BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 004054
(776 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|255545664|ref|XP_002513892.1| Periplasmic beta-glucosidase precursor, putative [Ricinus communis]
gi|223546978|gb|EEF48475.1| Periplasmic beta-glucosidase precursor, putative [Ricinus communis]
Length = 774
Score = 1291 bits (3342), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 607/776 (78%), Positives = 681/776 (87%), Gaps = 7/776 (0%)
Query: 1 MIFHKLSLVFPLLCLCFTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLV 60
+ F +FPLL L ++ ST+PPFSCDPSNPST +F FCKT+LPISQR RDLV
Sbjct: 6 LSFFTFFTIFPLLIL-------QITSTEPPFSCDPSNPSTSSFLFCKTSLPISQRVRDLV 58
Query: 61 SRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVI 120
SRLTLDEKISQLV+SAP+IPRLGIPAYEWWSEALHGVA VG+GI F G I+ ATSFPQVI
Sbjct: 59 SRLTLDEKISQLVSSAPSIPRLGIPAYEWWSEALHGVANVGRGIHFEGAIKAATSFPQVI 118
Query: 121 LTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDP 180
LTAASFD+Y WYRIGQ IG EARA+YNAGQA GMTFWAPNINIFRDPRWGRGQETPGEDP
Sbjct: 119 LTAASFDAYQWYRIGQVIGREARAVYNAGQATGMTFWAPNINIFRDPRWGRGQETPGEDP 178
Query: 181 LVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQ 240
LVTGKYAVSYVRGVQGD+F GGKLKG LQASACCKHFTAYDLDNWKG R+ FDARVTMQ
Sbjct: 179 LVTGKYAVSYVRGVQGDSFQGGKLKGHLQASACCKHFTAYDLDNWKGVNRFVFDARVTMQ 238
Query: 241 DLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCD 300
DLADTYQPPF+SCV+QG+ASGIMCAYNRVNGIPSCAD NLLS+TAR QW FHGYI SDCD
Sbjct: 239 DLADTYQPPFQSCVQQGKASGIMCAYNRVNGIPSCADFNLLSRTARGQWDFHGYIASDCD 298
Query: 301 AVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNL 360
AVSIIYD +GYAKSPEDAVVDVLKAGMDVNCGS+LQKHTKAAV+QKKLPE+ IDRALHNL
Sbjct: 299 AVSIIYDNQGYAKSPEDAVVDVLKAGMDVNCGSYLQKHTKAAVEQKKLPEASIDRALHNL 358
Query: 361 FSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSK 420
FSVRMRLGLFNGNPT QPF IG D VCS HQ+LAL+AA++GIVLLKNS LLPL KSK
Sbjct: 359 FSVRMRLGLFNGNPTEQPFSNIGPDQVCSQEHQILALEAARNGIVLLKNSARLLPLQKSK 418
Query: 421 SVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDK 480
+VSLA+IGPNANS +TLLGNYAGP C+++TPLQALQ YV+NT+YY GCDTV CSSASIDK
Sbjct: 419 TVSLAVIGPNANSVQTLLGNYAGPPCKTVTPLQALQYYVKNTIYYSGCDTVKCSSASIDK 478
Query: 481 AVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGG 540
AVDIAKG D VV++MGLDQTQE+EELDR+DLVLPG+QQELIT VA++AK P++LVLL GG
Sbjct: 479 AVDIAKGVDRVVMIMGLDQTQEREELDRLDLVLPGKQQELITNVAKSAKNPIVLVLLSGG 538
Query: 541 PVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTD 600
PVDI+FAKYD NIGSILWAGYPGEAG +ALAE+IFGDHNPGG+LPMTWYPQ+++KVPMTD
Sbjct: 539 PVDISFAKYDENIGSILWAGYPGEAGGIALAEIIFGDHNPGGKLPMTWYPQEFVKVPMTD 598
Query: 601 MKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVE 660
M+MRP +SG PGRTYRFY+G+ VF FG GLSYSKYSY+ K VSQ KLYLNQSS+ ++++
Sbjct: 599 MRMRPDPSSGYPGRTYRFYKGRNVFEFGYGLSYSKYSYELKYVSQTKLYLNQSSTMRIID 658
Query: 661 NQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLV 720
N D V V +LG EFC+ KF V +GV+N GEMAGKHPVLLF + AR GNGRP +QL+
Sbjct: 659 NSDPVRATLVAQLGAEFCKESKFSVKVGVENQGEMAGKHPVLLFARHARHGNGRPRRQLI 718
Query: 721 GFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGDEEYPISIFV 776
GF+SVILNA EKAEI FELSPCE SRA EDGL V+EEGTHFL+VG ++YPIS+ V
Sbjct: 719 GFKSVILNAGEKAEIEFELSPCEHFSRANEDGLRVMEEGTHFLMVGGDKYPISVVV 774
>gi|224066931|ref|XP_002302285.1| predicted protein [Populus trichocarpa]
gi|222844011|gb|EEE81558.1| predicted protein [Populus trichocarpa]
Length = 773
Score = 1260 bits (3260), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 591/756 (78%), Positives = 660/756 (87%)
Query: 21 LTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIP 80
+ RVDSTQPPFSCD SNPST+ FPFC+TTLPISQRARDLVSRLTLDEKISQLVNSAP IP
Sbjct: 18 VLRVDSTQPPFSCDSSNPSTKAFPFCETTLPISQRARDLVSRLTLDEKISQLVNSAPPIP 77
Query: 81 RLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGL 140
RLGIP YEWWSEALHGV+ G GI FN I+GATSFPQVILTAASFD+Y WYRIGQAIG
Sbjct: 78 RLGIPGYEWWSEALHGVSNAGPGIHFNDNIKGATSFPQVILTAASFDAYQWYRIGQAIGK 137
Query: 141 EARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFN 200
EARALYNAGQA GMTFWAPNINIFRDPRWGRGQETPGEDPLVTG YA SYV+GVQGD+F
Sbjct: 138 EARALYNAGQATGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGLYAASYVKGVQGDSFE 197
Query: 201 GGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRAS 260
GGK+KG LQASACCKHFTAYDLDNWKG R+ FDARVTMQDLADTYQPPF+SCV+QGRAS
Sbjct: 198 GGKIKGHLQASACCKHFTAYDLDNWKGMNRFVFDARVTMQDLADTYQPPFKSCVEQGRAS 257
Query: 261 GIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVV 320
GIMCAYN+VNG+PSCAD NLLSKTAR QWGF GYITSDCDAVSII+D +GYAKSPEDAVV
Sbjct: 258 GIMCAYNKVNGVPSCADSNLLSKTARAQWGFRGYITSDCDAVSIIHDDQGYAKSPEDAVV 317
Query: 321 DVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFG 380
DVLKAGMDVNCGS+L KH K AV+QKKL ES+ID+ALHNLFSVRMRLGLFNG P Q FG
Sbjct: 318 DVLKAGMDVNCGSYLLKHAKVAVEQKKLSESDIDKALHNLFSVRMRLGLFNGRPEGQLFG 377
Query: 381 KIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGN 440
IG D VCS HQ+LAL+AA++GIVLLKNS LLPL KSK+ SLA+IGPNANS + LLGN
Sbjct: 378 NIGPDQVCSQEHQILALEAARNGIVLLKNSARLLPLSKSKTKSLAVIGPNANSGQMLLGN 437
Query: 441 YAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQT 500
YAGP CR +TPLQALQ+Y++ TVY+P CDTV CSSAS+D+AVD+AKGAD+VVLMMGLDQT
Sbjct: 438 YAGPPCRFVTPLQALQSYIKQTVYHPACDTVQCSSASVDRAVDVAKGADNVVLMMGLDQT 497
Query: 501 QEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAG 560
QE+EELDR DL+LPG+QQELI VA+AAK PV+LVL GGPVDI+FAK D+NIGSILWAG
Sbjct: 498 QEREELDRTDLLLPGKQQELIIAVAKAAKNPVVLVLFSGGPVDISFAKNDKNIGSILWAG 557
Query: 561 YPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYE 620
YPGE GA+ALAE++FGDHNPGGRLPMTWYPQ+++KVPMTDM MRP+A+SG PGRTYRFY
Sbjct: 558 YPGEGGAIALAEIVFGDHNPGGRLPMTWYPQEFVKVPMTDMGMRPEASSGYPGRTYRFYR 617
Query: 621 GKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCET 680
G+ VF FG G+SYSKYSY+ AVSQN LYLNQSS+ ++ + D V + ELGTEFCE
Sbjct: 618 GRSVFEFGYGISYSKYSYELTAVSQNTLYLNQSSTMHIINDFDSVRSTLISELGTEFCEQ 677
Query: 681 RKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELS 740
K IGVKNHGEMAGKHPVLLF + + GNGRP KQL+GFQSV+L A E+AEI FE+S
Sbjct: 678 NKCRARIGVKNHGEMAGKHPVLLFARQEKHGNGRPRKQLIGFQSVVLGAGERAEIEFEVS 737
Query: 741 PCESLSRAREDGLMVIEEGTHFLVVGDEEYPISIFV 776
PCE LSRA EDGLMV+EEG HFLVV +EYPIS+ +
Sbjct: 738 PCEHLSRANEDGLMVMEEGRHFLVVDGDEYPISVVI 773
>gi|224082152|ref|XP_002306583.1| predicted protein [Populus trichocarpa]
gi|222856032|gb|EEE93579.1| predicted protein [Populus trichocarpa]
Length = 745
Score = 1182 bits (3059), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 565/760 (74%), Positives = 645/760 (84%), Gaps = 28/760 (3%)
Query: 17 FTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSA 76
FT ++ +VDSTQPPFSCD SNPST+TFPFCKTTLPISQRA DLVSRLTL+EKISQLVNSA
Sbjct: 14 FTVIVLQVDSTQPPFSCDSSNPSTKTFPFCKTTLPISQRANDLVSRLTLEEKISQLVNSA 73
Query: 77 PAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQ 136
IPRLGIP Y+WWSEALHGVA G GI FNGTI+ ATSFPQVIL+AASFD+ WYRI Q
Sbjct: 74 QPIPRLGIPGYQWWSEALHGVAYAGPGIRFNGTIKRATSFPQVILSAASFDANQWYRISQ 133
Query: 137 AIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQG 196
AIG EARALYNAGQA GMTFWAPNINIFRDPRWGRGQETPGEDPL+TGKYAVSYVRG+QG
Sbjct: 134 AIGKEARALYNAGQATGMTFWAPNINIFRDPRWGRGQETPGEDPLMTGKYAVSYVRGLQG 193
Query: 197 DTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQ 256
D+F GG++KG LQASACCKHFTAYDL+NW GT+RY FDA VT QDLADTYQPPF+SCV++
Sbjct: 194 DSFKGGEIKGPLQASACCKHFTAYDLENWNGTSRYVFDAYVTAQDLADTYQPPFKSCVEE 253
Query: 257 GRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPE 316
GRASGIMCAYNRVNGIP+CAD N LS+TAR QWGF GYI SDCDAVSII+DA+GYAK+PE
Sbjct: 254 GRASGIMCAYNRVNGIPNCADSNFLSRTARAQWGFDGYIASDCDAVSIIHDAQGYAKTPE 313
Query: 317 DAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTM 376
DAVV VLKAGMDVNCGS+LQ+HTKAAV QKKL SEIDRALHNLFSVRMRLGLFNGNPT
Sbjct: 314 DAVVAVLKAGMDVNCGSYLQQHTKAAVDQKKLTISEIDRALHNLFSVRMRLGLFNGNPTG 373
Query: 377 QPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKT 436
Q FG IG D VCS +Q+LAL AA++GIVLLKNS GLLPL KSK++SLA+IGPNANS +T
Sbjct: 374 QQFGNIGPDQVCSQENQILALDAARNGIVLLKNSAGLLPLSKSKTMSLAVIGPNANSVQT 433
Query: 437 LLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMG 496
LLGNYAGP C+ +TPLQALQ+Y+++T+ YPGCD+V CSSASI AV++AKGADHVVL+MG
Sbjct: 434 LLGNYAGPPCKLVTPLQALQSYIKHTIPYPGCDSVQCSSASIVGAVNVAKGADHVVLIMG 493
Query: 497 LDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSI 556
LD TQEKE LDR DLVLPG+QQELI VA+AAK PV+LVLL GGPVDI+FAK D+NIGSI
Sbjct: 494 LDDTQEKEGLDRRDLVLPGKQQELIISVAKAAKNPVVLVLLSGGPVDISFAKNDKNIGSI 553
Query: 557 LWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTY 616
LWAGYPGEAGA+ALAE+IFGDHNPGG+LPMTWYPQ+++KVPMTDM+MRP+ +SG PGRTY
Sbjct: 554 LWAGYPGEAGAIALAEIIFGDHNPGGKLPMTWYPQEFVKVPMTDMRMRPETSSGYPGRTY 613
Query: 617 RFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTE 676
RFY+G VF FG GLSYSKY+Y+ +A+ +G E
Sbjct: 614 RFYKGPTVFEFGYGLSYSKYTYELRAIY----------------------------IGEE 645
Query: 677 FCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIV 736
CE KF VT+ VKN G+MAGKHPVLLF + A+ G GRPIK+LVGFQ+V L A EK EI
Sbjct: 646 QCENIKFKVTVSVKNEGQMAGKHPVLLFARHAKPGKGRPIKKLVGFQTVKLGAGEKTEIE 705
Query: 737 FELSPCESLSRAREDGLMVIEEGTHFLVVGDEEYPISIFV 776
+ELSPCE LS A EDG+MV+EEG+ L+VGD+E+P++I V
Sbjct: 706 YELSPCEHLSSANEDGVMVMEEGSQILLVGDKEHPVTIIV 745
>gi|225459350|ref|XP_002285805.1| PREDICTED: probable beta-D-xylosidase 7-like [Vitis vinifera]
Length = 774
Score = 1162 bits (3005), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 565/777 (72%), Positives = 651/777 (83%), Gaps = 4/777 (0%)
Query: 1 MIFHKLSLVFPLLCLCFTSLLTRVDSTQ-PPFSCDPSNPSTETFPFCKTTLPISQRARDL 59
M H L L+ + T +L V+STQ PPFSCD SNPST+++ FCKTTLPI R RDL
Sbjct: 1 MKLHSLLLIN---LIYVTVILVGVESTQSPPFSCDSSNPSTKSYHFCKTTLPIPDRVRDL 57
Query: 60 VSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQV 119
VSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVA G GI FNGTIR ATSFPQV
Sbjct: 58 VSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVADAGPGIRFNGTIRSATSFPQV 117
Query: 120 ILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGED 179
ILTAASFD +LWYRIG+AIG+EARA+YNAGQ GMTFWAPNINIFRDPRWGRGQETPGED
Sbjct: 118 ILTAASFDVHLWYRIGRAIGVEARAVYNAGQTKGMTFWAPNINIFRDPRWGRGQETPGED 177
Query: 180 PLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTM 239
PLVTG YAVSYVRGVQGD G K G+LQASACCKHFTAYDLD+WKG R+KFDARVTM
Sbjct: 178 PLVTGSYAVSYVRGVQGDCLRGLKRCGELQASACCKHFTAYDLDDWKGIDRFKFDARVTM 237
Query: 240 QDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDC 299
QDLADTYQPPF C+++GRASGIMCAYNRVNG+PSCAD NLL+ TAR++W F GYITSDC
Sbjct: 238 QDLADTYQPPFHRCIEEGRASGIMCAYNRVNGVPSCADFNLLTNTARKRWNFQGYITSDC 297
Query: 300 DAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHN 359
DAVS+I+D+ G+AK+PEDAVVDVLKAGMDVNCG++L HTK+AV QKKLPESE+DRAL N
Sbjct: 298 DAVSLIHDSYGFAKTPEDAVVDVLKAGMDVNCGTYLLNHTKSAVMQKKLPESELDRALEN 357
Query: 360 LFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKS 419
LF+VRMRLGLFNGNP QP+G IG + VCS HQ LAL AA+DGIVLLKNS LLPLPK
Sbjct: 358 LFAVRMRLGLFNGNPKGQPYGDIGPNQVCSVEHQTLALDAARDGIVLLKNSQRLLPLPKG 417
Query: 420 KSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASID 479
K++SLA+IGPNANS KTL+GNYAGP C+ ITPLQALQ+YV++T+Y+PGCD VACSS SI+
Sbjct: 418 KTMSLAVIGPNANSPKTLIGNYAGPPCKFITPLQALQSYVKSTMYHPGCDAVACSSPSIE 477
Query: 480 KAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCG 539
KAV+IA+ AD+VVL+MGLDQTQE+E DR+DLVLPG+QQ+LI VA AAKKPV+LVLL G
Sbjct: 478 KAVEIAQKADYVVLVMGLDQTQEREAHDRLDLVLPGKQQQLIICVANAAKKPVVLVLLSG 537
Query: 540 GPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMT 599
GPVDI+FAKY NIGSILWAGYPG AG A+AE IFGDHNPGGRLP+TWYPQD+ K+PMT
Sbjct: 538 GPVDISFAKYSNNIGSILWAGYPGGAGGAAIAETIFGDHNPGGRLPVTWYPQDFTKIPMT 597
Query: 600 DMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMV 659
DM+MRP++ SG PGRTYRFY G++VF FG GLSYS YS + V++NKLY NQSS+ +
Sbjct: 598 DMRMRPESNSGYPGRTYRFYTGEKVFEFGYGLSYSTYSCETIPVTRNKLYFNQSSTAHVY 657
Query: 660 ENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQL 719
EN D + Y SV ELG E C++ ++I V+N GEMAGKH VLLFV+ + G PIKQL
Sbjct: 658 ENTDSIRYTSVAELGKELCDSNNISISIRVRNDGEMAGKHSVLLFVRRLKASAGSPIKQL 717
Query: 720 VGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGDEEYPISIFV 776
V FQSV LN E A++ F L+PCE S +DGLMVIEEGTHFLVVGD+E+P+++ V
Sbjct: 718 VAFQSVHLNGGESADVGFLLNPCEHFSGPNKDGLMVIEEGTHFLVVGDQEHPVTVVV 774
>gi|224066929|ref|XP_002302284.1| predicted protein [Populus trichocarpa]
gi|222844010|gb|EEE81557.1| predicted protein [Populus trichocarpa]
Length = 742
Score = 1135 bits (2937), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 543/779 (69%), Positives = 633/779 (81%), Gaps = 42/779 (5%)
Query: 1 MIFHKLSL-VFPLLCLCFTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDL 59
M L L + L+ + TSL V+STQPP+SCD S+PST+ +PFC+T LPISQR DL
Sbjct: 1 MRLQNLCLRILILIAIHTTSLHLYVESTQPPYSCDSSDPSTKLYPFCQTKLPISQRVEDL 60
Query: 60 VSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVA---GVGKGIFFNGTIRGATSF 116
VSRLTLDEK+SQLV++APAIPRLGIPAYEWWSEALHGVA V +GI FNGTIR ATSF
Sbjct: 61 VSRLTLDEKVSQLVDTAPAIPRLGIPAYEWWSEALHGVALQTTVRQGIRFNGTIRFATSF 120
Query: 117 PQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETP 176
PQVILTAASFD++LWYRIGQ IG EAR +YNAGQA GMTFWAPNINIFRDPRWGRGQETP
Sbjct: 121 PQVILTAASFDAHLWYRIGQVIGKEARGIYNAGQATGMTFWAPNINIFRDPRWGRGQETP 180
Query: 177 GEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDAR 236
GEDPLV GKYAVSYVRGVQGD+F GG L +LQASACCKHFTAYDLD WKG R+ FDA
Sbjct: 181 GEDPLVAGKYAVSYVRGVQGDSFGGGTLGEQLQASACCKHFTAYDLDKWKGMNRFVFDA- 239
Query: 237 VTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYIT 296
QDLADTYQPPF+SC+++G+ASGIMCAYNRVNG+P+CAD NLLSK AR QWGF+GYIT
Sbjct: 240 ---QDLADTYQPPFQSCIQEGKASGIMCAYNRVNGVPNCADYNLLSKKARGQWGFYGYIT 296
Query: 297 SDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRA 356
SDCDAV+II+D +GYAKSPEDAV DVLKAGMDVNCG +L+ +TK+AVK+KKLPESEIDRA
Sbjct: 297 SDCDAVAIIHDDQGYAKSPEDAVADVLKAGMDVNCGDYLKNYTKSAVKKKKLPESEIDRA 356
Query: 357 LHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPL 416
LHNLFS+RMRLGLFNGNPT QP+G I D VCS HQ LAL+AAQDGIVLLKN LLPL
Sbjct: 357 LHNLFSIRMRLGLFNGNPTKQPYGNIAPDQVCSQEHQALALKAAQDGIVLLKNPDKLLPL 416
Query: 417 PKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSA 476
K ++ SLA+IGPNAN++ LLGNY GP C+++TPLQ LQNY++NT Y+PGC VACSSA
Sbjct: 417 SKLETKSLAVIGPNANNSTKLLGNYFGPPCKTVTPLQGLQNYIKNTRYHPGCSRVACSSA 476
Query: 477 SIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVL 536
SI++AV IAKGAD V+L+MGLDQTQEKEE DRVDLVLPG+Q+ELIT VA+AAKKPV+LVL
Sbjct: 477 SINQAVKIAKGADQVILVMGLDQTQEKEEQDRVDLVLPGKQRELITAVAKAAKKPVVLVL 536
Query: 537 LCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKV 596
CGGPVD++FAKYD+NIGSI+WAGYPGEAG ALA++IFGDHNPGGRLPMTWYPQD+ KV
Sbjct: 537 FCGGPVDVSFAKYDQNIGSIIWAGYPGEAGGTALAQIIFGDHNPGGRLPMTWYPQDFTKV 596
Query: 597 PMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSST 656
PMTDM+MRPQ +SG PGRTYRFY GK+VF FG GLSYS YSY+ + +QNKLYL ++SS
Sbjct: 597 PMTDMRMRPQLSSGYPGRTYRFYNGKKVFEFGYGLSYSNYSYELASDTQNKLYL-RASSN 655
Query: 657 KMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPI 716
++ +N + + +K + +G E CE KF VT+ VKNHGEMAG
Sbjct: 656 QITKNSNTIRHKLISNIGKELCEKTKFTVTVRVKNHGEMAG------------------- 696
Query: 717 KQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGDEEYPISIF 775
E AEI +ELSPCE LS + G+MV+EEG+ FL++GD+EYPI+I
Sbjct: 697 --------------ENAEIQYELSPCEHLSSPDDRGMMVMEEGSQFLLIGDKEYPITII 741
>gi|358349509|ref|XP_003638778.1| Xylan 1 4-beta-xylosidase [Medicago truncatula]
gi|355504713|gb|AES85916.1| Xylan 1 4-beta-xylosidase [Medicago truncatula]
Length = 776
Score = 1121 bits (2900), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 525/772 (68%), Positives = 629/772 (81%), Gaps = 3/772 (0%)
Query: 7 SLVFPLLCLCFTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLD 66
+ F + F +L V + PPF+CD SNPST ++PFC LPI+QR +DLVSRLTLD
Sbjct: 6 TFTFVTIISLFLTLTYSVLAQLPPFACDYSNPSTRSYPFCNPKLPITQRTKDLVSRLTLD 65
Query: 67 EKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASF 126
EK++QLVNSAP IPRLGIPAYEWWSEALHG+ VG+GIFFNG+I ATSFPQVILTAASF
Sbjct: 66 EKLAQLVNSAPPIPRLGIPAYEWWSEALHGIGNVGRGIFFNGSITSATSFPQVILTAASF 125
Query: 127 DSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKY 186
DS+LWYRIGQAIG+EARA+YN GQA+GMTFWAPNINIFRDPRWGRGQET GEDP++T Y
Sbjct: 126 DSHLWYRIGQAIGVEARAIYNGGQAMGMTFWAPNINIFRDPRWGRGQETAGEDPMMTSNY 185
Query: 187 AVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTY 246
AVSYVRG+QGD+F GGKL+G LQASACCKHFTAYDLDNWKG R+ FDARV++QDLADTY
Sbjct: 186 AVSYVRGLQGDSFQGGKLRGHLQASACCKHFTAYDLDNWKGVNRFHFDARVSLQDLADTY 245
Query: 247 QPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIY 306
QPPF SC++QGRASGIMCAYNRVNGIPSCAD NLL+ T R+QW FHGYI SDC AV II+
Sbjct: 246 QPPFRSCIEQGRASGIMCAYNRVNGIPSCADFNLLTNTVRKQWEFHGYIVSDCGAVGIIH 305
Query: 307 DAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMR 366
D +GYAKS EDAV DVL AGMD+ CGS+L H K+AV+QKKLP IDRALHNLFS+R+R
Sbjct: 306 DEQGYAKSAEDAVADVLHAGMDLECGSYLTDHAKSAVQQKKLPIVRIDRALHNLFSIRIR 365
Query: 367 LGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLAL 426
LG F+GNP PFG IG + VCS H LAL+AA++GIVLLKN+ LLPLPK+ S+SLA+
Sbjct: 366 LGQFDGNPAKLPFGMIGPNHVCSENHLYLALEAARNGIVLLKNTASLLPLPKT-SISLAV 424
Query: 427 IGPNANSAK-TLLGNYAGPSCRSITPLQALQNYVENTVYYPGCD-TVACSSASIDKAVDI 484
IGPNAN++ TLLGNYAGP C+SIT LQ Q+YV+N V++PGCD C+SA IDKAV +
Sbjct: 425 IGPNANASPLTLLGNYAGPPCKSITILQGFQHYVKNAVFHPGCDGGPKCASAPIDKAVKV 484
Query: 485 AKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDI 544
AK AD+VVL+MGLDQ+ E+EE DRV L LPG+Q ELI VA+A+K+PVILVLLCGGP+DI
Sbjct: 485 AKNADYVVLVMGLDQSVEREERDRVHLDLPGKQLELINSVAKASKRPVILVLLCGGPIDI 544
Query: 545 TFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMR 604
+ AK + IG I+WAGYPGE G +ALA++IFGDHNPGGRLP+TWYP+DYIKVPMTDM+MR
Sbjct: 545 SSAKNNDKIGGIIWAGYPGELGGIALAQIIFGDHNPGGRLPITWYPKDYIKVPMTDMRMR 604
Query: 605 PQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDV 664
T+G PGRTYRFY+G V+ FG GLSY+KYSY+F +V+ +KL+ NQSS+ M EN +
Sbjct: 605 ADPTTGYPGRTYRFYKGPTVYEFGHGLSYTKYSYEFVSVTHDKLHFNQSSTHLMTENSET 664
Query: 665 VHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQS 724
+ YK V EL E C++ VT+GVKNHG + G+HP+LLF++P + P+KQLVGF S
Sbjct: 665 IRYKLVSELDEETCKSMSVSVTVGVKNHGNIVGRHPILLFMRPQKHRTRSPMKQLVGFHS 724
Query: 725 VILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGDEEYPISIFV 776
++L+A E + + FELSPCE LSRA E GL +IEEG+H L VG+EEY I I V
Sbjct: 725 LLLDAGEMSHVGFELSPCEHLSRANEAGLKIIEEGSHLLHVGEEEYLIDIIV 776
>gi|449465962|ref|XP_004150696.1| PREDICTED: probable beta-D-xylosidase 7-like [Cucumis sativus]
Length = 783
Score = 1120 bits (2897), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 528/773 (68%), Positives = 635/773 (82%)
Query: 4 HKLSLVFPLLCLCFTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRL 63
HK+ L+ LL F SL+ S+QPP++CD SNP T+T PFCKT LPI RARDLVSRL
Sbjct: 11 HKIKLLTLLLSAAFLSLIVAGSSSQPPYACDSSNPLTKTLPFCKTYLPIKLRARDLVSRL 70
Query: 64 TLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTA 123
TLDEK+ QLVN+ P IPRLGIPAYEWWSEALHGVA VG GI NGTI ATSFPQVILTA
Sbjct: 71 TLDEKVLQLVNTVPPIPRLGIPAYEWWSEALHGVANVGYGIRLNGTITAATSFPQVILTA 130
Query: 124 ASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVT 183
ASFD LWY+IGQAIG EARA+YNAGQA GMTFW PNINIFRDPRWGRGQETPGEDPL+T
Sbjct: 131 ASFDENLWYQIGQAIGTEARAVYNAGQAKGMTFWTPNINIFRDPRWGRGQETPGEDPLMT 190
Query: 184 GKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLA 243
GKY+V+YVRG+QGD GGKL +L+ASACCKHFTAYDLD W G TRY FDA+VTMQD+A
Sbjct: 191 GKYSVAYVRGIQGDAIEGGKLGNQLKASACCKHFTAYDLDRWNGMTRYVFDAKVTMQDMA 250
Query: 244 DTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVS 303
DTYQPPFESCV++G+ASGIMCAYNRVNG+PSCAD +LL+ TAR+QW F+GYITSDCDAVS
Sbjct: 251 DTYQPPFESCVEEGKASGIMCAYNRVNGVPSCADHHLLTATARKQWKFNGYITSDCDAVS 310
Query: 304 IIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSV 363
II+DA+GYAK PEDAV DVL+AGMDVNCG++L++HTK+AV+ KK+P IDRAL NLFSV
Sbjct: 311 IIHDAQGYAKIPEDAVADVLRAGMDVNCGTYLKEHTKSAVEMKKVPMLHIDRALRNLFSV 370
Query: 364 RMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVS 423
RMRLGLF+GNPT PFG+IG D VCS HQ LALQAA++GIVLLKNS LLPL KS + S
Sbjct: 371 RMRLGLFDGNPTKLPFGQIGRDQVCSQQHQNLALQAAREGIVLLKNSAKLLPLSKSNTHS 430
Query: 424 LALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVD 483
LA+IG N N KTL GNYAG C+S TP Q L NYV+NTVY+ GC+ C+ A+I +AV
Sbjct: 431 LAVIGHNGNDPKTLRGNYAGIPCKSATPFQGLNNYVKNTVYHRGCNYANCTEATIYQAVK 490
Query: 484 IAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVD 543
IAK D+VVL+MGLDQTQE+E+ DR +L LPG+Q +LI VA+AAK+PVILV+L GGPVD
Sbjct: 491 IAKSVDYVVLVMGLDQTQEREDFDRTELGLPGKQDKLIAEVAKAAKRPVILVILSGGPVD 550
Query: 544 ITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKM 603
I+ AKY+ IGSILWAGYPG+AG A+AE+IFGDHNPGGRLP+TWYP D+IK PMTDM+M
Sbjct: 551 ISSAKYNEKIGSILWAGYPGQAGGTAIAEIIFGDHNPGGRLPLTWYPHDFIKFPMTDMRM 610
Query: 604 RPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQD 663
R +++G PGRTYRFY G +V+ FG GLSYS + Y+F +VS++KL L+ +++ +N D
Sbjct: 611 RADSSTGYPGRTYRFYNGPKVYEFGYGLSYSNHIYEFTSVSESKLLLSHPKASQPAKNSD 670
Query: 664 VVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQ 723
+V Y+ V EL +FCE++ VT+GV+N GEM GKH VLLF+KP++ NG P+KQLVGF+
Sbjct: 671 LVSYRLVSELDKKFCESKTVNVTVGVRNEGEMGGKHSVLLFIKPSKPINGSPVKQLVGFK 730
Query: 724 SVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGDEEYPISIFV 776
V +NA E+ EI F +SPC+ +S+A E+GLM+IEEG++ LVVGD E+P+ IFV
Sbjct: 731 KVEINAGERREIEFLVSPCDHISKASEEGLMIIEEGSYSLVVGDVEHPLDIFV 783
>gi|302141935|emb|CBI19138.3| unnamed protein product [Vitis vinifera]
Length = 1411
Score = 1120 bits (2896), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 536/749 (71%), Positives = 619/749 (82%), Gaps = 32/749 (4%)
Query: 26 STQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIP 85
S+ PPF+CD S+P T+++ FC TTL ISQRA DL+SRLTLDEKISQL++SA +IPRLGIP
Sbjct: 693 SSSPPFACDSSDPLTKSYAFCNTTLRISQRASDLISRLTLDEKISQLISSAASIPRLGIP 752
Query: 86 AYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARAL 145
AYEWWSEALHG+ GI FNGTIR ATSFPQVILTAASFD++LWYRIGQAIG+E RA+
Sbjct: 753 AYEWWSEALHGIRDR-HGIRFNGTIRSATSFPQVILTAASFDAHLWYRIGQAIGIETRAM 811
Query: 146 YNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLK 205
YNAGQA+GMTFWAPNINIFRDPRWGRGQETPGEDP+V GKYAVSYVRG+QGDTF GGK+
Sbjct: 812 YNAGQAMGMTFWAPNINIFRDPRWGRGQETPGEDPVVAGKYAVSYVRGLQGDTFEGGKVD 871
Query: 206 GKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCA 265
LQASACCKHFTAYDLDNW RY FDARVTMQDLADTYQPPF SC+++GRASG+MCA
Sbjct: 872 -VLQASACCKHFTAYDLDNWTSIDRYTFDARVTMQDLADTYQPPFRSCIEEGRASGLMCA 930
Query: 266 YNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKA 325
YN VNG+P+CAD NLLSKTAR QWGF GYI SDCDAVS+++D +GYAKSPEDAV VL A
Sbjct: 931 YNLVNGVPNCADFNLLSKTARGQWGFDGYIVSDCDAVSLVHDVQGYAKSPEDAVAIVLTA 990
Query: 326 GMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGAD 385
GMDV CG +LQKH K+AV QKKL ESEIDRAL NLF+VRMRLGLFNGNP PFG IG D
Sbjct: 991 GMDVACGGYLQKHAKSAVSQKKLTESEIDRALLNLFTVRMRLGLFNGNPRKLPFGNIGPD 1050
Query: 386 VVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPS 445
VCS HQ LAL+AA+ GIVLLKNS LLPL K +++SLA+IGPNAN+ TLLGNYAGP
Sbjct: 1051 QVCSTEHQTLALEAARSGIVLLKNSDRLLPLSKGETLSLAVIGPNANATDTLLGNYAGPP 1110
Query: 446 CRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEE 505
C+ I+PLQ LQ+YV NT+Y+ GC+ VACSSASI+ AVD+AK AD+VVL+MGLDQTQE+E+
Sbjct: 1111 CKFISPLQGLQSYVNNTMYHAGCNDVACSSASIENAVDVAKQADYVVLVMGLDQTQEREK 1170
Query: 506 LDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEA 565
DR+DLVLPG+Q++LIT VA+AAKKPV+LVLLCGGPVDI+FAK NIGSILWAGYPGEA
Sbjct: 1171 YDRLDLVLPGKQEQLITGVAKAAKKPVVLVLLCGGPVDISFAKGSSNIGSILWAGYPGEA 1230
Query: 566 GAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVF 625
G A+AE IFGDHNPGGRLP+TWYP+D+IK+PMTDM+MRP+ SG PGRT+RFY GK VF
Sbjct: 1231 GGAAIAETIFGDHNPGGRLPVTWYPKDFIKIPMTDMRMRPEPQSGYPGRTHRFYTGKTVF 1290
Query: 626 PFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLV 685
FG GLSYS YSY+F +V+ NKLYLNQ S+T +VEN
Sbjct: 1291 EFGNGLSYSPYSYEFLSVTPNKLYLNQPSTTHVVENS----------------------- 1327
Query: 686 TIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESL 745
G+MAGKHPVLLFVK A+ GNG P+KQLVGFQ+V L+A E + + F LSPCE L
Sbjct: 1328 -------GKMAGKHPVLLFVKQAKAGNGSPMKQLVGFQNVFLDAGESSNVEFILSPCEHL 1380
Query: 746 SRAREDGLMVIEEGTHFLVVGDEEYPISI 774
SRA +DGLMV+E+G H LVVGD+EYPI+I
Sbjct: 1381 SRANKDGLMVMEQGIHLLVVGDKEYPIAI 1409
Score = 1051 bits (2717), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 519/720 (72%), Positives = 591/720 (82%), Gaps = 29/720 (4%)
Query: 1 MIFHKLSLVFPLLCLCFTSLLTRVDSTQ-PPFSCDPSNPSTETFPFCKTTLPISQRARDL 59
M H L L+ + T +L V+STQ PPFSCD SNPST+++ FCKTTLPI R RDL
Sbjct: 1 MKLHSLLLIN---LIYVTVILVGVESTQSPPFSCDSSNPSTKSYHFCKTTLPIPDRVRDL 57
Query: 60 VSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQV 119
VSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVA G GI FNGTIR ATSFPQV
Sbjct: 58 VSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVADAGPGIRFNGTIRSATSFPQV 117
Query: 120 ILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGED 179
ILTAASFD +LWYRIG+AIG+EARA+YNAGQ GMTFWAPNINIFRDPRWGRGQETPGED
Sbjct: 118 ILTAASFDVHLWYRIGRAIGVEARAVYNAGQTKGMTFWAPNINIFRDPRWGRGQETPGED 177
Query: 180 PLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTM 239
PLVTG YAVSYVRGVQGD G K G+LQASACCKHFTAYDLD+WKG R+KFDARVTM
Sbjct: 178 PLVTGSYAVSYVRGVQGDCLRGLKRCGELQASACCKHFTAYDLDDWKGIDRFKFDARVTM 237
Query: 240 QDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDC 299
QDLADTYQPPF C+++GRASGIMCAYNRVNG+PSCAD NLL+ TAR++W F GYITSDC
Sbjct: 238 QDLADTYQPPFHRCIEEGRASGIMCAYNRVNGVPSCADFNLLTNTARKRWNFQGYITSDC 297
Query: 300 DAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHN 359
DAVS+I+D+ G+AK+PEDAVVDVLKAGMDVNCG++L HTK+AV QKKLPESE+DRAL N
Sbjct: 298 DAVSLIHDSYGFAKTPEDAVVDVLKAGMDVNCGTYLLNHTKSAVMQKKLPESELDRALEN 357
Query: 360 LFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKS 419
LF+VRMRLGLFNGNP QP+G IG + VCS HQ LAL AA+DGIVLLKNS LLPLPK
Sbjct: 358 LFAVRMRLGLFNGNPKGQPYGDIGPNQVCSVEHQTLALDAARDGIVLLKNSQRLLPLPKG 417
Query: 420 KSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASID 479
K++SLA+IGPNANS KTL+GNYAGP C+ ITPLQALQ+YV++T+Y+PGCD VACSS SI+
Sbjct: 418 KTMSLAVIGPNANSPKTLIGNYAGPPCKFITPLQALQSYVKSTMYHPGCDAVACSSPSIE 477
Query: 480 KAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCG 539
KAV+IA+ AD+VVL+MGLDQTQE+E DR+DLVLPG+QQ+LI VA AAKKPV+LVLL G
Sbjct: 478 KAVEIAQKADYVVLVMGLDQTQEREAHDRLDLVLPGKQQQLIICVANAAKKPVVLVLLSG 537
Query: 540 GPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMT 599
GPVDI+FAKY NIGSILWAGYPG AG A+AE IFGDHNPGGRLP+TWYPQD+ K+PMT
Sbjct: 538 GPVDISFAKYSNNIGSILWAGYPGGAGGAAIAETIFGDHNPGGRLPVTWYPQDFTKIPMT 597
Query: 600 DMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMV 659
DM+MRP++ SG PGRTYRFY G++VF FG GLSYS YS + V++NKLY NQSS+ +
Sbjct: 598 DMRMRPESNSGYPGRTYRFYTGEKVFEFGYGLSYSTYSCETIPVTRNKLYFNQSSTAHVY 657
Query: 660 ENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQL 719
EN D + Y S MAGKH VLLFV+ + G PIKQL
Sbjct: 658 ENTDSIRYTS-------------------------MAGKHSVLLFVRRLKASAGSPIKQL 692
>gi|449508468|ref|XP_004163321.1| PREDICTED: LOW QUALITY PROTEIN: probable beta-D-xylosidase 7-like
[Cucumis sativus]
Length = 783
Score = 1118 bits (2893), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 528/773 (68%), Positives = 634/773 (82%)
Query: 4 HKLSLVFPLLCLCFTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRL 63
HK+ L+ LL F SL+ S+QPP++CD SNP T+T PFCKT LPI RARDLVSRL
Sbjct: 11 HKIKLLTLLLSAAFLSLIVAGSSSQPPYACDSSNPLTKTLPFCKTYLPIKLRARDLVSRL 70
Query: 64 TLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTA 123
TLDEK+ QLVN+ P IPRLGIPAYEWWSEALHGVA VG GI NGTI ATSFPQVILTA
Sbjct: 71 TLDEKVLQLVNTVPPIPRLGIPAYEWWSEALHGVANVGYGIRLNGTITAATSFPQVILTA 130
Query: 124 ASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVT 183
ASFD LWY+IGQAIG EARA+YNAGQA GMTFW PNINIFRDPRWGRGQETPGEDPL+T
Sbjct: 131 ASFDENLWYQIGQAIGTEARAVYNAGQAKGMTFWTPNINIFRDPRWGRGQETPGEDPLMT 190
Query: 184 GKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLA 243
GKY+V+YVRG+QGD GGKL +L+ASACCKHFTAYDLD W G TRY FDA+VTMQD+A
Sbjct: 191 GKYSVAYVRGIQGDAIEGGKLGNQLKASACCKHFTAYDLDRWNGMTRYVFDAKVTMQDMA 250
Query: 244 DTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVS 303
DTYQPPFESCV++G+ASGIMCAYNRVNG+PSCAD +LL+ TAR+QW F+GYITSDCDAVS
Sbjct: 251 DTYQPPFESCVEEGKASGIMCAYNRVNGVPSCADHHLLTATARKQWKFNGYITSDCDAVS 310
Query: 304 IIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSV 363
II+DA+GYAK PEDAV DVL+AGMDVNCG++L++HTK+AV+ KK+P IDRAL NLFSV
Sbjct: 311 IIHDAQGYAKIPEDAVADVLRAGMDVNCGTYLKEHTKSAVEMKKVPMLHIDRALRNLFSV 370
Query: 364 RMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVS 423
RMRLGLF+GNPT PFG+IG D VCS HQ LALQAA++GIVLLKNS LLPL KS + S
Sbjct: 371 RMRLGLFDGNPTKLPFGQIGRDQVCSQQHQNLALQAAREGIVLLKNSAKLLPLSKSNTHS 430
Query: 424 LALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVD 483
LA+IG N N KTL GNYAG C+S TP Q L NYV+NTVY+ GC+ C+ A+I +AV
Sbjct: 431 LAVIGHNGNDPKTLRGNYAGIPCKSATPFQGLNNYVKNTVYHRGCNYANCTEATIYQAVK 490
Query: 484 IAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVD 543
IAK D+VVL+MGLDQTQE+E+ DR +L LPG+Q +LI VA+AAK PVILV+L GGPVD
Sbjct: 491 IAKSVDYVVLVMGLDQTQEREDFDRTELGLPGKQDKLIAEVAKAAKXPVILVILSGGPVD 550
Query: 544 ITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKM 603
I+ AKY+ IGSILWAGYPG+AG A+AE+IFGDHNPGGRLP+TWYP D+IK PMTDM+M
Sbjct: 551 ISSAKYNEKIGSILWAGYPGQAGGTAIAEIIFGDHNPGGRLPLTWYPHDFIKFPMTDMRM 610
Query: 604 RPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQD 663
R +++G PGRTYRFY G +V+ FG GLSYS + Y+F +VS++KL L+ +++ +N D
Sbjct: 611 RADSSTGYPGRTYRFYNGPKVYEFGYGLSYSNHIYEFTSVSESKLLLSHPKASQPAKNSD 670
Query: 664 VVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQ 723
+V Y+ V EL +FCE++ VT+GV+N GEM GKH VLLF+KP++ NG P+KQLVGF+
Sbjct: 671 LVSYRLVSELDKKFCESKTVNVTVGVRNEGEMGGKHSVLLFIKPSKPINGSPVKQLVGFK 730
Query: 724 SVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGDEEYPISIFV 776
V +NA E+ EI F +SPC+ +S+A E+GLM+IEEG++ LVVGD E+P+ IFV
Sbjct: 731 KVEINAGERREIEFLVSPCDHISKASEEGLMIIEEGSYSLVVGDVEHPLDIFV 783
>gi|15218202|ref|NP_177929.1| putative beta-D-xylosidase 7 [Arabidopsis thaliana]
gi|259585708|sp|Q9SGZ5.2|BXL7_ARATH RecName: Full=Probable beta-D-xylosidase 7; Short=AtBXL7; Flags:
Precursor
gi|18086336|gb|AAL57631.1| At1g78060/F28K19_32 [Arabidopsis thaliana]
gi|332197942|gb|AEE36063.1| putative beta-D-xylosidase 7 [Arabidopsis thaliana]
Length = 767
Score = 1102 bits (2850), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 526/756 (69%), Positives = 628/756 (83%), Gaps = 8/756 (1%)
Query: 24 VDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLG 83
V+S PP SCDPSNP+T+ + FC+T LPI +RARDLVSRLT+DEKISQLVN+AP IPRLG
Sbjct: 17 VESAPPPHSCDPSNPTTKLYQFCRTDLPIGKRARDLVSRLTIDEKISQLVNTAPGIPRLG 76
Query: 84 IPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEAR 143
+PAYEWWSEALHGVA G GI FNGT++ ATSFPQVILTAASFDSY W+RI Q IG EAR
Sbjct: 77 VPAYEWWSEALHGVAYAGPGIRFNGTVKAATSFPQVILTAASFDSYEWFRIAQVIGKEAR 136
Query: 144 ALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGK 203
+YNAGQA GMTFWAPNINIFRDPRWGRGQETPGEDP++TG YAV+YVRG+QGD+F+G K
Sbjct: 137 GVYNAGQANGMTFWAPNINIFRDPRWGRGQETPGEDPMMTGTYAVAYVRGLQGDSFDGRK 196
Query: 204 -LKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGI 262
L LQASACCKHFTAYDLD WKG TRY F+A+V++ DLA+TYQPPF+ C+++GRASGI
Sbjct: 197 TLSNHLQASACCKHFTAYDLDRWKGITRYVFNAQVSLADLAETYQPPFKKCIEEGRASGI 256
Query: 263 MCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDV 322
MCAYNRVNGIPSCAD NLL++TAR QW F GYITSDCDAVSIIYDA+GYAKSPEDAV DV
Sbjct: 257 MCAYNRVNGIPSCADPNLLTRTARGQWAFRGYITSDCDAVSIIYDAQGYAKSPEDAVADV 316
Query: 323 LKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKI 382
LKAGMDVNCGS+LQKHTK+A++QKK+ E++IDRAL NLFSVR+RLGLFNG+PT P+G I
Sbjct: 317 LKAGMDVNCGSYLQKHTKSALQQKKVSETDIDRALLNLFSVRIRLGLFNGDPTKLPYGNI 376
Query: 383 GADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYA 442
+ VCSPAHQ LAL AA++GIVLLKN+ LLP K SLA+IGPNA+ KTLLGNYA
Sbjct: 377 SPNEVCSPAHQALALDAARNGIVLLKNNLKLLPFSKRSVSSLAVIGPNAHVVKTLLGNYA 436
Query: 443 GPSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQE 502
GP C+++TPL AL++YV+N VY+ GCD+VACS+A+ID+AV IAK ADHVVL+MGLDQTQE
Sbjct: 437 GPPCKTVTPLDALRSYVKNAVYHQGCDSVACSNAAIDQAVAIAKNADHVVLIMGLDQTQE 496
Query: 503 KEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYP 562
KE+ DRVDL LPG+QQELIT VA AAKKPV+LVL+CGGPVDI+FA + IGSI+WAGYP
Sbjct: 497 KEDFDRVDLSLPGKQQELITSVANAAKKPVVLVLICGGPVDISFAANNNKIGSIIWAGYP 556
Query: 563 GEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGK 622
GEAG +A++E+IFGDHNPGGRLP+TWYPQ ++ + MTDM+MR + +G PGRTY+FY+G
Sbjct: 557 GEAGGIAISEIIFGDHNPGGRLPVTWYPQSFVNIQMTDMRMR--SATGYPGRTYKFYKGP 614
Query: 623 EVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRK 682
+V+ FG GLSYS YSY+FK +++ LYLNQS K N D V Y V E+G E C+ K
Sbjct: 615 KVYEFGHGLSYSAYSYRFKTLAETNLYLNQS---KAQTNSDSVRYTLVSEMGKEGCDVAK 671
Query: 683 FLVTIGVKNHGEMAGKHPVLLFVKPARRGNG--RPIKQLVGFQSVILNAKEKAEIVFELS 740
VT+ V+N GEMAGKHPVL+F + R G R KQLVGF+S++L+ EKAE+ FE+
Sbjct: 672 TKVTVEVENQGEMAGKHPVLMFARHERGGEDGKRAEKQLVGFKSIVLSNGEKAEMEFEIG 731
Query: 741 PCESLSRAREDGLMVIEEGTHFLVVGDEEYPISIFV 776
CE LSRA E G+MV+EEG +FL VGD E P+ + V
Sbjct: 732 LCEHLSRANEFGVMVLEEGKYFLTVGDSELPLIVNV 767
>gi|371917286|dbj|BAL44719.1| SlArf/Xyl4 [Solanum lycopersicum]
Length = 775
Score = 1102 bits (2850), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 516/777 (66%), Positives = 627/777 (80%), Gaps = 3/777 (0%)
Query: 1 MIFHKLSLVFPLLCLCFTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLV 60
M H +L+ +L SL++ V STQPPFSCD SNP T++ FC+T LPIS R DLV
Sbjct: 1 MKLHISTLITTILISL--SLVSIVQSTQPPFSCDSSNPQTKSLKFCQTGLPISVRVLDLV 58
Query: 61 SRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVI 120
SRLTLDEKISQLVNSAPAIPRLGIPAYEWWSE+LHGV GKGIFFNG+I GATSFPQVI
Sbjct: 59 SRLTLDEKISQLVNSAPAIPRLGIPAYEWWSESLHGVGSAGKGIFFNGSIAGATSFPQVI 118
Query: 121 LTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDP 180
LTAA+FD LWYRIGQ IG+EAR +YNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDP
Sbjct: 119 LTAATFDENLWYRIGQVIGVEARGVYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDP 178
Query: 181 LVTGKYAVSYVRGVQGDTFNGGKLK-GKLQASACCKHFTAYDLDNWKGTTRYKFDARVTM 239
++TGKYA+ YVRGVQGD+FNGG+LK G LQASACCKHFTAYDLD WK R+ F+A VT
Sbjct: 179 IMTGKYAIRYVRGVQGDSFNGGQLKKGHLQASACCKHFTAYDLDQWKNLDRFSFNAIVTP 238
Query: 240 QDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDC 299
QD+ADT+QPPF+ C+++ +ASGIMC+YN VNGIPSCA+ NLL+KTAR+QWGFHGYITSDC
Sbjct: 239 QDMADTFQPPFQDCIQKAQASGIMCSYNSVNGIPSCANYNLLTKTARQQWGFHGYITSDC 298
Query: 300 DAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHN 359
DAV +++D Y +PED+ LKAGMD++CG +L+K+TK+AV +KK+ + IDRALHN
Sbjct: 299 DAVQVMHDNHRYGNTPEDSTAFALKAGMDIDCGDYLKKYTKSAVMKKKVSQVHIDRALHN 358
Query: 360 LFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKS 419
LFS+RMRLGLFNG+P Q +G I VC+P HQ LAL+AA++GIVLLKN+ LLPL K+
Sbjct: 359 LFSIRMRLGLFNGDPRKQLYGNISPSQVCAPQHQQLALEAARNGIVLLKNTGKLLPLSKA 418
Query: 420 KSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASID 479
K+ SLA+IG NAN+A L GNY GP C+ I L+AL Y ++ Y GC+ C+SA+ID
Sbjct: 419 KTNSLAVIGHNANNAYILRGNYDGPPCKYIEILKALVGYAKSVQYQQGCNAANCTSANID 478
Query: 480 KAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCG 539
+AV+IA+ AD+VVL+MGLDQTQE+E+ DR DLVLPG+Q+ LI VA+AAKKPVILV+L G
Sbjct: 479 QAVNIARNADYVVLIMGLDQTQEREQFDRDDLVLPGQQENLINSVAKAAKKPVILVILSG 538
Query: 540 GPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMT 599
GPVDI+FAKY+ IGSILWAGYPGEAG +ALAE+IFG+HNPGG+LP+TWYPQ ++K+PMT
Sbjct: 539 GPVDISFAKYNPKIGSILWAGYPGEAGGIALAEIIFGEHNPGGKLPVTWYPQAFVKIPMT 598
Query: 600 DMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMV 659
DM+MRP +G PGRTYRFY+G +V+ FG GLSY+ YSY F + + N + LNQ S K V
Sbjct: 599 DMRMRPDPKTGYPGRTYRFYKGPKVYEFGYGLSYTTYSYGFHSATPNTIQLNQLLSVKTV 658
Query: 660 ENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQL 719
EN D + Y V E+G++ CE KF + V+N GEM GKHPVLLFVK + NG PIKQL
Sbjct: 659 ENSDSIRYTFVDEIGSDNCEKAKFSAHVSVENSGEMDGKHPVLLFVKQDKARNGSPIKQL 718
Query: 720 VGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGDEEYPISIFV 776
VGFQSV L A E +++VFE+SPCE LS A EDGLM+IEEG+ +LVVGD E+PI+I +
Sbjct: 719 VGFQSVSLKAGENSQLVFEISPCEHLSSANEDGLMMIEEGSRYLVVGDAEHPINIMI 775
>gi|297842585|ref|XP_002889174.1| glycosyl hydrolase family 3 protein [Arabidopsis lyrata subsp.
lyrata]
gi|297335015|gb|EFH65433.1| glycosyl hydrolase family 3 protein [Arabidopsis lyrata subsp.
lyrata]
Length = 766
Score = 1096 bits (2835), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 529/756 (69%), Positives = 628/756 (83%), Gaps = 8/756 (1%)
Query: 24 VDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLG 83
V+S PP SCDPSNP+T+ + FC+T LPISQRARDLVSRL +DEKISQL N+AP IPRLG
Sbjct: 16 VESAPPPHSCDPSNPTTKLYQFCRTDLPISQRARDLVSRLNIDEKISQLGNTAPGIPRLG 75
Query: 84 IPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEAR 143
+PAYEWWSEALHGVA G GI FNGT++ ATSFPQVILTAASFDSY W+RI Q IG EAR
Sbjct: 76 VPAYEWWSEALHGVAYAGPGIRFNGTVKAATSFPQVILTAASFDSYEWFRIAQVIGKEAR 135
Query: 144 ALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGK 203
+YNAGQA GMTFWAPNINIFRDPRWGRGQETPGEDP++TG YAV+YVRG+QGD+F+G K
Sbjct: 136 GVYNAGQAQGMTFWAPNINIFRDPRWGRGQETPGEDPIMTGTYAVAYVRGLQGDSFDGRK 195
Query: 204 -LKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGI 262
L LQASACCKHFTAYDLD WKG TRY F+A+V++ DLA+TYQPPF+ C+++GRASGI
Sbjct: 196 TLSIHLQASACCKHFTAYDLDRWKGITRYVFNAQVSLADLAETYQPPFKKCIEEGRASGI 255
Query: 263 MCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDV 322
MCAYNRVNGIPSCAD NLL++TAR W F GYITSDCDAVSII+DA+GYAK+PEDAV DV
Sbjct: 256 MCAYNRVNGIPSCADPNLLTRTARGLWRFRGYITSDCDAVSIIHDAQGYAKTPEDAVADV 315
Query: 323 LKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKI 382
LKAGMDVNCGS+LQKHTK+A++QKK+ E++IDRAL NLFSVR+RLGLFNG+PT P+G I
Sbjct: 316 LKAGMDVNCGSYLQKHTKSALQQKKVSETDIDRALLNLFSVRIRLGLFNGDPTKLPYGNI 375
Query: 383 GADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYA 442
+ VCSPAHQ LAL+AA++GIVLLKN+ LLP K SLA+IGPNA+ AKTLLGNYA
Sbjct: 376 SPNDVCSPAHQALALEAARNGIVLLKNNLKLLPFSKRSVSSLAVIGPNAHVAKTLLGNYA 435
Query: 443 GPSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQE 502
GP C+++TPL AL++YV+N VY+ GCD+VACS+A+ID+AV IA+ ADHVVL+MGLDQTQE
Sbjct: 436 GPPCKTVTPLDALRSYVKNAVYHNGCDSVACSNAAIDQAVAIARNADHVVLIMGLDQTQE 495
Query: 503 KEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYP 562
KE++DRVDL LPG+QQELIT VA AAKKPV+LVL+CGGPVDI+FA + IGSI+WAGYP
Sbjct: 496 KEDMDRVDLSLPGKQQELITSVANAAKKPVVLVLICGGPVDISFATNNDKIGSIMWAGYP 555
Query: 563 GEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGK 622
GEAG +ALAE+IFGDHNPGGRLP+TWYPQ ++ V MTDM+MR + +G PGRTY+FY+G
Sbjct: 556 GEAGGIALAEIIFGDHNPGGRLPVTWYPQSFVNVQMTDMRMR--SATGYPGRTYKFYKGP 613
Query: 623 EVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRK 682
+VF FG GLSYS YSY+FK + LYLNQS K N D V Y V E+G E C K
Sbjct: 614 KVFEFGHGLSYSTYSYRFKTLGATNLYLNQS---KAQLNSDSVRYTLVSEMGEEGCNIAK 670
Query: 683 FLVTIGVKNHGEMAGKHPVLLFVKPARRG-NG-RPIKQLVGFQSVILNAKEKAEIVFELS 740
V + V+N GEMAGKHPVL+F + R G NG R KQLVGF+S++L+ EKAE+ FE+
Sbjct: 671 TKVIVTVENQGEMAGKHPVLMFARHERGGENGKRAEKQLVGFKSIVLSNGEKAEMEFEIG 730
Query: 741 PCESLSRAREDGLMVIEEGTHFLVVGDEEYPISIFV 776
CE LSRA E G+MV+EEG +FL VGD E P++I V
Sbjct: 731 LCEHLSRANEVGVMVVEEGKYFLTVGDSELPLTINV 766
>gi|356515806|ref|XP_003526589.1| PREDICTED: probable beta-D-xylosidase 7-like [Glycine max]
Length = 772
Score = 1088 bits (2815), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 520/760 (68%), Positives = 623/760 (81%), Gaps = 11/760 (1%)
Query: 24 VDSTQP--PFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPR 81
V S++P PF+CD SNPS+ ++PFC LPI QR +DL+SRLTLDEK+SQLVN+AP IPR
Sbjct: 17 VQSSKPEAPFACDFSNPSSRSYPFCNPKLPIPQRTKDLLSRLTLDEKLSQLVNTAPPIPR 76
Query: 82 LGIPAYEWWSEALHGVAGVGKGIFF--NGTIRGATSFPQVILTAASFDSYLWYRIGQAIG 139
LGIPAY+WWSEALHGV+GVG GI F N TI ATSFPQVILTAASFDS LWYRIG AIG
Sbjct: 77 LGIPAYQWWSEALHGVSGVGPGILFDNNSTISSATSFPQVILTAASFDSRLWYRIGHAIG 136
Query: 140 LEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTF 199
+EARA++NAGQA G+TFWAPNINIFRDPRWGRGQET GEDPL+T +YAVS+VRG+QGD+F
Sbjct: 137 IEARAIFNAGQANGLTFWAPNINIFRDPRWGRGQETAGEDPLLTSRYAVSFVRGLQGDSF 196
Query: 200 NGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRA 259
G L ASACCKHFTAYDLDNWKG R+ FDARV++QDLADTYQPPF+SCV+QGRA
Sbjct: 197 KG----AHLLASACCKHFTAYDLDNWKGVDRFVFDARVSLQDLADTYQPPFQSCVQQGRA 252
Query: 260 SGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAV 319
SGIMCAYNRVNG+P+CAD LL++TAR QW F+GYITSDC AV I+D + YAKSPED V
Sbjct: 253 SGIMCAYNRVNGVPNCADYGLLTQTARNQWDFNGYITSDCGAVGFIHDRQRYAKSPEDVV 312
Query: 320 VDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPF 379
DVL+AGMD+ CGS+L H K+AV QKKL SEIDRAL NLFS+RMRLGLF+GNPT F
Sbjct: 313 ADVLRAGMDLECGSYLTYHAKSAVLQKKLGMSEIDRALQNLFSIRMRLGLFDGNPTRLSF 372
Query: 380 GKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPK-SKSVSLALIGPNANSAK-TL 437
G IG++ VCS HQ LAL+AA++GIVLLKNS LLPLPK S S+SLA+IGPNANS+ TL
Sbjct: 373 GLIGSNHVCSKEHQYLALEAARNGIVLLKNSPTLLPLPKTSPSISLAVIGPNANSSPLTL 432
Query: 438 LGNYAGPSCRSITPLQALQNYVENTVYYPGCD-TVACSSASIDKAVDIAKGADHVVLMMG 496
LGNYAGP C+ +T LQ ++YV+N Y+PGCD CSSA ID+AV++AK D+VVL+MG
Sbjct: 433 LGNYAGPPCKYVTILQGFRHYVKNAFYHPGCDGGPKCSSAQIDQAVEVAKKVDYVVLVMG 492
Query: 497 LDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSI 556
LDQ++E+EE DRV L LPG+Q ELI VAEA+KKPVILVLL GGP+DIT AKY+ IG I
Sbjct: 493 LDQSEEREERDRVHLDLPGKQLELINGVAEASKKPVILVLLSGGPLDITSAKYNHKIGGI 552
Query: 557 LWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTY 616
LWAGYPGE G +ALA++IFGDHNPGGRLP TWYP+DYIKVPMTDM+MR ++G PGRTY
Sbjct: 553 LWAGYPGELGGIALAQIIFGDHNPGGRLPTTWYPKDYIKVPMTDMRMRADPSTGYPGRTY 612
Query: 617 RFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTE 676
RFY+G +V+ FG GLSYSKYSY+F +V+ +KL+ NQSS+ MVEN + + YK V EL +
Sbjct: 613 RFYKGPKVYEFGYGLSYSKYSYEFVSVTHDKLHFNQSSTHLMVENSETISYKLVSELDEQ 672
Query: 677 FCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIV 736
C++ VT+ V+NHG M GKHPVLLF++P R+ +G P+KQLVGF+SV+L+A E A +
Sbjct: 673 TCQSMSLSVTVRVQNHGSMVGKHPVLLFIRPKRQKSGSPVKQLVGFESVMLDAGEMAHVE 732
Query: 737 FELSPCESLSRAREDGLMVIEEGTHFLVVGDEEYPISIFV 776
FE+SPCE LSRA E G M+IEEG+H L+V D E+PI I+V
Sbjct: 733 FEVSPCEHLSRANEAGAMIIEEGSHMLLVDDLEHPIDIYV 772
>gi|356552866|ref|XP_003544783.1| PREDICTED: probable beta-D-xylosidase 7-like [Glycine max]
Length = 776
Score = 1072 bits (2771), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 508/768 (66%), Positives = 614/768 (79%), Gaps = 5/768 (0%)
Query: 12 LLCLCFTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQ 71
+ +L +STQPP+SCD S+ ++ +PFC T LPIS+RA+DLVSRLTLDEK++Q
Sbjct: 11 FISFLLLTLHHHAESTQPPYSCDSSS-NSPYYPFCNTRLPISKRAQDLVSRLTLDEKLAQ 69
Query: 72 LVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLW 131
LVN+APAIPRLGIP+Y+WWSEALHGVA G GI FNGTI+ ATSFPQVILTAASFD LW
Sbjct: 70 LVNTAPAIPRLGIPSYQWWSEALHGVADAGFGIRFNGTIKSATSFPQVILTAASFDPNLW 129
Query: 132 YRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYV 191
Y+I + IG EARA+YNAGQA GMTFWAPNIN+FRDPRWGRGQET GEDPL+ KY V+YV
Sbjct: 130 YQISKTIGKEARAVYNAGQATGMTFWAPNINVFRDPRWGRGQETAGEDPLMNAKYGVAYV 189
Query: 192 RGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFE 251
RG+QGD+F GGKL +LQASACCKHFTAYDLD+WKG R+ +DARVT QDLADTYQPPF+
Sbjct: 190 RGLQGDSFEGGKLGERLQASACCKHFTAYDLDHWKGLDRFVYDARVTSQDLADTYQPPFQ 249
Query: 252 SCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGY 311
SC++QGRASGIMCAYNRVNG+P+CA+ NLL+KTAR+QW F GYITSDC AVSII+D +GY
Sbjct: 250 SCIEQGRASGIMCAYNRVNGVPNCANFNLLTKTARQQWKFDGYITSDCGAVSIIHDEQGY 309
Query: 312 AKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFN 371
AK+ EDA+ DV +AGMDV CG ++ KH K+AV QKKLP S+IDRAL NLFS+R+RLGL +
Sbjct: 310 AKTAEDAIADVFRAGMDVECGDYITKHGKSAVSQKKLPISQIDRALQNLFSIRIRLGLLD 369
Query: 372 GNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNA 431
GNPT PFG IG D VCS LAL+AA+DGIVLLKN++ LLPLPK+ ++ALIGPNA
Sbjct: 370 GNPTKLPFGTIGPDQVCSKQSLQLALEAARDGIVLLKNTNSLLPLPKTNP-TIALIGPNA 428
Query: 432 N-SAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDT-VACSSASIDKAVDIAKGAD 489
N S+K LGNY G C +T LQ + Y ++TVY+PGCD C+ A I+ AV++AK D
Sbjct: 429 NASSKVFLGNYYGRPCNLVTLLQGFEGYAKDTVYHPGCDDGPQCAYAQIEGAVEVAKKVD 488
Query: 490 HVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKY 549
+VVL+MGLDQ+QE+E DR L LPG+Q+ELI VA A+K+PV+LVLLCGGPVDIT AK+
Sbjct: 489 YVVLVMGLDQSQERESHDREYLGLPGKQEELIKSVARASKRPVVLVLLCGGPVDITSAKF 548
Query: 550 DRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATS 609
D +G ILWAGYPGE G VALA+V+FGDHNPGG+LP+TWYP+D+IKVPMTDM+MR S
Sbjct: 549 DDKVGGILWAGYPGELGGVALAQVVFGDHNPGGKLPITWYPKDFIKVPMTDMRMRADPAS 608
Query: 610 GNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKS 669
G PGRTYRFY G +V+ FG GLSY+KYSYK ++S N L++NQSS+ +N + + YK
Sbjct: 609 GYPGRTYRFYTGPKVYEFGYGLSYTKYSYKLLSLSHNTLHINQSSTHLTTQNSETIRYKL 668
Query: 670 VPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPAR-RGNGRPIKQLVGFQSVILN 728
V EL E C+T + +GV NHG MAGKHPVLLFV+ + R NG P+KQLVGFQSV LN
Sbjct: 669 VSELAEETCQTMLLSIALGVTNHGNMAGKHPVLLFVRQGKVRNNGNPVKQLVGFQSVKLN 728
Query: 729 AKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGDEEYPISIFV 776
A E ++ FELSPCE LS A E G MVIEEG++ L+VGD+EYPI I V
Sbjct: 729 AGETVQVGFELSPCEHLSVANEAGSMVIEEGSYLLLVGDQEYPIEITV 776
>gi|356548162|ref|XP_003542472.1| PREDICTED: probable beta-D-xylosidase 7-like [Glycine max]
Length = 778
Score = 1050 bits (2716), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 499/759 (65%), Positives = 608/759 (80%), Gaps = 8/759 (1%)
Query: 23 RVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRL 82
+ST+PP+SCD S+ ++ + FC T LPI++RA+DLVSRLTLDEK++QLVN+APAIPRL
Sbjct: 23 HAESTRPPYSCDSSS-NSPYYSFCNTKLPITKRAQDLVSRLTLDEKLAQLVNTAPAIPRL 81
Query: 83 GIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEA 142
GIP+Y+WWSEALHGVA G GI FNGTI+ ATSFPQVILTAASFD LWY+I + IG EA
Sbjct: 82 GIPSYQWWSEALHGVADAGFGIRFNGTIKSATSFPQVILTAASFDPNLWYQISKTIGREA 141
Query: 143 RALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGG 202
RA+YNAGQA GMTFWAPNIN+FRDPRWGRGQET GEDPL+ KY V+YVRG+QGD+F GG
Sbjct: 142 RAVYNAGQATGMTFWAPNINVFRDPRWGRGQETAGEDPLMNAKYGVAYVRGLQGDSFEGG 201
Query: 203 KLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGI 262
KL +LQASACCKHFTAYDLD WKG R+ FDARVT QDLADTYQPPF+SC++QGRASGI
Sbjct: 202 KLAERLQASACCKHFTAYDLDQWKGLDRFVFDARVTSQDLADTYQPPFQSCIEQGRASGI 261
Query: 263 MCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDV 322
MCAYNRVNG+P+CAD NLL+KTAR+QW F GYITSDC AVSII++ +GYAK+ EDA+ DV
Sbjct: 262 MCAYNRVNGVPNCADFNLLTKTARQQWKFDGYITSDCGAVSIIHEKQGYAKTAEDAIADV 321
Query: 323 LKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKI 382
+AGMDV CG ++ KH K+AV QKKLP S+IDRAL NLFS+R+RLGLF+GNPT PFG I
Sbjct: 322 FRAGMDVECGDYITKHAKSAVFQKKLPISQIDRALQNLFSIRIRLGLFDGNPTKLPFGTI 381
Query: 383 GADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNAN-SAKTLLGNY 441
G + VCS LAL+AA+DGIVLLKN++ LLPLPK+ ++ALIGPNAN S+K LGNY
Sbjct: 382 GPNEVCSKQSLQLALEAARDGIVLLKNTNSLLPLPKTNP-TIALIGPNANASSKVFLGNY 440
Query: 442 AGPSCRSITPLQALQNYVENTVYYPGCDT-VACSSASIDKAVDIAKGADHVVLMMGLDQT 500
G C +T LQ + Y + TVY+PGCD C+ A I++AV++AK D+VVL+MGLDQ+
Sbjct: 441 YGRPCNLVTLLQGFEGYAK-TVYHPGCDDGPQCAYAQIEEAVEVAKKVDYVVLVMGLDQS 499
Query: 501 QEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAG 560
QE+E DR L LPG+Q+ELI VA AAK+PV++VLLCGGPVDIT AK+D +G ILWAG
Sbjct: 500 QERESHDREYLGLPGKQEELIKSVARAAKRPVVVVLLCGGPVDITSAKFDDKVGGILWAG 559
Query: 561 YPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYE 620
YPGE G VALA+V+FGDHNPGG+LP+TWYP+D+IKVPMTDM+MR SG PGRTYRFY
Sbjct: 560 YPGELGGVALAQVVFGDHNPGGKLPITWYPKDFIKVPMTDMRMRADPASGYPGRTYRFYT 619
Query: 621 GKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCET 680
G +V+ FG GLSY+KYSYK ++S + L++NQSS+ M +N + + YK V EL E C+T
Sbjct: 620 GPKVYEFGYGLSYTKYSYKLLSLSHSTLHINQSSTHLMTQNSETIRYKLVSELAEETCQT 679
Query: 681 RKFLVTIGVKNHGEMAGKHPVLLFVKPARR---GNGRPIKQLVGFQSVILNAKEKAEIVF 737
+ +GV N G +AGKHPVLLFV+ + NG P+KQLVGFQSV +NA E ++ F
Sbjct: 680 MLLSIALGVTNRGNLAGKHPVLLFVRQGKVRNINNGNPVKQLVGFQSVKVNAGETVQVGF 739
Query: 738 ELSPCESLSRAREDGLMVIEEGTHFLVVGDEEYPISIFV 776
ELSPCE LS A E G MVIEEG++ +VGD+EYPI + V
Sbjct: 740 ELSPCEHLSVANEAGSMVIEEGSYLFIVGDQEYPIEVTV 778
>gi|357489441|ref|XP_003615008.1| Xylan 1 4-beta-xylosidase [Medicago truncatula]
gi|355516343|gb|AES97966.1| Xylan 1 4-beta-xylosidase [Medicago truncatula]
Length = 798
Score = 1050 bits (2716), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 507/788 (64%), Positives = 612/788 (77%), Gaps = 23/788 (2%)
Query: 8 LVFPLLCLCFTSLLTRVDSTQ--PPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTL 65
+ F L L L DS PP+SCD +NP T++ PFC L I+QRA+D+VSRLTL
Sbjct: 11 ISFIFLFLTRYHRLVHADSPTHVPPYSCDTTNPLTKSLPFCNLNLTITQRAKDIVSRLTL 70
Query: 66 DEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAAS 125
DEKISQLVN+AP+IPRLGIP+Y+WW EALHGVA GKGI NG++ GATSFPQVILTAAS
Sbjct: 71 DEKISQLVNTAPSIPRLGIPSYQWWDEALHGVANAGKGIRLNGSVAGATSFPQVILTAAS 130
Query: 126 FDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGK 185
FDS LWY+I + IG EAR +YNAGQA GMTFWAPNINIFRDPRWGRGQET GEDPLV K
Sbjct: 131 FDSKLWYQISKVIGTEARGVYNAGQAQGMTFWAPNINIFRDPRWGRGQETAGEDPLVNSK 190
Query: 186 YAVSYVRGVQGDTFNGGKLKG-KLQASACCKHFTAYDLDNWKGTTRYKFDARV------- 237
Y VSYVRG+QGD+F GGKL G +L+ASACCKHFTAYDLDNWKG R+ FDA+V
Sbjct: 191 YGVSYVRGLQGDSFEGGKLIGDRLKASACCKHFTAYDLDNWKGLDRFDFDAKVSFLFSMA 250
Query: 238 ---------TMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQ 288
T+QDLADTYQPPF SC+ QGR+SGIMCAYNRVNG+P+CAD NLL+KTAR++
Sbjct: 251 YSPWMINYVTLQDLADTYQPPFHSCIVQGRSSGIMCAYNRVNGVPNCADYNLLTKTARQK 310
Query: 289 WGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKL 348
W F+GYITSDC+AV IIYD +GYAK+PEDAV DVL+AGMDV CG +L KH KAAV QKK+
Sbjct: 311 WNFNGYITSDCEAVRIIYDNQGYAKTPEDAVADVLQAGMDVECGDYLTKHAKAAVLQKKV 370
Query: 349 PESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLK 408
P S+IDRALHNLF++R+RLGLF+GNPT +G+IG + VCS + LAL+AA+ GIVLLK
Sbjct: 371 PISQIDRALHNLFTIRIRLGLFDGNPTKLQYGRIGPNQVCSKENLDLALEAARSGIVLLK 430
Query: 409 NSHGLLPLPKSKSVSLALIGPNAN-SAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPG 467
N+ +LPLP+ + L +IGPNAN S+K +LGNY G CR + L+ Y T Y G
Sbjct: 431 NTASILPLPRVNT--LGVIGPNANKSSKVVLGNYFGRPCRLVPILKGFYTYASQTHYRSG 488
Query: 468 C-DTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAE 526
C D C+SA ID+AV++AK +D+V+L+MGLDQ+QE+E DR DL LPG+QQELI VA+
Sbjct: 489 CLDGTKCASAEIDRAVEVAKISDYVILVMGLDQSQERESRDRDDLELPGKQQELINSVAK 548
Query: 527 AAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPM 586
A+KKPVILVLLCGGPVDITFAK + IG I+WAGYPGE G ALA+V+FGD+NPGGRLPM
Sbjct: 549 ASKKPVILVLLCGGPVDITFAKNNDKIGGIIWAGYPGELGGRALAQVVFGDYNPGGRLPM 608
Query: 587 TWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQN 646
TWYP+D+IK+PMTDM+MR +SG PGRTYRFY G +V+ FG GLSYS YSY F +V N
Sbjct: 609 TWYPKDFIKIPMTDMRMRADPSSGYPGRTYRFYTGPKVYEFGYGLSYSNYSYNFISVKNN 668
Query: 647 KLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVK 706
L++NQS++ ++EN + ++YK V ELG E C+T VT+G+ N G MAGKHPVLLFVK
Sbjct: 669 NLHINQSTTHSILENSETIYYKLVSELGEETCKTMSISVTLGITNTGSMAGKHPVLLFVK 728
Query: 707 PARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
P + NG P+KQLVGF+SV + K E+ FE+S CE LSRA E G+ VIEEG H LVVG
Sbjct: 729 PKKGRNGNPVKQLVGFESVTVEGGGKGEVGFEVSVCEHLSRANESGVKVIEEGGHLLVVG 788
Query: 767 DEEYPISI 774
+EEY I+I
Sbjct: 789 EEEYSINI 796
>gi|357489463|ref|XP_003615019.1| hypothetical protein MTR_5g062650 [Medicago truncatula]
gi|355516354|gb|AES97977.1| hypothetical protein MTR_5g062650 [Medicago truncatula]
Length = 785
Score = 1041 bits (2692), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 500/774 (64%), Positives = 609/774 (78%), Gaps = 8/774 (1%)
Query: 8 LVFPLLCLCFTSLLTRVDSTQ--PPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTL 65
+ F L L + L DS PP+SCD +NP T+++ FC L QRA+D+VSRLTL
Sbjct: 11 ISFIFLFLTRYNQLVHADSPTLVPPYSCDITNPLTKSYTFCNLNLTTIQRAKDIVSRLTL 70
Query: 66 DEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNG--TIRGATSFPQVILTA 123
DEK++QLVN+APAIPRLGI +Y+WWSEALHGVA GKGI NG TI+ AT FPQVILTA
Sbjct: 71 DEKLAQLVNTAPAIPRLGIHSYQWWSEALHGVADYGKGIRLNGNVTIKAATIFPQVILTA 130
Query: 124 ASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVT 183
ASFDS LWYRI + IG EARA+YNAGQA GMTFWAPNINIFRDPRWGRGQET GEDPLV+
Sbjct: 131 ASFDSKLWYRISKVIGTEARAVYNAGQAEGMTFWAPNINIFRDPRWGRGQETAGEDPLVS 190
Query: 184 GKYAVSYVRGVQGDTFNGGKL-KGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDL 242
KYAVS+VRG+QGD+F GGKL + +L+ASACCKHFTAYDLDNWKG R+ FDA VT+QDL
Sbjct: 191 AKYAVSFVRGLQGDSFEGGKLNEDRLKASACCKHFTAYDLDNWKGVDRFDFDANVTLQDL 250
Query: 243 ADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAV 302
ADTYQPPF SC+ QGR+SGIMCAYNRVNGIP+CAD NLL+ TAR++W F+GYITSDC AV
Sbjct: 251 ADTYQPPFHSCIVQGRSSGIMCAYNRVNGIPNCADYNLLTNTARKKWNFNGYITSDCSAV 310
Query: 303 SIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFS 362
II+D +GYAK+PEDAV DVL+AGMDV CG + H+K+AV QKK+P S+IDRALHNLFS
Sbjct: 311 DIIHDRQGYAKAPEDAVADVLQAGMDVECGDYFTSHSKSAVLQKKVPISQIDRALHNLFS 370
Query: 363 VRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSV 422
+R+RLGLF+G+PT +GKIG + VCS + +AL+AA+ GIVLLKN+ +LPLPKS
Sbjct: 371 IRIRLGLFDGHPTKLKYGKIGPNRVCSKQNLNIALEAARSGIVLLKNAASILPLPKSTD- 429
Query: 423 SLALIGPNANSA-KTLLGNYAGPSCRSITPLQALQNYVENTVYYPGC-DTVACSSASIDK 480
S+ +IGPNANS+ + +LGNY G C +T LQ +NY +N +Y+PGC D C SA ID+
Sbjct: 430 SIVVIGPNANSSSQVVLGNYFGRPCNLVTILQGFENYSDNLLYHPGCSDGTKCVSAEIDR 489
Query: 481 AVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGG 540
AV++AK D+VVL+MGLDQ+QE E DR DL LPG+QQELI VA+A+K+PVILVL CGG
Sbjct: 490 AVEVAKVVDYVVLVMGLDQSQESEGHDRDDLELPGKQQELINSVAKASKRPVILVLFCGG 549
Query: 541 PVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTD 600
PVDI+FAK D IG ILWAGYPGE G +ALA+V+FGD+NPGGRLPMTWYP+D+IK+PMTD
Sbjct: 550 PVDISFAKVDDKIGGILWAGYPGELGGMALAQVVFGDYNPGGRLPMTWYPKDFIKIPMTD 609
Query: 601 MKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVE 660
M+MR +SG PGRTYRFY G +V+ FG GLSYS YSY F +V N L++NQS++ ++E
Sbjct: 610 MRMRADPSSGYPGRTYRFYTGPKVYEFGYGLSYSNYSYNFISVKNNNLHINQSTTYSILE 669
Query: 661 NQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLV 720
+HYK V ELG + C+T VT+G+ N G MAGKHPVLLFVKP + NG P+KQLV
Sbjct: 670 KSQTIHYKLVSELGKKACKTMSISVTLGITNTGSMAGKHPVLLFVKPKKGRNGNPVKQLV 729
Query: 721 GFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGDEEYPISI 774
GF+SV + K E+ FE+S CE LSRA E G+ VIEEG + +VG+ EY I+I
Sbjct: 730 GFESVTVEGGGKGEVGFEVSVCEHLSRANESGVKVIEEGGYLFLVGELEYSINI 783
>gi|357489431|ref|XP_003615003.1| Xylan 1 4-beta-xylosidase [Medicago truncatula]
gi|355516338|gb|AES97961.1| Xylan 1 4-beta-xylosidase [Medicago truncatula]
Length = 780
Score = 1041 bits (2691), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 494/775 (63%), Positives = 607/775 (78%), Gaps = 8/775 (1%)
Query: 6 LSLVFPLLCLCFTSLLTRVDSTQ---PPFSCDPSNPSTETFPFCKTTLPISQRARDLVSR 62
+++ F L L L DS PP+SCD SNP T++FPFC L I+QRA+D+VSR
Sbjct: 6 ITITFIFLFLTRYHRLVHADSLATNVPPYSCDTSNPLTKSFPFCNLNLTITQRAKDIVSR 65
Query: 63 LTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILT 122
LTLDEKISQLVN+APAIPRLGIP+Y+WW+EALHGV+ VGKGI NG+I ATSFPQ+IL
Sbjct: 66 LTLDEKISQLVNTAPAIPRLGIPSYQWWNEALHGVSYVGKGIRLNGSITAATSFPQIILI 125
Query: 123 AASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLV 182
AASFD LWYRI + IG EAR +YNAGQA GMTFWAPNINIFRDPRWGRGQET GEDPLV
Sbjct: 126 AASFDPKLWYRISKVIGTEARGVYNAGQAQGMTFWAPNINIFRDPRWGRGQETAGEDPLV 185
Query: 183 TGKYAVSYVRGVQGDTFNGGKL-KGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQD 241
KY VSYVRG+QGD+F GGKL G+L+ASACCKHFTAYDL+NWKG RY FDA+VT+QD
Sbjct: 186 NSKYGVSYVRGLQGDSFEGGKLIGGRLKASACCKHFTAYDLENWKGVNRYVFDAKVTLQD 245
Query: 242 LADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDA 301
LADTYQP F SCV QGR+SGIMCAYNRVNG+P+CAD NLL+ TAR++W F+GYI SDCDA
Sbjct: 246 LADTYQPSFHSCVVQGRSSGIMCAYNRVNGVPNCADYNLLTNTARKKWNFNGYIASDCDA 305
Query: 302 VSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLF 361
V IY+ +GYAK+PED V DVL+AGMDV CG+++ KH K+AV QKK+P S+IDRALHNLF
Sbjct: 306 VRFIYEKQGYAKTPEDVVADVLRAGMDVECGNYMTKHAKSAVLQKKIPISQIDRALHNLF 365
Query: 362 SVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKS 421
++R+RLGLF+GNPT +G+IG + VCS + LAL+AA+ GIVLLKN+ +LPLP+ +
Sbjct: 366 TIRIRLGLFDGNPTKLQYGRIGPNQVCSKENLDLALEAARSGIVLLKNTASILPLPRVNT 425
Query: 422 VSLALIGPNAN-SAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGC-DTVACSSASID 479
L +IGPNAN S+ LLGNY G C+ ++ L+ Y T Y GC D V C+SA ID
Sbjct: 426 --LGVIGPNANKSSIVLLGNYFGQPCKQVSILKGFYTYASQTHYRSGCTDGVKCASAEID 483
Query: 480 KAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCG 539
+AV++AK +D+V+L+MGLDQ+QE E LDR L LPG+QQ+LI VA+A+KKPVILV+LCG
Sbjct: 484 RAVEVAKISDYVILVMGLDQSQETETLDRDHLELPGKQQKLINSVAKASKKPVILVILCG 543
Query: 540 GPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMT 599
GPVDITFAK + IG I+WAGYPGE G ALA+V+FGD+NPGGRLPMTWYP+D+IK+PMT
Sbjct: 544 GPVDITFAKNNDKIGGIIWAGYPGELGGRALAQVVFGDYNPGGRLPMTWYPKDFIKIPMT 603
Query: 600 DMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMV 659
DM+MR +SG PGRTYRFY G +V+ FG GLSYS YSY F +V N +++NQS++ ++
Sbjct: 604 DMRMRADPSSGYPGRTYRFYTGPKVYEFGYGLSYSNYSYNFISVKNNNIHINQSTTHSIL 663
Query: 660 ENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQL 719
EN + + YK V ELG + C+T VT+G+ N G MAGKHPVLLFVKP + NG P+KQL
Sbjct: 664 ENSETIRYKLVSELGKKACKTMSISVTLGITNTGSMAGKHPVLLFVKPKKGRNGNPVKQL 723
Query: 720 VGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGDEEYPISI 774
VGF+SV + K E+ FE+S CE LSRA E G+ VIEEG + +VG+ EY I+I
Sbjct: 724 VGFESVTVEGGGKGEVGFEVSVCEHLSRANESGVKVIEEGGYLFLVGELEYSINI 778
>gi|371917284|dbj|BAL44718.1| SlArf/Xyl3 [Solanum lycopersicum]
Length = 777
Score = 1018 bits (2632), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 473/774 (61%), Positives = 595/774 (76%), Gaps = 4/774 (0%)
Query: 2 IFHKLSLVFPLLCLCFTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVS 61
I +K +++F + L L R +ST+PPFSCD SNP+T ++PFC LPI QR DLVS
Sbjct: 5 ITNKFTVMFIFVILVL--LFRRTESTKPPFSCDSSNPNTSSYPFCNAALPIPQRVNDLVS 62
Query: 62 RLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVIL 121
RLT+DEKI QLVN AP IPRLGI AYEWWSE LHG++ GKG FNGTI+ AT FPQ+IL
Sbjct: 63 RLTVDEKILQLVNGAPEIPRLGISAYEWWSEGLHGISRHGKGTLFNGTIKAATQFPQIIL 122
Query: 122 TAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPL 181
TA+SFD LWYRI QAIG EARA+YNAGQ G+T WAPNINI RDPRWGRGQETPGEDP+
Sbjct: 123 TASSFDENLWYRIAQAIGREARAVYNAGQLKGITLWAPNINILRDPRWGRGQETPGEDPM 182
Query: 182 VTGKYAVSYVRGVQGDTFNGGKLK-GKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQ 240
+ GKY V+YVRG+QGD+F GGKLK G LQ SACCKHF A D+DNW +RY FDA+V Q
Sbjct: 183 MVGKYGVAYVRGLQGDSFEGGKLKDGHLQTSACCKHFIAQDMDNWHNFSRYTFDAQVLKQ 242
Query: 241 DLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCD 300
DLAD+Y+PPF+ CV+QG+AS +MCAYN VNGIP+CA+ +LL+ TAR +WG GYI SDCD
Sbjct: 243 DLADSYEPPFKDCVEQGKASSVMCAYNLVNGIPNCANFDLLTTTARGKWGLQGYIVSDCD 302
Query: 301 AVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNL 360
AV +Y + YAK PEDAV LKAGMDVNCGS L+ +TK+A++++K+ ES+IDRALHNL
Sbjct: 303 AVDKMYSEQHYAKEPEDAVAATLKAGMDVNCGSHLKTYTKSALEKQKVKESDIDRALHNL 362
Query: 361 FSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSK 420
FSVRMRLGLFNG+P+ +G I A VCS H+ LA++AA+ G VLLKNS+ LLPL K K
Sbjct: 363 FSVRMRLGLFNGDPSKLEYGDISAAEVCSEEHRALAVEAARSGSVLLKNSNRLLPLSKMK 422
Query: 421 SVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDK 480
+ SLA+IGP AN ++ LLGNY G SC+++T Q LQ YV NT+Y+PGCD + C+S +ID+
Sbjct: 423 TASLAVIGPKANDSEVLLGNYEGFSCKNVTLFQGLQGYVANTMYHPGCDFINCTSPAIDE 482
Query: 481 AVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGG 540
AV+IAK AD+VVL+MGLDQT E+E+ DR +L LPG Q++LIT +AEAA KPVILVL+CGG
Sbjct: 483 AVNIAKKADYVVLVMGLDQTLEREKFDRTELGLPGMQEKLITSIAEAASKPVILVLMCGG 542
Query: 541 PVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTD 600
PVD+TFAK + IG ILW GYPGE GA ALA+++FG+HNPGGR P+TWYP+++ KV M D
Sbjct: 543 PVDVTFAKDNPKIGGILWVGYPGEGGAAALAQILFGEHNPGGRSPVTWYPKEFNKVAMND 602
Query: 601 MKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVE 660
M+MRP+++SG PGRTYRFY G +VF FG GLSY+ YSY F +VS+N+L + E
Sbjct: 603 MRMRPESSSGYPGRTYRFYNGPKVFEFGYGLSYTNYSYTFASVSKNQLLFKNPKINQSTE 662
Query: 661 NQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLV 720
V++ +V ++G E C + V + VKN GEMAGKHPVLLF+K + + P K L+
Sbjct: 663 KGSVLNI-AVSDVGPEVCNSAMITVKVAVKNQGEMAGKHPVLLFLKHSSTVDEVPKKTLI 721
Query: 721 GFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGDEEYPISI 774
GF+SV L A ++ F++ PCE +RA DG +VI+EG HFL++GD+EYPI +
Sbjct: 722 GFKSVNLEAGANTQVTFDVKPCEHFTRANRDGTLVIDEGKHFLLLGDQEYPIPV 775
>gi|242062502|ref|XP_002452540.1| hypothetical protein SORBIDRAFT_04g027700 [Sorghum bicolor]
gi|241932371|gb|EES05516.1| hypothetical protein SORBIDRAFT_04g027700 [Sorghum bicolor]
Length = 784
Score = 998 bits (2580), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 474/753 (62%), Positives = 586/753 (77%), Gaps = 7/753 (0%)
Query: 26 STQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIP 85
+++PP++C P PFC T LPI +R DLVSRLT+ EKISQL + +PAIPRLG+P
Sbjct: 35 ASEPPYTCGAGAPPN--IPFCDTALPIDRRVDDLVSRLTVAEKISQLGDESPAIPRLGVP 92
Query: 86 AYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARAL 145
AY+WWSEALHGVA G+GI +G +R ATSFPQVILTAASF+ +LWYRIGQ IG+EARA+
Sbjct: 93 AYKWWSEALHGVANAGRGIHLDGPLRAATSFPQVILTAASFNPHLWYRIGQVIGVEARAV 152
Query: 146 YNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLK 205
YN GQA G+TFWAPNIN+FRDPRWGRGQETPGEDP +TGKYA +VRGVQG G
Sbjct: 153 YNNGQAEGLTFWAPNINVFRDPRWGRGQETPGEDPTMTGKYAAVFVRGVQGYGVAGPVNS 212
Query: 206 GKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCA 265
L+ASACCKHFTAYDL+NWKG TRY +DA+VT QDL DTY PPF+SCV+ G ASGIMC+
Sbjct: 213 TDLEASACCKHFTAYDLENWKGITRYVYDAKVTAQDLEDTYNPPFKSCVEDGHASGIMCS 272
Query: 266 YNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKA 325
YNRVNG+P+CAD NLLSKTAR+ WGF+GYITSDCDAVSII+DA+GYAK+ EDAV DVLKA
Sbjct: 273 YNRVNGVPTCADYNLLSKTARQSWGFYGYITSDCDAVSIIHDAQGYAKTSEDAVADVLKA 332
Query: 326 GMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGAD 385
GMDVNCG ++QK+ +A++Q K+ E +I+RALHNLF+VRMRLGLFNG+P +G IG D
Sbjct: 333 GMDVNCGGYVQKYGASALQQGKITEQDINRALHNLFTVRMRLGLFNGDPRRNRYGNIGPD 392
Query: 386 VVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPS 445
VC+ HQ LAL+AAQDGIVLLKN G LPL KS SLA+IG NAN+A +LLGNY GP
Sbjct: 393 QVCTQEHQDLALEAAQDGIVLLKNDGGALPLSKSGVASLAVIGFNANNATSLLGNYFGPP 452
Query: 446 CRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEE 505
C ++TPLQ LQ YV++T + GC++ AC+ +I +AV A AD VVL MGLDQ QE+EE
Sbjct: 453 CVTVTPLQVLQGYVKDTSFVAGCNSAACNVTTIPEAVQAASSADSVVLFMGLDQNQEREE 512
Query: 506 LDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEA 565
+DR+DL LPG+QQ LI VA AAKKPVILVLLCGGPVD++FAK + IG+ILWAGYPGEA
Sbjct: 513 VDRLDLTLPGQQQTLIESVANAAKKPVILVLLCGGPVDVSFAKTNPKIGAILWAGYPGEA 572
Query: 566 GAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVF 625
G +A+A+V+FG+HNPGGRLP+TWYPQD+ KVPMTDM+MR +G PGRTYRFY G VF
Sbjct: 573 GGIAIAQVLFGEHNPGGRLPVTWYPQDFTKVPMTDMRMRADPATGYPGRTYRFYRGPTVF 632
Query: 626 PFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTK-MVENQDVVHYKSVPELGTEFCETRKFL 684
FG GLSYSKYS++F V++ ++ + K + V V +G+E C+ KF
Sbjct: 633 NFGYGLSYSKYSHRF--VTKPPPSMSNVAGLKALATTAGGVATYDVEAIGSETCDRLKFP 690
Query: 685 VTIGVKNHGEMAGKHPVLLFVK--PARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPC 742
+ V+NHG M GKHPVL+F++ A G+GRP +QL+GFQS+ L A + A + FE+SPC
Sbjct: 691 AVVRVQNHGPMDGKHPVLVFLRWPNATDGSGRPARQLIGFQSLHLRATQTAHVEFEVSPC 750
Query: 743 ESLSRAREDGLMVIEEGTHFLVVGDEEYPISIF 775
+ SRA EDG VI++G+HF++VGD+E+ +S
Sbjct: 751 KHFSRATEDGRKVIDQGSHFVMVGDDEFEMSFM 783
>gi|218191593|gb|EEC74020.1| hypothetical protein OsI_08964 [Oryza sativa Indica Group]
Length = 774
Score = 983 bits (2540), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 470/750 (62%), Positives = 571/750 (76%), Gaps = 9/750 (1%)
Query: 29 PPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYE 88
PPFSC PS+ F C LPI QRA DLVSRLTL+EKISQL + +PA+ RLG+PAY+
Sbjct: 28 PPFSC--GAPSSAAF--CNPRLPIEQRADDLVSRLTLEEKISQLGDQSPAVDRLGVPAYK 83
Query: 89 WWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA 148
WWSEALHGV+ G+GI +G +R ATSFPQVILTAASF+ +LWYRIGQ IG EARA+YN
Sbjct: 84 WWSEALHGVSNAGRGIHLDGPLRAATSFPQVILTAASFNPHLWYRIGQVIGTEARAVYNN 143
Query: 149 GQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKL 208
GQA G+TFWAPNIN+FRDPRWGRGQETPGEDP VTGKYA +VRGVQG G L
Sbjct: 144 GQAEGLTFWAPNINVFRDPRWGRGQETPGEDPTVTGKYAAVFVRGVQGYALAGAINSTDL 203
Query: 209 QASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNR 268
+ASACCKHFTAYDL+NWKG TRY FDA+VT QDLADTY PPF SCV+ G ASGIMC+YNR
Sbjct: 204 EASACCKHFTAYDLENWKGVTRYAFDAKVTAQDLADTYNPPFRSCVEDGGASGIMCSYNR 263
Query: 269 VNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMD 328
VNG+P+CAD NLLSKTAR W F+GYITSDCDAVSII+D +GYAK+ EDAV DVLKAGMD
Sbjct: 264 VNGVPTCADYNLLSKTARGDWRFYGYITSDCDAVSIIHDVQGYAKTAEDAVADVLKAGMD 323
Query: 329 VNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVC 388
VNCGS++Q+H +A++Q K+ E +I+RALHNLF+VRMRLGLFNGNP +G IG D VC
Sbjct: 324 VNCGSYVQEHGLSAIQQGKITEQDINRALHNLFAVRMRLGLFNGNPKYNRYGNIGPDQVC 383
Query: 389 SPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRS 448
+ HQ LAL+AAQ G+VLLKN LPL KS+ S+A+IG NAN A LLGNY GP C S
Sbjct: 384 TQEHQNLALEAAQHGVVLLKNDANALPLSKSQVSSIAVIGHNANDATRLLGNYFGPPCIS 443
Query: 449 ITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDR 508
+TPLQ LQ YV++T + GC++ AC+ +SI +A +A D+VVL MGLDQ QE+EE+DR
Sbjct: 444 VTPLQVLQGYVKDTRFLAGCNSAACNVSSIGEAAQLASSVDYVVLFMGLDQDQEREEVDR 503
Query: 509 VDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAV 568
++L LPG Q+ LI VA AAKKPVILVLLCGGPVD+TFAKY+ IG+ILWAGYPGEAG +
Sbjct: 504 LELSLPGMQENLINTVANAAKKPVILVLLCGGPVDVTFAKYNPKIGAILWAGYPGEAGGI 563
Query: 569 ALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFG 628
A+A+V+FG+HNPGGRLP+TWYP+++ VPMTDM+MR ++G PGRTYRFY G V+ FG
Sbjct: 564 AIAQVLFGEHNPGGRLPVTWYPKEFTSVPMTDMRMRADPSTGYPGRTYRFYRGNTVYKFG 623
Query: 629 CGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVEN--QDVVHYKSVPELGTEFCETRKFLVT 686
GLSYSKYS+ F A L+ K + V Y V E+GTE C+ KF
Sbjct: 624 YGLSYSKYSHHFVANGTKLPSLSSIDGLKAMATAAAGTVSY-DVEEIGTETCDKLKFPAL 682
Query: 687 IGVKNHGEMAGKHPVLLFVK--PARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCES 744
+ V+NHG M G+HPVLLF++ GRP QL+GFQS+ L + + + FE+SPC+
Sbjct: 683 VRVQNHGPMDGRHPVLLFLRWPNGAADGGRPASQLIGFQSLHLKSMQTVHVEFEVSPCKH 742
Query: 745 LSRAREDGLMVIEEGTHFLVVGDEEYPISI 774
SRA EDG VI+ G+HF++VGD+E+ +S
Sbjct: 743 FSRATEDGKKVIDHGSHFMMVGDDEFEMSF 772
>gi|18025342|gb|AAK38482.1| beta-D-xylosidase [Hordeum vulgare]
Length = 777
Score = 979 bits (2532), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 465/752 (61%), Positives = 572/752 (76%), Gaps = 6/752 (0%)
Query: 26 STQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIP 85
+ PPFSC PS+ F C LPI QRA DLVS+LTL+EKISQL + +PA+ RLG+P
Sbjct: 29 AADPPFSC--GAPSSAAF--CDRRLPIEQRAADLVSKLTLEEKISQLGDESPAVDRLGVP 84
Query: 86 AYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARAL 145
AY+WWSEALHGVA G+G+ +G +R ATSFPQVILTAASF+ +LWYRIGQ IG EAR +
Sbjct: 85 AYKWWSEALHGVANAGRGVHLDGPLRAATSFPQVILTAASFNPHLWYRIGQVIGTEARGV 144
Query: 146 YNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLK 205
YN GQA G+TFWAPNIN+FRDPRWGRGQETPGEDP +TGKYA +VRGVQG +G
Sbjct: 145 YNNGQAEGLTFWAPNINVFRDPRWGRGQETPGEDPTMTGKYAAVFVRGVQGYGMSGAINS 204
Query: 206 GKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCA 265
L+ASACCKHFTAYDL+NWKG TR+ FDA+VT QDLADTY PPF+SCV+ G ASGIMC+
Sbjct: 205 SDLEASACCKHFTAYDLENWKGVTRFAFDAKVTEQDLADTYNPPFKSCVEDGGASGIMCS 264
Query: 266 YNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKA 325
YNRVNG+P+CAD NLLSKTAR W F+GYITSDCDAV+II+D +GYAK+PEDAV DVLKA
Sbjct: 265 YNRVNGVPTCADHNLLSKTARGDWSFNGYITSDCDAVAIIHDVQGYAKAPEDAVADVLKA 324
Query: 326 GMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGAD 385
GMDVNCG ++Q H +A +Q K+ +IDRAL NLF++RMRLGLF+GNP +G IGAD
Sbjct: 325 GMDVNCGGYIQTHGVSAYQQGKITGEDIDRALRNLFAIRMRLGLFDGNPKYNRYGNIGAD 384
Query: 386 VVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPS 445
VCS HQ LALQAA+DGIVLLKN LPL KSK SLA+IGPN N+A LLGNY GP
Sbjct: 385 QVCSKEHQDLALQAARDGIVLLKNDGAALPLSKSKVSSLAVIGPNGNNASLLLGNYFGPP 444
Query: 446 CRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEE 505
C S+TPLQALQ YV++ + GC+ C+ ++I +AV A AD+VVL MGLDQ QE+EE
Sbjct: 445 CISVTPLQALQGYVKDARFVQGCNAAVCNVSNIGEAVHAAGSADYVVLFMGLDQNQEREE 504
Query: 506 LDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEA 565
+DR++L LPG Q+ L+ VA+AAKKPVILVLLCGGPVD+TFAK + IG+I+WAGYPG+A
Sbjct: 505 VDRLELGLPGMQESLVNSVADAAKKPVILVLLCGGPVDVTFAKNNPKIGAIVWAGYPGQA 564
Query: 566 GAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVF 625
G +A+A+V+FGDHNPGGRLP+TWYP+++ VPMTDM+MR ++G PGRTYRFY+GK V+
Sbjct: 565 GGIAIAQVLFGDHNPGGRLPVTWYPKEFTAVPMTDMRMRADPSTGYPGRTYRFYKGKTVY 624
Query: 626 PFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKS--VPELGTEFCETRKF 683
FG GLSYSKYS++F + ++ K S V E+G E C+ +F
Sbjct: 625 NFGYGLSYSKYSHRFASKGTKPPSMSGIEGLKATARASAAGTVSYDVEEMGAEACDRLRF 684
Query: 684 LVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCE 743
+ V+NHG M G H VLLF++ +GRP QL+GFQSV L A E A + FE+SPC+
Sbjct: 685 PAVVRVQNHGPMDGGHLVLLFLRWPNATDGRPASQLIGFQSVHLRADEAAHVEFEVSPCK 744
Query: 744 SLSRAREDGLMVIEEGTHFLVVGDEEYPISIF 775
LSRA EDG VI++G+HF+ VGD+E+ +S
Sbjct: 745 HLSRAAEDGRKVIDQGSHFVRVGDDEFELSFM 776
>gi|115448721|ref|NP_001048140.1| Os02g0752200 [Oryza sativa Japonica Group]
gi|46390122|dbj|BAD15557.1| putative beta-D-xylosidase [Oryza sativa Japonica Group]
gi|46390225|dbj|BAD15656.1| putative beta-D-xylosidase [Oryza sativa Japonica Group]
gi|113537671|dbj|BAF10054.1| Os02g0752200 [Oryza sativa Japonica Group]
gi|125583710|gb|EAZ24641.1| hypothetical protein OsJ_08409 [Oryza sativa Japonica Group]
Length = 780
Score = 979 bits (2531), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 469/750 (62%), Positives = 570/750 (76%), Gaps = 9/750 (1%)
Query: 29 PPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYE 88
PPFSC PS+ F C LPI QRA DLVSRLTL+EKISQL + +PA+ RLG+PAY+
Sbjct: 34 PPFSC--GAPSSAAF--CNPRLPIEQRADDLVSRLTLEEKISQLGDQSPAVDRLGVPAYK 89
Query: 89 WWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA 148
WWSEALHGV+ G+GI +G +R ATSFPQVILTAASF+ +LWYRIGQ IG EARA+YN
Sbjct: 90 WWSEALHGVSNAGRGIHLDGPLRAATSFPQVILTAASFNPHLWYRIGQVIGTEARAVYNN 149
Query: 149 GQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKL 208
GQA G+TFWAPNIN+FRDPRWGRGQETPGEDP VTGKYA +VRGVQG G L
Sbjct: 150 GQAEGLTFWAPNINVFRDPRWGRGQETPGEDPTVTGKYAAVFVRGVQGYALAGAINSTDL 209
Query: 209 QASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNR 268
+ASACCKHFTAYDL+NWKG TRY FDA+VT QDLADTY PPF SCV+ G ASGIMC+YNR
Sbjct: 210 EASACCKHFTAYDLENWKGVTRYAFDAKVTAQDLADTYNPPFRSCVEDGGASGIMCSYNR 269
Query: 269 VNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMD 328
VNG+P+CAD NLLSKTAR W F+GYITSDCDAVSII+D +GYAK+ EDAV DVLKAGMD
Sbjct: 270 VNGVPTCADYNLLSKTARGDWRFYGYITSDCDAVSIIHDVQGYAKTAEDAVADVLKAGMD 329
Query: 329 VNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVC 388
VNCGS++Q+H +A++Q K+ E +I+RALHNLF+VRMRLGLFNGNP +G IG D VC
Sbjct: 330 VNCGSYVQEHGLSAIQQGKITEQDINRALHNLFAVRMRLGLFNGNPKYNRYGNIGPDQVC 389
Query: 389 SPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRS 448
+ HQ LAL+AAQ G+VLLKN LPL KS+ S+A+IG NAN A LLGNY GP C S
Sbjct: 390 TQEHQNLALEAAQHGVVLLKNDANALPLSKSQVSSIAVIGHNANDATRLLGNYFGPPCIS 449
Query: 449 ITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDR 508
+TPLQ LQ YV++T + GC++ AC+ +SI +A +A D+VVL MGLDQ QE+EE+DR
Sbjct: 450 VTPLQVLQGYVKDTRFLAGCNSAACNVSSIGEAAQLASSVDYVVLFMGLDQDQEREEVDR 509
Query: 509 VDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAV 568
++L LPG Q+ LI VA AAKKPVILVLLCGGPVD+TFAKY+ IG+ILWAGYPGEAG +
Sbjct: 510 LELSLPGMQENLINTVANAAKKPVILVLLCGGPVDVTFAKYNPKIGAILWAGYPGEAGGI 569
Query: 569 ALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFG 628
A+A+V+FG+HNPGGRLP+TWYP+++ VPMTDM+MR ++G PGRTYRFY G V+ FG
Sbjct: 570 AIAQVLFGEHNPGGRLPVTWYPKEFTSVPMTDMRMRADPSTGYPGRTYRFYRGNTVYKFG 629
Query: 629 CGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVEN--QDVVHYKSVPELGTEFCETRKFLVT 686
GLSYSKYS+ F A L+ K + V Y V E+G E C+ KF
Sbjct: 630 YGLSYSKYSHHFVANGTKLPSLSSIDGLKAMATAAAGTVSY-DVEEIGPETCDKLKFPAL 688
Query: 687 IGVKNHGEMAGKHPVLLFVK--PARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCES 744
+ V+NHG M G+HPVLLF++ GRP QL+GFQS+ L + + + FE+SPC+
Sbjct: 689 VRVQNHGPMDGRHPVLLFLRWPNGAADGGRPASQLIGFQSLHLKSMQTVHVEFEVSPCKH 748
Query: 745 LSRAREDGLMVIEEGTHFLVVGDEEYPISI 774
SRA EDG VI+ G+HF++VGD+E+ +S
Sbjct: 749 FSRATEDGKKVIDHGSHFMMVGDDEFEMSF 778
>gi|212275712|ref|NP_001130324.1| uncharacterized protein LOC100191418 precursor [Zea mays]
gi|194688848|gb|ACF78508.1| unknown [Zea mays]
gi|413938927|gb|AFW73478.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
Length = 780
Score = 975 bits (2521), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 473/779 (60%), Positives = 585/779 (75%), Gaps = 13/779 (1%)
Query: 5 KLSLVFPLLCLCFTSLL---TRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVS 61
+LSLV L L V +++PP++C P PFC LPI +R DLVS
Sbjct: 6 RLSLVHAAAALVVLLQLHGGAVVVASEPPYTCGAGAPPN--IPFCDAGLPIDRRVDDLVS 63
Query: 62 RLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVIL 121
R+T+ EKISQL + +PAIPRLG+PAY+WWSEALHG++ G+GI +G +R ATSFPQVIL
Sbjct: 64 RMTVAEKISQLGDQSPAIPRLGVPAYKWWSEALHGISNQGRGIHLDGPLRAATSFPQVIL 123
Query: 122 TAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPL 181
TAASF+ +LWYRIGQ IG+EARA+YN GQA G+TFWAPNIN+FRDPRWGRGQETPGEDP
Sbjct: 124 TAASFNPHLWYRIGQVIGVEARAVYNNGQAEGLTFWAPNINVFRDPRWGRGQETPGEDPT 183
Query: 182 VTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQD 241
+TGKYA +VRGVQG G L+ASACCKHFTAYDL+NWKG TRY FDA+VT QD
Sbjct: 184 MTGKYAAVFVRGVQGYGLAGPVNSTGLEASACCKHFTAYDLENWKGVTRYVFDAKVTAQD 243
Query: 242 LADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDA 301
LADTY PPF+SCV+ G ASGIMC+YNRVNG+P+CAD NLLS TAR+ WGF+GYITSDCDA
Sbjct: 244 LADTYNPPFKSCVEDGHASGIMCSYNRVNGVPTCADYNLLSTTARQDWGFYGYITSDCDA 303
Query: 302 VSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLF 361
V+II+DA+GYAK+ EDAV DVLKAGMDVNCGS++Q H +A++Q K+ E +I+RALHNLF
Sbjct: 304 VAIIHDAQGYAKTAEDAVADVLKAGMDVNCGSYVQDHGASALQQGKITEQDINRALHNLF 363
Query: 362 SVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNS--HGLLPLPKS 419
+VRMRLGLFNG+P +G IG D VC+ HQ LAL+AAQDGIVLLKN G LPL K
Sbjct: 364 AVRMRLGLFNGDPRRNLYGDIGPDQVCTQEHQDLALEAAQDGIVLLKNDGGAGALPLSKP 423
Query: 420 KSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASID 479
SLA+IG NAN A L GNY GP C ++TPLQ LQ YV++T + GC++ AC+ +I
Sbjct: 424 NVASLAVIGFNANDAIRLRGNYFGPPCVTVTPLQVLQGYVKDTSFVAGCNSAACNVTTIP 483
Query: 480 KAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCG 539
+AV A AD VVL MGLDQ QE+EE+DR+DL LPG+QQ LI VA AAKKPVILVLLCG
Sbjct: 484 EAVQAASSADSVVLFMGLDQDQEREEVDRLDLTLPGQQQTLIESVANAAKKPVILVLLCG 543
Query: 540 GPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMT 599
GPVD++FAK + IG+ILWAGYPGEAG +A+A+V+FG+HNPGGRLP+TWYPQD+ +VPMT
Sbjct: 544 GPVDVSFAKTNPKIGAILWAGYPGEAGGIAIAQVLFGEHNPGGRLPVTWYPQDFTRVPMT 603
Query: 600 DMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMV 659
DM+MR +G PGRTYRFY G VF FG GLSYSKYS++F + + + K V
Sbjct: 604 DMRMRADPATGYPGRTYRFYRGPTVFNFGYGLSYSKYSHRF---ATKPPPTSNVAGLKAV 660
Query: 660 E-NQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVK--PARRGNGRPI 716
E + V +G+E C+ KF + V+NHG M GKH VL+F++ A G+GRP
Sbjct: 661 EATAGGMASYDVEAIGSETCDRLKFPAVVRVQNHGPMDGKHSVLVFMRWPNATDGSGRPA 720
Query: 717 KQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGDEEYPISIF 775
QL+GFQS+ L A + A + FE+SPC+ SRA EDG VI++G+HF++VG++E+ +S
Sbjct: 721 SQLIGFQSLHLRATQTAHVEFEVSPCKHFSRATEDGRKVIDQGSHFVMVGEDEFEMSFM 779
>gi|357489437|ref|XP_003615006.1| Xylan 1 4-beta-xylosidase [Medicago truncatula]
gi|355516341|gb|AES97964.1| Xylan 1 4-beta-xylosidase [Medicago truncatula]
Length = 685
Score = 920 bits (2377), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 435/677 (64%), Positives = 535/677 (79%), Gaps = 5/677 (0%)
Query: 103 GIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNIN 162
GI NG+I ATSFPQVILTAASFD LWY+I + IG EAR +YNAGQA GM FWAPNIN
Sbjct: 2 GIILNGSIPAATSFPQVILTAASFDPKLWYQISKVIGTEARGVYNAGQAQGMNFWAPNIN 61
Query: 163 IFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKL-KGKLQASACCKHFTAYD 221
IFRDPRWGRGQET GEDPLV KY VSYVRG+QGD+F GGKL G+L+ASACCKHFTAYD
Sbjct: 62 IFRDPRWGRGQETAGEDPLVNSKYGVSYVRGLQGDSFEGGKLIGGRLKASACCKHFTAYD 121
Query: 222 LDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLL 281
L+NWKG RY FDA+VT+QDLADTYQP F SCV QGR+SGIMCAYNRVNG+P+CAD NLL
Sbjct: 122 LENWKGVNRYVFDAKVTLQDLADTYQPSFHSCVVQGRSSGIMCAYNRVNGVPNCADYNLL 181
Query: 282 SKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKA 341
+ TAR++W F+GYI SDCDAV IY+ +GYAK+PED V DVL+AGMD+ CG+++ KH K+
Sbjct: 182 TNTARKKWNFNGYIASDCDAVRFIYEKQGYAKTPEDVVADVLRAGMDLECGNYMTKHAKS 241
Query: 342 AVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQ 401
AV QKK+P S+IDRALHNLF++R+RLGLF+GNPT +G+IG + VCS + LAL+AA+
Sbjct: 242 AVLQKKIPISQIDRALHNLFTIRIRLGLFDGNPTKLQYGRIGPNQVCSKENLDLALEAAR 301
Query: 402 DGIVLLKNSHGLLPLPKSKSVSLALIGPNAN-SAKTLLGNYAGPSCRSITPLQALQNYVE 460
GIVLLKN+ +LPLP+ + L +IGPNAN S+ LLGNY GP C++++ L+ Y
Sbjct: 302 SGIVLLKNTASILPLPRVNT--LGVIGPNANKSSIVLLGNYIGPPCKNVSILKGFYTYAS 359
Query: 461 NTVYYPGC-DTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQE 519
T Y+ GC D C+SA ID+AV++AK +D+V+L+MGLDQ+QE E LDR L LPG+QQ+
Sbjct: 360 QTHYHSGCTDGTKCASAEIDRAVEVAKISDYVILVMGLDQSQETETLDRDHLELPGKQQK 419
Query: 520 LITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHN 579
LI VA+A+KKPVILVLLCGGPVDITFAK + IG I+WAGYPGE G ALA+V+FGD+N
Sbjct: 420 LINSVAKASKKPVILVLLCGGPVDITFAKNNDKIGGIIWAGYPGELGGRALAQVVFGDYN 479
Query: 580 PGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYK 639
PGGRLPMTWYP+D+IK+PMTDM+MR +SG PGRTYRFY G +V+ FG GLSYS YSY
Sbjct: 480 PGGRLPMTWYPKDFIKIPMTDMRMRADPSSGYPGRTYRFYTGPKVYEFGYGLSYSNYSYN 539
Query: 640 FKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKH 699
F +V N L++NQS++ ++EN + ++YK V ELG E C+T VT+G+ N G MAGKH
Sbjct: 540 FISVKNNNLHINQSTTYSILENSETINYKLVSELGEETCKTMSISVTLGITNTGSMAGKH 599
Query: 700 PVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEG 759
PVLLFVKP + NG P+KQLVGF+SV + K E+ FE+S CE LSRA E G+ VIEEG
Sbjct: 600 PVLLFVKPKKGRNGNPVKQLVGFESVTVEGGGKGEVGFEVSVCEHLSRANESGVKVIEEG 659
Query: 760 THFLVVGDEEYPISIFV 776
+ +VG EEY I+I +
Sbjct: 660 GYLFLVGQEEYSINIML 676
>gi|357152329|ref|XP_003576084.1| PREDICTED: probable beta-D-xylosidase 7-like [Brachypodium
distachyon]
Length = 779
Score = 896 bits (2316), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 420/751 (55%), Positives = 548/751 (72%), Gaps = 12/751 (1%)
Query: 29 PPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYE 88
PPFSC P + +T+ + FC LP+ +RA DLVSRLTL EK+SQL + A A+PRLG+PAY+
Sbjct: 34 PPFSCGPGS-ATQGYAFCDKALPVERRAADLVSRLTLAEKVSQLGDEADAVPRLGVPAYK 92
Query: 89 WWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA 148
WWSE LHG++ G G+ F+G +R TSFPQV+LTAASFD +WYRIGQAIG EARALYN
Sbjct: 93 WWSEGLHGLSFWGHGMHFDGAVRAITSFPQVLLTAASFDQDIWYRIGQAIGTEARALYNL 152
Query: 149 GQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKL 208
GQA G+T W+PN+NI+RDPRWGRGQETPGEDP KYAV++V+G+QG + L
Sbjct: 153 GQAQGLTIWSPNVNIYRDPRWGRGQETPGEDPTTASKYAVAFVKGLQGTSAT------TL 206
Query: 209 QASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNR 268
Q SACCKH TAYDL++W G RY F+A+VT+QDLADT+ PPF+SCV++G+A+ +MCAY
Sbjct: 207 QTSACCKHATAYDLEDWNGVVRYNFNAKVTLQDLADTFNPPFKSCVEEGKATCVMCAYTN 266
Query: 269 VNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMD 328
+NG+P+CA +L++KT + WG +GY++SDCDAV+++ DA+ Y +PED V LKAG+D
Sbjct: 267 INGVPACASSDLITKTFKGDWGLNGYVSSDCDAVALLRDAQRYRATPEDTVAVALKAGLD 326
Query: 329 VNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNP-TMQPFGKIGADVV 387
+NCG++ Q H +A++Q K+ E ++D AL NLF+VRMRLG F+G+P T +G +GA V
Sbjct: 327 LNCGNYTQVHGMSALQQGKMTEQDVDNALKNLFAVRMRLGHFDGDPRTSALYGSLGAADV 386
Query: 388 CSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCR 447
CSPAH+ LAL+AAQ GIVLLKN G+LPL S S A IG NAN L GNY GP C
Sbjct: 387 CSPAHKNLALEAAQSGIVLLKNDAGILPLDPSAVASAAAIGHNANDPAALNGNYFGPPCE 446
Query: 448 SITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELD 507
+ TPLQ LQ YV+N + GCD+ AC A+ +AV +A +D+V+L MGL Q +E+E +D
Sbjct: 447 TTTPLQGLQGYVKNVKFLAGCDSAACGFAATGQAVTLASSSDYVILFMGLSQKEEQEGID 506
Query: 508 RVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGA 567
R L+LPG+QQ LIT VA A+K+PVILVLL GG VDITFAK + IG+ILWAGYPG+AG
Sbjct: 507 RTSLLLPGKQQNLITAVASASKRPVILVLLTGGSVDITFAKSNPKIGAILWAGYPGQAGG 566
Query: 568 VALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPF 627
+A+A V+FGDHNP GRLP+TWYP+++ KVPMTDM+MR +G PGR+YRFY+GK V+ F
Sbjct: 567 LAIARVLFGDHNPSGRLPVTWYPEEFTKVPMTDMRMRADPATGYPGRSYRFYQGKTVYKF 626
Query: 628 GCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDV---VHYKSVPELGTEFCETRKFL 684
G GLSYSK+S + + + N + T + + Y V E+G E C+ KF
Sbjct: 627 GDGLSYSKFSRQLVSSTNTHQVPNTNLLTGLTARTATDGGMSYYHVEEIGVEGCDKLKFP 686
Query: 685 VTIGVKNHGEMAGKHPVLLFVK-PARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCE 743
+ V+NHG M GKH V++F++ P G GRP+ QLVGF+S L A EKA + F++SPCE
Sbjct: 687 AVVEVQNHGPMDGKHSVMMFLRWPNSTGTGRPVSQLVGFRSQHLKAGEKASLTFDVSPCE 746
Query: 744 SLSRAREDGLMVIEEGTHFLVVGDEEYPISI 774
+RAREDG VI+ G+HFLVVG +E IS
Sbjct: 747 HFARAREDGKKVIDRGSHFLVVGKDEREISF 777
>gi|357156390|ref|XP_003577440.1| PREDICTED: probable beta-D-xylosidase 7-like [Brachypodium
distachyon]
Length = 755
Score = 879 bits (2272), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 418/754 (55%), Positives = 543/754 (72%), Gaps = 20/754 (2%)
Query: 29 PPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYE 88
P FSC P P + FC LP QRA DLV++LTL+EK+SQL + AP +PR G+P Y
Sbjct: 12 PAFSCGP--PQQAQYAFCNRALPAEQRAADLVAKLTLEEKVSQLGDQAPGVPRFGVPGYN 69
Query: 89 WWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA 148
WWSE LHGV+ G G+ FNG +RG T+FPQV+LT ASFD +WYRIGQAIG EARA++N
Sbjct: 70 WWSEGLHGVSMWGHGMHFNGAVRGVTTFPQVLLTTASFDDSIWYRIGQAIGTEARAMFNL 129
Query: 149 GQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKL 208
GQA G+T W+PN+NI+RDPRWGRGQETPGEDP KYAV++VRG+QG + L
Sbjct: 130 GQADGLTIWSPNVNIYRDPRWGRGQETPGEDPATASKYAVAFVRGLQGTSTT------TL 183
Query: 209 QASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNR 268
Q SACCKH TAYDLD+W RY F+A+VT QDL +T+ PPF+SCV +G+A+ +MCAY
Sbjct: 184 QTSACCKHATAYDLDDWNRIGRYNFNAKVTAQDLEETFNPPFKSCVVEGKATCVMCAYTS 243
Query: 269 VNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMD 328
VNGIP+CAD LL+KT + +WG +GYI+SDCDAV+++Y Y+ +PEDAV +KAG+D
Sbjct: 244 VNGIPACADSGLLTKTIKGEWGMNGYISSDCDAVALLYGTR-YSGTPEDAVAAAIKAGLD 302
Query: 329 VNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQP-FGKIGADVV 387
+NCG+F Q H AA++Q+K+ E ++D+AL NLF++RMRLG F+G+P P +G++GA V
Sbjct: 303 MNCGNFSQVHGMAALQQRKMSEQDVDKALRNLFAIRMRLGHFDGDPLQSPLYGRLGAQDV 362
Query: 388 CSPAHQVLALQAAQDGIVLLKNSHGLLPL--PKSKSVSLALIGPNANSAKTLLGNYAGPS 445
CSPAH+ LAL+AAQ+GIVLLKN LPL P + S S A+IGPNAN LLGNY GP
Sbjct: 363 CSPAHKDLALEAAQNGIVLLKNDAATLPLSRPTAASASFAVIGPNANEPGALLGNYFGPP 422
Query: 446 CRSITPLQALQN-YVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKE 504
C + TPLQALQ Y +N + PGCD+ AC+ A +A +A +D+ +L MGL Q QE+E
Sbjct: 423 CETTTPLQALQKFYSKNVRFVPGCDSAACNVADTYQASGLAATSDYTILFMGLSQKQEQE 482
Query: 505 ELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGE 564
LDR L+LPG+Q+ LIT VA AAK+P+ILVLL GGPVDITFAK++ IG+ILWAGYPG+
Sbjct: 483 GLDRTSLLLPGKQESLITAVAAAAKRPIILVLLTGGPVDITFAKFNPKIGAILWAGYPGQ 542
Query: 565 AGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEV 624
AG +A+A+V+FG+HNP GRLP+TWYP++Y KVPM DM+MR +G PGR+YRFY+G V
Sbjct: 543 AGGLAIAKVLFGEHNPSGRLPVTWYPEEYTKVPMDDMRMRADPATGYPGRSYRFYKGNAV 602
Query: 625 FPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDV----VHYKSVPELGTEFCET 680
+ FG GLSYSK+S + + +N N++ +T+++ V Y V E+G E CE
Sbjct: 603 YKFGYGLSYSKFS---RQLVRNSSSNNRAPNTELLAAAAVDCGASRYYLVEEIGGEVCER 659
Query: 681 RKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELS 740
KF + V+NHG M GK VLLF++ GRP QLVGF+S L A EKA + F++S
Sbjct: 660 LKFPAVVEVENHGPMDGKQSVLLFLRWPTATEGRPASQLVGFRSQDLRAGEKASVSFDIS 719
Query: 741 PCESLSRAREDGLMVIEEGTHFLVVGDEEYPISI 774
PCE SR DG VI+ G+HFL+V ++E IS
Sbjct: 720 PCEHFSRTTVDGTKVIDRGSHFLMVDEDEMEISF 753
>gi|253761872|ref|XP_002489310.1| hypothetical protein SORBIDRAFT_0010s010920 [Sorghum bicolor]
gi|241946958|gb|EES20103.1| hypothetical protein SORBIDRAFT_0010s010920 [Sorghum bicolor]
Length = 772
Score = 878 bits (2268), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 418/748 (55%), Positives = 533/748 (71%), Gaps = 9/748 (1%)
Query: 29 PPFSCDP-SNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAY 87
PPFSC P S ++E FC TL +QRA DLVSRLT EKI+QL + A +PRLG+P Y
Sbjct: 26 PPFSCGPTSAEASEGLAFCDVTLSPAQRAADLVSRLTPAEKIAQLGDQATGVPRLGVPGY 85
Query: 88 EWWSEALHGVAGVGKGIFFN--GTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARAL 145
+WW+EALHG+A GKG+ F+ G +R ATSFPQV+LTAA+FD LW+RIGQAIG EARAL
Sbjct: 86 KWWNEALHGLATSGKGLHFDVVGGVRAATSFPQVLLTAAAFDDDLWFRIGQAIGREARAL 145
Query: 146 YNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLK 205
+N GQA G+T W+PN+NIFRDPRWGRGQETPGEDP V +YAV++VRG+QG++ +
Sbjct: 146 FNVGQAEGLTIWSPNVNIFRDPRWGRGQETPGEDPAVASRYAVAFVRGIQGNSSSS---- 201
Query: 206 GKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCA 265
LQ SACCKH TAYDL++W G RY F ARVT QDL DT+ PPF SCV +G+AS IMCA
Sbjct: 202 -LLQTSACCKHATAYDLEDWNGVARYSFVARVTAQDLEDTFNPPFRSCVVEGKASCIMCA 260
Query: 266 YNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKA 325
Y +NG+P+CA+ +LL+ T R WG GY+ SDCDAV+I+ DA+ YA +PEDAV LKA
Sbjct: 261 YTAINGVPACANTDLLTGTVRGDWGLDGYVASDCDAVAIMRDAQRYAPTPEDAVAVSLKA 320
Query: 326 GMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGAD 385
G+D++CGS++Q+H AA++Q KL E +ID+AL NLF+VRMRLG F+G+P +G + A
Sbjct: 321 GLDIDCGSYIQQHATAAIQQGKLTELDIDKALVNLFAVRMRLGHFDGDPRKNMYGALSAA 380
Query: 386 VVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPS 445
+C+P H+ LAL+AAQDGIVLLKN G+LPL +S S A+IGPN+N L+ NY GP
Sbjct: 381 DICTPEHRSLALEAAQDGIVLLKNDGGILPLDRSTVTSAAVIGPNSNDGMALIANYFGPP 440
Query: 446 CRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEE 505
C S TPLQ LQ+YV N + GC + AC A D+AV ++ D+V L MGL Q QE E
Sbjct: 441 CESTTPLQGLQSYVNNVRFLAGCSSAACDVAVTDQAVVLSGSEDYVFLFMGLSQQQESEG 500
Query: 506 LDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEA 565
DR L+LPG QQ LIT VA+A+K+PVILVLL GGPVDITFA+ + IG+ILWAGYPG+A
Sbjct: 501 KDRTSLLLPGMQQSLITAVADASKRPVILVLLSGGPVDITFAQSNPKIGAILWAGYPGQA 560
Query: 566 GAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVF 625
G +A+A+V+FGDHNP GRLPMTWYP+D+ KVPMTDM+MR TSG PGR+YRFY+G V+
Sbjct: 561 GGLAIAKVLFGDHNPSGRLPMTWYPEDFTKVPMTDMRMRADPTSGYPGRSYRFYQGNAVY 620
Query: 626 PFGCGLSYSKYSYK-FKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFL 684
FG GLSYS +S + S L + + ++ + ++GT+ CE KF
Sbjct: 621 KFGYGLSYSTFSSRLLYGTSMPALSSTVLAGLRETVTEEGDRSYHIDDIGTDGCEQLKFP 680
Query: 685 VTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCES 744
+ V+NHG M GKH L+F++ GRP QL+GF S L A E A + F++SPCE
Sbjct: 681 AMVEVQNHGPMDGKHSALMFLRWPNTNGGRPASQLIGFMSQHLKAGETANLRFDISPCEH 740
Query: 745 LSRAREDGLMVIEEGTHFLVVGDEEYPI 772
SR R DG+ VI+ G+HFL V + I
Sbjct: 741 FSRVRADGMKVIDIGSHFLTVDNHAIEI 768
>gi|326517420|dbj|BAK00077.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 781
Score = 872 bits (2252), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 411/754 (54%), Positives = 536/754 (71%), Gaps = 8/754 (1%)
Query: 26 STQPPFSCDPSN-PSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGI 84
+ PPFSC PS+ +T+ + FC TLP++QRA DLV+RLT EK++QL + A +PRLG+
Sbjct: 33 AADPPFSCGPSSTAATQGYAFCDATLPVAQRAADLVARLTTAEKVAQLGDEAAGVPRLGV 92
Query: 85 PAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARA 144
PAY+WW+EALHG+A GKG+ FNG +R ATSFPQV LTAA+FD LW RIGQAIG EARA
Sbjct: 93 PAYKWWNEALHGLATSGKGLHFNGAVRSATSFPQVSLTAAAFDDDLWLRIGQAIGREARA 152
Query: 145 LYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKL 204
LYN GQA G+T W+PN+NI+RDPRWGRGQETPGEDP +Y V++V+G+QG++
Sbjct: 153 LYNVGQAEGLTMWSPNVNIYRDPRWGRGQETPGEDPTTASRYGVAFVKGLQGNS----TS 208
Query: 205 KGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMC 264
LQ SACCKH TAYDL++W G RY FDARVT QDL DTY PPF SCV G+AS +MC
Sbjct: 209 SSLLQTSACCKHATAYDLEDWGGVARYNFDARVTAQDLEDTYNPPFRSCVVDGKASCVMC 268
Query: 265 AYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLK 324
AY +NG+P+CA+ LL+ T R WG GY+ SDCDAV+I+ DA+ YA +PEDAV LK
Sbjct: 269 AYTAINGVPACANSGLLTNTVRADWGLDGYVASDCDAVAIMRDAQRYAPTPEDAVALALK 328
Query: 325 AGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGA 384
AG+D++CG+++Q+H AA++Q K+ E ++D+AL NLF++RMRLG F+G+P +G + A
Sbjct: 329 AGLDIDCGTYMQQHAPAALQQGKITEDDVDKALKNLFAIRMRLGHFDGDPRANIYGGLNA 388
Query: 385 DVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGP 444
+C+P H+ LAL+AAQDGIVLLKN G+LPL ++ S A+IGPNAN+ L+GNY GP
Sbjct: 389 AHICTPEHRSLALEAAQDGIVLLKNDAGILPLDRAAIASAAVIGPNANNPGLLIGNYFGP 448
Query: 445 SCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKE 504
C S+TPL+ +Q YV++ + GC + AC A D+A +A +D+V+L MGL Q QE E
Sbjct: 449 PCESVTPLKGVQGYVKDVRFMAGCGSAACDVADTDQAATLAGSSDYVLLFMGLSQQQESE 508
Query: 505 ELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGE 564
DR L+LPG+QQ LIT VA+AAK+PVILVLL GGPVD+TFAK + IG+ILWAGYPG+
Sbjct: 509 GRDRTSLLLPGQQQSLITAVADAAKRPVILVLLTGGPVDVTFAKNNPKIGAILWAGYPGQ 568
Query: 565 AGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEV 624
AG +A+A V+FGDHNPGGRLP+TWYP+++ KVPMTDM+MR +G PGR+YRFY+G+ V
Sbjct: 569 AGGLAIARVLFGDHNPGGRLPVTWYPEEFTKVPMTDMRMRADPATGYPGRSYRFYQGETV 628
Query: 625 FPFGCG--LSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRK 682
+ FG G S L S+ E V Y V +G CE K
Sbjct: 629 YKFGYGLSYSSYSRRLLSSGTPNTDLLAGLSTMPTPAEEGGVASYH-VEHIGARGCEQLK 687
Query: 683 FLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPC 742
F + V+NHG M GKH VL++++ A GRP KQL+GF+ L A EKA + F++SPC
Sbjct: 688 FPAVVEVENHGPMDGKHSVLMYLRWANATAGRPAKQLIGFRRQHLKAGEKASLTFDISPC 747
Query: 743 ESLSRAREDGLMVIEEGTHFLVVGDEEYPISIFV 776
E SR R+DG V++ G+HFL+V E I+ V
Sbjct: 748 EHFSRVRKDGNKVVDRGSHFLMVDMHEMEITFEV 781
>gi|414588273|tpg|DAA38844.1| TPA: putative O-Glycosyl hydrolase superfamily protein [Zea mays]
Length = 775
Score = 870 bits (2249), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 425/755 (56%), Positives = 543/755 (71%), Gaps = 12/755 (1%)
Query: 26 STQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIP 85
++ P FSC PS+ S +PFC +LP ++RA DLVSRLT+ EK+SQL + A +PRLG+P
Sbjct: 25 ASDPMFSCGPSSAS-RAYPFCDRSLPAARRAADLVSRLTVAEKVSQLGDEAAGVPRLGVP 83
Query: 86 AYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARAL 145
Y+WWSE LHG+A G G+ FNGT+ TSFPQV+LT ASFD LW+RIGQAIG EARAL
Sbjct: 84 PYKWWSEGLHGLAFWGHGMRFNGTVSAVTSFPQVLLTTASFDESLWFRIGQAIGREARAL 143
Query: 146 YNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLK 205
YN GQA G+T W+PN+NIFRDPRWGRGQETPGEDP V KYAV++VRG+QG G
Sbjct: 144 YNLGQAEGLTIWSPNVNIFRDPRWGRGQETPGEDPAVASKYAVAFVRGIQGSN-PAGAAA 202
Query: 206 GKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCA 265
LQASACCKH TAYDL++W G RY FDARVT+QDLADT+ PPF+SCV G+AS +MCA
Sbjct: 203 APLQASACCKHATAYDLEDWNGVARYNFDARVTLQDLADTFNPPFQSCVVDGKASCVMCA 262
Query: 266 YNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKA 325
Y +NG+P+CA +LL+KT R WG GY++SDCDAV+I+ DA+ Y +PED V LKA
Sbjct: 263 YTVINGVPACASSDLLTKTFRGAWGLDGYVSSDCDAVAIMRDAQRYEPTPEDTVAVALKA 322
Query: 326 GMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQP-FGKIGA 384
G+D+NCG++ Q+H AA++Q K+ E ++D+AL NLF+VRMRLG F+G+P +G++GA
Sbjct: 323 GLDLNCGTYTQQHGMAAIQQGKMTEKDVDKALTNLFAVRMRLGHFDGDPRGNALYGRLGA 382
Query: 385 DVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGP 444
VC+ H+ LAL+AAQDGIVLLKN G+LPL +S S A+IG NAN L GNY GP
Sbjct: 383 ADVCTADHKNLALEAAQDGIVLLKNDAGILPLDRSAVGSAAVIGHNANDPLVLSGNYFGP 442
Query: 445 SCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKE 504
+C + TPL+ LQ+YV N + GC + AC A+ +A +A A++V L MGL Q QEKE
Sbjct: 443 ACETTTPLEGLQSYVRNVRFLAGCSSAACGYAATGQAAALASSAEYVFLFMGLSQDQEKE 502
Query: 505 ELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGE 564
LDR L+LPG+QQ L+T VA AAK+PV+LVLL GGPVDITFA+ + IG+ILWAGYPG+
Sbjct: 503 GLDRTSLLLPGKQQSLVTAVASAAKRPVVLVLLTGGPVDITFAQSNPKIGAILWAGYPGQ 562
Query: 565 AGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEV 624
AG +A+A V+FGDHNP GRLP+TWY +D+ KVPMTDM+MR +G PGRTYRFY GK +
Sbjct: 563 AGGLAIARVLFGDHNPSGRLPVTWYTEDFTKVPMTDMRMRADPATGYPGRTYRFYRGKTI 622
Query: 625 FPFGCGLSYSKYSYKFKAVSQN-----KLYLNQSSSTKMVENQDVVHYKSVPELGTEFCE 679
+ FG GLSYSK+S + +N L + S+ T+ Y V ++GT CE
Sbjct: 623 YKFGYGLSYSKFSRQLVTGDKNLAPNTSLLAHLSAKTQHA----ATSYYHVDDIGTVGCE 678
Query: 680 TRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFEL 739
KF + V NHG M GKH VL+F++ +GRP++QL+GF+S + A EKA + F +
Sbjct: 679 QLKFPAEVEVLNHGPMDGKHSVLMFLRWPNATDGRPVRQLIGFRSQHIKAGEKANVRFHV 738
Query: 740 SPCESLSRAREDGLMVIEEGTHFLVVGDEEYPISI 774
SPCE SR R DG VI+ G+HFL+VG EE IS
Sbjct: 739 SPCEHFSRTRADGKKVIDRGSHFLMVGKEELEISF 773
>gi|62701898|gb|AAX92971.1| beta-D-xylosidase [Oryza sativa Japonica Group]
gi|62733926|gb|AAX96035.1| beta-D-xylosidase [Oryza sativa Japonica Group]
gi|77550045|gb|ABA92842.1| Glycosyl hydrolase family 3 C terminal domain containing protein,
expressed [Oryza sativa Japonica Group]
gi|125576900|gb|EAZ18122.1| hypothetical protein OsJ_33667 [Oryza sativa Japonica Group]
Length = 771
Score = 867 bits (2241), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 421/756 (55%), Positives = 544/756 (71%), Gaps = 19/756 (2%)
Query: 28 QPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAY 87
+PP+SC P +PS+ + FC LP ++RA DLVSRLT EK++QL + A +PRLG+P Y
Sbjct: 24 EPPYSCGPRSPSS-GYAFCDARLPPARRAADLVSRLTAAEKVAQLGDEAGGVPRLGVPPY 82
Query: 88 EWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYN 147
+WWSE LHG++ G G+ FNG + TSFPQV+LTAA+FD LW+RIGQAIG EARALYN
Sbjct: 83 KWWSEGLHGLSYWGHGMHFNGAVTAITSFPQVLLTAAAFDDRLWFRIGQAIGTEARALYN 142
Query: 148 AGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGK 207
GQA G+T W+PN+NI+RDPRWGRGQETPGEDP KYAV++V+G+QG T G
Sbjct: 143 LGQAEGLTIWSPNVNIYRDPRWGRGQETPGEDPTTASKYAVAFVKGLQGST------PGT 196
Query: 208 LQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYN 267
LQ SACCKH TAYDL+ W G RY F+A+VT QDLADT+ PPF+SCV +AS +MCAY
Sbjct: 197 LQTSACCKHATAYDLEEWNGVARYNFNAKVTAQDLADTFNPPFKSCVVDAKASCVMCAYT 256
Query: 268 RVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGM 327
+NG+P+CA +LLSKT R QWG GY++SDCDAV+++ DA+ YA +PED V +KAG+
Sbjct: 257 DINGVPACASSDLLSKTFRGQWGLDGYVSSDCDAVALLRDAQRYAPTPEDTVAVAIKAGL 316
Query: 328 DVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNP-TMQPFGKIGADV 386
D+NCG++ Q H AA++Q K+ ES++DRAL NLF+VRMRLG F+G+P + +G +GA
Sbjct: 317 DLNCGNYTQVHGMAALQQGKMRESDVDRALTNLFAVRMRLGHFDGDPRSNAAYGHLGAAD 376
Query: 387 VCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSC 446
VC+ AH+ LAL+AAQDGIVLLKN G LPL ++ S A+IGPNAN L GNY GP C
Sbjct: 377 VCTQAHRDLALEAAQDGIVLLKNDAGALPLDRATVRSAAVIGPNANDPAALNGNYFGPPC 436
Query: 447 RSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEEL 506
+ TPLQ +Q Y+ + + GCD+ AC A+ +A +A +D V++ MGL Q QEKE L
Sbjct: 437 ETTTPLQGVQRYISSVRFLAGCDSPACGFAATGQAAALASSSDQVIMFMGLSQDQEKEGL 496
Query: 507 DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAG 566
DR L+LPG+QQ LIT VA AA++PVILVLL GGPVD+TFAK + IG+ILWAGYPG+AG
Sbjct: 497 DRTSLLLPGKQQSLITAVASAARRPVILVLLTGGPVDVTFAKNNPKIGAILWAGYPGQAG 556
Query: 567 AVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFP 626
+A+A+V+FGDHNP GRLP+TWYP+++ ++PMTDM+MR +G PGR+YRFY+G V+
Sbjct: 557 GLAIAKVLFGDHNPSGRLPVTWYPEEFTRIPMTDMRMRADPATGYPGRSYRFYQGNPVYK 616
Query: 627 FGCGLSYSKYSYKFKAVSQ----NKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRK 682
FG GLSYSK+S + A ++ N+ L + + H V E+G E CE K
Sbjct: 617 FGYGLSYSKFSRRLVAAAKPRRPNRNLLAGVIPKPAGDGGESYH---VEEIGEEGCERLK 673
Query: 683 FLVTIGVKNHGEMAGKHPVLLFVK--PARRGNGRPIKQLVGFQSVILNAKEKAEIVFELS 740
F T+ V NHG M GKH VL+FV+ A G RP +QLVGF S + A EKA + E++
Sbjct: 674 FPATVEVHNHGPMDGKHSVLVFVRWPNATAGASRPARQLVGFSSQHVRAGEKARLTMEIN 733
Query: 741 PCESLSRAREDGLMVIEEGTHFLVVG--DEEYPISI 774
PCE LSRAREDG VI+ G+HFL VG D+E+ IS
Sbjct: 734 PCEHLSRAREDGTKVIDRGSHFLKVGEEDDEWEISF 769
>gi|413925162|gb|AFW65094.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
Length = 774
Score = 865 bits (2236), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 421/751 (56%), Positives = 541/751 (72%), Gaps = 16/751 (2%)
Query: 29 PPFSCDPSNP-STETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAY 87
PPFSC PS+ ++E FC TL +QRA DLVSRLT EKI+QL + AP +PRLG+P Y
Sbjct: 29 PPFSCGPSSAEASEGLAFCDVTLAPAQRAADLVSRLTAAEKIAQLGDQAPGVPRLGVPGY 88
Query: 88 EWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYN 147
+WW+EALHG+A GKG+ F+ +R ATSFPQV+LTAA+FD LW RIGQAIG EARAL+N
Sbjct: 89 KWWNEALHGLATSGKGLHFDAAVRAATSFPQVLLTAAAFDDDLWLRIGQAIGREARALFN 148
Query: 148 AGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGK 207
GQA G+T W+PN+NIFRDPRWGRGQETPGEDP V +YAV++VRG+QG++ +
Sbjct: 149 VGQAEGLTIWSPNVNIFRDPRWGRGQETPGEDPAVASRYAVAFVRGIQGNSSSS-----L 203
Query: 208 LQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYN 267
LQ SACCKH TAYDL++W G RY F ARVT QDL DT+ PPF SCV + +AS +MCAY
Sbjct: 204 LQTSACCKHATAYDLEDWNGVARYSFVARVTEQDLEDTFNPPFRSCVVEAKASCVMCAYT 263
Query: 268 RVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGM 327
+NG+P+CA+ +LL+ T R WG GY+ SDCDAV+I+ DA+ YA +PEDAV LKAG+
Sbjct: 264 AINGVPACANSDLLTGTVRGDWGLDGYVASDCDAVAIMRDAQRYAPTPEDAVAVSLKAGL 323
Query: 328 DVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVV 387
D++CGS++Q+H AA++Q KL E +ID+AL NL++VRMRLG F+G+P +G +GA +
Sbjct: 324 DIDCGSYVQQHAAAAIQQGKLTEQDIDKALTNLYAVRMRLGHFDGDPRKNMYGVLGAADI 383
Query: 388 CSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCR 447
C+P H+ LAL+AAQDGIVLLKN G+LPL +S S A+IGPNAN L+ NY GP C
Sbjct: 384 CTPEHRNLALEAAQDGIVLLKNDGGILPLDRSTVTSAAVIGPNANDGMALIANYFGPPCE 443
Query: 448 SITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELD 507
S TPL+ LQ+YV + + GC++ AC A+ D+AV +A D+V L MGL Q QE E D
Sbjct: 444 STTPLKGLQSYVNDVRFLAGCNSAACDVAATDQAVALAGSEDYVFLFMGLSQKQESEGKD 503
Query: 508 RVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGA 567
R L+LPG QQ LIT VA+A+K+PVILVLL GGPVDITFA+ + IG+ILWAGYPG+AG
Sbjct: 504 RTSLLLPGMQQSLITAVADASKRPVILVLLSGGPVDITFAQSNPKIGAILWAGYPGQAGG 563
Query: 568 VALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPF 627
+A+A+V+FGDHNP GRLP+TWYP+++ KVPMTDM+MR TSG PGR+YRFY+G V+ F
Sbjct: 564 LAIAKVLFGDHNPSGRLPVTWYPEEFTKVPMTDMRMRADPTSGYPGRSYRFYQGNTVYKF 623
Query: 628 GCGLSYSKYSYKF------KAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETR 681
G GLSYS +S + A+S L + + T QD V +GTE CE
Sbjct: 624 GYGLSYSTFSRRLVHGTSVPALSSTLLTGLRETMTP----QDGDRSYHVDAIGTEGCEQL 679
Query: 682 KFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSP 741
KF + V+NHG M GKH VL+F++ GRP QL+GF+S L A E A++ F++SP
Sbjct: 680 KFPAMVEVQNHGPMDGKHSVLMFLRWPNTKQGRPASQLIGFRSQHLKAGETAKLRFDISP 739
Query: 742 CESLSRAREDGLMVIEEGTHFLVVGDEEYPI 772
C+ SR R DG VI+ G+HFL+V + E I
Sbjct: 740 CKHFSRVRADGRKVIDIGSHFLMVDNHEMEI 770
>gi|115485165|ref|NP_001067726.1| Os11g0297800 [Oryza sativa Japonica Group]
gi|62734696|gb|AAX96805.1| beta-D-xylosidase [Oryza sativa Japonica Group]
gi|77549999|gb|ABA92796.1| Glycosyl hydrolase family 3 C terminal domain containing protein,
expressed [Oryza sativa Japonica Group]
gi|113644948|dbj|BAF28089.1| Os11g0297800 [Oryza sativa Japonica Group]
gi|125534139|gb|EAY80687.1| hypothetical protein OsI_35869 [Oryza sativa Indica Group]
gi|215766717|dbj|BAG98945.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 782
Score = 863 bits (2231), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 412/755 (54%), Positives = 538/755 (71%), Gaps = 12/755 (1%)
Query: 24 VDSTQPPFSCDPSNPSTETFP-FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRL 82
V +PPFSC + FC TLP QRA DLV+RLT EK++QL + A +PRL
Sbjct: 30 VAGGEPPFSCGGAAAGGGQGYAFCDATLPAEQRAADLVARLTAAEKVAQLGDQAAGVPRL 89
Query: 83 GIPAYEWWSEALHGVAGVGKGIFFNG---TIRGATSFPQVILTAASFDSYLWYRIGQAIG 139
G+PAY+WWSEALHG+A G+G+ F+ R ATSFPQV+LTAA+FD LW+RIGQAIG
Sbjct: 90 GVPAYKWWSEALHGLATSGRGLHFDAPGSAARAATSFPQVLLTAAAFDDDLWFRIGQAIG 149
Query: 140 LEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTF 199
EARALYN GQA G+T W+PN+NIFRDPRWGRGQETPGEDP + KYAV++V+G+QG++
Sbjct: 150 TEARALYNIGQAEGLTMWSPNVNIFRDPRWGRGQETPGEDPTMASKYAVAFVKGMQGNS- 208
Query: 200 NGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRA 259
LQ SACCKH TAYDL++W G RY F+A+VT QDL DTY PPF SCV +A
Sbjct: 209 -----SAILQTSACCKHVTAYDLEDWNGVQRYNFNAKVTAQDLEDTYNPPFRSCVVDAKA 263
Query: 260 SGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAV 319
+ IMCAY +NG+P+CA+ +LL+KT R WG GYI SDCDAV+I+ DA+ Y ++PEDAV
Sbjct: 264 TCIMCAYTGINGVPACANADLLTKTVRGDWGLDGYIASDCDAVAIMRDAQRYTQTPEDAV 323
Query: 320 VDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNP-TMQP 378
LKAG+D+NCG+++Q+H AA++Q KL E +ID+AL NLF++RMRLG F+G+P +
Sbjct: 324 AVALKAGLDMNCGTYMQQHATAAIQQGKLTEEDIDKALKNLFAIRMRLGHFDGDPRSNSV 383
Query: 379 FGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLL 438
+G +GA +C+P H+ LAL+AA DGIVLLKN G+LPL ++ S A+IGPNAN L+
Sbjct: 384 YGGLGAADICTPEHRSLALEAAMDGIVLLKNDAGILPLDRTAVASAAVIGPNANDGLALI 443
Query: 439 GNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLD 498
GNY GP C S TPL + Y++N + GC++ AC A+ D+A +A +D+V L MGL
Sbjct: 444 GNYFGPPCESTTPLNGILGYIKNVRFLAGCNSAACDVAATDQAAAVASSSDYVFLFMGLS 503
Query: 499 QTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILW 558
Q QE E DR L+LPG QQ LIT VA+AAK+PVILVLL GGPVD+TFA+ + IG+ILW
Sbjct: 504 QKQESEGRDRTSLLLPGEQQSLITAVADAAKRPVILVLLTGGPVDVTFAQTNPKIGAILW 563
Query: 559 AGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRF 618
AGYPG+AG +A+A V+FGDHNPGGRLP+TWYP+++ KVPMTDM+MR +G PGR+YRF
Sbjct: 564 AGYPGQAGGLAIARVLFGDHNPGGRLPVTWYPEEFTKVPMTDMRMRADPATGYPGRSYRF 623
Query: 619 YEGKEVFPFGCGLSYSKYSYKFKAVSQ-NKLYLNQSSSTKMVENQDVVHYKSVPELGTEF 677
Y+GK V+ FG GLSYS YS + + + + Y N +S + + + E+GT+
Sbjct: 624 YQGKTVYKFGYGLSYSSYSRQLVSGGKPAESYTNLLASLRTTTTSEGDESYHIEEIGTDG 683
Query: 678 CETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVF 737
CE KF + V+NHG M GKH VL++++ GRP QL+GF+S L EKA I F
Sbjct: 684 CEQLKFPAVVEVQNHGPMDGKHSVLMYLRWPNAKGGRPTTQLIGFRSQHLKVGEKANIRF 743
Query: 738 ELSPCESLSRAREDGLMVIEEGTHFLVVGDEEYPI 772
++SPCE SR R+DG VI+ G+H+L+V +E I
Sbjct: 744 DISPCEHFSRVRKDGKKVIDRGSHYLMVDKDELEI 778
>gi|253761860|ref|XP_002489304.1| hypothetical protein SORBIDRAFT_0010s007570 [Sorghum bicolor]
gi|241946952|gb|EES20097.1| hypothetical protein SORBIDRAFT_0010s007570 [Sorghum bicolor]
Length = 750
Score = 860 bits (2221), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 414/750 (55%), Positives = 535/750 (71%), Gaps = 16/750 (2%)
Query: 26 STQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIP 85
++ P FSC PS+PS +PFC +LP ++RA DLVSRLT+ EK+SQL + A +PRLG+P
Sbjct: 14 ASDPLFSCGPSSPS-RAYPFCDRSLPAARRAADLVSRLTVAEKVSQLGDEAAGVPRLGVP 72
Query: 86 AYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARAL 145
Y+WWSE LHG+A G G+ FNGT+ G TSFPQV+LT ASFD LW+RIGQAIG EARAL
Sbjct: 73 PYKWWSEGLHGLAFWGHGMRFNGTVTGVTSFPQVLLTTASFDDGLWFRIGQAIGREARAL 132
Query: 146 YNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLK 205
YN GQA G+T W+PN+NIFRDPRWGRGQETPGEDP V KYAV++VRG+QG +
Sbjct: 133 YNLGQAEGLTIWSPNVNIFRDPRWGRGQETPGEDPAVASKYAVAFVRGIQGSSAA--GAA 190
Query: 206 GKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCA 265
LQASACCKH TAYDL++W G RY FDARVT QDLADT+ PPF+SCV G+A+ +MCA
Sbjct: 191 APLQASACCKHATAYDLEDWNGVARYNFDARVTAQDLADTFNPPFQSCVVDGKATCVMCA 250
Query: 266 YNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKA 325
Y +NG+P+CA +LL+KT R WG GY++SDCDAV+I++DA+ Y +PED V LK
Sbjct: 251 YTGINGVPACASSDLLTKTFRGAWGHDGYVSSDCDAVAIMHDAQRYVPTPEDTVAVALK- 309
Query: 326 GMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQP-FGKIGA 384
+H AA++Q K+ E ++D+AL NLF+VRMRLG F+G+P +G +GA
Sbjct: 310 -----------EHGMAAIQQGKMTEKDVDKALTNLFAVRMRLGHFDGDPRGNALYGHLGA 358
Query: 385 DVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGP 444
VC+ H+ LAL+AAQDGIVLLKN G+LPL +S S A+IG NAN A L GNY GP
Sbjct: 359 ADVCTADHKNLALEAAQDGIVLLKNDAGILPLDRSAMGSAAVIGHNANDALVLRGNYFGP 418
Query: 445 SCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKE 504
+C + TPLQ +Q+YV N + GC + AC A+ +A +A +++V L MGL Q QEKE
Sbjct: 419 ACETTTPLQGVQSYVSNVRFLAGCSSAACGYAATGQAAALASSSEYVFLFMGLSQDQEKE 478
Query: 505 ELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGE 564
LDR L+LPG+QQ LIT VA AAK+PVILVLL GGPVDITFA+ + IG+ILWAGYPG+
Sbjct: 479 GLDRTSLLLPGKQQSLITAVASAAKRPVILVLLTGGPVDITFAQSNPKIGAILWAGYPGQ 538
Query: 565 AGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEV 624
AG +A+A V+FGDHNP GRLP+TWYP+++ KVPMTDM+MR +G PGR+YRFY G +
Sbjct: 539 AGGLAIARVLFGDHNPSGRLPVTWYPEEFTKVPMTDMRMRADPANGYPGRSYRFYRGNTI 598
Query: 625 FPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFL 684
+ FG GLSYSK+S + +N+L + + ++ D Y V ++G + CE +F
Sbjct: 599 YKFGYGLSYSKFSRQLVTGGKNQLASLLAGLSATTKDDDATSYYHVDDIGADGCEQLRFP 658
Query: 685 VTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCES 744
+ V+NHG M GKH VL+F++ +GRP+ QL+GF S + A EKA + F++ PCE
Sbjct: 659 AEVEVQNHGPMDGKHSVLMFLRWPNATDGRPVSQLIGFTSQHIKAGEKANVRFDVRPCEH 718
Query: 745 LSRAREDGLMVIEEGTHFLVVGDEEYPISI 774
SRAR DG VI+ G+HFL+VG EE +S
Sbjct: 719 FSRARADGKKVIDRGSHFLMVGKEEVEVSF 748
>gi|125534112|gb|EAY80660.1| hypothetical protein OsI_35838 [Oryza sativa Indica Group]
Length = 771
Score = 859 bits (2219), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 417/756 (55%), Positives = 542/756 (71%), Gaps = 19/756 (2%)
Query: 28 QPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAY 87
+PP+SC P +PS + FC LP ++RA DLVSRLT EK++QL + A + RLG+P Y
Sbjct: 24 EPPYSCGPRSPSL-GYAFCDARLPPARRAADLVSRLTAAEKVAQLGDEAGGVARLGVPPY 82
Query: 88 EWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYN 147
+WWSE LHG++ G G+ FNG + TSFPQV+LTAA+FD LW+RIGQAIG EARALYN
Sbjct: 83 KWWSEGLHGLSYWGHGMHFNGAVTAITSFPQVLLTAAAFDDRLWFRIGQAIGTEARALYN 142
Query: 148 AGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGK 207
GQA G+T W+PN+NI+RDPRWGRGQETPGEDP KYAV++V+G+QG T G
Sbjct: 143 LGQAEGLTIWSPNVNIYRDPRWGRGQETPGEDPTTASKYAVAFVKGLQGST------PGT 196
Query: 208 LQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYN 267
LQ SACCKH TAYDL+ W G RY F+A+VT QDLADT+ PPF+SCV +AS +MCAY
Sbjct: 197 LQTSACCKHATAYDLEEWNGVARYNFNAKVTAQDLADTFNPPFKSCVVDAKASCVMCAYT 256
Query: 268 RVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGM 327
+NG+P+CA +LLSKT R QWG GY++SDCDAV+++ DA+ YA +PED V +KAG+
Sbjct: 257 DINGVPACASSDLLSKTFRGQWGLDGYVSSDCDAVALLRDAQRYAPTPEDTVAVAIKAGL 316
Query: 328 DVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNP-TMQPFGKIGADV 386
D+NCG++ Q H AA++Q K+ ES++DRAL NLF+VRMRLG F+G+P + +G +GA
Sbjct: 317 DLNCGNYTQVHGMAALQQGKMRESDVDRALTNLFAVRMRLGHFDGDPRSNAAYGHLGAAD 376
Query: 387 VCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSC 446
VC+ AH+ LAL+AAQ+GIVLLKN G LPL ++ S A+IGPNAN L GNY GP C
Sbjct: 377 VCTQAHRDLALEAAQNGIVLLKNDAGALPLDRATVRSAAVIGPNANDPAALNGNYFGPPC 436
Query: 447 RSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEEL 506
+ TPLQ +Q Y+ + + GCD+ AC A+ +A +A +D V++ MGL Q QEKE L
Sbjct: 437 ETTTPLQGVQRYISSVRFLAGCDSPACGFAATGQAAALASSSDQVIMFMGLSQDQEKEGL 496
Query: 507 DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAG 566
DR L+LPG+QQ LIT VA AA++PVILVLL GGPVD+TFAK + IG+ILWAGYPG+AG
Sbjct: 497 DRTSLLLPGKQQSLITAVASAARRPVILVLLTGGPVDVTFAKNNPKIGAILWAGYPGQAG 556
Query: 567 AVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFP 626
+A+A+V+FGDHNP GRLP+TWYP+++ ++PMTDM+MR +G PGR+YRFY+G V+
Sbjct: 557 GLAIAKVLFGDHNPSGRLPVTWYPEEFTRIPMTDMRMRADPATGYPGRSYRFYQGNPVYK 616
Query: 627 FGCGLSYSKYSYKFKAVSQ----NKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRK 682
FG GLSYSK++ + A ++ N+ L + + H V E+G E CE K
Sbjct: 617 FGYGLSYSKFTRRLVAAAKPRRPNRNLLAGVIPKPAGDGGESYH---VEEIGEEGCERLK 673
Query: 683 FLVTIGVKNHGEMAGKHPVLLFVK--PARRGNGRPIKQLVGFQSVILNAKEKAEIVFELS 740
F T+ V NHG M GKH VL+FV+ A G RP +QLVGF S + A EKA + E++
Sbjct: 674 FPATVEVHNHGPMDGKHSVLVFVQWPNATAGASRPARQLVGFSSQHVRAGEKARLTMEIN 733
Query: 741 PCESLSRAREDGLMVIEEGTHFLVVG--DEEYPISI 774
PCE LSRAR+DG VI+ G+HFL VG D+E+ IS
Sbjct: 734 PCEHLSRARDDGTKVIDRGSHFLKVGEEDDEWEISF 769
>gi|357138088|ref|XP_003570630.1| PREDICTED: probable beta-D-xylosidase 7-like [Brachypodium
distachyon]
Length = 1026
Score = 856 bits (2212), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 409/645 (63%), Positives = 499/645 (77%), Gaps = 17/645 (2%)
Query: 1 MIFHKLSLVFPLLCLCFTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLV 60
+IFH +V PL CL + + +T PPFSC S ++PFC LPI QRA DL
Sbjct: 6 IIFH---VVLPL-CLVLQATM----ATDPPFSCG----SPSSYPFCDRKLPIGQRAADLA 53
Query: 61 SRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGV---GKGIFFN-GTIRGATSF 116
SRLT++EK+S L + +P +PRLG+PAY+WWSEALHGVA G+ F+ G +R ATSF
Sbjct: 54 SRLTVEEKVSLLGDVSPGVPRLGVPAYKWWSEALHGVANAPADRAGVRFDDGPVRAATSF 113
Query: 117 PQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETP 176
PQV++TAASF+ +LWYRIGQ IG EAR +YN+GQA G+TFWAPNIN+FRDPRWGRGQETP
Sbjct: 114 PQVLVTAASFNPHLWYRIGQVIGREARGIYNSGQAEGLTFWAPNINVFRDPRWGRGQETP 173
Query: 177 GEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDAR 236
GEDP +TGKYA +VRGVQG +G L+ASACCKHFTAYDL+NW G TR+ F+A+
Sbjct: 174 GEDPTMTGKYAAVFVRGVQGYGASGAVNSSGLEASACCKHFTAYDLENWNGVTRFAFNAK 233
Query: 237 VTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYIT 296
V+ QDLADTY PPF SCV+ G ASGIMC+YNRVNG+P+CAD NLLSKTAR W F+GYIT
Sbjct: 234 VSEQDLADTYNPPFRSCVEDGGASGIMCSYNRVNGVPTCADHNLLSKTARGDWRFNGYIT 293
Query: 297 SDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRA 356
SDCDAV+II+D +GYAK PEDAV DVLKAGMDVNCG ++QKH +A Q K+ E +IDRA
Sbjct: 294 SDCDAVAIIHDVQGYAKEPEDAVADVLKAGMDVNCGDYVQKHGVSAFHQGKITEQDIDRA 353
Query: 357 LHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPL 416
L NLF++RMRLGLF+GNP +G IGAD VC HQ LAL+AAQDGIVLLKN G LPL
Sbjct: 354 LQNLFAIRMRLGLFDGNPKYNRYGNIGADQVCKKEHQDLALEAAQDGIVLLKNDAGTLPL 413
Query: 417 PKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSA 476
PK K SLA+IG NAN A+ L GNY GP C S++PLQALQ YV T + GC+ C+ +
Sbjct: 414 PKQKISSLAVIGHNANDAQRLQGNYFGPPCISVSPLQALQGYVRETKFVAGCNAAVCNVS 473
Query: 477 SIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVL 536
I A A A++VVL MGLDQ QE+E+LDR++L LPG Q+ L+ VA+AAKKPV+LVL
Sbjct: 474 DIAGAAKAASEAEYVVLFMGLDQDQEREDLDRIELGLPGMQESLVNAVADAAKKPVVLVL 533
Query: 537 LCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDY-IK 595
LCGGPVD+TFAK + IG+I+WAGYPG+AG +A+A+V+FG+HNPGGRLP+TWYP++Y
Sbjct: 534 LCGGPVDVTFAKGNPKIGAIIWAGYPGQAGGIAIAQVLFGEHNPGGRLPVTWYPKEYATA 593
Query: 596 VPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKF 640
V MTDM+MR A++G PGRTYRFY+GK V+ FG GLSYSKYS+ F
Sbjct: 594 VAMTDMRMRADASTGYPGRTYRFYKGKTVYNFGYGLSYSKYSHSF 638
>gi|253761874|ref|XP_002489311.1| hypothetical protein SORBIDRAFT_0010s012040 [Sorghum bicolor]
gi|241946959|gb|EES20104.1| hypothetical protein SORBIDRAFT_0010s012040 [Sorghum bicolor]
Length = 791
Score = 855 bits (2209), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 410/766 (53%), Positives = 536/766 (69%), Gaps = 10/766 (1%)
Query: 13 LCLCFTSLLTRVD-STQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQ 71
+ L +LL RV + PPFSC PS+PS + PFC LP SQRA DLVSR+T EK SQ
Sbjct: 30 VLLLVPALLMRVAVAGAPPFSCGPSSPS-KGLPFCNMKLPASQRAADLVSRMTPAEKASQ 88
Query: 72 LVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLW 131
L + A +PRLG+P+Y+WW+EALHGVA GKGI N +R ATSFPQV+ TAASF+ LW
Sbjct: 89 LGDIANGVPRLGVPSYKWWNEALHGVAISGKGIHMNQGVRSATSFPQVLHTAASFNDNLW 148
Query: 132 YRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYV 191
+RIGQA G EARA YN GQA G+T W+PN+NIFRDPRWGRGQETPGEDP V +Y ++V
Sbjct: 149 FRIGQATGKEARAFYNIGQAEGLTMWSPNVNIFRDPRWGRGQETPGEDPAVASRYGAAFV 208
Query: 192 RGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFE 251
RG+QG + N + LQ SACCKH TAYDL++WKG +RY F A VT+QDLADT+ PPF
Sbjct: 209 RGLQGSSSNTKSVPPVLQTSACCKHATAYDLEDWKGVSRYSFKATVTIQDLADTFNPPFR 268
Query: 252 SCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGY 311
SCV G+AS +MCAY VNG+PSCA+ +LL+KT R WG GY+ +DCDAV+I+ +++ Y
Sbjct: 269 SCVVDGKASCVMCAYTIVNGVPSCANGDLLTKTFRGSWGLDGYVAADCDAVAIMRNSQFY 328
Query: 312 AKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFN 371
+ ED V LKAG+D++CG ++Q++ AA+++ KL + ++D+A+ NL + RMRLG F+
Sbjct: 329 RPTAEDTVAATLKAGLDIDCGPYIQQYAMAAIQKGKLTQQDVDKAVKNLLTTRMRLGHFD 388
Query: 372 GNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNA 431
G+P +G +GA +C+ H+ LAL+AA DGIVLLKNS G+LPL + S A+IG NA
Sbjct: 389 GDPKTNVYGNLGAGHICTAEHKNLALEAALDGIVLLKNSAGVLPLKRGTVNSAAVIGHNA 448
Query: 432 NSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHV 491
N LLGNY GP C TPLQ +Q YV+N + GC+ AC+ A+ +A +A +D V
Sbjct: 449 NDVLALLGNYWGPPCAPTTPLQGIQGYVKNVKFLAGCNKAACNVAATPQATALASSSDAV 508
Query: 492 VLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDR 551
+L MGL Q QE E DR L+LPG QQ LI VA AAK+PVILVLL GGPVDITFA+ +
Sbjct: 509 ILFMGLSQEQESEGKDRTTLLLPGNQQSLINAVANAAKRPVILVLLTGGPVDITFAQANP 568
Query: 552 NIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGN 611
IG+ILWAGYPG+AG +A+A+V+FG+ NP G+LP TWYP+++ ++PMTDM+MR A
Sbjct: 569 KIGAILWAGYPGQAGGLAIAKVLFGEKNPSGKLPNTWYPEEFTRIPMTDMRMR--AAGSY 626
Query: 612 PGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLY---LNQSSSTKMVENQDVVHYK 668
PGRTYRFY GK ++ FG GLSYSK+S++ +N + L + M E+ H
Sbjct: 627 PGRTYRFYNGKTIYKFGYGLSYSKFSHRVVTGRKNPAHNTSLLAAGLAAMTEDNLSYH-- 684
Query: 669 SVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILN 728
V +G C+ KFL + V+NHG + GKH L+F++ +GRP +QL+GFQS +
Sbjct: 685 -VEHIGDVVCDQLKFLAVVKVQNHGPIDGKHTALMFLRWPSATDGRPTRQLIGFQSQHIK 743
Query: 729 AKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGDEEYPISI 774
A EKA + FE+SPCE SR R+DG VI++G+HFL VG E IS
Sbjct: 744 AGEKANLRFEVSPCEHFSRVRQDGRKVIDKGSHFLKVGKHELEISF 789
>gi|357156904|ref|XP_003577615.1| PREDICTED: probable beta-D-xylosidase 7-like [Brachypodium
distachyon]
Length = 767
Score = 851 bits (2199), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 410/750 (54%), Positives = 540/750 (72%), Gaps = 13/750 (1%)
Query: 29 PPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYE 88
PPFSC ++ ++ FC LP++QRA DLVSRLT EK++QL + A +PRLG+P Y+
Sbjct: 25 PPFSC---GQASSSYAFCDAALPVAQRAADLVSRLTAAEKVAQLGDEAAGVPRLGVPGYK 81
Query: 89 WWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA 148
WW+EALHG+A GKG+ F+G +R ATSFPQV LTAA+FD LW+RIGQAIG EARALYN
Sbjct: 82 WWNEALHGLATSGKGLHFDGAVRSATSFPQVCLTAAAFDDDLWFRIGQAIGREARALYNL 141
Query: 149 GQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKL 208
GQA G+T W+PN+NI+RDPRWGRGQETPGEDP +YAV++VRG+QG++ + L
Sbjct: 142 GQAEGLTMWSPNVNIYRDPRWGRGQETPGEDPTTASRYAVAFVRGMQGNSTS------LL 195
Query: 209 QASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNR 268
QASACCKH TAYDL++W G RY FDA+VT QDL DT+ PPF SCV G+AS +MCAY
Sbjct: 196 QASACCKHATAYDLEDWNGVARYNFDAKVTAQDLEDTFNPPFRSCVVDGKASCVMCAYTG 255
Query: 269 VNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMD 328
+NG+P+CA+ +LL+KT R WG GY SDCDAV+I+ DA+ YA+SPEDAV LKAG+D
Sbjct: 256 INGVPACANADLLTKTVRGDWGLDGYTASDCDAVAIMRDAQRYAQSPEDAVALALKAGLD 315
Query: 329 VNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVC 388
++CG+++Q+H AA++Q K+ E +ID+AL NLF++RMRLG F+G+P +G +GA +C
Sbjct: 316 IDCGTYMQQHAAAAIQQGKITEEDIDKALKNLFAIRMRLGHFDGDPRTNMYGGLGAADIC 375
Query: 389 SPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRS 448
+ H+ LAL AAQDGIVLLKN G+LPL ++ S A+IGPNAN+ L+ NY GP C S
Sbjct: 376 TAEHRSLALDAAQDGIVLLKNDAGILPLDRAAVASTAVIGPNANNPGALIANYFGPPCES 435
Query: 449 ITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDR 508
TPL+ +Q YV++ + GC + AC A+ D+A +A +D+V L MGL Q QE E DR
Sbjct: 436 TTPLKGIQGYVKDARFLAGCSSTACDVATTDQAAALASTSDYVFLFMGLGQRQESEGRDR 495
Query: 509 VDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAV 568
L+LPG+QQ LIT VA+AA++PVILVLL GGPVD+TFA+ + IG+ILWAGYPG+AG +
Sbjct: 496 TSLLLPGKQQSLITAVADAAQRPVILVLLSGGPVDVTFAQTNPKIGAILWAGYPGQAGGL 555
Query: 569 ALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFG 628
A+A V+FGDHNP GRLP+TWYP+++ VPMTDM+MR +G PGR+YRFY+GK V+ FG
Sbjct: 556 AIARVLFGDHNPSGRLPVTWYPEEFTNVPMTDMRMRADPANGYPGRSYRFYQGKTVYKFG 615
Query: 629 CGLSYSKYSYKFKAVSQNKLYLN----QSSSTKMVENQDVVHYKSVPELGTEFCETRKFL 684
GLSYS YS + + + N S +T M ++++ V ++G + CE KF
Sbjct: 616 YGLSYSSYSRRLLSSGTSTPAPNADLLASLTTTMPSAENILGSYHVEQIGAQGCEMLKFP 675
Query: 685 VTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCES 744
+ V+NHG M GK VL++++ GRP +QL+GF+ L A EKA I FE+ PCE
Sbjct: 676 AVVEVQNHGPMDGKQSVLMYLRWPNATAGRPERQLIGFKKEHLKAGEKAHIKFEIRPCEH 735
Query: 745 LSRAREDGLMVIEEGTHFLVVGDEEYPISI 774
LSR REDG VI+ G+HFL V E I+
Sbjct: 736 LSRVREDGNKVIDRGSHFLRVDKHELEITF 765
>gi|413925164|gb|AFW65096.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
Length = 829
Score = 848 bits (2191), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 412/750 (54%), Positives = 531/750 (70%), Gaps = 10/750 (1%)
Query: 29 PPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYE 88
PPFSC PS PFC T LP +QRA DLVSR+T EK SQL + A +PRLG+P+Y+
Sbjct: 84 PPFSCG-GGPSL-GLPFCNTKLPAAQRAADLVSRMTPAEKASQLGDVANGVPRLGVPSYK 141
Query: 89 WWSEALHGVAGVGKGIFFN-GTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYN 147
WW+EALHGVA GKGI + G +R ATSFPQV+LTAASF+ LW+RIGQA G EARA YN
Sbjct: 142 WWNEALHGVAISGKGIHMDRGAVRSATSFPQVLLTAASFNDNLWFRIGQATGKEARAFYN 201
Query: 148 AGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGK 207
GQA G+T W+PN+NIFRDPRWGRGQETPGEDP V +YA ++VRG+QG + N +
Sbjct: 202 IGQAEGLTMWSPNVNIFRDPRWGRGQETPGEDPAVASRYAAAFVRGLQGSSSNTKSVPPV 261
Query: 208 LQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYN 267
L SACCKH TAYDL++WKG TRY F A VT+QDLADT+ PPF SCV G+AS +MCAY
Sbjct: 262 LLTSACCKHATAYDLEDWKGVTRYSFRATVTVQDLADTFNPPFRSCVVDGKASCVMCAYT 321
Query: 268 RVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGM 327
VNG+PSCA+ +LL+KT R WG GY+ +DCDAVSI+ +++ Y + ED V LKAG+
Sbjct: 322 SVNGVPSCANADLLTKTFRGSWGLDGYVAADCDAVSIMRNSQFYRPTAEDTVATTLKAGL 381
Query: 328 DVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVV 387
D++CG ++Q+H AA+++ KL + ++D+A+ NLF+ RMRLG F+G+P +G +GA +
Sbjct: 382 DIDCGPYVQQHAMAAIQKGKLTQQDVDKAVKNLFTTRMRLGHFDGDPKAHVYGNLGAAHI 441
Query: 388 CSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCR 447
C+ H+ LAL+AA DGIVLLKNS G+LPL + S A+IG NAN LLGNY GP C
Sbjct: 442 CTQEHKNLALEAALDGIVLLKNSAGVLPLKRGSVASAAVIGHNANDVLALLGNYWGPPCA 501
Query: 448 SITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELD 507
TPLQ +Q YV+N + GC AC+ A+ +A +A +D V+L MGL Q QE E D
Sbjct: 502 PTTPLQGIQGYVKNVRFLAGCHKAACNVAATPQAAALASTSDSVILFMGLSQEQESEGKD 561
Query: 508 RVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGA 567
R L+LPG QQ LIT VA AAK+PVILVLL GGPVDITFA+ + IG+ILWAGYPG+AG
Sbjct: 562 RTTLLLPGNQQSLITAVANAAKRPVILVLLTGGPVDITFAQANPKIGAILWAGYPGQAGG 621
Query: 568 VALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPF 627
+A+A+V+FG+ NP GRLP+TWYP+++ KVPMTDM+MR + PGR+YRFY+GK ++ F
Sbjct: 622 LAIAKVLFGEKNPSGRLPVTWYPEEFTKVPMTDMRMR--SAGSYPGRSYRFYKGKTIYKF 679
Query: 628 GCGLSYSKYSYKFKAVSQNKLYLNQS---SSTKMVENQDVVHYKSVPELGTEFCETRKFL 684
G GLSYSK+S++ N + N + ++ +D + Y V +G E C KFL
Sbjct: 680 GYGLSYSKFSHRVVTARNNPAH-NTTLLLAAGHAATTEDNLSYH-VDHIGDELCRQLKFL 737
Query: 685 VTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCES 744
+ V+NHG M GKH L+F++ +GRP +QLVGFQS + A EKA + FE+SPCE
Sbjct: 738 AVVKVQNHGPMDGKHTALMFLRWPNATDGRPARQLVGFQSQHIKAGEKAHLRFEVSPCED 797
Query: 745 LSRAREDGLMVIEEGTHFLVVGDEEYPISI 774
SR R+DG VI++G+HFL VG E IS
Sbjct: 798 FSRVRDDGRKVIDKGSHFLKVGKHELEISF 827
>gi|413925166|gb|AFW65098.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
Length = 830
Score = 844 bits (2180), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 412/751 (54%), Positives = 531/751 (70%), Gaps = 11/751 (1%)
Query: 29 PPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYE 88
PPFSC PS PFC T LP +QRA DLVSR+T EK SQL + A +PRLG+P+Y+
Sbjct: 84 PPFSCG-GGPSL-GLPFCNTKLPAAQRAADLVSRMTPAEKASQLGDVANGVPRLGVPSYK 141
Query: 89 WWSEALHGVAGVGKGIFFN-GTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYN 147
WW+EALHGVA GKGI + G +R ATSFPQV+LTAASF+ LW+RIGQA G EARA YN
Sbjct: 142 WWNEALHGVAISGKGIHMDRGAVRSATSFPQVLLTAASFNDNLWFRIGQATGKEARAFYN 201
Query: 148 AGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGK 207
GQA G+T W+PN+NIFRDPRWGRGQETPGEDP V +YA ++VRG+QG + N +
Sbjct: 202 IGQAEGLTMWSPNVNIFRDPRWGRGQETPGEDPAVASRYAAAFVRGLQGSSSNTKSVPPV 261
Query: 208 LQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYN 267
L SACCKH TAYDL++WKG TRY F A VT+QDLADT+ PPF SCV G+AS +MCAY
Sbjct: 262 LLTSACCKHATAYDLEDWKGVTRYSFRATVTVQDLADTFNPPFRSCVVDGKASCVMCAYT 321
Query: 268 RVNGIPSCADRNLLSKTARRQWGFHG-YITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAG 326
VNG+PSCA+ +LL+KT R WG G Y+ +DCDAVSI+ +++ Y + ED V LKAG
Sbjct: 322 SVNGVPSCANADLLTKTFRGSWGLDGRYVAADCDAVSIMRNSQFYRPTAEDTVATTLKAG 381
Query: 327 MDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADV 386
+D++CG ++Q+H AA+++ KL + ++D+A+ NLF+ RMRLG F+G+P +G +GA
Sbjct: 382 LDIDCGPYVQQHAMAAIQKGKLTQQDVDKAVKNLFTTRMRLGHFDGDPKAHVYGNLGAAH 441
Query: 387 VCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSC 446
+C+ H+ LAL+AA DGIVLLKNS G+LPL + S A+IG NAN LLGNY GP C
Sbjct: 442 ICTQEHKNLALEAALDGIVLLKNSAGVLPLKRGSVASAAVIGHNANDVLALLGNYWGPPC 501
Query: 447 RSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEEL 506
TPLQ +Q YV+N + GC AC+ A+ +A +A +D V+L MGL Q QE E
Sbjct: 502 APTTPLQGIQGYVKNVRFLAGCHKAACNVAATPQAAALASTSDSVILFMGLSQEQESEGK 561
Query: 507 DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAG 566
DR L+LPG QQ LIT VA AAK+PVILVLL GGPVDITFA+ + IG+ILWAGYPG+AG
Sbjct: 562 DRTTLLLPGNQQSLITAVANAAKRPVILVLLTGGPVDITFAQANPKIGAILWAGYPGQAG 621
Query: 567 AVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFP 626
+A+A+V+FG+ NP GRLP+TWYP+++ KVPMTDM+MR + PGR+YRFY+GK ++
Sbjct: 622 GLAIAKVLFGEKNPSGRLPVTWYPEEFTKVPMTDMRMR--SAGSYPGRSYRFYKGKTIYK 679
Query: 627 FGCGLSYSKYSYKFKAVSQNKLYLNQS---SSTKMVENQDVVHYKSVPELGTEFCETRKF 683
FG GLSYSK+S++ N + N + ++ +D + Y V +G E C KF
Sbjct: 680 FGYGLSYSKFSHRVVTARNNPAH-NTTLLLAAGHAATTEDNLSYH-VDHIGDELCRQLKF 737
Query: 684 LVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCE 743
L + V+NHG M GKH L+F++ +GRP +QLVGFQS + A EKA + FE+SPCE
Sbjct: 738 LAVVKVQNHGPMDGKHTALMFLRWPNATDGRPARQLVGFQSQHIKAGEKAHLRFEVSPCE 797
Query: 744 SLSRAREDGLMVIEEGTHFLVVGDEEYPISI 774
SR R+DG VI++G+HFL VG E IS
Sbjct: 798 DFSRVRDDGRKVIDKGSHFLKVGKHELEISF 828
>gi|125534137|gb|EAY80685.1| hypothetical protein OsI_35867 [Oryza sativa Indica Group]
Length = 779
Score = 840 bits (2170), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 398/751 (52%), Positives = 532/751 (70%), Gaps = 15/751 (1%)
Query: 28 QPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAY 87
P F+C P++ + + F FC LP QRA DLV+RLT EK+ QL + AP +PRLGIP Y
Sbjct: 34 NPGFTCGPAS-AQKGFAFCNAALPAEQRAADLVARLTTAEKVGQLGDQAPGVPRLGIPVY 92
Query: 88 EWWSEALHGVAGVGKGIFF-NGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALY 146
+WWSEALHG+A GKGI F NG R ATSFPQVI TAA+FD LW+RIGQAIG E RA Y
Sbjct: 93 KWWSEALHGLAISGKGIHFGNGPARTATSFPQVIHTAAAFDDGLWFRIGQAIGKEGRAFY 152
Query: 147 NAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKG 206
N GQA G+ W+PN+NIFRDPRWGRGQETPGEDP KY ++V+G+QG +
Sbjct: 153 NLGQAEGLAMWSPNVNIFRDPRWGRGQETPGEDPATASKYGAAFVKGLQGSSLT------ 206
Query: 207 KLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAY 266
LQ SACCKH TAYD++ WKG +RY F+A+VT QDLADTY PPF SCV G+AS IMCAY
Sbjct: 207 NLQTSACCKHITAYDIEEWKGVSRYNFNAKVTPQDLADTYNPPFRSCVVDGKASCIMCAY 266
Query: 267 NRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAG 326
+NG+P+CA +LL+KT R +W GY SDCDAV+I++ +E + ++ E+AV LKAG
Sbjct: 267 TLINGVPACASSDLLTKTVRGEWKLDGYTASDCDAVAILHKSEHFTRTAEEAVAVALKAG 326
Query: 327 MDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQP-FGKIGAD 385
+D+NCG ++Q++ +A++Q K+ E ++D+AL NLF++RMRLG F+G+P +G++GA
Sbjct: 327 LDINCGVYMQQNAASALQQGKMTEKDVDKALKNLFAIRMRLGHFDGDPRGNKLYGRLGAA 386
Query: 386 VVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPS 445
VC+P H+ LAL+AA+ G+VLLKN LLPL S A+IG NAN LLGNY G
Sbjct: 387 DVCTPVHKALALEAARRGVVLLKNDARLLPLRAPTVSSAAVIGHNANDILALLGNYYGLP 446
Query: 446 CRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEE 505
C + TP +Q YV++ + PGC + AC A+ D+A +AK +D+V L+MGL Q QE+E
Sbjct: 447 CETTTPFGGIQKYVKSAKFLPGCSSAACDVAATDQATALAKSSDYVFLVMGLSQKQEQEG 506
Query: 506 LDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEA 565
LDR L+LPG+QQ LIT VA A+K+PVIL+LL GGPVDITFA+ + IG+ILWAGYPG+A
Sbjct: 507 LDRTSLLLPGKQQALITAVATASKRPVILILLTGGPVDITFAQTNPKIGAILWAGYPGQA 566
Query: 566 GAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVF 625
G A+A+V+FG+ NP G+LP+TWYP+++ K MTDM+MRP +G PGR+YRFY+GK V+
Sbjct: 567 GGQAIADVLFGEFNPSGKLPVTWYPEEFTKFTMTDMRMRPDPATGYPGRSYRFYKGKTVY 626
Query: 626 PFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKM----VENQDVVHYKSVPELGTEFCETR 681
FG GLSYSK++ + + + N +++ + D V+ V E+G + CE
Sbjct: 627 KFGYGLSYSKFACRIVSGAGNSSSYGKAALAGLRAATTPEGDAVY--RVDEIGDDRCERL 684
Query: 682 KFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSP 741
+F V + V+NHG M GKH VL+FV+ + GRP++QL+GF++ L EK ++ E+SP
Sbjct: 685 RFPVMVEVQNHGPMDGKHTVLMFVRWSSTDGGRPVRQLIGFRNQHLKVGEKKKLKMEISP 744
Query: 742 CESLSRAREDGLMVIEEGTHFLVVGDEEYPI 772
CE LSRAR DG VI+ G+HFL+V ++E I
Sbjct: 745 CEHLSRARVDGEKVIDRGSHFLMVEEDELEI 775
>gi|62734691|gb|AAX96800.1| Glycosyl hydrolase family 3 C terminal domain, putative [Oryza
sativa Japonica Group]
gi|77549994|gb|ABA92791.1| beta-D-xylosidase, putative, expressed [Oryza sativa Japonica
Group]
Length = 853
Score = 839 bits (2167), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 397/751 (52%), Positives = 531/751 (70%), Gaps = 15/751 (1%)
Query: 28 QPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAY 87
P F+C P++ + + F FC LP QRA DLV+RLT EK+ QL + AP +PRLGIP Y
Sbjct: 108 NPGFTCGPAS-AQKGFAFCNAALPAEQRAADLVARLTTAEKVGQLGDQAPGVPRLGIPVY 166
Query: 88 EWWSEALHGVAGVGKGIFF-NGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALY 146
+WWSEALHG+A GKGI F NG R ATSFPQVI TAA+FD LW+RIGQAIG E RA Y
Sbjct: 167 KWWSEALHGLAISGKGIHFGNGPARTATSFPQVIHTAAAFDDGLWFRIGQAIGKEGRAFY 226
Query: 147 NAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKG 206
N GQA G+ W+PN+NIFRDPRWGRGQETPGEDP KY ++V+G+QG +
Sbjct: 227 NLGQAEGLAMWSPNVNIFRDPRWGRGQETPGEDPATASKYGAAFVKGLQGSSLT------ 280
Query: 207 KLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAY 266
LQ SACCKH TAYD++ WKG +RY F+A+VT QDLADTY PPF SCV G+AS IMCAY
Sbjct: 281 NLQTSACCKHITAYDIEEWKGVSRYNFNAKVTPQDLADTYNPPFRSCVVDGKASCIMCAY 340
Query: 267 NRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAG 326
+NG+P+CA +LL+KT R +W GY SDCDAV+I++ +E + ++ E+AV LKAG
Sbjct: 341 TLINGVPACASSDLLTKTVRGEWKLDGYTASDCDAVAILHKSEHFTRTAEEAVAVALKAG 400
Query: 327 MDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQP-FGKIGAD 385
+D+NCG ++Q++ +A++Q K+ E ++D+AL NLF++RMRLG F+G+P +G++ A
Sbjct: 401 LDINCGVYMQQNAASALQQGKMTEKDVDKALKNLFAIRMRLGHFDGDPRGNKLYGRLSAA 460
Query: 386 VVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPS 445
VC+P H+ LAL+AA+ G+VLLKN LLPL S A+IG NAN LLGNY G
Sbjct: 461 DVCTPVHKALALEAARRGVVLLKNDARLLPLRAPTVASAAVIGHNANDILALLGNYYGLP 520
Query: 446 CRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEE 505
C + TP +Q YV++ + PGC + AC A+ D+A +AK +D+V L+MGL Q QE+E
Sbjct: 521 CETTTPFGGIQKYVKSAKFLPGCSSAACDVAATDQATALAKSSDYVFLVMGLSQKQEQEG 580
Query: 506 LDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEA 565
LDR L+LPG+QQ LIT VA A+K+PVIL+LL GGPVDITFA+ + IG+ILWAGYPG+A
Sbjct: 581 LDRTSLLLPGKQQALITAVATASKRPVILILLTGGPVDITFAQTNPKIGAILWAGYPGQA 640
Query: 566 GAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVF 625
G A+A+V+FG+ NP G+LP+TWYP+++ K MTDM+MRP +G PGR+YRFY+GK V+
Sbjct: 641 GGQAIADVLFGEFNPSGKLPVTWYPEEFTKFTMTDMRMRPDPATGYPGRSYRFYKGKTVY 700
Query: 626 PFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKM----VENQDVVHYKSVPELGTEFCETR 681
FG GLSYSK++ + + + N +++ + D V+ V E+G + CE
Sbjct: 701 KFGYGLSYSKFACRIVSGAGNSSSYGKAALAGLRAATTPEGDAVY--RVDEIGDDRCERL 758
Query: 682 KFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSP 741
+F V + V+NHG M GKH VL+FV+ + GRP++QL+GF++ L EK ++ E+SP
Sbjct: 759 RFPVMVEVQNHGPMDGKHTVLMFVRWSSTDGGRPVRQLIGFRNQHLKVGEKKKLKMEISP 818
Query: 742 CESLSRAREDGLMVIEEGTHFLVVGDEEYPI 772
CE LSRAR DG VI+ G+HFL+V ++E I
Sbjct: 819 CEHLSRARVDGEKVIDRGSHFLMVEEDELEI 849
>gi|296084630|emb|CBI25718.3| unnamed protein product [Vitis vinifera]
Length = 768
Score = 839 bits (2167), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 421/773 (54%), Positives = 538/773 (69%), Gaps = 19/773 (2%)
Query: 8 LVFPLLCLCFTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDE 67
+F L L L + +ST P F C P P+ +PFC T+LPIS RA+ LVS LTL E
Sbjct: 7 FLFICLFLQVLPLFSISESTHPQFPCMP--PTNSDYPFCNTSLPISTRAQSLVSLLTLSE 64
Query: 68 KISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFD 127
KI QL + A AIPRL IPAYEWWSE+LHG+A G G+ FNGT+ ATSFPQV+LTAASF+
Sbjct: 65 KIQQLSDEAAAIPRLYIPAYEWWSESLHGIATNGPGVSFNGTVSAATSFPQVLLTAASFN 124
Query: 128 SYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYA 187
LW+ IG AI +EARA+YN GQA G+TFWAPNINIFRDPRWGRGQETPGEDP+V YA
Sbjct: 125 RSLWFSIGSAIAVEARAMYNVGQA-GLTFWAPNINIFRDPRWGRGQETPGEDPMVASAYA 183
Query: 188 VSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQ 247
V +VRG QGD+ G L SACCKH TAYDL+ W +RY FDA V+ QDL DTYQ
Sbjct: 184 VEFVRGFQGDSDGDG-----LMLSACCKHLTAYDLEKWGNFSRYSFDAVVSNQDLEDTYQ 238
Query: 248 PPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYD 307
PPF SCV+QG+AS +MC+YNRVNG+P+CA ++L K A+ +WGF GYITSDCDAV+ +Y+
Sbjct: 239 PPFRSCVQQGKASCLMCSYNRVNGVPACARQDLFQK-AKTEWGFKGYITSDCDAVATVYE 297
Query: 308 AEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRL 367
+ YA SPEDAV DVLKAG D+NCGS++ +HT++A+ Q K+ E +IDRAL NLFSV+MRL
Sbjct: 298 YQHYANSPEDAVADVLKAGTDINCGSYMLRHTQSAIDQGKVKEEDIDRALFNLFSVQMRL 357
Query: 368 GLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALI 427
GLF+G+P +G +G VC+ H+ LAL+AA+ GIVLLKN LPL KS+ SLA+I
Sbjct: 358 GLFDGDPANGLYGNLGPKDVCTKEHRTLALEAARQGIVLLKNDKKFLPLDKSRISSLAII 417
Query: 428 GPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVAC-SSASIDKAVDIAK 486
GP A+ L G Y G C+ + ++ L+ YVE T + GC V C S D+AV IA+
Sbjct: 418 GPQADQ-PFLGGGYTGIPCKPESLVEGLKTYVEKTSFAAGCVDVPCLSDTGFDEAVSIAR 476
Query: 487 GADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITF 546
AD VV++ GLD +QE E+ DRV L+LPG+Q LI+ VA A +KP++LVL GGP+D++F
Sbjct: 477 KADIVVVVAGLDLSQETEDHDRVSLLLPGKQMALISSVASAIQKPLVLVLTGGGPLDVSF 536
Query: 547 AKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQ 606
A+ D I SILW GYPGEAGA ALAE+IFGD NPGGRLPMTWYP+ + +VPM DM MR
Sbjct: 537 AEQDPRIASILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTRVPMNDMNMRAD 596
Query: 607 ATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVEN----- 661
G PGRTYRFY G V+ FG GLSY+K++Y+F + + NKL L +SS T +N
Sbjct: 597 PYRGYPGRTYRFYIGHRVYGFGQGLSYTKFAYQFVS-APNKLNLLRSSDTVSSKNLPRQR 655
Query: 662 QDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVG 721
++ V+Y + EL T C++ +F V I V N G+M G H V+LF + + G P KQL+G
Sbjct: 656 REEVNYFHIEELDT--CDSLRFHVEISVTNVGDMDGSHVVMLFSRVPKIVKGTPEKQLIG 713
Query: 722 FQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGDEEYPISI 774
F V ++ E + PCE S A E G ++ G H +++GD + +S+
Sbjct: 714 FSRVHTVSRRSTETSIMVDPCEHFSIANEQGKRIMPLGDHTIMLGDVVHSVSV 766
>gi|115485163|ref|NP_001067725.1| Os11g0297300 [Oryza sativa Japonica Group]
gi|113644947|dbj|BAF28088.1| Os11g0297300 [Oryza sativa Japonica Group]
Length = 779
Score = 838 bits (2166), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 397/751 (52%), Positives = 531/751 (70%), Gaps = 15/751 (1%)
Query: 28 QPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAY 87
P F+C P++ + + F FC LP QRA DLV+RLT EK+ QL + AP +PRLGIP Y
Sbjct: 34 NPGFTCGPAS-AQKGFAFCNAALPAEQRAADLVARLTTAEKVGQLGDQAPGVPRLGIPVY 92
Query: 88 EWWSEALHGVAGVGKGIFF-NGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALY 146
+WWSEALHG+A GKGI F NG R ATSFPQVI TAA+FD LW+RIGQAIG E RA Y
Sbjct: 93 KWWSEALHGLAISGKGIHFGNGPARTATSFPQVIHTAAAFDDGLWFRIGQAIGKEGRAFY 152
Query: 147 NAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKG 206
N GQA G+ W+PN+NIFRDPRWGRGQETPGEDP KY ++V+G+QG +
Sbjct: 153 NLGQAEGLAMWSPNVNIFRDPRWGRGQETPGEDPATASKYGAAFVKGLQGSSLT------ 206
Query: 207 KLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAY 266
LQ SACCKH TAYD++ WKG +RY F+A+VT QDLADTY PPF SCV G+AS IMCAY
Sbjct: 207 NLQTSACCKHITAYDIEEWKGVSRYNFNAKVTPQDLADTYNPPFRSCVVDGKASCIMCAY 266
Query: 267 NRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAG 326
+NG+P+CA +LL+KT R +W GY SDCDAV+I++ +E + ++ E+AV LKAG
Sbjct: 267 TLINGVPACASSDLLTKTVRGEWKLDGYTASDCDAVAILHKSEHFTRTAEEAVAVALKAG 326
Query: 327 MDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQP-FGKIGAD 385
+D+NCG ++Q++ +A++Q K+ E ++D+AL NLF++RMRLG F+G+P +G++ A
Sbjct: 327 LDINCGVYMQQNAASALQQGKMTEKDVDKALKNLFAIRMRLGHFDGDPRGNKLYGRLSAA 386
Query: 386 VVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPS 445
VC+P H+ LAL+AA+ G+VLLKN LLPL S A+IG NAN LLGNY G
Sbjct: 387 DVCTPVHKALALEAARRGVVLLKNDARLLPLRAPTVASAAVIGHNANDILALLGNYYGLP 446
Query: 446 CRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEE 505
C + TP +Q YV++ + PGC + AC A+ D+A +AK +D+V L+MGL Q QE+E
Sbjct: 447 CETTTPFGGIQKYVKSAKFLPGCSSAACDVAATDQATALAKSSDYVFLVMGLSQKQEQEG 506
Query: 506 LDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEA 565
LDR L+LPG+QQ LIT VA A+K+PVIL+LL GGPVDITFA+ + IG+ILWAGYPG+A
Sbjct: 507 LDRTSLLLPGKQQALITAVATASKRPVILILLTGGPVDITFAQTNPKIGAILWAGYPGQA 566
Query: 566 GAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVF 625
G A+A+V+FG+ NP G+LP+TWYP+++ K MTDM+MRP +G PGR+YRFY+GK V+
Sbjct: 567 GGQAIADVLFGEFNPSGKLPVTWYPEEFTKFTMTDMRMRPDPATGYPGRSYRFYKGKTVY 626
Query: 626 PFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKM----VENQDVVHYKSVPELGTEFCETR 681
FG GLSYSK++ + + + N +++ + D V+ V E+G + CE
Sbjct: 627 KFGYGLSYSKFACRIVSGAGNSSSYGKAALAGLRAATTPEGDAVY--RVDEIGDDRCERL 684
Query: 682 KFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSP 741
+F V + V+NHG M GKH VL+FV+ + GRP++QL+GF++ L EK ++ E+SP
Sbjct: 685 RFPVMVEVQNHGPMDGKHTVLMFVRWSSTDGGRPVRQLIGFRNQHLKVGEKKKLKMEISP 744
Query: 742 CESLSRAREDGLMVIEEGTHFLVVGDEEYPI 772
CE LSRAR DG VI+ G+HFL+V ++E I
Sbjct: 745 CEHLSRARVDGEKVIDRGSHFLMVEEDELEI 775
>gi|225469218|ref|XP_002264031.1| PREDICTED: probable beta-D-xylosidase 6-like [Vitis vinifera]
Length = 789
Score = 835 bits (2157), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 423/789 (53%), Positives = 541/789 (68%), Gaps = 30/789 (3%)
Query: 8 LVFPLLCLCFTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDE 67
+F L L L + +ST P F C P P+ +PFC T+LPIS RA+ LVS LTL E
Sbjct: 7 FLFICLFLQVLPLFSISESTHPQFPCMP--PTNSDYPFCNTSLPISTRAQSLVSLLTLSE 64
Query: 68 KISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFD 127
KI QL + A AIPRL IPAYEWWSE+LHG+A G G+ FNGT+ ATSFPQV+LTAASF+
Sbjct: 65 KIQQLSDEAAAIPRLYIPAYEWWSESLHGIATNGPGVSFNGTVSAATSFPQVLLTAASFN 124
Query: 128 SYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYA 187
LW+ IG AI +EARA+YN GQA G+TFWAPNINIFRDPRWGRGQETPGEDP+V YA
Sbjct: 125 RSLWFSIGSAIAVEARAMYNVGQA-GLTFWAPNINIFRDPRWGRGQETPGEDPMVASAYA 183
Query: 188 VSYVRGVQGDTFNGGK-----------LKGK-----LQASACCKHFTAYDLDNWKGTTRY 231
V +VRG QG + GG L+G L SACCKH TAYDL+ W +RY
Sbjct: 184 VEFVRGFQGGNWKGGDEIRGAVGKKRVLRGDSDGDGLMLSACCKHLTAYDLEKWGNFSRY 243
Query: 232 KFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGF 291
FDA V+ QDL DTYQPPF SCV+QG+AS +MC+YNRVNG+P+CA ++L K A+ +WGF
Sbjct: 244 SFDAVVSNQDLEDTYQPPFRSCVQQGKASCLMCSYNRVNGVPACARQDLFQK-AKTEWGF 302
Query: 292 HGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPES 351
GYITSDCDAV+ +Y+ + YA SPEDAV DVLKAG D+NCGS++ +HT++A+ Q K+ E
Sbjct: 303 KGYITSDCDAVATVYEYQHYANSPEDAVADVLKAGTDINCGSYMLRHTQSAIDQGKVKEE 362
Query: 352 EIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSH 411
+IDRAL NLFSV+MRLGLF+G+P +G +G VC+ H+ LAL+AA+ GIVLLKN
Sbjct: 363 DIDRALFNLFSVQMRLGLFDGDPANGLYGNLGPKDVCTKEHRTLALEAARQGIVLLKNDK 422
Query: 412 GLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTV 471
LPL KS+ SLA+IGP A+ L G Y G C+ + ++ L+ YVE T + GC V
Sbjct: 423 KFLPLDKSRISSLAIIGPQADQP-FLGGGYTGIPCKPESLVEGLKTYVEKTSFAAGCVDV 481
Query: 472 AC-SSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKK 530
C S D+AV IA+ AD VV++ GLD +QE E+ DRV L+LPG+Q LI+ VA A +K
Sbjct: 482 PCLSDTGFDEAVSIARKADIVVVVAGLDLSQETEDHDRVSLLLPGKQMALISSVASAIQK 541
Query: 531 PVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYP 590
P++LVL GGP+D++FA+ D I SILW GYPGEAGA ALAE+IFGD NPGGRLPMTWYP
Sbjct: 542 PLVLVLTGGGPLDVSFAEQDPRIASILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYP 601
Query: 591 QDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYL 650
+ + +VPM DM MR G PGRTYRFY G V+ FG GLSY+K++Y+F + + NKL L
Sbjct: 602 ESFTRVPMNDMNMRADPYRGYPGRTYRFYIGHRVYGFGQGLSYTKFAYQFVS-APNKLNL 660
Query: 651 NQSSSTKMVEN-----QDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFV 705
+SS T +N ++ V+Y + EL T C++ +F V I V N G+M G H V+LF
Sbjct: 661 LRSSDTVSSKNLPRQRREEVNYFHIEELDT--CDSLRFHVEISVTNVGDMDGSHVVMLFS 718
Query: 706 KPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVV 765
+ + G P KQL+GF V ++ E + PCE S A E G ++ G H +++
Sbjct: 719 RVPKIVKGTPEKQLIGFSRVHTVSRRSTETSIMVDPCEHFSIANEQGKRIMPLGDHTIML 778
Query: 766 GDEEYPISI 774
GD + +S+
Sbjct: 779 GDVVHSVSV 787
>gi|297611657|ref|NP_001067709.2| Os11g0291000 [Oryza sativa Japonica Group]
gi|255680005|dbj|BAF28072.2| Os11g0291000 [Oryza sativa Japonica Group]
Length = 764
Score = 835 bits (2156), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 409/737 (55%), Positives = 524/737 (71%), Gaps = 16/737 (2%)
Query: 45 FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGI 104
FC L QRA DLV+ LTL EK+SQL + A + RLG+PAYEWWSE LHG++ G+GI
Sbjct: 31 FCDAWLTAEQRAADLVANLTLAEKVSQLGDRAAGVARLGVPAYEWWSEGLHGLSIWGRGI 90
Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIF 164
FNGT+R TSFPQVILTAA+FD+ LW R+G+A+G EARALYN GQA G+T W+PN+NIF
Sbjct: 91 RFNGTVRAVTSFPQVILTAAAFDAGLWRRVGEAVGAEARALYNLGQANGLTIWSPNVNIF 150
Query: 165 RDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDN 224
RDPRWGRGQETPGEDP+ +YAV++V G+QG GG +ASACCKH TAYDLD
Sbjct: 151 RDPRWGRGQETPGEDPVTASRYAVAFVTGLQGI---GG------EASACCKHATAYDLDY 201
Query: 225 WKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKT 284
W RY +D++VT+QDL DTY PPF+SCV +G+A+ IMC YN +NG+P+CA +LL+K
Sbjct: 202 WNNVVRYNYDSKVTLQDLEDTYNPPFKSCVAEGKATCIMCGYNSINGVPACASSDLLTKK 261
Query: 285 ARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVK 344
R++WG +GY+ SDCDAV+ I DA Y SPED V +K GMDVNCG++ Q H AAV+
Sbjct: 262 VRQEWGMNGYVASDCDAVATIRDAHHYTLSPEDTVAVSIKVGMDVNCGNYTQVHAMAAVQ 321
Query: 345 QKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQP-FGKIGADVVCSPAHQVLALQAAQDG 403
+ L E +IDRAL NLF+VRMRLG F+G+P +G +GA VCSPAH+ LAL+AAQDG
Sbjct: 322 KGNLTEKDIDRALVNLFAVRMRLGHFDGDPRSNAVYGHLGAADVCSPAHKSLALEAAQDG 381
Query: 404 IVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYV-ENT 462
IVLLKN G LPL S SLA+IGPNA++ L GNY GP C + TPLQ ++ Y+ +
Sbjct: 382 IVLLKNDAGALPLQPSAVTSLAVIGPNADNLGALHGNYFGPPCETTTPLQGIKGYLGDRA 441
Query: 463 VYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELIT 522
+ GCD+ AC+ A+ ++A +A +DHVVL MGL Q QE++ LDR L+LPG QQ LIT
Sbjct: 442 RFLAGCDSPACAVAATNEAAALASSSDHVVLFMGLSQKQEQDGLDRTSLLLPGEQQGLIT 501
Query: 523 RVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGG 582
VA AA++PVILVLL GGPVD+TFAK + IG+ILWAGYPG+AG +A+A+V+FGDHNP G
Sbjct: 502 AVANAARRPVILVLLTGGPVDVTFAKDNPKIGAILWAGYPGQAGGLAIAKVLFGDHNPSG 561
Query: 583 RLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYK-FK 641
RLP+TWYP+++ KVPMTDM+MR +G PGR+YRFY+G V+ FG GLSYSK+S + F
Sbjct: 562 RLPVTWYPEEFTKVPMTDMRMRADPATGYPGRSYRFYQGNTVYNFGYGLSYSKFSRRMFS 621
Query: 642 AVSQ----NKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAG 697
+ S N L + + ++ + V E+G E C F + V+NHG M G
Sbjct: 622 SFSTSNAGNLSLLAGVMARRAGDDGGGMSSYLVKEIGVERCSRLVFPAVVEVQNHGPMDG 681
Query: 698 KHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIE 757
KH VL++++ GRP +QL+GF+S + EKA + FE+SPCE S EDG VI+
Sbjct: 682 KHSVLMYLRWPTTSGGRPARQLIGFRSQHVKVGEKAMVSFEVSPCEHFSWVGEDGERVID 741
Query: 758 EGTHFLVVGDEEYPISI 774
G HFL+VGDEE S
Sbjct: 742 GGAHFLMVGDEELETSF 758
>gi|297811163|ref|XP_002873465.1| glycosyl hydrolase family 3 protein [Arabidopsis lyrata subsp.
lyrata]
gi|297319302|gb|EFH49724.1| glycosyl hydrolase family 3 protein [Arabidopsis lyrata subsp.
lyrata]
Length = 796
Score = 823 bits (2127), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 412/800 (51%), Positives = 547/800 (68%), Gaps = 37/800 (4%)
Query: 4 HKLSLVFPLLCLCFTSLLTR----VDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDL 59
H+L+L+ + FTS + +DS P F C P P ++PFC +L I QRA L
Sbjct: 3 HQLTLISLVF---FTSAIAETFKNLDS-HPQFPCKP--PHFSSYPFCNVSLSIKQRAISL 56
Query: 60 VSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQV 119
VS LTL EKI QL +A ++PRLGIP YEWWSE+LHG+A G G+ FNG+I ATSFPQV
Sbjct: 57 VSLLTLPEKIGQLSTTAASVPRLGIPPYEWWSESLHGLADNGPGVSFNGSISAATSFPQV 116
Query: 120 ILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGED 179
I++AASF+ LWY IG A+ +EARA+YN GQA G+TFWAPNIN+FRDPRWGRGQETPGED
Sbjct: 117 IVSAASFNRTLWYEIGSAVAVEARAMYNGGQA-GLTFWAPNINLFRDPRWGRGQETPGED 175
Query: 180 PLVTGKYAVSYVRGVQGDT--------FNGGKLK----------GKLQASACCKHFTAYD 221
P V +Y V +VRG Q F + GKL SACCKHFTAYD
Sbjct: 176 PKVVSEYGVEFVRGFQEKKKRKVLKTRFGSDNVDDDARYDDDADGKLMLSACCKHFTAYD 235
Query: 222 LDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLL 281
L+ W TRY F+A VT QD+ DTYQPPFE+C+K G+AS +MC+YN VNG+P+CA +LL
Sbjct: 236 LEKWGNFTRYDFNAVVTEQDMEDTYQPPFETCIKDGKASCLMCSYNAVNGVPACAQGDLL 295
Query: 282 SKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKA 341
K AR +WGF GYITSDCDAV+ I++ +GY KSPE+AV D +KAG+D+NCG+++ ++T++
Sbjct: 296 QK-ARVEWGFDGYITSDCDAVATIFEYQGYTKSPEEAVADAIKAGVDINCGTYMLRNTQS 354
Query: 342 AVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQ 401
A++Q K+ E +DRAL NLF+V++RLGLF+G+P +GK+G++ +CS H+ LAL+AA+
Sbjct: 355 AIEQGKVSEELVDRALLNLFAVQLRLGLFDGDPRGGHYGKLGSNDICSSDHRKLALEAAR 414
Query: 402 DGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVEN 461
GIVLLKN + LLPL K+ SLA++GP AN+ + G Y G C+ T L YV+
Sbjct: 415 QGIVLLKNDYKLLPLNKNHVSSLAIVGPMANNISNMGGTYTGKPCQRKTLFTELLEYVKK 474
Query: 462 TVYYPGCDTVAC-SSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQEL 520
T Y GC V+C S +AV IAKGAD V+++ GLD +QE E+ DR L LPG+Q++L
Sbjct: 475 TSYASGCSDVSCVSDTGFGEAVAIAKGADFVIVVAGLDLSQETEDKDRFSLSLPGKQKDL 534
Query: 521 ITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
++ VA +KKPVILVL GGPVD+TFAK D IGSI+W GYPGE G ALAE+IFGD NP
Sbjct: 535 VSSVAAVSKKPVILVLTGGGPVDVTFAKTDPRIGSIIWIGYPGETGGQALAEIIFGDFNP 594
Query: 581 GGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKF 640
GGRLP+TWYP+ + VPM+DM MR ++ G PGRTYRFY G +V+ FG GLSY+K+ YK
Sbjct: 595 GGRLPITWYPESFADVPMSDMHMRADSSRGYPGRTYRFYTGPQVYSFGTGLSYTKFDYKI 654
Query: 641 KA----VSQNKLYLNQSSSTKMV--ENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGE 694
+ +S ++L QSS K + ++ + Y + ++ CE+ +F V + V+N GE
Sbjct: 655 ISAPIRLSLSELLPQQSSHKKQLLQHGEEQLQYIQLDDVMVNSCESLRFNVRVNVRNTGE 714
Query: 695 MAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLM 754
+ G H ++LF K AR +G P KQL+GF V + + E E VF + PC+ LS A + G
Sbjct: 715 IDGSHVLMLFSKMARVLSGVPEKQLIGFDRVHIRSNEMMETVFVIDPCKYLSVANDVGKR 774
Query: 755 VIEEGTHFLVVGDEEYPISI 774
VI G H L +GD ++ +S+
Sbjct: 775 VIPLGIHALFLGDLQHSLSV 794
>gi|15238197|ref|NP_196618.1| putative beta-D-xylosidase 6 [Arabidopsis thaliana]
gi|75264319|sp|Q9LXA8.1|BXL6_ARATH RecName: Full=Probable beta-D-xylosidase 6; Short=AtBXL6; Flags:
Precursor
gi|7671447|emb|CAB89387.1| beta-xylosidase-like protein [Arabidopsis thaliana]
gi|15982753|gb|AAL09717.1| AT5g10560/F12B17_90 [Arabidopsis thaliana]
gi|332004180|gb|AED91563.1| putative beta-D-xylosidase 6 [Arabidopsis thaliana]
Length = 792
Score = 822 bits (2122), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 407/785 (51%), Positives = 537/785 (68%), Gaps = 30/785 (3%)
Query: 15 LCFTSLLTR----VDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKIS 70
L FTS + +DS P F C P P ++PFC +L I QRA LVS L L EKI
Sbjct: 11 LFFTSAIAETFKNLDS-HPQFPCKP--PHFSSYPFCNVSLSIKQRAISLVSLLMLPEKIG 67
Query: 71 QLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYL 130
QL N+A ++PRLGIP YEWWSE+LHG+A G G+ FNG+I ATSFPQVI++AASF+ L
Sbjct: 68 QLSNTAASVPRLGIPPYEWWSESLHGLADNGPGVSFNGSISAATSFPQVIVSAASFNRTL 127
Query: 131 WYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSY 190
WY IG A+ +E RA+YN GQA G+TFWAPNIN+FRDPRWGRGQETPGEDP V +Y V +
Sbjct: 128 WYEIGSAVAVEGRAMYNGGQA-GLTFWAPNINVFRDPRWGRGQETPGEDPKVVSEYGVEF 186
Query: 191 VRGVQ----------------GDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFD 234
VRG Q D + GKL SACCKHFTAYDL+ W TRY F+
Sbjct: 187 VRGFQEKKKRKVLKRRFSDDVDDDRHDDDADGKLMLSACCKHFTAYDLEKWGNFTRYDFN 246
Query: 235 ARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGY 294
A VT QD+ DTYQPPFE+C++ G+AS +MC+YN VNG+P+CA +LL K AR +WGF GY
Sbjct: 247 AVVTEQDMEDTYQPPFETCIRDGKASCLMCSYNAVNGVPACAQGDLLQK-ARVEWGFEGY 305
Query: 295 ITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEID 354
ITSDCDAV+ I+ +GY KSPE+AV D +KAG+D+NCG+++ +HT++A++Q K+ E +D
Sbjct: 306 ITSDCDAVATIFAYQGYTKSPEEAVADAIKAGVDINCGTYMLRHTQSAIEQGKVSEELVD 365
Query: 355 RALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLL 414
RAL NLF+V++RLGLF+G+P +GK+G++ +CS H+ LAL+A + GIVLLKN H LL
Sbjct: 366 RALLNLFAVQLRLGLFDGDPRRGQYGKLGSNDICSSDHRKLALEATRQGIVLLKNDHKLL 425
Query: 415 PLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACS 474
PL K+ SLA++GP AN+ + G Y G C+ T L YV+ T Y GC V+C
Sbjct: 426 PLNKNHVSSLAIVGPMANNISNMGGTYTGKPCQRKTLFTELLEYVKKTSYASGCSDVSCD 485
Query: 475 SAS-IDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVI 533
S + +AV IAKGAD V+++ GLD +QE E+ DRV L LPG+Q++L++ VA +KKPVI
Sbjct: 486 SDTGFGEAVAIAKGADFVIVVAGLDLSQETEDKDRVSLSLPGKQKDLVSHVAAVSKKPVI 545
Query: 534 LVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDY 593
LVL GGPVD+TFAK D IGSI+W GYPGE G ALAE+IFGD NPGGRLP TWYP+ +
Sbjct: 546 LVLTGGGPVDVTFAKNDPRIGSIIWIGYPGETGGQALAEIIFGDFNPGGRLPTTWYPESF 605
Query: 594 IKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKA----VSQNKLY 649
V M+DM MR ++ G PGRTYRFY G +V+ FG GLSY+K+ YK + +S ++L
Sbjct: 606 TDVAMSDMHMRANSSRGYPGRTYRFYTGPQVYSFGTGLSYTKFEYKILSAPIRLSLSELL 665
Query: 650 LNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPAR 709
QSS K +++ + + Y + ++ CE+ +F V + V N GE+ G H V+LF K
Sbjct: 666 PQQSSHKKQLQHGEELRYLQLDDVIVNSCESLRFNVRVHVSNTGEIDGSHVVMLFSKMPP 725
Query: 710 RGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGDEE 769
+G P KQL+G+ V + + E E VF + PC+ LS A + G VI G+H L +GD +
Sbjct: 726 VLSGVPEKQLIGYDRVHVRSNEMMETVFVIDPCKQLSVANDVGKRVIPLGSHVLFLGDLQ 785
Query: 770 YPISI 774
+ +S+
Sbjct: 786 HSLSV 790
>gi|62701894|gb|AAX92967.1| beta-xylosidase, putative [Oryza sativa Japonica Group]
gi|77550041|gb|ABA92838.1| Glycosyl hydrolase family 3 C terminal domain containing protein
[Oryza sativa Japonica Group]
Length = 793
Score = 820 bits (2117), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 409/765 (53%), Positives = 524/765 (68%), Gaps = 44/765 (5%)
Query: 45 FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGI 104
FC L QRA DLV+ LTL EK+SQL + A + RLG+PAYEWWSE LHG++ G+GI
Sbjct: 32 FCDAWLTAEQRAADLVANLTLAEKVSQLGDRAAGVARLGVPAYEWWSEGLHGLSIWGRGI 91
Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIF 164
FNGT+R TSFPQVILTAA+FD+ LW R+G+A+G EARALYN GQA G+T W+PN+NIF
Sbjct: 92 RFNGTVRAVTSFPQVILTAAAFDAGLWRRVGEAVGAEARALYNLGQANGLTIWSPNVNIF 151
Query: 165 RDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDN 224
RDPRWGRGQETPGEDP+ +YAV++V G+QG GG +ASACCKH TAYDLD
Sbjct: 152 RDPRWGRGQETPGEDPVTASRYAVAFVTGLQGI---GG------EASACCKHATAYDLDY 202
Query: 225 WKGTTRYKFDAR----------------------------VTMQDLADTYQPPFESCVKQ 256
W RY +D++ VT+QDL DTY PPF+SCV +
Sbjct: 203 WNNVVRYNYDSKDGASTGKSGETSSQVEKKHGPYEKGYFAVTLQDLEDTYNPPFKSCVAE 262
Query: 257 GRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPE 316
G+A+ IMC YN +NG+P+CA +LL+K R++WG +GY+ SDCDAV+ I DA Y SPE
Sbjct: 263 GKATCIMCGYNSINGVPACASSDLLTKKVRQEWGMNGYVASDCDAVATIRDAHHYTLSPE 322
Query: 317 DAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTM 376
D V +K GMDVNCG++ Q H AAV++ L E +IDRAL NLF+VRMRLG F+G+P
Sbjct: 323 DTVAVSIKVGMDVNCGNYTQVHAMAAVQKGNLTEKDIDRALVNLFAVRMRLGHFDGDPRS 382
Query: 377 QP-FGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAK 435
+G +GA VCSPAH+ LAL+AAQDGIVLLKN G LPL S SLA+IGPNA++
Sbjct: 383 NAVYGHLGAADVCSPAHKSLALEAAQDGIVLLKNDAGALPLQPSAVTSLAVIGPNADNLG 442
Query: 436 TLLGNYAGPSCRSITPLQALQNYV-ENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLM 494
L GNY GP C + TPLQ ++ Y+ + + GCD+ AC+ A+ ++A +A +DHVVL
Sbjct: 443 ALHGNYFGPPCETTTPLQGIKGYLGDRARFLAGCDSPACAVAATNEAAALASSSDHVVLF 502
Query: 495 MGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIG 554
MGL Q QE++ LDR L+LPG QQ LIT VA AA++PVILVLL GGPVD+TFAK + IG
Sbjct: 503 MGLSQKQEQDGLDRTSLLLPGEQQGLITAVANAARRPVILVLLTGGPVDVTFAKDNPKIG 562
Query: 555 SILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGR 614
+ILWAGYPG+AG +A+A+V+FGDHNP GRLP+TWYP+++ KVPMTDM+MR +G PGR
Sbjct: 563 AILWAGYPGQAGGLAIAKVLFGDHNPSGRLPVTWYPEEFTKVPMTDMRMRADPATGYPGR 622
Query: 615 TYRFYEGKEVFPFGCGLSYSKYSYK-FKAVSQ----NKLYLNQSSSTKMVENQDVVHYKS 669
+YRFY+G V+ FG GLSYSK+S + F + S N L + + ++ +
Sbjct: 623 SYRFYQGNTVYNFGYGLSYSKFSRRMFSSFSTSNAGNLSLLAGVMARRAGDDGGGMSSYL 682
Query: 670 VPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNA 729
V E+G E C F + V+NHG M GKH VL++++ GRP +QL+GF+S +
Sbjct: 683 VKEIGVERCSRLVFPAVVEVQNHGPMDGKHSVLMYLRWPTTSGGRPARQLIGFRSQHVKV 742
Query: 730 KEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGDEEYPISI 774
EKA + FE+SPCE S EDG VI+ G HFL+VGDEE S
Sbjct: 743 GEKAMVSFEVSPCEHFSWVGEDGERVIDGGAHFLMVGDEELETSF 787
>gi|9294427|dbj|BAB02547.1| beta-1,4-xylosidase [Arabidopsis thaliana]
Length = 876
Score = 797 bits (2059), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 399/775 (51%), Positives = 519/775 (66%), Gaps = 40/775 (5%)
Query: 10 FPLLCLCFTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKI 69
F L L +L++ + +Q F+CD S P+T + FC +L RA+DLVSRL+L EK+
Sbjct: 6 FVRLSLLIIALVSSLCESQKNFACDISAPATAKYGFCNVSLSYEARAKDLVSRLSLKEKV 65
Query: 70 SQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSY 129
QLVN A +PRLG+P YEWWSEALHGV+ VG G+ FNGT+ GATSFP ILTAASF++
Sbjct: 66 QQLVNKATGVPRLGVPPYEWWSEALHGVSDVGPGVHFNGTVPGATSFPATILTAASFNTS 125
Query: 130 LWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVS 189
LW ++G+ + EARA++N G A G+T+W+PN+N+FRDPRWGRGQETPGEDPLV KYAV+
Sbjct: 126 LWLKMGEVVSTEARAMHNVGLA-GLTYWSPNVNVFRDPRWGRGQETPGEDPLVVSKYAVN 184
Query: 190 YVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPP 249
YV+G+Q D + GK + +L+ S+CCKH+TAYDLDNWKG R+ FDA+VT QDL DTYQ P
Sbjct: 185 YVKGLQ-DVHDAGKSR-RLKVSSCCKHYTAYDLDNWKGIDRFHFDAKVTKQDLEDTYQTP 242
Query: 250 FESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAE 309
F+SCV++G S +MC+YNRVNGIP+CAD NLL R QW GYI SDCD++ + ++
Sbjct: 243 FKSCVEEGDVSSVMCSYNRVNGIPTCADPNLLRGVIRGQWRLDGYIVSDCDSIQVYFNDI 302
Query: 310 GYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGL 369
Y K+ EDAV LKAG+++NCG FL K+T+ AVK KKL S++D AL + V MRLG
Sbjct: 303 HYTKTREDAVALALKAGLNMNCGDFLGKYTENAVKLKKLNGSDVDEALIYNYIVLMRLGF 362
Query: 370 FNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGP 429
F+G+P PFG +G VCS HQ+LAL+AA+ GIVLL+N G LPLPK+ LA+IGP
Sbjct: 363 FDGDPKSLPFGNLGPSDVCSKDHQMLALEAAKQGIVLLEN-RGDLPLPKTTVKKLAVIGP 421
Query: 430 NANSAKTLLGNYAGPSCRSITPLQALQNYV-ENTVYYPGCDTVACSSAS-IDKAVDIAKG 487
NAN+ K ++ NYAG C+ +P+Q LQ YV E VY PGC V C + I AV
Sbjct: 422 NANATKVMISNYAGVPCKYTSPIQGLQKYVPEKIVYEPGCKDVKCGDQTLISAAVKAVSE 481
Query: 488 ADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFA 547
AD VL++GLDQT E E LDRV+L LPG Q++L+ VA AAKK V+LV++ GP+DI+FA
Sbjct: 482 ADVTVLVVGLDQTVEAEGLDRVNLTLPGYQEKLVRDVANAAKKTVVLVIMSAGPIDISFA 541
Query: 548 KYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYI-KVPMTDMKMRPQ 606
K I ++LW GYPGEAG A+A+VIFGD+NP GRLP TWYPQ++ KV MTDM MRP
Sbjct: 542 KNLSTIRAVLWVGYPGEAGGDAIAQVIFGDYNPSGRLPETWYPQEFADKVAMTDMNMRPN 601
Query: 607 ATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVH 666
+TSG PGR+YRFY GK ++ FG GLSYS + ST ++ ++H
Sbjct: 602 STSGFPGRSYRFYTGKPIYKFGYGLSYSSF------------------STFVLSAPSIIH 643
Query: 667 YKSVP----------ELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPAR-----RG 711
K+ P ++ T C K + IGVKNHG +G H VL+F KP + G
Sbjct: 644 IKTNPIMNLNKTTSVDISTVNCHDLKIRIVIGVKNHGLRSGSHVVLVFWKPPKCSKSLVG 703
Query: 712 NGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
G P+ QLVGF+ V + + + C++LS G + G H LV+G
Sbjct: 704 GGVPLTQLVGFERVEVGRSMTEKFTVDFDVCKALSLVDTHGKRKLVTGHHKLVIG 758
>gi|302786124|ref|XP_002974833.1| hypothetical protein SELMODRAFT_101733 [Selaginella moellendorffii]
gi|300157728|gb|EFJ24353.1| hypothetical protein SELMODRAFT_101733 [Selaginella moellendorffii]
Length = 784
Score = 797 bits (2059), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 384/759 (50%), Positives = 517/759 (68%), Gaps = 12/759 (1%)
Query: 23 RVDSTQPPFSCD-PSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPR 81
R + Q ++CD SN S +FPFC T L I R +DLVSRLTLDEK+ ++VN+A IPR
Sbjct: 28 RASTAQLRYACDVSSNASLGSFPFCDTKLGIDVRVQDLVSRLTLDEKVDEMVNAAQGIPR 87
Query: 82 LGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLE 141
LG+P+Y+WW EALHGVA G+ F G ATSFP I TAASF+S L+Y IG+A+ E
Sbjct: 88 LGVPSYQWWQEALHGVAS-SPGVQFGGLAPAATSFPMPIATAASFNSTLFYSIGEAVSSE 146
Query: 142 ARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNG 201
ARAL+N G+A G+TFW+PN+NIFRDPRWGRGQETPGEDPL+ K+A YVRG+QG + G
Sbjct: 147 ARALHNLGRA-GLTFWSPNVNIFRDPRWGRGQETPGEDPLLASKFASLYVRGLQGGAYEG 205
Query: 202 GKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASG 261
G L+ SACCKH TAYD+DNWKG RY F+A V+ QDL DTY PPF+SC++ GR S
Sbjct: 206 SASDGFLKVSACCKHLTAYDVDNWKGMDRYHFNAEVSEQDLVDTYNPPFQSCIEDGRVSS 265
Query: 262 IMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVD 321
+MC+YNRVNG+P+CADRNLL++T R WGF+GYI SDCDA+ ++++ YA S EDAV D
Sbjct: 266 VMCSYNRVNGVPTCADRNLLTETVRNSWGFNGYIVSDCDALQVLFEDTTYAPSAEDAVAD 325
Query: 322 VLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGK 381
+ AG+D+NCG+FL KH K+A++ K+ E+++D A+ NL RMRLGLF+G+P QP+
Sbjct: 326 SILAGLDLNCGTFLGKHAKSALQAGKITEADLDHAVSNLMRTRMRLGLFDGDPNSQPYSS 385
Query: 382 IGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNY 441
+GA +CS HQ LAL AA G+VLLKN G LPL + ++ALIGPNAN+ T+LGNY
Sbjct: 386 LGATDICSNDHQQLALDAALQGVVLLKND-GSLPLSTALK-TVALIGPNANATYTMLGNY 443
Query: 442 AGPSCRSITPLQALQNYVENTVYYPGCDTVACSSAS-IDKAVDIAKGADHVVLMMGLDQT 500
G C+ I+PLQ +Q Y N +Y PGC VAC+ + AV++A AD VVL++GLDQ+
Sbjct: 444 EGIPCKYISPLQGMQIYSSNILYSPGCRNVACNEGDLVASAVEVATKADAVVLVVGLDQS 503
Query: 501 QEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAG 560
QE+E DR L+LPG Q +L++ +A A P++LV++ GPVDI+ K + I S++W G
Sbjct: 504 QERETFDRTSLLLPGMQSQLVSNIANAVTSPIVLVIMSAGPVDISTFKDNSRISSVIWLG 563
Query: 561 YPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYE 620
YPG++G ALA V+FG +NPGGRLP TWY +++ V M DM+MRP SG PGR+YRFY
Sbjct: 564 YPGQSGGAALAHVVFGAYNPGGRLPNTWYHEEFTNVSMLDMQMRPNPLSGYPGRSYRFYT 623
Query: 621 GKEVFPFGCGLSYSKYSYKF-KAVSQNKLYLNQSSSTKMVENQDVVHYKS----VPELGT 675
G ++ FG GLSYS Y YKF A ++ + + + +++ + KS +P
Sbjct: 624 GTPLYNFGDGLSYSTYFYKFLLAPTKLSFFKSNTGNSRGCPAVNRSKAKSGCFHLPADDL 683
Query: 676 EFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEI 735
E C + F V++ V N G +G H VL+F P G P+KQL+ FQ V L + +
Sbjct: 684 ETCNSILFQVSVEVSNLGPRSGSHSVLIFSAPPPV-EGAPLKQLIAFQKVHLESDTTQRL 742
Query: 736 VFELSPCESLSRAREDGLMVIEEGTHFLVVGDEEYPISI 774
+F + PC+ LS R +G + G H L++G+ + + I
Sbjct: 743 IFGIDPCKHLSSVRRNGKRFLHSGRHKLLIGNAVHILDI 781
>gi|15230897|ref|NP_188596.1| putative beta-D-xylosidase 5 [Arabidopsis thaliana]
gi|259585724|sp|Q9LJN4.2|BXL5_ARATH RecName: Full=Probable beta-D-xylosidase 5; Short=AtBXL5; Flags:
Precursor
gi|332642747|gb|AEE76268.1| putative beta-D-xylosidase 5 [Arabidopsis thaliana]
Length = 781
Score = 797 bits (2059), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 399/775 (51%), Positives = 519/775 (66%), Gaps = 40/775 (5%)
Query: 10 FPLLCLCFTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKI 69
F L L +L++ + +Q F+CD S P+T + FC +L RA+DLVSRL+L EK+
Sbjct: 6 FVRLSLLIIALVSSLCESQKNFACDISAPATAKYGFCNVSLSYEARAKDLVSRLSLKEKV 65
Query: 70 SQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSY 129
QLVN A +PRLG+P YEWWSEALHGV+ VG G+ FNGT+ GATSFP ILTAASF++
Sbjct: 66 QQLVNKATGVPRLGVPPYEWWSEALHGVSDVGPGVHFNGTVPGATSFPATILTAASFNTS 125
Query: 130 LWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVS 189
LW ++G+ + EARA++N G A G+T+W+PN+N+FRDPRWGRGQETPGEDPLV KYAV+
Sbjct: 126 LWLKMGEVVSTEARAMHNVGLA-GLTYWSPNVNVFRDPRWGRGQETPGEDPLVVSKYAVN 184
Query: 190 YVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPP 249
YV+G+Q D + GK + +L+ S+CCKH+TAYDLDNWKG R+ FDA+VT QDL DTYQ P
Sbjct: 185 YVKGLQ-DVHDAGKSR-RLKVSSCCKHYTAYDLDNWKGIDRFHFDAKVTKQDLEDTYQTP 242
Query: 250 FESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAE 309
F+SCV++G S +MC+YNRVNGIP+CAD NLL R QW GYI SDCD++ + ++
Sbjct: 243 FKSCVEEGDVSSVMCSYNRVNGIPTCADPNLLRGVIRGQWRLDGYIVSDCDSIQVYFNDI 302
Query: 310 GYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGL 369
Y K+ EDAV LKAG+++NCG FL K+T+ AVK KKL S++D AL + V MRLG
Sbjct: 303 HYTKTREDAVALALKAGLNMNCGDFLGKYTENAVKLKKLNGSDVDEALIYNYIVLMRLGF 362
Query: 370 FNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGP 429
F+G+P PFG +G VCS HQ+LAL+AA+ GIVLL+N G LPLPK+ LA+IGP
Sbjct: 363 FDGDPKSLPFGNLGPSDVCSKDHQMLALEAAKQGIVLLEN-RGDLPLPKTTVKKLAVIGP 421
Query: 430 NANSAKTLLGNYAGPSCRSITPLQALQNYV-ENTVYYPGCDTVACSSAS-IDKAVDIAKG 487
NAN+ K ++ NYAG C+ +P+Q LQ YV E VY PGC V C + I AV
Sbjct: 422 NANATKVMISNYAGVPCKYTSPIQGLQKYVPEKIVYEPGCKDVKCGDQTLISAAVKAVSE 481
Query: 488 ADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFA 547
AD VL++GLDQT E E LDRV+L LPG Q++L+ VA AAKK V+LV++ GP+DI+FA
Sbjct: 482 ADVTVLVVGLDQTVEAEGLDRVNLTLPGYQEKLVRDVANAAKKTVVLVIMSAGPIDISFA 541
Query: 548 KYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYI-KVPMTDMKMRPQ 606
K I ++LW GYPGEAG A+A+VIFGD+NP GRLP TWYPQ++ KV MTDM MRP
Sbjct: 542 KNLSTIRAVLWVGYPGEAGGDAIAQVIFGDYNPSGRLPETWYPQEFADKVAMTDMNMRPN 601
Query: 607 ATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVH 666
+TSG PGR+YRFY GK ++ FG GLSYS + ST ++ ++H
Sbjct: 602 STSGFPGRSYRFYTGKPIYKFGYGLSYSSF------------------STFVLSAPSIIH 643
Query: 667 YKSVP----------ELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPAR-----RG 711
K+ P ++ T C K + IGVKNHG +G H VL+F KP + G
Sbjct: 644 IKTNPIMNLNKTTSVDISTVNCHDLKIRIVIGVKNHGLRSGSHVVLVFWKPPKCSKSLVG 703
Query: 712 NGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
G P+ QLVGF+ V + + + C++LS G + G H LV+G
Sbjct: 704 GGVPLTQLVGFERVEVGRSMTEKFTVDFDVCKALSLVDTHGKRKLVTGHHKLVIG 758
>gi|449496501|ref|XP_004160150.1| PREDICTED: probable beta-D-xylosidase 6-like, partial [Cucumis
sativus]
Length = 767
Score = 797 bits (2058), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 393/764 (51%), Positives = 520/764 (68%), Gaps = 25/764 (3%)
Query: 31 FSCDPS-NPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEW 89
F C+ S +PS +PFC +L + RA+ LVS LTLDEKI QL N+A +IPRLGIP+Y+W
Sbjct: 10 FPCNSSLHPS---YPFCNRSLSFTARAQSLVSLLTLDEKIQQLSNNASSIPRLGIPSYQW 66
Query: 90 WSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAG 149
WSE LHG+A G G+ FNG+I ATSFPQV++TAASF+ LW+ IG AI +EARA++N G
Sbjct: 67 WSEGLHGIATNGPGVSFNGSITSATSFPQVLVTAASFNRTLWFLIGSAIAVEARAMFNVG 126
Query: 150 QAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQ-GDTFNGGKLK--- 205
Q G+T WAPNINIFRDPRWGRGQETPGEDP+V Y++ +VRG+Q G+ +++
Sbjct: 127 QC-GLTIWAPNINIFRDPRWGRGQETPGEDPMVASAYSIQFVRGLQSGNWMKEHEIRNEV 185
Query: 206 -------GKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGR 258
G L SACCKHFTAYDL+ W TRY FD+ VT QDL DTYQPPF SC++QG+
Sbjct: 186 LEEDNGMGSLMVSACCKHFTAYDLEKWNNFTRYTFDSVVTEQDLGDTYQPPFRSCIQQGK 245
Query: 259 ASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDA 318
AS +MC+YN VNG+P+CA+ +LL K AR WG GYITSDCDAV+ +Y+ + Y +PEDA
Sbjct: 246 ASCLMCSYNAVNGVPACANPDLLKK-ARNDWGLKGYITSDCDAVATVYEYQKYTDTPEDA 304
Query: 319 VVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQP 378
+ DVLKAGMD+NCG+F+ + TK+A+ Q K+ E E+D AL NLFSV+ RLG F+GNP
Sbjct: 305 IADVLKAGMDINCGTFMLRGTKSAIDQGKVREEELDSALINLFSVQARLGFFDGNPREGK 364
Query: 379 FGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLL 438
FG++GA VC+ H+ LAL+AA+ GIVLLKN + LPL K+ SL +IG AN + LL
Sbjct: 365 FGELGAQDVCTAQHKTLALEAARQGIVLLKNENKFLPLDKNAISSLTVIGSLANDSSKLL 424
Query: 439 GNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSAS-IDKAVDIAKGADHVVLMMGL 497
G YAG C ++ ++ Q Y E + GC V C+S + + A+ IAK AD V+ + GL
Sbjct: 425 GGYAGVPCSPMSLVEGFQEYAETIFFASGCLDVPCASDNRFEDAILIAKKADFVIAVAGL 484
Query: 498 DQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSIL 557
D +QE E+LDRV L+LPG+Q +L++ VA +KKP+ILVL+ GGP+DI+FAK D + SIL
Sbjct: 485 DASQETEDLDRVSLLLPGKQMDLVSSVASVSKKPIILVLIGGGPLDISFAKKDSRVASIL 544
Query: 558 WAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYR 617
W G PGEAG ALAEVIFGD+NPGGRLP+TWYPQ + VPM DM MRP + G PGRTYR
Sbjct: 545 WIGNPGEAGGKALAEVIFGDYNPGGRLPVTWYPQSFTNVPMNDMHMRPNPSRGYPGRTYR 604
Query: 618 FYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTK-----MVENQDVVHYKSVPE 672
FY G ++ FG GLSY+ + Y+ + + L ++ +++ V + + Y V E
Sbjct: 605 FYTGDRIYGFGEGLSYTSFKYRLLSAPKKVNLLGKAETSRRRIIPQVRDGVNMSYMEVEE 664
Query: 673 LGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEK 732
+ E C+ +F V + V N GE G H V++F + + G P +QL+GF + + +
Sbjct: 665 V--ESCDLLRFEVKLSVSNIGEFDGSHVVMMFSEFPKVLTGTPQRQLIGFDRLYVKRNQS 722
Query: 733 AEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGDEEYPISIFV 776
AE + PC +S A E G VI G H + +GD E+ ISI V
Sbjct: 723 AESSIMVDPCNHVSLADEYGKRVIPLGDHTISLGDLEHVISIQV 766
>gi|449451581|ref|XP_004143540.1| PREDICTED: probable beta-D-xylosidase 6-like [Cucumis sativus]
Length = 777
Score = 796 bits (2057), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 394/769 (51%), Positives = 522/769 (67%), Gaps = 25/769 (3%)
Query: 26 STQPPFSCDPS-NPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGI 84
S+ F C+ S +PS +PFC +L + RA+ LVS LTLDEKI QL N+A +IPRLGI
Sbjct: 15 SSDFQFPCNSSLHPS---YPFCNRSLSFTARAQSLVSLLTLDEKIQQLSNNASSIPRLGI 71
Query: 85 PAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARA 144
P+Y+WWSE LHG+A G G+ FNG+I ATSFPQV++TAASF+ LW+ IG AI +EARA
Sbjct: 72 PSYQWWSEGLHGIATNGPGVSFNGSITSATSFPQVLVTAASFNRTLWFLIGSAIAVEARA 131
Query: 145 LYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQ-GDTFNGGK 203
++N GQ G+T WAPNINIFRDPRWGRGQETPGEDP+V Y++ +VRG+Q G+ +
Sbjct: 132 MFNVGQC-GLTIWAPNINIFRDPRWGRGQETPGEDPMVASAYSIQFVRGLQSGNWMKEHE 190
Query: 204 LK----------GKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESC 253
++ G L SACCKHFTAYDL+ W TRY FD+ VT QDL DTYQPPF SC
Sbjct: 191 IRNEVLEEDNGMGSLMVSACCKHFTAYDLEKWNNFTRYTFDSVVTEQDLGDTYQPPFRSC 250
Query: 254 VKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAK 313
++QG+AS +MC+YN VNG+P+CA+ +LL K AR WG GYITSDCDAV+ +Y+ + Y
Sbjct: 251 IQQGKASCLMCSYNAVNGVPACANPDLLKK-ARNDWGLKGYITSDCDAVATVYEYQKYTD 309
Query: 314 SPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGN 373
+PEDA+ DVLKAGMD+NCG+F+ + TK+A+ Q K+ E E+D AL NLFSV+ RLG F+GN
Sbjct: 310 TPEDAIADVLKAGMDINCGTFMLRGTKSAIDQGKVREEELDSALINLFSVQARLGFFDGN 369
Query: 374 PTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANS 433
P FG++GA VC+ H+ LAL+AA+ GIVLLKN + LPL K+ SL +IG AN
Sbjct: 370 PREGKFGELGAQDVCTAQHKTLALEAARQGIVLLKNENKFLPLDKNAISSLTVIGSLAND 429
Query: 434 AKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSAS-IDKAVDIAKGADHVV 492
+ LLG YAG C ++ ++ Q Y E + GC V C+S + + A+ IAK AD V+
Sbjct: 430 SSKLLGGYAGVPCSPMSLVEGFQEYAETIFFASGCLDVPCASDNRFEDAILIAKKADFVI 489
Query: 493 LMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRN 552
+ GLD +QE E+LDRV L+LPG+Q +L++ VA +KKP+ILVL+ GGP+DI+FAK D
Sbjct: 490 AVAGLDASQETEDLDRVSLLLPGKQMDLVSSVASVSKKPIILVLIGGGPLDISFAKKDSR 549
Query: 553 IGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNP 612
+ SILW G PGEAG ALAEVIFGD+NPGGRLP+TWYPQ + VPM DM MRP + G P
Sbjct: 550 VASILWIGNPGEAGGKALAEVIFGDYNPGGRLPVTWYPQSFTNVPMNDMHMRPNPSRGYP 609
Query: 613 GRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTK-----MVENQDVVHY 667
GRTYRFY G ++ FG GLSY+ + Y+ + + L ++ +++ V + + Y
Sbjct: 610 GRTYRFYTGDRIYGFGEGLSYTSFKYRLLSAPKKVNLLGKAETSRRRIIPQVRDGVNMSY 669
Query: 668 KSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVIL 727
V E+ E C+ +F V + V N GE G H V++F + + G P +QL+GF + +
Sbjct: 670 MEVEEV--ESCDLLRFEVKLSVSNIGEFDGSHVVMMFSEFPKVLTGTPQRQLIGFDRLYV 727
Query: 728 NAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGDEEYPISIFV 776
+ AE + PC +S A E G VI G H + +GD E+ ISI V
Sbjct: 728 KRNQSAESSIMVDPCNHVSLADEYGKRVIPLGDHTISLGDLEHVISIQV 776
>gi|222615852|gb|EEE51984.1| hypothetical protein OsJ_33664 [Oryza sativa Japonica Group]
Length = 753
Score = 791 bits (2043), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 394/738 (53%), Positives = 506/738 (68%), Gaps = 29/738 (3%)
Query: 45 FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGI 104
FC L QRA DLV+ LTL EK+SQL + A + RLG+PAYEWWSE LHG++ G+GI
Sbjct: 31 FCDAWLTAEQRAADLVANLTLAEKVSQLGDRAAGVARLGVPAYEWWSEGLHGLSIWGRGI 90
Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIF 164
FNGT+R TSFPQVILTAA+FD+ LW R+G+A+G EARALYN GQA G+T W+PN+NIF
Sbjct: 91 RFNGTVRAVTSFPQVILTAAAFDAGLWRRVGEAVGAEARALYNLGQANGLTIWSPNVNIF 150
Query: 165 RDPRWGRGQETPGEDPLVTGKYAVSYVRG-VQGDTFNGGKLKGKLQASACCKHFTAYDLD 223
RDP R PG+ RG G+ GG +ASACCKH TAYDLD
Sbjct: 151 RDPSGTR----PGD-----------ARRGPRHGEQGIGG------EASACCKHATAYDLD 189
Query: 224 NWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSK 283
W RY +D++VT+QDL DTY PPF+SCV +G+A+ IMC YN +NG+P+CA +LL+K
Sbjct: 190 YWNNVVRYNYDSKVTLQDLEDTYNPPFKSCVAEGKATCIMCGYNSINGVPACASSDLLTK 249
Query: 284 TARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAV 343
R++WG +GY+ SDCDAV+ I DA Y SPED V +K GMDVNCG++ Q H AAV
Sbjct: 250 KVRQEWGMNGYVASDCDAVATIRDAHHYTLSPEDTVAVSIKVGMDVNCGNYTQVHAMAAV 309
Query: 344 KQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQP-FGKIGADVVCSPAHQVLALQAAQD 402
++ L E +IDRAL NLF+VRMRLG F+G+P +G +GA VCSPAH+ LAL+AAQD
Sbjct: 310 QKGNLTEKDIDRALVNLFAVRMRLGHFDGDPRSNAVYGHLGAADVCSPAHKSLALEAAQD 369
Query: 403 GIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYV-EN 461
GIVLLKN G LPL S SLA+IGPNA++ L GNY GP C + TPLQ ++ Y+ +
Sbjct: 370 GIVLLKNDAGALPLQPSAVTSLAVIGPNADNLGALHGNYFGPPCETTTPLQGIKGYLGDR 429
Query: 462 TVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELI 521
+ GCD+ AC+ + ++A +A +DHVVL MGL Q QE++ LDR L+LPG QQ LI
Sbjct: 430 ARFLAGCDSPACAVDATNEAAALASSSDHVVLFMGLSQKQEQDGLDRTSLLLPGEQQGLI 489
Query: 522 TRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
T VA AA++PVILVLL GGPVD+TFAK + IG+ILWAGYPG+AG +A+A+V+FGDHNP
Sbjct: 490 TAVANAARRPVILVLLTGGPVDVTFAKDNPKIGAILWAGYPGQAGGLAIAKVLFGDHNPS 549
Query: 582 GRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYK-F 640
GRLP+TWYP+++ KVPMTDM+MR +G PGR+YRFY+G V+ FG GLSYSK+S + F
Sbjct: 550 GRLPVTWYPEEFTKVPMTDMRMRADPATGYPGRSYRFYQGNTVYNFGYGLSYSKFSRRMF 609
Query: 641 KAVSQ----NKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMA 696
+ S N L + + ++ + V E+G E C F + V+NHG M
Sbjct: 610 SSFSTSNAGNLSLLAGVMARRAGDDGGGMSSYLVKEIGVERCSRLVFPAVVEVQNHGPMD 669
Query: 697 GKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVI 756
GKH VL++++ GRP +QL+GF+S + EKA + FE+SPCE S EDG VI
Sbjct: 670 GKHSVLMYLRWPTTSGGRPARQLIGFRSQHVKVGEKAMVSFEVSPCEHFSWVGEDGERVI 729
Query: 757 EEGTHFLVVGDEEYPISI 774
+ G HFL+VGDEE S
Sbjct: 730 DGGAHFLMVGDEELETSF 747
>gi|302760655|ref|XP_002963750.1| hypothetical protein SELMODRAFT_80102 [Selaginella moellendorffii]
gi|300169018|gb|EFJ35621.1| hypothetical protein SELMODRAFT_80102 [Selaginella moellendorffii]
Length = 785
Score = 790 bits (2041), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 380/760 (50%), Positives = 517/760 (68%), Gaps = 14/760 (1%)
Query: 23 RVDSTQPPFSCD-PSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPR 81
RV + QP ++CD SN S +FPFC T L + R +DLVSRLTLDEK+ ++VN+A IPR
Sbjct: 29 RVSTAQPRYACDVSSNASLGSFPFCDTKLGVDVRVQDLVSRLTLDEKVDEMVNAAQGIPR 88
Query: 82 LGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLE 141
LG+P+Y+WW EALHGVA G+ F G ATSFP I AASF+S L+Y IG+A+ E
Sbjct: 89 LGVPSYQWWQEALHGVAS-SPGVQFGGLAPAATSFPMPIAMAASFNSTLFYSIGEAVSSE 147
Query: 142 ARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNG 201
ARAL+N G+A G+TFW+PN+NIFRDPRWGRGQETPGEDPL+ K+A YVRG+QG + G
Sbjct: 148 ARALHNLGRA-GLTFWSPNVNIFRDPRWGRGQETPGEDPLLASKFASLYVRGLQGGAYGG 206
Query: 202 GKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASG 261
G L+ SACCKH TAYD+DNWKG RY F+A V+ QDL DTY PPF+SC++ GR S
Sbjct: 207 SASDGFLKVSACCKHLTAYDMDNWKGMDRYHFNAEVSEQDLVDTYNPPFQSCIEDGRVSS 266
Query: 262 IMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVD 321
+MC+YNRVNG+P+CADR+LL++T R WGF+GYI SDCDA+ ++++ YA S EDAV D
Sbjct: 267 VMCSYNRVNGVPTCADRSLLTETVRNSWGFNGYIVSDCDALQVLFEDTTYAPSAEDAVAD 326
Query: 322 VLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGK 381
+ AG+D+NCG+FL KH K+A++ K+ E+++D A+ NL RMRLGLF+G+ +P+
Sbjct: 327 SILAGLDLNCGTFLGKHAKSALQAGKVTEADLDHAISNLMRTRMRLGLFDGDLNTRPYSS 386
Query: 382 IGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNY 441
+GA +CS HQ LAL AA G+VLLKN G LPL + ++ALIGPNAN+ T+LGNY
Sbjct: 387 LGATDICSNDHQQLALDAALQGVVLLKND-GSLPLSTALK-TVALIGPNANATYTMLGNY 444
Query: 442 AGPSCRSITPLQALQNYVENTVYYPGCDTVACSSAS-IDKAVDIAKGADHVVLMMGLDQT 500
G C+ ++PLQ +Q Y N +Y PGC VACS + AV++A AD VVL++GLDQ+
Sbjct: 445 EGIPCKYVSPLQGMQIYNNNILYSPGCRDVACSEGDLVASAVEVATKADAVVLVVGLDQS 504
Query: 501 QEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAG 560
QE+E DR L+LPG Q +L++ +A A P++LV++ GPVDI+ K + I S++W G
Sbjct: 505 QERETFDRTSLLLPGMQSQLVSNIANAVTCPIVLVIMSAGPVDISTFKDNSRISSVIWIG 564
Query: 561 YPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYE 620
YPG++G ALA V+FG +NPGGRLP TWY +++ V M DM+MRP SG PGR+YRFY
Sbjct: 565 YPGQSGGAALAHVVFGAYNPGGRLPNTWYHEEFTNVSMLDMRMRPNPPSGYPGRSYRFYT 624
Query: 621 GKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKS------VPELG 674
G ++ FG GLSYS Y YKF ++ +L +S++ + V ++ +P
Sbjct: 625 GTPLYNFGDGLSYSTYLYKF-LLAPTRLSFFKSNTRNSRDCPTVNRSEAEFGCFHLPADD 683
Query: 675 TEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAE 734
E C + F V++ V N G +G H VL+F P G P+KQL+ FQ V L +
Sbjct: 684 LETCNSILFQVSVEVSNLGPRSGSHSVLIFSAPPPV-EGAPLKQLIAFQKVHLESDTTQR 742
Query: 735 IVFELSPCESLSRAREDGLMVIEEGTHFLVVGDEEYPISI 774
++F + PC+ LS R +G + G H L++G+ + + +
Sbjct: 743 LIFGIDPCKHLSSVRRNGKRFLHSGRHKLLIGNAVHILDV 782
>gi|26449574|dbj|BAC41913.1| putative beta-xylosidase [Arabidopsis thaliana]
Length = 732
Score = 789 bits (2038), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 384/731 (52%), Positives = 509/731 (69%), Gaps = 23/731 (3%)
Query: 65 LDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAA 124
L EKI QL N+A ++PRLGIP YEWWSE+LHG+A G G+ FNG+I ATSFPQVI++AA
Sbjct: 2 LPEKIGQLSNTAASVPRLGIPPYEWWSESLHGLADNGPGVSFNGSISAATSFPQVIVSAA 61
Query: 125 SFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTG 184
SF+ LWY IG A+ +E RA+YN GQA G+TFWAPNIN+FRDPRWGRGQETPGEDP V
Sbjct: 62 SFNRTLWYEIGSAVAVEGRAMYNGGQA-GLTFWAPNINVFRDPRWGRGQETPGEDPKVVS 120
Query: 185 KYAVSYVRGVQ----------------GDTFNGGKLKGKLQASACCKHFTAYDLDNWKGT 228
+Y V +VRG Q D + GKL SACCKHFTAYDL+ W
Sbjct: 121 EYGVEFVRGFQEKKKRKVLKRRFSDDVDDDRHDDDADGKLMLSACCKHFTAYDLEKWGNF 180
Query: 229 TRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQ 288
TRY F+A VT QD+ DTYQPPFE+C++ G+AS +MC+YN VNG+P+CA +LL K AR +
Sbjct: 181 TRYDFNAVVTEQDMEDTYQPPFETCIRDGKASCLMCSYNAVNGVPACAQGDLLQK-ARVE 239
Query: 289 WGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKL 348
WGF GYITSDCDAV+ I+ +GY KSPE+AV D +KAG+D+NCG+++ +HT++A++Q K+
Sbjct: 240 WGFEGYITSDCDAVATIFAYQGYTKSPEEAVADAIKAGVDINCGTYMLRHTQSAIEQGKV 299
Query: 349 PESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLK 408
E +DRAL NLF+V++RLGLF+G+P +GK+G++ +CS H+ LAL+A + GIVLLK
Sbjct: 300 SEELVDRALLNLFAVQLRLGLFDGDPRRGQYGKLGSNDICSSDHRKLALEATRQGIVLLK 359
Query: 409 NSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGC 468
N H LLPL K+ SLA++GP AN+ + G Y G C+ T L YV+ T Y GC
Sbjct: 360 NDHKLLPLNKNHVSSLAIVGPMANNISNMGGTYTGKPCQRKTLFTELLEYVKKTSYASGC 419
Query: 469 DTVACSSAS-IDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEA 527
V+C S + +AV IAKGAD V+++ GLD +QE E+ DRV L LPG+Q++L++ VA
Sbjct: 420 SDVSCDSDTGFGEAVAIAKGADFVIVVAGLDLSQETEDKDRVSLSLPGKQKDLVSHVAAV 479
Query: 528 AKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMT 587
+KKPVILVL GGPVD+TFAK D IGSI+W GYPGE G ALAE+IFGD NPGGRLP T
Sbjct: 480 SKKPVILVLTGGGPVDVTFAKNDPRIGSIIWIGYPGETGGQALAEIIFGDFNPGGRLPTT 539
Query: 588 WYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKA----V 643
WYP+ + V M+DM MR ++ G PGRTYRFY G +V+ FG GLSY+K+ YK + +
Sbjct: 540 WYPESFTDVAMSDMHMRANSSRGYPGRTYRFYTGPQVYSFGTGLSYTKFEYKILSAPIRL 599
Query: 644 SQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLL 703
S ++L QSS K +++ + + Y + ++ CE+ +F V + V N GE+ G H V+L
Sbjct: 600 SLSELLPQQSSHKKQLQHGEELRYLQLDDVIVNSCESLRFNVRVHVSNTGEIDGSHVVML 659
Query: 704 FVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFL 763
F K +G P KQL+G+ V + + E E VF + PC+ LS A + G VI G+H L
Sbjct: 660 FSKMPPVLSGVPEKQLIGYDRVHVRSNEMMETVFVIDPCKQLSVANDVGKRVIPLGSHVL 719
Query: 764 VVGDEEYPISI 774
+GD ++ +S+
Sbjct: 720 FLGDLQHSLSV 730
>gi|357485313|ref|XP_003612944.1| Beta-D-xylosidase [Medicago truncatula]
gi|355514279|gb|AES95902.1| Beta-D-xylosidase [Medicago truncatula]
Length = 783
Score = 788 bits (2034), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 395/783 (50%), Positives = 527/783 (67%), Gaps = 32/783 (4%)
Query: 12 LLCLCFTSLLTRVDS--TQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKI 69
++ F+ LL + T P + C P + +PFC +LPIS R L+S LTL +KI
Sbjct: 10 IIIFLFSLLLIHLPKFFTTPDYPCKPPH---SHYPFCNISLPISTRTTSLISLLTLSDKI 66
Query: 70 SQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSY 129
+QL N+A +I LGIP+Y+WWSEALHG+A G G+ FNG+++ AT+FPQVI++AA+F+
Sbjct: 67 NQLSNTASSISHLGIPSYQWWSEALHGIATNGPGVNFNGSVKSATNFPQVIVSAAAFNRS 126
Query: 130 LWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVS 189
LW+ IG A+G+E RA++N GQA G++FWAPN+N+FRDPRWGRGQETPGEDP+V YAV
Sbjct: 127 LWFLIGYAVGVEGRAMFNVGQA-GLSFWAPNVNVFRDPRWGRGQETPGEDPMVGSAYAVE 185
Query: 190 YVRGVQG-----------DTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVT 238
+VRG+QG D+ + G L SACCKHFTAYDL+ W +RY F+A VT
Sbjct: 186 FVRGIQGVDGIKKVLNDHDSDDDG-----LMVSACCKHFTAYDLEKWGEFSRYNFNAVVT 240
Query: 239 MQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSD 298
QDL DTYQPPF CV+QG+AS +MC+YN VNG+P+CA ++LL R +WGF GYI SD
Sbjct: 241 QQDLEDTYQPPFRGCVQQGKASCLMCSYNEVNGVPACASKDLLG-LVRNKWGFEGYIASD 299
Query: 299 CDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALH 358
CDAV+ +++ + YAKS EDAV DVLKAGMD+NCG+F+ +HT++A++Q + E ++DRAL
Sbjct: 300 CDAVATVFEYQKYAKSAEDAVADVLKAGMDINCGTFMLRHTESAIEQGLVKEEDLDRALF 359
Query: 359 NLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPK 418
NLFSV+MRLGLFNG+P FGK+G VC+P H+ LAL+AA+ GIVLLKN + LPL K
Sbjct: 360 NLFSVQMRLGLFNGDPEKGKFGKLGPQDVCTPEHKKLALEAARQGIVLLKNDNKFLPLDK 419
Query: 419 SKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSAS- 477
VSLA+IGP A +++ L G Y+G C + L+ YV+ Y GC V C S
Sbjct: 420 KDRVSLAIIGPMATTSE-LGGGYSGIPCSPRSLYDGLKEYVKTISYAFGCSDVKCDSDDG 478
Query: 478 IDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLL 537
A+DIAK AD VV++ GLD T E E+LDRV L+LPG+Q +L++RVA A+K+PVILVL
Sbjct: 479 FAVAIDIAKQADFVVIVAGLDTTLETEDLDRVSLLLPGKQMDLVSRVAAASKRPVILVLT 538
Query: 538 CGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVP 597
GGP+D++FA+ ++ I SILW GYPGEAG ALAE+IFG+ NP GRLPMTWYP+ + VP
Sbjct: 539 GGGPLDVSFAESNQLITSILWIGYPGEAGGKALAEIIFGEFNPAGRLPMTWYPESFTNVP 598
Query: 598 MTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTK 657
M DM MR + G PGRTYRFY G ++ FG GLSYS +SY+ + + +KL L+++++
Sbjct: 599 MNDMGMRADPSRGYPGRTYRFYTGSRIYGFGHGLSYSDFSYRVLS-APSKLSLSKTTNGG 657
Query: 658 MVEN------QDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRG 711
+ + +DV V + C + F V I V N G+M G H V+LF K +
Sbjct: 658 LRRSLLNKVEKDVFEVDHVHVDELQNCNSLSFSVHISVMNVGDMDGSHVVMLFSKWPKNI 717
Query: 712 NGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGDEEYP 771
G P QLVG + + + E PCE S A E G ++ G H L VGD E+
Sbjct: 718 QGSPESQLVGPSRLHTVSNKSIETSILADPCEHFSFADEQGKRILPLGNHILNVGDVEHI 777
Query: 772 ISI 774
+SI
Sbjct: 778 VSI 780
>gi|224058158|ref|XP_002299457.1| predicted protein [Populus trichocarpa]
gi|222846715|gb|EEE84262.1| predicted protein [Populus trichocarpa]
Length = 780
Score = 783 bits (2022), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 398/757 (52%), Positives = 522/757 (68%), Gaps = 16/757 (2%)
Query: 28 QPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAY 87
P F C P P+ T+ FC +LPI++RA+ L+S LTL EKI QL ++A IPRLGIP Y
Sbjct: 27 NPQFPCKP--PTHNTYSFCNKSLPITRRAQSLISHLTLQEKIQQLSDNASGIPRLGIPHY 84
Query: 88 EWWSEALHGVAGVGKGIFFN--GTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARAL 145
EWWSE+LHG++ G G+ F G + AT FPQVI++AASF+ LW+ IG AI +EARA+
Sbjct: 85 EWWSESLHGISINGPGVSFKNGGPVTSATGFPQVIVSAASFNRTLWFLIGSAIAIEARAM 144
Query: 146 YNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTF---NGG 202
YN GQA G+TFWAPNINIFRDPRWGRGQETPGEDP+V YA+ +V+G QG + +G
Sbjct: 145 YNVGQA-GLTFWAPNINIFRDPRWGRGQETPGEDPMVASAYAIEFVKGFQGGHWKNEDGE 203
Query: 203 KLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGI 262
KL SACCKH TAYDL+ W +RY F+A VT QD+ DTYQPPF SC+++G+AS +
Sbjct: 204 INDDKLMLSACCKHSTAYDLEKWGNFSRYSFNAVVTEQDMEDTYQPPFRSCIQKGKASCL 263
Query: 263 MCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDV 322
MC+YN VNG+P+CA +LL K R +WGF GYITSDCDAV+ I++ + Y+KSPEDAV
Sbjct: 264 MCSYNEVNGVPACAREDLLQK-PRTEWGFKGYITSDCDAVATIFEYQNYSKSPEDAVAIA 322
Query: 323 LKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKI 382
LKAGMD+NCG+++ ++ ++AV++ KL E +IDRALHNLFSV++RLGLF+G+P FGK+
Sbjct: 323 LKAGMDINCGTYVLRNAQSAVEKGKLQEEDIDRALHNLFSVQLRLGLFDGDPRKGQFGKL 382
Query: 383 GADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYA 442
G VC+ H+ LAL+AA+ GIVLLKN LLPL K SLA+IGP AN A +L G+Y
Sbjct: 383 GPKNVCTKEHKTLALEAARQGIVLLKNDKKLLPLNKKAVSSLAIIGPLANMANSLGGDYT 442
Query: 443 GPSCRSITPLQALQNYVENTVYYPGCDTVAC-SSASIDKAVDIAKGADHVVLMMGLDQTQ 501
G C + + L+ YV+ T Y GC VAC S KA+ +AK AD V+++ GLD +Q
Sbjct: 443 GYPCDPQSLFEGLKAYVKKTSYAIGCLDVACVSDTQFHKAIIVAKRADFVIIVAGLDLSQ 502
Query: 502 EKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGY 561
E EE DRV L+LPG+Q L++ VA A+KKPVILVL GGP+D++FAK D I SILW GY
Sbjct: 503 ETEEHDRVSLLLPGKQMSLVSSVAAASKKPVILVLTGGGPLDVSFAKGDPRIASILWIGY 562
Query: 562 PGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEG 621
PGEAGA ALAE+IFG++NPGGRLPMTWYP+ + +V MTDM MRP + G PGRTYRFY G
Sbjct: 563 PGEAGAKALAEIIFGEYNPGGRLPMTWYPESFTEVSMTDMNMRPNPSRGYPGRTYRFYTG 622
Query: 622 KEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTK----MVENQDVVHYKSVPELGTEF 677
V+ FG GLSY+ ++YK + SS + + + + Y ++ E+ +
Sbjct: 623 NRVYGFGGGLSYTNFTYKILSAPSKLSLSGSLSSNSRKRILQQGGERLSYININEITS-- 680
Query: 678 CETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVF 737
C++ +F + I V+N G M G H V+LF + G P KQLVGF V + E+
Sbjct: 681 CDSLRFYMQILVENVGNMDGGHVVMLFSRVPTVFRGAPEKQLVGFDRVHTISHRSTEMSI 740
Query: 738 ELSPCESLSRAREDGLMVIEEGTHFLVVGDEEYPISI 774
+ PCE LS A E G ++ G H L++GD E+ ++I
Sbjct: 741 LVDPCEHLSVANEQGKKIMLLGGHGLMLGDLEHFVTI 777
>gi|255556320|ref|XP_002519194.1| Periplasmic beta-glucosidase precursor, putative [Ricinus communis]
gi|223541509|gb|EEF43058.1| Periplasmic beta-glucosidase precursor, putative [Ricinus communis]
Length = 782
Score = 783 bits (2022), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 390/767 (50%), Positives = 518/767 (67%), Gaps = 30/767 (3%)
Query: 17 FTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSA 76
+SL + ++ PF+CDP N T FC+ LPI R RDL+SRLTL EKI LVN+A
Sbjct: 28 LSSLPIHLVESRAPFACDPRNGVTRNLKFCRANLPIHVRVRDLISRLTLQEKIRLLVNNA 87
Query: 77 PAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQ 136
A+PRLGI YEWWSEALHGV+ VG G+ F G GATSFPQVI TAASF+ LW +IG+
Sbjct: 88 AAVPRLGIQGYEWWSEALHGVSNVGPGVKFGGAFPGATSFPQVITTAASFNQSLWEQIGR 147
Query: 137 AIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQG 196
+ EARA+YN G A G+T+W+PN+N+FRDPRWGRGQETPGEDP++ GKYA SYVRG+Q
Sbjct: 148 VVSDEARAMYNGGLA-GLTYWSPNVNVFRDPRWGRGQETPGEDPVLAGKYAASYVRGLQS 206
Query: 197 DTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQ 256
T KL+ +ACCKH+TAYDLDNW G RY F+ARV+ QDL DTY PF++CV +
Sbjct: 207 ST------GLKLKVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYDVPFKACVVE 260
Query: 257 GRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPE 316
G+ + +MC+YN+VNG P+CAD LL T R QWG +GYI SDCD+V ++YD + Y +PE
Sbjct: 261 GKVASVMCSYNQVNGKPTCADPILLKNTIRGQWGLNGYIVSDCDSVGVLYDNQHYTSTPE 320
Query: 317 DAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTM 376
+A +KAG+D++CG FL HT+ AVK+ L E +++ AL N +V+MRLG+F+G P+
Sbjct: 321 EAAAATIKAGLDLDCGPFLAIHTENAVKKGLLVEEDVNLALANTITVQMRLGMFDGEPSA 380
Query: 377 QPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKT 436
P+G +G VC+PAHQ LAL+AA+ GIVLL+N LPL S+ ++A+IGPN++ T
Sbjct: 381 HPYGNLGPRDVCTPAHQELALEAARQGIVLLENRGQALPLSSSRHHTIAVIGPNSDVTVT 440
Query: 437 LLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVAC-SSASIDKAVDIAKGADHVVLMM 495
++GNYAG +C+ +PLQ + Y + T++ GC VAC S+ A A+ AD VL+M
Sbjct: 441 MIGNYAGIACKYTSPLQGISRYAK-TLHQNGCGDVACHSNQQFGAAEAAARQADATVLVM 499
Query: 496 GLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGS 555
GLDQ+ E E DRV L+LPG QQEL++RVA A++ P ILVL+ GGP+D++FAK D +G+
Sbjct: 500 GLDQSIEAEFRDRVGLLLPGHQQELVSRVARASRGPTILVLMSGGPIDVSFAKNDPRVGA 559
Query: 556 ILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYI-KVPMTDMKMRPQATSGNPGR 614
ILWAGYPG+AG A+A+V+FG NPGG+LPMTWYPQ Y+ KVPMT+M MRP +G PGR
Sbjct: 560 ILWAGYPGQAGGAAIADVLFGTTNPGGKLPMTWYPQGYLAKVPMTNMGMRPDPATGYPGR 619
Query: 615 TYRFYEGKEVFPFGCGLSYSKYSYKF----KAVS---QNKLYLNQSSSTKMVENQDVVHY 667
TYRFY+G VFPFG G+SY+ +S+ K VS N LN + S+K + V H
Sbjct: 620 TYRFYKGNVVFPFGHGMSYTSFSHSLTQAPKEVSLPITNLYALNTTISSKAIR---VSHI 676
Query: 668 KSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVIL 727
LG + I VKN G M G H +L+F P KQL+GF+ V L
Sbjct: 677 NCQTSLG----------IDINVKNTGTMDGTHTLLVFSSPPSGEKESSNKQLIGFEKVDL 726
Query: 728 NAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGDEEYPISI 774
A + ++ ++ C+ LS G+ I G H + +GD ++ IS+
Sbjct: 727 VAGSQIQVKIDIHVCKHLSAVDRFGIRRIPIGDHHIYIGDLKHSISL 773
>gi|255545293|ref|XP_002513707.1| Beta-glucosidase, putative [Ricinus communis]
gi|223547158|gb|EEF48654.1| Beta-glucosidase, putative [Ricinus communis]
Length = 777
Score = 782 bits (2020), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 396/755 (52%), Positives = 515/755 (68%), Gaps = 19/755 (2%)
Query: 27 TQPPFSCD-PSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIP 85
+ P F+CD SNPS +F FC +L IS R DLV+RLTL EKI LVNSA ++ RLGIP
Sbjct: 37 SSPVFACDVKSNPSLASFGFCNVSLGISDRVTDLVNRLTLQEKIGFLVNSAGSVSRLGIP 96
Query: 86 AYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARAL 145
YEWWSEALHGV+ VG G F+ + GATSFPQVILTAASF++ L+ IG+ + EARA+
Sbjct: 97 KYEWWSEALHGVSYVGPGTHFSNIVPGATSFPQVILTAASFNASLFEAIGKVVSTEARAM 156
Query: 146 YNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLK 205
YN G A G+TFW+PNINIFRDPRWGRGQETPGEDPL++ KY YVRG+Q T NG
Sbjct: 157 YNVGLA-GLTFWSPNINIFRDPRWGRGQETPGEDPLLSSKYGSCYVRGLQ-QTDNGDS-- 212
Query: 206 GKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCA 265
+L+ +ACCKH+TAYDLDNWKGT RY F+A VT QDL DT+QPPF+SCV G + +MC+
Sbjct: 213 ERLKVAACCKHYTAYDLDNWKGTDRYHFNAVVTKQDLDDTFQPPFKSCVIDGNVASVMCS 272
Query: 266 YNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKA 325
YN+VNG P+CAD +LL+ R +W +GYI SDCD+V +IY+++ Y K+PE+A + A
Sbjct: 273 YNQVNGKPTCADPDLLAGIIRGEWKLNGYIVSDCDSVDVIYNSQHYTKTPEEAAAITILA 332
Query: 326 GMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGAD 385
G+D+NCGSFL KHT+AAV L S +D+A+ N F+ MRLG F+G+P+ Q +GK+G
Sbjct: 333 GLDLNCGSFLGKHTEAAVNAGLLNVSAVDKAVSNNFATLMRLGFFDGDPSKQLYGKLGPK 392
Query: 386 VVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPS 445
VC+ +Q LA +AA+ GIVLLKNS G LPL + +LA+IGPNAN KT++GNY G
Sbjct: 393 DVCTAVNQELAREAARQGIVLLKNSPGSLPLSPTAIKTLAVIGPNANVTKTMIGNYEGTP 452
Query: 446 CRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEE 505
C+ TPLQ L V T Y GC VAC++A +D A +A AD VL+MG DQ+ E E
Sbjct: 453 CKYTTPLQGLTASVA-TTYLAGCSNVACAAAQVDDAKKLAASADATVLVMGADQSIEAES 511
Query: 506 LDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEA 565
DRVD++LPG+QQ LIT+VA +K PVILV++ GG +D++FAK + I SILW GYPGEA
Sbjct: 512 RDRVDVLLPGQQQLLITQVANVSKGPVILVIMSGGGMDVSFAKTNDKITSILWVGYPGEA 571
Query: 566 GAVALAEVIFGDHNPGGRLPMTWYPQDYI-KVPMTDMKMRPQATSGNPGRTYRFYEGKEV 624
G A+A+VIFG +NP GRLPMTWYPQ Y+ KVPMT+M MRP +SG PGRTYRFY G+ V
Sbjct: 572 GGAAIADVIFGYYNPSGRLPMTWYPQAYVDKVPMTNMNMRPDPSSGYPGRTYRFYTGETV 631
Query: 625 FPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKS---VPELGTEFCETR 681
+ FG GLSYS+Y ++ Q + +E+ V S + G + C+
Sbjct: 632 YSFGDGLSYSEYKHQLVQAPQ--------LVSIPLEDDHVCRSSSKCISVDAGEQNCQGL 683
Query: 682 KFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSP 741
F + + V+N G++ G H V LF P N P K LV F+ V L+AK + F++
Sbjct: 684 AFNIDLKVRNIGKVRGTHTVFLFFTPPSVHN-SPQKHLVDFEKVSLDAKTYGMVSFKVDV 742
Query: 742 CESLSRAREDGLMVIEEGTHFLVVGDEEYPISIFV 776
C+ LS E G + G H L VG+ E+ +++ +
Sbjct: 743 CKHLSVVDEFGSRKVALGGHVLHVGNLEHSLTVRI 777
>gi|356556038|ref|XP_003546334.1| PREDICTED: beta-D-xylosidase 1-like [Glycine max]
Length = 775
Score = 781 bits (2016), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 398/770 (51%), Positives = 522/770 (67%), Gaps = 30/770 (3%)
Query: 16 CFT-SLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVN 74
C T S RV + PF+CDP N T F FC T +PI R +DL++RLTL EKI +VN
Sbjct: 21 CVTLSFFPRVTEGRVPFACDPRNGLTRGFKFCNTHVPIHVRVQDLIARLTLPEKIRLVVN 80
Query: 75 SAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRI 134
+A A+PRLGI YEWWSEALHGV+ VG G F G GAT FPQVI TAASF+ LW I
Sbjct: 81 NAIAVPRLGIQGYEWWSEALHGVSNVGPGTKFGGAFPGATMFPQVISTAASFNQSLWQEI 140
Query: 135 GQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGV 194
G+ + EARA+YN GQA G+T+W+PN+NIFRDPRWGRGQETPGEDP + KYA SYV+G+
Sbjct: 141 GRVVSDEARAMYNGGQA-GLTYWSPNVNIFRDPRWGRGQETPGEDPTLAAKYAASYVKGL 199
Query: 195 QGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCV 254
QGD+ L+ +ACCKH+TAYDLDNW G R+ F+A+V+ QDL DTY PF++CV
Sbjct: 200 QGDSAGN-----HLKVAACCKHYTAYDLDNWNGVDRFHFNAKVSKQDLEDTYDVPFKACV 254
Query: 255 KQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKS 314
+G+ + +MC+YN+VNG P+CAD +LL T R QW +GYI SDCD+V + +D + Y K+
Sbjct: 255 LEGQVASVMCSYNQVNGKPTCADPDLLRNTIRGQWRLNGYIVSDCDSVGVFFDNQHYTKT 314
Query: 315 PEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNP 374
PE+A + +KAG+D++CG FL HT +A+++ + E++++ AL NL SV+MRLG+F+G P
Sbjct: 315 PEEAAAEAIKAGLDLDCGPFLAIHTDSAIRKGLISENDLNLALANLISVQMRLGMFDGEP 374
Query: 375 TMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSA 434
+ QP+G +G VC+ AHQ LAL+AA++ IVLL+N LPL S+ ++ ++GPNA++
Sbjct: 375 STQPYGNLGPRDVCTSAHQQLALEAARESIVLLQNKGNSLPLSPSRLRTIGVVGPNADAT 434
Query: 435 KTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVD-IAKGADHVVL 493
T++GNYAG +C TPLQ + YV+ T + GC VAC + A + IA+ AD +VL
Sbjct: 435 VTMIGNYAGVACGYTTPLQGIARYVK-TAHQVGCRGVACRGNELFGAAETIARQADAIVL 493
Query: 494 MMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNI 553
+MGLDQT E E DRV L+LPG QQEL+TRVA AAK PVIL+++ GGPVDI+FAK D I
Sbjct: 494 VMGLDQTVEAETRDRVGLLLPGLQQELVTRVARAAKGPVILLIMSGGPVDISFAKNDPKI 553
Query: 554 GSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYI-KVPMTDMKMRPQATSGNP 612
+ILW GYPG+AG A+A+VIFG NPGGRLPMTWYPQ Y+ KVPMT+M MRP T+G P
Sbjct: 554 SAILWVGYPGQAGGTAIADVIFGTTNPGGRLPMTWYPQGYLAKVPMTNMDMRPNPTTGYP 613
Query: 613 GRTYRFYEGKEVFPFGCGLSYSKYSYKF----KAVSQNKLYL----NQSSSTKMVENQDV 664
GRTYRFY+G VFPFG GLSYS++S+ K VS + L N + S+K V+ V
Sbjct: 614 GRTYRFYKGPVVFPFGHGLSYSRFSHSLALAPKQVSVPIMSLQALTNSTLSSKAVK---V 670
Query: 665 VHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQS 724
H L EF + VKN G M G H +L+F +P G IKQLVGF
Sbjct: 671 SHANCDDSLEMEF--------HVDVKNEGSMDGTHTLLIFSQPP-HGKWSQIKQLVGFHK 721
Query: 725 VILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGDEEYPISI 774
+ A K + + C+ LS + G+ I G H L +GD ++ IS+
Sbjct: 722 THVLAGSKQRVKVGVHVCKHLSVVDQFGVRRIPTGEHELHIGDVKHSISV 771
>gi|225437531|ref|XP_002270249.1| PREDICTED: probable beta-D-xylosidase 2 [Vitis vinifera]
gi|297743965|emb|CBI36935.3| unnamed protein product [Vitis vinifera]
Length = 768
Score = 779 bits (2011), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 380/765 (49%), Positives = 516/765 (67%), Gaps = 23/765 (3%)
Query: 15 LCFTSLLTRVD-STQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLV 73
L F +L V + PF+CDP + + FPFC+ ++ I +R +DL+ RLTL+EK+ LV
Sbjct: 12 LIFLVVLAVVSGEARDPFACDPKDGANAGFPFCRKSIGIGERVKDLIGRLTLEEKVRLLV 71
Query: 74 NSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYR 133
N+A +PRLGI YEWWSEALHGV+ VG G F+G GATSFPQVI TAASF+S LW
Sbjct: 72 NNAAGVPRLGIKGYEWWSEALHGVSNVGPGTKFSGDFPGATSFPQVITTAASFNSSLWEA 131
Query: 134 IGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRG 193
IGQ + EARA+YN G A G+TFW+PN+NIFRDPRWGRGQETPGEDP++ GKYA YVRG
Sbjct: 132 IGQVVSDEARAMYNGGAA-GLTFWSPNVNIFRDPRWGRGQETPGEDPVLAGKYAARYVRG 190
Query: 194 VQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESC 253
+QG+ + +L+ +ACCKHFTAYDLDNW G R+ FDARV+ Q++ DT+ PF SC
Sbjct: 191 LQGNAGD------RLKVAACCKHFTAYDLDNWNGVDRFHFDARVSKQEMEDTFDVPFRSC 244
Query: 254 VKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAK 313
V +G+ + +MC+YN+VNG+P+CAD NLL T R+QW +GY+ SDCD+V + YD + Y
Sbjct: 245 VVEGKVASVMCSYNQVNGVPTCADPNLLRNTVRKQWHLNGYVVSDCDSVGVFYDNQHYTN 304
Query: 314 SPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGN 373
+PE+A D +KAG+D++CG FL HT+ A+K+ + E+++D AL N +V+MRLG+F+G
Sbjct: 305 TPEEAAADAIKAGLDLDCGPFLAVHTQDAIKKGLVSEADVDSALVNTVTVQMRLGMFDGE 364
Query: 374 PTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANS 433
P+ QPFG +G VCSPAHQ LA++AA+ GIVLLKN LPL S+A+IGPN+++
Sbjct: 365 PSAQPFGDLGPKDVCSPAHQELAIEAARQGIVLLKNHGHSLPLSTRSHRSIAVIGPNSDA 424
Query: 434 AKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASI-DKAVDIAKGADHVV 492
T++GNYAG C TPLQ + Y T++ GC VACS + A+D A AD V
Sbjct: 425 NVTMIGNYAGIPCEYTTPLQGIGRY-SRTIHQKGCADVACSEDQLFAGAIDAASQADATV 483
Query: 493 LMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRN 552
L+MGLDQ+ E E DR DL+LPGRQQEL+++VA A++ P +LVL+ GGPVD++FAK D
Sbjct: 484 LVMGLDQSIEAEAKDRADLLLPGRQQELVSKVAMASRGPTVLVLMSGGPVDVSFAKKDPR 543
Query: 553 IGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYI-KVPMTDMKMRPQATSGN 611
I +I+WAGYPG+AG A+A+++FG NPGG+LPMTWYPQ+Y+ KVPMT M MR +
Sbjct: 544 IAAIVWAGYPGQAGGAAIADILFGVANPGGKLPMTWYPQEYLSKVPMTTMAMRAIPSKAY 603
Query: 612 PGRTYRFYEGKEVFPFGCGLSYSKYSYKF-KAVSQNKLYLNQSSSTKMVENQ-DVVHYKS 669
PGRTYRFY+G V+ FG GLSY+ + + +A + + L+ +T + V H K
Sbjct: 604 PGRTYRFYKGPVVYRFGHGLSYTNFVHTIAQAPTAVAIPLHGHHNTTVSGKAIRVTHAK- 662
Query: 670 VPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNA 729
C + + VKN G G H +L+F KP G+ P KQLV F+ V + A
Sbjct: 663 --------CNRLSIALHLDVKNVGNKDGSHTLLVFSKPP-AGHWAPHKQLVAFEKVHVAA 713
Query: 730 KEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGDEEYPISI 774
+ + + + C+ LS G+ I G H L +GD ++ +S+
Sbjct: 714 RTQQRVQINIHVCKYLSVVDRSGIRRIPMGQHGLHIGDTKHLVSL 758
>gi|183579871|dbj|BAG28345.1| arabinofuranosidase [Citrus unshiu]
Length = 769
Score = 778 bits (2008), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 378/743 (50%), Positives = 515/743 (69%), Gaps = 18/743 (2%)
Query: 16 CFTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNS 75
CF S R ++ PF+CDP N T + FC+T++PI R +DL+ RLTL EKI LVN+
Sbjct: 16 CFGS---RFVESRVPFACDPRNGLTRSLRFCRTSVPIHVRVQDLIGRLTLQEKIRLLVNN 72
Query: 76 APAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIG 135
A A+PRLGI YEWWSEALHGV+ VG G F G GATSFPQVI TAA+F+ LW IG
Sbjct: 73 AAAVPRLGIQGYEWWSEALHGVSNVGPGTKFGGAFPGATSFPQVITTAAAFNESLWEEIG 132
Query: 136 QAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQ 195
+ + EARA+YN G A G+T+W+PN+NIFRDPRWGRGQETPGEDP++ GKYA SYVR +Q
Sbjct: 133 RVVSDEARAMYNGGMA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLAGKYAASYVRRLQ 191
Query: 196 GDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVK 255
G+T +L+ +ACCKH+TAYDLDNW G RY F+ARV+ QDL DTY PF++CV
Sbjct: 192 GNT------GSRLKVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVV 245
Query: 256 QGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSP 315
+G+ + +MC+YN+VNG P+CAD ++L T R QW GYI SDCD+V ++Y+ + Y ++P
Sbjct: 246 EGKVASVMCSYNQVNGKPTCADPDILKNTIRGQWRLDGYIVSDCDSVGVLYNTQHYTRTP 305
Query: 316 EDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPT 375
E+A D +KAG+D++CG FL HT+ AV+ L E +++ A +V+MRLG+F+G P+
Sbjct: 306 EEAAADAIKAGLDLDCGPFLAIHTEGAVRGGLLREEDVNLASAYTITVQMRLGMFDGEPS 365
Query: 376 MQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAK 435
QPFG +G VC+PAHQ LALQAA GIVLLKNS LPL + ++A+IGPN++
Sbjct: 366 AQPFGNLGPRDVCTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTV 425
Query: 436 TLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDI-AKGADHVVLM 494
T++GNYAG +C TPLQ + Y + T++ GC VAC+ + A ++ A+ AD VL+
Sbjct: 426 TMIGNYAGVACGYTTPLQGISRYAK-TIHQAGCLGVACNGNQLIGAAEVAARQADATVLV 484
Query: 495 MGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIG 554
MGLDQ+ E E +DR L+LPGRQQEL++RVA+A++ PV+LVL+CGGPVD++FAK D IG
Sbjct: 485 MGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIG 544
Query: 555 SILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYI-KVPMTDMKMRPQATSGNPG 613
+ILW GYPG+AG A+A+V+FG NPGG+LPMTWYPQDY+ ++PMTDM+MR A G PG
Sbjct: 545 AILWVGYPGQAGGAAIADVLFGRANPGGKLPMTWYPQDYVARLPMTDMRMR--AGRGYPG 602
Query: 614 RTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPEL 673
RTYRFY+G VFPFG G+SY+ +++ + N+ + ++S +N + ++
Sbjct: 603 RTYRFYKGPVVFPFGHGMSYTTFAHTLSK-APNQFSVPIATSLYAFKNT-TISSNAIRVA 660
Query: 674 GTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKA 733
T + + + VKN G+MAG H +L+F KP GN P KQL+GF+ V + A
Sbjct: 661 HTNCNDAMSLGLHVDVKNTGDMAGTHTLLVFAKPP-AGNWSPNKQLIGFKKVHVTAGALQ 719
Query: 734 EIVFELSPCESLSRAREDGLMVI 756
+ ++ C+ LS + G+ I
Sbjct: 720 SVRLDIHVCKHLSVVDKFGIRRI 742
>gi|86553064|gb|AAS17751.2| beta xylosidase [Fragaria x ananassa]
Length = 772
Score = 777 bits (2007), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 387/777 (49%), Positives = 528/777 (67%), Gaps = 26/777 (3%)
Query: 4 HKLSLVFPLLCLCFTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRL 63
+KLSL+ L LC ++LL + +PPF+CDP NP T F FC+T +P+ R +DL+ RL
Sbjct: 7 NKLSLI--ALVLCVSALLFNLVHARPPFACDPRNPLTRGFKFCRTRVPVHVRVQDLIGRL 64
Query: 64 TLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTA 123
TL EKI LVN+A A+PRLGI YEWWSEALHGV+ VG G F G GATSFPQVI TA
Sbjct: 65 TLQEKIRLLVNNAIAVPRLGIQGYEWWSEALHGVSNVGPGTKFGGAFPGATSFPQVITTA 124
Query: 124 ASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVT 183
ASF+ LW IGQ + EARA+YN GQA G+T+W+PN+NIFRDPRWGRGQETPGEDP+++
Sbjct: 125 ASFNQSLWQEIGQVVSDEARAMYNGGQA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLS 183
Query: 184 GKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLA 243
KYA SYV+G+QGD +L+ +ACCKH+TAYDLDNW G R+ F+ARV+ QDLA
Sbjct: 184 AKYAASYVKGLQGD-----GAGNRLKVAACCKHYTAYDLDNWNGVDRFHFNARVSKQDLA 238
Query: 244 DTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVS 303
DTY PF CV +G+ + +MC+YN+VNG P+CAD +LL T R +W +GYI SDCD+V
Sbjct: 239 DTYDVPFRGCVLEGKVASVMCSYNQVNGKPTCADPDLLKNTIRGEWKLNGYIVSDCDSVG 298
Query: 304 IIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSV 363
+ YD + Y ++PE+A + +KAG+D++CG FL HT+ A+K LPE ++D AL N +V
Sbjct: 299 VFYDQQHYTRTPEEAAAEAIKAGLDLDCGPFLAIHTEGAIKAGLLPEIDVDYALANTLTV 358
Query: 364 RMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVS 423
+MRLG+F+G P+ Q +G +G VC+PAHQ LAL+A++ GIVLL+N+ LPL + +
Sbjct: 359 QMRLGMFDGEPSAQQYGNLGPRDVCTPAHQELALEASRQGIVLLQNNGHTLPLSTVRHRT 418
Query: 424 LALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASI-DKAV 482
+A++GPN++ +T++GNYAG +C TPLQ + Y + T++ GC VAC++ + A
Sbjct: 419 VAVVGPNSDVTETMIGNYAGVACGYTTPLQGIGRYTK-TIHQQGCTNVACTTNQLFGAAE 477
Query: 483 DIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPV 542
A+ AD VL+MGLDQ+ E E DR DLV+PG QQEL++RVA A++ P +LVL+ GGP+
Sbjct: 478 AAARQADATVLVMGLDQSIEAEFRDRTDLVMPGHQQELVSRVARASRGPTVLVLMSGGPI 537
Query: 543 DITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYI-KVPMTDM 601
D++FAK D IG+I+W GYPG+AG A+A+V+FG NP G+LPMTWYPQDY+ KVPMT+M
Sbjct: 538 DVSFAKNDPKIGAIIWVGYPGQAGGTAMADVLFGTTNPSGKLPMTWYPQDYVSKVPMTNM 597
Query: 602 KMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKF----KAVSQNKLYLNQSSSTK 657
MR A G PGRTYRFY+G VFPFG GLSY+ +++ +VS L+ ++++
Sbjct: 598 AMR--AGRGYPGRTYRFYKGPVVFPFGLGLSYTTFAHSLAQVPTSVSVPLTSLSATTNST 655
Query: 658 MVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIK 717
M+ + V + + C + + VKN G G H +L+F P G K
Sbjct: 656 MLSSAVRVSHTN--------CNPLSLALHVVVKNTGARDGTHTLLVFSSPP-SGKWAANK 706
Query: 718 QLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGDEEYPISI 774
QLVGF V + A + ++ C+ LS + G+ I G H L +GD E+ IS+
Sbjct: 707 QLVGFHKVHIVAGSHKRVKVDVHVCKHLSVVDQFGIRRIPIGEHKLQIGDLEHHISV 763
>gi|18378991|ref|NP_563659.1| beta-glucosidase [Arabidopsis thaliana]
gi|75250279|sp|Q94KD8.1|BXL2_ARATH RecName: Full=Probable beta-D-xylosidase 2; Short=AtBXL2; Flags:
Precursor
gi|14194121|gb|AAK56255.1|AF367266_1 At1g02640/T14P4_11 [Arabidopsis thaliana]
gi|23506063|gb|AAN28891.1| At1g02640/T14P4_11 [Arabidopsis thaliana]
gi|332189332|gb|AEE27453.1| beta-glucosidase [Arabidopsis thaliana]
Length = 768
Score = 777 bits (2007), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 384/780 (49%), Positives = 519/780 (66%), Gaps = 28/780 (3%)
Query: 1 MIFHKLSLVFPLLCLCFTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLV 60
MI HK++ + +L +S V S + F+CD + +T T FC+ ++PI +R RDL+
Sbjct: 1 MILHKMAFLAVILFFLISSSSVCVHSRET-FACDTKDAATATLRFCQLSVPIPERVRDLI 59
Query: 61 SRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVI 120
RLTL EK+S L N+A AIPRLGI YEWWSEALHGV+ VG G F G ATSFPQVI
Sbjct: 60 GRLTLAEKVSLLGNTAAAIPRLGIKGYEWWSEALHGVSNVGPGTKFGGVYPAATSFPQVI 119
Query: 121 LTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDP 180
T ASF++ LW IG+ + EARA+YN G G+T+W+PN+NI RDPRWGRGQETPGEDP
Sbjct: 120 TTVASFNASLWESIGRVVSNEARAMYNGGVG-GLTYWSPNVNILRDPRWGRGQETPGEDP 178
Query: 181 LVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQ 240
+V GKYA SYVRG+QG+ + +L+ +ACCKHFTAYDLDNW G R+ F+A+V+ Q
Sbjct: 179 VVAGKYAASYVRGLQGND------RSRLKVAACCKHFTAYDLDNWNGVDRFHFNAKVSKQ 232
Query: 241 DLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCD 300
D+ DT+ PF CVK+G + IMC+YN+VNG+P+CAD NLL KT R QWG +GYI SDCD
Sbjct: 233 DIEDTFDVPFRMCVKEGNVASIMCSYNQVNGVPTCADPNLLKKTIRNQWGLNGYIVSDCD 292
Query: 301 AVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNL 360
+V ++YD + Y +PE+A D +KAG+D++CG FL HT AVK+ L ES++D AL N
Sbjct: 293 SVGVLYDTQHYTGTPEEAAADSIKAGLDLDCGPFLGAHTIDAVKKNLLRESDVDNALINT 352
Query: 361 FSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSK 420
+V+MRLG+F+G+ QP+G +G VC+P H+ LAL+AAQ GIVLLKN LPL +
Sbjct: 353 LTVQMRLGMFDGDIAAQPYGHLGPAHVCTPVHKGLALEAAQQGIVLLKNHGSSLPLSSQR 412
Query: 421 SVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASI-D 479
++A+IGPN+++ T++GNYAG +C +P+Q + Y T++ GC V C + D
Sbjct: 413 HRTVAVIGPNSDATVTMIGNYAGVACGYTSPVQGITGYAR-TIHQKGCVDVHCMDDRLFD 471
Query: 480 KAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCG 539
AV+ A+GAD VL+MGLDQ+ E E DR L+LPG+QQEL++RVA+AAK PVILVL+ G
Sbjct: 472 AAVEAARGADATVLVMGLDQSIEAEFKDRNSLLLPGKQQELVSRVAKAAKGPVILVLMSG 531
Query: 540 GPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYI-KVPM 598
GP+DI+FA+ DR I +I+WAGYPG+ G A+A+++FG NPGG+LPMTWYPQDY+ +PM
Sbjct: 532 GPIDISFAEKDRKIPAIVWAGYPGQEGGTAIADILFGSANPGGKLPMTWYPQDYLTNLPM 591
Query: 599 TDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKF----KAVSQNKLYLNQSS 654
T+M MRP + PGRTYRFY+G V+PFG GLSY+++++ K + N +
Sbjct: 592 TEMSMRPVHSKRIPGRTYRFYDGPVVYPFGHGLSYTRFTHNIADAPKVIPIAVRGRNGTV 651
Query: 655 STKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGR 714
S K + V H + C+ V + V N G G H +L+F P G
Sbjct: 652 SGKSIR---VTHAR---------CDRLSLGVHVEVTNVGSRDGTHTMLVFSAPP-GGEWA 698
Query: 715 PIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGDEEYPISI 774
P KQLV F+ V + EK + + C+ LS G I G H + +GDE + +S+
Sbjct: 699 PKKQLVAFERVHVAVGEKKRVQVNIHVCKYLSVVDRAGNRRIPIGDHGIHIGDESHTVSL 758
>gi|255548487|ref|XP_002515300.1| Beta-glucosidase, putative [Ricinus communis]
gi|223545780|gb|EEF47284.1| Beta-glucosidase, putative [Ricinus communis]
Length = 768
Score = 777 bits (2006), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 377/753 (50%), Positives = 512/753 (67%), Gaps = 29/753 (3%)
Query: 30 PFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEW 89
PF+CD + +T+ PFC+ LPI R +DL+ RLTL EK+ LVN+A A+ RLGI YEW
Sbjct: 27 PFACDSKDGTTKNLPFCQVKLPIQDRVKDLIGRLTLAEKVGLLVNNAGAVSRLGIKGYEW 86
Query: 90 WSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAG 149
WSEALHGV+ VG G F G+ GATSFPQVI TAASF+S LW IG+ + EARA+YN G
Sbjct: 87 WSEALHGVSNVGPGTKFGGSFPGATSFPQVITTAASFNSTLWEAIGRVVSDEARAMYNGG 146
Query: 150 QAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQ 209
A G+T+W+PN+NI RDPRWGRGQETPGEDPL+ GKYA SYV+G+QG N G+ +L+
Sbjct: 147 AA-GLTYWSPNVNILRDPRWGRGQETPGEDPLLVGKYAASYVKGLQG---NDGE---RLK 199
Query: 210 ASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRV 269
+ACCKHFTAYDLDNW G R+ F+A+V+ QD+ DT+ PF CVK+G+ + +MC+YN+V
Sbjct: 200 VAACCKHFTAYDLDNWNGVDRFHFNAKVSKQDMKDTFDVPFRMCVKEGKVASVMCSYNQV 259
Query: 270 NGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDV 329
NGIP+CAD NLL KT R QWG +GYI SDCD+V + YD + Y +PE+A D +KAG+D+
Sbjct: 260 NGIPTCADPNLLRKTVRTQWGLNGYIVSDCDSVGVFYDKQHYTSTPEEAAADAIKAGLDL 319
Query: 330 NCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCS 389
+CG FL HT+ AVK+ + E++++ AL N +V+MRLG+F+G P+ QP+G +G VC+
Sbjct: 320 DCGPFLAVHTQDAVKRGLISEADVNGALFNTLTVQMRLGMFDGEPSAQPYGNLGPKDVCT 379
Query: 390 PAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSI 449
PAHQ LAL+A + GIVLLKN LPL + ++A+IGPN+N T++GNYAG +C+
Sbjct: 380 PAHQELALEAGRQGIVLLKNHGPSLPLSPRRHRTVAIIGPNSNVTVTMIGNYAGVACQYT 439
Query: 450 TPLQALQNYVENTVYYPGCDTVACSSASI-DKAVDIAKGADHVVLMMGLDQTQEKEELDR 508
TPLQ + +Y + T++ GC V C + + A+D A+ AD VL+MGLDQ+ E E DR
Sbjct: 440 TPLQGIGSYAK-TIHQQGCADVGCVTDQLFSGAIDAARQADATVLVMGLDQSIEAEFRDR 498
Query: 509 VDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAV 568
L+LPGRQQEL+++VA A+K P ILVL+ GGP+D++FAK D I +ILWAGYPG+AG
Sbjct: 499 TGLLLPGRQQELVSKVAMASKGPTILVLMSGGPIDVSFAKKDPKIAAILWAGYPGQAGGA 558
Query: 569 ALAEVIFGDHNPGGRLPMTWYPQDYI-KVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPF 627
A+A+V+FG NPGG+LPMTWYPQ+YI +PMT+M MR + G PGRTYRFY+GK V+PF
Sbjct: 559 AIADVLFGTINPGGKLPMTWYPQEYITNLPMTEMAMRSSQSKGYPGRTYRFYQGKVVYPF 618
Query: 628 GCGLSYSKYSYKFK------AVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETR 681
G G+SY+ + + +V + N S S K + V H K C
Sbjct: 619 GHGMSYTHFVHNIASAPTMVSVPLDGHRGNTSISGKAIR---VTHTK---------CNKL 666
Query: 682 KFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSP 741
+ + VKN G G H +L++ P G P KQLV F+ V ++A + + +
Sbjct: 667 SLGIQVDVKNVGSKDGTHTLLVYSAPP-AGRWSPHKQLVAFERVHVSAGTQERVGISIHV 725
Query: 742 CESLSRAREDGLMVIEEGTHFLVVGDEEYPISI 774
C+ LS G+ I G H + +G+ ++ +S+
Sbjct: 726 CKLLSVVDRSGIRRIPIGEHSIHIGNVKHSVSL 758
>gi|356529243|ref|XP_003533205.1| PREDICTED: beta-D-xylosidase 1-like [Glycine max]
Length = 774
Score = 776 bits (2005), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 396/771 (51%), Positives = 518/771 (67%), Gaps = 30/771 (3%)
Query: 14 CLCFTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLV 73
C+ T RV + PF+CDP N T F FC T +PI R +DL++RLTL EKI +V
Sbjct: 20 CVTLT-FFPRVTEGRVPFACDPRNGLTRGFKFCNTHVPIHVRVQDLIARLTLPEKIRLVV 78
Query: 74 NSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYR 133
N+A A+PRLGI YEWWSEALHGV+ VG G F G GAT FPQVI TAASF+ LW
Sbjct: 79 NNAIAVPRLGIQGYEWWSEALHGVSNVGPGTKFGGAFPGATMFPQVISTAASFNQSLWQE 138
Query: 134 IGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRG 193
IG+ + EARA+YN GQA G+T+W+PN+NIFRDPRWGRGQETPGEDP + KYA SYV+G
Sbjct: 139 IGRVVSDEARAMYNGGQA-GLTYWSPNVNIFRDPRWGRGQETPGEDPTLAAKYAASYVKG 197
Query: 194 VQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESC 253
+QGD +L+ +ACCKH+TAYDLDNW G R+ F+A+V+ QDL DTY PF++C
Sbjct: 198 LQGD-----GAGNRLKVAACCKHYTAYDLDNWNGVDRFHFNAKVSKQDLEDTYDVPFKAC 252
Query: 254 VKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAK 313
V +G+ + +MC+YN+VNG P+CAD +LL T R QWG +GYI SDCD+V + +D + Y +
Sbjct: 253 VLEGQVASVMCSYNQVNGKPTCADPDLLRNTIRGQWGLNGYIVSDCDSVGVFFDNQHYTR 312
Query: 314 SPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGN 373
+PE+A + +KAG+D++CG FL HT +A+++ + E++++ AL NL +V+MRLG+F+G
Sbjct: 313 TPEEAAAEAIKAGLDLDCGPFLAIHTDSAIRKGLISENDLNLALANLITVQMRLGMFDGE 372
Query: 374 PTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANS 433
P+ QPFG +G VC+PAHQ LAL+AA++ IVLL+N LPL S+ + +IGPN ++
Sbjct: 373 PSTQPFGNLGPRDVCTPAHQQLALEAARESIVLLQNKGNSLPLSPSRLRIVGVIGPNTDA 432
Query: 434 AKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVD-IAKGADHVV 492
T++GNYAG +C TPLQ + YV+ T + GC VAC + A + IA+ D V
Sbjct: 433 TVTMIGNYAGVACGYTTPLQGIARYVK-TAHQVGCRGVACRGNELFGAAEIIARQVDATV 491
Query: 493 LMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRN 552
L+MGLDQT E E DRV L+LPG QQEL+TRVA AAK PVILV++ GGPVD++FAK +
Sbjct: 492 LVMGLDQTIEAETRDRVGLLLPGLQQELVTRVARAAKGPVILVIMSGGPVDVSFAKNNPK 551
Query: 553 IGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYI-KVPMTDMKMRPQATSGN 611
I +ILW GYPG+AG A+A+VIFG NPGGRLPMTWYPQ Y+ KVPMT+M MRP +G
Sbjct: 552 ISAILWVGYPGQAGGTAIADVIFGATNPGGRLPMTWYPQGYLAKVPMTNMDMRPNPATGY 611
Query: 612 PGRTYRFYEGKEVFPFGCGLSYSKYSYKF----KAVSQNKLYL----NQSSSTKMVENQD 663
PGRTYRFY+G VFPFG GLSYS++S K VS L L N + S+K V+
Sbjct: 612 PGRTYRFYKGPVVFPFGHGLSYSRFSQSLALAPKQVSVQILSLQALTNSTLSSKAVK--- 668
Query: 664 VVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQ 723
V H L TEF + VKN G M G H +L+F KP G IKQLV F
Sbjct: 669 VSHANCDDSLETEF--------HVDVKNEGSMDGTHTLLIFSKPP-PGKWSQIKQLVTFH 719
Query: 724 SVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGDEEYPISI 774
+ A K + + C+ LS + G+ I G H L +GD ++ I++
Sbjct: 720 KTHVPAGSKQRLKVNVHSCKHLSVVDQFGVRRIPTGEHELHIGDLKHSINV 770
>gi|225428983|ref|XP_002264114.1| PREDICTED: probable beta-D-xylosidase 5-like [Vitis vinifera]
Length = 818
Score = 776 bits (2003), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 394/781 (50%), Positives = 514/781 (65%), Gaps = 30/781 (3%)
Query: 1 MIFHKLSLVFPLLCLCFTSLLTRVDSTQPP--FSCDPSNPSTETFPFCKTTLPISQRARD 58
+I + L PLL + +QP ++CD ++P++ FPFC T+LP RA D
Sbjct: 30 LILLPIFLSLPLLAI-----------SQPTHRYACDRTDPNSSQFPFCNTSLPYQDRASD 78
Query: 59 LVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQ 118
LVSRLTL EK QL+NSA I RLG+P YEWWSEALHGV+ G G+ F+ I T FP
Sbjct: 79 LVSRLTLQEKAKQLINSATGISRLGVPDYEWWSEALHGVSNSGIGVHFHDPIPAVTIFPA 138
Query: 119 VILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGE 178
VIL+AASF+ LWY +GQ + E RA+YN GQA G+T+W+PN+NIFRDPRWGRGQETPGE
Sbjct: 139 VILSAASFNESLWYTMGQVVSTEGRAMYNVGQA-GLTYWSPNVNIFRDPRWGRGQETPGE 197
Query: 179 DPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVT 238
DPLV +YAV+YVRG+Q G +L+ S+CCKH+TAYD+D WKG R+ FDA+VT
Sbjct: 198 DPLVVSRYAVNYVRGLQEVGKEGNFAADRLKVSSCCKHYTAYDVDKWKGVDRFHFDAKVT 257
Query: 239 MQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSD 298
+QDL DTYQPPF+SCV++G S +MC+YNRVNG+P+CA+ LL R QWG GYI SD
Sbjct: 258 LQDLEDTYQPPFKSCVEEGHVSSVMCSYNRVNGVPTCANPELLKGVIRDQWGLDGYIVSD 317
Query: 299 CDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALH 358
CD++ + ++ Y ++PEDAV LKAG+++NCGS+L +TK AV K+ ES +++AL
Sbjct: 318 CDSIMVYHERMNYTETPEDAVALALKAGLNLNCGSYLGDYTKNAVNLGKVKESIVNQALI 377
Query: 359 NLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPK 418
+ V MRLG F+G+PTM PFGK+G VC+ HQ+LAL AA+ GIVLL N +G LPL
Sbjct: 378 YNYIVLMRLGFFDGDPTMLPFGKMGPSDVCTVDHQLLALDAAKQGIVLLHN-NGALPLSP 436
Query: 419 SKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSAS- 477
+ + +LA+IGPNA++ T+L NYAG CR +PLQ LQ YV Y GC V+CS +
Sbjct: 437 NTTKTLAVIGPNADATNTMLSNYAGVPCRYTSPLQGLQKYVSAVSYEKGCANVSCSEETL 496
Query: 478 IDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLL 537
I+ A IA AD V+++GLD E E+LDRV+L LPG Q++L+ A+AA VILV++
Sbjct: 497 IEGAASIASMADATVVVVGLDLFIEAEDLDRVNLTLPGFQEKLVMEAAKAANGTVILVVM 556
Query: 538 CGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYI-KV 596
GPVDI+F K IG ILW GYPG+AG A+++VIFGD+NPGGR P TWYPQ+Y+ +V
Sbjct: 557 SAGPVDISFVKNVSKIGGILWVGYPGQAGGDAISQVIFGDYNPGGRSPFTWYPQEYVDQV 616
Query: 597 PMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSST 656
PMTDM MRP ATS PGRTYRFY GK ++ FG GLSYS + YKF + + ++
Sbjct: 617 PMTDMNMRPNATSNFPGRTYRFYTGKSLYQFGHGLSYSTF-YKFIKSAPTTVLVHLLPQM 675
Query: 657 KMVENQDVVHYKSVPELGTEF---------CETRKFL-VTIGVKNHGEMAGKHPVLLFVK 706
M N +Y ++P T C + + IGVKN GE+ G H VL F K
Sbjct: 676 DM-PNIFSSNYPTMPNPNTNGQAIDISAIDCRNLSNIDIVIGVKNAGEIDGTHVVLAFWK 734
Query: 707 PARRG-NGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVV 765
P R G G P +LVGF+ V + + + L C +S E+G + G H LVV
Sbjct: 735 PPRSGVRGAPGVELVGFERVEVKRGKTEMVGMRLDVCGKISNVDEEGKRKLVMGMHTLVV 794
Query: 766 G 766
G
Sbjct: 795 G 795
>gi|359485890|ref|XP_002264183.2| PREDICTED: beta-xylosidase/alpha-L-arabinofuranosidase 2-like
[Vitis vinifera]
Length = 774
Score = 773 bits (1996), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 397/776 (51%), Positives = 523/776 (67%), Gaps = 26/776 (3%)
Query: 9 VFPLLCLCFTSLLTR----VDSTQPPFSCD-PSNPSTETFPFCKTTLPISQRARDLVSRL 63
VF CF+ L+ + + P F+CD +NP+ F FC T+L + R DLV RL
Sbjct: 13 VFLCFLSCFSHFLSSPKWVLAQSSPVFACDVENNPTLGQFGFCNTSLETAARVADLVKRL 72
Query: 64 TLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTA 123
TL+EKI LVNSA ++ RLGIP YEWWSEALHGV+ VG G FN + GATSFPQVILTA
Sbjct: 73 TLEEKIGFLVNSAASVSRLGIPKYEWWSEALHGVSYVGPGTHFNSVVPGATSFPQVILTA 132
Query: 124 ASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVT 183
ASF++ L+ IG+A+ EARA+YN G A G+TFW+PN+NIFRDPRWGRGQETPGEDPL++
Sbjct: 133 ASFNASLFEAIGKAVSTEARAMYNVGLA-GLTFWSPNVNIFRDPRWGRGQETPGEDPLLS 191
Query: 184 GKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLA 243
KYA YVRG+Q + +G +L+ +ACCKH+TAYDLDNWKG R+ F+A VT QD+
Sbjct: 192 SKYASGYVRGLQ-QSDDGSP--DRLKVAACCKHYTAYDLDNWKGVDRFHFNAVVTKQDMD 248
Query: 244 DTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVS 303
DT+QPPF+SCV G + +MC+YN+VNG P+CAD +LLS R +W +GYI SDCD+V
Sbjct: 249 DTFQPPFKSCVIDGNVASVMCSYNQVNGKPACADPDLLSGIVRGEWKLNGYIVSDCDSVD 308
Query: 304 IIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSV 363
+ Y+++ Y K+PE+A + AG+D+NCGSFL +HT+AAVK + ES +D+A+ N F+
Sbjct: 309 VFYNSQHYTKTPEEAAAKAILAGLDLNCGSFLGQHTEAAVKGGLVDESAVDKAVSNNFAT 368
Query: 364 RMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVS 423
MRLG F+GNP+ +GK+G VC+ HQ LA +AA+ GIVLLKNS G LPL + +
Sbjct: 369 LMRLGFFDGNPSKAIYGKLGPKDVCTSEHQELAREAARQGIVLLKNSKGSLPLSPTAIKT 428
Query: 424 LALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVD 483
LA+IGPNAN KT++GNY G C+ TPLQ L V T Y PGC VAC +A ID+A
Sbjct: 429 LAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLTALVA-TTYLPGCSNVACGTAQIDEAKK 487
Query: 484 IAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVD 543
IA AD VL++G+DQ+ E E DRV++ LPG+Q LIT VA+A+K VILV++ GG D
Sbjct: 488 IAAAADATVLIVGIDQSIEAEGRDRVNIQLPGQQPLLITEVAKASKGNVILVVMSGGGFD 547
Query: 544 ITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYI-KVPMTDMK 602
I+FAK D I SILW GYPGEAG A+A+VIFG +NP GRLPMTWYPQ Y+ KVPMT+M
Sbjct: 548 ISFAKNDDKITSILWVGYPGEAGGAAIADVIFGFYNPSGRLPMTWYPQSYVDKVPMTNMN 607
Query: 603 MRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKF----KAVSQNKLYLNQSSSTKM 658
MRP SG PGRTYRFY G+ ++ FG GLSY+++++ K+VS + S +
Sbjct: 608 MRPDPASGYPGRTYRFYTGETIYTFGDGLSYTQFNHHLVQAPKSVSI-PIEEGHSCHSSK 666
Query: 659 VENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQ 718
++ D V E C+ F + + V N G ++G H V LF P N P K
Sbjct: 667 CKSVDAVQ---------ESCQNLVFDIHLRVNNAGNISGSHTVFLFSSPPSVHN-SPQKH 716
Query: 719 LVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGDEEYPISI 774
L+GF+ V + AK KA + F++ C+ LS E G + G H L VG+ ++ +++
Sbjct: 717 LLGFEKVFVTAKAKALVRFKVDVCKDLSIVDELGTRKVALGLHVLHVGNLKHSLNV 772
>gi|357442285|ref|XP_003591420.1| Beta xylosidase [Medicago truncatula]
gi|355480468|gb|AES61671.1| Beta xylosidase [Medicago truncatula]
Length = 765
Score = 772 bits (1993), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 378/772 (48%), Positives = 512/772 (66%), Gaps = 30/772 (3%)
Query: 12 LLCLCFTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQ 71
L+ + F LL ++ + PF+CDP N ST FPFCK +LPI R DL+ RLTL EK+S
Sbjct: 5 LITIVFLLLLMSSEA-RDPFACDPKNTSTNNFPFCKASLPIPTRVNDLIGRLTLQEKVSM 63
Query: 72 LVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLW 131
LVN+A A+PR+GI YEWWSEALHGV+ VG G F G ATSFPQVI T ASF++ LW
Sbjct: 64 LVNNAAAVPRVGIKGYEWWSEALHGVSNVGPGTKFAGQFPAATSFPQVITTVASFNASLW 123
Query: 132 YRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYV 191
IG+ EARA+YN G A G+T+W+PN+NIFRDPRWGRGQETPGEDP++ GKYA SYV
Sbjct: 124 EAIGRVASDEARAMYNGGTA-GLTYWSPNVNIFRDPRWGRGQETPGEDPILAGKYAASYV 182
Query: 192 RGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFE 251
RG+QG +L+ +A CKHFTAYDLDNW G R+ F+A+V+ QD+ DT+ PF
Sbjct: 183 RGLQGTD------SSRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDMEDTFNVPFR 236
Query: 252 SCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGY 311
CVK+G + +MC+YN+VNG+P+CAD NLL +T R QW GYI SDCD+V + Y + Y
Sbjct: 237 MCVKEGNVASVMCSYNQVNGVPTCADPNLLKRTIRGQWHLDGYIVSDCDSVGVFYTNQHY 296
Query: 312 AKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFN 371
+PE+A D +KAG+D++CG FL +HT+ AVK+ L E++++ AL N +V+MRLG+F+
Sbjct: 297 TSTPEEAAADAIKAGLDLDCGPFLAQHTQNAVKKGLLTETDVNGALANTLTVQMRLGMFD 356
Query: 372 GNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNA 431
G P+ QP+G +G VC+P HQ LAL AA+ GIVLLKN+ LPL ++A+IGPN+
Sbjct: 357 GEPSAQPYGNLGPTDVCTPTHQELALDAARQGIVLLKNTGPSLPLSTKNHQTVAVIGPNS 416
Query: 432 NSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACS-SASIDKAVDIAKGADH 490
N+ T++GNYAG +C +PLQ + Y T++ PGC VAC+ A++ A+ AD
Sbjct: 417 NATVTMIGNYAGIACGYTSPLQGIGKYAR-TIHEPGCANVACNDDKQFGSALNAARQADA 475
Query: 491 VVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYD 550
VL+MGLDQ+ E E +DR L+LPG QQ+L+++VA A++ P ILVL+ GGP+DITFAK D
Sbjct: 476 TVLVMGLDQSIEAEMVDRTGLLLPGHQQDLVSKVAAASRGPTILVLMSGGPIDITFAKND 535
Query: 551 RNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIK-VPMTDMKMRPQATS 609
I ILWAGYPG+AG A+A+++FG NPG +LPMTWYPQ Y+K + MT+M MRP +++
Sbjct: 536 PRIMGILWAGYPGQAGGAAIADILFGTTNPGAKLPMTWYPQGYLKNLAMTNMAMRPSSST 595
Query: 610 GNPGRTYRFYEGKEVFPFGCGLSYSKYSY------KFKAVSQNKLYLNQSSSTKMVENQD 663
G PGRTYRFY G V+PFG GLSY+ + + K +V + SS+ +
Sbjct: 596 GYPGRTYRFYNGPVVYPFGYGLSYTNFVHTLASAPKVVSVPVDGHRRGNSSNKAAIR--- 652
Query: 664 VVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLF-VKPARRGNGRPIKQLVGF 722
V H + C + I VKN G G + +L+F V P G+ P KQLV F
Sbjct: 653 VTHAR---------CGKLSIRLDIDVKNVGSKDGTNTLLVFSVPPTGNGHWAPQKQLVAF 703
Query: 723 QSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGDEEYPISI 774
+ V + AK + + + C+ LS + G I G H + +GD ++ +S+
Sbjct: 704 EKVYVPAKAQQRVRINIHVCKLLSVVDKSGTRRIPMGAHSIHIGDVKHFVSL 755
>gi|297834874|ref|XP_002885319.1| beta-1,4-xylosidase [Arabidopsis lyrata subsp. lyrata]
gi|297331159|gb|EFH61578.1| beta-1,4-xylosidase [Arabidopsis lyrata subsp. lyrata]
Length = 865
Score = 772 bits (1993), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 390/775 (50%), Positives = 511/775 (65%), Gaps = 51/775 (6%)
Query: 10 FPLLCLCFTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKI 69
F + L +L++ + +Q F+CD ++P+T + FC +L RA+DLVSRL+L EK+
Sbjct: 6 FVGVSLLIAALVSSLCESQKNFACDRNDPATAKYGFCNVSLSYEARAKDLVSRLSLKEKV 65
Query: 70 SQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSY 129
QLVN A + RLG+P YEWWSEALHGV+ VG G+ FNGT+ GATSFP ILTAASF++
Sbjct: 66 QQLVNKATGVSRLGVPPYEWWSEALHGVSDVGPGVRFNGTVPGATSFPATILTAASFNTS 125
Query: 130 LWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVS 189
LW ++G+ + EARA++N G A G+T+W+PN+NIFRDPRWGRGQETPGEDPLV KYAV+
Sbjct: 126 LWLKMGEVVSTEARAMHNVGLA-GLTYWSPNVNIFRDPRWGRGQETPGEDPLVVSKYAVN 184
Query: 190 YVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPP 249
YV+G+Q D + GK + +L+ S+CCKH+TAYDLDNWKG R+ FDA+VT QDL DTYQPP
Sbjct: 185 YVKGLQ-DVQDAGKSR-RLKVSSCCKHYTAYDLDNWKGIDRFHFDAKVTKQDLEDTYQPP 242
Query: 250 FESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAE 309
F+SCV++G S +MC+YNRVNGIP+CAD NLL R QW GYI SDCD++ + +D
Sbjct: 243 FKSCVEEGDVSSVMCSYNRVNGIPTCADPNLLRGVIRGQWRLDGYIVSDCDSIQVYFDDI 302
Query: 310 GYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGL 369
Y K+ +++NCG FL K+T+ AVK KKL SE+D AL + V MRLG
Sbjct: 303 HYTKTR-----------LNMNCGDFLGKYTENAVKLKKLNGSEVDEALIYNYIVLMRLGF 351
Query: 370 FNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGP 429
F+G+P PFG++G VCS HQ+LAL+AA+ GIVLL+N G LPL K+ +A+IGP
Sbjct: 352 FDGDPKSLPFGQLGPSDVCSKDHQMLALEAAKQGIVLLEN-RGDLPLSKTAVKKIAVIGP 410
Query: 430 NANSAKTLLGNYAGPSCRSITPLQALQNYV-ENTVYYPGCDTVACSSAS-IDKAVDIAKG 487
NAN+ K ++ NYAG C+ +PLQ LQ YV E VY PGC V C + I AV
Sbjct: 411 NANATKVMISNYAGVPCKYTSPLQGLQKYVPEKVVYEPGCKDVNCGEQTLISAAVKAVSE 470
Query: 488 ADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFA 547
AD VL++GLDQT E E LDRV+L LPG Q++L+ VA AAKK V+LV++ GP+DI+FA
Sbjct: 471 ADVTVLVVGLDQTVEAEGLDRVNLTLPGYQEKLVRDVANAAKKTVVLVIMSAGPIDISFA 530
Query: 548 KYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYI-KVPMTDMKMRPQ 606
K I ++LW GYPGEAG A+A+VIFGD+NP GRLP TWY Q++ KV MTDM MRP
Sbjct: 531 KNLSTISAVLWVGYPGEAGGDAIAQVIFGDYNPSGRLPETWYSQEFADKVAMTDMNMRPN 590
Query: 607 ATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVH 666
+TSG PGR+YRFY GK ++ FG GLSYS + ST ++ ++H
Sbjct: 591 STSGFPGRSYRFYTGKPIYKFGYGLSYSAF------------------STFVLSAPSIIH 632
Query: 667 YKSVP----------ELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPAR-----RG 711
K+ P ++ T C K + IGVKN G+ +G H VL+F KP + G
Sbjct: 633 IKTNPILNLNKTTSIDISTVNCHDLKIRIVIGVKNRGQRSGSHVVLVFWKPPKCSKTLVG 692
Query: 712 NGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
G P QLVGF+ V + ++ E C++LS G + G H LV+G
Sbjct: 693 AGVPQTQLVGFERVEVGRSMTEKVTVEFDVCKALSLVDTHGKRKLVTGHHTLVIG 747
>gi|115459584|ref|NP_001053392.1| Os04g0530700 [Oryza sativa Japonica Group]
gi|38346629|emb|CAD41212.2| OSJNBa0074L08.23 [Oryza sativa Japonica Group]
gi|38346760|emb|CAE03865.2| OSJNBa0081C01.11 [Oryza sativa Japonica Group]
gi|113564963|dbj|BAF15306.1| Os04g0530700 [Oryza sativa Japonica Group]
gi|218195263|gb|EEC77690.1| hypothetical protein OsI_16749 [Oryza sativa Indica Group]
Length = 770
Score = 771 bits (1992), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 388/747 (51%), Positives = 514/747 (68%), Gaps = 17/747 (2%)
Query: 36 SNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALH 95
++P+ +PFC TLP RAR LVS LTLDEKI+QL N+A PRLG+P +EWWSE+LH
Sbjct: 31 ASPAASAYPFCNATLPFPARARALVSLLTLDEKIAQLSNTAAGAPRLGVPPFEWWSESLH 90
Query: 96 GVAGVGKGI-FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGM 154
GV G G+ F +G +R AT FPQVIL+AA+F+ LW +AI +EARA++NAGQA G+
Sbjct: 91 GVCDNGPGVNFSSGPVRSATIFPQVILSAAAFNRSLWRAAARAIAVEARAMHNAGQA-GL 149
Query: 155 TFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACC 214
TFWAPNIN+FRDPRWGRGQETPGEDP V Y+V YV+G Q D +G++ SACC
Sbjct: 150 TFWAPNINVFRDPRWGRGQETPGEDPAVVSAYSVEYVKGFQRDYGE----EGRMMLSACC 205
Query: 215 KHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPS 274
KH+ AYDL+ W+G TRY F+A+V QD+ DTYQPPF+SC+++GRAS +MC+YN+VNG+P+
Sbjct: 206 KHYIAYDLEKWRGFTRYTFNAKVNAQDMEDTYQPPFKSCIQEGRASCLMCSYNQVNGVPA 265
Query: 275 CADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSF 334
CA +++L + AR +WGF GYITSDCDAV+II++ + Y S ED++ VLKAGMD+NCGSF
Sbjct: 266 CARKDILQR-ARDEWGFQGYITSDCDAVAIIHENQTYTASDEDSIAVVLKAGMDINCGSF 324
Query: 335 LQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQV 394
L +HTK+A+++ K+ E +I+ AL NLFSV++RLG F+ Q F ++G + VC+ H+
Sbjct: 325 LIRHTKSAIEKGKVQEEDINHALFNLFSVQLRLGFFDKTNENQWFTQLGPNNVCTTEHRE 384
Query: 395 LALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQA 454
LA +A + G VLLKN +G LPL +S+ +ALIGP AN L G+Y G C S T ++
Sbjct: 385 LAAEAVRQGTVLLKNDNGFLPLKRSEVGHIALIGPAANDPYILGGDYTGVPCHSTTFVKG 444
Query: 455 LQNYVENTVYYPGCDTVACSSAS-IDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVL 513
+Q YV T + GC V C+S +A++ AK AD VVL+ GL+ T+E E+ DRV L+L
Sbjct: 445 MQAYVPKTTFAAGCKDVPCNSTDGFGEAIEAAKRADVVVLIAGLNLTEETEDHDRVSLLL 504
Query: 514 PGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEV 573
PGRQ +LI VA KKPV+LVL+ GGPVD++FAK+D I SILW GYPGE G L E+
Sbjct: 505 PGRQMDLIHTVASVTKKPVVLVLMGGGPVDVSFAKHDPRIASILWIGYPGEVGGNVLPEI 564
Query: 574 IFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSY 633
+FG +NPGG+LP+TWYP+ + VPM DM MR A+ G PGRTYRFY G V+ FG GLSY
Sbjct: 565 LFGKYNPGGKLPITWYPESFTAVPMDDMNMRADASRGYPGRTYRFYTGDVVYGFGYGLSY 624
Query: 634 SKYSYKFKAVSQNKLYLNQSSSTKMVENQ------DVVHYKSVPELGTEFCETRKFLVTI 687
SKYSY + K+ L++SS ++ + D V Y V ++ + CE +F V I
Sbjct: 625 SKYSYSILQAPK-KISLSRSSVPDLISRKPAYTRRDGVDYVQVEDIAS--CEALQFPVHI 681
Query: 688 GVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSR 747
V N G M G H VLLF G PIKQLVGF+ V A ++ + PC+ +S
Sbjct: 682 SVSNDGAMDGSHAVLLFASSKPSFPGSPIKQLVGFERVHTAAGRSTDVEITVDPCKLMSF 741
Query: 748 AREDGLMVIEEGTHFLVVGDEEYPISI 774
A +G V+ GTH L+VGDEE+ + I
Sbjct: 742 ANTEGTRVLFLGTHVLMVGDEEHELLI 768
>gi|147844622|emb|CAN82161.1| hypothetical protein VITISV_035506 [Vitis vinifera]
Length = 925
Score = 770 bits (1989), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 392/781 (50%), Positives = 512/781 (65%), Gaps = 30/781 (3%)
Query: 1 MIFHKLSLVFPLLCLCFTSLLTRVDSTQPP--FSCDPSNPSTETFPFCKTTLPISQRARD 58
+I + L PLL + +QP ++CD ++P++ FPFC T+LP RA D
Sbjct: 6 LILLPIFLSLPLLAI-----------SQPTHRYACDRTDPNSSQFPFCNTSLPYQDRASD 54
Query: 59 LVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQ 118
LVSRLTL EK QL+NSA I RLG+P YEWWSEALHGV+ G G+ F+ I T FP
Sbjct: 55 LVSRLTLQEKAKQLINSATGISRLGVPDYEWWSEALHGVSNSGIGVHFHDPIPAVTIFPA 114
Query: 119 VILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGE 178
VIL+AASF+ LWY +GQ + E RA+YN GQA G+T+W+PN+NIFRDPRWGRGQETPGE
Sbjct: 115 VILSAASFNESLWYTMGQVVSTEGRAMYNVGQA-GLTYWSPNVNIFRDPRWGRGQETPGE 173
Query: 179 DPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVT 238
DPLV +YAV+YVRG+Q G +L+ S+CCKH+TAYD+D WKG R+ FDA+VT
Sbjct: 174 DPLVVSRYAVNYVRGLQEVGKEGNFAADRLKVSSCCKHYTAYDVDKWKGVDRFHFDAKVT 233
Query: 239 MQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSD 298
+QDL DTYQPPF+ CV++G S +MC+YNRVNG+P+CA+ LL R QWG GYI SD
Sbjct: 234 LQDLEDTYQPPFKXCVEEGHVSSVMCSYNRVNGVPTCANPELLKGVIRDQWGLDGYIVSD 293
Query: 299 CDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALH 358
CD++ + ++ Y ++PEDAV LKAG+++NCGS+L +TK AV K+ ES +B+AL
Sbjct: 294 CDSIMVYHERMNYTETPEDAVALALKAGLNLNCGSYLGDYTKNAVNLGKVKESIVBQALI 353
Query: 359 NLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPK 418
+ V MRLG F+G+PTM PFGK+G VC+ HQ+LAL AA+ GIVLL N+ G LPL
Sbjct: 354 YNYIVLMRLGFFDGDPTMLPFGKMGPSDVCTVDHQLLALDAAKQGIVLLHNN-GALPLSP 412
Query: 419 SKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSAS- 477
+ + +LA+IGPNA++ T+L NYAG CR +PLQ LQ YV Y GC V+CS +
Sbjct: 413 NTTKTLAVIGPNADATNTMLSNYAGVPCRYTSPLQGLQKYVSAVSYEKGCANVSCSEETL 472
Query: 478 IDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLL 537
I+ A IA AD V+++GLD E E+LDRV+L LPG Q++L+ A+AA VILV++
Sbjct: 473 IEGAASIASMADATVVVVGLDLFIEAEDLDRVNLTLPGFQEKLVMEAAKAANGTVILVVM 532
Query: 538 CGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYI-KV 596
GPVDI+F K IG ILW GYPG+AG A+++VIFGD+NPGGR P TWYPQ+Y+ +V
Sbjct: 533 SAGPVDISFVKNVSKIGGILWVGYPGQAGGDAISQVIFGDYNPGGRSPFTWYPQEYVDQV 592
Query: 597 PMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSST 656
PMTDM MRP AT PGRTYRFY GK ++ FG GLSYS + YKF + + ++
Sbjct: 593 PMTDMNMRPNATXNFPGRTYRFYTGKSLYQFGHGLSYSTF-YKFIKSAPXTVLVHLLPQM 651
Query: 657 KMVENQDVVHYKSVPELGTEF---------CETRKFL-VTIGVKNHGEMAGKHPVLLFVK 706
M N +Y ++P T C + + IGVKN GE+ G H VL F K
Sbjct: 652 DM-PNIFSSNYPTMPNPNTNGQAIDISAIDCRNLSNIDIVIGVKNAGEIDGTHVVLAFWK 710
Query: 707 PARRG-NGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVV 765
P R G G P +LVGF+ V + + + L C +S E+G + G H LVV
Sbjct: 711 PPRSGVRGAPGVELVGFERVEVKRGKTEMVGMRLDVCGKISNVDEEGKRKLVMGMHTLVV 770
Query: 766 G 766
G
Sbjct: 771 G 771
>gi|224111912|ref|XP_002316021.1| predicted protein [Populus trichocarpa]
gi|222865061|gb|EEF02192.1| predicted protein [Populus trichocarpa]
Length = 768
Score = 770 bits (1989), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 382/754 (50%), Positives = 511/754 (67%), Gaps = 24/754 (3%)
Query: 27 TQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPA 86
++ PF+CDP T + FC+ LPI R RDL+ RLTL EKI LVN+A A+PRLGI
Sbjct: 24 SREPFACDPKLGLTRSLKFCRVNLPIHVRVRDLIGRLTLQEKIRLLVNNAAAVPRLGIQG 83
Query: 87 YEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALY 146
YEWWSEALHGV+ VG G F G GAT+FPQVI TAASF+ LW IG+ + EARA+Y
Sbjct: 84 YEWWSEALHGVSNVGPGTKFGGAFPGATAFPQVITTAASFNESLWEEIGRVVSDEARAMY 143
Query: 147 NAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKG 206
N G A G+T+W+PN+N+FRDPRWGRGQETPGEDP+V GKYA SYVRG+QG+ NG +LK
Sbjct: 144 NGGMA-GLTYWSPNVNVFRDPRWGRGQETPGEDPVVAGKYAASYVRGLQGN--NGLRLK- 199
Query: 207 KLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAY 266
+ACCKH+TAYDLDNW G RY F+ARV+ QDL DTY PF+SCV G+ + +MC+Y
Sbjct: 200 ---VAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKSCVVAGKVASVMCSY 256
Query: 267 NRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAG 326
N+VNG P+CAD LL T R +WG +GYI SDCD+V +++D + Y +PE+A ++AG
Sbjct: 257 NQVNGKPTCADPYLLKNTIRGEWGLNGYIVSDCDSVGVLFDTQHYTATPEEAAASTIRAG 316
Query: 327 MDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADV 386
+D++CG FL HT+ AVK L E +++ AL N +V+MRLG+F+G P+ QPFG +G
Sbjct: 317 LDLDCGPFLAIHTENAVKGGLLKEEDVNMALANTITVQMRLGMFDGEPSAQPFGNLGPRD 376
Query: 387 VCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSC 446
VC+PAHQ LALQAA+ GIVLL+N LPL ++ ++A+IGPN++ T++GNYAG +C
Sbjct: 377 VCTPAHQQLALQAARQGIVLLQNRGRTLPLSRTLQ-TVAVIGPNSDVTVTMIGNYAGVAC 435
Query: 447 RSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDI-AKGADHVVLMMGLDQTQEKEE 505
TPLQ ++ Y + TV++PGC+ V C+ A ++ A+ AD +L+MGLDQ+ E E
Sbjct: 436 GYTTPLQGIRRYAK-TVHHPGCNDVFCNGNQQFNAAEVAARHADATILVMGLDQSIEAEF 494
Query: 506 LDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEA 565
DR L+LPG QQEL++ VA A++ P ILVL+ GGP+D++FAK D IG+ILW GYPG+A
Sbjct: 495 RDRKGLLLPGYQQELVSIVARASRGPTILVLMSGGPIDVSFAKNDPRIGAILWVGYPGQA 554
Query: 566 GAVALAEVIFGDHNPGGRLPMTWYPQDYI-KVPMTDMKMRPQATSGNPGRTYRFYEGKEV 624
G A+A+V+FG NPGG+LPMTWYP +Y+ KVPMT+M MR + G PGRTYRFY+G V
Sbjct: 555 GGAAIADVLFGTANPGGKLPMTWYPHNYLAKVPMTNMGMRADPSRGYPGRTYRFYKGPVV 614
Query: 625 FPFGCGLSYSKYSYKF----KAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCET 680
FPFG G+SY+ +++ + VS L+ S +T N V + + CE
Sbjct: 615 FPFGHGMSYTTFAHSLVQAPREVSVPLASLHVSRNTTGASNAIRVSHAN--------CEA 666
Query: 681 RKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELS 740
V I VKN G+M G H +L+F P G KQL+GF+ V L + + ++
Sbjct: 667 LALGVHIDVKNTGDMDGTHTLLVFSSPP-GGKWSTQKQLIGFEKVHLVTGSQKRVKIDIH 725
Query: 741 PCESLSRAREDGLMVIEEGTHFLVVGDEEYPISI 774
C+ LS G+ I G H L +GD ++ IS+
Sbjct: 726 VCKHLSVVDRFGIRRIPIGEHDLYIGDLKHSISL 759
>gi|302811514|ref|XP_002987446.1| hypothetical protein SELMODRAFT_426206 [Selaginella moellendorffii]
gi|300144852|gb|EFJ11533.1| hypothetical protein SELMODRAFT_426206 [Selaginella moellendorffii]
Length = 772
Score = 770 bits (1989), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 393/759 (51%), Positives = 519/759 (68%), Gaps = 25/759 (3%)
Query: 28 QPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAY 87
Q P++CD SN + FPFC T+LPI+ R D V+RLTL+EKISQL+N+A IPRLG+P Y
Sbjct: 27 QAPYACDQSNATLAAFPFCNTSLPITDRVEDYVARLTLEEKISQLINTATGIPRLGVPKY 86
Query: 88 EWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYN 147
+WW EALHGVA G+ F G++ ATSFP I TAASF++ L+Y IGQA+ EARA++N
Sbjct: 87 QWWQEALHGVAS-SPGVQFGGSVPAATSFPMPITTAASFNTSLFYGIGQAVSTEARAMHN 145
Query: 148 AGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGK 207
GQ+ G+TFW+PNINI+RDPRWGRGQETPGEDPL++ +A YVRG+Q KLK
Sbjct: 146 LGQS-GLTFWSPNINIYRDPRWGRGQETPGEDPLLSSNFATYYVRGLQESQAGSDKLK-- 202
Query: 208 LQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYN 267
SACCKH TAYD+DNW GT RY F+A VT QDL DTY PF+SCV+ G S +MC+YN
Sbjct: 203 --VSACCKHMTAYDVDNWLGTDRYHFNAIVTEQDLEDTYNAPFKSCVEDGGVSSVMCSYN 260
Query: 268 RVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGM 327
R+NG+P+CAD LL+ T R W +GYI SDCD++ + +D YA + EDA D L AG+
Sbjct: 261 RLNGVPTCADHELLTTTVRETWKLNGYIVSDCDSLQVFFDNTNYAATAEDAAADALLAGL 320
Query: 328 DVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVV 387
++NCG+FL KHT +A++QKK+ E+ I++AL L +V+MRLGL++G+P Q +G +GA V
Sbjct: 321 NLNCGTFLAKHTLSAIQQKKVTEATINQALTYLVTVQMRLGLYDGDPKSQTYGSLGASDV 380
Query: 388 CSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCR 447
C+ HQ LAL+AA+ G+VLLKN G LPL SK SLA++GP+AN+ + ++GNYAG C+
Sbjct: 381 CTSEHQTLALEAARQGMVLLKN-LGALPLSTSKIKSLAVVGPHANATRAMIGNYAGIPCK 439
Query: 448 SITPLQALQNYVENTVYYPGCDTVACSSAS-IDKAVDIAKGADHVVLMMGLDQTQEKEEL 506
+PLQA Q Y + + Y PGC VACSS S I AV A AD VV+ +GLD T E E L
Sbjct: 440 YTSPLQAFQKYAQVS-YAPGCANVACSSDSLISGAVSAAAAADAVVVAVGLDLTIEAESL 498
Query: 507 DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAG 566
DR L+LPG+QQEL+++V +AAK PV++V+L G +DI FA D I ILWAGYPG+AG
Sbjct: 499 DRTSLLLPGKQQELVSQVMQAAKGPVVIVILSAGAIDIPFALSDSRIAGILWAGYPGQAG 558
Query: 567 AVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFP 626
A+AEVIFGDHNP G+LP TWYPQ++ + M DM MRP A++G PGRTYRFY G +F
Sbjct: 559 GAAIAEVIFGDHNPSGKLPATWYPQNFTSISMLDMNMRPNASTGYPGRTYRFYTGPTIFK 618
Query: 627 FGCGLSYSKYSYKF-KAVSQNKLYLNQSSS------TKMVENQDVVHYKSVPELGTEFCE 679
FG GLSY+ S KF KA S +L+ S+ T + ++ H + E + CE
Sbjct: 619 FGDGLSYTSLSAKFIKAPS----FLSIPSTAPMQPCTGLKKSSSCFHLDATDE---KSCE 671
Query: 680 TRKFLVTIGVKNHGEMAGKHPVLLFVKPARRG-NGRPIKQLVGFQSVILNAKEKAE-IVF 737
+ K V I V+N G MA H ++LF P G +G P +QLVGF + + + ++F
Sbjct: 672 SLKSQVAISVRNKGAMAISHTLMLFSTPPSAGSDGVPQRQLVGFNKIQIAGDSISNPVIF 731
Query: 738 ELSPCESLSRAREDGLMVIEEGTHFLVVGDEEYPISIFV 776
+L PC A DG ++ GTH L G+E++ + + V
Sbjct: 732 DLDPCRHFVHADRDGKKLLRSGTHVLTAGNEQHSLRLLV 770
>gi|74355968|dbj|BAE44362.1| alpha-L-arabinofuranosidase [Raphanus sativus]
Length = 780
Score = 769 bits (1986), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 398/775 (51%), Positives = 514/775 (66%), Gaps = 18/775 (2%)
Query: 6 LSLVFPLLCLCFTSLLTRVDSTQPPFSCD-PSNPSTETFPFCKTTLPISQRARDLVSRLT 64
LSL+F LCL +S ST P F+CD NPS + FC T + I R DLV+RLT
Sbjct: 20 LSLIF--LCLLDSS---NAQST-PVFACDVAGNPSLAAYGFCNTAIKIEYRVADLVARLT 73
Query: 65 LDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAA 124
L EKI L + + RLGIP YEWWSEALHGV+ VG G F+G + GATSFPQVILTAA
Sbjct: 74 LQEKIGVLTSKLHGVARLGIPTYEWWSEALHGVSYVGPGTRFSGQVPGATSFPQVILTAA 133
Query: 125 SFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTG 184
SF+ L+ IG+ + EARA+YN G A G+T+W+PN+NIFRDPRWGRGQETPGEDPL++
Sbjct: 134 SFNVSLFQAIGKVVSTEARAMYNVGLA-GLTYWSPNVNIFRDPRWGRGQETPGEDPLLSS 192
Query: 185 KYAVSYVRGVQ-GDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLA 243
KYA YV+G+Q D+ + +LK +ACCKH+TAYD+DNWKG RY F+A V QDL
Sbjct: 193 KYASGYVKGLQETDSSDANRLK----VAACCKHYTAYDVDNWKGVERYSFNAVVNQQDLD 248
Query: 244 DTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVS 303
DTYQPPF+SCV G + +MC+YN+VNG P+CAD +LLS R +W +GYI SDCD+V
Sbjct: 249 DTYQPPFKSCVVDGNVASVMCSYNKVNGKPTCADPDLLSGVIRGEWKLNGYIVSDCDSVD 308
Query: 304 IIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSV 363
++Y + Y K+PE+A + AG+D+NCG FL HT+AAVK + E+ ID+A+ N F
Sbjct: 309 VLYKNQHYTKTPEEAAAISINAGLDLNCGYFLGDHTEAAVKAGLVKEAAIDKAITNNFLT 368
Query: 364 RMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVS 423
MRLG F+G+P Q +G +G VC+PA+Q LA +AA+ GIVLLKN+ G LPL +
Sbjct: 369 LMRLGFFDGDPKKQIYGGLGPKDVCTPANQELAAEAARQGIVLLKNT-GALPLSPKTIKT 427
Query: 424 LALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVD 483
LA+IGPNAN KT++GNY G C+ TPLQ L V +T Y PGC VAC+ A + +
Sbjct: 428 LAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAGTV-HTTYLPGCSNVACAVADVAGSTK 486
Query: 484 IAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVD 543
+A +D VL++G DQ+ E E DRVDL LPG+QQEL+T+VA+AAK PV LV++ GG D
Sbjct: 487 LAAASDATVLVIGADQSIEAESRDRVDLNLPGQQQELVTQVAKAAKGPVFLVIMSGGGFD 546
Query: 544 ITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYI-KVPMTDMK 602
ITFAK D I ILW GYPGEAG +A A+VIFG +NP GRLPMTWYPQ Y+ KVPMT+M
Sbjct: 547 ITFAKNDAKIAGILWVGYPGEAGGIATADVIFGRYNPSGRLPMTWYPQSYVEKVPMTNMN 606
Query: 603 MRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKF-KAVSQNKLYLNQSSSTKMVEN 661
MRP ++G PGRTYRFY G+ V+ FG GLSY+K+S+ KA L L ++ + E
Sbjct: 607 MRPDKSNGYPGRTYRFYTGETVYAFGDGLSYTKFSHSLVKAPRLVSLSLEENHVCRSSEC 666
Query: 662 QDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVG 721
Q + + + F V I V+N G+ G H V LF P +G P K L+G
Sbjct: 667 QSLNAIGPHCDNAVSGTGGKAFEVHIKVQNGGDREGIHTVFLFTTPPAV-HGSPRKHLLG 725
Query: 722 FQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGDEEYPISIFV 776
F+ + L E+A + F++ C+ LS E G I G H L VGD ++ +SI +
Sbjct: 726 FEKIRLGKMEEAVVKFKVDVCKDLSVVDEVGKRKIGLGQHLLHVGDVKHSLSIRI 780
>gi|9972374|gb|AAG10624.1|AC022521_2 Similar to xylosidase [Arabidopsis thaliana]
Length = 763
Score = 769 bits (1986), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 376/754 (49%), Positives = 507/754 (67%), Gaps = 27/754 (3%)
Query: 27 TQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPA 86
++ F+CD + +T T FC+ ++PI +R RDL+ RLTL EK+S L N+A AIPRLGI
Sbjct: 21 SRETFACDTKDAATATLRFCQLSVPIPERVRDLIGRLTLAEKVSLLGNTAAAIPRLGIKG 80
Query: 87 YEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALY 146
YEWWSEALHGV+ VG G F G ATSFPQVI T ASF++ LW IG+ + EARA+Y
Sbjct: 81 YEWWSEALHGVSNVGPGTKFGGVYPAATSFPQVITTVASFNASLWESIGRVVSNEARAMY 140
Query: 147 NAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKG 206
N G G+T+W+PN+NI RDPRWGRGQETPGEDP+V GKYA SYVRG+QG+ +
Sbjct: 141 NGGVG-GLTYWSPNVNILRDPRWGRGQETPGEDPVVAGKYAASYVRGLQGND------RS 193
Query: 207 KLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAY 266
+L+ +ACCKHFTAYDLDNW G R+ F+A+V+ QD+ DT+ PF CVK+G + IMC+Y
Sbjct: 194 RLKVAACCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVKEGNVASIMCSY 253
Query: 267 NRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAG 326
N+VNG+P+CAD NLL KT R QWG +GYI SDCD+V ++YD + Y +PE+A D +KAG
Sbjct: 254 NQVNGVPTCADPNLLKKTIRNQWGLNGYIVSDCDSVGVLYDTQHYTGTPEEAAADSIKAG 313
Query: 327 MDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADV 386
+D++CG FL HT AVK+ L ES++D AL N +V+MRLG+F+G+ QP+G +G
Sbjct: 314 LDLDCGPFLGAHTIDAVKKNLLRESDVDNALINTLTVQMRLGMFDGDIAAQPYGHLGPAH 373
Query: 387 VCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSC 446
VC+P H+ LAL+AAQ GIVLLKN LPL + ++A+IGPN+++ T++GNYAG +C
Sbjct: 374 VCTPVHKGLALEAAQQGIVLLKNHGSSLPLSSQRHRTVAVIGPNSDATVTMIGNYAGVAC 433
Query: 447 RSITPLQALQNYVENTVYYPGCDTVACSSASI-DKAVDIAKGADHVVLMMGLDQTQEKEE 505
+P+Q + Y T++ GC V C + D AV+ A+GAD VL+MGLDQ+ E E
Sbjct: 434 GYTSPVQGITGYAR-TIHQKGCVDVHCMDDRLFDAAVEAARGADATVLVMGLDQSIEAEF 492
Query: 506 LDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEA 565
DR L+LPG+QQEL++RVA+AAK PVILVL+ GGP+DI+FA+ DR I +I+WAGYPG+
Sbjct: 493 KDRNSLLLPGKQQELVSRVAKAAKGPVILVLMSGGPIDISFAEKDRKIPAIVWAGYPGQE 552
Query: 566 GAVALAEVIFGDHNPGGRLPMTWYPQDYI-KVPMTDMKMRPQATSGNPGRTYRFYEGKEV 624
G A+A+++FG NPGG+LPMTWYPQDY+ +PMT+M MRP + PGRTYRFY+G V
Sbjct: 553 GGTAIADILFGSANPGGKLPMTWYPQDYLTNLPMTEMSMRPVHSKRIPGRTYRFYDGPVV 612
Query: 625 FPFGCGLSYSKYSYKF----KAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCET 680
+PFG GLSY+++++ K + N + S K + V H + C+
Sbjct: 613 YPFGHGLSYTRFTHNIADAPKVIPIAVRGRNGTVSGKSIR---VTHAR---------CDR 660
Query: 681 RKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELS 740
V + V N G G H +L+F P G P KQLV F+ V + EK + +
Sbjct: 661 LSLGVHVEVTNVGSRDGTHTMLVFSAPP-GGEWAPKKQLVAFERVHVAVGEKKRVQVNIH 719
Query: 741 PCESLSRAREDGLMVIEEGTHFLVVGDEEYPISI 774
C+ LS G I G H + +GDE + +S+
Sbjct: 720 VCKYLSVVDRAGNRRIPIGDHGIHIGDESHTVSL 753
>gi|359481045|ref|XP_002268626.2| PREDICTED: beta-xylosidase/alpha-L-arabinofuranosidase 2-like
[Vitis vinifera]
gi|296089342|emb|CBI39114.3| unnamed protein product [Vitis vinifera]
Length = 774
Score = 768 bits (1983), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 395/777 (50%), Positives = 520/777 (66%), Gaps = 28/777 (3%)
Query: 9 VFPLLCLCFTSLLTR----VDSTQPPFSCD-PSNPSTETFPFCKTTLPISQRARDLVSRL 63
VF CF+ L+ + + P F+CD +NP+ F FC T+L + R DLV RL
Sbjct: 13 VFLCFLSCFSHFLSSPKWVLGQSSPVFACDVENNPTLGQFGFCNTSLETAARVADLVKRL 72
Query: 64 TLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTA 123
TL+EKI LVNSA ++ RLGIP YEWWSEALHGV+ VG G FN + GATSFPQVILTA
Sbjct: 73 TLEEKIGFLVNSAASVSRLGIPKYEWWSEALHGVSYVGPGTHFNSIVPGATSFPQVILTA 132
Query: 124 ASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVT 183
ASF++ L+ IG+ + EARA+YN G A G+TFW+PN+NIFRDPRWGRGQETPGEDPL++
Sbjct: 133 ASFNASLFEAIGKVVSTEARAMYNVGLA-GLTFWSPNVNIFRDPRWGRGQETPGEDPLLS 191
Query: 184 GKYAVSYVRGVQ-GDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDL 242
KYA +YVRG+Q GD + +LK +ACCKH+TAYDLDNWKG R F+A VT QD+
Sbjct: 192 SKYASAYVRGLQQGDDGSPDRLK----VAACCKHYTAYDLDNWKGVDRLHFNAVVTKQDM 247
Query: 243 ADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAV 302
DT+QPPF+SCV G + +MC++N+VNG P+CAD +LLS R +W +GYI SDCD+V
Sbjct: 248 DDTFQPPFKSCVIDGNVASVMCSFNQVNGKPTCADPDLLSGIVRGEWKLNGYIVSDCDSV 307
Query: 303 SIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFS 362
+ Y+++ Y K+PE+A + AG+D+NCGSFL +HT+AAVK + ES +D+A+ N F+
Sbjct: 308 DVFYNSQHYTKTPEEAAAKAILAGLDLNCGSFLGQHTEAAVKGGLVDESAVDKAVSNNFA 367
Query: 363 VRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSV 422
MRLG F+GNP+ +GK+G VC+ HQ +A +AA+ GIVLLKNS G LPL +
Sbjct: 368 TLMRLGFFDGNPSKAIYGKLGPKDVCTSEHQEMAREAARQGIVLLKNSKGSLPLSPTAIK 427
Query: 423 SLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAV 482
+LA+IGPNAN KT++GNY G C+ TPLQ L V T Y PGC VAC +A ID+A
Sbjct: 428 TLAIIGPNANVTKTMIGNYEGTPCKYTTPLQGLTALVA-TTYLPGCSNVACGTAQIDEAK 486
Query: 483 DIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPV 542
IA AD VL++G+DQ+ E E DRV + LPG+Q LIT VA+A+K VILV++ GG
Sbjct: 487 KIAAAADATVLIVGIDQSIEAEGRDRVSIQLPGQQPLLITEVAKASKGNVILVVMSGGGF 546
Query: 543 DITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYI-KVPMTDM 601
DI+FAK D I SILW GYPGEAG A+A+VIFG +NP GRLPMTWYPQ Y+ KVPMT+M
Sbjct: 547 DISFAKNDDKIASILWVGYPGEAGGAAIADVIFGFYNPSGRLPMTWYPQSYVDKVPMTNM 606
Query: 602 KMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKF----KAVSQNKLYLNQSSSTK 657
MRP SG PGRTYRFY G+ ++ FG GLSY+++++ K+VS + S +
Sbjct: 607 NMRPDPASGYPGRTYRFYTGETIYTFGDGLSYTQFNHHLVQAPKSVSI-PIEEGHSCHSS 665
Query: 658 MVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIK 717
++ D V E C+ F + + V N G ++G H V LF P N P K
Sbjct: 666 KCKSVDAVQ---------ESCQNLAFDIHLRVNNAGNISGSHTVFLFSSPPSVHN-SPQK 715
Query: 718 QLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGDEEYPISI 774
L+GF+ V + AK +A + F++ C+ LS E G + G H L VG ++ +++
Sbjct: 716 HLLGFEKVFVTAKAEALVRFKVDVCKDLSIVDELGTQKVALGLHVLHVGSLKHSLNV 772
>gi|297745522|emb|CBI40687.3| unnamed protein product [Vitis vinifera]
Length = 751
Score = 768 bits (1983), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 394/772 (51%), Positives = 516/772 (66%), Gaps = 41/772 (5%)
Query: 9 VFPLLCLCFTSLLTR----VDSTQPPFSCD-PSNPSTETFPFCKTTLPISQRARDLVSRL 63
VF CF+ L+ + + P F+CD +NP+ F FC T+L + R DLV RL
Sbjct: 13 VFLCFLSCFSHFLSSPKWVLAQSSPVFACDVENNPTLGQFGFCNTSLETAARVADLVKRL 72
Query: 64 TLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTA 123
TL+EKI LVNSA ++ RLGIP YEWWSEALHGV+ VG G FN + GATSFPQVILTA
Sbjct: 73 TLEEKIGFLVNSAASVSRLGIPKYEWWSEALHGVSYVGPGTHFNSVVPGATSFPQVILTA 132
Query: 124 ASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVT 183
ASF++ L+ IG+A+ EARA+YN G A G+TFW+PN+NIFRDPRWGRGQETPGEDPL++
Sbjct: 133 ASFNASLFEAIGKAVSTEARAMYNVGLA-GLTFWSPNVNIFRDPRWGRGQETPGEDPLLS 191
Query: 184 GKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLA 243
KYA YVRG+Q + +G +L+ +ACCKH+TAYDLDNWKG R+ F+A VT QD+
Sbjct: 192 SKYASGYVRGLQ-QSDDGSP--DRLKVAACCKHYTAYDLDNWKGVDRFHFNAVVTKQDMD 248
Query: 244 DTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVS 303
DT+QPPF+SCV G + +MC+YN+VNG P+CAD +LLS R +W +GYI SDCD+V
Sbjct: 249 DTFQPPFKSCVIDGNVASVMCSYNQVNGKPACADPDLLSGIVRGEWKLNGYIVSDCDSVD 308
Query: 304 IIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSV 363
+ Y+++ Y K+PE+A + AG+D+NCGSFL +HT+AAVK + ES +D+A+ N F+
Sbjct: 309 VFYNSQHYTKTPEEAAAKAILAGLDLNCGSFLGQHTEAAVKGGLVDESAVDKAVSNNFAT 368
Query: 364 RMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVS 423
MRLG F+GNP+ +GK+G VC+ HQ LA +AA+ GIVLLKNS G LPL + +
Sbjct: 369 LMRLGFFDGNPSKAIYGKLGPKDVCTSEHQELAREAARQGIVLLKNSKGSLPLSPTAIKT 428
Query: 424 LALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVD 483
LA+IGPNAN KT++GNY G C+ TPLQ L V T Y PGC VAC +A ID+A
Sbjct: 429 LAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLTALVA-TTYLPGCSNVACGTAQIDEAKK 487
Query: 484 IAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVD 543
IA AD VL++G+DQ+ E E DRV++ LPG+Q LIT VA+A+K VILV++ GG D
Sbjct: 488 IAAAADATVLIVGIDQSIEAEGRDRVNIQLPGQQPLLITEVAKASKGNVILVVMSGGGFD 547
Query: 544 ITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYI-KVPMTDMK 602
I+FAK D I SILW GYPGEAG A+A+VIFG +NP GRLPMTWYPQ Y+ KVPMT+M
Sbjct: 548 ISFAKNDDKITSILWVGYPGEAGGAAIADVIFGFYNPSGRLPMTWYPQSYVDKVPMTNMN 607
Query: 603 MRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQ 662
MRP SG PGRTYRFY G+ ++ FG GLSY++++
Sbjct: 608 MRPDPASGYPGRTYRFYTGETIYTFGDGLSYTQFN------------------------- 642
Query: 663 DVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGF 722
H+ SV + E C+ F + + V N G ++G H V LF P N P K L+GF
Sbjct: 643 ---HHLSVDAV-QESCQNLVFDIHLRVNNAGNISGSHTVFLFSSPPSVHN-SPQKHLLGF 697
Query: 723 QSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGDEEYPISI 774
+ V + AK KA + F++ C+ LS E G + G H L VG+ ++ +++
Sbjct: 698 EKVFVTAKAKALVRFKVDVCKDLSIVDELGTRKVALGLHVLHVGNLKHSLNV 749
>gi|297797477|ref|XP_002866623.1| beta-xylosidase 4 [Arabidopsis lyrata subsp. lyrata]
gi|297312458|gb|EFH42882.1| beta-xylosidase 4 [Arabidopsis lyrata subsp. lyrata]
Length = 784
Score = 768 bits (1983), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 401/782 (51%), Positives = 516/782 (65%), Gaps = 26/782 (3%)
Query: 6 LSLVFPLLCLCFTSLLTRVDSTQ--PPFSCD-PSNPSTETFPFCKTTLPISQRARDLVSR 62
+S VF L LCF + + Q P F+CD +NPS + FC T L I R DLV+R
Sbjct: 18 VSSVF-LTFLCFFLYFLDLSNAQSSPVFACDVAANPSLAAYGFCNTVLKIEYRVADLVAR 76
Query: 63 LTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILT 122
LTL EKI LV+ A + RLGIP YEWWSEALHGV+ +G G F+ + GATSFPQVILT
Sbjct: 77 LTLQEKIGFLVSKANGVTRLGIPTYEWWSEALHGVSYIGPGTHFSSQVPGATSFPQVILT 136
Query: 123 AASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLV 182
AASF+ L+ IG+ + EARA+YN G A G+T+W+PN+NIFRDPRWGRGQETPGEDPL+
Sbjct: 137 AASFNVSLFQAIGKVVSTEARAMYNVGLA-GLTYWSPNVNIFRDPRWGRGQETPGEDPLL 195
Query: 183 TGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDL 242
KYA YV+G+Q +GG +L+ +ACCKH+TAYD+DNWKG RY F+A VT QD+
Sbjct: 196 ASKYASGYVKGLQ--ETDGGD-SNRLKVAACCKHYTAYDVDNWKGVERYSFNAVVTQQDM 252
Query: 243 ADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAV 302
DTYQPPF+SCV G + +MC+YN+VNG P+CAD +LLS R +W +GYI SDCD+V
Sbjct: 253 DDTYQPPFKSCVVDGNVASVMCSYNQVNGKPTCADPDLLSGVIRGEWKLNGYIVSDCDSV 312
Query: 303 SIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFS 362
++Y + Y K+P +A + AG+D+NCGSFL +HT+ AVK + E+ ID+A+ N F
Sbjct: 313 DVLYKNQHYTKTPAEAAAISILAGLDLNCGSFLGQHTEEAVKSGLVNEAAIDKAISNNFL 372
Query: 363 VRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSV 422
MRLG F+GNP Q +G +G VC+ A+Q LA AA+ GIVLLKN+ G LPL
Sbjct: 373 TLMRLGFFDGNPKNQIYGGLGPTDVCTSANQELAADAARQGIVLLKNT-GFLPLSPKSIK 431
Query: 423 SLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAV 482
+LA+IGPNAN KT++GNY G C+ TPLQ L V +T Y PGC VAC+ A + A
Sbjct: 432 TLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAGAV-STTYLPGCSNVACAVADVAGAT 490
Query: 483 DIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPV 542
+A AD VL++G DQ+ E E DRVDL LPG+QQEL+ +VA+AAK PV+LV++ GG
Sbjct: 491 KLAATADVTVLLIGADQSIEAESRDRVDLNLPGQQQELVIQVAKAAKGPVLLVIMSGGGF 550
Query: 543 DITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYI-KVPMTDM 601
DITFAK D I ILW GYPGEAG +A+A++IFG +NP GRLPMTWYPQ Y+ KVPMT M
Sbjct: 551 DITFAKNDPKIAGILWVGYPGEAGGIAIADIIFGRYNPSGRLPMTWYPQSYVEKVPMTIM 610
Query: 602 KMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKF-KAVSQNKLYLNQSSSTKMVE 660
MRP + G PGRTYRFY G+ V+ FG GLSY+K+S+ KA S L L ++ + E
Sbjct: 611 NMRPDKSKGYPGRTYRFYTGETVYAFGDGLSYTKFSHSLVKAPSLVSLSLEENHVCRSSE 670
Query: 661 NQDVVHYKSVPELGTEFCET------RKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGR 714
Q S+ +G CE F V I V+N G+ G H V LF P +G
Sbjct: 671 CQ------SLDAIGPH-CENAVSGGGSAFEVQIKVRNGGDREGIHTVFLFTTPPAI-HGS 722
Query: 715 PIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGDEEYPISI 774
P K L+GF+ + L E+A + F++ C+ LS E G I G H L VGD ++ +SI
Sbjct: 723 PRKHLLGFEKIRLGKMEEAVVRFKVEVCKDLSVVDEIGKRKIGLGKHLLHVGDLKHSLSI 782
Query: 775 FV 776
+
Sbjct: 783 RI 784
>gi|298364130|gb|ADI79208.1| alpha-L-arabinofuranosidase/beta-D-xylosidase [Malus x domestica]
Length = 774
Score = 768 bits (1982), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 378/757 (49%), Positives = 503/757 (66%), Gaps = 22/757 (2%)
Query: 24 VDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLG 83
V +PPF+CDP NP T T FC+ +PI R +DL+ RLTL EKI LVN+A A+PRLG
Sbjct: 25 VVHARPPFACDPRNPITRTLKFCRVRVPIHVRVQDLIGRLTLQEKIGLLVNNAIAVPRLG 84
Query: 84 IPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEAR 143
I YEWWSEALHGV+ VG G F GT GATSFPQVI TAASF+ LW IG+ + EAR
Sbjct: 85 IQGYEWWSEALHGVSNVGPGTKF-GTFLGATSFPQVITTAASFNESLWEEIGRVVSDEAR 143
Query: 144 ALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGK 203
A+YN G A G+TFW+PN+NIFRDPRWGRGQETPGEDP++ KY YV+G+QGD
Sbjct: 144 AMYNGGAA-GLTFWSPNVNIFRDPRWGRGQETPGEDPILAAKYGARYVKGLQGD-----G 197
Query: 204 LKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIM 263
+L+ +ACCKH+TAYDLDNW G R+ F+ARV+ QDL DTY PF +CV G + +M
Sbjct: 198 AGNRLKVAACCKHYTAYDLDNWNGVDRFHFNARVSKQDLEDTYNVPFRACVVDGNVASVM 257
Query: 264 CAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVL 323
C+YN+VNG P+CAD LL T R QW +GYI SDCD+V + YD + Y K+PE+A +
Sbjct: 258 CSYNQVNGKPTCADPELLKGTIRGQWKLNGYIVSDCDSVGVYYDNQHYTKTPEEAAAYAI 317
Query: 324 KAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIG 383
KAG+D++CG FL HT+AAV+ ++ E +I+ AL N +V+MRLG+F+G P+ Q +G +G
Sbjct: 318 KAGLDLDCGPFLGIHTEAAVRFGQVNEIDINYALANTITVQMRLGMFDGEPSAQRYGNLG 377
Query: 384 ADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAG 443
VC P+ LAL+AA+ GIVLL+N LPL + ++A+IGPN++ +T++GNYAG
Sbjct: 378 LADVCKPSSNELALEAARQGIVLLENRGNSLPLSTMRHRTVAVIGPNSDVTETMIGNYAG 437
Query: 444 PSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDI-AKGADHVVLMMGLDQTQE 502
+C TPLQ + Y T++ GC V C+ + A ++ A+ AD VL++GLDQ+ E
Sbjct: 438 IACGYTTPLQGIARYTR-TIHQAGCTDVHCNGNQLIGAAEVAARQADATVLVIGLDQSIE 496
Query: 503 KEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYP 562
E DR DL+LPG QQEL++RVA A++ P ILV++ GGP+D+TFAK D IG+I+W GYP
Sbjct: 497 AEFRDRTDLLLPGHQQELVSRVARASRGPTILVIMSGGPIDVTFAKNDPRIGAIIWVGYP 556
Query: 563 GEAGAVALAEVIFGDHNPGGRLPMTWYPQDYI-KVPMTDMKMRPQATSGNPGRTYRFYEG 621
G+AG A+A+V+FG NP G+LPMTWYPQ+Y+ +PMTDM MR G PGRTYRFY+G
Sbjct: 557 GQAGGTAIADVLFGTTNPSGKLPMTWYPQNYVANLPMTDMAMRADPARGYPGRTYRFYKG 616
Query: 622 KEVFPFGCGLSYSKYSYKFKA----VSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEF 677
VFPFG GLSY+++S+ VS L S +T M+ N D+ +
Sbjct: 617 PVVFPFGLGLSYTRFSHSLAQGPTLVSVPFTSLVASKNTTMLGNHDI-------RVSHTN 669
Query: 678 CETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVF 737
C++ V I +KN G M G H +L+F P G P KQLVGF V + A + +
Sbjct: 670 CDSLSLDVHIDIKNSGTMDGTHTLLVFATPP-TGKWAPNKQLVGFHKVHIVAGSERRVRV 728
Query: 738 ELSPCESLSRAREDGLMVIEEGTHFLVVGDEEYPISI 774
+ C+ LS E G+ I G H L +GD ++ +S+
Sbjct: 729 GVQVCKHLSVVDELGIRRIPLGQHKLEIGDLQHHVSV 765
>gi|296083056|emb|CBI22460.3| unnamed protein product [Vitis vinifera]
Length = 896
Score = 767 bits (1981), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 389/771 (50%), Positives = 502/771 (65%), Gaps = 58/771 (7%)
Query: 1 MIFHKLSLVFPLLCLCFTSLLTRVDSTQPP--FSCDPSNPSTETFPFCKTTLPISQRARD 58
+I + L PLL + +QP ++CD ++P++ FPFC T+LP RA D
Sbjct: 30 LILLPIFLSLPLLAI-----------SQPTHRYACDRTDPNSSQFPFCNTSLPYQDRASD 78
Query: 59 LVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQ 118
LVSRLTL EK QL+NSA I RLG+P YEWWSEALHGV+ G G+ F+ I T FP
Sbjct: 79 LVSRLTLQEKAKQLINSATGISRLGVPDYEWWSEALHGVSNSGIGVHFHDPIPAVTIFPA 138
Query: 119 VILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGE 178
VIL+AASF+ LWY +GQ + E RA+YN GQA G+T+W+PN+NIFRDPRWGRGQETPGE
Sbjct: 139 VILSAASFNESLWYTMGQVVSTEGRAMYNVGQA-GLTYWSPNVNIFRDPRWGRGQETPGE 197
Query: 179 DPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVT 238
DPLV +YAV+YVRG+Q G +L+ S+CCKH+TAYD+D WKG R+ FDA+VT
Sbjct: 198 DPLVVSRYAVNYVRGLQEVGKEGNFAADRLKVSSCCKHYTAYDVDKWKGVDRFHFDAKVT 257
Query: 239 MQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSD 298
+QDL DTYQPPF+SCV++G S +MC+YNRVNG+P+CA+ LL R QWG GYI SD
Sbjct: 258 LQDLEDTYQPPFKSCVEEGHVSSVMCSYNRVNGVPTCANPELLKGVIRDQWGLDGYIVSD 317
Query: 299 CDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALH 358
CD++ + ++ Y ++PEDAV LKAG+++NCGS+L +TK AV K+ ES +++AL
Sbjct: 318 CDSIMVYHERMNYTETPEDAVALALKAGLNLNCGSYLGDYTKNAVNLGKVKESIVNQALI 377
Query: 359 NLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPK 418
+ V MRLG F+G+PTM PFGK+G VC+ HQ+LAL AA+ GIVLL N +G LPL
Sbjct: 378 YNYIVLMRLGFFDGDPTMLPFGKMGPSDVCTVDHQLLALDAAKQGIVLLHN-NGALPLSP 436
Query: 419 SKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSAS- 477
+ + +LA+IGPNA++ T+L NYAG CR +PLQ LQ YV Y GC V+CS +
Sbjct: 437 NTTKTLAVIGPNADATNTMLSNYAGVPCRYTSPLQGLQKYVSAVSYEKGCANVSCSEETL 496
Query: 478 IDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLL 537
I+ A IA AD V+++GLD E E+LDRV+L LPG Q++L+ A+AA VILV++
Sbjct: 497 IEGAASIASMADATVVVVGLDLFIEAEDLDRVNLTLPGFQEKLVMEAAKAANGTVILVVM 556
Query: 538 CGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYI-KV 596
GPVDI+F K IG ILW GYPG+AG A+++VIFGD+NPGGR P TWYPQ+Y+ +V
Sbjct: 557 SAGPVDISFVKNVSKIGGILWVGYPGQAGGDAISQVIFGDYNPGGRSPFTWYPQEYVDQV 616
Query: 597 PMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSST 656
PMTDM MRP ATS PGRTYRFY GK ++ FG GLSYS +
Sbjct: 617 PMTDMNMRPNATSNFPGRTYRFYTGKSLYQFGHGLSYSTF-------------------Y 657
Query: 657 KMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRG-NGRP 715
K + N D+V IGVKN GE+ G H VL F KP R G G P
Sbjct: 658 KNLSNIDIV---------------------IGVKNAGEIDGTHVVLAFWKPPRSGVRGAP 696
Query: 716 IKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
+LVGF+ V + + + L C +S E+G + G H LVVG
Sbjct: 697 GVELVGFERVEVKRGKTEMVGMRLDVCGKISNVDEEGKRKLVMGMHTLVVG 747
>gi|32481073|gb|AAP83934.1| auxin-induced beta-glucosidase [Chenopodium rubrum]
Length = 767
Score = 767 bits (1981), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 383/764 (50%), Positives = 515/764 (67%), Gaps = 21/764 (2%)
Query: 12 LLCLCFTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQ 71
C +L ++ P +CDP + T FC+ LPI R +DL+ RL L EK+
Sbjct: 7 FFCFLVLFILLSAEARAAPLACDPKSGLTRALRFCRVNLPIRARVQDLIGRLNLQEKVKL 66
Query: 72 LVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLW 131
LVN+A +PRLGI YEWWSEALHGV+ VG G F G ATSFPQVI TAASF++ LW
Sbjct: 67 LVNNAAPVPRLGISGYEWWSEALHGVSNVGPGTKFRGAFPAATSFPQVITTAASFNASLW 126
Query: 132 YRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYV 191
IGQ + EARA+YN G A G+T+W+PN+NIFRDPRWGRGQETPGEDP + +YA SYV
Sbjct: 127 EAIGQVVSDEARAMYNGGTA-GLTYWSPNVNIFRDPRWGRGQETPGEDPTLASQYAASYV 185
Query: 192 RGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFE 251
RG+QG +N K +L+ +ACCKH+TAYDLDNW R+ F+A+V+ QDL DTY PF+
Sbjct: 186 RGLQG-IYN----KNRLKVAACCKHYTAYDLDNWNAVDRFHFNAKVSKQDLEDTYNVPFK 240
Query: 252 SCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGY 311
CV++GR + +MC+YN+VNG P+CAD +LL T R QW +GYI SDCD+V ++YD + Y
Sbjct: 241 GCVQEGRVASVMCSYNQVNGKPTCADPDLLRNTIRGQWRLNGYIVSDCDSVGVLYDDQHY 300
Query: 312 AKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFN 371
++PE+A D +KAG+D++CG FL HT+AAVK+ L E+++++AL N F+V+MRLG+F+
Sbjct: 301 TRTPEEAAADTIKAGLDLDCGPFLAVHTEAAVKRGLLTEADVNQALTNTFTVQMRLGMFD 360
Query: 372 GNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNA 431
G QPFG +G VCSPAHQ LALQAA+ GIVLL+N LPL ++ ++A+IGPNA
Sbjct: 361 GEAAAQPFGHLGPKDVCSPAHQDLALQAARQGIVLLQNRGRSLPLSTARHRNIAVIGPNA 420
Query: 432 NSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSS-ASIDKAVDIAKGADH 490
++ T++GNYAG +C +PLQ + Y + TV+ GC VAC+S A A AD
Sbjct: 421 DATVTMIGNYAGVACGYTSPLQGIARYAK-TVHQAGCIGVACTSNQQFGAATAAAAHADA 479
Query: 491 VVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYD 550
VL+MGLDQ+ E E DR ++LPG QQEL+++VA A++ P ILVL+CGGPVD+TFAK D
Sbjct: 480 TVLVMGLDQSIEAEFRDRASVLLPGHQQELVSKVALASRGPTILVLMCGGPVDVTFAKND 539
Query: 551 RNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYI-KVPMTDMKMRPQATS 609
I +ILW GYPG+AG A+A+V+FG NPGG+LP TWYPQ Y+ KVPMTD+ MR ++
Sbjct: 540 PKISAILWVGYPGQAGGTAIADVLFGTTNPGGKLPNTWYPQSYVAKVPMTDLAMRANPSN 599
Query: 610 GNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYL---NQSSSTKMVE-NQDVV 665
G PGRTYRFY+G VFPFG GLSY++++ A + K+ + NQ +++ + N+D +
Sbjct: 600 GYPGRTYRFYKGPVVFPFGFGLSYTRFTQSL-AHAPTKVMVPLANQFTNSNITSFNKDAL 658
Query: 666 HYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSV 725
L T C+ + I VKN G++ G H +L+F P +G KQL+GF+ V
Sbjct: 659 KV-----LHTN-CDNIPLSLHIDVKNKGKVDGSHTILVFSTPP-KGTKSSEKQLIGFKRV 711
Query: 726 ILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGDEE 769
+ A K + + C LSRA E G+ I G H L +GD+
Sbjct: 712 HVFAGSKQRVRMNIHVCNHLSRADEFGVRRIPIGEHTLHIGDDH 755
>gi|15237736|ref|NP_201262.1| beta-D-xylosidase 4 [Arabidopsis thaliana]
gi|75262663|sp|Q9FLG1.1|BXL4_ARATH RecName: Full=Beta-D-xylosidase 4; Short=AtBXL4; Flags: Precursor
gi|10178060|dbj|BAB11424.1| beta-xylosidase [Arabidopsis thaliana]
gi|332010539|gb|AED97922.1| beta-D-xylosidase 4 [Arabidopsis thaliana]
Length = 784
Score = 767 bits (1980), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 399/776 (51%), Positives = 513/776 (66%), Gaps = 25/776 (3%)
Query: 12 LLCLCFTSLLTRVDSTQ--PPFSCD-PSNPSTETFPFCKTTLPISQRARDLVSRLTLDEK 68
L+ LCF + Q P F+CD +NPS + FC T L I R DLV+RLTL EK
Sbjct: 23 LIFLCFFLYFLNFSNAQSSPVFACDVAANPSLAAYGFCNTVLKIEYRVADLVARLTLQEK 82
Query: 69 ISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDS 128
I LV+ A + RLGIP YEWWSEALHGV+ +G G F+ + GATSFPQVILTAASF+
Sbjct: 83 IGFLVSKANGVTRLGIPTYEWWSEALHGVSYIGPGTHFSSQVPGATSFPQVILTAASFNV 142
Query: 129 YLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAV 188
L+ IG+ + EARA+YN G A G+T+W+PN+NIFRDPRWGRGQETPGEDPL+ KYA
Sbjct: 143 SLFQAIGKVVSTEARAMYNVGLA-GLTYWSPNVNIFRDPRWGRGQETPGEDPLLASKYAS 201
Query: 189 SYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQP 248
YV+G+Q +GG +L+ +ACCKH+TAYD+DNWKG RY F+A VT QD+ DTYQP
Sbjct: 202 GYVKGLQ--ETDGGD-SNRLKVAACCKHYTAYDVDNWKGVERYSFNAVVTQQDMDDTYQP 258
Query: 249 PFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDA 308
PF+SCV G + +MC+YN+VNG P+CAD +LLS R +W +GYI SDCD+V ++Y
Sbjct: 259 PFKSCVVDGNVASVMCSYNQVNGKPTCADPDLLSGVIRGEWKLNGYIVSDCDSVDVLYKN 318
Query: 309 EGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLG 368
+ Y K+P +A + AG+D+NCGSFL +HT+ AVK + E+ ID+A+ N F MRLG
Sbjct: 319 QHYTKTPAEAAAISILAGLDLNCGSFLGQHTEEAVKSGLVNEAAIDKAISNNFLTLMRLG 378
Query: 369 LFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIG 428
F+GNP Q +G +G VC+ A+Q LA AA+ GIVLLKN+ G LPL +LA+IG
Sbjct: 379 FFDGNPKNQIYGGLGPTDVCTSANQELAADAARQGIVLLKNT-GCLPLSPKSIKTLAVIG 437
Query: 429 PNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGA 488
PNAN KT++GNY G C+ TPLQ L V +T Y PGC VAC+ A + A +A A
Sbjct: 438 PNANVTKTMIGNYEGTPCKYTTPLQGLAGTV-STTYLPGCSNVACAVADVAGATKLAATA 496
Query: 489 DHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAK 548
D VL++G DQ+ E E DRVDL LPG+QQEL+ +VA+AAK PV+LV++ GG DITFAK
Sbjct: 497 DVSVLVIGADQSIEAESRDRVDLHLPGQQQELVIQVAKAAKGPVLLVIMSGGGFDITFAK 556
Query: 549 YDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYI-KVPMTDMKMRPQA 607
D I ILW GYPGEAG +A+A++IFG +NP G+LPMTWYPQ Y+ KVPMT M MRP
Sbjct: 557 NDPKIAGILWVGYPGEAGGIAIADIIFGRYNPSGKLPMTWYPQSYVEKVPMTIMNMRPDK 616
Query: 608 TSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKF-KAVSQNKLYLNQSSSTKMVENQDVVH 666
SG PGRTYRFY G+ V+ FG GLSY+K+S+ KA S L L ++ + E Q
Sbjct: 617 ASGYPGRTYRFYTGETVYAFGDGLSYTKFSHTLVKAPSLVSLGLEENHVCRSSECQ---- 672
Query: 667 YKSVPELGTEFCET------RKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLV 720
S+ +G CE F V I V+N G+ G H V LF P +G P K LV
Sbjct: 673 --SLDAIGPH-CENAVSGGGSAFEVHIKVRNGGDREGIHTVFLFTTPPAI-HGSPRKHLV 728
Query: 721 GFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGDEEYPISIFV 776
GF+ + L +E+A + F++ C+ LS E G I G H L VGD ++ +SI +
Sbjct: 729 GFEKIRLGKREEAVVRFKVEICKDLSVVDEIGKRKIGLGKHLLHVGDLKHSLSIRI 784
>gi|302796583|ref|XP_002980053.1| hypothetical protein SELMODRAFT_112087 [Selaginella moellendorffii]
gi|300152280|gb|EFJ18923.1| hypothetical protein SELMODRAFT_112087 [Selaginella moellendorffii]
Length = 772
Score = 767 bits (1980), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 392/759 (51%), Positives = 517/759 (68%), Gaps = 25/759 (3%)
Query: 28 QPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAY 87
Q P++CD SN + FPFC T+L I+ R D V+RLTL+EKISQL+N+A IPRLG+P Y
Sbjct: 27 QAPYACDQSNATLAAFPFCNTSLAITDRVEDYVARLTLEEKISQLINTATGIPRLGVPKY 86
Query: 88 EWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYN 147
+WW EALHGVA G+ F G++ ATSFP I TAASF++ L+Y IGQA+ EARA++N
Sbjct: 87 QWWQEALHGVAS-SPGVQFGGSVPAATSFPMPITTAASFNTSLFYGIGQAVSTEARAMHN 145
Query: 148 AGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGK 207
GQ+ G+TFW+PNINI+RDPRWGRGQETPGEDPL++ +A YVRG+Q KLK
Sbjct: 146 LGQS-GLTFWSPNINIYRDPRWGRGQETPGEDPLLSSNFATYYVRGLQESQAGSDKLK-- 202
Query: 208 LQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYN 267
SACCKH TAYD+DNW GT RY F+A VT QDL DTY PF+SCV+ G S +MC+YN
Sbjct: 203 --VSACCKHMTAYDVDNWLGTDRYHFNAIVTEQDLEDTYNAPFKSCVEDGGVSSVMCSYN 260
Query: 268 RVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGM 327
R+NG+P+CAD LL+ T R W +GYI SDCD++ + +D YA + EDA D L AG+
Sbjct: 261 RLNGVPTCADHELLTTTVRETWKLNGYIVSDCDSLQVFFDNTNYAATAEDAAADALLAGL 320
Query: 328 DVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVV 387
++NCG+FL KHT +A++QKK+ E+ I++AL L +V+MRLGL++G+P Q +G +GA V
Sbjct: 321 NLNCGTFLAKHTLSAIQQKKVTEATINQALTYLVTVQMRLGLYDGDPKSQTYGSLGASDV 380
Query: 388 CSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCR 447
C+ HQ LAL+AA+ G+VLLKN G LPL SK SLA++GP+AN+ + ++GNYAG C+
Sbjct: 381 CTSEHQTLALEAARQGMVLLKN-LGALPLSTSKIKSLAVVGPHANATRAMIGNYAGIPCK 439
Query: 448 SITPLQALQNYVENTVYYPGCDTVACSSAS-IDKAVDIAKGADHVVLMMGLDQTQEKEEL 506
+PLQA Q Y + Y PGC VACSS S I AV A AD VV+ +GLD T E E L
Sbjct: 440 YTSPLQAFQKYAQ-VSYAPGCANVACSSDSLISGAVSAAAAADAVVVAVGLDLTIEAESL 498
Query: 507 DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAG 566
DR L+LPG+QQEL+++V +AAK PV++V+L G +DI FA D I ILWAGYPG+AG
Sbjct: 499 DRTSLLLPGKQQELVSQVMQAAKGPVVIVILSAGAIDIPFALSDSRIAGILWAGYPGQAG 558
Query: 567 AVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFP 626
A+AEVIFGDHNP G+LP TWYPQ++ + M DM MRP A++G PGRTYRFY G +F
Sbjct: 559 GAAIAEVIFGDHNPSGKLPATWYPQNFTSISMLDMNMRPNASTGYPGRTYRFYTGPTIFK 618
Query: 627 FGCGLSYSKYSYKF-KAVSQNKLYLNQSSS------TKMVENQDVVHYKSVPELGTEFCE 679
FG GLSY+ S KF KA S +L+ S+ T + ++ H + E + CE
Sbjct: 619 FGDGLSYTSLSAKFIKAPS----FLSIPSTAPMQPCTGLKKSSSCFHLDATDE---KSCE 671
Query: 680 TRKFLVTIGVKNHGEMAGKHPVLLFVKPARRG-NGRPIKQLVGFQSVILNAKEKAE-IVF 737
+ K V I V+N G MA H ++LF P G +G P +QLVGF + + + ++F
Sbjct: 672 SLKSQVAISVRNKGAMAISHTLMLFSTPPNAGSDGVPQRQLVGFNKIQIAGDSISNPVIF 731
Query: 738 ELSPCESLSRAREDGLMVIEEGTHFLVVGDEEYPISIFV 776
+L PC A DG ++ GTH L G+E++ + + V
Sbjct: 732 DLDPCRHFVHADPDGKKLLRSGTHVLTAGNEQHSLRLLV 770
>gi|242076578|ref|XP_002448225.1| hypothetical protein SORBIDRAFT_06g023450 [Sorghum bicolor]
gi|241939408|gb|EES12553.1| hypothetical protein SORBIDRAFT_06g023450 [Sorghum bicolor]
Length = 766
Score = 766 bits (1978), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 376/750 (50%), Positives = 514/750 (68%), Gaps = 17/750 (2%)
Query: 33 CDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSE 92
C + +PFC +L I RAR LVS LTLDEKI+QL N+A +PRLGIP Y+WWSE
Sbjct: 24 CASPYSGSSAYPFCDASLSIPARARALVSLLTLDEKIAQLSNTAGGVPRLGIPPYQWWSE 83
Query: 93 ALHGVAGVGKGI-FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQA 151
+LHG+A G G+ F +G +R AT+FPQVIL+ A+F+ LW + +A+ EA ++NAGQA
Sbjct: 84 SLHGLADNGPGVNFSSGPVRAATTFPQVILSTAAFNRSLWRAVAEAVATEALGMHNAGQA 143
Query: 152 IGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQAS 211
G+T+WAPNINIFRDPRWGRGQET GEDP V Y++ YV+G QG+ +G+++ S
Sbjct: 144 -GLTYWAPNINIFRDPRWGRGQETSGEDPAVAAAYSLEYVKGFQGEQGE----EGRIRLS 198
Query: 212 ACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNG 271
ACCKH+TAYD++ W+G +RY F+A+V QDL DTYQPPF++C+++ RAS +MCAYN+VNG
Sbjct: 199 ACCKHYTAYDMEKWEGFSRYTFNAKVNAQDLEDTYQPPFKTCIQEARASCLMCAYNQVNG 258
Query: 272 IPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNC 331
+P CA+++LL KT R +WGF GYITSDCDAV+II++ + Y KS ED++ VLKAGMD+NC
Sbjct: 259 VPMCANKDLLQKT-RDEWGFQGYITSDCDAVAIIHENQTYTKSDEDSIAIVLKAGMDINC 317
Query: 332 GSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPA 391
GSFL +HTK+AV++ K+ E +IDRAL NLFSV++RLG+F+ Q ++G + VC+
Sbjct: 318 GSFLVRHTKSAVEKGKVQEQDIDRALFNLFSVQLRLGIFDKPNNNQWSTQLGPNNVCTKE 377
Query: 392 HQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITP 451
H+ LA +A + G VLLKN H LPL +S+ +A+IGP+AN + G+Y G +C T
Sbjct: 378 HRELAAEAVRQGAVLLKNDHSFLPLKRSEVRHVAIIGPSANDVYAMGGDYTGVACNPTTF 437
Query: 452 LQALQNYVENTVYYPGCDTVACSSASI-DKAVDIAKGADHVVLMMGLDQTQEKEELDRVD 510
L+ +Q Y T + GC V+C+S + +A+ AK AD VV++ GL+ T+E+E+ DRV
Sbjct: 438 LKGIQAYATQTTFAAGCKDVSCNSTELFGEAIAAAKRADIVVVVAGLNLTEEREDFDRVS 497
Query: 511 LVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVAL 570
L+LPG+Q LI VA AKKP++LVLL GGPVD++FAK D I SILW GYPGE G L
Sbjct: 498 LLLPGKQMSLIHAVASVAKKPLVLVLLGGGPVDVSFAKQDPRIASILWLGYPGEVGGQVL 557
Query: 571 AEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCG 630
E++FG++NPGG+L MTWYP+ + +PMTDM MR + G PGRTYRFY G V+ FG G
Sbjct: 558 PEILFGEYNPGGKLAMTWYPESFTAIPMTDMNMRADPSRGYPGRTYRFYTGDVVYGFGYG 617
Query: 631 LSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQ------DVVHYKSVPELGTEFCETRKFL 684
LSYSKYSY + + K+ +++SS ++ + D + + ++ + CE F
Sbjct: 618 LSYSKYSYSILSAPK-KITMSRSSVLDIISRKPSYIRRDGLDFVKTEDIAS--CEALAFS 674
Query: 685 VTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCES 744
V + V NHG M G H VLLF + G PIKQLVGF+ V A + + + PC+
Sbjct: 675 VHVAVSNHGSMDGSHAVLLFARSKSSVPGFPIKQLVGFERVHTAAGSASNVEISVDPCKH 734
Query: 745 LSRAREDGLMVIEEGTHFLVVGDEEYPISI 774
+S A +G V+ G H L VGDEE+ + I
Sbjct: 735 MSAANPEGKRVLLLGDHVLTVGDEEFELFI 764
>gi|224054312|ref|XP_002298197.1| predicted protein [Populus trichocarpa]
gi|222845455|gb|EEE83002.1| predicted protein [Populus trichocarpa]
Length = 741
Score = 765 bits (1975), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 391/759 (51%), Positives = 514/759 (67%), Gaps = 28/759 (3%)
Query: 28 QPPFSCDP-SNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPA 86
P F+CD SNPS +F FC T+L +S R DLV RLTL EKI LVNSA ++ RLGIP
Sbjct: 1 SPVFACDVVSNPSLASFGFCNTSLGVSDRVVDLVKRLTLQEKILFLVNSAGSVSRLGIPK 60
Query: 87 YEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALY 146
YEWWSEALHGV+ VG G F+ + GATSFPQVILTAASF++ L+ IG+ + EARA+Y
Sbjct: 61 YEWWSEALHGVSYVGPGTHFSSVVPGATSFPQVILTAASFNTSLFVAIGKVVSTEARAMY 120
Query: 147 NAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKG 206
N G A G+TFW+PNINIFRDPRWGRGQETPGEDPL++ KY YV+G+Q + G G
Sbjct: 121 NVGLA-GLTFWSPNINIFRDPRWGRGQETPGEDPLLSSKYGSGYVKGLQ--QRDDGNPDG 177
Query: 207 KLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAY 266
L+ +ACCKH+TAYDLDNWKG RY F+A VT QD+ DT+QPPF+SCV G + +MC+Y
Sbjct: 178 -LKVAACCKHYTAYDLDNWKGVDRYHFNAVVTKQDMDDTFQPPFKSCVVDGNVASVMCSY 236
Query: 267 NRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAG 326
N+VNGIP+CAD +LLS R +W +GYI +DCD++ + Y+++ Y K+PE+A + AG
Sbjct: 237 NKVNGIPTCADPDLLSGVIRGEWKLNGYIVTDCDSIDVFYNSQHYTKTPEEAAAKAILAG 296
Query: 327 --MDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGA 384
+D+NCGSFL KHT+AAV + ES IDRA+ N F+ MRLG F+G+P+ Q +GK+G
Sbjct: 297 IRLDLNCGSFLGKHTEAAVTAGLVNESAIDRAVSNNFATLMRLGFFDGDPSKQLYGKLGP 356
Query: 385 DVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGP 444
VC+ +Q LA +AA+ GIVLLKN+ G LPL + +LA+IGPNAN KT++GNY G
Sbjct: 357 KDVCTAENQELAREAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGT 416
Query: 445 SCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKE 504
C+ TPLQ L V T Y PGC VACS+A +D A IA AD VL+MG D + E E
Sbjct: 417 PCKYTTPLQGLAALVA-TTYLPGCSNVACSTAQVDDAKKIAAAADATVLVMGADLSIEAE 475
Query: 505 ELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGE 564
DRVD++LPG+QQ LIT VA A+ PVILV++ GG +D++FAK + I SILW GYPGE
Sbjct: 476 SRDRVDILLPGQQQLLITAVANASTGPVILVIMSGGGMDVSFAKTNDKITSILWVGYPGE 535
Query: 565 AGAVALAEVIFGDHNPGGRLPMTWYPQDYI-KVPMTDMKMRPQATSGNPGRTYRFYEGKE 623
AG A+A++IFG +NP GRLPMTWYPQ Y+ KVPMT+M MRP ++G PGRTYRFY G+
Sbjct: 536 AGGAAIADIIFGSYNPSGRLPMTWYPQSYVDKVPMTNMNMRPDPSNGYPGRTYRFYTGET 595
Query: 624 VFPFGCGLSYSKYSYKFK------AVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEF 677
V+ FG GLSYS++S++ +V + ++ SS K V + +
Sbjct: 596 VYSFGDGLSYSEFSHELTQAPGLVSVPLEENHVCYSSECKSVAAAE------------QT 643
Query: 678 CETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVF 737
C+ F V + +KN G +G H V LF P N P K LVGF+ V L+A+ + + F
Sbjct: 644 CQNLTFDVHLRIKNTGTTSGSHTVFLFSTPPSVHN-SPQKHLVGFEKVFLHAQTDSHVGF 702
Query: 738 ELSPCESLSRAREDGLMVIEEGTHFLVVGDEEYPISIFV 776
++ C+ LS E G + G H L +G ++ +++ +
Sbjct: 703 KVDVCKDLSVVDELGSKKVALGEHVLHIGSLKHSMTVRI 741
>gi|297843058|ref|XP_002889410.1| hypothetical protein ARALYDRAFT_470222 [Arabidopsis lyrata subsp.
lyrata]
gi|297335252|gb|EFH65669.1| hypothetical protein ARALYDRAFT_470222 [Arabidopsis lyrata subsp.
lyrata]
Length = 763
Score = 764 bits (1973), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 375/754 (49%), Positives = 505/754 (66%), Gaps = 27/754 (3%)
Query: 27 TQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPA 86
++ F+CD + +T T FC+ ++PI++R +DL+ RLTL EK+S L N+A AIPRLGI
Sbjct: 21 SRETFACDIKDAATATLRFCQLSVPITERVKDLIGRLTLVEKVSLLGNTAAAIPRLGIKG 80
Query: 87 YEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALY 146
YEWWSEALHGV+ VG G F G ATSFPQVI T ASF++ LW IG+ + EARA+Y
Sbjct: 81 YEWWSEALHGVSNVGPGTKFGGVYPAATSFPQVITTVASFNASLWESIGRVVSNEARAMY 140
Query: 147 NAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKG 206
N G G+T+W+PN+NI RDPRWGRGQETPGEDP+V GKYA SYVRG+QG+ +
Sbjct: 141 NGGVG-GLTYWSPNVNILRDPRWGRGQETPGEDPVVAGKYAASYVRGLQGND------RS 193
Query: 207 KLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAY 266
+L+ +ACCKHFTAYDLDNW G R+ F+A+V+ QD+ DT+ PF CVK+G + IMC+Y
Sbjct: 194 RLKVAACCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVKEGNVASIMCSY 253
Query: 267 NRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAG 326
N VNG+P+CAD NLL KT R +WG +GYI SDCD+V ++YD + Y +PE+A D +KAG
Sbjct: 254 NEVNGVPTCADPNLLKKTIRNEWGLNGYIVSDCDSVGVLYDTQHYTGTPEEAAADSIKAG 313
Query: 327 MDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADV 386
+D++CG FL HT AVK+ L ES++D AL N +V+MRLG+F+G+ QP+G +G
Sbjct: 314 LDLDCGPFLGAHTIDAVKKNLLRESDVDNALINTLTVQMRLGMFDGDIAAQPYGHLGPAH 373
Query: 387 VCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSC 446
VC+P H+ LAL+AAQ GIVLLKN LPL + ++A+IGPN+++ ++GNYAG +C
Sbjct: 374 VCTPVHKGLALEAAQQGIVLLKNHGSSLPLSSQRHRTVAVIGPNSDATVAMIGNYAGIAC 433
Query: 447 RSITPLQALQNYVENTVYYPGCDTVACSSASI-DKAVDIAKGADHVVLMMGLDQTQEKEE 505
+P+Q + Y TV+ GC V C + D AV+ A+GAD VL+MGLDQ+ E E
Sbjct: 434 GYTSPVQGITGYAR-TVHQKGCVDVHCMDDRLFDAAVEAARGADATVLVMGLDQSIEAEF 492
Query: 506 LDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEA 565
DR L+LPG+QQELI+RVA+AAK PVILVL+ GGP+DI+FA+ DR I +I+WAGYPG+
Sbjct: 493 KDRNSLLLPGKQQELISRVAKAAKGPVILVLMSGGPIDISFAEKDRKIPAIVWAGYPGQE 552
Query: 566 GAVALAEVIFGDHNPGGRLPMTWYPQDYI-KVPMTDMKMRPQATSGNPGRTYRFYEGKEV 624
G A+A+++FG NPGG+LPMTWYPQDY+ +PMT+M MRP + PGRTYRFY+G V
Sbjct: 553 GGTAIADILFGSANPGGKLPMTWYPQDYLTNLPMTEMSMRPIHSKRIPGRTYRFYDGPVV 612
Query: 625 FPFGCGLSYSKYSYKF----KAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCET 680
+PFG GLSY+++++ K + N + S K + V H + C
Sbjct: 613 YPFGHGLSYTRFTHSIADAPKVIPIAVRGRNGTVSGKSIR---VTHAR---------CNR 660
Query: 681 RKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELS 740
V + V N G G H +L+F P G P KQLV F+ V + EK + +
Sbjct: 661 LSLGVHVDVTNVGSRDGTHTMLVFSAPP-GGEWAPKKQLVAFERVHVAVGEKKRVQVNIH 719
Query: 741 PCESLSRAREDGLMVIEEGTHFLVVGDEEYPISI 774
C+ LS G I G H + +GDE + +S+
Sbjct: 720 VCKYLSVVDRAGNRRIPIGDHGIHIGDESHTVSL 753
>gi|357444469|ref|XP_003592512.1| Xylosidase [Medicago truncatula]
gi|355481560|gb|AES62763.1| Xylosidase [Medicago truncatula]
Length = 781
Score = 764 bits (1973), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 385/759 (50%), Positives = 507/759 (66%), Gaps = 10/759 (1%)
Query: 12 LLCLCFTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQ 71
LC+ F+ + + S + +CD +P T FPFC T+L RA+DLVSRLTL EK Q
Sbjct: 7 FLCIFFSFFILPITSQK--HACDKGSPKTSNFPFCNTSLSYETRAKDLVSRLTLQEKAQQ 64
Query: 72 LVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLW 131
LVN + I RLG+PAYEWWSEALHGV+ VG G F+ + GATSFP VIL+AASF+ LW
Sbjct: 65 LVNPSTGISRLGVPAYEWWSEALHGVSNVGPGTRFDSRVPGATSFPAVILSAASFNETLW 124
Query: 132 YRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYV 191
Y +GQ + EARA+YN A G+TFW+PN+N+FRDPRWGRGQETPGEDPLV +YAV+YV
Sbjct: 125 YTMGQVVSNEARAMYNVDLA-GLTFWSPNVNVFRDPRWGRGQETPGEDPLVVSRYAVNYV 183
Query: 192 RGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFE 251
RG+Q +L+ S+CCKH+TAYD+DNWKG R+ FDA+VT QDL DTYQPPF+
Sbjct: 184 RGLQEVGDEASAKGDRLKVSSCCKHYTAYDVDNWKGVDRFHFDAKVTKQDLEDTYQPPFK 243
Query: 252 SCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGY 311
SCV +G S +MC+YNRVNGIP+CAD +LL R QWG GYI SDCD+V + Y++ Y
Sbjct: 244 SCVLEGHVSSVMCSYNRVNGIPTCADPDLLQGVIRGQWGLDGYIVSDCDSVEVYYNSIHY 303
Query: 312 AKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFN 371
K+PEDAV LKAG+++NCG FL+K+T AV KK+ S +D+AL + V MRLG F
Sbjct: 304 TKTPEDAVALALKAGLNMNCGDFLKKYTANAVNLKKVDVSIVDQALVYNYIVLMRLGFFE 363
Query: 372 GNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNA 431
NP PF +G VC+ +Q LAL+AA+ GIVLL+N+ G LPL K+K +LA+IGPNA
Sbjct: 364 -NPKSLPFANLGPSDVCTKENQQLALEAAKQGIVLLENNKGALPLSKTKIKNLAVIGPNA 422
Query: 432 NSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASI-DKAVDIAKGADH 490
N+ ++ NYAG CR +PLQ LQ Y+ + Y GC V CS+ ++ AV A AD
Sbjct: 423 NATTVMISNYAGIPCRYSSPLQGLQKYISSVTYARGCSDVKCSNQNLFAAAVKAAASADA 482
Query: 491 VVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYD 550
VVL++GLDQ+ E E LDRV+L LPG Q++L+ VA A K +ILV++ GP+DI+F K
Sbjct: 483 VVLVVGLDQSIEAEGLDRVNLTLPGFQEKLVKDVAAATKGTLILVIMAAGPIDISFTKSV 542
Query: 551 RNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYI-KVPMTDMKMRPQATS 609
NIG ILW GYPG+ G A+A+VIFGD+NPGGR P TWYPQ Y+ +VPMTDM MR ++
Sbjct: 543 SNIGGILWVGYPGQDGGNAIAQVIFGDYNPGGRSPFTWYPQSYVDQVPMTDMNMRANSSR 602
Query: 610 GNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKS 669
PGRTYRFY GK ++ FG GLSYS +S + + +S +K + N +
Sbjct: 603 NFPGRTYRFYNGKSLYEFGYGLSYSTFSTHIASAPSTIMLQKNTSISKPLNN--IFLDDQ 660
Query: 670 VPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKP--ARRGNGRPIKQLVGFQSVIL 727
V ++ T C F + IGVKN+G G H VL+F++P + +G P+KQL+GF+ +
Sbjct: 661 VIDISTISCFNLTFSLVIGVKNNGPFDGSHVVLVFLEPPSSEAVSGVPLKQLIGFERAQV 720
Query: 728 NAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
+ + ++ C+ LS DG + G H ++VG
Sbjct: 721 KVGKTEFVTVKIDICKMLSNVDSDGKRKLVIGQHNILVG 759
>gi|302796585|ref|XP_002980054.1| hypothetical protein SELMODRAFT_419541 [Selaginella moellendorffii]
gi|300152281|gb|EFJ18924.1| hypothetical protein SELMODRAFT_419541 [Selaginella moellendorffii]
Length = 779
Score = 764 bits (1972), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 381/773 (49%), Positives = 499/773 (64%), Gaps = 38/773 (4%)
Query: 23 RVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRL 82
+V + Q P++CD N + F FC T LP S R DL+SR+TL EKI QLVN+A IPRL
Sbjct: 24 QVVAGQAPYACDQRNATLLQFGFCNTRLPTSTRVEDLISRMTLQEKIIQLVNNAAGIPRL 83
Query: 83 GIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEA 142
G+P YEWW EALHGVA V G+ F G GATSFP ILTAASFD A+ EA
Sbjct: 84 GLPRYEWWQEALHGVA-VSPGVKFGGKFPGATSFPMPILTAASFD---------AVSTEA 133
Query: 143 RALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGG 202
RA++N Q G+T+W+PN+NI+RDPRWGRGQETPGEDPL++ KYA YVRG+Q G
Sbjct: 134 RAMHNY-QRAGLTYWSPNVNIYRDPRWGRGQETPGEDPLLSSKYATFYVRGLQDTNLGGD 192
Query: 203 KLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGI 262
KLK SACCKH TAYD+DNWKGTTR+KF+A VT QDL+DTY PPF+SCV+ + S +
Sbjct: 193 KLK----VSACCKHMTAYDVDNWKGTTRFKFNAIVTQQDLSDTYNPPFQSCVEDAKVSSV 248
Query: 263 MCAYNRVNGIPSCADRNLLSKTARRQWGFHG----------------YITSDCDAVSIIY 306
MC+YNRVNG+P+CAD NLLS T R W +G YI SDCD++ +
Sbjct: 249 MCSYNRVNGVPTCADYNLLSATVRSSWNLNGSILLTCEVLLLYLPCSYIVSDCDSLQTFF 308
Query: 307 DAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMR 366
D YAK+ ED V D L AG++++CG FL HT++A+ K+ E+ +++AL L++V+MR
Sbjct: 309 DNTNYAKTAEDVVADALLAGLNLDCGPFLAIHTQSAITNGKITEANVNQALRYLYNVQMR 368
Query: 367 LGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLAL 426
LGL++GNP QP+G +G VC+ +Q LAL AA++GIVLLKN+ +LP KS ++A
Sbjct: 369 LGLYDGNPRSQPYGNLGPQSVCTGENQQLALDAAKEGIVLLKNNGNVLPFSKSNIRTVAA 428
Query: 427 IGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSAS-IDKAVDIA 485
IGP+A + + ++GNY G C+ TP L Y VY GC VAC S S I A A
Sbjct: 429 IGPHAKATRAMIGNYQGIPCKYTTPHDGLSAYAR-VVYSAGCSDVACYSNSLIGSAASTA 487
Query: 486 KGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDIT 545
AD VVL +GLD QE E DR L+LPG+QQEL+T V +AAK PV+LV+ GG VD++
Sbjct: 488 SQADAVVLFVGLDLNQEAEGKDRTSLLLPGKQQELVTEVTKAAKGPVVLVIFSGGSVDVS 547
Query: 546 FAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRP 605
FAKYD+ + +LWAGYPGEAG A+A+V+FGDHNPGGRLP+TWYP+ + + M DM MRP
Sbjct: 548 FAKYDKKVQGMLWAGYPGEAGGAAIAQVLFGDHNPGGRLPVTWYPESFTGITMLDMNMRP 607
Query: 606 QATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVE-NQDV 664
A+ G PGRTYRFY G+ V+ FG G +YSK S+KFK + + ++ + + N
Sbjct: 608 DASRGYPGRTYRFYTGQSVYNFGYGKTYSKLSHKFKEAPLSLGFPEAAAVKRSCDGNLTC 667
Query: 665 VHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRG-NGRPIKQLVGFQ 723
H + E+ C T V I V N G+ VLL+ P G +G PI+QL GF
Sbjct: 668 FHLNAHDEI---TCSTLTSKVRILVHNEGDRPSNRAVLLYSSPPNAGRDGAPIRQLAGFG 724
Query: 724 SVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGDEEYPISIFV 776
V + + E+ PC+ LS A +G+ ++ G H L VG+ +P+ I +
Sbjct: 725 KVSVAPGAVENVEIEIDPCKHLSHAGANGVRILHGGIHTLAVGNARHPLPILL 777
>gi|357130854|ref|XP_003567059.1| PREDICTED: probable beta-D-xylosidase 2-like [Brachypodium
distachyon]
Length = 779
Score = 764 bits (1972), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 377/752 (50%), Positives = 506/752 (67%), Gaps = 19/752 (2%)
Query: 27 TQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPA 86
T+PPF+C P PST PFC+ LP RARDLV+RLT EK+ LVN+A +PRLG+
Sbjct: 23 TRPPFACAPGGPSTR-LPFCRQALPPRARARDLVARLTRAEKVRLLVNNAAGVPRLGVEG 81
Query: 87 YEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALY 146
YEWWSEALHGV+ G G+ F G GAT+FPQVI TAASF++ LW IG+A+ E RA+Y
Sbjct: 82 YEWWSEALHGVSDTGPGVRFGGAFPGATAFPQVIGTAASFNASLWELIGRAVSDEGRAIY 141
Query: 147 NAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKG 206
N QA G+TFW+PN+NIFRDPRWGRGQETPGEDP V+G+YA +YVRG+Q + G
Sbjct: 142 NGRQA-GLTFWSPNVNIFRDPRWGRGQETPGEDPAVSGRYAAAYVRGLQQ------QHAG 194
Query: 207 KLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAY 266
+L+ +ACCKHFTAYDLD W G R+ F+A VT QDL DT+ PF +CV +GRA+ +MC+Y
Sbjct: 195 RLKTAACCKHFTAYDLDRWSGADRFHFNAIVTPQDLEDTFNAPFRACVVEGRAAAVMCSY 254
Query: 267 NRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAG 326
N+VNG+P+CAD+ L T R +W GYI SDCD+V + Y + Y ++ EDAV L+AG
Sbjct: 255 NQVNGVPTCADQGFLRGTIRGKWKLDGYIVSDCDSVDVFYREQHYTRTREDAVAATLRAG 314
Query: 327 MDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADV 386
+D++CG FL ++T+AAV Q K+ E++ID A+ N +V+MRLG+F+G+ QPFG +G
Sbjct: 315 LDLDCGPFLAQYTEAAVAQGKVKEADIDAAVVNTVTVQMRLGMFDGDVAAQPFGHLGPQH 374
Query: 387 VCSPAHQVLALQAAQDGIVLLKNSHG---LLPLPK-SKSVSLALIGPNANSAKTLLGNYA 442
VC+PAH+ LAL+AA IVLLKN G LPL + ++A++GP++ + ++GNYA
Sbjct: 375 VCTPAHRELALEAACQSIVLLKNGGGNNMRLPLSSHHRRGTVAVVGPHSEATVAMIGNYA 434
Query: 443 GPSCRSITPLQALQNYVENTVYYPGCDTVAC--SSASIDKAVDIAKGADHVVLMMGLDQT 500
G C TPLQ + Y TV+ GC VAC S ID AVD A+ AD V+++GLDQ+
Sbjct: 435 GKPCAYTTPLQGVGRYARATVHQAGCTDVACQGSGQPIDAAVDAARHADATVVVVGLDQS 494
Query: 501 QEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAG 560
E E LDR L+LPGRQ EL++ VA A+K PVILVL+ GGPVDI FA+ DRN+ +ILWAG
Sbjct: 495 VEAEGLDRTTLLLPGRQAELVSAVARASKGPVILVLMSGGPVDIAFAQNDRNVAAILWAG 554
Query: 561 YPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYI-KVPMTDMKMRPQATSGNPGRTYRFY 619
YPG+AG A+A+VIFG HNPGG+LP+TWYP+DY+ K PMT+M MR G PGRTYRFY
Sbjct: 555 YPGQAGGQAIADVIFGHHNPGGKLPVTWYPEDYLRKAPMTNMAMRADPARGYPGRTYRFY 614
Query: 620 EGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEF-- 677
G + PFG GLSY+K+++ A + L + +++ + + + ++
Sbjct: 615 AGPTIHPFGHGLSYTKFAHTL-AHAPAHLTVRRAAGHRTTAAINTTTASHLNDVRVAHAQ 673
Query: 678 CETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRG-NGRPIKQLVGFQSVILNAKEKAEIV 736
CE V + VKN G G H V ++ P +G P++QLV F+ V + A A +
Sbjct: 674 CEGLSVSVHVDVKNVGSRDGAHTVFVYASPPIAAIHGAPVRQLVAFEKVHVAAGAVARVK 733
Query: 737 FELSPCESLSRAREDGLMVIEEGTHFLVVGDE 768
+ C SLS A ++G+ I G H L++G+E
Sbjct: 734 MGVDVCGSLSIADQEGVRRIPIGEHRLMIGEE 765
>gi|302786474|ref|XP_002975008.1| hypothetical protein SELMODRAFT_103038 [Selaginella moellendorffii]
gi|300157167|gb|EFJ23793.1| hypothetical protein SELMODRAFT_103038 [Selaginella moellendorffii]
Length = 772
Score = 763 bits (1970), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 370/750 (49%), Positives = 509/750 (67%), Gaps = 18/750 (2%)
Query: 36 SNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALH 95
++ + +FPFC +LP+ R DLV R+ L EKI+Q+V++A IPRLGIP Y+WW EALH
Sbjct: 26 TDSRSSSFPFCDVSLPVPDRVADLVGRMNLSEKIAQIVSNASGIPRLGIPGYQWWEEALH 85
Query: 96 GVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMT 155
GVA G+ F + ATSFPQVILT ASF+S LW +I QAI +EA A+YNAG++ G+T
Sbjct: 86 GVAE-SPGVKFAAPVPSATSFPQVILTVASFNSSLWNKIAQAISIEAIAMYNAGRS-GLT 143
Query: 156 FWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGG-------KLKGKL 208
FW+PNINIFRDPRWGRGQETPGEDPL++ KYA +VRG+Q ++ G + +L
Sbjct: 144 FWSPNINIFRDPRWGRGQETPGEDPLLSSKYAAYFVRGLQEGDYDEGTAISTMQRRPTRL 203
Query: 209 QASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNR 268
+ S+CCKHFTAYD++ +GT + F+A+VT+QDL DT+ PPF SC+ G+ASG+MC+YNR
Sbjct: 204 KVSSCCKHFTAYDMEKSEGTDCFHFNAQVTVQDLQDTFDPPFRSCIVDGQASGLMCSYNR 263
Query: 269 VNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMD 328
VNG+PSCAD L++T R WGF GYI SDCDAV+++Y+ Y + EDAV DVL AGMD
Sbjct: 264 VNGVPSCADYTFLTETVRNSWGFEGYIVSDCDAVALLYEYINYTTTAEDAVADVLSAGMD 323
Query: 329 VNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVC 388
+NCG+FL +HT AA++Q K+ E+ +DRAL N+ +VRMRLGLF+GN + + + IG D VC
Sbjct: 324 LNCGTFLLRHTAAAIEQGKVTEAAVDRALSNVMTVRMRLGLFDGN-SGETYNSIGPDAVC 382
Query: 389 SPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRS 448
+ H+ L+L+AA+ GIVLLKNS +LP P++ +++A+IGP+ N+ +T+LGNYAG C+
Sbjct: 383 TREHRQLSLEAAEQGIVLLKNSGNVLPFPRNDLMTIAVIGPSGNATETMLGNYAGVPCQY 442
Query: 449 ITPLQALQNYVENTVYYPGCDTVACSSASID-KAVDIAKGADHVVLMMGLDQTQEKEELD 507
ITP Q LQ Y + V+ PGC + C+ ++ AV A+ +D VV+++GLD+ QE+E LD
Sbjct: 443 ITPFQGLQEYTKGVVFEPGCKDIMCNDTTLFLAAVRAAENSDAVVIVVGLDKDQEREGLD 502
Query: 508 RVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGA 567
R L+LPG QQ+L+ V++ AK PVILV++ GGP+D+TFAK + I S+LW GYPGEAG
Sbjct: 503 RTSLLLPGYQQDLVLEVSKVAKGPVILVVMSGGPIDVTFAKGNCKISSVLWVGYPGEAGG 562
Query: 568 VALAEVIFGDHNPGGRLPMTWYPQDYIK-VPMTDMKMRPQATSGNPGRTYRFYEGKEVFP 626
A+A VIFGDHNP GRLPMTWYPQ + + V + +M +RP ++G PGRTYRFY G+ V+
Sbjct: 563 KAIARVIFGDHNPAGRLPMTWYPQAFAEHVSILNMHLRPNTSTGFPGRTYRFYTGENVYE 622
Query: 627 FGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVT 686
FG GLSY+ ++Y + N N + + H+ + CE F V
Sbjct: 623 FGHGLSYTNFTYTNFSAPSNITARNTVAIRTPLREDGARHFP----IDYTGCEALAFKVV 678
Query: 687 IGVKNHGEMAGKHPVLLFVKP--ARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCES 744
+ N G H LL+ P A P KQL+ F+ L A A++ F++ C+
Sbjct: 679 AYISNTGTRDSDHISLLYAIPPAASSSLSPPRKQLISFKRQHLIAGRCAKVEFDVDTCKD 738
Query: 745 LSRAREDGLMVIEEGTHFLVVGDEEYPISI 774
L E G V+ G + L +GD E+ IS+
Sbjct: 739 LGLTNEAGTKVLVHGDYKLSLGDIEHVISL 768
>gi|15242492|ref|NP_196535.1| beta-xylosidase 3 [Arabidopsis thaliana]
gi|75264323|sp|Q9LXD6.1|BXL3_ARATH RecName: Full=Beta-D-xylosidase 3; Short=AtBXL3; AltName:
Full=Alpha-L-arabinofuranosidase; Flags: Precursor
gi|7671416|emb|CAB89357.1| beta-xylosidase-like protein [Arabidopsis thaliana]
gi|9759004|dbj|BAB09531.1| beta-xylosidase [Arabidopsis thaliana]
gi|15450735|gb|AAK96639.1| AT5g09730/F17I14_80 [Arabidopsis thaliana]
gi|332004056|gb|AED91439.1| beta-xylosidase 3 [Arabidopsis thaliana]
Length = 773
Score = 763 bits (1970), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 392/773 (50%), Positives = 518/773 (67%), Gaps = 22/773 (2%)
Query: 13 LCLCFTSLLTRVDSTQ--PPFSCDPS-NPSTETFPFCKTTLPISQRARDLVSRLTLDEKI 69
L LCF ++ + Q P F+CD + NPS FC L I R DLV RLTL+EKI
Sbjct: 14 LFLCFIVCISEQSNNQSSPVFACDVTGNPSLAGLRFCNAGLSIKARVTDLVGRLTLEEKI 73
Query: 70 SQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSY 129
L + A + RLGIP+Y+WWSEALHGV+ VG G F G + GATSFPQVILTAASF+
Sbjct: 74 GFLTSKAIGVSRLGIPSYKWWSEALHGVSNVGGGSRFTGQVPGATSFPQVILTAASFNVS 133
Query: 130 LWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVS 189
L+ IG+ + EARA+YN G A G+TFW+PN+NIFRDPRWGRGQETPGEDP ++ KYAV+
Sbjct: 134 LFQAIGKVVSTEARAMYNVGSA-GLTFWSPNVNIFRDPRWGRGQETPGEDPTLSSKYAVA 192
Query: 190 YVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPP 249
YV+G+Q +GG +L+ +ACCKH+TAYD+DNW+ R F+A V QDLADT+QPP
Sbjct: 193 YVKGLQ--ETDGGD-PNRLKVAACCKHYTAYDIDNWRNVNRLTFNAVVNQQDLADTFQPP 249
Query: 250 FESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAE 309
F+SCV G + +MC+YN+VNG P+CAD +LLS R QW +GYI SDCD+V +++ +
Sbjct: 250 FKSCVVDGHVASVMCSYNQVNGKPTCADPDLLSGVIRGQWQLNGYIVSDCDSVDVLFRKQ 309
Query: 310 GYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGL 369
YAK+PE+AV L AG+D+NC F +H AVK + E+ ID+A+ N F+ MRLG
Sbjct: 310 HYAKTPEEAVAKSLLAGLDLNCDHFNGQHAMGAVKAGLVNETAIDKAISNNFATLMRLGF 369
Query: 370 FNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGP 429
F+G+P Q +G +G VC+ +Q LA A+ GIVLLKNS G LPL S +LA+IGP
Sbjct: 370 FDGDPKKQLYGGLGPKDVCTADNQELARDGARQGIVLLKNSAGSLPLSPSAIKTLAVIGP 429
Query: 430 NANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGAD 489
NAN+ +T++GNY G C+ TPLQ L V +T Y GC+ VAC A I AVD+A AD
Sbjct: 430 NANATETMIGNYHGVPCKYTTPLQGLAETVSST-YQLGCN-VACVDADIGSAVDLAASAD 487
Query: 490 HVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKY 549
VVL++G DQ+ E+E DRVDL LPG+QQEL+TRVA AA+ PV+LV++ GG DITFAK
Sbjct: 488 AVVLVVGADQSIEREGHDRVDLYLPGKQQELVTRVAMAARGPVVLVIMSGGGFDITFAKN 547
Query: 550 DRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYI-KVPMTDMKMRPQAT 608
D+ I SI+W GYPGEAG +A+A+VIFG HNP G LPMTWYPQ Y+ KVPM++M MRP +
Sbjct: 548 DKKITSIMWVGYPGEAGGLAIADVIFGRHNPSGNLPMTWYPQSYVEKVPMSNMNMRPDKS 607
Query: 609 SGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKF-KAVSQNKLYLNQSSSTKMVENQ--DVV 665
G PGR+YRFY G+ V+ F L+Y+K+ ++ KA L L+++ + E Q D +
Sbjct: 608 KGYPGRSYRFYTGETVYAFADALTYTKFDHQLIKAPRLVSLSLDENHPCRSSECQSLDAI 667
Query: 666 --HYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQ 723
H ++ E G++ F V + VKN G+ AG H V LF + +G PIKQL+GF+
Sbjct: 668 GPHCENAVEGGSD------FEVHLNVKNTGDRAGSHTVFLFTTSPQV-HGSPIKQLLGFE 720
Query: 724 SVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGDEEYPISIFV 776
+ L E+A + F ++ C+ LS E G I G H L VG ++ ++I V
Sbjct: 721 KIRLGKSEEAVVRFNVNVCKDLSVVDETGKRKIALGHHLLHVGSLKHSLNISV 773
>gi|356501877|ref|XP_003519750.1| PREDICTED: probable beta-D-xylosidase 2-like [Glycine max]
Length = 772
Score = 762 bits (1968), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 372/755 (49%), Positives = 507/755 (67%), Gaps = 29/755 (3%)
Query: 30 PFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEW 89
PF+CDP N +T+ PFCK +L R +DL+ RLTL EK++ LVN+A A+PRLGI YEW
Sbjct: 27 PFACDPKNTATKNLPFCKASLATGARVKDLIGRLTLQEKVNLLVNNAAAVPRLGIKGYEW 86
Query: 90 WSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAG 149
WSEALHGV+ VG G F G ATSFPQVI TAASF++ LW IG+ EARA+YN G
Sbjct: 87 WSEALHGVSNVGPGTKFGGQFPAATSFPQVITTAASFNASLWEAIGRVASDEARAMYNGG 146
Query: 150 QAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQ 209
A G+T+W+PN+NIFRDPRWGRGQETPGEDP++ GKYA SYVRG+QG N +L+
Sbjct: 147 TA-GLTYWSPNVNIFRDPRWGRGQETPGEDPILAGKYAASYVRGLQGTDGN------RLK 199
Query: 210 ASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRV 269
+A CKHFTAYDLDNW G R+ F+A+V+ QD+ DT+ PF CVK+G+ + +MC+YN+V
Sbjct: 200 VAASCKHFTAYDLDNWNGVDRFHFNAQVSKQDIEDTFNVPFRMCVKEGKVASVMCSYNQV 259
Query: 270 NGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDV 329
NG+P+CAD LL +T R QWG +GYI SDCD+V + Y+++ Y +PE+A D +KAG+D+
Sbjct: 260 NGVPTCADPILLKRTVRGQWGLNGYIVSDCDSVGVFYNSQHYTSTPEEAAADAIKAGLDL 319
Query: 330 NCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCS 389
+CG FL +HT+ AVK+ + E++++ AL N +V+MRLG+++G P+ P+ +G VC+
Sbjct: 320 DCGPFLGQHTQNAVKKGLISEADVNGALLNTLTVQMRLGMYDGEPSSHPYNNLGPRDVCT 379
Query: 390 PAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSI 449
+HQ LAL+AA+ GIVLLKN LPL + ++A+IGPN+N T++GNYAG +C
Sbjct: 380 QSHQELALEAARQGIVLLKNKGPSLPLSTRRGRTVAVIGPNSNVTFTMIGNYAGIACGYT 439
Query: 450 TPLQALQNYVENTVYYPGCDTVACS-SASIDKAVDIAKGADHVVLMMGLDQTQEKEELDR 508
+PLQ + Y + T+Y GC VAC+ +A++ A+ AD VL+MGLDQ+ E E +DR
Sbjct: 440 SPLQGIGTYTK-TIYEHGCANVACTDDKQFGRAINAAQQADATVLVMGLDQSIEAETVDR 498
Query: 509 VDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAV 568
L+LPG QQ+L+++VA A+K P ILV++ GGPVDITFAK D I ILWAGYPG+AG
Sbjct: 499 ASLLLPGHQQDLVSKVAAASKGPTILVIMSGGPVDITFAKNDPRIQGILWAGYPGQAGGA 558
Query: 569 ALAEVIFGDHNPGGRLPMTWYPQDYIK-VPMTDMKMRPQATSGNPGRTYRFYEGKEVFPF 627
A+A+++FG NPGG+LPMTWYPQ YIK +PMT+M MR + G PGRTYRFY G V+PF
Sbjct: 559 AIADILFGTSNPGGKLPMTWYPQGYIKNLPMTNMAMRASRSKGYPGRTYRFYNGPVVYPF 618
Query: 628 GCGLSYSKYSYKFKAVSQNKL-------YLNQSSSTKMVENQDVVHYKSVPELGTEFCET 680
G GLSY+ + + S KL + + +SS + V H + C
Sbjct: 619 GYGLSYTHFVHTL--TSAPKLVSIPVDGHRHGNSSNIANKAIKVTHAR---------CGK 667
Query: 681 RKFLVTIGVKNHGEMAGKHPVLLF-VKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFEL 739
+ + VKN G G H +L+F PA G+ P KQLV F+ V + AK + + ++
Sbjct: 668 LSINLHVDVKNVGSKDGIHTLLVFSAPPAGNGHWAPHKQLVAFEKVHIPAKAQQRVRVKI 727
Query: 740 SPCESLSRAREDGLMVIEEGTHFLVVGDEEYPISI 774
C+ LS G I G H L +GD ++ +S+
Sbjct: 728 HVCKLLSVVDRSGTRRIPMGLHSLHIGDVKHSVSL 762
>gi|356534827|ref|XP_003535953.1| PREDICTED: probable beta-D-xylosidase 2-like [Glycine max]
Length = 771
Score = 762 bits (1967), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 370/755 (49%), Positives = 512/755 (67%), Gaps = 29/755 (3%)
Query: 30 PFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEW 89
PF+CDP N +T+ PFCK L R +DL+ RLTL EK++ LVN+A A+PRLGI YEW
Sbjct: 26 PFACDPKNTATKNLPFCKAWLATGARVKDLIGRLTLQEKVNLLVNNAAAVPRLGIKGYEW 85
Query: 90 WSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAG 149
WSEALHGV+ VG G F G ATSFPQVI TAASF++ LW IG+ EARA+YN G
Sbjct: 86 WSEALHGVSNVGPGTKFGGQFPAATSFPQVITTAASFNASLWEAIGRVASDEARAMYNGG 145
Query: 150 QAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQ 209
A G+T+W+PN+NIFRDPRWGRGQETPGEDP++ GKYA SYVRG+Q N +L+
Sbjct: 146 TA-GLTYWSPNVNIFRDPRWGRGQETPGEDPILAGKYAASYVRGLQETDGN------RLK 198
Query: 210 ASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRV 269
+A CKHFTAYDLDNW G R+ F+A+V+ QD+ DT+ PF CVK+G+ + +MC+YN+V
Sbjct: 199 VAASCKHFTAYDLDNWNGVDRFHFNAQVSKQDIEDTFNVPFRMCVKEGKVASVMCSYNQV 258
Query: 270 NGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDV 329
NG+P+CAD LL +T R QWG +GYI SDCD+V + Y+++ Y +PE+A D +KAG+D+
Sbjct: 259 NGVPTCADPILLKRTVRGQWGLNGYIVSDCDSVGVFYNSQHYTSTPEEAAADAIKAGLDL 318
Query: 330 NCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCS 389
+CG FL +HT+ AVK+ + E++++ AL N +V+MRLG+++G P+ P+GK+G VC+
Sbjct: 319 DCGPFLGQHTQNAVKKGLISETDVNGALLNTLTVQMRLGMYDGEPSSHPYGKLGPRDVCT 378
Query: 390 PAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSI 449
P+HQ LAL+AA+ GIVLLKN LPL + ++A+IGPN+N T++GNYAG +C
Sbjct: 379 PSHQELALEAARQGIVLLKNKGPSLPLSTRRHPTVAVIGPNSNVTVTMIGNYAGIACGYT 438
Query: 450 TPLQALQNYVENTVYYPGCDTVACSS-ASIDKAVDIAKGADHVVLMMGLDQTQEKEELDR 508
+PL+ + Y + T++ GC VAC++ +A+++A+ AD VL+MGLDQ+ E E +DR
Sbjct: 439 SPLEGIGRYTK-TIHELGCANVACTNDKQFGRAINVAQQADATVLVMGLDQSIEAETVDR 497
Query: 509 VDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAV 568
L+LPGRQQ+L+++VA A+K P ILV++ GGPVDITFAK + I +ILWAGYPG+AG
Sbjct: 498 AGLLLPGRQQDLVSKVAAASKGPTILVIMSGGPVDITFAKNNPRIQAILWAGYPGQAGGA 557
Query: 569 ALAEVIFGDHNPGGRLPMTWYPQDYIK-VPMTDMKMRPQATSGNPGRTYRFYEGKEVFPF 627
A+A+++FG NPGG+LPMTWYPQ YIK +PMT+M MR + G PGRTYRFY G V+PF
Sbjct: 558 AIADILFGTSNPGGKLPMTWYPQGYIKNLPMTNMAMRASRSKGYPGRTYRFYNGPVVYPF 617
Query: 628 GCGLSYSKYSYKFKAVSQNKL-------YLNQSSSTKMVENQDVVHYKSVPELGTEFCET 680
G GLSY+ + + S KL + + +SS+ + V H + C
Sbjct: 618 GYGLSYTHFVHTL--ASAPKLVSIPVDGHRHGNSSSIANKAIKVTHAR---------CGK 666
Query: 681 RKFLVTIGVKNHGEMAGKHPVLLF-VKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFEL 739
+ + VKN G G H +L+F PA G+ P KQLV FQ + + +K + + +
Sbjct: 667 LSISLQVDVKNVGSKDGTHTLLVFSAPPAGNGHWAPHKQLVAFQKLHIPSKAQQRVNVNI 726
Query: 740 SPCESLSRAREDGLMVIEEGTHFLVVGDEEYPISI 774
C+ LS G + G H L +GD ++ +S+
Sbjct: 727 HVCKLLSVVDRSGTRRVPMGLHSLHIGDVKHYVSL 761
>gi|302811516|ref|XP_002987447.1| hypothetical protein SELMODRAFT_426207 [Selaginella moellendorffii]
gi|300144853|gb|EFJ11534.1| hypothetical protein SELMODRAFT_426207 [Selaginella moellendorffii]
Length = 779
Score = 762 bits (1967), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 381/773 (49%), Positives = 499/773 (64%), Gaps = 38/773 (4%)
Query: 23 RVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRL 82
+V + Q P++CD N + F FC T LP S R DL+SR+TL EKI QLVN+A IPRL
Sbjct: 24 QVVAGQAPYACDQRNATLLQFGFCNTRLPTSTRVEDLISRMTLQEKIIQLVNNAAGIPRL 83
Query: 83 GIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEA 142
G+P YEWW EALHGVA V G+ F G GATSFP ILTAASFD A+ EA
Sbjct: 84 GLPRYEWWQEALHGVA-VSPGVKFGGKFPGATSFPMPILTAASFD---------AVSTEA 133
Query: 143 RALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGG 202
RA++N +A G+T+W+PN+NI+RDPRWGRGQETPGEDPL++ KYA YVRG+Q G
Sbjct: 134 RAMHNYQRA-GLTYWSPNVNIYRDPRWGRGQETPGEDPLLSSKYATFYVRGLQDTNLGGD 192
Query: 203 KLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGI 262
KLK SACCKH TAYD+DNWKGTTR+KF+A VT QDL+DTY PPF+SCV+ + S +
Sbjct: 193 KLK----VSACCKHMTAYDVDNWKGTTRFKFNAIVTQQDLSDTYNPPFQSCVEDAKVSSV 248
Query: 263 MCAYNRVNGIPSCADRNLLSKTARRQWGFHG----------------YITSDCDAVSIIY 306
MC+YNRVNG+P+CAD NLLS T R W +G YI SDCD++ +
Sbjct: 249 MCSYNRVNGVPTCADYNLLSATVRSSWNLNGSILLTCEVLLLYLPCSYIVSDCDSLQTFF 308
Query: 307 DAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMR 366
D YAK+ ED V D L AG++++CG FL HT++A+ K+ E+ +++AL L++V+MR
Sbjct: 309 DNTNYAKTAEDVVADALLAGLNLDCGPFLAIHTQSAITNGKITEANVNQALRYLYNVQMR 368
Query: 367 LGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLAL 426
LGL++GNP QP+G +G VC+ +Q LAL AA++GIVLLKN+ +LP KS ++A
Sbjct: 369 LGLYDGNPRSQPYGNLGPQSVCTGENQQLALDAAKEGIVLLKNNGNVLPFSKSNIRTVAA 428
Query: 427 IGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSAS-IDKAVDIA 485
IGP+A + + ++GNY G C+ TP L Y VY GC VAC S S I AV A
Sbjct: 429 IGPHAKATRAMIGNYQGIPCKYTTPHDGLSAYAR-VVYSAGCSDVACYSDSLIGSAVSTA 487
Query: 486 KGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDIT 545
AD VVL +GLD QE E DR L+LPG+QQEL+T V +AAK P +LV+ GG VD++
Sbjct: 488 SQADAVVLFVGLDLNQEAEGKDRTSLLLPGKQQELVTEVTKAAKGPAVLVIFSGGSVDVS 547
Query: 546 FAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRP 605
FAKY+ + ILWAGYPGEAG A+A+V+FGDHNPGGRLP+TWYP+ + + M DM MRP
Sbjct: 548 FAKYNNKVQGILWAGYPGEAGGAAIAQVLFGDHNPGGRLPVTWYPESFTGITMLDMNMRP 607
Query: 606 QATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVE-NQDV 664
A+ G PGRTYRFY G+ V+ FG G +YSK S+KFK + + ++ + + N
Sbjct: 608 DASRGYPGRTYRFYTGQSVYNFGYGKTYSKLSHKFKEAPLSLGFPEAAAVKRSCDGNLTC 667
Query: 665 VHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRG-NGRPIKQLVGFQ 723
H + E+ C T V I V N G+ VLL+ P G +G PI+QL GF
Sbjct: 668 FHLNAHDEI---TCSTLTSKVRILVHNKGDRPSNRAVLLYSSPPNAGRDGAPIRQLAGFG 724
Query: 724 SVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGDEEYPISIFV 776
V + + E+ PC+ LS A +G+ ++ G H L VG+ +P+ I +
Sbjct: 725 KVSVAPGAVENVEIEIDPCKHLSHAGANGVRILHGGIHTLAVGNARHPLPILL 777
>gi|302791321|ref|XP_002977427.1| hypothetical protein SELMODRAFT_106899 [Selaginella moellendorffii]
gi|300154797|gb|EFJ21431.1| hypothetical protein SELMODRAFT_106899 [Selaginella moellendorffii]
Length = 772
Score = 761 bits (1966), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 372/750 (49%), Positives = 512/750 (68%), Gaps = 18/750 (2%)
Query: 36 SNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALH 95
++ + +FPFC +LP+ R DLV R+ L EKI+Q+V++A IPRLGIP Y+WW EALH
Sbjct: 26 TDSRSSSFPFCDVSLPVPDRVADLVGRMNLSEKIAQIVSNASGIPRLGIPGYQWWEEALH 85
Query: 96 GVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMT 155
GVA G+ F + ATSFPQVILT ASF+S LW +I QAI +EA A+YNAG++ G+T
Sbjct: 86 GVAE-SPGVKFAAPVPSATSFPQVILTVASFNSSLWNKIAQAISIEAIAMYNAGRS-GLT 143
Query: 156 FWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGK----LKG---KL 208
FW+PNINIFRDPRWGRGQETPGEDPL++ KYA +VRG+Q ++ G ++G +L
Sbjct: 144 FWSPNINIFRDPRWGRGQETPGEDPLLSSKYAAYFVRGLQEGDYDEGTAISTMQGSPTRL 203
Query: 209 QASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNR 268
+ S+CCKHFTAYD++ +GT + F+A+VT+QDL DT+ PPF SC+ G+ASG+MC+YNR
Sbjct: 204 KVSSCCKHFTAYDMEKSEGTDCFHFNAQVTVQDLQDTFDPPFRSCIVDGQASGLMCSYNR 263
Query: 269 VNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMD 328
VNG+PSCAD L++T R WGF GYI SDCDAV+++Y+ Y + EDAV DVL AGMD
Sbjct: 264 VNGVPSCADYTFLTETVRNSWGFEGYIVSDCDAVALLYEYINYTTTAEDAVADVLSAGMD 323
Query: 329 VNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVC 388
+NCG+FL +HT AA++Q K+ E+ +DRAL N+ +VRMRLGLF+GN + + + IG D VC
Sbjct: 324 LNCGTFLLRHTAAAIEQGKVTEAAVDRALSNVMTVRMRLGLFDGN-SGETYNSIGPDAVC 382
Query: 389 SPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRS 448
+P H+ L+L+AA+ GIVLLKNS +LP P++ +++A+IGP+ N+ +T+LGNYAG C+
Sbjct: 383 TPEHRQLSLEAAEQGIVLLKNSGNVLPFPRNDLMTIAVIGPSGNATETMLGNYAGVPCQY 442
Query: 449 ITPLQALQNYVENTVYYPGCDTVACSSASID-KAVDIAKGADHVVLMMGLDQTQEKEELD 507
ITP Q LQ Y + V+ PGC + C+ ++ AV A+ +D VV+++GLD+ QE+E LD
Sbjct: 443 ITPFQGLQEYTKCVVFEPGCKDIMCNDTTLFLAAVRAAENSDAVVIVVGLDKDQEREGLD 502
Query: 508 RVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGA 567
R L+LPG QQ L+ V++ AK PVILV++ GGP+D+TFAK + I ++LW GYPGEAG
Sbjct: 503 RTSLLLPGNQQGLVLEVSKVAKGPVILVVMSGGPIDVTFAKENCKISNVLWVGYPGEAGG 562
Query: 568 VALAEVIFGDHNPGGRLPMTWYPQDYIK-VPMTDMKMRPQATSGNPGRTYRFYEGKEVFP 626
A+A VIFGDHNP GRLPMTWYPQ + + V + +M +RP ++G PGRTYRFY G+ V+
Sbjct: 563 KAIARVIFGDHNPAGRLPMTWYPQAFAEHVSILNMHLRPNTSTGFPGRTYRFYTGENVYE 622
Query: 627 FGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVT 686
FG GLSY+ ++Y N N + + + +D + G CE F V
Sbjct: 623 FGHGLSYTNFTYTNFCAPSNITARN-TVAIRTPLREDGARQFPIDYTG---CEALAFKVV 678
Query: 687 IGVKNHGEMAGKHPVLLFVKP--ARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCES 744
+ N G H LL+ P A P KQL+ F+ L A A++ F++ C+
Sbjct: 679 AYISNTGTRDSDHISLLYAIPPAASSSLSPPRKQLISFKRQHLIAGRCAKVEFDVDTCKD 738
Query: 745 LSRAREDGLMVIEEGTHFLVVGDEEYPISI 774
L E G V+ G + L +GD E+ IS+
Sbjct: 739 LGLTNEAGTKVLVHGDYKLSLGDIEHVISL 768
>gi|371917282|dbj|BAL44717.1| SlArf/Xyl2 [Solanum lycopersicum]
Length = 774
Score = 760 bits (1962), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 380/762 (49%), Positives = 504/762 (66%), Gaps = 14/762 (1%)
Query: 15 LCFTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVN 74
F L + + +PPF+CD N + FPFC+T LPI R RDL+ RLTL EK+ L N
Sbjct: 15 FIFIFLFVSIQAARPPFACDQKNRAFRNFPFCQTNLPIGDRVRDLIGRLTLQEKVKLLGN 74
Query: 75 SAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRI 134
+A A+PRLGI YEWWSEALHGV+ VG G F G GATSFPQVI TAASF++ LW I
Sbjct: 75 NAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGEFPGATSFPQVITTAASFNASLWEEI 134
Query: 135 GQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGV 194
G+ + EARA+YN G+ G+T+W+PN+NIFRDPRWGRGQETPGEDP+V YA YVRG+
Sbjct: 135 GRVVSDEARAMYN-GEMGGLTYWSPNVNIFRDPRWGRGQETPGEDPVVAALYAERYVRGL 193
Query: 195 QGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCV 254
QG+ +G LK +ACCKH+TAYDLDNW G R+ F+A+VT QD+ DT+ PF SCV
Sbjct: 194 QGNE-DGDSLK----VAACCKHYTAYDLDNWGGVDRFHFNAKVTKQDIEDTFDVPFRSCV 248
Query: 255 KQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKS 314
KQG+ + IMC+YN+VNGIP+CAD LL KT R WG +GYI SDCD+V + YD + Y +
Sbjct: 249 KQGKVASIMCSYNQVNGIPTCADPQLLRKTIRGGWGLNGYIVSDCDSVGVFYDTQHYTST 308
Query: 315 PEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNP 374
PE+A +KAG+D++CG FL +HT+ AV L E+ ID L N +V+MRLG+F+G P
Sbjct: 309 PEEAAAAAIKAGLDLDCGPFLSQHTENAVHIGILKEAAIDTNLANTVAVQMRLGMFDGEP 368
Query: 375 TMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSA 434
+ Q +G +G VCSPAHQ LA++AA+ GIVLLKN LPL + ++A+IGPN++
Sbjct: 369 SAQQYGHLGPRDVCSPAHQELAVEAARQGIVLLKNHGPALPLSPRRHRTVAVIGPNSDVT 428
Query: 435 KTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASI-DKAVDIAKGADHVVL 493
T++GNYAG +C +PLQ + Y + T++ GC VACS + AV+ A+ AD VL
Sbjct: 429 VTMIGNYAGVACGYTSPLQGISKYAK-TIHEKGCGDVACSDDKLFAGAVNAARQADATVL 487
Query: 494 MMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNI 553
+MGLDQ+ E E DR L+LPG QQELI+ V++A++ PV+LVL+ GGPVD+TFA D I
Sbjct: 488 VMGLDQSIEAEFRDRTGLLLPGFQQELISEVSKASRGPVVLVLMSGGPVDVTFANNDPRI 547
Query: 554 GSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYI-KVPMTDMKMRPQATSGNP 612
G+I+WAGYPG+ G A+A+V+FG HNPGG+LPMTWYPQ+Y+ +PMT M MR G P
Sbjct: 548 GAIVWAGYPGQGGGAAIADVLFGAHNPGGKLPMTWYPQEYLNNLPMTTMDMRSNLAKGYP 607
Query: 613 GRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPE 672
GRTYRFY+G V+PFG GLSY+K+ + + L + N + KS+
Sbjct: 608 GRTYRFYKGPLVYPFGHGLSYTKF---ITTIFEAPKTLAIPIDGRHTYNSSTISNKSI-R 663
Query: 673 LGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEK 732
+ C + + VKN G G H +L+F KP P KQLV FQ V + A+ K
Sbjct: 664 VTHAKCSKISVQIHVDVKNVGPKDGSHTLLVFSKPPVD-IWVPHKQLVAFQKVYVPARSK 722
Query: 733 AEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGDEEYPISI 774
+ + C+ LS G+ I G H + +GD ++ +S+
Sbjct: 723 QRVAINIHVCKYLSVVDRAGVRRIPIGEHSIHIGDAKHSLSL 764
>gi|356503923|ref|XP_003520749.1| PREDICTED: probable beta-D-xylosidase 2-like [Glycine max]
Length = 775
Score = 760 bits (1962), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 372/780 (47%), Positives = 520/780 (66%), Gaps = 27/780 (3%)
Query: 3 FHKLSLVFPLLCLCFTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSR 62
F L + + L F L+ + PF+CDP N +TE PFCK +L I +R +DLV R
Sbjct: 5 FSPLLNLIAVFLLLF--LVRHTCEARDPFACDPKNGATENMPFCKASLAIPERVKDLVGR 62
Query: 63 LTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILT 122
LTL EK+ LVN+A A+PRLG+ YEWWSEALHGV+ VG G+ FN GATSFPQVI T
Sbjct: 63 LTLQEKVRLLVNNAAAVPRLGMKGYEWWSEALHGVSNVGPGVKFNAQFPGATSFPQVITT 122
Query: 123 AASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLV 182
AASF++ LW IGQ + EARA+YN G A G+T+W+PN+NIFRDPRWGRGQETPGEDP++
Sbjct: 123 AASFNASLWEAIGQVVSDEARAMYNGGTA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVL 181
Query: 183 TGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDL 242
G YA SYVRG+QG N +L+ +ACCKHFTAYDLDNW G R+ F+A+V+ QD+
Sbjct: 182 AGTYAASYVRGLQGTDGN------RLKVAACCKHFTAYDLDNWNGMDRFHFNAQVSKQDI 235
Query: 243 ADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAV 302
+T+ PF CV +G+ + +MC+YN+VNG+P+CAD NLL KT R W GYI SDCD+V
Sbjct: 236 EETFDVPFRMCVSEGKVASVMCSYNQVNGVPTCADPNLLKKTVRGLWQLDGYIVSDCDSV 295
Query: 303 SIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFS 362
+ YD + Y +PE+A D +KAG+D++CG FL HT+ AV++ L E++++ AL N +
Sbjct: 296 GVFYDNQHYTPTPEEAAADAIKAGLDLDCGPFLAVHTQNAVEKGLLSEADVNGALVNTLT 355
Query: 363 VRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSV 422
V+MRLG+F+G P+ +GK+G VC PAHQ LAL+AA+ GIVLLKN+ +LPL +
Sbjct: 356 VQMRLGMFDGEPSAHAYGKLGPKDVCKPAHQELALEAARQGIVLLKNTGPVLPLSPQRHH 415
Query: 423 SLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASI-DKA 481
++A+IGPN+ + T++GNYAG +C PLQ + Y + T++ GC+ VAC + + A
Sbjct: 416 TVAVIGPNSKATVTMIGNYAGVACGYTNPLQGIGRYAK-TIHQLGCENVACKNDKLFGSA 474
Query: 482 VDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGP 541
++ A+ AD VL+MGLDQ+ E E +DR L+LPGRQQ+L+++VA A+K P ILV++ GG
Sbjct: 475 INAARQADATVLVMGLDQSIEAETVDRTGLLLPGRQQDLVSKVAAASKGPTILVIMSGGS 534
Query: 542 VDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYI-KVPMTD 600
VDITFAK + I ILWAGYPG+AG A+A+++FG NPGG+LP+TWYPQ+Y+ K+PMT+
Sbjct: 535 VDITFAKNNPRIVGILWAGYPGQAGGAAIADILFGTTNPGGKLPVTWYPQEYLTKLPMTN 594
Query: 601 MKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKA---VSQNKLYLNQSSSTK 657
M MR ++G PGRTYRFY G V+PFG GL+Y+ + + + V L ++ ++
Sbjct: 595 MAMRGSKSAGYPGRTYRFYNGPVVYPFGHGLTYTHFVHTLASAPTVVSVPLNGHRRANVT 654
Query: 658 MVENQ--DVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLF-VKPARRGNGR 714
+ N+ V H + C+ + + +KN G G H +L+F PA G+
Sbjct: 655 NISNRAIRVTHAR---------CDKLSISLEVDIKNVGSRDGTHTLLVFSAPPAGFGHWA 705
Query: 715 PIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGDEEYPISI 774
KQLV F+ + + AK + + C+ LS + G+ I G H +GD ++ +S+
Sbjct: 706 LEKQLVAFEKIHVPAKGLQRVGVNIHVCKLLSVVDKSGIRRIPLGEHSFNIGDVKHSVSL 765
>gi|242077366|ref|XP_002448619.1| hypothetical protein SORBIDRAFT_06g030270 [Sorghum bicolor]
gi|241939802|gb|EES12947.1| hypothetical protein SORBIDRAFT_06g030270 [Sorghum bicolor]
Length = 767
Score = 760 bits (1962), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 385/750 (51%), Positives = 503/750 (67%), Gaps = 16/750 (2%)
Query: 29 PPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYE 88
P F+CD SN + ++ FC + S RA DLVSRLTL EK+ LV+ A+PRLGIP YE
Sbjct: 32 PVFACDASNATLASYGFCNRSASASARAADLVSRLTLAEKVGFLVDKQAALPRLGIPLYE 91
Query: 89 WWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA 148
WWSEALHGV+ VG G F+ + ATSFPQ ILTAASF++ L+ IG+ + EARA++N
Sbjct: 92 WWSEALHGVSYVGPGTRFSSLVPAATSFPQPILTAASFNATLFRAIGEVVSNEARAMHNV 151
Query: 149 GQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKL 208
G A G+TFW+PNINIFRDPRWGRGQETPGEDPL+T KYAV YV G+Q + G G L
Sbjct: 152 GLA-GLTFWSPNINIFRDPRWGRGQETPGEDPLLTSKYAVGYVTGLQ----DAGSGSGSL 206
Query: 209 QASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNR 268
+ +ACCKH+TAYD+DNWKG RY F+A V+ QDL DT+QPPF+SCV G + +MC+YN+
Sbjct: 207 KVAACCKHYTAYDVDNWKGVERYTFNAVVSQQDLDDTFQPPFKSCVVDGNVASVMCSYNQ 266
Query: 269 VNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMD 328
VNG P+CAD++LLS R W +GYI+SDCD+V ++Y+ + Y K+PEDA +KAG+D
Sbjct: 267 VNGKPTCADKDLLSGVIRGDWKLNGYISSDCDSVDVLYNNQHYTKTPEDAAAISIKAGLD 326
Query: 329 VNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVC 388
+NCG+FL +HT AAV+ KL ES++DRA+ N F MRLG F+G+P PFG +G VC
Sbjct: 327 LNCGNFLAQHTVAAVQAGKLSESDVDRAITNNFITLMRLGFFDGDPRKLPFGNLGPSDVC 386
Query: 389 SPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRS 448
+ ++Q LA +AA+ GIVLLKNS G LPL S SLA+IGPNAN++ T++GNY G C+
Sbjct: 387 TSSNQELAREAARQGIVLLKNS-GALPLSASSIKSLAVIGPNANASFTMIGNYEGTPCKY 445
Query: 449 ITPLQALQNYVENTVYYPGCDTVACS--SASIDKAVDIAKGADHVVLMMGLDQTQEKEEL 506
TPLQ L V TVY PGC V CS S +D A A AD VL++G DQ+ E+E L
Sbjct: 446 TTPLQGLGANVA-TVYQPGCTNVGCSGNSLQLDAATKAAASADVTVLVVGADQSIERESL 504
Query: 507 DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAG 566
DR L+LPG+Q +L++ VA A++ P ILV++ GGP DI+FAK I +ILW GYPGEAG
Sbjct: 505 DRTSLLLPGQQPQLVSAVANASRGPCILVIMSGGPFDISFAKSSDKIAAILWVGYPGEAG 564
Query: 567 AVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFP 626
A+A+V+FG HNP GRLP+TWYP+ + KVPM DM+MRP A++G PGRTYRFY G V+
Sbjct: 565 GAAIADVLFGHHNPSGRLPVTWYPESFTKVPMIDMRMRPDASTGYPGRTYRFYTGDTVYA 624
Query: 627 FGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVT 686
FG GLSY+ +++ + + T + E V E CE F V
Sbjct: 625 FGDGLSYTSFAHHLVSAPKQVALQLAEGHTCLTEQCPSV------EAEGAHCEGLAFDVH 678
Query: 687 IGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLS 746
+ V+N G+M+G H V LF P N P K L+GF+ V L + + F++ C+ LS
Sbjct: 679 LRVRNAGDMSGAHTVFLFSSPPAVHNA-PAKHLLGFEKVSLEPGQAGVVAFKVDVCKDLS 737
Query: 747 RAREDGLMVIEEGTHFLVVGDEEYPISIFV 776
E G + G H L VGD ++ +S+ V
Sbjct: 738 VVDELGNRKVALGNHTLHVGDLKHTLSLGV 767
>gi|357511337|ref|XP_003625957.1| Beta-xylosidase [Medicago truncatula]
gi|355500972|gb|AES82175.1| Beta-xylosidase [Medicago truncatula]
Length = 771
Score = 759 bits (1959), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 368/772 (47%), Positives = 517/772 (66%), Gaps = 27/772 (3%)
Query: 11 PLLCLCFTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKIS 70
PL+ L F LL + F+CD + +T+ PFC L I +R +DL+ RLT+ EK++
Sbjct: 9 PLITL-FILLLQSSCDARDSFACDAKDAATKNLPFCNVKLAIPERVKDLIGRLTMQEKVN 67
Query: 71 QLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYL 130
LVN+APA+PR+G+ +YEWWSEALHGV+ VG G F G ATSFPQVI TAASF++ L
Sbjct: 68 LLVNNAPAVPRVGMKSYEWWSEALHGVSNVGPGTRFGGVFPAATSFPQVITTAASFNASL 127
Query: 131 WYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSY 190
W IG+ + EARA+YN G A G+T+W+PN+NIFRDPRWGRGQETPGEDP++ G+YA SY
Sbjct: 128 WEAIGRVVSDEARAMYNGGAA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLAGRYAASY 186
Query: 191 VRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPF 250
V+G+QG N KL+ +ACCKHFTAYD+DNW G R+ F+A V+ QD+ DT+ PF
Sbjct: 187 VKGLQGTDGN------KLKVAACCKHFTAYDVDNWNGVDRFHFNALVSKQDIEDTFDVPF 240
Query: 251 ESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEG 310
CVK+G+ + +MC+YN+VNG+P+CAD NLL KT R WG GYI SDCD+V ++Y+++
Sbjct: 241 RMCVKEGKVASVMCSYNQVNGVPTCADPNLLKKTVRGVWGLDGYIVSDCDSVGVLYNSQH 300
Query: 311 YAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLF 370
Y +PE+A D +KAG+D++CG FL HT+ AVK+ L E++++ AL N V+MRLG+F
Sbjct: 301 YTSTPEEAAADAIKAGLDLDCGPFLGVHTQDAVKKGLLTEADVNNALVNTLKVQMRLGMF 360
Query: 371 NGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPN 430
+G P+ Q +G++G VC PAHQ LAL+AA+ GIVLLKN+ LPL + ++A+IGPN
Sbjct: 361 DGEPSAQAYGRLGPKDVCKPAHQELALEAARQGIVLLKNTGPTLPLSPQRHRTVAVIGPN 420
Query: 431 ANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVAC-SSASIDKAVDIAKGAD 489
++ T++GNYAG +C +PLQ + Y + T++ GC VAC A+D A+ AD
Sbjct: 421 SDVTVTMIGNYAGIACGYTSPLQGIGRYAK-TIHQQGCSNVACRDDKQFGPALDAARHAD 479
Query: 490 HVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKY 549
+L++GLDQ+ E E +DR L+LPG QQ+L+++VA A+K P ILVL+ GGPVDITFAK
Sbjct: 480 ATILVIGLDQSIEAETVDRTSLLLPGHQQDLVSKVAAASKGPTILVLMSGGPVDITFAKN 539
Query: 550 DRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIK-VPMTDMKMRPQAT 608
D + ILWAGYPG+AG A+A+++FG +PGG+LP+TWYPQ+Y+K + MT+M MRP
Sbjct: 540 DPKVAGILWAGYPGQAGGAAIADILFGTASPGGKLPVTWYPQEYLKNLAMTNMAMRPSKI 599
Query: 609 SGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKA---VSQNKLYLNQSSSTKMVENQ--D 663
G PGRTYRFY+G V+PFG GL+Y+ + ++ + V ++ ++ + + N+
Sbjct: 600 -GYPGRTYRFYKGPVVYPFGHGLTYTHFVHELSSAPTVVSVPVHGHRHGNNTNISNKAIR 658
Query: 664 VVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNG-RPIKQLVGF 722
V H + C + + VKN G G H +L+F P GN P K LV F
Sbjct: 659 VTHAR---------CGKLSIALHVDVKNVGSRDGTHTLLVFSAPPNGGNHWVPQKSLVAF 709
Query: 723 QSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGDEEYPISI 774
+ V + AK K + + C+ LS + G+ I G H L +GD ++ +S+
Sbjct: 710 EKVHVPAKTKQRVRVNIHVCKLLSVVDKSGIRRIPMGEHSLHIGDVKHSVSL 761
>gi|449438167|ref|XP_004136861.1| PREDICTED: beta-xylosidase/alpha-L-arabinofuranosidase 2-like
[Cucumis sativus]
Length = 782
Score = 758 bits (1958), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 396/761 (52%), Positives = 510/761 (67%), Gaps = 28/761 (3%)
Query: 26 STQPP--FSCDP-SNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRL 82
S Q P F+CD +NPS F FC ++L R DLV RLTL EKI L+N+A + RL
Sbjct: 40 SAQSPTAFACDAETNPSVSGFAFCDSSLGFEARVEDLVKRLTLQEKIGFLINNARNVTRL 99
Query: 83 GIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEA 142
GIP YEWWSEALHGV+ VG G F+ + GATSFPQVILTAASF++ L+ IG+ + EA
Sbjct: 100 GIPKYEWWSEALHGVSYVGPGTKFSNVVPGATSFPQVILTAASFNASLFEAIGKVVSTEA 159
Query: 143 RALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQ----GDT 198
RA+YN G A G+T+W+PN+NIFRDPRWGRGQETPGEDPL++ KYA YVRG+Q GD
Sbjct: 160 RAMYNVGLA-GLTYWSPNVNIFRDPRWGRGQETPGEDPLLSSKYAAGYVRGLQQRDDGD- 217
Query: 199 FNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGR 258
+L+ +ACCKH+TAYDLDNWKGT RY F+A V+ QDL DT+QPPF+SCV G
Sbjct: 218 ------PDRLKVAACCKHYTAYDLDNWKGTDRYHFNAVVSPQDLEDTFQPPFKSCVIDGN 271
Query: 259 ASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDA 318
+ +MC+YN+VNG P+CAD +LL+ R QW +GYI SDCD+V ++Y+++ Y KSPE+A
Sbjct: 272 VASVMCSYNQVNGKPTCADPDLLAGVIRGQWKLNGYIVSDCDSVDVLYNSQHYTKSPEEA 331
Query: 319 VVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQP 378
+ AG+D++CG FL KHT+AAV + E+ I +A+ N MRLG F+GNP+ Q
Sbjct: 332 AAKTILAGLDLDCGDFLGKHTEAAVTGGLVNEAAISKAVFNNLLTLMRLGFFDGNPSKQL 391
Query: 379 FGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLL 438
+GK+G VC+P HQ LA +AA+ GIVLLKNS LPL S SLA+IGPNAN KT++
Sbjct: 392 YGKLGPKDVCTPEHQELAREAARQGIVLLKNSPKSLPLSSSAIKSLAVIGPNANVTKTMI 451
Query: 439 GNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLD 498
GNY G C+ TPLQ L V +T + PGC VAC+SA +D+A IA AD VL++G D
Sbjct: 452 GNYEGTPCKYTTPLQGLSAVV-STSFQPGCANVACTSAQLDEAKKIAASADATVLVVGSD 510
Query: 499 QTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILW 558
Q+ E E DRVDL LPG+Q LIT VA+A+K PVILV++ GG +DITFAK D I SILW
Sbjct: 511 QSIEAESRDRVDLNLPGQQALLITEVAKASKGPVILVIMTGGGMDITFAKKDDKITSILW 570
Query: 559 AGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYI-KVPMTDMKMRPQATSGNPGRTYR 617
G+PGEAG A+A+VIFG NP GRLPMTWYPQ Y+ KVPMTDM+MRP A++G PGRTYR
Sbjct: 571 VGFPGEAGGAAIADVIFGSFNPSGRLPMTWYPQSYVEKVPMTDMRMRPSASNGFPGRTYR 630
Query: 618 FYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVP--ELGT 675
FY G+ ++ FG GLSYS + + V KL + +E + H E+
Sbjct: 631 FYTGETIYSFGDGLSYSDFKHHL--VKAPKLV------SIPLEEGHICHSSKCHSLEVVQ 682
Query: 676 EFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEI 735
E C+ F V + VKN G+ +G H V L+ P N P K L+GF+ V L + +
Sbjct: 683 ESCQNLGFDVHLRVKNVGQRSGSHTVFLYSTPPSVHN-SPQKHLLGFEKVSLGRGGETVV 741
Query: 736 VFELSPCESLSRAREDGLMVIEEGTHFLVVGDEEYPISIFV 776
F++ C+ LS A E G + G H L VG ++ +++ V
Sbjct: 742 RFKVDVCKDLSVADEVGSRKVALGLHILHVGTLKHSLNVKV 782
>gi|224070626|ref|XP_002303181.1| predicted protein [Populus trichocarpa]
gi|222840613|gb|EEE78160.1| predicted protein [Populus trichocarpa]
Length = 773
Score = 758 bits (1956), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 396/775 (51%), Positives = 519/775 (66%), Gaps = 27/775 (3%)
Query: 9 VFPLLCLCFTSLLTRVDS-TQPPFSCDP-SNPSTETFPFCKTTLPISQRARDLVSRLTLD 66
VF L C+ L T V + + P F+CD SNPS + FC T++ I+ R DLV RLTL
Sbjct: 15 VFLLFCMFLVFLSTHVSAQSSPVFACDVVSNPSLASLGFCNTSIGINDRVVDLVKRLTLQ 74
Query: 67 EKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASF 126
EKI LVNSA + RLGIP YEWWSEALHGV+ VG G F+ + GATSFPQVILTAASF
Sbjct: 75 EKIVFLVNSAGNVSRLGIPKYEWWSEALHGVSYVGPGTHFSDDVAGATSFPQVILTAASF 134
Query: 127 DSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKY 186
++ L+ IG+ + EARA+YN G A G+TFW+PNINIFRDPRWGRGQETPGEDPL++ KY
Sbjct: 135 NTSLFEAIGKVVSTEARAMYNVGLA-GLTFWSPNINIFRDPRWGRGQETPGEDPLLSSKY 193
Query: 187 AVSYVRGVQ----GDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDL 242
YV+G+Q GD KL+ +ACCKH+TAYDLDNWKG+ RY F+A VT QD+
Sbjct: 194 GSCYVKGLQQRDDGD-------PDKLKVAACCKHYTAYDLDNWKGSDRYHFNAVVTKQDM 246
Query: 243 ADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAV 302
DT+QPPF+SCV G + +MC+YN+VNG P+CAD +LLS R +W +GYI +DCD++
Sbjct: 247 DDTFQPPFKSCVIDGNVASVMCSYNQVNGKPTCADPDLLSGVIRGEWNLNGYIVTDCDSL 306
Query: 303 SIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFS 362
+ Y ++ Y K+PE+A + AG+D+NCGSFL +HT+AAVK + E ID A+ N F+
Sbjct: 307 DVFYKSQNYTKTPEEAAAAAILAGVDLNCGSFLGQHTEAAVKGGLVNEHAIDIAVSNNFA 366
Query: 363 VRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSV 422
MRLG F+G+P+ Q +GK+G VC+ +Q LA +AA+ GIVLLKN+ G LPL +
Sbjct: 367 TLMRLGFFDGDPSKQLYGKLGPKDVCTAENQELAREAARQGIVLLKNTAGSLPLSPTAIK 426
Query: 423 SLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAV 482
+LA+IGPNAN KT++GNY G C+ TPLQ L V T Y PGC VACS+A +D A
Sbjct: 427 NLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAASVA-TTYLPGCSNVACSTAQVDDAK 485
Query: 483 DIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPV 542
+A AD VL+MG D + E E DRVD++LPG+QQ LIT VA + PVILV++ GG +
Sbjct: 486 KLAAAADATVLVMGADLSIEAESRDRVDVLLPGQQQLLITAVANVSCGPVILVIMSGGGM 545
Query: 543 DITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYI-KVPMTDM 601
D++FA+ + I SILW GYPGEAG A+A++IFG +NP GRLPMTWYPQ Y+ KVPMT+M
Sbjct: 546 DVSFARTNDKITSILWVGYPGEAGGAAIADIIFGYYNPSGRLPMTWYPQSYVDKVPMTNM 605
Query: 602 KMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLY--LNQSSSTKMV 659
MRP ++G PGRTYRFY G+ V+ FG GLSYS+++++ Q +Y L +S
Sbjct: 606 NMRPDPSNGYPGRTYRFYTGETVYSFGDGLSYSQFTHELIQAPQ-LVYVPLEESHVCHSS 664
Query: 660 ENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQL 719
E Q VV + + C+ F + + VKN G ++G H V LF P N P K L
Sbjct: 665 ECQSVVASE-------QTCQNSTFDMLLRVKNEGTISGSHTVFLFSSPPAVHN-SPQKHL 716
Query: 720 VGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGDEEYPISI 774
VGF+ V LNA+ + F++ C+ LS E G + G H L VG ++ +S+
Sbjct: 717 VGFEKVFLNAQTGRHVRFKVDICKDLSVVDELGSKKVALGEHVLHVGSLKHFLSV 771
>gi|357164885|ref|XP_003580200.1| PREDICTED: probable beta-D-xylosidase 6-like [Brachypodium
distachyon]
Length = 771
Score = 757 bits (1955), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 374/736 (50%), Positives = 508/736 (69%), Gaps = 15/736 (2%)
Query: 43 FPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGK 102
+PFC +LP RAR LVS LTLDEKI+QL N+A +PRLGIP YEWWSE+LHG+A G
Sbjct: 37 YPFCDASLPFPVRARALVSLLTLDEKIAQLSNTAAGVPRLGIPPYEWWSESLHGLADNGP 96
Query: 103 GI-FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNI 161
G+ F +G + AT FPQVIL+AASF+ LW + +A+ +EARA++NAGQA G+T+WAPNI
Sbjct: 97 GVNFSSGPVGAATIFPQVILSAASFNRSLWRAVAEAVAVEARAMHNAGQA-GLTYWAPNI 155
Query: 162 NIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYD 221
N+FRDPRWGRGQETPGEDP V Y+V YV+G QG+ + GK +G++ SACCKH+ AYD
Sbjct: 156 NVFRDPRWGRGQETPGEDPAVIAAYSVEYVKGFQGE-YGDGK-EGRMMLSACCKHYVAYD 213
Query: 222 LDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLL 281
L+ W TRY F+A+V QD DTY+PPF+SC+++GRAS +MC+YN+VNG+P+CA ++LL
Sbjct: 214 LEKWGNFTRYTFNAKVNEQDFEDTYEPPFKSCIQEGRASCLMCSYNQVNGVPACARKDLL 273
Query: 282 SKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKA 341
K R +WGF GY+ SDCDAV IIY + Y S ED++ VLKAGMD+NCGSFL +HTK+
Sbjct: 274 QKV-RDEWGFQGYVVSDCDAVGIIYGYQNYTNSDEDSIAIVLKAGMDINCGSFLIRHTKS 332
Query: 342 AVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQ 401
A+++ K+ E +I+ AL NLFSV++RLGLF+ Q F ++G +C+ H+ LA +AA+
Sbjct: 333 AIQKGKITEEDINHALFNLFSVQLRLGLFDKTSGNQWFTQLGPSNICTKEHRELAAEAAR 392
Query: 402 DGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVEN 461
G VLLKN + LPL +S+ +A+IGP AN A + G+Y G C T L+ +Q V
Sbjct: 393 QGTVLLKNDNSFLPLKRSEVSHIAIIGPVANDAYIMGGDYTGVPCNPTTFLKGMQAVVPQ 452
Query: 462 TVYYPGCDTVACSSAS-IDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQEL 520
T GC ++C+S +A+++AK AD VVL+ GL+ TQE E+LDRV L+LPG+Q +L
Sbjct: 453 TTIAAGCKDISCNSTDGFGEAIEVAKRADIVVLIAGLNLTQETEDLDRVSLLLPGKQMDL 512
Query: 521 ITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
I +A KKP++LV+ GGPVD++FAK D+ I S+LW GYPGE G L E++FG++NP
Sbjct: 513 INSIASVTKKPLVLVITGGGPVDVSFAKQDKRIASVLWIGYPGEVGGQVLPEILFGEYNP 572
Query: 581 GGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKF 640
GG+LP+TWYP+ + VPM DM MR + PGRTYRFY G V+ FG GLSYSKYSY
Sbjct: 573 GGKLPITWYPESFTAVPMNDMNMRADPSRSYPGRTYRFYTGDVVYGFGYGLSYSKYSYNI 632
Query: 641 KAVSQNKLYLNQSSSTKMVENQ------DVVHYKSVPELGTEFCETRKFLVTIGVKNHGE 694
+ K+ L++SS+ + + D + Y V ++ + CE+ KF V I V N G
Sbjct: 633 IQ-APTKISLSRSSAVDFISTKRAHTRRDGLDYVQVEDIAS--CESIKFSVHISVANDGA 689
Query: 695 MAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLM 754
M G H VLLF + G P+KQLVGF+ + A + + + PC+ +S A +G
Sbjct: 690 MDGSHAVLLFTRSKSSVPGFPLKQLVGFERLYAAAGKATNVEITVDPCKLMSSANTEGRR 749
Query: 755 VIEEGTHFLVVGDEEY 770
V+ G+H L+VGDEE+
Sbjct: 750 VLLLGSHLLMVGDEEH 765
>gi|449479116|ref|XP_004155509.1| PREDICTED: beta-xylosidase/alpha-L-arabinofuranosidase 2-like
[Cucumis sativus]
Length = 809
Score = 756 bits (1953), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 397/761 (52%), Positives = 510/761 (67%), Gaps = 28/761 (3%)
Query: 26 STQPP--FSCDP-SNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRL 82
S Q P F+CD +NPS F FC ++L R DLV RLTL EKI L+N+A + RL
Sbjct: 67 SAQSPTAFACDAETNPSVSGFAFCDSSLGFEARVEDLVKRLTLQEKIGFLINNARNVTRL 126
Query: 83 GIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEA 142
GIP YEWWSEALHGV+ VG G F+ + GATSFPQVILTAASF++ L+ IG+ + EA
Sbjct: 127 GIPKYEWWSEALHGVSYVGPGTKFSNVVPGATSFPQVILTAASFNASLFEAIGKVVSTEA 186
Query: 143 RALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQ----GDT 198
RA+YN G A G+T+W+PN+NIFRDPRWGRGQETPGEDPL++ KYA YVRG+Q GD
Sbjct: 187 RAMYNVGLA-GLTYWSPNVNIFRDPRWGRGQETPGEDPLLSSKYAAGYVRGLQQRDDGD- 244
Query: 199 FNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGR 258
+L+ +ACCKH+TAYDLDNWKGT RY F+A V+ QDL DT+QPPF+SCV G
Sbjct: 245 ------PDRLKVAACCKHYTAYDLDNWKGTDRYHFNAVVSPQDLEDTFQPPFKSCVIDGN 298
Query: 259 ASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDA 318
+ +MC+YN+VNG P+CAD +LL+ R QW +GYI SDCD+V ++Y+++ Y KSPE+A
Sbjct: 299 VASVMCSYNQVNGKPTCADPDLLAGVIRGQWKLNGYIVSDCDSVDVLYNSQHYTKSPEEA 358
Query: 319 VVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQP 378
+ AG+D++CG FL KHT+AAV + E+ I +A+ N MRLG F+GNP+ Q
Sbjct: 359 AAKTILAGLDLDCGDFLGKHTEAAVTGGLVNEAAISKAVFNNLLTLMRLGFFDGNPSKQL 418
Query: 379 FGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLL 438
+GK+G VC+P HQ LA +AA+ GIVLLKNS LPL S SLA+IGPNAN KT++
Sbjct: 419 YGKLGPKDVCTPEHQELAREAARQGIVLLKNSPKSLPLSSSAIKSLAVIGPNANVTKTMI 478
Query: 439 GNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLD 498
GNY G C+ TPLQ L V +T + PGC VAC+SA +D+A IA AD VL++G D
Sbjct: 479 GNYEGTPCKYTTPLQGLSAVV-STSFQPGCANVACTSAQLDEAKKIAASADATVLVVGSD 537
Query: 499 QTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILW 558
Q+ E E DRVDL LPG+Q LIT VA+A+K PVILV++ GG +DITFAK D I SILW
Sbjct: 538 QSIEAESRDRVDLNLPGQQALLITEVAKASKGPVILVIMTGGGMDITFAKKDDKITSILW 597
Query: 559 AGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYI-KVPMTDMKMRPQATSGNPGRTYR 617
G+PGEAG A+A+VIFG NP GRLPMTWYPQ Y+ KVPMTDM+MRP A++G PGRTYR
Sbjct: 598 VGFPGEAGGAAIADVIFGSFNPSGRLPMTWYPQSYVEKVPMTDMRMRPSASNGFPGRTYR 657
Query: 618 FYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVP--ELGT 675
FY G+ ++ FG GLSYS +K V KL + +E + H E+
Sbjct: 658 FYTGETIYSFGDGLSYS--DFKHHLVKAPKLV------SIPLEEGHICHSSKCHSLEVVQ 709
Query: 676 EFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEI 735
E C+ F V + VKN G+ +G H V L+ P N P K L+GF+ V L + +
Sbjct: 710 ESCQNLGFDVHLRVKNVGQRSGSHTVFLYSTPPSVHN-SPQKHLLGFEKVSLGRGGETVV 768
Query: 736 VFELSPCESLSRAREDGLMVIEEGTHFLVVGDEEYPISIFV 776
F++ C+ LS A E G + G H L VG ++ +++ V
Sbjct: 769 RFKVDVCKDLSVADEVGSRKVALGLHILHVGTLKHSLNVKV 809
>gi|413919688|gb|AFW59620.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
Length = 773
Score = 755 bits (1950), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 383/752 (50%), Positives = 506/752 (67%), Gaps = 20/752 (2%)
Query: 29 PPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYE 88
P F+CD SN + ++ FC + + RA DLVSRLTL EK+ LV+ A+PRLG+P YE
Sbjct: 38 PAFACDASNATLASYGFCNRSAAAAARAADLVSRLTLAEKVGFLVDKQAALPRLGVPLYE 97
Query: 89 WWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA 148
WWSEALHGV+ VG G F+ + GATSFPQ ILTAASF++ L+ IG+ + EARA++N
Sbjct: 98 WWSEALHGVSYVGPGTRFSPLVPGATSFPQPILTAASFNATLFRAIGEVVSNEARAMHNV 157
Query: 149 GQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKL 208
G A G+TFW+PNINIFRDPRWGRGQETPGEDPL+T KYAV YV G+QG G LK
Sbjct: 158 GLA-GLTFWSPNINIFRDPRWGRGQETPGEDPLLTSKYAVGYVTGLQGAVSGAGALK--- 213
Query: 209 QASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNR 268
+ACCKH+TAYD+DNWKG RY FDA V+ QDL DT+QPPF+SCV G + +MC+YN+
Sbjct: 214 -VAACCKHYTAYDVDNWKGVERYTFDAVVSQQDLDDTFQPPFKSCVVDGNVASVMCSYNQ 272
Query: 269 VNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMD 328
VNG P+CAD++LLS R W +GYI+SDCD+V ++Y+ + Y K+PEDA +KAG+D
Sbjct: 273 VNGKPTCADKDLLSGVIRGDWKLNGYISSDCDSVDVLYNNQHYTKTPEDAAAISIKAGLD 332
Query: 329 VNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVC 388
+NCG+FL +HT AAV+ KL ES++DRA+ N MRLG F+G+P PFG +G VC
Sbjct: 333 LNCGTFLAQHTVAAVQAGKLSESDVDRAVTNNLVTLMRLGFFDGDPRELPFGNLGPSDVC 392
Query: 389 SPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRS 448
+P++Q LA +AA+ GIVLLKN+ G LPL + S+A+IGPNAN++ T++GNY G C+
Sbjct: 393 TPSNQELAREAARQGIVLLKNT-GKLPLSATSIKSMAVIGPNANASFTMIGNYEGTPCKY 451
Query: 449 ITPLQALQNYVENTVYYPGCDTVACS--SASIDKAVDIAKGADHVVLMMGLDQTQEKEEL 506
TPLQ L V TVY PGC V CS S +D A A AD VL++G DQ+ E+E L
Sbjct: 452 TTPLQGLGANVA-TVYQPGCTNVGCSGNSLQLDAATKAAASADVTVLVVGADQSIERESL 510
Query: 507 DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAG 566
DR L+LPG+Q +L++ VA A+ P ILV++ GGP DI+FAK I +ILW GYPGEAG
Sbjct: 511 DRTSLLLPGQQPQLVSAVANASSGPCILVVMSGGPFDISFAKSSDKIAAILWVGYPGEAG 570
Query: 567 AVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFP 626
A+A+V+FG HNP GRLP+TWYP+ + KVPMTDM+MRP ++G PGRTYRFY G V+
Sbjct: 571 GAAIADVLFGYHNPSGRLPVTWYPESFTKVPMTDMRMRPDPSTGYPGRTYRFYTGDTVYA 630
Query: 627 FGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTE--FCETRKFL 684
FG GLSY+ +++ + + +L L + + Q P + E CE F
Sbjct: 631 FGDGLSYTSFAHHLVSAPK-QLALQLAEGHACLTEQ-------CPSVEAEGAHCEGLAFD 682
Query: 685 VTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCES 744
V + V+N GE +G H V LF P N P K L+GF+ V L + + F++ C+
Sbjct: 683 VHLRVRNAGERSGGHTVFLFSSPPAVHNA-PAKHLLGFEKVSLEPGQAGVVAFKVDVCKD 741
Query: 745 LSRAREDGLMVIEEGTHFLVVGDEEYPISIFV 776
LS E G + G+H L VGD ++ +++ V
Sbjct: 742 LSVVDELGNRKVALGSHTLHVGDLKHTLNLGV 773
>gi|115460876|ref|NP_001054038.1| Os04g0640700 [Oryza sativa Japonica Group]
gi|38344900|emb|CAE02971.2| OSJNBb0079B02.3 [Oryza sativa Japonica Group]
gi|113565609|dbj|BAF15952.1| Os04g0640700 [Oryza sativa Japonica Group]
gi|116310882|emb|CAH67823.1| OSIGBa0138H21-OSIGBa0138E01.14 [Oryza sativa Indica Group]
gi|218195682|gb|EEC78109.1| hypothetical protein OsI_17615 [Oryza sativa Indica Group]
Length = 765
Score = 755 bits (1950), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 385/752 (51%), Positives = 505/752 (67%), Gaps = 19/752 (2%)
Query: 29 PPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYE 88
P F+CD SN + + FC T + RA DL+ RLTL EK+ LVN A+PRLGIPAYE
Sbjct: 29 PVFACDASNATVSGYGFCDRTKSSAARAADLLGRLTLAEKVGFLVNKQAALPRLGIPAYE 88
Query: 89 WWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA 148
WWSEALHGV+ VG G F+ + GATSFPQ ILTAASF++ L+ IG+ + EARA++N
Sbjct: 89 WWSEALHGVSYVGPGTRFSTLVPGATSFPQPILTAASFNASLFRAIGEVVSTEARAMHNV 148
Query: 149 GQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKL 208
G A G+TFW+PNINIFRDPRWGRGQETPGEDPL+ KYAV YV G+Q + G L
Sbjct: 149 GLA-GLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYAVGYVTGLQ----DAGGGSDAL 203
Query: 209 QASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNR 268
+ +ACCKH+TAYD+DNWKG RY FDA V+ QDL DT+QPPF+SCV G + +MC+YN+
Sbjct: 204 KVAACCKHYTAYDVDNWKGVERYTFDAVVSQQDLDDTFQPPFKSCVIDGNVASVMCSYNK 263
Query: 269 VNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMD 328
VNG P+CAD++LLS R W +GYI SDCD+V ++Y+ + Y K+PEDA +K+G+D
Sbjct: 264 VNGKPTCADKDLLSGVIRGDWKLNGYIVSDCDSVDVLYNNQHYTKNPEDAAAITIKSGLD 323
Query: 329 VNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVC 388
+NCG+FL +HT AAV+ KL ES++DRA+ N F V MRLG F+G+P PFG +G VC
Sbjct: 324 LNCGNFLAQHTVAAVQAGKLSESDVDRAITNNFIVLMRLGFFDGDPRKLPFGSLGPKDVC 383
Query: 389 SPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRS 448
+ ++Q LA +AA+ GIVLLKN+ G LPL S+A+IGPNAN++ T++GNY G C+
Sbjct: 384 TSSNQELAREAARQGIVLLKNT-GALPLSAKSIKSMAVIGPNANASFTMIGNYEGTPCKY 442
Query: 449 ITPLQALQNYVENTVYYPGCDTVACSSASID--KAVDIAKGADHVVLMMGLDQTQEKEEL 506
TPLQ L V TVY PGC V CS S+ A A AD VL++G DQ+ E+E L
Sbjct: 443 TTPLQGLGANVA-TVYQPGCTNVGCSGNSLQLSAATQAAASADVTVLVVGADQSVERESL 501
Query: 507 DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAG 566
DR L+LPG+Q +L++ VA A++ PVILV++ GGP DI+FAK I +ILW GYPGEAG
Sbjct: 502 DRTSLLLPGQQPQLVSAVANASRGPVILVVMSGGPFDISFAKSSDKISAILWVGYPGEAG 561
Query: 567 AVALAEVIFGDHNPGGRLPMTWYPQDYI-KVPMTDMKMRPQATSGNPGRTYRFYEGKEVF 625
ALA+++FG HNPGGRLP+TWYP + KV MTDM+MRP +++G PGRTYRFY G V+
Sbjct: 562 GAALADILFGYHNPGGRLPVTWYPASFADKVSMTDMRMRPDSSTGYPGRTYRFYTGDTVY 621
Query: 626 PFGCGLSYSKYSYKF-KAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFL 684
FG GLSY+K+++ A Q + L + + H SV G E C + F
Sbjct: 622 AFGDGLSYTKFAHSLVSAPEQVAVQLAEGHACH------TEHCFSVEAAG-EHCGSLSFD 674
Query: 685 VTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCES 744
V + V+N G MAG H V LF P + P K L+GF+ V L + + F++ C+
Sbjct: 675 VHLRVRNAGGMAGGHTVFLFSSPPSV-HSAPAKHLLGFEKVSLEPGQAGVVAFKVDVCKD 733
Query: 745 LSRAREDGLMVIEEGTHFLVVGDEEYPISIFV 776
LS E G + G+H L VGD ++ +++ V
Sbjct: 734 LSVVDELGNRKVALGSHTLHVGDLKHTLNLRV 765
>gi|326494302|dbj|BAJ90420.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326521150|dbj|BAJ96778.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326527851|dbj|BAK08165.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 775
Score = 755 bits (1949), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 388/756 (51%), Positives = 502/756 (66%), Gaps = 15/756 (1%)
Query: 25 DSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGI 84
++ P F+CD SN + + FC S RARDLVSRLTL EK+ LVN PA+ RLGI
Sbjct: 31 EAQAPVFACDASNATLAAYGFCNRKATASARARDLVSRLTLAEKVGFLVNKQPALGRLGI 90
Query: 85 PAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARA 144
PAYEWWSEALHGV+ VG G F+ + GATSFPQ ILTAASF++ L+ IG+ + EARA
Sbjct: 91 PAYEWWSEALHGVSYVGPGTRFSPLVPGATSFPQPILTAASFNASLFRAIGEVVSTEARA 150
Query: 145 LYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKL 204
++N G A G+TFW+PNINIFRDPRWGRGQETPGEDPL+ KYAV YV G+Q D GG
Sbjct: 151 MHNVGLA-GLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYAVGYVTGLQ-DAGAGGVT 208
Query: 205 KGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMC 264
G L+ +ACCKH+TAYD+DNWKG RY FDA+V+ QDL DT+QPPF+SCV G + +MC
Sbjct: 209 DGALKVAACCKHYTAYDVDNWKGVERYTFDAKVSQQDLDDTFQPPFKSCVLDGNVASVMC 268
Query: 265 AYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLK 324
+YN+VNG P+CAD++LL R W +GYI SDCD+V ++Y + Y K+PE+A +K
Sbjct: 269 SYNKVNGKPTCADKDLLEGVIRGDWKLNGYIVSDCDSVDVLYTQQHYTKTPEEAAAITIK 328
Query: 325 AGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGA 384
+G+D+NCG+FL +HT AAV+ +L E ++DRA+ N F + MRLG F+G+P FG +G
Sbjct: 329 SGLDLNCGNFLAQHTVAAVQAGELSEEDVDRAITNNFIMLMRLGFFDGDPRQLAFGSLGP 388
Query: 385 DVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGP 444
VC+ +++ LA + A+ GIVLLKNS G LPL S+A+IGPNAN++ T++GNY G
Sbjct: 389 KDVCTSSNRELARETARQGIVLLKNS-GALPLSAKSIKSMAVIGPNANASFTMIGNYEGT 447
Query: 445 SCRSITPLQALQNYVENTVYYPGCDTVACSSASID--KAVDIAKGADHVVLMMGLDQTQE 502
C+ TPLQ L V NTVY PGC V CS S+ AV A AD VL++G DQ+ E
Sbjct: 448 PCKYTTPLQGLGAKV-NTVYQPGCTNVGCSGNSLQLSTAVAAAASADVTVLVVGADQSIE 506
Query: 503 KEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYP 562
+E LDR L+LPG+Q +L++ VA A+ PVILV++ GGP DI+FAK I +ILW GYP
Sbjct: 507 RESLDRTSLLLPGQQTQLVSAVANASSGPVILVVMSGGPFDISFAKASDKIAAILWVGYP 566
Query: 563 GEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIK-VPMTDMKMRPQATSGNPGRTYRFYEG 621
GEAG ALA+++FG HNP GRLP+TWYP Y V MTDM+MRP ++G PGRTYRFY G
Sbjct: 567 GEAGGAALADILFGSHNPSGRLPVTWYPASYADTVTMTDMRMRPDTSTGYPGRTYRFYTG 626
Query: 622 KEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVP-ELGTEFCET 680
VF FG GLSY+K S+ VS Y+ S ++ E+ + E + C+
Sbjct: 627 DTVFAFGDGLSYTKMSHSL--VSAPPSYV----SMRLAEDHPCRAEECASVEAAGDHCDD 680
Query: 681 RKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELS 740
F V + V+N GE+AG H VLLF P N P K L+GF+ V L E + F +
Sbjct: 681 LAFDVKLQVRNAGEVAGAHSVLLFSSPPPAHNA-PAKHLLGFEKVSLAPGEAGTVAFRVD 739
Query: 741 PCESLSRAREDGLMVIEEGTHFLVVGDEEYPISIFV 776
C LS E G + G H L VGD ++ + + V
Sbjct: 740 VCRDLSVVDELGGRKVALGGHTLHVGDLKHTVELRV 775
>gi|356572781|ref|XP_003554544.1| PREDICTED: probable beta-D-xylosidase 2-like [Glycine max]
Length = 771
Score = 754 bits (1948), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 365/753 (48%), Positives = 507/753 (67%), Gaps = 25/753 (3%)
Query: 30 PFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEW 89
PF+CDP N T+ FCK +L I++R +DL+ RLTL+EK+ LVN+A A+PRLG+ YEW
Sbjct: 26 PFACDPKNGGTKKMAFCKVSLAIAERVKDLIGRLTLEEKVRLLVNNAAAVPRLGMKGYEW 85
Query: 90 WSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAG 149
WSEALHGV+ +G + FN ATSFPQVI TAASF++ LW IGQ + EARA+YN G
Sbjct: 86 WSEALHGVSNLGPAVKFNAQFPAATSFPQVITTAASFNASLWEAIGQVVSDEARAMYNGG 145
Query: 150 QAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQ 209
A G+T+W+PN+NIFRDPRWGRGQETPGEDP++ G YA +YVRG+QG N +L+
Sbjct: 146 TA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLAGTYAATYVRGLQGTHAN------RLK 198
Query: 210 ASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRV 269
+ACCKHFTAYDLDNW G R+ F+A+V+ QD+ DT+ PF+ CV +G+ + +MC+YN+V
Sbjct: 199 VAACCKHFTAYDLDNWNGMDRFHFNAQVSKQDIEDTFDVPFKMCVSEGKVASVMCSYNQV 258
Query: 270 NGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDV 329
NG+P+CAD NLL KT R W GYI SDCD+V + YD + Y +PE+A D +KAG+D+
Sbjct: 259 NGVPTCADPNLLKKTVRGLWQLDGYIVSDCDSVGVFYDNQHYTPTPEEAAADAIKAGLDL 318
Query: 330 NCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCS 389
+CG FL HT+ AVK+ L E++++ AL N +V+MRLG+F+G PT P+G +G VC
Sbjct: 319 DCGPFLAVHTQNAVKKGLLSEADVNGALVNTLTVQMRLGMFDGEPTAHPYGHLGPKDVCK 378
Query: 390 PAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSI 449
PAHQ LAL+AA+ GIVLLKN+ +LPL ++A+IGPN+ + T++GNYAG +C
Sbjct: 379 PAHQELALEAARQGIVLLKNTGPVLPLSSQLHRTVAVIGPNSKATITMIGNYAGVACGYT 438
Query: 450 TPLQALQNYVENTVYYPGCDTVACSSASI-DKAVDIAKGADHVVLMMGLDQTQEKEELDR 508
PLQ + Y TV+ GC VAC + + A++ A+ AD VL+MGLDQ+ E E +DR
Sbjct: 439 NPLQGIGRYAR-TVHQLGCQNVACKNDKLFGPAINAARQADATVLVMGLDQSIEAETVDR 497
Query: 509 VDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAV 568
L+LPGRQ +L+++VA A+K P ILVL+ GGPVDITFAK + I ILWAGYPG+AG
Sbjct: 498 TGLLLPGRQPDLVSKVAAASKGPTILVLMSGGPVDITFAKNNPRIVGILWAGYPGQAGGA 557
Query: 569 ALAEVIFGDHNPGGRLPMTWYPQDYI-KVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPF 627
A+A+++FG NPGG+LP+TWYP++Y+ K+PMT+M MR ++G PGRTYRFY G V+PF
Sbjct: 558 AIADILFGTANPGGKLPVTWYPEEYLTKLPMTNMAMRATKSAGYPGRTYRFYNGPVVYPF 617
Query: 628 GCGLSYSKYSYKFKA---VSQNKLYLNQSSSTKMVENQ--DVVHYKSVPELGTEFCETRK 682
G GL+Y+ + + + V L ++ ++ + N+ V H + C+
Sbjct: 618 GHGLTYTHFVHTLASAPTVVSVPLNGHRRANVTNISNRAIRVTHAR---------CDKLS 668
Query: 683 FLVTIGVKNHGEMAGKHPVLLF-VKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSP 741
+ + +KN G G H +L+F PA G+ KQLV F+ V + AK + + +
Sbjct: 669 ITLQVDIKNVGSRDGTHTLLVFSAPPAGFGHWALEKQLVAFEKVHVPAKGQHRVGVNIHV 728
Query: 742 CESLSRAREDGLMVIEEGTHFLVVGDEEYPISI 774
C+ LS G+ I G H +GD ++ +S+
Sbjct: 729 CKLLSVVDRSGIRRIPLGEHSFNIGDVKHSVSL 761
>gi|357449039|ref|XP_003594795.1| Beta xylosidase [Medicago truncatula]
gi|355483843|gb|AES65046.1| Beta xylosidase [Medicago truncatula]
Length = 762
Score = 754 bits (1948), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 378/767 (49%), Positives = 506/767 (65%), Gaps = 30/767 (3%)
Query: 15 LCFTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVN 74
+ S+L + + PF+CDP N T ++ FC T +PI R +DL+ RL L EKI +VN
Sbjct: 11 MLLISILVTLSEGRVPFACDPKNGLTRSYKFCNTRVPIHARVQDLIGRLALPEKIRLVVN 70
Query: 75 SAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRI 134
+A A+PRLGI YEWWSEALHGV+ VG G F G ATSFPQVI TAASF+ LW I
Sbjct: 71 NAIAVPRLGIQGYEWWSEALHGVSNVGPGTKFGGAFSAATSFPQVITTAASFNQSLWLEI 130
Query: 135 GQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGV 194
G+ + EARA+YN G A G+TFW+PN+NIFRDPRWGRGQETPGEDP V GKYA SYV+G+
Sbjct: 131 GRIVSDEARAMYNGGAA-GLTFWSPNVNIFRDPRWGRGQETPGEDPTVAGKYAASYVQGL 189
Query: 195 QGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCV 254
QG NG +L+ +ACCKH+TAYDLDNW G R+ F+A+V+ QDLADTY PF++CV
Sbjct: 190 QG---NGAG--NRLKVAACCKHYTAYDLDNWNGVDRFHFNAKVSKQDLADTYDVPFKACV 244
Query: 255 KQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKS 314
+ G+ + +MC+YN+VNG P+CAD LL T R +WG +GYI SDCD+V ++YD + Y ++
Sbjct: 245 RDGKVASVMCSYNQVNGKPTCADPELLRNTIRGEWGLNGYIVSDCDSVGVLYDNQHYTRT 304
Query: 315 PEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNP 374
PE A +KAG+D++CG FL HT A+KQ + E++++ AL NL +V+MRLG+F+G+
Sbjct: 305 PEQAAAAAIKAGLDLDCGPFLALHTDGAIKQGLISENDLNLALANLITVQMRLGMFDGD- 363
Query: 375 TMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSA 434
QP+G +G VC P+H +AL+AA+ GIVLL+N LPL ++ ++ +IGPN++
Sbjct: 364 -AQPYGNLGTRDVCLPSHNDVALEAARQGIVLLQNKGNALPLSPTRYRTVGVIGPNSDVT 422
Query: 435 KTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASI-DKAVDIAKGADHVVL 493
T++GNYAG +C TPLQ + YV+ T++ GC V C + + +A+ AD VL
Sbjct: 423 VTMIGNYAGIACGYTTPLQGIARYVK-TIHQAGCKDVGCGGNQLFGLSEQVARQADATVL 481
Query: 494 MMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNI 553
+MGLDQ+ E E DR L+LPG QQEL++RVA AA+ PVILVL+ GGP+D+TFAK D I
Sbjct: 482 VMGLDQSIEAEFRDRTGLLLPGHQQELVSRVARAARGPVILVLMSGGPIDVTFAKNDPKI 541
Query: 554 GSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYI-KVPMTDMKMRPQATSGNP 612
+ILW GYPG++G A+A+VIFG NP GRLP TWYPQDY+ KVPMT+M MR +G P
Sbjct: 542 SAILWVGYPGQSGGTAIADVIFGRTNPSGRLPNTWYPQDYVRKVPMTNMDMRANPATGYP 601
Query: 613 GRTYRFYEGKEVFPFGCGLSYSKYSYKF----KAVS-QNKLYLNQSSSTKMVENQDVVHY 667
GRTYRFY+G VFPFG GLSYS++++ K VS Q L Q+ + + V H
Sbjct: 602 GRTYRFYKGPVVFPFGHGLSYSRFTHSLALAPKQVSVQFTTPLTQAFTNSSNKAMKVSHA 661
Query: 668 KSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVIL 727
C+ + + VKN G M G H +L++ K NG +KQLV F +
Sbjct: 662 N---------CDELEVGFHVDVKNEGSMDGAHTLLVYSKAP---NG--VKQLVNFHKTYV 707
Query: 728 NAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGDEEYPISI 774
A K + + C LS E G+ I G H L +GD ++ I +
Sbjct: 708 PAGSKTRVKVGVHVCNHLSAVDEFGVRRIPMGEHELQIGDLKHSILV 754
>gi|356531391|ref|XP_003534261.1| PREDICTED: probable beta-D-xylosidase 6-like [Glycine max]
Length = 780
Score = 754 bits (1948), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 394/747 (52%), Positives = 506/747 (67%), Gaps = 17/747 (2%)
Query: 40 TETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAG 99
+++ PFC T+LP RAR LVS LTL EKI L N+A +IPRLGIPAY+WWSE+LHG+A
Sbjct: 36 SQSHPFCDTSLPTLTRARSLVSLLTLPEKILLLSNNASSIPRLGIPAYQWWSESLHGLAL 95
Query: 100 VGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAP 159
G G+ F G + ATSFPQVIL+AASF+ LW R AI EARA++N GQA G+TFWAP
Sbjct: 96 NGPGVSFAGAVPSATSFPQVILSAASFNRSLWLRTAAAIAREARAMFNVGQA-GLTFWAP 154
Query: 160 NINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQG----DTFNGGKLKGKLQASACCK 215
NIN+FRDPRWGRGQETPGEDP++ YAV YVRG+QG L SACCK
Sbjct: 155 NINLFRDPRWGRGQETPGEDPMLASAYAVEYVRGLQGLSGIQDAVVVDDDDTLMVSACCK 214
Query: 216 HFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSC 275
HFTAYDLD W +RY F+A V+ QDL DTYQPPF SC++QG+AS +MC+YN VNG+P+C
Sbjct: 215 HFTAYDLDMWGQFSRYNFNAVVSQQDLEDTYQPPFRSCIQQGKASCLMCSYNEVNGVPAC 274
Query: 276 ADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFL 335
A LL AR +WGF GYITSDCDAV+ +Y+ + YAKS EDAV DVLKAGMD+NCG+F+
Sbjct: 275 ASEELLG-LARDKWGFKGYITSDCDAVATVYEYQKYAKSQEDAVADVLKAGMDINCGTFM 333
Query: 336 QKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVL 395
+HT++A++Q K+ E ++DRAL NLFSV++RLGLF+G+P FGK+G VC+ H+ L
Sbjct: 334 LRHTESAIEQGKVKEEDLDRALLNLFSVQLRLGLFDGDPIRGRFGKLGPKDVCTQEHKTL 393
Query: 396 ALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQAL 455
AL AA+ GIVLLKN LPL + SLA+IGP A + K L G Y+G C S + + L
Sbjct: 394 ALDAARQGIVLLKNDKKFLPLDRDIGASLAVIGPLATTTK-LGGGYSGIPCSSSSLYEGL 452
Query: 456 QNYVENTVYYPGCDTVACSSAS-IDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLP 514
+ E Y GC V C S +A+D AK AD VV++ GLD TQE E+ DRV L+LP
Sbjct: 453 GEFAERISYAFGCYDVPCDSDDGFAEAIDTAKQADFVVIVAGLDATQETEDHDRVSLLLP 512
Query: 515 GRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVI 574
G+Q L++ VA+A+K PVILVL+ GGP+D++FA+ + I SI+W GYPGEAG ALAE+I
Sbjct: 513 GKQMNLVSSVADASKNPVILVLIGGGPLDVSFAEKNPQIASIIWLGYPGEAGGKALAEII 572
Query: 575 FGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYS 634
FG+ NP GRLPMTWYP+ + VPM +M MR + G PGRTYRFY G V+ FG GLS+S
Sbjct: 573 FGEFNPAGRLPMTWYPEAFTNVPMNEMSMRADPSRGYPGRTYRFYTGGRVYGFGHGLSFS 632
Query: 635 KYSYKFKAVSQNKLYLNQS----SSTKM---VENQDVVHYKSVPELGTEFCETRKFLVTI 687
+SY F + + +K+ L+++ S ++ VEN+ V VP + C F V I
Sbjct: 633 DFSYNFLS-APSKISLSRTIKDGSRKRLLYQVENE-VYGVDYVPVNQLQNCNKLSFSVHI 690
Query: 688 GVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSR 747
V N G + G H V+LF K + +G P QLVGF + + + E + PCE LS
Sbjct: 691 SVMNLGGLDGSHVVMLFSKGPKVVDGSPETQLVGFSRLHTISSKPTETSILVHPCEHLSF 750
Query: 748 AREDGLMVIEEGTHFLVVGDEEYPISI 774
A + G ++ G H L VGD E+ +SI
Sbjct: 751 ADKQGKRILPLGPHTLSVGDLEHVVSI 777
>gi|408354264|gb|AFU54451.1| alpha-L-arabinofuranosidase/beta-D-xylosidase [Prunus salicina]
Length = 775
Score = 754 bits (1947), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 374/757 (49%), Positives = 502/757 (66%), Gaps = 21/757 (2%)
Query: 24 VDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLG 83
V +PPF+CDP NP T FC+ T+PI R +DL+ RLTL EKI LVN+A A+PRLG
Sbjct: 25 VVHARPPFACDPHNPITRGLKFCRVTVPIHVRVQDLIGRLTLQEKIRLLVNNAIAVPRLG 84
Query: 84 IPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEAR 143
I YEWWSEALHGV+ VG G F G GATSFPQVI TAASF+ LW IG+ + EAR
Sbjct: 85 IQGYEWWSEALHGVSNVGPGTKFGGAFPGATSFPQVITTAASFNESLWQEIGRGVPDEAR 144
Query: 144 ALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGK 203
A+YN G A G+T+W+PN+NIFRDPRWGRGQETPGEDP++ KYA YV+G+QGD
Sbjct: 145 AMYNGGMA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLASKYAARYVKGLQGD-----G 198
Query: 204 LKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIM 263
+L+ +ACCKH+TAYDLDNW G R+ F+ARV+ QDLADTY PF++CV +G + +M
Sbjct: 199 AGNRLKVAACCKHYTAYDLDNWNGVNRFHFNARVSKQDLADTYNVPFKACVVEGHVASVM 258
Query: 264 CAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVL 323
C+YN+VNG P+CAD +LL T R QW +GYI SDCD+V ++Y+ + Y ++PE+A D +
Sbjct: 259 CSYNQVNGKPTCADPDLLKGTIRGQWRLNGYIVSDCDSVGVLYEEQHYTRTPEEAAADAI 318
Query: 324 KAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIG 383
KAG+D++CG FL HT+AAV++ + + EI+ AL N +V+MRLG+F+G P+ +G +G
Sbjct: 319 KAGLDLDCGPFLAIHTEAAVRRGLVSQLEINWALANTMTVQMRLGMFDGEPSAHQYGNLG 378
Query: 384 ADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAG 443
VC+PAHQ LAL+AA+ GIVLL+N LPL + ++A+IGPN++ T++GNYAG
Sbjct: 379 PRDVCTPAHQQLALEAARQGIVLLENRGRSLPLSIRRHRTVAVIGPNSDVTVTMIGNYAG 438
Query: 444 PSCRSITPLQALQNYVENTVYYPGCDTVACSSASI-DKAVDIAKGADHVVLMMGLDQTQE 502
+C TPLQ + Y T++ GC V C+ + A A+ AD VL+MGLDQ+ E
Sbjct: 439 VACGYTTPLQGIGRYTR-TIHQAGCTDVHCNGNQLFGAAEAAARQADATVLVMGLDQSIE 497
Query: 503 KEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYP 562
E +DRV L+LPG QQEL++RVA A++ P ILVL+ GGP+D+TFAK D I +I+W GYP
Sbjct: 498 AEFVDRVGLLLPGHQQELVSRVARASRGPTILVLMSGGPIDVTFAKNDPRISAIIWVGYP 557
Query: 563 GEAGAVALAEVIFGDHNPGGRLPMTWYPQDYI-KVPMTDMKMRPQATSGNPGRTYRFYEG 621
G+AG A+A+V+FG NPGG+LPMTWYPQ+Y+ +PMTDM MR G PGRTYRFY G
Sbjct: 558 GQAGGTAIADVLFGTTNPGGKLPMTWYPQNYVTHLPMTDMAMRADPARGYPGRTYRFYRG 617
Query: 622 KEVFPFGCGLSYSKYSYKF----KAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEF 677
VFPFG GLSY+ +++ +VS L ++++ M+ V + L
Sbjct: 618 PVVFPFGLGLSYTTFAHNLAHGPTSVSVPLTSLKATANSTMLSKAVRVSHADCNALS--- 674
Query: 678 CETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVF 737
V + VKN G M G H +L+F P G KQLVGF + + A + +
Sbjct: 675 ----PLDVHVDVKNTGSMDGTHTLLVFTSPP-DGKWAASKQLVGFHKIHIAAGSETRVRI 729
Query: 738 ELSPCESLSRAREDGLMVIEEGTHFLVVGDEEYPISI 774
+ C+ LS G+ I G H L +GD + +S+
Sbjct: 730 AVHVCKHLSVVDRFGIRRIPLGEHKLQIGDLSHHVSL 766
>gi|408354266|gb|AFU54452.1| alpha-L-arabinofuranosidase/beta-D-xylosidase [Prunus salicina]
Length = 775
Score = 754 bits (1946), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 374/757 (49%), Positives = 502/757 (66%), Gaps = 21/757 (2%)
Query: 24 VDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLG 83
V +PPF+CDP NP T FC+ T+PI R +DL+ RLTL EKI LVN+A A+PRLG
Sbjct: 25 VVHARPPFACDPHNPITRGLKFCRVTVPIHVRVQDLIGRLTLQEKIRLLVNNAIAVPRLG 84
Query: 84 IPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEAR 143
I YEWWSEALHGV+ VG G F G GATSFPQVI TAASF+ LW IG+ + EAR
Sbjct: 85 IQGYEWWSEALHGVSNVGPGTKFGGAFPGATSFPQVITTAASFNESLWQEIGRVVPDEAR 144
Query: 144 ALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGK 203
A+YN G A G+T+W+PN+NIFRDPRWGRGQETPGEDP++ KYA YV+G+QGD
Sbjct: 145 AMYNGGMA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLASKYAARYVKGLQGD-----G 198
Query: 204 LKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIM 263
+L+ +ACCKH+TAYDLDNW G R+ F+ARV+ QDLADTY PF++CV +G + +M
Sbjct: 199 AGNRLKVAACCKHYTAYDLDNWNGVNRFHFNARVSKQDLADTYNVPFKACVVEGHVASVM 258
Query: 264 CAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVL 323
C+YN+VNG P+CAD +LL T R QW +GYI SDCD+V ++Y+ + Y ++PE+A D +
Sbjct: 259 CSYNQVNGKPTCADPDLLKGTIRGQWRLNGYIVSDCDSVGVLYEEQHYTRTPEEAAADAI 318
Query: 324 KAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIG 383
KAG+D++CG FL HT+AAV++ + + EI+ AL N +V+MRLG+F+G P+ +G +G
Sbjct: 319 KAGLDLDCGPFLAIHTEAAVRRGLVSQLEINWALANTMTVQMRLGMFDGEPSAHQYGNLG 378
Query: 384 ADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAG 443
VC+PAHQ LAL+AA+ GIVLL+N LPL + ++A+IGPN++ T++GNYAG
Sbjct: 379 PRDVCTPAHQQLALEAARQGIVLLENRGRSLPLSIRRHRTVAVIGPNSDVTVTMIGNYAG 438
Query: 444 PSCRSITPLQALQNYVENTVYYPGCDTVACSSASI-DKAVDIAKGADHVVLMMGLDQTQE 502
+C TPLQ + Y T++ GC V C+ + A A+ AD VL+MGLDQ+ E
Sbjct: 439 VACGYTTPLQGIGRYTR-TIHQAGCTDVHCNGNQLFGAAEAAARQADATVLVMGLDQSIE 497
Query: 503 KEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYP 562
E +DRV L+LPG QQEL++RVA A++ P ILVL+ GGP+D+TFAK D I +I+W GYP
Sbjct: 498 AEFVDRVGLLLPGHQQELVSRVARASRGPTILVLMSGGPIDVTFAKNDPRISAIIWVGYP 557
Query: 563 GEAGAVALAEVIFGDHNPGGRLPMTWYPQDYI-KVPMTDMKMRPQATSGNPGRTYRFYEG 621
G+AG A+A+V+FG NPGG+LPMTWYPQ+Y+ +PMTDM MR G PGRTYRFY G
Sbjct: 558 GQAGGTAIADVLFGTTNPGGKLPMTWYPQNYVTHLPMTDMAMRADPARGYPGRTYRFYRG 617
Query: 622 KEVFPFGCGLSYSKYSYKF----KAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEF 677
VFPFG GLSY+ +++ +VS L ++++ M+ V + L
Sbjct: 618 PVVFPFGLGLSYTTFAHNLAHGPTSVSVPLTSLKATANSTMLSKAVRVSHADCNALS--- 674
Query: 678 CETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVF 737
V + VKN G M G H +L+F P G KQLVGF + + A + +
Sbjct: 675 ----PLDVHVDVKNTGSMDGTHTLLVFTSPP-DGKWAASKQLVGFHKIHIAAGSETRVRI 729
Query: 738 ELSPCESLSRAREDGLMVIEEGTHFLVVGDEEYPISI 774
+ C+ LS G+ I G H L +GD + +S+
Sbjct: 730 AVHVCKHLSVVDRFGIRRIPLGEHKLQIGDLSHHVSL 766
>gi|326492918|dbj|BAJ90315.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 775
Score = 753 bits (1945), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 387/756 (51%), Positives = 502/756 (66%), Gaps = 15/756 (1%)
Query: 25 DSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGI 84
++ P F+CD SN + + FC S RARDLVSRLTL EK+ LVN PA+ RLGI
Sbjct: 31 EAQAPVFACDASNATLAAYGFCNRKATASARARDLVSRLTLAEKVGFLVNKQPALGRLGI 90
Query: 85 PAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARA 144
PAYEWWSEALHGV+ VG G F+ + GATSFPQ ILTAASF++ L+ IG+ + EARA
Sbjct: 91 PAYEWWSEALHGVSYVGPGTRFSPLVPGATSFPQPILTAASFNASLFRAIGEVVSTEARA 150
Query: 145 LYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKL 204
++N G A G+TFW+PNINIFRDPRWGRGQETPGEDPL+ KYAV YV G+Q D GG
Sbjct: 151 MHNVGLA-GLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYAVGYVTGLQ-DAGAGGVT 208
Query: 205 KGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMC 264
G L+ +ACCKH+TAYD+DNWKG RY FDA+V+ QDL DT+QPPF+SCV G + +MC
Sbjct: 209 DGALKVAACCKHYTAYDVDNWKGVERYTFDAKVSQQDLDDTFQPPFKSCVLDGNVASVMC 268
Query: 265 AYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLK 324
+YN+VNG P+CAD++LL R W +GYI SDCD+V ++Y + Y K+PE+A +K
Sbjct: 269 SYNKVNGKPTCADKDLLEGVIRGDWKLNGYIVSDCDSVDVLYTQQHYTKTPEEAAAITIK 328
Query: 325 AGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGA 384
+G+D+NCG+FL +HT AAV+ +L E ++DRA+ N F + MRLG F+G+P FG +G
Sbjct: 329 SGLDLNCGNFLAQHTVAAVQAGELSEEDVDRAITNNFIMLMRLGFFDGDPRQLAFGSLGP 388
Query: 385 DVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGP 444
VC+ +++ LA + A+ GIVLLKNS G LPL S+A+IGPNAN++ T++GNY G
Sbjct: 389 KDVCTSSNRELARETARQGIVLLKNS-GALPLSAKSIKSMAVIGPNANASFTMIGNYEGT 447
Query: 445 SCRSITPLQALQNYVENTVYYPGCDTVACSSASID--KAVDIAKGADHVVLMMGLDQTQE 502
C+ TPLQ L V NTVY PGC V CS S+ AV A AD VL++G DQ+ E
Sbjct: 448 PCKYTTPLQGLGAKV-NTVYQPGCTNVGCSGNSLQLSTAVAAAASADVTVLVVGADQSIE 506
Query: 503 KEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYP 562
+E LDR L+LPG+Q +L++ VA A+ PVILV++ GGP DI+FAK I +ILW GYP
Sbjct: 507 RESLDRTSLLLPGQQTQLVSAVANASSGPVILVVMSGGPFDISFAKASDKIAAILWVGYP 566
Query: 563 GEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIK-VPMTDMKMRPQATSGNPGRTYRFYEG 621
GEAG ALA+++FG HNP G+LP+TWYP Y V MTDM+MRP ++G PGRTYRFY G
Sbjct: 567 GEAGGAALADILFGSHNPSGKLPVTWYPASYADTVTMTDMRMRPDTSTGYPGRTYRFYTG 626
Query: 622 KEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVP-ELGTEFCET 680
VF FG GLSY+K S+ VS Y+ S ++ E+ + E + C+
Sbjct: 627 DTVFAFGDGLSYTKMSHSL--VSAPPSYV----SMRLAEDHPCRAEECASVEAAGDHCDD 680
Query: 681 RKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELS 740
F V + V+N GE+AG H VLLF P N P K L+GF+ V L E + F +
Sbjct: 681 LAFDVKLQVRNAGEVAGAHSVLLFSSPPPAHNA-PAKHLLGFEKVSLAPGEAGTVAFRVD 739
Query: 741 PCESLSRAREDGLMVIEEGTHFLVVGDEEYPISIFV 776
C LS E G + G H L VGD ++ + + V
Sbjct: 740 VCRDLSVVDELGGRKVALGGHTLHVGDLKHTVELRV 775
>gi|414586138|tpg|DAA36709.1| TPA: putative O-Glycosyl hydrolase superfamily protein [Zea mays]
Length = 769
Score = 753 bits (1945), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 381/742 (51%), Positives = 514/742 (69%), Gaps = 13/742 (1%)
Query: 39 STETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVA 98
ST +PFC +L I RAR LVS LTLDEKI+QL N+A +PRLGIP Y+WWSE+LHG+A
Sbjct: 33 STSAYPFCDASLSIPARARALVSLLTLDEKIAQLSNTAGGVPRLGIPPYQWWSESLHGLA 92
Query: 99 GVGKGI-FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFW 157
G G+ F +G +R AT FPQVIL+ A+F+ LW + +A+ EA ++NAGQA G+T+W
Sbjct: 93 DNGPGVNFSSGPVRAATDFPQVILSTAAFNRSLWRAVAEAVATEALGMHNAGQA-GLTYW 151
Query: 158 APNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHF 217
APNINIFRDPRWGRGQET GEDP V Y++ YV+G QG+ +G+++ SACCKH+
Sbjct: 152 APNINIFRDPRWGRGQETSGEDPAVAAAYSLEYVKGFQGEE----GEEGRIRLSACCKHY 207
Query: 218 TAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCAD 277
TAYD++ W+G +RY F+A+V QDL DTYQPPF++C+++ RAS +MCAYN+VNG+P CA
Sbjct: 208 TAYDMEKWEGFSRYTFNAKVNAQDLEDTYQPPFKTCIQEARASCLMCAYNQVNGVPMCAH 267
Query: 278 RNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQK 337
++LL KT R +WGF GYITSDCDAV+II++ + Y KS ED++ VLKAGMD+NCGSFL +
Sbjct: 268 KDLLQKT-RDEWGFQGYITSDCDAVAIIHENQTYTKSGEDSIAIVLKAGMDINCGSFLVR 326
Query: 338 HTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLAL 397
HTK+A+++ K+ E +IDRAL NLFSV++RLG+F+ Q F ++G + VC+ H+ LA
Sbjct: 327 HTKSAIEKGKIQEEDIDRALFNLFSVQLRLGIFDKPSNNQWFSQLGPNSVCTKEHRELAA 386
Query: 398 QAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQN 457
+A + G VLLKN H LPL +S+ +A+IGP+AN A + G+Y G C T L+ +Q
Sbjct: 387 EAVRQGAVLLKNDHNFLPLKRSEVRHVAIIGPSANDAYAMGGDYTGVPCNPTTFLKGIQA 446
Query: 458 YVENTVYYPGCDTVACSSASI-DKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGR 516
Y T + PGC +C+S + +AV+ AK AD VV++ GL+ T+E+E+ DRV L+LPG+
Sbjct: 447 YATQTSFAPGCKDASCNSTDLFGEAVEAAKRADIVVVIAGLNLTEEREDFDRVSLLLPGK 506
Query: 517 QQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFG 576
Q LI +A AKKP++LVLL GGPVD++FAK D I SILW GYPGE G L E++FG
Sbjct: 507 QMGLIHAIASVAKKPLVLVLLGGGPVDVSFAKQDPRIASILWLGYPGEVGGQVLPEILFG 566
Query: 577 DHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKY 636
++NPGG+LP+TWYP+ + +PMTDM MR + G PGRTYRFY G V+ FG GLSYSKY
Sbjct: 567 EYNPGGKLPITWYPESFTAIPMTDMNMRADPSRGYPGRTYRFYTGDVVYGFGYGLSYSKY 626
Query: 637 SYKFKAVSQNKLYLNQSSSTKMVENQDVVHYK----SVPELGTEFCETRKFLVTIGVKNH 692
SY + + K+ +++SS ++ + + SV CE F V + V NH
Sbjct: 627 SYSISSAPK-KITVSRSSDLGIISRKPAYTRRDGLGSVKTEDIASCEALVFSVHVAVSNH 685
Query: 693 GEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDG 752
G M G H VLLF + G PIKQLVGF+SV A + + + PC+ +S A +G
Sbjct: 686 GSMDGSHAVLLFARSKSSVPGFPIKQLVGFESVHTAAGSASNVEITVDPCKQMSAANPEG 745
Query: 753 LMVIEEGTHFLVVGDEEYPISI 774
V+ G H L VGDEE+ +SI
Sbjct: 746 KRVLLLGAHVLTVGDEEFELSI 767
>gi|292630923|sp|A5JTQ3.1|XYL2_MEDVA RecName: Full=Beta-xylosidase/alpha-L-arabinofuranosidase 2;
AltName: Full=Xylan
1,4-beta-xylosidase/Alpha-N-arabinofuranosidase 2;
Short=MsXyl2; Includes: RecName: Full=Beta-xylosidase;
AltName: Full=1,4-beta-D-xylan xylohydrolase; AltName:
Full=Xylan 1,4-beta-xylosidase; Includes: RecName:
Full=Alpha-N-arabinofuranosidase; AltName:
Full=Alpha-L-arabinofuranosidase; Short=Arabinosidase;
Flags: Precursor
gi|146762263|gb|ABQ45228.1| beta-xylosidase/alpha-L-arabinosidase [Medicago sativa subsp. x
varia]
Length = 774
Score = 753 bits (1944), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 377/777 (48%), Positives = 510/777 (65%), Gaps = 24/777 (3%)
Query: 9 VFPLLCLCFTSLLT---RVDS-TQPPFSCD-PSNPSTETFPFCKTTLPISQRARDLVSRL 63
VF + F +LL RV S T F+CD NP+ + FC L + R +DLV RL
Sbjct: 13 VFLCFFVLFATLLLSGGRVSSQTSAVFACDVAKNPALANYGFCNKKLSVDARVKDLVRRL 72
Query: 64 TLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTA 123
TL EK+ LVNSA + RLGIP YEWWSEALHGV+ +G G F+ I GATSFP IL A
Sbjct: 73 TLQEKVGNLVNSAVDVSRLGIPKYEWWSEALHGVSNIGPGTHFSNVIPGATSFPMPILIA 132
Query: 124 ASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVT 183
ASF++ L+ IG+ + EARA++N G A G+T+W+PNINIFRDPRWGRGQETPGEDPL+
Sbjct: 133 ASFNASLFQTIGKVVSTEARAMHNVGLA-GLTYWSPNINIFRDPRWGRGQETPGEDPLLA 191
Query: 184 GKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLA 243
KYA YV+G+Q T +G KL+ +ACCKH+TAYD+D+WKG RY F+A VT QDL
Sbjct: 192 SKYAAGYVKGLQ-QTDDGDS--NKLKVAACCKHYTAYDVDDWKGVQRYTFNAVVTQQDLD 248
Query: 244 DTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVS 303
DTYQPPF+SCV G + +MC+YN+VNG P+CAD +LL R +W +GYI SDCD+V
Sbjct: 249 DTYQPPFKSCVIDGNVASVMCSYNQVNGKPTCADPDLLKGVIRGKWKLNGYIVSDCDSVD 308
Query: 304 IIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSV 363
+++ + Y K+PE+A + AG+D+NCGSFL ++T+ AVKQ + E+ I+ A++N F+
Sbjct: 309 VLFKNQHYTKTPEEAAAKSILAGLDLNCGSFLGRYTEGAVKQGLIGEASINNAVYNNFAT 368
Query: 364 RMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVS 423
MRLG F+G+P+ QP+G +G VC+ A+Q LA +AA+ GIVLLKN G LPL S
Sbjct: 369 LMRLGFFDGDPSKQPYGNLGPKDVCTSANQELAREAARQGIVLLKNCAGSLPLNAKAIKS 428
Query: 424 LALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVD 483
LA+IGPNAN+ + ++GNY G C+ +PLQ L V T + GC V C++A++D A
Sbjct: 429 LAVIGPNANATRAMIGNYEGIPCKYTSPLQGLTALVP-TSFAAGCPDVQCTNAALDDAKK 487
Query: 484 IAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVD 543
IA AD V+++G + E E DR++++LPG+QQ+L+T VA AK PVIL ++ GG +D
Sbjct: 488 IAASADATVIVVGANLAIEAESHDRINILLPGQQQQLVTEVANVAKGPVILAIMSGGGMD 547
Query: 544 ITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYI-KVPMTDMK 602
++FAK ++ I SILW GYPGEAG A+A+VIFG HNP GRLPMTWYPQ Y+ KVPMT+M
Sbjct: 548 VSFAKTNKKITSILWVGYPGEAGGAAIADVIFGYHNPSGRLPMTWYPQSYVDKVPMTNMN 607
Query: 603 MRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQ---NKLYLNQSSSTKMV 659
MRP +G PGRTYRFY+G+ VF FG G+SYS + +K Q L + +
Sbjct: 608 MRPDPATGYPGRTYRFYKGETVFSFGDGISYSTFEHKLVKAPQLVSVPLAEDHVCRSSKC 667
Query: 660 ENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQL 719
++ DVV E C+ F + + +KN G+M+ V LF P N P K L
Sbjct: 668 KSLDVV---------GEHCQNLAFDIHLRIKNKGKMSSSQTVFLFSTPPAVHNA-PQKHL 717
Query: 720 VGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGDEEYPISIFV 776
+ F+ V+L K +A + F++ C+ L E G + G H L VGD ++P+S+ +
Sbjct: 718 LAFEKVLLTGKSEALVSFKVDVCKDLGLVDELGNRKVALGKHMLHVGDLKHPLSVMI 774
>gi|356558612|ref|XP_003547598.1| PREDICTED: beta-xylosidase/alpha-L-arabinofuranosidase 1-like
[Glycine max]
Length = 776
Score = 753 bits (1944), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 380/780 (48%), Positives = 514/780 (65%), Gaps = 22/780 (2%)
Query: 1 MIFHKLSLVFPLLCLCFTSLLTRVD-STQPPFSCD-PSNPSTETFPFCKTTLPISQRARD 58
+ F + V +L C RV T F+CD NP+ + FC +L + R D
Sbjct: 15 LCFFSFTFVASVLLNC-----NRVSGQTSAVFACDVAKNPALAGYGFCDKSLSVEDRVAD 69
Query: 59 LVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQ 118
LV RLTL EKI LVNSA ++ RLGIP YEWWSEALHGV+ VG G F+ + GATSFP
Sbjct: 70 LVKRLTLQEKIGSLVNSATSVSRLGIPKYEWWSEALHGVSNVGPGTHFSSLVPGATSFPM 129
Query: 119 VILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGE 178
ILTAASF++ L+ IG+ + EARA+YN G A G+T+W+PNINIFRDPRWGRGQETPGE
Sbjct: 130 PILTAASFNASLFEAIGRVVSTEARAMYNVGLA-GLTYWSPNINIFRDPRWGRGQETPGE 188
Query: 179 DPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVT 238
DPL++ KYA YV+G+Q T +G KL+ +ACCKH+TAYDLDNWKG RY F+A VT
Sbjct: 189 DPLLSSKYATGYVKGLQ-QTDDGDS--NKLKVAACCKHYTAYDLDNWKGIQRYTFNAVVT 245
Query: 239 MQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSD 298
QD+ DT+QPPF+SCV G + +MC+YN+VNG P+CAD +LL R +W +GYI SD
Sbjct: 246 QQDMDDTFQPPFKSCVIDGNVASVMCSYNQVNGKPTCADPDLLKGIIRGEWKLNGYIVSD 305
Query: 299 CDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALH 358
CD+V +++ + Y K+PE+A + AG+D+NCG++L ++T+ AVKQ L E+ I+ A+
Sbjct: 306 CDSVEVLFKDQHYTKTPEEAAAQTILAGLDLNCGNYLGQYTEGAVKQGLLDEASINNAVS 365
Query: 359 NLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPK 418
N F+ MRLG F+G+P+ QP+G +G VC+ ++ LA +AA+ GIVLLKNS G LPL
Sbjct: 366 NNFATLMRLGFFDGDPSKQPYGNLGPKDVCTSENRELAREAARQGIVLLKNSPGSLPLNA 425
Query: 419 SKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASI 478
SLA+IGPNAN+ + ++GNY G C I+PLQ L V T Y GC V C++A +
Sbjct: 426 KTIKSLAVIGPNANATRVMIGNYEGIPCNYISPLQTLTALVP-TSYAAGCPNVQCANAEL 484
Query: 479 DKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLC 538
D A IA AD V+++G E E LDR++++LPG+QQ L++ VA A+K PVILV++
Sbjct: 485 DDATQIAASADATVIIVGASLAIEAESLDRINILLPGQQQLLVSEVANASKGPVILVIMS 544
Query: 539 GGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYI-KVP 597
GG +D++FAK + I SILW GYPGEAG A+A+VIFG +NP GRLPMTWYPQ Y+ KVP
Sbjct: 545 GGGMDVSFAKSNDKITSILWVGYPGEAGGAAIADVIFGFYNPSGRLPMTWYPQAYVNKVP 604
Query: 598 MTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTK 657
MT+M MR +G PGRTYRFY+G+ VF FG G+S+S +K Q S
Sbjct: 605 MTNMNMRADPATGYPGRTYRFYKGETVFSFGDGISFSSIEHKIVKAP-------QLVSVP 657
Query: 658 MVENQDVVHYKSVP-ELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPI 716
+ E+ + + + ++ E C+ F + +GVKN G+M+ H VLLF P N P
Sbjct: 658 LAEDHECRSSECMSLDIADEHCQNLAFDIHLGVKNTGKMSTSHVVLLFFTPPDVHNA-PQ 716
Query: 717 KQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGDEEYPISIFV 776
K L+GF+ V L K +A++ F++ C+ LS E G + G H L VG+ ++P+S+ V
Sbjct: 717 KHLLGFEKVHLPGKSEAQVRFKVDVCKDLSVVDELGNRKVPLGQHLLHVGNLKHPLSLRV 776
>gi|350534908|ref|NP_001233910.1| beta-D-xylosidase 1 precursor [Solanum lycopersicum]
gi|37359706|dbj|BAC98298.1| LEXYL1 [Solanum lycopersicum]
Length = 770
Score = 753 bits (1943), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 382/766 (49%), Positives = 507/766 (66%), Gaps = 17/766 (2%)
Query: 15 LCFTSLLTRVDSTQPP-FSCD-PSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQL 72
+ +SLL +V + P F+CD SNP+ FC +L + R DLV+RLTL EKI L
Sbjct: 18 ILLSSLLKQVLAQNSPVFACDVTSNPALGNLTFCDASLAVENRVNDLVNRLTLGEKIGFL 77
Query: 73 VNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWY 132
V+ A + RLGIP YEWWSEALHGVA G G+ F + GATSFPQVILTAASF+ L+
Sbjct: 78 VSGAGGVSRLGIPKYEWWSEALHGVAYTGPGVHFTSLVPGATSFPQVILTAASFNVTLFQ 137
Query: 133 RIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVR 192
IG+ + EARA+YN G A G+T+W+PN+NIFRDPRWGRGQETPGEDP +T KY V+YV
Sbjct: 138 TIGKVVSTEARAMYNVGLA-GLTYWSPNVNIFRDPRWGRGQETPGEDPTLTSKYGVAYVE 196
Query: 193 GVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFES 252
G+Q T +G KL+ +ACCKH+TAYD+DNWKG RY F+A V QDL DT+QPPF S
Sbjct: 197 GLQ-QTDDGST--NKLKVAACCKHYTAYDVDNWKGIERYSFNAVVRQQDLDDTFQPPFRS 253
Query: 253 CVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYA 312
CV +G + +MC+YN+VNG P+C D NLL+ R +W +GYI +DCD++ +I+ ++ Y
Sbjct: 254 CVLEGAVASVMCSYNQVNGKPTCGDPNLLAGIVRGEWKLNGYIVTDCDSLQVIFKSQNYT 313
Query: 313 KSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNG 372
K+PE+A L +G+D+NCGS+L +T+ AV QK + ES IDRA+ N F+ MRLG F+G
Sbjct: 314 KTPEEAAALGLNSGVDLNCGSWLSTYTQGAVNQKLVNESVIDRAISNNFATLMRLGFFDG 373
Query: 373 NPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNAN 432
NP + +G +G VC+P +Q LA +AA+ GIVLLKN+ G LPL + SLA+IGPNAN
Sbjct: 374 NPKSRIYGNLGPKDVCTPENQELAREAARQGIVLLKNTAGSLPLTPTAIKSLAVIGPNAN 433
Query: 433 SAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVV 492
KT++GNY G C+ TPLQ L V T+Y PGC V+C++A ID A IA AD VV
Sbjct: 434 VTKTMIGNYEGIPCKYTTPLQGLTASVA-TIYKPGCADVSCNTAQIDDAKQIATTADAVV 492
Query: 493 LMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRN 552
L+MG DQ+ EKE LDR + LPG+Q L+ VA+ AK PVILV++ GG +D+ FA +
Sbjct: 493 LVMGSDQSIEKESLDRTSITLPGQQSILVAEVAKVAKGPVILVIMSGGGMDVQFAVDNPK 552
Query: 553 IGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIK-VPMTDMKMRPQATSGN 611
I SILW G+PGEAG ALA+VIFG +NP GRLPMTWYPQ Y VPMTDM MRP +
Sbjct: 553 ITSILWVGFPGEAGGAALADVIFGYYNPSGRLPMTWYPQSYADVVPMTDMNMRPNPATNY 612
Query: 612 PGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQ-NKLYLNQSSSTKMVENQDVVHYKSV 670
PGRTYRFY G VF FG GLSYS++ + Q L L + + ++ + K+V
Sbjct: 613 PGRTYRFYTGPTVFTFGHGLSYSQFKHHLDKAPQFVSLPLGEKHTCRLSK------CKTV 666
Query: 671 PELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAK 730
+G + C F + + VKN G+++G H + LF P N P K L+GF+ V L +
Sbjct: 667 DAVG-QSCSNMGFDIHLRVKNVGKISGSHIIFLFTSPPSVHNA-PKKHLLGFEKVHLTPQ 724
Query: 731 EKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGDEEYPISIFV 776
+ + F ++ C+ LS E G + G H L +GD ++ +++ +
Sbjct: 725 GEGVVKFNVNVCKHLSVHDELGNRKVALGPHVLHIGDLKHSLTVRI 770
>gi|255573163|ref|XP_002527511.1| Beta-glucosidase, putative [Ricinus communis]
gi|223533151|gb|EEF34909.1| Beta-glucosidase, putative [Ricinus communis]
Length = 810
Score = 753 bits (1943), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 377/756 (49%), Positives = 497/756 (65%), Gaps = 18/756 (2%)
Query: 31 FSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWW 90
F+CD ++P T + FC T+L RA+DL+SRLTL EK+ Q+VN A IPRLGIPAYEWW
Sbjct: 25 FACDKNSPQTNDYSFCNTSLSYQDRAKDLISRLTLQEKVQQVVNHAAGIPRLGIPAYEWW 84
Query: 91 SEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQ 150
SEALHGV+ VG G+ FNGT+ GATSFP +IL+AASF+ LW ++GQ + EAR +++ G
Sbjct: 85 SEALHGVSNVGFGVRFNGTVPGATSFPAMILSAASFNETLWLKMGQVVSTEARTMHSVGL 144
Query: 151 AIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQA 210
A G+T+W+PN+N+FRDPRWGRGQETPGEDPLV +YAV+YVRG+Q G KL+
Sbjct: 145 A-GLTYWSPNVNVFRDPRWGRGQETPGEDPLVVSRYAVNYVRGLQEVGDEGNSTADKLKV 203
Query: 211 SACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVN 270
S+CCKH+TAYDLD WKG R+ FDA+VT QDL DTYQPPF SCV++ S +MC+YNRVN
Sbjct: 204 SSCCKHYTAYDLDKWKGVDRFHFDAKVTKQDLEDTYQPPFRSCVEEAHVSSVMCSYNRVN 263
Query: 271 GIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVN 330
GIP+CAD +LL R +W GYI SDCD++ + YD+ Y +PEDAV LKAG+++N
Sbjct: 264 GIPTCADPDLLKGIIRGEWNLDGYIVSDCDSIEVYYDSINYTATPEDAVALALKAGLNMN 323
Query: 331 CGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSP 390
CG FL K+T AVK K+ ES +D+AL F V MRLG F+G+P FG +G VCS
Sbjct: 324 CGEFLGKYTVDAVKLNKVEESVVDQALIYNFIVLMRLGFFDGDPKSLLFGNLGPSDVCSD 383
Query: 391 AHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSIT 450
HQ LAL AA+ GIVLL N G LPL K+ + +LA+IGPNAN T++ NYAG C+ T
Sbjct: 384 GHQKLALDAARQGIVLLYNK-GALPLSKNNTRNLAVIGPNANVTTTMISNYAGIPCKYTT 442
Query: 451 PLQALQNYVENTVYYPGCDTVACSSAS-IDKAVDIAKGADHVVLMMGLDQTQEKEELDRV 509
PLQ LQ YV Y GC +V+CS + ID A A AD VVL++GLDQ+ E+E LDR
Sbjct: 443 PLQGLQKYVSTVTYAAGCKSVSCSDDTLIDAATQAAAAADAVVLLVGLDQSIEREGLDRE 502
Query: 510 DLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVA 569
+L LPG Q++L+ V A V+LV++ P+D++FA I ILW GYPG+AG A
Sbjct: 503 NLTLPGFQEKLVVDVVNATNGTVVLVVMSSSPIDVSFAVNKSKIKGILWVGYPGQAGGDA 562
Query: 570 LAEVIFGDHNPGGRLPMTWYPQDYI-KVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFG 628
+A+V+FGD+NP GR P TWYPQ+Y +VPMTDM MR +T+ PGRTYRFY G ++ FG
Sbjct: 563 VAQVMFGDYNPAGRSPFTWYPQEYAHQVPMTDMNMRANSTANFPGRTYRFYAGNTLYKFG 622
Query: 629 CGLSYSKYSYKFKAVSQNKLYLNQSSSTK---------MVENQDVVHYKSVPELGTEFCE 679
GLSYS +S F + L L +S K E ++ +++ ++ T C
Sbjct: 623 HGLSYSTFS-NFIISGPSTLLLKTNSDLKPDIILSTHNSTEEHPFINSQAM-DITTLNCT 680
Query: 680 TRKFLVTIGVKNHGEMAGKHPVLLFVKP--ARRGNGRPIKQLVGFQSVILNAKEKAEIVF 737
+ +GV+N+G ++G H VL+F KP + G QLVGF V +N + +
Sbjct: 681 NSLLSLILGVRNNGPVSGDHVVLVFWKPPNSSEVTGAANVQLVGFSRVEVNRGKTQNVTL 740
Query: 738 ELSPCESLSRAREDGLMVIEEGTHFLVVG-DEEYPI 772
E+ C+ LS +G + G H +G D E+ +
Sbjct: 741 EIDVCKRLSLVDSEGKRKLVTGQHIFTIGSDTEHQV 776
>gi|356525896|ref|XP_003531557.1| PREDICTED: beta-xylosidase/alpha-L-arabinofuranosidase 1-like
[Glycine max]
Length = 776
Score = 751 bits (1940), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 379/775 (48%), Positives = 513/775 (66%), Gaps = 26/775 (3%)
Query: 15 LCFTSLL---------TRVD-STQPPFSCD-PSNPSTETFPFCKTTLPISQRARDLVSRL 63
LCF S + RV T F+CD NP+ + FC +L + R DLV RL
Sbjct: 15 LCFFSFMFVATVLLNCDRVSGQTSSVFACDVAKNPALAGYGFCDKSLSLEDRVADLVKRL 74
Query: 64 TLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTA 123
TL EKI LVNSA ++ RLGIP YEWWSEALHGV+ VG G F+ + GATSFP ILTA
Sbjct: 75 TLQEKIGSLVNSATSVSRLGIPKYEWWSEALHGVSNVGPGTHFSSLVPGATSFPMPILTA 134
Query: 124 ASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVT 183
ASF++ L+ IG+ + EARA+YN G A G+T+W+PNINIFRDPRWGRGQETPGEDPL++
Sbjct: 135 ASFNASLFEAIGRVVSTEARAMYNVGLA-GLTYWSPNINIFRDPRWGRGQETPGEDPLLS 193
Query: 184 GKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLA 243
KYA YV+G+Q T +G KL+ +ACCKH+TAYDLDNWKG RY F+A VT QD+
Sbjct: 194 SKYATGYVKGLQ-QTDDGDS--NKLKVAACCKHYTAYDLDNWKGIQRYTFNAVVTQQDMD 250
Query: 244 DTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVS 303
DT+QPPF+SCV G + +MC+YN+VNG P+CAD +LL R +W +GYI SDCD+V
Sbjct: 251 DTFQPPFKSCVIDGNVASVMCSYNQVNGKPTCADPDLLKGVIRGEWKLNGYIVSDCDSVE 310
Query: 304 IIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSV 363
+++ + Y K+PE+A + + AG+D+NCG++L ++T+ AVKQ L E+ I+ A+ N F+
Sbjct: 311 VLFKDQHYTKTPEEAAAETILAGLDLNCGNYLGQYTEGAVKQGLLDEASINNAVSNNFAT 370
Query: 364 RMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVS 423
MRLG F+G+P+ Q +G +G + VC+ ++ LA +AA+ GIVLLKNS G LPL S
Sbjct: 371 LMRLGFFDGDPSKQTYGNLGPNDVCTSENRELAREAARQGIVLLKNSLGSLPLNAKAIKS 430
Query: 424 LALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVD 483
LA+IGPNAN+ + ++GNY G C I+PLQAL V T Y GC V C++A +D A
Sbjct: 431 LAVIGPNANATRVMIGNYEGIPCNYISPLQALTALVP-TSYAAGCPNVQCANAELDDATQ 489
Query: 484 IAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVD 543
IA AD V+++G E E LDR++++LPG+QQ L++ VA A+K PVILV++ GG +D
Sbjct: 490 IAASADATVIVVGASLAIEAESLDRINILLPGQQQLLVSEVANASKGPVILVIMSGGGMD 549
Query: 544 ITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYI-KVPMTDMK 602
++FAK + I SILW GYPGEAG A+A+VIFG +NP GRLPMTWYPQ Y+ KVPMT+M
Sbjct: 550 VSFAKSNDKITSILWVGYPGEAGGAAIADVIFGFYNPSGRLPMTWYPQSYVNKVPMTNMN 609
Query: 603 MRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQ 662
MR +G PGRTYRFY+G+ VF FG G+S+S +K Q S + E+
Sbjct: 610 MRADPATGYPGRTYRFYKGETVFSFGDGISFSNIEHKIVKAP-------QLVSVPLAEDH 662
Query: 663 DVVHYKSVP-ELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVG 721
+ + + ++ E C+ F + +GVKN G+M+ H VLLF P N P K L+G
Sbjct: 663 ECRSSECMSLDVADEHCQNLAFDIHLGVKNMGKMSSSHVVLLFFTPPDVHNA-PQKHLLG 721
Query: 722 FQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGDEEYPISIFV 776
F+ V L K +A++ F++ C+ LS E G + G H L VG+ ++ +S+ V
Sbjct: 722 FEKVHLPGKSEAQVRFKVDICKDLSVVDELGNRKVPLGQHLLHVGNLKHQLSVSV 776
>gi|225431898|ref|XP_002276351.1| PREDICTED: beta-D-xylosidase 1-like [Vitis vinifera]
Length = 770
Score = 751 bits (1940), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 377/748 (50%), Positives = 515/748 (68%), Gaps = 16/748 (2%)
Query: 30 PFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEW 89
PF+CDP N T PFC+ +LPI +RARDLV RLTL EKI LVN+A +PRLGI YEW
Sbjct: 26 PFACDPRNGVTRNLPFCRVSLPIQERARDLVGRLTLQEKIRLLVNNAIDVPRLGIKGYEW 85
Query: 90 WSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAG 149
WSEALHGV+ VG G F G+ GATSFPQVI TAASF++ LW IG+ + EARA+YN G
Sbjct: 86 WSEALHGVSNVGPGTKFGGSFPGATSFPQVITTAASFNASLWEEIGRVVSDEARAMYNGG 145
Query: 150 QAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQ 209
A G+T+W+PN+NIFRDPRWGRGQETPGEDP V KYA +YVRG+QG+ + +L+
Sbjct: 146 MA-GLTYWSPNVNIFRDPRWGRGQETPGEDPAVAAKYAAAYVRGLQGNA------RDRLK 198
Query: 210 ASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRV 269
+ACCKH+TAYDLD+W G R+ F+ARV+ QDL DTY PF++CV +G + +MC+YN+V
Sbjct: 199 VAACCKHYTAYDLDHWGGIDRFHFNARVSKQDLEDTYDVPFKACVVEGNVASVMCSYNQV 258
Query: 270 NGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDV 329
NG P+CAD +LL T R +W +GYI SDCD+V + YD + Y +PE+A +KAG+D+
Sbjct: 259 NGKPTCADPHLLRDTIRGEWKLNGYIVSDCDSVGVFYDEQHYTATPEEAAAVAIKAGLDL 318
Query: 330 NCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCS 389
+CG FL HT+AA++ KL E++++ AL N SV+MRLG+F+G P+ QP+G +G VC+
Sbjct: 319 DCGPFLAIHTEAAIRGGKLTEADVNGALMNTISVQMRLGMFDGEPSAQPYGNLGPRDVCT 378
Query: 390 PAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSI 449
PAHQ LAL+AA+ GIVL++N LPL S+ ++A+IGPN++ +T++GNYAG +C
Sbjct: 379 PAHQQLALEAARQGIVLVQNRGPALPLSTSRHRTIAVIGPNSDVTETMIGNYAGVACGYT 438
Query: 450 TPLQALQNYVENTVYYPGCDTVAC-SSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDR 508
TPLQ + Y T++ GC VAC AV A+ AD VL+MGLDQ+ E E DR
Sbjct: 439 TPLQGIGRYAR-TIHQAGCSGVACRDDQQFGAAVAAARQADATVLVMGLDQSIEAEFRDR 497
Query: 509 VDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAV 568
VD++LPGRQQEL+++VA A++ P +LVL+ GGP+D++FAK D I +I+W GYPG+AG
Sbjct: 498 VDILLPGRQQELVSKVAVASRGPTVLVLMSGGPIDVSFAKNDPRIAAIIWVGYPGQAGGT 557
Query: 569 ALAEVIFGDHNPGGRLPMTWYPQDYI-KVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPF 627
A+A+V+FG NPGG+LP+TWYPQ Y+ K PMT+M MR + G PGRTYRFY G VFPF
Sbjct: 558 AIADVLFGRTNPGGKLPVTWYPQSYLRKAPMTNMAMRAIPSRGYPGRTYRFYNGPVVFPF 617
Query: 628 GCGLSYSKYSYKFKAVSQNKLYLNQS-SSTKMVENQDVVHYKSVPELGTEFCETRKFLVT 686
G GLSYS +++ +++Q ++ S +S + ++N +V ++ + C T+
Sbjct: 618 GHGLSYSTFAH---SLAQAPTTVSVSLASLQTIKNSTIVSSGAI-RISHANCNTQPLGFH 673
Query: 687 IGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLS 746
I VKN G M G H +LLF P G P K+L+ F+ V + A + + F++ C+ LS
Sbjct: 674 IDVKNTGTMDGSHTLLLFSTPP-PGTWSPNKRLLAFEKVHVGAGSQERVRFDVHVCKHLS 732
Query: 747 RAREDGLMVIEEGTHFLVVGDEEYPISI 774
G+ I G H +GD ++ IS+
Sbjct: 733 VVDHFGIHRIPMGEHHFHIGDLKHSISL 760
>gi|224099193|ref|XP_002311398.1| predicted protein [Populus trichocarpa]
gi|222851218|gb|EEE88765.1| predicted protein [Populus trichocarpa]
Length = 755
Score = 750 bits (1937), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 381/747 (51%), Positives = 505/747 (67%), Gaps = 16/747 (2%)
Query: 30 PFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEW 89
PF+CD N T + FC+ +P+ R RDL+ RLTL EKI LVN+A A+PRLGI YEW
Sbjct: 19 PFACDAKNGLTRSLKFCRVNMPLHVRVRDLIGRLTLQEKIRLLVNNAAAVPRLGIQGYEW 78
Query: 90 WSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAG 149
WSEALHGV+ VG G F G GATSFPQVI TAASF+ LW IG+ + EARA++N G
Sbjct: 79 WSEALHGVSNVGPGTKFGGAFPGATSFPQVITTAASFNKSLWEEIGRVVSDEARAMFNGG 138
Query: 150 QAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQ 209
A G+T+W+PN+N+FRDPRWGRGQETPGEDP+V GKYA SYVRG+QG++ +L+
Sbjct: 139 MA-GLTYWSPNVNVFRDPRWGRGQETPGEDPVVAGKYAASYVRGLQGNS------GFRLK 191
Query: 210 ASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRV 269
+ACCKH+TAYDLDNW G RY F+ARV+ QDL DTY PF+SCV +G+ + +MC+YN+V
Sbjct: 192 VAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYDVPFKSCVVEGKVASVMCSYNQV 251
Query: 270 NGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDV 329
NG P+CAD NLL T R +W +GYI SDCD+V ++Y+ + Y +PE+A +KAG+D+
Sbjct: 252 NGKPTCADPNLLKNTIRGEWRLNGYIVSDCDSVGVLYENQHYTATPEEAAAATIKAGLDL 311
Query: 330 NCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCS 389
+CG FL HT+ AVK L E +++ AL N +V+MRLGLF+G P+ QPFGK+G VC+
Sbjct: 312 DCGPFLAIHTENAVKGGLLNEEDVNMALANTITVQMRLGLFDGEPSAQPFGKLGPRDVCT 371
Query: 390 PAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSI 449
PAHQ LAL AAQ GIVLL+NS LPL + ++++A+IGP A+ T++GNYAG +C
Sbjct: 372 PAHQQLALHAAQQGIVLLQNSGRTLPLSRP-NLTVAVIGPIADVTVTMIGNYAGVACGYT 430
Query: 450 TPLQALQNYVENTVYYPGCDTVACS-SASIDKAVDIAKGADHVVLMMGLDQTQEKEELDR 508
TPLQ + Y + T++ GC VAC+ + A A AD VL+MGLDQ+ E E DR
Sbjct: 431 TPLQGISRYAK-TIHQSGCIDVACNGNQQFGMAEAAASQADATVLVMGLDQSIEAEFRDR 489
Query: 509 VDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAV 568
DL+LPG QQELI+RVA A++ P ILVL+ GGP+D++FAK D IG+ILWAGYPG+AG
Sbjct: 490 KDLLLPGYQQELISRVARASRGPTILVLMSGGPIDVSFAKNDPRIGAILWAGYPGQAGGA 549
Query: 569 ALAEVIFGDHNPGGRLPMTWYPQDYI-KVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPF 627
A+A+V+FG NPGG+LPMTWYPQDY+ KVPMT+M MR + G PGRTYRFY+G VFPF
Sbjct: 550 AIADVLFGTTNPGGKLPMTWYPQDYLAKVPMTNMGMRADPSRGYPGRTYRFYKGPVVFPF 609
Query: 628 GCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTI 687
G G+SY+ +++ Q T + Q+ ++ + CE V I
Sbjct: 610 GHGMSYTTFAHSLVQAPQEVAV----PFTSLYALQNTTAARNSIRVSHANCEPLVLGVHI 665
Query: 688 GVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSR 747
VKN G+M G +L+F P G K+L+GF+ V + A K + ++ C+ LS
Sbjct: 666 DVKNTGDMDGIQTLLVFSSPP-EGKWSANKKLIGFEKVHIVAGSKKRVKIDIPVCKHLSV 724
Query: 748 AREDGLMVIEEGTHFLVVGDEEYPISI 774
G+ + G H L +GD ++ IS+
Sbjct: 725 VDRFGIRRLPIGKHDLHIGDLKHSISL 751
>gi|449505346|ref|XP_004162442.1| PREDICTED: LOW QUALITY PROTEIN: probable beta-D-xylosidase 2-like
[Cucumis sativus]
Length = 772
Score = 750 bits (1936), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 369/775 (47%), Positives = 517/775 (66%), Gaps = 29/775 (3%)
Query: 9 VFPLLCLCFTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEK 68
+ P+L + +++ + PF+CDP + + +PFC+ LPI +R +DL+ RLTL EK
Sbjct: 8 IIPILII-LSAIFRHGGGAREPFACDPKDAALSRYPFCRVALPIPERVKDLIGRLTLQEK 66
Query: 69 ISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDS 128
+ LVN+A A+PRLGI YEWWSEALHGV+ VG G F G GATSFPQVI T ASF+
Sbjct: 67 VRLLVNNAAAVPRLGIKGYEWWSEALHGVSNVGPGTEFGGDFPGATSFPQVITTVASFNV 126
Query: 129 YLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAV 188
LW IG+ + EARA+YN G A G+T+W+PN+NIFRDPRWGRGQETPGEDP+V G+YA
Sbjct: 127 SLWEAIGRVVSDEARAMYNGGAA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVVAGEYAA 185
Query: 189 SYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQP 248
Y++G+QG+ +G +LK +ACCKHFTAYDLDNW GT R+ F+A+VT QD+ DT++
Sbjct: 186 RYIKGLQGN--DGDRLK----VAACCKHFTAYDLDNWNGTDRFHFNAKVTRQDMVDTFEV 239
Query: 249 PFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDA 308
PF CVK+G+ + +MC+YN+VNG+P+CAD NLL T R QWG +GYI SDCD+V + YD
Sbjct: 240 PFRKCVKEGKVASVMCSYNQVNGVPTCADPNLLKGTIRNQWGLNGYIVSDCDSVGVFYDN 299
Query: 309 EGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLG 368
+ Y + E+A D +KAG+D++CG FL HT+ AVK+ L ++ I+ AL N +V+MRLG
Sbjct: 300 QHYTSTAEEAAADAIKAGLDLDCGPFLAVHTEDAVKKXLLTQTHINNALANTITVQMRLG 359
Query: 369 LFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIG 428
+F+G P+ +GK+G VCSP+HQ LAL AA+ GIVLLKN LPL ++A+IG
Sbjct: 360 MFDGAPSSHAYGKLGPKNVCSPSHQQLALDAARQGIVLLKNRLPGLPLSAXHHRTVAVIG 419
Query: 429 PNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSA-SIDKAVDIAKG 487
PN++ T++GNYAG +C +TPL+ ++ Y V+ GCD VAC++ S A+ A
Sbjct: 420 PNSDVNVTMIGNYAGVACGYVTPLEGIKRYT-TVVHRKGCDNVACATDYSFTDALAAAST 478
Query: 488 ADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFA 547
AD VL+MGLDQ+ E E DR L+LPGRQQEL+ +VA A++ P +++L+ GGP+D++FA
Sbjct: 479 ADATVLVMGLDQSVEAETKDRDGLLLPGRQQELVLKVAAASRGPTVVILMSGGPIDVSFA 538
Query: 548 KYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYI-KVPMTDMKMRPQ 606
D I +ILW GYPG+AG A+A+V+FG NPGG+LPMTWYPQ Y+ +PMT+M MR
Sbjct: 539 DNDPRISAILWVGYPGQAGGAAIADVLFGTTNPGGKLPMTWYPQSYLSNLPMTNMAMR-- 596
Query: 607 ATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSY------KFKAVSQNKLYLNQSSSTKMVE 660
+TS PGRTYRFY G V+ FG GLSY+ + + ++S + S+ST +
Sbjct: 597 STSSYPGRTYRFYAGPVVYEFGHGLSYTNFIHTIVKAPTIVSISLSGHRQTHSASTLSSK 656
Query: 661 NQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRG-NGRPIKQL 719
V H K C+ ++ + V+N G+ G H +L+F P G P KQL
Sbjct: 657 AIRVTHAK---------CQKLSLVIHVDVENKGDRDGFHTMLVFSTPPANGATWVPRKQL 707
Query: 720 VGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGDEEYPISI 774
V F+ + L ++EK + + C+ LS + G+ I G H++ +G+ ++ +S+
Sbjct: 708 VAFEKLHLASREKRRLQVHVHVCKYLSVVDKLGVRRIPLGDHYIHIGNVKHTVSL 762
>gi|449436749|ref|XP_004136155.1| PREDICTED: probable beta-D-xylosidase 2-like [Cucumis sativus]
Length = 772
Score = 749 bits (1935), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 369/775 (47%), Positives = 517/775 (66%), Gaps = 29/775 (3%)
Query: 9 VFPLLCLCFTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEK 68
+ P+L + +++ + PF+CDP + + +PFC+ LPI +R +DL+ RLTL EK
Sbjct: 8 IIPILII-LSAIFRHGGGAREPFACDPKDAALSRYPFCRVALPIPERVKDLIGRLTLQEK 66
Query: 69 ISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDS 128
+ LVN+A A+PRLGI YEWWSEALHGV+ VG G F G GATSFPQVI T ASF+
Sbjct: 67 VRLLVNNAAAVPRLGIKGYEWWSEALHGVSNVGPGTEFGGDFPGATSFPQVITTVASFNV 126
Query: 129 YLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAV 188
LW IG+ + EARA+YN G A G+T+W+PN+NIFRDPRWGRGQETPGEDP+V G+YA
Sbjct: 127 SLWEAIGRVVSDEARAMYNGGAA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVVAGEYAA 185
Query: 189 SYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQP 248
Y++G+QG+ +G +LK +ACCKHFTAYDLDNW GT R+ F+A+VT QD+ DT++
Sbjct: 186 RYIKGLQGN--DGDRLK----VAACCKHFTAYDLDNWNGTDRFHFNAKVTRQDMVDTFEV 239
Query: 249 PFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDA 308
PF CVK+G+ + +MC+YN+VNG+P+CAD NLL T R QWG +GYI SDCD+V + YD
Sbjct: 240 PFRKCVKEGKVASVMCSYNQVNGVPTCADPNLLKGTIRNQWGLNGYIVSDCDSVGVFYDN 299
Query: 309 EGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLG 368
+ Y + E+A D +KAG+D++CG FL HT+ AVK+ L ++ I+ AL N +V+MRLG
Sbjct: 300 QHYTSTAEEAAADAIKAGLDLDCGPFLAVHTEDAVKKGLLTQTHINNALANTITVQMRLG 359
Query: 369 LFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIG 428
+F+G P+ +GK+G VCSP+HQ LAL AA+ GIVLLKN LPL ++A+IG
Sbjct: 360 MFDGAPSSHAYGKLGPKNVCSPSHQQLALDAARQGIVLLKNRLPGLPLSADHHRTVAVIG 419
Query: 429 PNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSA-SIDKAVDIAKG 487
PN++ T++GNYAG +C +TPL+ ++ Y V+ GCD VAC++ S A+ A
Sbjct: 420 PNSDVNVTMIGNYAGVACGYVTPLEGIKRYT-TVVHRKGCDNVACATDYSFTDALAAAST 478
Query: 488 ADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFA 547
AD VL+MGLDQ+ E E DR L+LPGRQQEL+ +VA A++ P +++L+ GGP+D++FA
Sbjct: 479 ADATVLVMGLDQSVEAETKDRDGLLLPGRQQELVLKVAAASRGPTVVILMSGGPIDVSFA 538
Query: 548 KYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYI-KVPMTDMKMRPQ 606
D I +ILW GYPG+AG A+A+V+FG NPGG+LPMTWYPQ Y+ +PMT+M MR
Sbjct: 539 DNDPRISAILWVGYPGQAGGAAIADVLFGTTNPGGKLPMTWYPQSYLSNLPMTNMAMR-- 596
Query: 607 ATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSY------KFKAVSQNKLYLNQSSSTKMVE 660
+TS PGRTYRFY G V+ FG GLSY+ + + ++S + S+ST +
Sbjct: 597 STSSYPGRTYRFYAGPVVYEFGHGLSYTNFIHTIVKAPTIVSISLSGHRQTHSASTLSSK 656
Query: 661 NQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRG-NGRPIKQL 719
V H K C+ ++ + V+N G+ G H +L+F P G P KQL
Sbjct: 657 AIRVTHAK---------CQKLSLVIHVDVENKGDRDGFHTMLVFSTPPANGATWVPRKQL 707
Query: 720 VGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGDEEYPISI 774
V F+ + L ++EK + + C+ LS + G+ I G H++ +G+ ++ +S+
Sbjct: 708 VAFEKLHLASREKRRLQVHVHVCKYLSVVDKLGVRRIPLGDHYIHIGNVKHTVSL 762
>gi|168065036|ref|XP_001784462.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162663987|gb|EDQ50724.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 726
Score = 749 bits (1934), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 380/738 (51%), Positives = 495/738 (67%), Gaps = 30/738 (4%)
Query: 45 FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGI 104
FC T+L R DLVSRLTL+EK++QLVN+A AIPRL IPAYEWW E LHGVA V
Sbjct: 3 FCDTSLSDEIRVFDLVSRLTLEEKVTQLVNTASAIPRLSIPAYEWWQEGLHGVAHVS--- 59
Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIF 164
F G++ ATSFP ILT ASF+ LW +IGQA EARA YN G A G+T+W+P INI
Sbjct: 60 -FGGSLPRATSFPLPILTTASFNKDLWNQIGQAFSTEARAFYNDGIA-GLTYWSPVINIA 117
Query: 165 RDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDN 224
RDPRWGR QET GEDP T YA +V+G+Q N +LK SACCKHFTAYD+DN
Sbjct: 118 RDPRWGRIQETSGEDPYTTSAYATHFVQGMQEGDANSKRLK----LSACCKHFTAYDVDN 173
Query: 225 WKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKT 284
W+G RY FDA+ +LADTY PPF+SCV++GR++ +MC+YN+VNG+P+CA+ + L T
Sbjct: 174 WEGIDRYHFDAKA---NLADTYNPPFQSCVQEGRSASLMCSYNKVNGVPTCANYDFLENT 230
Query: 285 ARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVK 344
RR WG +GYI SDCD+V +++++ YA + EDA D L AG+D+NCG +L +T+ AV
Sbjct: 231 VRRAWGLNGYIVSDCDSVLVMHESTNYAPTTEDAAADALNAGLDLNCGDYLASYTEGAVA 290
Query: 345 QKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGI 404
K+ S +D A++N+F VRMRLG+F+GNP Q FG IG VC+PAHQ LA++AA+ GI
Sbjct: 291 MGKVNASRVDNAVYNVFLVRMRLGMFDGNPANQEFGNIGVADVCTPAHQELAVEAARQGI 350
Query: 405 VLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNY----VE 460
VLLKN +LPL SK+++ A+IGPNAN+ T+LGNY G C+ ITPLQ L +
Sbjct: 351 VLLKNDGNILPL--SKNINTAVIGPNANATHTMLGNYEGIPCQYITPLQGLVKFGSGDYH 408
Query: 461 NTVYYPGCDTVAC-SSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQE 519
+ GC AC I AV A AD VVL++GL Q QE E LDR L+LPG QQ
Sbjct: 409 KVWFSEGCVNTACQQDDQISSAVSTAAVADAVVLVVGLSQVQESEALDRTSLLLPGYQQT 468
Query: 520 LITRVA-EAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDH 578
LI VA AA +PV+LVL+C GPVDI FAK D+ I SILW GYPG++G A+AEVIFG H
Sbjct: 469 LIDEVAGAAAGRPVVLVLMCAGPVDINFAKNDKRIQSILWVGYPGQSGGQAIAEVIFGAH 528
Query: 579 NPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSY 638
NPGG+LPM+WYP+DY K+ MT+M MRP + S PGRTYRFY G++++ FG GLSY++Y +
Sbjct: 529 NPGGKLPMSWYPEDYTKISMTNMNMRPDSRSNYPGRTYRFYTGEKIYDFGYGLSYTEYKH 588
Query: 639 KFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGK 698
F + T + +Q +++ G++ C + F V I V+N G MAG
Sbjct: 589 SFALAPTTVM-------TPSIHSQLCDPHQT--SAGSKTCSSSNFDVHINVENIGAMAGN 639
Query: 699 HPVLL-FVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIE 757
H +LL F P+ NG P+KQL F SV + + + ++V L+PC+ L EDG ++E
Sbjct: 640 HTLLLFFTAPSAGKNGTPLKQLAAFDSVYIRSGSQEKVVLTLNPCQHLGTVAEDGTRMLE 699
Query: 758 EGTHFLVVGDEEYPISIF 775
G H L VGD ++ +S+
Sbjct: 700 AGNHILSVGDAKHSLSVL 717
>gi|157041199|dbj|BAF79669.1| beta-D-xylosidase [Pyrus pyrifolia]
Length = 774
Score = 749 bits (1933), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 375/758 (49%), Positives = 503/758 (66%), Gaps = 24/758 (3%)
Query: 24 VDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLG 83
V +PPF+CDP NP T T FC+ +PI R +DL+ RLTL EKI LVN+A A+PRLG
Sbjct: 25 VVHARPPFACDPRNPITRTLKFCRVRVPIHVRVQDLIGRLTLQEKIGLLVNNAIAVPRLG 84
Query: 84 IPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEAR 143
I YEWWSEALHGV+ VG G F GT GATSFPQVI TAASF+ LW IG+ + EAR
Sbjct: 85 IQGYEWWSEALHGVSNVGPGTKF-GTFLGATSFPQVITTAASFNESLWEEIGRVVSDEAR 143
Query: 144 ALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGK 203
A+YN G A G+TFW+PN+NIFRDPRWGRGQETPGEDP++ KY YV+G+QGD
Sbjct: 144 AMYNGGAA-GLTFWSPNVNIFRDPRWGRGQETPGEDPVLAAKYGARYVKGLQGD-----G 197
Query: 204 LKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIM 263
+L+ +ACCKH+TAYDLDNW G R+ F+ARV+ QDL DTY PF++CV G + +M
Sbjct: 198 AGNRLKVAACCKHYTAYDLDNWNGVDRFHFNARVSKQDLEDTYNVPFKACVVDGNVASVM 257
Query: 264 CAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVL 323
C+YN+VNG P+CAD +LL T R QW +GYI SDCD+V + YD + Y K+PE A +
Sbjct: 258 CSYNQVNGKPTCADPDLLKGTIRGQWKLNGYIVSDCDSVGVYYDNQHYTKTPEAAAAYAI 317
Query: 324 KAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIG 383
KAG+D++CG FL HT+AA++ ++ E +I+ AL N +V+MRLG+F+G P+ Q +G +G
Sbjct: 318 KAGLDLDCGPFLGIHTEAAIRTGQVNEIDINYALANTITVQMRLGMFDGEPSTQRYGNLG 377
Query: 384 ADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAG 443
VC P+ LAL+AA+ GIVLL+N LPL + ++A+IGPN++ +T++GNYAG
Sbjct: 378 LADVCKPSSNELALEAARQGIVLLENRGNSLPLSTIRHRTVAVIGPNSDVTETMIGNYAG 437
Query: 444 PSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDI-AKGADHVVLMMGLDQTQE 502
+C TPLQ + Y T++ GC V C+ + A ++ A+ AD VL++GLDQ+ E
Sbjct: 438 IACGYTTPLQGIARYTR-TIHQAGCTDVHCNGNQLIGAAEVAARQADATVLVIGLDQSIE 496
Query: 503 KEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYP 562
E DR L+LPG QQEL++RVA A++ P ILV++ GGP+D+TFAK D IG+I+W GYP
Sbjct: 497 AEFRDRTGLLLPGHQQELVSRVARASRGPTILVIMSGGPIDVTFAKNDPRIGAIIWVGYP 556
Query: 563 GEAGAVALAEVIFGDHNPGGRLPMTWYPQDYI-KVPMTDMKMRPQATSGNPGRTYRFYEG 621
G+AG A+A+V+FG NP G+LPMTWYPQ+Y+ +PMTDM MR G PGRTYRFY+G
Sbjct: 557 GQAGGTAIADVLFGTTNPSGKLPMTWYPQNYVANLPMTDMAMRADPARGYPGRTYRFYKG 616
Query: 622 KEVFPFGCGLSYSKYSYKFKA----VSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEF 677
VFPFG GLSY+++S+ VS L + +T M+ N V +
Sbjct: 617 PVVFPFGMGLSYTRFSHSLAQGPTLVSVPLTSLVAAKNTTMLSNHGV-------RVSHTN 669
Query: 678 CETRKFLVTIGVKNHGEMAGKHPVLLF-VKPARRGNGRPIKQLVGFQSVILNAKEKAEIV 736
C++ I +KN G M G H +L+F +PA G P KQLVGF V + A + +
Sbjct: 670 CDSLSLDFHIDIKNTGTMDGTHTLLVFATQPA--GKWAPNKQLVGFHKVHIVAGSERRVR 727
Query: 737 FELSPCESLSRAREDGLMVIEEGTHFLVVGDEEYPISI 774
+ C+ LS + G+ I G H L +GD ++ +SI
Sbjct: 728 VGVHVCKHLSIVDKLGIRRIPLGQHKLEIGDLKHYVSI 765
>gi|65736613|dbj|BAD98523.1| alpha-L-arabinofuranosidase / beta-D-xylosidase [Pyrus pyrifolia]
Length = 774
Score = 748 bits (1930), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 375/758 (49%), Positives = 503/758 (66%), Gaps = 24/758 (3%)
Query: 24 VDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLG 83
V +PPF+CDP NP T T FC+ +PI R +DL+ RLTL EKI LVN+A A+PRLG
Sbjct: 25 VVHARPPFACDPRNPITRTLKFCRVRVPIHVRVQDLIGRLTLQEKIGLLVNNAIAVPRLG 84
Query: 84 IPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEAR 143
I YEWWSEALHGV+ VG G F GT GATSFPQVI TAASF+ LW IG+ + EAR
Sbjct: 85 IQGYEWWSEALHGVSNVGPGTKF-GTFLGATSFPQVITTAASFNESLWEEIGRVVSDEAR 143
Query: 144 ALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGK 203
A+YN G A G+TFW+PN+NIFRDPRWGRGQETPGEDP++ KY YV+G+QGD
Sbjct: 144 AMYNGGAA-GLTFWSPNVNIFRDPRWGRGQETPGEDPVLAAKYGARYVKGLQGD-----G 197
Query: 204 LKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIM 263
+L+ +ACCKH+TAYDLDNW G R+ F+ARV+ QDL DTY PF++CV G + +M
Sbjct: 198 AGNRLKVAACCKHYTAYDLDNWNGVDRFHFNARVSKQDLEDTYNVPFKACVVDGNVASVM 257
Query: 264 CAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVL 323
C+YN+VNG P+CAD +LL T R QW +GYI SDCD+V + YD + Y K+PE A +
Sbjct: 258 CSYNQVNGKPTCADPDLLKGTIRGQWKLNGYIVSDCDSVGVYYDNQHYTKTPEAAAAYAI 317
Query: 324 KAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIG 383
KAG+D++CG FL HT+AA++ ++ E +I+ AL N +V+MRLG+F+G P+ Q +G +G
Sbjct: 318 KAGLDLDCGPFLGIHTEAAIRTGQVNEIDINYALANTITVQMRLGMFDGEPSTQRYGNLG 377
Query: 384 ADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAG 443
VC P+ LAL+AA+ GIVLL+N LPL + ++A+IGPN++ +T++GNYAG
Sbjct: 378 LADVCKPSSNELALEAARQGIVLLENRGNSLPLSTIRHRTVAVIGPNSDVTETMIGNYAG 437
Query: 444 PSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDI-AKGADHVVLMMGLDQTQE 502
+C TPLQ + Y T++ GC V C+ + A ++ A+ AD VL++GLDQ+ E
Sbjct: 438 IACGYTTPLQGIARYTR-TIHQAGCTDVHCNGNQLIGAAEVAARQADATVLVIGLDQSIE 496
Query: 503 KEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYP 562
E DR L+LPG QQEL++RVA A++ P ILV++ GGP+D+TFAK D IG+I+W GYP
Sbjct: 497 AEFRDRTGLLLPGHQQELVSRVARASRGPTILVIMSGGPIDVTFAKNDPCIGAIIWVGYP 556
Query: 563 GEAGAVALAEVIFGDHNPGGRLPMTWYPQDYI-KVPMTDMKMRPQATSGNPGRTYRFYEG 621
G+AG A+A+V+FG NP G+LPMTWYPQ+Y+ +PMTDM MR G PGRTYRFY+G
Sbjct: 557 GQAGGTAIADVLFGTTNPSGKLPMTWYPQNYVANLPMTDMAMRADPARGYPGRTYRFYKG 616
Query: 622 KEVFPFGCGLSYSKYSYKFKA----VSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEF 677
VFPFG GLSY+++S+ VS L + +T M+ N V +
Sbjct: 617 PVVFPFGMGLSYTRFSHSLAQGPTLVSVPLTSLVAAKNTTMLSNHGV-------RVSHTN 669
Query: 678 CETRKFLVTIGVKNHGEMAGKHPVLLF-VKPARRGNGRPIKQLVGFQSVILNAKEKAEIV 736
C++ I +KN G M G H +L+F +PA G P KQLVGF V + A + +
Sbjct: 670 CDSLSLDFHIDIKNTGTMDGTHTLLVFATQPA--GKWAPNKQLVGFHKVHIVAGSERRVR 727
Query: 737 FELSPCESLSRAREDGLMVIEEGTHFLVVGDEEYPISI 774
+ C+ LS + G+ I G H L +GD ++ +SI
Sbjct: 728 VGVHVCKHLSIVDKLGIRRIPLGQHKLEIGDLKHYVSI 765
>gi|326491679|dbj|BAJ94317.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 772
Score = 747 bits (1929), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 376/754 (49%), Positives = 507/754 (67%), Gaps = 15/754 (1%)
Query: 29 PPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYE 88
PP + ++ ++ FC +LP RAR LVS LTLDEKI+QL N+A +PRLG+P YE
Sbjct: 24 PPNARACASAEANSYAFCDGSLPFPVRARALVSLLTLDEKIAQLSNTAAGVPRLGVPPYE 83
Query: 89 WWSEALHGVAGVGKGI-FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYN 147
WWSE+LHG+A G G+ F +G + AT FPQVIL+AA+F+ LW + +A+ +EARA++N
Sbjct: 84 WWSESLHGLADNGPGVNFSSGPVAAATIFPQVILSAAAFNRSLWRAVAEAVAVEARAMHN 143
Query: 148 AGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGK 207
AGQA G+T+WAPNIN+FRDPRWGRGQETPGEDP + Y+V YV+G QG+ +G +G+
Sbjct: 144 AGQA-GLTYWAPNINVFRDPRWGRGQETPGEDPAMIAAYSVEYVKGFQGEYGDG--REGR 200
Query: 208 LQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYN 267
+ SACCKH+ AYDL+ W RY F+A V QD DTY+PPF+SC+++GRAS +MC+YN
Sbjct: 201 MMLSACCKHYIAYDLEKWGKFARYTFNAEVNAQDFEDTYEPPFKSCIQEGRASCLMCSYN 260
Query: 268 RVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGM 327
+VNG+P+CA ++LL K R +WGF GYI SDCDAV+II++ + Y S ED+V VLKAGM
Sbjct: 261 QVNGVPACARKDLLQKI-RDEWGFKGYIVSDCDAVAIIHENQTYTSSDEDSVAIVLKAGM 319
Query: 328 DVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVV 387
DVNCGSFL +HTK+A+++ K+ E +I+ AL+NLFSV++RLGLF Q F ++G V
Sbjct: 320 DVNCGSFLIRHTKSAIEKGKIQEEDINHALYNLFSVQLRLGLFEKANENQWFTRLGPSNV 379
Query: 388 CSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCR 447
C+ H+ LA +A + G VLLKN + LPL +SK +ALIG AN A + G+Y G C
Sbjct: 380 CTKEHRELAAEAVRQGTVLLKNDNSFLPLKRSKVSHIALIGAAANDAYIMGGDYTGVPCD 439
Query: 448 SITPLQALQNYVENTVYYPGCDTVACSSAS-IDKAVDIAKGADHVVLMMGLDQTQEKEEL 506
IT L+ +Q +V T GC V+C S +A++ AK AD VV++ GL+ TQE E+L
Sbjct: 440 PITFLKGMQAFVPQTTVAAGCKDVSCDSPDGFGEAIEAAKRADIVVVIAGLNLTQESEDL 499
Query: 507 DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAG 566
DRV L+LPGRQQ+L+ +A KKP++LV+ GGPVD+ FAK D I S+LW GYPGE G
Sbjct: 500 DRVTLLLPGRQQDLVNIIASVTKKPIVLVITGGGPVDVAFAKQDPRIASVLWIGYPGEVG 559
Query: 567 AVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFP 626
L E++FG++NPGG+LPMTWYP+ + VPM DM MR + G PGRTYRFY G+ V+
Sbjct: 560 GQVLPEILFGEYNPGGKLPMTWYPESFTAVPMNDMNMRADPSRGYPGRTYRFYTGEVVYG 619
Query: 627 FGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQ------DVVHYKSVPELGTEFCET 680
FG GLSYSKYSY Q ++ L+ S ++ + D + Y V ++ + CE+
Sbjct: 620 FGYGLSYSKYSYNIVQAPQ-RISLSHSPVPGLISRKPAYTRRDGLDYVQVEDIAS--CES 676
Query: 681 RKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELS 740
F V I V N G M G H VLLF + G P+KQLVGF+ V A + +
Sbjct: 677 LVFSVHISVANDGAMDGSHAVLLFARSKSSVPGFPLKQLVGFERVYTAAGSSKNVAITVD 736
Query: 741 PCESLSRAREDGLMVIEEGTHFLVVGDEEYPISI 774
PC+ +S A +G V+ G+H L+VGDE + I
Sbjct: 737 PCKYMSAANTEGRRVLLLGSHHLMVGDEVHEFVI 770
>gi|384872601|gb|AFI25186.1| putative beta-D-xylosidase [Nicotiana tabacum]
Length = 791
Score = 746 bits (1926), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 372/756 (49%), Positives = 493/756 (65%), Gaps = 31/756 (4%)
Query: 43 FPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGK 102
+ FC LPIS R + L+S LT+DEKI L ++ +IPRLG+PAYEWWSE+LHG+A G
Sbjct: 41 YTFCNKNLPISTRVQSLISLLTIDEKILHLSDNTTSIPRLGLPAYEWWSESLHGIATNGP 100
Query: 103 GIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNIN 162
+ FNG I+G TSFPQVILTAA+F+ LW+ I AI +EARA+YN GQA G+TFWAPNIN
Sbjct: 101 AVNFNGQIKGVTSFPQVILTAAAFNRTLWHSIATAIAVEARAMYNLGQA-GLTFWAPNIN 159
Query: 163 IFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKG---------------- 206
I RDPRWGRGQETPGEDP+V YA+ YV G QG N KG
Sbjct: 160 ILRDPRWGRGQETPGEDPMVVSAYAIEYVTGFQG--LNPKAKKGNRNGYGKKRRVLKEDD 217
Query: 207 ----KLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGI 262
+L SACCKHFTAYDL+ W TRY F+A VT QD+ DT+Q PF SC++QG+AS +
Sbjct: 218 NDGERLMLSACCKHFTAYDLEKWGDATRYDFNAVVTKQDMEDTFQAPFRSCIQQGKASCL 277
Query: 263 MCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDV 322
MC+YN VNG+P+CAD+ LL K R WGF GYITSDCDAV+ IY+ + Y K+PEDAV
Sbjct: 278 MCSYNSVNGVPACADKELLDKV-RTDWGFDGYITSDCDAVATIYENQKYTKTPEDAVAVA 336
Query: 323 LKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKI 382
LKAG ++NCG+++ +H K+A +Q + E ++DRAL LFSV+ RLGLF+GNP F
Sbjct: 337 LKAGTNINCGTYMLRHMKSAFQQGSVLEEDLDRALQYLFSVQFRLGLFDGNPADGQFANF 396
Query: 383 GADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYA 442
GA VC+ H LAL AA+ GIVLLKN LPL K+ +LA++GP AN + G Y+
Sbjct: 397 GAQDVCTSNHLNLALDAARQGIVLLKNDQKFLPLDKTSVSTLAIVGPMAN-VSSPGGTYS 455
Query: 443 GPSCRSITPLQALQNYVENTVYYPGCDTVAC-SSASIDKAVDIAKGADHVVLMMGLDQTQ 501
G C+ + + ++ T+Y GC V C S+A A+ I K AD+V+++ G D ++
Sbjct: 456 GVPCKLKSIREGFHRHINRTLYAAGCLDVGCNSTAGFQDAISIVKEADYVIVVAGSDLSE 515
Query: 502 EKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGY 561
E E+ DR L+LPG+Q L+T +A A+KKP+ILVL GGPVD++FA+ D I SILW Y
Sbjct: 516 ETEDHDRYSLLLPGQQTNLVTTLAAASKKPIILVLTGGGPVDVSFAEKDPRIASILWVAY 575
Query: 562 PGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEG 621
PGE G AL+E+IFG NPGG+LPMTWY + + KVPMTDM MR ++G PGRTYRFY G
Sbjct: 576 PGETGGKALSEIIFGYQNPGGKLPMTWYLESFTKVPMTDMNMRADPSNGYPGRTYRFYTG 635
Query: 622 KEVFPFGCGLSYSKYSYK-FKAVSQNKLYLNQSSSTK--MVENQDVVHYKSVPELGTEFC 678
++ FG GLSY+ +S + A S+ L L +S+ + + + + + Y V E+ E C
Sbjct: 636 DVLYGFGHGLSYTSFSSQLLSAPSRLSLSLAKSNRKRSILAKGRSRLGYIHVDEV--ESC 693
Query: 679 ETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFE 738
+ KF V I V N G+M G H ++LF + + G P KQLVGF V + A++ E
Sbjct: 694 HSSKFFVHISVTNDGDMDGSHVLMLFSRVLQNFQGAPQKQLVGFDRVHVPARKYVETSLL 753
Query: 739 LSPCESLSRAREDGLMVIEEGTHFLVVGDEEYPISI 774
+ PCE S A + G ++ G H ++ D E+ + +
Sbjct: 754 VDPCELFSFANDQGNRILALGEHTFILDDIEHVVFV 789
>gi|449484229|ref|XP_004156823.1| PREDICTED: LOW QUALITY PROTEIN: probable beta-D-xylosidase 2-like
[Cucumis sativus]
Length = 769
Score = 745 bits (1924), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 369/771 (47%), Positives = 513/771 (66%), Gaps = 40/771 (5%)
Query: 13 LCLCFTSLLTRVD--STQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKIS 70
L L LL+ + +++ PF+CDP+N T +PFC+ +L + +R +DL+ RLTL+EK+
Sbjct: 7 LILSIFILLSAIHGRASRAPFACDPNNSVTTDYPFCRRSLVVEERVKDLIGRLTLEEKVK 66
Query: 71 QLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYL 130
LV++A +PRLGI AY+WWSEALHGV+ VG G F G ATSFPQVI TAASF++ L
Sbjct: 67 LLVSNAGGVPRLGIKAYQWWSEALHGVSNVGPGTRFGGEFPAATSFPQVISTAASFNASL 126
Query: 131 WYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSY 190
W IG+ + EARA+YN G G+T+W+PN+NIFRDPRWGRGQETPGEDP++ G YAV+Y
Sbjct: 127 WEAIGRVVSDEARAMYNGGVG-GLTYWSPNVNIFRDPRWGRGQETPGEDPILAGTYAVNY 185
Query: 191 VRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPF 250
VRG+QG N +L+ +ACCKHFTAYDLDNW G R+ F+A+V+ QD+ DT++ PF
Sbjct: 186 VRGLQGTEGN------RLKVAACCKHFTAYDLDNWNGVDRFHFNAQVSKQDIEDTFEVPF 239
Query: 251 ESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEG 310
CVK G+ S +MC+YN+VNG+P+CAD NLL+ T R QW GYI SDCD+V + Y+++
Sbjct: 240 RMCVKGGKVSSVMCSYNQVNGVPTCADPNLLTNTLRSQWHLDGYIVSDCDSVGVFYNSQH 299
Query: 311 YAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLF 370
Y +PE+A +KAG+D++CGSFL+ HT+ AVK+ L ES I+ AL N SV+MRLG+F
Sbjct: 300 YTSTPEEAAAMAIKAGLDLDCGSFLETHTENAVKRGLLNESHINGALSNTLSVQMRLGMF 359
Query: 371 NGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPN 430
+G+ QP+ +GA VCS ++ LA+ AA+ GIVLL+N G LPL ++ +A++GPN
Sbjct: 360 DGDLKTQPYAHLGAKHVCSDHNRQLAVDAARQGIVLLENRRGSLPLSTNRHRIVAVVGPN 419
Query: 431 ANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSAS-IDKAVDIAKGAD 489
+N+ T++GNYAG +C ITPLQ + Y T++ GC VAC S A++ A+ AD
Sbjct: 420 SNATLTMIGNYAGIACEYITPLQGISKYTR-TIHQEGCRGVACRSNKFFGGAIEAARVAD 478
Query: 490 HVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKY 549
VVL+MGLDQ+ E E DR L+LPG Q +L+ +VA AK PVILVL+ GGP+D++FAK
Sbjct: 479 AVVLVMGLDQSIEAEFRDRAGLLLPGLQPDLVLKVASVAKGPVILVLMSGGPIDVSFAKD 538
Query: 550 DRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYI-KVPMTDMKMRPQAT 608
I I+W GYPG+AG +A+A+V+FG NPGG+LPMTWYPQDY+ K+PMT M +RP +
Sbjct: 539 HPKISGIIWGGYPGQAGGLAIADVLFGQTNPGGKLPMTWYPQDYVSKLPMTTMSLRPGTS 598
Query: 609 SGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYK 668
PGRTYRFY+G V+PFG GLSY+ +++K + + +T V H
Sbjct: 599 Y--PGRTYRFYKGPVVYPFGHGLSYTAFTHKILS----------APTTLTVPVTGHRH-- 644
Query: 669 SVPELGTEF-----------CETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIK 717
P G+EF C+ ++ + V+N G G H +L++ P G P K
Sbjct: 645 --PHNGSEFWGKAVRVTHAKCDRLSLVIKVAVRNIGARDGAHTLLVYSIPP-MGVWVPQK 701
Query: 718 QLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGDE 768
QLV F+ V ++A+ E+ + C+ LS + G+ + G H + +GD
Sbjct: 702 QLVAFEKVHIDAQALKEVQINIHVCKLLSVVDKYGIRRVPMGEHGIDIGDN 752
>gi|449469042|ref|XP_004152230.1| PREDICTED: probable beta-D-xylosidase 2-like [Cucumis sativus]
Length = 769
Score = 745 bits (1923), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 369/771 (47%), Positives = 514/771 (66%), Gaps = 40/771 (5%)
Query: 13 LCLCFTSLLTRVD--STQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKIS 70
L L LL+ + +++ PF+CDP+N T +PFC+ +L + +R +DL+ RLTL+EK+
Sbjct: 7 LILSIFILLSAIHGRASRAPFACDPNNSVTTDYPFCRRSLVVGERVKDLIGRLTLEEKVK 66
Query: 71 QLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYL 130
LV++A +PRLGI AY+WWSEALHGV+ VG G F G ATSFPQVI TAASF++ L
Sbjct: 67 LLVSNAGGVPRLGIKAYQWWSEALHGVSNVGPGTRFGGEFPAATSFPQVISTAASFNASL 126
Query: 131 WYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSY 190
W IG+ + EARA+YN G G+T+W+PN+NIFRDPRWGRGQETPGEDP++ G YAV+Y
Sbjct: 127 WEAIGRVVSDEARAMYNGGVG-GLTYWSPNVNIFRDPRWGRGQETPGEDPILAGTYAVNY 185
Query: 191 VRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPF 250
VRG+QG N +L+ +ACCKHFTAYDLDNW G R+ F+A+V+ QD+ DT++ PF
Sbjct: 186 VRGLQGTEGN------RLKVAACCKHFTAYDLDNWNGVDRFHFNAQVSKQDIEDTFEVPF 239
Query: 251 ESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEG 310
CVK G+ S +MC+YN+VNG+P+CAD NLL+ T R QW GYI SDCD+V + Y+++
Sbjct: 240 RMCVKGGKVSSVMCSYNQVNGVPTCADPNLLTNTLRSQWHLDGYIVSDCDSVGVFYNSQH 299
Query: 311 YAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLF 370
Y +PE+A +KAG+D++CGSFL+ HT+ AVK+ L ES I+ AL N SV+MRLG+F
Sbjct: 300 YTSTPEEAAAMAIKAGLDLDCGSFLETHTENAVKRGLLNESHINGALSNTLSVQMRLGMF 359
Query: 371 NGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPN 430
+G+ QP+ +GA VCS ++ LA+ AA+ GIVLL+N G LPL ++ +A++GPN
Sbjct: 360 DGDLKTQPYAHLGAKHVCSDHNRQLAVDAARQGIVLLENRRGSLPLSTNRHRIVAVVGPN 419
Query: 431 ANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVAC-SSASIDKAVDIAKGAD 489
+N+ T++GNYAG +C ITPLQ + Y T++ GC VAC S+ A++ A+ AD
Sbjct: 420 SNATLTMIGNYAGIACEYITPLQGISKYTR-TIHQEGCRGVACRSNKFFGGAIEAARVAD 478
Query: 490 HVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKY 549
VVL+MGLDQ+ E E DR L+LPG Q +L+ +VA AK PVILVL+ GGP+D++FAK
Sbjct: 479 AVVLVMGLDQSIEAEFRDRAGLLLPGLQPDLVLKVASVAKGPVILVLMSGGPIDVSFAKD 538
Query: 550 DRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYI-KVPMTDMKMRPQAT 608
I I+W GYPG+AG +A+A+V+FG NPGG+LPMTWYPQDY+ K+PMT M +RP +
Sbjct: 539 HPKISGIIWGGYPGQAGGLAIADVLFGQTNPGGKLPMTWYPQDYVSKLPMTTMSLRPGTS 598
Query: 609 SGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYK 668
PGRTYRFY+G V+PFG GLSY+ +++K + + +T V H
Sbjct: 599 Y--PGRTYRFYKGPVVYPFGHGLSYTAFTHKILS----------APTTLTVPVTGHRH-- 644
Query: 669 SVPELGTEF-----------CETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIK 717
P G+EF C+ ++ + V+N G G H +L++ P G P K
Sbjct: 645 --PHNGSEFWGKAVRVTHAKCDRLSLVIKVAVRNIGARDGAHTLLVYSIPP-MGVWVPQK 701
Query: 718 QLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGDE 768
QLV F+ V ++A+ E+ + C+ LS + G+ + G H + +GD
Sbjct: 702 QLVAFEKVHIDAQALKEVQINIHVCKLLSVVDKYGIRRVPMGEHGIDIGDN 752
>gi|15239867|ref|NP_199747.1| beta-xylosidase 1 [Arabidopsis thaliana]
gi|75262458|sp|Q9FGY1.1|BXL1_ARATH RecName: Full=Beta-D-xylosidase 1; Short=AtBXL1; AltName:
Full=Alpha-L-arabinofuranosidase; Flags: Precursor
gi|9759419|dbj|BAB09906.1| xylosidase [Arabidopsis thaliana]
gi|21539545|gb|AAM53325.1| xylosidase [Arabidopsis thaliana]
gi|332008419|gb|AED95802.1| beta-xylosidase 1 [Arabidopsis thaliana]
Length = 774
Score = 744 bits (1922), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 387/774 (50%), Positives = 514/774 (66%), Gaps = 28/774 (3%)
Query: 8 LVFPLLCLCFTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDE 67
LVF LLCL +S +S +P F+CDP+N T T FC+ +PI R +DL+ RLTL E
Sbjct: 18 LVF-LLCLVHSS-----ESLRPLFACDPANGLTRTLRFCRANVPIHVRVQDLLGRLTLQE 71
Query: 68 KISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFD 127
KI LVN+A A+PRLGI YEWWSEALHG++ VG G F G GATSFPQVI TAASF+
Sbjct: 72 KIRNLVNNAAAVPRLGIGGYEWWSEALHGISDVGPGAKFGGAFPGATSFPQVITTAASFN 131
Query: 128 SYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYA 187
LW IG+ + EARA+YN G A G+T+W+PN+NI RDPRWGRGQETPGEDP+V KYA
Sbjct: 132 QSLWEEIGRVVSDEARAMYNGGVA-GLTYWSPNVNILRDPRWGRGQETPGEDPIVAAKYA 190
Query: 188 VSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQ 247
SYVRG+QG T G +LK +ACCKH+TAYDLDNW G R+ F+A+VT QDL DTY
Sbjct: 191 ASYVRGLQG-TAAGNRLK----VAACCKHYTAYDLDNWNGVDRFHFNAKVTQQDLEDTYN 245
Query: 248 PPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYD 307
PF+SCV +G+ + +MC+YN+VNG P+CAD NLL T R QW +GYI SDCD+V + ++
Sbjct: 246 VPFKSCVYEGKVASVMCSYNQVNGKPTCADENLLKNTIRGQWRLNGYIVSDCDSVDVFFN 305
Query: 308 AEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRL 367
+ Y +PE+A +KAG+D++CG FL T+ AVK+ L E++I+ AL N +V+MRL
Sbjct: 306 QQHYTSTPEEAAARSIKAGLDLDCGPFLAIFTEGAVKKGLLTENDINLALANTLTVQMRL 365
Query: 368 GLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALI 427
G+F+GN + P+ +G VC+PAH+ LAL+AA GIVLLKNS LPL + ++A+I
Sbjct: 366 GMFDGN--LGPYANLGPRDVCTPAHKHLALEAAHQGIVLLKNSARSLPLSPRRHRTVAVI 423
Query: 428 GPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVAC-SSASIDKAVDIAK 486
GPN++ +T++GNYAG +C +PLQ + Y T++ GC VAC + A A+
Sbjct: 424 GPNSDVTETMIGNYAGKACAYTSPLQGISRYAR-TLHQAGCAGVACKGNQGFGAAEAAAR 482
Query: 487 GADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITF 546
AD VL+MGLDQ+ E E DR L+LPG QQ+L+TRVA+A++ PVILVL+ GGP+D+TF
Sbjct: 483 EADATVLVMGLDQSIEAETRDRTGLLLPGYQQDLVTRVAQASRGPVILVLMSGGPIDVTF 542
Query: 547 AKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYI-KVPMTDMKMRP 605
AK D + +I+WAGYPG+AG A+A +IFG NPGG+LPMTWYPQDY+ KVPMT M MR
Sbjct: 543 AKNDPRVAAIIWAGYPGQAGGAAIANIIFGAANPGGKLPMTWYPQDYVAKVPMTVMAMR- 601
Query: 606 QATSGN-PGRTYRFYEGKEVFPFGCGLSYSKYSYKF--KAVSQNKLYLNQSSSTKMVENQ 662
SGN PGRTYRFY+G VFPFG GLSY+ +++ ++Q + L+ +S + N
Sbjct: 602 --ASGNYPGRTYRFYKGPVVFPFGFGLSYTTFTHSLAKSPLAQLSVSLSNLNSANTILNS 659
Query: 663 DVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRG-NGRPI-KQLV 720
S+ T K + + V N GE G H V +F +P G G + KQL+
Sbjct: 660 S---SHSIKVSHTNCNSFPKMPLHVEVSNTGEFDGTHTVFVFAEPPINGIKGLGVNKQLI 716
Query: 721 GFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGDEEYPISI 774
F+ V + A K + ++ C+ L E G I G H L +GD ++ I +
Sbjct: 717 AFEKVHVMAGAKQTVQVDVDACKHLGVVDEYGKRRIPMGEHKLHIGDLKHTILV 770
>gi|18025340|gb|AAK38481.1| alpha-L-arabinofuranosidase/beta-D-xylosidase isoenzyme ARA-I
[Hordeum vulgare]
Length = 777
Score = 744 bits (1921), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 385/752 (51%), Positives = 495/752 (65%), Gaps = 15/752 (1%)
Query: 29 PPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYE 88
P F+CD SN + + FC S RARDLVSRLTL EK+ LVN PA+ RLGIPAYE
Sbjct: 37 PVFACDASNATLAAYGFCNRKATASARARDLVSRLTLAEKVGFLVNKQPALGRLGIPAYE 96
Query: 89 WWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA 148
WWSEALHGV+ VG G F+ + GATSFPQ ILTAASF++ L+ IG+ + EARA++N
Sbjct: 97 WWSEALHGVSYVGPGTRFSPLVPGATSFPQPILTAASFNASLFRAIGEVVSTEARAMHNV 156
Query: 149 GQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKL 208
G A G+TFW+PNINIFRDPRWGRGQETPGEDPL+ KYAV YV G+Q D GG G L
Sbjct: 157 GLA-GLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYAVGYVTGLQ-DAGAGGVTDGAL 214
Query: 209 QASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNR 268
+ +ACCKH+TAYD+DNWKG RY FDA+V+ QDL DT+QPPF+SCV G + +MC+YN+
Sbjct: 215 KVAACCKHYTAYDVDNWKGVERYTFDAKVSQQDLDDTFQPPFKSCVLDGNVASVMCSYNK 274
Query: 269 VNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMD 328
VNG P+CAD++LL R W +GYI SDCD+V ++Y + Y K+PE+A +K+G+D
Sbjct: 275 VNGKPTCADKDLLEGVIRGDWKLNGYIVSDCDSVDVLYTQQHYTKTPEEAAAITIKSGVD 334
Query: 329 VNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVC 388
+NCG+FL +HT AAV+ +L E ++DRA+ N F + MRLG F+G+P FG +G VC
Sbjct: 335 LNCGNFLAQHTVAAVQAGELSEEDVDRAITNNFIMLMRLGFFDGDPRQLAFGSLGPKDVC 394
Query: 389 SPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRS 448
+ +++ LA + A+ GIVLLKNS G LPL S+A+IGPNAN++ T++GNY G C+
Sbjct: 395 TSSNRELARETARQGIVLLKNS-GALPLSAKSIKSMAVIGPNANASFTMIGNYEGTPCKY 453
Query: 449 ITPLQALQNYVENTVYYPGCDTVACSSASID--KAVDIAKGADHVVLMMGLDQTQEKEEL 506
TPLQ L V NTVY PGC V CS S+ AV A AD VL++G DQ+ E+E L
Sbjct: 454 TTPLQGLGAKV-NTVYQPGCTNVGCSGNSLQLSTAVAAAASADVTVLVVGADQSIERESL 512
Query: 507 DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAG 566
DR L+LPG+Q +L++ VA A+ PVILV++ GGP DI+FAK I + LW GYPGEAG
Sbjct: 513 DRTSLLLPGQQTQLVSAVANASSGPVILVVMSGGPFDISFAKASDKIAATLWVGYPGEAG 572
Query: 567 AVALAEVIFGDHNPGGRLPMTWYPQDYIK-VPMTDMKMRPQATSGNPGRTYRFYEGKEVF 625
AL + +FG HNP GRLP+TWYP Y V MTDM+MRP ++G PGRTYRFY G VF
Sbjct: 573 GAALDDTLFGSHNPSGRLPVTWYPASYADTVTMTDMRMRPDTSTGYPGRTYRFYTGDTVF 632
Query: 626 PFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVP-ELGTEFCETRKFL 684
FG GLSY+K S+ VS Y+ S ++ E+ + E + C+
Sbjct: 633 AFGDGLSYTKMSHSL--VSAPPSYV----SMRLAEDHLCRAEECASVEAAGDHCDDLALD 686
Query: 685 VTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCES 744
V + V+N GE+AG H VLLF P N P K LVGF+ V L E + F + C
Sbjct: 687 VKLQVRNAGEVAGAHSVLLFSSPPPAHNA-PAKHLVGFEKVSLAPGEAGTVAFRVDVCRD 745
Query: 745 LSRAREDGLMVIEEGTHFLVVGDEEYPISIFV 776
LS E G + G H L GD ++ + + V
Sbjct: 746 LSVVDELGGRKVALGGHTLHDGDLKHTVELRV 777
>gi|356524862|ref|XP_003531047.1| PREDICTED: beta-xylosidase/alpha-L-arabinofuranosidase 2-like
[Glycine max]
Length = 765
Score = 744 bits (1920), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 375/748 (50%), Positives = 500/748 (66%), Gaps = 14/748 (1%)
Query: 31 FSCD-PSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEW 89
F+CD +P+ + FC +L + R +DLV RLTL EKI LVNSA + RLGIP YEW
Sbjct: 30 FACDVGKSPAVAGYGFCDKSLGVEARVKDLVGRLTLQEKIGNLVNSAVDVSRLGIPKYEW 89
Query: 90 WSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAG 149
WSEALHGV+ VG G F+ I GATSFP ILTAASF++ L+ IG+ + EARA+YN G
Sbjct: 90 WSEALHGVSNVGPGTRFSNVIPGATSFPMPILTAASFNTSLFEVIGRVVSTEARAMYNVG 149
Query: 150 QAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQ 209
A G+T+W+PNINIFRDPRWGRG ETPGEDP++T KYA YV+G+Q +GG KL+
Sbjct: 150 LA-GLTYWSPNINIFRDPRWGRGLETPGEDPVLTSKYAAGYVKGLQ--QTDGGD-PNKLK 205
Query: 210 ASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRV 269
+ACCKH+TAYD+DNWKG RY F+A VT QD+ DT+QPPF+SCV G + +MC+YN+V
Sbjct: 206 VAACCKHYTAYDVDNWKGIQRYTFNAVVTKQDMEDTFQPPFKSCVIDGNVASVMCSYNKV 265
Query: 270 NGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDV 329
NG P+CAD +LL R +W +GYI SDCD+V ++Y + Y K+PE+A + AG+D+
Sbjct: 266 NGKPTCADPDLLKGVVRGEWKLNGYIVSDCDSVEVLYKDQHYTKTPEEAAAISILAGLDL 325
Query: 330 NCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCS 389
NCG FL ++T+ AVKQ + E+ I+ A+ N F+ MRLG F+G+P QP+G +G VC+
Sbjct: 326 NCGRFLGQYTEGAVKQGLIDEASINNAVTNNFATLMRLGFFDGDPRKQPYGNLGPKDVCT 385
Query: 390 PAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSI 449
+Q LA +AA+ GIVLLKNS LPL SLA+IGPNAN+ + ++GNY G C+ I
Sbjct: 386 QENQELAREAARQGIVLLKNSPASLPLNAKAIKSLAVIGPNANATRVMIGNYEGIPCKYI 445
Query: 450 TPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRV 509
+PLQ L + T Y GC V C + +D A IA AD V+++G E E LDRV
Sbjct: 446 SPLQGLTAFAP-TSYAAGCLDVRCPNPVLDDAKKIAASADATVIVVGASLAIEAESLDRV 504
Query: 510 DLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVA 569
+++LPG+QQ L++ VA A+K PVILV++ GG +D++FAK + I SILW GYPGEAG A
Sbjct: 505 NILLPGQQQLLVSEVANASKGPVILVIMSGGGMDVSFAKNNNKITSILWVGYPGEAGGAA 564
Query: 570 LAEVIFGDHNPGGRLPMTWYPQDYI-KVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFG 628
+A+VIFG HNP GRLPMTWYPQ Y+ KVPMT+M MRP +G PGRTYRFY+G+ VF FG
Sbjct: 565 IADVIFGFHNPSGRLPMTWYPQSYVDKVPMTNMNMRPDPATGYPGRTYRFYKGETVFAFG 624
Query: 629 CGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIG 688
GLSYS +K V +L Q + + + + KS+ +G E C+ F + +
Sbjct: 625 DGLSYSSIVHKL--VKAPQLVSVQLAEDHVCRSSEC---KSIDVVG-EHCQNLVFDIHLR 678
Query: 689 VKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRA 748
+KN G+M+ H V LF P N P K L+GF+ V L K +A + F++ C+ LS
Sbjct: 679 IKNKGKMSSAHTVFLFSTPPAVHNA-PQKHLLGFEKVHLIGKSEALVSFKVDVCKDLSIV 737
Query: 749 REDGLMVIEEGTHFLVVGDEEYPISIFV 776
E G + G H L VGD ++P+S+ +
Sbjct: 738 DELGNRKVALGQHLLHVGDLKHPLSVMI 765
>gi|357166259|ref|XP_003580652.1| PREDICTED: beta-D-xylosidase 4-like [Brachypodium distachyon]
Length = 774
Score = 743 bits (1919), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 382/753 (50%), Positives = 498/753 (66%), Gaps = 19/753 (2%)
Query: 29 PPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYE 88
P F+CD +N + + FC S RA DLVSRLTL +K+ LVN PA+ RLGIPAYE
Sbjct: 36 PVFACDAANSTVAGYAFCDRAKSASARAADLVSRLTLADKVGFLVNKQPALARLGIPAYE 95
Query: 89 WWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA 148
WWSEALHGV+ VG G F+ + GATSFPQ ILTAASF++ L+ IG+ + EARA++N
Sbjct: 96 WWSEALHGVSYVGPGTRFSPLVPGATSFPQPILTAASFNASLFRAIGEVVSNEARAMHNV 155
Query: 149 GQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKL 208
G A G+TFW+PNINIFRDPRWGRGQETPGEDPL+ +YAV YV G+Q G G L
Sbjct: 156 GLA-GLTFWSPNINIFRDPRWGRGQETPGEDPLLASRYAVGYVSGLQD---AGADADGPL 211
Query: 209 QASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNR 268
+ +ACCKH+TAYD+DNWKG RY FDA+V+ QDL DT+QPPF+SCV G+ + +MC+YN+
Sbjct: 212 KVAACCKHYTAYDVDNWKGVERYTFDAKVSQQDLDDTFQPPFKSCVIDGKVASVMCSYNK 271
Query: 269 VNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMD 328
VNG P+CAD++LLS R W +GYI SDCD+V ++Y + Y K+PE+A +K+G+D
Sbjct: 272 VNGKPTCADKDLLSGVIRGDWKLNGYIVSDCDSVDVLYSQQHYTKTPEEAAAITIKSGLD 331
Query: 329 VNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVC 388
+NCG FL KHT AAV+ L ES++DRA+ N F + MRLG F+G+P +G +G VC
Sbjct: 332 LNCGDFLAKHTVAAVQAGNLSESDVDRAITNNFIMLMRLGFFDGDPRKLAYGSLGPKDVC 391
Query: 389 SPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRS 448
+ ++Q LA + A+ GIVLLKN G LPL S+A+IGPNAN++ T++GNY G C+
Sbjct: 392 TSSNQELARETARQGIVLLKND-GALPLSAKSIKSMAVIGPNANASFTMIGNYEGTPCKY 450
Query: 449 ITPLQALQNYVENTVYYPGCDTVACSSASID--KAVDIAKGADHVVLMMGLDQTQEKEEL 506
TPL L N V TVY PGC V CS S+ A A AD VL++G DQ+ E+E L
Sbjct: 451 TTPLHGLGNNVA-TVYQPGCSNVGCSGNSLQLSAATAAAASADVTVLVVGADQSIEREAL 509
Query: 507 DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAG 566
DR L+LPG+Q +LI+ VA A+K VILV++ GGP DI+FAK I +ILW GYPGEAG
Sbjct: 510 DRTSLLLPGQQPDLISAVANASKGHVILVVMSGGPFDISFAKASDKISAILWVGYPGEAG 569
Query: 567 AVALAEVIFGDHNPGGRLPMTWYPQDYI-KVPMTDMKMRPQATSGNPGRTYRFYEGKEVF 625
A+A++IFG +NP GRLP+TWYP + KVPMTDM+MRP ++G PGRTYRFY G+ VF
Sbjct: 570 GAAIADIIFGKYNPSGRLPVTWYPASFADKVPMTDMRMRPDNSTGYPGRTYRFYTGETVF 629
Query: 626 PFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVP--ELGTEFCETRKF 683
FG GLSY+ S+ A +++ S ++ E H K E + CE F
Sbjct: 630 AFGDGLSYTTMSHNLVAAPPSEV------SMQLAEGH-ACHTKECASVEAAGDHCEGMAF 682
Query: 684 LVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCE 743
V + V N GEMAG H VLLF P N P K L+GF+ + L + F++ C+
Sbjct: 683 EVRLRVHNTGEMAGAHTVLLFSSPPAVHNA-PAKHLLGFEKLNLEPGQAGVAAFKVDVCK 741
Query: 744 SLSRAREDGLMVIEEGTHFLVVGDEEYPISIFV 776
LS E G + G H L VGD ++ +++ V
Sbjct: 742 DLSVVDELGNRKVALGGHTLHVGDLKHTLNLGV 774
>gi|371917280|dbj|BAL44716.1| SlArf/Xyl1 [Solanum lycopersicum]
Length = 771
Score = 742 bits (1916), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 376/765 (49%), Positives = 500/765 (65%), Gaps = 23/765 (3%)
Query: 12 LLCLCFTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQ 71
+ + F ++ PF+CDP+N FCKT+LPI R +DL++RLTL EKI
Sbjct: 6 FILIIFVLAFAYSGESRQPFACDPANAGIRNLRFCKTSLPIHVRVQDLIARLTLQEKIRL 65
Query: 72 LVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLW 131
LVN+A + RLGI YEWWSEALHGV+ G G+ F G GATSFPQVI TAASF++ LW
Sbjct: 66 LVNNAAPVQRLGISGYEWWSEALHGVSNTGYGVKFGGAFPGATSFPQVITTAASFNASLW 125
Query: 132 YRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYV 191
IG+ + E RA+YN G A G+TFW+PN+NIFRDPRWGRGQETPGEDP + +Y VSYV
Sbjct: 126 EEIGRVVSEEGRAMYNGGAA-GLTFWSPNVNIFRDPRWGRGQETPGEDPHLVAQYGVSYV 184
Query: 192 RGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFE 251
+G+QG + +L+ +ACCKH+TAYDLD+W G RY F+A+V+MQDL DTY PF+
Sbjct: 185 KGLQGGGG---RGNTRLKVAACCKHYTAYDLDDWNGYDRYHFNAKVSMQDLEDTYNAPFK 241
Query: 252 SCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGY 311
+CV +G + +MC+YN++NG PSCAD LL T R QW +GYI SDCD+V ++++ + Y
Sbjct: 242 ACVVEGNVASVMCSYNQINGKPSCADPTLLRDTIRNQWHLNGYIVSDCDSVGVLFEKQHY 301
Query: 312 AKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFN 371
+ PEDA +KAG+D++CG FL HT AV K+ + EI+ AL N +V+MRLG+F+
Sbjct: 302 TRYPEDAAAITIKAGLDLDCGPFLAIHTDKAVHTGKVSQVEINNALANTITVQMRLGMFD 361
Query: 372 GNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNA 431
G P+ +G VCSPAHQ LALQAA++GIVLLKN LPL + ++A+IGPN+
Sbjct: 362 G--PNGPYANLGPKDVCSPAHQQLALQAAREGIVLLKNIGQALPLSTKRHRTVAVIGPNS 419
Query: 432 NSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVAC-SSASIDKAVDIAKGADH 490
++ ++GNYAG C I+PLQ + Y T++ GC VAC + + A A+ AD
Sbjct: 420 DATLAMIGNYAGVPCGYISPLQGISRYAR-TIHQQGCMGVACPGNQNFGLAEVAARHADA 478
Query: 491 VVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYD 550
VL+MGLDQ+ E E DRV L+LPG QQ+LI+RVA A+K PV+LVL+ GGP+D+TFAK D
Sbjct: 479 TVLVMGLDQSIEAEAKDRVTLLLPGHQQDLISRVAMASKGPVVLVLMSGGPIDVTFAKND 538
Query: 551 RNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYI-KVPMTDMKMRPQATS 609
+ SI+W GYPG+AG A+A+V+FG NPGG+LPMTWYPQDY+ KV M +M MR +
Sbjct: 539 PRVSSIVWVGYPGQAGGAAIADVLFGATNPGGKLPMTWYPQDYVAKVSMANMDMRANPSK 598
Query: 610 GNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKA----VSQNKLYLNQSSSTKMVENQDVV 665
G PGRTYRFY+G VFPFG G+SY+ +S + VS L+ S +V N
Sbjct: 599 GYPGRTYRFYKGPTVFPFGAGISYTTFSQHLVSAPITVSVPTLH-----SHDLVSNNTTT 653
Query: 666 HYKSVPELGT--EFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQ 723
K+ + T CE+ + I VKN G+M G H VL+F P + KQLV F+
Sbjct: 654 LMKAKATVRTIHTNCESLDIDMHIDVKNTGDMDGTHAVLIFSTPP---DPTETKQLVAFE 710
Query: 724 SVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGDE 768
V + A K + ++ C+ LS A E G+ I G H + VGD
Sbjct: 711 KVHVVAGAKQRVKINMNACKHLSVADEYGVRRIYMGEHKIHVGDH 755
>gi|297795695|ref|XP_002865732.1| beta-xylosidase 1 [Arabidopsis lyrata subsp. lyrata]
gi|297311567|gb|EFH41991.1| beta-xylosidase 1 [Arabidopsis lyrata subsp. lyrata]
Length = 774
Score = 742 bits (1915), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 388/774 (50%), Positives = 513/774 (66%), Gaps = 28/774 (3%)
Query: 8 LVFPLLCLCFTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDE 67
LVF LLCL +S +S +P F+CDP+N T T FC+ +PI R +DL+ RLTL E
Sbjct: 18 LVF-LLCLVHSS-----ESLRPLFACDPANGLTRTLRFCRVNVPIHVRVQDLIGRLTLQE 71
Query: 68 KISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFD 127
KI LVN+A A+PRLGI YEWWSEALHGV+ VG G F G GATSFPQVI TAASF+
Sbjct: 72 KIRNLVNNAAAVPRLGIGGYEWWSEALHGVSDVGPGSKFGGAFPGATSFPQVITTAASFN 131
Query: 128 SYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYA 187
LW IG+ + EARA+YN G A G+T+W+PN+NI RDPRWGRGQETPGEDP+V KYA
Sbjct: 132 QSLWEEIGRVVSDEARAMYNGGVA-GLTYWSPNVNILRDPRWGRGQETPGEDPIVAAKYA 190
Query: 188 VSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQ 247
SYVRG+QG T G +LK +ACCKH+TAYDLDNW G R+ F+A+VT QDL DTY
Sbjct: 191 ASYVRGLQG-TAAGNRLK----VAACCKHYTAYDLDNWNGVDRFHFNAKVTQQDLEDTYN 245
Query: 248 PPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYD 307
PF+SCV +G+ + +MC+YN+VNG P+CAD NLL T R +W +GYI SDCD+V + ++
Sbjct: 246 VPFKSCVYEGKVASVMCSYNQVNGKPTCADENLLKNTIRGKWRLNGYIVSDCDSVDVFFN 305
Query: 308 AEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRL 367
+ Y +PE+A +KAG+D++CG FL T+ AVK+ L E++I+ AL N +V+MRL
Sbjct: 306 QQHYTSTPEEAAAASIKAGLDLDCGPFLAIFTEGAVKKGLLTENDINLALANTLTVQMRL 365
Query: 368 GLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALI 427
G+F+GN + P+ +G VCS AH+ LAL+AA GIVLLKNS LPL + ++A+I
Sbjct: 366 GMFDGN--LGPYANLGPRDVCSLAHKHLALEAAHQGIVLLKNSGRSLPLSPRRHRTVAVI 423
Query: 428 GPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVAC-SSASIDKAVDIAK 486
GPN++ +T++GNYAG +C TPLQ + Y T++ GC VAC + A A+
Sbjct: 424 GPNSDVTETMIGNYAGKACAYTTPLQGISRYAR-TLHQAGCAGVACKGNQGFGAAEAAAR 482
Query: 487 GADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITF 546
AD VL+MGLDQ+ E E DR L+LPG QQ+L+TRVA+A++ PVILVL+ GGP+D+TF
Sbjct: 483 EADATVLVMGLDQSIEAETRDRTGLLLPGYQQDLVTRVAQASRGPVILVLMSGGPIDVTF 542
Query: 547 AKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYI-KVPMTDMKMRP 605
AK D + +I+WAGYPG+AG A+A +IFG NPGG+LPMTWYPQDY+ KVPMT M MR
Sbjct: 543 AKNDPRVAAIIWAGYPGQAGGAAIANIIFGAANPGGKLPMTWYPQDYVAKVPMTVMAMR- 601
Query: 606 QATSGN-PGRTYRFYEGKEVFPFGCGLSYSKYSYKF--KAVSQNKLYLNQSSSTKMVENQ 662
SGN PGRTYRFY+G VFPFG GLSY+ ++ ++Q + L+ +S + N
Sbjct: 602 --ASGNYPGRTYRFYKGPVVFPFGFGLSYTTFTNSLAKSPLAQLSVSLSNLNSANAILNS 659
Query: 663 DVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRG-NGRPI-KQLV 720
S+ T K + + V N GE G H V +F +P + G G + KQL+
Sbjct: 660 T---SHSIKVSHTNCNSFPKMPLHVEVSNTGEFDGTHTVFVFAEPPKNGIKGLGVNKQLI 716
Query: 721 GFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGDEEYPISI 774
F+ V + A K + ++ C+ L E G I G H L +GD ++ I +
Sbjct: 717 AFEKVHVMAGAKQTVRVDVDACKHLGVVDEYGKRRIPMGKHKLHIGDLKHTILV 770
>gi|40363751|dbj|BAD06320.1| putative beta-xylosidase [Triticum aestivum]
Length = 573
Score = 740 bits (1911), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 344/568 (60%), Positives = 433/568 (76%)
Query: 208 LQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYN 267
L+ASACCKHFTAYDL+NWKG TR+ FDA+VT QDLADTY PPF+SCV+ G ASGIMC+YN
Sbjct: 5 LEASACCKHFTAYDLENWKGVTRFAFDAKVTEQDLADTYNPPFKSCVEDGGASGIMCSYN 64
Query: 268 RVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGM 327
RVNG+P+CAD NLLSKTAR W F+GYITSDCDAV+II+D +GYAK+PEDAV DVLKAGM
Sbjct: 65 RVNGVPTCADHNLLSKTARGDWSFNGYITSDCDAVAIIHDVQGYAKAPEDAVADVLKAGM 124
Query: 328 DVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVV 387
DVNCG ++Q H +A +Q K+ +IDRAL NLF++RMRLGLFNGNP +G IGAD V
Sbjct: 125 DVNCGGYIQTHGVSAYQQGKITGEDIDRALRNLFAIRMRLGLFNGNPKYNRYGNIGADQV 184
Query: 388 CSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCR 447
C HQ LALQAAQDGIVLLKN G LPL KSK S+A+IGPN N+A LLGNY GP C
Sbjct: 185 CKKEHQDLALQAAQDGIVLLKNDAGALPLSKSKVSSVAVIGPNGNNASLLLGNYFGPPCI 244
Query: 448 SITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELD 507
S+TP QALQ YV++ + GC+ C+ ++I +AV A AD+VVL MGLDQ QE+EE+D
Sbjct: 245 SVTPFQALQGYVKDATFVQGCNAAVCNVSNIGEAVHAASSADYVVLFMGLDQNQEREEVD 304
Query: 508 RVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGA 567
R++L LPG Q+ L+ +VA+AAKKPVILVLLCGGPVD+TFAK + IG+I+WAGYPG+AG
Sbjct: 305 RLELGLPGMQESLVNKVADAAKKPVILVLLCGGPVDVTFAKNNPKIGAIVWAGYPGQAGG 364
Query: 568 VALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPF 627
+A+A+V+FG+HNPGGRLP+TWYP+++ VPMTDM+MR ++G PGRTYRFY+GK V+ F
Sbjct: 365 IAIAQVLFGEHNPGGRLPVTWYPKEFTAVPMTDMRMRADPSTGYPGRTYRFYKGKTVYNF 424
Query: 628 GCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTI 687
G GLSYSKYS++F + ++ K + V E+G E C+ +F +
Sbjct: 425 GYGLSYSKYSHRFASEGTKPPSMSGIEGLKATASAAGTVSYDVEEMGAEACDRLRFPAVV 484
Query: 688 GVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSR 747
V+NHG M G+HPVLLF++ +GRP QL+GFQSV L A E A + FE+SPC+ SR
Sbjct: 485 RVQNHGPMDGRHPVLLFLRWPNATDGRPASQLIGFQSVHLRADEAAHVEFEVSPCKHFSR 544
Query: 748 AREDGLMVIEEGTHFLVVGDEEYPISIF 775
A EDG VI++G+HF+ VGD+E+ +S
Sbjct: 545 AAEDGRKVIDQGSHFVKVGDDEFELSFM 572
>gi|449466797|ref|XP_004151112.1| PREDICTED: beta-D-xylosidase 1-like [Cucumis sativus]
Length = 770
Score = 739 bits (1909), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 370/744 (49%), Positives = 495/744 (66%), Gaps = 18/744 (2%)
Query: 33 CDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSE 92
CD N T FC+ +L I +R +DL+ RLTL EKI LVN+A A+PRLGI YEWWSE
Sbjct: 34 CDKRNVGTRNMGFCQESLGIEERVKDLIGRLTLGEKIRLLVNNAIAVPRLGIRGYEWWSE 93
Query: 93 ALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAI 152
ALHGV+ VG G F GT GATSFPQVI TAASF+ LW IG+ + EARA+YN G A
Sbjct: 94 ALHGVSNVGPGTKFGGTFPGATSFPQVITTAASFNQSLWLLIGRVVSDEARAMYNGGTA- 152
Query: 153 GMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASA 212
G+T+W+PN+NIFRDPRWGRGQETPGEDP++ KYA +YV+G+QG N GK +L+ +A
Sbjct: 153 GLTYWSPNVNIFRDPRWGRGQETPGEDPILAAKYAANYVQGLQG---NDGK--KRLKVAA 207
Query: 213 CCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGI 272
CCKH+TAYDLDNW G RY F+A+V+ QDL DTY PF++CV +G+ + +MC+YN+VNG
Sbjct: 208 CCKHYTAYDLDNWNGVDRYHFNAKVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGK 267
Query: 273 PSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG 332
P+CAD +LL T R WG GYI SDCD+V ++YD++ + +PE+A +KAG+D++CG
Sbjct: 268 PTCADPDLLKNTIRGAWGLDGYIVSDCDSVGVLYDSQHFTPTPEEAAASTIKAGLDLDCG 327
Query: 333 SFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAH 392
FL HT AV + L E +++ AL NL SV+MRLG+F+G P QP+G +G VC+PAH
Sbjct: 328 PFLAVHTATAVGRGLLKEVDLNNALANLLSVQMRLGMFDGEPAAQPYGNLGPKDVCTPAH 387
Query: 393 QVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPL 452
+ LAL+AA+ GIVLL+N G LPL ++ ++A+IGPN+++ T++GNYAG +C TP+
Sbjct: 388 KHLALEAARQGIVLLQNRAGALPLSPTRHRTVAVIGPNSDATVTMIGNYAGVACEYTTPV 447
Query: 453 QALQNYVENTVYYPGCDTVAC-SSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDL 511
Q + YV+ T++ GC VAC I +A A+ AD V+++GLDQ+ E E DR +
Sbjct: 448 QGISKYVK-TIHAKGCANVACVGDQLIGEAEAAARVADAAVVVVGLDQSIEAESRDRNGV 506
Query: 512 VLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALA 571
+LPG+Q+EL+ R+ A K P ++VL+ GGP+D++FAK D I ILW GYPG+AG A+A
Sbjct: 507 LLPGKQEELVRRIGLACKGPTVVVLMSGGPIDVSFAKNDGKISGILWVGYPGQAGGAAIA 566
Query: 572 EVIFGDHNPGGRLPMTWYPQDYI-KVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCG 630
+V+FG NPGG+LPMTWYPQ Y+ KVPMT+M +RP ++G PGRTYRFY+G VFPFG G
Sbjct: 567 DVLFGATNPGGKLPMTWYPQSYLAKVPMTNMGLRPDPSTGYPGRTYRFYKGPVVFPFGFG 626
Query: 631 LSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVK 690
LSYSK+S F S S V H T+ + I VK
Sbjct: 627 LSYSKFSQSFAEAPTKISLPLSSLSPNSSATVKVSH--------TDCASVSDLPIMIDVK 678
Query: 691 NHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRARE 750
N G + G H +L+F + P K L+GF+ V L A + + + C+ LSR E
Sbjct: 679 NTGTVDGSHTILVFSTVPNQ-TWSPEKHLIGFEKVHLIAGSQKRVRIGIHVCDHLSRVDE 737
Query: 751 DGLMVIEEGTHFLVVGDEEYPISI 774
G I G H L +GD + IS+
Sbjct: 738 FGTRRIPMGEHKLHIGDLTHSISL 761
>gi|297811069|ref|XP_002873418.1| beta-xylosidase 3 [Arabidopsis lyrata subsp. lyrata]
gi|297319255|gb|EFH49677.1| beta-xylosidase 3 [Arabidopsis lyrata subsp. lyrata]
Length = 780
Score = 739 bits (1909), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 387/770 (50%), Positives = 513/770 (66%), Gaps = 15/770 (1%)
Query: 13 LCLCFTSLLTRVDSTQ--PPFSCDPS-NPSTETFPFCKTTLPISQRARDLVSRLTLDEKI 69
L LCF ++ + Q P F+CD + NPS FC T L I R DLV RLTL+EKI
Sbjct: 19 LLLCFLLCISEQSNAQSSPVFACDVTGNPSLAGLRFCNTGLNIKSRVTDLVGRLTLEEKI 78
Query: 70 SQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSY 129
L ++A + RLGIPAY+WWSEALHGV+ VG G F+G + GATSFPQVILTAASF+
Sbjct: 79 GFLGSNAIGVSRLGIPAYKWWSEALHGVSNVGGGSSFSGQVPGATSFPQVILTAASFNVS 138
Query: 130 LWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVS 189
L+ IG+ + EARA+YN G A G+TFW+PN+NIFRDPRWGRGQETPGEDP ++ KYAV+
Sbjct: 139 LFQAIGKVVSTEARAMYNVGSA-GLTFWSPNVNIFRDPRWGRGQETPGEDPELSSKYAVA 197
Query: 190 YVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPP 249
YVRG+Q +GG +L+ +ACCKH+TAYD+DNWK R+ F+A V QD+ADT+QPP
Sbjct: 198 YVRGLQ--ETDGGD-PNRLKVAACCKHYTAYDVDNWKDVHRFTFNAVVNQQDMADTFQPP 254
Query: 250 FESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAE 309
F+SCV G + +MC+YN+VNG P+CAD +LLS R QW +GYI SDCD+V ++Y +
Sbjct: 255 FKSCVVDGNVASVMCSYNQVNGKPTCADPDLLSGVIRGQWKLNGYIVSDCDSVDVLYTKQ 314
Query: 310 GYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGL 369
Y K+PE+AV + AG+D+NC F ++ AVK + E+ ID+A+ N F+ MRLG
Sbjct: 315 HYTKTPEEAVAKSILAGLDLNCDHFTGQYAMKAVKVGLVNETAIDKAISNNFATLMRLGF 374
Query: 370 FNGNPTMQP-FGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIG 428
F+G+P Q +G +G + VC+ +Q LA AA+ GIVLLKNS G LPL S +LA+IG
Sbjct: 375 FDGDPKKQQLYGGLGPNDVCTANNQELARDAARQGIVLLKNSAGSLPLSPSAIKTLAVIG 434
Query: 429 PNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGA 488
PNAN+ +T++GNY G C+ TPLQ L V +T Y GC+ VAC+ + A +A A
Sbjct: 435 PNANATETMIGNYNGIPCKYTTPLQGLAETVSST-YQLGCN-VACAEPDLGSAAALAASA 492
Query: 489 DHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAK 548
D VVL+MG DQ+ E+E LDR+DL LPG+QQEL+T+VA+ AK PV+LV++ GG DITFAK
Sbjct: 493 DAVVLVMGADQSIEQENLDRLDLYLPGKQQELVTQVAKVAKGPVVLVIMSGGAFDITFAK 552
Query: 549 YDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYI-KVPMTDMKMRPQA 607
+ I I+W GYPGEAG +A+A+VIFG HNP G LPMTWYPQ Y+ KVPMT+M MRP
Sbjct: 553 NEEKITGIMWVGYPGEAGGLAIADVIFGRHNPSGNLPMTWYPQSYVEKVPMTNMNMRPDK 612
Query: 608 TSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYK-FKAVSQNKLYLNQSSSTKMVENQDVVH 666
++G PGRTYRFY G+ V+ FG GLSY+ ++++ KA L L+++ + + E Q V
Sbjct: 613 SNGYPGRTYRFYTGETVYAFGDGLSYTNFNHQILKAPKLVSLDLDENHACRSSECQSVDA 672
Query: 667 YKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVI 726
P F V + V+N G+ G H V LF P +G P K L+GF+ +
Sbjct: 673 IG--PHCDNAVGGGLNFEVQLKVRNVGDREGSHTVFLFTTPPEV-HGSPRKHLLGFEKIR 729
Query: 727 LNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGDEEYPISIFV 776
L KE+ I F + C+ LS E G I G + L VG ++ ++I V
Sbjct: 730 LGEKEETVIRFNVDVCKDLSVVDEIGKRKIALGHYLLHVGSFKHSLTISV 779
>gi|356574315|ref|XP_003555294.1| PREDICTED: LOW QUALITY PROTEIN: probable beta-D-xylosidase 5-like
[Glycine max]
Length = 901
Score = 739 bits (1909), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 375/739 (50%), Positives = 489/739 (66%), Gaps = 14/739 (1%)
Query: 40 TETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAG 99
T FPFC T+L RA+DLVSRLTL EK QLVN + I RLG+PAYEWWSEALHGV+
Sbjct: 31 TSNFPFCDTSLSYEDRAKDLVSRLTLQEKTQQLVNPSAGISRLGVPAYEWWSEALHGVSN 90
Query: 100 VGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAP 159
+G G F+ + GATSFP VIL+AASF++ LW ++GQ + EARA+YN A G+TFW+P
Sbjct: 91 LGPGTRFDKKVPGATSFPAVILSAASFNASLWQKMGQVVSTEARAMYNVDLA-GLTFWSP 149
Query: 160 NINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTA 219
N+N+FRDPRWGRGQETPGEDPLV +YAV Y+RG+Q +L+ S+CCKH+TA
Sbjct: 150 NVNVFRDPRWGRGQETPGEDPLVVSRYAVMYLRGLQEVEDEASAKADRLKVSSCCKHYTA 209
Query: 220 YDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRN 279
YDLDNWKG R+ FDA+VT QDL D+YQPPF+SCV +G S +MC+YNRVNGIP+CAD +
Sbjct: 210 YDLDNWKGIDRFHFDAKVTKQDLEDSYQPPFKSCVVEGHVSSVMCSYNRVNGIPTCADPD 269
Query: 280 LLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHT 339
LL R QWG GYI SDCD+V + Y+A Y +PEDAV LKAG+++NCG FL+K+T
Sbjct: 270 LLKGIIRGQWGLDGYIVSDCDSVEVYYNAIHYTATPEDAVALALKAGLNMNCGDFLKKYT 329
Query: 340 KAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQA 399
AV KK+ + +D+AL + V MRLG F+ +P PF +G VC+ +Q LAL A
Sbjct: 330 ANAVNLKKVDVATVDQALVYNYIVLMRLGFFD-DPKSLPFANLGPSDVCTKDNQQLALDA 388
Query: 400 AQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYV 459
A+ GIVLL+N++G LPL ++ LA+IGPNAN+ ++ NYAG CR +PLQ LQ Y+
Sbjct: 389 AKQGIVLLENNNGALPLSQTNIKKLAVIGPNANATTVMISNYAGIPCRYTSPLQGLQKYI 448
Query: 460 ENTVYYPGCDTVACSSAS-IDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQ 518
+ Y PGC V C + S I AV A AD VVL++GLDQ+ E E LDR +L LPG Q+
Sbjct: 449 SSVNYAPGCSNVKCDNQSLIAAAVKAAASADAVVLVVGLDQSIEAEGLDRENLTLPGFQE 508
Query: 519 ELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDH 578
+ + VA A K VILV++ GP+DI+ K NIG ILW GYPG+AG A+A+VIFGD+
Sbjct: 509 KFVKDVAGATKGKVILVIMAAGPIDISSTKSVSNIGGILWVGYPGQAGGDAIAQVIFGDY 568
Query: 579 NPGGRLPMTWYPQDYI-KVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYS 637
NPGGR P TWYPQ Y+ +VPMTDM MR + PGRTYRFY G ++ FG GLSYS +S
Sbjct: 569 NPGGRSPFTWYPQSYVDQVPMTDMNMRANKSRNFPGRTYRFYNGNSLYEFGHGLSYSTFS 628
Query: 638 YKFKAVSQNKLYLNQSSS---TKMVENQDVVHYKSVP-----ELGTEFCETRKFLVTIGV 689
+ + + N S S + N +S+ ++ T C+ FL+ IGV
Sbjct: 629 MYVASAPSSIMIENTSISEPHNMLSSNNSGTQVESLSDGQAIDISTINCQDLTFLLVIGV 688
Query: 690 KNHGEMAGKHPVLLFVKPARRGN--GRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSR 747
KN+G + G H VL+F +PA G PIKQL+GF+ V + + ++ C+ +S
Sbjct: 689 KNNGPLNGSHVVLVFWEPATSEFVIGAPIKQLIGFERVQVVVGVTEFVTVKIDICQLISN 748
Query: 748 AREDGLMVIEEGTHFLVVG 766
DG + G H ++VG
Sbjct: 749 VDSDGKRKLVIGQHTILVG 767
>gi|296083274|emb|CBI22910.3| unnamed protein product [Vitis vinifera]
Length = 738
Score = 735 bits (1898), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 373/747 (49%), Positives = 501/747 (67%), Gaps = 46/747 (6%)
Query: 30 PFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEW 89
PF+CDP N T PFC+ +LPI +RARDLV RLTL EKI LVN+A +PRLGI YEW
Sbjct: 26 PFACDPRNGVTRNLPFCRVSLPIQERARDLVGRLTLQEKIRLLVNNAIDVPRLGIKGYEW 85
Query: 90 WSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAG 149
WSEALHGV+ VG G F G+ GATSFPQVI TAASF++ LW IG+ + EARA+YN G
Sbjct: 86 WSEALHGVSNVGPGTKFGGSFPGATSFPQVITTAASFNASLWEEIGRVVSDEARAMYNGG 145
Query: 150 QAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQ 209
A G+T+W+PN+NIFRDPRWGRGQETPGEDP V KYA +YVRG+QG+ + +L+
Sbjct: 146 MA-GLTYWSPNVNIFRDPRWGRGQETPGEDPAVAAKYAAAYVRGLQGNA------RDRLK 198
Query: 210 ASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRV 269
+ACCKH+TAYDLD+W G R+ F+ARV+ QDL DTY PF++CV +G + +MC+YN+V
Sbjct: 199 VAACCKHYTAYDLDHWGGIDRFHFNARVSKQDLEDTYDVPFKACVVEGNVASVMCSYNQV 258
Query: 270 NGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDV 329
NG P+CAD +LL T R +W +GYI SDCD+V + YD + Y +PE+A +KAG+D+
Sbjct: 259 NGKPTCADPHLLRDTIRGEWKLNGYIVSDCDSVGVFYDEQHYTATPEEAAAVAIKAGLDL 318
Query: 330 NCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCS 389
+CG FL HT+AA++ KL E++++ AL N SV+MRLG+F+G P+ QP+G +G VC+
Sbjct: 319 DCGPFLAIHTEAAIRGGKLTEADVNGALMNTISVQMRLGMFDGEPSAQPYGNLGPRDVCT 378
Query: 390 PAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSI 449
PAHQ LAL+AA+ GIVL++N LPL S+ ++A+IGPN++ +T++GNYAG +C
Sbjct: 379 PAHQQLALEAARQGIVLVQNRGPALPLSTSRHRTIAVIGPNSDVTETMIGNYAGVACGYT 438
Query: 450 TPLQALQNYVENTVYYPGCDTVAC-SSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDR 508
TPLQ + Y T++ GC VAC AV A+ AD VL+MGLDQ+ E E DR
Sbjct: 439 TPLQGIGRYAR-TIHQAGCSGVACRDDQQFGAAVAAARQADATVLVMGLDQSIEAEFRDR 497
Query: 509 VDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAV 568
VD++LPGRQQEL+++VA A++ P +LVL+ GGP+D++FAK D I +I+W GYPG+AG
Sbjct: 498 VDILLPGRQQELVSKVAVASRGPTVLVLMSGGPIDVSFAKNDPRIAAIIWVGYPGQAGGT 557
Query: 569 ALAEVIFGDHNPGGRLPMTWYPQDYI-KVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPF 627
A+A+V+FG NPGG+LP+TWYPQ Y+ K PMT+M MR + G PGRTYRFY G VFPF
Sbjct: 558 AIADVLFGRTNPGGKLPVTWYPQSYLRKAPMTNMAMRAIPSRGYPGRTYRFYNGPVVFPF 617
Query: 628 GCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTI 687
G GLSYS +++ L Q+ +T + + D
Sbjct: 618 GHGLSYSTFAHS----------LAQAPTTPLGFHID------------------------ 643
Query: 688 GVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSR 747
VKN G M G H +LLF P G P K+L+ F+ V + A + + F++ C+ LS
Sbjct: 644 -VKNTGTMDGSHTLLLFSTPP-PGTWSPNKRLLAFEKVHVGAGSQERVRFDVHVCKHLSV 701
Query: 748 AREDGLMVIEEGTHFLVVGDEEYPISI 774
G+ I G H +GD ++ IS+
Sbjct: 702 VDHFGIHRIPMGEHHFHIGDLKHSISL 728
>gi|357445735|ref|XP_003593145.1| Beta-xylosidase/alpha-L-arabinofuranosidase [Medicago truncatula]
gi|355482193|gb|AES63396.1| Beta-xylosidase/alpha-L-arabinofuranosidase [Medicago truncatula]
Length = 775
Score = 734 bits (1894), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 371/779 (47%), Positives = 511/779 (65%), Gaps = 20/779 (2%)
Query: 5 KLSLVFPLLCLCFTSLLTRVD----STQPPFSCD-PSNPSTETFPFCKTTLPISQRARDL 59
K+S VF + + ++L + T F+CD N + ++ FC +L + R DL
Sbjct: 10 KVSSVFLCFSIFYVAVLLNCNHVYGQTSTVFACDVAKNTNVSSYGFCDKSLSVEDRVSDL 69
Query: 60 VSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQV 119
V RLTL EKI L NSA + RLGIP YEWWSEALHGV+ +G G F+ + GATSFP
Sbjct: 70 VKRLTLQEKIGNLGNSAVEVSRLGIPKYEWWSEALHGVSNIGPGTHFSSLVPGATSFPMP 129
Query: 120 ILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGED 179
ILTAASF++ L+ IG + EARA+YN G A G+T+W+PNINIFRDPRWGRGQETPGED
Sbjct: 130 ILTAASFNTSLFQAIGSVVSNEARAMYNVGLA-GLTYWSPNINIFRDPRWGRGQETPGED 188
Query: 180 PLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTM 239
PL++ KYA YV+G+Q T +G KL+ +ACCKH+TAYD+DNWKG RY FDA V+
Sbjct: 189 PLLSSKYAAGYVKGLQ-QTDDGDS--DKLKVAACCKHYTAYDVDNWKGVQRYTFDAVVSQ 245
Query: 240 QDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDC 299
QDL DT+QPPF+SCV G + +MC+YN+VNG P+CAD +LL R +W +GYI SDC
Sbjct: 246 QDLDDTFQPPFKSCVIDGNVASVMCSYNKVNGKPTCADPDLLKGVIRGKWKLNGYIVSDC 305
Query: 300 DAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHN 359
D+V +++ + Y K+PE+A + +G+D++CGS+L ++T AVKQ + E+ I+ A+ N
Sbjct: 306 DSVEVLFKDQHYTKTPEEAAAKTILSGLDLDCGSYLGQYTGGAVKQGLVDEASINNAVSN 365
Query: 360 LFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKS 419
F+ MRLG F+G+P+ QP+G +G VC+P +Q LA +AA+ GIVLLKNS G LPL
Sbjct: 366 NFATLMRLGFFDGDPSKQPYGNLGPKDVCTPENQELAREAARQGIVLLKNSPGSLPLSSK 425
Query: 420 KSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASID 479
SLA+IGPNAN+ + ++GNY G C+ +PLQ L +V T Y PGC V C++A ID
Sbjct: 426 AIKSLAVIGPNANATRVMIGNYEGIPCKYTSPLQGLTAFVP-TSYAPGCPDVQCANAQID 484
Query: 480 KAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCG 539
A IA AD ++++G + E E LDRV+++LPG+QQ+L+ VA +K PVILV++ G
Sbjct: 485 DAAKIAASADATIIVVGANLAIEAESLDRVNILLPGQQQQLVNEVANVSKGPVILVIMSG 544
Query: 540 GPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYI-KVPM 598
G +D++FAK + I SILW GYPGEAG A+A+VIFG +NP GRLPMTWYPQ Y+ K+PM
Sbjct: 545 GGMDVSFAKTNDKITSILWVGYPGEAGGAAIADVIFGSYNPSGRLPMTWYPQSYVEKIPM 604
Query: 599 TDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKF-KAVSQNKLYLNQSSSTK 657
T+M MR +G PGRTYRFY+G+ VF FG G+S+ +K KA + L + +
Sbjct: 605 TNMNMRSDPATGYPGRTYRFYKGETVFSFGDGMSFGTVEHKIVKAPQLVSVPLAEDHECR 664
Query: 658 MVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIK 717
+E + + ++ E C+ F + + VKN G+M+ H VLLF P N P K
Sbjct: 665 SLECKSL-------DVADEHCQNLAFDIHLSVKNMGKMSSSHSVLLFFTPPNVHNA-PQK 716
Query: 718 QLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGDEEYPISIFV 776
L+GF+ V L K + + F++ C LS E G + G H L VG+ ++ +S+ +
Sbjct: 717 HLLGFEKVQLAGKSEGMVRFKVDVCNDLSVVDELGNRKVPLGDHMLHVGNLKHSLSVRI 775
>gi|292630922|sp|A5JTQ2.1|XYL1_MEDVA RecName: Full=Beta-xylosidase/alpha-L-arabinofuranosidase 1;
AltName: Full=Xylan
1,4-beta-xylosidase/Alpha-N-arabinofuranosidase 1;
Short=MsXyl1; Includes: RecName: Full=Beta-xylosidase;
AltName: Full=1,4-beta-D-xylan xylohydrolase; AltName:
Full=Xylan 1,4-beta-xylosidase; Includes: RecName:
Full=Alpha-N-arabinofuranosidase; AltName:
Full=Alpha-L-arabinofuranosidase; Short=Arabinosidase;
Flags: Precursor
gi|146762261|gb|ABQ45227.1| beta-xylosidase/alpha-L-arabinosidase [Medicago sativa subsp. x
varia]
Length = 774
Score = 730 bits (1884), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 370/779 (47%), Positives = 509/779 (65%), Gaps = 20/779 (2%)
Query: 5 KLSLVFPLLCLCFTSLLTRVD----STQPPFSCD-PSNPSTETFPFCKTTLPISQRARDL 59
K+S VF + + ++L + T F+CD N + ++ FC +L + R DL
Sbjct: 9 KVSSVFLCFSIFYVTVLLNCNHVYGQTSTVFACDVAKNTNVSSYGFCDNSLSVEDRVSDL 68
Query: 60 VSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQV 119
V RLTL EKI L NSA + RLGIP YEWWSEALHGV+ +G G F+ + GAT+FP
Sbjct: 69 VKRLTLQEKIGNLGNSAVEVSRLGIPKYEWWSEALHGVSNIGPGTHFSSLVPGATNFPMP 128
Query: 120 ILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGED 179
ILTAASF++ L+ IG + EARA+YN G A G+T+W+PNINIFRDPRWGRGQETPGED
Sbjct: 129 ILTAASFNTSLFQAIGSVVSNEARAMYNVGLA-GLTYWSPNINIFRDPRWGRGQETPGED 187
Query: 180 PLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTM 239
PL++ KYA YV+G+Q T +G KL+ +ACCKH+TAYD+DNWKG RY FDA V+
Sbjct: 188 PLLSSKYAAGYVKGLQ-QTDDGDS--DKLKVAACCKHYTAYDVDNWKGVQRYTFDAVVSQ 244
Query: 240 QDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDC 299
QDL DT+QPPF+SCV G + +MC+YN+VNG P+CAD +LL R +W +GYI SDC
Sbjct: 245 QDLDDTFQPPFKSCVIDGNVASVMCSYNKVNGKPTCADPDLLKGVIRGKWKLNGYIVSDC 304
Query: 300 DAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHN 359
D+V ++Y + Y K+PE+A + +G+D++CGS+L ++T AVKQ + E+ I A+ N
Sbjct: 305 DSVEVLYKDQHYTKTPEEAAAKTILSGLDLDCGSYLGQYTGGAVKQGLVDEASITNAVSN 364
Query: 360 LFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKS 419
F+ MRLG F+G+P+ QP+G +G VC+P +Q LA +AA+ GIVLLKNS LPL
Sbjct: 365 NFATLMRLGFFDGDPSKQPYGNLGPKDVCTPENQELAREAARQGIVLLKNSPRSLPLSSK 424
Query: 420 KSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASID 479
SLA+IGPNAN+ + ++GNY G C+ +PLQ L +V T Y PGC V C++A ID
Sbjct: 425 AIKSLAVIGPNANATRVMIGNYEGIPCKYTSPLQGLTAFVP-TSYAPGCPDVQCANAQID 483
Query: 480 KAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCG 539
A IA AD ++++G + E E LDRV+++LPG+QQ+L+ VA +K PVILV++ G
Sbjct: 484 DAAKIAASADATIIVVGANLAIEAESLDRVNILLPGQQQQLVNEVANVSKGPVILVIMSG 543
Query: 540 GPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYI-KVPM 598
G +D++FAK + I SILW GYPGEAG A+A+VIFG +NP GRLPMTWYPQ Y+ KVPM
Sbjct: 544 GGMDVSFAKTNDKITSILWVGYPGEAGGAAIADVIFGSYNPSGRLPMTWYPQSYVEKVPM 603
Query: 599 TDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKF-KAVSQNKLYLNQSSSTK 657
T+M MR +G PGRTYRFY+G+ VF FG G+S+ +K KA + L + +
Sbjct: 604 TNMNMRADPATGYPGRTYRFYKGETVFSFGDGMSFGTVEHKIVKAPQLVSVPLAEDHECR 663
Query: 658 MVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIK 717
+E + + ++ + C+ F + + VKN G+M+ H VLLF P N P K
Sbjct: 664 SLECKSL-------DVADKHCQNLAFDIHLSVKNMGKMSSSHSVLLFFTPPNVHNA-PQK 715
Query: 718 QLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGDEEYPISIFV 776
L+GF+ V L K + + F++ C LS E G + G H L VG+ ++ +S+ +
Sbjct: 716 HLLGFEKVQLAGKSEGMVRFKVDVCNDLSVVDELGNRKVPLGDHMLHVGNLKHSLSVRI 774
>gi|222629257|gb|EEE61389.1| hypothetical protein OsJ_15562 [Oryza sativa Japonica Group]
Length = 771
Score = 720 bits (1858), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 374/755 (49%), Positives = 499/755 (66%), Gaps = 32/755 (4%)
Query: 36 SNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPR---------LGIPA 86
++P+ +PFC TLP RAR LVS LTLDEKI+QL+ P +G+P+
Sbjct: 31 ASPAASAYPFCNATLPFPARARALVSLLTLDEKIAQLLQHRRGRPPPRRPALRVVVGVPS 90
Query: 87 YEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALY 146
+ A G G +R AT FPQVIL+AA+F+ LW +AI +EARA++
Sbjct: 91 T---ASATTGPGSTSP----RGPVRSATIFPQVILSAAAFNRSLWRAAARAIAVEARAMH 143
Query: 147 NAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKG 206
NAGQA G+TFWAPNIN+FRDPRWGRGQETPGEDP V Y+V YV+G Q D +G
Sbjct: 144 NAGQA-GLTFWAPNINVFRDPRWGRGQETPGEDPAVVSAYSVEYVKGFQRDYGE----EG 198
Query: 207 KLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAY 266
++ SACCKH+ AYDL+ W+G TRY F+A+V QD+ DTYQPPF+SC+++GRAS +MC+Y
Sbjct: 199 RMMLSACCKHYIAYDLEKWRGFTRYTFNAKVNAQDMEDTYQPPFKSCIQEGRASCLMCSY 258
Query: 267 NRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAG 326
N+VNG+P+CA +++L + AR +WGF GYITSDCDAV+II++ + Y S ED++ VLKAG
Sbjct: 259 NQVNGVPACARKDILQR-ARDEWGFQGYITSDCDAVAIIHENQTYTASDEDSIAVVLKAG 317
Query: 327 MDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADV 386
MD+NCGSFL +HTK+A+++ K+ E +I+ AL NLFSV++RLG F+ Q F ++G +
Sbjct: 318 MDINCGSFLIRHTKSAIEKGKVQEEDINHALFNLFSVQLRLGFFDKTNENQWFTQLGPNN 377
Query: 387 VCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSC 446
VC+ H+ LA +A + G VLLKN +G LPL +S+ +ALIGP AN L G+Y G C
Sbjct: 378 VCTTEHRELAAEAVRQGTVLLKNDNGFLPLKRSEVGHIALIGPAANDPYILGGDYTGVPC 437
Query: 447 RSITPLQALQNYVENTVYYPGCDTVACSSAS-IDKAVDIAKGADHVVLMMGLDQTQEKEE 505
S T ++ +Q YV T + GC V C+S +A++ AK AD VVL+ GL+ T+E E+
Sbjct: 438 HSTTFVKGMQAYVPKTTFAAGCKDVPCNSTDGFGEAIEAAKRADVVVLIAGLNLTEETED 497
Query: 506 LDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEA 565
DRV L+LPGRQ +LI VA KKPV+LVL+ GGPVD++FAK+D I SILW GYPGE
Sbjct: 498 HDRVSLLLPGRQMDLIHTVASVTKKPVVLVLMGGGPVDVSFAKHDPRIASILWIGYPGEV 557
Query: 566 GAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVF 625
G L E++FG +NPGG+LP+TWYP+ + VPM DM MR A+ G PGRTYRFY G V+
Sbjct: 558 GGNVLPEILFGKYNPGGKLPITWYPESFTAVPMDDMNMRADASRGYPGRTYRFYTGDVVY 617
Query: 626 PFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQ------DVVHYKSVPELGTEFCE 679
FG GLSYSKYSY + K+ L++SS ++ + D V Y V ++ + CE
Sbjct: 618 GFGYGLSYSKYSYSILQAPK-KISLSRSSVPDLISRKPAYTRRDGVDYVQVEDIAS--CE 674
Query: 680 TRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFEL 739
+F V I V N G M G H VLLF G PIKQLVGF+ V A ++ +
Sbjct: 675 ALQFPVHISVSNDGAMDGSHAVLLFASSKPSFPGSPIKQLVGFERVHTAAGRSTDVEITV 734
Query: 740 SPCESLSRAREDGLMVIEEGTHFLVVGDEEYPISI 774
PC+ +S A +G V+ GTH L+VGDEE+ + I
Sbjct: 735 DPCKLMSFANTEGTRVLFLGTHVLMVGDEEHELLI 769
>gi|226531269|ref|NP_001145980.1| uncharacterized protein LOC100279508 precursor [Zea mays]
gi|219885199|gb|ACL52974.1| unknown [Zea mays]
gi|413920228|gb|AFW60160.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
Length = 794
Score = 718 bits (1853), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 365/766 (47%), Positives = 499/766 (65%), Gaps = 23/766 (3%)
Query: 31 FSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWW 90
F+C P P+ + PFC+ +LP+ RARDLVSRLT EK+ LVN+A +PRLG+ YEWW
Sbjct: 27 FACAPGGPAA-SLPFCRQSLPLRARARDLVSRLTRAEKVRLLVNNAAGVPRLGVAGYEWW 85
Query: 91 SEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQ 150
SEALHGV+ G G+ F G GAT+FPQVI TAAS ++ LW +G+A+ EARA+YN G+
Sbjct: 86 SEALHGVSDTGPGVRFGGAFPGATAFPQVIGTAASLNATLWELVGRAVSDEARAMYNGGR 145
Query: 151 AIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGD--TFNGGKLKGKL 208
A G+TFW+PN+NIFRDPRWGRGQETPGEDP V+ +YA +YVRG+Q NGG + +L
Sbjct: 146 A-GLTFWSPNVNIFRDPRWGRGQETPGEDPAVSARYAAAYVRGLQQPYAAPNGGH-RNRL 203
Query: 209 QASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNR 268
+ +ACCKHFTAYDLD W GT R+ F+A V QDL DT+ PF +CV+ GRA+ +MC+YN+
Sbjct: 204 KLAACCKHFTAYDLDKWGGTDRFHFNAVVAAQDLEDTFNVPFRACVEDGRAASVMCSYNQ 263
Query: 269 VNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMD 328
VNG+P+CAD L T R +WG GYI SDCD+V + + + Y ++PEDA L+AG+D
Sbjct: 264 VNGVPTCADAAFLRGTIRGRWGLDGYIVSDCDSVDVFFRDQHYTRTPEDAAAATLRAGLD 323
Query: 329 VNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVC 388
++CG FL + +AV K+ ++++D AL N +V+MRLG+F+G+P PFG++G VC
Sbjct: 324 LDCGPFLALYAGSAVAAGKVADADVDAALLNTVTVQMRLGMFDGDPAAGPFGRLGPADVC 383
Query: 389 SPAHQVLALQAAQDGIVLLKNSHG------LLPLPKSKSVSLALIGPNANSAKTLLGNYA 442
+ HQ LAL AA+ G+VLLKN G +LPL + +A++GP+A++ ++GNYA
Sbjct: 384 TREHQDLALDAARQGVVLLKNRRGARHNRDVLPLRPAAHRVVAVVGPHADATVAMIGNYA 443
Query: 443 GPSCRSITPLQALQNYVENTVYYPGCDTVAC-SSASIDKAVDIAKGADHVVLMMGLDQTQ 501
G CR TPLQ + Y + GC VAC + I AV+ A+ AD V++ GLDQ
Sbjct: 444 GKPCRYTTPLQGVAAYAARVAHQAGCTDVACRGNQPIAAAVEAARQADATVVVAGLDQRV 503
Query: 502 EKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGY 561
E E LDR L+LPGRQ ELI+ VA+A+K PVILVL+ GGP+DI FA+ D I ILW GY
Sbjct: 504 EAEGLDRTTLLLPGRQAELISAVAKASKGPVILVLMSGGPIDIAFAQNDPRIDGILWVGY 563
Query: 562 PGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYI-KVPMTDMKMRPQATSGNPGRTYRFYE 620
PG+AG A+A+VIFG HNPG +LP+TWY QDY+ KVPMT+M MR G PGRTYRFY
Sbjct: 564 PGQAGGQAIADVIFGHHNPGAKLPVTWYHQDYLQKVPMTNMAMRANPARGYPGRTYRFYT 623
Query: 621 GKEVFPFGCGLSYSKYSYKF-KAVSQNKLYLNQSSSTKMVENQDVVHYKSVP----ELGT 675
G ++PFG GLSY+++++ A +Q + L+ S + + + P +
Sbjct: 624 GPTIYPFGHGLSYTQFTHTLAHAPTQLTVRLSGSGHSAASAASLLNATLARPVRAVRVAH 683
Query: 676 EFCETRKFLVTIGVKNHGEMAGKHPVLLF-VKP----ARRGNGRPIKQLVGFQSVILNAK 730
CE V + V N G+ G H VL++ P A G P +QLV F+ V + A
Sbjct: 684 ARCEGLTVPVHVDVSNVGDRDGAHAVLVYHAAPSPSHAAPGADAPARQLVAFEKVHVPAG 743
Query: 731 EKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGDEEYPISIFV 776
A + + C+ LS A +G+ + G H L++G+ + +S+ V
Sbjct: 744 GVARVEMRIGVCDRLSVADRNGVRRVPVGEHRLMIGELTHSVSLGV 789
>gi|326489197|dbj|BAK01582.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 709
Score = 714 bits (1844), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 367/714 (51%), Positives = 477/714 (66%), Gaps = 15/714 (2%)
Query: 67 EKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASF 126
+K+ LVN PA+ RLGIPAYEWWSEALHGV+ VG G F+ + GATSFPQ ILTAASF
Sbjct: 7 QKVGFLVNKQPALGRLGIPAYEWWSEALHGVSYVGPGTRFSPLVPGATSFPQPILTAASF 66
Query: 127 DSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKY 186
++ L+ IG+ + EARA++N G A G+TFW+PNINIFRDPRWGRGQETPGEDPL+ KY
Sbjct: 67 NASLFRAIGEVVSTEARAMHNVGLA-GLTFWSPNINIFRDPRWGRGQETPGEDPLLASKY 125
Query: 187 AVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTY 246
AV YV G+Q D GG G L+ +ACCKH+TAYD+DNWKG RY FDA+V+ QDL DT+
Sbjct: 126 AVGYVTGLQ-DAGAGGVTDGALKVAACCKHYTAYDVDNWKGVERYTFDAKVSQQDLDDTF 184
Query: 247 QPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIY 306
QPPF+SCV G + +MC+YN+VNG P+CAD++LL R W +GYI SDCD+V ++Y
Sbjct: 185 QPPFKSCVLDGNVASVMCSYNKVNGKPTCADKDLLEGVIRGDWKLNGYIVSDCDSVDVLY 244
Query: 307 DAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMR 366
+ Y K+PE+A +K+G+D+NCG+FL +HT AAV+ +L E ++DRA+ N F + MR
Sbjct: 245 TQQHYTKTPEEAAAITIKSGLDLNCGNFLAQHTVAAVQAGELSEEDVDRAITNNFIMLMR 304
Query: 367 LGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLAL 426
LG F+G+P FG +G VC+ +++ LA + A+ GIVLLKNS G LPL S+A+
Sbjct: 305 LGFFDGDPRQLAFGSLGPKDVCTSSNRELARETARQGIVLLKNS-GALPLSAKSIKSMAV 363
Query: 427 IGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASID--KAVDI 484
IGPNAN++ T++GNY G C+ TPLQ L V NTVY PGC V CS S+ AV
Sbjct: 364 IGPNANASFTMIGNYEGTPCKYTTPLQGLGAKV-NTVYQPGCTNVGCSGNSLQLSTAVAA 422
Query: 485 AKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDI 544
A AD VL++G DQ+ E+E LDR L+LPG+Q +L++ VA A+ PVILV++ GGP DI
Sbjct: 423 AASADVTVLVVGADQSIERESLDRTSLLLPGQQTQLVSAVANASSGPVILVVMSGGPFDI 482
Query: 545 TFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIK-VPMTDMKM 603
+FAK I +ILW GYPGEAG ALA+++FG HNP GRLP+TWYP Y V MTDM+M
Sbjct: 483 SFAKASDKIAAILWVGYPGEAGGAALADILFGSHNPSGRLPVTWYPASYADTVTMTDMRM 542
Query: 604 RPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQD 663
RP ++G PGRTYRFY G VF FG GLSY+K S+ VS Y+ S ++ E+
Sbjct: 543 RPDTSTGYPGRTYRFYTGDTVFAFGDGLSYTKMSHSL--VSAPPSYV----SMRLAEDHP 596
Query: 664 VVHYKSVP-ELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGF 722
+ E + C+ F V + V+N GE+AG H VLLF P N P K L+GF
Sbjct: 597 CRAEECASVEAAGDHCDDLAFDVKLQVRNAGEVAGAHSVLLFSSPPPAHNA-PAKHLLGF 655
Query: 723 QSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGDEEYPISIFV 776
+ V L E + F + C LS E G + G H L VGD ++ + + V
Sbjct: 656 EKVSLAPGEAGTVAFRVDVCRDLSVVDELGGRKVALGGHTLHVGDLKHTVELRV 709
>gi|168046596|ref|XP_001775759.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162672911|gb|EDQ59442.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 784
Score = 714 bits (1842), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 360/771 (46%), Positives = 498/771 (64%), Gaps = 27/771 (3%)
Query: 23 RVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRL 82
R+ + ++CDP P+ FPFC T++ R DL+SRLT+ EKI QLVN+A + RL
Sbjct: 12 RIKVYKLQYACDPDGPADLLFPFCNTSISDDDRVEDLISRLTIQEKIEQLVNTAANVSRL 71
Query: 83 GIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEA 142
GIP Y+WW E LHGVA + ++F G ATSFP L+ S++ LW +IGQ + E
Sbjct: 72 GIPPYQWWGEGLHGVA-ISPSVYFGGATPAATSFPLPCLSVCSYNRTLWNKIGQVVSTEG 130
Query: 143 RALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGG 202
RA+YN G++ G+T+W+PNINI RDPRWGR QETPGEDP ++ YAV +V+G+Q ++
Sbjct: 131 RAMYNQGRS-GLTYWSPNINIARDPRWGRTQETPGEDPKLSSGYAVHFVKGLQEGDYDQN 189
Query: 203 KLKG------KLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQ 256
+ + +L+ SACCKHFTA+DLD WK R FD++VT QDL DTY P F+SCVK+
Sbjct: 190 QPQAVSRGPRRLKISACCKHFTAHDLDRWKDYDRDHFDSKVTQQDLEDTYNPSFKSCVKE 249
Query: 257 GRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPE 316
G++S +MC+YNR+NGIP C LL+ T R QWGF GYI SDCDAV++I+D YA + E
Sbjct: 250 GQSSSVMCSYNRLNGIPMCTHYELLTLTVRNQWGFDGYIVSDCDAVALIHDYINYAPTSE 309
Query: 317 DAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTM 376
DAV V+ AGMD+NCGS H AA+ +K + E ID L NLF VRMRLG+F+GNP+
Sbjct: 310 DAVSYVMLAGMDLNCGSTTLVHGLAALDKKLIWEGLIDMHLRNLFRVRMRLGMFDGNPST 369
Query: 377 QPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKT 436
P+G +G + +C+ +Q LAL+AA+ +VLLKN LP K+ + LA+IG +A++ +
Sbjct: 370 LPYGSLGPEDMCTEDNQHLALEAARQSLVLLKNEKNALPWKKTHGLKLAVIGHHADATRE 429
Query: 437 LLGNYAGPSCRSITPLQA----LQNYVENTVYYPGCDTVACSSA-SIDKAVDIAKGADHV 491
+LGNY G C+ ++PLQ L ++ + GC AC I A + A AD V
Sbjct: 430 MLGNYEGYPCKFVSPLQGFAKVLSDHSPRISHERGCSDAACEDQFYIYAAKEAAAQADAV 489
Query: 492 VLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEA-AKKPVILVLLCGGPVDITFAKYD 550
VL++G+ Q QEKE DR L+LPGRQ EL++ V EA A +PV+LVLL G P+D++FA D
Sbjct: 490 VLVLGISQAQEKEGRDRDSLLLPGRQMELVSSVVEASAGRPVVLVLLSGSPLDVSFANDD 549
Query: 551 RNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSG 610
I SI+WAGYPG++G A+AE IFG NPGGRL +WY ++Y + M++M MRP A++G
Sbjct: 550 PRIQSIIWAGYPGQSGGEAIAEAIFGLVNPGGRLAQSWYYENYTNIDMSNMNMRPNASTG 609
Query: 611 NPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQN------KLYLNQSSSTKMVENQDV 664
PGRTYRF+ ++ FG GLSYS + Y + Q+ + L S M + +
Sbjct: 610 YPGRTYRFFTDTPLWEFGHGLSYSDFKYTMVSAPQSIMAPHLRYQLCSSDRAVMTSDLNC 669
Query: 665 VHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRG-NGRPIKQLVGFQ 723
+HY+ E C+ F V + V NHG ++G H VLLF KP RG +G P+KQLV F+
Sbjct: 670 LHYEK------EACKESSFHVRVWVINHGPLSGDHSVLLFSKPPSRGIDGIPLKQLVSFE 723
Query: 724 SVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGDEEYPISI 774
V L A EI+F+++PCE L +DG+ +E G H L+VG ++ +++
Sbjct: 724 RVHLEAGAGQEILFKVNPCEDLGTVGDDGIRTVELGEHTLMVGMVQHVLTV 774
>gi|189380221|gb|ACD93208.1| beta xylosidase [Camellia sinensis]
Length = 767
Score = 711 bits (1836), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 372/766 (48%), Positives = 505/766 (65%), Gaps = 48/766 (6%)
Query: 27 TQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPA 86
++P F+CD + T PFC+ +LPI R RDL+ RLTL EKI LVN+A A+PRLGI
Sbjct: 27 SRPAFACDGA---TRNLPFCRVSLPIQDRVRDLIGRLTLQEKIRLLVNNAAAVPRLGIKG 83
Query: 87 YEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALY 146
YEWWSEALHGV+ G+ F G GATSFPQVI TAASF++ LW IG+ + EARA+Y
Sbjct: 84 YEWWSEALHGVSNADPGVKFGGAFPGATSFPQVISTAASFNASLWEHIGRVVSDEARAMY 143
Query: 147 NAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKG 206
N G A G+T+W+PN+NIFRDPRWGRGQETPGEDP++ GKYA SYVRG+QG++ N
Sbjct: 144 NGGMA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLAGKYAASYVRGLQGNSGN------ 196
Query: 207 KLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAY 266
+L+ +ACCKH+TAYDLDNW RY+F+ARV+ QDLADTY PF++CV +G+ + CA+
Sbjct: 197 QLKVAACCKHYTAYDLDNWNSVDRYRFNARVSKQDLADTYDVPFKACVVEGKYQ-VYCAH 255
Query: 267 NRVNGIPSCADRNLLSKTARRQ--WGFHGYITSDCDAVSIIYDAEGYA------KSPEDA 318
I A+ +L+ + + W +H ++ C +Y G+ +PEDA
Sbjct: 256 T----IKLMANPLVLTLISPQHHPWSWHSWL--HCFR---LYRCWGFICHSTLHSTPEDA 306
Query: 319 VVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQP 378
+KAG+D+ CG FL HT+ AV+Q KL E++++ AL N SV+MRLG+F+G P+ QP
Sbjct: 307 AAATIKAGLDLECGPFLAIHTEQAVRQGKLGEADVNGALINTLSVQMRLGMFDGEPSSQP 366
Query: 379 FGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLL 438
+G +G VC+PAHQ LAL+AA+ GIVLL+N LPL ++A+IGPN++ T+L
Sbjct: 367 YGNLGPRDVCTPAHQQLALEAARQGIVLLQNRGRSLPLSTQLHRTVAVIGPNSDVTVTML 426
Query: 439 GNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASI-DKAVDIAKGADHVVLMMGL 497
GNYAG +C TPLQ ++ YV T++ GCD+VACS+ + A A+ AD VL+MGL
Sbjct: 427 GNYAGVACGFTTPLQGIERYVR-TIHQSGCDSVACSNNQLFGVAETAARQADATVLVMGL 485
Query: 498 DQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSIL 557
DQ+ E E DRV L+LPG QQEL++RVA A++ PV+LVL+ GGP+D++FAK D IG+IL
Sbjct: 486 DQSIETEFKDRVGLLLPGPQQELVSRVAMASRGPVVLVLMSGGPIDVSFAKNDPRIGAIL 545
Query: 558 WAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYI-KVPMTDMKMRPQATSGNPGRTY 616
W GYPG+AG A+A+V+FG NPGGRLPMTWYPQDY+ K PMT+M MR +SG PGRTY
Sbjct: 546 WVGYPGQAGGTAIADVLFGRTNPGGRLPMTWYPQDYLAKAPMTNMAMRANPSSGYPGRTY 605
Query: 617 RFYEGKEVFPFGCGLSYSKYSYKFK------AVSQNKLYLNQSSSTKMVENQDVVHYKSV 670
RFY+G VFPFG G+SY+ ++++ +V LY Q+S+T + +
Sbjct: 606 RFYKGPVVFPFGHGMSYTTFAHELAHAPTTVSVPLTSLYGLQNSTT----------FNNG 655
Query: 671 PELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAK 730
+ C+T + I VKN G+M G H VL+F P G KQL+GF+ V + A+
Sbjct: 656 IRVTHTNCDTLILGIHIDVKNTGDMDGTHTVLVFSTPP-VGKWGANKQLIGFKKVHVVAR 714
Query: 731 EKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGDEEYPISIFV 776
+ + + C LS + G+ I G H L +GD ++ IS+ V
Sbjct: 715 GRQRVKIHVHVCNQLSVVDQFGIRRIPIGEHSLHIGDIKHSISLQV 760
>gi|90399376|emb|CAJ86207.1| B1011H02.4 [Oryza sativa Indica Group]
Length = 738
Score = 707 bits (1824), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 368/747 (49%), Positives = 486/747 (65%), Gaps = 49/747 (6%)
Query: 36 SNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALH 95
++P+ +PFC TLP RAR LVS LTLDEKI+QL N+A PRLG+P +EWWSE+LH
Sbjct: 31 ASPAASAYPFCNATLPFPARARALVSLLTLDEKIAQLSNTAAGAPRLGVPPFEWWSESLH 90
Query: 96 GVAGVGKGI-FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGM 154
GV G G+ F +G +R AT FPQVIL+AA+F+ LW +AI +EARA++NAGQA G+
Sbjct: 91 GVCDNGPGVNFSSGPVRSATIFPQVILSAAAFNRSLWRAAARAIAVEARAMHNAGQA-GL 149
Query: 155 TFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACC 214
TFWAPNIN+FRDPRWGRGQETPGEDP V Y+V YV+G Q D +G++ SACC
Sbjct: 150 TFWAPNINVFRDPRWGRGQETPGEDPAVVSAYSVEYVKGFQRDYGE----EGRMMLSACC 205
Query: 215 KHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPS 274
KH+ AYDL+ W+G TRY F+A+V NG+P+
Sbjct: 206 KHYIAYDLEKWRGFTRYTFNAKV--------------------------------NGVPA 233
Query: 275 CADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSF 334
CA +++L + AR +WGF GYITSDCDAV+II++ + Y S ED++ VLKAGMD+NCGSF
Sbjct: 234 CARKDILQR-ARDEWGFQGYITSDCDAVAIIHENQTYTASDEDSIAVVLKAGMDINCGSF 292
Query: 335 LQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQV 394
L +HTK+A+++ K+ E +I+ AL NLFSV++RLG F+ Q F ++G + VC+ H+
Sbjct: 293 LIRHTKSAIEKGKVQEEDINHALFNLFSVQLRLGFFDKTNENQWFTQLGPNNVCTTEHRE 352
Query: 395 LALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQA 454
LA +A + G VLLKN +G LPL +S+ +ALIGP AN L G+Y G C S T ++
Sbjct: 353 LAAEAVRQGTVLLKNDNGFLPLKRSEVGHIALIGPAANDPYILGGDYTGVPCHSTTFVKG 412
Query: 455 LQNYVENTVYYPGCDTVACSSAS-IDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVL 513
+Q YV T + GC V C+S +A++ AK AD VVL+ GL+ T+E E+ DRV L+L
Sbjct: 413 MQAYVPKTTFAAGCKDVPCNSTDGFGEAIEAAKRADVVVLIAGLNLTEETEDHDRVSLLL 472
Query: 514 PGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEV 573
PGRQ +LI VA KKPV+LVL+ GGPVD++FAK+D I SILW GYPGE G L E+
Sbjct: 473 PGRQMDLIHTVASVTKKPVVLVLMGGGPVDVSFAKHDPRIASILWIGYPGEVGGNVLPEI 532
Query: 574 IFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSY 633
+FG +NPGG+LP+TWYP+ + VPM DM MR A+ G PGRTYRFY G V+ FG GLSY
Sbjct: 533 LFGKYNPGGKLPITWYPESFTAVPMDDMNMRADASRGYPGRTYRFYTGDVVYGFGYGLSY 592
Query: 634 SKYSYKFKAVSQNKLYLNQSSSTKMVENQ------DVVHYKSVPELGTEFCETRKFLVTI 687
SKYSY + K+ L++SS ++ + D V Y V ++ + CE +F V I
Sbjct: 593 SKYSYSILQAPK-KISLSRSSVPDLISRKPAYTRRDGVDYVQVEDIAS--CEALQFPVHI 649
Query: 688 GVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSR 747
V N G M G H VLLF G PIKQLVGF+ V A ++ + PC+ +S
Sbjct: 650 SVSNDGAMDGSHAVLLFASSKPSFPGSPIKQLVGFERVHTAAGRSTDVEITVDPCKLMSF 709
Query: 748 AREDGLMVIEEGTHFLVVGDEEYPISI 774
A +G V+ GTH L+VGDEE+ + I
Sbjct: 710 ANTEGTRVLFLGTHVLMVGDEEHELLI 736
>gi|115486595|ref|NP_001068441.1| Os11g0673200 [Oryza sativa Japonica Group]
gi|113645663|dbj|BAF28804.1| Os11g0673200 [Oryza sativa Japonica Group]
Length = 822
Score = 704 bits (1818), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 377/781 (48%), Positives = 501/781 (64%), Gaps = 53/781 (6%)
Query: 42 TFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVG 101
T PFC+ +LP RARDLV+RLT EK+ LVN+A +PRLG+ YEWWSEALHGV+ G
Sbjct: 40 TLPFCRRSLPARARARDLVARLTRAEKVRLLVNNAAGVPRLGVAGYEWWSEALHGVSDTG 99
Query: 102 KGIFFNGTIRGATSFPQVILTAASFDSYLWYRIG-------------------------- 135
G+ F G GAT+FPQVI TAASF++ LW IG
Sbjct: 100 PGVRFGGAFPGATAFPQVIGTAASFNATLWELIGQVMPILKGGHARCNQRPSCIRISVFM 159
Query: 136 ------QAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVS 189
QA+ E RA+YN GQA G+TFW+PN+NIFRDPRWGRGQETPGEDP V +YA +
Sbjct: 160 YVYVCAQAVSDEGRAMYNGGQA-GLTFWSPNVNIFRDPRWGRGQETPGEDPAVAARYAAA 218
Query: 190 YVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPP 249
YVRG+Q + G+LK +ACCKHFTAYDLDNW GT R+ F+A VT QDL DT+ P
Sbjct: 219 YVRGLQQQQPSSGRLK----LAACCKHFTAYDLDNWSGTDRFHFNAVVTRQDLEDTFNVP 274
Query: 250 FESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAE 309
F SCV GRA+ +MC+YN+VNG+P+CAD L T RR+WG GYI SDCD+V + Y +
Sbjct: 275 FRSCVVDGRAASVMCSYNQVNGVPTCADAAFLRGTIRRRWGLAGYIVSDCDSVDVFYSDQ 334
Query: 310 GYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGL 369
Y ++ EDAV L+AG+D++CG FL ++T+ AV Q K+ + +ID A+ N +V+MRLG+
Sbjct: 335 HYTRTREDAVAATLRAGLDLDCGPFLAQYTEGAVAQGKVGDGDIDAAVTNTVTVQMRLGM 394
Query: 370 FNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPL-PKSKSVSLALIG 428
F+G+P QPFG +G VC+ AHQ LA++AA+ GIVLLKN LPL P + ++A++G
Sbjct: 395 FDGDPAAQPFGHLGPQHVCTAAHQELAVEAARQGIVLLKNDGRALPLSPATARRAVAVVG 454
Query: 429 PNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSAS--IDKAVDIAK 486
P+A + ++GNYAG CR TPLQ + Y + PGC VAC+ + I AVD A+
Sbjct: 455 PHAEATVAMIGNYAGKPCRYTTPLQGVARYAARAAHQPGCTDVACAGSGQPIAAAVDAAR 514
Query: 487 GADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITF 546
AD +++ GLDQ E E LDR L+LPGRQ ELI+ VA+A+K PVILVL+ GGP+DI F
Sbjct: 515 RADATIVVAGLDQKIEAEGLDRASLLLPGRQAELISSVAKASKGPVILVLMSGGPIDIGF 574
Query: 547 AKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYI-KVPMTDMKMRP 605
A+ D I ILWAGYPG+AG A+A+VIFG HNPGG+LP+TWYPQDY+ KVPMT+M MR
Sbjct: 575 AQNDPKIAGILWAGYPGQAGGQAIADVIFGHHNPGGKLPVTWYPQDYLQKVPMTNMAMRA 634
Query: 606 QATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKF-KAVSQNKLYL----NQSSSTKMVE 660
G PGRTYRFY G + PFG GLSY+ +++ A SQ + L +S++ +
Sbjct: 635 NPAKGYPGRTYRFYTGPTIHPFGHGLSYTSFTHSIAHAPSQLTVRLSAHHAAASASASLN 694
Query: 661 NQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLF-------VKPARRGNG 713
+ + + CE + V + V+N GE G H VL++ A G+G
Sbjct: 695 ATARLSRAAAVRVAHARCEELRMPVHVDVRNVGERDGAHTVLVYAAAPASSAAEAAAGHG 754
Query: 714 RPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGDEEYPIS 773
P++QLV F+ V + A A + + C+ LS A +G+ I G H L++G+ + ++
Sbjct: 755 APVRQLVAFEKVHVGAGGTARVEMGIDVCDGLSVADRNGVRRIPVGEHRLIIGELTHTVT 814
Query: 774 I 774
I
Sbjct: 815 I 815
>gi|85813772|emb|CAJ65922.1| xylan 1,4-beta-xylosidase [Populus tremula x Populus alba]
Length = 757
Score = 702 bits (1813), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 382/787 (48%), Positives = 499/787 (63%), Gaps = 68/787 (8%)
Query: 8 LVFPLLC-LCFTSLLTRVDSTQPPFSCDP-SNPSTETFPFCKTTLPISQRARDLVSRLTL 65
L F L+C L F+S + + P F+CD SNPS +F FC T+L +S R DLV RLTL
Sbjct: 17 LFFSLVCFLLFSSSHVVLAQSSPVFACDVVSNPSLASFGFCNTSLGVSDRVVDLVKRLTL 76
Query: 66 DEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAAS 125
EKI LVNSA ++ RLGIP YEWWSEALHGV+ VG G F+ + GATSFPQVILTAAS
Sbjct: 77 QEKILFLVNSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSSVVPGATSFPQVILTAAS 136
Query: 126 FDSYLWYRIG----QAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPL 181
F++ L+ IG Q + EARA+YN G A G+TFW+PNINIFRDPRWGRGQETPGEDPL
Sbjct: 137 FNTSLFVAIGKVISQVVSTEARAMYNVGLA-GLTFWSPNINIFRDPRWGRGQETPGEDPL 195
Query: 182 VTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDA-RVTMQ 240
++ KY YV+G+Q + G G L+ +ACCKH+TAYDLDNWKG RY F+A VT Q
Sbjct: 196 LSSKYGSGYVKGLQQR--DDGNPDG-LKVAACCKHYTAYDLDNWKGVDRYHFNAVVVTKQ 252
Query: 241 DLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHG--YITSD 298
D+ DT+QPPF+SCV G + +MC+YN+VNGIP+CAD +LLS R +W +G YI +D
Sbjct: 253 DMDDTFQPPFKSCVVDGNVASVMCSYNKVNGIPTCADPDLLSGVIRGEWKLNGYVYIVTD 312
Query: 299 CDAVSIIYDAEGYAKSPEDAVVDVLKA--GMDVNCGSFLQKHTKAAVKQKKLPESEIDRA 356
CD++ + Y+++ Y K+PE+A + A G+D+NCGSFL KHT+AAV + ES IDRA
Sbjct: 313 CDSIDVFYNSQHYTKTPEEAAAKAILAGIGLDLNCGSFLGKHTEAAVTAGLVNESAIDRA 372
Query: 357 LHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPL 416
+ N F+ MRLG F+G+P+ Q +GK+G VC+ +Q LA +AA+ GIVLLKN+
Sbjct: 373 VSNNFATLMRLGFFDGDPSKQLYGKLGPKDVCTAENQELAREAARQGIVLLKNT------ 426
Query: 417 PKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSA 476
G C+ TPLQ L V T Y PGC VACS+A
Sbjct: 427 --------------------------GTPCKYTTPLQGLAALVATT-YLPGCSNVACSTA 459
Query: 477 SIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVL 536
+D A IA AD VL+MG D + E E DRVD++LPG+QQ LIT VA A+ PVILV+
Sbjct: 460 QVDDAKKIAAAADATVLVMGADLSIEAESRDRVDILLPGQQQLLITAVANASTGPVILVI 519
Query: 537 LCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHN------PGGRLPMTWYP 590
+ GG +D++FAK + I SILW GYPGEAG A+A++IFG +N PGGRLPMTWYP
Sbjct: 520 MSGGGMDVSFAKTNDKITSILWVGYPGEAGGAAIADIIFGSYNPSTHQPPGGRLPMTWYP 579
Query: 591 QDYI-KVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLY 649
Q Y+ KVPMT+M MRP ++G PGRTYRFY G+ V+ FG GLSYS++S++
Sbjct: 580 QSYVDKVPMTNMNMRPDPSNGYPGRTYRFYTGETVYSFGDGLSYSEFSHE---------- 629
Query: 650 LNQSSSTKMV--ENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKP 707
L Q+ V E V + + + F V + +KN G +G H V LF P
Sbjct: 630 LTQAPGLVSVPLEENHVCYSSECKSVAAAEQTCQNFDVHLRIKNTGTTSGSHTVFLFSTP 689
Query: 708 ARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGD 767
N P K LVGF+ V L+A+ + + F++ C+ LS E G + G H L +G
Sbjct: 690 PSVHN-SPQKHLVGFEKVFLHAQTDSHVGFKVDVCKDLSVVDELGSKKVALGEHVLHIGS 748
Query: 768 EEYPISI 774
++ +++
Sbjct: 749 LKHSMTV 755
>gi|242071935|ref|XP_002451244.1| hypothetical protein SORBIDRAFT_05g026400 [Sorghum bicolor]
gi|241937087|gb|EES10232.1| hypothetical protein SORBIDRAFT_05g026400 [Sorghum bicolor]
Length = 790
Score = 692 bits (1786), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 358/765 (46%), Positives = 488/765 (63%), Gaps = 40/765 (5%)
Query: 35 PSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEAL 94
P P+T T PFC+ +LP+ RARDLVSRLT EK+ LVN+A + RLG+ YEWWSEAL
Sbjct: 38 PGGPAT-TLPFCRQSLPLHARARDLVSRLTRAEKVRLLVNNAAGVARLGVGGYEWWSEAL 96
Query: 95 HGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGM 154
HGV+ G G+ F G GAT+FPQVI AA+ ++ LW IG+A+ EARA+YN G+A G+
Sbjct: 97 HGVSDTGPGVKFGGAFPGATAFPQVIGAAAALNATLWELIGRAVSDEARAMYNGGRA-GL 155
Query: 155 TFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACC 214
TFW+PN+NIFRDPRWGRGQETPGEDP ++ +YA +YVRG+Q ++ +LK +ACC
Sbjct: 156 TFWSPNVNIFRDPRWGRGQETPGEDPAISSRYAAAYVRGLQ-QPYDHNRLK----LAACC 210
Query: 215 KHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPS 274
KHFTAYDLD+W GT R+ F+A V+ QDL DT+ PF +CV GRA+ +MC+YN+VNG+P+
Sbjct: 211 KHFTAYDLDSWGGTDRFHFNAVVSPQDLEDTFNVPFRACVAGGRAASVMCSYNQVNGVPT 270
Query: 275 CADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSF 334
CAD+ L T R+ WG GYI SDCD+V + + + Y ++ EDAV L+AG+D++CG F
Sbjct: 271 CADQGFLRGTIRKAWGLDGYIVSDCDSVDVFFRDQHYTRTAEDAVAATLRAGLDLDCGPF 330
Query: 335 LQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQV 394
L +T+ AV +KK+ ++++D AL N +V+MRLG+F+G+P PFG +GA VC+ AHQ
Sbjct: 331 LALYTENAVARKKVSDADVDAALLNTVTVQMRLGMFDGDPASGPFGHLGAADVCTKAHQD 390
Query: 395 LALQAAQDGIVLLKNSHG-------LLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCR 447
LAL AA+ +VLLKN G +LPL + +A++GP+A++ ++GNYAG CR
Sbjct: 391 LALDAARQSVVLLKNQRGRKHRDRDVLPLRPAAHRVVAVVGPHADATVAMIGNYAGKPCR 450
Query: 448 SITPLQALQNYVENTVYYPGCDTVACSSAS--IDKAVDIAKGADHVVLMMGLDQTQEKEE 505
TPLQ + Y V+ GC VAC + I AVD A+ L
Sbjct: 451 YTTPLQGVAAYAARVVHQAGCADVACQGKNQPIAAAVDAAR---------RLTPPSSSPG 501
Query: 506 LDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEA 565
L R L+LPGRQ ELI+ VA+AAK PVILVL+ GGP+DI FA+ D I ILW GYPG+A
Sbjct: 502 LTR-SLLLPGRQAELISAVAKAAKGPVILVLMSGGPIDIAFAQNDPRIDGILWVGYPGQA 560
Query: 566 GAVALAEVIFGDHNPGGRLPMTWYPQDYI-KVPMTDMKMRPQATSGNPGRTYRFYEGKEV 624
G A+A+VIFG HNPGG+LP+TWYPQDY+ KVPMT+M MR G PGRTYRFY G +
Sbjct: 561 GGQAIADVIFGQHNPGGKLPVTWYPQDYLEKVPMTNMAMRANPARGYPGRTYRFYTGPTI 620
Query: 625 FPFGCGLSYSKYSYKF-KAVSQNKLYLNQSSSTKMVENQ-----DVVHYKSVPELGTEFC 678
FG GLSY+++++ A +Q + L+ SS++ + + C
Sbjct: 621 HAFGHGLSYTQFTHTLAHAPAQLTVRLSTSSASASASASAASLLNATRPSRAVRVAHARC 680
Query: 679 ETRKFLVTIGVKNHGEMAGKHPVLLF-------VKPARRGNGRPIKQLVGFQSVILNAKE 731
E V + V+N G+ G H VL++ A G P +QLV F+ V + A
Sbjct: 681 EGLTVPVHVDVRNVGDRDGAHAVLVYHVAPSSSSSSAPAGTDAPARQLVAFEKVHVPAGG 740
Query: 732 KAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGDEEYPISIFV 776
A + + C+ LS A DG+ I G H L++G+ + +++ V
Sbjct: 741 VARVEMGIDVCDRLSVADRDGVRRIPVGEHRLMIGELTHSVTLGV 785
>gi|449433577|ref|XP_004134574.1| PREDICTED: probable beta-D-xylosidase 2-like [Cucumis sativus]
gi|449530107|ref|XP_004172038.1| PREDICTED: probable beta-D-xylosidase 2-like [Cucumis sativus]
Length = 812
Score = 688 bits (1775), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 353/763 (46%), Positives = 489/763 (64%), Gaps = 28/763 (3%)
Query: 31 FSCDPSNPST-----ETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIP 85
F CDPS +F FC ++L +RA+DL+ R+TL EK +QL + A + RLG+P
Sbjct: 56 FVCDPSRYDKLGLDFSSFGFCDSSLSFPERAKDLIDRMTLSEKAAQLGHVASGVDRLGLP 115
Query: 86 AYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARAL 145
Y WWSEALHGV+ VG G F+ + GATSFP VI TA+SF+ LW IGQA+ EARA+
Sbjct: 116 PYNWWSEALHGVSNVGPGTQFDKVVPGATSFPNVITTASSFNEDLWKTIGQAVSTEARAM 175
Query: 146 YNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQG--DTFNGGK 203
YN G+A G+T+W+P IN+ RDPRWGR ETPGEDP V GKYA +YVRG+Q + N
Sbjct: 176 YNLGRA-GLTYWSPTINVIRDPRWGRTVETPGEDPFVVGKYAKNYVRGLQDVEGSENVTD 234
Query: 204 LKGK-LQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGI 262
L + L+ S+CCKH+ AYD+DNW G RY FDARVT QD+ +T+ PFE CVK+G S +
Sbjct: 235 LNSRPLKVSSCCKHYAAYDVDNWLGVERYSFDARVTEQDMLETFNKPFEMCVKEGDVSSV 294
Query: 263 MCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSI-IYDAEGYAKSPEDAVVD 321
MC+YNRVNGIP+CAD LL T R WG HGYI SDCD+V + + DA + EDAV
Sbjct: 295 MCSYNRVNGIPTCADPVLLKDTIRGNWGLHGYIVSDCDSVKVMVEDAHYLQDTNEDAVAQ 354
Query: 322 VLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGK 381
LKAG+D++CG +T++ V+Q K+ ID AL+NL+ V MRLG F+GN + GK
Sbjct: 355 TLKAGLDLDCGQIYPNYTESTVRQGKVGMRNIDNALNNLYVVLMRLGYFDGNTGFESLGK 414
Query: 382 IGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNY 441
+CS H LA +AA+ G VLLKN + LP S +LA++GP+AN+ +LGNY
Sbjct: 415 PD---ICSDEHIELATEAARQGTVLLKNDNDTLPFDPSNYKTLAVVGPHANATSAMLGNY 471
Query: 442 AGPSCRSITPLQALQNYVENTVYYPGCDTVACSSAS-IDKAVDIAKGADHVVLMMGLDQT 500
AG CR +P+ L Y + Y GCD+VAC + + I A++ A+ +D V+ +G+D +
Sbjct: 472 AGVPCRMNSPMDGLSEYAK-VKYQMGCDSVACKNDTFIFGAMEAARTSDATVIFVGIDLS 530
Query: 501 QEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAG 560
E E LDRVDL+LPG Q +L+ +VA +K PV+LV+L G +D++FAK + NI +I+WAG
Sbjct: 531 IEAESLDRVDLLLPGYQTQLVQQVATVSKGPVVLVILSAGGIDVSFAKNNSNIKAIIWAG 590
Query: 561 YPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYI-KVPMTDMKMRPQATSGNPGRTYRFY 619
YPGE G A+A+VIFG NPGGRLP+TWY DY+ ++PMT M +RP + G PGRTY+FY
Sbjct: 591 YPGEEGGRAIADVIFGKFNPGGRLPLTWYENDYVYQLPMTSMPLRPVKSLGYPGRTYKFY 650
Query: 620 EGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSV--PE----L 673
+G V+PFG GLSY+ + + + ++ + ++ S+ T+ +D+ + PE L
Sbjct: 651 DGPVVYPFGHGLSYTFFLHNLTS-AKRSIAIDLSNRTQC---RDIAYTNGTFKPECPAVL 706
Query: 674 GTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKA 733
+ T + + V+N GE G +L++ P + IKQ+VGFQ V L A +
Sbjct: 707 VDDLTCTEEIEFQMEVENTGERDGSQVLLVYSVPPGGISSTHIKQVVGFQRVFLKAGDSE 766
Query: 734 EIVFELSPCESLSRAREDGLMVIEEGTHFLVVGDEE--YPISI 774
+ F+L+ C+SL G ++ G H +VVGD E +P+ +
Sbjct: 767 TVTFKLNACKSLGLVDFTGYNLLPAGGHTIVVGDGEVSFPVEL 809
>gi|125576920|gb|EAZ18142.1| hypothetical protein OsJ_33692 [Oryza sativa Japonica Group]
Length = 618
Score = 682 bits (1759), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 318/622 (51%), Positives = 439/622 (70%), Gaps = 13/622 (2%)
Query: 156 FWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCK 215
W+PN+NIFRDPRWGRGQETPGEDP KY ++V+G+QG + LQ SACCK
Sbjct: 1 MWSPNVNIFRDPRWGRGQETPGEDPATASKYGAAFVKGLQGSSLT------NLQTSACCK 54
Query: 216 HFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSC 275
H TAYD++ WKG +RY F+A+VT QDLADTY PPF SCV G+AS IMCAY +NG+P+C
Sbjct: 55 HITAYDIEEWKGVSRYNFNAKVTPQDLADTYNPPFRSCVVDGKASCIMCAYTLINGVPAC 114
Query: 276 ADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFL 335
A +LL+KT R +W GY SDCDAV+I++ +E + ++ E+AV LKAG+D+NCG ++
Sbjct: 115 ASSDLLTKTVRGEWKLDGYTASDCDAVAILHKSEHFTRTAEEAVAVALKAGLDINCGVYM 174
Query: 336 QKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPT-MQPFGKIGADVVCSPAHQV 394
Q++ +A++Q K+ E ++D+AL NLF++RMRLG F+G+P + +G++ A VC+P H+
Sbjct: 175 QQNAASALQQGKMTEKDVDKALKNLFAIRMRLGHFDGDPRGNKLYGRLSAADVCTPVHKA 234
Query: 395 LALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQA 454
LAL+AA+ G+VLLKN LLPL S A+IG NAN LLGNY G C + TP
Sbjct: 235 LALEAARRGVVLLKNDARLLPLRAPTVASAAVIGHNANDILALLGNYYGLPCETTTPFGG 294
Query: 455 LQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLP 514
+Q YV++ + PGC + AC A+ D+A +AK +D+V L+MGL Q QE+E LDR L+LP
Sbjct: 295 IQKYVKSAKFLPGCSSAACDVAATDQATALAKSSDYVFLVMGLSQKQEQEGLDRTSLLLP 354
Query: 515 GRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVI 574
G+QQ LIT VA A+K+PVIL+LL GGPVDITFA+ + IG+ILWAGYPG+AG A+A+V+
Sbjct: 355 GKQQALITAVATASKRPVILILLTGGPVDITFAQTNPKIGAILWAGYPGQAGGQAIADVL 414
Query: 575 FGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYS 634
FG+ NP G+LP+TWYP+++ K MTDM+MRP +G PGR+YRFY+GK V+ FG GLSYS
Sbjct: 415 FGEFNPSGKLPVTWYPEEFTKFTMTDMRMRPDPATGYPGRSYRFYKGKTVYKFGYGLSYS 474
Query: 635 KYSYKFKAVSQNKLYLNQSSSTKM----VENQDVVHYKSVPELGTEFCETRKFLVTIGVK 690
K++ + + + N +++ + D V+ V E+G + CE +F V + V+
Sbjct: 475 KFACRIVSGAGNSSSYGKAALAGLRAATTPEGDAVY--RVDEIGDDRCERLRFPVMVEVQ 532
Query: 691 NHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRARE 750
NHG M GKH VL+FV+ + GRP++QL+GF++ L EK ++ E+SPCE LSRAR
Sbjct: 533 NHGPMDGKHTVLMFVRWSSTDGGRPVRQLIGFRNQHLKVGEKKKLKMEISPCEHLSRARV 592
Query: 751 DGLMVIEEGTHFLVVGDEEYPI 772
DG VI+ G+HFL+V ++E I
Sbjct: 593 DGEKVIDRGSHFLMVEEDELEI 614
>gi|255557375|ref|XP_002519718.1| Beta-glucosidase, putative [Ricinus communis]
gi|223541135|gb|EEF42691.1| Beta-glucosidase, putative [Ricinus communis]
Length = 802
Score = 675 bits (1742), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 341/742 (45%), Positives = 480/742 (64%), Gaps = 14/742 (1%)
Query: 42 TFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVG 101
TF FC ++L RA+DLV+++TL EK+ QL + A +PRLGIP YEWWSEALHGV+ VG
Sbjct: 65 TFGFCDSSLSYEVRAKDLVNQMTLKEKVQQLGDLAYGVPRLGIPKYEWWSEALHGVSDVG 124
Query: 102 KGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNI 161
G FF+ + GATSFP ILT ASF+ LW IGQA +ARA+YN G+A G+T+W+PN+
Sbjct: 125 PGTFFDDLVPGATSFPTTILTTASFNESLWKNIGQA-SAKARAMYNLGRA-GLTYWSPNV 182
Query: 162 NIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQG--DTFNGGKLKGK-LQASACCKHFT 218
N+ RDPRWGR ETPGEDP V G+YAV+YVRG+Q T N L + L+ S+CCKH+
Sbjct: 183 NVVRDPRWGRTVETPGEDPYVVGRYAVNYVRGLQDVEGTENYTDLNTRPLKVSSCCKHYA 242
Query: 219 AYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADR 278
AYD++ W+G R FDARVT QD+ +T+ PFE CVK+G S +MC++NRVNGIP+CAD
Sbjct: 243 AYDVEKWQGVERLTFDARVTEQDMVETFLRPFEMCVKEGDVSSVMCSFNRVNGIPTCADP 302
Query: 279 NLLSKTARRQWGFHGYITSDCDAVSIIYDAEGY-AKSPEDAVVDVLKAGMDVNCGSFLQK 337
LL++T R W HGYI SDCD++ ++ D + + EDAV VLKAG+D++CG +
Sbjct: 303 KLLNQTIRGDWDLHGYIVSDCDSIEVMVDNHKFLGDTNEDAVAQVLKAGLDLDCGGYYTN 362
Query: 338 HTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLAL 397
T+ +VKQ K E IDR+L L+ V MRLG F+G P Q K+G +C+ + LA
Sbjct: 363 FTETSVKQGKAREEYIDRSLKYLYVVLMRLGFFDGTPQYQ---KLGKKDICTKENVELAK 419
Query: 398 QAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQN 457
QAA++GIVLLKN+ LPL K +LA++GP+AN+ + ++GNYAG CR ++P+
Sbjct: 420 QAAREGIVLLKNND-TLPLSMDKVKNLAVVGPHANATRVMIGNYAGVPCRYVSPIDGFSI 478
Query: 458 YVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQ 517
Y N Y GCD + + + AV AK AD +++ GLD T E E LDR DL+LPG Q
Sbjct: 479 Y-SNVTYEIGCDVPCKNESLVFPAVHAAKNADATIIVAGLDLTIEAEGLDRNDLLLPGYQ 537
Query: 518 QELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGD 577
+LI +VA AA PVILV++ G VDI+FA+ + I +ILW GYPG+ G A+A+V+FG
Sbjct: 538 TQLINQVAGAANGPVILVIMAAGGVDISFARDNEKIKAILWVGYPGQEGGHAIADVVFGK 597
Query: 578 HNPGGRLPMTWYPQDYI-KVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKY 636
+NPGGRLP+TWY D++ +VPMT M++RP G PG+TY+FY+G V+PFG GLSY+ +
Sbjct: 598 YNPGGRLPITWYEADFVEQVPMTYMQLRPDEELGYPGKTYKFYDGSTVYPFGYGLSYTTF 657
Query: 637 SYKFKAVSQNK-LYLNQSSSTKMVENQDVVHYKSVPELGTEFCETR-KFLVTIGVKNHGE 694
SY + ++K + LN+ + + + S P + T+ F + + V+N G
Sbjct: 658 SYNITSAKRSKHIALNKFQHCRDLRYGNETFKPSCPAVLTDHLPCNDDFELEVEVENTGS 717
Query: 695 MAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLM 754
G V+++ K G IKQ++GF+ V + A ++ F + C+S +
Sbjct: 718 RDGSEVVMVYSKTPEGIVGSYIKQVIGFKRVFVQAGSVEKVNFRFNVCKSFRIIDYNAYS 777
Query: 755 VIEEGTHFLVVGDEEYPISIFV 776
++ G H ++VGD+ I +++
Sbjct: 778 ILPSGGHTIMVGDDIVSIPLYI 799
>gi|163889365|gb|ABY48135.1| beta-D-xylosidase [Medicago truncatula]
Length = 776
Score = 675 bits (1741), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 360/796 (45%), Positives = 495/796 (62%), Gaps = 64/796 (8%)
Query: 12 LLCLCFTSLLTRVDS--TQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKI 69
++ F+ LL + T P + C P + +PFC +LPIS R L+S LTL +KI
Sbjct: 9 IIIFLFSLLLIHLPKFFTTPDYPCKPPH---SHYPFCNISLPISTRTTSLISLLTLSDKI 65
Query: 70 SQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSY 129
+QL N+A +I LGIP+Y+WWSEALHG+A G G+ FNG+++ AT+FPQVI++AA+F+
Sbjct: 66 NQLSNTASSISHLGIPSYQWWSEALHGIATNGPGVNFNGSVKSATNFPQVIVSAAAFNRS 125
Query: 130 LWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVS 189
LW+ IG A+G+E RA++N GQA G++FWAPN+N+FRDPRWGRGQETPGEDP+V YAV
Sbjct: 126 LWFLIGYAVGVEGRAMFNVGQA-GLSFWAPNVNVFRDPRWGRGQETPGEDPMVGSAYAVE 184
Query: 190 YVRGVQG-----------DTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVT 238
+VRG+QG D+ + G L SACCKHFTAYDL+ W +RY F+A V
Sbjct: 185 FVRGIQGVDGIKKVLNDHDSDDDG-----LMVSACCKHFTAYDLEKWGEFSRYNFNAVV- 238
Query: 239 MQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGY-ITS 297
+TYQPPF CV+QG+AS +MC+YN VNG+P+CA ++LL R +WGF G I
Sbjct: 239 -----NTYQPPFRGCVQQGKASCLMCSYNEVNGVPACASKDLLG-LVRNKWGFEGVGILP 292
Query: 298 DCDAVSIIY-DAEGYAKSPEDAVVDVLKA-----------GMDVNCGSFLQKHTKAAVKQ 345
+ +++ + P+ ++ LK MD+NCG+F+ +HT++A++Q
Sbjct: 293 QTVMLWLLFLSIKSMQNLPKMLLLMFLKQVFFYVFENLWFCMDINCGTFMLRHTESAIEQ 352
Query: 346 KKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIV 405
+ E ++DRAL NLFSV+MRLGLFNG+P FGK+G VC+P H+ LAL+AA+ GIV
Sbjct: 353 GLVKEEDLDRALFNLFSVQMRLGLFNGDPEKGKFGKLGPQDVCTPEHKKLALEAARQGIV 412
Query: 406 LLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYY 465
LLKN + LPL K VSLA+IGP A +++ L G Y+G C + L+ YV+ Y
Sbjct: 413 LLKNDNKFLPLDKKDRVSLAIIGPMATTSE-LGGGYSGIPCSPRSLYDGLKEYVKTISYA 471
Query: 466 PGCDTVACSSAS-IDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRV 524
GC V C S A+DIAK AD VV++ GLD T E E+LDRV L+LPG+Q +L++RV
Sbjct: 472 FGCSDVKCDSDDGFAVAIDIAKQADFVVIVAGLDTTLETEDLDRVSLLLPGKQMDLVSRV 531
Query: 525 AEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRL 584
A A+K+PVILVL GGP+D++FA+ ++ I SILW GYP D + GRL
Sbjct: 532 AAASKRPVILVLTGGGPLDVSFAESNQLITSILWIGYP-------------VDFDAAGRL 578
Query: 585 PMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVS 644
PMTWYP+ + VPM DM MR + G PGRTYRFY G ++ FG GLSYS +SY+ + +
Sbjct: 579 PMTWYPESFTNVPMNDMGMRADPSRGYPGRTYRFYTGSRIYGFGHGLSYSDFSYRVLS-A 637
Query: 645 QNKLYLNQSSSTKMVEN------QDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGK 698
+KL L+++++ + + +DV V + C + F V I V N G+M G
Sbjct: 638 PSKLSLSKTTNGGLRRSLLNKVEKDVFEVDHVHVDELQNCNSLSFSVHISVMNVGDMDGS 697
Query: 699 HPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEE 758
H V+LF K + G P QLVG + + + E PCE S A E G ++
Sbjct: 698 HVVMLFSKWPKNIQGSPESQLVGPSRLHTVSNKSIETSILADPCEHFSFADEQGKRILPL 757
Query: 759 GTHFLVVGDEEYPISI 774
G H L VGD E+ +SI
Sbjct: 758 GNHILNVGDVEHIVSI 773
>gi|85813770|emb|CAJ65921.1| xylan 1,4-beta-xylosidase [Populus tremula x Populus alba]
Length = 704
Score = 672 bits (1735), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 355/669 (53%), Positives = 459/669 (68%), Gaps = 40/669 (5%)
Query: 8 LVFPLLCLCFTSLLTRVDS-TQPPFSCDP-SNPSTETFPFCKTTLPISQRARDLVSRLTL 65
+VF L C+ L T V + + P F+CD SNPS + FC T++ I+ R DLV RLTL
Sbjct: 15 VVFLLFCMFLVFLSTHVSAQSSPVFACDVVSNPSLASLGFCNTSIGINDRVVDLVKRLTL 74
Query: 66 DEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAAS 125
EKI LVNSA + RLGIP YEWWSEALHGV+ VG G F+ + GATSFPQVILTAAS
Sbjct: 75 QEKIVFLVNSAGNVSRLGIPKYEWWSEALHGVSYVGPGTHFSDDVAGATSFPQVILTAAS 134
Query: 126 FDSYLWYRIG-----QAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDP 180
F++ L+ IG Q + EARA+YN G A G+TFW+PNINIFRDPRWGRGQETPGEDP
Sbjct: 135 FNTSLFEAIGKVYYTQVVSTEARAMYNVGLA-GLTFWSPNINIFRDPRWGRGQETPGEDP 193
Query: 181 LVTGKYAVSYVRGVQ----GDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDAR 236
L++ KY YV+G+Q GD KL+ +ACCKH+TAYDLDNWKG+ RY F+A
Sbjct: 194 LLSSKYGSCYVKGLQQRDDGD-------PDKLKVAACCKHYTAYDLDNWKGSDRYHFNAV 246
Query: 237 V-TMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGY- 294
V T QD+ DT+QPPF+SCV G + +MC+YN+VNG P+CAD +LLS R +W +GY
Sbjct: 247 VVTKQDMDDTFQPPFKSCVIDGNVASVMCSYNQVNGKPTCADPDLLSGVIRGEWNLNGYQ 306
Query: 295 ------ITSDCDAVSIIYDAEGYAKSPEDAVV------DVLKAGMDVNCGSFLQKHTKAA 342
I +DCD++ + Y ++ Y K+PE+A + L G+D+NCGSFL +HT+AA
Sbjct: 307 WGCCRYIVTDCDSLDVFYKSQNYTKTPEEAAAAAILAGNSLVTGVDLNCGSFLGQHTEAA 366
Query: 343 VKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQD 402
VK + E ID A+ N F+ MRLG F+G+P+ Q +GK+G VC+ +Q LA +AA+
Sbjct: 367 VKGGLVNEHAIDIAVSNNFATLMRLGFFDGDPSKQLYGKLGPKDVCTAENQELAREAARQ 426
Query: 403 GIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPS-CRSITPLQALQNYVEN 461
GIVLLKN+ G LPL + +LA+IGPNAN KT++GNY G + C+ TPLQ L V
Sbjct: 427 GIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGGTPCKYTTPLQGLAASVA- 485
Query: 462 TVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELI 521
T Y PGC VACS+A +D A +A AD VL+MG D + E E DRVD++LPG+QQ LI
Sbjct: 486 TTYLPGCSNVACSTAQVDDAKKLAAAADATVLVMGADLSIEAESRDRVDVLLPGQQQLLI 545
Query: 522 TRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
T VA + PVILV++ GG +D++FA+ + I SILW GYPGEAG A+A++IFG +NP
Sbjct: 546 TAVANVSCGPVILVIMSGGGMDVSFARTNDKITSILWVGYPGEAGGAAIADIIFGYYNPS 605
Query: 582 ----GRLPMTWYPQDYI-KVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKY 636
GRLPMTWYPQ Y+ KVPMT+M MRP ++G PGRTYRFY G+ V+ FG GLSYS++
Sbjct: 606 THQPGRLPMTWYPQSYVDKVPMTNMNMRPDPSNGYPGRTYRFYTGETVYSFGDGLSYSQF 665
Query: 637 SYKFKAVSQ 645
+++ Q
Sbjct: 666 THELIQAPQ 674
>gi|255590044|ref|XP_002535159.1| Thermostable beta-glucosidase B, putative [Ricinus communis]
gi|223523880|gb|EEF27223.1| Thermostable beta-glucosidase B, putative [Ricinus communis]
Length = 449
Score = 661 bits (1706), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 308/450 (68%), Positives = 378/450 (84%), Gaps = 1/450 (0%)
Query: 327 MDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADV 386
MDVNCG++L+ +TK+AV++KK+ ESEIDRALHNLFS+RMRLGLFNGNPT P+G I AD
Sbjct: 1 MDVNCGNYLKNYTKSAVEKKKVSESEIDRALHNLFSIRMRLGLFNGNPTKLPYGDISADQ 60
Query: 387 VCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSC 446
VCS HQ +AL+AA+DGIVLLKNS+ LLPL KSK+ SLA+IGPNA+++ L+GNYAGP C
Sbjct: 61 VCSQEHQAVALEAARDGIVLLKNSNQLLPLSKSKTTSLAIIGPNADNSTILVGNYAGPPC 120
Query: 447 RSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEEL 506
+++TP Q LQNY++ T Y+PGC TVACSSA+ID+A+ IAK AD VVL+MGLDQTQE+EE
Sbjct: 121 KTVTPFQGLQNYIKTTKYHPGCSTVACSSAAIDQAIKIAKEADQVVLVMGLDQTQEREEH 180
Query: 507 DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAG 566
DRVDLVLPG+QQELI VA AAKKPV+LVLLCGGPVDI+FAKYDRNIG ILWAGYPGEAG
Sbjct: 181 DRVDLVLPGKQQELIISVARAAKKPVVLVLLCGGPVDISFAKYDRNIGGILWAGYPGEAG 240
Query: 567 AVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFP 626
+ALAE+IFG+HNPGGRLP+TWYPQD+ KVPMTDM+MRPQ +SG PGRTYRFY+GK+VF
Sbjct: 241 GIALAEIIFGNHNPGGRLPVTWYPQDFTKVPMTDMRMRPQPSSGYPGRTYRFYKGKKVFE 300
Query: 627 FGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVT 686
FG GLSYS YSY+ +V+QNK+ L +SS + EN + YK++ E+ E CE KF VT
Sbjct: 301 FGYGLSYSNYSYELVSVTQNKISL-RSSIDQKAENSSPIGYKTISEIEEELCERSKFSVT 359
Query: 687 IGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLS 746
+ VKN GEM GKHPVLLF + + G+G PIK+L+ FQSV LNA E AEI ++++PCE LS
Sbjct: 360 VRVKNQGEMTGKHPVLLFARQDKPGSGGPIKKLIAFQSVKLNAGENAEIEYKVNPCEHLS 419
Query: 747 RAREDGLMVIEEGTHFLVVGDEEYPISIFV 776
RA EDGLMV+EEG+ +L+VGD+EYPI+I +
Sbjct: 420 RANEDGLMVMEEGSQYLLVGDKEYPINITI 449
>gi|225432136|ref|XP_002274651.1| PREDICTED: probable beta-D-xylosidase 5-like [Vitis vinifera]
Length = 809
Score = 654 bits (1686), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 334/753 (44%), Positives = 469/753 (62%), Gaps = 22/753 (2%)
Query: 31 FSCDPSNPST-----ETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIP 85
+ CD S + + F +C ++ P RA+DLV R+TL EK+ Q + A + R+G+P
Sbjct: 51 YVCDESRFAALGLDMKDFHYCDSSSPYEVRAKDLVDRMTLSEKVMQTGDQASGVERIGLP 110
Query: 86 AYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARAL 145
Y WWSEALHGV+ G+ +FF+ + GATSFP VIL+AASF+ LW +GQA+ EARA+
Sbjct: 111 KYNWWSEALHGVSNFGRCVFFDEVVPGATSFPTVILSAASFNQSLWKTLGQAVSTEARAM 170
Query: 146 YNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQG--DTFNGGK 203
YN+G A G+TFW+PNIN+ RDPRWGR ETPGEDP + G YAV+YVRG+Q N
Sbjct: 171 YNSGNA-GLTFWSPNINVVRDPRWGRILETPGEDPHLVGLYAVNYVRGLQDVVGAENTTD 229
Query: 204 LKGK-LQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGI 262
L + L+ S+CCKH+ AYDLDNWKG R FDARV++QD+A+T+ PFE CVK+G S +
Sbjct: 230 LNSRPLKVSSCCKHYAAYDLDNWKGADRVHFDARVSVQDMAETFVLPFEMCVKEGDVSSV 289
Query: 263 MCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSII-YDAEGYAKSPEDAVVD 321
MC+YN++NGIPSCAD LL +T R +W HGYI SDCD+V ++ D + S D+
Sbjct: 290 MCSYNKINGIPSCADSRLLKQTIRGEWDLHGYIVSDCDSVEVMAVDQKWLDSSFSDSAAQ 349
Query: 322 VLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGK 381
L AGM+++CG+F + AV Q K ++++D +L L+ + MR+G F+G P F
Sbjct: 350 ALNAGMNLDCGTFNNRSLTEAVNQGKANQADLDHSLRYLYVLLMRVGFFDGIPA---FAS 406
Query: 382 IGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNY 441
+G D +CS H LA +AA+ GIVLLKN + LPL K++ AL+GP+AN+ ++GNY
Sbjct: 407 LGKDDICSAEHIELAREAARQGIVLLKNDNATLPLKSVKNI--ALVGPHANATDAMIGNY 464
Query: 442 AGPSCRSITPLQALQNYVENTVYYPGCDTVACSSAS-IDKAVDIAKGADHVVLMMGLDQT 500
AG C ++PL A + E Y GC V C + + I A++ AK AD ++ G D +
Sbjct: 465 AGIPCYYVSPLDAFSSMGE-VRYEKGCADVQCLNETYIFNAMEAAKRADATIIFAGTDLS 523
Query: 501 QEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAG 560
E E LDRVDL+LPG Q +LI +VA+ + PV+LV++ GG VDI+FA+ + I +ILWAG
Sbjct: 524 IEAEALDRVDLLLPGYQTQLINQVADLSTGPVVLVIMSGGGVDISFARDNPKIAAILWAG 583
Query: 561 YPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIK-VPMTDMKMRPQATSGNPGRTYRFY 619
YPGE G A+A+VI G +NPGGRLP+TWY DY+ +PMT M +RP + G PGRTY+F+
Sbjct: 584 YPGEQGGNAIADVILGKYNPGGRLPITWYEADYVDMLPMTSMALRPVDSLGYPGRTYKFF 643
Query: 620 EGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTK---MVENQDVVHYKSVPELGTE 676
G V+PFG G+SY+ +SY + SQ +N + MV D L +
Sbjct: 644 NGSTVYPFGYGMSYTNFSYSL-STSQRWTNINLRKLQRCRSMVYINDTFVPDCPAVLVDD 702
Query: 677 FCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIV 736
+ VKN G M G V+++ P G IK++VGF+ V + ++
Sbjct: 703 LSCKESIEFEVAVKNVGRMDGSEVVVVYSSPPLGIAGTHIKKVVGFERVFVKVGGTEKVK 762
Query: 737 FELSPCESLSRAREDGLMVIEEGTHFLVVGDEE 769
F ++ C+SL G ++ G+H + VG +
Sbjct: 763 FSMNVCKSLGIVDSTGYALLPSGSHTIKVGGDN 795
>gi|125576923|gb|EAZ18145.1| hypothetical protein OsJ_33695 [Oryza sativa Japonica Group]
Length = 591
Score = 652 bits (1681), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 311/593 (52%), Positives = 418/593 (70%), Gaps = 8/593 (1%)
Query: 182 VTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQD 241
+ KYAV++V+G+QG++ LQ SACCKH TAYDL++W G RY F+A+VT QD
Sbjct: 1 MASKYAVAFVKGMQGNS------SAILQTSACCKHVTAYDLEDWNGVQRYNFNAKVTAQD 54
Query: 242 LADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDA 301
L DTY PPF SCV +A+ IMCAY +NG+P+CA+ +LL+KT R WG GYI SDCDA
Sbjct: 55 LEDTYNPPFRSCVVDAKATCIMCAYTGINGVPACANADLLTKTVRGDWGLDGYIASDCDA 114
Query: 302 VSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLF 361
V+I+ DA+ Y ++PEDAV LKAG+D+NCG+++Q+H AA++Q KL E +ID+AL NLF
Sbjct: 115 VAIMRDAQRYTQTPEDAVAVALKAGLDMNCGTYMQQHATAAIQQGKLTEEDIDKALKNLF 174
Query: 362 SVRMRLGLFNGNP-TMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSK 420
++RMRLG F+G+P + +G +GA +C+P H+ LAL+AA DGIVLLKN G+LPL ++
Sbjct: 175 AIRMRLGHFDGDPRSNSVYGGLGAADICTPEHRSLALEAAMDGIVLLKNDAGILPLDRTA 234
Query: 421 SVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDK 480
S A+IGPNAN L+GNY GP C S TPL + Y++N + GC++ AC A+ D+
Sbjct: 235 VASAAVIGPNANDGLALIGNYFGPPCESTTPLNGILGYIKNVRFLAGCNSAACDVAATDQ 294
Query: 481 AVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGG 540
A +A +D+V L MGL Q QE E DR L+LPG QQ LIT VA+AAK+PVILVLL GG
Sbjct: 295 AAAVASSSDYVFLFMGLSQKQESEGRDRTSLLLPGEQQSLITAVADAAKRPVILVLLTGG 354
Query: 541 PVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTD 600
PVD+TFA+ + IG+ILWAGYPG+AG +A+A V+FGDHNPGGRLP+TWYP+++ KVPMTD
Sbjct: 355 PVDVTFAQTNPKIGAILWAGYPGQAGGLAIARVLFGDHNPGGRLPVTWYPEEFTKVPMTD 414
Query: 601 MKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQ-NKLYLNQSSSTKMV 659
M+MR +G PGR+YRFY+GK V+ FG GLSYS YS + + + + Y N +S +
Sbjct: 415 MRMRADPATGYPGRSYRFYQGKTVYKFGYGLSYSSYSRQLVSGGKPAESYTNLLASLRTT 474
Query: 660 ENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQL 719
+ + E+GT+ CE KF + V+NHG M GKH VL++++ GRP QL
Sbjct: 475 TTSEGDESYHIEEIGTDGCEQLKFPAVVEVQNHGPMDGKHSVLMYLRWPNAKGGRPTTQL 534
Query: 720 VGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGDEEYPI 772
+GF+S L EKA I F++SPCE SR R+DG VI+ G+H+L+V +E I
Sbjct: 535 IGFRSQHLKVGEKANIRFDISPCEHFSRVRKDGKKVIDRGSHYLMVDKDELEI 587
>gi|222629651|gb|EEE61783.1| hypothetical protein OsJ_16354 [Oryza sativa Japonica Group]
Length = 771
Score = 645 bits (1664), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 349/748 (46%), Positives = 462/748 (61%), Gaps = 63/748 (8%)
Query: 75 SAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRI 134
S A+PRLGIPAYEWWSEALHGV+ VG G F+ + GATSFPQ ILTAASF++ L+ I
Sbjct: 41 SGRALPRLGIPAYEWWSEALHGVSYVGPGTRFSTLVPGATSFPQPILTAASFNASLFRAI 100
Query: 135 GQA------------------------------------------IGLEARALYNAGQAI 152
G++ + EARA++N G A
Sbjct: 101 GESACNNTSQFFFSSKSPFSICIAMENLHCDFRSRLVRFYRGARVVSTEARAMHNVGLA- 159
Query: 153 GMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASA 212
G+TFW+PNINIFRDPRWGRGQETPGEDPL+ KYAV YV G+Q + G L+ +A
Sbjct: 160 GLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYAVGYVTGLQ----DAGGGSDALKVAA 215
Query: 213 CCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGI 272
CCKH+TAYD+DNWKG RY FDA V+ QDL DT+QPPF+SCV G + +MC+YN+VNG
Sbjct: 216 CCKHYTAYDVDNWKGVERYTFDAVVSQQDLDDTFQPPFKSCVIDGNVASVMCSYNKVNGK 275
Query: 273 PSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG 332
P+CAD++LLS R W +GYI SDCD+V ++Y+ + Y K+PEDA +K+G+D+NCG
Sbjct: 276 PTCADKDLLSGVIRGDWKLNGYIVSDCDSVDVLYNNQHYTKNPEDAAAITIKSGLDLNCG 335
Query: 333 SFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAH 392
+FL +HT AAV+ KL ES++DRA+ N F V MRLG F+G+P PFG +G VC+ ++
Sbjct: 336 NFLAQHTVAAVQAGKLSESDVDRAITNNFIVLMRLGFFDGDPRKLPFGSLGPKDVCTSSN 395
Query: 393 QVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPL 452
Q LA +AA+ GIVLLKN+ G LPL S+A+IGPNAN++ T++GNY G C+ TPL
Sbjct: 396 QELAREAARQGIVLLKNT-GALPLSAKSIKSMAVIGPNANASFTMIGNYEGTPCKYTTPL 454
Query: 453 QALQNYVENTVYYPGCDTVACSSASID--KAVDIAKGADHVVLMMGLDQTQEKEELDRVD 510
Q L V TVY PGC V CS S+ A A AD VL++G DQ+ E+E LDR
Sbjct: 455 QGLGANVA-TVYQPGCTNVGCSGNSLQLSAATQAAASADVTVLVVGADQSVERESLDRTS 513
Query: 511 LVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVAL 570
L+LPG+Q +L++ VA A++ PVILV++ GGP DI+FAK I +ILW GYP +
Sbjct: 514 LLLPGQQPQLVSAVANASRGPVILVVMSGGPFDISFAKSSDKISAILWVGYPRRSRWRRP 573
Query: 571 AEVIFGDHNPGGRLPMTWYPQDYI-KVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGC 629
LP+TWYP + KV MTDM+MRP +++G PGRTYRFY G V+ FG
Sbjct: 574 RRHPLRIPQ--SWLPVTWYPASFADKVSMTDMRMRPDSSTGYPGRTYRFYTGDTVYAFGD 631
Query: 630 GLSYSKYSYKF-KAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIG 688
GLSY+K+++ A Q + L + + H SV G E C + F V +
Sbjct: 632 GLSYTKFAHSLVSAPEQVAVQLAEGHACH------TEHCFSVEAAG-EHCGSLSFDVHLR 684
Query: 689 VKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRA 748
V+N G MAG H V LF P + P K L+GF+ V L + + F++ C+ LS
Sbjct: 685 VRNAGGMAGGHTVFLFSSPPSV-HSAPAKHLLGFEKVSLEPGQAGVVAFKVDVCKDLSVV 743
Query: 749 REDGLMVIEEGTHFLVVGDEEYPISIFV 776
E G + G+H L VGD ++ +++ V
Sbjct: 744 DELGNRKVALGSHTLHVGDLKHTLNLRV 771
>gi|293336530|ref|NP_001167905.1| uncharacterized protein LOC100381616 [Zea mays]
gi|223944757|gb|ACN26462.1| unknown [Zea mays]
Length = 630
Score = 645 bits (1663), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 322/635 (50%), Positives = 438/635 (68%), Gaps = 12/635 (1%)
Query: 145 LYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKL 204
++NAGQA G+T+WAPNINIFRDPRWGRGQET GEDP V Y++ YV+G QG+
Sbjct: 1 MHNAGQA-GLTYWAPNINIFRDPRWGRGQETSGEDPAVAAAYSLEYVKGFQGEE----GE 55
Query: 205 KGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMC 264
+G+++ SACCKH+TAYD++ W+G +RY F+A+V QDL DTYQPPF++C+++ RAS +MC
Sbjct: 56 EGRIRLSACCKHYTAYDMEKWEGFSRYTFNAKVNAQDLEDTYQPPFKTCIQEARASCLMC 115
Query: 265 AYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLK 324
AYN+VNG+P CA ++LL KT R +WGF GYITSDCDAV+II++ + Y KS ED++ VLK
Sbjct: 116 AYNQVNGVPMCAHKDLLQKT-RDEWGFQGYITSDCDAVAIIHENQTYTKSGEDSIAIVLK 174
Query: 325 AGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGA 384
AGMD+NCGSFL +HTK+A+++ K+ E +IDRAL NLFSV++RLG+F+ Q F ++G
Sbjct: 175 AGMDINCGSFLVRHTKSAIEKGKIQEEDIDRALFNLFSVQLRLGIFDKPSNNQWFSQLGP 234
Query: 385 DVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGP 444
+ VC+ H+ LA +A + G VLLKN H LPL +S+ +A+IGP+AN A + G+Y G
Sbjct: 235 NSVCTKEHRELAAEAVRQGAVLLKNDHNFLPLKRSEVRHVAIIGPSANDAYAMGGDYTGV 294
Query: 445 SCRSITPLQALQNYVENTVYYPGCDTVACSSASI-DKAVDIAKGADHVVLMMGLDQTQEK 503
C T L+ +Q Y T + PGC +C+S + +AV+ AK AD VV++ GL+ T+E+
Sbjct: 295 PCNPTTFLKGIQAYATQTSFAPGCKDASCNSTDLFGEAVEAAKRADIVVVIAGLNLTEER 354
Query: 504 EELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPG 563
E+ DRV L+LPG+Q LI +A AKKP++LVLL GGPVD++FAK D I SILW GYPG
Sbjct: 355 EDFDRVSLLLPGKQMGLIHAIASVAKKPLVLVLLGGGPVDVSFAKQDPRIASILWLGYPG 414
Query: 564 EAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKE 623
E G L E++FG++NPGG+LP+TWYP+ + +PMTDM MR + G PGRTYRFY G
Sbjct: 415 EVGGQVLPEILFGEYNPGGKLPITWYPESFTAIPMTDMNMRADPSRGYPGRTYRFYTGDV 474
Query: 624 VFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYK----SVPELGTEFCE 679
V+ FG GLSYSKYSY + + K+ +++SS ++ + + SV CE
Sbjct: 475 VYGFGYGLSYSKYSYSISSAPK-KITVSRSSDLGIISRKPAYTRRDGLGSVKTEDIASCE 533
Query: 680 TRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFEL 739
F V + V NHG M G H VLLF + G PIKQLVGF+SV A + + +
Sbjct: 534 ALVFSVHVAVSNHGSMDGSHAVLLFARSKSSVPGFPIKQLVGFESVHTAAGSASNVEITV 593
Query: 740 SPCESLSRAREDGLMVIEEGTHFLVVGDEEYPISI 774
PC+ +S A +G V+ G H L VGDEE+ +SI
Sbjct: 594 DPCKQMSAANPEGKRVLLLGAHVLTVGDEEFELSI 628
>gi|224093292|ref|XP_002309869.1| predicted protein [Populus trichocarpa]
gi|222852772|gb|EEE90319.1| predicted protein [Populus trichocarpa]
Length = 694
Score = 643 bits (1659), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 327/722 (45%), Positives = 463/722 (64%), Gaps = 39/722 (5%)
Query: 58 DLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFP 117
DLV+++TL+EK+ QL N A +PRLG+ Y+WWSEALHGV+ VG G FF+ I G+TSFP
Sbjct: 2 DLVNQMTLNEKVLQLGNKAYGVPRLGLAEYQWWSEALHGVSNVGPGTFFDDLIPGSTSFP 61
Query: 118 QVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPG 177
VI TAA+F+ LW IGQA+ EARA+YN G+A G+T+W+PNIN+ RDPRWGR ETPG
Sbjct: 62 TVITTAAAFNESLWKVIGQAVSTEARAMYNLGRA-GLTYWSPNINVVRDPRWGRAIETPG 120
Query: 178 EDPLVTGKYAVSYVRGVQ---GDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFD 234
EDP + G+YAV+YVRG+Q G L+ S+CCKH+ AYD+DNWKG RY FD
Sbjct: 121 EDPYLVGRYAVNYVRGLQDVEGSENYTDPNSRPLKVSSCCKHYAAYDVDNWKGVERYTFD 180
Query: 235 ARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGY 294
ARV+ QD+ +T+ PFE CVK G S +MC+YNRVNGIP+CAD LL++T R W HGY
Sbjct: 181 ARVSEQDMVETFLRPFEMCVKDGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWDLHGY 240
Query: 295 ITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEID 354
I SDCD++ ++ + + G+D++CG++ ++ +AAV+Q K+ E++ID
Sbjct: 241 IVSDCDSLQVMVENHKWL-------------GLDLDCGAYYTENVEAAVRQGKVREADID 287
Query: 355 RALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLL 414
++L+ L+ V MRLG F+G P FGK + VCS + LA +AA++G VLLKN + L
Sbjct: 288 KSLNFLYVVLMRLGFFDGIPQYNSFGK---NDVCSKENIELATEAAREGAVLLKNENDSL 344
Query: 415 PLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACS 474
PL K +LA+IGP++N+ ++GNYAG C+ ITP++ L Y + Y GC +AC
Sbjct: 345 PLSIEKVKTLAVIGPHSNATSAMIGNYAGIPCQIITPIEGLSKYAK-VDYQMGCSDIACK 403
Query: 475 SAS-IDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVI 533
S I A++ AK AD +++ G+D + E E LDR DL+LPG Q +LI +VA + PV+
Sbjct: 404 DESFIFPAMESAKKADATIILAGIDLSIEAESLDRDDLLLPGYQTQLINQVASVSNGPVV 463
Query: 534 LVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDY 593
LVL+ G VDI+FAK + +I SILW GYPGE G A+A+VIFG +NPGGRLP+TW+ DY
Sbjct: 464 LVLMSAGGVDISFAKSNGDIKSILWVGYPGEEGGNAIADVIFGKYNPGGRLPLTWHEADY 523
Query: 594 IK-VPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQ 652
+ +PMT M +RP + G PGRTY+F+ G V+PFG GLSY++++YK
Sbjct: 524 VDMLPMTSMPLRPIDSLGYPGRTYKFFNGSTVYPFGHGLSYTQFTYKL------------ 571
Query: 653 SSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGN 712
+ST + + Y+ +LG + F + V N G G V+++ KP +
Sbjct: 572 -TSTIRSLDIKLDKYQYCHDLGY---KNDSFKPSFEVLNAGAKDGSEVVIVYAKPPEGID 627
Query: 713 GRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGDEEYPI 772
IKQ++GF+ V + A ++ FE + +SL + V+ G H +++GD+
Sbjct: 628 ATYIKQVIGFKRVFVPAGGSEKVKFEFNASKSLQVVDFNAYSVLPSGGHTIMLGDDIISF 687
Query: 773 SI 774
S+
Sbjct: 688 SV 689
>gi|225432132|ref|XP_002274591.1| PREDICTED: beta-xylosidase/alpha-L-arabinofuranosidase 1-like
[Vitis vinifera]
Length = 805
Score = 642 bits (1657), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 321/754 (42%), Positives = 470/754 (62%), Gaps = 28/754 (3%)
Query: 31 FSCDPSNPS-----TETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIP 85
+ CD S + ++F FC +L +RA+DLVSR+TL EK+ Q V++A + RLG+P
Sbjct: 47 YVCDESRYALLGLDMKSFAFCDKSLSYKERAKDLVSRMTLQEKVMQSVHTASGVRRLGLP 106
Query: 86 AYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARAL 145
Y WWSEALHG++ +G G+FF+ TI GATS P VIL+ A+F+ LW +G+ + E RA+
Sbjct: 107 EYSWWSEALHGISNLGPGVFFDETIPGATSLPTVILSTAAFNQTLWKTLGRVVSTEGRAM 166
Query: 146 YNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQG--DTFNGGK 203
YN G A G+TFW+PNIN+ RD RWGR QET GEDP + G++AV+YVRG+Q T N
Sbjct: 167 YNLGHA-GLTFWSPNINVVRDTRWGRTQETSGEDPFIVGEFAVNYVRGLQDVEGTENVTD 225
Query: 204 LKGK-LQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGI 262
L + L+ S+CCKH+ AYD+D+W R+ FDARV+ QD+ +T+ PFE CV++G S +
Sbjct: 226 LNSRPLKVSSCCKHYAAYDIDSWLNVDRHTFDARVSEQDMKETFVSPFERCVREGDVSSV 285
Query: 263 MCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAK-SPEDAVVD 321
MC++N++NGIP C+D LL R +W HGYI SDC + +I D + Y S DAV
Sbjct: 286 MCSFNKINGIPPCSDPRLLKGVIRDEWDLHGYIVSDCYGLEVIVDNQNYLNDSKVDAVAK 345
Query: 322 VLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGK 381
L+AG+D+ CG + +V K+ + E+DRAL N++ + MR+G F+G P +
Sbjct: 346 TLQAGLDLECGHYYTDALNESVLTGKVSQYELDRALKNIYVLLMRVGYFDGIPAYE---S 402
Query: 382 IGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNY 441
+G +C+ H LA +AA+ GIVLLKN + +LPL K + AL+GP+AN+ + ++GNY
Sbjct: 403 LGLKDICAADHIELAREAARQGIVLLKNDYEVLPLKPGKKI--ALVGPHANATEVMIGNY 460
Query: 442 AGPSCRSITPLQALQNYVENTVYYPGCDTVACSSAS-IDKAVDIAKGADHVVLMMGLDQT 500
AG C+ ++PL+A + N Y GC +CS+ + +A + AK A+ ++ +G D +
Sbjct: 461 AGLPCKYVSPLEAFSA-IGNVTYATGCLDASCSNDTYFSEAKEAAKSAEVTIIFVGTDLS 519
Query: 501 QEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAG 560
E E +DRVD +LPG Q ELI +VAE + PVILV+L G +DITFAK + I +ILW G
Sbjct: 520 IEAEFVDRVDFLLPGNQTELIKQVAEVSSGPVILVVLSGSNIDITFAKNNPRISAILWVG 579
Query: 561 YPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIK-VPMTDMKMRPQATSGNPGRTYRFY 619
+PGE G A+A+V+FG +NPGGRLP+TWY DY+ +PM+ M +RP G PGRTY+F+
Sbjct: 580 FPGEQGGHAIADVVFGKYNPGGRLPVTWYEADYVDMLPMSSMSLRPVDELGYPGRTYKFF 639
Query: 620 EGKEVFPFGCGLSYSKYSYKFKA------VSQNKLYLNQSSSTKMVENQDVVHYKSVPEL 673
+G V+PFG G+SY+K+SY + NK + + E+Q V +V
Sbjct: 640 DGSTVYPFGYGMSYTKFSYSLATSKISIDIDLNK--FQKCRTVAYTEDQKVPSCPAVLLD 697
Query: 674 GTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKA 733
+T +F + V N G + G ++++ P G IKQ++GFQ V + A +
Sbjct: 698 DMSCDDTIEF--EVAVTNVGMVDGSEVLMVYSIPPSGIVGTHIKQVIGFQKVFVAAGDTE 755
Query: 734 EIVFELSPCESLSRAREDGLMVIEEGTHFLVVGD 767
+ F ++ C+SL G ++ G+H + VGD
Sbjct: 756 RVKFSMNACKSLRIVDSTGYSLLPSGSHTIRVGD 789
>gi|6573772|gb|AAF17692.1|AC009243_19 F28K19.27 [Arabidopsis thaliana]
Length = 696
Score = 641 bits (1653), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 315/481 (65%), Positives = 387/481 (80%), Gaps = 7/481 (1%)
Query: 298 DCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRAL 357
DCDAVSIIYDA+GYAKSPEDAV DVLKAGMDVNCGS+LQKHTK+A++QKK+ E++IDRAL
Sbjct: 221 DCDAVSIIYDAQGYAKSPEDAVADVLKAGMDVNCGSYLQKHTKSALQQKKVSETDIDRAL 280
Query: 358 HNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLP 417
NLFSVR+RLGLFNG+PT P+G I + VCSPAHQ LAL AA++GIVLLKN+ LLP
Sbjct: 281 LNLFSVRIRLGLFNGDPTKLPYGNISPNEVCSPAHQALALDAARNGIVLLKNNLKLLPFS 340
Query: 418 KSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSAS 477
K SLA+IGPNA+ KTLLGNYAGP C+++TPL AL++YV+N VY+ GCD+VACS+A+
Sbjct: 341 KRSVSSLAVIGPNAHVVKTLLGNYAGPPCKTVTPLDALRSYVKNAVYHQGCDSVACSNAA 400
Query: 478 IDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLL 537
ID+AV IAK ADHVVL+MGLDQTQEKE+ DRVDL LPG+QQELIT VA AAKKPV+LVL+
Sbjct: 401 IDQAVAIAKNADHVVLIMGLDQTQEKEDFDRVDLSLPGKQQELITSVANAAKKPVVLVLI 460
Query: 538 CGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVP 597
CGGPVDI+FA + IGSI+WAGYPGEAG +A++E+IFGDHNPGGRLP+TWYPQ ++ +
Sbjct: 461 CGGPVDISFAANNNKIGSIIWAGYPGEAGGIAISEIIFGDHNPGGRLPVTWYPQSFVNIQ 520
Query: 598 MTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTK 657
MTDM+MR + +G PGRTY+FY+G +V+ FG GLSYS YSY+FK +++ LYLNQS K
Sbjct: 521 MTDMRMR--SATGYPGRTYKFYKGPKVYEFGHGLSYSAYSYRFKTLAETNLYLNQS---K 575
Query: 658 MVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNG--RP 715
N D V Y V E+G E C+ K VT+ V+N GEMAGKHPVL+F + R G R
Sbjct: 576 AQTNSDSVRYTLVSEMGKEGCDVAKTKVTVEVENQGEMAGKHPVLMFARHERGGEDGKRA 635
Query: 716 IKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGDEEYPISIF 775
KQLVGF+S++L+ EKAE+ FE+ CE LSRA E G+MV+EEG +FL VGD E P+ +
Sbjct: 636 EKQLVGFKSIVLSNGEKAEMEFEIGLCEHLSRANEFGVMVLEEGKYFLTVGDSELPLIVN 695
Query: 776 V 776
V
Sbjct: 696 V 696
Score = 350 bits (897), Expect = 2e-93, Method: Compositional matrix adjust.
Identities = 159/204 (77%), Positives = 178/204 (87%), Gaps = 1/204 (0%)
Query: 24 VDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLG 83
V+S PP SCDPSNP+T+ + FC+T LPI +RARDLVSRLT+DEKISQLVN+AP IPRLG
Sbjct: 17 VESAPPPHSCDPSNPTTKLYQFCRTDLPIGKRARDLVSRLTIDEKISQLVNTAPGIPRLG 76
Query: 84 IPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEAR 143
+PAYEWWSEALHGVA G GI FNGT++ ATSFPQVILTAASFDSY W+RI Q IG EAR
Sbjct: 77 VPAYEWWSEALHGVAYAGPGIRFNGTVKAATSFPQVILTAASFDSYEWFRIAQVIGKEAR 136
Query: 144 ALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGK 203
+YNAGQA GMTFWAPNINIFRDPRWGRGQETPGEDP++TG YAV+YVRG+QGD+F+G K
Sbjct: 137 GVYNAGQANGMTFWAPNINIFRDPRWGRGQETPGEDPMMTGTYAVAYVRGLQGDSFDGRK 196
Query: 204 -LKGKLQASACCKHFTAYDLDNWK 226
L LQASACCKHFTAYDLD WK
Sbjct: 197 TLSNHLQASACCKHFTAYDLDRWK 220
>gi|32488698|emb|CAE03635.1| OSJNBb0003B01.27 [Oryza sativa Japonica Group]
Length = 839
Score = 635 bits (1638), Expect = e-179, Method: Compositional matrix adjust.
Identities = 325/653 (49%), Positives = 433/653 (66%), Gaps = 19/653 (2%)
Query: 128 SYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYA 187
S ++ I + EARA++N G A G+TFW+PNINIFRDPRWGRGQETPGEDPL+ KYA
Sbjct: 202 SMMYNLIVLVVSTEARAMHNVGLA-GLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYA 260
Query: 188 VSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQ 247
V YV G+Q + G L+ +ACCKH+TAYD+DNWKG RY FDA V+ QDL DT+Q
Sbjct: 261 VGYVTGLQ----DAGGGSDALKVAACCKHYTAYDVDNWKGVERYTFDAVVSQQDLDDTFQ 316
Query: 248 PPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYD 307
PPF+SCV G + +MC+YN+VNG P+CAD++LLS R W +GYI SDCD+V ++Y+
Sbjct: 317 PPFKSCVIDGNVASVMCSYNKVNGKPTCADKDLLSGVIRGDWKLNGYIVSDCDSVDVLYN 376
Query: 308 AEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRL 367
+ Y K+PEDA +K+G+D+NCG+FL +HT AAV+ KL ES++DRA+ N F V MRL
Sbjct: 377 NQHYTKNPEDAAAITIKSGLDLNCGNFLAQHTVAAVQAGKLSESDVDRAITNNFIVLMRL 436
Query: 368 GLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALI 427
G F+G+P PFG +G VC+ ++Q LA +AA+ GIVLLKN+ G LPL S+A+I
Sbjct: 437 GFFDGDPRKLPFGSLGPKDVCTSSNQELAREAARQGIVLLKNT-GALPLSAKSIKSMAVI 495
Query: 428 GPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASID--KAVDIA 485
GPNAN++ T++GNY G C+ TPLQ L V TVY PGC V CS S+ A A
Sbjct: 496 GPNANASFTMIGNYEGTPCKYTTPLQGLGANVA-TVYQPGCTNVGCSGNSLQLSAATQAA 554
Query: 486 KGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDIT 545
AD VL++G DQ+ E+E LDR L+LPG+Q +L++ VA A++ PVILV++ GGP DI+
Sbjct: 555 ASADVTVLVVGADQSVERESLDRTSLLLPGQQPQLVSAVANASRGPVILVVMSGGPFDIS 614
Query: 546 FAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYI-KVPMTDMKMR 604
FAK I +ILW GYPGEAG ALA+++FG HNPGGRLP+TWYP + KV MTDM+MR
Sbjct: 615 FAKSSDKISAILWVGYPGEAGGAALADILFGYHNPGGRLPVTWYPASFADKVSMTDMRMR 674
Query: 605 PQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKF-KAVSQNKLYLNQSSSTKMVENQD 663
P +++G PGRTYRFY G V+ FG GLSY+K+++ A Q + L + +
Sbjct: 675 PDSSTGYPGRTYRFYTGDTVYAFGDGLSYTKFAHSLVSAPEQVAVQLAEGHACH------ 728
Query: 664 VVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQ 723
H SV G E C + F V + V+N G MAG H V LF P + P K L+GF+
Sbjct: 729 TEHCFSVEAAG-EHCGSLSFDVHLRVRNAGGMAGGHTVFLFSSPPSV-HSAPAKHLLGFE 786
Query: 724 SVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGDEEYPISIFV 776
V L + + F++ C+ LS E G + G+H L VGD ++ +++ V
Sbjct: 787 KVSLEPGQAGVVAFKVDVCKDLSVVDELGNRKVALGSHTLHVGDLKHTLNLRV 839
Score = 127 bits (318), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 62/108 (57%), Positives = 76/108 (70%)
Query: 29 PPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYE 88
P F+CD SN + + FC T + RA DL+ RLTL EK+ LVN A+PRLGIPAYE
Sbjct: 29 PVFACDASNATVSGYGFCDRTKSSAARAADLLGRLTLAEKVGFLVNKQAALPRLGIPAYE 88
Query: 89 WWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQ 136
WWSEALHGV+ VG G F+ + GATSFPQ ILTAASF++ L+ IG+
Sbjct: 89 WWSEALHGVSYVGPGTRFSTLVPGATSFPQPILTAASFNASLFRAIGE 136
>gi|225432134|ref|XP_002274619.1| PREDICTED: probable beta-D-xylosidase 5-like [Vitis vinifera]
Length = 805
Score = 635 bits (1638), Expect = e-179, Method: Compositional matrix adjust.
Identities = 321/742 (43%), Positives = 457/742 (61%), Gaps = 19/742 (2%)
Query: 43 FPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGK 102
F +C ++LP R +DLV R+TL+EK +++ A +PR+G+P Y+WWSEALHGVA VG
Sbjct: 65 FVYCDSSLPYDVRVKDLVDRITLEEKARNVIDVASGVPRIGLPPYKWWSEALHGVANVGS 124
Query: 103 GIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNIN 162
FF+ + GATSFP VIL+AASF+ LW +GQ + EARA+YN G A G+TFW+PNIN
Sbjct: 125 ATFFDEVVPGATSFPNVILSAASFNQSLWKTLGQVVSTEARAMYNLGHA-GLTFWSPNIN 183
Query: 163 IFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQG--DTFNGGKLKGK-LQASACCKHFTA 219
+ RDPRWGR ETPGEDPL G Y V+YVRG+Q T N L + L+ ++ CKHF A
Sbjct: 184 VARDPRWGRILETPGEDPLTVGVYGVNYVRGLQDIEGTENTTDLNSRPLKIASSCKHFAA 243
Query: 220 YDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRN 279
YDLD W R FDA+V+ QD+ +T+ PFE CVK+G S +MC++N +NGIP CAD
Sbjct: 244 YDLDQWFNVDRRHFDAKVSEQDMTETFLRPFEMCVKEGDTSSVMCSFNNINGIPPCADPR 303
Query: 280 LLSKTARRQWGFHGYITSDCDAV-SIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKH 338
L R QW HGYI SDC A+ +I+ D + + E+ V +KAG+D+ CG +
Sbjct: 304 FLKGVIREQWNLHGYIVSDCWAIDTIVQDQKFLDVTSEEGVALSMKAGLDLECGHYYNDS 363
Query: 339 TKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQ 398
AV++ ++ E ++D++L L+ V MR+G F+G P++ GK +C+ H LA +
Sbjct: 364 LATAVREGRVSEHDVDKSLSYLYVVLMRVGFFDGIPSLASLGKKD---ICNDEHIELARE 420
Query: 399 AAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNY 458
AA+ GIVLLKN + LPL K LAL+GP+AN+ ++GNYAG C ++PL A
Sbjct: 421 AARQGIVLLKNDNATLPLKPVKK--LALVGPHANATVAMIGNYAGIPCHYVSPLDAFSE- 477
Query: 459 VENTVYYPGCDTVAC-SSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQ 517
+ + Y GC V C + + KA + AK AD ++++G D + E EE DR DL+LPG Q
Sbjct: 478 LGDVTYEVGCADVKCHNDTHVYKAAEAAKNADATIILVGTDLSIEAEERDREDLLLPGYQ 537
Query: 518 QELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGD 577
E++ +V + + PVILV++CGGP+DI+FAK + I +ILWAG+PGE G A+A+++FG
Sbjct: 538 TEMVNQVTDLSTGPVILVVMCGGPIDISFAKNNPKIAAILWAGFPGEQGGNAIADIVFGK 597
Query: 578 HNPGGRLPMTWYPQDYI-KVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKY 636
+NPGGR P+TWY Y+ +PMT M +RP + G PGRTY+F+ G V+PFG GLSY+ +
Sbjct: 598 YNPGGRSPITWYENGYVGMLPMTSMALRPIESLGYPGRTYKFFNGSTVYPFGYGLSYTNF 657
Query: 637 SYKFKA----VSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNH 692
SY A V + L Q S M + D + L + F + VKN
Sbjct: 658 SYSLTAPTRSVHISLTRLQQCRS--MAYSSDSFQPECSAVLVDDLSCDESFEFQVAVKNV 715
Query: 693 GEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDG 752
G M G V+++ P G IKQ++GF+ V + ++ F ++ C+SL G
Sbjct: 716 GSMDGSEVVMVYSSPPSGIVGTHIKQVIGFERVFVKVGNTEKVKFSMNVCKSLGLVDSSG 775
Query: 753 LMVIEEGTHFLVVGDEEYPISI 774
+++ G+H ++ GD +S
Sbjct: 776 YILLPSGSHTIMAGDNSTSVSF 797
>gi|348667575|gb|EGZ07400.1| xylosidase [Phytophthora sojae]
Length = 751
Score = 629 bits (1623), Expect = e-177, Method: Compositional matrix adjust.
Identities = 328/750 (43%), Positives = 451/750 (60%), Gaps = 56/750 (7%)
Query: 33 CDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSE 92
CD +P + PFC +LPI R DLV+R+ L++ + LVN A A P + +P+YEWW+E
Sbjct: 24 CD--DPKVSSLPFCDGSLPIDARVSDLVNRIPLEQAVGLLVNKASAAPSVNVPSYEWWNE 81
Query: 93 ALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAI 152
ALHGVA + G+ F G + ATSFPQV+ TAASF+ L+Y+I +AI EARA YN A
Sbjct: 82 ALHGVA-LSPGVTFKGPLTAATSFPQVLSTAASFNRTLFYQIAEAISTEARAFYNEKNA- 139
Query: 153 GMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGK---LQ 209
G+TFW PN+NIFRDPRWGRGQETPGEDP +TG+YAV++VRG+QG+ G + K L+
Sbjct: 140 GLTFWTPNVNIFRDPRWGRGQETPGEDPYLTGEYAVAFVRGLQGEAMEGHENKDDNKFLK 199
Query: 210 ASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRV 269
S+CCKHF+AY + R++ DA VT QD ADTY P FE CVK+G S IMC+YN V
Sbjct: 200 ISSCCKHFSAYSQE----VPRHRNDAIVTKQDQADTYFPAFEDCVKRGHVSSIMCSYNAV 255
Query: 270 NGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDV 329
NGIPSCAD+ LL+ R QW F GYITSDC+AV+ + + +SPE L AGMD+
Sbjct: 256 NGIPSCADKGLLTDLVRNQWKFDGYITSDCEAVADVIYRHHFTQSPEQTCATTLDAGMDL 315
Query: 330 NCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCS 389
NCG FL++H +A++Q + + AL N F V MRLG+F QPF I D V +
Sbjct: 316 NCGEFLRQHLSSAIEQGIVSTEMVHNALKNQFRVMMRLGMFEKG--TQPFSNITKDAVDT 373
Query: 390 PAHQVLALQAAQDGIVLLKNSHGLLPLPK---SKSVSLALIGPNANSAKTLLGNYAGPSC 446
AH+ LAL+AA+ +VLLKN LPL SK SLALIGP+ N++ LLGNY G
Sbjct: 374 AAHRQLALEAARQSVVLLKNEDNTLPLATDVFSKDGSLALIGPHFNASTALLGNYFGIPS 433
Query: 447 RSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEEL 506
+TPL+ + +YV N Y GC D+A+++ K AD VV+ MGLDQ+QE+EE+
Sbjct: 434 HIVTPLKGVSSYVPNVAYSLGCKVSGEVLPDFDEAIEVVKKADRVVVFMGLDQSQEREEI 493
Query: 507 DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAG 566
DR L LPG Q L+ R+ AA P++LVL+ GG VD++ K +G+I++ GY G+AG
Sbjct: 494 DRYHLKLPGFQIALLNRILAAASHPIVLVLISGGSVDLSLYKNHPKVGAIVFGGYLGQAG 553
Query: 567 AVALAEVIFGDHNPGGRLPMTWYPQDYIK-VPMTDMKMRPQATSGNPGRTYRFYEGKEVF 625
ALA+++FG ++P GRL T+Y DY+ +P+ DM MRP +GNPGRTYRF+ G V+
Sbjct: 554 GQALADMLFGKYSPAGRLTQTFYDSDYVNTMPIYDMHMRPTFVTGNPGRTYRFFSGAPVY 613
Query: 626 PFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLV 685
FG GLSY+ + + C +
Sbjct: 614 EFGFGLSYTTFH--------------------------------------KACRSCVASF 635
Query: 686 TIGVKNHGEMAGKHPVLLFVKPARRGN-GRPIKQLVGFQSVILNAKEKAEIVFELSPCES 744
I V N G++ G+ +L++ +P G GRP++ LV F+ L K ++
Sbjct: 636 EITVTNLGDVEGEDAILIYAEPPHAGEGGRPLRSLVAFERTALVTTGKTATADFCLEAKA 695
Query: 745 LSRAREDGLMVIEEGTHFLVVGDEEYPISI 774
+ A +G V+E+G + V ++ +++
Sbjct: 696 FALANAEGSWVVEQGNWTIHVDTLQHRVNV 725
>gi|223945397|gb|ACN26782.1| unknown [Zea mays]
Length = 516
Score = 629 bits (1623), Expect = e-177, Method: Compositional matrix adjust.
Identities = 306/518 (59%), Positives = 387/518 (74%), Gaps = 8/518 (1%)
Query: 263 MCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDV 322
MC+YNRVNG+P+CAD NLLS TAR+ WGF+GYITSDCDAV+II+DA+GYAK+ EDAV DV
Sbjct: 1 MCSYNRVNGVPTCADYNLLSTTARQDWGFYGYITSDCDAVAIIHDAQGYAKTAEDAVADV 60
Query: 323 LKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKI 382
LKAGMDVNCGS++Q H +A++Q K+ E +I+RALHNLF+VRMRLGLFNG+P +G I
Sbjct: 61 LKAGMDVNCGSYVQDHGASALQQGKITEQDINRALHNLFAVRMRLGLFNGDPRRNLYGDI 120
Query: 383 GADVVCSPAHQVLALQAAQDGIVLLKNS--HGLLPLPKSKSVSLALIGPNANSAKTLLGN 440
G D VC+ HQ LAL+AAQDGIVLLKN G LPL K SLA+IG NAN A L GN
Sbjct: 121 GPDQVCTQEHQDLALEAAQDGIVLLKNDGGAGALPLSKPNVASLAVIGFNANDAIRLRGN 180
Query: 441 YAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQT 500
Y GP C ++TPLQ LQ YV++T + GC++ AC+ +I +AV A AD VVL MGLDQ
Sbjct: 181 YFGPPCVTVTPLQVLQGYVKDTSFVAGCNSAACNVTTIPEAVQAASSADSVVLFMGLDQD 240
Query: 501 QEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAG 560
QE+EE+DR+DL LPG+QQ LI VA AAKKPVILVLLCGGPVD++FAK + IG+ILWAG
Sbjct: 241 QEREEVDRLDLTLPGQQQTLIESVANAAKKPVILVLLCGGPVDVSFAKTNPKIGAILWAG 300
Query: 561 YPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYE 620
YPGEAG +A+A+V+FG+HNPGGRLP+TWYPQD+ +VPMTDM+MR +G PGRTYRFY
Sbjct: 301 YPGEAGGIAIAQVLFGEHNPGGRLPVTWYPQDFTRVPMTDMRMRADPATGYPGRTYRFYR 360
Query: 621 GKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVE-NQDVVHYKSVPELGTEFCE 679
G VF FG GLSYSKYS++F + + + K VE + V +G+E C+
Sbjct: 361 GPTVFNFGYGLSYSKYSHRF---ATKPPPTSNVAGLKAVEATAGGMASYDVEAIGSETCD 417
Query: 680 TRKFLVTIGVKNHGEMAGKHPVLLFVK--PARRGNGRPIKQLVGFQSVILNAKEKAEIVF 737
KF + V+NHG M GKH VL+F++ A G+GRP QL+GFQS+ L A + A + F
Sbjct: 418 RLKFPAVVRVQNHGPMDGKHSVLVFMRWPNATDGSGRPASQLIGFQSLHLRATQTAHVEF 477
Query: 738 ELSPCESLSRAREDGLMVIEEGTHFLVVGDEEYPISIF 775
E+SPC+ SRA EDG VI++G+HF++VG++E+ +S
Sbjct: 478 EVSPCKHFSRATEDGRKVIDQGSHFVMVGEDEFEMSFM 515
>gi|297736787|emb|CBI25988.3| unnamed protein product [Vitis vinifera]
Length = 774
Score = 627 bits (1617), Expect = e-177, Method: Compositional matrix adjust.
Identities = 316/738 (42%), Positives = 451/738 (61%), Gaps = 42/738 (5%)
Query: 43 FPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGK 102
F +C ++LP R +DLV R+TL+EK +++ A +PR+G+P Y+WWSEALHGVA VG
Sbjct: 65 FVYCDSSLPYDVRVKDLVDRITLEEKARNVIDVASGVPRIGLPPYKWWSEALHGVANVGS 124
Query: 103 GIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNIN 162
FF+ + GATSFP VIL+AASF+ LW +GQ + EARA+YN G A G+TFW+PNIN
Sbjct: 125 ATFFDEVVPGATSFPNVILSAASFNQSLWKTLGQVVSTEARAMYNLGHA-GLTFWSPNIN 183
Query: 163 IFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQG--DTFNGGKLKGK-LQASACCKHFTA 219
+ RDPRWGR ETPGEDPL G Y V+YVRG+Q T N L + L+ ++ CKHF A
Sbjct: 184 VARDPRWGRILETPGEDPLTVGVYGVNYVRGLQDIEGTENTTDLNSRPLKIASSCKHFAA 243
Query: 220 YDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRN 279
YDLD W R FDA+V+ QD+ +T+ PFE CVK+G S +MC++N +NGIP CAD
Sbjct: 244 YDLDQWFNVDRRHFDAKVSEQDMTETFLRPFEMCVKEGDTSSVMCSFNNINGIPPCADPR 303
Query: 280 LLSKTARRQWGFHGYITSDCDAV-SIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKH 338
L R QW HGYI SDC A+ +I+ D + + E+ V +KAG+D+ CG +
Sbjct: 304 FLKGVIREQWNLHGYIVSDCWAIDTIVQDQKFLDVTSEEGVALSMKAGLDLECGHYYNDS 363
Query: 339 TKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQ 398
AV++ ++ E ++D++L L+ V MR+G F+G P++ GK +C+ H LA +
Sbjct: 364 LATAVREGRVSEHDVDKSLSYLYVVLMRVGFFDGIPSLASLGKKD---ICNDEHIELARE 420
Query: 399 AAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNY 458
AA+ GIVLLKN + LPL K LAL+GP+AN+ ++GNYAG C ++PL A
Sbjct: 421 AARQGIVLLKNDNATLPLKPVKK--LALVGPHANATVAMIGNYAGIPCHYVSPLDAFSE- 477
Query: 459 VENTVYYPGCDTVAC-SSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQ 517
+ + Y GC V C + + KA + AK AD ++++G D + E EE DR DL+LPG Q
Sbjct: 478 LGDVTYEVGCADVKCHNDTHVYKAAEAAKNADATIILVGTDLSIEAEERDREDLLLPGYQ 537
Query: 518 QELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGD 577
E++ +V + + PVILV++CGGP+DI+FAK + I +ILWAG+PGE G A+A+++FG
Sbjct: 538 TEMVNQVTDLSTGPVILVVMCGGPIDISFAKNNPKIAAILWAGFPGEQGGNAIADIVFGK 597
Query: 578 HNPGGRLPMTWYPQDYI-KVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKY 636
+NPGGR P+TWY Y+ +PMT M +RP + G PGRTY+F+ G V+PFG GLSY+ +
Sbjct: 598 YNPGGRSPITWYENGYVGMLPMTSMALRPIESLGYPGRTYKFFNGSTVYPFGYGLSYTNF 657
Query: 637 SYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMA 696
SY A +++ VH F + VKN G M
Sbjct: 658 SYSLTAPTRS------------------VH-----------ISLTSFEFQVAVKNVGSMD 688
Query: 697 GKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVI 756
G V+++ P G IKQ++GF+ V + ++ F ++ C+SL G +++
Sbjct: 689 GSEVVMVYSSPPSGIVGTHIKQVIGFERVFVKVGNTEKVKFSMNVCKSLGLVDSSGYILL 748
Query: 757 EEGTHFLVVGDEEYPISI 774
G+H ++ GD +S
Sbjct: 749 PSGSHTIMAGDNSTSVSF 766
>gi|297736788|emb|CBI25989.3| unnamed protein product [Vitis vinifera]
Length = 746
Score = 624 bits (1609), Expect = e-176, Method: Compositional matrix adjust.
Identities = 322/747 (43%), Positives = 452/747 (60%), Gaps = 73/747 (9%)
Query: 31 FSCDPSNPST-----ETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIP 85
+ CD S + + F +C ++ P RA+DLV R+TL EK+ Q + A + R+G+P
Sbjct: 51 YVCDESRFAALGLDMKDFHYCDSSSPYEVRAKDLVDRMTLSEKVMQTGDQASGVERIGLP 110
Query: 86 AYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARAL 145
Y WWSEALHGV+ G+ +FF+ + GATSFP VIL+AASF+ LW +GQA+ EARA+
Sbjct: 111 KYNWWSEALHGVSNFGRCVFFDEVVPGATSFPTVILSAASFNQSLWKTLGQAVSTEARAM 170
Query: 146 YNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLK 205
YN+G A G+TFW+PNIN+ RDPRWGR ETPGEDP + G YAV+Y
Sbjct: 171 YNSGNA-GLTFWSPNINVVRDPRWGRILETPGEDPHLVGLYAVNY--------------- 214
Query: 206 GKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCA 265
H+ AYDLDNWKG R FDARV++QD+A+T+ PFE CVK+G S +MC+
Sbjct: 215 ----------HYAAYDLDNWKGADRVHFDARVSVQDMAETFVLPFEMCVKEGDVSSVMCS 264
Query: 266 YNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSII-YDAEGYAKSPEDAVVDVLK 324
YN++NGIPSCAD LL +T R +W HGYI SDCD+V ++ D + S D+ L
Sbjct: 265 YNKINGIPSCADSRLLKQTIRGEWDLHGYIVSDCDSVEVMAVDQKWLDSSFSDSAAQALN 324
Query: 325 AGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGA 384
AGM+++CG+F + AV Q K ++++D +L L+ + MR+G F+G P F +G
Sbjct: 325 AGMNLDCGTFNNRSLTEAVNQGKANQADLDHSLRYLYVLLMRVGFFDGIPA---FASLGK 381
Query: 385 DVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGP 444
D +CS H LA +AA+ GIVLLKN + LPL K++ AL+GP+AN+ ++GNYAG
Sbjct: 382 DDICSAEHIELAREAARQGIVLLKNDNATLPLKSVKNI--ALVGPHANATDAMIGNYAGI 439
Query: 445 SCRSITPLQALQNYVENTVYYPGCDTVACSSAS-IDKAVDIAKGADHVVLMMGLDQTQEK 503
C ++PL A + E Y GC V C + + I A++ AK AD ++ G D + E
Sbjct: 440 PCYYVSPLDAFSSMGE-VRYEKGCADVQCLNETYIFNAMEAAKRADATIIFAGTDLSIEA 498
Query: 504 EELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPG 563
E LDRVDL+LPG Q +LI +VA+ + PV+LV++ GG VDI+FA+ + I +ILWAGYPG
Sbjct: 499 EALDRVDLLLPGYQTQLINQVADLSTGPVVLVIMSGGGVDISFARDNPKIAAILWAGYPG 558
Query: 564 EAGAVALAEVIFGDHNPGGRLPMTWYPQDYIK-VPMTDMKMRPQATSGNPGRTYRFYEGK 622
E G A+A+VI G +NPGGRLP+TWY DY+ +PMT M +RP + G PGRTY+F+ G
Sbjct: 559 EQGGNAIADVILGKYNPGGRLPITWYEADYVDMLPMTSMALRPVDSLGYPGRTYKFFNGS 618
Query: 623 EVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRK 682
V+PFG G+SY+ +SY L+ S S K E+ +
Sbjct: 619 TVYPFGYGMSYTNFSYS----------LSTSQSCK---------------------ESIE 647
Query: 683 FLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPC 742
F + VKN G M G V+++ P G IK++VGF+ V + ++ F ++ C
Sbjct: 648 F--EVAVKNVGRMDGSEVVVVYSSPPLGIAGTHIKKVVGFERVFVKVGGTEKVKFSMNVC 705
Query: 743 ESLSRAREDGLMVIEEGTHFLVVGDEE 769
+SL G ++ G+H + VG +
Sbjct: 706 KSLGIVDSTGYALLPSGSHTIKVGGDN 732
>gi|85813774|emb|CAJ65923.1| xylan 1,4-beta-xylosidase [Populus tremula x Populus alba]
Length = 704
Score = 620 bits (1600), Expect = e-175, Method: Compositional matrix adjust.
Identities = 300/493 (60%), Positives = 385/493 (78%), Gaps = 9/493 (1%)
Query: 287 RQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQK 346
+ + H DCDAV++++ + YAK+PEDAV D LK+G+ S+L+ +TK+AV++K
Sbjct: 218 KHYTAHDLDNWDCDAVNVLHVEQKYAKTPEDAVADALKSGI-----SYLRNYTKSAVEKK 272
Query: 347 KLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVL 406
K+ SEIDRALHNLFS RMRLGLFNG+PT Q + IG D VCS HQ LAL+AA DGIVL
Sbjct: 273 KVTVSEIDRALHNLFSTRMRLGLFNGDPTKQLYSDIGPDQVCSQEHQALALEAALDGIVL 332
Query: 407 LKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYP 466
LKN+ LLPL KS SLA+IGPNA+++ LLGNY GP+C+++T L+ L+NYV + Y
Sbjct: 333 LKNADRLLPLSKSGISSLAVIGPNAHNSTNLLGNYFGPACKNVTILEGLRNYVSSASYEK 392
Query: 467 GCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAE 526
GC+ V+C+SA+ K V++A+ D V+L+MGLDQ+QEKE LDR+DLVLPG+Q LIT VA+
Sbjct: 393 GCNNVSCTSAAKKKPVEMAQTEDQVILVMGLDQSQEKERLDRMDLVLPGKQPTLITAVAK 452
Query: 527 AAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP---GGR 583
AAK+P++LVLL G P+D+TFAK +R IGSILWAGYPG+AGA ALA++IFG+HNP GGR
Sbjct: 453 AAKRPIVLVLLGGSPMDVTFAKNNRKIGSILWAGYPGQAGATALAQIIFGEHNPGNAGGR 512
Query: 584 LPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAV 643
LPMTWYPQD+ KVPMTDM+MRPQ ++GNPGRTYRFYEG++VF FG GLSYS YSY F +V
Sbjct: 513 LPMTWYPQDFTKVPMTDMRMRPQPSTGNPGRTYRFYEGEKVFEFGYGLSYSDYSYTFASV 572
Query: 644 SQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLL 703
+QN+L + + SS + EN + YK V ++G E CE KF VT+ VKN G+MAGKHPVLL
Sbjct: 573 AQNQLNV-KDSSNQQPENSETPGYKLVSDIGEEQCENIKFKVTVSVKNEGQMAGKHPVLL 631
Query: 704 FVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFL 763
F + A+ G GRPIK+LVGFQ+V L A EK EI +ELSPCE LS A EDG+MV+EEG+ L
Sbjct: 632 FARHAKPGKGRPIKKLVGFQTVKLGAGEKTEIEYELSPCEHLSSANEDGVMVMEEGSQIL 691
Query: 764 VVGDEEYPISIFV 776
+VGD+E+P++I V
Sbjct: 692 LVGDKEHPVTIIV 704
Score = 324 bits (830), Expect = 1e-85, Method: Compositional matrix adjust.
Identities = 151/208 (72%), Positives = 176/208 (84%), Gaps = 6/208 (2%)
Query: 23 RVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRL 82
R ++QPP+SCD S+PST+T+ FCKTTLPIS+RA DLVSRLT +EK +QLV+++PAIPRL
Sbjct: 22 RKQASQPPYSCDSSDPSTKTYDFCKTTLPISRRAEDLVSRLTFEEKATQLVDTSPAIPRL 81
Query: 83 GIPAYEWWSEALHGVA---GVGKGI-FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAI 138
GIPAYEWWSE LHG+ V +GI FFN TI+ ATSFPQVILTAASFD+++WYRIGQ +
Sbjct: 82 GIPAYEWWSEGLHGIGFLTRVQQGISFFNRTIQHATSFPQVILTAASFDAHIWYRIGQ-V 140
Query: 139 GLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDT 198
G EARALYNAGQ G+ FWAPN+NIFRDPRWGRGQETPGEDPLV GKY S+VRGVQGD+
Sbjct: 141 GKEARALYNAGQVTGLGFWAPNVNIFRDPRWGRGQETPGEDPLVVGKYGASFVRGVQGDS 200
Query: 199 FNG-GKLKGKLQASACCKHFTAYDLDNW 225
F G L LQASACCKH+TA+DLDNW
Sbjct: 201 FEGESTLGDHLQASACCKHYTAHDLDNW 228
Score = 42.0 bits (97), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 17/24 (70%), Positives = 20/24 (83%)
Query: 695 MAGKHPVLLFVKPARRGNGRPIKQ 718
MAGKHPVLLF + ++GNGRP KQ
Sbjct: 1 MAGKHPVLLFARQTKQGNGRPRKQ 24
>gi|326431595|gb|EGD77165.1| beta-glucosidase [Salpingoeca sp. ATCC 50818]
Length = 900
Score = 620 bits (1598), Expect = e-174, Method: Compositional matrix adjust.
Identities = 334/759 (44%), Positives = 468/759 (61%), Gaps = 31/759 (4%)
Query: 24 VDSTQP-PFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRL 82
+D+ P P +CD ++ PFC T L R RDL+SR+ + LVNSA + L
Sbjct: 164 LDAGSPTPRTCDVE--PGKSLPFCNTALSYDDRIRDLISRINDSDLPGLLVNSATGVEHL 221
Query: 83 GIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEA 142
+PAY+WWSEALHGV G G+ F G + ATSFPQVI T A+F+ L+ +IG I EA
Sbjct: 222 NLPAYQWWSEALHGV-GHSPGVHFGGDVPAATSFPQVIHTGATFNKTLYRKIGTVISTEA 280
Query: 143 RALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGG 202
RA+ N Q G TFWAPNINI RDPRWGRGQETPGEDP TG+YA ++V G F G
Sbjct: 281 RAMNNV-QRAGNTFWAPNINIIRDPRWGRGQETPGEDPFATGEYAANFVSG-----FQDG 334
Query: 203 KLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGI 262
+ ++AS+CCKHF Y+L+NW G R+ ++A T QD+ADTY P FE+CV+ GRASG+
Sbjct: 335 EDMNYIKASSCCKHFFDYNLENWHGVDRHHYNAIATDQDIADTYLPSFEACVRYGRASGL 394
Query: 263 MCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDV 322
MC+YN VNG+PSCA+ ++++ AR WGF GYITSDC AV+ + ++ + ++ + + V
Sbjct: 395 MCSYNAVNGVPSCANGDIMTVMARESWGFDGYITSDCGAVADVLNSHKFTRNTSETIRAV 454
Query: 323 LKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKI 382
L+AGMD +CGSF+Q++ A+++ +P ++ ALH LF V+ RLGLF+ + QP+
Sbjct: 455 LEAGMDTDCGSFVQQYLAKAMQEGVVPRELVNTALHRLFMVQFRLGLFD-PVSKQPYTNY 513
Query: 383 GADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYA 442
V +PA+Q LAL+AAQ GIVLLKN++ LPL + +ALIGPNA++ + GNY
Sbjct: 514 SVARVNTPANQQLALEAAQQGIVLLKNTNARLPL--KTGLHVALIGPNADATTVMQGNYQ 571
Query: 443 GPSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQE 502
G + I+P++ +NY Y GCD ++ D AV AK AD VV+++GLDQ QE
Sbjct: 572 GTAPFLISPVRGFKNYSAAVTYAKGCDVACKDTSGFDAAVAAAKEADAVVVVVGLDQGQE 631
Query: 503 KEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYP 562
E DR + LPG Q++L+ +VA AAK P+++ ++ GG VD++ K ++N+ ILW GYP
Sbjct: 632 SEGHDRTSITLPGHQEDLVAQVAAAAKSPIVVFVMTGGAVDLSTIKANKNVAGILWCGYP 691
Query: 563 GEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKV-PMTDMKMRPQATSGNPGRTYRFYEG 621
G++G A+A+V+FG +PGGRLP T YP Y+ M D MRP TSGNPGRTYRFY G
Sbjct: 692 GQSGGQAMADVVFGAVSPGGRLPYTIYPGSYVDACSMLDNGMRPNKTSGNPGRTYRFYTG 751
Query: 622 KEVFPFGCGLSYSKYSYKFKAVS--QNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCE 679
K V+ +G GLSY+ +SY ++ L Q+ +N + Y + PE
Sbjct: 752 KPVYEYGTGLSYTSFSYHIHYLNTMDTSLATVQTYVQDAKQNHKFIRYDA-PEF------ 804
Query: 680 TRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGN-GRPIKQLVGFQSVILNAKEKAEIVFE 738
TR V + V N G +AG V +FV+P G PIK L+GF+ V LN + + F
Sbjct: 805 TR---VEVNVTNVGRVAGADVVQVFVEPKTPAELGAPIKTLIGFERVFLNPGQWTIVQFS 861
Query: 739 LSPCESLSRAREDGLMVIEEGTHFLVVGDEE---YPISI 774
++ L+ G V G + +G + +P+ +
Sbjct: 862 VN-AHDLTFVDASGKRVARAGEWLVHIGHDSRLTFPVHV 899
>gi|297736786|emb|CBI25987.3| unnamed protein product [Vitis vinifera]
Length = 745
Score = 619 bits (1597), Expect = e-174, Method: Compositional matrix adjust.
Identities = 308/744 (41%), Positives = 452/744 (60%), Gaps = 68/744 (9%)
Query: 31 FSCDPSNPS-----TETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIP 85
+ CD S + ++F FC +L +RA+DLVSR+TL EK+ Q V++A + RLG+P
Sbjct: 47 YVCDESRYALLGLDMKSFAFCDKSLSYKERAKDLVSRMTLQEKVMQSVHTASGVRRLGLP 106
Query: 86 AYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARAL 145
Y WWSEALHG++ +G G+FF+ TI GATS P VIL+ A+F+ LW +G+ + E RA+
Sbjct: 107 EYSWWSEALHGISNLGPGVFFDETIPGATSLPTVILSTAAFNQTLWKTLGRVVSTEGRAM 166
Query: 146 YNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLK 205
YN G A G+TFW+PNIN+ RD RWGR QET GEDP + G++AV+YVRG+Q ++
Sbjct: 167 YNLGHA-GLTFWSPNINVVRDTRWGRTQETSGEDPFIVGEFAVNYVRGLQ-------DVE 218
Query: 206 GKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCA 265
G S+CCKH+ AYD+D+W R+ FDARV+ QD+ +T+ PFE CV++G S +MC+
Sbjct: 219 GTENVSSCCKHYAAYDIDSWLNVDRHTFDARVSEQDMKETFVSPFERCVREGDVSSVMCS 278
Query: 266 YNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAK-SPEDAVVDVLK 324
+N++NGIP C+D LL R +W HGYI SDC + +I D + Y S DAV L+
Sbjct: 279 FNKINGIPPCSDPRLLKGVIRDEWDLHGYIVSDCYGLEVIVDNQNYLNDSKVDAVAKTLQ 338
Query: 325 AGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGA 384
AG+D+ CG + +V K+ + E+DRAL N++ + MR+G F+G P + +G
Sbjct: 339 AGLDLECGHYYTDALNESVLTGKVSQYELDRALKNIYVLLMRVGYFDGIPAYE---SLGL 395
Query: 385 DVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGP 444
+C+ H LA +AA+ GIVLLKN + +LPL K + AL+GP+AN+ + ++GNYAG
Sbjct: 396 KDICAADHIELAREAARQGIVLLKNDYEVLPLKPGKKI--ALVGPHANATEVMIGNYAGL 453
Query: 445 SCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKE 504
C+ ++PL+A + N Y G ++ +G D + E E
Sbjct: 454 PCKYVSPLEAFSA-IGNVTYATG----------------------FTIIFVGTDLSIEAE 490
Query: 505 ELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGE 564
+DRVD +LPG Q ELI +VAE + PVILV+L G +DITFAK + I +ILW G+PGE
Sbjct: 491 FVDRVDFLLPGNQTELIKQVAEVSSGPVILVVLSGSNIDITFAKNNPRISAILWVGFPGE 550
Query: 565 AGAVALAEVIFGDHNPGGRLPMTWYPQDYIK-VPMTDMKMRPQATSGNPGRTYRFYEGKE 623
G A+A+V+FG +NPGGRLP+TWY DY+ +PM+ M +RP G PGRTY+F++G
Sbjct: 551 QGGHAIADVVFGKYNPGGRLPVTWYEADYVDMLPMSSMSLRPVDELGYPGRTYKFFDGST 610
Query: 624 VFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKF 683
V+PFG G+SY+K+SY +++K+ + D+ +F + R F
Sbjct: 611 VYPFGYGMSYTKFSYSL-------------ATSKISIDIDL----------NKFQKCRTF 647
Query: 684 LVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCE 743
+ V N G + G ++++ P G IKQ++GFQ V + A + + F ++ C+
Sbjct: 648 --EVAVTNVGMVDGSEVLMVYSIPPSGIVGTHIKQVIGFQKVFVAAGDTERVKFSMNACK 705
Query: 744 SLSRAREDGLMVIEEGTHFLVVGD 767
SL G ++ G+H + VGD
Sbjct: 706 SLRIVDSTGYSLLPSGSHTIRVGD 729
>gi|357153280|ref|XP_003576399.1| PREDICTED: probable beta-D-xylosidase 2-like [Brachypodium
distachyon]
Length = 807
Score = 615 bits (1586), Expect = e-173, Method: Compositional matrix adjust.
Identities = 337/787 (42%), Positives = 466/787 (59%), Gaps = 98/787 (12%)
Query: 43 FPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVG- 101
+ +C LP R RDL+ +T++EK+S L + A PR+G+P Y+WWSEALHG++ G
Sbjct: 60 YRYCDAKLPYGDRVRDLIGWMTVEEKVSNLGDWAAGAPRVGLPPYKWWSEALHGLSSTGP 119
Query: 102 ---------------KGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALY 146
+ FNGT+ F VI +AASF+ LW IGQAI EARA+Y
Sbjct: 120 TTKFDDLKKPRLHSGRAAVFNGTV-----FANVINSAASFNESLWRSIGQAISTEARAMY 174
Query: 147 NAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQ-----GDTFNG 201
N G+ G+T+W+PNIN+ RDPRWGR ETPGEDP V G+YAV++VRG+Q FNG
Sbjct: 175 NLGKG-GLTYWSPNINVVRDPRWGRALETPGEDPFVVGRYAVNFVRGMQDVDDAAAGFNG 233
Query: 202 GKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASG 261
L L+ SACCKH+ AYD+D+W G TR+KFDARVT +D+ +T+Q PFE CV+ G AS
Sbjct: 234 DPLSRPLKTSACCKHYAAYDVDDWYGHTRFKFDARVTERDMVETFQRPFEMCVRDGDASA 293
Query: 262 IMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYD-AEGYAKSPEDAVV 320
+MC+YNRVNGIP+CAD LL+ T RR WG HGYI SDCDAV ++ D A +P +A
Sbjct: 294 VMCSYNRVNGIPACADARLLAGTLRRDWGLHGYIVSDCDAVRVMTDNATWLGYTPAEASA 353
Query: 321 DVLKAGMDVNCG------------SFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLG 368
LKAG+D++CG FL + AAV+Q K+ ES+ID AL NL++ MRLG
Sbjct: 354 ASLKAGLDLDCGESWIVQKGKPVMDFLSTYGMAAVRQGKMRESDIDNALVNLYTTLMRLG 413
Query: 369 LFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIG 428
F+G P + + +CS AH+ LAL A+ +VLLKN GLLPL SK S+A+ G
Sbjct: 414 YFDGMPRYE---SLDEKDICSEAHRSLALDGARQSMVLLKNLDGLLPLDASKLASVAVRG 470
Query: 429 PNANS-AKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIA-K 486
P+A + K + G+Y GP CR ITP + I K V+I+ +
Sbjct: 471 PHAEAPEKVMDGDYTGPPCRYITPREG-----------------------ISKDVNISQQ 507
Query: 487 GADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITF 546
G D + M G++ E+E DR DL+LP Q E I RVA A+ P++LV+L GG +D++F
Sbjct: 508 GGDVTIYMGGINMHIEREGNDREDLLLPKNQTEEILRVAAASPSPIVLVILSGGGIDVSF 567
Query: 547 AKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYI-KVPMTDMKMRP 605
A+ IG+ILWAGYPG G A+A+VIFG +NPGGRLP+TW+ YI ++PMT M +RP
Sbjct: 568 AQSHPKIGAILWAGYPGGEGGHAIADVIFGRYNPGGRLPLTWFKNKYIHQLPMTSMALRP 627
Query: 606 QATSGNPGRTYRFYEGKEV-FPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQ-- 662
+ G PGRTY+FY+G +V +PFG GLSY+K+ Y+ LN+ ++ + +
Sbjct: 628 RPEHGYPGRTYKFYDGPDVLYPFGYGLSYTKFRYEL---------LNKETAVTLAPGRRH 678
Query: 663 -DVVHYKS--------VPELGTEFC-ETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGN 712
+ YK+ ++ + C ET F V+ V N G+ G + VL++ P
Sbjct: 679 CRQLSYKTGSVGPDCPAVDVASHACAETVSFNVS--VVNAGKADGANAVLVYTAPPAELA 736
Query: 713 GRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVV--GDE-- 768
G PIKQ+ F+ V + A +VF L+ C++ + V+ G ++V GD
Sbjct: 737 GAPIKQVAAFRRVAVKAGAAETVVFTLNVCKAFGIVEKTAYTVVPSGVSTVIVENGDSSA 796
Query: 769 -EYPISI 774
+P+ I
Sbjct: 797 VSFPVQI 803
>gi|326523729|dbj|BAJ93035.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 810
Score = 612 bits (1579), Expect = e-172, Method: Compositional matrix adjust.
Identities = 332/773 (42%), Positives = 463/773 (59%), Gaps = 43/773 (5%)
Query: 33 CDPSNPST-----ETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAY 87
CDP+ + F +C +LP + R RDLV RLTL+EK+ L + A R+G+P Y
Sbjct: 46 CDPARFAALGLEMAGFRYCDASLPYADRVRDLVGRLTLEEKVRNLGDRAEGAARVGLPPY 105
Query: 88 EWWSEALHGVAGVGKG-IFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALY 146
WW EALHGV+ G G F + GATSFP VI +AA+F+ LW IG A+ E RA+Y
Sbjct: 106 LWWGEALHGVSDTGPGGTRFGDVVPGATSFPLVINSAAAFNETLWGAIGGAVSTEIRAMY 165
Query: 147 NAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKG 206
N G A +T+W+PNIN+ RDPRWGR ETPGEDP V G+YAVS+VR +Q D G G
Sbjct: 166 NLGHA-ELTYWSPNINVVRDPRWGRASETPGEDPFVVGRYAVSFVRAMQ-DIDGAGPGAG 223
Query: 207 K------LQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRAS 260
++ S+CCKH+ AYD+D W R FDA+V +D+ +T++ PFE CV+ G AS
Sbjct: 224 ADPFARPIKVSSCCKHYAAYDVDAWLTADRLTFDAQVEERDMIETFERPFEMCVRDGDAS 283
Query: 261 GIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSI-IYDAEGYAKSPEDAV 319
+MC+YNR+NG+P+CA+ LLS+T R +W HGYI SDCD+V + + DA+ + +A
Sbjct: 284 CVMCSYNRINGVPACANARLLSETVRGEWQLHGYIVSDCDSVRVMVRDAKWLGYNGVEAT 343
Query: 320 VDVLKAGMDVNCGSFLQKHTK-------AAVKQKKLPESEIDRALHNLFSVRMRLGLFNG 372
+KAG+D++CG F + AV+Q KL ESE+D AL NL+ MRLG F+G
Sbjct: 344 AAAMKAGLDLDCGMFWEGAQDFFTAFGLDAVRQGKLRESEVDNALRNLYLTLMRLGFFDG 403
Query: 373 NPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIG--PN 430
P ++ +GA+ VC+ H+ LA AA+ G+VL+KN HG LPL SK SL+L+G +
Sbjct: 404 IPELE---SLGANDVCTEEHKELAADAARQGMVLIKNDHGRLPLDTSKVNSLSLVGLLQH 460
Query: 431 ANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADH 490
N+ +LG+Y G CR +TP A++ V T CD ACS+A+ K VD
Sbjct: 461 INATDVMLGDYRGKPCRVVTPYDAIRKVVSATSMQV-CDHGACSTAANGKTVDA------ 513
Query: 491 VVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYD 550
+++ GL+ + EKE DR DL+LP Q I VAEA+ P+ILV++ G VD++FA+ +
Sbjct: 514 TIVIAGLNMSVEKEGNDREDLLLPWNQTNWINAVAEASPYPIILVIISAGGVDVSFAQNN 573
Query: 551 RNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYI-KVPMTDMKMRPQATS 609
IG+I+WAGYPGE G A+A+V+FG +NPGGRLP+TWY +YI K+PMT M +RP A
Sbjct: 574 PKIGAIVWAGYPGEEGGTAIADVLFGKYNPGGRLPLTWYKSEYISKIPMTSMALRPVADK 633
Query: 610 GNPGRTYRFYEGKEV-FPFGCGLSYSKYSYKFKAVSQN-KLYLNQSSSTKMVENQ----D 663
G PGRTY+FY G EV +PFG GLSYS +SY + + + S K + +
Sbjct: 634 GYPGRTYKFYGGPEVLYPFGHGLSYSNFSYASDTTGASVTVRVGAWESCKQLTRKPGTTA 693
Query: 664 VVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQ 723
+ +V G E F +T V N G G H V+++ P + P+KQLV F+
Sbjct: 694 PLACPAVNVAGHGCKEEVSFSLT--VANRGSRDGAHVVMVYTVPPAEVDDAPLKQLVAFR 751
Query: 724 SVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGDEEYPISIFV 776
V + A ++ F L+ C++ + E V+ G ++VGD+ S V
Sbjct: 752 RVFVPAGAAVQVPFTLNVCKAFAIVEETAYTVVPSGVSTVLVGDDALSFSFSV 804
>gi|242052713|ref|XP_002455502.1| hypothetical protein SORBIDRAFT_03g012290 [Sorghum bicolor]
gi|241927477|gb|EES00622.1| hypothetical protein SORBIDRAFT_03g012290 [Sorghum bicolor]
Length = 825
Score = 612 bits (1579), Expect = e-172, Method: Compositional matrix adjust.
Identities = 320/761 (42%), Positives = 461/761 (60%), Gaps = 34/761 (4%)
Query: 43 FPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGK 102
F +C +LP ++R RDLV RL+L+EK+ L + A PR+G+P Y+WW EALHGV+ VG
Sbjct: 66 FRYCDASLPYAERVRDLVGRLSLEEKVRNLGDQAEGAPRVGLPPYKWWGEALHGVSDVGP 125
Query: 103 G-IFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNI 161
G +F + GATSFP VI +AA+F+ LW IG + E RA+YN G A +T+W+PNI
Sbjct: 126 GGTWFGDVVPGATSFPLVINSAAAFNESLWRAIGGVVSTEIRAMYNLGHA-ELTYWSPNI 184
Query: 162 NIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGK------LKGKLQASACCK 215
N+ RDPRWGR ETPGEDP V G+YAV++VRG+Q G ++ S+CCK
Sbjct: 185 NVVRDPRWGRASETPGEDPFVVGRYAVNFVRGMQDVVIAAGAAATADPFSRPIKVSSCCK 244
Query: 216 HFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSC 275
HF AYD+D W R FDA+V +D+ +T++ PFE C++ G AS +MC+YNR+NGIP+C
Sbjct: 245 HFAAYDVDAWFKADRLTFDAQVEERDMVETFERPFEMCIRDGDASCVMCSYNRINGIPAC 304
Query: 276 ADRNLLSKTARRQWGFHGYITSDCDAVSI-IYDAEGYAKSPEDAVVDVLKAGMDVNCG-- 332
AD LLS+T R QW HGYI SDCD+V + + DA+ + +A +KAG+D++CG
Sbjct: 305 ADARLLSETVRSQWQLHGYIVSDCDSVRVMVRDAKWLNYTGVEATAAAMKAGLDLDCGMF 364
Query: 333 -----SFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVV 387
F + AV+Q K+ E+++D AL N+++ MRLG F+G P F +GAD V
Sbjct: 365 WEGARDFFTTYGVDAVRQGKIKEADVDNALGNVYTTLMRLGFFDGMPE---FESLGADDV 421
Query: 388 CSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIG--PNANSAKTLLGNYAGPS 445
C+ H+ LA AA+ G+VLLKN LPL SK S++L+G + N+ +LG+Y G
Sbjct: 422 CTRDHKELAADAARQGMVLLKNDARRLPLDPSKINSVSLVGLLEHINATDVMLGDYRGKP 481
Query: 446 CRSITPLQALQNYVENTVYYPGCDTVACSSAS-IDKAVDIAKGADHVVLMMGLDQTQEKE 504
CR +TP A++ V N Y CD+ ACS+A + +A AK AD +++ GL+ + E+E
Sbjct: 482 CRIVTPYDAIRQVV-NATYVHACDSGACSTAEGMGRASRTAKIADATIVIAGLNMSVERE 540
Query: 505 ELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGE 564
DR DL+LP Q I VAEA+ P++LV++ G VD++FA+ + IG+I+WAGYPGE
Sbjct: 541 SNDREDLLLPWNQSSWINAVAEASTTPIVLVIMSAGGVDVSFAQNNTKIGAIVWAGYPGE 600
Query: 565 AGAVALAEVIFGDHNPGGRLPMTWYPQDYI-KVPMTDMKMRPQATSGNPGRTYRFYEGKE 623
G A+A+V+FG +NPGGRLP+TW+ +Y+ ++PMT M +RP A G PGRTY+FY G
Sbjct: 601 EGGTAIADVLFGKYNPGGRLPLTWFKNEYVNQIPMTSMALRPDAAHGYPGRTYKFYGGPA 660
Query: 624 V-FPFGCGLSYSKYSYKFKAVSQN-KLYLNQSSSTKMVENQDVVHYKSVP-----ELGTE 676
V +PFG GLSY+ ++Y + + KM+ + P + +
Sbjct: 661 VLYPFGHGLSYTSFTYASGTTGATVTIPIGAWEHCKMLTYKSGKAPSPSPACPALNVASH 720
Query: 677 FC-ETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEI 735
C E F ++ V N G + G H V ++ P P KQLV F+ V + A ++
Sbjct: 721 RCDEVVSF--SLRVANTGGVGGDHVVPVYTAPPPEVGDAPRKQLVEFRRVFVPAGAAVDV 778
Query: 736 VFELSPCESLSRAREDGLMVIEEGTHFLVVGDEEYPISIFV 776
F L+ C++ + E V+ G ++VGD+ +S V
Sbjct: 779 PFALNVCKTFAIVEETAYTVVPSGVSTVIVGDDALALSFAV 819
>gi|226506870|ref|NP_001146482.1| uncharacterized protein LOC100280070 precursor [Zea mays]
gi|219887469|gb|ACL54109.1| unknown [Zea mays]
gi|413947917|gb|AFW80566.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
Length = 835
Score = 612 bits (1577), Expect = e-172, Method: Compositional matrix adjust.
Identities = 325/781 (41%), Positives = 467/781 (59%), Gaps = 51/781 (6%)
Query: 33 CDPSN-----PSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAY 87
CDP+ F +C +LP + R RDLV RL L+EK+ L + A PR+G+P Y
Sbjct: 63 CDPARFVALGLDMSRFRYCDASLPYADRVRDLVGRLALEEKVRNLGDQAEGAPRVGLPPY 122
Query: 88 EWWSEALHGVAGVGKG-IFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALY 146
+WW EALHGV+ VG G +F + GATSFP VI +AA+F+ LW IG + E RA+Y
Sbjct: 123 KWWGEALHGVSDVGPGGTWFGDVVPGATSFPLVINSAAAFNESLWRAIGGVVSTEIRAMY 182
Query: 147 NAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQ----------G 196
N G A +T+W+PNIN+ RDPRWGR ETPGEDP V G+YAV++VRG+Q
Sbjct: 183 NLGHA-ELTYWSPNINVVRDPRWGRASETPGEDPFVVGRYAVNFVRGMQDVDDRPYAAAA 241
Query: 197 DTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQ 256
D F+ ++ S+CCKHF AYD+D W R FDA+V +D+ +T++ PFE C++
Sbjct: 242 DPFSR-----PIKVSSCCKHFAAYDVDAWFKADRLTFDAQVEERDMVETFERPFEMCIRD 296
Query: 257 GRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSI-IYDAEGYAKSP 315
G AS +MC+YNR+NGIP+CAD LLS+T R QW HGYI SDCD+V + + DA+ +
Sbjct: 297 GDASCVMCSYNRINGIPACADARLLSETVRSQWQLHGYIVSDCDSVRVMVRDAKWLNYTG 356
Query: 316 EDAVVDVLKAGMDVNCG-------SFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLG 368
+A +KAG+D++CG F + AV+Q K+ E ++D AL N+++ MRLG
Sbjct: 357 VEATAAAMKAGLDLDCGMFWEGARDFFTTYGVDAVRQGKIKEGDVDNALSNVYTTLMRLG 416
Query: 369 LFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIG 428
F+G P F +GA VC+ H+ LA AA+ G+VLLKN LPL +K S++L+G
Sbjct: 417 FFDGMPE---FESLGASNVCTDGHKELAADAARQGMVLLKNDARRLPLDPNKINSVSLVG 473
Query: 429 --PNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSAS-IDKAVDIA 485
+ N+ +LG+Y G CR +TP A++N V N Y CD+ AC++A + +A A
Sbjct: 474 LLEHINATDVMLGDYRGKPCRIVTPYNAIRNMV-NATYVHACDSGACNTAEGMGRASSTA 532
Query: 486 KGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDIT 545
K AD +++ GL+ + E+E DR DL+LP Q I VA A+ P++LV++ G VD++
Sbjct: 533 KIADATIVIAGLNMSVERESNDREDLLLPWNQSSWINAVAMASPTPIVLVIMSAGGVDVS 592
Query: 546 FAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYI-KVPMTDMKMR 604
FA + IG+I+WAGYPGE G A+A+V+FG +NPGGRLP+TW+ +Y+ ++PMT M +R
Sbjct: 593 FAHNNTKIGAIVWAGYPGEEGGTAIADVLFGKYNPGGRLPLTWFKNEYVNQIPMTSMALR 652
Query: 605 PQATSGNPGRTYRFYEGKEV-FPFGCGLSYSKYSYKFKAVSQN-KLYLNQSSSTKMVENQ 662
P A G PGRTY+FY G V +PFG GLSY+ +SY +++ KM+ +
Sbjct: 653 PDAALGYPGRTYKFYGGPAVLYPFGHGLSYTNFSYASGTTGATVTIHIGAWEHCKMLTYK 712
Query: 663 DVVHYKS--VPEL--GTEFC-ETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIK 717
S P L + C E F ++ V N G + G H V ++ P P+K
Sbjct: 713 MGAPSPSPACPALNVASHMCSEVVSF--SLRVANTGGVGGDHVVPVYTAPPPEVGDAPLK 770
Query: 718 QLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGDE----EYPIS 773
QLV F+ V + A ++ F L+ C++ + E V+ G +VVGD+ +P++
Sbjct: 771 QLVAFRRVFVPAGAAVDVPFALNVCKTFAIVEETAYTVVPSGVSTVVVGDDALVLSFPVT 830
Query: 774 I 774
I
Sbjct: 831 I 831
>gi|167525174|ref|XP_001746922.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163774702|gb|EDQ88329.1| predicted protein [Monosiga brevicollis MX1]
Length = 1620
Score = 610 bits (1572), Expect = e-171, Method: Compositional matrix adjust.
Identities = 342/760 (45%), Positives = 455/760 (59%), Gaps = 55/760 (7%)
Query: 24 VDSTQP-PFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRL 82
+D+ P P CD + FPFC +L + R RD++SRL++ +K++ N+A A
Sbjct: 906 IDAAPPRPRPCD--ELPAKNFPFCNASLDLDTRIRDVISRLSIQDKVALTANTAGAAADA 963
Query: 83 GIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEA 142
G+PAY+WWSEALHGV G G+ F G ++ ATSFPQVI T+ASF+ LW+ IG I EA
Sbjct: 964 GLPAYQWWSEALHGV-GFSPGVTFMGKVQAATSFPQVIHTSASFNKTLWHHIGMTISTEA 1022
Query: 143 RALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGG 202
RA+ N QA G+TFWAPNINI RDPRWGRGQETPGEDP TG YA ++V G+Q G
Sbjct: 1023 RAMNNVNQA-GLTFWAPNINIIRDPRWGRGQETPGEDPYATGLYAANFVPGMQE-----G 1076
Query: 203 KLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGI 262
+ ++AS+CCKHF Y+L++W R+ F+A T QD+ADTY P FESCV+ GRAS +
Sbjct: 1077 EDTRYIKASSCCKHFFDYNLEDWHNVDRHHFNAIATDQDIADTYLPAFESCVRFGRASSL 1136
Query: 263 MCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDV 322
MC+YN VNG+PSCA+ ++++ AR WGF GYITSDC AV +Y Y + V V
Sbjct: 1137 MCSYNAVNGVPSCANADIMTTLAREAWGFDGYITSDCGAVEDVYSNHKYYNTTGATVNGV 1196
Query: 323 LKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKI 382
L AGMDV+CGSFL +H A+ + + +D+AL+NLF V+ RLG+F+ QP+ +
Sbjct: 1197 LSAGMDVDCGSFLSQHLADAIDSGDVTNATVDQALYNLFRVQFRLGMFDPAED-QPYLNL 1255
Query: 383 GADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYA 442
D V +P HQ LAL+AA+ G+ LL+N LPL S LALIGPNAN+ + GNY
Sbjct: 1256 TTDAVNTPEHQQLALEAARQGMTLLENRDSRLPLDASSIKQLALIGPNANATGVMQGNYN 1315
Query: 443 GPSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQE 502
G + I+P Q +Q YV N G AV AK AD VV+++GLDQTQE
Sbjct: 1316 GKAPFLISPQQGVQQYVSNVALELG-------------AVTAAKAADTVVMVIGLDQTQE 1362
Query: 503 KEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYP 562
E DR + LPG Q EL+ +VA A+ P+++V++ GG VD+T K N+
Sbjct: 1363 SEGHDREIIALPGMQAELVAQVANASSSPIVVVVMTGGAVDLTPVKDLDNV--------- 1413
Query: 563 GEAGAVALAEVIFGDHNPGGRLPMTWYPQDYI-KVPMTDMKMRPQATSGNPGRTYRFYEG 621
G+AG ALAE +FGD+NPGGRLP T YP D + +V M D MRP ATSGNPGRTYRFY G
Sbjct: 1414 GQAGGQALAETLFGDNNPGGRLPYTLYPADLVNQVSMFDDGMRPNATSGNPGRTYRFYTG 1473
Query: 622 KEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETR 681
V+ +G GLSY+ +SY ++S+ + + + V T F
Sbjct: 1474 TPVYAYGTGLSYTSFSY-------------ETSTPSLRVSAERVRAWVAARGQTSFIRDE 1520
Query: 682 ---KFLVTIGVKNHGEMAGKHPVLLFVKPARRG-NGRPIKQLVGFQSVILNAKEKAEIVF 737
+ +T+ V+N+G +AG V +F+K G +G PIK L GF+ V L E I F
Sbjct: 1521 VDAEDYITVTVQNNGTVAGADVVQVFIKTTTPGADGNPIKSLCGFERVFLKPGETTSIQF 1580
Query: 738 ELSPCESLSRAREDGLMVIEEGTHFLVVGDE---EYPISI 774
++P LS G V GT + V E PIS+
Sbjct: 1581 PVTP-HDLSVVNSRGERVAVPGTWTVEVHHEARLSIPISV 1619
>gi|357128056|ref|XP_003565692.1| PREDICTED: beta-D-xylosidase 3-like [Brachypodium distachyon]
Length = 821
Score = 605 bits (1561), Expect = e-170, Method: Compositional matrix adjust.
Identities = 328/805 (40%), Positives = 470/805 (58%), Gaps = 53/805 (6%)
Query: 16 CFTSLLTRVDSTQPPFS---------CDPSNPST-----ETFPFCKTTLPISQRARDLVS 61
C S+ DS P + CDP+ ++ F +C +LP ++R RDLV
Sbjct: 20 CVVSVAFASDSVLGPITTNGRNYTKVCDPARFASLGLDMAGFRYCDASLPYAERVRDLVG 79
Query: 62 RLTLDEKISQLVNSAP-AIPRLGIPAYEWWSEALHGVAGVG-KGIFFNGTIRGATSFPQV 119
RLTL+EK++ L + A A R+G+P Y WW EALHGV+ G F + GATSFP V
Sbjct: 80 RLTLEEKVANLGDQAKGAEQRVGLPRYMWWGEALHGVSDTNPGGTRFGDVVPGATSFPLV 139
Query: 120 ILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGED 179
+ +AA+F+ LW IG A E RA+YN G A +T+W+PNIN+ RDPRWGR ETPGED
Sbjct: 140 LNSAAAFNETLWRAIGGATSTEIRAMYNLGHA-ELTYWSPNINVVRDPRWGRASETPGED 198
Query: 180 PLVTGKYAVSYVRGVQG-------DTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYK 232
P + G++AVS+VR +Q +L+ S+CCKH+ AYD+D W G R
Sbjct: 199 PFLVGRFAVSFVRAMQDIDDGANAGAGAADPFARRLKVSSCCKHYAAYDVDKWFGADRLS 258
Query: 233 FDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFH 292
FDA V +D+ +T++ PFE CV+ G AS +MC+YNR+NG+P+CA+ LL+ T RR W H
Sbjct: 259 FDANVQERDMVETFERPFEMCVRDGDASCVMCSYNRINGVPACANGRLLTGTVRRDWQLH 318
Query: 293 GYITSDCDAVSI-IYDAEGYAKSPEDAVVDVLKAGMDVNCG-------SFLQKHTKAAVK 344
GYI SDCD+V + + DA+ A +KAG+D++CG F + AV+
Sbjct: 319 GYIVSDCDSVRVMVRDAKWLGYDGVQATAAAMKAGLDLDCGMFWEGAKDFFTAYGLQAVR 378
Query: 345 QKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGI 404
Q KL E+E+D AL +L+ MRLG F+G+P Q +GA VC+ H+ +A +AA+ G+
Sbjct: 379 QGKLKEAEVDEALGHLYLTLMRLGFFDGSPEFQ---SLGASDVCTEEHKEMAAEAARQGM 435
Query: 405 VLLKNSHGLLPLPKSKSVSLALIG--PNANSAKTLLGNYAGPSCRSITPLQALQNYVENT 462
VLLKN H LPL +K SLAL+G + N+ +LG+Y G CR +TP +A++ V T
Sbjct: 436 VLLKNDHDRLPLDANKVNSLALVGLLQHINATDVMLGDYRGKPCRVVTPYEAIRKVVSGT 495
Query: 463 VYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELIT 522
CD AC + ++ A+ AK D +++ GL+ + E+E DR DL+LP Q + I
Sbjct: 496 -SMQACDKGACGTTALGAAI-AAKTVDATIVITGLNMSVEREGNDREDLLLPWDQTQWIN 553
Query: 523 RVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGG 582
VAEA++ P+ LV++ G VDI+FA+ + IG+ILWAGYPGE G +A+V+FG +NPGG
Sbjct: 554 AVAEASRDPITLVIISAGGVDISFAQNNPKIGAILWAGYPGEEGGTGIADVLFGKYNPGG 613
Query: 583 RLPMTWYPQDYI-KVPMTDMKMRPQATSGNPGRTYRFYEGKEV-FPFGCGLSYSKY---S 637
RLP+TWY +YI K+PMT M +RP A G PGRTY+FY G +V +PFG GLSY+ + S
Sbjct: 614 RLPLTWYKNEYIGKLPMTSMALRPVADKGYPGRTYKFYSGPDVLYPFGHGLSYTNFTYDS 673
Query: 638 YKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVP----ELGTEFCETRKFLVTIGVKNHG 693
Y A K+ S K + + + P + C+ + T+ V N G
Sbjct: 674 YTTGASVTVKIGTAWEDSCKNLTYKPGTTASTAPCPAINVAGHGCQ-EEVSFTLKVSNTG 732
Query: 694 EMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGL 753
+ G H V ++ P + P+KQLV F+ + + A + E+ F LS C++ +
Sbjct: 733 GIGGSHVVPVYTAPPAEVDDAPLKQLVAFRRMFVPAGDAVEVPFTLSVCKAFAIVEGTAY 792
Query: 754 MVIEEGTHFLVVGDE----EYPISI 774
V+ G ++VGDE +P+ I
Sbjct: 793 TVVPAGVSRVLVGDESLSFSFPVKI 817
>gi|359477633|ref|XP_003632006.1| PREDICTED: LOW QUALITY PROTEIN: beta-D-xylosidase 3-like [Vitis
vinifera]
Length = 781
Score = 605 bits (1561), Expect = e-170, Method: Compositional matrix adjust.
Identities = 321/756 (42%), Positives = 462/756 (61%), Gaps = 25/756 (3%)
Query: 33 CDPSNPST-----ETFPFCKTTLPISQ-RARDLVSRLTLDEKISQLVNSAPAIPRLGIPA 86
CDP+ + + F +C ++LPI R +DLV R+TL+EK + ++ A + R+G+P
Sbjct: 17 CDPARFAALGFDMKDFVYCNSSLPIYDVRVKDLVDRMTLEEKATNVIYKAAGVERIGLPP 76
Query: 87 YEWWSEALHGVAGVGKG--IFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARA 144
Y+WWSEALHGV+ V FF+ T+ GATSFP VIL+AASF+ LW I Q + EARA
Sbjct: 77 YQWWSEALHGVSSVSINGPTFFDETVPGATSFPNVILSAASFNQSLWKTIRQVVSKEARA 136
Query: 145 LYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQG--DTFNGG 202
YN G A G+TFW PN+N+ RDPRWGR QET GEDP YAVSYVRG+Q T N
Sbjct: 137 TYNLGHA-GLTFWCPNVNVARDPRWGRTQETXGEDPFTVSVYAVSYVRGLQDVEGTENTT 195
Query: 203 KLKGK-LQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASG 261
L + L+ S+ KHF AYDLDNW R F+ARV+ QD+A+T+ PFE+CV++G SG
Sbjct: 196 DLNSRPLKVSSSGKHFAAYDLDNWLNVDRNHFNARVSEQDMAETFLRPFEACVREGDVSG 255
Query: 262 IMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAV-SIIYDAEGYAKSPEDAVV 320
+MC++N +NGIP CAD L T R +W HGYI SDC ++ +I+ D + + E+AV
Sbjct: 256 VMCSFNNINGIPPCADPRLFKGTIRDEWNLHGYIVSDCWSIETIVEDQKFLDVTGEEAVA 315
Query: 321 DVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFG 380
LKAG+D+ CG + +AV ++ + ++D++L NL+ V MRLG F+G P + G
Sbjct: 316 LNLKAGLDLECGHYYNDSPASAVMAGRVGQHDLDQSLSNLYVVLMRLGFFDGIPALASLG 375
Query: 381 KIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGN 440
K D+ S H LA +AA+ GIVLLKN + LPL K +LAL+GPNA++ ++GN
Sbjct: 376 K--DDICLSAEHIELAREAARQGIVLLKNDNATLPLKSVK--NLALVGPNADAYGAMMGN 431
Query: 441 YAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSAS-IDKAVDIAKGADHVVLMMGL-D 498
YAGP CRS++P A + + N Y GC V C + + + KAV+ AK AD ++++G+ D
Sbjct: 432 YAGPPCRSVSPRDAF-SAIGNVTYEMGCGDVLCHNDTYVYKAVEAAKHADTTIIVVGITD 490
Query: 499 QTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLL--CGGPVDITFAKYDRNIGSI 556
+ E+ DRVDL+LPG Q L+ ++A+A P+ILV+ CGGP+DI+FA+ + I I
Sbjct: 491 VSIGTEDKDRVDLLLPGYQTHLVNQIAKATTAPIILVVCGHCGGPIDISFARDNPGIEPI 550
Query: 557 LWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYI-KVPMTDMKMRPQATSGNPGRT 615
LWAG+PGE G A+A+V++G +NPGGRLP+TWY Y+ +PMT M +R + G PGR
Sbjct: 551 LWAGFPGEEGGNAIADVVYGKYNPGGRLPVTWYENGYVGMLPMTSMALRSVESLGYPGRK 610
Query: 616 YRFYEGKEVFPFGCGLSYSKYSYKFKAVSQN-KLYLNQSSSTKMVENQDVVHYKSVPE-L 673
Y+F+ G V+PFGCGLSY+ +SY A +++ +L + + + P L
Sbjct: 611 YKFFSGSTVYPFGCGLSYTNFSYSLTAPTRSIHTHLKKLQPCRSMAYSICSVIPQCPAVL 670
Query: 674 GTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKA 733
+ F + VK G M G V+++ P G IKQ++GF+ V +
Sbjct: 671 VDDLSCNETFEFEVAVKTVGSMDGSEVVIVYSSPPSGIVGTHIKQVIGFERVFVKVGXVE 730
Query: 734 EIVFELSPCESLSRAREDGLMVIEEGTHFLVVGDEE 769
++ F ++ C+SL G ++ G+ + G +
Sbjct: 731 KVKFSMNVCKSLGIVHSSGHTLLPSGSDIIKAGGDN 766
>gi|14164501|dbj|BAB55751.1| putative alpha-L-arabinofuranosidase/beta-D- xylosidase isoenzyme
ARA-I [Oryza sativa Japonica Group]
Length = 818
Score = 605 bits (1560), Expect = e-170, Method: Compositional matrix adjust.
Identities = 326/777 (41%), Positives = 462/777 (59%), Gaps = 46/777 (5%)
Query: 33 CDPSNPSTET-----FPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAY 87
CDP+ + FP+C +LP + R RDLV R+TL+EK++ L + A PR+G+P Y
Sbjct: 47 CDPARFAAAGLDMAGFPYCDASLPYADRVRDLVGRMTLEEKVANLGDRAGGAPRVGLPRY 106
Query: 88 EWWSEALHGVAGVGKG-IFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALY 146
WW EALHGV+ VG G +F + GATSFP VI +AASF+ LW IG + E RA+Y
Sbjct: 107 LWWGEALHGVSDVGPGGTWFGDAVPGATSFPLVINSAASFNETLWRAIGGVVSTEIRAMY 166
Query: 147 NAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQG---------D 197
N G A +T+W+PNIN+ RDPRWGR ETPGEDP V G+YAV++VRG+Q
Sbjct: 167 NLGHA-ELTYWSPNINVVRDPRWGRASETPGEDPFVVGRYAVNFVRGMQDIDGATTAASA 225
Query: 198 TFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQG 257
++ S+CCKH+ AYD+D W GT R FDARV +D+ +T++ PFE C++ G
Sbjct: 226 AAATDAFSRPIKVSSCCKHYAAYDVDAWNGTDRLTFDARVQERDMVETFERPFEMCIRDG 285
Query: 258 RASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSI-IYDAEGYAKSPE 316
AS +MC+YNR+NG+P+CAD LL++T RR W HGYI SDCD+V + + DA+ +
Sbjct: 286 DASCVMCSYNRINGVPACADARLLTETVRRDWQLHGYIVSDCDSVRVMVRDAKWLGYTGV 345
Query: 317 DAVVDVLKAGMDVNCG-------SFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGL 369
+A +KAG+D++CG F + AV+Q KL ES +D AL NL+ MRLG
Sbjct: 346 EATAAAMKAGLDLDCGMFWEGVHDFFTTYGVDAVRQGKLKESAVDNALTNLYLTLMRLGF 405
Query: 370 FNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGP 429
F+G P ++ +GA VC+ H+ LA AA+ G+VLLKN LLPL K S+AL G
Sbjct: 406 FDGIPELE---SLGAADVCTEEHKELAADAARQGMVLLKNDAALLPLSPEKVNSVALFGQ 462
Query: 430 --NANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKG 487
+ N+ +LG+Y G CR +TP ++ V +T + AC S D A AK
Sbjct: 463 LQHINATDVMLGDYRGKPCRVVTPYDGVRKVVSSTSVH------ACDKGSCDTAAAAAKT 516
Query: 488 ADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFA 547
D +++ GL+ + E+E DR DL+LP Q I VAEA+ P++LV++ G VD++FA
Sbjct: 517 VDATIVVAGLNMSVERESNDREDLLLPWSQASWINAVAEASPSPIVLVIMSAGGVDVSFA 576
Query: 548 KYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYI-KVPMTDMKMRPQ 606
+ + IG+++WAGYPGE G A+A+V+FG +NPGGRLP+TWY +Y+ K+PMT M +RP
Sbjct: 577 QDNPKIGAVVWAGYPGEEGGTAIADVLFGKYNPGGRLPLTWYKNEYVSKIPMTSMALRPD 636
Query: 607 ATSGNPGRTYRFYEGKEV-FPFGCGLSYSKYSYKFKAVSQNKLYLNQSS---STKMVENQ 662
A G PGRTY+FY G +V +PFG GLSY+ ++Y A + + + + ++
Sbjct: 637 AEHGYPGRTYKFYGGADVLYPFGHGLSYTNFTYA-SATAAAPVTVKVGAWEYCKQLTYKA 695
Query: 663 DVVHYKSVP--ELGTEFC-ETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQL 719
V + P + + C E F VT V N G G H V ++ P +G P KQL
Sbjct: 696 GVSSPPACPAVNVASHACQEEVSFAVT--VANTGGRDGTHVVPMYTAPPAEVDGAPRKQL 753
Query: 720 VGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGDEEYPISIFV 776
V F+ V + A E+ F L+ C++ + E V+ G ++VGD+ +S V
Sbjct: 754 VAFRRVRVAAGAAVEVAFALNVCKAFAIVEETAYTVVPSGVSRVLVGDDALSLSFPV 810
>gi|125535275|gb|EAY81823.1| hypothetical protein OsI_36995 [Oryza sativa Indica Group]
Length = 885
Score = 603 bits (1556), Expect = e-170, Method: Compositional matrix adjust.
Identities = 320/661 (48%), Positives = 430/661 (65%), Gaps = 23/661 (3%)
Query: 132 YRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYV 191
Y QA+ E RA+YN GQA G+TFW+PN+NIFRDPRWGRGQETPGEDP V +YA +YV
Sbjct: 223 YVCAQAVSDEGRAMYNGGQA-GLTFWSPNVNIFRDPRWGRGQETPGEDPAVAARYAAAYV 281
Query: 192 RGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFE 251
RG+Q + G+LK +ACCKHFTAYDLDNW GT R+ F+A VT QDL DT+ PF
Sbjct: 282 RGLQQQQPSSGRLK----LAACCKHFTAYDLDNWSGTDRFHFNAVVTRQDLEDTFNVPFR 337
Query: 252 SCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGY 311
SCV GRA+ +MC+YN+VNG+P+CAD L T RR+WG GYI SDCD+V + Y + Y
Sbjct: 338 SCVVDGRAASVMCSYNQVNGVPTCADAAFLRGTIRRRWGLAGYIVSDCDSVDVFYSDQHY 397
Query: 312 AKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFN 371
++ EDAV L+AG+D++CG FL ++T+ AV Q K+ + +ID A+ N +V+MRLG+F+
Sbjct: 398 TRTREDAVAATLRAGLDLDCGPFLAQYTEGAVAQGKVGDGDIDAAVTNTVTVQMRLGMFD 457
Query: 372 GNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPL-PKSKSVSLALIGPN 430
G+P QPFG +G VC+ AHQ LA++AA+ GIVLLKN LPL P + ++A++GP+
Sbjct: 458 GDPAAQPFGHLGPQHVCTAAHQELAVEAARQGIVLLKNDGRALPLSPATARRAVAVVGPH 517
Query: 431 ANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSAS--IDKAVDIAKGA 488
A + ++GNYAG CR TPLQ + Y + PGC VAC+ + I AVD A+ A
Sbjct: 518 AEATVAMIGNYAGKPCRYTTPLQGVARYAARAAHQPGCTDVACAGSGQPIAAAVDAARRA 577
Query: 489 DHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAK 548
D +++ GLDQ E E LDR L+LPGRQ ELI+ VA+A+K PVILVL+ GGP+DI FA+
Sbjct: 578 DATIVVAGLDQKIEAEGLDRASLLLPGRQAELISSVAKASKGPVILVLMSGGPIDIGFAQ 637
Query: 549 YDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYI-KVPMTDMKMRPQA 607
D I ILWAGYPG+AG A+A+VIFG HNPGG+LP+TWYPQDY+ KVPMT+M MR
Sbjct: 638 NDPKIAGILWAGYPGQAGGQAIADVIFGHHNPGGKLPVTWYPQDYLQKVPMTNMAMRANP 697
Query: 608 TSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKF-KAVSQNKLYLNQSSSTKMVENQDVVH 666
G PGRTYRFY G + PFG GLSY+ +++ A SQ + L + ++
Sbjct: 698 AKGYPGRTYRFYTGPTIHPFGHGLSYTSFTHSIAHAPSQLTVRLAAHHAAASASASASLN 757
Query: 667 YKS------VPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLF-------VKPARRGNG 713
+ + CE + V + V+N GE G H VL++ A G+G
Sbjct: 758 ATARLSRAAAVRVAHARCEELRMPVHVDVRNVGERDGAHTVLVYAAAPASSAAEAAAGHG 817
Query: 714 RPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGDEEYPIS 773
P++QLV F+ V + A A + + C+ LS A +G+ I G H L++G+ + ++
Sbjct: 818 APVRQLVAFEKVHVGAGGTARVEMGIDVCDGLSVADRNGVRRIPVGEHRLIIGELTHTVT 877
Query: 774 I 774
I
Sbjct: 878 I 878
Score = 126 bits (316), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 57/95 (60%), Positives = 70/95 (73%)
Query: 42 TFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVG 101
T PFC+ +LP RARDLV+R+T EK+ LVN+A +PRLG+ YEWWSEALHGV+ G
Sbjct: 40 TLPFCRRSLPARARARDLVARMTRAEKVRLLVNNAAGVPRLGVAGYEWWSEALHGVSDTG 99
Query: 102 KGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQ 136
G+ F G GAT+FPQVI TAASF++ LW IGQ
Sbjct: 100 PGVRFGGAFPGATAFPQVIGTAASFNATLWELIGQ 134
>gi|77552476|gb|ABA95273.1| Beta-D-xylosidase, putative, expressed [Oryza sativa Japonica
Group]
Length = 883
Score = 603 bits (1555), Expect = e-169, Method: Compositional matrix adjust.
Identities = 321/659 (48%), Positives = 433/659 (65%), Gaps = 21/659 (3%)
Query: 132 YRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYV 191
Y QA+ E RA+YN GQA G+TFW+PN+NIFRDPRWGRGQETPGEDP V +YA +YV
Sbjct: 223 YVCAQAVSDEGRAMYNGGQA-GLTFWSPNVNIFRDPRWGRGQETPGEDPAVAARYAAAYV 281
Query: 192 RGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFE 251
RG+Q + G+LK +ACCKHFTAYDLDNW GT R+ F+A VT QDL DT+ PF
Sbjct: 282 RGLQQQQPSSGRLK----LAACCKHFTAYDLDNWSGTDRFHFNAVVTRQDLEDTFNVPFR 337
Query: 252 SCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGY 311
SCV GRA+ +MC+YN+VNG+P+CAD L T RR+WG GYI SDCD+V + Y + Y
Sbjct: 338 SCVVDGRAASVMCSYNQVNGVPTCADAAFLRGTIRRRWGLAGYIVSDCDSVDVFYSDQHY 397
Query: 312 AKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFN 371
++ EDAV L+AG+D++CG FL ++T+ AV Q K+ + +ID A+ N +V+MRLG+F+
Sbjct: 398 TRTREDAVAATLRAGLDLDCGPFLAQYTEGAVAQGKVGDGDIDAAVTNTVTVQMRLGMFD 457
Query: 372 GNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPL-PKSKSVSLALIGPN 430
G+P QPFG +G VC+ AHQ LA++AA+ GIVLLKN LPL P + ++A++GP+
Sbjct: 458 GDPAAQPFGHLGPQHVCTAAHQELAVEAARQGIVLLKNDGRALPLSPATARRAVAVVGPH 517
Query: 431 ANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSAS--IDKAVDIAKGA 488
A + ++GNYAG CR TPLQ + Y + PGC VAC+ + I AVD A+ A
Sbjct: 518 AEATVAMIGNYAGKPCRYTTPLQGVARYAARAAHQPGCTDVACAGSGQPIAAAVDAARRA 577
Query: 489 DHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAK 548
D +++ GLDQ E E LDR L+LPGRQ ELI+ VA+A+K PVILVL+ GGP+DI FA+
Sbjct: 578 DATIVVAGLDQKIEAEGLDRASLLLPGRQAELISSVAKASKGPVILVLMSGGPIDIGFAQ 637
Query: 549 YDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYI-KVPMTDMKMRPQA 607
D I ILWAGYPG+AG A+A+VIFG HNPGG+LP+TWYPQDY+ KVPMT+M MR
Sbjct: 638 NDPKIAGILWAGYPGQAGGQAIADVIFGHHNPGGKLPVTWYPQDYLQKVPMTNMAMRANP 697
Query: 608 TSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKF-KAVSQNKLYL----NQSSSTKMVENQ 662
G PGRTYRFY G + PFG GLSY+ +++ A SQ + L +S++ +
Sbjct: 698 AKGYPGRTYRFYTGPTIHPFGHGLSYTSFTHSIAHAPSQLTVRLSAHHAAASASASLNAT 757
Query: 663 DVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLF-------VKPARRGNGRP 715
+ + + CE + V + V+N GE G H VL++ A G+G P
Sbjct: 758 ARLSRAAAVRVAHARCEELRMPVHVDVRNVGERDGAHTVLVYAAAPASSAAEAAAGHGAP 817
Query: 716 IKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGDEEYPISI 774
++QLV F+ V + A A + + C+ LS A +G+ I G H L++G+ + ++I
Sbjct: 818 VRQLVAFEKVHVGAGGTARVEMGIDVCDGLSVADRNGVRRIPVGEHRLIIGELTHTVTI 876
Score = 127 bits (318), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 58/95 (61%), Positives = 70/95 (73%)
Query: 42 TFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVG 101
T PFC+ +LP RARDLV+RLT EK+ LVN+A +PRLG+ YEWWSEALHGV+ G
Sbjct: 40 TLPFCRRSLPARARARDLVARLTRAEKVRLLVNNAAGVPRLGVAGYEWWSEALHGVSDTG 99
Query: 102 KGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQ 136
G+ F G GAT+FPQVI TAASF++ LW IGQ
Sbjct: 100 PGVRFGGAFPGATAFPQVIGTAASFNATLWELIGQ 134
>gi|318136853|gb|ADV41671.1| alpha-L-arabinofuranosidase/beta-D-xylosidase [Actinidia deliciosa
var. deliciosa]
Length = 634
Score = 597 bits (1540), Expect = e-168, Method: Compositional matrix adjust.
Identities = 301/640 (47%), Positives = 415/640 (64%), Gaps = 22/640 (3%)
Query: 141 EARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFN 200
EARA+YN G A G+TFW+PN+NIFRDPRWGRGQETPGEDP++ G YA SYVRG+QG N
Sbjct: 2 EARAMYNGGMA-GLTFWSPNVNIFRDPRWGRGQETPGEDPMLAGNYAASYVRGLQG---N 57
Query: 201 GGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRAS 260
G+ +L+ +ACCKH+TAYDLDNW+G R+ F+ARV+ QD+ DT++ PF CV G+ +
Sbjct: 58 DGE---RLKVAACCKHYTAYDLDNWRGVDRFHFNARVSKQDIKDTFEIPFRECVLGGKVA 114
Query: 261 GIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVV 320
+MC+YN+VNGIP+CA+ LL T R W +GYI SDCD+V + ++ + Y PE+AV
Sbjct: 115 SVMCSYNQVNGIPTCANPKLLKGTIRGSWRLNGYIVSDCDSVGVFFENQHYTSKPEEAVA 174
Query: 321 DVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFG 380
+KAG+D++CG FL HT+AAV++ + + EI+ AL N + +MRLG+F+G P+ +G
Sbjct: 175 AAIKAGLDLDCGPFLAIHTEAAVRRGLVSQLEINWALANTMTAQMRLGMFDGEPSAHQYG 234
Query: 381 KIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGN 440
+G VC+PAHQ LAL+AA+ GIVLL+N LPL + ++A+IGPN++ T++GN
Sbjct: 235 NLGPRDVCTPAHQQLALEAARQGIVLLENRGRSLPLSIRRHRTVAVIGPNSDVTVTMIGN 294
Query: 441 YAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASI-DKAVDIAKGADHVVLMMGLDQ 499
YAG +C TPLQ + Y T++ GC V C+ + A A+ AD VL+MGLDQ
Sbjct: 295 YAGVACGYTTPLQGIGRYTR-TIHQAGCTDVHCNGNQLFGAAEAAARQADATVLVMGLDQ 353
Query: 500 TQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWA 559
+ E E +DR +LPG QQEL++RVA A++ P ILVL+ GGP+D+TFAK D I +I+W
Sbjct: 354 SIEAEFVDRAGPLLPGHQQELVSRVARASRGPTILVLMSGGPIDVTFAKNDPRISAIIWV 413
Query: 560 GYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYI-KVPMTDMKMRPQATSGNPGRTYRF 618
GYPG+AG A+A+V+FG NPGG+LPMTWYPQ+Y+ +PMTDM MR G PGRTYRF
Sbjct: 414 GYPGQAGGTAIADVLFGTTNPGGKLPMTWYPQNYVTHLPMTDMAMRADPARGYPGRTYRF 473
Query: 619 YEGKEVFPFGCGLSYSKYSYKFKA----VSQNKLYLNQSSSTKMVENQDVVHYKSVPELG 674
Y G VFPFG GLSY+ +++ VS L ++++ M+ V + L
Sbjct: 474 YRGPVVFPFGLGLSYTTFAHNLAHGPTLVSVPLTSLKATANSTMLSKAVRVSHADCNALS 533
Query: 675 TEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAE 734
V + VKN G M G H +L+F P G KQLVGF + + A +
Sbjct: 534 -------PLDVHVDVKNTGSMDGTHTLLVFTSPP-DGKWAASKQLVGFHKIHIAAGSETR 585
Query: 735 IVFELSPCESLSRAREDGLMVIEEGTHFLVVGDEEYPISI 774
+ + C+ LS G+ I G H L +GD + +S+
Sbjct: 586 VRIAVHVCKHLSVVDRFGIRRIPLGEHKLQIGDLSHHVSL 625
>gi|224128360|ref|XP_002320310.1| predicted protein [Populus trichocarpa]
gi|222861083|gb|EEE98625.1| predicted protein [Populus trichocarpa]
Length = 635
Score = 597 bits (1539), Expect = e-168, Method: Compositional matrix adjust.
Identities = 304/648 (46%), Positives = 420/648 (64%), Gaps = 33/648 (5%)
Query: 136 QAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQ 195
Q + EARA++N G A G+T+W+PN+NIFRDPRWGRGQETPGEDP+V GKYA SYVRG+Q
Sbjct: 2 QVVSDEARAMFNGGVA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVVVGKYAASYVRGLQ 60
Query: 196 GDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVK 255
G N +L+ +ACCKHFTAYDLDNW G R+ F+A V+ QD+ DT+ PF CVK
Sbjct: 61 GSDGN------RLKVAACCKHFTAYDLDNWNGVDRFHFNAEVSKQDMEDTFDVPFRMCVK 114
Query: 256 QGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSP 315
+G+ + +MC+YN+VNGIP+CAD NLL KT R G + + I + +
Sbjct: 115 EGKVASVMCSYNQVNGIPTCADPNLLKKTVR------GTLFQTVTLLEFIMGSNTILQPR 168
Query: 316 EDAVVDVLK-AGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNP 374
+LK A +D++CG FL +HT+ AVK+ L E+EI+ AL N +V+MRLG+F+G P
Sbjct: 169 RKQPRMLLKQASLDLDCGPFLGQHTEDAVKKGLLNEAEINNALLNTLTVQMRLGMFDGEP 228
Query: 375 TMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSA 434
+ Q +G +G + VC+PAHQ LAL+AA+ GIVLLKN LPL + +S+A++GPN+N
Sbjct: 229 SSQLYGNLGPNDVCTPAHQELALEAARQGIVLLKNHGPSLPLSTRRHLSVAIVGPNSNVT 288
Query: 435 KTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVAC-SSASIDKAVDIAKGADHVVL 493
T++GNYAG +C TPLQ +Q Y + T++ GC VAC S A+D A+ AD VL
Sbjct: 289 ATMIGNYAGLACGYTTPLQGIQRYAQ-TIHRQGCADVACVSDQQFSAAIDAARQADATVL 347
Query: 494 MMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNI 553
+MGLDQ+ E E DR L+LPGRQQEL+++VA A+K P ILVL+ GGP+D++FA+ D I
Sbjct: 348 VMGLDQSIEAEFRDRTGLLLPGRQQELVSKVAAASKGPTILVLMSGGPIDVSFAENDPKI 407
Query: 554 GSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYI-KVPMTDMKMRPQATSGNP 612
GSI+WAGYPG+AG A+++V+FG NPGG+LPMTWYPQDYI +PMT+M MR + G P
Sbjct: 408 GSIVWAGYPGQAGGAAISDVLFGITNPGGKLPMTWYPQDYITNLPMTNMAMRSSKSKGYP 467
Query: 613 GRTYRFYEGKEVFPFGCGLSYSKYSYKFKA------VSQNKLYLNQSSSTKMVENQDVVH 666
GRTYRFY+GK V+PFG G+SY+ + + + V + ++T + V H
Sbjct: 468 GRTYRFYKGKVVYPFGHGISYTNFVHTIASAPTMVSVPLDGHRHGSGNATISGKAIRVTH 527
Query: 667 YKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVI 726
+ C + + VKN G M G H +L++ +P R + P KQLV F+ V
Sbjct: 528 AR---------CNRLSLGMQVDVKNTGSMDGTHTLLVYSRPPAR-HWAPHKQLVAFEKVH 577
Query: 727 LNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGDEEYPISI 774
+ A + + + C+SLS G+ I G H L +GD ++ +S+
Sbjct: 578 VAAGTQQRVGINIHVCKSLSVVDGSGIRRIPMGEHSLHIGDVKHSVSL 625
>gi|356510699|ref|XP_003524073.1| PREDICTED: beta-xylosidase/alpha-L-arabinofuranosidase 2-like
[Glycine max]
Length = 613
Score = 594 bits (1531), Expect = e-167, Method: Compositional matrix adjust.
Identities = 291/551 (52%), Positives = 385/551 (69%), Gaps = 7/551 (1%)
Query: 31 FSCD-PSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEW 89
F+CD +P+ + FC +L + R +DLV RLTL EKI LVNSA + RLGIP YEW
Sbjct: 30 FACDVGKSPAVAGYGFCDKSLGVEARVKDLVGRLTLQEKIGNLVNSAGDVSRLGIPRYEW 89
Query: 90 WSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAG 149
WSEALHGV+ VG G F+ + GATSFP ILTAASF++ L+ IG+ + EA A+YN G
Sbjct: 90 WSEALHGVSNVGLGTRFSNVVPGATSFPMPILTAASFNTSLFEVIGRVVSTEAGAMYNVG 149
Query: 150 QAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQ 209
A G+T+W+PNINIFRDPRWGRG ETPGEDP++T KYA YV+G+Q +GG KL+
Sbjct: 150 LA-GLTYWSPNINIFRDPRWGRGLETPGEDPVLTSKYAAGYVKGLQ--QTDGGD-PNKLK 205
Query: 210 ASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRV 269
+ACCKH+TAYD+D WKG RY F+A +T QDL DT+QPPF+SCV G + +MC+YN+V
Sbjct: 206 VAACCKHYTAYDVDKWKGIQRYTFNAVLTKQDLEDTFQPPFKSCVIDGNVASVMCSYNKV 265
Query: 270 NGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDV 329
NG P+CAD +LL R +W +GY+ SDCD+V ++Y + Y K+PE+A + AG+D+
Sbjct: 266 NGKPTCADPDLLKGVVRGEWKLNGYMVSDCDSVEVLYKYQHYTKTPEEAAAISILAGLDL 325
Query: 330 NCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCS 389
NCG FL ++T+ AVKQ + ES I+ A+ N F+ MRLG F+G+P QP+G +G VC+
Sbjct: 326 NCGRFLGQYTEGAVKQGLIDES-INNAVSNNFATLMRLGFFDGDPRKQPYGNLGPKDVCT 384
Query: 390 PAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSI 449
PA+Q LA +AA+ GIV LKNS LPL SLA+IGPNAN+ + ++GNY G C+ I
Sbjct: 385 PANQELAREAARQGIVSLKNSPASLPLNAKAIKSLAVIGPNANATRVMIGNYEGIPCKYI 444
Query: 450 TPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRV 509
+PLQ L +V T Y GC V C + +D A I+ D V+++G E E LDRV
Sbjct: 445 SPLQGLTAFVP-TSYAAGCLDVRCPNPVLDDAKKISASGDATVIVVGASLAIEAESLDRV 503
Query: 510 DLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVA 569
+++LPG+QQ L+T VA A+K PVILV++ GG +D++FAK + I SILW GYPGEAG A
Sbjct: 504 NILLPGQQQLLVTEVANASKGPVILVIMSGGGMDVSFAKDNNKITSILWVGYPGEAGGAA 563
Query: 570 LAEVIFGDHNP 580
+A+VIFG HNP
Sbjct: 564 IADVIFGFHNP 574
>gi|37359708|dbj|BAC98299.1| LEXYL2 [Solanum lycopersicum]
Length = 633
Score = 591 bits (1523), Expect = e-166, Method: Compositional matrix adjust.
Identities = 305/645 (47%), Positives = 424/645 (65%), Gaps = 15/645 (2%)
Query: 134 IGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRG 193
IG+ + E RA+YN GQA G+T+W+PN+NI+RDPRWGRGQET GEDP ++ +Y V+YV+G
Sbjct: 2 IGKVVSTEGRAMYNVGQA-GLTYWSPNVNIYRDPRWGRGQETAGEDPTLSSRYGVAYVKG 60
Query: 194 VQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESC 253
+Q + GK K L+ ++CCKH+TAYD+D+WKG RY F+A+VT QDL DT+ PPF+SC
Sbjct: 61 LQ--QRDDGK-KDMLKVASCCKHYTAYDVDDWKGIQRYNFNAKVTQQDLDDTFNPPFKSC 117
Query: 254 VKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAK 313
V G + +MC+YN+V+G P+C D +LL+ R QW +GYI +DCD+++ +Y A+ Y K
Sbjct: 118 VLDGNVASVMCSYNQVDGKPTCGDYDLLAGVIRGQWKLNGYIVTDCDSLNEMYWAQHYTK 177
Query: 314 SPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGN 373
+PE+ L AG+ +NCGS+L K+T+ AV Q + ES IDRA+ N F+ MRLG F+GN
Sbjct: 178 TPEETAALSLNAGLGLNCGSWLGKYTQGAVNQGLVNESVIDRAVTNNFATLMRLGFFDGN 237
Query: 374 PTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANS 433
P Q +G +G +C+ HQ LA +AA+ GIVLLKN+ G LPL SLA+IGPNAN
Sbjct: 238 PKNQLYGNLGPKDICTEDHQELAREAARQGIVLLKNTAGSLPLSPKSIKSLAVIGPNANL 297
Query: 434 AKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVL 493
A T++G+Y G C+ TPL L V +TVY GCD +AC++A +D A +A AD VVL
Sbjct: 298 AYTMVGSYEGSPCKYTTPLDGLGASV-STVYQQGCD-IACATAQVDNAKKVAAAADAVVL 355
Query: 494 MMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNI 553
+MG DQT E+E DR ++ LPG+Q L+T VA +K PVILV++ GG +D+ FA + +
Sbjct: 356 VMGSDQTIERESKDRFNITLPGQQSLLVTEVASVSKGPVILVIMSGGGMDVKFAVDNPKV 415
Query: 554 GSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYI-KVPMTDMKMRPQATSGNP 612
SILW G+PGEAG ALA+V+FG HNPGGRLPMTWYPQ Y+ KV MT+M MR +G P
Sbjct: 416 TSILWVGFPGEAGGAALADVVFGYHNPGGRLPMTWYPQSYVDKVDMTNMNMRADPKTGFP 475
Query: 613 GRTYRFYEGKEVFPFGCGLSYSKYSYKF-KAVSQNKLYLNQSSSTKMVENQDVVHYKSVP 671
GR+YRFY+G VF FG GLSY++Y + KA + L + + + + KS+
Sbjct: 476 GRSYRFYKGPTVFNFGDGLSYTQYKHHLVKAPKFVSIPLEEGHACRSTK------CKSID 529
Query: 672 ELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKE 731
+ + C + + V+N G+M G H VLLF P N P K L+ FQ + L +
Sbjct: 530 AVNEQGCNNLGLDIHLKVQNVGKMRGSHTVLLFTSPPSVHNA-PQKHLLDFQKIHLTPQS 588
Query: 732 KAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGDEEYPISIFV 776
+ + F L C+ LS E G + G H L +GD ++ +++ +
Sbjct: 589 EGVVKFNLDVCKHLSVVDEVGNRKVALGLHVLHIGDLKHSLTLRI 633
>gi|413954831|gb|AFW87480.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
Length = 814
Score = 588 bits (1515), Expect = e-165, Method: Compositional matrix adjust.
Identities = 318/773 (41%), Positives = 450/773 (58%), Gaps = 71/773 (9%)
Query: 43 FPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGK 102
FP+C +LP + R RDL+ +T++EK+ L + + PR+G+P Y+WWSEALHGV+ G
Sbjct: 68 FPYCDASLPYADRVRDLIGWMTVEEKVGNLGDISHGAPRVGLPPYKWWSEALHGVSSTGP 127
Query: 103 GIFFN------------GTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQ 150
+ F+ T+ AT F VI +AASF+ LW IGQA+ EARA+YN G+
Sbjct: 128 TMLFDDLHSKPGNHSGRATVNNATVFANVINSAASFNETLWNSIGQAVSTEARAMYNLGK 187
Query: 151 AIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQG--DTFNGGKLKGKL 208
G+T+W+PNIN+ RDPRWGR ETPGEDP V G+YAV++VRG+Q ++G +
Sbjct: 188 G-GLTYWSPNINVVRDPRWGRALETPGEDPYVAGRYAVNFVRGMQDIPGHYSGDPSARPI 246
Query: 209 QASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNR 268
+ SACCKH AYD+DNW TR+ +DARV+ +D+A+T+ PFE CV++G S +MC+YNR
Sbjct: 247 KTSACCKHHAAYDVDNWHNQTRFTYDARVSERDMAETFLRPFEMCVREGDVSSVMCSYNR 306
Query: 269 VNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAK--SPEDAVVDVLKAG 326
VNG+P+CAD LLS T R +W +GYI SDCDAV ++ D + + E + V L+AG
Sbjct: 307 VNGVPACADARLLSGTVRGEWHLNGYIVSDCDAVRVMTDNATWLNFTAAESSAVS-LRAG 365
Query: 327 MDVNCGS------------FLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNP 374
MD++C +L ++ AAV Q K+ ES+ID AL NL+ MRLG F+ P
Sbjct: 366 MDLDCAESWIEEEGRPLRDYLSEYGMAAVAQGKMRESDIDNALTNLYMTLMRLGYFDNIP 425
Query: 375 TMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSA 434
+ + VC+ H+ LAL A+ GIVLLKN HGLLPL K++++A+ GP+A +
Sbjct: 426 R---YASLNETDVCTDEHKSLALDGARQGIVLLKNDHGLLPLDPKKTLAVAVHGPHARAP 482
Query: 435 KTLL-GNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVL 493
+ ++ G+Y GP CR +TP Q I + V I+ A +
Sbjct: 483 EKIMDGDYTGPPCRYVTPRQG-----------------------ISRDVKISHKAKMTIY 519
Query: 494 MMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNI 553
+ G++ E+E DR DL+LP Q E I A+A+ P+ILV+L GG +DI+FA+ I
Sbjct: 520 LGGINLYIEREGNDREDLLLPKNQTEEILHFAQASPTPIILVILSGGGIDISFAQKHPKI 579
Query: 554 GSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYI-KVPMTDMKMRPQATSGNP 612
G+ILWAGYPG G A+A+VIFG +NPGGRLP+TW+ YI ++PMT M+ RP G P
Sbjct: 580 GAILWAGYPGGEGGNAIADVIFGRYNPGGRLPLTWFKNKYIEQIPMTSMEFRPVPEKGYP 639
Query: 613 GRTYRFYEGKEV-FPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVP 671
GRTY+FY+G EV +PFG GLSY+K+ Y+ + K + + V +VP
Sbjct: 640 GRTYKFYDGPEVLYPFGYGLSYTKFQYETSTDGVSVSLPAPGGHCKGLSYKPSV--ATVP 697
Query: 672 -----ELGTEFC-ETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSV 725
+ C ET F + V N G G H VL++ P PIKQ+ F+ V
Sbjct: 698 ACQAVNVADHACTETVSF--NVSVTNAGGRGGAHVVLVYTAPPPEVAEAPIKQVAAFRRV 755
Query: 726 ILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVV--GDEEYPISIFV 776
+ A+ A + F L+ C++ V+ G ++V GD +S V
Sbjct: 756 FVAARSTATVPFALNVCKAFGIVERTAYTVVPSGVSKVLVENGDSSSSVSFPV 808
>gi|115486735|ref|NP_001068511.1| Os11g0696400 [Oryza sativa Japonica Group]
gi|77552754|gb|ABA95551.1| Glycosyl hydrolase family 3 C terminal domain containing protein
[Oryza sativa Japonica Group]
gi|113645733|dbj|BAF28874.1| Os11g0696400 [Oryza sativa Japonica Group]
Length = 816
Score = 586 bits (1510), Expect = e-164, Method: Compositional matrix adjust.
Identities = 316/770 (41%), Positives = 447/770 (58%), Gaps = 67/770 (8%)
Query: 43 FPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGK 102
F +C +LP + R RDL+ R+T++EK+ L + R+G+PAY WWSEALHG++ G
Sbjct: 68 FRYCDASLPYADRVRDLIGRMTVEEKVGALGDWTDGAARIGLPAYRWWSEALHGLSSTGP 127
Query: 103 GIFFN-----------GTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQA 151
F+ + AT F VI +AASF+ LW IGQA+ EARA+YN G+
Sbjct: 128 TTKFDDLATPHLHSGVSAVYNATVFANVINSAASFNETLWKSIGQAVSTEARAMYNMGKG 187
Query: 152 IGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQG-----DTFNGGKLKG 206
G+T+W+PNIN+ RDPRWGR ETPGEDP V G+YAV++VRG+Q GG
Sbjct: 188 -GLTYWSPNINVVRDPRWGRALETPGEDPYVVGRYAVNFVRGMQDIPGHEAVAAGGDPNT 246
Query: 207 K-LQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCA 265
+ L+ SACCKH+ AYDLD+W TR++FDARV +D+ +T+Q PFE CV+ G S +MC+
Sbjct: 247 RPLKTSACCKHYAAYDLDDWHNHTRFEFDARVDERDMVETFQRPFEMCVRDGDVSSVMCS 306
Query: 266 YNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYD-AEGYAKSPEDAVVDVLK 324
YNRVNGIP+CAD LLS+T RR WG HGYI SDCDAV ++ D A + +A LK
Sbjct: 307 YNRVNGIPACADARLLSQTIRRDWGLHGYIVSDCDAVRVMTDNATWLGYTGAEASAAALK 366
Query: 325 AGMDVNCG-------------SFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFN 371
AG+D++CG FL + AV + K+ ES+ID AL N + MRLG F+
Sbjct: 367 AGLDLDCGESWKNDTDGHPLMDFLTTYGMEAVNKGKMRESDIDNALTNQYMTLMRLGYFD 426
Query: 372 GNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNA 431
+ + +G +C+ H+ LAL A+ GIVLLKN + LLPL +K + + GP+
Sbjct: 427 ---DIAQYSSLGRQDICTDQHKTLALDGARQGIVLLKNDNKLLPLDANKVGFVNVRGPHV 483
Query: 432 NSAKTLL-GNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADH 490
+ + ++ G+Y GP CR +TP Q + YV + A+
Sbjct: 484 QAPEKIMDGDYTGPPCRYVTPRQGVSKYVR-----------------------FSHRANT 520
Query: 491 VVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYD 550
+ GL+ E+E DR D++LP Q E I RVA+A+ P+ILV+L GG +D++FA+ +
Sbjct: 521 TIYFGGLNLNIEREGNDREDILLPKNQTEEIIRVAKASPNPIILVILSGGGIDVSFAQNN 580
Query: 551 RNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYI-KVPMTDMKMRPQATS 609
IG+ILWAGYPG G A+A+VIFG HNP GRLP+TW+ YI ++PMT M +RP A
Sbjct: 581 PKIGAILWAGYPGGEGGNAIADVIFGKHNPSGRLPLTWFKNKYIYQLPMTSMDLRPVAKH 640
Query: 610 GNPGRTYRFYEGKEV-FPFGCGLSYSKYSYKFKAVSQNKLY-LNQSSSTKMVENQDVVHY 667
G PGRTY+FY+G +V +PFG GLSY+K+ Y+ + + K+ V
Sbjct: 641 GYPGRTYKFYDGPDVLYPFGYGLSYTKFLYEMGTNGTALIVPVAGGHCKKLSYKSGVSTA 700
Query: 668 KSVPEL---GTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQS 724
+ P + G ET F + V N G+ G HPV++F KP + P+KQ+V F+S
Sbjct: 701 PACPAINVNGHVCTETVSF--NVSVTNGGDTGGSHPVIVFSKPPAEVDDAPMKQVVAFKS 758
Query: 725 VILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGDEEYPISI 774
V + A + FEL+ C++ + V+ G ++V + + +S
Sbjct: 759 VFVPAWSTVSVSFELNVCKAFGIVEKTAYTVVPSGVSTILVENVDSSVSF 808
>gi|301110280|ref|XP_002904220.1| beta-D-xylosidase, putative [Phytophthora infestans T30-4]
gi|262096346|gb|EEY54398.1| beta-D-xylosidase, putative [Phytophthora infestans T30-4]
Length = 709
Score = 585 bits (1507), Expect = e-164, Method: Compositional matrix adjust.
Identities = 316/740 (42%), Positives = 436/740 (58%), Gaps = 64/740 (8%)
Query: 45 FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGI 104
C TT R+ ++R+ LD+ + LVN A P + IP+YEWW+EALHGVA + G+
Sbjct: 1 MCATT----PRSLHCLTRIPLDQAVGLLVNKAAPAPSVNIPSYEWWNEALHGVA-LSPGV 55
Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIF 164
F G+I ATSFPQV+ TAASF+ L+Y+I I EARA +NA A G+TFW PN+NIF
Sbjct: 56 TFKGSITAATSFPQVLSTAASFNRSLFYQIADVISTEARAFHNAKDA-GLTFWTPNVNIF 114
Query: 165 RDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGK--LQASACCKHFTAYDL 222
RDPRWGRGQETPGEDP +TG+YAV++VRG+QG+ G +++ L+ S+CCKHF+AY
Sbjct: 115 RDPRWGRGQETPGEDPYLTGEYAVAFVRGLQGEGMEGREVENSKFLKISSCCKHFSAYSQ 174
Query: 223 DNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLS 282
+ R++ +A VT QD ADTY P FE CVK+G S IMC+YN VNGIPSCAD+ LL+
Sbjct: 175 E----VPRHRNNAMVTKQDQADTYFPAFEDCVKRGHVSSIMCSYNAVNGIPSCADKGLLT 230
Query: 283 KTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAA 342
R QW F GYI SDC+AV+ + D Y +SPE L AGMD+NCG FL++H A
Sbjct: 231 DLVRGQWKFDGYIASDCEAVADVIDHHHYTQSPEQTCATTLDAGMDLNCGEFLRQHLPKA 290
Query: 343 VKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQD 402
++Q + I AL N F V MRLG+F ++PF I D V + H+ LAL+AA+
Sbjct: 291 LEQGIVTTEMIHNALKNQFRVLMRLGMFE---KVEPFANITKDSVDTTMHRQLALEAARQ 347
Query: 403 GIVLLKNSHGLLPLPK---SKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYV 459
IVLLKN LPL ++ SLALIGP+ N++ LLGNY G +TPL+ + +V
Sbjct: 348 SIVLLKNDGNTLPLATKDFTRDRSLALIGPHFNASAALLGNYFGIPSHIVTPLEGISQFV 407
Query: 460 ENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQE 519
N + GC D A+ +AK AD +++ +GLDQ+QE+EE+DR + LP Q
Sbjct: 408 PNVAHSLGCKVSGEVLPDFDDAIAVAKKADRLIVFVGLDQSQEREEIDRYHIGLPAFQST 467
Query: 520 LITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHN 579
L+ RV E A P++ V++ GG VD++ K +G+I++ GY G+AG ALA+V+FG +N
Sbjct: 468 LLKRVLEVASHPIVFVVISGGCVDLSAYKNHPKVGAIVFGGYLGQAGGQALADVLFGKYN 527
Query: 580 PGGRLPMTWYPQDYIK-VPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSY 638
P G+LP T+Y +Y+ + + DM MRP +GN GRTYRF+ G V+ FG GLSY+ +
Sbjct: 528 PSGKLPQTFYDSEYVNAMSIYDMHMRPTPVTGNSGRTYRFFTGVPVYEFGFGLSYTTFHK 587
Query: 639 KFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGK 698
C I V N G ++G+
Sbjct: 588 N--------------------------------------CHACVATFNITVTNAGAISGE 609
Query: 699 HPVLLFVKPARRGN-GRPIKQLVGFQSVILNA---KEKAEIVFELSPCESLSRAREDGLM 754
+L +V+P G GRP+K LV F+ L A + A+I E ++ + A E G
Sbjct: 610 DVILTYVEPPLAGEGGRPLKSLVAFERTPLIAAGQRATAKICLE---AKAFALANEAGNW 666
Query: 755 VIEEGTHFLVVGDEEYPISI 774
V+E G + V ++ + I
Sbjct: 667 VVEPGNWTIHVDTLQHKVDI 686
>gi|242093144|ref|XP_002437062.1| hypothetical protein SORBIDRAFT_10g020500 [Sorghum bicolor]
gi|241915285|gb|EER88429.1| hypothetical protein SORBIDRAFT_10g020500 [Sorghum bicolor]
Length = 809
Score = 584 bits (1506), Expect = e-164, Method: Compositional matrix adjust.
Identities = 317/772 (41%), Positives = 445/772 (57%), Gaps = 67/772 (8%)
Query: 43 FPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGK 102
FP+C +LP + R RDL+ +T++EK+ L + + PR+G+P Y+WWSEALHGV+ G
Sbjct: 61 FPYCDASLPYADRVRDLIGWMTVEEKVGNLGDVSHGAPRVGLPPYKWWSEALHGVSSTGP 120
Query: 103 GIFFN------------GTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQ 150
+ F+ T+ AT F VI +AASF+ LW IGQA+ EARA+YN G+
Sbjct: 121 TMLFDDLHSKPGNHSGRATVNNATVFANVINSAASFNETLWKSIGQAVSTEARAMYNLGK 180
Query: 151 AIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQ---GDTFNGGKLKGK 207
G+T+W+PNIN+ RDPRWGR ETPGEDP V G+YAV++VRG+Q G G +
Sbjct: 181 G-GLTYWSPNINVVRDPRWGRALETPGEDPFVAGRYAVNFVRGMQDIPGHDGGGDDPSTR 239
Query: 208 -LQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAY 266
++ SACCKH+ AYD+D+W TR+ FDARV+ +D+A+T+ PFE CV+ G ASG+MC+Y
Sbjct: 240 PIKTSACCKHYAAYDVDDWHNHTRFTFDARVSERDMAETFLRPFEMCVRDGDASGVMCSY 299
Query: 267 NRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAK-SPEDAVVDVLKA 325
NRVNGIP+CAD LLS T R W HGYI SDCDAV ++ D + + ++ ++A
Sbjct: 300 NRVNGIPACADARLLSGTIRGDWQLHGYIVSDCDAVRVMTDNATWLHFTGAESSAASIRA 359
Query: 326 GMDVNCG------------SFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGN 373
G+D++C FL ++ KAAV Q K+ ES+ID AL N + MRLG F+
Sbjct: 360 GLDLDCAESWIEEKGRPLRDFLSEYGKAAVAQGKMRESDIDSALRNQYMTLMRLGYFDNI 419
Query: 374 PTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANS 433
P + + +C+ H+ LA A+ G+VLLKN GLLPL K +++A+ GP+A +
Sbjct: 420 PR---YASLNETDICTDEHKSLAHDGARQGMVLLKNDDGLLPLDPEKILAVAVHGPHARA 476
Query: 434 AKTLL-GNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVV 492
+ ++ G+Y GP CR +TP Q I K V I+ A+ +
Sbjct: 477 PEKIMDGDYTGPPCRYVTPRQG-----------------------ISKDVKISHRANTTI 513
Query: 493 LMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRN 552
+ G++ E+E DR DL+LP Q E I A+A+ P+ILV+L GG +DI+FA
Sbjct: 514 YLGGINLHIEREGNDREDLLLPKNQTEEILHFAKASPNPIILVILSGGGIDISFAHKHPK 573
Query: 553 IGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIK-VPMTDMKMRPQATSGN 611
IG+ILWAGYPG G A+A+VIFG +NPGGRLP+TW+ YI+ +PMT M+ RP G
Sbjct: 574 IGAILWAGYPGGEGGNAIADVIFGRYNPGGRLPLTWFKNKYIQQIPMTSMEFRPVPEKGY 633
Query: 612 PGRTYRFYEGKEV-FPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHY--- 667
PGRTY+FY+G EV +PFG GLSY+K+ Y+ K + + V
Sbjct: 634 PGRTYKFYDGPEVLYPFGYGLSYTKFLYETSTNGTAVTLPATGGHCKGLSYKPSVATTPA 693
Query: 668 -KSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVI 726
++V G ET F I V N G G H VL++ P PIKQ+ F+ V
Sbjct: 694 CQAVDVAGHACTETVSF--NISVTNAGGRGGAHVVLVYTAPPPEVAQAPIKQVAAFRRVF 751
Query: 727 LNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVV--GDEEYPISIFV 776
+ A+ A + F L+ C++ V+ G ++V GD +S V
Sbjct: 752 VPARSTATVPFTLNVCKAFGIVERTAYTVVPSGVSKVLVQNGDSSSSVSFPV 803
>gi|125535311|gb|EAY81859.1| hypothetical protein OsI_37025 [Oryza sativa Indica Group]
Length = 816
Score = 583 bits (1503), Expect = e-163, Method: Compositional matrix adjust.
Identities = 318/771 (41%), Positives = 446/771 (57%), Gaps = 68/771 (8%)
Query: 43 FPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGK 102
F +C +LP + R RDL+ R+T++EK+ L + R+G+PAY WWSEALHG++ G
Sbjct: 67 FRYCDASLPYADRVRDLIGRMTVEEKVGALGDWTDGAARIGLPAYRWWSEALHGLSSTGP 126
Query: 103 GIFFN-----------GTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQA 151
F+ + AT F VI +AASF+ LW IGQA+ EARA+YN G+
Sbjct: 127 TTKFDDLATPHLHSGVSAVYNATVFANVINSAASFNETLWKSIGQAVSTEARAMYNMGKG 186
Query: 152 IGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQG-----DTFNGGKLKG 206
G+T+W+PNIN+ RDPRWGR ETPGEDP V G+YAV++VRG+Q GG
Sbjct: 187 -GLTYWSPNINVVRDPRWGRALETPGEDPYVVGRYAVNFVRGMQDIPGHEAVAAGGDPNT 245
Query: 207 K-LQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCA 265
+ L+ SACCKH+ AYDLD+W TR++FDARV +D+ +T+Q PFE CV+ G S +MC+
Sbjct: 246 RPLKTSACCKHYAAYDLDDWHNHTRFEFDARVDERDMVETFQRPFEMCVRDGDVSSVMCS 305
Query: 266 YNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYD-AEGYAKSPEDAVVDVLK 324
YNRVNGIP+CAD LLS+T RR WG HGYI SDCDAV ++ D A + +A LK
Sbjct: 306 YNRVNGIPACADARLLSQTIRRDWGLHGYIVSDCDAVRVMTDNATWLGYTGAEASAAALK 365
Query: 325 AGMDVNCG-------------SFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFN 371
AG+D++CG FL + AV + K+ ES+ID AL N + MRLG F+
Sbjct: 366 AGLDLDCGESWKNDTEGHPLMDFLTTYGMEAVNKGKMRESDIDNALTNQYMTLMRLGYFD 425
Query: 372 GNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNA 431
+ + +G +C+ H+ LAL A+ GIVLLKN + LLPL +K + + GP+
Sbjct: 426 ---DITQYSSLGRQDICTDQHKTLALDGARQGIVLLKNDNKLLPLDANKVGFVNVRGPHV 482
Query: 432 NSAKTLL-GNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADH 490
+ + ++ G+Y GP CR +TP Q + YV + A+
Sbjct: 483 QAPEKIMDGDYTGPPCRYVTPRQGVSKYVR-----------------------FSHRANT 519
Query: 491 VVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYD 550
+ GL+ E+E DR D++LP Q E I RVA+A+ P+ILV+L GG +D++FA+ +
Sbjct: 520 TIYFGGLNLNIEREGNDREDILLPKNQTEEIIRVAKASPNPIILVILSGGGIDVSFAQNN 579
Query: 551 RNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYI-KVPMTDMKMRPQATS 609
IG+ILWAGYPG G A+A+VIFG HNP GRLP+TW+ YI ++PMT M +RP A
Sbjct: 580 PKIGAILWAGYPGGEGGNAIADVIFGKHNPSGRLPLTWFKNKYIYQLPMTSMDLRPVAKH 639
Query: 610 GNPGRTYRFYEGKEV-FPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYK 668
G PGRTY+FY G +V +PFG GLSY+K+ Y+ K + + V
Sbjct: 640 GYPGRTYKFYNGPDVLYPFGYGLSYTKFLYEMGTNGTALTVPVAGGHCKKLSYKSGVSSA 699
Query: 669 --SVPEL---GTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQ 723
+ P + G ET F V+ V N G+ G HPV++F KP + PIKQ+V F+
Sbjct: 700 APACPAINVNGHACTETVSFNVS--VTNGGDTGGSHPVIVFSKPPAEVDDAPIKQVVAFR 757
Query: 724 SVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGDEEYPISI 774
SV + A + FEL+ C++ + V+ G ++V + + +S
Sbjct: 758 SVFVPAWSTVSVSFELNVCKAFGIVEKTAYTVVPSGVSTVLVENVDSSVSF 808
>gi|195614824|gb|ACG29242.1| auxin-induced beta-glucosidase [Zea mays]
gi|413920229|gb|AFW60161.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
Length = 655
Score = 583 bits (1503), Expect = e-163, Method: Compositional matrix adjust.
Identities = 300/652 (46%), Positives = 416/652 (63%), Gaps = 22/652 (3%)
Query: 145 LYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQG--DTFNGG 202
+YN G+A G+TFW+PN+NIFRDPRWGRGQETPGEDP V+ +YA +YVRG+Q NGG
Sbjct: 1 MYNGGRA-GLTFWSPNVNIFRDPRWGRGQETPGEDPAVSARYAAAYVRGLQQPYAAPNGG 59
Query: 203 KLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGI 262
+ +L+ +ACCKHFTAYDLD W GT R+ F+A V QDL DT+ PF +CV+ GRA+ +
Sbjct: 60 H-RNRLKLAACCKHFTAYDLDKWGGTDRFHFNAVVAAQDLEDTFNVPFRACVEDGRAASV 118
Query: 263 MCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDV 322
MC+YN+VNG+P+CAD L T R +WG GYI SDCD+V + + + Y ++PEDA
Sbjct: 119 MCSYNQVNGVPTCADAAFLRGTIRGRWGLDGYIVSDCDSVDVFFRDQHYTRTPEDAAAAT 178
Query: 323 LKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKI 382
L+AG+D++CG FL + +AV K+ ++++D AL N +V+MRLG+F+G+P PFG++
Sbjct: 179 LRAGLDLDCGPFLALYAGSAVAAGKVADADVDAALLNTVTVQMRLGMFDGDPAAGPFGRL 238
Query: 383 GADVVCSPAHQVLALQAAQDGIVLLKNSHG------LLPLPKSKSVSLALIGPNANSAKT 436
G VC+ HQ LAL AA+ G+VLLKN G +LPL + +A++GP+A++
Sbjct: 239 GPADVCTREHQDLALDAARQGVVLLKNRRGARHNRDVLPLRPAAHRVVAVVGPHADATVA 298
Query: 437 LLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVAC-SSASIDKAVDIAKGADHVVLMM 495
++GNYAG CR TPLQ + Y + GC VAC + I AV+ A+ AD V++
Sbjct: 299 MIGNYAGKPCRYTTPLQGVAAYAARVAHQAGCTDVACRGNQPIAAAVEAARQADATVVVA 358
Query: 496 GLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGS 555
GLDQ E E LDR L+LPGRQ ELI+ VA+A+K PVILVL+ GGP+DI FA+ D I
Sbjct: 359 GLDQRVEAEGLDRTTLLLPGRQAELISAVAKASKGPVILVLMSGGPIDIAFAQNDPRIDG 418
Query: 556 ILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYI-KVPMTDMKMRPQATSGNPGR 614
ILW GYPG+AG A+A+VIFG HNPG +LP+TWY QDY+ KVPMT+M MR G PGR
Sbjct: 419 ILWVGYPGQAGGQAIADVIFGHHNPGAKLPVTWYHQDYLQKVPMTNMAMRANPARGYPGR 478
Query: 615 TYRFYEGKEVFPFGCGLSYSKYSYKF-KAVSQNKLYLNQSSSTKMVENQDVVHYKSVP-- 671
TYRFY G ++PFG GLSY+++++ A +Q + L+ S + + + P
Sbjct: 479 TYRFYTGPTIYPFGHGLSYTQFTHTLAHAPTQLTVRLSGSGHSAASAASLLNATLARPVR 538
Query: 672 --ELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKP-----ARRGNGRPIKQLVGFQS 724
+ CE V + V N G+ G H VL++ A G P +QLV F+
Sbjct: 539 AVRVAHARCEGLTVPVHVDVSNVGDRDGAHAVLVYHAAPSPSHAAPGADAPARQLVAFEK 598
Query: 725 VILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGDEEYPISIFV 776
V + A A + + C+ LS A +G+ + G H L++G+ + +S+ V
Sbjct: 599 VHVPAGGVARVEMRIGVCDRLSVADRNGVRRVPVGEHRLMIGELTHSVSLGV 650
>gi|340370204|ref|XP_003383636.1| PREDICTED: probable beta-D-xylosidase 2-like [Amphimedon
queenslandica]
Length = 755
Score = 571 bits (1472), Expect = e-160, Method: Compositional matrix adjust.
Identities = 306/738 (41%), Positives = 440/738 (59%), Gaps = 44/738 (5%)
Query: 38 PSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGV 97
P + +C + I++R +DL+SRLT+ EK+SQ +A AI RL IPAY+WWSE LHG+
Sbjct: 51 PPFNAYLYCNYSASITERVKDLLSRLTVLEKMSQTATNASAIERLDIPAYDWWSECLHGL 110
Query: 98 AGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFW 157
A G+FF + ATSFPQVI A+F+ L +GQ I EARA N GQ+ G+TF+
Sbjct: 111 AQ-SPGVFFENDLTSATSFPQVIGLGATFNMSLVLAMGQVISTEARAFANNGQS-GLTFF 168
Query: 158 APNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHF 217
APNINI+RDPRWGRGQETPGEDP +T +YA ++V+G+Q G + + L+A A CKH+
Sbjct: 169 APNINIYRDPRWGRGQETPGEDPYLTSQYAANFVKGIQ----EGSEDRRYLKAIATCKHY 224
Query: 218 TAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCAD 277
AY+L+ + R F+A V+ QDL +TY P F++CV++G+ IMC+YN +NG+P+CA+
Sbjct: 225 AAYNLERYLDVRRVNFNAIVSDQDLEETYLPAFKACVQEGQVGSIMCSYNAINGVPNCAN 284
Query: 278 RNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQK 337
+ +K AR WGF GYI SDC A+ I Y V D LK G D+NCG F +K
Sbjct: 285 DFINNKIARDTWGFEGYIVSDCGAILDIQYKHNYTSDTNITVADALKGGCDLNCGHFYEK 344
Query: 338 HTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLAL 397
+ + A + E +ID++L LF+ RMRLG+F+ P +QPF + V +P Q LAL
Sbjct: 345 YMEDAFDNSTITEEDIDKSLTRLFTSRMRLGMFDP-PEIQPFRQYSVKDVNTPEAQDLAL 403
Query: 398 QAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQN 457
AA++GIVLL+N +LPL K ++A IGPNA++ + GNY G + I+PLQ N
Sbjct: 404 NAAREGIVLLQNKGSVLPLDIVKHSNIAAIGPNADATHIMQGNYHGIAPYLISPLQGFSN 463
Query: 458 YVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQ 517
N Y GC + AV +G D V+ ++GL+ TQE E DR + LPG Q
Sbjct: 464 LGINATYQIGCPVACNDTEGFPDAVKAVQGVDAVIAVIGLNNTQEGESHDRTSIALPGHQ 523
Query: 518 QELITRVAEAAKK--PVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIF 575
++L+ + + A K P+I+V++ GG VD+T K + +ILWAGYPG++G A+AEVI+
Sbjct: 524 EDLLLELKKNAAKGTPLIVVVMSGGSVDLTGVKDIAD--AILWAGYPGQSGGQAIAEVIY 581
Query: 576 GDHNPGGRLPMTWYPQDYI-KVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYS 634
G NP GRLP+T+YP YI ++P T+M MR PGR+Y+FY G VFPFG GLSY+
Sbjct: 582 GKVNPSGRLPVTFYPASYINEIPYTNMSMRVP-----PGRSYKFYTGTPVFPFGFGLSYT 636
Query: 635 KYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGE 694
+ K+K S K Y +++ + +VV+Y++ V N G
Sbjct: 637 TFEIKWKDTSTAKDYYLKTT------HDEVVNYEAT------------------VTNSGS 672
Query: 695 MAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLM 754
G VL F+ + G P+K+L F+ + L E ++ F P + + G+
Sbjct: 673 RPGSVSVLAFITSSVP--GAPMKELFAFKKIYLEPTESVDVSFVAEP-KVFTTVDIYGIR 729
Query: 755 VIEEGTHFLVVGDEEYPI 772
I G + +++GD+++ I
Sbjct: 730 KIRPGAYKIIIGDDDHHI 747
>gi|326513064|dbj|BAK03439.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 694
Score = 570 bits (1470), Expect = e-160, Method: Compositional matrix adjust.
Identities = 299/622 (48%), Positives = 396/622 (63%), Gaps = 14/622 (2%)
Query: 159 PNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFT 218
P + P + ETPGEDPL+ KYAV YV G+Q D GG G L+ +ACCKH+T
Sbjct: 83 PALGRLGIPAYEWWSETPGEDPLLASKYAVGYVTGLQ-DAGAGGVTDGALKVAACCKHYT 141
Query: 219 AYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADR 278
AYD+DNWKG RY FDA+V+ QDL DT+QPPF+SCV G + +MC+YN+VNG P+CAD+
Sbjct: 142 AYDVDNWKGVERYTFDAKVSQQDLDDTFQPPFKSCVLDGNVASVMCSYNKVNGKPTCADK 201
Query: 279 NLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKH 338
+LL R W +GYI SDCD+V ++Y + Y K+PE+A +K+G+D+NCG+FL +H
Sbjct: 202 DLLEGVIRGDWKLNGYIVSDCDSVDVLYTQQHYTKTPEEAAAITIKSGLDLNCGNFLAQH 261
Query: 339 TKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQ 398
T AAV+ +L E ++DRA+ N F + MRLG F+G+P FG +G VC+ +++ LA +
Sbjct: 262 TVAAVQAGELSEEDVDRAITNNFIMLMRLGFFDGDPRQLAFGSLGPKDVCTSSNRELARE 321
Query: 399 AAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNY 458
A+ GIVLLKNS G LPL S+A+IGPNAN++ T++GNY G C+ TPLQ L
Sbjct: 322 TARQGIVLLKNS-GALPLSAKSIKSMAVIGPNANASFTMIGNYEGTPCKYTTPLQGLGAK 380
Query: 459 VENTVYYPGCDTVACSSASID--KAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGR 516
V NTVY PGC V CS S+ AV A AD VL++G DQ+ E+E LDR L+LPG+
Sbjct: 381 V-NTVYQPGCTNVGCSGNSLQLSTAVAAAASADVTVLVVGADQSIERESLDRTSLLLPGQ 439
Query: 517 QQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFG 576
Q +L++ VA A+ PVILV++ GGP DI+FAK I +ILW GYPGEAG ALA+++FG
Sbjct: 440 QTQLVSAVANASSGPVILVVMSGGPFDISFAKASDKIAAILWVGYPGEAGGAALADILFG 499
Query: 577 DHNPGGRLPMTWYPQDYIK-VPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSK 635
HNP GRLP+TWYP Y V MTDM+MRP ++G PGRTYRFY G VF FG GLSY+K
Sbjct: 500 SHNPSGRLPVTWYPASYADTVTMTDMRMRPDTSTGYPGRTYRFYTGDTVFAFGDGLSYTK 559
Query: 636 YSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVP-ELGTEFCETRKFLVTIGVKNHGE 694
S+ VS Y+ S ++ E+ + E + C+ F V + V+N GE
Sbjct: 560 MSHSL--VSAPPSYV----SMRLAEDHPCRAEECASVEAAGDHCDDLAFDVKLQVRNAGE 613
Query: 695 MAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLM 754
+AG H VLLF P N P K L+GF+ V L E + F + C LS E G
Sbjct: 614 VAGAHSVLLFSSPPPAHNA-PAKHLLGFEKVSLAPGEAGTVAFRVDVCRDLSVVDELGGR 672
Query: 755 VIEEGTHFLVVGDEEYPISIFV 776
+ G H L VGD ++ + + V
Sbjct: 673 KVALGGHTLHVGDLKHTVELRV 694
Score = 85.5 bits (210), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 39/68 (57%), Positives = 46/68 (67%)
Query: 25 DSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGI 84
++ P F+CD SN + + FC S RARDLVSRLTL EK+ LVN PA+ RLGI
Sbjct: 31 EAQAPVFACDASNATLAAYGFCNRKATASARARDLVSRLTLAEKVGFLVNKQPALGRLGI 90
Query: 85 PAYEWWSE 92
PAYEWWSE
Sbjct: 91 PAYEWWSE 98
>gi|340370206|ref|XP_003383637.1| PREDICTED: probable beta-D-xylosidase 5-like [Amphimedon
queenslandica]
Length = 728
Score = 570 bits (1468), Expect = e-159, Method: Compositional matrix adjust.
Identities = 306/734 (41%), Positives = 442/734 (60%), Gaps = 53/734 (7%)
Query: 42 TFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVG 101
T+ +C T I +R DL+SR+T+ +KI QL+ SAPAIP L IPAY+WWSE LHGVAG
Sbjct: 28 TYKYCDYTQSIPERVNDLLSRMTILDKIPQLITSAPAIPSLDIPAYQWWSEGLHGVAG-S 86
Query: 102 KGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNI 161
G+ F G ATSFPQVI A+F+ L + Q I EARA N GQA G+T++APNI
Sbjct: 87 PGVHFGGNFPNATSFPQVIGLGATFNMSLVLAMAQVISTEARAFANGGQA-GLTYFAPNI 145
Query: 162 NIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYD 221
NIFRDPRWGRGQETPGEDP ++ +YA ++V+G+Q G L+ A CKH+ AYD
Sbjct: 146 NIFRDPRWGRGQETPGEDPYLSSQYAANFVKGMQ----EGADDTRYLKTIATCKHYAAYD 201
Query: 222 LDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLL 281
L+N+ +R+ F+A V+ QD +TY P F SCV++G+ IMC+YN VNG+PSCA+ +
Sbjct: 202 LENYLNLSRHTFNAIVSDQDFEETYFPAFRSCVEEGKVGSIMCSYNAVNGVPSCANDFIN 261
Query: 282 SKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKA 341
++ AR +WGF GY+ SDC A+S I ++ Y + +D V L+ G D+NCG F H +A
Sbjct: 262 NEVARGKWGFEGYVVSDCGAISDIINSHKYTSNTDDTVAAGLRGGCDLNCGHFYSDHAQA 321
Query: 342 AVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQ 401
A + + +IDRA+ LF+ RMRLG+F+ P+MQPF D V + H+ LAL A++
Sbjct: 322 AYDNGAITDDDIDRAMTRLFTYRMRLGMFD-PPSMQPFRDYTNDKVDTKQHEALALDASR 380
Query: 402 DGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVEN 461
+ IVLL+N+ +LPL + +AL+GP+ + + GNY G + I+P+Q LQ+ +
Sbjct: 381 ESIVLLQNNKDILPLSLTTHRKIALVGPHGQAQGAMQGNYKGTAPYLISPMQGLQDLGLS 440
Query: 462 TVYYPGCDTVACSS----ASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQ 517
+ GC VAC + + + K V+ + ++ ++GLD++QE E DR L LPG+Q
Sbjct: 441 VTFAAGCTQVACPTIAGFSEVTKLVE-EHSIEAIIAVIGLDESQESEGHDRTSLTLPGQQ 499
Query: 518 QELITRVAEAAKK--PVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIF 575
+L+ + + A P I+V++ GGPVD++ K + +ILWAGYPG++G A+AEVI+
Sbjct: 500 VQLLEDIKKKAVPGIPFIVVVMSGGPVDLSGVKDIAD--AILWAGYPGQSGGQAIAEVIY 557
Query: 576 GDHNPGGRLPMTWYPQDYI-KVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYS 634
G NP GRLP+T+YP YI ++P T+M MR PGR+Y+FY G VFPFG GLSY+
Sbjct: 558 GKVNPSGRLPVTFYPASYINEIPYTNMSMRVP-----PGRSYKFYTGTPVFPFGFGLSYT 612
Query: 635 KYSYKFKAVSQNKLYLNQSSSTKMVENQDV-VHYKSVPELGTEFCETRKFLVTIGVKNHG 693
+ K+K N + T + DV V+Y+ V V N G
Sbjct: 613 TFEMKWK---------NPPNVTHLKTTHDVDVNYEVV------------------VTNAG 645
Query: 694 EMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGL 753
+ +G VL ++ G P+K+L GFQ + L ++ + F P + + + G
Sbjct: 646 KRSGSVSVLAYITSTVP--GAPMKELFGFQKIYLKPEQSMTLSFVAEP-KVFTTVDKHGE 702
Query: 754 MVIEEGTHFLVVGD 767
I GT+ + +GD
Sbjct: 703 RKIRPGTYKITIGD 716
>gi|326488213|dbj|BAJ89945.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 525
Score = 551 bits (1419), Expect = e-154, Method: Compositional matrix adjust.
Identities = 274/499 (54%), Positives = 354/499 (70%), Gaps = 6/499 (1%)
Query: 25 DSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGI 84
++ P F+CD SN + + FC S RARDLVSRLTL EK+ LVN PA+ RLGI
Sbjct: 31 EAQAPVFACDASNATLAAYGFCNRKATASARARDLVSRLTLAEKVGFLVNKQPALGRLGI 90
Query: 85 PAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARA 144
PAYEWWSEALHGV+ VG G F+ + GATSFPQ ILTAASF++ L+ IG+ + EARA
Sbjct: 91 PAYEWWSEALHGVSYVGPGTRFSPLVPGATSFPQPILTAASFNASLFRAIGEVVSTEARA 150
Query: 145 LYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKL 204
++N G A G+TFW+PNINIFRDPRWGRGQETPGEDPL+T KYAV YV G+Q D GG
Sbjct: 151 MHNVGLA-GLTFWSPNINIFRDPRWGRGQETPGEDPLLTSKYAVGYVTGLQ-DAGAGGVT 208
Query: 205 KGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMC 264
G L+ +ACCKH+TAYD+DNWKG RY FDA+V+ QDL DT+QPPF+SCV G + +MC
Sbjct: 209 DGALKVAACCKHYTAYDVDNWKGVERYTFDAKVSQQDLDDTFQPPFKSCVLDGNVASVMC 268
Query: 265 AYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLK 324
+YN+VNG P+CAD++LL R W +GYI SDCD+V ++Y + Y K+PE+A +K
Sbjct: 269 SYNKVNGKPTCADKDLLEGVIRGDWKLNGYIVSDCDSVDVLYTQQHYTKTPEEAAAITIK 328
Query: 325 AGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGA 384
+G+D+NCG+FL +HT AAV+ +L E ++DRA+ N F + MRLG F+G+P FG +G
Sbjct: 329 SGLDLNCGNFLAQHTVAAVQAGELSEEDVDRAITNNFIMLMRLGFFDGDPRQLAFGSLGP 388
Query: 385 DVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGP 444
VC+ +++ LA + A+ GIVLLKNS G LPL S+A+IGPNAN++ T++GNY G
Sbjct: 389 KDVCTSSNRELARETARQGIVLLKNS-GALPLSAKSIKSMAVIGPNANASFTMIGNYEGT 447
Query: 445 SCRSITPLQALQNYVENTVYYPGCDTVACSSASID--KAVDIAKGADHVVLMMGLDQTQE 502
C+ TPLQ L V NTVY PGC V CS S+ AV A AD VL++G DQ+ E
Sbjct: 448 PCKYTTPLQGLGAKV-NTVYQPGCTNVGCSGNSLQLSTAVAAAASADVTVLVVGADQSIE 506
Query: 503 KEELDRVDLVLPGRQQELI 521
+E LDR L+LPG+Q +L+
Sbjct: 507 RESLDRTSLLLPGQQTQLV 525
>gi|340370208|ref|XP_003383638.1| PREDICTED: probable beta-D-xylosidase 2-like [Amphimedon
queenslandica]
Length = 732
Score = 541 bits (1393), Expect = e-151, Method: Compositional matrix adjust.
Identities = 295/744 (39%), Positives = 426/744 (57%), Gaps = 65/744 (8%)
Query: 41 ETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGV 100
++F +C +LPIS R +DL+SR+TL EKI+QL N+A +I RL IPAY+WWSE LHGVA
Sbjct: 30 QSFSYCNYSLPISDRVKDLLSRMTLAEKITQLGNTAGSIDRLDIPAYQWWSEGLHGVAD- 88
Query: 101 GKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPN 160
G+ FNG ATSFPQVI TA+SF+ L++ I + EARA N G + ++ +
Sbjct: 89 SPGVHFNGMFHNATSFPQVITTASSFNKTLYHEIAAVMSTEARAFANQG----IVYFKQH 144
Query: 161 INIF--------RDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASA 212
+ RDPRWGR QETPGEDP + +YA+ +V G QGD+ L+
Sbjct: 145 QQLLSNYLLFYCRDPRWGRAQETPGEDPYLNSQYAIQFVTGAQGDS-------KYLKVVT 197
Query: 213 CCKHFTAYDLDNW-KGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNG 271
CKHF YDL+++ G TR+ F+A++T QD +TY P F++CV++ + IMC+YN VNG
Sbjct: 198 TCKHFAGYDLEDYVDGETRHSFNAKITPQDFEETYYPAFKACVEEANVASIMCSYNEVNG 257
Query: 272 IPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNC 331
+PSCAD + +K AR WGF G+I SDC A+ I + Y + +D V LK G D+NC
Sbjct: 258 VPSCADGQINNKLARDTWGFDGFIASDCGAIDDIQNKHHYTNNTDDTVAAALKGGCDLNC 317
Query: 332 GSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPA 391
GS+ Q H ++A + EI+ AL LF+ RM+LG+F+ P +QP+ I DVV S
Sbjct: 318 GSYYQSHAQSAFLNGTITIGEINLALTRLFTARMKLGMFDP-PELQPYNAISPDVVNSLE 376
Query: 392 HQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITP 451
HQ LAL AA++ IVLL+N++ +LPL K ++A++GP+A + + GNY G + I+P
Sbjct: 377 HQALALNAARESIVLLQNNNDVLPLNFEKHSTIAVVGPHAMATDVMQGNYNGVAPYLISP 436
Query: 452 LQALQNY-VENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVD 510
++ +N +++ + GCD + A DIA AD V+ ++GLDQ+ E E DR D
Sbjct: 437 VEGFENLGIDSVLTASGCDVNCEVTDGFQDAFDIAVKADAVIAVLGLDQSHESEGHDRED 496
Query: 511 LVLPGRQQELI-----TRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEA 565
L LP Q + + T A P+I+V++ G VD+T K ++ +ILWAGYPG++
Sbjct: 497 LFLPNLQDKFVQDLKNTLKAAGTNAPLIVVVMSGSSVDLTVTK--KHADAILWAGYPGQS 554
Query: 566 GAVALAEVIFGDHNPGGRLPMTWYPQDYIK-VPMTDMKMRPQATSGNPGRTYRFYEGKEV 624
G A+AE+I+G NP GRLP+T+YP YI V M MR PGRTY+FY
Sbjct: 555 GGQAIAEIIYGKVNPSGRLPVTFYPGSYIDLVAFRHMSMREY-----PGRTYKFYNDTPD 609
Query: 625 FPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFL 684
F FG GLSY+ + YL S M + V S P + +
Sbjct: 610 FSFGDGLSYTTF------------YLEWSKPVNMSGVRSV----SYPTV----------V 643
Query: 685 VTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCES 744
+ V N G+M G VL ++ + +G P K+L GF+ V LN + + F + ++
Sbjct: 644 YNVTVTNTGKMPGAISVLAYI--SYNNSGAPKKKLFGFEKVFLNPLQSVSVTFP-ADSKA 700
Query: 745 LSRAREDGLMVIEEGTHFLVVGDE 768
S + G + G + + +GD+
Sbjct: 701 FSTVDKSGKRSVNPGDYHVTIGDQ 724
>gi|320170454|gb|EFW47353.1| beta-xylosidase [Capsaspora owczarzaki ATCC 30864]
Length = 779
Score = 537 bits (1383), Expect = e-149, Method: Compositional matrix adjust.
Identities = 313/757 (41%), Positives = 426/757 (56%), Gaps = 50/757 (6%)
Query: 10 FPLLCLCFTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKI 69
F C+ V T +C+ + + PFC L QRA DLV RLTL EKI
Sbjct: 4 FGWTCITIAVAALVVAPTARALTCE--DAALRNLPFCNPNLAWEQRADDLVGRLTLQEKI 61
Query: 70 SQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVI--------L 121
SQ +AP + RLG+ AYEWWSEALHGVA G+ F G +T FPQ+I
Sbjct: 62 SQFGTTAPGVARLGVNAYEWWSEALHGVAE-SPGVNFTGNTPVSTCFPQIIGNNCSSLSR 120
Query: 122 TAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPL 181
A+F+ + Q I EARA NAG A G+T++ PNINIFRDPRWGRGQETPGEDP
Sbjct: 121 VGATFNLDSVAAMAQVISTEARAFANAGHA-GLTYFTPNINIFRDPRWGRGQETPGEDPY 179
Query: 182 VTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQD 241
+T +Y + V+ +Q G+ L+ A CKH+TAYD+++W G R+ F+A V+ QD
Sbjct: 180 LTSRYVETLVQNLQN-----GEDARYLKVVATCKHYTAYDMEDWGGIDRFHFNAVVSDQD 234
Query: 242 LADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDA 301
L +T+ PPFE+CV+ G+ + +MC+YN VNGIPSCAD + ++ AR QWGF GYI SDC A
Sbjct: 235 LVETFMPPFEACVRVGKGASLMCSYNAVNGIPSCADDFINNEIAREQWGFDGYIVSDCGA 294
Query: 302 VSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLF 361
+ I Y + + ++ G D++CG F Q H A+ L E+++D +L LF
Sbjct: 295 IDCIQYTHNYTNTTQATCAAGIQGGCDLDCGDFYQSHLMDAIGNATLHEADLDFSLRRLF 354
Query: 362 SVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKS 421
R+RLG F+ ++QP+ +I + S HQ LALQ A++ IVLL N + LP +
Sbjct: 355 GHRIRLGEFDA-ASIQPYRQIPVSAINSQEHQELALQIARESIVLLGNDNNTLPFSLATV 413
Query: 422 VSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNY--VENTVYYPGCDTVACSSASID 479
LA+IGPNA+ A+TLLGNY G + ITPL+ Q + + GCD + ++
Sbjct: 414 RKLAIIGPNADDAETLLGNYYGDAPYLITPLKGFQQLDPTLSITFVKGCDVNSTDTSGFV 473
Query: 480 KAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCG 539
A AK AD ++++GL+QT E E LDR LVLPG Q ELI + AA+ PVILV++ G
Sbjct: 474 AAAAAAKAADATIVVVGLNQTVESENLDRTTLVLPGVQAELILALTAAARGPVILVVMSG 533
Query: 540 GPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYI-KVPM 598
P+D++ + + + LW GYPG+AG ALAE +FG +P GRLP T YP DY+ ++PM
Sbjct: 534 SPIDLSNVIHP--VRAALWIGYPGQAGGRALAEAVFGVFSPAGRLPFTVYPADYVNQLPM 591
Query: 599 TDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSS---S 655
T+M MR PGRTYRFY G +F FG GLSYS + Y + S + S S
Sbjct: 592 TNMDMR-----AGPGRTYRFYTGTPLFEFGHGLSYSTFQYTWSNSSSSSSSSATSQHSLS 646
Query: 656 TKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVK--------- 706
T + Q + V E F V V+N G+MA VL F
Sbjct: 647 TAALAAQHLAARAPV--------EAVSFRVL--VQNTGKMASDDVVLAFASFNASSIIDQ 696
Query: 707 PARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCE 743
+ + PI+ LVGF+ + L EI F ++ +
Sbjct: 697 SSSQFASPPIRSLVGFRRIHLAPGASQEIFFAVTSSQ 733
>gi|340377241|ref|XP_003387138.1| PREDICTED: probable beta-D-xylosidase 2-like [Amphimedon
queenslandica]
Length = 733
Score = 530 bits (1365), Expect = e-147, Method: Compositional matrix adjust.
Identities = 309/755 (40%), Positives = 425/755 (56%), Gaps = 54/755 (7%)
Query: 27 TQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPA 86
T+PP+ P+ T +C L R +DL+SRLTL+EKISQL NSA AI RLGIP
Sbjct: 23 TEPPYKTRPN----FTPAYCNYRLSFKDRVKDLLSRLTLEEKISQLGNSASAIDRLGIPG 78
Query: 87 YEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALY 146
Y+WWSE LHGVA V G+ G + TSFPQ+I TA+SF+ L+Y IG+A+ EAR
Sbjct: 79 YQWWSEGLHGVA-VSPGLHLGGNLTCTTSFPQIITTASSFNKSLFYEIGEAVSTEARGFA 137
Query: 147 NAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKG 206
+ GQ G+T++ PNINI RDPRWGRGQET GEDP +T +YAV+ VRG QG+ K+
Sbjct: 138 DNGQG-GLTYFTPNINIVRDPRWGRGQETAGEDPYLTSQYAVNLVRGAQGNDSEYKKII- 195
Query: 207 KLQASACCKHFTAYDLDNW-KGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCA 265
A CKHF AYDL+++ G R F+A VT QDL +TY P F SCV G IMC+
Sbjct: 196 -----ATCKHFAAYDLESYINGDVRDSFNAEVTKQDLEETYFPAFRSCVTAGGVGSIMCS 250
Query: 266 YNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKA 325
YN VNG+PSC D +K AR +W F GY+ SDC A+ + + Y +P D V LK
Sbjct: 251 YNSVNGVPSCVDGVFNNKIARNKWKFDGYLVSDCGAIDDVMNKHHYTSTPTDTVAAGLKG 310
Query: 326 GMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGAD 385
G D+NCGSF Q H A + E +IDRA+ LF+ RMRLGLF+ P QP+ D
Sbjct: 311 GTDLNCGSFYQTHAMDAFLNGSITEVDIDRAVGRLFTARMRLGLFD-LPKYQPYSYFNTD 369
Query: 386 VVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPS 445
VV + HQ LALQAA++ IVLL+N +G LPL +A++GPN + T+ G +
Sbjct: 370 VVNTKQHQDLALQAARESIVLLQN-NGKLPLSYEDHHKIAVVGPNILANVTMQGISQVIA 428
Query: 446 CRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEE 505
I+P+ ++ + Y GCD + A + K A VV +MGLDQ E+E
Sbjct: 429 PYLISPVDGFKSKGLHVTYSLGCDVKCIVTDGFHDAFKLVKDAKAVVAVMGLDQGIERET 488
Query: 506 LDRVDLVLPGRQQELITRVAEAAKK-----PVILVLLCGGPVDITFAKYDRNIGSILWAG 560
+DR D+ LPG Q + + + + P+I+V++ G VD++ +K + +ILW G
Sbjct: 489 VDREDIFLPGLQDKFLLGLRDTLTNLQSPVPLIVVIMSGSSVDLSESKSLAD--AILWVG 546
Query: 561 YPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIK-VPMTDMKMRPQATSGNPGRTYRFY 619
YPG++G A+AEVI+G+ NP GRLP+T+YP +YI V M MR PGRTYRFY
Sbjct: 547 YPGQSGGQAIAEVIYGEVNPSGRLPLTFYPGEYIDLVAYRHMSMREP-----PGRTYRFY 601
Query: 620 EGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCE 679
VFPFG GLSY+ + S + KM ++V SV ++ +F
Sbjct: 602 TENPVFPFGHGLSYTTFEL--------------SWTNKMNNVTEIVISDSV-DINIDF-- 644
Query: 680 TRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFEL 739
I V N G ++G VL +V + P+++L F V ++ E +I
Sbjct: 645 ------DITVVNTGYLSGAVSVLGYV--SSNIPDAPLRELFDFDKVFIDKYESKKISL-F 695
Query: 740 SPCESLSRAREDGLMVIEEGTHFLVVGDEEYPISI 774
+ ++ + E G I G + + + + + I I
Sbjct: 696 ATNDAFTTVDEKGRRNILPGEYDIAIENLSHKIII 730
>gi|300121549|emb|CBK22068.2| unnamed protein product [Blastocystis hominis]
Length = 690
Score = 527 bits (1357), Expect = e-146, Method: Compositional matrix adjust.
Identities = 301/710 (42%), Positives = 410/710 (57%), Gaps = 53/710 (7%)
Query: 54 QRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGA 113
+RAR LV+ LTL EK+S + ++A + RL IP Y+WWSEALHGVA G+ F A
Sbjct: 3 ERARALVAELTLAEKMSLMGHTASEVKRLNIPKYQWWSEALHGVAA-SPGVVFQEPTPFA 61
Query: 114 TSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQ 173
T+FPQV LTA SFD L++ I I EAR + NA +A +T+W+PN+N++RDPRWGRGQ
Sbjct: 62 TAFPQVALTAQSFDKPLFHDIASIISTEARVMNNAERA-NLTYWSPNVNVYRDPRWGRGQ 120
Query: 174 ETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKF 233
ETPGEDP + YAV +VRG+Q G+ L+ SACCKH++AYDL+NW G R++F
Sbjct: 121 ETPGEDPFLVATYAVEFVRGLQE-----GEDPRYLKVSACCKHYSAYDLENWHGVERFEF 175
Query: 234 DARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHG 293
DA V+ +D+ DT+Q PFE CVK+G S +MC+YN +NGIP+CADR LL TAR WGF G
Sbjct: 176 DAIVSDRDMTDTFQVPFEQCVKKGHVSSLMCSYNAINGIPACADRELLYGTARGGWGFEG 235
Query: 294 YITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEI 353
YITSDC A+ I Y + + ++A D++CG F Q+H +V+ +L E+E+
Sbjct: 236 YITSDCGAIDTIIYNHHYTNDTDTTAMLGVRATCDLDCGGFYQQHILHSVESGRLKEAEV 295
Query: 354 DRALHNLFSVRMRLGLFNGNPT-MQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHG 412
D AL NLF V+MRLGLF +P Q + G D + + HQ +AL+AA++GI LLKN +
Sbjct: 296 DDALANLFKVQMRLGLF--DPVEQQVYTHYGLDKLNTKEHQAMALRAAREGIALLKNQND 353
Query: 413 LLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVA 472
LPL K + ++GP A A +LGNY G +T Q L+N CD V
Sbjct: 354 FLPL-SLKDKHVVVMGPYAEDAGVMLGNYNGIPEFIVTVAQGLRNV---------CDHVD 403
Query: 473 CSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPV 532
+ +A+ +G D +V+ +GL+Q E+E LDR DL+LP Q+ L+ + PV
Sbjct: 404 VVKSL--EALSKLEGVDLIVVTVGLNQEIEREGLDREDLLLPASQRALLDGLLAQTDVPV 461
Query: 533 ILVLLC-GGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQ 591
+L LL GG VDI+ + + ++ +L GY G G A+AEVI GD NP GRL T Y
Sbjct: 462 VLTLLSGGGSVDISAYEQNEHVVGVLAVGYGGMFGGQAIAEVIVGDVNPSGRLVNTMYYN 521
Query: 592 DYI-KVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYL 650
DY+ + DM MRP+ +G PGRTYRF+ G + PFG GLSY+ +++ AV ++
Sbjct: 522 DYVTNLDYFDMNMRPKEETGFPGRTYRFFAGPVIHPFGFGLSYTTFAH---AVEIGQMRN 578
Query: 651 NQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVK-PAR 709
++ S ++ V + V N G G VLLFVK P
Sbjct: 579 HRLRSALAID------------------------VYVKVTNTGSRQGDESVLLFVKSPLA 614
Query: 710 RGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEG 759
G P+K L F V L E + F L E L A E V+ G
Sbjct: 615 GKQGYPLKSLADFSRVSLAPGETQTVHFVLGE-EQLHLANEQAKYVLLRG 663
>gi|194700280|gb|ACF84224.1| unknown [Zea mays]
Length = 452
Score = 518 bits (1335), Expect = e-144, Method: Compositional matrix adjust.
Identities = 256/454 (56%), Positives = 328/454 (72%), Gaps = 8/454 (1%)
Query: 327 MDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADV 386
MDVNCGS++Q H +A++Q K+ E +I+RALHNLF+VRMRLGLFNG+P +G IG D
Sbjct: 1 MDVNCGSYVQDHGASALQQGKITEQDINRALHNLFAVRMRLGLFNGDPRRNLYGDIGPDQ 60
Query: 387 VCSPAHQVLALQAAQDGIVLLKNS--HGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGP 444
VC+ HQ LAL+AAQDGIVLLKN G LPL K SLA+IG NAN A L GNY GP
Sbjct: 61 VCTQEHQDLALEAAQDGIVLLKNDGGAGALPLSKPNVASLAVIGFNANDAIRLRGNYFGP 120
Query: 445 SCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKE 504
C ++TPLQ LQ YV++T + GC++ AC+ +I +AV A AD VVL MGLDQ QE+E
Sbjct: 121 PCVTVTPLQVLQGYVKDTSFVAGCNSAACNVTTIPEAVQAASSADSVVLFMGLDQDQERE 180
Query: 505 ELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGE 564
E+DR+DL LPG+QQ LI VA AAKKPVILVLLCGGPVD++FAK + IG+ILWAGYPGE
Sbjct: 181 EVDRLDLTLPGQQQTLIESVANAAKKPVILVLLCGGPVDVSFAKTNPKIGAILWAGYPGE 240
Query: 565 AGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEV 624
AG +A+A+V+FG+HNPGGRLP+TWYPQD+ +VPMTDM+MR +G PGRTYRFY G V
Sbjct: 241 AGGIAIAQVLFGEHNPGGRLPVTWYPQDFTRVPMTDMRMRADPATGYPGRTYRFYRGPTV 300
Query: 625 FPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVE-NQDVVHYKSVPELGTEFCETRKF 683
F FG GLSYSKYS++F + + + K VE + V +G+E C+ KF
Sbjct: 301 FNFGYGLSYSKYSHRF---ATKPPPTSNVAGLKAVEATAGGMASYDVEAIGSETCDRLKF 357
Query: 684 LVTIGVKNHGEMAGKHPVLLFVK--PARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSP 741
+ V+NHG M GKH VL+F++ A G+GRP QL+GFQS+ L A + A + FE+SP
Sbjct: 358 PAVVRVQNHGPMDGKHSVLVFMRWPNATDGSGRPASQLIGFQSLHLRATQTAHVEFEVSP 417
Query: 742 CESLSRAREDGLMVIEEGTHFLVVGDEEYPISIF 775
C+ SRA EDG VI++G+HF++VG++E+ +S
Sbjct: 418 CKHFSRATEDGRKVIDQGSHFVMVGEDEFEMSFM 451
>gi|147857580|emb|CAN78858.1| hypothetical protein VITISV_030325 [Vitis vinifera]
Length = 699
Score = 510 bits (1313), Expect = e-141, Method: Compositional matrix adjust.
Identities = 283/647 (43%), Positives = 385/647 (59%), Gaps = 86/647 (13%)
Query: 133 RIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVR 192
R+ + + EARA+YN G A G+TFW+PN+NIF+DPRWGRGQETPGEDPL++ KYA YVR
Sbjct: 132 RLRKVVSTEARAMYNVGLA-GLTFWSPNVNIFQDPRWGRGQETPGEDPLLSSKYASGYVR 190
Query: 193 GVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFES 252
G+Q + +G +L+ +ACCKH+TAYDLDNWKG + F+A VT QD+ DT+QPPF+S
Sbjct: 191 GLQ-QSDDGSP--DRLKVAACCKHYTAYDLDNWKGVDCFHFNAVVTNQDMDDTFQPPFKS 247
Query: 253 CVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYA 312
CV G + ++ YI SDCD+V + Y+++ Y
Sbjct: 248 CVIDGNVASVI------------------------------YIVSDCDSVDVFYNSQHYT 277
Query: 313 KSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNG 372
K+PE+A + AG+D+NCGSFL +HT+AAVK + ES +D+A+ N F+ MRLG F+G
Sbjct: 278 KTPEEAAAKAILAGLDLNCGSFLGQHTEAAVKGGLVDESAVDKAVSNNFATLMRLGFFDG 337
Query: 373 NPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNAN 432
NP+ +GK+G VC+ HQ A +A + GIV
Sbjct: 338 NPSKAIYGKLGPKDVCTSEHQERAREAPRQGIV--------------------------- 370
Query: 433 SAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVV 492
+AG C+ TPLQ L V T Y PGC VAC +A ID+A IA AD V
Sbjct: 371 --------FAGTPCKYTTPLQGLTALVA-TTYLPGCSNVACGTAQIDEAKKIAAAADATV 421
Query: 493 LMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRN 552
L++G+DQ+ E E DRV++ LPG+Q LIT VA+ +K VILV++ GG DI+FAK D
Sbjct: 422 LIVGIDQSIEAEGRDRVNIQLPGQQPLLITEVAKXSKGNVILVVMSGGGFDISFAKNDDK 481
Query: 553 IGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYI-KVPMTDMKMRPQATSGN 611
I SI W GYPGEAG A+A+VIFG +NP G+LPMTWYPQ Y+ KVPMT+M MRP SG
Sbjct: 482 ITSIQWVGYPGEAGGAAIADVIFGFYNPSGKLPMTWYPQSYVDKVPMTNMNMRPDPASGY 541
Query: 612 PGRTYRFYEGKEVFPFGCGLSYSKYSYKF----KAVSQNKLYLNQSSSTKMVENQDVVHY 667
PGRTYRFY G+ ++ FG GLSY+++++ K+VS + S+K
Sbjct: 542 PGRTYRFYTGETIYTFGDGLSYTQFNHHLVQAPKSVSIPIEEAHSCHSSKC--------- 592
Query: 668 KSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVIL 727
KSV + E C+ F + + V N G ++G H V LF P N P K L+GF+ V +
Sbjct: 593 KSVDAV-QESCQNLAFDIHLRVNNAGNISGSHTVFLFSSPPSVHN-SPQKHLLGFEKVFV 650
Query: 728 NAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGDEEYPISI 774
AK KA + F++ C+ LS E G + G H L VG+ ++ +++
Sbjct: 651 TAKAKALVRFKVDVCKDLSIVDELGTRKVALGLHVLHVGNLKHSLNV 697
>gi|440799679|gb|ELR20723.1| betaxylosidase [Acanthamoeba castellanii str. Neff]
Length = 748
Score = 502 bits (1292), Expect = e-139, Method: Compositional matrix adjust.
Identities = 302/755 (40%), Positives = 411/755 (54%), Gaps = 91/755 (12%)
Query: 43 FPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGK 102
PFC T+L QR DLVSRLTLD+ I Q+ + APA+P LGIPAY WW+E LHGV
Sbjct: 13 LPFCNTSLTAGQRTDDLVSRLTLDQLIGQMGHQAPAVPSLGIPAYNWWTECLHGV----- 67
Query: 103 GIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNI- 161
TSFP A+F+ L +++ +AI EARAL N G G+ FWAPNI
Sbjct: 68 --LTKCGTNCPTSFPAPCALGAAFNMKLIHKMARAISNEARALNNEGIG-GLDFWAPNIK 124
Query: 162 ----------------------NIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTF 199
+I RDPRWGR E PGEDP +T +Y ++RG+Q
Sbjct: 125 YSTQPTNKTRQESQLRNAMVCISINRDPRWGRNMEVPGEDPFMTAQYVAHFMRGLQE--- 181
Query: 200 NGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRA 259
G+ Q CKHF AY L+ WK R+ FDA V+ D +TY P F+ C+ +GRA
Sbjct: 182 --GEDSRYPQVVGTCKHFAAYSLEAWKDYDRFMFDAIVSDYDFVETYLPAFKGCIVEGRA 239
Query: 260 SGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAV 319
IMC+YN VNG+PSCA+ LL R W F GY+ SDCDAV IY+ + K+PE A
Sbjct: 240 RSIMCSYNSVNGVPSCANDFLLRTILRDSWSFDGYVVSDCDAVDTIYNNHHFTKTPEGAC 299
Query: 320 VDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPF 379
L AG D+NCG F QKH A + ++ E E+ A+ LF RM LG+++ P QP+
Sbjct: 300 AVALHAGTDLNCGDFYQKHLGKAHSEGRVTEDEVRLAVKRLFRQRMELGMWD-PPAEQPY 358
Query: 380 GKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLG 439
+ VV S H LALQAA++ +VLL+N G+LPL KS +A+IGPNAN+ +T+LG
Sbjct: 359 KQYPPSVVGSREHSDLALQAARESMVLLQNRRGVLPLRKSVR-RVAVIGPNANATETMLG 417
Query: 440 NYAGPSCRS------ITPLQALQNYVENTV--YYPGCDTVACSSASIDKAVDIAKGADHV 491
NY G C ++P A++ + + Y GCD + ++ I +AV A+ AD
Sbjct: 418 NYYGSRCHDGTYDCIVSPYLAIKAKLPQALVTYNLGCDVDSTNTTGIPEAVKAAQAADVA 477
Query: 492 VLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDR 551
++++GL+ + E E DRV + LPG Q LI + A P ++V++ GG V I + K
Sbjct: 478 IVVLGLNTSVESEGKDRVAITLPGMQDHLIKSIV-ATNTPTVVVMMHGGAVAIEWIK--D 534
Query: 552 NIGSILWAGYPGEAGAVALAEVIFGDHNPG----------GRLPMTWYPQDYIK-VPMTD 600
+ I+ A YPGE G A+A+V+FGD+NPG GRLP+T P +Y+ VP+T+
Sbjct: 535 QVDGIVDAFYPGENGGQAIADVLFGDYNPGDNKTDGTTLLGRLPVTVLPANYVDMVPLTN 594
Query: 601 MKMRPQATSGNPGRTYRFYEG-KEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMV 659
M MR A+ NPGRTYR+Y G ++ FG GLSY+ + ++ L Q S+ K
Sbjct: 595 MSMR--ASGNNPGRTYRYYTGPAPLWEFGFGLSYTTFKTEW-------LSTPQPSALKSY 645
Query: 660 ENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQL 719
+ V ++ + V N G +AG VL FV G P+KQL
Sbjct: 646 ARDEAVSFR------------------VRVTNVGPVAGDEVVLAFVTRDNADRG-PLKQL 686
Query: 720 VGFQSVILNAKEKAEIVFELSPCESLSRAREDGLM 754
F+ V LN E EI F P ++L+ A DG M
Sbjct: 687 FAFERVHLNPGESKEIFFNTGP-DTLAVA-TDGAM 719
>gi|125534110|gb|EAY80658.1| hypothetical protein OsI_35835 [Oryza sativa Indica Group]
Length = 511
Score = 500 bits (1288), Expect = e-139, Method: Compositional matrix adjust.
Identities = 258/488 (52%), Positives = 335/488 (68%), Gaps = 7/488 (1%)
Query: 294 YITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEI 353
Y+ SDCDAV+ I DA Y SPED V +KAGMDVNCG++ Q H AAV++ L E +I
Sbjct: 16 YVASDCDAVATIRDAHHYTLSPEDTVAVSIKAGMDVNCGNYTQVHAMAAVQKGNLTEKDI 75
Query: 354 DRALHNLFSVRMRLGLFNGNPTMQP-FGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHG 412
DRAL NLF+VRMRLG F+G+P +G +GA VCSPAH+ LAL+AAQDGIVLLKN G
Sbjct: 76 DRALVNLFAVRMRLGHFDGDPRSNAVYGHLGAADVCSPAHKSLALEAAQDGIVLLKNDAG 135
Query: 413 LLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYV-ENTVYYPGCDTV 471
LPL S SLA+IGPNA++ L GNY GP C + TPLQ ++ Y+ + + GCD+
Sbjct: 136 ALPLQPSAVTSLAVIGPNADNLGALHGNYFGPPCETTTPLQGIKGYLGDRARFLAGCDSP 195
Query: 472 ACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKP 531
AC+ A+ ++A +A +DHVVL MGL Q QE+E LDR L+LPG QQ LIT VA AA++P
Sbjct: 196 ACAVAATNEAAALASSSDHVVLFMGLSQKQEQEGLDRTSLLLPGEQQGLITAVANAARRP 255
Query: 532 VILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQ 591
VILVLL GGPVD+TFAK + IG+IL AGYPG+AG +A+A+V+FGDHNP GRLP+TWYP+
Sbjct: 256 VILVLLTGGPVDVTFAKDNPKIGAILLAGYPGQAGGLAIAKVLFGDHNPSGRLPVTWYPE 315
Query: 592 DYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYK-FKAVSQ----N 646
++ KVPMTDM+MR +G PGR+YRFY+G V+ FG GLSYSK+S + F + S N
Sbjct: 316 EFTKVPMTDMRMRADPATGYPGRSYRFYQGNTVYNFGYGLSYSKFSRRMFSSFSTSNAGN 375
Query: 647 KLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVK 706
L + + ++ + V E+G E C F + V+NHG M GKH VL++++
Sbjct: 376 LSLLAGVMARRAGDDGGGMSSYLVKEIGVERCSRLVFPAVVEVQNHGPMDGKHSVLMYLR 435
Query: 707 PARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
+ GRP +QL+GF+S + EKA + FE+SPCE S EDG VI+ G HFL+VG
Sbjct: 436 WPTKSGGRPARQLIGFRSQHVKVGEKAMVSFEVSPCEHFSWVGEDGERVIDGGAHFLMVG 495
Query: 767 DEEYPISI 774
DEE S+
Sbjct: 496 DEELETSL 503
>gi|336377735|gb|EGO18896.1| glycoside hydrolase family 3 protein [Serpula lacrymans var.
lacrymans S7.9]
Length = 766
Score = 499 bits (1284), Expect = e-138, Method: Compositional matrix adjust.
Identities = 297/723 (41%), Positives = 422/723 (58%), Gaps = 35/723 (4%)
Query: 46 CKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIF 105
C T+L RA +V T+DE I+ V+++P +PRLG+P Y+WWSE LHGVA G+
Sbjct: 37 CDTSLDPISRATAVVDLFTIDELINNTVSTSPGVPRLGLPPYQWWSEGLHGVAD-SPGVN 95
Query: 106 FN--GTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINI 163
F+ G ATSFPQ I+ A+FD L +G +G+E R+ N G+A G+ FW PNIN
Sbjct: 96 FSASGEFSYATSFPQPIIMGAAFDDELIKSVGAIVGMEGRSFNNYGRA-GLDFWTPNINP 154
Query: 164 FRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLD 223
F+DPRWGRGQETPGEDP +Y + V+G+QG G K Q + CKHF AYDL+
Sbjct: 155 FKDPRWGRGQETPGEDPYHLAQYVYNLVQGLQG----GLDPKPYYQVISTCKHFAAYDLE 210
Query: 224 NWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSK 283
+W G RY FDA VT QDL++ Y P F+SC + + MC+YN VNGIPSCA+ LL
Sbjct: 211 DWDGNYRYGFDAIVTTQDLSEYYLPSFQSCYRDAKVGAAMCSYNAVNGIPSCANTYLLQS 270
Query: 284 TARRQWGF--HGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKA 341
R WGF ++TSDCDAV IYD Y K+PE+AV D LKAG D++CG+F ++
Sbjct: 271 ILRDFWGFAEDRWVTSDCDAVDNIYDPHNYTKTPEEAVADALKAGTDIDCGTFYSEYLPG 330
Query: 342 AVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPT-MQPFGKIGADVVCSPAHQVLALQAA 400
A Q + E+E+ +AL ++ +RLG F +PT +QP+ + + V +P Q LA QAA
Sbjct: 331 AYNQSLITETELRQALIRQYASLVRLGYF--DPTDIQPYRQYNWNNVDTPQAQQLAYQAA 388
Query: 401 QDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVE 460
+GIVLLKN G LPL S ++ALIGP N+ + GNY G + I+PL
Sbjct: 389 AEGIVLLKND-GTLPL-SSDIKNIALIGPWGNATGEMQGNYYGVAPYLISPLMGAVATGY 446
Query: 461 NTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQEL 520
N Y G + + ++ A+ A+GAD V+ G+D+T E E DR + PG Q +L
Sbjct: 447 NVTYVFGTNITSNDTSGFAAAIAAAQGADVVIYAGGIDETVESEGNDRNYITWPGNQLDL 506
Query: 521 ITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
+ +A A KP+++V GG VD T K + + ++LWAGYPG++G AL ++I G P
Sbjct: 507 VGELA-AVGKPLVVVQFGGGQVDDTSLKANSTVNALLWAGYPGQSGGSALFDIISGKVAP 565
Query: 581 GGRLPMTWYPQDYI-KVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYK 639
GRLP+T YP DY+ ++PMTDM +RP ATS PGRTY++Y G ++ FG GL Y+ +SYK
Sbjct: 566 AGRLPVTQYPADYVYEIPMTDMDLRPNATS--PGRTYKWYTGTPIYDFGYGLHYTTFSYK 623
Query: 640 FKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKH 699
+ A + + Y Q+ +V++ ++ Y + T T+ V N G +
Sbjct: 624 W-AKAPSSTYNIQT----LVQSGNLYSYLDLAPFDT---------FTVNVTNTGNVTSDF 669
Query: 700 PVLLFVKPARRGNGRPIKQLVGFQSV-ILNAKEKAEIVFELSPCESLSRAREDGLMVIEE 758
LLFV + P K L+ + + + + + A + ++ S++RA G M +
Sbjct: 670 ASLLFVNGTYGPSPYPNKSLITYARLHDIASGDTASVALGVT-LGSIARADTYGNMWLYP 728
Query: 759 GTH 761
GT+
Sbjct: 729 GTY 731
>gi|336365124|gb|EGN93476.1| glycoside hydrolase family 3 protein [Serpula lacrymans var.
lacrymans S7.3]
Length = 732
Score = 498 bits (1282), Expect = e-138, Method: Compositional matrix adjust.
Identities = 297/723 (41%), Positives = 422/723 (58%), Gaps = 35/723 (4%)
Query: 46 CKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIF 105
C T+L RA +V T+DE I+ V+++P +PRLG+P Y+WWSE LHGVA G+
Sbjct: 22 CDTSLDPISRATAVVDLFTIDELINNTVSTSPGVPRLGLPPYQWWSEGLHGVAD-SPGVN 80
Query: 106 FN--GTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINI 163
F+ G ATSFPQ I+ A+FD L +G +G+E R+ N G+A G+ FW PNIN
Sbjct: 81 FSASGEFSYATSFPQPIIMGAAFDDELIKSVGAIVGMEGRSFNNYGRA-GLDFWTPNINP 139
Query: 164 FRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLD 223
F+DPRWGRGQETPGEDP +Y + V+G+QG G K Q + CKHF AYDL+
Sbjct: 140 FKDPRWGRGQETPGEDPYHLAQYVYNLVQGLQG----GLDPKPYYQVISTCKHFAAYDLE 195
Query: 224 NWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSK 283
+W G RY FDA VT QDL++ Y P F+SC + + MC+YN VNGIPSCA+ LL
Sbjct: 196 DWDGNYRYGFDAIVTTQDLSEYYLPSFQSCYRDAKVGAAMCSYNAVNGIPSCANTYLLQS 255
Query: 284 TARRQWGF--HGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKA 341
R WGF ++TSDCDAV IYD Y K+PE+AV D LKAG D++CG+F ++
Sbjct: 256 ILRDFWGFAEDRWVTSDCDAVDNIYDPHNYTKTPEEAVADALKAGTDIDCGTFYSEYLPG 315
Query: 342 AVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPT-MQPFGKIGADVVCSPAHQVLALQAA 400
A Q + E+E+ +AL ++ +RLG F +PT +QP+ + + V +P Q LA QAA
Sbjct: 316 AYNQSLITETELRQALIRQYASLVRLGYF--DPTDIQPYRQYNWNNVDTPQAQQLAYQAA 373
Query: 401 QDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVE 460
+GIVLLKN G LPL S ++ALIGP N+ + GNY G + I+PL
Sbjct: 374 AEGIVLLKND-GTLPL-SSDIKNIALIGPWGNATGEMQGNYYGVAPYLISPLMGAVATGY 431
Query: 461 NTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQEL 520
N Y G + + ++ A+ A+GAD V+ G+D+T E E DR + PG Q +L
Sbjct: 432 NVTYVFGTNITSNDTSGFAAAIAAAQGADVVIYAGGIDETVESEGNDRNYITWPGNQLDL 491
Query: 521 ITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
+ +A A KP+++V GG VD T K + + ++LWAGYPG++G AL ++I G P
Sbjct: 492 VGELA-AVGKPLVVVQFGGGQVDDTSLKANSTVNALLWAGYPGQSGGSALFDIISGKVAP 550
Query: 581 GGRLPMTWYPQDYI-KVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYK 639
GRLP+T YP DY+ ++PMTDM +RP ATS PGRTY++Y G ++ FG GL Y+ +SYK
Sbjct: 551 AGRLPVTQYPADYVYEIPMTDMDLRPNATS--PGRTYKWYTGTPIYDFGYGLHYTTFSYK 608
Query: 640 FKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKH 699
+ A + + Y Q+ +V++ ++ Y + T T+ V N G +
Sbjct: 609 W-AKAPSSTYNIQT----LVQSGNLYSYLDLAPFDT---------FTVNVTNTGNVTSDF 654
Query: 700 PVLLFVKPARRGNGRPIKQLVGFQSV-ILNAKEKAEIVFELSPCESLSRAREDGLMVIEE 758
LLFV + P K L+ + + + + + A + ++ S++RA G M +
Sbjct: 655 ASLLFVNGTYGPSPYPNKSLITYARLHDIASGDTASVALGVT-LGSIARADTYGNMWLYP 713
Query: 759 GTH 761
GT+
Sbjct: 714 GTY 716
>gi|413919687|gb|AFW59619.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
Length = 451
Score = 497 bits (1279), Expect = e-137, Method: Compositional matrix adjust.
Identities = 238/419 (56%), Positives = 308/419 (73%), Gaps = 6/419 (1%)
Query: 29 PPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYE 88
P F+CD SN + ++ FC + + RA DLVSRLTL EK+ LV+ A+PRLG+P YE
Sbjct: 38 PAFACDASNATLASYGFCNRSAAAAARAADLVSRLTLAEKVGFLVDKQAALPRLGVPLYE 97
Query: 89 WWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA 148
WWSEALHGV+ VG G F+ + GATSFPQ ILTAASF++ L+ IG+ + EARA++N
Sbjct: 98 WWSEALHGVSYVGPGTRFSPLVPGATSFPQPILTAASFNATLFRAIGEVVSNEARAMHNV 157
Query: 149 GQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKL 208
G A G+TFW+PNINIFRDPRWGRGQETPGEDPL+T KYAV YV G+QG G LK
Sbjct: 158 GLA-GLTFWSPNINIFRDPRWGRGQETPGEDPLLTSKYAVGYVTGLQGAVSGAGALK--- 213
Query: 209 QASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNR 268
+ACCKH+TAYD+DNWKG RY FDA V+ QDL DT+QPPF+SCV G + +MC+YN+
Sbjct: 214 -VAACCKHYTAYDVDNWKGVERYTFDAVVSQQDLDDTFQPPFKSCVVDGNVASVMCSYNQ 272
Query: 269 VNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMD 328
VNG P+CAD++LLS R W +GYI+SDCD+V ++Y+ + Y K+PEDA +KAG+D
Sbjct: 273 VNGKPTCADKDLLSGVIRGDWKLNGYISSDCDSVDVLYNNQHYTKTPEDAAAISIKAGLD 332
Query: 329 VNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVC 388
+NCG+FL +HT AAV+ KL ES++DRA+ N MRLG F+G+P PFG +G VC
Sbjct: 333 LNCGTFLAQHTVAAVQAGKLSESDVDRAVTNNLVTLMRLGFFDGDPRELPFGNLGPSDVC 392
Query: 389 SPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCR 447
+P++Q LA +AA+ GIVLLKN+ G LPL + S+A+IGPNAN++ T++GNY G SC
Sbjct: 393 TPSNQELAREAARQGIVLLKNT-GKLPLSATSIKSMAVIGPNANASFTMIGNYEGTSCN 450
>gi|407922988|gb|EKG16078.1| Glycoside hydrolase family 3 [Macrophomina phaseolina MS6]
Length = 800
Score = 497 bits (1279), Expect = e-137, Method: Compositional matrix adjust.
Identities = 281/738 (38%), Positives = 414/738 (56%), Gaps = 37/738 (5%)
Query: 46 CKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGV--GKG 103
C ++ RA LV LTL+EK++ N++P +PRLGIP Y+WW+EALHGVA G+
Sbjct: 41 CDSSATPLARATALVKELTLEEKLNNTGNTSPGVPRLGIPEYQWWNEALHGVAFTYPGQP 100
Query: 104 IFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINI 163
+ +G ATSFPQ IL A+FD L Y + + EARA N G++ G+ +W PNIN
Sbjct: 101 MTESGNFSSATSFPQPILMGAAFDDELIYEVASVVSTEARAYSNGGRS-GLDYWTPNINP 159
Query: 164 FRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLD 223
++DPRWGRGQETPGEDP Y + +RG++G+ + K + A CKHFT YD++
Sbjct: 160 YKDPRWGRGQETPGEDPFHLASYVQNLIRGLEGNQNDPYK-----KIVATCKHFTGYDME 214
Query: 224 NWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSK 283
NW G RY+FDA++ M+D+ + Y PPF++C ++ + MC+YN VNG+P+CAD LL
Sbjct: 215 NWNGNFRYQFDAQINMRDMVEYYMPPFQACAREAKVGAFMCSYNAVNGVPTCADPWLLQT 274
Query: 284 TARRQWGFH---GYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTK 340
R WG++ ++ SDCDA+ +Y +A+S E AV D L AG D+NCG++ Q++
Sbjct: 275 VLREHWGWNQEDQWVVSDCDAIQNVYLPHEWAESREQAVADTLNAGTDLNCGTYYQRYLP 334
Query: 341 AAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAA 400
A +Q + ++ +DRAL +S ++LG F+ N QP+ +IG V S Q LAL+AA
Sbjct: 335 GAYEQGLINDTTLDRALTRTYSSLIKLGYFD-NADSQPYRQIGWQDVNSQHAQELALKAA 393
Query: 401 QDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVE 460
Q+GIVLLKN GLLPL S+ALIG AN+ + + GNYAG + +PL A +
Sbjct: 394 QEGIVLLKND-GLLPLSLDGVSSIALIGSWANATEQMQGNYAGVAPYLHSPLYAAEQLGV 452
Query: 461 NTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQEL 520
Y G ++ A+ +D ++++ G+D E EELDRV + G Q ++
Sbjct: 453 KVNYAEGASQSNPTTDQWGAEYTAAENSDVIIVVGGIDNDIESEELDRVAIAWSGPQLDM 512
Query: 521 ITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
IT++A KPVI+V + G +D T + NI ++LW GYPG+ G AL ++I G P
Sbjct: 513 ITKLATYG-KPVIVVQMGAGQLDSTPLVSNANISALLWGGYPGQDGGTALFDIITGAVAP 571
Query: 581 GGRLPMTWYPQDYIK-VPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYK 639
GRLP+T YP Y K V MTDM +RP +TS GRTY++Y G VFPFG GL Y+ +S
Sbjct: 572 AGRLPITQYPARYTKEVAMTDMSLRPSSTSA--GRTYKWYNGTAVFPFGFGLHYTNFSAA 629
Query: 640 FKAVSQNKLYLNQ---SSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMA 696
+ + ++ S S D+ + S + + + N G A
Sbjct: 630 IPSPPASSFAISDLVASCSANDTSKLDLCPFTS---------------LAVDIANDGTRA 674
Query: 697 GKHPVLLFVKPARRGNGRPIKQLVGFQSV-ILNAKEKAEIVFELSPCESLSRAREDGLMV 755
L F+ + P LV +Q + + A E L+ SL R E+G +
Sbjct: 675 SDFVALAFLTGEFGPSPHPKSSLVAYQRLHAIAAGETQTARLNLT-LGSLVRVDENGDKL 733
Query: 756 IEEGTHFLVVGDEEYPIS 773
+ G + +++ + P++
Sbjct: 734 LYPGDYSVLIDVPDKPLA 751
>gi|78482949|emb|CAJ41429.1| beta (1,4)-xylosidase [Populus tremula x Populus alba]
Length = 732
Score = 494 bits (1273), Expect = e-137, Method: Compositional matrix adjust.
Identities = 254/527 (48%), Positives = 342/527 (64%), Gaps = 22/527 (4%)
Query: 13 LCLCFTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQL 72
L + F + + PF+CDP + + PFC+ LPI R DL+ R+TL EK+ L
Sbjct: 9 LSVLFLGVSLQTSKALDPFACDPKDGTNRDLPFCQVNLPIHTRVNDLIGRMTLQEKVGLL 68
Query: 73 VNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWY 132
VN+A A+PRLGI YEWWSEALHGV+ VG G F G ATSFPQVI TAASF++ LW
Sbjct: 69 VNNAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGAFPVATSFPQVITTAASFNATLWE 128
Query: 133 RIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVR 192
IG+ + EARA++N G A G+T+W+PN+ PRWGRGQETPGEDP+V GKYA SYVR
Sbjct: 129 AIGRVVSDEARAMFNGGVA-GLTYWSPNVTYSVYPRWGRGQETPGEDPVVVGKYAASYVR 187
Query: 193 GVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFES 252
G+QG +G +LK +ACCKHFTAYDLDNW G R+ F+A+V+ QD+ DT+ PF
Sbjct: 188 GLQGS--DGIRLK----VAACCKHFTAYDLDNWNGVDRFHFNAKVSKQDMVDTFDVPFRM 241
Query: 253 CVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYA 312
CVK+G+ + +MC+YN+VNGIP+CAD NLL KT R QW +GYI SDCD+ + Y + +
Sbjct: 242 CVKEGKVASVMCSYNQVNGIPTCADPNLLKKTVRGQWRLNGYIVSDCDSFGVYYGQQHFT 301
Query: 313 KSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNG 372
SP + + KAG+D++CG FL H A +K E+EI+ A + ++ LG+F+G
Sbjct: 302 -SPRRSSLGCYKAGLDLDCGPFLVTHRDAV--KKAAEEAEINNAWLKTLTFQISLGIFDG 358
Query: 373 NPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNA- 431
+P +Q G + + P +Q LA+ A + + + KN LL P+ + GP A
Sbjct: 359 SP-LQAVGDV-VPTMGPPTNQDLAVNAPKR-LFIFKNRAFLLYSPRH------IFGPVAL 409
Query: 432 -NSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADH 490
S +LGNY G C+ + PLQ L +V + +Y PGC V C+ A + AVD+A AD
Sbjct: 410 FKSLPFMLGNYEGLPCKYLFPLQGLAGFV-SLLYLPGCSNVICAVADVGSAVDLAASADA 468
Query: 491 VVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLL 537
VVL++G DQ+ E+E DRVD LPG+QQEL+TRVA AAK PV+LV++
Sbjct: 469 VVLVVGADQSIEREGHDRVDFYLPGKQQELVTRVAMAAKGPVLLVIM 515
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 30/88 (34%), Positives = 48/88 (54%), Gaps = 1/88 (1%)
Query: 687 IGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLS 746
+ VKN G M G H +L++ +P R + P KQLV F+ V + A + + + C+SLS
Sbjct: 636 VDVKNTGSMDGTHTLLVYFRPPAR-HWAPHKQLVAFEKVHVAAGTQQRVGINIHVCKSLS 694
Query: 747 RAREDGLMVIEEGTHFLVVGDEEYPISI 774
G+ I G H L +GD ++ +S+
Sbjct: 695 VVDGSGIRRIPMGEHSLHIGDVKHSVSL 722
>gi|396473219|ref|XP_003839293.1| similar to beta-1,4-xylosidase [Leptosphaeria maculans JN3]
gi|312215862|emb|CBX95814.1| similar to beta-1,4-xylosidase [Leptosphaeria maculans JN3]
Length = 789
Score = 492 bits (1266), Expect = e-136, Method: Compositional matrix adjust.
Identities = 293/744 (39%), Positives = 412/744 (55%), Gaps = 41/744 (5%)
Query: 34 DPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEA 93
D N +T C T+ RA+ LV+ TL+EKI+ + +P +PRLGIP Y+WWSE
Sbjct: 23 DCQNGPLQTETICNTSASPLDRAKSLVTLYTLEEKINATSSGSPGVPRLGIPPYQWWSEG 82
Query: 94 LHGVAGVGKGIFFNGTIRG-ATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAI 152
LHG+AG +G +TSFPQ IL A+FD +L + + I EARA NA +
Sbjct: 83 LHGIAGPYTNFSTSGIEYSYSTSFPQPILMGAAFDDHLITDVAKVISTEARAFNNANRT- 141
Query: 153 GMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASA 212
G+ FW PNIN FRDPRWGRGQETPGED Y + + G+QG+T + K + A
Sbjct: 142 GLDFWTPNINPFRDPRWGRGQETPGEDAFHLSSYVKALIAGLQGETTDPYK-----RVVA 196
Query: 213 CCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGI 272
CKHF YD+++W G RY+FDA+++ QDL + Y PF++CV Q MC+YN VNG+
Sbjct: 197 TCKHFAGYDIEDWNGNLRYQFDAQISQQDLVEYYLQPFQACV-QANVGAFMCSYNAVNGV 255
Query: 273 PSCADRNLLSKTARRQWGF---HGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDV 329
P+CAD LL R WG+ ++TSDCDAV IY ++ + E AV D L AG D+
Sbjct: 256 PTCADPYLLQTILREHWGWTNEEQWVTSDCDAVQNIYLPHQWSATREQAVADALIAGTDL 315
Query: 330 NCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCS 389
+CG+++Q+H A Q + E+ +D+AL +S +RLG F+ + QP+ + G D V +
Sbjct: 316 DCGTYMQEHLPGAFAQGLVNENVLDQALVRQYSSLVRLGWFD-DAADQPYRQFGWDSVAT 374
Query: 390 PAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSI 449
A Q LA +AA +GIVLLKN G+LPL SVSL + G AN+ LLGNYAG
Sbjct: 375 DASQALARRAAVEGIVLLKND-GVLPLSIDSSVSLGVFGDWANATSQLLGNYAGVPTYLH 433
Query: 450 TPLQALQNYVEN-TVYYPGCDTVACSSASIDKAVDIAKG---ADHVVLMMGLDQTQEKEE 505
+PL ALQ EN T+ Y G + + ++ ++ +D ++ + G+D + E+E
Sbjct: 434 SPLWALQQ--ENLTINYAGGNPGGQGDPTTNRWSSLSGAIATSDILIYIGGIDNSIEEEG 491
Query: 506 LDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEA 565
DR L G Q ++I ++A A KP I+V++ GG +D + NI +ILWAGYPG+
Sbjct: 492 HDRTSLAWTGAQLDVIFQLA-ATGKPTIVVVMGGGQIDSAPLANNANISAILWAGYPGQD 550
Query: 566 GAVALAEVIFGDHNPGGRLPMTWYPQDYIK-VPMTDMKMRPQATSGNPGRTYRFYEGKEV 624
G A+ +++ G P GRLP T YP Y VPMTDM +RP + NPGRTY++Y G
Sbjct: 551 GGPAIVDILTGKSPPAGRLPQTQYPASYTSLVPMTDMGLRP--SENNPGRTYKWYNGTAT 608
Query: 625 FPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFL 684
+ FG GL Y+ +S + Q + ST K+ + E C
Sbjct: 609 YEFGHGLHYTNFSATVTSPMQQSYRIADLMSTC----------KNATSITLERCAFTS-- 656
Query: 685 VTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQ---SVILNAKEKAEIVFELSP 741
V I V N G +A + L ++ + P K LVG+Q + A + A I L
Sbjct: 657 VDISVTNTGAVASDYVTLCYISGSHGPAPHPKKSLVGYQRLFGIAAGASDTARIDLTL-- 714
Query: 742 CESLSRAREDGLMVIEEGTHFLVV 765
ESL+R E G V+ G + L+V
Sbjct: 715 -ESLARVDEVGNKVLYPGEYSLMV 737
>gi|389748500|gb|EIM89677.1| glycoside hydrolase family 3 protein [Stereum hirsutum FP-91666
SS1]
Length = 770
Score = 491 bits (1264), Expect = e-136, Method: Compositional matrix adjust.
Identities = 285/728 (39%), Positives = 405/728 (55%), Gaps = 30/728 (4%)
Query: 46 CKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIF 105
C T+ RA+ LV+ +TL+E ++ VN++P +PRLG+P YEWWSEALHGVA G+
Sbjct: 36 CNTSANFLDRAKALVNAMTLEEMVNNTVNTSPGVPRLGLPPYEWWSEALHGVAS-SPGVT 94
Query: 106 F--NGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINI 163
F +G GATSFP+ IL +A+FD L + + I EARA N + G+ F+ PNIN
Sbjct: 95 FETSGDFSGATSFPEPILMSAAFDDDLIFSVASTISTEARAFGNTNHS-GLDFFTPNINP 153
Query: 164 FRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLD 223
F+DPRWGRGQETPGEDPL T +Y + G+QG G + A CKHF AYDL+
Sbjct: 154 FKDPRWGRGQETPGEDPLHTSRYVYQLITGLQG----GVGPSPYYKIIADCKHFAAYDLE 209
Query: 224 NWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSK 283
NW+G R F+A V+ QDLA+ Y P F+SCV+ + +MC+YN VNG+P+C LL
Sbjct: 210 NWEGNNRMAFNAIVSTQDLAEFYTPSFQSCVRDAKVGSVMCSYNAVNGVPACGSPYLLQD 269
Query: 284 TARR--QWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKA 341
R + G +ITSDCDAV I+D Y + +A L AG DV+CG+ +
Sbjct: 270 LVRDYFELGNDTWITSDCDAVGNIFDPHNYTTTLTNASAVALLAGTDVDCGTSYSETLGE 329
Query: 342 AVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQ 401
AV + + +S+++RAL L+ +RLG F+ ++ P+ +GA V +PA Q LA AA
Sbjct: 330 AVSEGLVSKSDVERALVRLYGSLVRLGYFDPEDSV-PYRALGASDVNTPAAQTLAYTAAV 388
Query: 402 DGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVEN 461
+GIVLLKN GLLPL + S +ALIGP AN+ + GNY G + I+PL + N
Sbjct: 389 EGIVLLKN-DGLLPLSSNVS-HIALIGPWANATTQMQGNYEGIAPLLISPLDGFTSAGFN 446
Query: 462 TVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELI 521
+ G S++ A+ +A AD +V + G+D T E E DR + PG Q ELI
Sbjct: 447 VSFTNGTTISGNSTSGFADALSMASAADVIVYIGGIDDTVEAEGQDRTSITWPGNQLELI 506
Query: 522 TRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
+ A KP +++ + GG VD T K + ++ ++LW GYPG+AG ALA++I G P
Sbjct: 507 GELG-AFGKPFVVIQMGGGQVDDTELKANSSVNALLWGGYPGQAGGKALADIITGVQAPA 565
Query: 582 GRLPMTWYPQDYI-KVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKF 640
GRL T YP Y+ +V MTDM +RP ++G+PGRTY++Y G VF FG GL Y+ + ++
Sbjct: 566 GRLTTTQYPASYVDQVAMTDMSVRPSNSTGSPGRTYKWYTGTPVFEFGFGLHYTTFDVEW 625
Query: 641 KAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHP 700
S Y Q + V + L T T+ V N G + +
Sbjct: 626 AEGSPAASYSIQDLVASANSSSSAVAHVDSAILDT---------FTVQVTNTGNVTSDYV 676
Query: 701 VLLFVKPARRGNGRPIKQLVGF---QSVILNAKEKAEIVFELSPCESLSRAREDGLMVIE 757
LLF + P+++LV + + + A + L +++R EDG +I
Sbjct: 677 ALLFSNTTAGPSPAPLQELVSYARVKGITPGVSATASLNVTLG---TIARVDEDGNSIIY 733
Query: 758 EGTHFLVV 765
G + L V
Sbjct: 734 PGVYNLWV 741
>gi|238508313|ref|XP_002385353.1| beta-xylosidase, putative [Aspergillus flavus NRRL3357]
gi|296439537|sp|B8NYD8.1|BXLB_ASPFN RecName: Full=Probable exo-1,4-beta-xylosidase bxlB; AltName:
Full=1,4-beta-D-xylan xylohydrolase bxlB; AltName:
Full=Beta-xylosidase bxlB; AltName: Full=Xylobiase bxlB;
Flags: Precursor
gi|220688872|gb|EED45224.1| beta-xylosidase, putative [Aspergillus flavus NRRL3357]
Length = 776
Score = 490 bits (1262), Expect = e-135, Method: Compositional matrix adjust.
Identities = 303/784 (38%), Positives = 431/784 (54%), Gaps = 44/784 (5%)
Query: 2 IFHKLSLVFPLLCLCFTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVS 61
+ H L+ PL F + L +S P S P + + C T+L RA+ LV+
Sbjct: 1 MVHLSPLLRPLAAFSFFTSLASTESIFPDCSTGPLSKNN----VCDTSLDPVSRAKSLVA 56
Query: 62 RLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFN--GTIRGATSFPQV 119
+TL+EKI+ + PRLG+PAY WW+EALHGVA G G+ F+ G ATSFP
Sbjct: 57 AMTLEEKINNTKYDSSGAPRLGLPAYNWWNEALHGVA-EGHGVSFSDSGNFSYATSFPMP 115
Query: 120 ILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGED 179
IL A+FD L ++ I EARA N G A G+ +W PNIN FRDPRWGRGQETPGED
Sbjct: 116 ILLGAAFDDDLVKQVATVISTEARAFANGGHA-GLDYWTPNINPFRDPRWGRGQETPGED 174
Query: 180 PLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTM 239
PL +Y V G+Q G + + A CKHF AYDL+NW+G RY FDA V+
Sbjct: 175 PLHLSRYVYHLVDGLQD-----GIGPERPKVVATCKHFAAYDLENWEGIERYAFDAVVSP 229
Query: 240 QDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFH---GYIT 296
QDL++ Y P F++C + + +MC+YN +NGIP+CADR LL R WG+ ++T
Sbjct: 230 QDLSEYYLPSFKTCTRDAKVDAVMCSYNSLNGIPTCADRWLLQTLLREHWGWEQTGHWVT 289
Query: 297 SDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRA 356
DC A+ IY Y A L AG D++CGS ++ ++A++Q ++ A
Sbjct: 290 GDCGAIDNIYADHHYVADGAHAAAAALNAGTDLDCGSVFPEYLRSALQQGLYNNQTLNNA 349
Query: 357 LHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPL 416
L L+S ++LG F+ QP+ IG + V +PA + LA +A +GIV+LKN G LPL
Sbjct: 350 LIRLYSSLVKLGYFDPADD-QPYRSIGWNEVFTPAAEELAHKATVEGIVMLKND-GTLPL 407
Query: 417 PKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSA 476
+ +V A+IGP AN+ L GNY GP T + A + + G D + SSA
Sbjct: 408 KSNGTV--AIIGPFANATTQLQGNYEGPPKYIRTLIWAAVHNGYKVKFSQGTDINSNSSA 465
Query: 477 SIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVL 536
+A+ AK AD V+ G+D T EKE DR +V PG Q +LI ++++ +KP+I+V
Sbjct: 466 GFAEAISAAKEADTVIYAGGIDNTIEKESQDRTTIVWPGNQLDLIEQLSD-LEKPLIVVQ 524
Query: 537 LCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYI-K 595
GG VD + + +G++LWAGYP +AG A+ +++ G P GRLP+T YP Y+ +
Sbjct: 525 FGGGQVDDSSLLANAGVGALLWAGYPSQAGGAAVFDILTGKSAPAGRLPVTQYPASYVDE 584
Query: 596 VPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSS 655
VPMTDM +RP S NPGRTYR+Y+ K V PFG GL Y+ ++ + N S
Sbjct: 585 VPMTDMTLRPG--SNNPGRTYRWYD-KAVLPFGFGLHYTTFNVSWNHAEYGP--YNTDSV 639
Query: 656 TKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRG-NGR 714
N V TE +T +I V N G +A + LLF+ R G
Sbjct: 640 ASGTTNAPV---------DTELFDT----FSITVTNTGNVASDYIALLFLTADRVGPEPY 686
Query: 715 PIKQLVGFQSVI-LNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG-DEEYPI 772
PIK LVG+ + + ++ ++S S++R E+G +V+ G++ L V +++P
Sbjct: 687 PIKTLVGYSRAKGIEPGQSQQVKLDVS-VGSVARTAENGDLVLYPGSYKLEVDVGQDFPT 745
Query: 773 SIFV 776
+ F
Sbjct: 746 ATFT 749
>gi|393247584|gb|EJD55091.1| beta-xylosidase [Auricularia delicata TFB-10046 SS5]
Length = 763
Score = 489 bits (1260), Expect = e-135, Method: Compositional matrix adjust.
Identities = 277/724 (38%), Positives = 400/724 (55%), Gaps = 37/724 (5%)
Query: 46 CKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIF 105
C TT RA+ L+ T +E ++ VN +P +PRLG+P Y+WWSEALHGVAG G+
Sbjct: 37 CNTTANFMDRAKALIDEFTTEELVNNTVNGSPGVPRLGLPPYQWWSEALHGVAGANPGVH 96
Query: 106 F---NGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNIN 162
F ATSFPQ IL A+FD L + + I EARA N G + G+ F+ PNIN
Sbjct: 97 FAPAGEDFDHATSFPQPILMGAAFDDELIHEVATVISTEARAFNNFGFS-GIDFFTPNIN 155
Query: 163 IFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDL 222
FRDPRWGRGQETPGEDPL +Y V +QG G + A CKHF YDL
Sbjct: 156 PFRDPRWGRGQETPGEDPLHISRYVFQLVTALQG----GLGPSPYYKIVADCKHFAGYDL 211
Query: 223 DNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLS 282
++W+G R+ FDA +T QDLA+ Y P F+SCV+ + +MC+YN VNG+P+CA LL
Sbjct: 212 ESWEGIDRFHFDAVITTQDLAEFYTPSFQSCVRDAKVGSVMCSYNSVNGVPACASSYLLQ 271
Query: 283 KTARRQWGF-HGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKA 341
R +G G+ITSDCDAV ++ + + +A LKAG DV+CG+ +
Sbjct: 272 DIVRDFYGLGDGWITSDCDAVQNVFTTHNFTTTQANASAISLKAGTDVDCGNVYAQSLGD 331
Query: 342 AVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQ 401
A+ Q + E ++ +AL L+ +R G F+ +P QPF ++G V +PA + LAL AA+
Sbjct: 332 ALDQGLVEEDDLKQALVRLYGSLVRTGYFD-SPEEQPFRQLGWADVDTPASRRLALLAAE 390
Query: 402 DGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVEN 461
+GIVLLKN GLLPL ++ ++GP N+ + GNY G + ++P Q + N
Sbjct: 391 EGIVLLKND-GLLPLSSRDVPNVIMVGPWGNATTMMQGNYFGNAPYLVSPRQGFVDAGFN 449
Query: 462 TVYYPG-CDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQEL 520
++ G T ++ D+AV A D +V + G D E+E DR+++ PG Q +L
Sbjct: 450 VTFFNGTVGTNGTDTSGFDEAVAAAGDTDLIVFVGGPDNVVERESRDRINITWPGVQLDL 509
Query: 521 ITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
I +A KP+I++ + G VD T+ K I +++W GYPG++G ALA ++ G P
Sbjct: 510 IKELA-GVGKPMIVLQMGAGQVDDTWLKESDAINALIWGGYPGQSGGTALANIVTGKTAP 568
Query: 581 GGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKF 640
RLP+T YP+DYI +PMTDM +RP +S PGRTY+++ G+ +F FG GL YSK+ + +
Sbjct: 569 AARLPITQYPEDYISLPMTDMNVRPSNSS--PGRTYKWFTGEPIFEFGFGLHYSKFDFAW 626
Query: 641 KAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHP 700
+ D+V S P + F V V N G +A
Sbjct: 627 AEEPPASFAIG-----------DLVANASSP---VDLATFHTFQVN--VTNLGPVASDFV 670
Query: 701 VLLFVKPARRGNGRPIKQLVGF---QSVILNAKEKAEIVFELSPCESLSRAREDGLMVIE 757
+LF + P+K+LVG+ ++ + A A + L +++RA EDG V+
Sbjct: 671 AMLFGNTTAGPSPAPLKELVGYTRLTNIPVGATVTASVPVTLG---TIARADEDGNSVLF 727
Query: 758 EGTH 761
G +
Sbjct: 728 PGQY 731
>gi|332982588|ref|YP_004464029.1| glycoside hydrolase [Mahella australiensis 50-1 BON]
gi|332700266|gb|AEE97207.1| glycoside hydrolase family 3 domain protein [Mahella australiensis
50-1 BON]
Length = 714
Score = 489 bits (1259), Expect = e-135, Method: Compositional matrix adjust.
Identities = 290/746 (38%), Positives = 414/746 (55%), Gaps = 83/746 (11%)
Query: 41 ETFPFCK-TTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAG 99
E P K +L RA+DLVSR+TL EKISQ++ APAIPRL IPAY WW+E LHGVA
Sbjct: 8 EKLPAYKDVSLSFEDRAKDLVSRMTLPEKISQMIYDAPAIPRLDIPAYNWWNECLHGVAR 67
Query: 100 VGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALY-----NAGQAI-- 152
G AT FPQ I AA+F+ L +++ +AI EARA + N + I
Sbjct: 68 AGI----------ATVFPQAIAMAATFNPELIHKVAEAISDEARAKHHEAVRNGDRGIYK 117
Query: 153 GMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASA 212
G+TFW+PNINIFRDPRWGRG ET GEDP +T + V++V+G+QGD L+ A
Sbjct: 118 GLTFWSPNINIFRDPRWGRGHETYGEDPYLTSRMGVAFVKGLQGDD------PKYLKVVA 171
Query: 213 CCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGI 272
KH+ + + + R+ FDARV+ +DL +TY P FE CVK+G+A IM AYNR NG
Sbjct: 172 TPKHYAVH---SGPESQRHSFDARVSQKDLRETYLPAFEECVKEGKAVSIMGAYNRTNGE 228
Query: 273 PSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG 332
P CA + LL R +WGF GY+ SDC A+ I+ K+ ++ + G ++NCG
Sbjct: 229 PCCASKTLLKDILRDEWGFDGYVVSDCGAIDDIHMHHKVTKTAAESAALAVNNGCELNCG 288
Query: 333 SFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAH 392
+ + AV+Q + E ID+A+ LF+ RMRLG+F+ P M + I DV SP H
Sbjct: 289 KTYE-YLCQAVEQGLISEETIDQAVIKLFTARMRLGMFD-PPEMVRYAHIPYDVNDSPEH 346
Query: 393 QVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPL 452
+ LAL+ A+ IVLLKN +LPL K K ++A+IGPNA+ LL NY G + +TPL
Sbjct: 347 RELALETARQSIVLLKNDENILPLSK-KLKTIAVIGPNADDLDVLLANYFGTPSKYVTPL 405
Query: 453 QALQNYVE---NTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEE---- 505
+ ++N V +Y GC+ S D+AV+IA+ AD V++ +GL E EE
Sbjct: 406 EGIKNKVSPDTKVLYAKGCEVTGNSVDGFDEAVNIAEMADIVIMCLGLSPRIEGEEGDVA 465
Query: 506 -----LDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAG 560
DR+ + LPG Q++L+ + KP++LVLL G + I +A ++ +I+ A
Sbjct: 466 DSDGGGDRLHIDLPGMQEQLLETIY-GTGKPIVLVLLNGSAIAINWAH--EHVPAIIEAW 522
Query: 561 YPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYE 620
YPGE G A+A+V+FGD+NP GRLP+T+ P TD M+ GRTYR++E
Sbjct: 523 YPGEEGGTAIADVLFGDYNPAGRLPITFVRSLDDLPPFTDYNMK--------GRTYRYFE 574
Query: 621 GKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCET 680
+ ++PFG GLSY+ + Y +S +L N D
Sbjct: 575 KEPLYPFGYGLSYTSFKYSNLRLSAMRLPAGN--------NLD----------------- 609
Query: 681 RKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELS 740
+ + V+N G++AG+ V L++ P++QL G Q + L +K + F +
Sbjct: 610 ----INVDVENTGKLAGREVVQLYISDVEASVEVPMRQLCGIQCITLEPGQKQTVSFTVE 665
Query: 741 PCESLSRAREDGLMVIEEGTHFLVVG 766
P + +S DG ++E G + VG
Sbjct: 666 P-QHMSLFDYDGKRILEPGQFIIAVG 690
>gi|115436902|ref|XP_001217674.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
gi|121734342|sp|Q0CB82.1|BXLB_ASPTN RecName: Full=Probable exo-1,4-beta-xylosidase bxlB; AltName:
Full=1,4-beta-D-xylan xylohydrolase bxlB; AltName:
Full=Beta-xylosidase bxlB; AltName: Full=Xylobiase bxlB;
Flags: Precursor
gi|114188489|gb|EAU30189.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
Length = 765
Score = 489 bits (1258), Expect = e-135, Method: Compositional matrix adjust.
Identities = 298/772 (38%), Positives = 432/772 (55%), Gaps = 49/772 (6%)
Query: 13 LCLCFTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQL 72
+ C SL T++ Q PF D N C TTL RA+ L++ +TL+EKI+
Sbjct: 12 ILACIVSL-TQLGFAQSPFP-DCENGPLSKNAVCDTTLDPVTRAQALLAAMTLEEKINNT 69
Query: 73 VNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFF--NGTIRGATSFPQVILTAASFDSYL 130
++P +PRLG+PAY WWSEALHGVAG G+ F +G ATSFP I A+FD L
Sbjct: 70 QYNSPGVPRLGLPAYNWWSEALHGVAG-SPGVHFADSGNFSYATSFPSPITLGAAFDDDL 128
Query: 131 WYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSY 190
+I IG E RA NAG A G+ +W PNIN +RDPRWGRGQETPGEDP T +Y
Sbjct: 129 VKQIATVIGTEGRAFGNAGHA-GLDYWTPNINPYRDPRWGRGQETPGEDPFHTSRYVYHL 187
Query: 191 VRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPF 250
+ G+Q G K K+ A+ CKHF YD+++W+G RY FDA ++ QD+A+ Y PPF
Sbjct: 188 IDGLQDGI---GPEKPKIVAT--CKHFAGYDIEDWEGNERYAFDAVISDQDMAEYYFPPF 242
Query: 251 ESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHG---YITSDCDAVSIIYD 307
++C + + +MC+YN VNGIP+CAD LL R W + G ++TSDC A+ IY
Sbjct: 243 KTCTRDAKVDAVMCSYNSVNGIPTCADPWLLQTVLREHWEWEGVGHWVTSDCGAIDNIYK 302
Query: 308 AEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRL 367
Y A + AG D++CGS + +A+ Q L +DRAL L+S ++L
Sbjct: 303 DHKYVADGAHAAAVAVNAGTDLDCGSVYPQFLGSAISQGLLGNRTLDRALTRLYSSLVKL 362
Query: 368 GLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALI 427
G F+ QP+ IG V +P + LA AA +G VLLKN G LPL K+ +V A++
Sbjct: 363 GYFD-PAADQPYRSIGWSDVATPDAEQLAHTAAVEGTVLLKND-GTLPLKKNGTV--AIV 418
Query: 428 GPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKG 487
GP AN+ L GNY G + T L A Y PG + S++ ++A++ AKG
Sbjct: 419 GPYANATTQLQGNYEGTAKYIHTMLSAAAQQGYKVKYAPGTGINSNSTSGFEQALNAAKG 478
Query: 488 ADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFA 547
+D V+ G+D E E LDR + PG Q +LI ++++ KKP+++V GG VD +
Sbjct: 479 SDLVIYFGGIDHEVEAEALDRTSIAWPGNQLDLIQQLSD-LKKPLVVVQFGGGQVDDSSL 537
Query: 548 KYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYI-KVPMTDMKMRPQ 606
+ + +LWAGYP +AG A+ +++ G P GRLP+T YP++Y+ +VPMTDM +RP
Sbjct: 538 LSNAGVNGLLWAGYPSQAGGAAVFDILTGKTAPAGRLPVTQYPEEYVDQVPMTDMNLRPG 597
Query: 607 ATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVH 666
+ NPGRTYR+Y+ K V PFG G+ Y+ + +K K Y +++ EN
Sbjct: 598 PS--NPGRTYRWYD-KAVIPFGYGMHYTTFDVSWK----RKNYGPYNTAAVKAEN----- 645
Query: 667 YKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRG-NGRPIKQLVGFQSV 725
ET ++ VKN G++ + L+F+ G PIK LVG+Q V
Sbjct: 646 ---------AVLET----FSLQVKNTGKVTSDYVALVFLTTTDAGPKPYPIKTLVGYQRV 692
Query: 726 -ILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG-DEEYPISIF 775
+ E+ + +++ S++R +G +V+ G++ L V +++YP + F
Sbjct: 693 KAIRPGERKVVDIDVT-VGSVARTAANGDLVLYPGSYKLQVDVEKDYPTAGF 743
>gi|392590128|gb|EIW79457.1| glycoside hydrolase family 3 protein [Coniophora puteana RWD-64-598
SS2]
Length = 770
Score = 486 bits (1251), Expect = e-134, Method: Compositional matrix adjust.
Identities = 289/730 (39%), Positives = 407/730 (55%), Gaps = 38/730 (5%)
Query: 46 CKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIF 105
C T+L +QRA LV T++E I+ VN +P +PRLG+PAY+WWSE LHGVA G+
Sbjct: 37 CDTSLNATQRAAALVELFTVEELINNTVNGSPGVPRLGLPAYQWWSEGLHGVAD-SPGVN 95
Query: 106 FN--GTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINI 163
F+ G ATSFPQ I+ +A+FD L +G +G+E R+ N G A G+ FW PNIN
Sbjct: 96 FSTSGPFSYATSFPQPIVMSAAFDDALIKAVGGVVGMEGRSFNNYGHA-GLDFWTPNINP 154
Query: 164 FRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLD 223
F+DPRWGRGQETPGEDP +Y + ++G+QG G + Q A CKHF YDL+
Sbjct: 155 FKDPRWGRGQETPGEDPYHIAQYVYNLIQGLQG----GVNPEPYFQVVATCKHFAGYDLE 210
Query: 224 NWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSK 283
+W+ RY FDA +T QDL++ Y P F+SC + +A MC+YN VNGIP+CAD LL
Sbjct: 211 DWENNFRYGFDALITTQDLSEFYLPSFQSCYRDAQAGASMCSYNAVNGIPTCADTYLLQD 270
Query: 284 TARRQWGFHG--YITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKA 341
R W F ++TSDCDAV IY+ Y P+ A D L+AG D++CG+F ++
Sbjct: 271 ILRDYWNFDETRWVTSDCDAVENIYNPHNYTALPQQAAADALRAGTDLDCGTFYTEYLPL 330
Query: 342 AVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQ 401
A Q + E+E+ AL ++ +RLG F+ QP+ + G V +P Q LA AA
Sbjct: 331 AYNQSLITETELRAALTRQYASLVRLGYFD-PAAQQPYRQYGWSNVDTPYAQQLAYTAAT 389
Query: 402 DGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVEN 461
+GI LLKN G LPLP + ++ALIGP AN+ + GNY G + ++PLQ N
Sbjct: 390 EGITLLKND-GTLPLPSTLK-NIALIGPWANATNQMQGNYFGVAPYLVSPLQGALAAGYN 447
Query: 462 TVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELI 521
Y G + + S+A A+ A+ AD VV G+D T E E +DR ++ PG Q +LI
Sbjct: 448 VTYVFGTNITSNSTAGFAAAIAAAREADAVVYAGGIDVTVEAEAMDRYNVTWPGNQLQLI 507
Query: 522 TRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
+A A KP ++ GG VD T K + ++ S++WAGYPG++G AL ++I G P
Sbjct: 508 GELA-ALGKPFVVAQFGGGQVDDTEIKANASVNSLIWAGYPGQSGGQALFDIISGKVAPA 566
Query: 582 GRLPMTWYPQDYI-KVPMTDMKMRPQAT-SGNPGRTYRFYEGKEVFPFGCGLSYSKYSYK 639
GRL T YP DY+ ++PMTDM +RP A + +PGRTY++Y G V+ FG GL Y+ ++Y
Sbjct: 567 GRLVTTQYPADYVYEIPMTDMNLRPNANGTTSPGRTYKWYTGAPVYEFGYGLHYTNFTYT 626
Query: 640 FKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKF-LVTIGVKNHGEMAGK 698
+ +ST + Q +V S G + F +++ V N G +
Sbjct: 627 WTKA---------PASTYNI--QTLVSAAS----GAAHIDLAPFDTLSVAVTNAGAVTSD 671
Query: 699 HPVLLFVKPARRGNGRPIKQLVGF---QSVILNAKEKAEIVFELSPCESLSRAREDGLMV 755
+ LLFV P K L + SV A + A L+ ++RA G
Sbjct: 672 YSALLFVNGTYGPAPYPNKALAAYTRLHSVAAGAAQTATFDVVLN---QIARADAYGNFW 728
Query: 756 IEEGTHFLVV 765
+ G + L +
Sbjct: 729 LYPGAYELAL 738
>gi|317158006|ref|XP_001826724.2| exo-1,4-beta-xylosidase xlnD [Aspergillus oryzae RIB40]
Length = 776
Score = 486 bits (1251), Expect = e-134, Method: Compositional matrix adjust.
Identities = 302/784 (38%), Positives = 429/784 (54%), Gaps = 44/784 (5%)
Query: 2 IFHKLSLVFPLLCLCFTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVS 61
+ H L+ PL F + L +S P S P + + C T+L RA+ LV+
Sbjct: 1 MVHLSPLLRPLAAFSFFTSLASTESIFPDCSTGPLSKNN----VCDTSLDPVSRAKSLVA 56
Query: 62 RLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFN--GTIRGATSFPQV 119
+TL+EKI+ + PRLG+PAY WW+EALHGVA G G+ F+ G ATSFP
Sbjct: 57 AMTLEEKINNTKYDSSGAPRLGLPAYNWWNEALHGVA-EGHGVSFSDSGNFSYATSFPMP 115
Query: 120 ILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGED 179
IL A+FD L ++ I EARA N G A G+ +W PNIN FRDPRWGRGQETPGED
Sbjct: 116 ILLGAAFDDDLVKQVATVISTEARAFANGGHA-GLDYWTPNINPFRDPRWGRGQETPGED 174
Query: 180 PLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTM 239
PL +Y V G+Q G + + A CKHF AYDL+NW+G RY FDA V+
Sbjct: 175 PLHLSRYVYHLVDGLQD-----GIGPERPKVVATCKHFAAYDLENWEGIERYAFDAVVSP 229
Query: 240 QDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFH---GYIT 296
QDL++ Y P F++C + + +MC+YN +NGIP+CADR LL R WG+ ++T
Sbjct: 230 QDLSEYYLPSFKTCTRDAKVDAVMCSYNSLNGIPTCADRWLLQTLLREHWGWEQTGHWVT 289
Query: 297 SDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRA 356
DC A+ IY Y A L AG D++CGS ++ +A++Q ++ A
Sbjct: 290 GDCGAIDNIYADHHYVADGAHAAAAALNAGTDLDCGSVFPEYLGSALQQGLYNNQTLNNA 349
Query: 357 LHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPL 416
L L+S ++LG F+ QP+ IG + V +PA + LA +A +GIV+LKN G LPL
Sbjct: 350 LIRLYSSLVKLGYFD-PADDQPYRSIGWNEVFTPAAEELAHKATVEGIVMLKND-GTLPL 407
Query: 417 PKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSA 476
+ +V A+IGP AN+ L GNY GP T + A + + G D + SSA
Sbjct: 408 KSNGTV--AIIGPFANATTQLQGNYEGPPKYIRTLIWAAVHNGYKVKFSQGTDINSNSSA 465
Query: 477 SIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVL 536
+A+ AK AD V+ G+D T EKE DR +V PG Q +LI ++++ +KP+I+V
Sbjct: 466 GFAEAISAAKEADTVIYAGGIDNTIEKESQDRTTIVWPGNQLDLIEQLSD-LEKPLIVVQ 524
Query: 537 LCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYI-K 595
GG VD + + +G++LWAGYP +AG A+ +++ G P GRLP+T YP Y+ +
Sbjct: 525 FGGGQVDDSSLLANAGVGALLWAGYPSQAGGAAVFDILTGKSAPAGRLPVTQYPASYVDE 584
Query: 596 VPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSS 655
VPMTDM +RP S NPGRTYR+Y+ K V PFG GL Y+ ++ + N S
Sbjct: 585 VPMTDMTLRPG--SNNPGRTYRWYD-KAVLPFGFGLHYTTFNVSWNHAEYGP--YNTDSV 639
Query: 656 TKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRG-NGR 714
N V TE +T +I V N G +A + LLF+ G
Sbjct: 640 ASGTTNAPV---------DTELFDT----FSITVTNTGNVASDYIALLFLTADGVGPEPY 686
Query: 715 PIKQLVGFQSVI-LNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG-DEEYPI 772
PIK LVG+ + + ++ ++S S++R E+G +V+ G++ L V +++P
Sbjct: 687 PIKTLVGYSRAKGIEPGQSQQVKLDVS-VGSVARTAENGDLVLYPGSYKLEVDVGQDFPT 745
Query: 773 SIFV 776
+ F
Sbjct: 746 ATFT 749
>gi|409079878|gb|EKM80239.1| hypothetical protein AGABI1DRAFT_120267 [Agaricus bisporus var.
burnettii JB137-S8]
Length = 786
Score = 483 bits (1242), Expect = e-133, Method: Compositional matrix adjust.
Identities = 282/746 (37%), Positives = 424/746 (56%), Gaps = 43/746 (5%)
Query: 34 DPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEA 93
D +N ++ P C + + RA+ L+ T DE I N++P +PRLG+P YEWWSEA
Sbjct: 26 DCTNGPLKSTPVCDSAKDPATRAQSLIQMFTDDELIQNGDNASPGVPRLGLPPYEWWSEA 85
Query: 94 LHGVAGVGKGIFF--NGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQA 151
LHGV G G+ F +G ATSFPQ I+ A+FD L + + EARA N G+A
Sbjct: 86 LHGV-GHSPGVVFAPSGDFSSATSFPQPIVIGAAFDDDLVKAVANVVSTEARAFNNFGRA 144
Query: 152 IGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQAS 211
G+ ++ PNIN F+DPRWGRGQETPGEDP +Y V G+QG G ++ +
Sbjct: 145 -GLNYFTPNINPFKDPRWGRGQETPGEDPFHLSQYVYHLVDGLQG----GIDPWPYIKVA 199
Query: 212 ACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNG 271
A CKHF AYDL+NW+G R+ FDA+V+ QDL++ Y PPF+SCV+ +A+ +MC+YN VNG
Sbjct: 200 ADCKHFAAYDLENWEGIDRFHFDAQVSQQDLSEYYLPPFQSCVRDAKAASVMCSYNSVNG 259
Query: 272 IPSCADRNLLSKTARRQWGFHG--YITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDV 329
+P+CA LL R WGF ++TSDC A+ I+D+ + +S +A LKAG D+
Sbjct: 260 VPACASTYLLQDILRDAWGFDDDRWVTSDCWALDKIFDSHNFTRSFAEAAAISLKAGTDI 319
Query: 330 NCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCS 389
+CGS H AA+ Q + ++ RA ++ +RLG F+ + + Q + + V +
Sbjct: 320 DCGSTFADHLPAALNQSLISRDDLTRAFIRQYTSLIRLGYFDPSDS-QTYRQFDWSDVNT 378
Query: 390 PAHQVLALQAAQDGIVLLKNSHGLLPL-PKSKSVSLALIGPNANSAKTLLGNYAGPSCRS 448
P Q L+ +AA +G+VLLKN GLLPL P K++ A+IGP N+ ++ GNY G +
Sbjct: 379 PEAQALSRRAAVEGLVLLKND-GLLPLAPDGKTI--AIIGPYTNATSSMQGNYFGNAPII 435
Query: 449 ITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDR 508
+P Q Q+ V G SSA +A++ AK AD VV + G+D T E+E LDR
Sbjct: 436 TSPFQGAQDVGFKVVSAAGTTVNGTSSAGFAEAINTAKAADVVVFVGGIDNTLEREGLDR 495
Query: 509 VDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAV 568
+ PG Q +L+ +A KP+I+V GG VD T ++ + +I+WAGYPG++G
Sbjct: 496 SSISWPGNQLDLVKDLASLG-KPLIVVQFGGGQVDDTEILANKKVQAIIWAGYPGQSGGT 554
Query: 569 ALAEVIFGDHNPGGRLPMTWYPQDYI-KVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPF 627
A+ ++I G P GRLP+T YP DY +V MTDM +RP +S NPGRTY++Y+ V +
Sbjct: 555 AIFDIIVGSTAPAGRLPVTQYPADYTHQVRMTDMSLRP--SSHNPGRTYKWYK-TPVLEY 611
Query: 628 GCGLSYSKYSYKFK---AVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFL 684
G GL ++ + + ++ A + L ++S +K + D+ H+ + E C
Sbjct: 612 GHGLHFTTFDFSWQRQPAAEYDIQELIRASHSKFL---DLAHFDTF-----EIC------ 657
Query: 685 VTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSV-ILNAKEKAEIVFELSPCE 743
V+N G + + LLF+ PIK LV + V + A + +++
Sbjct: 658 ----VRNTGNITSDYVGLLFLSGNTGPGPHPIKSLVAYSRVHDIQGGTSATLTLKVT-LG 712
Query: 744 SLSRAREDGLMVIEEGTHFLVVGDEE 769
S++R ++G + + G + LV+ ++
Sbjct: 713 SVARVDKNGDLWLFPGPYRLVLDTKD 738
>gi|395334835|gb|EJF67211.1| beta-xylosidase [Dichomitus squalens LYAD-421 SS1]
Length = 774
Score = 481 bits (1238), Expect = e-133, Method: Compositional matrix adjust.
Identities = 290/730 (39%), Positives = 408/730 (55%), Gaps = 26/730 (3%)
Query: 46 CKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIF 105
C T+ RA L+ T +E + VN++P +PRLG+PAY WWSE LHGVA G+
Sbjct: 41 CDTSKDPITRATALIDLWTDEELTNNTVNASPGVPRLGLPAYNWWSEGLHGVA-QSPGVT 99
Query: 106 F--NGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINI 163
F +G ATSFPQ IL A+FD L + + E RA N G+A G+ +W PNIN
Sbjct: 100 FAPSGNFSYATSFPQPILMGAAFDDPLIQAVASVVSTEGRAFNNVGRA-GLDYWTPNINP 158
Query: 164 FRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLD 223
F+DPRWGRGQETPGEDP Y + + G+QG G + A CKHF AYD+D
Sbjct: 159 FKDPRWGRGQETPGEDPFHLQGYVYNLILGLQG----GLDPTPYFKVVADCKHFAAYDMD 214
Query: 224 NWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSK 283
NW+G RY F+A VT QDL++ Y P F++CV+ + + +MC+YN VNGIPSCA+ LL
Sbjct: 215 NWEGNVRYGFNAVVTQQDLSEYYLPSFQTCVRDAKVASVMCSYNAVNGIPSCANSFLLQD 274
Query: 284 TARRQWGFHG--YITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKA 341
R WGF ++TSDCDAV IY Y +P A D L AG D++CG+F +
Sbjct: 275 ILRDYWGFDDTRWVTSDCDAVQNIYTPHNYTDNPAQAAADALLAGTDIDCGTFSSTYLPD 334
Query: 342 AVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQ 401
A+ Q + +++ RA ++ +RLG F+ P QP+ ++G V +P Q LA AA
Sbjct: 335 ALSQGLVNATDLKRAAIRQYASLVRLGYFD-PPESQPYRQLGWSDVNTPEAQQLAHTAAV 393
Query: 402 DGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVEN 461
+G+VLLKN G LPL K LALIGP AN+ + GNYAG + I+PL Q +
Sbjct: 394 EGMVLLKND-GTLPLSKHVR-KLALIGPWANATTLMQGNYAGIAPYLISPLLGAQQAGFD 451
Query: 462 TVYYPGCD-TVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQEL 520
Y G + T ++ AV AK AD V+ GLD+T E+EE+DR+++ PG Q +L
Sbjct: 452 VEYVFGTNVTTTNDTSGFAAAVAAAKRADAVIFAGGLDETVEREEVDRLNVTWPGNQLDL 511
Query: 521 ITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
+ +A KP+I+ GG +D + K R++ +I+W GYPG++G AL +++ G P
Sbjct: 512 VAELASVG-KPLIVAQFGGGQLDDSALKSKRSVNAIIWGGYPGQSGGTALFDILTGKAAP 570
Query: 581 GGRLPMTWYPQDYI-KVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYK 639
GRLP+T YP +Y +VPMTDM +RP AT NPGRTY++Y G VF FG GL Y+ +S+
Sbjct: 571 AGRLPITQYPAEYANQVPMTDMTLRPSAT--NPGRTYKWYTGTPVFEFGFGLHYTTFSFA 628
Query: 640 FKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKH 699
+ + + N +++ ++ KS L +T + V N G+M +
Sbjct: 629 WAS----NAHANTPAASYSIDALMASGNKSAAFLDLAPLDT----FAVRVTNTGKMTSDY 680
Query: 700 PVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEG 759
LLF P KQLV + V A +++ I +++RA E G + G
Sbjct: 681 VALLFASGTFGPAPHPNKQLVAYTRVHGVAPKQSTIAELTVTLGAIARADESGAKWVYPG 740
Query: 760 THFLVVGDEE 769
T+ L + E
Sbjct: 741 TYTLALDTTE 750
>gi|426198356|gb|EKV48282.1| hypothetical protein AGABI2DRAFT_67675 [Agaricus bisporus var.
bisporus H97]
Length = 763
Score = 481 bits (1238), Expect = e-133, Method: Compositional matrix adjust.
Identities = 282/746 (37%), Positives = 424/746 (56%), Gaps = 43/746 (5%)
Query: 34 DPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEA 93
D +N ++ P C +T + RA+ L+ T DE I N++P +PRLG+P YEWWSEA
Sbjct: 26 DCTNGPLKSTPVCDSTKDPATRAQSLIQMFTDDELIQNGDNASPGVPRLGLPPYEWWSEA 85
Query: 94 LHGVAGVGKGIFF--NGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQA 151
LHGV G G+ F +G ATSFPQ I+ A+FD L + + EARA N G+A
Sbjct: 86 LHGV-GHSPGVVFAPSGDFSSATSFPQPIVIGAAFDDDLVKAVANVVSTEARAFNNFGRA 144
Query: 152 IGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQAS 211
G+ ++ PNIN F+DPRWGRGQETPGEDP +Y V G+QG G ++ +
Sbjct: 145 -GLNYFTPNINPFKDPRWGRGQETPGEDPFHLSQYVYHLVDGLQG----GIDPWPYIKVA 199
Query: 212 ACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNG 271
A CKHF AYDL+NW+G R+ FDA+V+ QDL++ Y PPF+SCV+ +A+ +MC+YN VNG
Sbjct: 200 ADCKHFAAYDLENWEGIDRFHFDAQVSQQDLSEYYLPPFQSCVRDAKAASVMCSYNSVNG 259
Query: 272 IPSCADRNLLSKTARRQWGFHG--YITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDV 329
+P+CA LL R WGF ++TSDC A+ I+D+ + +S +A LKAG D+
Sbjct: 260 VPACASTYLLQDILRDAWGFDDDRWVTSDCWALDKIFDSHNFTRSFAEAAAISLKAGTDI 319
Query: 330 NCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCS 389
+CGS H AA+ Q + ++ RA ++ +RLG F+ + + Q + + V +
Sbjct: 320 DCGSTFADHLPAALNQSLISRDDLTRAFIRQYTSLIRLGYFDPSHS-QTYRQFDWSDVNT 378
Query: 390 PAHQVLALQAAQDGIVLLKNSHGLLPL-PKSKSVSLALIGPNANSAKTLLGNYAGPSCRS 448
P Q L+ +AA +G+VLLKN GLLPL P K++ A+IGP N+ ++ GNY G +
Sbjct: 379 PEAQALSRRAAVEGLVLLKND-GLLPLAPDGKTI--AIIGPYTNATSSMQGNYFGNAPFI 435
Query: 449 ITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDR 508
+P Q Q+ V G SSA +A++ A+ AD VV + G+D T E+E LDR
Sbjct: 436 TSPFQGAQDVGFKVVSAAGTIVNGTSSAGFAEAINTARAADVVVFVGGIDNTLEREGLDR 495
Query: 509 VDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAV 568
+ PG Q +L+ +A KP+I+V GG VD T + + +I+WAGYPG++G
Sbjct: 496 SSISWPGNQLDLVKDLASLG-KPLIVVQFGGGQVDDTEILANEKVQAIIWAGYPGQSGGT 554
Query: 569 ALAEVIFGDHNPGGRLPMTWYPQDYI-KVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPF 627
A+ ++I G P GRLP+T YP DY +V MTDM +RP +S NPGRTY++Y+ V +
Sbjct: 555 AIFDIIVGATAPAGRLPVTQYPADYTHQVRMTDMSLRP--SSHNPGRTYKWYK-TPVLEY 611
Query: 628 GCGLSYSKYSYKFK---AVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFL 684
G GL ++ + + ++ A + L ++S +K + D+ H+ + E C
Sbjct: 612 GHGLHFTTFDFSWQRQPAAEYDIQELIRASHSKFL---DLAHFDTF-----EIC------ 657
Query: 685 VTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSV-ILNAKEKAEIVFELSPCE 743
V+N G + + LLF+ PIK LV + V + A + +++
Sbjct: 658 ----VRNTGNITSDYVGLLFLSGNSGPGPHPIKSLVAYSRVHDIQGGTSATLTLKVT-LG 712
Query: 744 SLSRAREDGLMVIEEGTHFLVVGDEE 769
S++R ++G + + G + LV+ ++
Sbjct: 713 SVARVDKNGDLWLFPGPYRLVLDTKD 738
>gi|452989371|gb|EME89126.1| glycoside hydrolase family 3 protein [Pseudocercospora fijiensis
CIRAD86]
Length = 790
Score = 480 bits (1235), Expect = e-132, Method: Compositional matrix adjust.
Identities = 280/730 (38%), Positives = 400/730 (54%), Gaps = 37/730 (5%)
Query: 46 CKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGV-GKGI 104
C T RA+ L++ TL EKI+ +++P +PRLG+ YEWW EALHGVA G
Sbjct: 39 CDTAADPLTRAKALIAEFTLAEKINNTGSTSPGVPRLGLLPYEWWQEALHGVASSPGVNF 98
Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIF 164
+G R ATSFPQ IL A+FD L + + I EARA N +A G+ FW PNIN F
Sbjct: 99 SVSGEFRYATSFPQPILMGAAFDDQLIHDVASVISTEARAFSNDDRA-GLDFWTPNINPF 157
Query: 165 RDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDN 224
+DPRWGRGQETPGEDP Y S +RG+QGD + K+ A CKHF AYD++N
Sbjct: 158 KDPRWGRGQETPGEDPYHLSSYVHSLIRGLQGDNPSYKKVV------ATCKHFVAYDVEN 211
Query: 225 WKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKT 284
W G RY+ DA + QDL + Y PPF SC + MC+YN +NG+P+CAD LL
Sbjct: 212 WNGNFRYQLDAHINSQDLVEYYMPPFRSCARDSNVGAFMCSYNSLNGVPTCADPYLLQTV 271
Query: 285 ARRQWGFHG---YITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKA 341
R W + ++TSDCD+V ++ YA S E+A LKAG D+NCG++ Q+H
Sbjct: 272 LREHWNWTAEEQWVTSDCDSVQNVFLYHNYASSREEAAAISLKAGTDINCGTYYQEHLPR 331
Query: 342 AVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQ 401
A +Q + E+++D +L + +RLG F+G+ P+ + + V +P Q LAL+AA
Sbjct: 332 AYEQGLINETDVDTSLIRQYGSLIRLGYFDGD--RVPYRNLTWNDVSTPYAQDLALKAAT 389
Query: 402 DGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVEN 461
GI LLKN G+LPL + +ALIG AN+ +LGNY G +PL A Q
Sbjct: 390 SGITLLKND-GILPLQITNGTKIALIGDWANATDQMLGNYHGIPPYFHSPLWAAQQTGAE 448
Query: 462 TVYYPGCDTVACSSASIDKAV-DIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQEL 520
Y G + + + + A +D ++ + G+D+ E EE DRV + G Q ++
Sbjct: 449 VTYVQGPGGQSDPTTYTWRPIWSAANKSDVIIYIGGMDERVEAEEKDRVSIAWSGPQLDV 508
Query: 521 ITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
I ++A+ KP I+V + GG +D + + NI ++LW GYPG+ G A+ +++ G P
Sbjct: 509 IGQLADYYDKPTIVVQMGGGSLDSSPLVKNPNIRALLWGGYPGQDGGKAIFDILQGISAP 568
Query: 581 GGRLPMTWYPQDYI-KVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYK 639
GRLP+T Y DYI KVPMTD +RP ATSG+PGRTY + + VF FG GL Y+ ++
Sbjct: 569 AGRLPITQYRADYISKVPMTDTSLRPNATSGSPGRTYIWLNEEPVFEFGYGLHYTNFTAT 628
Query: 640 F-KAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGK 698
A S + Y S ++ E+ + C + F +I V N G +
Sbjct: 629 IPDAESSDTTYSIDSLASDCTESY------------LDRCPFKTF--SIDVTNTGSVTSD 674
Query: 699 HPVLLFVKPARRGNGRPIKQLVGFQ---SVILNAKEKAEIVFELSPCESLSRAREDGLMV 755
+ L F+ A P K+LV +Q ++ + + A + L SLSR + G V
Sbjct: 675 YVTLGFLTGAHGPEPCPNKRLVSYQRLHNITAGSTQTAALNLTLG---SLSRVDDKGNTV 731
Query: 756 IEEGTHFLVV 765
+ G++ L+V
Sbjct: 732 LFPGSYALLV 741
>gi|121797681|sp|Q2TYT2.1|BXLB_ASPOR RecName: Full=Probable exo-1,4-beta-xylosidase bxlB; AltName:
Full=1,4-beta-D-xylan xylohydrolase bxlB; AltName:
Full=Beta-xylosidase bxlB; AltName: Full=Xylobiase bxlB;
Flags: Precursor
gi|83775471|dbj|BAE65591.1| unnamed protein product [Aspergillus oryzae RIB40]
Length = 797
Score = 479 bits (1233), Expect = e-132, Method: Compositional matrix adjust.
Identities = 298/769 (38%), Positives = 423/769 (55%), Gaps = 44/769 (5%)
Query: 17 FTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSA 76
F + T +S P S P + + C T+L RA+ LV+ +TL+EKI+ +
Sbjct: 37 FNWVSTSTESIFPDCSTGPLSKNN----VCDTSLDPVSRAKSLVAAMTLEEKINNTKYDS 92
Query: 77 PAIPRLGIPAYEWWSEALHGVAGVGKGIFFN--GTIRGATSFPQVILTAASFDSYLWYRI 134
PRLG+PAY WW+EALHGVA G G+ F+ G ATSFP IL A+FD L ++
Sbjct: 93 SGAPRLGLPAYNWWNEALHGVA-EGHGVSFSDSGNFSYATSFPMPILLGAAFDDDLVKQV 151
Query: 135 GQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGV 194
I EARA N G A G+ +W PNIN FRDPRWGRGQETPGEDPL +Y V G+
Sbjct: 152 ATVISTEARAFANGGHA-GLDYWTPNINPFRDPRWGRGQETPGEDPLHLSRYVYHLVDGL 210
Query: 195 QGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCV 254
Q G + + A CKHF AYDL+NW+G RY FDA V+ QDL++ Y P F++C
Sbjct: 211 QD-----GIGPERPKVVATCKHFAAYDLENWEGIERYAFDAVVSPQDLSEYYLPSFKTCT 265
Query: 255 KQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFH---GYITSDCDAVSIIYDAEGY 311
+ + +MC+YN +NGIP+CADR LL R WG+ ++T DC A+ IY Y
Sbjct: 266 RDAKVDAVMCSYNSLNGIPTCADRWLLQTLLREHWGWEQTGHWVTGDCGAIDNIYADHHY 325
Query: 312 AKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFN 371
A L AG D++CGS ++ +A++Q ++ AL L+S ++LG F+
Sbjct: 326 VADGAHAAAAALNAGTDLDCGSVFPEYLGSALQQGLYNNQTLNNALIRLYSSLVKLGYFD 385
Query: 372 GNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNA 431
QP+ IG + V +PA + LA +A +GIV+LKN G LPL + +V A+IGP A
Sbjct: 386 -PADDQPYRSIGWNEVFTPAAEELAHKATVEGIVMLKND-GTLPLKSNGTV--AIIGPFA 441
Query: 432 NSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHV 491
N+ L GNY GP T + A + + G D + SSA +A+ AK AD V
Sbjct: 442 NATTQLQGNYEGPPKYIRTLIWAAVHNGYKVKFSQGTDINSNSSAGFAEAISAAKEADTV 501
Query: 492 VLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDR 551
+ G+D T EKE DR +V PG Q +LI ++++ +KP+I+V GG VD + +
Sbjct: 502 IYAGGIDNTIEKESQDRTTIVWPGNQLDLIEQLSD-LEKPLIVVQFGGGQVDDSSLLANA 560
Query: 552 NIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYI-KVPMTDMKMRPQATSG 610
+G++LWAGYP +AG A+ +++ G P GRLP+T YP Y+ +VPMTDM +RP S
Sbjct: 561 GVGALLWAGYPSQAGGAAVFDILTGKSAPAGRLPVTQYPASYVDEVPMTDMTLRPG--SN 618
Query: 611 NPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSV 670
NPGRTYR+Y+ K V PFG GL Y+ ++ + N S N V
Sbjct: 619 NPGRTYRWYD-KAVLPFGFGLHYTTFNVSWNHAEYGP--YNTDSVASGTTNAPV------ 669
Query: 671 PELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRG-NGRPIKQLVGFQSVI-LN 728
TE +T +I V N G +A + LLF+ G PIK LVG+ +
Sbjct: 670 ---DTELFDT----FSITVTNTGNVASDYIALLFLTADGVGPEPYPIKTLVGYSRAKGIE 722
Query: 729 AKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG-DEEYPISIFV 776
+ ++ ++S S++R E+G +V+ G++ L V +++P + F
Sbjct: 723 PGQSQQVKLDVS-VGSVARTAENGDLVLYPGSYKLEVDVGQDFPTATFT 770
>gi|297039776|gb|ADH95739.1| beta-xylosidase [Aspergillus fumigatus]
Length = 771
Score = 479 bits (1232), Expect = e-132, Method: Compositional matrix adjust.
Identities = 302/766 (39%), Positives = 422/766 (55%), Gaps = 51/766 (6%)
Query: 32 SCDPSNPSTETFP-----------FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIP 80
S P S TFP C T+L ++ RA+ LV+ +T +EK++ ++P +P
Sbjct: 18 SLAPGALSLNTFPDCSSGPLSKLAVCDTSLDVTTRAQSLVNAMTFEEKVNNTQYNSPGVP 77
Query: 81 RLGIPAYEWWSEALHGVAGVGKGIFF--NGTIRGATSFPQVILTAASFDSYLWYRIGQAI 138
RLG+PAY WWSEALHGVAG G+ F +G ATSFPQ IL A+FD L ++ +
Sbjct: 78 RLGLPAYNWWSEALHGVAG-SPGVEFADSGPFSYATSFPQPILLGATFDDDLIKQVATVV 136
Query: 139 GLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDT 198
E RA NAG++ G+ FW PNIN FRD RWGRGQETPGEDPL +Y V G+Q
Sbjct: 137 STEGRAFGNAGRS-GLDFWTPNINPFRDARWGRGQETPGEDPLHVSRYVYHLVDGLQN-- 193
Query: 199 FNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGR 258
G + A CKHF AY L++W G R+ F+A V+ QDL++ Y PPF+SC + R
Sbjct: 194 ---GIGPANPKVVATCKHFAAYGLEDWNGVVRHSFNAEVSTQDLSEFYLPPFKSCARDAR 250
Query: 259 ASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHG---YITSDCDAVSIIYDAEGYAKSP 315
+MC+YN +NG+P+CAD LL R W + +ITSDC A+ IY+ + +P
Sbjct: 251 VDAVMCSYNALNGVPACADSYLLQTILREHWKWDEPGRWITSDCGAIDDIYNGHNFTTTP 310
Query: 316 EDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPT 375
+A L AG D++CG+ K+ A + +DRAL L+S ++LG F+
Sbjct: 311 AEAAATALNAGTDLDCGTVFPKYLGQAADEGLYSNQTLDRALVRLYSSFVKLGYFDPAED 370
Query: 376 MQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAK 435
QP+ IG V +PA + LA +AA +GIVLLKN LPL K+K +LALIGP AN+ K
Sbjct: 371 -QPYRSIGWTDVDTPAVEALAHKAAGEGIVLLKNDK-TLPL-KAKG-TLALIGPYANATK 426
Query: 436 TLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMM 495
+ GNY GP+ T L A + Y G S+A D A+ AK AD VV
Sbjct: 427 QMQGNYEGPAKYIRTLLWAATQAGYDVKYAAGTAINTNSTAGFDAALSAAKQADVVVYAG 486
Query: 496 GLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGS 555
G+D T E E DR + PG Q LI ++++ KP+++V GG VD + + + +
Sbjct: 487 GIDNTIEAEGRDRTTIAWPGNQVNLIDQLSKIG-KPLVVVQFGGGQVDDSSLLSNPRVNA 545
Query: 556 ILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYI-KVPMTDMKMRPQATSGNPGR 614
+LWAGYP + G A+ +++ G P GRLP+T YP DY+ +VPMTDM +RP S PGR
Sbjct: 546 LLWAGYPSQEGGSAIFDILTGKTAPAGRLPVTQYPADYVNQVPMTDMALRPG--SNTPGR 603
Query: 615 TYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELG 674
TYR+Y+ K V PFG GL Y+ + +S + L ++ +V K+VP
Sbjct: 604 TYRWYD-KAVLPFGFGLHYTTFK-----ISWPRRALGPYNTAALVSRSP----KNVPIDR 653
Query: 675 TEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRG-NGRPIKQLVGF-QSVILNAKEK 732
F I V N G+ + LLF+K G P+K LVG+ ++ + EK
Sbjct: 654 AAFDTFH-----IQVTNTGKTTSDYVALLFLKTIDAGPKPYPLKTLVGYTRAKQIKPGEK 708
Query: 733 AEIVFELSPCESLSRAREDGLMVIEEGTHFLV--VGDEEYPISIFV 776
+ E+S SL+R E+G +V+ G + L VG+ +YP + F
Sbjct: 709 RSVDIEVS-LGSLARTAENGDLVLYPGRYTLEVDVGESQYPTASFT 753
>gi|62321271|dbj|BAD94481.1| beta-xylosidase [Arabidopsis thaliana]
Length = 523
Score = 478 bits (1230), Expect = e-132, Method: Compositional matrix adjust.
Identities = 258/533 (48%), Positives = 339/533 (63%), Gaps = 18/533 (3%)
Query: 252 SCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGY 311
S V G + +MC+YN+VNG P+CAD +LLS R +W +GYI SDCD+V ++Y + Y
Sbjct: 1 SYVVDGNVASVMCSYNQVNGKPTCADPDLLSGVIRGEWKLNGYIVSDCDSVDVLYKNQHY 60
Query: 312 AKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFN 371
K+P +A + AG+D+NCGSFL +HT+ AVK + E+ ID+A+ N F MRLG F+
Sbjct: 61 TKTPAEAAAISILAGLDLNCGSFLGQHTEEAVKSGLVNEAAIDKAISNNFLTLMRLGFFD 120
Query: 372 GNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNA 431
GNP Q +G +G VC+ A+Q LA AA+ GIVLLKN+ G LPL +LA+IGPNA
Sbjct: 121 GNPKNQIYGGLGPTDVCTSANQELAADAARQGIVLLKNT-GCLPLSPKSIKTLAVIGPNA 179
Query: 432 NSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHV 491
N KT++GNY G C+ TPLQ L V +T Y PGC VAC+ A + A +A AD
Sbjct: 180 NVTKTMIGNYEGTPCKYTTPLQGLAGTV-STTYLPGCSNVACAVADVAGATKLAATADVS 238
Query: 492 VLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDR 551
VL++G DQ+ E E DRVDL LPG+QQEL+ +VA+AAK PV+LV++ GG DITFAK D
Sbjct: 239 VLVIGADQSIEAESRDRVDLRLPGQQQELVIQVAKAAKGPVLLVIMSGGGFDITFAKNDP 298
Query: 552 NIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYI-KVPMTDMKMRPQATSG 610
I ILW GYPGEAG +A+A++IFG +NP G+LPMTWYPQ Y+ KVPMT M MRP SG
Sbjct: 299 KIAGILWVGYPGEAGGIAIADIIFGRYNPSGKLPMTWYPQSYVEKVPMTIMNMRPDKASG 358
Query: 611 NPGRTYRFYEGKEVFPFGCGLSYSKYSYKF-KAVSQNKLYLNQSSSTKMVENQDVVHYKS 669
PGRTYRFY G+ V+ FG GLSY+K+S+ KA S L L ++ + E Q S
Sbjct: 359 YPGRTYRFYTGETVYAFGDGLSYTKFSHTLVKAPSLVSLGLEENHVCRSSECQ------S 412
Query: 670 VPELGTEFCET------RKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQ 723
+ +G CE F V I V+N G+ G H V LF P +G P K LVGF+
Sbjct: 413 LDAIGPH-CENAVSGGGSAFEVHIKVRNGGDREGIHTVFLFTTPPAI-HGSPRKHLVGFE 470
Query: 724 SVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGDEEYPISIFV 776
+ L +E+A + F++ C+ LS E G I G H L VGD ++ +SI +
Sbjct: 471 KIRLGKREEAVVRFKVEICKDLSVVDEIGKRKIGLGKHLLHVGDLKHSLSIRI 523
>gi|388857998|emb|CCF48443.1| related to Beta-xylosidase [Ustilago hordei]
Length = 782
Score = 478 bits (1230), Expect = e-132, Method: Compositional matrix adjust.
Identities = 286/760 (37%), Positives = 428/760 (56%), Gaps = 46/760 (6%)
Query: 32 SCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWS 91
SC P +P ++ C T+P RA LV++ T +E ++ +N AP +PRLGIP Y+WW+
Sbjct: 23 SCAPDSPLSKIPDICDPTIPFYTRATSLVNQFTTEELLNNTINYAPGVPRLGIPNYQWWT 82
Query: 92 EALHGVAGVGKGIFFN-----GTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALY 146
EALHGVA G+ F+ AT FPQ I A+FD L+ +I I E RA
Sbjct: 83 EALHGVAK-SPGVNFDLSDPHAEFTSATQFPQTINLGATFDDDLYQQIASVIASEVRAYN 141
Query: 147 NAGQAIGMTFWAP-NINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLK 205
NAG+A G+ ++P NIN FRDPRWGRGQET GEDPL ++AVS V G+QG
Sbjct: 142 NAGKA-GLNLYSPLNINCFRDPRWGRGQETVGEDPLHMSRFAVSIVHGLQGPHAQNEAEG 200
Query: 206 GKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCA 265
KL +A CKHF AYDL+ + RY+FDA V+ QDL+D + P F +CV+ G A+ +M +
Sbjct: 201 NKLTVAATCKHFLAYDLEQYDRGERYQFDAIVSKQDLSDFHLPQFRACVRDGGATTLMTS 260
Query: 266 YNRVNGIPSCADRNLLSKTARRQWGF---HGYITSDCDAVSIIYDAEGYAKSPEDAVVDV 322
YN VN +P A + L AR+ WG H Y+TSDCDAV+ +YD YA++ +A
Sbjct: 261 YNAVNNVPPSASKYYLQTLARQAWGLDKTHNYVTSDCDAVANVYDGHRYAQNYVEAAAKS 320
Query: 323 LKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKI 382
+ AG D++CG+ ++ AA+KQK + I RA+ +++ +RLG F+ +P QP ++
Sbjct: 321 INAGTDLDCGATYSENLGAALKQKLTDIATIRRAVIRMYASLVRLGYFD-DPASQPLRQL 379
Query: 383 GADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYA 442
V SP+ Q LA +A I LLKN LP+ K K +A+IGP N + + GNYA
Sbjct: 380 TWKDVNSPSSQRLAYTSALSSITLLKNLDSTLPI-KQKPTKIAIIGPYTNVSTSFSGNYA 438
Query: 443 GPSCRSITPLQALQNYVENT--VYYPGCDTVACSSAS-IDKAVDIAKGADHVVLMMGLDQ 499
GP+ ++T + A + V+ G D S AV + AD VV G+D
Sbjct: 439 GPAAFNMTMVHAASQVFPDAKIVWVNGTDISGPYIPSDAQDAVKLTSDADSVVFAGGIDA 498
Query: 500 TQEKEELDRVDLVLPGRQQELITRVAEAAKK----PVILVLLCGGPVDITFAKYDRNIGS 555
+ E+E DR D+ P Q LI ++++ KK +++V GG +D K D +G+
Sbjct: 499 SIERESHDRKDIAWPPNQLRLIHELSQSRKKDKKSKLVVVQFGGGQLDGASLKSDDAVGA 558
Query: 556 ILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIK-VPMTDMKMRPQATSGNPGR 614
++WAGYPG++ ++A+ +++ G P GRLP+T YP YI +P + M +RP+A G PGR
Sbjct: 559 LVWAGYPGQSASLAVWDILAGKAVPAGRLPVTQYPASYIDGLPESAMSLRPKA--GYPGR 616
Query: 615 TYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELG 674
TY++Y+G +PFG GL Y+ +S ++++ + Y +T + + VH + +
Sbjct: 617 TYKWYKGVPTYPFGHGLHYTTFS---ASLAKPQPY--AIPTTPAAKGPEGVHAEHISVAD 671
Query: 675 TEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNG---RPIKQLVGFQSVI-LNAK 730
V +KN G++A + LLF AR NG P K LVG+ V L+A
Sbjct: 672 ----------VQANIKNTGKVASDYTALLF---ARHSNGPAPYPRKTLVGYTKVKNLSAG 718
Query: 731 EKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGDEEY 770
E++ + +++ +L+RA E+G + G++ L + EE+
Sbjct: 719 EESSVTIKITQA-ALARADEEGNQFLYPGSYQLELDTEEH 757
>gi|119473971|ref|XP_001258861.1| beta-xylosidase [Neosartorya fischeri NRRL 181]
gi|292495290|sp|A1DJS5.1|XYND_NEOFI RecName: Full=Probable exo-1,4-beta-xylosidase xlnD; AltName:
Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
gi|119407014|gb|EAW16964.1| beta-xylosidase [Neosartorya fischeri NRRL 181]
Length = 771
Score = 478 bits (1230), Expect = e-132, Method: Compositional matrix adjust.
Identities = 308/788 (39%), Positives = 432/788 (54%), Gaps = 51/788 (6%)
Query: 1 MIFHKLSLVFPLLCL--CFTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARD 58
M +H + +CL SL T D + P S C T+L ++ RAR
Sbjct: 5 MSWHYGKAITLFVCLGPVALSLDTFPDCSSGPLS---------KLAVCDTSLDVTTRARS 55
Query: 59 LVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFF--NGTIRGATSF 116
LV+ +T +EK++ ++P +PRLG+PAY WWSEALHGVAG G+ F +G ATSF
Sbjct: 56 LVNAMTFEEKVNNTQYNSPGVPRLGLPAYNWWSEALHGVAG-SPGVEFADSGPFSYATSF 114
Query: 117 PQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETP 176
PQ IL A+FD L ++ + E RA NAG+A G+ FW PNIN FRD RWGRGQETP
Sbjct: 115 PQPILLGATFDDDLIKQVATVVSTEGRAFGNAGRA-GLDFWTPNINPFRDARWGRGQETP 173
Query: 177 GEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDAR 236
GEDPL +Y V G+Q G + A CKHF AYDL++W G R+ F+A
Sbjct: 174 GEDPLHVSRYVYHLVDGLQN-----GIGPANPKVVATCKHFAAYDLEDWNGVVRHSFNAE 228
Query: 237 VTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFH---G 293
V+ QDL++ Y PPF+SC + + +MC+YN +NG+P+CAD LL R W +
Sbjct: 229 VSTQDLSEFYLPPFKSCARDAKVDAVMCSYNALNGVPACADSYLLQTILREHWKWDEPGH 288
Query: 294 YITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEI 353
+IT DC A+ IY+ Y K+P +A L AG D++CG+ K+ A + +
Sbjct: 289 WITGDCGAIDDIYNGHNYTKTPAEAAATALNAGTDLDCGTVFPKYLGQAADEGLYTNKTL 348
Query: 354 DRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGL 413
D+AL L+S ++LG F+ QP+ IG V SPA + LA +AA +GIVLLKN
Sbjct: 349 DKALVRLYSSLVKLGYFDPAED-QPYRSIGWKDVDSPAAEALAHKAAVEGIVLLKNDK-T 406
Query: 414 LPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVAC 473
LPL K+K +LALIGP AN+ K + GNY GP T L A + Y G A
Sbjct: 407 LPL-KAKG-TLALIGPYANATKQMQGNYEGPPKYIRTLLWAATQAGYDVKYVAGTAINAN 464
Query: 474 SSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVI 533
S+A D A+ AK AD VV G+D T E E DR +V PG Q +LI ++++ KP++
Sbjct: 465 STAGFDAALSAAKQADVVVYAGGIDNTIEAEGHDRTTIVWPGNQLDLIDQLSKIG-KPLV 523
Query: 534 LVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDY 593
+V GG VD + + ++ ++LW GYP + G A+ +++ G P GRLP+T YP DY
Sbjct: 524 VVQFGGGQVDDSSLLSNPHVNALLWTGYPSQEGGSAIFDILTGKTAPAGRLPVTQYPADY 583
Query: 594 I-KVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQ 652
+ +VP+TDM +RP S PGRTYR+Y+ K V PFG GL Y+ + +S + L
Sbjct: 584 VNQVPLTDMALRPG--SNTPGRTYRWYD-KAVLPFGFGLHYTTFK-----ISWPRRALGP 635
Query: 653 SSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRG- 711
+ +V K+VP F I V N G+ + LLF+K G
Sbjct: 636 YDTAALVSRSP----KNVPIDRAAFDTFH-----IQVTNTGKTTSDYVALLFLKTIDAGP 686
Query: 712 NGRPIKQLVGF-QSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLV--VGDE 768
P+K LVG+ ++ + EK + ++S SL+R E+G +V+ G + L VG+
Sbjct: 687 KPYPLKTLVGYTRAKQIKPGEKRSVDIKVS-LGSLARTAENGDLVLYPGRYTLEVDVGEN 745
Query: 769 EYPISIFV 776
+YP + F
Sbjct: 746 QYPTASFT 753
>gi|391864313|gb|EIT73609.1| beta-glucosidase-related glycosidase [Aspergillus oryzae 3.042]
Length = 797
Score = 477 bits (1228), Expect = e-132, Method: Compositional matrix adjust.
Identities = 298/769 (38%), Positives = 422/769 (54%), Gaps = 44/769 (5%)
Query: 17 FTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSA 76
F + T +S P S P + + C T+L RA+ LV+ +TL+EKI+ +
Sbjct: 37 FNWVSTSTESIFPDCSTGPLSKNN----VCDTSLDPVSRAKSLVAAMTLEEKINNTKYDS 92
Query: 77 PAIPRLGIPAYEWWSEALHGVAGVGKGIFFN--GTIRGATSFPQVILTAASFDSYLWYRI 134
PRLG+PAY WW+EALHGVA G G+ F+ G ATSFP IL A+FD L ++
Sbjct: 93 SGAPRLGLPAYNWWNEALHGVA-EGHGVSFSDSGNFSYATSFPMPILLGAAFDDDLVKQV 151
Query: 135 GQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGV 194
I EARA N G A G+ +W PNIN FRDPRWGRGQETPGEDPL +Y V G+
Sbjct: 152 ATVISTEARAFANGGHA-GLDYWTPNINPFRDPRWGRGQETPGEDPLHLSRYVYHLVDGL 210
Query: 195 QGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCV 254
Q G + + A CKHF AYDL+NW+G RY FDA V+ QDL++ Y P F++C
Sbjct: 211 QD-----GIGPERPKVVATCKHFAAYDLENWEGIERYAFDAVVSPQDLSEYYLPSFKTCT 265
Query: 255 KQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFH---GYITSDCDAVSIIYDAEGY 311
+ + +MC+YN +NGIP+CADR LL R WG+ ++T DC A+ IY Y
Sbjct: 266 RDAKVDAVMCSYNSLNGIPTCADRWLLQTLLREHWGWEQTGHWVTGDCGAIDNIYADHHY 325
Query: 312 AKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFN 371
A L AG D++CGS ++ +A++Q + AL L+S ++LG F+
Sbjct: 326 VADGAHAAAAALNAGTDLDCGSVFPEYLGSALQQGLYNNQTLYNALIRLYSSLVKLGYFD 385
Query: 372 GNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNA 431
QP+ IG + V +PA + LA +A +GIV+LKN G LPL + +V A+IGP A
Sbjct: 386 -PADDQPYRSIGWNEVFTPAAEELAHKATVEGIVMLKND-GTLPLKSNGTV--AIIGPFA 441
Query: 432 NSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHV 491
N+ L GNY GP T + A + + G D + SSA +A+ AK AD V
Sbjct: 442 NATTQLQGNYEGPPKYIRTLIWAAVHNGYKVKFSQGTDINSNSSAGFAEAISAAKEADTV 501
Query: 492 VLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDR 551
+ G+D T EKE DR +V PG Q +LI ++++ +KP+I+V GG VD + +
Sbjct: 502 IYAGGIDNTIEKESQDRTTIVWPGNQLDLIEQLSD-LEKPLIVVQFGGGQVDDSSLLANA 560
Query: 552 NIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYI-KVPMTDMKMRPQATSG 610
+G++LWAGYP +AG A+ +++ G P GRLP+T YP Y+ +VPMTDM +RP S
Sbjct: 561 GVGALLWAGYPSQAGGAAVFDILTGKSAPAGRLPVTQYPASYVDEVPMTDMTLRPG--SN 618
Query: 611 NPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSV 670
NPGRTYR+Y+ K V PFG GL Y+ ++ + N S N V
Sbjct: 619 NPGRTYRWYD-KAVLPFGFGLHYTTFNVSWNHAEYGP--YNTDSVASGTTNAPV------ 669
Query: 671 PELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRG-NGRPIKQLVGFQSVI-LN 728
TE +T +I V N G +A + LLF+ G PIK LVG+ +
Sbjct: 670 ---DTELFDT----FSITVTNTGNVASDYIALLFLTADGVGPEPYPIKTLVGYSRAKGIE 722
Query: 729 AKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG-DEEYPISIFV 776
+ ++ ++S S++R E+G +V+ G++ L V +++P + F
Sbjct: 723 PGQSQQVKLDVS-VGSVARTAENGDLVLYPGSYKLEVDVGQDFPTATFT 770
>gi|409041356|gb|EKM50841.1| glycoside hydrolase family 3 protein [Phanerochaete carnosa
HHB-10118-sp]
Length = 764
Score = 477 bits (1228), Expect = e-131, Method: Compositional matrix adjust.
Identities = 283/729 (38%), Positives = 409/729 (56%), Gaps = 37/729 (5%)
Query: 46 CKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIF 105
C + + RA LV LTL+E ++ VN++P +PRLG+P Y WWSEALHGVA + G
Sbjct: 38 CNPSADPTSRANALVDALTLEELVNNTVNASPGVPRLGLPPYNWWSEALHGVA-LSPGTN 96
Query: 106 FN---GTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNIN 162
F+ ATSFPQ I+ A+FD L I I EARA NAG+A G+ F+ PNIN
Sbjct: 97 FSVPGSPFSSATSFPQPIILGATFDDDLVTSIATVISTEARAFNNAGRA-GLDFFTPNIN 155
Query: 163 IFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDL 222
F+DPRWGRGQETPGEDP +Y V G+QG G + A CKHF YDL
Sbjct: 156 PFKDPRWGRGQETPGEDPFHIAQYVYQLVTGLQG----GLSPDPYYKVIADCKHFAGYDL 211
Query: 223 DNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLS 282
+NW+G +R F+A ++ QDLA+ Y P F+SCV+ +MC+YN VNGIPSCA+ LL
Sbjct: 212 ENWEGNSRMAFNAIISTQDLAEYYTPSFQSCVRDAHVGSVMCSYNAVNGIPSCANSYLLQ 271
Query: 283 KTARRQWGF-HGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKA 341
R +G G+ITSDCDAV+ I+ Y + +A LKAG DV+CG+ +
Sbjct: 272 DIIRGHFGLGDGWITSDCDAVANIFSPHQYTTTLVNASAVALKAGTDVDCGTTYSQTLVD 331
Query: 342 AVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQ 401
AV Q + E +I ++ L+ +RLG F+ +P QPF ++G V +P+ Q LAL AA+
Sbjct: 332 AVDQNLVTEDDIKNSMIRLYRSLVRLGYFD-SPAEQPFRQLGWSDVNTPSSQALALTAAE 390
Query: 402 DGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVEN 461
+G+ LLKN G LPL S +AL+GP AN+ + GNY G + ++PLQALQ+
Sbjct: 391 EGVTLLKN-DGTLPL-SSAIKRIALVGPWANATTQMQGNYQGIAPFLVSPLQALQDAGFQ 448
Query: 462 TVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELI 521
+ G + + AV + AD V+ G+D+T E E DR + PG Q +L+
Sbjct: 449 VTFANGTAINSTDDSGFAAAVSAVQVADAVIYAGGIDETIESEGNDREIITWPGNQLDLV 508
Query: 522 TRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
+++A A KP +++ + GG VD + K ++ + +++W GYPG++G A+ ++ G P
Sbjct: 509 SQLA-AVGKPFVVLQMGGGQVDSSSLKSNKAVNALIWGGYPGQSGGAAIVNILTGKIAPA 567
Query: 582 GRLPMTWYPQDYI-KVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKF 640
GRLP+T YP DY+ ++PMTDM +RP TS PGRTY+++ G +F FG GL Y+ +S +
Sbjct: 568 GRLPITQYPADYVNEIPMTDMALRPNGTS--PGRTYKWFTGTPIFGFGFGLHYTTFSLDW 625
Query: 641 KAVSQNKLYLNQSSSTKMVE-NQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKH 699
+ + ST + E N V + ++ L F + VKN G++ +
Sbjct: 626 APTPPSSFAI----STLVSEANTAGVSFTNLAPL---------FTFRVNVKNTGKVGSDY 672
Query: 700 PVLLFVKPARRGNGRPIKQLVGF---QSVILNAKEKAEIVFELSPCESLSRAREDGLMVI 756
LLF P+KQLV + + + E AE+ L S++R E+G +
Sbjct: 673 VALLFSNTTAGPQPAPLKQLVSYTRVKGIAPGQTETAELKVTLG---SIARIDENGDSAL 729
Query: 757 EEGTHFLVV 765
G + + V
Sbjct: 730 YPGRYNIWV 738
>gi|156062754|ref|XP_001597299.1| hypothetical protein SS1G_01493 [Sclerotinia sclerotiorum 1980]
gi|154696829|gb|EDN96567.1| hypothetical protein SS1G_01493 [Sclerotinia sclerotiorum 1980
UF-70]
Length = 758
Score = 477 bits (1228), Expect = e-131, Method: Compositional matrix adjust.
Identities = 279/739 (37%), Positives = 403/739 (54%), Gaps = 41/739 (5%)
Query: 46 CKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIF 105
C TT RA LVS TL EKI+ N++P +PR+G+PAY+WW+EALHG+A
Sbjct: 34 CDTTADPYTRATALVSLFTLAEKINNTGNTSPGVPRIGLPAYQWWNEALHGIAYGTHFAA 93
Query: 106 FNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFR 165
ATSFPQ IL A+FD L + + I EARA NA + G+ FW PNIN ++
Sbjct: 94 AGSNYSYATSFPQPILMGAAFDDALIHDVASQISTEARAFSNANR-YGLNFWTPNINPYK 152
Query: 166 DPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNW 225
DPRWGRGQETPGEDP Y + V G+QG + KG A CKH+ YDL+N
Sbjct: 153 DPRWGRGQETPGEDPFHVSSYVNALVTGLQGGLDDLPYKKGV----ATCKHYAGYDLENG 208
Query: 226 KGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTA 285
G RY FDA + QDL D Y P F+ C + IMC+YN VNG+P+CAD LL
Sbjct: 209 GGIQRYAFDAIINSQDLRDYYLPSFQQCARDSNVQSIMCSYNAVNGVPTCADDWLLQSLL 268
Query: 286 RRQWGF---HGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAA 342
R WG+ ++TSDCDAV I+D+ Y +PE A D L AG D++CG F + +A
Sbjct: 269 REHWGWVEEDQWVTSDCDAVQNIWDSHNYTSTPEQAAADALNAGTDLDCGGFWPTYLGSA 328
Query: 343 VKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQD 402
Q S +DR+L ++ +RLG F+ ++QP+ ++G V +P+ + LALQAA+D
Sbjct: 329 YNQSLYNISTLDRSLTRRYASLVRLGYFDP-ASIQPYRQLGWSDVSTPSAEQLALQAAED 387
Query: 403 GIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENT 462
GIVLLKN G+LPLP S ++ALIGP AN+ + GNY G + +PL A QN +
Sbjct: 388 GIVLLKND-GILPLP-SNITNVALIGPWANATTQMQGNYYGQAPYLHSPLIAAQNAGFHV 445
Query: 463 VYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELIT 522
Y G D + ++ A+ AK AD ++ + G+D + E E DR + P Q L+
Sbjct: 446 TYVQGADIDSTNTTEFTAAIAAAKKADVIIYIGGIDNSIEAEAKDRKTIAWPSSQISLVN 505
Query: 523 RVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGG 582
++A + P+I+ + G +D + +R + I+WAGYPG+ G A+ ++ G P G
Sbjct: 506 QLANLS-IPLIISQM-GTMIDSSSLLTNRGVNGIIWAGYPGQDGGTAIFNILTGKTAPAG 563
Query: 583 RLPMTWYPQDYI-KVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFK 641
RLP+T YP DY+ +V M +M + P A NPGRTY+++ G +F FG GL Y+ ++ K
Sbjct: 564 RLPITQYPSDYVNEVSMNNMNLHPGAN--NPGRTYKWFNGTSIFDFGFGLHYTTFNAKIT 621
Query: 642 AVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPV 701
S N ++ +S +D+ + ++P I + N G +
Sbjct: 622 PPSSNTFEISHLTSNTSTH-KDLTPFLTLP---------------ISISNTGTTTSDYVA 665
Query: 702 LLFVKPARRGNGRPIKQLVGF---QSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEE 758
LLF+ + P K LV + + A A++ L+ SL+R E G +V+
Sbjct: 666 LLFLTGSFGPTPYPKKSLVAYTRLHDIKGGASSTAQLKLNLA---SLARGNEKGDLVLYP 722
Query: 759 GTHFLVV---GDEEYPISI 774
G + +VV G +E+ ++
Sbjct: 723 GDYKVVVDVDGKDEWSFTL 741
>gi|443893988|dbj|GAC71176.1| hypothetical protein PANT_1d00031 [Pseudozyma antarctica T-34]
Length = 759
Score = 477 bits (1227), Expect = e-131, Method: Compositional matrix adjust.
Identities = 292/754 (38%), Positives = 421/754 (55%), Gaps = 61/754 (8%)
Query: 33 CDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSE 92
C P P + C T+L RA LV+ T E I+ +N+AP +PRLGIP Y+WW+E
Sbjct: 24 CSPGTPLSAN-AVCDTSLDYWTRATSLVAEFTTQELINNTINTAPGVPRLGIPPYQWWTE 82
Query: 93 ALHGVAGVGKGIFFNGTIRG----ATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA 148
ALHGVAG G+ F + AT+FPQ+I A+FD L+ ++ I E RA NA
Sbjct: 83 ALHGVAG-SPGVNFADDVEAPYGSATNFPQIINLGATFDDALYEQVATHIANETRAFNNA 141
Query: 149 GQAIGMTFWAP-NINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGK 207
G+A G+ ++P NIN FRDPRWGRGQET GEDPL +YAV V+G+QG + +
Sbjct: 142 GKA-GLNMYSPLNINCFRDPRWGRGQETTGEDPLHMSRYAVKMVQGLQGPN------QDE 194
Query: 208 LQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYN 267
L+ +A CKH+ AYDL+ W G RY+FDA+V+ Q+LA+ Y P F +CV+ G+A +M +YN
Sbjct: 195 LRLAATCKHYLAYDLEKWDGVERYQFDAQVSRQELAEFYLPQFRACVRDGKAVTLMTSYN 254
Query: 268 RVNGIPSCADRNLLSKTARRQWGF---HGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLK 324
VN +P A R L AR++WG H Y+TSDCDAV+ ++D YA S A D +
Sbjct: 255 AVNNVPPSASRYYLETLARKEWGLDKKHNYVTSDCDAVANVFDGHHYADSYVQAAADSIN 314
Query: 325 AGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGA 384
AG D+NCG+ + A++Q I A+ +++ ++RLGLF+ QP ++G
Sbjct: 315 AGTDLNCGATYSDNLGQALEQNLTDVETIRTAVARMYASQVRLGLFDPK-QGQPLRELGW 373
Query: 385 DVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGP 444
+ V + A Q LA +A + LLKN +G LP+ + V A+IGP +N+ L GNYAGP
Sbjct: 374 EHVNTKAAQDLAYSSAAASVTLLKN-NGTLPVDGATKV--AVIGPYSNATFALRGNYAGP 430
Query: 445 SCRSITPLQALQN-YVENTVYYPGCDTVACSSASID--KAVDIAKGADHVVLMMGLDQTQ 501
+IT +A Q + + T+ T++ + D A+ +AK AD V+ G+D T
Sbjct: 431 GPFAITMTEAAQRVFSQATISSANGTTISGTYNHTDAEAAMQLAKEADLVIFAGGIDPTI 490
Query: 502 EKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGY 561
E EELDR + P Q +LI + AKK + +V GG +D K D NIG++LWAGY
Sbjct: 491 ESEELDRATIAWPPNQLQLIHALGGMAKK-MAVVQFGGGQIDGASIKADGNIGALLWAGY 549
Query: 562 PGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIK-VPMTDMKMRPQATSGNPGRTYRFYE 620
PG++GA+A+ +VI G+ P GRLP+T YP +YI + T M +RP AT PGRTY++Y
Sbjct: 550 PGQSGALAVMDVIAGNTAPAGRLPITQYPAEYIDGLAETTMALRPNATY--PGRTYKWYS 607
Query: 621 GKEVFPFGCGLSYSKYSYKFKA-VSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCE 679
G +P+ GL Y+ +FKA ++Q Y ++ E
Sbjct: 608 GTPTYPYAHGLHYT----EFKAELAQPAPYTIATAGYAEFE------------------- 644
Query: 680 TRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNG---RPIKQLVGFQSV-ILNAKEKAEI 735
R V + N G+ + L+F AR NG P K LVG++ V + E +
Sbjct: 645 -RVATVQATITNAGQRTSDYAALVF---ARHTNGPAPHPNKTLVGYKKVKAIAPGESRSV 700
Query: 736 VFELSPCESLSRAREDGLMVIEEGTHFLVVGDEE 769
E++ +L+R E+G +V+ G + L + EE
Sbjct: 701 EVEITQA-ALARGDEEGNLVLYPGKYELELDTEE 733
>gi|344303941|gb|EGW34190.1| hypothetical protein SPAPADRAFT_65353 [Spathaspora passalidarum
NRRL Y-27907]
Length = 788
Score = 477 bits (1227), Expect = e-131, Method: Compositional matrix adjust.
Identities = 276/742 (37%), Positives = 411/742 (55%), Gaps = 45/742 (6%)
Query: 9 VFPLLCLCFTSLLTRVDSTQPPFSCD---PSNPSTE---------TFPFCKT-------- 48
+ P+ L +L + PP S D +NP TFP C+
Sbjct: 1 MLPIKLLATLALAIFANGETPPSSPDYNTEANPQLPPITFEAVHYTFPDCQNGPLKHNAV 60
Query: 49 ---TLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIF 105
LP QRA+ +V T+DE I+ + N++P + RLG+P Y+WWSEALHG+A
Sbjct: 61 CNPHLPTEQRAKAVVDLFTVDELIANMGNTSPGVERLGLPPYQWWSEALHGIAR--SNFT 118
Query: 106 FNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFR 165
+G ATSFPQ IL +F++ L+ ++G IG EARA N G+A G+ F++PNIN FR
Sbjct: 119 ASGEYSHATSFPQPILMGGAFNNDLYKQVGNVIGTEARAFNNVGRA-GLDFYSPNINPFR 177
Query: 166 DPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNW 225
D RWGRGQE E P++ G YA++YV+G+QG + + LQ +A CKHF YD+++W
Sbjct: 178 DARWGRGQEVASESPVLVGNYALNYVQGLQGG-LDSNQNDDTLQVAATCKHFVGYDMESW 236
Query: 226 KGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTA 285
+R ++A ++ QDLAD Y P F+SCV+ +A+G MC+YN VNG+P+CA L+
Sbjct: 237 NQHSRLGYNAIISDQDLADFYLPTFQSCVRDAKAAGAMCSYNAVNGVPACASEFFLNTVL 296
Query: 286 RRQWGF-HGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVK 344
R + F +G I SDCDA+ +++ YA+ A D +KAG+DVNCG Q + A+
Sbjct: 297 RDGFDFQNGVIHSDCDAIYNVWNPHLYAQDLGGAAADAIKAGVDVNCGDTYQNNLGYALG 356
Query: 345 QKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGI 404
K + E++I ++ +S +RLG F+ +P + K + V +P LA QAA +GI
Sbjct: 357 NKTINENQIRTSVTRQYSNLIRLGYFD-SPQTNKYRKYDWNDVSTPQANQLAYQAAVEGI 415
Query: 405 VLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVY 464
LLKN G LP K K +A+IGP AN+ +LG+YAG I+PLQ Q+ Y
Sbjct: 416 ALLKND-GTLPFNKQKVRKVAVIGPWANATTQMLGDYAGTPPYMISPLQGAQSEGFQVEY 474
Query: 465 YPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRV 524
G ++ A++ AKGAD +V G+D + E E LDR L PG Q +L++++
Sbjct: 475 ALGTQINTTDTSGYTAALNAAKGADAIVYFGGIDNSVENEALDRESLAWPGNQLDLVSKL 534
Query: 525 AEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRL 584
+ KKP++++ GG +D T K ++N+ +I++AGYPG++G A+ +++ G + P GRL
Sbjct: 535 S-GLKKPLVVLQFGGGQIDDTEIKNNKNVNAIVYAGYPGQSGGTAIWDILSGKYAPAGRL 593
Query: 585 PMTWYPQDYI-KVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAV 643
T YP Y +VPMTDM +RP+ G PGRT+ +Y G+ V+ FG GL Y+ +S
Sbjct: 594 TTTQYPASYADQVPMTDMTLRPR--QGYPGRTFMWYNGEPVYEFGYGLHYTTFSASLANA 651
Query: 644 SQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLL 703
+ +QS + + V V K + T T + +KN G+ + LL
Sbjct: 652 PRGG---HQSFNIEQV----VAAAKRSQYVDTGLITT----FDVNIKNTGKTTSDYAALL 700
Query: 704 FVKPARRGNGRPIKQLVGFQSV 725
+ K P K LV F +
Sbjct: 701 YSKTTAGPGPHPNKILVSFDKL 722
>gi|302683060|ref|XP_003031211.1| glycoside hydrolase family 3 protein [Schizophyllum commune H4-8]
gi|300104903|gb|EFI96308.1| glycoside hydrolase family 3 protein [Schizophyllum commune H4-8]
Length = 761
Score = 475 bits (1223), Expect = e-131, Method: Compositional matrix adjust.
Identities = 291/727 (40%), Positives = 409/727 (56%), Gaps = 35/727 (4%)
Query: 46 CKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIF 105
C T+L +RAR LV LT+ E I+ V++AP +PRLG+P Y WW+EALHGVA G+
Sbjct: 35 CDTSLGHVERARALVEELTVAEMINNTVHTAPGVPRLGLPPYNWWNEALHGVA-ASPGVV 93
Query: 106 FNG---TIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNIN 162
F ATSFP I ++FD L +G EARA NAG A G+ +W PNIN
Sbjct: 94 FTSPGEEFSSATSFPMPINMGSAFDDALMLAVGNVTSTEARAFNNAGLA-GLDYWTPNIN 152
Query: 163 IFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDL 222
F+DPRWGRG ETPGEDPL +Y + V G+QG G L+ +A CKH+ AYDL
Sbjct: 153 PFKDPRWGRGAETPGEDPLHAARYVRTLVEGLQG-----GIDPPSLKVAADCKHWAAYDL 207
Query: 223 DNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLS 282
++W G RY FDA VT QDLA+ Y PPF+SCV+ RA+ +MC+YN VNG+P+CA LL
Sbjct: 208 EDWGGVARYAFDAVVTPQDLAEYYSPPFKSCVRDARAASVMCSYNAVNGVPACASPYLLK 267
Query: 283 KTARRQWGF--HGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTK 340
R WG ++TSDCDAV +YD GY + + LKAG D++CG+ ++
Sbjct: 268 TVLRDAWGLAEDRWVTSDCDAVGNVYDPHGYTEDFVNGSAVSLKAGSDLDCGTTYSQYLP 327
Query: 341 AAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAA 400
A + + E ++ AL L++ + LG F+ P QP+ +I V +PA Q LA AA
Sbjct: 328 EAYDRGLIDEDDLKAALTRLYASLVWLGYFDA-PEDQPYRQISWADVNTPAAQALAYTAA 386
Query: 401 QDGIVLLKNSHGLLPLPKSKSVSLALIGPNAN-SAKTLLGNYAGPSCRSITPLQALQNYV 459
+ VLLKN G LPL S S+S+ALIGP AN SA L GNY G +I PLQ +
Sbjct: 387 IESFVLLKN-DGTLPLTDS-SLSIALIGPMANASAVQLQGNYNGIPPFAIAPLQGFLDAG 444
Query: 460 ENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQE 519
N Y G + + ID AV A+ AD V+ + G+D T E+E DR ++ P Q
Sbjct: 445 FNVTYVLGTNVTGNDADDIDGAVAAAEAADVVIYVGGIDSTVEEEAKDRTEISWPDNQLA 504
Query: 520 LITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHN 579
L++ + EA KP+++V + GG +D T K + +ILWAGYPG++G A+A+ + G
Sbjct: 505 LLSALEEAG-KPLVVVQMGGGQLDDTPLKESDAVNAILWAGYPGQSGGTAIADTVMGKVA 563
Query: 580 PGGRLPMTWYPQDYIK-VPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSY 638
P GRL +T YP Y+ V MTDM +RP ++GNPGRTY++Y G V+P+G GL Y+ +S
Sbjct: 564 PAGRLSITQYPASYVDAVAMTDMTLRPDNSTGNPGRTYKWYTGTPVYPYGYGLHYTNFSV 623
Query: 639 KFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGK 698
+ + + Y Q ++ D+ + F VT V N G++A
Sbjct: 624 AWASDAPEACYSIQDLTSSADGFVDLAPLDT-------------FRVT--VTNDGDVASD 668
Query: 699 HPVLLFVKPARRGNGRPIKQLVGF-QSVILNAKEKAEIVFELSPCESLSRAREDGLMVIE 757
LLFV P+K+LV + ++ + + ++ E++ +L+R+ E G +
Sbjct: 669 FVALLFVSTQAGPAPAPMKELVAYARASDVQPGDSTDVDLEVT-LGALARSDESGDASLY 727
Query: 758 EGTHFLV 764
G + L
Sbjct: 728 PGDYELT 734
>gi|442803736|ref|YP_007371885.1| beta-xylosidase BxlB [Clostridium stercorarium subsp. stercorarium
DSM 8532]
gi|442739586|gb|AGC67275.1| beta-xylosidase BxlB [Clostridium stercorarium subsp. stercorarium
DSM 8532]
Length = 715
Score = 474 bits (1220), Expect = e-131, Method: Compositional matrix adjust.
Identities = 292/750 (38%), Positives = 417/750 (55%), Gaps = 93/750 (12%)
Query: 45 FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGI 104
+ + +RA+DLVSR+T++EK+SQ++ ++PAI RLGIPAY WW+EALHGVA G
Sbjct: 7 YLDPSYSFEERAKDLVSRMTIEEKVSQMLYNSPAIERLGIPAYNWWNEALHGVARAGT-- 64
Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA-------GQAIGMTFW 157
AT FPQ I AA+FD L Y++ I E RA Y+A G G+TFW
Sbjct: 65 --------ATMFPQAIGMAATFDEELIYKVADVISTEGRAKYHASSKKGDRGIYKGLTFW 116
Query: 158 APNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHF 217
+PNINIFRDPRWGRGQET GEDP +T + V++V+G+QG+ L+A+AC KHF
Sbjct: 117 SPNINIFRDPRWGRGQETYGEDPYLTARLGVAFVKGLQGNH------PKYLKAAACAKHF 170
Query: 218 TAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCAD 277
+ + + R++F+A V+ +DL +TY P F++ V++ + +M AYNR NG P C
Sbjct: 171 AVH---SGPESLRHEFNAVVSKKDLYETYLPAFKALVQEAKVESVMGAYNRTNGEPCCGS 227
Query: 278 RNLLSKTARRQWGFHGYITSDCDAVSIIY-DAEGYAKSPEDAVVDVLKAGMDVNCGSFLQ 336
+ LLS R +WGF G++ SDC A+ + A +PE A + V + G D+NCG+
Sbjct: 228 KTLLSDILRGEWGFKGHVVSDCWAIRDFHMHHHVTATAPESAALAV-RNGCDLNCGNMF- 285
Query: 337 KHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQ-PFGKIGADVVCSPAHQVL 395
+ A+K+ + E EIDRA+ L RM+LG+F +P Q P+ I D V H+ L
Sbjct: 286 GNLLIALKEGLITEEEIDRAVTRLMITRMKLGMF--DPEDQVPYASISYDFVDCKEHREL 343
Query: 396 ALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQAL 455
AL A+ IVLLKN GLLPL + K S+A+IGPNA+S + L+GNY G + +T L +
Sbjct: 344 ALDVAKKSIVLLKND-GLLPLDRKKIRSIAVIGPNADSRQALIGNYEGTASEYVTVLDGI 402
Query: 456 QNYVENTV---YYPGC-------DTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEE 505
+ + V Y GC + + I +AV A+ AD V++ +GLD T E EE
Sbjct: 403 REMAGDDVRIYYSVGCHLYKDRVENLGEPGDRIAEAVTCAEHADVVIMCLGLDSTIEGEE 462
Query: 506 L---------DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSI 556
+ D+ DL LPG+QQEL+ V A KP++LVLL G + +T+A D +I +I
Sbjct: 463 MHESNIYGSGDKPDLNLPGQQQELLEAVY-ATGKPIVLVLLTGSALAVTWA--DEHIPAI 519
Query: 557 LWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTY 616
L A YPG G A+A V+FG+ NP G+LP+T+Y TD M RTY
Sbjct: 520 LNAWYPGALGGRAIASVLFGETNPSGKLPVTFYRTTEELPDFTDYSME--------NRTY 571
Query: 617 RFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTE 676
RF + + ++PFG GLSY+ + Y +S++ + +
Sbjct: 572 RFMKNEALYPFGFGLSYTTFDYSDLKLSKDTIRAGEG----------------------- 608
Query: 677 FCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIV 736
F V++ V N G+MAG+ V +++K P QL G + V L + E AEI
Sbjct: 609 ------FNVSVKVTNTGKMAGEEVVQVYIKDLEASWRVPNWQLSGMKRVRLESGETAEIT 662
Query: 737 FELSPCESLSRAREDGLMVIEEGTHFLVVG 766
FE+ P E L+ ++G VIE G + VG
Sbjct: 663 FEIRP-EQLAVVTDEGKSVIEPGEFEIYVG 691
>gi|302683012|ref|XP_003031187.1| glycoside hydrolase family 3 protein [Schizophyllum commune H4-8]
gi|300104879|gb|EFI96284.1| glycoside hydrolase family 3 protein [Schizophyllum commune H4-8]
Length = 752
Score = 473 bits (1216), Expect = e-130, Method: Compositional matrix adjust.
Identities = 294/727 (40%), Positives = 401/727 (55%), Gaps = 41/727 (5%)
Query: 44 PFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKG 103
P C +L +RAR LV T+ E I+ VN+A +PRLG+P YEWW+EALHGV G+ G
Sbjct: 34 PVCDASLGHVERARALVEEFTVPEMINNTVNAAFGVPRLGLPPYEWWNEALHGV-GLSPG 92
Query: 104 I-FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNIN 162
+ FF ATSFP I ++FD L +G I EARA NAG+A G+ +W PNIN
Sbjct: 93 VVFFEPEPAVATSFPMPINMGSAFDDALMLAMGDVISTEARAFSNAGRA-GLDYWTPNIN 151
Query: 163 IFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDL 222
F+DPRWGRG ETPGEDPL +Y S V G+QG G L+ +A CKH+ AYDL
Sbjct: 152 PFKDPRWGRGAETPGEDPLHAARYVRSLVEGLQG-----GIDPPSLKVAAACKHWAAYDL 206
Query: 223 DNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLS 282
+NW G TRY FDA VT QDLA+ Y PPF SCV+ RA+ MC+YN VNG+P+CA LL
Sbjct: 207 ENWGGVTRYAFDAVVTPQDLAEYYAPPFRSCVRDARAASAMCSYNAVNGVPACASPYLLK 266
Query: 283 KTARRQWGF--HGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTK 340
R WG ++TSDC AV +YD GY + +A LKAG D+NCG+ ++
Sbjct: 267 TVLRDAWGLAEDRWVTSDCGAVGNVYDPHGYTEDLVNASTVSLKAGTDLNCGTNYTQYLP 326
Query: 341 AAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAA 400
A + + E ++ AL L++ + LG F+ P QP+ +I V +P Q LA AA
Sbjct: 327 EAYDRGLIDEDDLKAALTRLYASLVWLGYFDA-PEDQPYRQITWADVNTPEAQALAYTAA 385
Query: 401 QDGIVLLKNSHGLLPLPKSKSVSLALIGPNAN-SAKTLLGNYAGPSCRSITPLQALQNYV 459
VLLKN G LPL S ++SLALIGP AN SA +LGNY G I PLQ +
Sbjct: 386 IKSFVLLKN-DGTLPLTDS-TLSLALIGPMANASALQMLGNYFGIPPFVIAPLQGFLDAG 443
Query: 460 ENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQE 519
N Y G + + S D AV A+ AD V+ + G+D T E EE DR ++ P Q
Sbjct: 444 FNVTYVLGTNVTGNDAGSFDAAVAAAEAADVVIYVGGIDNTLEMEEKDRTEISWPDNQLA 503
Query: 520 LITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHN 579
L++ + E KP+++V + GG +D T K + +ILWAGYPG++G A+A+ + G
Sbjct: 504 LLSAL-EGVGKPLVVVQMGGGQLDDTPLKESDAVNAILWAGYPGQSGGTAIADTVTGKVA 562
Query: 580 PGGRLPMTWYPQDYI-KVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSY 638
P GRL Y+ +V MTDM +RP +GNPGRTY++Y G V+P+G GL Y+ S
Sbjct: 563 PAGRL--------YVDEVAMTDMTLRPDNATGNPGRTYKWYTGTPVYPYGYGLHYTNISV 614
Query: 639 KFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGK 698
+ + + Y Q + + D+ + F VT V N G++A
Sbjct: 615 AWASDAPEACYSIQDLTGEASGFVDLAPLDT-------------FRVT--VTNEGDIASD 659
Query: 699 HPVLLFVKPARRGNGRPIKQLVGF-QSVILNAKEKAEIVFELSPCESLSRAREDGLMVIE 757
LLFV PIK++V + ++ + E+ E++ +L+R E G +
Sbjct: 660 FVALLFVSTQAGPAPAPIKEMVAYARASDVQPGNSTEVELEVT-LGALARTDESGDASLY 718
Query: 758 EGTHFLV 764
G + L
Sbjct: 719 PGKYELT 725
>gi|70986056|ref|XP_748529.1| beta-xylosidase [Aspergillus fumigatus Af293]
gi|74668295|sp|Q4WFI6.1|BXLB_ASPFU RecName: Full=Probable exo-1,4-beta-xylosidase bxlB; AltName:
Full=1,4-beta-D-xylan xylohydrolase bxlB; AltName:
Full=Beta-xylosidase bxlB; AltName: Full=Xylobiase bxlB;
Flags: Precursor
gi|296439536|sp|B0Y0I4.1|BXLB_ASPFC RecName: Full=Probable exo-1,4-beta-xylosidase bxlB; AltName:
Full=1,4-beta-D-xylan xylohydrolase bxlB; AltName:
Full=Beta-xylosidase bxlB; AltName: Full=Xylobiase bxlB;
Flags: Precursor
gi|66846158|gb|EAL86491.1| beta-xylosidase, putative [Aspergillus fumigatus Af293]
gi|159128339|gb|EDP53454.1| beta-xylosidase [Aspergillus fumigatus A1163]
Length = 771
Score = 473 bits (1216), Expect = e-130, Method: Compositional matrix adjust.
Identities = 303/766 (39%), Positives = 423/766 (55%), Gaps = 51/766 (6%)
Query: 32 SCDPSNPSTETFP-----------FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIP 80
S P S TFP C T+L ++ RA+ LV+ +T +EK++ ++P +P
Sbjct: 18 SLAPGALSLNTFPDCSSGPLSKLAVCDTSLDVTTRAQSLVNAMTFEEKVNNTQYNSPGVP 77
Query: 81 RLGIPAYEWWSEALHGVAGVGKGIFF--NGTIRGATSFPQVILTAASFDSYLWYRIGQAI 138
RLG+PAY WWSEALHGVAG G+ F +G ATSFPQ IL A+FD L ++ +
Sbjct: 78 RLGLPAYNWWSEALHGVAG-SPGVEFADSGPFSYATSFPQPILLGATFDDDLIKQVATVV 136
Query: 139 GLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDT 198
E RA NAG++ G+ FW PNIN FRD RWGRGQETPGEDPL +Y V G+Q
Sbjct: 137 STEGRAFGNAGRS-GLDFWTPNINPFRDARWGRGQETPGEDPLHVSRYVYHLVDGLQN-- 193
Query: 199 FNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGR 258
G + A CKHF AYDL++W G R+ F+A V+ QDL++ Y PPF+SC + R
Sbjct: 194 ---GIGPANPKVVATCKHFAAYDLEDWNGVVRHSFNAEVSTQDLSEFYLPPFKSCARDAR 250
Query: 259 ASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHG---YITSDCDAVSIIYDAEGYAKSP 315
+MC+YN +NG+P+CAD LL R W + +ITSDC A+ IY+ + +P
Sbjct: 251 VDAVMCSYNALNGVPACADSYLLQTILREHWKWDEPGRWITSDCGAIDDIYNGHNFTTTP 310
Query: 316 EDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPT 375
+A L AG D++CG+ K+ A + +DRAL L+S ++LG F+
Sbjct: 311 AEAAATALNAGTDLDCGTVFPKYLGQAADEGLYSNQTLDRALVRLYSSLVKLGYFDPAED 370
Query: 376 MQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAK 435
QP+ IG V +PA + LA +AA +GIVLLKN LPL K+K +LALIGP AN+ K
Sbjct: 371 -QPYRSIGWTDVDTPAAEALAHKAAGEGIVLLKNDK-TLPL-KAKG-TLALIGPYANATK 426
Query: 436 TLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMM 495
+ GNY GP+ T L A + Y G S+A D A+ AK AD VV
Sbjct: 427 QMQGNYEGPAKYIRTLLWAATQAGYDVKYAAGTAINTNSTAGFDAALSAAKQADVVVYAG 486
Query: 496 GLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGS 555
G+D T E E DR + PG Q LI ++++ KP+++V GG VD + + + +
Sbjct: 487 GIDNTIEAEGRDRTTIAWPGNQVNLIDQLSKIG-KPLVVVQFGGGQVDDSSLLSNPRVNA 545
Query: 556 ILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYI-KVPMTDMKMRPQATSGNPGR 614
+LWAGYP + G A+ +++ G P GRLP+T YP DY+ +VPMTDM +RP S PGR
Sbjct: 546 LLWAGYPSQEGGSAIFDILTGKTAPAGRLPVTQYPADYVNQVPMTDMALRPG--SNTPGR 603
Query: 615 TYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELG 674
TYR+Y+ K V PFG GL Y+ + +S + L ++ +V K+VP
Sbjct: 604 TYRWYD-KAVLPFGFGLHYTTFK-----ISWPRRALGPYNTAALVSRSP----KNVPIDR 653
Query: 675 TEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRG-NGRPIKQLVGF-QSVILNAKEK 732
F I V N G+ + LLF+K G P+K LVG+ ++ + EK
Sbjct: 654 AAFDTFH-----IQVTNTGKTTSDYVALLFLKTTDAGPKPYPLKTLVGYTRAKQIKPGEK 708
Query: 733 AEIVFELSPCESLSRAREDGLMVIEEGTHFLV--VGDEEYPISIFV 776
+ E+S SL+R E+G +V+ G + L VG+ +YP + F
Sbjct: 709 RSVDIEVS-LGSLARTAENGDLVLYPGRYTLEVDVGESQYPTASFT 753
>gi|392560759|gb|EIW53941.1| glycoside hydrolase family 3 protein [Trametes versicolor FP-101664
SS1]
Length = 783
Score = 472 bits (1215), Expect = e-130, Method: Compositional matrix adjust.
Identities = 288/727 (39%), Positives = 403/727 (55%), Gaps = 19/727 (2%)
Query: 46 CKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIF 105
C T RA L+ T +E S VN++P +PRLG+PAY WWSE LHGVA G+
Sbjct: 41 CDITKDPITRATALIGLWTDEELTSNTVNASPGVPRLGLPAYNWWSEGLHGVA-QSPGVT 99
Query: 106 F--NGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINI 163
F +G ATSFPQ IL A+FD L I + E RA NAG+A G+ +W PNIN
Sbjct: 100 FAPSGNFSHATSFPQPILMGAAFDDTLIQAIATIVSTEGRAFNNAGRA-GLDYWTPNINP 158
Query: 164 FRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLD 223
F+DPRWGRGQETPGEDP +Y + + G+QG G K + A CKHF AYDL+
Sbjct: 159 FKDPRWGRGQETPGEDPFHLSQYVYNLILGLQG----GLDPKPYFKVVADCKHFAAYDLE 214
Query: 224 NWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSK 283
NW+G R FDA V+ QDL++ Y PPF++CV+ + + +MC+YN VNGIPSCA+ LL
Sbjct: 215 NWEGIVRNGFDAIVSQQDLSEFYLPPFQTCVRDAKVASVMCSYNAVNGIPSCANSFLLQD 274
Query: 284 TARRQWGFHG--YITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKA 341
R WGF ++TSDCDAV I Y P A D L AG D++CG+F +
Sbjct: 275 VLRDHWGFTDDRWVTSDCDAVENILTPHKYTTDPAQAAADALLAGTDIDCGTFSSTYLPE 334
Query: 342 AVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQ 401
A+++ + +++ RA ++ +RLG F+ +P QP+ ++G V +P Q LA AA
Sbjct: 335 ALQRGLVNSTDLRRAAIRQYASLVRLGYFD-DPAAQPYRQLGWSDVNTPQAQQLAHTAAV 393
Query: 402 DGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVEN 461
+GIVLLKN G+LP K LALIGP AN+ L G+Y G + ++PLQ Q
Sbjct: 394 EGIVLLKND-GVLPFSKHVR-KLALIGPWANATSLLQGSYIGVAPYLVSPLQGAQEAGFE 451
Query: 462 TVYYPGCDTVACSSAS-IDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQEL 520
Y G + + S AV + AD VV GLD+T E E DR+++ PG Q +L
Sbjct: 452 VEYVLGTNVTTQNDMSGFAAAVAAVRRADAVVFAGGLDETVECEGTDRLNVTWPGNQLDL 511
Query: 521 ITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
+ + E KP+I+ GG +D T K+ + + +I+W GYPG++G AL +++ G P
Sbjct: 512 VAEL-ERVGKPLIVAQFGGGQLDDTALKHSKAVNAIIWGGYPGQSGGTALFDILTGKAAP 570
Query: 581 GGRLPMTWYPQDYIK-VPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYK 639
GRLP+T YP Y K VPMTDM +RP AT NPGRTY++Y G VF FG GL Y+ + +
Sbjct: 571 AGRLPITQYPAAYTKQVPMTDMSLRPSAT--NPGRTYKWYSGTPVFEFGFGLHYTTFVFS 628
Query: 640 FKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKF-LVTIGVKNHGEMAGK 698
+ A S + +S + ++ + + + T F + + V N G +A
Sbjct: 629 WAAPSAAAAVDSTASFGSLAKSYSISQLVAHGQESTAFLDLAPLDTFAVRVTNTGRVASD 688
Query: 699 HPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEE 758
+ LLFV A P KQLV + V A + + +++RA ++G +
Sbjct: 689 YVALLFVSGAFGPAPHPKKQLVAYTRVHGLAPRGSTVAQLPVTLGAIARADKNGEKWVHP 748
Query: 759 GTHFLVV 765
GT+ L +
Sbjct: 749 GTYTLAL 755
>gi|296439595|sp|A1CCL9.2|BXLB_ASPCL RecName: Full=Probable exo-1,4-beta-xylosidase bxlB; AltName:
Full=1,4-beta-D-xylan xylohydrolase bxlB; AltName:
Full=Beta-xylosidase bxlB; AltName: Full=Xylobiase bxlB;
Flags: Precursor
Length = 771
Score = 471 bits (1213), Expect = e-130, Method: Compositional matrix adjust.
Identities = 297/741 (40%), Positives = 411/741 (55%), Gaps = 46/741 (6%)
Query: 46 CKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIF 105
C T+ ++ RA+ LV ++ EK++ AP +PRLG+PAY WWSEALHGVAG G+
Sbjct: 43 CDTSRDVTTRAQSLVDAMSFAEKVNNTQYEAPGVPRLGLPAYNWWSEALHGVAGA-PGVH 101
Query: 106 F--NGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINI 163
F +G ATSF Q IL ASFD L ++ +G E RA NAG+A G+ +W PNIN
Sbjct: 102 FADSGPFSYATSFAQPILLGASFDDELVKQVATVVGTEGRAFGNAGRA-GLDYWTPNINP 160
Query: 164 FRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLD 223
FRDPRWGRGQETPGEDPL +Y V G+QG G + Q +A CKHF AYD++
Sbjct: 161 FRDPRWGRGQETPGEDPLHVSRYVYHLVDGLQG-----GIGPARPQIAATCKHFAAYDME 215
Query: 224 NWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSK 283
+W G +R++FDARV+ QDLA+ Y P F+SCV+ + +MC+YN +NG+P+CAD LL
Sbjct: 216 DWNGVSRHEFDARVSTQDLAEFYLPSFKSCVRDAQVDAVMCSYNALNGVPTCADPYLLQT 275
Query: 284 TARRQWGFH---GYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTK 340
R W + ++ SDC A+ IY Y K+ +A L AG D++CG+ KH
Sbjct: 276 LLREHWDWDQPGHWVVSDCGAIDDIYIGHNYTKTGAEAAAVALNAGTDLDCGTVFPKHLG 335
Query: 341 AAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAA 400
A +Q +DRAL L+S ++LG F+ QP+G IG V +PA + LA +AA
Sbjct: 336 EAAEQGLYTNQTLDRALVRLYSSLVKLGYFD-PAEKQPYGSIGWKDVDTPAAEQLAHKAA 394
Query: 401 QDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVE 460
+GIVLLKN LPL K+K +LALIGP AN+ K + GNY GP T A +
Sbjct: 395 VEGIVLLKNDQ-TLPL-KAKG-TLALIGPYANATKQMQGNYQGPPKYIRTLEWAATQHGY 451
Query: 461 NTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQEL 520
Y PG S+A A+ AK AD V+ G+D T E E LDR + PG Q L
Sbjct: 452 QVQYSPGTAINNSSTAGFAAALAAAKDADVVLYAGGIDNTIESETLDRTTITWPGNQLSL 511
Query: 521 ITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
I+ ++ KP+I++ GG VD T + ++ ++LWAGYP + G A+ +++ G P
Sbjct: 512 ISELSN-LHKPLIVIQFGGGQVDDTPLLTNPHVNALLWAGYPSQEGGAAIFDILTGKAAP 570
Query: 581 GGRLPMTWYPQDYI-KVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYK 639
GRLP+T YP Y +VPMT+M +R A NPGRTYR+Y+ K V PFG GL Y+ +
Sbjct: 571 AGRLPITQYPAAYTAQVPMTEMGLR--AGGDNPGRTYRWYD-KAVVPFGFGLHYTSFE-- 625
Query: 640 FKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKF-LVTIGVKNHGEMAGK 698
VS ++ L ++ +V G + F + V+N G +
Sbjct: 626 ---VSWDRGRLGPYNTAALVNRAP----------GGSHVDRALFDTFRVQVQNTGTVTSD 672
Query: 699 HPVLLFVKPARRG-NGRPIKQLVGF---QSVILNAKEKAEIVFELSPCESLSRAREDGLM 754
+ LLFVK G P+K LVG+ Q V + EI L +++R +G +
Sbjct: 673 YVALLFVKTEDAGPEPYPLKTLVGYTRVQQVKPGERRSVEIEVTLG---AMARTAANGDL 729
Query: 755 VIEEGTHFLV--VGDEEYPIS 773
V+ G + L VG+ YP +
Sbjct: 730 VLYPGKYTLQVDVGERGYPTA 750
>gi|121712174|ref|XP_001273702.1| beta-xylosidase [Aspergillus clavatus NRRL 1]
gi|119401854|gb|EAW12276.1| beta-xylosidase [Aspergillus clavatus NRRL 1]
Length = 803
Score = 471 bits (1213), Expect = e-130, Method: Compositional matrix adjust.
Identities = 297/741 (40%), Positives = 411/741 (55%), Gaps = 46/741 (6%)
Query: 46 CKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIF 105
C T+ ++ RA+ LV ++ EK++ AP +PRLG+PAY WWSEALHGVAG G+
Sbjct: 75 CDTSRDVTTRAQSLVDAMSFAEKVNNTQYEAPGVPRLGLPAYNWWSEALHGVAGA-PGVH 133
Query: 106 F--NGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINI 163
F +G ATSF Q IL ASFD L ++ +G E RA NAG+A G+ +W PNIN
Sbjct: 134 FADSGPFSYATSFAQPILLGASFDDELVKQVATVVGTEGRAFGNAGRA-GLDYWTPNINP 192
Query: 164 FRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLD 223
FRDPRWGRGQETPGEDPL +Y V G+QG G + Q +A CKHF AYD++
Sbjct: 193 FRDPRWGRGQETPGEDPLHVSRYVYHLVDGLQG-----GIGPARPQIAATCKHFAAYDME 247
Query: 224 NWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSK 283
+W G +R++FDARV+ QDLA+ Y P F+SCV+ + +MC+YN +NG+P+CAD LL
Sbjct: 248 DWNGVSRHEFDARVSTQDLAEFYLPSFKSCVRDAQVDAVMCSYNALNGVPTCADPYLLQT 307
Query: 284 TARRQWGFH---GYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTK 340
R W + ++ SDC A+ IY Y K+ +A L AG D++CG+ KH
Sbjct: 308 LLREHWDWDQPGHWVVSDCGAIDDIYIGHNYTKTGAEAAAVALNAGTDLDCGTVFPKHLG 367
Query: 341 AAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAA 400
A +Q +DRAL L+S ++LG F+ QP+G IG V +PA + LA +AA
Sbjct: 368 EAAEQGLYTNQTLDRALVRLYSSLVKLGYFD-PAEKQPYGSIGWKDVDTPAAEQLAHKAA 426
Query: 401 QDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVE 460
+GIVLLKN LPL K+K +LALIGP AN+ K + GNY GP T A +
Sbjct: 427 VEGIVLLKNDQ-TLPL-KAKG-TLALIGPYANATKQMQGNYQGPPKYIRTLEWAATQHGY 483
Query: 461 NTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQEL 520
Y PG S+A A+ AK AD V+ G+D T E E LDR + PG Q L
Sbjct: 484 QVQYSPGTAINNSSTAGFAAALAAAKDADVVLYAGGIDNTIESETLDRTTITWPGNQLSL 543
Query: 521 ITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
I+ ++ KP+I++ GG VD T + ++ ++LWAGYP + G A+ +++ G P
Sbjct: 544 ISELSN-LHKPLIVIQFGGGQVDDTPLLTNPHVNALLWAGYPSQEGGAAIFDILTGKAAP 602
Query: 581 GGRLPMTWYPQDYI-KVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYK 639
GRLP+T YP Y +VPMT+M +R A NPGRTYR+Y+ K V PFG GL Y+ +
Sbjct: 603 AGRLPITQYPAAYTAQVPMTEMGLR--AGGDNPGRTYRWYD-KAVVPFGFGLHYTSFE-- 657
Query: 640 FKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKF-LVTIGVKNHGEMAGK 698
VS ++ L ++ +V G + F + V+N G +
Sbjct: 658 ---VSWDRGRLGPYNTAALVNRAP----------GGSHVDRALFDTFRVQVQNTGTVTSD 704
Query: 699 HPVLLFVKPARRG-NGRPIKQLVGF---QSVILNAKEKAEIVFELSPCESLSRAREDGLM 754
+ LLFVK G P+K LVG+ Q V + EI L +++R +G +
Sbjct: 705 YVALLFVKTEDAGPEPYPLKTLVGYTRVQQVKPGERRSVEIEVTLG---AMARTAANGDL 761
Query: 755 VIEEGTHFLV--VGDEEYPIS 773
V+ G + L VG+ YP +
Sbjct: 762 VLYPGKYTLQVDVGERGYPTA 782
>gi|389748262|gb|EIM89440.1| hypothetical protein STEHIDRAFT_182874, partial [Stereum hirsutum
FP-91666 SS1]
Length = 772
Score = 471 bits (1211), Expect = e-130, Method: Compositional matrix adjust.
Identities = 283/733 (38%), Positives = 398/733 (54%), Gaps = 39/733 (5%)
Query: 46 CKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVA---GVGK 102
C TT RA L+ L + ++ VN +P + RLG+P Y+WW+EALHGV GV
Sbjct: 39 CNTTAHFVDRATSLIEEFNLTDLVNNTVNGSPGVDRLGLPPYQWWNEALHGVGSSPGVNW 98
Query: 103 GIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNIN 162
G + ATSFP IL A+F+ L I I EARA N A G+TF+ PNIN
Sbjct: 99 GSGPDANFTSATSFPAPILLGATFNDSLIASIADVISTEARAFNNFNYA-GLTFFTPNIN 157
Query: 163 IFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDL 222
FRDPRWGRGQETPGEDP +Y YV G+QG G + A CKH AYD+
Sbjct: 158 PFRDPRWGRGQETPGEDPYHLSRYVYQYVVGLQG----GLSPDPYYKVLANCKHVLAYDV 213
Query: 223 DNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLS 282
+NW+G R F+A VT QDL++ Y P F+ C++ + + MC+YN VNG+PSCA +L
Sbjct: 214 ENWEGNDRTGFNAVVTTQDLSEFYTPSFQGCLRDAQGASAMCSYNAVNGVPSCASSYILK 273
Query: 283 KTARRQWGF---HGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHT 339
R WG G+IT DC AV IY GY + +A + AG D++CG +
Sbjct: 274 DLVRDFWGLGEREGWITGDCGAVQNIYQPHGYTDTLVNATAVAMDAGTDLDCGDVYSPNL 333
Query: 340 KAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQA 399
AV + + +I AL L+ +RLG F+ QP+ V +P+ Q LA A
Sbjct: 334 WTAVVEGLITAGQIQTALIRLYGSLIRLGYFD-PAEQQPYRSFDWSNVNTPSSQDLAYNA 392
Query: 400 AQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYV 459
A GIVLL+N GLLPL + ++ALIGP AN+ +L GNYAG + I+P QA +
Sbjct: 393 AVQGIVLLEN-DGLLPL-STNVKNIALIGPMANATLSLQGNYAGIAPFVISPQQAFETAG 450
Query: 460 ENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQE 519
N + G ++ +A++ A+GAD VV + G+D + E E DR + PG Q +
Sbjct: 451 YNVTFAFGTGISNSDNSGYSEALEAAQGADVVVFVGGIDNSIEAEGQDRTSIEWPGSQLD 510
Query: 520 LITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHN 579
LI ++ E KP+++V + GG D + K + + ++LWAGYPG++G AL ++I G +
Sbjct: 511 LIGQLGELG-KPLVVVRMGGGQCDDSTLKANATVNALLWAGYPGQSGGTALVDIISGKQS 569
Query: 580 PGGRLPMTWYPQDYI-KVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSY 638
P GRLP+T YP Y+ ++ MTDM +RP +SG+PGRTY++Y G ++PFG G+ Y+ +
Sbjct: 570 PSGRLPVTQYPSSYVSEIDMTDMAIRPN-SSGSPGRTYKWYTGAPIYPFGYGIHYTTF-- 626
Query: 639 KFKAVSQNKLYLNQSSSTKMVENQDVVHY--KSVPELGTEFCETRKFLVTIGVKNHGEMA 696
+L + SSST QD+V KS TE +T LVT N+
Sbjct: 627 --------RLAWSDSSSTTY-NIQDIVSSANKSGGFADTEILDTFSLLVTNTGSNY---T 674
Query: 697 GKHPVLLFVKPARRGNGRPIKQLVGFQSV---ILNAKEKAEIVFELSPCESLSRAREDGL 753
+ LLF + P+++LVG+ V AE+ L S+SR E+G
Sbjct: 675 SDYVALLFANSTSGPSPAPLQELVGYTRVPHITPGGTATAELNVTLG---SISRVDENGN 731
Query: 754 MVIEEGTHFLVVG 766
++ GT+ L VG
Sbjct: 732 WILYPGTYNLWVG 744
>gi|449531013|ref|XP_004172482.1| PREDICTED: beta-D-xylosidase 1-like, partial [Cucumis sativus]
Length = 534
Score = 470 bits (1210), Expect = e-129, Method: Compositional matrix adjust.
Identities = 242/535 (45%), Positives = 339/535 (63%), Gaps = 12/535 (2%)
Query: 242 LADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDA 301
L DTY PF++CV +G+ + +MC+YN+VNG P+CAD +LL T R WG GYI SDCD+
Sbjct: 1 LEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADPDLLKNTIRGAWGLDGYIVSDCDS 60
Query: 302 VSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLF 361
V ++YD++ + +PE+A +KAG+D++CG FL HT AV + L E +++ AL NL
Sbjct: 61 VGVLYDSQHFTPTPEEAAASTIKAGLDLDCGPFLAVHTATAVGRGLLKEVDLNNALANLL 120
Query: 362 SVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKS 421
SV+MRLG+F+G P QP+G +G VC+PAH+ LAL+AA+ GIVLL+N G LPL ++
Sbjct: 121 SVQMRLGMFDGEPAAQPYGNLGPKDVCTPAHKHLALEAARQGIVLLQNRAGALPLSPTRH 180
Query: 422 VSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVAC-SSASIDK 480
++A+IGPN+++ T++GNYAG +C TP+Q + YV+ T++ GC VAC I +
Sbjct: 181 RTVAVIGPNSDATVTMIGNYAGVACEYTTPVQGISKYVK-TIHAKGCANVACVGDQLIGE 239
Query: 481 AVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGG 540
A A+ AD V+++GLDQ+ E E DR ++LPG+Q+EL+ R+ A K P ++VL+ GG
Sbjct: 240 AEAAARVADAAVVVVGLDQSIEAESRDRNGVLLPGKQEELVRRIGLACKGPTVVVLMSGG 299
Query: 541 PVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYI-KVPMT 599
P+D++FAK D I ILW GYPG+AG A+A+V+FG NPGG+LPMTWYPQ Y+ KVPMT
Sbjct: 300 PIDVSFAKNDGKISGILWVGYPGQAGGAAIADVLFGATNPGGKLPMTWYPQSYLAKVPMT 359
Query: 600 DMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMV 659
+M +RP ++G PGRTYRFY+G VFPFG GLSYSK+S F S S
Sbjct: 360 NMGLRPDPSTGYPGRTYRFYKGPVVFPFGFGLSYSKFSQSFAEAPTKISLPLSSLSPNSS 419
Query: 660 ENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQL 719
V H T+ + I VKN G + G H +L+F + P K L
Sbjct: 420 ATVKVSH--------TDCASVSDLPIMIDVKNTGTVDGSHTILVFSTVPNQ-TWSPEKHL 470
Query: 720 VGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGDEEYPISI 774
+GF+ V L A + + + C+ LSR E G I G H L +GD + IS+
Sbjct: 471 IGFEKVHLIAGSQKRVRIGIHVCDHLSRVDEFGTRRIPMGEHKLHIGDLTHSISL 525
>gi|226491558|ref|NP_001146416.1| uncharacterized protein LOC100279996 [Zea mays]
gi|223975771|gb|ACN32073.1| unknown [Zea mays]
Length = 507
Score = 468 bits (1205), Expect = e-129, Method: Compositional matrix adjust.
Identities = 243/518 (46%), Positives = 334/518 (64%), Gaps = 15/518 (2%)
Query: 263 MCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDV 322
MC+YN+VNG P+CAD++LLS R W +GYI+SDCD+V ++Y+ + Y K+PEDA
Sbjct: 1 MCSYNQVNGKPTCADKDLLSGVIRGDWKLNGYISSDCDSVDVLYNNQHYTKTPEDAAAIS 60
Query: 323 LKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKI 382
+KAG+D+NCG+FL +HT AAV+ KL ES++DRA+ N MRLG F+G+P PFG +
Sbjct: 61 IKAGLDLNCGTFLAQHTVAAVQAGKLSESDVDRAVTNNLVTLMRLGFFDGDPRELPFGNL 120
Query: 383 GADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYA 442
G VC+P++Q LA +AA+ GIVLLKN+ G LPL + S+A+IGPNAN++ T++GNY
Sbjct: 121 GPSDVCTPSNQELAREAARQGIVLLKNT-GKLPLSATSIKSMAVIGPNANASFTMIGNYE 179
Query: 443 GPSCRSITPLQALQNYVENTVYYPGCDTVACSSASI--DKAVDIAKGADHVVLMMGLDQT 500
G C+ TPLQ L V TVY PGC V CS S+ D A A AD VL++G DQ+
Sbjct: 180 GTPCKYTTPLQGLGANVA-TVYQPGCTNVGCSGNSLQLDAATKAAASADVTVLVVGADQS 238
Query: 501 QEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAG 560
E+E LDR L+LPG+Q +L++ VA A+ P ILV++ GGP DI+FAK I +ILW G
Sbjct: 239 IERESLDRTSLLLPGQQPQLVSAVANASSGPCILVVMSGGPFDISFAKSSDKIAAILWVG 298
Query: 561 YPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYE 620
YPGEAG A+A+V+FG HNP GRLP+TWYP+ + KVPMTDM+MRP ++G PGRTYRFY
Sbjct: 299 YPGEAGGAAIADVLFGYHNPSGRLPVTWYPESFTKVPMTDMRMRPDPSTGYPGRTYRFYT 358
Query: 621 GKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTE--FC 678
G V+ FG GLSY+ +++ + + +L L + + Q P + E C
Sbjct: 359 GDTVYAFGDGLSYTSFAHHLVSAPK-QLALQLAEGHACLTEQ-------CPSVEAEGAHC 410
Query: 679 ETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFE 738
E F V + V+N GE +G H V LF P N P K L+GF+ V L + + F+
Sbjct: 411 EGLAFDVHLRVRNAGERSGGHTVFLFSSPPAVHNA-PAKHLLGFEKVSLEPGQAGVVAFK 469
Query: 739 LSPCESLSRAREDGLMVIEEGTHFLVVGDEEYPISIFV 776
+ C+ LS E G + G+H L VGD ++ +++ V
Sbjct: 470 VDVCKDLSVVDELGNRKVALGSHTLHVGDLKHTLNLGV 507
>gi|452846807|gb|EME48739.1| glycoside hydrolase family 3 protein [Dothistroma septosporum
NZE10]
Length = 802
Score = 468 bits (1203), Expect = e-129, Method: Compositional matrix adjust.
Identities = 281/743 (37%), Positives = 404/743 (54%), Gaps = 31/743 (4%)
Query: 46 CKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIF 105
C TT RA L++ TL EK++ +++P +PRLG+PAY WW EALHGVA G+
Sbjct: 39 CDTTADPLTRATALINAFTLQEKLNNTGSTSPGVPRLGLPAYTWWQEALHGVAS-SPGVN 97
Query: 106 FN--GTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINI 163
F+ G R ATSFPQ IL A+FD L + I EARA N +A G+ FW PNIN
Sbjct: 98 FSDSGPFRYATSFPQPILMGAAFDDDLIRDVATVISTEARAFNNDKRA-GLDFWTPNINP 156
Query: 164 FRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLD 223
F+D RWGRGQETPGEDP Y + + G+QG + K + A CKHF AYD++
Sbjct: 157 FKDSRWGRGQETPGEDPYHLSSYVAALIEGLQGSPDDKYK-----RVVATCKHFVAYDME 211
Query: 224 NWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSK 283
+W G RY+FDA+V+ QDL + Y PPF+ C + MC+YN +NG+P+CAD LL
Sbjct: 212 SWNGNFRYQFDAQVSSQDLVEYYMPPFQQCARDSNVGAFMCSYNALNGVPTCADPWLLQT 271
Query: 284 TARRQWGF---HGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTK 340
R +W + ++TSDCDAV ++ YA + E+A LKAG D+NCG++ Q H
Sbjct: 272 VLREKWNWTSEQQWVTSDCDAVQNVFLPHDYASTREEAAALSLKAGTDINCGTYYQDHLP 331
Query: 341 AAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAA 400
AA Q + +++D +L +S +RLG F+G P+ + + V +P Q LA +AA
Sbjct: 332 AAYDQGLINTTDLDISLIRQYSSLVRLGYFDG--LAVPYRNLTWNDVSTPHAQQLAYKAA 389
Query: 401 QDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVE 460
+GI LLKN G+LPL S S+ALIG AN+ +LGNY G +PL A Q
Sbjct: 390 AEGITLLKND-GVLPLTISNGTSIALIGDWANATDQMLGNYDGIPPFFHSPLYAAQQTGA 448
Query: 461 NTVYYPGCDTVACSSASIDKAVDI---AKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQ 517
+ G + D + + A +D ++ G+D + E E +DRV L G Q
Sbjct: 449 TVNFATGPG--GQGDPTTDHWLPVWAAANKSDVIIYAGGIDNSVESEGMDRVSLTWTGAQ 506
Query: 518 QELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGD 577
++I ++A KPVI++ + GG +D + + N+ +++W GYPG+ G VAL ++I G
Sbjct: 507 LDMIGQLAMYG-KPVIVLQMGGGQIDSSPLVNNPNVSALIWGGYPGQDGGVALFDIIRGI 565
Query: 578 HNPGGRLPMTWYPQDYI-KVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKY 636
P GRLP T YP YI +VPMTDM +RP +T+G+PGRTY +Y VFP+G GL Y+ +
Sbjct: 566 TAPAGRLPTTQYPAKYISQVPMTDMTLRPNSTTGSPGRTYIWYNENAVFPYGLGLHYTNF 625
Query: 637 SYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMA 696
+ K S Y + SS++ + + C F V+I N GE+
Sbjct: 626 TAAIKP-SFPSTYDSSSSNSGSASYDISTLTSNCTATYKDLCPFTSFSVSI--TNTGEIM 682
Query: 697 GKHPVLLFVKPARRGNGRPIKQLVGFQ---SVILNAKEKAEIVFELSPCESLSRAREDGL 753
+ L F+ P K+LV +Q ++ + + A + L SL+R E G
Sbjct: 683 SDYVTLGFLAGIHGPAPHPNKRLVSYQRLHNITAGSSQTAWLNLTLG---SLARVDEMGN 739
Query: 754 MVIEEGTHFLVVGDEEYPISIFV 776
V+ G + L+V + + F
Sbjct: 740 KVLYPGDYALLVDTQPLAMGNFT 762
>gi|392570764|gb|EIW63936.1| glycoside hydrolase family 3 protein [Trametes versicolor FP-101664
SS1]
Length = 781
Score = 467 bits (1202), Expect = e-128, Method: Compositional matrix adjust.
Identities = 289/734 (39%), Positives = 410/734 (55%), Gaps = 35/734 (4%)
Query: 46 CKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIF 105
C T RA L+S T +E + VN++P +PRLG+PAY WWSE LHGVA G+
Sbjct: 41 CDVTKDPITRATALISIWTDEELTNNTVNASPGVPRLGLPAYNWWSEGLHGVA-QSPGVT 99
Query: 106 F--NGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINI 163
F +G ATSFPQ IL A+FD L I + E RA NAG+A G+ +W PNIN
Sbjct: 100 FAPSGNFSYATSFPQPILMGAAFDDPLIQAIATIVSTEGRAFNNAGRA-GLDYWTPNINP 158
Query: 164 FRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLD 223
F+DPRWGRGQETPGEDP +Y + + G+QG G K + A CKHF AYD+D
Sbjct: 159 FKDPRWGRGQETPGEDPFHLSQYVYNLILGLQG----GLDPKPYFKVVADCKHFAAYDMD 214
Query: 224 NWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSK 283
NW+G RY F+A V+ QDL++ Y PPF++CV+ + + +MC+YN VNGIPSCA+ LL
Sbjct: 215 NWEGVVRYGFNAVVSQQDLSEFYLPPFQTCVRDAKVASVMCSYNAVNGIPSCANSFLLQD 274
Query: 284 TARRQWGFHG--YITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKA 341
R WGF ++TSDCDAV I+ Y P A D L AG D++CG+F +
Sbjct: 275 VLRDHWGFTDDRWVTSDCDAVQNIFTPHNYTTDPAQAAADALLAGTDIDCGTFSSTYLPE 334
Query: 342 AVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQ 401
A+++ + +++ RA ++ +RLG F+ +P QP+ ++G V + Q LA AA
Sbjct: 335 ALQRGLVNSTDLRRAAIRQYASLVRLGYFD-DPAAQPYRQLGWSDVNTLQAQQLAHTAAV 393
Query: 402 DGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVEN 461
+G+VLLKN GLLPL K + LALIGP AN+ + L GNY G + ++P+Q Q
Sbjct: 394 EGMVLLKND-GLLPLSK-RVRKLALIGPWANATRLLQGNYFGIAPYLVSPVQGAQQAGFE 451
Query: 462 TVYYPGCD-TVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQEL 520
Y G + T ++ AV AK AD VV GLD+T E+EE+DR+++ PG Q +L
Sbjct: 452 VEYVFGTNVTTRNDTSGFAAAVAAAKRADAVVFAGGLDETVEREEIDRLNVTWPGNQLDL 511
Query: 521 ITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
+ + E KP+I+ GG +D T K + + +I+W GYPG++G AL +++ G P
Sbjct: 512 VAEL-ERVGKPLIVAQFGGGQLDNTALKRSKAVNAIIWGGYPGQSGGTALFDILTGKAAP 570
Query: 581 GGRLPMTWYPQDYI-KVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYK 639
GRLP+T YP Y +VPMTDM +RP AT NPGRTY++Y G VF FG GL Y+ +++
Sbjct: 571 AGRLPITQYPAAYAEQVPMTDMTLRPSAT--NPGRTYKWYSGTPVFEFGFGLHYTTFAFA 628
Query: 640 F--------KAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKN 691
+ S + S S + Q+ + + L T + V N
Sbjct: 629 WAAPGAAADSTASFGGPAKSYSISQLVAHGQESAAFLDLAPLDT---------FAVRVTN 679
Query: 692 HGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRARED 751
G++A + LLFV + P K LV + + A + + +++RA E+
Sbjct: 680 TGKVASDYVALLFVSGSFGPAPHPKKTLVAYTRIHGLAPRGSTVGQLPVTLGAIARADEN 739
Query: 752 GLMVIEEGTHFLVV 765
G + GT+ L +
Sbjct: 740 GEKWVHPGTYTLAL 753
>gi|451992719|gb|EMD85198.1| glycoside hydrolase family 3 protein [Cochliobolus heterostrophus
C5]
Length = 781
Score = 467 bits (1201), Expect = e-128, Method: Compositional matrix adjust.
Identities = 279/732 (38%), Positives = 402/732 (54%), Gaps = 42/732 (5%)
Query: 45 FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGI 104
C + RA+ LV+ TL+EKI+ NSAP + RLG+P Y+WW+E LHG+AG
Sbjct: 36 ICDPSASTLARAKSLVALYTLEEKINATSNSAPGVARLGVPPYQWWNEGLHGIAGPFTSF 95
Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIF 164
G +TSFPQ IL A+FD L + + I EARA NA + G+ FW PNIN F
Sbjct: 96 AKQGDYSYSTSFPQPILMGAAFDDDLITEVAKVISTEARAFNNANRT-GLDFWTPNINPF 154
Query: 165 RDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDN 224
RDPRWGRGQETPGED Y + + G+QG+ + + + A CKH+ YD++N
Sbjct: 155 RDPRWGRGQETPGEDSYHLSSYVKALIHGLQGNATDPYR-----RVVATCKHYAGYDIEN 209
Query: 225 WKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKT 284
W G RY+ D +++ QDL + Y PFE+CV Q MC+YN VNG P CAD LL
Sbjct: 210 WNGNLRYQNDVQISQQDLVEYYLAPFEACV-QANVGAFMCSYNAVNGAPPCADPYLLQTV 268
Query: 285 ARRQWGFHG---YITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKA 341
R WG+ ++TSDCDA+ +Y ++ + E A D L AG D++CG++LQ H
Sbjct: 269 LREHWGWSSDDHWVTSDCDAIQNVYLPHQWSSTREGAAADSLNAGTDLDCGTYLQTHLPG 328
Query: 342 AVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQ 401
AVKQ E+ +D+AL +S ++LG F+ P QP+ ++G D V + A Q LAL+AA+
Sbjct: 329 AVKQGLTDETTLDKALIRQYSSLIKLGYFDA-PENQPYRQLGFDAVATSASQALALKAAE 387
Query: 402 DGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVEN 461
+GIVLLKN G+LP+ S + + G AN+ L GNY G + +PL ALQN +
Sbjct: 388 EGIVLLKND-GVLPI-NLGSKQVGIYGDWANATSQLQGNYFGVAKFLTSPLMALQNLGVD 445
Query: 462 TVY---YPG--CDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGR 516
Y PG D + +S+ + +D + + G+D E E+ DR L L G
Sbjct: 446 VKYAGNLPGGQGDPTTGAWSSLSGVITT---SDVHIWVGGIDNGVESEDRDRSWLTLTGG 502
Query: 517 QQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFG 576
Q ++I ++A+ KPVI+V++ GG +D + + I ++LWAGYPG+ G A+ ++ G
Sbjct: 503 QLDVIGQLADTG-KPVIVVIMGGGQIDTSPLIRNPKISAVLWAGYPGQDGGTAIVNILTG 561
Query: 577 DHNPGGRLPMTWYPQDYI-KVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSK 635
P GRLP T YP Y+ +VPMTDM MRP + NPGRTY++Y G+ +F FG GL Y+
Sbjct: 562 KAAPAGRLPQTQYPSKYVSEVPMTDMAMRP--SDKNPGRTYKWYTGEPIFEFGYGLHYTN 619
Query: 636 YSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFL-VTIGVKNHGE 694
+S NQ + + D+V K G F E F +T+ V+N G+
Sbjct: 620 FSASIT---------NQPKQSYAIS--DLV--KGCNSTGG-FLERCPFTGITVSVQNTGK 665
Query: 695 MAGKHPVLLFVKPARRGNGRPIKQLVGFQSVI-LNAKEKAEIVFELSPCESLSRAREDGL 753
++ + L F+ + P K LV + + + A + L+ SL+R E G
Sbjct: 666 ISSDYVTLGFLTGSFGPKPYPKKSLVAYDRLFNIAAGSSSTATLNLT-LASLARVDESGN 724
Query: 754 MVIEEGTHFLVV 765
V+ G + L +
Sbjct: 725 KVLYPGDYELQI 736
>gi|398403795|ref|XP_003853364.1| putative xylan 1,4-beta-Xylosidase [Zymoseptoria tritici IPO323]
gi|339473246|gb|EGP88340.1| putative xylan 1,4-beta-Xylosidase [Zymoseptoria tritici IPO323]
Length = 785
Score = 466 bits (1199), Expect = e-128, Method: Compositional matrix adjust.
Identities = 289/743 (38%), Positives = 399/743 (53%), Gaps = 49/743 (6%)
Query: 46 CKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIF 105
C T RA L++ T++EKI+ ++AP +PRLG+PAY WW EALHGVA G+
Sbjct: 39 CDFTADPLTRATALIAAFTIEEKINNTGSTAPGVPRLGLPAYTWWQEALHGVA-QSPGVN 97
Query: 106 FN--GTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINI 163
F+ G R ATSFPQ IL A+FD L + I EARA N ++ G+ +W PNIN
Sbjct: 98 FSDSGDFRYATSFPQPILMGAAFDDDLIKDVATVISTEARAFNNDARS-GLDYWTPNINP 156
Query: 164 FRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLD 223
F+D RWGRGQETPGEDP Y S + G+QGD GK K + A CKHF AYDL+
Sbjct: 157 FKDSRWGRGQETPGEDPYHLSSYVKSLIAGLQGD----GKYK---KVVATCKHFVAYDLE 209
Query: 224 NWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSK 283
W G RY+FD V Q+L + Y PPF++C + MC+YN +NGIP+CAD LL
Sbjct: 210 TWNGNFRYQFDPHVGSQELVEYYMPPFQACARDANVGAFMCSYNSLNGIPTCADPYLLQT 269
Query: 284 TARRQWGF---HGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTK 340
R W + ++TSDCD++ +Y Y + E+AV LKAG DVNCG++ Q+
Sbjct: 270 ILREHWNWTSEEQWVTSDCDSIQNVYLPHEYTSTREEAVAVSLKAGTDVNCGTYYQEFLP 329
Query: 341 AAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAA 400
A+ + E +ID AL +S +RLG F+G T + + V +P Q LAL+AA
Sbjct: 330 GALSLGLVTEKDIDMALIRQYSSLVRLGYFDG--TAVEYRSLSWKDVSTPYAQQLALKAA 387
Query: 401 QDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVE 460
+GI LLKN G+LPL +K +A+IG AN+ + +LGNY G +PL A Q
Sbjct: 388 VEGITLLKND-GILPLAITKDTKIAVIGDWANATEQMLGNYDGIPPYLHSPLWAAQQTGA 446
Query: 461 NTVYY--PG--CDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGR 516
N Y PG D + I AVD AD ++ G+D E E +DRV + G
Sbjct: 447 NVTYSGNPGGQGDPTTNNWLHIWTAVD---EADVILFAGGIDNGVEAEGMDRVSIAWTGA 503
Query: 517 QQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFG 576
Q ++I ++A KPVI+ + VD T ++NI ++LW GYPG+ G VAL ++I G
Sbjct: 504 QLDVIGQLASRG-KPVIVAQMGTNGVDSTPLLNNQNISALLWGGYPGQDGGVALLDIIQG 562
Query: 577 DHNPGGRLPMTWYPQDYI-KVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSK 635
P GRLP T YP YI KVPMTDM +RP +T+G PGRTY +Y K VF FG GL Y+
Sbjct: 563 KSAPAGRLPTTQYPASYISKVPMTDMHLRPNSTTGFPGRTYMWYNEKPVFEFGYGLHYTN 622
Query: 636 YSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEM 695
+S S S + HY F + + I V N G +
Sbjct: 623 FSATISPTD------TTSFSIADLTKDCTEHYMDR----CPFADMK-----IAVTNTGNV 667
Query: 696 AGKHPVLLFVKPARRGNGRPIKQLVGFQ---SVILNAKEKAEIVFELSPCESLSRAREDG 752
+ L F+ P K+LV +Q ++ A + + L+ SL+R + G
Sbjct: 668 TSDYVTLGFLAGEHGPAPCPNKRLVNYQRLHNITAGASQTTSLNLTLA---SLARVDDMG 724
Query: 753 LMVIEEGTHFLVVGDEEYPISIF 775
V+ G++ L++ + P+++F
Sbjct: 725 NTVLYPGSYALLI--DTQPLAMF 745
>gi|344302281|gb|EGW32586.1| hypothetical protein SPAPADRAFT_51129 [Spathaspora passalidarum
NRRL Y-27907]
Length = 788
Score = 465 bits (1196), Expect = e-128, Method: Compositional matrix adjust.
Identities = 263/682 (38%), Positives = 385/682 (56%), Gaps = 22/682 (3%)
Query: 46 CKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIF 105
C LP +QRA+ +V T+DE I+ + N++P + RLG+P Y+WWSE LHG+A
Sbjct: 61 CNPYLPNNQRAKAVVDLFTVDELIANMGNTSPGVERLGLPPYQWWSEGLHGIAR--SNFT 118
Query: 106 FNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFR 165
+G ATSFPQ IL +F+S L+ ++G IG EARA N G+A G+ +++PNIN F+
Sbjct: 119 ASGEYSHATSFPQPILMGGAFNSDLYKQVGNVIGTEARAFNNVGRA-GLDYYSPNINPFK 177
Query: 166 DPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNW 225
DPRWGRGQE E P++ G YA++YV+G+QG + LQ +A CKHF YD+++W
Sbjct: 178 DPRWGRGQEVASESPVLVGNYALNYVQGLQGG-IDSNPNDDTLQVAATCKHFAGYDMESW 236
Query: 226 KGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTA 285
K +R ++A ++ QDLAD Y P F+SCV+ +A+G MC+YN +NGIP CA L
Sbjct: 237 KQHSRLGYNAIISDQDLADYYFPTFQSCVRDAKAAGAMCSYNAINGIPVCASEFFLGTVI 296
Query: 286 RRQWGF-HGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVK 344
R + F +G I SDCD++ I++ Y + A D +KAG+DVNCG Q + A+
Sbjct: 297 REGFDFQNGVIHSDCDSLYSIWNPHLYVQDLGAAAADGIKAGVDVNCGDTYQNNLGYALG 356
Query: 345 QKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGI 404
K + E +I ++ +S +RLG F+ T + +DV S A+Q LA QAA +GI
Sbjct: 357 NKTINEDQIRASVTRQYSNLIRLGYFDSPQTNKYRTYNWSDVSTSQANQ-LAYQAAVEGI 415
Query: 405 VLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVY 464
LLKN G LP K K ++A+IGP AN+ +LG+YAG I+PLQ Q+ Y
Sbjct: 416 TLLKND-GTLPFNKDKVKNVAVIGPWANATTDMLGDYAGTPPYLISPLQGAQDSGFKVQY 474
Query: 465 YPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRV 524
G + + A++ AKGAD +V G+D + E E LDR L PG Q +L++++
Sbjct: 475 AYGTQINTTLTTNYTAALNAAKGADAIVYFGGIDNSIENEALDRESLAWPGNQLDLVSKL 534
Query: 525 AEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRL 584
+ KP+++V G VD T K + N+ SI++AGYPG++G A+ +V+ G + P GRL
Sbjct: 535 S-GLNKPLVVVQFGAGQVDDTEIKNNNNVNSIVYAGYPGQSGGTAIWDVLNGIYAPAGRL 593
Query: 585 PMTWYPQDYI-KVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAV 643
T YP Y +VPMTDM +RP+ G PGRT+ +Y G+ V+ FG GL Y+ +S
Sbjct: 594 STTQYPASYADQVPMTDMTLRPR--DGYPGRTFMWYNGEPVYEFGYGLHYTTFSVSLANA 651
Query: 644 SQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLL 703
+ + + KS + T T + +KN G++ + LL
Sbjct: 652 P-------PKGAPQSFNIDQFIAAKSSQYVDTSLITT----FDVNIKNTGKVTSDYAALL 700
Query: 704 FVKPARRGNGRPIKQLVGFQSV 725
+ P K LV F +
Sbjct: 701 YSNTTSGPGPHPNKILVSFDKL 722
>gi|421060771|ref|ZP_15523202.1| glycoside hydrolase family 3 domain protein [Pelosinus fermentans
B3]
gi|421065248|ref|ZP_15527033.1| glycoside hydrolase family 3 domain protein [Pelosinus fermentans
A12]
gi|421073214|ref|ZP_15534285.1| glycoside hydrolase family 3 domain protein [Pelosinus fermentans
A11]
gi|392444242|gb|EIW21677.1| glycoside hydrolase family 3 domain protein [Pelosinus fermentans
A11]
gi|392454445|gb|EIW31278.1| glycoside hydrolase family 3 domain protein [Pelosinus fermentans
B3]
gi|392459366|gb|EIW35779.1| glycoside hydrolase family 3 domain protein [Pelosinus fermentans
A12]
Length = 724
Score = 464 bits (1195), Expect = e-128, Method: Compositional matrix adjust.
Identities = 285/752 (37%), Positives = 411/752 (54%), Gaps = 88/752 (11%)
Query: 41 ETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGV 100
E F + TL QRA+DLVSR+TL+EK++Q+V +PAIPRLG+PAY WWSEALHGVA
Sbjct: 2 EIFAYQDETLSFEQRAKDLVSRMTLEEKVTQMVYISPAIPRLGVPAYNWWSEALHGVARA 61
Query: 101 GKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA-------GQAIG 153
G AT FPQ I AA+FD L + + + I +E RA ++ G G
Sbjct: 62 GV----------ATVFPQAIGLAATFDEKLIFNVAEVISIEGRAKFHEFQRKGDHGIYKG 111
Query: 154 MTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASAC 213
+TFW+PN+NIFRDPRWGRGQET GEDP +TG+ VS+++G+QG K L+A+AC
Sbjct: 112 LTFWSPNVNIFRDPRWGRGQETFGEDPYLTGRLGVSFIKGLQGQD------KKYLRAAAC 165
Query: 214 CKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIP 273
KHF + + + R+ FDA V+ +DL +TY P F+ CVK+ +M AYNRVNG P
Sbjct: 166 AKHFAVH---SGPESERHSFDAVVSPKDLRETYLPAFKECVKEANVEAVMGAYNRVNGEP 222
Query: 274 SCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGS 333
C LL +T RR+WGF G++ SDC A+ ++ S ++V L G D+NCG+
Sbjct: 223 CCGSNMLLKETLRREWGFTGHVVSDCWAIKDFHENHRVTSSAPESVAMALNNGCDLNCGN 282
Query: 334 FLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQ 393
+ + A ++ + E I+ A+ L RM+LGLF+ + P+ KIG H+
Sbjct: 283 -MYLNLLIAYQEGLVTEEAINTAVTRLMLTRMKLGLFDTAENV-PYTKIGFHQNDCQEHR 340
Query: 394 VLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQ 453
AL+ ++ +VLLKN + LLPL ++ S+A+IGPNANS + L GNY G + IT L+
Sbjct: 341 EFALEVSKKTLVLLKNENNLLPLDRNTISSIAVIGPNANSREALTGNYCGTASNYITVLE 400
Query: 454 ALQNYV-ENTV--YYPGC-------DTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEK 503
++ V ++T+ Y GC + + + +AV A+ AD VV+ MGLD + E
Sbjct: 401 GIREAVGKDTMVSYAQGCHLYRDKAENLGEARDRFAEAVSTAERADIVVMCMGLDASIEG 460
Query: 504 EELD---------RVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIG 554
EE D ++ L LPG QQEL+ + + K P+ILVLL G + +T+A I
Sbjct: 461 EEGDVSNEYASGDKLGLNLPGLQQELLEVIYQTGK-PIILVLLAGSALAVTWAA--EKIP 517
Query: 555 SILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGR 614
+I+ A YPG G ALA IFG+++P G+LP+T+Y TD M+ R
Sbjct: 518 AIIQAWYPGAEGGKALASAIFGEYSPVGKLPITFYRTTEELPEFTDYSMK--------NR 569
Query: 615 TYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELG 674
TYR+ + ++PFG GL Y+ ++Y+ Q +L Q S + V+
Sbjct: 570 TYRYMTKEALYPFGYGLGYTTFAYR-----QLQLNRTQISVGENVQG------------- 611
Query: 675 TEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAE 734
++ VKN G A V L++K + PI L G Q V L + E
Sbjct: 612 -----------SVLVKNTGNFASDETVQLYIKDVKASVEVPIWALQGIQKVHLLPGTEQE 660
Query: 735 IVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
+ F L+P L+ E+G ++E G + VG
Sbjct: 661 VFFTLTP-RQLALINEEGNCILEPGVFEIYVG 691
>gi|343172466|gb|AEL98937.1| beta-xylosidase, partial [Silene latifolia]
gi|343172468|gb|AEL98938.1| beta-xylosidase, partial [Silene latifolia]
Length = 374
Score = 464 bits (1193), Expect = e-127, Method: Compositional matrix adjust.
Identities = 208/380 (54%), Positives = 281/380 (73%), Gaps = 7/380 (1%)
Query: 81 RLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGL 140
RLG+ YEWWSEALHGV+ VG G F G ATSFPQVI TAASF++ LW IGQA+
Sbjct: 1 RLGLQGYEWWSEALHGVSNVGPGTKFQGAFPAATSFPQVITTAASFNASLWQAIGQAVSD 60
Query: 141 EARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFN 200
EARA+YN G A G+T+W+PN+NIFRDPRWGRGQETPGEDP ++ +YA SYV G+QG+ N
Sbjct: 61 EARAMYNGGTA-GLTYWSPNVNIFRDPRWGRGQETPGEDPTLSAQYAASYVTGLQGNYGN 119
Query: 201 GGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRAS 260
+L+ +ACCKH+TAYDLDNW G R+ F+A+V+ QDL DTY PF++CV +G+ +
Sbjct: 120 ------RLKVAACCKHYTAYDLDNWNGMDRFHFNAKVSKQDLEDTYNVPFKACVLEGKVA 173
Query: 261 GIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVV 320
+MC+YN+VNG P+CAD ++L T R QW +GYI SDCD+V ++YD + Y ++PE+A
Sbjct: 174 SVMCSYNQVNGKPTCADPDILRNTIRGQWHLNGYIVSDCDSVGVLYDDQHYTRTPEEAAA 233
Query: 321 DVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFG 380
D + AG+D++CG FL HT+ A++Q + E+ +++AL N +V+MRLG+F+G P+ QPFG
Sbjct: 234 DTINAGLDLDCGPFLAVHTEGAIRQGLVTEAAVNQALANTITVQMRLGMFDGEPSAQPFG 293
Query: 381 KIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGN 440
+G VC+PAHQ LALQAA++GIVLLKN G LPL + ++A+IGPNA + T++GN
Sbjct: 294 NLGPRDVCTPAHQDLALQAAREGIVLLKNQVGSLPLSTVRHRNIAVIGPNAQATTTMIGN 353
Query: 441 YAGPSCRSITPLQALQNYVE 460
YAG +C +PLQ + Y
Sbjct: 354 YAGIACGYTSPLQGISRYAR 373
>gi|23304843|emb|CAD48309.1| beta-xylosidase B [Clostridium stercorarium]
Length = 715
Score = 463 bits (1192), Expect = e-127, Method: Compositional matrix adjust.
Identities = 289/750 (38%), Positives = 415/750 (55%), Gaps = 93/750 (12%)
Query: 45 FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGI 104
+ + +RA+DLVSR+T++EK+SQ++ ++PAI RLGIPAY WW+EALHGVA G
Sbjct: 7 YLDPSYSFEERAKDLVSRMTIEEKVSQMLYNSPAIERLGIPAYNWWNEALHGVARAGT-- 64
Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA-------GQAIGMTFW 157
AT FPQ I AA+FD L Y++ I E RA Y+A G G+TFW
Sbjct: 65 --------ATMFPQAIGMAATFDEELIYKVADVISTEGRAKYHASSKKGDRGIYKGLTFW 116
Query: 158 APNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHF 217
+PNINIFRDPRWGRGQET GEDP +T + V++V+G+QG+ L+A CK+
Sbjct: 117 SPNINIFRDPRWGRGQETYGEDPYLTARLGVAFVKGLQGNH------PKYLKAGGMCKNI 170
Query: 218 TAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCAD 277
+ + + R++F+A V+ +DL +TY P F++ V++ + +M AYNR NG P C
Sbjct: 171 LPFTV--VPESLRHEFNAVVSKKDLYETYLPAFKALVQEAKVESVMGAYNRTNGEPCCGS 228
Query: 278 RNLLSKTARRQWGFHGYITSDCDAVSIIY-DAEGYAKSPEDAVVDVLKAGMDVNCGSFLQ 336
+ LLS R +WGF G++ SDC A+ + A +PE A + V + G D+NCG+
Sbjct: 229 KTLLSDILRGEWGFKGHVVSDCWAIRDFHMHHHVTATAPESAALAV-RNGCDLNCGNMF- 286
Query: 337 KHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQ-PFGKIGADVVCSPAHQVL 395
+ A+K+ + E EIDRA+ L RM+LG+F +P Q P+ I + V C H+ L
Sbjct: 287 GNLLIALKEGLITEEEIDRAVTRLMITRMKLGMF--DPEDQVPYASISSFVDCK-EHREL 343
Query: 396 ALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQAL 455
AL A+ IVLLKN GLLPL + K S+A+IGPNA+S + L+GNY G + +T L +
Sbjct: 344 ALDVAKKSIVLLKND-GLLPLDRKKIRSIAVIGPNADSRQALIGNYEGTASEYVTVLDGI 402
Query: 456 QNYVENTV---YYPGC-------DTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEE 505
+ + V Y GC + + I +AV A+ AD V++ +GLD T E EE
Sbjct: 403 REMAGDDVRIYYSVGCHLYKDRVENLGEPGDRIAEAVTCAEHADVVIMCLGLDSTIEGEE 462
Query: 506 L---------DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSI 556
+ D+ DL LPG+QQEL+ V A KP++LVLL G + +T+A D +I +I
Sbjct: 463 MHESNIYGSGDKPDLNLPGQQQELLEAVY-ATGKPIVLVLLTGSALAVTWA--DEHIPAI 519
Query: 557 LWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTY 616
L A YPG G A+A V+FG+ NP G+LP+T+Y TD M RTY
Sbjct: 520 LNAWYPGALGGRAIASVLFGETNPSGKLPVTFYRTTEELPDFTDYSME--------NRTY 571
Query: 617 RFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTE 676
RF + + ++PFG GLSY+ + Y +S++ + +
Sbjct: 572 RFMKNEALYPFGFGLSYTTFDYSDLKLSKDTIRAGEG----------------------- 608
Query: 677 FCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIV 736
F V++ V N G+MAG+ V +++K P QL G + V L + E AEI
Sbjct: 609 ------FNVSVKVTNTGKMAGEEVVQVYIKDLEASWRVPNWQLSGMKRVRLESGETAEIT 662
Query: 737 FELSPCESLSRAREDGLMVIEEGTHFLVVG 766
FE+ P E L+ ++G VIE G + VG
Sbjct: 663 FEIRP-EQLAVVTDEGKSVIEPGEFEIYVG 691
>gi|347832625|emb|CCD48322.1| glycoside hydrolase family 3 protein [Botryotinia fuckeliana]
Length = 772
Score = 463 bits (1191), Expect = e-127, Method: Compositional matrix adjust.
Identities = 276/742 (37%), Positives = 404/742 (54%), Gaps = 44/742 (5%)
Query: 46 CKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIF 105
C T+ RA L+S TL EK++ N++P +PR+G+P+YEWW+EALHG+A G
Sbjct: 34 CDTSSDPYTRAAALISLFTLAEKVNNTGNTSPGVPRIGLPSYEWWNEALHGIA-RSPGTT 92
Query: 106 FNGTIRG---ATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNIN 162
F T +TSFPQ IL A+FD L +++ + EARA N + G+ FW PNIN
Sbjct: 93 FAATGSNYSYSTSFPQPILMGATFDDELIHKVATQVSTEARAFNNVNR-FGLNFWTPNIN 151
Query: 163 IFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDL 222
++DPRWGRGQETPGEDP T Y + + G+QG + KG A CKHF YDL
Sbjct: 152 PYKDPRWGRGQETPGEDPFHTSSYVNALITGLQGGLDDLPYKKGV----ATCKHFAGYDL 207
Query: 223 DNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLS 282
++ G RY FDA + QDL D Y PPF+ C + +MC+YN +NG+P+CAD LL
Sbjct: 208 ESSDGAIRYGFDAIIKSQDLRDYYLPPFQQCARDSNVQSVMCSYNAMNGVPTCADDWLLQ 267
Query: 283 KTARRQWGF---HGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHT 339
R WG+ ++TSDCDAV I+D Y +PE + D L AG D++CG+F +
Sbjct: 268 TLLREHWGWTEEDQWVTSDCDAVKNIWDYHNYTLTPEQSAADALNAGTDLDCGTFWPTYL 327
Query: 340 KAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQA 399
+A Q S +DR+L ++ +RLG F+ P++QP+ ++ D V +PA Q LALQA
Sbjct: 328 GSAYDQGLYDISTLDRSLARRYASLVRLGYFDP-PSVQPYRQLNWDNVSTPAAQQLALQA 386
Query: 400 AQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYV 459
A+DGIVLLKN G+LPL S ++ALIGP AN+ K + GNY G + +PL A QN
Sbjct: 387 AEDGIVLLKND-GILPL-SSNITNVALIGPLANATKQMQGNYYGTAPYLRSPLIAAQNAG 444
Query: 460 ENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQE 519
Y G D + ++ A+ A+ AD V+ + G+D + E EE+DR + P Q
Sbjct: 445 FKVTYVQGADIDSQNTTDFSAAISAAQSADLVIYVGGIDNSIEAEEIDRTSISWPSSQLS 504
Query: 520 LITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHN 579
LI ++A + +I + C +D + + + ++LWAGYPG+ G A+ ++ G
Sbjct: 505 LINQLANLSTPLIISQMGCM--IDSSSLLSNTGVNALLWAGYPGQDGGTAIFNILTGKTA 562
Query: 580 PGGRLPMTWYPQDYI-KVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSY 638
P GRLP+T YP +Y+ +V MTDM ++P + NPGRTY++Y G+ VF +G GL Y+ +
Sbjct: 563 PAGRLPITQYPSNYVNQVTMTDMNLQP--SRFNPGRTYKWYNGEPVFEYGYGLQYTTFDA 620
Query: 639 KFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGK 698
K S N + +D+ + +P I V N G
Sbjct: 621 KITPSSPNNTFEISELLANASNYKDLTPFVKIP---------------ITVSNTGTTTSD 665
Query: 699 HPVLLFVKPARRGNGRPIKQLVGF---QSVILNAKEKAEIVFELSPCESLSRAREDGLMV 755
+ L F+ P K LV + + A AE+ L+ SL+R +G ++
Sbjct: 666 YVALFFLSGTFGPAPHPKKSLVAYTRLHDITGGANATAEVSLNLA---SLARGNWNGDLI 722
Query: 756 IEEGTHFLVV---GDEEYPISI 774
+ G + +VV G +E+ ++
Sbjct: 723 LYPGDYKVVVDIDGKDEWSFTL 744
>gi|83774566|dbj|BAE64689.1| unnamed protein product [Aspergillus oryzae RIB40]
Length = 822
Score = 462 bits (1190), Expect = e-127, Method: Compositional matrix adjust.
Identities = 288/739 (38%), Positives = 413/739 (55%), Gaps = 45/739 (6%)
Query: 42 TFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVG 101
+ P C T+L I++R LV LTL+EKI LV+++ RLG+P+YEWWSEA HGV G
Sbjct: 76 SHPVCDTSLSIAERVDSLVKSLTLEEKILNLVDASAGSTRLGLPSYEWWSEATHGV-GSA 134
Query: 102 KGIFFN---GTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWA 158
G+ F ATSFP ILTAASFD L +I + IG E RA N G + G FWA
Sbjct: 135 PGVQFTSKPANFSYATSFPAPILTAASFDDTLIRKIAEVIGREGRAFGNNGFS-GFDFWA 193
Query: 159 PNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFT 218
PNIN FRDPRWGRGQETPGEDPLV Y ++V G+QGD K K Q A CKH+
Sbjct: 194 PNINGFRDPRWGRGQETPGEDPLVAQNYIRNFVPGLQGD-----DPKNK-QVIATCKHYA 247
Query: 219 AYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADR 278
YDL+ T RY + T QDL+D + PF++CV+ IMC+YN V+GIP+CA+
Sbjct: 248 VYDLE----TGRYGNNYNPTQQDLSDYFLAPFKTCVRDTDVGSIMCSYNSVSGIPACANE 303
Query: 279 NLLSKTARRQWGFHG---YITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG-SF 334
LLS+ R+ W F+ Y+ SDC AV+ I+ + + E A L AG+D+ CG S+
Sbjct: 304 YLLSEVLRKHWNFNSDYHYVVSDCGAVTDIWQYHNFTDTEEAAASVALNAGVDLECGSSY 363
Query: 335 LQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQV 394
L+ + A Q + +D++L L+S +G F+G + K+ V +P Q
Sbjct: 364 LKLNESLAANQTSV--KVMDQSLARLYSALFTVGFFDGG----KYDKLDFSDVSTPDAQA 417
Query: 395 LALQAAQDGIVLLKNSHGLLPLPK-SKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQ 453
LA +AA +G+ LLKN LLPL K S+A+IGP AN+ + G+Y+G + I+PL+
Sbjct: 418 LAYEAAVEGMTLLKNDD-LLPLDSPHKYKSVAVIGPFANATTQMQGDYSGDAPYLISPLE 476
Query: 454 ALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVL 513
A + Y G +++ ++A+ A +D ++ + G+D + E E LDR L
Sbjct: 477 AFGDSRWKVNYALGTAMNNQNTSGFEEALAAANKSDLIIYLGGIDNSLESETLDRTSLTW 536
Query: 514 PGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEV 573
PG Q +LIT +++ + KP+++V GG VD + +++I +++WAGYP ++G AL +V
Sbjct: 537 PGNQLDLITSLSKLS-KPLVVVQFGGGQVDDSDILKNKDIQALVWAGYPSQSGGTALLDV 595
Query: 574 IFGDHNPGGRLPMTWYPQDYI-KVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLS 632
+ G +P GRLP+T YP Y +V + D+ +RP T PGRTY++Y GK V PFG GL
Sbjct: 596 LVGKRSPAGRLPVTQYPASYADQVNIFDINLRP--TDSYPGRTYKWYTGKPVLPFGYGLH 653
Query: 633 YSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNH 692
Y+K+ + ++ LN+ + QD+V G T V + VKN
Sbjct: 654 YTKFMFDWEKT------LNREYNI-----QDLVASCRNSSGGPINDNTPLTTVKVRVKNV 702
Query: 693 GEMAGKHPVLLFVKPARRGNG-RPIKQLVGFQSVILNAKEKAEIVFELS-PCESLSRARE 750
G + LLF+ G RP K LV + +LN ++ V EL SL+RA E
Sbjct: 703 GHKTSDYVSLLFLSSKNAGPAPRPNKSLVSYVR-LLNIARGSDQVAELPLTLGSLARADE 761
Query: 751 DGLMVIEEGTHFLVVGDEE 769
+G +VI G + + + E
Sbjct: 762 NGSLVIFPGRYKIALDHSE 780
>gi|421077748|ref|ZP_15538711.1| glycoside hydrolase family 3 domain protein [Pelosinus fermentans
JBW45]
gi|392524151|gb|EIW47314.1| glycoside hydrolase family 3 domain protein [Pelosinus fermentans
JBW45]
Length = 750
Score = 462 bits (1190), Expect = e-127, Method: Compositional matrix adjust.
Identities = 281/752 (37%), Positives = 410/752 (54%), Gaps = 88/752 (11%)
Query: 41 ETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGV 100
E F + TL QRA+DLVSR+TL+EK++Q+V +PAIPRLG+PAY WWSEALHGVA
Sbjct: 28 EIFDYQDETLSFEQRAKDLVSRMTLEEKVTQMVYISPAIPRLGVPAYNWWSEALHGVARA 87
Query: 101 GKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA-------GQAIG 153
G AT FPQ I AA+FD L + + + I +E RA ++ G G
Sbjct: 88 GV----------ATVFPQAIGLAATFDEKLIHDVAEVISIEGRAKFHEFQRKGDHGIYKG 137
Query: 154 MTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASAC 213
+TFW+PN+NIFRDPRWGRGQET GEDP +TG+ VS+++G+QG K L+A+AC
Sbjct: 138 LTFWSPNVNIFRDPRWGRGQETFGEDPYLTGRLGVSFIKGLQGQD------KKYLRAAAC 191
Query: 214 CKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIP 273
KHF + + + R+ FDA V+ +DL +TY P F+ CVK+ +M AYNRVNG P
Sbjct: 192 AKHFAVH---SGPESERHSFDAVVSPKDLRETYLPAFKECVKEANVEAVMGAYNRVNGEP 248
Query: 274 SCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGS 333
C LL +T R++WGF G++ SDC A+ ++ S ++V L G D+NCG+
Sbjct: 249 CCGSNMLLKETLRQEWGFTGHVVSDCWAIKDFHENHRVTSSAPESVALALNNGCDLNCGN 308
Query: 334 FLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQ 393
+ + A ++ + E I+ A+ L RM+LGLF+ + P+ IG H+
Sbjct: 309 -MYLNLLIAYQEGLVTEEAINTAVTRLMLTRMKLGLFDTAENV-PYTNIGFHQNDCQEHR 366
Query: 394 VLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQ 453
AL+ ++ +VLLKN + LLPL ++ S+A+IGPNANS + L GNY G + IT L+
Sbjct: 367 EFALEVSKKTLVLLKNENNLLPLDRNTISSIAVIGPNANSREALTGNYCGTASNYITVLE 426
Query: 454 ALQNYV-ENTV--YYPGC-------DTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEK 503
++ V ++T+ Y GC + + + +AV A+ AD VV+ MGLD + E
Sbjct: 427 GIREAVGKDTIVSYAQGCHLYRDKAENLGEARDRFAEAVSTAERADIVVMCMGLDASIEG 486
Query: 504 EE---------LDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIG 554
EE D++ L LPG QQEL+ + + KP+ILVLL G + +T+A +
Sbjct: 487 EEGDVSNEYASGDKLGLNLPGLQQELLEVIYQTG-KPIILVLLAGSALAVTWAA--EKVP 543
Query: 555 SILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGR 614
+I+ A YPG G ALA IFG+++P G+LP+T+Y TD M+ R
Sbjct: 544 AIIQAWYPGAEGGKALASAIFGEYSPVGKLPITFYRTTEELPEFTDYSMK--------NR 595
Query: 615 TYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELG 674
TYR+ + ++PFG GL Y+ ++Y+ +L LN+
Sbjct: 596 TYRYMTKEALYPFGYGLGYTTFAYR-------QLQLNR---------------------- 626
Query: 675 TEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAE 734
T+ ++ VKN G A V L++K + PI +L G Q V L + E
Sbjct: 627 TQISAGENVQCSVLVKNTGNFASDETVQLYIKDVKASVEVPILELQGIQKVHLLPGTEQE 686
Query: 735 IVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
+ F L+P L+ E+G ++E G + VG
Sbjct: 687 VFFTLTP-RQLALINEEGNCILEPGAFEIYVG 717
>gi|403412992|emb|CCL99692.1| predicted protein [Fibroporia radiculosa]
Length = 760
Score = 462 bits (1189), Expect = e-127, Method: Compositional matrix adjust.
Identities = 286/733 (39%), Positives = 408/733 (55%), Gaps = 46/733 (6%)
Query: 46 CKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIF 105
C T+ RA L+ TL+EKI+ N++P +PRLG+PAY+WW EALHGVA G+
Sbjct: 34 CDTSASPVARATALIGLFTLEEKINNTGNTSPGVPRLGLPAYQWWQEALHGVA-ESPGVI 92
Query: 106 F--NGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINI 163
F G ATSFPQ IL A+FD L ++ + EARA NA ++ G+ FW PNIN
Sbjct: 93 FAETGEYSYATSFPQPILMGAAFDDELINQVATIVSTEARAFNNANRS-GLDFWTPNINP 151
Query: 164 FRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLD 223
F+DPRWGRGQETPGEDP Y + + G+QG G + A CKH+ YDL+
Sbjct: 152 FKDPRWGRGQETPGEDPFHLQSYVYNLITGLQG-----GLDPEYKRIVATCKHYAGYDLE 206
Query: 224 NWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSK 283
NW+G RY FDA +++QDL++ Y FE+C + MC+YN VNG+PSCA+ LL
Sbjct: 207 NWEGNVRYGFDALISIQDLSEFYTRSFETCARDANVGAFMCSYNAVNGVPSCANSYLLQD 266
Query: 284 TARRQWGFHG---YITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTK 340
R W + +ITSDCDA+ IY+ YA + E V D L AG D++CG++ ++
Sbjct: 267 ILRGHWNWTSDDQWITSDCDAIQNIYEPHYYAPTRELTVADALNAGADLDCGTYYPENLG 326
Query: 341 AAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAA 400
AA + ES +DRAL ++ ++LG F+ QP+ +IG V +P + LA +AA
Sbjct: 327 AAYDEGLFAESTLDRALIRQYASLVKLGYFDPAEN-QPYRQIGWANVSTPEAEELAYRAA 385
Query: 401 QDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQ--NY 458
+GI L+KN G LPL S SLALIGP AN+ + GNY G I+PL A + NY
Sbjct: 386 VEGITLIKND-GTLPLSPSIK-SLALIGPWANATTQMQGNYYGQPPYLISPLMAAEALNY 443
Query: 459 VENTVYY-PGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQ 517
TVYY PG +++S A A+ AD ++ + G+D T E E +DR L PG Q
Sbjct: 444 ---TVYYSPGPGVDDPTTSSFPAAFAAAQAADAIIYIGGIDTTVEAEAMDRYTLDWPGVQ 500
Query: 518 QELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGD 577
+ I ++++ KP++++ + GG VD + + N+ +++W GYPG++G AL ++I G+
Sbjct: 501 PDFIDQLSQFG-KPLVVLQMGGGQVDDSCLLPNTNVNALIWGGYPGQSGGTALMDIIVGN 559
Query: 578 HNPGGRLPMTWYPQDYI-KVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKY 636
P GRLP T YP DY+ +V MTDM +RP AT NPGRTY +Y G + FG GL Y+ +
Sbjct: 560 AAPAGRLPTTQYPLDYVYQVAMTDMSLRPSAT--NPGRTYMWYTGTPIVEFGFGLHYTNF 617
Query: 637 SYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHG-EM 695
S + L+Q S+ V + V L + C + T+ V N G ++
Sbjct: 618 SAE----------LSQPSAPSYDIASLVGACEGVAHL--DLCAFESY--TVNVTNIGSKV 663
Query: 696 AGKHPVLLFVKPARRGNGRPIKQLVGF---QSVILNAKEKAEIVFELSPCESLSRAREDG 752
+ LLFV P K L + ++ + ++A + L SLSR E G
Sbjct: 664 TSDYVALLFVAGEHGPAPIPNKVLAAYDRLHTIAPLSSQQATLNLTLG---SLSRVDEYG 720
Query: 753 LMVIEEGTHFLVV 765
V+ G + L++
Sbjct: 721 NRVLYPGEYTLIL 733
>gi|391865040|gb|EIT74331.1| beta-glucosidase-related glycosidase [Aspergillus oryzae 3.042]
Length = 822
Score = 462 bits (1188), Expect = e-127, Method: Compositional matrix adjust.
Identities = 288/739 (38%), Positives = 412/739 (55%), Gaps = 45/739 (6%)
Query: 42 TFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVG 101
+ P C T+L I++R LV LTL+EKI LV+++ RLG+P+YEWWSEA HGV G
Sbjct: 76 SHPVCDTSLSIAERVDSLVKSLTLEEKILNLVDASAGSTRLGLPSYEWWSEATHGV-GSA 134
Query: 102 KGIFFN---GTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWA 158
G+ F ATSFP ILTAASFD L +I + IG E RA N G + G FWA
Sbjct: 135 PGVQFTSKPANFSYATSFPAPILTAASFDDTLIRKIAEVIGREGRAFGNNGFS-GFDFWA 193
Query: 159 PNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFT 218
PNIN FRDPRWGRGQETPGEDPLV Y ++V G+QGD K K Q A CKH+
Sbjct: 194 PNINGFRDPRWGRGQETPGEDPLVAQNYIRNFVPGLQGD-----DPKNK-QVIATCKHYA 247
Query: 219 AYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADR 278
YDL+ T RY + T QDL+D + PF++CV+ IMC+YN V+GIP+CA+
Sbjct: 248 VYDLE----TGRYGNNYNPTQQDLSDYFLAPFKTCVRDTDVGSIMCSYNSVSGIPACANE 303
Query: 279 NLLSKTARRQWGFHG---YITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG-SF 334
LL + R+ W F+ Y+ SDC AV+ I+ + + E A L AG+D+ CG S+
Sbjct: 304 YLLDEVLRKHWNFNSDYYYVVSDCGAVTDIWQYHNFTDTEEAAASVALNAGVDLECGSSY 363
Query: 335 LQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQV 394
L+ + A Q + +DR+L L+S +G F+G + K+ V +P Q
Sbjct: 364 LKLNESLAANQTSV--KVMDRSLARLYSALFTVGFFDGG----KYDKLDFSDVSTPDAQA 417
Query: 395 LALQAAQDGIVLLKNSHGLLPLP-KSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQ 453
LA +AA +G+ LLKN LLPL K S+A+IGP AN+ + G+Y+G + I+PL+
Sbjct: 418 LAYEAAVEGMTLLKNDD-LLPLDFPHKYKSVAVIGPFANATTQMQGDYSGDAPYLISPLE 476
Query: 454 ALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVL 513
A + Y G +++ ++A+ A +D ++ + G+D + E E LDR L
Sbjct: 477 AFGDSRWKVNYALGTAINNQNTSGFEEALAAANKSDLIIYLGGIDNSLESETLDRTSLAW 536
Query: 514 PGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEV 573
PG Q +LIT +++ + KP+++V GG VD + +++I +++WAGYP ++G AL +V
Sbjct: 537 PGNQLDLITSLSKLS-KPLVVVQFGGGQVDDSAILKNKDIQALVWAGYPSQSGGTALLDV 595
Query: 574 IFGDHNPGGRLPMTWYPQDYI-KVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLS 632
+ G +P GRLP+T YP Y +V + D+ +RP T PGRTY++Y GK V PFG GL
Sbjct: 596 LVGKRSPAGRLPVTQYPASYADQVNIFDINLRP--TDSYPGRTYKWYTGKPVLPFGYGLH 653
Query: 633 YSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNH 692
Y+K+ + ++ LN+ + QD+V G T V VKN
Sbjct: 654 YTKFMFDWEKT------LNREYNI-----QDLVASCRNSSGGPINDNTPLTTVKARVKNV 702
Query: 693 GEMAGKHPVLLFVKPARRGNG-RPIKQLVGFQSVILNAKEKAEIVFELS-PCESLSRARE 750
G + LLF+ G RP K LV + +LN ++ V EL SL+RA E
Sbjct: 703 GHKTSDYVSLLFLSSKNAGPAPRPNKSLVSYVR-LLNIARGSDQVAELPLTLGSLARADE 761
Query: 751 DGLMVIEEGTHFLVVGDEE 769
+G +VI G + + + + E
Sbjct: 762 NGSLVIFPGRYKIALDNSE 780
>gi|392962219|ref|ZP_10327666.1| glycoside hydrolase family 3 domain protein [Pelosinus fermentans
DSM 17108]
gi|392452977|gb|EIW29882.1| glycoside hydrolase family 3 domain protein [Pelosinus fermentans
DSM 17108]
Length = 724
Score = 462 bits (1188), Expect = e-127, Method: Compositional matrix adjust.
Identities = 284/752 (37%), Positives = 409/752 (54%), Gaps = 88/752 (11%)
Query: 41 ETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGV 100
E F + TL QRA+DLVSR+T++EK++Q+V S+PAI RLGIPAY WWSEALHGVA
Sbjct: 2 EIFDYQDETLSFEQRAKDLVSRMTIEEKVTQMVYSSPAISRLGIPAYNWWSEALHGVARA 61
Query: 101 GKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA-------GQAIG 153
G AT FPQ I AA+FD L Y + + I +EARA ++ G G
Sbjct: 62 GV----------ATVFPQAIGLAATFDEKLIYDVAEIISIEARAKFHEFQRKGDHGIYKG 111
Query: 154 MTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASAC 213
+TFW+PN+NIFRDPRWGRGQET GEDP +TG+ VS+++G+QG K L+A+AC
Sbjct: 112 LTFWSPNVNIFRDPRWGRGQETFGEDPYLTGRLGVSFIKGLQGQD------KKYLRAAAC 165
Query: 214 CKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIP 273
KHF + + + R++FDA V+ +DL +TY P F+ CVK+ +M AYNRVNG P
Sbjct: 166 AKHFAVH---SGPESERHRFDAVVSPKDLRETYLPAFKECVKEANVEAVMGAYNRVNGEP 222
Query: 274 SCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGS 333
C LL +T R++WGF G++ SDC A+ ++ S ++V L G D+NCG+
Sbjct: 223 CCGSNILLKETLRQEWGFTGHVVSDCWAIKDFHENHRVTSSAPESVALALNNGCDLNCGN 282
Query: 334 FLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQ 393
+ + A ++ + E I+ A+ L RM+LGLF+ + P+ IG H+
Sbjct: 283 -MYLNLLIAYQEGLVTEEAINTAVTRLMLTRMKLGLFDAAENV-PYTNIGFHQNDCQEHR 340
Query: 394 VLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQ 453
AL+ ++ +VLLKN + LLPL ++ S+A+IGPNANS + L GNY G + IT L+
Sbjct: 341 EFALEVSKKTLVLLKNENHLLPLDRNTISSIAVIGPNANSREALTGNYFGTASNYITVLE 400
Query: 454 ALQNYV-ENTV--YYPGC-------DTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEK 503
++ V ++T+ Y GC + + +AV A+ AD VV+ MGLD + E
Sbjct: 401 GIREAVGKDTMVSYAQGCHLYRDKAENLGEERDRFAEAVSTAERADLVVMCMGLDASIEG 460
Query: 504 EE---------LDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIG 554
EE D++ L LPG QQEL+ + + KP+ILVLL G + +T+A +
Sbjct: 461 EEGDVSNEYASGDKLGLNLPGLQQELLEVIYKTG-KPIILVLLAGSALAVTWAA--EKVP 517
Query: 555 SILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGR 614
+I+ A YPG G ALA IFG+++P G+LP+T+Y TD M+ R
Sbjct: 518 AIIQAWYPGAEGGKALASAIFGEYSPVGKLPITFYRTTEELPEFTDYSMK--------NR 569
Query: 615 TYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELG 674
TYR+ + ++PFG GL Y+ ++Y+ +L LN+
Sbjct: 570 TYRYMTKEALYPFGYGLGYTTFAYR-------QLQLNR---------------------- 600
Query: 675 TEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAE 734
T+ C +I VKN G A V L++K + PI L G Q + L + E
Sbjct: 601 TKICAGENVQCSILVKNTGNFASDETVQLYIKDVKASVEVPIWALQGIQKIHLLPGAEQE 660
Query: 735 IVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
I F L+ L+ E G ++E G + VG
Sbjct: 661 ISFTLTS-RQLALINEKGNCILEPGIFEIYVG 691
>gi|317156541|ref|XP_001825822.2| exo-1,4-beta-xylosidase xlnD [Aspergillus oryzae RIB40]
Length = 882
Score = 461 bits (1186), Expect = e-127, Method: Compositional matrix adjust.
Identities = 288/739 (38%), Positives = 413/739 (55%), Gaps = 45/739 (6%)
Query: 42 TFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVG 101
+ P C T+L I++R LV LTL+EKI LV+++ RLG+P+YEWWSEA HGV G
Sbjct: 136 SHPVCDTSLSIAERVDSLVKSLTLEEKILNLVDASAGSTRLGLPSYEWWSEATHGV-GSA 194
Query: 102 KGIFFN---GTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWA 158
G+ F ATSFP ILTAASFD L +I + IG E RA N G + G FWA
Sbjct: 195 PGVQFTSKPANFSYATSFPAPILTAASFDDTLIRKIAEVIGREGRAFGNNGFS-GFDFWA 253
Query: 159 PNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFT 218
PNIN FRDPRWGRGQETPGEDPLV Y ++V G+QGD K K Q A CKH+
Sbjct: 254 PNINGFRDPRWGRGQETPGEDPLVAQNYIRNFVPGLQGD-----DPKNK-QVIATCKHYA 307
Query: 219 AYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADR 278
YDL+ T RY + T QDL+D + PF++CV+ IMC+YN V+GIP+CA+
Sbjct: 308 VYDLE----TGRYGNNYNPTQQDLSDYFLAPFKTCVRDTDVGSIMCSYNSVSGIPACANE 363
Query: 279 NLLSKTARRQWGFHG---YITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG-SF 334
LLS+ R+ W F+ Y+ SDC AV+ I+ + + E A L AG+D+ CG S+
Sbjct: 364 YLLSEVLRKHWNFNSDYHYVVSDCGAVTDIWQYHNFTDTEEAAASVALNAGVDLECGSSY 423
Query: 335 LQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQV 394
L+ + A Q + +D++L L+S +G F+G + K+ V +P Q
Sbjct: 424 LKLNESLAANQTSV--KVMDQSLARLYSALFTVGFFDGG----KYDKLDFSDVSTPDAQA 477
Query: 395 LALQAAQDGIVLLKNSHGLLPLPK-SKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQ 453
LA +AA +G+ LLKN LLPL K S+A+IGP AN+ + G+Y+G + I+PL+
Sbjct: 478 LAYEAAVEGMTLLKNDD-LLPLDSPHKYKSVAVIGPFANATTQMQGDYSGDAPYLISPLE 536
Query: 454 ALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVL 513
A + Y G +++ ++A+ A +D ++ + G+D + E E LDR L
Sbjct: 537 AFGDSRWKVNYALGTAMNNQNTSGFEEALAAANKSDLIIYLGGIDNSLESETLDRTSLTW 596
Query: 514 PGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEV 573
PG Q +LIT +++ + KP+++V GG VD + +++I +++WAGYP ++G AL +V
Sbjct: 597 PGNQLDLITSLSKLS-KPLVVVQFGGGQVDDSDILKNKDIQALVWAGYPSQSGGTALLDV 655
Query: 574 IFGDHNPGGRLPMTWYPQDYI-KVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLS 632
+ G +P GRLP+T YP Y +V + D+ +RP T PGRTY++Y GK V PFG GL
Sbjct: 656 LVGKRSPAGRLPVTQYPASYADQVNIFDINLRP--TDSYPGRTYKWYTGKPVLPFGYGLH 713
Query: 633 YSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNH 692
Y+K+ + ++ LN+ + QD+V G T V + VKN
Sbjct: 714 YTKFMFDWEKT------LNREYNI-----QDLVASCRNSSGGPINDNTPLTTVKVRVKNV 762
Query: 693 GEMAGKHPVLLFVKPARRGNG-RPIKQLVGFQSVILNAKEKAEIVFELS-PCESLSRARE 750
G + LLF+ G RP K LV + +LN ++ V EL SL+RA E
Sbjct: 763 GHKTSDYVSLLFLSSKNAGPAPRPNKSLVSYVR-LLNIARGSDQVAELPLTLGSLARADE 821
Query: 751 DGLMVIEEGTHFLVVGDEE 769
+G +VI G + + + E
Sbjct: 822 NGSLVIFPGRYKIALDHSE 840
>gi|378730020|gb|EHY56479.1| beta-glucosidase, variant [Exophiala dermatitidis NIH/UT8656]
gi|378730021|gb|EHY56480.1| beta-glucosidase [Exophiala dermatitidis NIH/UT8656]
Length = 783
Score = 460 bits (1184), Expect = e-126, Method: Compositional matrix adjust.
Identities = 284/741 (38%), Positives = 408/741 (55%), Gaps = 42/741 (5%)
Query: 46 CKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIF 105
C T ++ RA+ LV+ LT +EK + N++P +PRLG+ +Y+WW EALHGVA G+
Sbjct: 35 CNTNASVADRAKALVAALTNEEKFNLTGNTSPGVPRLGLYSYQWWQEALHGVAS-SPGVN 93
Query: 106 FN--GTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINI 163
F+ G ATSFPQ IL +A+FD L + + EARA N ++ G+ FW PNIN
Sbjct: 94 FSTSGDFSHATSFPQPILMSAAFDDALINAVATVVSTEARAFNNVNRS-GLDFWTPNINP 152
Query: 164 FRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLD 223
++DPRWGRGQETPGED Y + + G+QG G + A CKHF AYDL+
Sbjct: 153 YKDPRWGRGQETPGEDTFHLKSYVAALIDGLQG-----GLNPPIKKVIATCKHFVAYDLE 207
Query: 224 NWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSK 283
+W T RY FDA V+ QDLA+ Y PF++C + R IMC+YN +NG+P+CAD +L
Sbjct: 208 DWITTDRYNFDAIVSTQDLAEYYMQPFQTCARDARVGSIMCSYNAMNGVPTCADPYILQT 267
Query: 284 TARRQWGFHG---YITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTK 340
R W + Y+TSDCDA+ IY Y + E AV D L AG D+NCG++ Q H
Sbjct: 268 VLREHWNWTDDGQYVTSDCDAIQNIYAPHYYEPTREQAVADALTAGTDLNCGTYYQTHLP 327
Query: 341 AAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAA 400
AA + ++ ID+ + L+S ++LG F+ P+ P+ + V +PA + LAL+AA
Sbjct: 328 AAFSEGLFNQTVIDQTITRLYSALIKLGYFDP-PSATPYRSLNWSDVSTPAAEALALKAA 386
Query: 401 QDGIVLLKNSHGLLPL--PKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNY 458
++GIVLLKN GLLPL P K+ ++A+IG AN+ T+ GNY G + +PL ALQ
Sbjct: 387 EEGIVLLKND-GLLPLSFPTDKNTTVAIIGGWANATTTMQGNYFGIAPYLHSPLYALQQL 445
Query: 459 VENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQ 518
Y G V + D+ + A AD +++ GL + E E DR +
Sbjct: 446 PNINAVYGGGFGVPTTDG-WDELLGAAGEADLIIIADGLTTSDESESNDRYTIGWQPAAI 504
Query: 519 ELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDH 578
++I +++ K V L + G +D T + NI +++W GYPG AG AL ++ G
Sbjct: 505 DIINQLSGMGKPTVFLQM--GDQLDNTPLLNNPNISALIWGGYPGMAGGDALINILTGKA 562
Query: 579 NPGGRLPMTWYPQDYI-KVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYS 637
P GRLP+T YP DY+ +V MTDM++RP ATSGNPGRTY++Y V PFG GL Y+ +S
Sbjct: 563 APAGRLPVTQYPADYVNQVNMTDMELRPNATSGNPGRTYKWYN-NAVLPFGYGLHYTNFS 621
Query: 638 YKFKAVSQNKLYLNQSSS-----------TKMVENQDVVHYKSVPELGTEFCETRKFLVT 686
A Q + SS+ + +V + D Y + + C F
Sbjct: 622 VAASAQGQAQTQSGPSSNSSQGQGTSYNISSLVSSCDRSQYAYL-----DLCPFESF--N 674
Query: 687 IGVKNHG-EMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVI-LNAKEKAEIVFELSPCES 744
+ V N G ++A L F+ + PIKQLV +Q + ++A A L+ S
Sbjct: 675 VNVTNTGSKLASDFVALGFISGSYGPQPYPIKQLVAYQRLFNISAGASATATLNLT-LGS 733
Query: 745 LSRAREDGLMVIEEGTHFLVV 765
L+R E+G V+ G + L++
Sbjct: 734 LARHDENGNAVLYPGDYGLLI 754
>gi|222618262|gb|EEE54394.1| hypothetical protein OsJ_01415 [Oryza sativa Japonica Group]
Length = 776
Score = 459 bits (1180), Expect = e-126, Method: Compositional matrix adjust.
Identities = 289/800 (36%), Positives = 414/800 (51%), Gaps = 134/800 (16%)
Query: 33 CDPSNPSTET-----FPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAY 87
CDP+ + FP+C +LP + R RDLV R+TL+EK++ L + A PR+G+P Y
Sbjct: 47 CDPARFAAAGLDMAGFPYCDASLPYADRVRDLVGRMTLEEKVANLGDRAGGAPRVGLPRY 106
Query: 88 EWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEAR---- 143
G G R T+ P T+A D +W R + L AR
Sbjct: 107 ------------CGGG-------RRCTACP----TSARRD-VVWRRRARRHQLPARHQQR 142
Query: 144 --------------------ALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVT 183
+YN G A +T+W+PNIN+ RDPRWGR ETPGEDP V
Sbjct: 143 RVVQRDAVARHRRRGVDGDQGMYNLGHA-ELTYWSPNINVVRDPRWGRASETPGEDPFVV 201
Query: 184 GKYAVSYVRGVQG---------DTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFD 234
G+YAV++VRG+Q ++ S+CCKH+ A
Sbjct: 202 GRYAVNFVRGMQDIDGATTAASAAAATDAFSRPIKVSSCCKHYAA--------------- 246
Query: 235 ARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGY 294
CV MC+YNR+NG+P+CAD LL++T RR W HGY
Sbjct: 247 ------------------CV--------MCSYNRINGVPACADARLLTETVRRDWQLHGY 280
Query: 295 ITSDCDAVSI-IYDAEGYAKSPEDAVVDVLKAGMDVNCG-------SFLQKHTKAAVKQK 346
I SDCD+V + + DA+ + +A +KAG+D++CG F + AV+Q
Sbjct: 281 IVSDCDSVRVMVRDAKWLGYTGVEATAAAMKAGLDLDCGMFWEGVHDFFTTYGVDAVRQG 340
Query: 347 KLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVL 406
KL ES +D AL NL+ MRLG F+G P ++ +GA VC+ H+ LA AA+ G+VL
Sbjct: 341 KLKESAVDNALTNLYLTLMRLGFFDGIPELE---SLGAADVCTEEHKELAADAARQGMVL 397
Query: 407 LKNSHGLLPLPKSKSVSLALIGP--NANSAKTLLGNYAGPSCRSITPLQALQNYVENTVY 464
LKN LLPL K S+AL G + N+ +LG+Y G CR +TP ++ V +T
Sbjct: 398 LKNDAALLPLSPEKVNSVALFGQLQHINATDVMLGDYRGKPCRVVTPYDGVRKVVSSTSV 457
Query: 465 YPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRV 524
+ AC S D A AK D +++ GL+ + E+E DR DL+LP Q I V
Sbjct: 458 H------ACDKGSCDTAAAAAKTVDATIVVAGLNMSVERESNDREDLLLPWSQASWINAV 511
Query: 525 AEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRL 584
AEA+ P++LV++ G VD++FA+ + IG+++WAGYPGE G A+A+V+FG +NPGGRL
Sbjct: 512 AEASPSPIVLVIMSAGGVDVSFAQDNPKIGAVVWAGYPGEEGGTAIADVLFGKYNPGGRL 571
Query: 585 PMTWYPQDYI-KVPMTDMKMRPQATSGNPGRTYRFYEGKEV-FPFGCGLSYSKYSYKFKA 642
P+TWY +Y+ K+PMT M +RP A G PGRTY+FY G +V +PFG GLSY+ ++Y A
Sbjct: 572 PLTWYKNEYVSKIPMTSMALRPDAEHGYPGRTYKFYGGADVLYPFGHGLSYTNFTYA-SA 630
Query: 643 VSQNKLYLNQSS---STKMVENQDVVHYKSVP--ELGTEFC-ETRKFLVTIGVKNHGEMA 696
+ + + + ++ V + P + + C E F VT V N G
Sbjct: 631 TAAAPVTVKVGAWEYCKQLTYKAGVSSPPACPAVNVASHACQEEVSFAVT--VANTGGRD 688
Query: 697 GKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVI 756
G H V ++ P +G P KQLV F+ V + A E+ F L+ C++ + E V+
Sbjct: 689 GTHVVPMYTAPPAEVDGAPRKQLVAFRRVRVAAGAAVEVAFALNVCKAFAIVEETAYTVV 748
Query: 757 EEGTHFLVVGDEEYPISIFV 776
G ++VGD+ +S V
Sbjct: 749 PSGVSRVLVGDDALSLSFPV 768
>gi|333381510|ref|ZP_08473192.1| hypothetical protein HMPREF9455_01358 [Dysgonomonas gadei ATCC
BAA-286]
gi|332830480|gb|EGK03108.1| hypothetical protein HMPREF9455_01358 [Dysgonomonas gadei ATCC
BAA-286]
Length = 738
Score = 456 bits (1173), Expect = e-125, Method: Compositional matrix adjust.
Identities = 275/748 (36%), Positives = 403/748 (53%), Gaps = 86/748 (11%)
Query: 43 FPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGK 102
+PF T L +R DLVSRLTL+EK+ Q++N+ PAI RL IPAY WW+E LHG+
Sbjct: 24 YPFRDTKLSTDKRVSDLVSRLTLEEKVLQMLNNTPAIERLNIPAYNWWNECLHGIGR--- 80
Query: 103 GIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIG-------MT 155
T T FPQ I AA++D+ L + AI E RA+YN A G +T
Sbjct: 81 ------TEYKVTVFPQAIGMAAAWDARLLKDVANAISDEGRAIYNDASAKGNYSIYHGLT 134
Query: 156 FWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCK 215
+W PN+NIFRDPRWGRGQET GEDP +TG S+V G+QGD L+A+AC K
Sbjct: 135 YWTPNVNIFRDPRWGRGQETYGEDPYLTGALGKSFVAGLQGDD------SQYLKAAACAK 188
Query: 216 HFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSC 275
H+ + + TR+ F+ VT DL DTY P F V + +G+MCAYN +G P C
Sbjct: 189 HYAVH---SGPENTRHTFNTFVTTFDLWDTYLPAFRDLVVDAKVAGVMCAYNAFSGEPCC 245
Query: 276 ADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFL 335
+ L+ + R +WGF GY+TSDC A+ Y + A D + +G D++CG+
Sbjct: 246 GNNLLMQEILRDKWGFTGYVTSDCGAIDDFYRHHKTHPDAKYAAADAVYSGTDIDCGNEA 305
Query: 336 QKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVL 395
K AVK + E +ID +L LF +R RLG+F+ ++ F KI V+ S H+ L
Sbjct: 306 YKALVDAVKTGLITEEQIDISLKRLFEIRFRLGMFDPAEDVK-FSKIPLSVLESQPHKDL 364
Query: 396 ALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQAL 455
AL+ ++ IVLLKN + LPL K K +A+IGPNA++ ++LGNY G + ITP +A+
Sbjct: 365 ALKITRESIVLLKNENNFLPLSK-KLKKVAVIGPNADNEVSVLGNYNGFPTQIITPYKAI 423
Query: 456 QNYVENT--VYYPGCDTVACSSASIDKAVDIA---KGADHVVLMMGLDQTQEKEEL---- 506
+N ++NT +Y G D V S S ++ +A KG D V+ G+ E EE+
Sbjct: 424 KNKLKNTEVIYEKGIDFVKPSENSKEEIAALAKRLKGMDVVIFAGGISPELEGEEMPVKI 483
Query: 507 ------DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAG 560
DR + LP Q EL+ + +A + P + V++ G + + +N+ +IL A
Sbjct: 484 EGFTGGDRTSIKLPKIQTELM-QALKAERIPTVFVMMTGSAIAAEWES--QNVPAILNAW 540
Query: 561 YPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYE 620
Y G+ A+A+V+FGD+NP G+LP+T+Y +D +M+ RTYR+++
Sbjct: 541 YGGQDAGTAIADVLFGDYNPSGKLPVTFYTKDSDLPAFNSYEMK--------NRTYRYFD 592
Query: 621 GKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCET 680
G+ ++PFG GLSY+K+ Y + +S K EN +
Sbjct: 593 GQVLYPFGYGLSYTKFEYS---------PIQMPASIKAGENME----------------- 626
Query: 681 RKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGR--PIKQLVGFQSVILNAKEKAEIVFE 738
V+I VKN G+ G+ V L++ G R P+ L F+ + L A E + F+
Sbjct: 627 ----VSITVKNTGKTDGEEVVQLYISHDNNGTNRQLPLYALKSFERISLKAGESKSVTFK 682
Query: 739 LSPCESLSRAREDGLMVIEEGTHFLVVG 766
LSP E ++ A EDG++ + +G L +G
Sbjct: 683 LSPRE-MALADEDGVLKMTKGKSKLYIG 709
>gi|343428088|emb|CBQ71612.1| related to Beta-xylosidase [Sporisorium reilianum SRZ2]
Length = 698
Score = 456 bits (1172), Expect = e-125, Method: Compositional matrix adjust.
Identities = 267/614 (43%), Positives = 351/614 (57%), Gaps = 31/614 (5%)
Query: 41 ETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGV 100
T P C T+L RA LV++ T E I+ VN AP +PRLGIP Y+WW+EALHGVA
Sbjct: 30 STLPVCDTSLDFYTRATSLVAQFTTAELINNTVNHAPGVPRLGIPQYQWWTEALHGVA-R 88
Query: 101 GKGIFFNGTIRG----ATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTF 156
G+ FN G ATSFPQVI A+FD L+ + I E RA NAG+A G+
Sbjct: 89 SPGVNFNPDAAGEFGCATSFPQVINLGATFDDALYEAVAAHIANETRAFSNAGRA-GLNM 147
Query: 157 WAP-NINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCK 215
++P NIN FRDPRWGRGQET GEDPL +YAV VRG+QG + +L +A CK
Sbjct: 148 YSPLNINAFRDPRWGRGQETVGEDPLHLSRYAVRVVRGLQGPAAQ-DEANPRLTLAATCK 206
Query: 216 HFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSC 275
H+ AYDL+ G RY+FDA V+ QDLAD + P F +CV+ G A+ +M +YN VNG+P
Sbjct: 207 HYLAYDLEASAGVERYQFDALVSNQDLADLHLPQFRACVRDGGATTLMTSYNAVNGVPPS 266
Query: 276 ADRNLLSKTARRQWGF---HGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG 332
A + L AR WG H Y+TSDCDAV+ +YDA YA A L AG D++CG
Sbjct: 267 ASKYYLETLARDTWGLDKHHNYVTSDCDAVANVYDAHHYAADYVHAAAASLNAGTDLDCG 326
Query: 333 SFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAH 392
+ + AA+ Q + I RA+ ++ +RLG F+ QP ++G V +PA
Sbjct: 327 ATYRDSLAAALAQNLTDVATIRRAVTRMYGSLVRLGYFDAA-EAQPLRQLGWKDVNAPAA 385
Query: 393 QVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPL 452
Q LA +AA I LLKN LPL ++ ++ALIGP N+ L GNYAGPS ITP
Sbjct: 386 QKLAYEAAAASITLLKNRQSTLPLRETAGKTIALIGPYTNATFALRGNYAGPSPLVITPF 445
Query: 453 QALQNYVENTVYYPGCDTVACSSASIDKAVD---------IAKGADHVVLMMGLDQTQEK 503
A + + V+ + SI D AK AD +V G+D T E
Sbjct: 446 DAARRTFSD------AHIVSANGTSIAGPYDTATASAALATAKSADIIVYAGGIDPTVEG 499
Query: 504 EELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPG 563
E LDR D+ P Q LI +A A K +++V GG VD K D +G+++WAGYPG
Sbjct: 500 ESLDRRDIAWPANQLRLIQELA-ALGKVLVVVQFGGGQVDGALLKGDDGVGALVWAGYPG 558
Query: 564 EAGAVALAEVIFGDHNPGGRLPMTWYPQDYIK-VPMTDMKMRPQATSGNPGRTYRFYEGK 622
++GA+AL +++ G P GRLP+T YP +Y + T M +RP AT PGRTY++Y G
Sbjct: 559 QSGALALMDILAGKRAPAGRLPITQYPANYTHALRETTMALRPTATY--PGRTYKWYTGT 616
Query: 623 EVFPFGCGLSYSKY 636
FPFG GL Y+ +
Sbjct: 617 PTFPFGFGLHYTTF 630
>gi|238492365|ref|XP_002377419.1| conserved hypothetical protein [Aspergillus flavus NRRL3357]
gi|220695913|gb|EED52255.1| conserved hypothetical protein [Aspergillus flavus NRRL3357]
Length = 775
Score = 455 bits (1170), Expect = e-125, Method: Compositional matrix adjust.
Identities = 285/739 (38%), Positives = 411/739 (55%), Gaps = 45/739 (6%)
Query: 42 TFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVG 101
+ P C T+L I++R LV LTL+EKI LV+++ RLG+P+YEWWSEA HGV G
Sbjct: 29 SHPVCDTSLSIAERVDSLVKSLTLEEKILNLVDASAGSTRLGLPSYEWWSEATHGV-GSA 87
Query: 102 KGIFFN---GTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWA 158
G+ F ATSFP ILTAASFD L +I + IG E R N G + G FWA
Sbjct: 88 PGVQFTSKPANFSYATSFPAPILTAASFDDTLIRKIAEVIGREGRVFGNNGFS-GFDFWA 146
Query: 159 PNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFT 218
PNIN FRDPRWGRGQETPGEDPLV Y ++V G+QGD K K Q A CKH+
Sbjct: 147 PNINGFRDPRWGRGQETPGEDPLVAQNYIRNFVPGLQGD-----DPKNK-QVIATCKHYA 200
Query: 219 AYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADR 278
YDL+ T RY + T QDL++ + PF++CV+ IMC+YN V+GIP+CA+
Sbjct: 201 VYDLE----TGRYGNNYNPTQQDLSEYFLAPFKTCVRDTDVGSIMCSYNSVSGIPACANE 256
Query: 279 NLLSKTARRQWGFHG---YITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG-SF 334
LL + R+ W F+ Y+ SDC AV+ I+ + + E A L AG+D+ CG S+
Sbjct: 257 YLLDEVLRKHWNFNSDYHYVVSDCGAVTDIWQYHNFTDTEEAAASVALNAGVDLECGSSY 316
Query: 335 LQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQV 394
L+ + A Q + +D++L L+S +G F+G + K+ V +P Q
Sbjct: 317 LKLNESLAANQTSV--KVMDQSLARLYSALFTVGFFDGG----KYDKLDFSDVSTPDAQA 370
Query: 395 LALQAAQDGIVLLKNSHGLLPLPK-SKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQ 453
LA +AA +G+ LLKN LLPL K S+A+IGP AN+ + G+Y+G + I+PL+
Sbjct: 371 LAYEAAVEGMTLLKNDD-LLPLDSPHKYKSVAVIGPFANATTQMQGDYSGDAPYLISPLE 429
Query: 454 ALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVL 513
A + Y G +++ ++A+ A +D ++ + G+D + E E LDR L
Sbjct: 430 AFGDSRWKVNYALGTAINNQNTSGFEEALAAANKSDLIIYLGGIDNSLESETLDRTSLAW 489
Query: 514 PGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEV 573
PG Q +LIT +++ + KP+++V GG VD + +++I +++WAGYP ++G AL +V
Sbjct: 490 PGNQLDLITSLSKLS-KPLVVVQFGGGQVDDSAILKNKDIQALVWAGYPSQSGGTALLDV 548
Query: 574 IFGDHNPGGRLPMTWYPQDYI-KVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLS 632
+ G +P GRLP+T YP Y +V + D+ +RP T PGRTY++Y GK V PFG GL
Sbjct: 549 LVGKRSPAGRLPVTQYPASYADQVNIFDINLRP--TDLYPGRTYKWYTGKPVLPFGYGLH 606
Query: 633 YSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNH 692
Y+K+ + ++ LN+ + QD+V G T V VKN
Sbjct: 607 YTKFMFDWEKT------LNREYNI-----QDLVASCRNSSGGPINDNTPLTTVKARVKNV 655
Query: 693 GEMAGKHPVLLFVKPARRGNG-RPIKQLVGFQSVILNAKEKAEIVFELS-PCESLSRARE 750
G + LLF+ G RP K LV + +LN ++ V EL SL+RA E
Sbjct: 656 GHKTSDYVSLLFLSSKNAGPAPRPNKSLVSYVR-LLNIARGSDQVAELPLTLGSLARADE 714
Query: 751 DGLMVIEEGTHFLVVGDEE 769
+G +VI G + + + + E
Sbjct: 715 NGSLVIFPGRYKIALDNSE 733
>gi|189203341|ref|XP_001938006.1| beta-xylosidase [Pyrenophora tritici-repentis Pt-1C-BFP]
gi|187985105|gb|EDU50593.1| beta-xylosidase [Pyrenophora tritici-repentis Pt-1C-BFP]
Length = 761
Score = 454 bits (1169), Expect = e-125, Method: Compositional matrix adjust.
Identities = 274/729 (37%), Positives = 397/729 (54%), Gaps = 41/729 (5%)
Query: 48 TTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFN 107
T+ P RA+ LV+ TL+EKI+ + AP +PRLG+P Y+WWSE LHG+AG +
Sbjct: 3 TSRPPLARAQSLVALYTLEEKINATSSGAPGVPRLGVPPYQWWSEGLHGIAGPYTNFSDS 62
Query: 108 GTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDP 167
G +TSFPQ IL A+FD L + + I EARA NA + G+ FW PNIN FRDP
Sbjct: 63 GEWSYSTSFPQPILMGAAFDDDLITDVAKVISTEARAFNNANR-TGLDFWTPNINPFRDP 121
Query: 168 RWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKG 227
RWGRGQETPGED Y + + G+QG++ + K + A CKHF YD+++W G
Sbjct: 122 RWGRGQETPGEDAYHLSSYVQALIHGLQGESTDPYK-----RVVATCKHFAGYDVEDWNG 176
Query: 228 TTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARR 287
RY+ D ++T Q+L + Y PF++CV Q MC+YN VNG P CAD LL R
Sbjct: 177 NLRYQNDVQITQQELVEYYLAPFQACV-QANVGAFMCSYNAVNGAPPCADPYLLQTILRE 235
Query: 288 QWGFHG---YITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVK 344
WG+ ++T DCDAV +Y ++ + A D L AG DV CG+++Q+H AA +
Sbjct: 236 HWGWTNEEQWVTGDCDAVQNVYLPHQWSPTRAGAAADSLVAGTDVTCGTYMQEHLPAAFQ 295
Query: 345 QKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGI 404
QK L ES +D+AL +S +RLG F+ + QP+ ++G D V + A Q LA +AA +GI
Sbjct: 296 QKLLNESSLDQALIRQYSSLVRLGYFDASEN-QPYRQLGFDAVATNASQALARRAAAEGI 354
Query: 405 VLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVY 464
VLLKN G LPL SV++ L G AN+ LLGNYAG + +PL AL+ +
Sbjct: 355 VLLKND-GTLPLSLDSSVTVGLFGDWANATSQLLGNYAGVATYLHSPLYALEQ-TGVKIN 412
Query: 465 YPGCDTVACSSASIDKAVDIAKG---ADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELI 521
Y G + + ++ ++ +D ++ + G+D + E+E DR L G Q ++I
Sbjct: 413 YAGGNPGGQGDPTTNRWSNLYGAYSTSDVLIYVGGIDNSVEEEGRDRGYLTWTGAQLDVI 472
Query: 522 TRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
++A+ KPVI+V+ GG +D + + NI +I+WAGYPG+ G A+ ++I G P
Sbjct: 473 GQLADTG-KPVIVVVTGGGQIDSSPLVNNPNISAIMWAGYPGQDGGSAIIDIIGGKTAPA 531
Query: 582 GRLPMTWYPQDYI-KVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKF 640
GRLP T YP +Y V M +M +RP S PGRTY++Y G F FG G+ Y+ +S +
Sbjct: 532 GRLPQTQYPANYTAAVSMMNMNLRPGENS--PGRTYKWYNGSATFEFGYGMHYTNFSAEI 589
Query: 641 KAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFL-VTIGVKNHGEMAGKH 699
+T+M ++ + S F E F V + V N G + +
Sbjct: 590 --------------TTQMQQSYAISSLASGCNSTGGFLERCPFASVNVQVHNTGNVTSDY 635
Query: 700 PVLLFVKPARRGNGRPIKQLVGFQ---SVILNAKEKAEIVFELSPCESLSRAREDGLMVI 756
L ++ P K LV ++ S+ A A + L+ SL+R E G V+
Sbjct: 636 ITLGYMAGTFGPAPHPRKTLVSYKRLHSIAGGATSTATLNLTLA---SLARVDEHGNKVL 692
Query: 757 EEGTHFLVV 765
G + L +
Sbjct: 693 YPGDYSLQI 701
>gi|451849522|gb|EMD62825.1| glycoside hydrolase family 3 protein [Cochliobolus sativus ND90Pr]
Length = 849
Score = 454 bits (1167), Expect = e-124, Method: Compositional matrix adjust.
Identities = 271/720 (37%), Positives = 393/720 (54%), Gaps = 38/720 (5%)
Query: 55 RARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGAT 114
RA+ LV+ TL+EKI+ NSAP + RLGIP Y+WW+E LHG+AG G +T
Sbjct: 114 RAKSLVALYTLEEKINATSNSAPGVARLGIPPYQWWNEGLHGIAGPFTSFAKQGDYSYST 173
Query: 115 SFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQE 174
SFPQ IL A+FD L + I EARA N + G+ FW PNIN FRDPRWGRGQE
Sbjct: 174 SFPQPILMGAAFDDNLITEVANVISTEARAFNNVNRT-GLDFWTPNINPFRDPRWGRGQE 232
Query: 175 TPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFD 234
TPGED Y + + G+QG+ + + + A CKH+ YD++NW G RY+ D
Sbjct: 233 TPGEDSYHLSSYVKALIHGLQGNETDPYR-----RVVATCKHYAGYDIENWNGNLRYQND 287
Query: 235 ARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHG- 293
+++ QDL + Y PFE+CV Q MC+YN VNG P CAD +L R WG+
Sbjct: 288 VQISQQDLVEYYLAPFEACV-QANVGAFMCSYNAVNGAPPCADPYMLQTVLREHWGWSSD 346
Query: 294 --YITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPES 351
++TSDCD++ +Y ++ + E A D L AG D++CG++LQ H AVKQ E+
Sbjct: 347 EHWVTSDCDSIQNVYLPHQWSSTREGAAADSLNAGTDLDCGTYLQSHLPGAVKQGLTNET 406
Query: 352 EIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSH 411
+D AL +S ++LG F+ P QP+ ++G D V + A Q LAL+AA++GIVLLKN
Sbjct: 407 TLDNALIRQYSSLIKLGYFD-IPENQPYRQLGFDAVATSASQALALKAAEEGIVLLKND- 464
Query: 412 GLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVY---YPGC 468
G+LP+ S ++ + G AN+ L GNY G + +P AL+ N Y PG
Sbjct: 465 GVLPI-NFGSKNVGIYGDWANATSQLQGNYFGVAKFLTSPYMALEKLGVNVRYAGNLPGG 523
Query: 469 DTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAA 528
++ S + + +D + + G+D E E+ DR L L G Q ++I ++A+
Sbjct: 524 QG-DPTTGSWPRLSGVITTSDVHIWVGGMDNGIESEDRDRSWLTLTGSQLDVIGQLADTG 582
Query: 529 KKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTW 588
KPVI++++ GG +D + + I ++LWAGYPG+ G A+ ++ G P GRLP T
Sbjct: 583 -KPVIVIIMGGGQIDTSPLIKNPKISAVLWAGYPGQDGGTAIVNILTGKAAPAGRLPQTQ 641
Query: 589 YPQDYI-KVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNK 647
Y Y+ +VPMTDM MRP ++ NPGRTY++Y GK +F FG GL Y+ +S
Sbjct: 642 YLYKYVSEVPMTDMAMRP--SNKNPGRTYKWYTGKPIFEFGYGLHYTNFSASIT------ 693
Query: 648 LYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFL-VTIGVKNHGEMAGKHPVLLFVK 706
NQ + + D+V K G F E F + + V+N G+ + + L F+
Sbjct: 694 ---NQPKQSYAIS--DLV--KGCNSTGG-FLERCPFTGINVSVQNTGKTSSDYVTLGFLT 745
Query: 707 PARRGNGRPIKQLVGFQSVI-LNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVV 765
+ P K LV + + + A + L+ SL+R E G V+ G + L +
Sbjct: 746 GSFGPKPYPKKSLVAYDRLFNIAASSSSTATLNLT-LASLARVDESGNKVLYPGDYELQI 804
>gi|426198365|gb|EKV48291.1| hypothetical protein AGABI2DRAFT_219902 [Agaricus bisporus var.
bisporus H97]
Length = 767
Score = 453 bits (1165), Expect = e-124, Method: Compositional matrix adjust.
Identities = 271/731 (37%), Positives = 407/731 (55%), Gaps = 37/731 (5%)
Query: 46 CKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIF 105
C T + RA+ L+ T +E + N +P +PRLG+P+Y+WWSEALHGVAG G+
Sbjct: 38 CDPTKAPAARAKTLIQMFTDEELMQNTDNVSPGVPRLGVPSYQWWSEALHGVAG-SPGVS 96
Query: 106 F--NGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINI 163
F +G ATSFPQ I+ ++FD L + I EARA N +A G+ ++ PNIN
Sbjct: 97 FAPSGEFSSATSFPQSIVLGSTFDIDLVKAVATVISTEARAFNNFHRA-GLDYFTPNINP 155
Query: 164 FRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLD 223
F+DPRWGRGQETPGEDP +Y S + G+QG G + + +A CKH+ AYDLD
Sbjct: 156 FKDPRWGRGQETPGEDPFHVSQYVYSLIDGLQG----GIDPRPYFKVAADCKHYAAYDLD 211
Query: 224 NWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSK 283
+W+G R+ FDA+V++QDL++ Y P F+SCV+ + + +MC+YN VNGIP+CA+ LL
Sbjct: 212 SWEGIDRFHFDAKVSLQDLSEYYLPSFQSCVRDAKVASVMCSYNSVNGIPACANPYLLQD 271
Query: 284 TARRQWGFHG--YITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKA 341
R WGF ++TSDCDA+ I+ + + +AV D LKAG DV+CG+ H
Sbjct: 272 ILRDFWGFDDDRWVTSDCDAIGNIFTTHNFTDTFAEAVADALKAGTDVDCGTSYSTHLPD 331
Query: 342 AVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQ 401
A+ Q + +++RAL ++ MRLG F+ P QP ++ V P Q LA AA
Sbjct: 332 ALNQSLITRDDLERALTRQYTSLMRLGYFD-PPESQPLRQLAWSDVNKPDAQALAHTAAV 390
Query: 402 DGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQ-ALQNYVE 460
+G+VLLKN G LP+ S ++A+IGP AN+ K + GNY G + +TP Q A+
Sbjct: 391 EGLVLLKND-GFLPVSASGK-TIAIIGPYANATKDMQGNYFGTAPFIVTPFQGAVDAGFN 448
Query: 461 NTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQEL 520
V G S A A+ +A +D ++ G++ + E E DR+ + G Q L
Sbjct: 449 EVVSAAGTSINGTSEADFAAAIAVANSSDIIIFAGGINNSIESEAKDRLTIAWTGNQLSL 508
Query: 521 ITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
+ ++A KPV++V GG +D + + + +++WAGYPG++G A+ +VI G P
Sbjct: 509 VKQLASLG-KPVVVVQFGGGQLDDSDLLDNDAVRAVIWAGYPGQSGGTAIFDVITGAVAP 567
Query: 581 GGRLPMTWYPQDYI-KVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYK 639
GRL +T YP+D++ +V MTDM +RP S NPGRTY++Y G+ V FG GL ++ + +
Sbjct: 568 AGRLSVTQYPEDFVNQVGMTDMALRPG--SANPGRTYKWYTGRPVLEFGHGLHFTTFDFS 625
Query: 640 FKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKH 699
++ K + ++ D K P+L F V I +N G + +
Sbjct: 626 WRGRPGRKYNIQH-----LLHTAD----KKFPDL----IPLDTFHVNI--RNTGNITSDY 670
Query: 700 PVLLFVKPARRGNGRPIKQLVGF-QSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEE 758
LLF++ P K LV F ++ ++A A + ++ S++R E G +
Sbjct: 671 VALLFLRSNAGFAPHPKKSLVSFARAHRIDAGSSATVDLGVN-LGSIARVDEHGDSWLFA 729
Query: 759 GTHFLV--VGD 767
G + LV +GD
Sbjct: 730 GDYQLVLDIGD 740
>gi|67902828|ref|XP_681670.1| hypothetical protein AN8401.2 [Aspergillus nidulans FGSC A4]
gi|74592887|sp|Q5ATH9.1|BXLB_EMENI RecName: Full=Exo-1,4-beta-xylosidase bxlB; AltName:
Full=1,4-beta-D-xylan xylohydrolase bxlB; AltName:
Full=Beta-xylosidase bxlB; AltName: Full=Xylobiase bxlB;
Flags: Precursor
gi|40747867|gb|EAA67023.1| hypothetical protein AN8401.2 [Aspergillus nidulans FGSC A4]
gi|259484335|tpe|CBF80465.1| TPA: beta-1,4-xylosidase (Eurofung) [Aspergillus nidulans FGSC A4]
Length = 763
Score = 452 bits (1164), Expect = e-124, Method: Compositional matrix adjust.
Identities = 294/778 (37%), Positives = 418/778 (53%), Gaps = 57/778 (7%)
Query: 7 SLVFPLLCLCFTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLD 66
SL P LC D T P S P C T+L +RA+ LVS LTL+
Sbjct: 14 SLFIPALCQS-----NYPDCTTGPLS---------ELPICDTSLSPLERAKSLVSALTLE 59
Query: 67 EKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFF--NGTIRGATSFPQVILTAA 124
EKI+ + A RLG+PAY WW+EALHGVA G+ F +G ATSFP I+ A
Sbjct: 60 EKINNTGHEAAGSSRLGLPAYNWWNEALHGVA-EKHGVSFEESGDFSYATSFPAPIVLGA 118
Query: 125 SFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTG 184
+F+ L R+ + I EARA N+ A G+ +W PN+N F+DPRWGRGQETPGEDPL
Sbjct: 119 AFNDALIRRVAEIISTEARAFSNSDHA-GIDYWTPNVNPFKDPRWGRGQETPGEDPLHCS 177
Query: 185 KYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLAD 244
+Y +V G+QGD K + A CKH AYDL+ W G +R++FDA+V+ DL +
Sbjct: 178 RYVKEFVGGLQGDD------PEKPKVVATCKHLAAYDLEEWGGVSRFEFDAKVSAVDLLE 231
Query: 245 TYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHG---YITSDCDA 301
Y PPF++C MC+YN +NG+P+CADR LL R WG+ G ++T DC A
Sbjct: 232 YYLPPFKTCAVDASVGAFMCSYNALNGVPACADRYLLQTVLREHWGWEGPGHWVTGDCGA 291
Query: 302 VSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLF 361
V I Y +S +A L AG+D++CG++L + A +Q + +D AL L+
Sbjct: 292 VERIQTYHHYVESGPEAAAAALNAGVDLDCGTWLPSYLGEAERQGLISNETLDAALTRLY 351
Query: 362 SVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKS 421
+ ++LG F+ QP +G D V + + LA A G VLLKN LPL +
Sbjct: 352 TSLVQLGYFD-PAEGQPLRSLGWDDVATSEAEELAKTVAIQGTVLLKNIDWTLPLKANG- 409
Query: 422 VSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKA 481
+LALIGP N L NYAGP+ T ++A + N + PG + + S+ D A
Sbjct: 410 -TLALIGPFINFTTELQSNYAGPAKHIPTMIEAAERLGYNVLTAPGTEVNSTSTDGFDDA 468
Query: 482 VDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGP 541
+ IA AD ++ G+D T E+E LDR + PG Q+ELI +AE +P+ +V GG
Sbjct: 469 LAIAAEADALIFFGGIDNTVEEESLDRTRIDWPGNQEELILELAELG-RPLTVVQFGGGQ 527
Query: 542 VDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYI-KVPMTD 600
VD + +G+I+WAGYP +AG + +V+ G P GRLP+T YP+ Y+ +VPMTD
Sbjct: 528 VDDSALLASAGVGAIVWAGYPSQAGGAGVFDVLTGKAAPAGRLPITQYPKSYVDEVPMTD 587
Query: 601 MKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVE 660
M ++P + NPGRTYR+YE V PFG GL Y+ + N S + K
Sbjct: 588 MNLQPG--TDNPGRTYRWYE-DAVLPFGFGLHYTTF--------------NVSWAKKAFG 630
Query: 661 NQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVK-PARRGNGRPIKQL 719
D + +T ++ V N G++A + L+F P PIK L
Sbjct: 631 PYDAATLARGKNPSSNIVDT----FSLAVTNTGDVASDYVALVFASAPELGAQPAPIKTL 686
Query: 720 VGF-QSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG-DEEYPISIF 775
VG+ ++ ++ E ++ E++ L+RA EDG +V+ G + L+V ++EYP + F
Sbjct: 687 VGYSRASLIKPGETRKVDVEVT-VAPLTRATEDGRVVLYPGEYTLLVDVNDEYPTAKF 743
>gi|347531439|ref|YP_004838202.1| beta-glucosidase [Roseburia hominis A2-183]
gi|345501587|gb|AEN96270.1| beta-glucosidase [Roseburia hominis A2-183]
Length = 716
Score = 452 bits (1163), Expect = e-124, Method: Compositional matrix adjust.
Identities = 284/748 (37%), Positives = 401/748 (53%), Gaps = 89/748 (11%)
Query: 49 TLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNG 108
+L + A+ LV ++TL+EKISQ+ +PAI RL IPAY WW+EALHGVA G
Sbjct: 2 SLETKEYAKRLVEQMTLEEKISQMRYESPAIERLHIPAYNWWNEALHGVARSGV------ 55
Query: 109 TIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA-------GQAIGMTFWAPNI 161
AT FPQ I AA+FD L +IG + E RA + A G G+TFWAPNI
Sbjct: 56 ----ATMFPQAIALAATFDEELIEKIGDVVSTEGRAKFEAYSGRGDRGIYKGLTFWAPNI 111
Query: 162 NIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYD 221
NIFRDPRWGRG ET GEDP +T K +Y+RG+QG K L+A+AC KHF +
Sbjct: 112 NIFRDPRWGRGHETYGEDPCLTAKLGCAYIRGIQG------KDPDHLKAAACAKHFAVH- 164
Query: 222 LDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLL 281
+ R++FDA+V++ DL DTY F+ CVK +M AYNRVNG P+C + LL
Sbjct: 165 --SGPEALRHEFDAKVSLHDLYDTYLYAFKRCVKDAGVEAVMGAYNRVNGEPACGSKTLL 222
Query: 282 SKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG-SFLQKHTK 340
R Q+GF G++ SDC A+ ++ K+ E++ + G D+NCG +FL +
Sbjct: 223 QDILREQFGFEGHVVSDCWAILDFHEHHHVTKTVEESAAMAVNHGCDLNCGKAFL--YLS 280
Query: 341 AAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAA 400
A +Q + E I A+ L VR+RLG+ P+ P+ I DVV P H L+L+A+
Sbjct: 281 RACEQGLVEEKTITEAVERLMDVRIRLGMMEDYPS--PYANIPYDVVECPEHIALSLEAS 338
Query: 401 QDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYV- 459
+ +VLLKN + LPL + + ++A+IGPNANS L+GNY G S R ITPL+ +Q Y
Sbjct: 339 KRSMVLLKNDNHFLPLKQEQVHTIAVIGPNANSRAALVGNYEGTSSRYITPLEGIQEYTG 398
Query: 460 ENT--VYYPGC----DTVACSSASIDK---AVDIAKGADHVVLMMGLDQTQEKEE----- 505
E T +Y GC D V D+ A+ A+ AD +V+ +GLD E EE
Sbjct: 399 EKTRVLYAQGCHLYKDQVEFLGEPKDRFKEALIAAERADVIVMCLGLDAGIEGEEGDAGN 458
Query: 506 ----LDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGY 561
D++ L LPG QQEL+ VA A KP++L +L G +D+++A+ I +IL Y
Sbjct: 459 EYASGDKLGLKLPGLQQELLEAVA-AVGKPIVLTVLAGSALDLSWAQEHAQIRAILDCWY 517
Query: 562 PGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEG 621
PG G A+AE +FG+ +P G+LP+T+Y TD M GRTYR+ +
Sbjct: 518 PGARGGKAIAEALFGEFSPCGKLPVTFYEGTEFLPDFTDYSM--------AGRTYRYTDR 569
Query: 622 KEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETR 681
++PFG GL+YS+ Y + DV T+F
Sbjct: 570 HVLYPFGYGLTYSQIRYSDA-------------------HADV----------TDFGILE 600
Query: 682 KFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSP 741
V + V+N G + V ++V+ + R P QL G +SV L EK E+ LSP
Sbjct: 601 PVTVHVTVENTGTYPVQEAVQVYVRFSEREAYDPGYQLKGIRSVALECGEKKEVCITLSP 660
Query: 742 CESLSRAREDGLMVIEEGTHFLVVGDEE 769
+ E+G ++ G++ + VG ++
Sbjct: 661 -RDFALISEEGKCLVHPGSYEIAVGGQQ 687
>gi|409079872|gb|EKM80233.1| hypothetical protein AGABI1DRAFT_57801 [Agaricus bisporus var.
burnettii JB137-S8]
Length = 767
Score = 452 bits (1163), Expect = e-124, Method: Compositional matrix adjust.
Identities = 272/731 (37%), Positives = 406/731 (55%), Gaps = 37/731 (5%)
Query: 46 CKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIF 105
C T + RA L+ T +E + N +P +PRLG+P+Y+WWSEALHGVAG G+
Sbjct: 38 CDPTKAPAARATTLIQMFTDEELMQNTDNVSPGVPRLGVPSYQWWSEALHGVAG-SPGVS 96
Query: 106 F--NGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINI 163
F +G ATSFPQ I+ ++FD L + I EARA N +A G+ ++ PNIN
Sbjct: 97 FAPSGEFSSATSFPQSIVLGSTFDIDLVKAVATVISTEARAFNNFHRA-GLDYFTPNINP 155
Query: 164 FRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLD 223
F+DPRWGRGQETPGEDP +Y S + G+QG G + + +A CKH+ AYDLD
Sbjct: 156 FKDPRWGRGQETPGEDPFHVSQYVYSLIDGLQG----GIDPRPYFKVAADCKHYAAYDLD 211
Query: 224 NWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSK 283
+W+G R+ FDA+V++QDL++ Y P F+SCV+ + + +MC+YN VNGIP+CA+ LL
Sbjct: 212 SWEGIDRFHFDAKVSLQDLSEYYLPSFQSCVRDAKVASVMCSYNSVNGIPACANPYLLQD 271
Query: 284 TARRQWGFHG--YITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKA 341
R WGF ++TSDCDA+ I+ + + +AV D LKAG DV+CG+ H
Sbjct: 272 ILRDFWGFDDDRWVTSDCDAIGNIFTTHNFTDTFAEAVADALKAGTDVDCGTSYSTHLPD 331
Query: 342 AVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQ 401
A+ Q + +++RAL ++ MRLG F+ P QP ++ V P Q LA AA
Sbjct: 332 ALNQSLITRDDLERALTRQYTSLMRLGYFD-PPESQPLRQLAWSDVNKPDAQALAHTAAV 390
Query: 402 DGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQ-ALQNYVE 460
+G+VLLKN G LP+ S ++A+IGP AN+ K + GNY G + +TP Q A+
Sbjct: 391 EGLVLLKND-GFLPVSASGK-TIAIIGPYANATKDMQGNYFGTAPFIVTPFQGAVDAGFN 448
Query: 461 NTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQEL 520
V G S A A+ +A +D ++ G++ + E E DR+ + G Q L
Sbjct: 449 EVVSAAGTSINGTSEADFAAAIAVANSSDIIIFAGGINNSIESEAKDRLTIAWTGNQLSL 508
Query: 521 ITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
+ ++A KPV++V GG +D + + + +++WAGYPG++G A+ +VI G P
Sbjct: 509 VKQLASLG-KPVVVVQFGGGQLDDSDLLDNDAVRAVIWAGYPGQSGGTAIFDVITGAVAP 567
Query: 581 GGRLPMTWYPQDYI-KVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYK 639
GRL +T YP+D++ +V MTDM +RP S NPGRTY++Y G+ V FG GL ++ + +
Sbjct: 568 AGRLSVTQYPEDFVNQVGMTDMALRPG--SANPGRTYKWYTGRPVLEFGHGLHFTTFDFS 625
Query: 640 FKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKH 699
++ K + ++ D K P+L F V I +N G + +
Sbjct: 626 WRGRPGRKYNIQH-----LLHTAD----KKFPDL----IPLDTFHVNI--RNTGNITSDY 670
Query: 700 PVLLFVKPARRGNGRPIKQLVGF-QSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEE 758
LLF+K P K LV F ++ ++A A + ++ S++R E G +
Sbjct: 671 VALLFLKSNAGFAPHPKKSLVSFARAHRIDAGSSATVDLGVN-LGSIARVDEHGDSWLFA 729
Query: 759 GTHFLV--VGD 767
G + LV +GD
Sbjct: 730 GDYQLVLDIGD 740
>gi|334187562|ref|NP_196532.2| Glycosyl hydrolase family protein [Arabidopsis thaliana]
gi|332004052|gb|AED91435.1| Glycosyl hydrolase family protein [Arabidopsis thaliana]
Length = 526
Score = 451 bits (1160), Expect = e-124, Method: Compositional matrix adjust.
Identities = 233/485 (48%), Positives = 316/485 (65%), Gaps = 5/485 (1%)
Query: 294 YITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEI 353
YI SDCD++ I+Y ++ Y K+PE+A + AG+D+NCGSFL HT+ AVK+ + E+ I
Sbjct: 45 YIVSDCDSLGILYGSQHYTKTPEEAAAKSILAGLDLNCGSFLGNHTENAVKKGLIDEAAI 104
Query: 354 DRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGL 413
++A+ N F+ MRLG F+GNP QP+G +G VC+ ++ LA++ A+ GIVLLKNS G
Sbjct: 105 NKAISNNFATLMRLGFFDGNPKNQPYGGLGPKDVCTVENRELAVETARQGIVLLKNSAGS 164
Query: 414 LPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVAC 473
LPL S +LA+IGPNAN KT++GNY G +C+ TPLQ L+ V T Y+ GC V C
Sbjct: 165 LPLSPSAIKTLAVIGPNANVTKTMIGNYEGVACKYTTPLQGLERTVLTTKYHRGCFNVTC 224
Query: 474 SSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVI 533
+ A +D A +A AD VL+MG DQT EKE LDR+DL LPG+QQEL+T+VA+AA+ PV+
Sbjct: 225 TEADLDSAKTLAASADATVLVMGADQTIEKETLDRIDLNLPGKQQELVTQVAKAARGPVV 284
Query: 534 LVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDY 593
LV++ GG DITFAK D I SI+W GYPGEAG +A+A+VIFG HNP G+LPMTWYPQ Y
Sbjct: 285 LVIMSGGGFDITFAKNDEKITSIMWVGYPGEAGGIAIADVIFGRHNPSGKLPMTWYPQSY 344
Query: 594 I-KVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKF-KAVSQNKLYLN 651
+ KVPMT+M MRP ++G GRTYRFY G+ V+ FG GLSY+ +S++ KA L L+
Sbjct: 345 VEKVPMTNMNMRPDKSNGYLGRTYRFYIGETVYAFGDGLSYTNFSHQLIKAPKFVSLNLD 404
Query: 652 QSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRG 711
+S S + E Q + P E F V + V+N G+ G V LF P
Sbjct: 405 ESQSCRSPECQSLDAIG--PHCEKAVGERSDFEVQLKVRNVGDREGTETVFLFTTPPEV- 461
Query: 712 NGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGDEEYP 771
+G P KQL+GF+ + L KE+ + F++ C+ L E G + G H L VG ++
Sbjct: 462 HGSPRKQLLGFEKIRLGKKEETVVRFKVDVCKDLGVVDEIGKRKLALGHHLLHVGSLKHS 521
Query: 772 ISIFV 776
+I V
Sbjct: 522 FNISV 526
>gi|392596548|gb|EIW85871.1| hypothetical protein CONPUDRAFT_80240 [Coniophora puteana
RWD-64-598 SS2]
Length = 770
Score = 451 bits (1159), Expect = e-123, Method: Compositional matrix adjust.
Identities = 274/724 (37%), Positives = 383/724 (52%), Gaps = 34/724 (4%)
Query: 46 CKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAG-VGKGI 104
C T+L +QRA L+ T+DE I VN AP +PRLG+PAYEWWSE LHGVA G
Sbjct: 37 CDTSLNATQRAAALIDLFTVDELIVNTVNWAPGVPRLGLPAYEWWSEGLHGVANSAGVTW 96
Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIF 164
G ATSFPQ IL +A+FD L +G IG+E RA N G A G+ FW PNIN F
Sbjct: 97 SITGPFSYATSFPQPILMSAAFDDALIKAVGGVIGMEGRAFNNYGHA-GLDFWTPNINPF 155
Query: 165 RDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDN 224
+DPRWGRGQETPGEDP +Y + ++G+QG G + Q A CKHF YDL++
Sbjct: 156 KDPRWGRGQETPGEDPYHIAQYVYNLIQGLQG----GLDPEPYFQVVATCKHFAGYDLED 211
Query: 225 WKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKT 284
W RY ++A ++ QDL++ Y P F+SC + A MC+YN +NGIP+CAD LL
Sbjct: 212 WDFNYRYGYNAIISTQDLSEYYLPSFQSCYRDAFAGASMCSYNAINGIPTCADTYLLQDI 271
Query: 285 ARRQWGFHG--YITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAA 342
R WGF ++T DCD+V IYD Y P+ A D LKAG D++CG F A
Sbjct: 272 LRGFWGFDQTRWVTGDCDSVEDIYDFHHYTALPQQAAADALKAGSDIDCGIFYTTWLPLA 331
Query: 343 VKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQD 402
+ + E ++ AL ++ +RLG F+ + QP+ + V + Q LA AA +
Sbjct: 332 YTESLITEQDLRAALTRQYASLVRLGYFD-PASEQPYRQYNWSNVDTSYAQELAYTAAVE 390
Query: 403 GIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENT 462
GI LLKN G LP S ++ALIGP + + GNY G + I+P Q Q N
Sbjct: 391 GITLLKND-GTLPF-SSAIKNIALIGPWTFATTQMQGNYYGNAPYLISPYQGAQLAGYNI 448
Query: 463 VYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELIT 522
Y + + ++ A A+GAD +V + G+D T E E +DR D+ P Q LI
Sbjct: 449 SYVLETNVTSNTTDGYAAAFTAAQGADAIVFVGGIDNTVEAEAMDRNDITWPAFQLWLIG 508
Query: 523 RVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGG 582
+ + KP+++V GG VD T + ++ ++LW GYPG++G AL ++I G P G
Sbjct: 509 ELGKLG-KPLVVVQFGGGQVDDTEINANPDVNALLWGGYPGQSGGQALFDIISGKVAPAG 567
Query: 583 RLPMTWYPQDYI-KVPMTDMKMRPQAT-SGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKF 640
RL T YP DY+ ++PMT+M +RP A + +PGRTY++Y G V+ FG GL Y+ ++Y +
Sbjct: 568 RLVSTQYPADYVNEIPMTNMNLRPDANGTTSPGRTYKWYTGTPVYEFGYGLHYTNFTYAW 627
Query: 641 KAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHP 700
S + Q H P +T +++ V N G + +
Sbjct: 628 TKAPAATY----SIEALVAAGQGSAHIDLAP------FDT----LSVEVTNAGAVTSDYS 673
Query: 701 VLLFVKPARRGNGRPIKQLVGF---QSVILNAKEKAEIVFELSPCESLSRAREDGLMVIE 757
LLFV P K L + +V A + A FE+ ++RA G +
Sbjct: 674 ALLFVNGTYGPAPYPNKSLAAYTRLHNVTAGASQTA--TFEVV-LNQIARADVQGNFWLY 730
Query: 758 EGTH 761
G +
Sbjct: 731 PGAY 734
>gi|429850127|gb|ELA25427.1| glycoside hydrolase family 3 protein [Colletotrichum
gloeosporioides Nara gc5]
Length = 918
Score = 450 bits (1157), Expect = e-123, Method: Compositional matrix adjust.
Identities = 273/742 (36%), Positives = 410/742 (55%), Gaps = 32/742 (4%)
Query: 46 CKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVA-GVGKGI 104
C +L QRA LV+ LT+ EK+ LVN AP IPRL +P YEWWSE LHGVA G
Sbjct: 170 CDESLSDKQRAAALVAELTIWEKLDNLVNEAPGIPRLRVPPYEWWSEGLHGVARSPGTKF 229
Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIF 164
G ATSFPQ IL ++FD L +G+ + EARA NAG++ G+ ++PNIN F
Sbjct: 230 TSKGNFSYATSFPQPILLGSAFDDELVRAVGEVVSREARAFSNAGRS-GLDLYSPNINAF 288
Query: 165 RDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDN 224
+DPRWGRGQETPGED KY + + G++GD + + A CKH+ A D +N
Sbjct: 289 KDPRWGRGQETPGEDTFHLQKYVSAMLSGLEGDDPDKKLI-------ATCKHYAANDFEN 341
Query: 225 WKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKT 284
+KG R F+A ++ QDL++ Y PPF++C + MC+YN +NG P CA+ L+
Sbjct: 342 YKGVDRSGFNAVISTQDLSEYYLPPFKTCAVEKNVGSFMCSYNGINGTPLCANSYLIEDI 401
Query: 285 ARRQWGFHG---YITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFL-QKHTK 340
R+ WG++G Y+++DCD V+++ YA A ++AG D+ C +F + +
Sbjct: 402 LRKHWGWNGDGQYVSTDCDCVALMVSYHHYAPDLGHAAAWSMQAGTDLECNAFPGSEALQ 461
Query: 341 AAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAA 400
+A Q + E ++D+AL +++ + +GLF+ + P +G D V + Q LA +AA
Sbjct: 462 SAWNQSLISEKDVDKALTRMYTSLVSVGLFDLD-RKDPLRSLGWDEVNTKEAQDLAYRAA 520
Query: 401 QDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVE 460
+G VL+KN G+LPL S ALIGP ++ + GNY GP+ I+P +A ++
Sbjct: 521 VEGAVLMKND-GILPLSPDSSKKYALIGPWVSATTQMQGNYFGPAPYLISPRKAAKDLGL 579
Query: 461 NTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQEL 520
+ Y+ G T S +S +A+ A+ AD V+ M G+D T E+E LDR L P Q +L
Sbjct: 580 DFTYFLGSRTNK-SDSSFAQAIKAAQAADVVIFMGGVDNTLEQETLDRNTLAWPEPQLQL 638
Query: 521 ITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
+ ++E KP++++ GG VD T + ++ +ILW GYPG++G A+ +++FG P
Sbjct: 639 LRALSEVG-KPLVVLQFGGGQVDDTELLANDSVNAILWGGYPGQSGGKAILDIVFGRAAP 697
Query: 581 GGRLPMTWYPQDYIK-VPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYK 639
GRL +T YP Y VP TDM +RP + GRTYR+Y G+ P+G GL Y+K+S
Sbjct: 698 AGRLSVTQYPASYNDAVPATDMNLRPGPGNSGLGRTYRWYTGETPVPYGFGLHYTKFSVD 757
Query: 640 FKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKH 699
K S ++ +M + +P E R VT+ KN G + +
Sbjct: 758 MKPASN----VHNIDIAQMAAEANDDAASEIPSWQRGL-ERRMVTVTVSAKNEGNVISDY 812
Query: 700 PVLLFVKPARRGNGRPIKQLVGF---QSVILNAKEKAEIVFELSPCESLSRAREDGLMVI 756
L+F++ P K LVG+ +++ + K EI+ ++ E L R E G V+
Sbjct: 813 VALVFLRSEAGPKPWPQKTLVGYTRLRNIKPGEERKEEIIIKM---EQLVRVDEVGNRVL 869
Query: 757 EEGTH--FLVVGDE-EYPISIF 775
EG + FL V D+ + I+IF
Sbjct: 870 YEGLYSLFLDVDDKAQNVITIF 891
>gi|242216161|ref|XP_002473890.1| beta-xylosidase [Postia placenta Mad-698-R]
gi|220726990|gb|EED80923.1| beta-xylosidase [Postia placenta Mad-698-R]
Length = 741
Score = 449 bits (1155), Expect = e-123, Method: Compositional matrix adjust.
Identities = 285/741 (38%), Positives = 402/741 (54%), Gaps = 40/741 (5%)
Query: 34 DPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEA 93
D +N T C T+ +RA L+S TL+EKI+ N+AP +PRLG+PAY+WW EA
Sbjct: 22 DCANGPLTTNTVCDTSATPLERATALISLFTLEEKINNTGNTAPGVPRLGLPAYQWWQEA 81
Query: 94 LHGVAGVGKGIFF--NGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQA 151
LHGVA G+ F +G ATSFPQ IL A+FD L + + EARA NA ++
Sbjct: 82 LHGVA-ESPGVIFAPSGEYSYATSFPQPILMGAAFDDALINHVATIVSTEARAFNNANRS 140
Query: 152 IGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQAS 211
G+ FW PNIN F+DPRWGRGQETPGEDP Y + + G+QG G +
Sbjct: 141 -GIDFWTPNINPFKDPRWGRGQETPGEDPFHLQSYVYNLITGLQG-----GLDPEYKRIV 194
Query: 212 ACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNG 271
A CKHF AYDL+NW+G RY FDA V++QDL++ Y F +C + MC+YN VNG
Sbjct: 195 ATCKHFAAYDLENWEGNVRYGFDALVSLQDLSEFYTRSFRTCARDANVGSFMCSYNAVNG 254
Query: 272 IPSCADRNLLSKTARRQWGFHG---YITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMD 328
+PSCA+ LL R WG+ YITSDCDA+ IY+ Y + + V D L AG D
Sbjct: 255 VPSCANSYLLQDILRDHWGWTNEDQYITSDCDAIQNIYEPHYYTATRAETVADALNAGTD 314
Query: 329 VNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVC 388
++CG + ++ AA Q ES ++RAL ++ ++LG F+ +QP+ +IG V
Sbjct: 315 LDCGEYYPENLGAAYDQGLFTESTLNRALIRQYAALVKLGYFD-PADIQPYRQIGWANVS 373
Query: 389 SPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRS 448
+P + LA AA +GI LLKN G LPL S ++ALIGP AN+ + GNY G +
Sbjct: 374 TPEAEELAYTAAVEGITLLKND-GTLPLSPSIK-TIALIGPWANATTQMQGNYYGVAPYL 431
Query: 449 ITPLQALQNYVENTVYY---PGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEE 505
I+PL A + + TVYY PG D +++S A A+ AD ++ G+D T E E
Sbjct: 432 ISPLMAAEE-LGFTVYYSAGPGVDD--PTTSSFPAAFAAAEAADAIIYAGGIDITVEAEA 488
Query: 506 LDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEA 565
+DR L PG Q + I +++ KP+I++ GG +D + + + +++W GYPG++
Sbjct: 489 MDRYTLDWPGVQPDFIDQLS-LLGKPLIVLQFGGGQIDDSALLPNPGVNALVWGGYPGQS 547
Query: 566 GAVALAEVIFGDHNPGGRLPMTWYPQDYI-KVPMTDMKMRPQATSGNPGRTYRFYEGKEV 624
G A+ ++I G+ P GRLP+T YP DY+ +V MTDM +RP T NPGRTY +Y G +
Sbjct: 548 GGKAIMDIIVGNAAPAGRLPITQYPLDYVYQVAMTDMSLRPSPT--NPGRTYMWYTGTPI 605
Query: 625 FPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFL 684
FG GL Y+ ++ S + +T + V H P+L C +
Sbjct: 606 VEFGFGLHYTTFTASLSQPSAPSYDI----ATLVSLCSGVAH----PDL----CPFASY- 652
Query: 685 VTIGVKNHGEMAGKHPV-LLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCE 743
T V N G V LLF+ P K LV + + A ++
Sbjct: 653 -TANVTNTGSSVTSDFVSLLFLAGEHGPAPYPNKVLVAYDRLHAIAPLASQTTTLNLTLG 711
Query: 744 SLSRAREDGLMVIEEGTHFLV 764
SLSR + G ++ G + L+
Sbjct: 712 SLSRVDDYGNTILYPGEYTLI 732
>gi|330934749|ref|XP_003304687.1| hypothetical protein PTT_17336 [Pyrenophora teres f. teres 0-1]
gi|311318569|gb|EFQ87188.1| hypothetical protein PTT_17336 [Pyrenophora teres f. teres 0-1]
Length = 798
Score = 449 bits (1155), Expect = e-123, Method: Compositional matrix adjust.
Identities = 285/784 (36%), Positives = 411/784 (52%), Gaps = 59/784 (7%)
Query: 1 MIFHKLSLVFPLLCLCFTSLLTRVDSTQPPFS----CDPSNPSTETFPFCKTTLPISQRA 56
M+ + SL L + T D T P CDPS P++ RA
Sbjct: 1 MLTSRRSLPALLFATGAYAQFTFPDCTNGPLKNVTICDPS------------ASPLA-RA 47
Query: 57 RDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRG-ATS 115
+ LV+ TL+EKI+ + AP +PRLG+P Y+WW+E LHG+AG +G +TS
Sbjct: 48 KSLVALYTLEEKINATSSGAPGVPRLGVPPYQWWNEGLHGIAGPYTNFSHSGVEWSYSTS 107
Query: 116 FPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQET 175
FPQ IL A+FD L + + I EARA NA + G+ FW PNIN FRDPRWGRGQET
Sbjct: 108 FPQPILMGAAFDDDLITEVAKVISTEARAFNNANR-TGLDFWTPNINPFRDPRWGRGQET 166
Query: 176 PGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDA 235
PGED Y + + G+QG+ + K + A CKHF YD+++W G RY+ D
Sbjct: 167 PGEDAYHLSSYVQALIHGLQGEATDPYK-----RVVATCKHFAGYDVEDWNGNLRYQNDV 221
Query: 236 RVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHG-- 293
++T QDL + Y PF++CV Q MC+YN VNG P CAD LL R WG++
Sbjct: 222 QITQQDLVEYYLAPFQACV-QANVGAFMCSYNAVNGAPPCADPYLLQTILREHWGWNKEE 280
Query: 294 -YITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESE 352
++T DCDAV +Y ++ + A D L AG D+ CG+++Q+H AA +QK L ES
Sbjct: 281 QWVTGDCDAVQNVYFPHQWSSTRAGAAADSLVAGTDITCGTYMQEHLPAAFRQKLLNESS 340
Query: 353 IDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHG 412
+D AL +S +RLG F+ P QP+ ++G D V + A Q LA +AA +GIVLLKN G
Sbjct: 341 LDLALIRQYSSLVRLGYFDA-PENQPYRQLGFDAVATNASQALARRAAAEGIVLLKND-G 398
Query: 413 LLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVA 472
LPL S+++ L G AN+ LLGNYAG + +PL AL+ + Y G
Sbjct: 399 TLPLSLDSSMTVGLFGDWANATTQLLGNYAGVATYLHSPLYALKQ-TGVKINYAGGKPGG 457
Query: 473 CSSASIDKAVDIAKG---ADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAK 529
+ ++ ++ +D ++ + G+D E+E DR L G Q ++I ++AE
Sbjct: 458 QGDPTTNRWSNLYGAYSTSDVLIYVGGIDNGVEEEGHDRGYLTWTGPQLDVIGQLAETG- 516
Query: 530 KPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWY 589
KPVI+V+ GG +D + + NI +I+WAGYPG+ G A+ ++I G P GRLP T Y
Sbjct: 517 KPVIVVVTGGGQIDSSPLVNNPNISAIMWAGYPGQDGGSAIIDIISGKTAPAGRLPQTQY 576
Query: 590 PQDY-IKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKL 648
P Y V M +M +RP NPGRTY++Y G VF FG G+ Y+ +S
Sbjct: 577 PASYAAAVSMMNMNLRP--GENNPGRTYKWYNGSAVFEFGYGMHYTNFSAAI-------- 626
Query: 649 YLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFL-VTIGVKNHGEMAGKHPVLLFVKP 707
ST+M ++ + S F E F V + V N G++ + L ++
Sbjct: 627 ------STQMQQSYAISSLASGCNSTGGFLERCPFASVDVQVHNTGKVTSDYVTLGYMAG 680
Query: 708 ARRGNGRPIKQLVGFQ---SVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLV 764
P K LV ++ ++ A A++ L+ S++R E G V+ G + L
Sbjct: 681 TFGPAPHPRKTLVSYKRLHNIAGGATSTAKLNLTLA---SVARVDEYGNKVLYPGHYSLQ 737
Query: 765 VGDE 768
+ +
Sbjct: 738 IDNN 741
>gi|291167620|dbj|BAI82526.1| 1,4-beta-D-xylosidase [Aureobasidium pullulans var. melanogenum]
Length = 805
Score = 448 bits (1152), Expect = e-123, Method: Compositional matrix adjust.
Identities = 267/731 (36%), Positives = 399/731 (54%), Gaps = 45/731 (6%)
Query: 47 KTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFF 106
K+ P++ RA+ LV+ T+ EK++ N++P +PRLG+P Y+WW EALHGVA G+ F
Sbjct: 46 KSADPVA-RAKALVAAFTVAEKLNLTGNNSPGVPRLGLPVYQWWQEALHGVAS-SPGVTF 103
Query: 107 N--GTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIF 164
N G ATSFPQ IL A+FD L + + + EARA N G+A G+ FW PNIN +
Sbjct: 104 NATGQFDSATSFPQPILMGAAFDDALIQSVAEVVSTEARAFNNYGRA-GLDFWTPNINPY 162
Query: 165 RDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDN 224
RDPRWGRGQETPGEDP Y S + G+QG G+ + +A CKHF YD+++
Sbjct: 163 RDPRWGRGQETPGEDPYHLSSYVHSLIMGLQG-----GEDPEIRKITATCKHFAGYDIES 217
Query: 225 WKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKT 284
W G RY+ D ++ +DL + Y P F SC + MC Y+ +NG+P+CAD LL+
Sbjct: 218 WNGNLRYQNDVQIPQRDLVEYYLPSFRSCARDSNVGAFMCTYSALNGVPTCADPWLLNDV 277
Query: 285 ARRQWGFHG---YITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKA 341
R WG+ ++TSDCD++ I+ ++ + + A L AG D++CG++ Q H
Sbjct: 278 LREHWGWTNEEQWVTSDCDSIQNIFLPHNFSDTRQGAAAAALNAGTDLDCGTYYQHHLPL 337
Query: 342 AVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQ 401
A Q + ++ +D+AL L++ +R G F+G P +DV + A Q LALQAA+
Sbjct: 338 AYSQGLINQTTVDQALVRLYTSLVRTGYFDG-PNAMYRNLTWSDVGTTHAQQ-LALQAAE 395
Query: 402 DGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVEN 461
+G+VLLKN GLLPL S +ALIG AN+ + GNY G +PL A Q
Sbjct: 396 EGMVLLKND-GLLPLSISNGTKIALIGSWANATTQMQGNYYGVPTYLHSPLYAAQQTGAQ 454
Query: 462 TVYYPGCDTVACSSASIDKAVDI---AKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQ 518
Y G + D + + A+ AD ++ + G+D + E E +DR D+ G Q
Sbjct: 455 VFYAQGPG--GQGDPTTDHWLPVWTAAEKADIIIYIGGVDISVEAEGMDREDINWTGAQL 512
Query: 519 ELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDH 578
++I +A KP++L + G +D T + NI +++W GYPG+ G VAL +I G
Sbjct: 513 DIIGELAMYG-KPMVLAQM-GDQLDNTPIVNNANISALIWGGYPGQDGGVALFNIITGKT 570
Query: 579 NPGGRLPMTWYPQDYI-KVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYS 637
P GRLP+T YP YI +PMTDM +RP AT+G+PGRTY++Y G VF FG G+ Y+K+S
Sbjct: 571 APAGRLPVTQYPAHYIADIPMTDMTLRPNATTGSPGRTYKWYNGTAVFEFGYGMHYTKFS 630
Query: 638 YKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAG 697
+S++ ++ S +D ++S +++ V N G +
Sbjct: 631 ADISPMSKSSYDISSLLSGCNETYKDRCAFES---------------ISVNVHNTGNVTS 675
Query: 698 KHPVLLFVKPARRGNGRPIKQLVGFQ---SVILNAKEKAEIVFELSPCESLSRAREDGLM 754
+ L F+ + P K LV +Q ++ + + A + L SLSR + G
Sbjct: 676 DYAALGFIAGQFGPSPYPKKSLVNYQRLHNIAGGSSQTATLNLTLG---SLSRVDDHGNT 732
Query: 755 VIEEGTHFLVV 765
+ G + L++
Sbjct: 733 YLYPGDYALMI 743
>gi|291537442|emb|CBL10554.1| Beta-glucosidase-related glycosidases [Roseburia intestinalis
M50/1]
Length = 710
Score = 448 bits (1152), Expect = e-123, Method: Compositional matrix adjust.
Identities = 271/739 (36%), Positives = 401/739 (54%), Gaps = 90/739 (12%)
Query: 54 QRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGA 113
+RA +LV ++TL+EK++Q + APA+ RL I AY WW+EALHGVA G A
Sbjct: 15 KRAAELVGKMTLEEKVAQTLYQAPAVERLNIKAYNWWNEALHGVARAGT----------A 64
Query: 114 TSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQA-------IGMTFWAPNINIFRD 166
T FPQ I AA+FD L ++G A+ EARA +N Q G+TFWAPN+NIFRD
Sbjct: 65 TVFPQAIGLAATFDEDLLEQVGDAVSTEARAKFNMQQEGKDTDIYKGLTFWAPNVNIFRD 124
Query: 167 PRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWK 226
PRWGRG ET GEDP +T + V Y+ G+QG N L+A+AC KHF + +
Sbjct: 125 PRWGRGHETFGEDPYLTSRLGVRYIEGLQGHDEN------YLKAAACAKHFAVH---SGP 175
Query: 227 GTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTAR 286
R++FDA VT QDL +TY P FE+CVK+G+ +M AYNR NG+P C ++ LL R
Sbjct: 176 EAVRHEFDAEVTEQDLRETYLPAFEACVKEGKVEAVMGAYNRTNGVPCCGNKRLLIDILR 235
Query: 287 RQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQK 346
++WGF G++TSDC A+ ++ + ++V + G D+NCG+ + AV+Q
Sbjct: 236 KEWGFSGHVTSDCWAIRDFHEGHHVTGTAIESVAMAMNNGCDLNCGTLFGFLVQ-AVRQG 294
Query: 347 KLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVL 406
+ E +D A+ NLF RM+LG+F+ P+ KI S + L A+ +VL
Sbjct: 295 LVKEERLDEAVTNLFMARMKLGVFD-KKEENPYDKIPYLAADSREMKKLNEAVARRTVVL 353
Query: 407 LKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTV--- 463
LKN +LPL K+K ++ +IGPNA+S + L+GNY G + R IT L+ +++YV + V
Sbjct: 354 LKNKEHILPLDKNKIKTIGVIGPNADSRRALVGNYEGTASRYITVLEGIEDYVGDDVRVL 413
Query: 464 YYPGC-------DTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEE---------LD 507
Y GC +A + + + + + K +D VV ++GLD E EE D
Sbjct: 414 YSEGCHLYKDRTSNLAQENDRMSEVLGVCKESDVVVAVLGLDAGIEGEEGDAGNEYGSGD 473
Query: 508 RVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGA 567
+ DL LPG Q+E++ A + KPVILVLL G + + +A D ++ +I+ YPG G
Sbjct: 474 KPDLNLPGLQEEIL-EAAVSCGKPVILVLLSGSALAVNWA--DEHVDAIVQGWYPGARGG 530
Query: 568 VALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPF 627
A+A+++FG+ NP G+LP+T+Y D M+ GRTYR+ E + ++PF
Sbjct: 531 AAIADILFGEANPEGKLPVTFYRTTEELPDFEDYSMQ--------GRTYRYMEQEALYPF 582
Query: 628 GCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTI 687
G GLSY++Y+Y+ + + +E + VV + + +
Sbjct: 583 GYGLSYTEYAYQ---------------NVRFLEQEPVV--------------SEGVTIGL 613
Query: 688 GVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSR 747
VKN G+M G V ++VK + P QL + L A E+ EI L E+
Sbjct: 614 SVKNTGKMDGTETVQVYVKAEH--SKMPHGQLKKIVKLPLCAGEEKEINIRLE-SEAFML 670
Query: 748 AREDGLMVIEEGTHFLVVG 766
E+G ++ G + VG
Sbjct: 671 YDENGEKILPSGHFEIFVG 689
>gi|449303062|gb|EMC99070.1| glycoside hydrolase family 3 protein [Baudoinia compniacensis UAMH
10762]
Length = 786
Score = 447 bits (1151), Expect = e-123, Method: Compositional matrix adjust.
Identities = 265/737 (35%), Positives = 401/737 (54%), Gaps = 35/737 (4%)
Query: 37 NPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHG 96
N + T P C +L RA LV TL+E + N+AP +PRLG+PAYE W+EALHG
Sbjct: 51 NSTLSTTPVCNRSLSAWDRAHALVQLFTLEELANNTGNTAPGVPRLGLPAYEVWNEALHG 110
Query: 97 VAGVGKGIF-FNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMT 155
++ G F NGT ATSFP IL+ AS + L +IG I + RA NAG+ G+
Sbjct: 111 IS---HGHFATNGTWSWATSFPSPILSMASMNRTLINQIGDIISTQGRAFSNAGR-YGLD 166
Query: 156 FWAPNINIFRDPRWGRGQETPGEDPL-VTGKYAVSYVRGVQGDTFNGGKLKGKLQASACC 214
+APNIN FR P WGRGQETPGED ++ YA Y+ G+QG GK + A
Sbjct: 167 SYAPNINGFRSPVWGRGQETPGEDAFFLSSLYAYEYITGMQG-----GKAPAVPKLVAVP 221
Query: 215 KHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPS 274
KHF YD++NW +R D +T QDLA Y P F S ++ +A G+MC+YN VNG+PS
Sbjct: 222 KHFAGYDIENWNNNSRLGLDVNITQQDLAGYYTPQFRSAIQNAKALGLMCSYNAVNGVPS 281
Query: 275 CADRNLLSKTARRQWGF-HGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGS 333
C++ L AR WGF +G+++SDCDAV +Y+ GYA + AV D L+AG D++CG+
Sbjct: 282 CSNSFFLQTLARDTWGFGNGFVSSDCDAVYNVYNPHGYAANTTGAVADSLRAGTDIDCGT 341
Query: 334 FLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQ 393
+ A + ++I+ AL +S + G F+GN ++ + +G + V +
Sbjct: 342 SYPFYLVPAFNAGLVSRNDIELALTRYYSGLVMQGYFDGNSSL--YRNLGWNDVLTTDAW 399
Query: 394 VLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQ 453
++ +AA +GI LLKN G LPL KS S+ALIGP AN+ L GNY + I+PLQ
Sbjct: 400 NISYEAAVEGITLLKND-GTLPLSKSTR-SVALIGPWANATLQLQGNYYAAAPYLISPLQ 457
Query: 454 ALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVL 513
A + + G + +++ +A+ +A+ +D ++ G+D + E E LDR ++
Sbjct: 458 AFRASGMTVNFVNGTTISSTNTSGFAEAITLAQQSDVIIYAGGIDNSIEAEGLDRQNITW 517
Query: 514 PGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEV 573
PG Q +LI ++++ KP++++ + GG VD + K + + +++W GYPG++G AL ++
Sbjct: 518 PGNQLDLIYQLSQVG-KPLVVLQMGGGQVDSSALKNNSKVNALVWGGYPGQSGGQALFDI 576
Query: 574 IFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSY 633
I G+ P GRL T YP Y + M +G+ G+TY +Y G V+PFG GL Y
Sbjct: 577 IMGNRAPAGRLVTTQYPASY-ATSFNQLNMNMAPVNGSLGQTYMWYTGTPVYPFGHGLFY 635
Query: 634 SKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFL-VTIGVKNH 692
+ ++ +S+ V ++ + P G EF E + V N
Sbjct: 636 TNFT--------------TTSTMGPVTTYNLTSIFAAPHPGYEFVEEVPIMDFNFIVNNT 681
Query: 693 GEMAGKHPVLLFVKPARRGNGRPIKQLVGF-QSVILNAKEKAEIVFELSPCESLSRARED 751
G A +LF RPIK LVG + I+ A + ++ P +L+RA +
Sbjct: 682 GRTASDWSGMLFASTTSGPTPRPIKWLVGIDREAIIVPGGLASVTIKV-PVGALARADAN 740
Query: 752 GLMVIEEGTHFLVVGDE 768
G +V+ G++ L++ +E
Sbjct: 741 GNLVVYPGSYSLMLNNE 757
>gi|240146254|ref|ZP_04744855.1| beta-glucosidase [Roseburia intestinalis L1-82]
gi|257201613|gb|EEU99897.1| beta-glucosidase [Roseburia intestinalis L1-82]
gi|291539969|emb|CBL13080.1| Beta-glucosidase-related glycosidases [Roseburia intestinalis
XB6B4]
Length = 710
Score = 447 bits (1151), Expect = e-123, Method: Compositional matrix adjust.
Identities = 271/739 (36%), Positives = 401/739 (54%), Gaps = 90/739 (12%)
Query: 54 QRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGA 113
+RA +LV ++TL+EK++Q + APA+ RL I AY WW+EALHGVA G A
Sbjct: 15 KRAAELVGKMTLEEKVAQTLYQAPAVERLNIKAYNWWNEALHGVARAGT----------A 64
Query: 114 TSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQA-------IGMTFWAPNINIFRD 166
T FPQ I AA+FD L ++G A+ EARA +N Q G+TFWAPN+NIFRD
Sbjct: 65 TVFPQAIGLAATFDEDLLEQVGDAVSTEARAKFNMQQEGKDTDIYKGLTFWAPNVNIFRD 124
Query: 167 PRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWK 226
PRWGRG ET GEDP +T + V Y+ G+QG N L+A+AC KHF + +
Sbjct: 125 PRWGRGHETFGEDPYLTSRLGVRYIEGLQGHDEN------YLKAAACAKHFAVH---SGP 175
Query: 227 GTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTAR 286
R++FDA VT QDL +TY P FE+CVK+G+ +M AYNR NG+P C ++ LL R
Sbjct: 176 EAVRHEFDAEVTEQDLRETYLPAFEACVKEGKVEAVMGAYNRTNGVPCCGNKRLLIDILR 235
Query: 287 RQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQK 346
++WGF G++TSDC A+ ++ + ++V + G D+NCG+ + AV+Q
Sbjct: 236 KEWGFSGHVTSDCWAIRDFHEGHHVTGTAIESVAMAMNNGCDLNCGTLFGFLVQ-AVRQG 294
Query: 347 KLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVL 406
+ E +D A+ NLF RM+LG+F+ P+ KI S + L A+ +VL
Sbjct: 295 LVKEERLDEAVTNLFMARMKLGVFD-KKEENPYDKIPYLAADSREMKKLNEAVARRTVVL 353
Query: 407 LKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTV--- 463
LKN +LPL K+K ++ +IGPNA+S + L+GNY G + R IT L+ +++YV + V
Sbjct: 354 LKNKEHILPLDKNKIKTVGVIGPNADSRRALVGNYEGTASRYITVLEGIEDYVGDDVRVL 413
Query: 464 YYPGC-------DTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEE---------LD 507
Y GC +A + + + + + K +D VV ++GLD E EE D
Sbjct: 414 YSEGCHLYKDRTSNLAQENDRMSEVLGVCKESDVVVAVLGLDAGIEGEEGDAGNEYGSGD 473
Query: 508 RVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGA 567
+ DL LPG Q+E++ A + KPVILVLL G + + +A D ++ +I+ YPG G
Sbjct: 474 KPDLNLPGLQEEIL-EAAVSCGKPVILVLLSGSALAVNWA--DEHVDAIVQGWYPGARGG 530
Query: 568 VALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPF 627
A+A+++FG+ NP G+LP+T+Y D M+ GRTYR+ E + ++PF
Sbjct: 531 AAIADILFGEANPEGKLPVTFYRTTEELPDFEDYSMQ--------GRTYRYMEQEALYPF 582
Query: 628 GCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTI 687
G GLSY++Y+Y+ + + +E + VV + + +
Sbjct: 583 GYGLSYTEYAYQ---------------NVRFLEQEPVV--------------SEGVTIGL 613
Query: 688 GVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSR 747
VKN G+M G V ++VK + P QL + L A E+ EI L E+
Sbjct: 614 SVKNTGKMDGTETVQVYVKAEH--SKMPHGQLKKIVKLPLCAGEEKEINIRLE-SEAFML 670
Query: 748 AREDGLMVIEEGTHFLVVG 766
E+G ++ G + VG
Sbjct: 671 YDENGEKILPSGHFEIFVG 689
>gi|402225863|gb|EJU05924.1| hypothetical protein DACRYDRAFT_113532 [Dacryopinax sp. DJM-731
SS1]
Length = 778
Score = 446 bits (1148), Expect = e-122, Method: Compositional matrix adjust.
Identities = 274/731 (37%), Positives = 395/731 (54%), Gaps = 37/731 (5%)
Query: 46 CKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIF 105
C + L RAR LV LT+ EK + VN++P +PRLG+P Y WWSE LHGVA G+
Sbjct: 42 CDSALDPLTRARALVGMLTMAEKFNNTVNASPGVPRLGLPPYNWWSEGLHGVAS-SPGVT 100
Query: 106 F---NGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNIN 162
F ATSFP+ IL A+FD L Y I I EARA N + G+ FW PNIN
Sbjct: 101 FAPAGQNFSYATSFPEPILMGAAFDDNLIYDIATIISTEARAFNNFNHS-GLDFWTPNIN 159
Query: 163 IFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDL 222
RDPRWGR ETPGEDP Y V G+Q F G K + + A CKH+ YDL
Sbjct: 160 PVRDPRWGRSLETPGEDPFHLASYVAKLVTGLQ---FGGDDPKYQ-KLVATCKHYAGYDL 215
Query: 223 DNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLS 282
+NW G RY FDA ++ QDL + + PPF++C + + +MC+YN VNGIPSCA+ LL
Sbjct: 216 ENWGGYARYGFDAVISNQDLVEYFLPPFQTCARDVNVTSVMCSYNAVNGIPSCANDYLLQ 275
Query: 283 KTARRQWGFH--------GYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSF 334
R WG+ Y+TSDCDAVS IY Y +PE AV LKAG D++CG+F
Sbjct: 276 SLLRTYWGWEPDSESLNAHYVTSDCDAVSNIYYPHNYTITPEQAVAVSLKAGTDLDCGTF 335
Query: 335 LQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQV 394
+ ++ +Q +++IDRAL ++ LG F+ Q + + + + Q
Sbjct: 336 YAEWLPSSYEQGLFHQTDIDRALIRSYAALFLLGYFD-PAEGQIYRQYNWANINTDYAQQ 394
Query: 395 LALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQA 454
LA AA +GI LLKN +LPLP S ++ALIGP AN+ + GNY G + +PL A
Sbjct: 395 LAYTAAWEGITLLKNIDDMLPLP-STMTNIALIGPWANATTQMQGNYQGIAPFLHSPLYA 453
Query: 455 LQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLP 514
LQ N Y G + + S+A A+ A+ AD + + G+D T E E +DRV++ P
Sbjct: 454 LQQRGINVTYVLGTNITSNSTAGFAAALAAAQTADLTLYIGGIDITVEAEAMDRVNITWP 513
Query: 515 GRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVI 574
G Q +LI ++A + +I+ + GG +D T + + +LW GYPG+ G A+ +++
Sbjct: 514 GNQLDLIAQLANVSTH-LIVYQMGGGQIDDTVLLENPKVHGLLWGGYPGQDGGTAMIDIL 572
Query: 575 FGDHNPGGRLPMTWYPQDYI-KVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSY 633
+G P GRLP++ YP ++I +VPMTDM++ P G PGRTY++Y G V PFG GL Y
Sbjct: 573 YGSRAPAGRLPLSQYPANFINEVPMTDMRLHPAL--GTPGRTYKWYSGDLVLPFGYGLHY 630
Query: 634 SKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHG 693
+ ++ KA ++ + +T + E + +S L F + + V N G
Sbjct: 631 TTFA---KAALKDHSPRSSDIATLVNEAK-----QSSAWLDKAFFD----VFAAEVTNTG 678
Query: 694 EMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIV-FELSPCESLSRAREDG 752
+ + L ++ P LV + + + ++V F+L+ S++RA G
Sbjct: 679 SLTSDYVALGYLTGEFGPAPYPKSSLVSYTRLSQVTPGETQVVNFDLT-LGSIARADYYG 737
Query: 753 LMVIEEGTHFL 763
+ + GT+ L
Sbjct: 738 DLYLYPGTYTL 748
>gi|255690205|ref|ZP_05413880.1| xylosidase/arabinosidase [Bacteroides finegoldii DSM 17565]
gi|260624224|gb|EEX47095.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
finegoldii DSM 17565]
Length = 1425
Score = 445 bits (1145), Expect = e-122, Method: Compositional matrix adjust.
Identities = 289/779 (37%), Positives = 403/779 (51%), Gaps = 86/779 (11%)
Query: 12 LLCLCFTSLLTRVDSTQPPFSCDPSNPST--ETFPFCKTTLPISQRARDLVSRLTLDEKI 69
L +C L D T S P T + +PF L I QR DLVSRLTL+EK+
Sbjct: 679 LRSICVDKLYHNPDGTIKLVKQTKSTPITVQKRYPFRNPQLSIEQRVDDLVSRLTLEEKV 738
Query: 70 SQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSY 129
Q++N+APAI RLGIPAY WW+E LHGV T T FPQ I AAS++
Sbjct: 739 RQMLNNAPAIKRLGIPAYNWWNECLHGVGR---------TKYHVTVFPQAIGMAASWNDV 789
Query: 130 LWYRIGQAIGLEARALYNAGQAIG-------MTFWAPNINIFRDPRWGRGQETPGEDPLV 182
L + +I E RA+YN Q G +T+W PNINIFRDPRWGRGQET GEDP +
Sbjct: 790 LMKEVASSIADEGRAIYNDAQKRGDYSQYHALTYWTPNINIFRDPRWGRGQETYGEDPYL 849
Query: 183 TGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDL 242
T K ++V G+QGD L+ASAC KH+ + R+ F++ V+ DL
Sbjct: 850 TSKIGKAFVLGLQGDD------PRYLKASACAKHYAVHSGPE---KNRHSFNSDVSTYDL 900
Query: 243 ADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAV 302
DTY P F + V SG+MCAYN G P C + L+ R +W F GY+TSDC A+
Sbjct: 901 WDTYLPAFRTLVVDANVSGVMCAYNAFKGQPCCGNDLLMQSILRDKWNFKGYVTSDCGAI 960
Query: 303 SIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFS 362
I++ A D + G D++CG AVK + E ++D ++ LF+
Sbjct: 961 DDIFNHHKAHPDAATAAADAVFHGTDLDCGQSAYLALVKAVKNGIITEKQLDVSVKRLFT 1020
Query: 363 VRMRLGLFNGNPTMQ-PFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKS 421
+R RLGLF +P Q + I V+ HQ LA Q A++ +VLLKN LLPL K+K
Sbjct: 1021 IRFRLGLF--DPAEQVDYAHIPISVLECKKHQDLAKQLARESMVLLKNDR-LLPLQKNKL 1077
Query: 422 VSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNY---VENTVYYPGCDTV-ACSSAS 477
+ ++GPNA+ LLGNY G R +TPLQA++ V VY G D + S
Sbjct: 1078 KKVVVMGPNADCKDALLGNYNGHPSRMLTPLQAIRERLKGVAEVVYVSGIDYINTVSEDE 1137
Query: 478 IDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEA 527
+ + V+ AKGAD V+ + G+ E EE+ DR + LP Q +L+ + A
Sbjct: 1138 LKRYVNQAKGADAVIFIGGISPRLEGEEMSVNKDGFDGGDRTSIALPTVQTQLMKALV-A 1196
Query: 528 AKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMT 587
+ P + V++ G + I + +++ +IL A Y G+ G A+A+V+FGD+NP G+LP+T
Sbjct: 1197 GRIPTVFVMMTGSALAIPWEA--KHVPAILNAWYGGQYGGEAIADVLFGDYNPSGKLPVT 1254
Query: 588 WYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNK 647
+Y +D M+ GRTYR+++GK ++PFG GLSY+ + Y
Sbjct: 1255 FYAKDSDLPDFESYDMQ--------GRTYRYFKGKALYPFGYGLSYTDFRY--------- 1297
Query: 648 LYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKP 707
SS KM + K +P VT+ VKN G+M G+ V L+V
Sbjct: 1298 ------SSLKMPTACNTTD-KEIP-------------VTVTVKNTGKMDGEEVVQLYVSH 1337
Query: 708 ARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
+ P+ L GF+ + L A E +I F LS E LS E+G+ + GT + VG
Sbjct: 1338 PDKKILVPVTALKGFKRIYLKAGEAKQITFSLS-SEDLSCVDENGIRKVLPGTVKIQVG 1395
>gi|373955483|ref|ZP_09615443.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
paludis DSM 18603]
gi|373892083|gb|EHQ27980.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
paludis DSM 18603]
Length = 738
Score = 444 bits (1143), Expect = e-122, Method: Compositional matrix adjust.
Identities = 279/764 (36%), Positives = 397/764 (51%), Gaps = 95/764 (12%)
Query: 31 FSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWW 90
S P N T +PF L + +R DLV R+TL+EK+SQ++NSAPAI RLG+PAY WW
Sbjct: 20 LSFKPQN-HTYKYPFNNPALSMDERVADLVGRMTLEEKVSQMLNSAPAIERLGVPAYNWW 78
Query: 91 SEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQ 150
+E LHGVA T T +PQ I AA++D + +G E RA+YN
Sbjct: 79 NECLHGVA---------RTPFKVTVYPQAIAMAATWDKTSMHVMGDYTAEEGRAVYNESI 129
Query: 151 A-------IGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGK 203
+G+T+W PNINIFRDPRWGRGQET GEDP +TG+ ++V+G+QGD
Sbjct: 130 KNDKHDIYLGLTYWTPNINIFRDPRWGRGQETYGEDPFLTGEMGSAFVKGLQGDD----- 184
Query: 204 LKGKLQASACCKHFTAY----DLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRA 259
L+A+ C KH+ + DL R+KF+ ++ DL DTY P F V +
Sbjct: 185 -PRYLKAAGCAKHYAVHSGPEDL-------RHKFNTDISDYDLWDTYLPAFRKLVVDAKV 236
Query: 260 SGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSP--ED 317
+G+MCAYN G P C L++ +W F GY+TSDC + Y + P E
Sbjct: 237 TGVMCAYNAFKGQPCCGSDLLMNSILHDKWKFTGYVTSDCGGIDDFYRENTHQTQPDAES 296
Query: 318 AVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQ 377
A D + G DV CG+ K AVK KL E +ID++L LFSVR +LG+F+ ++
Sbjct: 297 AAADAVLHGTDVECGNVTYKSLVKAVKDGKLSEKQIDQSLKRLFSVRFKLGMFDPADAVK 356
Query: 378 PFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTL 437
+ +IG D + +PAH AL+ A IVLLKN LLPL K+ +A++GPNA++A ++
Sbjct: 357 -YNQIGKDALEAPAHGAQALKMAHQSIVLLKNEGNLLPLSKNLK-KIAVLGPNADNAVSV 414
Query: 438 LGNYAGPSCRSITPLQALQNYV---ENTVYYPGCDTVACSSASIDKAVDIA--KGADHVV 492
LGNY G R +T LQ ++N + +Y D VA S+A + A A K AD ++
Sbjct: 415 LGNYNGTPSRIVTALQGIKNKLPAGTEVIYDKAVDYVADSAARYNYAAMAAKVKDADAII 474
Query: 493 LMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPV 542
+ G+ E EE+ DR ++LPG Q EL+ + +A KPV+ V++ G +
Sbjct: 475 YIGGISPELEGEEMPVSKPGFHGGDRSTILLPGVQTELL-KALKATGKPVVFVMMTGSAI 533
Query: 543 DITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMK 602
+ N+ +I+ A Y G+A A+A+V+FGD+NP GRLP+T+Y D TD
Sbjct: 534 ATPWEA--ENLPAIVNAWYGGQAAGTAIADVLFGDYNPAGRLPVTFYGSDKDLPSFTDYS 591
Query: 603 MRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQ 662
M RTYR+++GK ++ FG GLSYSK+ Y A L L +
Sbjct: 592 MD--------NRTYRYFKGKPLYAFGYGLSYSKFEY---APLDAPLTLKAGEAL------ 634
Query: 663 DVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGF 722
VH K V N +M G+ L++ I+ L GF
Sbjct: 635 -TVHVK--------------------VTNKSKMDGEEVTELYLSHIGIKQKTAIRALKGF 673
Query: 723 QSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
+ ++ A E +I F+LS + LS +G +V G + VG
Sbjct: 674 ERTLIKAGETKDITFKLSSAD-LSITDLNGNLVKASGKIAISVG 716
>gi|348604625|dbj|BAK96214.1| beta-xylosidase [Acremonium cellulolyticus]
Length = 797
Score = 444 bits (1141), Expect = e-121, Method: Compositional matrix adjust.
Identities = 269/730 (36%), Positives = 407/730 (55%), Gaps = 34/730 (4%)
Query: 46 CKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIF 105
C T+ RA L++ TL+E I+ N+AP +PRLG+P Y+ WSEALHG+
Sbjct: 64 CDTSANYVDRAEGLIALFTLEELINNTQNTAPGVPRLGLPPYQVWSEALHGLDRANFATS 123
Query: 106 FNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFR 165
+ ATSFP IL+ A+ + L +I IG +ARA NAG+ G+ +APNIN FR
Sbjct: 124 GD-EWTWATSFPMPILSMAALNRTLINQIAGIIGTQARAFNNAGR-YGLDAYAPNINGFR 181
Query: 166 DPRWGRGQETPGEDP-LVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDN 224
P WGRGQETPGED ++ YA Y+ G+QG G L+ A KHF YDL+N
Sbjct: 182 SPLWGRGQETPGEDANFLSSSYAYEYITGLQG-----GVDPDHLKVVATAKHFAGYDLEN 236
Query: 225 WKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKT 284
W G +R FDA +T QDLA+ Y P F + + +A MC+YN VNG+PSC+ LL
Sbjct: 237 WGGNSRLGFDASITQQDLAEYYTPQFLAASRYAKARSFMCSYNSVNGVPSCSSSFLLQTL 296
Query: 285 ARRQWGF--HGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAA 342
R W F +GY++SDCDAV +++ GYA + A D L+AG D++CG + +
Sbjct: 297 LRDNWDFPEYGYVSSDCDAVYNVFNPHGYASNQSAAAADSLRAGTDIDCGQTYPWNLNQS 356
Query: 343 VKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQD 402
+ + EI+R++ L+S ++LG F+G+ + + ++G + V + ++ +AA +
Sbjct: 357 FIEGSVTRGEIERSIVRLYSNLVKLGYFDGDKS--EYRQLGWNDVVTTDAWNISYEAAVE 414
Query: 403 GIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENT 462
GIVLLKN G+LPL K S+ALIGP AN+ + L GNY G + ITPLQ +
Sbjct: 415 GIVLLKND-GILPLSKHVK-SIALIGPWANATEQLQGNYYGTAPYLITPLQGASDAGYKV 472
Query: 463 VYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELIT 522
Y G + + ++ A+ AK +D +V + G+D T E E DR+++ PG Q +LI
Sbjct: 473 NYALGTNILGNTTEGFADALSAAKKSDVIVYLGGIDNTIEAEGTDRMNVTWPGNQLDLIQ 532
Query: 523 RVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGG 582
++++ KP++++ + GG VD + K + + +++W GYPG++G A+ +++ G P G
Sbjct: 533 QLSQTG-KPLVVLQMGGGQVDSSSIKANSKVNALVWGGYPGQSGGTAIFDILSGKRVPAG 591
Query: 583 RLPMTWYPQDY-IKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFK 641
RL T YP +Y + P TDM +RP S NPG+TY +Y G V+ FG GL Y+ FK
Sbjct: 592 RLVTTQYPAEYATQFPATDMNLRPDGAS-NPGQTYMWYTGTPVYDFGYGLFYT----TFK 646
Query: 642 AVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFL-VTIGVKNHGEMAGKHP 700
+Q KL SSS + E +V P E+ E F+ +T +KN G+ A +
Sbjct: 647 ETAQ-KL---GSSSFDISE---IVAAPRSPSY--EYSELVPFVNITATIKNTGKTASPYT 697
Query: 701 VLLFVKPARRGNG-RPIKQLVGFQSVI-LNAKEKAEIVFELSPCESLSRAREDGLMVIEE 758
+LF G P K LVG+ + + + A++V + P +++R E+G ++
Sbjct: 698 AMLFANTTNAGPAPYPNKWLVGYDRLASIEPGKSADLVIPV-PIGAIARVDENGNRIVYP 756
Query: 759 GTHFLVVGDE 768
G + L + E
Sbjct: 757 GDYQLALNVE 766
>gi|410617070|ref|ZP_11328046.1| beta-glucosidase [Glaciecola polaris LMG 21857]
gi|410163339|dbj|GAC32184.1| beta-glucosidase [Glaciecola polaris LMG 21857]
Length = 731
Score = 443 bits (1139), Expect = e-121, Method: Compositional matrix adjust.
Identities = 267/737 (36%), Positives = 389/737 (52%), Gaps = 80/737 (10%)
Query: 50 LPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGT 109
+ +QRA LV+ +T+DEKI+QL ++ PAI RL +P Y WW+EALHG+A GK
Sbjct: 34 ISFAQRANLLVNAMTVDEKIAQLSHATPAIARLNVPQYNWWNEALHGIARNGK------- 86
Query: 110 IRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALY-------NAGQAIGMTFWAPNIN 162
AT FPQ I AA+FD L +++ AI EARA Y N GQ G+TFW PN+N
Sbjct: 87 ---ATIFPQAIGLAATFDPDLAHQVASAISDEARAKYAIAQSIGNQGQYAGLTFWTPNVN 143
Query: 163 IFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDL 222
IFRDPRWGRGQET GEDP +T + ++V+G+QGD L+++ KHF +
Sbjct: 144 IFRDPRWGRGQETYGEDPFLTAQMGTAFVKGLQGDD------PKYLKSAGVAKHFAVH-- 195
Query: 223 DNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLS 282
+ + R+ FD + +DL +TY P FE+ V Q + +G+MCAYN VNG P+CA LL
Sbjct: 196 -SGPESLRHHFDVEPSQKDLYETYLPAFEALVTQAKVAGVMCAYNAVNGEPACASAQLLD 254
Query: 283 KTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAA 342
++QWGFHGYI SDC A++ KS ++ L++G+++NCGS + KAA
Sbjct: 255 GILKKQWGFHGYIVSDCGALNDFQAGHKVTKSGPESAALALQSGVNLNCGSTYEHFLKAA 314
Query: 343 VKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQD 402
++Q +P ID+ L L +R +LG F+ + P+ ++ DV+ SP H L+ A+
Sbjct: 315 LEQNLVPLELIDQRLTQLLMIRFQLGFFDP-AGLNPYNEVTPDVIHSPEHINLSRDVARK 373
Query: 403 GIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVE-- 460
IVLLKN + +LPL K V + GP A S+ L+GNY G S ++ L+ + V
Sbjct: 374 SIVLLKNDNHVLPLSKDIKVPY-VTGPFAASSDMLIGNYYGISDSLVSVLEGIAGKVSLG 432
Query: 461 -NTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELD---------RVD 510
+ Y G + ++ A +AK AD V+ ++G+ E EE+D RV
Sbjct: 433 SSLNYRSGSLPFHNNINPLNWAPQVAKTADAVIAVVGVSADMEGEEVDAIASADRGDRVA 492
Query: 511 LVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVAL 570
+ LP Q + + ++A K P+ILV+ G PVDI+ + +ILW YPGE G A+
Sbjct: 493 ITLPQNQVDYVKQLAAHKKGPLILVVAAGSPVDIS--DLEPLADAILWIWYPGEQGGNAV 550
Query: 571 AEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCG 630
A+V+FGD NP G LP+T+ P D M GRTY+F E ++PFG G
Sbjct: 551 ADVLFGDTNPSGHLPLTFVKSIDDLPPFDDYAMT--------GRTYKFLEKAPLYPFGFG 602
Query: 631 LSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVK 690
SY+++S+ VSQ K E +++ V+
Sbjct: 603 RSYTEFSFNDLTVSQGKA-----------------------------IEGEALTLSVEVE 633
Query: 691 NHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRARE 750
N G++AG+ V ++ P R N I L F+ + L KE + + + L +
Sbjct: 634 NRGDIAGETVVQAYLSPIARMNNEAISSLKSFKRIHLAPKETRWVELTIQ-GKDLYQVNN 692
Query: 751 DGLMVIEEGTHFLVVGD 767
G V +G + L VGD
Sbjct: 693 AGETVWPQGRYSLAVGD 709
>gi|242813865|ref|XP_002486253.1| beta-xylosidase, putative [Talaromyces stipitatus ATCC 10500]
gi|218714592|gb|EED14015.1| beta-xylosidase, putative [Talaromyces stipitatus ATCC 10500]
Length = 893
Score = 443 bits (1139), Expect = e-121, Method: Compositional matrix adjust.
Identities = 273/748 (36%), Positives = 401/748 (53%), Gaps = 49/748 (6%)
Query: 44 PFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKG 103
P C T+L RA+ LV +T +EK+ N +P RLG+PAY+WW+EALHGVAG G
Sbjct: 163 PICDTSLDPLTRAKGLVDAMTFEEKVQNTQNGSPGAARLGLPAYQWWNEALHGVAG-SPG 221
Query: 104 IFF--NGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNI 161
+ F +G ATSFPQ IL +A+FD L +G + +E RA N G A G+ FW PNI
Sbjct: 222 VTFQPSGNFSYATSFPQPILMSAAFDDALIKEVGTVVSIEGRAFNNYGNA-GLDFWTPNI 280
Query: 162 NIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYD 221
N FRDPRWGRGQETPGEDP +Y + V G+Q G + A CKHF YD
Sbjct: 281 NPFRDPRWGRGQETPGEDPYHIARYVYNLVDGLQN-----GIAPANPRVVATCKHFAGYD 335
Query: 222 LDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLL 281
+++W+G +RY F+A ++ QDL++ Y PPF+SC + + IMC+YN VNGIP+CAD LL
Sbjct: 336 IEDWEGNSRYGFNAIISTQDLSEYYLPPFKSCARDAQVDAIMCSYNAVNGIPTCADSYLL 395
Query: 282 SKTARRQWGFH---GYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKH 338
R W ++ ++TSDCDAV IY Y S A D L AG +++CG+ + +
Sbjct: 396 DTILRDHWNWNQTGHWVTSDCDAVDNIYSDHRYTSSLAAAAADALNAGTNLDCGTTMSNN 455
Query: 339 TKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQ 398
AA Q + ++ AL L+S +RLG F+ + + +G V + A Q LA +
Sbjct: 456 LAAAAAQDLFKNATLNSALVYLYSSLVRLGWFDSEDSQ--YSSLGWSDVGTTASQQLANR 513
Query: 399 AAQDGIVLLKNSH-GLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQN 457
AA +GIVLLKN H +LPL + ++ALIGP AN+ L GNY G T + +
Sbjct: 514 AAVEGIVLLKNDHKKVLPLSQHGQ-TIALIGPYANATTQLQGNYYGTPAYIRTLVWGAEQ 572
Query: 458 YVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQ 517
Y G + ++ AV AK AD V+ G+D + E E +DR + G Q
Sbjct: 573 MGYTVQYEAGTGINSTDTSGFAAAVAAAKTADIVIYAGGIDNSIEAEAMDRNTIAWTGNQ 632
Query: 518 QELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGD 577
+LI ++++ KP++++ GG +D + + N+ ++LW GYP + G A+ +++ G
Sbjct: 633 LQLIDQLSQVG-KPLVVLQFGGGQLDDSALLQNENVNALLWCGYPSQTGGQAVFDILTGQ 691
Query: 578 HNPGGRLPMTWYPQDYIK-VPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKY 636
P GRLP+T YP +Y +PMTDM +RP ++ PGRTYR+Y+ V PFG GL Y+ +
Sbjct: 692 SAPAGRLPVTQYPANYTNAIPMTDMSLRPNGST--PGRTYRWYD-DAVIPFGFGLHYTTF 748
Query: 637 SY-----KFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKN 691
KF + L S S + QD + S + VKN
Sbjct: 749 DASWADKKFGPYNTASLVAKASKS----KYQDTAPFDS---------------FHVNVKN 789
Query: 692 HGEMAGKHPVLLFVKPARRG-NGRPIKQLVGF-QSVILNAKEKAEIVFELSPCESLSRAR 749
G++ LLF G PIK L+ + ++ + E + +++ S++R
Sbjct: 790 TGKVTSDFVALLFASTDNAGPKPYPIKTLISYARASSIKPGETRTVSIDVT-IGSIARTA 848
Query: 750 EDGLMVIEEGTHFLVVG-DEEYPISIFV 776
+G +V+ G++ L + + YP + F
Sbjct: 849 TNGDLVLYPGSYTLQLDVGQHYPTAEFT 876
>gi|313202830|ref|YP_004041487.1| glycoside hydrolase [Paludibacter propionicigenes WB4]
gi|312442146|gb|ADQ78502.1| glycoside hydrolase family 3 domain protein [Paludibacter
propionicigenes WB4]
Length = 742
Score = 442 bits (1138), Expect = e-121, Method: Compositional matrix adjust.
Identities = 285/737 (38%), Positives = 400/737 (54%), Gaps = 83/737 (11%)
Query: 43 FPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGK 102
+PF T+ I +R +DLVSRLTLDEK Q++++APAI RLGI Y WW+EALHGVA G+
Sbjct: 38 YPFQDTSKTIDERVKDLVSRLTLDEKAGQMLHNAPAIKRLGILPYSWWNEALHGVARTGR 97
Query: 103 GIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA-------GQAIGMT 155
AT FP+ + AA+FD L YRIGQAI EA A YN GQ G+T
Sbjct: 98 ----------ATVFPENVGLAATFDEDLVYRIGQAISDEAWAKYNIAQRLENYGQYSGIT 147
Query: 156 FWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCK 215
F+APN+NIFRDPRWGRGQET GEDP +T + V+YV+G+QG+ L+ +AC K
Sbjct: 148 FYAPNVNIFRDPRWGRGQETYGEDPFLTSRMGVAYVKGMQGND------PKYLKTAACAK 201
Query: 216 HFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSC 275
H+ + + R+ +DA M+D +TY P FE+ VK+G+ +MCAYNR G P C
Sbjct: 202 HYVVH---SGPEALRHSYDAEPPMKDFMETYVPAFETLVKEGKVESVMCAYNRTFGKPCC 258
Query: 276 ADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFL 335
LL R +WGF GY+T+DC A+ Y G AK +A +K+G+++NCG+
Sbjct: 259 GSSFLLHDLLREKWGFTGYVTTDCWAIQNFYLHHGAAKDSLEACALAIKSGVNLNCGNEF 318
Query: 336 QKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVL 395
+ AAV++ + E E+D AL L R RLGLF+ +P P+ KI +V+ S + L
Sbjct: 319 N-YLPAAVRKGLVTEKEVDEALSQLLRTRFRLGLFD-SPNENPYAKIKEEVIGSQQNIDL 376
Query: 396 ALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQAL 455
A +AA +VLL+N + LPL K SL ++GP A + LLGNY G + R T +QA+
Sbjct: 377 AYEAAAKSLVLLQNKNNTLPLKKDMK-SLYVVGPYAANQDILLGNYNGVNSRLTTIMQAI 435
Query: 456 QNYVE---NTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEE------- 505
V + Y G + A + S++ ++ A AD VV + G+ E EE
Sbjct: 436 VGKVSAGTSVNYRIGVEPSAPNKNSMNYSIGEAADADAVVAVFGISGVFEGEEGESTAST 495
Query: 506 --LDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPG 563
DR+DL LP Q + + + + KKP+ILVL G P I + + +IL+ YPG
Sbjct: 496 SRGDRLDLNLPQNQLDYLRELKKKCKKPIILVLTGGSP--ICTPELADMVDAILFVWYPG 553
Query: 564 EAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVP-MTDMKMRPQATSGNPGRTYRFYEGK 622
+ G A+A+VIFGD NP GRL +T +P+ ++P D M+ GRTYR+ +
Sbjct: 554 QEGGHAVADVIFGDVNPSGRLCIT-FPKSVSQLPAFEDYSMK--------GRTYRYMTEE 604
Query: 623 EVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRK 682
++PFG GLSY+ YSY ++K+ QS VH
Sbjct: 605 PLYPFGFGLSYTNYSYSNIKTDKDKIKKGQS-----------VH---------------- 637
Query: 683 FLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPC 742
VT V N G+ AG+ L++ + P+ L G + V L A E E+ FE++P
Sbjct: 638 --VTATVSNTGKTAGEEVAQLYITDVKASAPTPLYALKGTKRVKLAAGESKEVSFEVTP- 694
Query: 743 ESLSRAREDGLMVIEEG 759
+ + G VIE G
Sbjct: 695 QMMELVTVTGEKVIEPG 711
>gi|60729621|pir||JC7966 xylan 1,4-beta-xylosidase (EC 3.2.1.37) - Talaromyces emersonii
gi|21326570|gb|AAL32053.2|AF439746_1 beta-xylosidase [Rasamsonia emersonii]
Length = 796
Score = 442 bits (1137), Expect = e-121, Method: Compositional matrix adjust.
Identities = 271/756 (35%), Positives = 408/756 (53%), Gaps = 41/756 (5%)
Query: 32 SCDPSNPSTETFPF-----CKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPA 86
+ D S P E P C T+ RA LVS TL+E I+ N+AP +PRLG+P
Sbjct: 45 TIDLSFPDCENGPLSTNLVCNTSADPWARAEALVSLFTLEELINNTQNTAPGVPRLGLPQ 104
Query: 87 YEWWSEALHGVAGVGKGIFFN-GTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARAL 145
Y+ W+EALHG+ + F + G ATSFP IL+ ASF+ L +I I +ARA
Sbjct: 105 YQVWNEALHGL---DRANFSDSGEYSWATSFPMPILSMASFNRTLINQIASIIATQARAF 161
Query: 146 YNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPL-VTGKYAVSYVRGVQGDTFNGGKL 204
NAG+ G+ +APNIN FR P WGRGQETPGED ++ YA Y+ G+QG G
Sbjct: 162 NNAGR-YGLDSYAPNINGFRSPLWGRGQETPGEDAFFLSSAYAYEYITGLQG-----GVD 215
Query: 205 KGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMC 264
++ A KHF YDL+NW +R +A +T QDL++ Y P F + + + +MC
Sbjct: 216 PEHVKIVATAKHFAGYDLENWGNVSRLGSNAIITQQDLSEYYTPQFLASARYAKTRSLMC 275
Query: 265 AYNRVNGIPSCADRNLLSKTARRQWGF--HGYITSDCDAVSIIYDAEGYAKSPEDAVVDV 322
+YN VNG+PSC++ L R + F GY++SDCDAV +++ GYA + A D
Sbjct: 276 SYNAVNGVPSCSNSFFLQTLLRESFNFVDDGYVSSDCDAVYNVFNPHGYALNQSGAAADS 335
Query: 323 LKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKI 382
L AG D++CG + H + ++ + +I+++L L++ +RLG F+GN ++ + +
Sbjct: 336 LLAGTDIDCGQTMPWHLNESFYERYVSRGDIEKSLTRLYANLVRLGYFDGNNSV--YRNL 393
Query: 383 GADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYA 442
+ V + ++ +AA +GI LLKN G LPL K K S+ALIGP AN+ + GNY
Sbjct: 394 NWNDVVTTDAWNISYEAAVEGITLLKND-GTLPLSK-KVRSIALIGPWANATVQMQGNYY 451
Query: 443 GPSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQE 502
G I+PL+A + Y G + S+ +A+ AK +D ++ G+D T E
Sbjct: 452 GTPPYLISPLEAAKASGFTVNYAFGTNISTDSTQWFAEAISAAKKSDVIIYAGGIDNTIE 511
Query: 503 KEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYP 562
E DR DL PG Q +LI ++++ KP++++ + GG VD + K ++N+ +++W GYP
Sbjct: 512 AEGQDRTDLKWPGNQLDLIEQLSKVG-KPLVVLQMGGGQVDSSSLKANKNVNALVWGGYP 570
Query: 563 GEAGAVALAEVIFGDHNPGGRLPMTWYPQDY-IKVPMTDMKMRPQATSGNPGRTYRFYEG 621
G++G AL +++ G P GRL T YP +Y + P DM +RP + NPG+TY +Y G
Sbjct: 571 GQSGGAALFDILTGKRAPAGRLVSTQYPAEYATQFPANDMNLRPNGS--NPGQTYIWYTG 628
Query: 622 KEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETR 681
V+ FG GL Y+++ A T D++ P G E+ E
Sbjct: 629 TPVYEFGHGLFYTEFQESAAA------------GTNKTSTLDILDLVPTPHPGYEYIELV 676
Query: 682 KFL-VTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSV-ILNAKEKAEIVFEL 739
FL VT+ VKN G + LLF P K LVGF + ++ + A++ F +
Sbjct: 677 PFLNVTVDVKNVGHTPSPYTGLLFANTTAGPKPYPNKWLVGFDRLATIHPAKTAQVTFPV 736
Query: 740 SPCESLSRAREDGLMVIEEGTHFLVVGDEEYPISIF 775
P +++RA E+G VI G + L + +E + F
Sbjct: 737 -PLGAIARADENGNKVIFPGEYELALNNERSVVVSF 771
>gi|154313073|ref|XP_001555863.1| hypothetical protein BC1G_05538 [Botryotinia fuckeliana B05.10]
Length = 755
Score = 442 bits (1137), Expect = e-121, Method: Compositional matrix adjust.
Identities = 271/742 (36%), Positives = 396/742 (53%), Gaps = 61/742 (8%)
Query: 46 CKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIF 105
C T+ RA L+S TL EK++ N++P +PR+G+P+YEWW+EALHG+A G
Sbjct: 34 CDTSSDPYTRAAALISLFTLAEKVNNTGNTSPGVPRIGLPSYEWWNEALHGIA-RSPGTT 92
Query: 106 FNGTIRG---ATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNIN 162
F T +TSFPQ IL A+FD L +++ + EARA N + G+ FW PNIN
Sbjct: 93 FAATGSNYSYSTSFPQPILMGATFDDELIHKVATQVSTEARAFNNVNR-FGLNFWTPNIN 151
Query: 163 IFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDL 222
++DPRWGRGQETPGEDP T Y + + G+QG + KG A CKHF YDL
Sbjct: 152 PYKDPRWGRGQETPGEDPFHTSSYVNALITGLQGGLDDLPYKKGV----ATCKHFAGYDL 207
Query: 223 DNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLS 282
+N G RY FDA + QDL D Y PPF+ C + +MC+YN +NG+P+CAD LL
Sbjct: 208 ENSDGAIRYGFDAIIKSQDLRDYYLPPFQQCARDSNVQSVMCSYNAMNGVPTCADDWLLQ 267
Query: 283 KTARRQWGF---HGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHT 339
R WG+ ++TSDCDAV I+D Y +PE + D L AG D++CG+F +
Sbjct: 268 TLLREHWGWTEEDQWVTSDCDAVKNIWDYHNYTLTPEQSAADALNAGTDLDCGTFWPTYL 327
Query: 340 KAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQA 399
+A Q S +DR+L ++ +RLG F+ P++QP+ ++ D V +PA Q LALQA
Sbjct: 328 GSAYDQGLYDISTLDRSLARRYASLVRLGYFDP-PSVQPYRQLNWDNVSTPAAQQLALQA 386
Query: 400 AQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYV 459
A+DGIVLLKN G+LPL S ++ALIGP AN+ K + GNY G + +PL A QN
Sbjct: 387 AEDGIVLLKND-GILPL-SSNITNVALIGPLANATKQMQGNYYGTAPYLRSPLIAAQNAG 444
Query: 460 ENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQE 519
Y G D + ++ A+ A+ AD V+ + G+D + E EE+ +L P
Sbjct: 445 FKVTYVQGADIDSQNTTDFSAAISAAQSADLVIYVGGIDNSIEAEEI-LANLSTP----- 498
Query: 520 LITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHN 579
+++ G +D + + + ++LWAGYPG+ G A+ ++ G
Sbjct: 499 -------------LIISQMGCMIDSSSLLSNTGVNALLWAGYPGQDGGTAIFNILTGKTA 545
Query: 580 PGGRLPMTWYPQDYI-KVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSY 638
P GRLP+T YP +Y+ +V MTDM ++P + NPGRTY++Y G+ VF +G GL Y+ +
Sbjct: 546 PAGRLPITQYPSNYVNQVTMTDMNLQP--SRFNPGRTYKWYNGEPVFEYGYGLQYTTFDA 603
Query: 639 KFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGK 698
K S N + +D+ + +P I V N G
Sbjct: 604 KITPSSPNNTFEISELLANASNYKDLTPFVKIP---------------ITVSNTGTTTSD 648
Query: 699 HPVLLFVKPARRGNGRPIKQLVGF---QSVILNAKEKAEIVFELSPCESLSRAREDGLMV 755
+ L F+ P K LV + + A AE+ L+ SL+R +G ++
Sbjct: 649 YVALFFLSGTFGPAPHPKKSLVAYTRLHDITGGANATAEVSLNLA---SLARGNWNGDLI 705
Query: 756 IEEGTHFLVV---GDEEYPISI 774
+ G + +VV G +E+ ++
Sbjct: 706 LYPGDYKVVVDIDGKDEWSFTL 727
>gi|212531051|ref|XP_002145682.1| beta-xylosidase XylA [Talaromyces marneffei ATCC 18224]
gi|210071046|gb|EEA25135.1| beta-xylosidase XylA [Talaromyces marneffei ATCC 18224]
Length = 799
Score = 442 bits (1136), Expect = e-121, Method: Compositional matrix adjust.
Identities = 264/729 (36%), Positives = 400/729 (54%), Gaps = 36/729 (4%)
Query: 46 CKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIF 105
C T+ RA L++ TL+E I+ NS P +PRLG+P YE WSE LHG+ + F
Sbjct: 64 CDTSANYVDRAEGLIALFTLEELINNTQNSGPGVPRLGLPPYEVWSEGLHGL---DRAHF 120
Query: 106 FNG--TIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINI 163
ATSFP IL+ A+ + L +I I +ARA N G+ G+ +APNIN
Sbjct: 121 VKSGDEWTWATSFPMPILSMAALNRTLINQIASIIATQARAFNNVGR-YGLDAYAPNING 179
Query: 164 FRDPRWGRGQETPGEDP-LVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDL 222
FR P WGRGQETPGED +T YA Y+ G+QG G L+ +A KHF YDL
Sbjct: 180 FRSPLWGRGQETPGEDANFLTSSYAYEYITGLQG-----GIDPDNLKIAATAKHFAGYDL 234
Query: 223 DNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLS 282
+NW G +R FDAR+T QDLA+ Y P F + + +A MC+YN VN IPSC+ LL
Sbjct: 235 ENWGGNSRLGFDARITQQDLAEYYTPQFLAASRYAKARSFMCSYNSVNAIPSCSSSFLLQ 294
Query: 283 KTARRQWGF--HGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTK 340
R QW F +GY++SDCDAV +++ GYA + A + L+AG D++CG H
Sbjct: 295 TLLREQWDFPEYGYVSSDCDAVYNVFNPHGYASNQSSAAAESLRAGTDIDCGQTYSWHLN 354
Query: 341 AAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAA 400
+ + + EI+R++ L+S ++LG F+G+ + ++G + V + ++ +AA
Sbjct: 355 QSFIEGSVTRGEIERSILRLYSNLVKLGYFDGD--KNEYRQLGWNDVVTTDAWNISYEAA 412
Query: 401 QDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVE 460
+GIVLLKN G+LPL K+ S+AL+GP AN+ K L GNY G + ITPLQ +
Sbjct: 413 VEGIVLLKN-DGVLPLSKNVK-SVALVGPWANATKQLQGNYFGTAPYLITPLQGASDAGY 470
Query: 461 NTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQEL 520
Y G + ++ A+ AK +D +V + G+D T E E DR+++ P Q +L
Sbjct: 471 KVNYALGTNISGNTTDGFANALSAAKKSDVIVYLGGIDNTIEAEGTDRMNVTWPRNQLDL 530
Query: 521 ITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
I ++++ KP++++ + GG VD + K + + +++W GYPG++G A+ +++ G P
Sbjct: 531 IQQLSQTG-KPLVVLQMGGGQVDSSSIKSNSKVNALIWGGYPGQSGGKAIFDILKGKRAP 589
Query: 581 GGRLPMTWYPQDY-IKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYK 639
GRL T YP +Y + P TDM +RP S NPG+TY +Y GK V+ FG GL Y+ +
Sbjct: 590 AGRLVSTQYPAEYATQFPATDMSLRPDGKS-NPGQTYMWYIGKPVYEFGYGLFYTTFKET 648
Query: 640 FKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFL-VTIGVKNHGEMAGK 698
K + + + S ++V P E+ E FL VT +KN G+ A
Sbjct: 649 AKKLGSSSSSFDIS---------EIVSSPRSPSY--EYSELVPFLNVTATIKNTGKTASP 697
Query: 699 HPVLLFVKPARRGNG-RPIKQLVGFQSV-ILNAKEKAEIVFELSPCESLSRAREDGLMVI 756
+ +LF G P K LVG+ + + + A++V + P +++R ++G ++
Sbjct: 698 YTAMLFANTTNAGPAPYPNKWLVGYDRLPSIEPGKSADLVIPV-PIGAIARVDKNGNRIV 756
Query: 757 EEGTHFLVV 765
G + L +
Sbjct: 757 YPGDYQLTL 765
>gi|261368518|ref|ZP_05981401.1| beta-glucosidase [Subdoligranulum variabile DSM 15176]
gi|282569400|gb|EFB74935.1| glycosyl hydrolase family 3 C-terminal domain protein
[Subdoligranulum variabile DSM 15176]
Length = 717
Score = 441 bits (1133), Expect = e-120, Method: Compositional matrix adjust.
Identities = 271/739 (36%), Positives = 393/739 (53%), Gaps = 91/739 (12%)
Query: 54 QRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGA 113
+RAR LV+++TL EKISQ+++ APAIPRLGIPAY WW+E +HGV G A
Sbjct: 13 ERARALVAQMTLKEKISQMLSWAPAIPRLGIPAYNWWNEGIHGVGRAGT----------A 62
Query: 114 TSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAI-------GMTFWAPNINIFRD 166
T FPQ I AASFD L ++G+A+G+EAR YN ++ G+T WAPN+NIFRD
Sbjct: 63 TVFPQAIGLAASFDEDLLGQVGEAVGVEARGKYNMYRSYQDRDIYKGLTIWAPNVNIFRD 122
Query: 167 PRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWK 226
PRWGRG ET GEDP +T + V +V G+QGD + L+A+AC KHF + +
Sbjct: 123 PRWGRGHETYGEDPYLTSRLGVRFVEGMQGDDPD------YLRAAACAKHFAVH---SGP 173
Query: 227 GTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTAR 286
R+ FDA+V+ QDL +TY P F + VK+ +M AYNR NG P C + LL R
Sbjct: 174 EDQRHYFDAKVSQQDLWETYLPAFRALVKEAGVEAVMGAYNRTNGEPCCGSKTLLVDILR 233
Query: 287 RQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQK 346
+W F G++TSDC A+ ++ P D+V + G D+NCG L + + AV +
Sbjct: 234 GKWNFQGHVTSDCWAIKDFHEGHMVTSGPVDSVALAVNNGCDLNCGD-LYAYLEEAVAEG 292
Query: 347 KLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVL 406
K+ E IDR+L LF+ RM+LG+F+ + P+ KIG D V S Q L L+ A+ +VL
Sbjct: 293 KVKEETIDRSLVRLFTTRMKLGMFDAEEKV-PYNKIGYDAVDSREMQALNLEVAEKILVL 351
Query: 407 LKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTV--- 463
LKN + LPL KSK +A++GPNA++ K L+GNY G + R +T L +Q Y+ V
Sbjct: 352 LKNENHTLPLDKSKLHRVAVVGPNADNRKALVGNYEGTASRYVTVLDGIQEYLGEDVQVR 411
Query: 464 YYPGC-------DTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEE---------LD 507
Y GC +A S+ I + + D V+ +GLD E EE D
Sbjct: 412 YSEGCHLYADKIQGLAKSNELISEVRGVCAECDVVICCLGLDAGLEGEEGDQGNQFASGD 471
Query: 508 RVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGA 567
+ L LPG Q+ ++ E+ KPV++V+L G + + A+ ++L A YPG G
Sbjct: 472 KQSLSLPGNQESVLKACIESG-KPVVVVVLSGSALALGTAQ--EGAAAVLQAWYPGAQGG 528
Query: 568 VALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPF 627
A+A +FG+ NP G+LP+T+Y D TD M+ GRTYR+ E + ++PF
Sbjct: 529 RAVARALFGECNPQGKLPVTFYHSDEDLPAFTDYAMK--------GRTYRYMEKEPLYPF 580
Query: 628 GCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTI 687
G GLSYS ++++ ++G + + R +
Sbjct: 581 GYGLSYSHFTFRDAKA-------------------------DAAQIGPDGVDVR-----V 610
Query: 688 GVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSR 747
V N G+ G+ V ++VK R G P QL V L E+ + L C + +
Sbjct: 611 TVVNDGQYRGRETVEVYVKAERPGT--PNAQLKALAKVDLMPGEEKCVTLHLPQC-AFAL 667
Query: 748 AREDGLMVIEEGTHFLVVG 766
E+G+ + G + + +G
Sbjct: 668 CNEEGISEVLPGEYTVWLG 686
>gi|367046937|ref|XP_003653848.1| glycoside hydrolase family 3 protein [Thielavia terrestris NRRL
8126]
gi|347001111|gb|AEO67512.1| glycoside hydrolase family 3 protein [Thielavia terrestris NRRL
8126]
Length = 923
Score = 439 bits (1128), Expect = e-120, Method: Compositional matrix adjust.
Identities = 288/750 (38%), Positives = 402/750 (53%), Gaps = 66/750 (8%)
Query: 44 PFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKG 103
P C T+LPI+ R R LV +LTL EKI+ LV+ A R+G+P YEWWSEALHGVA G
Sbjct: 163 PACNTSLPIADRVRWLVGQLTLQEKITNLVDGASGSARVGLPPYEWWSEALHGVA-ASPG 221
Query: 104 IFF---NGT-IRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAP 159
+ F NGT ATSFP I +A+FD L +I +G E RA N G + G FW P
Sbjct: 222 VTFAGPNGTAFSYATSFPMPITISAAFDDDLVSQIAAVVGREGRAFANHGLS-GFDFWTP 280
Query: 160 NINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQG-DTFNGGKLKGKLQASACCKHFT 218
NIN FRDPRWGRG ETPGED +Y + G+QG D + Q A CKH+
Sbjct: 281 NINPFRDPRWGRGPETPGEDAFRIQQYIRHLIPGLQGSDPLDK-------QIIATCKHYA 333
Query: 219 AYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADR 278
YD++ T RY++D DLA+ Y PF++CV+ +MC+YN V+GIP+CA
Sbjct: 334 VYDVE----TGRYEYDYDPQPHDLAEYYLAPFKTCVRDVGIGSVMCSYNAVDGIPACASE 389
Query: 279 NLLSKTARRQWGF---HGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGS-F 334
LL R WGF + Y+ SDCDAV IY + SP A L AG D+ CGS +
Sbjct: 390 YLLQSVLRDHWGFTEPYQYVVSDCDAVRFIYSPHNFTDSPAAAAAVALNAGTDLECGSTY 449
Query: 335 LQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQV 394
L + A E+ +DRAL L++ +G F+G+ +G +G D V + QV
Sbjct: 450 LNLNQSLASNMTT--EAALDRALTRLYTALHTIGFFDGSAR---YGGLGWDAVGTGDAQV 504
Query: 395 LALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQA 454
LA QAA DG VLLKN LLPL + LA+IGP AN+ + GNY G + ++PL A
Sbjct: 505 LAYQAAVDGAVLLKNEKSLLPLDSKRLRKLAVIGPWANATTQMQGNYFGQAAYLVSPLAA 564
Query: 455 LQNY--VENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLV 512
Q+ +N ++ G S+A A+ AK AD VV + G+D + E E LDR +
Sbjct: 565 FQSAWGADNVLFANGTGIAGNSTAGFAAALAAAKAADAVVFLGGVDNSVESESLDRTAIS 624
Query: 513 LPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAE 572
PG Q +LI ++A A KP+++V GG +D + + +G++LWAGYPG+AG A+A+
Sbjct: 625 WPGNQLDLIAQLA-AVGKPLVVVQCGGGQLDDSALLANPRVGALLWAGYPGQAGGAAIAD 683
Query: 573 VIFGDHNPGGRLPMTWYPQDYI-KVPMTDMKMRPQATSGN------PGRTYRFYEGKEVF 625
++ G P GRLP+T Y Y +V + D +RP+ + G+ PGRTY++Y GK V
Sbjct: 684 LLTGKQAPAGRLPVTQYAASYTSEVSLFDPSLRPRRSGGSKSHSTFPGRTYKWYTGKPVL 743
Query: 626 PFGCGLSYSKYSYKFKAVSQNKLY-------LNQSSSTKMVENQDVVHYKSVPELGTEFC 678
PFG GL Y+ + + + + Y N ++++ D + P L
Sbjct: 744 PFGYGLHYTTFRTAWADEPRGRAYDIAGLFPANTTTTSSAFSAAD-----TYPVLN---- 794
Query: 679 ETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNG-RPIKQLVGF---QSVILNAKEKAE 734
V++ V N G A + LLF++ G P K LVG+ + + + + E
Sbjct: 795 ------VSVTVTNTGRGASDYVGLLFLRTRNAGPAPYPNKWLVGYARARGLAPGSSARLE 848
Query: 735 IVFELSPCESLSRAREDGLMVIEEGTHFLV 764
+ L SL+RA EDG V+ G + L+
Sbjct: 849 LAVALG---SLARADEDGRRVVYPGDYELL 875
>gi|323447708|gb|EGB03620.1| hypothetical protein AURANDRAFT_72703 [Aureococcus anophagefferens]
Length = 744
Score = 438 bits (1126), Expect = e-120, Method: Compositional matrix adjust.
Identities = 283/766 (36%), Positives = 384/766 (50%), Gaps = 100/766 (13%)
Query: 37 NPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHG 96
N + E PFC TL I RA D VSR+T+ EKI L I LG+PAY WWSEA G
Sbjct: 31 NATFEALPFCDATLAIDLRAADAVSRMTIPEKIDALDTKTGPIASLGLPAYNWWSEASSG 90
Query: 97 VAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTF 156
V G R T F + TA SF+ LW G AIG EARAL NAG A T+
Sbjct: 91 VMGS----------RPTTKFAYPVTTAMSFNRTLWRATGAAIGREARALMNAGAAYS-TY 139
Query: 157 WAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQG---DTFNGGKLKGKLQASAC 213
WAP +N+ R+PRWGR E PGEDP +TG+YA +V G Q D ++ LQASAC
Sbjct: 140 WAPVVNLAREPRWGRNIEVPGEDPYLTGEYATEFVGGFQAAPEDPYH-------LQASAC 192
Query: 214 CKHFTAYDLDNWKGTT-----RYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNR 268
CKH+ A +L+N + R D+ VT +DL D+Y PF++CV++G+ S +MC+YN
Sbjct: 193 CKHYVANELENTRQPDGEQWDRQHVDSNVTQRDLVDSYMVPFQACVEKGKVSSLMCSYNA 252
Query: 269 VNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMD 328
VNG+PSCA+ LL AR W F GYITSDCDA S +YDA YA +PE+AV DVLKAG D
Sbjct: 253 VNGVPSCANDWLLRTVARDAWHFDGYITSDCDADSNVYDAHHYAATPEEAVADVLKAGTD 312
Query: 329 VNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLG----LFNGNPTMQPFGKIGA 384
V+C SF+ +H ++A+ + + E+++D L NLF VR+RLG F+ P +I A
Sbjct: 313 VDCQSFVGQHARSALDKGLITEADMDARLVNLFKVRLRLGHFDLSFDAAKPRGPLDEIDA 372
Query: 385 D-VVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAG 443
D VVCS AH +++ LLKN G LPL S + A++GPNA +K G Y G
Sbjct: 373 DAVVCSDAHLDASMEGLAQSATLLKND-GALPLKPSGTA--AVVGPNALLSKADAGYY-G 428
Query: 444 PSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEK 503
P+ AD VVL +G D T
Sbjct: 429 PT----------------------------------------DAADAVVLAVGTDLTWAA 448
Query: 504 EELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDIT--FAKYDRNIGSILWAGY 561
E D +V Q ELI VA A+ PV++V+ P+D+T A+ D +G+++ G
Sbjct: 449 EGKDATSIVFTAAQLELIDAVATASATPVVVVVFSATPLDLTPLLARSDGKVGAVVHVGQ 508
Query: 562 PGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYI-KVPMTDMKMRPQATS----------- 609
P L ++++G + GR T YP Y ++ + D MRP ++
Sbjct: 509 P-SVTVKGLGDLLYGRRSFAGRAVQTVYPAAYADQISIFDFNMRPGPSAFARPDCATNES 567
Query: 610 -----GNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDV 664
NPGRTYRFY + V PFG GLSY+ ++Y AV ++ +
Sbjct: 568 ACPRGTNPGRTYRFYVDEPVVPFGFGLSYTTFAY---AVRSAPTTVDLAPLRAAYAGVAA 624
Query: 665 VHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRG-NGRPIKQLVGFQ 723
P + + + V N G++ VL FV P G +G P+K+L GF+
Sbjct: 625 ARGDGGPAFLSLHDDAAAATYAVDVTNTGDIDADDVVLGFVTPPGAGVDGVPLKELFGFE 684
Query: 724 SVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGDEE 769
V + A E + V+ EDG + G + + G E
Sbjct: 685 RVHVKAGET-KTVYLYPALSKFKTVAEDGALAARPGDYAIEFGIPE 729
>gi|238923424|ref|YP_002936940.1| beta-glucosidase [Eubacterium rectale ATCC 33656]
gi|238875099|gb|ACR74806.1| beta-glucosidase [Eubacterium rectale ATCC 33656]
Length = 714
Score = 437 bits (1125), Expect = e-120, Method: Compositional matrix adjust.
Identities = 273/743 (36%), Positives = 395/743 (53%), Gaps = 91/743 (12%)
Query: 54 QRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGA 113
+ A+ LVS++T+DEKISQ++ +PAI RLGIP Y WW+EALHGVA G A
Sbjct: 7 EYAKKLVSQMTIDEKISQMLYESPAIERLGIPEYNWWNEALHGVARAGV----------A 56
Query: 114 TSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA-------GQAIGMTFWAPNINIFRD 166
T FPQ I AA+FD+ L +IG + E R +N G G+TFWAPN+NIFRD
Sbjct: 57 TVFPQAIGLAATFDTDLIEKIGDVVSTEGRGKFNEFSKKGDHGIYKGLTFWAPNVNIFRD 116
Query: 167 PRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWK 226
PRWGRG ET GEDP +TGK +Y+RG+QGD + L+++AC KHF + +
Sbjct: 117 PRWGRGHETYGEDPYLTGKLGCAYIRGLQGDDPD------HLKSAACAKHFAVH---SGP 167
Query: 227 GTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTAR 286
R++FDA+ + D+ DTY F+ CVK + +M AYNRVNG P+C R LL R
Sbjct: 168 EAIRHEFDAKASKHDMYDTYLYAFKRCVKDAKVEAVMGAYNRVNGEPACGSRTLLKDILR 227
Query: 287 RQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGS-FLQKHTKAAVKQ 345
++GF G++ SDC A+ ++ + E++ + G D+NCGS FL H K A +
Sbjct: 228 DEFGFEGHVVSDCWAILDFHEHHHVTDTVEESAAMAVNNGCDLNCGSAFL--HLKDAYDK 285
Query: 346 KKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIV 405
+ + I A+ L VR+RLG+ P+ P+ I +VV H L+++AA+ +V
Sbjct: 286 GMVSDEAITAAVERLMEVRIRLGMMKDYPS--PYEDISYEVVECKEHVELSVEAARRSLV 343
Query: 406 LLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYV-ENT-- 462
LLKN LPL + ++A+IGPNANS L+GNY G S R ITPL+ LQ Y+ E+T
Sbjct: 344 LLKNKDNFLPLDRKNVKTIAVIGPNANSRDALIGNYYGTSSRYITPLEGLQQYLGEDTRV 403
Query: 463 VYYPGC----DTVACSSASIDK---AVDIAKGADHVVLMMGLDQTQEKEE---------L 506
+Y GC D V + D+ A+ +A+ +D VV+ +GLD T E EE
Sbjct: 404 LYAEGCHLYKDKVQGLAEEKDRFKEALIMAEQSDVVVMCLGLDATIEGEEGDAGNEYASG 463
Query: 507 DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAG 566
D++ L+LPG Q+EL+ VA A KPVILVL G +D+++A + ++ +I+ + YPG G
Sbjct: 464 DKLGLMLPGLQEELLEAVA-AVGKPVILVLSAGSAIDLSWA--EEHVDAIIDSWYPGARG 520
Query: 567 AVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFP 626
A+AE IFG+++P G+LP+T+Y TD M RTYR+ ++P
Sbjct: 521 GKAVAEAIFGEYSPNGKLPVTFYQGTENLPEFTDYSM--------AHRTYRYTNENVLYP 572
Query: 627 FGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVT 686
FG GL Y + +Y +V + + +N+ V
Sbjct: 573 FGYGLHYGETNYDGLSVDKAESDVNEPVE-----------------------------VF 603
Query: 687 IGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLS 746
+ V N V L+++ P QL G + V L E ++ LSP +
Sbjct: 604 VNVTNDSRYTVNEIVQLYIRHVDAAEYEPGYQLKGIEVVKLEPHETKKVKLTLSP-RDFA 662
Query: 747 RAREDGLMVIEEGTHFLVVGDEE 769
EDG V G + + G ++
Sbjct: 663 VIEEDGSCVAVPGIYEISAGGQQ 685
>gi|374372635|ref|ZP_09630297.1| Beta-glucosidase [Niabella soli DSM 19437]
gi|373235166|gb|EHP54957.1| Beta-glucosidase [Niabella soli DSM 19437]
Length = 734
Score = 437 bits (1124), Expect = e-119, Method: Compositional matrix adjust.
Identities = 269/754 (35%), Positives = 397/754 (52%), Gaps = 97/754 (12%)
Query: 43 FPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGK 102
PF L R DLVSRLTL+EK+ Q++N APAIPRLGIPAY+WWSE LHGVA
Sbjct: 26 LPFWNYKLSFEARVNDLVSRLTLEEKVKQMLNHAPAIPRLGIPAYDWWSEVLHGVAR--- 82
Query: 103 GIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAG--------QAIGM 154
T T +PQ I AA++D+ Y + E RA++N + +G+
Sbjct: 83 ------TPYHTTVYPQAIAMAATWDTVALYTMADQSAREGRAIHNKATEEGKNGDRYVGL 136
Query: 155 TFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACC 214
T+W PNINIFRDPRWGRGQET GEDP +T ++VRG+QG+ L+A+AC
Sbjct: 137 TYWTPNINIFRDPRWGRGQETYGEDPFLTAMLGRAFVRGLQGED------PKYLKAAACA 190
Query: 215 KHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPS 274
KH Y + + R+ FD V+ DL +TY P F+ V + +G+MCAYN P
Sbjct: 191 KH---YAIHSGPEAVRHSFDVDVSDYDLWNTYLPAFKELVTHAKVAGVMCAYNAFRKKPC 247
Query: 275 CADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSF 334
C L++ RRQWGF GY+TSDC A+ ++ + E A +D + G DV CG+
Sbjct: 248 CGSDLLMTDILRRQWGFTGYVTSDCGAIDDFFNYHKTHPNAEAAAIDAVTNGTDVECGNR 307
Query: 335 LQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQV 394
AVK ++ E EIDR++ LF +RMRLG+F+ +M + + V+ S H+
Sbjct: 308 AYLTLTDAVKTGRIAEKEIDRSVKRLFMIRMRLGMFD-PVSMVSYAQTSPAVLESAPHKA 366
Query: 395 LALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQA 454
AL+ AQ+ IVLLKN + LLPL KS +A++GPNA+++ +LGNY G + +T L
Sbjct: 367 QALKMAQESIVLLKNENHLLPLSKSIK-KIAVVGPNADNSIAVLGNYNGTPSKIVTALDG 425
Query: 455 LQNYV--ENTVYYPGCDTVACSSASI-DKAVDIA------KGADHVVLMMGLDQTQEKEE 505
++ + +V Y V ++A + + D A K AD ++ + G+ E EE
Sbjct: 426 IKAKLGTNGSVVYEKA--VNFTNAMLPEGKTDFAALTSRVKDADAIIFVGGISPQLEGEE 483
Query: 506 L----------DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGS 555
+ DR ++LP Q E + + +A KPV+ V++ G + I + + NI +
Sbjct: 484 MKVNEPGFNSGDRTTILLPTVQTEAM-KALKATGKPVVFVMMTGSALAIPWEQ--ENIPA 540
Query: 556 ILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRT 615
I+ A Y G+A A+A+V+FGD+NP GRLP+T+Y D D +M RT
Sbjct: 541 IVNAWYGGQAAGTAIADVLFGDYNPSGRLPVTFYKSDADLPAFDDYRME--------NRT 592
Query: 616 YRFYEGKEVFPFGCGLSYSKYSY---KFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPE 672
YR++ G+ ++PFG GLSY+ + Y K +NK+
Sbjct: 593 YRYFSGQALYPFGYGLSYTTFRYEGLKVPTTVKNKV------------------------ 628
Query: 673 LGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEK 732
+ V+I + N G G+ V L++ + +P+K L GFQ V LN +
Sbjct: 629 ---------RIPVSIQLTNTGAKGGEEVVQLYISYQGQPIKKPLKALKGFQRVWLNRGQT 679
Query: 733 AEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
I F L+P ++L+ A E+G ++ +G + VG
Sbjct: 680 KTIKFLLTP-DALAIAGENGKLLNPKGKLRISVG 712
>gi|291525508|emb|CBK91095.1| Beta-glucosidase-related glycosidases [Eubacterium rectale DSM
17629]
Length = 714
Score = 437 bits (1124), Expect = e-119, Method: Compositional matrix adjust.
Identities = 273/743 (36%), Positives = 395/743 (53%), Gaps = 91/743 (12%)
Query: 54 QRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGA 113
+ A+ LVS++T+DEKISQ++ +PAI RLGIP Y WW+EALHGVA G A
Sbjct: 7 EYAKKLVSQMTIDEKISQMLYESPAIERLGIPEYNWWNEALHGVARAGV----------A 56
Query: 114 TSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA-------GQAIGMTFWAPNINIFRD 166
T FPQ I AA+FD+ L +IG + E R +N G G+TFWAPN+NIFRD
Sbjct: 57 TVFPQAIGLAATFDTDLIEKIGDVVSTEGRGKFNEFSKKGDHGIYKGLTFWAPNVNIFRD 116
Query: 167 PRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWK 226
PRWGRG ET GEDP +TGK +Y+RG+QGD + L+++AC KHF + +
Sbjct: 117 PRWGRGHETYGEDPYLTGKLGCAYIRGLQGDDPD------HLKSAACAKHFAVH---SGP 167
Query: 227 GTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTAR 286
R++FDA+ + D+ DTY F+ CVK + +M AYNRVNG P+C R LL R
Sbjct: 168 EAIRHEFDAKASKHDMYDTYLYAFKRCVKDAKVEAVMGAYNRVNGEPACGSRTLLKDILR 227
Query: 287 RQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGS-FLQKHTKAAVKQ 345
++GF G++ SDC A+ ++ + E++ + G D+NCGS FL H K A +
Sbjct: 228 DEFGFEGHVVSDCWAILDFHEHHHVTDTVEESAAMAVNNGCDLNCGSAFL--HLKDAYDK 285
Query: 346 KKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIV 405
+ + I A+ L VR+RLG+ P+ P+ I +VV H L+++AA+ +V
Sbjct: 286 GLVSDEAITAAVERLMEVRIRLGMMKDYPS--PYEDISYEVVECKEHVELSVEAARRSLV 343
Query: 406 LLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYV-ENT-- 462
LLKN LPL + ++A+IGPNANS L+GNY G S R ITPL+ LQ Y+ E+T
Sbjct: 344 LLKNKDNFLPLDRKNVKTIAVIGPNANSRDALIGNYYGTSSRYITPLEGLQQYLGEDTRV 403
Query: 463 VYYPGC----DTVACSSASIDK---AVDIAKGADHVVLMMGLDQTQEKEE---------L 506
+Y GC D V + D+ A+ +A+ +D VV+ +GLD T E EE
Sbjct: 404 LYAEGCHLYKDKVQGLAEEKDRFKEALIMAEQSDVVVMCLGLDATIEGEEGDAGNEYASG 463
Query: 507 DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAG 566
D++ L+LPG Q+EL+ VA A KPVILVL G +D+++A + ++ +I+ + YPG G
Sbjct: 464 DKLGLMLPGLQEELLEAVA-AVGKPVILVLSAGSAIDLSWA--EEHVDAIIDSWYPGARG 520
Query: 567 AVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFP 626
A+AE IFG+++P G+LP+T+Y TD M RTYR+ ++P
Sbjct: 521 GKAVAEAIFGEYSPSGKLPVTFYQGTENLPEFTDYSM--------AHRTYRYTNENVLYP 572
Query: 627 FGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVT 686
FG GL Y + +Y +V + + +N+ V
Sbjct: 573 FGYGLHYGETNYDGMSVDKAESDVNEPVE-----------------------------VF 603
Query: 687 IGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLS 746
+ V N V L+++ P QL G + V L E ++ LSP +
Sbjct: 604 VNVTNDSRYTVNEIVQLYIRHVDAAEYEPGYQLKGIEVVKLEPYETKKVKLTLSP-RDFA 662
Query: 747 RAREDGLMVIEEGTHFLVVGDEE 769
EDG V G + + G ++
Sbjct: 663 VIEEDGSCVAVPGIYEISAGGQQ 685
>gi|410628680|ref|ZP_11339398.1| beta-glucosidase [Glaciecola mesophila KMM 241]
gi|410151684|dbj|GAC26167.1| beta-glucosidase [Glaciecola mesophila KMM 241]
Length = 732
Score = 437 bits (1123), Expect = e-119, Method: Compositional matrix adjust.
Identities = 272/737 (36%), Positives = 391/737 (53%), Gaps = 80/737 (10%)
Query: 50 LPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGT 109
L RA+ LV+ +T+DEKI+QL +S PAIPRL +P Y WW+EALHG+A GK
Sbjct: 35 LSFETRAQALVNAMTIDEKITQLSHSTPAIPRLEVPQYNWWNEALHGIARNGK------- 87
Query: 110 IRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALY-------NAGQAIGMTFWAPNIN 162
AT FPQ I A+FD L + AI EARA Y N GQ G+TFW PN+N
Sbjct: 88 ---ATIFPQAIGLGATFDPELAQEVANAISDEARAKYAIAQSIGNQGQYAGLTFWTPNVN 144
Query: 163 IFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDL 222
IFRDPRWGRGQET GEDPL+T + ++V+G+QGD L+++ KHF +
Sbjct: 145 IFRDPRWGRGQETYGEDPLLTSQMGTAFVKGLQGDD------PKYLKSAGVAKHFAVH-- 196
Query: 223 DNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLS 282
+ + R++FD + +DL +TY P FE+ V Q + +G+MCAYN V G PSCA LL
Sbjct: 197 -SGPESLRHQFDVEPSKKDLYETYLPAFEALVTQAKVAGVMCAYNGVYGQPSCASEFLLG 255
Query: 283 KTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAA 342
+ +++W F+GY+ SDC A+ + + ++ L+AG+D+NCG +K KAA
Sbjct: 256 EMLKKKWQFNGYVVSDCGALHDFHSGHKVTHNRVESAALALRAGVDLNCGFTYEKSLKAA 315
Query: 343 VKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQD 402
++ + +S ID+ L NL +R RLGLF+ + + P IG +V+ S H LA + A
Sbjct: 316 FEEGLITQSLIDQRLKNLLMIRFRLGLFDPS-ELNPHNAIGQEVIHSLEHIELARKVAAK 374
Query: 403 GIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYV--- 459
IVLLKN +LPL K V + GP A S+ L+GNY G S +T L+ + V
Sbjct: 375 SIVLLKNEKQVLPLSKDIKVPY-VTGPFAASSDMLMGNYYGISDSLVTVLEGIAGKVSLG 433
Query: 460 ENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEEL---------DRVD 510
+ Y G + ++ A ++AK AD V+ ++G+ E EE+ DRV
Sbjct: 434 SSLNYRAGALPFHSNINPLNWAPEVAKTADAVIAVVGISADMEGEEVDAIASADRGDRVA 493
Query: 511 LVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVAL 570
+ LP Q + + ++AE K P+ILV+ G PVDI+ + D +ILW YPGE G A+
Sbjct: 494 ITLPQNQVDYVKQLAENKKGPLILVVAAGSPVDIS--ELDPLADAILWIWYPGEQGGNAV 551
Query: 571 AEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCG 630
A+VIFGD NP G LP+T+ P D M GRTY+F + ++PFG G
Sbjct: 552 ADVIFGDTNPSGHLPLTFVKTIDDLPPFDDYTMT--------GRTYKFLKKLPLYPFGFG 603
Query: 631 LSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVK 690
LSY+++ + KL L+ K P+ G +++ V+
Sbjct: 604 LSYTQFKF-------GKLSLS----------------KRAPQEGENIN------ISVEVE 634
Query: 691 NHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRARE 750
N + G+ V +++ P I L F+ V + A EK I F + ++L R +
Sbjct: 635 NSTALDGETVVQVYLSPQVPLKNEAITNLKAFKRVHIGAYEKRLIEFTIE-GKNLYRVND 693
Query: 751 DGLMVIEEGTHFLVVGD 767
G V G + L VGD
Sbjct: 694 AGENVWPSGAYTLAVGD 710
>gi|292495282|sp|B0XP71.1|XYND_ASPFC RecName: Full=Probable exo-1,4-beta-xylosidase xlnD; AltName:
Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
gi|159131796|gb|EDP56909.1| beta-xylosidase XylA [Aspergillus fumigatus A1163]
Length = 792
Score = 437 bits (1123), Expect = e-119, Method: Compositional matrix adjust.
Identities = 266/756 (35%), Positives = 405/756 (53%), Gaps = 39/756 (5%)
Query: 32 SCDPSNPSTETFPF-----CKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPA 86
+ D S P E P C T+ RA LVS T +E ++ N++P +PRLG+P
Sbjct: 44 TIDLSFPDCENGPLSKTLVCDTSARPHDRAAALVSMFTFEELVNNTGNTSPGVPRLGLPP 103
Query: 87 YEWWSEALHGVAGVGKGIFFN-GTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARAL 145
Y+ WSEALHG+ + F + G ATSFP ILT ++ + L +I I + RA
Sbjct: 104 YQVWSEALHGL---DRANFTDEGEYSWATSFPMPILTMSALNRTLINQIATIIATQGRAF 160
Query: 146 YNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPL-VTGKYAVSYVRGVQGDTFNGGKL 204
N G+ G+ +APNIN FR WGRGQETPGED + YA Y+ G+QG G
Sbjct: 161 NNVGR-YGLDVYAPNINAFRSAMWGRGQETPGEDAYCLASAYAYEYITGIQG-----GVD 214
Query: 205 KGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMC 264
L+ A KH+ YDL+NW G +R D +T Q+L++ Y P F + + +MC
Sbjct: 215 PEHLKLVATAKHYAGYDLENWDGHSRLGNDMNITQQELSEYYTPQFLVAARDAKVHSVMC 274
Query: 265 AYNRVNGIPSCADRNLLSKTARRQWGF--HGYITSDCDAVSIIYDAEGYAKSPEDAVVDV 322
+YN VNG+PSCA+ L R +GF GY++SDCD+ +++ +A + A D
Sbjct: 275 SYNAVNGVPSCANSFFLQTLLRDTFGFVEDGYVSSDCDSAYNVWNPHEFAANITGAAADS 334
Query: 323 LKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKI 382
++AG D++CG+ Q + A ++++ +EI+R + L+S +RLG F+GN ++
Sbjct: 335 IRAGTDIDCGTTYQYYFGEAFDEQEVTRAEIERGVIRLYSNLVRLGYFDGNGSVYR-DLT 393
Query: 383 GADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYA 442
DVV + A + + +AA +GIVLLKN G LPL KS S+ALIGP N L GNY
Sbjct: 394 WNDVVTTDAWNI-SYEAAVEGIVLLKND-GTLPLAKSVR-SVALIGPWMNVTTQLQGNYF 450
Query: 443 GPSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQE 502
GP+ I+PL A QN + Y G + + S+ +A+ AK +D ++ G+D T E
Sbjct: 451 GPAPYLISPLNAFQNSDFDVNYAFGTNISSHSTDGFSEALSAAKKSDVIIFAGGIDNTLE 510
Query: 503 KEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYP 562
E +DR+++ PG Q +LI ++++ KP+I++ + GG VD + K ++N+ S++W GYP
Sbjct: 511 AEAMDRMNITWPGNQLQLIDQLSQLG-KPLIVLQMGGGQVDSSSLKSNKNVNSLIWGGYP 569
Query: 563 GEAGAVALAEVIFGDHNPGGRLPMTWYPQDY-IKVPMTDMKMRPQATSGNPGRTYRFYEG 621
G++G AL ++I G P GRL +T YP +Y + P TDM +RP NPG+TY +Y G
Sbjct: 570 GQSGGQALLDIITGKRAPAGRLVVTQYPAEYATQFPATDMSLRPHGN--NPGQTYMWYTG 627
Query: 622 KEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETR 681
V+ FG GL Y+ + ++K N QD++ + P G E
Sbjct: 628 TPVYEFGHGLFYTTFHASLPGTGKDKTSFNI---------QDLL---TQPHPGFANVEQM 675
Query: 682 KFL-VTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELS 740
L T+ + N G++A + +LF P K LVGF + +++ +
Sbjct: 676 PLLNFTVTITNTGKVASDYTAMLFANTTAGPAPYPNKWLVGFDRLASLEPHRSQTMTIPV 735
Query: 741 PCESLSRAREDGLMVIEEGTHFLVVGDEEYPISIFV 776
+S++R E G V+ G + L + +E + FV
Sbjct: 736 TIDSVARTDEAGNRVLYPGKYELALNNERSVVLQFV 771
>gi|336425135|ref|ZP_08605165.1| hypothetical protein HMPREF0994_01171 [Lachnospiraceae bacterium
3_1_57FAA_CT1]
gi|336013044|gb|EGN42933.1| hypothetical protein HMPREF0994_01171 [Lachnospiraceae bacterium
3_1_57FAA_CT1]
Length = 705
Score = 437 bits (1123), Expect = e-119, Method: Compositional matrix adjust.
Identities = 274/744 (36%), Positives = 392/744 (52%), Gaps = 102/744 (13%)
Query: 54 QRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGA 113
++A +LVS++TL+EK SQL APAIPRLG+P Y WW+EALHGVA G A
Sbjct: 9 EKAHELVSQMTLEEKASQLRYDAPAIPRLGVPTYNWWNEALHGVARAGV----------A 58
Query: 114 TSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQA-------IGMTFWAPNINIFRD 166
TSFPQ I AA+FD L +G A+ E RA YN G+TFW+PN+NIFRD
Sbjct: 59 TSFPQAIAMAAAFDDELLKTVGDAVAAEGRAKYNEYSRHDDRDIYKGLTFWSPNVNIFRD 118
Query: 167 PRWGRGQETPGEDPLVTGKYAVSYVRGVQG---DTFNGGKLKGKLQASACCKHFTAYDLD 223
PRWGRG ET GEDP +T + V+YV G+QG D F ++ +AC KHF +
Sbjct: 119 PRWGRGHETYGEDPYLTSRLGVAYVEGLQGSQDDDF--------MKTAACAKHFAVH--- 167
Query: 224 NWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSK 283
+ + R++FDA+ + +D+ +TY P FE+CVK+ +M AYNR NG P C L+
Sbjct: 168 SGPESVRHEFDAQASKKDMYETYLPAFEACVKEAGVEAVMGAYNRTNGEPCCGSPTLIQN 227
Query: 284 TARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG-SFLQKHTKAA 342
R +W F G+ SDC A++ + K+PE++ LK+G DVNCG ++L H A
Sbjct: 228 ILREEWDFQGHYVSDCWAIADFHMHHMVTKTPEESAALALKSGCDVNCGVTYL--HLLKA 285
Query: 343 VKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQD 402
+Q + E EI +A LF+ R LG F+ N + I +VV H LA + A++
Sbjct: 286 YQQGLVTEEEITQAAERLFTTRFLLGCFDKN----EYDDIPYEVVECKEHLELAQKMAKE 341
Query: 403 GIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENT 462
+VLLKN G+LPL K ++ +IGPNA+S L+GNY G S R IT L+ +Q++V
Sbjct: 342 SMVLLKND-GILPLNKDGLKTIGVIGPNADSRTPLVGNYHGTSSRYITLLEGIQDFVGED 400
Query: 463 V---YYPGC-------DTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEE------- 505
V Y GC + + I +A+ +A+ +D VVL +GLD+ E EE
Sbjct: 401 VRVYYSEGCHIYKDRVEGLGWKQDRISEALTVAEHSDVVVLCLGLDENLEGEEGDTGNSY 460
Query: 506 --LDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPG 563
D+ DL LP Q+EL+ VA KPV+L ++ G +D+ FA ++ +IL YPG
Sbjct: 461 ASGDKKDLELPESQRELLEAVA-GCGKPVVLCMMSGSAIDMQFAA--EHVNAILQVWYPG 517
Query: 564 EAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVP-MTDMKMRPQATSGNPGRTYRFYEGK 622
G A AE++FG +P G+LP+T+Y +D P D M+ GRTYR+ E +
Sbjct: 518 ARGGKAAAEILFGACSPSGKLPVTFY-KDLEGFPAFEDYSMK--------GRTYRYLEKE 568
Query: 623 EVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRK 682
++PFG GL+Y + K EL E ++
Sbjct: 569 PLYPFGYGLTYGQVCVK------------------------------AAELTGAVEEGKE 598
Query: 683 FLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPC 742
+ V+N G+ + +++K N P L F+ V L EKAEI+ ++ P
Sbjct: 599 LTIKAMVENSGKYDTDDVIQVYIKDLDSKNAVPNHSLCAFKRVSLKKGEKAEILLKV-PY 657
Query: 743 ESLSRAREDGLMVIEEGTHFLVVG 766
E+L E+G ++ L VG
Sbjct: 658 EALMAVDEEGKKYVDSSHFVLSVG 681
>gi|76160898|gb|ABA40420.1| Xld [Aspergillus fumigatus]
Length = 792
Score = 436 bits (1122), Expect = e-119, Method: Compositional matrix adjust.
Identities = 266/756 (35%), Positives = 405/756 (53%), Gaps = 39/756 (5%)
Query: 32 SCDPSNPSTETFPF-----CKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPA 86
+ D S P E P C T+ RA LVS T +E ++ N++P +PRLG+P
Sbjct: 44 TIDLSFPDCENGPLSKTLVCDTSARPHDRAAALVSMFTFEELVNNTGNTSPGVPRLGLPP 103
Query: 87 YEWWSEALHGVAGVGKGIFFN-GTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARAL 145
Y+ WSEALHG+ + F + G ATSFP ILT ++ + L +I I + RA
Sbjct: 104 YQVWSEALHGL---DRANFTDEGEYSWATSFPMPILTMSALNRTLINQIATIIATQGRAF 160
Query: 146 YNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPL-VTGKYAVSYVRGVQGDTFNGGKL 204
N G+ G+ +APNIN FR WGRGQETPGED + YA Y+ G+QG G
Sbjct: 161 NNVGR-YGLDVYAPNINAFRSAMWGRGQETPGEDAYCLASAYAYEYITGIQG-----GVD 214
Query: 205 KGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMC 264
L+ A KH+ YDL+NW G +R D +T Q+L++ Y P F + + +MC
Sbjct: 215 PEHLKLVATAKHYAGYDLENWDGHSRLGNDMNITQQELSEYYTPQFLVAARDAKVHSVMC 274
Query: 265 AYNRVNGIPSCADRNLLSKTARRQWGF--HGYITSDCDAVSIIYDAEGYAKSPEDAVVDV 322
+YN VNG+PSCA+ L R +GF GY++SDCD+ +++ +A + A D
Sbjct: 275 SYNAVNGVPSCANSFFLQTLLRDTFGFVEDGYVSSDCDSAYNVWNPHEFAANITGAAADS 334
Query: 323 LKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKI 382
++AG D++CG+ Q + A ++++ +EI+R + L+S +RLG F+GN ++
Sbjct: 335 IRAGTDIDCGTTYQYYFGEAFDEQEVTRAEIERGVIRLYSNLVRLGYFDGNGSVYR-DLT 393
Query: 383 GADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYA 442
DVV + A + + +AA +GIVLLKN G LPL KS S+ALIGP N L GNY
Sbjct: 394 WNDVVTTDAWNI-SYEAAVEGIVLLKND-GTLPLAKSVR-SVALIGPWMNVTTQLQGNYF 450
Query: 443 GPSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQE 502
GP+ I+PL A QN + Y G + + S+ +A+ AK +D ++ G+D T E
Sbjct: 451 GPAPYLISPLNAFQNSDFDVNYAFGTNISSHSTDGFSEALSAAKKSDVIIFAGGIDNTLE 510
Query: 503 KEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYP 562
E +DR+++ PG Q +LI ++++ KP+I++ + GG VD + K ++N+ S++W GYP
Sbjct: 511 AEAMDRMNITWPGNQLQLIDQLSQLG-KPLIVLQMGGGQVDSSSLKSNKNVNSLIWGGYP 569
Query: 563 GEAGAVALAEVIFGDHNPGGRLPMTWYPQDY-IKVPMTDMKMRPQATSGNPGRTYRFYEG 621
G++G AL ++I G P GRL +T YP +Y + P TDM +RP NPG+TY +Y G
Sbjct: 570 GQSGGQALLDIITGKRAPAGRLVVTQYPAEYATQFPATDMSLRPHGN--NPGQTYMWYTG 627
Query: 622 KEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETR 681
V+ FG GL Y+ + ++K N QD++ + P G E
Sbjct: 628 TPVYEFGHGLFYTTFHASLPGTGKDKTSFNI---------QDLL---TQPHPGFANVEQM 675
Query: 682 KFL-VTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELS 740
L T+ + N G++A + +LF P K LVGF + +++ +
Sbjct: 676 PLLNFTVTITNTGKVASDYTAMLFANTTAGPAPYPNKWLVGFDRLASLEPHRSQTMTIPV 735
Query: 741 PCESLSRAREDGLMVIEEGTHFLVVGDEEYPISIFV 776
+S++R E G V+ G + L + +E + FV
Sbjct: 736 TIDSVARTDEAGNRVLYPGKYELALNNERSVVLQFV 771
>gi|380696433|ref|ZP_09861292.1| glycoside hydrolase [Bacteroides faecis MAJ27]
Length = 739
Score = 436 bits (1122), Expect = e-119, Method: Compositional matrix adjust.
Identities = 272/751 (36%), Positives = 396/751 (52%), Gaps = 86/751 (11%)
Query: 39 STETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVA 98
+ E FPF LP+ QR DLVSRLTL+EK+ Q++NS P + RLGIPAY WW+E LHG+
Sbjct: 21 AQEKFPFRDPQLPVEQRVEDLVSRLTLEEKVKQMLNSTPPVERLGIPAYNWWNECLHGIG 80
Query: 99 GVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA-------GQA 151
T T FPQ I AA+++ L + +I E RA+YN Q
Sbjct: 81 R---------TKYHVTVFPQAIGMAAAWNDALIKEVASSIADEGRAIYNDTQRKEDYSQY 131
Query: 152 IGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQAS 211
+T+W PNINIFRDPRWGRGQET GEDP +T + ++V+G+QGD L+AS
Sbjct: 132 HALTYWTPNINIFRDPRWGRGQETYGEDPYLTARIGEAFVQGLQGDN------PRYLKAS 185
Query: 212 ACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNG 271
AC KH+ + + R+ F++ V+ DL DTY P F + V + SG+MCAYN G
Sbjct: 186 ACAKHYAVH---SGPEKNRHSFNSDVSTYDLWDTYLPAFRTLVVDAKVSGVMCAYNAFQG 242
Query: 272 IPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNC 331
P C + L+ R +W F GY+TSDC A+ I++ A D + G D++C
Sbjct: 243 QPCCGNDLLMQSILRDKWNFTGYVTSDCGAIDDIFNHHKTHPDAATAAADAVFHGTDLDC 302
Query: 332 GSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPA 391
G AVK + E ++D ++ LF++R RLGLF+ + + +I ++
Sbjct: 303 GHSAYLALVKAVKDGIITEKQLDVSVKRLFTIRFRLGLFDPVELVD-YARIPISILECRK 361
Query: 392 HQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITP 451
HQ LA Q A++ +VLLKN LLPL K+K + ++GPNA+S ++LLGNY G R +TP
Sbjct: 362 HQDLAKQLARESMVLLKNDQ-LLPLQKNKLKKVVVMGPNADSRESLLGNYNGNPSRMLTP 420
Query: 452 LQALQNYV---ENTVYYPGCDTVACSSAS-IDKAVDIAKGADHVVLMMGLDQTQEKEEL- 506
LQA++ + Y G D V SA + + V+ AKGAD V+ + G+ E EE+
Sbjct: 421 LQAIRERLGGWTEVEYIEGVDHVNTISADDLKQYVNRAKGADAVIFIGGISPRLEGEEMP 480
Query: 507 ---------DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSIL 557
DR + LP Q +++ + A P + V++ G + I + +N+ +IL
Sbjct: 481 VSKDGFDGGDRTTIALPAVQTQMM-KAWVAEHIPTVFVMMTGSALAIPWEA--QNVPAIL 537
Query: 558 WAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYR 617
A Y G+ G A+A+V+FGD+NP G+LP+T+Y +D M+ GRTYR
Sbjct: 538 NAWYGGQYGGEAIADVLFGDYNPSGKLPVTFYAKDSDLPDFESYDMQ--------GRTYR 589
Query: 618 FYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEF 677
++ GK ++PFG GLSY+ ++Y SS K+ +
Sbjct: 590 YFNGKALYPFGYGLSYTSFAY---------------SSLKL----------------PKV 618
Query: 678 CET--RKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEI 735
C T ++ VT+ VKN G G+ V L+V + P+ L GF+ + L A E +
Sbjct: 619 CRTTDKEIEVTVTVKNTGHTEGEEVVQLYVSHPDKKILVPLTALKGFKRIQLKAGEAQRV 678
Query: 736 VFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
F LS E LS E+G+ + GT + VG
Sbjct: 679 TFSLS-SEDLSCVDENGIRKVWAGTVKIQVG 708
>gi|70996610|ref|XP_753060.1| beta-xylosidase XylA [Aspergillus fumigatus Af293]
gi|74672055|sp|Q4WRB0.1|XYND_ASPFU RecName: Full=Probable exo-1,4-beta-xylosidase xlnD; AltName:
Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
gi|66850695|gb|EAL91022.1| beta-xylosidase XylA [Aspergillus fumigatus Af293]
Length = 792
Score = 436 bits (1122), Expect = e-119, Method: Compositional matrix adjust.
Identities = 266/756 (35%), Positives = 405/756 (53%), Gaps = 39/756 (5%)
Query: 32 SCDPSNPSTETFPF-----CKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPA 86
+ D S P E P C T+ RA LVS T +E ++ N++P +PRLG+P
Sbjct: 44 TIDLSFPDCENGPLSKTLVCDTSARPHDRAAALVSMFTFEELVNNTGNTSPGVPRLGLPP 103
Query: 87 YEWWSEALHGVAGVGKGIFFN-GTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARAL 145
Y+ WSEALHG+ + F + G ATSFP ILT ++ + L +I I + RA
Sbjct: 104 YQVWSEALHGL---DRANFTDEGEYSWATSFPMPILTMSALNRTLINQIATIIATQGRAF 160
Query: 146 YNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPL-VTGKYAVSYVRGVQGDTFNGGKL 204
N G+ G+ +APNIN FR WGRGQETPGED + YA Y+ G+QG G
Sbjct: 161 NNVGR-YGLDVYAPNINAFRSAMWGRGQETPGEDAYCLASAYAYEYITGIQG-----GVD 214
Query: 205 KGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMC 264
L+ A KH+ YDL+NW G +R D +T Q+L++ Y P F + + +MC
Sbjct: 215 PEHLKLVATAKHYAGYDLENWDGHSRLGNDMNITQQELSEYYTPQFLVAARDAKVHSVMC 274
Query: 265 AYNRVNGIPSCADRNLLSKTARRQWGF--HGYITSDCDAVSIIYDAEGYAKSPEDAVVDV 322
+YN VNG+PSCA+ L R +GF GY++SDCD+ +++ +A + A D
Sbjct: 275 SYNAVNGVPSCANSFFLQTLLRDTFGFVEDGYVSSDCDSAYNVWNPHEFAANITGAAADS 334
Query: 323 LKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKI 382
++AG D++CG+ Q + A ++++ +EI+R + L+S +RLG F+GN ++
Sbjct: 335 IRAGTDIDCGTTYQYYFGEAFDEQEVTRAEIERGVIRLYSNLVRLGYFDGNGSVYR-DLT 393
Query: 383 GADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYA 442
DVV + A + + +AA +GIVLLKN G LPL KS S+ALIGP N L GNY
Sbjct: 394 WNDVVTTDAWNI-SYEAAVEGIVLLKND-GTLPLAKSVR-SVALIGPWMNVTTQLQGNYF 450
Query: 443 GPSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQE 502
GP+ I+PL A QN + Y G + + S+ +A+ AK +D ++ G+D T E
Sbjct: 451 GPAPYLISPLNAFQNSDFDVNYAFGTNISSHSTDGFSEALSAAKKSDVIIFAGGIDNTLE 510
Query: 503 KEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYP 562
E +DR+++ PG Q +LI ++++ KP+I++ + GG VD + K ++N+ S++W GYP
Sbjct: 511 AEAMDRMNITWPGNQLQLIDQLSQLG-KPLIVLQMGGGQVDSSSLKSNKNVNSLIWGGYP 569
Query: 563 GEAGAVALAEVIFGDHNPGGRLPMTWYPQDY-IKVPMTDMKMRPQATSGNPGRTYRFYEG 621
G++G AL ++I G P GRL +T YP +Y + P TDM +RP NPG+TY +Y G
Sbjct: 570 GQSGGQALLDIITGKRAPAGRLVVTQYPAEYATQFPATDMSLRPHGN--NPGQTYMWYTG 627
Query: 622 KEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETR 681
V+ FG GL Y+ + ++K N QD++ + P G E
Sbjct: 628 TPVYEFGHGLFYTTFHASLPGTGKDKTSFNI---------QDLL---TQPHPGFANVEQM 675
Query: 682 KFL-VTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELS 740
L T+ + N G++A + +LF P K LVGF + +++ +
Sbjct: 676 PLLNFTVTITNTGKVASDYTAMLFANTTAGPAPYPNKWLVGFDRLASLEPHRSQTMTIPV 735
Query: 741 PCESLSRAREDGLMVIEEGTHFLVVGDEEYPISIFV 776
+S++R E G V+ G + L + +E + FV
Sbjct: 736 TIDSVARTDEAGNRVLYPGKYELALNNERSVVLQFV 771
>gi|348684866|gb|EGZ24681.1| hypothetical protein PHYSODRAFT_325770 [Phytophthora sojae]
Length = 805
Score = 436 bits (1122), Expect = e-119, Method: Compositional matrix adjust.
Identities = 292/789 (37%), Positives = 415/789 (52%), Gaps = 73/789 (9%)
Query: 8 LVFPLL-CLCFTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLD 66
+V PLL + SL+T +T P +N + PFC T+L + R DL+SRL L
Sbjct: 2 IVLPLLLAIASASLVTPTAATIPRACVGKAN---QELPFCNTSLSTADRVEDLLSRLPLQ 58
Query: 67 EKISQLVNSAPAIPR-----LGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVIL 121
EK + L +A A PR +G+P Y W + +HGV GT TSFP +
Sbjct: 59 EKATLL--TARASPRGNMSSIGLPEYNWGANCVHGVQSTC------GT-NCPTSFPNPVN 109
Query: 122 TAASFDSYLWYRIGQAIGLEARAL--------YNAGQAIGMTFWAPNINIFRDPRWGRGQ 173
A FD + + + Q IG E RAL Y G +G+ W+PNINI RDPRWGR
Sbjct: 110 LGAIFDPQVVFDMAQVIGWELRALWLEGATENYKGGPHLGLDCWSPNININRDPRWGRNT 169
Query: 174 ETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKF 233
ETP EDPLV KY V+Y RG+Q + LQA KH+ AY +N+ G R +F
Sbjct: 170 ETPSEDPLVNSKYGVAYTRGLQEGKRQDPRF---LQAVVTLKHYAAYSYENYGGVNRMEF 226
Query: 234 DARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHG 293
DA V+ D ADTY P F S V G A G+MC+YN VNGIP CA++ L+ R GF G
Sbjct: 227 DAIVSPYDFADTYFPAFRSSVVDGNAKGVMCSYNSVNGIPMCANKELVETLLRGTLGFDG 286
Query: 294 YITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEI 353
Y+TSD AV I D YA S +A + AG D+N G + K V +L E +
Sbjct: 287 YVTSDSGAVEAISDMHHYADSQCEAARLAILAGTDINSGKSYEACLKTLVDDNQLEEKAL 346
Query: 354 DRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGL 413
D AL + +R LGLF+ QP+ + V + A + L+L A + +V+L+N+ +
Sbjct: 347 DDALRHTLKLRFELGLFDPIDD-QPYWNVTPSEVNTAAAKALSLNATRKSLVMLQNNASV 405
Query: 414 LPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRS--------ITPLQALQ--NYVENTV 463
LPL K V LA++GP+A S + LLGNY G C TPL A++ N NT
Sbjct: 406 LPL--QKGVKLAVLGPHAKSKRGLLGNYLGQMCHGDYDEVGCVQTPLDAIRAANGASNTT 463
Query: 464 YYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITR 523
+ GC S+A +KAV AK AD VVL +G+D++ E E DR ++ LP Q +L+ R
Sbjct: 464 FAEGCGISGNSTAGFEKAVAAAKEADAVVLFLGIDKSIEGEVGDRNNIDLPNIQMQLLQR 523
Query: 524 VAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGR 583
V A +P ++VL+ GG I + +++ A YPG GA A+A+V+FGD NP G+
Sbjct: 524 V-HAVGRPTVVVLINGGV--IGAEEIIERTDALVEAFYPGFFGARAMADVLFGDTNPSGK 580
Query: 584 LPMTWYPQDYI-KVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKA 642
LP+T Y DY+ +V M M M + +PGRTYR+++G+ VFPFG GLSY+ +S +
Sbjct: 581 LPVTMYRSDYVDQVEMKSMDM-----TAHPGRTYRYFKGEPVFPFGWGLSYTTFSLSVDS 635
Query: 643 VSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVL 702
+ + + N ++ + G E +T +++ VKN GE+AG VL
Sbjct: 636 GTNSSSHSNNAAFS-----------------GGEVSDTANVTISVVVKNDGEVAGDEVVL 678
Query: 703 LFVKPARRGNGRPI----KQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEE 758
F +P P +QL +Q V L + E+ F + +L+ E+G +
Sbjct: 679 AFFRPVNSNVTGPATLLNEQLFDYQRVSLGPLDSTEVSFTIER-STLALPDEEGNLASFP 737
Query: 759 GTHFLVVGD 767
G++ ++V +
Sbjct: 738 GSYEVIVSN 746
>gi|291528382|emb|CBK93968.1| Beta-glucosidase-related glycosidases [Eubacterium rectale M104/1]
Length = 714
Score = 436 bits (1121), Expect = e-119, Method: Compositional matrix adjust.
Identities = 271/743 (36%), Positives = 393/743 (52%), Gaps = 91/743 (12%)
Query: 54 QRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGA 113
+ A+ LVS++T+DEKISQ++ +PAI RLGIP Y WW+EALHGVA G A
Sbjct: 7 EYAKKLVSQMTIDEKISQMLYESPAIERLGIPEYNWWNEALHGVARAGV----------A 56
Query: 114 TSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA-------GQAIGMTFWAPNINIFRD 166
T FPQ I AA+FD+ L +IG + E R +N G G+TFWAPN+NIFRD
Sbjct: 57 TVFPQAIGLAAAFDADLIEKIGDVVSTEGRGKFNEFSKKGDHGIYKGLTFWAPNVNIFRD 116
Query: 167 PRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWK 226
PRWGRG ET GEDP +TGK +Y+RG+QGD + L+++AC KHF + +
Sbjct: 117 PRWGRGHETYGEDPYLTGKLGCAYIRGLQGDDPD------HLKSAACAKHFAVH---SGP 167
Query: 227 GTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTAR 286
R++FDA+ + D+ DTY F+ CVK + +M AYNRVNG P+C R LL R
Sbjct: 168 EAIRHEFDAKASKHDMYDTYLYAFKRCVKDAKVEAVMGAYNRVNGEPACGSRTLLKDILR 227
Query: 287 RQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGS-FLQKHTKAAVKQ 345
++GF G++ SDC A+ ++ + E++ + G D+NCGS FL H K A +
Sbjct: 228 DEFGFEGHVVSDCWAILDFHEHHHVTDTVEESAAMAVNNGCDLNCGSAFL--HLKDAYDK 285
Query: 346 KKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIV 405
+ + I A+ L VR+RLG+ P+ P+ I +VV H L+++AA+ +V
Sbjct: 286 GLVSDEAITAAVERLMEVRIRLGMMKDYPS--PYEDISYEVVECKEHVELSVEAARRSLV 343
Query: 406 LLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVEN---T 462
LLKN LPL + ++A+IGPNANS L+GNY G S R ITPL+ LQ Y+ +
Sbjct: 344 LLKNKDNFLPLDRKNVKTIAVIGPNANSRDALIGNYYGTSSRYITPLEGLQQYLGDDTRV 403
Query: 463 VYYPGC----DTVACSSASIDK---AVDIAKGADHVVLMMGLDQTQEKEE---------L 506
+Y GC D V + D+ A+ +A+ +D VV+ +GLD T E EE
Sbjct: 404 LYAEGCHLYKDKVQGLAEEKDRFKEALIMAEQSDVVVMCLGLDATIEGEEGDAGNEYASG 463
Query: 507 DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAG 566
D++ L+LPG Q+EL+ VA A KPVILVL G +D+++A + ++ +I+ + YPG G
Sbjct: 464 DKLGLMLPGLQEELLEAVA-AVGKPVILVLSAGSAIDLSWA--EEHVDAIIDSWYPGARG 520
Query: 567 AVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFP 626
A+AE IFG+++P G+LP+T+Y TD M RTYR+ ++P
Sbjct: 521 GKAVAEAIFGEYSPSGKLPVTFYQGTENLPEFTDYSM--------AHRTYRYTNENVLYP 572
Query: 627 FGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVT 686
FG GL Y + +Y +V + + +N+ V
Sbjct: 573 FGYGLHYGETNYDGLSVDKAESDVNEPVE-----------------------------VF 603
Query: 687 IGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLS 746
+ V N V L+++ P QL G + V L E ++ LSP +
Sbjct: 604 VNVTNDSRYTVNEIVQLYIRHVDAAEYEPGYQLKGIEVVKLEPHETKKVKLTLSP-RDFA 662
Query: 747 RAREDGLMVIEEGTHFLVVGDEE 769
EDG V G + + G ++
Sbjct: 663 VIEEDGSCVAVPGIYEISAGGQQ 685
>gi|330836687|ref|YP_004411328.1| Beta-glucosidase [Sphaerochaeta coccoides DSM 17374]
gi|329748590|gb|AEC01946.1| Beta-glucosidase [Sphaerochaeta coccoides DSM 17374]
Length = 709
Score = 436 bits (1120), Expect = e-119, Method: Compositional matrix adjust.
Identities = 255/607 (42%), Positives = 352/607 (57%), Gaps = 64/607 (10%)
Query: 56 ARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATS 115
AR +VSR+TLDEKISQ+ A AIPRL IP Y WW+EALHGVA G AT
Sbjct: 14 ARRIVSRMTLDEKISQIDYRASAIPRLDIPEYNWWNEALHGVARAGI----------ATV 63
Query: 116 FPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAI---------GMTFWAPNINIFRD 166
FPQ I AA FDS + RIG I E RA YN +A+ G+TFW+PN+NIFRD
Sbjct: 64 FPQAIGLAAMFDSDMMERIGAVISTEGRAKYN--EAVRHGDRDIYKGLTFWSPNVNIFRD 121
Query: 167 PRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWK 226
PRWGRGQET GEDP +T + AV+++RG+QGD GK L+A+AC KHF + +
Sbjct: 122 PRWGRGQETYGEDPYLTARLAVAFIRGIQGD----GKY---LKAAACAKHFAVH---SGP 171
Query: 227 GTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTAR 286
R++FDARV+ +DL +TY F++ VK+ + +M AYNRVNG+P+CA LLS R
Sbjct: 172 EALRHEFDARVSQKDLHETYLSAFKAAVKEAQVEIVMGAYNRVNGVPACASHELLSDILR 231
Query: 287 RQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQK 346
+WGF G++ SD +A+ I+ Y + LKAG ++ C + +H +++V +
Sbjct: 232 SEWGFEGHVVSDYEALEDIFKHHHYVADEAHTMAVALKAGCNL-CAGKIARHLRSSVDEG 290
Query: 347 KLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVL 406
+ E EI A+ LF+ R+ +G+ + P+ IG + +P H LA++AA VL
Sbjct: 291 LISEDEITEAVERLFTTRIMMGMMADDC---PYDSIGYEENDTPEHHQLAVEAASRSFVL 347
Query: 407 LKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTV--- 463
LKN GLLPL K S+A+IGPNANS K L GNY G + R +T L+ +Q+ V ++V
Sbjct: 348 LKND-GLLPLEMEKISSIAVIGPNANSRKMLEGNYNGTASRYVTVLEGIQDLVGDSVRVW 406
Query: 464 YYPGC-------DTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEE---------LD 507
Y GC +++ + + +AV A+ AD VVL +GLD T E EE D
Sbjct: 407 YSEGCHLYKNFHSSLSGRNDRLAEAVSAAQHADVVVLCLGLDATLEGEEGDVEVGFGSGD 466
Query: 508 RVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGA 567
+ +L LPGRQQ L+ + KPVIL+L G + + + D N+ +IL YPG G
Sbjct: 467 KPNLSLPGRQQLLLDTMLTVG-KPVILLLASGSALTLGGRENDENLKAILQIWYPGAMGG 525
Query: 568 VALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPF 627
A+A+V+FG P G+LP+T+Y D M GRTYR+ +G ++PF
Sbjct: 526 KAVADVLFGRRAPAGKLPVTFYASADELPAFEDYSM--------AGRTYRYMKGNALYPF 577
Query: 628 GCGLSYS 634
G GL+YS
Sbjct: 578 GYGLTYS 584
>gi|333379783|ref|ZP_08471502.1| hypothetical protein HMPREF9456_03097 [Dysgonomonas mossii DSM
22836]
gi|332884929|gb|EGK05184.1| hypothetical protein HMPREF9456_03097 [Dysgonomonas mossii DSM
22836]
Length = 737
Score = 436 bits (1120), Expect = e-119, Method: Compositional matrix adjust.
Identities = 260/748 (34%), Positives = 392/748 (52%), Gaps = 84/748 (11%)
Query: 41 ETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGV 100
+ +PF T L I +R DLVS+LTL+EK++Q++N+ PAI RL IPAY WW+E LHG+
Sbjct: 23 QNYPFQNTNLSIDERVNDLVSKLTLEEKVAQMLNNTPAIERLNIPAYNWWNECLHGIGR- 81
Query: 101 GKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQA-------IG 153
T T FPQ I AA+++ L + AI E RA+YN + G
Sbjct: 82 --------TDYKVTVFPQAIGMAAAWNKELMKEVASAISDEGRAIYNDATSKGNREIYYG 133
Query: 154 MTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASAC 213
+T+W PNINIFRDPRWGRGQET GEDP +TG S+V G+QGD L+A+AC
Sbjct: 134 LTYWTPNINIFRDPRWGRGQETYGEDPFLTGVLGKSFVAGLQGDD------TKYLKAAAC 187
Query: 214 CKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIP 273
KH Y + + TR+ F+ VT DL DTY P F + V + + +G+MCAYN NG P
Sbjct: 188 AKH---YAVHSGPENTRHTFNTFVTDYDLWDTYLPAFRNLVVEAKVAGVMCAYNAYNGEP 244
Query: 274 SCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGS 333
C + L+ + R +W F GY+TSDC A+ Y + A D + G D++CG+
Sbjct: 245 CCGNNFLMQEILREKWNFTGYVTSDCGAIDDFYQHHKTHPDAKYAAADAVYNGTDIDCGN 304
Query: 334 FLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQ 393
K AVK + E +ID +L LF++R RLG+F+ ++ + +I V+ S H+
Sbjct: 305 EAYKALVDAVKTGIITEKQIDISLKRLFTIRFRLGMFDPAENVK-YSQISTSVLESQKHK 363
Query: 394 VLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQ 453
LAL+ ++ IVLLKN + LPL K K +A++GPNAN+ ++LGNY G +TP +
Sbjct: 364 DLALKITRESIVLLKNENNTLPLSK-KLKKVAVVGPNANNEVSVLGNYNGFPTEIVTPYE 422
Query: 454 ALQNYVE--NTVYYPGCDTVACSSAS---IDKAVDIAKGADHVVLMMGLDQTQEKEEL-- 506
A++ ++ +Y G D V S+ S + V K D V+ + G+ E EE+
Sbjct: 423 AVKQKLKGAEVIYEKGIDFVTPSTNSKEEVSALVKRLKDVDVVIFVGGISPELEGEEMPV 482
Query: 507 --------DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILW 558
DR + LP Q + + + A K P + V++ G + + +NI +I+
Sbjct: 483 KIEGFTGGDRTSIKLPKIQTDFMKALV-AEKIPTVFVMMTGSAIATEWES--QNIPAIVN 539
Query: 559 AGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRF 618
A Y G+ A+A+V+FGD+NP G+LP+T+Y +D +M+ RTYR+
Sbjct: 540 AWYGGQDAGTAIADVLFGDYNPSGKLPVTFYAKDSDLPAFNSYEMK--------NRTYRY 591
Query: 619 YEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFC 678
+ G+ ++PFG GLSY+K+ Y V ++ ++ K
Sbjct: 592 FNGEVLYPFGYGLSYTKFEYSPIQVPST---IDTGNNAK--------------------- 627
Query: 679 ETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFE 738
V++ +KN G++ G+ V L++ +P+ L GF V L A E + F
Sbjct: 628 ------VSVSIKNTGKVEGEEVVQLYISYPDTKGQKPLYALKGFNRVSLKAGESKTVEFN 681
Query: 739 LSPCESLSRAREDGLMVIEEGTHFLVVG 766
LSP E L + G++ + G + +G
Sbjct: 682 LSPRE-LGLVDDAGILKVSAGKRKIFIG 708
>gi|67523807|ref|XP_659963.1| hypothetical protein AN2359.2 [Aspergillus nidulans FGSC A4]
gi|74597492|sp|Q5BAS1.1|XYND_EMENI RecName: Full=Exo-1,4-beta-xylosidase xlnD; AltName:
Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
gi|40745314|gb|EAA64470.1| hypothetical protein AN2359.2 [Aspergillus nidulans FGSC A4]
gi|259487761|tpe|CBF86686.1| TPA: Beta-xylosidase (EC 3.2.1.37)
[Source:UniProtKB/TrEMBL;Acc:O42810] [Aspergillus
nidulans FGSC A4]
Length = 803
Score = 435 bits (1118), Expect = e-119, Method: Compositional matrix adjust.
Identities = 257/738 (34%), Positives = 393/738 (53%), Gaps = 35/738 (4%)
Query: 44 PFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKG 103
P C +L RA LVS T DE ++ N+ + RLG+P Y+ W EALHGV G+
Sbjct: 59 PVCDRSLSPKDRATALVSLFTFDELVNNTGNTGLGVSRLGLPNYQVWGEALHGV---GRA 115
Query: 104 IFF-NGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNIN 162
F +G ATSFP I A+ + L ++IG + + RA NAG G+ ++PNIN
Sbjct: 116 NFVESGNFSWATSFPMPITMMAALNKTLIHQIGTIVSTQLRAFSNAGLG-GVDVYSPNIN 174
Query: 163 IFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDL 222
FR P WGRGQETPGED +T Y Y+ +QG G L+ A KH+ YD+
Sbjct: 175 TFRHPVWGRGQETPGEDAFLTSVYGYEYITALQG-----GVDPETLKIIATAKHYAGYDI 229
Query: 223 DNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLS 282
++W +R D ++T Q+L++ Y PPF + + +MC+YN VNG+PSCA++ L
Sbjct: 230 ESWNNHSRLGNDMQITQQELSEYYTPPFIVASRDAKVRSVMCSYNAVNGVPSCANKFFLQ 289
Query: 283 KTARRQWGF--HGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTK 340
R + F GY++ DC AV +++ GYA + A D + AG D++CG+ Q H++
Sbjct: 290 TLLRDTFEFSEDGYVSGDCGAVYNVWNPHGYASNEAAASADSILAGTDIDCGTSYQWHSE 349
Query: 341 AAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAA 400
A + + S+I+R + L+S ++ G F+G P+ I D V S +A +AA
Sbjct: 350 DAFEDSLVSRSDIERGVIRLYSNLVQAGYFDGEDA--PYRDITWDDVLSTDAWNIAYEAA 407
Query: 401 QDGIVLLKNSHGLLPLPKSKSV-SLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYV 459
+GIVLLKN LP SK + S+A+IGP AN + L GNY GP+ I+PL ++
Sbjct: 408 VEGIVLLKNDE---TLPLSKDIKSVAVIGPWANVTEELQGNYFGPAPYLISPLTGFRDSG 464
Query: 460 ENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQE 519
+ Y G + + S++ ++A+ AK AD ++ G+D T E E +DR ++ PG Q +
Sbjct: 465 LDVHYALGTNLTSHSTSGFEEALTAAKQADAIIFAGGIDNTIEAEAMDRENITWPGNQLD 524
Query: 520 LITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHN 579
LI++++E KP++++ + GG VD + K + N+ +++W GYPG++G ALA++I G
Sbjct: 525 LISKLSELG-KPLVVLQMGGGQVDSSSLKDNDNVNALIWGGYPGQSGGHALADIITGKRA 583
Query: 580 PGGRLPMTWYPQDYIKV-PMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSY 638
P GRL T YP +Y +V P DM +RP TSGNPG+TY +Y G V+ FG GL Y+ +
Sbjct: 584 PAGRLVTTQYPAEYAEVFPAIDMNLRPNETSGNPGQTYMWYTGTPVYEFGHGLFYTTFEE 643
Query: 639 KFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFL-VTIGVKNHGEMAG 697
+ + +T P G E + + L T VKN GE
Sbjct: 644 STETTDAGSFNIQTVLTT--------------PHSGYEHAQQKTLLNFTATVKNTGERES 689
Query: 698 KHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIE 757
+ L++V P K +VGF + ++ + ES++R E G V+
Sbjct: 690 DYTALVYVNTTAGPAPYPKKWVVGFDRLGGLEPGDSQTLTVPVTVESVARTDEQGNRVLY 749
Query: 758 EGTHFLVVGDEEYPISIF 775
G++ L + +E + F
Sbjct: 750 PGSYELALNNERSVVVKF 767
>gi|366163035|ref|ZP_09462790.1| glycoside hydrolase family 3 [Acetivibrio cellulolyticus CD2]
Length = 705
Score = 435 bits (1118), Expect = e-119, Method: Compositional matrix adjust.
Identities = 270/739 (36%), Positives = 391/739 (52%), Gaps = 94/739 (12%)
Query: 54 QRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGA 113
++A +LV+++TL+EK SQL ++PAI RLGIPAY WW+EALHGVA G A
Sbjct: 9 KKAEELVAQMTLEEKASQLTYNSPAIERLGIPAYNWWNEALHGVARAGT----------A 58
Query: 114 TSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQA-------IGMTFWAPNINIFRD 166
T FPQ I AA FD +I AI +EARA YN G+T W+PNINIFRD
Sbjct: 59 TVFPQAIGLAAMFDDEFLMKIANAIAIEARAKYNESSKHGDRDIYKGLTIWSPNINIFRD 118
Query: 167 PRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWK 226
PRWGRG ET GEDP ++GK V++++G+QGD K + +AC KHF AY +
Sbjct: 119 PRWGRGHETYGEDPFLSGKLGVAFIKGLQGD-------KDVMMTAACVKHFAAY---SGP 168
Query: 227 GTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTAR 286
R+ F+A VT +DL +TY P FE+CVK + +M YNR NG P C LL R
Sbjct: 169 EDLRHGFNAEVTKKDLWETYLPAFETCVKDAKVEAVMGGYNRTNGEPCCGSYTLLRDILR 228
Query: 287 RQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQK 346
+WGF G++ SDC A+ + K+PE++V + AG D+NCG+ + A+++
Sbjct: 229 EKWGFEGHVVSDCWAIKDFHTDHMVTKTPEESVALAIDAGCDLNCGN-MYLMLLIALQEG 287
Query: 347 KLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVL 406
+ E I RA +F+ R +LGLF G+ F I +VV H+ +A++AA+ VL
Sbjct: 288 LITEEHITRAAVRIFTTRFKLGLFEGS----EFDNIPYEVVECSEHKEMAIEAARKSAVL 343
Query: 407 LKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTV--- 463
LKN G+LP+ K ++ +IGPNANS L GNY G S R IT L+ +Q+ V + V
Sbjct: 344 LKND-GILPINKGAIKTIGVIGPNANSRIALKGNYHGTSSRYITLLEGIQDEVGDEVRVL 402
Query: 464 YYPGCDTV-------ACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEEL---------D 507
Y GC+ V A ++ + +AV +A+ +D VVL +GLD+T E E+ D
Sbjct: 403 YSNGCELVKDRTEVLAYANDRLAEAVTVAEHSDLVVLCLGLDETIEGEQSDEGNNGGSGD 462
Query: 508 RVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGA 567
+ DL LP Q+ L+ ++ A KP +L L+ G +++++A N IL YPG G
Sbjct: 463 KKDLDLPEVQKSLLEKIV-ATGKPTVLCLMAGSAINLSYAHEHCN--GILLTWYPGARGG 519
Query: 568 VALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPF 627
A+A+++FG+ +P G+LP+T+Y P+TD M+ RTYR+ E ++PF
Sbjct: 520 KAVADILFGNASPSGKLPVTFYRSLDNLPPITDYSMK--------NRTYRYIEEAPLYPF 571
Query: 628 GCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTI 687
G GL+Y V K V GT E + +T+
Sbjct: 572 GYGLTYGD-----------------------------VELKHVEIKGTVEIE-KDIYITV 601
Query: 688 GVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSR 747
++N G +A + V ++K + L F V L A E+ ++ + P +SL
Sbjct: 602 TLQNRGSVAVEEVVQAYIKDEQSMYAVTNTSLCAFMRVGLGANEEKQVSMRI-PFDSLKV 660
Query: 748 AREDGLMVIEEGTHFLVVG 766
DG V++ L G
Sbjct: 661 VNLDGEKVLDSKKFTLFAG 679
>gi|410098444|ref|ZP_11293422.1| hypothetical protein HMPREF1076_02600 [Parabacteroides goldsteinii
CL02T12C30]
gi|409222318|gb|EKN15263.1| hypothetical protein HMPREF1076_02600 [Parabacteroides goldsteinii
CL02T12C30]
Length = 738
Score = 434 bits (1117), Expect = e-119, Method: Compositional matrix adjust.
Identities = 272/754 (36%), Positives = 390/754 (51%), Gaps = 89/754 (11%)
Query: 40 TETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAG 99
T +PF LP+ R +D++SRLTL+EK+ + ++APA+PRLGIPAY WW+EALHGVA
Sbjct: 23 TYDYPFRNPDLPLDVRVQDIISRLTLEEKVQLMKHAAPAVPRLGIPAYNWWNEALHGVAR 82
Query: 100 VGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYN----AGQA---- 151
+ + T FPQ I AA+FD+ ++G E RAL+N AG+
Sbjct: 83 TKEKV---------TVFPQAIGMAATFDTEALQKMGDMTSSEGRALFNEDLKAGKTGEIY 133
Query: 152 IGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQAS 211
G+T+W PNINIFRDPRWGRGQET GEDP +T K + V G++G+ L++
Sbjct: 134 RGLTYWTPNINIFRDPRWGRGQETYGEDPYLTAKMGSAIVHGLEGNN------PEYLKSV 187
Query: 212 ACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNG 271
AC KH+ + + R+ +DARV+M DL DTY P F V + + G+MCAYNR G
Sbjct: 188 ACAKHYAVH---SGPEHNRHSYDARVSMYDLWDTYLPAFRELVTKAKVHGVMCAYNRFEG 244
Query: 272 IPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNC 331
P C LL R QW F GY+TSDC AVS + +AV D + G D+ C
Sbjct: 245 TPCCGHNELLQDILRNQWKFDGYVTSDCWAVSDFAKYHKTHSNDTEAVADAVLNGTDLEC 304
Query: 332 GSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPA 391
G+ QK + V++ + E +I+ +L LF ++ +LG+++ + P+ IG +V+ A
Sbjct: 305 GNLYQK-LQQGVEKGLISEKDINVSLARLFEIQFKLGMYDPADRV-PYASIGREVIECDA 362
Query: 392 HQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITP 451
H+ A + AQ +VLLKN+ +LPL SK +ALIGPN ++ TLL NY G ITP
Sbjct: 363 HKKHAYEMAQKSMVLLKNNKNILPLNASKIKRIALIGPNMDNGSTLLANYFGTPSEIITP 422
Query: 452 LQALQNYVENTVYYPGCDTVAC-----SSASIDKAVDIAKGADHVVLMMGLDQTQE---- 502
++LQ N++ V + S + AK AD ++ + G+ E
Sbjct: 423 YKSLQKRFGNSIQIDTLTGVGIVQKLEGAPSFAQVAAQAKKADIIIFVGGISADYEGEAG 482
Query: 503 ---------KEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNI 553
DR + LP Q EL+ + + +P+ILV + G ++F RN
Sbjct: 483 DAGAAGYGGFASGDRTTMKLPPVQTELMKELKKTG-RPLILVNMSGSV--MSFDWESRNA 539
Query: 554 GSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPG 613
+IL A Y G+A A+ +V+FGD+NP GR+P+T Y D D M
Sbjct: 540 DAILQAWYGGQAAGDAITDVLFGDYNPAGRMPLTTYMNDEDLPDFEDYSM--------AN 591
Query: 614 RTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPEL 673
RTYR+++G +PFG GLSY+ + Y L +S+ K E+
Sbjct: 592 RTYRYFKGDVRYPFGYGLSYTTFGYA---------PLQNASTVKTGES------------ 630
Query: 674 GTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGR-PIKQLVGFQSVILNAKEK 732
VT V N G+ AG V L++ + GN R P++ L GF+ + L+ E
Sbjct: 631 ---------IQVTTTVTNTGKRAGDEVVQLYISHPQNGNTRVPLRALKGFKRIHLDTGES 681
Query: 733 AEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
++ F LSP E LS E G V +EGT L +G
Sbjct: 682 RQVTFTLSP-EELSLVDEKGNQVEKEGTVELYIG 714
>gi|242771939|ref|XP_002477942.1| beta-xylosidase XylA [Talaromyces stipitatus ATCC 10500]
gi|218721561|gb|EED20979.1| beta-xylosidase XylA [Talaromyces stipitatus ATCC 10500]
Length = 797
Score = 434 bits (1115), Expect = e-118, Method: Compositional matrix adjust.
Identities = 265/729 (36%), Positives = 405/729 (55%), Gaps = 38/729 (5%)
Query: 46 CKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGV--AGVGKG 103
C T++ +RA L+S TL+E I+ NSAP +PRLG+P Y+ WSE LHG+ A K
Sbjct: 64 CNTSVNYVERAEGLISLFTLEELINNTQNSAPGVPRLGLPPYQVWSEGLHGLDRANWAKS 123
Query: 104 IFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINI 163
+ ATSFP IL+ A+ + L +I I +ARA N G+ G+ +APNIN
Sbjct: 124 ---GEEWKWATSFPMPILSMAALNRTLINQIASIIATQARAFNNVGR-YGLDAYAPNING 179
Query: 164 FRDPRWGRGQETPGEDP-LVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDL 222
FR P WGRGQETPGED ++ YA Y+ G+QG G L+ A KHF YDL
Sbjct: 180 FRSPLWGRGQETPGEDAGFLSSSYAYEYITGLQG-----GVDPEHLKIVATAKHFAGYDL 234
Query: 223 DNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLS 282
+NW +R FDA +T QDLA+ Y P F + + +A MC+YN VNG+PSC+ LL
Sbjct: 235 ENWNNNSRLGFDASITQQDLAEYYTPQFLAASRYAKARSFMCSYNSVNGVPSCSSSFLLQ 294
Query: 283 KTARRQWGF--HGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTK 340
R W F +GY++SDCDA +++ GYA + A D L+AG D++CG +
Sbjct: 295 TLLRENWDFPDYGYVSSDCDAAYNVFNPHGYAINISAAAADSLRAGTDIDCGQTYPWYLN 354
Query: 341 AAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAA 400
+ + + EI+R+L L+S ++LG F+GN + ++G + V + ++ +AA
Sbjct: 355 QSFIEGSVTRGEIERSLIRLYSNLVKLGYFDGN--QSEYRQLGWNDVVATDAWNISYEAA 412
Query: 401 QDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVE 460
+GIVLLKN G+LPL + K S+A+IGP AN+ + L GNY GP+ ITPLQA ++
Sbjct: 413 VEGIVLLKN-DGVLPLSE-KLKSVAVIGPWANATQQLQGNYFGPAPYLITPLQAARDAGY 470
Query: 461 NTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQEL 520
Y G + + ++ A+ AK +D ++ + G+D T E E DR+++ PG Q +L
Sbjct: 471 KVNYAFGTNILGNTTDGFAAALSAAKKSDVIIYLGGIDNTIEAEGTDRMNVTWPGNQLDL 530
Query: 521 ITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
I ++++ KP++++ + GG VD + K + N+ +++W GYPG++G A+ +++ G P
Sbjct: 531 IQQLSQTG-KPLVVLQMGGGQVDSSSLKSNNNVNALVWGGYPGQSGGKAIFDILSGKRAP 589
Query: 581 GGRLPMTWYPQDY-IKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYK 639
GRL T YP +Y + P TDM +RP S NPG+TY +Y GK V+ FG L Y+
Sbjct: 590 AGRLVTTQYPAEYATQFPATDMNLRPDGKS-NPGQTYIWYTGKPVYEFGYALFYT----T 644
Query: 640 FKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFL-VTIGVKNHGEMAGK 698
FK ++ KL + + ++ + Y + E F+ VT +KN G+ A
Sbjct: 645 FKETAE-KLASSSFDISDIIASPRSSSYA--------YSELVPFVNVTATIKNTGKTASP 695
Query: 699 HPVLLFVKPARRG-NGRPIKQLVGFQSV-ILNAKEKAEIVFELSPCESLSRAREDGLMVI 756
+ +LF G P K LVG+ + + + E+V + P ++SR E+G ++
Sbjct: 696 YTAMLFANTTNAGPTPYPNKWLVGYDRLPSIEPGKSTELVIPV-PIGAISRVDENGNRIV 754
Query: 757 EEGTHFLVV 765
G + L +
Sbjct: 755 YPGDYQLAL 763
>gi|145230215|ref|XP_001389416.1| exo-1,4-beta-xylosidase xlnD [Aspergillus niger CBS 513.88]
gi|74626559|sp|O00089.2|XYND_ASPNG RecName: Full=Exo-1,4-beta-xylosidase xlnD; AltName:
Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
gi|292495287|sp|A2QA27.1|XYND_ASPNC RecName: Full=Probable exo-1,4-beta-xylosidase xlnD; AltName:
Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
gi|2181180|emb|CAB06417.1| xylosidase [Aspergillus niger]
gi|134055533|emb|CAK37179.1| xylosidase xlnD-Aspergillus niger
gi|350638468|gb|EHA26824.1| hypothetical protein ASPNIDRAFT_205670 [Aspergillus niger ATCC
1015]
Length = 804
Score = 433 bits (1114), Expect = e-118, Method: Compositional matrix adjust.
Identities = 266/739 (35%), Positives = 393/739 (53%), Gaps = 40/739 (5%)
Query: 45 FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGI 104
C T RA L+S TLDE I+ N+ + RLG+PAY+ WSEALHG+ +
Sbjct: 68 ICDETATPYDRAASLISLFTLDELIANTGNTGLGVSRLGLPAYQVWSEALHGL---DRAN 124
Query: 105 FFN-GTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINI 163
F + G ATSFPQ ILT A+ + L ++I I + RA NAG+ G+ +APNIN
Sbjct: 125 FSDSGAYNWATSFPQPILTTAALNRTLIHQIASIISTQGRAFNNAGR-YGLDVYAPNINT 183
Query: 164 FRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLD 223
FR P WGRGQETPGED + YA Y+ G+QG + L+ +A KH+ YD++
Sbjct: 184 FRHPVWGRGQETPGEDVSLAAVYAYEYITGIQGP-----DPESNLKLAATAKHYAGYDIE 238
Query: 224 NWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSK 283
NW +R D +T QDL++ Y P F + + +MCAYN VNG+P+CAD L
Sbjct: 239 NWHNHSRLGNDMNITQQDLSEYYTPQFHVAARDAKVQSVMCAYNAVNGVPACADSYFLQT 298
Query: 284 TARRQWGF--HGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKA 341
R +GF HGY++SDCDA IY+ GYA S A + + AG D++CG+ Q H
Sbjct: 299 LLRDTFGFVDHGYVSSDCDAAYNIYNPHGYASSQAAAAAEAILAGTDIDCGTTYQWHLNE 358
Query: 342 AVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTM--QPFGKIGADVVCSPAHQVLALQA 399
++ L +I++ + L++ ++ G F+ N T P+ + V ++ QA
Sbjct: 359 SIAAGDLSRDDIEQGVIRLYTTLVQAGYFDSNTTKANNPYRDLSWSDVLETDAWNISYQA 418
Query: 400 AQDGIVLLKNSHGLLPLPKS----KSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQAL 455
A GIVLLKNS+ +LPL + + ++ALIGP AN+ LLGNY G + I+P A
Sbjct: 419 ATQGIVLLKNSNNVLPLTEKAYPPSNTTVALIGPWANATTQLLGNYYGNAPYMISPRAAF 478
Query: 456 QNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPG 515
+ + G + S++ A+ A+ AD ++ G+D T E E LDR + PG
Sbjct: 479 EEAGYKVNFAEGTGISSTSTSGFAAALSAAQSADVIIYAGGIDNTLEAEALDRESIAWPG 538
Query: 516 RQQELITRVAEAA-KKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVI 574
Q +LI ++A AA KKP+I++ + GG VD + K + N+ ++LW GYPG++G AL ++I
Sbjct: 539 NQLDLIQKLASAAGKKPLIVLQMGGGQVDSSSLKNNTNVSALLWGGYPGQSGGFALRDII 598
Query: 575 FGDHNPGGRLPMTWYPQDYI-KVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSY 633
G NP GRL T YP Y + P TDM +RP+ NPG+TY++Y G+ V+ FG GL Y
Sbjct: 599 TGKKNPAGRLVTTQYPASYAEEFPATDMNLRPEGD--NPGQTYKWYTGEAVYEFGHGLFY 656
Query: 634 SKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHG 693
+ ++ + ++ LN ++D+ +P L T ++N G
Sbjct: 657 TTFAESSSNTTTKEVKLNIQDILSQT-HEDLASITQLPVLN----------FTANIRNTG 705
Query: 694 EMAGKHPVLLFVKPARRGNG-RPIKQLVGFQSVILNAKEKAEIVFELS-PCE--SLSRAR 749
++ + ++F + G P K LVG+ + + K EL P E S +R
Sbjct: 706 KLESDYTAMVFANTSDAGPAPYPKKWLVGWDRL---GEVKVGETRELRVPVEVGSFARVN 762
Query: 750 EDGLMVIEEGTHFLVVGDE 768
EDG V+ GT L + E
Sbjct: 763 EDGDWVVFPGTFELALNLE 781
>gi|290889355|gb|ADD69953.1| xylosidase HistTag [synthetic construct]
Length = 810
Score = 433 bits (1114), Expect = e-118, Method: Compositional matrix adjust.
Identities = 266/739 (35%), Positives = 393/739 (53%), Gaps = 40/739 (5%)
Query: 45 FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGI 104
C T RA L+S TLDE I+ N+ + RLG+PAY+ WSEALHG+ +
Sbjct: 68 ICDETATPYDRAASLISLFTLDELIANTGNTGLGVSRLGLPAYQVWSEALHGL---DRAN 124
Query: 105 FFN-GTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINI 163
F + G ATSFPQ ILT A+ + L ++I I + RA NAG+ G+ +APNIN
Sbjct: 125 FSDSGAYNWATSFPQPILTTAALNRTLIHQIASIISTQGRAFNNAGR-YGLDVYAPNINT 183
Query: 164 FRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLD 223
FR P WGRGQETPGED + YA Y+ G+QG + L+ +A KH+ YD++
Sbjct: 184 FRHPVWGRGQETPGEDVSLAAVYAYEYITGIQGP-----DPESNLKLAATAKHYAGYDIE 238
Query: 224 NWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSK 283
NW +R D +T QDL++ Y P F + + +MCAYN VNG+P+CAD L
Sbjct: 239 NWHNHSRLGNDMNITQQDLSEYYTPQFHVAARDAKVQSVMCAYNAVNGVPACADSYFLQT 298
Query: 284 TARRQWGF--HGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKA 341
R +GF HGY++SDCDA IY+ GYA S A + + AG D++CG+ Q H
Sbjct: 299 LLRDTFGFVDHGYVSSDCDAAYNIYNPHGYASSQAAAAAEAILAGTDIDCGTTYQWHLNE 358
Query: 342 AVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTM--QPFGKIGADVVCSPAHQVLALQA 399
++ L +I++ + L++ ++ G F+ N T P+ + V ++ QA
Sbjct: 359 SIAAGDLSRDDIEQGVIRLYTTLVQAGYFDSNTTKANNPYRDLSWSDVLETDAWNISYQA 418
Query: 400 AQDGIVLLKNSHGLLPLPKS----KSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQAL 455
A GIVLLKNS+ +LPL + + ++ALIGP AN+ LLGNY G + I+P A
Sbjct: 419 ATQGIVLLKNSNNVLPLTEKAYPPSNTTVALIGPWANATTQLLGNYYGNAPYMISPRAAF 478
Query: 456 QNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPG 515
+ + G + S++ A+ A+ AD ++ G+D T E E LDR + PG
Sbjct: 479 EEAGYKVNFAEGTGISSTSTSGFAAALSAAQSADVIIYAGGIDNTLEAEALDRESIAWPG 538
Query: 516 RQQELITRVAEAA-KKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVI 574
Q +LI ++A AA KKP+I++ + GG VD + K + N+ ++LW GYPG++G AL ++I
Sbjct: 539 NQLDLIQKLASAAGKKPLIVLQMGGGQVDSSSLKNNTNVSALLWGGYPGQSGGFALRDII 598
Query: 575 FGDHNPGGRLPMTWYPQDYI-KVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSY 633
G NP GRL T YP Y + P TDM +RP+ NPG+TY++Y G+ V+ FG GL Y
Sbjct: 599 TGKKNPAGRLVTTQYPASYAEEFPATDMNLRPEGD--NPGQTYKWYTGEAVYEFGHGLFY 656
Query: 634 SKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHG 693
+ ++ + ++ LN ++D+ +P L T ++N G
Sbjct: 657 TTFAESSSNTTTKEVKLNIQDILSQT-HEDLASITQLPVLN----------FTANIRNTG 705
Query: 694 EMAGKHPVLLFVKPARRGNG-RPIKQLVGFQSVILNAKEKAEIVFELS-PCE--SLSRAR 749
++ + ++F + G P K LVG+ + + K EL P E S +R
Sbjct: 706 KLESDYTAMVFANTSDAGPAPYPKKWLVGWDRL---GEVKVGETRELRVPVEVGSFARVN 762
Query: 750 EDGLMVIEEGTHFLVVGDE 768
EDG V+ GT L + E
Sbjct: 763 EDGDWVVFPGTFELALNLE 781
>gi|288870210|ref|ZP_06113312.2| beta-glucosidase [Clostridium hathewayi DSM 13479]
gi|288868024|gb|EFD00323.1| beta-glucosidase [Clostridium hathewayi DSM 13479]
Length = 730
Score = 433 bits (1114), Expect = e-118, Method: Compositional matrix adjust.
Identities = 270/739 (36%), Positives = 387/739 (52%), Gaps = 90/739 (12%)
Query: 54 QRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGA 113
++A LV ++TL+EK+ Q +N APAI RLGI AY WW+E LHGVA G A
Sbjct: 23 EKAEYLVKQMTLEEKVFQTMNQAPAIERLGIKAYNWWNEGLHGVARAGV----------A 72
Query: 114 TSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIG-------MTFWAPNINIFRD 166
T FPQ I AA+FD L +G+A+ EARA Y+ Q G +T WAPNINIFRD
Sbjct: 73 TIFPQAIGLAATFDEDLIETVGEAVSTEARAKYHMQQRYGDTDIYKGLTLWAPNINIFRD 132
Query: 167 PRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWK 226
PRWGRG ET GEDP +T + + Y+RG+QG + L+ +AC KHF + +
Sbjct: 133 PRWGRGHETYGEDPWLTSRLGIRYIRGLQGSH------EKYLKTAACVKHFAVH---SGP 183
Query: 227 GTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTAR 286
R+ FDA V+ +DL +TY P FE+CVK G +M AYNRVNG+P C + LL R
Sbjct: 184 EELRHSFDAEVSEKDLRETYLPAFEACVKDGDVEAVMGAYNRVNGVPCCGNEYLLETILR 243
Query: 287 RQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQK 346
++WGFHG++ SDC A+ ++ G SP ++V + G D+NCG+ + AVK+
Sbjct: 244 KEWGFHGHVVSDCWAIKDFHEGHGVTDSPVESVSMAMNHGCDLNCGNLFT-YLIQAVKEG 302
Query: 347 KLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVL 406
K+ E +D A+ LF+ R++LG P+ I V SPA + L AA +VL
Sbjct: 303 KVKEERLDEAVIRLFTTRLKLGALGKMEEDDPYAGISYLEVDSPAMKKLNRSAAGKSVVL 362
Query: 407 LKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVE---NTV 463
LKN+ GLLP+ + ++ +IGPNA+S + L+GNY G + +T L+ ++ E +
Sbjct: 363 LKNTEGLLPIDTKRYKTIGVIGPNADSRRALVGNYEGTASEYVTVLEGIREAAEPEARVL 422
Query: 464 YYPGCDTVACSSASIDKAVD-------IAKGADHVVLMMGLDQTQEKEE---------LD 507
Y GC + + + D I + +D V+ MGLD T E E+ D
Sbjct: 423 YSEGCHLYKSNVSGLGARNDRLSEVKGICRESDIVIACMGLDSTLEGEQGDTGNIYAGGD 482
Query: 508 RVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGA 567
+ DL+LPG QQ+++ A + KPV+LVLL G + +T+A D ++ +IL A YPG G
Sbjct: 483 KPDLMLPGLQQKIL-ETAYDSGKPVVLVLLAGSAMAVTWA--DEHLPAILTAWYPGAEGG 539
Query: 568 VALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPF 627
+A+V+FG NP GRLP+T+Y T+ M GRTYRF + K ++PF
Sbjct: 540 RGVADVLFGTVNPEGRLPVTFYRTTEELPDFTNYSME--------GRTYRFMKQKALYPF 591
Query: 628 GCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTI 687
G GLSY+++S VS+ N G E V +
Sbjct: 592 GFGLSYTEFSCSGLEVSERDSVDN----------------------GVE--------VKL 621
Query: 688 GVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSR 747
V N GE G+ + ++V + P QL V L E+ I L P ++ +
Sbjct: 622 CVANCGERWGRETIQVYVGRKEDHDRNP--QLKAAVKVGLEPGEEKTISIHL-PADAFAV 678
Query: 748 AREDGLMVIEEGTHFLVVG 766
E+G I+ T+ + G
Sbjct: 679 YDENGKRYIDACTYQIFAG 697
>gi|436410475|gb|AGB57183.1| beta-xylosidase [Aspergillus sp. BCC125]
Length = 804
Score = 433 bits (1114), Expect = e-118, Method: Compositional matrix adjust.
Identities = 261/734 (35%), Positives = 391/734 (53%), Gaps = 40/734 (5%)
Query: 45 FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGI 104
C + RA L+S TLDE I+ N+ + RLG+PAY+ WSEALHG+ +
Sbjct: 68 ICDESATPYDRAASLISLFTLDELIANTGNTGLGVSRLGLPAYQVWSEALHGL---DRAN 124
Query: 105 FFN-GTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINI 163
F + G+ ATSFPQ ILT A+ + L ++I I + RA NAG+ G+ +APNIN
Sbjct: 125 FSDLGSYNWATSFPQPILTTAALNRTLIHQIASIISTQGRAFNNAGR-YGLDVYAPNINT 183
Query: 164 FRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLD 223
FR P WGRGQETPGED + YA Y+ G+QG L+ +A KH+ YD++
Sbjct: 184 FRHPVWGRGQETPGEDVSLAAIYAYEYITGIQGP-----DPDSNLKLAATAKHYAGYDIE 238
Query: 224 NWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSK 283
NW +R D +T QDL++ Y P F + + +MCAYN VNG+P+CAD L
Sbjct: 239 NWHNHSRLGNDMNITQQDLSEYYTPQFHVAARDAKVHSVMCAYNAVNGVPACADSYFLQT 298
Query: 284 TARRQWGF--HGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKA 341
R +GF HGY++SDCDA IY+ GYA S A + + AG D++CG+ Q H
Sbjct: 299 LLRDTFGFVDHGYVSSDCDAAYNIYNPHGYASSQAAAAAEAILAGTDIDCGTTYQWHLNE 358
Query: 342 AVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTM--QPFGKIGADVVCSPAHQVLALQA 399
++ L +I++ + L++ ++ G F+ N T P+ + V ++ QA
Sbjct: 359 SITAGDLSRDDIEKGVIRLYTTLVQAGYFDSNTTKANNPYRDLTWSDVLETDAWNISYQA 418
Query: 400 AQDGIVLLKNSHGLLPLPKS----KSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQAL 455
A GIVLLKNS+ +LPL + + ++ALIGP AN+ LLGNY G + I+P A
Sbjct: 419 ATQGIVLLKNSNNVLPLTEKAYPPSNTTVALIGPWANATTQLLGNYYGNAPYMISPRAAF 478
Query: 456 QNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPG 515
+ N + G + S++ A+ A+ AD ++ G+D T E E LDR + PG
Sbjct: 479 EEAGYNVNFAEGTGISSTSTSGFAAALSAAQSADVIIYAGGIDNTLEAEALDRESIAWPG 538
Query: 516 RQQELITRVAEAA-KKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVI 574
Q +LI ++A +A KP+I++ + GG VD + K + N+ ++LW GYPG++G AL ++I
Sbjct: 539 NQLDLIQKLASSAGSKPLIVLQMGGGQVDSSSLKNNTNVSALLWGGYPGQSGGFALRDII 598
Query: 575 FGDHNPGGRLPMTWYPQDYI-KVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSY 633
G NP GRL T YP Y + P TDM +RP+ NPG+TY++Y G+ V+ FG GL Y
Sbjct: 599 TGKKNPAGRLVTTQYPASYAEEFPATDMNLRPEGD--NPGQTYKWYTGEAVYEFGHGLFY 656
Query: 634 SKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHG 693
+ ++ + ++ LN ++D+ +P L T +KN G
Sbjct: 657 TTFAESSSNTTTREIKLNIQDILSQT-HEDLASITQLPVLN----------FTANIKNTG 705
Query: 694 EMAGKHPVLLFVKPARRGNG-RPIKQLVGFQ---SVILNAKEKAEIVFELSPCESLSRAR 749
++ + ++F + G P+K LVG+ V + + + E+ S +R
Sbjct: 706 KVESDYTAMVFANTSDAGPAPYPVKWLVGWDRLGDVKVGETRELRVPIEVG---SFARVN 762
Query: 750 EDGLMVIEEGTHFL 763
EDG V+ GT L
Sbjct: 763 EDGDWVLFPGTFEL 776
>gi|242786966|ref|XP_002480909.1| conserved hypothetical protein [Talaromyces stipitatus ATCC 10500]
gi|218721056|gb|EED20475.1| conserved hypothetical protein [Talaromyces stipitatus ATCC 10500]
Length = 757
Score = 433 bits (1113), Expect = e-118, Method: Compositional matrix adjust.
Identities = 272/728 (37%), Positives = 392/728 (53%), Gaps = 51/728 (7%)
Query: 53 SQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNG---T 109
+QR + L+ LTL+EKI LV+++ RLG+P+YEWW+EA HGV G G+ F
Sbjct: 23 TQRVKSLIDSLTLEEKILNLVDASAGSERLGLPSYEWWNEATHGV-GSAPGVQFTEKPVN 81
Query: 110 IRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRW 169
ATSFP ILTAASFD L I IG E RA N G + G FWAPNIN FRDPRW
Sbjct: 82 FSYATSFPAPILTAASFDDALVREIASVIGREGRAFGNNGFS-GFDFWAPNINPFRDPRW 140
Query: 170 GRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTT 229
GRGQETPGED V Y +++ G+QGD Q A CKH+ AYDL+ T
Sbjct: 141 GRGQETPGEDSFVVQSYIRNFIPGLQGDD------PEDKQVIATCKHYAAYDLE----TG 190
Query: 230 RYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQW 289
RY D T QDLAD + PF++CV+ IMCAYN V+GIP+CA LL + R+ W
Sbjct: 191 RYGNDYNPTQQDLADYFLAPFKTCVRDTGVGSIMCAYNAVDGIPTCASEYLLDQVLRKHW 250
Query: 290 GF---HGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG-SFLQKHTKAAVKQ 345
F + Y+ SDC AV+ I+ + + E A L AG+D+ CG S+L+ + A Q
Sbjct: 251 NFTADYNYVVSDCGAVTDIWQYHNFTDTEEAAASVSLNAGVDLECGSSYLKLNESLAANQ 310
Query: 346 KKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIV 405
+ +D+AL L+S +G F+G + +G V +P Q LA +AA +G+
Sbjct: 311 TTV--QALDQALTRLYSALFTVGFFDGG----KYTALGFADVSTPEAQSLAYEAAVEGMT 364
Query: 406 LLKNSHGLLPLPKS-KSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVY 464
LLKN LLP+ S K S+ALIGP AN+ + G+Y+G I+PL+A + + Y
Sbjct: 365 LLKNDKRLLPIRSSHKYKSVALIGPFANATTQMQGDYSGIPPFLISPLEAFKGHDWEVNY 424
Query: 465 YPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRV 524
G ++ A+ A+ +D V+ + G+D + E E LDR L PG Q +L+T++
Sbjct: 425 AMGTGINNQTTTGFASALAAAEKSDLVIYLGGIDNSIEAETLDRTSLTWPGNQLDLVTQL 484
Query: 525 AEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRL 584
++ KP+I+V GG +D + + + +++WAGYP ++G AL +V+ G + GRL
Sbjct: 485 SK-LHKPLIVVQFGGGQLDDSALLQNEGVQALVWAGYPSQSGGSALLDVLLGKRSIAGRL 543
Query: 585 PMTWYPQDYI-KVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAV 643
P+T YP Y +V + D+ +RP + PGRTY++Y G V PFG GL Y+K+ +++
Sbjct: 544 PVTQYPASYADQVSIFDINIRPNDSY--PGRTYKWYTGMPVVPFGYGLHYTKFEFEWAQT 601
Query: 644 SQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLL 703
++ + Q + S G T V VKN G A + LL
Sbjct: 602 LNHEYNIQQLVA-------------SCQSTGPISDNTPFTTVKAHVKNIGPEASDYVGLL 648
Query: 704 FVKPARRGNG-RPIKQLVGF---QSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEG 759
F+ G RP K LV + ++ ++ ++ L S++RA E+G +VI G
Sbjct: 649 FLSSPDAGPAPRPNKSLVSYLRLHNITSGSQGTLDLPLTLG---SMARADENGNLVIFPG 705
Query: 760 THFLVVGD 767
H+ + D
Sbjct: 706 -HYKIALD 712
>gi|310795958|gb|EFQ31419.1| glycosyl hydrolase family 3 N terminal domain-containing protein
[Glomerella graminicola M1.001]
Length = 824
Score = 432 bits (1112), Expect = e-118, Method: Compositional matrix adjust.
Identities = 283/810 (34%), Positives = 413/810 (50%), Gaps = 73/810 (9%)
Query: 13 LCLCFTSLLT--RVDSTQPPF---SCDPS------NPSTETFP-----------FCKTTL 50
L C ++L+ DS Q P +C P+ N + +FP C TL
Sbjct: 10 LLFCGSALVAGFDADSHQKPLVSRTCLPAVEHAANNNAVNSFPDCSRDPLCSNDVCDETL 69
Query: 51 PISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGV-GKGIFFNGT 109
+RA LV+ LT+ EK+ LVN AP +PRL IP YEWWSE LHGVA G +G
Sbjct: 70 SPKERAAALVAELTIWEKLDNLVNEAPGVPRLAIPPYEWWSEGLHGVASSPGTKFAKSGN 129
Query: 110 IRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFW------------ 157
ATSFPQ I+ ++FD L IG+ + EARA N G++ G+ +
Sbjct: 130 FSYATSFPQPIVLGSAFDDDLVKAIGEVVSKEARAFSNRGRS-GLDLYVSSISRHIEPEV 188
Query: 158 ---------APNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKL 208
+PNIN F+DPRWGRGQETPGEDP Y + + G++G G KL
Sbjct: 189 RDDMLTEPESPNINAFKDPRWGRGQETPGEDPFHLQNYVAAMLTGLEG-----GDPSKKL 243
Query: 209 QASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNR 268
A+ CKH+ A D +N+KG R FDA +T QDL++ Y PPF++C + MC+YN
Sbjct: 244 IAT--CKHYAANDFENYKGVDRAGFDANITTQDLSEYYLPPFKTCAVDKKVGSFMCSYNA 301
Query: 269 VNGIPSCADRNLLSKTARRQWGFHG---YITSDCDAVSIIYDAEGYAKSPEDAVVDVLKA 325
+NG P CA+ LL R+ WG++G Y+++DCD V+++ YA A +KA
Sbjct: 302 INGEPLCANPYLLEDILRQHWGWNGDGQYVSTDCDCVALMVSHHHYAPDLGHAAAWAMKA 361
Query: 326 GMDVNCGSFL-QKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGA 384
G D+ C +F + + A Q + E E+D++L +++ + +G F+ + QP +
Sbjct: 362 GTDLECNAFPGSEALQLAWNQSLISEKEVDKSLTRMYTALVSVGQFD-SARGQPLRSLSW 420
Query: 385 DVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKS-KSVSLALIGPNANSAKTLLGNYAG 443
D V + Q LA QA +G VLLKN G+LPL + + ALIGP N+ + GNY G
Sbjct: 421 DDVNTKEAQKLAYQAVIEGAVLLKND-GILPLSAAWREKKYALIGPWINATTQMQGNYFG 479
Query: 444 PSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEK 503
P+ I+ QA + + + Y G + + S +A+D A A +V G+D T E
Sbjct: 480 PAPYLISLYQAAKEFGLDFTYSLGS-RINSTDDSFKQALDSAHAAALIVFAGGVDNTLEA 538
Query: 504 EELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPG 563
E DR L P Q +L+ R A KPVI++ GG VD T + +I ++LW GYPG
Sbjct: 539 ETRDRKTLAWPESQLDLL-RAVSALGKPVIVLQFGGGQVDDTELLANHSINALLWGGYPG 597
Query: 564 EAGAVALAEVIFGDHNPGGRLPMTWYPQDYIK-VPMTDMKMRPQATSGNPGRTYRFYEGK 622
++G A+ +++FG P GRL +T YP Y + VP TDM +RP + GRTY +Y G
Sbjct: 598 QSGGKAVIDLLFGRAAPAGRLSVTQYPASYNEDVPSTDMNLRPGPGNSGLGRTYMWYNGD 657
Query: 623 EVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRK 682
V P+G GL Y+ + K KA + L + S+ + + Y S + +
Sbjct: 658 AVVPYGFGLHYTTFDAKLKARQASALIKTEEVSSLLSND-----YVSGTLVWQQILTKPV 712
Query: 683 FLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVG---FQSVILNAKEKAEIVFEL 739
V I V N G +A + LLF++ +P K L G F+++ + + E+ +
Sbjct: 713 VSVLITVSNTGNVASDYVALLFLRSNAGPTPQPTKTLAGYHRFRNIQPGDRSEREVSITI 772
Query: 740 SPCESLSRAREDGLMVIEEGTHFLVVGDEE 769
E L R E G V+ G++ L V +E
Sbjct: 773 ---ERLVRVDELGNRVLHPGSYELFVDLDE 799
>gi|292495634|sp|A1CND4.2|XYND_ASPCL RecName: Full=Probable exo-1,4-beta-xylosidase xlnD; AltName:
Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
Length = 792
Score = 432 bits (1110), Expect = e-118, Method: Compositional matrix adjust.
Identities = 257/743 (34%), Positives = 400/743 (53%), Gaps = 38/743 (5%)
Query: 33 CDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSE 92
CD + P ++T C T RA L+S TL+E ++ N++P +PRLG+P Y+ W+E
Sbjct: 52 CD-NGPLSKTI-VCDTLTSPYDRAAALISLFTLEELVNATGNTSPGVPRLGLPPYQVWNE 109
Query: 93 ALHGVAGVGKGIFFN-GTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQA 151
ALHG+ + F + G +TSFP ILT ++ + L ++ I + RA NAG+
Sbjct: 110 ALHGL---DRAYFTDEGQFSWSTSFPMPILTMSALNRTLINQVASIISTQGRAFSNAGR- 165
Query: 152 IGMTFWAPNINIFRDPRWGRGQETPGEDPL-VTGKYAVSYVRGVQGDTFNGGKLKGKLQA 210
G+ ++PNIN FR P WGRGQETPGED ++ YA Y+ G+QG G L+
Sbjct: 166 YGLDVYSPNINSFRHPVWGRGQETPGEDAYCLSSAYAYEYITGIQG-----GVDPKSLKL 220
Query: 211 SACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVN 270
A KH+ YD++NW G +R D +T QDL++ Y P F + + +MC+YN VN
Sbjct: 221 VATAKHYAGYDIENWDGHSRLGNDMNITQQDLSEYYTPQFLVAARDAKVRSVMCSYNAVN 280
Query: 271 GIPSCADRNLLSKTARRQWGF--HGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMD 328
G+PSCA+ L R +GF GYI+SDCD+ +++ YA + A D ++AG D
Sbjct: 281 GVPSCANSFFLQTLLRDTFGFVEDGYISSDCDSAYNVFNPHEYAANVSSAAADSIRAGTD 340
Query: 329 VNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVC 388
++CG+ Q + AV Q L ++I+R + L+S MRLG F+GN + DVV
Sbjct: 341 IDCGTTYQYYFDEAVDQNLLSRADIERGVIRLYSNLMRLGYFDGNSSAYR-NLTWNDVVT 399
Query: 389 SPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRS 448
+ + + +G VLLKN G LPL +S S+AL+GP N + L GNY GP+
Sbjct: 400 TNSWNI---SYEVEGTVLLKND-GTLPLSESIR-SIALVGPWMNVSTQLQGNYFGPAPYL 454
Query: 449 ITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDR 508
I+PL A ++ + Y G + + S+ KA+ AK +D ++ G+D + E E LDR
Sbjct: 455 ISPLDAFRDSHLDVNYAFGTNISSNSTDGFSKALSAAKKSDAIIFAGGIDNSLEAETLDR 514
Query: 509 VDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAV 568
+++ PG+Q ELI ++++ KP+I++ + GG VD + K ++N+ S++W GYPG++G
Sbjct: 515 MNITWPGKQLELIDQLSQLG-KPLIVLQMGGGQVDSSLLKSNKNVNSLIWGGYPGQSGGQ 573
Query: 569 ALAEVIFGDHNPGGRLPMTWYPQDY-IKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPF 627
AL ++I G P GRL +T YP +Y + P TDM +RP NPG+TY +Y G V+ F
Sbjct: 574 ALLDIITGKRAPAGRLVVTQYPAEYATQFPATDMSLRPHGN--NPGQTYMWYTGTPVYEF 631
Query: 628 GCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFL-VT 686
G GL Y+ + ++ + + K+ ++ + P G E FL T
Sbjct: 632 GHGLFYTTFR------------VSHARAVKIKPTYNIQDLLAQPHPGYIHVEQMPFLNFT 679
Query: 687 IGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLS 746
+ + N G+ + + +LF P K LVGF + ++++ S++
Sbjct: 680 VDITNTGKASSDYTAMLFANTTAGPAPYPKKWLVGFDRLPTLGPSTSKLMTIPVTINSMA 739
Query: 747 RAREDGLMVIEEGTHFLVVGDEE 769
R E G V+ G + L + +E
Sbjct: 740 RTDELGNRVLYPGKYELALNNER 762
>gi|238483831|ref|XP_002373154.1| beta-xylosidase XylA [Aspergillus flavus NRRL3357]
gi|292495283|sp|B8MYV0.1|XYND_ASPFN RecName: Full=Probable exo-1,4-beta-xylosidase xlnD; AltName:
Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
gi|220701204|gb|EED57542.1| beta-xylosidase XylA [Aspergillus flavus NRRL3357]
Length = 797
Score = 431 bits (1107), Expect = e-118, Method: Compositional matrix adjust.
Identities = 263/742 (35%), Positives = 394/742 (53%), Gaps = 36/742 (4%)
Query: 36 SNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALH 95
S P ++T C T+ RA LVS LT +E ++ N+ PR+G+PAY+ W+EALH
Sbjct: 54 SGPLSKTL-VCDTSAKPHDRAAALVSLLTFEELVNNTANTGHGAPRIGLPAYQVWNEALH 112
Query: 96 GVAGVGKGIFFN-GTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGM 154
GVA F + G +TSFPQ I T A+ + L ++I I + RA NAG+ G+
Sbjct: 113 GVA---HADFSDAGDFSWSTSFPQPISTMAALNRTLIHQIATIISTQGRAFMNAGR-YGL 168
Query: 155 TFWAPNINIFRDPRWGRGQETPGEDPL-VTGKYAVSYVRGVQGDTFNGGKLKGKLQASAC 213
++PNIN FR P WGRGQETPGED + YA Y+ G+QG G L+ A
Sbjct: 169 DVYSPNINTFRHPVWGRGQETPGEDAYCLASTYAYEYITGIQG-----GVDANPLKLIAT 223
Query: 214 CKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIP 273
KH+ YD++NW +R D ++T QDLA+ Y P F + + +MC+YN VNG+P
Sbjct: 224 AKHYAGYDIENWDNHSRLGNDMQITQQDLAEYYTPQFLVASRDAKVHSVMCSYNAVNGVP 283
Query: 274 SCADRNLLSKTARRQWGF--HGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNC 331
SC++ L R + F GY++ DC AV +++ GYA + A D ++AG D++C
Sbjct: 284 SCSNSFFLQTLLRDTFDFVEDGYVSGDCGAVYNVFNPHGYATNESSAAADSIRAGTDIDC 343
Query: 332 GSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPA 391
G +H + + +++ +++R + L++ +R G F+G + P+ I V S
Sbjct: 344 GVSYPRHFQESFHDQEVSRQDLERGVTRLYASLIRAGYFDGKTS--PYRNITWSDVVSTN 401
Query: 392 HQVLALQAAQDGIVLLKNSHGLLPLPKSKSV-SLALIGPNANSAKTLLGNYAGPSCRSIT 450
Q L+ +AA IVLLKN G+LPL S S ++ALIGP AN+ +LGNY GP+ I+
Sbjct: 402 AQNLSYEAAAQSIVLLKND-GILPLTTSSSTKTIALIGPWANATTQMLGNYYGPAPYLIS 460
Query: 451 PLQALQNYVENTVYYPGCDTVAC--SSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDR 508
PLQA Q+ E + Y S S A+ AK AD ++ G+D T E E DR
Sbjct: 461 PLQAFQD-SEYKITYTIGTNTTTDPDSTSQSTALTTAKEADLIIFAGGIDNTLETEAQDR 519
Query: 509 VDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAV 568
++ P Q LIT++A+ KP+I++ + GG VD + K ++N+ +++W GYPG++G
Sbjct: 520 SNITWPSNQLSLITKLADLG-KPLIVLQMGGGQVDSSALKNNKNVNALIWGGYPGQSGGQ 578
Query: 569 ALAEVIFGDHNPGGRLPMTWYPQDYIKV-PMTDMKMRPQATSGNPGRTYRFYEGKEVFPF 627
ALA++I G P RL T YP +Y +V P DM +RP + NPG+TY +Y G V+ F
Sbjct: 579 ALADIITGKRAPAARLVTTQYPAEYAEVFPAIDMNLRPNGS--NPGQTYMWYTGTPVYEF 636
Query: 628 GCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFL-VT 686
G GL Y+ ++ A S TK + ++ P G + E L T
Sbjct: 637 GHGLFYTNFTASASA----------GSGTKNRTSFNIDEVLGRPHPGYKLVEQMPLLNFT 686
Query: 687 IGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLS 746
+ VKN G+ + + FV P K LVGF + A+ + +SL+
Sbjct: 687 VDVKNTGDRVSDYTAMAFVNTTAGPAPHPNKWLVGFDRLSAVEPGSAKTMVIPVTVDSLA 746
Query: 747 RAREDGLMVIEEGTHFLVVGDE 768
R E+G V+ G + + + +E
Sbjct: 747 RTDEEGNRVLYPGRYEVALNNE 768
>gi|376259588|ref|YP_005146308.1| beta-glucosidase-like glycosyl hydrolase [Clostridium sp. BNL1100]
gi|373943582|gb|AEY64503.1| beta-glucosidase-like glycosyl hydrolase [Clostridium sp. BNL1100]
Length = 712
Score = 431 bits (1107), Expect = e-117, Method: Compositional matrix adjust.
Identities = 276/750 (36%), Positives = 391/750 (52%), Gaps = 95/750 (12%)
Query: 45 FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGI 104
+ +L +RA DLVSR+TL+EK SQL A + RLGIP Y WW+EALHGVA G
Sbjct: 6 YLDKSLSFKERAADLVSRMTLEEKASQLRYDAQPVERLGIPRYNWWNEALHGVARAGV-- 63
Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIG-------MTFW 157
AT FPQ I AA FD +I I E RA YN G +TFW
Sbjct: 64 --------ATVFPQAIGMAAIFDDEFLEKIADVIATEGRAKYNENAKKGDRDIYKGITFW 115
Query: 158 APNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHF 217
+PN+NIFRDPRWGRG ET GEDP +T + V++V+G+QGD GK L+ +AC KHF
Sbjct: 116 SPNVNIFRDPRWGRGHETYGEDPYLTSRLGVAFVKGLQGD----GKY---LKTAACAKHF 168
Query: 218 TAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCAD 277
+ + R+ FDA V+ +DL +TY P FE+ VK+ + +M AYNR NG P
Sbjct: 169 AVH---SGPEDDRHHFDAVVSQKDLYETYLPAFEALVKEAKVESVMGAYNRTNGEPCNGS 225
Query: 278 RNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQK 337
+ LL R WGF G++ SDC A+ ++ G K+P ++V LK+G D+NCG+ +
Sbjct: 226 KTLLKDILRDGWGFDGHVVSDCWAIKDFHEGHGVTKTPTESVALALKSGCDLNCGN-MYL 284
Query: 338 HTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLAL 397
A+K+ ++ E +IDRA L + RMRLG+F+ + F KI ++ S H L+L
Sbjct: 285 LILLALKEGRITEEDIDRAAIRLMTTRMRLGMFDDDC---EFDKIPYELNDSVEHNKLSL 341
Query: 398 QAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQN 457
+AA+ +VLLKN GLLPL K ++A+IGPNA+S+ L NY+G ++IT L ++
Sbjct: 342 EAAKKSMVLLKND-GLLPLDSKKIKNIAVIGPNADSSLALRANYSGTPSQNITILDGIRK 400
Query: 458 YV-ENT-VYYP--------GCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEEL- 506
V E+T V+Y + +A + +AV +A+ +D VVL +GLD + E E+
Sbjct: 401 RVSEDTRVWYSVGSHLFMNREEDLAQPDDRLKEAVSVAERSDVVVLCLGLDASVEGEQND 460
Query: 507 ----------DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSI 556
D+ DL LP Q+ L+ V A KP I+ LL G + I A +I
Sbjct: 461 QGTVILDAGGDKADLNLPESQRNLLNAVL-ATGKPTIVALLSGSALSIGDAA--DKAAAI 517
Query: 557 LWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTY 616
+ YPG G +A AE+IFGD++P GRLP+T+Y P D M RTY
Sbjct: 518 VQCWYPGSRGGLAFAEMIFGDYSPAGRLPVTFYKSTEELPPFADYSME--------NRTY 569
Query: 617 RFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTE 676
+F +G+ ++PFG GLSY+ + Y QN V N +
Sbjct: 570 KFMKGEALYPFGFGLSYTNFEYSNIVCPQN------------VNNGE------------- 604
Query: 677 FCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIV 736
V++ V+N G + V +++K P L GF+ + L + EK +
Sbjct: 605 -----NLSVSVDVQNAGSVDSDEVVQVYIKDMDASVRVPKYSLCGFKRIHLKSGEKKTVT 659
Query: 737 FELSPCESLSRAREDGLMVIEEGTHFLVVG 766
FE+ +++ E G IE G L VG
Sbjct: 660 FEID-SNAMTIVDEAGKRYIENGEFTLYVG 688
>gi|194400335|gb|ACF61038.1| beta-xylosidase [Aspergillus awamori]
Length = 804
Score = 430 bits (1106), Expect = e-117, Method: Compositional matrix adjust.
Identities = 270/770 (35%), Positives = 406/770 (52%), Gaps = 64/770 (8%)
Query: 36 SNPSTETFPFCKTTLPIS----------------------QRARDLVSRLTLDEKISQLV 73
+NP + +P C T+P+S RA L+S TLDE I+
Sbjct: 39 ANP--DLYPLCIETIPLSFPDCQNGPLRSHLICDESATPYDRAASLISLFTLDELIANTG 96
Query: 74 NSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFN-GTIRGATSFPQVILTAASFDSYLWY 132
N+ + RLG+PAY+ WSEALHG+ + F + G+ ATSFPQ ILT A+ + L +
Sbjct: 97 NTGLGVSRLGLPAYQVWSEALHGL---DRANFSDSGSYNWATSFPQPILTTAALNRTLIH 153
Query: 133 RIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVR 192
+I I + RA NAG+ G+ +APNIN FR P WGRGQETPGED + YA Y+
Sbjct: 154 QIASIISTQGRAFNNAGR-YGLDVYAPNINTFRHPVWGRGQETPGEDVSLAAVYAYEYIT 212
Query: 193 GVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFES 252
G+QG L+ +A KH+ YD++NW +R D +T QDL++ Y P F
Sbjct: 213 GIQGP-----DPDSNLKLAATAKHYAGYDIENWHNHSRLGNDMNITQQDLSEYYTPQFHV 267
Query: 253 CVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGF--HGYITSDCDAVSIIYDAEG 310
+ + +MCAYN VNG+P+CAD L R +GF HGY++SDCDA IY+ G
Sbjct: 268 AARDAKVHSVMCAYNAVNGVPACADSYFLQTLLRDTFGFVDHGYVSSDCDAAYNIYNPHG 327
Query: 311 YAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLF 370
YA S A + + AG D++CG+ Q H ++ L +I++ + L++ ++ G F
Sbjct: 328 YASSQAAAAAEAILAGTDIDCGTTYQWHLNESITAGDLSRDDIEKGVIRLYTTLVQAGYF 387
Query: 371 NGNPTM--QPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKS----KSVSL 424
+ N T P+ + V ++ QAA GIVLLKNS+ +LPL + + ++
Sbjct: 388 DSNTTKANNPYRDLTWSDVLETDAWNISYQAATQGIVLLKNSNNVLPLTEKAYPPSNTTV 447
Query: 425 ALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDI 484
ALIGP AN+ LLGNY G + I+P A + + G + S++ A+
Sbjct: 448 ALIGPWANATTQLLGNYYGNAPYMISPRAAFEEAGYKVNFAEGTGISSTSTSGFAAALSA 507
Query: 485 AKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAA-KKPVILVLLCGGPVD 543
A+ AD ++ G+D T E E LDR + PG Q +LI ++A +A KP+I++ + GG VD
Sbjct: 508 ARSADVIIYAGGIDNTLEAEALDRESIAWPGNQLDLIQKLASSAGSKPLIVLQMGGGQVD 567
Query: 544 ITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYI-KVPMTDMK 602
+ K + N+ ++LW GYPG++G AL ++I G NP GRL T YP Y + P TDM
Sbjct: 568 SSSLKNNTNVTALLWGGYPGQSGGFALRDIITGKKNPAGRLVTTQYPASYAEEFPATDMN 627
Query: 603 MRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQ 662
+RP+ NPG+TY++Y G+ V+ FG GL Y+ ++ + ++ LN Q
Sbjct: 628 LRPEGD--NPGQTYKWYTGEAVYEFGHGLFYTTFAESSSNTTTKEVKLNI---------Q 676
Query: 663 DVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNG-RPIKQLVG 721
D++ ++ EL + + T +KN G++ + ++F + G P+K LVG
Sbjct: 677 DILS-QTHEELAS-ITQLPVLNFTANIKNTGKLESDYTAMVFANTSDAGPAPYPVKWLVG 734
Query: 722 FQSV-ILNAKEKAEIVFELSPCE--SLSRAREDGLMVIEEGTHFLVVGDE 768
+ + + E E+ P E S +R EDG V+ GT L + E
Sbjct: 735 WDRLGDVKVGETRELRV---PVEVGSFARVNEDGDWVLFPGTFELALNLE 781
>gi|2791278|emb|CAA93248.1| beta-xylosidase [Trichoderma reesei]
gi|340519464|gb|EGR49702.1| glycoside hydrolase family 3 [Trichoderma reesei QM6a]
Length = 797
Score = 430 bits (1105), Expect = e-117, Method: Compositional matrix adjust.
Identities = 267/734 (36%), Positives = 399/734 (54%), Gaps = 41/734 (5%)
Query: 46 CKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVG---K 102
C ++ +RA+ L+S TL+E I NS P +PRLG+P Y+ W+EALHG+ K
Sbjct: 63 CDSSAGYVERAQALISLFTLEELILNTQNSGPGVPRLGLPNYQVWNEALHGLDRANFATK 122
Query: 103 GIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNIN 162
G G ATSFP ILT A+ + L ++I I +ARA N+G+ G+ +APN+N
Sbjct: 123 G----GQFEWATSFPMPILTTAALNRTLIHQIADIISTQARAFSNSGR-YGLDVYAPNVN 177
Query: 163 IFRDPRWGRGQETPGEDPL-VTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYD 221
FR P WGRGQETPGED ++ Y Y+ G+QG G L+ +A KHF YD
Sbjct: 178 GFRSPLWGRGQETPGEDAFFLSSAYTYEYITGIQG-----GVDPEHLKVAATVKHFAGYD 232
Query: 222 LDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLL 281
L+NW +R FDA +T QDL++ Y P F + + ++ +MCAYN VNG+PSCA+ L
Sbjct: 233 LENWNNQSRLGFDAIITQQDLSEYYTPQFLAAARYAKSRSLMCAYNSVNGVPSCANSFFL 292
Query: 282 SKTARRQWGF--HGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHT 339
R WGF GY++SDCDAV +++ YA + A L+AG D++CG H
Sbjct: 293 QTLLRESWGFPEWGYVSSDCDAVYNVFNPHDYASNQSSAAASSLRAGTDIDCGQTYPWHL 352
Query: 340 KAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQA 399
+ ++ EI+R++ L++ +RLG F+ + G DVV + A + + +A
Sbjct: 353 NESFVAGEVSRGEIERSVTRLYANLVRLGYFDKKNQYRSLG--WKDVVKTDAWNI-SYEA 409
Query: 400 AQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYV 459
A +GIVLLKN G LPL K K S+ALIGP AN+ + GNY GP+ I+PL+A +
Sbjct: 410 AVEGIVLLKND-GTLPLSK-KVRSIALIGPWANATTQMQGNYYGPAPYLISPLEAAKKAG 467
Query: 460 ENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQE 519
+ + G + S+ KA+ AK +D ++ + G+D T E+E DR D+ PG Q +
Sbjct: 468 YHVNFELGTEIAGNSTTGFAKAIAAAKKSDAIIYLGGIDNTIEQEGADRTDIAWPGNQLD 527
Query: 520 LITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHN 579
LI +++E KP++++ + GG VD + K ++ + S++W GYPG++G VAL +++ G
Sbjct: 528 LIKQLSEVG-KPLVVLQMGGGQVDSSSLKSNKKVNSLVWGGYPGQSGGVALFDILSGKRA 586
Query: 580 PGGRLPMTWYPQDYI-KVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSY 638
P GRL T YP +Y+ + P DM +RP S NPG+TY +Y GK V+ FG GL Y+ +
Sbjct: 587 PAGRLVTTQYPAEYVHQFPQNDMNLRPDGKS-NPGQTYIWYTGKPVYEFGSGLFYTTFKE 645
Query: 639 KFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRK-FLVTIGVKNHGEMAG 697
A L N SS S P G + E F +KN G+
Sbjct: 646 TL-ASHPKSLKFNTSSIL------------SAPHPGYTYSEQIPVFTFEANIKNSGKTES 692
Query: 698 KHPVLLFVKPARRGNG-RPIKQLVGFQSVI-LNAKEKAEIVFELSPCESLSRAREDGLMV 755
+ +LFV+ + G P K LVGF + + +++ + P +L+R G +
Sbjct: 693 PYTAMLFVRTSNAGPAPYPNKWLVGFDRLADIKPGHSSKLSIPI-PVSALARVDSHGNRI 751
Query: 756 IEEGTHFLVVGDEE 769
+ G + L + +E
Sbjct: 752 VYPGKYELALNTDE 765
>gi|2920706|emb|CAA73902.1| beta-xylosidase [Emericella nidulans]
Length = 802
Score = 430 bits (1105), Expect = e-117, Method: Compositional matrix adjust.
Identities = 256/738 (34%), Positives = 394/738 (53%), Gaps = 35/738 (4%)
Query: 44 PFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKG 103
P C +L RA LVS T DE ++ N+ + RLG+P Y+ W EALHGV G+
Sbjct: 58 PVCDRSLSPKDRATALVSLFTFDELVNNTGNTGLGVSRLGLPNYQVWGEALHGV---GRA 114
Query: 104 IFF-NGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNIN 162
F +G ATSFP I A+ + L ++IG + + RA NAG G+ ++PNIN
Sbjct: 115 NFVESGNFSWATSFPMPITMMAALNKTLIHQIGTIVSTQLRAFSNAGLG-GVDVYSPNIN 173
Query: 163 IFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDL 222
FR P WGRGQETPGED +T Y Y+ +QG K+ A+A KH+ YD+
Sbjct: 174 TFRHPVWGRGQETPGEDAFLTSVYGYEYITALQGAV---DPETSKIIATA--KHYAGYDI 228
Query: 223 DNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLS 282
++W +R D ++T Q+L++ Y PPF + + +MC+YN VNG+PSCA++ L
Sbjct: 229 ESWNNHSRLGNDMQITQQELSEYYTPPFIVASRDAKVRSVMCSYNAVNGVPSCANKFFLQ 288
Query: 283 KTARRQWGF--HGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTK 340
R + F GY++ DC AV +++ GYA + A D + AG D++CG+ Q H++
Sbjct: 289 TLLRDTFEFSEDGYVSGDCGAVYNVWNPHGYASNEAAASADSILAGTDIDCGTSYQWHSE 348
Query: 341 AAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAA 400
A + + S+I+R + L+S ++ G F+G P+ I D V S +A +AA
Sbjct: 349 DAFEDSLVSRSDIERGVIRLYSNLVQAGYFDGEDA--PYRDITWDDVLSTDAWNIAYEAA 406
Query: 401 QDGIVLLKNSHGLLPLPKSKSV-SLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYV 459
+GIVLLKN LP SK + S+A+IGP AN + L GNY GP+ I+PL ++
Sbjct: 407 VEGIVLLKNDE---TLPLSKDIKSVAVIGPWANVTEELQGNYFGPAPYLISPLTGFRDSG 463
Query: 460 ENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQE 519
+ Y G + + S++ ++A+ AK AD ++ G+D T E E +DR ++ PG Q +
Sbjct: 464 LDVHYALGTNLTSHSTSGFEEALTAAKQADAIIFAGGIDNTIEAEAMDRENITWPGNQLD 523
Query: 520 LITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHN 579
LI++++E KP++++ + GG VD + K + N+ +++W GYPG++G ALA++I G
Sbjct: 524 LISKLSELG-KPLVVLQMGGGQVDSSSLKDNDNVNALIWGGYPGQSGGHALADIITGKRA 582
Query: 580 PGGRLPMTWYPQDYIKV-PMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSY 638
P GRL T YP +Y +V P DM +RP TSGNPG+TY +Y G V+ FG GL Y+ +
Sbjct: 583 PAGRLVTTQYPAEYAEVFPAIDMNLRPNETSGNPGQTYMWYTGTPVYEFGHGLFYTTFEE 642
Query: 639 KFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFL-VTIGVKNHGEMAG 697
+ + +T P G E + + L T VKN GE
Sbjct: 643 STETTDAGSFNIQTVLTT--------------PHSGYEHAQQKTLLNFTATVKNTGERES 688
Query: 698 KHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIE 757
+ L++V P K +VGF + ++ + ES++R E G V+
Sbjct: 689 DYTALVYVNTTAGPAPYPKKWVVGFDRLGGLEPGDSQTLTVPVTVESVARTDEQGNRVLY 748
Query: 758 EGTHFLVVGDEEYPISIF 775
G++ + + +E + F
Sbjct: 749 PGSYDVALNNERSVVVKF 766
>gi|164429277|ref|XP_958209.2| hypothetical protein NCU09923 [Neurospora crassa OR74A]
gi|16945419|emb|CAB91343.2| related to xylan 1, 4-beta-xylosidase [Neurospora crassa]
gi|157073010|gb|EAA28973.2| hypothetical protein NCU09923 [Neurospora crassa OR74A]
Length = 774
Score = 430 bits (1105), Expect = e-117, Method: Compositional matrix adjust.
Identities = 283/783 (36%), Positives = 406/783 (51%), Gaps = 57/783 (7%)
Query: 13 LCLCFTSLLTRVDSTQPPFSCDPS--NPSTETFPFCKTTLPISQRARDLVSRLTLDEKIS 70
L C ++L++ +D PF P N + C TL QRA LV+ +T +EK+
Sbjct: 10 LLSCTSALVSAIDL---PFQTYPDCVNGPLASLKVCDATLSPPQRAAALVAAMTTEEKLQ 66
Query: 71 QLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFF---NGTIRGATSFPQVILTAASFD 127
LV+ + PR+G+PAY WWSEALHGVA G F +G +TSFP +L AA+FD
Sbjct: 67 NLVSKSKGAPRIGLPAYNWWSEALHGVA-YAPGTQFRSGDGPFNSSTSFPMPLLMAATFD 125
Query: 128 SYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYA 187
L ++G+ IG E RA NAG + G +W PN+N F+DPRWGRG ETPGED L +YA
Sbjct: 126 DELIEKVGEVIGTEGRAFGNAGFS-GFDYWTPNVNPFKDPRWGRGSETPGEDILRIKRYA 184
Query: 188 VSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQ 247
S +RG+QG + + A CKH+ A D ++W G+TR+ FDA+VT+QDLA+ Y
Sbjct: 185 ASMIRGLQGPL-------PERRVVATCKHYAANDFEDWNGSTRHDFDAKVTLQDLAEYYL 237
Query: 248 PPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHG---YITSDCDAVSI 304
PF+ C + + IMC+YN VNG+P+CA+ L+ R W + YITSDC+AV
Sbjct: 238 SPFQQCARDSKVGSIMCSYNAVNGVPACANTYLMQTILREHWNWTAPGNYITSDCEAVLD 297
Query: 305 IYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVR 364
I+ YAK+ + +AG D +C A Q L +S +DRAL L+
Sbjct: 298 IFANHHYAKTNAEGTALAFEAGTDSSCEYESSSDIPGAWTQGLLEQSTVDRALTRLYEGL 357
Query: 365 MRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSH----GLLPLPKSK 420
+R+G F+GN + +G V SP Q +ALQ A +GIVLLKN GL PKSK
Sbjct: 358 VRVGYFDGN--HSEYASLGWKDVNSPKSQEVALQTAVEGIVLLKNDQTLPLGLKTDPKSK 415
Query: 421 SVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQ--NYVENTVYYPGCDTVACSSASI 478
LA+IG AN KTL G Y+G +P+ A + + T P +
Sbjct: 416 ---LAMIGFWANDPKTLSGGYSGKPAFEHSPVYAAEAMGFNVTTAGGPVLQNSTSNDTWT 472
Query: 479 DKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLC 538
A++ A+ A++++ GLD + E DR + P Q +LI + + KP+++V +
Sbjct: 473 QAALEAAQDANYILYFGGLDTSAAGETKDRTTINWPEAQLQLIKTLTKLG-KPLVVVQM- 530
Query: 539 GGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYI-KVP 597
G +D T + + SILWA +PG+ G A+ +++ G +P GRLP+T YP +Y VP
Sbjct: 531 GDQLDNTPLLATKTVNSILWANWPGQDGGTAVMQILTGLKSPAGRLPVTQYPANYTAAVP 590
Query: 598 MTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTK 657
MTDM +RP + PGRTYR+Y V PFG GL Y+ + K A +L + S
Sbjct: 591 MTDMNLRP--SDRLPGRTYRWYP-TAVQPFGFGLHYTTFQAKI-AAPLPRLAIQDLLSRC 646
Query: 658 MVENQDVV-HYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPI 716
+N + ++P L E V N G + + VL F+ PI
Sbjct: 647 GGDNANAYPDTCALPPLKVE------------VTNSGNRSSDYVVLAFLAGDAGPRPYPI 694
Query: 717 KQLVGF---QSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGDEEYPIS 773
K LV + + V K A + + L ++R E G V+ GT+ + V + +
Sbjct: 695 KTLVSYTRLRDVSPGHKTTAHLEWTLG---DIARYDEQGNTVLYPGTYTVTVDEPAQASA 751
Query: 774 IFV 776
FV
Sbjct: 752 SFV 754
>gi|292495285|sp|B6EY09.1|XYND_ASPJA RecName: Full=Probable exo-1,4-beta-xylosidase xlnD; AltName:
Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
gi|211970990|dbj|BAG82824.1| 1,4-beta-D-xylosidase [Aspergillus japonicus]
Length = 804
Score = 429 bits (1104), Expect = e-117, Method: Compositional matrix adjust.
Identities = 264/735 (35%), Positives = 393/735 (53%), Gaps = 34/735 (4%)
Query: 46 CKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIF 105
C +T RA LVS TL+E I+ N++P +PRLG+P Y+ WSEALHG+A
Sbjct: 60 CDSTASPYDRAAALVSLFTLEELIANTGNTSPGVPRLGLPPYQVWSEALHGLARAN--FT 117
Query: 106 FNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFR 165
NG ATSFP IL+AA+F+ L +I I + RA NAG+ G+ ++PNIN FR
Sbjct: 118 DNGAYSWATSFPSPILSAAAFNRTLINQIASIISTQGRAFNNAGR-FGLDVYSPNINTFR 176
Query: 166 DPRWGRGQETPGEDPL-VTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDN 224
P WGRGQETPGED +T YA Y+ G+QG G L+ +A KHF YD++N
Sbjct: 177 HPVWGRGQETPGEDAYTLTAAYAYEYITGIQG-----GVNPEHLKLAATAKHFAGYDIEN 231
Query: 225 WKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKT 284
W +R D +T QDLA+ Y P F + MC+YN VNG+PSC++ L
Sbjct: 232 WDNHSRLGNDVNITQQDLAEYYTPQFLVAARDAHVHSFMCSYNAVNGVPSCSNTFFLQTL 291
Query: 285 ARRQWGF--HGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAA 342
R + F HGY++ DC AV +++ GYA + A D + AG D++CG+ Q H +
Sbjct: 292 LRDTFSFVDHGYVSGDCGAVYGVFNPHGYAANEPSAAADAILAGTDIDCGTSYQYHFNES 351
Query: 343 VKQKKLPESEIDRALHNLFSVRMRLGLFNGN-PTMQPFGKIGADVVCSPAHQVLALQAAQ 401
+ + +I+R L++ + LG F+GN + P+ +G V ++ +AA
Sbjct: 352 ITTGAVARDDIERGFIRLYANLVELGYFDGNSSSSNPYRSLGWPDVQKTDAWNISYEAAV 411
Query: 402 DGIVLLKNSHGLLPL---PKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNY 458
+GIVLLKN G LPL + K+ S+ALIGP AN+ L GNY G + I+P+ A
Sbjct: 412 EGIVLLKND-GTLPLASPSEGKNKSIALIGPWANATTQLQGNYYGDAPYLISPVDAFTAA 470
Query: 459 VENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQ 518
Y PG + S+A+ A+ A+ AD +V + G+D T E E DR + PG Q
Sbjct: 471 GYTVHYAPGTEISTNSTANFSAALSAARAADTIVFLGGIDNTIEAEAQDRSSIAWPGNQL 530
Query: 519 ELITRVA--EAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFG 576
ELI+++A ++ +P+++ + GG VD + K + + ++LW GYPG++G +AL +++ G
Sbjct: 531 ELISQLAAQKSDDQPLVVYQMGGGQVDSSALKSNAKVNALLWGGYPGQSGGLALRDILTG 590
Query: 577 DHNPGGRLPMTWYPQDYIK-VPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSK 635
P GRL T YP Y + DM +RP T+ NPG+TY +Y G+ V+ FG GL Y+
Sbjct: 591 ARAPAGRLTTTQYPAAYAESFSALDMNLRPNETTQNPGQTYMWYTGEPVYAFGHGLFYT- 649
Query: 636 YSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEM 695
F A S Q++ TK N + + P+ T + F T + N G+
Sbjct: 650 ---TFNASSA------QAAKTKYTFNITDLTSAAHPDT-TTVGQRTLFNFTASITNSGQR 699
Query: 696 AGKHPVLLFVKPARRG-NGRPIKQLVGFQSVILNAKE--KAEIVFELSPCESLSRAREDG 752
+ L++ + G + P K LVGF + AKE AE+ ++ + L+R E G
Sbjct: 700 DSDYTALVYANTSTAGPSPYPNKWLVGFDRLAAVAKEGGTAELNVPVA-VDRLARVDEAG 758
Query: 753 LMVIEEGTHFLVVGD 767
V+ G + + + +
Sbjct: 759 NTVLFPGRYEVALNN 773
>gi|169767016|ref|XP_001817979.1| exo-1,4-beta-xylosidase xlnD [Aspergillus oryzae RIB40]
gi|121805502|sp|Q2UR38.1|XYND_ASPOR RecName: Full=Exo-1,4-beta-xylosidase xlnD; AltName:
Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
gi|83765834|dbj|BAE55977.1| unnamed protein product [Aspergillus oryzae RIB40]
Length = 798
Score = 429 bits (1104), Expect = e-117, Method: Compositional matrix adjust.
Identities = 264/743 (35%), Positives = 396/743 (53%), Gaps = 37/743 (4%)
Query: 36 SNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALH 95
S P ++T C T+ RA LVS LT +E ++ N+ PR+G+PAY+ W+EALH
Sbjct: 54 SGPLSKTL-VCDTSAKPHDRAAALVSLLTFEELVNNTANTGHGAPRIGLPAYQVWNEALH 112
Query: 96 GVAGVGKGIFFN-GTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGM 154
GVA F + G +TSFPQ I T A+ + L ++I I + RA NAG+ G+
Sbjct: 113 GVA---HADFSDAGGFSWSTSFPQPISTMAALNRTLIHQIATIISTQGRAFMNAGR-YGL 168
Query: 155 TFWAPNINIFRDPRWGRGQETPGEDPL-VTGKYAVSYVRGVQGDTFNGGKLKGKLQASAC 213
++PNIN FR P WGRGQETPGED + YA Y+ G+QG G L+ A
Sbjct: 169 DVYSPNINTFRHPVWGRGQETPGEDAYCLASTYAYEYITGIQG-----GVDANPLKLIAT 223
Query: 214 CKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIP 273
KH+ YD++NW +R D ++T QDLA+ Y P F + + +MC+YN VNG+P
Sbjct: 224 AKHYAGYDIENWDNHSRLGNDMQITQQDLAEYYTPQFLVASRDAKVHSVMCSYNAVNGVP 283
Query: 274 SCADRNLLSKTARRQWGF--HGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNC 331
SC++ L R + F GY++ DC AV +++ GYA + A D ++AG D++C
Sbjct: 284 SCSNSFFLQTLLRDTFDFVEDGYVSGDCGAVYNVFNPHGYATNESSAAADSIRAGTDIDC 343
Query: 332 GSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPA 391
G +H + + +++ +++R + L++ +R G F+G + P+ I V S
Sbjct: 344 GVSYPRHFQESFHDQEVSRQDLERGVTRLYASLIRAGYFDGKTS--PYRNITWSDVVSTN 401
Query: 392 HQVLALQAAQDGIVLLKNSHGLLPL--PKSKSVSLALIGPNANSAKTLLGNYAGPSCRSI 449
Q L+ +AA IVLLKN G+LPL S + ++ALIGP AN+ +LGNY GP+ I
Sbjct: 402 AQNLSYEAAAQSIVLLKND-GILPLTSTSSSTKTIALIGPWANATTQMLGNYYGPAPYLI 460
Query: 450 TPLQALQNYVENTVYYPGCDTVAC--SSASIDKAVDIAKGADHVVLMMGLDQTQEKEELD 507
+PLQA Q+ E + Y S S A+ AK AD ++ G+D T E E D
Sbjct: 461 SPLQAFQD-SEYKITYTIGTNTTTDPDSTSQSTALTTAKEADLIIFAGGIDNTLETEAQD 519
Query: 508 RVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGA 567
R ++ P Q LIT++A+ KP+I++ + GG VD + K ++N+ +++W GYPG++G
Sbjct: 520 RSNITWPSNQLSLITKLADLG-KPLIVLQMGGGQVDSSALKNNKNVNALIWGGYPGQSGG 578
Query: 568 VALAEVIFGDHNPGGRLPMTWYPQDYIKV-PMTDMKMRPQATSGNPGRTYRFYEGKEVFP 626
ALA++I G P RL T YP +Y +V P DM +RP + NPG+TY +Y G V+
Sbjct: 579 QALADIITGKRAPAARLVTTQYPAEYAEVFPAIDMNLRPNGS--NPGQTYMWYTGTPVYE 636
Query: 627 FGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFL-V 685
FG GL Y+ ++ A SS TK + ++ P LG + E L
Sbjct: 637 FGHGLFYTNFTASASA----------SSGTKNRTSFNIDEVLGRPHLGYKLVEQMPLLNF 686
Query: 686 TIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESL 745
T+ VKN G+ + + FV P K LVGF + A+ + +SL
Sbjct: 687 TVDVKNTGDRVSDYTAMAFVNTTAGPAPHPNKWLVGFDRLSAVEPGSAKTMVIPVTVDSL 746
Query: 746 SRAREDGLMVIEEGTHFLVVGDE 768
+R E+G V+ G + + + +E
Sbjct: 747 ARTDEEGNRVLYPGRYEVALNNE 769
>gi|386347261|ref|YP_006045510.1| glycoside hydrolase family protein [Spirochaeta thermophila DSM
6578]
gi|339412228|gb|AEJ61793.1| glycoside hydrolase family 3 domain protein [Spirochaeta
thermophila DSM 6578]
Length = 693
Score = 429 bits (1103), Expect = e-117, Method: Compositional matrix adjust.
Identities = 279/733 (38%), Positives = 399/733 (54%), Gaps = 88/733 (12%)
Query: 53 SQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRG 112
++R L+S+++++EK +++ A IPRLGIP Y WW+EALHGVA G+
Sbjct: 4 NERMTSLLSKMSIEEKAGLMLHRAKGIPRLGIPHYNWWNEALHGVANSGE---------- 53
Query: 113 ATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA-------GQAIGMTFWAPNINIFR 165
AT FPQ I AA+FD L R+ +AI EARA +NA G+TFW+PNINI+R
Sbjct: 54 ATVFPQAIGLAATFDPDLVRRVAEAISTEARAKFNAIGKERAAEYERGLTFWSPNINIYR 113
Query: 166 DPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNW 225
DPRWGRGQET GEDP +T K VS+V+G+QGD ++ +AC KH+ +
Sbjct: 114 DPRWGRGQETYGEDPFLTSKIGVSFVKGLQGDH------PYYMRVAACAKHYAVHSGPE- 166
Query: 226 KGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTA 285
R+ FDARV+ +DL +TY P FE+ VK G +M AYNRVNG P+C + LL +
Sbjct: 167 --GLRHVFDARVSEKDLWETYLPAFEALVKAG-VEAVMGAYNRVNGEPACGSKRLLDEIL 223
Query: 286 RRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQ 345
R++WGF G++ SDC A++ + K P +++ L+AG D+NCG+ + H AVK
Sbjct: 224 RKRWGFKGHVVSDCWAIADFHLHHKVTKDPIESIAMALEAGCDLNCGNTYE-HLLDAVKA 282
Query: 346 KKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIV 405
+ E +DR++ L S RLGLF + P+ ++ + AH+ LA +AA+ +V
Sbjct: 283 GVVSEELVDRSVARLLSTLDRLGLFTDD---HPYARLSLSDIDWEAHRALAREAAEKSVV 339
Query: 406 LLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVE---NT 462
LLKN +G+LP + K + + GPNA + LLGNYAG S R +T L+ + Y
Sbjct: 340 LLKN-NGILPFDRQKLRYIYVTGPNAANPVALLGNYAGVSSRLVTVLEGITGYAGPGITV 398
Query: 463 VYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRV---------DLVL 513
Y GC ID A +A+ AD V +MG D T E EE D + DL L
Sbjct: 399 TYKIGCPLQGNKINPIDWASGVARYADVTVAVMGRDSTVEGEEGDAIFSDNYGDLSDLDL 458
Query: 514 PGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEV 573
P Q E + R+ E K P+++VLL G PV + + +I++A YPGE G A+A V
Sbjct: 459 PREQIEYLRRIKEIGK-PLVVVLLSGAPV--CSPELEELADAIVYAWYPGEEGGNAIARV 515
Query: 574 IFGDHNPGGRLPMTWYPQDYIKVP-MTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLS 632
+FG+ +P GRLP+T +P+ ++P TD M GRTYR+ + ++PFG GLS
Sbjct: 516 LFGEISPSGRLPIT-FPRGVDQLPPFTDYSME--------GRTYRYMREEPLYPFGFGLS 566
Query: 633 YSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNH 692
Y+ +SY+ QSS+++ + + + EL CE V+N
Sbjct: 567 YATFSYRGL----------QSSASRWDKRETL-------EL---VCE---------VENT 597
Query: 693 GEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDG 752
+ V L+V+ P+ L GF V L A E+ ++ F LSP E LS E+G
Sbjct: 598 SSIPADEVVQLYVRWEDAPFRVPLWSLKGFTRVSLGAGERKQVRFVLSP-EELSFIDEEG 656
Query: 753 LMVIEEGT-HFLV 764
V+ EG HF V
Sbjct: 657 RKVLPEGRLHFHV 669
>gi|220927661|ref|YP_002504570.1| glycoside hydrolase [Clostridium cellulolyticum H10]
gi|219997989|gb|ACL74590.1| glycoside hydrolase family 3 domain protein [Clostridium
cellulolyticum H10]
Length = 712
Score = 429 bits (1103), Expect = e-117, Method: Compositional matrix adjust.
Identities = 274/750 (36%), Positives = 396/750 (52%), Gaps = 95/750 (12%)
Query: 45 FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGI 104
+ +L +RA DLVSR+TL+EK SQL A + RLGIP Y WW+EALHGVA G
Sbjct: 6 YLDKSLSFKERAVDLVSRMTLEEKASQLRYDAQPVERLGIPRYNWWNEALHGVARAGV-- 63
Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIG-------MTFW 157
AT FPQ I AA FD +I I E RA YN G +TFW
Sbjct: 64 --------ATVFPQAIGLAAIFDDEFLEKIADVIATEGRAKYNESSKKGDRDIYKGITFW 115
Query: 158 APNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHF 217
+PN+NIFRDPRWGRG ET GEDP +T + V++V+G+QGD GK L+++AC KHF
Sbjct: 116 SPNVNIFRDPRWGRGHETYGEDPYLTSRLGVAFVKGLQGD----GKY---LKSAACAKHF 168
Query: 218 TAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCAD 277
+ + R+ F+A + +D+ +TY P FE+ VK+ + +M AYNR NG P
Sbjct: 169 AVH---SGPEDDRHHFNAVASQKDMYETYLPAFEALVKEAKVESVMGAYNRTNGEPCNGS 225
Query: 278 RNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQK 337
+ LL R WGF G++ SDC A+ ++ G K+P ++V LK G D+NCG+ +
Sbjct: 226 KTLLKDILRDDWGFDGHVVSDCWAIKDFHEGHGVTKTPTESVALALKNGCDLNCGN-MYL 284
Query: 338 HTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLAL 397
A+K+ K+ E +IDRA L + RM+LG+F+ + F KI +V S H L+L
Sbjct: 285 LILLALKEGKITEEDIDRAAIRLMTTRMKLGMFDDDCE---FDKIPYEVNDSIEHNKLSL 341
Query: 398 QAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQN 457
+AA+ +VLLKN +GLLPL K ++A+IGPNA+S+ L NY+G +IT L +++
Sbjct: 342 EAARKSMVLLKN-NGLLPLDSKKIKNIAVIGPNADSSLALRANYSGTPSHNITILDGVRS 400
Query: 458 YV-ENT-VYYP--------GCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEEL- 506
V E+T V+Y + +A + +AV +A+ +D VVL +GLD + E E+
Sbjct: 401 RVSEDTRVWYSLGSHLFMNREEDLAQPDDRLKEAVSMAERSDVVVLCLGLDASVEGEQND 460
Query: 507 ----------DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSI 556
D+ DL LP Q+ L+ V A KP I+ LL G + I A +I
Sbjct: 461 QGTVILDAGGDKADLNLPESQRNLLNAVL-ATGKPTIVALLSGSALSIGDAA--DKAAAI 517
Query: 557 LWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTY 616
+ YPG G +A AE+IFGD++P GRLP+T+Y P D M RTY
Sbjct: 518 VQCWYPGSKGGLAFAEMIFGDYSPAGRLPVTFYKSTEELPPFEDYSME--------NRTY 569
Query: 617 RFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTE 676
+F +G+ ++PFG GLSY+ + Y + +V Q V + +S+
Sbjct: 570 KFMKGEALYPFGFGLSYTNFEY-----------------SNIVCPQAVNNGESLS----- 607
Query: 677 FCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIV 736
V++ V+N G + V +++K P L GF+ + L + EK +
Sbjct: 608 --------VSVDVQNAGSVDSDEVVQVYIKDMEASVRVPNHSLCGFKRIFLKSGEKKTVT 659
Query: 737 FELSPCESLSRAREDGLMVIEEGTHFLVVG 766
FE+ +++ E+G IE G L VG
Sbjct: 660 FEID-SRAMTIVDEEGKRYIENGDFTLYVG 688
>gi|121809149|sp|Q4AEG8.1|XYND_ASPAW RecName: Full=Exo-1,4-beta-xylosidase xlnD; AltName:
Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
gi|73486695|dbj|BAE19756.1| beta-xylosidase [Aspergillus awamori]
Length = 804
Score = 429 bits (1103), Expect = e-117, Method: Compositional matrix adjust.
Identities = 266/739 (35%), Positives = 395/739 (53%), Gaps = 40/739 (5%)
Query: 45 FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGI 104
C T RA L+S TLDE I+ N+ + RLG+PAY+ WSEALHG+ +
Sbjct: 68 ICDETATPYDRAASLISLFTLDELIANTGNTGLGVSRLGLPAYQVWSEALHGL---DRAN 124
Query: 105 FFN-GTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINI 163
F + G ATSFPQ ILT A+ + L ++I I + RA NAG+ G+ +APNIN
Sbjct: 125 FSDSGAYNWATSFPQPILTTAALNRTLIHQIASIISTQGRAFNNAGR-YGLDVYAPNINT 183
Query: 164 FRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLD 223
FR P WGRGQETPGED + YA Y+ G+QG + L+ +A KH+ YD++
Sbjct: 184 FRHPVWGRGQETPGEDVSLAAVYAYEYITGIQGP-----DPESNLKLAATAKHYAGYDIE 238
Query: 224 NWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSK 283
NW +R D +T QDL++ Y P F + + +MCAYN VNG+P+CAD L
Sbjct: 239 NWHNHSRLGNDMNITQQDLSEYYTPQFHVAARDAKVQSVMCAYNAVNGVPACADSYFLQT 298
Query: 284 TARRQWGF--HGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKA 341
R +GF HGY++SDCDA IY+ GYA S A + + AG D++CG+ Q H
Sbjct: 299 LLRDTFGFVDHGYVSSDCDAAYNIYNPHGYASSQAAAAAEAILAGTDIDCGTTYQWHLNE 358
Query: 342 AVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTM--QPFGKIGADVVCSPAHQVLALQA 399
++ L +I++ + L++ ++ G F+ N T P+ + V ++ QA
Sbjct: 359 SITAGDLSRDDIEQGVIRLYTTLVQAGYFDSNTTKANNPYRDLSWSDVLETDAWNISYQA 418
Query: 400 AQDGIVLLKNSHGLLPLPKS----KSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQAL 455
A GIVLLKNS+ +LPL + + ++ALIGP AN+ LLGNY G + I+P A
Sbjct: 419 ATQGIVLLKNSNNVLPLTEKAYPPSNTTVALIGPWANATTQLLGNYYGNAPYMISPRAAF 478
Query: 456 QNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPG 515
+ + G + S++ A+ A+ AD ++ G+D T E E LDR + PG
Sbjct: 479 EEAGYKVNFAEGTGISSTSTSGFAAALSAAQSADVIIYAGGIDNTLEAEALDRESIAWPG 538
Query: 516 RQQELITRVAEAA-KKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVI 574
Q +LI ++A AA KKP+I++ + GG VD + K + + ++LW GYPG++G AL ++I
Sbjct: 539 NQLDLIQKLASAAGKKPLIVLQMGGGQVDSSSLKNNTKVSALLWGGYPGQSGGFALRDII 598
Query: 575 FGDHNPGGRLPMTWYPQDYI-KVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSY 633
G NP GRL T YP Y + P TDM +RP+ NPG+TY++Y G+ V+ FG GL Y
Sbjct: 599 TGKKNPAGRLVTTQYPASYAEEFPATDMNLRPEGD--NPGQTYKWYTGEAVYEFGHGLFY 656
Query: 634 SKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHG 693
+ ++ + ++ LN QD++ ++ EL + + T ++N G
Sbjct: 657 TTFAESSSNTTTKEVKLNI---------QDILS-RTHEELAS-ITQLPVLNFTANIRNTG 705
Query: 694 EMAGKHPVLLFVKPARRGNG-RPIKQLVGFQSVILNAKEKAEIVFELS-PCE--SLSRAR 749
++ + ++F + G P K LVG+ + + K EL P E S +R
Sbjct: 706 KLESDYTAMVFANTSDAGPAPYPKKWLVGWDRL---GEVKVGETRELRVPVEVGSFARVN 762
Query: 750 EDGLMVIEEGTHFLVVGDE 768
EDG V+ GT L + E
Sbjct: 763 EDGDWVLFPGTFELALNLE 781
>gi|255545660|ref|XP_002513890.1| Periplasmic beta-glucosidase precursor, putative [Ricinus communis]
gi|223546976|gb|EEF48473.1| Periplasmic beta-glucosidase precursor, putative [Ricinus communis]
Length = 336
Score = 429 bits (1102), Expect = e-117, Method: Compositional matrix adjust.
Identities = 207/341 (60%), Positives = 261/341 (76%), Gaps = 6/341 (1%)
Query: 436 TLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMM 495
TL+GNYAGP C+++TPLQ LQNY++NT Y+ GC+TVACS A+I +AV IA+ D VVL+M
Sbjct: 2 TLVGNYAGPPCKTVTPLQGLQNYIKNTKYHRGCNTVACSKATIREAVQIAREVDQVVLVM 61
Query: 496 GLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGS 555
LDQTQE E +D ++L LPG QQ+LI VA AA KPV+LVL+CGG VD++FA + IGS
Sbjct: 62 ELDQTQEAERIDLLNLRLPGNQQKLIISVARAANKPVVLVLICGGLVDVSFAITEPKIGS 121
Query: 556 ILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRT 615
ILWAGYPGEAG ALAE+IFGDHNPGG+LP+TWYPQ Y K+PMTD++MRPQ SG PGR+
Sbjct: 122 ILWAGYPGEAGGTALAEIIFGDHNPGGKLPVTWYPQQYTKIPMTDVRMRPQIASGYPGRS 181
Query: 616 YRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGT 675
YRFYEGK+VF FG GLSYS SY+ ++ Q+K++L SS K V+ Y V ELG
Sbjct: 182 YRFYEGKKVFEFGYGLSYSNCSYEIASIPQDKIFLRSPSSIKGVKTSS---YTLVSELGK 238
Query: 676 EFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEI 735
E CE KF VT+ VKN G++ GKHPVL+F++ + G+GRP+K+LV FQ+V LNA + AEI
Sbjct: 239 ELCERSKFSVTVKVKNEGKIIGKHPVLVFLRQPKPGSGRPVKKLVAFQTVRLNAGQNAEI 298
Query: 736 VFELSPCESLSRAREDGLMVIEEGTHFLVVGDEEYPISIFV 776
+LSPCE L+RA EDG MVI+ G LVVG++ Y I+I V
Sbjct: 299 QRKLSPCEPLTRANEDGSMVIDGG---LVVGEKPYQITIIV 336
>gi|4235093|gb|AAD13106.1| beta-xylosidase [Aspergillus niger]
Length = 804
Score = 428 bits (1101), Expect = e-117, Method: Compositional matrix adjust.
Identities = 269/770 (34%), Positives = 406/770 (52%), Gaps = 64/770 (8%)
Query: 36 SNPSTETFPFCKTTLPIS----------------------QRARDLVSRLTLDEKISQLV 73
+NP + +P C T+P+S RA L+S TLDE I+
Sbjct: 39 ANP--DLYPLCIETIPLSFPDCQNGPLRSHLICDESATPYDRAASLISLFTLDELIANTG 96
Query: 74 NSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFN-GTIRGATSFPQVILTAASFDSYLWY 132
N+ + RLG+PAY+ WSEALHG+ + F + G+ ATSFPQ ILT A+ + L +
Sbjct: 97 NTGLGVSRLGLPAYQVWSEALHGL---DRANFSDSGSYNWATSFPQPILTTAALNRTLIH 153
Query: 133 RIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVR 192
+I I + RA NAG+ G+ +APNIN FR P WGRGQETPGED + YA Y+
Sbjct: 154 QIASIISTQGRAFNNAGR-YGLDVYAPNINTFRHPVWGRGQETPGEDVSLAAVYAYEYIT 212
Query: 193 GVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFES 252
G+QG L+ +A KH+ YD++NW +R D +T QDL++ Y P F
Sbjct: 213 GIQGP-----DPDSNLKLAATAKHYAGYDIENWHNHSRLGNDMNITQQDLSEYYTPQFHV 267
Query: 253 CVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGF--HGYITSDCDAVSIIYDAEG 310
+ + +MCAYN V+G+P+CAD L R +GF HGY++SDCDA IY+ G
Sbjct: 268 AARDAKVHSVMCAYNAVDGVPACADSYFLQTLLRDTFGFVDHGYVSSDCDAAYNIYNPHG 327
Query: 311 YAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLF 370
YA S A + + AG D++CG+ Q H ++ L +I++ + L++ ++ G F
Sbjct: 328 YASSQAAAAAEAILAGTDIDCGTTYQWHLNESITAGDLSRDDIEKGVIRLYTTLVQAGYF 387
Query: 371 NGNPTM--QPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKS----KSVSL 424
+ N T P+ + V ++ QAA GIVLLKNS+ +LPL + + ++
Sbjct: 388 DSNTTKANNPYRDLTWSDVLETDAWNISYQAATQGIVLLKNSNNVLPLTEKAYPPSNTTV 447
Query: 425 ALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDI 484
ALIGP AN+ LLGNY G + I+P A + + G + S++ A+
Sbjct: 448 ALIGPWANATTQLLGNYYGNAPYMISPRAAFEEAGYKVNFAEGTGISSTSTSGFAAALSA 507
Query: 485 AKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAA-KKPVILVLLCGGPVD 543
A+ AD ++ G+D T E E LDR + PG Q +LI ++A +A KP+I++ + GG VD
Sbjct: 508 ARSADVIIYAGGIDNTLEAEALDRESIAWPGNQLDLIQKLASSAGSKPLIVLQMGGGQVD 567
Query: 544 ITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYI-KVPMTDMK 602
+ K + N+ ++LW GYPG++G AL ++I G NP GRL T YP Y + P TDM
Sbjct: 568 SSSLKNNTNVTALLWGGYPGQSGGFALRDIITGKKNPAGRLVTTQYPASYAEEFPATDMN 627
Query: 603 MRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQ 662
+RP+ NPG+TY++Y G+ V+ FG GL Y+ ++ + ++ LN Q
Sbjct: 628 LRPEGD--NPGQTYKWYTGEAVYEFGHGLFYTTFAESSSNTTTKEVKLNI---------Q 676
Query: 663 DVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNG-RPIKQLVG 721
D++ ++ EL + + T +KN G++ + ++F + G P+K LVG
Sbjct: 677 DILS-QTHEELAS-ITQLPVLNFTANIKNTGKLESDYTAMVFANTSDAGPAPYPVKWLVG 734
Query: 722 FQSV-ILNAKEKAEIVFELSPCE--SLSRAREDGLMVIEEGTHFLVVGDE 768
+ + + E E+ P E S +R EDG V+ GT L + E
Sbjct: 735 WDRLGDVKVGETRELRV---PVEVGSFARVNEDGDWVLFPGTFELALNLE 781
>gi|354508473|gb|AER26905.1| beta-xylosidase 3 [synthetic construct]
Length = 778
Score = 428 bits (1101), Expect = e-117, Method: Compositional matrix adjust.
Identities = 269/770 (34%), Positives = 406/770 (52%), Gaps = 64/770 (8%)
Query: 36 SNPSTETFPFCKTTLPIS----------------------QRARDLVSRLTLDEKISQLV 73
+NP + +P C T+P+S RA L+S TLDE I+
Sbjct: 13 ANP--DLYPLCIETIPLSFPDCQNGPLRSHLICDESATPYDRAASLISLFTLDELIANTG 70
Query: 74 NSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFN-GTIRGATSFPQVILTAASFDSYLWY 132
N+ + RLG+PAY+ WSEALHG+ + F + G+ ATSFPQ ILT A+ + L +
Sbjct: 71 NTGLGVSRLGLPAYQVWSEALHGL---DRANFSDSGSYNWATSFPQPILTTAALNRTLIH 127
Query: 133 RIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVR 192
+I I + RA NAG+ G+ +APNIN FR P WGRGQETPGED + YA Y+
Sbjct: 128 QIASIISTQGRAFNNAGR-YGLDVYAPNINTFRHPVWGRGQETPGEDVSLAAVYAYEYIT 186
Query: 193 GVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFES 252
G+QG L+ +A KH+ YD++NW +R D +T QDL++ Y P F
Sbjct: 187 GIQGP-----DPDSNLKLAATAKHYAGYDIENWHNHSRLGNDMNITQQDLSEYYTPQFHV 241
Query: 253 CVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGF--HGYITSDCDAVSIIYDAEG 310
+ + +MCAYN V+G+P+CAD L R +GF HGY++SDCDA IY+ G
Sbjct: 242 AARDAKVHSVMCAYNAVDGVPACADSYFLQTLLRDTFGFVDHGYVSSDCDAAYNIYNPHG 301
Query: 311 YAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLF 370
YA S A + + AG D++CG+ Q H ++ L +I++ + L++ ++ G F
Sbjct: 302 YASSQAAAAAEAILAGTDIDCGTTYQWHLNESITAGDLSRDDIEKGVIRLYTTLVQAGYF 361
Query: 371 NGNPTM--QPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKS----KSVSL 424
+ N T P+ + V ++ QAA GIVLLKNS+ +LPL + + ++
Sbjct: 362 DSNTTKANNPYRDLTWSDVLETDAWNISYQAATQGIVLLKNSNNVLPLTEKAYPPSNTTV 421
Query: 425 ALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDI 484
ALIGP AN+ LLGNY G + I+P A + + G + S++ A+
Sbjct: 422 ALIGPWANATTQLLGNYYGNAPYMISPRAAFEEAGYKVNFAEGTGISSTSTSGFAAALSA 481
Query: 485 AKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAA-KKPVILVLLCGGPVD 543
A+ AD ++ G+D T E E LDR + PG Q +LI ++A +A KP+I++ + GG VD
Sbjct: 482 ARSADVIIYAGGIDNTLEAEALDRESIAWPGNQLDLIQKLASSAGSKPLIVLQMGGGQVD 541
Query: 544 ITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYI-KVPMTDMK 602
+ K + N+ ++LW GYPG++G AL ++I G NP GRL T YP Y + P TDM
Sbjct: 542 SSSLKNNTNVTALLWGGYPGQSGGFALRDIITGKKNPAGRLVTTQYPASYAEEFPATDMN 601
Query: 603 MRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQ 662
+RP+ NPG+TY++Y G+ V+ FG GL Y+ ++ + ++ LN Q
Sbjct: 602 LRPEGD--NPGQTYKWYTGEAVYEFGHGLFYTTFAESSSNTTTKEVKLNI---------Q 650
Query: 663 DVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNG-RPIKQLVG 721
D++ ++ EL + + T +KN G++ + ++F + G P+K LVG
Sbjct: 651 DILS-QTHEELAS-ITQLPVLNFTANIKNTGKLESDYTAMVFANTSDAGPAPYPVKWLVG 708
Query: 722 FQSV-ILNAKEKAEIVFELSPCE--SLSRAREDGLMVIEEGTHFLVVGDE 768
+ + + E E+ P E S +R EDG V+ GT L + E
Sbjct: 709 WDRLGDVKVGETRELRV---PVEVGSFARVNEDGDWVLFPGTFELALNLE 755
>gi|336471692|gb|EGO59853.1| hypothetical protein NEUTE1DRAFT_99999 [Neurospora tetrasperma FGSC
2508]
gi|350292807|gb|EGZ74002.1| glycoside hydrolase [Neurospora tetrasperma FGSC 2509]
Length = 770
Score = 428 bits (1100), Expect = e-117, Method: Compositional matrix adjust.
Identities = 286/791 (36%), Positives = 412/791 (52%), Gaps = 57/791 (7%)
Query: 5 KLSLVFPLLCLCFTSLLTRVDSTQPPFSCDPS--NPSTETFPFCKTTLPISQRARDLVSR 62
K SLV L C + L++ +D PF P N + C TL QRA LV+
Sbjct: 2 KSSLVPYCLLSCSSVLVSAIDL---PFQTYPDCVNGPLASLKVCDVTLSPPQRAAALVAA 58
Query: 63 LTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFF--NGTIRGATSFPQVI 120
+T +EK+ LV+ + PR+G+PAY WWSEALHGVA F+ +G +TSFP +
Sbjct: 59 MTTEEKLQNLVSKSKGAPRIGLPAYNWWSEALHGVAYAPGTQFWSGDGPFNASTSFPMPL 118
Query: 121 LTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDP 180
L AA+FD L ++G+ IG E RA NAG + G +W PN+N F+DPRWGRG ETPGED
Sbjct: 119 LMAATFDDELIEKVGEVIGTEGRAFGNAGFS-GFDYWTPNVNPFKDPRWGRGSETPGEDI 177
Query: 181 LVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQ 240
L +YA S +RG+QG + + A CKH+ A D ++W G+TR+ F+A+VT+Q
Sbjct: 178 LRIKRYAASMIRGLQGPAR-------ERRVVATCKHYAANDFEDWNGSTRHDFNAKVTLQ 230
Query: 241 DLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHG---YITS 297
DLA+ Y PF+ C + + IMC+YN VNG+P+CA+ L+ R W + YITS
Sbjct: 231 DLAEYYLSPFQQCARDSKVGSIMCSYNAVNGVPACANTYLMQTILREHWNWTAPGNYITS 290
Query: 298 DCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRAL 357
DC+AV I YA++ + +AG+D +C A Q L +S +DRAL
Sbjct: 291 DCEAVLDISANHHYAETNAEGTALAFEAGIDSSCEYESSSDIPGAWTQGLLEQSTVDRAL 350
Query: 358 HNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPL- 416
++ +R+G F+GN + +G V SP Q +ALQAA +GIVLLKN LPL
Sbjct: 351 KRIYEGLVRVGYFDGN--HSEYASLGWKDVNSPKSQEVALQAAVEGIVLLKNDK-TLPLD 407
Query: 417 ----PKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQ--NYVENTVYYPGCDT 470
PKSK LA+IG AN KTL G Y+G +P+ A Q + T P
Sbjct: 408 LRTDPKSK---LAMIGFWANDPKTLSGGYSGKPAFEHSPVYAAQAMGFSVTTAGGPVLQN 464
Query: 471 VACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKK 530
+ A++ AK A++++ G D + E DR + P Q +LIT +++ K
Sbjct: 465 STSNDTWTQAALEAAKDANYILYFGGQDTSAAGETKDRTTINWPEAQLQLITTLSKLG-K 523
Query: 531 PVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYP 590
P+++V + G +D T + + +ILWA + G+ G A+ +++ G NP GRLP+T YP
Sbjct: 524 PLVVVQM-GDQLDNTPLLAAKAVNAILWANWLGQDGGTAVMQILTGLKNPAGRLPVTQYP 582
Query: 591 QDYI-KVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLY 649
+Y VPMTDM +RP + PGRTYR+Y V PFG GL Y+ + K AV +L
Sbjct: 583 ANYTAAVPMTDMNLRP--SDKLPGRTYRWYP-TAVQPFGFGLHYTTFQTKI-AVPLPRLA 638
Query: 650 LNQSSSTKMVENQDVV-HYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPA 708
+ S +N + ++P L E V N G + + VL F+
Sbjct: 639 IQDLLSRCGGDNANAYPDTCALPPLKVE------------VTNSGNRSSDYVVLAFLAGD 686
Query: 709 RRGNGRPIKQLVGF---QSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVV 765
PIK LV + + + K A + + L ++R E G V+ GT+ + V
Sbjct: 687 VGPKPYPIKTLVSYTRLRDLSPGHKTTAHLKWTLG---DIARYDEQGNTVLYPGTYTVTV 743
Query: 766 GDEEYPISIFV 776
+ + FV
Sbjct: 744 DEPAQASARFV 754
>gi|358365439|dbj|GAA82061.1| beta-xylosidase [Aspergillus kawachii IFO 4308]
Length = 788
Score = 427 bits (1098), Expect = e-116, Method: Compositional matrix adjust.
Identities = 262/733 (35%), Positives = 397/733 (54%), Gaps = 41/733 (5%)
Query: 51 PISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFN-GT 109
P++++ L+S TLDE I+ N+ + RLG+PAY+ WSEALHG+ + F + G+
Sbjct: 59 PMTEQ-HSLISLFTLDELIANTGNTGLGVSRLGLPAYQVWSEALHGL---DRANFSDSGS 114
Query: 110 IRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRW 169
ATSFPQ ILT A+ + L ++I I + RA NAG+ G+ +APNIN FR P W
Sbjct: 115 YNWATSFPQPILTTAALNRTLIHQIASIISTQGRAFNNAGR-YGLDVYAPNINTFRHPVW 173
Query: 170 GRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTT 229
GRGQETPGED + YA Y+ G+QG L+ +A KH+ YD++NW +
Sbjct: 174 GRGQETPGEDVSLAAVYAYEYITGIQGP-----DPDSNLKLAATAKHYAGYDIENWHNHS 228
Query: 230 RYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQW 289
R D +T QDL++ Y P F + + +MCAYN VNG+P+CAD L R +
Sbjct: 229 RLGNDMNITQQDLSEYYTPQFHVAARDAKVHSVMCAYNAVNGVPACADSYFLQTLLRDTF 288
Query: 290 GF--HGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKK 347
GF HGY++SDCDA IY+ GYA S A + + AG D++CG+ Q H ++
Sbjct: 289 GFVDHGYVSSDCDAAYNIYNPHGYASSQAAAAAEAILAGTDIDCGTTYQWHLNESITAGD 348
Query: 348 LPESEIDRALHNLFSVRMRLGLFNGNPTM--QPFGKIGADVVCSPAHQVLALQAAQDGIV 405
L +I++ + L++ ++ G F+ N T P+ + V ++ QAA GIV
Sbjct: 349 LSRDDIEKGVIRLYTTLVQAGYFDSNTTKANNPYRDLTWSDVLETDAWNISYQAATQGIV 408
Query: 406 LLKNSHGLLPLPKS----KSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVEN 461
LLKNS+ +LPL + + ++ALIGP AN+ LLGNY G + I+P A +
Sbjct: 409 LLKNSNNVLPLTEKAYPPSNTTVALIGPWANATTQLLGNYYGNAPYMISPRAAFEEAGYK 468
Query: 462 TVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELI 521
+ G + S++ A+ A+ AD ++ G+D T E E LDR + PG Q +LI
Sbjct: 469 VNFAEGTGISSTSTSGFAAALSAARSADVIIYAGGIDNTLEAEALDRESIAWPGNQLDLI 528
Query: 522 TRVAEAA-KKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
++A +A KP+I++ + GG VD + K + N+ ++LW GYPG++G AL ++I G NP
Sbjct: 529 QKLASSAGSKPLIVLQMGGGQVDSSSLKNNTNVTALLWGGYPGQSGGFALRDIITGKKNP 588
Query: 581 GGRLPMTWYPQDYI-KVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYK 639
GRL T YP Y + P TDM +RP+ NPG+TY++Y G+ V+ FG GL Y+ ++
Sbjct: 589 AGRLVTTQYPASYAEEFPATDMNLRPEGD--NPGQTYKWYTGEAVYEFGHGLFYTTFAES 646
Query: 640 FKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKH 699
+ ++ LN QD++ ++ EL + + T +KN G++ +
Sbjct: 647 SSNTTTKEVKLNI---------QDILS-QTHEELAS-ITQLPVLNFTANIKNTGKLESDY 695
Query: 700 PVLLFVKPARRGNG-RPIKQLVGFQSV-ILNAKEKAEIVFELSPCE--SLSRAREDGLMV 755
++F + G P+K LVG+ + + E E+ P E S +R EDG V
Sbjct: 696 TAMVFANTSDAGPAPYPVKWLVGWDRLGDVKVGETRELRV---PVEVGSFARVNEDGDWV 752
Query: 756 IEEGTHFLVVGDE 768
+ GT L + E
Sbjct: 753 LFPGTFELALNLE 765
>gi|115387056|ref|XP_001210069.1| predicted protein [Aspergillus terreus NIH2624]
gi|114191067|gb|EAU32767.1| predicted protein [Aspergillus terreus NIH2624]
Length = 908
Score = 427 bits (1098), Expect = e-116, Method: Compositional matrix adjust.
Identities = 279/751 (37%), Positives = 412/751 (54%), Gaps = 53/751 (7%)
Query: 38 PSTETFPFCK-----TTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSE 92
P+ +T P C T+L I++R LV LTL+EKI LV++A RLG+P YEWW+E
Sbjct: 150 PACQTQPLCSHRVCDTSLSIAERVNSLVKSLTLEEKILNLVDAAAGSTRLGLPFYEWWNE 209
Query: 93 ALHGVAGVGKGIFFN---GTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAG 149
A HGV G G+ F ATSFP IL AASFD+ L +I + IG E RA N G
Sbjct: 210 ATHGV-GSAPGVQFTSKPANFSYATSFPAPILIAASFDNALIRKIAEVIGKEGRAFANNG 268
Query: 150 QAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQ 209
+ G FWAPNIN FRDPRWGRGQETPGED V Y +++ G+QGD K K Q
Sbjct: 269 FS-GFDFWAPNINGFRDPRWGRGQETPGEDTFVAQNYIRNFIPGLQGD-----DPKNK-Q 321
Query: 210 ASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRV 269
A CKH+ YDL+ T RY + T QDL+D + PF++CV+ IMC+YN V
Sbjct: 322 VIATCKHYAVYDLE----TGRYGNNYNPTQQDLSDYFLAPFKTCVRDTDVGSIMCSYNSV 377
Query: 270 NGIPSCADRNLLSKTARRQWGFHG---YITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAG 326
+GIP+CA+ LL + R+ WGF+ Y+ SDC+AV+ I+ + + E A L AG
Sbjct: 378 SGIPACANEYLLDEVLRKHWGFNADYHYVVSDCNAVTDIWQYHNFTDTEEAAAAVALNAG 437
Query: 327 MDVNCG-SFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGAD 385
+D+ CG S+L+ + A Q + +D++L L+S +G F+G + +
Sbjct: 438 VDLECGSSYLKLNESLAANQTSV--KAMDQSLARLYSALFTIGFFDGG----KYDHLDFS 491
Query: 386 VVCSPAHQVLALQAAQDGIVLLKNSHGLLPL-PKSKSVSLALIGPNANSAKTLLGNYAGP 444
V PA Q LA +AA +G+ LLKN GLLPL + K S+A+IGP AN+ + G Y+G
Sbjct: 492 DVSIPAAQALAYEAAVEGMTLLKND-GLLPLHSQHKYKSVAVIGPFANATTQMQGGYSGN 550
Query: 445 SCRSITPLQALQNYVENTVYYP-GCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEK 503
+ I+PL A ++ V Y G ++ + ++ AK +D +V + G+D + E
Sbjct: 551 APYLISPLVAFESDHRWKVNYAVGTAINDQNTTGFEASLAAAKKSDLIVYLGGIDNSIES 610
Query: 504 EELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPG 563
E +DR L PG Q +LI ++ + KP+++V GG VD + +++I +++WAGYP
Sbjct: 611 ETIDRTSLAWPGNQLDLIKSLSNLS-KPMVVVQFGGGQVDDSALLENKDIQALIWAGYPS 669
Query: 564 EAGAVALAEVIFGDHNPGGRLPMTWYPQDYI-KVPMTDMKMRPQATSGNPGRTYRFYEGK 622
++G AL +++ G +P GRLP+T YP Y ++ + D+ +RP + +PGRTY++Y GK
Sbjct: 670 QSGGTALLDILVGKRSPAGRLPVTQYPASYADQINIFDINLRPNSKDSHPGRTYKWYTGK 729
Query: 623 EVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRK 682
V PFG GL Y+K+ + ++ LN+ S Q++V G T
Sbjct: 730 PVIPFGHGLHYTKFKFGWEET------LNREYSI-----QELVASCQRSSGGPIKDNTPF 778
Query: 683 FLVTIGVKNHGEMAGKHPVLLFVKPARRGNG-RPIKQLVGFQ---SVILNAKEKAEIVFE 738
V V+N G + LLF+ G RP K LV ++ ++ + A++
Sbjct: 779 TTVKARVRNVGHETSDYVSLLFLSSKNAGPAPRPNKSLVSYKRLHNIAPGSDRVADLPLT 838
Query: 739 LSPCESLSRAREDGLMVIEEGTHFLVVGDEE 769
L SL+RA E+G + I G + + + + E
Sbjct: 839 LG---SLARADENGSLYIFPGRYKIALDNSE 866
>gi|330947691|ref|XP_003306937.1| hypothetical protein PTT_20252 [Pyrenophora teres f. teres 0-1]
gi|311315273|gb|EFQ84970.1| hypothetical protein PTT_20252 [Pyrenophora teres f. teres 0-1]
Length = 756
Score = 427 bits (1097), Expect = e-116, Method: Compositional matrix adjust.
Identities = 267/737 (36%), Positives = 386/737 (52%), Gaps = 37/737 (5%)
Query: 34 DPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEA 93
D N ++ C T ++RA LV+ + EK+ LV+ + + RLG+PAY WW EA
Sbjct: 23 DCKNGPLKSNAICDVTASPAKRAAALVAAMQTQEKLENLVSKSKGVARLGLPAYNWWGEA 82
Query: 94 LHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIG 153
LHGVAG GI F G+ R ATSFP +L +A+FD L ++I IG EARA N G A
Sbjct: 83 LHGVAGA-PGINFTGSYRTATSFPMPLLMSAAFDDDLIHQIAIVIGNEARAFGNGGIA-P 140
Query: 154 MTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASAC 213
+ FW P+IN FRDPRWGRG ETPGED L Y S + G++GD K + + A
Sbjct: 141 VDFWTPDINPFRDPRWGRGSETPGEDILRIKGYTKSLLSGLEGD-------KAQRKIIAT 193
Query: 214 CKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIP 273
CKH+ YD++NW GT R+ FDA++T QDLA+ + PPF+ C + + MC+YN VNG+P
Sbjct: 194 CKHYVGYDVENWNGTDRHHFDAKITTQDLAEYFMPPFQQCARDSKVGSFMCSYNAVNGVP 253
Query: 274 SCADRNLLSKTARRQWGF---HGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVN 330
+CAD +L R+ W + + YITSDC+AV I Y + ++A GMD++
Sbjct: 254 TCADTYVLEDILRKHWNWTDSNNYITSDCEAVKDISLRHKYVATLQEATAIAFNNGMDLS 313
Query: 331 CGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSP 390
C A Q L S IDRAL + + G F+G + +G + +P
Sbjct: 314 CEYSGTSDIPGAFSQGLLNVSVIDRALTRQYEGLVHAGYFDG--AAATYAHLGVQDINTP 371
Query: 391 AHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSIT 450
Q L LQ A +G+ LLKN LPL +A++G AN+ L G Y+GP+ T
Sbjct: 372 EAQKLVLQVAAEGLTLLKNDD-TLPLSLKSGSKVAMVGFWANTTSKLSGIYSGPAPYLHT 430
Query: 451 PLQALQNYVENTVYYPG--CDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDR 508
P+ A + G T + A++ AK +D ++ GLD + E DR
Sbjct: 431 PVYAGNKLGLDMAVATGPILQTSGAADNWTTTALNAAKKSDFILYFGGLDPSAAAEGSDR 490
Query: 509 VDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAV 568
D+ P Q +LIT++A A KP++++ L G VD T + + S++WA +PG+ G
Sbjct: 491 TDISWPSAQIDLITKLA-ALGKPLVVIAL-GDMVDHTPILKMKGVNSLIWANWPGQDGGT 548
Query: 569 ALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFG 628
A+ +VI G+H GRLP+T YP +Y ++ M DM MRP NPGRTYR+Y + V PFG
Sbjct: 549 AVMQVITGEHAIAGRLPITQYPAEYTQLSMLDMNMRPGGN--NPGRTYRWYN-ESVQPFG 605
Query: 629 CGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIG 688
GL Y+K++ KF SSS V QD++ KS + + C+ + +
Sbjct: 606 FGLHYTKFAAKFG-----------SSSGLTVNIQDIM--KSCTKDHPDLCDVPP--IEVA 650
Query: 689 VKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRA 748
V N G L F+K P+K LV + + + + ++ +LSR
Sbjct: 651 VTNEGNRTSDFIALAFIKGEVGPKPYPLKTLVSYARLRDISGSQTKMASLALTLGALSRV 710
Query: 749 REDGLMVIEEGTHFLVV 765
+ G +V G + L++
Sbjct: 711 DQSGNLVAYPGEYTLLL 727
>gi|365120422|ref|ZP_09338009.1| hypothetical protein HMPREF1033_01355 [Tannerella sp.
6_1_58FAA_CT1]
gi|363647477|gb|EHL86692.1| hypothetical protein HMPREF1033_01355 [Tannerella sp.
6_1_58FAA_CT1]
Length = 735
Score = 427 bits (1097), Expect = e-116, Method: Compositional matrix adjust.
Identities = 270/756 (35%), Positives = 402/756 (53%), Gaps = 88/756 (11%)
Query: 33 CDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSE 92
C + + TFPF L +R DLVSRLTL+EKISQ++N APAI RLGIPAY+WW+E
Sbjct: 17 CAHTGKAQNTFPFQNPDLSFEKRVDDLVSRLTLEEKISQMLNKAPAIERLGIPAYDWWNE 76
Query: 93 ALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAI 152
LHGV T T FPQ I AA++D L+ ++ +I E RA+Y+ +
Sbjct: 77 CLHGVG---------RTPYKVTVFPQAIGMAATWDDALFQQVASSIADEGRAIYHDAISK 127
Query: 153 -------GMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLK 205
G+T+W PNINIFRDPRWGRGQET GEDP +TG ++V G+QGD
Sbjct: 128 GVHEIYHGLTYWTPNINIFRDPRWGRGQETYGEDPYLTGTLGKAFVNGLQGDD------P 181
Query: 206 GKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCA 265
L+ASAC KH+ + + +R+ F+ V+M DL DTY P F V + S +MCA
Sbjct: 182 KYLKASACAKHYAVH---SGPEISRHFFNTEVSMYDLWDTYLPAFRDLVVDAKVSSVMCA 238
Query: 266 YNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKA 325
YN + G P C + L+ R+QW F GY+TSDC A+ + +A + + VL
Sbjct: 239 YNALAGQPCCGNDLLMQDILRKQWKFTGYVTSDCGAIDDFLKHKTHADAAHASADAVLH- 297
Query: 326 GMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGAD 385
G D+ CG + AVKQ + E++ID ++ LF R RLGLF+ ++ +
Sbjct: 298 GTDLECGQNIYVKLVDAVKQGLITEAQIDESVKRLFMTRFRLGLFDPADRVK-YADTPLS 356
Query: 386 VVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPS 445
V+ H+ LAL+ +++ +VLLKN + +LPL K+ +A+IGPNA+ + +LGNY G
Sbjct: 357 VLECDEHKALALKMSRESVVLLKNDN-VLPLRKNLK-KIAVIGPNADDSTVVLGNYNGFP 414
Query: 446 CRSITPLQALQNYV---ENTVYYPGCDTVACS-SASIDKAVDIAKGADHVVLMMGLDQTQ 501
+ ITPL+A+++ V +Y D V S +++ ++ KG D V+ + G+
Sbjct: 415 SKVITPLEAIRSKVGKRTQVIYDRAIDCVKPSDEKTLNALIERLKGVDQVIFVGGISPRL 474
Query: 502 EKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDR 551
E EEL DR + LP Q EL+ ++ EA PVI V++ G + I + +
Sbjct: 475 EGEELPISVDGFRGGDRTTIALPEVQTELMKKMKEAG-LPVIFVMMTGSALGIEWES--Q 531
Query: 552 NIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGN 611
NI +IL A Y G+ A+A+V+FGD+NP G+LP+T+Y +D + P
Sbjct: 532 NIPAILNAWYGGQFAGQAIADVLFGDYNPSGKLPVTFY--------RSDSDLPPFGAFSM 583
Query: 612 PGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVP 671
RTYR+++G+ ++PFG GLSY+ + Y SVP
Sbjct: 584 ANRTYRYFKGEALYPFGFGLSYTMFDY------------------------------SVP 613
Query: 672 ELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGR-PIKQLVGFQSVILNAK 730
++ + V++ VKN G+ G V L++ + G + PI L GF+ V L A
Sbjct: 614 QVVSGGKVGEPIKVSVKVKNIGKKNGDEVVQLYL--SHEGVEKAPITALKGFKRVYLKAG 671
Query: 731 EKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
E+ + FE+SP +S ++G++ + G + G
Sbjct: 672 EEKTLSFEISP-RDMSLPDDNGIITVFPGKKTIYAG 706
>gi|340519849|gb|EGR50086.1| glycoside hydrolase family 3 [Trichoderma reesei QM6a]
Length = 796
Score = 426 bits (1096), Expect = e-116, Method: Compositional matrix adjust.
Identities = 277/763 (36%), Positives = 405/763 (53%), Gaps = 77/763 (10%)
Query: 35 PSNPSTETFP-----------FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLG 83
P+ P T +FP C TT I++RA +V +TL+EK++ + +SA RLG
Sbjct: 53 PNPPLTPSFPDCTRDPLCSNDVCDTTKSIAERAAAIVKPMTLNEKVANVGSSASGSARLG 112
Query: 84 IPAYEWWSEALHGVAGVGKGIFFN----GTIRGATSFPQVILTAASFDSYLWYRIGQAIG 139
+PAY+W +EALHGVAG G+ F ATSFP IL +A+FD L + AI
Sbjct: 113 LPAYQWQNEALHGVAG-STGVQFQSPLGANFSAATSFPMPILLSAAFDDALVKSVATAIS 171
Query: 140 LEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTF 199
EARA N G A G+ FW PNIN FRDPRWGRG ETPGED Y ++ V G+QG
Sbjct: 172 TEARAFANYGFA-GLDFWTPNINPFRDPRWGRGMETPGEDAFRIQGYVLALVDGLQG--- 227
Query: 200 NGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRA 259
G + + CKHF AYD++N R + T QD+AD Y P FE+CV+ +
Sbjct: 228 --GIDPDFYRTLSTCKHFAAYDIEN----GRTANNLSPTQQDMADYYLPMFETCVRDAKV 281
Query: 260 SGIMCAYNRVNGIPSCADRNLLSKTARRQWGF---HGYITSDCDAVSIIYDAEGYAKSPE 316
+ IMCAYN V+G+P+CAD LL R +GF Y+ SDCDAV ++D YA +
Sbjct: 282 ASIMCAYNAVDGVPACADSYLLQDVLRDTYGFTEDFNYVVSDCDAVENVFDPHHYAANLT 341
Query: 317 DAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTM 376
A + AG D++CGS A+V+ E+ +D++L L+S +++G F+
Sbjct: 342 QAAAMSINAGTDLDCGSSYNV-LNASVQAGLTTEATLDKSLIRLYSALVKVGYFD----- 395
Query: 377 QP--FGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSA 434
QP + +G V + Q LA AA +G+ LLKN G LPL ++ S ++A+IGP AN
Sbjct: 396 QPAEYNSLGWGNVNTTQSQALAHDAATEGMTLLKN-DGTLPLSRTLS-NVAVIGPWANVT 453
Query: 435 KTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLM 494
+ GNYAG + + PL Q N Y G + ++ + A+ A +D +V +
Sbjct: 454 TQMQGNYAGTAPLLVNPLSVFQQKWRNVKYAQGTAINSQDTSGFNAALSAASSSDVIVYL 513
Query: 495 MGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIG 554
G+D + E E DR + PG Q LI+++A KP+++V GG +D + + +
Sbjct: 514 GGIDISVENEGFDRSSITWPGNQLNLISQLANLG-KPLVIVQFGGGQIDDSALLSNSKVN 572
Query: 555 SILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYI-KVPMTDMKMRPQATSGNPG 613
SILWAGYPG+ G A+ +V+ G + P GRLP+T YP +Y+ + DM +RP ++G PG
Sbjct: 573 SILWAGYPGQDGGNAIFDVLTGANPPAGRLPVTQYPANYVNNNNIQDMNLRP--SNGIPG 630
Query: 614 RTYRFYEGKEVFPFGCGLSYSKYSYKFKAV----SQNKLYLNQSSSTKMVENQDVVHYKS 669
RTY +Y G V PFG GL Y+ +S F++ S +N + S N+D+ + +
Sbjct: 631 RTYAWYTGTPVLPFGYGLHYTNFSLSFQSTKTAGSDIATLVNNAGS-----NKDLATFAT 685
Query: 670 VPELGTEFCETRKFLVTIGVKNHG---EMAGKHPVLLFVKPARRGNG-RPIKQLVGF--- 722
+ + VKN G +A + LLF+K G P KQL +
Sbjct: 686 ---------------IVVNVKNTGGKANLASDYVGLLFLKSTNAGPAPHPNKQLAAYGRV 730
Query: 723 QSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVV 765
++V + A ++ + L SL+RA +G I G + L++
Sbjct: 731 RNVGVGATQQLTLTVNLG---SLARADTNGDRWIYPGAYTLIL 770
>gi|358393086|gb|EHK42487.1| glycoside hydrolase family 3 protein [Trichoderma atroviride IMI
206040]
Length = 794
Score = 426 bits (1095), Expect = e-116, Method: Compositional matrix adjust.
Identities = 267/733 (36%), Positives = 396/733 (54%), Gaps = 40/733 (5%)
Query: 46 CKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVG---K 102
C +T +RA+ L+S TL+E I NS P +PRLG+P Y+ W+EALHG+ K
Sbjct: 64 CDSTAGYVERAQALISLFTLEELILNTQNSGPGVPRLGLPNYQVWNEALHGLDRANFATK 123
Query: 103 GIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNIN 162
G G TSFP IL+ A+ + L ++I I +ARA N G+ G+ +APNIN
Sbjct: 124 G----GEFEWGTSFPMPILSMAALNRTLIHQIADIISTQARAFSNNGR-YGLDVYAPNIN 178
Query: 163 IFRDPRWGRGQETPGEDP-LVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYD 221
FR P WGRGQETPGED ++T Y Y+ G+QG G L+ +A KHF YD
Sbjct: 179 GFRSPLWGRGQETPGEDANVLTSAYTYEYITGMQG-----GVDPENLKIAATAKHFAGYD 233
Query: 222 LDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLL 281
L+N+ +R FDA +T QDL++ Y P F + + ++ MCAYN VNG+PSC++ L
Sbjct: 234 LENYNNQSRLGFDAIITQQDLSEYYTPQFLAASRYAKSHSFMCAYNSVNGVPSCSNSFFL 293
Query: 282 SKTARRQWGF--HGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHT 339
R WGF +GY++SDCDA+ +++ YA S A D LKAG D++CG H
Sbjct: 294 QTLLRESWGFPEYGYVSSDCDAIYNVWNPHNYANSQSSAAADSLKAGTDIDCGQTYPWHL 353
Query: 340 KAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQA 399
+ + EI+R++ L++ +RLG F+ + G DVV + A + + +A
Sbjct: 354 NESFVAGTVSRGEIERSVTRLYANLVRLGYFDKKNEYRSLG--WKDVVKTDAWNI-SYEA 410
Query: 400 AQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYV 459
A +GIVLLKN G LPL K K S+ALIGP N+ + L GNY G + I+PLQA +
Sbjct: 411 AVEGIVLLKND-GTLPLSK-KVRSIALIGPWVNATEQLQGNYFGTAPYLISPLQAAKKAG 468
Query: 460 ENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQE 519
Y G ++A KA+ AK +D ++ + G+D T E+E DR D+ PG Q +
Sbjct: 469 YEVNYELGTGINNQTTAGFAKAIAAAKKSDAIIFIGGIDNTIEQEGADRTDIAWPGNQLD 528
Query: 520 LITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHN 579
LI +++E KP++++ + GG VD + K ++ + S++W GYPG++G AL +++ G
Sbjct: 529 LIKQLSEVG-KPLVVLQMGGGQVDSSSIKSNKKVNSLVWGGYPGQSGGYALFDILSGKRA 587
Query: 580 PGGRLPMTWYPQDYI-KVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSY 638
P GRL T YP +Y+ + DM +RP NPG+TY +Y GK V+ FG GL Y+ +
Sbjct: 588 PAGRLVSTQYPAEYVHQFAQNDMNLRPDGKK-NPGQTYIWYTGKPVYQFGDGLFYTTF-- 644
Query: 639 KFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCE-TRKFLVTIGVKNHGEMAG 697
K L + S+ K +Q + G + E T F T ++N G+ A
Sbjct: 645 --------KETLGKQSTLKFNASQIL----GAGHPGYTYSEQTPVFTFTANIQNSGKTAS 692
Query: 698 KHPVLLFVKPARRG-NGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVI 756
+ + FV+ + G P K LVGF + + + P +LSR +G ++
Sbjct: 693 PYSAMAFVRTSNAGPKPYPNKWLVGFDRLATIKPGHSSTLSIPIPLNALSRVDSNGNKIV 752
Query: 757 EEGTHFLVVGDEE 769
G + LV+ +E
Sbjct: 753 YPGKYELVLNTDE 765
>gi|292495281|sp|C0STH4.1|XYND_ASPAC RecName: Full=Probable exo-1,4-beta-xylosidase xlnD; AltName:
Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
gi|225878711|dbj|BAH30675.1| beta-xylosidase [Aspergillus aculeatus]
Length = 805
Score = 426 bits (1095), Expect = e-116, Method: Compositional matrix adjust.
Identities = 264/737 (35%), Positives = 396/737 (53%), Gaps = 37/737 (5%)
Query: 46 CKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIF 105
C +T RA LVS TL+E I+ N++P +PRLG+P Y+ WSEALHG+ G+ F
Sbjct: 60 CDSTASPYDRAAALVSLFTLEELIANTGNTSPGVPRLGLPPYQVWSEALHGL---GRANF 116
Query: 106 F-NGTIR-GATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINI 163
NG + G SFP IL+AA+F+ L +I I + RA NAG+ G+ ++PNIN
Sbjct: 117 TDNGALHAGRPSFPSPILSAAAFNRTLINQIASIISTQGRAFNNAGR-FGLDVYSPNINT 175
Query: 164 FRDPRWGRGQETPGEDPL-VTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDL 222
FR P WGRGQETPGED +T YA Y+ G+QG G L+ +A KHF YD+
Sbjct: 176 FRHPVWGRGQETPGEDAYTLTAAYAYEYITGIQG-----GVNPEHLKLAATAKHFAGYDI 230
Query: 223 DNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLS 282
+NW +R D +T QDLA+ Y P F + MC+YN VNG+PSC++ L
Sbjct: 231 ENWDNHSRLGNDVNITQQDLAEYYTPQFLVAARDAHVHSFMCSYNAVNGVPSCSNTFFLQ 290
Query: 283 KTARRQWGF--HGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTK 340
R + F HGY++ DC AV +++ GYA + A D + AG D++CG+ Q H
Sbjct: 291 TLLRDTFSFVDHGYVSGDCGAVYGVFNPHGYAANEPSAAADAILAGTDIDCGTSYQYHFN 350
Query: 341 AAVKQKKLPESEIDRALHNLFSVRMRLGLFNGN-PTMQPFGKIGADVVCSPAHQVLALQA 399
++ + +I+R L++ + LG F+GN + P+ +G V ++ +A
Sbjct: 351 ESITTGAVARDDIERGFIRLYANLVELGYFDGNSSSSNPYRSLGWPDVQKTDAWNISYEA 410
Query: 400 AQDGIVLLKNSHGLLPL---PKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQ 456
A +GIVLLKN G LPL + K+ S+ALIGP AN+ L GNY G + I+P+ A
Sbjct: 411 AVEGIVLLKND-GTLPLASPSEGKNKSIALIGPWANATTQLQGNYYGDAPYLISPVDAFT 469
Query: 457 NYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGR 516
Y PG + S+A+ A+ A+ AD +V + G+D T E E DR + PG
Sbjct: 470 AAGYTVHYAPGTEISTNSTANFSAALSAARAADTIVFLGGIDNTIEAEAQDRSSIAWPGN 529
Query: 517 QQELITRVA--EAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVI 574
Q ELI+++A ++ +P+++ + GG VD + K++ + ++LW GYPG++G +AL +++
Sbjct: 530 QLELISQLAAQKSDDQPLVVYQMGGGQVDSSSLKFNAKVNALLWGGYPGQSGGLALRDIL 589
Query: 575 FGDHNPGGRLPMTWYPQDYIK-VPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSY 633
G P GRL T YP Y + DM +RP T+ NPG+TY +Y G+ V+ FG GL Y
Sbjct: 590 TGARAPAGRLTTTQYPAAYAESFSALDMNLRPNETTQNPGQTYMWYTGEPVYAFGHGLFY 649
Query: 634 SKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHG 693
+ F A S Q++ TK N + + P+ T + F T + N G
Sbjct: 650 T----TFNASSA------QAAKTKYTFNITDLTSAAHPDT-TTVGQRTLFNFTASITNSG 698
Query: 694 EMAGKHPVLLFVKPARRG-NGRPIKQLVGFQSVILNAKE--KAEIVFELSPCESLSRARE 750
+ + L++ + G + P K LVGF + AKE AE+ ++ + L+R E
Sbjct: 699 QRDSDYTALVYANTSTAGPSPYPNKWLVGFDRLAAVAKEGGTAELNVPVA-VDRLARVDE 757
Query: 751 DGLMVIEEGTHFLVVGD 767
G V+ G + + + +
Sbjct: 758 AGNTVLFPGRYEVALNN 774
>gi|391872736|gb|EIT81831.1| beta-glucosidase-related glycosidase [Aspergillus oryzae 3.042]
Length = 798
Score = 426 bits (1094), Expect = e-116, Method: Compositional matrix adjust.
Identities = 262/743 (35%), Positives = 394/743 (53%), Gaps = 37/743 (4%)
Query: 36 SNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALH 95
S P ++T C T+ RA LVS LT +E ++ N+ PR+G+PAY+ W+EALH
Sbjct: 54 SGPLSKTL-VCDTSAKPHDRAAALVSLLTFEELVNNTANTGHGAPRIGLPAYQVWNEALH 112
Query: 96 GVAGVGKGIFFN-GTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGM 154
GVA F + G +TSFPQ I T A+ + L ++I I + RA NAG+ G+
Sbjct: 113 GVA---HADFSDAGDFSWSTSFPQPISTMAALNRTLIHQIATIISTQGRAFMNAGR-YGL 168
Query: 155 TFWAPNINIFRDPRWGRGQETPGEDPL-VTGKYAVSYVRGVQGDTFNGGKLKGKLQASAC 213
++PNIN FR P WGRGQETPGED + YA Y+ G+QG G L+ A
Sbjct: 169 DVYSPNINTFRHPVWGRGQETPGEDAYCLASTYAYEYITGIQG-----GVDANPLKLIAT 223
Query: 214 CKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIP 273
KH+ YD++NW +R D ++T QDLA+ Y P F + + +MC+YN VNG+P
Sbjct: 224 AKHYAGYDIENWDNHSRLGNDMQITQQDLAEYYTPQFLVASRDAKVHSVMCSYNAVNGVP 283
Query: 274 SCADRNLLSKTARRQWGF--HGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNC 331
SC++ L R + F GY++ DC AV +++ GYA + A D ++AG D++C
Sbjct: 284 SCSNSFFLQTLLRDTFDFVEDGYVSGDCGAVYNVFNPHGYATNESSAAADSIRAGTDIDC 343
Query: 332 GSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPA 391
G +H + + +++ +++R + L++ +R G F+G + P+ I V S
Sbjct: 344 GVSYPRHFQESFHDQEVSRQDLERGVTRLYASLIRAGYFDGKTS--PYRNITWSDVVSTN 401
Query: 392 HQVLALQAAQDGIVLLKNSHGLLPL--PKSKSVSLALIGPNANSAKTLLGNYAGPSCRSI 449
Q L+ +AA IVLLKN G+LPL S + ++ALIGP AN+ +LGNY GP+ I
Sbjct: 402 AQNLSYEAAAQSIVLLKND-GILPLTSTSSSTKTIALIGPWANATTQMLGNYYGPAPYLI 460
Query: 450 TPLQALQNYVENTVYYPGCDTVAC--SSASIDKAVDIAKGADHVVLMMGLDQTQEKEELD 507
+PLQA Q+ E + Y S S A+ AK AD ++ G+D T E E D
Sbjct: 461 SPLQAFQD-SEYKITYTIGTNTTTDPDSTSQSTALTTAKEADLIIFAGGIDNTLETEAQD 519
Query: 508 RVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGA 567
R ++ P Q LIT++A+ KP+I++ + GG VD + K ++N+ +++W GYPG++G
Sbjct: 520 RSNITWPSNQLSLITKLADLG-KPLIVLQMGGGQVDSSALKNNKNVNALIWGGYPGQSGG 578
Query: 568 VALAEVIFGDHNPGGRLPMTWYPQDYIKV-PMTDMKMRPQATSGNPGRTYRFYEGKEVFP 626
ALA++I G P RL T YP +Y +V P DM +RP + NPG+TY +Y G V+
Sbjct: 579 QALADIITGKRAPAARLVTTQYPAEYAEVFPAIDMNLRPNGS--NPGQTYMWYTGTPVYE 636
Query: 627 FGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFL-V 685
FG GL Y+ ++ A S TK + ++ P G + E L
Sbjct: 637 FGHGLFYTNFTASASA----------GSGTKNRTSFNIDEVLGRPHPGYKLVEQMPLLNF 686
Query: 686 TIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESL 745
T+ VKN G+ + + FV P K LVGF + A+ + +SL
Sbjct: 687 TVDVKNTGDRVSDYTAMAFVNTTAGPAPHPNKWLVGFDRLSAVEPGSAKTMVIPVTVDSL 746
Query: 746 SRAREDGLMVIEEGTHFLVVGDE 768
+R E+G V+ G + + + +E
Sbjct: 747 ARTDEEGNRVLYPGRYEVALNNE 769
>gi|358385386|gb|EHK22983.1| glycoside hydrolase family 3 protein [Trichoderma virens Gv29-8]
Length = 795
Score = 425 bits (1092), Expect = e-116, Method: Compositional matrix adjust.
Identities = 269/733 (36%), Positives = 400/733 (54%), Gaps = 39/733 (5%)
Query: 46 CKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVG---K 102
C ++ ++RA+ L+S TL+E I NS P +PRLG+P Y+ W+EALHG+ K
Sbjct: 63 CDSSAGYAERAQALISLFTLEELILNTQNSGPGVPRLGLPNYQVWNEALHGLDRANFATK 122
Query: 103 GIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNIN 162
G G + ATSFP IL+ A+ + L ++I I +ARA N+G+ G+ +APNIN
Sbjct: 123 G----GQFQWATSFPMPILSMAALNRTLIHQIADIISTQARAFSNSGR-YGLDVYAPNIN 177
Query: 163 IFRDPRWGRGQETPGEDP-LVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYD 221
FR P WGRGQETPGED ++T Y Y+ G+QG G L+ +A KHF YD
Sbjct: 178 GFRSPLWGRGQETPGEDANVLTSAYTYEYITGMQG-----GVDPENLKIAATAKHFAGYD 232
Query: 222 LDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLL 281
L+NW +R FDA +T QDL++ Y P F + + ++ MCAYN VNG+PSCA+ L
Sbjct: 233 LENWNNQSRLGFDAIITQQDLSEYYTPQFLAASRYAKSHSFMCAYNSVNGVPSCANSFFL 292
Query: 282 SKTARRQWGF--HGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHT 339
R WGF GY++SDCDAV +++ YA + A L+AG D++CG H
Sbjct: 293 QTLLRESWGFPEWGYVSSDCDAVYNVWNPHDYASNQSSAAASSLRAGTDIDCGQTYPWHL 352
Query: 340 KAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQA 399
+ ++ EI+R++ L++ +RLG F+ + G DVV + A + + +A
Sbjct: 353 NESFVAGEVSRGEIERSVTRLYANLVRLGYFDKKNEYRSLG--WKDVVKTDAWNI-SYEA 409
Query: 400 AQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYV 459
A +GIVLLKN G LPL K K S+ALIGP AN+ + GNY G + I+PL+A +
Sbjct: 410 AVEGIVLLKND-GTLPLSK-KVRSIALIGPWANATTQMQGNYFGAAPYLISPLEAAKKAG 467
Query: 460 ENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQE 519
+ G +T + S+A KA+ AK +D ++ G+D T E+E DR D+ PG Q +
Sbjct: 468 YQVNFELGTETASTSTAGFAKAIAAAKKSDAIIFAGGIDNTVEQEGADRTDIAWPGNQLD 527
Query: 520 LITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHN 579
LI +++E KP++++ + GG VD + K ++ + S++W GYPG++G VAL +++ G
Sbjct: 528 LIKQLSELG-KPLVVLQMGGGQVDSSSLKSNKKVNSLVWGGYPGQSGGVALFDILSGKRA 586
Query: 580 PGGRLPMTWYPQDYI-KVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSY 638
P GRL T YP DY+ + P DM +RP S NPG+TY +Y GK V+ FG G+ Y+ +
Sbjct: 587 PAGRLVSTQYPADYVHQFPQNDMNLRPDGKS-NPGQTYIWYTGKPVYQFGDGIFYTTFKE 645
Query: 639 KFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLV-TIGVKNHGEMAG 697
S+ L N SS + P G + E L T ++N G+
Sbjct: 646 TLSGSSKG-LKFNVSSVL------------AAPHPGYTYSEQTPVLTFTANIENSGKTDS 692
Query: 698 KHPVLLFVKPARRGNG-RPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVI 756
+ +LFV+ A G P K LVGF + + + P +L+R G ++
Sbjct: 693 PYSAMLFVRTANAGPAPYPNKWLVGFDRLATIKPGHSSKLSIPIPVSALARVDSLGNRIV 752
Query: 757 EEGTHFLVVGDEE 769
G + L + +E
Sbjct: 753 YPGKYELALNTDE 765
>gi|359473427|ref|XP_002265788.2| PREDICTED: beta-xylosidase/alpha-L-arabinofuranosidase 1-like
[Vitis vinifera]
Length = 464
Score = 425 bits (1092), Expect = e-116, Method: Compositional matrix adjust.
Identities = 208/455 (45%), Positives = 297/455 (65%), Gaps = 12/455 (2%)
Query: 145 LYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQG--DTFNGG 202
+YN G A G+TFW+PNIN+ RD RWGR QET EDP + G++AV+YVRG+Q T N
Sbjct: 1 MYNLGHA-GLTFWSPNINVVRDTRWGRTQETSREDPFMVGEFAVNYVRGLQDVEGTENVT 59
Query: 203 KLKGK-LQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASG 261
L + L+ S+CCKH+ AYD+D+W R+ FDARV+ QD+ +T+ PFE CV++G S
Sbjct: 60 DLNSRPLKVSSCCKHYAAYDIDSWLNIDRHTFDARVSEQDMKETFVSPFERCVREGDVSS 119
Query: 262 IMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAK-SPEDAVV 320
+MC++N++NGIP C+D LL R +W HGYI SDC + +I D + Y S DAV
Sbjct: 120 VMCSFNKINGIPPCSDPRLLKGVIRDEWDLHGYIVSDCYGLEVIVDNQNYLNDSKVDAVA 179
Query: 321 DVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFG 380
L+AG+D+ CG + V K+ + E+DRAL N++ + MR+G F+G P +
Sbjct: 180 KTLQAGLDLECGHYYTDALNELVLTGKVSQYELDRALKNIYVLLMRVGYFDGIPAYE--- 236
Query: 381 KIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGN 440
+G +C+ H LA +AA+ GIVLLKN + + PL K LAL+GP+AN+ + ++GN
Sbjct: 237 SLGLKDICAADHIELAREAARQGIVLLKNDYEVFPLKPGKK--LALVGPHANATEVMIGN 294
Query: 441 YAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSAS-IDKAVDIAKGADHVVLMMGLDQ 499
YAG + ++PL+A + + N Y GC +CS+ + +A + AK A+ ++ +G D
Sbjct: 295 YAGLPRKYVSPLEAF-SAIGNVTYTTGCLDASCSNDTYFSEAKEAAKSAEVTIIFVGTDL 353
Query: 500 TQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWA 559
+ E E +DRVD +LPG Q ELI +VAE + PVILV+L G +DITFAK + I +ILW
Sbjct: 354 SIEAEFVDRVDFLLPGNQTELIKQVAEVSSGPVILVVLSGSNIDITFAKNNPRISAILWV 413
Query: 560 GYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYI 594
G+PGE G A+A+V+FG +NPGGRLP+TWY DY+
Sbjct: 414 GFPGEQGGHAIADVVFGKYNPGGRLPVTWYEADYV 448
>gi|3135209|dbj|BAA28267.1| beta-xylosidase A [Aspergillus oryzae]
Length = 798
Score = 425 bits (1092), Expect = e-116, Method: Compositional matrix adjust.
Identities = 262/743 (35%), Positives = 394/743 (53%), Gaps = 37/743 (4%)
Query: 36 SNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALH 95
S P ++T C T+ RA LVS LT +E ++ N+ PR+G+PAY+ W+EALH
Sbjct: 54 SGPLSKTL-VCDTSAKPHDRAAALVSLLTFEELVNNTANTGHGAPRIGLPAYQVWNEALH 112
Query: 96 GVAGVGKGIFFN-GTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGM 154
GVA F + G +TSFPQ I T A+ + L ++I I + RA NAG+ G+
Sbjct: 113 GVA---HADFSDAGDFSWSTSFPQPISTMAALNRTLIHQIATIISTQGRAFMNAGR-YGL 168
Query: 155 TFWAPNINIFRDPRWGRGQETPGEDPL-VTGKYAVSYVRGVQGDTFNGGKLKGKLQASAC 213
++PNIN FR P WGRGQETPGED + YA Y+ G+QG G L+ A
Sbjct: 169 DVYSPNINTFRHPVWGRGQETPGEDAYCLASTYAYEYITGIQG-----GVDANPLKLIAT 223
Query: 214 CKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIP 273
KH+ YD++NW +R D ++T QDLA+ Y P F + + +MC+YN VNG+P
Sbjct: 224 AKHYAGYDIENWDNHSRLGNDMQITQQDLAEYYTPQFLVASRDAKVHSVMCSYNAVNGVP 283
Query: 274 SCADRNLLSKTARRQWGF--HGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNC 331
SC++ L R + F GY++ DC AV +++ GYA + A D ++AG D++C
Sbjct: 284 SCSNSFFLQTLLRDTFDFVEDGYVSGDCGAVYNVFNPHGYATNESSAAADSIRAGTDIDC 343
Query: 332 GSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPA 391
G +H + + +++ +++R + L++ +R G F+G + P+ I V S
Sbjct: 344 GVSYPRHFQESFHDQEVSRQDLERGVIRLYASLIRAGYFDGKTS--PYRNITWSDVVSTN 401
Query: 392 HQVLALQAAQDGIVLLKNSHGLLPL--PKSKSVSLALIGPNANSAKTLLGNYAGPSCRSI 449
Q L+ +AA IVLLKN G+LPL S + ++ALIGP AN+ +LGNY GP+ I
Sbjct: 402 AQNLSYEAAAQSIVLLKND-GILPLTSTSSSTKTIALIGPWANATTQMLGNYYGPAPYLI 460
Query: 450 TPLQALQNYVENTVYYPGCDTVAC--SSASIDKAVDIAKGADHVVLMMGLDQTQEKEELD 507
+PLQA Q+ E + Y S S A+ AK AD ++ G+D T E E D
Sbjct: 461 SPLQAFQD-SEYKITYTIGTNTTTDPDSTSQSTALTTAKEADLIIFAGGIDNTLETEAQD 519
Query: 508 RVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGA 567
R ++ P Q LIT++A+ KP+I++ + GG VD + K ++N+ +++W GYPG++G
Sbjct: 520 RSNITWPSNQLSLITKLADLG-KPLIVLQMGGGQVDSSALKNNKNVNALIWGGYPGQSGG 578
Query: 568 VALAEVIFGDHNPGGRLPMTWYPQDYIKV-PMTDMKMRPQATSGNPGRTYRFYEGKEVFP 626
ALA++I G P RL T YP +Y +V P DM +RP + NPG+TY +Y G V+
Sbjct: 579 QALADIITGKRAPAARLVTTQYPAEYAEVFPAIDMNLRPNGS--NPGQTYMWYTGTPVYE 636
Query: 627 FGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFL-V 685
FG GL Y+ ++ A S TK + ++ P G + E L
Sbjct: 637 FGHGLFYTNFTASASA----------GSGTKNRTSFNIDEVLGRPHPGYKLVEQMPLLNF 686
Query: 686 TIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESL 745
T+ VKN G+ + + FV P K LVGF + A+ + +SL
Sbjct: 687 TVDVKNTGDRVSDYTAMAFVNTTAGPAPHPNKWLVGFDRLSAVEPGSAKTMVIPVTVDSL 746
Query: 746 SRAREDGLMVIEEGTHFLVVGDE 768
+R E+G V+ G + + + +E
Sbjct: 747 ARTDEEGNRVLYPGRYEVALNNE 769
>gi|2723496|dbj|BAA24107.1| beta-1,4-xylosidase [Aspergillus oryzae]
Length = 798
Score = 424 bits (1091), Expect = e-116, Method: Compositional matrix adjust.
Identities = 262/743 (35%), Positives = 394/743 (53%), Gaps = 37/743 (4%)
Query: 36 SNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALH 95
S P ++T C T+ RA LVS LT +E ++ N+ PR+G+PAY+ W+EALH
Sbjct: 54 SGPLSKTL-VCDTSAKPHDRAAALVSLLTFEELVNNTANTGHGAPRIGLPAYQVWNEALH 112
Query: 96 GVAGVGKGIFFN-GTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGM 154
GVA F + G +TSFPQ I T A+ + L ++I I + RA NAG+ G+
Sbjct: 113 GVA---HADFSDAGDFSWSTSFPQPISTMAALNRTLIHQIATIISTQGRAFMNAGR-YGL 168
Query: 155 TFWAPNINIFRDPRWGRGQETPGEDPL-VTGKYAVSYVRGVQGDTFNGGKLKGKLQASAC 213
++PNIN FR P WGRGQETPGED + YA Y+ G+QG G L+ A
Sbjct: 169 DVYSPNINTFRHPVWGRGQETPGEDAYCLASTYAYEYITGIQG-----GVDANPLKLIAT 223
Query: 214 CKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIP 273
KH+ YD++NW +R D ++T QDLA+ Y P F + + +MC+YN VNG+P
Sbjct: 224 AKHYAGYDIENWDNHSRLGNDMQITQQDLAEYYTPQFLVASRDAKVHSVMCSYNAVNGVP 283
Query: 274 SCADRNLLSKTARRQWGF--HGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNC 331
SC++ L R + F GY++ DC AV +++ GYA + A D ++AG D++C
Sbjct: 284 SCSNSFFLQTLLRDTFDFVEDGYVSGDCGAVYNVFNPHGYATNESSAAADSIRAGTDIDC 343
Query: 332 GSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPA 391
G +H + + +++ +++R + L++ +R G F+G + P+ I V S
Sbjct: 344 GVSYPRHFQESFHDQEVSRQDLERGVIRLYASLIRAGYFDGKTS--PYRNITWSDVVSTN 401
Query: 392 HQVLALQAAQDGIVLLKNSHGLLPL--PKSKSVSLALIGPNANSAKTLLGNYAGPSCRSI 449
Q L+ +AA IVLLKN G+LPL S + ++ALIGP AN+ +LGNY GP+ I
Sbjct: 402 AQNLSYEAAAQSIVLLKND-GILPLTSTSSSTKTIALIGPWANATTQMLGNYYGPAPYLI 460
Query: 450 TPLQALQNYVENTVYYPGCDTVAC--SSASIDKAVDIAKGADHVVLMMGLDQTQEKEELD 507
+PLQA Q+ E + Y S S A+ AK AD ++ G+D T E E D
Sbjct: 461 SPLQAFQD-SEYKITYTIGTNTTTDPDSTSQSTALTTAKEADLIIFAGGIDNTLETEAQD 519
Query: 508 RVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGA 567
R ++ P Q LIT++A+ KP+I++ + GG VD + K ++N+ +++W GYPG++G
Sbjct: 520 RSNITWPSNQLSLITKLADLG-KPLIVLQMGGGQVDSSALKNNKNVNALIWGGYPGQSGG 578
Query: 568 VALAEVIFGDHNPGGRLPMTWYPQDYIKV-PMTDMKMRPQATSGNPGRTYRFYEGKEVFP 626
ALA++I G P RL T YP +Y +V P DM +RP + NPG+TY +Y G V+
Sbjct: 579 QALADIITGKRAPAARLVTTQYPAEYAEVFPAIDMNLRPNGS--NPGQTYMWYTGTPVYE 636
Query: 627 FGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFL-V 685
FG GL Y+ ++ A S TK + ++ P G + E L
Sbjct: 637 FGHGLFYTNFTASASA----------GSGTKNRTSFNIDEVLGRPHPGYKLVEQMPLLNF 686
Query: 686 TIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESL 745
T+ VKN G+ + + FV P K LVGF + A+ + +SL
Sbjct: 687 TVDVKNTGDRVSDYTAMAFVNTTAGPAPHPNKWLVGFDRLSAVEPGSAKTMVIPVTVDSL 746
Query: 746 SRAREDGLMVIEEGTHFLVVGDE 768
+R E+G V+ G + + + +E
Sbjct: 747 ARTDEEGNRVLYPGRYEVALNNE 769
>gi|380293100|gb|AFD50200.1| beta-xylosidase [Hypocrea orientalis]
Length = 797
Score = 424 bits (1089), Expect = e-115, Method: Compositional matrix adjust.
Identities = 270/735 (36%), Positives = 404/735 (54%), Gaps = 43/735 (5%)
Query: 46 CKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVG---K 102
C ++ +RA+ L+S TL+E I NS P +PRLG+P Y+ W+EALHG+ K
Sbjct: 63 CDSSAGYVERAQALISLFTLEELILNTQNSGPGVPRLGLPNYQVWNEALHGLDRANFATK 122
Query: 103 GIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNIN 162
G G ATSFP ILT A+ + L ++I I +ARA N+G+ G+ +APN+N
Sbjct: 123 G----GQFEWATSFPMPILTTAALNRTLIHQIADIISTQARAFSNSGR-YGLDVYAPNVN 177
Query: 163 IFRDPRWGRGQETPGEDPL-VTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYD 221
FR P WGRGQETPGED ++ Y Y+ G+QG G +L+ +A KHF YD
Sbjct: 178 GFRSPLWGRGQETPGEDAFFLSSAYTYEYITGIQG-----GVDPEQLKVAATVKHFAGYD 232
Query: 222 LDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLL 281
L+NW +R FDA +T QDL++ Y P F + + ++ +MC+YN VNG+PSCA+ L
Sbjct: 233 LENWNNQSRLGFDAIITQQDLSEYYTPQFLAAARYAKSRSLMCSYNSVNGVPSCANSFFL 292
Query: 282 SKTARRQWGF--HGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHT 339
R WGF GY++SDCDAV +++ YA + A L+AG D++CG H
Sbjct: 293 QTLLRESWGFPEWGYVSSDCDAVYNVFNPHDYASNQSSAAASSLRAGTDIDCGQTYPWHL 352
Query: 340 KAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQA 399
+ ++ EI+R++ L++ +RLG F+ + G DVV + A + + +A
Sbjct: 353 NESFVAGEVTRGEIERSVTRLYANLVRLGYFDKKNQYRSLG--WKDVVKTDAWNI-SYEA 409
Query: 400 AQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYV 459
A +GIVLLKN G LPL K K S+ALIGP AN+ + GNY GP+ I+PL+A +
Sbjct: 410 AVEGIVLLKND-GTLPLSK-KVRSIALIGPWANATTQMQGNYFGPAPYLISPLEAAKKAG 467
Query: 460 ENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQE 519
+ + G + S+A KA+ AK +D +V + G+D T E+E DR D+ PG Q +
Sbjct: 468 YHVNFELGTEIAGNSTAGFAKAIAAAKKSDAIVYLGGIDNTIEQEGADRTDIAWPGNQLD 527
Query: 520 LITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHN 579
LI +++E KP++++ + GG VD + K ++ + S++W GYPG++G VAL +++ G
Sbjct: 528 LIKQLSEVG-KPLVVLQMGGGQVDSSSLKSNKKVNSLVWGGYPGQSGGVALFDILSGKRA 586
Query: 580 PGGRLPMTWYPQDYI-KVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSY 638
P GRL T YP +Y+ + P DM +RP S NPG+TY +Y GK V+ FG GL Y+ ++
Sbjct: 587 PAGRLITTQYPAEYVHQFPQNDMNLRPDGKS-NPGQTYIWYTGKPVYEFGSGLFYT--TF 643
Query: 639 KFKAVSQNK-LYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRK-FLVTIGVKNHGEMA 696
K S K L N SS S P G + E F +KN G+
Sbjct: 644 KETLASHPKCLKFNTSSIL------------SAPHPGYTYSEQIPVFTFEANIKNSGKTE 691
Query: 697 GKHPVLLFVKPARRGNG-RPIKQLVGFQSVI-LNAKEKAEIVFELSPCESLSRAREDGLM 754
+ +LFV+ + G P K LVGF + + +++ + P +L+R G
Sbjct: 692 SPYTAMLFVRTSNAGPAPYPNKWLVGFDRLADIKPGHSSKLSIPI-PVSALARVDSYGNR 750
Query: 755 VIEEGTHFLVVGDEE 769
++ G + L + +E
Sbjct: 751 IVYPGKYELALNTDE 765
>gi|336435507|ref|ZP_08615222.1| hypothetical protein HMPREF0988_00807 [Lachnospiraceae bacterium
1_4_56FAA]
gi|336000960|gb|EGN31106.1| hypothetical protein HMPREF0988_00807 [Lachnospiraceae bacterium
1_4_56FAA]
Length = 717
Score = 424 bits (1089), Expect = e-115, Method: Compositional matrix adjust.
Identities = 261/702 (37%), Positives = 387/702 (55%), Gaps = 85/702 (12%)
Query: 54 QRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGA 113
++A +LV ++TL EK SQL APAIPRL IPAY WW+E+LHGVA G A
Sbjct: 12 KQAEELVDQMTLMEKASQLRYDAPAIPRLHIPAYNWWNESLHGVARGGT----------A 61
Query: 114 TSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAI-------GMTFWAPNINIFRD 166
T FPQ I AASFD + IG+AI LE RA YNA + G+TFWAPN+NIFRD
Sbjct: 62 TVFPQAIGLAASFDREMLEEIGEAIALEGRAKYNAAVKLDDRDIYKGLTFWAPNVNIFRD 121
Query: 167 PRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWK 226
PRWGRG ET GEDP ++ + VSY+RG+QGD ++A+AC KHF + +
Sbjct: 122 PRWGRGHETYGEDPYLSSRLGVSYIRGLQGDGET-------MKAAACAKHFAVH---SGP 171
Query: 227 GTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTAR 286
R++FDA V+ +DL +TY P F++CV++G +M AYN VNG P C LL K R
Sbjct: 172 EALRHEFDAEVSEKDLRETYLPAFQACVQEGHVEAVMGAYNCVNGEPCCGSETLLKKILR 231
Query: 287 RQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG-SFLQKHTKAAVKQ 345
+WGF G++ SDC A+ ++ +P + ++AG D+NCG ++L H A ++
Sbjct: 232 EEWGFDGHVVSDCWAIKDFHENHLVTGTPVQSAALAMEAGCDLNCGVTYL--HLVHACQE 289
Query: 346 KKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIV 405
+ E++I A LF+ R LG+F+G+ + + VV H+ L+ +AA++ IV
Sbjct: 290 GLVTEAQITEAAIRLFTTRFLLGMFDGS----EYDSVPYTVVECKEHRDLSERAARESIV 345
Query: 406 LLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTV-- 463
LLKN+ G+LPL + K ++ +IGPNA+S K L+GNY G S IT L+ ++ V + V
Sbjct: 346 LLKNN-GILPLDREKLKTIGIIGPNADSRKALIGNYHGTSSEYITVLEGVRRLVGDEVRI 404
Query: 464 -YYPGC-------DTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEE---------L 506
Y GC + +A + +A +A+ +D V+L +GLD+T E EE
Sbjct: 405 LYSDGCHLYENKTENLAREQDRLSEARIVARESDVVILCLGLDETLEGEEGDTGNSYASG 464
Query: 507 DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAG 566
D+VDL LP Q+ L+ VA KKP +L L+ G +D++FA+ + LW YPG G
Sbjct: 465 DKVDLRLPKSQRMLMEAVA-MEKKPTVLCLMAGSDIDLSFAEKHFDAIVDLW--YPGAYG 521
Query: 567 AVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFP 626
A A+++FG +P G+LP+T+Y + D MR GRTYR+ E K +P
Sbjct: 522 GAAAADILFGKCSPSGKLPITFYESLEVLPSFEDYSMR--------GRTYRYLEQKAQYP 573
Query: 627 FGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVT 686
FG GL+Y+K + ++L +++ M E D + ++ +V
Sbjct: 574 FGYGLTYTKMKIR-------NVWL-ENAEKDMKEVTDGENAEAA------------VIVC 613
Query: 687 IGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILN 728
V+N G M + + ++++ + P L GF+ + +
Sbjct: 614 AEVENCGGMDSQEVLQIYIRDTESEHETPHPHLAGFERIFVE 655
>gi|367028614|ref|XP_003663591.1| glycoside hydrolase family 3 protein [Myceliophthora thermophila
ATCC 42464]
gi|347010860|gb|AEO58346.1| glycoside hydrolase family 3 protein [Myceliophthora thermophila
ATCC 42464]
Length = 760
Score = 423 bits (1088), Expect = e-115, Method: Compositional matrix adjust.
Identities = 286/749 (38%), Positives = 397/749 (53%), Gaps = 57/749 (7%)
Query: 34 DPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEA 93
D +N ++ C T+ RA LVS + +EK++ LVN++P + RLG+ AY+WW+EA
Sbjct: 27 DCANGPLKSNTVCDTSASPGARAAALVSVMNNNEKLANLVNNSPGVSRLGLSAYQWWNEA 86
Query: 94 LHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIG 153
LHGVA +GI + G AT FPQ I T+A+FD L +IG I EARA N G+A
Sbjct: 87 LHGVAH-NRGITWGGEFSAATQFPQAITTSATFDDALIEQIGTIISTEARAFANNGRA-H 144
Query: 154 MTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASAC 213
+ FW PN+N FRDPRWGRG ETPGED K+A ++V+G+QG + A
Sbjct: 145 LDFWTPNVNPFRDPRWGRGHETPGEDAFKNKKWAEAFVKGMQGP-------GPTHRVIAT 197
Query: 214 CKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIP 273
CKH+ AYDL+N TTR+ FDA+V+ QDLA+ Y PPF+ C + + IMC+YN VN IP
Sbjct: 198 CKHYAAYDLENSGSTTRFNFDAKVSTQDLAEYYLPPFQQCARDSKVGSIMCSYNAVNEIP 257
Query: 274 SCADRNLLSKTARRQWGF---HGYITSDCDAVSIIYDAEG---YAKSPEDAVVDVLKAGM 327
+CA+ L+ R+ W + H YI SDCDAV + +A G Y S A+ L+AG
Sbjct: 258 ACANPYLMDTILRKHWNWTDEHQYIVSDCDAVYYLGNANGGHRYKPSYAAAIGASLEAGC 317
Query: 328 DVNCGSF--LQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGAD 385
D C + +A + ++ +D A+ + G F+G M + +
Sbjct: 318 DNMCWATGGTAPDPASAFNSGQFSQTTLDTAILRQMQGLVLAGYFDGPGGM--YRNLSVA 375
Query: 386 VVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKS-VSLALIGPNANSAKTLLGNYAGP 444
V + Q AL+AA+ GIVLLKN G+LPL + S +A+IG AN+A +LG Y+G
Sbjct: 376 DVNTQTAQDTALKAAEGGIVLLKND-GILPLSVNGSNFQVAMIGFWANAADKMLGGYSGS 434
Query: 445 SCRSITPLQALQ------NYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLD 498
+ P+ A + NYV + P DT A A++ A+ ++ VV G+D
Sbjct: 435 PPFNHDPVTAARSMGITVNYVNGPLTQPNGDTSA--------ALNAAQKSNAVVFFGGID 486
Query: 499 QTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILW 558
T EKE DR + P Q LI R+AE KPVI+V L G VD T N+ +ILW
Sbjct: 487 NTVEKESQDRTSIEWPSGQLALIRRLAETG-KPVIVVRL-GTHVDDTPLLSIPNVRAILW 544
Query: 559 AGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYI-KVPMTDMKMRPQATSGNPGRTYR 617
AGYPG+ G A+ ++I G +P GRLP T YP Y + P T+M +RP +S PGRTYR
Sbjct: 545 AGYPGQDGGTAVVKIITGLASPAGRLPATVYPSSYTSQAPFTNMALRP--SSSYPGRTYR 602
Query: 618 FYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEF 677
+Y VFPFG GL Y+ +S VS + + + + D V Y +
Sbjct: 603 WYS-NAVFPFGHGLHYTNFS-----VSVRDFPASFAIADLLASCGDSVAY-------LDL 649
Query: 678 CETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVF 737
C V++ V N G + L F+ + PIK L ++ V N + V
Sbjct: 650 CPFPS--VSLNVTNTGTRVSDYVALGFLSGDFGPSPHPIKTLATYKRV-FNIEPGETQVA 706
Query: 738 ELS-PCESLSRAREDGLMVIEEGTHFLVV 765
EL ESL R E G V+ GT+ L+V
Sbjct: 707 ELDWKLESLVRVDEKGNRVLYPGTYTLLV 735
>gi|346225847|ref|ZP_08846989.1| beta-glucosidase [Anaerophaga thermohalophila DSM 12881]
gi|346227016|ref|ZP_08848158.1| beta-glucosidase [Anaerophaga thermohalophila DSM 12881]
Length = 718
Score = 423 bits (1087), Expect = e-115, Method: Compositional matrix adjust.
Identities = 266/759 (35%), Positives = 385/759 (50%), Gaps = 82/759 (10%)
Query: 31 FSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWW 90
F+ S + E F + + +RA +V +LT++EKI+QL+N+APA+ RL IP Y+WW
Sbjct: 3 FTFSFSLKAQEDCSFRNPDISLDERAECIVKQLTVEEKINQLMNAAPAVDRLEIPEYDWW 62
Query: 91 SEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA-- 148
+E LHGVA G+ AT FPQ I AA++D+ L YR+G AI EARA YN
Sbjct: 63 NECLHGVARAGR----------ATVFPQAIGMAATWDTTLVYRVGDAISTEARAKYNVFS 112
Query: 149 -----GQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGK 203
GQ G+TFW PN+NIFRDPRWGRGQET GEDP +T + VS+V+G+QG+
Sbjct: 113 KHGYRGQYKGLTFWTPNVNIFRDPRWGRGQETYGEDPFLTSRIGVSFVKGLQGNH----- 167
Query: 204 LKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIM 263
L+ +A KH+ + N R++FDA+V+M+DL +TY P FE+ VK+ G+M
Sbjct: 168 -PKYLKVAALAKHYAVH---NGPEALRHEFDAKVSMKDLWETYLPAFEALVKEAGVEGVM 223
Query: 264 CAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVL 323
AYNR NG P CA L+ + R +WGF GY SDC A+ Y +PE+A L
Sbjct: 224 GAYNRTNGDPCCAHPYLMQEVLREKWGFDGYYVSDCGAIMDFYTGHKIVDTPEEAAAMAL 283
Query: 324 KAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIG 383
AG ++NCG K+ +++ E EIDR++ LF R+RLGLF + P+ I
Sbjct: 284 NAGCNLNCGDTYASLLKS-LEKGLTTEEEIDRSVKQLFKTRLRLGLFAPEGAV-PYDTIS 341
Query: 384 ADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAG 443
DV+ S HQ LAL+AA+ +VLLKN LP+ + + + GP A + LL NY G
Sbjct: 342 TDVIRSKEHQKLALEAARKSVVLLKNEANTLPVARDVK-KVYVTGPTATHVQALLANYYG 400
Query: 444 PSCRSITPLQALQNYVE---NTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQT 500
S T L+ + V + Y G + ++D A AD V +G+ Q
Sbjct: 401 VSEDMTTILEGIVGKVSPQTSVQYRQGALLYEANRNTMDWFSGAAASADVTVACLGISQL 460
Query: 501 QEKEEL---------DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDR 551
E EE DR LP Q + + R+ +AKK +++V+ G + + YD
Sbjct: 461 IEGEEGEAIASEHRGDRERTRLPQNQIDFLKRIRASAKK-LVVVITSGSAISLP-EIYDM 518
Query: 552 NIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGN 611
++L+ YPGE G A+A+V+FGD P GRLP+T V + + P
Sbjct: 519 -ADALLYVWYPGEQGGKAVADVLFGDAVPSGRLPVT--------VVKSVDDLPPYENYDM 569
Query: 612 PGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVP 671
GRTYR+ E FPFG GLSY+ ++Y + NK+ +S
Sbjct: 570 KGRTYRYMEVSPQFPFGFGLSYTDFTYSNLTLESNKVKSGESVR---------------- 613
Query: 672 ELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKE 731
++ + N GE V ++ P + L+GF+ V L A E
Sbjct: 614 -------------LSFDLTNEGEYDADEVVQFYITDVEASVNVPKQSLIGFKRVGLAAGE 660
Query: 732 KAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGDEEY 770
+I F ++P + + +G ++E G + +G Y
Sbjct: 661 STKIEFTVTP-DMMKIVDNNGEKILESGEFKIYIGGSSY 698
>gi|425780840|gb|EKV18836.1| Beta-xylosidase XylA [Penicillium digitatum PHI26]
gi|425783077|gb|EKV20946.1| Beta-xylosidase XylA [Penicillium digitatum Pd1]
Length = 792
Score = 423 bits (1087), Expect = e-115, Method: Compositional matrix adjust.
Identities = 264/765 (34%), Positives = 403/765 (52%), Gaps = 33/765 (4%)
Query: 16 CFTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNS 75
++ + VD + P D SN C TT RA L S TL+E ++ N
Sbjct: 36 LYSETMAHVDFSFP----DCSNGPLSKTIVCDTTAKPHDRAAALTSMFTLEELVNSTGNV 91
Query: 76 APAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIG 135
PA+PRLG+P Y+ WSEALHG+ + +G ATSFP IL A+ + L +IG
Sbjct: 92 IPAVPRLGLPPYQVWSEALHGLDRAN--LTESGDYSWATSFPSPILIMAALNRTLINQIG 149
Query: 136 QAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQ 195
+ I + RA N G+ G+ +APNIN FR P WGRGQETPGED + Y V Y+ G+Q
Sbjct: 150 EIISTQGRAFNNGGR-YGLDVYAPNINSFRHPVWGRGQETPGEDVQLCSIYGVEYITGIQ 208
Query: 196 GDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVK 255
G G L+ +A KHF YDL+NW +R + ++ DLA Y P F + V+
Sbjct: 209 G-----GLNPRDLKLAATAKHFAGYDLENWGNHSRLGNNVAISSFDLASYYTPQFITAVR 263
Query: 256 QGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGF--HGYITSDCDAVSIIYDAEGYAK 313
R +M +YN VNG+PS A+ LL R W F GY++SDCDAV +++ GYA
Sbjct: 264 DARVHSVMSSYNAVNGVPSSANSFLLQTLLRETWNFVEDGYVSSDCDAVFNVFNPHGYAS 323
Query: 314 SPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGN 373
S A ++AG D++CG+ Q + ++ ++ SEI+RA+ +S + LG F+G+
Sbjct: 324 SASLAAAKSIQAGTDIDCGATYQLYLNESLSHDEISRSEIERAVTRFYSTLVSLGYFDGD 383
Query: 374 PTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANS 433
+ DVV + A + + +AA +GIVLLKN G LPL + + S+ALIGP AN
Sbjct: 384 NSKYRHLHW-PDVVATDAWNI-SYEAAVEGIVLLKND-GTLPL-SNNTRSVALIGPWANV 439
Query: 434 AKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVL 493
TL GNY G + PL ALQ + Y G + + S++ + A+ A ++ ++
Sbjct: 440 TTTLQGNYYGAAPYLTGPLAALQASNLDVNYAFGTNISSDSTSGFEAALSAAGKSEVIIF 499
Query: 494 MMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNI 553
G+D T E E +DR + PG Q +LI ++++ KP++++ + GG VD + K ++N+
Sbjct: 500 AGGIDNTVEAEGVDRESITWPGNQLQLIEQLSKLG-KPLVVLQMGGGQVDSSSLKANKNV 558
Query: 554 GSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDY-IKVPMTDMKMRPQATSGNP 612
S++W GYPG++G A+ +++ G P GRL +T YP +Y ++ P TDM +RP+ NP
Sbjct: 559 NSLVWGGYPGQSGGPAILDILTGKRAPAGRLTVTQYPAEYALQFPATDMSLRPKGN--NP 616
Query: 613 GRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPE 672
G+TY +Y GK V+ FG GL Y+ + K+ L + + D+V S P
Sbjct: 617 GQTYMWYTGKPVYEFGHGLFYTTF----------KVSLAHFHGAENGTSFDIVQLLSRPN 666
Query: 673 LGTEFCETRKFL-VTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKE 731
G E F+ T+ V N G + + + FV + P K LVGF + +
Sbjct: 667 AGYSVVEQIPFINYTVEVMNTGNVTSDYTAMAFVNTKAGPSPHPNKWLVGFDRLGGISPR 726
Query: 732 KAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGDEEYPISIFV 776
+ + ++++R E G ++ G + L + +E + F
Sbjct: 727 TTQTMTIPITLDNVARTDERGNRIVYPGKYELTLNNERSAVLSFT 771
>gi|225878709|dbj|BAH30674.1| beta-xylosidase [Aspergillus aculeatus]
Length = 785
Score = 422 bits (1085), Expect = e-115, Method: Compositional matrix adjust.
Identities = 262/737 (35%), Positives = 387/737 (52%), Gaps = 35/737 (4%)
Query: 46 CKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIF 105
C T RA L S +TL+E ++ N PAIPRLG+P Y+ W+EALHG+
Sbjct: 62 CDRTASAHDRAAALTSMMTLEELMNSTGNRIPAIPRLGLPPYQIWNEALHGL--YLANFT 119
Query: 106 FNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFR 165
+G +TSFP ILT A+ + L ++I Q I + RA NAG+ G+ ++PNIN FR
Sbjct: 120 ESGPFSWSTSFPSPILTMATLNRTLIHQIAQIIATQGRAFNNAGR-YGLNAFSPNINAFR 178
Query: 166 DPRWGRGQETPGEDP-LVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDN 224
P WGRGQETPGED + YA Y+ G+QG+ N + A KH+ YD++N
Sbjct: 179 HPVWGRGQETPGEDANCLCSAYAYEYITGLQGNATNPKII-------ATAKHYAGYDIEN 231
Query: 225 WKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKT 284
W+ +R+ D +T QDLA+ + P F V+ + +M +YN VNG+PS A+ LL
Sbjct: 232 WRQRSRFGNDLNITQQDLAEYFTPQFVVAVRDAQVRSVMPSYNAVNGVPSSANTFLLQTL 291
Query: 285 ARRQWGF--HGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAA 342
R WGF GY+ SDCDAV +++ GYA + A L+AG D++CG +
Sbjct: 292 VRDSWGFIQDGYMASDCDAVYNVFNPHGYAANLSSASAMSLRAGTDIDCGISYLTTLNES 351
Query: 343 VKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQD 402
+ Q ++ SEI+RA+ +S + G F+G P+ + V +A +AA
Sbjct: 352 LTQGQISRSEIERAVTRFYSNLVSAGYFDGPDA--PYRDLSWSDVVRTNRWNVAYEAAVA 409
Query: 403 GIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENT 462
G+VLLKN G+LPL KS +ALIGP AN+ + + GNY G + +PL A+Q
Sbjct: 410 GVVLLKND-GVLPLSKSVQ-RVALIGPWANATEQMQGNYHGVAPYLTSPLAAVQASGLEV 467
Query: 463 VYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELIT 522
Y G + + + A+ A+ +D ++ G+D T E EELDR ++ PG Q ELI
Sbjct: 468 NYAFGTNITSNVTNCFAAALAAAEKSDIIIFAGGIDNTLEAEELDRANITWPGNQLELIH 527
Query: 523 RVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGG 582
R+ E KP++++ + GG VD + K +G++LW GYPG+AG AL +++ G P G
Sbjct: 528 RLGELG-KPLVVLQMGGGQVDSSALKASEKVGALLWGGYPGQAGGQALWDILTGQRAPAG 586
Query: 583 RLPMTWYPQDY-IKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFK 641
RL T YP +Y ++ P TDM +RP+ NPG+TY +Y G+ V+ FG GL Y+ ++
Sbjct: 587 RLTTTQYPAEYALQFPATDMSLRPRGD--NPGQTYMWYTGEPVYAFGHGLFYTTFATALA 644
Query: 642 AVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFL-VTIGVKNHGEMAGKHP 700
Q + + D+ + P G E FL T+ V N GE+ +
Sbjct: 645 GPGQ-----------EPERSFDIGALLARPHAGYNLVEQLPFLNFTVKVTNTGEVISDYT 693
Query: 701 VLLFVKPARRGNGRPIKQLVGFQSV-ILNAKEKAEIVFELSPCESLSRAREDGLMVIEEG 759
+ F P K LVGF + L+ + A + +S +SL+R G VI G
Sbjct: 694 AMAFANTTAGPRPHPNKWLVGFDRIGPLDPRVSARMSVPVS-LDSLARTDAQGNRVIYPG 752
Query: 760 THFLVVGDEEYPISIFV 776
+ L + +E P+ F
Sbjct: 753 PYELALNNERSPVLSFT 769
>gi|291518645|emb|CBK73866.1| Beta-glucosidase-related glycosidases [Butyrivibrio fibrisolvens
16/4]
Length = 713
Score = 422 bits (1085), Expect = e-115, Method: Compositional matrix adjust.
Identities = 266/742 (35%), Positives = 383/742 (51%), Gaps = 90/742 (12%)
Query: 54 QRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGA 113
+RA++LVS++T++EK SQ+++ A AI RLGIP Y WW+EALHGVA G A
Sbjct: 7 KRAKELVSQMTIEEKCSQMLHHAEAIDRLGIPKYCWWNEALHGVARAGD----------A 56
Query: 114 TSFPQVILTAASFDSYLWYRIGQAIGLEARALYN-----AGQAI--GMTFWAPNINIFRD 166
T FPQ I A+FD L ++ E RA YN + I G+T+WAPN+NIFRD
Sbjct: 57 TVFPQAIGLGATFDEELVEKVADVTSTEGRAKYNEFTKHGDRDIYKGLTYWAPNVNIFRD 116
Query: 167 PRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWK 226
PRWGRG ET GEDP +TG+ ++YVRG+QGD + K ++AC KHF + +
Sbjct: 117 PRWGRGHETYGEDPYLTGQLGMAYVRGLQGDDLDNPK------SAACAKHFAVH---SGP 167
Query: 227 GTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTAR 286
R+ FDA+V QDL DTY F+ VK + +M AYNRVNG P+C + LL R
Sbjct: 168 EAERHHFDAKVNDQDLYDTYLYAFKRLVKDAKVEAVMGAYNRVNGEPACGSKRLLKDILR 227
Query: 287 RQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQK 346
WGF G++ SDC A+ ++ ++ + G D+NCG +K A K
Sbjct: 228 GDWGFEGHVVSDCWAIRDFHENHKVTGCEVESAALAVNNGCDLNCGCVYEK-LLYAYKAN 286
Query: 347 KLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVL 406
+ E I ++ L +R+RLG + + I +VV H+ LA++AA+ +VL
Sbjct: 287 LVTEETITESVERLIELRLRLGTLPERRS--KYDDIPYEVVECKEHKELAIEAAKRSMVL 344
Query: 407 LKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYP 466
LKN GLLPL K + ++ +IGPN+NS L+GNY G S IT L+ +Q YV + V
Sbjct: 345 LKND-GLLPLKKDEIKTIGVIGPNSNSRMALVGNYEGISSEYITVLEGIQQYVGDDVRVF 403
Query: 467 GCD----------TVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEE---------LD 507
D ++ + + +A+ +A+ +D VVL MGLD T E EE D
Sbjct: 404 HSDGTPLWKDRMHVLSEARDTFAEAMAVAEHSDVVVLAMGLDSTIEGEEGDAGNEFGSGD 463
Query: 508 RVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGA 567
+ L LPG QQEL+ ++ A KPV+L++L G +D+++A + N+ +I+ YPG G
Sbjct: 464 KKGLKLPGLQQELLEKIT-AIGKPVVLLVLAGSAMDLSWA--NENVNAIMHCWYPGARGG 520
Query: 568 VALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPF 627
A+A+V+FG+ +P G+LP+T+Y D P D M GRTYR+++G ++PF
Sbjct: 521 KAIAQVLFGEDSPSGKLPLTFYKSDADLPPFEDYSME--------GRTYRYFKGTPLYPF 572
Query: 628 GCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTI 687
G GLSYS Y + + TE KF V +
Sbjct: 573 GYGLSYSDIQYSNAGIDK-----------------------------TEGAIGDKFTVKV 603
Query: 688 GVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSR 747
VKN G+ V ++VK L V L E E+ ELS +
Sbjct: 604 TVKNAGDYKAHETVQVYVKDVEASTRVANCSLRKIAKVELLPGESKEVSLELS-ARDFAI 662
Query: 748 AREDGLMVIEEGTHFLVVGDEE 769
E G ++E G + VG ++
Sbjct: 663 IDEKGHCIVEPGKFKVFVGGQQ 684
>gi|329745495|gb|AEB98984.1| xylosidase precursor [synthetic construct]
Length = 804
Score = 422 bits (1084), Expect = e-115, Method: Compositional matrix adjust.
Identities = 266/757 (35%), Positives = 397/757 (52%), Gaps = 40/757 (5%)
Query: 27 TQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPA 86
T P S D N + C + RA L+S TLDE I+ N+ + RLG+P
Sbjct: 50 TIPLSSPDCQNGPLRSHLICDESATPYDRAASLISLFTLDELIANTGNTGLGVSRLGLPV 109
Query: 87 YEWWSEALHGVAGVGKGIFFN-GTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARAL 145
Y+ WSEALHG+ + F + G+ ATSFPQ ILT A+ + L ++I I + RA
Sbjct: 110 YQVWSEALHGL---DRANFSDSGSYNWATSFPQPILTTAALNRTLIHQIASIISTQGRAF 166
Query: 146 YNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLK 205
NAG+ G+ +APNIN FR P GRGQETPGED + YA Y+ G+QG
Sbjct: 167 NNAGR-YGLDVYAPNINTFRHPVRGRGQETPGEDVSLAAVYAYEYITGIQGP-----DPD 220
Query: 206 GKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCA 265
L+ +A KH+ YD++NW +R D +T QDL++ Y P F + + +MCA
Sbjct: 221 SNLKLAATAKHYAGYDIENWHNHSRLGNDMNITQQDLSEYYTPQFHVAARDAKVHSVMCA 280
Query: 266 YNRVNGIPSCADRNLLSKTARRQWGF--HGYITSDCDAVSIIYDAEGYAKSPEDAVVDVL 323
YN VNG+P+CAD L R +GF HGY++SDCDA IY+ GYA S A + +
Sbjct: 281 YNAVNGVPACADSYFLQTLLRDTFGFVDHGYVSSDCDAAYNIYNPHGYASSQAAAAAEAI 340
Query: 324 KAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTM--QPFGK 381
AG D++CG+ Q H ++ L +I++ + L++ ++ G F+ N T P+
Sbjct: 341 LAGTDIDCGTTYQWHLNESITAGDLSRDDIEKGVIRLYTTLVQAGYFDSNTTKANNPYRD 400
Query: 382 IGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKS----KSVSLALIGPNANSAKTL 437
+ V ++ QAA GIVLLKNS+ +LPL + + ++ALIGP AN+ L
Sbjct: 401 LTWSDVLETDAWNISYQAATQGIVLLKNSNKVLPLTEKAYPPSNTTVALIGPWANATTQL 460
Query: 438 LGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGL 497
LGNY G + I+P A + N + + +++ A+ A+ AD ++ G+
Sbjct: 461 LGNYYGNAPYMISPRVAFEEAGYNVNFAERTGISSTNTSGFAAALSAAQSADVIIYAGGI 520
Query: 498 DQTQEKEELDRVDLVLPGRQQELITRVAEAA-KKPVILVLLCGGPVDITFAKYDRNIGSI 556
D T E E LDR + PG Q +LI ++A +A KP+I++ + GG VD + K + N+ ++
Sbjct: 521 DNTLEAEALDRESIAWPGNQLDLIQKLASSAGSKPLIVLQMGGGQVDSSSLKNNTNVSAL 580
Query: 557 LWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYI-KVPMTDMKMRPQATSGNPGRT 615
LW GYPG++G AL ++I G NP GRL T YP Y + P TDM +RP+ NPG+T
Sbjct: 581 LWGGYPGQSGGFALRDIITGKKNPAGRLVTTQYPASYAEEFPATDMNLRPEGD--NPGQT 638
Query: 616 YRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGT 675
Y++Y G+ V+ FG GL Y+ ++ + ++ LN ++D+ +P L
Sbjct: 639 YKWYTGEAVYEFGHGLFYTTFAESSSNTTTREIKLNIQDILSQT-HEDLASITQLPVLN- 696
Query: 676 EFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNG-RPIKQLVGFQSVILNAKEKAE 734
T +KN G++ + ++F + G P+K LVG+ + + K
Sbjct: 697 ---------FTANIKNTGKVESDYTAMVFANTSDAGPAPYPVKWLVGWDRL---GEVKVG 744
Query: 735 IVFELS-PCE--SLSRAREDGLMVIEEGTHFLVVGDE 768
EL P E S +R EDG V+ GT L + E
Sbjct: 745 ETRELRVPVEVGSFARVNEDGDWVLFPGTFELALNLE 781
>gi|398406144|ref|XP_003854538.1| hypothetical protein MYCGRDRAFT_38178 [Zymoseptoria tritici IPO323]
gi|339474421|gb|EGP89514.1| hypothetical protein MYCGRDRAFT_38178 [Zymoseptoria tritici IPO323]
Length = 884
Score = 421 bits (1082), Expect = e-115, Method: Compositional matrix adjust.
Identities = 274/750 (36%), Positives = 400/750 (53%), Gaps = 47/750 (6%)
Query: 28 QPPFSCDPSNPSTETFPFC-----KTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRL 82
PP +P T PFC T+L R L+S++T++EK + LV+ A +PR+
Sbjct: 123 HPPPKGVKRSPVCLTDPFCANKACDTSLSQDDRIAALISQMTVEEKATNLVDGALGLPRI 182
Query: 83 GIPAYEWWSEALHGVAGVGKGIFF---NGT-IRGATSFPQVILTAASFDSYLWYRIGQAI 138
G+P YEWW+EALHGVAG +G+ F NG+ ATSFP IL A+FD L Y + I
Sbjct: 183 GLPPYEWWNEALHGVAG-SRGVSFDSPNGSDFSYATSFPLPILMGAAFDDPLIYDVASII 241
Query: 139 GLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDT 198
G EARA N + G FW PN+N F DPRWGRG E P ED +Y S V G+QG
Sbjct: 242 GKEARAFANYAHS-GYDFWTPNMNTFLDPRWGRGLEVPTEDSFHAQRYVASLVPGLQG-- 298
Query: 199 FNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGR 258
G + Q A CKHF YD++ T R+ + T QDL + Y P F++CV+
Sbjct: 299 --GKEKTDHKQIIATCKHFAVYDVE----TNRHAQNYEPTPQDLGEYYLPAFKTCVRDVN 352
Query: 259 ASGIMCAYNRVNGIPSCADRNLLSKTARRQWGF---HGYITSDCDAVSIIYDAEGYAKSP 315
IMC+YN V G+P+CA L R QW F + Y+TSDC+AV I+ + +
Sbjct: 353 VGSIMCSYNAVYGVPACASEYFLQDVLRDQWNFNEPYHYVTSDCEAVKDIWTPHNFTDTE 412
Query: 316 EDAVVDVLKAGMDVNCG-SFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNP 374
A L AG D NCG S+LQ +T +V E+++D +L L++ +G F+G P
Sbjct: 413 PAAAAVALNAGTDTNCGTSYLQLNT--SVANNWTTEAQMDISLTRLYNALFTVGYFDGQP 470
Query: 375 TMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSA 434
G ADV +P Q A +AA +GI LLKN GLLPL KS + S+ALIGP AN+
Sbjct: 471 EYD--GLSFADV-STPFAQATAYRAASEGITLLKND-GLLPLKKSYN-SVALIGPWANAT 525
Query: 435 KTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLM 494
+ G Y G + ++PL A Q + + G + ++ A+ A+ AD ++
Sbjct: 526 TQMQGIYQGIAPYLVSPLAAAQAQWGHISFTNGTAINSTNTTGFASALSAARDADVIIYA 585
Query: 495 MGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIG 554
G+D + EKE DR + PG Q +L+ +++E K P+++V GG VD + ++N+
Sbjct: 586 GGIDSSIEKESRDRTSISWPGNQLDLVQQLSELGK-PLVVVQFGGGQVDDSALLRNKNVN 644
Query: 555 SILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYI-KVPMTDMKMRPQATSGNPG 613
S++WAGYPG+ G AL +V+ G +P GRL +T YP DYI ++ + D +RP +S PG
Sbjct: 645 SLVWAGYPGQDGGSALIDVLVGKQSPAGRLTITQYPADYINQISLFDPNLRPSDSS--PG 702
Query: 614 RTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPEL 673
RTY++Y + V PFG GL Y+ + + + Q + +V++ Y + P+
Sbjct: 703 RTYKWYNKEPVLPFGYGLHYTTFEFDWAKAPQASYDI-----ASLVDS--TASYTTSPKK 755
Query: 674 GTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNG-RPIKQLVGFQSVI-LNAKE 731
T ++I V N G + + L+F++ G P K L + + L+A
Sbjct: 756 NDASPWTE---LSIKVHNSGSLGSDYVGLVFLRTPNAGPAPYPNKWLASYARLHGLSAGA 812
Query: 732 KAEIVFELSPCESLSRAREDGLMVIEEGTH 761
AE+ F LS +L+R E G ++I G +
Sbjct: 813 SAELSFSLS-LGALARGDEHGDLIIYPGDY 841
>gi|171678585|ref|XP_001904242.1| hypothetical protein [Podospora anserina S mat+]
gi|170937362|emb|CAP62020.1| unnamed protein product [Podospora anserina S mat+]
Length = 800
Score = 421 bits (1082), Expect = e-115, Method: Compositional matrix adjust.
Identities = 269/733 (36%), Positives = 394/733 (53%), Gaps = 47/733 (6%)
Query: 46 CKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIF 105
C TTL +RA LV+ LT +EK+ +V+ + PR+G+PAY WWSEALHGVA F
Sbjct: 40 CNTTLSPPERAAALVAALTPEEKLQNIVSKSLGAPRIGLPAYNWWSEALHGVAYAPGTQF 99
Query: 106 F--NGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINI 163
+ +G +TSFP +L AA+FD L +I + IG+E RA NAG + G+ +W PN+N
Sbjct: 100 WQGDGPFNSSTSFPMPLLMAATFDDELLEKIAEVIGIEGRAFGNAGFS-GLDYWTPNVNP 158
Query: 164 FRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLD 223
F+DPRWGRG ETPGED L+ +YA + ++G++G + + + A CKH+ A D +
Sbjct: 159 FKDPRWGRGSETPGEDVLLVKRYAAAMIKGLEGPV-----PEKERRVVATCKHYAANDFE 213
Query: 224 NWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSK 283
+W G TR+ F+A++++QD+A+ Y PF+ CV+ R IMCAYN VNG+PSCA LL
Sbjct: 214 DWNGATRHNFNAKISLQDMAEYYFMPFQQCVRDSRVGSIMCAYNAVNGVPSCASPYLLQT 273
Query: 284 TARRQWGF---HGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTK 340
R W + + YITSDC+AV + YA + + +AGMD +C
Sbjct: 274 ILREHWNWTEHNNYITSDCEAVLDVSLNHKYAATNAEGTAISFEAGMDTSCEYEGSSDIP 333
Query: 341 AAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAA 400
A Q L ES +DRAL L+ +R G F+G ++ + +G V P+ Q L+LQAA
Sbjct: 334 GAWSQGLLKESTVDRALLRLYEGIVRAGYFDGKQSL--YSSLGWADVNKPSAQKLSLQAA 391
Query: 401 QDGIVLLKNSHGLLPLP----KSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQ 456
DG VLLKN G LPL KS+ +A+IG +++ L G Y+G + TP A
Sbjct: 392 VDGTVLLKND-GTLPLSDLLDKSRPKKVAMIGFWSDAKDKLRGGYSGTAAYLHTPAYAAS 450
Query: 457 NY--VENTVYYPGCDT-VACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVL 513
+T P + +A + + D A+ AK AD+++ G+D + E DR DL
Sbjct: 451 QLGIPFSTASGPILHSDLASNQSWTDNAMAAAKDADYILYFGGIDTSAAGETKDRYDLDW 510
Query: 514 PGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEV 573
PG Q LI + +K ++VL G +D T + I +ILWA +PG+ G A+ E+
Sbjct: 511 PGAQLSLINLLTTLSKP--LIVLQMGDQLDNTPLLSNPKINAILWANWPGQDGGTAVMEL 568
Query: 574 IFGDHNPGGRLPMTWYPQDYIK-VPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLS 632
+ G +P GRLP+T YP ++ + VPMTDM +RP A + GRTYR+Y+ V FG GL
Sbjct: 569 VTGLKSPAGRLPVTQYPSNFTELVPMTDMALRPSAGNSQLGRTYRWYK-TPVQAFGFGLH 627
Query: 633 YSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNH 692
Y+ +S KF K + +++E D + + P + + V+N
Sbjct: 628 YTTFSPKF-----GKKFPAVIDVDEVLEGCDDKYLDTCPLPD----------LPVVVENR 672
Query: 693 GEMAGKHPVLLFVKPARRGNG-RPIKQLVGF---QSVILNAKEKAEIVFELSPCESLSRA 748
G + L FV G G PIK L F + V K + + + L +L+R
Sbjct: 673 GNRTSDYVALAFVSAPGVGPGPWPIKTLGAFTRLRGVKGGEKREGGLKWNLG---NLARH 729
Query: 749 REDGLMVIEEGTH 761
E+G V+ G +
Sbjct: 730 DEEGNTVVYPGKY 742
>gi|167519969|ref|XP_001744324.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163777410|gb|EDQ91027.1| predicted protein [Monosiga brevicollis MX1]
Length = 721
Score = 420 bits (1080), Expect = e-114, Method: Compositional matrix adjust.
Identities = 259/726 (35%), Positives = 388/726 (53%), Gaps = 64/726 (8%)
Query: 36 SNPSTE-TFPFCKTTLPISQRARDLVSRLTLDEKISQLVNS--------APAIPRLGIPA 86
SNP+ + PFC +L RA DL RLTLDE ++Q +N+ AP +PRLG+
Sbjct: 34 SNPNCQLDLPFCDLSLDFRDRAWDLAQRLTLDE-LAQQLNTYSFTPQAYAPGVPRLGLRN 92
Query: 87 YEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALY 146
Y + +E LHG+ N AT +PQV AA+ ++ L + + +G E RA+
Sbjct: 93 YSYHAEGLHGIRDA------NVVNYPATLYPQVTAMAATANASLIHEMSTIMGTELRAVN 146
Query: 147 NAGQAIGMTF--------WAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDT 198
N Q +G F + P +NI RD RWGR QE+ EDP + G YAV++V G++
Sbjct: 147 NRAQELGEIFGRGGALSIYGPTMNIIRDGRWGRSQESVSEDPWLNGLYAVNFVLGLEQR- 205
Query: 199 FNGGKLKGKLQASACCKHFTAYDLDNWKGT-TRYKFDARVTMQDLADTYQPPFESCVKQG 257
N K LQA+ CKH AY + + T TR+ F+A + D+ DTY P F +CV+ G
Sbjct: 206 -NSSKY---LQAATSCKHLFAYSFEGYNNTLTRHSFNAVIDELDIHDTYLPAFRACVELG 261
Query: 258 RASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPED 317
IMC+YN VNGIP+CA ++ + R+ WGF G I SDCDAV+ IY+ Y ++PED
Sbjct: 262 HVQQIMCSYNSVNGIPACARGDVQNDRVRKAWGFEGLIVSDCDAVADIYNTHNYTRTPED 321
Query: 318 AVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQ 377
AV L+ G D++CG F +H +AV+Q + + +++ + +R LG F+ + T
Sbjct: 322 AVTVALQGGCDLDCGDFYSQHLASAVQQNLTTLAALQQSMTRVLEMRFLLGEFDPD-TSV 380
Query: 378 PFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTL 437
P+ ++G + + +P + +L+A+++ +VLL+N LLP+ S + +ALIGP N +
Sbjct: 381 PYRQLGREAIDTPFARDSSLRASRESVVLLENRIKLLPVTLSADIKVALIGPYVNLTTIM 440
Query: 438 LGNYAGPSCRSITP-LQALQNY-VENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMM 495
+G + IT Q Q + + PGC+ A ++DKAV IA AD VVL +
Sbjct: 441 MGGKLDYTPSFITTYFQGFQAIGITHLTSSPGCNITAPLPGALDKAVQIATQADLVVLTL 500
Query: 496 GLDQTQEKEELDRVDLVLPGRQQELITRVAEAA-KKPVILVLLCGGPVDITFAKYD-RNI 553
GL E E DR L LP QQ+L ++ A +++VL+ GGPV + KY
Sbjct: 501 GLSSDIEHEGGDRETLGLPTPQQDLYDAISAAIPSSKLVVVLVNGGPVSVDRIKYGIART 560
Query: 554 GSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMT-WYPQDYIKVPMTDMKMRPQATSGNP 612
+I+ A Y G++ ALAE IFG +NP G LP T ++ VP TDM +RP A +G P
Sbjct: 561 PTIIEAFYGGQSAGTALAETIFGQNNPSGTLPYTVFFSNITAHVPFTDMHLRPDAATGFP 620
Query: 613 GRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPE 672
GRT+RF++ ++PFG GLSYS +S + QD ++VP
Sbjct: 621 GRTHRFFDAPVMWPFGHGLSYSTFSLAW---------------------QD----ETVPS 655
Query: 673 LGT-EFCETRKF--LVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNA 729
+ T +F + L+++ V NHG + G+ + L+V P++ LVG Q L
Sbjct: 656 ITTGDFTQPTLMHQLLSVNVTNHGPLPGRRALHLYVTVPVTNVSVPLRNLVGLQKHWLAV 715
Query: 730 KEKAEI 735
+ +
Sbjct: 716 DQSMTV 721
>gi|189201569|ref|XP_001937121.1| beta-xylosidase [Pyrenophora tritici-repentis Pt-1C-BFP]
gi|187984220|gb|EDU49708.1| beta-xylosidase [Pyrenophora tritici-repentis Pt-1C-BFP]
Length = 756
Score = 420 bits (1079), Expect = e-114, Method: Compositional matrix adjust.
Identities = 258/707 (36%), Positives = 372/707 (52%), Gaps = 43/707 (6%)
Query: 67 EKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASF 126
EK+ LV+ + + RLG+PAY WW EALHGVAG GI F G R ATSFP +L +A+F
Sbjct: 56 EKLDNLVSKSKGVARLGLPAYNWWGEALHGVAGA-PGINFTGPYRTATSFPMPLLMSAAF 114
Query: 127 DSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKY 186
D L ++I IG EARA N G A + FW P+IN FRDPRWGRG ETPGED L Y
Sbjct: 115 DDDLIHQIAIVIGNEARAFGNGGIA-PVDFWTPDINPFRDPRWGRGSETPGEDILRIKGY 173
Query: 187 AVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTY 246
S + G++GD K + + A CKH+ YD+++W GT R+ FDA++T QDLA+ +
Sbjct: 174 TKSLLSGLEGD-------KAQRKIIATCKHYVGYDMEDWNGTDRHSFDAKITTQDLAEYF 226
Query: 247 QPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGF---HGYITSDCDAVS 303
PPF+ C + + MC+YN VNG+P+CAD +L R+ W + + YITSDC+AV
Sbjct: 227 MPPFQQCARDSKVGSFMCSYNAVNGVPTCADTYVLEDILRKHWNWTDSNNYITSDCEAVK 286
Query: 304 IIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSV 363
I Y + ++A GMD++C A Q L S IDRAL +
Sbjct: 287 DISLRHKYVATLQEATAIAFNNGMDLSCEYSGSSDIPGAFSQGLLNVSVIDRALTRQYEG 346
Query: 364 RMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVS 423
+ G F+G + +G + +P Q L LQ A +G+ LLKN LPL
Sbjct: 347 LVHAGYFDG--AAATYANLGVQDINTPEAQKLVLQVAAEGLTLLKNDD-TLPLSLKSGSK 403
Query: 424 LALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSAS--IDKA 481
+A++G AN + L G Y+GP+ P+ A + G +A KA
Sbjct: 404 VAMVGFWANDSSKLSGIYSGPAPYLHNPVYAGNKLGLDMAVATGPILQKSGAADNWTTKA 463
Query: 482 VDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGP 541
+D AK +D ++ GLD + E DR D+ P Q +LIT++A A KP++++ L G
Sbjct: 464 LDAAKKSDTILYFGGLDPSAAAEGSDRTDISWPSAQIDLITKLA-ALGKPLVVIAL-GDM 521
Query: 542 VDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDM 601
VD + + S++WA +PG+ G A+ +VI G+H GRLP+T YP Y ++ M DM
Sbjct: 522 VDHMPILNMKGVNSLIWANWPGQDGGTAVMQVITGEHAIAGRLPITQYPAKYTQLSMLDM 581
Query: 602 KMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVEN 661
+RP NPGRTYR+Y + V PFG GL Y+K++ KF S+S+ V
Sbjct: 582 NLRPGGN--NPGRTYRWYN-ESVQPFGFGLHYTKFAAKFG-----------SNSSLTVNI 627
Query: 662 QDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVG 721
QD++ KS + + C+ + + V N G L F+K P+K LV
Sbjct: 628 QDIM--KSCTKDHPDLCDVPP--IEVAVTNKGNRTSDFIALAFIKGEVGPKPYPLKTLVS 683
Query: 722 F---QSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVV 765
+ + + + + A + L +LSR + G +V G + L++
Sbjct: 684 YARLRDISGSQTKTASLALTLG---TLSRVDQSGNLVAYPGEYTLLL 727
>gi|358382857|gb|EHK20527.1| hypothetical protein TRIVIDRAFT_192759 [Trichoderma virens Gv29-8]
Length = 860
Score = 420 bits (1079), Expect = e-114, Method: Compositional matrix adjust.
Identities = 277/759 (36%), Positives = 393/759 (51%), Gaps = 69/759 (9%)
Query: 35 PSNPSTETFP-----------FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLG 83
P+ P T +FP C TT I+ RA +V +TL+EK++ + +SA RLG
Sbjct: 117 PNPPLTPSFPDCTRDPLCSNDVCDTTKSIAARAAAIVKPMTLNEKVANVGSSASGSGRLG 176
Query: 84 IPAYEWWSEALHGVAGVGKGIFFN----GTIRGATSFPQVILTAASFDSYLWYRIGQAIG 139
+PAY+W +EALHGVAG G+ F ATSFP IL +A+FD L + AI
Sbjct: 177 LPAYQWQNEALHGVAG-STGVQFQSPLGANFSAATSFPMPILLSAAFDDALVQSVATAIS 235
Query: 140 LEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTF 199
EARA N G A G+ FW PNIN FRDPRWGRG ETPGED Y +S + G+QG
Sbjct: 236 TEARAFANYGFA-GLDFWTPNINPFRDPRWGRGMETPGEDAFRIQGYVLSLINGLQG--- 291
Query: 200 NGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRA 259
G + + CKHF AYD++N R + T QD+AD Y P FE+CV+ +
Sbjct: 292 --GIDPDFFRTISTCKHFAAYDIEN----GRTANNLSPTQQDMADYYLPMFETCVRDAKV 345
Query: 260 SGIMCAYNRVNGIPSCADRNLLSKTARRQWGF---HGYITSDCDAVSIIYDAEGYAKSPE 316
IMCAYN VNG+P+CAD LL R +GF Y+ SDCDAV +YD YA +
Sbjct: 346 GSIMCAYNSVNGVPACADSYLLQSVLRDGYGFTEDFNYVVSDCDAVENVYDPHHYAANLT 405
Query: 317 DAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTM 376
A L AG D++CGS A+V+ E+ +D++L L+S +++G F+
Sbjct: 406 QAAAMSLNAGTDLDCGSSYNV-LNASVQAGMTTEATLDKSLIRLYSALIKVGWFD----- 459
Query: 377 QP--FGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSA 434
QP + +G V + + LA AA G+ LLKN G LPL + ++A+IGP N+
Sbjct: 460 QPAKYSSLGWGNVNTTQTRALAHDAATGGMTLLKND-GTLPLSPTLQ-NVAVIGPWVNAT 517
Query: 435 KTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLM 494
L GNYAG + + PL Q N Y G + ++ + A+ A +D +V +
Sbjct: 518 TQLQGNYAGTAPVLVNPLTVFQQKWRNVKYAQGTAINSQDTSGFNAAISAASSSDVIVYL 577
Query: 495 MGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIG 554
G+D + E E DR + PG Q LI+++A KP+++V GG +D + + +
Sbjct: 578 GGIDISVENEGFDRTAITWPGNQLSLISQLANLG-KPLVIVQFGGGQIDDSSLLSNSKVN 636
Query: 555 SILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYI-KVPMTDMKMRPQATSGNPG 613
SILWAGYPG+ G AL +V+ G + P GRLP+T YP +Y+ + DM +RP + PG
Sbjct: 637 SILWAGYPGQEGGNALFDVLTGANPPAGRLPITQYPANYVNNNNIQDMNLRPSGSI--PG 694
Query: 614 RTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPEL 673
RTY +Y G V PFG GL Y+ +S F QS+ T + +V+ +
Sbjct: 695 RTYAWYTGTPVLPFGYGLHYTNFSVSF-----------QSTKTSGTDVATIVNNAGSNKD 743
Query: 674 GTEFCETRKFLVTIGVKNHG---EMAGKHPVLLFVKPARRGNG-RPIKQLVGF---QSVI 726
F + + VKN G +A + LLF+K G P KQL + + V
Sbjct: 744 RATFAT-----LVVNVKNTGGKANLASDYVGLLFLKSTNAGPAPHPNKQLAAYGRVKKVG 798
Query: 727 LNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVV 765
+ A ++ + L SL+RA +G + G + L +
Sbjct: 799 VGATQQLTLTVNLG---SLARADTNGDRWVYPGAYTLTL 834
>gi|307719075|ref|YP_003874607.1| glycoside hydrolase family protein [Spirochaeta thermophila DSM
6192]
gi|306532800|gb|ADN02334.1| glycoside hydrolase family 3 [Spirochaeta thermophila DSM 6192]
Length = 693
Score = 420 bits (1079), Expect = e-114, Method: Compositional matrix adjust.
Identities = 273/734 (37%), Positives = 391/734 (53%), Gaps = 87/734 (11%)
Query: 53 SQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRG 112
++R L+SR++++EK +V+ A +PRLGIP Y WW+EALHGVA G+
Sbjct: 4 NERMTSLLSRMSIEEKAGLMVHRAKGVPRLGIPNYNWWNEALHGVANSGE---------- 53
Query: 113 ATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA-------GQAIGMTFWAPNINIFR 165
AT FPQ I AA+FD L R+ AI EARA +NA G+TFW+PNINI+R
Sbjct: 54 ATVFPQAIGLAATFDPDLVRRVADAISREARAKFNAVGKERAAEYERGLTFWSPNINIYR 113
Query: 166 DPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNW 225
DPRWGRGQET GEDP +T K V++V+G+QGD L+ +AC KH+ +
Sbjct: 114 DPRWGRGQETYGEDPFLTSKIGVAFVKGLQGDH------PYYLRVAACAKHYAVHSGPE- 166
Query: 226 KGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTA 285
R+ FDARV+ +DL +TY P FE+ VK G +M AYNRVNG P+C + LL +
Sbjct: 167 --GLRHVFDARVSEKDLWETYLPAFEALVKAG-VEAVMGAYNRVNGEPACGSKRLLEEIL 223
Query: 286 RRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQ 345
R++WGF G++ SDC A++ + K P +++ L+AG D+NCG+ + H AVK
Sbjct: 224 RKKWGFKGHVVSDCWAIADFHLHHKVTKDPIESIAMALEAGCDLNCGNTYE-HLLDAVKA 282
Query: 346 KKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIV 405
+ E +DR++ L S RLGLF + P+ ++ + AH+ LA +AA+ +V
Sbjct: 283 GAVSEELVDRSVARLLSTLDRLGLFTDD---HPYVRLSLADIDWEAHRALAREAAEKSVV 339
Query: 406 LLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVE---NT 462
LLKN +G+LPL + K + + GPNA + LLGNYAG S R +T L+ + Y
Sbjct: 340 LLKN-NGILPLDRRKLRYIYVTGPNAANPVALLGNYAGVSSRLVTVLEGITGYAGPGITV 398
Query: 463 VYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRV---------DLVL 513
Y GC ID A +A+ AD V +MG D E EE D + DL L
Sbjct: 399 TYKIGCPLQGNKINPIDWASGVARYADVTVAVMGRDSAVEGEEGDAIFSDNYGDLSDLNL 458
Query: 514 PGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEV 573
Q + + R+ E K P+++VLL G PV + + +I++A YPGE G A+A V
Sbjct: 459 SREQIDYLRRIKEIGK-PLVVVLLSGAPV--CSPELEELADAIVYAWYPGEEGGNAIARV 515
Query: 574 IFGDHNPGGRLPMTWYPQDYIKVP-MTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLS 632
+FG+ +P GRLP+T +P+ ++P TD M GRTYR+ + + ++PFG GLS
Sbjct: 516 LFGEVSPSGRLPIT-FPKGVDQLPPFTDYSME--------GRTYRYMKEEPLYPFGFGLS 566
Query: 633 YSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNH 692
Y+ +SY+ + SS + ++ + CE V+N
Sbjct: 567 YATFSYR-----------DPKSSASRWDKRETLEV---------VCE---------VENT 597
Query: 693 GEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDG 752
+ V L+V+ P+ L GF V L E+ ++ F LSP E LS E G
Sbjct: 598 SSIPADEVVQLYVRWEDAPFRVPLWSLKGFTRVSLGTGERIQVRFVLSP-EDLSFIDEKG 656
Query: 753 LMVIEEGTHFLVVG 766
V+ EG VG
Sbjct: 657 RKVLPEGRLRFHVG 670
>gi|255957137|ref|XP_002569321.1| Pc21g23540 [Penicillium chrysogenum Wisconsin 54-1255]
gi|211591032|emb|CAP97251.1| Pc21g23540 [Penicillium chrysogenum Wisconsin 54-1255]
Length = 791
Score = 419 bits (1078), Expect = e-114, Method: Compositional matrix adjust.
Identities = 260/739 (35%), Positives = 398/739 (53%), Gaps = 37/739 (5%)
Query: 46 CKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIF 105
C TT RA L++ T +E ++ N PAIPRLG+P Y+ W+EALHG+ F
Sbjct: 61 CDTTAKPHDRAAALIAMFTFEELVNSTGNVMPAIPRLGLPPYQVWNEALHGLDRANLTEF 120
Query: 106 FNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFR 165
G ATSFP ILT A+ + L +IG + + RA N G+ G+ ++PNIN FR
Sbjct: 121 --GDYSWATSFPSPILTMAALNRTLINQIGGIVSTQGRAFNNGGR-YGLDVYSPNINSFR 177
Query: 166 DPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNW 225
P WGRGQETPGED + Y + Y+ G+QG G +L+ +A KHF YD++NW
Sbjct: 178 HPVWGRGQETPGEDIQLCSVYGLEYITGLQG-----GLDPKELKLAATAKHFAGYDIENW 232
Query: 226 KGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTA 285
+R D ++ D A Y P F + V+ R +M +YN VNG+P+ A+ LL
Sbjct: 233 GNHSRLGNDMSISAFDFASYYAPQFVTAVRDARVHSVMASYNAVNGVPASANSFLLQTLL 292
Query: 286 RRQWGF--HGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAV 343
R W F GY++SDCD+V +++ GYA S A ++AG D++CG+ Q + +
Sbjct: 293 RDTWNFVEDGYVSSDCDSVYNVFNPHGYASSASLAAAKSIQAGTDIDCGATYQLYLNQSF 352
Query: 344 KQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIG-ADVVCSPAHQVLALQAAQD 402
Q ++ SEI+RA +S + LG F+G+ + + + +DVV + A + + +AA +
Sbjct: 353 TQGEISRSEIERAATRFYSNLVSLGYFDGDNSK--YRDLDWSDVVATDAWNI-SYEAAVE 409
Query: 403 GIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENT 462
GIVLLKN G LPL K S+ALIGP AN T+ GNY G + PL ALQ +
Sbjct: 410 GIVLLKND-GTLPLSKDTH-SVALIGPWANVTTTMQGNYYGAAPYLTGPLAALQASDLDV 467
Query: 463 VYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELIT 522
Y G + + +++ + A+ A+ +D V+ G+D + E E +DR + PG Q +LI
Sbjct: 468 NYAFGTNISSETTSGFEAALSAARKSDVVIFAGGIDNSVEAEGVDRETITWPGNQLQLIE 527
Query: 523 RVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGG 582
+++E KP++++ + GG VD + K ++N+ S++W GYPG++G A+ +++ G P G
Sbjct: 528 QLSELG-KPLVVLQMGGGQVDSSSLKANKNVNSLVWGGYPGQSGGPAILDILTGKRAPAG 586
Query: 583 RLPMTWYPQDY-IKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFK 641
RL +T YP +Y ++ P TDM +RP+ + NPG+TY +Y GK V+ FG GL Y+ F+
Sbjct: 587 RLTVTQYPAEYALQFPATDMSLRPKGS--NPGQTYMWYTGKPVYEFGHGLFYT----TFE 640
Query: 642 AVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFL-VTIGVKNHGEMAGKHP 700
N N +S D+V S G E F+ TI V+N G + +
Sbjct: 641 TSLANSHGANNGASF------DIVKLLSRSNAGYNVIEQVPFMNYTIEVENTGTVTSDYT 694
Query: 701 VLLFVKPARRGNGRPIKQLVGFQ---SVILNAKEKAEIVFELSPCESLSRAREDGLMVIE 757
+ FV + P K LVGF + +A + I L ++++R EDG ++
Sbjct: 695 AMAFVNTKAGPSPHPNKWLVGFDRLGGIEPHATQTMTIPVSL---DNVARTDEDGNRIVY 751
Query: 758 EGTHFLVVGDEEYPISIFV 776
G + L + +E + F
Sbjct: 752 PGKYELALNNERSAVLSFT 770
>gi|322512556|gb|ADX05682.1| putative carbohydrate-active enzyme [uncultured organism]
Length = 717
Score = 419 bits (1078), Expect = e-114, Method: Compositional matrix adjust.
Identities = 271/745 (36%), Positives = 400/745 (53%), Gaps = 95/745 (12%)
Query: 49 TLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNG 108
T +RA+ LV +TL+EK+ Q + +APAI RLG+PAY +W+EALHGVA G
Sbjct: 10 TKTFEERAQALVCEMTLEEKVFQTLFNAPAIERLGVPAYNYWNEALHGVARAGV------ 63
Query: 109 TIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIG-------MTFWAPNI 161
AT FPQ I AASFD L ++ I EARA +N Q G +TFW+PN+
Sbjct: 64 ----ATVFPQAIGLAASFDEELLGQVADTISTEARAKFNMQQKFGDRDIYKGLTFWSPNV 119
Query: 162 NIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYD 221
NIFRDPRWGRG ET GEDP ++G+ VS++RG+QGD + ++ +AC KHF +
Sbjct: 120 NIFRDPRWGRGHETFGEDPFLSGRLGVSFIRGMQGDD------ERYMKVAACAKHFAVH- 172
Query: 222 LDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLL 281
+ R+ F+A V+ QDL +TY P F +CV + +M AYNR NG C + LL
Sbjct: 173 --SGPEDQRHSFNAVVSEQDLRETYLPAFHACVTEAGVEAVMGAYNRTNGEACCGSKKLL 230
Query: 282 SKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKA 341
R +WGF G++TSDC A+ ++ K+ E+ V + +G D+NCG+ L H
Sbjct: 231 VDILRGEWGFRGHVTSDCWALKDFHEFHMVTKNQEETVALAMNSGCDLNCGN-LYVHLLQ 289
Query: 342 AVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQ 401
AV+ + ES IDRA+ LF+ RM+LGLF+ + + P+ IG D V + A++ L +A++
Sbjct: 290 AVRDGLVEESVIDRAVTRLFTTRMKLGLFDRSEEV-PYNGIGYDRVDTEANRKLNREASR 348
Query: 402 DGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVEN 461
+ LLKN+ GLLPL SK ++ ++GPNA++ K L+GNY G + +T L ++ +
Sbjct: 349 RTVCLLKNADGLLPLDISKLRTIGVVGPNADNRKALVGNYEGTASEYVTVLDGIRELAGD 408
Query: 462 ---TVYYPGC-------DTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEE------ 505
VY GC + + I +A +A+ +D V+ +MGLD E EE
Sbjct: 409 DVRVVYSEGCHLFRDRVQGLGQPNDRIAEARAVAELSDVVIAVMGLDPGLEGEEGDQGNE 468
Query: 506 ---LDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYP 562
D+ +L LPG Q E++ + E+ KPV+LVLL G + I +A + ++ +IL A YP
Sbjct: 469 FASGDKPNLELPGLQGEVLKALVESG-KPVVLVLLGGSALAIPWA--EEHVPAILDAWYP 525
Query: 563 GEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGK 622
G G A+A+V+FG P G+LP+T+Y TD M+ RTYR+ +
Sbjct: 526 GAQGGRAVADVLFGRACPEGKLPVTFYRTSEELPAFTDYSMK--------NRTYRYMKQP 577
Query: 623 EVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRK 682
++PFG GLSY+ + L +++ V+ D V ++V
Sbjct: 578 ALYPFGYGLSYTSWE------------LTNTTAEGSVD--DGVVCRAV------------ 611
Query: 683 FLVTIGVKNHGEMAGKHPVLLFVK-PARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSP 741
++N G MAG V ++VK P G P QL G + + L E AE+ L
Sbjct: 612 ------LRNTGAMAGAQTVQVYVKAPLATG---PNAQLKGLRKIRLQPGESAEVAISLDK 662
Query: 742 CESLSRAREDGLMVIEEGTHFLVVG 766
E+ E GL V+ G + + +G
Sbjct: 663 -EAFGVYNEKGLRVLLPGEYKIYIG 686
>gi|297745533|emb|CBI40698.3| unnamed protein product [Vitis vinifera]
Length = 461
Score = 419 bits (1077), Expect = e-114, Method: Compositional matrix adjust.
Identities = 207/379 (54%), Positives = 273/379 (72%), Gaps = 5/379 (1%)
Query: 145 LYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKL 204
+YN G A G+TFW+PN+NIFRDPRWGRGQETPGEDPL++ KYA YVRG+Q + +G
Sbjct: 1 MYNVGLA-GLTFWSPNVNIFRDPRWGRGQETPGEDPLLSSKYASGYVRGLQ-QSDDGSP- 57
Query: 205 KGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMC 264
+L+ +ACCKH+TAYDLDNWKG R+ F+A VT QD+ DT+QPPF+SCV G + +MC
Sbjct: 58 -DRLKIAACCKHYTAYDLDNWKGVDRFHFNAVVTKQDMDDTFQPPFKSCVIDGNVASVMC 116
Query: 265 AYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLK 324
+YN+VNG P+CAD +LLS R +W +GYI SDCD+V + Y+++ Y K+PE+A +
Sbjct: 117 SYNQVNGKPACADPDLLSGIVRGEWKLNGYIVSDCDSVDVFYNSQHYTKTPEEAAAKAIL 176
Query: 325 AGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGA 384
AG+D+NCGSFL +HT+AAVK + ES +D+A+ N F+ MRLG F+GNP+ +GK+G
Sbjct: 177 AGLDLNCGSFLGQHTEAAVKGGLVDESAVDKAVSNNFATLMRLGFFDGNPSKAIYGKLGP 236
Query: 385 DVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGP 444
VC+ HQ LA +AA+ GI+LLKNS G LPL + +LA+IGPNAN KT++GNY G
Sbjct: 237 KDVCTLEHQELAREAARQGIMLLKNSKGSLPLSPTAIKTLAIIGPNANVTKTMIGNYEGT 296
Query: 445 SCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKE 504
C+ TPLQ L V T Y GC VACS+A ID+A IA AD VL++G+DQ+ E E
Sbjct: 297 PCKYTTPLQGLMALVA-TTYLSGCSNVACSTAQIDEAKKIAAAADATVLIVGIDQSIEAE 355
Query: 505 ELDRVDLVLPGRQQELITR 523
DRV++ LPG+Q LIT
Sbjct: 356 GRDRVNIQLPGQQPLLITE 374
>gi|326202986|ref|ZP_08192853.1| glycoside hydrolase family 3 domain protein [Clostridium
papyrosolvens DSM 2782]
gi|325987063|gb|EGD47892.1| glycoside hydrolase family 3 domain protein [Clostridium
papyrosolvens DSM 2782]
Length = 712
Score = 419 bits (1077), Expect = e-114, Method: Compositional matrix adjust.
Identities = 271/750 (36%), Positives = 392/750 (52%), Gaps = 95/750 (12%)
Query: 45 FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGI 104
+ +L +RA DLVS++TL+EK SQL A + RLGIP Y WW+EALHGVA G
Sbjct: 6 YLDKSLSFKERAADLVSKMTLEEKASQLRYDAQPVERLGIPRYNWWNEALHGVARAGV-- 63
Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIG-------MTFW 157
AT FPQ I AA FD +I I E RA YN G +TFW
Sbjct: 64 --------ATVFPQAIGMAAMFDDEFLEKIADVIATEGRAKYNESAKKGDRDIYKGITFW 115
Query: 158 APNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHF 217
+PN+NIFRDPRWGRG ET GEDP +T + V++V+G+QGD GK L+ +AC KH+
Sbjct: 116 SPNVNIFRDPRWGRGHETYGEDPYLTSRLGVAFVKGLQGD----GKY---LKTAACAKHY 168
Query: 218 TAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCAD 277
+ + R+ FDA V+ +DL +TY P FE+ VK+ + IM AYNR NG P
Sbjct: 169 AVH---SGPEDDRHFFDAIVSQKDLYETYLPAFEALVKEAKVESIMGAYNRTNGEPCNGS 225
Query: 278 RNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQK 337
+ LL R WGF G++ SDC A+ ++ G K+P ++V LK+G D+NCG+ +
Sbjct: 226 KTLLKDILRDGWGFDGHVVSDCWAIKDFHEGHGVTKTPTESVALALKSGCDLNCGN-MYL 284
Query: 338 HTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLAL 397
A+K+ + E +IDRA L + RM+LG+F+ + F I ++ S H ++L
Sbjct: 285 LILLALKEGLITEEDIDRAAIRLMTTRMKLGMFDDDC---EFDNIPYELNDSAEHNKISL 341
Query: 398 QAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQN 457
+AA+ +VLLKN GLLPL K ++A+IGPNA+S+ L NY+G +++T ++ ++
Sbjct: 342 EAAKKSMVLLKND-GLLPLDSKKIKNVAVIGPNADSSLALRANYSGTPSQNVTIIEGIRK 400
Query: 458 YV-ENT-VYYPGC--------DTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEEL- 506
V ENT V+Y + +A + +AV A+ +D VVL +GLD + E E+
Sbjct: 401 RVSENTRVWYAMGSHLFLNRDEDLAQPDDRLKEAVSAAERSDVVVLCLGLDASVEGEQND 460
Query: 507 ----------DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSI 556
D+ DL LP Q+ L+ V A KP I+ LL G + I A +I
Sbjct: 461 QGTVILDAGGDKADLNLPESQRNLLNAVL-ATGKPTIVALLSGSALSIGDAA--DKAAAI 517
Query: 557 LWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTY 616
+ YPG G +A AE+IFGD++P GRLP+T+Y P D M RTY
Sbjct: 518 VQCWYPGAIGGLAFAEMIFGDYSPAGRLPVTFYKSTEELPPFADYSME--------NRTY 569
Query: 617 RFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTE 676
+F +G ++PFG GLSY+ + Y + MV Q V + +++
Sbjct: 570 KFMKGDALYPFGFGLSYTSFEY-----------------SNMVCPQTVNNGENLS----- 607
Query: 677 FCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIV 736
V++ V+N G + V +++K P L GF+ + L + EK +
Sbjct: 608 --------VSVDVQNTGSVDSDEVVQVYIKDMDASVRVPKYSLCGFKRIHLKSGEKKTVT 659
Query: 737 FELSPCESLSRAREDGLMVIEEGTHFLVVG 766
FE++ ++S E G IE G L G
Sbjct: 660 FEVA-SNAMSIVDEAGKRHIENGEFTLYAG 688
>gi|266619450|ref|ZP_06112385.1| beta-glucosidase [Clostridium hathewayi DSM 13479]
gi|288869013|gb|EFD01312.1| beta-glucosidase [Clostridium hathewayi DSM 13479]
Length = 714
Score = 419 bits (1077), Expect = e-114, Method: Compositional matrix adjust.
Identities = 246/616 (39%), Positives = 350/616 (56%), Gaps = 63/616 (10%)
Query: 54 QRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGA 113
+R RDLVS++TL+EK+SQL APA+ RLGIP+Y WW+EALHGVA G A
Sbjct: 14 ERVRDLVSQMTLEEKVSQLRYDAPAVERLGIPSYNWWNEALHGVARAG----------AA 63
Query: 114 TSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA-------GQAIGMTFWAPNINIFRD 166
T FPQ I AA FD L +IG LE RA Y+ G G+TFW+PNINIFRD
Sbjct: 64 TVFPQAIGLAAMFDEALLEKIGDVTALEGRAKYHEAVRNGDRGLYKGITFWSPNINIFRD 123
Query: 167 PRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWK 226
PRWGRG ET GEDP +TG+ +Y++G+QG NG +LK A+AC KHF A+ K
Sbjct: 124 PRWGRGHETYGEDPCLTGRMGTAYIKGMQG---NGKRLK----AAACVKHFAAHSGPE-K 175
Query: 227 GTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTAR 286
G R+ F++ V+ +DL +TY P FE CVK+ G+M YNR+NG +C +L+++ R
Sbjct: 176 G--RHSFNSVVSKKDLTETYFPAFERCVKEAGVEGVMGGYNRLNGEAACGSHHLITEILR 233
Query: 287 RQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQK 346
+WGF GY SDC A+ + G +P+++ LK+G D+NCG+ + H +A Q
Sbjct: 234 EKWGFDGYYVSDCGAIKDFHMHHGLTDTPQESAALALKSGCDLNCGA-VYLHVMSAYNQG 292
Query: 347 KLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVL 406
+ +IDRA+ +L RMRLG+F+ + F +I ++ H LAL+AA++ +VL
Sbjct: 293 LVSAEDIDRAVTHLMMTRMRLGMFDQH---TEFDEIPYEINDCAEHHGLALKAAEESMVL 349
Query: 407 LKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYV------- 459
LKN G+LPL K+ ++A+IGPN +S + L GNY G + T L+ ++ +
Sbjct: 350 LKND-GILPLDKTALKTVAVIGPNGDSEEILKGNYNGTATEKYTILEGIRAVLGKETRIF 408
Query: 460 ---ENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEE---------LD 507
+ +Y + +A + + +AV +A +D V L +GL+ T E EE D
Sbjct: 409 CSEGSHLYRDNVENLAEADDRLKEAVSMAVRSDVVFLCLGLNGTLEGEEGDANNSYAGAD 468
Query: 508 RVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGA 567
+ DL LP Q L+ V PVIL+L G + I +A + +IL YPG+ G
Sbjct: 469 KADLNLPESQMRLLKAVC-GTGTPVILLLAAGSAMAINYAA--EHCSAILHIWYPGQMGG 525
Query: 568 VALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPF 627
+A A ++ G+ P GRLP+T+Y TD M+ GRTYR+ E + ++PF
Sbjct: 526 LAAARLLTGEAVPSGRLPVTFYQTTEELPEFTDYSMK--------GRTYRYMEREALYPF 577
Query: 628 GCGLSYSKYSY-KFKA 642
G GLSY + Y FKA
Sbjct: 578 GYGLSYGDFEYSNFKA 593
>gi|402308386|ref|ZP_10827395.1| glycosyl hydrolase family 3, N-terminal domain protein [Prevotella
sp. MSX73]
gi|400375830|gb|EJP28725.1| glycosyl hydrolase family 3, N-terminal domain protein [Prevotella
sp. MSX73]
Length = 721
Score = 418 bits (1075), Expect = e-114, Method: Compositional matrix adjust.
Identities = 263/735 (35%), Positives = 385/735 (52%), Gaps = 85/735 (11%)
Query: 52 ISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIR 111
+S+ A+++++R+T+ EKISQL+N +PAI LGI Y+WWSE LHGV G+
Sbjct: 30 MSRHAKEIIARMTVSEKISQLMNESPAIEHLGIKPYDWWSEGLHGVGRDGR--------- 80
Query: 112 GATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAI-------GMTFWAPNINIF 164
AT FPQ I A+FD L IG A+ E RA +N Q + G+TFW+PN+NIF
Sbjct: 81 -ATVFPQPIALGATFDEALVREIGDAVATEGRAKFNVAQKLKNYSRNAGLTFWSPNVNIF 139
Query: 165 RDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDN 224
RDPRWGRG ET GEDPL++G +YVRG+QGD L+ AC KH Y + +
Sbjct: 140 RDPRWGRGMETYGEDPLLSGMLGTAYVRGLQGDD------AFYLKTGACAKH---YAVHS 190
Query: 225 WKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKT 284
TR++ D + +DL +TY P F+ V+QGR +M AYNRV G P + LL+
Sbjct: 191 GPEGTRHEADIHPSRRDLFETYLPQFKMLVQQGRVEAVMSAYNRVYGEPCGGSKYLLTDI 250
Query: 285 ARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVK 344
R+ WGF+G+I SDCDA++ Y Y K+PE+A +KAG++V CG K + A+
Sbjct: 251 LRKSWGFNGHIVSDCDAINDFYGGHRYVKTPEEACAAAIKAGLNVECGHTF-KAMQGALD 309
Query: 345 QKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGI 404
Q L E+++DRAL L R++LG+ + P+ +CSPAH LAL+AA + +
Sbjct: 310 QGLLAEADLDRALFPLVMTRLKLGILEPDSAC-PYNSYDESEICSPAHTALALRAADEAM 368
Query: 405 VLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTV- 463
VLLKN +G+LPL K+ +L + GP A+ A L+GNY G S R T LQ + + V +
Sbjct: 369 VLLKN-NGILPLDKNIR-TLFVAGPGASDAFYLMGNYFGLSNRYSTYLQGIVSRVSSGTS 426
Query: 464 --YYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEE---------LDRVDLV 512
+ P + ++ AV+ A A+ +++MG + E EE DRV +
Sbjct: 427 VNFRPAFMQITEELNDMNWAVNEACAAEVAIVVMGNNGNMEGEEGEAIASASRGDRVGIG 486
Query: 513 LPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAE 572
LP Q + RV +++VL G P+D+ K + +++ A YPG+ G AL +
Sbjct: 487 LPASQLNYLRRVKARKGGRIVVVLTGGSPIDLR--KISKLADAVVMAWYPGQEGGEALGD 544
Query: 573 VIFGDHNPGGRLPMTWYPQDYIKVP-MTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGL 631
++FGD N GRLP+T +P D +P D M GRTY++ G ++PFG GL
Sbjct: 545 LLFGDKNFSGRLPIT-FPADVDSLPAFDDYSMN--------GRTYKYMSGNVMYPFGYGL 595
Query: 632 SYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKN 691
SY + +Y + + V K L E T N
Sbjct: 596 SYGRVTY--------------------TDARVVGRIKKGEPLAVEVVLT----------N 625
Query: 692 HGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRARED 751
+G+ ++ G G P+ LVGF+ V + K + VF++ P E L + D
Sbjct: 626 NGDRTIDEVAQAYIATPTAGKGSPMASLVGFRRVSIPPKSSVKAVFKIVP-ERLMTIQSD 684
Query: 752 GLMVIEEGTHFLVVG 766
G + +G + L +G
Sbjct: 685 GSSKLLKGNYTLTIG 699
>gi|358397360|gb|EHK46735.1| glycoside hydrolase family 3 protein [Trichoderma atroviride IMI
206040]
Length = 865
Score = 418 bits (1074), Expect = e-114, Method: Compositional matrix adjust.
Identities = 280/762 (36%), Positives = 403/762 (52%), Gaps = 65/762 (8%)
Query: 26 STQPPFSCDPSNPSTETFPFCK-----TTLPISQRARDLVSRLTLDEKISQLVNSAPAIP 80
S PP + PS P P C TTL +++RA +V +TLDEK++ + +SA
Sbjct: 121 SANPPLT--PSFPDCTRDPLCSNAICDTTLSMAERAAAIVKPMTLDEKVANVGSSASGSA 178
Query: 81 RLGIPAYEWWSEALHGVAGVGKGIFFN----GTIRGATSFPQVILTAASFDSYLWYRIGQ 136
RLG+PAY+W +EALHGVAG G+ F ATSFP IL +A+FD L +
Sbjct: 179 RLGLPAYQWQNEALHGVAG-STGVQFQSPLGANFSAATSFPMPILLSAAFDDALVQNVAT 237
Query: 137 AIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQG 196
AI EARA N G A G+ FW PNIN FRDPRWGRG ETPGED Y ++ + G+QG
Sbjct: 238 AISTEARAFANYGFA-GLDFWTPNINPFRDPRWGRGMETPGEDAFRIQGYVLALISGLQG 296
Query: 197 DTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQ 256
G + A CKHF AYD++N R + T QD+AD Y P FE+CV+
Sbjct: 297 -----GINPDFFRIIATCKHFAAYDIEN----GRTGNNLNPTQQDMADYYLPMFETCVRD 347
Query: 257 GRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGF---HGYITSDCDAVSIIYDAEGYAK 313
+ +MCAYN V+GIP+CA LL R +GF Y+ SDCDAV ++D YA
Sbjct: 348 AKVGSVMCAYNAVDGIPACASEYLLQDVLRDGFGFTEDFNYVVSDCDAVDNVFDPHHYAS 407
Query: 314 SPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGN 373
+ +A L AG D++CGS A+V+ E+ ++++L L+S +++G F+
Sbjct: 408 NLTEAAALSLNAGTDLDCGSSYNV-LNASVEAALTSEAALNQSLVRLYSALIKVGYFD-- 464
Query: 374 PTMQP--FGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNA 431
QP + + V + +Q LA AA G+ LLKN G LPL ++ S ++A+IGP
Sbjct: 465 ---QPSEYKSLSWANVNTTQNQALAHDAATGGMTLLKN-DGTLPLSRTLS-NVAIIGPWV 519
Query: 432 NSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHV 491
N+ + GNYAG + + PL Q N Y G + ++ A+ A +D +
Sbjct: 520 NATTQMQGNYAGTAPFLVNPLDVFQQKWGNVKYAQGTAINSQDTSGFSAALSAASSSDVI 579
Query: 492 VLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDR 551
V + G+D T E E DR +V PG Q +LI+++A K P+++V GG +D + +
Sbjct: 580 VYLGGIDITVENEGFDRGSIVWPGNQLDLISQLANLGK-PLVIVQFGGGQIDDSSLLSNP 638
Query: 552 NIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYI-KVPMTDMKMRPQATSG 610
N+ SILWAGYPG+ G A+ +V+ G + P GRLP+T YP YI + DM +RP ++G
Sbjct: 639 NVRSILWAGYPGQDGGNAVFDVLTGANPPAGRLPITQYPASYINNNNIQDMNLRP--SNG 696
Query: 611 NPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSV 670
PGRTY +Y G V PFG GL Y+ +S F++++ + +V N V SV
Sbjct: 697 IPGRTYAWYTGTPVLPFGYGLHYTNFSVSFQSINTAG-----TDVATIVNNAGAVIDTSV 751
Query: 671 PELGTEFCETRKFLVTIGVKNHG---EMAGKHPVLLFVKPARRG-NGRPIKQLVGF---Q 723
F + + V N G +A + L+F+ G + P KQL + +
Sbjct: 752 ------FAT-----LVVSVHNTGGKANLASDYVGLVFLSSTNAGPSPYPNKQLAAYGRAK 800
Query: 724 SVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVV 765
SV + A ++ + L SL+RA +G I G + L +
Sbjct: 801 SVGVGATQQLTLKINLG---SLARADTNGDRWIYPGDYKLTL 839
>gi|372209074|ref|ZP_09496876.1| glycoside hydrolase family protein [Flavobacteriaceae bacterium
S85]
Length = 727
Score = 417 bits (1073), Expect = e-114, Method: Compositional matrix adjust.
Identities = 271/746 (36%), Positives = 386/746 (51%), Gaps = 85/746 (11%)
Query: 45 FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGI 104
F T I +RA LVS++TL EKI+QL N+APAI RL +P Y+WW+EALHGVA GK
Sbjct: 21 FLDTDKSIEERAEILVSQMTLKEKIAQLKNTAPAISRLKVPDYDWWNEALHGVARNGK-- 78
Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIG-------MTFW 157
AT FPQ I A+FD L R+ AI EARA Y Q +G +TFW
Sbjct: 79 --------ATIFPQGIGIGATFDPDLALRVASAISTEARAKYTISQQMGNHSRYAGLTFW 130
Query: 158 APNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHF 217
PN+NIFRDPRWGRGQET GEDP + + V++V+G+QGD N L+++AC KH+
Sbjct: 131 TPNVNIFRDPRWGRGQETFGEDPYLMTQMGVAFVKGLQGDDPN------YLKSAACAKHY 184
Query: 218 TAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCAD 277
+ + + R +F+A T QDL +TY P FE+ VK G+M A+N V G P A+
Sbjct: 185 AVH---SGPESLRLEFNAVPTQQDLYETYLPAFEALVKDANVEGVMPAHNAVFGAPMAAN 241
Query: 278 RNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQK 337
+ LL+ R +WGF GY+ +DC A+ I Y S A LKAG ++NCG+ K
Sbjct: 242 KFLLTDVLRDRWGFDGYVVTDCGAIKQIKVGHKYVDSEVAAAAVALKAGTNLNCGA-TYK 300
Query: 338 HTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLAL 397
K A+ Q + E + LF R RLG+F+ + + P+ KIG +++ S H LA
Sbjct: 301 ELKKAIDQGLVTEELVHERTKQLFKTRFRLGMFDKDLSKNPYSKIGPELIHSKEHIELAR 360
Query: 398 QAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQN 457
+AAQ IV+LKN + LLPLP V + GP ANS+ L+G+Y G S +T L + +
Sbjct: 361 EAAQKSIVMLKNKNNLLPLPTDIKVPY-VTGPFANSSDMLMGSYYGVSPGVVTILAGITD 419
Query: 458 YVE---NTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEEL-------- 506
V + Y G + + A ++A +D + ++GL +E E +
Sbjct: 420 AVSLGTSLNYRSGALPFQKNINPKNWAPNVAGMSDVTICVVGLTADREGEGVDAIASNHK 479
Query: 507 -DRVDLVLPGRQQELITRVAEAAK-KPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGE 564
DR+DL LP Q + ++A K KP++LV+ G PV + + + +IL YPGE
Sbjct: 480 GDRLDLKLPENQINYVKQLAAKKKDKPLVLVIASGSPV--SLEGIEEHCDAILQIWYPGE 537
Query: 565 AGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVP-MTDMKMRPQATSGNPGRTYRFYEGKE 623
G A+A+V+FG +P G LPMT +P+ ++P D M+ GRTY++ +
Sbjct: 538 QGGNAVADVLFGKVSPTGHLPMT-FPKSVAQLPDYKDYSMK--------GRTYKYMTEEP 588
Query: 624 VFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKF 683
+FPFG GL+YSK +K V KL +S
Sbjct: 589 MFPFGFGLTYSKTEFKNLVVEDAKLRKKES-----------------------------L 619
Query: 684 LVTIGVKNHGEMAGKHPVLLFVKPA--RRGNGRPIKQLVGFQSVILNAKEKAEIVFELSP 741
V++ V N G+ V L++ P + G G P L F+ V L E ++ F + P
Sbjct: 620 KVSVEVTNVGDFDIDEIVQLYISPKSQKEGEGLPFTTLKAFKRVALKKGETQKVEFTIHP 679
Query: 742 CESLSRAREDGLMVIEEGTHFLVVGD 767
ESL G V +G + + VG+
Sbjct: 680 -ESLKVINVKGQKVWRKGAYKVTVGN 704
>gi|288924872|ref|ZP_06418809.1| beta-glucosidase [Prevotella buccae D17]
gi|288338659|gb|EFC77008.1| beta-glucosidase [Prevotella buccae D17]
Length = 721
Score = 416 bits (1070), Expect = e-113, Method: Compositional matrix adjust.
Identities = 261/735 (35%), Positives = 385/735 (52%), Gaps = 85/735 (11%)
Query: 52 ISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIR 111
+S+ A+++++R+T+ EKISQL+N +PAI LGI Y+WWSE LHGV G+
Sbjct: 30 MSRHAKEIIARMTVSEKISQLMNESPAIEHLGIKPYDWWSEGLHGVGRDGR--------- 80
Query: 112 GATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAI-------GMTFWAPNINIF 164
AT FPQ I A+FD L IG A+ E RA +N + + G+TFW+PN+NIF
Sbjct: 81 -ATVFPQPIALGATFDEALVREIGDAVATEGRAKFNVARKLKNYSRNAGLTFWSPNVNIF 139
Query: 165 RDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDN 224
RDPRWGRG ET GEDPL++G +YVRG+QGD L+ AC KH Y + +
Sbjct: 140 RDPRWGRGMETYGEDPLLSGMLGTAYVRGLQGDD------AFYLKTGACAKH---YAVHS 190
Query: 225 WKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKT 284
TR++ D + +DL +TY P F+ V+QGR +M AYNRV G P + LL+
Sbjct: 191 GPEGTRHEADIHPSRRDLFETYLPQFKMLVQQGRVEAVMSAYNRVYGEPCGGSKYLLTDI 250
Query: 285 ARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVK 344
R+ WGF+G+I SDCDA++ Y Y K+PE+A +KAG++V CG K + A+
Sbjct: 251 LRKSWGFNGHIVSDCDAINDFYGGHRYVKTPEEACAAAIKAGLNVECGHTF-KAMQGALD 309
Query: 345 QKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGI 404
Q L E+++DRAL L R++LG+ + P+ +CSPAH LAL+AA + +
Sbjct: 310 QGLLAEADLDRALFPLVMTRLKLGILEPDSAC-PYNSYDESEICSPAHTALALRAADEAM 368
Query: 405 VLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTV- 463
VLLKN +G+LPL K+ +L + GP A+ A L+GNY G S R T LQ + + V +
Sbjct: 369 VLLKN-NGILPLDKNIR-TLFVAGPGASDAFYLMGNYFGLSNRYSTYLQGIVSRVSSGTS 426
Query: 464 --YYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEE---------LDRVDLV 512
+ P + ++ AV+ A A+ +++MG + E EE DRV +
Sbjct: 427 VNFRPAFMQITEELNDMNWAVNEACAAEVAIVVMGNNGNMEGEEGEAIASASRGDRVGIG 486
Query: 513 LPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAE 572
LP Q + RV +++VL G P+D+ + + +++ A YPG+ G AL +
Sbjct: 487 LPASQMNYLRRVKARKGGRIVVVLTGGSPIDLR--EISKLADAVVMAWYPGQEGGEALGD 544
Query: 573 VIFGDHNPGGRLPMTWYPQDYIKVP-MTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGL 631
++FGD N GRLP+T +P D +P D M GRTY++ G ++PFG GL
Sbjct: 545 LLFGDKNFSGRLPIT-FPADVDSLPAFDDYSMN--------GRTYKYMSGNVMYPFGYGL 595
Query: 632 SYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKN 691
SY + +Y + + V K L E T N
Sbjct: 596 SYGRVTY--------------------TDARVVGRIKKGEPLAVEVVLT----------N 625
Query: 692 HGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRARED 751
+G+ ++ G G P+ LVGF+ V + K + VF++ P E L + D
Sbjct: 626 NGDRTIDEVAQAYIATPTAGKGSPMASLVGFRRVSIPPKSSVKAVFKIVP-ERLMTIQSD 684
Query: 752 GLMVIEEGTHFLVVG 766
G + +G + L +G
Sbjct: 685 GSSKLLKGNYTLTIG 699
>gi|182415033|ref|YP_001820099.1| Beta-glucosidase [Opitutus terrae PB90-1]
gi|177842247|gb|ACB76499.1| Beta-glucosidase [Opitutus terrae PB90-1]
Length = 905
Score = 416 bits (1069), Expect = e-113, Method: Compositional matrix adjust.
Identities = 262/731 (35%), Positives = 386/731 (52%), Gaps = 98/731 (13%)
Query: 48 TTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFN 107
++ P+ RA DL+ R++L EK+SQL N+AP IPRLG+PAY++W+EA HG+A G
Sbjct: 208 SSKPLRVRADDLIRRMSLAEKVSQLKNAAPGIPRLGLPAYDYWNEAAHGIANNGI----- 262
Query: 108 GTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYN---------AGQAIGMTFWA 158
AT FPQ I AA+++ L ++ G IG+E RA +N + G+T+WA
Sbjct: 263 -----ATVFPQAIGAAAAWNPALLHQEGTVIGIEGRAKFNDYANRHNGDSKWWTGLTYWA 317
Query: 159 PNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFT 218
PNIN+FRDPRWGRGQET GEDP +T + + +V+GVQGD + A AC KH
Sbjct: 318 PNINLFRDPRWGRGQETYGEDPFLTAEIGIEFVKGVQGDD------PRYMLAMACAKH-- 369
Query: 219 AYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADR 278
Y + + TR+ F+A + +DL DTY P FE V++G+ +G+M AYN VNG+P+ A+
Sbjct: 370 -YAVHSGPERTRHSFNAEIPERDLFDTYLPHFERVVREGKVAGVMSAYNAVNGVPASANS 428
Query: 279 NLLSKTARRQWGFHGYITSDCDAVSIIYDAE--GYAKSPEDAVVDVLKAGMDVNCGSFLQ 336
LL++ R++WGF GY+ SDCDA+ IY + Y K+ E+A +KAG ++ CG
Sbjct: 429 FLLTELLRKRWGFEGYVPSDCDAIRDIYGEKQHHYVKTAEEAAALAVKAGCNLCCGGDYN 488
Query: 337 KHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQ-PFGKIGADVVCSPAHQVL 395
+ AV+Q + E ++D AL++ R RLGLF +P Q PF PAH +
Sbjct: 489 ALVR-AVQQGLVTEKDLDGALYHTLWTRFRLGLF--DPAEQVPFSGYTLKDNDLPAHSQV 545
Query: 396 ALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQAL 455
AL+ A+ IVLLKN G LPL ++K +A+IGPNA S L GNY G + RSI+ L +
Sbjct: 546 ALELARQAIVLLKND-GTLPLDRTKLKQIAVIGPNAASKSMLEGNYHGSASRSISILDDI 604
Query: 456 QNYVENTVYY-----------PGCDTVACSSASIDKAV--------DIAKGADHVVLMMG 496
+N V + + PG + + D+ V +A AD ++ + G
Sbjct: 605 RNLVGSEIKITHAMGSPVTTKPGTAPWSGQDNTTDRPVAELKAEALKLAAEADAIIYVGG 664
Query: 497 LDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSI 556
+ QE E DR + LP Q++LI R A KPV++V G + +T+ D N+ +I
Sbjct: 665 ITPAQEGESFDRESIELPSEQEDLI-RALHATGKPVVMVNCSGSAMALTW--QDENLPAI 721
Query: 557 LWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTY 616
+ A YPG+ G A+AEV+FG+ NP G LP+T+Y +D M+ RTY
Sbjct: 722 VQAWYPGQEGGRAVAEVLFGETNPSGHLPITFYRSTADLPDFSDYSMK--------NRTY 773
Query: 617 RFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTE 676
R++ G+ ++ FG GLSYS + Y + + P
Sbjct: 774 RYFTGRPLYAFGHGLSYSTFEY--------------------------ANLRVAP----- 802
Query: 677 FCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIV 736
VT+ + N G+ G V L+ P + ++ L GF+ + A E +
Sbjct: 803 -AANGALTVTLDLTNSGKRDGDDVVQLYATPPASSQPQELRALCGFRRTHVKAGETRTVT 861
Query: 737 FELSPCESLSR 747
+ P +L R
Sbjct: 862 VTV-PAVALRR 871
>gi|255572559|ref|XP_002527213.1| Thermostable beta-glucosidase B, putative [Ricinus communis]
gi|223533389|gb|EEF35139.1| Thermostable beta-glucosidase B, putative [Ricinus communis]
Length = 454
Score = 416 bits (1069), Expect = e-113, Method: Compositional matrix adjust.
Identities = 216/454 (47%), Positives = 300/454 (66%), Gaps = 9/454 (1%)
Query: 327 MDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADV 386
MD+NCGS+ ++ ++AV + KL E +IDRAL NLFSV++RLGLF+G+ F K+G +
Sbjct: 1 MDINCGSYAIRNAQSAVDKGKLREEDIDRALLNLFSVQLRLGLFDGDRINGHFSKLGPED 60
Query: 387 VCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSC 446
VC+ H+ LAL+AA+ GIVLLKN LPL K SLA+IGP AN+ +L G+Y G SC
Sbjct: 61 VCTEEHKKLALEAARQGIVLLKNEKKFLPLNKKAVSSLAIIGPLANNGGSLGGDYTGYSC 120
Query: 447 RSITPLQALQNYVENTVYYPGCDTVACSSAS-IDKAVDIAKGADHVVLMMGLDQTQEKEE 505
+ +Q Y++ T Y GC V+C S +A+ IAK AD V+++ G+D +QE E+
Sbjct: 121 NPQSLFDGVQAYIKRTSYAVGCSNVSCDSDDQFPEAIHIAKTADFVIVVAGIDLSQETED 180
Query: 506 LDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEA 565
DR+ L+LPG+Q L++ VA A+KKPVILVL GGPVD++FAK D I SILW GYPGEA
Sbjct: 181 RDRISLLLPGKQMALVSYVAAASKKPVILVLTGGGPVDVSFAKRDSRIASILWIGYPGEA 240
Query: 566 GAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVF 625
GA ALA++IFG++NPGGRLPMTWYP+ + VPM DM MR G PGRTYRFY G+ V+
Sbjct: 241 GAKALADIIFGEYNPGGRLPMTWYPESFTNVPMNDMNMRANPNRGYPGRTYRFYTGERVY 300
Query: 626 PFGCGLSYSKYSYKFKAVSQNKLYLNQS---SSTKMVENQ--DVVHYKSVPELGTEFCET 680
FG GLSY+ Y+YKF + + +KL L+ S +S K + +Q D + Y + E+ + C +
Sbjct: 301 GFGEGLSYTNYAYKFLS-APSKLSLSGSLTATSRKRILHQRGDRLDYIFIDEISS--CNS 357
Query: 681 RKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELS 740
+F V I V N G+M G H V+LF + + G P KQLVGF+ + + + E L
Sbjct: 358 LRFTVQISVMNVGDMDGSHVVMLFSRVPQVSEGTPEKQLVGFERINTVSHKSTETSILLD 417
Query: 741 PCESLSRAREDGLMVIEEGTHFLVVGDEEYPISI 774
PC+ LS A G ++ G+H L++GD ++ ++I
Sbjct: 418 PCKHLSIANGQGKRIMPVGSHVLLLGDLQHFVTI 451
>gi|348684865|gb|EGZ24680.1| family 3 glycoside hydrolase [Phytophthora sojae]
Length = 769
Score = 416 bits (1068), Expect = e-113, Method: Compositional matrix adjust.
Identities = 265/685 (38%), Positives = 369/685 (53%), Gaps = 64/685 (9%)
Query: 39 STETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPR-----LGIPAYEWWSEA 93
+ + PFC T+L + R DL+SRL L EK + L +A A PR +G+P Y W +
Sbjct: 30 TNQELPFCNTSLSTADRVEDLLSRLPLQEKATLL--TARASPRGNMSSIGLPEYNWGANC 87
Query: 94 LHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARAL-------- 145
+HGV GT TSFP + A FD + + + Q IG E RAL
Sbjct: 88 VHGVQSTC------GT-NCPTSFPNPVNLGAIFDPQVVFDMAQVIGWELRALWLEGATEN 140
Query: 146 YNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLK 205
Y G +G+ W+PNINI RDPRWGR ETP EDPLV KY V+Y RG+Q +
Sbjct: 141 YKGGPHLGLDCWSPNININRDPRWGRNTETPSEDPLVNSKYGVAYTRGLQEGKRQDPRF- 199
Query: 206 GKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCA 265
LQA KH+ AY +N+ G R +FDA V+ D ADTY P F S V G A G+MC+
Sbjct: 200 --LQAVVTLKHYAAYSYENYGGVNRMEFDAIVSPYDFADTYFPAFRSSVVDGNAKGVMCS 257
Query: 266 YNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKA 325
YN VNGIP CA++ L+ R GF GY+TSD AV I D YA S +A + A
Sbjct: 258 YNSVNGIPMCANKELVETLLRGTLGFDGYVTSDSGAVEAISDMHHYADSQCEAARLAILA 317
Query: 326 GMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGAD 385
G D+N G + K V +L E +D AL + +R LGLF+ QP+ +
Sbjct: 318 GTDINSGKSYEACLKTLVDDNQLEEKALDDALRHTLKLRFELGLFDPIDD-QPYWNVTPS 376
Query: 386 VVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPS 445
V + A + L+L A + +V+L+N+ +LPL K V LA++GP+A S + LLGNY G
Sbjct: 377 EVNTAAAKALSLNATRKSLVMLQNNASVLPL--QKGVKLAVLGPHAKSKRGLLGNYLGQM 434
Query: 446 CRS--------ITPLQALQ--NYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMM 495
C TPL A++ N NT + GC S+A +KAV AK AD VVL +
Sbjct: 435 CHGDYDEVGCVQTPLDAIRAANGASNTTFAEGCGISGNSTAGFEKAVAAAKEADAVVLFL 494
Query: 496 GLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGS 555
G+D++ E E DR ++ LP Q +L+ RV A +P ++VL+ GG I + +
Sbjct: 495 GIDKSIEGEVGDRNNIDLPNIQMQLLQRV-HAVGRPTVVVLINGGV--IGAEEIIERTDA 551
Query: 556 ILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYI-KVPMTDMKMRPQATSGNPGR 614
++ A YPG GA A+A+V+FGD NP G+LP+T Y DY+ +V M M M + +PGR
Sbjct: 552 LVEAFYPGFFGARAMADVLFGDTNPSGKLPVTMYRSDYVDQVEMKSMDM-----TAHPGR 606
Query: 615 TYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELG 674
TYR+++G+ VFPFG GLSY+ +S + + + + N ++ + G
Sbjct: 607 TYRYFKGEPVFPFGWGLSYTTFSLSVDSGTNSSSHSNNAAFS-----------------G 649
Query: 675 TEFCETRKFLVTIGVKNHGEMAGKH 699
E +T +++ VKN GE+AG
Sbjct: 650 GEVSDTANVTISVVVKNDGEVAGDE 674
>gi|402074909|gb|EJT70380.1| hypothetical protein GGTG_11406 [Gaeumannomyces graminis var.
tritici R3-111a-1]
Length = 793
Score = 415 bits (1066), Expect = e-113, Method: Compositional matrix adjust.
Identities = 275/743 (37%), Positives = 398/743 (53%), Gaps = 54/743 (7%)
Query: 46 CKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIF 105
C +L S+RA LV+ L + EK++ LV++A R+G+P Y WWSEALHGVA G
Sbjct: 44 CDRSLSPSERAAALVAALNVTEKMANLVSNANGSARIGLPKYNWWSEALHGVA-YAPGTQ 102
Query: 106 FN---GTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNIN 162
F G +TSFP +L AASFD L +IG IG E+RA N G+ G+ +W PN+N
Sbjct: 103 FRRGPGDFNSSTSFPMPLLLAASFDDSLIEKIGDVIGTESRAFGN-GRWSGLDYWTPNVN 161
Query: 163 IFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDL 222
F+DPRWGRG ETPGED L +YA S ++G++G + + + + CKH+ A D
Sbjct: 162 PFKDPRWGRGSETPGEDILRIKRYAASMIKGLEGP-----HPEKERRVVSTCKHYAANDF 216
Query: 223 DNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLS 282
++W GT+R+ FDAR++ QDLA+ Y PF+ C + R IMCAYN VNG+PSCA+ LL
Sbjct: 217 EDWNGTSRHDFDARISAQDLAEYYLMPFQQCARDSRVGSIMCAYNAVNGVPSCANSYLLD 276
Query: 283 KTARRQWGFHG---YITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHT 339
R+ WG+ G Y+TSDC+AV + YA++ + +AG D +C
Sbjct: 277 TVLRKHWGWTGHNNYVTSDCEAVLDVSAGHKYARTNAEGTAMCFEAGTDTSCEYTPSSDI 336
Query: 340 KAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQA 399
+ A Q L E +DRAL L+ +R+G F+GN + F I V +PA Q L+LQ+
Sbjct: 337 RGAYAQGLLREETMDRALLRLYEGLVRVGYFDGNSSA--FSDISWADVNAPAAQDLSLQS 394
Query: 400 AQDGIVLLKNSHGLLPLPKSKSV-------------SLALIGPNANSAKTLLGNYAGPSC 446
A +GIV+LKN G LPLP LA+IG A++ + L G Y+G +
Sbjct: 395 AVEGIVMLKND-GTLPLPLGAKCSSKSKKRSSSGGPKLAMIGFWADAPEKLRGGYSGTAA 453
Query: 447 RSITPLQALQNYVENTVYYPGCDTVACSSASIDK----AVDIAKGADHVVLMMGLDQTQE 502
TP A + + V G ++A+ D A+ A+GAD++V GLD+T
Sbjct: 454 YLRTPAYAARQMGLDVVTAGGPVLQGAAAAAADNWTAPALAAAEGADYIVYFGGLDETAA 513
Query: 503 KEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYP 562
E DR D+ PG Q L+ R+A A KP+++V + G +D T + +G++LWA +P
Sbjct: 514 GENKDRWDVEWPGAQLALVKRLA-ALGKPLVVVQM-GDQLDGTPLLANAGVGAVLWASWP 571
Query: 563 GEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIK-VPMTDMKMRPQATSGNPGRTYRFYEG 621
G+ G A+ ++ G +P GRLP+T YP +Y + VPMT+M +RP A+ PGRTYR+Y
Sbjct: 572 GQDGGPAVMRLLSGAASPAGRLPVTQYPANYTRLVPMTEMALRPSASGSRPGRTYRWYS- 630
Query: 622 KEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETR 681
V PFG GL Y+ ++ V + +++ ++E H E C
Sbjct: 631 TPVLPFGFGLHYTNFTPAVT-VPPALAAASGVTTSSLLEACRDPH--------PERCALP 681
Query: 682 KFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGF---QSVILNAKEKAEIVFE 738
V V N G A + L FV RPIK L + + V +A++ +
Sbjct: 682 PLRVA--VANTGRRASDYVALAFVSGDYGPRPRPIKTLAAYARLRGVRAGGSAEADLAWT 739
Query: 739 LSPCESLSRAREDGLMVIEEGTH 761
L ++R EDG V+ GT+
Sbjct: 740 LG---DIARHDEDGNTVLYPGTY 759
>gi|292495632|sp|Q0CMH8.2|XYND_ASPTN RecName: Full=Probable exo-1,4-beta-xylosidase xlnD; AltName:
Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
Length = 793
Score = 415 bits (1066), Expect = e-113, Method: Compositional matrix adjust.
Identities = 263/744 (35%), Positives = 396/744 (53%), Gaps = 39/744 (5%)
Query: 36 SNPSTETFPFCKT-----TLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWW 90
S P E P KT + RA LVS TL+E ++ N+ +PRLG+P Y+ W
Sbjct: 48 SFPDCENGPLSKTLVCDKSARPHDRAAALVSMFTLEELVNNTGNTGTGVPRLGLPKYQVW 107
Query: 91 SEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQ 150
SE+LHGV G ATSFPQ ILT A+ + L ++IG + +ARA N G+
Sbjct: 108 SESLHGVYRANWAS--EGDYSWATSFPQPILTMAALNRTLIHQIGDILSTQARAFSNVGR 165
Query: 151 AIGMTFWAPNINIFRDPRWGRGQETPGEDPL-VTGKYAVSYVRGVQGDTFNGGKLKGKLQ 209
G+ +APNIN FR P WGRGQETPGED + YA Y+ G+QG G L+
Sbjct: 166 -YGLDTYAPNINSFRHPVWGRGQETPGEDAYYLASTYAYEYITGIQG-----GVDPETLK 219
Query: 210 ASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRV 269
A KH+ YD++NW G +R D ++T QDL++ Y P F + + +MC+YN V
Sbjct: 220 LVATAKHYAGYDIENWDGHSRLGNDMQITQQDLSEYYTPQFLVSARDAKVHSVMCSYNAV 279
Query: 270 NGIPSCADRNLLSKTARRQWGF--HGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGM 327
NG+PSC++ L R +GF GY++ DC AV ++ YA + A D ++AG
Sbjct: 280 NGVPSCSNSFFLQTLLRETFGFVEDGYVSGDCGAVYNAFNPHEYAANESSASADSIRAGT 339
Query: 328 DVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVV 387
D++CG+ Q H A + ++ +I+R + L++ +RLG F+GN + Q +DV
Sbjct: 340 DIDCGTSYQYHFTNAFDEGEISRQDIERGVIRLYTNLVRLGYFDGNSS-QYRDLTWSDVQ 398
Query: 388 CSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCR 447
+ A + + +AA +G VLLKN G LPL S S+ALIGP AN+ + GNY GP+
Sbjct: 399 TTDAWNI-SHEAAVEGTVLLKND-GTLPLADSIR-SVALIGPWANATTQMQGNYYGPAPY 455
Query: 448 SITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELD 507
+PL AL+ + Y G + + ++A A+ A+ AD ++ G+D T E E LD
Sbjct: 456 LTSPLAALEASDLDVHYAFGTNISSTTTAGFADALAAARKADAIIFAGGIDNTIEGEALD 515
Query: 508 RVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGA 567
R+++ PG Q +LI +++ A KP++++ + GG VD + K++ N+ ++LW GYPG++G
Sbjct: 516 RMNITWPGNQLDLINQLS-ALGKPLVVLQMGGGQVDSSALKHNTNVSALLWGGYPGQSGG 574
Query: 568 VALAEVIFGDHNPGGRLPMTWYPQDY-IKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFP 626
AL ++I G P GRL T YP Y + P DM +RP T NPG+TY +Y G V+
Sbjct: 575 TALLDIIRGVRAPAGRLVTTQYPAGYATQFPAIDMGLRPNGT--NPGQTYMWYTGTPVYE 632
Query: 627 FGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFL-V 685
FG GL Y+ + K + + N N +D++ + P G + + R FL
Sbjct: 633 FGHGLFYTTFEAKRASTATNHSSFNI---------EDLL---TAPHPGYAYPQLRPFLNF 680
Query: 686 TIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSV-ILNAKEKAEIVFELSPCES 744
T + N G + +LF P K LVGF + L + F ++ ++
Sbjct: 681 TAHITNTGRTTSDYTAMLFANTTAGPAPHPNKWLVGFDRLGALEPGASQTMTFPIT-IDN 739
Query: 745 LSRAREDGLMVIEEGTHFLVVGDE 768
++R E G V+ G + L + +E
Sbjct: 740 VARTDELGNRVLYPGRYELALNNE 763
>gi|345519864|ref|ZP_08799275.1| beta-glucosidase [Bacteroides sp. 4_3_47FAA]
gi|254836262|gb|EET16571.1| beta-glucosidase [Bacteroides sp. 4_3_47FAA]
Length = 736
Score = 415 bits (1066), Expect = e-113, Method: Compositional matrix adjust.
Identities = 277/791 (35%), Positives = 406/791 (51%), Gaps = 110/791 (13%)
Query: 5 KLSLVFPLLCLCFTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLT 64
K+SL F LCF S + E PF LP+ R +DLV+RLT
Sbjct: 3 KISLYF---VLCFWSF--------------GAQAQVENLPFRNADLPLEVRVKDLVARLT 45
Query: 65 LDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAA 124
L+EK+ + + +PAIPRLGIPAY+WW+EALHGVA T+ T FPQ I AA
Sbjct: 46 LEEKVLLMQHHSPAIPRLGIPAYDWWNEALHGVAR---------TLEKVTVFPQAIGMAA 96
Query: 125 SFDSYLWYRIGQAIGLEARALYN----AGQA----IGMTFWAPNINIFRDPRWGRGQETP 176
+FD+ ++G E RAL+N AG+ G+T+W PNINIFRDPRWGRGQET
Sbjct: 97 TFDTEALQKMGDITSTEGRALFNEDWKAGKTGTRYRGLTYWTPNINIFRDPRWGRGQETY 156
Query: 177 GEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDAR 236
GEDP +T K + VRG++G+ + L++ AC KH+ + + R+ FDAR
Sbjct: 157 GEDPYLTAKMGAAIVRGLEGEDPH------YLKSVACAKHYAVHSGPEY---NRHSFDAR 207
Query: 237 VTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYIT 296
++ DL DTY P F V + + G+MCAYNR+NG P C + LL R QW F GY+T
Sbjct: 208 PSVFDLWDTYMPAFRELVTKAKVHGVMCAYNRLNGQPCCGNDPLLVDILRNQWHFDGYVT 267
Query: 297 SDCDAVSIIYDAEGYAKSPED--AVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEID 354
SDC A+ AE + PE A+ D L AG D+ CG+ L VK+ E +I+
Sbjct: 268 SDCWALKDF--AEFHKTHPEHTIAMSDALLAGTDLECGN-LYHLLAEGVKKGLHSERDIN 324
Query: 355 RALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLL 414
+L LF++ ++G+F+ + P+ IG +V+ AH+ A + A++ IVLL+N + +L
Sbjct: 325 VSLSRLFTILFKIGMFDPAERV-PYSSIGREVLECEAHKQHAERMAKESIVLLENKNHIL 383
Query: 415 PLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTV---YYPGCDTV 471
PL SK S+ALIGPNA++ +T L NY G +TP +L+ + + + Y PG V
Sbjct: 384 PLDASKIKSIALIGPNADNGQTQLANYFGTPSEIVTPYMSLKRRLGDKIKINYLPGVGIV 443
Query: 472 --ACSSASIDKAVDIAKGADHVVLMMGLDQTQE-------------KEELDRVDLVLPGR 516
+ S + A +D +V + G+ E DR + LP
Sbjct: 444 DKLKDAPSFVQVAHKAAQSDVIVFVSGISADYEGEAGDAGAAGYGGFASGDRTTMQLPLV 503
Query: 517 QQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFG 576
Q EL+ ++ + +P+I+V + G ++F +N ++L A Y G+A A+ +V+FG
Sbjct: 504 QIELLKKLKKTG-RPLIIVNMSGSV--MSFEWESQNADALLQAWYGGQAAGDAIVDVLFG 560
Query: 577 DHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKY 636
NP GR+P+T Y D P + M GRTYR+++G+ +PFG GLSY+ +
Sbjct: 561 HCNPAGRMPLTTYKSDNDLPPFENYSML--------GRTYRYFKGEPRYPFGYGLSYTTF 612
Query: 637 SYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMA 696
+Y S + V+ T +T + VT+ V N G+
Sbjct: 613 AY---------------SDVQCVDE-------------THTGDTAR--VTVTVSNTGDCD 642
Query: 697 GKHPVLLFVKPARRGNGR-PIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMV 755
G V L+V + G + P+ L GF+ + L E + F L+P E L+ DG +V
Sbjct: 643 GDEVVQLYVVHPQDGRKQIPLCALKGFKRIHLKRGESTSVSFTLTP-EELALTETDGNLV 701
Query: 756 IEEGTHFLVVG 766
+ G L VG
Sbjct: 702 EKNGQVTLFVG 712
>gi|160881137|ref|YP_001560105.1| glycoside hydrolase family 3 [Clostridium phytofermentans ISDg]
gi|160429803|gb|ABX43366.1| glycoside hydrolase family 3 domain protein [Clostridium
phytofermentans ISDg]
Length = 717
Score = 413 bits (1062), Expect = e-112, Method: Compositional matrix adjust.
Identities = 264/749 (35%), Positives = 388/749 (51%), Gaps = 90/749 (12%)
Query: 47 KTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFF 106
KT QRA +LV ++TL+EK+ Q ++SAP+IPRL I AY +W+EALHGVA G
Sbjct: 5 KTQQEFQQRATELVKKMTLEEKVFQTLHSAPSIPRLDIKAYNYWNEALHGVARAGV---- 60
Query: 107 NGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIG-------MTFWAP 159
AT FPQ I AA+FD L I I E R +NA Q G +TFW+P
Sbjct: 61 ------ATVFPQAIGLAATFDEDLIEEIADTISTEGRGKFNAQQKYGDHDIYKGLTFWSP 114
Query: 160 NINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTA 219
N+NIFRDPRWGRG ET GEDP ++G +V G+QG + L+A+AC KHF
Sbjct: 115 NVNIFRDPRWGRGHETFGEDPFLSGTLGGRFVDGIQGHD------ETYLKAAACAKHFAV 168
Query: 220 YDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRN 279
+ + R+ F+A V+ QDL +TY P F+ VK+ + +M AYNR NG P C +
Sbjct: 169 H---SGPEDIRHSFNAEVSEQDLRETYLPAFKKLVKEHKVEAVMGAYNRTNGEPCCGSKT 225
Query: 280 LLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHT 339
LL R +W F G++TSDC A+ ++ + ++V + G D+NCG+ L +
Sbjct: 226 LLEDILRGEWEFVGHVTSDCWAIKDFHEHHMVTSNAVESVALAMNRGCDLNCGN-LYVNL 284
Query: 340 KAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQA 399
AV+ + E ID AL LF+ RM+LGLF+ ++ PF I D V + + + L ++A
Sbjct: 285 LQAVRDGLVEEETIDTALIRLFTTRMKLGLFDKEESI-PFNTITYDQVDTKSSKELNIKA 343
Query: 400 AQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYV 459
++ +VLLKN +LPL K S+ +IGPNAN+ L+GNY G + IT L+ ++ V
Sbjct: 344 SKKCVVLLKNEDNILPLNPKKITSVGVIGPNANNRNALVGNYEGTASEYITVLEGIKQVV 403
Query: 460 ENTV---YYPGCDTVACSSASIDKAVD-------IAKGADHVVLMMGLDQTQEKEELDRV 509
V + GC +++ + D + + +D V+ +GLD E EE D+
Sbjct: 404 PEDVRVYFSEGCHLFKNKLSNLSQENDRIAEVRAVCEHSDVVIACLGLDPGLEGEEGDQG 463
Query: 510 D---------LVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAG 560
+ L LPG Q++++ + E KPVIL+LL G + + +A D +I +IL
Sbjct: 464 NQFASGDKKTLALPGIQEDVLKTIYECG-KPVILILLSGSALAVPWA--DEHIPAILQGW 520
Query: 561 YPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYE 620
YPG G A+AE+IFGD NP G+LP+T+Y TD M+ RTYR+ +
Sbjct: 521 YPGAQGGRAIAELIFGDGNPEGKLPVTFYRTTEELPEFTDYAMK--------NRTYRYMK 572
Query: 621 GKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCET 680
+ ++PFG GLSY+ + + LY+N + K + C
Sbjct: 573 NEALYPFGYGLSYTTFEHTL-------LYVNTDTLGKGSNVE---------------CMV 610
Query: 681 RKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELS 740
R VKN G+ G +VK G P QL G + V L E+ +I+ EL
Sbjct: 611 R-------VKNTGDYEGSVTTQAYVKYV--GEDAPNCQLKGLKKVSLLPGEEKDIMIELD 661
Query: 741 PCESLSRAREDGLMVIEEGTHFLVVGDEE 769
+ E+G ++ +G + L + D +
Sbjct: 662 D-RAFGLYNEEGEFILNQGEYELYLSDSQ 689
>gi|375150455|ref|YP_005012896.1| Beta-glucosidase [Niastella koreensis GR20-10]
gi|361064501|gb|AEW03493.1| Beta-glucosidase [Niastella koreensis GR20-10]
Length = 711
Score = 413 bits (1062), Expect = e-112, Method: Compositional matrix adjust.
Identities = 265/713 (37%), Positives = 376/713 (52%), Gaps = 85/713 (11%)
Query: 37 NPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHG 96
N +T F P+ R DL+ +LTL EKIS L + + RLGIPAY WW+EALHG
Sbjct: 9 NTQAQTSVFRNPQQPMEARVNDLLHQLTLPEKISLLGYRSKEVERLGIPAYNWWNEALHG 68
Query: 97 VAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAG------- 149
VA G AT FPQ I AA+F+ L I EARA YN
Sbjct: 69 VARAGV----------ATVFPQAIGMAATFNDDLLKEAATVISTEARAKYNLSLAQGRHL 118
Query: 150 QAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQ 209
Q +G+TFW+PNINIFRDPRWGRGQET GEDP +T ++V+G+QG+ L+
Sbjct: 119 QYMGLTFWSPNINIFRDPRWGRGQETYGEDPFLTAHMGTAFVKGLQGND------PRYLK 172
Query: 210 ASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRV 269
ASAC KHF + + R+ F+A V +DL +TY F + V G S +MCAYNRV
Sbjct: 173 ASACAKHFAVH---SGPENGRHTFNAIVDEKDLRETYLYAFHALVDAGVES-VMCAYNRV 228
Query: 270 NGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDV 329
N P C+ LL+ R +W F G++ +DC A+ I+ S + +KAG+++
Sbjct: 229 NDQPCCSGNFLLNSILRNEWKFKGHVVTDCGALDDIFMRHKVMPSGVEVAAAAIKAGVNL 288
Query: 330 NCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCS 389
+C + LQK + AV+QK L E +ID +L +L +++LG ++ +PT PF K GAD V +
Sbjct: 289 DCSNVLQKDVEKAVEQKLLNEKDIDSSLAHLLRTQIKLGFYD-DPTANPFYKYGADSVAN 347
Query: 390 PAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSI 449
AH LA AQ +VLLKNS+ LLPL K K ++ ++G N+ S LLGNY G S R++
Sbjct: 348 TAHATLARAMAQQSMVLLKNSNQLLPLDKKKYPAIMVVGTNSASMDALLGNYHGVSNRAV 407
Query: 450 TPLQALQNYVE---NTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEE- 505
+ ++ + N V+ Y G D + + + A AD V ++GL E EE
Sbjct: 408 SFVEGITNAVDAGTRVEYDQGSD---YNDTTHFGGIWAAGNADITVAVIGLTPVYEGEEG 464
Query: 506 --------LDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSIL 557
D+ D+ LP + + +A KKP+I V+ G VDI+ + +IL
Sbjct: 465 DAFLAAKGGDKPDMSLPAAHIAFMKALRKANKKPIIAVITAGSAVDIS--AIEPYADAIL 522
Query: 558 WAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYR 617
A YPGE G ALA+++FG +P GRLP+T+Y Q + VP D GRTYR
Sbjct: 523 LAWYPGEQGGNALADILFGKVSPAGRLPVTFY-QSFADVPAYD-------NYAMKGRTYR 574
Query: 618 FYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEF 677
++ GK +PFG GLSY+ ++Y+++ + N + +D V +
Sbjct: 575 YFNGKVQYPFGYGLSYTSFAYEWQQMPAN-----------IRTAKDSVSF---------- 613
Query: 678 CETRKFLVTIGVKNHGEMAGKHPVLLFVK-PARRGNGRPIKQLVGFQSVILNA 729
+I VKN G M G V ++V+ PA P+K+L F+ V + A
Sbjct: 614 --------SIKVKNTGSMDGDEVVQVYVEYPAVE--RMPLKELKAFKRVHVKA 656
>gi|373460527|ref|ZP_09552278.1| hypothetical protein HMPREF9944_00542 [Prevotella maculosa OT 289]
gi|371955145|gb|EHO72949.1| hypothetical protein HMPREF9944_00542 [Prevotella maculosa OT 289]
Length = 699
Score = 413 bits (1061), Expect = e-112, Method: Compositional matrix adjust.
Identities = 263/735 (35%), Positives = 386/735 (52%), Gaps = 86/735 (11%)
Query: 52 ISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIR 111
+ Q+AR L++ +TLDEKISQ++N P IPRLGI Y+WW+E LHGV G+
Sbjct: 9 MQQKARRLINMMTLDEKISQMMNETPGIPRLGIKPYDWWNEGLHGVGRDGR--------- 59
Query: 112 GATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQA-------IGMTFWAPNINIF 164
AT FPQ I A+F+ L +IG AI E RA YN Q G+TFW+PNINIF
Sbjct: 60 -ATVFPQPIGMGATFNPALIRQIGDAIATEGRAKYNVAQRNNNYARYTGLTFWSPNINIF 118
Query: 165 RDPRWGRGQETPGEDPLVTGKYAVSYVRGVQG-DTFNGGKLKGKLQASACCKHFTAYDLD 223
RDPRWGRG ET GEDP +TG ++YV+G+QG D F L+ +AC KH Y +
Sbjct: 119 RDPRWGRGMETYGEDPFLTGTLGIAYVQGMQGNDPF-------YLKVAACGKH---YAVH 168
Query: 224 NWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSK 283
+ TR++ + T +DL +TY P F+ V+QG IM AYNRV G + LL+
Sbjct: 169 SGPEATRHEANVSPTKRDLFETYLPAFKMLVQQGHVEAIMGAYNRVYGEACSGSKYLLTD 228
Query: 284 TARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAV 343
R+QWGF G+I SDCDAV+ I+ K+ +A +KAG+++ CG + K AV
Sbjct: 229 VLRKQWGFRGHIVSDCDAVADIHAGHKIVKTEAEACAIAIKAGLNIECGHTFEA-MKQAV 287
Query: 344 KQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDG 403
QK L E EIDRAL L R++LG+ + P+ ++ +CSP H LA +AA +
Sbjct: 288 AQKLLTEQEIDRALLPLMMTRLKLGILEYDAEC-PYNEVKETEICSPEHIALARKAATES 346
Query: 404 IVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTV 463
+VLLKN +G+LPL K+ +L + GP A+ + L+GNY G S R T LQ + + V +
Sbjct: 347 MVLLKN-NGILPLDKNLH-TLFIAGPGASDSFWLMGNYFGISNRYCTYLQGIADKVSSGT 404
Query: 464 ---YYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEE---------LDRVDL 511
+ P + +I+ A+D A A+ +++MG + E EE DRV +
Sbjct: 405 AVNFRPAFGESTPTKNTINWALDEAIAAEKTIVVMGNNGNLEGEEGESIASETRGDRVSM 464
Query: 512 VLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALA 571
LP Q + + R +A K +++VL G P+D+ + R +++ A YPG+ G ALA
Sbjct: 465 RLPASQMKFL-RDLKARKNGIVVVLTGGSPIDVR--EISRLADAVVMAWYPGQEGGYALA 521
Query: 572 EVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGL 631
+++FGD N GRLP+T+ P D M+ GRTY++ +PFG GL
Sbjct: 522 DLLFGDENFSGRLPVTFPESTDALPPFEDYAMK--------GRTYKYQTAHIQYPFGYGL 573
Query: 632 SYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKN 691
SY+ +Y V +++P+ G R V+ +KN
Sbjct: 574 SYTTVTYAHAKV------------------------ETMPQKG------RGMTVSAVLKN 603
Query: 692 HGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRARED 751
G A +++ G + LV F+ + L E+ + F++ P + L +ED
Sbjct: 604 TGNKAVDEVAQVYLSAPGAGTTAALASLVAFKRIGLQPGEQQLVRFDI-PFDRLLTVQED 662
Query: 752 GLMVIEEGTHFLVVG 766
G + +G + + VG
Sbjct: 663 GTAQLLKGNYTITVG 677
>gi|315607899|ref|ZP_07882892.1| beta-glucosidase [Prevotella buccae ATCC 33574]
gi|315250368|gb|EFU30364.1| beta-glucosidase [Prevotella buccae ATCC 33574]
Length = 721
Score = 412 bits (1059), Expect = e-112, Method: Compositional matrix adjust.
Identities = 260/735 (35%), Positives = 384/735 (52%), Gaps = 85/735 (11%)
Query: 52 ISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIR 111
+S+ A+++++R+T+ EKISQL+N +PAI LGI Y+WWSE LHGV G+
Sbjct: 30 MSRHAKEIIARMTVSEKISQLMNESPAIEHLGIKPYDWWSEGLHGVGRDGR--------- 80
Query: 112 GATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAI-------GMTFWAPNINIF 164
AT FPQ I A+FD L IG A+ E RA +N + + G+TFW+PN+NIF
Sbjct: 81 -ATVFPQPIALGATFDEALVREIGDAVATEGRAKFNVARKLKNYSRNAGLTFWSPNVNIF 139
Query: 165 RDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDN 224
RD RWGRG ET GEDPL++G +YVRG+QGD L+ AC KH Y + +
Sbjct: 140 RDLRWGRGMETYGEDPLLSGMLGTAYVRGLQGDD------AFYLKTGACAKH---YAVHS 190
Query: 225 WKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKT 284
TR++ D + +DL +TY P F+ V+QGR +M AYNRV G P + LL+
Sbjct: 191 GPEGTRHEADIHPSRRDLFETYLPQFKMLVQQGRVEAVMSAYNRVYGEPCGGSKYLLTDI 250
Query: 285 ARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVK 344
R+ WGF+G+I SDCDA++ Y Y K+PE+A +KAG++V CG K + A+
Sbjct: 251 LRKSWGFNGHIVSDCDAINDFYGGHRYVKTPEEACAAAIKAGLNVECGHTF-KAMQGALD 309
Query: 345 QKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGI 404
Q L E+++DRAL L R++LG+ + P+ +CSPAH LAL+AA + +
Sbjct: 310 QGLLAEADLDRALFPLVMTRLKLGILEPDSAC-PYNSYDESEICSPAHTALALRAADEAM 368
Query: 405 VLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTV- 463
VLLKN +G+LPL K+ +L + GP A+ A L+GNY G S R T LQ + + V +
Sbjct: 369 VLLKN-NGILPLDKNIR-TLFVAGPGASDAFYLMGNYFGLSNRYSTYLQGIVSRVSSGTS 426
Query: 464 --YYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEE---------LDRVDLV 512
+ P + ++ AV+ A A+ +++MG + E EE DRV +
Sbjct: 427 VNFRPAFMQITEELNDMNWAVNEACAAEVAIVVMGNNGNMEGEEGEAIASASRGDRVGIG 486
Query: 513 LPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAE 572
LP Q + RV +++VL G P+D+ + + +++ A YPG+ G AL +
Sbjct: 487 LPASQLNYLRRVKARKGGRIVVVLTGGSPIDLR--EISKLADAVVMAWYPGQEGGEALGD 544
Query: 573 VIFGDHNPGGRLPMTWYPQDYIKVP-MTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGL 631
++FGD N GRLP+T +P D +P D M GRTY++ G ++PFG GL
Sbjct: 545 LLFGDKNFSGRLPIT-FPADVDSLPAFDDYSMN--------GRTYKYMSGNVMYPFGYGL 595
Query: 632 SYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKN 691
SY + +Y + + V K L E T N
Sbjct: 596 SYGRVTY--------------------TDARVVGRIKKGEPLAVEVVLT----------N 625
Query: 692 HGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRARED 751
+G+ ++ G G P+ LVGF+ V + K + VF++ P E L + D
Sbjct: 626 NGDRTIDEVAQAYIATPTAGKGSPMASLVGFRRVSIPPKSSVKAVFKIVP-ERLMTVQSD 684
Query: 752 GLMVIEEGTHFLVVG 766
G + +G + L +G
Sbjct: 685 GSSKLLKGNYTLTIG 699
>gi|301090543|ref|XP_002895482.1| beta-glucosidase, putative [Phytophthora infestans T30-4]
gi|262098232|gb|EEY56284.1| beta-glucosidase, putative [Phytophthora infestans T30-4]
Length = 809
Score = 412 bits (1058), Expect = e-112, Method: Compositional matrix adjust.
Identities = 265/737 (35%), Positives = 390/737 (52%), Gaps = 75/737 (10%)
Query: 30 PFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPR-----LGI 84
P +C S+P + F FC +L ++R DL+ RL LDEK++ L +A A P+ +G+
Sbjct: 24 PRAC--SSPEHQQFAFCNASLSTAERVEDLLRRLPLDEKVTLL--TARASPKGNMSSIGL 79
Query: 85 PAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARA 144
P Y W + +HGV GT ATSFP + A FD + + Q +G E RA
Sbjct: 80 PEYNWGANCVHGVQSTC------GT-NCATSFPNPVNLGAIFDPRAVFDMAQVVGWELRA 132
Query: 145 L--------YNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQG 196
L Y G +G+ W+PNINI RDPRWGR ETP EDPLV KY V+Y +G+Q
Sbjct: 133 LWLEGARENYATGPHLGLDCWSPNININRDPRWGRNMETPSEDPLVNSKYGVAYTKGLQE 192
Query: 197 DTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQ 256
GK K LQA KH+ AY +++ G R F+A V+ D ADTY P FE+ V
Sbjct: 193 -----GKDKRFLQAVVTLKHYAAYSYEHYDGIDRMAFNAVVSRYDFADTYLPAFEASVVH 247
Query: 257 GRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPE 316
G+A G+MC+YN VNG+P CA+ L SK R GF GYITSD A++ IY Y K+
Sbjct: 248 GKAKGVMCSYNSVNGMPMCANEQLNSKLLRDALGFDGYITSDSGAIAGIYHQRHYTKTLC 307
Query: 317 DAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTM 376
+A + +G DVN GS ++ V +LPE +D A+ +R LGLF+
Sbjct: 308 EAGRLAILSGTDVNSGSVYKQCLAELVTSGQLPEKAVDDAMRRTLKLRFELGLFDPIDD- 366
Query: 377 QPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKT 436
QP+ + + V + + L+L ++ IVLL+N +LPL K K LA+IGP+A + +
Sbjct: 367 QPYWHVAPNEVNTAESKQLSLDLSRKSIVLLQNHGNILPLAKGK--KLAVIGPHAAAKRA 424
Query: 437 LLGNYAGPSCRS--------ITPLQALQ--NYVENTVYYPGCDTVACSSASIDKAVDIAK 486
LLGNY G C TPL+A+ N NT+Y G S+A D+A A+
Sbjct: 425 LLGNYLGQMCHGDYLEVGCVQTPLEAITIANGASNTLYAKGSGINDTSTAGFDEAEAAAR 484
Query: 487 GADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITF 546
A+ VVL +G+D + E+E DR ++ +P Q +L+ RV A KP ++VL GG V
Sbjct: 485 KAETVVLFLGIDTSIEREAWDRENIDMPNIQMQLLKRVRRAG-KPTVVVLFNGGVVGA-- 541
Query: 547 AKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQ 606
+ + ++ A YPG GA A+++++FGD P G+LP+T YP +Y+ T + M+
Sbjct: 542 EELILHTDGVVEAFYPGFFGAQAVSDILFGDAIPSGKLPVTMYPSNYV----TSVDMKSM 597
Query: 607 ATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVH 666
+ + PGR+YR+Y+ VFPFG GLSY++++ SSS ++ +V
Sbjct: 598 SMTKYPGRSYRYYKEVPVFPFGWGLSYTRFTMAL-----------DSSSGVTDPSEPIVV 646
Query: 667 YKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPAR---RGNGRPI-KQLVGF 722
+ + + VT+ + N G + G V F +P + GN + +QL +
Sbjct: 647 TRQLDQ-----------TVTVILSNDGNLVGDEVVFAFFRPLKVNATGNAALLNEQLFDY 695
Query: 723 QSVILNAKEKAEIVFEL 739
+ V L + ++ F +
Sbjct: 696 RRVSLRPTQYRKLKFRI 712
>gi|115397385|ref|XP_001214284.1| hypothetical protein ATEG_05106 [Aspergillus terreus NIH2624]
gi|114192475|gb|EAU34175.1| hypothetical protein ATEG_05106 [Aspergillus terreus NIH2624]
Length = 776
Score = 411 bits (1057), Expect = e-112, Method: Compositional matrix adjust.
Identities = 261/735 (35%), Positives = 391/735 (53%), Gaps = 39/735 (5%)
Query: 36 SNPSTETFPFCKT-----TLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWW 90
S P E P KT + RA LVS TL+E ++ N+ +PRLG+P Y+ W
Sbjct: 66 SFPDCENGPLSKTLVCDKSARPHDRAAALVSMFTLEELVNNTGNTGTGVPRLGLPKYQVW 125
Query: 91 SEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQ 150
SE+LHGV G ATSFPQ ILT A+ + L ++IG + +ARA N G+
Sbjct: 126 SESLHGVYRANWAS--EGDYSWATSFPQPILTMAALNRTLIHQIGDILSTQARAFSNVGR 183
Query: 151 AIGMTFWAPNINIFRDPRWGRGQETPGEDPL-VTGKYAVSYVRGVQGDTFNGGKLKGKLQ 209
G+ +APNIN FR P WGRGQETPGED + YA Y+ G+QG G L+
Sbjct: 184 -YGLDTYAPNINSFRHPVWGRGQETPGEDAYYLASTYAYEYITGIQG-----GVDPETLK 237
Query: 210 ASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRV 269
A KH+ YD++NW G +R D ++T QDL++ Y P F + + +MC+YN V
Sbjct: 238 LVATAKHYAGYDIENWDGHSRLGNDMQITQQDLSEYYTPQFLVSARDAKVHSVMCSYNAV 297
Query: 270 NGIPSCADRNLLSKTARRQWGF--HGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGM 327
NG+PSC++ L R +GF GY++ DC AV ++ YA + A D ++AG
Sbjct: 298 NGVPSCSNSFFLQTLLRETFGFVEDGYVSGDCGAVYNAFNPHEYAANESSASADSIRAGT 357
Query: 328 DVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVV 387
D++CG+ Q H A + ++ +I+R + L++ +RLG F+GN + Q +DV
Sbjct: 358 DIDCGTSYQYHFTNAFDEGEISRQDIERGVIRLYTNLVRLGYFDGN-SSQYRDLTWSDVQ 416
Query: 388 CSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCR 447
+ A + + +AA +G VLLKN G LPL S S+ALIGP AN+ + GNY GP+
Sbjct: 417 TTDAWNI-SHEAAVEGTVLLKND-GTLPLADSIR-SVALIGPWANATTQMQGNYYGPAPY 473
Query: 448 SITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELD 507
+PL AL+ + Y G + + ++A A+ A+ AD ++ G+D T E E LD
Sbjct: 474 LTSPLAALEASDLDVHYAFGTNISSTTTAGFADALAAARKADAIIFAGGIDNTIEGEALD 533
Query: 508 RVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGA 567
R+++ PG Q +LI +++ A KP++++ + GG VD + K++ N+ ++LW GYPG++G
Sbjct: 534 RMNITWPGNQLDLINQLS-ALGKPLVVLQMGGGQVDSSALKHNTNVSALLWGGYPGQSGG 592
Query: 568 VALAEVIFGDHNPGGRLPMTWYPQDY-IKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFP 626
AL ++I G P GRL T YP Y + P DM +RP T NPG+TY +Y G V+
Sbjct: 593 TALLDIIRGVRAPAGRLVTTQYPAGYATQFPAIDMGLRPNGT--NPGQTYMWYTGTPVYE 650
Query: 627 FGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFL-V 685
FG GL Y+ + K + + N N +D++ + P G + + R FL
Sbjct: 651 FGHGLFYTTFEAKRASTATNHSSFNI---------EDLL---TAPHPGYAYPQLRPFLNF 698
Query: 686 TIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSV-ILNAKEKAEIVFELSPCES 744
T + N G + +LF P K LVGF + L + F ++ ++
Sbjct: 699 TAHITNTGRTTSDYTAMLFANTTAGPAPHPNKWLVGFDRLGALEPGASQTMTFPIT-IDN 757
Query: 745 LSRAREDGLMVIEEG 759
++R E G V+ G
Sbjct: 758 VARTDELGNRVLYPG 772
>gi|301118693|ref|XP_002907074.1| glycoside hydrolase, putative [Phytophthora infestans T30-4]
gi|262105586|gb|EEY63638.1| glycoside hydrolase, putative [Phytophthora infestans T30-4]
Length = 809
Score = 410 bits (1054), Expect = e-111, Method: Compositional matrix adjust.
Identities = 264/737 (35%), Positives = 389/737 (52%), Gaps = 75/737 (10%)
Query: 30 PFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPR-----LGI 84
P +C S+P + F FC +L ++R DL+ RL LDEK++ L +A A P+ +G+
Sbjct: 24 PRAC--SSPEHQQFAFCNASLSTAERVEDLLRRLPLDEKVTLL--TARASPKGNMSSIGL 79
Query: 85 PAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARA 144
P Y W + +HGV GT ATSFP + A FD + + Q +G E RA
Sbjct: 80 PEYNWGANCVHGVQSTC------GT-NCATSFPNPVNLGAIFDPRAVFDMAQVVGWELRA 132
Query: 145 L--------YNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQG 196
L Y G +G+ W+PNINI RDPRWGR ETP EDPLV KY V+Y +G+Q
Sbjct: 133 LWLEGARENYATGPHLGLDCWSPNININRDPRWGRNMETPSEDPLVNSKYGVAYTKGLQE 192
Query: 197 DTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQ 256
GK K LQA KH+ AY +++ G R F+A V+ D ADTY P FE+ V
Sbjct: 193 -----GKDKRFLQAVVTLKHYAAYSYEHYDGIDRMAFNAVVSRYDFADTYLPAFEASVVH 247
Query: 257 GRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPE 316
G+A G+MC+YN VNG+P CA+ L SK R GF GYITSD A++ IY Y K+
Sbjct: 248 GKAKGVMCSYNSVNGMPMCANEQLNSKLLRDALGFDGYITSDSGAIAGIYHQRHYTKTLC 307
Query: 317 DAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTM 376
+A + +G DVN GS ++ V +LPE +D A+ +R LGLF+
Sbjct: 308 EAGRLAILSGTDVNSGSVYKQCLAELVTSGQLPEKAVDDAMRRTLKLRFELGLFDPIDD- 366
Query: 377 QPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKT 436
QP+ + + V + + L+L ++ IVLL+N +LPL K K LA+IGP+A + +
Sbjct: 367 QPYWHVAPNEVNTAESKQLSLDLSRKSIVLLQNHGNILPLAKGK--KLAVIGPHAAAKRA 424
Query: 437 LLGNYAGPSCRS--------ITPLQALQ--NYVENTVYYPGCDTVACSSASIDKAVDIAK 486
LLGNY G C TPL+A+ N NT+Y G S+ D+A A+
Sbjct: 425 LLGNYLGQMCHGDYLEVGCVQTPLEAITIANGASNTLYAKGSGINDTSTGGFDEAEAAAR 484
Query: 487 GADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITF 546
A+ VVL +G+D + E+E DR ++ +P Q +L+ RV A KP ++VL GG V
Sbjct: 485 KAETVVLFLGIDTSIEREAWDRENIDMPNIQMQLLKRVRRAG-KPTVVVLFNGGVVGA-- 541
Query: 547 AKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQ 606
+ + ++ A YPG GA A+++++FGD P G+LP+T YP +Y+ T + M+
Sbjct: 542 EELILHTDGVVEAFYPGFFGAQAVSDILFGDAIPSGKLPVTMYPSNYV----TSVDMKSM 597
Query: 607 ATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVH 666
+ + PGR+YR+Y+ VFPFG GLSY++++ SSS ++ +V
Sbjct: 598 SMTKYPGRSYRYYKEVPVFPFGWGLSYTRFTMAL-----------DSSSGVTDPSEPIVV 646
Query: 667 YKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPAR---RGNGRPI-KQLVGF 722
+ + + VT+ + N G + G V F +P + GN + +QL +
Sbjct: 647 TRQLDQ-----------TVTVILSNDGNLVGDEVVFAFFRPLKVNATGNAALLNEQLFDY 695
Query: 723 QSVILNAKEKAEIVFEL 739
+ V L + ++ F +
Sbjct: 696 RRVSLRPTQYRKLKFRI 712
>gi|326791674|ref|YP_004309495.1| beta-glucosidase [Clostridium lentocellum DSM 5427]
gi|326542438|gb|ADZ84297.1| Beta-glucosidase [Clostridium lentocellum DSM 5427]
Length = 696
Score = 410 bits (1054), Expect = e-111, Method: Compositional matrix adjust.
Identities = 262/716 (36%), Positives = 372/716 (51%), Gaps = 105/716 (14%)
Query: 54 QRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGA 113
++A+ LV+ +TL+E+ SQL +PAI RLG+PAY WW+EALHGVA G A
Sbjct: 8 KKAKALVAEMTLEERASQLKYDSPAIKRLGVPAYNWWNEALHGVARAGV----------A 57
Query: 114 TSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQA-------IGMTFWAPNINIFRD 166
TSFPQ I AA+FD L R+ + I E RA YNA G+TFW+PN+NIFRD
Sbjct: 58 TSFPQAIGMAATFDDELLKRVAEVIAEEGRAKYNAYSQEGDRDIYKGLTFWSPNVNIFRD 117
Query: 167 PRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWK 226
PRWGRG ET GEDP +T + V++V+G+QG+ L+ +AC KHF + +
Sbjct: 118 PRWGRGHETYGEDPYLTSRLGVAFVKGLQGEE--------GLKTAACAKHFAVH---SGP 166
Query: 227 GTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTAR 286
R+ FDARV+ +DL +TY P FE+ VK+ +M AYNR NG P C L+ R
Sbjct: 167 EADRHHFDARVSQKDLWETYLPAFEALVKEAEVESVMGAYNRTNGEPCCGSPTLMKDILR 226
Query: 287 RQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG-SFLQKHTKAAVKQ 345
+WGF G+ SDC A+ ++ + +++ LK+G D+NCG ++L H A +
Sbjct: 227 EKWGFQGHYVSDCWAIKDFHEHHMVTSTAQESAALALKSGCDLNCGNTYL--HILMAYQN 284
Query: 346 KKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIV 405
+ E EI A LF+ R LGLF+G+ + I +VV S H +A +A IV
Sbjct: 285 GLVTEEEITTAAERLFTTRYLLGLFDGS----TYDAIPYEVVESKPHLSVADEATAKSIV 340
Query: 406 LLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTV-- 463
LLKN+ GLLPL K ++ +IGPNANS K L+GNY G S + IT L+ LQ V + V
Sbjct: 341 LLKNN-GLLPLNKESIKTIGVIGPNANSRKALIGNYHGTSSQYITILEGLQKEVGDEVRI 399
Query: 464 --------YYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEE---------L 506
Y + +A + +A +AK +D V++ +GLD+T E EE
Sbjct: 400 LYSEGSHLYADRVEPLAYQRDRLSEAKIVAKHSDVVIVCVGLDETLEGEEGDTGNAYASG 459
Query: 507 DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAG 566
D+ DL LP QQEL+ +A+ KPVIL L G +D+ +A D + ++L A YPG G
Sbjct: 460 DKRDLALPEPQQELVEAMAKMG-KPVILCLSAGSAIDLQYA--DAHYDAVLQAWYPGARG 516
Query: 567 AVALAEVIFGDHNPGGRLPMTWYPQDYIKVP-MTDMKMRPQATSGNPGRTYRFYEGKEVF 625
+A+ + G+ P G+LP+T+Y +D +P D M+ GRTYR+ + + ++
Sbjct: 517 GQVIAKALLGEIVPSGKLPVTFY-RDLSGLPAFEDYSMQ--------GRTYRYMQEEALY 567
Query: 626 PFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLV 685
PFG GL+Y K + + Q L
Sbjct: 568 PFGYGLTYGKCRIEEASYDQGSL------------------------------------- 590
Query: 686 TIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSP 741
+ V N + + V L++K P L GF+ V L A E EI +SP
Sbjct: 591 RVLVHNEVDFKLEEVVQLYIKNLDSEFAVPNHSLCGFKRVSLEAGETKEIQINVSP 646
>gi|310792973|gb|EFQ28434.1| glycosyl hydrolase family 3 N terminal domain-containing protein
[Glomerella graminicola M1.001]
Length = 728
Score = 410 bits (1054), Expect = e-111, Method: Compositional matrix adjust.
Identities = 257/720 (35%), Positives = 378/720 (52%), Gaps = 49/720 (6%)
Query: 63 LTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFF---NGTIRGATSFPQV 119
++++EK+ LV+++ + LG+P + WW+E LHGV G G+ F + ATSFP
Sbjct: 1 MSVEEKVRNLVDASAGVKSLGLPPHGWWNEGLHGV-GFSPGVLFAQDSEPFGYATSFPLP 59
Query: 120 ILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGED 179
ILTAASFD L+ IGQ IG E RA N G A G FW PN+N FRDPRWGRGQETPGED
Sbjct: 60 ILTAASFDDDLFNAIGQVIGREGRAFSNYGYA-GFNFWTPNMNAFRDPRWGRGQETPGED 118
Query: 180 PLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTM 239
LV Y SYV G+QG + A CKHF AYD++ T R + T
Sbjct: 119 VLVVSNYVQSYVTGLQGSDPTDKVII------AACKHFAAYDIE----TARRANNYNPTQ 168
Query: 240 QDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGF---HGYIT 296
QDL D Y P F CV+ +MC+YN V+GIP+C+ LL + R WGF + ++
Sbjct: 169 QDLQDYYLPAFRRCVRDSHVGTVMCSYNSVDGIPACSSEYLLKEVLRDTWGFTNDYQFVV 228
Query: 297 SDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRA 356
SDC AV+ ++ + + +DA + AG D+ CGS H ++ K++ + +D A
Sbjct: 229 SDCGAVTDVWLLHNFTNTEQDAASVSMAAGTDLECGSSY-LHLNGSLADKQVTQERVDEA 287
Query: 357 LHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPL 416
L L+ +G F+G+ +G V + Q +A +AA+ G+ LLKN G+LPL
Sbjct: 288 LTRLYKALFTVGYFDGS----SHSSLGWSDVSTIDAQQIACEAARAGMTLLKND-GVLPL 342
Query: 417 PKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQAL-QNYVENTVYYPGCDTVACSS 475
K S+ALIGP AN+ + GNY G + +PL A Q Y G D + S
Sbjct: 343 ADGKYKSVALIGPFANATTQMQGNYFGRAPFVRSPLWAFTQQSSLQVNYAAGTDINSTSD 402
Query: 476 ASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILV 535
+ A+ AK +D V+ G+D T E E LDRV + PG Q +LI++++ KP+++
Sbjct: 403 SGFADALAAAKNSDIVIFCGGIDTTIEAETLDRVSITWPGNQLDLISQLSMLG-KPLVVA 461
Query: 536 LLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIK 595
GG VD T + N+ ++ WAG PG+AG +A+ +++ G + GRLP T YP Y
Sbjct: 462 QFGGGQVDDTALVDNANVNALFWAGLPGQAGGLAMYDLVVGKASFAGRLPTTQYPASYAD 521
Query: 596 -VPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSS 654
V + ++ +RP T PGRTY++Y G+ VFPFG GL Y+K+++ +K + ++
Sbjct: 522 LVSIFNINLRPNGTF--PGRTYKWYIGEPVFPFGFGLHYTKFNFTWKDTLEPTYDISNII 579
Query: 655 STKMVENQDVVHYKSVPELGTEFCETRKFL-VTIGVKNHGEMAGKHPVLLFVKPARRGN- 712
S +N V +T F V + VKN G + + LLF+ G
Sbjct: 580 SWARSQNNGHV------------TDTTPFTSVNVTVKNVGNVRSDYVGLLFLSSKNAGPV 627
Query: 713 GRPIKQLVGF---QSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGDEE 769
RP K L + + A ++ + L S +R+ G + I G + L + +++
Sbjct: 628 PRPNKSLASYSRAHDIETGASDQLTLKLTLG---SFARSDSQGNLTIFPGDYKLELDNDK 684
>gi|320161274|ref|YP_004174498.1| beta-D-xylosidase [Anaerolinea thermophila UNI-1]
gi|319995127|dbj|BAJ63898.1| beta-D-xylosidase [Anaerolinea thermophila UNI-1]
Length = 712
Score = 409 bits (1052), Expect = e-111, Method: Compositional matrix adjust.
Identities = 261/736 (35%), Positives = 379/736 (51%), Gaps = 82/736 (11%)
Query: 51 PISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTI 110
P+ +R DL+SR+TL+EKISQ+ NS AIPRLGIPAY++WSEALHGVA GK
Sbjct: 14 PLEERVNDLISRMTLEEKISQMCNSCAAIPRLGIPAYDYWSEALHGVARNGK-------- 65
Query: 111 RGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA-----GQA---IGMTFWAPNIN 162
AT FPQ I AA++D+ L R+ AI EARA ++ G+ G+T W+PNIN
Sbjct: 66 --ATVFPQAIGMAATWDTELIERVADAIASEARAKFHETLRKFGKTDIYQGLTMWSPNIN 123
Query: 163 IFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDL 222
IFRDPRWGRGQET GEDP +TG+ ++VRG+Q GK L+ +AC KH+T +
Sbjct: 124 IFRDPRWGRGQETWGEDPYLTGEMGAAFVRGLQ------GKDPHYLKTAACAKHYTVH-- 175
Query: 223 DNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLS 282
+ R+ F+A VT ++L DTY P F+ V + + +M AYNR G P C LL
Sbjct: 176 -SGPEKERHTFNAIVTRRELFDTYLPAFKKLVTEAKVEAVMGAYNRTLGEPCCGSPYLLK 234
Query: 283 KTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNC-GSFLQKHTKA 341
+ R QWGF G++ SDC A++ + K ++ +K G D+ C ++ ++
Sbjct: 235 EILRNQWGFKGHVVSDCGAINDFHLHHQVTKDGAESAALGIKNGCDMACICTYSYENLTE 294
Query: 342 AVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQ 401
A+ + + E +ID AL N R +LGLF+ + P+ I VV AH+ LA + A
Sbjct: 295 ALNRGLITEEDIDHALRNTLRTRFKLGLFDPQEKV-PYAHISMSVVGCEAHRKLAYETAV 353
Query: 402 DGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVEN 461
VLLKN + +LP+ K S+ ++GPNA + LLGNY G S T ++ L +
Sbjct: 354 KSAVLLKNHNHILPV-KPDVKSILIVGPNAGNVHVLLGNYYGLSDSMTTFMEGLVGRLPE 412
Query: 462 TV---YYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEE--------LDRVD 510
V + PG D +V A D V+ MGL E EE DR D
Sbjct: 413 GVRMEFMPGSLLTDSKKIKNDWSVASAASFDLVIAFMGLSPLLEGEEGEAILSDNGDRED 472
Query: 511 LVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVAL 570
+ LP QQE I +A K I+++L GG I + + +ILW GYPG+ G A+
Sbjct: 473 IALPKAQQEYIRDLAATGAK--IVLVLTGGSA-IALNGIEDLVEAILWVGYPGQEGGRAI 529
Query: 571 AEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCG 630
A++IFGDH+P G+LP+T+ P++ ++ P RTYR+ +FPFG G
Sbjct: 530 ADLIFGDHSPSGKLPITF--------PVSTDQLPPFREYSMKERTYRYMTSSPLFPFGFG 581
Query: 631 LSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVK 690
LSY+++ YK + L ++ T +
Sbjct: 582 LSYTQFEYKNLQLEHPVLSAGEA-----------------------------LRGTFELA 612
Query: 691 NHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRARE 750
N GE G+ V +++ P+++L+ FQ V L E ++ F + P E++ +
Sbjct: 613 NVGEYEGEEVVQVYLSDLEASTIVPLQKLISFQRVRLKPGETVQLSFAIQP-EAMMMIDD 671
Query: 751 DGLMVIEEGTHFLVVG 766
+G V+E G L +G
Sbjct: 672 EGNQVLEPGKFKLTIG 687
>gi|348684872|gb|EGZ24687.1| family 3 glycoside hydrolase [Phytophthora sojae]
Length = 805
Score = 409 bits (1050), Expect = e-111, Method: Compositional matrix adjust.
Identities = 273/739 (36%), Positives = 385/739 (52%), Gaps = 83/739 (11%)
Query: 30 PFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPR-----LGI 84
P +C S+ + FPFC +L S+R DL+ RL LDEK++ L +A A P+ +G+
Sbjct: 23 PRAC--SDTEHQKFPFCDASLSTSERVEDLLRRLPLDEKVTLL--TARASPKGNMSSIGL 78
Query: 85 PAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARA 144
P Y W + +HGV GT ATSFP + A FD + + Q IG E RA
Sbjct: 79 PEYNWGANCVHGVQSTC------GT-NCATSFPNPVNLGAIFDPQAVFDMAQVIGWELRA 131
Query: 145 L--------YNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQG 196
L Y AG +G+ W+PNINI RDPRWGR ETP EDPLV KY V+Y RG+Q
Sbjct: 132 LWLEGARENYAAGPHLGLDCWSPNININRDPRWGRNMETPSEDPLVNSKYGVAYTRGLQE 191
Query: 197 DTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQ 256
GK K LQA KH+ AY +++ G R F+A+V+ D ADTY P F + V +
Sbjct: 192 -----GKDKRFLQAVVTLKHYAAYSYEHYDGIDRMAFNAQVSRYDFADTYLPAFHASVVE 246
Query: 257 GRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPE 316
G+A G+MC+YN VNG+P CA+ L +K R GF GYITSD A+ IY Y KS
Sbjct: 247 GKAKGVMCSYNSVNGMPMCANEQLNTKLLREALGFDGYITSDSGAIEGIYRQRHYTKSLC 306
Query: 317 DAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTM 376
+A + +G DVN GS +K V +LPE +D A+ +R LGLF+
Sbjct: 307 EAGRLAIMSGTDVNSGSVYKKCLADLVTSGQLPEKAVDDAMRRTLKLRFELGLFDPIDD- 365
Query: 377 QPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKT 436
QP+ + V + L+L+ + IVLL+N +LPL K K LA+IGP+A + +
Sbjct: 366 QPYWHVAPSEVGKTESKQLSLELTRKSIVLLQNHGNVLPLRKGK--KLAVIGPHAKAKRA 423
Query: 437 LLGNYAGPSCRS--------ITPLQALQ--NYVENTVYYPGCDTVACSSASIDKAVDIAK 486
LLGNY G C TPL+A+ N NTVY G S+A D A A+
Sbjct: 424 LLGNYLGQMCHGDYLEVGCVQTPLEAITAANGASNTVYAKGSGINDTSTADFDAAEAAAR 483
Query: 487 GADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITF 546
GAD VVL +G+D + E+E DR ++ +P Q +L+ RV A KP ++VL GG V
Sbjct: 484 GADAVVLFLGIDTSIEREAWDRENIDMPNIQMQLLKRVRRAG-KPTVVVLFNGGVVGA-- 540
Query: 547 AKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQ 606
+ + + A YPG GA A+++++FGD P G+LP+T YP +YI + M+
Sbjct: 541 EELILHTDGVAEAFYPGFFGAQAVSDILFGDAIPSGKLPVTMYPSNYIN----SVDMKSM 596
Query: 607 ATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVH 666
+ + PGR+YR+Y+ VFPFG GLSY+K++ +M ++ V+
Sbjct: 597 SMTKYPGRSYRYYKEVPVFPFGWGLSYTKFTLALDG--------------EMPDDPIVI- 641
Query: 667 YKSVPELGTEFCETRKF--LVTIGVKNHGEMAGKHPVLLFVKPAR---RGNGRPI-KQLV 720
TR VT+ V N G++ G V F +P G+ + +QL
Sbjct: 642 -------------TRDLDQTVTVIVSNDGDLVGDEVVFAFFRPLNVNATGDAALLNEQLF 688
Query: 721 GFQSVILNAKEKAEIVFEL 739
++ V L + ++ F +
Sbjct: 689 DYRRVSLRPTQYRKLTFRI 707
>gi|295134875|ref|YP_003585551.1| beta-glucosidase [Zunongwangia profunda SM-A87]
gi|294982890|gb|ADF53355.1| beta-glucosidase [Zunongwangia profunda SM-A87]
Length = 735
Score = 409 bits (1050), Expect = e-111, Method: Compositional matrix adjust.
Identities = 264/785 (33%), Positives = 403/785 (51%), Gaps = 99/785 (12%)
Query: 5 KLSLVFPLLCLCFTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLT 64
K +F ++ + F + + T+ D S F F T L + +R DL+SRLT
Sbjct: 3 KYRQLFAIMLMLFLTAIMHAQQTK----IDKSE-----FDFYDTDLSMDERIDDLISRLT 53
Query: 65 LDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAA 124
L+EK Q++N++PAI RLGIPAY+WW+EALHG+ G AT FPQ I A
Sbjct: 54 LEEKAQQMLNASPAIERLGIPAYDWWNEALHGLGRSGV----------ATVFPQAIGMGA 103
Query: 125 SFDSYLWYRIGQAIGLEARALYNAG-------QAIGMTFWAPNINIFRDPRWGRGQETPG 177
+FD L ++ AI EARA +N + G+TFW PN+NIFRDPRWGRGQET G
Sbjct: 104 TFDDDLILKVSTAISDEARANFNNAVKHGYHRKYGGLTFWTPNVNIFRDPRWGRGQETYG 163
Query: 178 EDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARV 237
EDP +T K ++V+G+QGD L+ +A KH+ + + R++F+A V
Sbjct: 164 EDPYLTSKLGEAFVKGLQGDN------DKYLKTAAAAKHYAVH---SGPEKLRHEFNADV 214
Query: 238 TMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITS 297
+ +DL +TY P F++ V IMCAYN NG P CA+ L++ R +WGF+G++ S
Sbjct: 215 SEKDLWETYLPAFKTLV-DANVETIMCAYNSTNGEPCCANNRLINDILRDKWGFNGHVVS 273
Query: 298 DCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG---SFLQKHTKAAVKQKKLPESEID 354
DC A+ +SPE A ++ G+++NCG +FL K AV+ + E +D
Sbjct: 274 DCWALQDFVSGHDIVESPEAAAALAVEVGIELNCGDTYNFLAK----AVEDGLVSEELVD 329
Query: 355 RALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLL 414
+ LH L R +LGLF+ + P+ KIG +V+ S H+ LA + A+ IVLLKN G+L
Sbjct: 330 KRLHKLLETRFKLGLFDPEES-NPYNKIGVEVMNSDEHRALARETARKSIVLLKND-GVL 387
Query: 415 PLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVE---NTVYYPGCDTV 471
PL K+ + GPNA + + LLGNY G + +T L+ + ++ Y G
Sbjct: 388 PL-KNNLSKYFITGPNATNIEVLLGNYHGVNPDMVTVLEGIAKAIKPESQLQYRMGTRLN 446
Query: 472 ACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEE---------LDRVDLVLPGRQQELIT 522
+ D A A +D ++MG+ E EE DR+D LP Q + +
Sbjct: 447 LPNENPQDWASPNAGNSDATFVVMGISGLLEGEEGESIASPTFGDRMDYNLPQNQIDYLQ 506
Query: 523 RVAEAAK-KPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
+V+EAA+ +PV+ ++ G P+++T + + ++L YPGE G A+A++IFG ++P
Sbjct: 507 KVSEAAEDRPVVAIVTGGSPMNLT--EVHKLADAVLLVWYPGEEGGNAVADIIFGKNSPS 564
Query: 582 GRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFK 641
GRLP+T+ PMT + GRTY++ + ++PFG GLSY+ + Y
Sbjct: 565 GRLPITF--------PMTIEDLPAYEDYTMEGRTYKYMDVVPMYPFGYGLSYTDFEYS-- 614
Query: 642 AVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPV 701
+ E E R I V N G+ V
Sbjct: 615 ----------------------EIKLSKDKIKKKESVEAR-----ISVTNTGDFEADEVV 647
Query: 702 LLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTH 761
+++K + + P +LV F+++ L E E+ FE++P E LS ++G +E+G
Sbjct: 648 QVYLKDVKASSRVPNFELVAFKNIHLKRGESKELTFEITP-EMLSFIDDNGKEKLEKGAF 706
Query: 762 FLVVG 766
+ +G
Sbjct: 707 EIYIG 711
>gi|310797011|gb|EFQ32472.1| glycosyl hydrolase family 3 N terminal domain-containing protein
[Glomerella graminicola M1.001]
Length = 767
Score = 408 bits (1049), Expect = e-111, Method: Compositional matrix adjust.
Identities = 246/599 (41%), Positives = 339/599 (56%), Gaps = 22/599 (3%)
Query: 46 CKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIF 105
C TL +RA LV LT++EK+ LV+ A PR+G+PAY WWSEALHGVA F
Sbjct: 43 CDRTLSPPERAAALVKALTVEEKLQNLVSKAQGAPRIGLPAYNWWSEALHGVAYAPGTYF 102
Query: 106 FNGTIR--GATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINI 163
G + +TS+P +L AA+FD L +IG AIG+EARA NAG A G+ +W PN+N
Sbjct: 103 PEGDVEFNSSTSYPMPLLMAAAFDDELIEQIGAAIGIEARAWGNAGWA-GLDYWTPNVNP 161
Query: 164 FRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLD 223
F+DPRWGRG ETPGED L +YA RG+ G + + + CKH+ D +
Sbjct: 162 FKDPRWGRGSETPGEDVLRVKRYAEYITRGLDGPVPGEQR-----RVISTCKHYAGNDFE 216
Query: 224 NWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSK 283
+W GT+R+ FDA++T QDLA+ Y PF+ C + + IMCAYN VNG+PSCA+ LL
Sbjct: 217 DWNGTSRHDFDAKITAQDLAEYYLMPFQQCARDSKVGSIMCAYNAVNGVPSCANEYLLQN 276
Query: 284 TARRQWGF---HGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTK 340
R W + + Y+TSDC+AV + YA + +AGMD +C
Sbjct: 277 ILREHWNWTEHNNYVTSDCEAVLDVSANHKYAPTNAAGTAICFEAGMDTSCEYTGSSDIP 336
Query: 341 AAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAA 400
A Q L E +DRAL L+ +R G F+G+ + + K+G V S Q LALQAA
Sbjct: 337 GAWSQGLLKEETVDRALLRLYEGLVRAGYFDGHEAI--YAKLGWKDVNSAEAQSLALQAA 394
Query: 401 QDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVE 460
+GIVLLKN +G LPL S +A+IG A++ L G Y+G + TP A +
Sbjct: 395 VEGIVLLKN-NGTLPLDLKPSHKVAMIGFWADAPDKLQGGYSGRAAHLHTPAYAARQLGL 453
Query: 461 NTVYYPG--CDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQ 518
+ G S A++ A+GAD+++ GLD + E LDR DL P Q
Sbjct: 454 DITLASGPVLQRNNASDNWTAAALEAAEGADYILYFGGLDTSAAGETLDRTDLEWPEAQL 513
Query: 519 ELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDH 578
LI +++ A KP+++ LL D + D + SILWA +PG+ G VA+ ++I G+
Sbjct: 514 MLIKKLS-ALGKPLVVNLLGDQLDDTPLLQLDE-VSSILWANWPGQDGGVAIMKLITGEK 571
Query: 579 NPGGRLPMTWYPQDYIK-VPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKY 636
+P GRLP+T YP +Y +PMT M +RP TS PGRTYR+Y+ K + FG GL Y+ +
Sbjct: 572 SPAGRLPVTQYPSNYTDLIPMTSMDLRP--TSQYPGRTYRWYD-KPIKRFGFGLHYTTF 627
>gi|224068498|ref|XP_002302758.1| predicted protein [Populus trichocarpa]
gi|222844484|gb|EEE82031.1| predicted protein [Populus trichocarpa]
Length = 462
Score = 407 bits (1045), Expect = e-110, Method: Compositional matrix adjust.
Identities = 210/462 (45%), Positives = 295/462 (63%), Gaps = 25/462 (5%)
Query: 324 KAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIG 383
+A +D++CG FL +HT+ AV++ L E+EI+ AL N +V+MRLG+F+G P+ +P+G +G
Sbjct: 5 QASLDLDCGPFLGQHTEDAVRKGLLTEAEINNALLNTLTVQMRLGMFDGEPSSKPYGNLG 64
Query: 384 ADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAG 443
VC+PAHQ LAL+AA+ GIVLLKN LPL S+A+IGPN+N T++GNYAG
Sbjct: 65 PTDVCTPAHQELALEAARQGIVLLKNHGPPLPLSTRHHQSVAIIGPNSNVTVTMIGNYAG 124
Query: 444 PSCRSITPLQALQNYVENTVYYPGCDTVAC-SSASIDKAVDIAKGADHVVLMMGLDQTQE 502
+C TPLQ + Y + T+Y GC VAC S A+D A+ AD VL+MGLDQ+ E
Sbjct: 125 VACGYTTPLQGIGRYAK-TIYQQGCADVACVSDQQFVAAMDAARQADATVLVMGLDQSIE 183
Query: 503 KEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYP 562
E DR +L+LPGRQQELI++VA A+K P ILVL+ GGP+D++FA+ D IG I+WAGYP
Sbjct: 184 AESRDRTELLLPGRQQELISKVAAASKGPTILVLMSGGPIDVSFAENDPKIGGIVWAGYP 243
Query: 563 GEAGAVALAEVIFGDHNPGGRLPMTWYPQDYI-KVPMTDMKMRPQATSGNPGRTYRFYEG 621
G+AG A+++V+FG NPGG+LPMTWYPQDY+ +PMT+M MRP ++G PGRTYRFY+G
Sbjct: 244 GQAGGAAISDVLFGTTNPGGKLPMTWYPQDYVTNLPMTNMAMRPSKSNGYPGRTYRFYKG 303
Query: 622 KEVFPFGCGLSYSKYSYKFKAV---------SQNKLYLNQSSSTKMVENQDVVHYKSVPE 672
K V+PFG G+SY+ + + + + N + S K + V H +
Sbjct: 304 KVVYPFGHGISYTNFVHTIASAPTMVSVPLDGHRQASRNATISGKAIR---VTHAR---- 356
Query: 673 LGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEK 732
C F V + VKN G M G H +L++ KP G+ P+KQLV F+ V + A +
Sbjct: 357 -----CNRLSFGVQVDVKNTGSMDGTHTLLVYSKPP-AGHWAPLKQLVAFEKVHVAAGTQ 410
Query: 733 AEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGDEEYPISI 774
+ + C+ LS G+ I G H L +GD ++ +S+
Sbjct: 411 QRVGINVHVCKFLSVVDRSGIRRIPMGAHSLHIGDVKHSVSL 452
>gi|280977785|gb|ACZ98610.1| glucosidase [Cellulosilyticum ruminicola]
Length = 711
Score = 406 bits (1044), Expect = e-110, Method: Compositional matrix adjust.
Identities = 260/743 (34%), Positives = 383/743 (51%), Gaps = 90/743 (12%)
Query: 54 QRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGA 113
A++LV ++ L EK SQL APAI RLGIP Y WW+EALHGVA G A
Sbjct: 6 NEAKELVRQMDLLEKASQLRYDAPAIKRLGIPTYNWWNEALHGVARAGV----------A 55
Query: 114 TSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAI-------GMTFWAPNINIFRD 166
T FPQ I AA FD I I +E RA YN GMTFWAPNINIFRD
Sbjct: 56 TVFPQAIGLAAMFDEEKLGEIADIIAIEGRAKYNQFSQKEDRDIYKGMTFWAPNINIFRD 115
Query: 167 PRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWK 226
PRWGRG ET GEDP +T + V++++G+QGD + + L+A+AC KHF + +
Sbjct: 116 PRWGRGHETYGEDPYLTARLGVAFIKGLQGD-----ENEDYLKAAACAKHFAVH---SGP 167
Query: 227 GTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTAR 286
R+ FDA V+ +DL +TY P FE+ VK+ G+M AYNRVNG P+C + LL +
Sbjct: 168 EEDRHHFDAIVSKKDLYETYLPAFEAAVKEANVIGVMGAYNRVNGEPACGSKTLLVDILK 227
Query: 287 RQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGS----FLQKHTKAA 342
+ WGF GYI SDC A+ + + ++ + G ++NCG+ L+ H +
Sbjct: 228 KDWGFDGYIVSDCWAIRDFHTEHMVTHTAAESAALAINNGCELNCGNTYLHMLEAHQEGL 287
Query: 343 VKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQD 402
VK++ + E A L +RM+LGLF+ N + +I V H+ +AL+A++
Sbjct: 288 VKEEIITE-----AAEKLMRIRMQLGLFDKNC---KYNEIPYAVNDCKVHREVALEASRR 339
Query: 403 GIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENT 462
+V+LKN G+LPL K K S+ +IGP AN+ L GNY G + R T ++ +Q+YV +
Sbjct: 340 SMVMLKND-GILPLNKDKLKSIGIIGPTANNRTVLEGNYNGTASRYTTFVEGIQDYVGDD 398
Query: 463 V---YYPGCDTVACSSASI-------DKAVDIAKGADHVVLMMGLDQTQEKEE------- 505
V Y GC A +++ +A+ +A+ +D VVL +GLD T E E+
Sbjct: 399 VRVYYSEGCHLFANGMSNLAWENDREAEALIVAEQSDVVVLCLGLDSTIEGEQGDTGNAF 458
Query: 506 --LDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPG 563
D++ L L GRQQ+L+ +V A KPVILVL G + I +A D + +I YPG
Sbjct: 459 AGGDKLSLNLIGRQQQLLEKVV-AVGKPVILVLSTGSAMAINYA--DEHCNAIFQTWYPG 515
Query: 564 EAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKE 623
G ALA+++FG+++P G+LP+T+Y D M+ RTYR+ +
Sbjct: 516 AQGGKALAQLLFGEYSPSGKLPVTFYKTTEELPAFEDYSMKD--------RTYRYMPNEA 567
Query: 624 VFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKF 683
++PFG GLSY+ V++ V+ E+ K+
Sbjct: 568 LYPFGYGLSYADIK---------------------VQSVKVLDGAKGEEITNFSAGQTKY 606
Query: 684 LVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCE 743
V + ++N + V +++K P L F+SV L A E E+ + +
Sbjct: 607 KVKVELENKSNVDSYDVVQIYIKDMESQYAVPNFSLCSFKSVFLKAGESKEVTLNVGE-K 665
Query: 744 SLSRAREDGLMVIEEGTHFLVVG 766
+ + E+G +++ L +G
Sbjct: 666 AFTVINEEGKRIVDSKKFKLFIG 688
>gi|372208556|ref|ZP_09496358.1| beta-glucosidase [Flavobacteriaceae bacterium S85]
Length = 729
Score = 406 bits (1044), Expect = e-110, Method: Compositional matrix adjust.
Identities = 259/743 (34%), Positives = 383/743 (51%), Gaps = 82/743 (11%)
Query: 45 FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGI 104
+ T+L +R LV +TL EKI+QL + +P + RL IP Y WW+EALHGVA GK
Sbjct: 26 WLDTSLTFEERIHHLVKAMTLKEKIAQLDSGSPEVKRLDIPEYNWWNEALHGVARNGK-- 83
Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIG-------MTFW 157
+T FPQ I AA+FD L ++ AI EARA +N Q+IG +TFW
Sbjct: 84 --------STVFPQAIGLAATFDPVLAKQVASAISDEARAKFNISQSIGNRGQYAGLTFW 135
Query: 158 APNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHF 217
PN+NIFRDPRWGRGQET GEDP +T + V++V+G+QG+ L+++AC KHF
Sbjct: 136 TPNVNIFRDPRWGRGQETYGEDPYLTSQMGVAFVKGLQGNH------PKYLKSAACAKHF 189
Query: 218 TAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCAD 277
+ + R+ F+A + +DL +TY P FE+ VKQ G+M AYN V G+P+ +
Sbjct: 190 AVH---SGPEELRHHFNANPSKKDLYETYLPAFEALVKQANVEGVMSAYNAVYGVPAGSS 246
Query: 278 RNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQK 337
LL +T R+ WGF GYI SDC A+ I+ K+ +A LKAG+++NCG
Sbjct: 247 EFLLKETLRKSWGFDGYIVSDCGALGDIFKGHKQVKTMPEAAAVALKAGVNLNCGYVYNG 306
Query: 338 HTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLAL 397
+ AV+Q + E ID L L R +LG F+ P+ I V+ S H LA
Sbjct: 307 ALEKAVQQGLVSEELIDTRLKQLLKTRFKLGFFDPKEA-NPYNAIPTSVIHSDDHIALAR 365
Query: 398 QAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQN 457
+ AQ IVLLKN + LPL K+ V + GP A+S+ LL NY G + ++ L+ + +
Sbjct: 366 KTAQKSIVLLKNKNHTLPLDKNIKVPY-VTGPFASSSDVLLANYYGMTTNLVSVLEGIAD 424
Query: 458 YVE---NTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRV----- 509
V + Y G + + A ++AK AD V+ ++GL E EE+D +
Sbjct: 425 KVSLGTSLNYRMGALPFNKNLNPKNWAPNVAKTADAVIAVVGLSADFEGEEVDAIASPNK 484
Query: 510 ----DLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEA 565
DL LP Q + + +A K P+ILV+ G V + YD +I+ YPGE
Sbjct: 485 GDKKDLKLPQNQIDYVKEMAAKKKGPLILVVASGSAVALG-ELYDL-ADAIVLMWYPGEQ 542
Query: 566 GAVALAEVIFGDHNPGGRLPMTWYPQDYIKV-PMTDMKMRPQATSGNPGRTYRFYEGKEV 624
G A+A+V+FGD +P G LP+T +P+ ++ P D M+ GRTY++ E + +
Sbjct: 543 GGNAVADVLFGDVSPSGHLPVT-FPKSVAQLPPFEDYSMQ--------GRTYKYMEEEPL 593
Query: 625 FPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFL 684
FPFG GLSY+ + + +S+ K+ S F
Sbjct: 594 FPFGFGLSYTDFKFSNVQISEEKIKKKDS-----------------------------FT 624
Query: 685 VTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCES 744
V+ V N+G++ G+ V L++ P P QL+ F+ + + + F L +
Sbjct: 625 VSCSVANNGKVDGEEVVQLYLVPLNSNKDLPKYQLLKFKRIEIQKNTSKTVSFNLE-AKD 683
Query: 745 LSRAREDGLMVIEEGTHFLVVGD 767
L + ++G +G + LVV +
Sbjct: 684 LFQVNKEGKKTWIKGKYKLVVAN 706
>gi|291548352|emb|CBL21460.1| Beta-glucosidase-related glycosidases [Ruminococcus sp. SR1/5]
Length = 697
Score = 405 bits (1041), Expect = e-110, Method: Compositional matrix adjust.
Identities = 270/738 (36%), Positives = 381/738 (51%), Gaps = 101/738 (13%)
Query: 54 QRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGA 113
++A LV+R+TL+EK SQL APAI RLGIPAY WW+E LHGVA G+ A
Sbjct: 8 KKAEALVARMTLEEKASQLRYDAPAIKRLGIPAYNWWNEGLHGVARAGQ----------A 57
Query: 114 TSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQA-------IGMTFWAPNINIFRD 166
T FPQ I AA+FD + + E RA YNA G+TFW+PN+NIFRD
Sbjct: 58 TVFPQAIGMAAAFDRKSVAEMAGIVATEGRAKYNAYSVNGDRDIYKGLTFWSPNVNIFRD 117
Query: 167 PRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWK 226
PRWGRG ET GEDP +T + VS+V+ +QG NG +K A+AC KHF + +
Sbjct: 118 PRWGRGHETYGEDPYLTKELGVSFVKALQG---NGDTMK----AAACAKHFAVH---SGP 167
Query: 227 GTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTAR 286
R++FDA + +D+ +TY P FE VK+ + +M AYNR NG P C L K R
Sbjct: 168 EALRHEFDAEASAKDMEETYLPAFEGLVKEAKVEAVMGAYNRTNGEPCCGSPT-LQKKLR 226
Query: 287 RQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG-SFLQKHTKAAVKQ 345
+W F G+ SDC A+ ++ + ++ + G D+NCG ++L H A ++
Sbjct: 227 GEWKFQGHFVSDCWAIRDFHEHHMVTDTAVESAALAINNGCDLNCGNTYL--HIMKAYEK 284
Query: 346 KKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIV 405
+ E I RA LF+ R LGLF+G+ + + V SP H A +AA+ V
Sbjct: 285 GLVTEETITRAAVRLFTTRYLLGLFDGS----EYDNLSYMEVESPRHLDAAEKAAEKSFV 340
Query: 406 LLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYY 465
LLKN +G+LPL K K ++ +IGPNA+S + L+GNY G + R IT + +Q+YV + V
Sbjct: 341 LLKN-NGILPLDKEKLKTIGIIGPNADSRQALIGNYHGTASRYITIQEGIQDYVGDDVRI 399
Query: 466 ---PGCDT-------VACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEE---------L 506
GCD +A + I +A +A+ +D V+L MGLD+T E EE
Sbjct: 400 LTSRGCDLFRDRTEHLAFTRDRIAEAKVVAENSDVVILCMGLDETLEGEEGDTGNSYVSG 459
Query: 507 DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAG 566
D+ D+ LPG Q+EL+ +A+ KPV+ LL G +D+ +A + +LW YPG G
Sbjct: 460 DKEDIELPGVQRELMEAIADTG-KPVVFCLLAGSDLDLKYAAEKFDAVMMLW--YPGCQG 516
Query: 567 AVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFP 626
A A+V+FG+ +P G+LP+T+Y TD M+ GRTYR+ E K FP
Sbjct: 517 GKAAAKVLFGEISPSGKLPVTFYESLEELPDFTDYSMK--------GRTYRYMERKAQFP 568
Query: 627 FGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVT 686
FG GL+YSK + V V G +K V
Sbjct: 569 FGYGLTYSKVA---------------------------VDKAEVKTCG------QKINVE 595
Query: 687 IGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLS 746
+ V+N+G + V ++VK N P L GFQ + L A E +I E+ E
Sbjct: 596 VEVQNNGAYDTEDVVQIYVKNIDSKNAIPNPMLAGFQRIFLKAGECRKI--EIPIWEKAF 653
Query: 747 RAREDGLMVIEEGTHFLV 764
++ +EEG F +
Sbjct: 654 TVVDETGKRMEEGKKFEI 671
>gi|402493386|ref|ZP_10840139.1| beta-glucosidase [Aquimarina agarilytica ZC1]
Length = 734
Score = 405 bits (1040), Expect = e-110, Method: Compositional matrix adjust.
Identities = 280/785 (35%), Positives = 407/785 (51%), Gaps = 101/785 (12%)
Query: 4 HKLSLVFPLLCLCFTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRL 63
HKL +L L F +LL + + Q N +T+ F + T +RA+ LV+ L
Sbjct: 7 HKLLPTKVVLMLIFLTLLPKHLAAQ-------KNENTKNFEWFDTNKSFEKRAKALVASL 59
Query: 64 TLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTA 123
TL+EKIS +V+ + I RL IP Y WW+E LHGVA G+ AT FPQ I A
Sbjct: 60 TLEEKISLMVDQSAPIDRLNIPEYNWWNECLHGVARNGR----------ATVFPQAIGLA 109
Query: 124 ASFDSYLWYRIGQAIGLEARALYNAGQAIG-------MTFWAPNINIFRDPRWGRGQETP 176
A+FD L +++ AI EARA +NA AIG +TFW PNINIFRDPRWGRGQET
Sbjct: 110 ATFDQDLIFKVADAISTEARAKFNASIAIGNRGKYAGLTFWTPNINIFRDPRWGRGQETY 169
Query: 177 GEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDAR 236
GEDP +T + V++V+G+QG+ L+++AC KH Y + + R++FDA
Sbjct: 170 GEDPYLTSQIGVNFVKGLQGNH------PKYLKSAACAKH---YAVHSGPEELRHEFDAI 220
Query: 237 VTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYIT 296
+ +D+A+TY P FE+ VK+ + G+M AYNRVNG +CA LL K + WGF GYI
Sbjct: 221 ASKKDMAETYLPAFEALVKEAKVEGVMGAYNRVNGEGACASPYLLEKLLKDTWGFKGYIV 280
Query: 297 SDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRA 356
SDC A+S ++ ++ E++ L G++VNCG+ + A+KQ E ++D
Sbjct: 281 SDCWALSDLHKFHKVTQTAEESAAAALNVGLNVNCGN-VYPALDGAIKQGLTSEKQLDNV 339
Query: 357 LHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPL 416
L + R +LG F+ + P+ KI DVV S AH+ +AL+AAQ IVLLKN++ LLPL
Sbjct: 340 LQHQLLTRFKLGFFDPS-NNNPYNKITTDVVDSEAHRAIALEAAQKSIVLLKNNNNLLPL 398
Query: 417 PKS-KSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVE---NTVYYPGCDTVA 472
K KSV +A GPNA LLGNY G + ++ T L + + V + Y G
Sbjct: 399 KKDLKSVYVA--GPNAAREDVLLGNYYGVTSKTQTILDGIVSKVSAGTSINYKQGLLPFQ 456
Query: 473 CSSASIDKAVDIAKGADHVVLMMGLDQTQEKEE---------LDRVDLVLPGRQQELITR 523
+ ID + AD +++MGL E EE DRVD+ LP Q + I +
Sbjct: 457 KNVNPIDWSTGEISRADVGIIVMGLSGNYEGEEGEAIASESKGDRVDIRLPQNQIDYIKK 516
Query: 524 V-AEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGG 582
+ A+ P++LVL G P+ + YD + +I++A YPGE G A+A+++FGD P G
Sbjct: 517 IKAKNTGNPLVLVLTGGSPIAMP-EVYDL-VDAIVFAWYPGEEGGQAVADILFGDVVPSG 574
Query: 583 RLPMTWYPQDYIKV-PMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFK 641
+LP+T +P+ + P D M+ GRTY++ FPFG GLSY+ + Y
Sbjct: 575 KLPIT-FPKSVDDLPPYNDYAMK--------GRTYKYMTKTPQFPFGFGLSYTSFKYDNL 625
Query: 642 AVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPV 701
V + K + + N+G + +
Sbjct: 626 KVYKEK-------------------------------------ASFSITNNGNVDAEEVA 648
Query: 702 LLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTH 761
++V G G P+ LVGF V L A ++ S ++ + DG + +GT+
Sbjct: 649 QVYVSSPNAGKGDPLNTLVGFTRVSLKAGATKQVSIPFSK-KAFVQFDSDGKEITRKGTY 707
Query: 762 FLVVG 766
+ VG
Sbjct: 708 TIHVG 712
>gi|291240563|ref|XP_002740191.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii]
Length = 747
Score = 405 bits (1040), Expect = e-110, Method: Compositional matrix adjust.
Identities = 264/738 (35%), Positives = 374/738 (50%), Gaps = 91/738 (12%)
Query: 11 PLLCLCFTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKIS 70
P C+ FTS+L + ST FPF T+LP S+R DLV RLTL+E +
Sbjct: 4 PGKCVVFTSVLFSLIST-----------ILGDFPFRNTSLPWSERVDDLVGRLTLEEIVL 52
Query: 71 QLVNS-------APAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTA 123
Q+ AP I RLGI Y W +E LHG G ATSFPQ A
Sbjct: 53 QMSRGGTGSNGPAPPIDRLGIGPYSWNTECLHGDVAAGP----------ATSFPQAFGLA 102
Query: 124 ASFDSYLWYRIGQAIGLEARALYNA-------GQAIGMTFWAPNINIFRDPRWGRGQETP 176
A+FD+ L +I A E RA YN G G++ ++P INI R P WGR QET
Sbjct: 103 ATFDAVLIEQIANATAYEVRAKYNNYAKHKEYGDHKGLSCFSPVINIARHPLWGRIQETY 162
Query: 177 GEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDAR 236
GEDP ++G A SYV G+QG+ + A+A CKHF AY ++R FDA+
Sbjct: 163 GEDPYLSGTLAASYVNGLQGNH------PRYVTANAGCKHFDAYAGPEDIPSSRSTFDAK 216
Query: 237 VTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYIT 296
V+ +DL T+ P F C++ G S +MC+YN +NG+P+CA++ LL+ R +W F GY+
Sbjct: 217 VSDRDLRMTFLPAFHECIQAGTHS-LMCSYNSINGVPACANKKLLTDILRTEWNFTGYVI 275
Query: 297 SDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKH----TKAAVKQKKLPESE 352
SD AV +YDA Y K D + + +G+++ S L+ + T AVKQ +
Sbjct: 276 SDQSAVEKVYDAHHYTKDMLDTAIACVNSGLNLELSSNLEDNVMMQTTKAVKQGNVTMKT 335
Query: 353 IDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHG 412
+ + LF RMRLG F+ P M P+ K+ ++ S HQ L+L+AA VLLKN +
Sbjct: 336 VKARVSPLFYTRMRLGEFDP-PEMNPYSKLDLSIIQSQEHQELSLKAAAKSFVLLKNENR 394
Query: 413 LLPLPKSKSVSLALIGPNANSAKTLLGNY-AGPSCRSITPLQALQNYVENTVYYPGCDTV 471
LPL K K LA++GP A++ L G+Y A P+ ++TP L NT Y GCD
Sbjct: 395 FLPL-KEKIDKLAVVGPLADNVDALYGDYSATPNNYTVTPRNGLARLAGNTSYASGCDNP 453
Query: 472 ACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKP 531
C + GAD VV+ +G E E DR +L LPG+Q L+ + KP
Sbjct: 454 KCRKYDSGQVKSAVSGADMVVVCVGTGTDIESEGNDRHELALPGKQLSLLQDAVKFGTKP 513
Query: 532 VILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFG---DHNPGGRLPMTW 588
VIL+L GP+D+++A + + +I+ +P +A AL + + NP GRLPMTW
Sbjct: 514 VILLLFNAGPLDVSWAVENPAVQTIVACFFPAQATGDALYRMFMNTSPESNPAGRLPMTW 573
Query: 589 YPQDYIKV-PMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNK 647
P+ +V PMTD M+ GRTYR+ + +FPFG GLSY+ +
Sbjct: 574 -PRSMEQVPPMTDYTMK--------GRTYRYSDADPLFPFGFGLSYTLFK---------- 614
Query: 648 LYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKP 707
Y N S+S ++++ D V + + V N G+ G + +++
Sbjct: 615 -YYNTSASPTVIKSCDTV------------------TIPLTVTNVGDFPGDEVMQVYISW 655
Query: 708 ARRGNGRPIKQLVGFQSV 725
+ P QLVGF+ V
Sbjct: 656 SNASVTVPKLQLVGFRRV 673
>gi|116197206|ref|XP_001224415.1| hypothetical protein CHGG_05201 [Chaetomium globosum CBS 148.51]
gi|88181114|gb|EAQ88582.1| hypothetical protein CHGG_05201 [Chaetomium globosum CBS 148.51]
Length = 735
Score = 404 bits (1039), Expect = e-110, Method: Compositional matrix adjust.
Identities = 263/711 (36%), Positives = 384/711 (54%), Gaps = 62/711 (8%)
Query: 74 NSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYR 133
+++P + RLG+ AY+WW+EALHGVA +GI + G AT FPQ I ++A+FD +L R
Sbjct: 43 SNSPGVSRLGLSAYQWWNEALHGVAH-NRGITWGGQFSAATQFPQAITSSAAFDDHLIER 101
Query: 134 IGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRG 193
IG I EARA N G+A + FW PN+N FRDPRWGRG ETPGED K+A ++V+G
Sbjct: 102 IGVIISTEARAFANNGRA-HLDFWTPNVNPFRDPRWGRGHETPGEDAFRNKKWAEAFVQG 160
Query: 194 VQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESC 253
+QG + + A CKH+ AYDL+N TTR+ FDA+V+ QDLA+ Y PPF+ C
Sbjct: 161 MQG-------TESTHRVIATCKHYAAYDLENSGSTTRFNFDAKVSTQDLAEYYLPPFQQC 213
Query: 254 VKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGF---HGYITSDCDAVSIIYDAEG 310
+ + IMC+YN VNG+P+CA L+ R+ W + + Y+ SDCDAV + +A G
Sbjct: 214 ARDSKVGSIMCSYNAVNGVPACASPYLMDTILRKHWNWTDQNQYVVSDCDAVYYLGNANG 273
Query: 311 ---YAKSPEDAVVDVLKAGMDVNCGSF--LQKHTKAAVKQKKLPESEIDRALHNLFSVRM 365
Y S A+ L+AG D C + +A ++ ++ +D+A+ +
Sbjct: 274 GHRYKSSYAAAIGASLEAGCDNMCWATGGTTPDPASAFNSRQFTQATLDKAMLRQMQGLV 333
Query: 366 RLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKS-VSL 424
+ G F+G ++ + + A V + + AL+AA++GIVLLKN + +LPL S +
Sbjct: 334 KAGYFDGPNSL--YRNLTAADVNTQVARDTALKAAEEGIVLLKNDN-ILPLTLGGSNTQV 390
Query: 425 ALIGPNANSAKTLLGNYAGPSCRSITPLQALQ------NYVENTVYYPGCDTVACSSASI 478
A+IG AN+A +LG Y+G S P+ A + NYV + DT A
Sbjct: 391 AMIGFWANAADKMLGGYSGSPPFSHDPVTAARSMGITVNYVNGPLTQTNADTSA------ 444
Query: 479 DKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLC 538
AV+ A+ + V+ G+D T EKE DR + P Q +I R+A+ KPVI+V +
Sbjct: 445 --AVNAAQKSSVVIFFGGIDNTVEKESQDRTSIAWPSGQLTMIQRLAQTG-KPVIVVRM- 500
Query: 539 GGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYI-KVP 597
G VD T N+ +ILWAGYPG+ G A+ +I G +P GRLP+T YP Y + P
Sbjct: 501 GTHVDDTPLLSIPNVKAILWAGYPGQDGGTAVMNLITGLASPAGRLPVTVYPSSYTNQAP 560
Query: 598 MTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTK 657
T+M +RP +S PGRTYR+Y+ VFPFG GL Y+ +S + +S K
Sbjct: 561 YTNMALRP--SSSYPGRTYRWYK-DPVFPFGHGLHYTNFSVAPLDFPATFSIADLLASCK 617
Query: 658 MVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIK 717
V ++ + S V++ V N G A + VL F+ RPIK
Sbjct: 618 GVTYLELCPFPS---------------VSVSVTNTGSRASDYVVLGFLAGDFGPTPRPIK 662
Query: 718 QLVGFQSVI---LNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVV 765
L ++ V + AE+ ++L ESL+R G V+ GT+ L++
Sbjct: 663 SLATYKRVFDVQPGKTQSAELDWKL---ESLARVDGKGNRVLYPGTYTLLL 710
>gi|451996250|gb|EMD88717.1| glycoside hydrolase family 3 protein [Cochliobolus heterostrophus
C5]
Length = 763
Score = 404 bits (1039), Expect = e-110, Method: Compositional matrix adjust.
Identities = 255/742 (34%), Positives = 374/742 (50%), Gaps = 43/742 (5%)
Query: 34 DPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEA 93
D +N T C P +RA LV+ + EK+ LV+ + + RLG+PAY WW EA
Sbjct: 25 DCTNGPLSTNAICDVNAPPHERAAALVAAMEPQEKLDNLVSKSKGVSRLGLPAYNWWGEA 84
Query: 94 LHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIG 153
LHGVAG GI F + ATSFP IL +A+FD L ++I IG EARA N G A
Sbjct: 85 LHGVAGA-PGIKFVEPYKNATSFPMPILMSAAFDDDLIFKIANIIGNEARAFGNGGVA-P 142
Query: 154 MTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASAC 213
M +W P+IN RD RWGR E+PGED Y + + G++GD + + + A
Sbjct: 143 MDYWTPDINPVRDIRWGRASESPGEDIRRIKGYTKALLAGLEGD-------QAQRKIIAT 195
Query: 214 CKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIP 273
CKH+ YD++ W G R+ F A++TMQDLA+ Y PPF+ C + + MC+YN VNG+P
Sbjct: 196 CKHYVGYDMEAWGGYDRHNFSAKITMQDLAEYYMPPFQQCARDSKVGSFMCSYNAVNGVP 255
Query: 274 SCADRNLLSKTARRQWGF---HGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVN 330
+CAD +L R W + + YITSDC+AV+ I + Y ++ GMD++
Sbjct: 256 TCADTYVLQTILRDHWNWTDSNNYITSDCEAVADISENHKYVETLAQGTALAFAKGMDLS 315
Query: 331 CGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSP 390
C A Q L S ID+AL + + G F+G + + + + +P
Sbjct: 316 CEYSGSSDIPGAWSQGLLNLSVIDKALTRQYEGLVHAGYFDG--AKATYANLSYNDINTP 373
Query: 391 AHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSIT 450
+ L+LQ +G+V+LKN H LPLP +K +A+IG AN + L G Y+GP +
Sbjct: 374 EARQLSLQVTSEGLVMLKNDH-TLPLPLTKGSKVAMIGFWANDSSKLQGIYSGPPPYRHS 432
Query: 451 PLQALQNYVEN--TVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDR 508
P+ A + + + P + A+D A+ +D+++ G D T +E DR
Sbjct: 433 PVFAGEQMGLDMAIAWGPMIQNSSVPDNWTTNALDAAEKSDYILYFGGQDWTVAQEGYDR 492
Query: 509 VDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAV 568
+ P Q +L+ ++A+ K V++ L G D + I SI+WA +PG+ G
Sbjct: 493 TTISFPQVQIDLLAKLAKLGKPLVVITL--GDMTDHSPLLSMEGINSIIWANWPGQDGGP 550
Query: 569 ALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFG 628
A+ VI G H P GRLP+T YP DY+K+ M DM +RP A S PGRTYR++ + V PFG
Sbjct: 551 AILNVISGVHAPAGRLPITEYPADYVKLSMLDMNLRPHAES--PGRTYRWFN-ESVQPFG 607
Query: 629 CGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIG 688
GL Y+ + F S+ L + + Q YK + CE VT
Sbjct: 608 FGLHYTTFEAGF--ASEEGLTYDIQETLDSCTQQ----YK-------DLCEVAPLEVT-- 652
Query: 689 VKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGF---QSVILNAKEKAEIVFELSPCESL 745
V N G L F+K P+K L+ + + + AK+ A + L L
Sbjct: 653 VANKGNRTSDFVALAFIKGEVGPKPYPLKTLITYGRLRDIHGGAKKSASLPLTLG---EL 709
Query: 746 SRAREDGLMVIEEGTHFLVVGD 767
+R + G VI G + L++ +
Sbjct: 710 ARVDQSGNTVIYPGEYTLLLDE 731
>gi|319641744|ref|ZP_07996426.1| beta-glucosidase [Bacteroides sp. 3_1_40A]
gi|317386631|gb|EFV67528.1| beta-glucosidase [Bacteroides sp. 3_1_40A]
Length = 702
Score = 404 bits (1038), Expect = e-110, Method: Compositional matrix adjust.
Identities = 264/741 (35%), Positives = 390/741 (52%), Gaps = 93/741 (12%)
Query: 55 RARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGAT 114
R +DLV+RLTL+EK+ + + +PAIPRLGIPAY+WW+EALHGVA T+ T
Sbjct: 2 RVKDLVARLTLEEKVLLMQHHSPAIPRLGIPAYDWWNEALHGVAR---------TLEKVT 52
Query: 115 SFPQVILTAASFDSYLWYRIGQAIGLEARALYN----AGQA----IGMTFWAPNINIFRD 166
FPQ I AA+FD+ ++G E RAL+N AG+ G+T+W PNINIFRD
Sbjct: 53 VFPQAIGMAATFDTEALQKMGDITSTEGRALFNEDWKAGKTGTRYRGLTYWTPNINIFRD 112
Query: 167 PRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWK 226
PRWGRGQET GEDP +T K + VRG++G+ + L++ AC KH+ + +
Sbjct: 113 PRWGRGQETYGEDPYLTAKMGAAIVRGLEGEDPH------YLKSVACAKHYAVHSGPEY- 165
Query: 227 GTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTAR 286
R+ FDAR ++ DL DTY P F V + + G+MCAYNR+NG P C + LL R
Sbjct: 166 --NRHSFDARPSVFDLWDTYMPAFRELVTKAKVHGVMCAYNRLNGQPCCGNDPLLVDILR 223
Query: 287 RQWGFHGYITSDCDAVSIIYDAEGYAKSPED--AVVDVLKAGMDVNCGSFLQKHTKAAVK 344
QW F GY+TSDC A+ AE + PE A+ D L AG D+ CG+ L VK
Sbjct: 224 NQWHFDGYVTSDCWALKDF--AEFHKTHPEHTIAMSDALLAGTDLECGN-LYHLLAEGVK 280
Query: 345 QKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGI 404
+ E +I+ +L LF++ ++G+F+ + P+ IG +V+ AH+ A + A++ I
Sbjct: 281 KGLHSERDINVSLSRLFTILFKIGMFDPAERV-PYSSIGREVLECEAHKQHAERMAKESI 339
Query: 405 VLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTV- 463
VLL+N + +LPL SK S+ALIGPNA++ +T L NY G +TP +L+ + + +
Sbjct: 340 VLLENKNHILPLDASKIKSIALIGPNADNGQTQLANYFGTPSEIVTPYMSLKRRLGDKIK 399
Query: 464 --YYPGCDTV--ACSSASIDKAVDIAKGADHVVLMMGLDQTQE-------------KEEL 506
Y PG V + S + A +D +V + G+ E
Sbjct: 400 INYLPGVGIVDKLKDAPSFVQVAHKAAQSDVIVFVSGISADYEGEAGDAGAAGYGGFASG 459
Query: 507 DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAG 566
DR + LP Q EL+ ++ + +P+I+V + G ++F +N ++L A Y G+A
Sbjct: 460 DRTTMQLPLVQIELLKKLKKTG-RPLIIVNMSGSV--MSFEWESQNADALLQAWYGGQAA 516
Query: 567 AVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFP 626
A+ +V+FG NP GR+P+T Y D P + M GRTYR+++G+ +P
Sbjct: 517 GDAIVDVLFGHCNPAGRMPLTTYKSDNDLPPFENYSML--------GRTYRYFKGEPRYP 568
Query: 627 FGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVT 686
FG GLSY+ ++Y S + V+ T +T + VT
Sbjct: 569 FGYGLSYTTFAY---------------SDVQCVDE-------------THTGDTAR--VT 598
Query: 687 IGVKNHGEMAGKHPVLLFVKPARRGNGR-PIKQLVGFQSVILNAKEKAEIVFELSPCESL 745
+ V N G+ G V L+V + G + P+ L GF+ + L E + F L+P E L
Sbjct: 599 VTVSNTGDCDGDEVVQLYVVHPQDGRKQIPLCALKGFKRIHLKRGESTSVSFTLTP-EEL 657
Query: 746 SRAREDGLMVIEEGTHFLVVG 766
+ DG +V + G L VG
Sbjct: 658 ALTETDGNLVEKNGQVTLFVG 678
>gi|253579611|ref|ZP_04856880.1| glycoside hydrolase, family 3 domain-containing protein
[Ruminococcus sp. 5_1_39B_FAA]
gi|251849112|gb|EES77073.1| glycoside hydrolase, family 3 domain-containing protein
[Ruminococcus sp. 5_1_39BFAA]
Length = 706
Score = 404 bits (1038), Expect = e-109, Method: Compositional matrix adjust.
Identities = 267/737 (36%), Positives = 387/737 (52%), Gaps = 99/737 (13%)
Query: 54 QRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGA 113
++A LVS++TL EK SQL A + RLG+PAY +W+EALHGVA G A
Sbjct: 13 KKAEKLVSQMTLLEKASQLKYDAAPVKRLGVPAYNYWNEALHGVARAGV----------A 62
Query: 114 TSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAI-------GMTFWAPNINIFRD 166
T FPQ I AA FD ++G I E RA YNA A G+TFW+PN+NIFRD
Sbjct: 63 TMFPQAIAMAAVFDDEEMKKVGDIIATEGRAKYNAYSAKEDRDIYKGLTFWSPNVNIFRD 122
Query: 167 PRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWK 226
PRWGRG ET GEDP +T + V +V G+QGD G + ++A+AC KH+ + +
Sbjct: 123 PRWGRGHETYGEDPYLTSRLGVKFVEGIQGD----GPV---MKAAACAKHYAVH---SGP 172
Query: 227 GTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTAR 286
+ R++FDA+ +M+D+ +TY P FE+ V + +M AYNR NG P CA + L+ R
Sbjct: 173 ESLRHEFDAQASMKDMWETYLPAFEALVTEADVEAVMGAYNRTNGEPCCAHKYLMEDVLR 232
Query: 287 RQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG-SFLQKHTKAAVKQ 345
+W F G+ TSDC A+ ++ +P + L AG D+NCG ++L H A +
Sbjct: 233 GKWKFEGHYTSDCWAIRDFHEHHMVTSTPRQSAAMALNAGCDLNCGNTYL--HMMGAYQD 290
Query: 346 KKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIV 405
+ E +I + L + R LGLF+G+ + KI VV H AL+ A+ V
Sbjct: 291 GLVTEEKITESAVRLLTTRYLLGLFDGS----EYDKIPYSVVECKEHIDEALKMARKSCV 346
Query: 406 LLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTV-- 463
LLKN G+LP+ K+K ++ +IGPNA+S L+GNY G S IT L+ ++ + V
Sbjct: 347 LLKND-GVLPIDKTKVNTIGVIGPNADSRAALIGNYHGTSSEYITVLEGIREEAGDDVRI 405
Query: 464 -YYPGCD-------TVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEE---------L 506
Y GCD +A I +AV A+ +D V+L +GL++T E EE
Sbjct: 406 LYSQGCDLYKDKVENLAWDQDRISEAVITAENSDVVILCVGLNETLEGEEGDTGNSDASG 465
Query: 507 DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAG 566
D+VDL LP Q+ELI +V A KP I+VL+ G +D+ +A+ N IL A YPG G
Sbjct: 466 DKVDLHLPKVQEELIEKVT-AVGKPTIVVLMAGSAIDLNYAQ--DNCNGILLAWYPGARG 522
Query: 567 AVALAEVIFGDHNPGGRLPMTWYPQDYIKVP-MTDMKMRPQATSGNPGRTYRFYEGKEVF 625
A+A+++FG +P G+LP+T+Y +D +P TD M+ RTYR+ E + ++
Sbjct: 523 GRAIADLLFGKESPSGKLPITFY-KDLEGMPEFTDYSMK--------NRTYRYMEKEALY 573
Query: 626 PFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLV 685
PFG GL+YS + V + V E+ E ++
Sbjct: 574 PFGYGLTYS--------------------------DTCVTEAEVVGEVSAE----SDIVL 603
Query: 686 TIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESL 745
VKN+G + V +++K L GF+ V L A E+ + F +S +++
Sbjct: 604 KATVKNNGTVDTDEVVQVYIKDLDSPLAVRNYSLCGFKRVSLKAGEEKSVEFTISN-KAM 662
Query: 746 SRAREDGLMVIEEGTHF 762
+ EDG I G HF
Sbjct: 663 NIVDEDGNRYI-AGKHF 678
>gi|410723195|ref|ZP_11362440.1| beta-glucosidase-like glycosyl hydrolase [Clostridium sp.
Maddingley MBC34-26]
gi|410603399|gb|EKQ57833.1| beta-glucosidase-like glycosyl hydrolase [Clostridium sp.
Maddingley MBC34-26]
Length = 709
Score = 403 bits (1035), Expect = e-109, Method: Compositional matrix adjust.
Identities = 260/738 (35%), Positives = 383/738 (51%), Gaps = 95/738 (12%)
Query: 56 ARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATS 115
A++LVS++TL E+ QL +PAI L +P Y WW+E LHGVA G AT
Sbjct: 16 AKELVSKMTLQERAEQLTYQSPAIKHLNVPEYNWWNEGLHGVARAGT----------ATV 65
Query: 116 FPQVILTAASFDSYLWYRIGQAIGLEARALYNA-------GQAIGMTFWAPNINIFRDPR 168
FPQ I AA FD I I E RA YN G G+T+W+PN+NIFRDPR
Sbjct: 66 FPQAIGLAAIFDEEFLGEIADIISTEGRAKYNEYSKKDDRGIYKGLTYWSPNVNIFRDPR 125
Query: 169 WGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGT 228
WGRG ET GEDP +T + V++++G+QG+ GK L+ +AC KHF + +
Sbjct: 126 WGRGHETYGEDPYLTSRLGVAFIKGLQGE----GKY---LKLAACAKHFAVH---SGPEG 175
Query: 229 TRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQ 288
R++F+A V +DL +TY P FE+CVK+ +M AYNR NG P C + LL R +
Sbjct: 176 LRHEFNAVVEKKDLYETYLPAFEACVKEANVESVMGAYNRTNGEPCCGSKTLLKDILRGK 235
Query: 289 WGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKL 348
WGF G++ SDC A++ + + ++V ++ G D+NCG+ + + A K+ +
Sbjct: 236 WGFKGHVVSDCWALADFHLHHMITSTATESVALAIENGCDLNCGN-MYLNLLLAYKEGLV 294
Query: 349 PESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLK 408
E +I A L + R +LG+F+ + + +I +V H +AL A++ +VLLK
Sbjct: 295 TEEQITTAAERLMTTRFKLGMFDEDC---EYNRIPYEVNDCKEHNEIALIASRKSMVLLK 351
Query: 409 NSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTV---YY 465
N G LPL KS S+A+IGPNANS L GNY+G + + T L+ + N V + + Y
Sbjct: 352 ND-GTLPLDKSSLKSIAVIGPNANSEIMLKGNYSGTASKYTTILEGIHNAVGDNIRVYYS 410
Query: 466 PGC-------DTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELD---------RV 509
GC + +A + +A+ +A+ +D V+L +GLD T E E+ D +
Sbjct: 411 EGCHLFKDKVEDLAGPDDRLSEAISVAERSDVVILCLGLDSTIEGEQGDAGNSYGAGDKE 470
Query: 510 DLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVA 569
L LPGRQQ L+ +V E K PVI+VL G +TF + +IL A YPG G A
Sbjct: 471 SLNLPGRQQNLLEKVLEVGK-PVIVVLGAGSA--LTFNGAEEKCAAILNAWYPGSHGGTA 527
Query: 570 LAEVIFGDHNPGGRLPMTWYPQDYIKVP-MTDMKMRPQATSGNPGRTYRFYEGKEVFPFG 628
+A+++FG +P G+LP+T+Y +D +P TD M+ GRTYR+ E + ++PFG
Sbjct: 528 VADILFGKCSPSGKLPVTFY-KDTANLPEFTDYSMK--------GRTYRYLEHESLYPFG 578
Query: 629 CGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIG 688
GL+YSK VE ++ VP + +F F ++I
Sbjct: 579 YGLTYSK-----------------------VELSNL----QVPFVKADF---ESFDISID 608
Query: 689 VKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRA 748
++N G + V +VK + L GF+ V L E + ELS S
Sbjct: 609 IRNTGNYGIEEVVQCYVKDLKSKYAVLNHSLAGFKRVSLKKGESKTVTIELSK-RSFEAV 667
Query: 749 REDGLMVIEEGTHFLVVG 766
DG +++ + L VG
Sbjct: 668 NNDGERLLDSKSFKLFVG 685
>gi|367053033|ref|XP_003656895.1| glycoside hydrolase family 3 protein [Thielavia terrestris NRRL
8126]
gi|347004160|gb|AEO70559.1| glycoside hydrolase family 3 protein [Thielavia terrestris NRRL
8126]
Length = 758
Score = 400 bits (1028), Expect = e-108, Method: Compositional matrix adjust.
Identities = 271/746 (36%), Positives = 387/746 (51%), Gaps = 59/746 (7%)
Query: 46 CKTTLPISQRARDLVSRLTLDEKISQLV------NSAPAIPRLGIPAYEWWSEALHGVAG 99
C TT +RA LV + + EK++ LV +S+ PRLG+P YEWWSEALHGVA
Sbjct: 11 CDTTASPPKRAAALVEAMNITEKLANLVEYVMARSSSKGAPRLGLPPYEWWSEALHGVA- 69
Query: 100 VGKGIFFN---GTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTF 156
G+ FN G ATSF I +A+FD L ++ I EARA NAG A G+ F
Sbjct: 70 ASPGVSFNWSGGPFSYATSFANPITLSAAFDDELVQKVADVISTEARAFANAGSA-GLDF 128
Query: 157 WAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKH 216
W PNIN +RDPRWGRG ETPGEDP+ Y S +RG++G+ + + A CKH
Sbjct: 129 WTPNINPWRDPRWGRGSETPGEDPVRIKGYVRSLLRGLEGE-------ESIKKVIATCKH 181
Query: 217 FTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCA 276
+ AYDL+ W TRY+FDA V++QDL++ Y PPF+ C + + IMC+YN +NG P+CA
Sbjct: 182 YAAYDLERWHNITRYEFDAIVSLQDLSEYYLPPFQQCARDSKVGSIMCSYNSLNGTPACA 241
Query: 277 DRNLLSKTARRQWGF---HGYITSDCDAV-SIIYDAEGYAKSPEDAVVDVLKAGMDVNC- 331
+ L+ R+ W + + YITSDC+A+ + D + ++ +A AG D C
Sbjct: 242 NTYLMDDILRKHWRWTEDNNYITSDCNAIKDFLPDEHNFTQTAAEAAAAAYTAGTDTVCE 301
Query: 332 --GSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCS 389
GS A QK L E IDRAL L+ +R G F+ + P+ IG V +
Sbjct: 302 VAGSPPYTDVVGAYDQKLLSEEVIDRALRRLYEGLVRAGYFDPA-SASPYRDIGWSDVNT 360
Query: 390 PAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSI 449
Q LALQ+A DG+VLLKN G LP+ K + ++ALIG A+ +++LG Y+G
Sbjct: 361 AEAQALALQSASDGLVLLKND-GTLPI-KLEGKTVALIGHWASGTRSMLGGYSGIPPYYH 418
Query: 450 TPLQALQNYVENTVYYPGCDTVACSSASID----KAVDIAKGADHVVLMMGLDQTQEKEE 505
+P+ A N Y VA +SA+ D A+ A +D ++ GLDQ+ E+
Sbjct: 419 SPVYAAGQL--NLTYKYASGPVAPASAARDTWTADALSAANKSDVILYFGGLDQSVASED 476
Query: 506 LDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEA 565
DR + P Q LI +A KP++++ L G VD T + N+ +ILWAGYPG++
Sbjct: 477 KDRDSIAWPPAQLTLIQTLA-GLGKPLVVIQL-GDQVDDTPLLTNPNVSAILWAGYPGQS 534
Query: 566 GAVALAEVIFGDHNPGGRLPMTWYPQDYI-KVPMTDMKMRPQATSGNPGRTYRFY-EGKE 623
G A+ I G P GRLP+T YP Y ++P+TDM +RP SG PGRTYR+
Sbjct: 535 GGTAVLNAITGVSPPAGRLPVTQYPSSYTSQLPLTDMSLRPDPASGRPGRTYRWLPRNAT 594
Query: 624 VFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKF 683
V PFG GL Y+ ++ + L + + ++ + H P +
Sbjct: 595 VLPFGYGLHYTNFTARPNPAQNFTL-----TPSALLAPCKLAHRDLCP---------LPY 640
Query: 684 LVTIGVKNHGEMAGKHPVLLFVKPARRG-NGRPIKQLVGF---QSVILNAKEKAEIVFEL 739
VT+ V N G + L+F G P+K LV + + + +A++ L
Sbjct: 641 PVTVEVTNTGARTSDYVGLVFATTRDAGPPPHPLKTLVAYARLRGIAPGRTARAQVQVAL 700
Query: 740 SPCESLSRAREDGLMVIEEGTHFLVV 765
L+R G V+ G + V+
Sbjct: 701 G---DLARVDAAGNRVLYPGRYGFVL 723
>gi|169611757|ref|XP_001799296.1| hypothetical protein SNOG_08993 [Phaeosphaeria nodorum SN15]
gi|160702362|gb|EAT83185.2| hypothetical protein SNOG_08993 [Phaeosphaeria nodorum SN15]
Length = 755
Score = 400 bits (1028), Expect = e-108, Method: Compositional matrix adjust.
Identities = 260/736 (35%), Positives = 375/736 (50%), Gaps = 53/736 (7%)
Query: 45 FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGI 104
C T ++RA LV + +EK+ L+ + RLG+P Y WW EALHGVAG GI
Sbjct: 33 ICDVTAAPAERAAALVEAMQTNEKLDNLMR---GVTRLGLPKYNWWGEALHGVAGA-PGI 88
Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIF 164
F G + ATSFP +L +A+FD L ++I IG EARA N G A + FW P+IN F
Sbjct: 89 NFTGAYKTATSFPMPLLMSAAFDDDLIFKIANIIGNEARAFGNGGVA-PVDFWTPDINPF 147
Query: 165 RDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDN 224
RDPRWGRG ETPGED + Y + G++GD K + + A CKH+ YD++
Sbjct: 148 RDPRWGRGSETPGEDIVRIKGYTKHLLAGLEGD-------KPQRKIIATCKHYVGYDMEA 200
Query: 225 WKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKT 284
W G R+ F+A++ MQDLA+ Y PPF+ C + + MC+YN VNG+P+CAD +L
Sbjct: 201 WGGIDRHSFNAKINMQDLAEYYMPPFQQCARDSKVGSFMCSYNAVNGVPTCADTYVLQTI 260
Query: 285 ARRQWGF---HGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKA 341
R W + + YITSDC+AV I YAK+ + AGMD +C
Sbjct: 261 LRDHWNWTESNNYITSDCEAVKDISLKHKYAKTNAEGTGLAFTAGMDNSCEYTGSSDIPG 320
Query: 342 AVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQ 401
A Q L IDRAL + +R G F+G + +G + +P Q L+LQ A
Sbjct: 321 AFNQSYLSIPTIDRALKRQYEGLVRAGYFDG--AAATYANLGVKDINTPEAQQLSLQVAS 378
Query: 402 DGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVEN 461
+G+VLLKN LPL + +A++G AN L G Y+GP+ +P+ A Q +
Sbjct: 379 EGLVLLKNDD-TLPLSLTNGSKVAMLGFWANDTSKLSGIYSGPAPYLRSPVWAGQKLGLD 437
Query: 462 TVYYPGCDTVACSSASID----KAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQ 517
G +S++ D A+ A+ +D+++ GLD + E DR + P Q
Sbjct: 438 MAIASGPILQQSNSSTRDNWTTNALAAAEKSDYILYFGGLDPSAAAEGFDRNSIAWPTAQ 497
Query: 518 QELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGD 577
+LI ++A K V+LVL G +D + + S++WA +PG+ G A+ +V+ G
Sbjct: 498 VDLIKKLAAIGKPLVVLVL--GDLMDNSPLLELDGVNSVIWANWPGQDGGSAVMQVVTGA 555
Query: 578 HNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYS 637
GRLP+T YP +Y ++ M DM MRP ++S PGRTYR++ G V PFG GL Y+ +
Sbjct: 556 VAVAGRLPITQYPANYTELSMLDMNMRPSSSS--PGRTYRWFNGA-VQPFGTGLHYTTFD 612
Query: 638 YKFKAVSQNKLYLNQSSSTKMVENQ--DVVHYKSVPELGTEFCETRKFLVTIGVKNHGEM 695
KF A S + + S+ TK NQ D S+P + V N G
Sbjct: 613 AKFAANS--TIEYDISNITKECTNQYPDTCSVPSIP---------------VAVTNSGNR 655
Query: 696 AGKHPVLLFVKPARRGNGRPIKQLVGF---QSVILNAKEKAEIVFELSPCESLSRAREDG 752
L F+K P+K L+ + + V + AE+ L +L+R + G
Sbjct: 656 TSDFIALAFIKGENGPAPYPLKTLISYTRVRDVKGGQTKSAEMQLTLG---NLARVDQMG 712
Query: 753 LMVIEEGTHFLVVGDE 768
V+ G + V+ DE
Sbjct: 713 NTVLYPG-EYTVLLDE 727
>gi|302669556|ref|YP_003829516.1| beta-xylosidase [Butyrivibrio proteoclasticus B316]
gi|302394029|gb|ADL32934.1| beta-xylosidase Xyl3A [Butyrivibrio proteoclasticus B316]
Length = 709
Score = 400 bits (1028), Expect = e-108, Method: Compositional matrix adjust.
Identities = 256/733 (34%), Positives = 385/733 (52%), Gaps = 104/733 (14%)
Query: 54 QRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGA 113
+RA++LV+++T++EK SQL APAI RLGIPAY WW+EALHGVA G A
Sbjct: 8 KRAKELVAKMTVEEKASQLRYDAPAIDRLGIPAYNWWNEALHGVARAGT----------A 57
Query: 114 TSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQA-------IGMTFWAPNINIFRD 166
T FPQ I AA+FD L +G+ I EARA YN G+TFWAPN+NIFRD
Sbjct: 58 TMFPQAIGLAAAFDEELMSEVGEVIAEEARAKYNEQSKREDRDIYKGLTFWAPNVNIFRD 117
Query: 167 PRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWK 226
PRWGRG ET GEDP +T + AV +V+ +QGD G+ ++A+AC KHF + +
Sbjct: 118 PRWGRGHETYGEDPFLTSRLAVPFVKAMQGD----GEY---MKAAACAKHFAVH---SGP 167
Query: 227 GTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTAR 286
R+ FDA+ + +DL +TY P FE+ VK+ +M AYNR NG P CA++ L+ T R
Sbjct: 168 EGERHFFDAKASKKDLEETYLPAFEALVKEAEVEAVMGAYNRTNGEPCCANKPLMVDTLR 227
Query: 287 RQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQK 346
+WGF G+ SDC A+ ++ SPE++ L+ G D+NCG Q V+
Sbjct: 228 GKWGFQGHFVSDCWAIKDFHENHKVTSSPEESAKLALEMGCDLNCGCTYQS-IMNGVRAG 286
Query: 347 KLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVL 406
+ E I + LF+ R LG+F+ F +I + V H +A +AA++ +VL
Sbjct: 287 LIDEKLITESCERLFTTRFLLGMFD----KTEFDEIPYEKVECKEHLAVAKRAARESVVL 342
Query: 407 LKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTV--- 463
LKN GLLPL K ++ ++GPNANS +L+GNY G S R IT L+ +Q+ V + V
Sbjct: 343 LKND-GLLPLNKDSIKTIGVVGPNANSRLSLIGNYHGTSSRYITVLEGIQDKVGDDVRVL 401
Query: 464 YYPGCDTVACSSAS---------IDKAVDIAKGADHVVLMMGLDQTQEKEE--------- 505
Y GCD + ++ + +A +A +D VV+++GLD+ E EE
Sbjct: 402 YSEGCDIFQNNISNLADPNLPDRLSEAQAVADHSDVVVVVVGLDENLEGEEGDAGNQFAS 461
Query: 506 LDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEA 565
D+++L LP Q++L+ V + KP I++ + G +D++ A+ + N ++L A YPG
Sbjct: 462 GDKINLNLPLSQRQLLNAVLDCG-KPTIVIDMAGSAIDLSKAQDEAN--AVLQAFYPGAR 518
Query: 566 GAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVF 625
G +A+++FGD +P G+LP+T+Y D M+ RTY+++ G ++
Sbjct: 519 GGADVADILFGDVSPSGKLPVTFYKSADDLPDFKDYSMK--------NRTYKYFTGTPLY 570
Query: 626 PFGCGLSYS------KYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCE 679
PFG GL+Y Y + K +K+ G E
Sbjct: 571 PFGYGLTYGDCYVKPDYDFNVKYADADKVS------------------------GAE--- 603
Query: 680 TRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFEL 739
+T+ V N G++ V L++K LVGF+ V + A + + +
Sbjct: 604 -----ITVTVVNDGKLDTDEVVQLYIKDMDSYFATTNPSLVGFKRVHVPAGGETRVTLTV 658
Query: 740 SPCESLSRAREDG 752
S ++ + E+G
Sbjct: 659 SE-KAFTSVNEEG 670
>gi|451851086|gb|EMD64387.1| glycoside hydrolase family 3 protein [Cochliobolus sativus ND90Pr]
Length = 763
Score = 400 bits (1028), Expect = e-108, Method: Compositional matrix adjust.
Identities = 253/742 (34%), Positives = 372/742 (50%), Gaps = 43/742 (5%)
Query: 34 DPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEA 93
D +N T C P +RA LV+ + EK+ LV+ + + RLG+PAY WW EA
Sbjct: 25 DCTNGPLSTNAICDVNAPPHERAAALVAAMEPQEKLDNLVSKSKGVSRLGLPAYNWWGEA 84
Query: 94 LHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIG 153
LHGVAG GI F + ATSFP IL +A+FD L ++I IG EARA N G A
Sbjct: 85 LHGVAGA-PGIKFVEPYKNATSFPMPILMSAAFDDDLIFKIANIIGNEARAFGNGGVA-P 142
Query: 154 MTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASAC 213
+ +W P+IN RD RWGR E+PGED Y + + G++GD + + + A
Sbjct: 143 VDYWTPDINPVRDIRWGRASESPGEDIRRIKGYTKALLAGLEGD-------QAQRKIIAT 195
Query: 214 CKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIP 273
CKH+ YD++ W G R+ F A++TMQDLA+ Y PPF+ C + + MC+YN VNGIP
Sbjct: 196 CKHYVGYDMEAWGGYDRHNFSAKITMQDLAEYYMPPFQQCARDSKVGSFMCSYNAVNGIP 255
Query: 274 SCADRNLLSKTARRQWGF---HGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVN 330
+CAD +L R W + + YITSDC+AV+ I + Y ++ GMD++
Sbjct: 256 TCADTYVLQTILRDHWNWTDSNNYITSDCEAVADISENHKYVETLAQGTALAFAKGMDLS 315
Query: 331 CGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSP 390
C A Q L S ID+AL + + G F+G + + + +P
Sbjct: 316 CEYTGSSDIPGAWAQGLLNISVIDKALTRQYEGLVHAGYFDG--AKATYANLSYKDINTP 373
Query: 391 AHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSIT 450
+ L+LQ +G+V+LKN H LPLP +K +A+IG AN + L G Y+GP +
Sbjct: 374 EARQLSLQVTSEGLVMLKNDH-TLPLPLTKGSKVAMIGFWANDSSKLQGIYSGPPPYRHS 432
Query: 451 PLQALQNYVENT--VYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDR 508
P+ A + + + P + A+D A+ +D+++ G D T +E DR
Sbjct: 433 PVFAGEQMGLDMAIAWGPMIQNSSVPDNWTTNALDAAEKSDYILYFGGQDWTVAQEGYDR 492
Query: 509 VDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAV 568
+ P Q +L+T++A+ K V++ L G D + + SI+WA +PG+ G
Sbjct: 493 TTISFPQVQIDLLTKLAKLGKPLVVITL--GDMTDHSPLLSMEGVNSIIWANWPGQDGGP 550
Query: 569 ALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFG 628
A+ V+ G H P GRLP+T YP DY+K+ M DM +RP S PGRTYR++ + V PFG
Sbjct: 551 AILNVVSGAHAPAGRLPITEYPADYVKLSMLDMNLRPHTES--PGRTYRWFN-ESVQPFG 607
Query: 629 CGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIG 688
GL Y+ + F S+ L + Q YK + CE VT
Sbjct: 608 FGLHYTTFEASF--ASEEGLTYDIEEILDGCTQQ----YK-------DLCEVAPLEVT-- 652
Query: 689 VKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGF---QSVILNAKEKAEIVFELSPCESL 745
V N G L F+K P+K L+ + + + AK+ A + L L
Sbjct: 653 VANKGNRTSDFVALAFIKGEVGPKPYPLKTLITYGRLRDIHGGAKKSASLPLTLG---EL 709
Query: 746 SRAREDGLMVIEEGTHFLVVGD 767
+R + G VI G + L++ +
Sbjct: 710 ARVDQSGNTVIYPGEYTLLLDE 731
>gi|365135698|ref|ZP_09343911.1| hypothetical protein HMPREF1032_03710 [Subdoligranulum sp.
4_3_54A2FAA]
gi|363612160|gb|EHL63713.1| hypothetical protein HMPREF1032_03710 [Subdoligranulum sp.
4_3_54A2FAA]
Length = 643
Score = 399 bits (1026), Expect = e-108, Method: Compositional matrix adjust.
Identities = 240/605 (39%), Positives = 335/605 (55%), Gaps = 58/605 (9%)
Query: 52 ISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIR 111
+QRAR LV+++TL+EK+SQ+ APAI RLGIPAY WW+E LHGV G
Sbjct: 4 FAQRARALVAQMTLEEKVSQMRYDAPAIERLGIPAYNWWNECLHGVGRSGT--------- 54
Query: 112 GATSFPQVILTAASFDSYLWYRIGQAIGLEARALYN-------AGQAIGMTFWAPNINIF 164
AT FPQ I AASFD L + QAI EARA YN G G+TFW+PNIN+F
Sbjct: 55 -ATVFPQPIGMAASFDESLLEHVAQAISDEARAKYNQYKTFGETGIYQGLTFWSPNINLF 113
Query: 165 RDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDN 224
RDPRWGRG ET GEDPL+TG+ +++RG+Q G+ + A KHF A+ +
Sbjct: 114 RDPRWGRGHETYGEDPLLTGRMGTAFIRGLQE-----GEDSQYRKLDATVKHFAAH---S 165
Query: 225 WKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKT 284
R+ F+A V+ +D+AD+Y F C++ + + +M AYNR+NG P+CA L
Sbjct: 166 GPEAGRHSFNAEVSAEDMADSYLWAFRYCIEHAKPAAVMGAYNRINGEPACASSTYLKGV 225
Query: 285 ARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVK 344
+W F GY+ SDC A+ I + K+ +++ + G +NCG KAAV+
Sbjct: 226 LYEEWKFDGYVVSDCGAIQDINENHHVTKNEKESAALAVNNGCQLNCGKAYH-WVKAAVE 284
Query: 345 QKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGI 404
+ E + A+ LF R RLG+F+ + + I +V+ H+ L + AQ+ I
Sbjct: 285 DGLISEDTVTCAVERLFEARFRLGMFDSDCV---YDSIPMNVIECRKHRELNRKMAQESI 341
Query: 405 VLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVY 464
VLLKN+ G+LPL K++ A+IGPNA+ LLGNY G T L+ +Q+ VY
Sbjct: 342 VLLKNN-GILPLNPEKTI--AVIGPNADDKTVLLGNYNGTPSHWTTLLRGIQDQARGEVY 398
Query: 465 YPGCDTVACSSA------SIDKAVDIAKGADHVVLMMGLDQTQEKEE---------LDRV 509
Y + A + +A+ AK AD VVL +GL E EE DR
Sbjct: 399 YARGSVLVEKEALPWAEKPLHEAIYTAKAADVVVLCLGLSPLLEGEEGDAYNGADSGDRK 458
Query: 510 DLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVA 569
D+ LP QQ+L+ + + +KPV+LV + GG VD+ A D +IL YPG G A
Sbjct: 459 DISLPDIQQQLLCAILD-TEKPVVLVNVSGGCVDLRQA--DERCAAILQCFYPGAEGGNA 515
Query: 570 LAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGC 629
LA+++FG +P GRLP+T+Y P TD M+ GRTYRF++GK ++PFG
Sbjct: 516 LADILFGRVSPSGRLPVTFYRTVEDLPPFTDYSMK--------GRTYRFFDGKPLYPFGH 567
Query: 630 GLSYS 634
GL+Y+
Sbjct: 568 GLTYA 572
>gi|291556907|emb|CBL34024.1| Beta-glucosidase-related glycosidases [Eubacterium siraeum V10Sc8a]
Length = 691
Score = 399 bits (1025), Expect = e-108, Method: Compositional matrix adjust.
Identities = 249/648 (38%), Positives = 361/648 (55%), Gaps = 71/648 (10%)
Query: 50 LPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGT 109
L +RA L L+ +E+ QL APAI + G+P+Y WW+E LHGVA G
Sbjct: 9 LSAYERAAALADTLSTEEQAQQLKYDAPAIEKAGLPSYNWWNEGLHGVARAGT------- 61
Query: 110 IRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQA-------IGMTFWAPNIN 162
AT FPQ I AA+FD + YR+G+ I EARA+YN+ G+T WAPNIN
Sbjct: 62 ---ATVFPQAIALAAAFDKDMMYRVGEVISTEARAMYNSAAKHGDTDIYKGLTLWAPNIN 118
Query: 163 IFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDL 222
IFRDPRWGRG ET GEDP +T + VS+V+G+QG+ + L+A+AC KHF +
Sbjct: 119 IFRDPRWGRGHETYGEDPYLTSRLGVSFVKGIQGE-------EEYLRAAACAKHFAVH-- 169
Query: 223 DNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLS 282
+ + R++FDARV+ +D+ +TY P F++ VK+GR G+M AYNRVNG PSCA L+
Sbjct: 170 -SGPESLRHEFDARVSEKDMEETYLPAFKALVKEGRVEGVMGAYNRVNGEPSCASEKLMG 228
Query: 283 KTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG-SFLQKHTKA 341
K R+WGF GY SDC A+ + + + LKAG DVNCG ++L H A
Sbjct: 229 KL--REWGFDGYFVSDCWAIRDFHTTHKITDTAPQSAAMALKAGCDVNCGNTYL--HILA 284
Query: 342 AVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQ 401
A+++ + + +I A + R+RLG + N F + D++ ++ L+L+AA+
Sbjct: 285 ALEEGLITKQDIRTACIHALRTRIRLGQLDDN----EFDDLPFDIIACDGNKALSLEAAE 340
Query: 402 DGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVEN 461
+VLL N G+LPL KS+ S+A+IGPNA+S LLGNY G RS+T L+ +Q+ +
Sbjct: 341 KSMVLLHND-GILPLDKSRISSIAVIGPNADSRAALLGNYNGTPDRSVTFLEGIQDAFDG 399
Query: 462 TVYYP-GCDT-------VACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEE-------- 505
VYY GC +A +AV + AD V+ +GLD T E EE
Sbjct: 400 RVYYAEGCQLFRDRTQGLALPGDRYAEAVAACEAADVTVICVGLDATLEGEEGDTGNEFA 459
Query: 506 -LDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGE 564
D+ DL LP Q+ L+ + + KP+I+VL G V+ +++ A YPG+
Sbjct: 460 SGDKPDLRLPEVQRVLLQNLKDTG-KPLIIVLAAGSSVNTECEG-----NALINAWYPGQ 513
Query: 565 AGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFY--EGK 622
G ALAE++FG+ +P G+LP+T+Y + TD M+ RTYRF E
Sbjct: 514 YGGKALAEILFGEVSPSGKLPVTFYKSADMLPDFTDYSMK--------NRTYRFCDDESN 565
Query: 623 EVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSS-TKMVENQDVVHYKS 669
++PFG GL+YS + + N L +N +++ ++ E+ V+ KS
Sbjct: 566 VLYPFGYGLTYSHFECGDVSYKDNTLAVNVTNTGSRSAEDVLQVYIKS 613
>gi|440472411|gb|ELQ41274.1| beta-xylosidase [Magnaporthe oryzae Y34]
gi|440484691|gb|ELQ64724.1| beta-xylosidase [Magnaporthe oryzae P131]
Length = 792
Score = 399 bits (1024), Expect = e-108, Method: Compositional matrix adjust.
Identities = 267/765 (34%), Positives = 397/765 (51%), Gaps = 62/765 (8%)
Query: 34 DPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEA 93
D N T C ++RA LV + LDEK+ LVN +P PR+G+PAYEWWSEA
Sbjct: 29 DCENGPLSTNIVCDQAATPAERAAGLVDIMELDEKLENLVNKSPGAPRIGLPAYEWWSEA 88
Query: 94 LHGVAGVGKGIFFN----GTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAG 149
LHGVA G+ FN ATSF I+ +A+FD L + I EARA NAG
Sbjct: 89 LHGVAK-SPGVTFNKSSGAAFSSATSFSNPIVLSAAFDDELVEAVATQISTEARAFSNAG 147
Query: 150 QAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQ 209
A G+ +W PNIN ++DPRWGRG ETPGED L KY + +RG++G K+
Sbjct: 148 LA-GLDWWTPNINPYKDPRWGRGMETPGEDALRISKYVKALLRGLEGSDPTTRKM----- 201
Query: 210 ASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRV 269
A CKH+ A DL+ W G TRY FDA VT+QDL++ Y P F+ C + MCAYN +
Sbjct: 202 -VANCKHYAANDLERWNGVTRYNFDAPVTLQDLSEYYLPAFKQCARDSNVGSFMCAYNAM 260
Query: 270 ---------NGIPSCADRNLLSKTARRQWGF---HGYITSDCDAVSIIYDAEGYAKSPED 317
NG P CA + L++ R WG+ + +ITSDC+AV +++ ++ + E+
Sbjct: 261 SIKGKDLSWNGTPVCASKYLMNDILREHWGWKEHNNWITSDCNAVLHMWNQHHWSDTREE 320
Query: 318 AVVDVLKAGMDVNC--GSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPT 375
A AG D C ++ + K A + L E +DRAL L+ +R G F+G
Sbjct: 321 AAGSAYTAGTDTVCEVSNYDKTAVKGAFDRGLLDEDVVDRALKRLYEGLVRAGYFDGPDA 380
Query: 376 MQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLP----KSKSVSLALIGPNA 431
P+ I V +P + LA ++A +G+VL KN +G+LP+ + K ++ALIG
Sbjct: 381 --PYRNITWADVNTPEARKLAHRSAVEGMVLTKN-NGVLPIKLEELQKKGKTVALIGNWV 437
Query: 432 NSAKTLLGNYAGPSCRSITPLQALQ--NYVENTVYYPGCDTVACSSASIDKAVDIAKGAD 489
++ + +LG Y+G + TPL A + N T P + + A++ A AD
Sbjct: 438 DNGEQMLGTYSGIAPFRNTPLAAAKALNLKMVTAGGPVNQSTGSRDSWTRPALNAAIQAD 497
Query: 490 HVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKY 549
V+ G+D + E E+ DR L P Q +L++ ++ A KP ++V L G +D T
Sbjct: 498 VVLYFGGIDLSVEAEDRDRYSLAWPSAQAKLLSDIS-ALGKPTVVVQL-GTMLDDTALLD 555
Query: 550 DRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYI-KVPMTDMKMRPQ-- 606
++NI +I+WAGYPG+ G A ++I G P GRLP+T YP Y +VPMTDM++RP
Sbjct: 556 NKNISAIIWAGYPGQDGGTAAFDIITGKTAPSGRLPVTQYPAKYANQVPMTDMEVRPSKD 615
Query: 607 ---ATSGNPGRTYRFYEGKEVFPFGCGLSYSKY-SYKFKAVSQNKLYLNQSSSTKMVENQ 662
+ NPGRTYR+Y+ + V PFG GL ++ + + + S + S K ++
Sbjct: 616 TKGGAASNPGRTYRWYD-EAVHPFGFGLHFTNFTTSVAVSSSSAISTSDLESGCKSEKHI 674
Query: 663 DVVHYKSVPELGTEFCETRKFLVTIGVKNHGE-MAGKHPVLLFVKPARRGNGRPIKQLVG 721
D + S E + V N G+ + L FV+ P+K LV
Sbjct: 675 DKCSFPSSLE--------------VSVTNDGKSTTSSYAALAFVRGEYGPKPYPLKTLVA 720
Query: 722 FQSV-ILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVV 765
+ + + + ++ EL+ L+R E+G +V+ G + ++V
Sbjct: 721 YGKLHDIAPGQTKKVKLELT-LGDLARTAENGDLVLYPGKYEVLV 764
>gi|150019782|ref|YP_001312036.1| glycoside hydrolase family protein [Clostridium beijerinckii NCIMB
8052]
gi|149906247|gb|ABR37080.1| glycoside hydrolase, family 3 domain protein [Clostridium
beijerinckii NCIMB 8052]
Length = 709
Score = 398 bits (1023), Expect = e-108, Method: Compositional matrix adjust.
Identities = 263/738 (35%), Positives = 385/738 (52%), Gaps = 95/738 (12%)
Query: 56 ARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATS 115
A++LVS++TL EK QL +PAI L +P Y WW+E LHGVA G AT
Sbjct: 16 AKELVSKMTLQEKAEQLTYQSPAIKHLNVPEYNWWNEGLHGVARAGT----------ATV 65
Query: 116 FPQVILTAASFDSYLWYRIGQAIGLEARALYNA-------GQAIGMTFWAPNINIFRDPR 168
FPQ I AA FD ++ I E RA YN G G+T+W+PNINIFRDPR
Sbjct: 66 FPQAIGLAAIFDDEFLGKVANIIATEGRAKYNEYSKKDDRGIYKGLTYWSPNINIFRDPR 125
Query: 169 WGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGT 228
WGRG ET GEDP +T + V++++G+QG+ GK L+ +AC KHF + +
Sbjct: 126 WGRGHETYGEDPYLTSRLGVAFIKGLQGE----GKY---LKLAACAKHFAVH---SGPEG 175
Query: 229 TRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQ 288
R++F+A V +DL +TY P FE+CVK+ +M AYNR NG P C + LL R +
Sbjct: 176 LRHEFNAVVNKKDLYETYLPAFEACVKEANVESVMGAYNRTNGEPCCGSKTLLKDILRGK 235
Query: 289 WGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKL 348
WGF G++ SDC A++ + + ++V ++ G D+NCG+ + + A K+ +
Sbjct: 236 WGFKGHVVSDCWALADFHLHHMVTSTATESVALAIENGCDLNCGN-MYLNLLLAYKEGLV 294
Query: 349 PESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLK 408
E +I A L + R +LG+F+ + KI +V S H +AL A++ +VLLK
Sbjct: 295 TEEQITTAAERLMTTRFKLGMFDEEC---EYNKIPYEVNDSREHNEVALIASRKSMVLLK 351
Query: 409 NSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTV---YY 465
N +G LPL KS S+A+IGPNANS L GNY+G + + T L+ + + V N V Y
Sbjct: 352 N-NGTLPLDKSNLKSIAVIGPNANSEIMLKGNYSGTASKYTTILEGIHDAVGNDVRVYYS 410
Query: 466 PGC-------DTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEE---------LDRV 509
GC + +A + +A+ +A+ +D VVL +GLD T E E+ D+
Sbjct: 411 EGCHLFKDKVEDLARPDDRLSEAISVAERSDVVVLCLGLDSTIEGEQGDAGNSYGAGDKE 470
Query: 510 DLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVA 569
+L LPGRQQ L+ +V E KPVI+VL G + + A + +IL A YPG G A
Sbjct: 471 NLNLPGRQQNLLEKVLEVG-KPVIVVLGAGSALTLNGA--EEKCAAILNAWYPGSHGGTA 527
Query: 570 LAEVIFGDHNPGGRLPMTWYPQDYIKVP-MTDMKMRPQATSGNPGRTYRFYEGKEVFPFG 628
+A+++FG +P G+LP+T+Y +D K+P TD M+ GRTYR+ + ++PFG
Sbjct: 528 VADILFGKCSPSGKLPVTFY-KDTAKLPDFTDYSMK--------GRTYRYLGHESLYPFG 578
Query: 629 CGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIG 688
GL+YS VE ++ VP + F F ++I
Sbjct: 579 YGLTYS-----------------------TVELSNL----QVPSVKQGFG---SFDISIE 608
Query: 689 VKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRA 748
+KN GE + V +VK L GF+ V L E + +L+ +S
Sbjct: 609 IKNTGEYDIEEVVQCYVKDIESKYAVLNHSLAGFKRVSLKKGESKIVTIKLNK-KSFEVV 667
Query: 749 REDGLMVIEEGTHFLVVG 766
+DG +++ L VG
Sbjct: 668 NDDGERLLDSKKFKLFVG 685
>gi|291544853|emb|CBL17962.1| Beta-glucosidase-related glycosidases [Ruminococcus champanellensis
18P13]
Length = 697
Score = 398 bits (1022), Expect = e-108, Method: Compositional matrix adjust.
Identities = 249/654 (38%), Positives = 360/654 (55%), Gaps = 74/654 (11%)
Query: 45 FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGI 104
+ +L +RA DL RLT++E+ SQL A IPRLGIPAY WW+E LHGVA G
Sbjct: 9 YLNPSLTPDERAEDLADRLTVEEQASQLRYDALPIPRLGIPAYNWWNEGLHGVARAGT-- 66
Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQA-------IGMTFW 157
AT FPQ I AA+FD+ L ++IG+ EARA + A + G+T W
Sbjct: 67 --------ATMFPQAIGMAATFDTALLHQIGEITATEARAKHMAAREHGDFDIYKGLTLW 118
Query: 158 APNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHF 217
APNIN+FRDPRWGRG ET GEDP +T + V++V+G+QG+ GK+ L+A+AC KHF
Sbjct: 119 APNINLFRDPRWGRGHETYGEDPFLTARLGVAFVKGMQGE----GKV---LKAAACAKHF 171
Query: 218 TAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCAD 277
+ + R+ FDA+V+ +DL ++Y P F + V + + G+M AYNRVNG PSCA
Sbjct: 172 AVH---SGPEALRHSFDAQVSPKDLEESYLPAFHALVAEAKVEGVMGAYNRVNGEPSCAS 228
Query: 278 RNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG-SFLQ 336
L+ K QWGF GY SDC A+ + G K+ ++ L+ G D+NCG ++L
Sbjct: 229 PMLMDKL--HQWGFAGYFVSDCWAIQDFHKHHGVTKNVTESAALALRTGCDLNCGNTYL- 285
Query: 337 KHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLA 396
+ AA+++ + ++I RA + R+RLGLF+ P F D + SPAH+ ++
Sbjct: 286 -YVLAALEEGLIDAADIRRACIRVLRTRIRLGLFDPEPH---FAACTYDTIASPAHKAVS 341
Query: 397 LQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQ 456
L A+ +VLLKN G+LPL SK ++A+IGPNA+S L GNY G + R +T L+ +Q
Sbjct: 342 LSCAEKSMVLLKND-GILPLDLSKLHAIAVIGPNADSRAALEGNYCGTADRYVTFLEGIQ 400
Query: 457 NYVENTVYYP-GCDTVACSSASIDKAVD-------IAKGADHVVLMMGLDQTQEKEE--- 505
+ V+Y GC ++++ A D A+ +D V+L +GLD T E EE
Sbjct: 401 DAFPGRVHYAQGCHLYKDRTSNLAMADDRYAEALAAAEASDVVILCLGLDATLEGEEGDT 460
Query: 506 ------LDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWA 559
D+ DL LP Q +L+ ++ A KPVILVL G ++ + ++L A
Sbjct: 461 GNEFSSGDKADLRLPPPQCKLLEKL-HAVGKPVILVLAAGSALNPEIS-----CNAVLQA 514
Query: 560 GYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFY 619
YPG+ G ALA ++FG +P G+LP+T+Y TD M+ RTYR+
Sbjct: 515 WYPGQCGGQALAHILFGKVSPSGKLPVTFYETAEQLPDFTDYSMQ--------NRTYRYA 566
Query: 620 EGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPEL 673
++PFG GL+Y K + +L + V NQ + + V +L
Sbjct: 567 RNNVLYPFGYGLTYGK-------IVCTELSYENGCARMTVTNQGIRFTEDVVQL 613
>gi|167751044|ref|ZP_02423171.1| hypothetical protein EUBSIR_02029 [Eubacterium siraeum DSM 15702]
gi|167655962|gb|EDS00092.1| glycosyl hydrolase family 3 C-terminal domain protein [Eubacterium
siraeum DSM 15702]
Length = 691
Score = 398 bits (1022), Expect = e-108, Method: Compositional matrix adjust.
Identities = 248/648 (38%), Positives = 361/648 (55%), Gaps = 71/648 (10%)
Query: 50 LPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGT 109
L +RA L L+ +E+ QL APAI + G+P+Y WW+E LHGVA G
Sbjct: 9 LSAYERAAALADTLSTEEQAQQLKYDAPAIEKAGLPSYNWWNEGLHGVARAGT------- 61
Query: 110 IRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQA-------IGMTFWAPNIN 162
AT FPQ I AA+FD + YR+G+ I EARA+YN+ G+T WAPNIN
Sbjct: 62 ---ATVFPQAIALAAAFDKDMMYRVGEVISTEARAMYNSAAKHGDTDIYKGLTLWAPNIN 118
Query: 163 IFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDL 222
IFRDPRWGRG ET GEDP +T + V++V+G+QG+ + L+A+AC KHF +
Sbjct: 119 IFRDPRWGRGHETYGEDPYLTSRLGVNFVKGIQGE-------EEYLRAAACAKHFAVH-- 169
Query: 223 DNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLS 282
+ + R++FDARV+ +D+ +TY P F++ VK+GR G+M AYNRVNG PSCA L+
Sbjct: 170 -SGPESLRHEFDARVSEKDMEETYLPAFKALVKEGRVEGVMGAYNRVNGEPSCASEKLMG 228
Query: 283 KTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG-SFLQKHTKA 341
K R+WGF GY SDC A+ + + + LKAG DVNCG ++L H A
Sbjct: 229 KL--REWGFDGYFVSDCWAIRDFHTTHKITDTAPQSAAMALKAGCDVNCGNTYL--HILA 284
Query: 342 AVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQ 401
A+++ + + I A + R+RLG + N F + D++ ++ L+L+AA+
Sbjct: 285 ALEEGLITKQNIRTACIHALRTRIRLGQLDDN----EFDDLPFDIIACDGNKALSLEAAE 340
Query: 402 DGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVEN 461
+VLL N G+LPL KS+ S+A+IGPNA+S LLGNY G RS+T L+ +Q+ +
Sbjct: 341 KSMVLLHND-GILPLDKSRISSIAVIGPNADSRAALLGNYNGTPDRSVTFLEGIQDAFDG 399
Query: 462 TVYYP-GCDT-------VACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEE-------- 505
VYY GC +A +AV + AD V+ +GLD T E EE
Sbjct: 400 RVYYAEGCQLFRDRTQGLALPGDRYAEAVAACEAADVTVVCVGLDATLEGEEGDTGNEFA 459
Query: 506 -LDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGE 564
D+ DL LP Q+ L+ ++ + KP+I+VL G V+ +++ A YPG+
Sbjct: 460 SGDKPDLRLPEVQRVLLQKLKDTG-KPLIIVLAAGSSVNTECEG-----NALINAWYPGQ 513
Query: 565 AGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFY--EGK 622
G ALAE++FG+ +P G+LP+T+Y + TD M+ RTYRF E
Sbjct: 514 YGGKALAEILFGEVSPSGKLPVTFYKSADMLPDFTDYSMK--------NRTYRFCDDESN 565
Query: 623 EVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSS-TKMVENQDVVHYKS 669
++PFG GL+YS + + N L +N +++ ++ E+ V+ KS
Sbjct: 566 VLYPFGYGLTYSHFECGDISYKDNTLAVNVTNTGSRSAEDVLQVYIKS 613
>gi|389632743|ref|XP_003714024.1| beta-xylosidase [Magnaporthe oryzae 70-15]
gi|351646357|gb|EHA54217.1| beta-xylosidase [Magnaporthe oryzae 70-15]
Length = 847
Score = 397 bits (1021), Expect = e-108, Method: Compositional matrix adjust.
Identities = 267/765 (34%), Positives = 397/765 (51%), Gaps = 62/765 (8%)
Query: 34 DPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEA 93
D N T C ++RA LV + LDEK+ LVN +P PR+G+PAYEWWSEA
Sbjct: 84 DCENGPLSTNIVCDQAATPAERAAGLVDIMELDEKLENLVNKSPGAPRIGLPAYEWWSEA 143
Query: 94 LHGVAGVGKGIFFN----GTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAG 149
LHGVA G+ FN ATSF I+ +A+FD L + I EARA NAG
Sbjct: 144 LHGVAK-SPGVTFNKSSGAAFSSATSFSNPIVLSAAFDDELVEAVATQISTEARAFSNAG 202
Query: 150 QAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQ 209
A G+ +W PNIN ++DPRWGRG ETPGED L KY + +RG++G K+
Sbjct: 203 LA-GLDWWTPNINPYKDPRWGRGMETPGEDALRISKYVKALLRGLEGSDPTTRKM----- 256
Query: 210 ASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRV 269
A CKH+ A DL+ W G TRY FDA VT+QDL++ Y P F+ C + MCAYN +
Sbjct: 257 -VANCKHYAANDLERWNGVTRYNFDAPVTLQDLSEYYLPAFKQCARDSNVGSFMCAYNAM 315
Query: 270 ---------NGIPSCADRNLLSKTARRQWGF---HGYITSDCDAVSIIYDAEGYAKSPED 317
NG P CA + L++ R WG+ + +ITSDC+AV +++ ++ + E+
Sbjct: 316 SIKGKDLSWNGTPVCASKYLMNDILREHWGWKEHNNWITSDCNAVLHMWNQHHWSDTREE 375
Query: 318 AVVDVLKAGMDVNC--GSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPT 375
A AG D C ++ + K A + L E +DRAL L+ +R G F+G
Sbjct: 376 AAGSAYTAGTDTVCEVSNYDKTAVKGAFDRGLLDEDVVDRALKRLYEGLVRAGYFDGPDA 435
Query: 376 MQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLP----KSKSVSLALIGPNA 431
P+ I V +P + LA ++A +G+VL KN +G+LP+ + K ++ALIG
Sbjct: 436 --PYRNITWADVNTPEARKLAHRSAVEGMVLTKN-NGVLPIKLEELQKKGKTVALIGNWV 492
Query: 432 NSAKTLLGNYAGPSCRSITPLQALQ--NYVENTVYYPGCDTVACSSASIDKAVDIAKGAD 489
++ + +LG Y+G + TPL A + N T P + + A++ A AD
Sbjct: 493 DNGEQMLGTYSGIAPFRNTPLAAAKALNLKMVTAGGPVNQSTGSRDSWTRPALNAAIQAD 552
Query: 490 HVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKY 549
V+ G+D + E E+ DR L P Q +L++ ++ A KP ++V L G +D T
Sbjct: 553 VVLYFGGIDLSVEAEDRDRYSLAWPSAQAKLLSDIS-ALGKPTVVVQL-GTMLDDTALLD 610
Query: 550 DRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYI-KVPMTDMKMRPQ-- 606
++NI +I+WAGYPG+ G A ++I G P GRLP+T YP Y +VPMTDM++RP
Sbjct: 611 NKNISAIIWAGYPGQDGGTAAFDIITGKTAPSGRLPVTQYPAKYANQVPMTDMEVRPSKD 670
Query: 607 ---ATSGNPGRTYRFYEGKEVFPFGCGLSYSKY-SYKFKAVSQNKLYLNQSSSTKMVENQ 662
+ NPGRTYR+Y+ + V PFG GL ++ + + + S + S K ++
Sbjct: 671 TKGGAASNPGRTYRWYD-EAVHPFGFGLHFTNFTTSVAVSSSSAISTSDLESGCKSEKHI 729
Query: 663 DVVHYKSVPELGTEFCETRKFLVTIGVKNHGE-MAGKHPVLLFVKPARRGNGRPIKQLVG 721
D + S E + V N G+ + L FV+ P+K LV
Sbjct: 730 DKCSFPSSLE--------------VSVTNDGKSTTSSYAALAFVRGEYGPKPYPLKTLVA 775
Query: 722 FQSV-ILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVV 765
+ + + + ++ EL+ L+R E+G +V+ G + ++V
Sbjct: 776 YGKLHDIAPGQTKKVKLELT-LGDLARTAENGDLVLYPGKYEVLV 819
>gi|413925161|gb|AFW65093.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
Length = 323
Score = 397 bits (1019), Expect = e-107, Method: Compositional matrix adjust.
Identities = 182/297 (61%), Positives = 226/297 (76%), Gaps = 6/297 (2%)
Query: 29 PPFSCDPSNP-STETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAY 87
PPFSC PS+ ++E FC TL +QRA DLVSRLT EKI+QL + AP +PRLG+P Y
Sbjct: 29 PPFSCGPSSAEASEGLAFCDVTLAPAQRAADLVSRLTAAEKIAQLGDQAPGVPRLGVPGY 88
Query: 88 EWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYN 147
+WW+EALHG+A GKG+ F+ +R ATSFPQV+LTAA+FD LW RIGQAIG EARAL+N
Sbjct: 89 KWWNEALHGLATSGKGLHFDAAVRAATSFPQVLLTAAAFDDDLWLRIGQAIGREARALFN 148
Query: 148 AGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGK 207
GQA G+T W+PN+NIFRDPRWGRGQETPGEDP V +YAV++VRG+QG++ +
Sbjct: 149 VGQAEGLTIWSPNVNIFRDPRWGRGQETPGEDPAVASRYAVAFVRGIQGNSSS-----SL 203
Query: 208 LQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYN 267
LQ SACCKH TAYDL++W G RY F ARVT QDL DT+ PPF SCV + +AS +MCAY
Sbjct: 204 LQTSACCKHATAYDLEDWNGVARYSFVARVTEQDLEDTFNPPFRSCVVEAKASCVMCAYT 263
Query: 268 RVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLK 324
+NG+P+CA+ +LL+ T R WG GY+ SDCDAV+I+ DA+ YA +PEDAV LK
Sbjct: 264 AINGVPACANSDLLTGTVRGDWGLDGYVASDCDAVAIMRDAQRYAPTPEDAVAVSLK 320
>gi|291530120|emb|CBK95705.1| Beta-glucosidase-related glycosidases [Eubacterium siraeum 70/3]
Length = 689
Score = 396 bits (1018), Expect = e-107, Method: Compositional matrix adjust.
Identities = 243/631 (38%), Positives = 353/631 (55%), Gaps = 68/631 (10%)
Query: 50 LPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGT 109
L +RA L L+ +E+ QL APAI + G+P+Y WW+E LHGVA G
Sbjct: 9 LSAYERAAALADTLSTEEQAQQLKYDAPAIEKAGLPSYNWWNEGLHGVARAGT------- 61
Query: 110 IRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQA-------IGMTFWAPNIN 162
AT FPQ I AA+FD + R+G+ + EARA+YN+ G+T WAPNIN
Sbjct: 62 ---ATVFPQAIALAAAFDKDMMCRVGEVVSTEARAMYNSAAKHGDTDIYKGLTLWAPNIN 118
Query: 163 IFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDL 222
IFRDPRWGRG ET GEDP +T + V++V+G+QG+ + L+A+AC KHF +
Sbjct: 119 IFRDPRWGRGHETYGEDPYLTSRLGVNFVKGIQGE-------EKYLRAAACAKHFAVH-- 169
Query: 223 DNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLS 282
+ + R++FDARV+ +DL +TY P F++ VK+GR G+M AYNRVNG PSCA L+
Sbjct: 170 -SGPESLRHEFDARVSEKDLEETYLPAFKALVKEGRVEGVMGAYNRVNGEPSCASEKLMG 228
Query: 283 KTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG-SFLQKHTKA 341
K R+WGF GY SDC A+ + + + LKAG DVNCG ++L H A
Sbjct: 229 KL--REWGFDGYFVSDCGAIRDFHTNHKITDTAPQSAAMALKAGCDVNCGNTYL--HILA 284
Query: 342 AVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQ 401
A+++ + + +I A + R+RLG + N F + D++ ++ L+L+AA+
Sbjct: 285 ALEEGLITKQDIRTACIHALRTRIRLGQLDDN----EFDDLPFDIIACDGNKALSLEAAE 340
Query: 402 DGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVEN 461
+VLL N G+LPL KS+ S+A+IGPNA+S LLGNY G RS+T L+ +Q+ +
Sbjct: 341 KSMVLLHND-GILPLDKSRISSIAVIGPNADSRAALLGNYEGTPDRSVTFLEGIQDAFDG 399
Query: 462 TVYY-PGCDT-------VACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEE-------L 506
VYY GC +A +AV + AD V+ +GLD T E EE
Sbjct: 400 RVYYAEGCQLFRDRTQGLALPGDRYAEAVAACEAADVTVVCVGLDSTLEGEEGDTENKSG 459
Query: 507 DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAG 566
D+ DL LP Q+ L+ ++ + KP+I+VL G V+ +++ A YPG+ G
Sbjct: 460 DKPDLRLPEVQRVLLQKLKDTG-KPLIIVLAAGSSVNTECEG-----NALINAWYPGQYG 513
Query: 567 AVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFY--EGKEV 624
ALAE++FG+ +P G+LP+T+Y + TD M+ RTYRF E +
Sbjct: 514 GKALAEILFGEVSPSGKLPVTFYKSADMLPDFTDYSMK--------NRTYRFCDDESNVL 565
Query: 625 FPFGCGLSYSKYSYKFKAVSQNKLYLNQSSS 655
+PFG GL+YS + + N L +N +++
Sbjct: 566 YPFGYGLTYSHFECGDISYKDNTLAVNVTNT 596
>gi|336261464|ref|XP_003345521.1| hypothetical protein SMAC_07509 [Sordaria macrospora k-hell]
gi|380088197|emb|CCC13872.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 762
Score = 395 bits (1014), Expect = e-107, Method: Compositional matrix adjust.
Identities = 278/778 (35%), Positives = 396/778 (50%), Gaps = 71/778 (9%)
Query: 16 CFTSLLTRVDSTQPPFSCDPS--NPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLV 73
C +L+ +D PF P N + C TL QRA LV+ +T +EK+ LV
Sbjct: 20 CSAALVYAIDL---PFQTYPDCVNGPLASLKVCDATLSPPQRAAALVAAMTTEEKLQNLV 76
Query: 74 NSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFF---NGTIRGATSFPQVILTAASFDSYL 130
+ + PR+G+PAY WWSEALHGVA G F NGT +TSFP +L AA+FD L
Sbjct: 77 SKSKGAPRIGLPAYNWWSEALHGVA-YAPGTQFRSGNGTFNSSTSFPMPLLMAATFDDEL 135
Query: 131 WYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSY 190
R+G+ IG+E RA NAG + G +W PN+N F+DPRWGRG ETPGED L +YA S
Sbjct: 136 IERVGEVIGIEGRAFGNAGFS-GFDYWTPNVNPFKDPRWGRGSETPGEDILRIKRYAASM 194
Query: 191 VRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPF 250
+RG++G + + + A CKH+ A D ++W G+TR+ F+A+VT+QDLA+ Y PF
Sbjct: 195 IRGLEGPV-----RERERRIVATCKHYAANDFEDWNGSTRHDFNAKVTLQDLAEYYLSPF 249
Query: 251 ESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHG---YITSDCDAVSIIYD 307
+ C + + IMC+YN VNG+P+CA+ L+ R W + YITSDC+AV I
Sbjct: 250 QQCARDSKVGSIMCSYNAVNGVPACANTYLMQTILRDHWNWTAPGNYITSDCEAVLDISA 309
Query: 308 AEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRL 367
YAK+ + +AG+D +C A Q L +S +DRAL L+ +++
Sbjct: 310 NHHYAKTNAEGTALAFEAGIDSSCEYEGSSDILGAWTQGLLKQSTVDRALRRLYEGLVQV 369
Query: 368 GLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKS---VSL 424
G F+GN + +G + V P Q +ALQAA +GIVLLKN LPL K+ + L
Sbjct: 370 GYFDGN--RSEYASLGWNHVNRPKSQEVALQAAVEGIVLLKNDK-TLPLGVKKNGPKLKL 426
Query: 425 ALIGPNANSAKTLLGNYAGPSCRSITPLQALQ--NYVENTVYYPGCDTVACSSASIDKAV 482
A+IG AN KTL G Y+G +P+ A Q + T P A+
Sbjct: 427 AMIGFWANDPKTLSGGYSGTPAFEHSPVYATQAMGFKVTTAGGPVLQNSTSKDTWTQAAL 486
Query: 483 DIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPV 542
AK A++++ G D + E DR + P Q +LIT +++ KP+++V + G +
Sbjct: 487 AAAKDANYILYFGGQDTSAAGETKDRTTINWPEAQLQLITDLSKLG-KPLVVVQM-GDQL 544
Query: 543 DITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYI-KVPMTDM 601
D T + I SILWA +P P GRLP+T Y +Y VPMTDM
Sbjct: 545 DNTPLLASKAINSILWANWP----------------VPAGRLPVTQYHANYTAAVPMTDM 588
Query: 602 KMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVEN 661
+RP + PGRTYR+Y V PFG GL Y+ ++K K V + + S
Sbjct: 589 TLRP--SDKLPGRTYRWYP-TPVQPFGFGLHYT--TFKTKIVRLPRFAIKDLLSRCGNAY 643
Query: 662 QDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVG 721
D +P L E V N G+ + + VL F+K PIK LV
Sbjct: 644 PDTC---GLPPLKVE------------VTNTGKRSSDYVVLAFLKGDVGPKPYPIKTLVS 688
Query: 722 F---QSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGDEEYPISIFV 776
+ + + K A + + L ++R E G V+ GT+ ++V + + FV
Sbjct: 689 YTRLRDLSPGRKTTAHLDWTLG---DIARYDEQGNTVLYPGTYTVIVDEPAQASAKFV 743
>gi|325192664|emb|CCA27085.1| unnamed protein product [Albugo laibachii Nc14]
Length = 2278
Score = 394 bits (1013), Expect = e-107, Method: Compositional matrix adjust.
Identities = 263/750 (35%), Positives = 390/750 (52%), Gaps = 77/750 (10%)
Query: 43 FPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAP---AIPRLGIPAYEWWSEALHGVAG 99
FPFC ++L + R DL+ RL LDEK+ L A +IPRLG+P Y W + +HGV
Sbjct: 34 FPFCNSSLSLDLRVEDLLQRLQLDEKVRMLTARASTHGSIPRLGVPEYNWGANCVHGVQS 93
Query: 100 VGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARAL--------YNAGQA 151
GT ATSFP + A FD Y++ Q IG E RAL Y G
Sbjct: 94 TC------GT-HCATSFPNPVNLGAIFDPNEIYKMAQVIGKELRALRLEGARENYARGPH 146
Query: 152 IGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQAS 211
IG+ W+PNINI RDPRWGR ETP EDP V KY V+Y +G+Q G+ LQA
Sbjct: 147 IGLDCWSPNININRDPRWGRAMETPSEDPYVNAKYGVAYTKGLQE-----GQDSRFLQAV 201
Query: 212 ACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNG 271
KH+ AY +N+ GT R +FDA V+ D ADTY P FE+ V G+A GIMC+YN +NG
Sbjct: 202 VTLKHYLAYSYENYGGTDRTQFDAIVSAYDFADTYFPAFEASVVDGKAKGIMCSYNSLNG 261
Query: 272 IPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNC 331
IP+CA++ L++ R F GYITSD A+ I+D Y K+ +A +++G+D+
Sbjct: 262 IPTCANK-WLNQLLRDDLEFDGYITSDTGAIQGIFDGHKYTKTLCEATKIAMESGVDICS 320
Query: 332 GSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPA 391
G+ K S ID A+ +R +LGLF+ FG DV + +
Sbjct: 321 GNAYWNCLKQLANSTNFSAS-IDEAIRRTLKLRFQLGLFDAIGDQPHFGP--EDVRTAKS 377
Query: 392 HQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSC----- 446
Q L+L A+ IVLL+N LPL + +A+IGP++ + + ++GNY G C
Sbjct: 378 LQ-LSLDLARKSIVLLQNHGNTLPL--RLGLRIAVIGPHSMTRRGIMGNYYGQLCHGDYD 434
Query: 447 --RSI-TPLQALQ--NYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQ 501
R I +PL+A+Q N NT + GC S+A D A+ + AD VL +G+D +
Sbjct: 435 EVRCIQSPLEAIQSVNGRNNTHHVNGCGINDTSTAEFDDALQAVRTADVAVLFLGIDISI 494
Query: 502 EKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGY 561
E+E DR ++ +P Q EL+ + A KP ++VL GG + I K S+L A Y
Sbjct: 495 ERESKDRDNIDVPHIQLELL-KAIRVAGKPTVVVLFNGGILGI--EKLILYADSVLEAFY 551
Query: 562 PGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEG 621
PG GA A+AE++FG NP G+LP+T Y ++I D+ M+ + + PGR+YR+Y
Sbjct: 552 PGFFGAQAIAEILFGSINPSGKLPVTMYRSNFI----NDVDMKSMSMTLYPGRSYRYYTE 607
Query: 622 KEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETR 681
V+ FG GLSY+ +S +++ D +++ + T +
Sbjct: 608 VPVYSFGWGLSYTTFS---------------------IQSIDSHDTRAMNHVLTAQPKMY 646
Query: 682 KFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPI----KQLVGFQSVILNAKEKAEIVF 737
+ L+T N+G+ G+ + F +P P+ +QL + V L+ + E+
Sbjct: 647 RILIT----NNGKYYGEEVLFAFFRPLDIHATGPVESLQQQLFNYTRVRLDPGDMREVPL 702
Query: 738 ELSPCESLSRAREDGLMVIEEGTHFLVVGD 767
+ E+L+ +G + + EG + L++ +
Sbjct: 703 HVKD-ENLALHDRNGNLCVFEGFYELIISN 731
>gi|358380569|gb|EHK18247.1| glycoside hydrolase family 3 protein, partial [Trichoderma virens
Gv29-8]
Length = 722
Score = 394 bits (1012), Expect = e-107, Method: Compositional matrix adjust.
Identities = 255/723 (35%), Positives = 384/723 (53%), Gaps = 53/723 (7%)
Query: 63 LTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTI-------RGATS 115
LTLDEK + LVN+AP + RLG+P YEW +EALHG+AGV G N T +T
Sbjct: 12 LTLDEKAANLVNNAPGVKRLGLPPYEWRNEALHGLAGVSPGQGINSTFTQGNVAFNSSTQ 71
Query: 116 FPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQET 175
FP I+ A+FD +L + I A+ EARA N +A G+ +WAPNIN +RDPRWGRGQET
Sbjct: 72 FPSPIVLGAAFDDHLVHDIATAVSTEARAFSNHLKA-GLDYWAPNINPYRDPRWGRGQET 130
Query: 176 PGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDA 235
PGEDP +YA +YV G++G G K + + CKHF YD+++ G R ++A
Sbjct: 131 PGEDPYHVAQYAYNYVVGLKG-----GVGPAKSKVVSTCKHFAGYDIEDSDGVVRGSYNA 185
Query: 236 RVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHG-- 293
++ QDLA+ Y P F SC + + +MC+YN VNG PSCA+ +L R WG+
Sbjct: 186 IISTQDLAEYYLPSFRSCFRDAKTGAVMCSYNAVNGHPSCANSYMLDTVLRDHWGWGSSA 245
Query: 294 -YITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESE 352
++T DC AV +++ +S V + G D++CG+ + +AV+ E++
Sbjct: 246 HWVTGDCGAVDGVFNQHHVGQSAAQGVAFAINNGTDLDCGTAYASNIASAVQNNYTTEAQ 305
Query: 353 IDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHG 412
+D+AL L+S + LG F+ P Q + +G V +P+ Q LA A +GI
Sbjct: 306 LDQALSRLYSSLIVLGYFD-PPEGQEYRTLGVSDVNTPSTQKLAYTALVEGI-------N 357
Query: 413 LLPLPKSKSVSLALIGPNANSAK-TLLGNYAGPSCRSITPLQALQN--YVENTVYYPGCD 469
+LP+ + ++ +GP AN+A ++ GNY G + P+ + Y N Y G
Sbjct: 358 ILPI-RPMGQTVLFVGPWANNASVSMFGNYNGVAPYKTIPVPTANSSAYNWNVTYSQGLQ 416
Query: 470 TVACSSAS-IDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAA 528
V + S AV A+ AD VV + G+D+ E E DR + PG Q LI ++ AA
Sbjct: 417 YVLSNDTSQFAAAVSAAQEADVVVYIGGIDEQVEAEAHDRTSIDWPGAQLNLIKQL--AA 474
Query: 529 KKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTW 588
KPV++V + GG VD + ++N+ +LW GYPG+ L +++ G P GRLP+T
Sbjct: 475 VKPVVVVQVGGGQVDDSSLLQNKNVKGLLWMGYPGQEFGSGLIDILSGASAPAGRLPVTQ 534
Query: 589 YPQDYI-KVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNK 647
YP +YI +VPMTD +RP +S NPGRTYR+Y G V PFG G+ Y+K++ +K +
Sbjct: 535 YPANYITQVPMTDQSLRP--SSSNPGRTYRWYNGS-VIPFGTGIHYTKFNISWKTGGSGR 591
Query: 648 LYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKP 707
+ + + +D+ + + I V+N G + LLFVK
Sbjct: 592 GTYDTADFINAEDPKDLAEFD---------------VFQINVENVGSTTSDYVALLFVKS 636
Query: 708 ARRG-NGRPIKQLVGF-QSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVV 765
+ G P+K LV + ++ E +I ++ ++R G +V+ G + L +
Sbjct: 637 SDSGPQPYPLKTLVSYARAHGTQPGETTKIDLRVN-VGQIARNDSSGNLVLYPGAYTLEI 695
Query: 766 GDE 768
E
Sbjct: 696 DVE 698
>gi|413919686|gb|AFW59618.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
Length = 475
Score = 394 bits (1012), Expect = e-107, Method: Compositional matrix adjust.
Identities = 214/464 (46%), Positives = 291/464 (62%), Gaps = 11/464 (2%)
Query: 315 PEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNP 374
P + V AG+D+NCG+FL +HT AAV+ KL ES++DRA+ N MRLG F+G+P
Sbjct: 21 PVFTCIAVAAAGLDLNCGTFLAQHTVAAVQAGKLSESDVDRAVTNNLVTLMRLGFFDGDP 80
Query: 375 TMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSA 434
PFG +G VC+P++Q LA +AA+ GIVLLKN+ G LPL + S+A+IGPNAN++
Sbjct: 81 RELPFGNLGPSDVCTPSNQELAREAARQGIVLLKNT-GKLPLSATSIKSMAVIGPNANAS 139
Query: 435 KTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASI--DKAVDIAKGADHVV 492
T++GNY G C+ TPLQ L V TVY PGC V CS S+ D A A AD V
Sbjct: 140 FTMIGNYEGTPCKYTTPLQGLGANVA-TVYQPGCTNVGCSGNSLQLDAATKAAASADVTV 198
Query: 493 LMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRN 552
L++G DQ+ E+E LDR L+LPG+Q +L++ VA A+ P ILV++ GGP DI+FAK
Sbjct: 199 LVVGADQSIERESLDRTSLLLPGQQPQLVSAVANASSGPCILVVMSGGPFDISFAKSSDK 258
Query: 553 IGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNP 612
I +ILW GYPGEAG A+A+V+FG HNP GRLP+TWYP+ + KVPMTDM+MRP ++G P
Sbjct: 259 IAAILWVGYPGEAGGAAIADVLFGYHNPSGRLPVTWYPESFTKVPMTDMRMRPDPSTGYP 318
Query: 613 GRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPE 672
GRTYRFY G V+ FG GLSY+ +++ + + +L L + + Q SV
Sbjct: 319 GRTYRFYTGDTVYAFGDGLSYTSFAHHLVSAPK-QLALQLAEGHACLTEQ----CPSVEA 373
Query: 673 LGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEK 732
G CE F V + V+N GE +G H V LF P N P K L+GF+ V L +
Sbjct: 374 EGAH-CEGLAFDVHLRVRNAGERSGGHTVFLFSSPPAVHNA-PAKHLLGFEKVSLEPGQA 431
Query: 733 AEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGDEEYPISIFV 776
+ F++ C+ LS E G + G+H L VGD ++ +++ V
Sbjct: 432 GVVAFKVDVCKDLSVVDELGNRKVALGSHTLHVGDLKHTLNLGV 475
>gi|150019484|ref|YP_001311738.1| glycoside hydrolase family protein [Clostridium beijerinckii NCIMB
8052]
gi|149905949|gb|ABR36782.1| glycoside hydrolase, family 3 domain protein [Clostridium
beijerinckii NCIMB 8052]
Length = 709
Score = 394 bits (1012), Expect = e-106, Method: Compositional matrix adjust.
Identities = 256/740 (34%), Positives = 383/740 (51%), Gaps = 95/740 (12%)
Query: 54 QRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGA 113
++A++LV ++TL+EK QL + A+ RL +P Y WW+E LHGVA G A
Sbjct: 14 EKAKELVGKMTLEEKAEQLTYKSSAVKRLNVPRYNWWNEGLHGVARAGT----------A 63
Query: 114 TSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAI-------GMTFWAPNINIFRD 166
T FPQ I AA FD L I + I E RA YN G+TFW+PN+NIFRD
Sbjct: 64 TVFPQAIGLAAMFDDELLNYIAKVISTEGRAKYNENSKKDDRDIYKGITFWSPNVNIFRD 123
Query: 167 PRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWK 226
PRWGRG ET GEDP +T + V++V+G+QG+ GK L+A+AC KHF + +
Sbjct: 124 PRWGRGHETYGEDPYLTSRLGVAFVKGLQGE----GKY---LKAAACAKHFAVH---SGP 173
Query: 227 GTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTAR 286
R++FDA V+ +DL +TY P FE+CVK+G +M AYNR NG P C + LL R
Sbjct: 174 EGLRHEFDAVVSKKDLYETYLPAFEACVKEGDVEAVMGAYNRTNGEPCCGSKTLLRDILR 233
Query: 287 RQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGS-FLQKHTKAAVKQ 345
+W F G++ SDC A++ + + ++ +K G D+NCG+ +LQ A K+
Sbjct: 234 GKWNFKGHVVSDCWAIADFHLHHRVTSTATESAALAMKNGCDLNCGNVYLQ--LLLAYKE 291
Query: 346 KKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIV 405
+ E +I A L + R+RLG+F+ + KI ++ H L+L+AA++ +V
Sbjct: 292 GLVTEEDITTAAERLMATRIRLGMFDEEC---EYNKIPYELNDCKEHNELSLKAARNSMV 348
Query: 406 LLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTV-- 463
LLKN +G+LPL K+ S+A+IGPNA+S L GNY+G + R IT L+ + V V
Sbjct: 349 LLKN-NGILPLNKNNLKSIAVIGPNADSQIMLKGNYSGTASRYITVLEGIHEAVGEDVRV 407
Query: 464 -YYPGC-------DTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEE---------L 506
Y GC + +A + + +A+ IA+ +D +L +GLD T E E+
Sbjct: 408 YYSEGCHLFRDRVEELAEPNDRLKEAISIAERSDVAILCLGLDSTIEGEQGDAGNSEGAG 467
Query: 507 DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAG 566
D+ L LPGRQQEL+ ++ E PVILV+ G +TF + +IL A YPG G
Sbjct: 468 DKASLNLPGRQQELLEKIIETG-TPVILVIGAGSA--LTFNNAEDKCSAILDAWYPGSRG 524
Query: 567 AVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFP 626
A+A++IFG +P G+LP+T+Y D M+ RTYR+ + ++P
Sbjct: 525 GRAVADLIFGKCSPSGKLPITFYRNTKDLPEFIDYSMKD--------RTYRYMSCESLYP 576
Query: 627 FGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVT 686
FG GL+Y S+ K+ E +H VP++ ++F + V+
Sbjct: 577 FGYGLTY--------------------STVKLSE----LH---VPDVKSDFEDVE---VS 606
Query: 687 IGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLS 746
+ + N G + + ++K L GF+ V L E ++ S
Sbjct: 607 VKITNTGNFDIEEVIQCYIKDLESKYAVRNHSLAGFKRVRLKIGESKIAKMKIKKS-SFE 665
Query: 747 RAREDGLMVIEEGTHFLVVG 766
+DG +++ L VG
Sbjct: 666 VVNDDGERILDSKRFKLFVG 685
>gi|429738050|ref|ZP_19271875.1| glycosyl hydrolase family 3 protein [Prevotella saccharolytica
F0055]
gi|429161155|gb|EKY03583.1| glycosyl hydrolase family 3 protein [Prevotella saccharolytica
F0055]
Length = 722
Score = 394 bits (1012), Expect = e-106, Method: Compositional matrix adjust.
Identities = 256/728 (35%), Positives = 384/728 (52%), Gaps = 85/728 (11%)
Query: 54 QRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGA 113
Q+A+ ++S+LTLDEKISQL A I RLGI Y W +EALHGV G+ A
Sbjct: 33 QKAKSIISQLTLDEKISQLTQDAKGIDRLGIKPYYWLNEALHGVGRDGR----------A 82
Query: 114 TSFPQVILTAASFDSYLWYRIGQAIGLEARALY-------NAGQAIGMTFWAPNINIFRD 166
T FPQ I A+FD + ++IG AI E RA + N G+TFWAPN+NIFRD
Sbjct: 83 TVFPQPINLGATFDPKIVHQIGDAIATEGRAKFIVAQRQKNYSMYAGLTFWAPNVNIFRD 142
Query: 167 PRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWK 226
PRWGRG ET GEDP +TG ++V+G+QGD L+A+AC KHF + +
Sbjct: 143 PRWGRGMETYGEDPFLTGTLGTAFVKGMQGDD------PFYLKAAACGKHFAVH---SGP 193
Query: 227 GTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTAR 286
TR+ + T +DL +TY P F+ V++G+ IM AY R+ G + LL+ R
Sbjct: 194 ERTRHTANVEPTKRDLYETYLPAFKMLVQKGKVESIMGAYQRLYGESCSGSKYLLTDILR 253
Query: 287 RQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQK 346
+ WGF G++ SDC AV+ +Y+ KS +AV +KAG+++ CG+ + + K A++QK
Sbjct: 254 KDWGFKGHVVSDCGAVTDMYEGHKLVKSEAEAVAFAIKAGLNLECGNSM-RTMKDAIQQK 312
Query: 347 KLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVL 406
+ E ++D+AL L R++LG+ + P+ + V+ S A++ +A QAA++ +VL
Sbjct: 313 LITEKDLDKALLPLMMTRLKLGILQPDAAC-PYNEFPESVIGSEANRKIAEQAAEESMVL 371
Query: 407 LKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTV--- 463
LKN +G+LP+ K +L + GP A A L+GNY G S R T L+ + V N
Sbjct: 372 LKN-NGVLPIAKDIR-TLFVTGPGATDAYYLMGNYFGLSNRYSTYLEGIVGKVSNGTSVN 429
Query: 464 YYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELD---------RVDLVLP 514
Y G V + ++ +V ++GA+ +L+MG E EE D RV+L LP
Sbjct: 430 YKQGFMQVFKNLNDVNWSVSESRGAEVSILIMGNSGNTEGEEGDAIASAERGDRVNLRLP 489
Query: 515 GRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVI 574
Q E + V++ +++VL G P+D+ + +++ A YPG+ G VALA ++
Sbjct: 490 DSQMEYLREVSKDRTNKLVVVLTGGSPIDV--KEITELADAVVMAWYPGQEGGVALANLL 547
Query: 575 FGDHNPGGRLPMTWYPQDYIKVP-MTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSY 633
FGD N GRLP+T +P+ ++P D M+ GRTY++ ++PFG GLSY
Sbjct: 548 FGDANFSGRLPVT-FPESADRLPAFDDYSMK--------GRTYKYMTDNILYPFGYGLSY 598
Query: 634 SKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHG 693
SK +Y AV TKM +T V + V N+G
Sbjct: 599 SKVTYSNAAV------------TKMP------------------TKTTPMTVYVDVTNNG 628
Query: 694 EMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGL 753
+M V +++ GN PI+ L+GF+ V + F++ P E L + DG
Sbjct: 629 DMPVDEVVQVYLSTPGAGNTSPIESLIGFKRVKIYPHITVTKDFQI-PMELLETVQADGT 687
Query: 754 MVIEEGTH 761
+ +G +
Sbjct: 688 SKLLKGEY 695
>gi|449299051|gb|EMC95065.1| glycoside hydrolase family 3 protein [Baudoinia compniacensis UAMH
10762]
Length = 849
Score = 394 bits (1011), Expect = e-106, Method: Compositional matrix adjust.
Identities = 236/608 (38%), Positives = 333/608 (54%), Gaps = 21/608 (3%)
Query: 46 CKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIF 105
C T QRA +++ + + EK++ L++ + RLG+P YEWWSEALHGVAG G+
Sbjct: 43 CDTNATPYQRASAIINAMNITEKLANLLDVSYGSARLGLPPYEWWSEALHGVAG-SPGVN 101
Query: 106 F--NGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINI 163
F +G ATSFP I +++FD I I EARA NA + G+ ++ PNIN
Sbjct: 102 FTSSGNYSYATSFPMPITFSSAFDDPSVQNIASVISTEARAYSNAARG-GLDYFTPNINP 160
Query: 164 FRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQG--DTFNGGKLKGKLQASACCKHFTAYD 221
F+DPRWGRG ETPGEDPL Y + + G++G D + G + A CKHF YD
Sbjct: 161 FKDPRWGRGSETPGEDPLRIQGYVKNLLIGLEGTDDGYFNTSHSGYKKMIATCKHFAGYD 220
Query: 222 LDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLL 281
L++W G RY +DA +T QDLA+ Y PPF++C + + IMC+YN VN +P+CA+ L
Sbjct: 221 LEDWDGYIRYGYDAEITTQDLAEYYLPPFQTCARDQNVASIMCSYNSVNSVPACANSYLQ 280
Query: 282 SKTARRQWGF---HGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNC---GSFL 335
R WG+ + YITSDC+A+S IY Y+ + A L GMD C + +
Sbjct: 281 ETILREHWGWTIDNNYITSDCNAISDIYYNHNYSVNNAAAAGLSLSNGMDTACIVANTGV 340
Query: 336 QKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVL 395
+ + E+ I AL + + G F+ + P+ IG V +PA Q L
Sbjct: 341 MTDVNGSYYGGYVTEATITTALIRQYEALVIAGYFDP-ASSNPYRSIGWSSVNTPAAQTL 399
Query: 396 ALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQAL 455
A QAA +G LLKN+ GLLP + +A+IG AN + G Y+GP+ +PL A
Sbjct: 400 ARQAATEGTTLLKNT-GLLPYKFTSQTKVAMIGMWANGTSQMQGGYSGPAPYLHSPLYAA 458
Query: 456 QNYVENTVYYPG-CDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLP 514
+ Y G + +S A A+ AD ++ G+D + E E +DR + P
Sbjct: 459 SQLGLSYNYANGPINQTTLTSNYSQNATAAAQNADVILFFGGIDWSVEAEAMDRYQIAWP 518
Query: 515 GRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVI 574
G QQ LI ++A A KP+I VL G +D T + NI +++W GYPG+ G VA +++
Sbjct: 519 GAQQALIAQLA-ALGKPMI-VLQMGSMLDATPILSNNNISALVWVGYPGQDGGVAAFDIL 576
Query: 575 FGDHNPGGRLPMTWYPQDYI-KVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSY 633
G P GRLP+T YP DY+ +VPMT+M +RP GNPGRTY++Y V PF GL Y
Sbjct: 577 TGAVAPAGRLPVTMYPADYVNQVPMTNMSLRPG--PGNPGRTYKWYN-NAVLPFAYGLHY 633
Query: 634 SKYSYKFK 641
+ + F
Sbjct: 634 TTFKATFN 641
>gi|326789672|ref|YP_004307493.1| beta-glucosidase [Clostridium lentocellum DSM 5427]
gi|326540436|gb|ADZ82295.1| Beta-glucosidase [Clostridium lentocellum DSM 5427]
Length = 704
Score = 393 bits (1009), Expect = e-106, Method: Compositional matrix adjust.
Identities = 253/739 (34%), Positives = 382/739 (51%), Gaps = 92/739 (12%)
Query: 55 RARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGAT 114
+A LV+++ L EK S L +PAI RLG+P Y WWSEALHGVA G AT
Sbjct: 8 KAGQLVAQMDLLEKASMLRYDSPAIKRLGVPTYNWWSEALHGVARAGV----------AT 57
Query: 115 SFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAI-------GMTFWAPNINIFRDP 167
FPQ I AA FD Y I I EARA YN GMT WAPNINIFRDP
Sbjct: 58 VFPQAIGMAAMFDEEYLYEIADIIATEARAKYNEFAKKEDRDIYKGMTLWAPNINIFRDP 117
Query: 168 RWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKG 227
RWGRG ET GEDP +T + V+++ G+QGD + +A+AC KHF + +
Sbjct: 118 RWGRGHETYGEDPYLTSRLGVAFIHGLQGDENHH-----YWKAAACAKHFAVH---SGPE 169
Query: 228 TTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARR 287
R+ FDA V+ +DL +TY P FE+ V +G+ +G+M AYNRVNG P+C + LL +
Sbjct: 170 EERHHFDAVVSKKDLYETYLPAFEAAVTKGKVAGMMGAYNRVNGEPACGSKVLLQDILKE 229
Query: 288 QWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG-SFLQKHTKAAVKQK 346
+WGF GY+ SDC A+ + + ++ + G +NCG ++L H A K+
Sbjct: 230 EWGFDGYVVSDCWAIRDFHTEHMVTHTATESAALAINNGCQLNCGNTYL--HMLQAYKEG 287
Query: 347 KLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVL 406
+ E I ++ L ++RM+LGLF+ N + KI +V H+ +AL A+ +VL
Sbjct: 288 LVTEETITKSAQKLMAIRMKLGLFDKNC---EYNKIPYEVNDCKVHRDIALDVARRSMVL 344
Query: 407 LKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENT--VY 464
LKN+ G+LPL ++ ++ +IGP ANS L GNY G + R T L+ +Q+YV + VY
Sbjct: 345 LKNN-GILPLNLKQTKAIGVIGPTANSRTVLQGNYFGTASRYTTFLEGIQDYVGDAARVY 403
Query: 465 Y-PGCDT-------VACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEE---------LD 507
Y GC ++ + + +A+ +A+ +D V+L +GLD + E E+ D
Sbjct: 404 YAEGCHLFKNSISGLSWENDRLSEALIVAEQSDVVILCLGLDASIEGEQGDTGNAFAAGD 463
Query: 508 RVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGA 567
+ DL L GRQQ L+ V + KP IL+L G + I A+ +IL YPG++G
Sbjct: 464 KSDLNLIGRQQLLLEEVLKIG-KPTILILSSGSAMAIHTAQ--EYCEAILETWYPGQSGG 520
Query: 568 VALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPF 627
ALA+++FG+++P G+LP+T+Y D M GRTYR+ + + ++PF
Sbjct: 521 KALAQLLFGEYSPSGKLPITFYKTTEELPDFRDYSM--------AGRTYRYMKNEALYPF 572
Query: 628 GCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTI 687
G GL+Y+K K + + + +EN+ + + + +
Sbjct: 573 GYGLNYAKVEVKDAVIKE-----------RNIENEII------------------YEIQL 603
Query: 688 GVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSR 747
V N E+ V +++K P L ++S+ L A ++ +I ++ +
Sbjct: 604 QVTNQSEVCTYDVVQVYIKDMESRWAVPNYSLCAYKSIYLAAYDEPQITLQIKQS-AFEI 662
Query: 748 AREDGLMVIEEGTHFLVVG 766
E+G I+ L +G
Sbjct: 663 VDEEGKRYIDSHHFKLFIG 681
>gi|333995841|ref|YP_004528454.1| beta-glucosidase [Treponema azotonutricium ZAS-9]
gi|333737309|gb|AEF83258.1| periplasmic beta-glucosidase (Gentiobiase)(Cellobiase)
(Beta-D-glucoside glucohydrolase) [Treponema
azotonutricium ZAS-9]
Length = 706
Score = 393 bits (1009), Expect = e-106, Method: Compositional matrix adjust.
Identities = 256/752 (34%), Positives = 376/752 (50%), Gaps = 98/752 (13%)
Query: 55 RARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGAT 114
R ++++S++TL+EK+SQL APA+ GIP Y WW+E LHGVA G AT
Sbjct: 6 RIKEMISKMTLEEKVSQLSYDAPAVESAGIPKYNWWNECLHGVARAGL----------AT 55
Query: 115 SFPQVILTAASFDSYLWYRIGQAIGLEARALYNAG-------QAIGMTFWAPNINIFRDP 167
FPQ I AA+FD + AI E RA YN Q G+TFW PN+NIFRDP
Sbjct: 56 VFPQAIALAATFDEAFIRSVADAISDEGRAKYNEAVKRGNRSQYYGLTFWTPNVNIFRDP 115
Query: 168 RWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKG 227
RWGRGQET GEDP +TG+ +++++G+QGD L+ +AC KH+ + +
Sbjct: 116 RWGRGQETYGEDPYLTGRIGLAFMKGLQGDDTE------HLKVAACAKHYAVH---SGPE 166
Query: 228 TTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARR 287
R+ FDA V+ +DL +TY P F+ V+ G +M AYNR G P LL + R
Sbjct: 167 KLRHTFDAVVSKKDLFETYLPAFKLLVENG-VEAVMGAYNRTLGEPCGGSTYLLKEILRG 225
Query: 288 QWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKK 347
+WGF G++TSDC A+ ++ KSPE++ L AG D+NCG + + K+
Sbjct: 226 RWGFKGHVTSDCWAIRDFHENHKVTKSPEESAAMALNAGCDLNCGC-TYPYLTVSHKKGL 284
Query: 348 LPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLL 407
+ + ID AL L R +LGLF+ P P+ +G D+V H+ LAL+AAQ IVLL
Sbjct: 285 VTDETIDTALTRLLRTRFKLGLFD-PPEQDPYRNLGNDIVGCEKHRNLALEAAQKSIVLL 343
Query: 408 KNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVE------- 460
KN +LPL S + + L+GP A + TLL NY G S R +T L+ L ++
Sbjct: 344 KNDSNILPLDDS-ARKILLMGPGAANILTLLANYYGMSSRLVTILEGLAEKIKTKTAISF 402
Query: 461 -----NTVYYPG-CDTVACSSASIDKAVDI--AKGADHVVLMMGLDQTQEKEE------- 505
+ +Y P V S +D I D V+ + GLD + E EE
Sbjct: 403 EYRQGSLMYEPNHLSNVPFGSTGVDAEAPIYGLDEIDLVIAVYGLDGSMEGEEGDSIASD 462
Query: 506 --LDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPG 563
DR + LP Q + R+ +A KK V+L+L G P+ D ++L+A YPG
Sbjct: 463 ANGDRDTIELPSWQLNFLRRIRKAGKK-VVLILTGGSPIAFPEDLAD----AVLFAWYPG 517
Query: 564 EAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKE 623
E G A+A+++FGD +P G+LP+T +PQ ++P P GRTYR+ +
Sbjct: 518 EQGGNAVADILFGDVSPSGKLPIT-FPQSTAQLP-------PYDDYALKGRTYRYMKETP 569
Query: 624 VFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKF 683
++PFG GLSY+ + + +S +K+ S K
Sbjct: 570 LYPFGFGLSYTSFRFDSVELSSSKISAGNSVKAK-------------------------- 603
Query: 684 LVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCE 743
+ V N G+ + V L++ R P L GF+ + + A + A + EL P
Sbjct: 604 ---VQVSNTGKRDAEEVVQLYIAKDNRSEDEPASSLRGFRRLKILAGKSASVEIEL-PAS 659
Query: 744 SLSRAREDGLMVIEEGTHFLVVGDEE-YPISI 774
+ +G V+ G++ ++ D P+S+
Sbjct: 660 AFETINAEGASVLIPGSYTVIAADAAPLPVSV 691
>gi|121700633|ref|XP_001268581.1| beta-xylosidase XylA [Aspergillus clavatus NRRL 1]
gi|119396724|gb|EAW07155.1| beta-xylosidase XylA [Aspergillus clavatus NRRL 1]
Length = 743
Score = 392 bits (1007), Expect = e-106, Method: Compositional matrix adjust.
Identities = 239/743 (32%), Positives = 374/743 (50%), Gaps = 87/743 (11%)
Query: 33 CDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSE 92
CD + P ++T C T RA L+S TL+E ++ N++P +PRLG+P Y+ W+E
Sbjct: 52 CD-NGPLSKTI-VCDTLTSPYDRAAALISLFTLEELVNATGNTSPGVPRLGLPPYQVWNE 109
Query: 93 ALHGVAGVGKGIFFN-GTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQA 151
ALHG+ + F + G +TSFP ILT ++ + L ++ I + RA NAG+
Sbjct: 110 ALHGL---DRAYFTDEGQFSWSTSFPMPILTMSALNRTLINQVASIISTQGRAFSNAGR- 165
Query: 152 IGMTFWAPNINIFRDPRWGRGQETPGEDPL-VTGKYAVSYVRGVQGDTFNGGKLKGKLQA 210
G+ ++PNIN FR P WGRGQETPGED ++ YA Y+ G+QG G L+
Sbjct: 166 YGLDVYSPNINSFRHPVWGRGQETPGEDAYCLSSAYAYEYITGIQG-----GVDPKSLKL 220
Query: 211 SACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVN 270
A KH+ YD++NW G +R D +T QDL++ Y P F + + +MC+YN VN
Sbjct: 221 VATAKHYAGYDIENWDGHSRLGNDMNITQQDLSEYYTPQFLVAARDAKVRSVMCSYNAVN 280
Query: 271 GIPSCADRNLLSKTARRQWGF--HGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMD 328
G+PSCA+ L R +GF GYI+SDCD+ +++ YA + A D ++AG D
Sbjct: 281 GVPSCANSFFLQTLLRDTFGFVEDGYISSDCDSAYNVFNPHEYAANVSSAAADSIRAGTD 340
Query: 329 VNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVC 388
++CG+ Q + AV Q L ++I+R + L+S MRLG F+ P M
Sbjct: 341 IDCGTTYQYYFDEAVDQNLLSRADIERGVIRLYSNLMRLGYFDVGPWM------------ 388
Query: 389 SPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRS 448
+VS L G NY GP+
Sbjct: 389 --------------------------------NVSTQLQG-----------NYFGPAPYL 405
Query: 449 ITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDR 508
I+PL A ++ + Y G + + S+ KA+ AK +D ++ G+D + E E LDR
Sbjct: 406 ISPLDAFRDSHLDVNYAFGTNISSNSTDGFSKALSAAKKSDAIIFAGGIDNSLEAETLDR 465
Query: 509 VDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAV 568
+++ PG+Q ELI ++++ KP+I++ + GG VD + K ++N+ S++W GYPG++G
Sbjct: 466 MNITWPGKQLELIDQLSQLG-KPLIVLQMGGGQVDSSLLKSNKNVNSLIWGGYPGQSGGQ 524
Query: 569 ALAEVIFGDHNPGGRLPMTWYPQDY-IKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPF 627
AL ++I G P GRL +T YP +Y + P TDM +RP NPG+TY +Y G V+ F
Sbjct: 525 ALLDIITGKRAPAGRLVVTQYPAEYATQFPATDMSLRPHGN--NPGQTYMWYTGTPVYEF 582
Query: 628 GCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFL-VT 686
G GL Y+ + ++ + + K+ ++ + P G E FL T
Sbjct: 583 GHGLFYTTFR------------VSHARAVKIKPTYNIQDLLAQPHPGYIHVEQMPFLNFT 630
Query: 687 IGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLS 746
+ + N G+ + + +LF P K LVGF + ++++ S++
Sbjct: 631 VDITNTGKASSDYTAMLFANTTAGPAPYPKKWLVGFDRLPTLGPSTSKLMTIPVTINSMA 690
Query: 747 RAREDGLMVIEEGTHFLVVGDEE 769
R E G V+ G + L + +E
Sbjct: 691 RTDELGNRVLYPGKYELALNNER 713
>gi|359409694|ref|ZP_09202159.1| Beta-glucosidase [Clostridium sp. DL-VIII]
gi|357168578|gb|EHI96752.1| Beta-glucosidase [Clostridium sp. DL-VIII]
Length = 723
Score = 392 bits (1007), Expect = e-106, Method: Compositional matrix adjust.
Identities = 234/607 (38%), Positives = 347/607 (57%), Gaps = 65/607 (10%)
Query: 56 ARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATS 115
A++LV+++TL EK QL ++PA+ RL IP Y WW+E LHGVA G AT
Sbjct: 29 AKELVAKMTLQEKAEQLTYNSPAVKRLNIPEYNWWNEGLHGVARAGT----------ATV 78
Query: 116 FPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAI-------GMTFWAPNINIFRDPR 168
FPQ I AA FD ++ I E RA YN G+T+W+PN+NIFRDPR
Sbjct: 79 FPQAIGLAAMFDEEFLGKVAGIIATEGRAKYNENSKKEDRDIYKGLTYWSPNVNIFRDPR 138
Query: 169 WGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGT 228
WGRG ET GEDP +T + V++V+G+QGD GK L+ SAC KHF + + +
Sbjct: 139 WGRGHETYGEDPYLTSRLGVAFVKGLQGD----GKY---LKLSACAKHFAVH---SGPES 188
Query: 229 TRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQ 288
R++F+A V+ +DL +TY P FE+CVK+ +M AYNR NG P C + LL R +
Sbjct: 189 LRHEFNAVVSQKDLHETYLPAFEACVKEANVESVMGAYNRTNGEPCCGSKALLKDILRGK 248
Query: 289 WGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKL 348
WGF G++ SDC A++ + + ++V ++ G D+NCG+ + + A K+ +
Sbjct: 249 WGFKGHVVSDCWALADFHMHHKVTSTATESVALAIENGCDLNCGN-MYLNLLLAYKEGLV 307
Query: 349 PESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLK 408
E +I A L + R +LG+F+ + + +I +V H ++L+A++ +VLLK
Sbjct: 308 TEEQITTAAERLMTTRFKLGMFDEDC---EYNQIPYEVNDCKEHNQVSLEASRKSMVLLK 364
Query: 409 NSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTV---YY 465
N +G+LPL KSK ++A+IGPNANS L GNY+G + + T L + + +++ V Y
Sbjct: 365 N-NGILPLDKSKLKAVAVIGPNANSEIMLKGNYSGTASKYTTILDGIHDVLDDDVRVYYS 423
Query: 466 PGC-------DTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEE---------LDRV 509
GC + +A + +AV +A+ AD V+L +GLD T E E+ D++
Sbjct: 424 EGCHLYKEKVEDLARRDDRLAEAVSVAERADVVILCLGLDSTIEGEQGDAGNGYGAGDKL 483
Query: 510 DLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVA 569
DL LPG QQEL+ +V E KPV++VL G + + A + +IL A YPG G A
Sbjct: 484 DLNLPGIQQELLEKVLETG-KPVVVVLGTGSGLTLNGA--EERCAAILNAWYPGSHGGTA 540
Query: 570 LAEVIFGDHNPGGRLPMTWYPQDYIKVP-MTDMKMRPQATSGNPGRTYRFY-EGKEVFPF 627
A+++FG +P G+LP+T+Y +D K+P TD M+ GRTYR+ E ++PF
Sbjct: 541 AADILFGKCSPSGKLPVTFY-KDTDKLPEFTDYAMK--------GRTYRYMDESNCLYPF 591
Query: 628 GCGLSYS 634
G GL+YS
Sbjct: 592 GYGLTYS 598
>gi|373954937|ref|ZP_09614897.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
paludis DSM 18603]
gi|373891537|gb|EHQ27434.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
paludis DSM 18603]
Length = 723
Score = 391 bits (1005), Expect = e-106, Method: Compositional matrix adjust.
Identities = 254/737 (34%), Positives = 376/737 (51%), Gaps = 88/737 (11%)
Query: 51 PISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTI 110
P R RDL+S+LTL+EK+ Q+++ +P++PRL +P Y WW+EALHGVA G
Sbjct: 30 PTDVRVRDLISKLTLEEKVHQMMDVSPSVPRLNLPKYNWWNEALHGVARSGV-------- 81
Query: 111 RGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIG-------MTFWAPNINI 163
AT FPQ I A+FD L R AI EARA+YNA G +TFW PNINI
Sbjct: 82 --ATIFPQAIALGATFDQDLAKRESTAISDEARAMYNAAMVNGYNEKYGGLTFWTPNINI 139
Query: 164 FRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLD 223
FRDPRWGRGQET GEDP +T + V++++G+QGD L+ +AC KHF +
Sbjct: 140 FRDPRWGRGQETYGEDPFLTSQIGVAFIQGLQGDD------PEHLKVAACAKHFAVH--- 190
Query: 224 NWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSK 283
+ R+ F+A + +DL +TY P F++ V R +MCAYNR N C LL +
Sbjct: 191 SGPERLRHSFNAIASPKDLRETYLPAFKALV-NARVEAVMCAYNRTNSEVCCGSNLLLDQ 249
Query: 284 TARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAV 343
R +W F G++ SDC A+ Y +AV +K G+D+NCG + AV
Sbjct: 250 ILRDEWHFTGHVVSDCGAIVDFYMGHKVVPGQPEAVALAVKHGVDLNCGDEYPALIE-AV 308
Query: 344 KQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDG 403
K+ + E EID+AL L R +LGLF+ P+ I V+ S H+ LA + A
Sbjct: 309 KRGLITEKEIDKALATLLKTRFKLGLFDPKQN-SPYNNIPVSVINSTDHRALAKEVALKS 367
Query: 404 IVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVE--- 460
IVLLKN LPL K+ + GPNA S L+GNY G + T L+ + ++
Sbjct: 368 IVLLKNEK-CLPL-KNNLSKYYITGPNAASVDALMGNYYGVNPHMSTILEGIAGAIQPGS 425
Query: 461 NTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEE---------LDRVDL 511
Y PG ++ ID AK +D ++MG+ E EE DR+D
Sbjct: 426 QMQYKPGILLDRDNNNPIDWTTGDAKASDVTFVVMGITGLLEGEEGEAIASPNYGDRLDY 485
Query: 512 VLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALA 571
LP Q + + ++ + K V+ ++ G P++++ + ++L A YPGE G A+A
Sbjct: 486 NLPKNQIDFLRKIRKGNKNKVVAIITGGSPMNLS--EVHELADAVLLAWYPGEEGGNAVA 543
Query: 572 EVIFGDHNPGGRLPMTWYPQDYIKV-PMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCG 630
+++FG +P GRLP+T +P+ + ++ P D M+ GRTYR+ + ++ FG G
Sbjct: 544 DILFGKVSPSGRLPVT-FPKSFAQLPPYEDYSMK--------GRTYRYMTAEPMYTFGYG 594
Query: 631 LSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVK 690
LSYS Y+Y +S+ ++ N T ET V
Sbjct: 595 LSYSTYTYSSLTLSEKQIKKNM----------------------TIIAETM-------VT 625
Query: 691 NHGEMAGKHPVLLFVK-PARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAR 749
N G+M G+ V L++ P N P L GF+ V L A E ++ F+++P + +
Sbjct: 626 NTGKMEGEEVVQLYITVPQTEKN--PQYSLKGFKRVNLKAGESRKVQFQITP-DLMKSVD 682
Query: 750 EDGLMVIEEGTHFLVVG 766
+G V+ G++ + +G
Sbjct: 683 ANGSEVLLSGSYVVRIG 699
>gi|367032987|ref|XP_003665776.1| glycoside hydrolase family 3 protein [Myceliophthora thermophila
ATCC 42464]
gi|347013048|gb|AEO60531.1| glycoside hydrolase family 3 protein [Myceliophthora thermophila
ATCC 42464]
Length = 835
Score = 390 bits (1003), Expect = e-105, Method: Compositional matrix adjust.
Identities = 248/643 (38%), Positives = 343/643 (53%), Gaps = 41/643 (6%)
Query: 22 TRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPR 81
T D T+PP S C TLP ++RA LV+ LT +EK+ LV+ AP PR
Sbjct: 14 TYPDCTKPPLS---------DIKVCDRTLPEAERAAALVAALTDEEKLQNLVSKAPGAPR 64
Query: 82 LGIPAYEWWSEALHGVAGVGKGIFFNG--TIRGATSFPQVILTAASFDSYLWYRIGQAIG 139
+G+PAY WWSEALHGVA F +G +TSFP +L AA+FD L +G IG
Sbjct: 65 IGLPAYNWWSEALHGVAHAPGTQFRDGPGDFNSSTSFPMPLLMAAAFDDELIEAVGDVIG 124
Query: 140 LEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTF 199
EARA NAG + G+ +W PN+N FRDPRWGRG ETPGED + +YA S +RG++G +
Sbjct: 125 TEARAFGNAGWS-GLDYWTPNVNPFRDPRWGRGSETPGEDVVRLKRYAASMIRGLEGRSS 183
Query: 200 NGGKLKGKLQAS-----ACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCV 254
+ + CKH+ D ++W GTTR+ FDA ++ QDLA+ Y PF+ C
Sbjct: 184 SSSSCSFGSGGEPPRVISTCKHYAGNDFEDWNGTTRHDFDAVISAQDLAEYYLAPFQQCA 243
Query: 255 KQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGF---HGYITSDCDAVSIIYDAEGY 311
+ R +MCAYN VNG+PSCA+ L++ R W + Y+TSDC+AV + Y
Sbjct: 244 RDSRVGSVMCAYNAVNGVPSCANSYLMNTILRGHWNWTEHDNYVTSDCEAVLDVSAHHHY 303
Query: 312 AKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFN 371
A + + +AGMD +C A L +DRAL L+ +R+G F+
Sbjct: 304 ADTNAEGTGLCFEAGMDTSCEYEGSSDIPGASAGGFLTWPAVDRALTRLYRSLVRVGYFD 363
Query: 372 GNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSV--------- 422
G P +G V P Q LAL+AA +GIVLLKN + LPLP V
Sbjct: 364 G--PESPHASLGWADVNRPEAQELALRAAVEGIVLLKNDNDTLPLPLPDDVVVTADGGRR 421
Query: 423 SLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPG----CDTVACSSASI 478
+A+IG A++ L G Y+G + +P A + N G D+
Sbjct: 422 RVAMIGFWADAPDKLFGGYSGAPPFARSPASAARQLGWNVTVAGGPVLEGDSDEEEDTWT 481
Query: 479 DKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLC 538
AV+ A AD++V GLD + E DR+ + P Q LI+ +A KPV++V +
Sbjct: 482 APAVEAAADADYIVYFGGLDTSAAGETKDRMTIGWPAAQLALISELARLG-KPVVVVQMG 540
Query: 539 GGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIK-VP 597
D + D +G++LWA +PG+ G A+ ++ G +P GRLP+T YP +Y VP
Sbjct: 541 DQLDDTPLFELD-GVGAVLWANWPGQDGGTAVVRLLSGAESPAGRLPVTQYPANYTDAVP 599
Query: 598 MTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKF 640
+TDM +RP AT NPGRTYR+Y V PFG GL Y+ + +F
Sbjct: 600 LTDMTLRPSAT--NPGRTYRWYP-TPVRPFGFGLHYTTFRAEF 639
>gi|333494646|gb|AEF56854.1| putative glycosyl hydrolase [synthetic construct]
Length = 743
Score = 390 bits (1002), Expect = e-105, Method: Compositional matrix adjust.
Identities = 265/746 (35%), Positives = 383/746 (51%), Gaps = 96/746 (12%)
Query: 49 TLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNG 108
L +RARDLVSR+TL+EKI+Q+ + AP+I RLG+PAY WW+EALHGVA G
Sbjct: 34 NLSFEERARDLVSRMTLEEKIAQMQHEAPSIERLGVPAYNWWNEALHGVARAGV------ 87
Query: 109 TIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIG-------MTFWAPNI 161
+T FPQ I AA+FD+ L + I E RA Y+ Q G +TFW+P I
Sbjct: 88 ----STMFPQAIGMAATFDAELIEKTADVISTEGRARYHEFQRKGDRDIYKGLTFWSPTI 143
Query: 162 NIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGK-LQASACCKHFTAY 220
NI RDPRWGRGQET GEDP +T + AVS++RG+QG +G+ L+A+AC KHF +
Sbjct: 144 NIDRDPRWGRGQETYGEDPYLTSRLAVSFIRGIQG--------RGRYLKAAACAKHFAVH 195
Query: 221 DLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNL 280
+ R++F+A V+ +DL +TY P FE+ VK+ + +G+M AYNRVNG P C L
Sbjct: 196 SGPE---SERHQFNAEVSQKDLWETYLPAFEASVKEAKVAGVMGAYNRVNGEPCCGSGTL 252
Query: 281 LSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTK 340
L R +W F GY+TSDC A+ I + G K+ E++ +K+G D+NCG K
Sbjct: 253 LGDVLRGEWEFGGYVTSDCWAIKDINEGHGVTKTIEESSALAVKSGCDLNCGCAYASLVK 312
Query: 341 AAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAA 400
A + + E EID A+H L RMRLG+F+ P P+ I + H+ AL+ A
Sbjct: 313 -AYRAGLIGEKEIDTAVHRLMLTRMRLGMFDA-PEKVPYSSIPYEKNDCAEHRAFALEVA 370
Query: 401 QDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVE 460
+ +VLL+N G LPL +S+ S+A+IGPNA+S L GNY G + +T L ++ V
Sbjct: 371 EKSLVLLRNRSGFLPLDRSRIRSVAVIGPNADSRVALEGNYNGTASEYVTVLDGIREAVG 430
Query: 461 N--TVYYP-GCDTVACSSASIDK-------AVDIAKGADHVVLMMGLDQTQEKEE----- 505
+ VYY G S + + A A+ AD V+ +GL++ E EE
Sbjct: 431 DRARVYYAEGSHLFRNSMGGLSQKNDRLAEAAAAAERADVAVVCLGLNRDIEGEEGDPSN 490
Query: 506 ----LDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGY 561
D+ DL LPG Q+EL+ V +A PV+LVLL G + + +A D N +++ A Y
Sbjct: 491 EYPAGDKRDLRLPGLQEELLETV-KATGTPVVLVLLSGSALAVNWA--DENADAVVQAWY 547
Query: 562 PGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYR-FYE 620
PG A A +FG P G P + ++ R G +
Sbjct: 548 PG-AQAEGRRGALFGIIRPAGGFP-----------SRSTVRTRTSRIFGTIHENRLPLLQ 595
Query: 621 GKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCET 680
G ++PFG GLSY+K+ Y ++ +++ + +
Sbjct: 596 GDPLYPFGYGLSYTKFQYGDLKLAASEIPAGEDAE------------------------- 630
Query: 681 RKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELS 740
V++ V+N GE V L+++ P QL GF+ V L E A + F ++
Sbjct: 631 ----VSVTVRNAGERDSDEVVQLYLQDLESSVPVPKWQLAGFRRVHLKPGESAGVRFTVA 686
Query: 741 PCESLSRAREDGLMVIEEGTHFLVVG 766
++ EDG V+E G + G
Sbjct: 687 -ARQMALIDEDGRCVLEPGGFRVYAG 711
>gi|451821678|ref|YP_007457879.1| periplasmic beta-glucosidase BglX [Clostridium
saccharoperbutylacetonicum N1-4(HMT)]
gi|451787657|gb|AGF58625.1| periplasmic beta-glucosidase BglX [Clostridium
saccharoperbutylacetonicum N1-4(HMT)]
Length = 710
Score = 390 bits (1002), Expect = e-105, Method: Compositional matrix adjust.
Identities = 236/619 (38%), Positives = 340/619 (54%), Gaps = 65/619 (10%)
Query: 54 QRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGA 113
++A++LVS++TL E+ QL APAI L I Y WW+E LHGVA G A
Sbjct: 14 EKAKELVSKMTLQERAEQLTYKAPAIKHLNISRYNWWNEGLHGVARAGT----------A 63
Query: 114 TSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAI-------GMTFWAPNINIFRD 166
T FPQ I AA FD L +I I E RA YN G+TFW+PN+NIFRD
Sbjct: 64 TVFPQAIGLAAIFDDELLEKIAGIIATEGRAKYNENSKKEDKDIYKGLTFWSPNVNIFRD 123
Query: 167 PRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWK 226
PRWGRG ET GEDP +T + V++V+G+QGD + L+ +AC KHF + +
Sbjct: 124 PRWGRGHETYGEDPYLTSRLGVAFVKGLQGD-------EKYLKIAACAKHFAVH---SGP 173
Query: 227 GTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTAR 286
R++F+A V+ +DL +TY P FE+CVK+ +M AYNR N P C LL R
Sbjct: 174 EGLRHEFNAVVSKKDLYETYLPAFEACVKEADVEAVMGAYNRTNDEPCCGSSLLLKDILR 233
Query: 287 RQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGS-FLQKHTKAAVKQ 345
+W F G++ SDC A++ + G + ++ +K G D+NCG+ +LQ A K+
Sbjct: 234 GKWQFKGHVVSDCWAIADFHLYHGVTSTATESAALAIKNGCDLNCGNVYLQ--MLLAYKE 291
Query: 346 KKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIV 405
+ E +I RA L + R+RLG+F+ F KI + H ++L A++ IV
Sbjct: 292 GLVTEEDITRAAERLMATRIRLGMFDEEC---EFNKIPYTMNDCKEHHEVSLMASRKSIV 348
Query: 406 LLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVEN---- 461
+L+N +GLLPL KSK S+ +IGPNA+S L GNY G + + IT L+ + V++
Sbjct: 349 MLRN-NGLLPLDKSKLKSIGIIGPNADSELMLKGNYFGTASKYITVLEGIHEAVDSENIR 407
Query: 462 TVYYPGC-------DTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEE--------- 505
Y GC +A + +AV +A+ +D V+L +GLD + E E+
Sbjct: 408 IFYSEGCHLYKDRVQDLAEPDDRMAEAVTVAEHSDVVILCLGLDSSIEGEQGDAGNSDGA 467
Query: 506 LDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEA 565
D+++L LPG+QQEL+ +V A KPVI+VL G +T + N +IL A YPG
Sbjct: 468 GDKLNLNLPGKQQELLEKVI-ATGKPVIVVLGAGSA--LTLQGQEENCAAILNAWYPGSF 524
Query: 566 GAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVF 625
G A+A++IFG +P G+LP+T+Y TD M+ RTYR+ + + ++
Sbjct: 525 GGRAIADLIFGKCSPSGKLPVTFYKTTEELPEFTDYSMK--------NRTYRYMKNESLY 576
Query: 626 PFGCGLSYSKYSYKFKAVS 644
PFG GL+YSK +VS
Sbjct: 577 PFGFGLTYSKVQLSDLSVS 595
>gi|167537541|ref|XP_001750439.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163771117|gb|EDQ84789.1| predicted protein [Monosiga brevicollis MX1]
Length = 834
Score = 390 bits (1002), Expect = e-105, Method: Compositional matrix adjust.
Identities = 258/747 (34%), Positives = 377/747 (50%), Gaps = 68/747 (9%)
Query: 35 PSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQL-VNSAPAIPRLGIPAYEWWSEA 93
P ++ ++PFC T L + R +DLVSR++ + +QL + I +G+PAY W + A
Sbjct: 99 PCMGASSSYPFCDTKLSVDDRLKDLVSRVSTADAATQLRARESAQIDNIGLPAYYWGTNA 158
Query: 94 LHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIG 153
+HG+ +G + TSFP +A+F+ L +G+ IG E RA YN G
Sbjct: 159 IHGMQNTA--CLADG--QCPTSFPAPNGLSATFNYSLVKDMGRIIGRELRAYYNTKFHNG 214
Query: 154 MTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASAC 213
+ W+P IN RDPRWGR E+PGE P V G+Y +Y G+Q G K QA
Sbjct: 215 LDTWSPTINPSRDPRWGRNVESPGESPFVCGQYGAAYTEGLQN-----GDDKDYTQAVVT 269
Query: 214 CKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIP 273
KH+ AY ++++ TRY+++A V+ DL DTY P +E VK + G+MC+YN +NG+P
Sbjct: 270 LKHWVAYSVEDYDNVTRYEYNAIVSEYDLMDTYFPGWEYVVKNAKPLGVMCSYNSLNGVP 329
Query: 274 SCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGS 333
+C + L+ R WGF GYITSD D++ I+ Y + A D L G D++ G
Sbjct: 330 TCGNP-ALTAYLREDWGFEGYITSDSDSIHCIWADHHYESNAVLATRDGLLGGCDIDSGD 388
Query: 334 FLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQ 393
+ +AAV Q + S +D AL N + +R LGLF+ N T + +I AD V + Q
Sbjct: 389 TYADNLEAAVNQSLVNRSAVDAALTNSYRMRFNLGLFDPNVT-NAYDRISADEVGMSSSQ 447
Query: 394 VLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRS--ITP 451
+L AA+ + LLKN LP K V A+IG ++NSA+ +LGNY GP C S
Sbjct: 448 ETSLLAARKSMTLLKNDGQTLPFATGKKV--AVIGKSSNSAEDILGNYVGPICPSGAFDC 505
Query: 452 LQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDL 511
+Q L V G T++ A I+ A+ +A AD VVL + + Q E DR +
Sbjct: 506 VQTLYQGVA-AANQGGATTLSDDVADINTAIQLAMDADQVVLTIS-NYGQAGEGKDRTYI 563
Query: 512 VLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALA 571
L QQEL+ V + K P +V+L GG + + + K + +IL A PG G A+A
Sbjct: 564 GLDTDQQELVAAVLKVGK-PTAIVMLNGGLISLDWIKDEAQ--AILVAFAPGVHGGQAVA 620
Query: 572 EVIFGDHNPGGRLPMTWYPQDYIK-VPMTDMKMRPQAT-----------SGNPGRTYRFY 619
E IFG +NPGG+LP+T Y DY+ V +M M+ A PGR+Y++Y
Sbjct: 621 ETIFGANNPGGKLPVTMYASDYVNDVDFLNMSMQAVAVLHLMNVNGERDDTGPGRSYKYY 680
Query: 620 EGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCE 679
G+ ++PF GLSY+ ++ + + P + T
Sbjct: 681 TGEPLYPFAYGLSYTTFN---------------------------LSWSPAPPMTTFTST 713
Query: 680 TRKFLVTIGVKNHGEMAGKHPVLLFVKPARR-------GNGRPIKQLVGFQSVILNAKEK 732
R T V N G + G V F KP GN PIK++ GFQ V L +
Sbjct: 714 LRSTTYTATVTNTGSVGGDEVVFAFYKPKSESLKTLPVGNPVPIKEIFGFQRVALGPGQS 773
Query: 733 AEIVFELSPCESLSRAREDGLMVIEEG 759
++ FEL+ E+L++ DG + G
Sbjct: 774 TQVTFELN-AETLAQVTLDGHRELHSG 799
>gi|323344407|ref|ZP_08084632.1| beta-glucosidase [Prevotella oralis ATCC 33269]
gi|323094534|gb|EFZ37110.1| beta-glucosidase [Prevotella oralis ATCC 33269]
Length = 722
Score = 388 bits (996), Expect = e-105, Method: Compositional matrix adjust.
Identities = 260/747 (34%), Positives = 386/747 (51%), Gaps = 87/747 (11%)
Query: 40 TETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAG 99
T TF K + Q+A+ ++S+LTLDEKISQL A I RLGI Y W +EALHGV
Sbjct: 19 TFTFAQSKKEKEMIQKAKSIISQLTLDEKISQLTQDAKGIDRLGIKPYYWLNEALHGVGR 78
Query: 100 VGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALY-------NAGQAI 152
G+ AT FPQ I A+FD + +IG AI E RA + N
Sbjct: 79 DGR----------ATVFPQPISLGATFDPEIVQQIGDAIATEGRAKFIVAQRQKNYSMYA 128
Query: 153 GMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQG-DTFNGGKLKGKLQAS 211
G+TFWAPN+NIFRDPRWGRG ET GEDP +TG ++V+G+QG D F L+A+
Sbjct: 129 GLTFWAPNVNIFRDPRWGRGMETYGEDPFLTGVLGTAFVKGMQGNDPF-------YLKAA 181
Query: 212 ACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNG 271
AC KHF + + TR+ + T DL +TY P F+ V+QG+ IM AY R+ G
Sbjct: 182 ACGKHFAVH---SGPERTRHTANVEPTKHDLYETYLPAFKMLVQQGKVESIMGAYQRLYG 238
Query: 272 IPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNC 331
+ LL+ R+ WGF G++ SDC AV+ +Y+ KS +AV +KAG+++ C
Sbjct: 239 ESCSGSKYLLTDILRKDWGFKGHVVSDCGAVTDMYEGHKLVKSEAEAVAFAIKAGLNLEC 298
Query: 332 GSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPA 391
G+ + + K A+KQK + E ++D+AL L R++LG+ + P+ + V+ S
Sbjct: 299 GNSM-RTMKDALKQKLITEKDLDKALLPLMMTRLKLGILQPDVAC-PYNEFPESVIGSID 356
Query: 392 HQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITP 451
++ +A +AA++ +VLLKN G+LP+ K +L + GP A A L+GNY G S R T
Sbjct: 357 NRNIAQRAAEESMVLLKND-GVLPIAKDIR-TLFVTGPGATDAYYLMGNYFGLSDRYSTY 414
Query: 452 LQALQNYVENTV---YYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELD- 507
L+ + V N Y G V + ++ +V ++GA+ +++MG E EE D
Sbjct: 415 LEGIVGKVSNGTSVNYKQGFMQVFKNLNDVNWSVSESRGAEVSIIIMGNSGNTEGEEGDA 474
Query: 508 --------RVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWA 559
RVDL LP Q + + V++ +++VL G P+D+ + +++ A
Sbjct: 475 IASSERGDRVDLRLPEPQMQYLREVSKDRTNKLVVVLTGGSPIDV--KEITELADAVVMA 532
Query: 560 GYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVP-MTDMKMRPQATSGNPGRTYRF 618
YPG+ G VALA ++FGD N GRLP+T +P+ K+P D M+ GRTY++
Sbjct: 533 WYPGQEGGVALANLLFGDANFSGRLPVT-FPETTDKLPSFDDYSMK--------GRTYKY 583
Query: 619 YEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFC 678
++PFG GLSY K +Y V+ KL SS T
Sbjct: 584 MTDNILYPFGYGLSYGKVAYGNATVT--KLPTKHSSMT---------------------- 619
Query: 679 ETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFE 738
V++ + N G M V +++ G PI+ LV F+ V + FE
Sbjct: 620 ------VSVDLSNDGNMPVDEVVQVYLSTPSAGVTSPIESLVAFKRVKIAPHATVTTDFE 673
Query: 739 LSPCESLSRAREDGLMVIEEGTHFLVV 765
+ P E L +EDG + +G + +++
Sbjct: 674 I-PVERLETVQEDGTSKLLKGEYRVMI 699
>gi|373952439|ref|ZP_09612399.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
paludis DSM 18603]
gi|373889039|gb|EHQ24936.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
paludis DSM 18603]
Length = 721
Score = 388 bits (996), Expect = e-105, Method: Compositional matrix adjust.
Identities = 245/698 (35%), Positives = 358/698 (51%), Gaps = 87/698 (12%)
Query: 52 ISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIR 111
+S R +DL+SRLTL EK+S L + A+PRL IPAY WW+E LHGVA G+
Sbjct: 37 LSTRVQDLISRLTLAEKVSLLGYRSQAVPRLNIPAYNWWNEGLHGVARAGE--------- 87
Query: 112 GATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAG-------QAIGMTFWAPNINIF 164
AT FPQ I AA+FD L ++ + EARA YN Q +G+TFW+PNINIF
Sbjct: 88 -ATIFPQAIAMAATFDDNLVKQVANVVSTEARAKYNLSTAMGRHLQYMGLTFWSPNINIF 146
Query: 165 RDPRWGRGQETPGEDPLVTGKYAVSYVRGVQG-DTFNGGKLKGKLQASACCKHFTAYDLD 223
RDPRWGRGQET GEDP +T K +YV G+QG D + L+ SA KHF A+
Sbjct: 147 RDPRWGRGQETYGEDPFLTSKMGNAYVHGLQGTDPLH-------LKTSATAKHFVAHS-- 197
Query: 224 NWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSK 283
+G Y FDA V +DL DTY F+S V G S IM AYNRVNG+P+ ++ L++
Sbjct: 198 GPEGERDY-FDALVDEKDLRDTYLYAFKSLVDGGVES-IMTAYNRVNGVPNSINKTLVND 255
Query: 284 TARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAV 343
++WGF G++ +DC A+ +Y + + +KAG+D++C S Q A+
Sbjct: 256 IVIKEWGFKGHVVTDCGALDDVYKTHKVLPNRMEVAAAAIKAGVDLDCSSIFQTDIINAI 315
Query: 344 KQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDG 403
K L E ++D AL + S + +LG F+ P+ PF GAD + + +H +LA Q AQ
Sbjct: 316 NNKLLTEKQVDAALAAVLSTQFKLGFFDA-PSSSPFYSFGADSIHNDSHVMLARQMAQKS 374
Query: 404 IVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTV 463
+VLLKN +LPL S+ ++GPNA S L+ +Y G S +++ ++ + V+
Sbjct: 375 MVLLKNDKQILPLKMQNYSSIMVVGPNAASLDALVASYHGVSSKAVNFVEGITAAVDKGT 434
Query: 464 ---YYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEE---------LDRVDL 511
Y G D + + A AD V ++GL E E D+ DL
Sbjct: 435 RVEYDLGADYRDTTHFG---GIWGAGNADVTVAVIGLTPVLEGEAGDAFLSQTGGDKKDL 491
Query: 512 VLPGRQQELITRVAEAAKKPVILVLLCGGPVDI-TFAKYDRNIGSILWAGYPGEAGAVAL 570
LP + + ++ KKP+I V+ G VDI A Y +++ A YPGE G AL
Sbjct: 492 SLPAGDIAFMKALRKSVKKPIIAVVTSGSDVDIAAIAPY---ADAVILAWYPGEQGGNAL 548
Query: 571 AEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCG 630
A+++FG +P G LP+T+Y + M+ GRTYR++ G +PFG G
Sbjct: 549 ADILFGKISPSGHLPLTFYNSVNDLPAYNNYSMK--------GRTYRYFAGAVQYPFGFG 600
Query: 631 LSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVK 690
LSY+ ++Y+++ + T + +++ VK
Sbjct: 601 LSYTTFNYQWQQQPK-----------------------------TSYSAKDTIQLSVVVK 631
Query: 691 NHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILN 728
N G ++ V ++ N P+K+L GF+ + LN
Sbjct: 632 NTGNISADEVVQAYIGYPTL-NRMPLKELKGFKRITLN 668
>gi|413925165|gb|AFW65097.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
Length = 412
Score = 387 bits (994), Expect = e-104, Method: Compositional matrix adjust.
Identities = 185/301 (61%), Positives = 224/301 (74%), Gaps = 4/301 (1%)
Query: 29 PPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYE 88
PPFSC PS PFC T LP +QRA DLVSR+T EK SQL + A +PRLG+P+Y+
Sbjct: 84 PPFSCG-GGPSLG-LPFCNTKLPAAQRAADLVSRMTPAEKASQLGDVANGVPRLGVPSYK 141
Query: 89 WWSEALHGVAGVGKGIFFN-GTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYN 147
WW+EALHGVA GKGI + G +R ATSFPQV+LTAASF+ LW+RIGQA G EARA YN
Sbjct: 142 WWNEALHGVAISGKGIHMDRGAVRSATSFPQVLLTAASFNDNLWFRIGQATGKEARAFYN 201
Query: 148 AGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGK 207
GQA G+T W+PN+NIFRDPRWGRGQETPGEDP V +YA ++VRG+QG + N +
Sbjct: 202 IGQAEGLTMWSPNVNIFRDPRWGRGQETPGEDPAVASRYAAAFVRGLQGSSSNTKSVPPV 261
Query: 208 LQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYN 267
L SACCKH TAYDL++WKG TRY F A VT+QDLADT+ PPF SCV G+AS +MCAY
Sbjct: 262 LLTSACCKHATAYDLEDWKGVTRYSFRATVTVQDLADTFNPPFRSCVVDGKASCVMCAYT 321
Query: 268 RVNGIPSCADRNLLSKTARRQWGFHG-YITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAG 326
VNG+PSCA+ +LL+KT R WG G Y+ +DCDAVSI+ +++ Y + ED V LKAG
Sbjct: 322 SVNGVPSCANADLLTKTFRGSWGLDGRYVAADCDAVSIMRNSQFYRPTAEDTVATTLKAG 381
Query: 327 M 327
M
Sbjct: 382 M 382
>gi|332307852|ref|YP_004435703.1| glycoside hydrolase family protein [Glaciecola sp. 4H-3-7+YE-5]
gi|332175181|gb|AEE24435.1| glycoside hydrolase family 3 domain protein [Glaciecola sp.
4H-3-7+YE-5]
Length = 733
Score = 387 bits (993), Expect = e-104, Method: Compositional matrix adjust.
Identities = 251/744 (33%), Positives = 383/744 (51%), Gaps = 77/744 (10%)
Query: 44 PFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKG 103
P+ T LP +R L+ +TL EK SQLVN AI RLG+P Y++W+EALHGVA G+
Sbjct: 25 PWFDTQLPTQERIDLLIDAMTLKEKTSQLVNGNVAIERLGLPEYDFWNEALHGVARNGR- 83
Query: 104 IFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIG-------MTF 156
AT FPQ I AA+FD +L + I EARA +N IG +TF
Sbjct: 84 ---------ATVFPQAIGMAATFDQHLLLKAASVISDEARAKFNVSSEIGNRSKYSGLTF 134
Query: 157 WAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKH 216
W PNINIFRDPRWGRGQET GEDP +T + + V G+QGD L+ +A KH
Sbjct: 135 WTPNINIFRDPRWGRGQETYGEDPYLTAQMGKAMVNGLQGDH------PKYLKTAAAAKH 188
Query: 217 FTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCA 276
F + + R++FDA + +D+ +TY P FE+ + + +M AYNRVNG P+
Sbjct: 189 FAVH---SGPEALRHEFDAIASPKDMYETYFPAFEALITEANVETVMAAYNRVNGHPAGG 245
Query: 277 DRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQ 336
LL+ R +WGF G++ SDC ++ + + ++ + G D+NCG+ +
Sbjct: 246 SDFLLNTVLRDKWGFSGHVVSDCWGLADFHQYHKVTANAVESAALAINTGTDLNCGA-VY 304
Query: 337 KHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLA 396
AV+ + E ID+ L + + + +LG F+ P+ I ADVV S AH +A
Sbjct: 305 NALPDAVEAGLVDEKTIDKRLSKVLATKFKLGFFDPKDD-NPYNNISADVVNSEAHAQVA 363
Query: 397 LQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQ 456
+ A IVLL+N + +LPL ++ +L + GP A+S++ LLGNY G S ++ L +
Sbjct: 364 YEMAVKSIVLLQNKNNILPLDRNIR-NLYVTGPFASSSEVLLGNYYGLSGKTTNILDGIT 422
Query: 457 NYVE--NTVYYP-GCDTVACSSASIDKAVDIAKG-ADHVVLMMGLDQTQEKEE------- 505
V T+ Y G + ID AK D ++ +MGL E EE
Sbjct: 423 ANVSVGTTINYKQGILPYQANVNPIDWTTGEAKQMGDVIIAVMGLSGAYEGEEGEAIASP 482
Query: 506 --LDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPG 563
DR+ L LP Q + ++ + KPVI+VL G PV++T + +I++A YPG
Sbjct: 483 HKGDRLSLDLPEHQIAFLRKLRKDNDKPVIVVLTAGTPVNLT--EIAELADAIVFAWYPG 540
Query: 564 EAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKE 623
+ G A+A+++FG+ +P GRLP+T+ P ++ ++ P GRTYR+ +
Sbjct: 541 QEGGKAVADILFGERSPSGRLPITF--------PKSEAQLPPYDDYSMQGRTYRYMTQEP 592
Query: 624 VFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKF 683
++PFG GLSY++ KF ++ + N + K+ P+
Sbjct: 593 MYPFGFGLSYAQ--VKFDNIT--------------LGNTQALASKNEPQ--------ENM 628
Query: 684 LVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCE 743
VT+ V N GE + V L++K G +P+ L GF + L A + +++F + P +
Sbjct: 629 TVTVNVTNTGEREFEEVVQLYLKTPDAGVSQPLHSLKGFTRIKLAAGQTEQVLFSI-PKK 687
Query: 744 SLSRAREDGLMVIEEGTHFLVVGD 767
L E G V+ +G + ++VG+
Sbjct: 688 HLYSINEQGKPVLLKGQYSVIVGN 711
>gi|238578959|ref|XP_002388893.1| hypothetical protein MPER_12044 [Moniliophthora perniciosa FA553]
gi|215450599|gb|EEB89823.1| hypothetical protein MPER_12044 [Moniliophthora perniciosa FA553]
Length = 658
Score = 387 bits (993), Expect = e-104, Method: Compositional matrix adjust.
Identities = 250/688 (36%), Positives = 364/688 (52%), Gaps = 66/688 (9%)
Query: 87 YEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALY 146
Y WWSEAL+ ATSFP I A+FD L + I I EARA
Sbjct: 1 YNWWSEALN--------------FSSATSFPAPITMGATFDDGLIHAIATVISTEARAFN 46
Query: 147 NAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKG 206
N + G+ F+ PNIN F+DPRWGRGQETPGEDP +Y V G+QG G
Sbjct: 47 NVNRG-GLDFFTPNINPFKDPRWGRGQETPGEDPFHISQYVYQLVTGLQG-----GVGPT 100
Query: 207 KLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAY 266
L+ +A CKH+ AYDL+N G +R++FDA+VTMQDLA+ Y P F+SC++ + + IMC+Y
Sbjct: 101 NLKIAADCKHWAAYDLEN-LGVSRFEFDAKVTMQDLAEFYSPSFQSCIRDAKVASIMCSY 159
Query: 267 NRVNGIPSCADRNLLSKTARRQWGF--HGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLK 324
N VNGIPSCA+R LL AR WG +IT DC AV I+ Y P + L
Sbjct: 160 NAVNGIPSCANRYLLQTLARDFWGLGEEQWITGDCGAVGNIFARHHYTDDPANGTAVALN 219
Query: 325 AGMDVNCGSFLQKHTK---AAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGK 381
AG D++C S +++ A+ + + E ++ A+ ++ +RL
Sbjct: 220 AGTDIDCDSGAAAYSQNLGQALNRSLVSEDQLRTAVTRQYNSLVRLSW------------ 267
Query: 382 IGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNY 441
DV PA Q LA QAA +GIVLLKN G+LPL S +A++GP AN+ + NY
Sbjct: 268 --DDVNTEPAQQ-LAYQAAVEGIVLLKN-DGILPLASSVK-KVAVVGPMANATTQMQSNY 322
Query: 442 AGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQ 501
G + ++P QA +N N + G + ++ A+ A AD V + G+D T
Sbjct: 323 NGIAPFLVSPQQAFRNAGFNVTFANGTGLNSSDTSGFSAAIAAADDADVVFYVGGIDTTI 382
Query: 502 EKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGY 561
E+E+ DR ++ G Q L+ ++A KP+I++ + GG VD + + + ++ +++W GY
Sbjct: 383 EREDRDRPEISWTGNQLALVQQLASLG-KPLIVLQMGGGQVDSSSLRDNTSVNALIWGGY 441
Query: 562 PGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIK-VPMTDMKMRPQATSGNPGRTYRFYE 620
PG++G AL ++I G P GRLP+T YP Y+ PMTDM +RP +S NPGRTY++Y
Sbjct: 442 PGQSGGTALVDLITGKQAPAGRLPITQYPASYVDGFPMTDMTLRP--SSSNPGRTYKWYT 499
Query: 621 GKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCET 680
G +F FG GL Y+ + ++ + + + SS K N V H +LG
Sbjct: 500 GAPIFEFGFGLHYTTFDAEWASGGDSFSVQDLVSSAK---NSGVAHV----DLGV----L 548
Query: 681 RKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGF---QSVILNAKEKAEIVF 737
F VT V N G +A + LLF + + P K+LV + + + A A +
Sbjct: 549 DTFNVT--VTNSGTVASDYVALLFSRTTAGPSPAPNKELVSYTRVKGIEPGASSAASLKV 606
Query: 738 ELSPCESLSRAREDGLMVIEEGTHFLVV 765
L +++R E G V+ G + L++
Sbjct: 607 TLG---AVARTDEQGNRVLYPGEYVLLL 631
>gi|268610157|ref|ZP_06143884.1| glycoside hydrolase family 3 protein [Ruminococcus flavefaciens
FD-1]
Length = 690
Score = 386 bits (992), Expect = e-104, Method: Compositional matrix adjust.
Identities = 239/645 (37%), Positives = 360/645 (55%), Gaps = 70/645 (10%)
Query: 49 TLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNG 108
+L +RA DL +RLTL+E+ SQL APA+ RL IPAY WWSE LHGVA G
Sbjct: 8 SLSAQERAEDLTNRLTLEEQASQLKYDAPAVDRLDIPAYNWWSEGLHGVARAGT------ 61
Query: 109 TIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQA-------IGMTFWAPNI 161
AT FPQ I AA FD ++G IG EARA YN A G+ W+PN+
Sbjct: 62 ----ATMFPQAIGLAAMFDEEAMNKVGSIIGDEARAKYNEYSAHGDHDIYKGLCLWSPNV 117
Query: 162 NIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYD 221
NIFRDPRWGRGQET GEDP +T + V++ +G+QG+ G++ L+ +AC KH +
Sbjct: 118 NIFRDPRWGRGQETYGEDPYLTTRLGVAFAKGLQGE----GEV---LKTAACAKHLAVH- 169
Query: 222 LDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLL 281
+ R++FDA + +D+ +TY P FE+ VK+ + G+M AYNRVNG P+CA + L+
Sbjct: 170 --SGPEAIRHEFDAVASPKDMEETYLPAFEALVKEAKVEGVMGAYNRVNGEPACASKFLM 227
Query: 282 SKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG-SFLQKHTK 340
K +WGF GY SDC A+ + K+ ++ LK G D+NCG ++L H
Sbjct: 228 GKL--DEWGFDGYFVSDCWAIRDFHTNHMVTKTAPESAAMALKLGCDLNCGNTYL--HLL 283
Query: 341 AAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAA 400
A + + + +I +A +L R+RLG+F+ + K+ +V + ++ A + +
Sbjct: 284 HAYNEGLINDEDIKKACTHLMRTRVRLGMFDDE---TEYDKLDYSIVANEENKAYARKCS 340
Query: 401 QDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQN--- 457
+ +V+LKN +G+LPL SK ++ +IGPNA+S L GNY G + R IT L+ +Q+
Sbjct: 341 ERSMVMLKN-NGILPLDPSKIKTIGVIGPNADSRPALEGNYNGRADRYITFLEGIQDAFG 399
Query: 458 ----YVENT-VYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEE------- 505
Y E + +Y C +A + + +A + + +D VVL +GLD T E EE
Sbjct: 400 GRVLYSEGSHLYKDRCMGLAVADDRLSEAEIVTEHSDVVVLCVGLDATIEGEEGDTGNEF 459
Query: 506 --LDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPG 563
D+ DL LP Q++L+ V KPVI+V G +++ + + +++ A YPG
Sbjct: 460 SSGDKNDLRLPEAQRKLVETVMRKG-KPVIIVTAAGSAINV-----EADCDALIHAWYPG 513
Query: 564 EAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVP-MTDMKMRPQATSGNPGRTYRFYEGK 622
+ G ALA+++FG +P G+LP+T+Y D K+P TD M+ GRTYR+ +
Sbjct: 514 QFGGTALADILFGKISPSGKLPVTFY-TDTTKLPEFTDYSMK--------GRTYRYTQDN 564
Query: 623 EVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHY 667
++PFG GL+YSK K + + ++T + +DVV +
Sbjct: 565 ILYPFGYGLTYSKTEVSDLKFENGKASV-KVTNTGDFDTEDVVQF 608
>gi|410648100|ref|ZP_11358515.1| beta-glucosidase [Glaciecola agarilytica NO2]
gi|410132388|dbj|GAC06914.1| beta-glucosidase [Glaciecola agarilytica NO2]
Length = 733
Score = 386 bits (991), Expect = e-104, Method: Compositional matrix adjust.
Identities = 250/744 (33%), Positives = 381/744 (51%), Gaps = 77/744 (10%)
Query: 44 PFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKG 103
P+ T LP +R L+ +TL EK SQLVN AI RLG+P Y++W+EALHGVA G+
Sbjct: 25 PWFDTQLPTQKRIDLLIDAMTLKEKTSQLVNGNVAIERLGLPEYDFWNEALHGVARNGR- 83
Query: 104 IFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIG-------MTF 156
AT FPQ I AA+FD +L + I EARA +N IG +TF
Sbjct: 84 ---------ATVFPQAIGMAATFDQHLLLKAASVISDEARAKFNVSSEIGNRSKYSGLTF 134
Query: 157 WAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKH 216
W PNINIFRDPRWGRGQET GEDP +T + + V G+QGD L+ +A KH
Sbjct: 135 WTPNINIFRDPRWGRGQETYGEDPYLTAQMGKAMVNGLQGDH------PKYLKTAAAAKH 188
Query: 217 FTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCA 276
F + + R++FDA + +D+ +TY P FE+ V + +M AYNRVNG P+
Sbjct: 189 FAVH---SGPEALRHEFDAIASPKDMYETYFPAFEALVTEANVETVMAAYNRVNGHPAGG 245
Query: 277 DRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQ 336
LL+ R +WGF G++ SDC ++ + + ++ + G D+NCG+ +
Sbjct: 246 SDFLLNTVLRDKWGFSGHVVSDCWGLADFHQYHKVTANAVESAALAINTGTDLNCGA-VY 304
Query: 337 KHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLA 396
AV+ + E ID+ L + + + +LG F+ P+ I ADVV S AH +A
Sbjct: 305 NALPDAVEAGLVDEKTIDKRLSKVLATKFKLGFFDPKDD-NPYNNISADVVNSEAHAQVA 363
Query: 397 LQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQ 456
+ A IVLL+N + +LPL ++ +L + GP A+S++ LLGNY G S ++ L +
Sbjct: 364 YEMAVKSIVLLQNKNNILPLDRNIR-NLYVTGPFASSSEVLLGNYYGLSGKTTNILDGIT 422
Query: 457 NYVE--NTVYYP-GCDTVACSSASIDKAVDIAKG-ADHVVLMMGLDQTQEKEE------- 505
V T+ Y G + ID AK D ++ +MGL E EE
Sbjct: 423 ANVSVGTTINYKQGILPYQANVNPIDWTTGEAKQMGDVIIAVMGLSGAYEGEEGEAIASP 482
Query: 506 --LDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPG 563
DR+ L LP Q + ++ + KPVI+VL G PV++T + +I++A YPG
Sbjct: 483 HKGDRLSLDLPEHQIAFLRKLRKDNDKPVIVVLTAGTPVNLT--EIAELADAIVFAWYPG 540
Query: 564 EAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKE 623
+ G A+A+++FG+ +P GRLP+T+ P ++ ++ P GRTYR+ +
Sbjct: 541 QEGGKAVADILFGERSPSGRLPITF--------PKSEAQLPPYDDYSMQGRTYRYMTQEP 592
Query: 624 VFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKF 683
++PFG GLSY++ + + + +S ++ EN
Sbjct: 593 MYPFGFGLSYAQVKFDNITLGNTQAL---ASKNELQEN---------------------M 628
Query: 684 LVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCE 743
VT+ V N GE + V L++K G +P+ L GF + L A + +++F + P +
Sbjct: 629 TVTVNVTNTGEREFEEVVQLYLKTPDAGVSQPLHSLKGFTRIKLAAGQTEQVLFNI-PKK 687
Query: 744 SLSRAREDGLMVIEEGTHFLVVGD 767
L E G V+ +G + ++VG+
Sbjct: 688 HLYSINEQGKPVLLKGQYSVIVGN 711
>gi|373852136|ref|ZP_09594936.1| Beta-glucosidase [Opitutaceae bacterium TAV5]
gi|372474365|gb|EHP34375.1| Beta-glucosidase [Opitutaceae bacterium TAV5]
Length = 740
Score = 385 bits (990), Expect = e-104, Method: Compositional matrix adjust.
Identities = 239/629 (37%), Positives = 339/629 (53%), Gaps = 60/629 (9%)
Query: 44 PFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKG 103
PF L + R RDLVSRLTL EK+SQ+ ++A AIPRLGIPAY +W+E LHGVA G+
Sbjct: 22 PFRDPDLALDHRVRDLVSRLTLAEKVSQMEHAAAAIPRLGIPAYNYWNECLHGVARNGR- 80
Query: 104 IFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAG----------QAIG 153
AT FPQ+I AA++D+ L YR+ AI EARA ++A Q G
Sbjct: 81 ---------ATVFPQIIGLAATWDTDLVYRVATAISDEARAKHHAALARQGFAQTQQYQG 131
Query: 154 MTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASAC 213
+TFW PNIN+FRDPRWGRGQET GEDP +T + A ++VRG+QGDT + L+ +AC
Sbjct: 132 LTFWTPNINLFRDPRWGRGQETWGEDPHLTARLAAAFVRGLQGDTPDT-----HLKLAAC 186
Query: 214 CKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIP 273
KH+ + + R+ F+ARVT DL D+Y P FE V+ R +M AYNR P
Sbjct: 187 AKHYAVH---SGPENERHTFNARVTPHDLWDSYLPAFEHLVRHARVESVMGAYNRTLDEP 243
Query: 274 SCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGS 333
CA + LL R +WGF G++ SDC A+ I++ P ++ L G D+ CG+
Sbjct: 244 CCASQFLLLDILRERWGFEGHVVSDCWALRDIHETHRITTDPVESAALALTKGCDLACGT 303
Query: 334 FLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLF-----NGNPTMQPFGKIGADVVC 388
+ AV++ + E++IDRAL R +LG+F N NP P A V C
Sbjct: 304 TFEL-LGEAVQRGLITEADIDRALSRHLRARFKLGMFDPADDNRNPWSNPPAP-EAIVTC 361
Query: 389 SPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRS 448
+ AH LA +AA VLL+N + +LPL + S+ + GP A + LLGNY G R+
Sbjct: 362 A-AHTALACEAAVASCVLLQNHNHILPL-RPDVRSIYITGPLAATQDALLGNYYGLPPRA 419
Query: 449 ITPLQALQNYVENTV---YYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEE 505
IT L L + + Y PG +++ A D + +GL E EE
Sbjct: 420 ITLLDGLAAALPEGIRADYRPGALLSTPKQNALEWAEFDCASCDVTIACLGLTALLEGEE 479
Query: 506 ---------LDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSI 556
DR D+ LP Q+ + + + + ++V+L GG ++ + +I
Sbjct: 480 GEAIASSLHGDRDDISLPPPQRLFLESLIQRGAR--VIVILFGGSA-LSLGPLADKVEAI 536
Query: 557 LWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTY 616
LWAGYPG+ G ALA+++ G +P GRLP+T+Y P + MR GRT+
Sbjct: 537 LWAGYPGQEGGRALADILLGRASPSGRLPITFYENINDLPPYANYSMR--------GRTH 588
Query: 617 RFYEGKEVFPFGCGLSYSKYSYKFKAVSQ 645
R+++G +PFG GL+Y++++Y VS
Sbjct: 589 RWFDGTPAWPFGFGLTYTRFTYSDLRVSD 617
>gi|109897152|ref|YP_660407.1| beta-glucosidase [Pseudoalteromonas atlantica T6c]
gi|109699433|gb|ABG39353.1| Beta-glucosidase [Pseudoalteromonas atlantica T6c]
Length = 733
Score = 385 bits (989), Expect = e-104, Method: Compositional matrix adjust.
Identities = 263/776 (33%), Positives = 392/776 (50%), Gaps = 94/776 (12%)
Query: 12 LLCLCFTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQ 71
LL LC S P + D P+ T LP ++R L+ +TL EK SQ
Sbjct: 10 LLALCVAS---------PTLAND--------HPWFDTQLPTNERIESLIDAMTLKEKASQ 52
Query: 72 LVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLW 131
LVN AI RLG+P Y++W+EALHGVA G+ AT FPQ I AA+FD L
Sbjct: 53 LVNGNVAIERLGLPEYDFWNEALHGVARNGR----------ATVFPQAIGMAATFDQDLL 102
Query: 132 YRIGQAIGLEARALYNAGQAIG-------MTFWAPNINIFRDPRWGRGQETPGEDPLVTG 184
+ I EARA +N IG +TFW PNINIFRDPRWGRGQET GEDP +T
Sbjct: 103 LQAATVISDEARAKFNVSSEIGNRSKYSGLTFWTPNINIFRDPRWGRGQETYGEDPYLTA 162
Query: 185 KYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLAD 244
+ + V G+QGD L+ +A KHF + + R++FDA + +D+ +
Sbjct: 163 QMGKAMVNGLQGDH------PKYLKTAAAAKHFAVH---SGPEALRHEFDAIASEKDMYE 213
Query: 245 TYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSI 304
TY P FE+ V + +M AYNRVNG P+ LL+ R +WGF G+I SDC ++
Sbjct: 214 TYFPAFEALVTEADVETVMAAYNRVNGHPAGGSDFLLNTVLRDKWGFSGHIVSDCWGLAD 273
Query: 305 IYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVR 364
++ + ++ + G D+NCGS + AV+ + E ID LH + + +
Sbjct: 274 FHEYHKVTANAVESAALAINTGTDLNCGS-VYTALPDAVEAGLVDEKTIDTRLHKVLATK 332
Query: 365 MRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSL 424
+LG F+ P+ I ADVV S AH +A + A IVLL+N + +LPL K+ ++
Sbjct: 333 FKLGFFDPKDD-NPYNSISADVVNSDAHADVAYEMAVKSIVLLQNENQVLPLDKNIR-NV 390
Query: 425 ALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVE--NTVYYP-GCDTVACSSASIDKA 481
+ GP A+S++ LLGNY G S ++ L + V T+ Y G + ID
Sbjct: 391 YVTGPFASSSEVLLGNYYGLSGKTTNILDGITANVSVGTTINYKQGILPYQANVNPIDWT 450
Query: 482 VDIAKG-ADHVVLMMGLDQTQEKEE---------LDRVDLVLPGRQQELITRVAEAAKKP 531
AK D ++ +MGL E EE DR+ L LP Q E + ++ + KP
Sbjct: 451 TGEAKQMGDVIIAVMGLSGAYEGEEGEAIASPHKGDRLSLDLPEHQIEFLRKLRKDNDKP 510
Query: 532 VILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQ 591
VI+VL G PV++T + + +I++A YPG+ G A+A+++FG+ +P GRLP+T+
Sbjct: 511 VIVVLTAGTPVNVT--EIAQLADAIVFAWYPGQEGGKAVADILFGERSPSGRLPITF--- 565
Query: 592 DYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLN 651
P ++ ++ P GRTYR+ + ++PFG GLSY+ + KF ++ L
Sbjct: 566 -----PKSEAQLPPYDDYSMQGRTYRYMTEEPMYPFGFGLSYA--TVKFDNIT-----LG 613
Query: 652 QSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRG 711
+ + + Q GT V++ V N G + V L++K G
Sbjct: 614 NAEALSSTDGQK----------GT-------LDVSVNVTNTGTRELEEVVQLYLKTPNAG 656
Query: 712 NGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGD 767
+PI+ L GFQ + L + ++ F +S + L G V+ EG + ++VG+
Sbjct: 657 IDQPIQSLKGFQRIKLAPGQTGQVSFTVSK-KQLYSINAKGKPVLLEGDYHVIVGN 711
>gi|171695518|ref|XP_001912683.1| hypothetical protein [Podospora anserina S mat+]
gi|170948001|emb|CAP60165.1| unnamed protein product [Podospora anserina S mat+]
Length = 805
Score = 385 bits (989), Expect = e-104, Method: Compositional matrix adjust.
Identities = 264/734 (35%), Positives = 372/734 (50%), Gaps = 88/734 (11%)
Query: 46 CKTTLPISQRARDLVSRLTLDEKISQLVN-----SAP---------------AIPRLGIP 85
C TT RA LV L + EK+ LV AP R+G+P
Sbjct: 36 CDTTASPPARAAALVQALNITEKLVNLVEYVKSREAPLGISIQLITPHSMSLGAERIGLP 95
Query: 86 AYEWWSEALHGVAGVGKGIFFNGT---IRGATSFPQVILTAASFDSYLWYRIGQAIGLEA 142
AY WW+EALHGVA G+ FN ATSF I AA+FD+ L Y + I EA
Sbjct: 96 AYAWWNEALHGVAA-SPGVSFNQAGQEFSHATSFANTITLAAAFDNDLVYEVADTISTEA 154
Query: 143 RALYNAGQAIGMTFWAPNINIFRDPRWGRG------------------QETPGEDPLVTG 184
RA NA A G+ +W PNIN ++DPRWGRG Q+TPGEDP+
Sbjct: 155 RAFSNAELA-GLDYWTPNINPYKDPRWGRGHEVCYLSLLFRAVQLLRTQKTPGEDPVHIK 213
Query: 185 KYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLAD 244
Y + + G++G K++ + A CKHF AYDL+ W+G RY+F+A VT QDL++
Sbjct: 214 GYVQALLEGLEGR----DKIR---KVIATCKHFAAYDLERWQGALRYRFNAVVTSQDLSE 266
Query: 245 TYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGF---HGYITSDCDA 301
Y PF+ C + + MC+YN +NG P+CA L+ R+ W + + YITSDC+A
Sbjct: 267 YYLQPFQQCARDSKVGSFMCSYNALNGTPACASTYLMDDILRKHWNWTEHNNYITSDCNA 326
Query: 302 V-SIIYDAEGYAKSPEDAVVDVLKAGMDVNC---GSFLQKHTKAAVKQKKLPESEIDRAL 357
+ + + ++++P A D AG D C G A Q L E IDRAL
Sbjct: 327 IQDFLPNFHNFSQTPAQAAADAYNAGTDTVCEVPGYPPLTDVIGAYNQSLLSEEIIDRAL 386
Query: 358 HNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLP 417
L+ +R G + + + P+ KI V +P Q LALQ+A DGIVLLKN +GLLPL
Sbjct: 387 RRLYEGLIRAGYLD-SASPHPYTKISWSQVNTPKAQALALQSATDGIVLLKN-NGLLPLD 444
Query: 418 KSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSAS 477
+ ++ALIG AN+ + +LG Y+G P+ A N ++ V SS S
Sbjct: 445 LTNK-TIALIGHWANATRQMLGGYSGIPPYYANPIYAATQL--NVTFHHAPGPVNQSSPS 501
Query: 478 IDK-----AVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPV 532
+ A+ A +D ++ + G D + E+ DR + P Q L+T +A+ K +
Sbjct: 502 TNDTWTSPALSAASKSDIILYLGGTDLSIAAEDRDRDSIAWPSAQLSLLTSLAQMGKPTI 561
Query: 533 ILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQD 592
+ L G VD T + NI SILW GYPG++G AL +I G +P RLP+T YP+
Sbjct: 562 VARL--GDQVDDTPLLSNPNISSILWVGYPGQSGGTALLNIITGVSSPAARLPVTVYPET 619
Query: 593 YIK-VPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLN 651
Y +P+T M +RP TS PGRTYR+Y V PFG GL Y+ ++ KF L +N
Sbjct: 620 YTSLIPLTAMSLRP--TSARPGRTYRWYP-SPVLPFGHGLHYTTFTAKFGVF--ESLTIN 674
Query: 652 QSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRG 711
+ ++V N + E + C R V++ V N GE+ + L+FV+
Sbjct: 675 IA---ELVSNCN--------ERYLDLC--RFPQVSVWVSNTGELKSDYVALVFVRGEYGP 721
Query: 712 NGRPIKQLVGFQSV 725
PIK LVG++ +
Sbjct: 722 EPYPIKTLVGYKRI 735
>gi|317474362|ref|ZP_07933636.1| glycosyl hydrolase family 3 C terminal domain-containing protein
[Bacteroides eggerthii 1_2_48FAA]
gi|316909043|gb|EFV30723.1| glycosyl hydrolase family 3 C terminal domain-containing protein
[Bacteroides eggerthii 1_2_48FAA]
Length = 723
Score = 384 bits (986), Expect = e-103, Method: Compositional matrix adjust.
Identities = 253/731 (34%), Positives = 369/731 (50%), Gaps = 112/731 (15%)
Query: 44 PFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKG 103
P+ +L ++RA DLVSRLTL+EKI+ + N++ A+ RLGI YEWW+EALHGVA G
Sbjct: 24 PYQNKSLSPTERAADLVSRLTLEEKITLMQNNSSAVKRLGIKPYEWWNEALHGVARNGL- 82
Query: 104 IFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALY-------NAGQAIGMTF 156
AT +PQ I ASF+ L Y++ +I EAR Y N + G+TF
Sbjct: 83 ---------ATVYPQAIGMGASFNDTLLYQVFTSISDEARVKYRQAREAGNYKRYTGLTF 133
Query: 157 WAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQG---DTFNGGKLKGKLQASAC 213
W PNINIFRDPRWGRGQET GEDP +T + +S V G+QG +N + AC
Sbjct: 134 WTPNINIFRDPRWGRGQETYGEDPYLTSRMGLSVVNGLQGPQNTKYN--------KTHAC 185
Query: 214 CKHFTAYDLDNWKGTTRYKFDAR-VTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGI 272
KH+ + W R+ F+A + +DL +TY P F+ V QG +MCAYNR G
Sbjct: 186 AKHYAVHSGPEW---NRHSFNAENINPRDLWETYLPAFQDLVIQGNVKEVMCAYNRFEGD 242
Query: 273 PSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYA--KSPEDAVVDVLKAGMDVN 330
P C LL R +W + G + SDC A+ Y + K+ DA + +G D+
Sbjct: 243 PCCGSDRLLINILRNEWNYKGLVVSDCGAIDNFYFKGRHETHKNKADASAAAVLSGTDLE 302
Query: 331 CGSFLQKHTK--AAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVC 388
CG + +T +AVK+ + ES ID++L L R LG + P+ ++ ++
Sbjct: 303 CG---RSYTGLISAVKEGLINESAIDQSLCRLMKARFELGEMD---DTTPWDQLPDSLLS 356
Query: 389 SPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRS 448
AHQ LALQ A++ + LL+N +LPL K +++ALIGPNAN + NY G +
Sbjct: 357 CHAHQQLALQMARESMTLLQNHKNILPL--DKEMTVALIGPNANDSVMQWANYNGFPVHT 414
Query: 449 ITPLQALQNYV--ENTVYYPGCDT-------VACSSASIDKAVDIAKGADHVVLMMGLDQ 499
IT L+ L Y+ E +Y P + V I ++ A AD ++ G+
Sbjct: 415 ITLLEGLTQYLPQERLIYIPQKNIEVQKYPWVNYYPNDIQAVINQAAKADVIIYAGGISA 474
Query: 500 TQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKY 549
+ E EE+ DR + LP Q++L+ + +A KP++ V G + +
Sbjct: 475 SLEGEEMDVDAEGFRGGDRTTIELPNVQRKLV-KALKATGKPIVFVNFSGCAMGL--QPE 531
Query: 550 DRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATS 609
+ +IL A YPG+AG A+AEV+FGD+NP GRLP+T+Y +D D M+
Sbjct: 532 SQICDAILQAWYPGQAGGTAIAEVLFGDYNPAGRLPITFYKKDNQLPDFEDYNMQ----- 586
Query: 610 GNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKS 669
GRTYR+ + ++PFG GLSY+ +SY ST +EN
Sbjct: 587 ---GRTYRYLNYEPLYPFGHGLSYTTFSY----------------STPFIENGK------ 621
Query: 670 VPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNA 729
+ + V N G G + L++K +G P+K L GFQ + + A
Sbjct: 622 ---------------LKVKVTNSGNYNGDEVIQLYIKRYDDPDG-PLKTLRGFQRIHIPA 665
Query: 730 KEKAEIVFELS 740
+ +E+ F L+
Sbjct: 666 GQTSEVSFPLT 676
>gi|325970053|ref|YP_004246244.1| beta-glucosidase [Sphaerochaeta globus str. Buddy]
gi|324025291|gb|ADY12050.1| Beta-glucosidase [Sphaerochaeta globus str. Buddy]
Length = 698
Score = 384 bits (985), Expect = e-103, Method: Compositional matrix adjust.
Identities = 259/743 (34%), Positives = 386/743 (51%), Gaps = 96/743 (12%)
Query: 54 QRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGA 113
QRA++LV R+ L + +SQL + APAI LGIPAY WW+E LHG A G A
Sbjct: 5 QRAQELVERMNLPQMMSQLRHDAPAIESLGIPAYNWWNEGLHGSARSGT----------A 54
Query: 114 TSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQA-------IGMTFWAPNINIFRD 166
T FPQ I A+ FD Y + + E RA YN G+T W+PN+NIFRD
Sbjct: 55 TVFPQAIGLASLFDPDFLYAVASVVSTEQRAKYNLFTHENDRDIYKGLTVWSPNVNIFRD 114
Query: 167 PRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWK 226
PRWGRGQET GEDP +T + AV+++RG+QG+ G + L+ ++C KHF A+ +
Sbjct: 115 PRWGRGQETFGEDPYLTARLAVAFIRGLQGE----GPV---LKTASCVKHFAAH---SGP 164
Query: 227 GTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTAR 286
R+ F+A V +DL +TY P F S VK+ +A +M AY+ +N P CA L+ +T R
Sbjct: 165 EPLRHGFNAVVGKKDLEETYLPAFASAVKEAKADAVMGAYSALNDEPCCASSFLMEETLR 224
Query: 287 RQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQK 346
+WGF G SDC A+ + K+ E++ LK G D+ CG Q K A ++
Sbjct: 225 LRWGFEGMYISDCWAIRDFHLNHKVTKNEEESAALALKRGCDLACGCEYQSLEK-AFQKG 283
Query: 347 KLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVL 406
+ +I +A + + R +LG F+ + +G + + S H LA +A+ +VL
Sbjct: 284 LITREQIKKAAIRVMTTRFKLGQFDQGTA---YDTLGLESLDSDEHAALAFEASCRSLVL 340
Query: 407 LKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENT---V 463
LKN LLPL K LA+IGPNA+S + L GNY G S R +T L+ L++YV ++ +
Sbjct: 341 LKND-ALLPLKKEAVSCLAVIGPNADSRQALWGNYHGTSSRYVTILEGLRDYVGSSTRIL 399
Query: 464 YYPG-------CDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEEL---------D 507
Y G + +A + +AV +AK +D VVL +GL++T E E D
Sbjct: 400 YSEGSNLTKNKVERLAKDDDRLSEAVFMAKASDVVVLCLGLNETVEGEMHDDGNGGWAGD 459
Query: 508 RVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGA 567
+ DL LP Q++L+ VAE KP+I+VLL GG +D +Y N+ +++ A YPG+ G
Sbjct: 460 KDDLRLPLCQRKLLKAVAETG-KPIIVVLLSGGSLDPEIEQY-ANVKALIQAWYPGQEGG 517
Query: 568 VALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEV-FP 626
A+A +++G P G+LP+T+Y + P TD + RTYR+ + +V +P
Sbjct: 518 KAIAHLLYGALCPSGKLPVTFYKAEAKLPPFTDYSL--------IRRTYRYCDDPDVLYP 569
Query: 627 FGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVT 686
FG GLSY+ +S+ A + + N ++T +V N + ++V V
Sbjct: 570 FGFGLSYASFSFCLSAAQETE--QNGVAATVLVRNTSALDARTV--------------VQ 613
Query: 687 IGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLS 746
+ + G+ HPV L G +SV L A E+ +I F L + +
Sbjct: 614 LYLAMEGKDLPPHPV-----------------LCGMKSVHLKAGEETQITFILEE-KQFT 655
Query: 747 RAREDGLMVIEEGTHFLVVGDEE 769
+EDG G + L G +
Sbjct: 656 AVQEDGNRYAVRGGYTLYAGSSQ 678
>gi|339499234|ref|YP_004697269.1| beta-glucosidase [Spirochaeta caldaria DSM 7334]
gi|338833583|gb|AEJ18761.1| Beta-glucosidase [Spirochaeta caldaria DSM 7334]
Length = 699
Score = 382 bits (980), Expect = e-103, Method: Compositional matrix adjust.
Identities = 264/729 (36%), Positives = 381/729 (52%), Gaps = 90/729 (12%)
Query: 59 LVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQ 118
L+S ++L+EKI +++ A IPRLGIP Y WW+EALHGVA G+ AT FPQ
Sbjct: 15 LISNMSLEEKIGLMIHRAKGIPRLGIPDYNWWNEALHGVANNGE----------ATVFPQ 64
Query: 119 VILTAASFDSYLWYRIGQAIGLEARALYNA-GQAI------GMTFWAPNINIFRDPRWGR 171
I A+FD L +R+ +AI +EARA +NA G+ G+TFWAPNINIFRDPRWGR
Sbjct: 65 AIALGATFDEDLVHRVAEAISIEARAKFNAVGKEKAEQYHRGLTFWAPNINIFRDPRWGR 124
Query: 172 GQETPGEDPLVTGKYAVSYVRGVQG-DTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTR 230
GQET GEDP++T + +YVRG+QG D + L+A+AC KHF + R
Sbjct: 125 GQETYGEDPVLTSRLGTAYVRGLQGSDPY-------YLRAAACAKHFAVHSGPE---GLR 174
Query: 231 YKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWG 290
+ F+A V+ +DL +TY P F++ VK G S +M AYNRVNG P+C LL + R +W
Sbjct: 175 HTFNAEVSQKDLEETYLPAFKALVKSGVES-VMGAYNRVNGEPACGSTYLLKQKLREEWQ 233
Query: 291 FHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPE 350
F G++ SDC A+ + +++ L++G D+NCG + AV + + E
Sbjct: 234 FQGHVVSDCWAICDFHKNHKVTNDILESIALALRSGCDLNCGDAYN-YLAEAVLKGYVTE 292
Query: 351 SEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNS 410
+I+RA+ L +LGL + + P+ I + H LAL+AA+ IVLLKN+
Sbjct: 293 DDINRAVVRLLITLDKLGLIHDD---GPYQGITIHQIDWKKHDSLALEAAEKSIVLLKNN 349
Query: 411 HGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYV--ENTVYY-PG 467
G+LPL K K + + GPNA ++ LLGNYAG S R +T L+A+ E TV Y G
Sbjct: 350 -GVLPLKKDKISYIYVTGPNATNSDALLGNYAGVSSRLLTVLEAIVEEAGPEITVTYKKG 408
Query: 468 CDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRV---------DLVLPGRQQ 518
C D A + K AD + +MG D + E EE D + DL L Q
Sbjct: 409 CPLAERRVNPNDWASGVTKYADVTIAVMGRDTSVEGEEGDAILSSTYGDFEDLNLNDEQL 468
Query: 519 ELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDH 578
+ ++ E+ KP+I+VL+ G P I + +IL A YPG+AG A++ ++FG
Sbjct: 469 SYLHKLKESG-KPLIVVLMGGAP--ICSPELHEIADAILVAWYPGQAGGTAVSNIVFGKT 525
Query: 579 NPGGRLPMTWYPQDYIKVP-MTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYS 637
NP G+LP+T +P+ ++P + M+ GRTYR+ + ++PFG GLSY+K
Sbjct: 526 NPSGKLPVT-FPKSVRQLPEFENYSMQ--------GRTYRYMTEEPLYPFGFGLSYTK-- 574
Query: 638 YKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAG 697
+FK V+ +KS E + +V+ + N G + G
Sbjct: 575 MEFKHVTG--------------------RWKS--------PEKDELIVSTELYNQGTIDG 606
Query: 698 KHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIE 757
+ V L+ P L+ F+ V++ A F++ P E L G VI
Sbjct: 607 EEVVQLYYHWKDAPFAVPNWSLIDFKRVLVAAGASCICEFKI-PLEKLQCIDPSGKGVIP 665
Query: 758 EGTHFLVVG 766
GT VG
Sbjct: 666 TGTLQFYVG 674
>gi|410639677|ref|ZP_11350222.1| beta-glucosidase [Glaciecola chathamensis S18K6]
gi|410140558|dbj|GAC08409.1| beta-glucosidase [Glaciecola chathamensis S18K6]
Length = 733
Score = 381 bits (979), Expect = e-103, Method: Compositional matrix adjust.
Identities = 250/744 (33%), Positives = 381/744 (51%), Gaps = 77/744 (10%)
Query: 44 PFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKG 103
P+ T LP +R L+ +TL EK SQLVN AI RLG+P Y++W+EALHGVA G+
Sbjct: 25 PWFDTQLPTQKRIDLLIDAMTLKEKTSQLVNGNVAIERLGLPEYDFWNEALHGVARNGR- 83
Query: 104 IFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIG-------MTF 156
AT FPQ I AA+FD +L + I EARA +N IG +TF
Sbjct: 84 ---------ATVFPQAIGMAATFDQHLLLKAASVISDEARAKFNVSSEIGNRSKYSGLTF 134
Query: 157 WAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKH 216
W PNINIFRDPRWGRGQET GEDP +T + + V G+QGD L+ +A KH
Sbjct: 135 WTPNINIFRDPRWGRGQETYGEDPYLTAQMGKAMVNGLQGDH------PKYLKTAAAAKH 188
Query: 217 FTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCA 276
F + + R++FDA + +D+ +TY P FE+ V + +M AYNRVNG P+
Sbjct: 189 FAVH---SGPEALRHEFDAIASPKDMYETYFPAFEALVTEANVETVMAAYNRVNGHPAGG 245
Query: 277 DRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQ 336
LL+ R +WGF G++ SDC ++ + + ++ + G D+NCG+ +
Sbjct: 246 SDFLLNTVLRDKWGFSGHVVSDCWGLADFHQYHKVTANAVESAALAINTGTDLNCGA-VY 304
Query: 337 KHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLA 396
AV+ + E ID+ L + + + +LG F+ P+ I ADVV S AH +A
Sbjct: 305 NALPDAVEAGLVDEKTIDKRLSKVLATKFKLGFFDPKDD-NPYNNISADVVNSEAHAQVA 363
Query: 397 LQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQ 456
+ A IVLL+N + +LPL ++ +L + GP A+S++ LLGNY G S ++ L +
Sbjct: 364 YEMAVKSIVLLQNKNNILPLDRNIR-NLYVTGPFASSSEVLLGNYYGLSGKTTNILDGIT 422
Query: 457 NYVE--NTVYYP-GCDTVACSSASIDKAVDIAKG-ADHVVLMMGLDQTQEKEE------- 505
V T+ Y G + ID AK D ++ +MGL E EE
Sbjct: 423 ANVSVGTTINYKQGILPYQANVNPIDWTTGEAKQMGDVIIAVMGLSGAYEGEEGEAIASP 482
Query: 506 --LDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPG 563
DR+ L LP Q + ++ + KPVI+VL G PV++T + +I++A YPG
Sbjct: 483 HKGDRLSLDLPEHQIAFLRKLRKDNDKPVIVVLTAGTPVNLT--EIAELADAIVFAWYPG 540
Query: 564 EAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKE 623
+ G A+A+++FG+ +P GRLP+T+ P ++ ++ P RTYR+ +
Sbjct: 541 QEGGKAVADILFGERSPSGRLPITF--------PKSEAQLPPYDDYSMQERTYRYMTQEP 592
Query: 624 VFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKF 683
++PFG GLSY++ KF ++ + N + K+ P+
Sbjct: 593 MYPFGFGLSYAQ--VKFDNIT--------------LGNTQALASKNEPQ--------ENM 628
Query: 684 LVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCE 743
VT+ V N GE + V L++K G +P+ L GF + L A + +++F + P +
Sbjct: 629 TVTVNVTNTGEREFEEVVQLYLKTPDAGVSQPLHSLKGFTRIKLAAGQTEQVLFNI-PKK 687
Query: 744 SLSRAREDGLMVIEEGTHFLVVGD 767
L G V+ +G + ++VG+
Sbjct: 688 HLYSINAQGKPVLLKGQYSVIVGN 711
>gi|255284060|ref|ZP_05348615.1| beta-glucosidase [Bryantella formatexigens DSM 14469]
gi|255265405|gb|EET58610.1| glycosyl hydrolase family 3 C-terminal domain protein
[Marvinbryantia formatexigens DSM 14469]
Length = 700
Score = 381 bits (979), Expect = e-103, Method: Compositional matrix adjust.
Identities = 233/607 (38%), Positives = 335/607 (55%), Gaps = 66/607 (10%)
Query: 54 QRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGA 113
+RA LV+++T++EK SQL APAI RLGIPAY WW+EALHGVA G+ A
Sbjct: 8 KRAEALVAQMTVEEKASQLKYDAPAIKRLGIPAYNWWNEALHGVARAGQ----------A 57
Query: 114 TSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAI-------GMTFWAPNINIFRD 166
T FPQ I A+FD L I I E RA YNA A G+TFW+PN+NIFRD
Sbjct: 58 TVFPQAIGLGATFDEALLGEIADVIATEGRAKYNAYAAKEDRDIYKGLTFWSPNVNIFRD 117
Query: 167 PRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWK 226
PRWGRG ET GEDP +T + V++V+G+QGD ++A+AC KHF + +
Sbjct: 118 PRWGRGHETYGEDPCLTSRLGVAFVKGLQGDGET-------MKAAACAKHFAVH---SGP 167
Query: 227 GTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTAR 286
R++F+A + +D+ +TY P FE+ VK+ +M AYNR NG CA +L K R
Sbjct: 168 EAVRHEFNAEASAKDMEETYLPAFEALVKEADVEAVMGAYNRTNGEACCAS-PVLQKILR 226
Query: 287 RQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG-SFLQKHTKAAVKQ 345
WGF G+ SDC A+ ++ + +++ + +G D+NCG ++L H A +
Sbjct: 227 EDWGFEGHFVSDCWAIRDFHEHHMLTATAKESAAMAINSGCDLNCGNTYL--HILHAYRD 284
Query: 346 KKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIV 405
+ E I A LF+ R LGLF+G+ + I VV S H LA +AA + V
Sbjct: 285 GLVSEETITEAAVRLFTTRFLLGLFDGS----EYDDIPYTVVESKEHLALAEKAALESAV 340
Query: 406 LLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYY 465
LLKN +G+LPL K + ++ +IGPNA+S L GNY G + R T Q LQ+Y+ V
Sbjct: 341 LLKN-NGILPLKKERLRTVGVIGPNADSRAALAGNYHGTASRYETIQQGLQDYLGEDVRV 399
Query: 466 ---PGC-------DTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEE---------L 506
GC + +A + + +A +A+ +D V+L +GLD+T E EE
Sbjct: 400 LTSVGCALSEDRTEKLALAGDRLAEAQIVAENSDVVILCLGLDETLEGEEGDTGNSYASG 459
Query: 507 DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAG 566
D+ L+LP Q++L+ VA A KPV+L ++ G +D+++A + +IL YPG G
Sbjct: 460 DKETLLLPEAQRDLMEAVA-ATGKPVVLCMMSGSDLDMSYAA--EHFDAILQLWYPGSQG 516
Query: 567 AVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFP 626
A A+++FG+ +P G+LP+T+Y D M+ GRTYR+ +P
Sbjct: 517 GSAAAKLLFGEVSPSGKLPVTFYETLEELPAFEDYSMK--------GRTYRYMGHPAQYP 568
Query: 627 FGCGLSY 633
FG GL+Y
Sbjct: 569 FGFGLTY 575
>gi|7671419|emb|CAB89360.1| beta-glucosidase-like protein [Arabidopsis thaliana]
gi|9758998|dbj|BAB09525.1| unnamed protein product [Arabidopsis thaliana]
Length = 411
Score = 380 bits (977), Expect = e-102, Method: Compositional matrix adjust.
Identities = 201/414 (48%), Positives = 266/414 (64%), Gaps = 5/414 (1%)
Query: 365 MRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSL 424
MRLG F+GNP QP+G +G VC+ ++ LA++ A+ GIVLLKNS G LPL S +L
Sbjct: 1 MRLGFFDGNPKNQPYGGLGPKDVCTVENRELAVETARQGIVLLKNSAGSLPLSPSAIKTL 60
Query: 425 ALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDI 484
A+IGPNAN KT++GNY G +C+ TPLQ L+ V T Y+ GC V C+ A +D A +
Sbjct: 61 AVIGPNANVTKTMIGNYEGVACKYTTPLQGLERTVLTTKYHRGCFNVTCTEADLDSAKTL 120
Query: 485 AKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDI 544
A AD VL+MG DQT EKE LDR+DL LPG+QQEL+T+VA+AA+ PV+LV++ GG DI
Sbjct: 121 AASADATVLVMGADQTIEKETLDRIDLNLPGKQQELVTQVAKAARGPVVLVIMSGGGFDI 180
Query: 545 TFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYI-KVPMTDMKM 603
TFAK D I SI+W GYPGEAG +A+A+VIFG HNP G+LPMTWYPQ Y+ KVPMT+M M
Sbjct: 181 TFAKNDEKITSIMWVGYPGEAGGIAIADVIFGRHNPSGKLPMTWYPQSYVEKVPMTNMNM 240
Query: 604 RPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKF-KAVSQNKLYLNQSSSTKMVENQ 662
RP ++G GRTYRFY G+ V+ FG GLSY+ +S++ KA L L++S S + E Q
Sbjct: 241 RPDKSNGYLGRTYRFYIGETVYAFGDGLSYTNFSHQLIKAPKFVSLNLDESQSCRSPECQ 300
Query: 663 DVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGF 722
+ P E F V + V+N G+ G V LF P +G P KQL+GF
Sbjct: 301 SLDAIG--PHCEKAVGERSDFEVQLKVRNVGDREGTETVFLFTTPPEV-HGSPRKQLLGF 357
Query: 723 QSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGDEEYPISIFV 776
+ + L KE+ + F++ C+ L E G + G H L VG ++ +I V
Sbjct: 358 EKIRLGKKEETVVRFKVDVCKDLGVVDEIGKRKLALGHHLLHVGSLKHSFNISV 411
>gi|374316077|ref|YP_005062505.1| beta-glucosidase-like glycosyl hydrolase [Sphaerochaeta pleomorpha
str. Grapes]
gi|359351721|gb|AEV29495.1| beta-glucosidase-like glycosyl hydrolase [Sphaerochaeta pleomorpha
str. Grapes]
Length = 701
Score = 380 bits (977), Expect = e-102, Method: Compositional matrix adjust.
Identities = 229/611 (37%), Positives = 333/611 (54%), Gaps = 61/611 (9%)
Query: 54 QRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGA 113
++A+ LV+ ++L E SQL++ APAIPRLG+P Y WW+EALHG A G A
Sbjct: 8 EQAKQLVAHMSLKEMFSQLLHEAPAIPRLGLPRYNWWNEALHGAARSGT----------A 57
Query: 114 TSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIG-------MTFWAPNINIFRD 166
T FPQ I AA FD I I E RA YN A+G +T W+PN+NIFRD
Sbjct: 58 TVFPQAIGLAAMFDDVFLKEIATVISTEQRAKYNTFSALGDRGIYKGLTLWSPNVNIFRD 117
Query: 167 PRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWK 226
PRWGRGQET GEDP + + VS+++G+QGD L+ +AC KHF + +
Sbjct: 118 PRWGRGQETYGEDPYLASQLGVSFIQGLQGD-------GPYLKTAACVKHFAVH---SGP 167
Query: 227 GTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTAR 286
R+ F+A V+ +DL +TY P FE+CVK+G + +M AY+ VNG P C L++ R
Sbjct: 168 EPLRHDFNAIVSRKDLYETYLPAFEACVKEGEVNAVMGAYSAVNGEPCCGSPFLITDILR 227
Query: 287 RQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQK 346
WGF G SDC A+ + K+ D+V L AG D+NCG K A +Q
Sbjct: 228 NDWGFEGMYISDCWAIRDFHLNHAVTKNQVDSVALALNAGCDLNCGCEYLSLEK-AYQQG 286
Query: 347 KLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVL 406
+ I +A + + R LGLF+ + T + IG + + H+ +A +A+ + +VL
Sbjct: 287 LIDRKTITQACIRVMTTRFALGLFSEDCT---YSNIGYEQNDTEEHRKVAFKASCNSLVL 343
Query: 407 LKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTV--- 463
LKN G+LPL ++A+IGPNA+S + L GNY G S T L+ + + +V
Sbjct: 344 LKND-GMLPLDSRSLHAIAIIGPNADSREALWGNYHGTSSTYTTVLEGFRKTLGESVKVK 402
Query: 464 YYPGC-------DTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEEL---------D 507
Y G + +A + I +A+ +A +D ++L +G D+T E E D
Sbjct: 403 YSQGSAIQKEKLERLAEPNDRIAEAIAVATVSDTIILCLGYDETVEGEMHDDGNGGWAGD 462
Query: 508 RVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGA 567
+ DL LP Q+ L+ VA KP++LVLL GG +D ++ N+ ++L YPG+ G
Sbjct: 463 KQDLRLPPCQRALLKAVASTG-KPIVLVLLSGGAIDPEIERFP-NVKALLQGWYPGQEGG 520
Query: 568 VALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPF 627
+A+A I G +NP G LP+T+Y + + D +M GRTYR+ + K ++PF
Sbjct: 521 LAIAHTILGLNNPSGHLPVTFYRSETVLPDFCDYRME--------GRTYRYVQEKVLYPF 572
Query: 628 GCGLSYSKYSY 638
G GLSY+ +SY
Sbjct: 573 GFGLSYTTFSY 583
>gi|443695317|gb|ELT96258.1| hypothetical protein CAPTEDRAFT_179825 [Capitella teleta]
Length = 750
Score = 378 bits (970), Expect = e-102, Method: Compositional matrix adjust.
Identities = 231/645 (35%), Positives = 340/645 (52%), Gaps = 48/645 (7%)
Query: 14 CLCFTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLV 73
+C T + T S PS+ + ++FPF +LPI R DL+SRLT+++ I+Q V
Sbjct: 8 VVCLTLISTASFSQDNVLRFAPSSHALDSFPFRNVSLPIETRLNDLISRLTIEDAINQTV 67
Query: 74 NS----APAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSY 129
P I RLGI E+ +E L GV AT FPQ + AASF
Sbjct: 68 ARYGKFTPGIERLGIKPIEYITECLRGVRR-----------ENATGFPQALGLAASFSRD 116
Query: 130 LWYRIGQAIGLEARALYNAG------QAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVT 183
L R+ A+ +E RA YN A G+T ++P INI R P WGR QET GEDP ++
Sbjct: 117 LMQRVATAVSVEVRAFYNHDIQRETYGAHGITCFSPVINILRHPLWGRNQETYGEDPYLS 176
Query: 184 GKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLA 243
G+ A YV G+QGD L+ SA CKHF A+ + ++ FDA++ +DL
Sbjct: 177 GELASQYVSGLQGDD------PRYLRVSAGCKHFDAHGGPDTIPVRKFGFDAKIEERDLQ 230
Query: 244 DTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVS 303
T+ P F+ C+ + +MC++N +NG+PSCA++ LL+ R QWG+ G++ SD AV
Sbjct: 231 MTFLPAFKKCIA-AKPYNVMCSFNSINGVPSCANKRLLTDVLRAQWGYEGFVVSDDAAVE 289
Query: 304 IIYDAEGYAKSPEDAVVDVLKAGMDVN-CGSFLQKHTK--AAVKQKKLPESEIDRALHNL 360
I+ Y S E A V+ +K+G ++ G F + + A+ + + + E+ + +
Sbjct: 290 YIFTEHHYNSSFETAAVEAIKSGCNMELVGKFDPSYWQLTKALNEHLITKDELMENVRPV 349
Query: 361 FSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSK 420
F R LG F+ P + PF +I DVV S HQ LAL+AA VLLKN LPL K+
Sbjct: 350 FLTRFLLGEFDP-PALNPFNQITKDVVLSAEHQRLALEAAVKSFVLLKNDRNFLPLLKNS 408
Query: 421 SVSLALIGPNANSAKTLLGNYAGPSCRS--ITPLQALQNYVENTVYYPGCDTVACSSASI 478
++A++GP +N L+G+Y+ + S +TPL ++ N + GC C+
Sbjct: 409 LKTVAVVGPMSNYTDGLIGDYSTDTDPSLILTPLHGIKKLAPNVQFASGCSNSTCTDYRA 468
Query: 479 DKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITR-VAEAAKKPVILVLL 537
GA V + +G E E DR D+VLPG Q +L+ V A +PV+L+L
Sbjct: 469 TDVAAAVDGAQVVFVALGTGFIVEAENNDRSDIVLPGAQLQLLKDAVYHANGRPVVLLLF 528
Query: 538 CGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIF---GDHNPGGRLPMTWYPQDYI 594
GGP+D+TFA+ I SI+ +P A+ ++ G +P GRLP+TW P
Sbjct: 529 NGGPLDVTFAQLTSGIVSIVECFFPAMMTGEAIYRMLINNEGISSPAGRLPLTW-PAYLN 587
Query: 595 KVP-MTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSY 638
+VP +TD M+ GRTYR+Y ++PFG GLSY+++ Y
Sbjct: 588 QVPNITDYTMK--------GRTYRYYTEDPLYPFGYGLSYTQFKY 624
>gi|317057539|ref|YP_004106006.1| glycoside hydrolase family protein [Ruminococcus albus 7]
gi|315449808|gb|ADU23372.1| glycoside hydrolase family 3 domain protein [Ruminococcus albus 7]
Length = 691
Score = 377 bits (968), Expect = e-101, Method: Compositional matrix adjust.
Identities = 244/653 (37%), Positives = 350/653 (53%), Gaps = 67/653 (10%)
Query: 45 FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGI 104
+ TL +RA L +T +E+ SQL APA+ RLGIPAY WW+E +HG+A G
Sbjct: 4 YLDETLSAQERAEALTDEMTTEEQASQLRYDAPAVERLGIPAYNWWNEGIHGLARSGV-- 61
Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQA-------IGMTFW 157
AT FPQ I AA FD L + + EARA YNA G+T W
Sbjct: 62 --------ATMFPQAIGLAAMFDDELTKKTAEVTSEEARAKYNAYSGEEDRDIYKGLTLW 113
Query: 158 APNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHF 217
APNINIFRDPRWGRG ET GEDP +T K ++ VRG+QGD GK+ ++A+AC KHF
Sbjct: 114 APNINIFRDPRWGRGHETFGEDPYLTTKNGMAVVRGLQGD----GKV---IKAAACAKHF 166
Query: 218 TAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCAD 277
+ + R+ FDA+ +D+ +TY P FE+ VK+ + +M AYNRVNG P+CA
Sbjct: 167 AVH---SGPEAIRHSFDAKANAKDMEETYLPAFEALVKEAKVESVMGAYNRVNGEPACAS 223
Query: 278 RNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQK 337
L+ K ++W F GY SDC A+ ++ + ++ LKAG DVNCG Q
Sbjct: 224 NFLMDKL--KEWEFDGYFVSDCWAIRDFHENHMVTANAIESTAMALKAGCDVNCGCTYQ- 280
Query: 338 HTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLAL 397
+ A+++ + + +I A +L R+RLG+F+ + I D V H+ ++L
Sbjct: 281 NLLVALEKGAVTKEDIRTACVHLMRTRIRLGMFDKK---TEYDDIPYDKVACKEHKAISL 337
Query: 398 QAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQN 457
+ A+ +V+L+N +G+LP+ SK ++A+IGPNA+S L GNY G S R T L +Q+
Sbjct: 338 ECAEKSLVMLEN-NGILPVDTSKYKTIAVIGPNADSRTALEGNYNGLSDRYTTFLNGIQD 396
Query: 458 YVENTVYY-PGCDTVACSSASIDKAVD-------IAKGADHVVLMMGLDQTQEKEE---- 505
+ V + GC +++ +A D AK AD +L +GLD T E EE
Sbjct: 397 RFDGRVIFAEGCHLYKDRVSNLAQAGDRYAEAVAAAKFADMTILCLGLDATIEGEEGDTG 456
Query: 506 -----LDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAG 560
D+ L LP Q+EL+ ++ A KPV+ V +C G T +K D +++ A
Sbjct: 457 NEFSSGDKNGLTLPPPQRELVKKIM-AVGKPVVTV-VCAGSAINTESKPD----ALIHAF 510
Query: 561 YPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVP-MTDMKMRPQATSGNPGRTYRFY 619
YPG G ALAEV+FGD +P G+LP+T+Y +D K+P TD M+ GRTYR+
Sbjct: 511 YPGAEGGKALAEVLFGDVSPSGKLPVTFY-EDTDKLPEFTDYSMK--------GRTYRYT 561
Query: 620 EGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPE 672
++PFG GL+Y K + +S K E+ ++ K E
Sbjct: 562 TENVLYPFGYGLTYGSVKVTKVEYKDGKAVVTAENSGKATEDVIQLYIKDYSE 614
>gi|350295750|gb|EGZ76727.1| glycoside hydrolase [Neurospora tetrasperma FGSC 2509]
Length = 839
Score = 377 bits (967), Expect = e-101, Method: Compositional matrix adjust.
Identities = 271/792 (34%), Positives = 388/792 (48%), Gaps = 95/792 (11%)
Query: 46 CKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIF 105
C T +RA LV +LT+DEK+ LV+ A R+G+P Y WWSE LHGVAG G+
Sbjct: 37 CDVTGTAPERAASLVDQLTIDEKLVNLVDQALGASRIGLPKYAWWSEGLHGVAG-SPGVT 95
Query: 106 FNGT---IRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNIN 162
FN T ATSF I ASFD L Y +G AI EARA N G G+ +W PN+N
Sbjct: 96 FNTTGYPFSYATSFANAINLGASFDDDLVYEVGTAISTEARAFANFGFG-GLDYWTPNVN 154
Query: 163 IFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDL 222
++DPRWGRG ETPGEDPL Y + + G++G+ + + A CKH+ AYDL
Sbjct: 155 PYKDPRWGRGAETPGEDPLHIKGYVKAMLAGLEGN-------ETVRKVIATCKHYAAYDL 207
Query: 223 DNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRV------------- 269
+ W G TRY+F+A VT+QDL++ Y PPF+ C + + IMC+YN +
Sbjct: 208 ERWHGLTRYEFEAIVTLQDLSEYYLPPFQQCARDSKVGSIMCSYNALTIRDMAGGSKPDE 267
Query: 270 -----NGIPSCADRNLLSKTARRQWGF---HGYITSDCDAV-SIIYDAEGYAKSPEDAVV 320
P+CA+ L++ R W + + YITSDC+A+ + D ++++P +A
Sbjct: 268 IINLTTAQPACANTYLMT-ILRDHWNWTEHNNYITSDCNAILDFLPDNHNFSQTPAEAAA 326
Query: 321 DVLKAGMDVNC---GSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFN------ 371
KAG D C GS L A Q LPE+ ID AL L+ +R G +
Sbjct: 327 AAYKAGTDTVCEVSGSPLTD-VVGAYNQSLLPEAVIDTALRRLYEGLIRAGYLDHGRSAV 385
Query: 372 -----GNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLAL 426
G+ + + + + V +P+ Q LAL++A +GIVLLKNS LLPL S +AL
Sbjct: 386 AGGDGGSFSSPAYDALNWNDVNTPSTQELALRSATEGIVLLKNSGSLLPLDFSGK-KVAL 444
Query: 427 IGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAK 486
IG AN+ T+ G Y+G PL A Q + Y G A + A
Sbjct: 445 IGHWANATGTMRGPYSGIPPFYHNPLYAAQQLNLSLSYANGPVVNASDPDTWTAPALAAA 504
Query: 487 GADHVVLMM-GLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDIT 545
VVL G D T E+LDR + P Q +L++ +A KPV+++ L G VD +
Sbjct: 505 EGADVVLYFGGTDTTVASEDLDRESIAWPEAQMKLLSELA-GLGKPVVVIQL-GDQVDDS 562
Query: 546 FAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYI-KVPMTDMKMR 604
+ N+ SILW GYPG++G A+ +V+ G P GRLP+T YP+ Y+ +VP+T+M +R
Sbjct: 563 SLLNNGNVSSILWVGYPGQSGGTAVFDVLTGKKAPAGRLPVTQYPEGYVDEVPLTEMALR 622
Query: 605 P------------------------QATSGN-----PGRTYRFYEGKEVFPFGCGLSYSK 635
P ++T GN PGRTY++Y V PFG GL Y+
Sbjct: 623 PFNHSSSNLEEEVSVQGGASLTIQARSTPGNKTLSSPGRTYKWYS-TPVLPFGYGLHYTT 681
Query: 636 YSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEM 695
++ S S ++ H P F + +++ + N G
Sbjct: 682 FNVSLSLSSNASSPSFSIPS--LLTPCTATHLDLCP-----FSPSANSALSVSITNTGTH 734
Query: 696 AGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCE--SLSRAREDGL 753
+ LLF+ P+K LV ++ V + K + + P ++SR DG
Sbjct: 735 TSDYVALLFLSGEFGPEPYPLKTLVSYKRV-KDIKPGETVTVKDVPVSLGAISRVDGDGN 793
Query: 754 MVIEEGTHFLVV 765
V+ GT+ VV
Sbjct: 794 TVLYPGTYRFVV 805
>gi|443692971|gb|ELT94448.1| hypothetical protein CAPTEDRAFT_221920 [Capitella teleta]
Length = 757
Score = 376 bits (965), Expect = e-101, Method: Compositional matrix adjust.
Identities = 245/624 (39%), Positives = 332/624 (53%), Gaps = 51/624 (8%)
Query: 43 FPFCKTTLPISQRARDLVSRLTLDEKISQLVNS-----APAIPRLGIPAYEWWSEALHGV 97
FPF +L RA DLV+RLTL+E Q S PAI RLGI Y W +E L G
Sbjct: 20 FPFQDPSLSWDDRADDLVARLTLEEIAPQTQASYGGQHTPAIERLGIKPYVWITECLAGQ 79
Query: 98 AGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIG---- 153
AT++PQ I AASF L + + + I E RA +NA +A+G
Sbjct: 80 VNTN-----------ATAYPQPIGMAASFSEELLFNVSRDISYEVRAHWNANRAVGKYST 128
Query: 154 ---MTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQA 210
++ ++P INI R P WGR QET GEDPL++G A S+VRG+QGD L+A
Sbjct: 129 KVGLSCFSPVINIMRHPLWGRNQETYGEDPLLSGTLAQSFVRGLQGDD------PRYLRA 182
Query: 211 SACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVN 270
+A CKHF + +R+ FDA+V M+D T+ P F+ CV G S +MC+YNR+N
Sbjct: 183 NAGCKHFDVHGGPEDIPVSRFSFDAKVNMRDWRMTFLPQFKMCVDAGSYS-LMCSYNRIN 241
Query: 271 GIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVN 330
GIP+CA++ LL+ R +WGFHGYI SD A+S I + Y S VV +KAG ++
Sbjct: 242 GIPACANKQLLTDITRDEWGFHGYIVSDSGAISNIKEQHHYTNSTVATVVAAIKAGTNLE 301
Query: 331 CG---SFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVV 387
G + A+KQ L E EI + L R+RLG F+ M + KIG DV+
Sbjct: 302 LGGGSNMYYPKQLDAMKQGLLTEKEIRDNVRPLLYTRLRLGEFDPE-AMVDYNKIGVDVI 360
Query: 388 CSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCR 447
SP H+ A++AA G VLLKN + LLP+ K S LA++GP N+ L G Y+
Sbjct: 361 QSPEHREQAVKAAYMGFVLLKNHNNLLPIKKQYS-KLAIVGPFTNATSELFGTYSSEVNL 419
Query: 448 SITP--LQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEE 505
T + L +T GC ACS D GAD V++ +G Q E E
Sbjct: 420 KFTSTIFEGLSPLGGSTRSANGCTNSACSGYVRDDVETAVAGADLVIVALGSGQRFESEG 479
Query: 506 LDRVDLVLPGRQQELITR-VAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGE 564
DR L L G Q +++ V + PVILVL+ GP+DIT+AK D + +IL GYP +
Sbjct: 480 NDRAYLDLHGHQLDILKDAVFFSNGAPVILVLINAGPLDITWAKLDPGVTAILSCGYPAQ 539
Query: 565 AGAVALAEVIFGDH---NPGGRLPMTWYPQDYIKVP-MTDMKMRPQATSGNPGRTYRFYE 620
+ AL + P GRL TW P + +VP +TD M+ GRTYR+Y
Sbjct: 540 STGEALRRSLTMSEPQAAPAGRLQATW-PLNLDQVPKITDYTMQ--------GRTYRYYV 590
Query: 621 GKEVFPFGCGLSYSKYSYKFKAVS 644
G+ ++PFG GLSY+ +SY ++S
Sbjct: 591 GEPLYPFGFGLSYTSFSYTRLSIS 614
>gi|424661938|ref|ZP_18098975.1| hypothetical protein HMPREF1205_02324 [Bacteroides fragilis HMW
616]
gi|404578249|gb|EKA82984.1| hypothetical protein HMPREF1205_02324 [Bacteroides fragilis HMW
616]
Length = 722
Score = 374 bits (959), Expect = e-100, Method: Compositional matrix adjust.
Identities = 241/721 (33%), Positives = 381/721 (52%), Gaps = 79/721 (10%)
Query: 51 PISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTI 110
PI+ R + L+ ++TL EK SQLV+ + +IPRL +PAY +W+E LHGVA G+
Sbjct: 57 PIAVRVKTLIQQMTLAEKASQLVSESDSIPRLNLPAYNYWNECLHGVARAGE-------- 108
Query: 111 RGATSFPQVILTAASFDSYLWYRIGQAIGLEARALY-NAGQAIGMTFWAPNINIFRDPRW 169
T FPQ I A+++D+ L R+ AI EAR Y G+ G+T+W+P IN+ RDPRW
Sbjct: 109 --VTVFPQAINLASTWDTVLVKRVASAISTEARLKYLEIGK--GLTYWSPTINMARDPRW 164
Query: 170 GRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTT 229
GR +ET GEDP +T + V++V+G+QGD L+ A KHF A + +N
Sbjct: 165 GRNEETYGEDPYLTSRLGVAFVKGLQGDH------PAYLKTVATIKHFVANNEEN----N 214
Query: 230 RYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQW 289
R+ +++ + L + Y P +E+CVK+ +M AYN NG+P R LL + R++W
Sbjct: 215 RFSSSSQIPTKQLYEYYFPAYEACVKEADVQSVMTAYNAFNGVPPSGSRWLLGEVLRKEW 274
Query: 290 GFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLP 349
GF G++ SDC A+ ++ S E+A + +G D+ CG+ ++ AVKQ +
Sbjct: 275 GFDGFVVSDCGAIGVMNWQHRVVNSLEEAAALGVNSGCDLECGTTYKEKLVQAVKQGLIS 334
Query: 350 ESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKN 409
E+ ID+AL + + R +LG F+ + P+ ++ LA +AA +VLLKN
Sbjct: 335 EATIDQALTRVLTARFKLGEFD-PMELVPYNHYDKKLLAGKKFAELAYEAAVKSVVLLKN 393
Query: 410 SHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCD 469
+ LLPL K K+ S+A++GP A+ LG Y+G SIT L+ +++ + +
Sbjct: 394 EN-LLPLSKEKTKSVAVVGPFAD--HNYLGGYSGQPPYSITLLKGVKDLMGKRGKVNYLN 450
Query: 470 TVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAK 529
+ S SI AV KG D V++ +G D+ +E D + LP Q++L+ + +
Sbjct: 451 GIGASRDSIVAAV---KGVDVVLVALGSDEKMARENHDMTSIYLPEEQEKLLKAIYQVNP 507
Query: 530 KPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWY 589
+ ++LV G P+ +A D +I +I+ A YPG+ ALA ++FG+ NP G+LPMT Y
Sbjct: 508 R-IVLVFHSGNPLTSEWA--DTHIPAIMQAWYPGQEAGRALANLLFGNENPSGKLPMTIY 564
Query: 590 PQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLY 649
+ + D M GRTYR+ +G+ ++ FG GLSY+ S++F + N
Sbjct: 565 KTEEQLPDILDFDMW-------KGRTYRYMKGEPLYSFGHGLSYT--SFEFDNIQGN--- 612
Query: 650 LNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPAR 709
D + ++ + E N G++AG+ V ++V
Sbjct: 613 -------------DTLQPDAILQCSVELS------------NSGQLAGEEVVQVYVS--- 644
Query: 710 RGN----GRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVV 765
R N P+K+LV F+ V L + EK ++ F ++P E LS EDG + G + L +
Sbjct: 645 RENTPVYTYPLKKLVAFKKVKLASGEKKKVDFTIAPRE-LS-VWEDGKWRMLSGKYTLFI 702
Query: 766 G 766
G
Sbjct: 703 G 703
>gi|332638085|ref|ZP_08416948.1| glycoside hydrolase family 3 protein [Weissella cibaria KACC 11862]
Length = 713
Score = 373 bits (958), Expect = e-100, Method: Compositional matrix adjust.
Identities = 247/717 (34%), Positives = 366/717 (51%), Gaps = 94/717 (13%)
Query: 54 QRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGA 113
++A+ +V ++T+DEKI Q+ APAI RL IP Y +W+EALHGVA G A
Sbjct: 12 RQAKVIVDQMTIDEKIGQIKYEAPAIERLNIPEYNYWNEALHGVARAGV----------A 61
Query: 114 TSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQA-------IGMTFWAPNINIFRD 166
T FPQ I AA+FD L I IG E RA YN G+TFW+PN+NIFRD
Sbjct: 62 TVFPQAIGLAATFDDQLINDIADVIGTEGRAKYNEFTKHEDRDIYKGLTFWSPNVNIFRD 121
Query: 167 PRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWK 226
PRWGRG ET GEDP +T K+ V++++G+QG L+ +A KHF + +
Sbjct: 122 PRWGRGHETYGEDPFLTSKFGVAFIKGLQGQA-------KYLKLAATAKHFAVH---SGP 171
Query: 227 GTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTAR 286
R+ FDA V+ +DL +TY P F++ V++ IM AYN V+G+P+ LL
Sbjct: 172 EGLRHGFDAVVSDKDLYETYLPAFKAAVEEADVESIMTAYNAVDGVPASVSEMLLRDILH 231
Query: 287 RQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQK 346
+W F G++ SD A +++ Y K + + +KAG+++ G +++ A+ +
Sbjct: 232 DKWSFEGHVVSDYMAPEDVHENHKYTKDAAETMGLAIKAGLNLVAGH-IEQSLHEALNRG 290
Query: 347 KLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVL 406
+ E EI A+ +L++ R+RLG+F T + I + + AH L+ AA+ VL
Sbjct: 291 LVTEEEITNAVISLYATRVRLGMF---ATDNEYDAIPYEANDTKAHNNLSEIAAEKSFVL 347
Query: 407 LKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTV--- 463
LKN G+LPL K ++A++GPNA+S LLGNY G RS T L+ +Q + + V
Sbjct: 348 LKND-GVLPLRKETMEAIAVVGPNAHSEIALLGNYFGTPSRSYTILEGIQERLGDDVRVH 406
Query: 464 YYPGC----DTVACSSASIDK----AVDIAKGADHVVLMMGLDQTQEKEELD-------- 507
Y G D A A D+ A+ A+ +D +V ++GLD T E EE D
Sbjct: 407 YSIGSGVFQDHAAEPLAKADERESEAIIAAEHSDVIVAVLGLDSTIEGEEGDAGNSQGAG 466
Query: 508 -RVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAG 566
+ +L LPGRQ++L+ R+ A KPV+++L G + + + N+ +I+ YPG G
Sbjct: 467 DKPNLSLPGRQRQLLERLL-AVGKPVVVLLASGSSLQLDGLENHPNLRAIMQIWYPGARG 525
Query: 567 AVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFP 626
+A+A+V+FG +P G+LP+T+Y D M GRTYR+ + ++P
Sbjct: 526 GLAVADVLFGTVSPSGKLPVTFYKNTDNLPAFEDYNM--------AGRTYRYMTEEALYP 577
Query: 627 FGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVT 686
FG GL+YS VE D+ KS E T T
Sbjct: 578 FGYGLTYSS-----------------------VELSDL-QVKSYEETAT---------AT 604
Query: 687 IGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCE 743
+ ++N G V ++VK P QL GF+ V L K I F+L P +
Sbjct: 605 VTIQNTGNFDTDEVVQVYVKDLESEFAVPNAQLKGFKRVFLGKGSKQTITFDLRPQD 661
>gi|325679939|ref|ZP_08159508.1| glycosyl hydrolase family 3 C-terminal domain protein [Ruminococcus
albus 8]
gi|324108377|gb|EGC02624.1| glycosyl hydrolase family 3 C-terminal domain protein [Ruminococcus
albus 8]
Length = 691
Score = 373 bits (958), Expect = e-100, Method: Compositional matrix adjust.
Identities = 239/616 (38%), Positives = 338/616 (54%), Gaps = 71/616 (11%)
Query: 45 FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGI 104
+ +L +RA L +T +E+ SQL APAI RLGIPAY WW+E +HG+A G
Sbjct: 4 YLDESLSAEERAEALTDEMTTEEQASQLRYDAPAIERLGIPAYNWWNEGIHGLARSGV-- 61
Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQA-------IGMTFW 157
AT FPQ I AA FD L R + EARA YNA G+T W
Sbjct: 62 --------ATMFPQAIGLAAMFDDELTKRTAEITSEEARAKYNAYTVEGDRDIYKGLTLW 113
Query: 158 APNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHF 217
APNINIFRDPRWGRG ET GEDP +T + + VRG+QGD GK+ ++A+AC KHF
Sbjct: 114 APNINIFRDPRWGRGHETFGEDPYLTAQNGKAVVRGLQGD----GKV---MKAAACAKHF 166
Query: 218 TAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCAD 277
+ + R+ FDA+ +D+ +TY P FE+ VK+ + +M AYNRVNG P+CA
Sbjct: 167 AVH---SGPEALRHSFDAKADAKDMEETYLPAFEALVKEAKVESVMGAYNRVNGEPACAS 223
Query: 278 RNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQK 337
L+ K ++W F GY SDC A+ ++ + ++ LKAG DVNCG Q
Sbjct: 224 DYLMEKL--KEWEFDGYFVSDCWAIRDFHEHHMVTANAVESAAMALKAGCDVNCGCTYQ- 280
Query: 338 HTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQ--PFGKIGADVVCSPAHQVL 395
+ AA+ + + + +I A +L R+RLG+F+ + P+ K V C+ H+ +
Sbjct: 281 NLLAALDKGLITKEQIRTACVHLMRTRIRLGMFDKHTDFDDIPYSK----VACAE-HKAV 335
Query: 396 ALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQAL 455
+L+ A+ +VLLKN +G+LPL K ++A+IGPNA+S L GNY G S R T L +
Sbjct: 336 SLECAEKSLVLLKN-NGILPLDDKKYKTIAVIGPNADSRTALEGNYNGLSDRYTTFLNGI 394
Query: 456 QNYVENTVYYP-GCDTVACSSASIDKAVD-------IAKGADHVVLMMGLDQTQEKEE-- 505
Q+ E V + GC S + + +A D AK AD V++ +GLD T E EE
Sbjct: 395 QDRFEGRVIFAEGCHLYKKSISGLAQAGDRYAEAVAAAKNADLVIMCVGLDATIEGEEGD 454
Query: 506 -------LDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILW 558
D+ L LP Q+ L+ ++ KPV+ V+ G ++ + +++
Sbjct: 455 TGNEFSSGDKNGLTLPPPQKILVEKIMSVG-KPVVTVVCAGSAINT-----ESQPDALIH 508
Query: 559 AGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVP-MTDMKMRPQATSGNPGRTYR 617
A YPG G+ ALAEV+FGD +P G+LP+T+Y +D K+P TD M+ GRTYR
Sbjct: 509 AFYPGAEGSKALAEVLFGDVSPSGKLPVTFY-EDTDKLPEFTDYSMK--------GRTYR 559
Query: 618 FYEGKEVFPFGCGLSY 633
+ +FPFG GL+Y
Sbjct: 560 YTTDNILFPFGYGLTY 575
>gi|358061481|ref|ZP_09148135.1| hypothetical protein HMPREF9473_00197 [Clostridium hathewayi
WAL-18680]
gi|356700240|gb|EHI61746.1| hypothetical protein HMPREF9473_00197 [Clostridium hathewayi
WAL-18680]
Length = 695
Score = 372 bits (955), Expect = e-100, Method: Compositional matrix adjust.
Identities = 230/632 (36%), Positives = 343/632 (54%), Gaps = 65/632 (10%)
Query: 54 QRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGA 113
++A LV ++TL+E+ SQ+ APA+PRLGIPAY WW E LHGVA G A
Sbjct: 8 RKAVRLVEQMTLEERASQMRYDAPAVPRLGIPAYNWWGEGLHGVARAGT----------A 57
Query: 114 TSFPQVILTAASFDSYLWYRIGQAIGLEARALYN-----AGQAI--GMTFWAPNINIFRD 166
T FPQ I AA FD L I + E RA YN + I G+TFW+PN+NIFRD
Sbjct: 58 TMFPQAIAMAAMFDVELTEEIANVVSTEGRAKYNQFCEEGDRDIYKGLTFWSPNVNIFRD 117
Query: 167 PRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWK 226
PRWGRG ET GEDP +T + ++VRG+QGD + L+ +AC KHF + +
Sbjct: 118 PRWGRGHETYGEDPYLTSRLGTAFVRGLQGDGEH-------LKIAACAKHFAVH---SGP 167
Query: 227 GTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTAR 286
R++F A + +DL +TY P FE+CVK+ +M AYN +G P CA+ L+ + R
Sbjct: 168 EALRHEFWADTSKKDLWETYLPAFEACVKEAHVESVMGAYNSYHGEPCCANTLLMEEILR 227
Query: 287 RQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG-SFLQKHTKAAVKQ 345
QWGF G+ SDC A+ + + ++ +K G D+NCG ++LQ A ++
Sbjct: 228 GQWGFEGHFVSDCWAIRDFHMNYMVTDTAMESAALAVKKGCDLNCGNTYLQ--VLKACEE 285
Query: 346 KKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIV 405
L ++ + A+ LF+ R LG+ G T + I +VV H+ LA++AA+ +V
Sbjct: 286 GLLDDACVTEAVVRLFTTRYLLGM--GEET--EYDDIPYEVVECKEHRELAVEAARRSMV 341
Query: 406 LLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTV-- 463
LLKN GLLPL K ++A+IGPNA++ L+GNY G S T L+ +Q+ V V
Sbjct: 342 LLKND-GLLPLHAEKLNTIAVIGPNADNRTALIGNYHGTSSCYTTILEGIQDAVGEDVRV 400
Query: 464 -YYPGC-------DTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEE---------L 506
Y GC + +A + + +A +AK +D VVL +GLD+T E EE
Sbjct: 401 LYAEGCHLFKDRVEHLAVAGDRLSEARIVAKHSDVVVLCVGLDETLEGEEGDTGNSHASG 460
Query: 507 DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAG 566
D+ DL+LP Q+ L+ + KPV++ + G +D++ A+ G+++ YPG G
Sbjct: 461 DKKDLLLPESQRRLMEEILNLG-KPVVVCNMSGSAIDLSLAQ--EKAGAVIQVWYPGAEG 517
Query: 567 AVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFP 626
ALA+++FG +P G+LP+T+Y P D M GRTYR+ + ++P
Sbjct: 518 GRALADLLFGKASPSGKLPVTFYKDLENLPPFEDYSM--------DGRTYRYLTAEPLYP 569
Query: 627 FGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKM 658
FG GL+Y +++ ++ +S+ +
Sbjct: 570 FGFGLTYGTVELSEGEMTEEAAWVTVKNSSDI 601
>gi|371776901|ref|ZP_09483223.1| beta-glucosidase [Anaerophaga sp. HS1]
Length = 720
Score = 370 bits (951), Expect = 1e-99, Method: Compositional matrix adjust.
Identities = 250/728 (34%), Positives = 374/728 (51%), Gaps = 96/728 (13%)
Query: 45 FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGI 104
F L I RA+ L+S LTL EKIS L + P + RL IPAY WW+EALHGVA G+
Sbjct: 29 FRDEALDIETRAKALLSELTLKEKISLLGYNNPPVERLQIPAYNWWNEALHGVARAGE-- 86
Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYN-------AGQAIGMTFW 157
AT FPQ I AA+FD+ L YRI AI EAR+ YN Q +G+TFW
Sbjct: 87 --------ATVFPQAIALAATFDTTLVYRIADAISTEARSKYNINRSKGFQNQYLGITFW 138
Query: 158 APNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHF 217
PNINIFRDPRWGRGQET GEDP +T ++V+G+QG + + +L+ +A KHF
Sbjct: 139 TPNINIFRDPRWGRGQETYGEDPFLTASMGKAFVKGLQG-----SEPERRLKTAAGAKHF 193
Query: 218 TAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCAD 277
+ + R+ F+A V +DL +TY P F++ V+ G + IMCAYNRVNG P C
Sbjct: 194 AVH---SGPEADRHHFNAVVDEKDLRETYLPAFKALVENG-VTTIMCAYNRVNGEPCCTG 249
Query: 278 RNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQK 337
+ LL R +WGF G + +DC A+ I+ + + +KAG++++C + LQ+
Sbjct: 250 KTLLQDILRDEWGFKGQVVTDCWALDDIWLRHKTIPTRVEVAAAAVKAGVNLDCANILQE 309
Query: 338 HTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLAL 397
+ A++++ L ++D AL +++LG ++ +P+ P+ G D V + H LA
Sbjct: 310 DVQDAIEKRLLTLEQVDSALLPTLQTQLKLGFYD-DPSHSPYRHYGIDSVNNSYHISLAK 368
Query: 398 QAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQN 457
+AA+ +VLLKN G+LPL K S+ ++G NA S L GNY G S +T
Sbjct: 369 EAAEKSMVLLKND-GILPLKKDTISSIMVVGENAASISALTGNYHGLSGNMVT------- 420
Query: 458 YVENTVYY--PGCDT---VACSSASIDK--AVDIAKGADHVVLMMGLDQTQEKEE----- 505
+VE V PG CS A + A D + ++GL E E
Sbjct: 421 FVEGLVKAGGPGMSVQYDYGCSFADTSHFGGIWAAGFTDVTIAVIGLSPLLEGEHGDAFL 480
Query: 506 ----LDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGY 561
D+ DL +P + + ++ E+ PVI V+ G +DI+ + +I++A Y
Sbjct: 481 SNWGGDKKDLRMPRSHEIYLKKLRESHNHPVIAVVTGGSALDIS--AIEPYADAIIYAWY 538
Query: 562 PGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEG 621
PGE G ALA++IFG+ +P GRLP+T+Y P D M RTYR+++G
Sbjct: 539 PGEQGGTALADLIFGEVSPSGRLPITFYKDIKDLPPYHDYNM--------TNRTYRYFQG 590
Query: 622 KEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETR 681
++PFG GLSY+ + Y++ + STK+ E+ D++
Sbjct: 591 DVLYPFGYGLSYTSFHYEWLS----------KPSTKVSED-DIIS--------------- 624
Query: 682 KFLVTIGVKNHGEMAGKHPVLLFV--KPARRGNGRPIKQLVGFQSVILNAKEKAEIVFEL 739
V I V N G M + +++ R P+++L GF + + A + ++
Sbjct: 625 ---VNIAVTNTGTMDADEVIQVYIVYPDIER---MPLRELKGFSRIHIKAGQTQNTDIQI 678
Query: 740 SPCESLSR 747
P ++L +
Sbjct: 679 -PVKNLKK 685
>gi|332377068|gb|AEE64772.1| Xyl3A [Ruminococcus albus 8]
Length = 691
Score = 370 bits (950), Expect = 1e-99, Method: Compositional matrix adjust.
Identities = 238/616 (38%), Positives = 336/616 (54%), Gaps = 71/616 (11%)
Query: 45 FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGI 104
+ +L +RA L +T +E+ SQL APAI RLGIPAY WW+E +HG+A G
Sbjct: 4 YLDESLSAEERAEALTDEMTTEEQASQLRYDAPAIERLGIPAYNWWNEGIHGLARSGV-- 61
Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQA-------IGMTFW 157
AT FPQ I AA FD L R + EARA YNA G+T W
Sbjct: 62 --------ATMFPQAIGLAAMFDDELTKRTAEITSEEARAKYNAYTVEGDRDIYKGLTLW 113
Query: 158 APNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHF 217
APNINIFRDPRWGR ET GEDP +T + + VRG+QGD GK+ ++A+AC KHF
Sbjct: 114 APNINIFRDPRWGRSHETFGEDPYLTAQNGKAVVRGLQGD----GKV---MKAAACAKHF 166
Query: 218 TAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCAD 277
+ + R+ FDA+ +D+ +TY P FE+ VK+ + +M AYNRVNG P+CA
Sbjct: 167 AVH---SGPEALRHSFDAKADAKDMEETYLPAFEALVKEAKVESVMGAYNRVNGEPACAS 223
Query: 278 RNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQK 337
L+ K ++W F GY SDC A+ ++ + ++ LKAG DVNCG Q
Sbjct: 224 DYLMEKL--KEWEFDGYFVSDCWAIRDFHEHHMVTANAVESAAMALKAGCDVNCGCTYQ- 280
Query: 338 HTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQ--PFGKIGADVVCSPAHQVL 395
+ AA+ + + + +I A +L R+RLG+F+ + P+ K V C+ H+ +
Sbjct: 281 NLLAALDKGLITKEQIRTACVHLMRTRIRLGMFDKHTDFDDIPYSK----VACAE-HKAV 335
Query: 396 ALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQAL 455
+L+ A+ +VLLKN +G+LPL K ++A+IGPNA+S L GNY G S R T L +
Sbjct: 336 SLECAEKSLVLLKN-NGILPLDDKKYKTIAVIGPNADSRTALEGNYNGLSDRYTTFLNGI 394
Query: 456 QNYVENTVYYP-GCDTVACSSASIDKAVD-------IAKGADHVVLMMGLDQTQEKEE-- 505
Q+ E V + GC S + + +A D AK AD V++ +GLD T E EE
Sbjct: 395 QDRFEGRVIFAEGCHLYKKSISGLAQAGDRYAEAVAAAKNADLVIMCVGLDATIEGEEGD 454
Query: 506 -------LDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILW 558
D+ L LP Q+ L+ ++ KPV+ V+ G ++ + +++
Sbjct: 455 TGNEFSSGDKNGLTLPPPQKILVEKIMSVG-KPVVTVVCAGSAINT-----ESQPDALIH 508
Query: 559 AGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVP-MTDMKMRPQATSGNPGRTYR 617
A YPG G ALAEV+FGD +P G+LP+T+Y +D K+P TD M+ GRTYR
Sbjct: 509 AFYPGAEGGKALAEVLFGDVSPSGKLPVTFY-EDTDKLPEFTDYSMK--------GRTYR 559
Query: 618 FYEGKEVFPFGCGLSY 633
+ +FPFG GL+Y
Sbjct: 560 YTTDNILFPFGYGLTY 575
>gi|5690010|emb|CAB51937.1| Family 3 Glycoside Hydrolase [Ruminococcus flavefaciens]
Length = 690
Score = 370 bits (950), Expect = 2e-99, Method: Compositional matrix adjust.
Identities = 259/744 (34%), Positives = 376/744 (50%), Gaps = 109/744 (14%)
Query: 54 QRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGA 113
+RA D+ RL+ +EK Q APA RLG AY WWSE LHGVA G A
Sbjct: 15 ERAEDITDRLSTEEKAEQQKYDAPAEERLGKDAYNWWSEGLHGVARAGT----------A 64
Query: 114 TSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQA-------IGMTFWAPNINIFRD 166
T FPQ I AA FD +R G+ EARA YN A G+T W+PN+NIFRD
Sbjct: 65 TMFPQTIGMAAMFDDEAVHRAGETTSREARAKYNEYSAHDDRDIYKGLTLWSPNVNIFRD 124
Query: 167 PRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWK 226
PRWGRGQET GEDP +T V+Y +G+QGD GK+ L+ +AC KHF + +
Sbjct: 125 PRWGRGQETYGEDPYLTSCLGVAYAKGLQGD----GKV---LRTAACAKHFAVH---SGP 174
Query: 227 GTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTAR 286
TR++FDA+ M+D+ +TY FE+ VK + +M AYNRVNG P+CA +++K
Sbjct: 175 EATRHEFDAKANMKDMTETYIAAFEALVKDAKVESVMGAYNRVNGEPACASDFVMNKL-- 232
Query: 287 RQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG-SFLQKHTKAAVKQ 345
+WGF G+ SDC A+ + G K+ ++ LK G D+NCG ++L H AA +
Sbjct: 233 EEWGFDGHFVSDCWAIRDFHTNHGVTKTAPESAALALKKGCDLNCGNTYL--HLLAAFNE 290
Query: 346 KKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIV 405
+ E ++ R+ L R+RLG+F+ + + + D+V H+ +L+ ++ +V
Sbjct: 291 GLINEEDLRRSCIKLMRTRVRLGMFDKST---EYDGLDYDIVACDEHKEFSLRCSERSMV 347
Query: 406 LLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQN-------Y 458
LLKN +G+LPL SK ++ +IGPNA+S L GNY G + IT L ++ Y
Sbjct: 348 LLKN-NGILPLDGSKYKTIGVIGPNADSVPALEGNYNGKADEYITFLSGIREAHDGRVLY 406
Query: 459 VENT-VYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEE---------LDR 508
E + +Y C +A + +A I + + LD T E EE D+
Sbjct: 407 TEGSHLYKDRCMGLALPDDRLSEAEIITRTLRCSGSLCWLDATIEGEEGDTGNEFSSGDK 466
Query: 509 VDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAV 568
DL LP Q++L+ V A KPVI+V G +++ + + +++ A YPG+ G
Sbjct: 467 NDLRLPESQRKLVKTVM-AKGKPVIIVTAAGSAINV-----EADCDALIQAWYPGQLGGR 520
Query: 569 ALAEVIFGDHNPGGRLPMTWYPQDYIKVP-MTDMKMRPQATSGNPGRTYRFYEGKEVFPF 627
ALA ++FG +P G+LP+T+Y +D K+P +D M+ RTYR+ EG +FPF
Sbjct: 521 ALANILFGKVSPSGKLPVTFY-EDASKLPDFSDYSMK--------NRTYRYSEGNILFPF 571
Query: 628 GCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTI 687
G GL+YS+ ++L +T V N G+ F E +V I
Sbjct: 572 GYGLTYSE-------TECSELSFENGVATVKVTNT-----------GSRFTED---VVQI 610
Query: 688 GVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSR 747
+K + E N P L GF+ V L+A E + IV P +
Sbjct: 611 YIKGYSE-----------------NAVPNHSLCGFKRVALDAGE-SRIVQITLPERAFMA 652
Query: 748 AREDGLMVIEEGTHFLVVGDEEYP 771
E G I+EG+ F + P
Sbjct: 653 VNEKGEW-IKEGSEFTLYAGTSQP 675
>gi|291240561|ref|XP_002740190.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii]
Length = 763
Score = 370 bits (950), Expect = 2e-99, Method: Compositional matrix adjust.
Identities = 239/628 (38%), Positives = 334/628 (53%), Gaps = 56/628 (8%)
Query: 40 TETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIP--RLGIPAYEWWSEALHGV 97
+ +PF T+L +R DLVSRLTLDE + Q+ ++PA P RLGI Y W SE LHGV
Sbjct: 23 SAAYPFQNTSLSWEERVDDLVSRLTLDEMVLQMARTSPAPPIDRLGIKPYVWNSECLHGV 82
Query: 98 AGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA-------GQ 150
G+ AT+FPQ I AASF L + +AIGLE RA +N +
Sbjct: 83 VP-PDGL--------ATAFPQSIGLAASFSPDLLSDVAKAIGLEVRAKHNDYVQRGVYQE 133
Query: 151 AIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQA 210
G++ ++P INI R P WGR QET GEDP + G+ +YVRG+QGD + A
Sbjct: 134 HTGLSCFSPVINIARHPLWGRNQETYGEDPFLIGELGSAYVRGLQGDH------PRYVLA 187
Query: 211 SACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVN 270
+A CKHF + +R+ FDA+V +D T+ P F CVK G S +MC+YNR+N
Sbjct: 188 NAGCKHFDVHGGPEDIPVSRFSFDAKVFERDWQMTFLPAFHECVKAGVYS-VMCSYNRIN 246
Query: 271 GIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVN 330
+P+CA+ LL+ R++WGF GY+ SD AV I + Y S D V + AG +++
Sbjct: 247 EVPACANTRLLTDILRKEWGFDGYVVSDEGAVEFIMTSHHYTDSIVDTVASAVNAGCNLD 306
Query: 331 ----CGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADV 386
G + AV K+ E + + LF RMRLG F+ P + P+ + V
Sbjct: 307 LAFPVGDGMYIKIGDAVTAGKIKEKTVVERVKPLFYTRMRLGEFD-PPELNPYANLNLSV 365
Query: 387 VCSPAHQVLALQAAQD-----GIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNY 441
V S H+ LA++AA VLLK +LPL + LA+IGP A++ L G+Y
Sbjct: 366 VQSEEHRELAVKAALQSFVLLNFVLLKREGRVLPL-DTLVNKLAVIGPFADNPSYLFGDY 424
Query: 442 AGPSCRS--ITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQ 499
+ + +TP + L N +T PGC T C++ + GAD +V+ +G
Sbjct: 425 SPNPDKEFVVTPCKGLSNAARDTRCTPGCLTAPCTTYFSEMVKAAVTGADLIVVCLGTGV 484
Query: 500 TQEKEELDRVDLVLPGRQQELITRVAEAAK-KPVILVLLCGGPVDITFAKYDRNIGSILW 558
E E +DR DL LPG+Q +L+ V + A KP+IL+L GP+DI +A + I I+
Sbjct: 485 KIEAEFVDRSDLSLPGKQFQLLQDVVKYANGKPIILLLFNAGPLDIVWAVENPAIQVIVA 544
Query: 559 AGYPGEAGAVALAEVIFGDH-------NPGGRLPMTWYPQDYIKV-PMTDMKMRPQATSG 610
+P +A AL + H NPGGRLP+TW P+ +V PMT+ M
Sbjct: 545 CFFPSQATGDALYRMFMNTHGVDTGNGNPGGRLPITW-PRSMNQVPPMTNYTME------ 597
Query: 611 NPGRTYRFYEGKEVFPFGCGLSYSKYSY 638
GRTYR++ G +FPFG GLSY +SY
Sbjct: 598 --GRTYRYFNGDPLFPFGYGLSYGSFSY 623
>gi|336408348|ref|ZP_08588841.1| hypothetical protein HMPREF1018_00856 [Bacteroides sp. 2_1_56FAA]
gi|423248801|ref|ZP_17229817.1| hypothetical protein HMPREF1066_00827 [Bacteroides fragilis
CL03T00C08]
gi|423253750|ref|ZP_17234681.1| hypothetical protein HMPREF1067_01325 [Bacteroides fragilis
CL03T12C07]
gi|335937826|gb|EGM99722.1| hypothetical protein HMPREF1018_00856 [Bacteroides sp. 2_1_56FAA]
gi|392655379|gb|EIY49022.1| hypothetical protein HMPREF1067_01325 [Bacteroides fragilis
CL03T12C07]
gi|392657742|gb|EIY51373.1| hypothetical protein HMPREF1066_00827 [Bacteroides fragilis
CL03T00C08]
Length = 722
Score = 370 bits (950), Expect = 2e-99, Method: Compositional matrix adjust.
Identities = 240/726 (33%), Positives = 380/726 (52%), Gaps = 89/726 (12%)
Query: 51 PISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTI 110
P++ R R L+ ++TL EK++QLV+ + +IPRL +PAY +W+E LHGVA G+
Sbjct: 57 PVAVRVRTLIQQMTLAEKVAQLVSESDSIPRLNLPAYNYWNECLHGVARAGE-------- 108
Query: 111 RGATSFPQVILTAASFDSYLWYRIGQAIGLEARALY-NAGQAIGMTFWAPNINIFRDPRW 169
T FPQ I A+++D+ L R+ AI EAR Y G+ G+T+W+P IN+ RDPRW
Sbjct: 109 --VTVFPQAINLASTWDTVLVKRVASAISTEARLKYLEIGK--GLTYWSPTINMARDPRW 164
Query: 170 GRGQETPGEDPLVTGKYAVSYVRGVQGD--TFNGGKLKGKLQASACCKHFTAYDLDNWKG 227
GR +ET GEDP +T + V++V+G+QGD T+ L+ A KHF A + +N
Sbjct: 165 GRNEETYGEDPHLTSRLGVAFVKGLQGDHPTY--------LKTVATIKHFVANNEEN--- 213
Query: 228 TTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARR 287
R+ +++ + L + Y P +E+CVK+ A +M AYN NG+P LL R+
Sbjct: 214 -NRFSSSSQIPTKQLYEYYFPAYEACVKEANAQSVMTAYNAFNGVPPSGSHWLLDDVLRK 272
Query: 288 QWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKK 347
+WGF G++ SDC A+ ++ S E+A + +G D+ CG+ ++ AV+Q
Sbjct: 273 EWGFDGFVVSDCGAIGVMNWQHRVVNSLEEAAALGVNSGCDLECGTTYKEKLVQAVEQGL 332
Query: 348 LPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLL 407
+ E+ IDRAL + + R +LG F+ + P+ ++ LA +AA +VLL
Sbjct: 333 ISEAAIDRALTRVLTARFKLGEFD-PMELVPYNHYDKKLLAGKKFAELAYEAAVKSVVLL 391
Query: 408 KNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYV---ENTVY 464
KN LLPL K K S+A++GP A+ LG Y+G S++ L+ ++ + Y
Sbjct: 392 KND-ALLPLNKEKIKSVAVVGPFADY--NYLGGYSGQPPYSVSLLKGVKELIGKKGKVTY 448
Query: 465 YPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRV 524
G T S D + KGAD V++ +G D+ +E D + LP Q++L+ +
Sbjct: 449 LNGMGT------SADSIAQVVKGADIVLVALGSDEKMARENHDMPSIYLPEEQEKLLKEI 502
Query: 525 AEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRL 584
+ + ++LV G P+ +A D +I +I+ A YPG+ ALA ++FG+ NP G+L
Sbjct: 503 YQVNPR-IVLVFHTGNPLTSEWA--DTHIPAIMQAWYPGQEAGRALANLLFGNENPSGKL 559
Query: 585 PMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVS 644
PMT Y + + D M GRTYR+ +G+ ++ FG GLSY+ S++F +
Sbjct: 560 PMTIYKTEEQLPDILDFDMW-------KGRTYRYMKGEPLYGFGHGLSYT--SFEFDNIQ 610
Query: 645 QNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLF 704
N D + ++ + E N G++AG+ V ++
Sbjct: 611 GN----------------DTLQSDAILQCSVELS------------NSGQLAGEEVVQVY 642
Query: 705 VKPARRGN----GRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGT 760
V R N P+K+LV F+ V L + EK ++ F ++P E LS EDG + G
Sbjct: 643 VS---RENTPVYTYPLKKLVAFKKVKLASGEKKKVDFTIAPRE-LS-VWEDGKWRMLSGK 697
Query: 761 HFLVVG 766
+ L +G
Sbjct: 698 YTLFIG 703
>gi|265765457|ref|ZP_06093732.1| beta-xylosidase [Bacteroides sp. 2_1_16]
gi|263254841|gb|EEZ26275.1| beta-xylosidase [Bacteroides sp. 2_1_16]
Length = 722
Score = 370 bits (949), Expect = 2e-99, Method: Compositional matrix adjust.
Identities = 240/726 (33%), Positives = 381/726 (52%), Gaps = 89/726 (12%)
Query: 51 PISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTI 110
P++ R R L+ ++TL EK++QLV+ + +IPRL +PAY +W+E LHGVA G+
Sbjct: 57 PVAVRVRTLIQQMTLAEKVAQLVSESDSIPRLNLPAYNYWNECLHGVARAGE-------- 108
Query: 111 RGATSFPQVILTAASFDSYLWYRIGQAIGLEARALY-NAGQAIGMTFWAPNINIFRDPRW 169
T FPQ I A+++D+ L R+ AI EAR Y G+ G+T+W+P IN+ RDPRW
Sbjct: 109 --VTVFPQAINLASTWDTVLVKRVASAISTEARLKYLEIGK--GLTYWSPTINMARDPRW 164
Query: 170 GRGQETPGEDPLVTGKYAVSYVRGVQGD--TFNGGKLKGKLQASACCKHFTAYDLDNWKG 227
GR +ET GEDP +T + V++V+G+QGD T+ L+ A KHF A + +N
Sbjct: 165 GRNEETYGEDPHLTSRLGVAFVKGLQGDHPTY--------LKTVATIKHFVANNEEN--- 213
Query: 228 TTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARR 287
R+ +++ + L + Y P +E+CVK+ A +M AYN NG+P LL R+
Sbjct: 214 -NRFSSSSQIPTKQLYEYYFPAYEACVKEANAQSVMTAYNAFNGVPPSGSHWLLDDVLRK 272
Query: 288 QWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKK 347
+WGF G++ SDC A+ ++ S E+A + +G D+ CG+ ++ AV+Q
Sbjct: 273 EWGFDGFVVSDCGAIGVMNWQHRVVNSLEEAAALGVNSGCDLECGTTYKEKLVQAVEQGL 332
Query: 348 LPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLL 407
+ E+ IDRAL + + R +LG F+ + P+ ++ LA +AA +VLL
Sbjct: 333 ISEAAIDRALTRVLTARFKLGEFDPM-ELVPYNHYDKKLLAGKKFAELAYEAAVKSVVLL 391
Query: 408 KNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYV---ENTVY 464
KN LLPL K K S+A++GP A+ LG Y+G S++ L+ ++ + Y
Sbjct: 392 KND-ALLPLNKEKIKSVAVVGPFADY--NYLGGYSGQPPYSVSLLKGVKELIGKKGKVTY 448
Query: 465 YPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRV 524
G T S D + KGAD V++ +G D+ +E D + LP Q++L+ ++
Sbjct: 449 LNGMGT------SADSIAQVVKGADIVLVALGSDEKMARENHDMPSIYLPEEQEKLLKKI 502
Query: 525 AEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRL 584
+ + ++LV G P+ +A D +I +I+ A YPG+ ALA ++FG+ NP G+L
Sbjct: 503 YQVNPR-IVLVFHTGNPLTSEWA--DTHIPAIMQAWYPGQEAGRALANLLFGNENPSGKL 559
Query: 585 PMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVS 644
PMT Y + + D M GRTYR+ +G+ ++ FG GLSY+ S++F +
Sbjct: 560 PMTIYKTEEQLPDILDFDMW-------KGRTYRYMKGEPLYGFGHGLSYT--SFEFDNIQ 610
Query: 645 QNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLF 704
N D + ++ + E N G++AG+ V ++
Sbjct: 611 GN----------------DTLQPDAILQCSVELS------------NSGQLAGEEVVQVY 642
Query: 705 VKPARRGN----GRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGT 760
V R N P+K+LV F+ V L + EK ++ F ++P E LS EDG + G
Sbjct: 643 VS---RENTPVYTYPLKKLVAFKKVKLASGEKKKVDFTIAPRE-LS-VWEDGKWRMLSGK 697
Query: 761 HFLVVG 766
+ L +G
Sbjct: 698 YTLFIG 703
>gi|375357164|ref|YP_005109936.1| putative glycosyl hydrolase [Bacteroides fragilis 638R]
gi|301161845|emb|CBW21389.1| putative glycosyl hydrolase [Bacteroides fragilis 638R]
Length = 722
Score = 370 bits (949), Expect = 2e-99, Method: Compositional matrix adjust.
Identities = 240/726 (33%), Positives = 381/726 (52%), Gaps = 89/726 (12%)
Query: 51 PISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTI 110
P++ R R L+ ++TL EK++QLV+ + +IPRL +PAY +W+E LHGVA G+
Sbjct: 57 PVAVRVRTLIQQMTLAEKVAQLVSESDSIPRLNLPAYNYWNECLHGVARAGE-------- 108
Query: 111 RGATSFPQVILTAASFDSYLWYRIGQAIGLEARALY-NAGQAIGMTFWAPNINIFRDPRW 169
T FPQ I A+++D+ L R+ AI EAR Y G+ G+T+W+P IN+ RDPRW
Sbjct: 109 --VTVFPQAINLASTWDTVLVKRVASAISTEARLKYLEIGK--GLTYWSPTINMARDPRW 164
Query: 170 GRGQETPGEDPLVTGKYAVSYVRGVQGD--TFNGGKLKGKLQASACCKHFTAYDLDNWKG 227
GR +ET GEDP +T + V++V+G+QGD T+ L+ A KHF A + +N
Sbjct: 165 GRNEETYGEDPHLTSRLGVAFVKGLQGDHPTY--------LKTVATIKHFVANNEEN--- 213
Query: 228 TTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARR 287
R+ +++ + L + Y P +E+CVK+ A +M AYN NG+P LL R+
Sbjct: 214 -NRFSSSSQIPTKQLYEYYFPAYEACVKEANAQSVMTAYNAFNGVPPSGSHWLLDDVLRK 272
Query: 288 QWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKK 347
+WGF G++ SDC A+ ++ S E+A + +G D+ CG+ ++ AV+Q
Sbjct: 273 EWGFDGFVVSDCGAIGVMNWQHRVVNSLEEAAALGVNSGCDLECGTTYKEKLVQAVEQGL 332
Query: 348 LPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLL 407
+ E+ IDRAL + + R +LG F+ + P+ ++ LA +AA +VLL
Sbjct: 333 ISEAAIDRALTRVLTARFKLGEFDPM-ELVPYNHYDKKLLAGKKFAELAYEAAVKSVVLL 391
Query: 408 KNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYV---ENTVY 464
KN LLPL K K S+A++GP A+ LG Y+G S++ L+ ++ + Y
Sbjct: 392 KND-ALLPLNKEKIKSVAVVGPFADY--NYLGGYSGQPPYSVSLLKGVKELIGKKGKVTY 448
Query: 465 YPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRV 524
G T S D + KGAD V++ +G D+ +E D + LP Q++L+ ++
Sbjct: 449 LNGMGT------SADSIAQVVKGADIVLVALGSDEKMARENHDMPSIYLPEEQEKLLKKI 502
Query: 525 AEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRL 584
+ + ++LV G P+ +A D +I +I+ A YPG+ ALA ++FG+ NP G+L
Sbjct: 503 YQVNPR-IVLVFHTGNPLTSEWA--DTHIPAIMQAWYPGQEAGRALANLLFGNENPSGKL 559
Query: 585 PMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVS 644
PMT Y + + D M GRTYR+ +G+ ++ FG GLSY+ S++F +
Sbjct: 560 PMTIYKTEEQLPDILDFDMW-------KGRTYRYMKGEPLYGFGHGLSYT--SFEFDNIQ 610
Query: 645 QNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLF 704
N D + ++ + E N G++AG+ V ++
Sbjct: 611 GN----------------DTLQPDAILQCSVELS------------NSGQLAGEEVVQVY 642
Query: 705 VKPARRGN----GRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGT 760
V R N P+K+LV F+ V L + EK ++ F ++P E LS EDG + G
Sbjct: 643 VS---RENTPVYTYPLKKLVAFKKVKLASGEKKKVDFTIAPRE-LS-VWEDGKWRMLSGK 697
Query: 761 HFLVVG 766
+ L +G
Sbjct: 698 YTLFIG 703
>gi|423258868|ref|ZP_17239791.1| hypothetical protein HMPREF1055_02068 [Bacteroides fragilis
CL07T00C01]
gi|423264161|ref|ZP_17243164.1| hypothetical protein HMPREF1056_00851 [Bacteroides fragilis
CL07T12C05]
gi|387776448|gb|EIK38548.1| hypothetical protein HMPREF1055_02068 [Bacteroides fragilis
CL07T00C01]
gi|392706427|gb|EIY99550.1| hypothetical protein HMPREF1056_00851 [Bacteroides fragilis
CL07T12C05]
Length = 722
Score = 369 bits (948), Expect = 3e-99, Method: Compositional matrix adjust.
Identities = 240/726 (33%), Positives = 380/726 (52%), Gaps = 89/726 (12%)
Query: 51 PISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTI 110
P++ R R L+ ++TL EK++QLV+ + +IPRL +PAY +W+E LHGVA G+
Sbjct: 57 PVAVRVRTLIQQMTLAEKVAQLVSESDSIPRLNLPAYNYWNECLHGVARAGE-------- 108
Query: 111 RGATSFPQVILTAASFDSYLWYRIGQAIGLEARALY-NAGQAIGMTFWAPNINIFRDPRW 169
T FPQ I A+++D+ L R+ AI EAR Y G+ G+T+W+P IN+ RDPRW
Sbjct: 109 --VTVFPQAINLASTWDTVLVKRVASAISTEARLKYLEIGK--GLTYWSPTINMARDPRW 164
Query: 170 GRGQETPGEDPLVTGKYAVSYVRGVQGD--TFNGGKLKGKLQASACCKHFTAYDLDNWKG 227
GR +ET GEDP +T + V++V+G+QGD T+ L+ A KHF A + +N
Sbjct: 165 GRNEETYGEDPHLTSRLGVAFVKGLQGDHPTY--------LKTVATIKHFVANNEEN--- 213
Query: 228 TTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARR 287
R+ +++ + L + Y P +E+CVK+ A +M AYN NG+P LL R+
Sbjct: 214 -NRFSSSSQIPTKQLYEYYFPAYEACVKEANAQSVMTAYNAFNGVPPSGSHWLLDDVLRK 272
Query: 288 QWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKK 347
+WGF G++ SDC A+ ++ S E+A + +G D+ CG+ ++ AV+Q
Sbjct: 273 EWGFDGFVVSDCGAIGVMNWQHRVVNSLEEAAALGVNSGCDLECGTTYKEKLVQAVEQGL 332
Query: 348 LPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLL 407
+ E+ IDRAL + + R +LG F+ + P+ ++ LA +AA +VLL
Sbjct: 333 ISEAAIDRALTRVLTARFKLGEFDPM-ELVPYNHYDKKLLAGKKFAELAYEAAVKSVVLL 391
Query: 408 KNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYV---ENTVY 464
KN LLPL K K S+A++GP A+ LG Y+G S++ L+ ++ + Y
Sbjct: 392 KND-ALLPLNKEKIKSVAVVGPFADY--NYLGGYSGQPPYSVSLLKGVKELIGKKGKVTY 448
Query: 465 YPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRV 524
G T S D + KGAD V++ +G D+ +E D + LP Q++L+ +
Sbjct: 449 LNGMGT------SADSIAQVVKGADIVLVALGSDEKMARENHDMPSIYLPEEQEKLLKEI 502
Query: 525 AEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRL 584
+ + ++LV G P+ +A D +I +I+ A YPG+ ALA ++FG+ NP G+L
Sbjct: 503 YQVNPR-IVLVFHTGNPLTSEWA--DTHIPAIMQAWYPGQEAGRALANLLFGNENPSGKL 559
Query: 585 PMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVS 644
PMT Y + + D M GRTYR+ +G+ ++ FG GLSY+ S++F +
Sbjct: 560 PMTIYKTEEQLPDILDFDMW-------KGRTYRYMKGEPLYGFGHGLSYT--SFEFDNIQ 610
Query: 645 QNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLF 704
N D + ++ + E N G++AG+ V ++
Sbjct: 611 GN----------------DTLQPDAILQCSVELS------------NSGQLAGEEVVQVY 642
Query: 705 VKPARRGN----GRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGT 760
V R N P+K+LV F+ V L + EK ++ F ++P E LS EDG + G
Sbjct: 643 VS---RENTPVYTYPLKKLVAFKKVKLASGEKKKVDFTIAPRE-LS-VWEDGKWRMLSGK 697
Query: 761 HFLVVG 766
+ L +G
Sbjct: 698 YTLFIG 703
>gi|383117083|ref|ZP_09937830.1| hypothetical protein BSHG_0813 [Bacteroides sp. 3_2_5]
gi|251947612|gb|EES87894.1| hypothetical protein BSHG_0813 [Bacteroides sp. 3_2_5]
Length = 722
Score = 369 bits (947), Expect = 3e-99, Method: Compositional matrix adjust.
Identities = 240/726 (33%), Positives = 380/726 (52%), Gaps = 89/726 (12%)
Query: 51 PISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTI 110
P++ R R L+ ++TL EK++QLV+ + +IPRL +PAY +W+E LHGVA G+
Sbjct: 57 PVAVRVRTLIQQMTLAEKVAQLVSESDSIPRLNLPAYNYWNECLHGVARAGE-------- 108
Query: 111 RGATSFPQVILTAASFDSYLWYRIGQAIGLEARALY-NAGQAIGMTFWAPNINIFRDPRW 169
T FPQ I A+++D+ L R+ AI EAR Y G+ G+T+W+P IN+ RDPRW
Sbjct: 109 --VTVFPQAINLASTWDTVLVKRVASAISTEARLKYLEIGK--GLTYWSPTINMARDPRW 164
Query: 170 GRGQETPGEDPLVTGKYAVSYVRGVQGD--TFNGGKLKGKLQASACCKHFTAYDLDNWKG 227
GR +ET GEDP +T + V++V+G+QGD T+ L+ A KHF A + +N
Sbjct: 165 GRNEETYGEDPHLTSRLGVAFVKGLQGDHPTY--------LKTVATIKHFVANNEEN--- 213
Query: 228 TTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARR 287
R+ +++ + L + Y P +E+CVK+ A +M AYN NG+P LL R+
Sbjct: 214 -NRFSSSSQIPTKQLYEYYFPAYEACVKEANAQSVMTAYNAFNGVPPSGSHWLLDDVLRK 272
Query: 288 QWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKK 347
+WGF G++ SDC A+ ++ S E+A + +G D+ CG+ ++ AV+Q
Sbjct: 273 EWGFDGFVVSDCGAIGVMNWQHRVVNSLEEAAALGVNSGCDLECGTTYKEKLVQAVEQGL 332
Query: 348 LPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLL 407
+ E IDRAL + + R +LG F+ + P+ ++ LA +AA +VLL
Sbjct: 333 ISEVAIDRALTRVLTARFKLGEFDPM-ELVPYNHYDKKLLAGKKFAELAYEAAVKSVVLL 391
Query: 408 KNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYV---ENTVY 464
KN LLPL K K S+A++GP A+ LG Y+G S++ L+ ++ + Y
Sbjct: 392 KND-ALLPLNKEKIKSVAVVGPFADY--NYLGGYSGQPPYSVSLLKGVKELIGKKGKVTY 448
Query: 465 YPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRV 524
G T S D + KGAD V++ +G D+ +E D + LP Q++L+ ++
Sbjct: 449 LNGMGT------SADSIAQVVKGADIVLVALGSDEKMARENHDMPSIYLPEEQEKLLKKI 502
Query: 525 AEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRL 584
+ + ++LV G P+ +A D +I +I+ A YPG+ ALA ++FG+ NP G+L
Sbjct: 503 YQVNPR-IVLVFHTGNPLTSEWA--DTHIPAIMQAWYPGQEAGRALANLLFGNENPSGKL 559
Query: 585 PMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVS 644
PMT Y + + D M GRTYR+ +G+ ++ FG GLSY+ S++F +
Sbjct: 560 PMTIYKTEEQLPDILDFDMW-------KGRTYRYMKGEPLYGFGHGLSYT--SFEFDNIQ 610
Query: 645 QNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLF 704
N D + ++ + E N G++AG+ V ++
Sbjct: 611 GN----------------DTLQPDAILQCSVELS------------NSGQLAGEEVVQVY 642
Query: 705 VKPARRGN----GRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGT 760
V R N P+K+LV F+ V L + EK ++ F ++P E LS EDG + G
Sbjct: 643 VS---RENTPVYTYPLKKLVAFKKVKLASGEKKKVDFTIAPRE-LS-VWEDGKWRMLSGK 697
Query: 761 HFLVVG 766
+ L +G
Sbjct: 698 YTLFIG 703
>gi|60680313|ref|YP_210457.1| glycosyl hydrolase [Bacteroides fragilis NCTC 9343]
gi|60491747|emb|CAH06504.1| putative glycosyl hydrolase [Bacteroides fragilis NCTC 9343]
Length = 722
Score = 369 bits (946), Expect = 5e-99, Method: Compositional matrix adjust.
Identities = 239/726 (32%), Positives = 380/726 (52%), Gaps = 89/726 (12%)
Query: 51 PISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTI 110
P++ R R L+ ++TL EK++QLV+ + +IPRL +PAY +W+E LHGVA G+
Sbjct: 57 PVAVRVRTLIQQMTLAEKVAQLVSESDSIPRLNLPAYNYWNECLHGVARAGE-------- 108
Query: 111 RGATSFPQVILTAASFDSYLWYRIGQAIGLEARALY-NAGQAIGMTFWAPNINIFRDPRW 169
T FPQ I A+++D+ L R+ AI EAR Y G+ G+T+W+P IN+ RDPRW
Sbjct: 109 --VTVFPQAINLASTWDTVLVKRVASAISTEARLKYLEIGK--GLTYWSPTINMARDPRW 164
Query: 170 GRGQETPGEDPLVTGKYAVSYVRGVQGD--TFNGGKLKGKLQASACCKHFTAYDLDNWKG 227
GR +ET GEDP +T + V++V+G+QGD T+ L+ A KHF A + +N
Sbjct: 165 GRNEETYGEDPHLTSRLGVAFVKGLQGDHPTY--------LKTVATIKHFVANNEEN--- 213
Query: 228 TTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARR 287
R+ +++ + L + Y P +E+CVK+ A +M AYN NG+P LL R+
Sbjct: 214 -NRFSSSSQIPTKQLYEYYFPAYEACVKEANAQSVMTAYNAFNGVPPSGSHWLLDDVLRK 272
Query: 288 QWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKK 347
+WGF G++ SDC A+ ++ S E+A + +G D+ CG+ ++ AV+Q
Sbjct: 273 EWGFDGFVVSDCGAIGVMNWQHRVVNSLEEAAALGVNSGCDLECGTTYKEKLVQAVEQGL 332
Query: 348 LPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLL 407
+ E+ IDRAL + + R +LG F+ + P+ ++ LA +AA +VLL
Sbjct: 333 ISEAAIDRALTRVLTARFKLGEFDPM-ELVPYNHYDKKLLAGKKFAELAYEAAVKSVVLL 391
Query: 408 KNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYV---ENTVY 464
KN LLPL K K S+A++GP A+ LG Y+G S++ L+ ++ + Y
Sbjct: 392 KND-ALLPLNKEKIKSVAVVGPFADY--NYLGGYSGQPPYSVSLLKGVKELIGKKGKVTY 448
Query: 465 YPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRV 524
G T S D + KGAD V++ +G D+ +E D + LP Q++ + ++
Sbjct: 449 LNGMGT------SADSIAQVVKGADIVLVALGSDEKMARENHDMPSIYLPEEQEKFLKKI 502
Query: 525 AEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRL 584
+ + ++LV G P+ +A D +I +I+ A YPG+ ALA ++FG+ NP G+L
Sbjct: 503 YQVNPR-IVLVFHTGNPLTSEWA--DTHILAIMQAWYPGQEAGRALANLLFGNENPSGKL 559
Query: 585 PMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVS 644
PMT Y + + D M GRTYR+ +G+ ++ FG GLSY+ S++F +
Sbjct: 560 PMTIYKTEEQLPDILDFDMW-------KGRTYRYMKGEPLYGFGHGLSYT--SFEFDNIQ 610
Query: 645 QNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLF 704
N D + ++ + E N G++AG+ V ++
Sbjct: 611 GN----------------DTLQPDAILQCSVELS------------NSGQLAGEEVVQVY 642
Query: 705 VKPARRGN----GRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGT 760
V R N P+K+LV F+ V L + EK ++ F ++P E LS EDG + G
Sbjct: 643 VS---RENTPVYTYPLKKLVAFKKVKLASGEKKKVDFTIAPRE-LS-VWEDGKWRMLSGK 697
Query: 761 HFLVVG 766
+ L +G
Sbjct: 698 YTLFIG 703
>gi|53712125|ref|YP_098117.1| beta-xylosidase [Bacteroides fragilis YCH46]
gi|52214990|dbj|BAD47583.1| beta-xylosidase [Bacteroides fragilis YCH46]
Length = 722
Score = 368 bits (945), Expect = 7e-99, Method: Compositional matrix adjust.
Identities = 240/726 (33%), Positives = 380/726 (52%), Gaps = 89/726 (12%)
Query: 51 PISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTI 110
P++ R R L+ ++TL EK++QLV+ + +IPRL +PAY +W+E LHGVA G+
Sbjct: 57 PVAVRVRTLIQQMTLAEKVAQLVSESDSIPRLNLPAYNYWNECLHGVARAGE-------- 108
Query: 111 RGATSFPQVILTAASFDSYLWYRIGQAIGLEARALY-NAGQAIGMTFWAPNINIFRDPRW 169
T FPQ I A+++D+ L R+ AI EAR Y G+ G+T+W+P IN+ RDPRW
Sbjct: 109 --VTVFPQAINLASTWDTVLVKRVASAISTEARLKYLEIGK--GLTYWSPTINMARDPRW 164
Query: 170 GRGQETPGEDPLVTGKYAVSYVRGVQGD--TFNGGKLKGKLQASACCKHFTAYDLDNWKG 227
GR +ET GEDP +T + V++V+G+QGD T+ L+ A KHF A + +N
Sbjct: 165 GRNEETYGEDPHLTSRLGVAFVKGLQGDHPTY--------LKTVATIKHFVANNEEN--- 213
Query: 228 TTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARR 287
R+ +++ + L + Y P +E+CVK+ A +M AYN NG+P LL R+
Sbjct: 214 -NRFSSSSQIPTKQLYEYYFPAYEACVKEANAQSVMTAYNAFNGVPPSGSHWLLDDVLRK 272
Query: 288 QWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKK 347
+WGF G++ SDC A+ ++ S E+A + +G D+ CG+ ++ AV+Q
Sbjct: 273 EWGFDGFVVSDCGAIGVMNWQHRVVNSLEEAAALGVNSGCDLECGTTYKEKLVQAVEQGL 332
Query: 348 LPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLL 407
+ E+ IDRAL + + R +LG F+ + P+ ++ LA +AA +VLL
Sbjct: 333 ISEAAIDRALTRVLTARFKLGEFDPM-ELVPYNHYDKKLLAGKKFAELAYEAAVKSVVLL 391
Query: 408 KNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYV---ENTVY 464
KN LLPL K K S+A++GP A+ LG Y+G S++ L+ ++ + Y
Sbjct: 392 KND-ALLPLNKEKIKSVAVVGPFADY--NYLGGYSGQPPYSVSLLKGVKELIGKKGKVTY 448
Query: 465 YPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRV 524
G T S D + KGAD V++ +G D+ +E D + LP Q++L+ +
Sbjct: 449 LNGMGT------SADSIAQVVKGADIVLVALGSDEKMARENHDMPSIYLPEGQEKLLKEI 502
Query: 525 AEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRL 584
+ + ++LV G P+ +A D +I +I+ A YPG+ ALA ++FG+ NP G+L
Sbjct: 503 YQVNPR-IVLVFHTGNPLTSEWA--DTHIPAIMQAWYPGQEAGRALANLLFGNENPSGKL 559
Query: 585 PMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVS 644
PMT Y + + D M GRTYR+ +G+ ++ FG GLSY+ S++F +
Sbjct: 560 PMTIYKTEEQLPDILDFDMW-------KGRTYRYMKGEPLYGFGHGLSYT--SFEFDNIQ 610
Query: 645 QNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLF 704
N D + ++ + E N G++AG+ V ++
Sbjct: 611 GN----------------DTLQPDAILQCSVELS------------NSGQLAGEEVVQVY 642
Query: 705 VKPARRGN----GRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGT 760
V R N P+K+LV F+ V L + EK ++ F ++P E LS EDG + G
Sbjct: 643 VS---RENTPVYTYPLKKLVAFKKVKLASGEKKKVDFTIAPRE-LS-VWEDGKWRMLSGK 697
Query: 761 HFLVVG 766
+ L +G
Sbjct: 698 YTLFIG 703
>gi|30316196|sp|P83344.1|XYNB_PRUPE RecName: Full=Putative beta-D-xylosidase; AltName: Full=PpAz152
gi|19879972|gb|AAM00218.1|AF362990_1 beta-D-xylosidase, partial [Prunus persica]
Length = 461
Score = 367 bits (943), Expect = 9e-99, Method: Compositional matrix adjust.
Identities = 199/460 (43%), Positives = 280/460 (60%), Gaps = 15/460 (3%)
Query: 321 DVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFG 380
D +KAG+D++CG FL HT+AAV++ + + EI+ AL N +V+MRLG+F+G P+ +G
Sbjct: 2 DAIKAGLDLDCGPFLAIHTEAAVRRGLVSQLEINWALANTMTVQMRLGMFDGEPSAHQYG 61
Query: 381 KIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGN 440
+G VC+PAHQ LAL+AA+ GIVLL+N LPL + ++A+IGPN++ T++GN
Sbjct: 62 NLGPRDVCTPAHQQLALEAARQGIVLLENRGRSLPLSTRRHRTVAVIGPNSDVTVTMIGN 121
Query: 441 YAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASI-DKAVDIAKGADHVVLMMGLDQ 499
YAG +C TPLQ + Y T++ GC V C+ + A A+ AD VL+MGLDQ
Sbjct: 122 YAGVACGYTTPLQGIGRYTR-TIHQAGCTDVHCNGNQLFGAAEAAARQADATVLVMGLDQ 180
Query: 500 TQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWA 559
+ E E +DR L+LPG QQEL++RVA A++ P ILVL+ GGP+D+TFAK D I +I+W
Sbjct: 181 SIEAEFVDRAGLLLPGHQQELVSRVARASRGPTILVLMSGGPIDVTFAKNDPRISAIIWV 240
Query: 560 GYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYI-KVPMTDMKMRPQATSGNPGRTYRF 618
GYPG+AG A+A V+FG NPGG+LPMTWYPQ+Y+ +PMTDM MR G PGRTYRF
Sbjct: 241 GYPGQAGGTAIANVLFGTANPGGKLPMTWYPQNYVTHLPMTDMAMRADPARGYPGRTYRF 300
Query: 619 YEGKEVFPFGCGLSYSKYSYKF----KAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELG 674
Y G VFPFG GLSY+ +++ VS L ++++ M+ V + L
Sbjct: 301 YIGPVVFPFGLGLSYTTFAHNLAHGPTLVSVPLTSLKATANSTMLSKTVRVSHPDCNALS 360
Query: 675 TEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAE 734
V + VKN G M G H +L+F P G KQL+GF + + +
Sbjct: 361 -------PLDVHVDVKNTGSMDGTHTLLVFTSPP-DGKWASSKQLMGFHKIHIATGSEKR 412
Query: 735 IVFELSPCESLSRAREDGLMVIEEGTHFLVVGDEEYPISI 774
+ + C+ LS G+ I G H L +GD + +S+
Sbjct: 413 VRIAVHVCKHLSVVDRFGIRRIPLGEHKLQIGDLSHHVSL 452
>gi|423281966|ref|ZP_17260851.1| hypothetical protein HMPREF1204_00389 [Bacteroides fragilis HMW
615]
gi|404582453|gb|EKA87147.1| hypothetical protein HMPREF1204_00389 [Bacteroides fragilis HMW
615]
Length = 722
Score = 367 bits (942), Expect = 1e-98, Method: Compositional matrix adjust.
Identities = 240/726 (33%), Positives = 379/726 (52%), Gaps = 89/726 (12%)
Query: 51 PISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTI 110
P++ R R L+ ++TL EK++QLV+ + +IPRL +PAY +W+E LHGVA G+
Sbjct: 57 PVAVRVRTLIQQMTLAEKVAQLVSESDSIPRLNLPAYNYWNECLHGVARAGE-------- 108
Query: 111 RGATSFPQVILTAASFDSYLWYRIGQAIGLEARALY-NAGQAIGMTFWAPNINIFRDPRW 169
T FPQ I A+++D+ L R+ AI EAR Y G+ G+T+W+P IN+ RDPRW
Sbjct: 109 --VTVFPQAINLASTWDTVLVKRVASAISTEARLKYLEIGK--GLTYWSPTINMARDPRW 164
Query: 170 GRGQETPGEDPLVTGKYAVSYVRGVQGD--TFNGGKLKGKLQASACCKHFTAYDLDNWKG 227
GR +ET GEDP +T + V++V+G+QGD T+ L+ A KHF A + +N
Sbjct: 165 GRNEETYGEDPHLTSRLGVAFVKGLQGDHPTY--------LKTVATIKHFVANNEEN--- 213
Query: 228 TTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARR 287
R+ +++ + L + Y P +E+CVK+ A +M AYN NG+P LL R+
Sbjct: 214 -NRFSSSSQIPTKQLYEYYFPAYEACVKEANAQSVMTAYNAFNGVPPSGSHWLLDDVLRK 272
Query: 288 QWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKK 347
+WGF G++ SDC A+ ++ S E+A + +G D+ CG+ ++ AV+Q
Sbjct: 273 EWGFDGFVVSDCGAIGVMNWQHRVVNSLEEAAALGVNSGCDLECGTTYKEKLVQAVEQGL 332
Query: 348 LPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLL 407
+ E+ IDRAL + + R +LG F+ + P+ ++ LA +AA +VLL
Sbjct: 333 ISEAAIDRALTRVLTARFKLGEFDPM-ELVPYNHYDKKLLAGKKFAELAYEAAVKSVVLL 391
Query: 408 KNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYV---ENTVY 464
KN LLPL K K S+A++GP A+ LG Y+G S++ L+ ++ + Y
Sbjct: 392 KND-ALLPLNKEKIKSVAVVGPFADY--NYLGGYSGQPPYSVSLLKGVKELIGKKGKVTY 448
Query: 465 YPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRV 524
G T S D + KGAD V++ +G D+ +E D + LP Q++L+ +
Sbjct: 449 LNGMGT------SADSIAQVVKGADIVLVALGSDEKMARENHDMPSIYLPEEQEKLLKEI 502
Query: 525 AEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRL 584
+ + + LV G P+ +A D +I +I+ A YPG+ ALA ++FG+ NP G+L
Sbjct: 503 YQVNPR-IALVFHTGNPLTSEWA--DTHIPAIMQAWYPGQEAGRALANLLFGNENPSGKL 559
Query: 585 PMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVS 644
PMT Y + + D M GRTYR+ +G+ ++ FG GLSY+ S++F +
Sbjct: 560 PMTIYKTEEQLPDILDFDMW-------KGRTYRYMKGEPLYGFGHGLSYT--SFEFDNIQ 610
Query: 645 QNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLF 704
N D + ++ + E N G++AG+ V ++
Sbjct: 611 GN----------------DTLQPDAILQCSVELS------------NSGQLAGEEVVQVY 642
Query: 705 VKPARRGN----GRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGT 760
V R N P+K+LV F+ V L + EK ++ F ++P E LS EDG + G
Sbjct: 643 VS---RENTPVYTYPLKKLVAFKKVKLASGEKKKVDFTIAPRE-LS-VWEDGKWRMLSGK 697
Query: 761 HFLVVG 766
+ L +G
Sbjct: 698 YTLFIG 703
>gi|255545654|ref|XP_002513887.1| hypothetical protein RCOM_1034150 [Ricinus communis]
gi|223546973|gb|EEF48470.1| hypothetical protein RCOM_1034150 [Ricinus communis]
Length = 246
Score = 367 bits (942), Expect = 1e-98, Method: Compositional matrix adjust.
Identities = 174/245 (71%), Positives = 202/245 (82%), Gaps = 5/245 (2%)
Query: 1 MIFHKLSLVFPLLCLCFTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLV 60
M K+SL+ F + + QPPFSCD SNPST+++ FC+T+LPISQRA+DLV
Sbjct: 1 MKLRKISLLTLFHLTSFLIFIFADSAPQPPFSCDYSNPSTKSYLFCQTSLPISQRAKDLV 60
Query: 61 SRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGV-----GKGIFFNGTIRGATS 115
SRLTLDEKISQLV++AP+IPRLG+PAY WWSEALHGVA + +GI FNGTIR ATS
Sbjct: 61 SRLTLDEKISQLVDTAPSIPRLGVPAYRWWSEALHGVAFLPDVSERQGIHFNGTIRSATS 120
Query: 116 FPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQET 175
FPQVI+TAASFD+ LWYRIGQ G EARA+YNAGQAIGMTFWAPNINIFRDPRWGRGQET
Sbjct: 121 FPQVIITAASFDTKLWYRIGQVTGKEARAIYNAGQAIGMTFWAPNINIFRDPRWGRGQET 180
Query: 176 PGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDA 235
PGEDP+VTGKYAVS+VRGVQGD+F GG L KLQASACCKHFTAYDL+ W GT R+ F+A
Sbjct: 181 PGEDPMVTGKYAVSFVRGVQGDSFEGGVLGEKLQASACCKHFTAYDLEKWNGTNRFIFNA 240
Query: 236 RVTMQ 240
+V+ +
Sbjct: 241 QVSAR 245
>gi|291240559|ref|XP_002740189.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii]
Length = 745
Score = 367 bits (941), Expect = 2e-98, Method: Compositional matrix adjust.
Identities = 262/783 (33%), Positives = 375/783 (47%), Gaps = 94/783 (12%)
Query: 11 PLLCLCFTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKIS 70
P C+ FTS+L + ST FPF T+LP ++R DLV RL L+E +
Sbjct: 4 PGKCVVFTSVLFSLIST-----------ILSDFPFRNTSLPWNKRVEDLVGRLKLEEIVL 52
Query: 71 QLVNS-------APAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTA 123
Q+ AP I RL I Y W +E L G G ATSFPQ A
Sbjct: 53 QMSRGGRYSNGPAPPIDRLNIGPYSWNTECLRGDLSAGP----------ATSFPQAFGLA 102
Query: 124 ASFDSYLWYRIGQAIGLEARALYNA-------GQAIGMTFWAPNINIFRDPRWGRGQETP 176
A+FD+ L +I A E RA YN G G++ ++P INI R P WGR QET
Sbjct: 103 ATFDAVLIKQIANATAYEVRAKYNNYTKHKEYGDHKGLSCFSPVINIARHPLWGRIQETY 162
Query: 177 GEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDAR 236
GEDP ++G A S+V G+QG+ + A+A CKHF AY ++R FDA+
Sbjct: 163 GEDPYLSGTLAASFVTGLQGNH------PRYVTANAGCKHFDAYAGPENIPSSRSTFDAK 216
Query: 237 VTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYIT 296
V+ +DL T+ P F C++ G S +MC+YN +NG+P+CA++ LL+ R +W F GY+
Sbjct: 217 VSDRDLRMTFLPAFHECIQAGTYS-LMCSYNSINGVPACANKKLLTDILRTEWNFTGYVI 275
Query: 297 SDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKH----TKAAVKQKKLPESE 352
SD AV +YDA Y K D + + +G+++ S L + T AVKQ +
Sbjct: 276 SDQSAVEKVYDAHHYTKDMLDTAIACVNSGLNLELSSNLTDNVMMQTTKAVKQGNVTMKT 335
Query: 353 IDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHG 412
+ + LF RMRLG F+ P M P+ K+ ++ S HQ L+L+AA VLLKN +
Sbjct: 336 VKARVSPLFYTRMRLGEFDP-PEMNPYSKLDLSIIQSQEHQELSLKAAAKSFVLLKNENR 394
Query: 413 LLPLPKSKSVSLALIGPNANSAKTLLGNYA-GPSCRSITPLQALQNYVE-NTVYYPGCDT 470
LPL K K LA++GP ++ + G+ + S ++TP L T + GC +
Sbjct: 395 FLPL-KEKIDKLAVVGPFGDNPIEIYGSKSPDVSNLTVTPRYGLSKIARLATTFASGCLS 453
Query: 471 VACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITR-VAEAAK 529
AC+ D VV+ +G E E DR +L LPG+Q L+ V AA
Sbjct: 454 PACTEYDPKSTKQAIDRVDMVVVCLGTGNEVENEAHDRSELTLPGQQLRLLQDAVTFAAD 513
Query: 530 KPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGD--HNPGGRLPMT 587
KPVIL+L GP+DIT+A + I I+ +P + AL + NPGGRLP+T
Sbjct: 514 KPVILLLFNAGPLDITWAVSNPAIPVIVECFFPAQTTGTALYHLFVNSPGSNPGGRLPIT 573
Query: 588 WYPQDYIKV-PMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQN 646
W P+ +V PM D M GRTYR++ G +FPFG GLSY+ + Y +
Sbjct: 574 W-PKSMSQVPPMEDYTME--------GRTYRYFNGDPLFPFGYGLSYTTFHY-------S 617
Query: 647 KLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVK 706
L + S+ K + + + + ++N G++ G ++
Sbjct: 618 DLLITPSTPIKPCSSIN---------------------IDVFLENTGDVTGDEVTQFYLS 656
Query: 707 PARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
P QLVG L +K A I + P A VIE G + + G
Sbjct: 657 WKNASIPVPKWQLVGVSRTQLQSKTFANIAIIVPP---RLMAVYTNKWVIEPGVYTVYAG 713
Query: 767 DEE 769
++
Sbjct: 714 GQQ 716
>gi|323451996|gb|EGB07871.1| hypothetical protein AURANDRAFT_71699 [Aureococcus anophagefferens]
Length = 1202
Score = 367 bits (941), Expect = 2e-98, Method: Compositional matrix adjust.
Identities = 256/666 (38%), Positives = 346/666 (51%), Gaps = 77/666 (11%)
Query: 29 PPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYE 88
PP D +PS +P+C LPI R DL +R T++E ISQ+ A A+PRLG+PA
Sbjct: 327 PPKLGDVESPSAAAYPYCDRALPIRARVADLAARFTVNETISQMGTMAAAVPRLGLPALN 386
Query: 89 WWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA 148
+ EALHGV R T FP ASFD LW +G A GLEARAL+
Sbjct: 387 YGGEALHGVWST------CAAGRCPTQFPAPHAMGASFDRDLWRAVGAASGLEARALFRW 440
Query: 149 GQ-------------AIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQ 195
Q +G+TF+APN+N+ RDPRWGR +E P EDPL+ G Y +VRG Q
Sbjct: 441 NQRHNASDCARSLEGCLGLTFYAPNVNLARDPRWGRIEEVPSEDPLLNGVYGAEFVRGFQ 500
Query: 196 GDTFNGGKLKGKLQASACCKHFTAYDLD---------NWKGTT-------RYKFDARVTM 239
GD G + A+A KHF Y+L+ +W G+ R+ FDARV+
Sbjct: 501 GD----GAYR---VANAVVKHFAVYNLEVDVEDTPPADWCGSAACAPPNDRHSFDARVSP 553
Query: 240 QDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDC 299
+D +TY PF + A+ MC+YN VNG P+C D LL R F G + +DC
Sbjct: 554 RDFEETYVGPFVA-PVAAGAAAAMCSYNAVNGEPACTDGALLRGALRGALNFTGVLATDC 612
Query: 300 DAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHN 359
A+ + +A + AG+D NCG L A+ + + L
Sbjct: 613 GALEDAVARHKRYATEAEAAAAAIAAGVDSNCGKVLTSALPEALAAGLVRPDALRPPLER 672
Query: 360 LFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKS 419
L R+RLGL + P + D V SPAH+ LAL+AA++G+VLL+N + +LPL
Sbjct: 673 LLEARLRLGLLDDWDADAPVPRPDVDAVDSPAHRALALRAAREGLVLLQNPNQILPLDGR 732
Query: 420 KSVSLALIGPNANSAKTLLGNYAG--PSCRSITPLQALQNYVE--NTVYYPGCDTVACSS 475
+LA+IGPNAN++ LL Y G P +PLQ L+ VY GC+ ++
Sbjct: 733 G--TLAVIGPNANASMNLLSGYHGTPPPDLLRSPLQELEARWRGGKVVYAVGCNASGAAT 790
Query: 476 ASIDKAVDIAKGADHVVLMMGL------------DQT----QEKEELDRVDLVLPGRQQE 519
A++D+AVD+AK AD VVL +GL D T E E +DR L LPG Q+
Sbjct: 791 AALDEAVDLAKTADVVVLGLGLCGDNYGGGPPKEDATCFSIDEAESVDRTSLKLPGAQEA 850
Query: 520 LITRVAEAAKKPVILV--LLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGD 577
L +++ A KPV + L+ G VD +FAK + ++L AGY GE G VA+A+ + G
Sbjct: 851 LFSKI-WALGKPVAVAVFLVSAGAVDASFAK---DKAALLLAGYGGEFGGVAVADALLGA 906
Query: 578 HNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFP---FGCGLSYS 634
+NPGG L T P + P DM MRP A S PGRTYRF + + V P FG GLSY+
Sbjct: 907 YNPGGALTATMLPDAGLP-PFRDMAMRPSAAS--PGRTYRFLDERRVAPLWRFGFGLSYT 963
Query: 635 KYSYKF 640
++
Sbjct: 964 AFAVSL 969
>gi|423269271|ref|ZP_17248243.1| hypothetical protein HMPREF1079_01325 [Bacteroides fragilis
CL05T00C42]
gi|423273165|ref|ZP_17252112.1| hypothetical protein HMPREF1080_00765 [Bacteroides fragilis
CL05T12C13]
gi|392701693|gb|EIY94850.1| hypothetical protein HMPREF1079_01325 [Bacteroides fragilis
CL05T00C42]
gi|392708197|gb|EIZ01305.1| hypothetical protein HMPREF1080_00765 [Bacteroides fragilis
CL05T12C13]
Length = 722
Score = 366 bits (940), Expect = 2e-98, Method: Compositional matrix adjust.
Identities = 239/726 (32%), Positives = 380/726 (52%), Gaps = 89/726 (12%)
Query: 51 PISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTI 110
P++ R R L+ ++TL EK++QLV+ + +IPRL +PAY +W+E LHGVA G+
Sbjct: 57 PVAVRVRTLIQQMTLAEKVAQLVSESDSIPRLNLPAYNYWNECLHGVARAGE-------- 108
Query: 111 RGATSFPQVILTAASFDSYLWYRIGQAIGLEARALY-NAGQAIGMTFWAPNINIFRDPRW 169
T FPQ I A+++D+ L R+ AI EAR Y G+ G+T+W+P IN+ RDPRW
Sbjct: 109 --VTVFPQAINLASTWDTVLVKRVASAISTEARLKYLEIGK--GLTYWSPTINMARDPRW 164
Query: 170 GRGQETPGEDPLVTGKYAVSYVRGVQGD--TFNGGKLKGKLQASACCKHFTAYDLDNWKG 227
GR +ET GE+P +T + V++V+G+QGD T+ L+ A KHF A + +N
Sbjct: 165 GRNEETYGEEPHLTSRLGVAFVKGLQGDHPTY--------LKTVATIKHFVANNEEN--- 213
Query: 228 TTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARR 287
R+ +++ + L + Y P +E+CVK+ A +M AYN NG+P LL R+
Sbjct: 214 -NRFSSSSQIPTKQLYEYYFPAYEACVKEANAQSVMTAYNAFNGVPPSGSHWLLDDVLRK 272
Query: 288 QWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKK 347
+WGF G++ SDC A+ ++ S E+A + +G D+ CG+ ++ AV+Q
Sbjct: 273 EWGFDGFVVSDCGAIGVMNWQHRVVNSLEEAAALGVNSGCDLECGTTYKEKLVQAVEQGL 332
Query: 348 LPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLL 407
+ E+ IDRAL + + R +LG F+ + P+ ++ LA +AA +VLL
Sbjct: 333 ISEAAIDRALTRVLTARFKLGEFDPM-ELVPYNHYDKKLLAGKKFAELAYEAAVKSVVLL 391
Query: 408 KNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYV---ENTVY 464
KN LLPL K K S+A++GP A+ LG Y+G S++ L+ ++ + Y
Sbjct: 392 KND-ALLPLNKEKIKSVAVVGPFADY--NYLGGYSGQPPYSVSLLKGVKELIGKKGKVTY 448
Query: 465 YPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRV 524
G T S D + KGAD V++ +G D+ +E D + LP Q++L+ +
Sbjct: 449 LNGMGT------SADSIAQVVKGADIVLVALGSDEKMARENHDMPSIYLPEGQEKLLKEI 502
Query: 525 AEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRL 584
+ + ++LV G P+ +A D +I +I+ A YPG+ ALA ++FG+ NP G+L
Sbjct: 503 YQVNPR-IVLVFHTGNPLTSEWA--DTHIPAIMQAWYPGQEAGRALANLLFGNENPSGKL 559
Query: 585 PMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVS 644
PMT Y + + D M GRTYR+ +G+ ++ FG GLSY+ S++F +
Sbjct: 560 PMTIYKTEEQLPDILDFDMW-------KGRTYRYMKGEPLYGFGHGLSYT--SFEFDNIQ 610
Query: 645 QNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLF 704
N D + ++ + E N G++AG+ V ++
Sbjct: 611 GN----------------DTLQPDAILQCSVELS------------NSGQLAGEEVVQVY 642
Query: 705 VKPARRGN----GRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGT 760
V R N P+K+LV F+ V L + EK ++ F ++P E LS EDG + G
Sbjct: 643 VS---RENTPVYTYPLKKLVAFKKVKLASGEKKKVDFTIAPRE-LS-VWEDGKWRMLSGK 697
Query: 761 HFLVVG 766
+ L +G
Sbjct: 698 YTLFIG 703
>gi|282877070|ref|ZP_06285912.1| glycosyl hydrolase family 3 C-terminal domain protein [Prevotella
buccalis ATCC 35310]
gi|281300752|gb|EFA93079.1| glycosyl hydrolase family 3 C-terminal domain protein [Prevotella
buccalis ATCC 35310]
Length = 721
Score = 365 bits (937), Expect = 5e-98, Method: Compositional matrix adjust.
Identities = 247/736 (33%), Positives = 359/736 (48%), Gaps = 97/736 (13%)
Query: 42 TFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVG 101
T+PF L QRA DL RLTL+EK + N++ +PRLGI ++WW EALHG A G
Sbjct: 23 TYPFQDARLSFEQRADDLCKRLTLEEKAGLMQNNSKPVPRLGIKQFQWWGEALHGSARTG 82
Query: 102 KGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIG-------M 154
AT FPQ I AASFD L ++ EARA YN G +
Sbjct: 83 L----------ATVFPQTIGMAASFDDELLLQVFNIASTEARAKYNVAAKKGYFDTSWSV 132
Query: 155 TFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGK-LQASAC 213
+ W PN+NIFRDPRWGRGQET GEDP +T + + V G+Q GGK K +A AC
Sbjct: 133 SLWTPNVNIFRDPRWGRGQETYGEDPYLTSRMGCAVVEGLQ-----GGKGPHKYYKAFAC 187
Query: 214 CKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIP 273
KHF + W + D V+ +D +TY P F+ V+ G +MCAYN ++G P
Sbjct: 188 AKHFAVHSGPEWNRHSISIDD--VSPRDFHETYLPAFKHLVQVGGVKEVMCAYNSIDGEP 245
Query: 274 SCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPE--DAVVDVLKAGMDVNC 331
C+D+ LL + R +WGF G + SDC A+ I+ + P+ A +K G D++C
Sbjct: 246 CCSDQRLLEQLLRDEWGFKGIVVSDCGAIDDIWRKGFHEVEPDAAHASARAVKGGTDMSC 305
Query: 332 GSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPA 391
G AV+ K+ E ID++L L RM+LG F+ + ++ + I V +PA
Sbjct: 306 GQTYGS-LPEAVRLGKVTEERIDKSLKRLIVGRMQLGEFDPD-SITRWNAISMKDVSTPA 363
Query: 392 HQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITP 451
+ +AL+ A++ + LL N LPL K + ++GPNAN + + GNY G ++T
Sbjct: 364 SREVALKMARETMTLLHNPMHALPLSKQLK-QVVVMGPNANDSVMMWGNYNGTPHHTVTI 422
Query: 452 LQALQNYV--ENTVYYPGCDTV-----ACSSASIDKAVDIAKGADHVVLMMGLDQTQEKE 504
L ++ + + + GC V + + + V+ V+ + G+ E E
Sbjct: 423 LDGIRRKIGAQRVKFIEGCGLVEPHRRGNQALTTQQLVEEVGDNKTVIFVGGISPQLEGE 482
Query: 505 EL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIG 554
+L DRV + LP Q+E+I + A K+ ++++ C G I +
Sbjct: 483 QLEVEAKGFKGGDRVTIELPQVQREMIAALHAAGKQ--VIMVNCSGSA-IGLVPEVTHTD 539
Query: 555 SILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGR 614
+IL A YPGE G A+A+V+FGD+NP G+LP+T+Y D D MR R
Sbjct: 540 AILQAWYPGERGGEAVADVLFGDYNPAGKLPVTFYRDDSQLPDYLDYNMR--------NR 591
Query: 615 TYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELG 674
TYR+++GK +FPFG GLSY+ + + KL
Sbjct: 592 TYRYFKGKPLFPFGHGLSYTSFKIGKAKMRNGKL-------------------------- 625
Query: 675 TEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAE 734
T+ VKN G+ G+ V L++ NG PIK L GF+ + L A E+
Sbjct: 626 -----------TVSVKNTGKRDGEEVVQLYISCLDDPNG-PIKSLRGFKRMALQAGEQRT 673
Query: 735 IVFELSPCESLSRARE 750
+ L P +S R E
Sbjct: 674 VTLNL-PRKSFERFDE 688
>gi|198425898|ref|XP_002119549.1| PREDICTED: similar to predicted protein [Ciona intestinalis]
Length = 754
Score = 363 bits (933), Expect = 2e-97, Method: Compositional matrix adjust.
Identities = 260/768 (33%), Positives = 387/768 (50%), Gaps = 105/768 (13%)
Query: 12 LLCLCFTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQ 71
++ LC LL+ F S ++E FPF +LPI +R DLV+RLT++E I Q
Sbjct: 1 MIRLCLFLLLSY-------FHFASSKVTSEEFPFRNFSLPIEERLEDLVNRLTIEEVILQ 53
Query: 72 LVNS-------APAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAA 124
L APAI RLGI Y+W +E L G A G AT FPQ I AA
Sbjct: 54 LSRGGVRDNGPAPAITRLGIGPYQWNTECLRGYAMNGD----------ATCFPQPIGLAA 103
Query: 125 SFDSYLWYRIGQAIGLEARALYNA-------GQAIGMTFWAPNINIFRDPRWGRGQETPG 177
+FD L Y++ + + LEARA +N G G++ ++P INI R P WGR QET G
Sbjct: 104 TFDQGLIYKMAKTVALEARAKHNNFTKNGNFGDHTGLSCFSPVINILRHPLWGRNQETYG 163
Query: 178 EDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARV 237
EDP++T A +YV G+QGD L A+A CKHF AY TTR+ F A V
Sbjct: 164 EDPVLTSLMARAYVTGLQGDEI-------YLPATAVCKHFVAYGGPENIPTTRFSFSANV 216
Query: 238 TMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITS 297
+ D+ T+ P F CV G A G+MC+YN +NG+PSCA+ +L T R+++ F GY+ S
Sbjct: 217 SDHDIGTTFYPAFRECVHAG-AQGVMCSYNAINGVPSCANP-MLETTLRKKFHFDGYVVS 274
Query: 298 DCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKA----AVKQKKLPESEI 353
D +A+ I + KS + L AG+D+ F + + + AV+Q + E+ +
Sbjct: 275 DENALENIDLYFNFTKSKLETAAVALNAGVDLELTGFGKTNRYSLLNQAVEQGLVTEAAL 334
Query: 354 DRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGL 413
R+ LF RM LG F+ P + + DVV S AH+ A++ A VLLKN G+
Sbjct: 335 RRSAKRLFRTRMALGEFDP-PEFNHWLNVPIDVVQSLAHRKQAVEVAAKSFVLLKND-GI 392
Query: 414 LPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSI----TPLQALQNYVENTV--YYPG 467
LPL K ++++GP N+++ L G+Y P+ ++ +PL A + + V + G
Sbjct: 393 LPL-KQLYDKVSIVGPFINNSEALTGDY--PAEFNLKYFSSPLFAANSLSSSGVARFTTG 449
Query: 468 C------DTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELI 521
C + C++ + ++ G+D V++ +G + E E DR D+ LPG+Q +LI
Sbjct: 450 CVGTNNQNLPICATYNSTNVKEVVTGSDIVLVTLGTGRGVEAESNDRRDINLPGKQLQLI 509
Query: 522 TRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
V + A PVI+VL GP+D+++ N +++ + + A+ EV+ G NP
Sbjct: 510 QDVVKYANGPVIVVLFNAGPLDVSWVM--GNTAAVIACHFSAQMTGEAMLEVLTGVVNPA 567
Query: 582 GRLPMTWYPQDYIKV-PMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKF 640
GRLP TW P +V PMTD M RTYR+ +FPFG GLSY+K+ Y
Sbjct: 568 GRLPNTW-PASMEQVPPMTDYSMHE--------RTYRYSTSSPLFPFGYGLSYTKFWYLD 618
Query: 641 KAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHP 700
V + Q +P +V + ++N G + G+
Sbjct: 619 AVVEPTTIQRCQ-----------------IP------------VVRVLIQNTGHLDGEEV 649
Query: 701 VLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRA 748
V +++ ++ + ++QLV FQ V + A E+ I P LSRA
Sbjct: 650 VQIYMTSKKKRDRELLRQLVAFQRVPIKAGEEVSISL---PIPHLSRA 694
>gi|308208211|gb|ADO20356.1| putative beta-D-xylosidase/alpha-L-arabinosidase [uncultured rumen
bacterium]
Length = 780
Score = 362 bits (930), Expect = 4e-97, Method: Compositional matrix adjust.
Identities = 267/797 (33%), Positives = 374/797 (46%), Gaps = 150/797 (18%)
Query: 44 PFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKG 103
P+ +LP +RA+DLVSRLTL+EK S ++ + + LGI AY WWSEALHGVA G
Sbjct: 45 PYKDRSLPPEERAKDLVSRLTLEEKASLSMHPSAPVEALGIKAYNWWSEALHGVARNG-- 102
Query: 104 IFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALY-------NAGQAIGMTF 156
AT FPQ I AASFD L Y + A+ EAR Y + GQ G+TF
Sbjct: 103 --------AATVFPQPIGMAASFDEPLLYEVFTAVSDEARVKYKIAKESGHIGQYQGVTF 154
Query: 157 WAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKH 216
W PNINIFRDPRWGRG ET GEDP +TG+ ++ VRG+QG + + L+A AC KH
Sbjct: 155 WTPNINIFRDPRWGRGMETYGEDPYLTGQMGMAVVRGLQGPSDS-----PVLKAHACAKH 209
Query: 217 FTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCA 276
+ + W R+ +DA V+ +DL +TY P F+ V + +M AYNR G P A
Sbjct: 210 YAVHSGPEW---NRHSYDAEVSERDLRETYLPAFKDLVTKANVQEVMTAYNRFRGEPCGA 266
Query: 277 DRNLLSKTARRQWGFHGYITSDCDAVSIIY--DAEGYAKSPEDAVVDVLKAGMDVNCGSF 334
L++ R +WG+ G ITSDC AV Y GY+ A + AG+D CG
Sbjct: 267 SDYLINTILRGEWGYKGLITSDCWAVEDFYVQGRHGYSPDVASAAAAAVHAGVDTECGQ- 325
Query: 335 LQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQV 394
+H AV++ L E ++DR L LF+ R +LG + + + + A ++ P H
Sbjct: 326 AYRHIPEAVERGLLDEKDLDRNLIRLFTARYQLGEMD---DISLWDDLPASILEGPEHLA 382
Query: 395 LALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQA 454
L+ + AQ+ +VLL+N G+LPL + V +AL+GPN + + GNY R++T A
Sbjct: 383 LSRKMAQESMVLLQNKGGILPL--APDVRVALVGPNGDDREMQWGNYNPVPGRTVTLYDA 440
Query: 455 LQNYVENTVYYPGCDTVACS---------------------------------------- 474
L+ Y GC V
Sbjct: 441 LKERFPGIKYVRGCGIVGAEFAPKPDPNNPLSQALGKSREEMEAIARQYAIGVQDILNYV 500
Query: 475 ------SASIDKAVDIA------KGADHVVLMMGLDQTQEKEEL----------DRVDLV 512
AS +D+ +G D V+ G+ E EE+ DR D+
Sbjct: 501 RRQERMQASFLPELDVQSVLKELEGIDVVIFAGGISPRFEGEEMPVNLPGFKGGDRTDIQ 560
Query: 513 LPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAE 572
LP Q++L+ + +A KK VILV G I + +IL A YPGE G +A+ +
Sbjct: 561 LPQVQRDLMKALHDAGKK-VILVNFSG--CAIGLVPETESCDAILQAWYPGEEGGLAITD 617
Query: 573 VIFGDHNPGGRLPMTWYP--QDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCG 630
V+FGD NP G+LP+T+Y +D DMK G TYR+++GK +FPFG G
Sbjct: 618 VLFGDVNPSGKLPVTFYRSVEDLPDFENYDMK----------GHTYRYFKGKPLFPFGYG 667
Query: 631 LSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVK 690
LSYS + YK V N L I VK
Sbjct: 668 LSYSTFRYKRAKVRNNSL-------------------------------------IIPVK 690
Query: 691 NHGEMAGKHPVLLFVKPARRGN-GRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAR 749
N G+ V ++V+ R+G+ P+K L F+ V + A + ++ L L +
Sbjct: 691 NTGKREATEVVQVYVR--RKGDPDGPVKTLRAFRRVTIPAGKTVKVCIPLEDETFLWWSE 748
Query: 750 EDGLMVIEEGTHFLVVG 766
E MV G + L+ G
Sbjct: 749 EAQDMVPLPGKYELLYG 765
>gi|340368019|ref|XP_003382550.1| PREDICTED: probable beta-D-xylosidase 2-like [Amphimedon
queenslandica]
Length = 742
Score = 361 bits (927), Expect = 7e-97, Method: Compositional matrix adjust.
Identities = 261/759 (34%), Positives = 374/759 (49%), Gaps = 91/759 (11%)
Query: 43 FPFCKTTLPISQRARDLVSRLTLDEKISQLVNS-------APAIPRLGIPAYEWWSEALH 95
FPF T+L I R +D+V LTL+E + Q+ + AP IPRL I Y+W +E L
Sbjct: 24 FPFQNTSLSIEDRVKDIVDNLTLEELVEQMAHGGATLNGPAPGIPRLHINPYQWGTECLS 83
Query: 96 GVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQA---- 151
G G ATSFP I AASF+ L R+ A E RA + A
Sbjct: 84 GNVSAGD----------ATSFPMPIGMAASFNYDLLKRVTNATAYEVRAKHAAAVKDGSY 133
Query: 152 ---IGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKL 208
G++ W+P +NI RDPRWGR QET GEDP ++G +YV G+QG+ +
Sbjct: 134 AFHTGLSCWSPVLNIMRDPRWGRNQETYGEDPYLSGYLGQAYVNGLQGNN------SRYI 187
Query: 209 QASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNR 268
A+A CKHF + T+R+ FDA+V+M+D T+ P F++CV+ G S +MC+YNR
Sbjct: 188 IANAGCKHFDVHGGPENIPTSRFSFDAKVSMRDWRMTFLPQFKACVEAGALS-LMCSYNR 246
Query: 269 VNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMD 328
+NG+P+CA++ LL+ R +W F GY+ SD A+ I YA A D AG
Sbjct: 247 INGVPACANKALLTDILRNEWDFKGYVVSDQGALEFIVIEHHYAPDFMKAAADAANAGTC 306
Query: 329 VNCGSFLQK------HTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKI 382
+ G+ +K H AVK + + A+ LF VRM+LG F+ P P+ I
Sbjct: 307 LEDGNIGRKFFNVFEHLVDAVKNNLVSVDTLKNAVSRLFYVRMKLGEFDP-PDNNPYANI 365
Query: 383 GADVVCSPAHQVLALQAAQDGIVLLKNSHGLLP--LPKSKSVSLA-LIGPNANSAKTLLG 439
V+ S AH L+LQAA + IVL+KN G LP + V A ++GP ++ + L G
Sbjct: 366 PLSVIQSDAHINLSLQAAMESIVLMKNDDGFRSPFLPITNEVKKACMVGPFSDDPEVLFG 425
Query: 440 NYAGPSCRS--ITPLQALQNY---VENTVYYPGC-DTVACSSASIDKAVDIAKGADHVVL 493
+Y+ R IT L L+N + Y GC D AC + K G + +++
Sbjct: 426 DYSPTLMRDYVITSLAGLKNANIGTDTLNYAVGCEDGPACRNYDSAKVRSACDGVELIIV 485
Query: 494 MMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKK--PVILVLLCGGPVDITFAKYDR 551
GL + E E D D+ LPG Q +L+ + AEAA K VIL+L P+DI +AK D
Sbjct: 486 TAGLSKHLESEGKDLSDINLPGHQLDLM-QDAEAASKNASVILILFNASPLDIRYAKTDP 544
Query: 552 NIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVP-MTDMKMRPQATSG 610
I IL A YPG+ A+A V+ G++NP GRLP TW P +VP +T+ M+
Sbjct: 545 RIVGILEAYYPGQTAGKAIANVLTGEYNPSGRLPNTW-PASLDQVPGITNYTMKE----- 598
Query: 611 NPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSV 670
RTYR++ + ++PFG GLSY+ + Y +S +S M+
Sbjct: 599 ---RTYRYFTQEPLYPFGYGLSYTTFHYSNLNISSTA----TASGAGMIA---------- 641
Query: 671 PELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAK 730
V++ V N G M G ++V + P QLVG ++
Sbjct: 642 --------------VSVLVTNTGSMDGTEVTQVYV--WCNISYAPKLQLVGVNKDFISKG 685
Query: 731 EKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGDEE 769
+ E+ F + P E L +D VI T+ L V ++
Sbjct: 686 KTLEVSFSIKP-EQLQVWTDDDKWVIPVSTYDLSVSGQQ 723
>gi|348688508|gb|EGZ28322.1| family 3 glycoside hydrolase [Phytophthora sojae]
Length = 701
Score = 361 bits (926), Expect = 1e-96, Method: Compositional matrix adjust.
Identities = 236/652 (36%), Positives = 338/652 (51%), Gaps = 86/652 (13%)
Query: 8 LVFPLLCLCFTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDE 67
+V PLL L + + VD+ + P +C + + FC T+LP+S R DL++RL LDE
Sbjct: 1 MVLPLLLLVASVVAASVDA-EIPRACVGEH---QKLQFCNTSLPVSARVEDLLARLPLDE 56
Query: 68 KISQLVNSAPAIPR-----LGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILT 122
K L +A A PR +G+P Y W + +HGV GT TSFP +
Sbjct: 57 KAILL--TARASPRGNMSSIGLPEYNWGANCVHGVRSTC------GT-NCPTSFPNPV-- 105
Query: 123 AASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFR--DPRWGRGQETPGEDP 180
N++I R DPRWGR ETP EDP
Sbjct: 106 -------------------------------------NLSIHRRRDPRWGRNTETPSEDP 128
Query: 181 LVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQ 240
LV KY V+Y +G+Q + LQA KH+ AY +N+ G R F+A V+
Sbjct: 129 LVNSKYGVAYTKGLQEGKHEDPRY---LQAVVTLKHYVAYSYENYGGGNRKTFNAIVSPY 185
Query: 241 DLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCD 300
D ADTY P F S + G A G+MC+YN VNG+P+CA+ L +K R GF GYITSD
Sbjct: 186 DFADTYFPAFRSSIVDGNAKGVMCSYNSVNGVPACANNELENKLLRGMLGFDGYITSDSG 245
Query: 301 AVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFL--QKHTKAAVKQKKLPESEIDRALH 358
A+ I D Y + +A + AG DVN G K V+ +L +D L
Sbjct: 246 AIEAISDWLHYVPTRCEAARLAILAGTDVNSGRGFGYMACLKELVESNQLDVKVVDDVLR 305
Query: 359 NLFSVRMRLGLFNGNPTM-QPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLP 417
+ +R LGLF +P QP+ K+ + V + A + L+L A+ IVLL+N+ +LPL
Sbjct: 306 HTLKLRFELGLF--DPIEDQPYWKVTPNDVNTDAAKKLSLDLARKSIVLLQNNQPVLPL- 362
Query: 418 KSKSVSLALIGPNANSAKTLLGNYAGPSCRS--------ITPLQAL--QNYVENTVYYPG 467
+ V LA++GP+A + + LLGNY G C TP +A+ N +T Y G
Sbjct: 363 -RRGVKLAVVGPHAQAKRALLGNYLGQMCHGDYNEVGCIKTPFEAVSASNGDSSTTYALG 421
Query: 468 CDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEA 527
C+ S+A +AV +GA+ VVL +G+D++ E E DR ++ LP Q +L+ RV A
Sbjct: 422 CNVTGNSTAGFVEAVKAVQGAEAVVLFLGIDKSVEAEVRDRNNIDLPAIQVQLLQRV-RA 480
Query: 528 AKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMT 587
KP ++VL+ GG +T +++ A YPG GA A+ +++FGD NPGG+LP+T
Sbjct: 481 VGKPTVVVLMNGGV--LTAEDIIGQTDALVEAFYPGFFGAQAMTDILFGDANPGGKLPVT 538
Query: 588 WYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYK 639
Y DY+ + M+ + PGR+YR+++G+ VFPFG GLSY+ +S K
Sbjct: 539 MYRSDYVNT----VDMKSMNVTAYPGRSYRYFKGEPVFPFGWGLSYTSFSLK 586
>gi|157676888|emb|CAP07659.1| beta-xylosidase [uncultured rumen bacterium]
Length = 761
Score = 360 bits (925), Expect = 1e-96, Method: Compositional matrix adjust.
Identities = 266/823 (32%), Positives = 385/823 (46%), Gaps = 149/823 (18%)
Query: 13 LCLCFTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQL 72
L LCF + L SC P S + + P RA+ L+ +L+L+EK +
Sbjct: 4 LRLCFRAALAAACMMAAFASCAPQEIS-----YTDKSQPAELRAKALLPKLSLEEKAGLV 58
Query: 73 VNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWY 132
++PA+ RLGI AY WWSEALHGVA G AT FPQ I AASFD
Sbjct: 59 QYNSPAVERLGIKAYNWWSEALHGVARNGS----------ATVFPQPIGMAASFDVEKIE 108
Query: 133 RIGQAIGLEARALYNAG-------QAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGK 185
+ A+ EAR Q G++FW PNINIFRDPRWGRG ET GEDP + G+
Sbjct: 109 TVFTAVSDEARVKNRIAAEDGRVYQYAGLSFWTPNINIFRDPRWGRGMETYGEDPYLMGQ 168
Query: 186 YAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADT 245
++ VRG+QGD L+ AC KH+ + + + R++FDA+V+ +DL +T
Sbjct: 169 LGMAVVRGLQGDPD-----ADVLKTHACAKHYAVH---SGLESNRHRFDAQVSERDLRET 220
Query: 246 YQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSII 305
Y P F+ V + +M AYNR G P A L+ K R +WG+ G + SDC A+
Sbjct: 221 YLPAFKDLVTKAGVKEVMTAYNRFRGYPCAASEYLVQKILREEWGYKGLVVSDCWAIPDF 280
Query: 306 YDA--EGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSV 363
++ G+ + E+A + G+DV CGS K AA+ Q L E ++DR L + +
Sbjct: 281 FEPGRHGFVATGEEAAALAVANGLDVECGSTFSK-IPAAIDQGLLKEEDLDRNLLRVLTE 339
Query: 364 RMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVS 423
R RLG +G P+ + +V P H+ L+L A++ +VLL+N +G+LPL + +
Sbjct: 340 RFRLGEMDGE---SPWDDLDPAIVEGPEHRALSLDIARETMVLLRN-NGVLPLKAGEKI- 394
Query: 424 LALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGC--------------- 468
ALIGPNA+ A+ GNY +IT LQA+Q V VY C
Sbjct: 395 -ALIGPNADDAQMQWGNYNPVPKSTITLLQAMQARVPGLVYDRACGILDAEYAPQGSAYA 453
Query: 469 DTVACSSASIDKAV--------DIA--------------------------KGADHVVLM 494
+ + S A ++ A DI +G D VV
Sbjct: 454 NLIGASEAQLEAAARRYAVSVNDIKNYIRRDEEQRRSFMPALDEAAVLKKLEGVDVVVFA 513
Query: 495 MGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDI 544
G+ E EE+ DR D+ LPG Q+ L+ + +A KK V+LV G I
Sbjct: 514 GGISPRLEGEEMRVQVPGFSGGDRTDIELPGVQRRLLKALHDAGKK-VVLVNFSG--CAI 570
Query: 545 TFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVP-MTDMKM 603
+ +IL A YPG+ G A+A+V+FGD NP G+LP+T+Y ++ ++P + D M
Sbjct: 571 GLVPETESCDAILQAWYPGQEGGTAIADVLFGDVNPSGKLPVTFY-KNVDQLPDVEDYNM 629
Query: 604 RPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQD 663
G TYR++ G+ ++PFG GLSY+ +++ V L
Sbjct: 630 E--------GHTYRYFRGEPLYPFGYGLSYTSFAFGEPKVKGKNL--------------- 666
Query: 664 VVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQ 723
I V N G +AG V L+V+ G P+K L F+
Sbjct: 667 ----------------------EIDVTNTGSVAGTEVVQLYVRKPDDTAG-PVKTLRAFR 703
Query: 724 SVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
V + A + ++ L L + +D MV G + L+ G
Sbjct: 704 RVSVPAGQTVKVSIPLDKETFLWWSEKDQDMVPVRGRYELLCG 746
>gi|405955586|gb|EKC22647.1| Putative beta-D-xylosidase 2 [Crassostrea gigas]
Length = 745
Score = 360 bits (924), Expect = 2e-96, Method: Compositional matrix adjust.
Identities = 239/757 (31%), Positives = 378/757 (49%), Gaps = 88/757 (11%)
Query: 41 ETFPFCKTTLPISQRARDLVSRLTLDEKISQL--------VNSAPAIPRLGIPAYEWWSE 92
+ +PF T+LP R +DLV RLT++E + Q+ + APA+PRLG+ + W +E
Sbjct: 24 QDYPFRNTSLPWDARVKDLVDRLTIEEIVVQMSRGGSGPRASPAPAVPRLGVGPFSWNTE 83
Query: 93 ALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA---- 148
L G G ATSFPQ + AA+F + + + A +E RA +N
Sbjct: 84 CLRGDVYAGN----------ATSFPQALGLAATFSTEVICDVASATSIEVRAKFNDYQRR 133
Query: 149 ---GQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLK 205
G G++ ++P INI R P WGR QET GEDP ++G+ A +V+ +QGD
Sbjct: 134 KIYGDHKGISCFSPVINIMRHPLWGRNQETYGEDPFLSGELAAIFVKCLQGDD------P 187
Query: 206 GKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCA 265
++A+A CKHF + +R+ FDA+V+ +D T+ P F+ CV+ G S +MC+
Sbjct: 188 TYIRANAGCKHFDVHGGPENIPVSRFSFDAKVSERDWRLTFLPAFKRCVQAGSYS-LMCS 246
Query: 266 YNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKA 325
+NR+NG+P+C ++ LL+ R +WGF GY+ SD +A+ I Y + D +KA
Sbjct: 247 FNRINGVPACGNKRLLTDILRTEWGFTGYVVSDQEAIENIMTYHHYTNNSVDTAALCVKA 306
Query: 326 GMDVNCGSFLQKHTK----AAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGK 381
G ++ + K T A+K KL + ++ +++ LF RMRLG F+ P P+
Sbjct: 307 GCNLELSTNEVKPTYFYIIDALKAGKLDKEDLVKSVSPLFYTRMRLGEFDP-PDHNPYNF 365
Query: 382 IGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNY 441
I V+ S H+ ++L AA VLLKN G LP+ K +++++GP A++ +G+Y
Sbjct: 366 IDLSVIQSEEHRAISLNAAMKSFVLLKNKGGFLPITKLFD-TISVLGPMADNKYQQIGSY 424
Query: 442 AG---PSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLD 498
A PS + TPLQ L + Y GC+ ACS + + +D + +G
Sbjct: 425 APDVMPS-YTTTPLQGLSKLSKRVQYAAGCNDNACSKYNRTEIQRAVNSSDIFFVCLGTG 483
Query: 499 QTQEKEELDRVDLVLPGRQQELITRVAEAAKK--PVILVLLCGGPVDITFAKYDRNIGSI 556
E E+ DR + LPG+Q +L+ + K P++L+L GGPV+IT+A + +I
Sbjct: 484 PMIENEDHDRASMELPGQQAQLLKDAIMFSAKGVPIVLLLFNGGPVNITWADRSDRVVAI 543
Query: 557 LWAGYPGEAGAVALAEVIFG---DHNPGGRLPMTWYPQDYIKVP-MTDMKMRPQATSGNP 612
+ +P + A+ V+ NP GRLP TW P+ ++P M + M
Sbjct: 544 MECFFPAQETGEAVLRVVTNTGNSSNPAGRLPYTW-PKYQDQIPSMVNYSME-------- 594
Query: 613 GRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPE 672
GRTYR++ G ++PFG GLSYS +++ N + + + QD+
Sbjct: 595 GRTYRYFHGDPLYPFGYGLSYSTFNFT-----------NAWMNPIISQGQDLT------- 636
Query: 673 LGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEK 732
V + V N G G + +++K PI QLVGF+ V L AKE
Sbjct: 637 ------------VRVEVCNEGPTDGDEVIQVYLKWLDTNETMPIHQLVGFERVSLRAKET 684
Query: 733 AEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGDEE 769
+ + E+++ E IE G + L +G ++
Sbjct: 685 LSWLITVR-AENMAVWNESRGFYIEPGRYRLYIGGQQ 720
>gi|363742357|ref|XP_003642627.1| PREDICTED: probable beta-D-xylosidase 5-like [Gallus gallus]
Length = 748
Score = 360 bits (923), Expect = 2e-96, Method: Compositional matrix adjust.
Identities = 230/644 (35%), Positives = 339/644 (52%), Gaps = 51/644 (7%)
Query: 41 ETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNS-------APAIPRLGIPAYEWWSEA 93
+ FPF TLP +R DL+ RLT E + Q+ AP IPRLGI Y W +E
Sbjct: 25 QPFPFRDPTLPWHRRLEDLLGRLTPAEMVLQMARGGALGNGPAPPIPRLGIAPYNWNTEC 84
Query: 94 LHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQA-- 151
L G A AT+FPQ + AA+F L YR+ A E RA +N+ A
Sbjct: 85 LRGDAEAPGW---------ATAFPQALGLAAAFSPELVYRVANATATEVRAKHNSFVAAG 135
Query: 152 -----IGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKG 206
G++ ++P +NI R P WGR QET GEDP +T + A S+V+G+QG +
Sbjct: 136 RYDDHTGLSCFSPVLNIMRHPLWGRNQETYGEDPYLTAELATSFVQGLQG------QHPR 189
Query: 207 KLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAY 266
++ASA CKHF+ + +R FDA+V +D T+ P F++CV+ G S MC+Y
Sbjct: 190 YIKASAGCKHFSVHGGPENIPVSRLSFDAKVLERDWHTTFLPQFQACVRAGSYS-FMCSY 248
Query: 267 NRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAG 326
NR+NG+P+CA++ LL+ R +WGF GY+ SD AV +I Y + + + + AG
Sbjct: 249 NRINGVPACANKKLLTDILRGEWGFEGYVVSDEGAVELILLGHRYTHTFLETAIASVNAG 308
Query: 327 MDVNCGSFLQK----HTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKI 382
+++ ++ H A+ + + + LF R+RLG F+ P M P+ +
Sbjct: 309 LNLELSYGMRNNVFMHIPKALAMGNITLEMLRDRVRPLFYTRLRLGEFD-PPAMNPYNAL 367
Query: 383 GADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYA 442
VV S H+ L+L+AA VLLKN LPL + LA++GP A++ + L G+YA
Sbjct: 368 ELSVVQSSEHRNLSLEAAIKSFVLLKNQRDTLPLRELHGKRLAVVGPFADNPRVLFGDYA 427
Query: 443 G-PSCRSI-TPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQT 500
P + I TP + LQ N + GC C S D+ + +GAD V++ +G
Sbjct: 428 PVPEPQYIYTPRRGLQTLPANVSFAAGCREPRCWVYSRDEVENAVRGADVVLVCLGTGID 487
Query: 501 QEKEELDRVDLVLPGRQQELIT-RVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWA 559
E E DR DL LPG Q +L+ V AA PVIL+L GP+D+++A+ +G+IL
Sbjct: 488 VEMEARDRKDLSLPGHQLQLLQDAVRAAAGHPVILLLFNAGPLDVSWAQLHDGVGAILAC 547
Query: 560 GYPGEAGAVALAEVIFGDH--NPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYR 617
+P +A +A+A V+ G +P GRLP TW + PM + M GRTYR
Sbjct: 548 FFPAQATGLAIASVLLGKQGASPAGRLPATWPAGMHQVPPMENYTME--------GRTYR 599
Query: 618 FYEGKE--VFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMV 659
+Y G+E ++PFG GLSY+ + Y+ +S L + + S +V
Sbjct: 600 YY-GQEAPLYPFGYGLSYTTFHYRDLVLSPPVLPICANLSVSVV 642
>gi|255572557|ref|XP_002527212.1| beta-glucosidase, putative [Ricinus communis]
gi|223533388|gb|EEF35138.1| beta-glucosidase, putative [Ricinus communis]
Length = 349
Score = 359 bits (922), Expect = 3e-96, Method: Compositional matrix adjust.
Identities = 176/348 (50%), Positives = 230/348 (66%), Gaps = 26/348 (7%)
Query: 1 MIFHKLSLVFPLLCLCFTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLV 60
M + SL F + L F ++ +S + + C P P ++ FC +L + RA L+
Sbjct: 1 MTLYLRSLFFSCIVL-FELSVSNSESHKLQYPCQP--PLHNSYTFCNQSLSVPTRAHSLI 57
Query: 61 SRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFN-GTIRGATSFPQV 119
S LTL+EKI QL ++A IPR GIP YEWWSE+LHG+A G G+ F G + AT FPQV
Sbjct: 58 SLLTLEEKIKQLSDNASGIPRFGIPPYEWWSESLHGIAINGPGVSFTIGPVSAATGFPQV 117
Query: 120 ILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGED 179
I++AA+F+ LW+ IG AI +EARA++N GQ+ G+TFWAPN+NIFRDPRWGRGQETPGED
Sbjct: 118 IISAAAFNRTLWFLIGSAIAIEARAMHNVGQS-GLTFWAPNVNIFRDPRWGRGQETPGED 176
Query: 180 PLVTGKYAVSYVRGVQGDTFNGGKLKGKLQA--------------------SACCKHFTA 219
P++T YA+ +V+G QG + G SACCKH TA
Sbjct: 177 PMLTSAYAIEFVKGFQGGNWKSGVSGSGSGRYGFGEKRMLRDDDGDDGLMLSACCKHLTA 236
Query: 220 YDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRN 279
YDL+ W +RY F+A VT QDL DTYQPPF SC+++G+AS +MC+YN VNG+P+CA +
Sbjct: 237 YDLEKWGNFSRYSFNAVVTEQDLEDTYQPPFRSCIEEGKASCLMCSYNEVNGVPACARED 296
Query: 280 LLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGM 327
LL K AR +WGF GYI SDCDAV+ I++ + Y+KS EDAV LKAGM
Sbjct: 297 LLQK-AREEWGFEGYIVSDCDAVATIFEYQNYSKSAEDAVAIALKAGM 343
>gi|405968899|gb|EKC33925.1| Putative beta-D-xylosidase 5 [Crassostrea gigas]
Length = 748
Score = 359 bits (921), Expect = 4e-96, Method: Compositional matrix adjust.
Identities = 246/752 (32%), Positives = 367/752 (48%), Gaps = 91/752 (12%)
Query: 39 STETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNS--------APAIPRLGIPAYEWW 90
++ FPF +L S+R DLV RLTLD+ + QL APAI LGI Y+W
Sbjct: 20 ASSNFPFQNVSLSWSERVDDLVGRLTLDQIVQQLARGGAGLNGGPAPAIENLGIGPYQWN 79
Query: 91 SEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA-- 148
+E L G G ATSFPQ I AA+F L + + +A E RA +N
Sbjct: 80 TECLRGDVEAGN----------ATSFPQAIGLAAAFSKDLIFNVSKAAATEVRAKHNDFV 129
Query: 149 -----GQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGK 203
G++ ++P +NI R P WGR QET GEDP ++G YA +V+G+QGD
Sbjct: 130 KRGIFTDHTGLSCFSPVVNIMRHPLWGRNQETYGEDPYLSGTYASYFVQGLQGDH----- 184
Query: 204 LKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIM 263
+QA+A CKHF A+ +R FDA+V+M+DL T+ P F+ CV+ G A +M
Sbjct: 185 -DRYIQANAGCKHFDAHGGPEDIPESRMGFDAKVSMRDLRLTFLPAFQKCVQAG-AYSLM 242
Query: 264 CAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVL 323
C+YN +NG+P+C+++ L+ R +W F GY+ SD A+ Y + EDA +
Sbjct: 243 CSYNSINGVPACSNKLLMMDILRGEWNFTGYVVSDEGAIENQISFHHYYNNSEDAAAGSV 302
Query: 324 KAGMDVNCGSFLQK----HTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPF 379
AG ++ L + AVK KL ES + + LF RMRLG F+ P M P+
Sbjct: 303 NAGCNLELSGNLTEPVFMKIGDAVKSGKLEESVVRNRVKPLFYTRMRLGEFD-PPEMNPY 361
Query: 380 GKIGADVVCSPAHQVLALQAAQDGIVLLK------NSHGLLPLPKSKSVSLALIGPNANS 433
+ V+ S H+ L+L AA +VLLK H + P + +A+IGP AN+
Sbjct: 362 SSVNLSVIQSEEHRNLSLTAAAKSLVLLKRPSKFSKRHLIGGFPSER---MAVIGPMANN 418
Query: 434 AKTLLGNYAGPSCRSI--TPLQALQNYVENTVYYPGC-DTVACSSASIDKAVDIAKGADH 490
+ G+Y+ + TPL+ L + Y GC D C + S D GAD
Sbjct: 419 TDQIFGDYSPTTDPRFVKTPLKGLTELNFSMNYAAGCVDGTRCLNYSQDDVKTALVGADL 478
Query: 491 VVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYD 550
VV+ +G + E E +DR D++LPG+Q +L+ V K V L++ GPV+IT+A+
Sbjct: 479 VVVCLGTGKDLESENVDRKDMMLPGKQLQLLQDVVSMTNKAVYLLVFSAGPVNITWAQES 538
Query: 551 RNIGSILWAGYPGEAGAVALAEVIF---GDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQA 607
+ IL YP ++ A+ + + G NP GRLP TWY MTD M
Sbjct: 539 ERVLIILQCFYPAQSAGDAITQALIMRDGRFNPAGRLPYTWYRYTEQIPEMTDYSM---- 594
Query: 608 TSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHY 667
+TYR++ G ++PFG GLSYS + + +KLY
Sbjct: 595 ----ARKTYRYFTGVPLYPFGYGLSYSTFVF-------SKLYF----------------- 626
Query: 668 KSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVIL 727
+P++ +V + V N G G + +++K P QLV F+ V +
Sbjct: 627 --LPKVNA----GDPNVVQVRVFNEGPFDGDEVLQVYIKWMSTKERMPRVQLVAFERVFI 680
Query: 728 NAKEKAEIVFELSPCESLSRAREDGLMVIEEG 759
+++ +I ++P +S++ ++ VIE G
Sbjct: 681 RSQQYVDISTAITP-QSMAIWKDGIGFVIEPG 711
>gi|336463686|gb|EGO51926.1| hypothetical protein NEUTE1DRAFT_125528 [Neurospora tetrasperma
FGSC 2508]
Length = 788
Score = 358 bits (919), Expect = 7e-96, Method: Compositional matrix adjust.
Identities = 257/762 (33%), Positives = 369/762 (48%), Gaps = 97/762 (12%)
Query: 81 RLGIPAYEWWSEALHGVAGVGKGIFFNGT---IRGATSFPQVILTAASFDSYLWYRIGQA 137
R+G+P Y WWSE LHGVAG G+ FN T ATSF I ASFD L Y +G A
Sbjct: 13 RIGLPKYAWWSEGLHGVAG-SPGVTFNTTGYPFSYATSFANAINLGASFDDDLVYEVGTA 71
Query: 138 IGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGD 197
I EARA N G G+ +W PN+N ++DPRWGRG ETPGEDPL Y + + G++G+
Sbjct: 72 ISTEARAFANFGFG-GLDYWTPNVNPYKDPRWGRGAETPGEDPLHIKGYVKAMLAGLEGN 130
Query: 198 TFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQG 257
+ + A CKH+ AYDL+ W G TRY+F+A VT+QDL++ Y PPF+ C +
Sbjct: 131 -------ETVRKVIATCKHYAAYDLERWHGLTRYEFEAIVTLQDLSEYYLPPFQQCARDS 183
Query: 258 RASGIMCAYNRV-----------------NGIPSCADRNLLSKTARRQWGF---HGYITS 297
+ IMC+YN + P+CA+ L++ R W + + YITS
Sbjct: 184 KVGSIMCSYNALTIRDMAGGNPDEIINLTTAQPACANTYLMT-ILRDHWNWTEHNNYITS 242
Query: 298 DCDAV-SIIYDAEGYAKSPEDAVVDVLKAGMDVNC---GSFLQKHTKAAVKQKKLPESEI 353
DC+A+ + D ++++P +A KAG D C GS L A Q LPE+ I
Sbjct: 243 DCNAILDFLPDNHNFSQTPAEAAAAAYKAGTDTVCEVSGSPLTD-VVGAYNQSLLPEAVI 301
Query: 354 DRALHNLFSVRMRLGLFN------------GNPTMQPFGKIGADVVCSPAHQVLALQAAQ 401
D AL L+ +R G + G+ + + + + V +P+ Q LAL++A
Sbjct: 302 DTALRRLYEGLIRAGYLDHGRSSAVAGGDGGSFSSPAYDALNWEDVNTPSTQELALRSAT 361
Query: 402 DGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVEN 461
+GIVLLKNS LLPL S +ALIG AN+ T+ G Y+G PL A Q +
Sbjct: 362 EGIVLLKNSGSLLPLDFSSGKKVALIGHWANATGTMRGPYSGIPPFYHNPLYAAQQLNLS 421
Query: 462 TVYYPGCDTVACSSASIDKAVDIAKGADHVVLMM-GLDQTQEKEELDRVDLVLPGRQQEL 520
Y G A + A VVL G D T E+LDR + P Q +L
Sbjct: 422 FSYANGPVVNASDPDTWTAPALAAAEGADVVLYFGGTDTTVASEDLDRESIAWPKAQMKL 481
Query: 521 ITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
++ +A KP++++ L G VD +F + N+ SILW GYPG++G A+ +V+ G P
Sbjct: 482 LSELA-GLGKPLVVIQL-GDQVDDSFLLENGNVSSILWVGYPGQSGGTAVFDVLTGKKAP 539
Query: 581 GGRLPMTWYPQDYI-KVPMTDMKMR-----------------------------PQATSG 610
GRLP+T YP+ Y+ +VP+T+M +R P++T G
Sbjct: 540 AGRLPVTQYPEGYVDEVPLTEMALRPFNHSSSTSSSSNPEEEVSVQGSGSLTIQPRSTPG 599
Query: 611 N-----PGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVV 665
N PGRTY++Y V PFG GL Y+ ++ S S ++
Sbjct: 600 NKTLSSPGRTYKWYS-NPVLPFGYGLHYTTFNVSLSLSSNASSPSPSFSIPSLLTPCTAT 658
Query: 666 HYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSV 725
H P F + ++I + N G + LLF+ P+K LV ++ V
Sbjct: 659 HLDLCP-----FSPSANSALSISITNTGTHTSDYVALLFLSGEFGPKPYPLKTLVSYKRV 713
Query: 726 ILNAKEKAEIVFELSPCE--SLSRAREDGLMVIEEGTHFLVV 765
+ K + + P ++SR DG V+ GT+ V
Sbjct: 714 -KDIKPGETVTVKDVPVSLGAISRVDGDGNTVLYPGTYRFAV 754
>gi|256394357|ref|YP_003115921.1| glycoside hydrolase family 3 [Catenulispora acidiphila DSM 44928]
gi|256360583|gb|ACU74080.1| glycoside hydrolase family 3 domain protein [Catenulispora
acidiphila DSM 44928]
Length = 1357
Score = 358 bits (918), Expect = 8e-96, Method: Compositional matrix adjust.
Identities = 262/801 (32%), Positives = 383/801 (47%), Gaps = 109/801 (13%)
Query: 34 DPSNPSTETFP-FCKTTLPISQRARDLVSRLTLDEKISQL-VNSAPAIPRLGIPAYEWWS 91
D + P+ + P + T+ P RA DLVSR+TL EK +QL SAPAIPRLG+ Y + +
Sbjct: 37 DTAAPAASSTPIYLNTSYPFEARAADLVSRMTLAEKAAQLNTTSAPAIPRLGVQQYTYQA 96
Query: 92 EALHGVAGVGKGIFFNGTIRG----ATSFPQVILTAASFDSYLWYRIGQAIGLEAR---- 143
EA HG+ +G G +G++ G ATSFP ++ S+D L Y+ A+ EAR
Sbjct: 97 EAQHGINYLG-GDQNSGSVAGNPPVATSFPTNFASSMSWDPALVYQETTAVSDEARGLVD 155
Query: 144 -ALYNAGQA-IG--------MTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRG 193
+L+ GQ +G +TFWAP +N+ RDPRWGR E GEDP + G+ A ++V G
Sbjct: 156 KSLFGTGQNNLGPSASDYGSLTFWAPTVNLDRDPRWGRTDEAFGEDPYLVGQMAGAFVNG 215
Query: 194 VQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESC 253
QG++ G L G L+A+A KH+ D++ R + V+ DL D Y F
Sbjct: 216 FQGNSMTGQSLDGYLKAAATAKHYALNDVEQ----NRTGISSNVSDTDLRDYYTKQFADL 271
Query: 254 VKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDA--EGY 311
++ +G+M +YN +NG PS AD ++ A+R +GF+GY+TSDC AV Y G+
Sbjct: 272 IENSHVAGLMTSYNAINGTPSVADTYTANQLAQRTYGFNGYVTSDCGAVGTAYRNFPAGH 331
Query: 312 AKSPEDAVVD--------------------------VLKAGMDVNCGS--FLQKHTKAAV 343
A +P D L+AG VNCG F ++ +AA+
Sbjct: 332 AWAPPGWTTDGGDTNSIWTNTSTGAKISGAAGGEAYSLRAGTQVNCGGDEFSLQNIQAAI 391
Query: 344 KQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQ-PFGKIGADVVCSPAHQVLALQAAQD 402
L E ID L LF++RM G F+ P + P+ I + SPAHQ LA A +
Sbjct: 392 SAGILSEGVIDSDLTKLFTIRMETGEFD--PASKVPYTSITKAQIQSPAHQALATSVADN 449
Query: 403 GIVLLKN------SHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQ 456
+VLLKN S LLP SK ++ ++G AN + LG+Y+G + +Q L
Sbjct: 450 SLVLLKNANVSGTSAPLLPASASKLANVVILGDMAN--QVTLGDYSGAPSLQVNAVQGLT 507
Query: 457 NYVE------NTVYYPGCDTVACSSASIDKAVDIA--KGADHVVLMMGLDQTQEKEELDR 508
++ N ++ + +SA+ + A K AD VV+ +G +Q +E DR
Sbjct: 508 TAIKAANPSANILFDAAGTSSTTTSAATLSSATQAAIKKADLVVMFVGTNQNNAQEGNDR 567
Query: 509 VDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAV 568
L +PG LIT+ LV+ GPV I+ ++ +++++GY GE+
Sbjct: 568 TTLNMPGNYDSLITQTTALGNPKTALVVQSDGPVKIS--DVQGSVPAVVFSGYNGESQGT 625
Query: 569 ALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFG 628
ALA+V+ G NP G L TWY D M++ + P TSG GRTY+++ G +PFG
Sbjct: 626 ALADVLLGKQNPSGHLNFTWYADDSQLPAMSNYGLTPGDTSG-LGRTYQYFTGTPTYPFG 684
Query: 629 CGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIG 688
GLSYS ++Y V N S + N V+
Sbjct: 685 YGLSYSAFTYSAATVD------NASPNADGTVN-----------------------VSFK 715
Query: 689 VKNHGEMAGKHPVLLFVKP--ARRGNGRPIKQLVGFQSV-ILNAKEKAEIVFELSPCESL 745
V N G AG L+ G P K+LVGFQ +LN +I + +
Sbjct: 716 VTNSGSTAGATVAQLYAATQFTESGVQLPTKRLVGFQKTGVLNPGAAQQITIPVKISDLS 775
Query: 746 SRAREDGLMVIEEGTHFLVVG 766
V+ +GT+ L VG
Sbjct: 776 FWNATTMKSVVYDGTYALQVG 796
>gi|427385932|ref|ZP_18882239.1| hypothetical protein HMPREF9447_03272 [Bacteroides oleiciplenus YIT
12058]
gi|425726971|gb|EKU89834.1| hypothetical protein HMPREF9447_03272 [Bacteroides oleiciplenus YIT
12058]
Length = 732
Score = 357 bits (917), Expect = 1e-95, Method: Compositional matrix adjust.
Identities = 250/753 (33%), Positives = 386/753 (51%), Gaps = 99/753 (13%)
Query: 45 FCKTTLPISQRARDLVSRLTLDEKISQLVN---SAPAIPRLGIPAYEWWSEALHGVAGVG 101
F + + R DL+SRLTL++K +QL+N + I A + W++ LHGV
Sbjct: 33 FLNQEMSMEARVADLMSRLTLEQK-AQLLNHRGKTVVVDGFSIRA-DQWNQCLHGVKWT- 89
Query: 102 KGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA--------GQAIG 153
T+FP I A++D+ L +R+ I EARA+YN G+ G
Sbjct: 90 ---------EPTTNFPTSIALGATWDTELIHRVATVISDEARAIYNGWKQDPEFRGEHKG 140
Query: 154 MTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASAC 213
+ + +P INI R+P WGR E GEDP TG+ V+YV+G+QGD + KL ++
Sbjct: 141 LIYRSPVINISRNPYWGRINEIFGEDPYHTGRMGVAYVKGLQGDDSHYLKL------AST 194
Query: 214 CKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIP 273
KH+ +++ R K A+V + L + + P F+ C+ +G+A +M +YN +NG+P
Sbjct: 195 LKHYAVNNVE----VDRMKLSAQVPERMLYEYWLPHFKDCIVEGKAQSVMASYNAINGVP 250
Query: 274 SCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAK---SPEDAVVDVLKAGMDVN 330
+ ++ LL+ + QWG G++ SD V + EG+ + S E+AV + AG D +
Sbjct: 251 NNINKLLLTDILKNQWGHEGFVVSDLGGVKTM--VEGHHQRQISCEEAVGRSIMAGCDFS 308
Query: 331 CGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSP 390
+ +K+ A+++ L E ++ AL + VR RLG F+ ++ P+ +I DV+
Sbjct: 309 DAEY-EKYIPDALRKGYLTEERLNDALRRVLLVRFRLGEFDDFKSV-PYSRISPDVIGCK 366
Query: 391 AHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSIT 450
H+ L+L+AA+ IVLLKN LLP+ +S +A+IGP A+ GNY G +T
Sbjct: 367 EHRNLSLEAARKSIVLLKNEKKLLPIDRSIIKRVAVIGPYADLFNQ--GNYGGVPKDPVT 424
Query: 451 PLQALQNYVENTVYYPGCDTVACS----------------SASIDKAVDIAKGADHVVLM 494
PLQ ++N V N V C + A + KAV++A+ +D V L
Sbjct: 425 PLQGIKNAVGNNVEVLYCKGAQITPVKVRKGQPIPPRFDKEAEMKKAVEMARNSDVVFLF 484
Query: 495 MGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIG 554
+G E E DR LVLPG Q EL+ V E KK V++VL+ GPV + K +NI
Sbjct: 485 VGTTADIEVEGRDRKTLVLPGNQNELVKAVYEVNKK-VVVVLMSAGPVAVPEVK--KNIP 541
Query: 555 SILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGR 614
++L A +PG+ G A+A+V+FGD+NPGG+LP T Y D +VP TD + G
Sbjct: 542 AVLQAWWPGDEGGNAIADVLFGDYNPGGKLPYTMYASDE-QVPSTD------EYDISKGF 594
Query: 615 TYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELG 674
TY + + K +F FG GLSYSK+ Y +S + +N + S
Sbjct: 595 TYMYLKKKPLFAFGHGLSYSKFHYSDLQISSPVVSVNDTVS------------------- 635
Query: 675 TEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAE 734
V + VKN G+ G+ V L+V+ + RP K+L GF+ + L E+ E
Sbjct: 636 ----------VVLKVKNMGKRTGEEVVQLYVRDVKAKVVRPTKELRGFKRIALQPNEEQE 685
Query: 735 IVFELSPCESLSRARED-GLMVIEEGTHFLVVG 766
I L P +SL+ E G ++E G+ +++G
Sbjct: 686 IRLML-PVKSLAFYDESIGDFLVEPGSFEILLG 717
>gi|423279990|ref|ZP_17258903.1| hypothetical protein HMPREF1203_03120 [Bacteroides fragilis HMW
610]
gi|404584326|gb|EKA88991.1| hypothetical protein HMPREF1203_03120 [Bacteroides fragilis HMW
610]
Length = 722
Score = 357 bits (916), Expect = 1e-95, Method: Compositional matrix adjust.
Identities = 237/721 (32%), Positives = 374/721 (51%), Gaps = 79/721 (10%)
Query: 51 PISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTI 110
PI+ R + L+ ++TL EK SQLV+ + +IPRL +PAY +W+E LHGVA G+
Sbjct: 57 PIAVRVKTLIQQMTLAEKASQLVSESDSIPRLNLPAYNYWNECLHGVARAGE-------- 108
Query: 111 RGATSFPQVILTAASFDSYLWYRIGQAIGLEARALY-NAGQAIGMTFWAPNINIFRDPRW 169
T FPQ I A+++D+ L R+ AI EAR Y G+ G+T+W+P IN+ RDPRW
Sbjct: 109 --VTVFPQAINLASTWDTVLVKRVASAISTEARLKYLEIGK--GLTYWSPTINMARDPRW 164
Query: 170 GRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTT 229
GR +ET GEDP +T + V++V+G+QGD L+ A KHF A + +N
Sbjct: 165 GRNEETYGEDPYLTSRLGVAFVKGLQGDH------PAYLKTVATIKHFVANNEEN----N 214
Query: 230 RYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQW 289
R+ +++ + L + Y P +E+CVK+ +M AYN NG+P R LL + R++W
Sbjct: 215 RFSSSSQIPTKQLYEYYFPAYEACVKEAGVQSVMTAYNAFNGVPPSGSRWLLGEVLRKEW 274
Query: 290 GFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLP 349
GF G++ SDC A+ ++ S E+A + +G D+ CG+ ++ AVKQ +
Sbjct: 275 GFDGFVVSDCGAIGVMNWQHRVVNSLEEAAALGVNSGCDLECGTTYKEKLVQAVKQGLIS 334
Query: 350 ESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKN 409
E+ ID+AL + + R +LG F+ + P+ ++ LA +AA +VLLKN
Sbjct: 335 EATIDQALTRVLTARFKLGEFD-PMELVPYNHYDKKLLAGKKFAELAYEAAVKSVVLLKN 393
Query: 410 SHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCD 469
+ LLPL K K+ S+A++GP A+ LG Y+G S+T L+ +++ + +
Sbjct: 394 EN-LLPLSKEKTKSVAVVGPFAD--HNYLGGYSGQPPYSVTLLKGVKDLMGKRGKVNYLN 450
Query: 470 TVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAK 529
+ S SI AV KG D V++ +G D+ +E D + LP Q++L+ + +
Sbjct: 451 GIGASRDSIVAAV---KGVDVVLVALGSDEKMARENHDMTSIYLPEEQEKLLKAIYQVNP 507
Query: 530 KPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWY 589
+ ++LV G P+ +A D +I +I+ A YPG+ ALA+++FG+ NP G+LPMT Y
Sbjct: 508 R-IVLVFHSGNPLTSEWA--DVHIPAIMQAWYPGQEAGRALADLLFGNENPSGKLPMTIY 564
Query: 590 PQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLY 649
+ + D M GRTYR+ + ++ FG GLSY+ S+ F +
Sbjct: 565 RAEDQLPDILDFDMW-------KGRTYRYMKEDPLYGFGHGLSYT--SFGFDGI------ 609
Query: 650 LNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPAR 709
Q S T + ++ + N G+ G+ V ++V
Sbjct: 610 --QGSDT--------------------LKSGARLQCSVELSNTGKWTGEEVVQVYVS--- 644
Query: 710 RGN----GRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVV 765
R N P+K+LV F+ V L EK + F + P E LS E+G + G + L +
Sbjct: 645 RENTPVYTYPLKKLVAFKKVKLAPGEKKRVEFNIPPRE-LS-VWENGNWRMLTGKYTLFI 702
Query: 766 G 766
G
Sbjct: 703 G 703
>gi|313145345|ref|ZP_07807538.1| beta-glucosidase [Bacteroides fragilis 3_1_12]
gi|313134112|gb|EFR51472.1| beta-glucosidase [Bacteroides fragilis 3_1_12]
Length = 722
Score = 357 bits (915), Expect = 2e-95, Method: Compositional matrix adjust.
Identities = 234/721 (32%), Positives = 372/721 (51%), Gaps = 79/721 (10%)
Query: 51 PISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTI 110
PI+ R + L+ ++TL EK SQLV+ + +IPRL +PAY +W+E LHGVA G+
Sbjct: 57 PIAVRVKTLIQQMTLAEKASQLVSESDSIPRLNLPAYNYWNECLHGVARAGE-------- 108
Query: 111 RGATSFPQVILTAASFDSYLWYRIGQAIGLEARALY-NAGQAIGMTFWAPNINIFRDPRW 169
T FPQ I A+++D+ L R+ AI EAR Y G+ G+T+W+P IN+ RDPRW
Sbjct: 109 --VTVFPQAINLASTWDTVLVKRVASAISTEARLKYLEIGK--GLTYWSPTINMARDPRW 164
Query: 170 GRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTT 229
GR +ET GEDP +T + V++V+G+QGD L+ A KHF A + +N
Sbjct: 165 GRNEETYGEDPYLTSRLGVAFVKGLQGDH------PAYLKTVATIKHFVANNEEN----N 214
Query: 230 RYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQW 289
R+ +++ + L + Y P +E+CVK+ +M AYN NG+P R LL + R++W
Sbjct: 215 RFSSSSQIPTKQLYEYYFPAYEACVKEAGVQSVMTAYNAFNGVPPSGSRWLLGEVLRKEW 274
Query: 290 GFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLP 349
GF G++ SDC A+ ++ S E+A + +G D+ CG+ ++ AVKQ +
Sbjct: 275 GFDGFVVSDCGAIGVMNWQHRVVNSLEEAAALGVNSGCDLECGTTYKEKLVQAVKQGLIS 334
Query: 350 ESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKN 409
E+ ID+AL + + R +LG F+ + P+ ++ LA +AA +VLLKN
Sbjct: 335 EATIDQALTRVLTARFKLGEFD-PMELVPYNHYDKKLLAGKKFAELAYEAAVKSVVLLKN 393
Query: 410 SHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCD 469
+ LLPL K K+ S+A++GP A+ LG Y+G S+T L+ +++ + +
Sbjct: 394 EN-LLPLSKEKTKSVAVVGPFAD--HNYLGGYSGQPPYSVTLLKGVKDLMGKRGKVNYLN 450
Query: 470 TVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAK 529
+ S SI AV KG D V++ +G D+ +E D + LP Q++L+ + +
Sbjct: 451 GIGASRDSIVAAV---KGVDVVLVALGSDEKMARENHDMTSIYLPEEQEKLLKAIYQVNP 507
Query: 530 KPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWY 589
+ ++LV G P+ +A D +I +I+ A YPG+ ALA+++FG+ NP G+LPMT Y
Sbjct: 508 R-IVLVFHSGNPLTSEWA--DVHIPAIMQAWYPGQEAGRALADLLFGNENPSGKLPMTIY 564
Query: 590 PQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLY 649
+ + D M GRTYR+ + ++ FG GLSY+ + +
Sbjct: 565 RAEDQLPDILDFDMW-------KGRTYRYMKEDPLYGFGHGLSYTSFGF----------- 606
Query: 650 LNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPAR 709
++ D + + + E N G+ G+ V ++V
Sbjct: 607 -------DGIQGSDTLKSGTTLQCSVELS------------NTGKWTGEEVVQVYVS--- 644
Query: 710 RGN----GRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVV 765
R N P+K+LV F+ V L EK + F + P E LS E+G + G + L +
Sbjct: 645 RENTPVYTYPLKKLVAFKKVKLAPGEKKRVEFNIPPRE-LS-VWENGNWRMLTGKYTLFI 702
Query: 766 G 766
G
Sbjct: 703 G 703
>gi|308080460|ref|NP_001183746.1| uncharacterized protein LOC100502339 [Zea mays]
gi|238014360|gb|ACR38215.1| unknown [Zea mays]
Length = 344
Score = 356 bits (913), Expect = 3e-95, Method: Compositional matrix adjust.
Identities = 176/342 (51%), Positives = 230/342 (67%), Gaps = 10/342 (2%)
Query: 437 LLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMG 496
L+ NY GP C S TPL+ LQ+YV + + GC++ AC A+ D+AV +A D+V L MG
Sbjct: 3 LIANYFGPPCESTTPLKGLQSYVNDVRFLAGCNSAACDVAATDQAVALAGSEDYVFLFMG 62
Query: 497 LDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSI 556
L Q QE E DR L+LPG QQ LIT VA+A+K+PVILVLL GGPVDITFA+ + IG+I
Sbjct: 63 LSQKQESEGKDRTSLLLPGMQQSLITAVADASKRPVILVLLSGGPVDITFAQSNPKIGAI 122
Query: 557 LWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTY 616
LWAGYPG+AG +A+A+V+FGDHNP GRLP+TWYP+++ KVPMTDM+MR TSG PGR+Y
Sbjct: 123 LWAGYPGQAGGLAIAKVLFGDHNPSGRLPVTWYPEEFTKVPMTDMRMRADPTSGYPGRSY 182
Query: 617 RFYEGKEVFPFGCGLSYSKYSYKF------KAVSQNKLYLNQSSSTKMVENQDVVHYKSV 670
RFY+G V+ FG GLSYS +S + A+S L + + + QD V
Sbjct: 183 RFYQGNTVYKFGYGLSYSTFSRRLVHGTSVPALSSTLL----TGLRETMTPQDGDRSYHV 238
Query: 671 PELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAK 730
+GTE CE KF + V+NHG M GKH VL+F++ GRP QL+GF+S L A
Sbjct: 239 DAIGTEGCEQLKFPAMVEVQNHGPMDGKHSVLMFLRWPNTKQGRPASQLIGFRSQHLKAG 298
Query: 731 EKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGDEEYPI 772
E A++ F++SPC+ SR R DG VI+ G+HFL+V + E I
Sbjct: 299 ETAKLRFDISPCKHFSRVRADGRKVIDIGSHFLMVDNHEMEI 340
>gi|372209036|ref|ZP_09496838.1| glycoside hydrolase [Flavobacteriaceae bacterium S85]
Length = 859
Score = 355 bits (910), Expect = 6e-95, Method: Compositional matrix adjust.
Identities = 248/729 (34%), Positives = 378/729 (51%), Gaps = 82/729 (11%)
Query: 52 ISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIR 111
I R DL++ +TL+EKIS + P I RLGIP +EW+ EALHG+ I +N
Sbjct: 32 IEDRVNDLLANMTLEEKISYCGSRIPEIKRLGIPYFEWYGEALHGI------ISWN---- 81
Query: 112 GATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGR 171
T FPQ I A+++ L + + AI EARAL NAG+ M F +P +N+ RDPRWGR
Sbjct: 82 -CTQFPQNIAMGATWNPDLMFDVATAISNEARALKNAGKKEVMMF-SPTVNMARDPRWGR 139
Query: 172 GQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRY 231
E EDP + + A YVRG+QG N K ++ KH+ A +++ T R
Sbjct: 140 NGECYAEDPHLMSEMARMYVRGMQG---NDPKY---VKTVTTVKHYVANNVE----TKRE 189
Query: 232 KFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGF 291
+ + +DL + Y P +++C+ A+GIM A N +NGIP A L++ R +WGF
Sbjct: 190 WIHSNIGKKDLYEYYFPAYKTCIVDEEATGIMTALNGLNGIPCSAHDWLVNGVLRNEWGF 249
Query: 292 HGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKA--------AV 343
GY+ +D AV + YA S A +KAG+D C F K +A A+
Sbjct: 250 KGYVIADWAAVQGLEKRMKYASSQAQAAAMAIKAGVDQEC--FRNKVRQAPMVQALPDAL 307
Query: 344 KQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDG 403
+Q + E E+D + L +R G F+ +P++ P+ I V+ AH+ LAL+AA+
Sbjct: 308 QQGLITEKELDVTVKRLLRLRFMTGDFD-DPSLNPYSAIPTSVLECDAHKQLALKAAEQS 366
Query: 404 IVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTV 463
IVLLKN +LPL K S+A+IGP A+ + +G Y+G ++PL ++ Y V
Sbjct: 367 IVLLKND-AVLPLKKDLK-SIAMIGPFAD--RCWMGIYSGHPKSKVSPLDGIKAYTNAKV 422
Query: 464 YYP-GCDTVACS--SASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQEL 520
+ GC+ A I +AV +AK ++ V+L++G D+T E DR + LPG Q +L
Sbjct: 423 SFAQGCEVTAKEDDEQKIAEAVALAKKSEQVILVVGNDETTSTENTDRKSIKLPGNQHQL 482
Query: 521 ITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
I + +A K VILVL+ GP +T+ + +NI I+ A G+ ALA+V+FGD NP
Sbjct: 483 I-KAVQAVNKNVILVLVPSGPTAVTWEQ--KNIPGIVCAWPNGQEQGTALAKVLFGDVNP 539
Query: 581 GGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKF 640
GG+L TWY D D KM GN RTY +++GK ++PFG GLSY+ ++
Sbjct: 540 GGKLNATWYQSDKDLPNFHDYKM----AGGN--RTYMYFKGKPLYPFGYGLSYTNFTISD 593
Query: 641 KAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHP 700
++++ L N+ + K V N G +AG
Sbjct: 594 VSINKKTLQANEYVTVKA-----------------------------KVNNTGAVAGDEV 624
Query: 701 VLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSR--AREDGLMVIEE 758
V ++++ + P+K L GFQ + + A + ++ P E+ S +++ LMV +
Sbjct: 625 VQVYIRDVKSKEKTPLKALKGFQRISVAAGASKWVEIKI-PYEAFSHYNTKKEALMV-AK 682
Query: 759 GTHFLVVGD 767
G ++VG+
Sbjct: 683 GEFEILVGN 691
>gi|256393789|ref|YP_003115353.1| beta-glucosidase [Catenulispora acidiphila DSM 44928]
gi|256360015|gb|ACU73512.1| Beta-glucosidase [Catenulispora acidiphila DSM 44928]
Length = 1548
Score = 353 bits (907), Expect = 2e-94, Method: Compositional matrix adjust.
Identities = 253/783 (32%), Positives = 362/783 (46%), Gaps = 132/783 (16%)
Query: 47 KTTLPI--------SQRARDLVSRLTLDEKISQL-VNSAPAIPRLGIPAYEWWSEALHGV 97
KT LPI RA DLVSR+TL EK+ QL NS PAIPRLG+ Y +WSE HGV
Sbjct: 58 KTPLPIYLDTHYSAQARAADLVSRMTLPEKVEQLSTNSGPAIPRLGVQQYTYWSEGQHGV 117
Query: 98 A--GVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYN-------- 147
G + NG +TSFP + S+D L Y+ AI EAR +
Sbjct: 118 NTLGANQDNGGNGGAVHSTSFPTNFASTMSWDPSLIYQETTAISDEARGFLDKSLFGVNQ 177
Query: 148 ------AGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNG 201
A +TFWAP +N+ RDPRWGR E GEDP +T A ++V G +G+T G
Sbjct: 178 NNLGPSAADYGSLTFWAPTVNMDRDPRWGRTDEAFGEDPYLTSTMAGAFVNGYEGNTPTG 237
Query: 202 GKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASG 261
G L+ +A KH+ D++ R + V+ DL D Y F S ++ SG
Sbjct: 238 QSKTGTLKVAATAKHYALNDVEQ----DRTGISSNVSDTDLHDYYTKQFASLIENAHVSG 293
Query: 262 IMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDA--EGYAKSPEDAV 319
+M +YN +NG PS AD ++ A+RQ+GF+GY+TSDC A+ Y + G+ +P
Sbjct: 294 LMTSYNAINGTPSVADTYTANQLAQRQFGFNGYVTSDCGAIGTAYQSFPSGHDWAPPGWT 353
Query: 320 VD--------------------------VLKAGMDVNC--GSFLQKHTKAAVKQKKLPES 351
D L+AG D+NC G AA+ L E
Sbjct: 354 TDGKSSTGTWTNTATGATVPAQAGGQAYALRAGTDLNCAGGENTYAQITAAISAGVLSEG 413
Query: 352 EIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKN-- 409
ID AL +F+VR+ G F+ P+ I + SPAHQ LA + A + +VLLKN
Sbjct: 414 VIDNALVKIFTVRVETGEFD-PAGSNPYTGITKAQIQSPAHQALATKVADNSLVLLKNQP 472
Query: 410 -----------------------SHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSC 446
+ LLPL + + + ++G AN+ LGNY+
Sbjct: 473 PAASGTSTTPPAASSAASSAAAAAKPLLPLSAAATAKIVIVGDMANAVT--LGNYSSDPA 530
Query: 447 RSITPLQALQNYVEN-----TVYYPGCDTVACSSASIDKA----VDIAKGADHVVLMMGL 497
++P+Q + V +V + C T +SA+ + D+A GAD V++ +G
Sbjct: 531 LKVSPVQGITAAVRKANPGASVTFDACGTSTTASAAASCSAQTLADVA-GADAVIVFVGT 589
Query: 498 DQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSIL 557
+Q E DR + +PG LI++VA ++L + GGPV I ++ SI+
Sbjct: 590 NQQIADEGKDRTSIAMPGNYDSLISQVAAVGNPRMVLAVQSGGPVRID--DVQKDFASIV 647
Query: 558 WAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYR 617
++G+ GE+ ALA+V+FG NP G L TWY D M++ + P A +G GRTY
Sbjct: 648 FSGFNGESQGTALADVLFGAQNPDGHLDFTWYADDSQLPAMSNYGLTP-AQTGGLGRTYM 706
Query: 618 FYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEF 677
++ G +PFG GLSYS +S+ + N S S
Sbjct: 707 YFTGTPTYPFGYGLSYSTFSFSGVHAEGRSVDANGSQS---------------------- 744
Query: 678 CETRKFLVTIGVKNHGEMAGKHPVLLFVKP--ARRGNGRPIKQLVGF-QSVILNAKEKAE 734
V++ VKN G+ AG L+ +P G P +QLVGF +S +L E
Sbjct: 745 -------VSVTVKNTGKTAGSTVAQLYAQPKFTVAGQTFPNEQLVGFAKSKVLKPGESQH 797
Query: 735 IVF 737
+
Sbjct: 798 LTI 800
>gi|116621778|ref|YP_823934.1| glycoside hydrolase family 3 protein [Candidatus Solibacter
usitatus Ellin6076]
gi|116224940|gb|ABJ83649.1| glycoside hydrolase, family 3 domain protein [Candidatus Solibacter
usitatus Ellin6076]
Length = 850
Score = 353 bits (905), Expect = 3e-94, Method: Compositional matrix adjust.
Identities = 204/456 (44%), Positives = 275/456 (60%), Gaps = 37/456 (8%)
Query: 39 STETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVA 98
+T PF L +RA DLV+R+TLDEK+ Q+ NSAPAIPRLGIPAY+WW+EALHGVA
Sbjct: 20 TTSQLPFMDPDLSAERRAADLVARMTLDEKVLQMQNSAPAIPRLGIPAYDWWNEALHGVA 79
Query: 99 GVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAI------ 152
G AT FPQ I AA++D+ L +RI + I EARA YN +AI
Sbjct: 80 RAGL----------ATVFPQAIGLAATWDATLMHRIAETISTEARAKYN--EAIRNDDHS 127
Query: 153 ---GMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQ 209
G+TFW+PNINIFRDPRWGRGQET GEDP +T + AV++++G+QG+ + K+
Sbjct: 128 RYRGLTFWSPNINIFRDPRWGRGQETYGEDPFLTSRMAVAFIKGMQGEDPHYYKVI---- 183
Query: 210 ASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRV 269
A KH Y + + ++R++FD + + +DLADTY P F + + + RA +MCAYNRV
Sbjct: 184 --ATAKH---YAVHSGPESSRHQFDVKPSPRDLADTYLPAFRASIVEARADSLMCAYNRV 238
Query: 270 NGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDV 329
+GIP+CA +LL K R +WGF G++ SDC AVS I+ Y A +KAG D+
Sbjct: 239 DGIPACASTDLLEKRLRGEWGFQGFVVSDCGAVSDIFRGHHYQPDAASASAVAVKAGTDL 298
Query: 330 NCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCS 389
CG+ + AVK + E EI+R+L LF R +LG+F+ P PF I V S
Sbjct: 299 TCGNEYRALVD-AVKTGLITEPEINRSLERLFVARFKLGMFD-PPERVPFSNIPYSEVDS 356
Query: 390 PAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSI 449
H+ +AL+AA+ IVLLKN G LPL KS +A+IGP A+ A+ LLGNY G S +
Sbjct: 357 AGHRKIALEAARKSIVLLKND-GTLPL-KSSIKKIAVIGPAADDAEALLGNYNGFSSLQV 414
Query: 450 TPLQALQNYVENTV---YYPGCDTVACSSASIDKAV 482
TPL +++ Y G + A S A + +V
Sbjct: 415 TPLAGIEHQWAGKAEVRYALGANYTAQSQAPLPASV 450
Score = 102 bits (255), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 80/269 (29%), Positives = 123/269 (45%), Gaps = 55/269 (20%)
Query: 481 AVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKK 530
A++ AD + +GL+ + E EE+ DR +L LP Q++LI A A K
Sbjct: 597 AIEAVSNADVTLAFVGLNPSLEGEEMPVSVPGFQGGDRTNLELPEPQEKLI-EAAIATGK 655
Query: 531 PVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYP 590
PV++VL G V + FA ++ ++L Y GE A+A+ + G +NP GRLP+T+Y
Sbjct: 656 PVVVVLASGSAVAMNFAA--QHASALLETWYNGEETGTAIADTLAGINNPSGRLPVTFYR 713
Query: 591 QDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYL 650
P + M+ GRTYR++ G ++ FG GLSYSK+ Y + L
Sbjct: 714 SVDQLPPFEEYAMK--------GRTYRYFNGDALYSFGFGLSYSKFQY-------SALKT 758
Query: 651 NQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARR 710
++ S +V ++ V+N + G V L+V
Sbjct: 759 RRAGSGTIVASR--------------------------VRNASSIEGDEVVQLYVN-GSG 791
Query: 711 GNGRPIKQLVGFQSVILNAKEKAEIVFEL 739
+G PI+ L GFQ + L E E+ F L
Sbjct: 792 ADGDPIRSLRGFQRIHLRPGESREVHFPL 820
>gi|115436096|ref|NP_001042806.1| Os01g0296700 [Oryza sativa Japonica Group]
gi|113532337|dbj|BAF04720.1| Os01g0296700, partial [Oryza sativa Japonica Group]
Length = 522
Score = 352 bits (903), Expect = 4e-94, Method: Compositional matrix adjust.
Identities = 205/526 (38%), Positives = 300/526 (57%), Gaps = 30/526 (5%)
Query: 269 VNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSI-IYDAEGYAKSPEDAVVDVLKAGM 327
+NG+P+CAD LL++T RR W HGYI SDCD+V + + DA+ + +A +KAG+
Sbjct: 1 INGVPACADARLLTETVRRDWQLHGYIVSDCDSVRVMVRDAKWLGYTGVEATAAAMKAGL 60
Query: 328 DVNCG-------SFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFG 380
D++CG F + AV+Q KL ES +D AL NL+ MRLG F+G P ++
Sbjct: 61 DLDCGMFWEGVHDFFTTYGVDAVRQGKLKESAVDNALTNLYLTLMRLGFFDGIPELE--- 117
Query: 381 KIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGP--NANSAKTLL 438
+GA VC+ H+ LA AA+ G+VLLKN LLPL K S+AL G + N+ +L
Sbjct: 118 SLGAADVCTEEHKELAADAARQGMVLLKNDAALLPLSPEKVNSVALFGQLQHINATDVML 177
Query: 439 GNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLD 498
G+Y G CR +TP ++ V +T + AC S D A AK D +++ GL+
Sbjct: 178 GDYRGKPCRVVTPYDGVRKVVSSTSVH------ACDKGSCDTAAAAAKTVDATIVVAGLN 231
Query: 499 QTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILW 558
+ E+E DR DL+LP Q I VAEA+ P++LV++ G VD++FA+ + IG+++W
Sbjct: 232 MSVERESNDREDLLLPWSQASWINAVAEASPSPIVLVIMSAGGVDVSFAQDNPKIGAVVW 291
Query: 559 AGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYI-KVPMTDMKMRPQATSGNPGRTYR 617
AGYPGE G A+A+V+FG +NPGGRLP+TWY +Y+ K+PMT M +RP A G PGRTY+
Sbjct: 292 AGYPGEEGGTAIADVLFGKYNPGGRLPLTWYKNEYVSKIPMTSMALRPDAEHGYPGRTYK 351
Query: 618 FYEGKEV-FPFGCGLSYSKYSYKFKAVSQNKLYLNQSS---STKMVENQDVVHYKSVP-- 671
FY G +V +PFG GLSY+ ++Y A + + + + ++ V + P
Sbjct: 352 FYGGADVLYPFGHGLSYTNFTYA-SATAAAPVTVKVGAWEYCKQLTYKAGVSSPPACPAV 410
Query: 672 ELGTEFC-ETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAK 730
+ + C E F VT V N G G H V ++ P +G P KQLV F+ V + A
Sbjct: 411 NVASHACQEEVSFAVT--VANTGGRDGTHVVPMYTAPPAEVDGAPRKQLVAFRRVRVAAG 468
Query: 731 EKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGDEEYPISIFV 776
E+ F L+ C++ + E V+ G ++VGD+ +S V
Sbjct: 469 AAVEVAFALNVCKAFAIVEETAYTVVPSGVSRVLVGDDALSLSFPV 514
>gi|336275603|ref|XP_003352555.1| hypothetical protein SMAC_01389 [Sordaria macrospora k-hell]
gi|380094444|emb|CCC07823.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 833
Score = 350 bits (898), Expect = 2e-93, Method: Compositional matrix adjust.
Identities = 256/791 (32%), Positives = 373/791 (47%), Gaps = 131/791 (16%)
Query: 46 CKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIF 105
C +T RA LV +LT+DEK+ LV+ + PRLG+P Y WWSE LHGVAG G+
Sbjct: 37 CDSTASAPDRAASLVEQLTIDEKLVNLVDQSKGAPRLGLPPYAWWSEGLHGVAG-SPGVV 95
Query: 106 FNGT---IRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNIN 162
FN + ATSF VI A+ D L Y +G AI EARA G G+ +W PNIN
Sbjct: 96 FNTSGYPFSYATSFANVITLGAALDDDLVYEVGTAISTEARAFAKFGFG-GLDYWTPNIN 154
Query: 163 IFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDL 222
++DPRWGRG ETPGEDPL Y + V G++G+ G ++ + A CKHF AYDL
Sbjct: 155 PYKDPRWGRGAETPGEDPLRIKGYVKAMVAGLEGN----GTVR---KVIATCKHFAAYDL 207
Query: 223 DNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMC------------------ 264
+ W+G TRY FDA V++QDL++ Y PPF+ C + R IMC
Sbjct: 208 ERWRGLTRYDFDAVVSLQDLSEYYLPPFQQCARDSRVGSIMCRYVSFFLPPFPSFPRLVT 267
Query: 265 --------------AYNRVNGIPSCADRNLLSKTARRQWGF---HGYITSDCDAV-SIIY 306
+YN +NG P+CA L++ R W + + YITSDC+A+ +
Sbjct: 268 RQSGNQVDIVDNFRSYNALNGTPACASTYLMTNILRDHWNWTNHNNYITSDCNAIQDFLP 327
Query: 307 DAEGYAKSPEDAVVDVLKAGMDVNC---GSFLQKHTKAAVKQKKLPESEIDRALHNLFSV 363
D ++++P +A AG D C G A Q L ES ID AL L+
Sbjct: 328 DNHNFSQTPAEAAAAAYIAGTDTVCEVSGWPPYTDVVGAYNQSLLSESVIDTALRRLYEG 387
Query: 364 RMRLGLFN------GNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLP 417
+R G + +P PF +S LPL
Sbjct: 388 LIRAGYLDHGRPASSSPDKAPF-----------------------------SSPDFLPLD 418
Query: 418 KSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSAS 477
+ ++ALIG AN+ +T+ G Y+G P+ A++ ++ + YY V + A
Sbjct: 419 LTGK-TVALIGHWANATRTIRGPYSGLPPFYHNPMYAVRQ-LKLSFYYANGPVVNSTDAD 476
Query: 478 --IDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILV 535
A+ A+ AD V+ G D T E+LDR + P Q LI ++A+ K V++
Sbjct: 477 TWTAAAMLAAESADVVLYFGGTDTTVASEDLDRESIAWPKTQLTLIEKLAQVGKPMVVIQ 536
Query: 536 LLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYI- 594
L G VD T ++NI SILW GYPG++G A+ +V+ G GRLP+T YP Y+
Sbjct: 537 L--GDQVDDTPLLNNKNISSILWVGYPGQSGGTAVFDVLTGKKASAGRLPVTQYPAGYVD 594
Query: 595 KVPMTDMKMRP---------------------------QATSGN-----PGRTYRFYEGK 622
+VP+T+M +RP ++T GN PGRTY++Y +
Sbjct: 595 EVPLTEMGLRPFNHSSSTTSSDVSQSGVEEGNGLTIQTRSTRGNKTLSSPGRTYKWYP-R 653
Query: 623 EVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRK 682
V PFG GL Y+ ++ + + +++ + + ++ ++ + + C
Sbjct: 654 PVLPFGYGLHYTPFNISLSLSTSSNASSTTDNTS--ISIRSLLTSQTCTAIHLDLCPFSP 711
Query: 683 FLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVI-LNAKEKAEIVFELSP 741
F V+I N G + LLF+ P+K LVG++ V + E + E P
Sbjct: 712 FSVSI--TNTGSHTSDYVALLFLSGKFGPKPDPLKTLVGYKRVKDIKPGETRVVGGEDIP 769
Query: 742 CESLSRAREDG 752
+ AR DG
Sbjct: 770 VNLAAVARVDG 780
>gi|443717728|gb|ELU08656.1| hypothetical protein CAPTEDRAFT_228276 [Capitella teleta]
Length = 731
Score = 349 bits (896), Expect = 3e-93, Method: Compositional matrix adjust.
Identities = 247/713 (34%), Positives = 370/713 (51%), Gaps = 67/713 (9%)
Query: 43 FPFCKTTLPISQRARDLVSRLTLDE----KISQLVNSAPAIPRLGIPAYEWWSEALHGVA 98
FPF TL +R DLV RLT++E ++Q S + RLG+ Y++ +E + GV
Sbjct: 20 FPFEDVTLSWDKRVDDLVQRLTIEEVVNISVAQYGKSTIPVDRLGVKPYQFINECITGV- 78
Query: 99 GVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAI------ 152
+T+FPQ I ASF L + + QAI E R YN
Sbjct: 79 ----------RWENSTAFPQAIGLGASFSPDLAFNMSQAIARELRGFYNTEVKSQIYGHR 128
Query: 153 GMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASA 212
G+ + P INI R P WGR QET GEDP ++G+ +V +V+G+QGD +QAS
Sbjct: 129 GVNCFTPVINIMRHPLWGRNQETYGEDPWLSGQLSVGFVKGLQGDH------PRYIQASG 182
Query: 213 CCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGI 272
CKHF ++ +R+ FDA+V+ +D T+ P F++CV+ G + IMC+YNR+NG+
Sbjct: 183 GCKHFDVHNGPENIPVSRFGFDAKVSERDWRMTFLPQFKTCVEAGSIN-IMCSYNRINGV 241
Query: 273 PSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVN-- 330
P+CA++ LL+ R++WGF+GY+ SD A+ I Y K+ +A D +KAG +V
Sbjct: 242 PACANKKLLTDILRKEWGFNGYVISDSGAIENIVYHHKYTKTLAEAAADSVKAGCNVELT 301
Query: 331 --CGSFLQK-HTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPT-MQPFGKIGADV 386
GS + + AVKQ + E E+ L RMR G F +P M PF KI V
Sbjct: 302 GATGSGVAYFNLLNAVKQNLISEEELRENLKKPMYSRMRQGEF--DPVDMNPFTKIDMSV 359
Query: 387 VCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYA---G 443
V S HQ LA++A+ VL+KN + +LPL K + LA+IGP A++A+TL G+Y
Sbjct: 360 VLSQEHQDLAVKASAMSFVLMKNLNRVLPL-KKRFDRLAIIGPFADNAETLFGDYIPNWD 418
Query: 444 PSCRSITPLQALQNYVENTVYYPGCDTVACSS---ASIDKAVDIAKGADHVVLMMGLDQT 500
P S TP + L++ ++ Y GCD +C++ +I+KAV KGA V + +G+
Sbjct: 419 PKFVS-TPYEGLKSLGDDVRYASGCDDPSCTNYDPKAIEKAV---KGAQFVFVCLGVGSN 474
Query: 501 QEKEELDRVDLVLPGRQQELITRVAEAAKK-PVILVLLCGGPVDITFAKYDRNIGSILWA 559
E+E DR DL LPG Q +++ +++ P++LVL GPVD+T+ K + I+
Sbjct: 475 LEREGHDRADLDLPGYQLQILKDAEFFSREAPLVLVLFNAGPVDLTWPKLSPEVDGIIEC 534
Query: 560 GYPGEAGAVALAEVIFG---DHNPGGRLPMTWYPQDYIKVP-MTDMKMRPQATSGNPGRT 615
YP AL +V+ D P RLP TW P +VP +TD M G T
Sbjct: 535 FYPAMGTGKALYQVVTATGDDGVPAARLPSTW-PAQLHQVPSITDYNMT--------GHT 585
Query: 616 YRFYEGKE-VFPFGCGLSYSKYSYKFKAVSQNKLYLNQS-SSTKMVENQDVVHYKSVPEL 673
YR+++G + ++PFG GLSY+ + Y+ +VS + + + T V N+ + V ++
Sbjct: 586 YRYFDGGDPLYPFGYGLSYTSFHYQTVSVSPTSVRAGGNVTVTVQVLNRGPYNADEVTQV 645
Query: 674 GTEFCETR---KFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQ 723
+ E +G K H + L FV A + + + GFQ
Sbjct: 646 YMSWMEATVPVPRWTLVGFKRHRHTVNQSSSLSFVVSAEQ-MAVWVDEATGFQ 697
>gi|164428543|ref|XP_964543.2| hypothetical protein NCU00709 [Neurospora crassa OR74A]
gi|157072187|gb|EAA35307.2| hypothetical protein NCU00709 [Neurospora crassa OR74A]
Length = 786
Score = 349 bits (896), Expect = 3e-93, Method: Compositional matrix adjust.
Identities = 259/761 (34%), Positives = 371/761 (48%), Gaps = 97/761 (12%)
Query: 81 RLGIPAYEWWSEALHGVAGVGKGIFFNGT---IRGATSFPQVILTAASFDSYLWYRIGQA 137
RLG+P Y WWSE LHGVAG G+ FN T ATSF I ASFD L Y +G A
Sbjct: 13 RLGLPKYAWWSEGLHGVAG-SPGVKFNTTGYPFSYATSFANAINLGASFDDDLVYEVGTA 71
Query: 138 IGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGD 197
I EARA N G G+ +W PN+N ++DPRWGRG ETPGEDPL Y + + G++G+
Sbjct: 72 ISTEARAFANFGFG-GLDYWTPNVNPYKDPRWGRGAETPGEDPLHIKGYVKAILAGLEGN 130
Query: 198 TFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQG 257
+ + A CKH+ AYDL+ W G TRY+F+A VT+QDL++ Y PPF+ C +
Sbjct: 131 -------ETVRKVIATCKHYAAYDLERWHGLTRYEFEAIVTLQDLSEYYLPPFQQCARDS 183
Query: 258 RASGIMCAYNRV-----------------NGIPSCADRNLLSKTARRQWGF---HGYITS 297
+ IMC+YN + P+CA L++ R W + + YITS
Sbjct: 184 KVGSIMCSYNALTIRDMASGKPDEEINLTTAQPACAKPYLMT-ILRDHWNWTEHNNYITS 242
Query: 298 DCDAV-SIIYDAEGYAKSPEDAVVDVLKAGMDVNC---GSFLQKHTKAAVKQKKLPESEI 353
DC+A+ + D ++++P +A KAG D C GS L A Q LPE+ I
Sbjct: 243 DCNAILDFLPDNHNFSQTPAEAAAAAYKAGTDTVCEVSGSPLTD-VVGAYNQSLLPEAVI 301
Query: 354 DRALHNLFSVRMRLGLFN------------GNPTMQPFGKIGADVVCSPAHQVLALQAAQ 401
D AL L+ +R G + G+ + + + + V +P+ Q LAL++A
Sbjct: 302 DTALRRLYEGLIRAGYLDHGRSSAVAGGDGGSFSSPAYDALNWEDVNTPSTQELALRSAT 361
Query: 402 DGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVEN 461
+GIVLLKN+ LLPL S +ALIG AN+ T+ G Y+G PL A Q +
Sbjct: 362 EGIVLLKNAGSLLPLDFSGK-KVALIGHWANATGTMRGPYSGIPPFYHNPLYAAQQLNLS 420
Query: 462 TVYYPGCDTVACSSASIDKAVDIAKGADHVVLMM-GLDQTQEKEELDRVDLVLPGRQQEL 520
Y G A + A VVL G D T E+LDR + P Q +L
Sbjct: 421 FSYANGPVVNASDPDTWTAPALAAAEGADVVLYFGGTDTTVASEDLDRESIAWPETQMQL 480
Query: 521 ITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
++ +A KP++++ L G VD + + N+ SILW GYPG++G A+ +V+ G P
Sbjct: 481 LSELA-GLGKPLVVIQL-GDQVDDSSLLNNGNVSSILWVGYPGQSGGTAVFDVLTGKKAP 538
Query: 581 GGRLPMTWYPQDYI-KVPMTDMKMR-------------------------PQATSGN--- 611
GRLP+T YP+ Y+ +VP+T+M +R P++T GN
Sbjct: 539 AGRLPVTQYPEGYVDEVPLTEMALRPFNYSSSSNLEQEVSVQGRGSLTIQPRSTPGNKTL 598
Query: 612 --PGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQ---DVVH 666
PGRTY++Y V PFG GL Y+ ++ S N + S S + H
Sbjct: 599 SSPGRTYKWYS-SPVLPFGYGLHYTTFNVSLSLSSSNASSSSSSPSFSIPSLLTPCTATH 657
Query: 667 YKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVI 726
P F + +++ + N G + VLLF+ P+K LV ++ V
Sbjct: 658 LDLCP-----FSPSANSALSVSITNTGTHTSDYVVLLFLSGEFGPKPYPLKTLVSYKRV- 711
Query: 727 LNAKEKAEIVFELSPCE--SLSRAREDGLMVIEEGTHFLVV 765
+ K + + P ++SR DG V+ GT+ VV
Sbjct: 712 KDIKPGETVTVKDVPVSLGAISRVDGDGNTVLYPGTYRFVV 752
>gi|219887077|gb|ACL53913.1| unknown [Zea mays]
gi|224035251|gb|ACN36701.1| unknown [Zea mays]
gi|413919685|gb|AFW59617.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
Length = 405
Score = 349 bits (895), Expect = 4e-93, Method: Compositional matrix adjust.
Identities = 191/414 (46%), Positives = 259/414 (62%), Gaps = 11/414 (2%)
Query: 365 MRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSL 424
MRLG F+G+P PFG +G VC+P++Q LA +AA+ GIVLLKN+ G LPL + S+
Sbjct: 1 MRLGFFDGDPRELPFGNLGPSDVCTPSNQELAREAARQGIVLLKNT-GKLPLSATSIKSM 59
Query: 425 ALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASI--DKAV 482
A+IGPNAN++ T++GNY G C+ TPLQ L V TVY PGC V CS S+ D A
Sbjct: 60 AVIGPNANASFTMIGNYEGTPCKYTTPLQGLGANVA-TVYQPGCTNVGCSGNSLQLDAAT 118
Query: 483 DIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPV 542
A AD VL++G DQ+ E+E LDR L+LPG+Q +L++ VA A+ P ILV++ GGP
Sbjct: 119 KAAASADVTVLVVGADQSIERESLDRTSLLLPGQQPQLVSAVANASSGPCILVVMSGGPF 178
Query: 543 DITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMK 602
DI+FAK I +ILW GYPGEAG A+A+V+FG HNP GRLP+TWYP+ + KVPMTDM+
Sbjct: 179 DISFAKSSDKIAAILWVGYPGEAGGAAIADVLFGYHNPSGRLPVTWYPESFTKVPMTDMR 238
Query: 603 MRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQ 662
MRP ++G PGRTYRFY G V+ FG GLSY+ +++ + + +L L + + Q
Sbjct: 239 MRPDPSTGYPGRTYRFYTGDTVYAFGDGLSYTSFAHHLVSAPK-QLALQLAEGHACLTEQ 297
Query: 663 DVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGF 722
SV G CE F V + V+N GE +G H V LF P N P K L+GF
Sbjct: 298 ----CPSVEAEGAH-CEGLAFDVHLRVRNAGERSGGHTVFLFSSPPAVHNA-PAKHLLGF 351
Query: 723 QSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGDEEYPISIFV 776
+ V L + + F++ C+ LS E G + G+H L VGD ++ +++ V
Sbjct: 352 EKVSLEPGQAGVVAFKVDVCKDLSVVDELGNRKVALGSHTLHVGDLKHTLNLGV 405
>gi|390340546|ref|XP_001186857.2| PREDICTED: probable beta-D-xylosidase 2-like [Strongylocentrotus
purpuratus]
Length = 623
Score = 347 bits (891), Expect = 1e-92, Method: Compositional matrix adjust.
Identities = 226/607 (37%), Positives = 324/607 (53%), Gaps = 52/607 (8%)
Query: 43 FPFCKTTLPISQRARDLVSRLTLDEKISQLVNS-------APAIPRLGIPAYEWWSEALH 95
PF +LP QR DL+SRL +D+ QL APAI RL I Y W +E L
Sbjct: 30 LPFWNQSLPWDQRLDDLLSRLKVDDMTYQLARGGADPNGPAPAIGRLQIGKYVWNTECLR 89
Query: 96 GVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA------- 148
G A G AT+FPQ + +A+F L + + A G E RA YN
Sbjct: 90 GDAQAGN----------ATAFPQALGLSAAFSRDLLFEVANATGYEVRAKYNYYLQKGDF 139
Query: 149 GQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKL 208
G+ ++P INI R P WGR QET GEDP +TG+ A S+V G+QG+ L
Sbjct: 140 NNHQGLNCFSPVINIMRHPYWGRNQETYGEDPYLTGELAKSFVWGLQGNH------PRYL 193
Query: 209 QASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNR 268
+A CKHF AY ++R+ FDA+V+ +DL T+ P F+ C+K G S +MC+YN
Sbjct: 194 LTNAGCKHFAAYSGPENYPSSRFSFDAKVSDKDLQVTFFPAFKECIKAGTYS-VMCSYNS 252
Query: 269 VNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMD 328
VNGIP+CA+ LL+ R +WGF GY+ SD A+ + A Y S D + LKAG +
Sbjct: 253 VNGIPACANSYLLNDVLRTEWGFKGYVVSDQRALELEELAHNYTTSYLDTAIKSLKAGCN 312
Query: 329 VNCGSF---LQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGAD 385
++ G+ + + AV+ L ++ ++ LF R+RLG F+ P P+ K+ D
Sbjct: 313 LDLGTTKPAVYDYLAEAVELGMLTAQDLRDSIAPLFYTRLRLGEFDP-PDHNPYVKLNVD 371
Query: 386 -VVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYA-G 443
VV SP HQ +AL+AA VL+KN LP+ + +LA++GP AN++K L G+YA
Sbjct: 372 QVVESPEHQEIALKAALKSFVLVKNDGSTLPIEGTIH-TLAVVGPFANNSKLLFGDYAPN 430
Query: 444 PSCRSITP-LQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQE 502
P R +T L+ L T + GC + C + ++ GAD VV+ +G E
Sbjct: 431 PDPRFVTTVLEGLSPMATKTRHASGCPSPKCVTYDQQGVLNAVTGADVVVVCLGTGIELE 490
Query: 503 KEELDRVDLVLPGRQQELITRVAE-AAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGY 561
E DR D++LPG+Q++L+ A AA KPVIL+L GP++IT+A ++ +I+ +
Sbjct: 491 SEGNDRRDMLLPGKQEQLLQDAARYAAGKPVILLLFNAGPLNITWALSSPSVQAIVECFF 550
Query: 562 PGEAGAVALAEVIFGD---HNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRF 618
P +A VAL ++F + NPGGRLP TW P T ++ P GRTYR+
Sbjct: 551 PAQATGVAL-RMMFQNAPGANPGGRLPSTW--------PATVAQIPPMENYSMDGRTYRY 601
Query: 619 YEGKEVF 625
+ + F
Sbjct: 602 FMATQCF 608
>gi|46127231|ref|XP_388169.1| hypothetical protein FG07993.1 [Gibberella zeae PH-1]
Length = 712
Score = 347 bits (889), Expect = 2e-92, Method: Compositional matrix adjust.
Identities = 260/784 (33%), Positives = 376/784 (47%), Gaps = 99/784 (12%)
Query: 1 MIFHKLSLVFPLLCLCFTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLV 60
M+ HKL + L L L QP + +P + C TT ++RA LV
Sbjct: 1 MMLHKLQVAASALALLNGLALAAAPYQQPDCTKEP----LRSNGICDTTASPAERAAALV 56
Query: 61 SRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGV-GKGIFFNGTIRGATSFPQV 119
S LT EK++ LV++A PR+G+P Y WW+EALHGVAG G ATSFP
Sbjct: 57 SALTPREKVNNLVSNATGAPRIGLPRYNWWNEALHGVAGAPGNDYNDKPPYDSATSFPMP 116
Query: 120 ILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGED 179
+L ++FD L + IG+ IG EARA N G G+ +W PN+N F+DPRWGRG ETPGED
Sbjct: 117 LLMGSTFDDDLIHDIGEVIGTEARAWNNGGWGGGVDYWTPNVNPFKDPRWGRGSETPGED 176
Query: 180 PLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTM 239
L +YA +A C + DA+V
Sbjct: 177 ALHVSRYA---------------------RAMECTR------------------DAKV-- 195
Query: 240 QDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGF---HGYIT 296
IMC+YN VNGIP+CA+ L R+ W + + +IT
Sbjct: 196 --------------------GSIMCSYNAVNGIPACANSYLQETLLRKHWNWTHTNNWIT 235
Query: 297 SDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRA 356
SDC A+ I+ Y K+ +A + G D +C K + +Q L E +DRA
Sbjct: 236 SDCGAMQDIWQHHNYTKTGAEAAKAAFENGQDSSCEYTTTKDISDSYEQGLLTEKVMDRA 295
Query: 357 LHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPL 416
L LF + G F+G+ + + + D V + Q LALQ+A G VLLKN + LPL
Sbjct: 296 LKRLFEGLVHTGFFDGDKS--EWSSLDFDDVNTRHAQDLALQSAVRGAVLLKNDN-TLPL 352
Query: 417 PKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENT--VYYPGCDTVACS 474
K S+ALIG A+ L G Y+GP+ TP A + NT + P +
Sbjct: 353 NIKKKESVALIGFWADDKTKLQGGYSGPAPHVRTPAYAAKMLGLNTNVAWGPTLQNSSVP 412
Query: 475 SASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVIL 534
A++ AK +D++V + GLD T EE DR DL P Q L+ +++ KP+++
Sbjct: 413 DNWTTNALEAAKKSDYIVYLGGLDATAAGEERDRTDLDWPSTQLTLLKKLSNLG-KPLVV 471
Query: 535 VLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYI 594
V L G VD T ++ + SILW YPG+ G A+ E+I G P GRLP+T YP Y
Sbjct: 472 VQL-GDQVDDTPLLKNKGVNSILWVNYPGQEGGTAVMELITGRKGPAGRLPLTQYPSKYT 530
Query: 595 -KVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQS 653
+V M +M++RP T +PGRTYR+Y V PFG G Y+ + FK+ K+ +N
Sbjct: 531 EQVGMLEMELRP--TKSSPGRTYRWYS-DSVLPFGFGKHYTTFKAMFKS---QKIEMNIQ 584
Query: 654 SSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNG 713
K+++ D + + P + + VKN G L+F++
Sbjct: 585 ---KILKGCDATYVDTCPLPP----------IHLSVKNTGRTTSDFVSLVFIQGKVGPKP 631
Query: 714 RPIKQLVGFQSVILNAKEKAEIVFELS-PCESLSRAREDGLMVIEEGTHFLVVGDEEYPI 772
P+K L + S + K +A EL ++++R ++G +V+ GT+ L++ DE +
Sbjct: 632 YPLKTLAAY-SRSHDIKPRATKDVELQWTMDNIARREKNGDLVVYPGTYTLLL-DEPTQV 689
Query: 773 SIFV 776
I V
Sbjct: 690 KIKV 693
>gi|409385818|ref|ZP_11238358.1| Beta-glucosidase [Lactococcus raffinolactis 4877]
gi|399206850|emb|CCK19273.1| Beta-glucosidase [Lactococcus raffinolactis 4877]
Length = 695
Score = 345 bits (885), Expect = 5e-92, Method: Compositional matrix adjust.
Identities = 222/632 (35%), Positives = 341/632 (53%), Gaps = 65/632 (10%)
Query: 54 QRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGA 113
+ A +VS++TL EKISQ+ A AI RL IP Y +W+E LHGVA G A
Sbjct: 10 EEAIKIVSQMTLAEKISQIDFDASAIERLNIPHYNYWNEGLHGVARAGV----------A 59
Query: 114 TSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQA-------IGMTFWAPNINIFRD 166
T FPQ I AA+FD+ L I + I +E RA YNA G+TFW+PNIN+FRD
Sbjct: 60 TVFPQAIGLAATFDTELVKHIAEVISIEGRAKYNAYTKHGDRDIYKGLTFWSPNINLFRD 119
Query: 167 PRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWK 226
PRWGRGQET GEDP +T + V++++G+QG+ GK L+ +AC KHF + +
Sbjct: 120 PRWGRGQETYGEDPFLTAQIGVAFIKGLQGE----GKY---LRLAACTKHFAVH---SGP 169
Query: 227 GTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTAR 286
R+ FDA V +DL + Y P F++ +++ M AYN +NG P+C + L++KT
Sbjct: 170 EADRHYFDAVVNPKDLNEFYLPQFKAAIEEADVESFMGAYNAINGQPACVNEELIAKTLL 229
Query: 287 RQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQK 346
+WGF G++ SD A+ +++ Y ++ + + +K G ++ C + AV +
Sbjct: 230 GKWGFEGHVVSDYAALEDVHENHHYTQTAAETMALAMKIGTNL-CAGKISDALFEAVGKG 288
Query: 347 KLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVL 406
+ E+EI ++ L++ +RLG+F + + I +V S H++L+L+AA+ +VL
Sbjct: 289 LVTETEITASVVKLYTTHVRLGMFAED---NDYDTIPYEVNASAEHEMLSLKAAEKSMVL 345
Query: 407 LKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVEN---TV 463
LKN + LPL +S+ S+A+IGP A + L GNYAG + T + +Q + N
Sbjct: 346 LKNDN-FLPLSQSEIKSVAVIGPTARNIGALEGNYAGTANHYETFVSGIQQALSNQARVT 404
Query: 464 YYPGCDTVACSSASI--------DKAVDIAKGADHVVLMMGLDQTQEKEE---------L 506
Y GC A + S +A+ A+ AD VL +GLD T E E+
Sbjct: 405 YALGCHLYADHAESSLSRANERESEAIIAAEHADIAVLCVGLDPTIEGEQGDAGNVYGSG 464
Query: 507 DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAG 566
D+ L LPG+Q+ LI +V E K VILVL G + + + + +I+ A YPG G
Sbjct: 465 DKPSLSLPGQQKRLIEKVLETG-KTVILVLTSGSALSLEGLEKHTGVKAIIQAWYPGAHG 523
Query: 567 AVALAEVIFGDHNPGGRLPMTWYPQDYIKVP-MTDMKMRPQATSGNPGRTYRFYEGKEVF 625
ALA ++ G +P G+LP+T + +D +P +D M RTY+ + + ++
Sbjct: 524 GTALANILLGKVSPSGKLPVT-FCKDTQGLPDFSDYSMAE--------RTYQNTQLEVLY 574
Query: 626 PFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTK 657
PFG GL+Y + K + + L L+ ++ K
Sbjct: 575 PFGYGLTYGH--AEIKTLQLDDLTLSVTAENK 604
>gi|374313710|ref|YP_005060140.1| Beta-glucosidase [Granulicella mallensis MP5ACTX8]
gi|358755720|gb|AEU39110.1| Beta-glucosidase [Granulicella mallensis MP5ACTX8]
Length = 883
Score = 345 bits (884), Expect = 7e-92, Method: Compositional matrix adjust.
Identities = 187/419 (44%), Positives = 254/419 (60%), Gaps = 31/419 (7%)
Query: 32 SCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWS 91
S P+ T P+ TTLP QRA DLV RLTLDEK +QLV SAP IPRLG+PAY++WS
Sbjct: 24 SPSPAGTRTPLLPYQDTTLPAEQRAADLVGRLTLDEKAAQLVTSAPGIPRLGVPAYDFWS 83
Query: 92 EALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQA 151
E LHG+A G AT FPQ + AA+FD L ++IG+ I EARA YN A
Sbjct: 84 EGLHGIARSGY----------ATLFPQAVGMAATFDEPLLHQIGEVISTEARAKYNDAVA 133
Query: 152 -------IGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKL 204
G+T W+PNINIFRDPRWGRGQET GEDP +T + ++V G+QGD N
Sbjct: 134 HDLRSIFYGLTIWSPNINIFRDPRWGRGQETYGEDPFLTARLGTAFVEGLQGDDPN---- 189
Query: 205 KGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMC 264
+A KHF + + + R++F+A + DL DTY P F + + +G+A IMC
Sbjct: 190 --YYRAIGTPKHFAVH---SGPESERHRFNADPSPHDLWDTYLPAFRATIVEGKAGSIMC 244
Query: 265 AYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEG--YAKSPEDAVVDV 322
AYN + G P+CA LL + R+ W F G++TSDC A+ ++ +G Y+K E A VD
Sbjct: 245 AYNAIEGKPACASDLLLDEVLRKDWAFKGFVTSDCGAIDNFFEKDGHHYSKDAEQASVDG 304
Query: 323 LKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKI 382
++AG D NCG ++ +AV++ + ESE+D L LF R +LGLF+ P+ + +
Sbjct: 305 IRAGTDTNCGG-TYRNLASAVRKGMIQESELDVPLRRLFLARFKLGLFD-PPSQVKYASM 362
Query: 383 GADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNY 441
S +H LALQAA++ +VLLKN H LPL ++ ++A+IGPNA+S +L GNY
Sbjct: 363 PITENMSSSHTELALQAAREAVVLLKNEHHTLPL-DARVKTIAVIGPNASSLISLEGNY 420
Score = 137 bits (345), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 99/297 (33%), Positives = 147/297 (49%), Gaps = 52/297 (17%)
Query: 480 KAVDIAKGADHVVLMMGLDQTQEKEELD----------RVDLVLPGRQQELITRVAEAAK 529
+A++ K AD VV +GL E EE+D R DLVLP QQ+L+ A+A+
Sbjct: 605 QAMEAVKQADAVVAFVGLSPELEGEEMDVHIPGFSGGDRTDLVLPAAQQQLL-EAAKASG 663
Query: 530 KPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWY 589
KP+++VLL G + + +A+ + +IL A YPG+AGA A+AE + G +NP GRLP+T+Y
Sbjct: 664 KPLVVVLLNGSALAVNWAQ--EHADAILEAWYPGQAGAQAIAETLSGKNNPSGRLPVTFY 721
Query: 590 PQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLY 649
P TD M RTYR+++GK ++ FG GLSYS +SY +S+ +L
Sbjct: 722 RSVNDLPPFTDYAM--------ANRTYRYFKGKPLYEFGYGLSYSTFSYSNAHLSKERL- 772
Query: 650 LNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPAR 709
+ T VE VKN +AG L++ P +
Sbjct: 773 --DAGDTLRVEAD--------------------------VKNTSTLAGDEVAELYLTPPQ 804
Query: 710 RGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
G P++ L GF+ V L + + F L P + LS E G+ + G + + VG
Sbjct: 805 NGV-YPLRSLEGFEHVHLLPGQSKHVSFTLDPRQ-LSEVDEKGIRAVRAGVYSVTVG 859
>gi|390630430|ref|ZP_10258413.1| Beta-xylosidase B [Weissella confusa LBAE C39-2]
gi|390484359|emb|CCF30761.1| Beta-xylosidase B [Weissella confusa LBAE C39-2]
Length = 674
Score = 344 bits (883), Expect = 8e-92, Method: Compositional matrix adjust.
Identities = 235/689 (34%), Positives = 346/689 (50%), Gaps = 94/689 (13%)
Query: 82 LGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLE 141
+ IP Y +W+EALHGVA G AT FPQ I AA+FD +L I IG E
Sbjct: 1 MNIPEYNYWNEALHGVARAGV----------ATVFPQAIGLAATFDDHLINEIADVIGTE 50
Query: 142 ARALYNAGQA-------IGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGV 194
RA YN G+TFW+PN+NIFRDPRWGRG ET GEDP +T K+ V++++G+
Sbjct: 51 GRAKYNEFTKHDDRDIYKGLTFWSPNVNIFRDPRWGRGHETYGEDPFLTSKFGVAFIKGL 110
Query: 195 QGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCV 254
QG L+ +A KHF + + R+ FDA V+ +DL +TY P F++ V
Sbjct: 111 QGQA-------KYLKLAATAKHFAVH---SGPEGLRHGFDAVVSDKDLYETYLPAFKAAV 160
Query: 255 KQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKS 314
++ IM AYN V+G+P+ LL +W F G++ SD A +++ Y K
Sbjct: 161 EEADVESIMTAYNAVDGVPASVSEMLLKDILHDKWSFEGHVVSDYMAPEDVHENHKYTKD 220
Query: 315 PEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNP 374
+ + +KAG+++ G +++ A+ + + E EI A+ +L++ R+RLG+F
Sbjct: 221 AAETMGLAIKAGLNLVAGH-IEQSLHEALDRGLVTEEEITNAVISLYATRVRLGMF---A 276
Query: 375 TMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSA 434
T + I + + AH L+ AA+ VLLKN G+LPL K ++A++GPNA+S
Sbjct: 277 TDNEYDAIPYEANDTKAHNNLSEIAAEKSFVLLKND-GVLPLRKETMEAIAVVGPNAHSE 335
Query: 435 KTLLGNYAGPSCRSITPLQALQNYVENTV---YYPGC----DTVACSSASID----KAVD 483
LLGNY G RS T L+ +Q + + V Y G D A A D +AV
Sbjct: 336 IALLGNYFGTPSRSYTILEGIQERLGDDVRVHYSIGSGLFQDHAAEPLAKADERESEAVI 395
Query: 484 IAKGADHVVLMMGLDQTQEKEE---------LDRVDLVLPGRQQELITRVAEAAKKPVIL 534
A+ +D VV ++GLD T E EE D+ +L LPGRQ++L+ R+ A KPV++
Sbjct: 396 AAEHSDVVVAVLGLDSTIEGEEGDAGNSQGAGDKPNLSLPGRQRQLLERLL-AVGKPVVV 454
Query: 535 VLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYI 594
+L G + + + N+ +I+ YPG G +A+A+V+FG +P G+LP+T+Y
Sbjct: 455 LLASGSSLQLDGLENHPNLRAIMQIWYPGARGGLAVADVLFGAVSPSGKLPVTFYKNVDN 514
Query: 595 KVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSS 654
D M GRTYR+ + ++PFG GL+YS
Sbjct: 515 LPAFEDYNM--------AGRTYRYMTDEALYPFGYGLTYSS------------------- 547
Query: 655 STKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGR 714
VE D + KS + T VT ++N G V ++VK
Sbjct: 548 ----VELSD-LQVKSYEDTAT---------VTATIQNTGNFDTDEVVQVYVKDLGSEFAV 593
Query: 715 PIKQLVGFQSVILNAKEKAEIVFELSPCE 743
P QL GF+ V L K I F+L P +
Sbjct: 594 PNAQLKGFKRVYLGKGAKQTITFDLRPQD 622
>gi|256394020|ref|YP_003115584.1| glycoside hydrolase family 3 [Catenulispora acidiphila DSM 44928]
gi|256360246|gb|ACU73743.1| glycoside hydrolase family 3 domain protein [Catenulispora
acidiphila DSM 44928]
Length = 1212
Score = 344 bits (883), Expect = 9e-92, Method: Compositional matrix adjust.
Identities = 249/781 (31%), Positives = 376/781 (48%), Gaps = 103/781 (13%)
Query: 45 FCKTTLPISQRARDLVSRLTLDEKISQL-VNSAPAIPRLGIPAYEWWSEALHGVAGVGKG 103
+ +T ++RA DLVSR+TL EK++QL NSAPAIPRLG+ +Y +WSE HG+ +G
Sbjct: 45 YLQTRYSFAERAADLVSRMTLPEKVAQLHTNSAPAIPRLGVQSYTYWSEGQHGINLLGAD 104
Query: 104 IFFNGTIRG--ATSFPQVILTAASFDSYLWYRIGQAIGLEAR-----ALYNAGQA-IG-- 153
G G ATSFP + + S+D L Y+ AI E R +L+ Q IG
Sbjct: 105 SNNGGAAGGPHATSFPTNLSSTMSWDPALVYQETTAISDEVRGELDKSLWGVAQNNIGPS 164
Query: 154 ------MTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGK 207
+T+WAP +N+ RDPRWGR E GEDP + GK A ++V G QG+T +G
Sbjct: 165 ADDYGSLTYWAPTVNMDRDPRWGRTDEAFGEDPYLVGKMAGAFVAGYQGETIDGTPTSPY 224
Query: 208 LQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYN 267
L+ +A KHF + +N R+ A + D+ D Y F S V+ +G+M +YN
Sbjct: 225 LKVAATAKHFALNNNEN----DRHADSADASESDIRDYYTAQFRSLVEDSHVAGLMTSYN 280
Query: 268 RVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDV----- 322
+NG PS AD A+R WGF GYITSDC AV + + + +P V V
Sbjct: 281 AINGTPSPADTYTTDALAQRTWGFDGYITSDCGAVGDVTASSSHDWAPPGWTVSVVNGTS 340
Query: 323 ---------------------LKAGMDVNC--GSFLQKHTKAAVKQKKLPESEIDRALHN 359
L+AG D NC G + +AA+K + L E ID AL
Sbjct: 341 TWTNTATGVQVPADAGGQAYALRAGTDANCTGGDATLGNIEAAIKAEILSEGVIDHALVQ 400
Query: 360 LFSVRMRLGLFNGNPTMQ-PFGKIGADVVCSPAHQVLALQAAQDGIVLLKN------SHG 412
LF+VRM+ G F +P + + +I + SP HQ LA + A + +VLLKN +
Sbjct: 401 LFTVRMQTGEF--DPANKVAYTRITKAQIQSPEHQALAEKVAANSLVLLKNDPMPGSAAK 458
Query: 413 LLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVE-----NTVYYPG 467
+LP + ++ ++G AN+ LG Y+G + +Q + + V+ TV +
Sbjct: 459 VLPANPASLNNVVVVGDLANTVT--LGGYSGDPTLQVNAVQGITSAVKAANPNATVTFDA 516
Query: 468 CDTVACSSASIDKAVDIA---KGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRV 524
C T ++A+ + K AD VV+ +G D + E DR L +PG LI++V
Sbjct: 517 CGTSTTATAAASCSAATQAAIKTADLVVVFVGTDGSTAGESNDRASLAMPGNYDSLISQV 576
Query: 525 AEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRL 584
A +L + GPVDI K D +I+++ Y GE+ ALA+V+FG NP G L
Sbjct: 577 AALGNPRTVLSMQTDGPVDIENVKGD--FPAIVYSAYNGESQGTALADVLFGKQNPSGHL 634
Query: 585 PMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVS 644
TWY D + + + P A +G GRTY+++ G +PFG GLSY+ ++Y
Sbjct: 635 DFTWYKDDSQLPSIKNYGLNP-ADTGGLGRTYQYFTGTPTYPFGYGLSYTDFAY------ 687
Query: 645 QNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLF 704
S + ++ D K V V N G+ G L+
Sbjct: 688 ---------SKVQATDHADA---------------QGKATVRFDVTNTGKTPGATVAQLY 723
Query: 705 VK-PARRGNGRPIKQLVGF-QSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHF 762
+ P+ G +P +QL GF ++ +L + + ++ + + ++ + +GT+
Sbjct: 724 ITPPSVPGTQQPAEQLEGFAKTAVLKPGQTQHLSVSVNIADLATWDAQNAKNAVTDGTYT 783
Query: 763 L 763
L
Sbjct: 784 L 784
>gi|340369765|ref|XP_003383418.1| PREDICTED: probable beta-D-xylosidase 2-like [Amphimedon
queenslandica]
Length = 748
Score = 343 bits (881), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 243/760 (31%), Positives = 364/760 (47%), Gaps = 89/760 (11%)
Query: 43 FPFCKTTLPISQRARDLVSRLTLDEKISQLVNS-------APAIPRLGIPAYEWWSEALH 95
FPF +LPI +R +D+V +L+LD+ + Q+ + AP IP+ I Y+W +E L
Sbjct: 27 FPFRDPSLPIEERVKDIVDQLSLDQLVEQMAHGGAGSNGPAPGIPKFNIKPYQWGTECLS 86
Query: 96 GVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARA-----LYNAGQ 150
G G ATSFP I AASF+ L ++ A E RA + N
Sbjct: 87 GDVNAGD----------ATSFPMSIGMAASFNYDLLKQVSNATAYEVRAKNTAAVLNGSY 136
Query: 151 AI--GMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKL 208
A G++ W+P +NI RDPRWGR QET GEDP ++G ++V G+QGD +
Sbjct: 137 AFHTGLSCWSPVLNIMRDPRWGRNQETYGEDPYLSGYLGQAFVTGLQGDD------PTYV 190
Query: 209 QASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNR 268
A+A CKHF + R FDA VTM D T+ P F++CV+ G S +MC+YNR
Sbjct: 191 IANAGCKHFDVHGGPEDTPLPRASFDANVTMIDWRMTFLPQFKACVEAGALS-LMCSYNR 249
Query: 269 VNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYA------KSPEDAVVDV 322
+NG+P+CA++ LL+ R +W F GY+ SD A+ I YA +
Sbjct: 250 INGVPACANKKLLTDILRNEWNFKGYVVSDQGALENIVTQHHYAPDFVTAAADAANAGTC 309
Query: 323 LKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKI 382
L+ G G + + AV++ + + A+ LF VR +LG F+ P+ I
Sbjct: 310 LEDGNSEGKGGNVFDNLDDAVEKGLVSVDTLKDAVSRLFYVRTKLGEFDPPDNNNPYANI 369
Query: 383 GADVVCSPAHQVLALQAAQDGIVLLKNSHG---LLPLPKSKSVSLALIGPNANSAKTLLG 439
++ S H L++QAA + IVL+KN + LPL ++GP +A T+ G
Sbjct: 370 PLSIIQSDEHIKLSIQAAMETIVLMKNDNDGSPFLPLAADDFKKACVVGPFIENADTMFG 429
Query: 440 NYAGPSCRS---ITPLQALQNY---VENTVYYPGC-DTVACSSASIDKAVDIAKGADHVV 492
+Y+ P+ + +TPL ++ + Y GC D AC K +G D V+
Sbjct: 430 DYS-PTMMTDYIVTPLAGIKTTQIGSDLLNYEDGCTDGPACEIYDGYKVRTACEGVDLVI 488
Query: 493 LMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAA-KKPVILVLLCGGPVDITFAKYDR 551
+ GL + E E D D+ LPG Q L+T A+ P+IL+L P+DI++AK +
Sbjct: 489 VTAGLSRYLEHEGHDISDIYLPGHQMSLLTDAESASGSAPIILLLFNANPLDISYAKSNP 548
Query: 552 NIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVP-MTDMKMRPQATSG 610
+IL A YPG+ VA+A V+ G +NP GRLP TW P +VP M D M+
Sbjct: 549 RFAAILEAYYPGQEAGVAIANVLTGSYNPAGRLPNTW-PASLDQVPDMIDYTMKE----- 602
Query: 611 NPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSV 670
RTYR++ + ++PFG GLS++ ++Y LN +S+ +
Sbjct: 603 ---RTYRYFTQEPLYPFGYGLSFTTFNYS---------DLNVASTANTNGEGSIA----- 645
Query: 671 PELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAK 730
V++ V N G M G +VK P QLVG ++
Sbjct: 646 --------------VSVTVMNTGTMDGDEVTQAYVKWDNVAEA-PNIQLVGVSRKFISKG 690
Query: 731 EKAEIVFELSPCE-SLSRAREDGLMVIEEGTHFLVVGDEE 769
+ + F + P + + +DG I GT+ L VG ++
Sbjct: 691 QSITVSFTIKPEQLQVWINGDDGKWSIPGGTYSLFVGGQQ 730
>gi|389736853|ref|ZP_10190363.1| glucan 1,4-beta-glucosidase [Rhodanobacter sp. 115]
gi|388438821|gb|EIL95541.1| glucan 1,4-beta-glucosidase [Rhodanobacter sp. 115]
Length = 868
Score = 342 bits (878), Expect = 3e-91, Method: Compositional matrix adjust.
Identities = 192/413 (46%), Positives = 252/413 (61%), Gaps = 38/413 (9%)
Query: 55 RARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGAT 114
RA LV+++TL EK++Q+ N APAIPRLG+PAY+WWSE LHG+A G AT
Sbjct: 32 RAVALVAKMTLPEKVAQMQNDAPAIPRLGVPAYDWWSEGLHGIARNGY----------AT 81
Query: 115 SFPQVILTAASFDSYLWYRIGQAIGLEARALYNA-------GQAIGMTFWAPNINIFRDP 167
FPQ I AAS+D+ L + +G I EARA +NA G G+T W+PNINIFRDP
Sbjct: 82 VFPQAIGLAASWDTSLLHAVGTVISTEARAKFNASGSGRAHGLFQGLTLWSPNINIFRDP 141
Query: 168 RWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKG 227
RWGRGQET GEDP +TG+ AV++VRG+QGD +A A KHF A+ +
Sbjct: 142 RWGRGQETYGEDPYLTGQLAVAFVRGIQGDDPQ------HPRAIATPKHFVAH---SGPE 192
Query: 228 TTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARR 287
R FD V+ DL DTY P F + V G A +MCAYN ++G P+CA+ LL R+
Sbjct: 193 AGRDSFDVDVSPHDLEDTYLPAFRTAVVDGHAGSVMCAYNALHGTPACANAGLLDTRLRK 252
Query: 288 QWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKA----AV 343
WGF GY+ SDCDAV I + A V ++AG D++CG HT A AV
Sbjct: 253 DWGFAGYVVSDCDAVGDIASYHYFKPDDVQASVAAVQAGTDLDCG-----HTYASLAQAV 307
Query: 344 KQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDG 403
+Q + ES +D +L LF+ R RLG G+ P+ +IGAD + SPAH+ LALQAA +
Sbjct: 308 RQGDIAESALDASLVRLFTARYRLGEL-GSRGNDPYARIGADQIDSPAHRKLALQAALES 366
Query: 404 IVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQ 456
+VLLKN+H LPL + LA+IGP+A++ +TL NY G + +TPLQ L+
Sbjct: 367 LVLLKNAHSTLPL--HAGMRLAVIGPDADALETLEANYHGTARHPVTPLQGLR 417
Score = 128 bits (322), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 94/291 (32%), Positives = 145/291 (49%), Gaps = 56/291 (19%)
Query: 488 ADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLL 537
AD VV +GL E E+L DR D+ LP Q+ L+ R A A+ KP+I+VLL
Sbjct: 598 ADAVVAFIGLSPDVEGEQLRIDVPGFDGGDRTDIGLPAPQRALLER-ARASGKPLIVVLL 656
Query: 538 CGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVP 597
G V + +A+ ++ +IL A YPG+AG A+A+V+ GD+NPGGRLP+T+Y P
Sbjct: 657 SGSAVALDWAQ--QHADAILAAWYPGQAGGTAIAQVLAGDYNPGGRLPVTFYRSTRDLPP 714
Query: 598 MTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTK 657
M+ GRTYR+++G+ ++PFG GLSY++++Y +S L ++ T
Sbjct: 715 YVSYAMQ--------GRTYRYFDGRPLYPFGYGLSYTRFTYAAPTLSAATL---KAGGTL 763
Query: 658 MVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVK--PARRGNGRP 715
V + V+N G+ AG V +++ P+ P
Sbjct: 764 QVSAE--------------------------VRNAGQRAGDEVVQVYLDTPPSPLA---P 794
Query: 716 IKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
LVGF+ + L A E+ + F L+P LS G +E G + + +G
Sbjct: 795 RHALVGFRRIHLAAGEQRLVRFTLAP-RQLSSVDAAGARAVEPGQYRVFIG 844
>gi|224068504|ref|XP_002302759.1| predicted protein [Populus trichocarpa]
gi|222844485|gb|EEE82032.1| predicted protein [Populus trichocarpa]
Length = 273
Score = 342 bits (878), Expect = 3e-91, Method: Compositional matrix adjust.
Identities = 155/255 (60%), Positives = 194/255 (76%), Gaps = 7/255 (2%)
Query: 30 PFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEW 89
PF+CDP + ++ FPFC+ LPI R DL+ R+TL EK+ LVN A A+PRLGI YEW
Sbjct: 26 PFACDPEDGTSRNFPFCQVKLPIQSRVSDLIGRMTLQEKVGLLVNDAAAVPRLGIKGYEW 85
Query: 90 WSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAG 149
WSEALHGV+ VG G F G GATSFPQVI TAASF++ LW IG+ + EARA++N G
Sbjct: 86 WSEALHGVSNVGPGTQFGGAFPGATSFPQVITTAASFNATLWEAIGRVVSDEARAMFNGG 145
Query: 150 QAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQ 209
A G+T+W+PN+NIFRDPRWGRGQETPGEDP+V GKYA SYVRG+QG+ +G +LK
Sbjct: 146 VA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVVAGKYAASYVRGLQGN--DGDRLK---- 198
Query: 210 ASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRV 269
+ACCKHFTAYDLDNW G R+ F+A+V+ QD+ DT+ PF CVK+G+ + +MC+YN+V
Sbjct: 199 VAACCKHFTAYDLDNWNGVDRFHFNAQVSKQDMEDTFDVPFRMCVKEGKVASVMCSYNQV 258
Query: 270 NGIPSCADRNLLSKT 284
NGIP+CAD LL KT
Sbjct: 259 NGIPTCADPKLLKKT 273
>gi|90021134|ref|YP_526961.1| Beta-glucosidase [Saccharophagus degradans 2-40]
gi|89950734|gb|ABD80749.1| b-xylosidase-like protein [Saccharophagus degradans 2-40]
Length = 893
Score = 341 bits (875), Expect = 8e-91, Method: Compositional matrix adjust.
Identities = 194/468 (41%), Positives = 270/468 (57%), Gaps = 37/468 (7%)
Query: 21 LTRVDSTQPPFSCDPSN--PSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPA 78
LT + P + P + ST T+PF +L + R DLVSRLT EKI+Q+ N PA
Sbjct: 19 LTACSKSAAPNANQPQDTAASTATYPFRDASLSVDARVDDLVSRLTTTEKIAQMFNDTPA 78
Query: 79 IPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAI 138
I RLGIPAY WW+E+LHGVA GK AT +PQ I A++FD L R+ +I
Sbjct: 79 IERLGIPAYNWWNESLHGVARAGK----------ATVYPQAIGLASTFDEDLMLRVATSI 128
Query: 139 GLEARALYNAGQAI-------GMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYV 191
E RA Y+ + G+TFW+PNINIFRDPRWGRGQET GEDP +TG+ A+++V
Sbjct: 129 SDEGRAKYHDFLSKDVRTIYGGLTFWSPNINIFRDPRWGRGQETYGEDPFLTGRMAINFV 188
Query: 192 RGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFE 251
+G+QG+ N LK A A KH+ + + TR+ D T +DL +TY P F
Sbjct: 189 KGIQGENDNSDYLK----AVATIKHYAVH---SGPEKTRHSDDYHPTRKDLFETYLPAFR 241
Query: 252 SCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGY 311
+ + +MCAYNRV+G P+C + L+ + R GF+GY+ SDC A++ Y++ +
Sbjct: 242 MAIAETNVQSLMCAYNRVDGAPACGNNELMQEILRGDMGFNGYVVSDCGAIADFYESRSH 301
Query: 312 --AKSPEDAVVDVLKAGMDVNCG-SFLQKHTKA--AVKQKKLPESEIDRALHNLFSVRMR 366
SP +A +K+G D+NCG S +T A++Q + E ID A+ LF R++
Sbjct: 302 HVVDSPAEAAAWAVKSGTDLNCGDSHGNTYTNLHYALQQGLITEDYIDIAVKRLFKARIK 361
Query: 367 LGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLAL 426
LG+F+ + P+ +IG DVV SP H L +AA+ IVLLKN +G+LPL V +A+
Sbjct: 362 LGMFDEQDRV-PYSEIGMDVVGSPKHLALTQEAAEKSIVLLKN-NGVLPL--KAGVKVAV 417
Query: 427 IGPNANSAKTLLGNYAGPSCRSITPLQALQNYV--ENTVYYPGCDTVA 472
IGPNA L+GNY G + + PL+ + N V N Y PG +A
Sbjct: 418 IGPNAVDEDVLVGNYHGVPVKPVLPLEGIVNRVGEANVFYAPGSAQIA 465
Score = 99.4 bits (246), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 80/298 (26%), Positives = 131/298 (43%), Gaps = 52/298 (17%)
Query: 479 DKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAA 528
++A+ A+ AD ++ M G+D E EE+ DR + LP Q L+ ++ +A
Sbjct: 618 EEALAAARKADVIIFMGGIDAHLEGEEMPLELDGFTHGDRTHINLPKVQTNLLKQL-KAT 676
Query: 529 KKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTW 588
KPV++V G + + + + +IL A YPGEA ALA +++GD +P GRLP+T+
Sbjct: 677 GKPVVMVNFSGSAMALNWES--EKLDAILQAFYPGEATGTALANILWGDVSPSGRLPVTF 734
Query: 589 YPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKL 648
Y D M RTY+FY G+ ++ FG GL Y ++Y N L
Sbjct: 735 YKGVDDLPAFNDYHME--------NRTYKFYRGEPLYAFGHGLGYVDFAY-------NNL 779
Query: 649 YLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPA 708
+ ++ K++P + + V N G+M + +++
Sbjct: 780 VVANTAEAG----------KALP-------------IAVSVTNTGKMQAEDVAQVYISLL 816
Query: 709 RRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
PI+ L F+ L A E E+ F L P L+ ++G G + VG
Sbjct: 817 DAPANTPIRDLKAFKRTKLAAGESTELEFNL-PARVLTYIDDNGKTQTYTGRVEVTVG 873
>gi|289668505|ref|ZP_06489580.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. musacearum
NCPPB 4381]
Length = 902
Score = 340 bits (873), Expect = 1e-90, Method: Compositional matrix adjust.
Identities = 195/442 (44%), Positives = 261/442 (59%), Gaps = 32/442 (7%)
Query: 44 PFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKG 103
P+ T QRA DLVSR+TL+EK +Q+ N+APAIPRLG+ AY+WW+EALHGVA G
Sbjct: 34 PYLDTQRSFEQRAADLVSRMTLEEKAAQMQNAAPAIPRLGVAAYDWWNEALHGVARAG-- 91
Query: 104 IFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA-------GQAIGMTF 156
GAT FPQ I AA+FD L + + AI EARA ++ + G+TF
Sbjct: 92 --------GATVFPQAIGMAATFDLPLMHEVATAISDEARAKHHQFLRQNQHERYQGLTF 143
Query: 157 WAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNG-----GKLKGKLQAS 211
W+PNINIFRDPRWGRGQET GEDP +T + V++VRG+QG+ + G+ KL A+
Sbjct: 144 WSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVRGLQGEGGDAPKNAQGESYRKLDAT 203
Query: 212 ACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNG 271
A KHF + + R+ FDAR + +DL +TY P FE+ VK G+ +M AYNRV G
Sbjct: 204 A--KHFAVH---SGPEADRHHFDARPSQRDLYETYLPAFEALVKDGKVDAVMGAYNRVYG 258
Query: 272 IPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNC 331
+ A + LL R+QWGF GY+ SDC A+ I+ + E A +K G ++ C
Sbjct: 259 ESASASKFLLQDVLRQQWGFKGYVVSDCWAIVDIWKHHKIVATREQAAALAVKHGTELEC 318
Query: 332 GSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPA 391
G AAV Q + E++ID +L L + RMRLG+F+ P P+ KI A V SPA
Sbjct: 319 GEEYST-LPAAVHQGLIEEAQIDTSLQTLMTARMRLGMFD-PPGQLPWSKIPASVNQSPA 376
Query: 392 HQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITP 451
H LA + A++ +VLLKN GLLPL ++K +A+IGP A+ LLGNY G +T
Sbjct: 377 HDALARRTARESLVLLKND-GLLPLSRTKLKRIAVIGPTADDTMALLGNYYGTPAAPVTV 435
Query: 452 LQALQNYVENT--VYYPGCDTV 471
LQ ++ N +Y G D V
Sbjct: 436 LQGIRAAAPNAQVLYARGADLV 457
Score = 140 bits (352), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 99/312 (31%), Positives = 151/312 (48%), Gaps = 58/312 (18%)
Query: 452 LQALQNYVENTVYYPGCDTVACSSA--------SIDKAVDIAKGADHVVLMMGLDQTQEK 503
LQA + Y Y+ G A A + +A+D+A+ A+ VV + GL E
Sbjct: 592 LQAGRTYDLRLEYFEGERDAAVRLAWRQPGAKPPLQEALDVARSAEVVVFVGGLTGDVEG 651
Query: 504 EEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNI 553
EE+ DR DL LP Q+EL+ + +A KPV+ VL G + I +A+ +++
Sbjct: 652 EEMKVNYPGFAGGDRTDLRLPKPQRELLEAL-QATGKPVVAVLTAGSALAIDWAQ--QHV 708
Query: 554 GSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPG 613
+IL A YPG+ G A+A+ +FGD NPGGRLP+T+Y + D M G
Sbjct: 709 PAILLAWYPGQRGGTAVADTLFGDANPGGRLPVTFYKESEALPAFDDYAMH--------G 760
Query: 614 RTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPEL 673
RTYR++ G ++PFG GLSY++++Y + +N + + S
Sbjct: 761 RTYRYFGGTPLYPFGHGLSYTQFAYSDLRLDRNTVAADGS-------------------- 800
Query: 674 GTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKA 733
F T+ VKN G+ AG L++ P R K+L GFQ V L+ E+
Sbjct: 801 ---------FTATVTVKNTGQRAGDEVAQLYLHPLTPQRERAGKELRGFQRVALHPGEQR 851
Query: 734 EIVFELSPCESL 745
E+ F ++ E+L
Sbjct: 852 ELRFPINAKEAL 863
>gi|289666226|ref|ZP_06487807.1| beta-glucosidase precursor [Xanthomonas campestris pv. vasculorum
NCPPB 702]
Length = 902
Score = 340 bits (872), Expect = 2e-90, Method: Compositional matrix adjust.
Identities = 195/442 (44%), Positives = 261/442 (59%), Gaps = 32/442 (7%)
Query: 44 PFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKG 103
P+ T QRA DLVSR+TL+EK +Q+ N+APAIPRLG+ AY+WW+EALHGVA G
Sbjct: 34 PYLDTQRSFEQRAADLVSRMTLEEKAAQMQNAAPAIPRLGVAAYDWWNEALHGVARAG-- 91
Query: 104 IFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA-------GQAIGMTF 156
GAT FPQ I AA+FD L + + AI EARA ++ + G+TF
Sbjct: 92 --------GATVFPQAIGMAATFDLPLMHEVATAISDEARAKHHQFLRQNQHERYQGLTF 143
Query: 157 WAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNG-----GKLKGKLQAS 211
W+PNINIFRDPRWGRGQET GEDP +T + V++VRG+QG+ + G+ KL A+
Sbjct: 144 WSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVRGLQGEGGDAPKNAQGESYRKLDAT 203
Query: 212 ACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNG 271
A KHF + + R+ FDAR + +DL +TY P FE+ VK G+ +M AYNRV G
Sbjct: 204 A--KHFAVH---SGPEADRHHFDARPSQRDLYETYLPAFEALVKDGKVDAVMGAYNRVYG 258
Query: 272 IPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNC 331
+ A + LL R+QWGF GY+ SDC A+ I+ + E A +K G ++ C
Sbjct: 259 ESASASKFLLQDLLRQQWGFKGYVVSDCWAIVDIWKHHKIVATREQAAALAVKHGTELEC 318
Query: 332 GSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPA 391
G AAV Q + E++ID +L L + RMRLG+F+ P P+ KI A V SPA
Sbjct: 319 GEEYST-LPAAVHQGLIEEAQIDTSLQTLMTARMRLGMFD-PPGQLPWSKIPASVNQSPA 376
Query: 392 HQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITP 451
H LA + A++ +VLLKN GLLPL ++K +A+IGP A+ LLGNY G +T
Sbjct: 377 HDALARRTARESLVLLKND-GLLPLSRTKLKRIAVIGPTADDTMALLGNYYGTPAAPVTV 435
Query: 452 LQALQNYVENT--VYYPGCDTV 471
LQ ++ N +Y G D V
Sbjct: 436 LQGIRAAAPNAQVLYARGADLV 457
Score = 140 bits (353), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 99/312 (31%), Positives = 151/312 (48%), Gaps = 58/312 (18%)
Query: 452 LQALQNYVENTVYYPGCDTVACSSA--------SIDKAVDIAKGADHVVLMMGLDQTQEK 503
LQA + Y Y+ G A A + +A+D+A+ A+ VV + GL E
Sbjct: 592 LQAGRTYDLRLEYFEGERDAAVRLAWRQPGAKPPLQEALDVARSAEVVVFVGGLTGDVEG 651
Query: 504 EEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNI 553
EE+ DR DL LP Q+EL+ + +A KPV+ VL G + I +A+ +++
Sbjct: 652 EEMKVNYPGFAGGDRTDLRLPKPQRELLEAL-QATGKPVVAVLTAGSALAIDWAQ--QHV 708
Query: 554 GSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPG 613
+IL A YPG+ G A+A+ +FGD NPGGRLP+T+Y + D M G
Sbjct: 709 PAILLAWYPGQRGGTAVADTLFGDANPGGRLPVTFYKESEALPAFDDYAMH--------G 760
Query: 614 RTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPEL 673
RTYR++ G ++PFG GLSY++++Y + +N + + S
Sbjct: 761 RTYRYFGGTPLYPFGHGLSYTQFAYSDLRLDRNTVAADGS-------------------- 800
Query: 674 GTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKA 733
F T+ VKN G+ AG L++ P R K+L GFQ V L+ E+
Sbjct: 801 ---------FTATVTVKNTGQRAGDEVAQLYLHPLTPQRERAGKELRGFQRVALHPGEQR 851
Query: 734 EIVFELSPCESL 745
E+ F ++ E+L
Sbjct: 852 ELSFPINAKEAL 863
>gi|58584046|ref|YP_203062.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae KACC
10331]
gi|84625823|ref|YP_453195.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae MAFF
311018]
gi|58428640|gb|AAW77677.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae KACC
10331]
gi|84369763|dbj|BAE70921.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae MAFF
311018]
Length = 904
Score = 339 bits (870), Expect = 3e-90, Method: Compositional matrix adjust.
Identities = 193/442 (43%), Positives = 261/442 (59%), Gaps = 32/442 (7%)
Query: 44 PFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKG 103
P+ +T QRA DLVSR+TL+EK +Q+ N+APAIPRLG+PAY+WW+EALHGVA G
Sbjct: 36 PYLQTQRSFEQRAADLVSRMTLEEKAAQMQNAAPAIPRLGVPAYDWWNEALHGVARAG-- 93
Query: 104 IFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA-------GQAIGMTF 156
GAT FPQ I AA+FD L + + AI EARA ++ + G+TF
Sbjct: 94 --------GATVFPQAIGMAATFDLPLMHEVATAISDEARAKHHRFLRQHQHARYQGLTF 145
Query: 157 WAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNG-----GKLKGKLQAS 211
W+PNINIFRDPRWGRGQET GEDP +T + V++V+G+QG+ + G+ KL A+
Sbjct: 146 WSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQGEGSDAPKNAQGERYRKLDAT 205
Query: 212 ACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNG 271
A KHF + + R+ FDAR + +DL +TY P FE+ VK G+ +M AYNRV G
Sbjct: 206 A--KHFAVH---SGPEADRHHFDARPSQRDLYETYLPAFEALVKDGKVDAVMGAYNRVYG 260
Query: 272 IPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNC 331
+ A + LL R+QWGF GY+ SDC A+ ++ + E A + G ++ C
Sbjct: 261 ESASASKFLLQDVLRQQWGFKGYVVSDCWAIVDVWKHHKIVATREQAAALAVTHGTELEC 320
Query: 332 GSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPA 391
G AAV Q + E++ID AL L + RMRLG+F+ P P+ KI A V SPA
Sbjct: 321 GEEYST-LPAAVHQGLIDEAQIDTALQTLMTARMRLGMFD-PPGQLPWSKIPASVNQSPA 378
Query: 392 HQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITP 451
H LA + A++ +VLLKN GLLPL ++ +A+IGP A+ LLGNY G +T
Sbjct: 379 HDALARRTARESLVLLKND-GLLPLSRATLKRIAVIGPTADDTMALLGNYYGTPAAPVTV 437
Query: 452 LQALQNYVENT--VYYPGCDTV 471
LQ ++ N +Y G D V
Sbjct: 438 LQGIRAAAPNAQVLYARGADLV 459
Score = 139 bits (349), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 98/312 (31%), Positives = 154/312 (49%), Gaps = 58/312 (18%)
Query: 452 LQALQNYVENTVYYPGCDTVACSSA--------SIDKAVDIAKGADHVVLMMGLDQTQEK 503
LQA ++Y Y+ G A A + +A+D+A+ AD VV + GL E
Sbjct: 594 LQAGRSYDLRLDYFEGERDAAVRLAWRQPGAKPPLQEALDVARSADVVVFVGGLTGDVEG 653
Query: 504 EEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNI 553
EE+ DR DL LP Q+EL+ + +A KPV+ VL G + + +A+ +++
Sbjct: 654 EEMKVSYPGFAGGDRTDLRLPKPQRELLEAL-QATGKPVVAVLTAGSALAVDWAQ--QHV 710
Query: 554 GSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPG 613
+IL A YPG+ G A+A+ +FGD NPGGRLP+T+Y + D M G
Sbjct: 711 PAILLAWYPGQRGGTAVADTLFGDANPGGRLPVTFYKESETLPAFDDYAMH--------G 762
Query: 614 RTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPEL 673
RTYR++ G ++PFG GLSY++++Y + L L++S+ T
Sbjct: 763 RTYRYFGGTPLYPFGHGLSYTQFAY-------SDLRLDRSTLTA---------------- 799
Query: 674 GTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKA 733
T+ VKN G+ AG V L++ P + R K+L GFQ + L ++
Sbjct: 800 ------DGALTATVAVKNTGQRAGDEVVQLYLHPLKPQRERAGKELRGFQRLALQPGQQR 853
Query: 734 EIVFELSPCESL 745
E+ F ++ ++L
Sbjct: 854 ELRFTINAKDAL 865
>gi|365118446|ref|ZP_09337032.1| hypothetical protein HMPREF1033_00378 [Tannerella sp.
6_1_58FAA_CT1]
gi|363649697|gb|EHL88801.1| hypothetical protein HMPREF1033_00378 [Tannerella sp.
6_1_58FAA_CT1]
Length = 1283
Score = 339 bits (870), Expect = 3e-90, Method: Compositional matrix adjust.
Identities = 241/747 (32%), Positives = 372/747 (49%), Gaps = 97/747 (12%)
Query: 39 STETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNS--APAIPRLGIPAYEWWSEALHG 96
S ET + +PI +R DL+ RLTL+EK+ QL +S + I RL IPA +E LHG
Sbjct: 66 SKETPKYLNPNIPIEERIDDLLPRLTLEEKVIQLSDSWGSKGIARLKIPAM-LKTEGLHG 124
Query: 97 VAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTF 156
G+T FP I ++FD+ L +G+A +EA+A A
Sbjct: 125 ----------QSYATGSTIFPHGINMGSTFDTELIQEVGKATAIEAKA------ANLRVS 168
Query: 157 WAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKH 216
W+P +++ RD RWGR +ET GEDP + G+ V++++G QG+ AC KH
Sbjct: 169 WSPVLDVARDARWGRVEETYGEDPYLVGRIGVAWIKGFQGE-----------HMFACPKH 217
Query: 217 FTAYDLD-NWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSC 275
F + + + Y RV + + + PF +K+ A G+M AY NG+P
Sbjct: 218 FAGHGQPVGGRDSHDYGLSDRV----MRNIHLAPFRDVIKEANAFGVMAAYGLWNGVPDN 273
Query: 276 ADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFL 335
+ LL K R +WGF G++ SDC I + + E+A ++AG+D+ CGS
Sbjct: 274 GSKELLQKILREEWGFEGFVVSDCSGPENIQRKQSVVGTMEEAAAMAVRAGVDIECGSAY 333
Query: 336 QKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVL 395
+K +AVK+ + ESE+D L +F +MRLGLF+ P+++ +P H+ L
Sbjct: 334 KKALASAVKKGIIKESELDANLRRVFRAKMRLGLFD-RPSIENMVWNKLPEYDTPEHRAL 392
Query: 396 ALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSC--RSITPLQ 453
A + A VLLKN + LLPL K+ ++A+IGPNA+ +T G+Y+ + I+ L+
Sbjct: 393 ARKVAVKSTVLLKNENNLLPLDKNIK-TIAVIGPNADQGQT--GDYSAKYAPGQIISVLE 449
Query: 454 ALQNYVE---NTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEK------- 503
++N+V +Y GC + + +AV+IAK AD V+L++G + + +
Sbjct: 450 GVKNHVSPSTKVLYAQGCTQLDMDTTGFAEAVNIAKQADAVILVVGDNSNRHENGNKKST 509
Query: 504 --EELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGY 561
E +D L +PG Q++LI + EA KPV+LVL+ G P +T+ D NI SIL Y
Sbjct: 510 TGENVDGATLEIPGVQRQLI-KAVEATGKPVVLVLVNGKPFTLTWE--DENIESILETWY 566
Query: 562 PGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEG 621
PGE G A A++IFGD NP GRLP++ +P+ ++P+ GR Y +Y+
Sbjct: 567 PGEEGGNATADIIFGDENPSGRLPIS-FPRHPGQLPL-------WYNYETSGRNYDYYDM 618
Query: 622 --KEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCE 679
++ FG GLSY+ + Y N ++TK + P T
Sbjct: 619 PFTPLYRFGHGLSYTTFRYS-----------NLKATTKSGD----------PGFVT---- 653
Query: 680 TRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFEL 739
V++ ++N G+ G+ L++ + L GF+ V L EK + FEL
Sbjct: 654 -----VSVDIENTGKRPGEEVAQLYITDLVASVNTAVIDLKGFKRVFLKPGEKKTVTFEL 708
Query: 740 SPCESLSRAREDGLMVIEEGTHFLVVG 766
+P LS D V+E G + VG
Sbjct: 709 NPY-LLSLLNPDMKRVLEAGKFRMHVG 734
>gi|188574621|ref|YP_001911550.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae PXO99A]
gi|188519073|gb|ACD57018.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae PXO99A]
Length = 904
Score = 339 bits (869), Expect = 4e-90, Method: Compositional matrix adjust.
Identities = 193/442 (43%), Positives = 261/442 (59%), Gaps = 32/442 (7%)
Query: 44 PFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKG 103
P+ +T QRA DLVSR+TL+EK +Q+ N+APAIPRLG+PAY+WW+EALHGVA G
Sbjct: 36 PYLQTQRSFEQRAADLVSRMTLEEKAAQMQNAAPAIPRLGVPAYDWWNEALHGVARAG-- 93
Query: 104 IFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA-------GQAIGMTF 156
GAT FPQ I AA+FD L + + AI EARA ++ + G+TF
Sbjct: 94 --------GATVFPQAIGMAATFDLPLMHEVATAISDEARAKHHRFLRQHQHARYQGLTF 145
Query: 157 WAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNG-----GKLKGKLQAS 211
W+PNINIFRDPRWGRGQET GEDP +T + V++V+G+QG+ + G+ KL A+
Sbjct: 146 WSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQGEGSDAPKNAQGERYRKLDAT 205
Query: 212 ACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNG 271
A KHF + + R+ FDAR + +DL +TY P FE+ VK G+ +M AYNRV G
Sbjct: 206 A--KHFAVH---SGPEADRHHFDARPSQRDLYETYLPAFEALVKDGKVDAVMGAYNRVYG 260
Query: 272 IPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNC 331
+ A + LL R+QWGF GY+ SDC A+ ++ + E A + G ++ C
Sbjct: 261 ESASASKFLLQDVLRQQWGFKGYVVSDCWAIVDVWKHHKIVATREQAAALAVTHGTELEC 320
Query: 332 GSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPA 391
G AAV Q + E++ID AL L + RMRLG+F+ P P+ KI A V SPA
Sbjct: 321 GEEYST-LPAAVHQGLIDEAQIDTALQTLMTARMRLGMFD-PPGQLPWSKIPASVNQSPA 378
Query: 392 HQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITP 451
H LA + A++ +VLLKN GLLPL ++ +A+IGP A+ LLGNY G +T
Sbjct: 379 HDALARRTARESLVLLKND-GLLPLSRATLKRIAVIGPTADDTMALLGNYYGTPAAPVTV 437
Query: 452 LQALQNYVENT--VYYPGCDTV 471
LQ ++ N +Y G D V
Sbjct: 438 LQGIRAAAPNAQVLYARGADLV 459
Score = 139 bits (351), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 99/312 (31%), Positives = 154/312 (49%), Gaps = 58/312 (18%)
Query: 452 LQALQNYVENTVYYPGCDTVACSSA--------SIDKAVDIAKGADHVVLMMGLDQTQEK 503
LQA ++Y Y+ G A A + +A+D+A+ AD VV + GL E
Sbjct: 594 LQAGRSYDLRLDYFEGERDAAVRLAWRQPGAKPPLQEALDVARSADVVVFVGGLTGDVEG 653
Query: 504 EEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNI 553
EE+ DR DL LP Q+EL+ + +A KPV+ VL G + I +A+ +++
Sbjct: 654 EEMKVSYPGFAGGDRTDLRLPKPQRELLEAL-QATGKPVVAVLTAGSALAIDWAQ--QHV 710
Query: 554 GSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPG 613
+IL A YPG+ G A+A+ +FGD NPGGRLP+T+Y + D M G
Sbjct: 711 PAILLAWYPGQRGGTAVADTLFGDANPGGRLPVTFYKESETLPAFDDYAMH--------G 762
Query: 614 RTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPEL 673
RTYR++ G ++PFG GLSY++++Y + L L++S+ T
Sbjct: 763 RTYRYFGGTPLYPFGHGLSYTQFAY-------SDLRLDRSTLTA---------------- 799
Query: 674 GTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKA 733
T+ VKN G+ AG V L++ P + R K+L GFQ + L ++
Sbjct: 800 ------DGALTATVAVKNTGQRAGDEVVQLYLHPLKPQRERAGKELRGFQRLALQPGQQR 853
Query: 734 EIVFELSPCESL 745
E+ F ++ ++L
Sbjct: 854 ELRFTINAKDAL 865
>gi|418518550|ref|ZP_13084692.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. malvacearum
str. GSPB1386]
gi|418522850|ref|ZP_13088880.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. malvacearum
str. GSPB2388]
gi|410700720|gb|EKQ59264.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. malvacearum
str. GSPB2388]
gi|410703176|gb|EKQ61671.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. malvacearum
str. GSPB1386]
Length = 901
Score = 339 bits (869), Expect = 4e-90, Method: Compositional matrix adjust.
Identities = 196/455 (43%), Positives = 262/455 (57%), Gaps = 32/455 (7%)
Query: 31 FSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWW 90
C T T P+ T RA DLVSR+TL+EK +Q+ N+APAIPRL +PAY+WW
Sbjct: 20 LGCATGAAQTATPPYLDTQRSFEARAADLVSRMTLEEKAAQMQNAAPAIPRLQVPAYDWW 79
Query: 91 SEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA-- 148
+EALHGVA G GAT FPQ I AA+FD L + + AI EARA ++
Sbjct: 80 NEALHGVARAG----------GATVFPQAIGMAATFDLPLMHEVATAISDEARAKHHQFL 129
Query: 149 -----GQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNG-- 201
+ G+TFW+PNINIFRDPRWGRGQET GEDP +T + V++V+G+QG+ +
Sbjct: 130 RQNQHARYQGLTFWSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQGEGADAPK 189
Query: 202 ---GKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGR 258
G+ KL A+A KHF + + R+ FDAR + +DL +TY P FE+ VK G+
Sbjct: 190 NAQGERYRKLDATA--KHFAVH---SGPEADRHHFDARPSQRDLYETYLPAFEALVKDGK 244
Query: 259 ASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDA 318
+M AYNRV G + A + LL R QWGF GY+ SDC A+ I+ + E A
Sbjct: 245 VDAVMGAYNRVYGESASASKFLLQDVLRDQWGFRGYVVSDCWAIVDIWKHHKIVATREQA 304
Query: 319 VVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQP 378
+K G ++ CG AAV+Q + E++ID AL L + RMRLG+F+ P P
Sbjct: 305 AALAVKHGTELECGEEYAT-LPAAVRQGLIDEAQIDTALKTLMTARMRLGMFD-PPGQLP 362
Query: 379 FGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLL 438
+ I A V SPAH LA + A++ +VLLKN GLLPL ++K +A+IGP A+ LL
Sbjct: 363 WSTIPASVNQSPAHDALARRTARESLVLLKND-GLLPLSRAKLKRIAVIGPTADDTMALL 421
Query: 439 GNYAGPSCRSITPLQALQNYVENT--VYYPGCDTV 471
GNY G +T LQ ++ N +Y G D V
Sbjct: 422 GNYYGTPAAPVTVLQGIRAAAPNAQVLYARGADLV 456
Score = 137 bits (346), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 99/312 (31%), Positives = 151/312 (48%), Gaps = 58/312 (18%)
Query: 452 LQALQNYVENTVYYPGCDTVACSSA--------SIDKAVDIAKGADHVVLMMGLDQTQEK 503
LQA + Y Y+ G A A + +A+D+A+ AD VV + GL E
Sbjct: 591 LQAGRAYEVRLEYFEGERDAAVRLAWRQPGAKPPLQEALDVARSADVVVFVGGLTGDVEG 650
Query: 504 EEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNI 553
EE+ DR DL LP Q++L+ + +A KPV+ VL G + I +A+ +++
Sbjct: 651 EEMKVNYPGFAGGDRTDLRLPKPQRDLLEAL-QATGKPVVAVLTAGSALAIDWAQ--QHL 707
Query: 554 GSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPG 613
+IL A YPG+ G A+A+ +FGD NPGGRLP+T+Y + D MR G
Sbjct: 708 PAILLAWYPGQRGGTAVADTLFGDANPGGRLPVTFYKESETLPAFDDYAMR--------G 759
Query: 614 RTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPEL 673
RTYR++ G ++PFG GLSY++++Y + L L++
Sbjct: 760 RTYRYFGGTPLYPFGHGLSYTQFAY-------SGLRLDR--------------------- 791
Query: 674 GTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKA 733
T T+ VKN G+ AG V L++ P R K+L GFQ + L E+
Sbjct: 792 -TTIATDGSLTATVTVKNTGQRAGDEVVQLYLHPLAPQRERAGKELHGFQRIALQPGEQR 850
Query: 734 EIVFELSPCESL 745
E+ F ++ ++L
Sbjct: 851 ELGFTINAKDAL 862
>gi|381170979|ref|ZP_09880130.1| glycosyl hydrolase family 3 N terminal domain protein [Xanthomonas
citri pv. mangiferaeindicae LMG 941]
gi|380688543|emb|CCG36617.1| glycosyl hydrolase family 3 N terminal domain protein [Xanthomonas
citri pv. mangiferaeindicae LMG 941]
Length = 901
Score = 338 bits (868), Expect = 5e-90, Method: Compositional matrix adjust.
Identities = 196/455 (43%), Positives = 262/455 (57%), Gaps = 32/455 (7%)
Query: 31 FSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWW 90
C T P+ T QRA DLVSR+TL+EK +Q+ N+APAIPRL +PAY+WW
Sbjct: 20 LGCATGAAQAATPPYLDTQRSFEQRAADLVSRMTLEEKAAQMQNAAPAIPRLQVPAYDWW 79
Query: 91 SEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA-- 148
+EALHGVA G GAT FPQ I AA+FD L + + AI EARA ++
Sbjct: 80 NEALHGVARAG----------GATVFPQAIGMAATFDLPLMHEVATAISDEARAKHHQFL 129
Query: 149 -----GQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNG-- 201
+ G+TFW+PNINIFRDPRWGRGQET GEDP +T + V++V+G+QG+
Sbjct: 130 RQNQHARYQGLTFWSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQGEGAAAPK 189
Query: 202 ---GKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGR 258
G+ KL A+A KHF + + R+ FDAR + +DL +TY P FE+ VK+G+
Sbjct: 190 NAQGEPYRKLDATA--KHFAVH---SGPEADRHHFDARPSQRDLYETYLPAFEALVKEGK 244
Query: 259 ASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDA 318
+M AYNRV G + A + LL R QWGF GY+ SDC A+ I+ + E A
Sbjct: 245 VDAVMGAYNRVYGESASASKFLLQDVLRDQWGFRGYVVSDCWAIVDIWKHHKIVATREQA 304
Query: 319 VVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQP 378
+K G ++ CG AAV+Q + E++ID AL L + RMRLG+F+ P P
Sbjct: 305 AALAVKHGTELECGEEYAT-LPAAVRQGLIDEAQIDTALKTLMTARMRLGMFD-PPGQLP 362
Query: 379 FGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLL 438
+ I A V SPAH LA + A++ +VLLKN GLLPL ++K +A+IGP A+ LL
Sbjct: 363 WSTIPASVNQSPAHDALARRTARESLVLLKND-GLLPLSRAKLKRIAVIGPTADDTMALL 421
Query: 439 GNYAGPSCRSITPLQALQNYVENT--VYYPGCDTV 471
GNY G +T LQ ++ N +Y G D V
Sbjct: 422 GNYYGTPAAPVTVLQGIRAAAPNAQVLYARGADLV 456
Score = 136 bits (343), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 98/312 (31%), Positives = 151/312 (48%), Gaps = 58/312 (18%)
Query: 452 LQALQNYVENTVYYPGCDTVACSSA--------SIDKAVDIAKGADHVVLMMGLDQTQEK 503
LQA + Y Y+ G A A + +A+D+A+ AD VV + GL E
Sbjct: 591 LQAGRAYDVRLEYFEGERDAAVRLAWRQPGAKPPLQEALDVARSADVVVFVGGLTGDVEG 650
Query: 504 EEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNI 553
EE+ DR DL LP Q++L+ + +A +PV+ VL G + I +A+ +++
Sbjct: 651 EEMKVNYPGFAGGDRTDLRLPKPQRDLLEAL-QATGRPVVAVLTTGSALAIDWAQ--QHL 707
Query: 554 GSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPG 613
+IL A YPG+ G A+A+ +FGD NPGGRLP+T+Y + D MR G
Sbjct: 708 PAILLAWYPGQRGGTAVADTLFGDANPGGRLPVTFYKESETLPAFDDYAMR--------G 759
Query: 614 RTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPEL 673
RTYR++ G ++PFG GLSY++++Y + L L++
Sbjct: 760 RTYRYFGGTPLYPFGHGLSYTQFAY-------SGLRLDR--------------------- 791
Query: 674 GTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKA 733
T T+ VKN G+ AG V L++ P R K+L GFQ + L E+
Sbjct: 792 -TTIATDGSLTATVTVKNTGQRAGDEVVQLYLHPLAPQRERAGKELHGFQRIALQPGEQR 850
Query: 734 EIVFELSPCESL 745
E+ F ++ ++L
Sbjct: 851 ELGFTINAKDAL 862
>gi|346726970|ref|YP_004853639.1| beta-glucosidase-related glycosidase [Xanthomonas axonopodis pv.
citrumelo F1]
gi|346651717|gb|AEO44341.1| Beta-glucosidase-related glycosidase [Xanthomonas axonopodis pv.
citrumelo F1]
Length = 902
Score = 338 bits (867), Expect = 7e-90, Method: Compositional matrix adjust.
Identities = 194/441 (43%), Positives = 260/441 (58%), Gaps = 32/441 (7%)
Query: 45 FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGI 104
+ T QRA DLVSR+TL+EK +Q+ N+APAIPRLG+PAY+WW+EALHGVA G
Sbjct: 35 YLDTQRSFEQRAADLVSRMTLEEKAAQMQNAAPAIPRLGVPAYDWWNEALHGVARAG--- 91
Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA-------GQAIGMTFW 157
GAT FPQ I AA+FD L + + AI EARA ++ + G+TFW
Sbjct: 92 -------GATVFPQAIGMAATFDLPLMHEVATAISDEARAKHHQFLRQNQHARYQGLTFW 144
Query: 158 APNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNG-----GKLKGKLQASA 212
+PNINIFRDPRWGRGQET GEDP +T + V++V+G+QG+ + G+ KL A+A
Sbjct: 145 SPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQGEGADAPKNAQGEPYRKLDATA 204
Query: 213 CCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGI 272
KHF + + R+ FDAR + +DL +TY P FE+ VK G+ +M AYNRV G
Sbjct: 205 --KHFAVH---SGPEADRHHFDARPSQRDLYETYLPAFEALVKDGKVDAVMGAYNRVYGE 259
Query: 273 PSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG 332
+ A + LL R+QWGF GY+ SDC A+ I+ + E A +K G ++ CG
Sbjct: 260 SASASKFLLQDVLRQQWGFKGYVVSDCWAIVDIWKHHKIVATREQAAALAVKHGTELECG 319
Query: 333 SFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAH 392
AAV Q + E++ID AL L + RMRLG+F+ P P+ I A V SPAH
Sbjct: 320 EEYST-LPAAVHQGLIDEAQIDTALKTLMTARMRLGMFD-PPGQLPWSTIPASVNQSPAH 377
Query: 393 QVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPL 452
LA + A++ +VLLKN GLLPL ++K +A+IGP A+ LLGNY G +T L
Sbjct: 378 DALARRTARESLVLLKND-GLLPLSRAKLKRIAVIGPTADDTMALLGNYYGTPAAPVTVL 436
Query: 453 QALQNYVENT--VYYPGCDTV 471
Q ++ N +Y G D V
Sbjct: 437 QGIRAAAPNAQVLYARGADLV 457
Score = 134 bits (337), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 91/278 (32%), Positives = 138/278 (49%), Gaps = 50/278 (17%)
Query: 478 IDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEA 527
+ +A+D+A+ AD VV + GL E EE+ DR DL LP Q++L+ + +A
Sbjct: 626 LQEALDVARSADVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRDLLEAL-QA 684
Query: 528 AKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMT 587
KPV+ VL G + I +A+ +++ +IL A YPG+ G A+A+ +FGD NPGGRLP+T
Sbjct: 685 TGKPVVAVLTTGSALAIDWAQ--QHLPAILLAWYPGQRGGTAVADTLFGDANPGGRLPVT 742
Query: 588 WYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNK 647
+Y + D MR GRTYR++ G ++PFG GLSY++++Y +
Sbjct: 743 FYKESETLPAFDDYAMR--------GRTYRYFGGTPLYPFGHGLSYTQFAY-------SG 787
Query: 648 LYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKP 707
L L++ T T+ VKN G+ AG V L++ P
Sbjct: 788 LRLDR----------------------TTIAADGSLTATVTVKNTGQRAGDEVVQLYLHP 825
Query: 708 ARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESL 745
R K+L GFQ + L E+ + F L +L
Sbjct: 826 LTPQRERAGKELHGFQRIALQPGEQRALHFTLDAKNAL 863
>gi|78049893|ref|YP_366068.1| beta-glucosidase precursor [Xanthomonas campestris pv. vesicatoria
str. 85-10]
gi|78038323|emb|CAJ26068.1| beta-glucosidase precursor [Xanthomonas campestris pv. vesicatoria
str. 85-10]
Length = 902
Score = 337 bits (865), Expect = 1e-89, Method: Compositional matrix adjust.
Identities = 193/441 (43%), Positives = 261/441 (59%), Gaps = 32/441 (7%)
Query: 45 FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGI 104
+ T QRA DLVSR+TL+EK +Q+ N+APAIPRLG+PAY+WW+EALHGVA G
Sbjct: 35 YLDTQRSFEQRAADLVSRMTLEEKAAQMQNAAPAIPRLGVPAYDWWNEALHGVARAG--- 91
Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA-------GQAIGMTFW 157
GAT FPQ I AA+FD L + + AI EARA ++ + G+TFW
Sbjct: 92 -------GATVFPQAIGMAATFDLPLMHEVATAISDEARAKHHQFLRQNQHARYQGLTFW 144
Query: 158 APNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNG-----GKLKGKLQASA 212
+PNINIFRDPRWGRGQET GEDP +T + V++V+G++G+ + G+ KL A+A
Sbjct: 145 SPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLRGEGADAPKNAQGEPYRKLDATA 204
Query: 213 CCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGI 272
KHF + + R+ FDAR + +DL +TY P FE+ VK G+ +M AYNRV G
Sbjct: 205 --KHFAVH---SGPEADRHHFDARPSQRDLYETYLPAFEALVKDGKVDAVMGAYNRVYGE 259
Query: 273 PSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG 332
+ A + LL R+QWGF GY+ SDC A+ I+ + E A +K G ++ CG
Sbjct: 260 SASASKFLLQDVLRQQWGFKGYVVSDCWAIVDIWKHHKIVATREQAAALAVKHGTELECG 319
Query: 333 SFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAH 392
AAV+Q + E++ID AL L + RMRLG+F+ P P+ I A V SPAH
Sbjct: 320 EEYST-LPAAVRQGLIDEAQIDTALTTLMTARMRLGMFD-PPGQLPWSTIPASVNQSPAH 377
Query: 393 QVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPL 452
LA + A++ +VLLKN GLLPL ++K +A+IGP A+ LLGNY G +T L
Sbjct: 378 DALARRTARESLVLLKND-GLLPLSRAKLKRIAVIGPTADDTMALLGNYYGTPAAPVTVL 436
Query: 453 QALQNYVENT--VYYPGCDTV 471
Q ++ N +Y G D V
Sbjct: 437 QGIRAAAPNAQVLYARGADLV 457
Score = 135 bits (339), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 92/278 (33%), Positives = 138/278 (49%), Gaps = 50/278 (17%)
Query: 478 IDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEA 527
+ +A+D+A AD VV + GL E EE+ DR DL LP Q++L+ + +A
Sbjct: 626 LQEALDVASSADVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRDLLEAL-QA 684
Query: 528 AKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMT 587
KPV+ VL G + I +A+ +++ +IL A YPG+ G A+A+ +FGD NPGGRLP+T
Sbjct: 685 TGKPVVAVLTTGSALAIDWAQ--QHLPAILLAWYPGQRGGTAVADTLFGDANPGGRLPVT 742
Query: 588 WYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNK 647
+Y + D MR GRTYR++ G ++PFG GLSY++++Y +
Sbjct: 743 FYKESETLPAFDDYAMR--------GRTYRYFGGTPLYPFGHGLSYTQFAY-------SG 787
Query: 648 LYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKP 707
L L++ T T+ VKN G+ AG V L++ P
Sbjct: 788 LRLDR----------------------TTIAADGSLTATVTVKNTGQRAGDEVVQLYLHP 825
Query: 708 ARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESL 745
R K+L GFQ + L A E+ + F L +L
Sbjct: 826 LTPQRERAGKELHGFQRITLQAGEQRALHFILDAKNAL 863
>gi|389737578|ref|ZP_10190998.1| beta-glucosidase [Rhodanobacter sp. 115]
gi|388434298|gb|EIL91245.1| beta-glucosidase [Rhodanobacter sp. 115]
Length = 898
Score = 336 bits (862), Expect = 2e-89, Method: Compositional matrix adjust.
Identities = 192/436 (44%), Positives = 252/436 (57%), Gaps = 32/436 (7%)
Query: 45 FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGI 104
+ T +RA DLVSR+TL EK++Q+ NSAPAIPRLG+PAY+WW+EALHGVA G+
Sbjct: 43 YLDTAHSFQERAADLVSRMTLAEKVAQMQNSAPAIPRLGVPAYDWWNEALHGVARAGE-- 100
Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYN-------AGQAIGMTFW 157
AT FPQ I AA+FD L + AI EARA YN G+ G+TFW
Sbjct: 101 --------ATVFPQAIGLAATFDPALLHHEATAISDEARAKYNDFQRRGMRGRYEGLTFW 152
Query: 158 APNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHF 217
+PN NIFRDPRWGRGQET GEDP +T + V++VRG++GD KL A KHF
Sbjct: 153 SPNTNIFRDPRWGRGQETYGEDPYLTSRMGVAFVRGLEGDDPTYQKL------DATAKHF 206
Query: 218 TAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCAD 277
+ + + R++FD + +DL +TY P F++ V+QG +M AYNRV+G+P+ A
Sbjct: 207 AVH---SGPESERHRFDVHPSERDLHETYLPAFQALVQQGGVDAVMGAYNRVDGVPATAS 263
Query: 278 RNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQK 337
LL RR WGF GY+ SDCDAV+ IY + E A + G D+NCG+
Sbjct: 264 HRLLQDILRRDWGFKGYVVSDCDAVADIYQFHKVVPTAEQAAALAVNNGDDLNCGTTYAT 323
Query: 338 HTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLAL 397
K AV + E ID A+ L R RLG+F+ P P+ + VV SP H LAL
Sbjct: 324 LVK-AVHDGLVNEHTIDTAVTRLMLARFRLGMFD-PPGRVPWSTLPMSVVQSPQHDALAL 381
Query: 398 QAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQN 457
+ AQ+ +VLLKN GLLPL + +A+IGP A++ LLGNY G +T LQ ++
Sbjct: 382 RTAQESMVLLKND-GLLPLSHNVR-RIAVIGPTADNVTALLGNYHGTPKAPVTILQGIRE 439
Query: 458 YVEN--TVYYPGCDTV 471
V N Y G + V
Sbjct: 440 AVPNAQVTYVQGTELV 455
Score = 140 bits (352), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 89/282 (31%), Positives = 144/282 (51%), Gaps = 50/282 (17%)
Query: 474 SSASIDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITR 523
+ + + A+D A+ AD V+ GL E EE+ DR L LP Q++L+ +
Sbjct: 619 AKSPFEAALDAARHADVVIFAGGLSSDLEGEEMPVDYPGFAGGDRTTLALPATQRKLL-Q 677
Query: 524 VAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGR 583
+ KPV+LVL G + I +AK +++ +IL A YPG+ G A+A+ +FG+ +P GR
Sbjct: 678 ALQVTGKPVVLVLTTGSALAIDWAK--QHLPAILLAWYPGQDGGHAVADALFGNVDPAGR 735
Query: 584 LPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAV 643
LP+T+Y P D M+ GRTYR++ G+ +FPFG GLSY++++Y
Sbjct: 736 LPVTFYKSARQLPPFDDYAMK--------GRTYRYFTGQPLFPFGFGLSYTRFAY----- 782
Query: 644 SQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLL 703
+ L L++ + + + +++ VKN G+ AG V L
Sbjct: 783 --SDLQLDRDT----------------------LGPSDRMRISLRVKNTGQRAGDEVVQL 818
Query: 704 FVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESL 745
+++P R + R IK L GFQ + L E+ + F++SP L
Sbjct: 819 YLRPLRAPHARAIKSLRGFQRISLKPGEERSVSFDISPQTDL 860
>gi|390956994|ref|YP_006420751.1| beta-glucosidase-like glycosyl hydrolase [Terriglobus roseus DSM
18391]
gi|390411912|gb|AFL87416.1| beta-glucosidase-like glycosyl hydrolase [Terriglobus roseus DSM
18391]
Length = 742
Score = 335 bits (858), Expect = 7e-89, Method: Compositional matrix adjust.
Identities = 235/750 (31%), Positives = 371/750 (49%), Gaps = 80/750 (10%)
Query: 13 LCLCFTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQL 72
+ L ++L V S + + P++ + + PI R DL+ R TL EK QL
Sbjct: 6 VALSTAAVLLSVASCVSASAQGSNAPASGGEVYRDMSRPIEDRITDLIKRFTLQEKAMQL 65
Query: 73 VNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWY 132
++ +PRLG+P + W++ LHGV + + T FP A++D L +
Sbjct: 66 NHTNRGVPRLGLPMWGGWNQTLHGV----------WSKQPTTLFPIPTAMGATWDPELVH 115
Query: 133 RIGQAIGLEARALYNAGQA-----IGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYA 187
+ A+ EARALYNA G+ + +P INI RDPRWGR QE EDPL+TG+
Sbjct: 116 TVADAMSDEARALYNAHAEGPRTPHGLVYRSPVINISRDPRWGRIQEVFSEDPLLTGRMG 175
Query: 188 VSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQ 247
V+YVRG+QGD L+ +A KHF +++ + R +A V ++L + +
Sbjct: 176 VAYVRGLQGDDLQ------HLKLAATVKHFAVNNVE----SGRQHLNADVDERNLFEFWL 225
Query: 248 PPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYD 307
P + + + + A +M +YN +NG+P + LL+ R++WGF G++T D AV+++
Sbjct: 226 PHWRAAIMEAHAQSVMSSYNAINGMPDAVNHWLLTDVLRKKWGFDGFVTDDLGAVALLSG 285
Query: 308 ---------AEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALH 358
+ +++ P A ++AG D + F + + AV++ L E ++D AL
Sbjct: 286 TRATNTSEPGQHFSEDPVVAAAAAIRAGNDSDDVEF-ETNLPLAVQRGLLTEKDVDGALR 344
Query: 359 NLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPK 418
N+ V RLG ++ P + +IG DVV S AH+ L+ + A++ + LL N LPL +
Sbjct: 345 NVLRVGFRLGAYD-PPQASKYSRIGMDVVRSQAHRDLSQRVAEESMTLLLNRRQFLPLQR 403
Query: 419 SKSVSLALIGPNANSAKTLLGNYAGPSCR-SITP-LQALQNYVENTVYYPGCDTVACS-S 475
+ S+A+IGP A Y P+ + S+T L+AL Y G V +
Sbjct: 404 DQVKSVAVIGPAGGEAYETGNYYGTPAVKTSVTEGLRALLGSGVKVEYEKGAGYVDLADD 463
Query: 476 ASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILV 535
I++A ++A+ +D VVL +G + E E DR DL LPG QQ L+ V AA V LV
Sbjct: 464 KEIERAANLARKSDVVVLCLGTNLQVEAEGRDRRDLNLPGAQQRLLEAVY-AANPKVALV 522
Query: 536 LLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIK 595
L+ GP+ +T+A ++ +IL A YPGE G A+A +FG +NPGG LP T Y
Sbjct: 523 LMNAGPLGVTWAH--DHVPAILSAWYPGELGGAAIARTLFGLNNPGGHLPYTVYANLDGV 580
Query: 596 VPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSS 655
P + + + G TY++++G ++PFG GLSY+ + Y +KL + Q+S
Sbjct: 581 PPQNEYDV-------SRGYTYQYFKGVPLYPFGHGLSYTHFDY-------SKLKVTQTS- 625
Query: 656 TKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRP 715
G T F +T N G+ AG L+ + +P
Sbjct: 626 ------------------GDHANVTVSFTLT----NTGQSAGAEVTQLYSHQVKSSEVQP 663
Query: 716 IKQLVGFQSVILNAKEKAEIVFELSPCESL 745
++ L GF+ V L E + + P +L
Sbjct: 664 LRTLRGFERVTLQPGESKAVAISI-PTSAL 692
>gi|384421334|ref|YP_005630694.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzicola
BLS256]
gi|353464247|gb|AEQ98526.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzicola
BLS256]
Length = 904
Score = 335 bits (858), Expect = 7e-89, Method: Compositional matrix adjust.
Identities = 192/442 (43%), Positives = 258/442 (58%), Gaps = 32/442 (7%)
Query: 44 PFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKG 103
P+ T QRA DLVSR+TL+EK +Q+ N+APAIPRL +PAY+WW+EALHGVA G
Sbjct: 36 PYLDTARSFEQRAADLVSRMTLEEKAAQMQNAAPAIPRLQVPAYDWWNEALHGVARAG-- 93
Query: 104 IFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA-------GQAIGMTF 156
GAT FPQ I AA+FD L + + AI EARA ++ + G+TF
Sbjct: 94 --------GATVFPQAIGMAATFDLPLMHEVATAISDEARAKHHQFLRQNQHARYQGLTF 145
Query: 157 WAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNG-----GKLKGKLQAS 211
W+PNINIFRDPRWGRGQET GEDP +T + V++V+G+QG+ G+ KL A+
Sbjct: 146 WSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQGEGAAAPKNAQGEPYRKLDAT 205
Query: 212 ACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNG 271
A KHF + + R+ FDAR + +DL +TY P FE+ VK G+ +M AYNRV G
Sbjct: 206 A--KHFAVH---SGPEAERHHFDARPSQRDLYETYLPAFEALVKDGKVDAVMGAYNRVYG 260
Query: 272 IPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNC 331
+ A + LL R+QWGF GY+ SDC A+ ++ + E A + G ++ C
Sbjct: 261 ESASASKFLLQDVLRQQWGFKGYVVSDCWAIVDVWKHHKIVATREQAAALAVTHGTELEC 320
Query: 332 GSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPA 391
G AAV Q + E++ID AL L + RMRLG+F+ P P+ KI A V SPA
Sbjct: 321 GEEYST-LPAAVHQGLIDEAQIDTALQTLMTARMRLGMFD-PPGQLPWSKIPASVNQSPA 378
Query: 392 HQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITP 451
H LA + A++ +VLLKN GLLPL ++ +A+IGP A+ LLGNY G +T
Sbjct: 379 HDALARRTARESLVLLKND-GLLPLSRATLKRIAVIGPTADDTMALLGNYYGTPAAPVTV 437
Query: 452 LQALQNYVENT--VYYPGCDTV 471
LQ ++ N +Y G D V
Sbjct: 438 LQGIRAAAPNAQVLYARGADLV 459
Score = 142 bits (357), Expect = 9e-31, Method: Compositional matrix adjust.
Identities = 100/312 (32%), Positives = 154/312 (49%), Gaps = 58/312 (18%)
Query: 452 LQALQNYVENTVYYPGCDTVACSSA--------SIDKAVDIAKGADHVVLMMGLDQTQEK 503
LQA ++Y Y+ G A A + +A+D+A+ AD VV + GL E
Sbjct: 594 LQAGRSYDLRLDYFEGERDAAVRLAWRQPGAKPPLQEALDVARSADVVVFVGGLTGDVEG 653
Query: 504 EEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNI 553
EE+ DR DL LP Q+EL+ + +A KPV+ VL G + I +A+ +++
Sbjct: 654 EEMKVNYPGFAGGDRTDLRLPKPQRELLEAL-QATGKPVVAVLTAGSALAIDWAQ--QHV 710
Query: 554 GSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPG 613
+IL A YPG+ G A+A+ +FGD NPGGRLP+T+Y + D M G
Sbjct: 711 PAILLAWYPGQRGGTAVADTLFGDANPGGRLPVTFYKESETLPAFDDYTMH--------G 762
Query: 614 RTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPEL 673
RTYR++ G ++PFG GLSY++++Y + L L++S+ T
Sbjct: 763 RTYRYFGGTPLYPFGHGLSYTQFAY-------SDLRLDRSTLTA---------------- 799
Query: 674 GTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKA 733
T+ VKN G+ AG V L++ P + R K+L GFQ + L E+
Sbjct: 800 ------DGALTATVAVKNTGQRAGDEVVQLYLHPLKPQRERAGKELRGFQRLALQPGEQR 853
Query: 734 EIVFELSPCESL 745
E+ F ++ ++L
Sbjct: 854 ELRFTINATDAL 865
>gi|389636381|ref|XP_003715843.1| beta-xylosidase [Magnaporthe oryzae 70-15]
gi|351648176|gb|EHA56036.1| beta-xylosidase [Magnaporthe oryzae 70-15]
gi|440480767|gb|ELQ61414.1| beta-xylosidase [Magnaporthe oryzae P131]
Length = 517
Score = 334 bits (857), Expect = 1e-88, Method: Compositional matrix adjust.
Identities = 200/499 (40%), Positives = 272/499 (54%), Gaps = 18/499 (3%)
Query: 34 DPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEA 93
D N T C TL +RA LV L+++EK+ LV+ + PR+G+PAY WWSEA
Sbjct: 29 DCVNGPLSTNNVCDRTLSPPERAAALVEALSIEEKLQNLVSKSQGAPRIGLPAYNWWSEA 88
Query: 94 LHGVAGVGKGIFF---NGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQ 150
LHGVA G +F N +TS+P +L AA FD L +IG AIG+EARA N+G
Sbjct: 89 LHGVA-YAPGTYFPQGNVEFNSSTSYPMPLLMAAGFDDNLIEKIGTAIGIEARAWGNSGW 147
Query: 151 AIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQA 210
A G +W PN+N F+DPRWGRG ETPGED L +YA RG+ G N + +
Sbjct: 148 A-GFDYWTPNVNAFKDPRWGRGSETPGEDVLRIKRYAEYITRGLDGPVPNEQR-----RI 201
Query: 211 SACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVN 270
+ CKH+ D ++W GTTR+ F+A++TMQDLA+ Y PF+ C + + IMCAYN VN
Sbjct: 202 ISTCKHYAGNDFEDWNGTTRHDFNAKITMQDLAEYYLKPFQQCARDSKVGSIMCAYNAVN 261
Query: 271 GIPSCADRNLLSKTARRQWGF---HGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGM 327
G+PSCA++ LL R W + + Y+TSDC+AV + YA + +AGM
Sbjct: 262 GVPSCANKYLLQTILRDHWKWTEHNNYVTSDCEAVLDVSANHHYAPTNAAGTAICFEAGM 321
Query: 328 DVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVV 387
D +C A Q L E +DRAL L+ +R G F+G M + + V
Sbjct: 322 DTSCEYTGSSDIPGAWSQGLLKEETVDRALLRLYEGLVRAGYFDGEEAM--YADLDWQHV 379
Query: 388 CSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCR 447
S Q LALQAA +G+VLLKN +G LPL S +A+IG A++ + L G Y+G +
Sbjct: 380 NSAEAQSLALQAAVEGMVLLKN-NGTLPLDLDPSHKIAMIGFWADAPEKLQGGYSGRAHH 438
Query: 448 SITP-LQALQNYVENTVYY-PGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEE 505
+P A Q ++ TV P S A++ A GAD+++ GLD + E
Sbjct: 439 LYSPAFAARQLGLDITVASGPVLQDNNASDNWTTNALEAASGADYILYFGGLDTSAAGET 498
Query: 506 LDRVDLVLPGRQQELITRV 524
LDR DL P Q L+ V
Sbjct: 499 LDRTDLDWPEAQLTLVKVV 517
>gi|449489074|ref|XP_002195511.2| PREDICTED: beta-xylosidase/alpha-L-arabinofuranosidase 2-like
[Taeniopygia guttata]
Length = 685
Score = 334 bits (856), Expect = 1e-88, Method: Compositional matrix adjust.
Identities = 241/716 (33%), Positives = 363/716 (50%), Gaps = 86/716 (12%)
Query: 76 APAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRG-ATSFPQVILTAASFDSYLWYRI 134
AP IPRLGI Y W +E L G +G G AT+FPQ + AA+F L YR+
Sbjct: 6 APPIPRLGIAPYNWNTECLRG----------DGEAPGWATAFPQALGLAAAFSPELIYRV 55
Query: 135 GQAIGLEARALYNAGQA-------IGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYA 187
A E RA +N+ A G++ ++P +NI R P WGR QET GEDP ++G+ A
Sbjct: 56 ANATATEVRAKHNSFAAAGRYSDHTGLSCFSPVLNIMRHPLWGRNQETYGEDPFLSGELA 115
Query: 188 VSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQ 247
S+V+G+QG ++ASA CKHF+ + + Y V +D T+
Sbjct: 116 RSFVQGLQGPH------PRYVKASAGCKHFSVHG--GHENILLYLLT--VLERDWRMTFL 165
Query: 248 PPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYD 307
P F++CV+ G S MC+YNR+NG+P+CA++ LL+ R +WGF GY+ SD AV +I
Sbjct: 166 PQFQACVRAGSYS-FMCSYNRINGVPACANKKLLTDILRGEWGFDGYVVSDEGAVELIML 224
Query: 308 AEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHT----KAAVKQKKLPESEIDRALHNLFSV 363
Y +S + V + AG ++ ++ + A+ + + + LF
Sbjct: 225 GHHYTRSFLETAVASVNAGCNLELSYGMRNNVFMRIPEALAMGNITLQMLRDRVRPLFYT 284
Query: 364 RMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVS 423
RMRLG F+ P M P+ + VV SP H+ L+L+AA VLLKN G LPL K++ +S
Sbjct: 285 RMRLGEFD-PPAMNPYSSLDLSVVQSPEHRNLSLEAAVKSFVLLKNVRGTLPL-KAQDLS 342
Query: 424 ---LALIGPNANSAKTLLGNYAG-PSCRSI-TPLQALQNYVENTVYYPGCDTVACSSASI 478
LA++GP A++ + L G+YA P R I TP + L+ N + GC C S
Sbjct: 343 SQHLAVVGPFADNPRVLFGDYAPVPEPRYIYTPRRGLEMLGANVSFAAGCSEPRCQRYSR 402
Query: 479 DKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAK-KPVILVLL 537
+ V + AD V++ +G E E DR DL LPG Q EL+ +AA +PVIL+L
Sbjct: 403 AELVKVVGAADVVLVCLGTGVDVETEAKDRSDLSLPGHQLELLQDAVQAAAGRPVILLLF 462
Query: 538 CGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGD--HNPGGRLPMTWYPQDYIK 595
GP+D+++A+ +G+IL +P +A +A+A V+ G+ +P GRLP TW +
Sbjct: 463 NAGPLDVSWAQAHDGVGAILACFFPAQATGLAIARVLLGEAGASPAGRLPATWPAGMHQV 522
Query: 596 VPMTDMKMRPQATSGNPGRTYRFYEGKE--VFPFGCGLSYSKYSYKFKAVSQNKLYLNQS 653
PM + M GRTYR+Y G+E ++PFG GLSY+ + Y+ +S L L +
Sbjct: 523 PPMENYTME--------GRTYRYY-GQEAPLYPFGYGLSYTTFRYRDLVLSPPVLPLCAN 573
Query: 654 SSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNG 713
S +V L G+++ E+ V L+++
Sbjct: 574 LSVSVV------------------------LENTGLRDSEEV-----VQLYLRWEHSSVP 604
Query: 714 RPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGDEE 769
P QLV F+ V + A +A++ F++ + RA +E GT L G ++
Sbjct: 605 VPRWQLVAFRRVAVPAGREAKLSFQVL---AEQRAVWAQHWHLEPGTFTLFAGGQQ 657
>gi|167524198|ref|XP_001746435.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163775197|gb|EDQ88822.1| predicted protein [Monosiga brevicollis MX1]
Length = 834
Score = 334 bits (856), Expect = 1e-88, Method: Compositional matrix adjust.
Identities = 242/739 (32%), Positives = 355/739 (48%), Gaps = 90/739 (12%)
Query: 43 FPFCKTTLPISQRARDLVSRLTLDEKISQL-------VNSAPAIPRLGIPAYEWWSEALH 95
+PF LP + R DLV RLTL+EK+ QL + APA+ RLGI + W SE
Sbjct: 34 YPFRNPDLPWAARLDDLVGRLTLEEKLQQLQHGGAAQMTPAPAVERLGIGPFVWGSEC-- 91
Query: 96 GVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQA---- 151
V G+G +G T+FPQ + AA+FD L R I LE RA N +
Sbjct: 92 -VTGLGT----DGNDPHGTAFPQPLGMAATFDPALLKRAAGTIALELRAQRNFDRENGVV 146
Query: 152 ---IGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKL 208
G++ W+P +NI R P WGR ET GE P+++ A S+V G+QG+
Sbjct: 147 KFHHGLSCWSPVVNINRHPLWGRNDETFGECPVLSSFMARSFVEGIQGNHTR------YY 200
Query: 209 QASACCKHFTAYD-LDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYN 267
A+A CKH Y DN RY FDA V+ DL T+ FE C G G MC+YN
Sbjct: 201 AAAAACKHLDVYGGPDNL----RYVFDADVSQADLTGTFLMAFEECAAAG-VMGYMCSYN 255
Query: 268 RVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGM 327
+ G+P+CA+ ++ AR QWGF GY+ SD AV I ++ Y + V L AG
Sbjct: 256 SIRGVPACANYRTMTFFAREQWGFEGYVVSDQGAVFRITESHNYTANQTLGAVAALNAGC 315
Query: 328 DVNCGSFLQK----HTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIG 383
D+ Q + A+ K + ID ++ LF VRMRLG F+ P P+ +
Sbjct: 316 DMEDSDDAQHVAYYNLSLALDLKLTDMATIDASVSRLFYVRMRLGEFD-PPENDPWRSLN 374
Query: 384 ADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPL-PKSKSVSLALIGPNANSAKTLLGNYA 442
+V SPAH +A A IVLLKN + LPL +K+ S L+GP A++A ++G Y+
Sbjct: 375 MSIVSSPAHVEMARDVATASIVLLKNQNETLPLSAAAKNASYCLLGPFADNADLMMGKYS 434
Query: 443 GPSCRSITPL------QALQNYVENTV--YYPGCDTVACSSASIDKAVD-IAKGADHVVL 493
++T ALQN + Y GC C I +G D V+L
Sbjct: 435 PHGSTNVTVTYRAGLAAALQNASQTASFQYLEGCTGPFCDGLDTAAVTTFIQQGCDTVLL 494
Query: 494 MMGLDQTQEKEELDRVDLVLPGRQQELITRVAEA--AKKPVILVLLCGGPVDITFAKYDR 551
+G E E LDR ++ PG Q L+ V EA K+ ++L++ GPVD+ + D
Sbjct: 495 AVGTSYHVESESLDRSNMSFPGAQPTLVQTVLEALGTKQRLVLLVSTAGPVDLAALEQDT 554
Query: 552 NIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGN 611
+ +IL Y G+ ALA+++ G+ +P GRLP +W + P+ D M+
Sbjct: 555 RVAAILDLIYLGQTAGTALADILLGETSPSGRLPFSWPNKVSDVPPIDDYTMQ------- 607
Query: 612 PGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVP 671
GRTYRF + +FPFG GLSY++++ + + + P
Sbjct: 608 -GRTYRFAQADVLFPFGYGLSYTQFN---------------------------LSHLAAP 639
Query: 672 ELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKE 731
+ C+ + +++ V N G ++G P+ ++V+ G PI+QL V ++A
Sbjct: 640 YI-LPVCQALR--LSVNVTNTGRLSGAIPLQVYVE-WPNAVGGPIRQLATTTRVFVDAAS 695
Query: 732 KAEIVFELSPCESLSRARE 750
+ + P E L+RA +
Sbjct: 696 SKTVQLSIRPRE-LARASD 713
>gi|440476402|gb|ELQ45004.1| beta-xylosidase, partial [Magnaporthe oryzae Y34]
Length = 515
Score = 333 bits (855), Expect = 1e-88, Method: Compositional matrix adjust.
Identities = 199/496 (40%), Positives = 271/496 (54%), Gaps = 18/496 (3%)
Query: 34 DPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEA 93
D N T C TL +RA LV L+++EK+ LV+ + PR+G+PAY WWSEA
Sbjct: 29 DCVNGPLSTNNVCDRTLSPPERAAALVEALSIEEKLQNLVSKSQGAPRIGLPAYNWWSEA 88
Query: 94 LHGVAGVGKGIFF---NGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQ 150
LHGVA G +F N +TS+P +L AA FD L +IG AIG+EARA N+G
Sbjct: 89 LHGVA-YAPGTYFPQGNVEFNSSTSYPMPLLMAAGFDDNLIEKIGTAIGIEARAWGNSGW 147
Query: 151 AIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQA 210
A G +W PN+N F+DPRWGRG ETPGED L +YA RG+ G N + +
Sbjct: 148 A-GFDYWTPNVNAFKDPRWGRGSETPGEDVLRIKRYAEYITRGLDGPVPNEQR-----RI 201
Query: 211 SACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVN 270
+ CKH+ D ++W GTTR+ F+A++TMQDLA+ Y PF+ C + + IMCAYN VN
Sbjct: 202 ISTCKHYAGNDFEDWNGTTRHDFNAKITMQDLAEYYLKPFQQCARDSKVGSIMCAYNAVN 261
Query: 271 GIPSCADRNLLSKTARRQWGF---HGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGM 327
G+PSCA++ LL R W + + Y+TSDC+AV + YA + +AGM
Sbjct: 262 GVPSCANKYLLQTILRDHWKWTEHNNYVTSDCEAVLDVSANHHYAPTNAAGTAICFEAGM 321
Query: 328 DVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVV 387
D +C A Q L E +DRAL L+ +R G F+G M + + V
Sbjct: 322 DTSCEYTGSSDIPGAWSQGLLKEETVDRALLRLYEGLVRAGYFDGEEAM--YADLDWQHV 379
Query: 388 CSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCR 447
S Q LALQAA +G+VLLKN +G LPL S +A+IG A++ + L G Y+G +
Sbjct: 380 NSAEAQSLALQAAVEGMVLLKN-NGTLPLDLDPSHKIAMIGFWADAPEKLQGGYSGRAHH 438
Query: 448 SITP-LQALQNYVENTVYY-PGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEE 505
+P A Q ++ TV P S A++ A GAD+++ GLD + E
Sbjct: 439 LYSPAFAARQLGLDITVASGPVLQDNNASDNWTTNALEAASGADYILYFGGLDTSAAGET 498
Query: 506 LDRVDLVLPGRQQELI 521
LDR DL P Q L+
Sbjct: 499 LDRTDLDWPEAQLTLV 514
>gi|294667502|ref|ZP_06732718.1| glucan 1,4-beta-glucosidase [Xanthomonas fuscans subsp.
aurantifolii str. ICPB 10535]
gi|292602731|gb|EFF46166.1| glucan 1,4-beta-glucosidase [Xanthomonas fuscans subsp.
aurantifolii str. ICPB 10535]
Length = 901
Score = 333 bits (855), Expect = 1e-88, Method: Compositional matrix adjust.
Identities = 193/455 (42%), Positives = 261/455 (57%), Gaps = 32/455 (7%)
Query: 31 FSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWW 90
C T P+ T RA DLVSR+TL+EK +Q+ N+APAIPRL +PAY+WW
Sbjct: 20 LGCATGAAQAATPPYLDTQRSFEARAADLVSRMTLEEKAAQMQNAAPAIPRLQVPAYDWW 79
Query: 91 SEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA-- 148
+EALHGVA G GAT FPQ I AA+FD L + + AI EARA ++
Sbjct: 80 NEALHGVARAG----------GATVFPQAIGMAATFDLPLMHEVATAISDEARAKHHQFL 129
Query: 149 -----GQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNG-- 201
+ G+TFW+PNINIFRDPRWGRGQET GEDP +T + V++V+G+QG+ +
Sbjct: 130 RQNQHERYQGLTFWSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQGEGGDAPK 189
Query: 202 ---GKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGR 258
G+ KL A+A KHF + + R+ FDA + +DL +TY P FE+ VK G+
Sbjct: 190 NAQGERYRKLDATA--KHFAVH---SGPEADRHHFDAHPSQRDLYETYLPAFEALVKDGK 244
Query: 259 ASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDA 318
+M AYNRV G + A + LL R QWGF GY+ SDC A+ I+ + E A
Sbjct: 245 VDAVMGAYNRVYGESASASKFLLQDVLRDQWGFRGYVVSDCWAIVDIWKHHKIVATREQA 304
Query: 319 VVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQP 378
+K G ++ CG AAV+Q + E++ID AL L + RMRLG+F+ P P
Sbjct: 305 AALAVKHGTELECGEEYST-LPAAVRQGLIDEAQIDTALKTLMTARMRLGMFD-PPGQLP 362
Query: 379 FGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLL 438
+ +I A V SPAH LA + A++ +VLLKN GLLPL +++ +A+IGP A+ LL
Sbjct: 363 WSQIPASVNQSPAHDALARRTARESLVLLKND-GLLPLSRARLKRIAVIGPTADDTMALL 421
Query: 439 GNYAGPSCRSITPLQALQNYVENT--VYYPGCDTV 471
GNY G +T LQ ++ N +Y G D V
Sbjct: 422 GNYYGTPAAPVTVLQGIRAAAPNAQVLYARGADLV 456
Score = 135 bits (340), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 102/334 (30%), Positives = 159/334 (47%), Gaps = 59/334 (17%)
Query: 452 LQALQNYVENTVYYPGCDTVACSSA--------SIDKAVDIAKGADHVVLMMGLDQTQEK 503
LQA + Y Y+ G A A + +A+D+A+ A+ VV + GL E
Sbjct: 591 LQAGRAYEVRLEYFEGERDAAVRLAWRQPGAKPPLQEALDVARSAEVVVFVGGLTGDVEG 650
Query: 504 EEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNI 553
EE+ DR DL LP Q++L+ + A KPV+ VL G + I +A+ +++
Sbjct: 651 EEMKVNYPGFAGGDRTDLRLPKPQRDLLEAL-HATGKPVVAVLTTGSALAIDWAQ--QHL 707
Query: 554 GSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPG 613
+IL A YPG+ G A+A+ +FGD NPGGRLP+T+Y + D MR G
Sbjct: 708 PAILLAWYPGQRGGTAVADTLFGDANPGGRLPVTFYKESETLPAFDDYAMR--------G 759
Query: 614 RTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPEL 673
RTYR++ G ++PFG GLSY++++Y + L L++
Sbjct: 760 RTYRYFGGTPLYPFGHGLSYTQFAY-------SGLRLDR--------------------- 791
Query: 674 GTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKA 733
T T+ VKN G+ AG V L++ P R K+L GFQ + L E+
Sbjct: 792 -TTIATDGSLTATVTVKNTGQRAGDEVVQLYLHPLTPQRERAGKELHGFQRIALTPGEQR 850
Query: 734 EIVFELSPCESLSRARED-GLMVIEEGTHFLVVG 766
E+ F ++ ++L E V++ G + + +G
Sbjct: 851 ELGFTINAKDALRLYDEQRKAYVVDPGAYEVQIG 884
>gi|21244948|ref|NP_644530.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. citri str.
306]
gi|21110666|gb|AAM39066.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. citri str.
306]
Length = 901
Score = 333 bits (855), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 194/455 (42%), Positives = 260/455 (57%), Gaps = 32/455 (7%)
Query: 31 FSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWW 90
C T P+ T RA DLVSR+TL+EK +Q+ N+APAIPRL +PAY+WW
Sbjct: 20 LGCATGAAHAATPPYLDTQRSFEARAADLVSRMTLEEKAAQMQNAAPAIPRLQVPAYDWW 79
Query: 91 SEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA-- 148
+EALHGVA G GAT FPQ I AA+FD L + + AI EARA ++
Sbjct: 80 NEALHGVARAG----------GATVFPQAIGMAATFDLPLMHEVATAISDEARAKHHQFL 129
Query: 149 -----GQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNG-- 201
+ G+TFW+PNINIFRDPRWGRGQET GEDP +T + V++V+G+QG+
Sbjct: 130 RQNQHARYQGLTFWSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQGEGAAAPK 189
Query: 202 ---GKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGR 258
G+ KL A+A KH + + R+ FDAR + +DL +TY P FE+ VK+G+
Sbjct: 190 NAQGEPYRKLDATA--KHLAVH---SGPEADRHHFDARPSQRDLYETYLPAFEALVKEGK 244
Query: 259 ASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDA 318
+M AYNRV G + A + LL R QWGF GY+ SDC A+ I+ + E A
Sbjct: 245 VDAVMGAYNRVYGESASASKFLLQDVLRDQWGFRGYVVSDCWAIVDIWKHHKIVATREQA 304
Query: 319 VVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQP 378
+K G ++ CG AAV+Q + E++ID AL L + RMRLG+F+ P P
Sbjct: 305 AALAVKHGTELECGEEYAT-LPAAVRQGLIDEAQIDTALKTLMTARMRLGMFD-PPGQLP 362
Query: 379 FGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLL 438
+ I A V SPAH LA + A++ +VLLKN GLLPL ++K +A+IGP A+ LL
Sbjct: 363 WSTIPASVNQSPAHDALARRTARESLVLLKND-GLLPLSRAKFKRIAVIGPTADDTMALL 421
Query: 439 GNYAGPSCRSITPLQALQNYVENT--VYYPGCDTV 471
GNY G +T LQ ++ N +Y G D V
Sbjct: 422 GNYYGTPAAPVTVLQGIRAAAPNAQVLYARGADLV 456
Score = 136 bits (343), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 98/312 (31%), Positives = 151/312 (48%), Gaps = 58/312 (18%)
Query: 452 LQALQNYVENTVYYPGCDTVACSSA--------SIDKAVDIAKGADHVVLMMGLDQTQEK 503
LQA + Y Y+ G A A + +A+D+A+ AD VV + GL E
Sbjct: 591 LQAGRAYEVRLEYFEGERDAAVRLAWRQPGARPPLQEALDVARSADVVVFVGGLTGDVEG 650
Query: 504 EEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNI 553
EE+ DR DL LP Q++L+ + +A +PV+ VL G + I +A+ +++
Sbjct: 651 EEMKVNYPGFAGGDRTDLRLPKPQRDLLEAL-QATGRPVVAVLTTGSALAIDWAQ--QHL 707
Query: 554 GSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPG 613
+IL A YPG+ G A+A+ +FGD NPGGRLP+T+Y + D MR G
Sbjct: 708 PAILLAWYPGQRGGTAVADTLFGDANPGGRLPVTFYKESETLPAFDDYAMR--------G 759
Query: 614 RTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPEL 673
RTYR++ G ++PFG GLSY++++Y + L L++
Sbjct: 760 RTYRYFGGTPLYPFGHGLSYTQFAY-------SGLRLDR--------------------- 791
Query: 674 GTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKA 733
T T+ VKN G+ AG V L++ P R K+L GFQ + L E+
Sbjct: 792 -TTIATDGSLAATVTVKNTGQRAGDEVVQLYLHPLAPQRERAGKELHGFQRIALQPGEQR 850
Query: 734 EIVFELSPCESL 745
E+ F ++ ++L
Sbjct: 851 ELGFTINAKDAL 862
>gi|326427096|gb|EGD72666.1| hypothetical protein PTSG_04397 [Salpingoeca sp. ATCC 50818]
Length = 614
Score = 333 bits (854), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 220/631 (34%), Positives = 328/631 (51%), Gaps = 57/631 (9%)
Query: 158 APNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHF 217
+PNINI RDPRWGR QE P EDPL+ G++ Y G+Q G+ + KH+
Sbjct: 11 SPNININRDPRWGRNQEVPSEDPLLNGEFGKLYTMGLQQ-----GEDSRYTKVVVTLKHW 65
Query: 218 TAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCAD 277
AY L++ G TR+ FDA+V+ L DTY P F V +G A G+MC+YN +NG P+C
Sbjct: 66 DAYSLEDSDGFTRHNFDAKVSNFALMDTYWPAFRKAVMEGNAKGVMCSYNALNGRPTCT- 124
Query: 278 RNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGM-DVNCGSFLQ 336
LL+K R W F GY+TSD A+ IY Y + AV L+ G D++ G+
Sbjct: 125 HPLLTKVLRDIWKFDGYVTSDTGAIEDIYAKHHYTANASAAVAAALRDGRCDMDSGAVYH 184
Query: 337 KHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLA 396
AV + ++DRAL+N +R LGLF+ QP+ +I A + + Q L
Sbjct: 185 DALLDAVNSGECSMDDVDRALYNTLKLRFELGLFDPIED-QPYWRINASSINTTYAQDLN 243
Query: 397 LQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRS------IT 450
++ + ++LL+N + LP K + V A+IGP+ N+ + L+GNY G C +
Sbjct: 244 MKITLESMILLQNHNNALPFKKGRKV--AVIGPHINAQEALVGNYLGQLCPDDSFDCITS 301
Query: 451 PLQALQ--NYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDR 508
PL A++ N + NTV G +AC+ ASI +AV++AK AD+VVL++G++ T E E DR
Sbjct: 302 PLAAIEAINGMSNTVSAMGSGVLACTDASIQEAVNVAKDADYVVLLIGINDTIEAESNDR 361
Query: 509 VDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAV 568
+ LP Q +L +A K VL+ GG + I K + + +I+ AGYPG G
Sbjct: 362 TSIDLPQCQHKLTAAIAH-LNKTTAAVLINGGMLAIEQEK--KQLPAIIEAGYPGFYGGA 418
Query: 569 ALAEVIFGDHNP-GGRLPMTWYPQDYI-KVPMTDMKMRPQATSGNPGRTYRFYEGKEVFP 626
A+A+ IFGD+N GG+LP T YP DYI K+ M+DM+M + +PGR+YR+Y G+ ++P
Sbjct: 419 AIAKTIFGDNNHLGGKLPYTVYPADYIHKINMSDMEM-----TNSPGRSYRYYTGQPLWP 473
Query: 627 FGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVT 686
FG GL+Y+ +S + S + ++S F +
Sbjct: 474 FGFGLAYTTFSVQSPGPSASTFATGSNTS---------------------------FSLP 506
Query: 687 IGVKNHGEMAGKHPVLLFVKPAR--RGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCES 744
+ V N G+ G V +++ P + KQL+ F+ V L ++ + LS
Sbjct: 507 VHVVNTGKRTGDTVVQVYMAPVSLPHRSFSLKKQLIAFERVHLTPNQRLGVTIPLSADVF 566
Query: 745 LSRAREDGLMVIEEGTHFLVVGDEEYPISIF 775
G +V G++ LVV D P F
Sbjct: 567 NMVDPVTGNVVSTPGSYRLVVSDGVAPTLRF 597
>gi|390991557|ref|ZP_10261819.1| glycosyl hydrolase family 3 N terminal domain protein [Xanthomonas
axonopodis pv. punicae str. LMG 859]
gi|372553724|emb|CCF68794.1| glycosyl hydrolase family 3 N terminal domain protein [Xanthomonas
axonopodis pv. punicae str. LMG 859]
Length = 901
Score = 333 bits (854), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 194/455 (42%), Positives = 259/455 (56%), Gaps = 32/455 (7%)
Query: 31 FSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWW 90
C T P+ T RA DLVSR+TL+EK +Q+ N+APAIPRL +PAY+WW
Sbjct: 20 LGCATGAAHAATPPYLDTQRSFEARAADLVSRMTLEEKAAQMQNAAPAIPRLQVPAYDWW 79
Query: 91 SEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA-- 148
+EALHGVA G GAT FPQ I AA+FD L + + AI EARA ++
Sbjct: 80 NEALHGVARAG----------GATVFPQAIGMAATFDLPLMHEVATAISDEARAKHHQFL 129
Query: 149 -----GQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNG-- 201
+ G+TFW+PNINIFRDPRWGRGQET GEDP +T + V++V+G+QG+
Sbjct: 130 RQNQHARYQGLTFWSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQGEGAAAPK 189
Query: 202 ---GKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGR 258
G+ KL A+A KHF + + R+ FDAR + +DL +TY P FE+ VK G+
Sbjct: 190 NAQGEPYRKLDATA--KHFAVH---SGPEADRHHFDARPSQRDLYETYLPAFEALVKDGK 244
Query: 259 ASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDA 318
+M AYNRV G + A + LL R QWGF GY+ SDC A+ I+ + E A
Sbjct: 245 VDAVMGAYNRVYGESASASKFLLQDVLRDQWGFRGYVVSDCWAIVDIWKHHKIVATREQA 304
Query: 319 VVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQP 378
+K G ++ CG AAV+Q + E++ID AL L + RMRLG+F+ P P
Sbjct: 305 AALAVKHGTELECGEEYAT-LPAAVRQGLIDEAQIDTALKTLMTARMRLGMFD-PPGQLP 362
Query: 379 FGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLL 438
+ I A V SPAH LA + A++ +VLLKN GLLPL ++K +A+IGP A+ LL
Sbjct: 363 WSTIPASVNQSPAHDALARRTARESLVLLKND-GLLPLSRAKLKRIAVIGPTADDTMALL 421
Query: 439 GNYAGPSCRSITPLQALQNYV--ENTVYYPGCDTV 471
GNY G +T LQ ++ +Y G D V
Sbjct: 422 GNYYGTPAAPVTVLQGIRAAAPKAQVLYARGADLV 456
Score = 137 bits (344), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 98/312 (31%), Positives = 151/312 (48%), Gaps = 58/312 (18%)
Query: 452 LQALQNYVENTVYYPGCDTVACSSA--------SIDKAVDIAKGADHVVLMMGLDQTQEK 503
LQA + Y Y+ G A A + +A+D+A+ AD VV + GL E
Sbjct: 591 LQAGRAYEVRLEYFEGERDAAVRLAWRQPGAKPPLQEALDVARSADVVVFVGGLTGDVEG 650
Query: 504 EEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNI 553
EE+ DR DL LP Q++L+ + +A +PV+ VL G + I +A+ +++
Sbjct: 651 EEMKVNYPGFAGGDRTDLRLPKPQRDLLEAL-QATGRPVVAVLTTGSALAIDWAQ--QHL 707
Query: 554 GSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPG 613
+IL A YPG+ G A+A+ +FGD NPGGRLP+T+Y + D MR G
Sbjct: 708 PAILLAWYPGQRGGTAVADTLFGDANPGGRLPVTFYKESETLPAFDDYAMR--------G 759
Query: 614 RTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPEL 673
RTYR++ G ++PFG GLSY++++Y + L L++
Sbjct: 760 RTYRYFGGTPLYPFGHGLSYTQFAY-------SGLRLDR--------------------- 791
Query: 674 GTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKA 733
T T+ VKN G+ AG V L++ P R K+L GFQ + L E+
Sbjct: 792 -TTIATDGSLTATVTVKNTGQRAGDEVVQLYLHPLAPQRERAGKELHGFQRIALQPGEQR 850
Query: 734 EIVFELSPCESL 745
E+ F ++ ++L
Sbjct: 851 ELGFTINAKDAL 862
>gi|319788503|ref|YP_004147978.1| glycoside hydrolase [Pseudoxanthomonas suwonensis 11-1]
gi|317467015|gb|ADV28747.1| glycoside hydrolase family 3 domain protein [Pseudoxanthomonas
suwonensis 11-1]
Length = 916
Score = 333 bits (853), Expect = 3e-88, Method: Compositional matrix adjust.
Identities = 196/455 (43%), Positives = 259/455 (56%), Gaps = 37/455 (8%)
Query: 44 PFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKG 103
P+ T+L +RA LVSR+TL+EK +Q+ N +PAI RLG+PAY+WW+EALHGVA G
Sbjct: 49 PWLDTSLSFEERAAALVSRMTLEEKAAQMQNDSPAIERLGLPAYDWWNEALHGVARAG-- 106
Query: 104 IFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYN-------AGQAIGMTF 156
GAT FPQ I AASFD L ++ AI EARA ++ G+ G+TF
Sbjct: 107 --------GATVFPQAIGMAASFDVPLMDQVSAAISDEARAKHHDFLRKGEHGRYQGLTF 158
Query: 157 WAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQG-DTFNGGKLKGKLQA-SACC 214
W+PNINIFRDPRWGRGQET GEDP +T + VS+VRG+QG D G L K + A
Sbjct: 159 WSPNINIFRDPRWGRGQETYGEDPFLTTRMGVSFVRGLQGMDPQTGQPLDPKYRKLDATA 218
Query: 215 KHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPS 274
KHF + + R+ FD + QDL DTY P FES VK+ +M AYNRV G +
Sbjct: 219 KHFAVH---SGPEADRHTFDVHPSKQDLYDTYLPAFESLVKEADVYAVMGAYNRVYGESA 275
Query: 275 CADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSF 334
+ LL T RR WGF GY+ SDC A+ I+ ++PE+A +K G ++NCGS
Sbjct: 276 SGSKFLLLDTLRRDWGFDGYVMSDCWAIVDIWKNHKIVETPEEAAALAVKNGTELNCGST 335
Query: 335 LQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQV 394
H AVK+ + E+E+D AL LF RM LG+F+ P + ++ V S H
Sbjct: 336 YADHLPVAVKKGLISEAELDDALTRLFVARMELGMFD-PPEQVRWAQVPYSVNQSAEHDA 394
Query: 395 LALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQA 454
LA + AQ+ +VLLKN G+LPL K LA++GP A+ LLGNY G +T L+
Sbjct: 395 LARKMAQESLVLLKND-GVLPLSKDIR-RLAVVGPTADDTMALLGNYYGTPADPVTILRG 452
Query: 455 LQNYVENTVYYPGCDTVACSSASIDKAVDIAKGAD 489
++ PG D V + VD+ +G D
Sbjct: 453 IREAA------PGVDVVYA------RGVDLVEGRD 475
Score = 144 bits (364), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 101/300 (33%), Positives = 146/300 (48%), Gaps = 51/300 (17%)
Query: 478 IDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEA 527
+ A++ A AD VV + GL E EE+ DR D+ LP QQ+L+ V A
Sbjct: 640 FETALEAANSADAVVFVGGLTGDVEGEEMKVDYPGFAGGDRTDIRLPATQQKLLEAV-HA 698
Query: 528 AKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMT 587
KPV++VL G + I +A+ RN+ IL A YPG+ G A+ E +FGD+NPGGRLP+T
Sbjct: 699 TGKPVVMVLTTGSALGIDWAR--RNVPGILVAWYPGQRGGTAVGEALFGDYNPGGRLPVT 756
Query: 588 WYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNK 647
+Y D P D M+ RTYR++ G+ +FPFG GLSY+ + Y +
Sbjct: 757 FYSADEKLPPFDDYAMKE--------RTYRYFTGQPLFPFGHGLSYTSFGY-------SG 801
Query: 648 LYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKP 707
L L++ + G E V++ VKN G+ AG V L++ P
Sbjct: 802 LKLDRKRAGA----------------GDEVT------VSVTVKNQGKRAGDEVVQLYLAP 839
Query: 708 ARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRARE-DGLMVIEEGTHFLVVG 766
+ R +K+L GFQ V L E + F + P L E G ++ G + + VG
Sbjct: 840 VKPQRERALKELRGFQRVHLQPGESRTVTFSIVPERDLRVYDEAAGRYTVDPGRYEVQVG 899
>gi|285016879|ref|YP_003374590.1| beta-glucosidase [Xanthomonas albilineans GPE PC73]
gi|283472097|emb|CBA14604.1| putative beta-glucosidase protein [Xanthomonas albilineans GPE
PC73]
Length = 914
Score = 332 bits (852), Expect = 4e-88, Method: Compositional matrix adjust.
Identities = 188/447 (42%), Positives = 266/447 (59%), Gaps = 36/447 (8%)
Query: 45 FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGI 104
+ + +QRA DLV+R+TL+EK++Q+ N+APAIPRLG+PAY+WW+E LHGVA G
Sbjct: 34 YLDSQRTFAQRADDLVARMTLEEKVAQMQNAAPAIPRLGVPAYDWWNEGLHGVARAG--- 90
Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA-------GQAIGMTFW 157
GAT FPQ I AA+FD L + + AI EARA ++ G+ G+TFW
Sbjct: 91 -------GATVFPQAIGLAATFDLPLMHEVSTAISDEARAKHHEALRRGEHGRYQGLTFW 143
Query: 158 APNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNG-----GKLKGKLQASA 212
+PNINIFRDPRWGRGQET GEDP +T + V++V+G+QG+ + G+ KL A+A
Sbjct: 144 SPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGMQGEGADAPKNAQGETYRKLDATA 203
Query: 213 CCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGI 272
KHF + + + R+ FDAR + +DL +TY P FE+ VK+G+ +M AYNR+ G
Sbjct: 204 --KHFAVH---SGPESERHHFDARPSQRDLYETYLPAFEALVKEGKVDAVMGAYNRLFGE 258
Query: 273 PSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG 332
+ A + LL R +WGFHGY+ SDC A+ I+ + E A +K G + CG
Sbjct: 259 SASASKFLLRDVLRERWGFHGYVVSDCWAIVDIWKNHKIVATREQAAALAVKNGTQLECG 318
Query: 333 SFLQKHTK--AAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSP 390
Q++ AAV+Q + E++ID AL L + RMRLG+F+ P + ++ V SP
Sbjct: 319 ---QEYATLPAAVQQGLIGETDIDAALRTLMTARMRLGMFD-PPGQLRWAQLPISVNQSP 374
Query: 391 AHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSIT 450
H LA + A++ +VLLKN GLLPL ++K +A+IGP A+ LLGNY G +T
Sbjct: 375 EHDALARRTARESLVLLKND-GLLPLSRAKHKRIAVIGPTADDTMALLGNYYGTPATPVT 433
Query: 451 PLQALQNYV--ENTVYYPGCDTVACSS 475
LQ ++ + +Y G D V S
Sbjct: 434 ILQGIRAAAPDADVLYARGADLVEGRS 460
Score = 139 bits (349), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 97/307 (31%), Positives = 146/307 (47%), Gaps = 58/307 (18%)
Query: 478 IDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEA 527
+ +A+D A+ AD VV + GL E EE+ DR DL LP Q+EL+ ++ A
Sbjct: 625 LQEALDTARRADVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRELLQALS-A 683
Query: 528 AKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMT 587
KPV+ VL G + I +A+ ++ +IL A YPG+ G A+A+V+FGD NPGGRLP+T
Sbjct: 684 TGKPVVAVLTTGSALAIDWAQ--EHVPAILLAWYPGQRGGSAVADVLFGDTNPGGRLPVT 741
Query: 588 WYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNK 647
+Y D MR GRTYR++ G ++PFG GLSY++++Y + + K
Sbjct: 742 FYKASETLPAFDDYAMR--------GRTYRYFAGTPLYPFGHGLSYTQFAYSDLRLDRRK 793
Query: 648 LYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKP 707
+ + S T+ V N G AG V L++ P
Sbjct: 794 VAADGQLS-----------------------------ATLKVTNTGTRAGDEVVQLYLHP 824
Query: 708 ARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGD 767
R IK+L GFQ + L E ++ F +SP + L + +E VV
Sbjct: 825 LAPTRARAIKELRGFQRIALAPGESRDVHFTISP--------QTDLRIYDEAQKHYVVDP 876
Query: 768 EEYPISI 774
+Y + +
Sbjct: 877 GDYELQV 883
>gi|386819249|ref|ZP_10106465.1| beta-glucosidase-like glycosyl hydrolase [Joostella marina DSM
19592]
gi|386424355|gb|EIJ38185.1| beta-glucosidase-like glycosyl hydrolase [Joostella marina DSM
19592]
Length = 878
Score = 331 bits (849), Expect = 7e-88, Method: Compositional matrix adjust.
Identities = 185/427 (43%), Positives = 257/427 (60%), Gaps = 30/427 (7%)
Query: 40 TETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAG 99
+E +PF T LP +R DL++RLT+DEKI+QL+ +PAI RLGIPAY WW+E+LHGVA
Sbjct: 21 SEKYPFQNTELPEDERVNDLINRLTVDEKIAQLLYQSPAIERLGIPAYNWWNESLHGVAR 80
Query: 100 VGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA----GQA---I 152
G AT FPQ I AAS+D L + I EARA ++ GQ
Sbjct: 81 AGY----------ATVFPQSITIAASWDDELVAEVANVISDEARAKHHEYLRRGQHDIYQ 130
Query: 153 GMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASA 212
G+TFW+PNINIFRDPRWGRG ET GEDP +TG YV+G+QG N K L+ A
Sbjct: 131 GLTFWSPNINIFRDPRWGRGHETYGEDPYLTGVLGTEYVKGLQG---NNAKY---LKVVA 184
Query: 213 CCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGI 272
KHF + R++FD + +DL +TY P F + VK G IM AYNR+ G
Sbjct: 185 TAKHFAVHSGPE---PLRHEFDVAPSQRDLWETYLPAFRTLVKDGNVYSIMTAYNRIYGE 241
Query: 273 PSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG 332
+ A +L S R +WGF+GY+ SDC A++ ++ AK +A +K G D+NCG
Sbjct: 242 AASASNSLYS-ILRDKWGFNGYVVSDCGAIADMWKTHHVAKDAAEASAMAVKEGCDLNCG 300
Query: 333 SFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAH 392
+ +K T A++ + E+++D ALH L R +LG+F+ + + P+ KI V +P H
Sbjct: 301 NSYEKLTD-ALQDGLITEADLDVALHRLMRARFKLGMFDSDEKV-PYAKIPFSVNNNPKH 358
Query: 393 QVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPL 452
+VLAL+AAQ IVLLKN + +LPL K+ ++A+IGPNA++ ++L GNY G +T L
Sbjct: 359 KVLALKAAQKSIVLLKNENAILPLSKNLK-NIAVIGPNADNIQSLWGNYNGMPKNPVTVL 417
Query: 453 QALQNYV 459
+ ++N V
Sbjct: 418 EGIKNKV 424
Score = 147 bits (371), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 102/299 (34%), Positives = 147/299 (49%), Gaps = 52/299 (17%)
Query: 478 IDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEA 527
++KAV A +D VVL +GL++ E EE+ DR L LP +Q EL+ V A
Sbjct: 589 LEKAVLAANKSDVVVLALGLNERLEGEEMKVEVEGFADGDRTSLNLPKKQVELMKEVV-A 647
Query: 528 AKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMT 587
KPV+LVLL G + I +A NI +I+ AGYPG+ G A+A V+FGD+NP GRLP+T
Sbjct: 648 TGKPVVLVLLNGSALSINWAS--ENIPAIISAGYPGQEGGNAIANVLFGDYNPAGRLPVT 705
Query: 588 WYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNK 647
+Y P D M GRTY++++ + ++PFG GLSY+K+ Y
Sbjct: 706 YYKSVDDLPPFEDYNMD--------GRTYKYFKKEPLYPFGYGLSYTKFKY--------- 748
Query: 648 LYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKP 707
S E+ E V++ V N G+ G V L+V+
Sbjct: 749 ---------------------SNLEIPLEIKINEPIKVSVQVANEGDFDGDEVVQLYVRD 787
Query: 708 ARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
RPI +LVGF+ + L + ++ F + P E L+ +D VIE G + VG
Sbjct: 788 EEGSTPRPICELVGFKRIHLKKGARQKVEFTIQPRE-LAMINKDDKFVIEPGWFSISVG 845
>gi|225873993|ref|YP_002755452.1| beta-xylosidase B [Acidobacterium capsulatum ATCC 51196]
gi|225791521|gb|ACO31611.1| beta-xylosidase B [Acidobacterium capsulatum ATCC 51196]
Length = 894
Score = 330 bits (846), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 200/504 (39%), Positives = 269/504 (53%), Gaps = 45/504 (8%)
Query: 1 MIFHKLSLVFPLLCLCFTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLV 60
M HKL V L LC LL + + PS P+ + +LP RARDLV
Sbjct: 1 MRQHKLH-VAAGLALCAGMLLMLPSAFAQSQTQSPSTPA-----YLNPSLPPVVRARDLV 54
Query: 61 SRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVI 120
SR+TL EK SQLVN+A AIPRL +PAY WWSEALHGVA + G T FP+ I
Sbjct: 55 SRMTLKEKASQLVNAARAIPRLKVPAYNWWSEALHGVA-----------VNGTTEFPEPI 103
Query: 121 LTAASFDSYLWYRIGQAIGLEARALYNAGQA-------IGMTFWAPNINIFRDPRWGRGQ 173
A+FD + + IG E R +Y + G+ FWAPN+NIFRDPRWGRGQ
Sbjct: 104 GLGATFDVPAIHEMAVDIGTEGRVVYEENEKDGSSKIFHGLDFWAPNLNIFRDPRWGRGQ 163
Query: 174 ETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKF 233
ET GEDP +TGK V++V G+QGD + A KHF D+ + TR+
Sbjct: 164 ETYGEDPFLTGKMGVAFVSGMQGDN------PKYYRVIATPKHF---DVHSGPEPTRHFA 214
Query: 234 DARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHG 293
D V++ D DTY+P F + + QG A +MC+YN +NG P+CA++ L R WGF G
Sbjct: 215 DVDVSLHDQLDTYEPAFRAAIMQGHADSVMCSYNAINGQPACANQFTLQHQLRGAWGFKG 274
Query: 294 YITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQ-------KHTKAAVKQK 346
Y+ SDCDAV IY Y + A ++ GMD +C F Q K AV+Q
Sbjct: 275 YVVSDCDAVHDIYSGHKYRPTLAQAAAISMERGMDNDCADFAQPKGDDDYKAYIDAVQQG 334
Query: 347 KLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVL 406
L + +D AL LF+ R++LGLF+ M P+ + SPAH+ A + A + +VL
Sbjct: 335 YLSQQAMDTALVRLFTARIKLGLFDPK-GMDPYADTPHSELNSPAHRAYARKLADESMVL 393
Query: 407 LKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENT--VY 464
LKN G LPL S+A++GP A+ LLGNY G +++ L+ L+ NT Y
Sbjct: 394 LKND-GTLPLKPGSVHSIAVVGPLADQTAVLLGNYNGVPTHTVSFLEGLRAEYPNTKITY 452
Query: 465 YPGCDTVACSSASI-DKAVDIAKG 487
PG ++ ++ + D A+ G
Sbjct: 453 VPGTQFLSDTANPVPDSALTTPDG 476
Score = 96.7 bits (239), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 78/285 (27%), Positives = 132/285 (46%), Gaps = 52/285 (18%)
Query: 492 VLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGP 541
+ ++G+ E EE+ DR +L +P ++ L+ VA+ K PV++VL+ G
Sbjct: 627 IAVVGITSKLEGEEMPVDQPGFLGGDRTNLQMPEPEEALVEAVAKTGK-PVVVVLMNGSA 685
Query: 542 VDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDM 601
+ + + N ++L A Y GE G A+A+ + G ++P GRLP+T+Y D
Sbjct: 686 LAVNWISQHAN--AVLEAWYSGEEGGAAIADTLSGKNDPAGRLPVTFYKSVNQLPNFEDY 743
Query: 602 KMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVEN 661
M RTYR+++GK ++PFG GLSY+ + Y
Sbjct: 744 SME--------NRTYRYFKGKPLYPFGYGLSYTTFRYS---------------------- 773
Query: 662 QDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVG 721
D+ S+P + + + T V N G++AG V L++K + +G P L G
Sbjct: 774 -DL----SIPHATVDAGQPVEASAT--VTNTGKVAGDEVVQLYLKFPKV-DGAPDIALRG 825
Query: 722 FQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
FQ + L + ++ FEL LS G +++ +G + L +G
Sbjct: 826 FQRIHLEPGQSQQVHFELK-KRDLSMVTALGQIIVAQGDYTLSIG 869
>gi|361127339|gb|EHK99311.1| putative exo-1,4-beta-xylosidase bxlB [Glarea lozoyensis 74030]
Length = 569
Score = 330 bits (846), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 198/544 (36%), Positives = 284/544 (52%), Gaps = 55/544 (10%)
Query: 46 CKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIF 105
C TT P + RA LV + EK+ +++ + + RLG+P Y WWSEALHGVAG GI
Sbjct: 65 CDTTAPPADRAAALVKAMQSSEKLQNIISKSAGVSRLGLPPYNWWSEALHGVAGA-PGIQ 123
Query: 106 FNGT--IRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINI 163
F+ + ATS P IL AA+FD L ++G IG EARA N G G+ FW PNIN
Sbjct: 124 FSSSSPWNYATSLPMPILMAAAFDDDLIEKVGTLIGTEARAFGN-GNHSGIDFWTPNINP 182
Query: 164 FRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLD 223
F+DPRWGRG ETPGED L Y + +RG++G+ K + + A CKH+ A DL+
Sbjct: 183 FKDPRWGRGSETPGEDTLRLKGYVAALLRGLEGN-------KAQRRIIATCKHYAANDLE 235
Query: 224 NWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSK 283
+W G TR+ FDA+++MQDLA+ Y PF+ C + + MC+YN VNG+P+CA++ LL
Sbjct: 236 SWNGVTRHDFDAKISMQDLAEYYLQPFQQCARDSKVGSFMCSYNSVNGVPACANKYLLQT 295
Query: 284 TARRQWGF---HGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTK 340
R W + + Y+TSDC+AV I YA + AG D +C
Sbjct: 296 ILRDHWNWTSENQYVTSDCEAVQDISLNHHYASTNAAGTALAFNAGTDSSC--------- 346
Query: 341 AAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAA 400
G F+G+ + + +G V +P Q LALQA
Sbjct: 347 -------------------------EAGYFDGSKAL--YSSLGWSDVNTPQAQQLALQAT 379
Query: 401 QDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQN--Y 458
DGIV+LKN G LPL +A+IG A+ + L G Y+G + TP+ A Q +
Sbjct: 380 VDGIVMLKN-DGTLPLKLDSKSKVAMIGFWASDSSKLQGGYSGKAPYLRTPVYAAQQLGF 438
Query: 459 VENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQ 518
N P + + + A+ A +D+++ GLD + E +DR L P Q
Sbjct: 439 TPNVATGPVQQSASATDNWTTNALAAASKSDYILYFGGLDTSAAAEGVDRTSLEWPSAQL 498
Query: 519 ELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDH 578
LI +++ A KP+I++ G +D T ++ + SILWA +PG+ G A+ ++I G
Sbjct: 499 ALIKKLS-ALGKPLIIIQE-GDQMDNTPLLTNKGVSSILWASWPGQDGGPAVMQIISGAK 556
Query: 579 NPGG 582
+P G
Sbjct: 557 SPAG 560
>gi|397690575|ref|YP_006527829.1| glucan 1,4-beta-glucosidase [Melioribacter roseus P3M]
gi|395812067|gb|AFN74816.1| glucan 1,4-beta-glucosidase [Melioribacter roseus P3M]
Length = 860
Score = 328 bits (842), Expect = 5e-87, Method: Compositional matrix adjust.
Identities = 185/435 (42%), Positives = 256/435 (58%), Gaps = 33/435 (7%)
Query: 45 FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGI 104
+ LP +RA DL+ RL+LDEKIS +V+ +PAI RLGIP Y WW+EALHGVA G+
Sbjct: 23 YLNVNLPFEERAEDLLQRLSLDEKISLMVHQSPAIERLGIPEYNWWNEALHGVARNGR-- 80
Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA-------GQAIGMTFW 157
AT FP I AA++D L YRI I EARA YN+ G G++ W
Sbjct: 81 --------ATVFPMPIGLAATWDRDLIYRIADVISNEARAKYNSALKKNQRGIYQGISLW 132
Query: 158 APNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHF 217
APNINIFRDPRWGRG ET GEDP +TG+ AVS+++G+QG K L+ A KH
Sbjct: 133 APNINIFRDPRWGRGMETYGEDPYLTGELAVSFIKGLQGQD------KKYLKTIATPKHL 186
Query: 218 TAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCAD 277
+ + R+ F+A V+ DL +TY P F+ + +G+A +MCAYNR+ G C
Sbjct: 187 AVH---SGPEPERHHFNALVSNYDLNETYLPHFKKSIMKGKAYSVMCAYNRLRGKACCGH 243
Query: 278 RNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG-SFLQ 336
LL+ R +WGF G + SDC AV I+++ SPE A + +G D+ CG +FL
Sbjct: 244 DTLLTDILRNKWGFEGIVVSDCWAVYDIFNSHKIVDSPEKAAALAVSSGTDLECGNTFLS 303
Query: 337 KHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLA 396
K A + + E EID AL + R +LG+F+ P + + +I + + ++ +A
Sbjct: 304 --LKNAYRDGLITEKEIDSALRRVLLARFKLGMFDP-PEIVSYSQIDESYLDNSYNREIA 360
Query: 397 LQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQ 456
L+AA+ IVLLKN + LLPL S + +A+IGPNA++ ++LLGNY G ITPLQA++
Sbjct: 361 LEAARKSIVLLKNDNKLLPLDSSIN-KIAVIGPNADNLESLLGNYHGFPSEYITPLQAIR 419
Query: 457 NYVEN--TVYYPGCD 469
++N Y GCD
Sbjct: 420 RVLKNGEVFYEKGCD 434
Score = 127 bits (320), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 87/275 (31%), Positives = 136/275 (49%), Gaps = 52/275 (18%)
Query: 479 DKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAA 528
D+A A +D V++ MGL E E L DR+ L LP Q +LI ++ +
Sbjct: 589 DRAYKTALKSDAVIMFMGLCPRMEGEALKIKLDGFKGGDRLKLSLPANQLKLIKKI-HST 647
Query: 529 KKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTW 588
KPVILVLL GGP+ + NI +IL A YPG+AG A+ +VI+G +NP G+LP+T
Sbjct: 648 GKPVILVLLNGGPISTVWES--ENIPAILEAWYPGQAGGRAITDVIWGKYNPSGKLPVTI 705
Query: 589 YPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKL 648
Y + P + M GRTYR+++G+ ++PFG GL+Y+ + +S N+
Sbjct: 706 YKSENDLPPFENYDME--------GRTYRYFKGEVLYPFGWGLNYTDITISNIELSANE- 756
Query: 649 YLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPA 708
+++ D + V + +KN+G +AG+ V L+ K
Sbjct: 757 ----------IKDNDTIR------------------VVVKLKNNGNLAGEETVQLYTKAL 788
Query: 709 RRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCE 743
+ + R IK L GF+ + L + + F LS +
Sbjct: 789 K--DNRTIKTLRGFEKIKLEPGTEGMVEFYLSKSD 821
>gi|189467437|ref|ZP_03016222.1| hypothetical protein BACINT_03826 [Bacteroides intestinalis DSM
17393]
gi|189435701|gb|EDV04686.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
intestinalis DSM 17393]
Length = 863
Score = 328 bits (842), Expect = 6e-87, Method: Compositional matrix adjust.
Identities = 181/429 (42%), Positives = 244/429 (56%), Gaps = 26/429 (6%)
Query: 42 TFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVG 101
T F LPI +R DLV RLTL+EKISQ++N+APAI RLGIPAY WW+E LHGVA
Sbjct: 23 TCKFLNPELPIVERVNDLVGRLTLEEKISQMLNNAPAIDRLGIPAYNWWNECLHGVAR-- 80
Query: 102 KGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYN-------AGQAIGM 154
+ TSFPQ I AA++D+ +++ E RA+Y+ G G+
Sbjct: 81 -------SPYPVTSFPQAIAMAATWDTESVHQMAVYASDEGRAIYHDATRKGTPGIFRGL 133
Query: 155 TFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACC 214
T+W+PNINIFRDPRWGRGQET GEDP +T VS+V+G+QGD L++SAC
Sbjct: 134 TYWSPNINIFRDPRWGRGQETYGEDPFLTASIGVSFVKGLQGDD------PVYLKSSACA 187
Query: 215 KHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPS 274
KH+ + W R+ +DA+V DL DTY P F+ V +G+ +G+MCAYN G P
Sbjct: 188 KHYAVHSGPEW---NRHTYDAKVNNHDLWDTYLPAFKELVVEGKVTGVMCAYNSFFGQPC 244
Query: 275 CADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSF 334
C + L+ R W F GY+TSDC AV Y+ + A D + G D CG+
Sbjct: 245 CGNDLLMMDILRNHWKFGGYVTSDCGAVEDFYNTHKTHQDAAAASADAVLHGTDCECGNG 304
Query: 335 LQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQV 394
+ AV + + E +ID +L LF +R RLG+F+ + + P+ I V+ AH+
Sbjct: 305 AYRALADAVLRGLITEKQIDESLKKLFEIRFRLGMFDPDDRV-PYSNIPLSVLECDAHKA 363
Query: 395 LALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQA 454
AL+ A+ IVLLKN LLPL K+K +A++GPNA+ LL NY G T L+
Sbjct: 364 HALKIARQSIVLLKNQDQLLPLNKNKIKKIAVVGPNADDKSVLLANYYGYPSHITTALEG 423
Query: 455 LQNYVENTV 463
+Q V N V
Sbjct: 424 IQKKVGNQV 432
Score = 105 bits (262), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 81/268 (30%), Positives = 121/268 (45%), Gaps = 53/268 (19%)
Query: 486 KGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILV 535
K AD ++ + GL E EE+ DR + +P QQ L+ + A KPV+ V
Sbjct: 597 KDADVIIFVGGLSAKVEGEEMGVEIEGFKRGDRTSISIPSVQQNLLKELY-ATGKPVVFV 655
Query: 536 LLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIK 595
++ G + + + ++ +IL A Y G+AG A+A+V+FGD+NP GRLP+T+Y
Sbjct: 656 MMTGSALGLEWES--AHLPAILNAWYGGQAGGQAIADVLFGDYNPSGRLPLTFYKSVNDL 713
Query: 596 VPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSS 655
D M RTYR++ G V+PFG GLSY+ + Y SS
Sbjct: 714 PDFEDYSME--------NRTYRYFTGTPVYPFGYGLSYTTFQY---------------SS 750
Query: 656 TKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRP 715
K+ + D R VT + N G+M G+ L+V R P
Sbjct: 751 LKLQPSPD----------------KRSVKVTAKITNTGKMEGEEVAQLYVSNPRDFV-TP 793
Query: 716 IKQLVGFQSVILNAKEKAEIVFELSPCE 743
I+ L GF+ + L E + F L+ E
Sbjct: 794 IRALKGFKRINLKPGESQTVEFVLTSKE 821
>gi|254786805|ref|YP_003074234.1| glycoside hydrolase family 3 domain-containing protein
[Teredinibacter turnerae T7901]
gi|237686035|gb|ACR13299.1| glycoside hydrolase family 3 domain protein [Teredinibacter
turnerae T7901]
Length = 888
Score = 328 bits (840), Expect = 8e-87, Method: Compositional matrix adjust.
Identities = 194/477 (40%), Positives = 271/477 (56%), Gaps = 44/477 (9%)
Query: 11 PLLCLCFTSLL---TRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDE 67
P+L L SLL D+ P + + E + TTL I R DLVSR+ L E
Sbjct: 10 PILGLTLASLLFTGCSPDNNPVPKPVSERSTANEQPAYMDTTLDIDTRVDDLVSRMDLAE 69
Query: 68 KISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFD 127
KISQ+ N +PAI LGI Y+WW+EALHGVA GK AT FPQ I AA +D
Sbjct: 70 KISQMYNESPAIEHLGIAEYDWWNEALHGVARAGK----------ATVFPQAIGMAAMWD 119
Query: 128 SYLWYRIGQAIGLEARALYN-------AGQAIGMTFWAPNINIFRDPRWGRGQETPGEDP 180
+ I +A+ EARA ++ + G+TFW+PNINIFRDPRWGRGQET GEDP
Sbjct: 120 RETMFDIAEAVSDEARAKHHYFVENGVHFRYTGLTFWSPNINIFRDPRWGRGQETYGEDP 179
Query: 181 LVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQ 240
+TG+ A+ Y+ G+QG+ L+ +A KHF + + +R+ + + +
Sbjct: 180 YLTGELALPYISGLQGEN------PKYLKTAAMAKHFAVH---SGPEKSRHSDNYIASPK 230
Query: 241 DLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCD 300
DL +TY P FE V +G +MCAYNRVN P+C + LL +T R +WGF G++ SDC
Sbjct: 231 DLNETYLPAFEKAVVEGDVESVMCAYNRVNDEPACGNDMLLKETLRGKWGFKGHVVSDCG 290
Query: 301 AVSIIYDAEGY--AKSPEDAVVDVLKAGMDVNCG-----SFLQKHTKAAVKQKKLPESEI 353
A++ Y E + +P A +++G D+NCG +F H A++++ + + EI
Sbjct: 291 AIADFYAPEAHHVVMAPAAAAAWAVRSGTDLNCGTDRLSTFANLHF--ALQREMITQDEI 348
Query: 354 DRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGL 413
D+++ L R +LG+F+ + + P+ KI DVV S AH L +AA+ VLLKNS G+
Sbjct: 349 DQSVKRLMKTRFKLGMFDPDDQV-PYSKIPMDVVGSQAHLALTQKAAEKSFVLLKNS-GI 406
Query: 414 LPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYV--ENTVYYPGC 468
LPL KS V A+IGPNA + L+GNY G + +TPL +Q Y+ EN Y PG
Sbjct: 407 LPLKKSSKV--AIIGPNATNPTVLVGNYFGDPIKPVTPLDGIQQYLGEENVFYAPGS 461
Score = 90.1 bits (222), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 75/292 (25%), Positives = 130/292 (44%), Gaps = 54/292 (18%)
Query: 491 VVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGG 540
++ G+ E EE+ DR D+ LP Q++L+ + + KP++LV G
Sbjct: 624 IIFTGGISADLEGEEMSVEIEGFDHGDRTDIRLPEPQRKLLATLKKL-NKPIVLVNFSGS 682
Query: 541 PVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVP-MT 599
+ + +A + N+ +IL YPGEA ALA +++G+ +P GRLP+T+Y + +P
Sbjct: 683 AIALNWA--NNNVDAILQGFYPGEATGTALARILWGEVSPSGRLPITFY-RSLDDLPGFK 739
Query: 600 DMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMV 659
D M RTY++Y+G ++PFG GLSY++++Y S+ M
Sbjct: 740 DYAMT--------NRTYKYYQGDVLYPFGYGLSYTQFAYS-----------ELSAPATMA 780
Query: 660 ENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQL 719
+ + +T V N G++A V ++V G P ++L
Sbjct: 781 SGEPLA-------------------ITAQVSNSGKVASDEVVQVYVSMKVPGLSLPQREL 821
Query: 720 VGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGDEEYP 771
F+ + L + F ++ + LS + G+ G L VG + P
Sbjct: 822 KEFKRIYLEPGASQTVEFSIA-GKDLSYVDDQGVRHPYHGPLTLSVGGGQQP 872
>gi|197106390|ref|YP_002131767.1| glucan 1,4-beta-glucosidase [Phenylobacterium zucineum HLK1]
gi|196479810|gb|ACG79338.1| glucan 1,4-beta-glucosidase [Phenylobacterium zucineum HLK1]
Length = 888
Score = 326 bits (836), Expect = 3e-86, Method: Compositional matrix adjust.
Identities = 187/462 (40%), Positives = 262/462 (56%), Gaps = 40/462 (8%)
Query: 45 FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGI 104
+ T LP +RA DLV+R+TL+EK Q+ ++APAIPRLG+PAY WW+E LHGVA G
Sbjct: 38 YRDTRLPAERRAADLVARMTLEEKSRQIGHTAPAIPRLGVPAYNWWNEGLHGVARAGI-- 95
Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALY--------NAGQAIGMTF 156
AT FPQ I AA++D IG E RA Y + G+T
Sbjct: 96 --------ATVFPQAIGMAATWDVDRMRGTADVIGTEFRAKYAERVHPDGSTDWYRGLTV 147
Query: 157 WAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKH 216
W+PNINIFRDPRWGRGQET GEDP +TG+ V+++RG+QG N + A KH
Sbjct: 148 WSPNINIFRDPRWGRGQETYGEDPYLTGRMGVAFIRGLQGQDPN------FFKTIATAKH 201
Query: 217 FTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCA 276
+ + + R++ D + DL DTY P F + V +G+ +MCAYN V+G+P+CA
Sbjct: 202 YAVHSGPE---SNRHREDVHPSAYDLEDTYLPAFRAAVTEGKVQAVMCAYNAVDGVPACA 258
Query: 277 DRNLLSKTARRQWGFHGYITSDCDAVSIIY--DAEGYAKSPEDAVVDVLKAGMDVNCGSF 334
+L+ + RR WGF G++ SDC A + IY D+ Y K+PE+ + L AGMD+ CG +
Sbjct: 259 SEDLMDQRLRRDWGFSGHVVSDCGAAANIYREDSLAYVKTPEEGITRALNAGMDLVCGDY 318
Query: 335 L------QKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVC 388
+ T +AV++ L E+ +D AL LF+ R+RLGLF+ P PF KI A
Sbjct: 319 RADWNTEAEATVSAVRKGMLDETVLDGALVRLFADRIRLGLFD-PPAEVPFSKITAAQND 377
Query: 389 SPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRS 448
+P H+ ++L+ A+ + LLKN G+LPL K + +A++GPNA+S L+GNY G
Sbjct: 378 TPEHRAMSLEMAKASMTLLKND-GVLPL-KGEPRRIAVVGPNADSVDALIGNYYGTPSNP 435
Query: 449 ITPLQALQNYV--ENTVYYPGCDTVACSSASIDKAVDIAKGA 488
+T L ++ VY G V +S + AV A A
Sbjct: 436 VTVLAGIRARFPKAEVVYAEGTGLVGPASLPVPDAVLCADAA 477
Score = 121 bits (303), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 99/296 (33%), Positives = 144/296 (48%), Gaps = 55/296 (18%)
Query: 489 DHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLLC 538
D VV + GL E EE+ DR L LP QQ+L+ R+ A KPV+LVL+
Sbjct: 613 DLVVFVGGLTARVEGEEMKLQVPGFAGGDRTSLDLPAPQQDLLRRL-HATGKPVVLVLMN 671
Query: 539 GGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPM 598
G + + +A D N+ +I+ A YPG G A+A+++ GD++P GRLP+T+Y P
Sbjct: 672 GSALSVNWA--DANLPAIVEAWYPGGEGGHAVAQLLAGDYSPAGRLPVTFYRSAGDLPPF 729
Query: 599 TDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKM 658
D M+ GRTYR++ G+ ++PFG GLSY+++SY +S
Sbjct: 730 ADYAMK--------GRTYRYFGGEVLYPFGYGLSYTRFSYGAPQLSA------------- 768
Query: 659 VENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVK-PARRGNGRPIK 717
+SV G + VT V N G M G+ V L+V P R +G PI+
Sbjct: 769 ---------RSVSADG-------EITVTTQVTNTGGMDGEEVVQLYVSHPGR--DGTPIR 810
Query: 718 QLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGDEEYPIS 773
L GFQ + L E + F L + LS +G +E G + VG + P+S
Sbjct: 811 ALQGFQRIGLKRGETRPVSFTLKDRQ-LSVVDAEGNRRVEPGRVEVWVGGGQ-PVS 864
>gi|359450637|ref|ZP_09240068.1| beta-glucosidase [Pseudoalteromonas sp. BSi20480]
gi|358043611|dbj|GAA76317.1| beta-glucosidase [Pseudoalteromonas sp. BSi20480]
Length = 468
Score = 326 bits (835), Expect = 4e-86, Method: Compositional matrix adjust.
Identities = 180/437 (41%), Positives = 250/437 (57%), Gaps = 36/437 (8%)
Query: 33 CDPSNPSTETFP-FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWS 91
C ST+T P + + I +R DLV+RLTL+EK++QL + +PAI RL +P Y WW+
Sbjct: 21 CAKDEQSTQTPPLYLNKSASIDERVNDLVTRLTLEEKVAQLFDKSPAIERLNMPEYNWWN 80
Query: 92 EALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA--- 148
EALHGVA GK AT FPQ I AA+FD L R+G AI E RA ++A
Sbjct: 81 EALHGVARAGK----------ATVFPQAIGLAATFDEDLMLRVGTAISDEGRAKHHAFLE 130
Query: 149 ----GQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKL 204
G+T+W+PNINIFRDPRWGRGQET GEDP +T + AV+++ G+QGD
Sbjct: 131 ENNRSMYTGLTYWSPNINIFRDPRWGRGQETYGEDPYLTTRIAVNFINGLQGDN------ 184
Query: 205 KGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMC 264
L++ A KH+ + + +R+ D + +DLA+TY P F+ + Q + + +MC
Sbjct: 185 AEYLKSVATLKHYAVH---SGPEVSRHSDDYTASEKDLAETYLPAFKDVIAQTKVASVMC 241
Query: 265 AYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGY--AKSPEDAVVDV 322
AYN VNG P+C + L+ R ++ F GYI SDC A++ YD + + + A
Sbjct: 242 AYNSVNGTPACGNDELIQNKLRDEFNFDGYIVSDCGAIADFYDVKSHNIVNTGAKAAAMA 301
Query: 323 LKAGMDVNCGSF---LQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPF 379
LK G D+NCG + AVK+ + E ++D+AL L R +LG+F+ NP P+
Sbjct: 302 LKTGTDLNCGDHHGNTYSYLTQAVKEGLVEEKDVDKALKRLMYARFKLGMFD-NPENVPY 360
Query: 380 GKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLG 439
DVV S H L +AAQ +VLLKN +LPL ++ + ALIGPNA++ LLG
Sbjct: 361 SDTSIDVVGSNKHLALTQEAAQKSLVLLKNEQ-VLPLKGNEKI--ALIGPNADNEAILLG 417
Query: 440 NYAGPSCRSITPLQALQ 456
NY G ITP AL+
Sbjct: 418 NYNGMPIVPITPKLALE 434
>gi|392537607|ref|ZP_10284744.1| Beta-glucosidase [Pseudoalteromonas marina mano4]
Length = 870
Score = 325 bits (833), Expect = 5e-86, Method: Compositional matrix adjust.
Identities = 180/437 (41%), Positives = 250/437 (57%), Gaps = 36/437 (8%)
Query: 33 CDPSNPSTETFP-FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWS 91
CD S + P + + I +R DLV+RLTL+EK++QL + +PAI RL IP Y WW+
Sbjct: 21 CDNDEQSMQPPPLYLNESASIDERVNDLVTRLTLEEKVAQLFDKSPAIERLNIPEYNWWN 80
Query: 92 EALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQA 151
EALHGVA GK AT FPQ I AA+FD L R+G AI E RA ++A A
Sbjct: 81 EALHGVARAGK----------ATVFPQAIGLAATFDEDLMLRVGTAISDEGRAKHHAFLA 130
Query: 152 -------IGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKL 204
G+T+W+PNINIFRDPRWGRGQET GEDP +T + AV+++ G+QGD
Sbjct: 131 ENNRSMYTGLTYWSPNINIFRDPRWGRGQETYGEDPYLTTRIAVNFINGLQGDN------ 184
Query: 205 KGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMC 264
L++ A KH+ + + +R+ D + +DLA+TY P F+ + Q + + +MC
Sbjct: 185 TEYLKSVATLKHYAVH---SGPEVSRHSDDYTASKKDLAETYLPAFKDVIAQTKVASVMC 241
Query: 265 AYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGY--AKSPEDAVVDV 322
AYN VNG P+C + L+ R ++ F GYI SDC A++ YD + + + A
Sbjct: 242 AYNSVNGTPACGNDELIQNKLRDEFNFDGYIVSDCGAIADFYDVKSHNIVNTEAKAAAMA 301
Query: 323 LKAGMDVNCGSF---LQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPF 379
LK G D+NCG + AVK+ + E ++D+AL L R +LG+F+ NP P+
Sbjct: 302 LKTGTDLNCGDHHGNTYSYLSQAVKEGLVEEKDVDKALKRLMYARFKLGMFD-NPENVPY 360
Query: 380 GKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLG 439
D+V S H L +AA+ +VLLKN +LPL ++ V ALIGPNA++ LLG
Sbjct: 361 SDTSIDIVGSNKHLALTQEAAKKSLVLLKNEQ-VLPLKGNEKV--ALIGPNADNEAILLG 417
Query: 440 NYAGPSCRSITPLQALQ 456
NY G ITP AL+
Sbjct: 418 NYNGMPIVPITPKLALE 434
Score = 112 bits (279), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 92/361 (25%), Positives = 153/361 (42%), Gaps = 59/361 (16%)
Query: 418 KSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSS-- 475
KS +V L G + N A TL N + ++ + V P +
Sbjct: 538 KSNNVDFTLNGQSVNGAITLEKN----QSYQLNAESKFNHFWHSNVINPTASLTWLKNPQ 593
Query: 476 ASIDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVA 525
+ +A++ A AD +V + G+ E EE+ DR ++ LP Q L+ ++
Sbjct: 594 SLTQQALNNANEADVIVFVGGISANLEGEEMPLQIDGFSHGDRTNINLPKSQLNLLKKLK 653
Query: 526 EAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLP 585
+ K P++LV + G + + + + NI +I+ YPGEA AL +++G+++P G+LP
Sbjct: 654 QTGK-PIVLVNMSGSAMALNWE--NENIDAIIQGFYPGEAAGSALVSLLYGEYSPSGKLP 710
Query: 586 MTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQ 645
+T+Y D M+ RTY++YEG+ ++PFG GLSY+ + YK
Sbjct: 711 ITFYKSVSDLPDFKDYSMK--------NRTYKYYEGEVLYPFGFGLSYADFKYK------ 756
Query: 646 NKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFV 705
N S + +T + N + V ++V
Sbjct: 757 -----NTRHSID--------------------AGSGDLNLTTTITNQSSFSADDVVQVYV 791
Query: 706 KPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVV 765
P KQLVGF+ + L + K +I F + P LS E G+ V +G + V
Sbjct: 792 SMPDAPIKTPNKQLVGFKHITLKNESKNDIKFTI-PKNKLSYINEQGIAVAYKGRLIITV 850
Query: 766 G 766
G
Sbjct: 851 G 851
>gi|380509734|ref|ZP_09853141.1| beta-glucosidase-related glycosidase [Xanthomonas sacchari NCPPB
4393]
Length = 883
Score = 325 bits (832), Expect = 7e-86, Method: Compositional matrix adjust.
Identities = 192/444 (43%), Positives = 265/444 (59%), Gaps = 36/444 (8%)
Query: 39 STETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVA 98
+ + P+ T+ RA LV+++TL+EK +Q+ N+APAI RLG+PAY+WW+EALHGVA
Sbjct: 18 AVDAKPWQDTSASFEARAAALVAQMTLEEKAAQMQNAAPAIERLGVPAYDWWNEALHGVA 77
Query: 99 GVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA-------GQA 151
G+ AT FPQ I AA+FD L ++ I EARA ++ G+
Sbjct: 78 RAGQ----------ATVFPQAIGLAATFDVPLMGQVATTISDEARAKHHQFLREGAHGRY 127
Query: 152 IGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQAS 211
G+TFW+PNINIFRDPRWGRGQET GEDP +T + V++V+G+QGD + KL A+
Sbjct: 128 QGLTFWSPNINIFRDPRWGRGQETYGEDPYLTARMGVAFVQGLQGDD----PVYRKLDAT 183
Query: 212 ACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNG 271
A KHF + + R+ FDAR + +DL DTY P FE+ VK+G+ +M AYNRV G
Sbjct: 184 A--KHFAVH---SGPEADRHHFDARPSKRDLYDTYLPAFEALVKEGKVDAVMGAYNRVYG 238
Query: 272 IPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNC 331
+ A + LL RR WGF GY+ SDC A+ I+ A S E A +K G ++ C
Sbjct: 239 ESASASQFLLRDVLRRDWGFTGYVVSDCWAIVDIWKHHHLAPSREAAAALAVKNGTELEC 298
Query: 332 GSFLQKHTK--AAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCS 389
G Q++ AAV+Q + E+EID A+ LF+ RMRLG+F+ P + +I A V
Sbjct: 299 G---QEYATLPAAVRQGLIGEAEIDDAVTRLFTARMRLGMFD-PPERVRWARIPASVNQV 354
Query: 390 PAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSI 449
PAH LALQAAQ+ +VLLKN G+LPL ++ +A++GP A+ LLGNY G +
Sbjct: 355 PAHDALALQAAQESLVLLKND-GVLPLSRTLK-RIAVVGPTADDTMALLGNYFGTPAAPV 412
Query: 450 TPLQALQNYVE--NTVYYPGCDTV 471
T LQ +++ + Y G D V
Sbjct: 413 TILQGIRDAAKGIEVRYARGVDLV 436
Score = 124 bits (311), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 91/301 (30%), Positives = 142/301 (47%), Gaps = 53/301 (17%)
Query: 478 IDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEA 527
D+A+D A+ AD VV + GL E EE+ DR DL LP Q+ L+ + A
Sbjct: 605 FDEALDAARNADVVVFVGGLTGDVEGEEMKVDYPGFAGGDRTDLRLPAPQRALLEAL-HA 663
Query: 528 AKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMT 587
KPV++VL G + + +A+ ++ +IL + YPG+ G A+ + +FG+ NP GRLP+T
Sbjct: 664 TGKPVVMVLTGGSALAVDWAQA--HLPAILMSWYPGQRGGTAVGQALFGEVNPAGRLPVT 721
Query: 588 WYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNK 647
+Y D D M GRTYR++ G ++PFG GLSY+++ Y K
Sbjct: 722 FYRADQALPAFDDYAME--------GRTYRYFRGTPLYPFGHGLSYTRFDY-------GK 766
Query: 648 LYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKP 707
L+L+ + + + + V N G+ AG L+V+
Sbjct: 767 LHLD----------------------APRIADDGRLKLQVEVANTGKRAGDEVAQLYVRR 804
Query: 708 ARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRARED--GLMVIEEGTHFLVV 765
G + L GFQ V L E+ + FEL ++L R +D G V+ G + + +
Sbjct: 805 LAAAPGDAQQTLRGFQRVHLAPGERRTLTFELDAQQAL-RQYDDARGAYVVPAGRYEVRI 863
Query: 766 G 766
G
Sbjct: 864 G 864
>gi|322437617|ref|YP_004219707.1| glycoside hydrolase family protein [Granulicella tundricola
MP5ACTX9]
gi|321165510|gb|ADW71213.1| glycoside hydrolase family 3 domain protein [Granulicella
tundricola MP5ACTX9]
Length = 892
Score = 325 bits (832), Expect = 8e-86, Method: Compositional matrix adjust.
Identities = 193/445 (43%), Positives = 257/445 (57%), Gaps = 40/445 (8%)
Query: 36 SNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALH 95
S+ + P+ L QR DLVSR+TL+EK+SQ +NSAPAI RL +P Y++WSE LH
Sbjct: 25 SHQDAKPLPYMDPALTTQQRVDDLVSRMTLEEKVSQTINSAPAISRLNVPEYDYWSEGLH 84
Query: 96 GVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAI--- 152
G+A G AT FPQ I AA++D+ L +IG I +EARA +N +AI
Sbjct: 85 GIARSGY----------ATMFPQAIGMAATWDAPLLQQIGDVISIEARAKFN--EAIRHN 132
Query: 153 ------GMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKG 206
G+T W+PNINIFRDPRWGRGQET GEDP +TG+ V++V+G+QG N
Sbjct: 133 IHSIYYGLTIWSPNINIFRDPRWGRGQETYGEDPFLTGRLGVAFVKGIQGPDPN------ 186
Query: 207 KLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAY 266
+A A KHF + + +TR+ + T DL DTY P F + + + A IMCAY
Sbjct: 187 YFRAIATPKHFAVH---SGPESTRHSANIEPTPHDLHDTYLPAFRATITEAHADSIMCAY 243
Query: 267 NRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIY--DAEGYAKSP--EDAVVDV 322
N V G P+CA + LL T RR WGF G++TSDC A+ Y D + SP E A
Sbjct: 244 NAVEGSPACASKLLLQDTLRRDWGFKGFVTSDCGAIDDFYATDYPSHHTSPDKEAAAAAG 303
Query: 323 LKAGMDVNCG-SFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGK 381
+KAG D NCG ++L +AVK+ + E+EID AL +LF+ R +LGLF+ + F
Sbjct: 304 IKAGTDSNCGQTYLT--LGSAVKKGLVTEAEIDTALKHLFTARFQLGLFDPAAKVA-FNA 360
Query: 382 IGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNY 441
I V SPAHQ LAL+AA++ IVLLKN LP K ++A+IGP+A + L GNY
Sbjct: 361 IPFSEVNSPAHQALALKAAEESIVLLKNDAHTLPF-KPSVRTIAVIGPSAATLNNLEGNY 419
Query: 442 AGPSCRSITPLQA-LQNYVENTVYY 465
+ PL L + + V Y
Sbjct: 420 NAIPLHPVLPLDGILTQFKSSKVLY 444
Score = 107 bits (268), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 86/283 (30%), Positives = 127/283 (44%), Gaps = 54/283 (19%)
Query: 494 MMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVD 543
+GL E EE+ DR D+ LP QQ+++ VA A KP+++VLL G +
Sbjct: 629 FLGLSPELEGEEMPIHIEGFAGGDRTDIKLPAAQQQMLEAVA-ATGKPLVVVLLNGSALA 687
Query: 544 ITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVP-MTDMK 602
+ +A + + +IL A YPG+AG A+AE + G +NP GRLP+T+Y ++P D
Sbjct: 688 VNWA--NDHAAAILEAWYPGQAGGTAIAETLAGKNNPAGRLPVTFY-SSIDQIPAFDDYS 744
Query: 603 MRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQ 662
M RTYR+ + K +F FG GLSY+ ++Y N ST+ +
Sbjct: 745 MA--------NRTYRYSKAKPLFEFGYGLSYTTFTYS-----------NIKLSTQTLHAG 785
Query: 663 DVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGF 722
D V V+N G +AG L++ P P + L F
Sbjct: 786 D------------------PLTVEADVRNTGRVAGDEVAELYLTPPHTAVS-PQRALSAF 826
Query: 723 QSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVV 765
V L E + F L P +LS+ E G + G + L V
Sbjct: 827 TRVHLAPGELRHVTFTLDP-RTLSQVDEKGARAVTPGNYTLSV 868
>gi|320105647|ref|YP_004181237.1| glycoside hydrolase family protein [Terriglobus saanensis SP1PR4]
gi|319924168|gb|ADV81243.1| glycoside hydrolase family 3 domain protein [Terriglobus saanensis
SP1PR4]
Length = 885
Score = 323 bits (829), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 199/467 (42%), Positives = 268/467 (57%), Gaps = 51/467 (10%)
Query: 45 FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGI 104
+ TL RARDLV R+TL+EK +Q++N+APAI RLG+PAY++WSE LHGVA G
Sbjct: 30 YLDPTLSPPARARDLVHRMTLEEKTAQMINTAPAIDRLGVPAYDFWSEGLHGVARSGY-- 87
Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQA-------IGMTFW 157
AT FPQ I AA++D L + IG + EARA YN G+T W
Sbjct: 88 --------ATLFPQAIGMAATWDEPLMHEIGTVVSTEARAKYNDAVQHGVHSIYFGLTIW 139
Query: 158 APNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHF 217
+PNINIFRDPRWGRGQET GEDP +T + ++VRG+QGD N + A KHF
Sbjct: 140 SPNINIFRDPRWGRGQETYGEDPFLTARMGTAFVRGIQGDDPN------YFRTIATPKHF 193
Query: 218 TAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCAD 277
+ + +TR+ F+ V+ DL DTY P F S + +G+A IMCAYNR++G P+CA
Sbjct: 194 AVH---SGPESTRHTFNVDVSQHDLWDTYLPAFRSTIIEGKADSIMCAYNRIDGQPACAS 250
Query: 278 RNLLSKTARRQWGFHGYITSDCDAVSIIYDAEG--YAKSPEDAVVDVLKAGMDVNCGSFL 335
LL + R WGF G++TSDC A+ Y G ++K EDA +KAG D CG
Sbjct: 251 DLLLKQILRGDWGFRGFVTSDCGAIDDFYTKIGHHFSKEKEDASAAGVKAGTDTACGKTY 310
Query: 336 QKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVL 395
T +AVK + E E+D +L LF R+RLGLF+ +P P+ ++ V SPAH+ L
Sbjct: 311 LGLT-SAVKSGLITEHEMDISLERLFEARIRLGLFD-DPARMPYARLTMAEVNSPAHRAL 368
Query: 396 ALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQAL 455
AL+AA++ IVLLKN++ LLPL K++ A+IGPNA S L GNY + P+ +
Sbjct: 369 ALRAARESIVLLKNANNLLPLHGVKNI--AVIGPNAASLDALEGNYNAIARDPAMPVDGI 426
Query: 456 QNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVV-LMMGLDQTQ 501
+PG V A+GA +V LM+ + +TQ
Sbjct: 427 ------AAAFPGAKVV------------YAQGAPYVEGLMLPIPRTQ 455
Score = 106 bits (264), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 87/288 (30%), Positives = 128/288 (44%), Gaps = 54/288 (18%)
Query: 489 DHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLLC 538
D VV +GL E EE+ DR D+ LP Q EL+ R +A KP+I+VL+
Sbjct: 620 DVVVAFVGLSPELEGEEMPIKVKGFAGGDRTDIELPQTQLELL-RAVKATGKPLIVVLMN 678
Query: 539 GGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPM 598
G + A D ++L A YPGEAGA A+AE + G +NP GRLP+T+Y
Sbjct: 679 GSAI----ALKDSETDALLEAWYPGEAGAQAIAETLAGKNNPSGRLPLTFYSNIDQLPAF 734
Query: 599 TDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKM 658
D M RTYR+++G+ ++ FG GLSY+ + Y ++S L+ +
Sbjct: 735 DDYSMA--------NRTYRYFKGQPLYAFGGGLSYTTFRYGKVSLSATHLHAGED----- 781
Query: 659 VENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQ 718
V V N G++AG +++ P + P
Sbjct: 782 ------------------------LTVEAEVTNTGKVAGDEVAQVYLTPPQTSIA-PRFA 816
Query: 719 LVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
LVG+Q V L + + F L P E LS+ G+ G + + VG
Sbjct: 817 LVGYQRVHLLPGQSKPMRFTLHPRE-LSQVDAQGVRAASAGHYEIKVG 863
>gi|218262493|ref|ZP_03476939.1| hypothetical protein PRABACTJOHN_02617 [Parabacteroides johnsonii
DSM 18315]
gi|218223341|gb|EEC95991.1| hypothetical protein PRABACTJOHN_02617 [Parabacteroides johnsonii
DSM 18315]
Length = 868
Score = 323 bits (829), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 180/449 (40%), Positives = 250/449 (55%), Gaps = 41/449 (9%)
Query: 41 ETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGV 100
E +PF LPI +R DL+ RLT +EK+ Q++N+ PAI RLGIP Y+WW+EALHGVA
Sbjct: 24 EDYPFRNPDLPIDERIDDLLKRLTAEEKVGQMMNTTPAIERLGIPQYDWWNEALHGVARA 83
Query: 101 GKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQA-------IG 153
GK AT FPQ I AA+FD Y + EARA Y+ Q G
Sbjct: 84 GK----------ATVFPQAIAMAATFDDDALYETFTMVSDEARAKYHQYQKDKEYDRYKG 133
Query: 154 MTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASAC 213
+TFW PNINIFRDPRWGRG ET GEDP +T + V+ V+G+QGD + AC
Sbjct: 134 LTFWTPNINIFRDPRWGRGMETYGEDPYLTERMGVAVVKGLQGDD------PKYFKTHAC 187
Query: 214 CKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIP 273
KH+ + W R++FD VT +DL TY P FE+ VK+G +MCAYNR G P
Sbjct: 188 AKHYAVHSGPEW---NRHEFDVTVTPRDLWQTYLPAFEALVKEGNVQEVMCAYNRYQGKP 244
Query: 274 SCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSP--------EDAVVDVLKA 325
C+ LL R WG+ I SDC A++ ++ + ++P E A D +
Sbjct: 245 CCSSDKLLIDILRNSWGYENIILSDCGAINDFWERD--ERTPRHETHPDAESASADAVLN 302
Query: 326 GMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGAD 385
G D+ CG+ + K A+K K+ E+++D +L L R LG+F+ + + P+ +I +
Sbjct: 303 GTDLECGNSYRALVK-ALKDGKISENDLDVSLRRLLKGRFELGMFDPDEQV-PYAQIPYN 360
Query: 386 VVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPS 445
VV SP H AL+ A +VLLKN + LPL K+ +A++GPNA + L NY G
Sbjct: 361 VVESPEHVAQALEMAHKSMVLLKNKNNTLPLSKTIR-KIAVVGPNAADSTMLWANYNGFP 419
Query: 446 CRSITPLQALQNYVENT--VYYPGCDTVA 472
++T L+ ++N V +T +Y GC+ A
Sbjct: 420 THTVTILEGIRNKVPDTEVIYELGCNHAA 448
Score = 112 bits (279), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 83/291 (28%), Positives = 133/291 (45%), Gaps = 51/291 (17%)
Query: 486 KGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILV 535
K AD +V + G+ E EE+ DR ++ LP QQE++ + +A KPV+ V
Sbjct: 603 KDADVIVYVGGISPRLEGEEMPVNVEGFKKGDRTNIELPKVQQEMV-KALKATGKPVVYV 661
Query: 536 LLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIK 595
L G + + + + NI +IL A Y G+ A+A+++FGD+NP GRLP+T+Y
Sbjct: 662 LCTGSALALNWE--EANIDAILNAWYGGQEAGTAVADILFGDYNPSGRLPVTFYKSIDQL 719
Query: 596 VPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSS 655
D M+ GRTYR+ ++PFG GLSY+ ++Y+ +S K+ +QS +
Sbjct: 720 PDFEDYSMK--------GRTYRYMTETPLYPFGYGLSYTNFAYRNAKLSSGKIAKDQSVT 771
Query: 656 TKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRP 715
+T + N G+M G +++K G P
Sbjct: 772 -----------------------------LTFDIANTGKMDGDEVAQIYIKNPNDPEG-P 801
Query: 716 IKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
IK L F V + A + E+ EL+P S M + G + ++ G
Sbjct: 802 IKALKAFLRVHVKAGDSQEVNIELAPETFHSFNDNTQTMEVRPGKYQILYG 852
>gi|154493680|ref|ZP_02033000.1| hypothetical protein PARMER_03021 [Parabacteroides merdae ATCC
43184]
gi|423723902|ref|ZP_17698051.1| hypothetical protein HMPREF1078_02038 [Parabacteroides merdae
CL09T00C40]
gi|154086890|gb|EDN85935.1| glycosyl hydrolase family 3 C-terminal domain protein
[Parabacteroides merdae ATCC 43184]
gi|409240709|gb|EKN33484.1| hypothetical protein HMPREF1078_02038 [Parabacteroides merdae
CL09T00C40]
Length = 868
Score = 323 bits (828), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 181/449 (40%), Positives = 250/449 (55%), Gaps = 41/449 (9%)
Query: 41 ETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGV 100
E +PF LPI +R DL+ RLT +EKI Q++N+ PAI RLGIP Y+WW+EALHGVA
Sbjct: 24 EDYPFRNPDLPIDERIDDLLKRLTAEEKIGQMMNTTPAIERLGIPEYDWWNEALHGVARA 83
Query: 101 GKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQA-------IG 153
GK AT FPQ I AA+FD Y + EARA Y+ Q G
Sbjct: 84 GK----------ATVFPQAIAMAATFDDDALYETFTMVSDEARAKYHQYQKNKEYDRYKG 133
Query: 154 MTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASAC 213
+TFW PNINIFRDPRWGRG ET GEDP +T + V+ V+G+QGD + AC
Sbjct: 134 LTFWTPNINIFRDPRWGRGMETYGEDPYLTERMGVAVVKGLQGDD------PKYFKTHAC 187
Query: 214 CKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIP 273
KH+ + W R++FD VT +DL TY P FE+ VK+G +MCAYNR G P
Sbjct: 188 AKHYAVHSGPEW---NRHEFDVTVTPRDLWQTYLPAFEALVKKGNVQEVMCAYNRYQGKP 244
Query: 274 SCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSP--------EDAVVDVLKA 325
C+ LL R WG+ I SDC A++ + + ++P E A D +
Sbjct: 245 CCSSDKLLIDILRNSWGYENIILSDCGAINDFWQRD--ERTPRHETHPDAESASADAVLN 302
Query: 326 GMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGAD 385
G D+ CG+ + K A+K+ K+ E+++D +L L R LG+F+ + + P+ +I +
Sbjct: 303 GTDLECGNSYKALIK-ALKEGKISENDLDVSLRRLLKGRFELGMFDPDERV-PYAQIPYN 360
Query: 386 VVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPS 445
VV SP H AL+ A +VLLKN + LPL K+ +A++GPNA + L NY G
Sbjct: 361 VVESPEHVAQALEMAHKSMVLLKNKNNTLPLSKTIR-KIAVVGPNAADSTMLWANYNGFP 419
Query: 446 CRSITPLQALQNYVENT--VYYPGCDTVA 472
++T L+ ++N V +T +Y GC+ A
Sbjct: 420 THTVTILEGIRNKVPDTEVIYELGCNHAA 448
Score = 113 bits (282), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 83/291 (28%), Positives = 133/291 (45%), Gaps = 51/291 (17%)
Query: 486 KGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILV 535
K AD +V + G+ E EE+ DR ++ +P QQE++ + +A KPV+ V
Sbjct: 603 KDADVIVYVGGISPRLEGEEMPVNVEGFKKGDRTNIEIPKVQQEMV-KALKATGKPVVYV 661
Query: 536 LLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIK 595
L G + + + D NI +IL A Y G+ A+A+++FGD+NP GRLP+T+Y
Sbjct: 662 LCTGSALALNWE--DANIDAILNAWYGGQEAGTAVADILFGDYNPSGRLPVTFYKSIDQL 719
Query: 596 VPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSS 655
D M+ GRTYR+ ++PFG GLSY+ ++Y+ +S K+ +QS +
Sbjct: 720 PDFEDYSMK--------GRTYRYMTETPLYPFGYGLSYTNFAYRNAKLSSGKITKDQSVT 771
Query: 656 TKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRP 715
+T + N G+M G +++K G P
Sbjct: 772 -----------------------------LTFDIANTGKMDGDEVAQIYIKNPNDPEG-P 801
Query: 716 IKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
IK L F V + A + E+ EL+P S M + G + ++ G
Sbjct: 802 IKALKAFLRVHVKAGDSQEVNIELTPEAFHSFNDNTQTMEVRPGKYQILYG 852
>gi|371777036|ref|ZP_09483358.1| glycoside hydrolase [Anaerophaga sp. HS1]
Length = 890
Score = 323 bits (827), Expect = 3e-85, Method: Compositional matrix adjust.
Identities = 190/438 (43%), Positives = 249/438 (56%), Gaps = 40/438 (9%)
Query: 45 FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGI 104
+ TLP +RA DLVS++TL+EK+SQ+ ++APAI RLGIP Y WW+E LHGV G
Sbjct: 40 YLDPTLPFEERAADLVSKMTLEEKVSQMQHAAPAIERLGIPEYNWWNECLHGVGRAGI-- 97
Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYN-------AGQAIGMTFW 157
AT FPQ I AA +D YRI A+ EARA ++ G G+TFW
Sbjct: 98 --------ATVFPQAIGMAAMWDDEEMYRIATAVSDEARAKHHDFARRGKRGIYQGLTFW 149
Query: 158 APNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHF 217
PNINIFRDPRWGRG ET GEDP +TG+ AV Y++G+QGD L+ A KHF
Sbjct: 150 TPNINIFRDPRWGRGMETYGEDPFLTGELAVDYIKGLQGDD------DRYLKLVATSKHF 203
Query: 218 TAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCAD 277
+ + R+ FDAR + +D TY P F+ +++ +MCAYNR NG+P C
Sbjct: 204 LVH---SGPEPDRHHFDARTSARDSLMTYTPHFKKTIQEAGVYSVMCAYNRYNGLPCCGS 260
Query: 278 R---NLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSP--EDAVVDVLKAGMDVNCG 332
+ NLL R +WGF GYI SDC AV+ Y + P E+A +KAG D+NCG
Sbjct: 261 KPVENLL----RNEWGFKGYIVSDCWAVADFYKKGHHEVVPTVEEAAAMAVKAGTDLNCG 316
Query: 333 SFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAH 392
+ AVKQ + E EID + L R+RLG+F+ P M P+ I VV S H
Sbjct: 317 NSYPALVD-AVKQGLVSEEEIDVLVKRLMEARLRLGMFDP-PEMVPYTNIPYSVVDSKEH 374
Query: 393 QVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPL 452
+ LAL AA+ +VLLKN + LPL K+ ++A+IGPNAN+ LL NY G +TPL
Sbjct: 375 RELALIAARKSMVLLKNDNNTLPLDKNVK-NVAVIGPNANNLDVLLANYNGYPSNPVTPL 433
Query: 453 QALQNYV--ENTVYYPGC 468
++ + N Y GC
Sbjct: 434 DGIRQKLPNANVQYALGC 451
Score = 131 bits (330), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 106/334 (31%), Positives = 153/334 (45%), Gaps = 54/334 (16%)
Query: 453 QALQNYVENTVYYPGCDTV-ACSSASIDK-AVDIAKGADHVVLMMGLDQTQEKEEL---- 506
+ L Y +N YP + ++ K A+ IA +D V++ MGL E EE+
Sbjct: 580 KVLIEYTQNNTEYPVMKFLWDVPGRNLKKEAIQIAAASDVVLMFMGLSPNLEGEEMPVNV 639
Query: 507 ------DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAG 560
DRVD+ LP Q +L+ + K PV+LVLL G + I + N+ +IL A
Sbjct: 640 PGFSGGDRVDIKLPQIQTDLVKAIMSLGK-PVVLVLLNGSALAINWEA--ENVPAILEAW 696
Query: 561 YPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYE 620
YPG+AG A+A+V+FGD+NP GRLP+T+Y P D M GRTY++++
Sbjct: 697 YPGQAGGTAIADVLFGDYNPAGRLPVTFYKSVTQLPPFEDYSMD--------GRTYQYFK 748
Query: 621 GKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCET 680
G+ +FPFG GLSY+ + Y V +KL E G E
Sbjct: 749 GEALFPFGYGLSYTSFKYD-NLVVPDKL-----------------------EAGKEVT-- 782
Query: 681 RKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELS 740
V + V N G G V L+V + PI+ L GF + L A E + F L
Sbjct: 783 ----VHVDVTNTGNRDGDEVVQLYVSHPDVESA-PIRSLQGFDRIALKAGETKTVSFTLK 837
Query: 741 PCESLSRAREDGLMVIEEGTHFLVVGDEEYPISI 774
P + ++GL+V V G + P +I
Sbjct: 838 PEQLAVYQPQNGLVVPAGNLKLSVGGRQPNPEAI 871
>gi|423342048|ref|ZP_17319763.1| hypothetical protein HMPREF1077_01193 [Parabacteroides johnsonii
CL02T12C29]
gi|409219455|gb|EKN12417.1| hypothetical protein HMPREF1077_01193 [Parabacteroides johnsonii
CL02T12C29]
Length = 868
Score = 323 bits (827), Expect = 3e-85, Method: Compositional matrix adjust.
Identities = 180/449 (40%), Positives = 250/449 (55%), Gaps = 41/449 (9%)
Query: 41 ETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGV 100
E +PF LPI +R DL+ RLT +EK+ Q++N+ PAI RLGIP Y+WW+EALHGVA
Sbjct: 24 EDYPFRNPDLPIDERIDDLLKRLTAEEKVGQMMNTTPAIERLGIPQYDWWNEALHGVARA 83
Query: 101 GKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQA-------IG 153
GK AT FPQ I AA+FD Y + EARA Y+ Q G
Sbjct: 84 GK----------ATVFPQAIAMAATFDDDALYETFTMVSDEARAKYHQYQKDKEYDRYKG 133
Query: 154 MTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASAC 213
+TFW PNINIFRDPRWGRG ET GEDP +T + V+ V+G+QGD + AC
Sbjct: 134 LTFWTPNINIFRDPRWGRGMETYGEDPYLTERMGVAVVKGLQGDD------PKYFKTHAC 187
Query: 214 CKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIP 273
KH+ + W R++FD VT +DL TY P FE+ VK+G +MCAYNR G P
Sbjct: 188 AKHYAVHSGPEW---NRHEFDVTVTPRDLWQTYLPAFEALVKEGNVQEVMCAYNRYQGKP 244
Query: 274 SCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSP--------EDAVVDVLKA 325
C+ LL R WG+ I SDC A++ ++ + ++P E A D +
Sbjct: 245 CCSSDKLLIDILRNSWGYENIILSDCGAINDFWERD--ERTPRHETHPDAESASADAVLN 302
Query: 326 GMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGAD 385
G D+ CG+ + K A+K K+ E+++D +L L R LG+F+ + + P+ +I +
Sbjct: 303 GTDLECGNSYRALVK-ALKDGKISENDLDVSLRRLLKGRFELGMFDPDERV-PYAQIPYN 360
Query: 386 VVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPS 445
VV SP H AL+ A +VLLKN + LPL K+ +A++GPNA + L NY G
Sbjct: 361 VVESPEHVAQALEMAHKSMVLLKNKNNTLPLSKTIR-KIAVVGPNAADSTMLWANYNGFP 419
Query: 446 CRSITPLQALQNYVENT--VYYPGCDTVA 472
++T L+ ++N V +T +Y GC+ A
Sbjct: 420 THTVTILEGIRNKVPDTEVIYELGCNHAA 448
Score = 112 bits (279), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 83/291 (28%), Positives = 133/291 (45%), Gaps = 51/291 (17%)
Query: 486 KGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILV 535
K AD +V + G+ E EE+ DR ++ LP QQE++ + +A KPV+ V
Sbjct: 603 KDADVIVYVGGISPRLEGEEMPVNVEGFKKGDRTNIELPKVQQEMV-KALKATGKPVVYV 661
Query: 536 LLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIK 595
L G + + + + NI +IL A Y G+ A+A+++FGD+NP GRLP+T+Y
Sbjct: 662 LCTGSALALNWE--EANIDAILNAWYGGQEAGTAVADILFGDYNPSGRLPVTFYKSIDQL 719
Query: 596 VPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSS 655
D M+ GRTYR+ ++PFG GLSY+ ++Y+ +S K+ +QS +
Sbjct: 720 PDFEDYSMK--------GRTYRYMTETPLYPFGYGLSYTNFAYRNAKLSSGKIAKDQSVT 771
Query: 656 TKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRP 715
+T + N G+M G +++K G P
Sbjct: 772 -----------------------------LTFDIANTGKMDGDEIAQIYIKNPNDPEG-P 801
Query: 716 IKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
IK L F V + A + E+ EL+P S M + G + ++ G
Sbjct: 802 IKALKAFLRVHVKAGDSQEVNIELAPETFHSFNDNTQTMEVRPGKYQILYG 852
>gi|325929067|ref|ZP_08190221.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas perforans
91-118]
gi|325540562|gb|EGD12150.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas perforans
91-118]
Length = 850
Score = 323 bits (827), Expect = 3e-85, Method: Compositional matrix adjust.
Identities = 185/423 (43%), Positives = 251/423 (59%), Gaps = 32/423 (7%)
Query: 63 LTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILT 122
+TL+EK +Q+ N+APAIPRLG+PAY+WW+EALHGVA G GAT FPQ I
Sbjct: 1 MTLEEKAAQMQNAAPAIPRLGVPAYDWWNEALHGVARAG----------GATVFPQAIGM 50
Query: 123 AASFDSYLWYRIGQAIGLEARALYNA-------GQAIGMTFWAPNINIFRDPRWGRGQET 175
AA+FD L + + AI EARA ++ + G+TFW+PNINIFRDPRWGRGQET
Sbjct: 51 AATFDLPLMHEVATAISDEARAKHHQFLRQNQHARYQGLTFWSPNINIFRDPRWGRGQET 110
Query: 176 PGEDPLVTGKYAVSYVRGVQGDTFNG-----GKLKGKLQASACCKHFTAYDLDNWKGTTR 230
GEDP +T + V++V+G+QG+ + G+ KL A+A KHF + + R
Sbjct: 111 YGEDPFLTARMGVTFVQGLQGEGADAPKNAQGEPYRKLDATA--KHFAVH---SGPEADR 165
Query: 231 YKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWG 290
+ FDAR + +DL +TY P FE+ VK G+ +M AYNRV G + A + LL R+QWG
Sbjct: 166 HHFDARPSQRDLYETYLPAFEALVKDGKVDAVMGAYNRVYGESASASKFLLQDVLRQQWG 225
Query: 291 FHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPE 350
F GY+ SDC A+ I+ + E A +K G ++ CG AAV+Q + E
Sbjct: 226 FKGYVVSDCWAIVDIWKHHKIVATREQAAALAVKHGTELECGEEYST-LPAAVRQGLIDE 284
Query: 351 SEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNS 410
++ID AL L + RMRLG+F+ P P+ I A V SPAH LA + A++ +VLLKN
Sbjct: 285 AQIDTALTTLMTARMRLGMFD-PPGQLPWSTIPASVNQSPAHDALARRTARESLVLLKND 343
Query: 411 HGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENT--VYYPGC 468
GLLPL ++K +A+IGP A+ LLGNY G +T LQ ++ N +Y G
Sbjct: 344 -GLLPLSRAKLKRIAVIGPTADDTMALLGNYYGTPAAPVTVLQGIRAAAPNAQVLYARGA 402
Query: 469 DTV 471
D V
Sbjct: 403 DLV 405
Score = 134 bits (337), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 91/278 (32%), Positives = 138/278 (49%), Gaps = 50/278 (17%)
Query: 478 IDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEA 527
+ +A+D+A+ AD VV + GL E EE+ DR DL LP Q++L+ + +A
Sbjct: 574 LQEALDVARSADVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRDLLEAL-QA 632
Query: 528 AKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMT 587
KPV+ VL G + I +A+ +++ +IL A YPG+ G A+A+ +FGD NPGGRLP+T
Sbjct: 633 TGKPVVAVLTTGSALAIDWAQ--QHLPAILLAWYPGQRGGTAVADTLFGDANPGGRLPVT 690
Query: 588 WYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNK 647
+Y + D MR GRTYR++ G ++PFG GLSY++++Y +
Sbjct: 691 FYKESETLPAFDDYAMR--------GRTYRYFGGTPLYPFGHGLSYTQFAY-------SG 735
Query: 648 LYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKP 707
L L++ T T+ VKN G+ AG V L++ P
Sbjct: 736 LRLDR----------------------TTIAADGSLTATVTVKNTGQRAGDEVVQLYLHP 773
Query: 708 ARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESL 745
R K+L GFQ + L E+ + F L +L
Sbjct: 774 LTPQRERAGKELHGFQRIALQPGEQRALHFTLDAKNAL 811
>gi|146301622|ref|YP_001196213.1| glycoside hydrolase family 3 protein [Flavobacterium johnsoniae
UW101]
gi|146156040|gb|ABQ06894.1| Candidate beta-xylosidase; Glycoside hydrolase family 3
[Flavobacterium johnsoniae UW101]
Length = 875
Score = 323 bits (827), Expect = 3e-85, Method: Compositional matrix adjust.
Identities = 173/425 (40%), Positives = 244/425 (57%), Gaps = 28/425 (6%)
Query: 43 FPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGK 102
F F +L QR DLVSRLTL+EK+SQ++NS+P I RLGIPAY+WW+E LHGVA
Sbjct: 27 FQFQNPSLSFEQRVDDLVSRLTLEEKVSQMLNSSPEIARLGIPAYDWWNETLHGVAR--- 83
Query: 103 GIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAG--------QAIGM 154
T T +PQ I AA+FD + + LE RA+YN + +G+
Sbjct: 84 ------TPFKTTVYPQAIGMAATFDKNSLFTMADYSALEGRAIYNKAVELKRTNERYLGL 137
Query: 155 TFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACC 214
T+W PNINIFRDPRWGRGQET GEDP +T ++V+G+QGD L+A+AC
Sbjct: 138 TYWTPNINIFRDPRWGRGQETYGEDPYLTAVLGDAFVKGLQGDD------PKYLKAAACA 191
Query: 215 KHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPS 274
KH Y + + + R+ FD VT +L DTY P F + + +G+MCAYN P
Sbjct: 192 KH---YAVHSGPESLRHTFDVDVTPYELWDTYLPAFRKLITESNVAGVMCAYNAFRTQPC 248
Query: 275 CADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSF 334
CA L++ R++W F GY+TSDC A+ + E A D + G D++CG+
Sbjct: 249 CASDILMNDILRKEWKFDGYVTSDCWAIDDFFKNHKTHPDAESAAADAVFHGTDIDCGTD 308
Query: 335 LQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQV 394
K AVK K+ E +ID ++ LF +R RLG+F+ +M + + + V+ S HQ+
Sbjct: 309 AYKALVQAVKNGKISEKQIDISVKRLFMIRFRLGMFDP-VSMVKYAQTPSSVLESKEHQL 367
Query: 395 LALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQA 454
AL+ A+ IVLLKN +LPL K+ + ++GPNA++A ++LGNY G + T LQ
Sbjct: 368 HALKMARQSIVLLKNEKNILPLNKNLK-KIVVLGPNADNAISILGNYNGTPSKLTTVLQG 426
Query: 455 LQNYV 459
++ V
Sbjct: 427 IKEKV 431
Score = 104 bits (260), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 80/295 (27%), Positives = 129/295 (43%), Gaps = 52/295 (17%)
Query: 482 VDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKP 531
++ K AD + G+ E EE+ DR ++ P Q +L+ + +++ KP
Sbjct: 600 IEHHKNADAFIFAGGISPQLEGEEMPVDFPGFKGGDRTSILFPEVQTKLL-KALQSSGKP 658
Query: 532 VILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQ 591
V+ ++ G + I + NI +IL Y G++ A A+VIFGD+NP GRLP+T+Y
Sbjct: 659 VVFAMMTGSAIAIPWEA--ENIPAILNIWYGGQSAGTAAADVIFGDYNPAGRLPVTFYKN 716
Query: 592 DYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLN 651
D D KM +TYR+++G ++ FG GLSY+ + Y +
Sbjct: 717 DSDLPSFVDYKM--------DNKTYRYFKGTPLYGFGYGLSYTSFKYS-----------D 757
Query: 652 QSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRG 711
+ K+ + Q V + + V N G+ G+ L++
Sbjct: 758 LKTPVKIKKGQSVS-------------------ILVKVANTGKTEGEEVAQLYLINQDTA 798
Query: 712 NGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
P+K L GF+ L E I F LSP E LS +G + EG + +G
Sbjct: 799 IKTPLKSLKGFERFNLKPGENKTITFNLSP-EDLSYVTPEGSLKQYEGKIKISIG 852
>gi|167521708|ref|XP_001745192.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163776150|gb|EDQ89770.1| predicted protein [Monosiga brevicollis MX1]
Length = 614
Score = 322 bits (826), Expect = 3e-85, Method: Compositional matrix adjust.
Identities = 207/585 (35%), Positives = 304/585 (51%), Gaps = 48/585 (8%)
Query: 63 LTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILT 122
LT E + R+G+P Y+W A+HGV +GT+ TSFP +
Sbjct: 56 LTAREGGGGSPGPPGNVSRIGLPEYDWGMNAIHGVQSSCIKDD-DGTVYCPTSFPNPVNY 114
Query: 123 AASFDSYLWYRIGQAIGLEARALYNAGQA----------IGMTFWAPNINIFRDPRWGRG 172
+++ + +G+ IG+E RAL+ AG IG+ W+PNINI R P WGR
Sbjct: 115 GFTWNYSAYLELGRIIGVETRALWLAGAVEASTWSGRPHIGLDTWSPNINIARSPLWGRN 174
Query: 173 QETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYK 232
QE PGEDP + G++ +Y G+QGD LQA KH+ AY L++ G TR+
Sbjct: 175 QEVPGEDPFMNGQFGKAYTLGLQGDD------DTYLQAIVTLKHWDAYSLEDSDGATRHN 228
Query: 233 FDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFH 292
F+A V+ L DTY P F V +G+A G+MC+YN VNGIP+CA LL R W F
Sbjct: 229 FNAIVSNFSLMDTYWPAFRVAVTEGKAKGVMCSYNAVNGIPTCA-HPLLRTVLRDLWKFD 287
Query: 293 GYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGM-DVNCGSFLQKHTKAAVKQKKLPES 351
GY++SD AV I D Y S A ++ G D++ G+ K V +
Sbjct: 288 GYVSSDTGAVEDISDNHKYTPSWATAACAAIRDGQTDIDSGAVYMKSLLQGVSEGHCRME 347
Query: 352 EIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSH 411
++D AL N +R LGLF+ Q + + V + A + + + +VLL+N +
Sbjct: 348 DVDNALRNTLRLRFELGLFDPVEN-QSYWHVPLAAVNTNASRATNMLHTLESMVLLQNKN 406
Query: 412 GLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRS------ITPLQALQNYV--ENTV 463
+LPL + V ALIGP+A + + ++GNY G C ++P AL + + +
Sbjct: 407 NVLPLASNTKV--ALIGPHAKAQEDMVGNYLGQLCPDNNFDCVVSPHDALVSILGTDAVT 464
Query: 464 YYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITR 523
Y PG + CS + ID+AV +A AD VLM+G+D++ E E DR + LP Q +L +
Sbjct: 465 YAPGTNVTTCSQSHIDEAVSVATAADVAVLMLGIDESIEAESNDRKSIDLPECQHQLASA 524
Query: 524 VAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGR 583
+ A KP ++VLL GG + I K + +I+ AGYPG G A+A+ + G + G
Sbjct: 525 IF-AVGKPTVIVLLNGGMLAIENEK--QQADAIIEAGYPGFYGGTAIAQTLTGQNEHLG- 580
Query: 584 LPMTWYPQDYIK-VPMTDMKMRPQATSGNPGRTYRFYEGKEVFPF 627
DYI + M+DM+M TSG PGRTYR+Y+ + ++ F
Sbjct: 581 --------DYINWINMSDMEM----TSG-PGRTYRYYKNETLWAF 612
>gi|315500297|ref|YP_004089100.1| glycoside hydrolase family 3 domain protein [Asticcacaulis
excentricus CB 48]
gi|315418309|gb|ADU14949.1| glycoside hydrolase family 3 domain protein [Asticcacaulis
excentricus CB 48]
Length = 882
Score = 322 bits (826), Expect = 4e-85, Method: Compositional matrix adjust.
Identities = 183/437 (41%), Positives = 256/437 (58%), Gaps = 35/437 (8%)
Query: 45 FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGI 104
+ + P RA DLVSR+TL+EK +QL+N APAIPRL + Y WW+E LHGVA G
Sbjct: 35 YQDASKPPEARAADLVSRMTLEEKTAQLINDAPAIPRLNVREYNWWNEGLHGVAAAGY-- 92
Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALY--------NAGQAIGMTF 156
AT FPQ + AA++D L +R+ + I +E RA Y + G+T
Sbjct: 93 --------ATVFPQAVGLAATWDEPLIHRVAETISVEFRAKYLKERHRFGGSDWFGGLTV 144
Query: 157 WAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKH 216
W+PNINIFRDPRWGRGQET GEDP +T + V++VRG+QGD + A KH
Sbjct: 145 WSPNINIFRDPRWGRGQETYGEDPYLTARMGVAFVRGLQGDD------PVYYRTVATPKH 198
Query: 217 FTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCA 276
Y + + R++ + + DLADTY P F + + +G+A IMCAYN +NG P+CA
Sbjct: 199 ---YAVHSGPEAGRHRDNVNPSPYDLADTYLPAFRATITEGQAGSIMCAYNAINGQPACA 255
Query: 277 DRNLLSKTARRQWGFHGYITSDCDAVSIIY--DAEGYAKSPEDAVVDVLKAGMDVNCGSF 334
+ +LL K R+ WGF GY+ SDCDAV IY + Y +PE+ V + G D+ CG+
Sbjct: 256 NEDLLVKYLRKDWGFKGYVVSDCDAVGDIYYKTSHAYRPTPEEGVTAAYQVGTDLICGNA 315
Query: 335 LQ-KHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQ 393
+ H AV+Q LPE +D AL LF+ R +LG F +P + F KI A+ +PA++
Sbjct: 316 NEADHLTRAVRQGLLPEKTLDTALIRLFTARFKLGQF--DPPAKVFPKITAEDYDTPANR 373
Query: 394 VLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQ 453
+ + A+ +VLLKN + LLPL K + +A+IGPNA+S +L+GNY G +T L
Sbjct: 374 DFSQKVAESAMVLLKNENNLLPL-KGEPRQIAVIGPNADSMDSLVGNYNGDPSHPVTVLS 432
Query: 454 ALQ-NYVENTVYY-PGC 468
++ + + TV Y PG
Sbjct: 433 GIRARFPKATVTYAPGS 449
Score = 122 bits (305), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 96/304 (31%), Positives = 144/304 (47%), Gaps = 54/304 (17%)
Query: 474 SSASIDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITR 523
S AV AK AD VV + GL Q E EE+ DR L LP QQ+++ +
Sbjct: 592 SDTGAQSAVAAAKEADLVVFVAGLSQRVEGEEMRVETEGFSGGDRTTLNLPPAQQKVLEQ 651
Query: 524 VAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGR 583
V+ AA KPV+LVL+ G + I +A D+N+ +I+ A YPG G A+A +I GD++P GR
Sbjct: 652 VS-AAGKPVVLVLINGSALGINWA--DKNVPAIIEAWYPGGQGGAAVARLIAGDYSPAGR 708
Query: 584 LPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAV 643
LP+T+Y D M+ GRTYR+++G+ ++PFG GLS++ + Y +
Sbjct: 709 LPVTFYRSADQLPAFNDYNMK--------GRTYRYFKGEALYPFGYGLSFTTFRYAPLTL 760
Query: 644 SQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLL 703
S ++ + S V+ V N G V L
Sbjct: 761 SARQVAGDGQVS-----------------------------VSADVTNSGSRDSDEVVQL 791
Query: 704 FVK-PARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHF 762
+V P ++ PI+ L F+ + L A E + F L P ++LS DG ++ G
Sbjct: 792 YVSYPGQKLA--PIRALARFERIHLKAGETKTVRFTLDP-QALSTVNADGSRSVKPGKVE 848
Query: 763 LVVG 766
L +G
Sbjct: 849 LWLG 852
>gi|329850151|ref|ZP_08264997.1| beta-xylosidase B [Asticcacaulis biprosthecum C19]
gi|328842062|gb|EGF91632.1| beta-xylosidase B [Asticcacaulis biprosthecum C19]
Length = 877
Score = 322 bits (826), Expect = 4e-85, Method: Compositional matrix adjust.
Identities = 186/441 (42%), Positives = 252/441 (57%), Gaps = 38/441 (8%)
Query: 32 SCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWS 91
+ DP + T L RA DLVSR+TL+EK +QL ++APAIPRLG+P Y WW+
Sbjct: 10 ALDPVPADVAAMAYRDTALDPKARAADLVSRMTLEEKAAQLGHTAPAIPRLGVPKYNWWN 69
Query: 92 EALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALY----- 146
E LHGVA G AT FPQ I AA++D + +G + E RA Y
Sbjct: 70 EGLHGVARAGV----------ATVFPQAIGMAATWDEPMMTTVGDVVSTEFRAKYVERVH 119
Query: 147 -NAGQAI--GMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGK 203
+ G G+T W+PNINIFRDPRWGRGQET GEDP +T + + Y+ G+QG N K
Sbjct: 120 PDGGTDWYRGLTVWSPNINIFRDPRWGRGQETYGEDPYLTSRIGIGYIHGLQG---NDPK 176
Query: 204 LKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIM 263
+ A KHF + + R+K D + DL DTY P F + V +G+A +M
Sbjct: 177 F---FKTVATSKHFAVHSGPE---SNRHKEDVYPSKFDLEDTYLPAFRATVTEGKAYSVM 230
Query: 264 CAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIY--DAEGYAKSPEDAVVD 321
C YN V G+P CA L+ + R+ WGF G++ SDC A + I+ DA Y K+ E+ V
Sbjct: 231 CVYNAVYGVPGCASDFLMEEKLRQNWGFPGFVVSDCGAAANIFREDALHYTKTAEEGVAV 290
Query: 322 VLKAGMDVNCGSFLQKHTKA------AVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPT 375
LKAGMD+ CG + K + AVK +LP + +D+AL LF R+RLG+F+ P
Sbjct: 291 GLKAGMDLICGDYRNKMSTEVQPIINAVKAGQLPIAVVDQALVRLFEGRIRLGMFD-PPA 349
Query: 376 MQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAK 435
PF I AD +PAH +AL A+ +VLLKN GLLPL K++ ++A+IGPNA+S
Sbjct: 350 SLPFAHITADDSDTPAHHAVALDMAKKSMVLLKND-GLLPL-KAEPKTIAVIGPNADSLD 407
Query: 436 TLLGNYAGPSCRSITPLQALQ 456
L+GNY G + +T L ++
Sbjct: 408 ALVGNYYGKPSKPVTVLDGIR 428
Score = 107 bits (266), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 63/169 (37%), Positives = 98/169 (57%), Gaps = 21/169 (12%)
Query: 480 KAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAK 529
+AVD+AK AD VV + GL E EE+ DR + LP QQ+L+ +V
Sbjct: 590 QAVDVAKTADFVVFVGGLSARVEGEEMKVEAEGFAGGDRTSIDLPKPQQQLLEKVI-GTG 648
Query: 530 KPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWY 589
KP +LVL+ G + + +A D+++ +I+ A YPG G A+A++I GD++P GRLP+T+Y
Sbjct: 649 KPTVLVLMSGSALGVNWA--DKHVPAIIEAWYPGGEGGHAVAQLIAGDYSPAGRLPVTFY 706
Query: 590 PQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSY 638
+D M+ RTYR++ G+ ++PFG GLSY+ ++Y
Sbjct: 707 RSVDALPGFSDYTMK--------NRTYRYFNGEVLYPFGHGLSYTTFAY 747
>gi|440731995|ref|ZP_20911965.1| glucan 1,4-beta-glucosidase [Xanthomonas translucens DAR61454]
gi|440370332|gb|ELQ07251.1| glucan 1,4-beta-glucosidase [Xanthomonas translucens DAR61454]
Length = 913
Score = 322 bits (825), Expect = 4e-85, Method: Compositional matrix adjust.
Identities = 181/441 (41%), Positives = 259/441 (58%), Gaps = 32/441 (7%)
Query: 45 FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGI 104
+ T QRA DLV+R+TL+EK +Q+ N+APAIPRLG+PAY+WW+EALHGVA G
Sbjct: 37 YLDTQRSFEQRAADLVARMTLEEKAAQMQNAAPAIPRLGVPAYDWWNEALHGVARAG--- 93
Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA-------GQAIGMTFW 157
GAT FPQ I AA+FD L + + AI EARA ++ + G+TFW
Sbjct: 94 -------GATVFPQAIGMAATFDVPLMHEVSTAISDEARAKHHEALRHDQHARYQGLTFW 146
Query: 158 APNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNG-----GKLKGKLQASA 212
+PNINIFRDPRWGRGQET GEDP +T + V++V+G+QG+ + G+ KL A+A
Sbjct: 147 SPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQGEGADAPKNAQGEAYRKLDATA 206
Query: 213 CCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGI 272
KHF + + R+ FDA + +DL +TY P FE+ VK+G+ +M AYNRV G
Sbjct: 207 --KHFAVH---SGPEADRHHFDAHPSQRDLYETYLPAFEALVKEGKVDAVMGAYNRVYGE 261
Query: 273 PSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG 332
+ A + LL R +WGF GY+ SDC A+ I+ + E+A +K G ++ CG
Sbjct: 262 SASASKFLLRDVLRDRWGFDGYVVSDCWAIVDIWKNHKIVATREEAAALAVKHGTELECG 321
Query: 333 SFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAH 392
+ +AV++ + E+++D+AL L RMRLG+F+ P + +I SP H
Sbjct: 322 AEYST-LPSAVRKGLISEADVDKALQKLMYSRMRLGMFD-PPEKLAWAQIPLSANQSPEH 379
Query: 393 QVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPL 452
LA + A++ +VLLKN G+LPL ++K +A++GP A+ LLGNY G +T L
Sbjct: 380 DALARRTARESLVLLKND-GVLPLSRAKIKRIAVVGPTADDTMALLGNYYGTPAAPVTVL 438
Query: 453 QALQNYVENT--VYYPGCDTV 471
Q ++ + +Y G D V
Sbjct: 439 QGIREAAPDAEVLYARGADLV 459
Score = 138 bits (348), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 100/302 (33%), Positives = 148/302 (49%), Gaps = 55/302 (18%)
Query: 478 IDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEA 527
+ A+D A+ AD VV + GL E EE+ DR DL LP Q+ L+ +
Sbjct: 628 LQDALDAARRADVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRALLEAL-HG 686
Query: 528 AKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMT 587
KPV+ VL G + I +A+ +++ +IL A YPG+ G A+A+V+FGD NPGGRLP+T
Sbjct: 687 TGKPVVAVLTTGSALAIDWAQ--QHVPAILLAWYPGQRGGSAVADVLFGDANPGGRLPVT 744
Query: 588 WYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNK 647
+Y + D MR GRTYR++ G ++PFG GLSY++++Y +
Sbjct: 745 FYKESETLPAFDDYAMR--------GRTYRYFAGTPLYPFGHGLSYTQFAY-------SD 789
Query: 648 LYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKP 707
L L++S + + T+ VKN G+ AG V L+++P
Sbjct: 790 LRLDRS----------------------KLAADGRLHATLKVKNTGQRAGDEVVQLYLQP 827
Query: 708 ARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESL---SRAREDGLMVIEEGTHFLV 764
R K L GFQ + L E E+ F +SP L AR+ V++ G + L
Sbjct: 828 LSPQRERASKDLRGFQRIALQPGETREVRFAISPQSDLRLYDEARKG--YVVDPGDYELQ 885
Query: 765 VG 766
VG
Sbjct: 886 VG 887
>gi|423344787|ref|ZP_17322476.1| hypothetical protein HMPREF1060_00148 [Parabacteroides merdae
CL03T12C32]
gi|409224378|gb|EKN17311.1| hypothetical protein HMPREF1060_00148 [Parabacteroides merdae
CL03T12C32]
Length = 866
Score = 322 bits (825), Expect = 6e-85, Method: Compositional matrix adjust.
Identities = 180/449 (40%), Positives = 250/449 (55%), Gaps = 41/449 (9%)
Query: 41 ETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGV 100
E +PF LPI +R DL+ RLT +EKI Q++N+ PAI RLGIP Y+WW+EALHGVA
Sbjct: 22 EDYPFRNPDLPIDERIDDLLKRLTAEEKIGQMMNTTPAIERLGIPEYDWWNEALHGVARA 81
Query: 101 GKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQA-------IG 153
GK AT FPQ I AA+FD Y + EARA Y+ Q G
Sbjct: 82 GK----------ATVFPQAIAMAATFDDDALYETFTMVSDEARAKYHQYQKNKEYDRYKG 131
Query: 154 MTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASAC 213
+TFW PNINIFRDPRWGRG ET GEDP +T + ++ V+G+QGD + AC
Sbjct: 132 LTFWTPNINIFRDPRWGRGMETYGEDPYLTERMGLAVVKGLQGDD------PKYFKTHAC 185
Query: 214 CKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIP 273
KH+ + W R++FD VT +DL TY P FE+ VK+G +MCAYNR G P
Sbjct: 186 AKHYAVHSGPEW---NRHEFDVTVTPRDLWQTYLPAFEALVKKGNVQEVMCAYNRYQGKP 242
Query: 274 SCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSP--------EDAVVDVLKA 325
C+ LL R WG+ I SDC A++ + + ++P E A D +
Sbjct: 243 CCSSDKLLIDILRNSWGYENIILSDCGAINDFWQRD--ERTPRHETHPDAESASADAVLN 300
Query: 326 GMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGAD 385
G D+ CG+ + K A+K+ K+ E+++D +L L R LG+F+ + + P+ +I +
Sbjct: 301 GTDLECGNSYKALIK-ALKEGKISENDLDVSLRRLLKGRFELGMFDPDERV-PYAQIPYN 358
Query: 386 VVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPS 445
VV SP H AL+ A +VLLKN + LPL K+ +A++GPNA + L NY G
Sbjct: 359 VVESPEHVAQALEMAHKSMVLLKNKNNTLPLSKTIR-KIAVVGPNAADSTMLWANYNGFP 417
Query: 446 CRSITPLQALQNYVENT--VYYPGCDTVA 472
++T L+ ++N V +T +Y GC+ A
Sbjct: 418 THTVTILEGIRNKVPDTEVIYELGCNHAA 446
Score = 113 bits (282), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 83/291 (28%), Positives = 133/291 (45%), Gaps = 51/291 (17%)
Query: 486 KGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILV 535
K AD +V + G+ E EE+ DR ++ +P QQE++ + +A KPV+ V
Sbjct: 601 KDADVIVYVGGISPRLEGEEMPVNVEGFKKGDRTNIEIPKVQQEMV-KALKATGKPVVYV 659
Query: 536 LLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIK 595
L G + + + D NI +IL A Y G+ A+A+++FGD+NP GRLP+T+Y
Sbjct: 660 LCTGSALALNWE--DANIDAILNAWYGGQEAGTAVADILFGDYNPSGRLPVTFYKSIDQL 717
Query: 596 VPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSS 655
D M+ GRTYR+ ++PFG GLSY+ ++Y+ +S K+ +QS +
Sbjct: 718 PDFEDYSMK--------GRTYRYMTETPLYPFGYGLSYTNFAYRNAKLSSGKITKDQSVT 769
Query: 656 TKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRP 715
+T + N G+M G +++K G P
Sbjct: 770 -----------------------------LTFDIANTGKMDGDEVAQIYIKNPNDPEG-P 799
Query: 716 IKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
IK L F V + A + E+ EL+P S M + G + ++ G
Sbjct: 800 IKALKAFLRVHVKAGDSQEVNIELTPEAFHSFNDNTQTMEVRPGKYQILYG 850
>gi|433677589|ref|ZP_20509555.1| beta-glucosidase [Xanthomonas translucens pv. translucens DSM
18974]
gi|430817300|emb|CCP39963.1| beta-glucosidase [Xanthomonas translucens pv. translucens DSM
18974]
Length = 913
Score = 321 bits (822), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 181/441 (41%), Positives = 257/441 (58%), Gaps = 32/441 (7%)
Query: 45 FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGI 104
+ T QRA DLV+R+TL+EK +Q+ N+APAIPRLG+PAY+WW+EALHGVA G
Sbjct: 37 YLDTQRSFEQRAADLVARMTLEEKAAQMQNAAPAIPRLGVPAYDWWNEALHGVARAG--- 93
Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA-------GQAIGMTFW 157
GAT FPQ I AA+FD L + + AI EARA ++ + G+TFW
Sbjct: 94 -------GATVFPQAIGMAATFDLPLMHEVSTAISDEARAKHHEALRHDQHARYQGLTFW 146
Query: 158 APNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFN-----GGKLKGKLQASA 212
+PNINIFRDPRWGRGQET GEDP +T + V++V+G+QG+ + G+ KL A+A
Sbjct: 147 SPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQGEDVDVPKNAQGEAYRKLDATA 206
Query: 213 CCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGI 272
KHF + + R+ FDA + +DL +TY P FE+ VK+G+ +M AYNRV G
Sbjct: 207 --KHFAVH---SGPEADRHHFDAHPSQRDLYETYLPAFEALVKEGKVDAVMGAYNRVYGE 261
Query: 273 PSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG 332
+ A + LL R +WGF GY+ SDC A+ I+ + E+A +K G ++ CG
Sbjct: 262 SASASKFLLRDVLRDRWGFDGYVVSDCWAIVDIWKNHKIVATREEAAALAVKHGTELECG 321
Query: 333 SFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAH 392
+ AV++ + E+++D AL L RMRLG+F+ P + +I SP H
Sbjct: 322 AEYST-LPTAVRKGLISEADVDNALQKLMYSRMRLGMFD-PPEKLAWAQIPLSANQSPEH 379
Query: 393 QVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPL 452
LA + A++ +VLLKN G+LPL ++K +A++GP A+ LLGNY G +T L
Sbjct: 380 DALARRTARESLVLLKND-GVLPLSRAKIKRIAVVGPTADDTMALLGNYYGTPAAPVTVL 438
Query: 453 QALQNYVENT--VYYPGCDTV 471
Q ++ + +Y G D V
Sbjct: 439 QGIREAAPDAEVLYARGADLV 459
Score = 139 bits (349), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 100/302 (33%), Positives = 148/302 (49%), Gaps = 55/302 (18%)
Query: 478 IDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEA 527
+ A+D A+ AD VV + GL E EE+ DR DL LP Q+ L+ +
Sbjct: 628 LQDALDAARRADVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRALLEAL-HG 686
Query: 528 AKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMT 587
KPV+ VL G + I +A+ +++ +IL A YPG+ G A+A+V+FGD NPGGRLP+T
Sbjct: 687 TGKPVVAVLTTGSALAIDWAQ--QHVPAILLAWYPGQRGGSAVADVLFGDANPGGRLPVT 744
Query: 588 WYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNK 647
+Y + D MR GRTYR++ G ++PFG GLSY++++Y +
Sbjct: 745 FYKESETLPAFDDYAMR--------GRTYRYFAGTALYPFGHGLSYTQFAY-------SD 789
Query: 648 LYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKP 707
L L++S + + T+ VKN G+ AG V L+++P
Sbjct: 790 LRLDRS----------------------KLAADGRLHATLKVKNTGQRAGDEVVQLYLQP 827
Query: 708 ARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESL---SRAREDGLMVIEEGTHFLV 764
R K L GFQ + L E E+ F +SP L AR+ V++ G + L
Sbjct: 828 LSPQRERASKDLRGFQRIALQPGETREVRFAISPQSDLRLYDEARK--AYVVDPGDYELQ 885
Query: 765 VG 766
VG
Sbjct: 886 VG 887
>gi|424796589|ref|ZP_18222299.1| exported beta-glucosidase [Xanthomonas translucens pv. graminis
ART-Xtg29]
gi|422794891|gb|EKU23686.1| exported beta-glucosidase [Xanthomonas translucens pv. graminis
ART-Xtg29]
Length = 913
Score = 320 bits (820), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 183/441 (41%), Positives = 255/441 (57%), Gaps = 32/441 (7%)
Query: 45 FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGI 104
+ T QRA DLVSR+TL+EK +Q+ N+APAIPRLG+PAY+WW+EALHGVA G
Sbjct: 37 YLDTQRSFEQRAADLVSRMTLEEKAAQMQNAAPAIPRLGVPAYDWWNEALHGVARAG--- 93
Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA-------GQAIGMTFW 157
GAT FPQ I AA+FD L + + AI EARA ++ + G+TFW
Sbjct: 94 -------GATVFPQAIGMAATFDLPLMHEVSTAISDEARAKHHEALRHDQHARYQGLTFW 146
Query: 158 APNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNG-----GKLKGKLQASA 212
+PNINIFRDPRWGRGQET GEDP +T + V++V+G+QG+ + G KL A+A
Sbjct: 147 SPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQGEGADAPKNAQGDAYRKLDATA 206
Query: 213 CCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGI 272
KHF + + R+ FDA + +DL +TY P FE+ VK+G+ +M AYNRV G
Sbjct: 207 --KHFAVH---SGPEADRHHFDAHPSQRDLYETYLPAFEALVKEGKVDAVMGAYNRVYGE 261
Query: 273 PSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG 332
+ A + LL R WGF GY+ SDC A+ I+ + E A + G ++ CG
Sbjct: 262 SASASKFLLRDVLRDTWGFDGYVVSDCWAIVDIWKNHKIVATREQAAALAVNNGTELECG 321
Query: 333 SFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAH 392
AAV++ + E+++D+AL L RMRLG+F+ T++ + +I SP H
Sbjct: 322 EEYST-LPAAVRKGLISEADVDKALQKLMYSRMRLGMFDPPDTLR-WAQIPLSANQSPEH 379
Query: 393 QVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPL 452
LA + A++ +VLLKN G+LPL + K +A+IGP A+ LLGNY G +T L
Sbjct: 380 DALARRTARESLVLLKND-GVLPLSRGKIKRIAVIGPTADDTMALLGNYYGTPAAPVTVL 438
Query: 453 QALQNYVENT--VYYPGCDTV 471
Q ++ + +Y G D V
Sbjct: 439 QGIREAAPDAEVLYARGADLV 459
Score = 143 bits (361), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 108/336 (32%), Positives = 159/336 (47%), Gaps = 63/336 (18%)
Query: 452 LQALQNYVENTVYYPGCDTVACSSA--------SIDKAVDIAKGADHVVLMMGLDQTQEK 503
L+A ++Y YY G A + A+D A+ AD VV + GL E
Sbjct: 594 LEAGRSYALRLEYYEGERDAGIRLAWRQPGAKPPLQDALDAARRADVVVFVGGLTGDVEG 653
Query: 504 EEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNI 553
EE+ DR DL LP Q+EL+ + + KPV+ VL G + I +A+ +++
Sbjct: 654 EEMKVNYPGFAGGDRTDLRLPKPQRELLEAL-QGTGKPVVAVLTTGSALAIDWAQ--QHV 710
Query: 554 GSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPG 613
+IL A YPG+ G A+A+V+FGD NPGGRLP+T+Y + D MR G
Sbjct: 711 PAILLAWYPGQRGGSAVADVLFGDANPGGRLPVTFYKESEKLPAFDDYAMR--------G 762
Query: 614 RTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPEL 673
RTYR++ G ++PFG GLSY++++Y + ++KL + S
Sbjct: 763 RTYRYFAGTALYPFGHGLSYTQFAYSDLRLDRSKLATDGS-------------------- 802
Query: 674 GTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKA 733
T+ VKN G+ AG V L++ P R K+L GFQ + L E
Sbjct: 803 ---------LHATLKVKNTGQRAGDEVVQLYLHPLSPQRERARKELRGFQRIALQPGETR 853
Query: 734 EIVFELSPCESL---SRAREDGLMVIEEGTHFLVVG 766
E+ F +SP L AR+ V++ G + L VG
Sbjct: 854 EVSFAISPQTDLRLYDEARK--AYVVDPGDYELQVG 887
>gi|399030621|ref|ZP_10730998.1| beta-glucosidase-like glycosyl hydrolase [Flavobacterium sp. CF136]
gi|398071229|gb|EJL62496.1| beta-glucosidase-like glycosyl hydrolase [Flavobacterium sp. CF136]
Length = 876
Score = 320 bits (819), Expect = 3e-84, Method: Compositional matrix adjust.
Identities = 174/426 (40%), Positives = 242/426 (56%), Gaps = 30/426 (7%)
Query: 43 FPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGK 102
F F L +R DLV+RLTL+EK+SQ++NS+PAIPRL IPAY+WW+E LHGVA
Sbjct: 29 FLFQNPDLSFEKRVDDLVNRLTLEEKVSQMLNSSPAIPRLDIPAYDWWNETLHGVAR--- 85
Query: 103 GIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAG--------QAIGM 154
T T +PQ I AA+FD Y++ LE RA+YN + +G+
Sbjct: 86 ------TPFKVTVYPQAIAMAATFDKNSLYKMADFSALEGRAIYNKAVESGRTNERYLGL 139
Query: 155 TFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACC 214
T+W PNINIFRDPRWGRGQET GEDP +TG S+V+G+QGD L+A+AC
Sbjct: 140 TYWTPNINIFRDPRWGRGQETYGEDPYLTGVLGDSFVKGLQGDD------PKYLKAAACA 193
Query: 215 KHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPS 274
KH Y + + R+ FD VT +L DTY P F+ V + + +G+MCAYN P
Sbjct: 194 KH---YAVHSGPEPLRHTFDVDVTPYELWDTYLPAFQKLVTESKVAGVMCAYNAFRTQPC 250
Query: 275 CADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSF 334
CA L++ R QW F GY+TSDC A+ + E A D + G D++CG+
Sbjct: 251 CASDILMTDILRNQWKFEGYVTSDCWAIDDFFKNHKTHPDAESASADAVFHGTDIDCGTD 310
Query: 335 LQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPT-MQPFGKIGADVVCSPAHQ 393
K AVK K+ E +ID ++ LF +R RLG+F +P M + + V+ + H+
Sbjct: 311 AYKALVQAVKDGKISEKQIDISVKRLFMIRFRLGMF--DPVEMVKYAQTPTSVLENDEHK 368
Query: 394 VLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQ 453
AL+ A+ IVLL+N + LPL K K + ++GPN ++A +LGNY G + T L+
Sbjct: 369 AHALKMARQSIVLLRNENKTLPLSK-KLKKIVVLGPNVDNAIAILGNYNGTPSKLTTVLE 427
Query: 454 ALQNYV 459
++ V
Sbjct: 428 GIKEKV 433
Score = 123 bits (308), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 97/329 (29%), Positives = 152/329 (46%), Gaps = 68/329 (20%)
Query: 452 LQALQNYVENTV--YY--PGCDTVACSSASIDKA-----VDIAKGADHVVLMMGLDQTQE 502
L+ +N V N V Y+ G VA + + +K V+ K AD V + G+ E
Sbjct: 563 LKTKRNTVYNLVVEYWQGEGKANVALHTGNYEKTDFNTLVNRVKDADAFVFVGGISPQLE 622
Query: 503 KEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRN 552
EE+ DR ++LP Q +L+ + + KP++ V++ G + I + N
Sbjct: 623 GEEMKVNFPGFKGGDRTSILLPKIQTDLM-KALKTTGKPIVFVMMTGSAIAIPWEA--EN 679
Query: 553 IGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNP 612
I +I A Y G+A A+A+V+FG++NP GRLP+T+Y D P D KM
Sbjct: 680 IPAIANAWYGGQAAGTAVADVLFGNYNPAGRLPVTFYKSDADLSPFVDYKM--------D 731
Query: 613 GRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPE 672
RTYR+++GK ++ FG GLSY+ + Y ++ + V+ K+VP
Sbjct: 732 NRTYRYFKGKPLYGFGYGLSYTTFKYDNLKIAPS-----------------VIKGKNVP- 773
Query: 673 LGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEK 732
+T+ V N G+++G+ V L+V P+K L GF+ + L A +
Sbjct: 774 ------------ITVKVTNTGKVSGEEVVQLYVINQNTAIKAPLKTLKGFERISLKAGKS 821
Query: 733 AEIVFELSPCESLSRAREDGLMVIEEGTH 761
I F LSP ED + EG H
Sbjct: 822 KTITFTLSP--------EDLSYITAEGNH 842
>gi|329851587|ref|ZP_08266344.1| beta-xylosidase B [Asticcacaulis biprosthecum C19]
gi|328840433|gb|EGF90005.1| beta-xylosidase B [Asticcacaulis biprosthecum C19]
Length = 883
Score = 319 bits (818), Expect = 4e-84, Method: Compositional matrix adjust.
Identities = 185/437 (42%), Positives = 259/437 (59%), Gaps = 35/437 (8%)
Query: 41 ETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGV 100
E+ + TT QRA DLVSR++L+EK +QL+N APAIPRLG+ Y WW+E LHGVA
Sbjct: 33 ESPAYQDTTKTAEQRAADLVSRMSLEEKAAQLINDAPAIPRLGVREYNWWNEGLHGVAAH 92
Query: 101 GKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAI-------- 152
G AT FPQ + AA+FD L +R+ I +E RA Y A +
Sbjct: 93 GY----------ATVFPQAVGMAATFDEPLIHRVADTISVEFRAKYVASRHRFGGSDWFR 142
Query: 153 GMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASA 212
G+T W+PNINIFRDPRWGRGQET GEDP +T + V++V+G+QG+ + A
Sbjct: 143 GLTVWSPNINIFRDPRWGRGQETYGEDPYLTARIGVAFVKGLQGED------PVYYRTIA 196
Query: 213 CCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGI 272
KH Y + + +R++ + + DL DTY P F + + +G+A IMCAYN ++G
Sbjct: 197 TPKH---YAVHSGPEASRHRDNINPSRYDLEDTYLPAFRATIVEGKAVSIMCAYNAIDGQ 253
Query: 273 PSCADRNLLSKTARRQWGFHGYITSDCDAVSIIY--DAEGYAKSPEDAVVDVLKAGMDVN 330
P+CA+ +LL K R+ WGF G++ SDCDAV IY + Y +PE+ V +AG D+
Sbjct: 254 PACANDDLLVKHLRQDWGFKGFVVSDCDAVGDIYYKTSHHYRPTPEEGVTVAYQAGTDLI 313
Query: 331 CGSFLQ-KHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCS 389
CG+ + H +AV++ LPES +D AL LFS R +LG F +P Q F I AD +
Sbjct: 314 CGNANEADHVASAVRKGILPESLVDTALVRLFSARFKLGQF--DPPAQVFPAITADDYDT 371
Query: 390 PAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSI 449
A++ + A+ +VLLKN GLLPL KS+ ++A+IGPNA++ +L+GNY G +
Sbjct: 372 QANRDFSQHVAESAMVLLKND-GLLPL-KSEPRTIAVIGPNADTMDSLVGNYNGDPSHPV 429
Query: 450 TPLQALQNYVEN-TVYY 465
T L ++ N TV Y
Sbjct: 430 TVLAGIKARFPNATVRY 446
Score = 128 bits (322), Expect = 9e-27, Method: Compositional matrix adjust.
Identities = 100/305 (32%), Positives = 153/305 (50%), Gaps = 56/305 (18%)
Query: 474 SSASIDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITR 523
S +AV AK +D V+ + GL Q E EE+ DR L LP QQ+++ +
Sbjct: 593 SDTGAQEAVAAAKESDLVIFVAGLSQRVEGEEMRVETPGFSGGDRTSLDLPPVQQKVLEQ 652
Query: 524 VAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGR 583
V+ A KPV+LVL+ G + + +A D+N+ +I+ A YPG G A+A +I GD +P GR
Sbjct: 653 VS-ATGKPVVLVLINGSALSVNWA--DKNVPAIVEAWYPGGQGGAAVARLIAGDFSPAGR 709
Query: 584 LPMTWYPQDYIKVP-MTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKA 642
LP+T+Y + ++P TD M+ GRTYR+++G+ ++PFG GLSY+K+SY
Sbjct: 710 LPVTFY-RSADQIPAFTDYTMK--------GRTYRYFKGEALYPFGYGLSYTKFSY---- 756
Query: 643 VSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVL 702
+ KL S+ K+ N +V V++ V N G G V
Sbjct: 757 -APAKL-----SAAKVAGNGEVT-------------------VSVDVTNSGARDGDEVVQ 791
Query: 703 LFVK-PARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTH 761
L++ P ++ PI+ L F + L A E + F L +LS DG ++ G
Sbjct: 792 LYLSHPGQKDT--PIRALARFDRIHLKAGETKTVTFTLD-SRALSTVNADGSRSVKPGKV 848
Query: 762 FLVVG 766
L +G
Sbjct: 849 NLWLG 853
>gi|386718620|ref|YP_006184946.1| glucan 1,4-beta-glucosidase [Stenotrophomonas maltophilia D457]
gi|384078182|emb|CCH12773.1| Glucan 1,4-beta-glucosidase [Stenotrophomonas maltophilia D457]
Length = 897
Score = 319 bits (817), Expect = 4e-84, Method: Compositional matrix adjust.
Identities = 186/439 (42%), Positives = 257/439 (58%), Gaps = 36/439 (8%)
Query: 44 PFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKG 103
P+ + QRA LV+++TLDEK +Q+ N+APAI RLG+PAY+WW+E LHGVA G+
Sbjct: 37 PWLDVSASFEQRAASLVAQMTLDEKAAQMQNAAPAIERLGVPAYDWWNEGLHGVARAGQ- 95
Query: 104 IFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA-------GQAIGMTF 156
AT FPQ I AA+FD L ++ I EARA ++ G+ G+TF
Sbjct: 96 ---------ATVFPQAIGLAATFDVPLMGQVATTISDEARAKHHQFLRQGAHGRYQGLTF 146
Query: 157 WAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKH 216
W+PN+NIFRDPRWGRGQET GEDP +T + V++VRG+QGD + KL A+A KH
Sbjct: 147 WSPNVNIFRDPRWGRGQETYGEDPYLTARMGVAFVRGLQGDD----PVYRKLDATA--KH 200
Query: 217 FTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCA 276
+ + R+ FDAR + +DL DTY P FE+ VK+G +M AYNRV G + A
Sbjct: 201 LAVH---SGPEADRHHFDARPSRRDLYDTYLPAFEALVKEGDVDAVMGAYNRVYGESASA 257
Query: 277 DRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQ 336
R LL RR WGF GY+ SDC A+ I+ + E A ++ G ++ CG Q
Sbjct: 258 SRFLLRDVLRRDWGFKGYVVSDCWAIVDIWKHHHIVTTREAAAALAVRNGTELECG---Q 314
Query: 337 KHTK--AAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQV 394
++ +AV+Q + E+EID A+ LF+ RMRLG+F+ P + +I A V +P+H
Sbjct: 315 EYATLPSAVRQGLISEAEIDDAVTRLFTARMRLGMFD-PPERVRWARIPASVNQAPSHDA 373
Query: 395 LALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQA 454
LAL+AAQ +VLLKN G+LPL + +A++GP A+ LLGNY G +T LQ
Sbjct: 374 LALKAAQASLVLLKND-GILPLSRDIK-RIAVVGPTADDTMALLGNYFGTPAAPVTILQG 431
Query: 455 LQNYVE--NTVYYPGCDTV 471
++ + Y G D V
Sbjct: 432 IREAAKGVEVRYARGVDLV 450
Score = 127 bits (318), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 88/282 (31%), Positives = 135/282 (47%), Gaps = 50/282 (17%)
Query: 474 SSASIDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITR 523
+ A D+A+D A+ AD VV + GL E EE+ DR DL LP Q+ L+
Sbjct: 615 AKAPFDEALDAAREADVVVFVGGLTGDVEGEEMTVNYPGFAGGDRTDLRLPAPQRTLLEA 674
Query: 524 VAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGR 583
+ A KPV++VL G + + +A+ ++ +IL + YPG+ G A+ + +FGD NP GR
Sbjct: 675 L-HATGKPVVMVLTGGSAIAVDWAQ--SHLPAILMSWYPGQRGGTAVGQALFGDVNPAGR 731
Query: 584 LPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAV 643
LP+T+Y D M GRTYR++ G ++PFG GLSY+++ Y
Sbjct: 732 LPVTFYKASEALPAFDDYAME--------GRTYRYFRGTPLYPFGHGLSYTRFDY----- 778
Query: 644 SQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLL 703
L L+ S ++ LG V + V N G +G V L
Sbjct: 779 --GTLRLDAGS------------LRADGRLG----------VAVDVTNAGTRSGDEVVQL 814
Query: 704 FVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESL 745
+V+ G+G +++L GFQ + L E + F L ++L
Sbjct: 815 YVRREHAGSGDAVQELRGFQRIHLAPGEHRTVTFTLEAAQAL 856
>gi|325916103|ref|ZP_08178390.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas vesicatoria
ATCC 35937]
gi|325537647|gb|EGD09356.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas vesicatoria
ATCC 35937]
Length = 896
Score = 319 bits (817), Expect = 4e-84, Method: Compositional matrix adjust.
Identities = 187/437 (42%), Positives = 251/437 (57%), Gaps = 33/437 (7%)
Query: 44 PFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKG 103
P+ T LP RA DLVSR+TL+EK +Q+ N+APAIPRL +PAY+WW+EALHGVA G
Sbjct: 39 PYLDTQLPFETRAADLVSRMTLEEKAAQMQNAAPAIPRLRVPAYDWWNEALHGVARAG-- 96
Query: 104 IFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQA-------IGMTF 156
GAT FPQ I AA+FD+ L + AI EARA ++A A G+TF
Sbjct: 97 --------GATVFPQAIGLAATFDTPLMAEVATAISDEARAKHHAFLARDEHKRYQGLTF 148
Query: 157 WAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKH 216
W+PNINIFRDPRWGRGQET GEDP +T + V++V+G+Q KL A KH
Sbjct: 149 WSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQAQQGPYRKL------DATAKH 202
Query: 217 FTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCA 276
+ + + R+ FD + +DL +TY P F++ V++G + +M AYNRVNG + A
Sbjct: 203 YAVH---SGPEADRHHFDVHPSERDLHETYLPAFQALVQEGHVAAVMGAYNRVNGESASA 259
Query: 277 DRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQ 336
L RR WGF GYI SDC A+ I+ +PE A +K G D++CG
Sbjct: 260 S-TRLEGILRRDWGFDGYIVSDCAAIRDIWQNHKIVPTPEAAAALGVKHGTDLDCGDTYA 318
Query: 337 KHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLA 396
K AV+ + E+ ID +L L + RMRLG+F+ P + +I A V SP H LA
Sbjct: 319 ALPK-AVRAGLIDEATIDTSLKRLMTTRMRLGMFD-PPAKVAWAQIPASVNQSPQHDALA 376
Query: 397 LQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQ 456
+ A++ +VLLKN GLLPL K +A++GP A+ +LLGNY G +T LQ ++
Sbjct: 377 RRTARESLVLLKND-GLLPL-KPTLKRIAVVGPTADDPMSLLGNYYGTPAAPVTILQGIR 434
Query: 457 NYV--ENTVYYPGCDTV 471
+ VY G D V
Sbjct: 435 DAAPQAEVVYARGSDLV 451
Score = 128 bits (321), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 88/278 (31%), Positives = 140/278 (50%), Gaps = 50/278 (17%)
Query: 478 IDKAVDIAKGADHVVLMMGLDQTQEKEELD----------RVDLVLPGRQQELITRVAEA 527
+ +AVD A+ A+ VV + GL E EE+D R D LP Q+EL+ + +A
Sbjct: 620 LQEAVDAARNAEVVVFVGGLTGDVEGEEMDVNYPGFAGGDRTDTRLPKPQRELL-QALQA 678
Query: 528 AKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMT 587
PV+ VL G + + +A+ +++ +IL A YPG+ G A+ +V+FG +PGGRLP+T
Sbjct: 679 TGTPVVAVLTTGSALAVDWAQ--QHVPAILLAWYPGQRGGSAVGDVLFGQASPGGRLPIT 736
Query: 588 WYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNK 647
+Y + D MR GRTYR++ G ++PFG GLSY++++Y +
Sbjct: 737 FYKEAERLPAFDDYAMR--------GRTYRYFTGTALYPFGHGLSYTQFAY-------SD 781
Query: 648 LYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKP 707
L L++++ LG + T+ V+N G+ AG V L++ P
Sbjct: 782 LRLDRTT------------------LGADGT----LRATLKVRNTGKRAGDEVVQLYLHP 819
Query: 708 ARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESL 745
R K+L GFQ + L E+ E+ F L ++L
Sbjct: 820 LDPKRERAGKELRGFQRMTLQPGEQREVAFTLKAADAL 857
>gi|294462719|gb|ADE76904.1| unknown [Picea sitchensis]
Length = 304
Score = 319 bits (817), Expect = 4e-84, Method: Compositional matrix adjust.
Identities = 151/296 (51%), Positives = 206/296 (69%), Gaps = 5/296 (1%)
Query: 480 KAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCG 539
+AV+ AK D V++++GLDQTQEKEE DR+ L LPG+Q+ L+ +V+ AAK+PV+ V+L G
Sbjct: 4 EAVETAKKVDEVIVVVGLDQTQEKEERDRIKLTLPGQQENLVYQVSRAAKRPVVFVILSG 63
Query: 540 GPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMT 599
GPVD++FA D I SI+WAGYPG+AG ALAE+IFGD+NPGGRLPMTWYPQD++K+PMT
Sbjct: 64 GPVDVSFAVNDPQISSIIWAGYPGQAGGQALAEIIFGDYNPGGRLPMTWYPQDFVKIPMT 123
Query: 600 DMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQS-SSTKM 658
DM MRP +G PGRTYRFY GK+VF FG GLSYS YSY F + + K+ LN + +
Sbjct: 124 DMNMRPNHDTGYPGRTYRFYTGKKVFEFGQGLSYSAYSYNFSSTTIQKIDLNVTMEHFEA 183
Query: 659 VENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQ 718
+ N+ H + + C KF +I V+NH +M G+H VLL+ K G P KQ
Sbjct: 184 LGNRGKGHVR----VENTPCRKLKFRSSIFVRNHDKMDGRHAVLLYSKSPATHKGAPQKQ 239
Query: 719 LVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGDEEYPISI 774
L+GF+SV + K A++ F + PC+ S E+G ++ G+H L+VGD +YP+S+
Sbjct: 240 LIGFRSVHVQGKHTAKVTFVVKPCDHFSTVEENGQRLLAIGSHSLIVGDTQYPVSL 295
>gi|408824590|ref|ZP_11209480.1| Glucan 1,4-beta-glucosidase [Pseudomonas geniculata N1]
Length = 897
Score = 318 bits (815), Expect = 7e-84, Method: Compositional matrix adjust.
Identities = 186/439 (42%), Positives = 259/439 (58%), Gaps = 36/439 (8%)
Query: 44 PFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKG 103
P+ + QRA LV+++TL+EK +Q+ N+APAI RLG+PAY+WW+E LHGVA G+
Sbjct: 37 PWLDVSASFEQRAAALVAQMTLEEKAAQMQNAAPAIERLGVPAYDWWNEGLHGVARAGQ- 95
Query: 104 IFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA-------GQAIGMTF 156
AT FPQ I AA+FD L ++ I EARA ++ G+ G+TF
Sbjct: 96 ---------ATVFPQAIGLAATFDVPLMGQVAATISDEARAKHHQFLREGAHGRYQGLTF 146
Query: 157 WAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKH 216
W+PN+NIFRDPRWGRGQET GEDP +T + V++VRG+QGD + KL A+A KH
Sbjct: 147 WSPNVNIFRDPRWGRGQETYGEDPYLTARMGVAFVRGLQGDD----PVYRKLDATA--KH 200
Query: 217 FTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCA 276
+ + R+ FDAR + +DL DTY P FE+ VK+G +M AYNRV G + A
Sbjct: 201 LAVH---SGPEADRHHFDARPSRRDLYDTYLPAFEALVKEGDVDAVMGAYNRVYGESASA 257
Query: 277 DRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQ 336
R LL RR WGF GY+ SDC A+ I+ + E A ++ G ++ CG Q
Sbjct: 258 SRFLLRDVLRRDWGFKGYVVSDCWAIVDIWKHHRIVTTREAAAALAVRNGTELECG---Q 314
Query: 337 KHTK--AAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQV 394
++ +AV+Q + E+EID A+ LF+ RMRLG+F+ P + +I A V +PAH
Sbjct: 315 EYATLPSAVRQGLISEAEIDDAVTRLFTARMRLGMFD-PPERVRWARIPASVNQAPAHDA 373
Query: 395 LALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQA 454
LAL+AAQ +VLLKN G+LPL ++ + +A++GP A+ LLGNY G +T LQ
Sbjct: 374 LALKAAQASLVLLKND-GILPLSRN-TRRIAVVGPTADDTMALLGNYFGTPAAPVTILQG 431
Query: 455 LQNYVE--NTVYYPGCDTV 471
++ + Y G D V
Sbjct: 432 IREAAKGVEVRYARGVDLV 450
Score = 114 bits (286), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 81/265 (30%), Positives = 124/265 (46%), Gaps = 50/265 (18%)
Query: 491 VVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGG 540
VV + GL E EE+ DR DL LP Q+ L+ + KPV++VL G
Sbjct: 632 VVFVGGLTGDVEGEEMTVNYPGFAGGDRTDLRLPAPQRTLLEAL-HGTGKPVVMVLTGGS 690
Query: 541 PVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTD 600
+ + +A+ ++ +IL + YPG+ G A+ + +FGD NP GRLP+T+Y D
Sbjct: 691 AIAVDWAQA--HLPAILMSWYPGQRGGTAVGQALFGDVNPSGRLPVTFYKAGEAMPAFDD 748
Query: 601 MKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVE 660
M GRTYR++ G ++PFG GLSY+++ Y L L+ S
Sbjct: 749 YAME--------GRTYRYFRGTPLYPFGHGLSYTRFDY-------GTLRLDADS------ 787
Query: 661 NQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLV 720
++ LG V + V N G +G V L+V+ G+G +++L
Sbjct: 788 ------LRADGRLG----------VAVDVANTGTRSGDEVVQLYVRREHAGSGDAVQELR 831
Query: 721 GFQSVILNAKEKAEIVFELSPCESL 745
GFQ V L E+ + F L ++L
Sbjct: 832 GFQRVQLAPGERRTVTFTLEAAQAL 856
>gi|325919363|ref|ZP_08181395.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas gardneri ATCC
19865]
gi|325550152|gb|EGD20974.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas gardneri ATCC
19865]
Length = 876
Score = 318 bits (814), Expect = 1e-83, Method: Compositional matrix adjust.
Identities = 185/437 (42%), Positives = 250/437 (57%), Gaps = 33/437 (7%)
Query: 44 PFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKG 103
P+ T P RA DLV+R+TL+EK +Q+ N+APAIPRL +P Y+WW+EALHGVA G
Sbjct: 19 PYLDTQRPFDARAADLVARMTLEEKAAQMQNAAPAIPRLQVPEYDWWNEALHGVARAG-- 76
Query: 104 IFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQA-------IGMTF 156
GAT FPQ I AA+FD+ L + AI EARA ++A A G+TF
Sbjct: 77 --------GATVFPQAIGLAATFDTPLMAEVATAISDEARAKHHAFLARGEYKRYQGLTF 128
Query: 157 WAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKH 216
W+PNINIFRDPRWGRGQET GEDP +T + V++V+G+Q KL A KH
Sbjct: 129 WSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQAQQGPYRKL------DATAKH 182
Query: 217 FTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCA 276
F + + R+ FD + +DL +TY P F++ V++G+ + +M AYNRVNG S +
Sbjct: 183 FAVH---SGPEADRHHFDVHPSERDLHETYLPAFQALVQEGKVAAVMGAYNRVNG-ESAS 238
Query: 277 DRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQ 336
L RR WGF GYI SDC A+ I+ +PE A +K G D++CG
Sbjct: 239 ASTRLEGILRRDWGFDGYIVSDCAAIRDIWQNHKIVPTPEAAAALGVKHGTDLDCGDTYA 298
Query: 337 KHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLA 396
AAV+ + E+ ID AL L + RMRLG+F+ P P+ +I A SP H LA
Sbjct: 299 A-LPAAVRAGLIDEATIDTALKRLMTTRMRLGMFD-PPAKVPWAQIPASANQSPQHDALA 356
Query: 397 LQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQ 456
+ A++ +VLLKN G+LPL K +A+IGP A+ +LLGNY G +T LQ ++
Sbjct: 357 RRTARESLVLLKND-GVLPL-KPTLKRIAVIGPTADDPMSLLGNYYGTPAAPVTILQGIR 414
Query: 457 NYV--ENTVYYPGCDTV 471
+ +Y G D V
Sbjct: 415 DAAPQAQVIYARGSDLV 431
Score = 128 bits (322), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 88/278 (31%), Positives = 143/278 (51%), Gaps = 50/278 (17%)
Query: 478 IDKAVDIAKGADHVVLMMGLDQTQEKEELD----------RVDLVLPGRQQELITRVAEA 527
+ +AVD A+ A+ VV + GL E EE+D R D LP Q+EL+ + +A
Sbjct: 600 LQEAVDAARDAEVVVFVGGLTGDVEGEEMDVNYPGFAGGDRTDTRLPKPQRELL-QALQA 658
Query: 528 AKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMT 587
PV+ VL G + I +A+ +++ +IL A YPG+ G A+ +V+FG +PGGRLP+T
Sbjct: 659 TGTPVVAVLTTGSALAIDWAQ--QHVPAILLAWYPGQRGGSAVGDVLFGQASPGGRLPVT 716
Query: 588 WYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNK 647
+Y + D MR GRTYR+++GK ++PFG GLSY++++Y +
Sbjct: 717 FYKEAERLPAFDDYAMR--------GRTYRYFQGKPLYPFGHGLSYTQFAY-------SD 761
Query: 648 LYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKP 707
L L+++ +V GT T+ +KN G+ AG V L++ P
Sbjct: 762 LRLDRT---------------TVAADGT-------LTATVTLKNTGQRAGDEVVQLYLHP 799
Query: 708 ARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESL 745
+ R +K+L G Q + L E+ ++ F + ++L
Sbjct: 800 LKPQRERALKELHGLQRITLQPGEQRQLRFTIKAQDAL 837
>gi|395803818|ref|ZP_10483061.1| glycoside hydrolase family 3 protein [Flavobacterium sp. F52]
gi|395434089|gb|EJG00040.1| glycoside hydrolase family 3 protein [Flavobacterium sp. F52]
Length = 875
Score = 317 bits (813), Expect = 1e-83, Method: Compositional matrix adjust.
Identities = 175/433 (40%), Positives = 248/433 (57%), Gaps = 30/433 (6%)
Query: 43 FPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGK 102
FPF T L +R +LVS+LTL+EK++Q++N+APAIPRLGIPAY+WW+E LHGVA
Sbjct: 27 FPFQNTDLTFEERVENLVSQLTLEEKVAQMLNAAPAIPRLGIPAYDWWNETLHGVAR--- 83
Query: 103 GIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAG--------QAIGM 154
T T FPQ I AA+FD +++ LE RA+YN + +G+
Sbjct: 84 ------TPFKTTVFPQAIAMAATFDKNSLFKMADYSALEGRAIYNKAVELNRTKERYLGL 137
Query: 155 TFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACC 214
T+W PNINIFRDPRWGRGQET GEDP +T ++V+G+QGD L+A+AC
Sbjct: 138 TYWTPNINIFRDPRWGRGQETYGEDPYLTAVLGDAFVKGLQGDD------PKYLKAAACA 191
Query: 215 KHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPS 274
KH Y + + + R+ FD VT +L DTY P F+ V + +G+MCAYN P
Sbjct: 192 KH---YAVHSGPESLRHTFDVDVTPYELWDTYLPAFKKLVTNSKVAGVMCAYNAFRTQPC 248
Query: 275 CADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSF 334
CA L++ R QW F GY+TSDC A+ + A D + G D++CG+
Sbjct: 249 CASDILMNDILRNQWKFTGYVTSDCWAIDDFFKNHKTHPDAASASADAVLHGTDIDCGTD 308
Query: 335 LQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQV 394
K AVK ++ E +ID ++ LF +R RLG+F+ +M + + + V+ S H+
Sbjct: 309 AYKSLVQAVKNGQITEKQIDVSVKRLFMIRFRLGMFDP-VSMVKYAQTPSSVLESEEHKE 367
Query: 395 LALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQA 454
AL+ A+ IVLLKN LPL K K + ++GPNA+++ ++LGNY G + T LQ
Sbjct: 368 HALKMARQSIVLLKNEKNTLPLSK-KLKKIVVLGPNADNSISILGNYNGTPSKLTTVLQG 426
Query: 455 LQNYV--ENTVYY 465
++ + E V Y
Sbjct: 427 IKEKISPETEVVY 439
Score = 105 bits (262), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 81/295 (27%), Positives = 132/295 (44%), Gaps = 52/295 (17%)
Query: 482 VDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKP 531
++ K AD + G+ E EE+ DR ++LP Q L+ + +++ KP
Sbjct: 600 IERHKNADAFIFAGGISPQLEGEEMPVDAPGFNGGDRTSILLPEVQTRLL-KALQSSGKP 658
Query: 532 VILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQ 591
V+ +++ G + + + NI +IL Y G++ A A+VIFGD+NP GRLP+T+Y
Sbjct: 659 VVFLIMTGSAIAVPWEA--ENIPAILNIWYGGQSAGTASADVIFGDYNPAGRLPVTFYKG 716
Query: 592 DYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLN 651
D D KM +TYR+++G ++ FG GLSY+++ Y
Sbjct: 717 DSDLSSFVDYKMD--------NKTYRYFKGIPLYGFGYGLSYTEFKYS-----------G 757
Query: 652 QSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRG 711
+ K+ + Q V +++ V N G+M G+ L++
Sbjct: 758 LKTPDKIKKGQPVT-------------------ISVKVTNTGKMEGEEVAQLYLINPNTS 798
Query: 712 NGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
P+K L GF+ L + + F LSP E LS E G + EG + VG
Sbjct: 799 IKSPLKSLKGFERFNLKPGQSTVVNFTLSP-EDLSYVTESGNLKPYEGKIQIAVG 852
>gi|188993706|ref|YP_001905716.1| beta-glucosidase [Xanthomonas campestris pv. campestris str. B100]
gi|167735466|emb|CAP53681.1| exported beta-glucosidase [Xanthomonas campestris pv. campestris]
Length = 896
Score = 317 bits (813), Expect = 1e-83, Method: Compositional matrix adjust.
Identities = 186/445 (41%), Positives = 255/445 (57%), Gaps = 33/445 (7%)
Query: 36 SNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALH 95
++ +T P+ T P+ RA DLVSR+TL+EK +Q+ N+APAIPRL +P Y+WW+EALH
Sbjct: 31 THAATPPPPYLDPTQPLQARAADLVSRMTLEEKAAQMQNAAPAIPRLQVPEYDWWNEALH 90
Query: 96 GVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQA---- 151
GVA G GAT FPQ I AA+FD+ L + AI EARA ++A A
Sbjct: 91 GVARAG----------GATVFPQAIGLAATFDTPLMAEVATAISDEARAKHHAFLAGGEH 140
Query: 152 ---IGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKL 208
G+TFW+PNINIFRDPRWGRGQET GEDP +T + V++V+G+Q KL
Sbjct: 141 KRYQGLTFWSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQAQQGPYRKL---- 196
Query: 209 QASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNR 268
A KH+ + + R+ FD + +DL +TY P F++ V++G + +M AYNR
Sbjct: 197 --DATAKHYAVH---SGPEADRHHFDVHPSERDLYETYLPAFQALVQEGHVAAVMGAYNR 251
Query: 269 VNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMD 328
VNG S + L RR WGF GYI SDC A+ I+ +PE A +K G D
Sbjct: 252 VNG-ESASASTRLEGILRRDWGFDGYIVSDCAAIRDIWQNHKIVPTPEAAAALGVKHGTD 310
Query: 329 VNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVC 388
++CG AAV+ + E+ IDR+L L + R+RLG+F+ P P+ +I A
Sbjct: 311 LDCGDTYAA-LPAAVRAGLIDEATIDRSLTRLMAARLRLGMFD-PPAKVPWAQIPASANQ 368
Query: 389 SPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRS 448
SP H LA + A++ +VLLKN GLLPL K +A++GP A+ +LLGNY G
Sbjct: 369 SPQHDALARRTARESLVLLKND-GLLPL-KPTLKRIAVVGPTADDPMSLLGNYYGTPAAP 426
Query: 449 ITPLQALQNYV--ENTVYYPGCDTV 471
+T LQ +++ VY G D V
Sbjct: 427 VTILQGIRDAAPQAEVVYARGSDLV 451
Score = 143 bits (360), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 113/361 (31%), Positives = 170/361 (47%), Gaps = 66/361 (18%)
Query: 411 HGLLPLPKSKSVSLALIGPNAN----SAKTLLGNYA-GPSCRSITP---LQALQNYVENT 462
HG L P S + L + + K L+ ++ P RS T LQA + Y
Sbjct: 537 HGQLLPPVSGNYELQIAADDGVRLYLDGKPLIDQWSDAPRMRSSTATVALQAGKAYDLRV 596
Query: 463 VYYPGCDTVACSSA--------SIDKAVDIAKGADHVVLMMGLDQTQEKEELD------- 507
YY A + +AVD A+ AD VV + GL E EE+D
Sbjct: 597 EYYEATRDAGVRLAWRMPGAKPPLQEAVDAARNADVVVFVGGLTGDVEGEEMDVNYPGFA 656
Query: 508 ---RVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGE 564
R D LP Q+EL+ + +A PV+ VL G + I +A+ +++ +IL A YPG+
Sbjct: 657 GGDRTDTRLPKPQRELL-QALQATGTPVVAVLTTGSALAIDWAQ--QHVPAILLAWYPGQ 713
Query: 565 AGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEV 624
G A+ +V+FG +PGGRLP+T+Y +D D MR GRTYR+++GK +
Sbjct: 714 RGGTAVGDVLFGQASPGGRLPITFYKEDERLPAFDDYAMR--------GRTYRYFDGKPL 765
Query: 625 FPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFL 684
+PFG GL+Y++++Y + L L+++ +V GT
Sbjct: 766 YPFGHGLAYTQFAY-------SNLRLDRT---------------TVAADGT-------LR 796
Query: 685 VTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCES 744
T+ VKN G+ AG V L++ P R K+L GFQ + L E E+ F ++P E+
Sbjct: 797 ATVSVKNTGQRAGDEVVQLYLHPLNPQRERARKELRGFQRITLQPGEHREVSFNITPREA 856
Query: 745 L 745
L
Sbjct: 857 L 857
>gi|255013451|ref|ZP_05285577.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp. 2_1_7]
gi|410103695|ref|ZP_11298616.1| hypothetical protein HMPREF0999_02388 [Parabacteroides sp. D25]
gi|409236424|gb|EKN29231.1| hypothetical protein HMPREF0999_02388 [Parabacteroides sp. D25]
Length = 868
Score = 316 bits (810), Expect = 3e-83, Method: Compositional matrix adjust.
Identities = 179/447 (40%), Positives = 244/447 (54%), Gaps = 37/447 (8%)
Query: 41 ETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGV 100
+ +PF LP+ +R DL+SRLT +EKI Q++N PAI RLGIP Y+WW+EALHGVA
Sbjct: 24 QDYPFRNPDLPLEERIDDLLSRLTPEEKIGQMMNVTPAIERLGIPTYDWWNEALHGVARA 83
Query: 101 GKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQA-------IG 153
G+ AT FPQ I AA+FD + + EARA Y+ Q G
Sbjct: 84 GR----------ATVFPQAIAMAATFDDNAVHETFTMVSDEARAKYHQYQKDKEYDRYKG 133
Query: 154 MTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASAC 213
+TFW PNINIFRDPRWGRG ET GEDP +T K V+ RG+QGD N + AC
Sbjct: 134 LTFWTPNINIFRDPRWGRGMETYGEDPYLTEKMGVAVTRGLQGDDPN------YYKTHAC 187
Query: 214 CKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIP 273
KH+ + W R++FD T +DL +TY P FE+ VK+G +MCAYNR G P
Sbjct: 188 AKHYAVHSGPEW---NRHEFDVEATPRDLYETYLPAFEALVKEGDVQEVMCAYNRFEGKP 244
Query: 274 SCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEG----YAKSP--EDAVVDVLKAGM 327
C+ LL R WG+ I SDC A+ + + + P E A D + G
Sbjct: 245 CCSSDKLLIDILRNSWGYDNIILSDCGAIDDFWRKDKNTPRHETHPDAESASADAVLNGT 304
Query: 328 DVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVV 387
D+ CG + K A+ K+ E ++D +L L R LG+F+ + + P+ KI VV
Sbjct: 305 DLECGGSYRALNK-ALADGKISEKDLDVSLRRLLKGRFELGMFDPDERV-PYSKIPYSVV 362
Query: 388 CSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCR 447
SP H AL A+ IVLLKN + +LPL K+ +A++GPNA + L NY G +
Sbjct: 363 ESPEHIAKALDMARKSIVLLKNKNNMLPLDKNIK-KIAVVGPNAADSTMLWANYNGFPTK 421
Query: 448 SITPLQALQNYVEN--TVYYPGCDTVA 472
++T ++ ++N V N +Y GC+ A
Sbjct: 422 TVTIVEGIRNKVPNAEVIYELGCNHTA 448
Score = 105 bits (261), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 76/291 (26%), Positives = 133/291 (45%), Gaps = 51/291 (17%)
Query: 486 KGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILV 535
K AD +V + G+ E EE+ DR ++ +P Q+E++ + A KPV+ V
Sbjct: 603 KDADVIVFVGGISPRLEGEEMPVDAEGFRKGDRTNIEIPAVQKEMVKALV-ATGKPVVYV 661
Query: 536 LLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIK 595
+ G + + + + ++ +IL A Y G+ G A+A+V+FGD+NP GRLP+T+Y
Sbjct: 662 VCTGSALALNWE--NDHVNAILNAWYGGQEGGTAVADVLFGDYNPAGRLPITFYKSVDQL 719
Query: 596 VPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSS 655
D M+ GRTYR+ ++PFG GLSY+ + YK +S++K+ N+S +
Sbjct: 720 PDFQDYSMK--------GRTYRYMTQTPLYPFGYGLSYTTFDYKNAKLSKDKIASNESVT 771
Query: 656 TKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRP 715
++ + N G+M G +++K G P
Sbjct: 772 -----------------------------LSFDIANTGKMDGDEVAQIYIKNPNDPAG-P 801
Query: 716 IKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
+K + F+ V + A + + +L P S M + G + ++ G
Sbjct: 802 LKAMKAFKRVNVKAGSEQPVSIQLEPKAFQSFNDNTQTMEVRPGKYQILYG 852
>gi|150007848|ref|YP_001302591.1| glycoside hydrolase family protein [Parabacteroides distasonis ATCC
8503]
gi|301310124|ref|ZP_07216063.1| beta-glucosidase [Bacteroides sp. 20_3]
gi|423336365|ref|ZP_17314112.1| hypothetical protein HMPREF1059_00064 [Parabacteroides distasonis
CL09T03C24]
gi|149936272|gb|ABR42969.1| glycoside hydrolase family 3, candidate beta-glycosidase
[Parabacteroides distasonis ATCC 8503]
gi|300831698|gb|EFK62329.1| beta-glucosidase [Bacteroides sp. 20_3]
gi|409240840|gb|EKN33614.1| hypothetical protein HMPREF1059_00064 [Parabacteroides distasonis
CL09T03C24]
Length = 868
Score = 316 bits (810), Expect = 3e-83, Method: Compositional matrix adjust.
Identities = 179/447 (40%), Positives = 245/447 (54%), Gaps = 37/447 (8%)
Query: 41 ETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGV 100
+ +PF LP+ +R DL+SRLT +EKI Q++N PAI RLGIP Y+WW+EALHGVA
Sbjct: 24 QDYPFRNPDLPLEERIDDLLSRLTPEEKIGQMMNVTPAIERLGIPTYDWWNEALHGVARA 83
Query: 101 GKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQA-------IG 153
G+ AT FPQ I AA+FD + + EARA Y+ Q G
Sbjct: 84 GR----------ATVFPQAIAMAATFDDNAVHETFTMVSDEARAKYHQYQKDKEYDRYKG 133
Query: 154 MTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASAC 213
+TFW PNINIFRDPRWGRG ET GEDP +T K V+ RG+QGD N + AC
Sbjct: 134 LTFWTPNINIFRDPRWGRGMETYGEDPYLTEKMGVAVTRGLQGDDPN------YYKTHAC 187
Query: 214 CKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIP 273
KH+ + W R++F+A T +DL +TY P FE+ VK+G +MCAYNR G P
Sbjct: 188 AKHYAVHSGPEW---NRHEFNAEATPRDLYETYLPAFEALVKEGDVQEVMCAYNRFEGKP 244
Query: 274 SCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEG----YAKSP--EDAVVDVLKAGM 327
C+ LL R WG+ I SDC A+ + + + P E A D + G
Sbjct: 245 CCSSDKLLIDILRNSWGYDNIILSDCGAIDDFWRKDKNTPRHETHPDAESASADAVLNGT 304
Query: 328 DVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVV 387
D+ CG + K A+ K+ E ++D +L L R LG+F+ + + P+ KI VV
Sbjct: 305 DLECGGSYRALNK-ALADGKISEKDLDVSLRRLLKGRFELGMFDPDERV-PYSKIPYSVV 362
Query: 388 CSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCR 447
SP H AL A+ IVLLKN + +LPL K+ +A++GPNA + L NY G +
Sbjct: 363 ESPEHIAKALDMARKSIVLLKNKNNMLPLDKNIK-KIAVVGPNAADSTMLWANYNGFPTK 421
Query: 448 SITPLQALQNYVEN--TVYYPGCDTVA 472
++T ++ ++N V N +Y GC+ A
Sbjct: 422 TVTIVEGIRNKVPNAEVIYELGCNHTA 448
Score = 105 bits (262), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 76/291 (26%), Positives = 133/291 (45%), Gaps = 51/291 (17%)
Query: 486 KGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILV 535
K AD +V + G+ E EE+ DR ++ +P Q+E++ + A KPV+ V
Sbjct: 603 KDADVIVFVGGISPRLEGEEMPVDAEGFRKGDRTNIEIPAVQKEMVKALV-ATGKPVVYV 661
Query: 536 LLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIK 595
+ G + + + + ++ +IL A Y G+ G A+A+V+FGD+NP GRLP+T+Y
Sbjct: 662 VCTGSALALNWE--NDHVNAILNAWYGGQEGGTAVADVLFGDYNPAGRLPITFYKSVDQL 719
Query: 596 VPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSS 655
D M+ GRTYR+ ++PFG GLSY+ + YK +S++K+ N+S +
Sbjct: 720 PDFQDYSMK--------GRTYRYMTQTPLYPFGYGLSYTTFDYKNAKLSKDKIASNESVT 771
Query: 656 TKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRP 715
++ + N G+M G +++K G P
Sbjct: 772 -----------------------------LSFDIANTGKMDGDEVAQIYIKNPNDPAG-P 801
Query: 716 IKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
+K + F+ V + A + + +L P S M + G + ++ G
Sbjct: 802 LKAMKAFKRVNVKAGSEQPVSIQLEPKAFQSFNDNTQTMEVRPGKYQILYG 852
>gi|298376791|ref|ZP_06986746.1| beta-glucosidase [Bacteroides sp. 3_1_19]
gi|298266669|gb|EFI08327.1| beta-glucosidase [Bacteroides sp. 3_1_19]
Length = 868
Score = 316 bits (810), Expect = 3e-83, Method: Compositional matrix adjust.
Identities = 179/447 (40%), Positives = 245/447 (54%), Gaps = 37/447 (8%)
Query: 41 ETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGV 100
+ +PF LP+ +R DL+SRLT +EKI Q++N PAI RLGIP Y+WW+EALHGVA
Sbjct: 24 QDYPFRNPELPLEERIDDLLSRLTPEEKIGQMMNVTPAIERLGIPTYDWWNEALHGVARA 83
Query: 101 GKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQA-------IG 153
G+ AT FPQ I AA+FD + + EARA Y+ Q G
Sbjct: 84 GR----------ATVFPQAIAMAATFDDNAVHETFTMVSDEARAKYHQYQKDKEYDRYKG 133
Query: 154 MTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASAC 213
+TFW PNINIFRDPRWGRG ET GEDP +T K V+ RG+QGD N + AC
Sbjct: 134 LTFWTPNINIFRDPRWGRGMETYGEDPYLTEKMGVAVTRGLQGDDPN------YYKTHAC 187
Query: 214 CKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIP 273
KH+ + W R++F+A T +DL +TY P FE+ VK+G +MCAYNR G P
Sbjct: 188 AKHYAVHSGPEW---NRHEFNAEATPRDLYETYLPAFEALVKEGDVQEVMCAYNRFEGKP 244
Query: 274 SCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEG----YAKSP--EDAVVDVLKAGM 327
C+ LL R WG+ I SDC A+ + + + P E A D + G
Sbjct: 245 CCSSDKLLIDILRNSWGYDNIILSDCGAIDDFWRKDKNTPRHETHPDAESASADAVLNGT 304
Query: 328 DVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVV 387
D+ CG + K A+ K+ E ++D +L L R LG+F+ + + P+ KI VV
Sbjct: 305 DLECGGSYRALNK-ALADGKISEKDLDVSLRRLLKGRFELGMFDPDERV-PYSKIPYSVV 362
Query: 388 CSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCR 447
SP H AL A+ IVLLKN + +LPL K+ +A++GPNA + L NY G +
Sbjct: 363 ESPEHIAKALDMARKSIVLLKNKNNMLPLDKNIK-KIAVVGPNAADSTMLWANYNGFPSK 421
Query: 448 SITPLQALQNYVEN--TVYYPGCDTVA 472
++T ++ ++N V N +Y GC+ A
Sbjct: 422 TVTIVEGIRNKVPNAEVIYELGCNHTA 448
Score = 105 bits (261), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 76/291 (26%), Positives = 133/291 (45%), Gaps = 51/291 (17%)
Query: 486 KGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILV 535
K AD +V + G+ E EE+ DR ++ +P Q+E++ + A KPV+ V
Sbjct: 603 KDADVIVFVGGISPRLEGEEMPVDAEGFRKGDRTNIEIPAVQKEMVKALV-ATGKPVVYV 661
Query: 536 LLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIK 595
+ G + + + + ++ +IL A Y G+ G A+A+V+FGD+NP GRLP+T+Y
Sbjct: 662 VCTGSALALNWE--NDHVNAILNAWYGGQEGGTAVADVLFGDYNPAGRLPITFYKSVDQL 719
Query: 596 VPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSS 655
D M+ GRTYR+ ++PFG GLSY+ + YK +S++K+ N+S +
Sbjct: 720 PDFQDYSMK--------GRTYRYMTQTPLYPFGYGLSYTTFDYKNAKLSKDKIASNESVT 771
Query: 656 TKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRP 715
++ + N G+M G +++K G P
Sbjct: 772 -----------------------------LSFDIANTGKMDGDEVAQIYIKNPNDPAG-P 801
Query: 716 IKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
+K + F+ V + A + + +L P S M + G + ++ G
Sbjct: 802 LKAMKAFKRVNVKAGSEQPVSIQLEPKAFQSFNDNTQTMEVRPGKYQILYG 852
>gi|375149998|ref|YP_005012439.1| Beta-glucosidase [Niastella koreensis GR20-10]
gi|361064044|gb|AEW03036.1| Beta-glucosidase [Niastella koreensis GR20-10]
Length = 875
Score = 316 bits (809), Expect = 3e-83, Method: Compositional matrix adjust.
Identities = 175/422 (41%), Positives = 238/422 (56%), Gaps = 28/422 (6%)
Query: 43 FPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGK 102
FPF L R DLVSRLTL+EK++Q++N+AP IPRL IPAY+WW+E LHGVA
Sbjct: 28 FPFQNYRLSFEDRVNDLVSRLTLEEKVAQMLNAAPGIPRLDIPAYDWWNETLHGVAR--- 84
Query: 103 GIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQA--------IGM 154
T T FPQ I AA++D+ YR+ LE R ++N A +G+
Sbjct: 85 ------TPYNVTVFPQAIAMAATWDTAALYRMADCSALEGRVIHNKAIAAGKEKDRYLGL 138
Query: 155 TFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACC 214
T+W PNINIFRDPRWGRGQET GEDP +T A ++VRG+QG+ L+A+AC
Sbjct: 139 TYWTPNINIFRDPRWGRGQETYGEDPYLTAALADAFVRGLQGND------PKYLKAAACA 192
Query: 215 KHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPS 274
KH Y + + +R+ FD VT DL DTY P F+ V +G+MCAYN P
Sbjct: 193 KH---YAVHSGPEPSRHVFDVDVTPYDLWDTYLPSFKKLVTVSNVAGVMCAYNAFRKQPC 249
Query: 275 CADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSF 334
CA L++ R QW F GY+TSDC A+ Y A D + G D++CG+
Sbjct: 250 CASDVLMTDILRNQWSFKGYVTSDCGAIDDFYRNHKTHPDAAAASADAVFHGTDIDCGNE 309
Query: 335 LQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQV 394
+ AVK+ K+ E +ID ++ LF +R RLG+F+ P+M + + A + S AH
Sbjct: 310 AYRALVQAVKENKITEKQIDISVKRLFMIRFRLGMFDP-PSMVKYAQTPATELESAAHAK 368
Query: 395 LALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQA 454
AL A + IVLLKN++ LPL K + ++GPNA + LGNY+G + IT Q
Sbjct: 369 HALLMAHESIVLLKNANNTLPLKKGLK-KIVVLGPNATNVIAPLGNYSGTPSKLITLFQG 427
Query: 455 LQ 456
++
Sbjct: 428 IK 429
Score = 101 bits (252), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 79/275 (28%), Positives = 120/275 (43%), Gaps = 52/275 (18%)
Query: 488 ADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLL 537
AD + G+ E EE+ DR ++LP Q EL+ + +A+ KPV+ V++
Sbjct: 607 ADAFIFAGGISPQLEGEEMKVSDPGFKGGDRTTILLPAIQTELM-KALQASGKPVVFVMM 665
Query: 538 CGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVP 597
G + + NI +I+ A Y G+A ALA+V+FGD+NP GRLP+T+Y D
Sbjct: 666 TGSALATPWES--ENIPAIVNAWYGGQAAGTALADVLFGDYNPSGRLPVTFYGSDNDLPS 723
Query: 598 MTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTK 657
D M+ RTYR++ GK ++ FG GLSY+ + Y +Q +
Sbjct: 724 FEDYSMK--------NRTYRYFTGKPLYGFGYGLSYTTFRY------------DQLTMPV 763
Query: 658 MVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIK 717
+N V VT+ V N G+ G ++V +K
Sbjct: 764 TAQNGKPVK------------------VTVRVTNTGKTTGDEVAQIYVVNENTSIQTALK 805
Query: 718 QLVGFQSVILNAKEKAEIVFELSPCESLSRAREDG 752
L GFQ + L E + F L + L+ DG
Sbjct: 806 TLKGFQRISLRPAESKMVSFVLQ-SDDLTYVDADG 839
>gi|225873995|ref|YP_002755454.1| beta-xylosidase B [Acidobacterium capsulatum ATCC 51196]
gi|225792796|gb|ACO32886.1| beta-xylosidase B [Acidobacterium capsulatum ATCC 51196]
Length = 896
Score = 316 bits (809), Expect = 3e-83, Method: Compositional matrix adjust.
Identities = 178/450 (39%), Positives = 252/450 (56%), Gaps = 39/450 (8%)
Query: 44 PFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKG 103
P+ PI +R +LVS++TL E+ +Q++N+APAIPRLG+PAY WWSE LHG+A G
Sbjct: 38 PWDNPNQPIQKRVHELVSQMTLQEEAAQMMNTAPAIPRLGVPAYNWWSEGLHGIARSGY- 96
Query: 104 IFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYN-------AGQAIGMTF 156
AT FPQ I +A+FD +++G + EARA YN G+T
Sbjct: 97 ---------ATVFPQAIGMSATFDPAAIHQMGTTVSTEARAKYNWAIRHDIHSIYFGLTL 147
Query: 157 WAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKH 216
WAPNINI RDPRWGRGQET GEDP +TG A YV G+QG+ L+ A KH
Sbjct: 148 WAPNINIVRDPRWGRGQETYGEDPFLTGTMAAEYVSGLQGNN------PKYLKTVATPKH 201
Query: 217 FTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCA 276
F+ Y N + R+K +A + D+ DTY F + +G A +MC+YN V G+PSCA
Sbjct: 202 FSVY---NGPESMRHKINANPSAHDMQDTYLAAFRMAITKGHADSMMCSYNAVYGVPSCA 258
Query: 277 DRNLLSKTARRQWGFHGYITSDCDAVSIIY--DAEGYAKSPEDAVVDVLKAGMDVNCGSF 334
++ LL+ R +WGF GYITSDC A+S Y A GY+ A + AG D +CG+
Sbjct: 259 NK-LLADVVRGKWGFDGYITSDCGAISDFYRPGAHGYSPDAVHAAASAVLAGTDTDCGTG 317
Query: 335 LQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQV 394
K +V+Q + ++ IDRA+ LF+ R RLG+F+ + P+ I VV S AH+
Sbjct: 318 Y-KVLPQSVQQGLISKAAIDRAVERLFTARFRLGMFDPKADV-PYNSIPYSVVDSAAHRA 375
Query: 395 LALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQA 454
AL+ A +VLLKN G+LPL ++++ A++GPNA + ++ GNY P+
Sbjct: 376 QALEDASKSMVLLKNEGGILPLRNARTI--AVVGPNAANLNSIEGNYNAIPSHPSLPVDG 433
Query: 455 LQNYVENTVYYPGCDTVACSSASIDKAVDI 484
++ +P V +S D + +
Sbjct: 434 IE------AAFPQAHVVYAQGSSFDAVLPV 457
Score = 132 bits (332), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 95/285 (33%), Positives = 135/285 (47%), Gaps = 52/285 (18%)
Query: 495 MGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDI 544
MGL T EE+ DR L LP QQ+L+ + A KPV+LVLL G + I
Sbjct: 633 MGLSPTFVSEEMPIKIPGFDGGDRTRLSLPQTQQDLLHALV-ATGKPVVLVLLNGSALSI 691
Query: 545 TFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMR 604
+AK +++ IL A YPGEAG A+ E + G ++PGG+LP+T+Y P TD M+
Sbjct: 692 DWAK--QHVQGILEAWYPGEAGGEAIGETLSGQNDPGGKLPITFYTSVKDLPPFTDYSMK 749
Query: 605 PQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDV 664
GRTYR+Y GK +FPFG GLSY+ + Y +S + L +
Sbjct: 750 --------GRTYRYYTGKPLFPFGYGLSYTTFEYSHVRLSTSNLKAGE------------ 789
Query: 665 VHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQS 724
V VKN G +AG ++V P + G P+K+L GF
Sbjct: 790 -----------------PLTVEAEVKNTGHVAGDAVTEVYVTPPQNGV-NPLKELKGFDR 831
Query: 725 VILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGDEE 769
V L + ++ F L+P LS E G ++ G + + VG +
Sbjct: 832 VHLAPGQSRQLTFTLNP-RDLSLVDEAGKRSVQPGVYSIFVGGSQ 875
>gi|423331656|ref|ZP_17309440.1| hypothetical protein HMPREF1075_01453 [Parabacteroides distasonis
CL03T12C09]
gi|409230226|gb|EKN23094.1| hypothetical protein HMPREF1075_01453 [Parabacteroides distasonis
CL03T12C09]
Length = 868
Score = 316 bits (809), Expect = 3e-83, Method: Compositional matrix adjust.
Identities = 179/447 (40%), Positives = 244/447 (54%), Gaps = 37/447 (8%)
Query: 41 ETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGV 100
+ +PF LP+ +R DL+SRLT +EKI Q++N PAI RLGIP Y+WW+EALHGVA
Sbjct: 24 QDYPFRNPDLPLEERIDDLLSRLTPEEKIGQMMNVTPAIERLGIPTYDWWNEALHGVARA 83
Query: 101 GKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQA-------IG 153
G+ AT FPQ I AA+FD + + EARA Y+ Q G
Sbjct: 84 GR----------ATVFPQAIAMAATFDDNAVHETFTMVSDEARAKYHQYQKDKEYDRYKG 133
Query: 154 MTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASAC 213
+TFW PNINIFRDPRWGRG ET GEDP +T K V+ RG+QGD N + AC
Sbjct: 134 LTFWTPNINIFRDPRWGRGMETYGEDPYLTEKMGVAVTRGLQGDDPN------YYKTHAC 187
Query: 214 CKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIP 273
KH+ + W R++FD T +DL +TY P FE+ VK+G +MCAYNR G P
Sbjct: 188 AKHYAVHSGPEW---NRHEFDVEATPRDLYETYLPAFEALVKEGDVQEVMCAYNRFEGKP 244
Query: 274 SCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEG----YAKSP--EDAVVDVLKAGM 327
C+ LL R WG+ I SDC A+ + + + P E A D + G
Sbjct: 245 CCSSDKLLIDILRNSWGYDNIILSDCGAIDDFWRKDKNTPRHETHPDAESASADAVLNGT 304
Query: 328 DVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVV 387
D+ CG + K A+ K+ E ++D +L L R LG+F+ + + P+ KI VV
Sbjct: 305 DLECGGSYRALNK-ALADGKISEKDLDVSLRRLLKGRFELGMFDPDERV-PYSKIPYSVV 362
Query: 388 CSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCR 447
SP H AL A+ IVLLKN + +LPL K+ +A++GPNA + L NY G +
Sbjct: 363 ESPEHIAKALDMARKSIVLLKNKNNMLPLDKNIK-KIAVVGPNAADSTMLWANYNGFPTK 421
Query: 448 SITPLQALQNYVEN--TVYYPGCDTVA 472
++T ++ ++N V N +Y GC+ A
Sbjct: 422 TVTIVEGIRNKVPNAEVIYELGCNHTA 448
Score = 104 bits (259), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 76/291 (26%), Positives = 132/291 (45%), Gaps = 51/291 (17%)
Query: 486 KGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILV 535
K AD +V + G+ E EE+ DR ++ +P Q+E++ + A KPV+ V
Sbjct: 603 KDADVIVFVGGISPRLEGEEMPVDAEGFRKGDRTNIEIPAVQKEMVKALV-ATGKPVVYV 661
Query: 536 LLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIK 595
+ G + + + + ++ +IL A Y G+ G A+A+V+FGD+NP GRLP+T+Y
Sbjct: 662 VCTGSALALNWE--NDHVNAILNAWYGGQEGGTAVADVLFGDYNPAGRLPITFYKSVDQL 719
Query: 596 VPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSS 655
D M+ GRTYR+ ++PFG GLSY+ + YK +S++K+ N+S +
Sbjct: 720 PDFQDYSMK--------GRTYRYMTQTPLYPFGYGLSYTTFDYKNAKLSKDKIASNESVT 771
Query: 656 TKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRP 715
++ + N G+M G +++K G P
Sbjct: 772 -----------------------------LSFDIANTGKMDGDEVAQIYIKNPNDPAG-P 801
Query: 716 IKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
+K + F+ V + A + +L P S M + G + ++ G
Sbjct: 802 LKAMKAFKRVNVKAGSAQPVSIQLEPKAFQSFNDNTQTMEVRPGKYQILYG 852
>gi|256840106|ref|ZP_05545615.1| glycoside hydrolase family beta-glycosidase [Parabacteroides sp.
D13]
gi|256739036|gb|EEU52361.1| glycoside hydrolase family beta-glycosidase [Parabacteroides sp.
D13]
Length = 868
Score = 316 bits (809), Expect = 3e-83, Method: Compositional matrix adjust.
Identities = 179/447 (40%), Positives = 245/447 (54%), Gaps = 37/447 (8%)
Query: 41 ETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGV 100
+ +PF LP+ +R DL+SRLT +EKI Q++N PAI RLGIP Y+WW+EALHGVA
Sbjct: 24 QDYPFRNPDLPLEERIDDLLSRLTPEEKIGQMMNVTPAIERLGIPTYDWWNEALHGVARA 83
Query: 101 GKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQA-------IG 153
G+ AT FPQ I AA+FD + + EARA Y+ Q G
Sbjct: 84 GR----------ATVFPQAIAMAATFDDNAVHETFTMVSDEARAKYHQYQKDKEYDRYKG 133
Query: 154 MTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASAC 213
+TFW PNINIFRDPRWGRG ET GEDP +T K V+ RG+QGD N + AC
Sbjct: 134 LTFWTPNINIFRDPRWGRGMETYGEDPYLTEKMGVAVTRGLQGDDPN------YYKTHAC 187
Query: 214 CKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIP 273
KH+ + W R++F+A T +DL +TY P FE+ VK+G +MCAYNR G P
Sbjct: 188 AKHYAVHSGPEW---NRHEFNAEATPRDLYETYLPAFEALVKEGDVQEVMCAYNRFEGKP 244
Query: 274 SCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEG----YAKSP--EDAVVDVLKAGM 327
C+ LL R WG+ I SDC A+ + + + P E A D + G
Sbjct: 245 CCSSDKLLIDILRNSWGYDNIILSDCGAIDDFWRKDKNTPRHETHPDAESASADAVLNGT 304
Query: 328 DVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVV 387
D+ CG + K A+ K+ E ++D +L L R LG+F+ + + P+ KI VV
Sbjct: 305 DLECGGSYRALNK-ALADGKISEKDLDVSLRRLLKGRFELGMFDPDERV-PYSKIPYSVV 362
Query: 388 CSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCR 447
SP H AL A+ IVLLKN + +LPL K+ +A++GPNA + L NY G +
Sbjct: 363 ESPEHIAKALDMARKSIVLLKNKNNMLPLDKNIK-KIAVVGPNAADSTMLWANYNGFPSK 421
Query: 448 SITPLQALQNYVEN--TVYYPGCDTVA 472
++T ++ ++N V N +Y GC+ A
Sbjct: 422 TVTIVEGIRNKVPNAEVIYELGCNHTA 448
Score = 105 bits (263), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 77/291 (26%), Positives = 133/291 (45%), Gaps = 51/291 (17%)
Query: 486 KGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILV 535
K AD +V + G+ E EE+ DR ++ +P Q+E++ + A KPV+ V
Sbjct: 603 KDADVIVFVGGISPRLEGEEMPVDAEGFRKGDRTNIEIPAVQKEMVKALV-ATGKPVVYV 661
Query: 536 LLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIK 595
+ G + + + + ++ +IL A Y G+ G A+A+V+FGD+NP GRLP+T+Y
Sbjct: 662 VCTGSALALNWE--NDHVNAILNAWYGGQEGGTAVADVLFGDYNPAGRLPITFYKSVDQL 719
Query: 596 VPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSS 655
D M+ GRTYR+ ++PFG GLSY+ + YK +S++K+ N+S +
Sbjct: 720 PDFQDYSMK--------GRTYRYMTQTPLYPFGYGLSYTTFDYKNAKLSKDKIASNESVT 771
Query: 656 TKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRP 715
++ + N G+M G +++K G P
Sbjct: 772 -----------------------------LSFDIANTGKMDGDEVAQIYIKNPNDPAG-P 801
Query: 716 IKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
+K + F+ V + A + + +L P S M I G + ++ G
Sbjct: 802 LKAMKAFKRVNVKAGSEQPVSIQLEPKAFQSFNDNTQTMEIRPGKYQILYG 852
>gi|373951852|ref|ZP_09611812.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
paludis DSM 18603]
gi|373888452|gb|EHQ24349.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
paludis DSM 18603]
Length = 871
Score = 316 bits (809), Expect = 4e-83, Method: Compositional matrix adjust.
Identities = 169/425 (39%), Positives = 243/425 (57%), Gaps = 28/425 (6%)
Query: 40 TETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAG 99
T +P+ L + R DLV R+TL+EK+SQ++NS+PAIPRL IPAY+WW+E LHGVA
Sbjct: 23 TSDYPYQNYHLDFTTRVNDLVKRMTLEEKVSQMLNSSPAIPRLKIPAYDWWNEVLHGVAR 82
Query: 100 VGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAG--------QA 151
T T +PQ I AA+FD ++ LE RA++N +
Sbjct: 83 ---------TPFKVTVYPQAIAMAATFDRQSLNQMADYAALEGRAVHNKALQMRKPGEKY 133
Query: 152 IGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQAS 211
+G+T+W PNINIFRDPRWGRGQET GEDP +TG ++V G+QG N K L+A+
Sbjct: 134 LGLTYWTPNINIFRDPRWGRGQETYGEDPFLTGAMGSAFVSGLQG---NDPKY---LKAA 187
Query: 212 ACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNG 271
AC KH Y + + R+ F+A ++ DL DTY P F+ V + +G+MCAYN
Sbjct: 188 ACAKH---YAVHSGPEPLRHVFNADISTYDLWDTYLPAFKKLVVDDKVAGVMCAYNAFKT 244
Query: 272 IPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNC 331
P C L+ R QW F GY+TSDC + + + EDA D + G D+ C
Sbjct: 245 QPCCGSDLLMVDILRNQWKFSGYVTSDCGGIDDFFKNHKTHATAEDASTDAVLHGTDIEC 304
Query: 332 GSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPA 391
G+ K AAVK+ K+ E++ID ++ LF +R RLG+F+ + ++ + + V+ SP
Sbjct: 305 GTDAYKSLVAAVKEGKISETQIDISVKRLFMIRFRLGMFDPSDVVK-YAQTPVSVLESPE 363
Query: 392 HQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITP 451
HQ AL+ A+ +VLLKN++ LPL K+ + ++GPNA++ +LGNY G T
Sbjct: 364 HQAHALKMARQSVVLLKNANHTLPLSKTIR-KIVVLGPNADNPIAILGNYNGTPSNLTTV 422
Query: 452 LQALQ 456
Q ++
Sbjct: 423 YQGIR 427
Score = 113 bits (282), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 83/280 (29%), Positives = 135/280 (48%), Gaps = 51/280 (18%)
Query: 488 ADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLL 537
AD +V + G+ E EE+ DR + LP Q L+ + +A KPV+ V++
Sbjct: 603 ADAIVYVGGISPQLEGEEMQVNYPGFNGGDRTSIQLPAAQTNLM-KTLQATGKPVVFVMM 661
Query: 538 CGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVP 597
G + + NI +I+ A Y G+A A+A+V+FGD+NP GRLP+T+Y D
Sbjct: 662 TGSALATPWEA--ENIPAIVNAWYGGQAAGTAVADVLFGDYNPAGRLPVTFYKSDTDLPD 719
Query: 598 MTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTK 657
TD M RTYR+++G ++ FG GLSY+++ Y +KL + +
Sbjct: 720 FTDYSMT--------NRTYRYFKGIPLYGFGYGLSYTQFKY-------DKLIVPAT---- 760
Query: 658 MVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIK 717
V++ +H +++ V N G++AG V +++K + P+K
Sbjct: 761 -VKSGKAIH------------------LSVTVTNSGQIAGDEVVQIYMKHHSQRIKVPLK 801
Query: 718 QLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIE 757
L GF V L A E+ + F LSP + + GL+ I+
Sbjct: 802 ALKGFARVYLKAGERRTLNFILSPDDLAVTSSNGGLVPIK 841
>gi|374312362|ref|YP_005058792.1| Beta-glucosidase [Granulicella mallensis MP5ACTX8]
gi|358754372|gb|AEU37762.1| Beta-glucosidase [Granulicella mallensis MP5ACTX8]
Length = 874
Score = 315 bits (808), Expect = 4e-83, Method: Compositional matrix adjust.
Identities = 175/428 (40%), Positives = 249/428 (58%), Gaps = 31/428 (7%)
Query: 47 KTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFF 106
+T I R +L++++T+ E+I+QL + APAI RLG+P+Y WW+E LHG+A G
Sbjct: 30 QTPQQIDARIDELIAKMTVSERIAQLQDRAPAIERLGLPSYNWWNEGLHGLARDGY---- 85
Query: 107 NGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALY------NAGQAIGMTFWAPN 160
AT FPQ I AA++D+ L + +G + EARA + N + G+T W+PN
Sbjct: 86 ------ATVFPQAIGLAATWDAPLLHEVGDVVSTEARAKFYSHGGENTPRFGGLTVWSPN 139
Query: 161 INIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQG-DTFNGGKLKGKLQASACCKHFTA 219
INIFRDPRWGRGQET GEDP +T +V GVQG D F L+A A KHF A
Sbjct: 140 INIFRDPRWGRGQETYGEDPFLTATLGTQFVEGVQGNDPF-------YLKADATPKHFAA 192
Query: 220 YDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRN 279
+ + R F+A V+ DLADTY P F + A+ +MC+YN ++G PSCA N
Sbjct: 193 H---SGPEEGRDSFNAVVSPHDLADTYLPAFHALTTNAHAAALMCSYNEIDGTPSCASGN 249
Query: 280 LLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHT 339
L R +WGF GY+ SDCDAV I +A D L AG+D++CG+ +
Sbjct: 250 NLQDLVRERWGFKGYVVSDCDAVGNIAGYHHFATDNAHGAADALNAGVDLDCGNTYAALS 309
Query: 340 KAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQA 399
K ++ Q E+++++ALH L R+RLG+ + + P+ IGA+ + SPAH LAL+A
Sbjct: 310 K-SLDQNLTTEAKLNQALHRLLLARVRLGMLD-PLSCSPYRDIGAEELDSPAHHTLALRA 367
Query: 400 AQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYV 459
A++ IVLLKN G+LPL ++ + +++IGP A+ K L NY G + ITPL ++
Sbjct: 368 AEESIVLLKND-GVLPL-QASTQKVSVIGPTADMVKVLEANYHGTALHPITPLDGFRSRF 425
Query: 460 ENTVYYPG 467
+ Y G
Sbjct: 426 HDVSYAQG 433
Score = 118 bits (295), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 95/303 (31%), Positives = 139/303 (45%), Gaps = 52/303 (17%)
Query: 474 SSASIDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITR 523
++A +D+AV A +D +V +GL E E L DR L LP Q+ L++R
Sbjct: 591 AAALLDQAVQTAAKSDVIVAFVGLSPDLEGEALQLRLKGFNGGDRTSLDLPEAQRTLLSR 650
Query: 524 VAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGR 583
+ + KPVI+VL G V ++ +L A YPGEAG ALA ++ G+ NP GR
Sbjct: 651 LTQL-HKPVIIVLTSGSGV--ALGPEAKDAAGVLEAWYPGEAGGEALAGILAGNVNPSGR 707
Query: 584 LPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAV 643
LP+T+Y TD M RTYR+++G +FPFG GLSYS + Y
Sbjct: 708 LPVTFYRSVDDLPAFTDYSM--------AHRTYRYFDGPVLFPFGYGLSYSHFQY----- 754
Query: 644 SQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLL 703
Q +L ST M++ + + + V N + G L
Sbjct: 755 GQLRL------STHMLKT------------------SEPLVAMVTVHNESQREGTEVAEL 790
Query: 704 FVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFL 763
+++P + +G P L G Q V L E E+ F+L+P + LS G + G + L
Sbjct: 791 YLQPP-QASGAPRLTLQGVQRVALRPGETRELTFKLAPGQ-LSTVDTSGARTVRAGEYKL 848
Query: 764 VVG 766
VG
Sbjct: 849 FVG 851
>gi|262381651|ref|ZP_06074789.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp.
2_1_33B]
gi|262296828|gb|EEY84758.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp.
2_1_33B]
Length = 868
Score = 315 bits (808), Expect = 4e-83, Method: Compositional matrix adjust.
Identities = 179/447 (40%), Positives = 245/447 (54%), Gaps = 37/447 (8%)
Query: 41 ETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGV 100
+ +PF LP+ +R DL+SRLT +EKI Q++N PAI RLGIP Y+WW+EALHGVA
Sbjct: 24 QDYPFRNPELPLEERIDDLLSRLTPEEKIGQMMNVTPAIERLGIPTYDWWNEALHGVARA 83
Query: 101 GKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQA-------IG 153
G+ AT FPQ I AA+FD + + EARA Y+ Q G
Sbjct: 84 GR----------ATVFPQAIAMAATFDDNAVHETFTIVSDEARAKYHQYQKDKEYDRYKG 133
Query: 154 MTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASAC 213
+TFW PNINIFRDPRWGRG ET GEDP +T K V+ RG+QGD N + AC
Sbjct: 134 LTFWTPNINIFRDPRWGRGMETYGEDPYLTEKMGVAVTRGLQGDDPN------YYKTHAC 187
Query: 214 CKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIP 273
KH+ + W R++F+A T +DL +TY P FE+ VK+G +MCAYNR G P
Sbjct: 188 AKHYAVHSGPEW---NRHEFNAEATPRDLYETYLPAFEALVKEGDVQEVMCAYNRFEGKP 244
Query: 274 SCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEG----YAKSP--EDAVVDVLKAGM 327
C+ LL R WG+ I SDC A+ + + + P E A D + G
Sbjct: 245 CCSSDKLLIDILRNSWGYDNIILSDCGAIDDFWRKDKNTPRHETHPDAESASADAVLNGT 304
Query: 328 DVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVV 387
D+ CG + K A+ K+ E ++D +L L R LG+F+ + + P+ KI VV
Sbjct: 305 DLECGGSYRALNK-ALADGKISEKDLDVSLRRLLKGRFELGMFDPDERV-PYSKIPYSVV 362
Query: 388 CSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCR 447
SP H AL A+ IVLLKN + +LPL K+ +A++GPNA + L NY G +
Sbjct: 363 ESPEHIAKALDMARKSIVLLKNKNNMLPLDKNIK-KIAVVGPNAADSTMLWANYNGFPSK 421
Query: 448 SITPLQALQNYVEN--TVYYPGCDTVA 472
++T ++ ++N V N +Y GC+ A
Sbjct: 422 TVTIVEGIRNKVPNAEVIYELGCNHTA 448
Score = 105 bits (261), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 76/291 (26%), Positives = 133/291 (45%), Gaps = 51/291 (17%)
Query: 486 KGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILV 535
K AD +V + G+ E EE+ DR ++ +P Q+E++ + A KPV+ V
Sbjct: 603 KDADVIVFVGGISPRLEGEEMPVDAEGFRKGDRTNIEIPAVQKEMVKALV-ATGKPVVYV 661
Query: 536 LLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIK 595
+ G + + + + ++ +IL A Y G+ G A+A+V+FGD+NP GRLP+T+Y
Sbjct: 662 VCTGSALALNWE--NDHVNAILNAWYGGQEGGTAVADVLFGDYNPAGRLPITFYKSVDQL 719
Query: 596 VPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSS 655
D M+ GRTYR+ ++PFG GLSY+ + YK +S++K+ N+S +
Sbjct: 720 PDFQDYSMK--------GRTYRYMTQTPLYPFGYGLSYTTFDYKNAKLSKDKIASNESVT 771
Query: 656 TKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRP 715
++ + N G+M G +++K G P
Sbjct: 772 -----------------------------LSFDIANTGKMDGDEVAQIYIKNPNDPAG-P 801
Query: 716 IKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
+K + F+ V + A + + +L P S M + G + ++ G
Sbjct: 802 LKAMKAFKRVNVKAGSEQPVSIQLEPKAFQSFNDNTQTMEVRPGKYQILYG 852
>gi|21233528|ref|NP_639445.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. campestris
str. ATCC 33913]
gi|66770493|ref|YP_245255.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. campestris
str. 8004]
gi|21115383|gb|AAM43327.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. campestris
str. ATCC 33913]
gi|66575825|gb|AAY51235.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. campestris
str. 8004]
Length = 896
Score = 315 bits (808), Expect = 5e-83, Method: Compositional matrix adjust.
Identities = 185/445 (41%), Positives = 254/445 (57%), Gaps = 33/445 (7%)
Query: 36 SNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALH 95
++ +T P+ T P+ RA DLVSR+TL+EK +Q+ N+APAIPRL +P Y+WW+EALH
Sbjct: 31 THAATPPPPYLDPTQPLQARAADLVSRMTLEEKAAQMQNAAPAIPRLQVPEYDWWNEALH 90
Query: 96 GVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQA---- 151
GVA G GAT FPQ I AA+FD+ L + AI EARA ++A A
Sbjct: 91 GVARAG----------GATVFPQAIGLAATFDTPLMAEVATAISDEARAKHHAFLAGGEH 140
Query: 152 ---IGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKL 208
G+TFW+PNINIFRDPRWGRGQET GEDP +T + V++V+G+Q KL
Sbjct: 141 KRYQGLTFWSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQAQQGPYRKL---- 196
Query: 209 QASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNR 268
A KH+ + + R+ FD + +DL +TY P F++ V++G + +M AYNR
Sbjct: 197 --DATAKHYAVH---SGPEADRHHFDVHPSERDLYETYLPAFQALVQEGHVAAVMGAYNR 251
Query: 269 VNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMD 328
VNG S + L RR WGF GYI SDC A+ I+ +PE A +K G D
Sbjct: 252 VNG-ESASASTRLEGILRRDWGFDGYIVSDCAAIRDIWQNHKIVPTPEAAAALGVKHGTD 310
Query: 329 VNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVC 388
++CG AAV+ + E+ IDR+L L + R+RLG+F+ P P+ + A
Sbjct: 311 LDCGDTYAA-LPAAVRAGLIDEATIDRSLTRLMAARLRLGMFD-PPAKVPWAQTPASANQ 368
Query: 389 SPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRS 448
SP H LA + A++ +VLLKN GLLPL K +A++GP A+ +LLGNY G
Sbjct: 369 SPQHDALARRTARESLVLLKND-GLLPL-KPTLKRIAVVGPTADDPMSLLGNYYGTPAAP 426
Query: 449 ITPLQALQNYV--ENTVYYPGCDTV 471
+T LQ +++ VY G D V
Sbjct: 427 VTILQGIRDAAPQAEVVYARGSDLV 451
Score = 142 bits (359), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 113/361 (31%), Positives = 170/361 (47%), Gaps = 66/361 (18%)
Query: 411 HGLLPLPKSKSVSLALIGPNAN----SAKTLLGNYA-GPSCRSITP---LQALQNYVENT 462
HG L P S + L + + K L+ ++ P RS T LQA + Y
Sbjct: 537 HGQLLPPVSGNYELQIAADDGVRLYLDGKPLIDQWSDAPRMRSSTATVALQAGKAYDLRV 596
Query: 463 VYYPGCDTVACSSA--------SIDKAVDIAKGADHVVLMMGLDQTQEKEELD------- 507
YY A + +AVD A+ AD VV + GL E EE+D
Sbjct: 597 EYYEATRDAGVRLAWRMPGAKPPLQEAVDAARNADVVVFVGGLTGDVEGEEMDVNYPGFA 656
Query: 508 ---RVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGE 564
R D LP Q+EL+ + +A PV+ VL G + I +A+ +++ +IL A YPG+
Sbjct: 657 GGDRTDTRLPKPQRELL-QALQATGTPVVAVLTTGSALAIDWAQ--QHVPAILLAWYPGQ 713
Query: 565 AGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEV 624
G A+ +V+FG +PGGRLP+T+Y +D D MR GRTYR+++GK +
Sbjct: 714 RGGTAVGDVLFGQASPGGRLPITFYKEDERLPAFDDYAMR--------GRTYRYFDGKPL 765
Query: 625 FPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFL 684
+PFG GL+Y++++Y + L L+++ +V GT
Sbjct: 766 YPFGHGLAYTQFAY-------SNLRLDRT---------------TVAADGT-------LR 796
Query: 685 VTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCES 744
T+ VKN G+ AG V L++ P R K+L GFQ + L E E+ F ++P E+
Sbjct: 797 ATVSVKNTGQRAGDEVVQLYLHPLNPQRERARKELRGFQRITLQPGEHREVSFNITPREA 856
Query: 745 L 745
L
Sbjct: 857 L 857
>gi|384430040|ref|YP_005639401.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. raphani
756C]
gi|341939144|gb|AEL09283.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. raphani
756C]
Length = 896
Score = 315 bits (808), Expect = 5e-83, Method: Compositional matrix adjust.
Identities = 185/445 (41%), Positives = 254/445 (57%), Gaps = 33/445 (7%)
Query: 36 SNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALH 95
++ +T P+ T P+ RA DLVSR+TL+EK +Q+ N+APAIPRL +P Y+WW+EALH
Sbjct: 31 THAATPPPPYLDPTQPLQARAADLVSRMTLEEKAAQMQNAAPAIPRLQVPEYDWWNEALH 90
Query: 96 GVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQA---- 151
GVA G GAT FPQ I AA+FD+ L + AI EARA ++A A
Sbjct: 91 GVARAG----------GATVFPQAIGLAATFDTPLMAEVATAISDEARAKHHAFLARGEH 140
Query: 152 ---IGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKL 208
G+TFW+PNINIFRDPRWGRGQET GEDP +T + V++V+G+Q KL
Sbjct: 141 KRYQGLTFWSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQAQQGPYRKL---- 196
Query: 209 QASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNR 268
A KH+ + + R+ FD + +DL +TY P F++ V++G + +M AYNR
Sbjct: 197 --DATAKHYAVH---SGPEADRHHFDVHPSERDLYETYLPAFQALVQEGHVAAVMGAYNR 251
Query: 269 VNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMD 328
VNG + A L RR WGF GYI SDC A+ I+ +PE A +K G D
Sbjct: 252 VNGESASAS-TRLEGILRRDWGFDGYIVSDCAAIRDIWQNHKIVPTPEAAAALGVKHGTD 310
Query: 329 VNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVC 388
++CG AAV+ + E+ IDR+L L + R+RLG+F+ P P+ + A
Sbjct: 311 LDCGDTYAA-LPAAVRAGLIDEATIDRSLTRLMAARLRLGMFD-PPAKVPWAQTPASANQ 368
Query: 389 SPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRS 448
SP H LA + A++ +VLLKN GLLPL K +A++GP A+ +LLGNY G
Sbjct: 369 SPQHDALARRTARESLVLLKND-GLLPL-KPTLKRIAVVGPTADDPMSLLGNYYGTPAAP 426
Query: 449 ITPLQALQNYV--ENTVYYPGCDTV 471
+T LQ +++ VY G D V
Sbjct: 427 VTILQGIRDAAPQAEVVYARGSDLV 451
Score = 141 bits (356), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 113/361 (31%), Positives = 170/361 (47%), Gaps = 66/361 (18%)
Query: 411 HGLLPLPKSKSVSLALIGPNAN----SAKTLLGNYA-GPSCRSITP---LQALQNYVENT 462
HG L P S + L + + K L+ ++ P RS T LQA + Y
Sbjct: 537 HGQLLPPVSGNYELQIAADDGVRLYLDGKPLIDQWSDAPRMRSGTATVALQAGKAYDLRV 596
Query: 463 VYYPGCDTVACSSA--------SIDKAVDIAKGADHVVLMMGLDQTQEKEELD------- 507
YY A + +AVD A+ AD VV + GL E EE+D
Sbjct: 597 EYYEATRDAGVRLAWRMPGAKPPLQEAVDAARNADVVVFVGGLTGDVEGEEMDVNYPGFA 656
Query: 508 ---RVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGE 564
R D LP Q+EL+ + +A PV+ VL G + I +A+ +++ +IL A YPG+
Sbjct: 657 GGDRTDTRLPKPQRELL-QALQATGTPVVAVLTTGSALAIDWAQ--QHVPAILLAWYPGQ 713
Query: 565 AGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEV 624
G A+ +V+FG +PGGRLP+T+Y +D D MR GRTYR+++GK +
Sbjct: 714 RGGTAVGDVLFGQASPGGRLPITFYKEDERLPAFDDYAMR--------GRTYRYFDGKPL 765
Query: 625 FPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFL 684
+PFG GL+Y++++Y + L L+++ +V GT
Sbjct: 766 YPFGHGLAYTQFAY-------SNLRLDRT---------------TVAADGT-------LR 796
Query: 685 VTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCES 744
T+ VKN G+ AG V L++ P R K+L GFQ + L E E+ F ++P E+
Sbjct: 797 ATVWVKNTGQRAGDEVVQLYLHPLNPQRERARKELRGFQRITLQPGEHREVSFTITPREA 856
Query: 745 L 745
L
Sbjct: 857 L 857
>gi|389794400|ref|ZP_10197553.1| beta-glucosidase-related glycosidase [Rhodanobacter fulvus Jip2]
gi|388432423|gb|EIL89432.1| beta-glucosidase-related glycosidase [Rhodanobacter fulvus Jip2]
Length = 902
Score = 315 bits (807), Expect = 6e-83, Method: Compositional matrix adjust.
Identities = 185/448 (41%), Positives = 250/448 (55%), Gaps = 33/448 (7%)
Query: 38 PSTETFPFCKT-TLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHG 96
PS + P + + RA DLV+ +TL+EK +Q+ N+APAIPRLG+ AY+WW+E LHG
Sbjct: 39 PSAASEPVYRDLSRSFHDRAADLVAHMTLEEKAAQMQNTAPAIPRLGVAAYDWWNEGLHG 98
Query: 97 VAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA-------G 149
VA G+ AT FPQ I AA+FD L + + AI EARA YN G
Sbjct: 99 VARAGQ----------ATVFPQAIGLAATFDVPLMHEVATAISDEARAKYNEFQRKGSHG 148
Query: 150 QAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQ 209
+ G+T+W+PNINIFRDPRWGRGQET GEDP +T + V++V G+QGD KL
Sbjct: 149 RYEGLTYWSPNINIFRDPRWGRGQETYGEDPYLTERMGVAFVTGLQGDNPTYRKL----- 203
Query: 210 ASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRV 269
A KHF + + R+ FD + +DL +TY P F++ V++ +M AYNRV
Sbjct: 204 -DATAKHFAVH---SGPEADRHHFDVHPSERDLYETYLPAFQTLVQEADVDAVMSAYNRV 259
Query: 270 NGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDV 329
NG P+ LL + R+ WGF GY+ SDC AV IY + E A +K G+D+
Sbjct: 260 NGEPATGSPRLLGQILRKDWGFKGYVVSDCGAVEDIYKHHKVVDTVEAASALAVKNGVDL 319
Query: 330 NCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCS 389
+CG+ K AV + ESEID AL L RMRLG+F+ + P+ + V S
Sbjct: 320 DCGTEYAALVK-AVHDGLIKESEIDAALTRLMQARMRLGMFDPASKV-PWSDVPYSVNQS 377
Query: 390 PAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSI 449
P H LA +AA++ +VLLKN G+LPL K +A+IGP A+ L+GNY G +
Sbjct: 378 PQHDALARRAARESMVLLKND-GVLPLSKDIK-HIAVIGPTADDVMALVGNYHGTPADPV 435
Query: 450 TPLQALQNYV--ENTVYYPGCDTVACSS 475
T L+ ++ VY G D V S
Sbjct: 436 TILRGIREAAPQAKVVYARGVDLVEGRS 463
Score = 122 bits (307), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 82/265 (30%), Positives = 129/265 (48%), Gaps = 50/265 (18%)
Query: 491 VVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGG 540
VV GL E EE+ DR DL LP Q++L+ + +A KPV+LVL G
Sbjct: 641 VVFAGGLTSDVEGEEMKVNYPGFAGGDRTDLRLPATQRKLLEAL-QATGKPVVLVLTSGS 699
Query: 541 PVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTD 600
+ + +A ++++ ++L A YPG+ G A+A+V+FG +P GRLP+T+Y D
Sbjct: 700 ALAVDWA--NQHLPAVLLAWYPGQRGGNAVADVLFGKADPAGRLPVTFYKASEKLPAFDD 757
Query: 601 MKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVE 660
+M GRTYR+++G+ ++PFG GLSY+K++Y + NK+ N
Sbjct: 758 YRM--------DGRTYRYFKGEPLYPFGYGLSYTKFTYADLKLDHNKIGKND-------- 801
Query: 661 NQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLV 720
K VT+ V N G+ AG V L+++ + R K L
Sbjct: 802 ---------------------KLHVTVKVHNAGKRAGDEVVQLYLRGVGTPHERSNKDLR 840
Query: 721 GFQSVILNAKEKAEIVFELSPCESL 745
G Q + L + ++ F++SP L
Sbjct: 841 GIQRITLQPGQTRDVSFDVSPATDL 865
>gi|427386425|ref|ZP_18882622.1| hypothetical protein HMPREF9447_03655 [Bacteroides oleiciplenus YIT
12058]
gi|425726465|gb|EKU89330.1| hypothetical protein HMPREF9447_03655 [Bacteroides oleiciplenus YIT
12058]
Length = 864
Score = 314 bits (805), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 173/430 (40%), Positives = 245/430 (56%), Gaps = 28/430 (6%)
Query: 42 TFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVG 101
+ F LP+ +R DLV LTL+EKISQ++N+APAI RLGIPAY WW+E LHGVA
Sbjct: 23 VYKFQNPDLPVEERVNDLVGHLTLEEKISQMMNNAPAIERLGIPAYNWWNECLHGVAR-- 80
Query: 102 KGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA-------GQAIGM 154
+ TSFPQ I AA++D+ Y++ + E RA+Y+ G G+
Sbjct: 81 -------SPYPVTSFPQAIAMAATWDTKSVYQMAEYASDEGRAIYHDAARKGTPGIFRGL 133
Query: 155 TFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACC 214
T+W+PNINIFRDPRWGRGQET GEDP +T V++V+G+QGD L++SAC
Sbjct: 134 TYWSPNINIFRDPRWGRGQETYGEDPYLTAAIGVAFVKGLQGDD------PVYLKSSACA 187
Query: 215 KHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPS 274
KH+ + W R+ ++A V+ DL DTY P F V + +G+MCAYN P
Sbjct: 188 KHYAVHSGPEW---NRHTYNAEVSNHDLWDTYLPAFRELVVDAKVTGVMCAYNSFFEQPC 244
Query: 275 CADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSF 334
C + L+ R QW F GY+TSDC A+ Y+ + +A D + G D CG+
Sbjct: 245 CGNDLLMMDILRNQWKFDGYVTSDCGAIEDFYNTHNTHEDAAEASADAVLHGTDCECGNG 304
Query: 335 LQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQV 394
+ A+ + + E ++D +L LF +R RLG+F+ + + P+ I V+ AH+
Sbjct: 305 AYRALADAIVRGLITEEQVDVSLKKLFEIRFRLGMFDPDDRV-PYSDIPISVLECDAHKA 363
Query: 395 LALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAG-PSCRSITPLQ 453
AL+ A+ IVLLKN LLPL +K +A++GPNA+ LL NY G PSC + T L+
Sbjct: 364 HALKMARQSIVLLKNEKQLLPLDMNKIKKIAVVGPNADDKSVLLANYYGYPSCVT-TVLE 422
Query: 454 ALQNYVENTV 463
++ V + V
Sbjct: 423 GIKGKVGDQV 432
Score = 103 bits (257), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 84/268 (31%), Positives = 123/268 (45%), Gaps = 53/268 (19%)
Query: 486 KGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILV 535
K AD VV + GL E EE+ DR + +P QQ L+ + A KPVI +
Sbjct: 597 KDADVVVFVGGLSAKVEGEEMKVEIDGFKRGDRTSISIPVVQQNLLKELY-ATGKPVIFI 655
Query: 536 LLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIK 595
L+ G V + + ++ +IL A Y G+AG A+A+V+FGD+NP GRLP+T+Y
Sbjct: 656 LMTGSAVGLEWES--EHLPAILNAWYGGQAGGQAIADVLFGDYNPSGRLPLTFYKNVNDL 713
Query: 596 VPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSS 655
D M+ RTYR++ G V+PFG GLSY+ + Y ++
Sbjct: 714 PDFEDYSMK--------NRTYRYFTGIPVYPFGYGLSYTDFQY---------------NT 750
Query: 656 TKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRP 715
K+ + D + K VT V N G+ G+ V L+V R P
Sbjct: 751 IKVQPSLDKLSVK----------------VTAEVSNVGKYEGEEVVQLYVSNP-RDFVTP 793
Query: 716 IKQLVGFQSVILNAKEKAEIVFELSPCE 743
I+ L GF+ + L E + F L+ E
Sbjct: 794 IRALKGFRRINLKPGESQMVEFVLTSKE 821
>gi|399029285|ref|ZP_10730258.1| beta-glucosidase-like glycosyl hydrolase [Flavobacterium sp. CF136]
gi|398072895|gb|EJL64089.1| beta-glucosidase-like glycosyl hydrolase [Flavobacterium sp. CF136]
Length = 871
Score = 314 bits (804), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 182/441 (41%), Positives = 249/441 (56%), Gaps = 33/441 (7%)
Query: 39 STETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVA 98
+ E F F L QR DLVSR+++DEKISQL++S+PAI RLG+P Y WW+E+LHGVA
Sbjct: 20 AQENFAFKNPNLTTEQRVDDLVSRMSIDEKISQLMDSSPAIERLGVPEYNWWNESLHGVA 79
Query: 99 GVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA-------GQA 151
G AT FPQ I A+S+D L + + I EARA ++ G
Sbjct: 80 RAGY----------ATVFPQSISIASSWDRQLIFDVANVISDEARAKHHEYLRRGQHGMY 129
Query: 152 IGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQAS 211
G+TFW+PN+NIFRDPRWGRG ET GEDP +TG+ + YV G+QG + L+
Sbjct: 130 QGLTFWSPNVNIFRDPRWGRGHETYGEDPFLTGQLGLKYVNGLQGTN------EKYLKVI 183
Query: 212 ACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNG 271
A KH+ + +R+ F+A + DL +TY P F + VK+G +M AYNR G
Sbjct: 184 ATAKHYAVHSGPE---PSRHLFNAETSDIDLYETYLPAFRTLVKEGHVYSVMGAYNRFRG 240
Query: 272 IPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNC 331
SC+ L R WGF GYI SDC AV+ I+ A LK G+D+ C
Sbjct: 241 -ESCSASPFLFNILRNVWGFDGYIVSDCGAVTDIWKYHKITGDAATASALALKDGLDLEC 299
Query: 332 GSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPA 391
GS K K A+ +K + E++ID A+ LF+ R +LG+F+ + + +I V + A
Sbjct: 300 GSSF-KSLKEAIDRKLISEADIDIAVKRLFTARFKLGMFDPEEIVS-YAQIPYSVNNNSA 357
Query: 392 HQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITP 451
H LA A+Q IVLLKN + LPL + ++A+IGPNAN ++L GNY+G IT
Sbjct: 358 HDWLARVASQKSIVLLKNQNNTLPLSRDIK-TVAVIGPNANDVQSLWGNYSGVPSNPITV 416
Query: 452 LQALQNYVE-NT--VYYPGCD 469
L+ +QN +E NT +Y G D
Sbjct: 417 LKGIQNKLEPNTKVLYAKGTD 437
Score = 152 bits (383), Expect = 8e-34, Method: Compositional matrix adjust.
Identities = 101/302 (33%), Positives = 153/302 (50%), Gaps = 52/302 (17%)
Query: 478 IDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEA 527
+ +AV +A AD +VL++GL++ E EE+ DR L LP Q+EL+ + A
Sbjct: 589 LQEAVQVAGQADAIVLVLGLNERLEGEEMKVEADGFEGGDRTSLDLPSNQEELM-KAMTA 647
Query: 528 AKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMT 587
KPVILVL+ G + I +A + ++ +IL AGYPG+ G A+A+V+FGD+NP GRLP+T
Sbjct: 648 TGKPVILVLINGSALSINWA--NDHVPAILTAGYPGQQGGNAIADVLFGDYNPAGRLPVT 705
Query: 588 WYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNK 647
+Y + M+ GRTYR+++ K ++PFG GLSY+K+ Y
Sbjct: 706 YYKSTEQLPAFENYDMK--------GRTYRYFQKKPLYPFGFGLSYTKFKY--------- 748
Query: 648 LYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKP 707
S +L T + F + + V N GE G + L++K
Sbjct: 749 ---------------------SNLKLPTNVTPEKDFEILVDVTNIGERDGDEVIELYLKD 787
Query: 708 ARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGD 767
+ RPI QL GF+ V L E + F ++P + LS + G VIE G + VG
Sbjct: 788 EKASTPRPILQLEGFERVNLKKGETKTVRFTITPRQ-LSLINKKGQRVIEPGWFTISVGG 846
Query: 768 EE 769
++
Sbjct: 847 KQ 848
>gi|332665860|ref|YP_004448648.1| beta-glucosidase [Haliscomenobacter hydrossis DSM 1100]
gi|332334674|gb|AEE51775.1| Beta-glucosidase [Haliscomenobacter hydrossis DSM 1100]
Length = 887
Score = 314 bits (804), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 172/427 (40%), Positives = 241/427 (56%), Gaps = 28/427 (6%)
Query: 39 STETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVA 98
S + FP T L R +DLVSRLTL+EK+ Q++N+APAIPRLGIPAY+WW+E LHGVA
Sbjct: 36 SFKGFPMWDTNLSFEVRVKDLVSRLTLEEKVGQMLNAAPAIPRLGIPAYDWWNEVLHGVA 95
Query: 99 GVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIG----- 153
T T +PQ I AA +DS + LE RA++N A+G
Sbjct: 96 R---------TPFHVTVYPQAIGMAAGWDSTSLAMMAHYSALEGRAVFNKATALGRNNER 146
Query: 154 ---MTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQA 210
+T+W PNINIFRDPRWGRGQET GEDP +T ++VRG+QGD L+A
Sbjct: 147 YLGLTYWTPNINIFRDPRWGRGQETYGEDPFLTSMLGRAFVRGLQGDD------PKYLKA 200
Query: 211 SACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVN 270
+AC KHF + +R+ + + DL DTY P F+ V + + G+MCAYN +
Sbjct: 201 AACAKHFAVHSGPE---PSRHSDNFSPSNYDLWDTYLPAFKELVTKAKVEGVMCAYNAFH 257
Query: 271 GIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVN 330
G P C L++ R+QW F GY+TSDC A+ + A VD + G DV
Sbjct: 258 GQPCCGSDVLMNDILRKQWQFKGYVTSDCWAIDDFFKFHKTHPDATSASVDAVLHGTDVE 317
Query: 331 CGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSP 390
CG+ + K VK+ + E+++D +L LF+ R RLG+F+ +M + + ++ +
Sbjct: 318 CGTDVYKSLLDGVKKGMIAEAQLDISLIRLFTTRYRLGMFD-PVSMVKYAQTPESILETA 376
Query: 391 AHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSIT 450
H+ +L+ AQ IVLLKN LPL K+ +A++GPNA++ +LGNY G IT
Sbjct: 377 EHKAHSLKMAQQSIVLLKNEGNTLPLSKNIK-KIAVLGPNADNRIVVLGNYNGQPSEIIT 435
Query: 451 PLQALQN 457
LQ ++N
Sbjct: 436 ALQGIKN 442
Score = 112 bits (280), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 84/301 (27%), Positives = 140/301 (46%), Gaps = 52/301 (17%)
Query: 476 ASIDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVA 525
+++ V+ K AD +V + G+ E EE+ DR ++LP Q EL+ ++
Sbjct: 607 SNLSAIVNRVKDADAIVYVGGISPQLEGEEMRVDFPGFNGGDRTSILLPAVQTELL-KML 665
Query: 526 EAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLP 585
+ KP++ V++ G + + + D+NI +I+ A Y G++ A+A+V+FGD+NP GRLP
Sbjct: 666 KGTGKPLVFVVMTGSAIALPYE--DQNIPAIVNAWYGGQSAGTAIADVLFGDYNPAGRLP 723
Query: 586 MTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQ 645
+T+Y D + D K + RTYR+++G ++PFG GLSY+ + Y K +
Sbjct: 724 VTFYKAD---SDLPDFK-----SYDMNNRTYRYFKGDALYPFGHGLSYTSFQYS-KLKTP 774
Query: 646 NKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFV 705
K+ S F V+ + N G+ G V L++
Sbjct: 775 GKIKSGAS-----------------------------FKVSATLTNTGKKDGDEVVQLYL 805
Query: 706 KPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVV 765
PI+ L GF + L A E + F LSP E E+G + +G + +
Sbjct: 806 AYPEVAGKAPIRALKGFNRIRLKAGESKTVSFTLSP-EQCQLVNEEGALYQPKGKMEISL 864
Query: 766 G 766
G
Sbjct: 865 G 865
>gi|408369545|ref|ZP_11167326.1| glycoside hydrolase [Galbibacter sp. ck-I2-15]
gi|407745291|gb|EKF56857.1| glycoside hydrolase [Galbibacter sp. ck-I2-15]
Length = 881
Score = 313 bits (801), Expect = 3e-82, Method: Compositional matrix adjust.
Identities = 184/434 (42%), Positives = 250/434 (57%), Gaps = 32/434 (7%)
Query: 41 ETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGV 100
+ +PF L S R DL+ RLT++EKI QL+ ++PAI RLGIP Y WW+E+LHGVA
Sbjct: 25 QQYPFQNPELDDSARVADLLERLTVEEKIDQLLYTSPAIERLGIPEYNWWNESLHGVARA 84
Query: 101 GKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA-------GQAIG 153
G AT FPQ I AA++DS L + AI EARA ++ G G
Sbjct: 85 GY----------ATVFPQSITIAAAWDSDLLKEVADAISDEARAKHHEYIRRGQRGIYQG 134
Query: 154 MTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASAC 213
+TFW+PNINIFRDPRWGRG ET GEDP +TG+ ++YV+G+QG+ N L+ A
Sbjct: 135 LTFWSPNINIFRDPRWGRGHETYGEDPYLTGQLGIAYVKGLQGNDPN------YLKLVAT 188
Query: 214 CKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIP 273
KHF + + R++FD + +DL +TY P F VKQG +M AYNRV G
Sbjct: 189 AKHFAVH---SGPEPLRHEFDVSPSKRDLWETYLPAFRYLVKQGDVKSVMTAYNRVYGEA 245
Query: 274 SCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGS 333
+ A L + R W F GY+ SDC A+S I+ AK +A + G D+NCG
Sbjct: 246 ASASDTLFT-ILRDYWDFDGYVVSDCFAISDIWKYHKIAKDAAEASAMAVIEGCDLNCGD 304
Query: 334 FLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQ 393
+K + A +Q + E +ID AL L R++LG+F+ + P+ +I +V S H
Sbjct: 305 SYEKLNQ-AYQQGMVTEKDIDIALSRLMEARIKLGMFDPE-QLVPYAQIPFNVNTSEKHN 362
Query: 394 VLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQ 453
LAL+AA++ IVLLKN LLPL K S+A+IGPNA++ ++L GNY G IT LQ
Sbjct: 363 QLALKAAKESIVLLKNQGDLLPLSKDLK-SVAVIGPNADNIQSLWGNYNGNPKDPITVLQ 421
Query: 454 ALQNYV--ENTVYY 465
+QN + + TV Y
Sbjct: 422 GIQNALGPQTTVVY 435
Score = 133 bits (334), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 93/288 (32%), Positives = 144/288 (50%), Gaps = 54/288 (18%)
Query: 493 LMMGLDQTQEKEELD----------RVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPV 542
+++GL++ E EE+D R L LP Q+ L+ VA+ K P++LVLL G +
Sbjct: 607 MVLGLNERLEGEEMDVVVEGFAGGDRTALDLPASQRTLLKEVAKTGK-PIVLVLLNGSAL 665
Query: 543 DITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMK 602
I +A NI +I+ AGY G+ G A+AEV+FGD+NP RLP+T+Y D
Sbjct: 666 SINWAA--ENIPAIMTAGYAGQQGGNAVAEVLFGDYNPAARLPVTYYKSVEDLPDFEDYN 723
Query: 603 MRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSY-KFKAVSQNKLYLNQSSSTKMVEN 661
M GRTYR++E + ++PFG GLSY+ + Y KF+ S K+ +N+S
Sbjct: 724 MD--------GRTYRYFEKEPLYPFGYGLSYTTFDYSKFQLPS--KIDMNES-------- 765
Query: 662 QDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVG 721
+++ V N G G V +++ + RPI++LVG
Sbjct: 766 ---------------------IELSVEVTNTGAYDGDEVVQVYLTDEKGSTPRPIRELVG 804
Query: 722 FQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGDEE 769
F+ + L E ++ F + P + LS + G +VIE G + VG E+
Sbjct: 805 FKRIHLKKGESQKVQFTIEPRQ-LSMIDDKGDLVIEPGVFSISVGGEQ 851
>gi|289670678|ref|ZP_06491753.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. musacearum
NCPPB 4381]
Length = 886
Score = 312 bits (800), Expect = 4e-82, Method: Compositional matrix adjust.
Identities = 175/421 (41%), Positives = 242/421 (57%), Gaps = 32/421 (7%)
Query: 54 QRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGA 113
QRA LV++++ +EK++Q +N APAIPRLGIPAYEWWSE LHG+A G A
Sbjct: 36 QRAAALVAQMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNGH----------A 85
Query: 114 TSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAG--------QAIGMTFWAPNINIFR 165
T FPQ I AAS+++ L ++G + EARA +N + G+T W+PNINIFR
Sbjct: 86 TVFPQAIGLAASWNTNLMQQVGTVVSTEARAKFNQAGGPGKDHKRYAGLTIWSPNINIFR 145
Query: 166 DPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNW 225
DPRWGRG ET GEDP +TG+ AV ++RG+QG+ N + A KH + +
Sbjct: 146 DPRWGRGMETYGEDPFLTGQMAVGFIRGLQGEDLN------HPRTIATPKHLAVH---SG 196
Query: 226 KGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTA 285
R+ FD V+ D+ TY P F + + QG+A +MCAYN ++G P+CA LL+
Sbjct: 197 PEPGRHGFDVDVSPHDVEATYTPAFRAALVQGQAGSVMCAYNSLHGTPACAADWLLNGRV 256
Query: 286 RRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQ 345
R WGF G++ SDCDAV + + + LKAG D+NCG + A+++
Sbjct: 257 RGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAAALKAGHDLNCG-HAYRELGTAIER 315
Query: 346 KKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIV 405
+ E+ +D++L LF+ R RLG P P+ ++GA V + AH+ LALQAA + IV
Sbjct: 316 GDVDEALLDQSLVRLFAARYRLGELEA-PRKDPYARLGAKDVDNAAHRALALQAAAESIV 374
Query: 406 LLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQAL-QNYVENTVY 464
LLKN LPL + LA+IGPNA++ L NY G S +TPL L Q + V
Sbjct: 375 LLKNDANTLPL--NAGTRLAVIGPNADALAALEANYQGTSSAPVTPLLGLRQRFGAQQVR 432
Query: 465 Y 465
Y
Sbjct: 433 Y 433
Score = 134 bits (337), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 96/291 (32%), Positives = 141/291 (48%), Gaps = 56/291 (19%)
Query: 488 ADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLL 537
+D VV +GL E EEL DR D+ LP QQ L+ R A+A+ KP+++VL+
Sbjct: 615 SDAVVAFVGLSPDVEGEELRIDVPGFDGGDRNDIALPAPQQALLER-AKASGKPLVVVLM 673
Query: 538 CGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWY--PQDYIK 595
G V + +AK + +I+ A YPG++G A+A ++ GD NPGGRLP+T+Y +D
Sbjct: 674 SGSAVALNWAKTHAD--AIVAAWYPGQSGGTAIARMLAGDDNPGGRLPVTFYRSTKDLPA 731
Query: 596 VPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSS 655
DMK GRTYR+++G+ +FPFG GLSY++++Y +S L
Sbjct: 732 YVSYDMK----------GRTYRYFKGEPLFPFGYGLSYTRFAYDAPQLSSTTLQAGN--- 778
Query: 656 TKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRP 715
VT V+N G AG ++++ R P
Sbjct: 779 --------------------------PLQVTTTVRNTGTHAGDEVAQVYLQYPDRPQS-P 811
Query: 716 IKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
++ LVGFQ V L A E+ + F L +LS G +E G + L VG
Sbjct: 812 LRSLVGFQRVHLAAGEQRTLTFNLD-ARALSDVDRSGQRAVEAGNYTLFVG 861
>gi|224538282|ref|ZP_03678821.1| hypothetical protein BACCELL_03173 [Bacteroides cellulosilyticus
DSM 14838]
gi|224520107|gb|EEF89212.1| hypothetical protein BACCELL_03173 [Bacteroides cellulosilyticus
DSM 14838]
Length = 864
Score = 312 bits (799), Expect = 5e-82, Method: Compositional matrix adjust.
Identities = 183/445 (41%), Positives = 247/445 (55%), Gaps = 39/445 (8%)
Query: 37 NPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHG 96
N E FP+ T+L +RA DL+ RLTL+EK S ++N +PAIPRL I AY WW+EALHG
Sbjct: 21 NKLPEKFPYQDTSLTAEERADDLLKRLTLEEKASLMMNGSPAIPRLSIKAYGWWNEALHG 80
Query: 97 VAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALY-------NAG 149
+A G AT FPQ I ASFD L Y + A+ EARA N
Sbjct: 81 LARTGL----------ATVFPQAIGMGASFDDSLLYEVFTAVSDEARAKSRRLDSKGNLT 130
Query: 150 QAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQG-DTFNGGKLKGKL 208
+ +T W PN+NIFRDPRWGRGQET GEDP +T + V+ V G+QG DT KL
Sbjct: 131 RYQALTVWTPNVNIFRDPRWGRGQETYGEDPYLTSRLGVAVVNGLQGPDTARYNKLH--- 187
Query: 209 QASACCKHFTAYDLDNWKGTTRYKFDAR-VTMQDLADTYQPPFESCVKQGRASGIMCAYN 267
AC KH+ + W R+ F+A ++ +DL +TY P F++ V++ + +MCAYN
Sbjct: 188 ---ACAKHYAVHSGPEW---NRHSFNAENISPRDLWETYLPAFKTLVQEAKVKEVMCAYN 241
Query: 268 RVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKA-- 325
R G P C LL++ R +WGF G + SDC AVS + + P+ A
Sbjct: 242 RFEGEPCCGSNRLLTQILRDEWGFDGVVVSDCGAVSDFWQKRKHETHPDAASASADAVLN 301
Query: 326 GMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGAD 385
G DV CG+ K AVK + E++ID ++ L R LG + N + I +D
Sbjct: 302 GTDVECGNSY-KSLPDAVKAGLITENQIDISVKRLLKARFELGEMDEN----VWTGISSD 356
Query: 386 VVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPS 445
VV SP H+ LALQ A++ + LL+N++ +LPL SK +ALIGPNAN + GNY G
Sbjct: 357 VVDSPKHRQLALQMARETMTLLQNNNNILPL--SKQAKIALIGPNANDSVMQWGNYNGLP 414
Query: 446 CRSITPLQALQNYV--ENTVYYPGC 468
+IT L+ +Q Y+ N +Y P C
Sbjct: 415 SHTITLLEGMQRYLPTSNLIYEPVC 439
Score = 101 bits (251), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 82/292 (28%), Positives = 132/292 (45%), Gaps = 53/292 (18%)
Query: 486 KGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILV 535
K D V+ G+ E EE+ DR ++ LP Q+ ++ + A K+ ++ V
Sbjct: 599 KDVDVVIFAGGISPALEGEEMPVDAAGFRGGDRTEIELPAVQRRVVEALKTAGKR-IVFV 657
Query: 536 LLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIK 595
G I +N +IL A YPG+AG A+AEV+FGD+NP G+LP+T+Y ++ +
Sbjct: 658 NFSGAA--IALEPESQNCEAILQAWYPGQAGGQAVAEVLFGDYNPAGKLPLTFY-RNLAQ 714
Query: 596 VP-MTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSS 654
+P D M GRTYR+ + +FPFG GLSY+ + Y KL +N
Sbjct: 715 IPDFEDYNM--------TGRTYRYMKETPLFPFGHGLSYTTFKY-------GKLKMNDD- 758
Query: 655 STKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGR 714
K+ Q++ + I V N G G V +++K G
Sbjct: 759 --KIAAGQNLN-------------------LAIPVTNTGSRDGDEVVQVYLKKMDDTEG- 796
Query: 715 PIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
P+K L F+ V + A + E+ F L + + M + G + +++G
Sbjct: 797 PVKTLRAFKRVRIPAGKTVEVKFSLDDTQLEWWDEQSNTMRVCPGNYTVMIG 848
>gi|423215029|ref|ZP_17201557.1| hypothetical protein HMPREF1074_03089 [Bacteroides xylanisolvens
CL03T12C04]
gi|392692292|gb|EIY85530.1| hypothetical protein HMPREF1074_03089 [Bacteroides xylanisolvens
CL03T12C04]
Length = 861
Score = 312 bits (799), Expect = 5e-82, Method: Compositional matrix adjust.
Identities = 179/444 (40%), Positives = 247/444 (55%), Gaps = 38/444 (8%)
Query: 41 ETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGV 100
+ P+ T+L QRA DL+ RLTL+EK+S + N++PAIPRLGI YEWW+EALHGV
Sbjct: 22 QLLPYQDTSLTAEQRAEDLLPRLTLEEKVSLMQNASPAIPRLGIKEYEWWNEALHGVGRA 81
Query: 101 GKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARA---LYNAGQAI----G 153
G AT FPQ I ASF+ L Y + A EAR ++ + G
Sbjct: 82 GL----------ATVFPQSIGMGASFNDSLLYEVFNATSDEARVKSRIFGESGVLKRYQG 131
Query: 154 MTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQG-DTFNGGKLKGKLQASA 212
+TFW PN+NIFRDPRWGRGQET GEDP +TG+ ++ VRG+QG + KL A
Sbjct: 132 LTFWTPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPEDARYDKLH------A 185
Query: 213 CCKHFTAYDLDNWKGTTRYKFDAR-VTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNG 271
C KHF + W R+ FDA + +DL +TY P F+ V++ +MCAYNR G
Sbjct: 186 CAKHFAVHSGPEW---NRHSFDAENIDPRDLWETYLPAFKDLVQKAHVKEVMCAYNRFEG 242
Query: 272 IPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSP--EDAVVDVLKAGMDV 329
P C LL + R +WG+ G + SDC A+S Y + P E A D ++AG D+
Sbjct: 243 EPCCGSNRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTHGTHPDKEHASADAVRAGTDL 302
Query: 330 NCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCS 389
CGS AVK + E EID +L L + R LG + P + +I V+ S
Sbjct: 303 ECGSEYASLAD-AVKAGLIDEKEIDISLKRLLTARFELGEMDEQPA---WSEIPTSVLNS 358
Query: 390 PAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSI 449
HQ LAL+ A++ +VLL+N + +LPL + + +A++GPNAN + GNY G ++
Sbjct: 359 KEHQALALRMARESLVLLQNKNNILPL--NTHLKVAVMGPNANDSVMQWGNYNGIPAHTV 416
Query: 450 TPLQALQNYVE--NTVYYPGCDTV 471
T L+A++ + +Y PGCD V
Sbjct: 417 TLLEAVRAKLPEGQIIYEPGCDRV 440
Score = 85.1 bits (209), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 83/300 (27%), Positives = 139/300 (46%), Gaps = 53/300 (17%)
Query: 478 IDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEA 527
++ AV AD ++ G+ + E EE+ DR D+ LP Q++L+ + +A
Sbjct: 588 LNFAVKRVMDADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQRDLLKALKKA 647
Query: 528 AKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMT 587
KK V+ + G I +IL A YPG+AG A+ + ++G++NPGGRLP+T
Sbjct: 648 GKK-VVFINYSGSA--IGLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPGGRLPVT 704
Query: 588 WYPQDYIKVP-MTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQN 646
+Y +D ++P D M+ GRTYR+ + + +FPFG GLSY+ ++Y +S+N
Sbjct: 705 FY-KDVNQLPDFEDYSMK--------GRTYRYMQQQPLFPFGHGLSYTDFTYGEAKLSKN 755
Query: 647 KLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVK 706
+ + ++VV +TI V N G+ G+ V ++++
Sbjct: 756 TI----------AKGENVV-------------------LTIPVSNVGQRDGEEVVQVYLR 786
Query: 707 PARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
G P L F+ V + A + + L+ E M EGT+ L+ G
Sbjct: 787 RPGDKEG-PRYTLRAFKRVHIPAGKTESVAIPLTGVNFEWFDVESNTMRPLEGTYELLYG 845
>gi|423221630|ref|ZP_17208100.1| hypothetical protein HMPREF1062_00286 [Bacteroides cellulosilyticus
CL02T12C19]
gi|392645869|gb|EIY39591.1| hypothetical protein HMPREF1062_00286 [Bacteroides cellulosilyticus
CL02T12C19]
Length = 864
Score = 311 bits (798), Expect = 6e-82, Method: Compositional matrix adjust.
Identities = 183/445 (41%), Positives = 247/445 (55%), Gaps = 39/445 (8%)
Query: 37 NPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHG 96
N E FP+ T+L +RA DL+ RLTL+EK S ++N +PAIPRL I AY WW+EALHG
Sbjct: 21 NKLPEKFPYQDTSLTAEERADDLLKRLTLEEKASLMMNGSPAIPRLSIKAYGWWNEALHG 80
Query: 97 VAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALY-------NAG 149
+A G AT FPQ I ASFD L Y + A+ EARA N
Sbjct: 81 LARTGL----------ATVFPQAIGMGASFDDSLLYEVFTAVSDEARAKSRRLDSKGNLT 130
Query: 150 QAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQG-DTFNGGKLKGKL 208
+ +T W PN+NIFRDPRWGRGQET GEDP +T + V+ V G+QG DT KL
Sbjct: 131 RYQALTVWTPNVNIFRDPRWGRGQETYGEDPYLTSRLGVAVVNGLQGPDTARYNKLH--- 187
Query: 209 QASACCKHFTAYDLDNWKGTTRYKFDAR-VTMQDLADTYQPPFESCVKQGRASGIMCAYN 267
AC KH+ + W R+ F+A ++ +DL +TY P F++ V++ + +MCAYN
Sbjct: 188 ---ACAKHYAVHSGPEW---NRHSFNAENISPRDLWETYLPAFKTLVQEAKVKEVMCAYN 241
Query: 268 RVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKA-- 325
R G P C LL++ R +WGF G + SDC AVS + + P+ A
Sbjct: 242 RFEGEPCCGSNRLLTQILRDEWGFDGVVVSDCGAVSDFWQKRKHETHPDAASASADAVLN 301
Query: 326 GMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGAD 385
G DV CG+ K AVK + E++ID ++ L R LG + N + I +D
Sbjct: 302 GTDVECGNSY-KSLPDAVKAGLITENQIDISVKRLLKARFELGEMDEN----VWTGISSD 356
Query: 386 VVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPS 445
VV SP H+ LALQ A++ + LL+N++ +LPL SK +ALIGPNAN + GNY G
Sbjct: 357 VVDSPKHRQLALQMARETMTLLQNNNNILPL--SKQAKIALIGPNANDSVMQWGNYNGLP 414
Query: 446 CRSITPLQALQNYV--ENTVYYPGC 468
+IT L+ +Q Y+ N +Y P C
Sbjct: 415 SHTITLLEGMQRYLPTSNLIYEPVC 439
Score = 100 bits (248), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 82/292 (28%), Positives = 131/292 (44%), Gaps = 53/292 (18%)
Query: 486 KGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILV 535
K D V+ G+ E EE+ DR ++ LP Q+ ++ + A K+ ++ V
Sbjct: 599 KDVDVVIFAGGISPALEGEEMPVDAAGFRGGDRTEIELPAVQRRVVEALKTAGKR-IVFV 657
Query: 536 LLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIK 595
G I N +IL A YPG+AG A+AEV+FGD+NP G+LP+T+Y ++ +
Sbjct: 658 NFSGAA--IALEPESLNCEAILQAWYPGQAGGQAVAEVLFGDYNPAGKLPLTFY-RNLAQ 714
Query: 596 VP-MTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSS 654
+P D M GRTYR+ + +FPFG GLSY+ + Y KL +N
Sbjct: 715 IPDFEDYNM--------TGRTYRYMKETPLFPFGHGLSYTTFKY-------GKLKMNDD- 758
Query: 655 STKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGR 714
K+ Q++ + I V N G G V +++K G
Sbjct: 759 --KIAAGQNLN-------------------LVIPVTNTGSRDGDEVVQVYLKKMDDTEG- 796
Query: 715 PIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
P+K L F+ V + A + E+ F L + + M + G + +++G
Sbjct: 797 PVKTLRAFKRVRIPAGKTVEVKFSLDDTQLEWWDEQSNTMRVCPGNYTVMIG 848
>gi|289664871|ref|ZP_06486452.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. vasculorum
NCPPB 702]
Length = 886
Score = 311 bits (798), Expect = 6e-82, Method: Compositional matrix adjust.
Identities = 175/421 (41%), Positives = 241/421 (57%), Gaps = 32/421 (7%)
Query: 54 QRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGA 113
QRA LV+ ++ +EK++Q +N APAIPRLGIPAYEWWSE LHG+A G A
Sbjct: 36 QRAAALVAHMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNGH----------A 85
Query: 114 TSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAG--------QAIGMTFWAPNINIFR 165
T FPQ I AAS+++ L ++G + EARA +N + G+T W+PNINIFR
Sbjct: 86 TVFPQAIGLAASWNTNLMQQVGTVVSTEARAKFNQAGGPGKDHKRYAGLTIWSPNINIFR 145
Query: 166 DPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNW 225
DPRWGRG ET GEDP +TG+ AV ++RG+QG+ N + A KH + +
Sbjct: 146 DPRWGRGMETYGEDPFLTGQMAVGFIRGLQGEDLN------HPRTIATPKHLAVH---SG 196
Query: 226 KGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTA 285
R+ FD V+ D+ TY P F + + QG+A +MCAYN ++G P+CA LL+
Sbjct: 197 PEPGRHGFDVDVSPHDVEATYTPAFRAALVQGQAGSVMCAYNSLHGTPACAADWLLNGRV 256
Query: 286 RRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQ 345
R WGF G++ SDCDAV + + + LKAG D+NCG + A+++
Sbjct: 257 RGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAAALKAGHDLNCG-HAYRELGTAIER 315
Query: 346 KKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIV 405
+ E+ +D++L LF+ R RLG P P+ ++GA V + AH+ LALQAA + IV
Sbjct: 316 GDVDEALLDQSLVRLFAARYRLGELEA-PRKDPYARLGAKDVDNAAHRALALQAAAESIV 374
Query: 406 LLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQAL-QNYVENTVY 464
LLKN LPL + LA+IGPNA++ L NY G S +TPL L Q + V
Sbjct: 375 LLKNDANTLPL--NAGTRLAVIGPNADALAALEANYQGTSSAPVTPLLGLRQRFGAQQVR 432
Query: 465 Y 465
Y
Sbjct: 433 Y 433
Score = 135 bits (339), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 98/292 (33%), Positives = 144/292 (49%), Gaps = 58/292 (19%)
Query: 488 ADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLL 537
+D VV +GL E EEL DR D+ LP QQ L+ R A+A+ KP+++VL+
Sbjct: 615 SDAVVAFVGLSPDVEGEELRIDVPGFDGGDRNDIALPAPQQALLER-AKASGKPLVVVLM 673
Query: 538 CGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWY--PQDYIK 595
G V + +AK + +I+ A YPG++G A+A ++ GD NPGGRLP+T+Y +D
Sbjct: 674 SGSAVALNWAKTHAD--AIVAAWYPGQSGGTAIARMLAGDDNPGGRLPVTFYRSTKDLPA 731
Query: 596 VPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSS 655
DMK GRTYR+++G+ +FPFG GLSY++++Y
Sbjct: 732 YVSYDMK----------GRTYRYFKGEPLFPFGYGLSYTRFAY----------------- 764
Query: 656 TKMVENQDVVHYKSVPELGTEFCETRKFL-VTIGVKNHGEMAGKHPVLLFVKPARRGNGR 714
P+L T + L VT V+N G AG ++++ R
Sbjct: 765 -------------DAPQLSTTALQAGNPLQVTTTVRNTGTRAGDEVAQVYLQYPDRPQS- 810
Query: 715 PIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
P++ LVGFQ V L A E+ + F L +LS G +E G + L VG
Sbjct: 811 PLRSLVGFQRVHLAAGEQRTLTFNLD-ARALSDVDRSGQRAVEAGNYTLFVG 861
>gi|299147288|ref|ZP_07040353.1| beta-glucosidase [Bacteroides sp. 3_1_23]
gi|298514566|gb|EFI38450.1| beta-glucosidase [Bacteroides sp. 3_1_23]
Length = 861
Score = 311 bits (798), Expect = 6e-82, Method: Compositional matrix adjust.
Identities = 179/444 (40%), Positives = 247/444 (55%), Gaps = 38/444 (8%)
Query: 41 ETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGV 100
+ P+ T+L QRA DL+ RLTL+EK+S + N++PAIPRLGI YEWW+EALHGV
Sbjct: 22 QLLPYQDTSLTAEQRAEDLLPRLTLEEKVSLMQNASPAIPRLGIKEYEWWNEALHGVGRA 81
Query: 101 GKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARA---LYNAGQAI----G 153
G AT FPQ I ASF+ L Y + A EAR ++ + G
Sbjct: 82 GL----------ATVFPQSIGMGASFNDSLLYEVFNATSDEARVKSRIFGESGVLKRYQG 131
Query: 154 MTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQG-DTFNGGKLKGKLQASA 212
+TFW PN+NIFRDPRWGRGQET GEDP +TG+ ++ VRG+QG + KL A
Sbjct: 132 LTFWTPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPEDARYDKLH------A 185
Query: 213 CCKHFTAYDLDNWKGTTRYKFDAR-VTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNG 271
C KHF + W R+ FDA + +DL +TY P F+ V++ +MCAYNR G
Sbjct: 186 CAKHFAVHSGPEW---NRHSFDAENIDPRDLWETYLPAFKDLVQKAHVKEVMCAYNRFEG 242
Query: 272 IPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSP--EDAVVDVLKAGMDV 329
P C LL + R +WG+ G + SDC A+S Y + P E A D ++AG D+
Sbjct: 243 EPCCGSNRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTHGTHPDKEHASADAVRAGTDL 302
Query: 330 NCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCS 389
CGS AVK + E EID +L L + R LG + P + +I V+ S
Sbjct: 303 ECGSEYASLAD-AVKAGLIDEKEIDISLKRLLTARFELGEMDEQPA---WSEIPTSVLNS 358
Query: 390 PAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSI 449
HQ LAL+ A++ +VLL+N + +LPL + + +A++GPNAN + GNY G ++
Sbjct: 359 KEHQALALRMARESLVLLQNKNNILPL--NTHLKVAVMGPNANDSVMQWGNYNGIPAHTV 416
Query: 450 TPLQALQNYVE--NTVYYPGCDTV 471
T L+A++ + +Y PGCD V
Sbjct: 417 TLLEAVRAKLPEGQIIYEPGCDRV 440
Score = 83.2 bits (204), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 81/299 (27%), Positives = 133/299 (44%), Gaps = 51/299 (17%)
Query: 478 IDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEA 527
++ AV AD ++ G+ + E EE+ DR D+ LP Q+ L+ + +A
Sbjct: 588 LNLAVKRVMDADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQRNLLKALKKA 647
Query: 528 AKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMT 587
KK V+ + G I +IL A YPG+AG A+ + ++G++NPGGRLP+T
Sbjct: 648 GKK-VVFINYSGSA--IGLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPGGRLPVT 704
Query: 588 WYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNK 647
+Y D M+ GRTYR+ + + +FPFG GLSY+ ++Y +S+N
Sbjct: 705 FYKNVNQLPDFEDYSMK--------GRTYRYMQQQPLFPFGHGLSYTTFTYGEAKLSKNT 756
Query: 648 LYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKP 707
+ + ++VV +TI V N G+ G+ V ++++
Sbjct: 757 I----------AKGENVV-------------------LTIPVSNVGQRDGEEVVQVYLRR 787
Query: 708 ARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
G P L F+ V + A + + L+ E M EGT+ L+ G
Sbjct: 788 PGDKEG-PRYTLRAFKRVHIPAGKTESVAIPLTGVNFEWFDAESNTMRPLEGTYELLYG 845
>gi|423290405|ref|ZP_17269254.1| hypothetical protein HMPREF1069_04297 [Bacteroides ovatus
CL02T12C04]
gi|392665792|gb|EIY59315.1| hypothetical protein HMPREF1069_04297 [Bacteroides ovatus
CL02T12C04]
Length = 861
Score = 311 bits (798), Expect = 7e-82, Method: Compositional matrix adjust.
Identities = 179/444 (40%), Positives = 247/444 (55%), Gaps = 38/444 (8%)
Query: 41 ETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGV 100
+ P+ T+L QRA DL+ RLTL+EK+S + N++PAIPRLGI YEWW+EALHGV
Sbjct: 22 QLLPYQDTSLTAEQRAEDLLPRLTLEEKVSLMQNASPAIPRLGIKEYEWWNEALHGVGRA 81
Query: 101 GKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARA---LYNAGQAI----G 153
G AT FPQ I ASF+ L Y + A EAR ++ + G
Sbjct: 82 GL----------ATVFPQSIGMGASFNDSLLYEVFNATSDEARVKSRIFGESGVLKRYQG 131
Query: 154 MTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQG-DTFNGGKLKGKLQASA 212
+TFW PN+NIFRDPRWGRGQET GEDP +TG+ ++ VRG+QG + KL A
Sbjct: 132 LTFWTPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPEDARYDKLH------A 185
Query: 213 CCKHFTAYDLDNWKGTTRYKFDAR-VTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNG 271
C KHF + W R+ FDA + +DL +TY P F+ V++ +MCAYNR G
Sbjct: 186 CAKHFAVHSGPEW---NRHSFDAENIDPRDLWETYLPAFKDLVQKAHVKEVMCAYNRFEG 242
Query: 272 IPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSP--EDAVVDVLKAGMDV 329
P C LL + R +WG+ G + SDC A+S Y + P E A D ++AG D+
Sbjct: 243 EPCCGSNRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTHETHPDKEHASADAVRAGTDL 302
Query: 330 NCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCS 389
CGS AVK + E EID +L L + R LG + P + +I V+ S
Sbjct: 303 ECGSEYASLAD-AVKAGLIDEKEIDISLKRLLTARFELGEMDEQPA---WAEIPTSVLNS 358
Query: 390 PAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSI 449
HQ LAL+ A++ +VLL+N + +LPL + + +A++GPNAN + GNY G ++
Sbjct: 359 KEHQALALRMARESLVLLQNKNNILPL--NTHLKVAVMGPNANDSVMQWGNYNGIPAHTV 416
Query: 450 TPLQALQNYVE--NTVYYPGCDTV 471
T L+A++ + +Y PGCD V
Sbjct: 417 TLLEAVRAKLPEGQIIYEPGCDRV 440
Score = 84.3 bits (207), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 83/300 (27%), Positives = 139/300 (46%), Gaps = 53/300 (17%)
Query: 478 IDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEA 527
++ AV AD ++ G+ + E EE+ DR D+ LP Q++L+ + +A
Sbjct: 588 LNFAVKRVMDADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQRDLLKALKKA 647
Query: 528 AKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMT 587
KK V+ + G I +IL A YPG+AG A+ + ++G++NPGGRLP+T
Sbjct: 648 GKK-VVFINYSGSA--IGLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPGGRLPVT 704
Query: 588 WYPQDYIKVP-MTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQN 646
+Y +D ++P D M+ GRTYR+ + + +FPFG GLSY+ ++Y +S+N
Sbjct: 705 FY-KDVNQLPDFEDYSMK--------GRTYRYMQQQPLFPFGHGLSYTTFTYGEAKLSKN 755
Query: 647 KLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVK 706
+ + ++VV +TI V N G+ G+ V ++++
Sbjct: 756 TI----------AKGENVV-------------------LTIPVSNVGQRDGEEVVQVYLR 786
Query: 707 PARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
G P L F+ V + A + + L+ E M EGT+ L+ G
Sbjct: 787 RPGDKEG-PRYTLRAFKRVHIPAGKTESVAIPLTGENFEWFDAESNTMRPLEGTYELLYG 845
>gi|298481648|ref|ZP_06999839.1| beta-glucosidase [Bacteroides sp. D22]
gi|298272189|gb|EFI13759.1| beta-glucosidase [Bacteroides sp. D22]
Length = 861
Score = 311 bits (798), Expect = 7e-82, Method: Compositional matrix adjust.
Identities = 179/444 (40%), Positives = 247/444 (55%), Gaps = 38/444 (8%)
Query: 41 ETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGV 100
+ P+ T+L QRA DL+ RLTL+EK+S + N++PAIPRLGI YEWW+EALHGV
Sbjct: 22 QLLPYQDTSLTAEQRAEDLLPRLTLEEKVSLMQNASPAIPRLGIKEYEWWNEALHGVGRA 81
Query: 101 GKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARA---LYNAGQAI----G 153
G AT FPQ I ASF+ L Y + A EAR ++ + G
Sbjct: 82 GL----------ATVFPQSIGMGASFNDSLLYEVFNATSDEARVKSRIFGESGVLKRYQG 131
Query: 154 MTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQG-DTFNGGKLKGKLQASA 212
+TFW PN+NIFRDPRWGRGQET GEDP +TG+ ++ VRG+QG + KL A
Sbjct: 132 LTFWTPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPEDARYDKLH------A 185
Query: 213 CCKHFTAYDLDNWKGTTRYKFDAR-VTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNG 271
C KHF + W R+ FDA + +DL +TY P F+ V++ +MCAYNR G
Sbjct: 186 CAKHFAVHSGPEW---NRHSFDAENIDPRDLWETYLPAFKDLVQKAHVKEVMCAYNRFEG 242
Query: 272 IPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSP--EDAVVDVLKAGMDV 329
P C LL + R +WG+ G + SDC A+S Y + P E A D ++AG D+
Sbjct: 243 EPCCGSNRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTHGTHPDKEHASADAVRAGTDL 302
Query: 330 NCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCS 389
CGS AVK + E EID +L L + R LG + P + +I V+ S
Sbjct: 303 ECGSEYASLAD-AVKAGLIDEKEIDISLKRLLTARFELGEMDEQPA---WSEIPTSVLNS 358
Query: 390 PAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSI 449
HQ LAL+ A++ +VLL+N + +LPL + + +A++GPNAN + GNY G ++
Sbjct: 359 KEHQALALRMARESLVLLQNKNNILPL--NTHLKVAVMGPNANDSVMQWGNYNGIPAHTV 416
Query: 450 TPLQALQNYVE--NTVYYPGCDTV 471
T L+A++ + +Y PGCD V
Sbjct: 417 TLLEAVRAKLPEGQIIYEPGCDRV 440
Score = 83.6 bits (205), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 83/300 (27%), Positives = 138/300 (46%), Gaps = 53/300 (17%)
Query: 478 IDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEA 527
++ AV AD ++ G+ + E EE+ DR D+ LP Q+ L+ + +A
Sbjct: 588 LNLAVKRVMDADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQRNLLKALKKA 647
Query: 528 AKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMT 587
KK V+ + G I +IL A YPG+AG A+ + ++G++NPGGRLP+T
Sbjct: 648 GKK-VVFINYSGSA--IGLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPGGRLPVT 704
Query: 588 WYPQDYIKVP-MTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQN 646
+Y +D ++P D M+ GRTYR+ + + +FPFG GLSY+ ++Y +S+N
Sbjct: 705 FY-KDVNQLPDFEDYSMK--------GRTYRYMQQQPLFPFGHGLSYTDFTYGEAKLSKN 755
Query: 647 KLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVK 706
+ + ++VV +TI V N G+ G+ V ++++
Sbjct: 756 TI----------AKGENVV-------------------LTIPVSNVGQCDGEEVVQVYLR 786
Query: 707 PARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
G P L F+ V + A + + L+ E M EGT+ L+ G
Sbjct: 787 RPGDKEG-PRYTLRAFKRVHIPAGKTESVAIPLTGENFEWFDVESNTMRPLEGTYELLYG 845
>gi|94970273|ref|YP_592321.1| Beta-glucosidase [Candidatus Koribacter versatilis Ellin345]
gi|94552323|gb|ABF42247.1| Beta-glucosidase [Candidatus Koribacter versatilis Ellin345]
Length = 881
Score = 311 bits (798), Expect = 7e-82, Method: Compositional matrix adjust.
Identities = 174/426 (40%), Positives = 242/426 (56%), Gaps = 37/426 (8%)
Query: 45 FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGI 104
+ +L +RA DLV R+T++EK+SQL N + A+PRL +P Y+WWSEALHGVA G
Sbjct: 30 YLNPSLAPEKRAADLVHRMTVEEKVSQLTNDSRAVPRLNVPDYDWWSEALHGVAQPG--- 86
Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQA-------IGMTFW 157
T +PQ + AA+FD+ R+ + IG+E R + G G+ FW
Sbjct: 87 --------VTEYPQPVALAATFDNDKVQRMARFIGIEGRIKHEEGMKDGHSDIFQGLDFW 138
Query: 158 APNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHF 217
APNINIFRDPRWGRGQET GEDP +T + V+YV+G+QGD A + KH+
Sbjct: 139 APNINIFRDPRWGRGQETYGEDPFLTARMGVAYVKGLQGDD------PKYYLAISTPKHY 192
Query: 218 TAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCAD 277
+ + TTR+ D +V+ D DTY P F + V + +A +MCAYN +NG P+C +
Sbjct: 193 AVH---SGPETTRHFADVKVSKHDELDTYLPAFRATVTEAKAGSVMCAYNSINGQPACVN 249
Query: 278 RNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQK 337
LL R +W F GY+ SDC+A+ IY + K+ +A ++ GMD C F ++
Sbjct: 250 EFLLQDQLRGKWNFQGYVVSDCEAIINIYRDHKFTKTQAEASALAVQRGMDNECVDFGKQ 309
Query: 338 HTKA-------AVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSP 390
A KQ L ESEID AL LF+ RM+LG+F+ P M P+ KI + S
Sbjct: 310 KDDHDYRPYFDAYKQGILKESEIDTALVRLFTARMKLGMFD-PPEMVPYSKIDPKELESA 368
Query: 391 AHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSIT 450
H+ LA A + +VLLKN G LPL KS + +A+IGP A + LLGNY G +++
Sbjct: 369 EHRELARTLANESMVLLKND-GTLPLKKS-GLKIAVIGPLAEQTRYLLGNYNGTPSHTVS 426
Query: 451 PLQALQ 456
L+ L+
Sbjct: 427 VLEGLR 432
Score = 128 bits (322), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 95/296 (32%), Positives = 141/296 (47%), Gaps = 52/296 (17%)
Query: 481 AVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKK 530
AV AK AD V+ ++G+ E EE+ DR L LP +Q+L+ ++ AA K
Sbjct: 603 AVTAAKNADVVIAVLGITSDLEGEEMPVSEEGFNGGDRTSLDLPKPEQQLLESIS-AAGK 661
Query: 531 PVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYP 590
PV+LVL G + + +A+ N +IL YPGE G A+A+ + G +NP GRLP+T+Y
Sbjct: 662 PVVLVLSNGSALSVNWAQQHAN--AILEGWYPGEEGGTAIAQTLSGKNNPAGRLPVTFYT 719
Query: 591 QDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYL 650
P D M+ GRTYR++EGK ++PFG GLSY+ +SY+ A+ + L
Sbjct: 720 GTEQLPPFEDYAMK--------GRTYRYFEGKPLYPFGYGLSYTTFSYRDLALPKAPLNA 771
Query: 651 NQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARR 710
+ ++ V N G++ G L++
Sbjct: 772 GDPVTAQVT-----------------------------VTNTGKVEGDEVAQLYLSFPNI 802
Query: 711 GNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
G P++ L GF+ + L A E I FEL LS E G +I EG + + VG
Sbjct: 803 A-GAPLRALRGFRRIHLKAGESQTIKFELK-DRDLSMVNEAGDPIIAEGEYSVSVG 856
>gi|84623339|ref|YP_450711.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae MAFF
311018]
gi|188577358|ref|YP_001914287.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae PXO99A]
gi|84367279|dbj|BAE68437.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae MAFF
311018]
gi|188521810|gb|ACD59755.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae PXO99A]
Length = 889
Score = 311 bits (796), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 174/421 (41%), Positives = 245/421 (58%), Gaps = 32/421 (7%)
Query: 54 QRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGA 113
QRA DLV+ ++ +EK++Q +N APAIPRLGIPAYEWWSE LHG+A G A
Sbjct: 39 QRAADLVAHMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNGY----------A 88
Query: 114 TSFPQVILTAASFDSYLWYRIGQAIGLEARALYN-AGQA-------IGMTFWAPNINIFR 165
T FPQ I AAS++++L ++G + EARA +N AG+ G+T W+PNINIFR
Sbjct: 89 TVFPQAIGLAASWNTHLMQQVGTVVSTEARAKFNQAGRPGKDHKRYAGLTIWSPNINIFR 148
Query: 166 DPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNW 225
DPRWGRG ET GEDP +TG+ AV ++ G+QGD + + A KH + +
Sbjct: 149 DPRWGRGMETYGEDPFLTGQMAVGFIHGLQGDDLD------HPRTIATPKHLAVH---SG 199
Query: 226 KGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTA 285
R+ FD V+ +D+ TY P F + + +G+A +MCAYN ++G P+CA L++
Sbjct: 200 PEPGRHGFDVDVSPRDVEATYTPAFRAAIVEGQAGAVMCAYNSLHGTPACAADWLINGRV 259
Query: 286 RRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQ 345
R WGF G++ SDCDAV + + + LKAG D+NCG + A+ +
Sbjct: 260 RGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAAALKAGHDLNCG-HAYRELGTAIAR 318
Query: 346 KKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIV 405
++ E+ +D++L LF+ R RLG P P+ ++GA V + H+ LALQAA + IV
Sbjct: 319 GEVDEALLDQSLVRLFAARYRLGELEA-PRKDPYARLGAKDVDNAQHRALALQAAAESIV 377
Query: 406 LLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQAL-QNYVENTVY 464
LLKN+ LPL + LA+IGPNA++ L NY G S +TPL L Q + V
Sbjct: 378 LLKNNANTLPL--NAGTRLAVIGPNADALAALEANYQGTSSAPVTPLLGLRQRFGAQQVS 435
Query: 465 Y 465
Y
Sbjct: 436 Y 436
Score = 131 bits (329), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 95/283 (33%), Positives = 140/283 (49%), Gaps = 56/283 (19%)
Query: 496 GLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDIT 545
GL E EEL DR D+ LP QQ L+ R A+A+ KP+++VL+ G V +
Sbjct: 626 GLSPDVEGEELRIDVPGFDGGDRNDIALPAPQQALLER-AKASGKPLVVVLMSGSAVALN 684
Query: 546 FAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWY--PQDYIKVPMTDMKM 603
+AK + +I+ A YPG++G A+A ++ GD NPGGRLP+T+Y +D DMK
Sbjct: 685 WAKTHAD--AIVAAWYPGQSGGTAIARMLAGDDNPGGRLPVTFYRSTKDLPAYVSYDMK- 741
Query: 604 RPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQD 663
GRTYR+++G+ +FPFG GLSY++++Y +S + Q+ ST
Sbjct: 742 ---------GRTYRYFKGEPLFPFGYGLSYTRFAYDAPQLSSTAV---QAGST------- 782
Query: 664 VVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQ 723
VT V+N G AG ++++ R P++ LVGFQ
Sbjct: 783 -------------------LQVTTTVRNTGARAGDEVAQVYLQYPDRPQS-PLRSLVGFQ 822
Query: 724 SVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
V L A E+ + F L +LS G +E G + L VG
Sbjct: 823 RVHLAAGEQRTLTFNLD-ARALSDVDRSGQRAVEAGNYTLFVG 864
>gi|325914134|ref|ZP_08176487.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas vesicatoria
ATCC 35937]
gi|325539637|gb|EGD11280.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas vesicatoria
ATCC 35937]
Length = 874
Score = 311 bits (796), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 173/413 (41%), Positives = 240/413 (58%), Gaps = 31/413 (7%)
Query: 54 QRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGA 113
QRA LV++++ DEK++Q +N APAIPRL IPAYEWWSE LHG+A G A
Sbjct: 24 QRAAALVAQMSRDEKVAQAMNDAPAIPRLDIPAYEWWSEGLHGIARNGY----------A 73
Query: 114 TSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAG--------QAIGMTFWAPNINIFR 165
T FPQ I AAS+++ L ++G + EARA +N + G+T W+PNINIFR
Sbjct: 74 TVFPQAIGLAASWNTALMQQVGTVVSTEARAKFNQAGGPGKDHKRYAGLTIWSPNINIFR 133
Query: 166 DPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNW 225
DPRWGRG ET GEDP +TG+ AV ++RG+QGD N + A KH + +
Sbjct: 134 DPRWGRGMETYGEDPFLTGQLAVGFIRGLQGDDLN------HPRTIATPKHIAVH---SG 184
Query: 226 KGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTA 285
R+ FD V+ +D+ TY P F + + G+A +MCAYN ++G P+CA LL+
Sbjct: 185 PEPGRHGFDVDVSPRDMEATYTPAFRAALVDGQAWSVMCAYNSLHGTPACAADWLLNGRV 244
Query: 286 RRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQ 345
R WGF G++ SDCDAV + + + LKAG D+NCG + A+++
Sbjct: 245 RGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAAALKAGHDLNCG-HAYRELGTAIER 303
Query: 346 KKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIV 405
++ E+ +D++L LF+ R RLG P P+ ++GA V + AH+ LALQAA + IV
Sbjct: 304 GEVDEALLDQSLVRLFAARYRLGELEA-PRKDPYARLGAKDVDNAAHRALALQAAAESIV 362
Query: 406 LLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNY 458
LLKN+ LPL LA+IGPNA++ L NY G S ITPL L+ +
Sbjct: 363 LLKNTATTLPL--KAGTRLAVIGPNADALAALEANYQGTSATPITPLLGLRQH 413
Score = 134 bits (337), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 98/292 (33%), Positives = 141/292 (48%), Gaps = 58/292 (19%)
Query: 488 ADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLL 537
+D VV +GL E EEL DR D+ LP QQ L+ R A+A+ KP+++VL+
Sbjct: 603 SDAVVAFVGLSPDVEGEELRIDVPGFDGGDRNDIALPAPQQALLER-AKASGKPLVVVLM 661
Query: 538 CGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWY--PQDYIK 595
G V + +AK N +I+ A YPG++G A+A + GD NPGGRLP+T+Y +D
Sbjct: 662 SGSAVALNWAKA--NADAIVAAWYPGQSGGTAIARALAGDDNPGGRLPVTFYRSTKDLPA 719
Query: 596 VPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSS 655
DMK GRTYR+++G+ +FPFG GLSY+ ++Y
Sbjct: 720 YVSYDMK----------GRTYRYFKGEPLFPFGYGLSYTSFAY----------------- 752
Query: 656 TKMVENQDVVHYKSVPELGTEFCETRKFL-VTIGVKNHGEMAGKHPVLLFVKPARRGNGR 714
P L T + L VT V+N G AG ++++ R
Sbjct: 753 -------------DAPRLSTRTLQAGNPLQVTTTVRNTGSRAGDEVAQVYLQYPDRPQS- 798
Query: 715 PIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
P++ LVGFQ V L E+ E+ F L +LS G +E G + + VG
Sbjct: 799 PLRSLVGFQRVHLKPGEQRELTFTLD-ARALSDVDRSGQRAVEAGEYRVFVG 849
>gi|427385138|ref|ZP_18881643.1| hypothetical protein HMPREF9447_02676 [Bacteroides oleiciplenus YIT
12058]
gi|425727306|gb|EKU90166.1| hypothetical protein HMPREF9447_02676 [Bacteroides oleiciplenus YIT
12058]
Length = 863
Score = 310 bits (795), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 188/469 (40%), Positives = 257/469 (54%), Gaps = 54/469 (11%)
Query: 13 LCLCFTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQL 72
LCLCF S +QPP+ NP+ L +RA DLV RLTL+EK S +
Sbjct: 14 LCLCFLS------CSQPPYK----NPA----------LTPEERAADLVGRLTLEEKASLM 53
Query: 73 VNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWY 132
N++PAIPRLGI AY+WW+EALHGV G AT FPQ I ASF++ L Y
Sbjct: 54 QNTSPAIPRLGIKAYDWWNEALHGVGRAGL----------ATVFPQAIGMGASFNNDLLY 103
Query: 133 RIGQAIGLEARA----LYNAG---QAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGK 185
+ A+ EARA G + G+T W PN+NIFRDPRWGRGQET GEDP +TG+
Sbjct: 104 DVFTAVSDEARAKTAEFSKEGGLKRYQGLTMWTPNVNIFRDPRWGRGQETYGEDPYLTGQ 163
Query: 186 YAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDAR-VTMQDLAD 244
++ VRG+QG GGK + AC KHF + W R+ FDA V +DL +
Sbjct: 164 MGMAVVRGLQGP--EGGKYD---KLHACAKHFAVHSGPEW---NRHSFDAENVDPRDLWE 215
Query: 245 TYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSI 304
TY P F+ V++ +MCAYNR G P C LL + R +W + G I SDC A++
Sbjct: 216 TYLPAFKDLVQKAHVKEVMCAYNRFEGEPCCGSNRLLVQILRDEWAYDGIIVSDCWAIND 275
Query: 305 IYDAEGYAKSP--EDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFS 362
++ + P E A + G DV CG + AVK + E +ID +L L
Sbjct: 276 FFNKGAHETEPDKEHASAKAVLTGTDVECGESYASLPQ-AVKAGLIDEKKIDISLKRLMK 334
Query: 363 VRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSV 422
R LG + NP + + +I VV S H+ LAL+ A++ +VLL+N+ +LPL +KS+
Sbjct: 335 ARFELGEMD-NPELVSWAQIPYSVVDSKEHRELALRMARESLVLLQNNQNVLPL--NKSL 391
Query: 423 SLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENT--VYYPGCD 469
+A++GPNAN + GNY G ++T L+ ++ Y+ +Y PGCD
Sbjct: 392 KVAVVGPNANDSVMQWGNYNGFPGHTVTLLEGIRQYLPEAQLIYEPGCD 440
Score = 99.8 bits (247), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 85/299 (28%), Positives = 129/299 (43%), Gaps = 51/299 (17%)
Query: 478 IDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEA 527
++K + K AD +V G+ E EE+ DR + LP Q L+ + +A
Sbjct: 590 LNKTIQRVKDADIIVFAGGISPAVEGEEMRVTIPGFKGGDRETIELPSIQSRLLAELKKA 649
Query: 528 AKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMT 587
KK V+ V G + +T + +IL A YPG+AG A+A V+FGD+NP GRLP+T
Sbjct: 650 GKK-VVFVNFSGSAIALT--PETKTCDAILQAWYPGQAGGTAIANVLFGDYNPAGRLPVT 706
Query: 588 WYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNK 647
+Y D M+ GRTYR+ +FPFG GLSY+ + Y
Sbjct: 707 FYKSTSQLPDFEDYSMK--------GRTYRYMAEAPLFPFGHGLSYTTFRYG-------- 750
Query: 648 LYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKP 707
+ S ST+ V+ E + ++TI V N GE G+ V ++++
Sbjct: 751 ---DASLSTQEVK------------------EGEQAILTIPVSNTGERDGEEVVQVYLRR 789
Query: 708 ARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
G P L F+ V + + LS + E M EG + ++ G
Sbjct: 790 PGDKEG-PSHALRAFKRVNIAKGTTGNVTISLSKEDFEWFDTETNTMRPIEGDYEILYG 847
>gi|384420163|ref|YP_005629523.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzicola
BLS256]
gi|353463076|gb|AEQ97355.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzicola
BLS256]
Length = 889
Score = 310 bits (795), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 174/421 (41%), Positives = 244/421 (57%), Gaps = 32/421 (7%)
Query: 54 QRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGA 113
QRA DLV+ ++ +EK++Q +N APAIPRLGIPAYEWWSE LHG+A G A
Sbjct: 39 QRAADLVAHMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNGY----------A 88
Query: 114 TSFPQVILTAASFDSYLWYRIGQAIGLEARALYN-AGQA-------IGMTFWAPNINIFR 165
T FPQ I AAS++++L ++G + EARA +N AG+ G+T W+PNINIFR
Sbjct: 89 TVFPQAIGLAASWNTHLMQQVGTVVSTEARAKFNQAGRPGKDHKRYAGLTIWSPNINIFR 148
Query: 166 DPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNW 225
DPRWGRG ET GEDP +TG+ AV ++ G+QGD + + A KH + +
Sbjct: 149 DPRWGRGMETYGEDPFLTGQMAVGFIHGLQGDDLD------HPRTIATPKHLAVH---SG 199
Query: 226 KGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTA 285
R+ FD V+ +D+ TY P F + + +G+A +MCAYN ++G P+CA L++
Sbjct: 200 PEPGRHGFDVDVSPRDVEATYTPAFRAAIVEGQAGAVMCAYNSLHGTPACAADWLINGRV 259
Query: 286 RRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQ 345
R WGF G++ SDCDAV + + + LKAG D+NCG + A+ +
Sbjct: 260 RGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAAALKAGHDLNCG-HAYRELGTAIAR 318
Query: 346 KKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIV 405
++ E+ +D++L LF+ R RLG P P+ ++GA V + H+ LALQAA + IV
Sbjct: 319 GEVDEALLDQSLVRLFAARYRLGELEA-PRKDPYARLGAKDVDNAQHRALALQAAAESIV 377
Query: 406 LLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQAL-QNYVENTVY 464
LLKN+ LPL LA+IGPNA++ L NY G S +TPL L Q + V
Sbjct: 378 LLKNNANTLPL--KAGTRLAVIGPNADALAALEANYQGTSSAPVTPLLGLRQRFGAQQVS 435
Query: 465 Y 465
Y
Sbjct: 436 Y 436
Score = 128 bits (321), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 95/283 (33%), Positives = 139/283 (49%), Gaps = 56/283 (19%)
Query: 496 GLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDIT 545
GL E EEL DR D+ LP QQ L+ R A+A+ KP+++VL+ G V +
Sbjct: 626 GLSPDVEGEELRIDVPGFDGGDRNDIALPAPQQALLER-AKASGKPLVVVLMSGSAVALN 684
Query: 546 FAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWY--PQDYIKVPMTDMKM 603
+AK + +I+ A YPG++G A+A ++ GD NPGGRLP+T+Y +D DMK
Sbjct: 685 WAKTHAD--AIVAAWYPGQSGGTAIARMLAGDDNPGGRLPVTFYRSTKDLPAYVSYDMK- 741
Query: 604 RPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQD 663
GRTYR+++G+ +FPFG GLSY+ ++Y +S + Q+ ST
Sbjct: 742 ---------GRTYRYFKGEPLFPFGYGLSYTCFAYDAPQLSSTAV---QAGST------- 782
Query: 664 VVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQ 723
VT V+N G AG ++++ R P++ LVGFQ
Sbjct: 783 -------------------LQVTTTVRNTGARAGDEVAQVYLQYPDRPQS-PLRSLVGFQ 822
Query: 724 SVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
V L A E+ + F L +LS G +E G + L VG
Sbjct: 823 RVHLAAGEQRTLTFNLD-ARALSDVDPSGQRAVEAGNYTLFVG 864
>gi|423294294|ref|ZP_17272421.1| hypothetical protein HMPREF1070_01086 [Bacteroides ovatus
CL03T12C18]
gi|392675485|gb|EIY68926.1| hypothetical protein HMPREF1070_01086 [Bacteroides ovatus
CL03T12C18]
Length = 861
Score = 310 bits (794), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 179/444 (40%), Positives = 247/444 (55%), Gaps = 38/444 (8%)
Query: 41 ETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGV 100
+ P+ T+L QRA DL+ RLTL+EK+S + N++PAIPRLGI YEWW+EALHGV
Sbjct: 22 QLLPYQDTSLTAEQRAEDLLPRLTLEEKVSLMQNASPAIPRLGIKEYEWWNEALHGVGRA 81
Query: 101 GKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARA---LYNAGQAI----G 153
G AT FPQ I ASF+ L Y + A EAR ++ A+ G
Sbjct: 82 GL----------ATVFPQSIGMGASFNDSLLYEVFNATSDEARVKSRIFGESGALKRYQG 131
Query: 154 MTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQG-DTFNGGKLKGKLQASA 212
+TFW PN+NIFRDPRWGRGQET GEDP +TG+ ++ VRG+QG + KL A
Sbjct: 132 LTFWTPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPEDTKYDKLH------A 185
Query: 213 CCKHFTAYDLDNWKGTTRYKFDAR-VTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNG 271
C KHF + W R+ FDA + +DL +TY P F+ V++ +MCAYNR G
Sbjct: 186 CAKHFAVHSGPEW---NRHSFDAENIDPRDLWETYLPAFKDLVQKAHVKEVMCAYNRFEG 242
Query: 272 IPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSP--EDAVVDVLKAGMDV 329
P C LL + R +WG+ G + SDC A+S Y + P E A ++ G D+
Sbjct: 243 EPCCGSNRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTHGTHPDKEHASAAAVRTGTDL 302
Query: 330 NCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCS 389
CGS AVK + E EID +L L + R LG + P + +I A V+ S
Sbjct: 303 ECGSEYASLAD-AVKAGLIDEKEIDISLKRLLTARFELGEMDEQPA---WSEIPASVLNS 358
Query: 390 PAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSI 449
HQ LAL+ A++ +VLL+N + +LPL + + +A++GPNAN + GNY G ++
Sbjct: 359 KEHQALALRMARESLVLLQNKNNILPL--NTHLKVAVMGPNANDSVMQWGNYNGIPAHTV 416
Query: 450 TPLQALQNYVE--NTVYYPGCDTV 471
T L+A++ + +Y PGCD V
Sbjct: 417 TLLEAVRAKLPEGQIIYEPGCDRV 440
Score = 83.2 bits (204), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 83/300 (27%), Positives = 138/300 (46%), Gaps = 53/300 (17%)
Query: 478 IDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEA 527
++ AV AD ++ G+ + E EE+ DR D+ LP Q+ L+ + +A
Sbjct: 588 LNFAVKRVMDADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQRNLLKALKKA 647
Query: 528 AKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMT 587
KK V+ + G I +IL A YPG+AG A+ + ++G++NPGGRLP+T
Sbjct: 648 GKK-VVFINYSGSA--IGLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPGGRLPVT 704
Query: 588 WYPQDYIKVP-MTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQN 646
+Y +D ++P D M+ GRTYR+ + + +FPFG GLSY+ ++Y +S+N
Sbjct: 705 FY-KDVNQLPDFEDYSMK--------GRTYRYMQQQPLFPFGHGLSYTTFTYGEAKLSKN 755
Query: 647 KLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVK 706
+ + ++VV +TI V N G+ G+ V ++++
Sbjct: 756 TI----------AKGENVV-------------------LTIPVSNVGQRDGEEVVQVYLR 786
Query: 707 PARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
G P L F+ V + A + + L+ E M EGT+ L+ G
Sbjct: 787 RPGDKEG-PRYTLRAFKRVHIPAGKTESVAIPLTGENFEWFDVESNTMRPLEGTYELLYG 845
>gi|325925754|ref|ZP_08187127.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas perforans
91-118]
gi|325543811|gb|EGD15221.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas perforans
91-118]
Length = 874
Score = 310 bits (794), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 174/421 (41%), Positives = 242/421 (57%), Gaps = 32/421 (7%)
Query: 54 QRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGA 113
QRA LV++++ +EK++Q +N APAIPRLGIPAYEWWSE LHG+A G A
Sbjct: 24 QRAAALVAQMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNGY----------A 73
Query: 114 TSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAG--------QAIGMTFWAPNINIFR 165
T FPQ I AAS+++ L ++G + EARA +N + G+T W+PNINIFR
Sbjct: 74 TVFPQAIGLAASWNTRLMQQVGTVVSTEARAKFNQAGGPGKDHKRYAGLTIWSPNINIFR 133
Query: 166 DPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNW 225
DPRWGRG ET GEDP +TG+ AV ++RG+QG+ N + A KH + +
Sbjct: 134 DPRWGRGMETYGEDPFLTGQMAVGFIRGLQGEDLN------HPRTIATPKHIAVH---SG 184
Query: 226 KGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTA 285
R+ FD V+ +D+ TY P F + + +G+A +MCAYN ++G P+CA LL+
Sbjct: 185 PEPGRHGFDVDVSPRDVEATYTPAFRAAIVEGQAGSVMCAYNSLHGTPACAADWLLNGRV 244
Query: 286 RRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQ 345
R WGF G++ SDCDAV + + + LKAG D+NCG + A+ +
Sbjct: 245 RGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAAALKAGHDLNCG-HAYRELGTAIAR 303
Query: 346 KKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIV 405
++ E+ +D++L LF+ R RLG P P+ ++GA V + AH+ LALQAA + IV
Sbjct: 304 GEVDEALLDQSLVRLFAARYRLGELEA-PRKDPYARLGAKDVDNAAHRALALQAAAESIV 362
Query: 406 LLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQAL-QNYVENTVY 464
LLKN LPL LA+IGPNA++ L NY G S +TPL L Q + V
Sbjct: 363 LLKNDANTLPL--KAGTRLAVIGPNADALAALEANYQGTSSAPVTPLLGLRQRFGAQQVS 420
Query: 465 Y 465
Y
Sbjct: 421 Y 421
Score = 127 bits (319), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 93/283 (32%), Positives = 136/283 (48%), Gaps = 56/283 (19%)
Query: 496 GLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDIT 545
GL E EEL DR D+ LP QQ L+ R A+A+ KP+++VL+ G V +
Sbjct: 611 GLSPDVEGEELRIDVPGFDGGDRNDIALPAPQQALLER-AKASGKPLVVVLMSGSAVALN 669
Query: 546 FAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWY--PQDYIKVPMTDMKM 603
+AK + +I+ A YPG++G A+A ++ GD NPGGRLP+T+Y +D DMK
Sbjct: 670 WAKTHAD--AIVAAWYPGQSGGTAIARMLAGDDNPGGRLPVTFYRSTKDLPAYVSYDMK- 726
Query: 604 RPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQD 663
GRTYR+++G+ +F FG GLSY++++Y +S L S
Sbjct: 727 ---------GRTYRYFKGEPLFAFGYGLSYTRFAYDAPQLSTTTLQAGSS---------- 767
Query: 664 VVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQ 723
VT V+N G AG ++++ R P++ LVGFQ
Sbjct: 768 -------------------LQVTTTVRNTGARAGDEVAQVYLQYPDRPQS-PLRSLVGFQ 807
Query: 724 SVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
V L A E+ + F L +LS G +E G + L VG
Sbjct: 808 RVHLAAGEQRTLTFNLD-ARALSDVDRSGQRAVEAGNYTLFVG 849
>gi|336404627|ref|ZP_08585320.1| hypothetical protein HMPREF0127_02633 [Bacteroides sp. 1_1_30]
gi|335941531|gb|EGN03384.1| hypothetical protein HMPREF0127_02633 [Bacteroides sp. 1_1_30]
Length = 861
Score = 310 bits (793), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 176/443 (39%), Positives = 246/443 (55%), Gaps = 36/443 (8%)
Query: 41 ETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGV 100
+ P+ T+L QRA DL+ RLTL+EK+S + N++PAIPRLGI YEWW+EALHGV
Sbjct: 22 QLLPYQDTSLTAEQRAEDLLPRLTLEEKVSLMQNASPAIPRLGIKEYEWWNEALHGVGRA 81
Query: 101 GKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARA---LYNAGQAI----G 153
G AT FPQ I ASF+ L Y + A EAR ++ + G
Sbjct: 82 GL----------ATVFPQSIGMGASFNDSLLYEVFNATSDEARVKSRIFGESGVLKRYQG 131
Query: 154 MTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASAC 213
+TFW PN+NIFRDPRWGRGQET GEDP +TG+ ++ VRG+QG + G + AC
Sbjct: 132 LTFWTPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGP-----EDAGYDKLHAC 186
Query: 214 CKHFTAYDLDNWKGTTRYKFDAR-VTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGI 272
KHF + W R+ FDA + +DL +TY P F+ V++ +MCAYNR G
Sbjct: 187 AKHFAVHSGPEW---NRHSFDAENIAPRDLWETYLPAFKDLVQKAHVKEVMCAYNRFEGE 243
Query: 273 PSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSP--EDAVVDVLKAGMDVN 330
P C LL + R +WG+ G + SDC A+S Y + P E A ++ G D+
Sbjct: 244 PCCGSNRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTHETHPDKEHASAAAVRTGTDLE 303
Query: 331 CGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSP 390
CGS AVK + E EID +L L + R LG + P + +I V+ S
Sbjct: 304 CGSEYASLAD-AVKAGLIDEKEIDISLKRLLTARFELGEMDEQPA---WAEIPTSVLNSK 359
Query: 391 AHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSIT 450
HQ LAL+ A++ +VLL+N + +LPL + ++ +A++GPNAN + GNY G ++T
Sbjct: 360 EHQALALRMARESLVLLQNKNNILPL--NTNLKIAVMGPNANDSVMQWGNYNGIPAHTVT 417
Query: 451 PLQALQNYVE--NTVYYPGCDTV 471
L+A++ + +Y PGCD V
Sbjct: 418 LLEAVRAKLPEGQIIYEPGCDRV 440
Score = 84.7 bits (208), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 83/300 (27%), Positives = 139/300 (46%), Gaps = 53/300 (17%)
Query: 478 IDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEA 527
++ AV AD ++ G+ + E EE+ DR D+ LP Q++L+ + +A
Sbjct: 588 LNLAVKKVMDADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQRDLLKALKKA 647
Query: 528 AKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMT 587
KK V+ + G I +IL A YPG+AG A+ + ++G++NPGGRLP+T
Sbjct: 648 GKK-VVFINYSGSA--IGLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPGGRLPVT 704
Query: 588 WYPQDYIKVP-MTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQN 646
+Y +D ++P D M+ GRTYR+ + + +FPFG GLSY+ ++Y +S+N
Sbjct: 705 FY-KDVNQLPDFEDYSMK--------GRTYRYMQQQPLFPFGHGLSYTDFTYGEAKLSKN 755
Query: 647 KLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVK 706
+ + ++VV +TI V N G+ G+ V ++++
Sbjct: 756 TI----------AKGENVV-------------------LTIPVSNVGQRDGEEVVQVYLR 786
Query: 707 PARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
G P L F+ V + A + + L+ E M EGT+ L+ G
Sbjct: 787 RPGDKEG-PRYTLRAFKRVHIPAGKTESVAIPLTGENFEWFDAESNTMRPLEGTYELLYG 845
>gi|346725879|ref|YP_004852548.1| beta-glucosidase-related glycosidase [Xanthomonas axonopodis pv.
citrumelo F1]
gi|346650626|gb|AEO43250.1| Beta-glucosidase-related glycosidase [Xanthomonas axonopodis pv.
citrumelo F1]
Length = 889
Score = 310 bits (793), Expect = 3e-81, Method: Compositional matrix adjust.
Identities = 174/421 (41%), Positives = 242/421 (57%), Gaps = 32/421 (7%)
Query: 54 QRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGA 113
QRA LV++++ +EK++Q +N APAIPRLGIPAYEWWSE LHG+A G A
Sbjct: 39 QRAAALVAQMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNGY----------A 88
Query: 114 TSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAG--------QAIGMTFWAPNINIFR 165
T FPQ I AAS+++ L ++G + EARA +N + G+T W+PNINIFR
Sbjct: 89 TVFPQSIGLAASWNTRLMQQVGTVVSTEARAKFNQAGGPGKDHKRYAGLTIWSPNINIFR 148
Query: 166 DPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNW 225
DPRWGRG ET GEDP +TG+ AV ++RG+QG+ N + A KH + +
Sbjct: 149 DPRWGRGMETYGEDPFLTGQMAVGFIRGLQGEDLN------HPRTIATPKHIAVH---SG 199
Query: 226 KGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTA 285
R+ FD V+ +D+ TY P F + + +G+A +MCAYN ++G P+CA LL+
Sbjct: 200 PEPGRHGFDVDVSPRDVEATYTPAFRAAIVEGQAGSVMCAYNSLHGTPACAADWLLNGRV 259
Query: 286 RRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQ 345
R WGF G++ SDCDAV + + + LKAG D+NCG + A+ +
Sbjct: 260 RGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAAALKAGHDLNCG-HAYRELGTAIAR 318
Query: 346 KKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIV 405
++ E+ +D++L LF+ R RLG P P+ ++GA V + AH+ LALQAA + IV
Sbjct: 319 GEVDEALLDQSLVRLFATRYRLGELEA-PRKDPYARLGAKDVDNAAHRALALQAAAESIV 377
Query: 406 LLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQAL-QNYVENTVY 464
LLKN LPL LA+IGPNA++ L NY G S +TPL L Q + V
Sbjct: 378 LLKNDANTLPL--KAGTRLAVIGPNADALAALEANYQGTSSAPVTPLLGLRQRFGAQQVS 435
Query: 465 Y 465
Y
Sbjct: 436 Y 436
Score = 127 bits (320), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 93/283 (32%), Positives = 136/283 (48%), Gaps = 56/283 (19%)
Query: 496 GLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDIT 545
GL E EEL DR D+ LP QQ L+ R A+A+ KP+++VL+ G V +
Sbjct: 626 GLSPDVEGEELRIDVPGFDGGDRNDIALPAPQQALLER-AKASGKPLVVVLMSGSAVALN 684
Query: 546 FAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWY--PQDYIKVPMTDMKM 603
+AK + +I+ A YPG++G A+A ++ GD NPGGRLP+T+Y +D DMK
Sbjct: 685 WAKTHAD--AIVAAWYPGQSGGTAIARMLAGDDNPGGRLPVTFYRSTKDLPAYVSYDMK- 741
Query: 604 RPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQD 663
GRTYR+++G+ +F FG GLSY++++Y +S L S
Sbjct: 742 ---------GRTYRYFKGEPLFAFGYGLSYTRFAYDAPQLSTTTLQAGSS---------- 782
Query: 664 VVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQ 723
VT V+N G AG ++++ R P++ LVGFQ
Sbjct: 783 -------------------LQVTTTVRNTGARAGDEVAQVYLQYPDRPQS-PLRSLVGFQ 822
Query: 724 SVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
V L A E+ + F L +LS G +E G + L VG
Sbjct: 823 RVHLAAGEQRTLTFNLD-ARALSDVDRSGQRAVEAGNYTLFVG 864
>gi|78048767|ref|YP_364942.1| beta-glucosidase precursor [Xanthomonas campestris pv. vesicatoria
str. 85-10]
gi|78037197|emb|CAJ24942.1| beta-glucosidase precursor [Xanthomonas campestris pv. vesicatoria
str. 85-10]
Length = 889
Score = 310 bits (793), Expect = 3e-81, Method: Compositional matrix adjust.
Identities = 174/421 (41%), Positives = 242/421 (57%), Gaps = 32/421 (7%)
Query: 54 QRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGA 113
QRA LV++++ +EK++Q +N APAIPRLGIPAYEWWSE LHG+A G A
Sbjct: 39 QRAAALVAQMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNGY----------A 88
Query: 114 TSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAG--------QAIGMTFWAPNINIFR 165
T FPQ I AAS+++ L ++G + EARA +N + G+T W+PNINIFR
Sbjct: 89 TVFPQSIGLAASWNTRLMQQVGTVVSTEARAKFNQAGGPGKDHKRYAGLTIWSPNINIFR 148
Query: 166 DPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNW 225
DPRWGRG ET GEDP +TG+ AV ++RG+QG+ N + A KH + +
Sbjct: 149 DPRWGRGMETYGEDPFLTGQMAVGFIRGLQGEDLN------HPRTIATPKHIAVH---SG 199
Query: 226 KGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTA 285
R+ FD V+ +D+ TY P F + + +G+A +MCAYN ++G P+CA LL+
Sbjct: 200 PEPGRHGFDVDVSPRDVEATYTPAFRAAIVEGQAGSVMCAYNSLHGTPACAADWLLNGRV 259
Query: 286 RRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQ 345
R WGF G++ SDCDAV + + + LKAG D+NCG + A+ +
Sbjct: 260 RGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAAALKAGHDLNCG-HAYRELGTAIAR 318
Query: 346 KKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIV 405
++ E+ +D++L LF+ R RLG P P+ ++GA V + AH+ LALQAA + IV
Sbjct: 319 GEVDEALLDQSLVRLFAARYRLGELEA-PRKDPYARLGAKDVDNAAHRALALQAAAESIV 377
Query: 406 LLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQAL-QNYVENTVY 464
LLKN LPL LA+IGPNA++ L NY G S +TPL L Q + V
Sbjct: 378 LLKNDANTLPL--KAGTRLAVIGPNADALAALEANYQGTSSAPVTPLLGLRQRFGAQQVS 435
Query: 465 Y 465
Y
Sbjct: 436 Y 436
Score = 127 bits (320), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 93/283 (32%), Positives = 136/283 (48%), Gaps = 56/283 (19%)
Query: 496 GLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDIT 545
GL E EEL DR D+ LP QQ L+ R A+A+ KP+++VL+ G V +
Sbjct: 626 GLSPDVEGEELRIDVPGFDGGDRNDIALPAPQQALLER-AKASGKPLVVVLMSGSAVALN 684
Query: 546 FAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWY--PQDYIKVPMTDMKM 603
+AK + +I+ A YPG++G A+A ++ GD NPGGRLP+T+Y +D DMK
Sbjct: 685 WAKTHAD--AIVAAWYPGQSGGTAIARMLAGDDNPGGRLPVTFYRSTKDLPAYVSYDMK- 741
Query: 604 RPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQD 663
GRTYR+++G+ +F FG GLSY++++Y +S L S
Sbjct: 742 ---------GRTYRYFKGEPLFAFGYGLSYTRFAYDAPQLSTTTLQAGSS---------- 782
Query: 664 VVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQ 723
VT V+N G AG ++++ R P++ LVGFQ
Sbjct: 783 -------------------LQVTTTVRNTGARAGDEVAQVYLQYPDRPQS-PLRSLVGFQ 822
Query: 724 SVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
V L A E+ + F L +LS G +E G + L VG
Sbjct: 823 RVHLAAGEQRTLTFNLD-ARALSDVDRSGQRAVEAGNYTLFVG 864
>gi|15837447|ref|NP_298135.1| family 3 glycoside hydrolase [Xylella fastidiosa 9a5c]
gi|9105751|gb|AAF83655.1|AE003924_1 family 3 glycoside hydrolase [Xylella fastidiosa 9a5c]
Length = 882
Score = 309 bits (792), Expect = 3e-81, Method: Compositional matrix adjust.
Identities = 174/411 (42%), Positives = 240/411 (58%), Gaps = 31/411 (7%)
Query: 54 QRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGA 113
Q A LV+++TL EKI+Q +N+APAIPRLGIPAY+WWSE LHG+A G A
Sbjct: 32 QHAAALVAKMTLQEKITQTMNAAPAIPRLGIPAYDWWSEGLHGIARNGY----------A 81
Query: 114 TSFPQVILTAASFDSYLWYRIGQAIGLEARALYN-AG-------QAIGMTFWAPNINIFR 165
T FPQ I AAS+++ L +G EARA +N AG + G+T W+PNINIFR
Sbjct: 82 TVFPQAIGLAASWNTDLLQHVGTVTSTEARAKFNLAGGPGKDHPRYAGLTLWSPNINIFR 141
Query: 166 DPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNW 225
DPRWGRG ET GEDP +TG+ AVS++RG+QG+ + + A KHF + +
Sbjct: 142 DPRWGRGMETYGEDPYLTGQLAVSFIRGLQGN------IPDHPRTIATPKHFAVH---SG 192
Query: 226 KGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTA 285
R+ FD V+ DL TY P F + + G A +MCAYN ++G P+CA LL+
Sbjct: 193 PEPGRHSFDVDVSAYDLEATYTPAFRAAIVDGHAGSVMCAYNALHGTPACASDWLLNTRL 252
Query: 286 RRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQ 345
R WGF+G++ SDCDA+ + + + A LK+G D+NCG+ + A+ +
Sbjct: 253 RNDWGFNGFVVSDCDAIDDMTRFHFFRQDNASASAAALKSGNDLNCGN-TYRDLNQAIAR 311
Query: 346 KKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIV 405
+ E+ +D+AL LF+ R RLG P+ IG + +PAH+ LALQAA +V
Sbjct: 312 GDIDEALLDQALIRLFAARQRLGTLQPR-EHDPYATIGIKHIDTPAHRALALQAAVQSLV 370
Query: 406 LLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQ 456
LLKNS LPL + +LA++GP+A+S L NY G S +TPL L+
Sbjct: 371 LLKNSGNTLPL--TPGTTLAVLGPDADSLTALEANYQGTSSTPVTPLTGLR 419
Score = 133 bits (334), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 99/312 (31%), Positives = 150/312 (48%), Gaps = 64/312 (20%)
Query: 471 VACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQEL 520
+A ++ + +A AD +V +GL E EEL DR + LP Q+ L
Sbjct: 594 LAPAAPQLAEAERAVAHADAIVAFVGLSPEVEGEELHIDTPGFSGGDRTTIDLPATQETL 653
Query: 521 ITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
+ V + KP+I+VL+ G V + +A++ N +IL A YPG++G A+A+ + GD NP
Sbjct: 654 LQHV-KTTGKPLIVVLMSGSAVALNWAQHHAN--AILAAWYPGQSGGTAIAQALAGDVNP 710
Query: 581 GGRLPMTWY--PQD---YIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSK 635
GGRLP+T+Y QD YI MT GRTYR+++G+ ++PFG GLSY++
Sbjct: 711 GGRLPVTFYRSTQDLPPYISYDMT-------------GRTYRYFKGQPLYPFGYGLSYTQ 757
Query: 636 YSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFL-VTIGVKNHGE 694
++Y+ P+L T + L VT V+N G
Sbjct: 758 FTYE------------------------------APQLSTATLKAGDTLTVTAHVRNTGT 787
Query: 695 MAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLM 754
AG V L+++P P++ LVGF+ V L E + F L LS ++ G
Sbjct: 788 RAGDEVVQLYLEPPHSPQA-PLRNLVGFKRVTLRPGESRLLTFTLD-TRQLSSVQQTGQR 845
Query: 755 VIEEGTHFLVVG 766
+E G + L VG
Sbjct: 846 SVEAGHYHLFVG 857
>gi|423226625|ref|ZP_17213090.1| hypothetical protein HMPREF1062_05276 [Bacteroides cellulosilyticus
CL02T12C19]
gi|392628884|gb|EIY22909.1| hypothetical protein HMPREF1062_05276 [Bacteroides cellulosilyticus
CL02T12C19]
Length = 863
Score = 309 bits (791), Expect = 4e-81, Method: Compositional matrix adjust.
Identities = 184/472 (38%), Positives = 260/472 (55%), Gaps = 54/472 (11%)
Query: 13 LCLCFTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQL 72
LCLCF S +QPP+ NP+ L +RA DLV RLTL+EK + +
Sbjct: 14 LCLCFLSC------SQPPYK----NPA----------LSPEERANDLVGRLTLEEKAALM 53
Query: 73 VNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWY 132
N++PAIPRLGI AY+WW+EALHGV G AT FPQ I ASF++ L Y
Sbjct: 54 QNTSPAIPRLGIKAYDWWNEALHGVGRAGL----------ATVFPQAIGMGASFNNELLY 103
Query: 133 RIGQAIGLEARALYNA-------GQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGK 185
+ A+ EARA + G+T W PNINIFRDPRWGRGQET GEDP +TG+
Sbjct: 104 DVFTAVSDEARAKNTEFSKEGGLKRYQGLTMWTPNINIFRDPRWGRGQETYGEDPYLTGQ 163
Query: 186 YAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDAR-VTMQDLAD 244
++ VRG+QG G+ KL AC KH+ + W R+ F+A + +DL +
Sbjct: 164 MGMAVVRGLQGPE---GEKYDKLH--ACAKHYAVHSGPEW---NRHSFNAENIDPRDLWE 215
Query: 245 TYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSI 304
TY P F+ V++ +MCAYNR G P C LL + R +WG+ + SDC A+S
Sbjct: 216 TYLPAFKDLVQKAHVKEVMCAYNRFEGEPCCGSNRLLMQILRDEWGYKEIVVSDCWAISD 275
Query: 305 IYDAEGYAKSP--EDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFS 362
Y+ + + P + A + +G DV CG + AVK+ + E +ID +L L
Sbjct: 276 FYNKDAHETDPDKQHASAKAVLSGTDVECGDSYASLPE-AVKEGLIDEKQIDISLKRLMK 334
Query: 363 VRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSV 422
R LG + P+ + +I VV S H+ LAL+ A++ +VLL+N+ LLPL +K++
Sbjct: 335 ARFELGEMD-EPSQVSWAQIPYSVVDSKEHRELALRMARESLVLLQNNQSLLPL--NKNL 391
Query: 423 SLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENT--VYYPGCDTVA 472
+A++GPNAN + GNY G +IT L+ ++ Y+ + +Y PGCD +
Sbjct: 392 KVAVVGPNANDSVMQWGNYNGFPSHTITLLEGIREYLPESQIIYEPGCDLTS 443
Score = 97.4 bits (241), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 59/171 (34%), Positives = 86/171 (50%), Gaps = 21/171 (12%)
Query: 478 IDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEA 527
+ + VD K AD ++ G+ E EE+ DR + LP Q L+ + +A
Sbjct: 590 LKQTVDKVKEADVIIFAGGISPAVEGEEMHVNIPGFKGGDRETIELPSIQSRLLAELKKA 649
Query: 528 AKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMT 587
KK ++ V G + +T + +IL A YPG+AG A+A V+FGD+NP GRLP+T
Sbjct: 650 GKK-IVFVNFSGSAIALT--PESKTCDAILQAWYPGQAGGTAIANVLFGDYNPAGRLPVT 706
Query: 588 WYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSY 638
+Y D M+ GRTYR+ +FPFG GLSY+ + Y
Sbjct: 707 FYKSTKQLPDFEDYSMK--------GRTYRYMTENPLFPFGHGLSYTTFQY 749
>gi|58581402|ref|YP_200418.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae KACC
10331]
gi|58425996|gb|AAW75033.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae KACC
10331]
Length = 889
Score = 309 bits (791), Expect = 4e-81, Method: Compositional matrix adjust.
Identities = 173/421 (41%), Positives = 245/421 (58%), Gaps = 32/421 (7%)
Query: 54 QRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGA 113
QRA DLV+ ++ +EK++Q +N APAIPRLGIPAYEWWSE LHG+A G A
Sbjct: 39 QRAADLVAHMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNGY----------A 88
Query: 114 TSFPQVILTAASFDSYLWYRIGQAIGLEARALYN-AGQA-------IGMTFWAPNINIFR 165
T FPQ I AAS++++L ++G + EARA +N AG+ G+T W+PNINIFR
Sbjct: 89 TVFPQAIGLAASWNTHLMQQVGTVVSTEARAKFNQAGRPGNDHKRYAGLTIWSPNINIFR 148
Query: 166 DPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNW 225
DPRWGRG ET GEDP +TG+ AV ++ G+QG+ + + A KH + +
Sbjct: 149 DPRWGRGMETYGEDPFLTGQMAVGFIHGLQGEDLD------HPRTIATPKHLAVH---SG 199
Query: 226 KGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTA 285
R+ FD V+ +D+ TY P F + + +G+A +MCAYN ++G P+CA L++
Sbjct: 200 PEPGRHGFDVDVSPRDVEATYTPAFRAAIVEGQAGAVMCAYNSLHGTPACAADWLINGRV 259
Query: 286 RRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQ 345
R WGF G++ SDCDAV + + + LKAG D+NCG + A+ +
Sbjct: 260 RGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAAALKAGHDLNCG-HAYRELGTAIAR 318
Query: 346 KKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIV 405
++ E+ +D++L LF+ R RLG P P+ ++GA V + H+ LALQAA + IV
Sbjct: 319 GEVDEALLDQSLVRLFAARYRLGELEA-PRKDPYARLGAKDVDNAQHRALALQAAAESIV 377
Query: 406 LLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQAL-QNYVENTVY 464
LLKN+ LPL + LA+IGPNA++ L NY G S +TPL L Q + V
Sbjct: 378 LLKNNANTLPL--NAGTRLAVIGPNADALAALEANYQGTSSAPVTPLLGLRQRFGAQQVS 435
Query: 465 Y 465
Y
Sbjct: 436 Y 436
Score = 130 bits (328), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 95/283 (33%), Positives = 140/283 (49%), Gaps = 56/283 (19%)
Query: 496 GLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDIT 545
GL E EEL DR D+ LP QQ L+ R A+A+ KP+++VL+ G V +
Sbjct: 626 GLSPDVEGEELRIDVPGFDGGDRNDIALPAPQQALLER-AKASGKPLVVVLMSGSAVALN 684
Query: 546 FAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWY--PQDYIKVPMTDMKM 603
+AK + +I+ A YPG++G A+A ++ GD NPGGRLP+T+Y +D DMK
Sbjct: 685 WAKTHAD--AIVAAWYPGQSGGTAIARMLAGDDNPGGRLPVTFYRSTKDLPAYVSYDMK- 741
Query: 604 RPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQD 663
GRTYR+++G+ +FPFG GLSY++++Y +S + Q+ ST
Sbjct: 742 ---------GRTYRYFKGEPLFPFGYGLSYTRFAYDAPQLSSTAV---QAGST------- 782
Query: 664 VVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQ 723
VT V+N G AG ++++ R P++ LVGFQ
Sbjct: 783 -------------------LQVTTTVRNTGARAGDEVAQVYLQYPDRPQS-PLRSLVGFQ 822
Query: 724 SVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
V L A E+ + F L +LS G +E G + L VG
Sbjct: 823 RVHLAAGEQRTLTFNLD-ARALSDVDRSGQRAVEAGNYTLFVG 864
>gi|189468358|ref|ZP_03017143.1| hypothetical protein BACINT_04755 [Bacteroides intestinalis DSM
17393]
gi|189436622|gb|EDV05607.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
intestinalis DSM 17393]
Length = 865
Score = 308 bits (790), Expect = 6e-81, Method: Compositional matrix adjust.
Identities = 177/444 (39%), Positives = 255/444 (57%), Gaps = 33/444 (7%)
Query: 44 PFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKG 103
P+ L S+RA DL+ R+TL+EKISQ+ N +PAI RLGIPAY+WW+EALHGVA GK
Sbjct: 24 PYRNPNLSPSERAWDLLKRMTLEEKISQMKNGSPAIERLGIPAYDWWNEALHGVARAGK- 82
Query: 104 IFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYN-------AGQAIGMTF 156
AT FPQ I AA+FD+ Y + EARA Y+ G G+TF
Sbjct: 83 ---------ATVFPQAIGLAATFDNQAVYETFDIVSDEARAKYHDFQRKGERGGYKGLTF 133
Query: 157 WAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKH 216
W PNINI+RDPRWGRG ET GEDP +T ++ V+G+QG+ GK +A AC KH
Sbjct: 134 WTPNINIYRDPRWGRGMETYGEDPYLTSLMGLAVVKGLQGN--GAGKYD---KAHACAKH 188
Query: 217 FTAYDLDNWKGTTRYKFDAR-VTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSC 275
+ + W R+ FD++ ++ +DL +TY P F++ V +G+ +MCAYNR G P C
Sbjct: 189 YAVHSGPEW---NRHSFDSKNISQRDLWETYLPAFKTLVTEGKVKEVMCAYNRFEGEPCC 245
Query: 276 ADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSP--EDAVVDVLKAGMDVNCGS 333
+++ LL + R WG+ + SDC A+ Y + P E A D + +G D+ CG
Sbjct: 246 SNKQLLIRILREDWGYDDIVVSDCGAIGDFYYPNHHETHPSAEAASADAVVSGTDLECGG 305
Query: 334 FLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQ 393
+ AVK+ + E +I+ ++ L R +LG+F+ + T+ + +I VV S H
Sbjct: 306 SYSSLNE-AVKKGLITEDKINESVFRLLRARFQLGMFD-DDTLVSWSEIPYSVVESKEHV 363
Query: 394 VLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQ 453
AL+ A+ +VLL N + LPL KS +A++GPNAN + L NY G +S+T L+
Sbjct: 364 DKALEMARKSMVLLTNKNNSLPLSKSIR-KVAVLGPNANDSVMLWANYNGFPTKSVTILE 422
Query: 454 ALQNYV-ENTVYYP-GCDTVACSS 475
+++ + E VYY GCD V+ +
Sbjct: 423 GIRSKLPEGAVYYEKGCDFVSTQT 446
Score = 113 bits (282), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 78/259 (30%), Positives = 121/259 (46%), Gaps = 51/259 (19%)
Query: 491 VVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGG 540
++ + GL E EE+ DR ++ LP Q+E++ + + K PVI V+ G
Sbjct: 604 IIFVGGLSSALEGEEMPVDLPGFKKGDRTNIDLPRVQEEMLKALKKTGK-PVIFVVCSGS 662
Query: 541 PVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTD 600
+ + + N+ ++L A YPG+ G A+A+V+FGD+NP GRLP+T+Y D D
Sbjct: 663 TLALPWEA--ENLDAMLEAWYPGQQGGTAVADVLFGDYNPAGRLPLTFYASDSDLPDFED 720
Query: 601 MKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVE 660
M RTYR+++GK +FPFG GLSY+ + Y V + + S +
Sbjct: 721 YNMS--------NRTYRYFKGKPLFPFGYGLSYTTFDYGKAKVDKKSIKTGDSMT----- 767
Query: 661 NQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLV 720
+TI +KN G+M G V ++++ G PIK L
Sbjct: 768 ------------------------LTIPLKNTGKMDGDEVVQVYLRNPADKEG-PIKMLR 802
Query: 721 GFQSVILNAKEKAEIVFEL 739
F+ V L A + I EL
Sbjct: 803 AFRRVSLKAGQAENIQIEL 821
>gi|431798021|ref|YP_007224925.1| beta-glucosidase-like glycosyl hydrolase [Echinicola vietnamensis
DSM 17526]
gi|430788786|gb|AGA78915.1| beta-glucosidase-like glycosyl hydrolase [Echinicola vietnamensis
DSM 17526]
Length = 906
Score = 308 bits (789), Expect = 8e-81, Method: Compositional matrix adjust.
Identities = 178/428 (41%), Positives = 243/428 (56%), Gaps = 30/428 (7%)
Query: 40 TETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAG 99
T F F +R LV +++L+EK+SQ++N++PAIPRL +P Y WW+E LHGVA
Sbjct: 47 TLDFSFLDMEKNFEERVDILVDQMSLEEKVSQMMNASPAIPRLKVPEYNWWNECLHGVAR 106
Query: 100 VGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA-------GQAI 152
G AT FPQ I AASFD L IG I EARA ++ G
Sbjct: 107 AGY----------ATVFPQSISVAASFDKNLMKDIGSVISDEARAKHHEFIRNGKRGIYT 156
Query: 153 GMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASA 212
G+ FW+PNINIFRDPRWGRG ET GEDP +TG+ A ++ G+Q + GK L+ A
Sbjct: 157 GLDFWSPNINIFRDPRWGRGHETYGEDPYLTGELASQFIEGLQD---SDGKY---LKTIA 210
Query: 213 CCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGI 272
KHF + + R+ FD V+ +DL +TY P F VK+ + IM AYNR G
Sbjct: 211 TSKHFAVH---SGPEPLRHTFDVDVSDRDLYETYLPAFRKTVKEAKVYSIMGAYNRFRG- 266
Query: 273 PSCADRN-LLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNC 331
SC+ + LL++ R QWGF GY+ SDC A+ I+ A + +A + G D+NC
Sbjct: 267 ESCSGHDFLLNQLLREQWGFEGYVVSDCGAIQDIHTGHKIASTAAEAAAIGVSGGCDLNC 326
Query: 332 GSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPA 391
G++ H AV + + E EID A+ LF R RLG+F+ + + +I +VCS A
Sbjct: 327 GNYY-THLTEAVAEGLISEEEIDIAVKRLFLARFRLGMFDPEEAVS-YAQIPFGIVCSEA 384
Query: 392 HQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITP 451
H LA QAAQ +VLLKN LLPL K +A+IGPNA++ ++LLGNY G + +T
Sbjct: 385 HNTLARQAAQKSMVLLKNQKNLLPLSVDKIKRIAVIGPNADNVESLLGNYHGIPKKPVTF 444
Query: 452 LQALQNYV 459
L +++ V
Sbjct: 445 LDGIKHKV 452
Score = 162 bits (409), Expect = 8e-37, Method: Compositional matrix adjust.
Identities = 112/304 (36%), Positives = 152/304 (50%), Gaps = 52/304 (17%)
Query: 476 ASIDKAVDIAKGADHVVLMMGLDQTQEKEELD----------RVDLVLPGRQQELITRVA 525
+ ID+AV +AK AD V+++GL Q E E +D R + LP +Q+ L+ V
Sbjct: 615 SKIDEAVAMAKSADLAVVVLGLSQRLEGESMDVVTPGFDRGDRTAITLPAQQEALLKAVK 674
Query: 526 EAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLP 585
E K PVILVL G + I +AK N+ +I+ AGYPGE G ALA+V+FGD+NP GRLP
Sbjct: 675 ETGK-PVILVLNAGSAMAINWAK--ENVDAIISAGYPGEEGGNALADVVFGDYNPAGRLP 731
Query: 586 MTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQ 645
+T+Y P D M+ GRTYR++EGK ++PFG GLSY+++SYK V
Sbjct: 732 ITYYQSVEDLPPFEDYDMK--------GRTYRYFEGKPLYPFGYGLSYTRFSYKDLEVPA 783
Query: 646 NKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFV 705
V D V +++ V N G AG V L++
Sbjct: 784 K------------VNAGDPVQ------------------ISVTVTNIGSRAGDEVVQLYL 813
Query: 706 KPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVV 765
RPI+QL GFQ + L E + F LS LS + VIEEG + V
Sbjct: 814 NDKEASTMRPIRQLEGFQRIHLKPGESKVVNFTLS-ARQLSMINGESKRVIEEGVFSIHV 872
Query: 766 GDEE 769
G E+
Sbjct: 873 GGEQ 876
>gi|224537384|ref|ZP_03677923.1| hypothetical protein BACCELL_02262 [Bacteroides cellulosilyticus
DSM 14838]
gi|224521009|gb|EEF90114.1| hypothetical protein BACCELL_02262 [Bacteroides cellulosilyticus
DSM 14838]
Length = 863
Score = 308 bits (789), Expect = 8e-81, Method: Compositional matrix adjust.
Identities = 184/472 (38%), Positives = 260/472 (55%), Gaps = 54/472 (11%)
Query: 13 LCLCFTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQL 72
LCLCF S +QPP+ NP+ L +RA DLV RLTL+EK + +
Sbjct: 14 LCLCFLSC------SQPPYK----NPA----------LSPEERANDLVGRLTLEEKAALM 53
Query: 73 VNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWY 132
N++PAIPRLGI AY+WW+EALHGV G AT FPQ I ASF++ L Y
Sbjct: 54 QNTSPAIPRLGIKAYDWWNEALHGVGRAGL----------ATVFPQAIGMGASFNNELLY 103
Query: 133 RIGQAIGLEARALYNA-------GQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGK 185
+ A+ EARA + G+T W PNINIFRDPRWGRGQET GEDP +TG+
Sbjct: 104 DVFTAVSDEARAKNTEFSKEGGLKRYQGLTMWTPNINIFRDPRWGRGQETYGEDPYLTGQ 163
Query: 186 YAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDAR-VTMQDLAD 244
++ VRG+QG G+ KL AC KH+ + W R+ F+A + +DL +
Sbjct: 164 MGMAVVRGLQGPE---GEKYDKLH--ACAKHYAVHSGPEW---NRHSFNAENIDPRDLWE 215
Query: 245 TYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSI 304
TY P F++ V++ +MCAYNR G P C LL + R +WG+ + SDC A+S
Sbjct: 216 TYLPAFKNLVQKAHVKEVMCAYNRFEGEPCCGSNRLLMQILRDEWGYKEIVVSDCWAISD 275
Query: 305 IYDAEGYAKSPED--AVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFS 362
Y+ + P+ A + +G DV CG + AVK+ + E +ID +L L
Sbjct: 276 FYNKGAHETDPDKQHASAKAVLSGTDVECGDSYASLPE-AVKEGLIDEKQIDISLKRLMK 334
Query: 363 VRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSV 422
R LG + P+ + +I VV S H+ LAL+ A++ +VLL+N+ LLPL +K++
Sbjct: 335 ARFELGEMD-EPSQVSWAQIPYSVVDSKEHRELALRMARESLVLLQNNQSLLPL--NKNL 391
Query: 423 SLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENT--VYYPGCDTVA 472
+A++GPNAN + GNY G +IT L+ ++ Y+ + +Y PGCD +
Sbjct: 392 KVAVVGPNANDSVMQWGNYNGFPSHTITLLEGIREYLPESQIIYEPGCDLTS 443
Score = 97.4 bits (241), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 59/171 (34%), Positives = 86/171 (50%), Gaps = 21/171 (12%)
Query: 478 IDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEA 527
+ + VD K AD ++ G+ E EE+ DR + LP Q L+ + +A
Sbjct: 590 LKQTVDKVKEADVIIFAGGISPAVEGEEMHVNIPGFKGGDRETIELPSIQSRLLAELKKA 649
Query: 528 AKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMT 587
KK ++ V G + +T + +IL A YPG+AG A+A V+FGD+NP GRLP+T
Sbjct: 650 GKK-IVFVNFSGSAIALT--PESKTCDAILQAWYPGQAGGTAIANVLFGDYNPAGRLPVT 706
Query: 588 WYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSY 638
+Y D M+ GRTYR+ +FPFG GLSY+ + Y
Sbjct: 707 FYKSTKQLPDFEDYSMK--------GRTYRYMTENPLFPFGHGLSYTTFQY 749
>gi|418518029|ref|ZP_13084183.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. malvacearum
str. GSPB1386]
gi|410705279|gb|EKQ63755.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. malvacearum
str. GSPB1386]
Length = 886
Score = 308 bits (788), Expect = 9e-81, Method: Compositional matrix adjust.
Identities = 174/421 (41%), Positives = 241/421 (57%), Gaps = 32/421 (7%)
Query: 54 QRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGA 113
QRA LV++++ +EK++Q +N APAIPRLGIPAYEWWSE LHG+A G A
Sbjct: 36 QRAAALVAQMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNGY----------A 85
Query: 114 TSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAG--------QAIGMTFWAPNINIFR 165
T FPQ I AAS+++ L ++G + EARA +N + G+T W+PNINIFR
Sbjct: 86 TVFPQSIGLAASWNTSLMQQVGTVVSTEARAKFNQAGGPGKDHQRYAGLTIWSPNINIFR 145
Query: 166 DPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNW 225
DPRWGRG ET GEDP +TG+ AV ++RG+QG+ N + A KH + +
Sbjct: 146 DPRWGRGMETYGEDPFLTGQMAVGFIRGLQGEDLN------HPRTIATPKHIAVH---SG 196
Query: 226 KGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTA 285
R+ FD V+ D+ TY P F + + +G+A +MCAYN ++G P CA LL+
Sbjct: 197 PEPGRHGFDVDVSPHDVEATYTPAFRAALVEGQAGSVMCAYNALHGTPVCAADWLLNGRV 256
Query: 286 RRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQ 345
R WGF G++ SDCDA+ + + + V LKAG D+NCG + A+ +
Sbjct: 257 RGDWGFKGFVVSDCDAIDDMTQFHYFRPDNAGSSVAALKAGHDLNCG-HAYRELGTAIAR 315
Query: 346 KKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIV 405
++ E+ +D++L LF+ R RLG P P+ ++GA V + AH+ LALQAA + IV
Sbjct: 316 GEVDEALLDQSLVRLFAARYRLGELEA-PRKDPYARLGAKDVDNVAHRALALQAAAESIV 374
Query: 406 LLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQAL-QNYVENTVY 464
LLKN LPL LA+IGPNA++ L NY G S +TPL L Q + V
Sbjct: 375 LLKNDANTLPL--RAGTRLAVIGPNADALAALEANYQGTSSAPVTPLLGLRQRFGAQQVS 432
Query: 465 Y 465
Y
Sbjct: 433 Y 433
Score = 128 bits (322), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 94/284 (33%), Positives = 138/284 (48%), Gaps = 58/284 (20%)
Query: 496 GLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDIT 545
GL E EEL DR D+ LP QQ L+ R A+A+ KP+++VL+ G V +
Sbjct: 623 GLSPDVEGEELRIDVPGFDGGDRNDIALPAPQQALLER-AKASGKPLVVVLMSGSAVALN 681
Query: 546 FAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWY--PQDYIKVPMTDMKM 603
+AK + +I+ A YPG++G A+A ++ GD NPGGRLP+T+Y +D DMK
Sbjct: 682 WAKTHAD--AIMAAWYPGQSGGTAIARMLAGDDNPGGRLPVTFYRSTKDLPAYVSYDMK- 738
Query: 604 RPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQD 663
GRTYR+++G+ +FPFG GLSY++++Y
Sbjct: 739 ---------GRTYRYFKGEPLFPFGYGLSYTRFAY------------------------- 764
Query: 664 VVHYKSVPELGTEFCETRKFLVTIG-VKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGF 722
P+L T + L I V+N G AG ++++ R P++ LVGF
Sbjct: 765 -----DAPQLSTTTLQAGNPLQVIATVRNTGARAGDEVAQVYLQYPDRPQS-PLRSLVGF 818
Query: 723 QSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
Q V L A E+ + F L +LS G +E G + L VG
Sbjct: 819 QRVHLAAGEQRTLTFHLD-ARALSDVDRSGQRAVEAGDYTLFVG 861
>gi|381169747|ref|ZP_09878910.1| glycosyl hydrolase family 3 N terminal domain protein [Xanthomonas
citri pv. mangiferaeindicae LMG 941]
gi|380689765|emb|CCG35397.1| glycosyl hydrolase family 3 N terminal domain protein [Xanthomonas
citri pv. mangiferaeindicae LMG 941]
Length = 874
Score = 307 bits (787), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 177/437 (40%), Positives = 243/437 (55%), Gaps = 32/437 (7%)
Query: 38 PSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGV 97
P P K QRA LV++++ +EK++Q +N APAIPRLGIPAYEWWSE LHG+
Sbjct: 8 PCLALAPPAKPAGSPEQRAAALVAQMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGI 67
Query: 98 AGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAG-------- 149
A G AT FPQ I AAS+++ L ++G + EARA +N
Sbjct: 68 ARNGY----------ATVFPQSIGLAASWNTSLMQQVGTVVSTEARAKFNQAGGPGKDHQ 117
Query: 150 QAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQ 209
+ G+T W+PNINIFRDPRWGRG ET GEDP +TG+ AV ++RG+QG+ N +
Sbjct: 118 RYAGLTIWSPNINIFRDPRWGRGMETYGEDPFLTGQMAVGFIRGLQGEDLN------HPR 171
Query: 210 ASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRV 269
A KH + + R+ FD V+ D+ TY P F + + +G+A +MCAYN +
Sbjct: 172 TIATPKHIAVH---SGPEPGRHGFDVDVSPHDVEATYTPAFRAALVEGQAGSVMCAYNAL 228
Query: 270 NGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDV 329
+G P CA LL+ R WGF G++ SDCDAV + + + LKAG D+
Sbjct: 229 HGTPVCAADWLLNGRVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAAALKAGHDL 288
Query: 330 NCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCS 389
NCG + A+ + ++ E+ +D++L LF+ R RLG P P+ ++GA V +
Sbjct: 289 NCG-HAYRELGTAIARGEVDEALLDQSLVRLFAARYRLGELEA-PRKDPYARLGAKDVDN 346
Query: 390 PAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSI 449
AH+ LALQAA + IVLLKN LPL LA+IGPNA++ L NY G S +
Sbjct: 347 AAHRALALQAAAESIVLLKNDANTLPL--RAGTRLAVIGPNADALAALEANYQGTSSAPV 404
Query: 450 TPLQAL-QNYVENTVYY 465
TPL L Q + V Y
Sbjct: 405 TPLLGLRQRFGAQQVSY 421
Score = 130 bits (328), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 93/284 (32%), Positives = 138/284 (48%), Gaps = 54/284 (19%)
Query: 494 MMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVD 543
+GL E EEL DR D+ LP QQ L+ R A+A+ KP+++VL+ G V
Sbjct: 609 FVGLSPDVEGEELRIDVPGFDGGDRNDIALPAPQQALLER-AKASGKPLVVVLMSGSAVA 667
Query: 544 ITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKM 603
+ +AK + +I+ A YPG++G A+A ++ GD NPGGRLP+T+Y P M
Sbjct: 668 LNWAKMHAD--AIVAAWYPGQSGGTAIARMLAGDDNPGGRLPVTFYRSTKDLPPYVSYDM 725
Query: 604 RPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQD 663
+ GRTYR+++G+ +FPFG GLSY++++Y
Sbjct: 726 K--------GRTYRYFKGEPLFPFGYGLSYTRFAY------------------------- 752
Query: 664 VVHYKSVPELGTEFCETRKFL-VTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGF 722
P+L T + L VT V+N G AG ++++ R P++ LVGF
Sbjct: 753 -----DAPQLSTTTLQAGNPLQVTATVRNTGARAGDEVAQVYLQYPDRPQS-PLRSLVGF 806
Query: 723 QSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
Q V L A E+ + F L +LS G +E G + L VG
Sbjct: 807 QRVHLAAGEQRTLTFHLD-ARALSDVDRSGQRAVEAGDYTLFVG 849
>gi|418519424|ref|ZP_13085476.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. malvacearum
str. GSPB2388]
gi|410704868|gb|EKQ63347.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. malvacearum
str. GSPB2388]
Length = 886
Score = 307 bits (787), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 174/421 (41%), Positives = 240/421 (57%), Gaps = 32/421 (7%)
Query: 54 QRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGA 113
QRA LV++++ +EK++Q +N APAIPRLGIPAYEWWSE LHG+A G A
Sbjct: 36 QRAAALVAQMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNGY----------A 85
Query: 114 TSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAG--------QAIGMTFWAPNINIFR 165
T FPQ I AAS+++ L ++G + EARA +N + G+T W+PNINIFR
Sbjct: 86 TVFPQSIGLAASWNTSLMQQVGTVVSTEARAKFNQAGGPGKDHQRYAGLTIWSPNINIFR 145
Query: 166 DPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNW 225
DPRWGRG ET GEDP +TG+ AV ++RG+QG+ N + A KH + +
Sbjct: 146 DPRWGRGMETYGEDPFLTGQMAVGFIRGLQGEDLN------HPRTIATPKHIAVH---SG 196
Query: 226 KGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTA 285
R+ FD V+ D+ TY P F + + +G+A +MCAYN ++G P CA LL+
Sbjct: 197 PEPGRHGFDVDVSPHDVEATYTPAFRAALVEGQAGSVMCAYNALHGTPVCAADWLLNGRV 256
Query: 286 RRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQ 345
R WGF G++ SDCDAV + + + LKAG D+NCG + A+ +
Sbjct: 257 RGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAAALKAGHDLNCG-HAYRELGTAIAR 315
Query: 346 KKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIV 405
++ E+ +D++L LF+ R RLG P P+ ++GA V + AH+ LALQAA + IV
Sbjct: 316 GEVDEALLDQSLVRLFAARYRLGELEA-PRKDPYARLGAKDVDNAAHRALALQAAAESIV 374
Query: 406 LLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQAL-QNYVENTVY 464
LLKN LPL LA+IGPNA++ L NY G S +TPL L Q + V
Sbjct: 375 LLKNDANTLPL--RAGTRLAVIGPNADALAALEANYQGTSSAPVTPLLGLRQRFGAQQVS 432
Query: 465 Y 465
Y
Sbjct: 433 Y 433
Score = 130 bits (326), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 95/286 (33%), Positives = 140/286 (48%), Gaps = 58/286 (20%)
Query: 494 MMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVD 543
+GL E EEL DR D+ LP QQ L+ R A+A+ KP+++VL+ G V
Sbjct: 621 FVGLSPDVEGEELRIDVPGFDGGDRNDIALPAPQQTLLER-AKASGKPLVVVLMSGSAVA 679
Query: 544 ITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWY--PQDYIKVPMTDM 601
+ +AK + +I+ A YPG++G A+A ++ GD NPGGRLP+T+Y +D DM
Sbjct: 680 LNWAKTHAD--AIVAAWYPGQSGGTAIARMLAGDDNPGGRLPVTFYRSTKDLPAYVSYDM 737
Query: 602 KMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVEN 661
K GRTYR+++G+ +FPFG GLSY++++Y
Sbjct: 738 K----------GRTYRYFKGEPLFPFGYGLSYTRFAY----------------------- 764
Query: 662 QDVVHYKSVPELGTEFCETRKFL-VTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLV 720
P+L T + L VT V+N G AG ++++ R P++ LV
Sbjct: 765 -------DAPQLSTTTLQAGNPLQVTATVRNTGARAGDEVAQVYLQYPDRPQS-PLRSLV 816
Query: 721 GFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
GFQ V L A E+ + F L +LS G +E G + L VG
Sbjct: 817 GFQRVHLAAGEQRTLTFHLD-ARALSDVDRSGQRAVEAGDYTLFVG 861
>gi|21243803|ref|NP_643385.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. citri str.
306]
gi|21109396|gb|AAM37921.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. citri str.
306]
Length = 886
Score = 307 bits (786), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 174/421 (41%), Positives = 240/421 (57%), Gaps = 32/421 (7%)
Query: 54 QRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGA 113
QRA LV++++ +EK++Q +N APAIPRLGIPAYEWWSE LHG+A G A
Sbjct: 36 QRAAALVAQMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNGY----------A 85
Query: 114 TSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAG--------QAIGMTFWAPNINIFR 165
T FPQ I AAS+++ L ++G + EARA +N + G+T W+PNINIFR
Sbjct: 86 TVFPQSIGLAASWNTSLMQQVGTVVSTEARAKFNQAGGPGKDHQRYAGLTIWSPNINIFR 145
Query: 166 DPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNW 225
DPRWGRG ET GEDP +TG+ AV ++RG+QG+ N + A KH + +
Sbjct: 146 DPRWGRGMETYGEDPFLTGQMAVGFIRGLQGEDLN------HPRTIATPKHIAVH---SG 196
Query: 226 KGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTA 285
R+ FD V+ D+ TY P F + + +G+A +MCAYN ++G P CA LL+
Sbjct: 197 PEPGRHGFDVDVSPHDVEATYTPAFRAALVEGQAGSVMCAYNALHGTPVCAADWLLNGRV 256
Query: 286 RRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQ 345
R WGF G++ SDCDAV + + + LKAG D+NCG + A+ +
Sbjct: 257 RGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAAALKAGHDLNCG-HAYRELGTAIAR 315
Query: 346 KKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIV 405
++ E+ +D++L LF+ R RLG P P+ ++GA V + AH+ LALQAA + IV
Sbjct: 316 GEVDEALLDQSLVRLFAARYRLGELEA-PRKDPYARLGAKDVDNAAHRALALQAAAESIV 374
Query: 406 LLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQAL-QNYVENTVY 464
LLKN LPL LA+IGPNA++ L NY G S +TPL L Q + V
Sbjct: 375 LLKNDANTLPL--RAGTRLAVIGPNADALAALEANYQGTSSAPVTPLLGLRQRFGAQQVS 432
Query: 465 Y 465
Y
Sbjct: 433 Y 433
Score = 131 bits (329), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 93/284 (32%), Positives = 138/284 (48%), Gaps = 54/284 (19%)
Query: 494 MMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVD 543
+GL E EEL DR D+ LP QQ L+ R A+A+ KP+++VL+ G V
Sbjct: 621 FVGLSPDVEGEELRIDVPGFDGGDRNDIALPAPQQALLER-AKASGKPLVVVLMSGSAVA 679
Query: 544 ITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKM 603
+ +AK + +I+ A YPG++G A+A ++ GD NPGGRLP+T+Y P M
Sbjct: 680 LNWAKMHAD--AIVAAWYPGQSGGTAIARMLAGDDNPGGRLPVTFYRSTKDLPPYVSYDM 737
Query: 604 RPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQD 663
+ GRTYR+++G+ +FPFG GLSY++++Y
Sbjct: 738 K--------GRTYRYFKGEPLFPFGYGLSYTRFAY------------------------- 764
Query: 664 VVHYKSVPELGTEFCETRKFL-VTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGF 722
P+L T + L VT V+N G AG ++++ R P++ LVGF
Sbjct: 765 -----DAPQLSTTTLQAGNPLQVTATVRNTGARAGDEVAQVYLQYPDRPQS-PLRSLVGF 818
Query: 723 QSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
Q V L A E+ + F L +LS G +E G + L VG
Sbjct: 819 QRVHLAAGEQRTLTFHLD-ARALSDVDRSGQRAVEAGDYTLFVG 861
>gi|94969405|ref|YP_591453.1| Beta-glucosidase [Candidatus Koribacter versatilis Ellin345]
gi|94551455|gb|ABF41379.1| Beta-glucosidase [Candidatus Koribacter versatilis Ellin345]
Length = 902
Score = 307 bits (786), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 182/464 (39%), Positives = 256/464 (55%), Gaps = 40/464 (8%)
Query: 30 PFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEW 89
P +P+T + T P ++RA DLV R+TLDEK +QL + A AIPRLG+P Y+
Sbjct: 25 PIQLPAQSPATPVY--RDATRPANERAHDLVQRMTLDEKAAQLEDWATAIPRLGVPDYQT 82
Query: 90 WSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAG 149
WSEALHGVA G AT FPQ I AA++D+ + ++G I EAR YN
Sbjct: 83 WSEALHGVARAGH----------ATVFPQAIGMAATWDTEMVKQMGDVISTEARGKYNEA 132
Query: 150 QA-------IGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGG 202
Q G+TFW+PNINIFRDPRWGRGQET GEDP +TGK ++++ GVQG
Sbjct: 133 QREGNHRIFWGLTFWSPNINIFRDPRWGRGQETYGEDPFLTGKMGIAFIDGVQGPD---- 188
Query: 203 KLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGI 262
+A A KHF + + R+ FD +V+ +DL +TY F + V G +
Sbjct: 189 --AAHPKAVATSKHFAVHSGPE---SLRHGFDVKVSPRDLEETYLAAFRATVTDGHVKSV 243
Query: 263 MCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPE--DAVV 320
MCAYN V+G+ +CA++ LL + ++ WGF G++ SDC A+ + +G+ +P+ A
Sbjct: 244 MCAYNAVDGMGACANKMLLEEHLKQAWGFKGFVVSDCGAIMDV--TQGHKNAPDIVHAAA 301
Query: 321 DVLKAGMDVNCGSFLQKHTK--AAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQP 378
L AG D++C + AV++ + E + RA L++ R LG+F+ P P
Sbjct: 302 ISLAAGTDLSCSIWEPGFNTLADAVRKGLVTEDMVTRAAERLYAARFELGMFD-EPGSNP 360
Query: 379 FGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLL 438
KI V S H+ AL+AA++ IVLLKN GLLPL +K++ A+IGP A +L
Sbjct: 361 NDKIDMSQVASEEHRAEALKAAEESIVLLKND-GLLPLKNAKTI--AVIGPTAELLASLE 417
Query: 439 GNYAGPSCRSITPLQAL--QNYVENTVYYPGCDTVACSSASIDK 480
GNY G R +TPL + Q EN Y G A + + +
Sbjct: 418 GNYNGQPVRPVTPLDGIVKQFGAENVRYAQGSSLAAGAPVPVPR 461
Score = 115 bits (287), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 88/284 (30%), Positives = 131/284 (46%), Gaps = 53/284 (18%)
Query: 496 GLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDIT 545
GL E EE+ DR + LP Q++L+ + AA KPV++V L G V +
Sbjct: 640 GLSPNLEGEEMPIKIEGFSGGDRTSIDLPATQEKLLEALG-AAGKPVVVVNLSGSAVALN 698
Query: 546 FAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRP 605
+A +++ G+IL A YPG G A+A+ + G+ NP GRLP+T+Y T+ M+
Sbjct: 699 WA--NQHAGAILQAWYPGVEGGTAIAKTLAGESNPAGRLPVTFYASVQDLPAFTEYAMK- 755
Query: 606 QATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVV 665
RTYR+Y GK ++ FG GLSYS + Y ++ + +S
Sbjct: 756 -------NRTYRYYAGKPLWGFGFGLSYSTFKYGEVKLASTSVDAGKS------------ 796
Query: 666 HYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSV 725
T+ V N ++AG V ++K ++G P LVGFQ V
Sbjct: 797 -----------------LTATVTVTNTSQVAGDEVVEAYLKTPQKGG--PSHSLVGFQRV 837
Query: 726 ILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGDEE 769
LN E E+ E+SP SLS + G I G + L +G +
Sbjct: 838 PLNPGESREVAIEVSP-RSLSAVDDSGKRSILAGEYRLSIGSTQ 880
>gi|390992294|ref|ZP_10262532.1| glycosyl hydrolase family 3 N terminal domain protein [Xanthomonas
axonopodis pv. punicae str. LMG 859]
gi|372552957|emb|CCF69507.1| glycosyl hydrolase family 3 N terminal domain protein [Xanthomonas
axonopodis pv. punicae str. LMG 859]
Length = 886
Score = 307 bits (786), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 174/421 (41%), Positives = 240/421 (57%), Gaps = 32/421 (7%)
Query: 54 QRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGA 113
QRA LV++++ +EK++Q +N APAIPRLGIPAYEWWSE LHG+A G A
Sbjct: 36 QRAAALVAQMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNGY----------A 85
Query: 114 TSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAG--------QAIGMTFWAPNINIFR 165
T FPQ I AAS+++ L ++G + EARA +N + G+T W+PNINIFR
Sbjct: 86 TVFPQSIGLAASWNTSLMQQVGTVVSTEARAKFNQAGGPGKDHQRYAGLTIWSPNINIFR 145
Query: 166 DPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNW 225
DPRWGRG ET GEDP +TG+ AV ++RG+QG+ N + A KH + +
Sbjct: 146 DPRWGRGMETYGEDPFLTGQMAVGFIRGLQGEDLN------HPRTIATPKHIAVH---SG 196
Query: 226 KGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTA 285
R+ FD V+ D+ TY P F + + +G+A +MCAYN ++G P CA LL+
Sbjct: 197 PEPGRHGFDVDVSPHDVEATYTPAFRAALVEGQAGSVMCAYNALHGTPVCAADWLLNGRV 256
Query: 286 RRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQ 345
R WGF G++ SDCDAV + + + LKAG D+NCG + A+ +
Sbjct: 257 RGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAAALKAGHDLNCG-HAYRELGTAIAR 315
Query: 346 KKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIV 405
++ E+ +D++L LF+ R RLG P P+ ++GA V + AH+ LALQAA + IV
Sbjct: 316 GEVDEALLDQSLVRLFAARYRLGELEA-PRKDPYARLGAKDVDNAAHRALALQAAAESIV 374
Query: 406 LLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQAL-QNYVENTVY 464
LLKN LPL LA+IGPNA++ L NY G S +TPL L Q + V
Sbjct: 375 LLKNDANTLPL--RAGTRLAVIGPNADALAALEANYQGTSSAPVTPLLGLRQRFGAQQVS 432
Query: 465 Y 465
Y
Sbjct: 433 Y 433
Score = 130 bits (326), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 95/286 (33%), Positives = 140/286 (48%), Gaps = 58/286 (20%)
Query: 494 MMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVD 543
+GL E EEL DR D+ LP QQ L+ R A+A+ KP+++VL+ G V
Sbjct: 621 FVGLSPDVEGEELRIDVPGFDGGDRNDIALPAAQQTLLER-AKASGKPLVVVLMSGSAVA 679
Query: 544 ITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWY--PQDYIKVPMTDM 601
+ +AK + +I+ A YPG++G A+A ++ GD NPGGRLP+T+Y +D DM
Sbjct: 680 LNWAKTHAD--AIVAAWYPGQSGGTAIARMLAGDDNPGGRLPVTFYRSTKDLPAYVSYDM 737
Query: 602 KMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVEN 661
K GRTYR+++G+ +FPFG GLSY++++Y
Sbjct: 738 K----------GRTYRYFKGEPLFPFGYGLSYTRFAY----------------------- 764
Query: 662 QDVVHYKSVPELGTEFCETRKFL-VTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLV 720
P+L T + L VT V+N G AG ++++ R P++ LV
Sbjct: 765 -------DAPQLSTTTLQAGNPLQVTATVRNTGARAGDEVAQVYLQYPDRPQS-PLRSLV 816
Query: 721 GFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
GFQ V L A E+ + F L +LS G +E G + L VG
Sbjct: 817 GFQRVHLAAGEQRTLTFNLD-ARALSDVDRSGQRAVEAGNYTLFVG 861
>gi|294665226|ref|ZP_06730524.1| glucan 1,4-beta-glucosidase [Xanthomonas fuscans subsp.
aurantifolii str. ICPB 10535]
gi|292605014|gb|EFF48367.1| glucan 1,4-beta-glucosidase [Xanthomonas fuscans subsp.
aurantifolii str. ICPB 10535]
Length = 886
Score = 307 bits (786), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 173/421 (41%), Positives = 241/421 (57%), Gaps = 32/421 (7%)
Query: 54 QRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGA 113
QRA LV++++ +EK++Q +N APAIPRLGIPAYEWWSE LHG+A G A
Sbjct: 36 QRAAALVAQMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNGY----------A 85
Query: 114 TSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAG--------QAIGMTFWAPNINIFR 165
T FPQ I AAS+++ L ++G + EARA +N + G+T W+PNINIFR
Sbjct: 86 TVFPQSIGLAASWNTSLMQQVGTVVSTEARAKFNQAGGPGKDHQRYAGLTIWSPNINIFR 145
Query: 166 DPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNW 225
DPRWGRG ET GEDP +TG+ AV ++RG+QG+ + + A KH + +
Sbjct: 146 DPRWGRGMETYGEDPFLTGQMAVGFIRGLQGEDLD------HPRTIATPKHIAVH---SG 196
Query: 226 KGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTA 285
R+ FD V+ D+ TY P F + + +G+A +MCAYN ++G P+CA LL+
Sbjct: 197 PEPGRHGFDVDVSPHDVEATYTPAFRAALVEGQAGSVMCAYNSLHGTPACAADWLLNGRV 256
Query: 286 RRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQ 345
R WGF G++ SDCDAV + + + LKAG D+NCG + A+++
Sbjct: 257 RGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAAALKAGHDLNCG-HAYRDLGTAIER 315
Query: 346 KKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIV 405
+ E+ +D++L LF+ R RLG P P+ ++GA V + AH+ LALQAA + IV
Sbjct: 316 GDVDEALLDQSLVRLFAARYRLGELEA-PRKDPYARLGAKDVDNAAHRALALQAAAESIV 374
Query: 406 LLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQAL-QNYVENTVY 464
LLKN LPL LA+IGPNA++ L NY G S +TPL L Q + V
Sbjct: 375 LLKNDANTLPL--KAGTRLAVIGPNADALAALEANYQGTSSAPVTPLLGLRQRFGAQQVS 432
Query: 465 Y 465
Y
Sbjct: 433 Y 433
Score = 130 bits (326), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 95/285 (33%), Positives = 140/285 (49%), Gaps = 58/285 (20%)
Query: 495 MGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDI 544
+GL E EEL DR D+ LP QQ L+ R A+A+ KP+++VL+ G V +
Sbjct: 622 VGLSPDVEGEELRIDVPGFDGGDRNDIALPAPQQALLER-AKASGKPLVVVLMSGSAVAL 680
Query: 545 TFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWY--PQDYIKVPMTDMK 602
+AK + +I+ A YPG++G A+A ++ GD NPGGRLP+T+Y +D DMK
Sbjct: 681 NWAKTHAD--AIVAAWYPGQSGGTAMARMLAGDDNPGGRLPVTFYRSTKDLPAYVSYDMK 738
Query: 603 MRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQ 662
GRTYR+++G+ +FPFG GLSY++++Y
Sbjct: 739 ----------GRTYRYFKGEPLFPFGYGLSYTRFAY------------------------ 764
Query: 663 DVVHYKSVPELGTEFCETRKFL-VTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVG 721
P+L T + L VT V+N G AG ++++ R P++ LVG
Sbjct: 765 ------DAPQLSTTTLQAGNPLQVTTTVRNTGARAGDEVAQVYLQYPDRPQS-PLRSLVG 817
Query: 722 FQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
FQ V L A E+ + F L +LS G +E G + L VG
Sbjct: 818 FQRVHLAAGEQRTLTFHLD-ARALSDVDRSGQRAVEAGDYTLFVG 861
>gi|294627323|ref|ZP_06705909.1| glucan 1,4-beta-glucosidase [Xanthomonas fuscans subsp.
aurantifolii str. ICPB 11122]
gi|292598405|gb|EFF42556.1| glucan 1,4-beta-glucosidase [Xanthomonas fuscans subsp.
aurantifolii str. ICPB 11122]
Length = 886
Score = 307 bits (786), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 173/421 (41%), Positives = 241/421 (57%), Gaps = 32/421 (7%)
Query: 54 QRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGA 113
QRA LV++++ +EK++Q +N APAIPRLGIPAYEWWSE LHG+A G A
Sbjct: 36 QRAAALVAQMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNGY----------A 85
Query: 114 TSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAG--------QAIGMTFWAPNINIFR 165
T FPQ I AAS+++ L ++G + EARA +N + G+T W+PNINIFR
Sbjct: 86 TVFPQSIGLAASWNTSLMQQVGTVVSTEARAKFNQAGGPGKDHQRYAGLTIWSPNINIFR 145
Query: 166 DPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNW 225
DPRWGRG ET GEDP +TG+ AV ++RG+QG+ + + A KH + +
Sbjct: 146 DPRWGRGMETYGEDPFLTGQMAVGFIRGLQGEDLD------HPRTIATPKHIAVH---SG 196
Query: 226 KGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTA 285
R+ FD V+ D+ TY P F + + +G+A +MCAYN ++G P+CA LL+
Sbjct: 197 PEPGRHGFDVDVSPHDVEATYTPAFRAALVEGQAGSVMCAYNSLHGTPACAADWLLNGRV 256
Query: 286 RRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQ 345
R WGF G++ SDCDAV + + + LKAG D+NCG + A+++
Sbjct: 257 RGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAAALKAGHDLNCG-HAYRDLGTAIER 315
Query: 346 KKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIV 405
+ E+ +D++L LF+ R RLG P P+ ++GA V + AH+ LALQAA + IV
Sbjct: 316 GDVDEALLDQSLVRLFAARYRLGELEA-PRKDPYARLGAKDVDNAAHRALALQAAAESIV 374
Query: 406 LLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQAL-QNYVENTVY 464
LLKN LPL LA+IGPNA++ L NY G S +TPL L Q + V
Sbjct: 375 LLKNDANTLPL--KAGTRLAVIGPNADALAALEANYQGTSSAPVTPLLGLRQRFGAQQVS 432
Query: 465 Y 465
Y
Sbjct: 433 Y 433
Score = 130 bits (326), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 95/286 (33%), Positives = 140/286 (48%), Gaps = 58/286 (20%)
Query: 494 MMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVD 543
+GL E EEL DR D+ LP QQ L+ R A+A+ KP+++VL+ G V
Sbjct: 621 FVGLSPDVEGEELRIDVPGFDGGDRNDIALPAPQQALLER-AKASGKPLVVVLMSGSAVA 679
Query: 544 ITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWY--PQDYIKVPMTDM 601
+ +AK + +I+ A YPG++G A+A ++ GD NPGGRLP+T+Y +D DM
Sbjct: 680 LNWAKTHAD--AIVAAWYPGQSGGTAIARMLAGDDNPGGRLPVTFYRSTKDLPAYVSYDM 737
Query: 602 KMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVEN 661
K GRTYR+++G+ +FPFG GLSY++++Y
Sbjct: 738 K----------GRTYRYFKGEPLFPFGYGLSYTRFAY----------------------- 764
Query: 662 QDVVHYKSVPELGTEFCETRKFL-VTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLV 720
P+L T + L VT V+N G AG ++++ R P++ LV
Sbjct: 765 -------DAPQLSTTTLQAGNPLQVTTTVRNTGARAGDEVAQVYLQYPDRPQS-PLRSLV 816
Query: 721 GFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
GFQ V L A E+ + F L +LS G +E G + L VG
Sbjct: 817 GFQRVHLAAGEQRTLTFHLD-ARALSDVDRSGQRAVEAGDYTLFVG 861
>gi|293370605|ref|ZP_06617157.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
ovatus SD CMC 3f]
gi|292634339|gb|EFF52876.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
ovatus SD CMC 3f]
Length = 861
Score = 306 bits (785), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 175/443 (39%), Positives = 244/443 (55%), Gaps = 36/443 (8%)
Query: 41 ETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGV 100
+ P+ T+L QRA DL+ RLTL+EK+S + N++PAIPRLGI YEWW+EALHGV
Sbjct: 22 QLLPYQDTSLTAEQRAEDLLPRLTLEEKVSLMQNASPAIPRLGIKEYEWWNEALHGVGRA 81
Query: 101 GKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARA---LYNAGQAI----G 153
G AT FPQ I ASF+ L Y + A EAR ++ + G
Sbjct: 82 GL----------ATVFPQSIGMGASFNDSLLYEVFNATSDEARVKSRIFGESGVLKRYQG 131
Query: 154 MTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASAC 213
+TFW PN+NIFRDPRWGRGQET GEDP +TG+ ++ VRG+QG + G + AC
Sbjct: 132 LTFWTPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGP-----EDAGYDKLHAC 186
Query: 214 CKHFTAYDLDNWKGTTRYKFDAR-VTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGI 272
KHF + W R+ FDA + +DL +TY P F+ V++ +MCAYNR G
Sbjct: 187 AKHFAVHSGPEW---NRHSFDAENIDPRDLWETYLPAFKDLVQKAHVKEVMCAYNRFEGE 243
Query: 273 PSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSP--EDAVVDVLKAGMDVN 330
P C LL + R +WG+ G + SDC A+S Y + P E A ++ G D+
Sbjct: 244 PCCGSNRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTHGTHPDKEHASAAAVRTGTDLE 303
Query: 331 CGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSP 390
CGS AVK + E EID +L L + R LG + + +I V+ S
Sbjct: 304 CGSEYASLAD-AVKAGLIDEKEIDISLKRLLTARFELGEMD---EQSAWSEIPTSVLNSK 359
Query: 391 AHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSIT 450
HQ LAL+ A++ +VLL+N + +LPL + + +A++GPNAN + GNY G ++T
Sbjct: 360 EHQALALRMARESLVLLQNKNNILPL--NTHLKVAVMGPNANDSVMQWGNYNGIPAHTVT 417
Query: 451 PLQALQNYVE--NTVYYPGCDTV 471
L+A++ + +Y PGCD V
Sbjct: 418 LLEAVRAKLPEGQIIYEPGCDRV 440
Score = 84.3 bits (207), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 83/300 (27%), Positives = 139/300 (46%), Gaps = 53/300 (17%)
Query: 478 IDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEA 527
++ AV AD ++ G+ + E EE+ DR D+ LP Q++L+ + +A
Sbjct: 588 LNLAVKRVMDADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQRDLLKALKKA 647
Query: 528 AKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMT 587
KK V+ + G I +IL A YPG+AG A+ + ++G++NPGGRLP+T
Sbjct: 648 GKK-VVFINYSGSA--IGLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPGGRLPVT 704
Query: 588 WYPQDYIKVP-MTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQN 646
+Y +D ++P D M+ GRTYR+ + + +FPFG GLSY+ ++Y +S+N
Sbjct: 705 FY-KDVNQLPDFEDYSMK--------GRTYRYMQQQPLFPFGHGLSYTDFTYGEAKLSKN 755
Query: 647 KLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVK 706
+ + ++VV +TI V N G+ G+ V ++++
Sbjct: 756 TI----------AKGENVV-------------------LTIPVSNVGQRDGEEVVQVYLR 786
Query: 707 PARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
G P L F+ V + A + + L+ E M EGT+ L+ G
Sbjct: 787 RPGDKEG-PRYTLRAFKRVHIPAGKTESVAIPLTGENFEWFDVESNTMRPLEGTYELLYG 845
>gi|262405256|ref|ZP_06081806.1| conserved hypothetical protein [Bacteroides sp. 2_1_22]
gi|294644754|ref|ZP_06722499.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
ovatus SD CC 2a]
gi|294810589|ref|ZP_06769241.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
xylanisolvens SD CC 1b]
gi|345508031|ref|ZP_08787672.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp. D1]
gi|229444722|gb|EEO50513.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp. D1]
gi|262356131|gb|EEZ05221.1| conserved hypothetical protein [Bacteroides sp. 2_1_22]
gi|292639876|gb|EFF58149.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
ovatus SD CC 2a]
gi|294442250|gb|EFG11065.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
xylanisolvens SD CC 1b]
Length = 861
Score = 306 bits (784), Expect = 3e-80, Method: Compositional matrix adjust.
Identities = 177/444 (39%), Positives = 245/444 (55%), Gaps = 38/444 (8%)
Query: 41 ETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGV 100
+ P+ T+L QRA DL+ RLTL+EK+S + N++PAIPRLGI YEWW+EALHGV
Sbjct: 22 QLLPYQDTSLTAEQRAEDLLPRLTLEEKVSLMQNASPAIPRLGIKEYEWWNEALHGVGRA 81
Query: 101 GKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARA---LYNAGQAI----G 153
G AT FPQ I ASF+ L Y + A EAR ++ + G
Sbjct: 82 GL----------ATVFPQSIGMGASFNDSLLYEVFNATSDEARVKSRIFGESGVLKRYQG 131
Query: 154 MTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQG-DTFNGGKLKGKLQASA 212
+TFW PN+NIFRDPRWGRGQET GEDP +TG+ ++ VRG+QG + KL A
Sbjct: 132 LTFWTPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPEDARYDKLH------A 185
Query: 213 CCKHFTAYDLDNWKGTTRYKFDAR-VTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNG 271
C KHF + W R+ FDA + +DL +TY P F+ V++ +MCAYNR G
Sbjct: 186 CAKHFAVHSGPEW---NRHSFDAENIDPRDLWETYLPAFKDLVQKAHVKEVMCAYNRFEG 242
Query: 272 IPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSP--EDAVVDVLKAGMDV 329
P C LL + R +WG+ G + SDC A+S Y + P E A ++AG D+
Sbjct: 243 EPCCGSNRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTHGTHPDKEHASAGAVRAGTDL 302
Query: 330 NCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCS 389
CGS AVK + E EID +L L + R LG + + +I V+ S
Sbjct: 303 ECGSEYASLAD-AVKAGLIDEKEIDISLKRLLTARFELGEMD---EQSAWSEIPTSVLNS 358
Query: 390 PAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSI 449
HQ LAL+ A++ +VLL+N + +LPL + + +A++GPNAN + GNY G ++
Sbjct: 359 KEHQALALRMARESLVLLQNKNNILPL--NTHLKVAVMGPNANDSVMQWGNYNGIPAHTV 416
Query: 450 TPLQALQNYVE--NTVYYPGCDTV 471
T L+A++ + +Y PGCD V
Sbjct: 417 TLLEAVRAKLPEGQIIYEPGCDRV 440
Score = 84.0 bits (206), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 60/182 (32%), Positives = 99/182 (54%), Gaps = 23/182 (12%)
Query: 478 IDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEA 527
++ AV AD ++ G+ + E EE+ DR D+ LP Q++L+ + +A
Sbjct: 588 LNLAVKKVMDADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQRDLLKALKKA 647
Query: 528 AKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMT 587
KK V+ + G I +IL A YPG+AG A+ + ++G++NPGGRLP+T
Sbjct: 648 GKK-VVFINYSGSA--IGLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPGGRLPVT 704
Query: 588 WYPQDYIKVP-MTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQN 646
+Y +D ++P D M+ GRTYR+ + + +FPFG GLSY+ ++Y +S+N
Sbjct: 705 FY-KDVNQLPDFEDYSMK--------GRTYRYMQQQPLFPFGHGLSYTTFTYGEAKLSKN 755
Query: 647 KL 648
+
Sbjct: 756 TI 757
>gi|336415363|ref|ZP_08595703.1| hypothetical protein HMPREF1017_02811 [Bacteroides ovatus
3_8_47FAA]
gi|335940959|gb|EGN02821.1| hypothetical protein HMPREF1017_02811 [Bacteroides ovatus
3_8_47FAA]
Length = 861
Score = 306 bits (784), Expect = 3e-80, Method: Compositional matrix adjust.
Identities = 177/444 (39%), Positives = 245/444 (55%), Gaps = 38/444 (8%)
Query: 41 ETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGV 100
+ P+ T+L QRA DL+ RLTL+EK+S + N++PAIPRLGI YEWW+EALHGV
Sbjct: 22 QLLPYQDTSLAAEQRAEDLLPRLTLEEKVSLMQNASPAIPRLGIKEYEWWNEALHGVGRA 81
Query: 101 GKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARA---LYNAGQAI----G 153
G AT FPQ I ASF+ L Y + A EAR ++ + G
Sbjct: 82 GL----------ATVFPQSIGMGASFNDSLLYEVFNATSDEARVKSRIFGESGVLKRYQG 131
Query: 154 MTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQG-DTFNGGKLKGKLQASA 212
+TFW PN+NIFRDPRWGRGQET GEDP +TG+ ++ VRG+QG + KL A
Sbjct: 132 LTFWTPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPEDARYDKLH------A 185
Query: 213 CCKHFTAYDLDNWKGTTRYKFDAR-VTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNG 271
C KHF + W R+ FDA + +DL +TY P F+ V++ +MCAYNR G
Sbjct: 186 CAKHFAVHSGPEW---NRHSFDAENIDPRDLWETYLPAFKDLVQKAHVKEVMCAYNRFEG 242
Query: 272 IPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSP--EDAVVDVLKAGMDV 329
P C LL + R +WG+ G + SDC A+S Y + P E A ++AG D+
Sbjct: 243 EPCCGSNRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTHGTHPDKEHASAGAVRAGTDL 302
Query: 330 NCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCS 389
CGS AVK + E EID +L L + R LG + + +I V+ S
Sbjct: 303 ECGSEYASLAD-AVKAGLIDEKEIDISLKRLLTARFELGEMD---EQSAWSEIPTSVLNS 358
Query: 390 PAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSI 449
HQ LAL+ A++ +VLL+N + +LPL + + +A++GPNAN + GNY G ++
Sbjct: 359 KEHQALALRMARESLVLLQNKNNILPL--NTHLKVAVMGPNANDSVMQWGNYNGIPAHTV 416
Query: 450 TPLQALQNYVE--NTVYYPGCDTV 471
T L+A++ + +Y PGCD V
Sbjct: 417 TLLEAVRAKLPEGQIIYEPGCDRV 440
Score = 92.4 bits (228), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 80/290 (27%), Positives = 134/290 (46%), Gaps = 53/290 (18%)
Query: 488 ADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLL 537
AD ++ G+ + E EE+ DR D+ LP Q++L+ + + KK V+ +
Sbjct: 598 ADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQRDLLKALKKVGKK-VVFINY 656
Query: 538 CGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVP 597
G I +IL A YPG+AG A+ + ++G++NPGGRLP+T+Y +D ++P
Sbjct: 657 SGSA--IGLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPGGRLPVTFY-KDVNQLP 713
Query: 598 -MTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSST 656
D M+ GRTYR+ + + +FPFG GLSY+ ++Y +S+N +
Sbjct: 714 DFEDYSMK--------GRTYRYMQQQPLFPFGHGLSYTDFTYGEAKLSKNTI-------- 757
Query: 657 KMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPI 716
+ ++VV +TI V N G+ G+ V ++++ G P
Sbjct: 758 --AKGENVV-------------------LTIPVSNVGQRDGEEVVQVYLRRPGDKEG-PR 795
Query: 717 KQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
L F+ V + A + + L+ E M EGT+ L+ G
Sbjct: 796 YTLRAFKRVHIPAGKTESVAISLTGENFEWFDVESNTMRPLEGTYELLYG 845
>gi|295086418|emb|CBK67941.1| Beta-glucosidase-related glycosidases [Bacteroides xylanisolvens
XB1A]
Length = 861
Score = 306 bits (783), Expect = 4e-80, Method: Compositional matrix adjust.
Identities = 177/444 (39%), Positives = 245/444 (55%), Gaps = 38/444 (8%)
Query: 41 ETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGV 100
+ P+ T+L QRA DL+ RLTL+EK+S + N++PAIPRLGI YEWW+EALHGV
Sbjct: 22 QLLPYQDTSLTAEQRAEDLLPRLTLEEKVSLMQNASPAIPRLGIKEYEWWNEALHGVGRA 81
Query: 101 GKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARA---LYNAGQAI----G 153
G AT FPQ I ASF+ L Y + A EAR ++ + G
Sbjct: 82 GL----------ATVFPQSIGMGASFNDSLLYEVFNATSDEARVKSRIFGESGVLKRYQG 131
Query: 154 MTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQG-DTFNGGKLKGKLQASA 212
+TFW PN+NIFRDPRWGRGQET GEDP +TG+ ++ VRG+QG + KL A
Sbjct: 132 LTFWTPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPEDARYDKLH------A 185
Query: 213 CCKHFTAYDLDNWKGTTRYKFDAR-VTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNG 271
C KHF + W R+ FDA + +DL +TY P F+ V++ +MCAYNR G
Sbjct: 186 CAKHFAVHSGPEW---NRHSFDAENIDPRDLWETYLPAFKDLVQKAHVKEVMCAYNRFEG 242
Query: 272 IPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSP--EDAVVDVLKAGMDV 329
P C LL + R +WG+ G + SDC A+S Y + P E A ++AG D+
Sbjct: 243 EPCCGSNRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTHETHPDKEHASAGAVRAGTDL 302
Query: 330 NCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCS 389
CGS AVK + E EID +L L + R LG + + +I V+ S
Sbjct: 303 ECGSEYASLAD-AVKAGLIDEKEIDISLKRLLTARFELGEMD---EQSAWSEIPTSVLNS 358
Query: 390 PAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSI 449
HQ LAL+ A++ +VLL+N + +LPL + + +A++GPNAN + GNY G ++
Sbjct: 359 KEHQALALRMARESLVLLQNKNNILPL--NTHLKVAVMGPNANDSVMQWGNYNGIPAHTV 416
Query: 450 TPLQALQNYVE--NTVYYPGCDTV 471
T L+A++ + +Y PGCD V
Sbjct: 417 TLLEAVRAKLPEGQIIYEPGCDRV 440
Score = 85.1 bits (209), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 83/300 (27%), Positives = 139/300 (46%), Gaps = 53/300 (17%)
Query: 478 IDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEA 527
++ AV AD ++ G+ + E EE+ DR D+ LP Q++L+ + +A
Sbjct: 588 LNLAVKKVMDADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQRDLLKALKKA 647
Query: 528 AKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMT 587
KK V+ + G I +IL A YPG+AG A+ + ++G++NPGGRLP+T
Sbjct: 648 GKK-VVFINYSGSA--IGLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPGGRLPVT 704
Query: 588 WYPQDYIKVP-MTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQN 646
+Y +D ++P D M+ GRTYR+ + + +FPFG GLSY+ ++Y +S+N
Sbjct: 705 FY-KDVNQLPDFEDYSMK--------GRTYRYMQQQPLFPFGHGLSYTDFTYGEAKLSKN 755
Query: 647 KLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVK 706
+ + ++VV +TI V N G+ G+ V ++++
Sbjct: 756 TI----------AKGENVV-------------------LTIPVSNVGQRDGEEVVQVYLR 786
Query: 707 PARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
G P L F+ V + A + + L+ E M EGT+ L+ G
Sbjct: 787 RPGDKEG-PRYTLRAFKRVHIPAGKTESVAIPLTGENFEWFDVESNTMCPLEGTYELLYG 845
>gi|189463167|ref|ZP_03011952.1| hypothetical protein BACCOP_03878 [Bacteroides coprocola DSM 17136]
gi|189430146|gb|EDU99130.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
coprocola DSM 17136]
Length = 865
Score = 306 bits (783), Expect = 4e-80, Method: Compositional matrix adjust.
Identities = 180/445 (40%), Positives = 248/445 (55%), Gaps = 38/445 (8%)
Query: 41 ETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGV 100
+ FP+ T+L QRA DL+ RLTL+EK+S + N++PAIPRLGI AY+WW+EALHGV
Sbjct: 23 QQFPYQNTSLTPEQRASDLLERLTLEEKVSLMQNASPAIPRLGIKAYDWWNEALHGVGRA 82
Query: 101 GKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALY-------NAGQAIG 153
G AT FPQ I AASFD L Y++ A+ EARA Y N + G
Sbjct: 83 GI----------ATVFPQTIGMAASFDDELIYKVFTAVSDEARAKYTEFSKSGNLKRYQG 132
Query: 154 MTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQG-DTFNGGKLKGKLQASA 212
+TFW PNINIFRDPRWGRGQET GEDP +T + V+ VRG+QG D KL A
Sbjct: 133 LTFWTPNINIFRDPRWGRGQETYGEDPYLTSRMGVAVVRGLQGPDNMKYDKLH------A 186
Query: 213 CCKHFTAYDLDNWKGTTRYKFDAR-VTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNG 271
C KH+ + W R+ F+A + +DL +TY P F++ V++ +MCAYNR G
Sbjct: 187 CAKHYAVHSGPEW---NRHSFNAENIAPRDLWETYLPAFKALVQEADVKEVMCAYNRFEG 243
Query: 272 IPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSP--EDAVVDVLKAGMDV 329
P C LL + R +W + G I SDC A+S + + P E A + +G D+
Sbjct: 244 EPCCGSNRLLMQILRDEWKYKGIIVSDCGAISDFWRKGDHETHPDKETASAGAVLSGTDL 303
Query: 330 NCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCS 389
CG+ K AV++ + E +ID ++ L + R LG + + + I VV S
Sbjct: 304 ECGNNY-KSLPEAVQKGLIDEKQIDISVKRLLTARFELGEMDEHVC---WDSIPYSVVDS 359
Query: 390 PAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSI 449
AH+ LAL+ A+ IVLL+N + +LPL + + +ALIGPNAN + GNY G +
Sbjct: 360 KAHKDLALEIARKSIVLLQNRNNILPL--KEDMKIALIGPNANDSVMQWGNYNGFPSHTS 417
Query: 450 TPLQALQNYV--ENTVYYPGCDTVA 472
T +AL+ + +Y GCD +
Sbjct: 418 TLYEALKERIPANQLIYDFGCDRTS 442
Score = 99.4 bits (246), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 84/315 (26%), Positives = 137/315 (43%), Gaps = 56/315 (17%)
Query: 477 SIDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAE 526
++ ++D K AD +V G+ + E EE+ DR + LP Q+ LI+ + +
Sbjct: 590 NLQASIDKVKAADVIVFAGGISPSLEGEEMPVNAEGFKGGDRTTIELPAIQRRLISELKK 649
Query: 527 AAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPM 586
K P+I V G V + + +IL A YPG+AG A+A+V+FGD+NP G+LP+
Sbjct: 650 LGK-PIIFVNYSGSAVGL--EPESKICDAILQAWYPGQAGGTAVADVLFGDYNPSGKLPV 706
Query: 587 TWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQN 646
T+Y D M+ GRTYR+ ++ FG GLSY+ ++Y +SQ
Sbjct: 707 TFYKHTDQLPDFQDYSMK--------GRTYRYMTESPLYSFGHGLSYTNFTYGPATLSQQ 758
Query: 647 KLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVK 706
+ + ++ +TI V+N G G+ V +++
Sbjct: 759 TI-----------------------------SQGKEVTLTIPVQNTGNYDGEEVVQVYLS 789
Query: 707 PARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
+ G P L F+ V + ++A + F L M + EG + L+ G
Sbjct: 790 CSGDKEG-PSHTLRAFKRVHIAKGQRANVSFTLDSETFQWFDTNTNTMRMVEGNYELLYG 848
Query: 767 -----DEEYPISIFV 776
D+ ISI V
Sbjct: 849 GTSRIDQLQKISIKV 863
>gi|237719778|ref|ZP_04550259.1| glycoside hydrolase family 3 protein [Bacteroides sp. 2_2_4]
gi|229451047|gb|EEO56838.1| glycoside hydrolase family 3 protein [Bacteroides sp. 2_2_4]
Length = 861
Score = 305 bits (782), Expect = 5e-80, Method: Compositional matrix adjust.
Identities = 177/444 (39%), Positives = 245/444 (55%), Gaps = 38/444 (8%)
Query: 41 ETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGV 100
+ P+ T+L QRA DL+ RLTL+EK+S + N++PAIPRLGI YEWW+EALHGV
Sbjct: 22 QLLPYQDTSLTAEQRAEDLLPRLTLEEKVSLMQNASPAIPRLGIKEYEWWNEALHGVGRA 81
Query: 101 GKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARA---LYNAGQAI----G 153
G AT FPQ I ASF+ L Y + A EAR ++ + G
Sbjct: 82 GL----------ATVFPQSIGMGASFNDSLLYEVFNATSDEARVKSRIFGESGVLKRYQG 131
Query: 154 MTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQG-DTFNGGKLKGKLQASA 212
+TFW PN+NIFRDPRWGRGQET GEDP +TG+ ++ VRG+QG + KL A
Sbjct: 132 LTFWTPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPEDARYDKLH------A 185
Query: 213 CCKHFTAYDLDNWKGTTRYKFDAR-VTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNG 271
C KHF + W R+ FDA + +DL +TY P F+ V++ +MCAYNR G
Sbjct: 186 CAKHFAVHSGPEW---NRHSFDAENIDPRDLWETYLPAFKDLVQKAHVKEVMCAYNRFEG 242
Query: 272 IPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSP--EDAVVDVLKAGMDV 329
P C LL + R +WG+ G + SDC A+S Y + P E A ++AG D+
Sbjct: 243 EPCCGSNRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTHETYPDKEHASAGAVRAGTDL 302
Query: 330 NCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCS 389
CGS AVK + E EID +L L + R LG + + +I V+ S
Sbjct: 303 ECGSEYASLAD-AVKAGLIDEKEIDISLKRLLTARFELGEMD---EQSAWSEIPTSVLNS 358
Query: 390 PAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSI 449
HQ LAL+ A++ +VLL+N + +LPL + + +A++GPNAN + GNY G ++
Sbjct: 359 KEHQALALRMARESLVLLQNKNNILPL--NTHLKVAVMGPNANDSVMQWGNYNGIPAHTV 416
Query: 450 TPLQALQNYVE--NTVYYPGCDTV 471
T L+A++ + +Y PGCD V
Sbjct: 417 TLLEAVRAKLPEGQIIYEPGCDRV 440
Score = 85.1 bits (209), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 83/300 (27%), Positives = 139/300 (46%), Gaps = 53/300 (17%)
Query: 478 IDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEA 527
++ AV AD ++ G+ + E EE+ DR D+ LP Q++L+ + +A
Sbjct: 588 LNLAVKKVMDADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQRDLLKALKKA 647
Query: 528 AKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMT 587
KK V+ + G I +IL A YPG+AG A+ + ++G++NPGGRLP+T
Sbjct: 648 GKK-VVFINYSGSA--IGLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPGGRLPVT 704
Query: 588 WYPQDYIKVP-MTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQN 646
+Y +D ++P D M+ GRTYR+ + + +FPFG GLSY+ ++Y +S+N
Sbjct: 705 FY-KDVNQLPDFEDYSMK--------GRTYRYMQQQPLFPFGHGLSYTDFTYGEAKLSKN 755
Query: 647 KLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVK 706
+ + ++VV +TI V N G+ G+ V ++++
Sbjct: 756 TI----------AKGENVV-------------------LTIPVSNVGQRDGEEVVQVYLR 786
Query: 707 PARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
G P L F+ V + A + + L+ E M EGT+ L+ G
Sbjct: 787 RPGDKEG-PRYTLRAFKRVHIPAGKTESVAIPLTGENFEWFDVESNTMCPLEGTYELLYG 845
>gi|189464583|ref|ZP_03013368.1| hypothetical protein BACINT_00926 [Bacteroides intestinalis DSM
17393]
gi|189436857|gb|EDV05842.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
intestinalis DSM 17393]
Length = 879
Score = 305 bits (780), Expect = 8e-80, Method: Compositional matrix adjust.
Identities = 183/472 (38%), Positives = 257/472 (54%), Gaps = 54/472 (11%)
Query: 13 LCLCFTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQL 72
LCLCF S +QPP+ NP+ L +RA DLV RLTL+EK + +
Sbjct: 30 LCLCFLSC------SQPPYK----NPA----------LSPEERANDLVGRLTLEEKAALM 69
Query: 73 VNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWY 132
N++PAIPRLGI AY+WW+EALHGV G AT FPQ I ASF++ L Y
Sbjct: 70 QNTSPAIPRLGIKAYDWWNEALHGVGRAGL----------ATVFPQAIGMGASFNNELLY 119
Query: 133 RIGQAIGLEARALYNA-------GQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGK 185
+ AI EARA + G+T W PNINIFRDPRWGRGQET GEDP +T +
Sbjct: 120 DVFTAISDEARAKNTEFSKEGGLKRYQGLTMWTPNINIFRDPRWGRGQETYGEDPYLTSQ 179
Query: 186 YAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDAR-VTMQDLAD 244
++ VRG+QG G+ KL AC KH+ + W R+ F+A + +DL +
Sbjct: 180 MGMAVVRGLQGPE---GEKYDKLH--ACAKHYAVHSGPEW---NRHSFNAENIDPRDLWE 231
Query: 245 TYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSI 304
TY P F+ V++ +MCAYNR G P C LL R +WG+ + SDC A+S
Sbjct: 232 TYLPAFKDLVQKAHVKEVMCAYNRFEGEPCCGSNRLLMHILRDEWGYKEIVVSDCWAISD 291
Query: 305 IYDAEGYAKSP--EDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFS 362
Y+ + P + A + +G D+ CG + AVK+ + E +ID +L L
Sbjct: 292 FYNKGAHETDPDKQHASAKAVLSGTDIECGDSYGSLPE-AVKEGLIDEKQIDISLKRLMK 350
Query: 363 VRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSV 422
R LG + P+ + +I VV S H+ LAL+ A++ +VLL+N+ LLPL +K++
Sbjct: 351 ARFELGEMD-EPSQVSWAQIPYSVVDSKEHRELALRMARESLVLLQNNQSLLPL--NKNL 407
Query: 423 SLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENT--VYYPGCDTVA 472
+A++GPNAN + GNY G +IT L+ ++ Y+ + +Y PGCD +
Sbjct: 408 KVAVVGPNANDSVMQWGNYNGFPSHTITLLEGIREYLPESQIIYEPGCDLTS 459
Score = 92.0 bits (227), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 57/171 (33%), Positives = 85/171 (49%), Gaps = 21/171 (12%)
Query: 478 IDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEA 527
+ + V+ K AD ++ G+ E EE+ DR + LP Q L+ + +A
Sbjct: 606 LKQTVNKVKEADVIIFAGGISPAVEGEEMHVNIPGFKGGDRETIELPSIQSRLLAELKKA 665
Query: 528 AKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMT 587
KK ++ V G + +T + +IL A YPG+AG A+A V+FGD+NP GRLP+T
Sbjct: 666 GKK-IVFVNFSGSAIALT--PESKTCDAILQAWYPGQAGGTAIANVLFGDYNPAGRLPVT 722
Query: 588 WYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSY 638
+Y D M+ RTYR+ +FPFG GLSY+ + Y
Sbjct: 723 FYKSTSQLPGFEDYSMKE--------RTYRYMTEAPLFPFGHGLSYTTFRY 765
>gi|384428895|ref|YP_005638255.1| beta-glucosidase [Xanthomonas campestris pv. raphani 756C]
gi|341937998|gb|AEL08137.1| beta-glucosidase [Xanthomonas campestris pv. raphani 756C]
Length = 888
Score = 305 bits (780), Expect = 8e-80, Method: Compositional matrix adjust.
Identities = 170/421 (40%), Positives = 246/421 (58%), Gaps = 32/421 (7%)
Query: 54 QRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGA 113
QRA LV++++ +EK++Q +N+APAIPRLGIPAYEWW+E LHG+A G A
Sbjct: 38 QRAAALVAQMSREEKVAQAMNAAPAIPRLGIPAYEWWNEGLHGIARNGY----------A 87
Query: 114 TSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAG--------QAIGMTFWAPNINIFR 165
T FPQ I AAS+++ L ++G + EARA +N + G+T W+PNINIFR
Sbjct: 88 TVFPQAIGLAASWNTQLMQQVGTVVSTEARAKFNQAGGPGKDHKRYAGLTIWSPNINIFR 147
Query: 166 DPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNW 225
DPRWGRG ET GEDP +TG+ AV ++RG+QGD + A KH + +
Sbjct: 148 DPRWGRGMETYGEDPFLTGQLAVGFIRGLQGDDLE------HPRTIATPKHIAVH---SG 198
Query: 226 KGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTA 285
R+ FD V+ +D+ TY P F + + +G+A +MCAYN ++G P+CA LL+
Sbjct: 199 PEPGRHGFDVDVSPRDVEATYTPAFRAALVEGQAGSVMCAYNSLHGTPACAADWLLNGRV 258
Query: 286 RRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQ 345
R WGF G++ SDCDAV + + + LKAG D+NCG+ + A+++
Sbjct: 259 RGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAAALKAGHDLNCGTAY-RALGTAIER 317
Query: 346 KKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIV 405
++ E+ +D++L LF+ R RLG P + ++GA + + ++ LALQAA + IV
Sbjct: 318 GEVDEALLDQSLVRLFAARYRLGELQA-PRKDRYARLGAKDIDNAGNRALALQAAAESIV 376
Query: 406 LLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQAL-QNYVENTVY 464
LLKN++ LPL S LA+IGPNA++ L NY G S + +TPL L Q + V
Sbjct: 377 LLKNANATLPL--KASTRLAVIGPNADALAALEANYQGTSSQPVTPLLGLRQRFGAQQVR 434
Query: 465 Y 465
Y
Sbjct: 435 Y 435
Score = 137 bits (344), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 96/289 (33%), Positives = 142/289 (49%), Gaps = 52/289 (17%)
Query: 488 ADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLL 537
+D VV +GL E EEL DR D+ LP QQ L+ R A+A+ KP+++VL+
Sbjct: 617 SDAVVAFVGLSPDVEGEELRIDVPGFDGGDRNDIALPAAQQALLER-AKASGKPLVVVLM 675
Query: 538 CGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVP 597
G V + +AK + +I+ A YPG++G A+A + GD NPGGRLP+T+Y P
Sbjct: 676 SGSAVALNWAKTHAD--AIVAAWYPGQSGGTAIARALAGDDNPGGRLPVTFYRSTKDLPP 733
Query: 598 MTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTK 657
M+ GRTYR+++G+ +FPFG GLSY++++Y+ +S L Q+ S
Sbjct: 734 YVSYDMK--------GRTYRYFKGEALFPFGYGLSYTRFAYETPRLSATTL---QAGS-- 780
Query: 658 MVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIK 717
VT V+N GE AG ++++ R P++
Sbjct: 781 ------------------------PLQVTTTVRNTGERAGDEVAQVYLQYPERPQS-PLR 815
Query: 718 QLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
LVGFQ V L E+ + F L +LS G +E G + L VG
Sbjct: 816 SLVGFQRVHLQPGEQRTLTFTLD-ARALSDVDRTGTRAVEAGDYRLFVG 863
>gi|383110854|ref|ZP_09931672.1| hypothetical protein BSGG_1962 [Bacteroides sp. D2]
gi|313694427|gb|EFS31262.1| hypothetical protein BSGG_1962 [Bacteroides sp. D2]
Length = 861
Score = 305 bits (780), Expect = 9e-80, Method: Compositional matrix adjust.
Identities = 181/472 (38%), Positives = 257/472 (54%), Gaps = 51/472 (10%)
Query: 13 LCLCFTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQL 72
L +C SLL FSC + P+ T+L QRA DL+ RLTL+EK++ +
Sbjct: 9 LGVCSLSLL---------FSC------AQKLPYQDTSLTAEQRAEDLLPRLTLEEKVALM 53
Query: 73 VNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWY 132
N++PAIPRLGI Y+WW+EALHGV G AT FPQ I ASF+ L Y
Sbjct: 54 QNASPAIPRLGIKEYDWWNEALHGVGRAGL----------ATVFPQSIGMGASFNDSLLY 103
Query: 133 RIGQAIGLEARA---LYNAGQAI----GMTFWAPNINIFRDPRWGRGQETPGEDPLVTGK 185
+ A+ EAR +++ + G+TFW PN+NIFRDPRWGRGQET GEDP +TG+
Sbjct: 104 EVFDAVSDEARVKSRIFSENGVLKRYQGLTFWTPNVNIFRDPRWGRGQETYGEDPYLTGQ 163
Query: 186 YAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDAR-VTMQDLAD 244
++ VRG+QG GK + AC KHF + W R+ FDA +T +DL +
Sbjct: 164 LGMAVVRGLQGP--ENGKYD---KLHACAKHFAVHSGPEW---NRHSFDAENITPRDLWE 215
Query: 245 TYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSI 304
TY P F+ V++ +MCAYNR G P C LL + R +WG+ G + SDC A+S
Sbjct: 216 TYLPAFKDLVQKADVKEVMCAYNRFEGEPCCGSNRLLMQILRDEWGYKGIVVSDCGAISD 275
Query: 305 IYDAEGYAKSP--EDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFS 362
Y + P E A + +G D+ CG AVK + E +ID +L L +
Sbjct: 276 FYRPGTHGTHPDKEHASAGAVLSGTDLECGGEYGSLAD-AVKAGLIDEKQIDVSLKRLLT 334
Query: 363 VRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSV 422
R LG + P + +I A + S HQ LAL+ A++ +VLL+N + +LPL + +
Sbjct: 335 ARFELGEMDEQPA---WAEIPASTLNSKEHQDLALRMARESLVLLQNKNDILPL--NTDL 389
Query: 423 SLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVE--NTVYYPGCDTVA 472
+A++GPNAN + GNY G ++T L+A+++ + +Y PGCD +
Sbjct: 390 KVAVMGPNANDSVMQWGNYNGIPGHTVTLLEAVRSKLPEGQVMYEPGCDRTS 441
Score = 90.5 bits (223), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 63/183 (34%), Positives = 101/183 (55%), Gaps = 23/183 (12%)
Query: 477 SIDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAE 526
+++ AV+ K AD V+ G+ + E EE+ DR D+ LP Q++L+ + +
Sbjct: 587 NLNLAVEKVKDADVVLFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPAVQRDLLKALKK 646
Query: 527 AAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPM 586
A KK V+ + G I +IL YPG+AG A+ +V+FGD+NP GRLP+
Sbjct: 647 AGKK-VVFINYSGSA--IGLVPESNTCEAILQGWYPGQAGGTAIVDVLFGDYNPAGRLPV 703
Query: 587 TWYPQDYIKVP-MTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQ 645
T+Y +D ++P D M+ GRTYR+ + + +FPFG GLSY+ ++Y +S+
Sbjct: 704 TFY-KDAGQLPDFEDYSMK--------GRTYRYMQQQPLFPFGHGLSYTTFTYGEADLSK 754
Query: 646 NKL 648
N +
Sbjct: 755 NTI 757
>gi|160885419|ref|ZP_02066422.1| hypothetical protein BACOVA_03419 [Bacteroides ovatus ATCC 8483]
gi|156109041|gb|EDO10786.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
ovatus ATCC 8483]
Length = 861
Score = 304 bits (779), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 177/444 (39%), Positives = 245/444 (55%), Gaps = 38/444 (8%)
Query: 41 ETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGV 100
+ P+ T+L QR DL+ RLTL+EK+S + N++PAIPRLGI YEWW+EALHGV
Sbjct: 22 QLLPYQDTSLAAEQRTEDLLPRLTLEEKVSLMQNASPAIPRLGIKEYEWWNEALHGVGRA 81
Query: 101 GKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARA----LYNAG---QAIG 153
G AT FPQ I ASF+ L Y + A EAR ++G + G
Sbjct: 82 GL----------ATVFPQSIGMGASFNDSLLYEVFNATSDEARVKSRIFGDSGVLKRYQG 131
Query: 154 MTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQG-DTFNGGKLKGKLQASA 212
+TFW PN+NIFRDPRWGRGQET GEDP +TG+ ++ VRG+QG + KL A
Sbjct: 132 LTFWTPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPEDARYDKLH------A 185
Query: 213 CCKHFTAYDLDNWKGTTRYKFDAR-VTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNG 271
C KHF + W R+ FDA + +DL +TY P F+ V++ +MCAYNR G
Sbjct: 186 CAKHFAVHSGPEW---NRHSFDAENIDPRDLWETYLPAFKDLVQKAHVKEVMCAYNRFEG 242
Query: 272 IPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSP--EDAVVDVLKAGMDV 329
P C LL + R +WG+ G + SDC A+S Y + P E A ++AG D+
Sbjct: 243 EPCCGSNRLLMQILRDEWGYEGIVVSDCGAISDFYRPGTHGTHPDKEHASAGAVRAGTDL 302
Query: 330 NCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCS 389
CGS AVK + E EID +L L + R LG + + +I V+ S
Sbjct: 303 ECGSEYASLAD-AVKAGLIDEKEIDISLKRLLTARFELGEMD---EQSAWSEIPTSVLNS 358
Query: 390 PAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSI 449
HQ LAL+ A++ +VLL+N + +LPL + + +A++GPNAN + GNY G ++
Sbjct: 359 KEHQALALRMARESLVLLQNKNNILPL--NTHLKVAVMGPNANDSVMQWGNYNGIPAHTV 416
Query: 450 TPLQALQNYVE--NTVYYPGCDTV 471
T L+A++ + +Y PGCD V
Sbjct: 417 TLLEAVRAKLPEGQIIYEPGCDRV 440
Score = 84.0 bits (206), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 81/290 (27%), Positives = 135/290 (46%), Gaps = 53/290 (18%)
Query: 488 ADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLL 537
AD ++ G+ + E EE+ DR D+ LP Q++L+ + +A KK V+ +
Sbjct: 598 ADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQRDLLKALKKAGKK-VVFINY 656
Query: 538 CGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVP 597
G I +IL A YPG+AG A+ + ++G++NPGGRLP+T+Y +D ++P
Sbjct: 657 SGSA--IGLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPGGRLPVTFY-KDVNQLP 713
Query: 598 -MTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSST 656
D M+ GRTYR+ + + +FPFG GLSY+ ++Y +S+N +
Sbjct: 714 DFEDYSMK--------GRTYRYMQQQPLFPFGHGLSYTDFTYGEAKLSKNTI-------- 757
Query: 657 KMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPI 716
+ ++VV +TI V N G+ G+ V ++++ G P
Sbjct: 758 --AKGENVV-------------------LTIPVSNVGQRDGEEVVQVYLRRPGDKEG-PR 795
Query: 717 KQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
L F+ V + A + + L+ E M EGT+ L+ G
Sbjct: 796 YTLRAFKRVHIPAGKTESVAIPLTGENFEWFDVESNTMCPLEGTYELLYG 845
>gi|188990656|ref|YP_001902666.1| beta-glucosidase [Xanthomonas campestris pv. campestris str. B100]
gi|167732416|emb|CAP50610.1| exported beta-glucosidase [Xanthomonas campestris pv. campestris]
Length = 888
Score = 304 bits (778), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 170/421 (40%), Positives = 245/421 (58%), Gaps = 32/421 (7%)
Query: 54 QRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGA 113
QRA LV++++ +EK++Q +N+APAIPRLGIPAYEWWSE LHG+A G A
Sbjct: 38 QRAAALVAQMSREEKVAQAMNAAPAIPRLGIPAYEWWSEGLHGIARNGY----------A 87
Query: 114 TSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAG--------QAIGMTFWAPNINIFR 165
T FPQ I AAS+++ L ++G + EARA +N + G+T W+PNINIFR
Sbjct: 88 TVFPQAIGLAASWNTQLMQQVGTVVSTEARAKFNQAGGPGKDHKRYAGLTIWSPNINIFR 147
Query: 166 DPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNW 225
DPRWGRG ET GEDP +TG+ AV ++RG+QGD + A KH + +
Sbjct: 148 DPRWGRGMETYGEDPFLTGQLAVGFIRGLQGDDLE------HPRTIATPKHIAVH---SG 198
Query: 226 KGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTA 285
R+ FD V+ +D+ TY P F + + +G+A +MCAYN ++G P+CA LL+
Sbjct: 199 PEPGRHGFDVDVSPRDVEATYTPAFRAALVEGQAGSVMCAYNSLHGTPACAADWLLNGRV 258
Query: 286 RRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQ 345
R WGF G++ SDCDAV + + + LKAG D+NCG+ + A+++
Sbjct: 259 RGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAASLKAGHDLNCGTAY-RALGTAIER 317
Query: 346 KKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIV 405
++ E+ +D++L LF+ R RLG P + ++GA + + ++ LALQAA + IV
Sbjct: 318 GEVDEALLDQSLVRLFAARYRLGELQA-PRKDRYARLGAKDIDNAGNRALALQAAAESIV 376
Query: 406 LLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQAL-QNYVENTVY 464
LLKN++ LPL LA+IGPNA++ L NY G S + +TPL L Q + V
Sbjct: 377 LLKNANATLPL--KAGTRLAVIGPNADALAALEANYQGTSSQPVTPLLGLRQRFGAQQVR 434
Query: 465 Y 465
Y
Sbjct: 435 Y 435
Score = 132 bits (331), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 94/281 (33%), Positives = 138/281 (49%), Gaps = 52/281 (18%)
Query: 496 GLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDIT 545
GL E EEL DR D+ LP QQ L+ R A+A+ KP+++VL+ G V +
Sbjct: 625 GLSPDVEGEELRIDVPGFDGGDRNDIALPAAQQALLER-AKASGKPLVVVLMSGSAVALN 683
Query: 546 FAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRP 605
+AK + +I+ A YPG++G A+A + GD NPGGRLP+T+Y P M+
Sbjct: 684 WAKTHAD--AIVAAWYPGQSGGTAIARALAGDDNPGGRLPVTFYRSTKDLPPYVSYDMK- 740
Query: 606 QATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVV 665
GRTYR+++G+ +FPFG GLSY++++Y+ +S L Q+ S
Sbjct: 741 -------GRTYRYFKGEALFPFGYGLSYTRFAYETPRLSVTTL---QAGS---------- 780
Query: 666 HYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSV 725
VT V+N GE AG ++++ R P++ LVGFQ V
Sbjct: 781 ----------------PLQVTTTVRNTGERAGDEVAQVYLQYPDRPQS-PLRSLVGFQRV 823
Query: 726 ILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
L E+ + F L +LS G V+E G + L VG
Sbjct: 824 HLQPGEQRTLTFTLD-ARALSDVDRTGTRVVEAGDYRLFVG 863
>gi|21232323|ref|NP_638240.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. campestris
str. ATCC 33913]
gi|21114093|gb|AAM42164.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. campestris
str. ATCC 33913]
Length = 888
Score = 303 bits (777), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 170/421 (40%), Positives = 245/421 (58%), Gaps = 32/421 (7%)
Query: 54 QRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGA 113
QRA LV++++ +EK++Q +N+APAIPRLGIPAYEWWSE LHG+A G A
Sbjct: 38 QRAAALVAQMSREEKVAQAMNAAPAIPRLGIPAYEWWSEGLHGIARNGY----------A 87
Query: 114 TSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAG--------QAIGMTFWAPNINIFR 165
T FPQ I AAS+++ L ++G + EARA +N + G+T W+PNINIFR
Sbjct: 88 TVFPQAIGLAASWNTQLMQQVGTVVSTEARAKFNQAGGPGKDHKRYAGLTIWSPNINIFR 147
Query: 166 DPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNW 225
DPRWGRG ET GEDP +TG+ AV ++RG+QGD + A KH + +
Sbjct: 148 DPRWGRGMETYGEDPFLTGQLAVGFIRGLQGDDLE------HPRTIATPKHIAVH---SG 198
Query: 226 KGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTA 285
R+ FD V+ +D+ TY P F + + +G+A +MCAYN ++G P+CA LL+
Sbjct: 199 PEPGRHGFDVDVSPRDVEATYTPAFRAALVEGQAGSVMCAYNSLHGTPACAADWLLNGRV 258
Query: 286 RRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQ 345
R WGF G++ SDCDAV + + + LKAG D+NCG+ + A+++
Sbjct: 259 RGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAASLKAGHDLNCGTAY-RALGTAIER 317
Query: 346 KKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIV 405
++ E+ +D++L LF+ R RLG P + ++GA + + ++ LALQAA + IV
Sbjct: 318 GEVDEALLDQSLVRLFAARYRLGELQA-PRKDRYARLGAKDIDNAGNRALALQAAAESIV 376
Query: 406 LLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQAL-QNYVENTVY 464
LLKN++ LPL LA+IGPNA++ L NY G S + +TPL L Q + V
Sbjct: 377 LLKNANATLPL--KAGTRLAVIGPNADALAALEANYQGTSSQPVTPLLGLRQRFGAQQVR 434
Query: 465 Y 465
Y
Sbjct: 435 Y 435
Score = 127 bits (319), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 92/281 (32%), Positives = 134/281 (47%), Gaps = 52/281 (18%)
Query: 496 GLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDIT 545
GL E EEL DR D+ LP QQ L+ R A+A+ KP+++VL+ G V +
Sbjct: 625 GLSPDVEGEELRIDVPGFDGGDRNDIALPAAQQALLER-AKASGKPLVVVLMSGSAVALN 683
Query: 546 FAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRP 605
+AK + +I+ A YPG++G A+A + GD NPGGRLP+T+Y P M+
Sbjct: 684 WAKTHAD--AIVAAWYPGQSGGTAIARALAGDDNPGGRLPVTFYRSTKDLPPYVSYDMK- 740
Query: 606 QATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVV 665
GRTYR+++G+ +FPFG GLSY+ ++Y +S L Q+ S
Sbjct: 741 -------GRTYRYFKGEALFPFGYGLSYTSFAYDAPQLSSTTL---QAGS---------- 780
Query: 666 HYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSV 725
VT V+N G AG ++++ R P++ LVGFQ V
Sbjct: 781 ----------------PLQVTTTVRNTGTRAGDEVAQVYLQYPDRPQS-PLRSLVGFQRV 823
Query: 726 ILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
L E+ + F L +LS G +E G + L VG
Sbjct: 824 HLQPGEQRTLTFTLD-ARALSDVDRTGTRAVEAGDYRLFVG 863
>gi|66767544|ref|YP_242306.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. campestris
str. 8004]
gi|66572876|gb|AAY48286.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. campestris
str. 8004]
Length = 888
Score = 303 bits (776), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 170/421 (40%), Positives = 245/421 (58%), Gaps = 32/421 (7%)
Query: 54 QRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGA 113
QRA LV++++ +EK++Q +N+APAIPRLGIPAYEWWSE LHG+A G A
Sbjct: 38 QRAAALVAQMSREEKVAQSMNAAPAIPRLGIPAYEWWSEGLHGIARNGY----------A 87
Query: 114 TSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAG--------QAIGMTFWAPNINIFR 165
T FPQ I AAS+++ L ++G + EARA +N + G+T W+PNINIFR
Sbjct: 88 TVFPQAIGLAASWNTQLMQQVGTVVSTEARAKFNQAGGPGKDHKRYAGLTIWSPNINIFR 147
Query: 166 DPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNW 225
DPRWGRG ET GEDP +TG+ AV ++RG+QGD + A KH + +
Sbjct: 148 DPRWGRGMETYGEDPFLTGQLAVGFIRGLQGDDLE------HPRTIATPKHIAVH---SG 198
Query: 226 KGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTA 285
R+ FD V+ +D+ TY P F + + +G+A +MCAYN ++G P+CA LL+
Sbjct: 199 PEPGRHGFDVDVSPRDVEATYTPAFRAALVEGQAGSVMCAYNSLHGTPACAADWLLNGRV 258
Query: 286 RRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQ 345
R WGF G++ SDCDAV + + + LKAG D+NCG+ + A+++
Sbjct: 259 RGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAASLKAGHDLNCGTAY-RALGTAIER 317
Query: 346 KKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIV 405
++ E+ +D++L LF+ R RLG P + ++GA + + ++ LALQAA + IV
Sbjct: 318 GEVDEALLDQSLVRLFAARYRLGELQA-PRKDRYARLGAKDIDNAGNRALALQAAAESIV 376
Query: 406 LLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQAL-QNYVENTVY 464
LLKN++ LPL LA+IGPNA++ L NY G S + +TPL L Q + V
Sbjct: 377 LLKNANATLPL--KAGTRLAVIGPNADALAALEANYQGTSSQPVTPLLGLRQRFGAQQVR 434
Query: 465 Y 465
Y
Sbjct: 435 Y 435
Score = 127 bits (319), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 92/281 (32%), Positives = 134/281 (47%), Gaps = 52/281 (18%)
Query: 496 GLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDIT 545
GL E EEL DR D+ LP QQ L+ R A+A+ KP+++VL+ G V +
Sbjct: 625 GLSPDVEGEELRIDVPGFDGGDRNDIALPAAQQALLER-AKASGKPLVVVLMSGSAVALN 683
Query: 546 FAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRP 605
+AK + +I+ A YPG++G A+A + GD NPGGRLP+T+Y P M+
Sbjct: 684 WAKTHAD--AIVAAWYPGQSGGTAIARALAGDDNPGGRLPVTFYRSTKDLPPYVSYDMK- 740
Query: 606 QATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVV 665
GRTYR+++G+ +FPFG GLSY+ ++Y +S L Q+ S
Sbjct: 741 -------GRTYRYFKGEALFPFGYGLSYTSFAYDAPQLSSTTL---QAGS---------- 780
Query: 666 HYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSV 725
VT V+N G AG ++++ R P++ LVGFQ V
Sbjct: 781 ----------------PLQVTTTVRNTGTRAGDEVAQVYLQYPDRPQS-PLRSLVGFQRV 823
Query: 726 ILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
L E+ + F L +LS G +E G + L VG
Sbjct: 824 HLQPGEQRTLTFTLD-ARALSDVDRTGTRAVEAGDYRLFVG 863
>gi|285018984|ref|YP_003376695.1| beta-glucosidase [Xanthomonas albilineans GPE PC73]
gi|283474202|emb|CBA16703.1| putative beta-glucosidase protein [Xanthomonas albilineans GPE
PC73]
Length = 904
Score = 303 bits (776), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 176/420 (41%), Positives = 242/420 (57%), Gaps = 32/420 (7%)
Query: 55 RARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGAT 114
RA LV+++T EKI+Q +N APAIPRLGIPAYEWWSE LHG+A G+ AT
Sbjct: 55 RATALVAKMTRAEKIAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNGE----------AT 104
Query: 115 SFPQVILTAASFDSYLWYRIGQAIGLEARALYN-AG-------QAIGMTFWAPNINIFRD 166
FPQ I AAS+++ L + +G EARA +N AG + G+T W+PNINIFRD
Sbjct: 105 VFPQAIGLAASWNTDLLHAVGTVTSTEARAKFNLAGGPGKNHARYGGLTIWSPNINIFRD 164
Query: 167 PRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWK 226
PRWGRG ET GEDP +TG+ AV ++ G+QGD + A KH + +
Sbjct: 165 PRWGRGMETYGEDPYLTGQLAVGFIHGLQGDDPT------HPRTIATPKHLAVH---SGP 215
Query: 227 GTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTAR 286
+ R+ FD V+ D TY P F + + +G A +MCAYN ++GIP+CA L+ R
Sbjct: 216 ESGRHGFDVDVSPHDFEATYSPAFRAAIVEGHAGSVMCAYNALHGIPACAADWLIDGRVR 275
Query: 287 RQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQK 346
WGF G++ SDCDA+ + Y + LKAG D+NCG + + A+ +
Sbjct: 276 GNWGFKGFVVSDCDAIDDMTQFHYYRADNAGSAAAALKAGHDLNCG-YAYRDLGTALDRG 334
Query: 347 KLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVL 406
+ E+ +DR+L LF+ R RLG + P+ ++GA + SP H+ LALQAAQ +VL
Sbjct: 335 EAEEAMLDRSLVRLFAARYRLGELQPR-SKDPYARLGAKDIDSPTHRALALQAAQQSLVL 393
Query: 407 LKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENT-VYY 465
L+N + LPL + LA+IGPNA++ L NY G S +TPLQ L+ T V+Y
Sbjct: 394 LQNRNDTLPL--RPGLRLAVIGPNADALAALEANYQGTSVAPVTPLQGLRARFGTTQVHY 451
Score = 134 bits (338), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 97/281 (34%), Positives = 137/281 (48%), Gaps = 52/281 (18%)
Query: 496 GLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDIT 545
GL E EEL DR DL LP QQ L+ R A+A+ KP+I+VL+ G V +
Sbjct: 641 GLSPDVEGEELRIDVPGFDGGDRNDLSLPAAQQALLER-AKASGKPLIVVLMSGSAVALN 699
Query: 546 FAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRP 605
+AK ++ +IL A YPG++G A+A+ + GD NPGGRLP+T+Y P M+
Sbjct: 700 WAK--QHADAILAAWYPGQSGGTAIAQALAGDINPGGRLPVTFYRSTKDLPPYVSYDMK- 756
Query: 606 QATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVV 665
GRTYR+++G+ +FPFG GLSY+ ++Y +S L + D +
Sbjct: 757 -------GRTYRYFKGEALFPFGYGLSYTHFAYTAPQLSSTTL-----------QAGDTL 798
Query: 666 HYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSV 725
H VT V+N G AG V ++++ R P++ LVGFQ V
Sbjct: 799 H------------------VTTTVRNTGARAGDEVVQVYLQYPPRAQS-PLRALVGFQRV 839
Query: 726 ILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
L E + F L P LS G +E G + L VG
Sbjct: 840 SLQPGEARTLSFALEP-RQLSDVDRSGQRAVEAGDYRLFVG 879
>gi|219118959|ref|XP_002180246.1| beta-xylosidase [Phaeodactylum tricornutum CCAP 1055/1]
gi|217408503|gb|EEC48437.1| beta-xylosidase [Phaeodactylum tricornutum CCAP 1055/1]
Length = 682
Score = 303 bits (776), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 209/619 (33%), Positives = 309/619 (49%), Gaps = 61/619 (9%)
Query: 37 NPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPA----------IPRLGIPA 86
N + P+C +L I +R DL+S LTLDEK+ ++ + P + R+G+P
Sbjct: 64 NDVAKALPYCDMSLSIDERLEDLLSHLTLDEKVD-MIGADPTQDVCMTHTMNVSRIGLPD 122
Query: 87 YEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALY 146
Y W E VG + AT F + AASF+ W+ G G E RAL
Sbjct: 123 YYWLVET---NTAVGSACIAEN--KCATEFSGPLSIAASFNRSSWFLKGSVFGTEQRALM 177
Query: 147 N---------AGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGD 197
N +G+ IG+T + PNIN RDPR+GR E PGEDP ++G+YA V+G+Q
Sbjct: 178 NVHGERFHTHSGRHIGLTAFGPNINQQRDPRFGRSSELPGEDPFLSGQYAAHMVQGMQER 237
Query: 198 TFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQG 257
NG + A KHFTAY + +G Y ++M DL DTY P +E + QG
Sbjct: 238 DANGYP-----KVLAYLKHFTAYSREEGRGNDDYN----ISMYDLFDTYLPQYEMGMVQG 288
Query: 258 RASGIMCAYNRVNGIPSCADRNLLSKTARRQWG-FHGYITSDCDAVSIIYDAEGYAKSPE 316
A+G+MC+YN VNGIP+CA+ LL+K R++W ++T+DC AV+ + A
Sbjct: 289 GATGVMCSYNAVNGIPACANDYLLNKILRQRWNRSDAHVTTDCGAVNNLRGKPIQAADEA 348
Query: 317 DAVVDVLKAGMDVNCGSFLQKHT-KAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPT 375
A L G D+ GS L H A+ E +++A+ + G F+ +PT
Sbjct: 349 QAAAMALMNGADIEMGSTLFVHNLTTAITLGYATEEAVNQAIRRSYRPHFIAGRFD-DPT 407
Query: 376 MQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAK 435
+ + +G D + S HQ + L+AA G+VLLK+ +LP+ + LA++GP +
Sbjct: 408 LSEWFSLGLDDIQSKKHQEIQLEAALQGLVLLKHEDSILPI--AAGTKLAVLGPLGMTRS 465
Query: 436 TLLGNY--------AGPSC-RSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAK 486
L+ +Y G C ++ N E TV G D + +++ +++ + +A
Sbjct: 466 GLMSDYESDQSCFGGGHDCIPTLAESIGFINGKEFTVAAAGVDVDSRNTSDVERILQLAA 525
Query: 487 GADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITF 546
D +VL +G +TQE+E DR D LPG+Q L V +KPV+LVL+ GG + +
Sbjct: 526 DRDLIVLCLGNTKTQEQEGFDRKDTALPGQQYALFEAVL-TLRKPVVLVLVNGGQIAL-- 582
Query: 547 AKYDRNIG---SILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKM 603
D G +I+ A P G ALA +FG N G+LP T YP M M
Sbjct: 583 ---DGMTGYPSAIIEAFNPNGIGGTALAASLFGQENRWGKLPYTIYPYSV----MQSFDM 635
Query: 604 RPQATSGNPGRTYRFYEGK 622
+ + S PGRTYR++ GK
Sbjct: 636 KDHSMSAPPGRTYRYFTGK 654
>gi|427387362|ref|ZP_18883418.1| hypothetical protein HMPREF9447_04451 [Bacteroides oleiciplenus YIT
12058]
gi|425725523|gb|EKU88394.1| hypothetical protein HMPREF9447_04451 [Bacteroides oleiciplenus YIT
12058]
Length = 865
Score = 303 bits (775), Expect = 3e-79, Method: Compositional matrix adjust.
Identities = 177/440 (40%), Positives = 251/440 (57%), Gaps = 33/440 (7%)
Query: 44 PFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKG 103
P+ L S+RA DL+ R+TL+EKISQ+ N +PAI RLGIPAY WW+EALHGVA GK
Sbjct: 24 PYKNPDLTPSERAWDLLKRMTLEEKISQMKNGSPAIERLGIPAYNWWNEALHGVARAGK- 82
Query: 104 IFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQA-------IGMTF 156
AT FPQ I AA+FD+ + + EARA Y+ Q G+TF
Sbjct: 83 ---------ATVFPQAIGLAATFDNQAVHETFSIVSDEARAKYHDFQRKGERDGYKGLTF 133
Query: 157 WAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKH 216
W PNINI+RDPRWGRG ET GEDP +T ++ V+G+QGD GK + AC KH
Sbjct: 134 WTPNINIYRDPRWGRGMETYGEDPYLTSLMGLAVVKGLQGD--GTGKYD---KTHACAKH 188
Query: 217 FTAYDLDNWKGTTRYKFDAR-VTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSC 275
+ + W R+ FDA+ ++ +DL +TY P F++ V +G+ +MCAYNR G P C
Sbjct: 189 YAVHSGPEW---NRHSFDAKNISQRDLWETYLPAFKTLVTEGKVKEVMCAYNRYEGEPCC 245
Query: 276 ADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSP--EDAVVDVLKAGMDVNCGS 333
+++ LL + R WG+ + SDC A+ Y + P A D + +G D+ CG
Sbjct: 246 SNKQLLIRILREDWGYDDIVVSDCGAIGDFYYPNHHETHPTAAAASADAVVSGTDLECGG 305
Query: 334 FLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQ 393
+ AV++ + E +I+ ++ L R +LG+F+ N T+ + +I VV S H
Sbjct: 306 SYSSLNE-AVRKGLISEDKINESVFRLLRARFQLGMFDDN-TLVSWSEIPYSVVESKEHV 363
Query: 394 VLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQ 453
AL+ A+ +VLL N + +LPL KS +A++GPNAN + L NY G +S+T L+
Sbjct: 364 AKALEMARKSMVLLTNKNNILPLSKSVR-KVAVLGPNANDSVMLWANYNGFPTKSVTILE 422
Query: 454 ALQNYV-ENTVYYP-GCDTV 471
++N + E VYY GCD V
Sbjct: 423 GIRNKLPEGAVYYEKGCDFV 442
Score = 115 bits (287), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 80/267 (29%), Positives = 130/267 (48%), Gaps = 51/267 (19%)
Query: 483 DIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPV 532
D A AD ++ + GL + E EE+ DR ++ LP Q+E++ + + K PV
Sbjct: 596 DKAAEADVIIFVGGLSSSLEGEEMPVDLPGFRKGDRTNIDLPQVQEEMLKALKKTGK-PV 654
Query: 533 ILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQD 592
+ VL G + + + N+ +I+ A YPG+ G A+A+V+FGD+NP GRLP+T+Y
Sbjct: 655 VFVLCSGSTLALPWEA--ENLDAIIEAWYPGQQGGTAVADVLFGDYNPAGRLPLTFYASS 712
Query: 593 YIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQ 652
D M RTYR+++G+ +FPFG GLSY+ + Y KA + K+
Sbjct: 713 SDLPDFEDYDM--------SNRTYRYFKGRPLFPFGHGLSYTTFDYG-KAKADKKILRAG 763
Query: 653 SSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGN 712
T +TI +KN G+++G V ++++
Sbjct: 764 EGLT----------------------------LTIPLKNIGKLSGDEVVQVYLRNPGDKE 795
Query: 713 GRPIKQLVGFQSVILNAKEKAEIVFEL 739
G PIK L F+ + L A + +++FEL
Sbjct: 796 G-PIKTLRAFRRISLEAGQAEDVLFEL 821
>gi|390957160|ref|YP_006420917.1| beta-glucosidase-like glycosyl hydrolase [Terriglobus roseus DSM
18391]
gi|390412078|gb|AFL87582.1| beta-glucosidase-like glycosyl hydrolase [Terriglobus roseus DSM
18391]
Length = 908
Score = 303 bits (775), Expect = 3e-79, Method: Compositional matrix adjust.
Identities = 185/438 (42%), Positives = 248/438 (56%), Gaps = 33/438 (7%)
Query: 45 FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGI 104
+ L QRA DLV R+TL+EK Q+VN A AIPRL +PAY++W+E LHGVA G
Sbjct: 24 YLNPALTPQQRAADLVGRMTLEEKSLQMVNGAAAIPRLNVPAYDYWNEGLHGVARSGY-- 81
Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQA-------IGMTFW 157
AT FPQ I AA++D+ L +IG I EARA N G+TFW
Sbjct: 82 --------ATMFPQAIGMAATWDAPLLKQIGDVIATEARAKNNEALRRNNHDIYFGLTFW 133
Query: 158 APNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHF 217
+PNINIFRDPRWGRGQET GEDP +T + V+++ G+QG K+ A KHF
Sbjct: 134 SPNINIFRDPRWGRGQETYGEDPHLTTQLGVNFIEGLQGTDPKFYKV------IATPKHF 187
Query: 218 TAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCAD 277
+ + R+KFD T DL DTY P F + + +A IMCAYNR++G P+C
Sbjct: 188 AVH---SGPEEGRHKFDVEPTPHDLWDTYLPQFRAAIVDAKADSIMCAYNRIDGQPACGS 244
Query: 278 RNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVD--VLKAGMDVNCGSFL 335
+ LL R W F G++TSDC A+ + + P+ D L AG D NCGS
Sbjct: 245 KLLLVDILRNDWKFQGFVTSDCGAIDDFFRPNTHQTEPDAEHADKAALLAGTDTNCGSTY 304
Query: 336 QKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVL 395
+K AVK + ES+ID +L LF R+RLGLF+ ++ P+ +I V SPA+ +
Sbjct: 305 RKLGD-AVKSGLIKESDIDVSLRRLFEARVRLGLFDPAGSV-PYAQIPFSQVNSPANAAV 362
Query: 396 ALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQAL 455
A +AA++ +VLLKN G+LPL K ++A+IGPN S +L GNY G + P+ AL
Sbjct: 363 AKRAAEESMVLLKND-GILPLKAGKYKTIAVIGPNGASLSSLEGNYNGMAHDPRMPVDAL 421
Query: 456 QNYVE--NTVYYPGCDTV 471
++ + N VY PG V
Sbjct: 422 RSALSGTNVVYAPGAPYV 439
Score = 139 bits (350), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 96/301 (31%), Positives = 148/301 (49%), Gaps = 53/301 (17%)
Query: 478 IDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEA 527
+ +A++ A +D VV M+GL E EE+ DR D+ LP QQ L+ + A
Sbjct: 621 LPEALEAANKSDLVVAMLGLSPDLEGEEMPVKLPGFVGGDRTDISLPASQQALLQGLI-A 679
Query: 528 AKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMT 587
KP I+VLL G + I A D +IL + YPGEAG+ ALA+ + G +NP GRLP+T
Sbjct: 680 TGKPTIVVLLNGSALAINLA--DEKANAILESWYPGEAGSTALADTLVGRNNPSGRLPIT 737
Query: 588 WYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNK 647
+Y + D M+ RTYR+++G ++ FG GLSY+K++Y +++ K
Sbjct: 738 FYKSESDLPGFEDYSMQ--------NRTYRYFKGAPLYGFGFGLSYTKFAYSGLKLAKAK 789
Query: 648 LYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKP 707
L + + ++ VKN G++AG+ L++ P
Sbjct: 790 LNAGDTLTAEVT-----------------------------VKNTGKVAGEEVAELYLLP 820
Query: 708 ARRGNG--RPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVV 765
GN P +QL GFQ V+L E ++ F L+P + LS G I+ GT+ + +
Sbjct: 821 PAEGNAGLSPKQQLEGFQRVMLKPGESRKLTFTLTPRQ-LSEVDAKGTRAIQPGTYAIAI 879
Query: 766 G 766
G
Sbjct: 880 G 880
>gi|325922365|ref|ZP_08184139.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas gardneri ATCC
19865]
gi|325547147|gb|EGD18227.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas gardneri ATCC
19865]
Length = 889
Score = 303 bits (775), Expect = 3e-79, Method: Compositional matrix adjust.
Identities = 168/412 (40%), Positives = 238/412 (57%), Gaps = 31/412 (7%)
Query: 54 QRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGA 113
QRA LV++++ +EK++Q +N APAIPRLGIPAYEWWSE LHG+A G A
Sbjct: 39 QRAAALVAQMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNGY----------A 88
Query: 114 TSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAG--------QAIGMTFWAPNINIFR 165
T FPQ I AAS+++ L ++G + EARA +N + G+T W+PNINIFR
Sbjct: 89 TVFPQAIGLAASWNTQLMQQVGTVVSTEARAKFNQAGGPGKDHKRYAGLTIWSPNINIFR 148
Query: 166 DPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNW 225
DPRWGRG ET GEDP +TG+ AV ++RG+QGD + + A KH + +
Sbjct: 149 DPRWGRGMETYGEDPFLTGQLAVGFIRGLQGDDLD------HPRTIATPKHIAVH---SG 199
Query: 226 KGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTA 285
R+ FD V+ +D+ TY P F + + G+A +MCAYN ++G P+CA LL+
Sbjct: 200 PEPGRHSFDVDVSPRDVEATYTPAFRAALIDGQAGSVMCAYNSLHGTPACAADWLLNGRV 259
Query: 286 RRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQ 345
R WGF G++ SDCDAV + + + LKAG D+NCG + + A+++
Sbjct: 260 RGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAASLKAGHDLNCG-YAYRALGTAIER 318
Query: 346 KKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIV 405
++ E+ +D++L LF+ R RLG P P+ +GA + + A++ LAL+AA IV
Sbjct: 319 GEVDEALLDQSLVRLFAARYRLGELEA-PHKDPYATLGAKDIDNTANRALALKAAAQSIV 377
Query: 406 LLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQN 457
LLKN LPL LA+IGPNA++ L NY G S +TPL L+
Sbjct: 378 LLKNDANTLPL--KAGARLAVIGPNADALAALEANYQGTSSTPVTPLLGLRQ 427
Score = 129 bits (323), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 93/281 (33%), Positives = 136/281 (48%), Gaps = 52/281 (18%)
Query: 496 GLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDIT 545
GL E EEL DR D+ LP QQ L+ R A+A+ KP+++VL+ G V +
Sbjct: 626 GLSPDVEGEELRIDVPGFDGGDRNDIALPAPQQALLER-AKASGKPLVVVLMSGSAVALN 684
Query: 546 FAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRP 605
+AK + +I+ A YPG++G A+A ++ GD NPGGRLP+T+Y P M+
Sbjct: 685 WAKTHAD--AIVAAWYPGQSGGTAIARMLAGDDNPGGRLPVTFYRSTKDLPPYVSYDMK- 741
Query: 606 QATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVV 665
GRTYR+++G+ +FPFG GLSY+ ++Y +S L Q+ ST
Sbjct: 742 -------GRTYRYFKGEPLFPFGYGLSYTSFAYGAPQLSSTTL---QAGST--------- 782
Query: 666 HYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSV 725
VT V+N G AG ++++ R P++ LVGFQ V
Sbjct: 783 -----------------LQVTTTVRNTGTRAGDEVAQVYLQYPDRPQS-PLRSLVGFQRV 824
Query: 726 ILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
L E+ + F L +LS G +E G + L VG
Sbjct: 825 HLKPGEQRTLTFTLD-ARALSDVDRTGQRAVEAGDYTLFVG 864
>gi|86143269|ref|ZP_01061671.1| beta-glucosidase precursor [Leeuwenhoekiella blandensis MED217]
gi|85830174|gb|EAQ48634.1| beta-glucosidase precursor [Leeuwenhoekiella blandensis MED217]
Length = 873
Score = 302 bits (774), Expect = 4e-79, Method: Compositional matrix adjust.
Identities = 176/436 (40%), Positives = 242/436 (55%), Gaps = 30/436 (6%)
Query: 31 FSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWW 90
F C + + FPF L + R DLVSR+TL+EKISQL++ APAI RL IP Y WW
Sbjct: 12 FCCCIQIEAQKQFPFQNEQLDLETRLNDLVSRMTLEEKISQLMSDAPAIERLNIPKYNWW 71
Query: 91 SEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARA-----L 145
+E+LHGVA G AT FPQ I AAS+D+ L + AI EARA L
Sbjct: 72 NESLHGVARAGY----------ATVFPQSISIAASWDAQLVREVATAISDEARAKHHEYL 121
Query: 146 YNAGQAI--GMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGK 203
I G+T W+PNINIFRDPRWGRG ET GEDP +TG YV+G+QGD
Sbjct: 122 RRDQHDIYQGLTMWSPNINIFRDPRWGRGHETYGEDPFLTGTLGAQYVKGLQGDD----- 176
Query: 204 LKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIM 263
L+ A KHF + + +R+ FDA + +DL +TY P F VK + +M
Sbjct: 177 -PEYLKVVATAKHFAVH---SGPEESRHYFDANTSERDLWETYLPAFRMLVKDAQVQSVM 232
Query: 264 CAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVL 323
AYNR G + +++ LL R +WGF GY+ SDC A++ I++ A L
Sbjct: 233 TAYNRFRGEAASSNK-LLFDILRNKWGFDGYVVSDCGAINDIWEDHKITADAASASALAL 291
Query: 324 KAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIG 383
+ G D+NCG+ K K A+ + E +I+ A+ LF R++LG+F+ + + I
Sbjct: 292 ETGTDLNCGA-TYKSLKEAIANGLITEEKINIAIERLFRARLKLGMFDTEENLS-YATIP 349
Query: 384 ADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAG 443
V + +H LA +AAQ+ IVLLKN +LPL K +A+IGPNA++ ++L GNY G
Sbjct: 350 FSVNTNASHTALARKAAQESIVLLKNEAHMLPLSKDLK-QIAVIGPNAHNVQSLWGNYNG 408
Query: 444 PSCRSITPLQALQNYV 459
+T +Q ++N V
Sbjct: 409 TPKNPVTVVQGIRNKV 424
Score = 142 bits (357), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 97/299 (32%), Positives = 150/299 (50%), Gaps = 52/299 (17%)
Query: 478 IDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEA 527
+++AV++A+ +D +L++GL++ E EE+ DR L LP Q+EL+ R A
Sbjct: 589 LERAVNLAEDSDVTILVLGLNERLEGEEMRIDVEGFSKGDRTALDLPLEQRELM-RALVA 647
Query: 528 AKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMT 587
KP++LVLL G + I +A+ ++ +IL AGYPG+ G A+A+V+FGD+NP GRLP+T
Sbjct: 648 TGKPIVLVLLNGSALAINYAQ--EHVPAILSAGYPGQEGGNAIADVLFGDYNPAGRLPVT 705
Query: 588 WYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNK 647
+Y D M+ GRTYR++EG+ ++PFG GLSY+++SY
Sbjct: 706 YYKSVDDLPDFEDYSMK--------GRTYRYFEGEALYPFGYGLSYTQFSY--------- 748
Query: 648 LYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKP 707
D + K+ L + + V + V N G+ G V L++K
Sbjct: 749 ---------------DAI--KTSGRLAAD----KVLNVQVTVTNSGDRDGDEVVQLYLKD 787
Query: 708 ARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
RP QLVGF+ + L E + F L S + +V+E G L G
Sbjct: 788 EVASTTRPQVQLVGFKRIHLQKGETQTVEFRLD-ARQFSMINDQEQLVVEPGWFTLYAG 845
>gi|116181370|ref|XP_001220534.1| hypothetical protein CHGG_01313 [Chaetomium globosum CBS 148.51]
gi|88185610|gb|EAQ93078.1| hypothetical protein CHGG_01313 [Chaetomium globosum CBS 148.51]
Length = 549
Score = 302 bits (774), Expect = 4e-79, Method: Compositional matrix adjust.
Identities = 193/520 (37%), Positives = 277/520 (53%), Gaps = 43/520 (8%)
Query: 33 CDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSE 92
CDP K T P +RA LV L ++EK+ LV+ + RLG+PAY WWSE
Sbjct: 39 CDP-----------KATPP--ERAAALVKALNIEEKLQNLVDMSKGAERLGLPAYAWWSE 85
Query: 93 ALHGVAGVGKGIFFNGTIRG----ATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA 148
ALHGVA G+ FN T G ATSF I +A+FD L Y++ I EARA NA
Sbjct: 86 ALHGVA-ASPGVRFNRTAGGRFSSATSFANSITLSAAFDDELVYKVADTISTEARAFANA 144
Query: 149 GQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKL 208
G A G+ +W PNIN ++DPRWGRG ETPGEDP+ Y + + G++GD + K+
Sbjct: 145 GLA-GLDYWTPNINPYKDPRWGRGHETPGEDPVRIKGYVKALLAGLEGDDPSIRKV---- 199
Query: 209 QASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNR 268
A CKH+ AYDL+ W+GTTR++FDA V++QDL++ Y PPF+ C + + MC+YN
Sbjct: 200 --VATCKHYAAYDLERWQGTTRHRFDAVVSLQDLSEYYLPPFQQCARDSKVGSFMCSYNA 257
Query: 269 VNGIPSCADRNLLSKTARRQWGF---HGYITSDCDAVSIIYDAEGY-----AKSPEDAVV 320
+NG P+CA L+ R+ WG+ + YITSDC+A+ + ++ +A
Sbjct: 258 LNGTPACASTYLMDDILRKHWGWTEHNNYITSDCNAIQDFLPGPKWHNFSSTQTEAEAAA 317
Query: 321 DVLKAGMDVNC---GSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQ 377
+AG D C G A Q L E ID AL L+ +R+G F+ +
Sbjct: 318 VAYQAGTDTVCEVPGWPPYTDVIGAYNQTLLSEEVIDTALKRLYEGLVRVGYFD-PASGS 376
Query: 378 PFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLP-KSKSVSLALIGPNANSAKT 436
P+ IG + V +P Q LALQ+ DG+VLLKN G LPL + K+V+L N+ +
Sbjct: 377 PYRSIGWEDVNTPEAQELALQSGTDGLVLLKND-GTLPLNLEDKTVALIGFWANSTNGGR 435
Query: 437 LLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASID----KAVDIAKGADHVV 492
+LG Y+G +P+ A + Y G + A+ID KA++ AK ++ ++
Sbjct: 436 ILGGYSGFPPYIHSPVDAAEKLNLTYHYASGPLAENITQAAIDDWVAKALEPAKKSNVIL 495
Query: 493 LMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPV 532
G D + E+LDR + P Q +I ++ + P
Sbjct: 496 YFGGTDTSIAAEDLDRDSIAWPEIQLAVIEALSALRQAPA 535
>gi|423227459|ref|ZP_17213920.1| hypothetical protein HMPREF1062_06106 [Bacteroides cellulosilyticus
CL02T12C19]
gi|392623089|gb|EIY17195.1| hypothetical protein HMPREF1062_06106 [Bacteroides cellulosilyticus
CL02T12C19]
Length = 864
Score = 302 bits (773), Expect = 5e-79, Method: Compositional matrix adjust.
Identities = 178/447 (39%), Positives = 256/447 (57%), Gaps = 33/447 (7%)
Query: 37 NPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHG 96
N + + P+ L S+RA DL+ R+TL+EK+SQ+ N +PAI RLGIPAY+WW+EALHG
Sbjct: 16 NVTAQNEPYKNPELSPSERAWDLLKRMTLEEKVSQMKNGSPAIERLGIPAYDWWNEALHG 75
Query: 97 VAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQA----- 151
VA GK AT FPQ I AA+FD+ Y + EARA Y+ Q
Sbjct: 76 VARAGK----------ATVFPQAIGLAATFDNQAVYETFDIVSDEARAKYHDFQRKGERD 125
Query: 152 --IGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQ 209
G+TFW PNINI+RDPRWGRG ET GEDP +T ++ V+G+QG GG K +
Sbjct: 126 GYKGLTFWTPNINIYRDPRWGRGMETYGEDPYLTSLMGLAVVKGLQG----GGTGKYD-K 180
Query: 210 ASACCKHFTAYDLDNWKGTTRYKFDAR-VTMQDLADTYQPPFESCVKQGRASGIMCAYNR 268
A AC KH+ + W R+ FDA+ ++ +DL +TY P F++ VK+G+ +MCAYNR
Sbjct: 181 AHACAKHYAVHSGPEW---NRHSFDAKNISQRDLWETYLPAFKTLVKEGKVKEVMCAYNR 237
Query: 269 VNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSP--EDAVVDVLKAG 326
G P C+++ LL + R WG+ + SDC A+ Y + P A D + +G
Sbjct: 238 FEGEPCCSNKQLLIRILREDWGYDDIVVSDCGAIGDFYYPNHHETHPTAAAASADAVVSG 297
Query: 327 MDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADV 386
D+ CG + AV++ + E +I+ ++ L R +LG+F+ + + + +I V
Sbjct: 298 TDLECGGSYSSLNE-AVRKGLISEEKINESVFRLLRARFQLGMFD-DDALVSWSEIPYSV 355
Query: 387 VCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSC 446
V S H AL+ A+ +VLL N + LPL KS +A++GPNAN + L NY G
Sbjct: 356 VESKEHVAKALEMARKSMVLLTNKNHTLPLSKSIR-KVAVLGPNANDSVMLWANYNGFPT 414
Query: 447 RSITPLQALQNYV-ENTVYYP-GCDTV 471
+S+T L+ +++ + E TVYY GCD V
Sbjct: 415 KSVTILEGIKSKLPEGTVYYEKGCDYV 441
Score = 112 bits (280), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 81/270 (30%), Positives = 125/270 (46%), Gaps = 51/270 (18%)
Query: 480 KAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAK 529
+ D A AD ++ + GL T E EE+ DR ++ LP Q E++ + + K
Sbjct: 592 EVADKAAEADAIIFVGGLSPTLEGEEMPVDLPGFRKGDRTNIDLPHVQAEMLKALKKTGK 651
Query: 530 KPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWY 589
PVI VL G + + + N+ +IL A YPG+ G A+A+V+FGD+NP GRLP+T+Y
Sbjct: 652 -PVIFVLCSGSTLALPWEA--ENLDAILEAWYPGQQGGTAVADVLFGDYNPAGRLPLTFY 708
Query: 590 PQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLY 649
D M RTYR+++GK +FPFG GLSY+ + Y V + +
Sbjct: 709 ASSDDLPDFEDYDMS--------NRTYRYFKGKALFPFGHGLSYTIFDYGKAKVDKQNVR 760
Query: 650 LNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPAR 709
+ + +TI +KN G++ G + ++++
Sbjct: 761 AGEGMT-----------------------------LTIPLKNTGKLDGDEVIQVYLRNPA 791
Query: 710 RGNGRPIKQLVGFQSVILNAKEKAEIVFEL 739
G PIK L F+ V L A + I EL
Sbjct: 792 DKEG-PIKTLRAFRRVSLPAGQTENIRIEL 820
>gi|383114360|ref|ZP_09935124.1| hypothetical protein BSGG_1469 [Bacteroides sp. D2]
gi|313693934|gb|EFS30769.1| hypothetical protein BSGG_1469 [Bacteroides sp. D2]
Length = 863
Score = 302 bits (773), Expect = 6e-79, Method: Compositional matrix adjust.
Identities = 170/439 (38%), Positives = 247/439 (56%), Gaps = 38/439 (8%)
Query: 43 FPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGK 102
+P+ T L + QRA DL+ RLTL+EK++ + N++PAIPRLGI YEWW+EALHGVA G
Sbjct: 26 YPYQDTKLTVEQRADDLLQRLTLEEKVALMQNNSPAIPRLGIKPYEWWNEALHGVARAGL 85
Query: 103 GIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARA----LYNAGQAI---GMT 155
AT FPQ I AASF+ L Y + A+ EARA GQ G+T
Sbjct: 86 ----------ATVFPQAIGMAASFNDELLYEVFDAVSDEARAKNRQFNEKGQYKRYQGLT 135
Query: 156 FWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQG-DTFNGGKLKGKLQASACC 214
W PN+NIFRDPRWGRGQET GEDP ++G+ ++ VRG+QG + KL AC
Sbjct: 136 MWTPNVNIFRDPRWGRGQETYGEDPYLSGRMGMAAVRGLQGPEDAEYDKLH------ACA 189
Query: 215 KHFTAYDLDNWKGTTRYKFDAR-VTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIP 273
KHF + W R+ F+A + +DL +TY P F+ V++ +MCAYNR G P
Sbjct: 190 KHFAVHSGPEW---NRHSFNAENIAPRDLWETYLPAFKELVQKAGVKEVMCAYNRFEGDP 246
Query: 274 SCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPE--DAVVDVLKAGMDVNC 331
C LL++ R WGF G + +DC A+ + + + P+ A D + +G D+ C
Sbjct: 247 CCGSNRLLTQILRNDWGFKGIVVTDCGAIGDFFQRKKHETHPDAAHASADAVLSGTDLEC 306
Query: 332 GSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPA 391
G + T AVK+ + E +I+ ++ L R LG N + P+ I V+ P
Sbjct: 307 GGNFKSITD-AVKKGLISEEKINTSVKRLLKARFELGEMN---STHPWSNIPFSVIDCPK 362
Query: 392 HQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITP 451
H+ LAL+ A + +VLL+N++ +LPL ++ + +A+IGPNAN + GNY G ++T
Sbjct: 363 HKELALKMAHESLVLLQNNNNILPL--NRQMKVAVIGPNANDSVMQWGNYNGFPSHTVTL 420
Query: 452 LQALQNYVENT--VYYPGC 468
L+ ++ + + +Y P C
Sbjct: 421 LEGIRAKLPDAQIIYEPVC 439
Score = 103 bits (257), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 82/264 (31%), Positives = 120/264 (45%), Gaps = 51/264 (19%)
Query: 486 KGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILV 535
+ AD V+ G+ E E + DR ++ LP Q+E++ + + KK + V
Sbjct: 598 QSADVVIFAGGISPLLEGESMRVSDPGFKGGDRTEIELPAIQREVLALLKKNGKK-TVFV 656
Query: 536 LLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIK 595
G + I +N +IL A YPG+AG A+A+V+FGD+NP GRLP+T+Y
Sbjct: 657 NFSGSAMAIV--PETQNCDAILQAWYPGQAGGTAVADVLFGDYNPAGRLPITFYKSMQQL 714
Query: 596 VPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSS 655
D M+ GRTYRF ++PFG GLSY+++SY K LNQS
Sbjct: 715 PDYEDYSMK--------GRTYRFMTETPLYPFGYGLSYTRFSY-------GKATLNQSKL 759
Query: 656 TKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRP 715
TK K ++TI V N G+ G+ V +++ G P
Sbjct: 760 TK----------------------GEKAILTIPVSNVGQRDGEEVVQVYICRPDDKEG-P 796
Query: 716 IKQLVGFQSVILNAKEKAEIVFEL 739
K L GFQ V + + + EL
Sbjct: 797 QKTLRGFQRVSIAKGKTQNVQIEL 820
>gi|217967241|ref|YP_002352747.1| glycoside hydrolase family 3 [Dictyoglomus turgidum DSM 6724]
gi|217336340|gb|ACK42133.1| glycoside hydrolase family 3 domain protein [Dictyoglomus turgidum
DSM 6724]
Length = 762
Score = 301 bits (772), Expect = 7e-79, Method: Compositional matrix adjust.
Identities = 225/715 (31%), Positives = 361/715 (50%), Gaps = 101/715 (14%)
Query: 81 RLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGL 140
RLGIPA E L G +GAT FPQ I A++F+ L R+ I
Sbjct: 87 RLGIPAI-IHEECLSGFMA-----------KGATVFPQAIGMASTFEPELIRRVSDVIRQ 134
Query: 141 EARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFN 200
RA N Q + +P ++I RDPRWGR +ET GEDP + + A YV+G+QG+ +
Sbjct: 135 HMRAA-NVHQGL-----SPVLDIPRDPRWGRTEETFGEDPYLVSRMAAEYVKGLQGEDWR 188
Query: 201 GGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRAS 260
G + A KHFTAY + R A+V ++L + + PFE +K+G+A
Sbjct: 189 EGII-------ATVKHFTAYGISE---GARNLGPAKVGERELREVFLFPFEVAIKEGQAG 238
Query: 261 GIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVV 320
+M AY+ ++G+P + + LL+K R +WGF GY+ SD A+ ++ + AK ++A V
Sbjct: 239 SLMNAYHEIDGVPCASSKFLLTKILRWEWGFKGYVVSDYIAIRMLENFHRVAKDAKEAAV 298
Query: 321 DVLKAGMDVNCGSF--LQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQP 378
L+AG+D+ S + AVK+ + E I+ ++ + + LGLF+G+ P
Sbjct: 299 LALEAGIDIELPSVDCYGEPLIQAVKEGLISEEVINASVERVLRAKFMLGLFDGDLEKDP 358
Query: 379 FGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLL 438
K D+ P + L+ + A+ IVLLKN G+LPL K+ ++A+IGPNA++ + L
Sbjct: 359 --KKVYDIFDKPEFRELSREVARRSIVLLKND-GILPLSKNIR-TVAVIGPNADNPRNLH 414
Query: 439 GNYAG----PS---------------CRSITPLQALQNYVE---NTVYYPGCDTVACSSA 476
G+Y+ PS R+++ L+ ++N V +Y GC+ ++ S
Sbjct: 415 GDYSYTAHIPSVSETLEGVKIPEECAVRTVSILEGIKNKVSAETQVLYAKGCEILSDSKE 474
Query: 477 SIDKAVDIAKGADHVVLMMG-----LDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKP 531
D+A++IAK AD ++ +MG + E DR L L G Q++L+ + + KP
Sbjct: 475 GFDEAIEIAKRADVIIAVMGEESGLFHRGISGEGNDRTTLELFGIQRDLLRELHKLG-KP 533
Query: 532 VILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQ 591
++LVL+ G P + + N+ +IL A YPGE G A+A+VIFGD+NP G+LP++ +P
Sbjct: 534 IVLVLVNGRPQALKWEH--ENLNAILEAWYPGEEGGDAVADVIFGDYNPSGKLPIS-FPA 590
Query: 592 DYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLN 651
+VP+ +P A + Y K ++PFG GLSY+ + Y S K++
Sbjct: 591 VTGQVPVY-YNRKPSAFT-----DYVEESAKPLYPFGHGLSYTTFEY-----SNLKIHPE 639
Query: 652 QSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRG 711
+ ++ + VE ++ +KN G G+ V L+V
Sbjct: 640 KVNALEKVE------------------------ISFTIKNTGVREGEEVVQLYVHDQVAS 675
Query: 712 NGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
RP+K+L GF+ + L E + F L P E L+ E V+E+G +++G
Sbjct: 676 LERPVKELKGFKKIHLKPGESKRVTFILYP-EQLAFYDEFMRFVVEKGIFEIMIG 729
>gi|293370402|ref|ZP_06616956.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
ovatus SD CMC 3f]
gi|292634550|gb|EFF53085.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
ovatus SD CMC 3f]
Length = 863
Score = 301 bits (771), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 169/439 (38%), Positives = 247/439 (56%), Gaps = 38/439 (8%)
Query: 43 FPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGK 102
+P+ T L + QRA DL+ RLTL+EK++ + N++PAIPRLGI YEWW+EALHGVA G
Sbjct: 26 YPYQDTKLTVEQRADDLLQRLTLEEKVALMQNNSPAIPRLGIKPYEWWNEALHGVARAGL 85
Query: 103 GIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARA----LYNAGQAI---GMT 155
AT FPQ I AASF+ L Y + A+ EARA GQ G+T
Sbjct: 86 ----------ATVFPQAIGMAASFNDELLYEVFDAVSDEARAKNRQFNEKGQYKRYQGLT 135
Query: 156 FWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQG-DTFNGGKLKGKLQASACC 214
W PN+NIFRDPRWGRGQET GEDP ++G+ ++ VRG+QG + KL AC
Sbjct: 136 MWTPNVNIFRDPRWGRGQETYGEDPYLSGRMGMAAVRGLQGPEDAEYDKLH------ACA 189
Query: 215 KHFTAYDLDNWKGTTRYKFDAR-VTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIP 273
KHF + W R+ F+A + +DL +TY P F+ V++ +MCAYNR G P
Sbjct: 190 KHFAVHSGPEW---NRHSFNAENIAPRDLWETYLPAFKELVQKAGVKEVMCAYNRFEGDP 246
Query: 274 SCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPE--DAVVDVLKAGMDVNC 331
C LL++ R WGF G + +DC A+ + + + P+ A D + +G D+ C
Sbjct: 247 CCGSNRLLTQILRNDWGFKGIVVTDCGAIGDFFQRKKHETHPDAAHASADAVLSGTDLEC 306
Query: 332 GSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPA 391
G + T AVK+ + E +I+ ++ + R LG N + P+ I V+ P
Sbjct: 307 GGNFKSITD-AVKKDLISEEKINTSVKRVLKARFELGEMN---STHPWSNIPFSVIDCPK 362
Query: 392 HQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITP 451
H+ LAL+ A + +VLL+N++ +LPL ++ + +A+IGPNAN + GNY G ++T
Sbjct: 363 HKELALKMAHESLVLLQNNNNILPL--NRQMKVAVIGPNANDSVMQWGNYNGFPSHTVTL 420
Query: 452 LQALQNYVENT--VYYPGC 468
L+ ++ + + +Y P C
Sbjct: 421 LEGIRAKLPDAQIIYEPVC 439
Score = 103 bits (257), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 82/264 (31%), Positives = 120/264 (45%), Gaps = 51/264 (19%)
Query: 486 KGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILV 535
+ AD V+ G+ E E + DR ++ LP Q+E++ + + KK + V
Sbjct: 598 QSADVVIFAGGISPLLEGESMRVSDPGFKGGDRTEIELPAIQREVLALLKKNGKK-TVFV 656
Query: 536 LLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIK 595
G + I +N +IL A YPG+AG A+A+V+FGD+NP GRLP+T+Y
Sbjct: 657 NFSGSAMAIV--PETQNCDAILQAWYPGQAGGTAVADVLFGDYNPAGRLPITFYKSMQQL 714
Query: 596 VPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSS 655
D M+ GRTYRF ++PFG GLSY+++SY K LNQS
Sbjct: 715 PDYEDYSMK--------GRTYRFMTETPLYPFGYGLSYTRFSY-------GKATLNQSKL 759
Query: 656 TKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRP 715
TK K ++TI V N G+ G+ V +++ G P
Sbjct: 760 TK----------------------GEKAILTIPVSNVGQRDGEEVVQVYICRPDDKEG-P 796
Query: 716 IKQLVGFQSVILNAKEKAEIVFEL 739
K L GFQ V + + + EL
Sbjct: 797 QKTLRGFQRVSIAKGKTQNVQIEL 820
>gi|297740661|emb|CBI30843.3| unnamed protein product [Vitis vinifera]
Length = 401
Score = 301 bits (771), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 172/434 (39%), Positives = 250/434 (57%), Gaps = 42/434 (9%)
Query: 345 QKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGI 404
Q K E ++D +L NL+ V ++G F+G P+ + K +C+ H LA AA+ GI
Sbjct: 2 QGKAREEDVDTSLRNLYIVLTQVGFFDGIPSYESLDK---KDLCTKEHIELAADAARQGI 58
Query: 405 VLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVY 464
VLLKN + LPL +K +LALIGP+AN+ +LGNYAG C+ +PL Y + T Y
Sbjct: 59 VLLKNINETLPLDPAKLKNLALIGPHANATIEMLGNYAGVPCQYSSPLDGFSAYGKVT-Y 117
Query: 465 YPGCDTVACSSAS-IDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITR 523
GC+ V C + + I AV+ +K AD +L++GLD+T E E LDR DL+LPG Q ELI +
Sbjct: 118 EMGCNNVTCDNKTFIMPAVEASKNADATILLVGLDKTVEGEGLDRNDLLLPGYQTELILQ 177
Query: 524 VAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGR 583
V A+K P+ILV++ G VDI+F+K D + +ILWAGYPGE G A+A+V++G +NPGGR
Sbjct: 178 VIVASKGPIILVIMSGSAVDISFSKTDDRVKAILWAGYPGEEGGRAIADVVYGKYNPGGR 237
Query: 584 LPMTWYPQDYIK-VPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKA 642
LP+TW+ DY+ +PMT M +RP + PGRTY+F+ G V+PFG GLSY+K++Y ++
Sbjct: 238 LPLTWHQNDYLSMLPMTSMSLRP--VNNYPGRTYKFFNGSVVYPFGHGLSYTKFNYTLRS 295
Query: 643 VSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVL 702
+ + C+ F + I VKN G G VL
Sbjct: 296 SNMS-------------------------------CKDH-FELDIEVKNIGAKHGNEVVL 323
Query: 703 LFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHF 762
++ KP G KQ++GF+ V + A + FE + C+SL + ++ G H
Sbjct: 324 VYSKPPTGIVGTHAKQVIGFKRVFVPAGGSQNVKFEFNVCKSLGIVGYNAYKLLPSGEHK 383
Query: 763 LVVGDE--EYPISI 774
+++GD PI I
Sbjct: 384 IIIGDSPTSLPIDI 397
>gi|380512525|ref|ZP_09855932.1| beta-glucosidase [Xanthomonas sacchari NCPPB 4393]
Length = 885
Score = 301 bits (770), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 168/407 (41%), Positives = 234/407 (57%), Gaps = 31/407 (7%)
Query: 59 LVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQ 118
LV+++T EKI+Q +N+APAIPRLG+PAYEWWSE LHG+A G+ AT FPQ
Sbjct: 40 LVAKMTRAEKIAQAMNAAPAIPRLGVPAYEWWSEGLHGIARNGE----------ATVFPQ 89
Query: 119 VILTAASFDSYLWYRIGQAIGLEARALYN-AG-------QAIGMTFWAPNINIFRDPRWG 170
I AA+++ L + +G EARA +N AG + G+T W+PNINIFRDPRWG
Sbjct: 90 AIGLAATWNPELLHDVGTVTSTEARAKFNLAGGPGKDHPRYAGLTIWSPNINIFRDPRWG 149
Query: 171 RGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTR 230
RG ET GEDP +TG+ AV ++ G+QGD + A KH + + R
Sbjct: 150 RGMETYGEDPYLTGRLAVGFIHGLQGDD------PAHPRTIATPKHLAVH---SGPEPGR 200
Query: 231 YKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWG 290
+ FD V+ D TY P F + + G+A +MCAYN ++G P+CA L+ R WG
Sbjct: 201 HGFDVDVSPHDFEATYSPAFRAAIVDGQAGSVMCAYNSLHGTPACAADWLIDGRVRGDWG 260
Query: 291 FHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPE 350
F G++ SDCDA+ + Y + LKAG D+NCG+ + A + + E
Sbjct: 261 FKGFVVSDCDAIDDMTQFHYYRPDNAGSSAAALKAGHDLNCGTAY-RELGIAFDRGEADE 319
Query: 351 SEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNS 410
+ +DR+L LF+ R RLG P+ ++GA + S AH+ LALQAAQ +VLLKN+
Sbjct: 320 ALLDRSLVRLFAARYRLGELQPR-RNDPYARLGARDIDSAAHRALALQAAQQSLVLLKNA 378
Query: 411 HGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQN 457
+ LPL + LA++GPNA++ L NY G S + +TPLQ L+
Sbjct: 379 NATLPL--RPGLRLAVLGPNADALAALEANYQGTSVQPVTPLQGLRT 423
Score = 129 bits (325), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 91/281 (32%), Positives = 135/281 (48%), Gaps = 52/281 (18%)
Query: 496 GLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDIT 545
GL E EEL DR DL LP QQ L+ R A+A+ KP+++VL+ G V +
Sbjct: 622 GLSPDVEGEELRIDVPGFDGGDRNDLALPAAQQALLER-AKASGKPLVVVLMSGSAVALN 680
Query: 546 FAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRP 605
+A +++ +I+ A YPG++G A+A+ + GD NPGGRLP+T+Y P M+
Sbjct: 681 WA--EQHADAIIAAWYPGQSGGTAIAQALAGDINPGGRLPVTFYRSTKDLPPYVSYDMK- 737
Query: 606 QATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVV 665
GRTYR+++G+ +FPFG GLSY++++Y +S L Q
Sbjct: 738 -------GRTYRYFKGEPLFPFGYGLSYTQFAYDAPQLSTTTLQAGQ------------- 777
Query: 666 HYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSV 725
V+ V+N G AG V ++++ +R P++ LVGFQ V
Sbjct: 778 ----------------PLQVSTTVRNTGARAGDEVVQVYLQYPQRAQS-PLRSLVGFQRV 820
Query: 726 ILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
L E + F L LS G +E G + L VG
Sbjct: 821 HLQPGEARTLSFALD-ARQLSDVDRSGQRAVEAGDYRLFVG 860
>gi|71731103|gb|EAO33170.1| Beta-glucosidase [Xylella fastidiosa subsp. sandyi Ann-1]
Length = 882
Score = 301 bits (770), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 179/439 (40%), Positives = 247/439 (56%), Gaps = 39/439 (8%)
Query: 54 QRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGA 113
Q A LV+++T EKI+Q +N+APAIPRLGIPAY+WWSE LHG+A G A
Sbjct: 32 QHAAALVAQMTRQEKIAQTMNAAPAIPRLGIPAYDWWSEGLHGIARNGY----------A 81
Query: 114 TSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAG--------QAIGMTFWAPNINIFR 165
T FPQ I AAS+++ L +G EARA +N + G+T W+PNINIFR
Sbjct: 82 TVFPQAIGLAASWNTDLLQHVGTVTSTEARAKFNLTGGPGKDHPRYAGLTLWSPNINIFR 141
Query: 166 DPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNW 225
DPRWGRG ET GEDP +TG+ AVS++RG+QGDT + + A KHF + +
Sbjct: 142 DPRWGRGMETYGEDPYLTGQLAVSFIRGLQGDTPD------HPRTIATPKHFAVH---SG 192
Query: 226 KGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTA 285
R+ FD V+ DL TY P F + + G A +MCAYN ++G P+CA LL+
Sbjct: 193 PEQGRHSFDVDVSAYDLEATYTPAFRAAIVDGHAGSVMCAYNALHGTPACASDWLLNTRL 252
Query: 286 RRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQ 345
R WGF+G++ SDCDA+ + + + A LK+G D+NCG+ + + A+ +
Sbjct: 253 RNDWGFNGFVVSDCDAIEDMTRFHFFRQDNASASAAALKSGDDLNCGNTYRDLNQ-AIAR 311
Query: 346 KKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIV 405
+ ES +D+AL LF+ R RLG P+ IG + +PAH+ LALQAA +V
Sbjct: 312 GDIDESTLDQALIRLFTARQRLGTLQPR-EHDPYAAIGIKHIDTPAHRALALQAAAQSLV 370
Query: 406 LLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQ--------N 457
LLKNS LPLP +LA++GP+A+S L NY G S +TPL L+ +
Sbjct: 371 LLKNSGNTLPLP--PETTLAVLGPDADSLTALEANYQGTSSTPVTPLTGLRTRFGTAKVH 428
Query: 458 YVENTVYYPGCDTVACSSA 476
Y + PG + +A
Sbjct: 429 YAQGASLAPGVPSTIPETA 447
Score = 130 bits (326), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 98/312 (31%), Positives = 150/312 (48%), Gaps = 64/312 (20%)
Query: 471 VACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQEL 520
+A ++ + +A AD +V +GL E EEL DR + LP Q+ L
Sbjct: 594 LAPAAPQLAEAERAVAHADAIVAFVGLSPEVEGEELHIDTPGFSGGDRTTIDLPATQETL 653
Query: 521 ITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
+ V + KP+I+VL+ G V + +A++ + +IL A YPG++G A+A+ + GD NP
Sbjct: 654 LQHV-KTTGKPLIVVLMSGSAVALNWAQHHAD--AILAAWYPGQSGGTAIAQALAGDVNP 710
Query: 581 GGRLPMTWY--PQD---YIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSK 635
GGRLP+T+Y QD YI MT GRTYR+++G+ ++PFG GLSY++
Sbjct: 711 GGRLPVTFYRSTQDLPPYISYDMT-------------GRTYRYFKGQPLYPFGYGLSYTQ 757
Query: 636 YSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFL-VTIGVKNHGE 694
++Y+ P+L T + L VT V+N G
Sbjct: 758 FAYE------------------------------APQLSTATLKAGNTLTVTAHVRNTGT 787
Query: 695 MAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLM 754
AG V L+++P P++ LVGF+ V L E + F L LS ++ G
Sbjct: 788 RAGDEVVQLYLEPPYSPQA-PLRSLVGFKRVTLRPGESRLLTFTLD-ARQLSGVQQTGQR 845
Query: 755 VIEEGTHFLVVG 766
+E G + L VG
Sbjct: 846 SVEAGHYHLFVG 857
>gi|313205375|ref|YP_004044032.1| glycoside hydrolase [Paludibacter propionicigenes WB4]
gi|312444691|gb|ADQ81047.1| glycoside hydrolase family 3 domain protein [Paludibacter
propionicigenes WB4]
Length = 858
Score = 300 bits (768), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 183/463 (39%), Positives = 250/463 (53%), Gaps = 42/463 (9%)
Query: 41 ETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGV 100
+ P+ L RA DL++RLTL EK + + N++PAIPRLGI AYEWW+EALHGV
Sbjct: 20 QQLPYQNPKLSAEVRATDLLARLTLAEKAALMQNNSPAIPRLGIKAYEWWNEALHGVGRS 79
Query: 101 GKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA-------GQAIG 153
G AT FPQ I AASF++ L + A+ EARA N + G
Sbjct: 80 GV----------ATVFPQAIGMAASFNNGLLFDAFTAVSDEARAKSNKFSEQGGLKRYQG 129
Query: 154 MTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQG-DTFNGGKLKGKLQASA 212
+T+W PN+NIFRDPRWGRGQET GEDP +T V+ V+G+QG D KL A
Sbjct: 130 LTYWTPNVNIFRDPRWGRGQETYGEDPYLTSLMGVAVVKGLQGPDNAEYDKLH------A 183
Query: 213 CCKHFTAYDLDNWKGTTRYKFDAR-VTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNG 271
C KHF + W R+ F+A + +DL +TY P F++ V++ +MCAYNR
Sbjct: 184 CAKHFAVHSGPEW---NRHSFNAENINPRDLWETYLPAFKALVQKADVKEVMCAYNRFED 240
Query: 272 IPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPE--DAVVDVLKAGMDV 329
P C LL++ R W F G + SDC A+S Y +A P+ A + + G D+
Sbjct: 241 EPCCGSNRLLTQILRNDWKFDGLVVSDCWAISDFYKPNAHATQPDATHAAANAVLNGTDL 300
Query: 330 NCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCS 389
CGS ++ AVK + E ID +L L R LG N + I VV S
Sbjct: 301 ECGSDF-RNLPEAVKAGLIEEKRIDVSLKRLLKARFELGEMNSDQVW----PISYSVVNS 355
Query: 390 PAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSI 449
HQ LAL+ A++ IVLL+N++ +LPL SK + +A++GPNAN + GNY G ++
Sbjct: 356 EKHQNLALRMAEESIVLLQNNNNILPL--SKKLKIAVMGPNANDSVMQWGNYNGFPAHTV 413
Query: 450 TPLQALQNYV--ENTVYYPGCD---TVACSSASIDKAVDIAKG 487
T L+A++ +Y PGCD VA SS + ++D KG
Sbjct: 414 TLLEAMRKSFPGAQLIYEPGCDRTMDVAVSSLFQECSIDGKKG 456
Score = 102 bits (255), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 81/277 (29%), Positives = 135/277 (48%), Gaps = 57/277 (20%)
Query: 475 SASIDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRV 524
SASI K D AD VV G+ + E EE+ DR D+ LP Q+ L+ +
Sbjct: 586 SASIAKVKD----ADVVVFAGGIAPSLEGEEMRVTVPGFKGGDRTDIELPAIQRRLLQAL 641
Query: 525 AEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRL 584
+A KK V+ V G + + ++ +IL A YPG+AG A+A V+ G++NP GRL
Sbjct: 642 KDAGKK-VVFVNFSGSAMGLV--PETQSCEAILQAWYPGQAGGTAVANVLLGNYNPSGRL 698
Query: 585 PMTWYPQDYIKVP-MTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAV 643
P+T+Y ++ ++P D M+ GRTYR+ K +F FG GLSY+K+ +
Sbjct: 699 PVTFY-KNVAQLPDFEDYSMK--------GRTYRYMTEKPLFSFGYGLSYTKFVLGTAKL 749
Query: 644 SQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLL 703
+++ + N++ +T+ V N G++AG + +
Sbjct: 750 NKSSIKANET-----------------------------LKITVPVTNAGKVAGTEVLQV 780
Query: 704 FVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELS 740
+V+ + +G P K L GF+ V + + ++I +L+
Sbjct: 781 YVRKVKDVDG-PAKTLRGFKKVNIEPGKTSQISIDLT 816
>gi|160886913|ref|ZP_02067916.1| hypothetical protein BACOVA_04927 [Bacteroides ovatus ATCC 8483]
gi|423288977|ref|ZP_17267828.1| hypothetical protein HMPREF1069_02871 [Bacteroides ovatus
CL02T12C04]
gi|423294866|ref|ZP_17272993.1| hypothetical protein HMPREF1070_01658 [Bacteroides ovatus
CL03T12C18]
gi|156107324|gb|EDO09069.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
ovatus ATCC 8483]
gi|392668741|gb|EIY62235.1| hypothetical protein HMPREF1069_02871 [Bacteroides ovatus
CL02T12C04]
gi|392676057|gb|EIY69498.1| hypothetical protein HMPREF1070_01658 [Bacteroides ovatus
CL03T12C18]
Length = 863
Score = 300 bits (768), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 170/439 (38%), Positives = 246/439 (56%), Gaps = 38/439 (8%)
Query: 43 FPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGK 102
+P+ T L QRA DL+ RLTL+EK++ + N++PAIPRLGI YEWW+EALHGVA G
Sbjct: 26 YPYQDTKLTAEQRADDLLQRLTLEEKVALMQNNSPAIPRLGIKPYEWWNEALHGVARAGL 85
Query: 103 GIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARA----LYNAGQAI---GMT 155
AT FPQ I AASF+ L Y + A+ EARA GQ G+T
Sbjct: 86 ----------ATVFPQAIGMAASFNDELLYEVFDAVSDEARAKNRQFNEKGQYKRYQGLT 135
Query: 156 FWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQG-DTFNGGKLKGKLQASACC 214
W PN+NIFRDPRWGRGQET GEDP ++G+ ++ VRG+QG + KL AC
Sbjct: 136 MWTPNVNIFRDPRWGRGQETYGEDPYLSGRMGMAAVRGLQGPEDAEYDKLH------ACA 189
Query: 215 KHFTAYDLDNWKGTTRYKFDAR-VTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIP 273
KHF + W R+ F+A + +DL +TY P F+ V++ +MCAYNR G P
Sbjct: 190 KHFAVHSGPEW---NRHSFNAENIAPRDLWETYLPAFKELVQKAGVKEVMCAYNRFEGDP 246
Query: 274 SCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPE--DAVVDVLKAGMDVNC 331
C LL++ R WGF G + +DC A+ + + + P+ A D + +G D+ C
Sbjct: 247 CCGSNRLLTQILRNDWGFKGIVVTDCGAIGDFFQRKKHETHPDAAHASADAVLSGTDLEC 306
Query: 332 GSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPA 391
G + T AVK+ + E +I+ ++ L R LG N + P+ I V+ P
Sbjct: 307 GGNFKSITD-AVKKGLISEEKINTSVKRLLKARFELGEMN---STHPWSNIPFSVIDCPK 362
Query: 392 HQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITP 451
H+ LAL+ A + +VLL+N++ +LPL ++ + +A+IGPNAN + GNY G ++T
Sbjct: 363 HKELALKMAHESLVLLQNNNNILPL--NRQMKVAVIGPNANDSVMQWGNYNGFPSHTVTL 420
Query: 452 LQALQNYVENT--VYYPGC 468
L+ ++ + + +Y P C
Sbjct: 421 LEGIRAKLPDAQIIYEPVC 439
Score = 103 bits (257), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 82/264 (31%), Positives = 120/264 (45%), Gaps = 51/264 (19%)
Query: 486 KGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILV 535
+ AD V+ G+ E E + DR ++ LP Q+E++ + + KK + V
Sbjct: 598 QSADVVIFAGGISPLLEGESMRVSDPGFKGGDRTEIELPAIQREVLALLKKNGKK-TVFV 656
Query: 536 LLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIK 595
G + I +N +IL A YPG+AG A+A+V+FGD+NP GRLP+T+Y
Sbjct: 657 NFSGSAMAIV--PETQNCDAILQAWYPGQAGGTAVADVLFGDYNPAGRLPITFYKSMQQL 714
Query: 596 VPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSS 655
D M+ GRTYRF ++PFG GLSY+++SY K LNQS
Sbjct: 715 PDYEDYSMK--------GRTYRFMTETPLYPFGYGLSYTRFSY-------GKATLNQSKL 759
Query: 656 TKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRP 715
TK K ++TI V N G+ G+ V +++ G P
Sbjct: 760 TK----------------------GEKAILTIPVSNVGQRDGEEVVQVYICRPDDKEG-P 796
Query: 716 IKQLVGFQSVILNAKEKAEIVFEL 739
K L GFQ V + + + EL
Sbjct: 797 QKTLRGFQRVSIAKGKTQNVQIEL 820
>gi|224535242|ref|ZP_03675781.1| hypothetical protein BACCELL_00103 [Bacteroides cellulosilyticus
DSM 14838]
gi|224523140|gb|EEF92245.1| hypothetical protein BACCELL_00103 [Bacteroides cellulosilyticus
DSM 14838]
Length = 864
Score = 300 bits (768), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 177/447 (39%), Positives = 255/447 (57%), Gaps = 33/447 (7%)
Query: 37 NPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHG 96
N + + P+ L S+RA DL+ R+TL+EK+SQ+ N +PAI RLGIPAY+WW+EALHG
Sbjct: 16 NVTAQNEPYKNPELSPSERAWDLLKRMTLEEKVSQMKNGSPAIERLGIPAYDWWNEALHG 75
Query: 97 VAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQA----- 151
VA GK AT FPQ I AA+FD+ Y + EARA Y+ Q
Sbjct: 76 VARAGK----------ATVFPQAIGLAATFDNQAVYETFDIVSDEARAKYHDFQRKGERD 125
Query: 152 --IGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQ 209
G+TFW PNINI+RDPRWGRG ET GEDP +T ++ V+G+QG GG K +
Sbjct: 126 GYKGLTFWTPNINIYRDPRWGRGMETYGEDPYLTSLMGLAVVKGLQG----GGTGKYD-K 180
Query: 210 ASACCKHFTAYDLDNWKGTTRYKFDAR-VTMQDLADTYQPPFESCVKQGRASGIMCAYNR 268
A AC KH+ + W R+ FDA+ ++ +DL +TY F++ VK+G+ +MCAYNR
Sbjct: 181 AHACAKHYAVHSGPEW---NRHSFDAKNISQRDLWETYLSAFKTLVKEGKVKEVMCAYNR 237
Query: 269 VNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSP--EDAVVDVLKAG 326
G P C+++ LL + R WG+ + SDC A+ Y + P A D + +G
Sbjct: 238 FEGEPCCSNKQLLIRILREDWGYDDIVVSDCGAIGDFYYPNHHETHPTAAAASADAVVSG 297
Query: 327 MDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADV 386
D+ CG + AV++ + E +I+ ++ L R +LG+F+ + + + +I V
Sbjct: 298 TDLECGGSYSSLNE-AVRKGLISEEKINESVFRLLRARFQLGMFD-DDALVSWSEIPYSV 355
Query: 387 VCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSC 446
V S H AL+ A+ +VLL N + LPL KS +A++GPNAN + L NY G
Sbjct: 356 VESKEHVTKALEMARKSMVLLTNKNHTLPLSKSIR-KVAVLGPNANDSVMLWANYNGFPT 414
Query: 447 RSITPLQALQNYV-ENTVYYP-GCDTV 471
+S+T L+ +++ + E TVYY GCD V
Sbjct: 415 KSVTILEGIKSKLPEGTVYYEKGCDYV 441
Score = 113 bits (282), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 81/270 (30%), Positives = 125/270 (46%), Gaps = 51/270 (18%)
Query: 480 KAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAK 529
+ D A AD ++ + GL T E EE+ DR ++ LP Q E++ + + K
Sbjct: 592 EVADKAAEADAIIFVGGLSPTLEGEEMPVDLPGFRKGDRTNIDLPHVQAEMLKALKKTGK 651
Query: 530 KPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWY 589
PVI VL G + + + N+ +IL A YPG+ G A+A+V+FGD+NP GRLP+T+Y
Sbjct: 652 -PVIFVLCSGSTLALPWEA--ENLDAILEAWYPGQQGGTAVADVLFGDYNPAGRLPLTFY 708
Query: 590 PQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLY 649
D M RTYR+++GK +FPFG GLSY+ + Y V + +
Sbjct: 709 ASSNDLPDFEDYDM--------SNRTYRYFKGKALFPFGHGLSYTIFDYGKAKVDKQNVR 760
Query: 650 LNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPAR 709
+ + +TI +KN G++ G + ++++
Sbjct: 761 AGEGMT-----------------------------LTIPLKNTGKLDGDEVIQVYLRNPA 791
Query: 710 RGNGRPIKQLVGFQSVILNAKEKAEIVFEL 739
G PIK L F+ V L A + I EL
Sbjct: 792 DKEG-PIKTLRAFRRVSLPAGQTENIRIEL 820
>gi|433679952|ref|ZP_20511614.1| beta-glucosidase [Xanthomonas translucens pv. translucens DSM
18974]
gi|430814928|emb|CCP42243.1| beta-glucosidase [Xanthomonas translucens pv. translucens DSM
18974]
Length = 909
Score = 299 bits (766), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 174/427 (40%), Positives = 238/427 (55%), Gaps = 33/427 (7%)
Query: 62 RLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVIL 121
++T +EK++Q +N+APAIPRLG+PAYEWW+E LHG+A G AT FPQ I
Sbjct: 67 KMTREEKVAQAMNAAPAIPRLGVPAYEWWNEGLHGIARNGY----------ATVFPQAIG 116
Query: 122 TAASFDSYLWYRIGQAIGLEARALYN-AG-------QAIGMTFWAPNINIFRDPRWGRGQ 173
AA++++ L ++G EARA +N AG + G+T W+PNINIFRDPRWGRG
Sbjct: 117 LAATWNTALLEQVGTVTSTEARAKFNLAGGPGKDHPRYAGLTIWSPNINIFRDPRWGRGM 176
Query: 174 ETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKF 233
ET GEDP +TG+ AV ++RG+QGD + A KH + + R+ F
Sbjct: 177 ETYGEDPYLTGQLAVGFIRGLQGDDLT------HPRTIATPKHLAVH---SGPEPGRHGF 227
Query: 234 DARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHG 293
D V+ DL TY P F + + GRA +MCAYN ++G P+CA LL+ R WGF G
Sbjct: 228 DVDVSPHDLEATYTPAFRAAIVDGRAGAVMCAYNSLHGTPACAADWLLNGRLRGDWGFTG 287
Query: 294 YITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEI 353
++ SDCDAV + + + LKAG D+NCG + + A+ + E+ +
Sbjct: 288 FVVSDCDAVDDMTQFHYFRADNAGSSAAALKAGHDLNCG-YAYRDLGKAIARGDADEAVL 346
Query: 354 DRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGL 413
D++L LF+ R RLG P+ ++GA V S AH+ LALQAAQ IVLL+N +
Sbjct: 347 DQSLVRLFAARYRLGELQPQ-RKDPYARLGAKDVDSAAHRALALQAAQQSIVLLQNRNAT 405
Query: 414 LPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNY--VENTVYYPGCDTV 471
LPL + LA+IGPNA++ L NY G S +TPL L+ N Y G
Sbjct: 406 LPL--RPGLRLAVIGPNADALAALEANYQGTSAAPVTPLLGLRERFGAANLRYAQGAPLA 463
Query: 472 ACSSASI 478
A S I
Sbjct: 464 AGVSGMI 470
Score = 137 bits (344), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 97/283 (34%), Positives = 144/283 (50%), Gaps = 56/283 (19%)
Query: 496 GLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDIT 545
GL E EEL DR DL LP QQ L+ R A+A+ KP+++VL+ G V +
Sbjct: 646 GLSPDVEGEELRIDVPGFDGGDRNDLALPAAQQALLER-AKASGKPLVVVLMSGSAVALN 704
Query: 546 FAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWY--PQDYIKVPMTDMKM 603
+AK ++ +I+ A YPG++G A+A+V+ GD NPGGRLP+T+Y +D DMK
Sbjct: 705 WAK--QHADAIVAAWYPGQSGGTAIAQVLAGDVNPGGRLPVTFYRSTKDLPAYVSYDMK- 761
Query: 604 RPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQD 663
GRTYR+++G+ +F FG GLSY++++Y +S L Q+ + V Q
Sbjct: 762 ---------GRTYRYFKGEPLFAFGSGLSYTRFTYAAPQLSATTL---QAGANLQVRTQ- 808
Query: 664 VVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQ 723
V+N G AG V ++++P +G P++ LVGFQ
Sbjct: 809 -------------------------VRNSGTRAGDEVVQVYLQPP-QGAQSPLRTLVGFQ 842
Query: 724 SVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
V L E E+ FEL+P LS G ++ G + + VG
Sbjct: 843 RVTLQPGEAREVGFELTP-RQLSDVDRAGQRAVQPGDYRVFVG 884
>gi|397691073|ref|YP_006528327.1| beta-glucosidase [Melioribacter roseus P3M]
gi|395812565|gb|AFN75314.1| beta-glucosidase [Melioribacter roseus P3M]
Length = 923
Score = 299 bits (765), Expect = 4e-78, Method: Compositional matrix adjust.
Identities = 173/426 (40%), Positives = 247/426 (57%), Gaps = 30/426 (7%)
Query: 48 TTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFN 107
TT +R DL+S +T +EKI QL N A +IPRLG+ AY +W+E+LHGV
Sbjct: 109 TTGSYKERLNDLISLMTTEEKIKQLTNQADSIPRLGLRAYNYWNESLHGVLA-------- 160
Query: 108 GTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDP 167
GATSFPQ I A++D L R+ A+ EARAL N G+T+W+P INI RDP
Sbjct: 161 ---EGATSFPQAIALGATWDPRLVNRVATAVSDEARAL-NRLYGKGLTYWSPTINIARDP 216
Query: 168 RWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKG 227
RWGR +E+ EDP + + V++++G+QGD L+ A KHF A N +
Sbjct: 217 RWGRNEESYSEDPYLLSRMGVAFIKGMQGDH------PYYLKTVATPKHFIA----NNEE 266
Query: 228 TTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARR 287
R+ + V M++L + Y P F+S + + RA IM AYN +N +PS A+ L++ RR
Sbjct: 267 ERRHTGSSDVDMRNLYEYYLPAFKSAIVEARAYSIMGAYNELNHVPSNANMFLMTDLLRR 326
Query: 288 QWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKK 347
QWGF GY+ SDC A+ + + K+ +AV + AG D+NCG ++ K A+ +
Sbjct: 327 QWGFEGYVVSDCGAIHDMLYGHKFFKTGAEAVARSILAGCDLNCGQAYREFIKDALDEGL 386
Query: 348 LPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLL 407
L E +ID AL + S R RLG F+ P + P+ IG D + S ++ LAL AA+ IVLL
Sbjct: 387 LREKDIDSALFRVLSARFRLGEFDP-PELVPYSSIGKDKLDSKENRRLALDAARKSIVLL 445
Query: 408 KNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVEN----TV 463
KN+ +LP+ KSK S+A+IGPNA A+ LG Y+G I+PL+ ++N ++
Sbjct: 446 KNND-ILPIDKSKIKSIAVIGPNAREAQ--LGIYSGFPNVLISPLEGIKNKADSLDIRVG 502
Query: 464 YYPGCD 469
Y GCD
Sbjct: 503 YVKGCD 508
Score = 115 bits (289), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 91/317 (28%), Positives = 143/317 (45%), Gaps = 38/317 (11%)
Query: 458 YVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQ 517
Y +N++ T +KA IA D V+L++G+ +EELDR ++ LP Q
Sbjct: 638 YFDNSLGATARLTWDLGQKDFEKAKKIAAENDLVILVLGITPGISQEELDRKEIELPSVQ 697
Query: 518 QELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGD 577
+EL+ + AE +++VL+ GGPV A ++ +I+ Y GE G ALA+V+FGD
Sbjct: 698 RELVKQTAEVNPN-IVIVLVNGGPV--ALAGAEKYAKAIVENWYNGEFGGQALADVLFGD 754
Query: 578 HNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYS 637
+NPGG+LP T+Y PM+D + NP RTY + + +FPFG GLSY+ +
Sbjct: 755 YNPGGKLPQTFYASTEQLPPMSDYDI-----INNP-RTYMYLNEQALFPFGHGLSYTTFK 808
Query: 638 YKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAG 697
Y S K+V N ET + + N G G
Sbjct: 809 Y---------------DSLKIVSNT--------------LNETDTLSLQFRLTNVGNRNG 839
Query: 698 KHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIE 757
V ++ P KQL F+ + L E + F++ E + + V+E
Sbjct: 840 DEVVQIYASCKDAKFKVPRKQLKRFRRLTLQTGESKVLEFKIPVDELAFYSTYENDFVVE 899
Query: 758 EGTHFLVVGDEEYPISI 774
+G +++G I +
Sbjct: 900 KGAWEILIGSSSEDIRL 916
>gi|28199699|ref|NP_780013.1| family 3 glycoside hydrolase [Xylella fastidiosa Temecula1]
gi|182682443|ref|YP_001830603.1| beta-glucosidase [Xylella fastidiosa M23]
gi|417557804|ref|ZP_12208815.1| Beta-glucosidase [Xylella fastidiosa EB92.1]
gi|28057820|gb|AAO29662.1| family 3 glycoside hydrolase [Xylella fastidiosa Temecula1]
gi|182632553|gb|ACB93329.1| Beta-glucosidase [Xylella fastidiosa M23]
gi|338179587|gb|EGO82522.1| Beta-glucosidase [Xylella fastidiosa EB92.1]
Length = 882
Score = 298 bits (764), Expect = 5e-78, Method: Compositional matrix adjust.
Identities = 174/411 (42%), Positives = 237/411 (57%), Gaps = 31/411 (7%)
Query: 54 QRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGA 113
Q A LV+++T EKI+Q +N+APAIPRLGIPAY+WWSE LHG+A G A
Sbjct: 32 QHAAALVAQMTRQEKIAQTMNAAPAIPRLGIPAYDWWSEGLHGIARNGY----------A 81
Query: 114 TSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAG--------QAIGMTFWAPNINIFR 165
T FPQ I AAS+++ L +G EARA +N + G+T W+PNINIFR
Sbjct: 82 TVFPQAIGLAASWNTDLLQHVGTVTSTEARAKFNLTGGPGKDHPRYAGLTLWSPNINIFR 141
Query: 166 DPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNW 225
DPRWGRG ET GEDP +T + AVS++RG+QGDT + + A KHF + +
Sbjct: 142 DPRWGRGMETYGEDPYLTSQLAVSFIRGLQGDTPD------HPRTIATPKHFAVH---SG 192
Query: 226 KGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTA 285
R+ FD V+ DL TY P F + + G A +MCAYN ++G P+CA LL+
Sbjct: 193 PEQGRHSFDVDVSAYDLEATYTPAFRAAIVDGHAGSVMCAYNALHGTPACASDWLLNTRL 252
Query: 286 RRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQ 345
R WGF+G++ SDCDA+ + + + A LK+G D+NCG+ + A+ +
Sbjct: 253 RNDWGFNGFVVSDCDAIEDMTRFHFFRQDNASASAAALKSGNDLNCGN-TYRDLNQAIAR 311
Query: 346 KKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIV 405
+ ES +D+AL LF+ R RLG P+ IG + +PAH+ LALQAA +V
Sbjct: 312 GDIDESTLDQALIRLFTARQRLGTLQPR-EHDPYAAIGIKHIDTPAHRALALQAAAQSLV 370
Query: 406 LLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQ 456
LLKNS LPLP +LA++GP+A+S L NY G S +TPL L+
Sbjct: 371 LLKNSGNTLPLP--PETTLAVLGPDADSLTALEANYQGTSSTPVTPLTGLR 419
Score = 130 bits (327), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 98/312 (31%), Positives = 149/312 (47%), Gaps = 64/312 (20%)
Query: 471 VACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQEL 520
+A + + +A AD +V +GL E EEL DR + LP Q+ L
Sbjct: 594 LAPETPQLAEAERAVAHADAIVAFVGLSPEVEGEELHIDTPGFSGGDRTTIDLPATQETL 653
Query: 521 ITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
+ V + KP+I+VL+ G V + +A++ + +IL A YPG++G A+A+ + GD NP
Sbjct: 654 LQHV-KTTGKPLIVVLMSGSAVALNWAQHHAD--AILAAWYPGQSGGTAIAQALAGDVNP 710
Query: 581 GGRLPMTWY--PQD---YIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSK 635
GGRLP+T+Y QD YI MT GRTYR+++G+ ++PFG GLSY++
Sbjct: 711 GGRLPVTFYRSTQDLPPYISYDMT-------------GRTYRYFKGQPLYPFGYGLSYTQ 757
Query: 636 YSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFL-VTIGVKNHGE 694
++Y+ P+L T + L VT V+N G
Sbjct: 758 FAYE------------------------------APQLSTATLKAGNTLTVTTHVRNTGT 787
Query: 695 MAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLM 754
AG V L+++P P++ LVGF+ V L E + F L LS ++ G
Sbjct: 788 RAGDEVVQLYLEPPYSPQA-PLRSLVGFKRVTLRPGESRLLTFTLD-ARQLSSVQQTGQR 845
Query: 755 VIEEGTHFLVVG 766
+E G + L VG
Sbjct: 846 SVEAGHYHLFVG 857
>gi|298482082|ref|ZP_07000270.1| beta-glucosidase [Bacteroides sp. D22]
gi|298271639|gb|EFI13212.1| beta-glucosidase [Bacteroides sp. D22]
Length = 863
Score = 298 bits (764), Expect = 5e-78, Method: Compositional matrix adjust.
Identities = 172/439 (39%), Positives = 245/439 (55%), Gaps = 38/439 (8%)
Query: 43 FPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGK 102
+P+ T L QRA DL+ RLTL+EK++ + N++PAIPRLGI YEWW+EALHGVA G
Sbjct: 26 YPYQDTKLTAEQRADDLLQRLTLEEKVALMQNNSPAIPRLGIKPYEWWNEALHGVARAGL 85
Query: 103 GIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARA----LYNAGQAI---GMT 155
AT FPQ I AASF+ L Y + A+ EARA GQ G+T
Sbjct: 86 ----------ATVFPQAIGMAASFNDELLYEVFDAVSDEARAKNRQFNERGQYKRYQGLT 135
Query: 156 FWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQG-DTFNGGKLKGKLQASACC 214
W PN+NIFRDPRWGRGQET GEDP ++G+ ++ VRG+QG + KL AC
Sbjct: 136 MWTPNVNIFRDPRWGRGQETYGEDPYLSGRMGMAVVRGLQGPEDAEYDKLH------ACA 189
Query: 215 KHFTAYDLDNWKGTTRYKFDAR-VTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIP 273
KHF + W R+ F+A + +DL +TY P F+ V++ +MCAYNR G P
Sbjct: 190 KHFAVHSGPEW---NRHSFNAENIAPRDLWETYLPAFKELVQKAGVKEVMCAYNRFEGDP 246
Query: 274 SCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPE--DAVVDVLKAGMDVNC 331
C LL++ R WGF G + +DC A+ + + + P+ A D + G D+ C
Sbjct: 247 CCGSNRLLTQILRNDWGFKGIVVTDCGAIGDFFQRKKHETHPDAVHASADAVLNGTDLEC 306
Query: 332 GSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPA 391
G + T AVK+ + E +I+ ++ L R LG NPT P+ I V+ P
Sbjct: 307 GGNFKSITD-AVKKGLISEEKINTSVKRLLKARFELGEM--NPT-HPWSNIPYSVIDCPK 362
Query: 392 HQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITP 451
H+ LAL+ A + +VLL+N + +LPL ++ + +A+IGPNAN + GNY G ++T
Sbjct: 363 HKELALKMAHESLVLLQNKNNILPL--NRQMKVAVIGPNANDSVMQWGNYNGFPSHTVTL 420
Query: 452 LQALQNYVENT--VYYPGC 468
L+ ++ + + +Y P C
Sbjct: 421 LEGIRAKLPDAQIIYEPVC 439
Score = 100 bits (250), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 81/264 (30%), Positives = 119/264 (45%), Gaps = 51/264 (19%)
Query: 486 KGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILV 535
K AD V+ G+ E E + DR ++ LP Q+E++ + + KK + V
Sbjct: 598 KNADVVIFAGGISPLLEGESMRVSDPGFKGGDRTEIELPAIQREVLALLKKNGKK-TVFV 656
Query: 536 LLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIK 595
G + I ++ +IL A YPG+AG A+A+V+FGD+NP GRLP+T+Y
Sbjct: 657 NFSGSAMAIV--PETQSCDAILQAWYPGQAGGTAVADVLFGDYNPAGRLPITFYKSIQQL 714
Query: 596 VPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSS 655
D M+ GRTYRF ++PFG GLSY+++SY K LNQS
Sbjct: 715 PDYEDYSMK--------GRTYRFMTETPLYPFGYGLSYTRFSY-------GKATLNQSKL 759
Query: 656 TKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRP 715
K K ++TI V N G+ G+ V +++ G P
Sbjct: 760 NK----------------------GEKAILTIPVSNVGQRDGEEVVQVYICRPDDKEG-P 796
Query: 716 IKQLVGFQSVILNAKEKAEIVFEL 739
K L GFQ V + + + EL
Sbjct: 797 QKTLRGFQRVNIAKGKTQNVSIEL 820
>gi|336415490|ref|ZP_08595829.1| hypothetical protein HMPREF1017_02937 [Bacteroides ovatus
3_8_47FAA]
gi|335940369|gb|EGN02236.1| hypothetical protein HMPREF1017_02937 [Bacteroides ovatus
3_8_47FAA]
Length = 863
Score = 298 bits (764), Expect = 6e-78, Method: Compositional matrix adjust.
Identities = 172/439 (39%), Positives = 245/439 (55%), Gaps = 38/439 (8%)
Query: 43 FPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGK 102
+P+ T L QRA DL+ RLTL+EK++ + N++PAIPRLGI YEWW+EALHGVA G
Sbjct: 26 YPYQDTKLTAEQRADDLLQRLTLEEKVALMQNNSPAIPRLGIKPYEWWNEALHGVARAGL 85
Query: 103 GIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARA----LYNAGQAI---GMT 155
AT FPQ I AASF+ L Y + A+ EARA GQ G+T
Sbjct: 86 ----------ATVFPQAIGMAASFNDELLYEVFDAVSDEARAKNRQFNERGQYKRYQGLT 135
Query: 156 FWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQG-DTFNGGKLKGKLQASACC 214
W PN+NIFRDPRWGRGQET GEDP ++G+ ++ VRG+QG + KL AC
Sbjct: 136 MWTPNVNIFRDPRWGRGQETYGEDPYLSGRMGMAAVRGLQGPEDAEYDKLH------ACA 189
Query: 215 KHFTAYDLDNWKGTTRYKFDAR-VTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIP 273
KHF + W R+ F+A + +DL +TY P F+ V++ +MCAYNR G P
Sbjct: 190 KHFAVHSGPEW---NRHSFNAENIAPRDLWETYLPAFKELVQKAGVKEVMCAYNRFEGDP 246
Query: 274 SCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPE--DAVVDVLKAGMDVNC 331
C LL++ R WGF G + +DC A+ + + + P+ A D + G D+ C
Sbjct: 247 CCGSNRLLTQILRNDWGFKGIVVTDCGAIGDFFQRKKHETHPDAAHASADAVLNGTDLEC 306
Query: 332 GSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPA 391
G + T AVK+ + E +I+ ++ L R LG NPT P+ I V+ P
Sbjct: 307 GGNFKSITD-AVKKGLISEEKINTSVKRLLKARFELGEM--NPT-HPWSNIPYSVINCPK 362
Query: 392 HQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITP 451
H+ LAL+ A + +VLL+N + +LPL ++ + +A+IGPNAN + GNY G ++T
Sbjct: 363 HKELALKMAHESLVLLQNKNNILPL--NRQMKVAVIGPNANDSVMQWGNYNGFPSHTVTL 420
Query: 452 LQALQNYVENT--VYYPGC 468
L+ ++ + + +Y P C
Sbjct: 421 LEGIRAKLPDAQIIYEPVC 439
Score = 101 bits (251), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 78/265 (29%), Positives = 123/265 (46%), Gaps = 53/265 (20%)
Query: 486 KGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILV 535
K AD V+ G+ E E + DR ++ LP Q+E++ + + KK + V
Sbjct: 598 KNADVVIFAGGISPLLEGESMRVSDPGFKGGDRTEIELPAIQREVLALLKKNGKK-TVFV 656
Query: 536 LLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIK 595
G + I ++ +IL A YPG+AG A+A+V+FG++NP GRLP+T+Y
Sbjct: 657 NFSGSAMAIV--PETQSCDAILQAWYPGQAGGTAVADVLFGNYNPAGRLPITFYKSIQQL 714
Query: 596 VPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSS 655
D M+ GRTYRF ++PFG GLSY+++SY ++Q+KL
Sbjct: 715 PDYEDYSMK--------GRTYRFMTETPLYPFGYGLSYTRFSYGKATLNQSKL------- 759
Query: 656 TKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFV-KPARRGNGR 714
+ K ++TI V N G+ G+ V +++ +P +G
Sbjct: 760 ----------------------AKGEKAILTIPVSNVGQRDGEEVVQVYICRPDDKGG-- 795
Query: 715 PIKQLVGFQSVILNAKEKAEIVFEL 739
P K L GFQ V + + + EL
Sbjct: 796 PQKTLRGFQRVNIAKGKTQNVNIEL 820
>gi|299148437|ref|ZP_07041499.1| beta-glucosidase [Bacteroides sp. 3_1_23]
gi|298513198|gb|EFI37085.1| beta-glucosidase [Bacteroides sp. 3_1_23]
Length = 863
Score = 298 bits (763), Expect = 7e-78, Method: Compositional matrix adjust.
Identities = 172/439 (39%), Positives = 245/439 (55%), Gaps = 38/439 (8%)
Query: 43 FPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGK 102
+P+ T L QRA DL+ RLTL+EK++ + N++PAIPRLGI YEWW+EALHGVA G
Sbjct: 26 YPYQDTKLTAEQRADDLLQRLTLEEKVALMQNNSPAIPRLGIKPYEWWNEALHGVARAGL 85
Query: 103 GIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARA----LYNAGQAI---GMT 155
AT FPQ I AASF+ L Y + A+ EARA GQ G+T
Sbjct: 86 ----------ATVFPQAIGMAASFNDELLYEVFDAVSDEARAKNRQFNERGQYKRYQGLT 135
Query: 156 FWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQG-DTFNGGKLKGKLQASACC 214
W PN+NIFRDPRWGRGQET GEDP ++G+ ++ VRG+QG + KL AC
Sbjct: 136 MWTPNVNIFRDPRWGRGQETYGEDPYLSGRMGMAAVRGLQGPEDAEYDKLH------ACA 189
Query: 215 KHFTAYDLDNWKGTTRYKFDAR-VTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIP 273
KHF + W R+ F+A + +DL +TY P F+ V++ +MCAYNR G P
Sbjct: 190 KHFAVHSGPEW---NRHSFNAENIAPRDLWETYLPAFKELVQKAGVKEVMCAYNRFEGDP 246
Query: 274 SCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPE--DAVVDVLKAGMDVNC 331
C LL++ R WGF G + +DC A+ + + + P+ A D + G D+ C
Sbjct: 247 CCGSNRLLTQILRNDWGFKGIVVTDCGAIGDFFQRKKHETHPDAAHASADAVLNGTDLEC 306
Query: 332 GSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPA 391
G + T AVK+ + E +I+ ++ L R LG NPT P+ I V+ P
Sbjct: 307 GGNFKSITD-AVKKGLISEEKINTSVKRLLKARFELGEM--NPT-HPWSNIPYSVINCPK 362
Query: 392 HQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITP 451
H+ LAL+ A + +VLL+N + +LPL ++ + +A+IGPNAN + GNY G ++T
Sbjct: 363 HKELALKMAHESLVLLQNKNNILPL--NRQMKVAVIGPNANDSVMQWGNYNGFPSHTVTL 420
Query: 452 LQALQNYVENT--VYYPGC 468
L+ ++ + + +Y P C
Sbjct: 421 LEGIRAKLPDAQIIYEPVC 439
Score = 103 bits (256), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 79/265 (29%), Positives = 123/265 (46%), Gaps = 53/265 (20%)
Query: 486 KGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILV 535
K AD V+ G+ E E + DR ++ LP Q+E++ + + KK + V
Sbjct: 598 KNADVVIFAGGISPLLEGESMRVSDPGFKGGDRTEIELPAIQREVLALLKKNGKK-TVFV 656
Query: 536 LLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIK 595
G + I ++ +IL A YPG+AG A+A+V+FGD+NP GRLP+T+Y
Sbjct: 657 NFSGSAMAIV--PETQSCDAILQAWYPGQAGGTAVADVLFGDYNPAGRLPITFYKSIQQL 714
Query: 596 VPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSS 655
D M+ GRTYRF ++PFG GLSY+++SY ++Q+KL
Sbjct: 715 PDYEDYSMK--------GRTYRFMTETPLYPFGYGLSYTRFSYGKATLNQSKL------- 759
Query: 656 TKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFV-KPARRGNGR 714
+ K ++TI V N G+ G+ V +++ +P +G
Sbjct: 760 ----------------------AKGEKAILTIPVSNVGQRDGEEVVQVYICRPDDKGG-- 795
Query: 715 PIKQLVGFQSVILNAKEKAEIVFEL 739
P K L GFQ V + + + EL
Sbjct: 796 PQKTLRGFQRVNIAKGKTQNVNIEL 820
>gi|167038437|ref|YP_001666015.1| glycoside hydrolase family 3 [Thermoanaerobacter pseudethanolicus
ATCC 33223]
gi|320116830|ref|YP_004186989.1| glycoside hydrolase family 3 domain-containing protein
[Thermoanaerobacter brockii subsp. finnii Ako-1]
gi|166857271|gb|ABY95679.1| glycoside hydrolase, family 3 domain protein [Thermoanaerobacter
pseudethanolicus ATCC 33223]
gi|319929921|gb|ADV80606.1| glycoside hydrolase family 3 domain protein [Thermoanaerobacter
brockii subsp. finnii Ako-1]
Length = 784
Score = 298 bits (763), Expect = 8e-78, Method: Compositional matrix adjust.
Identities = 234/801 (29%), Positives = 385/801 (48%), Gaps = 134/801 (16%)
Query: 45 FCKTTLPISQRARDLVSRLTLDEKISQLVNSA------------------------PAIP 80
+ +T + +R DL+ ++T++EK++QL NS I
Sbjct: 5 YLDSTQSVEKRVEDLLQQMTIEEKVAQL-NSIWVYEILDDMKFSFDKAKRLMSYGIGQIT 63
Query: 81 RLGIPAYEWWSEALHGVAGVGKGIFFNGTI----------------RGATSFPQVILTAA 124
RLG + E + + K + N + +GAT FPQ I A+
Sbjct: 64 RLGGASNLSPRETVRIANQIQKFLIENTRLGIPALIHEESCSGYMAKGATIFPQTIGVAS 123
Query: 125 SFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTG 184
++++ + ++ I + +A+ A QA+ AP ++I RDPRWGR +ET GEDP +
Sbjct: 124 TWNNEIVEKMASVIREQMKAV-GARQAL-----APLLDITRDPRWGRTEETFGEDPYLVM 177
Query: 185 KYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLAD 244
+ VSY+RG+Q ++ G + A KHF Y N +G + A + ++L +
Sbjct: 178 RMGVSYIRGLQTESLKEGIV-------ATGKHFVGYG--NSEGGMNWA-PAHIPERELRE 227
Query: 245 TYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSI 304
+ PFE+ VK+ + S IM Y+ ++G+P + LL+ R+ WGF G + SD A+S
Sbjct: 228 VFLYPFEAAVKEAKLSSIMPGYHELDGVPCHKSKKLLNDILRKDWGFEGIVVSDYFAISQ 287
Query: 305 IYDAEGYAKSPEDAVVDVLKAGMDVNCGS--FLQKHTKAAVKQKKLPESEIDRALHNLFS 362
+Y+ + A L+AG+DV S + + ++ ++ ++ A+ +
Sbjct: 288 LYEYHHVTSDKKGAAKLALEAGVDVELPSTDYYGLPLRELIESGEIDIDFVNEAVKRVLK 347
Query: 363 VRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSV 422
++ LGLF NP + + ++ + + LA + AQ+ IVLLKN + LLPL K
Sbjct: 348 IKFELGLFE-NPYINE--EKAVEIFDTNEQRELAYKIAQESIVLLKNENNLLPLKKDLK- 403
Query: 423 SLALIGPNANSAKTLLGNYAGP-------SCRS----------------------ITPLQ 453
S+A+IGPNA+S + ++G+YA P R +T LQ
Sbjct: 404 SIAVIGPNADSIRNMIGDYAYPCHIESLLEMRETDNVFNTPLPESLEAKDIYVPIVTVLQ 463
Query: 454 ALQNYVENT---VYYPGCDTVACSSASIDKAVDIAKGADHVVLMMG-----LDQTQEKEE 505
++ V + +Y GCD + S +AV+IAK AD V+++G D E
Sbjct: 464 GIKAKVSSNTEVLYAKGCDVLNNSKDGFKEAVEIAKQADVAVVVVGDKSGLTDGCTSGES 523
Query: 506 LDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEA 565
DR DL LPG Q+ELI + E PVI+VL+ G P+ I++ I +I+ A PGE
Sbjct: 524 RDRADLNLPGVQEELIKAIYETG-TPVIVVLINGRPMSISWIA--EKIPAIIEAWLPGEE 580
Query: 566 GAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVF 625
G A+A+VIFGD+NPGG+LP++ PQ ++P+ +P + Y K ++
Sbjct: 581 GGRAVADVIFGDYNPGGKLPIS-IPQSVGQLPVY-YYHKPSGGRSHWKGDYVELSTKPLY 638
Query: 626 PFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLV 685
PFG GLSY+++SY +S K+ L +MVE +
Sbjct: 639 PFGYGLSYTEFSYTNLNISNRKVSLRD----RMVE------------------------I 670
Query: 686 TIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESL 745
++ +KN G + G V L++ RP+K+L GF+ + L+A E+ ++F+LS E L
Sbjct: 671 SVDIKNTGTLKGDEVVQLYIHQEALSVTRPVKELKGFKRITLDAGEEKTVIFKLS-IEQL 729
Query: 746 SRAREDGLMVIEEGTHFLVVG 766
E+ V+E G +++G
Sbjct: 730 GFYDENMEYVVEPGRVDVMIG 750
>gi|294675412|ref|YP_003576028.1| family 3 glycosyl hydrolase [Prevotella ruminicola 23]
gi|294472176|gb|ADE81565.1| glycosyl hydrolase, family 3 [Prevotella ruminicola 23]
Length = 875
Score = 298 bits (762), Expect = 1e-77, Method: Compositional matrix adjust.
Identities = 169/430 (39%), Positives = 243/430 (56%), Gaps = 32/430 (7%)
Query: 43 FPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGK 102
P+ L +QRA DL+SRLTLDEK+S +++++PAIPRLGIP ++WW+EALH G+G+
Sbjct: 23 LPYQNANLSAAQRADDLLSRLTLDEKVSLMMDTSPAIPRLGIPQFQWWNEALH---GIGR 79
Query: 103 GIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIG-------MT 155
F AT FP + AAS+D L +++ A+ EAR + G ++
Sbjct: 80 NGF-------ATVFPITMAMAASWDDALLHQVFTAVSDEARVKAQQAKCTGDIKRYQSLS 132
Query: 156 FWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKL--KGKLQASAC 213
FW PNINIFRDPRWGRGQET GEDP +T K ++ VRG+QG +NG L + AC
Sbjct: 133 FWTPNINIFRDPRWGRGQETYGEDPYLTAKMGLAVVRGLQGVGYNGEDLGVSKYRKLLAC 192
Query: 214 CKHFTAYDLDNWKGTTRYKFD-ARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGI 272
KHF + W R++F+ + +DL +TY P F++ V++G+ + +MCAY R++G
Sbjct: 193 AKHFAVHSGPEW---NRHEFNIENLPERDLWETYLPAFKALVQEGKVAEVMCAYQRIDGQ 249
Query: 273 PSCADRNLLSKTARRQWGFHGYITSDCDAV-SIIYDAEGYAKSPEDAVVDVLKAGMDVNC 331
CA + R +WGF G ITSDC A+ + +K +A + AG DV C
Sbjct: 250 ACCAQTRYEQQILRDEWGFDGLITSDCGAIRDFLPRWHNVSKDGAEASAKAVLAGTDVEC 309
Query: 332 GSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPA 391
GS KH AV++ + E++IDR+L L R LG + + + + KI VV S A
Sbjct: 310 GSEY-KHLPEAVRRGDVKEADIDRSLRRLLIARFELGDMDSD-DLNAWTKIPETVVASQA 367
Query: 392 HQVLALQAAQDGIVLLKNSHGLLPLPK------SKSVSLALIGPNANSAKTLLGNYAGPS 445
H+ LAL+ A IVLL+N +LPL + ++GPNAN + + GNYAG
Sbjct: 368 HKDLALKMALKSIVLLQNKIKVLPLGNPLNAGAGSDKDIVVMGPNANDSVMMWGNYAGYP 427
Query: 446 CRSITPLQAL 455
++T L +
Sbjct: 428 THTVTALDGI 437
Score = 85.1 bits (209), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 72/259 (27%), Positives = 105/259 (40%), Gaps = 59/259 (22%)
Query: 491 VVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGG 540
++ + G+ E EE+ DR + LP Q++L+ + +A KK ++ + C G
Sbjct: 623 IIFVGGISPNLEGEEMRVNEPGFKGGDRTSIELPQAQRDLLAVLHKAGKK--VIFVNCSG 680
Query: 541 PVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTD 600
+ A +IL Y GE G ALA +FG P G+LP+T+Y D
Sbjct: 681 SA-MALAPELETCDAILQWWYGGEQGGAALATTLFGMVAPSGKLPVTFYKSTDELPDFLD 739
Query: 601 MKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVE 660
M+ RTYR+YEG+ +FPFG GL Y+ ++ NK
Sbjct: 740 YTMK--------NRTYRYYEGEPLFPFGFGLGYTTFNIDKPIYKNNK------------- 778
Query: 661 NQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLV 720
V + VKN G AG V ++++ G P K L
Sbjct: 779 ------------------------VQVRVKNLGTTAGTETVQVYIRHLADKEG-PKKSLR 813
Query: 721 GFQSVILNAKEKAEIVFEL 739
+Q V LNA E I EL
Sbjct: 814 AYQQVTLNAAEAKTISIEL 832
>gi|393781488|ref|ZP_10369683.1| hypothetical protein HMPREF1071_00551 [Bacteroides salyersiae
CL02T12C01]
gi|392676551|gb|EIY69983.1| hypothetical protein HMPREF1071_00551 [Bacteroides salyersiae
CL02T12C01]
Length = 850
Score = 297 bits (761), Expect = 1e-77, Method: Compositional matrix adjust.
Identities = 178/440 (40%), Positives = 244/440 (55%), Gaps = 41/440 (9%)
Query: 43 FPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGK 102
P+ L QRA DL+ RLT++EKIS + N++P IPRLGI YEWW+EALHGVA G
Sbjct: 14 LPYQNPDLTPEQRATDLLQRLTVEEKISLMQNNSPGIPRLGIRPYEWWNEALHGVARAGL 73
Query: 103 GIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEA----RALYNAGQAI---GMT 155
AT FPQ I AASF+ L ++ A+ EA RA + GQ G+T
Sbjct: 74 ----------ATVFPQTIGMAASFNDSLVQKVFTAVSDEARAKNRAFNDQGQYKRYQGLT 123
Query: 156 FWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQG-DTFNGGKLKGKLQASACC 214
W PN+NIFRDPRWGRGQET GEDP +T + V+ V+G+QG D+ KL AC
Sbjct: 124 MWTPNVNIFRDPRWGRGQETYGEDPYLTSRMGVAVVKGLQGPDSARYDKLH------ACA 177
Query: 215 KHFTAYDLDNWKGTTRYKFDAR-VTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIP 273
KHF + W R+ F+A + +DL +TY P F++ V++ +MCAYNR G P
Sbjct: 178 KHFAVHSGPEW---NRHSFNAENIAPRDLWETYLPAFKTLVQEADVKEVMCAYNRFEGDP 234
Query: 274 SCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPE--DAVVDVLKAGMDVNC 331
C LL++ R +WGF+G + SDC A+S + A+ + P+ A D + +G D+ C
Sbjct: 235 CCGSNRLLTQILRDEWGFNGIVVSDCGAISDFWGAKKHNTHPDAAHASADAVLSGTDLEC 294
Query: 332 GSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLG-LFNGNPTMQPFGKIGADVVCSP 390
GS +K T AVK + E +ID ++ L R LG + +P P+ +V P
Sbjct: 295 GSNYRKLTD-AVKAGIISEEQIDISVKRLLKARFELGEMEESHPWALPYS-----IVDCP 348
Query: 391 AHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSIT 450
H+ LALQ A + + LL+N +LPL K V A+IGPNAN + GNY G + T
Sbjct: 349 EHRHLALQIAHETMTLLQNKENILPLDKHAKV--AVIGPNANDSVMQWGNYNGTPSHTST 406
Query: 451 PLQALQNYV--ENTVYYPGC 468
L AL++ + +Y P C
Sbjct: 407 LLSALRSKLPAAQLIYEPVC 426
Score = 97.1 bits (240), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 80/286 (27%), Positives = 130/286 (45%), Gaps = 53/286 (18%)
Query: 465 YPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLP 514
Y D V +++ ++ K + V+ G+ E EE+ DR D+ LP
Sbjct: 564 YFNFDLVEDIPLNMNATLEKLKDTEIVIFAGGISPLLEGEEMKVSAAGFKGGDRTDIELP 623
Query: 515 GRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVI 574
Q+ ++ + +A KK VI V G + +T N +IL A YPG+ G A+A+V+
Sbjct: 624 AVQRNVLAALKKAGKK-VIFVNFSGSAMALT--PETENCDAILQAWYPGQEGGTAVADVL 680
Query: 575 FGDHNPGGRLPMTWYPQDYIKVP-MTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSY 633
FGD+NP GRLP+T+Y ++ ++P D M+ GRTYR+ + +FPFG GLSY
Sbjct: 681 FGDYNPAGRLPVTFY-KNMEQLPDFEDYSMQ--------GRTYRYMKEAPLFPFGYGLSY 731
Query: 634 SKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHG 693
+ ++Y + ++ S+ KM +TI V N G
Sbjct: 732 TTFTYGKARADKKRI----STGEKMT-------------------------LTIPVSNIG 762
Query: 694 EMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFEL 739
G+ V ++++ G P K L F+ V + + + EL
Sbjct: 763 SRDGEEVVQVYLRREDDPEG-PTKTLRAFKRVEITKGKSLNVKIEL 807
>gi|424792251|ref|ZP_18218496.1| exported beta-glucosidase [Xanthomonas translucens pv. graminis
ART-Xtg29]
gi|422797157|gb|EKU25539.1| exported beta-glucosidase [Xanthomonas translucens pv. graminis
ART-Xtg29]
Length = 909
Score = 297 bits (761), Expect = 1e-77, Method: Compositional matrix adjust.
Identities = 173/427 (40%), Positives = 237/427 (55%), Gaps = 33/427 (7%)
Query: 62 RLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVIL 121
++T +EK++Q +N+APAIPRLG+PAYEWW+E LHG+A G AT FPQ I
Sbjct: 67 KMTREEKVAQAMNAAPAIPRLGVPAYEWWNEGLHGIARNGY----------ATVFPQAIG 116
Query: 122 TAASFDSYLWYRIGQAIGLEARALYN-AG-------QAIGMTFWAPNINIFRDPRWGRGQ 173
AA++++ L ++G EARA +N AG + G+T W+PNINIFRDPRWGRG
Sbjct: 117 LAATWNTALLEQVGTVTSTEARAKFNLAGGPGKDHPRYAGLTIWSPNINIFRDPRWGRGM 176
Query: 174 ETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKF 233
ET GEDP +TG+ AV ++ G+QGD + A KH + + R+ F
Sbjct: 177 ETYGEDPYLTGQLAVGFIHGLQGDDLT------HPRTIATPKHLAVH---SGPEPGRHGF 227
Query: 234 DARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHG 293
D V+ DL TY P F + + GRA +MCAYN ++G P+CA LL+ R WGF G
Sbjct: 228 DVDVSPHDLEATYTPAFRAAIVDGRAGSVMCAYNALHGTPACAADWLLNGRLRGDWGFTG 287
Query: 294 YITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEI 353
++ SDCDAV + + + LKAG D+NCG + + A+ + E+ +
Sbjct: 288 FVVSDCDAVDDMTQFHYFRADNAGSSAAALKAGHDLNCG-YAYRDLGKAIARGDADEALL 346
Query: 354 DRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGL 413
D++L LF+ R RLG P+ ++GA V S AH+ LALQAAQ IVLL+N +
Sbjct: 347 DKSLVRLFAARYRLGELQPQ-RKDPYARLGAKDVDSAAHRALALQAAQQSIVLLQNRNAT 405
Query: 414 LPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNY--VENTVYYPGCDTV 471
LPL + LA+IGPNA++ L NY G S +TPL L+ N Y G
Sbjct: 406 LPL--RPGLRLAVIGPNADALAALEANYQGTSAAPVTPLLGLRERFGAANVRYAQGAPLA 463
Query: 472 ACSSASI 478
A S I
Sbjct: 464 AGVSGMI 470
Score = 132 bits (331), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 95/283 (33%), Positives = 143/283 (50%), Gaps = 56/283 (19%)
Query: 496 GLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDIT 545
GL E EEL DR DL LP QQ L+ R A+A+ KP+++VL+ G V +
Sbjct: 646 GLSPDVEGEELRIDVPGFDGGDRNDLALPAAQQALLER-AKASGKPLVVVLMSGSAVALN 704
Query: 546 FAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWY--PQDYIKVPMTDMKM 603
+AK ++ +I+ A YPG++G A+A+V+ GD NPGGRLP+T+Y +D DMK
Sbjct: 705 WAK--QHADAIVAAWYPGQSGGTAIAQVLAGDVNPGGRLPVTFYRSTKDLPAYVSYDMK- 761
Query: 604 RPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQD 663
GRTYR+++G+ +F FG GLSY++++Y +S L Q+ + V Q
Sbjct: 762 ---------GRTYRYFKGEPLFAFGSGLSYTRFTYAAPQLSATTL---QAGAHLQVRTQ- 808
Query: 664 VVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQ 723
V+N G AG V ++++ +R P++ LVGFQ
Sbjct: 809 -------------------------VRNSGTRAGDEVVQVYLEFPQRAQS-PLRTLVGFQ 842
Query: 724 SVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
V L E ++ FEL+P LS G ++ G + + VG
Sbjct: 843 RVTLQPGEARDVSFELAP-RQLSDVDRAGQRAVQPGDYRVFVG 884
>gi|326389315|ref|ZP_08210883.1| glycoside hydrolase family 3 domain protein [Thermoanaerobacter
ethanolicus JW 200]
gi|325994678|gb|EGD53102.1| glycoside hydrolase family 3 domain protein [Thermoanaerobacter
ethanolicus JW 200]
Length = 784
Score = 297 bits (761), Expect = 1e-77, Method: Compositional matrix adjust.
Identities = 239/801 (29%), Positives = 387/801 (48%), Gaps = 134/801 (16%)
Query: 45 FCKTTLPISQRARDLVSRLTLDEKISQLVNSA------------------------PAIP 80
+ +T + +R DL+ ++T++EK++QL NS I
Sbjct: 5 YLDSTQSVEKRVEDLLQQMTIEEKVAQL-NSIWVYEILDDMKFSFDKAKRLMSYGISQIT 63
Query: 81 RLGIPAYEWWSEALHGVAGVGKGIFFNGTI----------------RGATSFPQVILTAA 124
RLG + E + + K + N + +GAT FPQ I A+
Sbjct: 64 RLGGASNLSPRETVRIANQIQKFLIENTRLGIPALIHEESCSGYMAKGATIFPQTIGVAS 123
Query: 125 SFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTG 184
++++ + ++ I + +A+ A QA+ AP ++I RDPRWGR +ET GEDP +
Sbjct: 124 TWNNEIVEKMASVIREQMKAV-GARQAL-----APLLDITRDPRWGRTEETFGEDPYLVM 177
Query: 185 KYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLAD 244
+ VSY+RG+Q ++ G + A KHF Y N +G + A + ++L +
Sbjct: 178 RMGVSYIRGLQTESLREGIV-------ATGKHFVGYG--NSEGGMNWA-PAHIPERELRE 227
Query: 245 TYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSI 304
+ PFE+ VK+ + S IM Y+ ++G+P + LL+ R+ WGF G + SD A+S
Sbjct: 228 VFLYPFEAAVKEAKLSSIMPGYHELDGVPCHKSKKLLNDILRKDWGFEGIVVSDYFAISQ 287
Query: 305 IYDAEGYAKSPEDAVVDVLKAGMDVNCGS--FLQKHTKAAVKQKKLPESEIDRALHNLFS 362
+Y+ + A L+AG+DV S + + ++ ++ ++ A+ +
Sbjct: 288 LYEYHHVTSDKKGAAKLALEAGVDVELPSTDYYGLPLRELIESGEIDIDFVNEAVKRVLK 347
Query: 363 VRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSV 422
++ LGLF NP + + ++ + + LA + AQ+ IVLLKN + LLPL K
Sbjct: 348 IKFELGLFE-NPYINE--EKAVEIFDTNEQRELAYKIAQESIVLLKNENNLLPLKKDLK- 403
Query: 423 SLALIGPNANSAKTLLGNYAGP-SCRSI-----------TP----LQALQNYV------- 459
S+A+IGPNA+S + ++G+YA P S+ TP L+A YV
Sbjct: 404 SIAVIGPNADSIRNMIGDYAYPCHIESLLEMRETDNVFNTPLPESLEAKDIYVPIVTVLQ 463
Query: 460 -------ENT--VYYPGCDTVACSSASIDKAVDIAKGADHVVLMMG-----LDQTQEKEE 505
NT +Y GCD + S +AV+IAK AD V+++G D E
Sbjct: 464 GIKAKASSNTEVLYAKGCDVLNNSKDGFKEAVEIAKQADVAVVVVGDKSGLTDGCTSGES 523
Query: 506 LDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEA 565
DR DL LPG Q+ELI V E PVI+VL+ G P+ I++ I +I+ A PGE
Sbjct: 524 RDRADLNLPGVQEELIKAVYETG-TPVIVVLINGRPMSISWIA--EKIPAIIEAWLPGEE 580
Query: 566 GAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVF 625
G A+A+VIFGD+NPGG+LP++ PQ ++P+ +P + Y K ++
Sbjct: 581 GGRAVADVIFGDYNPGGKLPIS-IPQSVGQLPVY-YYHKPSGGRSHWKGDYVELSTKPLY 638
Query: 626 PFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLV 685
PFG GLSY+++SY +S K+ L +MVE +
Sbjct: 639 PFGYGLSYTEFSYTNLNISNRKVSLRD----RMVE------------------------I 670
Query: 686 TIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESL 745
++ +KN G + G V L++ RP+K+L GF+ + L+A E+ ++F+LS E L
Sbjct: 671 SVDIKNTGTLKGDEVVQLYIHQEALSVTRPVKELKGFKRITLDAGEEKTVIFKLS-IEQL 729
Query: 746 SRAREDGLMVIEEGTHFLVVG 766
E+ V+E G +++G
Sbjct: 730 GFYDENMEYVVEPGRVDVMIG 750
>gi|1749831|emb|CAA91219.1| beta-xylo-glucosidase [Thermoanaerobacter brockii]
Length = 730
Score = 297 bits (760), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 216/695 (31%), Positives = 351/695 (50%), Gaps = 93/695 (13%)
Query: 111 RGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWG 170
+GAT FPQ I A+++++ + ++ I + +A+ A QA+ AP ++I RDPRWG
Sbjct: 56 KGATIFPQTIGVASTWNNEIVEKMASVIREQMKAV-GARQAL-----APLLDITRDPRWG 109
Query: 171 RGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTR 230
R +ET GEDP + + VSY+RG+Q ++ G + A KHF Y N +G
Sbjct: 110 RTEETFGEDPYLVMRMGVSYIRGLQTESLKEGIV-------ATGKHFVGYG--NSEGGMN 160
Query: 231 YKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWG 290
+ A + ++L + + PFE+ VK+ + S IM Y+ ++G+P + LL+ R+ WG
Sbjct: 161 WA-PAHIPERELREVFLYPFEAAVKEAKLSSIMPGYHELDGVPCHKSKKLLNDILRKDWG 219
Query: 291 FHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGS--FLQKHTKAAVKQKKL 348
F G + SD A+S +Y+ + A L+AG+DV S + + ++ ++
Sbjct: 220 FEGIVVSDYFAISQLYEYHHVTSDKKGAAKLALEAGVDVELPSTDYYGLPLRELIESGEI 279
Query: 349 PESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLK 408
++ A+ + ++ LGLF NP + + ++ + + LA + AQ+ IVLLK
Sbjct: 280 DIDFVNEAVKRVLKIKFELGLFE-NPYINE--EKAVEIFDTNEQRELAYKIAQESIVLLK 336
Query: 409 NSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGP-------SCRS------------- 448
N + LLPL K S+A+IGPNA+S + ++G+YA P R
Sbjct: 337 NENNLLPLKKDLK-SIAVIGPNADSIRNMIGDYAYPCHIESLLEMRETDNVFNTPLPESL 395
Query: 449 ---------ITPLQALQNYVENT---VYYPGCDTVACSSASIDKAVDIAKGADHVVLMMG 496
+T LQ ++ V + +Y GCD + S +AV+IAK AD V+++G
Sbjct: 396 EAKDIYVPIVTVLQGIKAKVSSNTEVLYAKGCDVLNNSKDGFKEAVEIAKQADVAVVVVG 455
Query: 497 -----LDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDR 551
D E DR DL LPG Q+ELI + E PVI+VL+ G P+ I++
Sbjct: 456 DKSGLTDGCTSGESRDRADLNLPGVQEELIKAIYETG-TPVIVVLINGRPMSISWIA--E 512
Query: 552 NIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGN 611
I +I+ A PGE G A+A+VIFGD+NPGG+LP++ PQ ++P+ +P +
Sbjct: 513 KIPAIIEAWLPGEEGGRAVADVIFGDYNPGGKLPIS-IPQSVGQLPVY-YYHKPSGGRSH 570
Query: 612 PGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVP 671
Y K ++PFG GLSY+++SY +S K+ L +MVE
Sbjct: 571 WKGDYVELSTKPLYPFGYGLSYTEFSYTNLNISNRKVSLRD----RMVE----------- 615
Query: 672 ELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKE 731
+++ +KN G + G V L++ RP+K+L GF+ + L+A E
Sbjct: 616 -------------ISVDIKNTGTLKGDEVVQLYIHQEALSVTRPVKELKGFKRITLDAGE 662
Query: 732 KAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
+ ++F+LS E L E+ V+E G +++G
Sbjct: 663 EKTVIFKLS-IEQLGFYDENMEYVVEPGRVDVMIG 696
>gi|7259476|gb|AAF43783.1|AF135015_2 xylosidase/arabinosidase [Thermoanaerobacter ethanolicus JW 200]
Length = 784
Score = 297 bits (760), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 239/801 (29%), Positives = 387/801 (48%), Gaps = 134/801 (16%)
Query: 45 FCKTTLPISQRARDLVSRLTLDEKISQLVNSA------------------------PAIP 80
+ +T + +R DL+ ++T++EK++QL NS I
Sbjct: 5 YLDSTQSVEKRVEDLLQQMTIEEKVAQL-NSIWVYEILDDMKFSFDKAKRLMSYGISQIT 63
Query: 81 RLGIPAYEWWSEALHGVAGVGKGIFFNGTI----------------RGATSFPQVILTAA 124
RLG + E + + K + N + +GAT FPQ I A+
Sbjct: 64 RLGGASNLSPRETVRIANQIQKFLIENTRLGIPALIHEESCSGYMPKGATIFPQTIGVAS 123
Query: 125 SFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTG 184
++++ + ++ I + +A+ A QA+ AP ++I RDPRWGR +ET GEDP +
Sbjct: 124 TWNNEIVEKMASVIREQMKAV-GARQAL-----APLLDITRDPRWGRTEETFGEDPYLVM 177
Query: 185 KYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLAD 244
+ VSY+RG+Q ++ G + A KHF Y N +G + A + ++L +
Sbjct: 178 RMGVSYIRGLQTESLREGIV-------ATGKHFVGYG--NSEGGMNWA-PAHIPERELRE 227
Query: 245 TYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSI 304
+ PFE+ VK+ + S IM Y+ ++G+P + LL+ R+ WGF G + SD A+S
Sbjct: 228 VFLYPFEAAVKEPKLSSIMPGYHELDGVPCHKSKKLLNDILRKDWGFEGIVVSDYFAISQ 287
Query: 305 IYDAEGYAKSPEDAVVDVLKAGMDVNCGS--FLQKHTKAAVKQKKLPESEIDRALHNLFS 362
+Y+ + A L+AG+DV S + + ++ ++ ++ A+ +
Sbjct: 288 LYEYHHVTSDKKGAAKLALEAGVDVELPSTDYYGLPLRELIESGEIDIDFVNEAVKRVLK 347
Query: 363 VRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSV 422
++ LGLF NP + + ++ + + LA + AQ+ IVLLKN + LLPL K
Sbjct: 348 IKFELGLFE-NPYINE--EKAVEIFDTNEQRELAYKIAQESIVLLKNENNLLPLKKDLK- 403
Query: 423 SLALIGPNANSAKTLLGNYAGP-SCRSI-----------TP----LQALQNYV------- 459
S+A+IGPNA+S + ++G+YA P S+ TP L+A YV
Sbjct: 404 SIAVIGPNADSIRNMIGDYAYPCHIESLLEMRETDNVFNTPLPESLEAKDIYVPIVTVLQ 463
Query: 460 -------ENT--VYYPGCDTVACSSASIDKAVDIAKGADHVVLMMG-----LDQTQEKEE 505
NT +Y GCD + S +AV+IAK AD V+++G D E
Sbjct: 464 GIKAKASSNTEVLYAKGCDVLNNSKDGFKEAVEIAKQADVAVVVVGDKSGLTDGCTSGES 523
Query: 506 LDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEA 565
DR DL LPG Q+ELI V E PVI+VL+ G P+ I++ I +I+ A PGE
Sbjct: 524 RDRADLNLPGVQEELIKAVYETG-TPVIVVLINGRPMSISWIA--EKIPAIIEAWLPGEE 580
Query: 566 GAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVF 625
G A+A+VIFGD+NPGG+LP++ PQ ++P+ +P + Y K ++
Sbjct: 581 GGRAVADVIFGDYNPGGKLPIS-IPQSVGQLPVY-YYHKPSGGRSHWKGDYVELSTKPLY 638
Query: 626 PFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLV 685
PFG GLSY+++SY +S K+ L +MVE +
Sbjct: 639 PFGYGLSYTEFSYTNLNISNRKVSLRD----RMVE------------------------I 670
Query: 686 TIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESL 745
++ +KN G + G V L++ RP+K+L GF+ + L+A E+ ++F+LS E L
Sbjct: 671 SVDIKNTGTLKGDEVVQLYIHQEALSVTRPVKELKGFKRITLDAGEEKTVIFKLS-IEQL 729
Query: 746 SRAREDGLMVIEEGTHFLVVG 766
E+ V+E G +++G
Sbjct: 730 GFYDENMEYVVEPGRVDVMIG 750
>gi|315607027|ref|ZP_07882031.1| beta-glucosidase [Prevotella buccae ATCC 33574]
gi|315251081|gb|EFU31066.1| beta-glucosidase [Prevotella buccae ATCC 33574]
Length = 866
Score = 296 bits (759), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 173/454 (38%), Positives = 250/454 (55%), Gaps = 36/454 (7%)
Query: 31 FSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWW 90
FSC + +P+ L +RA DL SRLTL+EK + NS+PAIPRLGIP +EWW
Sbjct: 14 FSC---VAGAQQYPYQNLQLSSQERAEDLCSRLTLEEKTKLMRNSSPAIPRLGIPQFEWW 70
Query: 91 SEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQ 150
SEALHG+A G AT FPQ AAS+D L YR+ A EA A N +
Sbjct: 71 SEALHGIARNGF----------ATVFPQTTAMAASWDDELLYRVFCAASDEAVAKNNLAR 120
Query: 151 AI-------GMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGG- 202
G++ W PNINIFRDPRWGRGQET GEDP +T + ++ V G+QG F
Sbjct: 121 KSGDIKRYQGVSIWTPNINIFRDPRWGRGQETYGEDPYLTSRMGLAVVNGLQGQPFRRDM 180
Query: 203 ----KLKGKLQASACCKHFTAYDLDNWKGTTRYKFDA-RVTMQDLADTYQPPFESCVKQG 257
+ + AC KH+ + W R+ FD R+ +DL +TY P F+S V++G
Sbjct: 181 RPFTERPRYYKTLACAKHYAVHSGPEW---NRHVFDVERLPERDLWETYLPAFKSLVQEG 237
Query: 258 RASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGY---AKS 314
+MCAY R++G P C + L + R +WG++G + SDC A+S Y EG+ ++
Sbjct: 238 NVREVMCAYQRIDGSPCCGNTRYLHQILRGEWGYNGLVVSDCGAISDFY-REGHHHVVET 296
Query: 315 PEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNP 374
P +A ++AG DV CG+ + AV+Q + ID ++ L R +G F+ +
Sbjct: 297 PAEASAMGVRAGTDVECGAVYATLPR-AVEQGLISREAIDTSVVRLLKARFEVGDFD-SE 354
Query: 375 TMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSA 434
+ P+ G +V+ S H+ LAL A++ + LL+N + LLPL K+ + +A++GPNAN +
Sbjct: 355 KLVPWKLTGPEVIASETHRRLALDMARESMTLLQNRNRLLPLSKN-GLRIAVMGPNANDS 413
Query: 435 KTLLGNYAGPSCRSITPLQALQNYVENTVYYPGC 468
L GNY G + T L+ +++ V + GC
Sbjct: 414 VMLWGNYTGYPISTTTILKGIRSKVPAARFVEGC 447
Score = 108 bits (269), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 89/298 (29%), Positives = 128/298 (42%), Gaps = 71/298 (23%)
Query: 485 AKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVIL 534
A AD VV + G+ E EE+ DR + LP Q+E+I R+ A K V+
Sbjct: 608 AGDADVVVFVGGISPRLEGEEMKVDAPGFNGGDRTSIELPEAQREVI-RLLRQAGKLVVF 666
Query: 535 VLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYI 594
V GG V ++L A Y GEAG A+A+V+FGD+NP G+LP+T+Y D
Sbjct: 667 VNCSGGAV--ALVPEAEACDAVLQAWYAGEAGGQAVADVLFGDYNPSGKLPVTFYKSDAD 724
Query: 595 KVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSS 654
D +M GRTYR++ G +FPFG GLSY+ + + KLY+
Sbjct: 725 LPDFLDYRM--------TGRTYRYFRGTPLFPFGFGLSYTSFVFGTPRYENGKLYVE--- 773
Query: 655 STKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGR 714
V N G+ G V ++VK +G
Sbjct: 774 ----------------------------------VTNTGKRDGAEVVQVYVKNPADADG- 798
Query: 715 PIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDG------LMVIEEGTHFLVVG 766
P+K L GF + L A E+ + ++ R R +G M ++ G H L+VG
Sbjct: 799 PVKTLRGFARIDLKAGERRRVEI------AMPRERFEGWDATTNTMRVKPGNHLLMVG 850
>gi|325299987|ref|YP_004259904.1| Beta-glucosidase [Bacteroides salanitronis DSM 18170]
gi|324319540|gb|ADY37431.1| Beta-glucosidase [Bacteroides salanitronis DSM 18170]
Length = 864
Score = 296 bits (758), Expect = 3e-77, Method: Compositional matrix adjust.
Identities = 180/458 (39%), Positives = 250/458 (54%), Gaps = 39/458 (8%)
Query: 41 ETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGV 100
+ P+ L +RA DLV RLTL+EK S + N++PAIPRLGI AY+WW+EALHGV
Sbjct: 23 QQLPYQNPKLTPEERANDLVGRLTLEEKASLMQNTSPAIPRLGIKAYDWWNEALHGVGRA 82
Query: 101 GKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALY-------NAGQAIG 153
G AT FPQ I AASFD L Y++ A+ EARA Y + + G
Sbjct: 83 GI----------ATVFPQTIGMAASFDDELLYQVFTAVSDEARAKYTQFRKEGDLKRYQG 132
Query: 154 MTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASAC 213
+TFW PN+NIFRDPRWGRGQET GEDP +T + ++ VRG+QG KL AC
Sbjct: 133 LTFWTPNVNIFRDPRWGRGQETYGEDPYLTSQMGMAVVRGLQGPE---DAPYDKLH--AC 187
Query: 214 CKHFTAYDLDNWKGTTRYKFDAR-VTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGI 272
KHF + W R++F+A + +DL +TY P F+ V++ +MCAYNR+ G
Sbjct: 188 AKHFAVHSGPEW---NRHEFNAENIAPRDLWETYMPAFKDLVQKAHVKEVMCAYNRLEGE 244
Query: 273 PSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVD--VLKAGMDVN 330
P C + LL+ R +WG+ G + SDC A+S + + P+ A + +G D+
Sbjct: 245 PCCGNNRLLTHILRDEWGYQGIVVSDCGAISDFWRKGDHETHPDKAHASAGAVLSGTDLE 304
Query: 331 CGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSP 390
CGS K AVK + ES++D ++ L R LG + + + I VV
Sbjct: 305 CGSNY-KSLPEAVKAGLIAESQLDISVKRLLKARFELGEMDKDVC---WDTIPYSVVDCQ 360
Query: 391 AHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSIT 450
AH+ LAL+ A++ IVLL+N + +LPL K + +AL+GPNAN + GNY G + T
Sbjct: 361 AHKDLALRMARESIVLLQNRNNILPL--RKDMKIALVGPNANDSIMHWGNYNGFPSHTET 418
Query: 451 PLQALQNYV--ENTVYYPGCDT---VACSSASIDKAVD 483
+AL+ + +Y GCD VA S A+D
Sbjct: 419 LYEALKKRLPASQLIYEFGCDRTSPVALESVFAQCAID 456
Score = 93.2 bits (230), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 80/301 (26%), Positives = 126/301 (41%), Gaps = 53/301 (17%)
Query: 477 SIDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAE 526
++ D K AD ++ G+ T E EE+ DR + LP Q++L+ + +
Sbjct: 590 NLQATADKVKDADVILFAGGISPTLEGEEMPVDAEGFRGGDRTSIELPAIQRQLVGELKK 649
Query: 527 AAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPM 586
K P++ + G + + A ++ A YPG+AG A+A+V+FGD+NP G+LP+
Sbjct: 650 LGK-PIVFINYSGSAMGL--APESEICDGMIQAWYPGQAGGTAIADVLFGDYNPAGKLPV 706
Query: 587 TWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQN 646
T+Y D M+ GRTYR+ +F FG GLSY+ + Y +SQN
Sbjct: 707 TFYRNTEQLPDFEDYAMK--------GRTYRYMTETPLFRFGHGLSYTTFDYGKARLSQN 758
Query: 647 KLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVK 706
F + +TI V N G G+ V ++++
Sbjct: 759 T-----------------------------FSKGETLTLTIPVSNTGTRDGEETVQVYLR 789
Query: 707 PARRGNGR-PIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVV 765
R G+ P L F+ V + EI F LS L M + G + L+
Sbjct: 790 --RPGDADAPSHTLRAFKRVYVPKGGTKEIKFTLSDDNFLWFDTSTNNMNLISGEYELLY 847
Query: 766 G 766
G
Sbjct: 848 G 848
>gi|440733337|ref|ZP_20913088.1| beta-glucosidase [Xanthomonas translucens DAR61454]
gi|440362904|gb|ELQ00083.1| beta-glucosidase [Xanthomonas translucens DAR61454]
Length = 895
Score = 296 bits (758), Expect = 3e-77, Method: Compositional matrix adjust.
Identities = 173/427 (40%), Positives = 237/427 (55%), Gaps = 33/427 (7%)
Query: 62 RLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVIL 121
++T +EK++Q +N+APAIPRLG+PAYEWW+E LHG+A G AT FPQ I
Sbjct: 53 KMTREEKVAQAMNAAPAIPRLGVPAYEWWNEGLHGIARNGY----------ATVFPQAIG 102
Query: 122 TAASFDSYLWYRIGQAIGLEARALYN-AG-------QAIGMTFWAPNINIFRDPRWGRGQ 173
AA++++ L ++G EARA +N AG + G+T W+PNINIFRDPRWGRG
Sbjct: 103 LAATWNTALLEQVGTVTSTEARAKFNLAGGPGKDHPRYAGLTIWSPNINIFRDPRWGRGM 162
Query: 174 ETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKF 233
ET GEDP +TG+ AV ++ G+QGD + A KH + + R+ F
Sbjct: 163 ETYGEDPYLTGQLAVGFIHGLQGDDLT------HPRTIATPKHLAVH---SGPEPGRHGF 213
Query: 234 DARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHG 293
D V+ DL TY P F + + GRA +MCAYN ++G P+CA LL+ R WGF G
Sbjct: 214 DVDVSPHDLEATYTPAFRAAIVDGRAGSVMCAYNALHGTPACAADWLLNGRLRGDWGFTG 273
Query: 294 YITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEI 353
++ SDCDAV + + + LKAG D+NCG + + A+ + E+ +
Sbjct: 274 FVVSDCDAVDDMTQFHYFRADNAGSSAAALKAGHDLNCG-YAYRDLGKAIARGDADEALL 332
Query: 354 DRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGL 413
D++L LF+ R RLG P+ ++GA V S AH+ LALQAAQ IVLL+N +
Sbjct: 333 DQSLVRLFAARYRLGELQPQ-RKDPYAQLGAKDVDSAAHRALALQAAQQSIVLLQNRNAT 391
Query: 414 LPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNY--VENTVYYPGCDTV 471
LPL + LA+IGPNA++ L NY G S +TPL L+ N Y G
Sbjct: 392 LPL--RPGLRLAVIGPNADALAALEANYQGTSAAPVTPLLGLRERFGAANVRYAQGAPLA 449
Query: 472 ACSSASI 478
A S I
Sbjct: 450 AGVSGMI 456
Score = 135 bits (341), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 97/283 (34%), Positives = 143/283 (50%), Gaps = 56/283 (19%)
Query: 496 GLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDIT 545
GL E EEL DR DL LP QQ L+ R A+A+ KP+++VL+ G V +
Sbjct: 632 GLSPDVEGEELRIDVPGFDGGDRNDLALPAAQQALLER-AKASGKPLVVVLMSGSAVALN 690
Query: 546 FAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWY--PQDYIKVPMTDMKM 603
+AK ++ +I+ A YPG++G A+A+V+ GD NPGGRLP+T+Y +D DMK
Sbjct: 691 WAK--QHADAIVAAWYPGQSGGTAIAQVLAGDVNPGGRLPVTFYRSTKDLPAYVSYDMK- 747
Query: 604 RPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQD 663
GRTYR+++G+ +F FG GLSY++++Y +S L Q+ + V Q
Sbjct: 748 ---------GRTYRYFKGEPLFAFGSGLSYTRFTYAAPQLSATTL---QAGANLQVRTQ- 794
Query: 664 VVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQ 723
V N G AG V ++++P +G P++ LVGFQ
Sbjct: 795 -------------------------VSNSGTRAGDEVVQVYLQPP-QGAQSPLRTLVGFQ 828
Query: 724 SVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
V L E E+ FEL+P LS G ++ G + + VG
Sbjct: 829 RVTLQPGEAREVGFELTP-RQLSDVDRAGQRAVQPGDYRVFVG 870
>gi|402304900|ref|ZP_10823963.1| glycosyl hydrolase family 3, N-terminal domain protein [Prevotella
sp. MSX73]
gi|400380686|gb|EJP33499.1| glycosyl hydrolase family 3, N-terminal domain protein [Prevotella
sp. MSX73]
Length = 866
Score = 296 bits (757), Expect = 4e-77, Method: Compositional matrix adjust.
Identities = 173/454 (38%), Positives = 250/454 (55%), Gaps = 36/454 (7%)
Query: 31 FSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWW 90
FSC + +P+ L +RA DL SRLTL+EK + NS+PAIPRLGIP +EWW
Sbjct: 14 FSC---VAGAQQYPYQNPRLSSQERAEDLCSRLTLEEKTKLMRNSSPAIPRLGIPQFEWW 70
Query: 91 SEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQ 150
SEALHG+A G AT FPQ AAS+D L YR+ A EA A N +
Sbjct: 71 SEALHGIARNGF----------ATVFPQTTAMAASWDDELLYRVFCAASDEAVAKNNLAR 120
Query: 151 AI-------GMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGG- 202
G++ W PNINIFRDPRWGRGQET GEDP +T + ++ V G+QG F
Sbjct: 121 KSGDIKRYQGVSIWTPNINIFRDPRWGRGQETYGEDPYLTSRMGLAVVNGLQGQPFRRDM 180
Query: 203 ----KLKGKLQASACCKHFTAYDLDNWKGTTRYKFDA-RVTMQDLADTYQPPFESCVKQG 257
+ + AC KH+ + W R+ FD R+ +DL +TY P F+S V++G
Sbjct: 181 RPFTERPRYYKTLACAKHYAVHSGPEW---NRHVFDVERLPERDLWETYLPAFKSLVQEG 237
Query: 258 RASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGY---AKS 314
+MCAY R++G P C + L + R +WG++G + SDC A+S Y EG+ ++
Sbjct: 238 NVREVMCAYQRIDGSPCCGNTRYLHQILRGEWGYNGLVVSDCGAISDFY-REGHHHVVET 296
Query: 315 PEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNP 374
P +A ++AG DV CG+ + AV+Q + ID ++ L R +G F+ +
Sbjct: 297 PAEASAMGVRAGTDVECGAVYATLPR-AVEQGLISREAIDTSVVRLLKARFEVGDFD-SE 354
Query: 375 TMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSA 434
+ P+ G +V+ S H+ LAL A++ + LL+N + LLPL K+ + +A++GPNAN +
Sbjct: 355 KLVPWKLTGPEVIASETHRRLALDMARESMTLLQNRNRLLPLSKN-GLRIAVMGPNANDS 413
Query: 435 KTLLGNYAGPSCRSITPLQALQNYVENTVYYPGC 468
L GNY G + T L+ +++ V + GC
Sbjct: 414 VMLWGNYTGYPISTTTILKGIRSKVPAARFVEGC 447
Score = 106 bits (264), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 88/298 (29%), Positives = 128/298 (42%), Gaps = 71/298 (23%)
Query: 485 AKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVIL 534
A AD VV + G+ E EE+ DR + LP Q+E+I R+ A K V+
Sbjct: 608 AGDADVVVFVGGISPRLEGEEMKVDAPGFKGGDRTSIELPEAQREVI-RLLRQAGKLVVF 666
Query: 535 VLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYI 594
V GG V ++L A Y GEAG A+A+V+FGD+NP G+LP+T+Y D
Sbjct: 667 VNCSGGAV--ALVPEAEACDAVLQAWYAGEAGGQAVADVLFGDYNPSGKLPVTFYKSDAD 724
Query: 595 KVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSS 654
D +M GRTYR++ G +FPFG GLSY+ +++ LY+
Sbjct: 725 LPDFLDYRM--------TGRTYRYFRGTPLFPFGFGLSYTSFAFGKPRYENGMLYVE--- 773
Query: 655 STKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGR 714
V N G+ G V ++VK +G
Sbjct: 774 ----------------------------------VTNTGKRDGAEVVQVYVKNPADADG- 798
Query: 715 PIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDG------LMVIEEGTHFLVVG 766
P+K L GF + L A E+ + ++ R R +G M ++ G H L+VG
Sbjct: 799 PVKTLRGFARIDLKAGERRRVEI------AMPRERFEGWDATANTMRVKPGNHLLMVG 850
>gi|218186207|gb|EEC68634.1| hypothetical protein OsI_37026 [Oryza sativa Indica Group]
Length = 1241
Score = 295 bits (756), Expect = 6e-77, Method: Compositional matrix adjust.
Identities = 153/329 (46%), Positives = 209/329 (63%), Gaps = 25/329 (7%)
Query: 136 QAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQ 195
QA+ EARA+YN G+ G+T+W+PNIN+ RDPRWGR ETPGEDP V G+YAV++VRG+Q
Sbjct: 916 QAVSTEARAMYNMGKG-GLTYWSPNINVVRDPRWGRALETPGEDPYVVGRYAVNFVRGMQ 974
Query: 196 G-----DTFNGGKLKGK-LQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPP 249
GG + L+ SACCKH+ AYDLD+W TR++FDARV +D+ +T+Q P
Sbjct: 975 DIPGHEAVAAGGDPNTRPLKTSACCKHYAAYDLDDWHNHTRFEFDARVDERDMVETFQRP 1034
Query: 250 FESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYD-A 308
FE CV+ G S +MC+YNRVNGIP+CAD LLS+T RR WG HGYI SDCDAV ++ D A
Sbjct: 1035 FEMCVRDGDVSSVMCSYNRVNGIPACADARLLSQTIRRDWGLHGYIVSDCDAVRVMTDNA 1094
Query: 309 EGYAKSPEDAVVDVLKAGMDVNCGS-------------FLQKHTKAAVKQKKLPESEIDR 355
+ +A LKAG+D++CG FL + AV + K+ ES+ID
Sbjct: 1095 TWLGYTGAEASAAALKAGLDLDCGESWKNETDGHPLMDFLTTYGMEAVNKGKMRESDIDN 1154
Query: 356 ALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLP 415
AL N + MRLG F+ + + +G +C+ H+ LAL A+ GIVLLKN + LLP
Sbjct: 1155 ALTNQYMTLMRLGYFD---DIAQYSSLGRQDICTDQHKTLALDGARQGIVLLKNDNKLLP 1211
Query: 416 LPKSKSVSLALIGPNANSAKTLL-GNYAG 443
L +K + + GP+ + + ++ G+Y G
Sbjct: 1212 LDANKVGFVNVRGPHVQAPEKIMDGDYTG 1240
>gi|305663349|ref|YP_003859637.1| glycoside hydrolase family protein [Ignisphaera aggregans DSM
17230]
gi|304377918|gb|ADM27757.1| glycoside hydrolase family 3 domain protein [Ignisphaera aggregans
DSM 17230]
Length = 757
Score = 295 bits (755), Expect = 6e-77, Method: Compositional matrix adjust.
Identities = 234/782 (29%), Positives = 371/782 (47%), Gaps = 126/782 (16%)
Query: 50 LPISQRARDLVSRLTLDEKISQLVN----------------------------------S 75
+ I +R R+L+ R++++EKI+QL++ S
Sbjct: 1 MDIDERVRELIGRMSIEEKIAQLISIPLESVLDGKKFSVEKAREVLKYGVGEILRIGGSS 60
Query: 76 APAIPRLGIPAYEWWSEALHGVAGVG-KGIFFNGTIRG-----ATSFPQVILTAASFDSY 129
A PR + Y L +G I +I G AT FP + A+++D
Sbjct: 61 ARLSPREAVEIYNAIQRFLTRETRLGIPAIVHEESIAGLLAPTATVFPIPLALASTWDPD 120
Query: 130 LWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVS 189
L YR+ AI + A+ + AP +++ R+PRWGR +ET GED + ++
Sbjct: 121 LVYRVAVAIRRQIMAIGSRHT------LAPVLDLCREPRWGRCEETYGEDSYLAASMGIA 174
Query: 190 YVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPP 249
YV+G+QGD G + A KHF + + R V +++L + Y P
Sbjct: 175 YVKGIQGDDIRYGVI-------ATGKHFVGHGVPE---GGRNIASIHVGLRELLEIYMYP 224
Query: 250 FESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAE 309
FE+ VK+ IM AY+ ++ +P A++ LL+ R WGF G SD + V ++
Sbjct: 225 FEATVKEANLLSIMPAYHDIDNVPCHANKWLLTDILRGSWGFKGIAVSDYEGVKQLHTIH 284
Query: 310 GYAKSPEDAVVDVLKAGMDVNCGS-FLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLG 368
A+ +A V +KAG+D+ S K AV++ + E I+RA+ + ++ LG
Sbjct: 285 RVARDCMEAAVKAIKAGVDIEYPSGECFKQLVEAVRKGLIDEDTINRAVERVLKLKFMLG 344
Query: 369 LFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIG 428
LF NP + K+ + + A + LA + A+ IVLLKN G+LPL K ++A+IG
Sbjct: 345 LFE-NPFIDE-TKV-PTTLDNEADRELAREVARKAIVLLKND-GILPL-KRDIKTIAVIG 399
Query: 429 PNANSAKTLLGNY---------------AGPSCRSITPLQALQNYVE---NTVYYPGCDT 470
PNAN +LG+Y PS R +T L+A+++ V +Y GCDT
Sbjct: 400 PNANDPWAMLGDYHYDAHIGSFDGTYGKISPSVRIVTVLEAIKSRVSPSTEVLYAKGCDT 459
Query: 471 VACSSASIDKAVDIAKGADHVVLMMG-------LDQTQEKEELDRVDLVLPGRQQELITR 523
+ + +A++IAK AD ++ +MG L E +DR L LPG Q+EL+
Sbjct: 460 IGDDRSGFGEAIEIAKRADIIIAVMGDRSGLFNLKMFTSGEGVDRASLKLPGVQEELLKE 519
Query: 524 VAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGR 583
+A KP+ILVL+ G P + + + +I+ A PGE G A+A+++FGD++PGGR
Sbjct: 520 LASLG-KPIILVLINGRP--LALSSILPYVNAIVEAWRPGEEGGNAIADILFGDYSPGGR 576
Query: 584 LPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAV 643
LP++ P D ++P+ + N R Y Y K +FPFG GLSY++++Y+
Sbjct: 577 LPVSL-PYDVGQLPIY------YSRKPNCFRDYVEYPAKPLFPFGYGLSYTQFAYE---- 625
Query: 644 SQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLL 703
N + V + D V V++ VKN G MAG V L
Sbjct: 626 -------NLVVESTEVRDPDTV-----------------IRVSVDVKNVGSMAGDEVVQL 661
Query: 704 FVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFL 763
++ RP+ +L GF+ + L EK +VFE+ P E L+ D V+E G +
Sbjct: 662 YISRDYASVTRPVAELKGFKRITLEPGEKKTVVFEI-PLELLAYYDMDMNYVVEPGEYTF 720
Query: 764 VV 765
++
Sbjct: 721 MI 722
>gi|206901280|ref|YP_002250567.1| xylosidase/arabinosidase [Dictyoglomus thermophilum H-6-12]
gi|206740383|gb|ACI19441.1| xylosidase/arabinosidase [Dictyoglomus thermophilum H-6-12]
Length = 762
Score = 295 bits (754), Expect = 9e-77, Method: Compositional matrix adjust.
Identities = 231/757 (30%), Positives = 368/757 (48%), Gaps = 105/757 (13%)
Query: 52 ISQRARDLVSRLTLDEKISQLVNSAPAI----PRLGIPAYEWWSEALHGVAGVGKGIFFN 107
I Q +R R EK +L N RLGIPA E L G
Sbjct: 54 IGQISRVAGERGVSPEKAVELANKIQKFLKEKTRLGIPAI-IHEECLSGFMA-------- 104
Query: 108 GTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDP 167
+GAT FPQ I A++F+ L R+ I +A N Q + +P ++I RDP
Sbjct: 105 ---QGATVFPQAIGMASTFEPELIRRVSDVIRQHMKAA-NVHQGL-----SPVLDIPRDP 155
Query: 168 RWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKG 227
RWGR +ET GEDP + + A YV+G+QG+ + G + A KHFTAY +
Sbjct: 156 RWGRTEETFGEDPYLVSRMATEYVKGLQGEDWREGIV-------ATVKHFTAYGISE--- 205
Query: 228 TTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARR 287
R A+V ++L + + PFE +K+G+A +M AY+ ++G+P + + LL+K R
Sbjct: 206 GARNLGPAKVGERELREVFLFPFEVAIKEGQAGSLMNAYHEIDGVPCASSKFLLTKILRW 265
Query: 288 QWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSF--LQKHTKAAVKQ 345
+WGF GY+ SD AV ++ + A+ ++A V L+AG+D+ S + AVK+
Sbjct: 266 EWGFKGYVVSDYIAVRMLENFHKVARDAKEAAVLALEAGIDIELPSVDCYGEPLIQAVKE 325
Query: 346 KKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIV 405
+ E I+ ++ + + LGLF+ N P K +V P + L+ + A+ IV
Sbjct: 326 GLISEEVINASVERVLRAKFMLGLFDDNLEKDP--KKVYEVFDKPEFRDLSREVARRSIV 383
Query: 406 LLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAG----PS---------------C 446
LLKN G LPL K+ +A+IGPNA++ + L G+Y+ PS
Sbjct: 384 LLKND-GTLPLSKNLK-KVAVIGPNADNPRNLHGDYSYTAHIPSIAEGLEGVKVEEKCVV 441
Query: 447 RSITPLQALQNYVE---NTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMG-----LD 498
R+++ L+ ++N V +Y GCD ++ S +A+++AK AD ++ +MG
Sbjct: 442 RTVSILEGIRNKVSPETEVLYAKGCDIISDSKDGFAEAIEMAKEADVIIAVMGEESGLFH 501
Query: 499 QTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILW 558
+ E DR L L G Q++L+ + + KP++LVL+ G P + + N+ +IL
Sbjct: 502 RGISGEGNDRTTLELFGVQRDLLKELHKLG-KPIVLVLINGRPQALKWEH--ENLNAILE 558
Query: 559 AGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRF 618
A YPGE G A+A+VIFGD+NP G+LP++ +P ++P+ +P A S Y
Sbjct: 559 AWYPGEEGGNAVADVIFGDYNPSGKLPIS-FPAVTGQIPVY-YNRKPSAFS-----DYID 611
Query: 619 YEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFC 678
K ++PFG GLSY+ + Y +S K+ +S + VE
Sbjct: 612 ESAKPLYPFGHGLSYTTFEYSDLKISPEKV-----NSLEKVE------------------ 648
Query: 679 ETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFE 738
++ +KN G G+ V L++ RP+K+L GF+ + L E + F
Sbjct: 649 ------ISFTIKNTGNRDGEEVVQLYIHDQVASLERPVKELKGFKKIYLKPGESKRVTFT 702
Query: 739 LSPCESLSRAREDGLMVIEEGTHFLVVGDEEYPISIF 775
L P E L+ E ++E+G +++G I +
Sbjct: 703 LYP-EQLAFYDEFMRFIVEKGVFEVMIGSSSEDIRLM 738
>gi|393786770|ref|ZP_10374902.1| hypothetical protein HMPREF1068_01182 [Bacteroides nordii
CL02T12C05]
gi|392658005|gb|EIY51635.1| hypothetical protein HMPREF1068_01182 [Bacteroides nordii
CL02T12C05]
Length = 864
Score = 295 bits (754), Expect = 9e-77, Method: Compositional matrix adjust.
Identities = 174/440 (39%), Positives = 241/440 (54%), Gaps = 41/440 (9%)
Query: 43 FPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGK 102
P+ L QRA DL+ RLT++EK+S + N++P I RLGI YEWW+EALHGVA G
Sbjct: 28 LPYQDPNLTPEQRATDLLQRLTIEEKVSLMQNNSPGILRLGIKPYEWWNEALHGVARAGL 87
Query: 103 GIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARA-------LYNAGQAIGMT 155
AT FPQ I AASFD L Y + AI EARA L + G+T
Sbjct: 88 ----------ATVFPQTIGMAASFDDTLIYEVFNAISDEARAKNRHFNTLGQYKRYQGLT 137
Query: 156 FWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQG-DTFNGGKLKGKLQASACC 214
W PNINIFRDPRWGRGQET GEDP +T + V+ V+G+QG D+ KL AC
Sbjct: 138 MWTPNINIFRDPRWGRGQETYGEDPYLTSRMGVAVVKGLQGPDSARYNKLH------ACA 191
Query: 215 KHFTAYDLDNWKGTTRYKFDAR-VTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIP 273
KHF + W R+ F+A + +DL +TY P F++ V++ +MCAYNR G P
Sbjct: 192 KHFAVHSGPEW---NRHSFNAENIIPRDLWETYLPAFKTLVQEADVKEVMCAYNRFEGDP 248
Query: 274 SCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPE--DAVVDVLKAGMDVNC 331
C LL++ R +WGF G + SDC A+S + + + P+ A + + G D+ C
Sbjct: 249 CCGSNRLLTQILRNEWGFKGIVVSDCGAISDFWGTKKHNTHPDAAHASAEAVLNGTDLEC 308
Query: 332 GSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLG-LFNGNPTMQPFGKIGADVVCSP 390
GS +K T+ A+K + E +I+ ++ L R LG + N +P P+ +V SP
Sbjct: 309 GSNYRKLTE-AIKAGIISEKQINVSVKRLLKARFELGEMENIHPWTLPY-----SIVDSP 362
Query: 391 AHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSIT 450
H+ LAL+ A + + LL+N +LPL K +A+IGPNAN + GNY G + T
Sbjct: 363 KHRCLALKMAHETMTLLQNKGKVLPL--DKQARIAIIGPNANDSVMQWGNYNGTPSHTST 420
Query: 451 PLQALQNY--VENTVYYPGC 468
L A + + + +Y P C
Sbjct: 421 LLSAFRKRLPISHLIYEPVC 440
Score = 104 bits (259), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 80/291 (27%), Positives = 135/291 (46%), Gaps = 52/291 (17%)
Query: 459 VENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DR 508
V++ Y+ D V +I ++ K D ++ G+ + E EE+ DR
Sbjct: 573 VKDETYF-NFDLVEGLPLNISNTLEKLKDIDIIIFAGGISPSLEGEEMNVSATGFKGGDR 631
Query: 509 VDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAV 568
D+ P Q++++ + EA KK VILV G + +T ++ +IL A YPGE G +
Sbjct: 632 TDIEFPAVQRKVLAALKEAGKK-VILVNFSGSAMALT--PETKSCDAILQAWYPGEEGGM 688
Query: 569 ALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFG 628
A+ V+FGD+NP GRLP+T+Y + M+ GRTYR+ + + +FPFG
Sbjct: 689 AIVNVLFGDYNPAGRLPITFYKSIDQLPDFENYSMK--------GRTYRYMQEEPLFPFG 740
Query: 629 CGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIG 688
GLSY+ +++ K+++N++S K + I
Sbjct: 741 YGLSYTTFAF-------GKIHINKNS----------------------LSAGEKVTLHIP 771
Query: 689 VKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFEL 739
+KN G+ G V ++++ G P+K L F+ V + + E+ EL
Sbjct: 772 IKNIGDRDGVEVVQIYIQRQADKEG-PVKTLRAFKRVEIPKGKTQEVKIEL 821
>gi|302872724|ref|YP_003841360.1| glycoside hydrolase family protein [Caldicellulosiruptor
obsidiansis OB47]
gi|302575583|gb|ADL43374.1| glycoside hydrolase family 3 domain protein [Caldicellulosiruptor
obsidiansis OB47]
Length = 771
Score = 294 bits (752), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 218/696 (31%), Positives = 339/696 (48%), Gaps = 103/696 (14%)
Query: 113 ATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRG 172
AT FPQ I A +FD+ L + + IGL+ +A+ A QA+ AP I++ RD RWGR
Sbjct: 103 ATVFPQSIGVACTFDNELVKEMAKVIGLQMKAV-GAHQAL-----APLIDVARDARWGRV 156
Query: 173 QETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLD----NWKGT 228
+ET GEDP + AVSYV G+QG F + A KHF Y + NW
Sbjct: 157 EETFGEDPYLVANMAVSYVEGIQGKDFEE-------KVIATGKHFVGYAMSEGGMNWAPV 209
Query: 229 TRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQ 288
+ ++L + Y PFE VK IM AY+ ++GIP A+R LL++ AR +
Sbjct: 210 -------HIPERELREVYLYPFEVAVKVAGLKSIMPAYHEIDGIPCHANRKLLTEIARNE 262
Query: 289 WGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVN---CGSFLQKHTKAAVKQ 345
WGF G SD V I D K+ E+A L AG+D+ F +K + A+K+
Sbjct: 263 WGFDGIFVSDYSGVKNILDYHKSVKTYEEAAYISLWAGLDIELPRIECFTEKFIE-ALKE 321
Query: 346 KKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIV 405
K + +D A+ + ++ RLGLF+ NP ++ I ++ + + LA + AQ+ IV
Sbjct: 322 GKFDMAVVDAAVKRVLEMKFRLGLFD-NPFVKTENII--ELFDNKEQRDLARKVAQESIV 378
Query: 406 LLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVE----- 460
LLKN G+LPL + +A+IGPNANS + LLG+Y+ P+ S T + ++ V+
Sbjct: 379 LLKND-GILPLKEKDLKKVAVIGPNANSVRNLLGDYSYPAHISTTEMFFMKEEVDLGDED 437
Query: 461 ------------------------NTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMG 496
VY GCD + + ++A A+ AD V++++G
Sbjct: 438 AFVKKVVNIKSVYEVIKERIGEKTEVVYAKGCDVNSHDKSGFEEAKKAAQDADVVIVVVG 497
Query: 497 ------LDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYD 550
LD T E DR L LPG Q+ELI +A+ + ++++L+ G PV
Sbjct: 498 DKAGLKLDCTS-GESRDRASLKLPGVQEELIEEIAKVNQN-IVVILVNGRPV--ALENIW 553
Query: 551 RNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSG 610
+ ++L A +PGE GA A+A+++FG +NPGG+L ++ +P+D +VP+ +P
Sbjct: 554 QKSKAVLEAWFPGEEGAEAIADIVFGKYNPGGKLAIS-FPRDVGQVPVY-YGHKPSGGKS 611
Query: 611 NPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSV 670
Y K PFG GLSY+ + YK + + K+ +++S
Sbjct: 612 CWHGDYVEMSSKPFLPFGYGLSYTTFEYKNLTIEKEKITMDESIK--------------- 656
Query: 671 PELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAK 730
+ + ++N G G V L+ + RP+K+L ++ V LN
Sbjct: 657 --------------IFVEIENTGNCEGDEIVQLYTRKEEFLVTRPVKELKAYKRVHLNPG 702
Query: 731 EKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
EK ++VFE+ P + + D VI GT ++VG
Sbjct: 703 EKKKVVFEIFP-DQFAYYDYDMNRVISPGTVEVMVG 737
>gi|312794525|ref|YP_004027448.1| glycoside hydrolase family 3 domain-containing protein
[Caldicellulosiruptor kristjanssonii 177R1B]
gi|312181665|gb|ADQ41835.1| glycoside hydrolase family 3 domain protein [Caldicellulosiruptor
kristjanssonii 177R1B]
Length = 770
Score = 294 bits (752), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 228/773 (29%), Positives = 365/773 (47%), Gaps = 142/773 (18%)
Query: 50 LPISQRARDLVSRLTLDEKISQLV-----------------------NSAPAIPRLGIPA 86
+ I +R DL+ ++T++EK+ QL N I RL +
Sbjct: 1 MSIEKRVNDLLQKMTIEEKVYQLTSILVQDILENDKFSPQKAKEKIPNGIGQITRLAGAS 60
Query: 87 YEWWSEALHGVAGVGKGIFFNGTI----------------RGATSFPQVILTAASFDSYL 130
EA + K + N + +GAT FPQ I A +FD+ +
Sbjct: 61 NLSPEEAAKTANEIQKFLIENTRLGIPAMIHEESCSGFMAKGATVFPQSIGVACTFDNEI 120
Query: 131 WYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSY 190
+ + I + +A+ A QA+ AP I++ RD RWGR +ET GEDP + AVSY
Sbjct: 121 VEELAKVIRTQMKAV-GAHQAL-----APLIDVARDARWGRVEETFGEDPYLVANMAVSY 174
Query: 191 VRGVQGDTFNGGKLKGKLQASACCKHFTAYDLD----NWKGTTRYKFDARVTMQDLADTY 246
V+G+QGD G + A KHF Y + NW + ++L + Y
Sbjct: 175 VKGLQGDDIKDGIV-------ATGKHFVGYAMSEGGMNWAPV-------HIPERELREVY 220
Query: 247 QPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIY 306
PFE VK IM AY+ ++GIP A+R LL+ AR +WGF G SD V +
Sbjct: 221 LYPFEVAVKVAGLKSIMPAYHEIDGIPCHANRKLLTDIARGEWGFDGIYVSDYSGVKNLL 280
Query: 307 DAEGYAKSPEDAVVDVLKAGMDVNCGS---FLQKHTKAAVKQKKLPESEIDRALHNLFSV 363
D K+ E+A L AG+D+ F ++ K A+K+ K + +D A+ + +
Sbjct: 281 DYHKSVKTYEEAAALSLWAGLDIELPKIECFTEEFIK-ALKEGKFDMALVDAAVKRVLEM 339
Query: 364 RMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVS 423
+ RLGLF+ NP ++ G + ++ + + L+ + AQ+ +VLLKN LPL K
Sbjct: 340 KFRLGLFD-NPYIKTEGVV--ELFDNKEQRQLSRKVAQESMVLLKND-SFLPLSKDLK-K 394
Query: 424 LALIGPNANSAKTLLGNYAGPS----------------------CRSITPLQALQNYVEN 461
+A+IGPNANS + LLG+Y+ P+ +++ ++++ +++
Sbjct: 395 IAVIGPNANSVRNLLGDYSYPAHIATLEMFFIKEDRGVGNEEEFVKNVINMKSIFEAIKD 454
Query: 462 -------TVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMG------LDQTQEKEELDR 508
VY GCD + + ++A A+GAD V+L++G LD T E DR
Sbjct: 455 KVSSNTEVVYAKGCDVNSQDKSGFEEAKKAAEGADAVILVVGDKAGLRLDCTS-GESRDR 513
Query: 509 VDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAV 568
L LPG Q++L+ + V+ VL+ G PV + + N+ ++L A +PGE GA
Sbjct: 514 ASLRLPGVQEDLVKEIVSVNPNTVV-VLVNGRPVALDWIM--ENVKAVLEAWFPGEEGAD 570
Query: 569 ALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFG 628
A+A+++FGD+NPGG+L ++ +P+D +VP+ +P Y K + PFG
Sbjct: 571 AVADILFGDYNPGGKLAIS-FPRDVGQVPVY-YGHKPSGGKSCWHGDYVEMSTKPLLPFG 628
Query: 629 CGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIG 688
GLSY+ + YK A+ + K+ +++S V++
Sbjct: 629 YGLSYTTFEYKNFAIEKEKIGMDES-----------------------------IKVSVE 659
Query: 689 VKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSP 741
V+N G+ G V L+ + RP+K+L G++ V L EK ++VFEL P
Sbjct: 660 VENTGKYEGDEIVQLYTRKEEYLVTRPVKELKGYKRVHLKPGEKKKVVFELYP 712
>gi|227828570|ref|YP_002830350.1| glycoside hydrolase [Sulfolobus islandicus M.14.25]
gi|229585800|ref|YP_002844302.1| glycoside hydrolase family protein [Sulfolobus islandicus M.16.27]
gi|227460366|gb|ACP39052.1| glycoside hydrolase family 3 domain protein [Sulfolobus islandicus
M.14.25]
gi|228020850|gb|ACP56257.1| glycoside hydrolase family 3 domain protein [Sulfolobus islandicus
M.16.27]
Length = 755
Score = 294 bits (752), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 213/689 (30%), Positives = 347/689 (50%), Gaps = 97/689 (14%)
Query: 110 IRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRW 169
++ AT+FPQ I A+++D L + I +A+ L Q + +P +++ RDPRW
Sbjct: 98 VKTATAFPQAIGLASTWDPDLIREVSSTIRYQAK-LIGTNQCL-----SPVLDVCRDPRW 151
Query: 170 GRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTT 229
GR +ET GED + ++YV+G+QG+ + A KHF A+ +
Sbjct: 152 GRCEETYGEDQYLVASIGLAYVKGLQGEN----------ELIATVKHFAAH---GFPEGG 198
Query: 230 RYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQW 289
R V ++L + + PFE +K G+A +M AY+ ++GIP ++ LL+K R++W
Sbjct: 199 RNIAPVHVGNRELREVFLFPFEVAIKLGKAMSVMPAYHEIDGIPCHSNAELLTKILRQEW 258
Query: 290 GFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDV---NCGSFLQKHTKAAVKQK 346
GF G + SD DA+ + + + ++A + L+AG+D N F + +A VK+
Sbjct: 259 GFEGIVVSDYDAIRQLEAIHKVSLNKKEAAILALEAGVDTEFPNIDCFGEPLLEA-VKEG 317
Query: 347 KLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVL 406
+ ES IDRA+ + ++ +LGLFN + + + + S + + LAL A+ IVL
Sbjct: 318 LISESIIDRAVERVLRIKEKLGLFNNHYINE--NNVPEKLDNSKSRE-LALDVARKSIVL 374
Query: 407 LKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNY-------AGPSCRSITPLQALQNYV 459
LKN + +LPL K+ ++A+IGPNAN + LLG+Y A +T L+ + V
Sbjct: 375 LKNDN-ILPLNKNIG-TIAVIGPNANEPRNLLGDYTYTGHLNADGGIEVVTVLEGIMRKV 432
Query: 460 ENT---VYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLD------------------ 498
N +Y GCD A S +A++IAK D ++ +MG
Sbjct: 433 SNNTNVLYAKGCDIAAESKEGFSEAIEIAKKGDIIIAVMGEKSGLPLSWTDVPGKDEFEK 492
Query: 499 -QTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSIL 557
Q E DR L LPG Q+EL+ + + K P+ILVL+ G P + + + +I+
Sbjct: 493 YQAVTGEGNDRTSLRLPGVQEELLKELHKTGK-PIILVLVNGRP--LALSSIFNEVNAII 549
Query: 558 WAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYR 617
A +PGE G A+A+VIFGD+NP GRLP++ +P D ++P+ +P + R Y
Sbjct: 550 DAWFPGEEGGNAIADVIFGDYNPSGRLPIS-FPIDTGQIPIY-YNRKPSSL-----RPYV 602
Query: 618 FYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEF 677
+ K +FPFG GLSY+++ Y V+ ++ N S K
Sbjct: 603 MMKSKPLFPFGYGLSYTEFKYSNLEVTPKEV--NSSGKIK-------------------- 640
Query: 678 CETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVF 737
+++ V+N G+ G+ V L++ G RPIK+L GF V L EK +I F
Sbjct: 641 -------ISLEVENVGKREGEETVQLYISKQYSGVSRPIKELKGFAKVYLKPNEKRKITF 693
Query: 738 ELSPCESLSRAREDGLMVIEEGTHFLVVG 766
L P E+L+ + ++I+ G + +++G
Sbjct: 694 SL-PLEALAFYDQYMRLIIDTGDYEILIG 721
>gi|393786911|ref|ZP_10375043.1| hypothetical protein HMPREF1068_01323 [Bacteroides nordii
CL02T12C05]
gi|392658146|gb|EIY51776.1| hypothetical protein HMPREF1068_01323 [Bacteroides nordii
CL02T12C05]
Length = 863
Score = 293 bits (751), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 168/440 (38%), Positives = 242/440 (55%), Gaps = 35/440 (7%)
Query: 43 FPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGK 102
PF LP+ +R DLV RLTL EK+ + + + ++PRLGI Y WW+EALHGV G
Sbjct: 22 LPFNNPDLPVEERVEDLVRRLTLHEKVLLMCDYSSSVPRLGIKQYNWWNEALHGVGRAGL 81
Query: 103 GIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAI-------GMT 155
AT FPQ I AA+FD ++ + + EARA Y+ + G+T
Sbjct: 82 ----------ATVFPQAIGMAATFDDCAVKQVFECVSDEARAKYHHSENKDGSERYRGLT 131
Query: 156 FWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCK 215
FW PN+NIFRDPRWGRGQET GEDP +T + ++ VRG+QG + + + AC K
Sbjct: 132 FWTPNVNIFRDPRWGRGQETYGEDPYLTSRMGLAVVRGLQGPSESKYD-----KLHACAK 186
Query: 216 HFTAYDLDNWKGTTRYKFDAR-VTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPS 274
H+ + W R++FD ++ +DL +TY P F++ V+QG +MCAYNR G P
Sbjct: 187 HYALHSGPEW---NRHRFDVENISPRDLWETYLPAFKALVQQGGVKEVMCAYNRFEGEPC 243
Query: 275 CADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAK---SPEDAVVDVLKAGMDVNC 331
C LL R +WGF G + SDC A+S Y +G+ + + E AV +KAG D++C
Sbjct: 244 CGSNRLLYNILREEWGFDGLVVSDCGAISDFY-LKGHHETHSTKESAVAAAVKAGTDLDC 302
Query: 332 GSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPA 391
G Q K AV++ + E +ID +L L R LGL + + + I VV S
Sbjct: 303 GVDYQSLEK-AVEKGIITEKQIDVSLSRLLKARFELGLMD-EEHLVSWSDIPYTVVDSEK 360
Query: 392 HQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITP 451
H+ AL+ A+ + LLKN +G LPL K + +IGPNAN + + GNY G ++T
Sbjct: 361 HRAKALEVARKSMTLLKNKNGTLPLSKHCG-KIVVIGPNANDSIMMWGNYNGFPSHTVTI 419
Query: 452 LQALQNYVE--NTVYYPGCD 469
L+ + + ++ +Y GC+
Sbjct: 420 LEGITHKLDAGQVIYDKGCE 439
Score = 111 bits (278), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 76/262 (29%), Positives = 131/262 (50%), Gaps = 51/262 (19%)
Query: 488 ADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLL 537
A+ +V + G+ E EEL DR + LP Q++L+ + + K P+IL+L
Sbjct: 600 AEAIVFVGGISPKVEGEELPVSFPGFKGGDRTVIELPQVQRDLLQELYKTGK-PIILILC 658
Query: 538 CGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVP 597
G + ++ A+ D +I+ A YPG+AG A+A+V+FGD+NP GRLP+T+Y
Sbjct: 659 SGSAIGLS-AEVDL-ADAIIQAWYPGQAGGTAVADVLFGDYNPAGRLPVTFYKTTEQLPD 716
Query: 598 MTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTK 657
D M+ GRTYR+++G+ +FPFG GLSY+ + +S+ +++ N+S +
Sbjct: 717 FEDYNMQ--------GRTYRYFKGEALFPFGYGLSYTSFEIGKAQLSKKRIHANESVN-- 766
Query: 658 MVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIK 717
+ + +KN GE G+ + ++++ + G P+K
Sbjct: 767 ---------------------------LDLWIKNTGERDGEEVIQVYIRKLKDKEG-PLK 798
Query: 718 QLVGFQSVILNAKEKAEIVFEL 739
L F+ V + + EK +I L
Sbjct: 799 TLRAFKRVHVKSGEKKQISIHL 820
>gi|238620766|ref|YP_002915592.1| glycoside hydrolase family protein [Sulfolobus islandicus M.16.4]
gi|238381836|gb|ACR42924.1| glycoside hydrolase family 3 domain protein [Sulfolobus islandicus
M.16.4]
Length = 755
Score = 293 bits (751), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 213/689 (30%), Positives = 347/689 (50%), Gaps = 97/689 (14%)
Query: 110 IRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRW 169
++ AT+FPQ I A+++D L + I +A+ L Q + +P +++ RDPRW
Sbjct: 98 VKTATAFPQAIGLASTWDPDLIREVSSTIRYQAK-LIGTNQCL-----SPVLDVCRDPRW 151
Query: 170 GRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTT 229
GR +ET GED + ++YV+G+QG+ + A KHF A+ +
Sbjct: 152 GRCEETYGEDQYLVASIGLAYVKGLQGEN----------ELIATVKHFAAH---GFPEGG 198
Query: 230 RYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQW 289
R V ++L + + PFE +K G+A +M AY+ ++GIP ++ LL+K R++W
Sbjct: 199 RNIAPVHVGNRELREVFLFPFEVAIKLGKAMSVMPAYHEIDGIPCHSNAELLTKILRQEW 258
Query: 290 GFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDV---NCGSFLQKHTKAAVKQK 346
GF G + SD DA+ + + + ++A + L+AG+D N F + +A VK+
Sbjct: 259 GFEGIVVSDYDAIRQLEAIHKVSLNKKEAAILALEAGVDTEFPNIDCFGEPLLEA-VKEG 317
Query: 347 KLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVL 406
+ ES IDRA+ + ++ +LGLFN + + + + S + + LAL A+ IVL
Sbjct: 318 LISESIIDRAVERVLRIKEKLGLFNDHYINE--NNVPEKLDNSKSRE-LALDVARKSIVL 374
Query: 407 LKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNY-------AGPSCRSITPLQALQNYV 459
LKN + +LPL K+ ++A+IGPNAN + LLG+Y A +T L+ + V
Sbjct: 375 LKNDN-ILPLNKNIG-TIAVIGPNANEPRNLLGDYTYTGHLNADVGIEVVTVLEGIMRKV 432
Query: 460 ENT---VYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLD------------------ 498
N +Y GCD A S +A++IAK D ++ +MG
Sbjct: 433 SNNTNVLYAKGCDIAAESKEGFSEAIEIAKKGDIIIAVMGEKSGLPLSWTDVPGKDEFEK 492
Query: 499 -QTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSIL 557
Q E DR L LPG Q+EL+ + + K P+ILVL+ G P + + + +I+
Sbjct: 493 YQAVTGEGNDRTSLRLPGVQEELLKELHKTGK-PIILVLVNGRP--LALSSIFNEVNAII 549
Query: 558 WAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYR 617
A +PGE G A+A+VIFGD+NP GRLP++ +P D ++P+ +P + R Y
Sbjct: 550 DAWFPGEEGGNAIADVIFGDYNPSGRLPIS-FPIDTGQIPIY-YNRKPSSL-----RPYV 602
Query: 618 FYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEF 677
+ K +FPFG GLSY+++ Y V+ ++ N S K
Sbjct: 603 MMKSKPLFPFGYGLSYTEFKYSNLEVTPKEV--NSSGKIK-------------------- 640
Query: 678 CETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVF 737
+++ V+N G+ G+ V L++ G RPIK+L GF V L EK +I F
Sbjct: 641 -------ISLEVENVGKREGEETVQLYISKQYSGVSRPIKELKGFAKVYLKPNEKRKITF 693
Query: 738 ELSPCESLSRAREDGLMVIEEGTHFLVVG 766
L P E+L+ + ++I+ G + +++G
Sbjct: 694 SL-PLEALAFYDQYMRLIIDTGDYEILIG 721
>gi|170731072|ref|YP_001776505.1| beta-glucosidase [Xylella fastidiosa M12]
gi|167965865|gb|ACA12875.1| Beta-glucosidase [Xylella fastidiosa M12]
Length = 882
Score = 293 bits (750), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 175/439 (39%), Positives = 245/439 (55%), Gaps = 39/439 (8%)
Query: 54 QRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGA 113
Q A LV+++T EKI+Q +N+APAIPRLGIPAY+WWSE LHG+A G A
Sbjct: 32 QHAAALVAQMTRQEKIAQTMNAAPAIPRLGIPAYDWWSEGLHGIARNGY----------A 81
Query: 114 TSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAG--------QAIGMTFWAPNINIFR 165
T FPQ I AAS+++ L +G EARA +N + G+T W+PNINIFR
Sbjct: 82 TVFPQAIGLAASWNTDLLQHVGTVTSTEARAKFNLTGGPGKDHPRYAGLTLWSPNINIFR 141
Query: 166 DPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNW 225
DPRWGRG ET GEDP +T + AVS++RG+QG+ + + A KHF + +
Sbjct: 142 DPRWGRGMETYGEDPYLTSQLAVSFIRGLQGN------IPDHPRTIATPKHFAVH---SG 192
Query: 226 KGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTA 285
R+ FD V+ DL TY P F + + G A +MCAYN ++G P+CA LL+
Sbjct: 193 PEPGRHSFDVDVSAYDLEATYTPAFRAAIVDGHAGSVMCAYNALHGTPACASDWLLNTRL 252
Query: 286 RRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQ 345
R WGF+G++ SDCDA+ + + + A LK+G D+NCG+ + + A+ +
Sbjct: 253 RNDWGFNGFVVSDCDAIEDMTRFHFFRQDNASASAAALKSGDDLNCGNTYRDLNQ-AIAR 311
Query: 346 KKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIV 405
+ ES +D+AL LF+ R RLG P+ IG + +PAH+ LALQAA +V
Sbjct: 312 GDIDESTLDQALIRLFTARQRLGTLQPR-EHDPYAAIGIKHIDTPAHRALALQAAAQSLV 370
Query: 406 LLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQ--------N 457
LLKNS LPL + +LA++GP+A+S L NY G S +TPL L+ +
Sbjct: 371 LLKNSGNTLPL--TPGTTLAVLGPDADSLTALEANYQGTSSTPVTPLIGLRTRFGTAKVH 428
Query: 458 YVENTVYYPGCDTVACSSA 476
Y + PG + +A
Sbjct: 429 YAQGASLAPGVPSTITETA 447
Score = 132 bits (333), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 99/312 (31%), Positives = 149/312 (47%), Gaps = 64/312 (20%)
Query: 471 VACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQEL 520
+A + + +A AD +V +GL E EEL DR + LP Q+ L
Sbjct: 594 LAPEAPQLAEAERAVAHADAIVAFVGLSPEVEGEELHIDTPGFSGGDRTTIDLPATQETL 653
Query: 521 ITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
+ V + KP+I+VL+ G V + +A++ + +IL A YPG++G A+A+ + GD NP
Sbjct: 654 LQHV-KTTGKPLIVVLMSGSAVALNWAQHHAD--AILAAWYPGQSGGTAIAQALAGDVNP 710
Query: 581 GGRLPMTWY--PQD---YIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSK 635
GGRLPMT+Y QD YI MT GRTYR+++G+ ++PFG GLSY++
Sbjct: 711 GGRLPMTFYRSTQDLPPYISYDMT-------------GRTYRYFKGQPLYPFGYGLSYTQ 757
Query: 636 YSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFL-VTIGVKNHGE 694
++Y+ P+L T + L VT V+N G
Sbjct: 758 FAYE------------------------------APQLSTATLKAGDTLTVTAHVRNTGT 787
Query: 695 MAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLM 754
AG V L+++P P++ LVGF+ V L E + F L LS ++ G
Sbjct: 788 RAGDEVVQLYLEPPHSPQA-PLRNLVGFKRVTLRPGESRLLTFTLD-ARQLSSVQQTGQR 845
Query: 755 VIEEGTHFLVVG 766
+E G + L VG
Sbjct: 846 SVEAGHYHLFVG 857
>gi|334144838|ref|YP_004538047.1| beta-glucosidase [Novosphingobium sp. PP1Y]
gi|333936721|emb|CCA90080.1| beta-glucosidase [Novosphingobium sp. PP1Y]
Length = 889
Score = 292 bits (747), Expect = 5e-76, Method: Compositional matrix adjust.
Identities = 168/411 (40%), Positives = 235/411 (57%), Gaps = 41/411 (9%)
Query: 51 PISQRAR----DLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFF 106
P+ +AR DLV+++TLDEK+ QL+N+APAIPRL IPAY WW+E+LHG G
Sbjct: 25 PVRAKARAMAADLVAKMTLDEKLGQLLNTAPAIPRLDIPAYNWWTESLHGALGS------ 78
Query: 107 NGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQ--------AIGMTFWA 158
T+FP+ I AA+FD+ L + AI E R L+ + G+ W+
Sbjct: 79 ----LPTTNFPEPIGLAATFDASLVKDVAGAISTEVRGLHALARKTGRMGRIGTGLDTWS 134
Query: 159 PNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFT 218
PNINIFRDPRWGRGQET GEDP +T + VS+V G+QG + A KHF
Sbjct: 135 PNINIFRDPRWGRGQETYGEDPYLTARMGVSFVEGMQGPDPD------LPDVIATPKHFA 188
Query: 219 AYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADR 278
+ N +TR+ + V+ DL DTY P F + + +GRA +MCAYNRV+G P+CA +
Sbjct: 189 VH---NGPESTRHHANVFVSRHDLEDTYLPAFRAAIVEGRAGSVMCAYNRVDGQPACASQ 245
Query: 279 NLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSF---- 334
LL + WGF GY+ SDCDAV I D YA AV ++ G+D C ++
Sbjct: 246 ELLQEHLVDAWGFQGYVVSDCDAVKDISDNHKYAPDGAAAVAAAMRMGVDSECHTWTLSD 305
Query: 335 ---LQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPA 391
L + A+++ + S++DR L LFS R+R G G + F AD V +PA
Sbjct: 306 TDGLTDRYREALERGLITVSDVDRTLIRLFSARLRNGDLPGVRKLSTFTSSAAD-VGTPA 364
Query: 392 HQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYA 442
H LAL+AA++ +VLLKN G+LP ++ + +A+IGP ++ + L GNY+
Sbjct: 365 HGALALKAAEESLVLLKND-GILPF-QTAGMKVAVIGPFGDATRVLRGNYS 413
Score = 127 bits (318), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 103/308 (33%), Positives = 146/308 (47%), Gaps = 55/308 (17%)
Query: 472 ACSSASIDKAVDI---AKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQ 518
A S D A D+ A+ AD +V ++GL E EE D+ L +P QQ
Sbjct: 599 AWKRISTDPAGDMRRAAQAADVLVAVVGLTSDLEAEESPIEIPGFKGGDKTTLDIPADQQ 658
Query: 519 ELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDH 578
EL+ + A+A KP+I+V + G P+++ +AK N +IL A YPG++G +A+A V+ G
Sbjct: 659 ELLEQ-AKATGKPLIVVAMNGSPINLHWAK--ENADAILEAWYPGQSGGLAIANVLTGKA 715
Query: 579 NPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSY 638
NP G+LP+T+Y P D M+ GRTYR++ GK V+PFG GLSY+ + Y
Sbjct: 716 NPTGKLPLTFYRSVEDLPPFDDYDMK--------GRTYRYFTGKAVYPFGYGLSYTTFGY 767
Query: 639 KFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGK 698
AV P G R VT V N G+ AG
Sbjct: 768 GPVAVE--------------------------PASGGAQDGIR---VTTQVSNTGQRAGG 798
Query: 699 HPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEE 758
V L++ G P L GFQ V L E ++ F LSP + LS DG+ + +
Sbjct: 799 DAVQLYLD-FPDAPGTPNIALRGFQKVSLQPGETRQVTFTLSPRD-LSSVTPDGVRKVLK 856
Query: 759 GTHFLVVG 766
G + + VG
Sbjct: 857 GHYRVTVG 864
>gi|229580225|ref|YP_002838625.1| glycoside hydrolase family protein [Sulfolobus islandicus
Y.G.57.14]
gi|229581131|ref|YP_002839530.1| glycoside hydrolase family protein [Sulfolobus islandicus
Y.N.15.51]
gi|228010941|gb|ACP46703.1| glycoside hydrolase family 3 domain protein [Sulfolobus islandicus
Y.G.57.14]
gi|228011847|gb|ACP47608.1| glycoside hydrolase family 3 domain protein [Sulfolobus islandicus
Y.N.15.51]
Length = 754
Score = 291 bits (745), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 227/755 (30%), Positives = 364/755 (48%), Gaps = 114/755 (15%)
Query: 50 LPISQRARDLVSRLTLDEK-ISQLVNSAPAI----PRLGIPAYEWWSEALHGVAGVGKGI 104
L I Q R SRL L K +++LVN ++ IPA E L G+ G
Sbjct: 45 LGIGQITRVAGSRLGLKPKEVAKLVNKIQKFLVENTKMKIPAI-IHEECLSGLMGYS--- 100
Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIF 164
+T+FPQ I A++++ L I I + R L Q + +P +++
Sbjct: 101 --------STAFPQAIGLASTWNPELVMDIASVIRSQGR-LVGVNQCL-----SPVLDVC 146
Query: 165 RDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDN 224
+DPRWGR +ET GEDP + ++Y+ G+QGD Q A KHF A+
Sbjct: 147 KDPRWGRCEETYGEDPYLVASMGLAYITGLQGDN----------QLVATAKHFAAH---G 193
Query: 225 WKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKT 284
+ R V ++L +T+ PFE VK G+ IM AY+ ++GIP + LL+
Sbjct: 194 FPEGGRNIAQVHVGNRELRETFLFPFEVAVKIGKVMSIMPAYHEIDGIPCHGNPQLLTNI 253
Query: 285 ARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSF--LQKHTKAA 342
R++WGF G + SD D + + A + +A + L++G+D+ + + A
Sbjct: 254 LRQEWGFDGIVVSDYDGIRQLETIHRVASNKMEAAILALESGVDIEFPTIDCYGEPLVNA 313
Query: 343 VKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGA--DVVCSPAHQVLALQAA 400
+K+ +PES IDRA+ + ++ RLGL + PF + + + + LAL+ A
Sbjct: 314 LKEGLVPESLIDRAVERVLRIKDRLGLLD-----NPFVNENSVPEKLDDHKSRELALKTA 368
Query: 401 QDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYA-------GPSCRSITPLQ 453
++ IVLLKN + +LPL K+ + +A+IGPNAN + +LG+Y +T LQ
Sbjct: 369 RESIVLLKNENNILPLSKNVN-KIAVIGPNANDPRNMLGDYTYTGHLNIDSGIEIVTVLQ 427
Query: 454 ALQNYV--ENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMG---------LDQTQE 502
+ V +Y GCD + S +A++IA+ AD ++ +MG +D E
Sbjct: 428 GIVKKVGESKVLYAKGCDIASESKEGFAEAIEIARQADVIIAIMGEKSGLPLSWMDIPSE 487
Query: 503 K----------EELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRN 552
+ E DR L LPG Q+EL+ + + KP+ILVL+ G P + +
Sbjct: 488 EEFKKYQAVTGEGNDRSSLRLPGVQEELLKELYKTG-KPIILVLINGRP--LVLSSIINY 544
Query: 553 IGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNP 612
+ +++ A +PGE G A+A+VIFGD+NPGGRLP+T +P D ++P+ +P +
Sbjct: 545 VKAVIEAWFPGEEGGNAIADVIFGDYNPGGRLPIT-FPMDTGQIPLY-YNRKPSSF---- 598
Query: 613 GRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPE 672
R Y +F FG GLSY+++ Y V+ ++ N + +
Sbjct: 599 -RPYVMLRSSPLFTFGYGLSYTQFEYSNLEVTPKEIGPNSNIA----------------- 640
Query: 673 LGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEK 732
++I VKN G+M G V L+V RP+K+L GF + L EK
Sbjct: 641 ------------ISIDVKNVGKMEGDDVVQLYVSKTFSSVARPVKELKGFAKIHLKPGEK 688
Query: 733 AEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGD 767
+ F L P E+L+ +V+E+G + L++G+
Sbjct: 689 RRVKFIL-PTEALAFYDSFMRLVVEKGEYQLLIGN 722
>gi|288927072|ref|ZP_06420962.1| beta-glucosidase [Prevotella buccae D17]
gi|288336152|gb|EFC74543.1| beta-glucosidase [Prevotella buccae D17]
Length = 866
Score = 291 bits (744), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 171/454 (37%), Positives = 248/454 (54%), Gaps = 36/454 (7%)
Query: 31 FSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWW 90
FSC + +P+ L +RA DL SRLTL+EK + NS+PAIPRLGIP +EWW
Sbjct: 14 FSC---VAGAQQYPYQNPRLSSQERAEDLCSRLTLEEKTKLMRNSSPAIPRLGIPQFEWW 70
Query: 91 SEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQ 150
SEALHG+A G AT FPQ AAS+D L Y + A EA A N +
Sbjct: 71 SEALHGIARNGF----------ATVFPQTTAMAASWDDELLYHVFCAASDEAVAKNNLAR 120
Query: 151 AI-------GMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGG- 202
G++ W PNINIFRDPRWGRGQET GEDP +T + ++ V G+QG F
Sbjct: 121 KSGDIKRYQGVSIWTPNINIFRDPRWGRGQETYGEDPYLTSRMGLAVVNGLQGQPFRRDM 180
Query: 203 ----KLKGKLQASACCKHFTAYDLDNWKGTTRYKFDA-RVTMQDLADTYQPPFESCVKQG 257
+ + AC KH+ + W R+ FD R+ +DL +TY P F+S V++G
Sbjct: 181 RPFTERPRYYKTLACAKHYAVHSGPEW---NRHVFDVERLPERDLWETYLPAFKSLVQEG 237
Query: 258 RASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGY---AKS 314
+MCAY R++G P C + L + R +W ++G + SDC A+S Y EG+ ++
Sbjct: 238 NVREVMCAYQRIDGSPCCGNTRYLHQILRGEWEYNGLVVSDCGAISDFY-REGHHHVVET 296
Query: 315 PEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNP 374
P +A ++AG DV CG+ + AV+Q + ID ++ L R +G F+ +
Sbjct: 297 PAEASAMGVRAGTDVECGAVYATLPR-AVEQGLISREAIDTSVVRLLKARFEVGDFD-SE 354
Query: 375 TMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSA 434
+ P+ G +V+ S H+ LAL A++ + LL+N + LLPL K+ + +A++GPNAN +
Sbjct: 355 KLVPWKLTGPEVIASETHRRLALDMARESMTLLQNRNRLLPLSKN-GLRIAVMGPNANDS 413
Query: 435 KTLLGNYAGPSCRSITPLQALQNYVENTVYYPGC 468
L GNY G + T L+ +++ V + GC
Sbjct: 414 VMLWGNYTGYPISTTTILKGIRSKVPAARFVEGC 447
Score = 108 bits (269), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 89/298 (29%), Positives = 129/298 (43%), Gaps = 71/298 (23%)
Query: 485 AKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVIL 534
A AD VV + G+ E EE+ DR + LP Q+E+I R+ A K V+
Sbjct: 608 AGDADVVVFVGGISPRLEGEEMKVDAPGFKGGDRTSIELPEAQREVI-RLLRQAGKLVVF 666
Query: 535 VLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYI 594
V GG V ++L A Y GEAG A+A+V+FGD+NP G+LP+T+Y D
Sbjct: 667 VNCSGGAV--ALVPETEACDAVLQAWYAGEAGGQAVADVLFGDYNPSGKLPVTFYKSDAD 724
Query: 595 KVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSS 654
D +M GRTYR++ G +FPFG GLSY+ +++ KLY+
Sbjct: 725 LPDFLDYRM--------TGRTYRYFRGIPLFPFGFGLSYTSFAFGKPRYENGKLYVE--- 773
Query: 655 STKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGR 714
V N G+ G V ++VK +G
Sbjct: 774 ----------------------------------VTNTGKRDGAEVVQVYVKNPADADG- 798
Query: 715 PIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDG------LMVIEEGTHFLVVG 766
P+K L GF + L A E+ + ++ R R +G M ++ G H L+VG
Sbjct: 799 PVKTLRGFARIDLKAGERRRVEI------AMPRERFEGWDATTNTMRVKPGNHLLMVG 850
>gi|294146655|ref|YP_003559321.1| putative beta-glucosidase [Sphingobium japonicum UT26S]
gi|292677072|dbj|BAI98589.1| putative beta-glucosidase [Sphingobium japonicum UT26S]
Length = 874
Score = 291 bits (744), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 179/439 (40%), Positives = 241/439 (54%), Gaps = 42/439 (9%)
Query: 55 RARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGAT 114
+AR +V RLTLDEK++QL+N APAIPRLGIPAY WW+E+LHG G T
Sbjct: 36 KARAIVDRLTLDEKVAQLLNVAPAIPRLGIPAYNWWTESLHGAIGAVP----------TT 85
Query: 115 SFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQ--------AIGMTFWAPNINIFRD 166
+FP+ I AA+FD+ L ++ AIG E RAL+ G+ G+ W+PNINIFRD
Sbjct: 86 NFPEPIGLAATFDAPLIQQVAAAIGTEVRALHTLGRRTGHLGRIGTGLDTWSPNINIFRD 145
Query: 167 PRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWK 226
PRWGRGQET GEDP +T + V++V+G+QG N L A KHF + +
Sbjct: 146 PRWGRGQETYGEDPYLTARIGVAFVQGMQG---NDPDLP---LVVATPKHFAVH---SGP 196
Query: 227 GTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTAR 286
+R+ + T +DL DTY P F + + +G+A IMCAYNRV+G P+C LL R
Sbjct: 197 EPSRHTDNIFATPRDLEDTYLPAFRAAIVEGKAGSIMCAYNRVDGQPACGSHMLLKDYLR 256
Query: 287 RQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSF-------LQKHT 339
WGF GY+ SDCDAV IY+ YA V L+ G+D C + L
Sbjct: 257 GAWGFKGYVVSDCDAVVDIYEHHKYAPDAATGVAVALRHGVDSECNNATLGGRPDLGDRY 316
Query: 340 KAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQA 399
K ++ + + +ID AL LFS R R G G + +P I + +P HQ LAL
Sbjct: 317 KESLARGHIAMGDIDAALVRLFSARYRNGDLPGLSSRKP-NAIPPSAIGTPDHQALALST 375
Query: 400 AQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYV 459
A+ +VLLKN G+LPL ++LA GP A++ + L GNY+ S +S P+ +
Sbjct: 376 AEKSLVLLKND-GILPLRPGARIALA--GPLADATRVLRGNYS--SAKSAPPISVAEGLK 430
Query: 460 ENTVYYPGCDTVACSSASI 478
+ V G TV S SI
Sbjct: 431 Q--VMGAGAVTVIPFSPSI 447
Score = 108 bits (271), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 91/289 (31%), Positives = 126/289 (43%), Gaps = 61/289 (21%)
Query: 496 GLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDIT 545
GL E EE+ DR L LP Q+ + + A A KP+++V + G +D++
Sbjct: 611 GLTSDLEGEEMPVKVEGFDGGDRTTLDLPADQRAFLEK-ARALGKPLVIVAMNGSAIDLS 669
Query: 546 FAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRP 605
+AK N +I+ A YPG++G +A+ V+ G +PGGRLP+T+Y P TD M
Sbjct: 670 WAK--ENAAAIVEAWYPGQSGGLAVGNVLSGRADPGGRLPVTFYRNVNDLPPFTDYGME- 726
Query: 606 QATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVV 665
GRTYR++ G V+PFG GLSY+ + Y + VE D
Sbjct: 727 -------GRTYRYFRGTPVYPFGHGLSYTSFRY----------------APLTVEPVDGS 763
Query: 666 HYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSV 725
K + V + N G AG L++ P R G P L GFQ V
Sbjct: 764 VEKGLK-------------VRTAITNTGARAGDDVAQLYITPPRF-EGAPRLALRGFQRV 809
Query: 726 ILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGDEEYPISI 774
L E + F LSP D V G L+ GD Y +SI
Sbjct: 810 TLKPGETRNVEFTLSP--------RDLSFVTMAGERGLIPGD--YGLSI 848
>gi|325103214|ref|YP_004272868.1| glycoside hydrolase family protein [Pedobacter saltans DSM 12145]
gi|324972062|gb|ADY51046.1| glycoside hydrolase family 3 domain protein [Pedobacter saltans DSM
12145]
Length = 866
Score = 291 bits (744), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 168/442 (38%), Positives = 242/442 (54%), Gaps = 34/442 (7%)
Query: 43 FPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGK 102
+PF LP +R DL+ RLT++EK+ + + + I RLGI Y WW+EALHGVA G
Sbjct: 24 YPFQDNRLPFDKRVDDLLQRLTVEEKVLLMQDVSRPIERLGIKQYNWWNEALHGVARAGL 83
Query: 103 GIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA-------GQAIGMT 155
AT FPQ I AASFD + + A+ EARA +N G+ G+T
Sbjct: 84 ----------ATVFPQPIGMAASFDRDALFNVFNAVSDEARAKHNYHLSQGSYGRYEGLT 133
Query: 156 FWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCK 215
W P INIFRDPRWGRG ET GEDP +T V V+G+QG + GK + AC K
Sbjct: 134 MWTPTINIFRDPRWGRGIETYGEDPYLTAVMGVQAVKGLQGPS--NGKYD---KLHACAK 188
Query: 216 HFTAYDLDNWKGTTRYKFDA-RVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPS 274
HF + W R+ FDA + +DL +TY P FE+ VK+ + +MCAYNR G P
Sbjct: 189 HFAVHSGPEW---NRHSFDAANIKQRDLYETYLPAFEALVKEAKVQEVMCAYNRFEGDPC 245
Query: 275 CADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPE--DAVVDVLKAGMDVNCG 332
C LL + R++WGF G + +DC A++ + + P+ A + +G D++CG
Sbjct: 246 CGSDRLLQQILRKKWGFEGIVVADCGAIADFFKENAHKTHPDAASASAAAVYSGTDLDCG 305
Query: 333 SFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAH 392
S + T+ AVK+ + E +ID ++ L R RLG + + ++ P+ KI +VV S AH
Sbjct: 306 SSYKALTE-AVKKGLIEEKDIDVSVRRLLMARFRLGEMD-DQSLVPWSKISYNVVASKAH 363
Query: 393 QVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPL 452
+AL A+ I LL+N + +LPL KS + +A++GPNA + GNY G +IT L
Sbjct: 364 NQIALDMARKSITLLQNKNNILPL-KSGGLKIAVMGPNAQDSVMQWGNYNGTPANTITIL 422
Query: 453 QALQNYV---ENTVYYPGCDTV 471
+ ++ + + +Y C V
Sbjct: 423 EGIKAALGPKDKLIYEQACGLV 444
Score = 127 bits (320), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 92/315 (29%), Positives = 159/315 (50%), Gaps = 55/315 (17%)
Query: 476 ASIDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVA 525
A+I+K++ GAD VV + G+ + E EE+ DR D+ LP Q++ + +
Sbjct: 591 ANINKSIKNIAGADLVVFVGGISPSLEGEEMGVKLPGFRGGDRTDIQLPTIQRQFVKALK 650
Query: 526 EAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLP 585
EA K+ VI + G P+ + A N +I+ A YPG+AG A+A+V+FG +NP GRLP
Sbjct: 651 EAGKR-VIFINCSGSPIGL--ADEMANSEAIVQAWYPGQAGGQAVADVLFGKYNPSGRLP 707
Query: 586 MTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQ 645
+T+Y +D ++P D + A GRTYR+ + K +FPFG GLSY+++ Y ++Q
Sbjct: 708 ITFY-RDTTQLP--DFENYDMA-----GRTYRYMQDKPLFPFGYGLSYTQFQYGNPILNQ 759
Query: 646 NKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFV 705
+++ N + +T+ V N G+ +G V +++
Sbjct: 760 -----------QVITNGQTIQ------------------LTVPVTNTGKRSGDEVVQVYL 790
Query: 706 KPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVV 765
+ G P+K L F+ + NA + ++VF+++P + + M ++ G + L+V
Sbjct: 791 RKKGDATG-PVKTLRDFRRLSFNAGQTQQVVFKITPKQLEWWNEQSKAMQVQSGDYELLV 849
Query: 766 G----DEEYPISIFV 776
G DE+ + F
Sbjct: 850 GKSSADEDLKMVTFT 864
>gi|315499711|ref|YP_004088514.1| beta-glucosidase [Asticcacaulis excentricus CB 48]
gi|315417723|gb|ADU14363.1| Beta-glucosidase [Asticcacaulis excentricus CB 48]
Length = 869
Score = 290 bits (743), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 164/404 (40%), Positives = 241/404 (59%), Gaps = 33/404 (8%)
Query: 60 VSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQV 119
++R+T+++K +Q+ N AP +P G+ AYEWW+E LHGVA G+ AT FPQ
Sbjct: 40 IARMTVEQKAAQMQNRAPDLPSAGLTAYEWWNEGLHGVARAGE----------ATVFPQA 89
Query: 120 ILTAASFDSYLWYRIGQAIGLEARALYNAG-------QAIGMTFWAPNINIFRDPRWGRG 172
I AA+++ L ++G + EARA +N+ + G+T W+PNINIFRDPRWGRG
Sbjct: 90 IGLAATWNPALLKQVGDVVSTEARAKFNSTDPAGDHQRYYGLTLWSPNINIFRDPRWGRG 149
Query: 173 QETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYK 232
QET GEDP +T + A +V G+QG K+ A KH + + R+
Sbjct: 150 QETYGEDPFLTSRLAEGFVTGLQGPDPQHPKV------VASVKHLAVH---SGPEAGRHG 200
Query: 233 FDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFH 292
F A V+ DL TY P F V +A +MCAYN V G+P+CA LL R WGF
Sbjct: 201 FAASVSPYDLEMTYLPAFRYSVMTTKAQSVMCAYNAVGGVPACASDLLLKTYVREAWGFK 260
Query: 293 GYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESE 352
GY+ +DCDA+ + Y + ++ + LKAG+D+NCG+ + AV++ +PES
Sbjct: 261 GYVVTDCDAIYDMTRFHFYRLNDAESSAESLKAGVDLNCGNAYAALPE-AVQKGLIPESL 319
Query: 353 IDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHG 412
+D++L+ L VR RLG+ +G P+ P+ +I + + +P Q LALQAA+ +VLLKN +G
Sbjct: 320 MDQSLNRLLDVRKRLGI-DGAPS--PWARISPEAINTPQAQGLALQAAEQSLVLLKN-NG 375
Query: 413 LLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQ 456
+LPL ++V A+IGPNA++ +TL GNY G + + +TPL L+
Sbjct: 376 VLPLKPGQTV--AVIGPNADTEETLRGNYNGIARQPVTPLTGLR 417
Score = 117 bits (294), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 102/311 (32%), Positives = 145/311 (46%), Gaps = 63/311 (20%)
Query: 471 VACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQEL 520
+A + ID+AV A+ AD +V +GL E EEL DR DL LP R QE
Sbjct: 583 IAPAQPQIDEAVKAAQAADAIVAFVGLSPDIEGEELQILVPGFDRGDRTDLGLP-RTQED 641
Query: 521 ITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
+ + +A KP+++VLL G V + +A + W YPGEAG A+A + G+ NP
Sbjct: 642 LLKAVKATGKPLVVVLLSGSAVALNWADAHADAVVAAW--YPGEAGGTAIARTLTGEANP 699
Query: 581 GGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKF 640
GRLP+T+Y P D +M GRTYR+++GK ++PFG GLSY+++SY
Sbjct: 700 SGRLPVTFYRSVQDLPPFIDYRME--------GRTYRYFKGKPLYPFGHGLSYTQFSY-- 749
Query: 641 KAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHP 700
+ L L+ S+ T + V++ V+N+G+ AG
Sbjct: 750 -----SDLKLDTSTLTA----------------------GQPLRVSVRVRNNGQRAGDEV 782
Query: 701 VLLFVKPARRGNGRPIK-----QLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMV 755
V L+VK RP L F V L A E +V + P LS +G
Sbjct: 783 VQLYVK-------RPDTFGLNASLAAFARVSLKAGESRTVVMTIDP-RDLSTVTLEGERA 834
Query: 756 IEEGTHFLVVG 766
I G + L VG
Sbjct: 835 IRAGAYGLSVG 845
>gi|397642422|gb|EJK75223.1| hypothetical protein THAOC_03061, partial [Thalassiosira oceanica]
Length = 534
Score = 290 bits (743), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 192/560 (34%), Positives = 280/560 (50%), Gaps = 91/560 (16%)
Query: 211 SACCKHFTAYDLDNWKGTTRYKFDAR-VTMQDLADTYQPPFESCVKQGR----------- 258
+A CKH AY L+ T R+ F A + D TY P F++CV R
Sbjct: 11 AATCKHLAAYSLE----TDRFNFSADGIDRTDWEGTYLPAFDACVHAERFLLEHYNASGG 66
Query: 259 --------ASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEG 310
A G+MC+YN ++G+P+CAD LL RR W F G + SDC AV I+
Sbjct: 67 GGGGQDRGALGVMCSYNAIDGVPACADPALLKDMLRRDWNFTGLVVSDCWAVDNIHSNHR 126
Query: 311 YAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLF 370
+ S E+AV L++G+D++CG+ Q + A + L E +ID AL LF V M LG F
Sbjct: 127 FVASYEEAVGLALRSGVDLDCGNTFQDFGRLAYDESLLDEDDIDEALSRLFRVLMDLGYF 186
Query: 371 NGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKN-----SHGLLPLPKSKSVSLA 425
+ T +P K D + H LAL+AA IVLLKN G LPL +K +A
Sbjct: 187 --DETDEPDAKSSDDEM---EHDQLALEAALQSIVLLKNGINEDEPGPLPLSLAKHKEIA 241
Query: 426 LIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIA 485
L GP A++ LLGNY G +TPL L G + AS+ D
Sbjct: 242 LFGPLADNQTVLLGNYHGLPSTIVTPLMGLAKM--------GVEVAFRQRASV---CDF- 289
Query: 486 KGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKK---PVILVLLCGGPV 542
G +L++GLDQ+ E E+ DR L+LP Q++LI ++ +K PV+LV++ GG V
Sbjct: 290 HGESATILVVGLDQSLEAEDQDRTTLLLPVEQRDLIKTISRCSKVRDLPVVLVVVSGGMV 349
Query: 543 DITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYI-KVPMTDM 601
D++ K +I +++ YPG+ G ALA+V++G +NP G+L T YP+ Y+ +V + DM
Sbjct: 350 DLSRYKNSSDIDAMIHMSYPGQNGGSALAQVLYGAYNPSGKLVGTMYPESYLNEVSLHDM 409
Query: 602 KMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVEN 661
+MRP PGRT+R+Y G ++PFG GLSY+ + Y + +
Sbjct: 410 RMRPDGKF--PGRTHRYYRGDVIYPFGYGLSYTSFRYAMEFLG----------------- 450
Query: 662 QDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGR-PIKQLV 720
GT V + V N G M G VLLF + GN + P + L+
Sbjct: 451 ------------GT---------VKVTVSNSGSMDGSVAVLLFHSAPQAGNEQEPFRSLI 489
Query: 721 GFQSVILNAKEKAEIVFELS 740
GF+ + ++ + + F++S
Sbjct: 490 GFEKIYVSVGDSQLVSFDVS 509
>gi|323451833|gb|EGB07709.1| hypothetical protein AURANDRAFT_64764 [Aureococcus anophagefferens]
Length = 819
Score = 290 bits (742), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 229/690 (33%), Positives = 337/690 (48%), Gaps = 91/690 (13%)
Query: 45 FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGI 104
+ +LP + R L + L++ I QLVN+APA+ + +PAY W ++ HGV G
Sbjct: 71 YLDASLPEADRLAWLADNVPLEDMIGQLVNAAPAVDAVDLPAYNWLNDNEHGVKGTAH-- 128
Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA---------------- 148
AT +P AS+ L +R+G AIG E+RA +N
Sbjct: 129 --------ATVYPMGASLGASWSVDLAWRVGAAIGNESRATHNGLADKSGNACGSTSTGE 180
Query: 149 --GQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDT--FNGGKL 204
G+T +APN+N+ RDPRWGR +E GEDP +T + AV V G+QG+ G
Sbjct: 181 VVANGCGITLYAPNVNLVRDPRWGRAEEVYGEDPHLTAELAVGMVTGLQGNAEGSTSGPG 240
Query: 205 KGKLQASACCKHFTAY-------DLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQG 257
G L ACCKHF A+ DL R DA V+ +DL +TY P ++CV +
Sbjct: 241 GGPLVTGACCKHFAAHFAVYQNEDLP----ADRMVLDANVSSRDLWETYLPVMKACVVRA 296
Query: 258 RASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPED 317
+A+ VNG P+CA LL+ R WGF G++ SD DA S + Y + E+
Sbjct: 297 KAT-------HVNGKPTCAHPELLNDVLRESWGFDGFVVSDYDAWSNLVTTHKYVSTWEE 349
Query: 318 AVVDVLKAGMDVNCGSFLQKHTKA---AVKQKKLPESEIDRALHNLFSVRMRLGLFNGNP 374
A + AGMD G A AV+ + + + R+ L VR+RLG+F+
Sbjct: 350 AAAAGINAGMDQEGGFGDYSPVDALPDAVRNGTVAAATVRRSFERLMRVRLRLGMFDPPA 409
Query: 375 TMQPFGKI-GADVVCSPAHQV-LALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNAN 432
+ +G+ D C A ++ LA +AA++GIVL KN+ G LPL +K +AL+GP +
Sbjct: 410 STAVYGEAYQCDYQCETAAKLALAREAAREGIVLFKNAGGALPL--AKGARIALVGPQVD 467
Query: 433 SAKTLLG--NYA---GPSCRSITPLQALQNYVENTVYYPGCDTVACSS-ASIDKAVDIAK 486
+ LLG NYA GP +T + L+ V N GCD+VAC++ +D A +A
Sbjct: 468 DWRVLLGAVNYAFEDGPDVAPVTIQKGLEA-VANVSVAAGCDSVACAALVDVDGAKRLAA 526
Query: 487 GADHVVLMMG---------------LDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKP 531
AD V+++G D E E DR + LPG Q L+ + A+ +
Sbjct: 527 AADATVVVLGDSFGATDGWPLCRGTRDDGCESESHDRATIELPGEQVALVAALRAASSR- 585
Query: 532 VILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQ 591
++ VL+ GG V + A D + LW PG+ G ALA+V+FGD++P GR P+T Y
Sbjct: 586 LVCVLVHGGAVALGAAADDCDAVLDLWV--PGQMGGAALADVLFGDYSPAGRSPITMYAA 643
Query: 592 DYIKVPMTDMKMRPQATSGNPGRTYRFYEG-KEVFPFGCGLSYSKYSYKFKAVSQN---- 646
PM A + G TYR+Y G + FG GLSY+ +SY + A
Sbjct: 644 TSDLPPMG--VFDEYAGESSNGTTYRYYAGPAPTYAFGDGLSYASFSYAWAAAPPTTVDA 701
Query: 647 ----KLYLNQSSSTKMVENQDVVHYKSVPE 672
+L + +++ + ++ V Y VP+
Sbjct: 702 CGAIRLRVAVTNTGSVASDEVVQVYARVPD 731
>gi|344995394|ref|YP_004797737.1| glycoside hydrolase family protein [Caldicellulosiruptor
lactoaceticus 6A]
gi|343963613|gb|AEM72760.1| glycoside hydrolase family 3 domain protein [Caldicellulosiruptor
lactoaceticus 6A]
Length = 770
Score = 290 bits (742), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 228/728 (31%), Positives = 356/728 (48%), Gaps = 116/728 (15%)
Query: 81 RLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGL 140
RLGIPA E+ G +GAT FPQ I A +FD+ + + + I +
Sbjct: 83 RLGIPAM-IHEESCSGFMA-----------KGATVFPQSIGVACTFDNEIVEELAKVIRI 130
Query: 141 EARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFN 200
+ +A + QA+ AP I++ RD RWGR +ET GEDP + AVSYV+G+QGD
Sbjct: 131 QMKAT-GSHQAL-----APLIDVARDARWGRVEETFGEDPYLVANMAVSYVKGIQGDDIK 184
Query: 201 GGKLKGKLQASACCKHFTAYDLD----NWKGTTRYKFDARVTMQDLADTYQPPFESCVKQ 256
G + A KHF Y + NW + ++L + Y PFE VK
Sbjct: 185 DGIV-------ATGKHFVGYAMSEGGMNWAPV-------HIPERELREVYLYPFEVAVKV 230
Query: 257 GRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPE 316
IM AY+ ++GIP A+R LL+ AR +WGF G SD V I D K+
Sbjct: 231 AGLKSIMPAYHEIDGIPCHANRKLLTDIARGEWGFDGIYVSDYSGVRNILDYHKAVKTYA 290
Query: 317 DAVVDVLKAGMDVNCGS---FLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGN 373
+A L AG+D+ F ++ K A+K+ K + +D A+ + ++ RLGLF+ N
Sbjct: 291 EAAYISLWAGLDIELPKIECFTEEFIK-ALKEGKFDMAVVDAAVKRVLEMKFRLGLFD-N 348
Query: 374 PTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANS 433
P ++ G + ++ + + L+ + AQ+ +VLLKN + LPL K +A+IGPNA+S
Sbjct: 349 PYIKTEGIL--ELFDNKEQRELSRKVAQESMVLLKNDN-FLPLSKDVK-KIAVIGPNADS 404
Query: 434 AKTLLGNYAGPS----------------------CRSITPL----QALQNYVEN---TVY 464
+ LLG+Y+ P+ R + + +A+++ V+N VY
Sbjct: 405 VRNLLGDYSYPAHIATLEMFFIKEDRGVGNEEEFVRKVINMKSIFEAVKDRVQNKAEVVY 464
Query: 465 YPGCDTVACSSASIDKAVDIAKGADHVVLMMG------LDQTQEKEELDRVDLVLPGRQQ 518
GCD + ++A A+GAD V+L++G LD T E DR L LPG Q+
Sbjct: 465 AKGCDVNTQDESGFEEAKKAAQGADVVILVVGDKAGLRLDCTS-GESRDRASLKLPGVQE 523
Query: 519 ELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDH 578
+LI V++ + +++VL+ G PV + +IL A +PGE GA A+A+V+FGD+
Sbjct: 524 KLIEEVSKVNEN-IVVVLVNGRPV--ALEGIWQKAKAILEAWFPGEEGAEAVADVLFGDY 580
Query: 579 NPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSY 638
NPGG+L ++ +P+D +VP+ +P Y K PFG GLSY+ + Y
Sbjct: 581 NPGGKLAIS-FPRDVGQVPVY-YGHKPSGGKSCWHGDYVEMSTKPFLPFGYGLSYTTFEY 638
Query: 639 KFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGK 698
K A+ + K+ +++S +++ V+N G+ AG
Sbjct: 639 KNFAIEKEKISMDES-----------------------------IKISVEVENTGKYAGD 669
Query: 699 HPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEE 758
V L+ + RP+K+L ++ V L EK ++VFE+ P + + D VI
Sbjct: 670 EIVQLYTRKEEFLVTRPVKELKAYKRVHLKPGEKKKVVFEIFP-DQFAYYDYDMKRVISP 728
Query: 759 GTHFLVVG 766
GT ++VG
Sbjct: 729 GTVEVMVG 736
>gi|206901921|ref|YP_002251428.1| xylosidase/arabinosidase [Dictyoglomus thermophilum H-6-12]
gi|206741024|gb|ACI20082.1| xylosidase/arabinosidase [Dictyoglomus thermophilum H-6-12]
Length = 756
Score = 290 bits (742), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 217/690 (31%), Positives = 351/690 (50%), Gaps = 85/690 (12%)
Query: 92 EALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQA 151
EALHG +G+T FPQ I A++++ L Y++ AIG E R+ +
Sbjct: 138 EALHGCMA-----------KGSTIFPQAIGMASTWNPELIYQVATAIGKETRS-----RG 181
Query: 152 IGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQAS 211
I +P INI RDPR GR +ET GEDP + + AV+Y++GVQ +
Sbjct: 182 IHQVL-SPTINIARDPRCGRTEETYGEDPYLASRMAVAYIKGVQ-----------EQGVI 229
Query: 212 ACCKHFTA-YDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVN 270
A KHF A + D + + F R+ L + Y P F + +++ A +M AYN ++
Sbjct: 230 ATPKHFVANFVGDGGRDSYPIHFSERL----LREIYFPAFRASIEEAGALSLMAAYNSLD 285
Query: 271 GIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVN 330
GIP +++ LL++ R++WGF GY+ SD +V + A+S +A L+AG+D+
Sbjct: 286 GIPCSSNKWLLTRILRKEWGFKGYVVSDYFSVLHLMTKHKVAESKAEAAKLSLEAGLDME 345
Query: 331 C-GSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQP-FGKIGADVVC 388
S + +++ KL + +D A+ + V+ +GLF+ NP + P + + D C
Sbjct: 346 LPDSDCFEEIPGLIRESKLSQDTLDEAVRRVLRVKFWIGLFD-NPFVDPDYAERIND--C 402
Query: 389 SPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRS 448
S H+ LAL+ A++ IVLLKN G+LPL K S+A+IGPNA A LG Y+G +
Sbjct: 403 S-EHRELALRVARESIVLLKN-EGILPLNKDIR-SIAVIGPNA--AVPRLGGYSGYGVKV 457
Query: 449 ITPLQALQNYVENTV---YYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQ-EKE 504
+TPL+ ++N + + V + GC S + D+A+ IA+ +D +L MG + E E
Sbjct: 458 VTPLEGIKNKLGDKVKVYFAEGCGLNDTSKSGFDEAIKIAQKSDVAILFMGNSVPETEGE 517
Query: 505 ELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGE 564
+ DR +L LPG Q++LI + PVI+VL+ G IT + + +++ A YPGE
Sbjct: 518 QRDRHNLNLPGVQEDLIKEICN-TNTPVIVVLINGSA--ITMMNWIDKVQAVIEAWYPGE 574
Query: 565 AGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEV 624
G A+A+V+FGD+NPGG+LP++ +P+ ++P+ +P SG + +
Sbjct: 575 EGGNAIADVLFGDYNPGGKLPIS-FPKYSSQLPLY-YNHKP---SGRVDDYVDLRGNQYL 629
Query: 625 FPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFL 684
FPFG GLSY+ + Y + + PE E + +
Sbjct: 630 FPFGYGLSYTDFKYS--------------------------NLRITPE---EIPRDGEVV 660
Query: 685 VTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCES 744
+T ++N G+ G V L++ RPIK+L F+ V L+ E+ + F+L+
Sbjct: 661 ITFDIENIGKYKGDEVVQLYLHDEFASVARPIKELKRFERVTLDVGERKTVSFKLNR-RD 719
Query: 745 LSRAREDGLMVIEEGTHFLVVGDEEYPISI 774
L D +V+E G +++G I +
Sbjct: 720 LEFLSMDMELVVEPGRFEVLIGSSSEDIRL 749
>gi|402307522|ref|ZP_10826545.1| glycosyl hydrolase family 3, N-terminal domain protein [Prevotella
sp. MSX73]
gi|400378572|gb|EJP31427.1| glycosyl hydrolase family 3, N-terminal domain protein [Prevotella
sp. MSX73]
Length = 858
Score = 290 bits (741), Expect = 3e-75, Method: Compositional matrix adjust.
Identities = 176/463 (38%), Positives = 252/463 (54%), Gaps = 35/463 (7%)
Query: 39 STETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVA 98
+ + P+C L +RARDL+SRLTL+EK +++ +PAIPRLGI + WWSEALHG A
Sbjct: 18 TAQLLPYCNPDLSARERARDLLSRLTLEEKARLMLDESPAIPRLGIKKFFWWSEALHGAA 77
Query: 99 GVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIG----- 153
+G G T FP+ + AASF+ L R+ A E RA YN G
Sbjct: 78 NMG----------GVTVFPEPVGMAASFNDGLLRRVFDAASDEMRAQYNRRMLNGGEDEK 127
Query: 154 ---MTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQG-DTFNGGKLKGKLQ 209
++ W PN+NIFRDPRWGRGQET GEDP +T + VRG+QG +T KL
Sbjct: 128 FHSLSVWTPNVNIFRDPRWGRGQETYGEDPYLTSVMGTAVVRGLQGPETAKYRKL----- 182
Query: 210 ASACCKHFTAYDLDNWKGTTRYKFD-ARVTMQDLADTYQPPFESCVKQGRASGIMCAYNR 268
AC KH+ + + TR+ + A V+ +DL +TY P F++ V + + +MCAY R
Sbjct: 183 -WACAKHYAVHSGPEY---TRHTANVADVSPRDLWETYLPAFKTLVTEAKVREVMCAYQR 238
Query: 269 VNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMD 328
++ P C++ LL + R +WGF+ + SDC AV+ IY + A AG D
Sbjct: 239 LDDDPCCSNNRLLQQILRDEWGFNYLVVSDCGAVTDIYANHKTSSDAVHAAAKAAVAGTD 298
Query: 329 VNCG-SFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVV 387
V CG + K AV++ + E+E+D+ + L R LG + +P + + KI A V+
Sbjct: 299 VECGFGYAYKTIPEAVRRGLITEAEVDKHVLRLLEGRFDLGEMD-DPKLVEWSKIPASVM 357
Query: 388 CSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCR 447
S AH+ LAL A+ +VLL+N G+LPL K+ +A+IGPNA+ + GNY G R
Sbjct: 358 DSKAHRQLALDMARQSLVLLQNKGGVLPL-KAGGEPIAVIGPNADDGPMMWGNYNGTPNR 416
Query: 448 SITPLQALQNYVENTVYYPGC---DTVACSSASIDKAVDIAKG 487
++T L ++ + Y GC DT +S A+D KG
Sbjct: 417 TVTILDGIKARHKRVTYLKGCDLTDTKTVNSLLPQCAIDGRKG 459
Score = 76.6 bits (187), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 70/265 (26%), Positives = 110/265 (41%), Gaps = 56/265 (21%)
Query: 486 KGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILV 535
+G VV + G+ E EE+ DR ++ LP Q++ + + EA K ++
Sbjct: 597 QGIRKVVFVGGISAALEGEEMPVDIDGFKGGDRTNIELPKVQRDFLRALHEAGK--TVVF 654
Query: 536 LLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIK 595
+ C G I +IL A Y G+ G A+++V+FG NP G+LP+T+Y +
Sbjct: 655 VNCSGSA-IALEPEMETCDAILQAWYAGQEGGTAVSDVLFGTVNPSGKLPVTFYKRTDQL 713
Query: 596 VPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSS 655
D MR GRTYR++ +F FG GLSY+ + +
Sbjct: 714 PDYEDYSMR--------GRTYRYFS-DPLFAFGYGLSYTTFRF----------------G 748
Query: 656 TKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRP 715
E + + SVP + N G G+ V ++++ NG P
Sbjct: 749 RAHAEAAEGGYRLSVP-----------------LTNTGTRPGEEVVQVYIRRVADTNG-P 790
Query: 716 IKQLVGFQSVILNAKEKAEIVFELS 740
+K L F+ V L A E + LS
Sbjct: 791 LKSLRAFRRVALKAGESTTVEIPLS 815
>gi|288925400|ref|ZP_06419334.1| beta-glucosidase [Prevotella buccae D17]
gi|288337871|gb|EFC76223.1| beta-glucosidase [Prevotella buccae D17]
Length = 858
Score = 290 bits (741), Expect = 3e-75, Method: Compositional matrix adjust.
Identities = 176/463 (38%), Positives = 252/463 (54%), Gaps = 35/463 (7%)
Query: 39 STETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVA 98
+ + P+C L +RARDL+SRLTL+EK +++ +PAIPRLGI + WWSEALHG A
Sbjct: 18 TAQLLPYCNPDLSARERARDLLSRLTLEEKARLMLDESPAIPRLGIKKFFWWSEALHGAA 77
Query: 99 GVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIG----- 153
+G G T FP+ + AASF+ L R+ A E RA YN G
Sbjct: 78 NMG----------GVTVFPEPVGMAASFNDGLLRRVFDAASDEMRAQYNRRMLNGGEDEK 127
Query: 154 ---MTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQG-DTFNGGKLKGKLQ 209
++ W PN+NIFRDPRWGRGQET GEDP +T + VRG+QG +T KL
Sbjct: 128 FHSLSVWTPNVNIFRDPRWGRGQETYGEDPYLTSVMGTAVVRGLQGPETAKYRKL----- 182
Query: 210 ASACCKHFTAYDLDNWKGTTRYKFD-ARVTMQDLADTYQPPFESCVKQGRASGIMCAYNR 268
AC KH+ + + TR+ + A V+ +DL +TY P F++ V + + +MCAY R
Sbjct: 183 -WACAKHYAVHSGPEY---TRHTANVADVSPRDLWETYLPAFKTLVTEAKVREVMCAYQR 238
Query: 269 VNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMD 328
++ P C++ LL + R +WGF+ + SDC AV+ IY + A AG D
Sbjct: 239 LDDDPCCSNNRLLQQILRDEWGFNYLVVSDCGAVTDIYANHKTSSDAVHAAAKAAVAGTD 298
Query: 329 VNCG-SFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVV 387
V CG + K AV++ + E+E+D+ + L R LG + +P + + KI A V+
Sbjct: 299 VECGFGYAYKTIPEAVRRGLITEAEVDKHVLRLLEGRFDLGEMD-DPKLVEWSKIPASVM 357
Query: 388 CSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCR 447
S AH+ LAL A+ +VLL+N G+LPL K+ +A+IGPNA+ + GNY G R
Sbjct: 358 DSKAHRQLALDMARQSLVLLQNKGGVLPL-KAGGEPIAVIGPNADDGPMMWGNYNGTPNR 416
Query: 448 SITPLQALQNYVENTVYYPGC---DTVACSSASIDKAVDIAKG 487
++T L ++ + Y GC DT +S A+D KG
Sbjct: 417 TVTILNGIKVRHKRVTYLKGCDLTDTKTVNSLLPQCAIDGRKG 459
Score = 76.6 bits (187), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 70/265 (26%), Positives = 110/265 (41%), Gaps = 56/265 (21%)
Query: 486 KGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILV 535
+G VV + G+ E EE+ DR ++ LP Q++ + + EA K ++
Sbjct: 597 QGIRKVVFVGGISAALEGEEMPVDIDGFKGGDRTNIELPKVQRDFLRALHEAGK--TVVF 654
Query: 536 LLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIK 595
+ C G I +IL A Y G+ G A+++V+FG NP G+LP+T+Y +
Sbjct: 655 VNCSGSA-IALEPEMETCDAILQAWYAGQEGGTAVSDVLFGTVNPSGKLPVTFYKRTDQL 713
Query: 596 VPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSS 655
D MR GRTYR++ +F FG GLSY+ + +
Sbjct: 714 PDYEDYSMR--------GRTYRYFS-DPLFAFGYGLSYTTFRF----------------G 748
Query: 656 TKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRP 715
E + + SVP + N G G+ V ++++ NG P
Sbjct: 749 RARAEAAEGGYRLSVP-----------------LTNTGTRPGEEVVQVYIRRVADTNG-P 790
Query: 716 IKQLVGFQSVILNAKEKAEIVFELS 740
+K L F+ V L A E + LS
Sbjct: 791 LKSLRAFRRVALKAGESTTVEIPLS 815
>gi|393782428|ref|ZP_10370612.1| hypothetical protein HMPREF1071_01480 [Bacteroides salyersiae
CL02T12C01]
gi|392673256|gb|EIY66719.1| hypothetical protein HMPREF1071_01480 [Bacteroides salyersiae
CL02T12C01]
Length = 596
Score = 290 bits (741), Expect = 3e-75, Method: Compositional matrix adjust.
Identities = 194/626 (30%), Positives = 310/626 (49%), Gaps = 68/626 (10%)
Query: 154 MTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQG-DTFNGGKLKGKLQASA 212
MT+W+PN+NIFRDPRWGRGQET GEDP +T + +YVRG+QG D F L+A+A
Sbjct: 1 MTYWSPNVNIFRDPRWGRGQETYGEDPYLTAEIGKAYVRGLQGNDPF-------FLKAAA 53
Query: 213 CCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGI 272
C KH+ + + R++F+A + +DL +TY P FE+ VK+ + +M AYNRV G
Sbjct: 54 CAKHYAVH---SGPEALRHEFNASPSKRDLFETYLPAFEALVKEAKVEAVMGAYNRVYGE 110
Query: 273 PSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG 332
+ LL+ R++WGF G++ SDC AV IY AK +A LK+G+++NCG
Sbjct: 111 SASGSFFLLTDILRKKWGFKGHVVSDCGAVDDIYGGHKIAKDVAEASAIALKSGLNLNCG 170
Query: 333 SFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAH 392
K A+++K + E ++D AL L R++LG + P+ I V+ S H
Sbjct: 171 GSFHA-LKEALERKLITEVDLDNALMPLMMTRLKLGNLTDDDE-SPYKNISDSVIASYTH 228
Query: 393 QVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPL 452
++A + AQ +VLLKN++ LPL K ++ + GP A ++GNY G S RS T L
Sbjct: 229 AMVAREVAQKSMVLLKNNNHTLPLKKDVK-TIFVTGPYAADTYVMMGNYYGVSPRSNTFL 287
Query: 453 QALQNYVENTV---YYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRV 509
Q + V Y G + D V + A+ ++++GL E EE D +
Sbjct: 288 QGIAAKVSGGTSINYKIGILPTTPNMNPADWTVGEVRAAEVAIVVIGLSGIDEGEEGDAI 347
Query: 510 ---------DLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAG 560
+L LP Q + + ++ ++ V+ G P+D+ + +++ A
Sbjct: 348 ASSHRGDKQNLKLPEHQLKFLRDISRNRWNKLVTVITGGSPIDLE--EVSELSDAVIMAW 405
Query: 561 YPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVP-MTDMKMRPQATSGNPGRTYRFY 619
YPG+ G +AL +++FGD + GR+P+T +P + +P D M+ GRTY++
Sbjct: 406 YPGQEGGMALGDLLFGDVSFSGRMPVT-FPINSDWLPAFEDYNMQ--------GRTYKYM 456
Query: 620 EGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCE 679
++PFG GL+Y V Y V L ++
Sbjct: 457 TDNIMYPFGYGLTYGD-----------------------------VSYSDVKILNPKYDG 487
Query: 680 TRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFEL 739
++ V ++N+G + V L++ G PI L+GF+ V L + + F +
Sbjct: 488 KQEIHVQATLRNNGNNEVEEVVQLYLSAPGAGVITPISSLIGFKRVTLESHLSQTVEFII 547
Query: 740 SPCESLSRAREDGLMVIEEGTHFLVV 765
P + L EDG + +G + ++V
Sbjct: 548 KP-DQLKMVMEDGSKNLLKGKYTIIV 572
>gi|284998833|ref|YP_003420601.1| glycoside hydrolase family protein [Sulfolobus islandicus L.D.8.5]
gi|284446729|gb|ADB88231.1| glycoside hydrolase, family 3 domain protein [Sulfolobus islandicus
L.D.8.5]
Length = 754
Score = 289 bits (740), Expect = 4e-75, Method: Compositional matrix adjust.
Identities = 226/755 (29%), Positives = 363/755 (48%), Gaps = 114/755 (15%)
Query: 50 LPISQRARDLVSRLTLDEK-ISQLVNSAPAI----PRLGIPAYEWWSEALHGVAGVGKGI 104
L I Q R SRL L K +++LVN ++ IPA E L G+ G
Sbjct: 45 LGIGQITRVAGSRLGLKPKEVAKLVNKIQKFLVENTKMKIPAI-IHEECLSGLMGYS--- 100
Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIF 164
+T+FPQ I A++++ L I I + R L Q + +P +++
Sbjct: 101 --------STAFPQAIGLASTWNPELVMDIASVIRSQGR-LVGVNQCL-----SPVLDVC 146
Query: 165 RDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDN 224
+DPRWGR +ET GEDP + ++Y+ G+QGD Q A KHF A+
Sbjct: 147 KDPRWGRCEETYGEDPYLVASMGLAYITGLQGDN----------QLVATAKHFAAH---G 193
Query: 225 WKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKT 284
+ R V ++L +T+ PFE VK G+ IM AY+ ++GIP + LL+
Sbjct: 194 FPEGGRNIAQVHVGNRELRETFLFPFEVAVKIGKVMSIMPAYHEIDGIPCHGNPQLLTNI 253
Query: 285 ARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSF--LQKHTKAA 342
R++WGF G + SD D + + A + +A + L++G+D+ + + A
Sbjct: 254 LRQEWGFDGIVVSDYDGIRQLETIHRVASNKMEAAILALESGVDIEFPTIDCYGEPLVNA 313
Query: 343 VKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGA--DVVCSPAHQVLALQAA 400
+K+ +PES IDRA+ + ++ RLGL + PF + + + + LAL+ A
Sbjct: 314 LKEGLVPESLIDRAVERVLRIKDRLGLLD-----NPFVNENSVPEKLDDHKSRELALKTA 368
Query: 401 QDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYA-------GPSCRSITPLQ 453
++ IVLLKN + +LPL K+ + +A+IGPNAN + +LG+Y +T LQ
Sbjct: 369 RESIVLLKNENNILPLSKNVN-KIAVIGPNANDPRNMLGDYTYTGHLNIDSGIEIVTVLQ 427
Query: 454 ALQNYV--ENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMG---------LDQTQE 502
+ V +Y GCD + S +A++IA+ AD ++ +MG +D E
Sbjct: 428 GIVKKVGESKVLYAKGCDIASESKEGFAEAIEIARQADVIIAIMGEKSGLPLSWMDIPSE 487
Query: 503 KEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRN 552
+E DR L LPG Q+EL+ + + KP+ILVL+ G P + +
Sbjct: 488 EEFKKYQAVTGEGNDRSSLRLPGVQEELLKELYKTG-KPIILVLINGRP--LVLSSIINY 544
Query: 553 IGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNP 612
+ +++ A +PGE G A+A+VIFGD+NP GRLP+T +P D ++P+ +P +
Sbjct: 545 VKAVIEAWFPGEEGGNAIADVIFGDYNPSGRLPIT-FPMDTGQIPLY-YNRKPSSF---- 598
Query: 613 GRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPE 672
R Y +F FG GLSY+++ Y V+ ++ N + +
Sbjct: 599 -RPYVMLRSSPLFTFGYGLSYTQFEYSNLEVTPKEIGPNSNIA----------------- 640
Query: 673 LGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEK 732
++I VKN G+M G V L+V RP+K+L GF + L EK
Sbjct: 641 ------------ISIDVKNVGKMEGDDVVQLYVSKTFSSVARPVKELKGFAKIHLKPGEK 688
Query: 733 AEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGD 767
+ F L P E+L+ +V+E+G + L++G+
Sbjct: 689 RRVKFIL-PTEALAFYDSFMRLVVEKGEYQLLIGN 722
>gi|312621303|ref|YP_004022916.1| glycoside hydrolase family 3 domain-containing protein
[Caldicellulosiruptor kronotskyensis 2002]
gi|312201770|gb|ADQ45097.1| glycoside hydrolase family 3 domain protein [Caldicellulosiruptor
kronotskyensis 2002]
Length = 770
Score = 289 bits (740), Expect = 4e-75, Method: Compositional matrix adjust.
Identities = 229/728 (31%), Positives = 356/728 (48%), Gaps = 116/728 (15%)
Query: 81 RLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGL 140
RLGIPA E+ G +GAT FPQ I A +FD+ + + + I
Sbjct: 83 RLGIPAM-IHEESCSGFMA-----------KGATVFPQSIGVACTFDNEIVEELAKVIKT 130
Query: 141 EARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFN 200
+ +A+ A QA+ AP I++ RD RWGR +ET GEDP + AVSYV+G+QGD
Sbjct: 131 QMKAV-GAHQAL-----APLIDVARDARWGRVEETFGEDPYLVANMAVSYVKGIQGDDIK 184
Query: 201 GGKLKGKLQASACCKHFTAYDLD----NWKGTTRYKFDARVTMQDLADTYQPPFESCVKQ 256
G + A KHF Y + NW + ++L + Y PFE VK
Sbjct: 185 DGIV-------ATGKHFVGYAMSEGGMNWAPV-------HIPERELREVYLYPFEVAVKV 230
Query: 257 GRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPE 316
IM AY+ ++GIP A+R LL+ AR +WGF G SD V I D K+
Sbjct: 231 AGLKSIMPAYHEIDGIPCHANRKLLTDIARGEWGFDGIFVSDYAGVRNILDYHKAVKTYA 290
Query: 317 DAVVDVLKAGMDVNCGS---FLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGN 373
+A L AG+D+ F ++ K A+K+ K + +D A+ + ++ RLGLF+ N
Sbjct: 291 EAAYISLWAGLDIELPKIECFTEEFIK-ALKEGKFDMAVVDAAVKRVLEMKFRLGLFD-N 348
Query: 374 PTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANS 433
P ++ G + ++ + + L+ + AQ+ +VLLKN + LPL +A+IGPNA+S
Sbjct: 349 PYIKTEGIL--ELFDNKEQRELSRKVAQESMVLLKNDN-FLPLSNDVK-KIAVIGPNADS 404
Query: 434 AKTLLGNYAGPS----------------------CRSI----TPLQALQNYVEN---TVY 464
+ LLG+Y+ P+ R + + L+A+++ V+N VY
Sbjct: 405 VRNLLGDYSYPAHIATLEMFFIKEDKGVGNEEEFVRKVINIKSILEAIKDRVQNKAEVVY 464
Query: 465 YPGCDTVACSSASIDKAVDIAKGADHVVLMMG------LDQTQEKEELDRVDLVLPGRQQ 518
GCD + ++A A+GAD V+L++G LD T E DR L LPG Q+
Sbjct: 465 AKGCDVNNQDESGFEEAKKAAQGADVVILVVGDKAGLRLDCTS-GESRDRASLKLPGVQE 523
Query: 519 ELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDH 578
+LI V++ + +++VL+ G PV + +IL A +PGE GA A+A+V+FGD+
Sbjct: 524 KLIEEVSKVNEN-IVVVLVNGRPV--ALEGIWQKAKAILEAWFPGEEGAEAVADVLFGDY 580
Query: 579 NPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSY 638
NPGG+L ++ +P+D +VP+ +P Y K PFG GLSY+ + Y
Sbjct: 581 NPGGKLAIS-FPRDVGQVPVY-YGHKPSGGKSCWHGDYVEMSTKPFLPFGYGLSYTTFEY 638
Query: 639 KFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGK 698
K A+ + K+ +++S +++ V+N G+ AG
Sbjct: 639 KNFAIEKEKISMDES-----------------------------IKISVEVENTGKYAGD 669
Query: 699 HPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEE 758
V L+ + RP+K+L ++ V L EK ++VFE+ P + + D VI
Sbjct: 670 EIVQLYTRKEEFLVTRPVKELKAYKRVHLKPGEKKKVVFEIFP-DQFAYYDYDMNRVISP 728
Query: 759 GTHFLVVG 766
GT ++VG
Sbjct: 729 GTVEVMVG 736
>gi|227831319|ref|YP_002833099.1| glycoside hydrolase family protein [Sulfolobus islandicus L.S.2.15]
gi|227457767|gb|ACP36454.1| glycoside hydrolase family 3 domain protein [Sulfolobus islandicus
L.S.2.15]
Length = 754
Score = 289 bits (739), Expect = 4e-75, Method: Compositional matrix adjust.
Identities = 226/755 (29%), Positives = 361/755 (47%), Gaps = 114/755 (15%)
Query: 50 LPISQRARDLVSRLTLDEK-ISQLVNSAPAI----PRLGIPAYEWWSEALHGVAGVGKGI 104
L I Q R SRL L K +++LVN ++ IPA E L G+ G
Sbjct: 45 LGIGQITRVAGSRLGLKPKEVAKLVNKIQKFLVENTKMKIPAI-IHEECLSGLMGYS--- 100
Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIF 164
+T+FPQ I A++++ L I I + R L Q + +P +++
Sbjct: 101 --------STAFPQAIGLASTWNPELVMDIASVIRSQGR-LVGVNQCL-----SPVLDVC 146
Query: 165 RDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDN 224
+DPRWGR +ET GEDP + ++Y+ G+QGD Q A KHF A+
Sbjct: 147 KDPRWGRCEETYGEDPYLVASMGLAYITGLQGDN----------QLVATAKHFAAH---G 193
Query: 225 WKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKT 284
+ R V ++L +T+ PFE VK G+ IM AY+ ++GIP + LL+
Sbjct: 194 FPEGGRNIAQVHVGNRELRETFLFPFEVAVKIGKVMSIMPAYHEIDGIPCHGNPQLLTNI 253
Query: 285 ARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSF--LQKHTKAA 342
R++WGF G + SD D + + A + +A + L++G+D+ + + A
Sbjct: 254 LRQEWGFDGIVVSDYDGIRQLETIHRVASNKMEAAILALESGVDIEFPTIDCYGEPLVNA 313
Query: 343 VKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGA--DVVCSPAHQVLALQAA 400
+K+ +PES IDRA+ + ++ RLGL + PF + + + + LAL+ A
Sbjct: 314 LKEGLVPESLIDRAVERVLRIKDRLGLLD-----NPFVNENSVPEKLDDHKSRELALKTA 368
Query: 401 QDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYA-------GPSCRSITPLQ 453
++ IVLLKN + +LPL K+ + +A+IGPNAN + +LG+Y +T LQ
Sbjct: 369 RESIVLLKNENNILPLSKNVN-KIAVIGPNANDPRNMLGDYTYTGHLNIDSGIEIVTVLQ 427
Query: 454 ALQNYV--ENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLD--------QTQEK 503
+ V +Y GCD + S +A++IA+ AD ++ +MG K
Sbjct: 428 GIVKKVGESKVLYAKGCDIASESKEGFAEAIEIARQADVIIAIMGEKSGLPLSWMDIPSK 487
Query: 504 EEL-----------DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRN 552
EE DR L LPG Q+EL+ + + KP+ILVL+ G P + +
Sbjct: 488 EEFKKYQAVTGEGNDRSSLRLPGVQEELLKELYKTG-KPIILVLINGRP--LVLSSIINY 544
Query: 553 IGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNP 612
+ +++ A +PGE G A+A+VIFGD+NP GRLP+T +P D ++P+ +P +
Sbjct: 545 VKAVIEAWFPGEEGGNAIADVIFGDYNPSGRLPIT-FPMDTGQIPLY-YNRKPSSF---- 598
Query: 613 GRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPE 672
R Y +F FG GLSY+++ Y V+ ++ N + +
Sbjct: 599 -RPYVMLRSSPLFTFGYGLSYTQFEYSNLEVTPKEIGPNSNIA----------------- 640
Query: 673 LGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEK 732
++I VKN G+M G V L+V RP+K+L GF + L EK
Sbjct: 641 ------------ISIDVKNVGKMEGDDVVQLYVSKTFSSVARPVKELKGFAKIHLKPGEK 688
Query: 733 AEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGD 767
+ F L P E+L+ +V+E+G + L++G+
Sbjct: 689 RRVKFIL-PTEALAFYDSFMRLVVEKGEYQLLIGN 722
>gi|224025503|ref|ZP_03643869.1| hypothetical protein BACCOPRO_02243 [Bacteroides coprophilus DSM
18228]
gi|224018739|gb|EEF76737.1| hypothetical protein BACCOPRO_02243 [Bacteroides coprophilus DSM
18228]
Length = 787
Score = 289 bits (739), Expect = 5e-75, Method: Compositional matrix adjust.
Identities = 215/693 (31%), Positives = 347/693 (50%), Gaps = 103/693 (14%)
Query: 81 RLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGL 140
RLGIP + E HG +G AT FP + A++++ L ++G+ IGL
Sbjct: 125 RLGIPVL-FAEECPHGHMAIG-----------ATVFPTSMGQASTWNESLIRQMGEVIGL 172
Query: 141 EARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFN 200
EAR G IG + P ++I R+PRW R +ET GEDP +TG ++V+G+QG F
Sbjct: 173 EARL---QGANIG---YGPVLDIAREPRWSRVEETFGEDPYLTGILGTAFVQGMQGKDFK 226
Query: 201 GGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFD---ARVTMQDLADTYQPPFESCVKQG 257
G+ + KH AY G R + A + ++ L D Y P F+ V+ G
Sbjct: 227 DGR-----HVYSTLKHLAAY------GVPRGGHNGGPADMGLRALLDEYLPGFQRAVEVG 275
Query: 258 RASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPED 317
+A+ +M +YN ++G+P +++ L+ R++WGF G++ SD ++ I A A + ED
Sbjct: 276 KAATVMTSYNSIDGVPCTSNKFLIDSLLRKRWGFDGFVYSDLASIDGIAGAH-VAANLED 334
Query: 318 AVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQ 377
A + ++AG D++ G+ + AV+ K+ ES I+RA+ N+ ++ R+GLF P +
Sbjct: 335 AAIQAVEAGTDMDLGANAYRRLVKAVQTGKVKESAINRAVSNVLRLKFRMGLFE-QPYVS 393
Query: 378 PFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTL 437
P + A +V H++LA + A++G VLLKN +G+LPL K K + A+IGPNA+
Sbjct: 394 P--EEAARLVNCEDHRMLARKIAREGTVLLKN-NGILPLGKVKRI--AVIGPNADVMYNY 448
Query: 438 LGNYAGPSCRS--ITPLQALQNYVENTV--YYPGCDTVACSSASIDKAVDIAKGADHVVL 493
LG+Y P RS +T L AL+N + + Y GC + ++I +AV+ A+ AD V+L
Sbjct: 449 LGDYTAPQERSKVVTLLDALRNRMPDVRIDYVKGCAIRDTTQSNIKEAVEAARKADLVIL 508
Query: 494 MMG--------------------------LDQTQEKEELDRVDLVLPGRQQELITRVAEA 527
+G L + E DR L L G Q++LI +A A
Sbjct: 509 AVGGSSARDFKTKYINTGAATVDSENSGILSDMECGEGFDRATLDLLGDQEKLIRAIA-A 567
Query: 528 AKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMT 587
+KP++ V + G P+++ A + ++L A YPGE G + +V+ G++NP GRLPM+
Sbjct: 568 TEKPLVTVYIAGRPLNMNLASEVSD--ALLTAWYPGEQGGNGIVDVLTGEYNPSGRLPMS 625
Query: 588 WYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNK 647
P+ ++P+ S R Y GK ++ FG GLSY+ ++Y S K
Sbjct: 626 -VPRHVGQIPV--------HYSQGTLRDYMDCPGKPLYTFGYGLSYTTFAY-----SNLK 671
Query: 648 LYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKP 707
L ++++ + +V+ +T V N G+ G V L++
Sbjct: 672 LSATAKAASQPAGDNEVMQ-----------------TITCTVTNTGDRDGDEVVQLYLND 714
Query: 708 ARRGNGRPIKQLVGFQSVILNAKEKAEIVFELS 740
P +L GFQ + L E E+ F+L+
Sbjct: 715 EVSSVAVPPIRLKGFQKIFLKKGESREVTFQLT 747
>gi|395492941|ref|ZP_10424520.1| glycoside hydrolase family protein [Sphingomonas sp. PAMC 26617]
Length = 865
Score = 288 bits (738), Expect = 6e-75, Method: Compositional matrix adjust.
Identities = 179/415 (43%), Positives = 240/415 (57%), Gaps = 34/415 (8%)
Query: 51 PISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTI 110
PI R DL+ R+TL+EK +Q+ N APAIPRLGIP Y++W+EALHGVA G+
Sbjct: 20 PIEARVDDLMRRMTLEEKAAQMQNVAPAIPRLGIPPYDYWNEALHGVARAGE-------- 71
Query: 111 RGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQA-------IGMTFWAPNINI 163
AT FPQ I AA++D + GQ + E RA YN QA G+TFW+PNINI
Sbjct: 72 --ATVFPQAIGMAATWDRDMMLAEGQTVATEGRAKYNQAQAQKNYDRYYGLTFWSPNINI 129
Query: 164 FRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLD 223
FRDPRWGRGQET GEDP +TG AV +V GVQG N L+A A KHF +
Sbjct: 130 FRDPRWGRGQETLGEDPYLTGTMAVPFVHGVQGTDAN------YLKAIATPKHFAVH--- 180
Query: 224 NWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSK 283
+ R++F+ + +DL++TY P F + GRA +MCAYN V+ +CA+ LL
Sbjct: 181 SGPEQLRHQFNVDPSPRDLSETYLPAFRRAIVDGRAESLMCAYNAVDTKAACANTMLLKD 240
Query: 284 TARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTK--A 341
T R WGF G++TSDC A+ I + + + +KAG D C F +
Sbjct: 241 TLRGAWGFKGFVTSDCGAIDDITTGHHNSPTNPEGAALAVKAGTDTGC-DFKDEMLDLPR 299
Query: 342 AVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQ-PFGKIGADVVCSPAHQVLALQAA 400
AVK L E ++D AL LF+ RM+LG+F +P + PF I SPAH+ LAL+AA
Sbjct: 300 AVKAGYLTEGDMDVALRRLFTARMKLGMF--DPAARVPFSTISIAENHSPAHRALALRAA 357
Query: 401 QDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQAL 455
++ IVLLKN G+LPL + + +A++GP A S L GNY G ++ P+ +
Sbjct: 358 RESIVLLKND-GVLPL-AAGARRIAVVGPTAASLIALEGNYNGTPVGAVLPVDGM 410
Score = 104 bits (260), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 78/257 (30%), Positives = 116/257 (45%), Gaps = 53/257 (20%)
Query: 496 GLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDIT 545
GL+ E EE+ DR + LP Q +L+ + A KP+++VL G I
Sbjct: 602 GLNAWLEGEEMPLQVPGFAGGDRTAIALPAAQSQLLDALF-ATGKPLVIVLQSGSA--IA 658
Query: 546 FAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRP 605
+ ++L A YPGEAG A+AEV+ G NP GRLP+T+Y D +M
Sbjct: 659 LGAQEAKARAVLEAWYPGEAGGQAIAEVLSGTVNPSGRLPVTFYASTDQLPAFDDYRM-- 716
Query: 606 QATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVV 665
RTYR++ G+ +PFG GLSY++++Y + + + Q +S
Sbjct: 717 ------ANRTYRYFAGRVEYPFGHGLSYTRFAYSALRPATSSVAAGQGTS---------- 760
Query: 666 HYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVK-PARRGNGRPIKQLVGFQS 724
V++ V+N G +AG L++ P R G PI+ L G+Q
Sbjct: 761 -------------------VSVAVRNTGVLAGDEVAQLYLSVPGREGA--PIRSLKGYQR 799
Query: 725 VILNAKEKAEIVFELSP 741
V L A E + F L P
Sbjct: 800 VHLAAGETKTLTFALEP 816
>gi|371777646|ref|ZP_09483968.1| beta-glucosidase [Anaerophaga sp. HS1]
Length = 865
Score = 288 bits (738), Expect = 6e-75, Method: Compositional matrix adjust.
Identities = 165/444 (37%), Positives = 245/444 (55%), Gaps = 32/444 (7%)
Query: 41 ETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGV 100
+ PF L +RA+DL+SRLT+ EK L + + AIPRLGI + WWSEALHG A
Sbjct: 20 QVLPFQNPDLSSEERAKDLISRLTVQEKARLLCDQSEAIPRLGIKKFNWWSEALHGYAN- 78
Query: 101 GKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIG------- 153
N ++ T FPQ I AASF+ L + I AI EARA Y+ Q G
Sbjct: 79 ------NDSV---TVFPQPIGMAASFNEELVFEIFNAISDEARAKYHQAQRRGEENRRFL 129
Query: 154 -MTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASA 212
++ W PN+NIFRDPRWGRGQET GEDP +T + V V+G+QG K + L A
Sbjct: 130 SLSVWTPNVNIFRDPRWGRGQETYGEDPYLTSRMGVQVVKGLQGP--EDAKYRKLL---A 184
Query: 213 CCKHFTAYDLDNWKGTTRYKFDAR-VTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNG 271
C KH+T + W +R++ + V+ ++ +TY P F++ V++ +MCAY+R++
Sbjct: 185 CAKHYTVHSGPEW---SRHELNINDVSPREFYETYMPAFKALVQKADVRQVMCAYHRLDD 241
Query: 272 IPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNC 331
P C++ +L + R +WG+ + +DC A+S Y G + +P A L AG D+ C
Sbjct: 242 EPCCSNTRILQRILRDEWGYEHMVVADCGAISDFYTTHGISSTPVHAAATGLLAGTDLEC 301
Query: 332 --GSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCS 389
++ K A+++ + E +IDR+L + R LG + N ++ P+ +I V+
Sbjct: 302 IWDNYHYKMLPEALEKDLITEKDIDRSLMRVLKGRFDLGEMDDN-SLVPWAQIPPSVLNC 360
Query: 390 PAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSI 449
H+ LA + AQ IVLL+N + +LPL KS +A++GPNA+ L GNY G R+I
Sbjct: 361 EKHRQLAYKMAQQSIVLLQNKNKVLPLDKSSINKIAVVGPNADDEVVLWGNYNGTPIRTI 420
Query: 450 TPLQALQNYV--ENTVYYPGCDTV 471
T L +++ V +N Y D V
Sbjct: 421 TVLDGIKSKVPHDNIFYDKAVDLV 444
Score = 91.3 bits (225), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 78/292 (26%), Positives = 125/292 (42%), Gaps = 52/292 (17%)
Query: 486 KGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILV 535
+G + V+ + GL E EE+ DR ++ LP Q+ + + EA K ++
Sbjct: 601 EGIETVIFVGGLSGFLEGEEMPVSYPGFKGGDRTNIELPSVQRNCLKALKEAGK--TVIF 658
Query: 536 LLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIK 595
+ C G I + +I+ A Y GE+G A+A+V+FGD+NP G+LP+T+Y
Sbjct: 659 VNCSGSA-IALEPETESCDAIIQAWYGGESGGQAIADVLFGDYNPSGKLPVTFYRNSDNL 717
Query: 596 VPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSS 655
D M GRTYR Y +FPFG GLSY+ + +S++ + +++ S
Sbjct: 718 GDFEDYSME--------GRTYR-YTNNHLFPFGFGLSYTNFEIGKARLSKSTIKADETIS 768
Query: 656 TKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRP 715
K I VKN G+ G V ++V+ +G P
Sbjct: 769 IK-----------------------------IPVKNTGKRDGTEIVQVYVRKVNDIDG-P 798
Query: 716 IKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGD 767
+K L GFQ + + A + + L P M + G + L G+
Sbjct: 799 LKTLKGFQRIAVPAGKTRQANISLPPSTFEFYDWSQRKMAVTPGEYILYYGN 850
>gi|261880245|ref|ZP_06006672.1| beta-glucosidase [Prevotella bergensis DSM 17361]
gi|270333079|gb|EFA43865.1| beta-glucosidase [Prevotella bergensis DSM 17361]
Length = 854
Score = 288 bits (738), Expect = 6e-75, Method: Compositional matrix adjust.
Identities = 169/440 (38%), Positives = 245/440 (55%), Gaps = 36/440 (8%)
Query: 41 ETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGV 100
+ FP+ T L +RA DL SRLTL+EK + N +PAIPRLGIP +EWWSEALH G+
Sbjct: 21 QQFPYQNTDLSPKERAADLCSRLTLEEKSKIMQNGSPAIPRLGIPQFEWWSEALH---GI 77
Query: 101 GKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAI-------- 152
G+ F AT FP + A+S+D L ++ A+ E R A QA
Sbjct: 78 GRNGF-------ATVFPITMGMASSWDDALLQKVFDAVSDEGRV--KAQQAKRSGTIKRY 128
Query: 153 -GMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQAS 211
G++FW PNINIFRDPRWGRGQET GEDP +T + ++ VRG+QG + K + L
Sbjct: 129 QGLSFWTPNINIFRDPRWGRGQETYGEDPYLTSRMGLAVVRGLQGPS--DSKYRKLL--- 183
Query: 212 ACCKHFTAYDLDNWKGTTRYKFDAR-VTMQDLADTYQPPFESCVKQGRASGIMCAYNRVN 270
AC KHF + W R+ F+ + +DL +TY P F++ V+QG + +MCAY R++
Sbjct: 184 ACAKHFAVHSGPEW---NRHTFNVEDLPERDLWETYLPAFKALVQQGDVAEVMCAYQRID 240
Query: 271 GIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPE--DAVVDVLKAGMD 328
G P C + L R +W + G + SDC AV + + SP+ A + +G D
Sbjct: 241 GQPCCGNNRFLKSILRNEWNYQGMVVSDCWAVPDFWKKGHHEVSPDATHASAKAVLSGTD 300
Query: 329 VNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVC 388
V CGS + AV+ + E+++D ++ L R LG F+ + + P+ KI VV
Sbjct: 301 VECGSDY-SNLPEAVRAGIIKEADVDVSVRRLLEARFALGDFDPD-ELVPWTKISESVVA 358
Query: 389 SPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRS 448
S AH+ LAL A+ +VLL+N+ +LPL +S + ++G NA + + GNY+G ++
Sbjct: 359 SKAHKQLALDMARKSMVLLQNND-ILPLKRSGQ-KIVVVGANAIDSTMMWGNYSGYPTQT 416
Query: 449 ITPLQALQNYVENTVYYPGC 468
+T LQ LQ + + PGC
Sbjct: 417 VTILQGLQTKSDQVTFIPGC 436
Score = 111 bits (278), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 88/292 (30%), Positives = 135/292 (46%), Gaps = 63/292 (21%)
Query: 487 GADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVL 536
GAD V+ + G+ E EE+ DR + LP Q+E+I ++EA ++ I+ +
Sbjct: 598 GADVVIFVGGISPRLEGEEMEVSDPGFKGGDRTTIELPQAQREVIKALSEAGRR--IVFV 655
Query: 537 LCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKV 596
C G I + + +IL A YPGE G A+A+V+FGD+NP G+LP+T+Y D
Sbjct: 656 NCSGSA-IALTPESQRVDAILQAWYPGEQGGTAVADVLFGDYNPSGKLPVTFYKNDAQLP 714
Query: 597 PMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSST 656
D +M GRTYR+++ +FPFG GLSY++++ + Q + NQ
Sbjct: 715 DFLDYRM--------AGRTYRYFKETPLFPFGYGLSYTQFT-----IGQPRYINNQ---- 757
Query: 657 KMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPI 716
V + V N G+ G V ++++ G PI
Sbjct: 758 ----------------------------VQVSVSNTGKRDGDEVVQVYIRRTDDAAG-PI 788
Query: 717 KQLVGFQSVILNAKEKAEIVFELSPCESLS--RAREDGLMVIEEGTHFLVVG 766
K L GFQ V L E ++ L P ES A + + VI G + ++VG
Sbjct: 789 KTLRGFQRVSLKVGETKQVSVSL-PRESFEWWDASSNTMRVI-PGNYEVMVG 838
>gi|385776908|ref|YP_005649476.1| glycoside hydrolase family protein [Sulfolobus islandicus REY15A]
gi|323475656|gb|ADX86262.1| glycoside hydrolase family 3 domain protein [Sulfolobus islandicus
REY15A]
Length = 754
Score = 288 bits (738), Expect = 7e-75, Method: Compositional matrix adjust.
Identities = 228/755 (30%), Positives = 363/755 (48%), Gaps = 114/755 (15%)
Query: 50 LPISQRARDLVSRLTLDEK-ISQLVNSAPAI----PRLGIPAYEWWSEALHGVAGVGKGI 104
L I Q R SRL L K +++LVN R+ IPA E L G+ G
Sbjct: 45 LGIGQITRVAGSRLGLKPKEVAKLVNKIQKFLVENTRMKIPAI-IHEECLSGLMGYS--- 100
Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIF 164
+T+FPQ I A++++ L I I +AR L Q + +P +++
Sbjct: 101 --------STAFPQAIGLASTWNLELVMDIASVIRSQAR-LVGVNQCL-----SPVLDVC 146
Query: 165 RDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDN 224
+DPRWGR +ET GEDP + ++Y+ G+QGD Q A KHF A+
Sbjct: 147 KDPRWGRCEETYGEDPYLVASMGLAYITGLQGDN----------QLVATAKHFAAH---G 193
Query: 225 WKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKT 284
+ R V ++L +T+ PFE VK G+ IM AY+ ++GIP + LL+
Sbjct: 194 FPEGGRNIAQVHVGNRELRETFLFPFEVAVKIGKVMSIMPAYHEIDGIPCHGNPQLLTNI 253
Query: 285 ARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSF--LQKHTKAA 342
R++WGF G + SD D + + A + +A + L++G+D+ + + A
Sbjct: 254 LRQEWGFDGIVVSDYDGIRQLETIHRVASNKMEAAILALESGVDIEFPTIDCYSEPLVNA 313
Query: 343 VKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGA--DVVCSPAHQVLALQAA 400
+ + +PES IDRA+ + ++ RLGL + PF + + + + LAL+ A
Sbjct: 314 LTEGLVPESLIDRAVERVLRIKDRLGLLD-----NPFVNENSVPEKLDDHKSRELALKTA 368
Query: 401 QDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYA-------GPSCRSITPLQ 453
++ IVLLKN + +LPL K+ + +A+IGPNAN + +LG+Y +T LQ
Sbjct: 369 RESIVLLKNENNILPLSKNVN-KIAVIGPNANDPRNMLGDYTYTGHLNIDSGIEIVTVLQ 427
Query: 454 ALQNYV--ENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMG------LDQTQ--EK 503
+ V +Y GCD + S +A++IA+ AD ++ +MG L T +
Sbjct: 428 GVVKKVGESKVLYAKGCDIASESKEGFAEAIEIARQADVIIAVMGEKSGLPLSWTDIPSE 487
Query: 504 EEL-----------DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRN 552
EE DR L LPG Q+EL+ + + KP+ILVL+ G P + +
Sbjct: 488 EEFKKYQAVTGEGNDRSSLRLPGVQEELLKELYKTG-KPIILVLINGRP--LVLSPIINY 544
Query: 553 IGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNP 612
+ +++ A +PGE G A+A+VIFGD+NP GRLP+T +P D ++P+ +P +
Sbjct: 545 VKAVIEAWFPGEEGGNAIADVIFGDYNPSGRLPIT-FPMDTGQIPLY-YNRKPSSF---- 598
Query: 613 GRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPE 672
R Y +F FG GLSY+++ Y V+ ++ N + +
Sbjct: 599 -RPYVMLRSSPLFTFGYGLSYTQFEYSNLEVTPKEIGPNSNIA----------------- 640
Query: 673 LGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEK 732
++I VKN G+M G V L+V RP+K+L GF + L EK
Sbjct: 641 ------------ISIDVKNVGKMEGDDVVQLYVSKTFSSVARPVKELKGFAKIHLKPGEK 688
Query: 733 AEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGD 767
+ F L P E+L+ +V+E+G + L++G+
Sbjct: 689 RRVKFIL-PTEALAFYDSFMRLVVEKGEYQLLIGN 722
>gi|270340289|ref|ZP_06007688.2| periplasmic beta-glucosidase [Prevotella bergensis DSM 17361]
gi|270331970|gb|EFA42756.1| periplasmic beta-glucosidase [Prevotella bergensis DSM 17361]
Length = 867
Score = 288 bits (736), Expect = 9e-75, Method: Compositional matrix adjust.
Identities = 175/467 (37%), Positives = 250/467 (53%), Gaps = 40/467 (8%)
Query: 17 FTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSA 76
F SLL V PF P+ + P+ L +RA DL RLTL+EK S +++ +
Sbjct: 7 FVSLLMAV----LPFCNMPA----QQLPYQNPALSAHERAVDLCGRLTLEEKASLMLDDS 58
Query: 77 PAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQ 136
PAIPRLGI ++WWSEALHGVA +G T FPQ I AASF+ L Y++
Sbjct: 59 PAIPRLGIKRFQWWSEALHGVANMGD----------VTVFPQPIGMAASFNDNLVYQVFN 108
Query: 137 AIGLEARALYNAGQAI--------GMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAV 188
A E RA +N QA G++ W PN+NIFRDPRWGRGQET GEDP +T +
Sbjct: 109 ATSDEMRAKWNELQASGKEVTRFHGLSVWTPNVNIFRDPRWGRGQETYGEDPYLTSRMGC 168
Query: 189 SYVRGVQG-DTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQ 247
+ VRG+QG +T KL AC KH+ + W T D V+ +DL +TY
Sbjct: 169 AVVRGLQGPETAKYRKL------WACAKHYAIHSGPEWARHTDNVTD--VSPRDLWETYM 220
Query: 248 PPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYD 307
P F+S V+ +MCAY R + P C + LL + R +WGF + SDC AVS +
Sbjct: 221 PAFKSLVQDANVREVMCAYQRWDDDPCCGNNRLLQRILRDEWGFKYLVVSDCGAVSDFWT 280
Query: 308 AEGYAKSPEDAVVDVLKAGMDVNCG-SFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMR 366
+ + + +A + AG DV CG +++ K AVK + E+E+D + L R
Sbjct: 281 SHKSSSNARNAATKAVLAGTDVECGYNYVYKSIPEAVKYGAMTEAEMDEHVVRLLEGRFD 340
Query: 367 LGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLAL 426
LG + N ++ P+ I + +CS H+ L+L + + LL+NS+ +LPL K K +A+
Sbjct: 341 LGEMDDN-SIVPWSSIPSSALCSKKHRQLSLDMSLQTMTLLQNSNDVLPLNK-KEKKIAV 398
Query: 427 IGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENT--VYYPGCDTV 471
IGPN ++ + GNY G ++T L +++ + V + GCD V
Sbjct: 399 IGPNVDNEPMMWGNYNGTPRSTVTILDGIKSRLRKNQLVTFRGCDLV 445
Score = 94.7 bits (234), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 77/269 (28%), Positives = 114/269 (42%), Gaps = 52/269 (19%)
Query: 481 AVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKK 530
A+ K + VV + G+ E EE+ DR D+ LP Q+E + + +A K
Sbjct: 597 AIKQLKDCETVVFVGGISPQLEGEEMPINVDGFKGGDRTDIELPKVQREFLKALKQAGKS 656
Query: 531 PVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYP 590
+V +C I + +IL A YPG+ G A+A V+FG++NP G+LP+T+Y
Sbjct: 657 ---VVFVCCSGSAIALTPETKTCDAILQAWYPGQEGGEAVARVLFGEYNPSGKLPITFYK 713
Query: 591 QDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYL 650
D M+ RTYR Y +FPFG GLSY+ +S +S L
Sbjct: 714 NTEQLPDFKDYSMK--------NRTYR-YMNDALFPFGYGLSYTDFSIGDATLSGKTLTP 764
Query: 651 NQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARR 710
+ + K + VKN G+ G + ++VK
Sbjct: 765 GATITMK-----------------------------VPVKNIGKRDGTEVIQIYVKDPTD 795
Query: 711 GNGRPIKQLVGFQSVILNAKEKAEIVFEL 739
G P+K L GFQ V + A + AE L
Sbjct: 796 SEG-PLKSLKGFQRVPVKAGQTAEATITL 823
>gi|385774250|ref|YP_005646817.1| glycoside hydrolase family protein [Sulfolobus islandicus HVE10/4]
gi|323478365|gb|ADX83603.1| glycoside hydrolase family 3 domain protein [Sulfolobus islandicus
HVE10/4]
Length = 754
Score = 288 bits (736), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 225/755 (29%), Positives = 360/755 (47%), Gaps = 114/755 (15%)
Query: 50 LPISQRARDLVSRLTLDEK-ISQLVNSAPAI----PRLGIPAYEWWSEALHGVAGVGKGI 104
L I Q R SRL L K +++LVN ++ IPA E L G+ G
Sbjct: 45 LGIGQITRVAGSRLGLKPKDVAKLVNKIQKFLVENTKMKIPAI-IHEECLSGLMGYS--- 100
Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIF 164
+T+FPQ I A++++ L I I + R L Q + +P +++
Sbjct: 101 --------STAFPQAIGLASTWNLELVMDIASVIRSQGR-LVGVNQCL-----SPVLDVC 146
Query: 165 RDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDN 224
+DPRWGR +ET GEDP + ++Y+ G+QGD Q A KHF A+
Sbjct: 147 KDPRWGRCEETYGEDPYLVASMGLAYITGLQGDN----------QLVATAKHFAAH---G 193
Query: 225 WKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKT 284
+ R V ++L +T+ PFE VK G+ IM AY+ ++GIP + LL+
Sbjct: 194 FPEGGRNIAQVHVGNRELRETFLFPFEVAVKIGKVMSIMPAYHEIDGIPCHGNPQLLTNI 253
Query: 285 ARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSF--LQKHTKAA 342
R++WGF G + SD D + + A + +A + L++G+D+ + + A
Sbjct: 254 LRQEWGFDGIVVSDYDGIRQLETIHRVASNKMEAAILALESGVDIEFPTIDCYSEPLVNA 313
Query: 343 VKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGA--DVVCSPAHQVLALQAA 400
+ + +PES IDRA+ + ++ RLGL + PF + + + + LAL+ A
Sbjct: 314 LTEGLVPESLIDRAVERVLRIKDRLGLLD-----NPFVNENSVPEKLDDHKSRELALKTA 368
Query: 401 QDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYA-------GPSCRSITPLQ 453
++ IVLLKN + +LPL K+ + +A+IGPNAN + +LG+Y +T LQ
Sbjct: 369 RESIVLLKNENNILPLSKNVN-KIAVIGPNANDPRNMLGDYTYTGHLNIDSGIEIVTVLQ 427
Query: 454 ALQNYV--ENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLD------------- 498
+ V +Y GCD + S +A++IA+ AD ++ +MG
Sbjct: 428 GVVKKVGESKVLYAKGCDIASESKEGFAEAIEIARQADVIIAVMGEKSGLPLSWTDIPSE 487
Query: 499 ------QTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRN 552
Q E DR L LPG Q+EL+ + + KP+ILVL+ G P + +
Sbjct: 488 EEFKKYQAVTGEGNDRSSLRLPGVQEELLKELYKTG-KPIILVLINGRP--LVLSPIINY 544
Query: 553 IGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNP 612
+ +++ A +PGE G A+A+VIFGD+NPGGRLP+T +P D ++P+ +P +
Sbjct: 545 VKAVIEAWFPGEEGGNAIADVIFGDYNPGGRLPIT-FPMDTGQIPLY-YNRKPSSF---- 598
Query: 613 GRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPE 672
R Y +F FG GLSY+++ Y V+ ++ N + +
Sbjct: 599 -RPYVMLRSSPLFTFGYGLSYTQFEYSNLEVTPKEIGPNSNIA----------------- 640
Query: 673 LGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEK 732
++I VKN G+M G V L+V RP+K+L GF + L EK
Sbjct: 641 ------------ISIDVKNVGKMEGDDVVQLYVSKTFSSVARPVKELKGFAKIHLKPGEK 688
Query: 733 AEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGD 767
+ F L P E+L+ +V+E+G + L++G+
Sbjct: 689 RRVKFIL-PTEALAFYDSFMRLVVEKGEYQLLIGN 722
>gi|281422547|ref|ZP_06253546.1| beta-glucosidase [Prevotella copri DSM 18205]
gi|281403371|gb|EFB34051.1| beta-glucosidase [Prevotella copri DSM 18205]
Length = 869
Score = 288 bits (736), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 168/447 (37%), Positives = 241/447 (53%), Gaps = 31/447 (6%)
Query: 37 NPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHG 96
N S + P+ L +RA DL RLTL+EK S +++ +PAIPRLGI ++WWSEALHG
Sbjct: 20 NASAQQLPYQNPALSAHERAVDLCGRLTLEEKASLMLDDSPAIPRLGIKRFQWWSEALHG 79
Query: 97 VAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIG--- 153
VA +G T FP+ I AASF+ + YR+ A E RA +N Q G
Sbjct: 80 VANMGD----------VTVFPEPIGMAASFNDRMVYRVFDATSDEMRAKWNELQQKGGDV 129
Query: 154 -----MTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKL 208
++ W PN+NIFRDPRWGRGQET GEDP +T + + VRG+QG +
Sbjct: 130 TRFHALSVWTPNVNIFRDPRWGRGQETYGEDPYLTSRMGCAVVRGLQGPEDTKYR----- 184
Query: 209 QASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNR 268
+ AC KH+ + W T D VT +DL +TY P F+S V+ + +MCAY R
Sbjct: 185 KLWACAKHYAIHSGPEWARHTDNITD--VTPRDLWETYMPAFKSLVQDAKVREVMCAYQR 242
Query: 269 VNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMD 328
+ P C + LL + R +WGF + SDC AV+ ++ + + +A + AG D
Sbjct: 243 WDDEPCCGNTRLLQQILRDEWGFKYLVVSDCGAVTDFWENHKVSSNARNAAAKGVLAGTD 302
Query: 329 VNCG-SFLQKHTKAAVKQKKL-PESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADV 386
V CG +++ K AVK L E E+D+ + L R LG + N + + KI V
Sbjct: 303 VECGFNYIYKSVPEAVKYGALTEEEEVDKHVIRLLEGRFDLGEMDDNKIVS-WSKIPVSV 361
Query: 387 VCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSC 446
+CS AH+ L+L A + LL+N + +LPL K K +A IGPN ++ + GNY G
Sbjct: 362 LCSKAHRQLSLDMALQTMTLLQNKNEVLPLDK-KVKKIAFIGPNVDNEPMMWGNYNGTPR 420
Query: 447 RSITPLQALQNYVENT--VYYPGCDTV 471
++IT L +++ ++ V + GCD V
Sbjct: 421 QTITILDGIKSRLKKNQVVTFKGCDLV 447
Score = 89.0 bits (219), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 75/264 (28%), Positives = 113/264 (42%), Gaps = 52/264 (19%)
Query: 486 KGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILV 535
K + VV + G+ E EE+ DR ++ LP Q+ + + EA KK ++
Sbjct: 604 KDCETVVFVGGISPQLEGEEMPIEISGFKGGDRTNIELPKVQRNFLKALKEAGKK--VVF 661
Query: 536 LLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIK 595
+ C G I + +IL A YPG+ G A+A V+FG++NP G+LP+T+Y
Sbjct: 662 VNCSGSA-IALTPETESCDAILQAWYPGQEGGEAVARVLFGEYNPAGKLPITFYKNSEQL 720
Query: 596 VPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSS 655
D M+ GRTYR Y +FPFG GLSY+ + +S + L +
Sbjct: 721 PDFKDYSMK--------GRTYR-YMNDALFPFGYGLSYTSFRMGDATLSNSILKKGE--- 768
Query: 656 TKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRP 715
K + + V N G+ G V ++VK G P
Sbjct: 769 --------------------------KITLKVPVSNVGKKDGTEIVQVYVKDPADTEG-P 801
Query: 716 IKQLVGFQSVILNAKEKAEIVFEL 739
+K L F+ V + A + AE V L
Sbjct: 802 LKSLKAFERVEVKAGKTAEAVITL 825
>gi|399025517|ref|ZP_10727513.1| beta-glucosidase-like glycosyl hydrolase [Chryseobacterium sp.
CF314]
gi|398077894|gb|EJL68841.1| beta-glucosidase-like glycosyl hydrolase [Chryseobacterium sp.
CF314]
Length = 875
Score = 288 bits (736), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 167/457 (36%), Positives = 243/457 (53%), Gaps = 34/457 (7%)
Query: 31 FSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWW 90
F C S +PF LP+ QR +L+ LT+DEKI +++++ A+PRL IPAY WW
Sbjct: 11 FICAFSFGQNYKYPFRNPNLPVEQRIENLLGLLTVDEKIGMMMDNSKAVPRLEIPAYGWW 70
Query: 91 SEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYN--- 147
+EALHGVA G AT FPQ I AA++D + + I EARA YN
Sbjct: 71 NEALHGVARAGT----------ATVFPQAIGMAAAWDVPEHLKTFEMISDEARAKYNKSF 120
Query: 148 -----AGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGG 202
G+ G+TFW PNINIFRDPRWGRGQET GEDP +T V+ V+G+QG+
Sbjct: 121 DEASKTGRYEGLTFWTPNINIFRDPRWGRGQETYGEDPYLTSVLGVAAVKGLQGND---- 176
Query: 203 KLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGI 262
+ AC KHF + W R+ ++A V+ +DL +TY P F+S V +G +
Sbjct: 177 --PKYFKTHACAKHFAVHSGPEW---NRHSYNAEVSKRDLYETYLPAFKSLVLEGNVREV 231
Query: 263 MCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPED--AVV 320
MCAYN +G P CA LL++ R +W + G + SDC A++ Y + + P++
Sbjct: 232 MCAYNAFDGQPCCASNTLLNEILRGKWKYDGMVVSDCWALADFYQEKYHGTHPDEKSTAA 291
Query: 321 DVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFG 380
D LK D+ CG K ++ + E +ID ++ + LG+ + ++ +
Sbjct: 292 DALKHSTDLECGDTYNNLNK-SLAGGLITEKDIDISMRRILKGWFELGMLDPKSSVL-WN 349
Query: 381 KIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGN 440
+I VV S H+ AL+ AQ IVL+KN + +LP K+ +A++GPNA+ LGN
Sbjct: 350 QIPYSVVDSDEHKKQALKMAQKSIVLMKNENNILPFNKNIK-KIAVVGPNADDEMMQLGN 408
Query: 441 YAGPSCRSITPLQALQNYVENT--VYYPGCDTVACSS 475
Y G +T L+ ++ NT +Y G + SS
Sbjct: 409 YNGTPSSIVTILEGIKAKFPNTEIIYEKGSEVADPSS 445
Score = 98.2 bits (243), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 80/279 (28%), Positives = 131/279 (46%), Gaps = 52/279 (18%)
Query: 476 ASIDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVA 525
AS+ + V K AD +V GL + E EE+ D+ + LP Q+EL+ +
Sbjct: 593 ASVKEKV---KDADVIVFAGGLSPSLEGEEMLVNAEGFKGGDKTSIELPKVQRELLAELR 649
Query: 526 EAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLP 585
+ K PV+ VL G + + + ++N +L A Y G++G A+A+V+ GD+NP GRLP
Sbjct: 650 KTGK-PVVFVLCTGSSLGL--EQDEKNYDVLLNAWYGGQSGGTAVADVLAGDYNPSGRLP 706
Query: 586 MTWYP--QDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAV 643
+T+Y + K + GRTYR+ ++ FG GLSYSK++Y +
Sbjct: 707 VTFYKNLEQLDNALSKTSKHQGFENYDMQGRTYRYMTENPLYAFGHGLSYSKFNYGNAKL 766
Query: 644 SQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLL 703
S+N + N+D++ +T+ V N + G+ V +
Sbjct: 767 SKNSI----------SPNEDII-------------------ITVPVTNISDRDGEEVVQV 797
Query: 704 FVKPARRGNG--RPIKQLVGFQSVILNAKEKAEIVFELS 740
+VK R N P+K L F+ V++ +KE I +S
Sbjct: 798 YVK---RNNDVLAPVKTLRAFERVLIRSKETKNIQLTIS 833
>gi|315606832|ref|ZP_07881841.1| beta-glucosidase [Prevotella buccae ATCC 33574]
gi|315251497|gb|EFU31477.1| beta-glucosidase [Prevotella buccae ATCC 33574]
Length = 858
Score = 287 bits (735), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 175/463 (37%), Positives = 250/463 (53%), Gaps = 35/463 (7%)
Query: 39 STETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVA 98
+ + P+C L +RARDL+SRLTL+EK +++ +PAIPRLGI + WWSEALHG A
Sbjct: 18 TAQLLPYCNPALSARERARDLLSRLTLEEKARLMLDESPAIPRLGIKKFFWWSEALHGAA 77
Query: 99 GVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIG----- 153
+G G T FP+ + AASF+ L R+ A E RA YN G
Sbjct: 78 NMG----------GVTVFPEPVGMAASFNDGLLRRVFDAASDEMRAQYNRRMLNGGEDEK 127
Query: 154 ---MTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQG-DTFNGGKLKGKLQ 209
++ W PN+NIFRDPRWGRGQET GEDP +T + VRG+QG +T KL
Sbjct: 128 FHSLSVWTPNVNIFRDPRWGRGQETYGEDPYLTSVMGTAVVRGLQGPETAKYRKL----- 182
Query: 210 ASACCKHFTAYDLDNWKGTTRYKFD-ARVTMQDLADTYQPPFESCVKQGRASGIMCAYNR 268
AC KH+ + + TR+ + A V+ +DL +TY P F++ V + + +MCAY R
Sbjct: 183 -WACAKHYAVHSGPEY---TRHTANVADVSPRDLWETYLPAFKTLVTEAKVREVMCAYQR 238
Query: 269 VNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMD 328
++ P C++ LL + R +WGF+ + SDC AV+ IY + A AG D
Sbjct: 239 LDDDPCCSNNRLLQQILRDEWGFNYLVVSDCGAVTDIYANHKTSSDAVHAAAKAAVAGTD 298
Query: 329 VNCG-SFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVV 387
V CG + K AV++ + E+E+D+ + L R LG + +P + + KI A V+
Sbjct: 299 VECGFGYAYKTIPEAVRRGLITEAEVDKHVLRLLEGRFDLGEMD-DPKLVEWSKIPASVM 357
Query: 388 CSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCR 447
S AH+ LAL A+ +VLL+N G+LPL K+ + +IGPNA+ + GNY G R
Sbjct: 358 DSKAHRQLALDMARQSLVLLQNKGGVLPL-KAGGDPITVIGPNADDGPMMWGNYNGTPNR 416
Query: 448 SITPLQALQNYVENTVYYPGC---DTVACSSASIDKAVDIAKG 487
++T L ++ Y GC DT +S A+D KG
Sbjct: 417 TVTILDGIKARHTRVTYLKGCDLTDTKTVNSLLPQCAIDGRKG 459
Score = 76.6 bits (187), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 70/265 (26%), Positives = 110/265 (41%), Gaps = 56/265 (21%)
Query: 486 KGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILV 535
+G VV + G+ E EE+ DR ++ LP Q++ + + EA K ++
Sbjct: 597 QGIRKVVFVGGISAALEGEEMPVDIDGFKGGDRTNIELPKVQRDFLRALHEAGK--TVVF 654
Query: 536 LLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIK 595
+ C G I +IL A Y G+ G A+++V+FG NP G+LP+T+Y +
Sbjct: 655 VNCSGSA-IALEPEMETCDAILQAWYAGQEGGTAVSDVLFGTVNPSGKLPVTFYKRTDQL 713
Query: 596 VPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSS 655
D MR GRTYR++ +F FG GLSY+ + +
Sbjct: 714 PDYEDYSMR--------GRTYRYFS-DPLFAFGYGLSYTTFRF----------------G 748
Query: 656 TKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRP 715
E + + SVP + N G G+ V ++++ NG P
Sbjct: 749 RARAEAAEGGYRLSVP-----------------LTNTGTRPGEEVVQVYIRRVADTNG-P 790
Query: 716 IKQLVGFQSVILNAKEKAEIVFELS 740
+K L F+ V L A E + LS
Sbjct: 791 LKSLRAFRRVALKAGESTTVEIPLS 815
>gi|393781363|ref|ZP_10369562.1| hypothetical protein HMPREF1071_00430 [Bacteroides salyersiae
CL02T12C01]
gi|392676856|gb|EIY70278.1| hypothetical protein HMPREF1071_00430 [Bacteroides salyersiae
CL02T12C01]
Length = 863
Score = 287 bits (734), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 173/442 (39%), Positives = 241/442 (54%), Gaps = 39/442 (8%)
Query: 43 FPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGK 102
PF + LP+ +RA+DL+ RLTL EK+ + + + IPRLGI Y WW+EALHGV G
Sbjct: 22 LPFNNSDLPVEERAQDLLQRLTLQEKVLLMCDYSSPIPRLGIKRYNWWNEALHGVGRAGL 81
Query: 103 GIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAI-------GMT 155
AT FPQ I AA+FD + + + EARA Y+ + G+T
Sbjct: 82 ----------ATVFPQAIGMAATFDDCAVRQAFECVSDEARAKYHHSENKEGSERYQGLT 131
Query: 156 FWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCK 215
FW PN+NIFRDPRWGRGQET GEDP +T + ++ VRG+QG + + + AC K
Sbjct: 132 FWTPNVNIFRDPRWGRGQETYGEDPYLTSQMGLAVVRGLQGPSESKYD-----KLHACAK 186
Query: 216 HFTAYDLDNWKGTTRYKFDA-RVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPS 274
H+ + W R+ FD ++ +DL +TY P F++ V+QG +MCAYNR G P
Sbjct: 187 HYALHSGPEW---NRHSFDVDSISPRDLWETYLPAFKALVQQGGVKEVMCAYNRFEGEPC 243
Query: 275 CADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSP--EDAVVDVLKAGMDVNCG 332
C LL R +WGF G + SDC A+S Y + P E AV +KAG D++CG
Sbjct: 244 CGSNRLLYNILREEWGFDGLVVSDCGAISDFYLKGHHETHPTKEAAVAAAVKAGTDLDCG 303
Query: 333 ---SFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCS 389
LQK AV++ + E +ID +L L R LGL + + + I VV S
Sbjct: 304 VDYYALQK----AVEEGIITEKQIDVSLFRLLKARFELGLMD-EEHLVSWSDIPYTVVDS 358
Query: 390 PAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSI 449
H+ AL+ A+ + LLKN HG LPL K +A+IGPNAN + + GNY G ++
Sbjct: 359 EKHREKALEMARKSMTLLKNDHGTLPLSKHCG-KIAVIGPNANDSVMMWGNYNGFPSHTV 417
Query: 450 TPLQALQNYV--ENTVYYPGCD 469
T L+ + + + E +Y GC+
Sbjct: 418 TILEGITHKLGAEQIIYDKGCE 439
Score = 111 bits (278), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 79/262 (30%), Positives = 129/262 (49%), Gaps = 51/262 (19%)
Query: 488 ADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLL 537
A+ +V + G+ E EEL DR + LP Q++L+ + + K PVIL+L
Sbjct: 600 AEVIVFVGGISPKVEGEELPVSFPGFKGGDRTVIELPQVQRDLLQELHKTGK-PVILILC 658
Query: 538 CGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVP 597
G + ++ A+ D +I+ A Y G+AG A+A+V+FGD+NP GRLP+T+Y
Sbjct: 659 SGSAIGLS-AEVDL-ADAIIQAWYLGQAGGTAVADVLFGDYNPAGRLPVTFYKATEQLPD 716
Query: 598 MTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTK 657
D M+ GRTYR++EG+ +FPFG GLSY+ + +S+ ++ N+S S K
Sbjct: 717 FEDYSMQ--------GRTYRYFEGEALFPFGYGLSYTSFEIGKARLSKKRIRENESVSLK 768
Query: 658 MVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIK 717
+ V+N G++ G + ++++ + G P+K
Sbjct: 769 LT-----------------------------VENTGKLDGDEVIQIYIRKLQDKEG-PLK 798
Query: 718 QLVGFQSVILNAKEKAEIVFEL 739
L F+ L A EK ++ F L
Sbjct: 799 TLRAFKRFHLRAGEKKDVTFHL 820
>gi|399070782|ref|ZP_10749848.1| beta-glucosidase-like glycosyl hydrolase [Caulobacter sp. AP07]
gi|398043993|gb|EJL36849.1| beta-glucosidase-like glycosyl hydrolase [Caulobacter sp. AP07]
Length = 900
Score = 287 bits (734), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 169/416 (40%), Positives = 234/416 (56%), Gaps = 41/416 (9%)
Query: 56 ARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATS 115
A LV ++TLDEK+ QL+N+APAIPRLGIPAY WW+E+LHG G T+
Sbjct: 49 AAQLVGQMTLDEKVEQLLNTAPAIPRLGIPAYNWWTESLHGALGALP----------TTN 98
Query: 116 FPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIG--------MTFWAPNINIFRDP 167
FP+ I AA+FD L + + A+G E R L+ ++ G + W+PNINIFRDP
Sbjct: 99 FPEPIGLAATFDPALVHDVAGAVGSEVRGLHTLARSTGRMGRIGGGLDTWSPNINIFRDP 158
Query: 168 RWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKG 227
RWGRGQET GEDP +T + V++V+GVQG + Q A KHF + +
Sbjct: 159 RWGRGQETYGEDPFLTARVGVAFVQGVQGPNPD------LPQVIATPKHFAVH---SGPE 209
Query: 228 TTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARR 287
+TR+ + V+ DL DTY P F + + +GRA +MCAYNR++G P+CA LL R+
Sbjct: 210 STRHSANVFVSRHDLEDTYLPAFRAAITEGRAGSVMCAYNRIDGQPACASDLLLKDHLRQ 269
Query: 288 QWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSF-------LQKHTK 340
W F GY+ SDCDAV I D YA AV ++AG+D C L +
Sbjct: 270 AWRFDGYVVSDCDAVKDISDHHRYAPDAAAAVAAAMRAGVDNECNGATLTDTDGLADRYR 329
Query: 341 AAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAA 400
A+++ + +++DRAL LFS R+R G G +P + V +P H VLAL AA
Sbjct: 330 EALRRDLISVADVDRALVRLFSARLRTGDLPG--VRKPETPVALASVMTPEHAVLALAAA 387
Query: 401 QDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQ 456
+ +VLLKN G+LPL +A+IGP ++ + L GNY+ S S P+ L+
Sbjct: 388 EKSLVLLKND-GVLPL--KADTRIAVIGPLGDATRVLRGNYS--SALSAPPISVLE 438
Score = 110 bits (274), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 93/303 (30%), Positives = 136/303 (44%), Gaps = 55/303 (18%)
Query: 477 SIDKAVDI---AKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITR 523
S D A D+ A AD +V ++GL E EE D+ L +P QQ L+ +
Sbjct: 615 SDDPAGDLKAAAAQADVLVAVVGLTSDLEAEESPVEIPGFKGGDKTTLDIPSDQQALLEQ 674
Query: 524 VAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGR 583
A+A KP+++V + G P+++++AK N +IL A YPG++G +A+A V+ G NP GR
Sbjct: 675 -AKALGKPLVVVTMNGSPLNLSWAK--DNASAILEAWYPGQSGGLAVANVLSGKTNPSGR 731
Query: 584 LPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAV 643
LP+T+Y P D M GRTYR++EG V+ FG GLSY++++Y+ V
Sbjct: 732 LPLTFYRSVDDLPPFGDYAM--------AGRTYRYFEGTPVYSFGYGLSYTRFAYEPLKV 783
Query: 644 SQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLL 703
+ VT V+N G +G L
Sbjct: 784 EAAAG-----------------------------GAGKGLRVTTSVRNLGTRSGDEVAQL 814
Query: 704 FVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFL 763
++ G P L GFQ + L E I F LS LS DG + G + +
Sbjct: 815 YLD-FPDAPGAPRLALRGFQRLTLKPGESRAITFALS-SRDLSSVDPDGQRRVASGRYRV 872
Query: 764 VVG 766
VG
Sbjct: 873 SVG 875
>gi|94495010|ref|ZP_01301591.1| family 3 glycoside hydrolase [Sphingomonas sp. SKA58]
gi|94425276|gb|EAT10296.1| family 3 glycoside hydrolase [Sphingomonas sp. SKA58]
Length = 872
Score = 286 bits (733), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 172/418 (41%), Positives = 227/418 (54%), Gaps = 48/418 (11%)
Query: 54 QRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGA 113
+AR +V +LTL EK QL+N+APAIPRLGIPAY WW+E+LHG G
Sbjct: 33 DKARAIVRQLTLQEKTEQLLNTAPAIPRLGIPAYNWWTESLHGAIGPVP----------T 82
Query: 114 TSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQ--------AIGMTFWAPNINIFR 165
T+FP+ I AA+FD+ L + + I E +AL+ G+ G+ W+PNINIFR
Sbjct: 83 TNFPEPIGLAATFDAPLIHTVASVISTEVQALHTLGRQTGHLGRIGTGLDTWSPNINIFR 142
Query: 166 DPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNW 225
DPRWGRGQET GEDP +T + V++++G+QGD + A KHF + +
Sbjct: 143 DPRWGRGQETYGEDPYLTARMGVAFIQGMQGDNPD------LPDVVATPKHFAVH---SG 193
Query: 226 KGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTA 285
+R+ + T +DL DTY P F + + +G+A IMCAYNRV+G P+C LL+
Sbjct: 194 PEPSRHTDNIFATKRDLEDTYLPAFRAAIVEGKAGSIMCAYNRVDGEPACGSAMLLTDYL 253
Query: 286 RRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSF-------LQKH 338
R WGF GY+ SDCDAV IY YA SP V L+ G+D C + L
Sbjct: 254 RNAWGFRGYVVSDCDAVVDIYAHHKYAPSPATGVSVALRRGVDSECNNATLSGQAGLGDR 313
Query: 339 TKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNG----NPTMQPFGKIGADVVCSPAHQV 394
K A+ + +ID AL LFS R+R G G P P IG +PAH
Sbjct: 314 YKDALAANHISIGDIDSALVRLFSARLRNGDLPGLSARKPNSTPVSAIG-----TPAHGA 368
Query: 395 LALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPL 452
LAL AA +VLLKN G LPL V + L GP A++ + L GNY+ S RS P+
Sbjct: 369 LALDAAVKSLVLLKN-EGALPL--KPGVRIVLAGPLADATRVLRGNYS--STRSAPPI 421
Score = 124 bits (312), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 100/297 (33%), Positives = 138/297 (46%), Gaps = 55/297 (18%)
Query: 483 DIAKG---ADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAK 529
D+A G AD +V +MGL E EE+ DR L LP Q + + A+A
Sbjct: 593 DLAAGVRDADVIVAVMGLTSDLEGEEMPVKVEGFEGGDRTTLALPADQIAFLEK-AKATG 651
Query: 530 KPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWY 589
KP+ILV++ G +D+ +AK N +IL A YPG++G +A+A V+ G +PGGRLP+T+Y
Sbjct: 652 KPLILVMMNGSAIDLGWAK--DNAAAILEAWYPGQSGGLAIANVLSGKADPGGRLPLTFY 709
Query: 590 PQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLY 649
P TD M GRTYR++ G V+PFG GLSY+++ Y
Sbjct: 710 HSVDDLPPFTDYSME--------GRTYRYFRGAPVYPFGHGLSYTRFRY----------- 750
Query: 650 LNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPAR 709
+ +VE D K VT + N G+ G L++ P
Sbjct: 751 -----APLVVEPIDGAVEKG-------------LRVTTRITNVGQRPGDEVAQLYITPPA 792
Query: 710 RGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
G P L GFQ + L A E I F LSP LS G ++ G + L VG
Sbjct: 793 F-EGAPRTALRGFQRLSLKAGESRAISFTLSP-RDLSFVTMAGDRMLIPGNYDLSVG 847
>gi|223936933|ref|ZP_03628842.1| Beta-glucosidase [bacterium Ellin514]
gi|223894502|gb|EEF60954.1| Beta-glucosidase [bacterium Ellin514]
Length = 774
Score = 286 bits (732), Expect = 3e-74, Method: Compositional matrix adjust.
Identities = 229/728 (31%), Positives = 361/728 (49%), Gaps = 108/728 (14%)
Query: 81 RLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGL 140
RLGIP + E LHG A R TSFPQ I A+F+ L ++
Sbjct: 112 RLGIPVM-FHEECLHGHAA-----------RDGTSFPQPIGLGATFNPALVEKLYAMTAH 159
Query: 141 EARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFN 200
E R + QA+ P +++ RD RWGR +ET GEDP + + ++ VRG QGD
Sbjct: 160 ETR-VRGGHQAL-----TPVVDVARDARWGRVEETYGEDPFLNTQLGIAAVRGFQGD--- 210
Query: 201 GGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRAS 260
K K A KHF A+ + + V+ + L +T+ PF C+K+G A
Sbjct: 211 -ASFKDKKHVIATLKHFAAHGQPE---SGQNCAPVNVSERLLRETFLHPFRDCLKKGGAI 266
Query: 261 GIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAV---SIIYDAEGY--AKSP 315
+M +YN ++G+PS A R LL R++WGF G++ SD A+ S D+ G+ A
Sbjct: 267 SVMASYNEIDGVPSHASRWLLRDVLRKEWGFKGFVVSDYYAIWELSHRPDSHGHHVAADK 326
Query: 316 EDAVVDVLKAGMDV-----NCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLF 370
++A V +KAG+++ +C +H V++K L E+E+D + + + ++GLF
Sbjct: 327 KEACVLAVKAGVNIEFPEPDC----YRHLVELVRKKVLHETELDELIAPMLLWKFKMGLF 382
Query: 371 NGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPN 430
+ +P + P + A VV H+ LA +AA++ I LLKN + LLPL +K ++A+IGPN
Sbjct: 383 D-DPYVDP--EEAARVVGCEVHRELASEAARETITLLKNENDLLPLNPAKLKTVAVIGPN 439
Query: 431 ANSAKTLLGNYAGPSCRSITPLQALQNYVE---NTVYYPGC----------DTVACSSAS 477
AN ++LLG Y+G ++T L ++ + V+ GC D V S +
Sbjct: 440 AN--RSLLGGYSGVPAHNVTVLDGIKARLGGAVKVVHAEGCKITVGGSWQQDEVLASDPA 497
Query: 478 -----IDKAVDIAKGADHVVLMMGLDQTQEKEEL------DRVDLVLPGRQQELITRVAE 526
ID+AV +A AD V++ +G ++ +E DR L L G Q ELI R
Sbjct: 498 EDRKQIDEAVKVAWSADVVIVAIGGNEQTSREAWSLKHMGDRTSLDLIGHQDELI-RALL 556
Query: 527 AAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPM 586
A KPV+ ++ G P+ I +N+ +IL Y G+ A+A V+FGDHNPGG+LP+
Sbjct: 557 ATGKPVVALVFNGRPLAINHVA--QNVPAILECWYLGQECGSAVAAVLFGDHNPGGKLPI 614
Query: 587 TWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQN 646
+ P+ ++P+ +P A G + + E +FPFG GLSY+K++ FK V
Sbjct: 615 S-IPRSVGQLPVF-YNHKPSARRG-----FLWDEATPLFPFGFGLSYTKFT--FKNVRLA 665
Query: 647 KLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVK 706
K ++++ ST V++ V N G+ AG V ++V+
Sbjct: 666 KKIISRTGSTH---------------------------VSVDVTNAGKRAGTEVVQVYVR 698
Query: 707 PARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
RP+K+L FQ + L E + +L+P ESL+ + V+E G ++VG
Sbjct: 699 DLISSVTRPVKELKVFQKITLAPGETKTVSLDLTP-ESLAFYDVNMKYVVEPGEFEIMVG 757
Query: 767 DEEYPISI 774
+ + +
Sbjct: 758 NSSRDVDL 765
>gi|167533461|ref|XP_001748410.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163773222|gb|EDQ86865.1| predicted protein [Monosiga brevicollis MX1]
Length = 781
Score = 286 bits (731), Expect = 4e-74, Method: Compositional matrix adjust.
Identities = 246/812 (30%), Positives = 366/812 (45%), Gaps = 172/812 (21%)
Query: 43 FPFCKTTLPISQRARDLVSRLTLDEKISQLVN----SAPAIPRLGIPAYEWWSEALHGVA 98
P+ TLP+S+R LV+ +T+ E QL +APAI RLG+P+Y+W + LHG+
Sbjct: 29 LPWFDPTLPLSERVALLVNNMTIKEVSMQLSTNFDTAAPAIERLGVPSYDWRNNFLHGL- 87
Query: 99 GVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQA------- 151
V GI AT FPQ I AA++ + L IG+ +E RA YN +
Sbjct: 88 -VDNGI--------ATMFPQAIGLAATWSTELLTAIGRVGAVEQRAKYNINKTANGDVPM 138
Query: 152 -IGMTFW-----------APNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQG--D 197
G+ W PN R RGQET GEDP++TG V+ V+G+Q D
Sbjct: 139 NYGLNVWQVAGHESSTLVKPN----RSSNACRGQETYGEDPVLTGTLGVAIVKGLQTLPD 194
Query: 198 TFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQG 257
+ + L A KHF AY LD K R FD ++M DL TY P F++ V++G
Sbjct: 195 SAD------YLALGATPKHFVAYSLD--KTPPRLSFDPTISMVDLRQTYFPAFKAVVQEG 246
Query: 258 RASGIMC-------------------------------------------AYNRVNGIPS 274
RA+ +MC +YN VNG P
Sbjct: 247 RATSLMCRFVILTTHKAVHICWIACLRARATNVCKPLSMTSALASRTLPHSYNGVNGYPM 306
Query: 275 CADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG-S 333
CA +L R +WGF G+ TSD DA+ + Y+ + A L AG+D+N G +
Sbjct: 307 CAS-PMLDLIVRNEWGFDGFFTSDSDAIVFFVSEQNYSTNTIHAAAAALNAGVDLNSGPA 365
Query: 334 FLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQ 393
+L+ H A + + E + + LF+ R+R G F+ + + PF + SP HQ
Sbjct: 366 YLELHD--AYEHGLVTEQALRTSAERLFTFRLRTGEFDPD-ELVPFSSYDERNISSPVHQ 422
Query: 394 VLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQ 453
L L+ A++ +VLLKNS +LPL + +A+IG + A
Sbjct: 423 ALNLRVAEESLVLLKNSQDVLPLDLASLKHVAIIGLSTTLAT------------------ 464
Query: 454 ALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLD----------QTQEK 503
N V T + GCD + ++ I AV+ A+ AD V L++GL E
Sbjct: 465 ---NNVTFT-FEEGCDIESNDTSRIPAAVEAARQADVVFLVLGLHVCTEHANTPYHNAEC 520
Query: 504 EELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPG 563
E DR + LPG Q+EL + A + VI + + GGPV +R+ ++L A Y G
Sbjct: 521 EGHDRESIGLPGVQKELAESIFSAHTR-VITIYVNGGPVASPIVA-ERS-AALLEAWYGG 577
Query: 564 EAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKE 623
+ A++ +IFG +P GRLP P D + M M S PGRTYR+++
Sbjct: 578 QHAGTAVSNIIFGKVSPSGRLPYLVPPSDEDLPDILSMTM-----SDFPGRTYRYFQNDP 632
Query: 624 VFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKF 683
+ FG GLSY+++ Y A + L N + + K P+L E
Sbjct: 633 LHDFGFGLSYTRFEY--AAATMAPLTFNLTENLK-------------PQLKLE------- 670
Query: 684 LVTIGVKNHGEMAGKHPVLLFVKPAR------RGNGRPIKQLVGFQ---SVILNAKEKAE 734
+ V+N G+ AGK V ++ + + R P++QL+ F + +A+E+ +
Sbjct: 671 ---VTVRNVGKRAGKEVVQVYARLRQTTALQERYPSIPLRQLLAFTKTGELKTSAEERIK 727
Query: 735 IVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
+ LSP L A G + EG + + VG
Sbjct: 728 LEIVLSP---LQLANAHGELEYPEGVYDVFVG 756
>gi|365121645|ref|ZP_09338561.1| hypothetical protein HMPREF1033_01907 [Tannerella sp.
6_1_58FAA_CT1]
gi|363645135|gb|EHL84409.1| hypothetical protein HMPREF1033_01907 [Tannerella sp.
6_1_58FAA_CT1]
Length = 868
Score = 285 bits (730), Expect = 5e-74, Method: Compositional matrix adjust.
Identities = 169/454 (37%), Positives = 251/454 (55%), Gaps = 40/454 (8%)
Query: 31 FSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWW 90
FS NP P+ L +RA DL++R+TL EK +Q+ N+ I RLG+ Y+WW
Sbjct: 17 FSFRAENP-----PYKNPELSPDERALDLLNRMTLKEKFAQMHNNTGGIERLGVRPYDWW 71
Query: 91 SEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQ 150
+EALHG+A GK AT FPQ I AA+FD Y + + E RA Y+ Q
Sbjct: 72 NEALHGIARAGK----------ATVFPQAIGLAATFDDTAVYEMFDMVSDEGRAKYHDFQ 121
Query: 151 AIGM-------TFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGK 203
GM TFW PNINIFRDPRWGRG ET GEDP +T K ++ V+G+QGD G
Sbjct: 122 RKGMYNGYKGLTFWTPNINIFRDPRWGRGMETYGEDPFLTTKMGLAVVKGLQGD----GT 177
Query: 204 LKGKLQASACCKHFTAYDLDNWKGTTRYKFDAR-VTMQDLADTYQPPFESCVKQGRASGI 262
K +A AC KH+ + W R+ ++A ++++DL +TY P F++ V +G+ +
Sbjct: 178 QKYD-KAHACAKHYAVHSGPEW---NRHSYNAENISIRDLRETYLPAFKALVTEGKVKEV 233
Query: 263 MCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDA---EGYAKSPEDAV 319
MCAYNR G P C+++ LL + +WGF I SDC A++ Y E +A + + +
Sbjct: 234 MCAYNRFEGEPCCSNKTLLINILKDEWGFDDVIVSDCGAIADFYTKGRHETHASAADASA 293
Query: 320 VDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPF 379
V+ +G D+ CG A+++ + E++I+ ++ L R LG+F+ + ++ +
Sbjct: 294 DAVI-SGTDLECGGSYWA-LDEALEKGLITETKINESVFRLLRARFELGMFD-DDSLVSW 350
Query: 380 GKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLG 439
I VVC H+ AL+ A+ +VLL N + LPL KS +A++GPNAN + L
Sbjct: 351 SSIPYSVVCCDKHKAKALEMARKSMVLLSNKNNTLPLSKSIK-KVAVMGPNANDSVMLWA 409
Query: 440 NYAGPSCRSITPLQALQNYVE--NTVYYPGCDTV 471
NY G RS+T L+ ++ + + +Y GCD V
Sbjct: 410 NYNGTPDRSVTILEGIKAKLPEGSVIYEKGCDYV 443
Score = 116 bits (290), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 82/277 (29%), Positives = 137/277 (49%), Gaps = 50/277 (18%)
Query: 475 SASID-KAV-DIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELIT 522
S ID KAV + K AD ++ + G+ + E EE+ DR ++ LP Q+ ++
Sbjct: 587 SRQIDYKAVAEKVKDADAIIFVGGISSSLEGEEMGVKYPGFRNGDRTNIDLPQVQKNMMK 646
Query: 523 RVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGG 582
+ E K PVI VL G + +++ D+N+ +IL A YPG+ G A+A+V+FGD+NP G
Sbjct: 647 ALKETGK-PVIFVLCSGSTMALSWE--DKNMDAILQAWYPGQEGGTAVADVLFGDYNPAG 703
Query: 583 RLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKA 642
RLP+T+Y + M S GRTYR+++GK ++PFG GLSY+ +SY
Sbjct: 704 RLPLTFYASSDDLPDFENYNM-----SEGQGRTYRYFKGKPLYPFGHGLSYTGFSYSKAK 758
Query: 643 VSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVL 702
+++ + +N S +++ +KN G G V
Sbjct: 759 LNKKSMSVNDS-----------------------------VFLSLNLKNTGLRDGDEVVQ 789
Query: 703 LFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFEL 739
++++ + G P K L G++ V + A + + +L
Sbjct: 790 VYIRNLQDPEG-PSKSLRGYKRVSVKAGQTVPVKIDL 825
>gi|313204104|ref|YP_004042761.1| beta-glucosidase [Paludibacter propionicigenes WB4]
gi|312443420|gb|ADQ79776.1| Beta-glucosidase [Paludibacter propionicigenes WB4]
Length = 871
Score = 285 bits (730), Expect = 5e-74, Method: Compositional matrix adjust.
Identities = 168/444 (37%), Positives = 240/444 (54%), Gaps = 34/444 (7%)
Query: 41 ETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGV 100
+ P+ L +RA+DL+SRLTL EK + + + + A+PRLGI + WWSEALHG A
Sbjct: 28 QQLPYKNPNLSSEERAKDLISRLTLSEKATLMCDISDAVPRLGIKKFNWWSEALHGFAN- 86
Query: 101 GKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGM------ 154
NG + T FP+ + AASFD L Y+I A+ EARA YN G+
Sbjct: 87 ------NGNV---TVFPEPVGMAASFDDNLVYKIFDAVSDEARAKYNEATKKGLENARFL 137
Query: 155 --TFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASA 212
+ W PN+NIFRDPRWGRGQET GEDP +T + VS V+G+QG K + L A
Sbjct: 138 SLSVWTPNVNIFRDPRWGRGQETYGEDPYLTSRMGVSVVKGLQGPA--DAKYRKLL---A 192
Query: 213 CCKHFTAYDLDNWKGTTRYKFDA-RVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNG 271
C KHF + W +R+ + + +DL +TY P F+S V++ +MCAY R++
Sbjct: 193 CAKHFAVHSGPEW---SRHSLNINNLDPRDLWETYLPAFKSLVQKSDVRQVMCAYQRLDD 249
Query: 272 IPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNC 331
P C + LL + R +WG+ + +DC AVS Y + + A + AG DV C
Sbjct: 250 EPCCGNTQLLQRILRDEWGYKYMVVADCGAVSDFYTSHKVSSDAVHAASKGVWAGTDVEC 309
Query: 332 --GSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCS 389
+ + K AV + + E+EI++ L N+ R LG + + + P+ KI VV +
Sbjct: 310 QWDNHIYKQLPDAVAKGLITEAEINKHLLNVLIGRFDLGEMDDD-ALVPWSKIPMSVVNN 368
Query: 390 PAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSI 449
H+ LAL A + LL+N + +LPL KSK + A++GPNAN L GNY G R+I
Sbjct: 369 EEHRKLALDMALKSMTLLQNKNNILPLSKSKKI--AVVGPNANDKPMLWGNYNGTPVRTI 426
Query: 450 TPLQALQNYV--ENTVYYPGCDTV 471
T L + + + +Y GCD V
Sbjct: 427 TILDGITSKLSANKVLYEKGCDLV 450
Score = 95.5 bits (236), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 74/233 (31%), Positives = 111/233 (47%), Gaps = 42/233 (18%)
Query: 507 DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAG 566
DR D+ LP Q+ + + EA K+ ++ + C G I + +IL A Y GE+G
Sbjct: 638 DRTDIELPAVQRNCLKALKEAGKQ--VIFVNCSGSA-IALIPETESCDAILQAWYGGESG 694
Query: 567 AVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFP 626
A+A+V+FGD+NP G+LP+T+Y D M+ GRTYR Y +FP
Sbjct: 695 GQAVADVLFGDYNPSGKLPITFYKSVKQLSDFEDYSMK--------GRTYR-YMSDPLFP 745
Query: 627 FGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVT 686
FG GLSY+ ++ +S ++ NQS VE +T
Sbjct: 746 FGFGLSYTTFAIGNAKLSNPEIKSNQS-----VE------------------------LT 776
Query: 687 IGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFEL 739
I V N G+ +G V ++V+ +G P+K L GFQ V + A + A+ V L
Sbjct: 777 IPVSNKGKRSGTEIVQVYVRMVNDTDG-PLKTLKGFQRVEVAAGKTAQAVINL 828
>gi|255690202|ref|ZP_05413877.1| beta-glucosidase [Bacteroides finegoldii DSM 17565]
gi|260624221|gb|EEX47092.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
finegoldii DSM 17565]
Length = 853
Score = 285 bits (730), Expect = 5e-74, Method: Compositional matrix adjust.
Identities = 167/424 (39%), Positives = 242/424 (57%), Gaps = 34/424 (8%)
Query: 51 PISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTI 110
P+ +R DL+SRLT++EKIS L ++P IPRLGI Y +EALHGV G+
Sbjct: 35 PVHERVMDLISRLTVEEKISLLRATSPGIPRLGIDKYYHGNEALHGVVRPGR-------- 86
Query: 111 RGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQ---------AIGMTFWAPNI 161
T FPQ I AA+++ L RI I EARA +N + +TFW+P +
Sbjct: 87 --FTVFPQAIGLAATWNPELQKRIATVISDEARARWNELDQGRNQKEQFSDVLTFWSPTV 144
Query: 162 NIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYD 221
N+ RDPRWGR ET GEDP ++G ++V+G+QGD + L+ + KHF A
Sbjct: 145 NMARDPRWGRTPETYGEDPFLSGVMGTAFVKGLQGDDPH------YLKIVSTPKHFAA-- 196
Query: 222 LDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLL 281
N + R+ + +++ + L + Y P FE CVK+G+A+ IM AYN +N +P + LL
Sbjct: 197 --NNEEHNRFVCNPQISEKQLREYYFPAFEMCVKEGKAASIMTAYNALNNVPCTLNSWLL 254
Query: 282 SKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG-SFLQKHTK 340
K RR WGF GY+ SDC S++ +A Y K+ E A +KAG+D+ CG ++
Sbjct: 255 QKVLRRDWGFQGYVVSDCGGPSLLVNAHKYVKTKEAAATLSIKAGLDLECGDDVYDEYLL 314
Query: 341 AAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAA 400
A KQ E++ID A +++ + RM+LGLF+G P+ KI V+ S HQ +AL AA
Sbjct: 315 NAYKQYMASEADIDSAAYHVLTARMKLGLFDG-VERNPYAKISPSVIGSKEHQTVALNAA 373
Query: 401 QDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAG-PSCRSITPLQALQNYV 459
++ IVLLKN +LPL K S+A++G NA K G+Y+G P ++ LQ ++N V
Sbjct: 374 RECIVLLKNQKNMLPLNVKKLKSIAVVGINA--GKCEFGDYSGAPVVEPVSILQGIKNRV 431
Query: 460 ENTV 463
+ V
Sbjct: 432 GDKV 435
Score = 115 bits (289), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 80/279 (28%), Positives = 140/279 (50%), Gaps = 44/279 (15%)
Query: 489 DHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAK 548
+ VV +MG++++ E+E DR D+ LP Q+E + + + +ILVL+ G + + +
Sbjct: 604 ETVVAVMGINKSIEREGQDRYDIQLPADQREFLQEIYKVNPN-IILVLVAGSSLAVNWE- 661
Query: 549 YDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQAT 608
+ ++ +I+ A YPGE G A+AEV+FGD+NP GRLP+T+Y + ++P D
Sbjct: 662 -NEHLPAIVNAWYPGEQGGTAVAEVLFGDYNPAGRLPLTYY-KSLEQLPAFD------DY 713
Query: 609 SGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYK 668
GRTY++++ ++PFG GLSY+ + Y S+ K+ + V+
Sbjct: 714 DITKGRTYQYFKKDVLYPFGYGLSYTTFKY---------------SNLKVDDAGKTVN-- 756
Query: 669 SVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVK-PARRGNGRPIKQLVGFQSVIL 727
V+ +KN G+ AG ++V+ P G+ + I+QL GF+ V L
Sbjct: 757 ----------------VSFTLKNTGKRAGDEVAQVYVRLPEIAGSTQAIRQLKGFRRVAL 800
Query: 728 NAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
A E ++ L + + V+ +G+ +VG
Sbjct: 801 KAGESRKVEITLDKEQLRYWDEKQACFVVPQGSFTFMVG 839
>gi|254295141|ref|YP_003061164.1| glycoside hydrolase [Hirschia baltica ATCC 49814]
gi|254043672|gb|ACT60467.1| glycoside hydrolase family 3 domain protein [Hirschia baltica ATCC
49814]
Length = 897
Score = 285 bits (729), Expect = 6e-74, Method: Compositional matrix adjust.
Identities = 185/500 (37%), Positives = 265/500 (53%), Gaps = 62/500 (12%)
Query: 20 LLTRVDSTQPPFSCDPSNPSTET-----------FPFCKTTLPISQRARDLVSRLTLDEK 68
LL + S +C S TET F F +L +RA DLVS +TL+EK
Sbjct: 8 LLGTIASLAFATACSSSQTDTETAQTTEEAKSSEFRFMDPSLSPKERALDLVSHMTLEEK 67
Query: 69 ISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDS 128
+Q+ + A AIPRLG+ Y WW+EALHGVA G AT FPQ I AA++D
Sbjct: 68 AAQMYDKAAAIPRLGLHEYNWWNEALHGVARAGH----------ATVFPQAIGMAATWDE 117
Query: 129 YLWYRIGQAIGLEARALYN--AGQAI-----GMTFWAPNINIFRDPRWGRGQETPGEDPL 181
L + I E RA ++ A + + G+TFW+PNINIFRDPRWGRGQET GEDP
Sbjct: 118 DLMLEVANVISDEGRAKHHFYANEDVYAMYGGLTFWSPNINIFRDPRWGRGQETYGEDPY 177
Query: 182 VTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQD 241
+TG+ AV+++ G+QGD ++ A KH+ + + +R++ + T D
Sbjct: 178 LTGRMAVNFINGLQGDD------DKYFKSVATVKHYAVH---SGPEPSRHRDNYIATDAD 228
Query: 242 LADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDA 301
L +TY P F++ + + +MCAYN V G P+C L+ R + GF GY+ SDC A
Sbjct: 229 LYETYLPAFKTAFDETEVASVMCAYNAVWGDPACGSERLMKDLLREELGFDGYVVSDCGA 288
Query: 302 VSIIY-----DAEGYAKSPEDAVVDV--------LKAGMDVNCGSFLQKHTKA---AVKQ 345
+ Y AEG A VD + G D+NCG A AVK+
Sbjct: 289 IGDFYYDEEKKAEGTAPYAAHDHVDTRAQAAALSVNMGTDLNCGDGEGNKMDALPQAVKE 348
Query: 346 KKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIV 405
+ E ID+++ L+S +LG+++ +P++ P+ I D V SP+H + +AA+ +V
Sbjct: 349 GLITEETIDQSVVRLYSALFKLGMYD-DPSLVPWSNISIDTVASPSHLEKSEEAARASLV 407
Query: 406 LLKNSHGLLPL-PKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYV--ENT 462
LLKN G+LPL P +K +A+IGPNA++ TL+ NY G +T L+ ++ + EN
Sbjct: 408 LLKND-GILPLKPDTK---VAVIGPNADNWWTLVANYYGQPTAPVTALKGIKAKIGAENV 463
Query: 463 VYYPGCDTVACSSASIDKAV 482
Y G T+A S KAV
Sbjct: 464 SYSVGS-TIAGDIYSNYKAV 482
Score = 104 bits (260), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 76/288 (26%), Positives = 129/288 (44%), Gaps = 54/288 (18%)
Query: 491 VVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGG 540
++ G+D E EE+ DR + LP Q++L+ + A KPV+LV G
Sbjct: 633 ILFFGGIDANLEGEEMGVELDGFLGGDRTHINLPAPQEKLLKEL-HATGKPVVLVNFSGS 691
Query: 541 PVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTD 600
+ + + D N+ +I+ A YPGE A+A++++G+ +P GRLP+T+Y D
Sbjct: 692 AMALNWE--DENLPAIVQAFYPGEKSGTAIADLLWGEFSPSGRLPVTFYKSLEGMPAFDD 749
Query: 601 MKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVE 660
M RTY++YEG++++PFG GLSY+ + Y
Sbjct: 750 YSME--------NRTYKYYEGEQLYPFGHGLSYTSFEY---------------------- 779
Query: 661 NQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGN-GRPIKQL 719
S +L T + V++ V N G+ A + V +V N P +L
Sbjct: 780 --------SDLKLETAYAANENLQVSVKVTNSGDKASREIVQAYVTRDTLANVSTPRVEL 831
Query: 720 VGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVI-EEGTHFLVVG 766
F ++ L KE + + P +++ E+G + E+G+ L +G
Sbjct: 832 AAFDAIELAPKESQTVTLSIKP-DAIGYFNENGKLTFPEDGSFTLSIG 878
>gi|393784569|ref|ZP_10372732.1| hypothetical protein HMPREF1071_03600 [Bacteroides salyersiae
CL02T12C01]
gi|392665550|gb|EIY59074.1| hypothetical protein HMPREF1071_03600 [Bacteroides salyersiae
CL02T12C01]
Length = 929
Score = 285 bits (729), Expect = 6e-74, Method: Compositional matrix adjust.
Identities = 169/444 (38%), Positives = 248/444 (55%), Gaps = 33/444 (7%)
Query: 41 ETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGV 100
+T PF +L +RA++LVS LTL+EKI+Q+ + AIPRL I Y +W+EA+HGVA
Sbjct: 38 QTHPFQDESLSFHERAKNLVSLLTLEEKINQVGHQTLAIPRLNIKGYNYWNEAIHGVARS 97
Query: 101 GKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPN 160
G ATSFP ++++D L + A EAR +Y+ + G+ +W P
Sbjct: 98 GL----------ATSFPVSKAMSSTWDLPLIFDCAVATSDEAR-VYSNTKDKGLIYWCPT 146
Query: 161 INIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAY 220
IN+ RDPRWGR +E GEDP +TGK AV Y++G+QGD + A KHF A
Sbjct: 147 INMSRDPRWGRDEENYGEDPFLTGKIAVEYIKGMQGDD------PKYYKTIATAKHFAAN 200
Query: 221 DLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNL 280
+ + + +T DAR +L + Y P FE VK+G +M AYN +NGIP A+ L
Sbjct: 201 NYEKGRHSTSSDMDAR----NLREYYLPAFEMAVKEGNVRSVMSAYNALNGIPCGANHEL 256
Query: 281 LSKTARRQWGFHGYITSDCDAVSIIYDA--EGYAKSPEDAVVDVLKAGMDVNCGSFLQKH 338
L R +WGF+G++TSDC AV +Y + + + +A + G D+NCG+ Q +
Sbjct: 257 LIDILRTEWGFNGFVTSDCGAVDDVYQSNRHHFVNTAAEASAVSIVNGEDLNCGNTFQDY 316
Query: 339 TKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQ 398
K A+++ + E+++D AL +F R +G F+ N + P+ I DV+ H+ LA +
Sbjct: 317 CKEAIEKGYMQEADLDTALVRVFEARFSVGEFD-NASNVPWRSISDDVLDCEEHRQLAYK 375
Query: 399 AAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQN- 457
AAQ+ IVLLKN + +LPL K+KSV A+IGP N+ LG Y+G TP +
Sbjct: 376 AAQEAIVLLKNDNNILPLDKTKSV--AVIGPFGNT--ITLGGYSGSPTALTTPFGGIAEK 431
Query: 458 --YVEN--TVYYPGCDTVACSSAS 477
YV N T+ + C+ + S S
Sbjct: 432 IGYVVNDGTIQFEDCEEQSVPSDS 455
Score = 118 bits (296), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 89/303 (29%), Positives = 139/303 (45%), Gaps = 52/303 (17%)
Query: 465 YPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRV 524
Y GC + ++++A +IA AD V+ G D T E DR +L LPG QQ+L+ V
Sbjct: 590 YKGCAVTGTAETNLERAKEIAAKADVVIFAAGTDLTVSDESHDRTNLNLPGDQQKLLEAV 649
Query: 525 AEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRL 584
+A VIL+L V I +AK ++ +I+ A Y G+A A+A+V++GD+NP G+L
Sbjct: 650 Y-SANPNVILLLQTCSSVTINWAK--EHVPAIIEAWYGGQAQGKAIADVLYGDYNPSGKL 706
Query: 585 PMTWY------PQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSY 638
TWY P + + D K TY +++ ++PFG G+SY+ + Y
Sbjct: 707 TSTWYNALSDLPNGMLNYDIRDAKY-----------TYMYHDKTPLYPFGYGMSYTTFEY 755
Query: 639 KFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGK 698
+ +S+++L + + +V+ + N G+ AG
Sbjct: 756 QKLNISKSRLAAGE-----------------------------ELIVSADITNTGKYAGA 786
Query: 699 HPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRARED-GLMVIE 757
V L+ RP+KQLVGF V L E + L E LS E +E
Sbjct: 787 EIVQLYAH-VNSSIERPLKQLVGFARVELEPGETKTVTMPLKH-EQLSYFNEQTNTFDVE 844
Query: 758 EGT 760
EG+
Sbjct: 845 EGS 847
>gi|118489157|gb|ABK96385.1| unknown [Populus trichocarpa x Populus deltoides]
Length = 343
Score = 285 bits (728), Expect = 8e-74, Method: Compositional matrix adjust.
Identities = 152/344 (44%), Positives = 212/344 (61%), Gaps = 16/344 (4%)
Query: 437 LLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIA-KGADHVVLMM 495
++GNYAG +C TPLQ ++ Y + TV+ GC+ V C+ A ++A + AD +L+M
Sbjct: 1 MIGNYAGVACGYTTPLQGIRRYAK-TVHLSGCNDVFCNGNQQFNAAEVAARHADATILVM 59
Query: 496 GLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGS 555
GLDQ+ E E DR L+LPG QQEL++RVA A++ P ILVL+ GGP+D++FAK D IG+
Sbjct: 60 GLDQSIEAEFRDRKGLLLPGYQQELVSRVARASRGPTILVLMSGGPIDVSFAKNDPRIGA 119
Query: 556 ILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYI-KVPMTDMKMRPQATSGNPGR 614
ILW GYPG+AG A+A+V+FG NPGG+LPMTWYP DY+ KVPMT+M MR + G PGR
Sbjct: 120 ILWVGYPGQAGGAAIADVLFGTANPGGKLPMTWYPHDYLAKVPMTNMGMRADPSRGYPGR 179
Query: 615 TYRFYEGKEVFPFGCGLSYSKYSYKF----KAVSQNKLYLNQSSSTKMVENQDVVHYKSV 670
TYRFY+G VFPFG G+SY+ +++ + VS L+ S +T N V + +
Sbjct: 180 TYRFYKGPVVFPFGHGMSYTTFAHSLVQAPREVSVPLASLHVSRNTTGASNAIRVSHAN- 238
Query: 671 PELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAK 730
CE V I VKN G+M G H +L+F P G KQL+GF+ V L
Sbjct: 239 -------CEALALGVHIDVKNTGDMDGTHTLLVFSSPP-GGKWSTQKQLIGFEKVHLVTG 290
Query: 731 EKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGDEEYPISI 774
+ + ++ C+ LS G+ I G H+L +GD ++ IS+
Sbjct: 291 SQKRVKIDIHVCKHLSVVDRFGIRRIPNGEHYLYIGDLKHSISL 334
>gi|300777563|ref|ZP_07087421.1| beta-glucosidase [Chryseobacterium gleum ATCC 35910]
gi|300503073|gb|EFK34213.1| beta-glucosidase [Chryseobacterium gleum ATCC 35910]
Length = 896
Score = 285 bits (728), Expect = 8e-74, Method: Compositional matrix adjust.
Identities = 160/445 (35%), Positives = 242/445 (54%), Gaps = 34/445 (7%)
Query: 43 FPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGK 102
+PF LP+++R +L++ LT +EKI +++++ A+PRL IPAY WW+EALHGVA G
Sbjct: 44 YPFRNPDLPVNERIENLLTLLTTEEKIGMMMDNSQAVPRLEIPAYGWWNEALHGVARAGI 103
Query: 103 GIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYN--------AGQAIGM 154
AT FPQ I AA++D ++ + I EARA YN G+ G+
Sbjct: 104 ----------ATVFPQAIGMAATWDVPEHFKTFEMISDEARAKYNRSFDEALKTGRYEGL 153
Query: 155 TFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACC 214
TFW PNINIFRDPRWGRGQET GEDP +T V+ V+G+QG N K + AC
Sbjct: 154 TFWTPNINIFRDPRWGRGQETYGEDPYLTSVLGVAAVKGLQG---NDPKF---FKTHACA 207
Query: 215 KHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPS 274
KHF + W R+ ++A ++ +DL +TY P F++ V++G +MCAYN +G P
Sbjct: 208 KHFAVHSGPEW---NRHSYNAEISKRDLYETYLPAFKALVQEGNVREVMCAYNAFDGQPC 264
Query: 275 CADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPED--AVVDVLKAGMDVNCG 332
CA+ LL++ R +W + G + SDC A++ + + + P++ D LK D+ CG
Sbjct: 265 CANNTLLTEILRGKWKYDGMVVSDCWALADFFQKKYHGTHPDEKTTAADALKHSTDLECG 324
Query: 333 SFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAH 392
K ++ + E +ID ++ + LG+ + ++ + I VV S H
Sbjct: 325 DTYNNLNK-SLASGLITEKDIDESMRRILKGWFELGMLDPKSSVH-WNTIPYSVVDSEEH 382
Query: 393 QVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPL 452
+ AL+ AQ IVL+KN +LPL ++ +A++GPNA+ LGNY G +T L
Sbjct: 383 KKQALKMAQKSIVLMKNEKNILPLNRNIK-KIAVVGPNADDGLMQLGNYNGTPSSIVTIL 441
Query: 453 QALQNYVENT--VYYPGCDTVACSS 475
++ N +Y G + SS
Sbjct: 442 DGIKTKFPNAEIIYEKGSEVTDPSS 466
Score = 101 bits (251), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 77/269 (28%), Positives = 130/269 (48%), Gaps = 49/269 (18%)
Query: 486 KGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILV 535
K AD +V GL + E EE+ D+ + LP Q++L+ + + K PV+ V
Sbjct: 621 KNADVIVFAGGLSPSLEGEEMMVNAEGFKGGDKTSIALPKVQRDLLAELRKTGK-PVVFV 679
Query: 536 LLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIK 595
L G + + + ++N ++L A Y G++G A+A+V+ GD+NP G+LP+T+Y ++ +
Sbjct: 680 LCTGSALGL--EQDEKNYDALLNAWYGGQSGGTAVADVLAGDYNPSGKLPITFY-KNLEQ 736
Query: 596 VPMTDMKMRPQATSGN---PGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQ 652
+ K N GRTYR+ K ++PFG GLSYSK+ Y +S+N + +N+
Sbjct: 737 LDNALSKTSKHEGFENYDMQGRTYRYMTEKPLYPFGHGLSYSKFVYGDSKLSKNSISVNE 796
Query: 653 SSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGN 712
+ + +TI V N E G+ V +++K R +
Sbjct: 797 NVT-----------------------------ITIPVTNISEREGEEVVQVYIK--RNND 825
Query: 713 GR-PIKQLVGFQSVILNAKEKAEIVFELS 740
+ P+K L F+ + +KE I LS
Sbjct: 826 AQAPVKTLRAFERTPIKSKETKNIQLILS 854
>gi|297738404|emb|CBI27605.3| unnamed protein product [Vitis vinifera]
Length = 581
Score = 285 bits (728), Expect = 9e-74, Method: Compositional matrix adjust.
Identities = 152/389 (39%), Positives = 224/389 (57%), Gaps = 51/389 (13%)
Query: 208 LQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYN 267
L+ S+CCKH+ YD+D+W V+ QD+ +T+ PFE
Sbjct: 19 LKVSSCCKHYATYDIDSW---------LNVSEQDMKETFFSPFE---------------- 53
Query: 268 RVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYA-KSPEDAVVDVLKAG 326
R +W HGYI SDC + +I D + Y +S DAV L+AG
Sbjct: 54 ------------------RDEWDLHGYIVSDCYGLEVIVDNQNYLNESKVDAVAKTLQAG 95
Query: 327 MDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADV 386
+D+ CG + +V K+ + E+DRAL N++ + MR+G F+G P + +G
Sbjct: 96 LDLECGHYYTDALNESVLTGKVSQYELDRALKNIYVLLMRVGYFDGIPAYE---SLGLKD 152
Query: 387 VCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSC 446
+C+ H LA +AA+ GIVLLKN + +LPL K L L+GP+AN+ + ++GNYAG
Sbjct: 153 ICAADHIELAREAARQGIVLLKNDYEVLPLKPGKK--LVLVGPHANATEVMIGNYAGLPY 210
Query: 447 RSITPLQALQNYVENTVYYPGCDTVACSSAS-IDKAVDIAKGADHVVLMMGLDQTQEKEE 505
+ ++PL+A + + N Y GC +CS+ + +A + AK A+ ++ +G D + E E
Sbjct: 211 KYVSPLEAF-SAIGNVTYATGCLDASCSNDTYFSEAKEAAKFAEVTIIFVGTDLSIEAEF 269
Query: 506 LDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEA 565
+DRVD +LPG Q ELI +VAE + PVILV+L G +DITFAK + I +ILW G+PGE
Sbjct: 270 VDRVDFLLPGNQTELIKQVAEVSSGPVILVVLSGSNIDITFAKNNPRISAILWVGFPGEQ 329
Query: 566 GAVALAEVIFGDHNPGGRLPMTWYPQDYI 594
G A+A+V+FG +NPGGRLP+TWY DY+
Sbjct: 330 GGHAIADVVFGKYNPGGRLPVTWYEADYV 358
>gi|333381842|ref|ZP_08473521.1| hypothetical protein HMPREF9455_01687 [Dysgonomonas gadei ATCC
BAA-286]
gi|332829771|gb|EGK02417.1| hypothetical protein HMPREF9455_01687 [Dysgonomonas gadei ATCC
BAA-286]
Length = 861
Score = 284 bits (727), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 161/442 (36%), Positives = 243/442 (54%), Gaps = 32/442 (7%)
Query: 41 ETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGV 100
+ P+ L +RA+DL+SRLTL EK+ + +++ + RLG+ + WWSEALHGVA
Sbjct: 19 QIMPYKDANLTPEERAQDLLSRLTLKEKVGLMGDNSIEVTRLGVKKFAWWSEALHGVANQ 78
Query: 101 GKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYN--------AGQAI 152
G G T FP+ I AASF+ L Y + AI EARA ++ Q
Sbjct: 79 G----------GVTVFPEPIGMAASFNDELLYHVFDAISDEARARFHFREKKGDERRQDN 128
Query: 153 GMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASA 212
G++ W PN+NIFRDPRWGRGQET GEDP +T + +S V G+QG K K L A
Sbjct: 129 GLSVWTPNVNIFRDPRWGRGQETYGEDPYLTSRMGISVVNGLQGP--KDAKYKKLL---A 183
Query: 213 CCKHFTAYDLDNWKGTTRYKFD-ARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNG 271
C KH+ + W R+ + + + L +TY P F+ V++ S +MCAY+R +
Sbjct: 184 CAKHYAVHSGPEW---NRHVLNLNNLDNRHLWETYMPAFQVLVQKADVSQVMCAYHRQDD 240
Query: 272 IPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNC 331
P C + +LL + R +WGF + SDC A++ Y + + + V + AG DV C
Sbjct: 241 DPCCGNNHLLKRILRDEWGFKRMVVSDCGAIADFYTSHKVSSDALHSAVKGVLAGTDVEC 300
Query: 332 GSFLQKHTKA-AVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSP 390
G H AV + + E++ID+++ L + R RLG F+ N ++ P+ I ++
Sbjct: 301 GFGYTYHELVDAVSRGLIYEADIDKSVLRLLTERFRLGDFDDN-SIVPWANIPDTIINCK 359
Query: 391 AHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSIT 450
HQ LAL+ A+ + LL+N + +LPL K + A+IGPNA+ AK + GNY G +++T
Sbjct: 360 KHQALALEMARQSMTLLQNKNNILPLSSKKKI--AVIGPNADDAKLMWGNYNGIPVKTVT 417
Query: 451 PLQALQNYVENTVYYP-GCDTV 471
L+ +++ ++Y GCD V
Sbjct: 418 ILEGIKSIAGKDIFYEKGCDIV 439
Score = 101 bits (252), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 86/273 (31%), Positives = 125/273 (45%), Gaps = 58/273 (21%)
Query: 482 VDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKP 531
VD K D VV G+ E EE+ DR D+ LP Q+ I + +A K+
Sbjct: 593 VDRLKDIDVVVFAGGISGELEGEEMPIEMPGFKGGDRTDIELPASQRNCIKALKKAGKR- 651
Query: 532 VILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQ 591
++++ C G I + +IL A Y G++G A+AEV+FG +NP G+LP+T+Y +
Sbjct: 652 -VIMVNCSGSA-IGLMPESESCEAILQAWYGGQSGGQAIAEVLFGKYNPSGKLPITFY-K 708
Query: 592 DYIKVP---MTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKL 648
+ ++P DMK GRTYR+ E K +FPFG GLSY+ + S
Sbjct: 709 NIDQLPDFEEYDMK----------GRTYRYLEDKPLFPFGYGLSYTTFDIGRATAS---- 754
Query: 649 YLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPA 708
S S K E K + I VKN G+ G V ++VK
Sbjct: 755 ----SISAKAGE---------------------KIKLVIPVKNTGKRTGSETVQVYVK-- 787
Query: 709 RRGNGRPIKQLVGFQSVILNAKEKAEIVFELSP 741
+ +G PIK L F+ + L ++ FEL P
Sbjct: 788 KVDSGGPIKTLRSFKRIELPPNVSQDLTFELEP 820
>gi|325918730|ref|ZP_08180824.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas vesicatoria
ATCC 35937]
gi|325535054|gb|EGD06956.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas vesicatoria
ATCC 35937]
Length = 391
Score = 284 bits (727), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 156/371 (42%), Positives = 217/371 (58%), Gaps = 29/371 (7%)
Query: 54 QRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGA 113
QRA LV++++ DEK++Q +N APAIPRL IPAYEWWSE LHG+A G A
Sbjct: 34 QRAAALVAQMSRDEKVAQAMNDAPAIPRLDIPAYEWWSEGLHGIARNGY----------A 83
Query: 114 TSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAG--------QAIGMTFWAPNINIFR 165
T FPQ I AAS+++ L ++G + EARA +N + G+T W+PNINIFR
Sbjct: 84 TVFPQAIGLAASWNTALMQQVGTVVSTEARAKFNQAGGPGKDHKRYAGLTIWSPNINIFR 143
Query: 166 DPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNW 225
DPRWGRG ET GEDP +TG+ AV ++RG+QGD N + A KH +
Sbjct: 144 DPRWGRGMETYGEDPFLTGQLAVGFIRGLQGDDLN------HPRTIATPKHIAVHSGPE- 196
Query: 226 KGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTA 285
R+ FD V+ +D+ TY P F + + G+A +MCAYN ++G P+CA LL+
Sbjct: 197 --PGRHGFDVDVSPRDMEATYTPAFRAALVDGQAWSVMCAYNSLHGTPACAADWLLNGRV 254
Query: 286 RRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQ 345
R WGF G++ SDCDAV + + + LKAG D+NCG + A+++
Sbjct: 255 RGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAAALKAGHDLNCG-HAYRELGTAIER 313
Query: 346 KKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIV 405
++ E+ +D++L LF+ R RLG P P+ ++GA V + AH+ LALQAA + IV
Sbjct: 314 GEVDEALLDQSLVRLFAARYRLGELEA-PRKDPYARLGAKDVDNAAHRALALQAAAESIV 372
Query: 406 LLKNSHGLLPL 416
LLKN+ LPL
Sbjct: 373 LLKNTATTLPL 383
>gi|198425902|ref|XP_002120563.1| PREDICTED: similar to predicted protein [Ciona intestinalis]
Length = 996
Score = 283 bits (725), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 196/564 (34%), Positives = 294/564 (52%), Gaps = 54/564 (9%)
Query: 54 QRARDLVSRLTLDEKISQLVNS-------APAIPRLGIPAYEWWSEALHGVAGVGKGIFF 106
+R+ VSRLT+ E + Q+ APAI RLGI Y+W +E L G A G
Sbjct: 449 KRSFSEVSRLTVKELVLQISRGGAGDNGPAPAITRLGIGPYQWNTECLRGYAMNGD---- 504
Query: 107 NGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA-------GQAIGMTFWAP 159
AT FPQ I AA+FD L Y++ + I LEARA +N G G++ ++P
Sbjct: 505 ------ATCFPQPIGLAATFDQGLIYKLAKTIALEARAKHNNFTKNGNFGDHTGLSCFSP 558
Query: 160 NINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTA 219
INI R P WGR QET GEDP+++ A +YV G+QGD A+A CKHF A
Sbjct: 559 VINILRHPLWGRNQETFGEDPVMSSLMARAYVTGLQGDEI-------YYPATANCKHFAA 611
Query: 220 YDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRN 279
YD ++R F+A V+++DL TY P F CV G A GI+C+YN +NG P+CA +
Sbjct: 612 YDGPENIPSSRLSFNANVSIEDLGRTYFPAFRECVHSG-AFGIVCSYNAINGEPACAS-S 669
Query: 280 LLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHT 339
L R ++ F GY++SD A+ Y KS + V AG+D+ S+ + +
Sbjct: 670 YLQTILRDKFNFKGYVSSDESAIEFFDIYFKYTKSNLLSAVAAFDAGVDLELTSYGKNNR 729
Query: 340 ----KAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVL 395
AV+Q + E+ + R+ LF RM LG F+ + + DVV S AH+
Sbjct: 730 YSLLNQAVEQGLVTEAALRRSAKRLFRTRMALGEFDPQ-EFNHWLNVPIDVVQSLAHRKQ 788
Query: 396 ALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCR---SITPL 452
A++ A VLLKN G+LPL + ++A++GP N+++ L G+Y P+ +PL
Sbjct: 789 AVEVAAKSFVLLKND-GVLPL-NHRYENVAIVGPFINNSEALTGDYH-PNYNLKYFSSPL 845
Query: 453 QALQNYVENTV--YYPGC------DTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKE 504
A + + V + GC + C++ + ++ G+D V++ +G E E
Sbjct: 846 FAANSLSSSGVARFTTGCVGTNNQNLPICATYNSTHVKEVVTGSDIVLVTLGTGTGIEAE 905
Query: 505 ELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGE 564
DR+ + LPG+Q ++I V + A PVI+VL GP+D+++ N +++ + +
Sbjct: 906 GRDRLSMELPGKQLDMIKDVVKYANGPVIVVLFNAGPLDVSWVM--GNTAAVIACHFSAQ 963
Query: 565 AGAVALAEVIFGDHNPGGRLPMTW 588
A+ EV+ G NP GRLP TW
Sbjct: 964 MTGEAMLEVLTGIVNPAGRLPNTW 987
>gi|146295632|ref|YP_001179403.1| glycoside hydrolase family protein [Caldicellulosiruptor
saccharolyticus DSM 8903]
gi|145409208|gb|ABP66212.1| glycoside hydrolase, family 3 domain protein [Caldicellulosiruptor
saccharolyticus DSM 8903]
Length = 770
Score = 283 bits (725), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 219/703 (31%), Positives = 345/703 (49%), Gaps = 115/703 (16%)
Query: 81 RLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGL 140
RLGIPA E+ G +GAT FPQ I A +FD+ + + + I
Sbjct: 83 RLGIPAM-IHEESCSGFMA-----------KGATVFPQSIGVACTFDNEIVEELAKVIRT 130
Query: 141 EARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFN 200
+ +A+ A QA+ AP I++ RD RWGR +ET GEDP + AVSYV+G+QGD
Sbjct: 131 QMKAV-GAHQAL-----APLIDVARDARWGRVEETFGEDPYLVANMAVSYVKGLQGDDIK 184
Query: 201 GGKLKGKLQASACCKHFTAYDLD----NWKGTTRYKFDARVTMQDLADTYQPPFESCVKQ 256
G + A KHF Y + NW + ++L + Y PFE VK
Sbjct: 185 KGIV-------ATGKHFVGYAMSEGGMNWA-------PVHIPERELREVYLYPFEVAVKV 230
Query: 257 GRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPE 316
IM AY+ ++GIP A+R LL+ AR +WGF G SD V + D K+ E
Sbjct: 231 AGLKSIMPAYHEIDGIPCHANRKLLTDIARNEWGFDGIYVSDYSGVRNLLDYHKSVKTYE 290
Query: 317 DAVVDVLKAGMDVNCGS---FLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGN 373
+A L AG+D+ F ++ K A+K+ K + +D A+ + ++ RLGLF+ N
Sbjct: 291 EAAALSLWAGLDIELPKIECFTEEFIK-ALKEGKFDMTLVDAAVKRVLEMKFRLGLFD-N 348
Query: 374 PTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANS 433
P ++ G + ++ + + L+ + AQ+ +VLLKN + LPL K S +A+IGPNANS
Sbjct: 349 PYIKTDGIV--ELFDNKEQRQLSRRVAQESMVLLKNDN-FLPLSKD-SKKIAVIGPNANS 404
Query: 434 AKTLLGNYAGPS----------------------CRSITPLQALQNYVEN-------TVY 464
+ LLG+Y+ P+ R++ ++++ +++ VY
Sbjct: 405 VRNLLGDYSYPAHIATLEMFFIKEDKGVGNEEEFVRNVINMKSIFEAIKDKVSSNTEVVY 464
Query: 465 YPGCDTVACSSASIDKAVDIAKGADHVVLMMG------LDQTQEKEELDRVDLVLPGRQQ 518
GCD + + ++A A AD V+L++G LD T E DR L LPG Q+
Sbjct: 465 AKGCDVNSQDRSGFEEAKKAAADADAVILVVGDKAGLRLDCTS-GESRDRASLRLPGVQE 523
Query: 519 ELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDH 578
+L+ + V+ VL+ G PV + + N+ ++L A +PGE GA A+A+V+FGD+
Sbjct: 524 DLVKEIVSVNPNTVV-VLVNGRPVALDWIM--ENVKAVLEAWFPGEEGANAVADVLFGDY 580
Query: 579 NPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSY 638
NPGG+L ++ +P+D +VP+ +P Y K + PFG GLSY+ + Y
Sbjct: 581 NPGGKLAIS-FPRDVGQVPVY-YGHKPSGGKSCWHGDYVEMSTKPLLPFGYGLSYTTFEY 638
Query: 639 KFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGK 698
K A+ + K+ +++S +++ ++N G+ G
Sbjct: 639 KNFAIEKEKIGMDES-----------------------------IKISVEIENTGKYEGD 669
Query: 699 HPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSP 741
V L+ + RP+K+L G++ V L EK ++VFEL P
Sbjct: 670 EIVQLYTRKEEYLVTRPVKELKGYKRVHLKPGEKKKVVFELYP 712
>gi|410097219|ref|ZP_11292201.1| hypothetical protein HMPREF1076_01379 [Parabacteroides goldsteinii
CL02T12C30]
gi|409224537|gb|EKN17469.1| hypothetical protein HMPREF1076_01379 [Parabacteroides goldsteinii
CL02T12C30]
Length = 805
Score = 283 bits (725), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 229/788 (29%), Positives = 369/788 (46%), Gaps = 134/788 (17%)
Query: 51 PISQRARDLVSRLTLDEKISQL--VNSAPAIPRLGIPAYEW----WSEAL---------- 94
PI +R DL+ ++T++EK QL + A+ + +P EW W + +
Sbjct: 67 PIDKRVEDLLKQMTVEEKTCQLGTIYGYGAVLKDTLPTDEWKTRIWKDGIGNIDEHLNGE 126
Query: 95 ------------HGVAGVGKGIFF-------------NGTIRG-----ATSFPQVILTAA 124
H A FF N IRG +T FP I
Sbjct: 127 WKRTSLDFPYSNHAEAMNKVQAFFVEETRLGIPADLTNEGIRGLKHEKSTFFPAQIGQGC 186
Query: 125 SFDSYLWYRIGQAIGLEARALYNAGQAIGMT-FWAPNINIFRDPRWGRGQETPGEDPLVT 183
++D L Y IG+ G EA+AL G T ++P +++ RDPRWGR E+ GED +
Sbjct: 187 TWDKELIYEIGRITGEEAKAL-------GYTNIYSPILDLSRDPRWGRTVESYGEDSYLA 239
Query: 184 GKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLA 243
G+ V G+Q + + + KHF Y + + D + Q++
Sbjct: 240 GELGRQQVLGIQSN-----------RVVSTPKHFAIYGIPGGGRDCYSRTDPHASPQEVH 288
Query: 244 DTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVS 303
+ + PF ++ A G MC++N NG P A L+++ R QWGF GY+ SD A+
Sbjct: 289 ELHLEPFRIAFQEAGALGTMCSHNDYNGTPVSASHYLMTELLRNQWGFKGYVVSDSWAID 348
Query: 304 IIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHT------KAAVKQKKLPESEIDRAL 357
+ E+AV L AG++V +F ++ + A+++ + ES +D+ +
Sbjct: 349 KNVKFYHIVDTEEEAVASELNAGLNVR--TFFEQSEVFIEALRRALQKGLVEESTLDQRV 406
Query: 358 HNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLP 417
+ V+ LGLF+ +P ++ K+ +V S ++ ++L+AA++ IVLLKN + LPL
Sbjct: 407 REVLYVKFWLGLFD-DPYVKD-TKLADKIVNSDKNREVSLRAARESIVLLKNENNTLPLS 464
Query: 418 KSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVE---NTVYYPGCDTVACS 474
K+ ++A+IGP A+ K+L Y + IT LQ L+N + N +Y GC+ +
Sbjct: 465 KTLK-NIAVIGPQADEVKSLTSRYGSHNPNVITGLQGLKNLLGENVNLMYAKGCNVRDKN 523
Query: 475 ---------------SASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQE 519
ID+AV+IAK A+ ++ +G D E RV+L L GRQ+E
Sbjct: 524 FPQSDVMYFELSDKEKEEIDEAVEIAKKAEVAIIYVGDDFRTIGESRSRVNLDLSGRQKE 583
Query: 520 LITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHN 579
L+ R +A PV+LVL G PV + + D N+ +I+ A YPGE A+AEV+FGD+N
Sbjct: 584 LV-RAVQATGTPVVLVLFNGRPVTLNWE--DANLPAIVEAWYPGEFSGQAVAEVLFGDYN 640
Query: 580 PGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYK 639
PGG+L T +P+ ++P +P AT G+ + +G E++PFG GLSY+ +
Sbjct: 641 PGGKLSTT-FPKSVGQIPWA-FPFKPNAT----GKGFARVDG-ELYPFGYGLSYTTF--- 690
Query: 640 FKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKH 699
++ Q S+TK+ + + VT VKN G + G
Sbjct: 691 -------EISNLQPSATKIADGDTLT-------------------VTCKVKNTGSVKGDE 724
Query: 700 PVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEG 759
V L++ R K+L GF+ V L E+ + F+++ + + V+E G
Sbjct: 725 VVQLYLNDETSSISRFEKELCGFERVALEPGEEKTVTFKVNR-RAYGMYNDKNEFVVEPG 783
Query: 760 THFLVVGD 767
FL G+
Sbjct: 784 KFFLFAGN 791
>gi|398386387|ref|ZP_10544389.1| beta-glucosidase-like glycosyl hydrolase [Sphingobium sp. AP49]
gi|397718418|gb|EJK79007.1| beta-glucosidase-like glycosyl hydrolase [Sphingobium sp. AP49]
Length = 791
Score = 283 bits (725), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 227/712 (31%), Positives = 342/712 (48%), Gaps = 96/712 (13%)
Query: 81 RLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGL 140
RLGIP + E LHG A VG ATSFPQ I A+S+D + ++ Q IG
Sbjct: 138 RLGIPIL-FHEEGLHGYAAVG-----------ATSFPQSIAMASSWDPTMLRQVNQVIGR 185
Query: 141 EARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFN 200
E RA + + M +P ++I RDPRWGR +ET GEDP + G+ V+ V G+QG+ +
Sbjct: 186 EIRA-----RGVPMVL-SPVVDIARDPRWGRIEETYGEDPYLVGEMGVAAVEGLQGEGRS 239
Query: 201 GGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRAS 260
+L A KH T + GT A V+ ++L + + PPFE VK+
Sbjct: 240 --RLLRPGHVFATLKHLTGHGQPE-SGTNVGP--APVSERELRENFFPPFEQVVKRTGIE 294
Query: 261 GIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVV 320
+M +YN ++G+PS A+R LL R++WGF G + SD AV + A + E+A +
Sbjct: 295 AVMASYNEIDGVPSHANRWLLDNVLRQEWGFRGAVVSDYSAVDQLMSIHHIAANLEEAAM 354
Query: 321 DVLKAGMDVNCGSFLQKHTKAA-VKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPF 379
L AG+D + L T V++ K+ E+++D A+ + ++ R GLF NP
Sbjct: 355 RALDAGVDADLPEGLSYATLGKLVREGKVSEAKVDLAVRRMLELKFRAGLFE-NPYAD-- 411
Query: 380 GKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLG 439
A + + + LA AAQ I LLKN G+LPL ++ A+IGP+A A+ LG
Sbjct: 412 ANAAAAITNNDEARALARTAAQRSITLLKND-GMLPLKPEGTI--AVIGPSAAVAR--LG 466
Query: 440 NYAGPSCRSITPLQALQNYV---ENTVYYPGC----------DTVACSSAS-----IDKA 481
Y G S++ L+ ++ V N V+ G D V S + I +A
Sbjct: 467 GYYGQPPHSVSILEGIKARVGTKANIVFAQGVKITENDDWWEDKVVKSDPAENRKLIAQA 526
Query: 482 VDIAKGADHVVLMMGLDQTQEKEEL------DRVDLVLPGRQQELITRVAEAAKKPVILV 535
V+ A+ D ++L +G + +E DR L L G QQEL + +A KP+ +V
Sbjct: 527 VEAARNVDRIILTLGDTEQSSREGWADNHLGDRPSLDLVGEQQELFDAL-KALGKPITVV 585
Query: 536 LLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIK 595
L+ G P + K +IL Y GE G A+A+++FGD NPGG+LP+T P+ +
Sbjct: 586 LINGRPA--STVKVSEQANAILEGWYLGEQGGNAVADILFGDVNPGGKLPVT-VPRSVGQ 642
Query: 596 VPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSS 655
+PM M+P A G Y F ++PFG GLSY+ +S +S K+ +S
Sbjct: 643 LPMF-YNMKPSARRG-----YLFDTTDPLYPFGFGLSYTNFSLSAPRLSATKIGTGGKTS 696
Query: 656 TKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRP 715
V++ V+N G G V L+++ RP
Sbjct: 697 -----------------------------VSVDVRNTGAREGDEVVQLYIRDKVSSVTRP 727
Query: 716 IKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGD 767
+K+L GFQ V L E + F + P E+L + V+E G ++ G+
Sbjct: 728 VKELKGFQRVTLKPGESRTVTFTVGP-EALQMWNDQMRRVVEPGDFEIMTGN 778
>gi|393724082|ref|ZP_10344009.1| beta-glucosidase [Sphingomonas sp. PAMC 26605]
Length = 900
Score = 283 bits (725), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 173/419 (41%), Positives = 234/419 (55%), Gaps = 47/419 (11%)
Query: 56 ARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATS 115
AR +V+R+T EK+ QL+N APAIPRL IPAY WW+E+LHG G T+
Sbjct: 50 ARAIVARMTPAEKLPQLLNVAPAIPRLNIPAYNWWTESLHGAIGTVP----------TTN 99
Query: 116 FPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQ--------AIGMTFWAPNINIFRDP 167
FP+ I AASFD+ L +R+ AI E RAL+ + G+ W+PNINIFRDP
Sbjct: 100 FPEPIGLAASFDAPLVHRVAGAISTEVRALHTLARETGKLGRIGTGLDTWSPNINIFRDP 159
Query: 168 RWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKG 227
RWGRGQET GEDP +T V++V G+QG + + A KHF + +
Sbjct: 160 RWGRGQETYGEDPYLTAHQGVAFVTGMQGPNPD------LPEVIATPKHFAVH---SGPE 210
Query: 228 TTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARR 287
+TR+ + V+ DL DTY P F + + +GRA IMCAYNR++G P+CA+ LL R
Sbjct: 211 STRHAANVYVSPHDLEDTYLPAFRAAIVEGRAGSIMCAYNRIDGQPACANDLLLKDHLRG 270
Query: 288 QWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGS-------FLQKHTK 340
WGF+GY+ SDCDAV I D YA AV LKAG+D C + L
Sbjct: 271 AWGFNGYVVSDCDAVKDIADNHKYAPDQATAVAAALKAGVDNECNTQTIGDIGGLPDRFG 330
Query: 341 AAVKQKKLPESEIDRALHNLFSVRMRLGLFNG---NPTMQPFGKIGADVVCSPAHQVLAL 397
A+K+ + E ++DR L LFS R+R G G P + P G I +P H LAL
Sbjct: 331 EALKRNLISEGDVDRTLVRLFSARIRNGDLPGIAAAPAVVPVGAI-----LTPDHAALAL 385
Query: 398 QAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQ 456
+AAQ +V+LKN G+LPL +A+IGP ++ + L GNY+ S +S P+ ++
Sbjct: 386 EAAQKSLVMLKND-GVLPL--RPGARIAVIGPLGDATRVLRGNYS--STQSAPPVSVVE 439
Score = 112 bits (279), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 84/260 (32%), Positives = 122/260 (46%), Gaps = 42/260 (16%)
Query: 507 DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAG 566
D+ L LP QQ L+ R A A KP+I++ + G P+++ +AK N +I+ A YPG+AG
Sbjct: 658 DKTTLDLPAEQQALLER-AHATGKPLIVIAMNGSPINLAWAK--DNAAAIVEAWYPGQAG 714
Query: 567 AVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFP 626
+A+A V+ G +PGGRLP+T+Y P D M GRTYR++ G V+P
Sbjct: 715 GLAVANVLTGKADPGGRLPLTFYRSVADLPPFDDYAMT--------GRTYRYFTGTPVYP 766
Query: 627 FGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVT 686
FG GLS++ ++Y V+ P G R VT
Sbjct: 767 FGYGLSFTSFAYAPLTVT--------------------------PAAGGAETGLR---VT 797
Query: 687 IGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLS 746
V N G+ G V L++ + G P L G++ V L E + F+LSP LS
Sbjct: 798 TEVSNTGQRVGDEVVQLYLNFPDQ-PGVPRVALRGYRRVTLKPGEHRAVTFDLSP-RDLS 855
Query: 747 RAREDGLMVIEEGTHFLVVG 766
DG + G + + VG
Sbjct: 856 AVGIDGKRQVMAGQYRVSVG 875
>gi|409195436|ref|ZP_11224099.1| glycoside hydrolase family protein [Marinilabilia salmonicolor JCM
21150]
Length = 867
Score = 283 bits (724), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 162/419 (38%), Positives = 232/419 (55%), Gaps = 31/419 (7%)
Query: 53 SQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRG 112
+RA DL+ LTL+EK+S +V+ AI RLGI Y WW+EALHGVA G+
Sbjct: 34 EERADDLLKELTLEEKVSLMVDRNTAIERLGIEEYNWWNEALHGVARAGQ---------- 83
Query: 113 ATSFPQVILTAASFDSYLWYRIGQAIGLEARALYN-------AGQAIGMTFWAPNINIFR 165
AT FPQ + AA+FD + + A EARA ++ G+ G+T W PNIN+FR
Sbjct: 84 ATVFPQPVGMAAAFDRDMVLDVFSAASDEARAKHHFFKERGERGRYQGLTMWTPNINVFR 143
Query: 166 DPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNW 225
DPRWGRG E GEDP + G + V+G+QGD GK + AC KH+ + W
Sbjct: 144 DPRWGRGMEAYGEDPFMNGVLGTAVVKGLQGD--RSGKYD---KLHACAKHYAVHSGPEW 198
Query: 226 KGTTRYKFDAR-VTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKT 284
R+ F+A + +DL +TY P F+ V G +MCAYNR G P C + LL
Sbjct: 199 ---NRHSFNAENIRPRDLHETYLPAFKKLVIDGDVRMVMCAYNRFEGEPCCGNNQLLRDI 255
Query: 285 ARRQWGFHGYITSDCDAVSIIYDAEGYAKSPE--DAVVDVLKAGMDVNCGSFLQKHTKAA 342
R +WGF G + SDC A++ ++ + +A P+ A D + AG D+NCG + A
Sbjct: 256 LRNEWGFDGVVVSDCWAINDFFNKDAHAMYPDAKTASTDAVLAGTDLNCGDSYPSLVE-A 314
Query: 343 VKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQD 402
V+Q + E ++D +L L R LG + + ++ + KI VV SP H +AL+AA+
Sbjct: 315 VEQGLITEEQLDISLRRLLIARFELGEMDPDEEVE-WSKIPHSVVSSPTHSEMALEAARK 373
Query: 403 GIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVEN 461
+ LL N +G LPL K + +++A++GPNAN + GNY G + T LQ ++N + N
Sbjct: 374 SMTLLMNKNGALPL-KKEGLTVAVMGPNANDSLMQWGNYNGTPATTTTILQGIRNALGN 431
Score = 117 bits (294), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 88/282 (31%), Positives = 133/282 (47%), Gaps = 63/282 (22%)
Query: 476 ASIDKAVDIAK--GADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITR 523
A +D +AK AD VV G+ E EE+ DR D+ LP Q+E++
Sbjct: 589 AKVDIPSSVAKVADADVVVFASGISPFLEGEEMGVDLPGFKGGDRTDIALPAIQKEMLKA 648
Query: 524 VAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGR 583
+ +A K+ I+++ C G I F + +IL A YPG+AG A+AEV+FGD+NP GR
Sbjct: 649 LHKAGKE--IILVNCSGSA-IGFEEATDYSSAILQAWYPGQAGGQAVAEVLFGDYNPAGR 705
Query: 584 LPMTWYPQ-----DYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSY 638
LP+T+Y D+ MT+ RTYR++EG+ ++PFG GLSY+ +SY
Sbjct: 706 LPVTFYKSVDQLPDFQDYNMTN-------------RTYRYFEGEPLYPFGYGLSYTTFSY 752
Query: 639 KFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGK 698
+SQ + + +S K + V N G+ G+
Sbjct: 753 DQPELSQTSISTEEEASLK-----------------------------VSVANTGDYDGE 783
Query: 699 HPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELS 740
V L+++ G P L GFQ V + E E+ F+L+
Sbjct: 784 EVVQLYLQKPDDTEG-PSLTLRGFQRVFIPKGETVEVEFQLT 824
>gi|253574420|ref|ZP_04851761.1| glycoside hydrolase [Paenibacillus sp. oral taxon 786 str. D14]
gi|251846125|gb|EES74132.1| glycoside hydrolase [Paenibacillus sp. oral taxon 786 str. D14]
Length = 782
Score = 283 bits (724), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 218/734 (29%), Positives = 346/734 (47%), Gaps = 107/734 (14%)
Query: 67 EKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASF 126
E ++ + A RLGIP E HG +G AT FP + +++
Sbjct: 106 EAVNAIQRYAIENSRLGIPIL-IGEECSHGHMAIG-----------ATVFPVPLSLGSTW 153
Query: 127 DSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKY 186
+ L+ + +A+ E RA G ++P +++ RDPRWGR +E GED + +
Sbjct: 154 NVELYREMCRAVARETRA------QGGAVTYSPVLDVVRDPRWGRTEECFGEDAYLISEM 207
Query: 187 AVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTY 246
AV+ V G+QG++ L G+ +A KHF Y R + ++L +
Sbjct: 208 AVASVEGLQGES-----LDGEDSVAATLKHFVGYGSSE---GGRNAGPVHMGRRELLEVD 259
Query: 247 QPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIY 306
PF V+ G AS IM AYN ++G+P + LL R +WGF G + +DC A+ ++
Sbjct: 260 LLPFRKAVEAGAAS-IMPAYNEIDGVPCTTNEELLDGVLRGEWGFDGMVITDCGAIDMLA 318
Query: 307 DAEGYAKSPEDAVVDVLKAGMDVN-CGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRM 365
A+ DA + ++AG+D+ G KH AV+ +L E +DRA+ + +++
Sbjct: 319 SGHDVAEDGRDAAIQAIRAGIDMEMSGVMFGKHLVEAVRSGQLEEEVLDRAVRRVLTLKF 378
Query: 366 RLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLA 425
RLGLF P P + V+ S H LA Q A +G+VLLKN G+LPL + + ++A
Sbjct: 379 RLGLFE-RPYADP--ERAERVIGSAEHVELARQLASEGVVLLKNKDGVLPL-SADAGTIA 434
Query: 426 LIGPNANSAKTLLGNYAGPSCRS--ITPLQALQNYV----ENTVYYPGCDTVACSSASID 479
+IGPNA++ LG+Y P RS T L +++ + E +Y PGC S D
Sbjct: 435 VIGPNADAGYNQLGDYTSPQPRSKVTTVLGGIRSKLAETPERVLYAPGCRINGNSREGFD 494
Query: 480 KAVDIAKGADHVVLMMGLDQTQE-------------------------KEELDRVDLVLP 514
A+ A+ AD VV+++G ++ E +DR++L L
Sbjct: 495 VALSCAEKADTVVMVVGGSSARDFGEGTIDLRTGASKVTDNAESDMDCGEGIDRMNLSLS 554
Query: 515 GRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVI 574
G Q ELI + + KP+++V + G P+ + D + +IL A YPG+ G A+A+++
Sbjct: 555 GVQLELIQEIHKLG-KPLVVVYINGRPIAEPW--IDEHADAILEAWYPGQEGGHAIADIL 611
Query: 575 FGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEG--KEVFPFGCGLS 632
FGD NP GRL ++ P+ +VP+ G R R+ EG + +PFG GLS
Sbjct: 612 FGDVNPSGRLTIS-IPKHVGQVPVY--------YHGKRSRGKRYLEGDSQPRYPFGYGLS 662
Query: 633 YSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNH 692
Y++++Y + + +N+ STK VT+ V N
Sbjct: 663 YTEFTYNNLKLESDT--INKDGSTK---------------------------VTVEVTNV 693
Query: 693 GEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDG 752
GE AG + L++ RP K+L GF+ + L E + F + P E L ++
Sbjct: 694 GERAGAEVIQLYITDVASKVTRPAKELKGFRKIFLQPGETQTVEFTVGP-EQLQYIGQNY 752
Query: 753 LMVIEEGTHFLVVG 766
V+E G + VG
Sbjct: 753 KPVVEPGEFRVHVG 766
>gi|284174578|ref|ZP_06388547.1| Beta-xylosidase [Sulfolobus solfataricus 98/2]
gi|356934752|gb|AET42953.1| beta-xylosidase-like protein [Sulfolobus solfataricus 98/2]
Length = 754
Score = 283 bits (724), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 226/755 (29%), Positives = 357/755 (47%), Gaps = 114/755 (15%)
Query: 50 LPISQRARDLVSRLTLDEK-ISQLVNSAPAI----PRLGIPAYEWWSEALHGVAGVGKGI 104
L I Q R SRL L K + +LVN RL IPA E L G+ G
Sbjct: 45 LGIGQITRVAGSRLDLKPKEVVKLVNKVQKFLVENTRLKIPAI-IHEECLSGLMGYS--- 100
Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIF 164
+T+FPQ I A++++ L + I + R L Q + +P +++
Sbjct: 101 --------STAFPQAIGLASTWNPELLTNVASTIRSQGR-LIGVNQCL-----SPVLDVC 146
Query: 165 RDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDN 224
RDPRWGR +ET GEDP + ++Y+ G+QG+T Q A KHF A+
Sbjct: 147 RDPRWGRCEETYGEDPYLVASMGLAYITGLQGET----------QLVATAKHFAAH---G 193
Query: 225 WKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKT 284
+ R V ++L +T+ PFE VK G+ IM AY+ ++G+P + LL+
Sbjct: 194 FPEGGRNIAQVHVGNRELRETFLFPFEVAVKIGKVMSIMPAYHEIDGVPCHGNPQLLTNI 253
Query: 285 ARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSF--LQKHTKAA 342
R++WGF G + SD D + + A + +A + L++G+D+ + + A
Sbjct: 254 LRQEWGFDGIVVSDYDGIRQLEAIHKVASNKMEAAILALESGVDIEFPTIDCYGEPLVTA 313
Query: 343 VKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGA--DVVCSPAHQVLALQAA 400
+K+ + E+ IDRA+ + ++ RLGL + PF A + + + LAL+AA
Sbjct: 314 IKEGLVSEAIIDRAVERVLRIKERLGLLD-----NPFVDESAVPERLDDRKSRELALKAA 368
Query: 401 QDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYA-------GPSCRSITPLQ 453
++ IVLLKN + +LPL K+ + +A+IGPNAN + +LG+Y +T LQ
Sbjct: 369 RESIVLLKNENNMLPLSKNIN-KIAVIGPNANDPRNMLGDYTYTGHLNIDSGIEIVTVLQ 427
Query: 454 ALQNYVEN--TVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMG------LDQTQ--EK 503
+ V +Y GCD S +A++IAK AD ++ +MG L T +
Sbjct: 428 GIAKKVGEGKVLYAKGCDIAGESKEGFSEAIEIAKQADVIIAVMGEKSGLPLSWTDIPSE 487
Query: 504 EEL-----------DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRN 552
EE DR L L G Q+EL+ + + KP+ILVL+ G P + +
Sbjct: 488 EEFKKYQAVTGEGNDRASLRLLGVQEELLKELYKTG-KPIILVLINGRP--LVLSPIINY 544
Query: 553 IGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNP 612
+ +I+ A +PGE G A+A++IFGD+NP GRLP+T +P D ++P+ + +P +
Sbjct: 545 VKAIIEAWFPGEEGGNAIADIIFGDYNPSGRLPIT-FPMDTGQIPLYYSR-KPSSF---- 598
Query: 613 GRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPE 672
R Y +F FG GLSY+++ Y V+
Sbjct: 599 -RPYVMLHSSPLFTFGYGLSYTQFEYSNLEVTPK-------------------------- 631
Query: 673 LGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEK 732
E + + VKN G M G V L++ + RP+K+L GF V L EK
Sbjct: 632 ---EVGPLSYITILLDVKNVGNMEGDEVVQLYISKSFSSVARPVKELKGFAKVHLKPGEK 688
Query: 733 AEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGD 767
+ F L P E+L+ +V+E+G + +++G+
Sbjct: 689 RRVKFAL-PMEALAFYDNFMRLVVEKGEYQILIGN 722
>gi|15899739|ref|NP_344344.1| Beta-xylosidase [Sulfolobus solfataricus P2]
gi|13816430|gb|AAK43134.1| Beta-xylosidase [Sulfolobus solfataricus P2]
Length = 754
Score = 283 bits (723), Expect = 3e-73, Method: Compositional matrix adjust.
Identities = 224/755 (29%), Positives = 354/755 (46%), Gaps = 114/755 (15%)
Query: 50 LPISQRARDLVSRLTLDEK-ISQLVNSAPAI----PRLGIPAYEWWSEALHGVAGVGKGI 104
L I Q R SRL L K + +LVN RL IPA E L G+ G
Sbjct: 45 LGIGQITRVAGSRLGLKPKEVVKLVNKVQKFLVENTRLKIPAI-IHEECLSGLMGYS--- 100
Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIF 164
+T+FPQ I A++++ L + I + R L Q + +P +++
Sbjct: 101 --------STAFPQAIGLASTWNPELLTNVASTIRSQGR-LIGVNQCL-----SPVLDVC 146
Query: 165 RDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDN 224
RDPRWGR +ET GEDP + ++Y+ G+QG+T Q A KHF A+
Sbjct: 147 RDPRWGRCEETYGEDPYLVASMGLAYITGLQGET----------QLVATAKHFAAH---G 193
Query: 225 WKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKT 284
+ R V ++L +T+ PFE VK G+ IM AY+ ++G+P + LL+
Sbjct: 194 FPEGGRNIAQVHVGNRELRETFLFPFEVAVKIGKVMSIMPAYHEIDGVPCHGNPQLLTNI 253
Query: 285 ARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSF--LQKHTKAA 342
R++WGF G + SD D + + A + +A + L++G+D+ + + A
Sbjct: 254 LRQEWGFDGIVVSDYDGIRQLEAIHKVASNKMEAAILALESGVDIEFPTIDCYGEPLVTA 313
Query: 343 VKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGA--DVVCSPAHQVLALQAA 400
+K+ + E+ IDRA+ + ++ RLGL + PF A + + + LAL+AA
Sbjct: 314 IKEGLVSEAIIDRAVERVLRIKERLGLLD-----NPFVDESAVPERLDDRKSRELALKAA 368
Query: 401 QDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYA-------GPSCRSITPLQ 453
++ IVLLKN + +LPL K+ + +A+IGPNAN + +LG+Y +T LQ
Sbjct: 369 RESIVLLKNENNMLPLSKNIN-KIAVIGPNANDPRNMLGDYTYTGHLNIDSGIEIVTVLQ 427
Query: 454 ALQNYVEN--TVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLD------------- 498
+ V +Y GCD S +A++IAK AD ++ +MG
Sbjct: 428 GIAKKVGEGKVLYAKGCDIAGESKEGFSEAIEIAKQADVIIAVMGEKSGLPLSWTDIPSE 487
Query: 499 ------QTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRN 552
Q E DR L L G Q+EL+ + + KP+ILVL+ G P + +
Sbjct: 488 EEFKKYQAVTGEGNDRASLRLLGVQEELLKELYKTG-KPIILVLINGRP--LVLSPIINY 544
Query: 553 IGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNP 612
+ +I+ A +PGE G A+A++IFGD+NP GRLP+T +P D ++P+ + +P +
Sbjct: 545 VKAIIEAWFPGEEGGNAIADIIFGDYNPSGRLPIT-FPMDTGQIPLYYSR-KPSSF---- 598
Query: 613 GRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPE 672
R Y +F FG GLSY+++ Y V+
Sbjct: 599 -RPYVMLHSSPLFTFGYGLSYTQFEYSNLEVTPK-------------------------- 631
Query: 673 LGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEK 732
E + + VKN G M G V L++ + RP+K+L GF V L EK
Sbjct: 632 ---EVGPLSYITILLDVKNVGNMEGDEVVQLYISKSFSSVARPVKELKGFAKVHLKPGEK 688
Query: 733 AEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGD 767
+ F L P E+L+ +V+E+G + +++G+
Sbjct: 689 RRVKFAL-PMEALAFYDNFMRLVVEKGEYQILIGN 722
>gi|380692997|ref|ZP_09857856.1| beta-glucosidase [Bacteroides faecis MAJ27]
Length = 837
Score = 282 bits (722), Expect = 4e-73, Method: Compositional matrix adjust.
Identities = 164/424 (38%), Positives = 241/424 (56%), Gaps = 34/424 (8%)
Query: 51 PISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTI 110
PI +R +DL+S+LT++EK+S L ++P I R+GI Y +EALHG+ GK
Sbjct: 20 PIHERVQDLLSKLTIEEKVSLLRATSPGIERMGIDKYYMGNEALHGIIRPGK-------- 71
Query: 111 RGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQ---------AIGMTFWAPNI 161
T FPQ I A+ ++ L + I I EARA +N + + +TFW+P +
Sbjct: 72 --FTVFPQAIGLASMWNPELHHIIAGVISDEARARWNELERGKKQKDQFSDLLTFWSPTV 129
Query: 162 NIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYD 221
N+ RDPRWGR ET GEDP ++G ++V+G+QGD L+A A KHF A
Sbjct: 130 NMARDPRWGRTPETYGEDPYLSGVLGTAFVKGLQGDH------PRYLKAVATPKHFAA-- 181
Query: 222 LDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLL 281
N + R+ DA +T DL + Y P FE C+++G+A IM AYN +NG+P A+ LL
Sbjct: 182 --NNEEHNRFYCDAAITETDLREYYFPAFEKCIREGKAESIMTAYNAINGVPCTANNWLL 239
Query: 282 SKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKH-TK 340
+K ++ WGF+GYI SDC A ++ Y K+PE A + +KAG+DV CG ++ +
Sbjct: 240 NKVLKQDWGFNGYIVSDCGAPGLLMTDHRYVKTPEAAAMIAIKAGLDVECGDYVFANPLL 299
Query: 341 AAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAA 400
A KQ + +EID A + + RMRLG+F+ +P P+ + ++V H LAL+AA
Sbjct: 300 NAYKQYMVSAAEIDSAAYRVLRARMRLGMFD-DPEKNPYNHLSPEIVGCKKHHDLALEAA 358
Query: 401 QDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAG-PSCRSITPLQALQNYV 459
+ IVLLKN LPL K S+A++G NA A G+Y+G P ++ L ++N V
Sbjct: 359 RQSIVLLKNQQNTLPLNAQKIKSIAVVGINA--ANCEFGDYSGTPVNAPVSVLDGIRNRV 416
Query: 460 ENTV 463
N +
Sbjct: 417 GNEI 420
Score = 124 bits (311), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 85/286 (29%), Positives = 136/286 (47%), Gaps = 43/286 (15%)
Query: 481 AVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGG 540
A I + +D V+ +MG++Q+ E+E DR + LP + Q++ R A A I+VL+ G
Sbjct: 581 ASKIIRESDVVIAVMGINQSIEREGQDRNSIELP-KDQQIFIREAYKANPNTIVVLVAGS 639
Query: 541 PVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTD 600
+ I + D++I +I+ A YPGE G A+AEV+FGD+NP GRLP+T+Y + D
Sbjct: 640 SMAIGWM--DQHIPAIIDAWYPGEQGGTAIAEVLFGDYNPAGRLPLTFYNS------IED 691
Query: 601 MKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVE 660
+ N RTY ++EGK ++ FG GLSY+K+ Y+ + Q
Sbjct: 692 LPAFDDYNVKN-NRTYMYFEGKPLYAFGYGLSYTKFDYRNLNIKQ--------------- 735
Query: 661 NQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLV 720
+T+ + +KN G+ G ++VK +G P+KQL
Sbjct: 736 ------------------DTQNVTLNFSIKNSGKYNGDEVAQVYVKFPDQGIKTPLKQLK 777
Query: 721 GFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
GF+ V + +I E+ E + GT+ +VG
Sbjct: 778 GFKRVHIKKGATEQISIEIPKEELRLWDDQKKQFYTPSGTYHFMVG 823
>gi|110737298|dbj|BAF00595.1| xylosidase [Arabidopsis thaliana]
Length = 303
Score = 282 bits (722), Expect = 4e-73, Method: Compositional matrix adjust.
Identities = 138/232 (59%), Positives = 168/232 (72%), Gaps = 12/232 (5%)
Query: 8 LVFPLLCLCFTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDE 67
LVF LLCL +S +S +P F+CDP+N T T FC+ +PI R +DL+ RLTL E
Sbjct: 18 LVF-LLCLVHSS-----ESLRPLFACDPANGLTRTLRFCRANVPIHVRVQDLLGRLTLQE 71
Query: 68 KISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFD 127
KI LVN+A A+PRLGI YEWWSEALHG++ VG G F G GATSFPQVI TAASF+
Sbjct: 72 KIRNLVNNAAAVPRLGIGGYEWWSEALHGISDVGPGAKFGGAFPGATSFPQVITTAASFN 131
Query: 128 SYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYA 187
LW IG+ + EARA+YN G A G+T+W+PN+NI RDPRWGRGQETPGEDP+V KYA
Sbjct: 132 QSLWEEIGRVVSDEARAMYNGGVA-GLTYWSPNVNILRDPRWGRGQETPGEDPIVAAKYA 190
Query: 188 VSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTM 239
SYVRG+QG T G +LK +ACCKH+TAYDLDNW G R+ F+A+V +
Sbjct: 191 ASYVRGLQG-TAAGNRLK----VAACCKHYTAYDLDNWNGVDRFHFNAKVNL 237
>gi|218132025|ref|ZP_03460829.1| hypothetical protein BACEGG_03650 [Bacteroides eggerthii DSM 20697]
gi|217985785|gb|EEC52125.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
eggerthii DSM 20697]
Length = 888
Score = 282 bits (721), Expect = 6e-73, Method: Compositional matrix adjust.
Identities = 168/434 (38%), Positives = 233/434 (53%), Gaps = 38/434 (8%)
Query: 43 FPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHG-VAGVG 101
+PF L + +R +L+S LTL EK+S ++N + AI RLGIPAY WW EA HG +AG
Sbjct: 28 YPFQNPKLKVEERVDNLMSLLTLQEKVSMMINRSLAIERLGIPAYNWWGEACHGLIAG-- 85
Query: 102 KGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIG-------- 153
G T FPQ I AA+FD + EARA YN G
Sbjct: 86 ----------GVTVFPQSIALAATFDDSSQLTTYTMVSDEARARYNTLPLDGDIGPYVSA 135
Query: 154 ---MTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQA 210
+TFWAPNINIFRDPRWGRGQET GEDP + + ++ V G+QGD + +
Sbjct: 136 IPNLTFWAPNINIFRDPRWGRGQETYGEDPFLMSRMGLNVVLGMQGDD------EHYYKT 189
Query: 211 SACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVN 270
AC KH Y + + R++F+A V+M+DL +TY P FE+ V +G +MCAY+
Sbjct: 190 HACAKH---YGVHSGPEPLRHEFNAVVSMRDLWETYLPAFETLVVKGNVREVMCAYSAYE 246
Query: 271 GIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSP--EDAVVDVLKAGMD 328
G P CA LL R +WGF G + SDCDA++ Y + P A D + G D
Sbjct: 247 GEPCCASNRLLVDILRNRWGFDGMVVSDCDAINDFYVKGRHETHPDAAAASADAVLTGTD 306
Query: 329 VNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVC 388
+ CG + AV++ + E ++D +L + + R RLGL + + P+ I V+
Sbjct: 307 LECGRSYNALIE-AVEKGIIKEQDLDVSLRRILTERFRLGLLDPAKYV-PYSTIPGSVID 364
Query: 389 SPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRS 448
H+ AL+ A + VLLKN +LPL K+ S+A++GPN N + + GNY+G
Sbjct: 365 CQEHRDHALKMAHESQVLLKNEGNILPLDKNIK-SIAIVGPNINDSIMMRGNYSGSPTHC 423
Query: 449 ITPLQALQNYVENT 462
IT LQ L+N + NT
Sbjct: 424 ITILQGLKNKLPNT 437
Score = 103 bits (257), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 89/299 (29%), Positives = 135/299 (45%), Gaps = 44/299 (14%)
Query: 460 ENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQE 519
E +V + D A ++ + K + ++ + G+ TQE E +R + LP Q+
Sbjct: 603 EGSVSHLSVDMYERKLAVFEELKEKIKDVEAIIYVGGITPTQEGEGHERAKIELPDVQKR 662
Query: 520 LITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHN 579
+ + E K PVI V C G I A D ++L A YPG+ G A+A+V+FGD+N
Sbjct: 663 FLKAMHETGK-PVIYVN-CSGSA-IALADIDYAYDALLQAWYPGQEGGTAVADVLFGDYN 719
Query: 580 PGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYK 639
P G+LP+T+Y TD M RTYR+++G+ + FG GLSY+ + +
Sbjct: 720 PSGKLPVTFYKSTEQLPEFTDYSME--------NRTYRYFKGEPQYAFGYGLSYTDFEFG 771
Query: 640 FKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKH 699
+S SSS K E K +TI + N G+M G
Sbjct: 772 EALLS--------SSSIKAGE---------------------KVEITIPLTNVGKMDGAE 802
Query: 700 PVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLS--RAREDGLMVI 756
V ++VK + PIK L G+ + A + ++ L P ES + A DGL V
Sbjct: 803 VVQVYVKSLTNPDA-PIKSLKGYVRQEIKAGKSEKVRITLEP-ESFAYYNADVDGLKVF 859
>gi|373956830|ref|ZP_09616790.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
paludis DSM 18603]
gi|373893430|gb|EHQ29327.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
paludis DSM 18603]
Length = 823
Score = 282 bits (721), Expect = 6e-73, Method: Compositional matrix adjust.
Identities = 232/795 (29%), Positives = 369/795 (46%), Gaps = 132/795 (16%)
Query: 45 FCKTTLPISQRARDLVSRLTLDEKISQLVN--SAPAIPRLGIPAYEW----W-------S 91
+ +T PI R DL+ ++TL+EK QL I + +P EW W
Sbjct: 73 YEDSTQPIEARLNDLIGQMTLEEKTCQLATLYGYKRILKDSVPTPEWKNEIWKDGIANID 132
Query: 92 EALHGVAGVGK------------------------------GI---FFNGTIRG-----A 113
E L+G GK GI F N IRG A
Sbjct: 133 EHLNGFITWGKTSDLPLVTDVKKHVWAMNQTQRFFIEQTRLGIPVDFTNEGIRGVEAYQA 192
Query: 114 TSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMT-FWAPNINIFRDPRWGRG 172
T+FP + ++D L ++G G+EARAL G T +AP +++ RD RWGR
Sbjct: 193 TAFPTQLNMGMTWDKPLVNQMGNITGMEARAL-------GYTNVYAPILDVARDQRWGRL 245
Query: 173 QETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYK 232
+E GEDP + + V +G+Q + Q +A KHF Y + +
Sbjct: 246 EEVYGEDPYLVARLGVEMAKGMQQNN----------QIAATAKHFAVYSANKGGREGLAR 295
Query: 233 FDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFH 292
D +V +++ + PF+ +K+ G+M +YN +GIP L + R+++GF
Sbjct: 296 TDPQVAPREVENILLYPFKKVIKEAGLMGVMSSYNDYDGIPISGSSYWLIQRLRQEFGFK 355
Query: 293 GYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG----SFLQKHTKAAVKQKKL 348
GY+ SD DA+ +Y+ A +DAV AGM+V + + + VK+ KL
Sbjct: 356 GYVVSDSDALEYLYNKHHVAADLKDAVYQAFMAGMNVRTTFRTPDSIIIYARQLVKEGKL 415
Query: 349 PESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLK 408
P I+ + ++ V+ +LGLF+ +P +Q + A +V A+Q +ALQA+++ IVLLK
Sbjct: 416 PIDTINSRVRDVLRVKFKLGLFD-HPYVQD-AEASAKLVNCAANQAVALQASKESIVLLK 473
Query: 409 NSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYV--ENTVYYP 466
N +LPL SK +LA+IGPNA + +Y + +SI L+ +Q V +Y
Sbjct: 474 NKGAILPL--SKQQTLAVIGPNALNDDYAHTHYGPLASKSINILEGIQAKVGAGKVLYAL 531
Query: 467 GCDTV---------------ACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDL 511
GC+ V A ID AV IA+ AD V+++G + E R L
Sbjct: 532 GCNLVDKHWPESEILPQDPDQAEQAKIDSAVTIARHADVAVVVLGGNTQTAGENKSRTSL 591
Query: 512 VLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALA 571
LPG Q L+ + +A KPV++VL+ P+ I + D++I I++AGYPG G A+A
Sbjct: 592 DLPGYQLRLV-KAVKATGKPVVVVLIGSQPMTINWI--DQHIDGIIYAGYPGTQGGTAVA 648
Query: 572 EVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGL 631
+V+FGD+NPGG+L +T +P+ ++P + +P + + + G + ++PFG GL
Sbjct: 649 DVLFGDYNPGGKLTLT-FPKSVGQLPF-NFPTKPNSET-DEGELAKI--KGLLYPFGFGL 703
Query: 632 SYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKN 691
SY+ ++Y +S + +Q + T V+ V N
Sbjct: 704 SYTTFAYSDLKISP-AIQSDQGNVT----------------------------VSCKVTN 734
Query: 692 HGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRARED 751
G++AG V L+++ K L GF + L E E++F + P + L
Sbjct: 735 TGKVAGDEVVQLYLRDVLSTVTTYEKVLRGFDRLSLKPGETKEVMFTIVP-DDLKLYNRQ 793
Query: 752 GLMVIEEGTHFLVVG 766
V+E G ++VG
Sbjct: 794 MKYVVEPGEFKVMVG 808
>gi|373460605|ref|ZP_09552356.1| hypothetical protein HMPREF9944_00620 [Prevotella maculosa OT 289]
gi|371955223|gb|EHO73027.1| hypothetical protein HMPREF9944_00620 [Prevotella maculosa OT 289]
Length = 858
Score = 281 bits (720), Expect = 6e-73, Method: Compositional matrix adjust.
Identities = 167/467 (35%), Positives = 250/467 (53%), Gaps = 36/467 (7%)
Query: 13 LCLCFTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQL 72
+ L F +++T + P + + +P+ L RA+DL+SRLTL+EK +
Sbjct: 7 ILLSFVTVMTASSALAQP------SLTGMAYPYQNPNLSALTRAQDLLSRLTLEEKALLM 60
Query: 73 VNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWY 132
++ +PAIPRLGI + WWSEALHG A +G T FP+ I AASF+ L Y
Sbjct: 61 LDESPAIPRLGIKKFFWWSEALHGAANMGN----------VTVFPEPIAMAASFNDALLY 110
Query: 133 RIGQAIGLEARALY-----NAGQA---IGMTFWAPNINIFRDPRWGRGQETPGEDPLVTG 184
++ A E RA Y N G+ ++ W PN+NIFRDPRWGRGQET GEDP +T
Sbjct: 111 KVFSAASDEMRAQYHHRIRNGGEDEKFHSLSVWTPNVNIFRDPRWGRGQETYGEDPYLTA 170
Query: 185 KYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFD-ARVTMQDLA 243
+ VRG+QG + + + AC KH+ + + TR+ + V+ +DL
Sbjct: 171 VMGTAVVRGLQGPEDSKYR-----KLWACAKHYAVHSGPEY---TRHTANLNNVSPRDLW 222
Query: 244 DTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVS 303
+TY P F++ V++ + +MCAY ++ P C + LL + R +WGF + SDC AVS
Sbjct: 223 ETYLPAFKTLVEEAKVREVMCAYQALDDEPCCGNSRLLQQILRDEWGFQYLVVSDCGAVS 282
Query: 304 IIYDAEGYAKSPEDAVVDVLKAGMDVNCG-SFLQKHTKAAVKQKKLPESEIDRALHNLFS 362
I+ + A AG DV CG ++ K AV++ + E E+D+ + L
Sbjct: 283 DIWQNHKTSSDAVHATAKAALAGTDVECGFNYTYKCIPEAVQRGLISEKEVDKHVLRLLE 342
Query: 363 VRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSV 422
R LG + +P + P+ KI V+ S AH+ L+L A+ IVLL+N +LPL K+
Sbjct: 343 GRFDLGEMD-DPALVPWSKIPYSVMDSKAHRQLSLDMARQSIVLLQNKQNMLPLKKNNE- 400
Query: 423 SLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCD 469
+A+IGPNA++ + GNY G R++T L ++ +N Y GCD
Sbjct: 401 RIAVIGPNADNVPMMWGNYNGTPNRTVTILDGIRAKHKNVKYIKGCD 447
Score = 92.8 bits (229), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 82/301 (27%), Positives = 132/301 (43%), Gaps = 60/301 (19%)
Query: 486 KGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILV 535
KG + VV + G+ E EE+ DR D+ LP Q++ I + A K+ ++
Sbjct: 606 KGIEKVVFVGGISPALEGEEMPVDIPGFKGGDRTDIELPRVQRDFIKALHAAGKQ--LVY 663
Query: 536 LLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIK 595
+ C G I +I+ A Y G+ G A+A+V+FGD+NP G+LP+T+Y
Sbjct: 664 VNCSGSA-IALEPETTACDAIVQAWYAGQEGGTAVADVLFGDYNPSGKLPVTFYKNSNQL 722
Query: 596 VPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSS 655
+ M+ GRTYR++ +F FG GLSY+ ++ +
Sbjct: 723 PDYENYSMK--------GRTYRYF-SDPLFAFGHGLSYTTFNM---------------GT 758
Query: 656 TKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRP 715
++++ D + +V I V+N G G VLL++K + NG P
Sbjct: 759 AEIIKKADSI------------------VVRIPVENVGSKDGTETVLLYIKNHQDPNG-P 799
Query: 716 IKQLVGFQSVILNAKEK--AEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGDEEYPIS 773
IK L GF V + A K AE++ E + +EG + L+ GD+ I
Sbjct: 800 IKSLRGFSRVFVKAGHKAVAELLLTRKSFEFFD--ENTNTVHFKEGNYDLLYGDKRVEIR 857
Query: 774 I 774
+
Sbjct: 858 M 858
>gi|354582345|ref|ZP_09001247.1| glycoside hydrolase family 3 domain protein [Paenibacillus lactis
154]
gi|353199744|gb|EHB65206.1| glycoside hydrolase family 3 domain protein [Paenibacillus lactis
154]
Length = 765
Score = 281 bits (720), Expect = 7e-73, Method: Compositional matrix adjust.
Identities = 211/732 (28%), Positives = 349/732 (47%), Gaps = 103/732 (14%)
Query: 67 EKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASF 126
E ++++ A RLGIP E HG +G AT FP + +++
Sbjct: 89 EAVNEIQRYAVEHSRLGIPIL-IGEECSHGHMAIG-----------ATVFPVPLSLGSTW 136
Query: 127 DSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKY 186
++ L+ + +A+ LE R+ G ++P +++ RDPRWGR +E GEDP + G++
Sbjct: 137 NTELYREMCRAVALETRS------QGGAVTYSPVLDVVRDPRWGRTEECFGEDPYLIGEF 190
Query: 187 AVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTY 246
A + V G+QG++ L G+ +A KHF Y R + ++L +
Sbjct: 191 AAASVEGLQGES-----LDGEASVAATLKHFVGYGSSE---GGRNAGPVHMGTRELMEVD 242
Query: 247 QPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIY 306
PF+ V+ G AS IM AYN ++G+P + LL R++WGF G + +DC A++++
Sbjct: 243 MYPFKKAVEAGAAS-IMPAYNEIDGVPCTVNEELLDGVLRKEWGFDGMVITDCGAINMLA 301
Query: 307 DAEGYAKSPEDAVVDVLKAGMDVN-CGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRM 365
A+ DA V + AG+D+ G + + AV++K+L S +D A+ + +++
Sbjct: 302 AGHDTAEDGMDAAVSAISAGIDMEMSGEMFGMYLERAVQEKRLDVSVLDEAVRRVLTLKF 361
Query: 366 RLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLA 425
+LGLF NP P V+ H+ +A Q A +GIVLLKN LPL K V +A
Sbjct: 362 KLGLFE-NPYADP--ARAEQVIGCSRHREMARQLAAEGIVLLKNEGSTLPLSKEDGV-IA 417
Query: 426 LIGPNANSAKTLLGNYAGPS--CRSITPLQALQNYVEN----TVYYPGCDTVACSSASID 479
+IGPNA+ LG+Y P R +T L+ ++ + +Y PGC S +
Sbjct: 418 VIGPNADQGYNQLGDYTSPQPPSRVVTVLEGIRAKLGGDKGRVLYAPGCRINGDSREGFE 477
Query: 480 KAVDIAKGADHVVLMMGLDQTQE-------------------------KEELDRVDLVLP 514
A+ A AD VVL++G ++ E +DR+ L L
Sbjct: 478 LALSCAGQADTVVLVLGGSSARDFGEGTIDLRTGASKVTGNDWSDMDCGEGIDRMTLQLS 537
Query: 515 GRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVI 574
G Q EL + + K+ +++V + G P+ + DR+ +IL A YPG+ G A+A+++
Sbjct: 538 GVQLELAREIHKLGKR-LVVVYINGRPIAEPW--IDRHADAILEAWYPGQEGGHAVADIL 594
Query: 575 FGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYS 634
FGD NP G+L ++ P+ ++P+ R + G+ Y + + +PFG GLSY+
Sbjct: 595 FGDVNPSGKLTIS-IPKHVGQLPVYYNGKRSR------GKRYLEEDSQPQYPFGYGLSYT 647
Query: 635 KYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGE 694
++ Y V+ + +++ +VT+ V+N G
Sbjct: 648 EFRYSDLQVTPQTIRTGETA-----------------------------VVTVNVENSGS 678
Query: 695 MAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLM 754
+AG V L++ A RP K+L GF+ + L EK I F + P E L ++
Sbjct: 679 VAGAEVVQLYINDAASRFTRPAKELKGFRKIYLEPGEKQRIEFTVGP-EQLQYIGQNYQP 737
Query: 755 VIEEGTHFLVVG 766
V+E G ++VG
Sbjct: 738 VVEPGLFRVMVG 749
>gi|390945417|ref|YP_006409177.1| beta-glucosidase-like glycosyl hydrolase [Alistipes finegoldii DSM
17242]
gi|390421986|gb|AFL76492.1| beta-glucosidase-like glycosyl hydrolase [Alistipes finegoldii DSM
17242]
Length = 771
Score = 281 bits (720), Expect = 8e-73, Method: Compositional matrix adjust.
Identities = 217/719 (30%), Positives = 329/719 (45%), Gaps = 99/719 (13%)
Query: 81 RLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGL 140
RLGIP + EA HG +G AT+FP A++++ L R+G+ I
Sbjct: 120 RLGIPLF-LAEEAPHGHMAIG-----------ATTFPTAPGQASTWNPELIERMGKVIAA 167
Query: 141 EARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFN 200
E R G + P ++I RDPRW R +E+ GED +T + +YVRG
Sbjct: 168 EIRL------QGGHICYGPVLDIVRDPRWSRTEESYGEDCYLTARIGEAYVRGT-----G 216
Query: 201 GGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRAS 260
G L A + KHF AY + ++L +TY PPFE+ VK G A
Sbjct: 217 SGDLSQSRHALSTLKHFIAYGASEGGQNGGSNL---LGERELRETYLPPFEAAVKAG-AR 272
Query: 261 GIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVV 320
+M AYN V+GIP A+R +L+ R +WGF G++ SD ++ +++ G A S +A V
Sbjct: 273 SVMTAYNSVDGIPCTANRRMLTDILRGEWGFDGFVVSDLLSIEGLHETHGVAGSVREAAV 332
Query: 321 DVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFG 380
L+AG+D + + A + + E+EIDRA+ + +++ +GLF NP +
Sbjct: 333 QALRAGVDADLKGGAFASLREAAEAGDVAEAEIDRAVERVLALKFEMGLFE-NPYID--- 388
Query: 381 KIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGN 440
+ A V AH LAL+AA+ + LL+N G LPL + +A+IGPNA++ LG+
Sbjct: 389 EAAAAEVGCAAHSELALEAARQSVTLLENRSGTLPLDPRRLRRVAVIGPNADNIYNQLGD 448
Query: 441 YAGPSCRSITPLQALQNYV--ENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLD 498
Y + T L+ + + VY GC + I AV A+G D V+++G
Sbjct: 449 YTAQQTAANTVRDGLEKLLGRDRVVYSRGCTVRGGDRSEIAAAVSAARGTDAAVVVIGGS 508
Query: 499 QTQE-----------------------KEELDRVDLVLPGRQQELITRVAEAAKKPVILV 535
++ E DR L L G Q+EL+ R+ +A P+I+V
Sbjct: 509 SARDFDTEFLQTGAAKAAHDEVRDMECGEGFDRATLALLGEQEELLRRI-KATGTPLIVV 567
Query: 536 LLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIK 595
+ G P+D+ A + ++L A YPG G A+AE I G +NP GRLP+T P+ +
Sbjct: 568 CIAGRPLDLRRASEQAD--ALLMAWYPGARGGDAVAETILGHNNPAGRLPIT-IPRAEGQ 624
Query: 596 VPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSS 655
+P+ K RP Y ++PFG GLSYS + Y L QS
Sbjct: 625 IPVYYNKKRP------ANHDYTDLTAAPLYPFGYGLSYSTFEY-------GSLEARQS-- 669
Query: 656 TKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRP 715
G E V+ ++N + G V L++ RP
Sbjct: 670 ------------------GDNVLE-----VSCRIRNTSDREGDEVVQLYISDMVASTVRP 706
Query: 716 IKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGDEEYPISI 774
+QL GF+ + L E+ ++ F L E+LS G V+E+G + VG I +
Sbjct: 707 PRQLGGFRRIRLAPGEQRQVSFTLGD-EALSLIDPQGRRVVEKGDFVIAVGSSSQDIRL 764
>gi|317474349|ref|ZP_07933623.1| glycosyl hydrolase family 3 C terminal domain-containing protein
[Bacteroides eggerthii 1_2_48FAA]
gi|316909030|gb|EFV30710.1| glycosyl hydrolase family 3 C terminal domain-containing protein
[Bacteroides eggerthii 1_2_48FAA]
Length = 877
Score = 281 bits (720), Expect = 8e-73, Method: Compositional matrix adjust.
Identities = 168/434 (38%), Positives = 233/434 (53%), Gaps = 38/434 (8%)
Query: 43 FPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHG-VAGVG 101
+PF L + +R +L+S LTL EK+S ++N + AI RLGIPAY WW EA HG +AG
Sbjct: 17 YPFQNPKLKVEERVDNLMSLLTLQEKVSMMINRSLAIERLGIPAYNWWGEACHGLIAG-- 74
Query: 102 KGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIG-------- 153
G T FPQ I AA+FD + EARA YN G
Sbjct: 75 ----------GVTVFPQSIALAATFDDSSQLTTYTMVSDEARARYNTLPLDGDIGPYVSA 124
Query: 154 ---MTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQA 210
+TFWAPNINIFRDPRWGRGQET GEDP + + ++ V G+QGD + +
Sbjct: 125 IPNLTFWAPNINIFRDPRWGRGQETYGEDPFLMSRMGLNVVLGMQGDD------EHYYKT 178
Query: 211 SACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVN 270
AC KH Y + + R++F+A V+M+DL +TY P FE+ V +G +MCAY+
Sbjct: 179 HACAKH---YGVHSGPEPLRHEFNAVVSMRDLWETYLPAFETLVVKGNVREVMCAYSAYE 235
Query: 271 GIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSP--EDAVVDVLKAGMD 328
G P CA LL R +WGF G + SDCDA++ Y + P A D + G D
Sbjct: 236 GEPCCASNRLLVDILRNRWGFDGMVVSDCDAINDFYVKGRHETHPDAAAASADAVLTGTD 295
Query: 329 VNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVC 388
+ CG + AV++ + E ++D +L + + R RLGL + + P+ I V+
Sbjct: 296 LECGRSYNALIE-AVEKGIIKEQDLDVSLRRILTERFRLGLLDPAKYV-PYSTIPGSVID 353
Query: 389 SPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRS 448
H+ AL+ A + VLLKN +LPL K+ S+A++GPN N + + GNY+G
Sbjct: 354 CQEHRDHALKMAHESQVLLKNEGNILPLDKNIK-SIAVVGPNINDSIMMRGNYSGSPTHC 412
Query: 449 ITPLQALQNYVENT 462
IT LQ L+N + NT
Sbjct: 413 ITILQGLKNKLPNT 426
Score = 103 bits (257), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 89/299 (29%), Positives = 135/299 (45%), Gaps = 44/299 (14%)
Query: 460 ENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQE 519
E +V + D A ++ + K + ++ + G+ TQE E +R + LP Q+
Sbjct: 592 EGSVSHLSVDMYERKLAVFEELKEKIKDVEAIIYVGGITPTQEGEGHERAKIELPDVQKR 651
Query: 520 LITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHN 579
+ + E K PVI V C G I A D ++L A YPG+ G A+A+V+FGD+N
Sbjct: 652 FLKAMHETGK-PVIYVN-CSGSA-IALADIDYAYDALLQAWYPGQEGGTAVADVLFGDYN 708
Query: 580 PGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYK 639
P G+LP+T+Y TD M RTYR+++G+ + FG GLSY+ + +
Sbjct: 709 PSGKLPVTFYKSTEQLPEFTDYSME--------NRTYRYFKGEPQYAFGYGLSYTDFEFG 760
Query: 640 FKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKH 699
+S SSS K E K +TI + N G+M G
Sbjct: 761 EALLS--------SSSIKAGE---------------------KVEITIPLTNVGKMDGAE 791
Query: 700 PVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLS--RAREDGLMVI 756
V ++VK + PIK L G+ + A + ++ L P ES + A DGL V
Sbjct: 792 VVQVYVKSLTNPDA-PIKSLKGYVRQEIKAGKSEKVRITLEP-ESFAYYNADVDGLKVF 848
>gi|299141953|ref|ZP_07035087.1| beta-glucosidase [Prevotella oris C735]
gi|298576415|gb|EFI48287.1| beta-glucosidase [Prevotella oris C735]
Length = 858
Score = 281 bits (719), Expect = 9e-73, Method: Compositional matrix adjust.
Identities = 167/467 (35%), Positives = 250/467 (53%), Gaps = 36/467 (7%)
Query: 13 LCLCFTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQL 72
+ L F +++T + P + + +P+ L RA+DL+SRLTL+EK +
Sbjct: 7 ILLSFVTVMTASSALAQP------SLTGMAYPYQNPNLSALTRAQDLLSRLTLEEKALLM 60
Query: 73 VNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWY 132
++ +PAIPRLGI + WWSEALHG A +G T FP+ I AASF+ L Y
Sbjct: 61 LDESPAIPRLGIKKFFWWSEALHGAANMGN----------VTVFPEPIAMAASFNDALLY 110
Query: 133 RIGQAIGLEARALY-----NAGQA---IGMTFWAPNINIFRDPRWGRGQETPGEDPLVTG 184
++ A E RA Y N G+ ++ W PN+NIFRDPRWGRGQET GEDP +T
Sbjct: 111 KVFSAASDEMRAQYHHRIRNGGEDEKFHSLSVWTPNVNIFRDPRWGRGQETYGEDPYLTA 170
Query: 185 KYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFD-ARVTMQDLA 243
+ VRG+QG + + + AC KH+ + + TR+ + V+ +DL
Sbjct: 171 VMGTAVVRGLQGPEDSKYR-----KLWACAKHYAVHSGPEY---TRHTANLNNVSPRDLW 222
Query: 244 DTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVS 303
+TY P F++ V++ + +MCAY ++ P C + LL + R +WGF + SDC AVS
Sbjct: 223 ETYLPAFKTLVEEAKVREVMCAYQALDDEPCCGNSRLLQQILRDEWGFQYLVVSDCGAVS 282
Query: 304 IIYDAEGYAKSPEDAVVDVLKAGMDVNCG-SFLQKHTKAAVKQKKLPESEIDRALHNLFS 362
I+ + A AG DV CG ++ K AV++ + E E+D+ + L
Sbjct: 283 DIWQNHKTSSDAVHATAKAALAGTDVECGFNYTYKCIPEAVQRGLISEKEVDKHVLRLLE 342
Query: 363 VRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSV 422
R LG + +P + P+ KI V+ S AH+ L+L A+ IVLL+N +LPL K+
Sbjct: 343 GRFDLGEMD-DPALVPWSKIPYSVMDSKAHRQLSLDMARQSIVLLQNKQNMLPLKKNNE- 400
Query: 423 SLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCD 469
+A+IGPNA++ + GNY G R++T L ++ +N Y GCD
Sbjct: 401 RIAVIGPNADNVPMMWGNYNGTPNRTVTILDGIRAKHKNVKYIKGCD 447
Score = 92.8 bits (229), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 82/301 (27%), Positives = 132/301 (43%), Gaps = 60/301 (19%)
Query: 486 KGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILV 535
KG + VV + G+ E EE+ DR D+ LP Q++ I + A K+ ++
Sbjct: 606 KGIEKVVFVGGISPALEGEEMPVDIPGFKGGDRTDIELPRVQRDFIKALHAAGKQ--LVY 663
Query: 536 LLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIK 595
+ C G I +I+ A Y G+ G A+A+V+FGD+NP G+LP+T+Y
Sbjct: 664 VNCSGSA-IALEPETTACDAIVQAWYAGQEGGTAVADVLFGDYNPSGKLPVTFYKNSNQL 722
Query: 596 VPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSS 655
+ M+ GRTYR++ +F FG GLSY+ ++ +
Sbjct: 723 PDYENYSMK--------GRTYRYF-SDPLFAFGHGLSYTTFNM---------------GT 758
Query: 656 TKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRP 715
++++ D + +V I V+N G G VLL++K + NG P
Sbjct: 759 AEIIKKADSI------------------VVRIPVENVGSKDGTETVLLYIKNHQDPNG-P 799
Query: 716 IKQLVGFQSVILNAKEK--AEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGDEEYPIS 773
IK L GF V + A + AE+V E + +EG + L+ GD+ I
Sbjct: 800 IKSLRGFSRVFVKAGHQAVAELVLTRKSFEFFD--ENTNTVHFKEGNYDLLYGDKRVEIR 857
Query: 774 I 774
+
Sbjct: 858 M 858
>gi|380696428|ref|ZP_09861287.1| beta-glucosidase [Bacteroides faecis MAJ27]
Length = 851
Score = 281 bits (718), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 167/424 (39%), Positives = 243/424 (57%), Gaps = 34/424 (8%)
Query: 51 PISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTI 110
P+ +R DL+SRLT++EKIS L ++P IPRLGI Y +EALHGV G+
Sbjct: 34 PVHERVMDLISRLTVEEKISLLRATSPGIPRLGIDKYYHGNEALHGVVRPGR-------- 85
Query: 111 RGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA---GQAIG------MTFWAPNI 161
T FPQ I AA+++ L R+ I EARA +N G+A +TFW+P +
Sbjct: 86 --FTVFPQAIGLAATWNPELQRRVATVISDEARARWNELDQGRAQKEQFSDVLTFWSPTV 143
Query: 162 NIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYD 221
N+ RDPRWGR ET GEDP ++G ++V+G+QGD + L+ + KHF A
Sbjct: 144 NMARDPRWGRTPETYGEDPFLSGVMGTAFVKGLQGDDPH------YLKIVSTPKHFAA-- 195
Query: 222 LDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLL 281
N + R+ + +++ + L + Y P FE CVK+G+A+ IM AYN +N +P + LL
Sbjct: 196 --NNEEHNRFVCNPQISEKQLREYYFPAFEMCVKEGKAASIMSAYNALNDVPCTLNAWLL 253
Query: 282 SKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG-SFLQKHTK 340
K R+ WGF GY+ SDC +++ +A Y K+ E A LKAG+D+ CG
Sbjct: 254 QKVLRKDWGFQGYVVSDCGGPALLVNAHKYLKTKEAAATLSLKAGLDLECGDDVYDGPLL 313
Query: 341 AAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAA 400
A KQ + +++ID A +++ + RM+LGLF+G P+ KI V+ S HQ +AL AA
Sbjct: 314 NAYKQYMVSDADIDSAAYHVLTARMKLGLFDG-VERNPYTKISPSVIGSKEHQQIALDAA 372
Query: 401 QDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAG-PSCRSITPLQALQNYV 459
+ IVLLKN +LPL SK S+A++G NA K G+Y+G P ++ LQ ++N V
Sbjct: 373 RQCIVLLKNQKNMLPLNASKLKSIAVVGINA--GKCEFGDYSGAPVVEPVSILQGIRNRV 430
Query: 460 ENTV 463
+ V
Sbjct: 431 GDQV 434
Score = 121 bits (303), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 80/278 (28%), Positives = 137/278 (49%), Gaps = 43/278 (15%)
Query: 489 DHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAK 548
+ V+ +MG++++ E+E DR D+ LP Q+E + + + +I++L+ G + I +
Sbjct: 603 ETVIAVMGINKSIEREGQDRYDIQLPADQREFLQEIYKVNSN-MIVILVAGSSLAINWM- 660
Query: 549 YDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQAT 608
D ++ +I+ A YPGE G A+AEV+FGD+NP GRLP+T+Y P D +
Sbjct: 661 -DEHVPAIVNAWYPGEQGGTAVAEVLFGDYNPAGRLPLTYYKSLDELPPFDDYDI----- 714
Query: 609 SGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYK 668
GRTY++++G+ ++PFG GLSYS + Y S ++ + D V
Sbjct: 715 --TKGRTYKYFKGEVLYPFGYGLSYSSFKY---------------SDLRVKDEADEV--- 754
Query: 669 SVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILN 728
V+ +KN G+ G ++V+ G P+K+L GF+ V L
Sbjct: 755 ---------------AVSFRLKNTGKRNGDEVTQVYVRIPETGGIVPVKELKGFRRVPLK 799
Query: 729 AKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
+ E + L+ + G V+ +GT ++VG
Sbjct: 800 SGESRRVEIRLNKEQLRYWDVGKGQFVVPKGTFDIMVG 837
>gi|217968103|ref|YP_002353609.1| glycoside hydrolase family 3 [Dictyoglomus turgidum DSM 6724]
gi|217337202|gb|ACK42995.1| glycoside hydrolase family 3 domain protein [Dictyoglomus turgidum
DSM 6724]
Length = 756
Score = 280 bits (717), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 221/674 (32%), Positives = 338/674 (50%), Gaps = 88/674 (13%)
Query: 72 LVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLW 131
L NS IP L EALHG +G+T FPQ I A++++ L
Sbjct: 124 LENSEIKIPVL------IHDEALHGCMA-----------KGSTIFPQAIGMASTWNPELI 166
Query: 132 YRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYV 191
Y++ AIG E R+ + I +P INI RDPR GR +ET GEDP + + AV+Y+
Sbjct: 167 YQVATAIGKETRS-----RGIHQVL-SPTINIARDPRCGRTEETYGEDPYLASRMAVAYI 220
Query: 192 RGVQGDTFNGGKLKGKLQASACCKHFTA-YDLDNWKGTTRYKFDARVTMQDLADTYQPPF 250
+GVQ + A KHF A + D + + F R+ L + Y P F
Sbjct: 221 KGVQ-----------EQGVIATPKHFAANFVGDGGRDSYPIHFSERL----LREVYFPAF 265
Query: 251 ESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEG 310
++ +K+ A +M AYN ++GIP +++ LL+ R++WGF GY+ SD +V +
Sbjct: 266 KASIKEAGALSLMAAYNSLDGIPCSSNKWLLTDVLRKEWGFKGYVVSDYFSVLHLMTKHK 325
Query: 311 YAKSPEDAVVDVLKAGMDVNC-GSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGL 369
A+S +A L+AG+D+ S + VK KL E I+ A+ + V+ GL
Sbjct: 326 VAESKAEAARLALEAGLDMELPDSDCFEEMINLVKGGKLSEETINEAVRRILGVKFWAGL 385
Query: 370 FNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGP 429
F+ NP + P V H+ LAL+ A++ IVLLKN G+LPL K S+A+IGP
Sbjct: 386 FD-NPFVDP--DYAERVNDCAEHRELALRVARESIVLLKN-EGILPLSKDIG-SIAVIGP 440
Query: 430 NANSAKTLLGNYAGPSCRSITPLQALQNYVEN--TVYY-PGCDTVACSSASIDKAVDIAK 486
NA A LG Y+G + +TPL+ ++N +EN +Y+ GC S + D+A+ IA+
Sbjct: 441 NA--AVPRLGGYSGYGVKIVTPLEGIKNKMENKAKIYFAEGCGLNDTSKSGFDEAIKIAQ 498
Query: 487 GADHVVLMMGLDQTQ-EKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDIT 545
+D +L +G + E E+ DR +L LPG Q+ELI + PVI+VL+ G IT
Sbjct: 499 KSDVAILFVGNSVPETEGEQRDRHNLNLPGVQEELIKEICN-TNTPVIVVLINGSA--IT 555
Query: 546 FAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRP 605
+ + +++ A YPGE G A+A+V+FGD+NPGG+LP+T +P+ ++P+ +P
Sbjct: 556 MMNWIDKVQAVIEAWYPGEEGGNAIADVLFGDYNPGGKLPIT-FPKYSSQLPLY-YNHKP 613
Query: 606 QATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVV 665
SG + +FPFG GLSY+++ Y
Sbjct: 614 ---SGRVDDYVDLRSPQYLFPFGYGLSYTEFRYS-------------------------- 644
Query: 666 HYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSV 725
+ + PE E + +T V+N G+ G V L++ RP+K+L F+ +
Sbjct: 645 NLRITPE---EIPMDGEITITFEVENIGKYKGDEVVQLYLHDEFASVVRPVKELKRFKRI 701
Query: 726 ILNAKEKAEIVFEL 739
L EK + F+L
Sbjct: 702 TLAVGEKKTVSFKL 715
>gi|409198859|ref|ZP_11227522.1| periplasmic beta-glucosidase , xylosidase/arabinosidase
[Marinilabilia salmonicolor JCM 21150]
Length = 777
Score = 280 bits (717), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 215/716 (30%), Positives = 345/716 (48%), Gaps = 107/716 (14%)
Query: 81 RLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGL 140
RLGIP + EA HG +G AT FP I A+++ L +G+ IG
Sbjct: 124 RLGIPLF-LAEEAPHGHMAIG-----------ATVFPTGIGLASTWAPELIEEVGKVIGR 171
Query: 141 EARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFN 200
E R G IG + P +++ RD RW R +ET GEDP++ G + VRG+ GD
Sbjct: 172 EVRL---QGGHIG---YGPVLDLVRDARWSRVEETYGEDPVLAGIMGAAMVRGMAGDG-- 223
Query: 201 GGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTM---QDLADTYQPPFESCVKQG 257
L K A KHF AY GT + +T D+ Y PPF+ + G
Sbjct: 224 ---LHDKHGLIATLKHFLAY------GTPEAGQNGGITTIGPLDVMQNYLPPFKEAINAG 274
Query: 258 RASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPED 317
S +M +YN ++GIPS ++ LL+ R++W F G++ SD ++ +Y + A+S ++
Sbjct: 275 ALS-VMTSYNSIDGIPSTSNHYLLTDVLRKKWNFDGFVVSDLASIEGLYGSHYVAQSVQE 333
Query: 318 AVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQ 377
A L AG+DV+ GS +V ++ ES ID A+ + ++ +GLF+ NP +
Sbjct: 334 AANMALSAGVDVDLGSNAFVKLVNSVNAGEVQESLIDTAVTRVLRLKFEMGLFD-NPYVD 392
Query: 378 PFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTL 437
P +V S H LA + AQ+ +VLLKN H +LPL + + ++GPNA++ +
Sbjct: 393 P--TKAEKMVRSAEHIALAREVAQNTLVLLKNDHEILPLKRDPG-KILVVGPNADNQYNM 449
Query: 438 LGNYAGP--SCRSITPLQALQNYV--ENTVYYPGCDTVACSSASIDKAVDIAKGADHVVL 493
LG+Y P S +T L+ + N + E Y GC S+ +IDKAV+ A+ +D ++
Sbjct: 450 LGDYTAPQDSDNVVTILEGVLNKLSPEQVEYVKGCAIRDTSNVNIDKAVEAAEHSDVILA 509
Query: 494 MMG-----------------------LDQTQEKEELDRVDLVLPGRQQELITRVAEAAKK 530
++G + E DR L L G+Q +L+ + + K
Sbjct: 510 VVGGSSARDFKTEYLETGAAVANNQSVSDMDAGEGFDRATLDLLGKQTDLL-KALKKTGK 568
Query: 531 PVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYP 590
P+++V + G P+ + +A N +++ A YPG+ G A+A+V+FGD+NP GRLPM+ P
Sbjct: 569 PLVVVYIQGRPMKMNWAA--ENADALICAWYPGQEGGNAIADVLFGDYNPAGRLPMS-VP 625
Query: 591 QDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYL 650
+D ++P+ K P+ Y ++ FG GLSYS ++Y
Sbjct: 626 RDVGQLPIYYNKKNPRP------HDYVELSATPLYSFGYGLSYSSFTY------------ 667
Query: 651 NQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARR 710
D+ V +G E +F+V VKN G G V L+++
Sbjct: 668 -----------SDL----EVAPVGQE-----EFVVNFKVKNTGNFDGDEVVQLYLRDEYA 707
Query: 711 GNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
+P++QL F+ + L E+ ++ F L+ + L+ + V+E GT L++G
Sbjct: 708 SVVQPVQQLKHFKRLFLKKGEEKQVSFHLNE-KDLAILNQKKEWVVEPGTFQLMIG 762
>gi|334365132|ref|ZP_08514098.1| glycosyl hydrolase family 3 N-terminal domain protein [Alistipes
sp. HGB5]
gi|313158675|gb|EFR58064.1| glycosyl hydrolase family 3 N-terminal domain protein [Alistipes
sp. HGB5]
Length = 771
Score = 280 bits (716), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 216/719 (30%), Positives = 329/719 (45%), Gaps = 99/719 (13%)
Query: 81 RLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGL 140
RLGIP + EA HG +G AT+FP A++++ L R+G+ I
Sbjct: 120 RLGIPLF-LAEEAPHGHMAIG-----------ATTFPTAPGQASTWNPELIERMGKVIAA 167
Query: 141 EARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFN 200
E R G + P ++I RDPRW R +E+ GED +T + +YVRG
Sbjct: 168 EIRL------QGGHICYGPVLDIVRDPRWSRTEESYGEDCYLTARIGEAYVRGT-----G 216
Query: 201 GGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRAS 260
G L A + KHF AY + ++L +TY PPFE+ VK G A
Sbjct: 217 SGDLSQSRHALSTLKHFIAYGASEGGQNGGSNL---LGERELRETYLPPFEAAVKAG-AR 272
Query: 261 GIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVV 320
+M AYN V+GIP A+R +L+ R +WGF G++ SD ++ +++ G A S +A V
Sbjct: 273 SVMTAYNSVDGIPCTANRRMLTDILRGEWGFDGFVVSDLLSIEGLHETHGVAGSVREAAV 332
Query: 321 DVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFG 380
L+AG+D + + A + + E+EIDRA+ + +++ +GLF NP +
Sbjct: 333 QALRAGVDADLKGGAFASLREAAEAGDVAEAEIDRAVERVLALKFEMGLFE-NPYID--- 388
Query: 381 KIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGN 440
+ A V AH LAL+AA+ + LL+N G LPL + +A+IGPNA++ LG+
Sbjct: 389 EAAAAEVGCAAHSELALEAARQSVTLLENRSGTLPLDPRRLRRVAVIGPNADNIYNQLGD 448
Query: 441 YAGPSCRSITPLQALQNYV--ENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLD 498
Y + T L+ + + VY GC + I AV A+G D V+++G
Sbjct: 449 YTAQQTAANTVRDGLEKLLGRDRVVYSRGCTVRGGDRSEIAAAVSAARGTDAAVVVIGGS 508
Query: 499 QTQE-----------------------KEELDRVDLVLPGRQQELITRVAEAAKKPVILV 535
++ E DR L L G Q+EL+ R+ +A P+I+V
Sbjct: 509 SARDFDTEFLQTGAAKAAHDEVRDMECGEGFDRATLALLGEQEELLRRI-KATGTPLIVV 567
Query: 536 LLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIK 595
+ G P+D+ A + ++L A YPG G A+AE I G +NP GRLP+T P+ +
Sbjct: 568 CIAGRPLDLRRASEQAD--ALLMAWYPGARGGDAVAETILGRNNPAGRLPIT-IPRAEGQ 624
Query: 596 VPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSS 655
+P+ K RP Y ++PFG GLSYS + Y L QS
Sbjct: 625 IPVYYNKKRP------ANHDYTDLTAAPLYPFGYGLSYSTFEY-------GSLEARQS-- 669
Query: 656 TKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRP 715
G E V+ ++N + G V L++ RP
Sbjct: 670 ------------------GDNVLE-----VSCRIRNTSDREGDEVVQLYISDMVASTVRP 706
Query: 716 IKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGDEEYPISI 774
+QL GF+ + L E+ ++ F L E+L+ G V+E+G + VG I +
Sbjct: 707 PRQLGGFRRIRLAPGEQRQVSFTLGD-EALALIDPQGRRVVEKGDFVIAVGSSSQDIRL 764
>gi|332184951|ref|ZP_08386700.1| glycosyl hydrolase family 3 N terminal domain protein [Sphingomonas
sp. S17]
gi|332014675|gb|EGI56731.1| glycosyl hydrolase family 3 N terminal domain protein [Sphingomonas
sp. S17]
Length = 886
Score = 280 bits (716), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 163/401 (40%), Positives = 226/401 (56%), Gaps = 38/401 (9%)
Query: 56 ARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATS 115
A LV ++T DEK+ QL+N+APAIPRLG+PAY WW+E+LHG G T+
Sbjct: 34 AATLVGKMTTDEKLYQLLNTAPAIPRLGVPAYNWWTESLHGALGPVP----------TTN 83
Query: 116 FPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQ--------AIGMTFWAPNINIFRDP 167
FP+ + AA+FD L R QAI E R L+ + G+ W+PNINIFRDP
Sbjct: 84 FPEPVGLAATFDDALVGRAAQAISREVRGLHALARQTGRLGRIGTGLDSWSPNINIFRDP 143
Query: 168 RWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKG 227
RWGRGQET GEDP +T + V++VRG+QG N ++ A KH+ + +
Sbjct: 144 RWGRGQETYGEDPFLTARMGVAFVRGMQGTDPN------RIDVVATPKHYAVH---SGPE 194
Query: 228 TTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARR 287
+TR++ + V+ DL DTY P F + + +G A IMCAYNR++G P+CA LL R
Sbjct: 195 STRHEANVYVSQHDLVDTYLPAFRAAIVEGGAGSIMCAYNRIDGQPACASDLLLKDYLRG 254
Query: 288 QWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHT-------K 340
+WGF GY+ SDCDAV I YA P AV ++AG+D C + T +
Sbjct: 255 RWGFRGYVVSDCDAVKDIDANHHYAPDPATAVAAAMRAGVDNECNNATLTDTDGLTNPYR 314
Query: 341 AAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAA 400
A+ + + ++IDR+L LF+ R R G G + AD + +PAHQ LAL+ A
Sbjct: 315 DALARGLISVADIDRSLTRLFAARYRTGDLPGVRPLSMQSSSPAD-IGTPAHQALALETA 373
Query: 401 QDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNY 441
+ +VLLKN G+LPL +A+IGP ++ + L GNY
Sbjct: 374 EKSLVLLKN-KGILPL--KAQARIAVIGPLGDATRVLRGNY 411
Score = 117 bits (292), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 95/292 (32%), Positives = 139/292 (47%), Gaps = 52/292 (17%)
Query: 485 AKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVIL 534
AK +D +V ++GL E EE D+ L LP QQ L+ + A+A KP+I+
Sbjct: 612 AKDSDALVAVVGLTSDLEAEEAPISVPGFKGGDKTTLDLPADQQALLEQ-AKATGKPLIV 670
Query: 535 VLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYI 594
VL+ G PV++ +AK N +I+ A YPG+AG +A+ V+ G NP GRLP+T+Y
Sbjct: 671 VLMNGSPVNLAWAKA--NADAIVEAWYPGQAGGLAVGNVLSGKTNPAGRLPLTFYRSVED 728
Query: 595 KVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSS 654
D M+ GRTYR++ G V+PFG GLSY+ ++Y + KL
Sbjct: 729 LPAFGDYAMK--------GRTYRYFTGTPVYPFGYGLSYTSFAY-----APVKL------ 769
Query: 655 STKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGR 714
+ + QD VT V+N G AG L++ + +G
Sbjct: 770 TPAVTGVQD------------------GLTVTTEVRNTGRRAGDEVAQLYLNFPKV-DGA 810
Query: 715 PIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
P L GFQ V L E+ + F L P LS DG ++ +G + + VG
Sbjct: 811 PRVALRGFQRVHLAPGERRMLRFTLDP-RDLSAVDADGHRLVMKGDYRVSVG 861
>gi|298386950|ref|ZP_06996504.1| beta-glucosidase [Bacteroides sp. 1_1_14]
gi|298260100|gb|EFI02970.1| beta-glucosidase [Bacteroides sp. 1_1_14]
Length = 846
Score = 280 bits (715), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 161/424 (37%), Positives = 242/424 (57%), Gaps = 34/424 (8%)
Query: 51 PISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTI 110
PI +R +DL+S+LT++EKIS L ++P I R+GI Y +EALHG+ GK
Sbjct: 29 PIHERIQDLLSKLTIEEKISLLRATSPGIERMGIDKYYMGNEALHGIIRPGK-------- 80
Query: 111 RGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQ---------AIGMTFWAPNI 161
T FPQ I A+ ++ L + I I EARA +N + + +TFW+P +
Sbjct: 81 --FTVFPQAIGLASMWNPELHHIIASVISDEARARWNELERGKKQKDQFSDLLTFWSPTV 138
Query: 162 NIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYD 221
N+ RDPRWGR ET GEDP ++G ++V+G+QGD L++ + KHF A
Sbjct: 139 NMARDPRWGRTPETYGEDPYLSGVLGTAFVKGLQGDH------PRYLKSVSTPKHFAA-- 190
Query: 222 LDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLL 281
N + R+ DA +T D+ + Y P FE C+++G+A IM AYN +NG+P A+ LL
Sbjct: 191 --NNEEHNRFYCDAAITETDMREYYLPAFEKCIREGKAESIMTAYNAINGVPCTANNWLL 248
Query: 282 SKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFL-QKHTK 340
+K ++ WGF+GYI SDC A ++ Y K+PE A + +KAG+D+ CG ++
Sbjct: 249 NKVLKQDWGFNGYIVSDCGAPGLLMTDHRYVKTPEAAAMIAIKAGLDLECGDYVFGAPLL 308
Query: 341 AAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAA 400
A KQ + +EID A +++ RMRLG+F+ +P P+ + ++V H+ LAL+AA
Sbjct: 309 NAYKQYMVSTAEIDSAAYHVLRARMRLGMFD-DPEKNPYNHLSPEIVGCEKHKELALEAA 367
Query: 401 QDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAG-PSCRSITPLQALQNYV 459
+ IVLLKN LPL K S+A++G NA A G+Y+G P ++ L ++N V
Sbjct: 368 RQSIVLLKNQKNTLPLNAKKIKSIAVVGINA--ANCEFGDYSGTPVNAPVSVLDGIRNRV 425
Query: 460 ENTV 463
N +
Sbjct: 426 GNEI 429
Score = 121 bits (303), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 83/286 (29%), Positives = 135/286 (47%), Gaps = 43/286 (15%)
Query: 481 AVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGG 540
A + + +D V+ +MG++Q+ E+E DR + LP + Q++ R A A I+VL+ G
Sbjct: 590 ASKVIRESDVVIAVMGINQSIEREGQDRSSIELP-KDQQIFIREAYKANPNTIVVLVAGS 648
Query: 541 PVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTD 600
+ + + D+NI +I+ A YPGE G A+AEV+FGD+NP GRLP+T+Y + D
Sbjct: 649 SMAVGWM--DQNIPAIIDAWYPGEQGGTAIAEVLFGDYNPAGRLPLTFYNS------IED 700
Query: 601 MKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVE 660
+ N RTY ++EGK ++ FG GLSY+K+ Y+ + Q
Sbjct: 701 LPAFNDYNVKN-NRTYMYFEGKPLYAFGYGLSYTKFDYRNLNIKQ--------------- 744
Query: 661 NQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLV 720
+++ + VKN G+ G ++V+ G P+KQL
Sbjct: 745 ------------------DSQNITLNFSVKNSGKYNGDEVAQVYVQFPDLGIKTPLKQLK 786
Query: 721 GFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
GF+ V + +I E+ E + GT+ +VG
Sbjct: 787 GFKRVHIKKGATEQISIEIPKEELRLWDDQKKQFYTPSGTYNFMVG 832
>gi|146301613|ref|YP_001196204.1| beta-glucosidase [Flavobacterium johnsoniae UW101]
gi|146156031|gb|ABQ06885.1| Candidate beta-xylosidase; Glycoside hydrolase family 3
[Flavobacterium johnsoniae UW101]
Length = 864
Score = 280 bits (715), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 163/448 (36%), Positives = 247/448 (55%), Gaps = 33/448 (7%)
Query: 37 NPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHG 96
N + + +P+ +L +RA DL+SRLTL+EK + + + + AIPRLGI + WWSEALHG
Sbjct: 16 NLNAQQYPYKDPSLSSKKRAEDLISRLTLEEKAALMCDQSDAIPRLGIKKFNWWSEALHG 75
Query: 97 VAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA-------- 148
A N + T FP+ I AASFD L +R+ A+ E RA YN
Sbjct: 76 YAN-------NDNV---TVFPEPIGMAASFDDQLVFRVFDAVSDEGRAKYNQWIQNGNEN 125
Query: 149 GQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKL 208
+ + ++ W PN+NIFRDPRWGRGQET GEDP +T + VS V+G+QG K + L
Sbjct: 126 KRFLSLSVWTPNVNIFRDPRWGRGQETYGEDPYLTSRMGVSVVKGLQGPA--DAKYRKLL 183
Query: 209 QASACCKHFTAYDLDNWKGTTRYKFD-ARVTMQDLADTYQPPFESCVKQGRASGIMCAYN 267
AC KHF + W +R++ + V ++L +TY P F++ V++ +MCAY
Sbjct: 184 ---ACAKHFAVHSGPEW---SRHELNLNNVKPRELYETYLPAFKALVQEADVRQVMCAYQ 237
Query: 268 RVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGM 327
R++ P C++ LL + R +WGF + SDC AV+ Y + A + AG
Sbjct: 238 RLDDEPCCSNTRLLQRILRDEWGFQYLVVSDCGAVTDFYTTHKVSSDEVHAASKAVLAGT 297
Query: 328 DVNC--GSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGAD 385
DV C + K AV++ + E EI+++L + R LG + + ++ P+ +I A
Sbjct: 298 DVECVWDKYPFKKLPEAVEKDLIKEEEINKSLLRVLIGRFDLGEMDDD-SIVPWAQIPAS 356
Query: 386 VVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPS 445
V+ S HQ LAL+ AQ + LL+N + +LPL K+ + +A+IGPNA++ L GNY G
Sbjct: 357 VLNSKEHQQLALEMAQKSMTLLQNKNNILPLNKNVN-KVAVIGPNADNEPMLWGNYNGTP 415
Query: 446 CRSITPLQALQNYV--ENTVYYPGCDTV 471
++IT + ++ + +Y CD V
Sbjct: 416 NKTITIKKGIEGKLAANKVLYDKSCDLV 443
Score = 105 bits (261), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 77/264 (29%), Positives = 120/264 (45%), Gaps = 52/264 (19%)
Query: 486 KGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILV 535
KG D V+ GL E EE+ DR ++ LP Q++ + + A KK ++
Sbjct: 600 KGIDTVIFAGGLSTLLEGEEMPVSFPGFKGGDRTNIELPAVQRKCLKELKAAGKK--VIF 657
Query: 536 LLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIK 595
+ C G I + +IL A YPGE+G A+A+V+FGD+NP G+LP+++Y
Sbjct: 658 VNCSGSA-IALTPETESCDAILQAWYPGESGGQAVADVLFGDYNPAGKLPISFYKNSDKL 716
Query: 596 VPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSS 655
D ++ GRTYR Y +FPFG GLSYSK+ + +S+NK+ N+ +
Sbjct: 717 GDFEDYSLK--------GRTYR-YTTDVLFPFGFGLSYSKFDIQAGNLSKNKIKPNEDTQ 767
Query: 656 TKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRP 715
K +KN + +G V ++V+ G P
Sbjct: 768 LKF-----------------------------SIKNSSKRSGTEIVQVYVRKLNDTEG-P 797
Query: 716 IKQLVGFQSVILNAKEKAEIVFEL 739
+K L F+ + L A EK + +L
Sbjct: 798 LKTLKAFKRIELKAGEKQNVTIDL 821
>gi|380692929|ref|ZP_09857788.1| glycoside hydrolase family protein [Bacteroides faecis MAJ27]
Length = 777
Score = 280 bits (715), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 237/790 (30%), Positives = 372/790 (47%), Gaps = 136/790 (17%)
Query: 49 TLPISQRARDLVSRLTLDEKISQLVN--SAPAIPRLGIPAYEW----WSEALHGV----A 98
+ PI +R +DL+S++ +DEK Q+ + + +P +W W + + +
Sbjct: 36 SAPIEERVKDLLSQMNMDEKTCQMATLYGSGRVLADALPTEKWKSEIWKDGIGNIDEEHN 95
Query: 99 GVGK-----------------------------GI---FFNGTIRG-----ATSFPQVIL 121
G+GK GI F N IRG AT FP
Sbjct: 96 GLGKFGSEYAFPYAKHVKAIHDIQRWFVEETRLGIPVDFTNEGIRGVCHEKATFFPAQCG 155
Query: 122 TAASFDSYLWYRIGQAIGLEARALYNAGQAIGMT-FWAPNINIFRDPRWGRGQETPGEDP 180
++++ L RIG+ EA AL G T ++P ++I +DPRWGR E GEDP
Sbjct: 156 QGSTWNKELIARIGEVEAKEAVAL-------GYTNIYSPILDIAQDPRWGRAVECYGEDP 208
Query: 181 LVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDL--DNWKGTTRYKFDARVT 238
+ G+ ++ +Q K KL A+ KHF Y + G TR D V
Sbjct: 209 YLVGQLGKQMIQSLQ---------KHKLVATP--KHFAVYSIPVGGRDGGTRT--DPHVA 255
Query: 239 MQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSD 298
+++ Y PF ++ A G+M +YN +G P L++ R++WGF GY+ SD
Sbjct: 256 PREMRTLYLEPFRVAFQEAGALGVMSSYNDYDGEPITGSYRFLTQILRQEWGFKGYVVSD 315
Query: 299 CDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG----SFLQKHTKAAVKQKKLPESEID 354
DAV I A + E+AVV + AG++V + K ++A+ + K+ ++ ID
Sbjct: 316 SDAVEFISSKHKVADNNEEAVVQSVNAGLNVRTNFSSPAGFIKPLRSAIAKGKVSQATID 375
Query: 355 RALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLL 414
+ + + V+ LGLF+ NP + GK+ +V HQ +AL+AA+ IVLLKN LL
Sbjct: 376 QRVSEILYVKFWLGLFD-NP-YRGDGKLADKIVHCKEHQAVALEAARQSIVLLKNQDNLL 433
Query: 415 PLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSI-TPLQALQNYV--ENTVYYPGC--- 468
PL K+ S+A+IGPNA+ K L+ Y GPS I T + ++ + VY GC
Sbjct: 434 PLQKTLK-SVAVIGPNADEQKELICRY-GPSNAPIKTVYKGIKEALPGAKVVYKKGCEIV 491
Query: 469 ------------DTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGR 516
D +D+A++ AK A+ V++++G + +EE R L LPGR
Sbjct: 492 DPHFPESEVLPFDITPKEQQIMDEAIEAAKSAEVVIMVLGGSEVTVREERSRTSLDLPGR 551
Query: 517 QQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFG 576
Q+EL+ V + KP ILV++ G I +AK + + +IL A +PGE A+AE IFG
Sbjct: 552 QEELLKAVCKLG-KPTILVMIDGRASSINYAK--KYVPAILHAWFPGEFCGQAVAETIFG 608
Query: 577 DHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKY 636
D+NPGG+L +T +P+ ++P +P + SG +FPFG GLSY+ +
Sbjct: 609 DNNPGGKLAVT-FPKSVGQIPFA-FPFKPGSDSGCGTSVT-----GALFPFGHGLSYTTF 661
Query: 637 SYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMA 696
Y N L ++ PE E + V+ VKN G+
Sbjct: 662 EY-------NNLKIS-------------------PEQQGVLGEVK---VSCTVKNTGKRP 692
Query: 697 GKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVI 756
G V L+++ +K L GF+ + L E+ ++ F LSP + L+ ++ +
Sbjct: 693 GDEVVQLYLRDEISSVTTYVKILRGFERITLQPNEEKKVTFTLSP-QDLAIWDKNMKFQV 751
Query: 757 EEGTHFLVVG 766
E GT +++G
Sbjct: 752 EPGTFKVMIG 761
>gi|383115540|ref|ZP_09936296.1| hypothetical protein BSGG_2590 [Bacteroides sp. D2]
gi|313695055|gb|EFS31890.1| hypothetical protein BSGG_2590 [Bacteroides sp. D2]
Length = 770
Score = 280 bits (715), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 218/680 (32%), Positives = 337/680 (49%), Gaps = 77/680 (11%)
Query: 103 GIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNIN 162
I NG RG+T +P I AA+FD L R+ + +E RA + TF PN+
Sbjct: 136 AIHGNGLYRGSTIYPTPIGQAATFDPALVERMSRETAIEMRA-----SGMHWTF-TPNVE 189
Query: 163 IFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFT--AY 220
+ RD RWGR ET GEDP + G+ + VRG Q F G + AC KH +
Sbjct: 190 VARDARWGRVGETFGEDPYLVGQMGAATVRGFQTKDFTGND-----KVIACAKHLVGGSQ 244
Query: 221 DLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNL 280
+ G A ++ + L + + PPF+ C++ G + +M A+N +NGIP ++ L
Sbjct: 245 PANGINGAP-----AELSERTLQEVFFPPFKDCLEAGVFT-VMTAHNELNGIPCHGNKYL 298
Query: 281 LSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNC-GSFLQKHT 339
+++ R QW F G++ SD + ++D A++ +DA + AGM ++ G +
Sbjct: 299 MTEVLRNQWKFDGFVVSDWMDIERMHDYHNVAETLKDAYRISVDAGMGMHMHGPEFYEAI 358
Query: 340 KAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQA 399
VK+ +PE +ID A+ + V+ RLGLF NP + K ++V + HQ AL+
Sbjct: 359 IECVKEGSIPEKQIDAAVSKILEVKFRLGLFE-NPFIDL--KKKDEIVFNEKHQQTALEG 415
Query: 400 AQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYA--GPSCRSITPLQALQN 457
A+ IVLLKN +LPL SK + + G NAN+ +++LG++A P T L+ L+
Sbjct: 416 ARKSIVLLKNEGNMLPLDASKYKKVFVTGHNANN-QSILGDWAMEQPEEHVTTVLKGLKA 474
Query: 458 YVENTVYY---PGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQ----EK---EELD 507
T Y G + S I +AV A+ +D +L++G + + EK E D
Sbjct: 475 ISPETNYNFLDLGWNVRLLSDNQIKEAVQQARNSDLAILVVGENSMRYHWNEKTCGENSD 534
Query: 508 RVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGA 567
R +L LPGRQQEL+ VA A P +++L+ G P +T D N+ I+ A PG AG
Sbjct: 535 RYELSLPGRQQELVKAVA-ATGVPTVVILVNGRP--LTTEWIDENMPCIIEAWEPGVAGG 591
Query: 568 VALAEVIFGDHNPGGRLPMTWYPQDYIKVP-MTDMKMRPQATSGNPGRTYRFYEGKEVFP 626
ALAE+++G NP G+LP+T P+ ++ M + K G + YE
Sbjct: 592 QALAEILYGKVNPSGKLPIT-IPRSTGQIQCMYNHKFTNHWFPYATGNSLPLYE------ 644
Query: 627 FGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVT 686
FG GLSY+ Y Y+ L L++++ T P+ + VT
Sbjct: 645 FGYGLSYTTYKYE-------NLKLSEATIT--------------PD--------KSVKVT 675
Query: 687 IGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLS 746
+ V N G+M G+ V L+++ RP+K+L F + L A E E+ F L+P E LS
Sbjct: 676 VDVTNTGKMDGEETVQLYIRDEYSSATRPVKELKDFARIPLKAGETKEVSFTLTP-EMLS 734
Query: 747 RAREDGLMVIEEGTHFLVVG 766
+ +E+GT ++VG
Sbjct: 735 YYDANMHYGVEKGTFKIMVG 754
>gi|255693560|ref|ZP_05417235.1| periplasmic beta-glucosidase [Bacteroides finegoldii DSM 17565]
gi|260620625|gb|EEX43496.1| glycosyl hydrolase family 3 N-terminal domain protein [Bacteroides
finegoldii DSM 17565]
Length = 770
Score = 279 bits (714), Expect = 4e-72, Method: Compositional matrix adjust.
Identities = 218/680 (32%), Positives = 337/680 (49%), Gaps = 77/680 (11%)
Query: 103 GIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNIN 162
I NG RG+T +P I AA+FD L R+ + +E RA + TF PN+
Sbjct: 136 AIHGNGLYRGSTIYPTPIGQAATFDPALVERMSRETAIEMRA-----SGMHWTF-TPNVE 189
Query: 163 IFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFT--AY 220
+ RD RWGR ET GEDP + G+ + VRG Q F G + AC KH +
Sbjct: 190 VARDARWGRVGETFGEDPYLVGQMGAATVRGFQTKDFTGND-----KVIACAKHLVGGSQ 244
Query: 221 DLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNL 280
+ G A ++ + L + + PPF+ C++ G + +M A+N +NGIP ++ L
Sbjct: 245 PANGINGAP-----AELSERTLQEVFFPPFKDCLEAGVFT-VMTAHNELNGIPCHGNKYL 298
Query: 281 LSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNC-GSFLQKHT 339
+++ R QW F G++ SD + ++D A++ +DA + AGM ++ G +
Sbjct: 299 MTEVLRNQWKFDGFVVSDWMDIERMHDYHNVAETLKDAYQISVDAGMGMHMHGPEFYEAI 358
Query: 340 KAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQA 399
VK+ +PE +ID A+ + V+ RLGLF NP + K ++V + HQ AL+
Sbjct: 359 IECVKEGSIPEKQIDAAVSKILEVKFRLGLFE-NPFIDL--KKKDEIVFNEKHQQTALEG 415
Query: 400 AQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYA--GPSCRSITPLQALQN 457
A+ IVLLKN +LPL SK + + G NAN+ +++LG++A P T L+ L+
Sbjct: 416 ARKSIVLLKNEGNMLPLDASKYKKVFVTGHNANN-QSILGDWAMEQPEEHVTTVLKGLKA 474
Query: 458 YVENTVYY---PGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQ----EK---EELD 507
T Y G + S I +AV A+ +D +L++G + + EK E D
Sbjct: 475 ISPETNYNFLDLGWNVRLLSDNQIKEAVQQARSSDLAILVVGENSMRYHWNEKTCGENSD 534
Query: 508 RVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGA 567
R +L LPGRQQEL+ VA A P +++L+ G P +T D N+ I+ A PG AG
Sbjct: 535 RYELSLPGRQQELVEAVA-ATGVPTVVILVNGRP--LTTEWIDENMPCIIEAWEPGVAGG 591
Query: 568 VALAEVIFGDHNPGGRLPMTWYPQDYIKVP-MTDMKMRPQATSGNPGRTYRFYEGKEVFP 626
ALAE+++G NP G+LP+T P+ ++ M + K G + YE
Sbjct: 592 QALAEILYGKVNPSGKLPIT-IPRSTGQIQCMYNHKFTNHWFPYATGNSLPLYE------ 644
Query: 627 FGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVT 686
FG GLSY+ Y Y+ L L++++ T P+ + VT
Sbjct: 645 FGYGLSYTTYKYE-------NLKLSEATIT--------------PD--------KSVKVT 675
Query: 687 IGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLS 746
+ V N G+M G+ V L+++ RP+K+L F + L A E E+ F L+P E LS
Sbjct: 676 VDVTNTGKMDGEETVQLYIRDEYSSATRPVKELKDFARIPLKAGETKEVSFTLTP-EMLS 734
Query: 747 RAREDGLMVIEEGTHFLVVG 766
+ +E+GT ++VG
Sbjct: 735 YYDANMHYGVEKGTFKIMVG 754
>gi|294675359|ref|YP_003575975.1| 1,4-beta-xylosidase [Prevotella ruminicola 23]
gi|225016052|gb|ACN78955.1| xylosidase/arabinofuranosidase [Prevotella ruminicola]
gi|294472720|gb|ADE82109.1| putative 1,4-beta-xylosidase [Prevotella ruminicola 23]
Length = 861
Score = 279 bits (714), Expect = 4e-72, Method: Compositional matrix adjust.
Identities = 166/444 (37%), Positives = 241/444 (54%), Gaps = 35/444 (7%)
Query: 39 STETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVA 98
S +T P+ L +RA DL SRLTL+EK +++ +PAIPRLGI + WWSEALHG A
Sbjct: 18 SAQTLPYQNPNLSAKERAVDLCSRLTLEEKAMLMLDESPAIPRLGIKKFFWWSEALHGAA 77
Query: 99 GVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA------GQAI 152
+G T+FP+ + AASF+ +L +++ E RA YN G+ +
Sbjct: 78 NMG----------NVTNFPEPVGMAASFNPHLLFKVFDIASTEFRAQYNHRMYDLNGEDM 127
Query: 153 GM---TFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQ 209
M + W PN+NIFRDPRWGRGQET GEDP +T V V+G+QG + + +
Sbjct: 128 KMRSLSVWTPNVNIFRDPRWGRGQETYGEDPYLTSVMGVQVVKGLQGP--EDARYR---K 182
Query: 210 ASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRV 269
AC KH+ + + T D V+ +D +TY P F++ VK + +MCAY R+
Sbjct: 183 LWACAKHYAVHSGPEYTRHTANLTD--VSARDFWETYMPAFKTLVKDAKVREVMCAYQRL 240
Query: 270 NGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLK---AG 326
+ P C LL + R +WGF + SDC AVS Y+ KS DAV K AG
Sbjct: 241 DDDPCCGSTRLLQQILRDEWGFEYLVVSDCGAVSDFYENH---KSSSDAVHGTSKAVLAG 297
Query: 327 MDVNCG-SFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGAD 385
DV CG ++ K AV++ L E E+D+ + L R LG + +P++ + KI
Sbjct: 298 TDVECGFNYAYKSLPEAVRKGLLSEKEVDKHVIRLLEGRFDLGEMD-DPSLVEWSKIPYS 356
Query: 386 VVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPS 445
+ + A +AL A+ IVLL+N + +LPL K+ + +A+IGPNA++ + GNY G
Sbjct: 357 AMSTKASANVALDMARQTIVLLQNKNNILPLKKN-AEKIAIIGPNAHNEPMMWGNYNGTP 415
Query: 446 CRSITPLQALQNYVENTVYYPGCD 469
++T L ++ + VY PGCD
Sbjct: 416 NHTVTILDGVKAKQKKLVYIPGCD 439
Score = 92.0 bits (227), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 84/284 (29%), Positives = 128/284 (45%), Gaps = 63/284 (22%)
Query: 477 SIDKAVDIA--KGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRV 524
+ID IA KG + V+ G+ + E EE+ DR + LP Q+E + +
Sbjct: 589 NIDYQETIAQLKGINKVIFCGGIAPSLEGEEMPVNIEGFKGGDRTSIELPKVQREFL-KA 647
Query: 525 AEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRL 584
+AA K VI V C G I + +I+ A YPG+ G A+A+V+FGD+NPGG+L
Sbjct: 648 LKAAGKQVIYVN-CSGSA-IALQPETESCDAIVQAWYPGQEGGTAVADVLFGDYNPGGKL 705
Query: 585 PMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSY---KFK 641
+T+Y D D M+ GRTYR+++ +FPFG GLSY+ + K +
Sbjct: 706 SVTFYKNDQQLPDYEDYSMK--------GRTYRYFD-DALFPFGYGLSYTTFEVGEAKVE 756
Query: 642 AVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPV 701
A + LY V I V N G G +
Sbjct: 757 AATDGALY----------------------------------NVQIPVTNTGTKNGSETI 782
Query: 702 LLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESL 745
L+++ + +G P+K L GF+ + + A + A +L+ ESL
Sbjct: 783 QLYIRNLQDPDG-PLKSLRGFERLDIKAGKTATANLKLTK-ESL 824
>gi|116621797|ref|YP_823953.1| glycoside hydrolase family protein [Candidatus Solibacter usitatus
Ellin6076]
gi|116224959|gb|ABJ83668.1| glycoside hydrolase, family 3 domain protein [Candidatus Solibacter
usitatus Ellin6076]
Length = 765
Score = 279 bits (713), Expect = 4e-72, Method: Compositional matrix adjust.
Identities = 223/718 (31%), Positives = 349/718 (48%), Gaps = 104/718 (14%)
Query: 81 RLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGL 140
RLGIP + E LHG A +G TSFPQ I A+FD L +
Sbjct: 104 RLGIPVI-FHEECLHGHAAIG-----------GTSFPQPIGLGATFDPELVESLFAMTAA 151
Query: 141 EARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFN 200
EARA QA+ P +++ R+PRWGR +ET GEDP + + ++ VRG QGD
Sbjct: 152 EARA-RGTHQAL-----TPVVDVAREPRWGRVEETYGEDPFLVSRMGIAAVRGFQGD--- 202
Query: 201 GGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRAS 260
+ K + A KHF A+ GT + V+M+ L +T+ PF+ + +G A
Sbjct: 203 -ATFRDKTRVIATLKHFAAHGQPE-SGTNCAPVN--VSMRVLRETFLFPFKEALDKGCAI 258
Query: 261 GIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAV-SIIYDAEGY----AKSP 315
+M +YN ++G+PS A R LL R++WGF G++ SD A+ + Y E + AK
Sbjct: 259 SVMASYNEIDGVPSHASRWLLRDVLRKEWGFKGFVVSDYYAIYELSYRPESHGHFVAKDK 318
Query: 316 EDAVVDVLKAGMDVNC---GSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNG 372
+A ++AG+++ +L H V + L ES++D + + + ++GLF+
Sbjct: 319 REACALAVQAGVNIELPEPDCYL--HLVDLVHKGVLQESQLDELVEPMLRWKFQMGLFD- 375
Query: 373 NPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNAN 432
+P + P + AH+ LA+QAA++ I LLKN ++PL S ++A+IGPNAN
Sbjct: 376 DPYVDP--AEAERIAGCDAHRELAMQAARETITLLKNDGPVVPLDLSAIKTIAVIGPNAN 433
Query: 433 SAKTLLGNYAGPSCRSITPLQALQNYV---ENTVYYPGC----------DTVACSSASID 479
++LLG Y+G +T L ++ V VY GC D V S + D
Sbjct: 434 --RSLLGGYSGVPKHDVTVLDGIRERVGSRAKVVYAEGCKITIGGSWVQDEVTPSDPAED 491
Query: 480 K-----AVDIAKGADHVVLMMGLDQTQEKEEL------DRVDLVLPGRQQELITRVAEAA 528
+ AV +AK AD +VL +G ++ +E DR L L GRQ+EL+ R A
Sbjct: 492 RRQIAEAVKVAKRADVIVLAIGGNEQTSREAWSPKHLGDRPSLDLVGRQEELV-RAMVAT 550
Query: 529 KKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTW 588
KPVI L G P+ I + +++ +I Y G+ A+AEV+FGD NPGG+LP+T
Sbjct: 551 GKPVIAFLFNGRPISINYLA--QSVPAIFECWYLGQETGRAVAEVLFGDTNPGGKLPIT- 607
Query: 589 YPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKL 648
P+ +P +P A G Y F E ++ FG GLSY+ ++++ +++ K+
Sbjct: 608 IPRSAGHLPAF-YNHKPSARRG-----YLFDEVGPLYAFGYGLSYTTFAFQNLRLAKKKM 661
Query: 649 YLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPA 708
+ ++ V + V N G G+ V L+++
Sbjct: 662 HRESTAR-----------------------------VLVDVTNTGAREGREVVQLYIRDL 692
Query: 709 RRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
RPIK+L GF+ + L + + FE++P + L+ D V+E G ++VG
Sbjct: 693 VSSVTRPIKELKGFRKITLQPGQTQTVEFEITP-DLLAFYNVDMKFVVEPGDFEIMVG 749
>gi|389794138|ref|ZP_10197297.1| beta-glucosidase [Rhodanobacter fulvus Jip2]
gi|388432924|gb|EIL89908.1| beta-glucosidase [Rhodanobacter fulvus Jip2]
Length = 869
Score = 279 bits (713), Expect = 5e-72, Method: Compositional matrix adjust.
Identities = 169/417 (40%), Positives = 234/417 (56%), Gaps = 40/417 (9%)
Query: 59 LVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQ 118
LV+R+TL EK+ QL+N APAIPRLGIP Y WW+E+LHG G T+FP+
Sbjct: 25 LVARMTLAEKLPQLLNVAPAIPRLGIPRYNWWTESLHGALGTLP----------TTNFPE 74
Query: 119 VILTAASFDSYLWYRIGQAIGLEARALYNAGQ--------AIGMTFWAPNINIFRDPRWG 170
I ASFD+ L R+ AI E R L+ + G+ W+PN+NIFRDPRWG
Sbjct: 75 PIGLGASFDAPLVKRVAAAISTEVRGLHTLARQTGRLGRIGTGLDTWSPNLNIFRDPRWG 134
Query: 171 RGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTR 230
RGQET GEDP +T + V++V G+QG + + + A KHF + N +TR
Sbjct: 135 RGQETYGEDPFLTARMGVAFVEGMQGPNAD------RPEVIATPKHFAVH---NGPESTR 185
Query: 231 YKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWG 290
++ + V+ DL DTY P F + + + +A IMCAYNR++G P+CA+ LL R WG
Sbjct: 186 HEANVFVSRHDLVDTYLPAFRAAITEAKAGSIMCAYNRIDGQPACANDLLLKHYLRGAWG 245
Query: 291 FHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG-------SFLQKHTKAAV 343
F GY+ SDCDAV I D YA AV ++AG+D C + L+ A+
Sbjct: 246 FTGYVVSDCDAVKDISDHHHYAPDAATAVAAAVRAGVDNECNGATLSDTAGLEDRYSEAL 305
Query: 344 KQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDG 403
+++ + ++IDRAL LF+ R R+G G P P + + S A+ LAL AAQ
Sbjct: 306 QRELISVADIDRALVRLFAARYRVGDLAGLPGTAP--AVPPSAIGSAANSQLALDAAQKT 363
Query: 404 IVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGP-SCRSITPLQALQNYV 459
+VLLKN G+LPL V +A+IGP ++ + L GNY+ P S I+ L LQ +
Sbjct: 364 LVLLKND-GVLPL--KAGVKVAVIGPLGDATRVLRGNYSSPLSGTPISVLDGLQQVL 417
Score = 120 bits (301), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 97/302 (32%), Positives = 146/302 (48%), Gaps = 57/302 (18%)
Query: 476 ASIDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVA 525
A +D+A A AD +V ++GL E EE D+ L LP Q ++ + A
Sbjct: 589 ADLDRA---AAQADVLVAVVGLTSDLEAEETSVTVPGFAGGDKTTLDLPADQIAMLKQ-A 644
Query: 526 EAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLP 585
A KP+I+V + G P+D+ ++K +N +IL A YPG++G +A+A V+ G +PGGRLP
Sbjct: 645 RATGKPLIVVAMNGSPIDLAWSK--QNAAAILEAWYPGQSGGLAIANVLSGKTDPGGRLP 702
Query: 586 MTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQ 645
+T+Y P D +M GRTYR+++G+ V+PFG GLSY++++Y
Sbjct: 703 LTFYRSVADLPPFGDYRME--------GRTYRYFKGEPVYPFGFGLSYTEFTYG------ 748
Query: 646 NKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFV 705
LN + + N +H +V V N G+ G L++
Sbjct: 749 ---PLNVTPAAGGAANG--LHVSTV------------------VTNAGQREGGDVAQLYL 785
Query: 706 K-PARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLV 764
+ PA G P L GFQ V L E + F+LSP LS DG + G + +
Sbjct: 786 RFPAI--PGAPQVALRGFQRVALKPGETRTVSFDLSP-RDLSAVTADGQREVMAGDYTVS 842
Query: 765 VG 766
VG
Sbjct: 843 VG 844
>gi|261405721|ref|YP_003241962.1| glycoside hydrolase family protein [Paenibacillus sp. Y412MC10]
gi|261282184|gb|ACX64155.1| glycoside hydrolase family 3 domain protein [Paenibacillus sp.
Y412MC10]
Length = 765
Score = 279 bits (713), Expect = 5e-72, Method: Compositional matrix adjust.
Identities = 217/733 (29%), Positives = 347/733 (47%), Gaps = 105/733 (14%)
Query: 67 EKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASF 126
E ++ + A RLGIP E HG +G T FP + +++
Sbjct: 89 EAVNHIQRYAVEQSRLGIPIL-IGEECSHGHMAIG-----------GTVFPVPLSIGSTW 136
Query: 127 DSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKY 186
+ L+ + +A+ LE R+ G ++P +++ RDPRWGR +E GEDP + +Y
Sbjct: 137 NVDLYRDMCRAVALETRS------QGGAVTYSPVLDVVRDPRWGRTEECFGEDPYLISEY 190
Query: 187 AVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTY 246
AV+ V G+QG++ L +A KHF Y R + ++L +
Sbjct: 191 AVASVEGLQGES-----LDSPSSVAATLKHFVGYGSSE---GGRNAGPVHMGTRELMEVD 242
Query: 247 QPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIY 306
PF+ V+ G AS IM AYN ++G+P + LL R++WGF G + +DC A+ ++
Sbjct: 243 MLPFKKAVEAGAAS-IMPAYNEIDGVPCTVNTELLDGILRKEWGFDGMVITDCGAIDMLA 301
Query: 307 DAEGYAKSPEDAVVDVLKAGMDVN-CGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRM 365
A+ DA V ++AG+D+ G KH + AV+ KL S +D A+ + +++
Sbjct: 302 SGHDTAEDGMDAAVQAIRAGIDMEMSGEMFGKHLQKAVESNKLEVSVLDEAVRRVLTLKF 361
Query: 366 RLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLA 425
+LGLF NP + P + +V+ S H LA Q A +GIVLLKN LPL K V +A
Sbjct: 362 KLGLFE-NPYVDP--QTAENVIGSEQHVGLARQLAAEGIVLLKNEAKALPLSKEGGV-IA 417
Query: 426 LIGPNANSAKTLLGNYAGPS-CRSITPL-----QALQNYVENTVYYPGCDTVACSSASID 479
+IGPNA+ LG+Y P ++T + L + +Y PGC S +
Sbjct: 418 VIGPNADQGYNQLGDYTSPQPPAAVTTVLGGIRAKLGEEAQRVLYAPGCRIKDDSREGFE 477
Query: 480 KAVDIAKGADHVVLMMG-------------------------LDQTQEKEELDRVDLVLP 514
A+ A+ AD VV+++G L E +DR+ L L
Sbjct: 478 FALTCAEQADTVVMVLGGSSARDFGEGTIDLRTGASKVTDDALSDMDCGEGIDRMTLQLS 537
Query: 515 GRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVI 574
G Q EL+ + + K+ +I+V + G P+ + D + +IL A YPG+ G A+A+++
Sbjct: 538 GVQLELVQEIHKLGKR-MIVVYINGRPIAEPW--IDEHADAILEAWYPGQEGGHAVADIL 594
Query: 575 FGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYS 634
FGD NP G+L M+ P+ ++P+ R + G+ Y + + +PFG GLSY+
Sbjct: 595 FGDVNPSGKLTMS-IPKHVGQLPVYYNGKRSR------GKRYLEEDSQPRYPFGYGLSYT 647
Query: 635 KYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPE-LGTEFCETRKFLVTIGVKNHG 693
++SY S +M PE +GT+ +V++ V N G
Sbjct: 648 EFSY---------------SDIQM-----------TPEVIGTDGTA----VVSVNVTNSG 677
Query: 694 EMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGL 753
+ G V L+V A RP ++L GFQ + L E+ ++ F + P E L +D
Sbjct: 678 DCEGSEVVQLYVSDAASKYTRPARELKGFQKIFLQPGERRKVEFTIGP-EQLQYIGQDYR 736
Query: 754 MVIEEGTHFLVVG 766
V+E G +++G
Sbjct: 737 QVVEPGLFRVMLG 749
>gi|182413194|ref|YP_001818260.1| glycoside hydrolase family 3 [Opitutus terrae PB90-1]
gi|177840408|gb|ACB74660.1| glycoside hydrolase family 3 domain protein [Opitutus terrae
PB90-1]
Length = 859
Score = 278 bits (712), Expect = 6e-72, Method: Compositional matrix adjust.
Identities = 229/801 (28%), Positives = 357/801 (44%), Gaps = 114/801 (14%)
Query: 44 PFCKTTLPISQRARDLVSRLTLDEKISQLVN--SAPAIPRLGIPAYEW----W------- 90
P+ ++ PI R DL++R++L+EK +QL P + + P W W
Sbjct: 60 PYEDSSRPIDARIEDLLARMSLEEKTAQLTTLYGFPRVLKDERPTSAWREAMWKDGIGNI 119
Query: 91 SEALHGVAGVGKGI----------------------------------FFNGTIRG---- 112
E L+G G + F N IRG
Sbjct: 120 DEHLNGNTGWTNNLADPVHDLPWSLHARALNEVQRWFIEQTRLGIPVDFTNEGIRGLLHS 179
Query: 113 -ATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMT-FWAPNINIFRDPRWG 170
ATSFP + A+++D L IG+ G EARAL G T ++P +++ RDPRWG
Sbjct: 180 KATSFPAELAVASTWDPALVREIGRITGREARAL-------GYTNIYSPVLDLARDPRWG 232
Query: 171 RGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTR 230
R ET GEDP + G V VRG+Q + + KHF Y +
Sbjct: 233 RTIETYGEDPFLVGTLGVEQVRGLQAE-----------HVVSTLKHFAVYSIPKGGRDGE 281
Query: 231 YKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWG 290
+ D + T +++ + PF +++ A G+M +YN +G+P LS+ R QWG
Sbjct: 282 ARTDPQATWREVQTIFLEPFRRAIREAGALGVMASYNDYDGVPVEGSALFLSEILRGQWG 341
Query: 291 FHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG----SFLQKHTKAAVKQK 346
F GY+ SD AV I+ A +P DA+ ++AG+++ + + + V+
Sbjct: 342 FRGYVVSDSAAVEFIHSKHRVAPTPADAIRQAVEAGLNIRTNFTPPAAYAEPLRQLVRDG 401
Query: 347 KLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGAD-VVCSPAHQVLALQAAQDGIV 405
KL + ID + ++ V+ +LGLF+ P AD VV +P H V+A +A ++ IV
Sbjct: 402 KLAMATIDARVRDVLRVKFQLGLFDRPYVADP---AAADRVVRAPEHLVVAQRAGREAIV 458
Query: 406 LLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTV-- 463
LLKN LLPL ++K + + GP A+ A Y +TPL L+ + V
Sbjct: 459 LLKNEPALLPLDRAKLQRVLVAGPLADDAHAWWSRYGAQRLDFVTPLPGLRAKLGAAVEV 518
Query: 464 -----------YYPGCDTV-----ACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELD 507
+P D + A A I+ AV A+ D ++ ++G +E
Sbjct: 519 RYAKGVEAKDAAWPASDVLKDPPSAEVRAGIEAAVAAAQNVDVIIAVLGETDELCRESSS 578
Query: 508 RVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGA 567
R+ L LPG QQEL+ + A KP++LVL G P+ + +A R++ +I+ +PGE G
Sbjct: 579 RISLALPGYQQELLEAL-HATGKPLVLVLSNGRPLSVVWAA--RHVPAIVELWFPGEDGG 635
Query: 568 VALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPF 627
ALA V+ GD NP GRLP+T +PQ ++P A G+ R + EG +FPF
Sbjct: 636 AALAAVLLGDANPSGRLPIT-FPQSVGQLPYNF-----PAHPGSQARDFGQVEGS-LFPF 688
Query: 628 GCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETR--KFLV 685
G GLSY+ + Y ++ ++ ++ + + + T + + +F +
Sbjct: 689 GHGLSYTTFRYSDLRITPERIPVDGFGAAGGGDPG----LRGSASRATPYSVSTVPEFTI 744
Query: 686 TIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESL 745
T V N G AG V L+++ L GF V L E + F L L
Sbjct: 745 TCDVTNTGTRAGDEVVQLYLRDDYSSVTTYDIALRGFARVTLAPGETKPVTFTLHRAH-L 803
Query: 746 SRAREDGLMVIEEGTHFLVVG 766
DG V+E G +++G
Sbjct: 804 ELYNRDGDWVVEPGRFTVMLG 824
>gi|383119099|ref|ZP_09939838.1| hypothetical protein BSHG_1822 [Bacteroides sp. 3_2_5]
gi|251946311|gb|EES86688.1| hypothetical protein BSHG_1822 [Bacteroides sp. 3_2_5]
Length = 859
Score = 278 bits (712), Expect = 6e-72, Method: Compositional matrix adjust.
Identities = 226/803 (28%), Positives = 362/803 (45%), Gaps = 142/803 (17%)
Query: 42 TFPFCKTTLPISQRARDLVSRLTLDEKISQ------------------------------ 71
F + +LP+ R +DL+SR+TL+EKI+Q
Sbjct: 23 NFKYKNASLPVEVRVQDLLSRMTLEEKIAQMRHIHAYSIMENGKLNEEKLEKMIGGQNYG 82
Query: 72 --------------LVNSAPAI----PRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGA 113
L+N PRLGIP + +E+LHG G+
Sbjct: 83 FIEGITLPGKECLTLMNEVQKYMREKPRLGIPVFTL-TESLHG-----------SVHDGS 130
Query: 114 TSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQ 173
T FPQ I ++F+ L Y + AI E A Q I + P I++ RD RWGR +
Sbjct: 131 TIFPQAIALGSTFNPILAYEMTSAIAKELSA-----QGITQSL-TPVIDVCRDLRWGRVE 184
Query: 174 ETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKF 233
E GEDP + + VS VRG + Q S KHF A+ GT +
Sbjct: 185 ECFGEDPFLVSRMGVSQVRGYLDN-----------QVSPMIKHFGAH------GTPQGGL 227
Query: 234 D-ARVT--MQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWG 290
+ A V+ ++L Y FE+ VK+ + +M +YN N P+ + L+++ R +W
Sbjct: 228 NLASVSCGQRELLSIYLKTFETVVKEAKPWAVMSSYNSWNNEPNSSSHYLMTELLRDRWD 287
Query: 291 FHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPE 350
F GY+ SD A+ ++ A++ +A + L AG+D + V+ L
Sbjct: 288 FQGYVYSDWGAIGMLNYFHKTAQNSAEAAIQALTAGLDAEASDNSYAELQQLVENGMLDV 347
Query: 351 SEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNS 410
ID+A+ + + + +GLF P P K VV +PAH LA + A++ IVLL+N
Sbjct: 348 KYIDQAVARILTAKFNMGLFE-YPL--PMEKNYDKVVHAPAHVSLARKIAEESIVLLQNE 404
Query: 411 HGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTV---YYPG 467
+ +LPL +K S+A+IGPNA+ + ++ + +T L+AL+ V N + Y G
Sbjct: 405 NNILPLQMNKLKSIAVIGPNADQVQFGDYTWSRDNKDGVTLLEALKERVSNQLTLNYAKG 464
Query: 468 CDTVACSSASIDKAVDIAKGADHVVLMMG---------LDQTQEKEELDRVDLVLPGRQQ 518
CD V + +AVD+AK +D ++++G E D DL L G Q+
Sbjct: 465 CDLVTDDCSGFKEAVDVAKKSDVCIVVVGSASASLARDYSNATCGEGFDLSDLTLTGVQE 524
Query: 519 ELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDH 578
+L+ + A KPVI+VLL G P +++ K NI I+ YPGE G +ALA+++ G
Sbjct: 525 DLVEAI-HATGKPVIVVLLSGKPFAMSWIK--ENIPGIVVQWYPGEQGGLALADMLLGKV 581
Query: 579 NPGGRLPMTWYPQD-------YIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGL 631
NP G+L + +PQ Y +P R + PG+ Y F K ++ FG GL
Sbjct: 582 NPSGKLNYS-FPQSVGHLPCYYNYLPTDKGFYRSPGSKNKPGKDYVFSSPKALWAFGHGL 640
Query: 632 SYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKN 691
SY+ + YL+ ++S + +DV+ VTI ++N
Sbjct: 641 SYTDFE-----------YLSATTSKEDYACEDVIE------------------VTIAIRN 671
Query: 692 HGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRARED 751
G+ G ++V+ P+++L GF+ V++ E +++ ++ P L+ ++
Sbjct: 672 TGDYDGLEVPQVYVRDMVSSVVMPVQELKGFEKVLIKKGETKQVIIKI-PVSELALYNKE 730
Query: 752 GLMVIEEGTHFLVVGDEEYPISI 774
V+E G L +G I I
Sbjct: 731 MKKVVEPGAFELQIGRASDDIRI 753
>gi|373952814|ref|ZP_09612774.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
paludis DSM 18603]
gi|373889414|gb|EHQ25311.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
paludis DSM 18603]
Length = 862
Score = 278 bits (712), Expect = 6e-72, Method: Compositional matrix adjust.
Identities = 161/441 (36%), Positives = 239/441 (54%), Gaps = 30/441 (6%)
Query: 39 STETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVA 98
S + P+ L RA+DLV+RLTL EK+ + + + A+PRLGI + WWSEALHG A
Sbjct: 18 SAQQLPYQNPALSSEARAKDLVTRLTLKEKVGLMKDVSEAVPRLGIKKFNWWSEALHGYA 77
Query: 99 GVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYN-------AGQA 151
G T FP+ + AASFD + + A+ EARA N + +
Sbjct: 78 NQGP----------VTVFPEPVGMAASFDDQKLFHVFDAVSDEARAKNNEYRKQVESQRF 127
Query: 152 IGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQAS 211
++ W PN+NIFRDPRWGRGQET GEDP +T + VS V+G+QG K + L
Sbjct: 128 HDLSVWTPNVNIFRDPRWGRGQETYGEDPYLTSRMGVSVVKGLQGPA--DAKYRKLL--- 182
Query: 212 ACCKHFTAYDLDNWKGTTRYKFDAR-VTMQDLADTYQPPFESCVKQGRASGIMCAYNRVN 270
AC KH+ + W +R++ + VT +DL +TY P F+S V+ +MCAY R++
Sbjct: 183 ACAKHYAVHSGPEW---SRHEMNVTDVTPRDLWETYLPAFKSLVQDADVREVMCAYQRLD 239
Query: 271 GIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVN 330
P C + LL + R WGF + SDC A++ Y++ + A + +G DV
Sbjct: 240 DEPCCGNSRLLGQILREDWGFKYLVVSDCGAITDFYNSHHSSSDATHASAKAVLSGTDVE 299
Query: 331 CGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSP 390
C + AV + + E +I+ ++ L + R LG + + + P+ KI VV S
Sbjct: 300 CVGYAFDKIPDAVYRGLIKEKDINTSVVRLMTQRFELGEMDKD-ELVPWTKIPLSVVNSE 358
Query: 391 AHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSIT 450
HQ LAL A++ + LL+N++ +LPL KS LA+IGPNAN ++ L GNY G R+I
Sbjct: 359 DHQKLALDMARETMTLLQNNNNILPLSKSIG-KLAVIGPNANDSQMLSGNYNGTPLRTIN 417
Query: 451 PLQALQNYV--ENTVYYPGCD 469
L+ ++ + ++ +Y GCD
Sbjct: 418 ILEGIKTKLGADHVIYDAGCD 438
Score = 99.8 bits (247), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 81/306 (26%), Positives = 142/306 (46%), Gaps = 53/306 (17%)
Query: 477 SIDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAE 526
+ D ++ K AD VV + G+ E EE+ DR D+ LP Q+ I + +
Sbjct: 588 TYDGIIEKVKDADIVVFVGGISPKLEGEEMPVQLPGFKGGDRTDIELPAVQRNCIEALRK 647
Query: 527 AAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPM 586
A KK I+ + C G I +N +IL A Y GE+G A+A+V+FGD+NP G LP+
Sbjct: 648 AGKK--IVFVNCSGSA-IAMVPETQNCDAILQAWYAGESGGQAVADVLFGDYNPSGHLPV 704
Query: 587 TWYPQDYIKVP-MTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQ 645
T+Y ++ ++P +D M+ GRTYR+ + +FPFG GLSY+ ++ + +
Sbjct: 705 TFY-RNVQQLPDFSDYSMK--------GRTYRYLKSAPLFPFGFGLSYTTFN-----IGE 750
Query: 646 NKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFV 705
KL N + + ++ + + V N G+ G + +++
Sbjct: 751 AKLTKNNITKGEAIQ------------------------LRVPVANAGKTDGTELLQVYI 786
Query: 706 KPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVV 765
+ +G K L GF+ + ++A + + +L P D ++ + G + L+
Sbjct: 787 RKVDDPDGAS-KTLRGFKRIPVSAGKTEMVTLDLPPKTFEFFDPTDAVVRVSPGNYQLLY 845
Query: 766 GDEEYP 771
G+ P
Sbjct: 846 GESSDP 851
>gi|313204584|ref|YP_004043241.1| glycoside hydrolase [Paludibacter propionicigenes WB4]
gi|312443900|gb|ADQ80256.1| glycoside hydrolase family 3 domain protein [Paludibacter
propionicigenes WB4]
Length = 727
Score = 278 bits (711), Expect = 7e-72, Method: Compositional matrix adjust.
Identities = 226/766 (29%), Positives = 357/766 (46%), Gaps = 100/766 (13%)
Query: 39 STETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVA 98
S TFPF T LP ++R +L+S +TLDEK++ L + +PRLGI SE LHG+A
Sbjct: 21 SQTTFPFQNTGLPDNERLDNLLSLMTLDEKVNALSTNL-GVPRLGI-RNTGHSEGLHGMA 78
Query: 99 GVGKGIFFNGTIRG------ATSFPQVILTAASFDSYLWYRIGQAIGLEARAL-YNAG-Q 150
G G + G+ RG T FPQ ++D+ L ++ E R NA Q
Sbjct: 79 LGGPG-NWGGSERGVAKTYPTTIFPQAYGLGETWDTELIQKVADIEATEIRFYAQNANLQ 137
Query: 151 AIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQA 210
GM APN ++ RDPRWGR +E+ GED + + V++V+G+QG+ ++
Sbjct: 138 KGGMVMRAPNADLARDPRWGRTEESYGEDAFLGSRLTVAFVKGLQGND------PKYWKS 191
Query: 211 SACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVN 270
++ KHF A ++ + +T FD R+ + Y PF + +G + M +YN N
Sbjct: 192 ASLMKHFLANSNEDGRDSTSSNFDERL----FREYYSFPFYKGITEGGSRAFMASYNAWN 247
Query: 271 GIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVN 330
G+P + +L K AR +WG +G I +D A+S++ +A + + V+KA +
Sbjct: 248 GVPMTVNP-ILKKIARDEWGNNGIICTDGGALSLLVNAHHAFPTLTEGAAAVVKA----S 302
Query: 331 CGSFL---QKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVV 387
G FL + + A+K+ L E ID + F V ++LGL + + + P+ IG
Sbjct: 303 VGQFLDNFRSYIYEALKKGLLTEKNIDNVIRGNFYVALKLGLLDADQSKVPYTGIGVTDT 362
Query: 388 CSPAH----QVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAG 443
SP + + + +VLLKN+ GLLPL KSK S+A+IGP AN + LL Y+G
Sbjct: 363 VSPWNKQDTKAFVRKVTAKSVVLLKNTAGLLPLNKSKIKSIAVIGPRAN--EVLLDWYSG 420
Query: 444 PSCRSITPLQALQNYV--ENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLD--- 498
+++ LQ ++N V + V+Y D +DKA A+ AD ++ +G
Sbjct: 421 TPPYAVSILQGIKNAVGKDIEVFYAPSD-------EMDKATLAARKADVAIVCVGNHPYG 473
Query: 499 ----------QTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAK 548
+ +E +DR + L Q++L+ V +A K ++VL+ P I +++
Sbjct: 474 TDARWKISPVPSDGREAVDRKSITL--EQEDLVKLVMQANPK-TVMVLVSNFPFAINWSQ 530
Query: 549 YDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQAT 608
N+ +IL + LA+VIFGD +P GR TW PM D +R
Sbjct: 531 --ENVPAILHVTNNSQELGNGLADVIFGDVSPAGRTTQTWVKSITDLPPMMDYDIR---- 584
Query: 609 SGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYK 668
GRTY++++ K ++PFG GLSY+ + Y S L
Sbjct: 585 ---HGRTYQYFKSKPLYPFGFGLSYTSFEYSGLETSNPTL-------------------- 621
Query: 669 SVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILN 728
T V++ VKN G+ G + L+V RP+KQL GF+ V +
Sbjct: 622 -----------TDSIFVSVKVKNIGKRDGDEVIQLYVSYPDSKVERPMKQLKGFKRVFIP 670
Query: 729 AKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGDEEYPISI 774
A + + L + V E+ L++G I +
Sbjct: 671 AGKSKTVEIPLKASDVAYWDIAKHAFVTEKSKLKLMIGSSSADIKL 716
>gi|423281958|ref|ZP_17260843.1| hypothetical protein HMPREF1204_00381 [Bacteroides fragilis HMW
615]
gi|404582445|gb|EKA87139.1| hypothetical protein HMPREF1204_00381 [Bacteroides fragilis HMW
615]
Length = 805
Score = 278 bits (711), Expect = 8e-72, Method: Compositional matrix adjust.
Identities = 224/717 (31%), Positives = 337/717 (47%), Gaps = 104/717 (14%)
Query: 81 RLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGL 140
RLGIP + E HG +G T FP I A++++ L ++G+ I +
Sbjct: 140 RLGIPLF-LAEECPHGHMAIG-----------TTVFPTSIGQASTWNPELIRQMGRVIAI 187
Query: 141 EARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFN 200
EA A G IG + P +++ RDPRW R +ET GEDP + G + VRG QG+T N
Sbjct: 188 EASA---QGAHIG---YGPVLDLARDPRWSRVEETYGEDPYLNGVMGTALVRGFQGETLN 241
Query: 201 GGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRAS 260
GK A KHF +Y W A + ++L + PPF V G A
Sbjct: 242 DGK-----SVIATLKHFASY---GWTEGGHNGGTAHIGERELEEAIFPPFREAVGAG-AL 292
Query: 261 GIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVV 320
+M +YN ++G P R LL+ + +W F G++ SD AV + + G A + +A +
Sbjct: 293 SVMSSYNEIDGNPCTGSRYLLTDILKDRWQFKGFVVSDLYAVGGLRE-HGVAGNDYEAAI 351
Query: 321 DVLKAGMDVNCGS-FLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPF 379
+ AG+D + G+ + AAVK+ + + ID+A+ + S++ ++GLF+ PF
Sbjct: 352 KAVNAGVDSDLGTNVYAEQLVAAVKRGDVAVATIDKAVRRILSLKFQMGLFD-----DPF 406
Query: 380 --GKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTL 437
K A +V S H LA + A+ IVLLKN LLPL K +LA+IGPNA++ +
Sbjct: 407 VDEKQAAQLVASSEHTGLAREVARQSIVLLKNKDKLLPLKKDIR-TLAVIGPNADNVYNM 465
Query: 438 LGNYAGPSCRS--ITPLQALQNYVEN---TVYYPGCDTVACSSASIDKAVDIAKGADHVV 492
LG+Y P +T L ++ V +Y GC S A++ A+ AD VV
Sbjct: 466 LGDYTAPQADGTVVTVLDGIRQKVSKETRVLYAKGCAVRDSSRTGFKDAIETARNADAVV 525
Query: 493 LMMG----LDQTQEKEE-------------------LDRVDLVLPGRQQELITRVAEAAK 529
++MG D + E EE DR L L GRQ EL+ ++
Sbjct: 526 MVMGGSSARDFSSEYEETGAAKVTINQISDMESGEGYDRATLHLMGRQLELLEEISRLG- 584
Query: 530 KPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWY 589
KPV+LVL+ G P+ + A + +I+ A YPG G A+A+V+FGD+NP GRL ++
Sbjct: 585 KPVVLVLIKGRPLLMEGAIQEAE--AIVDAWYPGMQGGNAVADVLFGDYNPAGRLTLS-V 641
Query: 590 PQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLY 649
P+ ++P+ R GN R Y G +PFG GLSY+ +SY V +
Sbjct: 642 PRSVGQLPVYYNTRR----KGNRSR-YIEEPGTPRYPFGYGLSYTTFSYTDMKVQVTE-- 694
Query: 650 LNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPAR 709
G++ C + VT+ ++N G G L+ +
Sbjct: 695 ------------------------GSDDC---RVDVTVTIQNQGTADGDEVAQLYFRDDV 727
Query: 710 RGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
P KQL F + L A E E+ F L +SL+ ++G V+E G ++VG
Sbjct: 728 SSFTTPAKQLRAFSRIHLKAGESREVTFTLDK-KSLALYMQEGEWVVEPGLFTIMVG 783
>gi|325105296|ref|YP_004274950.1| beta-glucosidase [Pedobacter saltans DSM 12145]
gi|324974144|gb|ADY53128.1| Beta-glucosidase [Pedobacter saltans DSM 12145]
Length = 884
Score = 278 bits (711), Expect = 9e-72, Method: Compositional matrix adjust.
Identities = 166/450 (36%), Positives = 244/450 (54%), Gaps = 37/450 (8%)
Query: 36 SNPSTETFP--FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEA 93
SN ++ P F LP+++R +L+ LTL+EK+ ++NS+ + RLGIPAY+WW+EA
Sbjct: 21 SNLKSQEIPYKFRNPDLPVNERIENLLGLLTLEEKVGLMMNSSKPVGRLGIPAYDWWNEA 80
Query: 94 LHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA----- 148
LHGVA GK AT FPQ I AA+++ + I EARA YN
Sbjct: 81 LHGVARSGK----------ATVFPQAIGMAATWNESGHKQTFDLISDEARAKYNEAIRNG 130
Query: 149 --GQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKG 206
G+ G++FW PNINIFRDPRWGRGQET GEDP +T + V+ VRG+QGD
Sbjct: 131 ERGRYYGLSFWTPNINIFRDPRWGRGQETYGEDPYLTARLGVAAVRGLQGDD------PK 184
Query: 207 KLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAY 266
+ AC KHF + W R+ +DA + +DL +TY P F++ VK+ +MCAY
Sbjct: 185 YFKTHACAKHFAVHSGPEW---NRHSYDATASGRDLWETYLPAFKALVKEANVQEVMCAY 241
Query: 267 NRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIY---DAEGYAKSPEDAVVDVL 323
N G P C LL+ R +W + G + SDC A+ + E + + A V+
Sbjct: 242 NAYEGQPCCGSDRLLTDILRNRWEYKGIVVSDCWAIDDFFRKGHHETHKDAAAAAADAVI 301
Query: 324 KAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIG 383
+ D+ CGS + AV+Q + + +ID +L + LG+ + + P+ ++
Sbjct: 302 HS-TDLECGSAYTNLLE-AVRQGLISQQQIDISLRRVLRGWFELGMLDPAERL-PWSQLP 358
Query: 384 ADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAG 443
+V S H AL+ A++ + LLKN+ +LPL KS +A+IGPNA + L GNY G
Sbjct: 359 YQIVASKEHVQQALKVARESMTLLKNNGSILPLSKSIK-KIAVIGPNAADSVMLWGNYNG 417
Query: 444 PSCRSITPLQALQNYVENT--VYYPGCDTV 471
++T LQ ++N + + +Y GCD V
Sbjct: 418 TPNSTVTILQGIKNKLPHAEIIYDKGCDWV 447
Score = 98.6 bits (244), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 79/270 (29%), Positives = 128/270 (47%), Gaps = 49/270 (18%)
Query: 486 KGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILV 535
K D +V GL E EE+ D++ + LP Q+EL++ + ++ KPV+ V
Sbjct: 606 KEVDAIVYAGGLSPQLEGEEMPVNADGFRGGDKISIDLPKIQRELLSSL-KSTGKPVVFV 664
Query: 536 LLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQ---- 591
L G + + ++N ++L A Y G+ A+A+V+FGD+NP GRLP+T+Y
Sbjct: 665 LCTGS--SLALEQDEKNYNALLCAWYGGQEAGTAVADVLFGDYNPAGRLPITFYKSLSQL 722
Query: 592 DYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLN 651
D + +D + GRTYR+ K ++ FG GLSYSK++Y + KL
Sbjct: 723 DNALLKTSDTSRQDFENYSMQGRTYRYMTEKPLYAFGHGLSYSKFNY-----GEAKL--- 774
Query: 652 QSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRG 711
+S T + N ++I + N G+ V ++VK R G
Sbjct: 775 -TSGTVKIGNT--------------------LNISIPLTNISNNKGEEVVQVYVK--RNG 811
Query: 712 N-GRPIKQLVGFQSVILNAKEKAEIVFELS 740
+ P+K L GF+ V + A E + F+L+
Sbjct: 812 DPDAPVKSLKGFKRVAIAAGETKHLDFQLT 841
>gi|237721201|ref|ZP_04551682.1| glycoside hydrolase family 3 protein [Bacteroides sp. 2_2_4]
gi|229448997|gb|EEO54788.1| glycoside hydrolase family 3 protein [Bacteroides sp. 2_2_4]
Length = 863
Score = 278 bits (710), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 170/439 (38%), Positives = 245/439 (55%), Gaps = 38/439 (8%)
Query: 43 FPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGK 102
+P+ T L QRA DL+ RLTL+EK++ + N++PAIPRLGI YEWW+EALHGVA G
Sbjct: 26 YPYQDTKLTAEQRADDLLQRLTLEEKVALMQNNSPAIPRLGIKPYEWWNEALHGVARAGL 85
Query: 103 GIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARA----LYNAGQAI---GMT 155
AT FPQ I AASF+ L Y + A+ EARA GQ G+T
Sbjct: 86 ----------ATVFPQAIGMAASFNDELLYEVFDAVSDEARAKNRQFNEKGQYKRYQGLT 135
Query: 156 FWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQG-DTFNGGKLKGKLQASACC 214
W PN+NIFRDPRWGRGQET GEDP ++G+ ++ VRG+QG + KL AC
Sbjct: 136 MWTPNVNIFRDPRWGRGQETYGEDPYLSGRMGMAAVRGLQGPEDAEYDKLH------ACA 189
Query: 215 KHFTAYDLDNWKGTTRYKFDAR-VTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIP 273
KHF + W R+ F+A + +DL +TY P F+ V++ +MCAYNR G P
Sbjct: 190 KHFAVHSGPEW---NRHSFNAENIAPRDLWETYLPAFKELVQKAGVKEVMCAYNRFEGDP 246
Query: 274 SCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPE--DAVVDVLKAGMDVNC 331
C LL++ R WGF G + +DC A+ + + + P+ A D + +G D+ C
Sbjct: 247 CCGSNRLLTQILRNDWGFKGIVVTDCGAIGDFFQRKKHETHPDAAHASADAVLSGTDLEC 306
Query: 332 GSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPA 391
G + T AVK+ + E +I+ ++ L R LG N + P+ I V+ P
Sbjct: 307 GGNFKSITD-AVKKGLISEEKINTSVKRLLKARFELGEMN---STHPWSNIPFSVIDCPK 362
Query: 392 HQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITP 451
H+ LAL+ A + +VLL+N++ L LP ++ + +A+IGPNAN + GNY G ++T
Sbjct: 363 HKELALKMAHESLVLLQNNNNL--LPLNRQMKVAVIGPNANDSVMQWGNYNGFPSHTVTL 420
Query: 452 LQALQNYVENT--VYYPGC 468
L+ ++ + + +Y P C
Sbjct: 421 LEGIRAKLPDAQIIYEPVC 439
Score = 103 bits (257), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 82/264 (31%), Positives = 120/264 (45%), Gaps = 51/264 (19%)
Query: 486 KGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILV 535
+ AD V+ G+ E E + DR ++ LP Q+E++ + + KK + V
Sbjct: 598 QSADVVIFAGGISPLLEGESMRVSDPGFKGGDRTEIELPAIQREVLALLKKNGKK-TVFV 656
Query: 536 LLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIK 595
G + I +N +IL A YPG+AG A+A+V+FGD+NP GRLP+T+Y
Sbjct: 657 NFSGSAMAIV--PETQNCDAILQAWYPGQAGGTAVADVLFGDYNPAGRLPITFYKSMQQL 714
Query: 596 VPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSS 655
D M+ GRTYRF ++PFG GLSY+++SY K LNQS
Sbjct: 715 PDYEDYSMK--------GRTYRFMTKTPLYPFGYGLSYTRFSY-------GKATLNQSKL 759
Query: 656 TKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRP 715
TK K ++TI V N G+ G+ V +++ G P
Sbjct: 760 TK----------------------GEKAILTIPVSNVGQRDGEEVVQVYICRPDDKEG-P 796
Query: 716 IKQLVGFQSVILNAKEKAEIVFEL 739
K L GFQ V + + + EL
Sbjct: 797 QKTLRGFQRVSIAKGKTQNVQIEL 820
>gi|427411073|ref|ZP_18901275.1| hypothetical protein HMPREF9718_03749 [Sphingobium yanoikuyae ATCC
51230]
gi|425710258|gb|EKU73280.1| hypothetical protein HMPREF9718_03749 [Sphingobium yanoikuyae ATCC
51230]
Length = 791
Score = 278 bits (710), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 224/712 (31%), Positives = 339/712 (47%), Gaps = 96/712 (13%)
Query: 81 RLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGL 140
RLGIP + E LHG A VG ATSFPQ I A+S+D + ++ Q I
Sbjct: 138 RLGIPIL-FHEEGLHGYAAVG-----------ATSFPQSIAMASSWDPAMLRQVNQVIAR 185
Query: 141 EARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFN 200
E RA + + M +P ++I RDPRWGR +ET GEDP + G+ V+ V G+QG
Sbjct: 186 EIRA-----RGVPMVL-SPVVDIARDPRWGRIEETYGEDPYLVGEMGVAAVEGLQG--VG 237
Query: 201 GGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRAS 260
+ A KH T + GT A V+ ++L + + PPFE VK+
Sbjct: 238 RSRTLQPNHVFATLKHLTGHGQPE-SGTNIGP--APVSERELRENFFPPFEQVVKRTGIE 294
Query: 261 GIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVV 320
+M +YN ++G+PS A+R LL R++WGF G + SD AV + A + E+A +
Sbjct: 295 AVMASYNEIDGVPSHANRWLLDNVLRQEWGFRGAVVSDYSAVDQLMSIHHIAANLEEAAM 354
Query: 321 DVLKAGMDVNCGSFLQKHTKAA-VKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPF 379
L AG+D + L T V++ K+ E+++D A+ + ++ R GLF NP
Sbjct: 355 RALDAGVDADLPEGLSYATLGKLVREGKVSEAKVDLAVRRMLELKFRAGLFE-NPYAD-- 411
Query: 380 GKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLG 439
A + + + LA AAQ I LLKN G+LPL ++ A+IGP+A A+ LG
Sbjct: 412 ANAAAAITNNEDARALARTAAQRSITLLKND-GMLPLKPEGTI--AVIGPSAAVAR--LG 466
Query: 440 NYAGPSCRSITPLQALQNYV---ENTVYYPG----------CDTVACSSAS-----IDKA 481
Y G S++ L+ ++ V N V+ G D+V S + I +A
Sbjct: 467 GYYGQPPHSVSILEGIKARVGTKANIVFAQGVKITEDDDWWADSVTKSDPAENRKLIAQA 526
Query: 482 VDIAKGADHVVLMMGLDQTQEKEEL------DRVDLVLPGRQQELITRVAEAAKKPVILV 535
V+ A+ D ++L +G + +E DR L L G QQEL + +A KP+ +V
Sbjct: 527 VEAARNVDRIILTLGDTEQSSREGWADNHLGDRPSLDLVGEQQELFDAL-KALGKPITVV 585
Query: 536 LLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIK 595
L+ G P + K +IL Y GE G A+A+++FGD NPGG+LP+T P+ +
Sbjct: 586 LINGRPA--STVKVSEQANAILEGWYLGEQGGNAVADILFGDVNPGGKLPVT-VPRSAGQ 642
Query: 596 VPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSS 655
+P+ M+P A G Y F ++PFG GLSY+ +S +S ++ +S
Sbjct: 643 LPLF-YNMKPSARRG-----YLFDTTDPLYPFGFGLSYTSFSLSAPRLSATRIGTGGKTS 696
Query: 656 TKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRP 715
V++ V+N G G V L+++ RP
Sbjct: 697 -----------------------------VSVDVRNTGAREGDEVVQLYIRDKVSSVTRP 727
Query: 716 IKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGD 767
+K+L GFQ V L E I F + P E+L + V+E G ++ G+
Sbjct: 728 VKELKGFQRVTLKPGESRTITFTVGP-EALQMWNDQMRRVVEPGDFEIMTGN 778
>gi|317477153|ref|ZP_07936394.1| glycosyl hydrolase family 3 N terminal domain-containing protein
[Bacteroides eggerthii 1_2_48FAA]
gi|316906696|gb|EFV28409.1| glycosyl hydrolase family 3 N terminal domain-containing protein
[Bacteroides eggerthii 1_2_48FAA]
Length = 863
Score = 277 bits (709), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 158/437 (36%), Positives = 243/437 (55%), Gaps = 29/437 (6%)
Query: 51 PISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTI 110
PIS R +++ ++TL+EK++QL N + +IPRL +P+Y +W+E LHGVA G+
Sbjct: 55 PISVRIENIIRQMTLEEKVAQLSNESDSIPRLNLPSYNYWNECLHGVARAGE-------- 106
Query: 111 RGATSFPQVILTAASFDSYLWYRIGQAIGLEARALY-NAGQAIGMTFWAPNINIFRDPRW 169
T FPQ I A+++D+ L RI AI EAR Y + G+ G+T+WAP IN+ RDPRW
Sbjct: 107 --VTVFPQAINLASTWDTLLVKRIASAISTEARLKYLDIGK--GLTYWAPTINMARDPRW 162
Query: 170 GRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTT 229
GR +ET GEDP +T + V++V+G+QGD N L+ A KHF A + +N
Sbjct: 163 GRNEETYGEDPYLTSRLGVAFVKGLQGDHPN------YLKTVATVKHFVANNQEN----D 212
Query: 230 RYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQW 289
R+ +++ + L + Y P +E+CVK+ IM AYN NGIP LL R++W
Sbjct: 213 RFSSSSQIPTKQLYEYYFPAYEACVKEANVQSIMTAYNAFNGIPPSGSTWLLEDVLRKEW 272
Query: 290 GFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLP 349
GF G++ SDC A+ ++ S E+A + +G D+ CG +++ AAV++ +
Sbjct: 273 GFDGFVVSDCGAIGVMNWQHRIVNSLEEAAALGINSGCDLECGGTYRENLVAAVQRGLVS 332
Query: 350 ESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKN 409
E IDRAL + ++R +LG F+ + P+ ++ + LA +AA I+LLKN
Sbjct: 333 EYAIDRALTRVLTMRFKLGEFDP-IELVPYNHYDKKLLAGEQFRRLAYEAAVKSIILLKN 391
Query: 410 SHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTV---YYP 466
LP+ K S+A++GP A++ LG Y+G +I+ LQ ++ V V Y
Sbjct: 392 EDNFLPIDKKDVRSIAIVGPFADN--NYLGGYSGKPVHNISLLQGVKKMVGEEVEISYIE 449
Query: 467 GCDTVACSSASIDKAVD 483
G V+ +S A D
Sbjct: 450 GTSVVSPVDSSYLLASD 466
Score = 99.8 bits (247), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 86/297 (28%), Positives = 135/297 (45%), Gaps = 43/297 (14%)
Query: 478 IDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLL 537
IDK I AD V++ +G D +E D + LP Q+ L+ + + + + L+L
Sbjct: 599 IDKVKKIVSRADLVLVALGNDGKLARENRDLPSIYLPMTQELLLKEIYKVNPR-IALILQ 657
Query: 538 CGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVP 597
G P+ +A ++ SIL A YPG+ G ALA ++FG NP G+LPMT Y +
Sbjct: 658 TGNPLTSQWAA--EHVPSILQAWYPGQEGGAALAGILFGLENPSGKLPMTIYESEQQLPN 715
Query: 598 MTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTK 657
+ D + GRTY++ K ++ FG GLSYS + Y
Sbjct: 716 ILDYDIW-------KGRTYQYLSSKPLYGFGHGLSYSNFEY------------------A 750
Query: 658 MVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRG-NGRPI 716
++ DVVH GT C +I VKN ++ G+ + ++V + P+
Sbjct: 751 DLQCNDVVHVD-----GTLQC-------SIKVKNISDVVGEEVIQVYVSREKTPVYTFPL 798
Query: 717 KQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGDEEYPIS 773
K+L+ F V L E + F ++P LS +DG + G + L VG + +S
Sbjct: 799 KKLIAFARVNLKPNESKTVTFTITP-RQLS-VWQDGEWKMLSGKYSLFVGGGQKELS 853
>gi|409196594|ref|ZP_11225257.1| beta-glucosidase [Marinilabilia salmonicolor JCM 21150]
Length = 876
Score = 277 bits (709), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 157/454 (34%), Positives = 247/454 (54%), Gaps = 44/454 (9%)
Query: 43 FPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGK 102
+PF +L +R +DL+SRLT++EK + L + + AIPRLGI + WWSEALHG A
Sbjct: 33 YPFQNPSLSFEERVKDLISRLTVEEKAALLCDQSDAIPRLGIEKFNWWSEALHGYAN--- 89
Query: 103 GIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIG--------M 154
N ++ T FPQ + AASF+ L + I A E RA Y+ Q G +
Sbjct: 90 ----NDSV---TVFPQPVGMAASFNDELVFDIFNATSDEGRAKYHQAQRRGEENRRFLSL 142
Query: 155 TFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACC 214
+ W PN+NIFRDPRWGRGQET GEDP +T + V V+G+QG K + L AC
Sbjct: 143 SVWTPNVNIFRDPRWGRGQETYGEDPYLTSRMGVQVVKGLQGP--EDAKYRKLL---ACA 197
Query: 215 KHFTAYDLDNWKGTTRYKFDAR-VTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIP 273
KH+T + W +R++ + V+ ++ +TY P F++ V+ +MCAY+R++ P
Sbjct: 198 KHYTVHSGPEW---SRHELNVNDVSPREFYETYMPAFKALVQDADVRQVMCAYHRLDDEP 254
Query: 274 SCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNC-- 331
C++ +L + R +WG+ + +DC A++ + G + +P A L AG D+ C
Sbjct: 255 CCSNTRILQRILRDEWGYEHLVAADCGAITDFHTTHGISSTPVHAAATGLLAGTDLECIW 314
Query: 332 GSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPA 391
++ K A+++ + E ID +L + R LG + + + P+ + A V+ +
Sbjct: 315 DNYHYKMLPEALEKDLIKEEHIDSSLFRVLMGRFELGEMDDDAIV-PWAQTPASVLNNKE 373
Query: 392 HQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITP 451
H+ LA + AQ + LL+N + +LPL KS + +A+IGPNA+ + L GNY G R+IT
Sbjct: 374 HRELAYKMAQQSMTLLQNKNNVLPLDKSSTQKIAVIGPNADDEEVLWGNYNGTPIRTITI 433
Query: 452 LQALQNYVE-NTVYYPGCDTVACSSASIDKAVDI 484
L +++ + ++Y DKAVD+
Sbjct: 434 LDGMKSKTNSDRIFY-------------DKAVDL 454
Score = 91.3 bits (225), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 77/267 (28%), Positives = 115/267 (43%), Gaps = 52/267 (19%)
Query: 486 KGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILV 535
+G + VV GL + E EE+ DR + LP Q+ + + EA K VI V
Sbjct: 612 EGIETVVFAGGLSGSLEGEEMPVSYPGFKGGDRTHIELPSVQRNALKALKEAGKT-VIFV 670
Query: 536 LLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIK 595
G I + +IL A Y GE+G A+A+V+FGD+NP G+LP+T+Y
Sbjct: 671 NFSGSA--IALEPETESCDAILQAWYAGESGGQAIADVLFGDYNPSGKLPLTFYRNSDNL 728
Query: 596 VPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSS 655
D M GRTYR Y +FPFG GLSY+ + +S+ + +++ S
Sbjct: 729 GDFEDYSME--------GRTYR-YTNDHLFPFGFGLSYTNFQIGNAKLSKATITPDETIS 779
Query: 656 TKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRP 715
+ I VKN G+ G + ++V+ +G P
Sbjct: 780 -----------------------------INIPVKNTGKRDGTEIIQVYVRKVNDIDG-P 809
Query: 716 IKQLVGFQSVILNAKEKAEIVFELSPC 742
IK L GF+ V + A + + L P
Sbjct: 810 IKTLKGFERVDVPAGKTTQANISLPPS 836
>gi|375143423|ref|YP_005005864.1| Beta-glucosidase [Niastella koreensis GR20-10]
gi|361057469|gb|AEV96460.1| Beta-glucosidase [Niastella koreensis GR20-10]
Length = 793
Score = 277 bits (708), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 229/791 (28%), Positives = 365/791 (46%), Gaps = 138/791 (17%)
Query: 52 ISQRARDLVSRLTLDEKISQLVN--SAPAIPRLGIPAYEW----W-------SEALHGVA 98
++ R DL+S++TLDEK Q+ + + +P W W E L+G A
Sbjct: 50 VNARTADLLSKMTLDEKTCQMATLYGWHRVLKDSLPTDSWKNAIWKDGIANIDEHLNGFA 109
Query: 99 GVGK------------------------------GI---FFNGTIRG-----ATSFPQVI 120
G GK GI F N IRG AT FP +
Sbjct: 110 GWGKTAPIDLVKDMEKHVWAMNETQRFFIEQTRLGIPADFTNEGIRGVEAYEATGFPTEL 169
Query: 121 LTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMT-FWAPNINIFRDPRWGRGQETPGED 179
+++ L ++ G G EARAL G T +AP +++ RD RWGR +E+ GED
Sbjct: 170 NMGMTWNKELVHQEGIITGREARAL-------GYTNVYAPIMDVARDQRWGRLEESYGED 222
Query: 180 PLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTM 239
P + ++ +G+Q D GK+ ++A KHF Y + + + D +V
Sbjct: 223 PYLVASMGIALAKGIQQD--------GKVASTA--KHFAVYSANKGAREGQARTDPQVAP 272
Query: 240 QDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDC 299
+++ + PF+ +K+ G+M +YN +GIP L + R + GF GY+ SD
Sbjct: 273 REVENLLLYPFKKVIKEAGIMGVMSSYNDYDGIPVSGSNYWLIQRLRVEMGFTGYVVSDS 332
Query: 300 DAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG----SFLQKHTKAAVKQKKLPESEIDR 355
DA+ + A + ++AV AGM+V + + + VK+ ++P I+
Sbjct: 333 DALEYLATKHHVAANLKEAVFQAFMAGMNVRTTFKAPDSIIIYLRQLVKEGRIPMDTINH 392
Query: 356 ALHNLFSVRMRLGLFNGNPTMQPFGKIGAD---VVCSPAHQVLALQAAQDGIVLLKNSHG 412
+ ++ V+ RLGLF+ P+ + A+ VV S A Q +ALQA+++ +VLLKN++
Sbjct: 393 RVADVLRVKFRLGLFD-----HPYVESAAETRKVVNSDASQQIALQASRESVVLLKNNNN 447
Query: 413 LLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVE--NTVYYPGCDT 470
+LPL KS +A++GPNA +Y S+ LQ +Q + +Y G D
Sbjct: 448 ILPLVKSLD-KIAVVGPNATDDDYAHTHYGPLGSPSVNVLQGIQAKLGAGKVLYAKGVDL 506
Query: 471 V---------------ACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPG 515
V A A +D AV+I K A ++++G + E R DL LPG
Sbjct: 507 VDKNWPESEILPEPMDAGEQAMLDSAVNITKQAQMAIVVLGGNTRTAGESKSRTDLDLPG 566
Query: 516 RQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIF 575
Q EL+ + +A KPV++VLL P+ I + D+ I I++AGYPG G +A+A+V+F
Sbjct: 567 HQLELV-KAIKATGKPVVVVLLGTQPMTINW--IDKYIDGIVYAGYPGVKGGIAVADVLF 623
Query: 576 GDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSK 635
GD+NPGG+L +TW P+ ++P+ + +P A S + G + ++PFG GLSY+
Sbjct: 624 GDYNPGGKLTLTW-PKSVGQIPL-NFPSKPGAQS-DEGEHAKI--KGLLYPFGFGLSYTS 678
Query: 636 YSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEM 695
+ Y +S K + + VT+ V N G++
Sbjct: 679 FGYTNLKISTGKTAADPVA------------------------------VTVDVTNTGKL 708
Query: 696 AGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMV 755
AG V +++ K L GF+ V L A E I F + P E L + V
Sbjct: 709 AGDEVVQCYIRDVLSSVTTYEKLLKGFERVHLQAGETKTISFTI-PREELKLYNREMKFV 767
Query: 756 IEEGTHFLVVG 766
+E G +++G
Sbjct: 768 LEPGEFSVMIG 778
>gi|423258860|ref|ZP_17239783.1| hypothetical protein HMPREF1055_02060 [Bacteroides fragilis
CL07T00C01]
gi|423264169|ref|ZP_17243172.1| hypothetical protein HMPREF1056_00859 [Bacteroides fragilis
CL07T12C05]
gi|387776440|gb|EIK38540.1| hypothetical protein HMPREF1055_02060 [Bacteroides fragilis
CL07T00C01]
gi|392706435|gb|EIY99558.1| hypothetical protein HMPREF1056_00859 [Bacteroides fragilis
CL07T12C05]
Length = 805
Score = 277 bits (708), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 223/717 (31%), Positives = 336/717 (46%), Gaps = 104/717 (14%)
Query: 81 RLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGL 140
RLGIP + E HG +G T FP I A++++ L ++G+ I +
Sbjct: 140 RLGIPLF-LAEECPHGHMAIG-----------TTVFPTSIGQASTWNPELIRQMGRVIAI 187
Query: 141 EARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFN 200
EA A G IG + P +++ RDPRW R +ET GEDP + G + VRG QG+T N
Sbjct: 188 EASA---QGAHIG---YGPVLDLARDPRWSRVEETYGEDPYLNGAMGTALVRGFQGETLN 241
Query: 201 GGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRAS 260
GK A KHF +Y W A + ++L + PPF V G A
Sbjct: 242 DGK-----SVIATLKHFASY---GWTEGGHNGGTAHIGERELEEAIFPPFREAVGAG-AL 292
Query: 261 GIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVV 320
+M +YN ++G P R LL+ + +W F G++ SD AV + + G A + +A +
Sbjct: 293 SVMSSYNEIDGNPCTGSRYLLTDILKDRWQFKGFVVSDLYAVGGLRE-HGVAGNDYEAAI 351
Query: 321 DVLKAGMDVNCGS-FLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPF 379
+ AG+D + G+ + AAVK+ + + ID+A+ + S++ ++GLF+ PF
Sbjct: 352 KAVNAGVDSDLGTNVYAEQLVAAVKRGDVAVATIDKAVRRILSLKFQMGLFD-----DPF 406
Query: 380 --GKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTL 437
K +V S H LA + A+ IVLLKN LLPL K +LA+IGPNA++ +
Sbjct: 407 VDEKQAVQLVASSEHTGLAREVARQSIVLLKNKDKLLPLKKDIR-TLAVIGPNADNVYNM 465
Query: 438 LGNYAGPSCRS--ITPLQALQNYVEN---TVYYPGCDTVACSSASIDKAVDIAKGADHVV 492
LG+Y P +T L ++ V +Y GC S A++ A+ AD VV
Sbjct: 466 LGDYTAPQADGTVVTVLDGIRQKVSKETRVLYAKGCAVRDSSRTGFKDAIETARNADAVV 525
Query: 493 LMMG----LDQTQEKEE-------------------LDRVDLVLPGRQQELITRVAEAAK 529
++MG D + E EE DR L L GRQ EL+ ++
Sbjct: 526 MVMGGSSARDFSSEYEETGAAKVTINQISDMESGEGYDRATLHLMGRQLELLEEISRLG- 584
Query: 530 KPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWY 589
KPV+LVL+ G P+ + A + +I+ A YPG G A+A+V+FGD+NP GRL ++
Sbjct: 585 KPVVLVLIKGRPLLMEGAIQEAE--AIVDAWYPGMQGGNAVADVLFGDYNPAGRLTLS-V 641
Query: 590 PQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLY 649
P+ ++P+ R GN R Y G +PFG GLSY+ +SY V +
Sbjct: 642 PRSVGQLPVYYNTRR----KGNRSR-YIEEPGTPRYPFGYGLSYTTFSYTDMKVQVTE-- 694
Query: 650 LNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPAR 709
G++ C + VT+ ++N G G L+ +
Sbjct: 695 ------------------------GSDDC---RVDVTVTIQNQGTADGDEVAQLYFRDDV 727
Query: 710 RGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
P KQL F + L A E E+ F L +SL+ ++G V+E G ++VG
Sbjct: 728 SSFTTPAKQLRAFSRIHLKAGESREVTFTLDK-KSLALYMQEGEWVVEPGRFTIMVG 783
>gi|313203744|ref|YP_004042401.1| glycoside hydrolase [Paludibacter propionicigenes WB4]
gi|312443060|gb|ADQ79416.1| glycoside hydrolase family 3 domain protein [Paludibacter
propionicigenes WB4]
Length = 1286
Score = 277 bits (708), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 174/465 (37%), Positives = 252/465 (54%), Gaps = 43/465 (9%)
Query: 5 KLSLVFPLLCLCFTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLT 64
KLS + L C F + + ST+ P + T+ +RA DL+SRLT
Sbjct: 7 KLSCLI-LSCFAFMAFMPAKVSTKKPI-------------YLNTSYSFEERAADLISRLT 52
Query: 65 LDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAA 124
L+EK S L NS AIPRLGI + WSEALHG+ G G + I G TSFP + +
Sbjct: 53 LEEKESLLGNSMAAIPRLGIKSMNVWSEALHGILG---GANQSVGISGPTSFPNSVALGS 109
Query: 125 SFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTG 184
++D L R AI EARA+ G G+T+W+P + RDPRWGR E+ GEDP +
Sbjct: 110 AWDPALMQREAMAIADEARAINQTGTK-GLTYWSPVVEPIRDPRWGRTGESYGEDPFLAA 168
Query: 185 KYAVSYVRGVQGD--TFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDL 242
+ A +VRG+ G+ T+ L++ C KH+ A + + + + D+R D+
Sbjct: 169 EIAGGFVRGMVGNDPTY--------LKSVPCAKHYFANNSEFDRHVSSSNMDSR----DM 216
Query: 243 ADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAV 302
+ Y P++ ++Q IM +YN VNG+P+ A + L ARR +G GYIT DC A+
Sbjct: 217 REFYLAPYKKLIEQDNLPSIMSSYNAVNGVPTSASQLYLDTIARRTYGLKGYITGDCAAI 276
Query: 303 SIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFS 362
IY Y K+ E+A LKAG+D +CGS Q++ AA+K+ + ++IDRAL N+F
Sbjct: 277 EDIYTGHYYVKTAEEATAKGLKAGVDSDCGSIYQRYAIAALKKGLITMADIDRALLNIFI 336
Query: 363 VRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGL------LPL 416
VRMR G F+ P + + ++V SPA++ LA + A VLLKN+ L LPL
Sbjct: 337 VRMRTGEFD-PPAKVLYAQFQPNIVNSPANKALAKEIATKTPVLLKNNISLKTNRKALPL 395
Query: 417 PKSKSVSLALIGPNANSAKTLLGNYAGPSCRS--ITPLQALQNYV 459
+ +ALIGP+A+ K LG Y+G + ITP ++ Y+
Sbjct: 396 NPADLKKIALIGPHAD--KVELGPYSGRPAQENMITPFAGIKKYI 438
Score = 116 bits (291), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 81/275 (29%), Positives = 131/275 (47%), Gaps = 38/275 (13%)
Query: 496 GLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGS 555
G D+ EE DR+ L+LPG Q ELI VA A I+V+ G V++ K +NI
Sbjct: 619 GTDEKTATEEADRLTLLLPGNQVELIKAVA-AVNPNTIVVMQTLGCVEVEEFKNLQNIPG 677
Query: 556 ILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRT 615
I+W GY G+A A+A V+FG+ NPGG+L TWY +TD +R +G GRT
Sbjct: 678 IIWVGYNGQAQGDAIASVLFGEVNPGGKLNGTWYKSVKDLPEITDYTLR--GGNGKNGRT 735
Query: 616 YRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGT 675
+ +++ + FG G+SY+ + Y +S+N + +P
Sbjct: 736 FWYFDKDVSYEFGFGMSYTTFEYSNFRISKNSI---------------------IPH--- 771
Query: 676 EFCETRKFLVTIGVKNHGEMAGKHPVLLFVK----PARRGNGRPIKQLVGFQSVILNAKE 731
K V++ VKN G++ G + +++K PA RPIK+L GF+ V L A +
Sbjct: 772 -----DKITVSVDVKNTGKVEGDEVIQVYMKTPDSPASLQ--RPIKRLKGFKRVTLPAGQ 824
Query: 732 KAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
+ +++ + + ++G + +G
Sbjct: 825 TKTVNIDINCADLWFWDMDKNTTTFDQGKYVFEIG 859
>gi|265765465|ref|ZP_06093740.1| periplasmic beta-glucosidase [Bacteroides sp. 2_1_16]
gi|263254849|gb|EEZ26283.1| periplasmic beta-glucosidase [Bacteroides sp. 2_1_16]
Length = 814
Score = 276 bits (707), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 223/717 (31%), Positives = 336/717 (46%), Gaps = 104/717 (14%)
Query: 81 RLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGL 140
RLGIP + E HG +G T FP I A++++ L ++G+ I +
Sbjct: 149 RLGIPLF-LAEECPHGHMAIG-----------TTVFPTSIGQASTWNPELIRQMGRVIAI 196
Query: 141 EARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFN 200
EA A G IG + P +++ RDPRW R +ET GEDP + G + VRG QG+T N
Sbjct: 197 EASA---QGAHIG---YGPVLDLARDPRWSRVEETYGEDPYLNGAMGTALVRGFQGETLN 250
Query: 201 GGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRAS 260
GK A KHF +Y W A + ++L + PPF V G A
Sbjct: 251 DGK-----SVIATLKHFASY---GWTEGGHNGGTAHIGERELEEAIFPPFREAVGAG-AL 301
Query: 261 GIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVV 320
+M +YN ++G P R LL+ + +W F G++ SD AV + + G A + +A +
Sbjct: 302 SVMSSYNEIDGNPCTGSRYLLTDILKDRWQFKGFVVSDLYAVGGLRE-HGVAGNDYEAAI 360
Query: 321 DVLKAGMDVNCGS-FLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPF 379
+ AG+D + G+ + AAVK+ + + ID+A+ + S++ ++GLF+ PF
Sbjct: 361 KAVNAGVDSDLGTNVYAEQLVAAVKRGDVAVATIDKAVRRILSLKFQMGLFD-----DPF 415
Query: 380 --GKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTL 437
K +V S H LA + A+ IVLLKN LLPL K +LA+IGPNA++ +
Sbjct: 416 VDEKQAVQLVASSEHTGLAREVARQSIVLLKNKDKLLPLKKDIR-TLAVIGPNADNVYNM 474
Query: 438 LGNYAGPSCRS--ITPLQALQNYVEN---TVYYPGCDTVACSSASIDKAVDIAKGADHVV 492
LG+Y P +T L ++ V +Y GC S A++ A+ AD VV
Sbjct: 475 LGDYTAPQADGTVVTVLDGIRQKVSKETRVLYAKGCAVRDSSRTGFKDAIETARNADAVV 534
Query: 493 LMMG----LDQTQEKEE-------------------LDRVDLVLPGRQQELITRVAEAAK 529
++MG D + E EE DR L L GRQ EL+ ++
Sbjct: 535 MVMGGSSARDFSSEYEETGAAKVTINQISDMESGEGYDRATLHLMGRQLELLEEISRLG- 593
Query: 530 KPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWY 589
KPV+LVL+ G P+ + A + +I+ A YPG G A+A+V+FGD+NP GRL ++
Sbjct: 594 KPVVLVLIKGRPLLMEGAIQEAE--AIVDAWYPGMQGGNAVADVLFGDYNPAGRLTLS-V 650
Query: 590 PQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLY 649
P+ ++P+ R GN R Y G +PFG GLSY+ +SY V +
Sbjct: 651 PRSVGQLPVYYNTRR----KGNRSR-YIEEPGTPRYPFGYGLSYTTFSYTDMKVQVTE-- 703
Query: 650 LNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPAR 709
G++ C + VT+ ++N G G L+ +
Sbjct: 704 ------------------------GSDDC---RVDVTVTIQNQGTADGDEVAQLYFRDDV 736
Query: 710 RGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
P KQL F + L A E E+ F L +SL+ ++G V+E G ++VG
Sbjct: 737 SSFTTPAKQLRAFSRIHLKAGESREVTFTLDK-KSLALYMQEGEWVVEPGRFTIMVG 792
>gi|60680320|ref|YP_210464.1| beta-glucosidase [Bacteroides fragilis NCTC 9343]
gi|60491754|emb|CAH06512.1| putative beta-glucosidase [Bacteroides fragilis NCTC 9343]
Length = 814
Score = 276 bits (707), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 223/717 (31%), Positives = 336/717 (46%), Gaps = 104/717 (14%)
Query: 81 RLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGL 140
RLGIP + E HG +G T FP I A++++ L ++G+ I +
Sbjct: 149 RLGIPLF-LAEECPHGHMAIG-----------TTVFPTSIGQASTWNPELIRQMGRVIAI 196
Query: 141 EARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFN 200
EA A G IG + P +++ RDPRW R +ET GEDP + G + VRG QG+T N
Sbjct: 197 EASA---QGAHIG---YGPVLDLARDPRWSRVEETYGEDPYLNGVMGTALVRGFQGETLN 250
Query: 201 GGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRAS 260
GK A KHF +Y W A + ++L + PPF V G A
Sbjct: 251 DGK-----SVIATLKHFASY---GWTEGGHNGGTAHIGERELEEAIFPPFREAVGAG-AL 301
Query: 261 GIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVV 320
+M +YN ++G P R LL+ + +W F G++ SD AV + + G A + +A +
Sbjct: 302 SVMSSYNEIDGNPCTGSRYLLTDILKDRWQFKGFVVSDLYAVGGLRE-HGVAGNDYEAAI 360
Query: 321 DVLKAGMDVNCGS-FLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPF 379
+ AG+D + G+ + AAVK+ + + ID+A+ + S++ ++GLF+ PF
Sbjct: 361 KAVNAGVDSDLGTNVYAEQLVAAVKRGDVAVATIDKAVRRILSLKFQMGLFD-----DPF 415
Query: 380 --GKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTL 437
K +V S H LA + A+ IVLLKN LLPL K +LA+IGPNA++ +
Sbjct: 416 VDEKQAVQLVASSEHTGLAREVARQSIVLLKNKDKLLPLKKDIR-TLAVIGPNADNVYNM 474
Query: 438 LGNYAGPSCRS--ITPLQALQNYVEN---TVYYPGCDTVACSSASIDKAVDIAKGADHVV 492
LG+Y P +T L ++ V +Y GC S A++ A+ AD VV
Sbjct: 475 LGDYTAPQADGTVVTVLDGIRQKVSKETRVLYAKGCTVRDSSRTGFKDAIETARNADAVV 534
Query: 493 LMMG----LDQTQEKEE-------------------LDRVDLVLPGRQQELITRVAEAAK 529
++MG D + E EE DR L L GRQ EL+ ++
Sbjct: 535 MVMGGSSARDFSSEYEETGAAKVTINQISDMESGEGYDRATLHLMGRQLELLEEISRLG- 593
Query: 530 KPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWY 589
KPV+LVL+ G P+ + A + +I+ A YPG G A+A+V+FGD+NP GRL ++
Sbjct: 594 KPVVLVLIKGRPLLMEGAIQEAE--AIVDAWYPGMQGGNAVADVLFGDYNPAGRLTLS-V 650
Query: 590 PQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLY 649
P+ ++P+ R GN R Y G +PFG GLSY+ +SY V +
Sbjct: 651 PRSVGQLPVYYNTRR----KGNRSR-YIEEPGTPRYPFGYGLSYTTFSYTDMKVQVTE-- 703
Query: 650 LNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPAR 709
G++ C + VT+ ++N G G L+ +
Sbjct: 704 ------------------------GSDDC---RVDVTVTIQNQGTADGDEVAQLYFRDDV 736
Query: 710 RGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
P KQL F + L A E E+ F L +SL+ ++G V+E G ++VG
Sbjct: 737 SSFTTPAKQLRAFSRIHLKAGESREVTFTLDK-KSLALYMQEGEWVVEPGRFTIMVG 792
>gi|423269263|ref|ZP_17248235.1| hypothetical protein HMPREF1079_01317 [Bacteroides fragilis
CL05T00C42]
gi|423273173|ref|ZP_17252120.1| hypothetical protein HMPREF1080_00773 [Bacteroides fragilis
CL05T12C13]
gi|392701685|gb|EIY94842.1| hypothetical protein HMPREF1079_01317 [Bacteroides fragilis
CL05T00C42]
gi|392708205|gb|EIZ01313.1| hypothetical protein HMPREF1080_00773 [Bacteroides fragilis
CL05T12C13]
Length = 805
Score = 276 bits (707), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 223/717 (31%), Positives = 336/717 (46%), Gaps = 104/717 (14%)
Query: 81 RLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGL 140
RLGIP + E HG +G T FP I A++++ L ++G+ I +
Sbjct: 140 RLGIPLF-LAEECPHGHMAIG-----------TTVFPTSIGQASTWNPELIRQMGRVIAI 187
Query: 141 EARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFN 200
EA A G IG + P +++ RDPRW R +ET GEDP + G + VRG QG+T N
Sbjct: 188 EASA---QGAHIG---YGPVLDLARDPRWSRVEETYGEDPYLNGVMGTALVRGFQGETLN 241
Query: 201 GGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRAS 260
GK A KHF +Y W A + ++L + PPF V G A
Sbjct: 242 DGK-----SVIATLKHFASY---GWTEGGHNGGTAHIGERELEEAIFPPFREAVGAG-AL 292
Query: 261 GIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVV 320
+M +YN ++G P R LL+ + +W F G++ SD AV + + G A + +A +
Sbjct: 293 SVMSSYNEIDGNPCTGSRYLLTDILKDRWQFKGFVVSDLYAVGGLRE-HGVAGNDYEAAI 351
Query: 321 DVLKAGMDVNCGS-FLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPF 379
+ AG+D + G+ + AAVK+ + + ID+A+ + S++ ++GLF+ PF
Sbjct: 352 KAVNAGVDSDLGTNVYAEQLVAAVKRGDVAVATIDKAVRRILSLKFQMGLFD-----DPF 406
Query: 380 --GKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTL 437
K +V S H LA + A+ IVLLKN LLPL K +LA+IGPNA++ +
Sbjct: 407 VDEKQAVQLVASSEHTGLAREVARQSIVLLKNKDKLLPLKKDIR-TLAVIGPNADNVYNM 465
Query: 438 LGNYAGPSCRS--ITPLQALQNYVEN---TVYYPGCDTVACSSASIDKAVDIAKGADHVV 492
LG+Y P +T L ++ V +Y GC S A++ A+ AD VV
Sbjct: 466 LGDYTAPQADGTVVTVLDGIRQKVSKETRVLYAKGCAVRDSSRTGFKDAIETARNADAVV 525
Query: 493 LMMG----LDQTQEKEE-------------------LDRVDLVLPGRQQELITRVAEAAK 529
++MG D + E EE DR L L GRQ EL+ ++
Sbjct: 526 MVMGGSSARDFSSEYEETGAAKVTINQISDMESGEGYDRATLHLMGRQLELLEEISRLG- 584
Query: 530 KPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWY 589
KPV+LVL+ G P+ + A + +I+ A YPG G A+A+V+FGD+NP GRL ++
Sbjct: 585 KPVVLVLIKGRPLLMEGAIQEAE--AIVDAWYPGMQGGNAVADVLFGDYNPAGRLTLS-V 641
Query: 590 PQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLY 649
P+ ++P+ R GN R Y G +PFG GLSY+ +SY V +
Sbjct: 642 PRSVGQLPVYYNTRR----KGNRSR-YIEEPGTPRYPFGYGLSYTTFSYTDMKVQVTE-- 694
Query: 650 LNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPAR 709
G++ C + VT+ ++N G G L+ +
Sbjct: 695 ------------------------GSDDC---RVDVTVTIQNQGTADGDEVAQLYFRDDV 727
Query: 710 RGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
P KQL F + L A E E+ F L +SL+ ++G V+E G ++VG
Sbjct: 728 SSFTTPAKQLRAFSRIHLKAGESREVTFTLDK-KSLALYMQEGEWVVEPGRFTIMVG 783
>gi|336408356|ref|ZP_08588849.1| hypothetical protein HMPREF1018_00864 [Bacteroides sp. 2_1_56FAA]
gi|335937834|gb|EGM99730.1| hypothetical protein HMPREF1018_00864 [Bacteroides sp. 2_1_56FAA]
Length = 805
Score = 276 bits (707), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 223/717 (31%), Positives = 336/717 (46%), Gaps = 104/717 (14%)
Query: 81 RLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGL 140
RLGIP + E HG +G T FP I A++++ L ++G+ I +
Sbjct: 140 RLGIPLF-LAEECPHGHMAIG-----------TTVFPTSIGQASTWNPELIRQMGRVIAI 187
Query: 141 EARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFN 200
EA A G IG + P +++ RDPRW R +ET GEDP + G + VRG QG+T N
Sbjct: 188 EASA---QGAHIG---YGPVLDLARDPRWSRVEETYGEDPYLNGVMGTALVRGFQGETLN 241
Query: 201 GGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRAS 260
GK A KHF +Y W A + ++L + PPF V G A
Sbjct: 242 DGK-----SVIATLKHFASY---GWTEGGHNGGTAHIGERELEEAIFPPFREAVGAG-AL 292
Query: 261 GIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVV 320
+M +YN ++G P R LL+ + +W F G++ SD AV + + G A + +A +
Sbjct: 293 SVMSSYNEIDGNPCTGSRYLLTDILKDRWQFKGFVVSDLYAVGGLRE-HGVAGNDYEAAI 351
Query: 321 DVLKAGMDVNCGS-FLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPF 379
+ AG+D + G+ + AAVK+ + + ID+A+ + S++ ++GLF+ PF
Sbjct: 352 KAVNAGVDSDLGTNVYAEQLVAAVKRGDVAVATIDKAVRRILSLKFQMGLFD-----DPF 406
Query: 380 --GKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTL 437
K +V S H LA + A+ IVLLKN LLPL K +LA+IGPNA++ +
Sbjct: 407 VDEKQAVQLVASSEHTGLAREVARQSIVLLKNKDKLLPLKKDIR-TLAVIGPNADNVYNM 465
Query: 438 LGNYAGPSCRS--ITPLQALQNYVEN---TVYYPGCDTVACSSASIDKAVDIAKGADHVV 492
LG+Y P +T L ++ V +Y GC S A++ A+ AD VV
Sbjct: 466 LGDYTAPQADGTVVTVLDGIRQKVSKETRVLYAKGCAVRDSSRTGFKDAIETARNADAVV 525
Query: 493 LMMG----LDQTQEKEE-------------------LDRVDLVLPGRQQELITRVAEAAK 529
++MG D + E EE DR L L GRQ EL+ ++
Sbjct: 526 MVMGGSSARDFSSEYEETGAAKVTINQISDMESGEGYDRATLHLMGRQLELLEEISRLG- 584
Query: 530 KPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWY 589
KPV+LVL+ G P+ + A + +I+ A YPG G A+A+V+FGD+NP GRL ++
Sbjct: 585 KPVVLVLIKGRPLLMEGAIQEAE--AIVDAWYPGMQGGNAVADVLFGDYNPAGRLTLS-V 641
Query: 590 PQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLY 649
P+ ++P+ R GN R Y G +PFG GLSY+ +SY V +
Sbjct: 642 PRSVGQLPVYYNTRR----KGNRSR-YIEEPGTPRYPFGYGLSYTTFSYTDMKVQVTE-- 694
Query: 650 LNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPAR 709
G++ C + VT+ ++N G G L+ +
Sbjct: 695 ------------------------GSDDC---RVDVTVTIQNQGTADGDEVAQLYFRDDV 727
Query: 710 RGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
P KQL F + L A E E+ F L +SL+ ++G V+E G ++VG
Sbjct: 728 SSFTTPAKQLRAFSRIHLKAAESREVTFTLDK-KSLALYMQEGEWVVEPGRFTIMVG 783
>gi|383117091|ref|ZP_09937838.1| hypothetical protein BSHG_0805 [Bacteroides sp. 3_2_5]
gi|382973702|gb|EES87886.2| hypothetical protein BSHG_0805 [Bacteroides sp. 3_2_5]
Length = 805
Score = 276 bits (707), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 224/724 (30%), Positives = 339/724 (46%), Gaps = 104/724 (14%)
Query: 81 RLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGL 140
RLGIP + E HG +G T FP I A++++ L ++G+ I +
Sbjct: 140 RLGIPLF-LAEECPHGHMAIG-----------TTVFPTSIGQASTWNPELIRQMGRVIAI 187
Query: 141 EARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFN 200
EA A G IG + P +++ RDPRW R +ET GEDP + G + VRG QG+T N
Sbjct: 188 EASA---QGAHIG---YGPVLDLARDPRWSRVEETYGEDPYLNGVMGTALVRGFQGETLN 241
Query: 201 GGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRAS 260
GK A KHF +Y W A + ++L + PPF V G A
Sbjct: 242 DGK-----SVIATLKHFASY---GWTEGGHNGGTAHIGERELEEAIFPPFREAVGAG-AL 292
Query: 261 GIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVV 320
+M +YN ++G P R LL+ + +W F G++ SD AV + + G A + +A +
Sbjct: 293 SVMSSYNEIDGNPCTGSRYLLTDILKDRWQFKGFVVSDLYAVGGLRE-HGVAGNDYEAAI 351
Query: 321 DVLKAGMDVNCGS-FLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPF 379
+ AG+D + G+ + AAVK+ + + ID+A+ + S++ ++GLF+ PF
Sbjct: 352 KAVNAGVDSDLGTNVYAEQLVAAVKRGDVAVATIDKAVRRILSLKFQMGLFD-----DPF 406
Query: 380 --GKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTL 437
K +V S H LA + A+ IVLLKN LLPL K +LA+IGPNA++ +
Sbjct: 407 VDEKQAVQLVASSEHTGLAREVARQSIVLLKNKDKLLPLKKDIR-TLAVIGPNADNVYNM 465
Query: 438 LGNYAGPSCRS--ITPLQALQNYVEN---TVYYPGCDTVACSSASIDKAVDIAKGADHVV 492
LG+Y P +T L ++ V +Y GC S A++ A+ AD VV
Sbjct: 466 LGDYTAPQADGTVVTVLDGIRQKVSKETRVLYAKGCAVRDSSRTGFKDAIETARNADTVV 525
Query: 493 LMMG----LDQTQEKEE-------------------LDRVDLVLPGRQQELITRVAEAAK 529
++MG D + E EE DR L L GRQ EL+ ++
Sbjct: 526 MVMGGSSARDFSSEYEETGAAKVTINQISDMESGEGYDRATLHLMGRQLELLEEISRLG- 584
Query: 530 KPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWY 589
KPV+LVL+ G P+ + A + +I+ A YPG G A+A+V+FGD+NP GRL ++
Sbjct: 585 KPVVLVLIKGRPLLMEGAIQEAE--AIVDAWYPGMQGGNAVADVLFGDYNPAGRLTLS-V 641
Query: 590 PQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLY 649
P+ ++P+ R GN R Y G +PFG GLSY+ +SY + K+
Sbjct: 642 PRSVGQLPVYYNTRR----KGNRSR-YVEEPGTPRYPFGYGLSYTTFSY-----TDMKVQ 691
Query: 650 LNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPAR 709
+ + S V+ VT+ ++N G G L+ +
Sbjct: 692 VTEGSDDCWVD------------------------VTVTIQNQGTADGDEVAQLYFRDDV 727
Query: 710 RGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGDEE 769
P KQL F + L A E E+ F L +SL+ ++G V+E G ++VG
Sbjct: 728 SSFTTPAKQLRAFSRIHLKAGESREVTFTLDK-KSLALYMQEGEWVVEPGRFTIMVGGSS 786
Query: 770 YPIS 773
I+
Sbjct: 787 EDIT 790
>gi|333377782|ref|ZP_08469515.1| hypothetical protein HMPREF9456_01110 [Dysgonomonas mossii DSM
22836]
gi|332883802|gb|EGK04082.1| hypothetical protein HMPREF9456_01110 [Dysgonomonas mossii DSM
22836]
Length = 727
Score = 276 bits (707), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 224/766 (29%), Positives = 354/766 (46%), Gaps = 103/766 (13%)
Query: 43 FPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGK 102
+PF T+L +R +L+S +T+DEKI+ L ++ +PRLGI SE LHG+A G
Sbjct: 24 YPFQNTSLSDEKRLDNLLSIMTIDEKINAL-STNLGVPRLGIRN-TGHSEGLHGMALGGP 81
Query: 103 GIF--FNGT-------IRGATSFPQVILTAASFDSYLWYRIGQAIGLEARAL-----YNA 148
G + F + T+FPQ ++D+ L ++ E R Y
Sbjct: 82 GNWGGFKMVNYQRVPDVYPTTTFPQAYGLGETWDTELIKKVADIEATEIRYYTQNERYTK 141
Query: 149 GQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKL 208
G G+ APN ++ RDPRWGR +E+ GEDP + + AV++++G+QG+ K
Sbjct: 142 G---GLVMRAPNADLARDPRWGRTEESFGEDPFLVSEMAVAFIKGLQGENPRYWK----- 193
Query: 209 QASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNR 268
+++ KHF A ++ + +T FD R+ + Y PF +++G + M AYN
Sbjct: 194 -SASLMKHFLANSNEDGRDSTSSNFDNRL----FHEYYSYPFRKGIEKGGSQAFMAAYNS 248
Query: 269 VNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMD 328
N IP L K R+ W F G I +D A+ ++ A + + ++KAG+
Sbjct: 249 WNEIPMTIHPIL--KKIRKDWNFKGIICTDGGALDLLIKAHKTFPTHTEGSAAIVKAGV- 305
Query: 329 VNCGSFLQK---HTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIG-A 384
G FL + A+++ L E+EID+A+ F + ++LGL +G+ T P+ IG
Sbjct: 306 ---GQFLDNFRPYIYQALEKGMLTEAEIDKAIRGNFYIALKLGLLDGDQTKLPYAHIGVT 362
Query: 385 DVVC---SPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNY 441
D V + Q +VLLKN LLPL K +A+IGP AN + LL Y
Sbjct: 363 DTVSVWRNKEIQDFVRLVTAKSVVLLKNEKKLLPLNKGNIKRIAVIGPRAN--EVLLDWY 420
Query: 442 AGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLD--- 498
+G +++ LQ ++N V N V + + SS IDKA A+ AD ++ +G
Sbjct: 421 SGTPPYTVSILQGIKNAVGNNV-----EVIYESSNEIDKAYLAAQKADIAIVCVGNHVYG 475
Query: 499 ----------QTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAK 548
+ +E +DR L L +QE + ++ A ++VL+ P I +++
Sbjct: 476 TDPKWKYSPVPSDGREAVDRKALSL---EQEDLVKIVHKANPNTVMVLVSSFPFAINWSQ 532
Query: 549 YDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQAT 608
NI +IL + LA+VIFG++NP GR TW PM D +R
Sbjct: 533 --ENIPAILHITNNSQELGNGLADVIFGNYNPAGRTNQTWVKSIADLPPMMDYDIRN--- 587
Query: 609 SGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYK 668
GRTY + + K ++PFG GLSY+ ++Y A+S + L
Sbjct: 588 ----GRTYMYAKEKPLYPFGYGLSYTNFTYSDMALSSSAL-------------------- 623
Query: 669 SVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILN 728
+ + V++ VKN G+M G+ L+V + RPIKQL GF + +
Sbjct: 624 ---------SKGKNLKVSVNVKNTGDMDGEEVAQLYVSFPQSKVVRPIKQLKGFDRISIK 674
Query: 729 AKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGDEEYPISI 774
E F LS + + VIE T +++G I +
Sbjct: 675 KGESKTFEFTLSADDLAYWDNDKDSFVIEPETVNILIGSSSEDIRL 720
>gi|189464211|ref|ZP_03012996.1| hypothetical protein BACINT_00548 [Bacteroides intestinalis DSM
17393]
gi|189438001|gb|EDV06986.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
intestinalis DSM 17393]
Length = 814
Score = 276 bits (707), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 220/717 (30%), Positives = 345/717 (48%), Gaps = 104/717 (14%)
Query: 81 RLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGL 140
RLGIP + E HG +G T FP I A++++ L R+G+AI
Sbjct: 149 RLGIPLF-LAEECPHGHMAIG-----------TTVFPTSIGQASTWNPELIRRMGRAIAT 196
Query: 141 EARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFN 200
EA A G IG + P +++ RDPRW R +ET GED + G + V+G QG+
Sbjct: 197 EASA---QGAHIG---YGPVLDLARDPRWSRVEETYGEDAYLNGVMGAALVKGFQGEF-- 248
Query: 201 GGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRAS 260
+ KGK+ A+ KHF AY W A V +++ + PPF V G A
Sbjct: 249 -PRTKGKVIAT--LKHFAAY---GWTEGGHNGGSAHVGNREMEEAIYPPFREAVAAG-AL 301
Query: 261 GIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVV 320
+M +YN ++GIP A+ NLL+ + +W F G++ SD A+ + + G A + +A V
Sbjct: 302 SVMSSYNEIDGIPCTANSNLLTGLLKERWQFKGFVVSDLYAIGGLRE-HGVADTDYEAAV 360
Query: 321 DVLKAGMDVNCGSFLQK-HTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPF 379
+ AG+D + G+ + AVK+ + E I++A+ + +++ +GLF+ PF
Sbjct: 361 KAVNAGVDSDLGTNVYAGQLVNAVKRGDVQEVVINKAVSRILALKFHMGLFD-----HPF 415
Query: 380 --GKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTL 437
+ VV S H LA + A+ I+LLKN + LLPL K K+ ++A+IGPNA++ +
Sbjct: 416 VDEREPEQVVASTEHLELAREVARQSIILLKNKNELLPLNK-KTKTIAVIGPNADNIYNM 474
Query: 438 LGNYAGPSCRS--ITPLQALQNYVEN---TVYYPGCDTVACSSASIDKAVDIAKGADHVV 492
LG+Y P S +T L ++ V N +Y GC S + +A++ A+ +D VV
Sbjct: 475 LGDYTAPQSESSVVTVLDGIRQKVSNDTHIIYAKGCAVRDSSKSGFQEAIEAARQSDVVV 534
Query: 493 LMMG-----------------------LDQTQEKEELDRVDLVLPGRQQELITRVAEAAK 529
++MG + + E DR L L GRQ+ELI V +
Sbjct: 535 MVMGGSSARDFSSKYEETGAAKVSDSHISDMESGEGYDRSTLELLGRQRELIREVGK-LN 593
Query: 530 KPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWY 589
KP++LVL+ G P + + + +I+ A YPG G A+A+V+FGD+NP GRL ++
Sbjct: 594 KPIVLVLIKGRP--LLLEGIEAEVDAIVDAWYPGMQGGNAVADVLFGDYNPAGRLTIS-V 650
Query: 590 PQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLY 649
P+ ++P+ R GN + Y EG +PFG GLSY+ ++Y
Sbjct: 651 PRSVGQLPVYYNTKR----KGNRSK-YIEEEGTPRYPFGYGLSYTSFNY----------- 694
Query: 650 LNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPAR 709
++VE +D C +++ V+N G G V L+++
Sbjct: 695 --SDLKAEVVEAEDS-------------CLVN---ISVKVRNEGSRDGDEVVQLYLRDEV 736
Query: 710 RGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
P KQL GFQ + L E EI F L +SL+ ++ +E G L++G
Sbjct: 737 ASFTTPFKQLCGFQRIHLKVGETKEITFRLDK-KSLALYMQNEEWAVEPGRFTLMLG 792
>gi|365122063|ref|ZP_09338970.1| hypothetical protein HMPREF1033_02316 [Tannerella sp.
6_1_58FAA_CT1]
gi|363643257|gb|EHL82578.1| hypothetical protein HMPREF1033_02316 [Tannerella sp.
6_1_58FAA_CT1]
Length = 819
Score = 276 bits (706), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 224/777 (28%), Positives = 342/777 (44%), Gaps = 137/777 (17%)
Query: 45 FCKTTLPISQRARDLVSRLTLDEKISQLVN--SAPAIPRLGIPAYEW----WSEALHGV- 97
F PI +R +DL+S++ LDEK QL + +P EW W + + +
Sbjct: 53 FENPKQPIEKRVQDLLSQMNLDEKTCQLATLYGYKRVMSDSLPTPEWKNKIWKDGIANID 112
Query: 98 ---AGVGKGI-----------------------------------FFNGTIRG-----AT 114
GVG+G F N TI G AT
Sbjct: 113 EQLNGVGRGAKIAQDLIYPFSKHAEAINKTQKWFIEETRLGIPVDFSNETIHGLNHTKAT 172
Query: 115 SFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMT-FWAPNINIFRDPRWGRGQ 173
P I +++++ L Y+ G G EA+AL G T +AP +++ RDPRWGR
Sbjct: 173 PLPAPIGIGSTWNAPLVYKAGSIAGKEAKAL-------GYTNIYAPILDLARDPRWGRVL 225
Query: 174 ETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKF 233
E GEDP + V+G+Q + +A KHF Y + +
Sbjct: 226 ECYGEDPFLVATLGTQMVKGIQ-----------EQGVAATLKHFAVYSVPKGGRDGSVRT 274
Query: 234 DARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHG 293
D V +++ + PF+ ++ G+M +YN +G+P A L++ R+++GF G
Sbjct: 275 DPHVAPREMHQMHLYPFKKVIQDAHPMGVMSSYNDWDGVPVTASYYFLTQLLRQEFGFDG 334
Query: 294 YITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG----SFLQKHTKAAVKQKKLP 349
Y+ SD DAV +Y+ A++ E+AV VL+AG++V + VK+ +L
Sbjct: 335 YVVSDSDAVEYVYNKHHVAETYEEAVRMVLEAGLNVRTTFAAPDIFILPARKLVKEGRLS 394
Query: 350 ESEIDRALHNLFSVRMRLGLFNGNPTMQPF---GKIGADVVCSPAHQVLALQAAQDGIVL 406
ID + ++ V+ RLGLF+ QPF K +V + ++ L + +VL
Sbjct: 395 MKVIDERVADVLRVKFRLGLFD-----QPFVADPKAADKIVGADKNKDFVLDIQRQSLVL 449
Query: 407 LKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTV--- 463
LKN + LLPL K+K + + GP A ++ Y +IT + ++NY+ N V
Sbjct: 450 LKNENNLLPLDKNKLSRILITGPLAKEENYMVSRYGPQELENITVYEGIKNYLGNKVAVD 509
Query: 464 YYPGCDT---------------VACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDR 508
Y GC I AV+ AK +D V+ ++G D+ E R
Sbjct: 510 YALGCKVKDAKWPESEIIHSPLTTEEQQEIQNAVEKAKLSDIVIAVLGEDEESTGESKSR 569
Query: 509 VDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAV 568
L LPGRQQ+L+ + A KPV+LVL+ G P+ I +A DR I +IL A +PG+ G
Sbjct: 570 SGLDLPGRQQQLLEALY-ATGKPVVLVLINGQPLTINWA--DRYIPAILEAWFPGQMGGT 626
Query: 569 ALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEV---- 624
A+AE +FGD+NPGG+LP+T +P+ ++ + + +P + S P Y V
Sbjct: 627 AIAETLFGDYNPGGKLPVT-FPKTLGQIEL-NFPFKPASQSKQPEAGPNGYGKTRVNGAL 684
Query: 625 FPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFL 684
+PFG GLSY+ + Y VS PE + +
Sbjct: 685 YPFGFGLSYTTFEYSNLKVS--------------------------PERQGPKGDIQ--- 715
Query: 685 VTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSP 741
V+ + N G+ AG V L+VK L GF+ V L E I F L P
Sbjct: 716 VSFDITNTGKRAGDEIVQLYVKDKVSSVISYESLLRGFERVSLQPGETKNIQFTLHP 772
>gi|225872720|ref|YP_002754177.1| xylan 1,4-beta-xylosidase [Acidobacterium capsulatum ATCC 51196]
gi|225793233|gb|ACO33323.1| xylann 1,4-beta-xylosidase [Acidobacterium capsulatum ATCC 51196]
Length = 721
Score = 276 bits (706), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 222/755 (29%), Positives = 339/755 (44%), Gaps = 96/755 (12%)
Query: 43 FPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGK 102
+PF L QR DL+SR+TL EKI Q + P +PRLGIP E LHG A +G
Sbjct: 24 YPFQNPALSPDQRIDDLLSRMTLQEKI-QALGDDPGVPRLGIPG-ALTEEGLHGAA-IGG 80
Query: 103 GIFFNGTIRG---ATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAP 159
+ G R T FPQ ++D L + E R N G+ AP
Sbjct: 81 PAHWEGRGRAVVPTTQFPQNHGLGQTWDPALLQKAANVEAYETRWAVNKYHDGGLIVRAP 140
Query: 160 NINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTA 219
N N+ RDPRWGR +E+ GEDP + G AV++++G+QG+ + +A KHF A
Sbjct: 141 NANLSRDPRWGRTEESYGEDPYLVGTLAVAWIKGLQGNN------PRYWETAALMKHFDA 194
Query: 220 YDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRN 279
Y + + + F R+ + Y PF ++QG + M +YN NGIP A+
Sbjct: 195 YSNEANRDGSSSNFGKRL----FYEYYSVPFRMGIEQGHSDAFMTSYNAWNGIPMTANP- 249
Query: 280 LLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHT 339
+L ++WGF+G I +D A+S + Y K+ +A + AG++ + Q+
Sbjct: 250 VLKSVVMKKWGFNGIICTDAGALSNMVTHFHYYKTMPEAAAGAVHAGINQFLDRY-QQPV 308
Query: 340 KAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIG--------ADVVCSPA 391
+ A++QK L E +ID+ L ++ V +RLGL + + +M P+ IG D P+
Sbjct: 309 EEALQQKLLTEQQIDQDLKGVYRVVLRLGLMDPS-SMSPYSMIGLTNDNPAKGDPWDWPS 367
Query: 392 HQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITP 451
H L + + IVLLKN + LPL K S+A+IGP AN L Y+G +TP
Sbjct: 368 HIALDRKVTDESIVLLKNQNHALPLDAKKLHSIAVIGPWANIVA--LDWYSGTPPFGVTP 425
Query: 452 LQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQT----------- 500
++ ++ V P +++ A +AK +D ++++G T
Sbjct: 426 VEGIRQRVG-----PDVKVTFNDGSNLQAAAALAKQSDEAIVIIGNHPTCDAGWGKCALP 480
Query: 501 -QEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWA 559
+ KE DR L LP E I + AA ++VL P + + +I +IL
Sbjct: 481 SEGKEAFDRTALNLP---DESIAKAVYAANPHTVVVLQTSFPYTTDWTQ--AHIPAILEM 535
Query: 560 GYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFY 619
+ E ALA+V+FGD++P GRL TW PM D +R GRTY +
Sbjct: 536 AHNSEEQGTALADVLFGDYDPAGRLAQTWVASIGQLPPMMDYNIR-------DGRTYMYL 588
Query: 620 EGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCE 679
+ K ++PFG GLSY+ + Y +S + L P G
Sbjct: 589 KSKPLYPFGFGLSYTTFKYSNLRLSSHTL----------------------PAGG----- 621
Query: 680 TRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFEL 739
+ V++ V N G+ G V ++VK RP++ L GF V + + + L
Sbjct: 622 --QLTVSVDVTNTGKYNGDEVVQMYVKHLDSKVSRPLEALKGFDRVSIPVGQTRTVTLPL 679
Query: 740 SPCESLSRAREDGLMVIEEGTHFLVVGDEEYPISI 774
+ L ++ TH V + I I
Sbjct: 680 ---------KASALAYWDKNTHAFRVEPDHVQIRI 705
>gi|423248809|ref|ZP_17229825.1| hypothetical protein HMPREF1066_00835 [Bacteroides fragilis
CL03T00C08]
gi|423253758|ref|ZP_17234689.1| hypothetical protein HMPREF1067_01333 [Bacteroides fragilis
CL03T12C07]
gi|392655387|gb|EIY49030.1| hypothetical protein HMPREF1067_01333 [Bacteroides fragilis
CL03T12C07]
gi|392657750|gb|EIY51381.1| hypothetical protein HMPREF1066_00835 [Bacteroides fragilis
CL03T00C08]
Length = 805
Score = 276 bits (706), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 223/717 (31%), Positives = 336/717 (46%), Gaps = 104/717 (14%)
Query: 81 RLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGL 140
RLGIP + E HG +G T FP I A++++ L ++G+ I +
Sbjct: 140 RLGIPLF-LAEECPHGHMAIG-----------TTVFPTSIGQASTWNPELIRQMGRVIAI 187
Query: 141 EARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFN 200
EA A G IG + P +++ RDPRW R +ET GEDP + G + VRG QG+T N
Sbjct: 188 EASA---QGAHIG---YGPVLDLARDPRWSRVEETYGEDPYLNGVMGTALVRGFQGETLN 241
Query: 201 GGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRAS 260
GK A KHF +Y W A + ++L + PPF V G A
Sbjct: 242 DGK-----SVIATLKHFASY---GWTEGGHNGGTAHIGERELEEAIFPPFREAVGAG-AL 292
Query: 261 GIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVV 320
+M +YN ++G P R LL+ + +W F G++ SD AV + + G A + +A +
Sbjct: 293 SVMSSYNEIDGNPCTGSRYLLTDILKDRWQFKGFVVSDLYAVGGLRE-HGVAGNDYEAAI 351
Query: 321 DVLKAGMDVNCGS-FLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPF 379
+ AG+D + G+ + AAVK+ + + ID+A+ + S++ ++GLF+ PF
Sbjct: 352 KAVNAGVDSDLGTNVYAEQLVAAVKRGDVAVATIDKAVRRILSLKFQMGLFD-----DPF 406
Query: 380 --GKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTL 437
K +V S H LA + A+ IVLLKN LLPL K +LA+IGPNA++ +
Sbjct: 407 VDEKQAVQLVASSEHTGLAREVARQSIVLLKNKDKLLPLKKDIR-TLAVIGPNADNVYNM 465
Query: 438 LGNYAGPSCRS--ITPLQALQNYVEN---TVYYPGCDTVACSSASIDKAVDIAKGADHVV 492
LG+Y P +T L ++ V +Y GC S A++ A+ AD VV
Sbjct: 466 LGDYTAPQADGTVVTVLDGIRQKVSKETRVLYAKGCAVRDSSRTGFKDAIETARNADAVV 525
Query: 493 LMMG----LDQTQEKEE-------------------LDRVDLVLPGRQQELITRVAEAAK 529
++MG D + E EE DR L L GRQ EL+ ++ K
Sbjct: 526 MVMGGSSARDFSSEYEETGAAKVTINQISDMESGEGYDRATLHLMGRQLELLEEISRLGK 585
Query: 530 KPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWY 589
PV+LVL+ G P+ + A + +I+ A YPG G A+A+V+FGD+NP GRL ++
Sbjct: 586 -PVVLVLIKGRPLLMEGAIQEAE--AIVDAWYPGMQGGNAVADVLFGDYNPAGRLTLS-V 641
Query: 590 PQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLY 649
P+ ++P+ R GN R Y G +PFG GLSY+ +SY V +
Sbjct: 642 PRSVGQLPVYYNTRR----KGNRSR-YVEEPGTPRYPFGYGLSYTTFSYTDMKVQVTE-- 694
Query: 650 LNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPAR 709
G++ C + VT+ ++N G G L+ +
Sbjct: 695 ------------------------GSDDC---RVDVTVTIQNQGTADGDEVAQLYFQDDV 727
Query: 710 RGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
P KQL F + L A E E+ F L +SL+ ++G V+E G ++VG
Sbjct: 728 SSFTTPAKQLRAFSRIHLKAGESREVTFTLDK-KSLALYMQEGEWVVEPGRFTIMVG 783
>gi|317477144|ref|ZP_07936385.1| glycosyl hydrolase family 3 C terminal domain-containing protein
[Bacteroides eggerthii 1_2_48FAA]
gi|316906687|gb|EFV28400.1| glycosyl hydrolase family 3 C terminal domain-containing protein
[Bacteroides eggerthii 1_2_48FAA]
Length = 814
Score = 276 bits (706), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 220/717 (30%), Positives = 345/717 (48%), Gaps = 104/717 (14%)
Query: 81 RLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGL 140
RLGIP + E HG +G T FP I A++++ L R+G+AI
Sbjct: 149 RLGIPLF-LAEECPHGHMAIG-----------TTVFPTSIGQASTWNPELIRRMGRAIAT 196
Query: 141 EARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFN 200
EA A G IG + P +++ RDPRW R +ET GED + G + V+G QG+
Sbjct: 197 EASA---QGAHIG---YGPVLDLARDPRWSRVEETYGEDAYLNGVMGAALVKGFQGEF-- 248
Query: 201 GGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRAS 260
+ KGK+ A+ KHF AY W A V +++ + PPF V G A
Sbjct: 249 -PRTKGKVIAT--LKHFAAY---GWTEGGHNGGSAHVGNREMEEAIYPPFREAVAAG-AL 301
Query: 261 GIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVV 320
+M +YN ++GIP A+ NLL+ +++W F G++ SD A+ + + G A + +A V
Sbjct: 302 SVMSSYNEIDGIPCTANSNLLTGLLKKRWQFKGFVVSDLYAIGGLRE-HGVADTDYEAAV 360
Query: 321 DVLKAGMDVNCGSFLQK-HTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPF 379
+ AG+D + G+ + AVK+ + E I++A+ + +++ +GLF+ PF
Sbjct: 361 KAVNAGVDSDLGTNVYAGQLVNAVKRGDVQEVVINKAVSRILALKFHMGLFD-----HPF 415
Query: 380 --GKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTL 437
+ VV S H LA + A+ I+LLKN + LLPL K K ++A+IGPNA++ +
Sbjct: 416 VDEREPEQVVASTEHLELAREVARQSIILLKNKNELLPLNK-KMKTIAVIGPNADNIYNM 474
Query: 438 LGNYAGPSCRS--ITPLQALQNYVEN---TVYYPGCDTVACSSASIDKAVDIAKGADHVV 492
LG+Y P S +T L ++ V N +Y GC S + +A++ A+ +D VV
Sbjct: 475 LGDYTAPQSESSVVTVLDGIRQKVSNDTHIIYAKGCAVRDSSKSGFQEAIEAARQSDVVV 534
Query: 493 LMMG-----------------------LDQTQEKEELDRVDLVLPGRQQELITRVAEAAK 529
++MG + + E DR L L GRQ+ELI V +
Sbjct: 535 MVMGGSSARDFSSKYEETGAAKVSDSHISDMESGEGYDRSTLELLGRQRELIREVGK-LN 593
Query: 530 KPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWY 589
KP++LVL+ G P + + + +I+ A YPG G A+A+V+FGD+NP GRL ++
Sbjct: 594 KPIVLVLIKGRP--LLLEGIEAEVDAIVDAWYPGMQGGNAVADVLFGDYNPAGRLTIS-V 650
Query: 590 PQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLY 649
P+ ++P+ R GN + Y EG +PFG GLSY+ ++Y
Sbjct: 651 PRSVGQLPVYYNTKR----KGNRSK-YIEEEGTPRYPFGYGLSYTSFNY----------- 694
Query: 650 LNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPAR 709
++VE +D C +++ V+N G G V L+++
Sbjct: 695 --SDLKAEVVEAEDS-------------CLVN---ISVKVRNEGSRDGDEVVQLYLRDEV 736
Query: 710 RGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
P KQL GFQ + L E EI F L +SL+ ++ +E G L++G
Sbjct: 737 ASFTTPFKQLCGFQRIHLKVGETKEITFRLDK-KSLALYMQNEEWAVEPGRFTLMLG 792
>gi|375357172|ref|YP_005109944.1| putative beta-glucosidase [Bacteroides fragilis 638R]
gi|301161853|emb|CBW21397.1| putative beta-glucosidase [Bacteroides fragilis 638R]
Length = 814
Score = 276 bits (706), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 223/717 (31%), Positives = 336/717 (46%), Gaps = 104/717 (14%)
Query: 81 RLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGL 140
RLGIP + E HG +G T FP I A++++ L ++G+ I +
Sbjct: 149 RLGIPLF-LAEECPHGHMAIG-----------TTVFPTSIGQASTWNPELIRQMGRVIAI 196
Query: 141 EARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFN 200
EA A G IG + P +++ RDPRW R +ET GEDP + G + VRG QG+T N
Sbjct: 197 EASA---QGAHIG---YGPVLDLARDPRWSRVEETYGEDPYLNGVMGTALVRGFQGETLN 250
Query: 201 GGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRAS 260
GK A KHF +Y W A + ++L + PPF V G A
Sbjct: 251 DGK-----SVIATLKHFASY---GWTEGGHNGGTAHIGERELEEAIFPPFREAVGAG-AL 301
Query: 261 GIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVV 320
+M +YN ++G P R LL+ + +W F G++ SD AV + + G A + +A +
Sbjct: 302 SVMSSYNEIDGNPCTGSRYLLTDILKDRWQFKGFVVSDLYAVGGLRE-HGVAGNDYEAAI 360
Query: 321 DVLKAGMDVNCGS-FLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPF 379
+ AG+D + G+ + AAVK+ + + ID+A+ + S++ ++GLF+ PF
Sbjct: 361 KAVNAGVDSDLGTNVYAEQLVAAVKRGDVAVATIDKAVRRILSLKFQMGLFD-----DPF 415
Query: 380 --GKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTL 437
K +V S H LA + A+ IVLLKN LLPL K +LA+IGPNA++ +
Sbjct: 416 VDEKQAVQLVASSEHTGLAREVARQSIVLLKNKDKLLPLKKDIR-TLAVIGPNADNVYNM 474
Query: 438 LGNYAGPSCRS--ITPLQALQNYVEN---TVYYPGCDTVACSSASIDKAVDIAKGADHVV 492
LG+Y P +T L ++ V +Y GC S A++ A+ AD VV
Sbjct: 475 LGDYTAPQADGTVVTVLDGIRQKVSKETRVLYAKGCAVRDSSRTGFKDAIETARNADAVV 534
Query: 493 LMMG----LDQTQEKEE-------------------LDRVDLVLPGRQQELITRVAEAAK 529
++MG D + E EE DR L L GRQ EL+ ++
Sbjct: 535 MVMGGSSARDFSSEYEETGAAKVTINQISDMESGEGYDRATLHLMGRQLELLEEISRLG- 593
Query: 530 KPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWY 589
KPV+LVL+ G P+ + A + +I+ A YPG G A+A+V+FGD+NP GRL ++
Sbjct: 594 KPVVLVLIKGRPLLMEGAIQEAE--AIVDAWYPGMQGGNAVADVLFGDYNPAGRLTLS-V 650
Query: 590 PQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLY 649
P+ ++P+ R GN R Y G +PFG GLSY+ +SY V +
Sbjct: 651 PRSVGQLPVYYNTRR----KGNRSR-YIEEPGTPRYPFGYGLSYTTFSYTDMKVQVTE-- 703
Query: 650 LNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPAR 709
G++ C + VT+ ++N G G L+ +
Sbjct: 704 ------------------------GSDDC---RVDVTVTIQNQGTADGDEVAQLYFRDDV 736
Query: 710 RGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
P KQL F + L A E E+ F L +SL+ ++G V+E G ++VG
Sbjct: 737 SSFTTPAKQLRAFSRIHLKAGESREVTFTLDK-KSLALYMQEGEWVVEPGRFTIMVG 792
>gi|330996729|ref|ZP_08320604.1| glycosyl hydrolase family 3 protein [Paraprevotella xylaniphila YIT
11841]
gi|329572574|gb|EGG54217.1| glycosyl hydrolase family 3 protein [Paraprevotella xylaniphila YIT
11841]
Length = 852
Score = 276 bits (706), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 172/436 (39%), Positives = 240/436 (55%), Gaps = 36/436 (8%)
Query: 36 SNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALH 95
++ TE F K P +R DL+SRLT++EKIS LVN APAI RLGI Y +EALH
Sbjct: 22 TDGGTELFRDMKA--PQHERIMDLLSRLTVEEKISLLVNDAPAIGRLGIDKYNHGNEALH 79
Query: 96 GVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYN-------- 147
GV V G F T FPQ I AA ++ L YRI AI EAR +
Sbjct: 80 GV--VRPGDF--------TVFPQAIGMAAMWNPELLYRISSAISDEARGRWKELEYGKKQ 129
Query: 148 -AGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKG 206
AG + +TFW+P +N+ RDPRWGR ET GEDP ++G V++V+G+QG+
Sbjct: 130 IAGASDLLTFWSPTVNMARDPRWGRTPETYGEDPYLSGVLGVAFVKGLQGNH------PR 183
Query: 207 KLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAY 266
L+ + KHF N + R +A+V+ +DL + Y P FE C+ +G+A IM AY
Sbjct: 184 YLKTVSTPKHFAV----NNEEHNRSSCNAKVSERDLREYYLPSFERCITEGKAQSIMMAY 239
Query: 267 NRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAG 326
N VN +P + L+ R WGF+GYI SDC A + Y K+ E A +KAG
Sbjct: 240 NAVNDVPCTVNTYLIKNVLRGDWGFNGYIVSDCSAPEWMITKHHYVKTREAAATLAVKAG 299
Query: 327 MDVNCGS-FLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGAD 385
+D+ CG+ + A +Q + E++ID A + + RM LGLF+ +P+ P+ +I
Sbjct: 300 LDLECGNQVYGEGLLKAYRQYMVSEADIDSAAYRILRGRMMLGLFD-DPSQNPYNQIEPS 358
Query: 386 VVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAG-P 444
VV AHQ LAL+AA+ +VLLKN LPL K S+A++G +A + G+Y+G P
Sbjct: 359 VVGCKAHQDLALEAARQSMVLLKNKDNFLPLNPQKVKSIAVVGISAGHCE--FGDYSGTP 416
Query: 445 SCRSITPLQALQNYVE 460
+T L ++ Y E
Sbjct: 417 KNEPVTILDGIKQYAE 432
Score = 121 bits (303), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 86/283 (30%), Positives = 137/283 (48%), Gaps = 44/283 (15%)
Query: 484 IAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVD 543
+A D V ++G++++ E+E DR L LP QQE I + + V+ VL+ G +
Sbjct: 600 VAAECDVTVAVLGINKSIEREGQDRFTLELPIDQQEFIKELYKVNPNTVV-VLVAGSSLA 658
Query: 544 ITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKM 603
+ + D N+ +IL A YPGE G A+AEV+FGD+NPGGRLP+T+Y ++P D
Sbjct: 659 VNWM--DENVPAILNAWYPGEQGGNAVAEVLFGDYNPGGRLPLTYY-NSLDEIPAFD--- 712
Query: 604 RPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQD 663
GRTY+++EG+ ++ FG GLSY+K+ YK K VS + D
Sbjct: 713 ----NYSVKGRTYQYFEGQPLYEFGYGLSYTKFRYKSKGVSVAR---------------D 753
Query: 664 VVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQ 723
V V+ V N G+ G ++VK G P+KQL GF+
Sbjct: 754 TVK------------------VSFEVSNTGKYDGDEVAQVYVKYPETGTYMPLKQLHGFK 795
Query: 724 SVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
V + + +++ + + ++ V +G + +VG
Sbjct: 796 RVHIKKGKTSKVTVGVPKKDLRYWDEQERKFVTPKGEYTFMVG 838
>gi|336399370|ref|ZP_08580170.1| Beta-glucosidase [Prevotella multisaccharivorax DSM 17128]
gi|336069106|gb|EGN57740.1| Beta-glucosidase [Prevotella multisaccharivorax DSM 17128]
Length = 862
Score = 276 bits (706), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 171/471 (36%), Positives = 244/471 (51%), Gaps = 46/471 (9%)
Query: 12 LLCLCFTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQ 71
LL C + V++ Q P+ DP L RA+DL SRLTL+EK S
Sbjct: 6 LLTGCLLMCVVGVNAQQSPYQ-DPG-------------LSFEARAKDLCSRLTLEEKASL 51
Query: 72 LVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLW 131
+ + +PAIPRLGI + WWSEALHG A NG + T FP+ I AASF+ +
Sbjct: 52 MCDVSPAIPRLGIKPFNWWSEALHGYAN-------NGDV---TVFPEPIGMAASFNPTMV 101
Query: 132 YRIGQAIGLEARALYNAGQAIG--------MTFWAPNINIFRDPRWGRGQETPGEDPLVT 183
Y++ A EAR YN A G ++ W PN+NIFRDPRWGRGQET GEDP +T
Sbjct: 102 YQVFTATSDEARGKYNQSMAEGKEDTRFHSLSVWTPNVNIFRDPRWGRGQETYGEDPYLT 161
Query: 184 GKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFD-ARVTMQDL 242
V V+G+QG + + AC KHF + + TR+ + A ++ +DL
Sbjct: 162 SVMGVEVVKGLQGPESTKYR-----KLYACAKHFAVHSGPEY---TRHTANLADISPRDL 213
Query: 243 ADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAV 302
+TY P F++ V+Q +MCAY R++ P C + LL + R +WGF + SDC A+
Sbjct: 214 WETYLPAFKATVQQAGVREVMCAYQRLDDEPCCGNSRLLQQILRDEWGFRHMVVSDCGAI 273
Query: 303 SIIYDAEGYAKSPEDAVVDVLKAGMDVNCG-SFLQKHTKAAVKQKKLPESEIDRALHNLF 361
+ Y + A AG DV CG + AV++ + E+E+D+ + L
Sbjct: 274 ADFYTNHHVSSDAVHAAAKGTLAGTDVECGFGYAYMKLPEAVRRGLVSEAEVDKHVIRLL 333
Query: 362 SVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKS 421
R LG+ + +P + + KI VV S AH+ LAL A+ + LL+N + +LPL K +
Sbjct: 334 KGRFELGVMD-DPKLVSWTKISPKVVDSDAHRQLALNMARQTMTLLQNRNNVLPLAKGEK 392
Query: 422 VSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTV-YYPGCDTV 471
+ A++GPNA L GNY G R+ T L+ ++ + Y GCD V
Sbjct: 393 I--AVVGPNAADGPMLWGNYNGTPSRTTTILEGIRAKAGKDIPYLQGCDLV 441
Score = 95.5 bits (236), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 77/270 (28%), Positives = 124/270 (45%), Gaps = 52/270 (19%)
Query: 481 AVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKK 530
A+ +G VV + G+ E EE+ DR + LP Q++ + + +AA K
Sbjct: 593 AIRQLRGVRTVVFVGGISSKLEGEEMPVHVEGFKGGDRTSIELPAVQRDFL-KALKAAGK 651
Query: 531 PVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYP 590
V+ V G + +T + +IL A Y GE G A+A+V++GD+NPGG+LP+T+Y
Sbjct: 652 TVVFVNCSGSAIALT--PEVESCDAILQAWYAGEEGGRAVADVLYGDYNPGGKLPVTFYR 709
Query: 591 QDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYL 650
D M+ GRTYR++ +FPFG GLSY+++ A+ + L
Sbjct: 710 STTQLPAFDDYSMK--------GRTYRYFS-DALFPFGYGLSYTRF-----AIGKGSL-- 753
Query: 651 NQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARR 710
S P + + K +T+ V N G+ G V ++V+
Sbjct: 754 ------------------SAPAMKAD----GKVTLTVPVSNVGKRTGDEVVQVYVRDVND 791
Query: 711 GNGRPIKQLVGFQSVILNAKEKAEIVFELS 740
+G P+K L F+ V L A E ++ L+
Sbjct: 792 ADG-PLKSLKAFRRVSLKAGESRKVTIPLT 820
>gi|329954674|ref|ZP_08295734.1| glycosyl hydrolase family 3 protein [Bacteroides clarus YIT 12056]
gi|328527215|gb|EGF54219.1| glycosyl hydrolase family 3 protein [Bacteroides clarus YIT 12056]
Length = 855
Score = 276 bits (705), Expect = 4e-71, Method: Compositional matrix adjust.
Identities = 236/796 (29%), Positives = 345/796 (43%), Gaps = 145/796 (18%)
Query: 41 ETFPFCKTTLPISQRARDLVSRLTLDEKISQL---------------------------- 72
+ P+ + LP+ R DL+ R+TLDEKI+Q+
Sbjct: 23 QQLPYQQAALPVEVRVDDLLQRMTLDEKIAQIRHVHSWDIFNGQELDENKLAKVVGDKCW 82
Query: 73 --VNSAPAI------------------PRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRG 112
V P RL IP + +EALHG A G
Sbjct: 83 GFVEGFPLTGENCRKNMRKIQEYMLNHTRLRIPCFNV-AEALHGSAH-----------EG 130
Query: 113 ATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMT-FWAPNINIFRDPRWGR 171
+T +PQ I ++F+ L YR + I E A GM AP +++ RD RWGR
Sbjct: 131 STIYPQNIALGSTFNPDLAYRKAKMIAGEL-------HAQGMKQILAPCVDVVRDLRWGR 183
Query: 172 GQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRY 231
+E+ GEDP + G + ++ V+G Q S KHF + N G
Sbjct: 184 VEESYGEDPFLCGLFGIAEVKGYQEQGI-----------SPMLKHFGPHG--NPLGGLNL 230
Query: 232 KFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGF 291
++DL D Y PFE +K +M YN N IP+ A LL+ R+QWGF
Sbjct: 231 A-SVECGVRDLHDIYLKPFEMIIKSQPLKAVMSTYNSWNRIPNSASHYLLTDVLRKQWGF 289
Query: 292 HGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPES 351
GY+ SD A+ ++ A SPE+ + L AG+DV S K AVK+ +LP S
Sbjct: 290 TGYVYSDWGAIEMLQTFHHTANSPEECALQALMAGLDVEASSECYPALKQAVKEGRLPVS 349
Query: 352 EIDRALHNLFSVRMRLGLFNGNPTMQPFG-KIGADVVCSPAHQVLALQAAQDGIVLLKNS 410
ID A+ + + + GLF PFG K A V S LA + A + +VLLKN
Sbjct: 350 YIDEAVRRVLTAKFETGLFE-----DPFGEKYRAGKVHSEESVRLAREIADESVVLLKNE 404
Query: 411 HGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTV---YYPG 467
LLPL + + ++A+IGPNA+ + ++ + +TPLQ ++ V N V Y G
Sbjct: 405 GKLLPLDEKRLDAVAVIGPNADQVQFGDYTWSRSNKDGVTPLQGIRRLVGNKVKVHYAKG 464
Query: 468 CDTVACSSASIDKAVDIAKGADHVVLMMG---------LDQTQEKEELDRVDLVLPGRQQ 518
CD ++ ++ I KAV+ A+ +D +L +G T E D DL L G Q
Sbjct: 465 CDMMSPDTSLIAKAVEAARKSDVAILFVGSSSASLARDYSNTNCGEGFDLNDLTLAGAQS 524
Query: 519 ELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDH 578
LI V A PV+LVL+ G P +++ K +I +IL Y GE ++A+++FG
Sbjct: 525 GLIKAVY-ATGTPVVLVLVSGKPFVLSWEK--EHIPAILAQWYAGEQEGNSIADILFGRV 581
Query: 579 NPGGRLPMTWYPQD-------YIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGL 631
NP G L + +PQ Y +P + + PGR Y F ++ FG GL
Sbjct: 582 NPSGHLTFS-FPQSTGHLPVYYNHLPSDKGFYKKPGSYSEPGRDYVFSSPAALWSFGHGL 640
Query: 632 SYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKN 691
SY+ + ++ S D ++ K C R N
Sbjct: 641 SYTDFVFEGIDTSL---------------RNDSIYVK---------CTLR---------N 667
Query: 692 HGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRARED 751
G +GK V L+V+ P++QL F+ V L A E +E+ L P L+ + D
Sbjct: 668 VGNRSGKEVVQLYVRDLVSSVVMPVQQLKAFEKVGLKAGESSEVQLSL-PVSELAISDND 726
Query: 752 GLMVIEEGTHFLVVGD 767
G VIE G +G+
Sbjct: 727 GKRVIEPGAFEFQLGN 742
>gi|224536538|ref|ZP_03677077.1| hypothetical protein BACCELL_01413 [Bacteroides cellulosilyticus
DSM 14838]
gi|224521794|gb|EEF90899.1| hypothetical protein BACCELL_01413 [Bacteroides cellulosilyticus
DSM 14838]
Length = 863
Score = 276 bits (705), Expect = 4e-71, Method: Compositional matrix adjust.
Identities = 166/437 (37%), Positives = 233/437 (53%), Gaps = 34/437 (7%)
Query: 45 FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGI 104
+ +L +RA LV LTL+EK +++ + ++ RLGI Y WW+EALHGVA G
Sbjct: 23 YKDASLSPERRAELLVKELTLEEKAHLMMDGSRSVERLGIKPYNWWNEALHGVARAGL-- 80
Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARA--LYNAGQAI-----GMTFW 157
AT FPQ I AASF+ + Y + A+ EARA Y A Q G+T W
Sbjct: 81 --------ATVFPQPIGMAASFNPEMVYEVFNAVSDEARAKNTYYASQDSRERYQGLTMW 132
Query: 158 APNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHF 217
P +NI+RDPRWGRG ET GEDP +T + V V+G+QG GK + AC KHF
Sbjct: 133 TPTVNIYRDPRWGRGIETYGEDPYLTSRMGVMVVKGLQGPA--DGKYD---KLHACAKHF 187
Query: 218 TAYDLDNWKGTTRYKFDAR-VTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCA 276
+ W R+ F+A + +DL +TY PPFE+ VK+G+ +MCAYNR G P C
Sbjct: 188 AVHSGPEW---NRHSFNAENIKPRDLYETYLPPFEALVKEGKVEEVMCAYNRFEGDPCCG 244
Query: 277 DRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSP--EDAVVDVLKAGMDVNCGSF 334
LL + R +WGF G + SDC A++ Y+ G+ P E A + +G D+ CGS
Sbjct: 245 SDRLLMQILRGEWGFDGIVVSDCGAIADFYNDRGHHTHPDAESASAAAVISGTDLECGSS 304
Query: 335 LQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQV 394
K +VK+ + E +D ++ L R LG + P + KI VV S AH
Sbjct: 305 Y-KALIESVKKGLISEETVDTSVKRLMKARFALGEMD-EPEKVSWTKIPFSVVASAAHDS 362
Query: 395 LALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQA 454
LAL A++ + LL N LPL K +++A++GPNAN + GNY G ++T L
Sbjct: 363 LALNMARESMTLLMNKDNFLPL-KRGGLTVAVMGPNANDSVMQWGNYNGMPAHTVTILDG 421
Query: 455 LQNYV---ENTVYYPGC 468
++N + + +Y GC
Sbjct: 422 VRNLLGTDDKLIYEQGC 438
Score = 115 bits (289), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 88/277 (31%), Positives = 132/277 (47%), Gaps = 53/277 (19%)
Query: 478 IDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEA 527
I K+V+ K AD V+ G+ + E EE+ DR D+ LP Q+ELI + A
Sbjct: 590 IRKSVERVKDADIVIFASGISPSLEGEEMGVNLPGFKKGDRTDIELPAVQRELIDALHRA 649
Query: 528 AKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMT 587
KK +ILV G P I + +IL A YPG+ G A+AEV+FGD+NP G+LP+T
Sbjct: 650 GKK-IILVNCSGSP--IGLEPETQKCEAILQAWYPGQQGGKAVAEVLFGDYNPAGKLPVT 706
Query: 588 WYPQDYIKVP-MTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQN 646
+Y ++ ++P D M GRTYR+ + +FPFG GLSY+ + Y + +N
Sbjct: 707 FY-RNVSQLPDFEDYNM--------TGRTYRYMQDVPLFPFGYGLSYTTFGYGKTVLDKN 757
Query: 647 KLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVK 706
+L QS +T+ V N G+ G+ V ++++
Sbjct: 758 ELTAGQS-----------------------------LKLTVPVTNTGKRNGEEVVQVYLR 788
Query: 707 PARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCE 743
G PIK L F+ V + A + + F+L E
Sbjct: 789 KQGDAEG-PIKTLRAFKRVSIPAGKTVNVEFDLKDKE 824
>gi|340347926|ref|ZP_08671025.1| beta-glucosidase [Prevotella dentalis DSM 3688]
gi|433652779|ref|YP_007296633.1| beta-glucosidase-like glycosyl hydrolase [Prevotella dentalis DSM
3688]
gi|339608382|gb|EGQ13286.1| beta-glucosidase [Prevotella dentalis DSM 3688]
gi|433303312|gb|AGB29127.1| beta-glucosidase-like glycosyl hydrolase [Prevotella dentalis DSM
3688]
Length = 849
Score = 276 bits (705), Expect = 4e-71, Method: Compositional matrix adjust.
Identities = 166/427 (38%), Positives = 241/427 (56%), Gaps = 34/427 (7%)
Query: 53 SQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRG 112
+++A L +LTL EKIS L++ +PAIP+L IPA++WW+EALHGVA G
Sbjct: 26 NRQAEALCRQLTLSEKISLLLDDSPAIPQLNIPAFQWWNEALHGVARNGV---------- 75
Query: 113 ATSFPQVILTAASFDSYLWYRIGQAIG-------LEARALYNAGQAIGMTFWAPNINIFR 165
AT FPQ I AA+FD L ++ A+ A+AL + + G++FW PNINIFR
Sbjct: 76 ATVFPQTIGMAAAFDDELLQQVFTAVSDEARAKAERAKALGHVRRYQGLSFWTPNINIFR 135
Query: 166 DPRWGRGQETPGEDPLVTGKYAVSYVRGVQG-DTFNGGKLKGKLQASACCKHFTAYDLDN 224
DPRWGRGQET GEDP +TG+ ++ VRG+QG DT + AC KHF + +
Sbjct: 136 DPRWGRGQETYGEDPYLTGRMGLAVVRGLQGPDT------ASYYKTLACAKHFAVH---S 186
Query: 225 WKGTTRYKFD-ARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSK 283
+ R++ + ++ +DL +TY P F+S V +G+ + +MCAY R +G P C + LL
Sbjct: 187 GPESLRHEMNIENLSPRDLWETYLPAFKSLVTKGQVAEVMCAYQRFDGQPCCGNSRLLQH 246
Query: 284 TARRQWGFHGYITSDCDAVSIIY--DAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKA 341
R +W F G + SDC A+S + G AK +A +AG DV CGS +
Sbjct: 247 ILRDEWNFKGLVVSDCGAISDFWIPGRHGVAKDAVEASAQAQRAGTDVECGSNYHS-LEE 305
Query: 342 AVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQ 401
AVK L ES+IDR++ + R+ LG + + + P+ I VV PAH+ LAL+ A+
Sbjct: 306 AVKAGVLKESDIDRSVIRVLEARLALGDISPSADV-PWKTIPYAVVDCPAHRQLALRMAR 364
Query: 402 DGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVEN 461
+ +VLL+N H LPL S + ++G NA + L NY G ++T L ++ E+
Sbjct: 365 ESMVLLQNKHHTLPL--STHDKILVVGENAVDSMMLWANYNGTPSHTMTILDGIRQLAEH 422
Query: 462 TVYYPGC 468
+ PGC
Sbjct: 423 VEFMPGC 429
Score = 120 bits (300), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 89/281 (31%), Positives = 125/281 (44%), Gaps = 59/281 (20%)
Query: 469 DTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQ 518
D V + + ++ A+ AD VV + GL E EE+ DR + LP Q+
Sbjct: 574 DVVHKQPFDLHQLIEKARQADAVVFVGGLSPRLEGEEMKVDLPGFSGGDRTSIELPVAQR 633
Query: 519 ELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDH 578
E I ++ A+KP+++V G I +N +IL A YPGE G A+A+V+FG
Sbjct: 634 EAIQALS-VARKPIVMVNCSGSA--IALEPETKNCDAILQAWYPGEEGGRAVADVLFGKV 690
Query: 579 NPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSY 638
NP G+LP+T+Y D P D MR GRTYR++ GK +FPFG GLSY+ +
Sbjct: 691 NPSGKLPVTFYRHDDQLPPFNDYNMR--------GRTYRYFTGKPLFPFGYGLSYTTF-- 740
Query: 639 KFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGK 698
+G C+ R V V N G G
Sbjct: 741 ---------------------------------RIGQPVCKGRTLKVK--VSNTGLREGT 765
Query: 699 HPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFEL 739
V +++K +G P+K L FQ V L A E E+ F L
Sbjct: 766 ETVQVYIKRNADSDG-PVKSLCDFQKVSLKAGESKEVTFTL 805
>gi|381200965|ref|ZP_09908097.1| beta-glucosidase [Sphingobium yanoikuyae XLDN2-5]
Length = 774
Score = 276 bits (705), Expect = 4e-71, Method: Compositional matrix adjust.
Identities = 226/715 (31%), Positives = 343/715 (47%), Gaps = 102/715 (14%)
Query: 81 RLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGL 140
RLGIP + E LHG A VG ATSFPQ I A+S+D + ++ Q I
Sbjct: 121 RLGIPIL-FHEEGLHGYAAVG-----------ATSFPQSIAMASSWDPAMLRQVNQVIAR 168
Query: 141 EARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFN 200
E RA + + M +P ++I RDPRWGR +ET GEDP + G+ V+ V G+QG
Sbjct: 169 EIRA-----RGVPMVL-SPVVDIARDPRWGRIEETYGEDPYLVGEMGVAAVEGLQGV--- 219
Query: 201 GGKLKGKLQAS---ACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQG 257
G+ + LQ++ A KH T + GT A V+ ++L + + PPFE VK+
Sbjct: 220 -GRSR-TLQSNHVFATLKHLTGHGQPE-SGTNIGP--APVSERELRENFFPPFEQVVKRT 274
Query: 258 RASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPED 317
+M +YN ++G+PS A+R LL R +WGF G + SD AV + A + E+
Sbjct: 275 GIEAVMASYNEIDGVPSHANRWLLENILREEWGFRGAVVSDYSAVDQLMSIHHIAANLEE 334
Query: 318 AVVDVLKAGMDVNCGSFLQKHTKAA-VKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTM 376
A + L AG+D + L T V++ K+ E+++D A+ + ++ R GLF NP
Sbjct: 335 AAMRALDAGVDADLPEGLSYATLGKLVREGKVSEAKVDLAVRRMLELKFRAGLFE-NPYA 393
Query: 377 QPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKT 436
A + + + LA AAQ I LLKN G+LPL ++ A+IGP+A A+
Sbjct: 394 D--ANAAAAITNNEDARALARTAAQRSITLLKND-GMLPLKPEGTI--AVIGPSAAVAR- 447
Query: 437 LLGNYAGPSCRSITPLQALQNYV---ENTVYYPG----------CDTVACSSAS-----I 478
LG Y G S++ L+ ++ V N V+ G D+V S + I
Sbjct: 448 -LGGYYGQPPHSVSILEGIKARVGTKANIVFAQGVKITEDDDWWADSVTKSDPAENRKLI 506
Query: 479 DKAVDIAKGADHVVLMMGLDQTQEKEEL------DRVDLVLPGRQQELITRVAEAAKKPV 532
+AV+ A+ D ++L +G + +E DR L L QQEL + +A KP+
Sbjct: 507 AQAVEAARNVDRIILTLGDTEQSSREGWADNHLGDRPSLDLVSEQQELFDAL-KALGKPI 565
Query: 533 ILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQD 592
+VL+ G P + K +IL Y GE G A+A+++FGD NPGG+LP+T P+
Sbjct: 566 TVVLINGRPA--STVKVSEQANAILEGWYLGEQGGNAVADILFGDVNPGGKLPVT-VPRS 622
Query: 593 YIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQ 652
++P+ M+P A G Y F ++PFG GLSY+ +S +S K+
Sbjct: 623 AGQLPLF-YNMKPSARRG-----YLFDTTDPLYPFGFGLSYTSFSLSAPRLSATKIGTGG 676
Query: 653 SSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGN 712
+S V++ V+N G G V L+++
Sbjct: 677 KTS-----------------------------VSVDVRNTGAREGDEVVQLYIRDKVSSV 707
Query: 713 GRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGD 767
RP+K+L GFQ V L E + F + P E+L + V+E G ++ G+
Sbjct: 708 TRPVKELKGFQRVTLKPGESRTVTFTVGP-EALQMWNDQMHRVVEPGDFEIMTGN 761
>gi|332881172|ref|ZP_08448831.1| glycosyl hydrolase family 3 protein [Capnocytophaga sp. oral taxon
329 str. F0087]
gi|357047867|ref|ZP_09109460.1| glycosyl hydrolase family 3 protein [Paraprevotella clara YIT
11840]
gi|332680886|gb|EGJ53824.1| glycosyl hydrolase family 3 protein [Capnocytophaga sp. oral taxon
329 str. F0087]
gi|355529206|gb|EHG98645.1| glycosyl hydrolase family 3 protein [Paraprevotella clara YIT
11840]
Length = 851
Score = 275 bits (704), Expect = 5e-71, Method: Compositional matrix adjust.
Identities = 169/427 (39%), Positives = 233/427 (54%), Gaps = 34/427 (7%)
Query: 45 FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGI 104
F P +R DL+SRLT++EKIS LVN APAI RLGI Y +EALHGV V G
Sbjct: 29 FRDMKAPQHERIMDLLSRLTVEEKISLLVNDAPAIGRLGIDKYNHGNEALHGV--VRPGD 86
Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYN---------AGQAIGMT 155
F T FPQ I AA ++ L YRI AI EAR + AG + +T
Sbjct: 87 F--------TVFPQAIGMAAMWNPELLYRISSAISDEARGRWKELEYGKKQIAGASDLLT 138
Query: 156 FWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCK 215
FW+P +N+ RDPRWGR ET GEDP ++G V++V+G+QGD L+ + K
Sbjct: 139 FWSPTVNMARDPRWGRTPETYGEDPYLSGVLGVAFVKGLQGDH------PRYLKTVSTPK 192
Query: 216 HFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSC 275
HF N + R +A+V+ +DL + Y P FE C+ +G+A IM AYN VN +P
Sbjct: 193 HFAV----NNEEHNRSSCNAKVSERDLREYYLPSFERCITEGKAQSIMMAYNAVNDVPCT 248
Query: 276 ADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGS-F 334
+ L+ R WGF+GYI SDC A + Y K+ E A +K G+D+ CG+
Sbjct: 249 VNTYLIKNVLRGDWGFNGYIVSDCSAPEWMITKHHYVKTREAAATLAVKVGLDLECGNQV 308
Query: 335 LQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQV 394
+ A +Q + E++ID A + + RM LGLF+ P+ P+ +I VV AHQ
Sbjct: 309 YGEGLLKAYRQYMVSEADIDSAAYRILRGRMMLGLFDA-PSQNPYNQIEPSVVGCKAHQD 367
Query: 395 LALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAG-PSCRSITPLQ 453
LAL+AA+ +VLLKN LPL K S+A++G +A + G+Y+G P +T L
Sbjct: 368 LALEAARQSMVLLKNKDNFLPLNPKKVKSIAVVGISAGHCE--FGDYSGTPKNEPVTILD 425
Query: 454 ALQNYVE 460
++ Y E
Sbjct: 426 GIKQYAE 432
Score = 122 bits (307), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 89/291 (30%), Positives = 141/291 (48%), Gaps = 44/291 (15%)
Query: 476 ASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILV 535
A A +A D V ++G++++ E+E DR L LP QQE I + + V+ V
Sbjct: 591 ARFGDAGKVAAECDVTVAVLGINKSIEREGQDRFSLELPVDQQEFIKELYKVNPNTVV-V 649
Query: 536 LLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIK 595
L+ G + + + D N+ +IL A YPGE G A+AEV+FGD+NPGGRLP+T+Y +
Sbjct: 650 LVAGSSMAVNWM--DENVPAILNAWYPGEQGGNAVAEVLFGDYNPGGRLPLTYY-NSLDE 706
Query: 596 VPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSS 655
+P D GRTY+++EG+ ++ FG GLSY+K+ YK K V+
Sbjct: 707 IPAFD-------NYSVKGRTYQYFEGQPLYEFGYGLSYTKFRYKSKGVN----------- 748
Query: 656 TKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRP 715
VE QD V V+ V N G+ G ++VK G P
Sbjct: 749 ---VE-QDTVK------------------VSFEVSNTGKYDGDEVAQVYVKYPETGTYMP 786
Query: 716 IKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
+KQL GF+ V + + +++ + + ++ + +G + +VG
Sbjct: 787 LKQLHGFKRVHIKKGKTSKVTIGVPRKDLRYWYEQERKFITPKGEYTFMVG 837
>gi|423223593|ref|ZP_17210062.1| hypothetical protein HMPREF1062_02248 [Bacteroides cellulosilyticus
CL02T12C19]
gi|392638218|gb|EIY32065.1| hypothetical protein HMPREF1062_02248 [Bacteroides cellulosilyticus
CL02T12C19]
Length = 863
Score = 275 bits (704), Expect = 5e-71, Method: Compositional matrix adjust.
Identities = 166/437 (37%), Positives = 233/437 (53%), Gaps = 34/437 (7%)
Query: 45 FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGI 104
+ +L +RA LV LTL+EK +++ + ++ RLGI Y WW+EALHGVA G
Sbjct: 23 YKDASLSPERRAELLVKELTLEEKAHLMMDGSRSVERLGIKPYNWWNEALHGVARAGL-- 80
Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARA--LYNAGQAI-----GMTFW 157
AT FPQ I AASF+ + Y + A+ EARA Y A Q G+T W
Sbjct: 81 --------ATVFPQPIGMAASFNPEMVYEVFNAVSDEARAKNTYYASQDSRERYQGLTMW 132
Query: 158 APNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHF 217
P +NI+RDPRWGRG ET GEDP +T + V V+G+QG GK + AC KHF
Sbjct: 133 TPTVNIYRDPRWGRGIETYGEDPYLTSRMGVMVVKGLQGPA--DGKYD---KLHACAKHF 187
Query: 218 TAYDLDNWKGTTRYKFDAR-VTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCA 276
+ W R+ F+A + +DL +TY PPFE+ VK+G+ +MCAYNR G P C
Sbjct: 188 AVHSGPEW---NRHSFNAENIKPRDLYETYLPPFEALVKEGKVEEVMCAYNRFEGDPCCG 244
Query: 277 DRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSP--EDAVVDVLKAGMDVNCGSF 334
LL + R +WGF G + SDC A++ Y+ G+ P E A + +G D+ CGS
Sbjct: 245 SDRLLMQILRGEWGFDGIVVSDCGAIADFYNDRGHHTHPDAESASAAAVISGTDLECGSS 304
Query: 335 LQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQV 394
K +VK+ + E +D ++ L R LG + P + KI VV S AH
Sbjct: 305 Y-KALIESVKKGLISEETVDTSVKRLMKARFALGEMD-EPEKVSWTKIPFSVVASAAHDS 362
Query: 395 LALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQA 454
LAL A++ + LL N LPL K +++A++GPNAN + GNY G ++T L
Sbjct: 363 LALNMARESMTLLMNKDNFLPL-KRGGLTVAVMGPNANDSVMQWGNYNGMPAHTVTILDG 421
Query: 455 LQNYV---ENTVYYPGC 468
++N + + +Y GC
Sbjct: 422 VRNLLGTDDKLIYEQGC 438
Score = 115 bits (288), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 88/277 (31%), Positives = 132/277 (47%), Gaps = 53/277 (19%)
Query: 478 IDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEA 527
I K+V+ K AD V+ G+ + E EE+ DR D+ LP Q+ELI + A
Sbjct: 590 IRKSVERVKDADIVIFASGISPSLEGEEMGVNLPGFKKGDRTDIELPAVQRELIDALHRA 649
Query: 528 AKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMT 587
KK +ILV G P I + +IL A YPG+ G A+AEV+FGD+NP G+LP+T
Sbjct: 650 GKK-IILVNCSGSP--IGLEPETQKCEAILQAWYPGQQGGKAVAEVLFGDYNPAGKLPVT 706
Query: 588 WYPQDYIKVP-MTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQN 646
+Y ++ ++P D M GRTYR+ + +FPFG GLSY+ + Y + +N
Sbjct: 707 FY-RNVSQLPDFEDYNM--------TGRTYRYMQDVPLFPFGYGLSYTTFGYGKTVLDKN 757
Query: 647 KLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVK 706
+L QS +T+ V N G+ G+ V ++++
Sbjct: 758 ELTAGQS-----------------------------LKLTVPVTNTGKRNGEEVVQVYLR 788
Query: 707 PARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCE 743
G PIK L F+ V + A + + F+L E
Sbjct: 789 KQGDAEG-PIKTLRAFKRVSIPAGKTVNVEFDLKDKE 824
>gi|373461705|ref|ZP_09553443.1| hypothetical protein HMPREF9944_01707 [Prevotella maculosa OT 289]
gi|371951597|gb|EHO69442.1| hypothetical protein HMPREF9944_01707 [Prevotella maculosa OT 289]
Length = 662
Score = 275 bits (703), Expect = 7e-71, Method: Compositional matrix adjust.
Identities = 167/425 (39%), Positives = 237/425 (55%), Gaps = 34/425 (8%)
Query: 55 RARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGAT 114
+A L +LTL EK+S L++ +PAIPRL IPA++WW+EALHGVA NG AT
Sbjct: 19 QAEALCRQLTLSEKVSLLLDDSPAIPRLNIPAFQWWNEALHGVAR-------NGV---AT 68
Query: 115 SFPQVILTAASFDSYLWYRIGQAIG-------LEARALYNAGQAIGMTFWAPNINIFRDP 167
FPQ I AA+FD L ++ + A+AL + + G++FW PNINIFRDP
Sbjct: 69 VFPQTIGMAAAFDDELLQQVFTVVSDEARAKAERAKALGHVRRYQGLSFWTPNINIFRDP 128
Query: 168 RWGRGQETPGEDPLVTGKYAVSYVRGVQG-DTFNGGKLKGKLQASACCKHFTAYDLDNWK 226
RWGRGQET GEDP +TG+ ++ VRG+QG DT + AC KHF + +
Sbjct: 129 RWGRGQETYGEDPYLTGRMGLAVVRGLQGPDT------ACYYKTLACAKHFAVH---SGP 179
Query: 227 GTTRYKFD-ARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTA 285
+ R++ D ++ +DL +TY P F+S V +G+ + +MCAY R +G P C + LL
Sbjct: 180 ESLRHEMDIENLSPRDLWETYLPAFKSLVTKGQVAEVMCAYQRFDGQPCCGNSRLLQHIL 239
Query: 286 RRQWGFHGYITSDCDAVSIIY--DAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAV 343
R +W F G + SDC A+S + G AK +A AG DV CGS + AV
Sbjct: 240 RDEWNFKGLVVSDCGAISDFWIPGRHGVAKDAVEASAQAQSAGTDVECGSNYHS-LEEAV 298
Query: 344 KQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDG 403
K L ES+IDR++ + R+ LG + + + P+ I VV PAH+ LAL+ A +
Sbjct: 299 KAGVLKESDIDRSVIRVLEARLALGDISPSAVV-PWKTIPYAVVDCPAHRQLALRMAHES 357
Query: 404 IVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTV 463
+VLL+N H LPL S + ++G NA + L NY G ++T L ++ E+
Sbjct: 358 MVLLQNRHHTLPL--STHDKILVVGENAVDSMMLWANYNGIPSHTVTVLDGIRQLAEHVE 415
Query: 464 YYPGC 468
+ PGC
Sbjct: 416 FMPGC 420
>gi|329922637|ref|ZP_08278189.1| glycosyl hydrolase family 3 N-terminal domain protein
[Paenibacillus sp. HGF5]
gi|328941979|gb|EGG38262.1| glycosyl hydrolase family 3 N-terminal domain protein
[Paenibacillus sp. HGF5]
Length = 765
Score = 275 bits (703), Expect = 7e-71, Method: Compositional matrix adjust.
Identities = 216/733 (29%), Positives = 346/733 (47%), Gaps = 105/733 (14%)
Query: 67 EKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASF 126
E ++ + A RLGIP E HG +G T FP + +++
Sbjct: 89 EAVNHIQRYAIEQSRLGIPIL-IGEECSHGHMAIG-----------GTVFPVPLSIGSTW 136
Query: 127 DSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKY 186
+ L+ + +A+ LE R+ G ++P +++ RDPRWGR +E GEDP + +Y
Sbjct: 137 NLDLYRDMCRAVALETRS------QGGAVTYSPVLDVVRDPRWGRTEECFGEDPYLISEY 190
Query: 187 AVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTY 246
AV+ V G+QG++ L +A KHF Y R + ++L +
Sbjct: 191 AVASVEGLQGES-----LDSPSSVAATLKHFVGYGSSE---GGRNAGPVHMGTRELMEVD 242
Query: 247 QPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIY 306
PF+ V+ G AS IM AYN ++G+P + LL R++WGF G + +DC A+ ++
Sbjct: 243 MLPFKKAVEAGAAS-IMPAYNEIDGVPCTVNTELLDGILRKEWGFDGMVITDCGAIDMLA 301
Query: 307 DAEGYAKSPEDAVVDVLKAGMDVN-CGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRM 365
A+ DA V ++AG+D+ G KH + AV+ KL S +D A+ + +++
Sbjct: 302 SGHDTAEDGMDAAVQAIRAGIDLEMSGEMFGKHLQKAVESNKLEVSVLDEAVRRVLTLKF 361
Query: 366 RLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLA 425
+LGLF NP + P + +V+ S H LA Q A +GIVLLKN LPL K V +A
Sbjct: 362 KLGLFE-NPYVDP--QTAENVIGSGQHIGLARQLAAEGIVLLKNEAKALPLSKEGGV-IA 417
Query: 426 LIGPNANSAKTLLGNYAGPS-CRSITPL-----QALQNYVENTVYYPGCDTVACSSASID 479
+IGPNA+ LG+Y P ++T + L + +Y PGC S +
Sbjct: 418 VIGPNADQGYNQLGDYTSPQPPAAVTTVLGGIRAKLGEEAQRVLYAPGCRIKDDSREGFE 477
Query: 480 KAVDIAKGADHVVLMMG-------------------------LDQTQEKEELDRVDLVLP 514
A+ A+ AD VV+++G L E +DR+ L L
Sbjct: 478 FALSCAEQADTVVMVLGGSSARDFGEGTIDLRTGASKVTDDALSDMDCGEGIDRMTLQLS 537
Query: 515 GRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVI 574
G Q +L + + K+ +I+V + G P+ + D + +IL A YPG+ G A+A+++
Sbjct: 538 GVQLDLAQEIHKLGKR-MIVVYINGRPIAEPW--IDEHADAILEAWYPGQEGGHAIADIL 594
Query: 575 FGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYS 634
FGD NP G+L M+ P+ ++P+ R + G+ Y + + +PFG GLSY+
Sbjct: 595 FGDVNPSGKLTMS-IPKHVGQLPVYYNGKRSR------GKRYLEEDSQPRYPFGYGLSYT 647
Query: 635 KYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPE-LGTEFCETRKFLVTIGVKNHG 693
++SY S +M PE +GT+ +V++ V N G
Sbjct: 648 EFSY---------------SDIQM-----------TPEVIGTDGTA----VVSVNVTNSG 677
Query: 694 EMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGL 753
+ G V L+V A RP ++L GFQ + L E+ ++ F + P E L +D
Sbjct: 678 DCEGSEVVQLYVSDAASKYTRPARELKGFQKISLQPGERRKVEFTIGP-EQLQYIGQDYR 736
Query: 754 MVIEEGTHFLVVG 766
V+E G +++G
Sbjct: 737 QVVEPGLFRVMLG 749
>gi|189461690|ref|ZP_03010475.1| hypothetical protein BACCOP_02354 [Bacteroides coprocola DSM 17136]
gi|189431577|gb|EDV00562.1| glycosyl hydrolase family 3 N-terminal domain protein [Bacteroides
coprocola DSM 17136]
Length = 499
Score = 275 bits (702), Expect = 8e-71, Method: Compositional matrix adjust.
Identities = 161/420 (38%), Positives = 241/420 (57%), Gaps = 34/420 (8%)
Query: 51 PISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTI 110
P+ +R DL+SRLT++EK+S L ++P I RL IP Y +EALHGV G+
Sbjct: 34 PLHERIMDLLSRLTVEEKVSLLRATSPGISRLDIPKYYHGNEALHGVVRPGR-------- 85
Query: 111 RGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA--------GQ-AIGMTFWAPNI 161
T FPQ I AA+++ L Y++ I EARA +N GQ + +TFW+P +
Sbjct: 86 --FTVFPQAIGLAATWNPELQYQVATVISDEARARWNELDQGKLQKGQFSDLLTFWSPTV 143
Query: 162 NIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYD 221
N+ RDPRWGR ET GEDP ++G ++VRG+QGD L+ + KHF A
Sbjct: 144 NMARDPRWGRTPETYGEDPYLSGTMGTAFVRGLQGDDAR------YLKVVSTPKHFAA-- 195
Query: 222 LDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLL 281
N + R++ + +++ + L + Y P FE+C+K G+A+ IM AYN +N +P + LL
Sbjct: 196 --NNEEHNRFECNPQISEKQLREYYLPAFEACIKDGKAASIMSAYNAINNVPCTLNSWLL 253
Query: 282 SKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG-SFLQKHTK 340
+K R WGF GY+ SDC S++ +A Y K+ E A +KAG+D+ CG +
Sbjct: 254 TKVLRHDWGFQGYVVSDCGGPSLLVNAHKYVKTKEAAATLSIKAGLDLECGDDVYYEPLL 313
Query: 341 AAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAA 400
A KQ + +++ID +++ RMRLGLF+ N P+ KI ++ S HQ +AL+AA
Sbjct: 314 NAYKQYMVSDADIDSTAYHVLKARMRLGLFD-NGKNNPYTKISPSIIGSKLHQRVALEAA 372
Query: 401 QDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAG-PSCRSITPLQALQNYV 459
+ IVLLKN + +LPL K S+A++G NA + + G+Y+G P I+ LQ ++ V
Sbjct: 373 RQCIVLLKNHNWVLPLDTKKLKSIAVVGINAGNCE--FGDYSGSPVIAPISILQGIKERV 430
>gi|375145746|ref|YP_005008187.1| Beta-glucosidase [Niastella koreensis GR20-10]
gi|361059792|gb|AEV98783.1| Beta-glucosidase [Niastella koreensis GR20-10]
Length = 866
Score = 275 bits (702), Expect = 8e-71, Method: Compositional matrix adjust.
Identities = 161/443 (36%), Positives = 234/443 (52%), Gaps = 31/443 (6%)
Query: 41 ETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGV 100
+ P+ L +RA+DL++RLTL+EK S + + +PAIPRLGI + WWSEALHG+A
Sbjct: 22 QKLPYQNPDLSSEERAKDLITRLTLEEKASLMFDQSPAIPRLGIKKFNWWSEALHGLANN 81
Query: 101 GKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAG--------QAI 152
K T FP+ + AASFD L Y+I A E RA Y+ + +
Sbjct: 82 DK----------VTVFPEPVGMAASFDDTLVYKIFDATSDEVRAKYHDAIRNGKENRRFL 131
Query: 153 GMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASA 212
++ W PNINIFRDPRWGRGQET GEDP +T + V+ V+G+QG + + L A
Sbjct: 132 SLSVWTPNINIFRDPRWGRGQETYGEDPYLTSRMGVAVVKGLQGPA--DARYRKLL---A 186
Query: 213 CCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGI 272
C KHF + W T + R +DL +TY P F+S V + +MCAY R++
Sbjct: 187 CAKHFAVHSGPEWSRHTLNLYQVRP--RDLYETYLPAFKSLVMEADVRQVMCAYQRLDDE 244
Query: 273 PSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNC- 331
P C + LL R WGF + SDC A++ + + + A AG DV C
Sbjct: 245 PCCGNNRLLQTILRDNWGFKHVVVSDCGAITDFFTSHKVSSDAVHASAKGALAGTDVECV 304
Query: 332 -GSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSP 390
+ K+ AV + + E EID+ L + + R LG + + + P+ I VV +
Sbjct: 305 WEGYAFKNLPEAVSRGLITEKEIDKHLLRVLAGRFDLGEMDDDAVV-PWAAIPMSVVNNE 363
Query: 391 AHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSIT 450
AH+ LAL AQ+ + LL+N + +LPL K + +A+IGPNA++ L GNY G R+I
Sbjct: 364 AHRKLALDMAQESMTLLQNRNNILPL-KRNAKKIAVIGPNADNDPVLWGNYNGRPVRTIN 422
Query: 451 PLQALQNYV--ENTVYYPGCDTV 471
+ ++ + + +Y CD V
Sbjct: 423 IVSGIRAKMPPDGVLYDKACDLV 445
Score = 103 bits (256), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 83/292 (28%), Positives = 132/292 (45%), Gaps = 52/292 (17%)
Query: 486 KGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILV 535
KG D VV + GL E EE+ DR D+ LP Q+ + + A KK ++
Sbjct: 602 KGIDEVVFVGGLSTLLEGEEMPVSYPGFKGGDRTDIQLPEVQRNCLKALKAAGKK--VIF 659
Query: 536 LLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIK 595
+ C G I F + +IL A Y GE+G A+A+V+FGD+NP G+LP+T+Y
Sbjct: 660 VNCSGSA-IGFVPETESCDAILQAWYGGESGGQAVADVLFGDYNPSGKLPVTFYKDTTQL 718
Query: 596 VPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSS 655
D ++ GRTYRF +FPFG GLSY+ +S +S+ + S+
Sbjct: 719 AGFEDYSLK--------GRTYRFM-SDPLFPFGFGLSYTTFSIGTAQLSKTNI-----SN 764
Query: 656 TKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRP 715
++ +E ++I V N G+ G V ++V+ +G P
Sbjct: 765 SESIE------------------------LSIPVTNTGKRNGTEIVQVYVRKTGDSDG-P 799
Query: 716 IKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGD 767
+K + GF+ + + A + V L P + G M + G + L+ G+
Sbjct: 800 LKTMRGFKRIAIPAGKTTTAVINLPPAAFEFYDEKFGKMTVAAGEYELLYGN 851
>gi|383113364|ref|ZP_09934136.1| hypothetical protein BSGG_3068 [Bacteroides sp. D2]
gi|382948729|gb|EFS32368.2| hypothetical protein BSGG_3068 [Bacteroides sp. D2]
Length = 850
Score = 275 bits (702), Expect = 9e-71, Method: Compositional matrix adjust.
Identities = 161/424 (37%), Positives = 243/424 (57%), Gaps = 34/424 (8%)
Query: 51 PISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTI 110
P+ +R DL+SRLT++EKIS L ++P IPRLGI Y +EALHGV G+
Sbjct: 33 PVHERVADLLSRLTVEEKISLLRATSPGIPRLGIDKYYHGNEALHGVVRPGR-------- 84
Query: 111 RGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQ---------AIGMTFWAPNI 161
T FPQ I AA+++ L R+ I EARA +N + +TFW+P +
Sbjct: 85 --FTVFPQAIGLAATWNPELQKRVATVISDEARARWNELDQGREQKEQFSDVLTFWSPTV 142
Query: 162 NIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYD 221
N+ RDPRWGR ET GEDP ++G ++V+G+QGD L+ + KHF A
Sbjct: 143 NMARDPRWGRTPETYGEDPFLSGVMGTAFVKGLQGDD------PRYLKIVSTPKHFAA-- 194
Query: 222 LDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLL 281
N + R+ + +++ + L + Y P FE CVK+G+A+ IM AYN +N +P + LL
Sbjct: 195 --NNEEHNRFVCNPQISEKQLREYYFPAFEMCVKEGKAASIMTAYNALNDVPCTLNAWLL 252
Query: 282 SKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG-SFLQKHTK 340
K R+ WGF GY+ SDC S++ +A Y K+ E A ++AG+D+ CG ++
Sbjct: 253 KKVLRQDWGFQGYVVSDCGGPSLLVNAHKYVKTKEAAATLSIQAGLDLECGDDVYDEYLL 312
Query: 341 AAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAA 400
A KQ + +++ID A ++ + RM+LGLF+G P+ +I V+ S HQ +AL AA
Sbjct: 313 NAYKQYMVSDADIDSAACHVLTARMKLGLFDGT-ERNPYTRISPSVIGSKEHQQIALDAA 371
Query: 401 QDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAG-PSCRSITPLQALQNYV 459
++ IVLLKN + +LPL +K S+A++G NA K G+Y+G P ++ LQ +++ V
Sbjct: 372 RECIVLLKNKNNMLPLNVNKVKSIAVVGINA--GKCEFGDYSGAPVVDPVSILQGIKDRV 429
Query: 460 ENTV 463
+ V
Sbjct: 430 GDRV 433
Score = 117 bits (294), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 80/278 (28%), Positives = 138/278 (49%), Gaps = 43/278 (15%)
Query: 489 DHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAK 548
+ VV +MG++++ E+E DR D+ LP Q+E + + + +I+VL+ G + + +
Sbjct: 602 ETVVAVMGINKSIEREGQDRYDIQLPADQREFLQEIYKVNPN-IIVVLVAGSSLAVNWM- 659
Query: 549 YDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQAT 608
D +I +I+ A YPGE G A+A+V+FGD+NP GRLP+T+Y + ++P D Q
Sbjct: 660 -DEHIPAIVNAWYPGEQGGTAVADVLFGDYNPAGRLPLTYY-KSLDELPAFDDYDITQ-- 715
Query: 609 SGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYK 668
GRTY++++G ++PFG GLSYS + Y S K+ + + V
Sbjct: 716 ----GRTYKYFKGDVLYPFGYGLSYSSFKY---------------SDLKVKDGANTVS-- 754
Query: 669 SVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILN 728
V+ +KN G+ G ++V+ G PIK+L GF+ + L
Sbjct: 755 ----------------VSFRLKNTGKRKGDEVAQVYVRIPETGGVVPIKELKGFRRIPLK 798
Query: 729 AKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
+ E + EL + G ++ +G +++G
Sbjct: 799 SGESRVVEIELDKEQLRYWDAGLGQFIVPQGAFDIMIG 836
>gi|262405113|ref|ZP_06081663.1| periplasmic beta-glucosidase [Bacteroides sp. 2_1_22]
gi|262355988|gb|EEZ05078.1| periplasmic beta-glucosidase [Bacteroides sp. 2_1_22]
Length = 769
Score = 275 bits (702), Expect = 9e-71, Method: Compositional matrix adjust.
Identities = 207/713 (29%), Positives = 333/713 (46%), Gaps = 102/713 (14%)
Query: 81 RLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGL 140
RLGIP + EA HG +G AT FP I AA++ L +G+AIG
Sbjct: 117 RLGIPLF-LAEEAPHGHMAIG-----------ATVFPTGIGMAATWSPQLIREVGKAIGK 164
Query: 141 EARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFN 200
E R G + P +++ RDPRW R +ET GEDP++TG+ + V G+
Sbjct: 165 EIRL------QGGHISYGPVLDLARDPRWSRVEETFGEDPVLTGEIGKAMVEGL-----G 213
Query: 201 GGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRAS 260
GG L A KHF AY + F +++L + + PPF + G A
Sbjct: 214 GGDLSHPYSTLATLKHFLAYGISESGQNGNPSF---AGIRELHENFLPPFRQAIDAG-AL 269
Query: 261 GIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVV 320
+M +YN ++G+P A+ +LL++ R +W F G + SD ++ I+ + A + E+A +
Sbjct: 270 SVMTSYNSMDGVPCTANHSLLTELLRNEWKFRGIVVSDLYSIEGIHQSHFVAPTMEEAAI 329
Query: 321 DVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFG 380
L AG+DV+ G + AV ++ ++ +D ++ + ++ +GLF NP + P
Sbjct: 330 LALSAGVDVDLGGDAYMNLMNAVNTGRISKTALDASVARVLRLKFEMGLFE-NPYVDP-- 386
Query: 381 KIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGN 440
+ V S LA + AQ I LLKN H LLPL K++ V ALIGPNA++ +LG+
Sbjct: 387 EKAKKEVRSEESVTLARRVAQASITLLKNEHSLLPLNKNRKV--ALIGPNADNRYNMLGD 444
Query: 441 YAGPSCRS--ITPLQALQNYVENTV--YYPGCDTVACSSASIDKAVDIAKGADHVVLMMG 496
Y P T L ++ + ++ Y GC + I++AV A+ ++ ++ ++G
Sbjct: 445 YTAPQEEENIKTVLDGIRAKLSSSQVEYVKGCSIRDTVTTDIEQAVAAAQRSEVIIAVVG 504
Query: 497 -----------------------LDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVI 533
+ + E DR L L G+QQEL+ + +A KP+I
Sbjct: 505 GSSARDFKTSYKETGAAIADEKTISDMECGEGFDRATLSLLGKQQELL-KALKATGKPLI 563
Query: 534 LVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDY 593
+V + G P+D +A N ++L A YPG+ G +A+A+V+FGD NP GRLP + P+
Sbjct: 564 VVYIEGRPLDKNWA--SENADAVLTAYYPGQEGGIAIADVLFGDFNPAGRLPFS-VPRSV 620
Query: 594 IKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQS 653
++P+ K PQ+ Y ++PFG GLSY+ + Y
Sbjct: 621 GQIPLYYNKKAPQS------HDYVEMSASPLYPFGYGLSYTSFDYS-------------- 660
Query: 654 SSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNG 713
+L R F ++ V+N G+ G+ L+++
Sbjct: 661 ------------------DLHLSALMPRSFEISFKVRNTGKYDGEEVAQLYLRDEYASVV 702
Query: 714 RPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
+P+KQL F L E+ E+ F LS E S + ++E GT +++G
Sbjct: 703 QPLKQLKHFARFYLKRGEEREVKFILSE-EDFSLVDRNLKKIVEPGTFQIMIG 754
>gi|160892207|ref|ZP_02073210.1| hypothetical protein BACUNI_04671 [Bacteroides uniformis ATCC 8492]
gi|156858685|gb|EDO52116.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
uniformis ATCC 8492]
Length = 990
Score = 275 bits (702), Expect = 9e-71, Method: Compositional matrix adjust.
Identities = 228/798 (28%), Positives = 360/798 (45%), Gaps = 139/798 (17%)
Query: 51 PISQRARDLVSRLTLDEKISQLVN--SAPAIPRLGIPAYEW------------------- 89
P+ R +L+ ++TLDEK Q+V + + +P EW
Sbjct: 107 PLEARIENLLQQMTLDEKTCQMVTLYGYKRVLKDDLPTPEWKELLWKDGIGAIDEHLNGF 166
Query: 90 -------------WSEALHG----------VAGVGKGI---FFNGTIRG-----ATSFPQ 118
W + H V GI F N IRG AT+FP
Sbjct: 167 QQWGLPPSDNAYVWPASRHAWALNEVQRFFVEDTRLGIPVDFTNEGIRGVESYRATNFPT 226
Query: 119 VILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMT-FWAPNINIFRDPRWGRGQETPG 177
+ +++ L ++G G EAR L G T +AP +++ RD RWGR +E G
Sbjct: 227 QLGLGHTWNRELIRQVGLITGREARML-------GYTNVYAPILDVGRDQRWGRYEEVYG 279
Query: 178 EDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARV 237
E P + + + VRG+Q + Q +A KHF AY + + D ++
Sbjct: 280 ESPYLVAELGIEMVRGLQHNH----------QVAATGKHFAAYSNNKGAREGMARVDPQM 329
Query: 238 TMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITS 297
+ +++ + + PF+ +++ G+M +YN +GIP L+ R + GF GY+ S
Sbjct: 330 SPREVENIHIYPFKRVIREAGMLGVMSSYNDYDGIPVQGSYYWLTTRLRGEMGFRGYVVS 389
Query: 298 DCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNC-----GSFLQKHTKAAVKQKKLPESE 352
D DAV +Y G AK ++AV ++AG++V C SF+ + VK+ L E
Sbjct: 390 DSDAVEYLYTKHGTAKDMKEAVRQSVEAGLNVRCTFRSPDSFVLP-LRELVKEGGLSEEV 448
Query: 353 IDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADV---VCSPAHQVLALQAAQDGIVLLKN 409
I+ + ++ V+ +GLF+ P+ AD V ++ +ALQA+++ IVLLKN
Sbjct: 449 INDRVRDILRVKFLIGLFDA-----PYQTDLADADREVEKEENEAIALQASRESIVLLKN 503
Query: 410 SHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVE---NTVYYP 466
+ LLPL + + +A+ GPNAN L +Y + T L+ +Q + +Y
Sbjct: 504 AGELLPLDINSTKKIAVCGPNANEEGYALTHYGPLAVEVTTVLEGIQEKTKGKAEVLYTK 563
Query: 467 GCDTVAC---------------SSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDL 511
GCD V A IDKAV+ A+ AD ++++G Q E R L
Sbjct: 564 GCDLVDAHWPESEIIDYPLTDDEQAEIDKAVENARQADVAIVVLGGGQRTCGENKSRTSL 623
Query: 512 VLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALA 571
LPGRQ +L+ + +A KPV+L+L+ G P+ I +A D+ + +IL A YPG G ALA
Sbjct: 624 DLPGRQLQLLQAI-QATGKPVVLILINGRPLSINWA--DKFVPAILEAWYPGSKGGTALA 680
Query: 572 EVIFGDHNPGGRLPMTWYPQDYIKVPMT-DMKMRPQATSG-NPGRTYRFYE-GKEVFPFG 628
+++FGD+NPGG+L +T +P+ ++P K Q G NPG T ++PFG
Sbjct: 681 DILFGDYNPGGKLTVT-FPKTVGQIPFNFPCKPSSQIDGGKNPGPTGNMSRINGALYPFG 739
Query: 629 CGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIG 688
GLSY+ + Y ++ + N+S++ V +
Sbjct: 740 YGLSYTTFEYSDLDITPRVITPNESAT-----------------------------VRLK 770
Query: 689 VKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRA 748
V N G+ AG V L+++ K L GFQ + L E E+ F + + L
Sbjct: 771 VTNTGKRAGDEVVQLYIRDVLSSITTYEKNLAGFQRIHLEPGEAQELSFTID-RKHLELL 829
Query: 749 REDGLMVIEEGTHFLVVG 766
D V+E G L+ G
Sbjct: 830 DADMKWVVEPGDFVLMAG 847
>gi|294647557|ref|ZP_06725134.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
ovatus SD CC 2a]
gi|294807095|ref|ZP_06765914.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
xylanisolvens SD CC 1b]
gi|345508184|ref|ZP_08787819.1| periplasmic beta-glucosidase [Bacteroides sp. D1]
gi|292637099|gb|EFF55540.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
ovatus SD CC 2a]
gi|294445794|gb|EFG14442.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
xylanisolvens SD CC 1b]
gi|345455214|gb|EEO50370.2| periplasmic beta-glucosidase [Bacteroides sp. D1]
Length = 783
Score = 274 bits (701), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 207/713 (29%), Positives = 333/713 (46%), Gaps = 102/713 (14%)
Query: 81 RLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGL 140
RLGIP + EA HG +G AT FP I AA++ L +G+AIG
Sbjct: 131 RLGIPLF-LAEEAPHGHMAIG-----------ATVFPTGIGMAATWSPQLIREVGKAIGK 178
Query: 141 EARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFN 200
E R G + P +++ RDPRW R +ET GEDP++TG+ + V G+
Sbjct: 179 EIRL------QGGHISYGPVLDLARDPRWSRVEETFGEDPVLTGEIGKAMVEGL-----G 227
Query: 201 GGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRAS 260
GG L A KHF AY + F +++L + + PPF + G A
Sbjct: 228 GGDLSHPYSTLATLKHFLAYGISESGQNGNPSF---AGIRELHENFLPPFRQAIDAG-AL 283
Query: 261 GIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVV 320
+M +YN ++G+P A+ +LL++ R +W F G + SD ++ I+ + A + E+A +
Sbjct: 284 SVMTSYNSMDGVPCTANHSLLTELLRNEWKFRGIVVSDLYSIEGIHQSHFVAPTMEEAAI 343
Query: 321 DVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFG 380
L AG+DV+ G + AV ++ ++ +D ++ + ++ +GLF NP + P
Sbjct: 344 LALSAGVDVDLGGDAYMNLMNAVNTGRISKTALDASVARVLRLKFEMGLFE-NPYVDP-- 400
Query: 381 KIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGN 440
+ V S LA + AQ I LLKN H LLPL K++ V ALIGPNA++ +LG+
Sbjct: 401 EKAKKEVRSEESVTLARRVAQASITLLKNEHSLLPLNKNRKV--ALIGPNADNRYNMLGD 458
Query: 441 YAGPSCRS--ITPLQALQNYVENTV--YYPGCDTVACSSASIDKAVDIAKGADHVVLMMG 496
Y P T L ++ + ++ Y GC + I++AV A+ ++ ++ ++G
Sbjct: 459 YTAPQEEENIKTVLDGIRAKLSSSQVEYVKGCSIRDTVTTDIEQAVAAAQRSEVIIAVVG 518
Query: 497 -----------------------LDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVI 533
+ + E DR L L G+QQEL+ + +A KP+I
Sbjct: 519 GSSARDFKTSYKETGAAIADEKTISDMECGEGFDRATLSLLGKQQELL-KALKATGKPLI 577
Query: 534 LVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDY 593
+V + G P+D +A N ++L A YPG+ G +A+A+V+FGD NP GRLP + P+
Sbjct: 578 VVYIEGRPLDKNWA--SENADAVLTAYYPGQEGGIAIADVLFGDFNPAGRLPFS-VPRSV 634
Query: 594 IKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQS 653
++P+ K PQ+ Y ++PFG GLSY+ + Y
Sbjct: 635 GQIPLYYNKKAPQS------HDYVEMSASPLYPFGYGLSYTSFDYS-------------- 674
Query: 654 SSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNG 713
+L R F ++ V+N G+ G+ L+++
Sbjct: 675 ------------------DLHLSALMPRSFEISFKVRNTGKYDGEEVAQLYLRDEYASVV 716
Query: 714 RPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
+P+KQL F L E+ E+ F LS E S + ++E GT +++G
Sbjct: 717 QPLKQLKHFARFYLKRGEEREVKFILSE-EDFSLVDRNLKKIVEPGTFQIMIG 768
>gi|383649536|ref|ZP_09959942.1| beta-glucosidase [Sphingomonas elodea ATCC 31461]
Length = 853
Score = 274 bits (701), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 166/427 (38%), Positives = 238/427 (55%), Gaps = 32/427 (7%)
Query: 60 VSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQV 119
++++T+++K +QL ++APA P +G+PAY+WW+E LHG+A G AT FPQ
Sbjct: 20 IAKMTIEQKAAQLQSTAPADPVIGLPAYDWWNEGLHGLARDGY----------ATVFPQA 69
Query: 120 ILTAASFDSYLWYRIGQAIGLEARALYNAGQAI-------GMTFWAPNINIFRDPRWGRG 172
I AA++D L +++G + EARA +NA G+T W+PNINIFRDPRWGRG
Sbjct: 70 IGLAATWDVPLLHKVGDTVATEARAKFNAKPVTADRKIYEGLTIWSPNINIFRDPRWGRG 129
Query: 173 QETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYK 232
QET GEDP +TG AV +++G+QG K+ A KH + R
Sbjct: 130 QETYGEDPFLTGHLAVGFIQGLQGPDPAHPKVL------ATPKHLAVHSGPE---AGRDG 180
Query: 233 FDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFH 292
FD + QDL TY P F + +G+A +MCAYN ++G P+CA LL+ R+ WGF
Sbjct: 181 FDVDPSPQDLESTYTPAFRLALTEGKAQSVMCAYNSIHGTPACASGGLLNDRLRKDWGFT 240
Query: 293 GYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESE 352
G I SDCDAV+ I+ Y +A +KAGMD+NCG+ + AVK+ + E+E
Sbjct: 241 GLIVSDCDAVANIHLFHHYRLDAAEASAAAIKAGMDLNCGTTYAALPE-AVKRGLVSEAE 299
Query: 353 IDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHG 412
+D AL R LG+ G P+ +I +PA + LAL+AA+ IVLLKN
Sbjct: 300 VDVALKRALDARRALGIAFGG--ANPWSRIKPGERGTPAQRALALEAARKAIVLLKNDGD 357
Query: 413 LLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQAL-QNYVENTVYYPGCDTV 471
LPL + +A+IG NA+ L GNY G + +TPL + + + + V Y +
Sbjct: 358 RLPL--APGGRIAVIGANADDLGVLEGNYHGTARDPVTPLDGIRRQFGADRVVYAQGAQL 415
Query: 472 ACSSASI 478
A ++A I
Sbjct: 416 AEAAAVI 422
Score = 101 bits (251), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 87/283 (30%), Positives = 123/283 (43%), Gaps = 49/283 (17%)
Query: 489 DHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLLC 538
D +V ++GL E E L DR D+ LP Q EL+ + KP++LVL
Sbjct: 580 DTIVAVLGLSPDLEGEALSVSIPGFVGGDRTDIALPRPQLELL-KALRKTGKPLVLVLTS 638
Query: 539 GGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPM 598
G V + + D +IL A YPGE G A+AE + G +NP GRLP+T+Y
Sbjct: 639 GSAVAVDPSLAD----AILEAWYPGEEGGTAIAETLAGKNNPSGRLPLTFYASVDDLPAF 694
Query: 599 TDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKM 658
D M+ RTYRF+ GK ++ FG GLSY+K++Y AV + S +
Sbjct: 695 VDYGMKE--------RTYRFFTGKPLWGFGHGLSYTKFAYGNVAVKAAGIGQPVQVSATL 746
Query: 659 VENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQ 718
+ G E + ++VT G + PVL +Q
Sbjct: 747 TNAGN--------RSGEEVAQV--YVVTPAAGKPGGL--TTPVLQ-------------RQ 781
Query: 719 LVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTH 761
L GFQ L + A + F L P S+S DG + GT+
Sbjct: 782 LAGFQRTALAPGKSASLSFTLDP-RSISSVARDGTRRVLPGTY 823
>gi|300726322|ref|ZP_07059774.1| beta-xylosidase B [Prevotella bryantii B14]
gi|291292284|gb|ADD92014.1| Xyl3A [Prevotella bryantii B14]
gi|299776347|gb|EFI72905.1| beta-xylosidase B [Prevotella bryantii B14]
Length = 885
Score = 274 bits (701), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 160/444 (36%), Positives = 235/444 (52%), Gaps = 29/444 (6%)
Query: 36 SNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALH 95
S+ ++ P+ L +RA DL RLTL+EK + + +PAIPRLGI + WWSEALH
Sbjct: 41 SDKTSTLLPYQNPNLSAYERAIDLCHRLTLEEKALLMQDESPAIPRLGIKKFFWWSEALH 100
Query: 96 GVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIG-- 153
G A +G T+FP+ I A+SF+ L + A E RA Y+ G
Sbjct: 101 GAANMG----------NVTNFPEPIAMASSFNPTLLKSVFSAASDEMRAQYHHRMDNGGE 150
Query: 154 ------MTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGK 207
++ W PN+NIFRDPRWGRGQET GEDP +T + V G+QG + +
Sbjct: 151 DEKFHSLSVWTPNVNIFRDPRWGRGQETYGEDPYLTSVMGCAVVEGLQGPESSKYR---- 206
Query: 208 LQASACCKHFTAYDLDNWKGTTRYKFD-ARVTMQDLADTYQPPFESCVKQGRASGIMCAY 266
+ AC KHF + + +TR+ + ++ +DL +TY P F+S V+ G +MCAY
Sbjct: 207 -KLWACAKHFAVH---SGPESTRHTANLNNISPRDLYETYLPAFQSTVQDGHVREVMCAY 262
Query: 267 NRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAG 326
R++ P C++ LL + R +WGF + SDC AVS I+ + + A AG
Sbjct: 263 QRLDDEPCCSNNRLLQQILREEWGFKYLVVSDCGAVSDIWQSHKTSSDAVHASRQATLAG 322
Query: 327 MDVNCG-SFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGAD 385
DV CG + AVK+ L E EID+ + L R LG + + ++ + KI
Sbjct: 323 TDVECGYGYTYAKIPEAVKRGLLTEEEIDKHVIRLLEGRFDLGEMDDSKLVE-WSKIPYS 381
Query: 386 VVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPS 445
++ AH LAL A+ IVLL+N +LPL K+ +A+IGPNA++ + GNY G
Sbjct: 382 IMSCKAHAQLALDMARQSIVLLQNKGNILPLQLKKNERIAVIGPNADNKPMMWGNYNGTP 441
Query: 446 CRSITPLQALQNYVENTVYYPGCD 469
+++ L+ ++ +N VY P CD
Sbjct: 442 NHTVSILEGIRKQYKNVVYLPACD 465
Score = 88.6 bits (218), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 59/188 (31%), Positives = 98/188 (52%), Gaps = 27/188 (14%)
Query: 478 IDKAVDIA--KGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVA 525
+D A +IA KG D V+ + G+ + E EE+ DR D+ +P Q++ I +A
Sbjct: 614 VDYAANIAQLKGIDKVIFVGGIAPSLEGEEMPVNIPGFKGGDRTDIEMPQVQRDFIKALA 673
Query: 526 EAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLP 585
EA K+ I+++ C G I + +I+ A YPG+ G A+A+++ G NP G+LP
Sbjct: 674 EAGKQ--IILVNCSGSA-IALTPEAQRCQAIIQAWYPGQEGGTAVADILMGKVNPMGKLP 730
Query: 586 MTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKY---SYKFKA 642
+T+Y D M+ RTYR++E ++PFG GLSY+ + + K +
Sbjct: 731 VTFYKSTQQLPDFEDYSMK--------NRTYRYFE-DALYPFGYGLSYTSFEIGTAKLQT 781
Query: 643 VSQNKLYL 650
++ N + L
Sbjct: 782 LTNNSITL 789
>gi|295132888|ref|YP_003583564.1| beta-glucosidase [Zunongwangia profunda SM-A87]
gi|294980903|gb|ADF51368.1| beta-glucosidase [Zunongwangia profunda SM-A87]
Length = 855
Score = 274 bits (701), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 160/443 (36%), Positives = 237/443 (53%), Gaps = 37/443 (8%)
Query: 44 PFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKG 103
P+ L +RA DLV+RLTL+EK S + + + AIPRLGI + WWSEALHG A
Sbjct: 13 PYQNPNLSPEERAEDLVNRLTLEEKASLMFDVSEAIPRLGIKKFNWWSEALHGFAN---- 68
Query: 104 IFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAG--------QAIGMT 155
N + T FP+ + AASFD L Y++ A E RA Y+ + + ++
Sbjct: 69 ---NDDV---TVFPEPVGMAASFDDELVYQVFDATSDEVRAKYHEALRNGEENKRFLSLS 122
Query: 156 FWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCK 215
W PN+NIFRDPRWGRGQET GEDP +T + V V+G+QG K K L AC K
Sbjct: 123 VWTPNVNIFRDPRWGRGQETYGEDPYLTSRMGVQVVKGLQGP--EDAKYKKLL---ACAK 177
Query: 216 HFTAYDLDNWKGTTRYKFD-ARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPS 274
H+ + W +R++ + V+ +DL +TY P F+ V+ +MCAY R++ P
Sbjct: 178 HYAVHSGPEW---SRHELNLNNVSQRDLWETYLPAFKVLVQDANVRQVMCAYQRLDDEPC 234
Query: 275 CADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSF 334
C LL + R +WGF + SDC A+ Y + + A + AG DV C
Sbjct: 235 CGSDRLLQQILREKWGFEHLVVSDCGAIQDFYTSHNVSSDAVHAAAKAVLAGTDVECQ-- 292
Query: 335 LQKHT----KAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSP 390
KH AV++ + E +IDR++ + R LG + + + P+ +I A V+ +
Sbjct: 293 WDKHNYKLLPEAVEKGLVKEEDIDRSVKRVLIGRFELGEMDPDEIV-PYAQIPASVINNE 351
Query: 391 AHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSIT 450
H+ LAL+ A++ + LL+N + +LPL K + +A+IGPNA+ L GNY G R+I+
Sbjct: 352 EHRQLALKMARESMTLLQNKNNILPLSKGQD-RIAVIGPNADDEPMLWGNYNGTPVRTIS 410
Query: 451 PLQALQNYV--ENTVYYPGCDTV 471
L + + + ++ VY CD V
Sbjct: 411 ILDGITSKIGEKSIVYDKACDLV 433
Score = 101 bits (251), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 82/265 (30%), Positives = 121/265 (45%), Gaps = 53/265 (20%)
Query: 486 KGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILV 535
KG + V+ + GL E EE+ DR D+ LP Q+ + + +A KK VI V
Sbjct: 590 KGIETVIFVGGLSTKLEGEEMPVSYPGFKGGDRTDIALPSVQRNCLKTLKDAGKK-VIFV 648
Query: 536 LLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIK 595
G I + +IL A Y GE+G A+A+V+FGD+NP G+LP+T+Y +D +
Sbjct: 649 NNSGSA--IGLVPETTSCDAILQAWYGGESGGQAVADVLFGDYNPSGKLPVTFY-KDTTQ 705
Query: 596 VP-MTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSS 654
+P D M GRTYRF + + +FPFG GLSY+ ++K +K ++ SS
Sbjct: 706 LPDFEDYSMN--------GRTYRFMKAEPLFPFGHGLSYT--NFKIGEAQLDKSEIDTSS 755
Query: 655 STKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGR 714
S +TI + N G+ G + ++V G
Sbjct: 756 SVN---------------------------ITISISNEGKTEGVEIIQVYVHKQGLEEG- 787
Query: 715 PIKQLVGFQSVILNAKEKAEIVFEL 739
PIK L GF+ V L E + L
Sbjct: 788 PIKTLKGFKRVNLKPNEMKNVTINL 812
>gi|423303577|ref|ZP_17281576.1| hypothetical protein HMPREF1072_00516 [Bacteroides uniformis
CL03T00C23]
gi|423307700|ref|ZP_17285690.1| hypothetical protein HMPREF1073_00440 [Bacteroides uniformis
CL03T12C37]
gi|392687941|gb|EIY81232.1| hypothetical protein HMPREF1072_00516 [Bacteroides uniformis
CL03T00C23]
gi|392689569|gb|EIY82846.1| hypothetical protein HMPREF1073_00440 [Bacteroides uniformis
CL03T12C37]
Length = 942
Score = 274 bits (701), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 229/796 (28%), Positives = 360/796 (45%), Gaps = 135/796 (16%)
Query: 51 PISQRARDLVSRLTLDEKISQLVN--SAPAIPRLGIPAYEW------------------- 89
P+ R +L+ ++TLDEK Q+V + + +P EW
Sbjct: 59 PLEARIENLLQQMTLDEKTCQMVTLYGYKRVLKDDLPTPEWKELLWKDGIGAIDEHLNGF 118
Query: 90 -------------WSEALHG----------VAGVGKGI---FFNGTIRG-----ATSFPQ 118
W + H V GI F N IRG AT+FP
Sbjct: 119 QQWGLPPSDNAYVWPASRHAWALNEVQRFFVEDTRLGIPVDFTNEGIRGVESYRATNFPT 178
Query: 119 VILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMT-FWAPNINIFRDPRWGRGQETPG 177
+ +++ L ++G G EAR L G T +AP +++ RD RWGR +E G
Sbjct: 179 QLGLGHTWNRELIRQVGLITGREARML-------GYTNVYAPILDVGRDQRWGRYEEVYG 231
Query: 178 EDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARV 237
E P + + + VRG+Q + Q +A KHF AY + + D ++
Sbjct: 232 ESPYLVAELGIEMVRGLQHNH----------QVAATGKHFAAYSNNKGAREGMARVDPQM 281
Query: 238 TMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITS 297
+ +++ + + PF+ +++ G+M +YN +GIP L+ R + GF GY+ S
Sbjct: 282 SPREVENIHIYPFKRVIREAGMLGVMSSYNDYDGIPVQGSYYWLTTRLRGEMGFRGYVVS 341
Query: 298 DCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNC-----GSFLQKHTKAAVKQKKLPESE 352
D DAV +Y G AK ++AV ++AG++V C SF+ + VK+ L E
Sbjct: 342 DSDAVEYLYTKHGTAKDMKEAVRQSVEAGLNVRCTFRSPDSFVLP-LRELVKEGGLSEEV 400
Query: 353 IDRALHNLFSVRMRLGLFNGNPTMQPFGKIGAD-VVCSPAHQVLALQAAQDGIVLLKNSH 411
I+ + ++ V+ +GLF+ P GAD V ++ +ALQA+ + +VLLKN+
Sbjct: 401 INDRVRDILRVKFLIGLFDA-PYQTDLA--GADREVEKEENEAIALQASHESVVLLKNAD 457
Query: 412 GLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVEN---TVYYPGC 468
LLPL + + +A+ GPNAN L +Y + T L+ +Q ++ +Y GC
Sbjct: 458 ELLPLDINSTKKIAVCGPNANEEGYALTHYGPLAVEVTTVLEGIQEKTKSKAEVLYTKGC 517
Query: 469 DTVAC---------------SSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVL 513
D V A IDKAV+ A+ AD V+++G Q E R L L
Sbjct: 518 DLVDAHWPESEIIDYPLTDDEQAEIDKAVENARQADVAVVVLGGGQRTCGENKSRTSLDL 577
Query: 514 PGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEV 573
PGRQ +L+ + +A KPV+L+L+ G P+ I +A D+ + +IL A YPG G ALA++
Sbjct: 578 PGRQLQLLQAI-QATGKPVVLILINGRPLSINWA--DKFVPAILEAWYPGSKGGTALADI 634
Query: 574 IFGDHNPGGRLPMTWYPQDYIKVPMT-DMKMRPQATSG-NPGRTYRFYE-GKEVFPFGCG 630
+FGD+NPGG+L +T +P+ ++P K Q G NPG T ++PFG G
Sbjct: 635 LFGDYNPGGKLTVT-FPKTVGQIPFNFPCKPSSQIDGGKNPGPTGNMSRINGALYPFGYG 693
Query: 631 LSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVK 690
LSY+ + Y ++ + N+S++ V + V
Sbjct: 694 LSYTTFEYSDLDITPRVITPNESAT-----------------------------VRLKVT 724
Query: 691 NHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRARE 750
N G+ AG V L+++ K L GFQ + L E E+ F + + L
Sbjct: 725 NTGKRAGDEVVQLYIRDVLSSITTYEKNLAGFQRIHLEPGEAQELSFTID-RKHLELLDA 783
Query: 751 DGLMVIEEGTHFLVVG 766
D V+E G L+ G
Sbjct: 784 DMKWVVEPGDFVLMAG 799
>gi|150002739|ref|YP_001297483.1| glycoside hydrolase family protein [Bacteroides vulgatus ATCC 8482]
gi|294776994|ref|ZP_06742455.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
vulgatus PC510]
gi|149931163|gb|ABR37861.1| glycoside hydrolase family 3, candidate beta-glycosidase
[Bacteroides vulgatus ATCC 8482]
gi|294449242|gb|EFG17781.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
vulgatus PC510]
Length = 788
Score = 274 bits (700), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 219/761 (28%), Positives = 349/761 (45%), Gaps = 131/761 (17%)
Query: 51 PISQRARDLVSRLTLDEKISQLVN--SAPAIPRLGIPAYEW----WSEALHGVAGVGKGI 104
P+ R +DL+S++TL+EK Q+ + + + +P W W + + + G+
Sbjct: 49 PLEDRVQDLLSQMTLEEKTCQMATLYGSGRVLKDALPQNNWKTEVWKDGIGNIDEEHNGL 108
Query: 105 ------------------------------------FFNGTIRG-----ATSFPQVILTA 123
F N IRG AT FP
Sbjct: 109 GAFKSEYSFPYAKHVNAKHTIQRWFVEKTRLGIPVDFTNEGIRGLCHDRATYFPAQCGQG 168
Query: 124 ASFDSYLWYRIGQAIGLEARALYNAGQAIGMT-FWAPNINIFRDPRWGRGQETPGEDPLV 182
A+++ L RIG+ EA AL G T ++P ++I +DPRWGR ET GEDP +
Sbjct: 169 ATWNKKLIARIGEVEAKEAVAL-------GYTNIYSPILDIAQDPRWGRCVETYGEDPYL 221
Query: 183 TGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDL 242
G+ + +Q K A KHF Y + + + D V +++
Sbjct: 222 VGELGKQMITSLQ-----------KYNLVATPKHFAVYSIPIGGRDGKTRTDPHVAPREM 270
Query: 243 ADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAV 302
Y PF ++ A G+M +YN +G P + L++ R++WGF GY+ SD +AV
Sbjct: 271 RTLYIEPFRMAFQEAGALGVMSSYNDYDGEPITGSYHFLTEILRQEWGFKGYVVSDSEAV 330
Query: 303 SIIYDAEGYAKSPEDAVVDVLKAGMDVNC-----GSFLQKHTKAAVKQKKLPESEIDRAL 357
I + A + ED + + AG+++ F+ K AV K+ + +D+ +
Sbjct: 331 EFISNKHKVADTYEDGIAQAVNAGLNIRTHFTPPADFILPLRK-AVDNGKISQETLDKRV 389
Query: 358 HNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLP 417
+ ++ RLGLF+ NP + GK +V S HQ ++L+AA+ +VLLKN LLPL
Sbjct: 390 AEILRIKFRLGLFD-NP-YRGNGKQAEQIVHSKEHQAVSLEAARQSLVLLKNETNLLPLS 447
Query: 418 KSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENT--VYYPGCDTV---- 471
KS S+A+IGPNAN L+ Y + T Q ++ + +T +Y GCD +
Sbjct: 448 KSIR-SIAVIGPNANEQTQLICRYGPANAPIKTVYQGIKELLPHTEVIYKKGCDIIDPHF 506
Query: 472 -----------ACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQEL 520
A +++A+ AK A+ VV+++G ++ +E+ R L LPGRQ+EL
Sbjct: 507 PESEILDFPKTAEEVQLMEEAIRAAKQAEVVVMVLGGNELTVREDRSRTSLNLPGRQEEL 566
Query: 521 ITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
+ V A KP+ILV+L G I +A +I +IL A +PGE A+AE +FGD+NP
Sbjct: 567 LKAVC-ATGKPIILVMLDGRASSINYAA--AHIPAILHAWFPGEFCGQAVAEALFGDYNP 623
Query: 581 GGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKF 640
GGRL +T +P+ ++P +P + + Y ++PFG GLSY+ ++Y
Sbjct: 624 GGRLAVT-FPKSVGQIPFA-FPFKPGSDESSSTSVY-----GALYPFGHGLSYTTFTYS- 675
Query: 641 KAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHP 700
L+ S S + V Q +H V+ +KN G++ G
Sbjct: 676 --------DLHISPSHQGV--QGDIH------------------VSCKIKNTGKIKGDEV 707
Query: 701 VLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSP 741
V L+++ K L GF+ + L A E+ + F L P
Sbjct: 708 VQLYLRDEISSVTTYTKVLRGFERISLKAGEEQTVHFRLRP 748
>gi|255690486|ref|ZP_05414161.1| periplasmic beta-glucosidase [Bacteroides finegoldii DSM 17565]
gi|260623937|gb|EEX46808.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
finegoldii DSM 17565]
Length = 1365
Score = 274 bits (700), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 230/795 (28%), Positives = 350/795 (44%), Gaps = 145/795 (18%)
Query: 43 FPFCKTTLPISQRARDLVSRLTLDEKISQL------------------------------ 72
P+ + LPI +R +DL+ R+T +EK++Q+
Sbjct: 534 LPYQRADLPIEERVKDLLQRMTPEEKLAQIRHIHSWEIFNGQALDERKLEEKAQGMSWGF 593
Query: 73 VNSAPAIP------------------RLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGAT 114
V P RLGIP + +E+LHGV GAT
Sbjct: 594 VEGFPLTAENCAKNMLAIQRFMVEKTRLGIPIFTV-AESLHGVVH-----------EGAT 641
Query: 115 SFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGM-TFWAPNINIFRDPRWGRGQ 173
FPQ I ++FD+ L YR I E A+GM +P I++ RD RWGR +
Sbjct: 642 VFPQNIALGSTFDTDLAYRKTSMIADEL-------HAVGMRQVLSPCIDVVRDLRWGRVE 694
Query: 174 ETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKF 233
E+ GEDP + G++ ++ V+G NG S KH+ + + G
Sbjct: 695 ESFGEDPYLCGRFGIAEVKGYMD---NG--------ISPMLKHYGPHG-NPLSGLNLASV 742
Query: 234 DARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHG 293
+ +++DL + Y PFE +KQ +M AYN N IP+ A LL+ R++WGF G
Sbjct: 743 E--TSIRDLHEVYLKPFEMVMKQAPTLAVMSAYNSWNRIPNSASHYLLTDVLRKEWGFKG 800
Query: 294 YITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEI 353
Y+ SD A+ ++ + A++ E+A + L AG+DV S +++ +L +
Sbjct: 801 YVYSDWGAIEMLKNFHFTARNSEEAALQALTAGLDVEASSDCYPAIPGLIERGELNREIV 860
Query: 354 DRALHNLFSVRMRLGLFNGNPTMQPFG-KIGADVVCSPAHQVLALQAAQDGIVLLKNSHG 412
D A+ + + R+GLF+ P+G K + S L+ + A + VLLKN
Sbjct: 861 DEAVRRVLYAKFRIGLFD-----DPYGEKFAKGAIHSGKAIALSKKIADESTVLLKNERQ 915
Query: 413 LLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCR--SITPLQALQNYVENTV---YYPG 467
LLPL K S+A+IGPNA+ + G+Y +TPLQ ++ + V Y G
Sbjct: 916 LLPLSIGKLKSIAVIGPNADQIQ--FGDYTWTRDNRFGVTPLQGIRKWAGTNVKVNYAKG 973
Query: 468 CDTVACSSASIDKAVDIAKGADHVVLMMG---------LDQTQEKEELDRVDLVLPGRQQ 518
C V+ + I +AV+ A+ +D VL G + E D DL L G Q
Sbjct: 974 CSLVSMDESGIRQAVEAAEQSDVCVLFCGSASAALARDYKSSTCGEGFDLNDLTLTGAQP 1033
Query: 519 ELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDH 578
LI + +A KPVILVL+ G P I + K +NI +IL Y GE ++A+++FG
Sbjct: 1034 ALI-KAVQATGKPVILVLVTGKPFAIPWEK--KNIPAILVQWYAGEQSGNSIADILFGKV 1090
Query: 579 NPGGRLPMTWYPQDYIKVPMTDMKMRPQ-------ATSGNPGRTYRFYEGKEVFPFGCGL 631
+P GRL + +P+ +P+ +R + +PGR Y F ++ FG GL
Sbjct: 1091 SPSGRLTFS-FPESTGHLPVYYNHLRSDRGFYKSPGSYDSPGRDYVFSAPVPLWSFGHGL 1149
Query: 632 SYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKN 691
+Y+ + Y + LN D VH V IG+KN
Sbjct: 1150 TYTTFEYSNLQTDRASYLLN-----------DTVH------------------VRIGLKN 1180
Query: 692 HGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRARED 751
G+ GK V L+V P++QL F+ V L A E +IV P L+ E
Sbjct: 1181 TGKCEGKEVVQLYVSDVCSSVAMPVRQLRDFRKVALQAGE-TQIVRLSIPVSELTILNEK 1239
Query: 752 GLMVIEEGTHFLVVG 766
++E G + VG
Sbjct: 1240 NEAIVEPGEFEIQVG 1254
>gi|365121914|ref|ZP_09338824.1| hypothetical protein HMPREF1033_02170 [Tannerella sp.
6_1_58FAA_CT1]
gi|363643627|gb|EHL82934.1| hypothetical protein HMPREF1033_02170 [Tannerella sp.
6_1_58FAA_CT1]
Length = 1073
Score = 274 bits (700), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 165/430 (38%), Positives = 230/430 (53%), Gaps = 45/430 (10%)
Query: 42 TFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVG 101
+PF TTL +R +DL+SRL + EKIS L ++PAIPRLGI Y +EALHGV G
Sbjct: 26 NYPFRDTTLSHHERIKDLLSRLNVSEKISLLRATSPAIPRLGIDKYYHGNEALHGVVRPG 85
Query: 102 KGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYN---------AGQAI 152
K T FPQ I A+ ++ + AI EAR +N AG +
Sbjct: 86 K----------FTVFPQAIGLASMWNPDFLQEVSTAISDEARGRWNELNQGKDQTAGASD 135
Query: 153 GMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASA 212
+TFW+P IN+ RDPRWGR ET GEDP +TG ++VRG+QG+ ++ +
Sbjct: 136 LLTFWSPTINMARDPRWGRTPETYGEDPFLTGTLGTAFVRGLQGND------PKYIKVVS 189
Query: 213 CCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGI 272
KHF A N + R +A ++ +DL + Y P FE C+K+G+A +M AYN VNGI
Sbjct: 190 TPKHFAA----NNEEHNRASGNAVISERDLREYYFPAFEKCIKEGQAQSVMSAYNAVNGI 245
Query: 273 PSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG 332
P ++ LL+ R WGF GY+ SDC A I Y + E+A +KAG+D+ CG
Sbjct: 246 PCTLNKWLLTDVLRDDWGFDGYVVSDCSAPEYIVSQHHYVDTYEEAASLCIKAGLDLECG 305
Query: 333 S------FLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADV 386
L + + V SEID A + + RMRLGLF+ +P P+ KI +
Sbjct: 306 DNVYITPLLNAYNRGMVTM-----SEIDSAAYRVLRGRMRLGLFD-DPNENPYNKISPSI 359
Query: 387 VCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSC 446
V H+ LAL+AA+ +VLLKN +LP+ S+A++G NA A G+Y+G
Sbjct: 360 VGCEKHRELALEAARQSLVLLKNDKDMLPIQTDNIKSIAVVGINA--ANCEFGDYSGTPV 417
Query: 447 RSITPLQALQ 456
TP+ L+
Sbjct: 418 N--TPISVLE 425
Score = 132 bits (332), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 93/301 (30%), Positives = 149/301 (49%), Gaps = 44/301 (14%)
Query: 466 PGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVA 525
P D+ + A D A +I +G+D + ++G+D+T E+E DR + LP QQ I
Sbjct: 722 PSTDSESLLDAYGD-AGEIIRGSDLTIAVLGIDRTIEREGQDRSTIELPEDQQIFIEEAY 780
Query: 526 EAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLP 585
+A V+ VL+ G + I + D+NI ++L A YPGE G A+AE +FGD+NPGGRLP
Sbjct: 781 KANPNTVV-VLVAGSSLAINWI--DQNIPAVLDAWYPGEQGGTAVAEALFGDYNPGGRLP 837
Query: 586 MTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQ 645
+T+Y ++D+ N RTY ++EGK ++PFG GLSY+ ++Y+ V+Q
Sbjct: 838 LTFYNS------LSDLPAFDDYNVRN-NRTYMYFEGKPLYPFGYGLSYTDFAYRGLDVTQ 890
Query: 646 NKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFV 705
++ EN V KF V+ N G G +++
Sbjct: 891 DE------------ENVTV-----------------KFFVS----NTGNYDGDEVAQVYI 917
Query: 706 KPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVV 765
+ +G P+KQL GF+ V ++ ++ EI + E + + EG + +V
Sbjct: 918 QFPDQGTTLPLKQLKGFKRVHISKGQETEITVRIPKKELRLWSENNSEFYTPEGNYIFLV 977
Query: 766 G 766
G
Sbjct: 978 G 978
>gi|317480750|ref|ZP_07939836.1| glycosyl hydrolase family 3 C terminal domain-containing protein
[Bacteroides sp. 4_1_36]
gi|316903091|gb|EFV24959.1| glycosyl hydrolase family 3 C terminal domain-containing protein
[Bacteroides sp. 4_1_36]
Length = 942
Score = 274 bits (700), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 228/798 (28%), Positives = 360/798 (45%), Gaps = 139/798 (17%)
Query: 51 PISQRARDLVSRLTLDEKISQLVN--SAPAIPRLGIPAYEW------------------- 89
P+ R +L+ ++TLDEK Q+V + + +P EW
Sbjct: 59 PLEARIENLLQQMTLDEKTCQMVTLYGYKRVLKDDLPTPEWKELLWKDGIGAIDEHLNGF 118
Query: 90 -------------WSEALHG----------VAGVGKGI---FFNGTIRG-----ATSFPQ 118
W + H V GI F N IRG AT+FP
Sbjct: 119 QQWGLPPSDNAYVWPASRHAWALNEVQRFFVEDTRLGIPVDFTNEGIRGVESYRATNFPT 178
Query: 119 VILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMT-FWAPNINIFRDPRWGRGQETPG 177
+ +++ L ++G G EAR L G T +AP +++ RD RWGR +E G
Sbjct: 179 QLGLGHTWNRELIRQVGLITGREARML-------GYTNVYAPILDVGRDQRWGRYEEVYG 231
Query: 178 EDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARV 237
E P + + + VRG+Q + Q +A KHF AY + + D ++
Sbjct: 232 ESPYLVAELGIEMVRGLQHNH----------QVAATGKHFAAYSNNKGAREGMARVDPQM 281
Query: 238 TMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITS 297
+ +++ + + PF+ +++ G+M +YN +GIP L+ R + GF GY+ S
Sbjct: 282 SPREVENIHIYPFKRVIREAGMLGVMSSYNDYDGIPVQGSYYWLTTRLRGEMGFRGYVVS 341
Query: 298 DCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNC-----GSFLQKHTKAAVKQKKLPESE 352
D DAV +Y G AK ++AV ++AG++V C SF+ + VK+ L E
Sbjct: 342 DSDAVEYLYTKHGTAKDMKEAVRQSVEAGLNVRCTFRSPDSFVLP-LRELVKEGGLSEEV 400
Query: 353 IDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADV---VCSPAHQVLALQAAQDGIVLLKN 409
I+ + ++ V+ +GLF+ P+ AD V ++ +ALQA+++ IVLLKN
Sbjct: 401 INDRVRDILRVKFLIGLFDA-----PYQTDLADADREVEKEENEAIALQASRESIVLLKN 455
Query: 410 SHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVE---NTVYYP 466
+ LLPL + + +A+ GPNAN L +Y + T L+ +Q + +Y
Sbjct: 456 AGELLPLDINSTKKIAVCGPNANEEGYALTHYGPLAVEVTTVLEGIQEKTKGKAEVLYTK 515
Query: 467 GCDTVAC---------------SSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDL 511
GCD V A IDKAV+ A+ AD ++++G Q E R L
Sbjct: 516 GCDLVDAHWPESEIIDYPLTDDEQAEIDKAVENARQADVAIVVLGGGQRTCGENKSRTSL 575
Query: 512 VLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALA 571
LPGRQ +L+ + +A KPV+L+L+ G P+ I +A D+ + +IL A YPG G ALA
Sbjct: 576 DLPGRQLQLLQAI-QATGKPVVLILINGRPLSINWA--DKFVPAILEAWYPGSKGGTALA 632
Query: 572 EVIFGDHNPGGRLPMTWYPQDYIKVPMT-DMKMRPQATSG-NPGRTYRFYE-GKEVFPFG 628
+++FGD+NPGG+L +T +P+ ++P K Q G NPG T ++PFG
Sbjct: 633 DILFGDYNPGGKLTVT-FPKTVGQIPFNFPCKPSSQIDGGKNPGPTGNMSRINGALYPFG 691
Query: 629 CGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIG 688
GLSY+ + Y ++ + N+S++ V +
Sbjct: 692 YGLSYTTFEYSDLDITPRVITPNESAT-----------------------------VRLK 722
Query: 689 VKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRA 748
V N G+ AG V L+++ K L GFQ + L E E+ F + + L
Sbjct: 723 VTNTGKRAGDEVVQLYIRDVLSSITTYEKNLAGFQRIHLEPGEAQELSFTID-RKHLELL 781
Query: 749 REDGLMVIEEGTHFLVVG 766
D V+E G L+ G
Sbjct: 782 DADMKWVVEPGDFVLMAG 799
>gi|293371041|ref|ZP_06617583.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
ovatus SD CMC 3f]
gi|292633971|gb|EFF52518.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
ovatus SD CMC 3f]
Length = 791
Score = 273 bits (699), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 210/713 (29%), Positives = 340/713 (47%), Gaps = 101/713 (14%)
Query: 81 RLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGL 140
RLGIP + EA HG +G T FP I AA++ L +GQ I
Sbjct: 138 RLGIPMF-LAEEAPHGHMAIG-----------ITVFPTGIGMAATWSPELVKEVGQVIAK 185
Query: 141 EARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFN 200
E R+ G + P +++ RDPRW R +ET GEDP+++G + V G+
Sbjct: 186 EIRS------QGGHISYGPVLDLTRDPRWSRVEETFGEDPVLSGTLGAAMVDGLIN---- 235
Query: 201 GGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRAS 260
G + K A KHF AY + A V M++L + + PPF+ + G A
Sbjct: 236 -GNISRKNSTIATLKHFLAYAVPEGGQNGN---QALVGMRELHENFLPPFKKAIDAG-AL 290
Query: 261 GIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVV 320
+M +YN ++GIP A+ LL++ R +W F G++ SD ++ IY++ A S EDA +
Sbjct: 291 SVMTSYNSIDGIPCTANSYLLNQLLRNEWKFRGFVVSDLYSIEGIYESHYTASSIEDAAI 350
Query: 321 DVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFG 380
+ AG+DV+ G + AVK+K+L E+ ID + + ++ +GLF NP + P
Sbjct: 351 QAVSAGVDVDLGGEAYTNIYRAVKEKRLSEAIIDEVVCRVLRLKFEMGLFE-NPYVDP-- 407
Query: 381 KIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGN 440
+I + V + H A + AQ + LLKN H +LPL K+ +A+IGPNA++ +LG+
Sbjct: 408 QIAIERVRNANHIANARRMAQASVTLLKNRHDILPLSKNIR-KVAVIGPNADNCYNMLGD 466
Query: 441 YAGP----SCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMG 496
Y P + +++ + + Y GC ++ I KAV+ A AD V+ ++G
Sbjct: 467 YTAPQKDENIKTVLDGIISKLSLSRVEYVRGCAIRDTTNNEIAKAVEAANRADVVIAVVG 526
Query: 497 ------------------LDQTQEK-----EELDRVDLVLPGRQQELITRVAEAAKKPVI 533
D++Q E DR L L G+Q EL+ + ++ +KP+I
Sbjct: 527 GSSARDFKTTYKETGAAIADKSQISDMECGEGFDRATLSLLGKQLELLESL-KSTRKPLI 585
Query: 534 LVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDY 593
+V + G P++ +A + ++L A YPG+ G A+A+V+FGD+NP GRLP++ P+
Sbjct: 586 VVYIEGRPLNKNWAA--EHADALLTAYYPGQEGGDAIADVLFGDYNPAGRLPVS-VPRSE 642
Query: 594 IKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQS 653
++P+ K P+ Y ++ FG GLSYS + Y V+Q
Sbjct: 643 GQIPVYYNKKTPKC------HDYVEMSASPLYSFGYGLSYSTFEYSNLKVTQ-------- 688
Query: 654 SSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNG 713
Q +H++ ++ V+N G+ G+ L+++
Sbjct: 689 --------QAPLHFE----------------ISFDVENTGKYDGEEVAQLYIRDEYASVV 724
Query: 714 RPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
R ++QL F+ L EK IVF L E LS + ++E G+ L++G
Sbjct: 725 RALRQLKHFKRFFLKQGEKKTIVFTLVE-EDLSIINQKMERIVEPGSFQLMIG 776
>gi|167646366|ref|YP_001684029.1| beta-glucosidase [Caulobacter sp. K31]
gi|167348796|gb|ABZ71531.1| Beta-glucosidase [Caulobacter sp. K31]
Length = 898
Score = 273 bits (699), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 168/417 (40%), Positives = 235/417 (56%), Gaps = 42/417 (10%)
Query: 56 ARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATS 115
A +LV ++TLDEK+ QL+N+APAIPRLG+PAY WW+E+LHG G T+
Sbjct: 46 AAELVGKMTLDEKLEQLLNTAPAIPRLGVPAYNWWTESLHGALGALP----------TTN 95
Query: 116 FPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIG--------MTFWAPNINIFRDP 167
FP+ I AA+FD+ L + + AIG E + L+ +A G + W+PNINIFRDP
Sbjct: 96 FPEPIGLAATFDAPLVHDVAGAIGAEMQDLHALARATGRMGRIGTGLNTWSPNINIFRDP 155
Query: 168 RWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKG 227
RWGRGQET GEDP +T + V++V G+QGD + ++ A KHF + +
Sbjct: 156 RWGRGQETYGEDPFLTARMGVAFVEGIQGDDPDHPRI------IATPKHFAVH---SGPE 206
Query: 228 TTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARR 287
+TR+ + V+ +DL DTY P F + V +GRA IMCAYNR++G P+CA LL + R
Sbjct: 207 STRHGANVFVSRRDLEDTYLPAFRAAVVEGRAGSIMCAYNRIDGQPACASDLLLKEHLRG 266
Query: 288 QWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSF-------LQKHTK 340
W F GY+ SDCDAV I D YA AV L+AG+D C L +
Sbjct: 267 AWKFDGYVVSDCDAVKDISDHHKYAPEAASAVAAALRAGVDNECNGATLTDTDGLAGRYR 326
Query: 341 AAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKI-GADVVCSPAHQVLALQA 399
A+ + + ++ID AL LFS R R G G++ G VV + H+ LAL A
Sbjct: 327 EALDRGLISTAQIDTALVRLFSARFRNGDLPAKGGSD--GRLAGPSVVTTHEHEALALAA 384
Query: 400 AQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQ 456
++ +VLLKN G+LPL + +A+IGP ++ + L GNY+ S S P+ +
Sbjct: 385 SEKSLVLLKND-GVLPL--KPGLRIAVIGPLGDATRVLRGNYS--SALSAPPISVVD 436
Score = 112 bits (281), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 90/288 (31%), Positives = 132/288 (45%), Gaps = 52/288 (18%)
Query: 489 DHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLLC 538
D +V ++GL E EE D+ L LP Q+ ++ + A+A KP+I+V +
Sbjct: 628 DVLVAVVGLTSDMEAEEAPIEIPGFKGGDKTTLDLPADQRAMLEQ-AKALGKPLIVVAMN 686
Query: 539 GGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPM 598
G P++ +AK N ++L A YPG++G +A+A V+ G NP GRLP+T+Y P
Sbjct: 687 GSPLNFAWAK--DNASALLEAWYPGQSGGLAIANVLTGKTNPAGRLPLTFYRSVDDLPPF 744
Query: 599 TDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKM 658
D M GRTYR++EG V+PFG GLSY+++ Y L +TK
Sbjct: 745 DDYAM--------AGRTYRYFEGTPVYPFGYGLSYTRFDYG---------PLKIEPATK- 786
Query: 659 VENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQ 718
+ VT +KN G G+ L++ + G P
Sbjct: 787 -------------------GAGQGLRVTTTIKNVGTRPGEEVAQLYLDFPKT-PGAPRLA 826
Query: 719 LVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
L GFQ + L E +I F LSP LS DG + G + + VG
Sbjct: 827 LRGFQRIALKPGETRDITFALSP-RDLSSVDLDGEHRVSAGLYRVSVG 873
>gi|393781366|ref|ZP_10369565.1| hypothetical protein HMPREF1071_00433 [Bacteroides salyersiae
CL02T12C01]
gi|392676859|gb|EIY70281.1| hypothetical protein HMPREF1071_00433 [Bacteroides salyersiae
CL02T12C01]
Length = 854
Score = 273 bits (699), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 168/442 (38%), Positives = 245/442 (55%), Gaps = 37/442 (8%)
Query: 45 FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGI 104
+ P +R DL+S+LT++EKIS L ++P IPRL I Y +EALHGV V G
Sbjct: 28 YLDMNAPQHERILDLLSKLTIEEKISLLRATSPGIPRLQIDKYYHGNEALHGV--VRPGN 85
Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYN----AGQAIG-----MT 155
F T FPQ I AA ++ L I AI EARA +N + +G +T
Sbjct: 86 F--------TVFPQAIGLAAMWNPQLLNEISTAISDEARARWNELEQGKKQLGQFSDLLT 137
Query: 156 FWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCK 215
FW+P +N+ RDPRWGR ET GEDP ++GK VS+V+G+QGD L+ + K
Sbjct: 138 FWSPTVNMARDPRWGRTPETYGEDPFLSGKLGVSFVKGLQGDD------PRYLKIVSTPK 191
Query: 216 HFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSC 275
HF A N + R++ + ++ +DL + Y P FE C+ +G+A+ IM AYN +N +P
Sbjct: 192 HFAA----NNEEHNRFECNPIISEKDLREYYLPAFEKCIIEGKAASIMTAYNAINDVPCT 247
Query: 276 ADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG-SF 334
+ LL K R WGF GY+ SDC A + Y K+ E A ++AG+D+ CG +
Sbjct: 248 LNNWLLKKVLRHDWGFDGYVVSDCGAPDFLVTHHKYVKTLEAAATLSIQAGLDLECGDNV 307
Query: 335 LQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQV 394
+ A KQ + E+EID A +++ RMRLGLF+ +P + P+ KI VV H
Sbjct: 308 YMEPLLNAYKQYMVTEAEIDSAAYHILRARMRLGLFD-DPNLNPYNKISPSVVGCEKHSQ 366
Query: 395 LALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAG-PSCRSITPLQ 453
LAL+AA+ IVLLKN LPL K S+A++G NA + + G+Y+G P + ++ L+
Sbjct: 367 LALEAARQSIVLLKNEKKFLPLDLKKIKSIAVVGINAGNCE--FGDYSGTPVNQPVSILE 424
Query: 454 ALQNYVENTV---YYPGCDTVA 472
++ V + + Y P +V+
Sbjct: 425 GIKKRVGDQIEVMYSPWTSSVS 446
Score = 120 bits (300), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 91/291 (31%), Positives = 138/291 (47%), Gaps = 51/291 (17%)
Query: 481 AVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGG 540
A D + D + ++G++++ E+E DR + LP QQ I + V+ VL+ G
Sbjct: 595 AGDAMRKCDLTIAVVGINKSIEREGQDRYSIELPKDQQIFIEEAYKINPNTVV-VLVAGS 653
Query: 541 PVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVP-MT 599
+ I + D +I +I+ A YPGEAG A+AEV+FGD+NPGG+LP+T+Y + ++P
Sbjct: 654 SLAINWM--DEHIPAIVNAWYPGEAGGTAVAEVLFGDYNPGGKLPLTYY-RSLDELPAFD 710
Query: 600 DMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMV 659
D +R GRTY+F+EG ++ FG GLSY+ +SYK LN S+ V
Sbjct: 711 DYDIR-------KGRTYQFFEGNPLYAFGHGLSYTTFSYK---------KLNIDSTGDAV 754
Query: 660 ENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGR----P 715
+ V+ +KN G+ G L+VK +GN P
Sbjct: 755 K------------------------VSFALKNTGKYDGDEVAQLYVK--YQGNDSLVKLP 788
Query: 716 IKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
+KQL GF+ V L E + + E E G G + +VG
Sbjct: 789 LKQLKGFERVHLKKGESKRVTLTVPKSELRFWDEEKGEFYTPAGDYLFMVG 839
>gi|423240769|ref|ZP_17221883.1| hypothetical protein HMPREF1065_02506 [Bacteroides dorei
CL03T12C01]
gi|392643731|gb|EIY37480.1| hypothetical protein HMPREF1065_02506 [Bacteroides dorei
CL03T12C01]
Length = 864
Score = 273 bits (699), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 163/443 (36%), Positives = 241/443 (54%), Gaps = 33/443 (7%)
Query: 45 FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGI 104
+ + L +RA DL+ +LTL+EK++ +++++ + RLGI Y WW+EALHGVA G
Sbjct: 24 YKNSNLSPEERAEDLLQQLTLEEKVALMMDNSKPVERLGIKPYNWWNEALHGVARSGL-- 81
Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIG-------MTFW 157
AT FPQ I AASF+ + I A+ EARA A A G +T W
Sbjct: 82 --------ATVFPQPIGMAASFEPDAIHTIYTAVSDEARAKNTAYSAAGSYERYQGLTMW 133
Query: 158 APNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHF 217
P +NI+RDPRWGRG ET GEDP +T V+ V+G+Q N K AC KHF
Sbjct: 134 TPTVNIYRDPRWGRGIETYGEDPYLTSVMGVNVVKGLQCTDANQKYDK----IHACAKHF 189
Query: 218 TAYDLDNWKGTTRYKFDAR-VTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCA 276
+ W R++F+A + +DL +TY PFE+ VK+G+ +MCAYNR+ G P C
Sbjct: 190 AVHSGPEW---NRHEFNAENIKPRDLHETYLVPFEALVKEGKVKEVMCAYNRLEGDPCCG 246
Query: 277 DRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSP--EDAVVDVLKAGMDVNCGSF 334
LL + R++WG+ G + SDC A+ Y +G+ P E A + +G D+ CGS
Sbjct: 247 SDRLLMQILRQEWGYEGIVLSDCGAIDDFYREKGHKTHPNAESASAAAVLSGTDLECGSS 306
Query: 335 LQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQV 394
+ ++A K+ + E +ID ++ L R LG + +P+ + KI VVCS H
Sbjct: 307 YKALVESA-KKGLISEKDIDVSVKRLLKARFELGEMD-DPSKVEWTKIPYSVVCSAEHDS 364
Query: 395 LALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQA 454
L+L A+ + LL N + +LPL K ++A++GPNAN + GNY G +IT L+
Sbjct: 365 LSLDIARKSMTLLLNKNNILPL-KRGGQTIAVMGPNANDSVMQWGNYNGTPKHTITLLEG 423
Query: 455 LQNYV---ENTVYYPGCDTVACS 474
+++ + + +Y GC V S
Sbjct: 424 IRSAMGENDKLIYEQGCSWVERS 446
Score = 116 bits (290), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 90/276 (32%), Positives = 133/276 (48%), Gaps = 54/276 (19%)
Query: 477 SIDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAE 526
+I V K AD V+ G+ + E EE+ DR D+ LP Q+ELI + +
Sbjct: 591 NIKNTVAKVKDADIVIFAGGISPSLEGEEMGVNLPGFRKGDRTDIELPAVQRELIKALCD 650
Query: 527 AAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPM 586
A KK VI V G P I + +IL A YPG++G A AEV+FGD+NP GRLP+
Sbjct: 651 AGKK-VIFVNFSGSP--IAMEPETKYCQAILQAWYPGQSGGKAAAEVLFGDYNPAGRLPV 707
Query: 587 TWYPQDYIKVP-MTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQ 645
T+Y ++ ++P D M GRTYR+++G +FPFG GLSY+ ++Y
Sbjct: 708 TFY-RNIAQLPDFEDYNM--------TGRTYRYFKGDPLFPFGYGLSYTTFNY------- 751
Query: 646 NKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFV 705
+ + L+Q+ + ET K + I V N G G+ V +++
Sbjct: 752 DNIKLDQT---------------------IKVGETAKMV--IPVTNAGNRDGEEVVQVYL 788
Query: 706 KPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSP 741
K G P K L F+ V + A + + EL+P
Sbjct: 789 KKQEDAEG-PAKTLRAFKRVQIPAGKTVNVELELTP 823
>gi|333380553|ref|ZP_08472244.1| hypothetical protein HMPREF9455_00410 [Dysgonomonas gadei ATCC
BAA-286]
gi|332826548|gb|EGJ99377.1| hypothetical protein HMPREF9455_00410 [Dysgonomonas gadei ATCC
BAA-286]
Length = 957
Score = 273 bits (699), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 229/746 (30%), Positives = 353/746 (47%), Gaps = 106/746 (14%)
Query: 45 FCKTTLPISQRARDLVSRLTLDEKISQLVNS--APAIPRLGIPAYEWWSEALHGVAGVGK 102
+ LP+ R DL+S +T+++K+ L P IP LG+PA EA+HG
Sbjct: 170 YMNPNLPLESRVEDLLSVMTVEDKMELLREGWGIPGIPHLGVPAIHK-VEAIHG------ 222
Query: 103 GIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNIN 162
F G+ GAT FPQ I A+++ L AIG E + NA QA W+P ++
Sbjct: 223 --FSYGS--GATIFPQSIGMGATWNKRLIEAAAMAIGDETVSA-NAVQA-----WSPVLD 272
Query: 163 IFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDL 222
+ +D RWGR +ET GEDP++ + ++++G Q KG + KHF A+
Sbjct: 273 VAQDARWGRCEETYGEDPVLVTEIGGAWIKGYQS--------KGLMTTP---KHFAAHGA 321
Query: 223 DNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLS 282
R D ++ +++ + + PF K+ + IM +Y+ G+P + LL
Sbjct: 322 ---PLGGRDSHDIGLSEREMREIHLVPFRDIYKKYKYQSIMMSYSDFLGVPVAKSKELLK 378
Query: 283 KTARRQWGFHGYITSDCDAVSIIYDAEGY-AKSPEDAVVDVLKAGMDVNCG-SFLQKHTK 340
R +WGF G+I SDC A+ + + Y A +A L AG+ NCG ++
Sbjct: 379 GILRDEWGFDGFIVSDCGAIGNLTARKHYTAVDKVEAARQALAAGIATNCGDTYNDPDVI 438
Query: 341 AAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQP--FGKI--GADVVCSPAHQVLA 396
AA K+ +L ++D L R GLF NP +P + KI G + SP HQ LA
Sbjct: 439 AAAKRGELNMDDLDFTCKTLLRTLFRNGLFENNPC-KPLDWNKIYPGWN---SPEHQALA 494
Query: 397 LQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYA-----GPSCRSITP 451
+ AQ+ IVLL+N +LPL KS ++A+IGP A++ + G+Y G +T
Sbjct: 495 RKTAQESIVLLENKGNILPLSKSLK-TIAVIGPGADNLQP--GDYTSKPQPGQLKSVLTG 551
Query: 452 LQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEK-------- 503
++A N +Y GC + I KAV A+ AD VL++G T E
Sbjct: 552 IKAAVNSSTKVLYEEGCRFIGTEGTDIAKAVKAAENADVAVLVLGDCSTSEALKGITNTS 611
Query: 504 -EELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYP 562
E D L+LPG QQ+L+ V + KPV+L+L G P ++++A N ++L P
Sbjct: 612 GENHDLATLILPGEQQKLLEAVCKTG-KPVVLILQAGRPYNLSYAA--ENCQAVLVNWLP 668
Query: 563 GEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGK 622
G+ G A A+V+FGD+NP GRLPMT +P+D ++P+ GR Y + +
Sbjct: 669 GQEGGYATADVLFGDYNPAGRLPMT-FPRDAAQLPL-------YYNFKTSGRVYDYVDMP 720
Query: 623 --EVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCET 680
++ FG GLSY+ ++Y +S K N +V
Sbjct: 721 YYPLYQFGYGLSYTSFNYSDLNISLEK-------------NGNVS--------------- 752
Query: 681 RKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELS 740
V V N G++AG V L++ + +L F V LN E ++ F L+
Sbjct: 753 ----VNATVTNTGKVAGDEVVQLYITDMYASVKTRVMELKDFDRVYLNPGESKKVSFVLT 808
Query: 741 PCESLSRAREDGLMVIEEGTHFLVVG 766
P + LS ++ V+E+G ++VG
Sbjct: 809 PYQ-LSLLNDEMDRVVEKGLFKIMVG 833
>gi|293371677|ref|ZP_06618088.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
ovatus SD CMC 3f]
gi|292633374|gb|EFF51944.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
ovatus SD CMC 3f]
Length = 783
Score = 273 bits (699), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 207/713 (29%), Positives = 333/713 (46%), Gaps = 102/713 (14%)
Query: 81 RLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGL 140
RLGIP + EA HG +G AT FP I AA++ L +G+AIG
Sbjct: 131 RLGIPLF-LAEEAPHGHMAIG-----------ATVFPTGIGMAATWSPQLIREVGKAIGK 178
Query: 141 EARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFN 200
E R G + P +++ RDPRW R +ET GEDP++TG+ + V G+
Sbjct: 179 EIRL------QGGHISYGPVLDLARDPRWSRVEETFGEDPVLTGEIGKAIVEGL-----G 227
Query: 201 GGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRAS 260
GG L A KHF AY + F +++L + + PPF + G A
Sbjct: 228 GGDLSRPYSTLATLKHFLAYGISESGQNGNPSFAG---IRELHENFLPPFRQAIDAG-AL 283
Query: 261 GIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVV 320
+M +YN ++G+P A+ +LL++ R +W F G + SD ++ I+ + A + E+A V
Sbjct: 284 SVMTSYNSMDGVPCTANHSLLTELLRNEWKFSGIVVSDLYSIEGIHQSHFVAPTMEEAAV 343
Query: 321 DVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFG 380
L AG+DV+ G + AV ++ ++ +D ++ + ++ +GLF NP + P
Sbjct: 344 LALSAGVDVDLGGDAYMNLMNAVNTGRIGKTALDASVARVLRLKFEMGLFE-NPYVDP-- 400
Query: 381 KIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGN 440
+ V S LA + AQ I LLKN H LLPL K++ V ALIGPNA++ +LG+
Sbjct: 401 EKAKKEVRSEESVTLARRVAQASITLLKNEHSLLPLNKNRKV--ALIGPNADNRYNMLGD 458
Query: 441 YAGPSCRS--ITPLQALQNYVENTV--YYPGCDTVACSSASIDKAVDIAKGADHVVLMMG 496
Y P + T L ++ + ++ Y GC + I++AV A+ ++ ++ ++G
Sbjct: 459 YTAPQEEANIKTVLDGIRTKLSSSQVEYVKGCSIRDTVTTDIEQAVAAAQRSEIIIAVVG 518
Query: 497 -----------------------LDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVI 533
+ + E DR L L G+QQEL+ + + KP++
Sbjct: 519 GSSARDFKTSYKETGAAIANEKTISDMECGEGFDRATLSLLGKQQELL-KALKTTGKPLV 577
Query: 534 LVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDY 593
+V + G P+D +A N ++L A YPG+ G +A+A+V+FGD NP GRLP + P+
Sbjct: 578 VVYIEGRPLDKNWAS--ENADAVLTAYYPGQEGGIAIADVLFGDFNPAGRLPFS-VPRSV 634
Query: 594 IKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQS 653
++P+ K PQ+ Y ++PFG GLSY+ + Y
Sbjct: 635 GQIPLYYNKKAPQS------HDYVEMSASPLYPFGYGLSYTSFDYS-------------- 674
Query: 654 SSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNG 713
+L R F V+ V+N G+ G+ L+++
Sbjct: 675 ------------------DLHLSALTPRSFEVSFKVRNTGKYDGEEVAQLYLRDEYASVV 716
Query: 714 RPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
+P+KQL F L E+ E+ F LS E S + ++E GT +++G
Sbjct: 717 QPLKQLKHFARFYLKRGEEREVKFILSE-EDFSLVDRNLKSIVEPGTFQIMIG 768
>gi|212692496|ref|ZP_03300624.1| hypothetical protein BACDOR_01992 [Bacteroides dorei DSM 17855]
gi|212664971|gb|EEB25543.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
dorei DSM 17855]
Length = 864
Score = 273 bits (699), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 163/443 (36%), Positives = 241/443 (54%), Gaps = 33/443 (7%)
Query: 45 FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGI 104
+ + L +RA DL+ +LTL+EK++ +++++ + RLGI Y WW+EALHGVA G
Sbjct: 24 YKNSNLSPEERAEDLLQQLTLEEKVALMMDNSKPVERLGIKPYNWWNEALHGVARSGL-- 81
Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIG-------MTFW 157
AT FPQ I AASF+ + I A+ EARA A A G +T W
Sbjct: 82 --------ATVFPQPIGMAASFEPDAIHTIYTAVSDEARAKNTAYSAAGSYERYQGLTMW 133
Query: 158 APNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHF 217
P +NI+RDPRWGRG ET GEDP +T V+ V+G+Q N K AC KHF
Sbjct: 134 TPTVNIYRDPRWGRGIETYGEDPYLTSVMGVNVVKGLQCTDANQKYDK----IHACAKHF 189
Query: 218 TAYDLDNWKGTTRYKFDAR-VTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCA 276
+ W R++F+A + +DL +TY PFE+ VK+G+ +MCAYNR+ G P C
Sbjct: 190 AVHSGPEW---NRHEFNAENIKPRDLHETYLVPFEALVKEGKVKEVMCAYNRLEGDPCCG 246
Query: 277 DRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSP--EDAVVDVLKAGMDVNCGSF 334
LL + R++WG+ G + SDC A+ Y +G+ P E A + +G D+ CGS
Sbjct: 247 SDRLLMQILRQEWGYEGIVLSDCGAIDDFYREKGHKTHPDAESASAAAVLSGTDLECGSS 306
Query: 335 LQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQV 394
+ ++A K+ + E +ID ++ L R LG + +P+ + KI VVCS H
Sbjct: 307 YKALVESA-KKGLISEKDIDVSVKRLLKARFELGEMD-DPSKVEWTKIPYSVVCSAEHDS 364
Query: 395 LALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQA 454
L+L A+ + LL N + +LPL K ++A++GPNAN + GNY G +IT L+
Sbjct: 365 LSLDIARKSMTLLLNKNNILPL-KRGGQTIAVMGPNANDSVMQWGNYNGTPKHTITLLEG 423
Query: 455 LQNYV---ENTVYYPGCDTVACS 474
+++ + + +Y GC V S
Sbjct: 424 IRSAMGENDKLIYEQGCSWVERS 446
Score = 115 bits (289), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 90/276 (32%), Positives = 133/276 (48%), Gaps = 54/276 (19%)
Query: 477 SIDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAE 526
+I V K AD V+ G+ + E EE+ DR D+ LP Q+ELI + +
Sbjct: 591 NIKNTVAKVKDADIVIFAGGISPSLEGEEMGVNLPGFRKGDRTDIELPAVQRELIKALCD 650
Query: 527 AAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPM 586
A KK VI V G P I + +IL A YPG++G A AEV+FGD+NP GRLP+
Sbjct: 651 AGKK-VIFVNFSGSP--IAMEPETKYCQAILQAWYPGQSGGKAAAEVLFGDYNPAGRLPV 707
Query: 587 TWYPQDYIKVP-MTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQ 645
T+Y ++ ++P D M GRTYR+++G +FPFG GLSY+ ++Y
Sbjct: 708 TFY-RNIAQLPDFEDYNM--------TGRTYRYFKGDPLFPFGYGLSYTTFNY------- 751
Query: 646 NKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFV 705
+ + L+Q+ + ET K + I V N G G+ V +++
Sbjct: 752 DNIKLDQT---------------------IKVGETAKMV--IPVTNAGNRDGEEVVQVYL 788
Query: 706 KPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSP 741
K G P K L F+ V + A + + EL+P
Sbjct: 789 KKQEDAEG-PAKTLRAFKRVQIPAGKTVNVELELTP 823
>gi|265752711|ref|ZP_06088280.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp.
3_1_33FAA]
gi|263235897|gb|EEZ21392.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp.
3_1_33FAA]
Length = 864
Score = 273 bits (699), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 163/443 (36%), Positives = 241/443 (54%), Gaps = 33/443 (7%)
Query: 45 FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGI 104
+ + L +RA DL+ +LTL+EK++ +++++ + RLGI Y WW+EALHGVA G
Sbjct: 24 YKNSNLSPEERAEDLLQQLTLEEKVALMMDNSKPVERLGIKPYNWWNEALHGVARSGL-- 81
Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIG-------MTFW 157
AT FPQ I AASF+ + I A+ EARA A A G +T W
Sbjct: 82 --------ATVFPQPIGMAASFEPDAIHTIYTAVSDEARAKNTAYSAAGSYERYQGLTMW 133
Query: 158 APNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHF 217
P +NI+RDPRWGRG ET GEDP +T V+ V+G+Q N K AC KHF
Sbjct: 134 TPTVNIYRDPRWGRGIETYGEDPYLTSVMGVNVVKGLQCTDANQKYDK----IHACAKHF 189
Query: 218 TAYDLDNWKGTTRYKFDAR-VTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCA 276
+ W R++F+A + +DL +TY PFE+ VK+G+ +MCAYNR+ G P C
Sbjct: 190 AVHSGPEW---NRHEFNAENIKPRDLHETYLVPFEALVKEGKVKEVMCAYNRLEGDPCCG 246
Query: 277 DRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSP--EDAVVDVLKAGMDVNCGSF 334
LL + R++WG+ G + SDC A+ Y +G+ P E A + +G D+ CGS
Sbjct: 247 SDRLLMQILRQEWGYEGIVLSDCGAIDDFYREKGHKTHPDAESASAAAVLSGTDLECGSS 306
Query: 335 LQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQV 394
+ ++A K+ + E +ID ++ L R LG + +P+ + KI VVCS H
Sbjct: 307 YKALVESA-KKGLISEKDIDVSVKRLLKARFELGEMD-DPSKVEWTKIPYSVVCSAEHDS 364
Query: 395 LALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQA 454
L+L A+ + LL N + +LPL K ++A++GPNAN + GNY G +IT L+
Sbjct: 365 LSLDIARKSMTLLLNKNNILPL-KRGGQTIAVMGPNANDSVMQWGNYNGTPKHTITLLEG 423
Query: 455 LQNYV---ENTVYYPGCDTVACS 474
+++ + + +Y GC V S
Sbjct: 424 IRSAMGENDKLIYEQGCSWVERS 446
Score = 114 bits (285), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 89/276 (32%), Positives = 129/276 (46%), Gaps = 54/276 (19%)
Query: 477 SIDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAE 526
+I V K AD V+ G+ + E EE+ DR D+ LP Q+ELI + +
Sbjct: 591 NIKNTVAKVKDADIVIFAGGISPSLEGEEMGVNLPGFRKGDRTDIELPAVQRELIKALCD 650
Query: 527 AAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPM 586
A KK VI V G P I + +IL A YPG++G A AEV+FGD+NP GRLP+
Sbjct: 651 AGKK-VIFVNFSGSP--IAMEPETQYCQAILQAWYPGQSGGKAAAEVLFGDYNPAGRLPV 707
Query: 587 TWYPQDYIKVP-MTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQ 645
T+Y ++ ++P D M GRTYR+++G +FPFG GLSY+ ++Y + Q
Sbjct: 708 TFY-RNIAQLPDFEDYNM--------TGRTYRYFKGDPLFPFGYGLSYTTFNYGNIKLEQ 758
Query: 646 NKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFV 705
+ + KMV I V N G G+ V +++
Sbjct: 759 T---IKVGETAKMV---------------------------IPVTNTGNRDGEEVVQVYL 788
Query: 706 KPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSP 741
K G P K L F+ V + A + + EL+P
Sbjct: 789 KKQEDTEG-PAKTLRAFKRVQIPAGKTVNVELELTP 823
>gi|237709184|ref|ZP_04539665.1| glycoside hydrolase family 3 protein [Bacteroides sp. 9_1_42FAA]
gi|229456880|gb|EEO62601.1| glycoside hydrolase family 3 protein [Bacteroides sp. 9_1_42FAA]
Length = 864
Score = 273 bits (699), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 163/443 (36%), Positives = 241/443 (54%), Gaps = 33/443 (7%)
Query: 45 FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGI 104
+ + L +RA DL+ +LTL+EK++ +++++ + RLGI Y WW+EALHGVA G
Sbjct: 24 YKNSNLSPEERAEDLLQQLTLEEKVALMMDNSKPVERLGIKPYNWWNEALHGVARSGL-- 81
Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIG-------MTFW 157
AT FPQ I AASF+ + I A+ EARA A A G +T W
Sbjct: 82 --------ATVFPQPIGMAASFEPDAIHTIYTAVSDEARAKNTAYSAAGSYERYQGLTMW 133
Query: 158 APNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHF 217
P +NI+RDPRWGRG ET GEDP +T V+ V+G+Q N K AC KHF
Sbjct: 134 TPTVNIYRDPRWGRGIETYGEDPYLTSVMGVNVVKGLQCTDANQKYDK----IHACAKHF 189
Query: 218 TAYDLDNWKGTTRYKFDAR-VTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCA 276
+ W R++F+A + +DL +TY PFE+ VK+G+ +MCAYNR+ G P C
Sbjct: 190 AVHSGPEW---NRHEFNAENIKPRDLHETYLVPFEALVKEGKVKEVMCAYNRLEGDPCCG 246
Query: 277 DRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSP--EDAVVDVLKAGMDVNCGSF 334
LL + R++WG+ G + SDC A+ Y +G+ P E A + +G D+ CGS
Sbjct: 247 SDRLLMQILRQEWGYEGIVLSDCGAIDDFYREKGHKTHPDAESASAAAVLSGTDLECGSS 306
Query: 335 LQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQV 394
+ ++A K+ + E +ID ++ L R LG + +P+ + KI VVCS H
Sbjct: 307 YKALVESA-KKGLISEKDIDVSVKRLLKARFELGEMD-DPSKVEWTKIPYSVVCSAEHDS 364
Query: 395 LALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQA 454
L+L A+ + LL N + +LPL K ++A++GPNAN + GNY G +IT L+
Sbjct: 365 LSLDIARKSMTLLLNKNNILPL-KRGGQTIAVMGPNANDSVMQWGNYNGTPKHTITLLEG 423
Query: 455 LQNYV---ENTVYYPGCDTVACS 474
+++ + + +Y GC V S
Sbjct: 424 IRSAMGENDKLIYEQGCSWVERS 446
Score = 115 bits (289), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 90/276 (32%), Positives = 133/276 (48%), Gaps = 54/276 (19%)
Query: 477 SIDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAE 526
+I V K AD V+ G+ + E EE+ DR D+ LP Q+ELI + +
Sbjct: 591 NIKNTVAKVKDADIVIFAGGISPSLEGEEMGVNLPGFRKGDRTDIELPAVQRELIKALCD 650
Query: 527 AAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPM 586
A KK VI V G P I + +IL A YPG++G A AEV+FGD+NP GRLP+
Sbjct: 651 AGKK-VIFVNFSGSP--IAMEPETKYCQAILQAWYPGQSGGKAAAEVLFGDYNPAGRLPV 707
Query: 587 TWYPQDYIKVP-MTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQ 645
T+Y ++ ++P D M GRTYR+++G +FPFG GLSY+ ++Y
Sbjct: 708 TFY-RNIAQLPDFEDYNM--------TGRTYRYFKGDPLFPFGYGLSYTTFNY------- 751
Query: 646 NKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFV 705
+ + L+Q+ + ET K + I V N G G+ V +++
Sbjct: 752 DNIKLDQT---------------------IKVGETAKMV--IPVTNAGNRDGEEVVQVYL 788
Query: 706 KPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSP 741
K G P K L F+ V + A + + EL+P
Sbjct: 789 KKQEDAEG-PAKTLRAFKRVQIPAGKTVNVELELTP 823
>gi|299149391|ref|ZP_07042448.1| beta-glucosidase [Bacteroides sp. 3_1_23]
gi|298512578|gb|EFI36470.1| beta-glucosidase [Bacteroides sp. 3_1_23]
Length = 853
Score = 273 bits (699), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 159/418 (38%), Positives = 240/418 (57%), Gaps = 34/418 (8%)
Query: 51 PISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTI 110
P+ +R DL+SRLT++EKIS L ++P IPRLGI Y +EALHGV G+
Sbjct: 36 PVHERVADLLSRLTVEEKISLLRATSPGIPRLGIDKYYHGNEALHGVVRPGR-------- 87
Query: 111 RGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQ---------AIGMTFWAPNI 161
T FPQ I AA+++ L R+ I EARA +N + +TFW+P +
Sbjct: 88 --FTVFPQAIGLAATWNPELQKRVATVISDEARARWNELDQGREQKEQFSDVLTFWSPTV 145
Query: 162 NIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYD 221
N+ RDPRWGR ET GEDP ++G ++V+G+QGD L+ + KHF A
Sbjct: 146 NMARDPRWGRTPETYGEDPFLSGVMGTAFVKGLQGDD------PRYLKIVSTPKHFAA-- 197
Query: 222 LDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLL 281
N + R+ + +++ + L + Y P FE CVK+G+A+ IM AYN +N +P + LL
Sbjct: 198 --NNEEHNRFVCNPQISEKQLREYYFPAFEMCVKEGKAASIMTAYNALNDVPCTLNAWLL 255
Query: 282 SKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG-SFLQKHTK 340
K R+ WGF GY+ SDC S++ +A Y K+ E A ++AG+D+ CG ++
Sbjct: 256 KKVLRQDWGFQGYVVSDCGGPSLLVNAHKYVKTKEAAATLSIQAGLDLECGDDVYDEYLL 315
Query: 341 AAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAA 400
A KQ + +++ID A ++ + RM+LGLF+G P+ +I V+ S HQ +AL AA
Sbjct: 316 NAYKQYMVSDADIDSAACHVLTARMKLGLFDGT-ERNPYTRISPSVIGSKEHQQIALDAA 374
Query: 401 QDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAG-PSCRSITPLQALQN 457
++ IVLLKN + +LPL +K S+A++G NA K G+Y+G P ++ LQ +++
Sbjct: 375 RECIVLLKNKNNMLPLNVNKVKSIAVVGINA--GKCEFGDYSGAPVVDPVSILQGIKD 430
Score = 117 bits (292), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 79/278 (28%), Positives = 137/278 (49%), Gaps = 43/278 (15%)
Query: 489 DHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAK 548
+ VV +MG++++ E+E DR D+ LP Q+E + + + +I+VL+ G + + +
Sbjct: 605 ETVVAVMGINKSIEREGQDRYDIQLPADQREFLQEIYKVNPN-IIVVLVAGSSLAVNWM- 662
Query: 549 YDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQAT 608
D +I +I+ A YPGE G A+A+V+FGD+NP GRLP+T+Y + ++P D
Sbjct: 663 -DEHIPAIVNAWYPGEQGGTAVADVLFGDYNPAGRLPLTYY-KSLDELPAFD------DY 714
Query: 609 SGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYK 668
GRTY++++G ++PFG GLSYS + Y S K+ + + +
Sbjct: 715 DITKGRTYKYFKGDVLYPFGYGLSYSSFKY---------------SDLKVKDGANTIS-- 757
Query: 669 SVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILN 728
V+ +KN G+ G ++V+ G PIK+L GF+ + L
Sbjct: 758 ----------------VSFRLKNTGKRKGDEVAQVYVRIPETGGVVPIKELKGFRRIPLK 801
Query: 729 AKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
+ E + EL + G ++ +G ++VG
Sbjct: 802 SGESRVVDIELDKEQLRYWDAGLGQFIVPQGAFDIMVG 839
>gi|291292288|gb|ADD92016.1| Xyl3C [Prevotella bryantii B14]
Length = 857
Score = 273 bits (699), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 166/438 (37%), Positives = 240/438 (54%), Gaps = 33/438 (7%)
Query: 41 ETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGV 100
+ P+ +L QRA DL SRLTL+EK + N +PAI RL IPA+EWWSEALHG A
Sbjct: 22 QHLPYQNPSLSAEQRAEDLCSRLTLEEKCKLMQNGSPAIKRLNIPAFEWWSEALHGTARN 81
Query: 101 GKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLE-------ARALYNAGQAIG 153
G AT FP AAS++ L +I AIG E AR N + G
Sbjct: 82 G----------FATVFPNTTGMAASWNDQLLLQIFSAIGNESRIKNTLARKSGNIKRYQG 131
Query: 154 MTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASAC 213
++ W PNINIFRDPRWGRGQET GEDP +TGK ++ V G+QG K + AC
Sbjct: 132 LSIWTPNINIFRDPRWGRGQETYGEDPYLTGKMGLAVVEGLQGP-----KNSKYYKLLAC 186
Query: 214 CKHFTAYDLDNWKGTTRYKFD-ARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGI 272
KHF + + R+ F+ + +DL +TY P F++ +++G + +MCAY+ ++G+
Sbjct: 187 AKHFAVHSGPEY---LRHSFNIENLPARDLWETYLPAFKTLIQEGNVAEVMCAYHSMDGL 243
Query: 273 PSCADRNLLSKTARRQWGFHGYITSDCDAVS--IIYDAEGYAKSPEDAVVDVLKAGMDVN 330
P C L + R+ GF G + SDC A+ I A+ A + AG DV
Sbjct: 244 PCCGSNKYLQQILRQDLGFKGMVVSDCGAIGDFWIQGRHEVAQDAAQASAQAVLAGTDVE 303
Query: 331 CGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSP 390
CG+ K AVK+ ++ E +I+ ++ L R +LG F+ + M + ++ ++
Sbjct: 304 CGANYDK-LPEAVKRGEISEEKINVSVMRLLKARFKLGDFDSD-NMVEWTQLPESLIACS 361
Query: 391 AHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSIT 450
H+ LA Q AQ+ + LLKN +G+LPL K+ +A++G NAN + L GNY G ++I+
Sbjct: 362 KHKQLAYQMAQESMTLLKN-NGILPL--QKNARIAVMGANANDSIMLWGNYNGYPTKTIS 418
Query: 451 PLQALQNYVENTVYYPGC 468
L+ LQN ++ Y PGC
Sbjct: 419 ILEGLQNKSKHISYIPGC 436
Score = 103 bits (258), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 88/290 (30%), Positives = 127/290 (43%), Gaps = 59/290 (20%)
Query: 488 ADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLL 537
A V+ + G+ E EE+ DR + LP Q+++I + + KK VI V
Sbjct: 600 AQVVIYVGGISPRLEGEEMKVNELGFKGGDRTTIELPQSQRDMIALLHNSGKK-VIFVNC 658
Query: 538 CGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVP 597
GG I RN +IL A Y GE G A+A+V+FGD+NP G+LP+T+Y D
Sbjct: 659 SGGA--IALEPESRNADAILQAWYGGEMGGQAVADVLFGDYNPNGKLPVTFYKNDSQLPD 716
Query: 598 MTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTK 657
D M+ GRTYR+ ++PFG GLSY+ ++Y
Sbjct: 717 YNDYTMK--------GRTYRYLHQAPLYPFGYGLSYTTFAY------------------- 749
Query: 658 MVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIK 717
D Y + RK +++ V N G+ G + ++++ NG PIK
Sbjct: 750 -----DNAKY-----------DRRKGNLSLEVTNTGKCEGTTTIQVYIRRTADING-PIK 792
Query: 718 QLVGFQSVILNAKEKAEIVFELSPCESLSRARE-DGLMVIEEGTHFLVVG 766
L FQ V L A EK + L P E E M I G + ++VG
Sbjct: 793 TLKAFQKVSLQANEKKRVTINL-PRERFEGWDETTNTMRIVPGKYEIMVG 841
>gi|261880507|ref|ZP_06006934.1| xylosidase [Prevotella bergensis DSM 17361]
gi|270332847|gb|EFA43633.1| xylosidase [Prevotella bergensis DSM 17361]
Length = 948
Score = 273 bits (699), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 206/696 (29%), Positives = 330/696 (47%), Gaps = 90/696 (12%)
Query: 105 FFNGTIRG-----ATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMT-FWA 158
F N IRG AT+FP + +++ L ++G G EAR L G T +A
Sbjct: 168 FTNEGIRGIESYKATNFPTQLGLGTTWNRQLIRQVGYITGREARLL-------GYTNVYA 220
Query: 159 PNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFT 218
P +++ RD RWGR +E GE P + + + RG+Q D Q ++ KHF
Sbjct: 221 PILDVGRDQRWGRYEEIYGESPFLVAELGIQMTRGLQTD----------FQVASTAKHFA 270
Query: 219 AYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADR 278
AY + + D ++ +++ + + P+E V++ G M +YN +GIP
Sbjct: 271 AYSNNKGGREGMSRVDPQMPPREVENIHLYPWERVVQEAGLLGAMSSYNDYDGIPIQGSY 330
Query: 279 NLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNC-----GS 333
+ L++ R ++GF GYI SD DA+ ++ A ++AV + AG++V C S
Sbjct: 331 HWLTEVLRHRFGFRGYIVSDSDALEYLFSKHHTAADMKEAVYQAVMAGLNVRCTFRSPDS 390
Query: 334 FLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQ 393
F+ + +++ ++P S IDR + ++ V+ G+F+ NP K V S +Q
Sbjct: 391 FVLP-LRELIREGRIPMSVIDRLVGDILRVKFITGIFD-NPYQMNL-KAADQEVNSERNQ 447
Query: 394 VLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQ 453
+ALQA++ IVLLKN LLPL +SK + + GPNA+ A L +Y + T L+
Sbjct: 448 AVALQASRQSIVLLKNQDRLLPLDRSKLRRILVCGPNADDASYALTHYGPLAVDVTTVLE 507
Query: 454 ALQNYVENTV---YYPGCDTV---------------ACSSASIDKAVDIAKGADHVVLMM 495
+++ VEN + Y GCD V + ID AV +AK +D ++++
Sbjct: 508 GIRDKVENNIEVSYAKGCDVVDPHWPESEIIGYPMTSQEQQDIDHAVALAKESDVAIVVL 567
Query: 496 GLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGS 555
G + E R L LPGRQ +L+ + +A KPV+LVL+ G P+ + +A DR I +
Sbjct: 568 GGNSRTCGENKSRSSLDLPGRQLDLL-KAVQATGKPVVLVLINGRPLSVNWA--DRFIPA 624
Query: 556 ILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMT-DMKMRPQATSGNP-- 612
I+ A YPG G A+A+V+FGD+NPGG+L +T +P+ ++P K Q GN
Sbjct: 625 IVEAWYPGSQGGTAVADVLFGDYNPGGKLTVT-FPKSVGQIPFNFPSKPASQVDGGNKLG 683
Query: 613 --GRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSV 670
G R ++ FG GLSY+ + Y +S+ + LN S +
Sbjct: 684 LQGNASRI--NGALYSFGHGLSYTTFKYSNLRLSKETMTLNDSIN--------------- 726
Query: 671 PELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAK 730
++ V N G+ G V L+++ K L GF + L
Sbjct: 727 --------------ISCDVSNTGDREGDEVVQLYIRDVISSVTTYEKNLRGFDRIHLKPG 772
Query: 731 EKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
E + F + P E L +D V+E G +++G
Sbjct: 773 ETKTLTFTIKP-EHLKLVNKDFEKVVEPGEFKIMIG 807
>gi|345514226|ref|ZP_08793739.1| glycoside hydrolase family beta-glycosidase [Bacteroides dorei
5_1_36/D4]
gi|229437207|gb|EEO47284.1| glycoside hydrolase family beta-glycosidase [Bacteroides dorei
5_1_36/D4]
Length = 864
Score = 273 bits (698), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 163/443 (36%), Positives = 241/443 (54%), Gaps = 33/443 (7%)
Query: 45 FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGI 104
+ + L +RA DL+ +LTL+EK++ +++++ + RLGI Y WW+EALHGVA G
Sbjct: 24 YKNSNLSPEERAEDLLQQLTLEEKVALMMDNSKPVERLGIKPYNWWNEALHGVARSGL-- 81
Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIG-------MTFW 157
AT FPQ I AASF+ + I A+ EARA A A G +T W
Sbjct: 82 --------ATVFPQPIGMAASFEPDAIHTIYTAVSDEARAKNTAYSAAGSYERYQGLTMW 133
Query: 158 APNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHF 217
P +NI+RDPRWGRG ET GEDP +T V+ V+G+Q N K AC KHF
Sbjct: 134 TPTVNIYRDPRWGRGIETYGEDPYLTSVMGVNVVKGLQCTDANQKYDK----IHACAKHF 189
Query: 218 TAYDLDNWKGTTRYKFDAR-VTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCA 276
+ W R++F+A + +DL +TY PFE+ VK+G+ +MCAYNR+ G P C
Sbjct: 190 AVHSGPEW---NRHEFNAENIKPRDLHETYLVPFEALVKEGKVKEVMCAYNRLEGDPCCG 246
Query: 277 DRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSP--EDAVVDVLKAGMDVNCGSF 334
LL + R++WG+ G + SDC A+ Y +G+ P E A + +G D+ CGS
Sbjct: 247 SDRLLMQILRQEWGYEGIVLSDCGAIDDFYREKGHKTHPDAESASAAAVLSGTDLECGSS 306
Query: 335 LQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQV 394
+ ++A K+ + E +ID ++ L R LG + +P+ + KI VVCS H
Sbjct: 307 YKALVESA-KKGLISEKDIDVSVKRLLKARFELGEMD-DPSKVEWTKIPYSVVCSAEHDS 364
Query: 395 LALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQA 454
L+L A+ + LL N + +LPL K ++A++GPNAN + GNY G +IT L+
Sbjct: 365 LSLDIARKSMTLLLNKNNILPL-KRGGQTIAVMGPNANDSVMQWGNYNGTPKHTITLLEG 423
Query: 455 LQNYV---ENTVYYPGCDTVACS 474
+++ + + +Y GC V S
Sbjct: 424 IRSAMGENDKLIYEQGCSWVERS 446
Score = 115 bits (289), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 90/276 (32%), Positives = 133/276 (48%), Gaps = 54/276 (19%)
Query: 477 SIDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAE 526
+I V K AD V+ G+ + E EE+ DR D+ LP Q+ELI + +
Sbjct: 591 NIKNTVAKVKDADIVIFAGGISPSLEGEEMGVNLPGFRKGDRTDIELPAVQRELIKALCD 650
Query: 527 AAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPM 586
A KK VI V G P I + +IL A YPG++G A AEV+FGD+NP GRLP+
Sbjct: 651 AGKK-VIFVNFSGSP--IAMEPETKYCQAILQAWYPGQSGGKAAAEVLFGDYNPAGRLPV 707
Query: 587 TWYPQDYIKVP-MTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQ 645
T+Y ++ ++P D M GRTYR+++G +FPFG GLSY+ ++Y
Sbjct: 708 TFY-RNIAQLPDFEDYNM--------TGRTYRYFKGDPLFPFGYGLSYTTFNY------- 751
Query: 646 NKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFV 705
+ + L+Q+ + ET K + I V N G G+ V +++
Sbjct: 752 DNIKLDQT---------------------IKVGETAKMV--IPVTNAGNRDGEEVVQVYL 788
Query: 706 KPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSP 741
K G P K L F+ V + A + + EL+P
Sbjct: 789 KKQEDAEG-PAKTLRAFKRVQIPAGKTVNVELELTP 823
>gi|423271149|ref|ZP_17250120.1| hypothetical protein HMPREF1079_03202 [Bacteroides fragilis
CL05T00C42]
gi|423274973|ref|ZP_17253919.1| hypothetical protein HMPREF1080_02572 [Bacteroides fragilis
CL05T12C13]
gi|392699073|gb|EIY92255.1| hypothetical protein HMPREF1079_03202 [Bacteroides fragilis
CL05T00C42]
gi|392704252|gb|EIY97391.1| hypothetical protein HMPREF1080_02572 [Bacteroides fragilis
CL05T12C13]
Length = 859
Score = 273 bits (698), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 224/803 (27%), Positives = 360/803 (44%), Gaps = 142/803 (17%)
Query: 42 TFPFCKTTLPISQRARDLVSRLTLDEKISQ------------------------------ 71
F + +LP+ R +DL+SR+TL+EKI+Q
Sbjct: 23 NFKYKNASLPVEVRVQDLLSRMTLEEKIAQMRHIHAYSIMENGKLNEEKLEKMIGGQNYG 82
Query: 72 --------------LVNSAPAI----PRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGA 113
L+N RLGIP + +E+LHG G+
Sbjct: 83 FIEGITLPGKECLTLMNEVQKYMREKTRLGIPVFTL-TESLHG-----------SVHDGS 130
Query: 114 TSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQ 173
T FPQ I ++F+ L Y + AI E A Q I + P I++ RD RWGR +
Sbjct: 131 TIFPQAIALGSTFNPILAYEMTSAIAKELSA-----QGITQSL-TPVIDVCRDLRWGRVE 184
Query: 174 ETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKF 233
E GEDP + + VS VRG + Q S KHF A+ G +
Sbjct: 185 ECFGEDPFLVSRMGVSQVRGYLDN-----------QVSPMIKHFGAH------GAPQGGL 227
Query: 234 D-ARVT--MQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWG 290
+ A V+ ++L Y FE+ VK+ + +M +YN N P+ + L+++ R +W
Sbjct: 228 NLASVSCGQRELLSIYLKTFETVVKEAKPWAVMSSYNSWNNEPNSSSHYLMTELLRDRWD 287
Query: 291 FHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPE 350
F GY+ SD A+ ++ A++ +A + L AG+D + V+ L
Sbjct: 288 FQGYVYSDWGAIGMLNYFHKTAQNSAEAAIQALTAGLDAEASDNSYAELQQLVENGMLDV 347
Query: 351 SEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNS 410
ID+A+ + + + +GLF P P K VV +PAH LA + A++ IVLL+N
Sbjct: 348 KYIDQAVARILTAKFNMGLFE-YPL--PMEKNYDKVVHAPAHVSLARKIAEESIVLLQNE 404
Query: 411 HGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTV---YYPG 467
+ +LPL +K S+A+IGPNA+ + ++ + +T L+AL+ V N + Y G
Sbjct: 405 NNILPLQMNKLKSIAVIGPNADQVQFGDYTWSRDNKDGVTLLEALKERVGNQLTLNYAKG 464
Query: 468 CDTVACSSASIDKAVDIAKGADHVVLMMG---------LDQTQEKEELDRVDLVLPGRQQ 518
CD V + +AVD+AK +D ++++G E D DL L G Q+
Sbjct: 465 CDLVTDDRSGFKEAVDVAKKSDVCIVVVGSASASLARDYSNATCGEGFDLSDLTLTGVQE 524
Query: 519 ELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDH 578
+L+ + A KPVI+VLL G P +++ K NI I+ YPGE G +ALA+++ G
Sbjct: 525 DLVEAI-HATGKPVIVVLLSGKPFAMSWIK--ENIPGIVVQWYPGEQGGLALADMLLGKV 581
Query: 579 NPGGRLPMTWYPQD-------YIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGL 631
NP G+L + +PQ Y +P R + PG+ Y F K ++ FG GL
Sbjct: 582 NPSGKLNYS-FPQSVGHLPCYYNYLPTDKGFYRSPGSKNKPGKDYVFSSPKALWAFGHGL 640
Query: 632 SYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKN 691
SY+ + YL+ ++S + +DV+ VTI ++N
Sbjct: 641 SYTDFE-----------YLSATTSKEDYACEDVIE------------------VTIAIRN 671
Query: 692 HGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRARED 751
G+ G ++V+ P+++L GF+ V++ E +++ ++ P L+ ++
Sbjct: 672 TGDYDGLEVPQVYVRDMVSSVVMPVQELKGFEKVLIKKGETKQVIIKI-PVSELALYNKE 730
Query: 752 GLMVIEEGTHFLVVGDEEYPISI 774
V+E G L +G I I
Sbjct: 731 MKKVVEPGAFELQIGRASDDIRI 753
>gi|270296098|ref|ZP_06202298.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp. D20]
gi|270273502|gb|EFA19364.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp. D20]
Length = 798
Score = 273 bits (698), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 215/764 (28%), Positives = 357/764 (46%), Gaps = 133/764 (17%)
Query: 51 PISQRARDLVSRLTLDEKISQLVN--SAPAIPRLGIPAYEW----WSEALHGVAGVGKGI 104
P+ R ++L+S++TL+EK Q+ + + +P+ W W + + + G+
Sbjct: 59 PLEARVQNLLSQMTLEEKSCQMATLYGSGRVLNDALPSDNWKNEVWKDGIGNIDEEHNGL 118
Query: 105 ------------------------------------FFNGTIRG-----ATSFPQVILTA 123
F N IRG AT FP
Sbjct: 119 GSFKSAYSFPYAHHVKTKHAIQRWFVENTRLGIPVDFTNEGIRGLCHDRATYFPAQCGQG 178
Query: 124 ASFDSYLWYRIGQAIGLEARALYNAGQAIGMT-FWAPNINIFRDPRWGRGQETPGEDPLV 182
A+++ L +IG+A EA L G T ++P ++I +DPRWGR ET GEDP
Sbjct: 179 ATWNKELIAQIGEAEAREASVL-------GYTNIYSPILDIAQDPRWGRCVETYGEDPYH 231
Query: 183 TGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDL 242
G+ + +Q K KL ++ KHF Y + + + D V +++
Sbjct: 232 AGQMGKQMILSLQ---------KNKLVSTP--KHFAVYSIPVGGRDGKTRTDPHVAPREM 280
Query: 243 ADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAV 302
Y PF + A G+M +YN +G P + L++ R++WGF GY+ SD +AV
Sbjct: 281 RTLYLDPFRVAFHEAGALGVMSSYNDYDGEPITGSYHFLTEILRQEWGFKGYVVSDSEAV 340
Query: 303 SIIYDAEGYAKSPEDAVVDVLKAGMDVNC-----GSFLQKHTKAAVKQKKLPESEIDRAL 357
I A EDAV + AG+++ F+ ++AVK+ K+ + +++ +
Sbjct: 341 EFISTKHQVANGYEDAVAQAVNAGLNIRTHFTPPADFILP-LRSAVKKGKISQETLNQRV 399
Query: 358 HNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLP 417
+ V+ LGLF+ NP + G +V SP HQ LAL+AA+ +VLLKN H LPL
Sbjct: 400 AEILRVKFWLGLFD-NPYRGDEKRAG-QIVHSPEHQQLALEAARQSLVLLKNEHQTLPLS 457
Query: 418 KSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPL-QALQNYV--ENTVYYPGCDTV--- 471
KS S+A+IGPNA+ + L+ Y GP+ IT + + ++ + + VY GCD +
Sbjct: 458 KSIR-SVAVIGPNADERQQLICRY-GPANAHITTIYEGIKKMLPQADVVYKKGCDIIDPH 515
Query: 472 ------------ACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQE 519
A + +++A++ AKGA+ V+++G ++ +E+ R L LPGRQ+E
Sbjct: 516 FPESEVLEFPKAAQEAQMMEEAIEAAKGAEVTVMVLGGNELTVREDRSRTSLDLPGRQKE 575
Query: 520 LITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHN 579
L+ ++ + KPV+LV++ G I FA ++ +I+ A +PGE G A+AE +FGD+N
Sbjct: 576 LLKKICQLG-KPVVLVMIDGRASSINFAA--THVPAIIHAWFPGEFGGQAIAEALFGDYN 632
Query: 580 PGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYK 639
PGGRL +T +P+ ++P +P + + Y ++PFG GLSY+ + Y
Sbjct: 633 PGGRLAVT-FPKSVGQIPFA-FPFKPGSDESSETSVY-----GALYPFGHGLSYTTFQYS 685
Query: 640 FKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKH 699
A+S +K + + S ++ +KN G+ G
Sbjct: 686 DLAISPSKQGVQGNIS-----------------------------ISCTIKNIGQREGDE 716
Query: 700 PVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCE 743
V L+++ + L GF+ + L + + FEL+P E
Sbjct: 717 VVQLYLRDEVSSVTTYTQVLRGFERITLKPEASHTVHFELTPQE 760
>gi|423230604|ref|ZP_17217008.1| hypothetical protein HMPREF1063_02828 [Bacteroides dorei
CL02T00C15]
gi|423244313|ref|ZP_17225388.1| hypothetical protein HMPREF1064_01594 [Bacteroides dorei
CL02T12C06]
gi|392630748|gb|EIY24734.1| hypothetical protein HMPREF1063_02828 [Bacteroides dorei
CL02T00C15]
gi|392642494|gb|EIY36260.1| hypothetical protein HMPREF1064_01594 [Bacteroides dorei
CL02T12C06]
Length = 864
Score = 273 bits (698), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 163/443 (36%), Positives = 241/443 (54%), Gaps = 33/443 (7%)
Query: 45 FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGI 104
+ + L +RA DL+ +LTL+EK++ +++++ + RLGI Y WW+EALHGVA G
Sbjct: 24 YKNSNLSPEERAEDLLQQLTLEEKVALMMDNSKPVERLGIKPYNWWNEALHGVARSGL-- 81
Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIG-------MTFW 157
AT FPQ I AASF+ + I A+ EARA A A G +T W
Sbjct: 82 --------ATVFPQPIGMAASFEPDAIHTIYTAVSDEARAKNTAYSAAGSYERYQGLTMW 133
Query: 158 APNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHF 217
P +NI+RDPRWGRG ET GEDP +T V+ V+G+Q N K AC KHF
Sbjct: 134 TPTVNIYRDPRWGRGIETYGEDPYLTSVMGVNVVKGLQCTDANQKYDK----IHACAKHF 189
Query: 218 TAYDLDNWKGTTRYKFDAR-VTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCA 276
+ W R++F+A + +DL +TY PFE+ VK+G+ +MCAYNR+ G P C
Sbjct: 190 AVHSGPEW---NRHEFNAENIKPRDLHETYLVPFEALVKEGKVKEVMCAYNRLEGDPCCG 246
Query: 277 DRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSP--EDAVVDVLKAGMDVNCGSF 334
LL + R++WG+ G + SDC A+ Y +G+ P E A + +G D+ CGS
Sbjct: 247 SDRLLMQILRQEWGYEGIVLSDCGAIDDFYREKGHKTHPDAESASAAAVLSGTDLECGSS 306
Query: 335 LQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQV 394
+ ++A K+ + E +ID ++ L R LG + +P+ + KI VVCS H
Sbjct: 307 YKALVESA-KKGLISEKDIDVSVKRLLKARFELGEMD-DPSKVEWTKIPYSVVCSAEHDS 364
Query: 395 LALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQA 454
L+L A+ + LL N + +LPL K ++A++GPNAN + GNY G +IT L+
Sbjct: 365 LSLDIARKSMTLLLNKNNILPL-KRGGQTIAVMGPNANDSVMQWGNYNGTPKHTITLLEG 423
Query: 455 LQNYV---ENTVYYPGCDTVACS 474
+++ + + +Y GC V S
Sbjct: 424 IRSAMGENDKLIYEQGCSWVERS 446
Score = 115 bits (288), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 89/276 (32%), Positives = 129/276 (46%), Gaps = 54/276 (19%)
Query: 477 SIDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAE 526
+I V K AD V+ G+ + E EE+ DR D+ LP Q+ELI + +
Sbjct: 591 NIKNTVAKVKDADIVIFAGGISPSLEGEEMGVNLPGFRKGDRTDIELPAVQRELIKALCD 650
Query: 527 AAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPM 586
A KK VI V G P I + +IL A YPG++G A AEV+FGD+NP GRLP+
Sbjct: 651 AGKK-VIFVNFSGSP--IAMEPETKYCQAILQAWYPGQSGGKAAAEVLFGDYNPAGRLPV 707
Query: 587 TWYPQDYIKVP-MTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQ 645
T+Y ++ ++P D M GRTYR+++G +FPFG GLSY+ ++Y + Q
Sbjct: 708 TFY-RNIAQLPDFEDYNM--------TGRTYRYFKGDPLFPFGYGLSYTTFNYDNIKLEQ 758
Query: 646 NKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFV 705
+ + KMV I V N G G+ V +++
Sbjct: 759 T---IKVGETAKMV---------------------------IPVTNTGNRDGEEVVQVYL 788
Query: 706 KPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSP 741
K G P K L F+ V + A + + EL+P
Sbjct: 789 KKQEDTEG-PTKTLRAFKRVQIPAGKTVNVELELTP 823
>gi|198274480|ref|ZP_03207012.1| hypothetical protein BACPLE_00628 [Bacteroides plebeius DSM 17135]
gi|198272682|gb|EDY96951.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
plebeius DSM 17135]
Length = 912
Score = 273 bits (698), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 223/798 (27%), Positives = 364/798 (45%), Gaps = 139/798 (17%)
Query: 51 PISQRARDLVSRLTLDEKISQLVN--SAPAIPRLGIPAYEW----WSEAL-----HGVAG 99
P+++R DL+S++T++EK Q+V + + +P +W W + + H A
Sbjct: 24 PLNERIEDLLSQMTVEEKTCQMVTLYGYQRVLKDSLPTPDWKNQLWKDGIGAIDEHLNAF 83
Query: 100 VGKGI------------------------------------FFNGTIRG-----ATSFPQ 118
G G+ F N IRG AT+FP
Sbjct: 84 RGWGVPPMQNELVWPASNHAWALNEVQRFFVEETRLGIPADFTNEGIRGVENYIATNFPT 143
Query: 119 VILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMT-FWAPNINIFRDPRWGRGQETPG 177
+ +++ L +IG G EAR L G T +AP +++ RD RWGR +E G
Sbjct: 144 QLALGHTWNRELIRQIGYITGREARLL-------GYTNVYAPILDVGRDQRWGRYEEVYG 196
Query: 178 EDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARV 237
E P + + ++ +G+Q D +Q ++ KHF AY + + D ++
Sbjct: 197 ESPYLVAELGIAMGKGLQTD----------MQVASTAKHFIAYSNNKGAREGFARVDPQM 246
Query: 238 TMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITS 297
+ +++ + + PF +++ G+M +YN +G P + L++ R GF GY+ S
Sbjct: 247 SWREVENIHAYPFTRVIQEAGILGVMSSYNDYDGFPIQSSYYWLTQRLRGTMGFRGYVVS 306
Query: 298 DCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHT-----KAAVKQKKLPESE 352
D DAV +Y AK ++AV ++AG++V C +F + + +++ L
Sbjct: 307 DSDAVEYLYSKHKTAKDMKEAVRQSVEAGLNVRC-TFRSPESYVLPLRELIQEGGLSMET 365
Query: 353 IDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHG 412
ID + ++ V+ GLF + Q + V S AHQ +ALQA+++G+VLLKN++
Sbjct: 366 IDNRVRDILRVKFLTGLF--DTPYQTDLALADKEVNSEAHQQVALQASREGLVLLKNANN 423
Query: 413 LLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVEN---TVYYPGCD 469
LLPL KS+ +A+ GPNA+ A L +Y + T L+ ++ V+ Y GCD
Sbjct: 424 LLPLDKSQIKRIAVCGPNADEASFALTHYGPVAVEVTTVLEGIKQQVKEGTKVTYTKGCD 483
Query: 470 TV---------------ACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLP 514
V A I KAVD K +D V+++G E R L LP
Sbjct: 484 LVDANWPESEIISYPLTAEEKTEIQKAVDNVKESDVAVVVLGGGIRTCGENKSRTSLDLP 543
Query: 515 GRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVI 574
G QQ+L+ + A KPV+LVL+ G P+ I +A D+ + +IL A YPG G A+AE +
Sbjct: 544 GHQQQLLEAIV-ATGKPVVLVLINGRPLSINWA--DKFVPAILEAWYPGSQGGTAIAEAL 600
Query: 575 FGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQA------TSGNPGRTYRFYEGKEVFPFG 628
FGD+NPGG+L +T +P+ ++P + +P + T G G R ++PFG
Sbjct: 601 FGDYNPGGKLTVT-FPKTVGQIPF-NFPAKPASQVDGGQTPGMKGNQSRI--NGPLYPFG 656
Query: 629 CGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIG 688
GLSY+ + Y N S+ ++ +++ V VT
Sbjct: 657 YGLSYTTFEYS-----------NLQLSSPVITDKEPV------------------TVTCK 687
Query: 689 VKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRA 748
+KN G +G V L+ + K L GF+ V L E ++ F+L P
Sbjct: 688 IKNTGTRSGDEVVQLYTRDVISSVTTYEKNLRGFERVHLEPGETKKVSFQLLP-RDFQLL 746
Query: 749 REDGLMVIEEGTHFLVVG 766
+D V+E G +++G
Sbjct: 747 NKDNHWVVEPGMFQIMIG 764
>gi|60682370|ref|YP_212514.1| hydrolase [Bacteroides fragilis NCTC 9343]
gi|60493804|emb|CAH08594.1| putative exported hydrolase [Bacteroides fragilis NCTC 9343]
Length = 859
Score = 273 bits (698), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 222/803 (27%), Positives = 358/803 (44%), Gaps = 142/803 (17%)
Query: 42 TFPFCKTTLPISQRARDLVSRLTLDEKISQ------------------------------ 71
F + +LP+ R +DL+SR+TL+EKI+Q
Sbjct: 23 NFKYKNASLPVEVRVQDLLSRMTLEEKIAQMRHIHAYSIMENGKLNEEKLEKMIGGQNYG 82
Query: 72 --------------LVNSAPAI----PRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGA 113
L+N RLGIP + +E+LHG G+
Sbjct: 83 FIEGITLPGKECLTLMNEVQKYMREKTRLGIPVFTL-TESLHG-----------SVHDGS 130
Query: 114 TSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQ 173
T FPQ I ++F+ L Y + AI E A Q I + P I++ RD RWGR +
Sbjct: 131 TIFPQAIALGSTFNPILAYEMTSAIAKELSA-----QGITQSL-TPVIDVCRDLRWGRVE 184
Query: 174 ETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKF 233
E GEDP + + VS VRG + Q S KHF A+ G +
Sbjct: 185 ECFGEDPFLVSRMGVSQVRGYLDN-----------QVSPMIKHFGAH------GAPQGGL 227
Query: 234 DARVTM---QDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWG 290
+ + ++L Y FE+ VK+ + +M +YN N P+ + L+++ R +W
Sbjct: 228 NLASVLCGQRELLSIYLKTFETVVKEAKPWAVMSSYNSWNNEPNSSSHYLMTELLRDRWD 287
Query: 291 FHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPE 350
F GY+ SD A+ ++ A++ +A + L AG+D + V+ L
Sbjct: 288 FQGYVYSDWGAIGMLNYFHKTAQNSAEAAIQALTAGLDAEASDNSYAELQQLVENGMLDV 347
Query: 351 SEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNS 410
ID+A+ + + + +GLF P P K VV +PAH LA + A++ IVLL+N
Sbjct: 348 KYIDQAVARILTAKFNMGLFE-YPL--PMEKNYDKVVHAPAHVSLARKIAEESIVLLQNK 404
Query: 411 HGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTV---YYPG 467
+ +LPL +K S+A+IGPNA+ + ++ + +T L+AL+ V N + Y G
Sbjct: 405 NNILPLQMNKLKSIAVIGPNADQVQFGDYTWSRDNKDGVTLLEALKERVGNQLTLNYAKG 464
Query: 468 CDTVACSSASIDKAVDIAKGADHVVLMMG---------LDQTQEKEELDRVDLVLPGRQQ 518
CD V + +AVD+AK +D ++++G E D DL L G Q+
Sbjct: 465 CDLVTDDRSGFKEAVDVAKKSDVCIVVVGSASASLARDYSNATCGEGFDLSDLTLTGVQE 524
Query: 519 ELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDH 578
+L+ + A KPVI+VLL G P +++ K NI I+ YPGE G +ALA+++ G
Sbjct: 525 DLVEAI-HATGKPVIVVLLSGKPFAMSWIK--ENIPGIVVQWYPGEQGGLALADMLLGKV 581
Query: 579 NPGGRLPMTWYPQD-------YIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGL 631
NP G+L + +PQ Y +P R + PG+ Y F K ++ FG GL
Sbjct: 582 NPSGKLNYS-FPQSVGHLPCYYNYLPTDKGFYRSPGSKNKPGKDYVFSSPKALWAFGHGL 640
Query: 632 SYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKN 691
SY+ + YL+ ++S + +DV+ VTI ++N
Sbjct: 641 SYTDFE-----------YLSATTSKEDYACEDVIE------------------VTIAIRN 671
Query: 692 HGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRARED 751
G+ G ++V+ P+++L GF+ V++ E +++ ++ P L+ ++
Sbjct: 672 TGDYDGLEVPQVYVRDMVSSVVMPVQELKGFEKVLIKKGETKQVIIKI-PVSELALYNKE 730
Query: 752 GLMVIEEGTHFLVVGDEEYPISI 774
V+E G L +G I I
Sbjct: 731 MKKVVEPGAFELQIGRASDDIRI 753
>gi|182415162|ref|YP_001820228.1| glycoside hydrolase family 3 [Opitutus terrae PB90-1]
gi|177842376|gb|ACB76628.1| glycoside hydrolase family 3 domain protein [Opitutus terrae
PB90-1]
Length = 747
Score = 273 bits (698), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 221/753 (29%), Positives = 350/753 (46%), Gaps = 81/753 (10%)
Query: 43 FPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGK 102
PF LP QR DL+ R+TL+EKI + A A+PRLG+ E HGVA G
Sbjct: 32 LPFQDPELPAEQRIDDLIGRMTLEEKIDCMAMRA-AVPRLGVKGSRH-IEGYHGVAQGGP 89
Query: 103 GIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQ--AIGMTFWAPN 160
+ T FPQ A++D L ++ EAR L+ + + G+ APN
Sbjct: 90 SNWGRRNPTATTQFPQAYGLGATWDPELIRQVAAQEAEEARYLFQSPRYDRAGLIVRAPN 149
Query: 161 INIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAY 220
++ RDPRWGR +E GEDP G A ++VRG+QGD +A + KHF A
Sbjct: 150 ADLARDPRWGRTEEVYGEDPFHAGTLATAFVRGLQGDDPR------YFKAVSLVKHFLAN 203
Query: 221 DLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNL 280
++ + ++ F R + Y PFE + G A +M AYN VNG P+ +
Sbjct: 204 SNEDGRESSSSNFSER----QWREYYAKPFEMAIVDGGAPALMAAYNAVNGTPAHV-HPM 258
Query: 281 LSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTK 340
L +W +G + +D + ++ + A +KAG++ FL +H
Sbjct: 259 LRDIVMAEWKLNGILCTDGGGLRLLVEKHHAFPDLPSAAAACVKAGIN----HFLDRHKD 314
Query: 341 A---AVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIG----ADVVCSPAHQ 393
A AV + + E ++D AL LF V ++LGL + + + P+ IG A+ P Q
Sbjct: 315 AVTEAVARGSITERDLDAALRGLFRVSLKLGLLDPDERV-PYAAIGRNGEAEPWLRPDTQ 373
Query: 394 VLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQ 453
L + Q IVLLKNS LLPL ++K ++AL+GP N+ L Y G ++ P
Sbjct: 374 ALVRKVTQRSIVLLKNSGALLPLDRTKVKTVALVGPLVNT--VLPDWYGGTPPYTVPPSI 431
Query: 454 ALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLD------------QTQ 501
++ V V + D AV++A+ ++ ++ +G D ++
Sbjct: 432 GVEKVAGEGV------KVGWLADMGDAAVELARTSEIAIVCVGNDPISAGGWELVRTPSE 485
Query: 502 EKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGY 561
KE +DR DL LP Q++ I RV AA I+VL+ P + + +++ +I+ +
Sbjct: 486 GKEAVDRKDLALPRDQEKFIRRVL-AANPRTIVVLISNFPYAMPWVV--KHVPAIVHLTH 542
Query: 562 PGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKV-PMTDMKMRPQATSGNPGRTYRFYE 620
+ AL +V++G+ NP G+L TW P+ ++ PM D + GRTY++++
Sbjct: 543 ASQELGHALGDVLWGEVNPDGKLAQTW-PKSLKQLPPMMDYDL-------THGRTYQYFK 594
Query: 621 GKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCET 680
G+ FPFG GLSY+ ++ +S ++ L+ + H + E E
Sbjct: 595 GEPQFPFGFGLSYTTFN-----LSNLRVGLDVAR-----------HVGAGAETPAESPAP 638
Query: 681 RKFL------VTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAE 734
R F + + V N G AG V ++ + RP+KQL GFQ + + A E A
Sbjct: 639 RTFAPNAILSIAVEVTNTGTRAGDEVVQVYARYPHSKVSRPLKQLCGFQRISVAAGETAH 698
Query: 735 IVFELSPCESLSRAREDGLMVIEEGTHFLVVGD 767
+ +L + E V+E G L+VG+
Sbjct: 699 VRLQLPASRFAYWSVEQHAWVVEPGPVELLVGN 731
>gi|393719789|ref|ZP_10339716.1| glycosyl hydrolase family protein [Sphingomonas echinoides ATCC
14820]
Length = 896
Score = 273 bits (698), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 168/419 (40%), Positives = 232/419 (55%), Gaps = 47/419 (11%)
Query: 56 ARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATS 115
A LV+++T +EK+ QL+N APAIPRL IPAY WW+E+LHG G T+
Sbjct: 46 AATLVAQMTTEEKLPQLLNVAPAIPRLKIPAYNWWTESLHGALGPIP----------TTN 95
Query: 116 FPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQ--------AIGMTFWAPNINIFRDP 167
FP+ I AASFD+ L +++ AI E RAL+ G+ G+ W+PNINIFRDP
Sbjct: 96 FPEPIGLAASFDAPLVHQVADAISTEVRALHTQGRQTGKLGRIGTGLDTWSPNINIFRDP 155
Query: 168 RWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKG 227
RWGRGQET GEDP + + V++V G+QG + ++ + KHF A+ +
Sbjct: 156 RWGRGQETYGEDPYLAARMGVAFVTGMQGPNPDLPRVI------STPKHFAAH---SGPE 206
Query: 228 TTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARR 287
+TR+ + V+ DL DTY P F + + +GRA IMCAYNR+ G P+CA LL R
Sbjct: 207 STRHAANVYVSPHDLEDTYLPAFRAAIVEGRAGSIMCAYNRIEGQPACASDLLLKDHLRG 266
Query: 288 QWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGS-------FLQKHTK 340
WGF GY+ SDCDAV I D YA AV +KAG+D C + L
Sbjct: 267 AWGFTGYVVSDCDAVKDIADNHKYAPDQATAVAAAMKAGVDNECNTQTIGDIGGLPDRFG 326
Query: 341 AAVKQKKLPESEIDRALHNLFSVRMRLGLF---NGNPTMQPFGKIGADVVCSPAHQVLAL 397
A+K+ + + +IDR L LFS R+R G G P + P I +P H LAL
Sbjct: 327 DALKRNLISQGDIDRTLVRLFSARLRNGDLPGVAGAPAVVPVSAI-----LTPDHIALAL 381
Query: 398 QAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQ 456
AA+ +VLLKN G+LPL +A++GP ++ + L GNY+ S +S P+ ++
Sbjct: 382 DAAEKSLVLLKND-GVLPL--RPGARIAVVGPLGDATRVLRGNYS--STQSAPPVSVVE 435
Score = 112 bits (279), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 89/282 (31%), Positives = 129/282 (45%), Gaps = 54/282 (19%)
Query: 496 GLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDIT 545
GL E EE D+ L LP QQ L+ A+A KP+I+V + G P+++
Sbjct: 633 GLTSDMEAEETGTDIPGFKGGDKTSLDLPAEQQALLEH-AKATGKPLIVVAMNGSPINLA 691
Query: 546 FAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRP 605
+AK N +I+ A YPG++G +A+A V+ G NP GRLP+T+Y P D M
Sbjct: 692 WAK--DNAAAIVEAWYPGQSGGLAVANVLTGKTNPAGRLPLTFYRSVADLPPFDDYAMT- 748
Query: 606 QATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVV 665
GRTYR++ G V+PFG GLS+++++Y V+
Sbjct: 749 -------GRTYRYFTGTPVYPFGYGLSFTRFAYAPLTVT--------------------- 780
Query: 666 HYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVK-PARRGNGRPIKQLVGFQS 724
P G R VT V N G+ +G L++ P + G P L G+Q
Sbjct: 781 -----PAAGGAETGLR---VTTEVSNTGQRSGDEVAQLYLNFPDQPGT--PRIALRGYQR 830
Query: 725 VILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
+ L E I F+LSP + LS DG + G + + VG
Sbjct: 831 ITLKPGEHRVISFDLSPRD-LSAVAVDGTRQVMAGQYRVSVG 871
>gi|265766195|ref|ZP_06094236.1| periplasmic beta-glucosidase [Bacteroides sp. 2_1_16]
gi|263253863|gb|EEZ25328.1| periplasmic beta-glucosidase [Bacteroides sp. 2_1_16]
Length = 859
Score = 273 bits (698), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 224/802 (27%), Positives = 360/802 (44%), Gaps = 142/802 (17%)
Query: 43 FPFCKTTLPISQRARDLVSRLTLDEKISQ------------------------------- 71
F + +LP+ R +DL+SR+TL+EKI+Q
Sbjct: 24 FKYKNASLPVEVRVQDLLSRMTLEEKIAQMRHIHAYSIMENGKLNEEKLEKMIGGQNYGF 83
Query: 72 -------------LVNSAPAI----PRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGAT 114
L+N RLGIP + +E+LHG G+T
Sbjct: 84 IEGITLPGKECLTLMNEVQKYMREKTRLGIPVFTL-TESLHG-----------SVHDGST 131
Query: 115 SFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQE 174
FPQ I ++F+ L Y + AI E A Q I + P I++ RD RWGR +E
Sbjct: 132 IFPQAIALGSTFNPILAYEMTSAIAKELTA-----QGITQSL-TPVIDVCRDLRWGRVEE 185
Query: 175 TPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFD 234
GEDP + + VS VRG + Q S KHF A+ G + +
Sbjct: 186 CFGEDPYLVSRMGVSQVRGYLDN-----------QVSPMIKHFGAH------GAPQGGLN 228
Query: 235 -ARVT--MQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGF 291
A V+ ++L Y FE+ VK+ + +M +YN N P+ + L+++ R +W F
Sbjct: 229 LASVSCGQRELLSIYLKTFETVVKEAKPWAVMSSYNSWNNEPNSSSHYLMTELLRDRWDF 288
Query: 292 HGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPES 351
GY+ SD A+ ++ A++ +A + L AG+D + V+ L
Sbjct: 289 QGYVYSDWGAIGMLNYFHKTAQNSAEAAIQALTAGLDAEASDNSYAELQQLVENGMLDVK 348
Query: 352 EIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSH 411
ID+A+ + + + +GLF P P K VV +PAH LA + A++ IVLL+N +
Sbjct: 349 YIDQAVARILTAKFNMGLFE-YPL--PMEKNYDKVVHAPAHVSLARKIAEESIVLLQNEN 405
Query: 412 GLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTV---YYPGC 468
+LPL +K S+A+IGPNA+ + ++ + +T L+AL+ V N + Y GC
Sbjct: 406 NILPLQMNKLKSIAVIGPNADQVQFGDYTWSRDNKDGVTLLEALKERVSNQLTLNYAKGC 465
Query: 469 DTVACSSASIDKAVDIAKGADHVVLMMG---------LDQTQEKEELDRVDLVLPGRQQE 519
D V + +AVD+AK +D ++++G E D DL L G Q++
Sbjct: 466 DLVTDDCSGFKEAVDVAKKSDVCIVVVGSASASLARDYSNATCGEGFDLSDLTLTGVQED 525
Query: 520 LITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHN 579
L+ + A KPVI+VLL G P +++ K NI I+ YPGE G +ALA+++ G N
Sbjct: 526 LVEAI-HATGKPVIVVLLSGKPFAMSWIK--ENIPGIVVQWYPGEQGGLALADMLLGKVN 582
Query: 580 PGGRLPMTWYPQD-------YIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLS 632
P G+L + +PQ Y +P R + PG+ Y F K ++ FG GLS
Sbjct: 583 PSGKLNYS-FPQSVGHLPCYYNYLPTDKGFYRSPGSKNKPGKDYVFSSPKALWAFGHGLS 641
Query: 633 YSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNH 692
Y+ + YL+ ++S + +DV+ VTI ++N
Sbjct: 642 YTDFE-----------YLSATTSKEDYACEDVIE------------------VTIAIRNT 672
Query: 693 GEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDG 752
G+ G ++V+ P+++L GF+ V++ E +++ ++ P L+ ++
Sbjct: 673 GDYDGLEVPQVYVRDMVSSVVMPVQELKGFEKVLIKKGETKQVIIKI-PVSELALYNKEM 731
Query: 753 LMVIEEGTHFLVVGDEEYPISI 774
V+E G L +G I I
Sbjct: 732 KKVVEPGAFELQIGRASDDIRI 753
>gi|336417083|ref|ZP_08597412.1| hypothetical protein HMPREF1017_04520 [Bacteroides ovatus
3_8_47FAA]
gi|335936708|gb|EGM98626.1| hypothetical protein HMPREF1017_04520 [Bacteroides ovatus
3_8_47FAA]
Length = 850
Score = 273 bits (698), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 159/418 (38%), Positives = 240/418 (57%), Gaps = 34/418 (8%)
Query: 51 PISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTI 110
P+ +R DL+SRLT++EKIS L ++P IPRLGI Y +EALHGV G+
Sbjct: 33 PVHERVADLLSRLTVEEKISLLRATSPGIPRLGIDKYYHGNEALHGVVRPGR-------- 84
Query: 111 RGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQ---------AIGMTFWAPNI 161
T FPQ I AA+++ L R+ I EARA +N + +TFW+P +
Sbjct: 85 --FTVFPQAIGLAATWNPELQKRVATVISDEARARWNELDQGREQKEQFSDVLTFWSPTV 142
Query: 162 NIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYD 221
N+ RDPRWGR ET GEDP ++G ++V+G+QGD L+ + KHF A
Sbjct: 143 NMARDPRWGRTPETYGEDPFLSGVMGTAFVKGLQGDD------PRYLKIVSTPKHFAA-- 194
Query: 222 LDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLL 281
N + R+ + +++ + L + Y P FE CVK+G+A+ IM AYN +N +P + LL
Sbjct: 195 --NNEEHNRFVCNPQISEKQLREYYFPAFEMCVKEGKAASIMTAYNALNDVPCTLNAWLL 252
Query: 282 SKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG-SFLQKHTK 340
K R+ WGF GY+ SDC S++ +A Y K+ E A ++AG+D+ CG ++
Sbjct: 253 KKVLRQDWGFQGYVVSDCGGPSLLVNAHKYVKTKEAAATLSIQAGLDLECGDDVYDEYLL 312
Query: 341 AAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAA 400
A KQ + +++ID A ++ + RM+LGLF+G P+ +I V+ S HQ +AL AA
Sbjct: 313 NAYKQYMVSDADIDSAACHVLTARMKLGLFDGT-ERNPYTRISPSVIGSKEHQQIALDAA 371
Query: 401 QDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAG-PSCRSITPLQALQN 457
++ IVLLKN + +LPL +K S+A++G NA K G+Y+G P ++ LQ +++
Sbjct: 372 RECIVLLKNKNNMLPLNVNKVKSIAVVGINA--GKCEFGDYSGAPVVDPVSILQGIKD 427
Score = 117 bits (294), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 80/278 (28%), Positives = 137/278 (49%), Gaps = 43/278 (15%)
Query: 489 DHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAK 548
+ VV +MG++++ E+E DR D+ LP Q+E + + + +I+VL+ G + + +
Sbjct: 602 ETVVAVMGINKSIEREGQDRYDIQLPADQREFLQEIYKVNPN-IIVVLVAGSSLAVNWM- 659
Query: 549 YDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQAT 608
D +I +I+ A YPGE G A+A+V+FGD+NP GRLP+T+Y + ++P D
Sbjct: 660 -DEHIPAIVNAWYPGEQGGTAVADVLFGDYNPAGRLPLTYY-KSLDELPAFD------DY 711
Query: 609 SGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYK 668
GRTY++++G ++PFG GLSYS + Y S K+ + + V
Sbjct: 712 DITKGRTYKYFKGDVLYPFGYGLSYSSFKY---------------SDLKVKDGANTVS-- 754
Query: 669 SVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILN 728
V+ +KN G+ G ++V+ G PIK+L GF+ + L
Sbjct: 755 ----------------VSFRLKNTGKRKGDEVAQVYVRIPETGGVVPIKELKGFRRIPLK 798
Query: 729 AKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
+ E + EL + G ++ +G ++VG
Sbjct: 799 SGESRVVEIELDKEQLRYWDAGLGRFIVPQGAFDIMVG 836
>gi|270296173|ref|ZP_06202373.1| conserved hypothetical protein [Bacteroides sp. D20]
gi|270273577|gb|EFA19439.1| conserved hypothetical protein [Bacteroides sp. D20]
Length = 942
Score = 273 bits (698), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 229/796 (28%), Positives = 360/796 (45%), Gaps = 135/796 (16%)
Query: 51 PISQRARDLVSRLTLDEKISQLVN--SAPAIPRLGIPAYEW------------------- 89
P+ R +L+ ++TLDEK Q+V + + +P EW
Sbjct: 59 PLEARIENLLQQMTLDEKTCQVVTLYGYKRVLKDDLPTPEWKELLWKDGIGAIDEHLNGF 118
Query: 90 -------------WSEALHG----------VAGVGKGI---FFNGTIRG-----ATSFPQ 118
W + H V GI F N IRG AT+FP
Sbjct: 119 QQWGLPPSDNAYVWPASRHAWALNEVQRFFVEDTRLGIPVDFTNEGIRGVESYRATNFPT 178
Query: 119 VILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMT-FWAPNINIFRDPRWGRGQETPG 177
+ +++ L ++G G EAR L G T +AP +++ RD RWGR +E G
Sbjct: 179 QLGLGHTWNRELIRQVGLITGREARML-------GYTNVYAPILDVGRDQRWGRYEEVYG 231
Query: 178 EDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARV 237
E P + + + VRG+Q + Q +A KHF AY + + D ++
Sbjct: 232 ESPYLVAELGIEMVRGLQHNH----------QVAATGKHFAAYSNNKGAREGMARVDPQM 281
Query: 238 TMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITS 297
+ +++ + + PF+ +++ G+M +YN +GIP L+ R + GF GY+ S
Sbjct: 282 SPREVENIHIYPFKRVIREAGMLGVMSSYNDYDGIPVQGSYYWLTTRLRGEMGFRGYVVS 341
Query: 298 DCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNC-----GSFLQKHTKAAVKQKKLPESE 352
D DAV +Y G AK ++AV ++AG++V C SF+ + VK+ L E
Sbjct: 342 DSDAVEYLYTKHGTAKDMKEAVRQSVEAGLNVRCTFRSPDSFVLP-LRELVKEGGLSEEV 400
Query: 353 IDRALHNLFSVRMRLGLFNGNPTMQPFGKIGAD-VVCSPAHQVLALQAAQDGIVLLKNSH 411
I+ + ++ V+ +GLF+ P GAD V ++ +ALQA+++ IVLLKN+
Sbjct: 401 INDRVRDILRVKFLIGLFDA-PYQTDLA--GADREVEKEENEAIALQASRESIVLLKNAG 457
Query: 412 GLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVE---NTVYYPGC 468
LLPL + + +A+ GPNAN L +Y + T L+ +Q + +Y GC
Sbjct: 458 ELLPLDINSTKKIAVCGPNANEEGYALTHYGPLAVEVTTVLEGIQEKTKGKAEVLYTKGC 517
Query: 469 DTVAC---------------SSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVL 513
D V A IDKAV+ A+ AD ++++G Q E R L L
Sbjct: 518 DLVDAHWPESEIIDYPLTDDEQAEIDKAVENARQADVAIVVLGGGQRTCGENKSRTSLDL 577
Query: 514 PGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEV 573
PGRQ +L+ + +A KPV+L+L+ G P+ I +A D+ + +IL A YPG G ALA++
Sbjct: 578 PGRQLQLLQAI-QATGKPVVLILINGRPLSINWA--DKFVPAILEAWYPGSKGGTALADI 634
Query: 574 IFGDHNPGGRLPMTWYPQDYIKVPMT-DMKMRPQATSG-NPGRTYRFYE-GKEVFPFGCG 630
+FGD+NPGG+L +T +P+ ++P K Q G NPG T ++PFG G
Sbjct: 635 LFGDYNPGGKLTVT-FPKTVGQIPFNFPCKPSSQIDGGKNPGPTGNMSRINGALYPFGYG 693
Query: 631 LSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVK 690
LSY+ + Y ++ + N+S++ V + V
Sbjct: 694 LSYTTFEYSDLDITPRVITPNESAT-----------------------------VRLKVT 724
Query: 691 NHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRARE 750
N G+ AG V L+++ K L GFQ + L E E+ F + + L
Sbjct: 725 NTGKRAGDEVVQLYIRDVLSSITTYEKNLAGFQRIHLEPGEAQELSFTID-RKHLELLDA 783
Query: 751 DGLMVIEEGTHFLVVG 766
D V+E G L+ G
Sbjct: 784 DMKWVVEPGDFVLMAG 799
>gi|300727409|ref|ZP_07060818.1| beta-xylosidase B [Prevotella bryantii B14]
gi|299775289|gb|EFI71888.1| beta-xylosidase B [Prevotella bryantii B14]
Length = 841
Score = 273 bits (698), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 166/438 (37%), Positives = 240/438 (54%), Gaps = 33/438 (7%)
Query: 41 ETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGV 100
+ P+ +L QRA DL SRLTL+EK + N +PAI RL IPA+EWWSEALHG A
Sbjct: 6 QHLPYQNPSLSAEQRAEDLCSRLTLEEKCKLMQNGSPAIKRLNIPAFEWWSEALHGTARN 65
Query: 101 GKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLE-------ARALYNAGQAIG 153
G AT FP AAS++ L +I AIG E AR N + G
Sbjct: 66 G----------FATVFPNTTGMAASWNDQLLLQIFSAIGNESRIKNTLARKSGNIKRYQG 115
Query: 154 MTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASAC 213
++ W PNINIFRDPRWGRGQET GEDP +TGK ++ V G+QG K + AC
Sbjct: 116 LSIWTPNINIFRDPRWGRGQETYGEDPYLTGKMGLAVVEGLQGP-----KNSKYYKLLAC 170
Query: 214 CKHFTAYDLDNWKGTTRYKFD-ARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGI 272
KHF + + R+ F+ + +DL +TY P F++ +++G + +MCAY+ ++G+
Sbjct: 171 AKHFAVHSGPEY---LRHSFNIENLPARDLWETYLPAFKTLIQEGNVAEVMCAYHSMDGL 227
Query: 273 PSCADRNLLSKTARRQWGFHGYITSDCDAVS--IIYDAEGYAKSPEDAVVDVLKAGMDVN 330
P C L + R+ GF G + SDC A+ I A+ A + AG DV
Sbjct: 228 PCCGSNKYLQQILRQDLGFKGMVVSDCGAIGDFWIQGRHEVAQDAAQASAQAVLAGTDVE 287
Query: 331 CGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSP 390
CG+ K AVK+ ++ E +I+ ++ L R +LG F+ + M + ++ ++
Sbjct: 288 CGANYDK-LPEAVKRGEISEEKINVSVMRLLKARFKLGDFDSD-NMVEWTQLPESLIACS 345
Query: 391 AHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSIT 450
H+ LA Q AQ+ + LLKN +G+LPL K+ +A++G NAN + L GNY G ++I+
Sbjct: 346 KHKQLAYQMAQESMTLLKN-NGILPL--QKNARIAVMGANANDSIMLWGNYNGYPTKTIS 402
Query: 451 PLQALQNYVENTVYYPGC 468
L+ LQN ++ Y PGC
Sbjct: 403 ILEGLQNKSKHISYIPGC 420
Score = 103 bits (257), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 88/290 (30%), Positives = 127/290 (43%), Gaps = 59/290 (20%)
Query: 488 ADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLL 537
A V+ + G+ E EE+ DR + LP Q+++I + + KK VI V
Sbjct: 584 AQVVIYVGGISPRLEGEEMKVNELGFKGGDRTTIELPQSQRDMIALLHNSGKK-VIFVNC 642
Query: 538 CGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVP 597
GG I RN +IL A Y GE G A+A+V+FGD+NP G+LP+T+Y D
Sbjct: 643 SGGA--IALEPESRNADAILQAWYGGEMGGQAVADVLFGDYNPNGKLPVTFYKNDSQLPD 700
Query: 598 MTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTK 657
D M+ GRTYR+ ++PFG GLSY+ ++Y
Sbjct: 701 YNDYTMK--------GRTYRYLHQAPLYPFGYGLSYTTFAY------------------- 733
Query: 658 MVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIK 717
D Y + RK +++ V N G+ G + ++++ NG PIK
Sbjct: 734 -----DNAKY-----------DRRKGNLSLEVTNTGKCEGTTTIQVYIRRTADING-PIK 776
Query: 718 QLVGFQSVILNAKEKAEIVFELSPCESLSRARE-DGLMVIEEGTHFLVVG 766
L FQ V L A EK + L P E E M I G + ++VG
Sbjct: 777 TLKAFQKVSLQANEKKRVTINL-PRERFEGWDETTNTMRIVPGKYEIMVG 825
>gi|354580734|ref|ZP_08999639.1| glycoside hydrolase family 3 domain protein [Paenibacillus lactis
154]
gi|353203165|gb|EHB68614.1| glycoside hydrolase family 3 domain protein [Paenibacillus lactis
154]
Length = 766
Score = 273 bits (697), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 211/719 (29%), Positives = 335/719 (46%), Gaps = 104/719 (14%)
Query: 81 RLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGL 140
RLGIP + E HG +G AT FP + ++++ L+ + +A+
Sbjct: 104 RLGIPIL-FGEECSHGHMAIG-----------ATVFPVPLTIGSTWNPELFRSMCRAVAA 151
Query: 141 EARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFN 200
E R+ G ++P +++ RDPRWGR +ET GEDP + ++AV+ V+G+QGD
Sbjct: 152 ETRSQG------GAATYSPVLDVVRDPRWGRTEETFGEDPHLVAEFAVAAVQGLQGD--- 202
Query: 201 GGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRAS 260
+L + A KHF Y R + +++L + PF V+ G A
Sbjct: 203 --RLDAEDSLLATLKHFAGYGASEGG---RNGAPVHMGLRELHEIDLLPFRKAVEAG-AQ 256
Query: 261 GIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVV 320
+M AYN ++G+P + R LL R WGF G++ +DC A+ ++ A S E+A
Sbjct: 257 SVMTAYNEIDGVPCTSSRYLLHDVLREAWGFDGFVITDCGAIDMLKSGHNTAASGEEAAA 316
Query: 321 DVLKAGMDVN-CGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPF 379
L AG+D+ GS + + + A++Q + E +++ A+ + +++ RLGLF+ P P
Sbjct: 317 QALTAGVDMEMSGSMFRVYLRQALEQGHITEDDLNTAVGRVLAMKFRLGLFD-RPYTDP- 374
Query: 380 GKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLG 439
+ V+ H LA + A +GIVLLKN +LPL K+ +A+IGPNAN+ LG
Sbjct: 375 -ERAEKVIGCEEHIELARRVAAEGIVLLKNEGNVLPL-NPKTGKIAVIGPNANAPYNQLG 432
Query: 440 NYAGPS--CRSITPLQALQNYV-----ENTVYYPGCDTVACSSASIDKAVDIAKGADHVV 492
+Y P + IT L+ ++ ++ +Y PGC S + A+ A AD +V
Sbjct: 433 DYTSPQPPGQIITVLEGIRRHIGEDADTRVLYAPGCRIQGDSREGLSHALACAAEADVIV 492
Query: 493 LMMGLDQTQE-------------------------KEELDRVDLVLPGRQQELITRVAEA 527
+ +G ++ E +DR L L G Q EL+ + +
Sbjct: 493 MAIGGSSARDFGEGTIDLRTGASVVTGLAQSDMECGEGIDRSTLHLMGVQLELLQEIHKL 552
Query: 528 AKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMT 587
KPV++V + G P IT D +I +IL A YPG+ G A+A+++FGD NP GRL +T
Sbjct: 553 G-KPVVVVYINGRP--ITEPWIDEHIPAILEAWYPGQEGGSAIADILFGDVNPSGRLTLT 609
Query: 588 WYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNK 647
P++ ++P+ R + G+ Y + + +PFG GLSY+ + Y +V
Sbjct: 610 -IPKEVGQLPINYNAKRTR------GKRYLETDLEPRYPFGYGLSYTDFHYGNLSVEPAV 662
Query: 648 LYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKP 707
+ + S++ ++V V N G G V L+V
Sbjct: 663 IPADGSAAVRIV-----------------------------VTNTGPRDGAEVVQLYVSD 693
Query: 708 ARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
RP K L F V L A E E+ F + P E L D V+E G + VG
Sbjct: 694 LAASVTRPEKALKAFSKVFLKAGESREVTFTVGP-EQLELIGPDMKAVVEPGEFRIRVG 751
>gi|374311316|ref|YP_005057746.1| Beta-glucosidase [Granulicella mallensis MP5ACTX8]
gi|358753326|gb|AEU36716.1| Beta-glucosidase [Granulicella mallensis MP5ACTX8]
Length = 773
Score = 273 bits (697), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 239/813 (29%), Positives = 364/813 (44%), Gaps = 115/813 (14%)
Query: 3 FHKLSLVFPLLCLCFTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSR 62
K LV +CL +L S S + + ++ LP+ R DL+ R
Sbjct: 12 LKKRFLVLTGVCLSSVTLFPAYSQ-----SIASSGKTKTVLIYEQSNLPLETRLADLLGR 66
Query: 63 LTLDEKISQL-----------VNSAPAIPRLGIPAYEWWSEALHGVAGVGK--------- 102
+TL+EK+ QL S ++P P ++AL G G G
Sbjct: 67 MTLEEKVRQLDFYSGTDSLLDRGSKNSLPSKQSPFSTAKADALFGSLGAGAIHDLDPTPE 126
Query: 103 ------------------GIFFNGTIRG---ATSFPQVILTAASFDSYLWYRIGQAIGLE 141
+F + G T FP + A+++D + + G AI E
Sbjct: 127 QYNTIQRWVIEHNRLHIPALFIEEGLHGFDTGTVFPAPLNLASTWDPSVAEKTGSAIAAE 186
Query: 142 ARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNG 201
ARA +GM AP +++ RDPRWGR +E GEDP +TG+ ++YVRG QG++ N
Sbjct: 187 ARA-----TGVGMIL-APVLDLARDPRWGRIEEDFGEDPYLTGQMGLAYVRGAQGESLNT 240
Query: 202 GKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASG 261
A KHF A+ T + ++L FE +QG A
Sbjct: 241 DH-----NVVAEPKHFAAHGSPEGGTNTS---PVHIGERELRSVMLKSFEPAFRQGHAMA 292
Query: 262 IMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVD 321
M AY+ ++GIP AD LL R++WGF G + SD A+ +Y A SP+ A
Sbjct: 293 TMAAYHEIDGIPVTADPYLLKTILRQEWGFQGMVLSDLGAIRRLYQLHQVASSPKAASCL 352
Query: 322 VLKAGMDVNCGSF----LQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQ 377
+K+G+D+ F QK V + LP++++DRA + ++ LGLF+ P +
Sbjct: 353 AIKSGVDMQFYDFDHDVFQKALIDCVHEGSLPQADVDRAASAVLRLKFTLGLFD-RPYVD 411
Query: 378 PFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTL 437
P + A S H ++LQ+A++ +VLLKN +GLLP KS +A+IGPNA+ A+
Sbjct: 412 P--TLNAKAYRSKPHLDVSLQSARESLVLLKNENGLLPFSKSIQ-RIAVIGPNADVAR-- 466
Query: 438 LGNYAGPSC-RSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMG 496
G+Y + I+ LQ ++ P S I AV AK AD V+L +G
Sbjct: 467 YGDYEEEANGLHISILQGVKAEA------PHAQVEFDSGKDIAAAVAKAKSADVVILGLG 520
Query: 497 LDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSI 556
+ E DR L LPG Q++L+ + A KPV+LVL G P+ I +AK ++G+I
Sbjct: 521 EWRGISGEAFDRTSLDLPGEQEKLLEAIT-ATNKPVVLVLENGRPLTIGWAK--AHVGAI 577
Query: 557 LWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVP---MTDMKMRPQATSGNPG 613
+ A YPGE G A+AE +FGD+NP GRL +T +P+ ++P TD R +
Sbjct: 578 VEAWYPGEFGGQAIAETLFGDNNPAGRLTIT-FPKTVGQIPDYYNTD-PSRAYDSDLTRR 635
Query: 614 RTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPEL 673
+ Y + + +FPFG GLSY+ + Y V+ N+ S
Sbjct: 636 KVYVDNDSQPLFPFGYGLSYTTFHYCDLQVTPPAAKSNEDVS------------------ 677
Query: 674 GTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKA 733
VT V N G AG ++++ P++ L F + L +E
Sbjct: 678 -----------VTFTVTNTGTKAGDEVSQVYLREQFSSVETPVRSLKAFTRMPLQPQESR 726
Query: 734 EIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
+ ++ P L+ D +E G + + VG
Sbjct: 727 TVTLKI-PRSELAVWNADEKWAVEGGKYTVWVG 758
>gi|53714352|ref|YP_100344.1| beta-glucosidase [Bacteroides fragilis YCH46]
gi|52217217|dbj|BAD49810.1| periplasmic beta-glucosidase precursor [Bacteroides fragilis YCH46]
Length = 859
Score = 273 bits (697), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 224/802 (27%), Positives = 361/802 (45%), Gaps = 142/802 (17%)
Query: 43 FPFCKTTLPISQRARDLVSRLTLDEKISQ------------------------------- 71
F + +LP+ R +DL+SR+TL+EKI+Q
Sbjct: 24 FKYKNASLPVEVRVQDLLSRMTLEEKIAQMRHIHAYSIMENGKLNEEKLEKMIGGQNYGF 83
Query: 72 -------------LVNSAPAI----PRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGAT 114
L+N RLGIP + +E+LHG G+T
Sbjct: 84 IEGITLPGKECLTLMNEVQKYMREKTRLGIPVFTL-TESLHG-----------SVHDGST 131
Query: 115 SFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQE 174
FPQ I ++F+ L Y + AI E A Q I + P I++ RD RWGR +E
Sbjct: 132 IFPQAIALGSTFNPILAYEMTSAIAKELTA-----QGITQSL-TPVIDVCRDLRWGRVEE 185
Query: 175 TPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFD 234
GEDP + + VS VRG + Q S KHF A+ G + +
Sbjct: 186 CFGEDPYLVSRMGVSQVRGYLDN-----------QVSPMIKHFGAH------GAPQGGLN 228
Query: 235 -ARVT--MQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGF 291
A V+ ++L Y FE+ VK+ + +M +YN N P+ + L+++ R +W F
Sbjct: 229 LASVSCGQRELLSIYLKTFETVVKEAKPWAVMSSYNSWNNEPNSSSHYLMTELLRDRWDF 288
Query: 292 HGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPES 351
GY+ SD A+ ++ A++ +A + L AG+D + V+ L
Sbjct: 289 QGYVYSDWGAIGMLNYFHKTAQNSAEAAIQALTAGLDAEASDNSYAELQQLVENGMLDVK 348
Query: 352 EIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSH 411
ID+A+ + + + +GLF P P K VV +PAH LA + A++ IVLL+N +
Sbjct: 349 YIDQAVARILTAKFNMGLFE-YPL--PMEKNYDKVVHAPAHVSLARKIAEESIVLLQNEN 405
Query: 412 GLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTV---YYPGC 468
+LPL +K S+A+IGPNA+ + ++ + +T L+AL+ V N + Y GC
Sbjct: 406 NILPLQMNKLKSIAVIGPNADQVQFGDYTWSRDNKDGVTLLEALKERVGNQLTLNYAKGC 465
Query: 469 DTVACSSASIDKAVDIAKGADHVVLMMG---------LDQTQEKEELDRVDLVLPGRQQE 519
D V + +AVD+AK +D ++++G E D DL L G Q++
Sbjct: 466 DLVTDDRSGFKEAVDVAKKSDVCIVVVGSASASLARDYSNATCGEGFDLSDLTLTGVQED 525
Query: 520 LITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHN 579
L+ + A KPVI+VLL G P+ +++ K NI I+ YPGE G +ALA+++ G N
Sbjct: 526 LVEAI-HATGKPVIVVLLSGKPLAMSWIK--ENIPGIVVQWYPGEQGGLALADMLLGKVN 582
Query: 580 PGGRLPMTWYPQD-------YIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLS 632
P G+L + +PQ Y +P R + PG+ Y F K ++ FG GLS
Sbjct: 583 PSGKLNYS-FPQSVGHLPCYYNYLPTDKGFYRSPGSKNKPGKDYVFSSPKALWAFGHGLS 641
Query: 633 YSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNH 692
Y+ + YL+ ++S + +DV+ VTI ++N
Sbjct: 642 YTDFE-----------YLSATTSKEDYACEDVIE------------------VTIAIRNT 672
Query: 693 GEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDG 752
G+ G ++V+ P+++L GF+ V++ E +++ ++ P L+ ++
Sbjct: 673 GDYDGLEVPQVYVRDMVSSVVMPVQELKGFEKVLIKKGETKQVIIKI-PVSELALYNKEM 731
Query: 753 LMVIEEGTHFLVVGDEEYPISI 774
V+E G L +G I I
Sbjct: 732 KKVVEPGAFELQIGRASDDIRI 753
>gi|404254492|ref|ZP_10958460.1| glycosyl hydrolase family protein [Sphingomonas sp. PAMC 26621]
Length = 898
Score = 273 bits (697), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 170/422 (40%), Positives = 227/422 (53%), Gaps = 59/422 (13%)
Query: 59 LVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQ 118
+V+R+T +EK+ QL+N APAIPR+ IPAY WW+E+LHG G T+FP+
Sbjct: 51 IVARMTTEEKLPQLLNVAPAIPRIKIPAYNWWTESLHGALGAVP----------TTNFPE 100
Query: 119 VILTAASFDSYLWYRIGQAIGLEARALYNAGQ--------AIGMTFWAPNINIFRDPRWG 170
I AASFD+ L +++ AI E RAL+ G+ G+ W+PNINIFRDPRWG
Sbjct: 101 PIGLAASFDAPLVHQVAGAISTEVRALHTMGRQTGKLGRIGTGLDTWSPNINIFRDPRWG 160
Query: 171 RGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTR 230
RGQET GEDP + V++V G+QG + A KHF + + +TR
Sbjct: 161 RGQETYGEDPYLAAHMGVAFVTGMQGPNPD------LPDVIATPKHFAVH---SGPESTR 211
Query: 231 YKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWG 290
+ + V+ DL DTY P F + + +GRA IMCAYNR++G P+CA+ LL R WG
Sbjct: 212 HGANVFVSAHDLEDTYLPAFRAAIVEGRAGSIMCAYNRIDGQPACANDLLLKDHLRGAWG 271
Query: 291 FHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPE 350
F GY+ SDCDAV I D YA A+ +KAG D C HT+ LPE
Sbjct: 272 FTGYVVSDCDAVKDIADGHKYAPDQATAIAAAMKAGTDNEC------HTQTIGDMGGLPE 325
Query: 351 -------------SEIDRALHNLFSVRMRLGLF---NGNPTMQPFGKIGADVVCSPAHQV 394
+IDR+L LFS R+R G G P P I SP H
Sbjct: 326 RFGDSLRRGLISQDDIDRSLVRLFSARLRNGDLPGIAGAPAAVPVRAI-----LSPDHMA 380
Query: 395 LALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQA 454
LALQAA+ +VLLKN G+LPL +A+IGP ++ + L GNY+ S +S P+
Sbjct: 381 LALQAAEKSLVLLKND-GVLPL--RPGARIAVIGPLGDATRVLRGNYS--STQSAPPVPI 435
Query: 455 LQ 456
++
Sbjct: 436 VE 437
Score = 122 bits (306), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 90/274 (32%), Positives = 135/274 (49%), Gaps = 54/274 (19%)
Query: 507 DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAG 566
D+ L LP Q ++ R A+A KPVI+V + G P+++ +AK N +I+ A YPG++G
Sbjct: 656 DKTSLDLPAEQMAMLER-AKATGKPVIVVAMNGSPINLAWAK--DNAAAIVEAWYPGQSG 712
Query: 567 AVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFP 626
+A+A V+ G +PGGRLP+T+Y P + M GRTYR++ GK V+P
Sbjct: 713 GLAVANVLTGKTDPGGRLPLTFYRSVADLPPFDNYAMT--------GRTYRYFTGKPVYP 764
Query: 627 FGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVT 686
FG GLSY+ ++Y L LN +S G E + VT
Sbjct: 765 FGHGLSYTSFAY-------TPLVLNPASG------------------GAE----QGLRVT 795
Query: 687 IGVKNHGEMAGKHPVLLFVK----PARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPC 742
V N G+ AG V L++ P G P L G+Q + L E+ + F+LSP
Sbjct: 796 TDVSNTGKRAGDEVVQLYLSFPDVP-----GAPRIALRGYQRITLAPGERRSVTFDLSP- 849
Query: 743 ESLSRAREDGLMVIEEGTHFLVVG----DEEYPI 772
LS R +G+ + +G + + VG D + P+
Sbjct: 850 RDLSAVRANGVREVMQGQYRVSVGAGQPDTDVPV 883
>gi|423301682|ref|ZP_17279705.1| hypothetical protein HMPREF1057_02846 [Bacteroides finegoldii
CL09T03C10]
gi|408471675|gb|EKJ90206.1| hypothetical protein HMPREF1057_02846 [Bacteroides finegoldii
CL09T03C10]
Length = 1365
Score = 273 bits (697), Expect = 4e-70, Method: Compositional matrix adjust.
Identities = 231/812 (28%), Positives = 354/812 (43%), Gaps = 148/812 (18%)
Query: 26 STQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQL------------- 72
S+ FS ++P P+ + LPI +R +DL+ R+T +EK++Q+
Sbjct: 520 SSAISFSSTTASPK---LPYQRADLPIEERVKDLLQRMTPEEKLAQIRHIHSWEIFNGQA 576
Query: 73 -----------------VNSAPAIP------------------RLGIPAYEWWSEALHGV 97
V P RLGIP + +E+LHGV
Sbjct: 577 LDERKLEEKAQGMSWGFVEGFPLTAENCAKNMLAIQRFMVEKTRLGIPIFTV-AESLHGV 635
Query: 98 AGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGM-TF 156
GAT FPQ I ++FD+ L YR I E A+GM
Sbjct: 636 VH-----------EGATVFPQNIALGSTFDTDLAYRKTSMIADEL-------HAVGMRQV 677
Query: 157 WAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKH 216
+P I++ RD RWGR +E+ GEDP + G++ ++ V+G NG S KH
Sbjct: 678 LSPCIDVVRDLRWGRVEESFGEDPYLCGRFGIAEVKGYMD---NG--------ISPMLKH 726
Query: 217 FTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCA 276
+ + + G + +++DL + Y PFE +KQ +M AYN N IP+ A
Sbjct: 727 YGPHG-NPLSGLNLASVET--SIRDLHEVYLKPFEMVMKQAPTLAVMSAYNSWNRIPNSA 783
Query: 277 DRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQ 336
LL+ R++WGF GY+ SD A+ ++ + A++ E+A + L AG+DV S
Sbjct: 784 SHYLLTDVLRKEWGFKGYVYSDWGAIEMLKNFHFTARNSEEAALQALTAGLDVEASSDCY 843
Query: 337 KHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFG-KIGADVVCSPAHQVL 395
+++ +L +D A+ + + R+GLF+ P+G K + S L
Sbjct: 844 PAIPGLIERGELNREIVDEAVRRVLYAKFRIGLFD-----DPYGEKFAKGAIHSGKAIAL 898
Query: 396 ALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCR--SITPLQ 453
+ + A + VLLKN LLPL K S+A+IGPNA+ + G+Y +TPLQ
Sbjct: 899 SKKIADESTVLLKNDRQLLPLSIGKLKSIAVIGPNADQIQ--FGDYTWTRDNRFGVTPLQ 956
Query: 454 ALQNYVENTV---YYPGCDTVACSSASIDKAVDIAKGADHVVLMMG---------LDQTQ 501
++ + V Y GC V+ + I +AV+ A+ +D VL G +
Sbjct: 957 GIRKWAGTNVKVNYVKGCSLVSMDESGIRQAVEAAEQSDVCVLFCGSASAALARDYKSST 1016
Query: 502 EKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGY 561
E D DL L G Q LI + +A KPVILVL+ G P I + K +NI +IL Y
Sbjct: 1017 CGEGFDLNDLTLTGAQPALI-KAVQATGKPVILVLVTGKPFAIPWEK--KNIPAILVQWY 1073
Query: 562 PGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQ-------ATSGNPGR 614
GE ++A+++FG +P GRL + +P+ +P+ +R + +PGR
Sbjct: 1074 AGEQSGNSIADILFGKVSPSGRLTFS-FPESTGHLPVFYNHLRSDRGFYKSPGSYDSPGR 1132
Query: 615 TYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELG 674
Y F ++ FG GL+Y+ + Y + LN D VH
Sbjct: 1133 DYVFSAPVPLWSFGHGLTYTTFEYSNLQTDRTSYLLN-----------DTVH-------- 1173
Query: 675 TEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAE 734
V I +KN G+ GK V L+V P+ QL F+ V L A E
Sbjct: 1174 ----------VRIDLKNTGKREGKEVVQLYVSDVYSSVAMPVHQLRDFRKVALQAGETQT 1223
Query: 735 IVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
+ + P L+ E ++E G + VG
Sbjct: 1224 VRLSI-PVSELTILNEKNEAIVEPGEFEIQVG 1254
>gi|261408260|ref|YP_003244501.1| glycoside hydrolase family protein [Paenibacillus sp. Y412MC10]
gi|261284723|gb|ACX66694.1| glycoside hydrolase family 3 domain protein [Paenibacillus sp.
Y412MC10]
Length = 763
Score = 273 bits (697), Expect = 4e-70, Method: Compositional matrix adjust.
Identities = 219/716 (30%), Positives = 337/716 (47%), Gaps = 101/716 (14%)
Query: 81 RLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGL 140
RLGIP + E HG +G AT FP + +++++ L+ I +A+
Sbjct: 104 RLGIPIL-FGEECSHGHMAIG-----------ATVFPVPLTIGSTWNTELFRSISRAVAA 151
Query: 141 EARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFN 200
E RA G A ++P +++ RDPRWGR +ET GEDP + ++AV+ V+G+QG+
Sbjct: 152 ETRA--QGGSAT----YSPVLDVVRDPRWGRTEETFGEDPHLVTEFAVAAVQGLQGE--- 202
Query: 201 GGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRAS 260
+L A KHF Y R + +++L + PF V+ G S
Sbjct: 203 --RLDSHTSLLATLKHFAGYGASE---GGRNGAPVHMGLRELHEVDLLPFRKAVEAGALS 257
Query: 261 GIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVV 320
+M AYN ++G+P + LL R WGF G++ +DC A+ ++ A S +A
Sbjct: 258 -VMTAYNEIDGVPCTSSGYLLQDVLREAWGFDGFVITDCGAIHMLACGHNTAGSGVEAAA 316
Query: 321 DVLKAGMDVN-CGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPF 379
LKAG+D+ G+ + H A++Q + E +++RA + ++ RLGLF+ P + P
Sbjct: 317 QSLKAGVDMEMSGTMFRAHLHQALEQGLITEEDLNRAAGRVLELKFRLGLFD-RPYVDP- 374
Query: 380 GKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLG 439
V+ H LA QAA +GIVLLKN LLPL S S ++A+IGPNA++ LG
Sbjct: 375 -AWAEQVIGCKEHIALAYQAAAEGIVLLKNEGNLLPL-DSSSGTIAVIGPNAHAPYHQLG 432
Query: 440 NYAGPS--CRSITPLQALQNYVENT--VYYPGCDTVACSSASIDKAVDIAKGADHVVLMM 495
+Y P + +T L ++ + ++ +Y PGC S +A+ A+ AD +V+++
Sbjct: 433 DYTSPQPPGQIVTVLDGIRRRLGDSRVLYAPGCRIQGDSREGFPRALACAEQADVIVMVL 492
Query: 496 GLDQTQE-------------------------KEELDRVDLVLPGRQQELITRVAEAAKK 530
G ++ E +DR L L G Q EL+ + + K
Sbjct: 493 GGSSARDFGEGTIDLRTGASVVTGHAESDMECGEGIDRSTLTLMGVQLELLQELHKLG-K 551
Query: 531 PVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYP 590
PVI+V + G P IT D +I SI+ A YPG+ G A+A+++FGD NP GRLP++ P
Sbjct: 552 PVIVVYINGRP--ITEPWIDEHIPSIVEAWYPGQEGGSAIADMLFGDINPSGRLPLS-IP 608
Query: 591 QDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYL 650
++ ++P + R + G+ Y + +PFG GLSY+++ Y V
Sbjct: 609 KEVGQLPNSYNARRTR------GKRYLETDLAPRYPFGFGLSYTEFRYGRLTVE------ 656
Query: 651 NQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARR 710
+V +G E V I V N G G V L+V
Sbjct: 657 -----------------PAVVPIGGEAT------VRIDVTNAGARDGAEVVQLYVSDLAA 693
Query: 711 GNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
RP K L GF+ V L A E E+ F + E L D V+E G + VG
Sbjct: 694 SVTRPEKALKGFRKVFLKAGETQEVTFTIG-SEQLELIGLDLKPVVEPGEFRIQVG 748
>gi|395490413|ref|ZP_10421992.1| glycosyl hydrolase family protein [Sphingomonas sp. PAMC 26617]
Length = 898
Score = 272 bits (696), Expect = 4e-70, Method: Compositional matrix adjust.
Identities = 170/422 (40%), Positives = 226/422 (53%), Gaps = 59/422 (13%)
Query: 59 LVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQ 118
+V+R+T +EK+ QL+N APAIPRL IPAY WW+E+LHG G T+FP+
Sbjct: 51 IVARMTTEEKLPQLLNVAPAIPRLKIPAYNWWTESLHGALGAVP----------TTNFPE 100
Query: 119 VILTAASFDSYLWYRIGQAIGLEARALYNAGQ--------AIGMTFWAPNINIFRDPRWG 170
I AASFD+ L + + AI E RAL+ G+ G+ W+PNINIFRDPRWG
Sbjct: 101 PIGLAASFDAPLVHEVAGAISTEVRALHTMGRQTGKLGRIGTGLDTWSPNINIFRDPRWG 160
Query: 171 RGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTR 230
RGQET GEDP ++ V++V G+QG + A KHF + + +TR
Sbjct: 161 RGQETYGEDPYLSAHMGVAFVTGMQGPNPD------LPDVIATPKHFAVH---SGPESTR 211
Query: 231 YKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWG 290
+ + V+ DL DTY P F + + +GRA IMCAYNR++G P+CA+ LL R WG
Sbjct: 212 HGANVFVSAHDLEDTYLPAFRAAIVEGRAGSIMCAYNRIDGQPACANDLLLKDHLRGAWG 271
Query: 291 FHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPE 350
F GY+ SDCDAV I D YA A+ +KAG D C HT+ LPE
Sbjct: 272 FTGYVVSDCDAVKDIADGHKYAPDQATAIAAAMKAGTDNEC------HTQTIGDMGGLPE 325
Query: 351 -------------SEIDRALHNLFSVRMRLGLF---NGNPTMQPFGKIGADVVCSPAHQV 394
+IDR+L LFS R+R G G P P I SP H
Sbjct: 326 RFGDSLRRGLISQDDIDRSLVRLFSARLRNGDLPGIAGAPAAVPVRAI-----LSPDHMA 380
Query: 395 LALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQA 454
LALQA + +VLLKN G+LPL +A+IGP ++ + L GNY+ S +S P+
Sbjct: 381 LALQATEKSLVLLKND-GVLPL--RPGARIAVIGPLGDATRVLRGNYS--STQSAPPVSI 435
Query: 455 LQ 456
++
Sbjct: 436 VE 437
Score = 120 bits (300), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 90/274 (32%), Positives = 134/274 (48%), Gaps = 54/274 (19%)
Query: 507 DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAG 566
D+ L LP Q ++ R A+A KPVI+V + G P+++ +AK N +I+ A YPG++G
Sbjct: 656 DKTSLDLPAEQMAMLER-AKATGKPVIVVAMNGSPINLAWAK--DNAAAIVEAWYPGQSG 712
Query: 567 AVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFP 626
VA+A V+ G +PGGRLP+T+Y P + M GRTYR++ GK V+P
Sbjct: 713 GVAVANVLTGKTDPGGRLPLTFYRSVADLPPFDNYTMT--------GRTYRYFTGKPVYP 764
Query: 627 FGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVT 686
FG GLSY+ ++Y L LN +S G E + VT
Sbjct: 765 FGHGLSYTSFAY-------TPLVLNPASG------------------GAE----QGLRVT 795
Query: 687 IGVKNHGEMAGKHPVLLFVK----PARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPC 742
V N + AG V L++ P G P L G+Q + L E+ + F+LSP
Sbjct: 796 TDVSNTVKRAGDEVVQLYLSFPDVP-----GAPRIALRGYQRITLAPGERRSVTFDLSP- 849
Query: 743 ESLSRAREDGLMVIEEGTHFLVVG----DEEYPI 772
LS R +G+ + +G + + VG D + P+
Sbjct: 850 RDLSAVRANGVREVMQGQYRVSVGAGQPDTDVPV 883
>gi|393789623|ref|ZP_10377743.1| hypothetical protein HMPREF1068_04023 [Bacteroides nordii
CL02T12C05]
gi|392650339|gb|EIY44008.1| hypothetical protein HMPREF1068_04023 [Bacteroides nordii
CL02T12C05]
Length = 863
Score = 272 bits (696), Expect = 4e-70, Method: Compositional matrix adjust.
Identities = 163/437 (37%), Positives = 236/437 (54%), Gaps = 33/437 (7%)
Query: 44 PFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKG 103
P+ L +RA DL+ RLTL EK++ + N + I RLGI Y+WW+EALHGVA G
Sbjct: 24 PYRNPNLSPEERAEDLLGRLTLKEKVTLMQNESFPIERLGIAHYDWWNEALHGVARAGI- 82
Query: 104 IFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAG--------QAIGMT 155
AT FP + A+SFD I A+ E RA Y+ + G+T
Sbjct: 83 ---------ATVFPITMGMASSFDDKAIEDIFTAVSDEVRAKYHDAHRQGRRGVRCEGLT 133
Query: 156 FWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCK 215
FW PNINIFRDPRWGRGQET GEDP +T + V+ V+G+QG KL AC K
Sbjct: 134 FWTPNINIFRDPRWGRGQETYGEDPYLTSRMGVAVVKGLQGP---ADAKYDKLH--ACAK 188
Query: 216 HFTAYDLDNWKGTTRYKFDAR-VTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPS 274
H+ + K R+ F+A ++ +DL +TY P F++ V++ +MCAYNR G P
Sbjct: 189 HYAVHSGPEAK---RHSFNAENISPRDLWETYLPAFKTLVQEADVKEVMCAYNRFEGDPC 245
Query: 275 CADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYA--KSPEDAVVDVLKAGMDVNCG 332
C LL++ R WG+ + SDC A+S + ++ +A K+ DA + +G D+ CG
Sbjct: 246 CGSNRLLTQILRDDWGYKHVVVSDCGAISDFFYSDRHATHKNAADASAAAVLSGTDLECG 305
Query: 333 SFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAH 392
H AVK+ + E I+ +L L R LG + + + P+ KI D V H
Sbjct: 306 -IEYAHLDKAVKEGLISEERINTSLFRLLKARFELGEMD-DDALVPWSKISIDTVDCQMH 363
Query: 393 QVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPL 452
+ +AL + +VLL N +G+LPL K+ +A++GPNAN + GNY G ++T L
Sbjct: 364 KQMALDITRKSMVLLHN-NGVLPLGKT-GARIAVMGPNANDSVMQWGNYEGTPSHTVTVL 421
Query: 453 QALQNYVENTVYYPGCD 469
+ ++N + N +Y GC+
Sbjct: 422 EGIRNKIGNVIYEKGCE 438
Score = 96.3 bits (238), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 82/293 (27%), Positives = 131/293 (44%), Gaps = 52/293 (17%)
Query: 486 KGADHVVLMMGLDQTQEKEEL-----------DRVDLVLPGRQQELITRVAEAAKKPVIL 534
K AD V+ + G+ E EE DR + LP Q+ ++ + +A KK ++
Sbjct: 597 KEADVVIFVGGISPNLEGEEKNFVNCPGFVGGDRTSIELPEVQRNILKALKKAGKK--VI 654
Query: 535 VLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYI 594
+ C G + ++ +IL A YPG+AG A+A++IFGD+NP G+LP+T+Y
Sbjct: 655 FVNCSGSA-MALVPETQSCDAILQAWYPGQAGGTAVADIIFGDYNPSGKLPVTFYKNTEQ 713
Query: 595 KVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSS 654
D M+ GRTYR+ +FPFG GLSY+ + + +++ +NQ
Sbjct: 714 LPDFEDYSMK--------GRTYRYMTESPLFPFGYGLSYTTFQFSKAGLNKRVATINQP- 764
Query: 655 STKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGR 714
V +K + VKN G+ G V ++++ A G
Sbjct: 765 ----------VQFK------------------VNVKNTGKRDGTEVVQIYIRKADDEEG- 795
Query: 715 PIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGD 767
P+K L F+ V L A + A + LSP E M I G + ++ G+
Sbjct: 796 PVKSLRAFRPVTLKAGKSATVNITLSPDAFEFFDPETNTMRIVPGDYEIMYGN 848
>gi|393786908|ref|ZP_10375040.1| hypothetical protein HMPREF1068_01320 [Bacteroides nordii
CL02T12C05]
gi|392658143|gb|EIY51773.1| hypothetical protein HMPREF1068_01320 [Bacteroides nordii
CL02T12C05]
Length = 854
Score = 272 bits (696), Expect = 5e-70, Method: Compositional matrix adjust.
Identities = 164/430 (38%), Positives = 239/430 (55%), Gaps = 34/430 (7%)
Query: 45 FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGI 104
+ P +R DL+S+LT++EKIS L ++P IPRL I Y +EALHGV V G
Sbjct: 28 YLDMNAPRHERILDLLSKLTIEEKISLLRATSPGIPRLHIDKYYHGNEALHGV--VRPGN 85
Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYN----AGQAIG-----MT 155
F T FPQ I AA ++ L I I EARA +N + +G +T
Sbjct: 86 F--------TVFPQAIGLAAMWNPQLLNEISTVISDEARARWNELEQGKKQLGQFSDLLT 137
Query: 156 FWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCK 215
FW+P +N+ RDPRWGR ET GEDP ++GK VS+V+G+QGD L+ + K
Sbjct: 138 FWSPTVNMARDPRWGRTPETYGEDPFLSGKLGVSFVKGLQGDD------PRYLKIVSTPK 191
Query: 216 HFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSC 275
HF A N + R++ + ++ +DL + Y P FE C+ +G+A+ IM AYN +N +P
Sbjct: 192 HFAA----NNEEHNRFECNPIISEKDLREYYLPAFEKCIIEGKAASIMTAYNAINDVPCT 247
Query: 276 ADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG-SF 334
+ LL K R WGF GY+ SDC S + Y K+ E A ++AG+D+ CG
Sbjct: 248 LNNWLLKKVLRHDWGFDGYVVSDCGGPSFLVTHHKYVKTLEAAAALSIQAGLDLECGDEV 307
Query: 335 LQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQV 394
+ A KQ + E+EID A +++ RMRLGLF+ +P + P+ KI +V H
Sbjct: 308 YMEPLLNAYKQYMVSEAEIDSAAYHVLRARMRLGLFD-DPALNPYNKISPSIVGCEKHSK 366
Query: 395 LALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAG-PSCRSITPLQ 453
LAL+AA+ IVLLKN LPL K S+A++G NA +++ G+Y+G P + ++ L+
Sbjct: 367 LALEAARQSIVLLKNEKKFLPLDSKKIKSIAVVGINAGNSE--FGDYSGTPVNQPVSILE 424
Query: 454 ALQNYVENTV 463
++ V + V
Sbjct: 425 GIKKKVGDQV 434
Score = 120 bits (301), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 93/292 (31%), Positives = 143/292 (48%), Gaps = 51/292 (17%)
Query: 480 KAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCG 539
+A DI + D V ++G++++ E+E DR + LP QQ I + V+ VL+ G
Sbjct: 594 EAGDIMRKCDLTVAVLGINKSIEREGQDRYSIELPKDQQIFIEEAYKINPNTVV-VLVAG 652
Query: 540 GPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVP-M 598
+ I + D +I +I+ A YPGEAG A+AEV+FGD+NPGG+LP+T+Y + ++P
Sbjct: 653 SSLAINWM--DEHIPAIVNAWYPGEAGGTAVAEVLFGDYNPGGKLPLTYY-RSLDELPAF 709
Query: 599 TDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKM 658
D +R GRTY+F+EG ++ FG GLSY+ +SYK KL ++ +
Sbjct: 710 DDYDIR-------KGRTYQFFEGDPLYAFGHGLSYTTFSYK-------KLSIDAAG---- 751
Query: 659 VENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGR---- 714
DVV V+ +KN G+ G L+VK +G+
Sbjct: 752 ----DVVS------------------VSFTLKNTGKYEGDEVAQLYVK--YQGSDSQVKL 787
Query: 715 PIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
P+KQL GF+ + L E +I + E E G G + +VG
Sbjct: 788 PLKQLKGFERIHLKKGESKQINLTVPKSELRFWNEEKGEFYTPAGDYLFMVG 839
>gi|375359159|ref|YP_005111931.1| putative exported hydrolase [Bacteroides fragilis 638R]
gi|423283738|ref|ZP_17262622.1| hypothetical protein HMPREF1204_02160 [Bacteroides fragilis HMW
615]
gi|301163840|emb|CBW23395.1| putative exported hydrolase [Bacteroides fragilis 638R]
gi|404580776|gb|EKA85484.1| hypothetical protein HMPREF1204_02160 [Bacteroides fragilis HMW
615]
Length = 859
Score = 272 bits (696), Expect = 5e-70, Method: Compositional matrix adjust.
Identities = 224/802 (27%), Positives = 360/802 (44%), Gaps = 142/802 (17%)
Query: 43 FPFCKTTLPISQRARDLVSRLTLDEKISQ------------------------------- 71
F + +LP+ R +DL+SR+TL+EKI+Q
Sbjct: 24 FKYKNASLPVEVRVQDLLSRMTLEEKIAQMRHIHAYSIMENGKLNEEKLEKMIGGQNYGF 83
Query: 72 -------------LVNSAPAI----PRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGAT 114
L+N RLGIP + +E+LHG G+T
Sbjct: 84 IEGITLPGKECLTLMNEVQKYMREKTRLGIPVFTL-TESLHG-----------SVHDGST 131
Query: 115 SFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQE 174
FPQ I ++F+ L Y + AI E A Q I + P I++ RD RWGR +E
Sbjct: 132 IFPQAIALGSTFNPILAYEMTSAIAKELTA-----QGITQSL-TPVIDVCRDLRWGRVEE 185
Query: 175 TPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFD 234
GEDP + + VS VRG + Q S KHF A+ G + +
Sbjct: 186 CFGEDPYLVSRMGVSQVRGYLDN-----------QVSPMIKHFGAH------GAPQGGLN 228
Query: 235 -ARVT--MQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGF 291
A V+ ++L Y FE+ VK+ + +M +YN N P+ + L+++ R +W F
Sbjct: 229 LASVSCGQRELLSIYLKTFETVVKEAKPWAVMSSYNSWNNEPNSSSHYLMTELLRDRWDF 288
Query: 292 HGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPES 351
GY+ SD A+ ++ A++ +A + L AG+D + V+ L
Sbjct: 289 QGYVYSDWGAIGMLNYFHKTAQNSAEAAIQALTAGLDAEASDNSYAELQQLVENGMLDVK 348
Query: 352 EIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSH 411
ID+A+ + + + +GLF P P K VV +PAH LA + A++ IVLL+N +
Sbjct: 349 YIDQAVARILTAKFNMGLFE-YPL--PMEKNYDKVVHAPAHVSLARKIAEESIVLLQNEN 405
Query: 412 GLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTV---YYPGC 468
+LPL +K S+A+IGPNA+ + ++ + +T L+AL+ V N + Y GC
Sbjct: 406 NILPLQMNKLKSIAVIGPNADQVQFGDYTWSRDNKDGVTLLEALKERVGNQLTLNYAKGC 465
Query: 469 DTVACSSASIDKAVDIAKGADHVVLMMG---------LDQTQEKEELDRVDLVLPGRQQE 519
D V + +AVD+AK +D ++++G E D DL L G Q++
Sbjct: 466 DLVTDDCSGFKEAVDVAKKSDVCIVVVGSASASLARDYSNATCGEGFDLSDLTLTGVQED 525
Query: 520 LITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHN 579
L+ + A KPVI+VLL G P +++ K NI I+ YPGE G +ALA+++ G N
Sbjct: 526 LVEAI-HATGKPVIVVLLSGKPFAMSWIK--ENIPGIVVQWYPGEQGGLALADMLLGKVN 582
Query: 580 PGGRLPMTWYPQD-------YIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLS 632
P G+L + +PQ Y +P R + PG+ Y F K ++ FG GLS
Sbjct: 583 PSGKLNYS-FPQSVGHLPCYYNYLPTDKGFYRSPGSKNKPGKDYVFSSPKALWAFGHGLS 641
Query: 633 YSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNH 692
Y+ + YL+ ++S + +DV+ VTI ++N
Sbjct: 642 YTDFE-----------YLSATTSKEDYACEDVIE------------------VTIAIRNT 672
Query: 693 GEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDG 752
G+ G ++V+ P+++L GF+ V++ E +++ ++ P L+ ++
Sbjct: 673 GDYDGLEVPQVYVRDMVSSVVMPVQELKGFEKVLIKKGETKQVIIKI-PVSELALYNKEM 731
Query: 753 LMVIEEGTHFLVVGDEEYPISI 774
V+E G L +G I I
Sbjct: 732 KKVVEPGAFELQIGRASDDIRI 753
>gi|423313768|ref|ZP_17291703.1| hypothetical protein HMPREF1058_02315 [Bacteroides vulgatus
CL09T03C04]
gi|392684303|gb|EIY77631.1| hypothetical protein HMPREF1058_02315 [Bacteroides vulgatus
CL09T03C04]
Length = 788
Score = 272 bits (695), Expect = 5e-70, Method: Compositional matrix adjust.
Identities = 218/761 (28%), Positives = 348/761 (45%), Gaps = 131/761 (17%)
Query: 51 PISQRARDLVSRLTLDEKISQLVN--SAPAIPRLGIPAYEW----WSEALHGVAGVGKGI 104
P+ R +DL+S++TL+EK Q+ + + + +P W W + + + G+
Sbjct: 49 PLEDRVQDLLSQMTLEEKTCQMATLYGSGRVLKDALPQNNWKTEVWKDGIGNIDEEHNGL 108
Query: 105 ------------------------------------FFNGTIRG-----ATSFPQVILTA 123
F N IRG AT FP
Sbjct: 109 GAFKSEYSFPYAKHVNAKHTIQRWFVEKTRLGIPVDFTNEGIRGLCHDRATYFPAQCGQG 168
Query: 124 ASFDSYLWYRIGQAIGLEARALYNAGQAIGMT-FWAPNINIFRDPRWGRGQETPGEDPLV 182
A+++ L RIG+ EA AL G T ++P ++I +DPRWGR ET GEDP +
Sbjct: 169 ATWNKKLIARIGEVEAKEAVAL-------GYTNIYSPILDIAQDPRWGRCVETYGEDPYL 221
Query: 183 TGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDL 242
G+ + +Q K A KHF Y + + + D V +++
Sbjct: 222 VGELGKQMITSLQ-----------KYNLVATPKHFAVYSIPIGGRDGKTRTDPHVAPREM 270
Query: 243 ADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAV 302
Y PF ++ A G+M +YN +G P + L++ R++WGF GY+ SD +AV
Sbjct: 271 RTLYIEPFRMAFQEAGALGVMSSYNDYDGEPITGSYHFLTEILRQEWGFKGYVVSDSEAV 330
Query: 303 SIIYDAEGYAKSPEDAVVDVLKAGMDVNC-----GSFLQKHTKAAVKQKKLPESEIDRAL 357
I + A + ED + + AG+++ F+ K AV K+ + +D+ +
Sbjct: 331 EFISNKHKVADTYEDGIAQAVNAGLNIRTHFTPPADFILPLRK-AVDNGKISQETLDKRV 389
Query: 358 HNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLP 417
+ ++ RLGLF+ NP + GK +V S HQ ++L+AA+ +VLLKN LLPL
Sbjct: 390 AEILRIKFRLGLFD-NP-YRGNGKQAEQIVHSKEHQAVSLEAARQSLVLLKNETNLLPLS 447
Query: 418 KSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVEN--TVYYPGCDTV---- 471
KS S+A+IGPNAN L+ Y + T Q ++ + + +Y GCD +
Sbjct: 448 KSIR-SIAVIGPNANEQTQLICRYGPANAPIKTVYQGIKELLPHAEVIYKKGCDIIDPHF 506
Query: 472 -----------ACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQEL 520
A +++A+ AK A+ VV+++G ++ +E+ R L LPGRQ+EL
Sbjct: 507 PESEILDFPKTAEEVQLMEEAIRAAKQAEVVVMVLGGNELTVREDRSRTSLNLPGRQEEL 566
Query: 521 ITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
+ V A KPVILV+L G I +A ++ +IL A +PGE A+AE +FGD+NP
Sbjct: 567 LKAVC-ATGKPVILVMLDGRASSINYAA--AHVPAILHAWFPGEFCGQAVAEALFGDYNP 623
Query: 581 GGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKF 640
GGRL +T +P+ ++P +P + + Y ++PFG GLSY+ ++Y
Sbjct: 624 GGRLAVT-FPKSVGQIPFA-FPFKPGSDESSSTSVY-----GALYPFGHGLSYTTFTYS- 675
Query: 641 KAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHP 700
L+ S S + V Q +H V+ +KN G++ G
Sbjct: 676 --------DLHISPSHQGV--QGDIH------------------VSCKIKNTGKIKGDEV 707
Query: 701 VLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSP 741
V L+++ K L GF+ + L A E+ + F L P
Sbjct: 708 VQLYLRDEISSVTTYTKVLRGFERISLKAGEEQTVHFRLRP 748
>gi|336411808|ref|ZP_08592268.1| hypothetical protein HMPREF1018_04286 [Bacteroides sp. 2_1_56FAA]
gi|335940152|gb|EGN02020.1| hypothetical protein HMPREF1018_04286 [Bacteroides sp. 2_1_56FAA]
Length = 859
Score = 272 bits (695), Expect = 5e-70, Method: Compositional matrix adjust.
Identities = 223/803 (27%), Positives = 360/803 (44%), Gaps = 142/803 (17%)
Query: 42 TFPFCKTTLPISQRARDLVSRLTLDEKISQ------------------------------ 71
F + +LP+ R +DL+SR+TL+EKI+Q
Sbjct: 23 NFKYKNASLPVEVRVQDLLSRMTLEEKIAQMRHIHAYSIMENGKLNEEKLEKMIGGQNYG 82
Query: 72 --------------LVNSAPAI----PRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGA 113
L+N RLGIP + +E+LHG G+
Sbjct: 83 FIEGITLPGKECLTLMNEVQKYMREKTRLGIPVFTL-TESLHG-----------SVHDGS 130
Query: 114 TSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQ 173
T FPQ I ++F+ L Y + AI E A Q I + P I++ RD RWGR +
Sbjct: 131 TIFPQAIALGSTFNPILAYEMTSAIAKELSA-----QGITQSL-TPVIDVCRDLRWGRVE 184
Query: 174 ETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKF 233
E GEDP + + VS VRG + Q S KHF A+ G +
Sbjct: 185 ECFGEDPFLVSRMGVSQVRGYLDN-----------QVSPMIKHFGAH------GAPQGGL 227
Query: 234 D-ARVT--MQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWG 290
+ A V+ ++L Y FE+ VK+ + +M +YN N P+ + L+++ R +W
Sbjct: 228 NLASVSCGQRELLSIYLKTFETVVKEAKPWAVMSSYNSWNNEPNSSSHYLMTELLRDRWD 287
Query: 291 FHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPE 350
F GY+ SD A+ ++ A++ +A + L AG+D + V+ L
Sbjct: 288 FQGYVYSDWGAIGMLNYFHKTAQNSAEAAIQALTAGLDAEASDNSYAELQQLVENGMLDV 347
Query: 351 SEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNS 410
ID+A+ + + + +GLF P P K VV +PAH LA + A++ IVLL+N
Sbjct: 348 KYIDQAVARILTAKFNMGLFE-YPL--PMEKNYDKVVHAPAHVSLARKIAEESIVLLQNE 404
Query: 411 HGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTV---YYPG 467
+ +LPL +K S+A+IGPNA+ + ++ + +T L+AL+ N + Y G
Sbjct: 405 NNILPLQMNKLKSIAVIGPNADQVQFGDYTWSRDNKDGVTLLEALKERAGNQLTLNYAKG 464
Query: 468 CDTVACSSASIDKAVDIAKGADHVVLMMG---------LDQTQEKEELDRVDLVLPGRQQ 518
CD V + +AVD+AK +D ++++G E D DL L G Q+
Sbjct: 465 CDLVTDDRSGFKEAVDVAKKSDVCIVVVGSASASLARDYSNATCGEGFDLSDLTLTGVQE 524
Query: 519 ELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDH 578
+L+ + A KPVI+VLL G P+ +++ K NI I+ YPGE G +ALA+++ G
Sbjct: 525 DLVEAI-HATGKPVIVVLLSGKPLAMSWIK--ENIPGIVVQWYPGEQGGLALADMLLGKV 581
Query: 579 NPGGRLPMTWYPQD-------YIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGL 631
NP G+L + +PQ Y +P R + PG+ Y F K ++ FG GL
Sbjct: 582 NPSGKLNYS-FPQSVGHLPCYYNYLPTDKGFYRSPGSKNKPGKDYVFSSPKALWAFGHGL 640
Query: 632 SYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKN 691
SY+ + YL+ ++S + +DV+ VTI ++N
Sbjct: 641 SYTDFE-----------YLSATTSKEDYACEDVIE------------------VTIAIRN 671
Query: 692 HGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRARED 751
G+ G ++V+ P+++L GF+ V++ E +++ ++ P L+ ++
Sbjct: 672 TGDYDGLEVPQVYVRDMVSSVVMPVQELKGFEKVLIKKGETKQVIIKI-PVSELALYNKE 730
Query: 752 GLMVIEEGTHFLVVGDEEYPISI 774
V+E G L +G I I
Sbjct: 731 MKKVVEPGAFELQIGRASDDIRI 753
>gi|423250669|ref|ZP_17231684.1| hypothetical protein HMPREF1066_02694 [Bacteroides fragilis
CL03T00C08]
gi|423253995|ref|ZP_17234925.1| hypothetical protein HMPREF1067_01569 [Bacteroides fragilis
CL03T12C07]
gi|392651626|gb|EIY45288.1| hypothetical protein HMPREF1066_02694 [Bacteroides fragilis
CL03T00C08]
gi|392654553|gb|EIY48200.1| hypothetical protein HMPREF1067_01569 [Bacteroides fragilis
CL03T12C07]
Length = 859
Score = 272 bits (695), Expect = 6e-70, Method: Compositional matrix adjust.
Identities = 224/803 (27%), Positives = 360/803 (44%), Gaps = 142/803 (17%)
Query: 42 TFPFCKTTLPISQRARDLVSRLTLDEKISQ------------------------------ 71
F + +LP+ R +DL+SR+TL+EKI+Q
Sbjct: 23 NFKYKNASLPVEVRVQDLLSRMTLEEKIAQMRHIHAYSIMENGKLNEEKLEKMIGGQNYG 82
Query: 72 --------------LVNSAPAI----PRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGA 113
L+N RLGIP + +E+LHG G+
Sbjct: 83 FIEGITLPGKECLTLMNEVQKYMREKTRLGIPVFTL-TESLHG-----------SVHDGS 130
Query: 114 TSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQ 173
T FPQ I ++F+ L Y + AI E A Q I + P I++ RD RWGR +
Sbjct: 131 TIFPQAIALGSTFNPILAYEMTSAIAKELSA-----QGITQSL-TPVIDVCRDLRWGRVE 184
Query: 174 ETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKF 233
E GEDP + + VS VRG + Q S KHF A+ G +
Sbjct: 185 ECFGEDPYLVSRMGVSQVRGYLDN-----------QVSPMIKHFGAH------GAPQGGL 227
Query: 234 D-ARVT--MQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWG 290
+ A V+ ++L Y FE+ VK+ + +M +YN N P+ + L+++ R +W
Sbjct: 228 NLASVSCGQRELLSIYLKTFETVVKEAKPWAVMSSYNSWNNEPNSSSHYLMTELLRDRWD 287
Query: 291 FHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPE 350
F GY+ SD A+ ++ A++ +A + L AG+D + V+ L
Sbjct: 288 FQGYVYSDWGAIGMLNYFHKTAQNSAEAAIQALTAGLDAEASDNSYAELQQLVENGMLDV 347
Query: 351 SEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNS 410
ID+A+ + + + +GLF P P K VV +PAH LA + A++ IVLL+N
Sbjct: 348 KYIDQAVARILTAKFNMGLFE-YPL--PMEKNYDKVVHAPAHVSLARKIAEESIVLLQNE 404
Query: 411 HGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTV---YYPG 467
+ +LPL +K S+A+IGPNA+ + ++ + +T L+AL+ V N + Y G
Sbjct: 405 NNILPLQMNKLKSIAVIGPNADQVQFGDYTWSRDNKDGVTLLEALKERVGNQLTLNYAKG 464
Query: 468 CDTVACSSASIDKAVDIAKGADHVVLMMG---------LDQTQEKEELDRVDLVLPGRQQ 518
CD V + +AVD+AK +D ++++G E D DL L G Q+
Sbjct: 465 CDLVTDDRSGFKEAVDVAKKSDVCIVVVGSASASLARDYSNATCGEGFDLSDLTLTGVQE 524
Query: 519 ELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDH 578
+L+ + A KPVI+VLL G P +++ K NI I+ YPGE G +ALA+++ G
Sbjct: 525 DLVEAI-HATGKPVIVVLLSGKPFAMSWIK--ENIPGIVVQWYPGEQGGLALADMLLGKV 581
Query: 579 NPGGRLPMTWYPQD-------YIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGL 631
NP G+L + +PQ Y +P R + PG+ Y F K ++ FG GL
Sbjct: 582 NPSGKLNYS-FPQSVGHLPCYYNYLPTDKGFYRSPGSKNKPGKDYVFSSPKALWAFGHGL 640
Query: 632 SYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKN 691
SY+ + YL+ ++S + +DV+ VTI ++N
Sbjct: 641 SYTDFE-----------YLSATTSKEDYACEDVIE------------------VTIAIRN 671
Query: 692 HGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRARED 751
G+ G ++V+ P+++L GF+ V++ E +++ ++ P L+ ++
Sbjct: 672 TGDYDGLEVPQVYVRDMVSSVVIPVQELKGFEKVLIKKGETKQVIIKI-PVSELALYNKE 730
Query: 752 GLMVIEEGTHFLVVGDEEYPISI 774
V+E G L +G I I
Sbjct: 731 MKKVVEPGAFELQIGRASDDIRI 753
>gi|304406707|ref|ZP_07388362.1| glycoside hydrolase family 3 domain protein [Paenibacillus
curdlanolyticus YK9]
gi|304344240|gb|EFM10079.1| glycoside hydrolase family 3 domain protein [Paenibacillus
curdlanolyticus YK9]
Length = 733
Score = 272 bits (695), Expect = 6e-70, Method: Compositional matrix adjust.
Identities = 226/742 (30%), Positives = 366/742 (49%), Gaps = 110/742 (14%)
Query: 49 TLPISQRARDLVSRLTLDEKISQLVN----SAPAIPRLGIPAYEWWSE--------ALHG 96
T I ++A L+S++TL++K+ Q+ P P G ++ E ++
Sbjct: 17 TTEIREQAEQLLSKMTLEDKVGQMTQFDWGYNPINPETGESEHDLIIELIRQGKVGSIFN 76
Query: 97 VAGVGKGIFFNGTIRGATSF--PQVILTAASFDSYLWYRIGQAIGLEARALYN------- 147
++G + G I T P VI D YR I L A +N
Sbjct: 77 LSGAAEANELQGLIEQHTELKIPMVI----GRDVIHGYRTVFPIPLAMAAAWNPEVARQT 132
Query: 148 ----AGQAI--GMTF-WAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFN 200
+ +A+ G+T+ +AP I++ RDPRWGR E+ GEDP +T Y ++V G Q D N
Sbjct: 133 SAAASTEALTDGVTWVFAPMIDVSRDPRWGRIAESIGEDPYLTAAYGRAWVEGSQID--N 190
Query: 201 GGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRAS 260
G G+ AS C KHF Y + G D ++ ++L D PPF+ V+ G S
Sbjct: 191 G---PGRATAS-CPKHFAGYGMAE-AGRDYNTVD--LSDRELRDIILPPFQDAVEAGALS 243
Query: 261 GIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDA-VSIIYDAEGYAKSPEDAV 319
IM ++N +NGIP+CA+ LL R +WGF G + SD +A V +I G A + E+A
Sbjct: 244 -IMASFNEINGIPACANEYLLKTILRDEWGFEGVVASDYNALVELI--VHGVAANEEEAC 300
Query: 320 VDVLKAG--MDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQ 377
+ AG MD++ G F ++ K V+ ++PES +D ++ + +++++LGL + +
Sbjct: 301 EMTVLAGCDMDMHSGIFTRQLPK-LVRAGRVPESVVDDSVRRILAMKIKLGLLEQSKS-- 357
Query: 378 PFGKIGADVVCSPAHQ-------VLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPN 430
DV S A Q LA +AA+ IVLL+N +LPL K+ + S+A+IGP
Sbjct: 358 -------DVSQSAATQPLKSEYVELAREAARQSIVLLQNKEQVLPLSKAGA-SIAVIGPL 409
Query: 431 ANSAKTLLGNYA--GPSCRSITPLQAL-QNYVENTV--YYPGCDTVACSSASIDKAVDIA 485
A++A LG +A G S +T L+ + Q E T Y GCD + S + A++ A
Sbjct: 410 ADNATDPLGCWALDGRSDEVVTALEGIRQAAAEGTSIRYAQGCDIDSDSEEGFEAALEAA 469
Query: 486 KGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDIT 545
+ +D VV+++G T E R L LPG+Q+ L+ VA+ KP++ V+L G P +T
Sbjct: 470 RSSDVVVMLLGESATMSGESRSRAALDLPGKQRALVEAVAKLG-KPIVAVILSGRP--LT 526
Query: 546 FAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRP 605
FA +I+ A + G A+A+V+FGD NP GRLP+T +PQ+ ++P+ + +
Sbjct: 527 FAWLPEQASAIVQAWHLGVQSGNAIADVLFGDFNPSGRLPVT-FPQNVGQIPIYHYRKK- 584
Query: 606 QATSGNPGRTYRFY----EGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVEN 661
T P Y Y + ++PFG GL+Y+++ Y S++ + ++
Sbjct: 585 --TGRPPAGAYSSYYIDSTTEPLYPFGYGLTYTEFEYGAIQTSKSSIGADE--------- 633
Query: 662 QDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVG 721
+ VT+ ++N G +AG+ V +V+ +P+K+LV
Sbjct: 634 --------------------QLDVTVSIRNVGNLAGEEVVQCYVRDEVASVTQPLKRLVA 673
Query: 722 FQSVILNAKEKAEIVFELSPCE 743
F+ V + A E ++ F + E
Sbjct: 674 FRKVKVAAGESVDVTFTIGAAE 695
>gi|329963634|ref|ZP_08301109.1| glycosyl hydrolase family 3 protein [Bacteroides fluxus YIT 12057]
gi|328528179|gb|EGF55158.1| glycosyl hydrolase family 3 protein [Bacteroides fluxus YIT 12057]
Length = 863
Score = 272 bits (695), Expect = 6e-70, Method: Compositional matrix adjust.
Identities = 181/471 (38%), Positives = 252/471 (53%), Gaps = 43/471 (9%)
Query: 32 SCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWS 91
SC S P+ +T T+LP +RA L+ LTL+EK+S +++ A + RLGI Y WW+
Sbjct: 17 SC--SQPTYKT-----TSLPPEERAELLLKELTLEEKVSLMMDGAKPVERLGIKPYNWWN 69
Query: 92 EALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARAL--YNAG 149
EALHGVA G AT FPQ I AASF Y + A+ EARA Y A
Sbjct: 70 EALHGVARAGL----------ATVFPQPIGMAASFSPEAVYGVFDAVSDEARAKNSYYAS 119
Query: 150 QAI-----GMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKL 204
Q G+T W P +NI+RDPRWGRG ET GEDP +T + V V+G+QG N G+
Sbjct: 120 QGSYERYQGLTMWTPTVNIYRDPRWGRGIETYGEDPYLTSRMGVEVVKGLQGG--NDGRY 177
Query: 205 KGKLQASACCKHFTAYDLDNWKGTTRYKFDAR-VTMQDLADTYQPPFESCVKQGRASGIM 263
+ AC KHF + W R++F+A + +DL +TY PPFE+ VK+G+ +M
Sbjct: 178 D---KLHACAKHFAVHSGPEW---NRHEFNAENIKPRDLYETYLPPFEALVKEGKVKEVM 231
Query: 264 CAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSP--EDAVVD 321
CAYNR G P C LL + R +WGF G + SDC A++ Y+ G+ P E A
Sbjct: 232 CAYNRFEGDPCCGSNRLLMQILRDEWGFDGIVLSDCGAIADFYNDRGHKTHPDAESASAA 291
Query: 322 VLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGK 381
+ +G D+ CGS K AVK K+ E +D A+ L + R LG + P + K
Sbjct: 292 AVLSGTDLECGSSY-KALVEAVKLGKIEEKAVDVAVKRLLTARFALGEMD-EPDKVSWTK 349
Query: 382 IGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNY 441
I VV S H LAL A+ + LL N + +LPL K +++A++GPNAN + GNY
Sbjct: 350 IPFSVVASAKHDSLALDIARKSMTLLMNKNDILPL-KQGGLTVAVMGPNANDSVMQWGNY 408
Query: 442 AGPSCRSITPLQALQNYV---ENTVYYPGCDTVACSSASIDKAVDIAKGAD 489
G ++T L+ ++ + + +Y GC V + I A ++ K AD
Sbjct: 409 NGMPSHTVTILEGVRKALGADDKLIYEQGCGWV--DRSLIQSAFNLCKSAD 457
Score = 104 bits (260), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 90/304 (29%), Positives = 137/304 (45%), Gaps = 59/304 (19%)
Query: 480 KAVDIAKGADHV------VLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITR 523
K VDI K V + + G+ + E EE+ DR D+ LP Q+E+I
Sbjct: 586 KEVDIKKSVAVVKDADVVIFVGGISPSLEGEEMGVNLPGFKKGDRTDIELPAVQREMIAA 645
Query: 524 VAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGR 583
+ + KK VI V G P I + +IL A YPG+AG A+AEV+FG +NP GR
Sbjct: 646 LHRSGKK-VIFVNCSGSP--IGLEPETKECEAILQAWYPGQAGGTAVAEVLFGKYNPAGR 702
Query: 584 LPMTWYPQDYIKVP-MTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKA 642
LP+T+Y ++ ++P D M GRTYR+ + +FPFG GLSY+ + Y
Sbjct: 703 LPVTFY-RNVSQLPDFEDYDMS--------GRTYRYMTQEPLFPFGYGLSYTAFEYGQMV 753
Query: 643 VSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVL 702
+ QNK+ Q+ +T+ V N G+ G+ V
Sbjct: 754 LDQNKINAGQT-----------------------------LELTVPVTNAGKRDGEEVVQ 784
Query: 703 LFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHF 762
++++ +G P K L F+ V + A + + FEL E M + G +
Sbjct: 785 VYLRKRDDTDG-PNKTLRAFKRVNIPAGKTVNVDFELKDKELEWWDTATNTMRVCAGDYD 843
Query: 763 LVVG 766
++VG
Sbjct: 844 IMVG 847
>gi|423303655|ref|ZP_17281654.1| hypothetical protein HMPREF1072_00594 [Bacteroides uniformis
CL03T00C23]
gi|423307623|ref|ZP_17285613.1| hypothetical protein HMPREF1073_00363 [Bacteroides uniformis
CL03T12C37]
gi|392688019|gb|EIY81310.1| hypothetical protein HMPREF1072_00594 [Bacteroides uniformis
CL03T00C23]
gi|392689492|gb|EIY82769.1| hypothetical protein HMPREF1073_00363 [Bacteroides uniformis
CL03T12C37]
Length = 801
Score = 272 bits (695), Expect = 6e-70, Method: Compositional matrix adjust.
Identities = 214/764 (28%), Positives = 356/764 (46%), Gaps = 133/764 (17%)
Query: 51 PISQRARDLVSRLTLDEKISQLVN--SAPAIPRLGIPAYEW----WSEALHGVAGVGKGI 104
P+ R ++L+S++TL+EK Q+ + + +P+ W W + + + G+
Sbjct: 62 PLEVRVQNLLSQMTLEEKSCQMATLYGSGRVLNDALPSDNWKNEVWKDGIGNIDEEHNGL 121
Query: 105 ------------------------------------FFNGTIRG-----ATSFPQVILTA 123
F N IRG AT FP
Sbjct: 122 GSFKSAYSFPYAHHVKTKHAIQRWFVENTRLGIPVDFTNEGIRGLCHDRATYFPAQCGQG 181
Query: 124 ASFDSYLWYRIGQAIGLEARALYNAGQAIGMT-FWAPNINIFRDPRWGRGQETPGEDPLV 182
A+++ L +IG+A EA L G T ++P ++I +DPRWGR ET GEDP
Sbjct: 182 ATWNKELIAQIGEAEAREASVL-------GYTNIYSPILDIAQDPRWGRCVETYGEDPYH 234
Query: 183 TGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDL 242
G+ + +Q K KL ++ KHF Y + + + D V +++
Sbjct: 235 AGQMGKQMILSLQ---------KNKLVSTP--KHFAVYSIPVGGRDGKTRTDPHVAPREM 283
Query: 243 ADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAV 302
Y PF + A G+M +YN +G P + L++ R++WGF GY+ SD +AV
Sbjct: 284 RTLYLDPFRVAFHEAGALGVMSSYNDYDGEPITGSYHFLTEILRQEWGFKGYVVSDSEAV 343
Query: 303 SIIYDAEGYAKSPEDAVVDVLKAGMDVNC-----GSFLQKHTKAAVKQKKLPESEIDRAL 357
I A EDAV + AG+++ F+ ++AVK+ K+ + +++ +
Sbjct: 344 EFISTKHQVANGYEDAVAQAVNAGLNIRTHFTPPADFILP-LRSAVKKGKISQETLNQRV 402
Query: 358 HNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLP 417
+ V+ LGLF+ NP + G +V SP HQ LAL+AA+ +VLLKN H LPL
Sbjct: 403 AEILRVKFWLGLFD-NPYRGDEKRAG-QIVHSPEHQQLALEAARQSLVLLKNEHQTLPLS 460
Query: 418 KSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPL-QALQNYV--ENTVYYPGCDTV--- 471
KS S+A+IGPNA+ + L+ Y GP+ IT + + ++ + + VY GCD +
Sbjct: 461 KSIR-SVAVIGPNADERQQLICRY-GPANAHITTIYEGIKKMLPQADVVYKKGCDIIDPH 518
Query: 472 ------------ACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQE 519
A + +++A++ AKGA+ V+++G ++ +E+ R L LPGRQ+E
Sbjct: 519 FPESEVLEFPKAAQEAQMMEEAIEAAKGAEVTVMVLGGNELTVREDRSRTSLDLPGRQEE 578
Query: 520 LITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHN 579
L+ ++ + KPV+LV++ G I FA ++ +I+ A +PGE G A+AE +FGD+N
Sbjct: 579 LLKKICQLG-KPVVLVMIDGRASSINFAA--THVPAIIHAWFPGEFGGQAIAEALFGDYN 635
Query: 580 PGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYK 639
PGGRL +T +P+ ++P +P + + Y ++PFG GLSY+ + Y
Sbjct: 636 PGGRLAVT-FPKSVGQIPFA-FPFKPGSDESSETSVY-----GALYPFGHGLSYTTFQYS 688
Query: 640 FKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKH 699
+S +K + + S ++ +KN G+ G
Sbjct: 689 DLVISPSKQGVQGNIS-----------------------------ISCTIKNIGQREGDE 719
Query: 700 PVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCE 743
V L+++ + L GF+ + L + + FEL+P E
Sbjct: 720 VVQLYLRDEVSSVTTYTQVLRGFERITLKPEASHTVHFELTPQE 763
>gi|295689400|ref|YP_003593093.1| glycosyl hydrolase family protein [Caulobacter segnis ATCC 21756]
gi|295431303|gb|ADG10475.1| glycoside hydrolase family 3 domain protein [Caulobacter segnis
ATCC 21756]
Length = 895
Score = 272 bits (695), Expect = 6e-70, Method: Compositional matrix adjust.
Identities = 167/432 (38%), Positives = 240/432 (55%), Gaps = 48/432 (11%)
Query: 46 CKTTLPI----SQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVG 101
+TT P Q A DLV+++T +EK+ QL+N+APAIPRLG+PAY WW+E+LHG G
Sbjct: 29 AQTTSPTRQMADQMAADLVAKMTTEEKLDQLLNTAPAIPRLGVPAYNWWTESLHGALGS- 87
Query: 102 KGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIG-------- 153
T+FP+ + AA+FD+ L +G AIG E R L+ +A G
Sbjct: 88 ---------LPTTNFPEPVGLAATFDTDLVKDVGGAIGAEVRGLHTLARATGRMGRIGTA 138
Query: 154 MTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASAC 213
+ W+PNINIFRDPRWGRGQET GEDP + + V++V GVQG + Q A
Sbjct: 139 LNTWSPNINIFRDPRWGRGQETYGEDPHLAARMGVAFVEGVQGPDPD------LPQVIAT 192
Query: 214 CKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIP 273
KH+ + + +TR+ + V+ +DL DTY P F + + + +A IMCAYNR++G P
Sbjct: 193 PKHYAVH---SGPESTRHHANVYVSRRDLEDTYLPAFRAAIVEAKAGSIMCAYNRIDGQP 249
Query: 274 SCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG- 332
+CA LL R W F GY+ SDCDAV I D YA AV ++AG+D C
Sbjct: 250 ACASDMLLKDYLRTAWKFDGYVVSDCDAVKDINDNHKYAPDAATAVAAAVRAGVDNECNG 309
Query: 333 ------SFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKI--GA 384
+ L + A++Q + ++D+AL L+S R+R+G P ++P
Sbjct: 310 ATLTDTAGLSNRYRDALQQGLITIGDVDQALVRLYSARLRVGDL---PGVRPVDTAPKAP 366
Query: 385 DVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGP 444
V +P H+ LAL A++ +VLLKN GLLPL + +A+IGP ++ + L GNY+
Sbjct: 367 SAVMTPEHEALALAASEKSLVLLKND-GLLPL--KPGLRVAVIGPLGDATRVLRGNYS-- 421
Query: 445 SCRSITPLQALQ 456
S S P+ ++
Sbjct: 422 SALSAPPISVVE 433
Score = 116 bits (290), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 90/288 (31%), Positives = 132/288 (45%), Gaps = 52/288 (18%)
Query: 489 DHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLLC 538
D +V ++GL E EE D+ L +P QQ L+ + A+A KP+++V +
Sbjct: 625 DVLVAVVGLTSDLEAEESPVEIPGFKGGDKTTLDIPADQQALLEQ-AKALGKPLVVVAMN 683
Query: 539 GGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPM 598
G P+++++AK N +IL A YPG++G +A+A V+ G NP GRLP+T+Y P
Sbjct: 684 GSPLNLSWAK--DNAAAILEAWYPGQSGGLAIANVLTGKTNPSGRLPLTFYKSVEDLPPF 741
Query: 599 TDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKM 658
D +M GRTYR++ G+ V+PFG GLSY+++ Y V K Q
Sbjct: 742 GDYRMD--------GRTYRYFTGQPVYPFGYGLSYTRFDYAPLKVEPIKGDAGQG----- 788
Query: 659 VENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQ 718
VT V+N G+ AG L++ G P
Sbjct: 789 ------------------------LRVTTTVRNVGQRAGDEVAQLYLN-FPNSPGAPRVA 823
Query: 719 LVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
L GFQ V L E + F LS LS DG+ + G + + VG
Sbjct: 824 LRGFQRVSLKPGEAKSVTFSLS-SRDLSSVDPDGVRQVMTGRYKVSVG 870
>gi|410097652|ref|ZP_11292633.1| hypothetical protein HMPREF1076_01811 [Parabacteroides goldsteinii
CL02T12C30]
gi|409223742|gb|EKN16677.1| hypothetical protein HMPREF1076_01811 [Parabacteroides goldsteinii
CL02T12C30]
Length = 780
Score = 272 bits (695), Expect = 6e-70, Method: Compositional matrix adjust.
Identities = 219/721 (30%), Positives = 354/721 (49%), Gaps = 114/721 (15%)
Query: 81 RLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGL 140
RLGIP + E HG +G T FP + A+++++ L +R+G+AI L
Sbjct: 125 RLGIPVL-FAEECPHGHMAIG-----------TTVFPTSLSQASTWNAELMHRMGEAIAL 172
Query: 141 EARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFN 200
EAR+ G IG + P ++I R+PRW R +ET GEDP++T V++++G+QG + N
Sbjct: 173 EARS---QGANIG---YGPVLDIAREPRWSRMEETFGEDPVLTTHLGVAFMKGMQGKSQN 226
Query: 201 GGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRAS 260
GK + KHF AY + A V M+ L Y PPF+ V++G A+
Sbjct: 227 DGK-----HLYSTLKHFAAYGIPE---AGHNGARANVGMRQLFSDYLPPFKKAVEEGVAT 278
Query: 261 GIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVV 320
IM +YN ++G+P +++ LL+ R QWGF G++ SD ++ I A AK ++A V
Sbjct: 279 -IMTSYNTIDGVPCTSNKYLLTDVLRDQWGFKGFVYSDLTSIEGIVGAR-VAKDNKEAAV 336
Query: 321 DVLKAGMDVNC-GSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPF 379
LKAG+D++ G+ K+ + A+++ + +++RA+ N+ ++ R+GLF NP + P
Sbjct: 337 LALKAGLDMDLGGNAYGKNLQKALEEGAITMDDLNRAVANVLRLKFRMGLFE-NPYVSP- 394
Query: 380 GKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLG 439
+ VV S AH+ LA + A++GIVLLKN G+LPL K+ ++A+IGPNA+ LG
Sbjct: 395 -EQAKQVVRSKAHKELAREVAREGIVLLKN-EGVLPLKKNIG-NIAVIGPNADMMYNQLG 451
Query: 440 NYAGPSCRS--ITPLQALQNYVENTV---YYPGCDTVACSSASID--------------- 479
+Y P R +T L ++ V + Y GC ++++I
Sbjct: 452 DYTAPQEREEIVTVLDGIRKAVSPSTKVNYVKGCAIRDITTSNITAAVEAARAADAVVLV 511
Query: 480 ---------KAVDIAKGADHVV-----LMMGLDQTQEKEELDRVDLVLPGRQQELITRVA 525
K I GA V L+ +D E DR L L G Q++L+ VA
Sbjct: 512 VGGSSARDFKTKYIGTGAADVSNDGNQLLSDMDC---GEGYDRSTLRLLGDQEKLLKAVA 568
Query: 526 EAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLP 585
A KP++++ + G +++ A ++L A YPGE G A+A+V+FGD+NP GRLP
Sbjct: 569 -ATGKPLVVIYIQGRTLNMNLA--SEKAQALLTAWYPGEQGGTAIADVLFGDYNPAGRLP 625
Query: 586 MTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQ 645
+ VP ++ ++ P S R Y EG ++ FG GLSY+K+ Y
Sbjct: 626 --------VSVPRSEGQL-PLFYSQGKQRAYVEEEGTPLYAFGYGLSYTKFDY------- 669
Query: 646 NKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFV 705
++L + + + +++ V+ V N G+ G+ V L++
Sbjct: 670 SQLEMQKGNGKDVLQT-----------------------VSCTVTNTGDCDGEEVVQLYI 706
Query: 706 KPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVV 765
+ L F+ + L E ++ F L E LS + V+E G ++V
Sbjct: 707 CDKVASVSQSPILLKAFERISLKKGESKKVTFTLGE-EELSLYNMEMKQVVEPGDFKVMV 765
Query: 766 G 766
G
Sbjct: 766 G 766
>gi|347736808|ref|ZP_08869356.1| putative exo-1,4-beta-xylosidase xlnD [Azospirillum amazonense Y2]
gi|346919574|gb|EGY01048.1| putative exo-1,4-beta-xylosidase xlnD [Azospirillum amazonense Y2]
Length = 903
Score = 272 bits (695), Expect = 7e-70, Method: Compositional matrix adjust.
Identities = 171/418 (40%), Positives = 230/418 (55%), Gaps = 42/418 (10%)
Query: 54 QRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGA 113
Q+A+ LV+R+TLDEK+ QLVN APAIPRL IPAY WW+E+LHG G
Sbjct: 43 QKAQSLVARMTLDEKLGQLVNVAPAIPRLDIPAYNWWTESLHGAIGAVP----------T 92
Query: 114 TSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIG--------MTFWAPNINIFR 165
T+FP+ I AA+FD+ L + + + +E RAL+ G+A G + W+PNINI+R
Sbjct: 93 TNFPEPIGLAATFDAPLVHDVAGTVSVEVRALHTLGRATGHLGRIGTGLDTWSPNINIYR 152
Query: 166 DPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNW 225
DPRWGRGQET GEDP +T + AV++V G+QG + A KHF + +
Sbjct: 153 DPRWGRGQETYGEDPYLTARLAVAFVGGMQGPNPD------LPDVVATPKHFAVH---SG 203
Query: 226 KGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTA 285
+TR+ + V+ DL DTY P F + + + RA IMCAYNR++G P+CA LL
Sbjct: 204 PESTRHAANVYVSPHDLEDTYLPAFRAAIVEARAGSIMCAYNRIDGQPACASDLLLKDHL 263
Query: 286 RRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG--------SFLQK 337
R W F GY+ SDCDAV I D YA AV LKAG+D C L
Sbjct: 264 RGAWHFTGYVVSDCDAVKDIADNHKYAPDQAAAVAAALKAGVDNECNVATLFGGGGGLAA 323
Query: 338 HTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLAL 397
K A + + ++ID AL LFS R R G G +P + + + HQ LA
Sbjct: 324 RYKEAYTRGLIGAADIDLALVRLFSARYRNGDLPG--LDRPKQDVPVQAIGTAEHQQLAR 381
Query: 398 QAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQAL 455
+ A + +VLLKN G+LPL V +A+IGP+ +S + L GNY+ S S P+ L
Sbjct: 382 KVAGESMVLLKND-GVLPL--KPGVKVAVIGPHGDSTRVLRGNYS--SALSAPPVSVL 434
Score = 125 bits (313), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 97/281 (34%), Positives = 133/281 (47%), Gaps = 52/281 (18%)
Query: 496 GLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDIT 545
GL E EE+ D+ L LP Q+ L+ + A+A KPV++VLL G P+D++
Sbjct: 640 GLSADLEGEEMKRQIEGFAGGDKTSLDLPSEQRRLL-QAAKATGKPVVVVLLNGSPLDLS 698
Query: 546 FAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRP 605
+AK N +I+ A YPG+AG +A+A+V+ G NP GRLP+T+Y P D M
Sbjct: 699 WAK--DNAAAIVEAWYPGQAGGLAVADVLSGKTNPAGRLPLTFYRSVADLPPFDDYAMT- 755
Query: 606 QATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVV 665
GRTYR++ G+ V+PFG GLSY+++ Y V
Sbjct: 756 -------GRTYRYFTGQAVYPFGYGLSYTRFDYGPVTVE--------------------- 787
Query: 666 HYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSV 725
P G R VT V N G+ AG L++KP G P L GFQ +
Sbjct: 788 -----PTHGGAENGVR---VTATVTNTGDRAGDEVAQLYLKPPAF-EGAPRLALRGFQRL 838
Query: 726 ILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
L E + FELSP LS DG + GT+ + VG
Sbjct: 839 SLAPGESRTVSFELSP-RDLSFVTRDGDRQVMVGTYQVSVG 878
>gi|393786524|ref|ZP_10374660.1| hypothetical protein HMPREF1068_00940 [Bacteroides nordii
CL02T12C05]
gi|392660153|gb|EIY53770.1| hypothetical protein HMPREF1068_00940 [Bacteroides nordii
CL02T12C05]
Length = 841
Score = 271 bits (694), Expect = 7e-70, Method: Compositional matrix adjust.
Identities = 226/802 (28%), Positives = 357/802 (44%), Gaps = 138/802 (17%)
Query: 45 FCKTTLPISQRARDLVSRLTLDEKISQLVN--SAPAIPRLGIPAYEW----WSEALHGV- 97
F + P+ +R +DL+S++T++EK QL + + +P W W + + +
Sbjct: 82 FEDPSQPVEKRVKDLLSQMTIEEKSCQLATLYGFGRVLKDSLPTPAWKEAIWKDGIANID 141
Query: 98 ---AGVGKGI-----------------------------------FFNGTIRG-----AT 114
GVG+G F N I G AT
Sbjct: 142 EQLNGVGRGAKRVPHLIVPFSNHVKAINETQRWFIEETRLGIPVDFSNEGIHGLNHTKAT 201
Query: 115 SFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMT-FWAPNINIFRDPRWGRGQ 173
P I +++++ L G+ +G EAR L G T +AP +++ RDPRWGR
Sbjct: 202 PLPAPIAIGSTWNTELVREAGEIVGKEARVL-------GYTNVYAPILDVVRDPRWGRTL 254
Query: 174 ETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKF 233
E GEDP + G+ V V G+Q +A KHF Y +
Sbjct: 255 ECYGEDPYLIGELGVQMVDGIQSQG-----------VAATLKHFAVYSSPKGGRDGNCRT 303
Query: 234 DARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHG 293
D VT ++L + Y PF+ ++Q G+M +YN NG P + L+K R ++GF G
Sbjct: 304 DPHVTPRELHEIYLYPFKHVIQQSHPMGVMSSYNDWNGEPVTSSYYFLTKLLREEYGFDG 363
Query: 294 YITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNC-----GSFLQKHTKAAVKQKKL 348
Y+ SD AV ++ A+ ++AV VL+AG++V F+ + + + K+
Sbjct: 364 YVVSDSQAVEFVHTKHQVAEDYDEAVRQVLEAGLNVRTHFTPPADFILP-IRRLLAENKI 422
Query: 349 PESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLK 408
+ ID+ + + +V+ RLGLF+ P K +V + H + + +VLLK
Sbjct: 423 SMATIDKRVSEVLAVKFRLGLFDAPYRDNP--KEADEVAGADKHSEFVKEMQRQSLVLLK 480
Query: 409 NSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVE---NTVYY 465
N LLPL K + + + GP A+ ++ Y +IT LQ +++Y++ VY
Sbjct: 481 NDGQLLPLNKKEIKKVLVTGPLADEDNFMISRYGPNGLPTITVLQGIKDYLKGDVEVVYS 540
Query: 466 PGCDTV---------------ACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVD 510
GC+ + A A +DKAV A+ AD ++ +MG D+ + E R
Sbjct: 541 KGCNIIDKEWPASEVLPAVLTAEEVADMDKAVSEAQSADVIIAVMGEDEYRVGESRSRTS 600
Query: 511 LVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVAL 570
L LPGRQ+EL+ + A KPV+LVL+ G P+ I + D+N+ +IL A +P G +
Sbjct: 601 LELPGRQRELL-QALHATGKPVVLVLINGQPLTINWE--DQNLPAILEAWFPSFQGGKII 657
Query: 571 AEVIFGDHNPGGRLPMTWYP----QDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFP 626
AE +FGD+NPGG+L +T +P Q + P Q +SG G G ++P
Sbjct: 658 AETLFGDYNPGGKLTVT-FPKSVGQIELNFPFKKGSHGTQPSSGPNGSGSTRVLGA-LYP 715
Query: 627 FGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVT 686
FG GLSY+ ++Y N +V + P GT+ + ++
Sbjct: 716 FGYGLSYTTFAYS---------------------NLEV----TAPAKGTQ----GEVQIS 746
Query: 687 IGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPC--ES 744
+ N G+ AG+ L+V+ +L GFQ V+L E + F L P E
Sbjct: 747 FDITNTGKYAGEEVAQLYVRDLVSSVVTYDSRLRGFQRVLLQPNETKRMHFTLKPADLEL 806
Query: 745 LSRAREDGLMVIEEGTHFLVVG 766
L R E +E GT + VG
Sbjct: 807 LDRNME---WTVESGTFEVRVG 825
>gi|393789624|ref|ZP_10377744.1| hypothetical protein HMPREF1068_04024 [Bacteroides nordii
CL02T12C05]
gi|392650340|gb|EIY44009.1| hypothetical protein HMPREF1068_04024 [Bacteroides nordii
CL02T12C05]
Length = 855
Score = 271 bits (694), Expect = 8e-70, Method: Compositional matrix adjust.
Identities = 163/428 (38%), Positives = 232/428 (54%), Gaps = 34/428 (7%)
Query: 45 FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGI 104
F T P +R DL++RLT++EK+S LVN A IPRL I Y +EALHGV G+
Sbjct: 29 FRDMTAPQHERILDLLNRLTVEEKVSLLVNDAREIPRLNIDKYNHGNEALHGVVRPGE-- 86
Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYN---------AGQAIGMT 155
T FPQ I AA+++ L +R+ AI EAR + AG + +T
Sbjct: 87 --------FTVFPQAIGLAATWNPNLIFRVSTAISDEARGRWKELDYGKKQIAGGSDLLT 138
Query: 156 FWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCK 215
FW+P +N+ RDPRWGR ET GEDP ++G+ +V+G+QGD L+ + K
Sbjct: 139 FWSPTVNMARDPRWGRTPETYGEDPFLSGRIGCEFVKGLQGDN------PRYLKTVSTPK 192
Query: 216 HFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSC 275
HF A N + R +AR++ +DL + Y P FE C+ G+A IM AYN VN +P
Sbjct: 193 HFAA----NNEEHNRSSCNARMSERDLREYYLPAFERCIVDGKAQSIMMAYNAVNDVPCT 248
Query: 276 ADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFL 335
+ L+ K R W F+GYI SDC A + Y K+ E A LKAG+D+ CG +
Sbjct: 249 VNIYLIKKVLRGDWNFNGYIVSDCSAPEWMVTKHKYVKNLEAAATLALKAGLDLECGDRV 308
Query: 336 QKH-TKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQV 394
A + + E+EID A +++ RM LGLF+ +P+ P+ KI V+ HQ
Sbjct: 309 YTAPLLKAYNEYMVSEAEIDSAAYHILRGRMLLGLFD-DPSQNPYNKIEPSVIGCKEHQE 367
Query: 395 LALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAG-PSCRSITPLQ 453
LAL+ A+ +VLLKN LPL + K S+A++G +A A G+Y+G P ++ L
Sbjct: 368 LALETARQSMVLLKNQKNFLPLNRKKIRSIAVVGISA--AHCEFGDYSGNPKNTPVSVLD 425
Query: 454 ALQNYVEN 461
++ Y EN
Sbjct: 426 GIKKYAEN 433
Score = 126 bits (317), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 91/291 (31%), Positives = 136/291 (46%), Gaps = 44/291 (15%)
Query: 484 IAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVD 543
+AK D V ++G++++ E+E DR L LP QQE I + + V+ VL+ G +
Sbjct: 599 VAKECDVTVAVLGINKSIEREGQDRYSLELPIDQQEFIKELYKVNPNTVV-VLVAGSSMA 657
Query: 544 ITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKM 603
I + D N+ +IL A YPGE G A+AEV+FGD+NPGGRLP+T+Y ++P D
Sbjct: 658 INWM--DENVPAILNAWYPGEQGGNAVAEVLFGDYNPGGRLPLTYY-NSLDELPAFD--- 711
Query: 604 RPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQD 663
RTY+++EGK ++ FG GLSY+ + YK K++ Q + D
Sbjct: 712 ----DYSVKNRTYQYFEGKPLYEFGYGLSYTNFKYKKKSIMQ---------------SND 752
Query: 664 VVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQ 723
V +T + N G+ G ++V+ G P+KQL GF
Sbjct: 753 TVD------------------ITFNLSNVGKYDGDEVAQVYVRYPETGTYMPLKQLKGFS 794
Query: 724 SVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGDEEYPISI 774
V L + A+I + E + V G + VG ISI
Sbjct: 795 RVHLKKGKSADITISIPKKELRYWDEKTRQFVTPTGEYVFQVGGSSENISI 845
>gi|427384989|ref|ZP_18881494.1| hypothetical protein HMPREF9447_02527 [Bacteroides oleiciplenus YIT
12058]
gi|425728250|gb|EKU91109.1| hypothetical protein HMPREF9447_02527 [Bacteroides oleiciplenus YIT
12058]
Length = 862
Score = 271 bits (694), Expect = 8e-70, Method: Compositional matrix adjust.
Identities = 164/446 (36%), Positives = 242/446 (54%), Gaps = 38/446 (8%)
Query: 41 ETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGV 100
+T P+ L ++RA+DLVSRLTL+EK + + + AIPRLGI + WWSEALHGVA
Sbjct: 19 QTLPYQNPELSPAERAKDLVSRLTLEEKALLMCDDSEAIPRLGIKKFNWWSEALHGVANQ 78
Query: 101 GKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIG------- 153
G T FP+ + AASF+ L + I A E RA +N G
Sbjct: 79 G----------NVTVFPEPVGMAASFNEKLVFEIFNATSDEMRAKHNERVRNGLEDTRFH 128
Query: 154 -MTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASA 212
++ W PN+NIFRDPRWGRGQET GEDP +T + ++ V+G+QG K + L A
Sbjct: 129 SLSVWTPNVNIFRDPRWGRGQETYGEDPYLTSQMGIAVVKGLQGP--EDEKYRKLL---A 183
Query: 213 CCKHFTAYDLDNWKGTTRYKFD-ARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNG 271
C KH+ + W +R+ + ++ +DL +TY P F++ V++ +MCAY R++
Sbjct: 184 CAKHYAVHSGPEW---SRHSANLNNISPRDLWETYLPAFKALVQKADVREVMCAYQRLDD 240
Query: 272 IPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLK---AGMD 328
P C LL + R +WGF + SDC A++ + + KS DAV +K AG D
Sbjct: 241 DPCCGSTRLLQQILRDEWGFKYLVVSDCGAIADFWTSH---KSSSDAVHAAVKGTMAGTD 297
Query: 329 VNCG-SFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVV 387
V CG + + AV + + E EID+ + L R LG + +P++ + +I VV
Sbjct: 298 VECGYGYAYQKLPEAVSRGLITEEEIDKHVLRLLEGRFELGEMD-DPSLVKWSQIPMSVV 356
Query: 388 CSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCR 447
S AH+ L+L ++ + LL+N + +LPL KS +A+IGPNA+ L GNY G +
Sbjct: 357 NSKAHKDLSLNMSRQTMTLLQNKNNVLPLSKSIR-KIAVIGPNADDKPMLWGNYNGTPNQ 415
Query: 448 SITPLQALQNYVENT--VYYPGCDTV 471
+IT L + ++ +Y GCD V
Sbjct: 416 TITILDGFKTKLKKNQIIYMKGCDLV 441
Score = 90.9 bits (224), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 73/277 (26%), Positives = 121/277 (43%), Gaps = 53/277 (19%)
Query: 479 DKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAA 528
D+++ K D VV + G+ E EE+ DR D+ LP Q+ + + EA+
Sbjct: 591 DESISKLKNVDMVVFVGGISPQLEGEEMPLNLPGFKNGDRTDIELPAVQRNFLKALKEAS 650
Query: 529 KKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTW 588
K+ ++ + C G + + +IL A Y GE G A+A+V+FGD+NP G+LP+T+
Sbjct: 651 KQ--VVFVNCSGS-SMALLPETESCDAILQAWYGGELGGQAVADVLFGDYNPSGKLPVTF 707
Query: 589 YPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKL 648
Y D M+ GRTYR Y +FPFG GLSY+ ++ S+ +L
Sbjct: 708 YKSTKQLPDYEDYSMK--------GRTYR-YMSDPLFPFGFGLSYTDFTVGTAQCSKTQL 758
Query: 649 YLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPA 708
++ +T+ + N G+ +G + ++++
Sbjct: 759 RTEEA-----------------------------LTLTVPISNTGKRSGTEVIQVYIRKT 789
Query: 709 RRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESL 745
G P+K L + L A +I +L P ES
Sbjct: 790 DDTGG-PLKSLKAYARAELAAGATQDIEIQL-PAESF 824
>gi|397691065|ref|YP_006528319.1| glycoside hydrolase family 3 protein [Melioribacter roseus P3M]
gi|395812557|gb|AFN75306.1| glycoside hydrolase family 3 protein [Melioribacter roseus P3M]
Length = 769
Score = 271 bits (694), Expect = 8e-70, Method: Compositional matrix adjust.
Identities = 222/722 (30%), Positives = 352/722 (48%), Gaps = 112/722 (15%)
Query: 81 RLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGL 140
RLGIP + E LHG+A + ATS+P I AA+F+ L +I AI
Sbjct: 110 RLGIPVI-FHEECLHGLAA-----------KDATSYPVPIGLAATFNPELIEKIFSAIAE 157
Query: 141 EARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFN 200
+AR+ A QA+ P +++ RDPRWGR +ET GED + + ++ V+G+QGD
Sbjct: 158 DARS-RGAHQAL-----TPVVDVVRDPRWGRVEETFGEDTYLVSQMGIASVKGLQGD--- 208
Query: 201 GGKLKGKLQASACCKHFTAYDL-DNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRA 259
G L + A KHF A+ ++ F R L DT+ PF+ + +
Sbjct: 209 -GSLNNNNKVIATLKHFAAHGQPESGTNCAPANFSERF----LRDTFLMPFKEAIDKAGV 263
Query: 260 SGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAE-----GYAKS 314
+M +YN ++GIPS A++ LL K R +W F G++ SD A++ ++ E G A +
Sbjct: 264 ISVMASYNEIDGIPSHANKWLLRKVLRDEWNFKGFVVSDYYAITELFHKEETVSHGVAAN 323
Query: 315 PEDAVVDVLKAGMDV-----NCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGL 369
+A L+AG+++ +C L + K + ES+ID + + + LGL
Sbjct: 324 KVEAAKLALEAGVNIEFPNPDCYPNLTEMVKGGLAD----ESDIDALVLPMLKYKFELGL 379
Query: 370 FNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGP 429
F+ NP ++ + G + LALQAA++ I LLKN LLPL K + A+IGP
Sbjct: 380 FD-NPYVE--AEPGQFENKLEQDRELALQAARETITLLKNEGNLLPLKDFKKI--AVIGP 434
Query: 430 NANSAKTLLGNYAGPSCRSITPLQALQNYV--ENTVYY-PGCDTVACSSAS--------- 477
NA+ +TLLG Y G + Q +++ V V+Y GC S +
Sbjct: 435 NAD--RTLLGGYHGTPKYYTSVYQGIKDKVGKNGEVFYSEGCKITVGGSWNDDEVILPDP 492
Query: 478 ------IDKAVDIAKGADHVVLMMGLDQTQEKEEL------DRVDLVLPGRQQELITRVA 525
I++AV +A+ +D VL++G ++ +E DR L L GRQ +L+ +
Sbjct: 493 AEDEKLINEAVAVAQKSDVAVLVLGGNEQTSREAWNKKHLGDRPSLELVGRQNKLVEEIL 552
Query: 526 EAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLP 585
+ K PV+++L G P I F K N+ +IL Y G+ A+A+V+FGD+NP G+LP
Sbjct: 553 KTGK-PVVVLLFNGRPNSIGFIK--DNVPAILECWYLGQETGRAVADVLFGDYNPSGKLP 609
Query: 586 MTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQ 645
++ P+ +P +P A G Y F + +F FG GLSY+K+S+ +S+
Sbjct: 610 VS-IPRSAGHIP-AHYSHKPSARRG-----YLFDDVSPLFAFGYGLSYTKFSFDNLRLSK 662
Query: 646 NKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFV 705
+ + ++ S V+I VKN G +AG+ V L++
Sbjct: 663 DTISADEKVS-----------------------------VSIEVKNEGAIAGEEVVQLYI 693
Query: 706 KPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVV 765
+ RP+K+L GF+ + L + + +VFEL P E L+ D +E G ++V
Sbjct: 694 RDKVSSVTRPVKELKGFRKITLAPGQTSTVVFELLP-EHLAFTNVDMKFTVEPGEFEIMV 752
Query: 766 GD 767
G+
Sbjct: 753 GN 754
>gi|383125190|ref|ZP_09945844.1| hypothetical protein BSIG_4346 [Bacteroides sp. 1_1_6]
gi|251838523|gb|EES66609.1| hypothetical protein BSIG_4346 [Bacteroides sp. 1_1_6]
Length = 853
Score = 271 bits (693), Expect = 9e-70, Method: Compositional matrix adjust.
Identities = 162/424 (38%), Positives = 237/424 (55%), Gaps = 34/424 (8%)
Query: 51 PISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTI 110
P+ +R DL+SRLT++EKIS L ++P IPRLGI Y +EALHGV G+
Sbjct: 36 PVHERVMDLISRLTVEEKISLLRATSPGIPRLGIDKYYHGNEALHGVVRPGR-------- 87
Query: 111 RGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQ---------AIGMTFWAPNI 161
T FPQ I AA+++ L R+ I EARA +N + +TFW+P +
Sbjct: 88 --FTVFPQAIGLAATWNPELQKRVATVISDEARARWNELDQGREQKEQFSDVLTFWSPTV 145
Query: 162 NIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYD 221
N+ RDPRWGR ET GEDP ++G ++V G+QGD + L+ + KHF A
Sbjct: 146 NMARDPRWGRTPETYGEDPFLSGIMGTAFVNGLQGDDPH------YLKIVSTPKHFAA-- 197
Query: 222 LDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLL 281
N + R+ + +++ + L + Y P FE CVK+G+A+ IM AYN +N +P + LL
Sbjct: 198 --NNEEHNRFVCNPQISEKQLREYYFPAFEMCVKEGKAASIMSAYNALNDVPCTLNPWLL 255
Query: 282 SKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG-SFLQKHTK 340
K R+ WGF GY+ SDC +++ +A Y K+ E A +KAG+D+ CG
Sbjct: 256 QKVLRQDWGFQGYVVSDCGGPALLVNAHKYVKTKEAAATLSIKAGLDLECGDDVYDGPLL 315
Query: 341 AAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAA 400
A KQ + +++ID A +++ + RM+LGLF+ P+ KI V+ S HQ +AL AA
Sbjct: 316 NAYKQYMVSDADIDSAAYHVLTARMKLGLFDSG-ERNPYTKISPSVIGSKEHQQIALDAA 374
Query: 401 QDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAG-PSCRSITPLQALQNYV 459
+ IVLLKN LPL K S+A++G NA K G+Y+G P ++ LQ ++N V
Sbjct: 375 RQCIVLLKNQKNRLPLNADKLKSIAVVGINA--GKCEFGDYSGAPVVEPVSILQGIRNRV 432
Query: 460 ENTV 463
+ V
Sbjct: 433 GDRV 436
Score = 121 bits (304), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 83/278 (29%), Positives = 136/278 (48%), Gaps = 43/278 (15%)
Query: 489 DHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAK 548
+ VV +MG++++ E+E DR D+ LP Q+E + + + +I+VL+ G + I +
Sbjct: 605 ETVVAVMGINKSIEREGQDRYDIQLPADQREFLQEIYKVNPN-IIVVLVAGSSLAINWM- 662
Query: 549 YDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQAT 608
D +I +I+ A YPGE G A+AEV+FGD+NP GRLP+T+Y P D +
Sbjct: 663 -DEHIPAIVNAWYPGEQGGTAVAEVLFGDYNPAGRLPLTYYKSLDELPPFDDYDI----- 716
Query: 609 SGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYK 668
GRTY++++G ++PFG GLSYS ++Y D+
Sbjct: 717 --TKGRTYKYFKGDVLYPFGYGLSYSSFTY-----------------------SDLQVKD 751
Query: 669 SVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILN 728
V E+ F +KN G+ G ++V+ G P+K+L GF+ V L
Sbjct: 752 GVGEVTVSF----------RLKNTGKRNGDEVAQVYVRIPETGGIVPLKELKGFRRVPLK 801
Query: 729 AKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
+ E + +L+ + E G V+ +G ++VG
Sbjct: 802 SGESRRVEIKLNKEQLRYWDVEKGQFVVPKGAFDVMVG 839
>gi|423260853|ref|ZP_17241755.1| hypothetical protein HMPREF1055_04032 [Bacteroides fragilis
CL07T00C01]
gi|423266988|ref|ZP_17245970.1| hypothetical protein HMPREF1056_03657 [Bacteroides fragilis
CL07T12C05]
gi|387774614|gb|EIK36724.1| hypothetical protein HMPREF1055_04032 [Bacteroides fragilis
CL07T00C01]
gi|392697691|gb|EIY90874.1| hypothetical protein HMPREF1056_03657 [Bacteroides fragilis
CL07T12C05]
Length = 859
Score = 271 bits (693), Expect = 9e-70, Method: Compositional matrix adjust.
Identities = 224/803 (27%), Positives = 359/803 (44%), Gaps = 142/803 (17%)
Query: 42 TFPFCKTTLPISQRARDLVSRLTLDEKISQ------------------------------ 71
F + +LP+ R +DL+SR+TL+EKI+Q
Sbjct: 23 NFKYKNASLPVEVRVQDLLSRMTLEEKIAQMRHIHAYSIMENGKLNEEKLEKMIGGQNYG 82
Query: 72 --------------LVNSAPAI----PRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGA 113
L+N RLGIP + +E+LHG G+
Sbjct: 83 FIEGITLPGKECLTLMNEVQKYMREKTRLGIPVFTL-TESLHG-----------SVHDGS 130
Query: 114 TSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQ 173
T FPQ I ++F+ L Y + AI E A Q I + P I++ RD RWGR +
Sbjct: 131 TIFPQAIALGSTFNPILAYEMTSAIAKELSA-----QGITQSL-TPVIDVCRDLRWGRVE 184
Query: 174 ETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKF 233
E GEDP + + VS VRG + Q S KHF A+ G +
Sbjct: 185 ECFGEDPYLVSRMGVSQVRGYLDN-----------QVSPMIKHFGAH------GAPQGGL 227
Query: 234 D-ARVT--MQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWG 290
+ A V+ ++L Y FE+ VK+ + +M +YN N P+ + L+++ R +W
Sbjct: 228 NLASVSCGQRELLSIYLKTFETVVKEAKPWAVMSSYNSWNNEPNSSSHYLMTELLRDRWD 287
Query: 291 FHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPE 350
F GY+ SD A+ ++ A++ +A + L AG+D + V+ L
Sbjct: 288 FQGYVYSDWGAIGMLNYFHKTAQNSAEAAIQALTAGLDAEASDNSYAELQQLVENGMLDV 347
Query: 351 SEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNS 410
ID+A+ + + + +GLF P P K VV +PAH LA + A++ IVLL+N
Sbjct: 348 KYIDQAVARILTAKFNMGLFE-YPL--PMEKNYDKVVHAPAHVSLARKIAEESIVLLQNE 404
Query: 411 HGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTV---YYPG 467
+ +LPL +K S+A+IGPNA+ + ++ + +T L+AL+ V N + Y G
Sbjct: 405 NNILPLQMNKLKSIAVIGPNADQVQFGDYTWSRDNKDGVTLLEALKERVGNQLTLNYAKG 464
Query: 468 CDTVACSSASIDKAVDIAKGADHVVLMMG---------LDQTQEKEELDRVDLVLPGRQQ 518
CD V + +AVD+AK +D ++++G E D DL L G Q+
Sbjct: 465 CDLVTDDRSGFKEAVDVAKKSDVCIVVVGSASASLARDYSNATCGEGFDLSDLTLTGVQE 524
Query: 519 ELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDH 578
+L+ + A KPVI+VLL G P +++ K NI I+ YPGE G +ALA+++ G
Sbjct: 525 DLVEAI-HATGKPVIVVLLSGKPFAMSWIK--ENIPGIVVQWYPGEQGGLALADMLLGKV 581
Query: 579 NPGGRLPMTWYPQD-------YIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGL 631
NP G+L + +PQ Y +P R + PG+ Y F K ++ FG GL
Sbjct: 582 NPSGKLNYS-FPQSVGHLPCYYNYLPTDKGFYRSPGSKNKPGKDYVFSSPKALWAFGHGL 640
Query: 632 SYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKN 691
SY+ + Y L+ + S + +DV+ VTI ++N
Sbjct: 641 SYTDFEY-----------LSATISKEDYACEDVIE------------------VTIAIRN 671
Query: 692 HGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRARED 751
G+ G ++V+ P+++L GF+ V++ E +++ ++ P L+ ++
Sbjct: 672 TGDYDGLEVPQVYVRDMVSSVVMPVQELKGFEKVLIKKGETKQVIIKI-PVSELALYNKE 730
Query: 752 GLMVIEEGTHFLVVGDEEYPISI 774
V+E G L +G I I
Sbjct: 731 MKKVVEPGAFELQIGRASDDIRI 753
>gi|393720890|ref|ZP_10340817.1| beta-glucosidase [Sphingomonas echinoides ATCC 14820]
Length = 563
Score = 271 bits (693), Expect = 9e-70, Method: Compositional matrix adjust.
Identities = 162/414 (39%), Positives = 223/414 (53%), Gaps = 33/414 (7%)
Query: 63 LTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILT 122
LT +K +QL +SAPA P G+PAY+WW+E LHG+A G AT FPQ I
Sbjct: 20 LTTAQKAAQLQSSAPADPAAGLPAYDWWNEGLHGLARNGY----------ATVFPQAIGM 69
Query: 123 AASFDSYLWYRIGQAIGLEARALYNAGQAI-------GMTFWAPNINIFRDPRWGRGQET 175
AA++D+ L RIG+ + EARA +NA G+T W+PNINIFRDPRWGRGQET
Sbjct: 70 AATWDAPLLNRIGEVVATEARAKFNAQPVAADRKIYQGLTIWSPNINIFRDPRWGRGQET 129
Query: 176 PGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDA 235
GEDP +TG V+++ G+QG K+ A KHF + R FD
Sbjct: 130 YGEDPYLTGHLGVAFITGLQGPDPKHPKV------IATPKHFVVHSGPE---AGRDGFDV 180
Query: 236 RVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYI 295
+ QDL TY P F V +G+A +MCAYN ++G+P+CA+ L+ R WGF G+
Sbjct: 181 DPSPQDLEATYLPAFRLAVTEGKAQSLMCAYNAIHGVPACANAPLMVDRLRHDWGFTGFT 240
Query: 296 TSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDR 355
SDCDAV+ ++ Y A LK G D+NCG+ + A+++ + +ID
Sbjct: 241 VSDCDAVANVHLFHHYRLDAAGAAAASLKGGTDLNCGTTYAALPE-ALQRGLVTSQDIDT 299
Query: 356 ALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLP 415
A + R LG+ G P+ +I + +P H +AL+AA+ IVLLKN LP
Sbjct: 300 AFGRAMAARKALGIAYG--ATSPWARISPRDIGTPEHHAVALEAARKAIVLLKNEGDRLP 357
Query: 416 LPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNY--VENTVYYPG 467
L +ALIG +A+ L GNY G + +TPL+ L+ EN +Y G
Sbjct: 358 L--KTGTRIALIGADADDIGVLQGNYHGTAIAPVTPLEGLRQQFGAENVLYAQG 409
>gi|189464219|ref|ZP_03013004.1| hypothetical protein BACINT_00556 [Bacteroides intestinalis DSM
17393]
gi|189438009|gb|EDV06994.1| glycosyl hydrolase family 3 N-terminal domain protein [Bacteroides
intestinalis DSM 17393]
Length = 865
Score = 271 bits (693), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 152/427 (35%), Positives = 238/427 (55%), Gaps = 31/427 (7%)
Query: 51 PISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTI 110
PIS R +L+S++TL+EK++QL N +IPRL +P+Y +W+E LHGVA G+
Sbjct: 55 PISARVENLISKMTLEEKVAQLSNETDSIPRLNLPSYNYWNECLHGVARAGE-------- 106
Query: 111 RGATSFPQVILTAASFDSYLWYRIGQAIGLEARALY-NAGQAIGMTFWAPNINIFRDPRW 169
T FPQ I A+++D+ L ++ AI EAR Y G+ G+T+W+P IN+ RDPRW
Sbjct: 107 --VTVFPQAINLASTWDTLLIKKVASAISTEARLKYLEIGK--GLTYWSPTINMARDPRW 162
Query: 170 GRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTT 229
GR +ET GEDP +T + V++V+G+QGD + L+ A KHF A + +N
Sbjct: 163 GRNEETYGEDPYLTSRLGVAFVKGLQGDHPD------YLKTVATIKHFVANNQEN----D 212
Query: 230 RYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQW 289
R+ +++ + L + Y P +E+CVK+ A +M AYN NG+ LL R++W
Sbjct: 213 RFSSSSQIPTKQLYEYYFPAYEACVKEADAQSVMTAYNAFNGVAPSGSTWLLGDVLRKEW 272
Query: 290 GFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLP 349
GF G++ SDC A+ ++ S E+A + +G D+ CG ++ AAVK +
Sbjct: 273 GFDGFVVSDCGAIGVMNWQHRVVNSLEEAAALGINSGCDLECGGTYREKLVAAVKMGLVS 332
Query: 350 ESEIDRALHNLFSVRMRLGLFNGNPT-MQPFGKIGADVVCSPAHQVLALQAAQDGIVLLK 408
E ID+AL + + R +LG F +P + P+ ++ LA +AA IVLLK
Sbjct: 333 EQAIDKALTRVLTARFKLGEF--DPIELVPYNHYDKKLLAGEKFGKLAYEAAVKSIVLLK 390
Query: 409 NSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTV---YY 465
N + LP+ K K S+A++GP A++ LG Y+G +++ LQ +++ V V Y
Sbjct: 391 NDNDFLPVDKKKIRSVAIVGPFADN--NYLGGYSGKPVHNVSLLQGVKDLVGKKVKVSYM 448
Query: 466 PGCDTVA 472
G ++
Sbjct: 449 EGTSVIS 455
Score = 103 bits (258), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 90/304 (29%), Positives = 143/304 (47%), Gaps = 49/304 (16%)
Query: 474 SSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVI 533
+S IDK + GAD V++ +G D+ +E D + LP Q+ L+ + + +
Sbjct: 595 NSDQIDKVKEFVSGADLVLVALGNDEKLARENRDLPSIYLPMTQELLLKEIYKVNPRTA- 653
Query: 534 LVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDY 593
L+L G P+ +A N+ +IL A YPG+ G ALA ++FG NP G+LPMT Y +
Sbjct: 654 LILHTGNPLTSKWAA--ENVPAILQAWYPGQEGGKALAGILFGSENPSGKLPMTIYESEE 711
Query: 594 IKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQS 653
+ D + GRTY++ K ++ FG GLSYS + Y
Sbjct: 712 QLPDILDYDIW-------KGRTYQYLSSKPLYGFGHGLSYSNFEYTH------------- 751
Query: 654 SSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGN- 712
+++ DVV P+ GT C +I +KN ++AG+ V +++ R N
Sbjct: 752 -----LQSDDVVR----PD-GTLQC-------SIEIKNISDVAGEEVVQVYIS---RENT 791
Query: 713 ---GRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGDEE 769
P+K+LV F V L E + F ++P LS +E G+ + G + L VG +
Sbjct: 792 PVYTFPLKKLVAFARVDLKPGESKTVTFTIAP-RQLSIWQE-GIWKMLPGKYSLFVGSGQ 849
Query: 770 YPIS 773
+S
Sbjct: 850 EGLS 853
>gi|333380551|ref|ZP_08472242.1| hypothetical protein HMPREF9455_00408 [Dysgonomonas gadei ATCC
BAA-286]
gi|332826546|gb|EGJ99375.1| hypothetical protein HMPREF9455_00408 [Dysgonomonas gadei ATCC
BAA-286]
Length = 854
Score = 271 bits (693), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 161/421 (38%), Positives = 240/421 (57%), Gaps = 36/421 (8%)
Query: 51 PISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTI 110
P R DL+SRLT++EKIS L ++P IPRL IP Y +E+LHGV G+
Sbjct: 36 PTHDRIMDLLSRLTIEEKISLLRATSPGIPRLQIPKYYHGNESLHGVVRPGR-------- 87
Query: 111 RGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGM---------TFWAPNI 161
T FPQ I A+ ++ L ++I AI EAR +N + + TFW+P +
Sbjct: 88 --FTVFPQAIGLASMWNPELHHKIATAISDEARGRWNELEQGKLQTQRFTDLLTFWSPTV 145
Query: 162 NIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYD 221
N+ RDPRWGR ET GEDP ++G ++VRG+QGD L+ + KHF A
Sbjct: 146 NMARDPRWGRTPETYGEDPYLSGILGTAFVRGLQGDD------PRYLKIVSTPKHFAA-- 197
Query: 222 LDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLL 281
N + R+ + +++ + L + Y P FE CVK G+++ IM AYN +N +P A+ LL
Sbjct: 198 --NNEEHNRFVCNPQISERQLREYYFPAFEMCVKDGKSASIMSAYNAINDVPCTANPWLL 255
Query: 282 SKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGS--FLQKHT 339
+K R WGF+GY+ SDC S++ A Y K+ E A +KAG+D+ CG ++Q
Sbjct: 256 TKVLRHDWGFNGYVVSDCGGPSLLVSAMKYVKTKEAAATLSIKAGLDLECGDDVYMQPLL 315
Query: 340 KAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQA 399
A Q + ++ID A + + RM LGLF+ +P + P+ KI VV S H+ LAL+A
Sbjct: 316 N-AYNQYMVSRADIDTAAYRVLRARMHLGLFD-DPDLNPYNKISPSVVGSAEHKQLALEA 373
Query: 400 AQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAG-PSCRSITPLQALQNY 458
A+ IVLLKN++ LPL K S+A++G NA +++ G+Y+G P+ ++ LQ +++
Sbjct: 374 ARQSIVLLKNNNRTLPLNPKKVKSIAVVGINAGNSE--FGDYSGIPANAPVSILQGIKDK 431
Query: 459 V 459
V
Sbjct: 432 V 432
Score = 122 bits (305), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 84/288 (29%), Positives = 141/288 (48%), Gaps = 47/288 (16%)
Query: 489 DHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAK 548
+ V+ ++G+++T E+E DR D+ LP Q+E I + + +++VL+ G + I +
Sbjct: 605 EQVIAVLGINKTIEREGQDRYDIHLPADQEEFIREIYKVNPN-IVVVLVAGSSLAINWM- 662
Query: 549 YDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQAT 608
D ++ +I+ A YPGE G A+AEV+FG++NPGGRLP+T+Y ++P D
Sbjct: 663 -DEHVPAIVNAWYPGEQGGTAVAEVLFGEYNPGGRLPVTYY-NSLEEIPSFD------DY 714
Query: 609 SGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYK 668
GRTY++++GK ++PFG GLSY+ ++YK ++ N
Sbjct: 715 DITKGRTYQYFKGKPLYPFGYGLSYTTFAYKNLQINDN---------------------- 752
Query: 669 SVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILN 728
V+ +KN G M G ++VK G PIK+L GFQ L
Sbjct: 753 -----------GNNIKVSFELKNTGRMDGDEVSQVYVKIPSSGIFMPIKELKGFQRSTL- 800
Query: 729 AKEKAEIVFELSPCESLSRAREDG--LMVIEEGTHFLVVGDEEYPISI 774
K+ A E++ + L R +D + +G + ++G I +
Sbjct: 801 -KKGATKNVEINIRKDLLRYWDDATETFITPKGEYEFMIGTSSQDIQL 847
>gi|319787180|ref|YP_004146655.1| glycoside hydrolase [Pseudoxanthomonas suwonensis 11-1]
gi|317465692|gb|ADV27424.1| glycoside hydrolase family 3 domain protein [Pseudoxanthomonas
suwonensis 11-1]
Length = 903
Score = 271 bits (693), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 166/417 (39%), Positives = 237/417 (56%), Gaps = 48/417 (11%)
Query: 59 LVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQ 118
LV++LT +EK+ QL+N+APAIPRLG+P Y WW+E+LHG G T+F +
Sbjct: 54 LVAKLTTEEKLDQLLNTAPAIPRLGVPHYNWWTESLHGAMGTLP----------TTNFAE 103
Query: 119 VILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMT--------FWAPNINIFRDPRWG 170
I AA+FD L R+GQ I E R L+ + G T W+PNINIFRDPRWG
Sbjct: 104 PIGLAATFDDGLVKRVGQVIATEVRGLHALARQTGRTGRMGTGLNTWSPNINIFRDPRWG 163
Query: 171 RGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTR 230
RGQET GEDP + + V+YV+GVQG+ + A KH+ + + +TR
Sbjct: 164 RGQETYGEDPFLAARMGVAYVKGVQGEDPRWYDI------IATPKHYAVH---SGPESTR 214
Query: 231 YKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWG 290
+ + V+ DL DTY P F + + +G A +MCAYNRV+G P+CA+ LL R W
Sbjct: 215 HSANVYVSRHDLEDTYLPAFRAAIVEGGAGSVMCAYNRVDGQPACANDLLLKDYLRGAWD 274
Query: 291 FHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG-------SFLQKHTKAAV 343
F GY+ SDCDAV+ I YA AV +++G+D C + L + + A+
Sbjct: 275 FKGYVVSDCDAVTDINRHHHYAPDAASAVAAAMRSGVDNECNGATLVDMAGLARPYQEAL 334
Query: 344 KQKKLPESEIDRALHNLFSVRMRLGLFNGN-PTMQPFGKIGADV--VCSPAHQVLALQAA 400
++ + +++DRAL LF+ R R NG+ P ++P A V P H LAL+ A
Sbjct: 335 ERGLISMADVDRALVRLFAARYR----NGDLPGLRPLSTETASPADVGKPEHAALALEVA 390
Query: 401 QDGIVLLKNSHGLLPL-PKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQ 456
+ +VLLKNS G+LPL P++K LA++GP A++ + L GNY+ P +S P+ L+
Sbjct: 391 EKSLVLLKNS-GVLPLRPQAK---LAVVGPLADATRVLRGNYSSP--QSGPPVSVLE 441
Score = 108 bits (270), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 89/292 (30%), Positives = 134/292 (45%), Gaps = 52/292 (17%)
Query: 485 AKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVIL 534
A +D VV ++GL E EE D+ L L QQ L+ A A KP+++
Sbjct: 629 AAQSDVVVAVVGLTSDLEAEEAPVEVPGFKGGDKTSLDLLADQQALL-EAARATGKPLVV 687
Query: 535 VLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYI 594
V++ G PV++ +A+ ++ +I+ A YPG++G +A+ V+ G NP GRLP+T+Y
Sbjct: 688 VVMNGSPVNLAWAR--QHAAAIIEAWYPGQSGGLAIGNVLAGHANPAGRLPLTFYRSVDD 745
Query: 595 KVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSS 654
P D MR GRTYR++EGK V+PFG GLSY+ + Y L L ++
Sbjct: 746 LPPFDDYDMR--------GRTYRYFEGKPVYPFGHGLSYTSFGY-------GPLKLEPAA 790
Query: 655 STKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGR 714
VT ++N G+ AG L++ +G
Sbjct: 791 GGAH----------------------EGLKVTTELRNTGKRAGDEVAQLYLDFPDV-DGL 827
Query: 715 PIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
P L GFQ V L E+ + F+L P LS +G + G + + VG
Sbjct: 828 PRIALRGFQRVHLAPGERRVLEFQLDP-RDLSAVTPEGQRQVFAGNYRVFVG 878
>gi|423222970|ref|ZP_17209439.1| hypothetical protein HMPREF1062_01625 [Bacteroides cellulosilyticus
CL02T12C19]
gi|392640546|gb|EIY34345.1| hypothetical protein HMPREF1062_01625 [Bacteroides cellulosilyticus
CL02T12C19]
Length = 862
Score = 271 bits (692), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 164/446 (36%), Positives = 243/446 (54%), Gaps = 38/446 (8%)
Query: 41 ETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGV 100
+T P+ L ++RA+DLV RLTL+EK + + + AIPRLGI + WWSEALHGVA
Sbjct: 19 QTLPYQNPELSPAERAKDLVKRLTLEEKALLMCDDSEAIPRLGIKKFNWWSEALHGVANQ 78
Query: 101 GKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIG------- 153
G T FP+ + AASF+ L + I A+ E RA +N G
Sbjct: 79 G----------NVTVFPEPVGMAASFNDKLVFEIFNAVSDEMRAKHNERVRNGLEDVRFH 128
Query: 154 -MTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASA 212
++ W PN+NIFRDPRWGRGQET GEDP +T + ++ V+G+QG K + L A
Sbjct: 129 SLSVWTPNVNIFRDPRWGRGQETYGEDPYLTSQMGIAVVKGLQGP--ENEKYRKLL---A 183
Query: 213 CCKHFTAYDLDNWKGTTRYKFD-ARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNG 271
C KH+ + W +R+ + V+ +DL +TY P F++ V++ +MCAY R++
Sbjct: 184 CAKHYAVHSGPEW---SRHTANLNNVSPRDLWETYLPAFKALVQKADVREVMCAYQRLDD 240
Query: 272 IPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLK---AGMD 328
P C + LL + R +WGF + SDC A++ + + KS DAV +K AG D
Sbjct: 241 DPCCGNTRLLQQILRDEWGFKYLVVSDCGAIADFWTSH---KSSSDAVHAAVKGTMAGTD 297
Query: 329 VNCG-SFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVV 387
V CG + + AV + + E E+D+ + L R LG + +P++ + KI VV
Sbjct: 298 VECGYGYAYQKLPEAVSRGLITEEEVDKHVLRLMEGRFELGEMD-DPSLVNWTKIPMSVV 356
Query: 388 CSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCR 447
AH+ L+L ++ + LL+N + +LPL KS +A+IGPNA+ L GNY G +
Sbjct: 357 NCKAHKDLSLNMSRQTMTLLQNKNNVLPLSKSIR-KIAVIGPNADDKPMLWGNYNGTPNQ 415
Query: 448 SITPLQALQNYVENT--VYYPGCDTV 471
+IT L ++ ++ VY GCD V
Sbjct: 416 TITILDGFKSKLKKNQIVYMKGCDLV 441
Score = 89.7 bits (221), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 67/239 (28%), Positives = 110/239 (46%), Gaps = 43/239 (17%)
Query: 507 DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAG 566
DR D+ LP Q+ + + +A K+ ++ + C G + + +IL A Y GE G
Sbjct: 629 DRTDIELPAVQRNFLKALKDAGKQ--VVFVNCSGS-SMALLPETESCDAILQAWYGGELG 685
Query: 567 AVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFP 626
A+A+V+FGD+NP G+LP+T+Y D M+ GRTYR Y +FP
Sbjct: 686 GYAVADVLFGDYNPSGKLPVTFYKSTKQLPDYEDYSMK--------GRTYR-YMSDPLFP 736
Query: 627 FGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVT 686
FG GLSY+ ++ + ++ +L ++S +T
Sbjct: 737 FGFGLSYTDFAVGTASCNKTQLRTDES-----------------------------LTLT 767
Query: 687 IGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESL 745
+ V N G+ +G V ++++ +G P+K L + V L A K ++ EL P ES
Sbjct: 768 VPVSNTGKRSGTEVVQVYIRKTDDADG-PLKSLKAYARVELAAGAKQDVKIEL-PSESF 824
>gi|423289663|ref|ZP_17268513.1| hypothetical protein HMPREF1069_03556 [Bacteroides ovatus
CL02T12C04]
gi|423298156|ref|ZP_17276215.1| hypothetical protein HMPREF1070_04880 [Bacteroides ovatus
CL03T12C18]
gi|392663697|gb|EIY57244.1| hypothetical protein HMPREF1070_04880 [Bacteroides ovatus
CL03T12C18]
gi|392667374|gb|EIY60884.1| hypothetical protein HMPREF1069_03556 [Bacteroides ovatus
CL02T12C04]
Length = 850
Score = 271 bits (692), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 158/424 (37%), Positives = 240/424 (56%), Gaps = 34/424 (8%)
Query: 51 PISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTI 110
P+ +R DL+SRLT++EKIS L ++P IPRLGI Y +EALHGV G+
Sbjct: 33 PVHERVADLLSRLTVEEKISLLRATSPGIPRLGIDKYYHGNEALHGVVRPGR-------- 84
Query: 111 RGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQ---------AIGMTFWAPNI 161
T FPQ I AA+++ L ++ I EARA +N + +TFW+P +
Sbjct: 85 --FTVFPQAIGLAATWNPVLQQKVATVISDEARARWNELDQGRNQKEQFSDVLTFWSPTV 142
Query: 162 NIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYD 221
N+ RDPRWGR ET GEDP ++G ++V+G+QG+ L+ + KHF A
Sbjct: 143 NMARDPRWGRTPETYGEDPFLSGVMGTAFVKGLQGED------PRYLKIVSTPKHFVA-- 194
Query: 222 LDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLL 281
N + R+ + +++ + L + Y P FE CVK+G+A+ IM AYN +N +P + LL
Sbjct: 195 --NNEEHNRFICNPQISEKQLREYYFPAFEMCVKKGKAASIMTAYNALNDVPCTLNAWLL 252
Query: 282 SKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG-SFLQKHTK 340
K R+ WGF GY+ SDC S++ +A Y K+ E A +KAG+D+ CG ++
Sbjct: 253 QKVLRQDWGFRGYVVSDCGGPSLLVNAHKYVKTKETAATLSIKAGLDLECGDDVYDEYLL 312
Query: 341 AAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAA 400
A KQ + +++ID A ++ + RM+LG+F+ P+ +I V+ S HQ +AL AA
Sbjct: 313 NAYKQYMVSDADIDSAACHVLAARMKLGMFDSK-ERNPYARISPSVIGSKDHQQVALDAA 371
Query: 401 QDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAG-PSCRSITPLQALQNYV 459
++ IVLLKN +LPL K S+A++G NA + + G+Y+G P ++ LQ ++N V
Sbjct: 372 RECIVLLKNQKNMLPLNVDKLKSIAVVGINAGTCE--FGDYSGAPVIEPVSVLQGIKNRV 429
Query: 460 ENTV 463
V
Sbjct: 430 GEKV 433
Score = 120 bits (301), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 83/278 (29%), Positives = 133/278 (47%), Gaps = 43/278 (15%)
Query: 489 DHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAK 548
+ VV +MG++++ E+E DR D+ LP Q+E + + + +I+VL+ G + + +
Sbjct: 602 ETVVAVMGINKSIEREGQDRYDIQLPADQREFLQEIYKVNPN-IIVVLVAGSSLAVNWM- 659
Query: 549 YDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQAT 608
D +I +I+ A YPGE G A+A+V+FGD+NP GRLP+T+Y P D +
Sbjct: 660 -DEHIPAIVNAWYPGEQGGTAVADVLFGDYNPAGRLPLTYYKSLDELPPFDDYDI----- 713
Query: 609 SGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYK 668
GRTY++++G ++PFG GLSYS + Y S K+ ++ D
Sbjct: 714 --TKGRTYKYFKGDVLYPFGYGLSYSSFKY---------------SDLKVKDSTD----- 751
Query: 669 SVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILN 728
K V+ +KN G G ++V+ G PIK+L GF+ V L
Sbjct: 752 -------------KVTVSFRLKNTGRRKGDEVAQVYVRIPETGGIVPIKELKGFRRVPLE 798
Query: 729 AKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
E I EL + ++ GT ++VG
Sbjct: 799 PGESRAIDIELDKEQLRYWDTTKEQFILPAGTFDVMVG 836
>gi|332185848|ref|ZP_08387595.1| glycosyl hydrolase family 3 N terminal domain protein [Sphingomonas
sp. S17]
gi|332014206|gb|EGI56264.1| glycosyl hydrolase family 3 N terminal domain protein [Sphingomonas
sp. S17]
Length = 838
Score = 271 bits (692), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 160/411 (38%), Positives = 226/411 (54%), Gaps = 36/411 (8%)
Query: 62 RLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVIL 121
+LT+++K +QLVN APA P L +PAY+WWSE LHG+A G AT FPQ +
Sbjct: 24 KLTVEQKAAQLVNEAPAEPALDLPAYDWWSEGLHGIARNGP----------ATVFPQAMG 73
Query: 122 TAASFDSYLWYRIGQAIGLEARALYNA-------GQAIGMTFWAPNINIFRDPRWGRGQE 174
AA++D L ++G + +EARA +N G G+T W+PNINIFRDPRWGRGQE
Sbjct: 74 MAATWDPALIEKVGDVVAVEARAKFNTKPINADRGLFEGLTIWSPNINIFRDPRWGRGQE 133
Query: 175 TPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFD 234
T GEDP +T + +++VRG+QG K+ A KH + R FD
Sbjct: 134 TYGEDPFLTSRIGIAFVRGLQGPDLKHPKV------IATVKHLAVHSGPE---GGRDSFD 184
Query: 235 ARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGY 294
V+ +DL TY P F + V +G+A +MCAYN ++G P CA+ L+++ R WGF G
Sbjct: 185 VMVSPRDLEATYLPAFRATVTEGKALSLMCAYNAIHGTPVCANPMLMTERLRTDWGFKGL 244
Query: 295 ITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEID 354
SDCDAV I+ A L+AG D+NCG+ + A+K+ + E EID
Sbjct: 245 TVSDCDAVGNIWMFHHAQPDAPAAAAAALRAGTDLNCGN-TYRALPEALKRGLITEGEID 303
Query: 355 RALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLL 414
AL +VR L + + P+ +I + +PA++ LAL+ A+ GI+LL N + LL
Sbjct: 304 TALARALAVRRMLTVDS------PWNRIKPSQLGTPANRALALETARKGIILLDNPNDLL 357
Query: 415 PLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYY 465
PL K LA+ G +A+ L GNY G + +TPL ++ V Y
Sbjct: 358 PL---KGKKLAVFGADADDLGVLEGNYHGSAIDPVTPLDGIRAKFGRDVRY 405
Score = 97.4 bits (241), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 91/299 (30%), Positives = 136/299 (45%), Gaps = 54/299 (18%)
Query: 480 KAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAK 529
+A+ +A AD +V ++GL E E L DR ++VLP Q+ L+ + +A
Sbjct: 555 EALKVAGTADVLVAIVGLSPDLEGEALGVSVPGFAGGDRTEVVLPEPQRNLLAAL-QATG 613
Query: 530 KPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWY 589
KPV+ V++ G V + D + L A YPG G ALAE++ GD NP GRLP+T Y
Sbjct: 614 KPVVAVIVSGSAVSLG----DIKPAATLAAFYPGAEGGTALAEILSGDVNPSGRLPVTIY 669
Query: 590 PQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLY 649
D M+ RTYR++ GK + FG GLSY+K+ Y +V
Sbjct: 670 HSAADLPAFADYGMKE--------RTYRYFTGKPAWGFGHGLSYTKFDYGQPSV------ 715
Query: 650 LNQSSSTKMVENQDV-VHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPA 708
++S + + DV V +++ + G E + +LV + PVL
Sbjct: 716 ---TASVAVGQPVDVAVQLRNIGQRGGEEV-AQAYLVPPATD-------EKPVLT----- 759
Query: 709 RRGNGRPI--KQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVV 765
PI +QLVGF V L + F + P +S+ DG I GT+ L +
Sbjct: 760 -----DPILQRQLVGFTRVALKPGQTGTTRFTIDP-RMMSQVWRDGTRRILPGTYKLYI 812
>gi|423217451|ref|ZP_17203947.1| hypothetical protein HMPREF1061_00720 [Bacteroides caccae
CL03T12C61]
gi|392628610|gb|EIY22636.1| hypothetical protein HMPREF1061_00720 [Bacteroides caccae
CL03T12C61]
Length = 946
Score = 271 bits (692), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 235/798 (29%), Positives = 364/798 (45%), Gaps = 135/798 (16%)
Query: 49 TLPISQRARDLVSRLTLDEKISQLVN--SAPAIPRLGIPAYEW----W-------SEALH 95
T PI R DL+S++TL+EK Q+V + + +P EW W E L+
Sbjct: 57 TAPIDARIEDLLSQMTLEEKTCQMVTLYGYKRVLKDDLPTSEWKNQLWKDGIGAIDEHLN 116
Query: 96 GVAGVGK-------------------------------GI---FFNGTIRG-----ATSF 116
G G GI F N IRG AT+F
Sbjct: 117 GFQQWGLPPSDNEYVWPASKHAWALNEVQRFFIEETRLGIPTDFTNEGIRGVESYKATNF 176
Query: 117 PQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMT-FWAPNINIFRDPRWGRGQET 175
P + +++ L +++G G EAR L G T +AP +++ RD RWGR +E
Sbjct: 177 PTQLGLGHTWNRQLIHQVGLITGREARML-------GYTNVYAPILDVGRDQRWGRYEEV 229
Query: 176 PGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDA 235
GE P + + + VRG+Q + Q +A KHF AY + + D
Sbjct: 230 YGESPYLVAELGIEMVRGMQHNH----------QVAATGKHFIAYSNNKGAREGMARVDP 279
Query: 236 RVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYI 295
+++ +++ + PF+ +++ G+M +YN +G P + L+ R + GF GY+
Sbjct: 280 QMSPREVEMLHAYPFKRVIREAGLLGVMSSYNDYDGFPIQSSYYWLTTRLRGEMGFRGYV 339
Query: 296 TSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHT-----KAAVKQKKLPE 350
SD DAV +Y G AK ++AV ++AG++V C +F + + VK+ L E
Sbjct: 340 VSDSDAVEYLYTKHGTAKDMKEAVRQSVEAGLNVRC-TFRSPDSYVLPLRELVKEGGLSE 398
Query: 351 SEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGAD-VVCSPAHQVLALQAAQDGIVLLKN 409
I+ + ++ V+ +GLF+ P GAD V ++ +ALQA+++ IVLLKN
Sbjct: 399 EVINDRVRDILRVKFLVGLFD-TPYQTDLK--GADEEVEKKENEEVALQASRESIVLLKN 455
Query: 410 SHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVE---NTVYYP 466
+LPL SK +A+ GPNA+ L +Y + + L+ +Q ++ + +Y
Sbjct: 456 EKNVLPLDPSKIRKIAVCGPNADEHSYALTHYGPLAVEVTSVLKGIQEKMKDKADVLYTK 515
Query: 467 GCDTVACS---------------SASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDL 511
GCD V + IDKAV AK AD ++++G Q E R L
Sbjct: 516 GCDLVDANWPESELIDYPLTDEEQKEIDKAVSQAKQADVAIVVLGGGQRTCGENKSRSSL 575
Query: 512 VLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALA 571
LPGRQ +L+ V A KPV+LVL+ G P+ I +A D+ + +IL A YPG G +A+A
Sbjct: 576 DLPGRQLDLLKAVV-ATGKPVVLVLINGRPLSINWA--DKFVPAILEAWYPGSKGGIAVA 632
Query: 572 EVIFGDHNPGGRLPMTWYPQDYIKVPMT-DMKMRPQATSG-NPGRTYRFYEGK-EVFPFG 628
+++FGD+NPGG+L +T +P+ ++P K Q G NPG ++PFG
Sbjct: 633 DILFGDYNPGGKLTVT-FPKTVGQIPFNFPCKPSSQIDGGKNPGPDGNMSRANGALYPFG 691
Query: 629 CGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIG 688
GLSY+ + Y +S + NQ K VT
Sbjct: 692 YGLSYTTFEYSDLKISPAIITPNQ-----------------------------KAYVTCK 722
Query: 689 VKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRA 748
V N G+ +G + L+V+ K LVGF+ V L E EI F + ++L
Sbjct: 723 VTNTGKRSGDEVIQLYVRDVLSSVTTYEKNLVGFERVHLKPGETKEITFPID-RKALELL 781
Query: 749 REDGLMVIEEGTHFLVVG 766
D V+E G L++G
Sbjct: 782 NADMHWVVEPGDFTLMLG 799
>gi|358342292|dbj|GAA27551.2| probable beta-D-xylosidase 7 [Clonorchis sinensis]
Length = 826
Score = 271 bits (692), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 225/798 (28%), Positives = 349/798 (43%), Gaps = 137/798 (17%)
Query: 44 PFCKTTLPISQRARDLVSRLTLDEKISQLVNS--------APAIPRLGIPAYEWWSEALH 95
PF +LP + R DL++RLT +E I Q+ N AP I RL I AY+W +
Sbjct: 28 PFRNPSLPANFRVDDLLARLTNEELIQQVSNGGAGPQHGPAPGIARLNISAYQWRTNP-- 85
Query: 96 GVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAI--- 152
G G+ T FPQ + A+FD + YR+ +A GLE RA +N +A
Sbjct: 86 ---GDGR----------ITPFPQPVNLGATFDVHTVYRVARATGLEMRARWNRAKAKKTY 132
Query: 153 ----GMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQG---------DTF 199
G+ +AP +N+ R P WGR QET GEDP + GK A ++VRG+ G D
Sbjct: 133 RDGNGIHLFAPVVNLLRHPLWGRNQETFGEDPFMIGKLARTFVRGLGGWKNAEPQSLDEQ 192
Query: 200 NGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRA 259
N L A CKHF + +R F+A VT DL TY P F +C++ G
Sbjct: 193 NLSSQPDVLLVGANCKHFAVHTGPEDFPVSRLSFEANVTDVDLWQTYLPAFRACLEAGAV 252
Query: 260 SGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVS-IIYDAEGYAKSPEDA 318
S +MCAY+ +NG P C + LL++ R++W F G++ +DC A+ +I+ + + E A
Sbjct: 253 S-VMCAYSGINGTPDCINHWLLTELLRQKWKFKGFVVTDCGALQFVIWKHQIFNHYNETA 311
Query: 319 VVDVLKAGMDVNCGSFLQKHTKAAVKQ----KKLPESEIDRALHNLFSVRMRLGLFNGNP 374
+ V +AG+++ + + L ++ LF R+ G F NP
Sbjct: 312 MAAV-RAGVNLENSVVYATEVFSTLPHLLASGSLSRDQLIEMARPLFLTRLMQGEF--NP 368
Query: 375 T-MQPFGKIG-ADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVS------LAL 426
M P+ + + + + H+ +AL IVLL+N LPL + S S +A+
Sbjct: 369 VEMDPYRLLAPEEAILNEDHRRVALATTARSIVLLQNRDRFLPLKNNMSDSGGPLRHIAI 428
Query: 427 IGPNANSAKTLLGNY-AGPSCRSITPLQALQNYVENTVYYPG--CDTVACSSASIDKAVD 483
+GP A S L G+Y P PL + + ++ D CSS + D A+
Sbjct: 429 VGPFATSVTELYGHYRTAPEPEIEVPLSKGLSQLSRRMHASDICTDGGRCSSLN-DDALH 487
Query: 484 IAKGADH---VVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKK---------- 530
G D +VL +G E E +DR ++ LPG+Q EL+ + +
Sbjct: 488 STLGYDDLDLIVLSLGTGSEVEGENVDRQNITLPGKQPELLEETLKLSSGLGNSGLSKRT 547
Query: 531 -PVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHN---------- 579
P+IL++ GP++I+ A + N+ +I W G+PG A+ ++ G
Sbjct: 548 VPIILLVFSAGPINISRAVENENVKAIFWCGFPGPLVGDAMRHLLLGSSGELFGPSKPIS 607
Query: 580 -----------------------PGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTY 616
P RLP TWY +T +M Q P + +
Sbjct: 608 VGFHSFQEAYRWDVTPDDGYWWIPAARLPFTWYESIDQLANITVYEMTNQTYRYLPTQCH 667
Query: 617 RFYEGKEV---FPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPEL 673
E ++ +PFG GLSY+ N S ++ V Y +
Sbjct: 668 MSSEDCKIPVLYPFGYGLSYN---------------FNLSGASGFV-------YSDLIAP 705
Query: 674 GTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARR---GNGR--PIKQLVGFQSVILN 728
+ ++ + + V+N G +A + V ++ K R N R P+ QL GF+ V L+
Sbjct: 706 SSAVSSNQRIVFYVTVQNEGPIACEEVVQVYTKWLNRTENDNSRNGPLIQLAGFERVRLD 765
Query: 729 AKEKAEIVFELSPCESLS 746
E ++ F L P E L+
Sbjct: 766 VGEYKQLKFTLIPSEHLA 783
>gi|393773722|ref|ZP_10362116.1| glycosyl hydrolase family 3 N terminal domain protein
[Novosphingobium sp. Rr 2-17]
gi|392720897|gb|EIZ78368.1| glycosyl hydrolase family 3 N terminal domain protein
[Novosphingobium sp. Rr 2-17]
Length = 846
Score = 271 bits (692), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 183/489 (37%), Positives = 259/489 (52%), Gaps = 55/489 (11%)
Query: 60 VSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQV 119
+++ L EK+ QL ++APA +G+PAY+WW+E LHG+A G+ AT FPQ
Sbjct: 17 IAKADLAEKVGQLQSTAPAAKAMGLPAYDWWNEGLHGLARNGE----------ATVFPQA 66
Query: 120 ILTAASFDSYLWYRIGQAIGLEARALYN-----AGQAI--GMTFWAPNINIFRDPRWGRG 172
I AA+FD L ++G I EARA +N A + I G+T W+PNINIFRDPRWGRG
Sbjct: 67 IGLAATFDPVLMEKVGTVISTEARAKFNTRPVNADRRIYEGLTIWSPNINIFRDPRWGRG 126
Query: 173 QETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYK 232
QET GEDP ++G AV +++G+QG + A KHF + R
Sbjct: 127 QETYGEDPFLSGTLAVGFIKGLQGTD------PAHPRVIATPKHFAVHSGPE---AGRDS 177
Query: 233 FDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFH 292
FD V+ QD TY P F V +GR +MCAYN +G P CAD L+++ R+ WGF
Sbjct: 178 FDVDVSPQDREATYLPAFRKAVTEGRPLSVMCAYNSTHGTPVCADDALINQRLRKDWGFK 237
Query: 293 GYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESE 352
G++ SDCDA++ ++ YA +A +KAG D +CG+ + AV++ + E+
Sbjct: 238 GFVVSDCDAIANVWMFHHYAADAAEASAASIKAGTDFDCGNTYAALPQ-AVERGLVDEAT 296
Query: 353 IDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHG 412
IDRAL R LG+ G P+G+I V + A + LAL+AA+ IVLL+N +
Sbjct: 297 IDRALARSLQARHSLGIAFG--AANPWGRIKPSQVHTLASRALALEAARKSIVLLQNDNA 354
Query: 413 LLPL-PKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTV 471
LPL P +K LA++G NA+ + +Y G + ITPLQ ++ G V
Sbjct: 355 RLPLKPGTK---LAVVGANADDLSVIEASYHGTAADPITPLQGIRRRF-------GAANV 404
Query: 472 ACSSASIDKAVDIAKGADHVVLMMGLDQTQEK-----EELDRVDLVLPGRQQELI----T 522
A + S+ +A+GA +V L +Q K E LD V+ RQ + T
Sbjct: 405 AYAQGSV-----LAEGAAVIVPETAL-ASQGKPGLKGEYLDATGKVVLTRQDRRVDFNYT 458
Query: 523 RVAEAAKKP 531
R A A P
Sbjct: 459 RYAPAGIDP 467
Score = 92.0 bits (227), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 89/305 (29%), Positives = 128/305 (41%), Gaps = 65/305 (21%)
Query: 480 KAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAK 529
+A+ + AD +V ++GL E E L DR D+ LP Q +L+ + A
Sbjct: 564 EAIAATRNADAIVAVLGLSPDLEGEALQVQVPGFVGGDRSDIALPRPQAQLLEAL-HATG 622
Query: 530 KPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWY 589
KPVI+VL G + I D ++L A YPGE G ALAE + G +NP GRLP+T+Y
Sbjct: 623 KPVIVVLTSGSAIAIDPKLAD----AVLAAWYPGEEGGTALAETLGGINNPSGRLPVTFY 678
Query: 590 PQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKF-KAVSQNKL 648
D M+ R+YR++ G ++ FG GLSY+ + Y KAV+
Sbjct: 679 RSTSDLPAYVDYSMKE--------RSYRYFTGTPLWGFGHGLSYTNFGYDAPKAVATG-- 728
Query: 649 YLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPA 708
+G VT+ V N G AG+ V +V P
Sbjct: 729 ------------------------IGDPVS------VTVTVHNTGVRAGEDVVQAYVVPP 758
Query: 709 RRGNGRPI-------KQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTH 761
PI +QL FQ + L + I L P E +S G+ + G +
Sbjct: 759 TI-EPEPIMTQAVLQRQLAAFQRIALAPGQARAITLNLDPRE-ISVVDRRGIRRLVPGEY 816
Query: 762 FLVVG 766
+ VG
Sbjct: 817 RVWVG 821
>gi|296081549|emb|CBI20072.3| unnamed protein product [Vitis vinifera]
Length = 333
Score = 271 bits (692), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 154/343 (44%), Positives = 209/343 (60%), Gaps = 17/343 (4%)
Query: 437 LLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMG 496
++GNY G + TPLQ L V T Y PGC VAC +A ID+A IA AD VL++G
Sbjct: 1 MIGNYEGTPGKYTTPLQGLTALVA-TTYLPGCSNVACGTAQIDEAKKIAAAADATVLIVG 59
Query: 497 LDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSI 556
+DQ+ E E DRV++ LPG+Q LIT VA+A+K VILV++ GG DI+FAK D I SI
Sbjct: 60 IDQSIEAEGRDRVNIQLPGQQPLLITEVAKASKGNVILVVMSGGGFDISFAKNDDKITSI 119
Query: 557 LWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYI-KVPMTDMKMRPQATSGNPGRT 615
LW GYPGEAG A+A+VIFG +NP GRLP TWYPQ Y+ KVPMT+M MRP SG PGRT
Sbjct: 120 LWVGYPGEAGGAAIADVIFGFYNPSGRLPTTWYPQSYVDKVPMTNMNMRPDPASGYPGRT 179
Query: 616 YRFYEGKEVFPFGCGLSYSKYSYKF----KAVSQNKLYLNQSSSTKMVENQDVVHYKSVP 671
YRFY G+ ++ FG GLSY+++++ K+VS + S + ++ D V
Sbjct: 180 YRFYTGETIYTFGDGLSYTQFNHHLIQAPKSVSI-PIEEGHSCHSSKCKSVDAVQ----- 233
Query: 672 ELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKE 731
E C+ F + + V N G ++G H V LF P N P K L+GF+ V + AK
Sbjct: 234 ----ESCQNLAFDIHLRVNNAGNISGSHTVFLFSSPPSVHN-SPQKHLLGFEKVFVTAKA 288
Query: 732 KAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGDEEYPISI 774
+A + F++ C+ LS E G + G H L VG+ ++ +++
Sbjct: 289 EALVRFKVDVCKDLSIVDELGTRKVALGLHVLHVGNLKHSLNV 331
>gi|29347188|ref|NP_810691.1| beta-glucosidase [Bacteroides thetaiotaomicron VPI-5482]
gi|29339087|gb|AAO76885.1| beta-glucosidase (gentiobiase) [Bacteroides thetaiotaomicron
VPI-5482]
Length = 853
Score = 270 bits (691), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 161/424 (37%), Positives = 237/424 (55%), Gaps = 34/424 (8%)
Query: 51 PISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTI 110
P+ +R DL+SRLT++EKIS L ++P IPRLGI Y +EALHGV G+
Sbjct: 36 PVHERVMDLISRLTVEEKISLLRATSPGIPRLGIDKYYHGNEALHGVVRPGR-------- 87
Query: 111 RGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQ---------AIGMTFWAPNI 161
T FPQ I AA+++ L R+ I EARA +N + +TFW+P +
Sbjct: 88 --FTVFPQAIGLAATWNPELQKRVATVISDEARARWNELDQGREQKEQFSDVLTFWSPTV 145
Query: 162 NIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYD 221
N+ RDPRWGR ET GEDP ++G ++V G+QGD + L+ + KHF A
Sbjct: 146 NMARDPRWGRTPETYGEDPFLSGIMGTAFVNGLQGDDPH------YLKIVSTPKHFAA-- 197
Query: 222 LDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLL 281
N + R+ + +++ + L + Y P FE CVK+G+A+ IM AYN +N +P + LL
Sbjct: 198 --NNEEHNRFVCNPQISEKQLREYYFPAFEMCVKEGKAASIMSAYNALNDVPCTLNPWLL 255
Query: 282 SKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG-SFLQKHTK 340
K R+ WGF GY+ SDC +++ +A Y K+ E A +KAG+D+ CG
Sbjct: 256 QKVLRQDWGFQGYVVSDCGGPALLVNAHKYVKTKEAAATLSIKAGLDLECGDDVYDGPLL 315
Query: 341 AAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAA 400
A KQ + +++ID A +++ + RM+LGLF+ P+ KI V+ S HQ +AL AA
Sbjct: 316 NAYKQYMVSDADIDSAAYHVLTARMKLGLFDSG-ERNPYTKISPSVIGSKEHQQIALDAA 374
Query: 401 QDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAG-PSCRSITPLQALQNYV 459
+ +VLLKN LPL K S+A++G NA K G+Y+G P ++ LQ ++N V
Sbjct: 375 RQCVVLLKNQKNRLPLNADKLKSIAVVGINA--GKCEFGDYSGAPVVEPVSILQGIRNRV 432
Query: 460 ENTV 463
+ V
Sbjct: 433 GDRV 436
Score = 121 bits (303), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 84/278 (30%), Positives = 137/278 (49%), Gaps = 43/278 (15%)
Query: 489 DHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAK 548
+ VV +MG++++ E+E DR D+ LP Q+E + + + +I+VL+ G + I +
Sbjct: 605 ETVVAVMGINKSIEREGQDRYDIQLPADQREFLQEIYKVNPN-IIVVLVAGSSLAINWM- 662
Query: 549 YDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQAT 608
D +I +I+ A YPGE G A+AEV+FGD+NP GRLP+T+Y P D +
Sbjct: 663 -DEHIPAIVNAWYPGEQGGTAVAEVLFGDYNPAGRLPLTYYKSLDELPPFDDYDI----- 716
Query: 609 SGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYK 668
GRTY++++G ++PFG GLSYS ++Y D+
Sbjct: 717 --TKGRTYKYFKGDVLYPFGYGLSYSSFTY-----------------------SDL---- 747
Query: 669 SVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILN 728
V + G E V+ +KN G+ G ++V+ G P+K+L GF+ V L
Sbjct: 748 QVKDGGGEVT------VSFRLKNTGKRNGDEVAQVYVRIPETGGIVPLKELKGFRRVPLK 801
Query: 729 AKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
+ E + +L + E G V+ +G ++VG
Sbjct: 802 SGESRRVEIKLDKEQLRYWDVEKGQFVVPKGAFDVMVG 839
>gi|365122193|ref|ZP_09339098.1| hypothetical protein HMPREF1033_02444 [Tannerella sp.
6_1_58FAA_CT1]
gi|363642907|gb|EHL82241.1| hypothetical protein HMPREF1033_02444 [Tannerella sp.
6_1_58FAA_CT1]
Length = 853
Score = 270 bits (691), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 162/436 (37%), Positives = 245/436 (56%), Gaps = 37/436 (8%)
Query: 51 PISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTI 110
P +R DL+SRLT++EKIS L ++P IPRLGI Y +EALHGV V G F
Sbjct: 36 PQHERIMDLLSRLTVEEKISLLRATSPGIPRLGIDKYYHGNEALHGV--VRPGNF----- 88
Query: 111 RGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQ---------AIGMTFWAPNI 161
T FPQ I A+ ++ L Y I AI EAR +N + +TFW+P +
Sbjct: 89 ---TVFPQAIGLASMWNPELLYEISTAISDEARGRWNELNRGKDQKGFFSDLLTFWSPTV 145
Query: 162 NIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYD 221
N+ RDPRWGR ET GEDP ++GK V++V+G+QG+ L+ + KHF A
Sbjct: 146 NMARDPRWGRTPETYGEDPFLSGKLGVAFVKGLQGND------PRYLKIVSTPKHFAA-- 197
Query: 222 LDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLL 281
N + R++ + ++ ++L + Y P FESC+K+G+A IM AYN +N +P + LL
Sbjct: 198 --NNEEHNRFECNPHISERNLREYYLPAFESCIKEGKAQSIMSAYNAINDVPCTLNPWLL 255
Query: 282 SKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQ-KHTK 340
++ R++WGF+GY+ SDC + Y K+PE A +KAG+D+ CG + +
Sbjct: 256 TQVLRKEWGFNGYVVSDCGGPGFLVTHHKYVKTPEAAATLSIKAGLDLECGDNVYIEPLM 315
Query: 341 AAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAA 400
A KQ + +++ID A + + RM LGLF+ +P P+ I +V H+ LAL+AA
Sbjct: 316 NAYKQCMVTDADIDTAAYRILRARMMLGLFD-DPEKNPYNAISPSIVGCEKHRQLALEAA 374
Query: 401 QDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAG-PSCRSITPLQALQNYV 459
+ +VLLKN LPL K S+A++G NA + + G+Y+G P ++ L+ ++N V
Sbjct: 375 RQSLVLLKNEKNFLPLNPKKVKSIAVVGINAGNCE--FGDYSGTPVNAPVSVLEGIKNRV 432
Query: 460 EN---TVYYPGCDTVA 472
+ VY P +++
Sbjct: 433 GDDVKIVYAPWSSSIS 448
Score = 125 bits (313), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 87/278 (31%), Positives = 134/278 (48%), Gaps = 43/278 (15%)
Query: 489 DHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAK 548
D V ++G++++ E+E DR + LP QQ L + A A ++VL+ G + I +
Sbjct: 605 DVTVAVLGINKSIEREGQDRYTIELPADQQ-LFIKEAYKANPNTVVVLVAGSSLAINWI- 662
Query: 549 YDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQAT 608
D NI +IL A YPGE G A+AE +FGD+NPGGRLP+T+Y + ++P D
Sbjct: 663 -DENIPAILNAWYPGEQGGTAVAEALFGDYNPGGRLPLTYY-RSLDELPAFD------DY 714
Query: 609 SGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYK 668
GRTY ++E K ++PFG GLSY+++ YK + K + D V+ K
Sbjct: 715 DIQKGRTYMYFENKPLYPFGYGLSYTRFDYK---------------NLKSEVSDDAVNLK 759
Query: 669 SVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILN 728
VKN G+ AG ++V+ G P+KQL GF+ V +
Sbjct: 760 ------------------FTVKNTGKYAGDEVAQVYVRFPESGIKVPLKQLKGFERVHIG 801
Query: 729 AKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
+ A++ + E +DG G + +VG
Sbjct: 802 KGKSAQVSVSIPKKELRLWDEKDGKFYTPSGNYIFMVG 839
>gi|380694149|ref|ZP_09859008.1| glycoside hydrolase 3 [Bacteroides faecis MAJ27]
Length = 946
Score = 270 bits (691), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 244/855 (28%), Positives = 381/855 (44%), Gaps = 155/855 (18%)
Query: 8 LVFPLLCLCFTSLLTRVDSTQPPFSCDPSNPSTETFP--------------FCKTTLPIS 53
L+F L L ++LL+ +T+ P P TE + + T I
Sbjct: 4 LLFTALFLGSSTLLSAQKATKTPAVYKPVR--TEMYKKGWIDLNKNGIKDVYEDPTATID 61
Query: 54 QRARDLVSRLTLDEKISQLVN--SAPAIPRLGIPAYEW----W-------SEALHGVAGV 100
R DL+S++TL+EK Q+V + + +P EW W E L+G
Sbjct: 62 ARIEDLLSQMTLEEKTCQMVTLYGYKRVLKDDLPTPEWKNQLWKDGIGAIDEHLNGFQQW 121
Query: 101 GK-------------------------------GI---FFNGTIRG-----ATSFPQVIL 121
G GI F N IRG AT+FP +
Sbjct: 122 GLPPSDNENIWPASRHAWALNEVQRFFIEETRLGIPTDFTNEGIRGVESYKATNFPTQLG 181
Query: 122 TAASFDSYLWYRIGQAIGLEARALYNAGQAIGMT-FWAPNINIFRDPRWGRGQETPGEDP 180
+++ L +++G G EAR L G T +AP +++ RD RWGR +E GE P
Sbjct: 182 LGHTWNRRLIHQVGLITGREARML-------GYTNVYAPILDVGRDQRWGRYEEVYGESP 234
Query: 181 LVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQ 240
+ + + VRG+Q + Q +A KHF AY + + D +++ +
Sbjct: 235 YLVAELGIEMVRGMQHNH----------QIAATGKHFIAYSNNKGAREGMARVDPQMSPR 284
Query: 241 DLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCD 300
++ T+ PF+ +++ G+M +YN +G P + L+ R + GF GY+ SD D
Sbjct: 285 EVEMTHVYPFKRVIREAGLLGVMSSYNDYDGFPIQSSYYWLTTRLRGEMGFRGYVVSDSD 344
Query: 301 AVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHT-----KAAVKQKKLPESEIDR 355
AV +Y G AK ++AV ++AG++V C +F + + VK+ L E I+
Sbjct: 345 AVEYLYTKHGTAKDMKEAVRQSVEAGLNVRC-TFRSPDSYVLPLRELVKEGGLSEEVIND 403
Query: 356 ALHNLFSVRMRLGLFNGNPTMQPFG---KIGADVVCSPAHQVLALQAAQDGIVLLKNSHG 412
+ ++ V+ +GLF+ P+ K + V A++ +ALQA+++ IVLLKN
Sbjct: 404 RVRDILRVKFLVGLFD-----HPYQIDLKGADEEVEKAANEEIALQASRESIVLLKNDKN 458
Query: 413 LLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVE---NTVYYPGCD 469
+LPL S +A+ GPNA+ L +Y + + L+ +Q ++ +Y GCD
Sbjct: 459 ILPLDASGIQKIAVCGPNADEHSYALTHYGPLAVEVTSVLKGIQEKMKGKAEVLYTKGCD 518
Query: 470 TVACS---------------SASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLP 514
V + I+KAVD K AD V+++G Q E R L LP
Sbjct: 519 LVDANWPESELIDYPLTDEEQKEIEKAVDQTKQADVAVVVLGGGQRTCGENKSRSSLDLP 578
Query: 515 GRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVI 574
GRQ +L+ VA A KPV+LVL+ G P+ I +A D+ + +I+ A YPG G A+A+V+
Sbjct: 579 GRQLDLLKAVA-ATGKPVVLVLINGRPLSINWA--DKFVPAIVEAWYPGSKGGKAVADVL 635
Query: 575 FGDHNPGGRLPMTWYPQDYIKVPMT-DMKMRPQATSG-NPGRTYRFYEGK-EVFPFGCGL 631
FG++NPGG+L +T +P+ ++P K Q G NPG ++PFG GL
Sbjct: 636 FGEYNPGGKLTVT-FPKTVGQIPFNFPCKPSSQIDGGKNPGMEGNMSRANGALYPFGYGL 694
Query: 632 SYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKN 691
SY+ + Y +S + NQ + VT V N
Sbjct: 695 SYTTFEYSDLKISPAIITPNQQT-----------------------------FVTCKVTN 725
Query: 692 HGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRARED 751
G+ AG V L+V+ K L GF+ V L E E+ F + ++L D
Sbjct: 726 TGKRAGDEVVQLYVRDVLSSVTTYEKNLAGFERVHLQPGETKEVTFPID-RKALELLNAD 784
Query: 752 GLMVIEEGTHFLVVG 766
V+E G L+VG
Sbjct: 785 MHWVVEPGDFTLMVG 799
>gi|427383551|ref|ZP_18880271.1| hypothetical protein HMPREF9447_01304 [Bacteroides oleiciplenus YIT
12058]
gi|425728735|gb|EKU91590.1| hypothetical protein HMPREF9447_01304 [Bacteroides oleiciplenus YIT
12058]
Length = 939
Score = 270 bits (691), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 233/797 (29%), Positives = 364/797 (45%), Gaps = 139/797 (17%)
Query: 52 ISQRARDLVSRLTLDEKISQLVN--SAPAIPRLGIPAYEW----W-------SEALHGVA 98
+ R DL+S++TLDEK Q+V + + +P EW W E L+G
Sbjct: 57 LDARIEDLLSQMTLDEKTCQMVTLYGYKRVLKDDLPTPEWKQMLWKDGIGAIDEHLNGFQ 116
Query: 99 GVGK-------------------------------GI---FFNGTIRG-----ATSFPQV 119
G GI F N IRG AT+FP
Sbjct: 117 QWGLPPSDNPNVWPASRHAWALNEVQRFFIEETRLGIPVDFTNEGIRGVESYRATNFPTQ 176
Query: 120 ILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMT-FWAPNINIFRDPRWGRGQETPGE 178
+ +++ L +++G G EAR L G T +AP +++ RD RWGR +E GE
Sbjct: 177 LGLGHTWNRKLIHQVGLITGREARML-------GYTNVYAPILDVGRDQRWGRYEEVYGE 229
Query: 179 DPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVT 238
P + + + VRG+Q + Q +A KHF AY + + D +++
Sbjct: 230 SPYLVAELGIEMVRGMQHNH----------QVAATGKHFVAYSNNKGAREGMARVDPQMS 279
Query: 239 MQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSD 298
+++ + PF+ +K+ G+M +YN +GIP L+K R + GF GY+ SD
Sbjct: 280 PREVEMIHVYPFKRVIKEAGMLGVMSSYNDYDGIPIQGSYYWLTKRLRGEMGFRGYVVSD 339
Query: 299 CDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHT-----KAAVKQKKLPESEI 353
DAV +Y AK ++AV ++AG++V C +F + + VK+ L E I
Sbjct: 340 SDAVEYLYTKHSTAKDMKEAVRQSVEAGLNVRC-TFRSPDSYVLPLRELVKEGGLSEDII 398
Query: 354 DRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPA-HQVLALQAAQDGIVLLKNSHG 412
+ + ++ V+ +GLF+ P GAD A ++ +ALQA+++ ++LLKN +
Sbjct: 399 NDRVRDILRVKFLIGLFDA-PYQTDLA--GADKEVEKAENEAVALQASRESLILLKNENN 455
Query: 413 LLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVE---NTVYYPGCD 469
+LPL + ++A+ GPNAN L +Y + IT L+ ++ E +Y GCD
Sbjct: 456 VLPLDINNIKTIAVCGPNANEEGYALTHYGPLAVEVITVLEGIRQKAEGKAEVLYAKGCD 515
Query: 470 TVACS---------------SASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLP 514
V + A I+KAV+ A+ AD V+++G Q E R L LP
Sbjct: 516 LVDANWPESELIEYPMTNEEQAEINKAVENARKADVAVVVLGGGQRTCGENKSRSSLDLP 575
Query: 515 GRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVI 574
GRQ +L+ V +A KPV+LVL+ G P+ I +A D+ + +IL YPG G A+A+V+
Sbjct: 576 GRQLKLLQAV-QATGKPVVLVLINGRPLSINWA--DKFVPAILETWYPGSKGGTAVADVL 632
Query: 575 FGDHNPGGRLPMTWYPQDYIKVPMT-DMKMRPQATSG-NPGRTYRFYE-GKEVFPFGCGL 631
FGD+NPGG+L +T +P+ ++P K Q G NPG ++PFG GL
Sbjct: 633 FGDYNPGGKLTVT-FPKSVGQIPFNFPCKPSSQIDGGKNPGPDGNMSRVNGSLYPFGYGL 691
Query: 632 SYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKN 691
SY+ + Y +S + NQ ++ + C+ V N
Sbjct: 692 SYTTFEYSNIEISPKMMTANQKATVR--------------------CK---------VTN 722
Query: 692 HGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELS--PCESLSRAR 749
G+ AG V L+++ K L GF+ V L E E+ F L E L +
Sbjct: 723 TGKRAGDEVVQLYIRDMLSSVTTYEKNLAGFERVHLQPGETKEVTFILDRKHLELLDKHM 782
Query: 750 EDGLMVIEEGTHFLVVG 766
E V+E G ++VG
Sbjct: 783 E---WVVEPGDFSIMVG 796
>gi|224537265|ref|ZP_03677804.1| hypothetical protein BACCELL_02142 [Bacteroides cellulosilyticus
DSM 14838]
gi|224521119|gb|EEF90224.1| hypothetical protein BACCELL_02142 [Bacteroides cellulosilyticus
DSM 14838]
Length = 885
Score = 270 bits (691), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 164/446 (36%), Positives = 243/446 (54%), Gaps = 38/446 (8%)
Query: 41 ETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGV 100
+T P+ L ++RA+DLV RLTL+EK + + + AIPRLGI + WWSEALHGVA
Sbjct: 19 QTLPYQNPELSPAERAKDLVKRLTLEEKALLMCDDSEAIPRLGIKKFNWWSEALHGVANQ 78
Query: 101 GKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIG------- 153
G T FP+ + AASF+ L + I A+ E RA +N G
Sbjct: 79 GN----------VTVFPEPVGMAASFNDKLVFDIFNAVSDEMRAKHNERVRNGLEDVRFH 128
Query: 154 -MTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASA 212
++ W PN+NIFRDPRWGRGQET GEDP +T + ++ V+G+QG K + L A
Sbjct: 129 SLSVWTPNVNIFRDPRWGRGQETYGEDPYLTSQMGIAVVKGLQGP--ENEKYRKLL---A 183
Query: 213 CCKHFTAYDLDNWKGTTRYKFD-ARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNG 271
C KH+ + W +R+ + V+ +DL +TY P F++ V++ +MCAY R++
Sbjct: 184 CAKHYAVHSGPEW---SRHTANLNNVSPRDLWETYLPAFKALVQKADVREVMCAYQRLDD 240
Query: 272 IPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLK---AGMD 328
P C + LL + R +WGF + SDC A++ + + KS DAV +K AG D
Sbjct: 241 DPCCGNTRLLQQILRDEWGFKYLVVSDCGAIADFWTSH---KSSSDAVHAAVKGTMAGTD 297
Query: 329 VNCG-SFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVV 387
V CG + + AV + + E E+D+ + L R LG + +P++ + KI VV
Sbjct: 298 VECGYGYAYQKLPEAVSKGLITEEEVDKHVLRLMEGRFELGEMD-DPSLVNWTKIPMSVV 356
Query: 388 CSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCR 447
AH+ L+L ++ + LL+N + +LPL KS +A+IGPNA+ L GNY G +
Sbjct: 357 NCKAHKDLSLNMSRQTMTLLQNKNNVLPLSKSIR-KIAVIGPNADDKPMLWGNYNGTPNQ 415
Query: 448 SITPLQALQNYVENT--VYYPGCDTV 471
+IT L ++ ++ VY GCD V
Sbjct: 416 TITILDGFKSKLKKNQIVYMKGCDLV 441
Score = 99.4 bits (246), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 83/301 (27%), Positives = 135/301 (44%), Gaps = 57/301 (18%)
Query: 455 LQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEEL-------- 506
++N+ N + G + S SI K KG D VV + G+ E EE+
Sbjct: 571 VENWAANLKFNLGKEFPINYSESISKL----KGIDVVVFVGGISPQLEGEEMPVNIPGFK 626
Query: 507 --DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGE 564
DR D+ LP Q+ + + +A K+ ++ + C G + + +IL A Y GE
Sbjct: 627 GGDRTDIELPAVQRNFLKALKDAGKQ--VVFVNCSGS-SMALLPETESCDAILQAWYGGE 683
Query: 565 AGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEV 624
G A+A+V+FGD+NP G+LP+T+Y D M+ GRTYR Y +
Sbjct: 684 LGGYAVADVLFGDYNPSGKLPVTFYKSTKQLPDYEDYSMK--------GRTYR-YMSDPL 734
Query: 625 FPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFL 684
FPFG GLSY+ ++ + ++ +L+ ++S
Sbjct: 735 FPFGFGLSYTDFAVGTASCNKTQLHTDES-----------------------------LT 765
Query: 685 VTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCES 744
+T+ V N G+ +G V ++++ +G P+K L + V L A K ++ EL P ES
Sbjct: 766 LTVPVSNTGKRSGTEVVQVYIRKTDDADG-PLKSLKAYARVELAAGAKQDVKIEL-PSES 823
Query: 745 L 745
Sbjct: 824 F 824
>gi|298387490|ref|ZP_06997042.1| beta-glucosidase [Bacteroides sp. 1_1_14]
gi|298259697|gb|EFI02569.1| beta-glucosidase [Bacteroides sp. 1_1_14]
Length = 853
Score = 270 bits (690), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 162/424 (38%), Positives = 236/424 (55%), Gaps = 34/424 (8%)
Query: 51 PISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTI 110
P+ +R DL+SRLT++EKIS L ++P IPRLGI Y +EALHGV G+
Sbjct: 36 PVHERVMDLISRLTVEEKISLLRATSPGIPRLGIDKYYHGNEALHGVVRPGR-------- 87
Query: 111 RGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQ---------AIGMTFWAPNI 161
T FPQ I AA+++ L R+ I EARA +N + +TFW+P +
Sbjct: 88 --FTVFPQAIGLAATWNPELQKRVATVISDEARARWNELDQGREQKEQFSDVLTFWSPTV 145
Query: 162 NIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYD 221
N+ RDPRWGR ET GEDP ++G ++V G+QGD + L+ + KHF A
Sbjct: 146 NMARDPRWGRTPETYGEDPFLSGIMGTAFVNGLQGDDPH------YLKIVSTPKHFAA-- 197
Query: 222 LDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLL 281
N + R+ + +++ + L + Y P FE CVK+G+A+ IM AYN +N +P + LL
Sbjct: 198 --NNEEHNRFVCNPQISEKQLREYYFPAFEMCVKEGKAASIMSAYNALNDVPCTLNSWLL 255
Query: 282 SKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG-SFLQKHTK 340
K R+ WGF GY+ SDC +++ +A Y K+ E A +KAG+D+ CG
Sbjct: 256 QKVLRQDWGFQGYVVSDCGGPALLVNAHKYVKTKEAAATLSIKAGLDLECGDDVYDGPLL 315
Query: 341 AAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAA 400
A KQ + +++ID A ++ + RM+LGLF+ P+ KI V+ S HQ +AL AA
Sbjct: 316 NAYKQYMVSDADIDSAACHVLTARMKLGLFDSG-ERNPYTKISPSVIGSKEHQQIALDAA 374
Query: 401 QDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAG-PSCRSITPLQALQNYV 459
+ IVLLKN LPL K S+A++G NA K G+Y+G P ++ LQ ++N V
Sbjct: 375 RQCIVLLKNQKNRLPLNADKLKSIAVVGINA--GKCEFGDYSGAPVVEPVSILQGIRNRV 432
Query: 460 ENTV 463
+ V
Sbjct: 433 GDRV 436
Score = 121 bits (303), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 81/278 (29%), Positives = 137/278 (49%), Gaps = 43/278 (15%)
Query: 489 DHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAK 548
+ VV +MG++++ E+E DR D+ LP Q+E + + + +I+VL+ G + + +
Sbjct: 605 ETVVAVMGINKSIEREGQDRYDIQLPADQREFLQEIYKVNPN-IIVVLVAGSSLAVNWM- 662
Query: 549 YDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQAT 608
D ++ +I+ A YPGE G A+AEV+FGD+NP GRLP+T+Y P D +
Sbjct: 663 -DEHVPAIVNAWYPGEQGGTAVAEVLFGDYNPAGRLPLTYYKSLDELPPFDDYDI----- 716
Query: 609 SGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYK 668
GRTY++++G ++PFG GLSYS ++Y S ++ + D V
Sbjct: 717 --TKGRTYKYFKGDVLYPFGYGLSYSSFTY---------------SDLQVKDGGDEV--- 756
Query: 669 SVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILN 728
V+ +KN G+ G ++V+ G P+K+L GF+ V L
Sbjct: 757 ---------------TVSFRLKNTGKRNGDEVAQVYVRIPETGGIVPLKELKGFRRVPLK 801
Query: 729 AKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
+ E + +L + E G V+ +G ++VG
Sbjct: 802 SGESRRVEIKLDKEQLRYWDVEKGQFVVPKGAFDVMVG 839
>gi|427384502|ref|ZP_18881007.1| hypothetical protein HMPREF9447_02040 [Bacteroides oleiciplenus YIT
12058]
gi|425727763|gb|EKU90622.1| hypothetical protein HMPREF9447_02040 [Bacteroides oleiciplenus YIT
12058]
Length = 862
Score = 270 bits (689), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 163/437 (37%), Positives = 233/437 (53%), Gaps = 34/437 (7%)
Query: 45 FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGI 104
+ T+L +RA LV LTL+EK +++++ ++ RLGI Y WW+EALHGVA G
Sbjct: 22 YKNTSLSPEERAELLVKELTLEEKARLMMDASQSVERLGIKPYNWWNEALHGVARAGL-- 79
Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARA---LYNAGQA----IGMTFW 157
AT FPQ I AASFD + Y + A+ E RA Y + + G+T W
Sbjct: 80 --------ATVFPQPIGMAASFDPEMVYEVFNAVSDEVRAKNTYYTSRDSRERYQGLTMW 131
Query: 158 APNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHF 217
P +NI+RDPRWGRG ET GEDP +T + V V+G+QG GK + AC KHF
Sbjct: 132 TPTVNIYRDPRWGRGIETYGEDPYLTSRMGVMVVKGLQGPA--DGKYD---KLHACAKHF 186
Query: 218 TAYDLDNWKGTTRYKFDAR-VTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCA 276
+ W R+ F+A + +DL +TY PPFE+ VK+G +MCAYNR G P C
Sbjct: 187 AVHSGPEW---NRHSFNAENINPRDLYETYLPPFEALVKEGEVKEVMCAYNRFEGEPCCG 243
Query: 277 DRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPE--DAVVDVLKAGMDVNCGSF 334
LL + R +WGF G + +DC A++ Y+ G+ P A + +G D++CGS
Sbjct: 244 SDRLLMQILRGEWGFDGIVVADCGAIADFYNDRGHHTHPNAESASAAAVVSGTDLDCGSS 303
Query: 335 LQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQV 394
K +VK+ + E +D ++ L R LG + P + KI VV S AH
Sbjct: 304 Y-KALVESVKKGLIAEETVDVSVKRLLKARFELGEMD-EPEKVSWTKIPFSVVASAAHDS 361
Query: 395 LALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQA 454
LAL+ A+ + LL N LPL K +++A++GPNAN + GNY G ++T L
Sbjct: 362 LALEIARKSMTLLMNKDNFLPL-KRGGLTVAVMGPNANDSVMQWGNYNGMPPHTVTILDG 420
Query: 455 LQNYV---ENTVYYPGC 468
++N + + +Y GC
Sbjct: 421 VRNILGAEDKLIYEQGC 437
Score = 118 bits (296), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 91/299 (30%), Positives = 137/299 (45%), Gaps = 51/299 (17%)
Query: 478 IDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEA 527
I K+V+ K AD V+ G+ + E EE+ DR D+ LP Q+ELI + A
Sbjct: 589 IRKSVERVKDADVVIFASGISPSLEGEEMGVNLPGFKKGDRTDIELPAVQRELIDALHRA 648
Query: 528 AKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMT 587
KK +ILV G P I + ++L A YPG+ G A+AEV+FG +NP GRLP+T
Sbjct: 649 GKK-IILVNCSGSP--IGLEPETKKCEALLQAWYPGQQGGTAVAEVLFGTYNPAGRLPVT 705
Query: 588 WYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNK 647
+Y Y D M GRTYR+ + +FPFG GLSY+ ++Y VS NK
Sbjct: 706 FYRNIYQLPDFEDYNM--------TGRTYRYMQDIPLFPFGYGLSYTTFNY--GKVSLNK 755
Query: 648 LYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKP 707
+ + K+ T+ V N GE G+ V +++K
Sbjct: 756 GEITDGQTVKL---------------------------TVSVTNTGERGGEEVVQVYLKK 788
Query: 708 ARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
G P+K L F+ + + + + + FEL E + + + G + ++VG
Sbjct: 789 QGDTKG-PVKTLRAFKRISIPSGQTVNVEFELKDKELEWWDEQSNTVRVCPGNYDIMVG 846
>gi|254445290|ref|ZP_05058766.1| Glycosyl hydrolase family 3 C terminal domain protein
[Verrucomicrobiae bacterium DG1235]
gi|198259598|gb|EDY83906.1| Glycosyl hydrolase family 3 C terminal domain protein
[Verrucomicrobiae bacterium DG1235]
Length = 730
Score = 270 bits (689), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 216/714 (30%), Positives = 337/714 (47%), Gaps = 92/714 (12%)
Query: 43 FPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGK 102
+PF LP +R DL++ +TL+EK+ L+ P IPRL + Y SE HGVA G
Sbjct: 27 YPFQDPDLPNEERIDDLITCMTLEEKV-DLMGFVPGIPRLDV-KYTRISEGYHGVAQGGP 84
Query: 103 GIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAG--QAIGMTFWAPN 160
+ T FPQ AA++D L R+ E R LY + Q G+ APN
Sbjct: 85 SNWGKRNPTPTTQFPQAYGLAATWDPALISRVSANQATEVRYLYQSPKYQRSGLVVMAPN 144
Query: 161 INIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAY 220
++ RDPRWGR +E GEDP +TG A ++ G+ GD L+A++ KHF A
Sbjct: 145 ADLARDPRWGRTEEVYGEDPFLTGTLAAAFASGLAGDH------PRYLKATSLLKHFLAN 198
Query: 221 DLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNL 280
++ + + FD R+ + Y PFE ++ G A +M AYN +NG P+ +
Sbjct: 199 SNEDDRFFSSSDFDERL----WREYYAKPFEMAIRDGGARSMMAAYNAINGTPAHV-HPM 253
Query: 281 LSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTK 340
L +WG G I +D ++ + + A +KAG+++ FL HT+
Sbjct: 254 LRDIVMGEWGLDGTICTDGGGLAHLVNQHKTYPDLPTATAACIKAGINL----FLDNHTQ 309
Query: 341 A---AVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAH----Q 393
A AV+Q + E+EID + + + LGL + P + P+ IG + P
Sbjct: 310 AALDAVEQSLVTEAEIDDVIRGRIRLFLDLGLLD-PPELVPYSNIGHEPGLEPWELPETH 368
Query: 394 VLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQ 453
+ + IVLLKN + +LPL SK S+A++GP AN+ TLL Y+G +I P
Sbjct: 369 AFVREVTRKSIVLLKNENNILPLDPSKINSVAIVGPLANT--TLLDWYSGTPPYAIPPRD 426
Query: 454 ALQNYVENTVYYP-----GCDTVACSSASIDKAVDIAKGADHVVLMMG------------ 496
++ Y N+ +P G + VA S D A+++A D ++++G
Sbjct: 427 GIEGYA-NSGPFPSPAKFGSNWVADMS---DTALEVAASRDVAIVVVGNHPESNAGWGVV 482
Query: 497 LDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSI 556
++ KE +DR +++L Q+E I +V AA I+VL+ P + +A N +I
Sbjct: 483 TSPSEGKEAVDRQEIILQPDQEEFIQKVY-AANPNTIVVLVSNFPYAMPWAA--ENAPAI 539
Query: 557 LWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKV-PMTDMKMRPQATSGNPGRT 615
+ + + ALA+V+FGD+NPGG+ TW P+ ++ PM D +R GRT
Sbjct: 540 VHITHASQEQGNALADVLFGDYNPGGKTVQTW-PKSLDQLPPMMDYDIR-------RGRT 591
Query: 616 YRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGT 675
Y + + + +PFG GLSY+ +++ + K +++T
Sbjct: 592 YMYSQHEPQYPFGYGLSYT--TFELSKLKAPKKLKADATAT------------------- 630
Query: 676 EFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNA 729
+ + V N GE G V L+V+ RP KQL GFQ V + A
Sbjct: 631 ---------IKVRVANTGERDGDEVVQLYVRYPNSKVERPSKQLKGFQRVTVPA 675
>gi|435848436|ref|YP_007310686.1| beta-glucosidase-like glycosyl hydrolase [Natronococcus occultus
SP4]
gi|433674704|gb|AGB38896.1| beta-glucosidase-like glycosyl hydrolase [Natronococcus occultus
SP4]
Length = 771
Score = 270 bits (689), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 218/715 (30%), Positives = 349/715 (48%), Gaps = 103/715 (14%)
Query: 81 RLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGL 140
RLGIPA E L G G AT+FPQ+I A+++D L + + I
Sbjct: 95 RLGIPAIPH-EECLSGYMG-----------PEATTFPQMIGMASTWDPELLEEVTETIRG 142
Query: 141 EARALYNAGQAIGMTF-WAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTF 199
E AL G T +P +++ RD RWGR +ET GEDPL+ A YV G+QGD
Sbjct: 143 ELEAL-------GTTHALSPVLDVARDLRWGRVEETFGEDPLLVAAMACGYVSGLQGD-- 193
Query: 200 NGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRA 259
G+ G SA KHF + + G R + V ++L + + P+E+ ++ A
Sbjct: 194 --GRADG---VSATLKHFVGHGATDG-GKNRSSLN--VGPRELREVHLFPYEAAIRTADA 245
Query: 260 SGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAV 319
+M AY+ ++GIP + LL+ R ++GF G + SD +V + G A + +A
Sbjct: 246 ESVMNAYHDIDGIPCASSEWLLTDLLRGEFGFDGTVVSDYYSVRHLVTEHGTANTKPEAA 305
Query: 320 VDVLKAGMDVNC--GSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQ 377
L+AG+DV + +H AV+ +L E +D ++ + + R GL + +P++
Sbjct: 306 TAALEAGLDVELPYTDYYGEHLITAVENGELSEKTLDESVRRVLREKARKGLLD-DPSVD 364
Query: 378 PFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTL 437
+ AD + L +AA+ + LLKN + LLPL S+A+IGP A++ K L
Sbjct: 365 --AEAAADAFRTDEAAALNRRAARRSMTLLKNENELLPLTAD---SVAVIGPKADAKKEL 419
Query: 438 LGNYAGPS--------CRSITPLQALQNYVENTVYYP-GCDTVACSSASIDKAVDIAKGA 488
LG+YA + + TPL AL++ V Y GC S+ + A +A+ A
Sbjct: 420 LGDYAYAAHYPEEEYASDATTPLAALESRDGLEVSYEQGCTVSGPSTDGFEPAAQVAEDA 479
Query: 489 DHVVLMMGL-------DQTQEKEE----------LDRVDLVLPGRQQELITRVAEAAKKP 531
D + +G D KEE D DL LPG Q+ELI R+ E P
Sbjct: 480 DVALAFVGARSAVDFSDGDASKEEKPSVPTSGEGCDVTDLGLPGVQEELIDRLQETGT-P 538
Query: 532 VILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQ 591
+ +V++ G P I + ++ ++L+A PG+ G A+ +V+FG+HNP GRLP++ P+
Sbjct: 539 LAVVIVSGRPHSIE--RITADVPAVLYAWLPGDEGGSAIVDVLFGEHNPSGRLPVSL-PK 595
Query: 592 DYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLN 651
++P+ + +A + N ++Y + +G+ V+PFG GLSY+++ Y ++S+
Sbjct: 596 SVGQLPVYYNR---KANTAN--KSYVYTDGEPVYPFGHGLSYTEFEYGTLSLSE------ 644
Query: 652 QSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRG 711
K V L T + ++ V N G+ +G V L+ A
Sbjct: 645 ----------------KRVSPLET-------VVASVPVTNEGDRSGAEVVQLYAHAANPS 681
Query: 712 NGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
RP+++L+GF+ V L A E + FELSP + L+ E + +EEG + + VG
Sbjct: 682 QARPVQELIGFERVPLEAGETKRVSFELSPTQ-LAFHDESMTLTVEEGPYEIRVG 735
>gi|189464310|ref|ZP_03013095.1| hypothetical protein BACINT_00651 [Bacteroides intestinalis DSM
17393]
gi|189438100|gb|EDV07085.1| glycosyl hydrolase family 3 N-terminal domain protein [Bacteroides
intestinalis DSM 17393]
Length = 864
Score = 269 bits (688), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 160/424 (37%), Positives = 240/424 (56%), Gaps = 34/424 (8%)
Query: 51 PISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTI 110
P+ +R DL+SRLT++EKIS L ++P IPRL IP Y +EALHGV G+
Sbjct: 34 PMHERIMDLLSRLTVEEKISLLRATSPGIPRLDIPKYYHGNEALHGVVRPGR-------- 85
Query: 111 RGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYN---------AGQAIGMTFWAPNI 161
T FPQ I AA+++ L ++ I EARA +N + + +TFW+P +
Sbjct: 86 --FTVFPQAIGLAATWNPELQLQVATVISDEARARWNELDQGREQKSQFSDLLTFWSPTV 143
Query: 162 NIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYD 221
N+ RDPRWGR ET GEDP ++G ++V+G+QGD L+ + KHF A
Sbjct: 144 NMARDPRWGRTPETYGEDPYLSGVMGTAFVKGLQGDD------DRYLKIVSTPKHFAA-- 195
Query: 222 LDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLL 281
N + R+ + +++ + L + Y P FE+CVK G+++ IM AYN +N +P + LL
Sbjct: 196 --NNEEHNRFVCNPQISEKQLREYYLPAFEACVKDGKSASIMSAYNALNDVPCTLNAWLL 253
Query: 282 SKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG-SFLQKHTK 340
+K R WGF GY+ SDC S++ +A Y K+ E A +KAG+D+ CG +
Sbjct: 254 TKVLREDWGFKGYVVSDCGGPSLLVNAHKYVKTKEAAATLSIKAGLDLECGDDVFDEPLL 313
Query: 341 AAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAA 400
+A +Q + ++ID A + + RM+LGLF+ P+ KI VV S HQ +AL AA
Sbjct: 314 SAYRQYMVTNADIDSAAYRVLRARMQLGLFDSGEK-NPYTKISPAVVGSAKHQEVALNAA 372
Query: 401 QDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAG-PSCRSITPLQALQNYV 459
++ IVLLKN +LPL K S+A++G NA + + G+Y+G P I+ LQ +++ V
Sbjct: 373 RECIVLLKNQKKMLPLNAKKVKSIAVVGINAGNCE--FGDYSGSPVIAPISVLQGIKDRV 430
Query: 460 ENTV 463
+ V
Sbjct: 431 GDGV 434
Score = 123 bits (309), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 81/280 (28%), Positives = 140/280 (50%), Gaps = 47/280 (16%)
Query: 489 DHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAK 548
+ VV ++G++++ E+E DR D+ LP Q E + + + +++VL+ G + + +
Sbjct: 603 ETVVAVLGINKSIEREGQDRYDIQLPADQMEFLQEIYKVNPN-IVVVLVAGSSLAVNWM- 660
Query: 549 YDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQAT 608
D ++ +I+ A YPGE+G A+AEV+FGD+NPGGRLP+T+Y P D +
Sbjct: 661 -DEHVPAIVNAWYPGESGGKAVAEVLFGDYNPGGRLPLTYYRSLDELPPFDDYDI----- 714
Query: 609 SGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYK 668
GRTY++++G ++PFG GLSY+ + Y V+ + +N
Sbjct: 715 --TKGRTYKYFKGDVLYPFGYGLSYTTFKYSNLQVADGEEEIN----------------- 755
Query: 669 SVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILN 728
V+ +KN G+ AG ++VK R P+K+L GF+ V L
Sbjct: 756 ----------------VSFQLKNAGKYAGDEVAQVYVKLPERDEVMPVKELKGFERVALK 799
Query: 729 AKEKAEIVFELSPCESLSRARED--GLMVIEEGTHFLVVG 766
+ E ++ +L + L R ++ G V G + ++VG
Sbjct: 800 SGENKKMTLKLR--KDLLRYWDEAKGKFVYPSGDYTIMVG 837
>gi|153809292|ref|ZP_01961960.1| hypothetical protein BACCAC_03604 [Bacteroides caccae ATCC 43185]
gi|149128062|gb|EDM19283.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
caccae ATCC 43185]
Length = 946
Score = 269 bits (687), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 234/798 (29%), Positives = 363/798 (45%), Gaps = 135/798 (16%)
Query: 49 TLPISQRARDLVSRLTLDEKISQLVN--SAPAIPRLGIPAYEW----W-------SEALH 95
T PI R DL+S++TL+EK Q+V + + +P EW W E L+
Sbjct: 57 TAPIDARIEDLLSQMTLEEKTCQMVTLYGYKRVLKDDLPTSEWKNQLWKDGIGAIDEHLN 116
Query: 96 GVAGVGK-------------------------------GI---FFNGTIRG-----ATSF 116
G G GI F N IRG AT+F
Sbjct: 117 GFQQWGLPPSDNEYVWPASKHAWALNEVQRFFIEETRLGIPTDFTNEGIRGVESYKATNF 176
Query: 117 PQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMT-FWAPNINIFRDPRWGRGQET 175
P + +++ L +++G G EAR L G T +AP +++ RD RWGR +E
Sbjct: 177 PTQLGLGHTWNRQLIHQVGLITGREARML-------GYTNVYAPILDVGRDQRWGRYEEV 229
Query: 176 PGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDA 235
GE P + + + VRG+Q + Q +A KHF AY + + D
Sbjct: 230 YGESPYLVAELGIEMVRGMQHNH----------QVAATGKHFIAYSNNKGAREGMARVDP 279
Query: 236 RVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYI 295
+++ +++ + PF+ +++ G+M +YN +G P + L+ R + GF GY+
Sbjct: 280 QMSPREVEMLHAYPFKRVIREAGLLGVMSSYNDYDGFPIQSSYYWLTTRLRGEMGFRGYV 339
Query: 296 TSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHT-----KAAVKQKKLPE 350
SD DAV +Y G AK ++AV ++AG++V C +F + + VK+ L E
Sbjct: 340 VSDSDAVEYLYTKHGTAKDMKEAVRQSVEAGLNVRC-TFRSPDSYVLPLRELVKEGGLSE 398
Query: 351 SEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGAD-VVCSPAHQVLALQAAQDGIVLLKN 409
I+ + ++ V+ +GLF+ P GAD V ++ +ALQA+++ IVLLKN
Sbjct: 399 EVINDRVRDILRVKFLVGLFD-TPYQTDLK--GADEEVEKKENEEVALQASRESIVLLKN 455
Query: 410 SHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVE---NTVYYP 466
+LPL SK +A+ GPNA+ L +Y + + L+ +Q ++ + +Y
Sbjct: 456 EKNVLPLDPSKIRKIAVCGPNADEHSYALTHYGPLAVEVTSVLKGIQEKMKDKADVLYTK 515
Query: 467 GCDTVACS---------------SASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDL 511
GCD V + IDKAV AK AD ++++G Q E R L
Sbjct: 516 GCDLVDANWPESELIDYPLTDEEQKEIDKAVSQAKQADVAIVVLGGGQRTCGENKSRSSL 575
Query: 512 VLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALA 571
LPGRQ +L+ V A KPV+LVL+ G P+ I +A D+ + +IL A YPG G +A+A
Sbjct: 576 DLPGRQLDLLKAVV-ATGKPVVLVLINGRPLSINWA--DKFVPAILEAWYPGSKGGIAVA 632
Query: 572 EVIFGDHNPGGRLPMTWYPQDYIKVPMT-DMKMRPQATSG-NPGRTYRFYEGK-EVFPFG 628
+++FGD+NPGG+L +T +P+ ++P K Q G NPG ++PFG
Sbjct: 633 DILFGDYNPGGKLTVT-FPKTVGQIPFNFPCKPSSQIDGGKNPGPDGNMSRANGALYPFG 691
Query: 629 CGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIG 688
GLSY+ + Y +S + NQ K VT
Sbjct: 692 YGLSYTTFEYSDLKISPAIITPNQ-----------------------------KAYVTCK 722
Query: 689 VKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRA 748
V N G+ +G + L+V+ K L GF+ V L E EI F + ++L
Sbjct: 723 VTNTGKRSGDEVIQLYVRDVLSSVTTYEKNLAGFERVHLKPGETKEITFPID-RKALELL 781
Query: 749 REDGLMVIEEGTHFLVVG 766
D V+E G L++G
Sbjct: 782 NADMHWVVEPGDFTLMLG 799
>gi|329923020|ref|ZP_08278536.1| glycosyl hydrolase family 3 N-terminal domain protein
[Paenibacillus sp. HGF5]
gi|328941793|gb|EGG38078.1| glycosyl hydrolase family 3 N-terminal domain protein
[Paenibacillus sp. HGF5]
Length = 763
Score = 269 bits (687), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 221/730 (30%), Positives = 343/730 (46%), Gaps = 101/730 (13%)
Query: 67 EKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASF 126
E ++ + A RLGIP + E HG +G AT FP + +++
Sbjct: 90 EAVNAIQRYAMEHSRLGIPIL-FGEECSHGHMAIG-----------ATVFPVPLTIGSTW 137
Query: 127 DSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKY 186
++ L+ I +A+ E RA A ++P +++ RDPRWGR +ET GEDP + ++
Sbjct: 138 NTELFRSISRAVAAETRAQGGAAT------YSPVLDVVRDPRWGRTEETFGEDPHLVAEF 191
Query: 187 AVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTY 246
AV+ V+G+QG+ +L A KHF Y R + +++L +
Sbjct: 192 AVAAVQGLQGE-----RLDSHTSLLATLKHFAGYGASE---GGRNGAPVHMGLRELHEVD 243
Query: 247 QPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIY 306
PF V+ G S IM AYN ++G+P + R LL R WGF G++ +DC A+ ++
Sbjct: 244 LLPFRKAVESGALS-IMTAYNEIDGVPCTSSRYLLQNVLREAWGFDGFVITDCGAIHMLA 302
Query: 307 DAEGYAKSPEDAVVDVLKAGMDVN-CGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRM 365
A S +A LKAG+D+ G+ + H + A++Q + E +++RA + ++
Sbjct: 303 CGHNTAGSGVEAATQSLKAGVDMEMSGTMFRAHLQQALEQGLITEDDLNRAAGRVLELKF 362
Query: 366 RLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLA 425
RLGLF+ P + P V+ H LA QAA +GIVLLKN LLPL S S ++A
Sbjct: 363 RLGLFD-RPYVDP--AWAEQVIGCKEHIALAYQAAAEGIVLLKNEGNLLPL-DSSSGTIA 418
Query: 426 LIGPNANSAKTLLGNYAGPS--CRSITPLQALQNYVENT--VYYPGCDTVACSSASIDKA 481
+IGPNA++ LG+Y P + +T L ++ + ++ +Y PGC S +A
Sbjct: 419 VIGPNAHTPYHQLGDYTSPQPPGQIVTVLDGIRRRLGDSRVLYAPGCRIQGDSREGFPRA 478
Query: 482 VDIAKGADHVVLMMGLDQTQE-------------------------KEELDRVDLVLPGR 516
+ A+ AD +V+++G ++ E +DR L L G
Sbjct: 479 LACAEQADVIVMVLGGSSARDFGEGTIDLRTGASVVTGDAKSDMECGEGIDRSTLTLMGV 538
Query: 517 QQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFG 576
Q EL+ + + KPVI+V + G P IT D I +I+ A YPG+ G A+A+++FG
Sbjct: 539 QLELLQELQKLG-KPVIVVYINGRP--ITEPWIDEFIPAIIEAWYPGQEGGGAIADMLFG 595
Query: 577 DHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKY 636
D NP GRLP++ P++ ++P++ R + G+ Y + +PFG GLSY+++
Sbjct: 596 DINPSGRLPLS-IPKEVGQLPISYNARRTR------GKRYLETDLAPRYPFGFGLSYTEF 648
Query: 637 SYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMA 696
Y V +V +G E V I V N G
Sbjct: 649 RYGRLTVE-----------------------PAVVPIGGEAT------VRIDVTNAGARD 679
Query: 697 GKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVI 756
G V L+V RP K L GF+ V L A E E+ F + E L D V+
Sbjct: 680 GAEVVQLYVSDLAASVTRPEKALKGFRKVFLKAGETQEVTFTIG-SEQLELIGLDLKPVV 738
Query: 757 EEGTHFLVVG 766
E G + VG
Sbjct: 739 EPGEFRIQVG 748
>gi|189464698|ref|ZP_03013483.1| hypothetical protein BACINT_01042 [Bacteroides intestinalis DSM
17393]
gi|189436972|gb|EDV05957.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
intestinalis DSM 17393]
Length = 862
Score = 269 bits (687), Expect = 5e-69, Method: Compositional matrix adjust.
Identities = 163/446 (36%), Positives = 243/446 (54%), Gaps = 38/446 (8%)
Query: 41 ETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGV 100
+T P+ L ++RA+DLV RLTL+EK + + + AIPRLGI + WWSEALHGVA
Sbjct: 19 QTLPYQNPELSPAERAKDLVKRLTLEEKALLMCDDSEAIPRLGIKKFNWWSEALHGVANQ 78
Query: 101 GKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIG------- 153
G T FP+ + AASF+ L + I A+ E RA +N G
Sbjct: 79 G----------NVTVFPEPVGMAASFNDKLVFEIFNAVSDEMRAKHNERVRNGLEDVRFH 128
Query: 154 -MTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASA 212
++ W PN+NIFRDPRWGRGQET GEDP +T + ++ V+G+QG K + L A
Sbjct: 129 SLSVWTPNVNIFRDPRWGRGQETYGEDPYLTSQMGIAVVKGLQGP--ENEKYRKLL---A 183
Query: 213 CCKHFTAYDLDNWKGTTRYKFD-ARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNG 271
C KH+ + W +R+ + V+ +DL +TY P F++ V++ +MCAY R++
Sbjct: 184 CAKHYAVHSGPEW---SRHTANLNNVSPRDLWETYLPAFKALVQKADVREVMCAYQRLDD 240
Query: 272 IPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLK---AGMD 328
P C + LL + R +WGF + SDC A++ + + KS DAV +K AG D
Sbjct: 241 DPCCGNTRLLQQILRDEWGFKYLVVSDCGAIADFWTSH---KSSSDAVHAAVKGTMAGTD 297
Query: 329 VNCG-SFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVV 387
V CG + + AV + + E E+++ + L R LG + +P++ + KI VV
Sbjct: 298 VECGYGYAYQKLPEAVSRGLITEEEVNKHVLRLMEGRFELGEMD-DPSLVNWTKIPMSVV 356
Query: 388 CSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCR 447
AH+ L+L ++ + LL+N + +LPL KS +A+IGPNA+ L GNY G +
Sbjct: 357 NCKAHKDLSLNMSRQTMTLLQNKNNVLPLSKSIR-KIAVIGPNADDKPMLWGNYNGTPNQ 415
Query: 448 SITPLQALQNYVENT--VYYPGCDTV 471
+IT L ++ ++ VY GCD V
Sbjct: 416 TITILDGFKSKLKKNQIVYMKGCDLV 441
Score = 94.7 bits (234), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 81/301 (26%), Positives = 133/301 (44%), Gaps = 57/301 (18%)
Query: 455 LQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEEL-------- 506
++N+ N + G + S SI K KG D VV + G+ E EE+
Sbjct: 571 VENWAANLKFNLGKEFPINYSESISKL----KGIDVVVFVGGISPQLEGEEMPVNIPGFK 626
Query: 507 --DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGE 564
DR D+ LP Q+ + + +A K+ ++ + C G + + +IL A Y GE
Sbjct: 627 GGDRTDIELPAVQRNFLKALKDAGKQ--VVFVNCSGS-SMALLPETESCDAILQAWYGGE 683
Query: 565 AGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEV 624
G A+A+V+FGD+NP G+LP+T+Y D M+ GRTYR Y +
Sbjct: 684 LGGYAVADVLFGDYNPSGKLPVTFYKNTKQLPDYEDYSMK--------GRTYR-YMSDPL 734
Query: 625 FPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFL 684
FPFG GLSY+ ++ + ++ +L ++S
Sbjct: 735 FPFGFGLSYTDFAIGTASCNKTQLRTDES-----------------------------LT 765
Query: 685 VTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCES 744
+T+ V N G+ +G V ++++ +G P+K L + V L + ++ EL P ES
Sbjct: 766 LTVPVSNTGKRSGTEIVQVYIRKTDDADG-PLKSLKAYARVELATGAQQDVKIEL-PSES 823
Query: 745 L 745
Sbjct: 824 F 824
>gi|313204103|ref|YP_004042760.1| glycoside hydrolase [Paludibacter propionicigenes WB4]
gi|312443419|gb|ADQ79775.1| glycoside hydrolase family 3 domain protein [Paludibacter
propionicigenes WB4]
Length = 1278
Score = 269 bits (687), Expect = 5e-69, Method: Compositional matrix adjust.
Identities = 156/425 (36%), Positives = 235/425 (55%), Gaps = 31/425 (7%)
Query: 45 FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGI 104
+ T +RA DLVSR+TL+EK SQL N+ P IPRLG+ Y+ W EALHGV G
Sbjct: 39 YLNTAYSFKERAADLVSRMTLEEKQSQLGNTMPPIPRLGVNKYDVWGEALHGVVGRNNN- 97
Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIF 164
+G I ATSFP + +++D L R + EAR +N +T+W+P I
Sbjct: 98 --SGMI--ATSFPNSVAVGSTWDPALIKRETSVVADEARG-FNHDLIFTLTYWSPVIEPA 152
Query: 165 RDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGD--TFNGGKLKGKLQASACCKHFTAYDL 222
RDPRWGR ET GEDP + + +V+G+ GD T+ L+ C KH+ A
Sbjct: 153 RDPRWGRTAETFGEDPFLVSQIGSGFVQGLMGDDPTY--------LKTVPCGKHYFA--- 201
Query: 223 DNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLS 282
N R+ A + +D+ + Y P+ + +++ + IM AY+ VNG+P A + L+
Sbjct: 202 -NNSEFNRHNGSANMDDRDMREFYLTPYRTLIQKDKLPSIMTAYSAVNGVPMSASKFLVD 260
Query: 283 KTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAA 342
A+R +G GY+T DCDAV+ + ++ YAKS +A LK G+D +CG Q A
Sbjct: 261 TIAKRTYGLDGYVTGDCDAVADVVNSHRYAKSKAEAAAMGLKTGVDSDCGGIYQTSALEA 320
Query: 343 VKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQD 402
+KQ + E+++D+AL N++++RMRLG F+ + P+ I ++ P+H LAL+ A
Sbjct: 321 LKQGLISEADMDKALVNIYTIRMRLGEFDPQ-NIVPYAGIKPSIINDPSHNDLALEIATK 379
Query: 403 GIVLLKNS------HGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPS--CRSITPLQA 454
VLLKN+ LPL +A++GP A+ K LG+Y+G + ITPL+
Sbjct: 380 SPVLLKNNLVGKSGKKALPLNAGTIKKIAVLGPQAD--KVELGDYSGEADPKYKITPLEG 437
Query: 455 LQNYV 459
++NY+
Sbjct: 438 IKNYI 442
Score = 133 bits (335), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 91/275 (33%), Positives = 136/275 (49%), Gaps = 41/275 (14%)
Query: 476 ASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILV 535
A+ + +D+A AD V+ +G DQT +EE DR + LPG Q ELI +A A P +V
Sbjct: 603 ATDKETLDMAASADVAVVFVGTDQTTGREESDRFAITLPGNQNELIKSIA--AVNPNTIV 660
Query: 536 LLCG-GPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYI 594
++ G G V++ K + N+ I++ GY G+A A+A+V+FGD NPGG+ +TWY
Sbjct: 661 VIQGMGMVEVEQFKNNPNVAGIIFTGYNGQAQGTAMAKVLFGDVNPGGKTSLTWYKSIND 720
Query: 595 KVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSS 654
+TD +R A G GRTY ++ + FG GLSY+ ++Y +S+ + N
Sbjct: 721 LPALTDYTLRGGA--GKNGRTYMYFNKDVSYEFGYGLSYTTFAYSNFNISKTSITPND-- 776
Query: 655 STKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVK----PARR 710
K VT+ VKN G + G V ++VK PA
Sbjct: 777 ---------------------------KVTVTVDVKNTGTVDGDEVVQIYVKTPDSPASL 809
Query: 711 GNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESL 745
RPIK+L GF+ V + A + + E+ C L
Sbjct: 810 --ERPIKRLKGFKRVAIPAGQTKTVSIEVD-CADL 841
>gi|423223731|ref|ZP_17210200.1| hypothetical protein HMPREF1062_02386 [Bacteroides cellulosilyticus
CL02T12C19]
gi|392638106|gb|EIY31959.1| hypothetical protein HMPREF1062_02386 [Bacteroides cellulosilyticus
CL02T12C19]
Length = 854
Score = 269 bits (687), Expect = 5e-69, Method: Compositional matrix adjust.
Identities = 159/424 (37%), Positives = 241/424 (56%), Gaps = 34/424 (8%)
Query: 51 PISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTI 110
P+ +R DL+SRLT++EKIS L ++P I RL IP Y +EALHGV G+
Sbjct: 34 PMHERIMDLLSRLTVEEKISLLRATSPGISRLDIPKYYHGNEALHGVVRPGR-------- 85
Query: 111 RGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYN---------AGQAIGMTFWAPNI 161
T FPQ I AA+++ L ++ I EARA +N + + +TFW+P +
Sbjct: 86 --FTVFPQAIGLAATWNPELQLQVATVISDEARARWNELDQGREQKSQFSDLLTFWSPTV 143
Query: 162 NIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYD 221
N+ RDPRWGR ET GEDP ++G ++V+G+QGD L+ + KHF A
Sbjct: 144 NMARDPRWGRTPETYGEDPYLSGIMGTAFVKGLQGDD------DRYLKIVSTPKHFAA-- 195
Query: 222 LDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLL 281
N + R+ + +++ + L + Y P FE+CVK G+++ IM AYN +N +P + LL
Sbjct: 196 --NNEEHNRFVCNPQISEKQLREYYLPAFEACVKDGKSASIMSAYNALNDVPCTLNAWLL 253
Query: 282 SKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG-SFLQKHTK 340
+K R+ WGF GY+ SDC S++ +A Y K+ E A +KAG+D+ CG +
Sbjct: 254 TKVLRKDWGFKGYVVSDCGGPSLLVNAHKYVKTKEAAAALSIKAGLDLECGDDVYDQPLL 313
Query: 341 AAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAA 400
+A +Q + +++ID A + + RM LGLF+ P+ KI V+ S HQ +AL AA
Sbjct: 314 SAYRQYMVTDADIDSAAYRVLRARMELGLFDSG-EQNPYTKISPAVIGSAEHQEVALNAA 372
Query: 401 QDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAG-PSCRSITPLQALQNYV 459
++ IVLLKN +LPL K S+A++G NA S++ G+Y+G P I+ LQ +++ V
Sbjct: 373 RECIVLLKNQKKMLPLNAKKVKSIAVVGINAGSSE--FGDYSGLPVIAPISVLQGIKDRV 430
Query: 460 ENTV 463
+ V
Sbjct: 431 GDDV 434
Score = 129 bits (323), Expect = 8e-27, Method: Compositional matrix adjust.
Identities = 85/286 (29%), Positives = 139/286 (48%), Gaps = 52/286 (18%)
Query: 489 DHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAK 548
+ VV ++G++++ E+E DR D+ LP QQE + + + +++VL+ G + I +
Sbjct: 603 ETVVAVLGINKSIEREGQDRYDIQLPADQQEFLQEIYKVNPN-IVVVLVAGSSLAINWM- 660
Query: 549 YDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQAT 608
D +I +I+ A YPGE+G A+AEV+FGD+NPGGRLP+T+Y P D +
Sbjct: 661 -DEHIPAIVNAWYPGESGGKAVAEVLFGDYNPGGRLPLTYYRSLDELPPFDDYDI----- 714
Query: 609 SGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYK 668
GRTY++++G ++PFG GLSY+ + Y V+ + +N
Sbjct: 715 --TKGRTYKYFKGDVLYPFGYGLSYTTFKYSNLQVADGEEEIN----------------- 755
Query: 669 SVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILN 728
V+ +KN G+ AG ++VK R PIK+L GF+ V L
Sbjct: 756 ----------------VSFQLKNSGKYAGDEVAQVYVKLPERDEVMPIKELKGFERVTLK 799
Query: 729 AKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGDEEYPISI 774
+ E ++ +L R+D L +E V +Y I +
Sbjct: 800 SGENKKVTLKL---------RKDLLRYWDEAKDKFVCPSGDYTIMV 836
>gi|423313129|ref|ZP_17291065.1| hypothetical protein HMPREF1058_01677 [Bacteroides vulgatus
CL09T03C04]
gi|392686343|gb|EIY79649.1| hypothetical protein HMPREF1058_01677 [Bacteroides vulgatus
CL09T03C04]
Length = 864
Score = 269 bits (687), Expect = 6e-69, Method: Compositional matrix adjust.
Identities = 161/443 (36%), Positives = 241/443 (54%), Gaps = 33/443 (7%)
Query: 45 FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGI 104
+ ++L +RA DL+ +LTL+EK++ +++++ + RLGI Y WW+EALHGVA G
Sbjct: 24 YKDSSLSPEERAEDLLQQLTLEEKVALMMDNSKPVERLGIKPYNWWNEALHGVARSGL-- 81
Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIG-------MTFW 157
AT FPQ I AASF+ + I A+ EARA A A G +T W
Sbjct: 82 --------ATVFPQPIGMAASFEPDAIHTIYTAVSDEARAKNAAYSAAGSYERYQGLTMW 133
Query: 158 APNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHF 217
P +NI+RDPRWGRG ET GEDP +T V+ V+G+Q + + K+ A C KHF
Sbjct: 134 TPTVNIYRDPRWGRGIETYGEDPYLTSVMGVNVVKGLQ--CMDANQKYDKIHA--CAKHF 189
Query: 218 TAYDLDNWKGTTRYKFDAR-VTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCA 276
+ W R++F+A + +DL +TY PFE+ VK+ + +MCAYNR+ G P C
Sbjct: 190 AVHSGPEW---NRHEFNAENIKPRDLHETYLVPFEALVKEAKVKEVMCAYNRLEGDPCCG 246
Query: 277 DRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSP--EDAVVDVLKAGMDVNCGSF 334
LL + R+ WG+ G + SDC A+ Y +G+ P E A + +G D+ CGS
Sbjct: 247 SDRLLMQILRQDWGYDGIVLSDCGAIDDFYREKGHKTHPDAESASAAAVLSGTDLECGSS 306
Query: 335 LQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQV 394
+ ++A K+ + E +ID ++ L R LG + +P + KI VVCS H
Sbjct: 307 YKALVESA-KKGLISEKDIDVSVKRLLKARFELGEMD-DPDKVEWTKIPYSVVCSAEHDS 364
Query: 395 LALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQA 454
L+L A+ + LL N + +LPL K ++A++GPNAN + GNY G +IT L+
Sbjct: 365 LSLDIARKSMTLLLNKNNILPL-KRGGQTIAVMGPNANDSVMQWGNYNGTPKHTITLLEG 423
Query: 455 LQNYV---ENTVYYPGCDTVACS 474
+++ + + +Y GC V S
Sbjct: 424 IRSAMGENDKLIYEQGCSWVERS 446
Score = 117 bits (294), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 90/276 (32%), Positives = 133/276 (48%), Gaps = 54/276 (19%)
Query: 477 SIDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAE 526
+I V K AD V+ G+ + E EE+ DR D+ LP Q+ELI + +
Sbjct: 591 NIKNTVAKVKDADVVIFAGGISPSLEGEEMGVNLPGFRKGDRTDIELPAVQRELIKALCD 650
Query: 527 AAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPM 586
A KK VI V G P I + +IL A YPG++G A+AEV+FGD+NP GRLP+
Sbjct: 651 AGKK-VIFVNFSGSP--IAMEPETKYCQAILQAWYPGQSGGKAVAEVLFGDYNPAGRLPV 707
Query: 587 TWYPQDYIKVP-MTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQ 645
T+Y ++ ++P D M GRTYR+++G +FPFG GLSY+ ++Y
Sbjct: 708 TFY-RNITQLPNFEDYNM--------TGRTYRYFKGDPLFPFGYGLSYTTFNY------- 751
Query: 646 NKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFV 705
+ L Q+ + ET K +V V N G G+ V +++
Sbjct: 752 GNIKLEQT---------------------IKVGETAKIIVP--VTNTGNRDGEEVVQVYL 788
Query: 706 KPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSP 741
K G P+K L F+ V + A + + EL+P
Sbjct: 789 KKQEDAEG-PVKTLRAFKRVQIPAGKTVNVELELTP 823
>gi|53712134|ref|YP_098126.1| beta-glucosidase [Bacteroides fragilis YCH46]
gi|52214999|dbj|BAD47592.1| periplasmic beta-glucosidase precursor [Bacteroides fragilis YCH46]
Length = 812
Score = 269 bits (687), Expect = 6e-69, Method: Compositional matrix adjust.
Identities = 222/728 (30%), Positives = 336/728 (46%), Gaps = 114/728 (15%)
Query: 81 RLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGL 140
RLGIP + E HG +G T FP I A++++ L ++G+ I +
Sbjct: 149 RLGIPLF-LAEECPHGHMAIG-----------TTVFPTSIGQASTWNPELIRQMGRVIAI 196
Query: 141 EARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFN 200
EA A G IG + P +++ RDPRW R +ET GEDP + G + VRG QG+T N
Sbjct: 197 EASA---QGAHIG---YGPVLDLARDPRWSRVEETYGEDPYLNGVMGTALVRGFQGETLN 250
Query: 201 GGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRAS 260
GK A KHF +Y W A + ++L + PPF V G A
Sbjct: 251 DGK-----SVIATLKHFASY---GWTEGGHNGGTAHIGERELEEAIFPPFREAVGAG-AL 301
Query: 261 GIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVV 320
+M +YN ++G P R LL+ + +W F G++ SD AV + + G A + +A +
Sbjct: 302 SVMSSYNEIDGNPCTGSRYLLTDILKDRWQFKGFVVSDLYAVGGLRE-HGVAGNDYEAAI 360
Query: 321 DVLKAGMDVNCGS-FLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPF 379
+ AG+D + G+ + AAVK+ + + ID+A+ + S++ ++GLF+ PF
Sbjct: 361 KAVNAGVDSDLGTNVYAEQLVAAVKRGDVAVATIDKAVRRILSLKFQMGLFD-----DPF 415
Query: 380 --GKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTL 437
K +V S H LA + A+ IVLLKN LLPL K +LA+IGPNA++ +
Sbjct: 416 VDEKQAVQLVASSEHTGLAREVARQSIVLLKNKDKLLPLKKDIR-TLAVIGPNADNVYNM 474
Query: 438 LGNYAGPSCRS--ITPLQALQNYVEN---TVYYPGCDTVACSSASIDKAVDIAKGADHVV 492
LG+Y P +T L ++ V +Y GC S A++ A+ AD VV
Sbjct: 475 LGDYTAPQADGTVVTVLDGIRQKVSKETRVLYAKGCAVRDSSRTGFKDAIETARNADAVV 534
Query: 493 LMMG----LDQTQEKEE-------------------LDRVDLVLPGRQQELITRVAEAAK 529
++MG D + E EE DR L L GRQ EL+ ++
Sbjct: 535 MVMGGSSARDFSSEYEETGAAKVTINQISDMESGEGYDRATLHLMGRQLELLEEISRLG- 593
Query: 530 KPVILV----LLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLP 585
KPV+L+ LL G + + +I+ A YPG G A+A+V+FGD+NP GRL
Sbjct: 594 KPVVLIKGRPLLMEGAI--------QEAEAIVDAWYPGMQGGNAVADVLFGDYNPAGRLT 645
Query: 586 MTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQ 645
++ P+ ++P+ R GN R Y G +PFG GLSY+ +SY +
Sbjct: 646 LS-VPRSVGQLPVYYNTRR----KGNRSR-YVEEPGTPRYPFGYGLSYTTFSY-----TD 694
Query: 646 NKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFV 705
K+ + + S V+ VT+ ++N G G L+
Sbjct: 695 MKVQVTEGSDDCWVD------------------------VTVTIQNQGTADGDEVAQLYF 730
Query: 706 KPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVV 765
+ P KQL F + L A E E+ F L +SL+ ++G V+E G ++V
Sbjct: 731 RDDVSSFTTPAKQLRAFSRIHLKAGESREVTFTLDK-KSLALYMQEGEWVVEPGRFTIMV 789
Query: 766 GDEEYPIS 773
G I+
Sbjct: 790 GGSSEDIT 797
>gi|294777452|ref|ZP_06742903.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
vulgatus PC510]
gi|294448520|gb|EFG17069.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
vulgatus PC510]
Length = 864
Score = 268 bits (686), Expect = 6e-69, Method: Compositional matrix adjust.
Identities = 161/443 (36%), Positives = 241/443 (54%), Gaps = 33/443 (7%)
Query: 45 FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGI 104
+ ++L +RA DL+ +LTL+EK++ +++++ + RLGI Y WW+EALHGVA G
Sbjct: 24 YKDSSLSPEERAEDLLQQLTLEEKVALMMDNSKPVERLGIKPYNWWNEALHGVARSGL-- 81
Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIG-------MTFW 157
AT FPQ I AASF+ + I A+ EARA A A G +T W
Sbjct: 82 --------ATVFPQPIGMAASFEPDAIHTIYTAVSDEARAKNAAYSAAGSYERYQGLTMW 133
Query: 158 APNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHF 217
P +NI+RDPRWGRG ET GEDP +T V+ V+G+Q + + K+ A C KHF
Sbjct: 134 TPTVNIYRDPRWGRGIETYGEDPYLTSVMGVNVVKGLQ--CMDANQKYDKIHA--CAKHF 189
Query: 218 TAYDLDNWKGTTRYKFDAR-VTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCA 276
+ W R++F+A + +DL +TY PFE+ VK+ + +MCAYNR+ G P C
Sbjct: 190 AVHSGPEW---NRHEFNAENIKPRDLHETYLVPFEALVKEAKVKEVMCAYNRLEGDPCCG 246
Query: 277 DRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSP--EDAVVDVLKAGMDVNCGSF 334
LL + R+ WG+ G + SDC A+ Y +G+ P E A + +G D+ CGS
Sbjct: 247 SDRLLMQILRQDWGYDGIVLSDCGAIDDFYREKGHKTHPDAESASAAAVLSGTDLECGSS 306
Query: 335 LQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQV 394
+ ++A K+ + E +ID ++ L R LG + +P + KI VVCS H
Sbjct: 307 YKALVESA-KKGLISEKDIDVSVKRLLKARFELGEMD-DPDKVEWTKIPYSVVCSAEHDS 364
Query: 395 LALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQA 454
L+L A+ + LL N + +LPL K ++A++GPNAN + GNY G +IT L+
Sbjct: 365 LSLDIARKSMTLLLNKNNILPL-KRGGQTIAVMGPNANDSVMQWGNYNGTPKHTITLLEG 423
Query: 455 LQNYV---ENTVYYPGCDTVACS 474
+++ + + +Y GC V S
Sbjct: 424 IRSAMGENDKLIYEQGCSWVERS 446
Score = 116 bits (291), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 89/275 (32%), Positives = 127/275 (46%), Gaps = 52/275 (18%)
Query: 477 SIDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAE 526
+I V K AD V+ G+ + E EE+ DR D+ LP Q+ELI + +
Sbjct: 591 NIKNTVAKVKDADIVIFAGGISPSLEGEEMGVNLPGFRKGDRTDIELPAVQRELIKALCD 650
Query: 527 AAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPM 586
A KK VI V G P I + +IL A YPG++G A AEV+FGD+NP GRLP+
Sbjct: 651 AGKK-VIFVNFSGSP--IAMEPETKYCQAILQAWYPGQSGGKAAAEVLFGDYNPAGRLPV 707
Query: 587 TWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQN 646
T+Y D M GRTYR+++G +FPFG GLSY+ ++Y
Sbjct: 708 TFYRNTAQLPDFEDYNM--------TGRTYRYFKGDPLFPFGYGLSYTTFNY-------G 752
Query: 647 KLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVK 706
+ L Q+ + ET K +V V N G G+ V +++K
Sbjct: 753 NIKLEQT---------------------IKVGETAKIIVP--VTNTGNRDGEEVVQVYLK 789
Query: 707 PARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSP 741
G P+K L F+ V + A + + EL+P
Sbjct: 790 KQEDAEG-PVKTLRAFKRVQIPAGKTVNVELELTP 823
>gi|329956868|ref|ZP_08297436.1| glycosyl hydrolase family 3 protein [Bacteroides clarus YIT 12056]
gi|328523625|gb|EGF50717.1| glycosyl hydrolase family 3 protein [Bacteroides clarus YIT 12056]
Length = 864
Score = 268 bits (686), Expect = 6e-69, Method: Compositional matrix adjust.
Identities = 161/432 (37%), Positives = 234/432 (54%), Gaps = 33/432 (7%)
Query: 53 SQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRG 112
++RA DLV +LTL+EK++ +++++ + RLGI Y WW+EALHGVA G
Sbjct: 32 AERAEDLVKQLTLEEKVALMMDNSKPVERLGIKPYNWWNEALHGVARSG----------W 81
Query: 113 ATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQA-------IGMTFWAPNINIFR 165
AT FPQ I AASF + A+ EARA A A G+T W P +NI+R
Sbjct: 82 ATVFPQPIGMAASFSPEALHTAFVAVSDEARAKNAAYSAEGSYKRYQGLTIWTPTVNIYR 141
Query: 166 DPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNW 225
DPRWGRG ET GEDP + VS V+G+Q N K AC KHF + W
Sbjct: 142 DPRWGRGIETYGEDPYLASVMGVSVVKGLQCLDENEKYDK----VHACAKHFAVHSGPEW 197
Query: 226 KGTTRYKFDAR-VTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKT 284
R+ F+A ++ +DL +TY PPFE+ VK+G+ +MCAYNR G P C LL+
Sbjct: 198 ---NRHSFNAENISPRDLYETYLPPFEALVKEGKVKEVMCAYNRFEGEPCCGSNRLLNHI 254
Query: 285 ARRQWGFHGYITSDCDAVSIIYDAEGYA--KSPEDAVVDVLKAGMDVNCGSFLQKHTKAA 342
RR+WG+ G + +DC A+S ++ +G+ A + +G D+ CGS + T+
Sbjct: 255 LRREWGYDGIVVADCSAISDFHNDKGHKTHADAASASSAAVLSGTDLECGSNYRSLTE-G 313
Query: 343 VKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQD 402
VK+ + E++IDR++ L R LG + P + +I VVCS H L+L A+
Sbjct: 314 VKKGFIDEADIDRSVKRLLQARFELGEMD-EPDQVRWAQIPYSVVCSDKHDSLSLDMARK 372
Query: 403 GIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYV--- 459
+ LL N + LPL + ++A++GPNAN + GNY G R+IT L +++ +
Sbjct: 373 SMTLLLNKNNALPLERG-GTTIAVMGPNANDSVMQWGNYNGLPKRTITILDGIRSAMGKD 431
Query: 460 ENTVYYPGCDTV 471
+ +Y GC V
Sbjct: 432 DKLIYEQGCSWV 443
Score = 120 bits (301), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 92/277 (33%), Positives = 134/277 (48%), Gaps = 54/277 (19%)
Query: 476 ASIDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVA 525
A I ++V K AD V+ G+ E EE+ DR D+ LP Q+E+I +
Sbjct: 590 ADIQRSVAKVKDADVVIFAGGISPQLEGEEMGVKLPGFRGGDRTDIELPAVQREMIKALH 649
Query: 526 EAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLP 585
+A KK VI V G P I +IL A YPG++G A+AEV+FGD+NP GRLP
Sbjct: 650 DAGKK-VIFVNCSGSP--IAMEPETEYCQAILQAWYPGQSGGKAVAEVLFGDYNPAGRLP 706
Query: 586 MTWYPQDYIKVP-MTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVS 644
T+Y ++ ++P D M G TYRF+ G+ +FPFG GLSY+ + Y
Sbjct: 707 ATFY-RNLAQLPDFEDYNM--------AGHTYRFFNGEPLFPFGYGLSYTTFKY------ 751
Query: 645 QNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLF 704
K+ L S+ T ET K +T+ V N G G+ V ++
Sbjct: 752 -GKIQLKSSAQTD---------------------ETVK--ITVPVTNTGSRNGEEVVQVY 787
Query: 705 VKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSP 741
+K +G P+K L F+ V + A + ++ EL+P
Sbjct: 788 LKKQGETDG-PVKTLRAFKRVYIPAGKTVKVELELTP 823
>gi|383110724|ref|ZP_09931543.1| hypothetical protein BSGG_1833 [Bacteroides sp. D2]
gi|382949470|gb|EFS31133.2| hypothetical protein BSGG_1833 [Bacteroides sp. D2]
Length = 783
Score = 268 bits (686), Expect = 6e-69, Method: Compositional matrix adjust.
Identities = 209/713 (29%), Positives = 329/713 (46%), Gaps = 102/713 (14%)
Query: 81 RLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGL 140
RLGIP + EA HG +G AT FP I AA++ L +G+AIG
Sbjct: 131 RLGIPLF-LAEEAPHGHMAIG-----------ATVFPTGIGMAATWSPQLIREVGKAIGK 178
Query: 141 EARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFN 200
E R G + P +++ RDPRW R +ET GEDP++TG+ + V G+
Sbjct: 179 EIRL------QGGHISYGPVLDLARDPRWSRVEETFGEDPVLTGEIGKAMVEGL-----G 227
Query: 201 GGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRAS 260
G L A KHF AY + F +++L + + PPF + G A
Sbjct: 228 SGDLSHPYSTLATLKHFLAYGISESGQNGNPSFAG---IRELHENFLPPFRQAIDAG-AL 283
Query: 261 GIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVV 320
+M +YN ++GIP A+ +LL++ R +W F G + SD ++ I+ + A + E A +
Sbjct: 284 SVMTSYNSMDGIPCTANHSLLTELLRNEWKFSGIVVSDLYSIEGIHQSHFVAPTMEAAAI 343
Query: 321 DVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFG 380
L AG+DV+ G + AV ++ ++ +D ++ + ++ +GLF NP + P
Sbjct: 344 LALSAGVDVDLGGDAYMNLMNAVNTGRISKTALDASVARVLRLKFEMGLFE-NPYVDP-- 400
Query: 381 KIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGN 440
+ V S LA + AQ I LLKN H LLPL K++ V ALIGPNA++ +LG+
Sbjct: 401 EKAKKEVRSEESVTLARRVAQASITLLKNEHSLLPLNKNRKV--ALIGPNADNRYNMLGD 458
Query: 441 YAGPSCRS--ITPLQALQNYVENTV--YYPGCDTVACSSASIDKAVDIAKGADHVVLMMG 496
Y P T L ++ + ++ Y GC + I++AV A+ ++ ++ ++G
Sbjct: 459 YTAPQEEENIKTVLDGIRTKLSSSQVEYVKGCSIRDTVTTDIEQAVAAAQRSEVIIAVVG 518
Query: 497 -----------------------LDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVI 533
+ + E DR L L G+QQEL+ + +A KP+I
Sbjct: 519 GSSARDFKTSYKETGAAIADEKTISDMECGEGFDRATLSLLGKQQELL-KALKATGKPLI 577
Query: 534 LVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDY 593
+V + G P+D T+A N ++L A YPG+ G A+A+V+FGD+NP GRLP+T P+
Sbjct: 578 VVYIEGRPLDKTWA--SENADAVLTAYYPGQEGGNAIADVLFGDYNPAGRLPLT-VPRSV 634
Query: 594 IKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQS 653
++P+ K PQ Y ++ FG GLSY+ + Y
Sbjct: 635 GQIPIYYNKKAPQ------NHDYVELSASPLYAFGYGLSYTTFEYS-------------- 674
Query: 654 SSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNG 713
+L F V+ VKN G G+ L+++
Sbjct: 675 ------------------DLRVSAISPHSFEVSFKVKNTGRYDGEEVSQLYLRDEYASVV 716
Query: 714 RPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
+P+KQL F+ L E E+ F LS + + + V+E GT ++VG
Sbjct: 717 QPLKQLKHFERFCLKRGEVKEVKFVLSESD-FTIIDRNLKTVVESGTFQVMVG 768
>gi|150003731|ref|YP_001298475.1| glycoside hydrolase family protein [Bacteroides vulgatus ATCC 8482]
gi|319640047|ref|ZP_07994774.1| glycoside hydrolase family 3 [Bacteroides sp. 3_1_40A]
gi|345517061|ref|ZP_08796539.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp.
4_3_47FAA]
gi|149932155|gb|ABR38853.1| glycoside hydrolase family 3, candidate beta-glycosidase
[Bacteroides vulgatus ATCC 8482]
gi|254833833|gb|EET14142.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp.
4_3_47FAA]
gi|317388325|gb|EFV69177.1| glycoside hydrolase family 3 [Bacteroides sp. 3_1_40A]
Length = 864
Score = 268 bits (686), Expect = 7e-69, Method: Compositional matrix adjust.
Identities = 161/443 (36%), Positives = 241/443 (54%), Gaps = 33/443 (7%)
Query: 45 FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGI 104
+ ++L +RA DL+ +LTL+EK++ +++++ + RLGI Y WW+EALHGVA G
Sbjct: 24 YKDSSLSPEERAEDLLQQLTLEEKVALMMDNSKPVERLGIKPYNWWNEALHGVARSGL-- 81
Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIG-------MTFW 157
AT FPQ I AASF+ + I A+ EARA A A G +T W
Sbjct: 82 --------ATVFPQPIGMAASFEPDAIHTIYTAVSDEARAKNAAYSAAGSYERYQGLTMW 133
Query: 158 APNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHF 217
P +NI+RDPRWGRG ET GEDP +T V+ V+G+Q + + K+ A C KHF
Sbjct: 134 TPTVNIYRDPRWGRGIETYGEDPYLTSVMGVNVVKGLQ--CMDANQKYDKIHA--CAKHF 189
Query: 218 TAYDLDNWKGTTRYKFDAR-VTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCA 276
+ W R++F+A + +DL +TY PFE+ VK+ + +MCAYNR+ G P C
Sbjct: 190 AVHSGPEW---NRHEFNAENIKPRDLHETYLVPFEALVKEAKVKEVMCAYNRLEGDPCCG 246
Query: 277 DRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSP--EDAVVDVLKAGMDVNCGSF 334
LL + R+ WG+ G + SDC A+ Y +G+ P E A + +G D+ CGS
Sbjct: 247 SDRLLMQILRQDWGYDGIVLSDCGAIDDFYREKGHKTHPDAESASAAAVLSGTDLECGSS 306
Query: 335 LQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQV 394
+ ++A K+ + E +ID ++ L R LG + +P + KI VVCS H
Sbjct: 307 YKALVESA-KKGLISEKDIDVSVKRLLKARFELGEMD-DPDKVEWTKIPYSVVCSAEHDS 364
Query: 395 LALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQA 454
L+L A+ + LL N + +LPL K ++A++GPNAN + GNY G +IT L+
Sbjct: 365 LSLDIARKSMTLLLNKNNILPL-KRGGQTIAVMGPNANDSVMQWGNYNGTPKHTITLLEG 423
Query: 455 LQNYV---ENTVYYPGCDTVACS 474
+++ + + +Y GC V S
Sbjct: 424 IRSAMGENDKLIYEQGCSWVERS 446
Score = 116 bits (291), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 90/276 (32%), Positives = 132/276 (47%), Gaps = 54/276 (19%)
Query: 477 SIDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAE 526
+I V K AD V+ G+ + E EE+ DR D+ LP Q+ELI + +
Sbjct: 591 NIKNTVAKVKDADVVIFAGGISPSLEGEEMGVNLPGFRKGDRTDIELPAVQRELIKALCD 650
Query: 527 AAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPM 586
A KK VI V G P I + +IL A YPG++G A AEV+FGD+NP GRLP+
Sbjct: 651 AGKK-VIFVNFSGSP--IAMEPETKYCQAILQAWYPGQSGGKAAAEVLFGDYNPAGRLPV 707
Query: 587 TWYPQDYIKVP-MTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQ 645
T+Y ++ ++P D M GRTYR+++G +FPFG GLSY+ ++Y
Sbjct: 708 TFY-RNITQLPDFEDYNM--------TGRTYRYFKGDPLFPFGYGLSYTTFNY------- 751
Query: 646 NKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFV 705
+ L Q+ + ET K +V V N G G+ V +++
Sbjct: 752 GNIKLEQT---------------------IKVGETAKIIVP--VTNTGNRDGEEVVQVYL 788
Query: 706 KPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSP 741
K G P+K L F+ V + A + + EL+P
Sbjct: 789 KKQEDAEG-PVKTLRAFKRVQIPAGKTVNVELELTP 823
>gi|423346097|ref|ZP_17323785.1| hypothetical protein HMPREF1060_01457 [Parabacteroides merdae
CL03T12C32]
gi|409220895|gb|EKN13848.1| hypothetical protein HMPREF1060_01457 [Parabacteroides merdae
CL03T12C32]
Length = 955
Score = 268 bits (686), Expect = 7e-69, Method: Compositional matrix adjust.
Identities = 221/800 (27%), Positives = 363/800 (45%), Gaps = 139/800 (17%)
Query: 49 TLPISQRARDLVSRLTLDEKISQLVN--SAPAIPRLGIPAYEW----W-------SEALH 95
T+PI R DL+S++ ++EK Q+V + + +P +W W E L+
Sbjct: 65 TVPIDARVEDLLSQMNVEEKTCQMVTLYGYKRVLKDDLPTSDWKKQLWKDGIGAIDEHLN 124
Query: 96 GVAGVGK-------------------------------GI---FFNGTIRG-----ATSF 116
G G GI F N IRG AT+F
Sbjct: 125 GFQQWGLPPSDNPYVWPASRHAWALNEVQRFFIEETRLGIPTDFTNEGIRGVESYIATNF 184
Query: 117 PQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMT-FWAPNINIFRDPRWGRGQET 175
P + +++ L +++G G E R L G T +AP +++ RD RWGR +E
Sbjct: 185 PTQLGLGHTWNRNLVHKVGYITGREGRLL-------GYTNVYAPILDVGRDQRWGRYEEV 237
Query: 176 PGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDA 235
GE P + + + +G+Q D Q +A KH+ AY + + D
Sbjct: 238 YGESPYLVAELGIEMAKGMQTDH----------QVAATSKHYIAYSNNKGGREGMARVDP 287
Query: 236 RVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYI 295
+++ +++ + P++ +K+ G+M +YN +G P + L+ R ++GF GY+
Sbjct: 288 QMSPREVEMIHVYPWKRVIKEAGILGVMSSYNDYDGFPIQSSYYWLTTRLRGEFGFRGYV 347
Query: 296 TSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHT-----KAAVKQKKLPE 350
SD DAV ++ G A +++V+ + AG+++ C +F + + + + +P
Sbjct: 348 VSDSDAVEYLFSKHGTAADMKESVLQSVLAGLNIRC-TFRSPDSYVLPLRELIAEGAIPM 406
Query: 351 SEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNS 410
S ID + ++ V+ +GLF+ +P + +V C+ QV ALQA+++ +VLLKN
Sbjct: 407 STIDDRVRDILRVKFLVGLFD-HPYQIDLKETDKEVNCAENQQV-ALQASKESLVLLKNQ 464
Query: 411 HGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVE---NTVYYPG 467
+LPL +K +A+ GPNA+ L +Y + T L+ ++N V+ N ++ G
Sbjct: 465 DAVLPLDVNKISKIAVCGPNADEEAYALTHYGPLAVEVTTVLEGIRNKVKPGTNVLFTKG 524
Query: 468 CDTV---------------ACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLV 512
CD V A + IDKAV+ AK +D V+++G E R L
Sbjct: 525 CDLVDANWPESELIRYPLTAEEQSEIDKAVENAKKSDVTVVVLGGSDRTCGENKSRSSLD 584
Query: 513 LPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAE 572
LPGRQ +L+ V A KPV+L+L+ G P+ I +A D+ + +IL A YPG G A+A+
Sbjct: 585 LPGRQLDLLQAVV-ATGKPVVLILINGRPLSINWA--DKYVPAILEAWYPGSQGGTAIAD 641
Query: 573 VIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQA------TSGNPGRTYRFYEGKEVFP 626
+FGD+NPGG+L +T +P+ ++P + +P A G G R ++P
Sbjct: 642 ALFGDYNPGGKLTVT-FPKTVGQIPF-NFPTKPNAQVDGGRNKGLDGNMSRV--NGPLYP 697
Query: 627 FGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVT 686
FG GLSY+ + Y ++ P + T + + V
Sbjct: 698 FGYGLSYTTFEYSDISIQ--------------------------PAIVT---QVQPVTVR 728
Query: 687 IGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLS 746
V N G+ AG V L+V+ K LVGF + LN E E+ F + P L
Sbjct: 729 CKVTNTGKRAGDEVVQLYVRDILSSVTTYEKNLVGFDRIHLNPGETKELTFTIEP-RDLQ 787
Query: 747 RAREDGLMVIEEGTHFLVVG 766
D V+E G ++VG
Sbjct: 788 LLNSDNHWVVEPGDFKVMVG 807
>gi|294674604|ref|YP_003575220.1| family 3 glycosyl hydrolase [Prevotella ruminicola 23]
gi|294474051|gb|ADE83440.1| glycosyl hydrolase, family 3 [Prevotella ruminicola 23]
Length = 869
Score = 268 bits (685), Expect = 9e-69, Method: Compositional matrix adjust.
Identities = 157/436 (36%), Positives = 232/436 (53%), Gaps = 51/436 (11%)
Query: 43 FPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGK 102
+P+ T L +R +L+S LT +EK+ ++N + +I RLGIP+Y WWSEA HGV
Sbjct: 23 YPYLDTKLSNKERVENLLSLLTPEEKVGLMMNKSISIDRLGIPSYNWWSEACHGVRQ--- 79
Query: 103 GIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARA-----------LYNAGQA 151
G T +PQ I AA+F++ L+Y + + EARA L+N
Sbjct: 80 --------DGYTVYPQPIGMAAAFNAQLFYDVFSQVSDEARANWNRTDHNDPKLFNVPMG 131
Query: 152 IG-------MTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKL 204
+ +TFW PN+NIFRDPRWGRGQET GEDP + V V G+QG+
Sbjct: 132 VTYYPGNPELTFWCPNVNIFRDPRWGRGQETCGEDPYLNAVLGVQTVLGMQGNN------ 185
Query: 205 KGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMC 264
+ AC KH+ + + R+ + T +DL +TY P F++ VK+G +MC
Sbjct: 186 DKYFKTHACAKHYAVH---SGPEPLRHSMNVEPTNRDLWETYLPAFKALVKKGNVREVMC 242
Query: 265 AYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPED---AVVD 321
AY R G P C LL R +WG+ + +DCDA++ ++ G ++ +D A VD
Sbjct: 243 AYQRFEGKPCCTSDRLLIDILRNKWGYDAIVLTDCDAINNFFN-RGQHETHKDGLSASVD 301
Query: 322 VLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGK 381
+ G D+ CG + +K+ + ES++D L R LG+F+ M P+ K
Sbjct: 302 AVLNGTDLECGKVFMSLVE-GLKKGLIKESDLDNHLRKTLMGRFELGMFDP-ADMLPWAK 359
Query: 382 IGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNY 441
+GADV+ S + +A+QAA++ +VLL+N G+LPL KS +LA++GPNA+ L GNY
Sbjct: 360 LGADVISSEKNDAMAVQAARESMVLLENKAGILPLSKSIK-TLAVLGPNADDVNMLNGNY 418
Query: 442 AGPSCRSITPLQALQN 457
G TP A Q+
Sbjct: 419 GG------TPTAAHQH 428
Score = 107 bits (268), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 86/255 (33%), Positives = 122/255 (47%), Gaps = 43/255 (16%)
Query: 486 KGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDIT 545
K AD +V++ G+ E E D+ D+ LP QQ L+ + KPVI V C G I
Sbjct: 614 KEADAIVIIGGISAQMEGEGGDKQDIELPKVQQMLV-KAMHKTGKPVIFVN-CSGSA-IA 670
Query: 546 FAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRP 605
F + ++L A Y G+ GA ALAEV+FGD+NPGG+LP+T+Y + D M+
Sbjct: 671 FGSVEGEYDALLQAWYAGQGGAKALAEVLFGDYNPGGKLPVTFYRSNNDLPDFLDYSMK- 729
Query: 606 QATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVV 665
RTYR++ G + FG GLSY+ + A+ K+ S+ +M +N V
Sbjct: 730 -------NRTYRYFTGVPQYAFGYGLSYTTF-----ALGDAKI-----SAKQMKKNGKVT 772
Query: 666 HYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSV 725
+T+ V N G+ G V ++VK G PIK L GFQ +
Sbjct: 773 -------------------LTVPVTNTGKREGTETVQVYVKRLDDA-GAPIKALKGFQKL 812
Query: 726 ILNAKE--KAEIVFE 738
L A E KA I +
Sbjct: 813 NLKAGETQKATITLD 827
>gi|423722678|ref|ZP_17696831.1| hypothetical protein HMPREF1078_00891 [Parabacteroides merdae
CL09T00C40]
gi|409241951|gb|EKN34716.1| hypothetical protein HMPREF1078_00891 [Parabacteroides merdae
CL09T00C40]
Length = 955
Score = 268 bits (684), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 221/800 (27%), Positives = 365/800 (45%), Gaps = 139/800 (17%)
Query: 49 TLPISQRARDLVSRLTLDEKISQLVN--SAPAIPRLGIPAYEW----W-------SEALH 95
T+PI R DL+S++ ++EK Q+V + + +P +W W E L+
Sbjct: 65 TVPIDARVEDLLSQMNVEEKTCQMVTLYGYKRVLKDDLPTSDWKKQLWKDGIGAIDEHLN 124
Query: 96 GVAGVGK-------------------------------GI---FFNGTIRG-----ATSF 116
G G GI F N IRG AT+F
Sbjct: 125 GFQQWGLPPSDNPYVWPASRHAWALNEVQRFFIEETRLGIPTDFTNEGIRGVESYIATNF 184
Query: 117 PQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMT-FWAPNINIFRDPRWGRGQET 175
P + +++ L +++G G E R L G T +AP +++ RD RWGR +E
Sbjct: 185 PTQLGLGHTWNRNLVHKVGYITGREGRLL-------GYTNVYAPILDVGRDQRWGRYEEV 237
Query: 176 PGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDA 235
GE P + + V +G+Q D Q +A KH+ AY + + D
Sbjct: 238 YGESPYLVAELGVEMAKGMQTD----------YQVAATSKHYIAYSNNKGGREGMARVDP 287
Query: 236 RVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYI 295
+++ +++ + P++ +K+ G+M +YN +G P + L+ R ++GF GY+
Sbjct: 288 QMSPREVEMIHVYPWKRVIKEAGILGVMSSYNDYDGFPIQSSYYWLTTRLRGEFGFRGYV 347
Query: 296 TSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHT-----KAAVKQKKLPE 350
SD DAV ++ G A +++V+ + AG+++ C +F + + + + +P
Sbjct: 348 VSDSDAVEYLFSKHGTAADMKESVLQSVLAGLNIRC-TFRSPDSYVLPLRELIAEGAIPM 406
Query: 351 SEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNS 410
S ID + ++ V+ +GLF+ +P + +V C+ +Q++ALQA+++ +VLLKN
Sbjct: 407 STIDDRVRDILRVKFLVGLFD-HPYQIDLKETDKEVNCA-ENQLVALQASKESLVLLKNQ 464
Query: 411 HGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVE---NTVYYPG 467
+LPL +K +A+ GPNA+ L +Y + T L+ ++N V+ + ++ G
Sbjct: 465 DAVLPLDVNKISKIAVCGPNADEEAYALTHYGPLAVEVTTVLEGIRNKVKPGTDVLFTKG 524
Query: 468 CDTV---------------ACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLV 512
CD V A + IDKAV+ AK +D V+++G E R L
Sbjct: 525 CDLVDANWPESELIRYPLTAEEQSEIDKAVENAKKSDVTVVVLGGSNRTCGENKSRSSLD 584
Query: 513 LPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAE 572
LPGRQ +L+ V A KPV+LVL+ G P+ I +A D+ + +IL A YPG G A+A+
Sbjct: 585 LPGRQLDLLQAVV-ATGKPVVLVLINGRPLSINWA--DKYVPAILEAWYPGSQGGTAIAD 641
Query: 573 VIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQA------TSGNPGRTYRFYEGKEVFP 626
+FGD+NPGG+L +T +P+ ++P + +P A G G R ++P
Sbjct: 642 ALFGDYNPGGKLTVT-FPKTVGQIPF-NFPTKPNAQVDGGRNKGLDGNMSRV--NGPLYP 697
Query: 627 FGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVT 686
FG GLSY+ + Y ++ P + T + + V
Sbjct: 698 FGYGLSYTTFEYSDISIQ--------------------------PAIVT---QVQPVTVR 728
Query: 687 IGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLS 746
V N G+ AG V L+V+ K LVGF + LN E E+ F + P L
Sbjct: 729 CKVTNTGKRAGDEVVQLYVRDILSSVTTYEKNLVGFDRIHLNPGETKELTFTIEP-RDLQ 787
Query: 747 RAREDGLMVIEEGTHFLVVG 766
D V+E G ++VG
Sbjct: 788 LLNSDNHWVVEPGDFKVMVG 807
>gi|414082523|ref|YP_006991223.1| glycosyl hydrolase family 3 N terminal domain-containing protein
[Carnobacterium maltaromaticum LMA28]
gi|412996099|emb|CCO09908.1| glycosyl hydrolase family 3 N terminal domain protein
[Carnobacterium maltaromaticum LMA28]
Length = 716
Score = 268 bits (684), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 203/676 (30%), Positives = 339/676 (50%), Gaps = 70/676 (10%)
Query: 104 IFFNGTIRG-ATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNIN 162
IF I G T FP + +S+++ L + LE+ A + +TF +P ++
Sbjct: 83 IFMADVIHGYKTIFPIPLALGSSWNTELAEETARIAALES-----AVSGLHVTF-SPMVD 136
Query: 163 IFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQ-ASACCKHFTAYD 221
+ RDPRWGR E+ GED + + A ++VRG QG LK + +AC KHF AY
Sbjct: 137 LVRDPRWGRVMESTGEDAFLNAELAKAFVRGYQGT-----DLKNDFERVAACVKHFAAY- 190
Query: 222 LDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLL 281
R ++ + L ++Y P +++ + +G A +M A+N V+G+P+ A++ L+
Sbjct: 191 --GAPIAGREYNTVNMSERQLRESYLPGYKAALDEG-AKLVMTAFNTVDGVPATANKWLM 247
Query: 282 SKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHT-K 340
R +W F+G + SD AV + G A ++A + L A +D+ + T +
Sbjct: 248 RDLLRNEWDFNGVLISDWGAVKELV-PHGVAADEKEAAILALNASVDIEMMTMCYMDTLE 306
Query: 341 AAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAA 400
A+++ ++ E+ +D A+ + +++ LGLF NP + VV S H+ LA +AA
Sbjct: 307 ASIEAGEIEEALVDEAVLRILNLKNDLGLFE-NPYRGADAEKEKAVVLSEEHRSLARKAA 365
Query: 401 QDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYA--GPSCRSITPLQALQNY 458
++ IVLLKN G+LPL +++ + ALIGP A S K +LG +A G S ++I+ + L +
Sbjct: 366 EESIVLLKN-EGVLPLKQNQKI--ALIGPGAES-KDILGAWAWQGESEKAISLAEGLAQH 421
Query: 459 V--ENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGR 516
V E + GCD ++ + A ID AV +AK AD +VL +G + E R D+ LP
Sbjct: 422 VPAEQLLIAKGCDILSGTQAEIDAAVAVAKEADVIVLALGENSDMSGEAASRSDIRLPKI 481
Query: 517 QQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFG 576
Q EL+ ++ ++ KP+I+ L G P+DI ++ + I+ A +PG G ALA ++ G
Sbjct: 482 QLELLAQL-KSVGKPIIVTLFNGRPLDI----HELEVAGIVEAWFPGTEGGAALANILMG 536
Query: 577 DHNPGGRLPMTWYPQDYIKVPMT---DMKMRPQATSGNPGRTYRFY---EGKEVFPFGCG 630
NP +L M+ +P + +VP+ D RP+ + + Y + ++PFG G
Sbjct: 537 AVNPSAKLSMS-FPYNVGQVPVYYNPDNTGRPELSRSVDEKYVSKYLDSPNEALYPFGFG 595
Query: 631 LSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVK 690
LSY+ +SY +S TE ++ V + VK
Sbjct: 596 LSYTSFSYSDMTLS-----------------------------ATELKDSETLTVEVTVK 626
Query: 691 NHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRARE 750
N G+ AGK V L+++ RP+K+L G+Q + L A + + F ++ E L +
Sbjct: 627 NSGKYAGKEIVQLYLRDIVGEVVRPVKELKGYQKIELAAGAEKTVSFTVTE-EQLRYVHQ 685
Query: 751 DGLMVIEEGTHFLVVG 766
D + G ++VG
Sbjct: 686 DCSFSSDSGVFEVMVG 701
>gi|365121873|ref|ZP_09338785.1| hypothetical protein HMPREF1033_02131 [Tannerella sp.
6_1_58FAA_CT1]
gi|363644185|gb|EHL83481.1| hypothetical protein HMPREF1033_02131 [Tannerella sp.
6_1_58FAA_CT1]
Length = 850
Score = 267 bits (683), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 156/424 (36%), Positives = 234/424 (55%), Gaps = 34/424 (8%)
Query: 51 PISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTI 110
P +R DL+SRLT++EKIS L ++P IPRL I Y +EALHG+ G
Sbjct: 33 PQHERIMDLLSRLTIEEKISLLRATSPGIPRLEIEKYYHGNEALHGIVRPG--------- 83
Query: 111 RGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQ---------AIGMTFWAPNI 161
T FPQ I A+ ++ Y I I EARA +N + +TFW+P +
Sbjct: 84 -NFTVFPQAIGLASMWNPDFLYEISTVISDEARARWNELNRGKDQKRLFSDLLTFWSPTV 142
Query: 162 NIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYD 221
N+ RDPRWGR ET GEDP ++GK V++V+G+QG+ L+ + KHF A
Sbjct: 143 NMARDPRWGRTPETYGEDPFLSGKLGVAFVKGLQGND------PRYLKVVSTPKHFAA-- 194
Query: 222 LDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLL 281
N + R++ + +++ +DL + Y P FE C+ G+A IM AYN +N +P + LL
Sbjct: 195 --NNEEHNRFECNPQISERDLREYYLPAFERCIIDGKAQSIMTAYNAINDVPCTLNTWLL 252
Query: 282 SKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQ-KHTK 340
K R WGF+GY+ SDC A S++ Y K+PE A LKAG+D+ CG + +
Sbjct: 253 KKVLRTDWGFNGYVVSDCGAPSLLVTHHKYVKTPEAAATLALKAGLDLECGDNVYIEPLM 312
Query: 341 AAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAA 400
A KQ + E+EID A + + RM LGLF+ +P P+ + +V H+ +AL+AA
Sbjct: 313 NAYKQYMVSEAEIDTAAYRILRARMMLGLFD-DPAKNPYNALSPSIVGCEKHKNMALEAA 371
Query: 401 QDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAG-PSCRSITPLQALQNYV 459
+ +VLLKN + LP+ K S+A++G NA + + G+Y+G P ++ L ++N V
Sbjct: 372 RQSLVLLKNENNFLPINPKKIKSIAVVGINAGNCE--FGDYSGKPVNVPVSVLDGIRNRV 429
Query: 460 ENTV 463
+ +
Sbjct: 430 GDDI 433
Score = 107 bits (266), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 82/279 (29%), Positives = 130/279 (46%), Gaps = 45/279 (16%)
Query: 489 DHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAK 548
D + +MG++++ E+E DR + LP + QEL A + +VL+ G + + +
Sbjct: 602 DMTIAVMGINKSIEREGRDRDHIELP-KDQELFIEEAYKLNPKMAVVLVAGSSLAVNWM- 659
Query: 549 YDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQAT 608
D ++ +IL A YPGE G A+AE +FGD+NP GRLP+T+Y P D ++
Sbjct: 660 -DEHVPAILNAWYPGEQGGTAVAEALFGDYNPAGRLPLTYYRSLDDLPPFDDYAVQKN-- 716
Query: 609 SGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYK 668
RTY ++ GK ++ FG GLSY+K+ Y+ K+ +QD + +
Sbjct: 717 -----RTYMYFTGKPLYAFGYGLSYTKFDYR-----------------KLSVDQDAENVR 754
Query: 669 SVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILN 728
++ +KN G+ G ++V+ G PIKQL GF+ V +
Sbjct: 755 ----------------LSFTIKNSGKYNGDEVAQVYVQFPEIGVKVPIKQLKGFERVHI- 797
Query: 729 AKEKAEIVFELSPCESLSRARE-DGLMVIEEGTHFLVVG 766
AK K V P + L E G G + +VG
Sbjct: 798 AKGKTLPVTITVPKKELRIWNERKGEFFTPSGNYVFMVG 836
>gi|332982620|ref|YP_004464061.1| glycoside hydrolase [Mahella australiensis 50-1 BON]
gi|332700298|gb|AEE97239.1| glycoside hydrolase family 3 domain protein [Mahella australiensis
50-1 BON]
Length = 753
Score = 267 bits (683), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 222/712 (31%), Positives = 351/712 (49%), Gaps = 107/712 (15%)
Query: 81 RLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGL 140
RLGIPA E L G GAT FPQ I A+++D+ + I
Sbjct: 89 RLGIPAI-VHEECLSGFMA-----------DGATVFPQAIGLASTWDAEAIEAMAGVIRQ 136
Query: 141 EARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFN 200
+ +A AG G++ P +++ RDPRWGR +ET GEDP + AVSYVRG+QG
Sbjct: 137 QMKA---AGAHQGLS---PVLDVARDPRWGRVEETFGEDPYLVASMAVSYVRGLQGQDLT 190
Query: 201 GGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRAS 260
G A KHF + ++ R V ++L D + PFE+ V++ A
Sbjct: 191 KGIF-------ATLKHFAGH---SFSEGGRNCAPVHVGERELWDIFLFPFEAAVREANAK 240
Query: 261 GIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVV 320
+M AY+ ++G+P A R LL+ R +GF G + SD DA+ + A A + ++A V
Sbjct: 241 SVMNAYHDIDGVPCAASRELLTDILRGHFGFDGIVVSDYDAIDRLRKAHFTAGNKKEAAV 300
Query: 321 DVLKAGMDVNCGSF--LQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQP 378
L+AG+D+ + AVK+ + E+ I+ ++ + + + LGLF+G
Sbjct: 301 QALEAGIDIELPKMDCYGQPLMDAVKEGMISEATINESVERVLTAKFELGLFDG------ 354
Query: 379 FGKIGADVVC------SPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNAN 432
+ DV +P + ++ A+ IVLLKN + +LPL K S+A+IGPNA+
Sbjct: 355 ---VYVDVDSVPGLFETPEQREMSRDIARKSIVLLKNDN-VLPLSKDIK-SIAVIGPNAD 409
Query: 433 SAKTLLGNYAGPSCRS--------ITPLQALQNYVENT---VYYPGCDTVACSSASIDKA 481
+A+ +LG+YA + RS +T L+ ++N V ++ Y GCD + S+ +A
Sbjct: 410 NARNMLGDYAFMAHRSYDKTSVHIVTVLEGIKNKVLDSCRITYAKGCDIIDPSTDGFVEA 469
Query: 482 VDIAKGADHVVLMMG-----LDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVL 536
V+ A+ AD ++++G + E DR D+ LPG Q +L+ + + KPVI+VL
Sbjct: 470 VNAARAADAAIVVVGDNSGIFGKGTSGENDDRTDITLPGVQMQLVKAIKDTG-KPVIVVL 528
Query: 537 LCGGPVDITFAKYD--RNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYI 594
+ G FA + N +++ A YPGE G A+A+V+FGD+NP GRLP++ P +
Sbjct: 529 INGR----AFAAKELADNASALMEAWYPGEEGGNAVADVLFGDYNPAGRLPISL-PCEVG 583
Query: 595 KVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSS 654
++P+ + ++P + Y E K F FG G+SY+ + Y + S
Sbjct: 584 QIPI-NYNLKPASYI-----NYLSTETKPAFAFGYGMSYTTFGY------------SDLS 625
Query: 655 STKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGR 714
T V P G K ++ V N G++AG V L+++ R
Sbjct: 626 ITPAV----------APSAG-------KVDISFKVTNAGQLAGDEVVQLYIRDEVSSIVR 668
Query: 715 PIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
P+K+L GF+ V L E EI F L + L+ +D +V+E GT ++VG
Sbjct: 669 PVKELKGFKRVNLQPGETKEITFTLY-ADQLAFHDKDMRLVVEPGTFKIMVG 719
>gi|319901412|ref|YP_004161140.1| glycoside hydrolase 3 [Bacteroides helcogenes P 36-108]
gi|319416443|gb|ADV43554.1| glycoside hydrolase family 3 domain protein [Bacteroides helcogenes
P 36-108]
Length = 944
Score = 267 bits (682), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 206/668 (30%), Positives = 322/668 (48%), Gaps = 87/668 (13%)
Query: 105 FFNGTIRG-----ATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMT-FWA 158
F N IRG AT+FP + +++ L ++IG G EAR L G T +A
Sbjct: 160 FTNEGIRGVESYKATNFPTQLGLGHTWNRELIHKIGFITGREARML-------GYTNVYA 212
Query: 159 PNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFT 218
P +++ RD RWGR +E GE P + + + VRG+Q +N Q +A KHF
Sbjct: 213 PILDVGRDQRWGRYEEVYGESPYLVAELGIEMVRGMQ---YNH-------QVAATGKHFA 262
Query: 219 AYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADR 278
AY + + D +++ +++ + + PF +++ G+M +YN +GIP
Sbjct: 263 AYSNNKGAREGMSRVDPQISPREVENIHIYPFRRVIREAGLLGVMSSYNDYDGIPIQGSH 322
Query: 279 NLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNC-----GS 333
L+ R + GF GY+ SD DAV +Y G AK ++A+ ++AG+++ C S
Sbjct: 323 YWLTTRLRGEIGFRGYVVSDSDAVEYLYTKHGTAKDMKEAIRQSVEAGLNIRCTFRSPDS 382
Query: 334 FLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGAD-VVCSPAH 392
F+ + VK+ L E I+ + ++ V+ GLF+ P GAD V +
Sbjct: 383 FVLP-LRELVKEGGLSEEIINDRVRDILRVKFLTGLFD-TPYQSDLA--GADREVEKEEN 438
Query: 393 QVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPL 452
+ALQA+++ IVLLKN + +LPL S +A+ GPNA+ L +Y + IT L
Sbjct: 439 GSIALQASRESIVLLKNENNMLPLDLSTVKRIAVCGPNADEKNYALTHYGPLAVEVITVL 498
Query: 453 QALQNYVE---NTVYYPGCDTV---------------ACSSASIDKAVDIAKGADHVVLM 494
+ +Q+ V +Y GCD V A A I+KA + A+ +D V++
Sbjct: 499 KGIQDKVSGKAEVLYTKGCDLVDANWPESEIINHPLTADEQAEINKAAENARQSDVAVVV 558
Query: 495 MGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIG 554
+G Q E R L LPGRQ +L+ + +A KPVILVL+ G P+ + +A D+ +
Sbjct: 559 LGGGQRTCGENKSRSSLDLPGRQLQLLQAI-QATGKPVILVLINGRPLSVNWA--DKYVP 615
Query: 555 SILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMT-DMKMRPQATSG-NP 612
+IL A YPG G +ALA+V+FGD+NPGG+L +T +P+ ++P K Q G NP
Sbjct: 616 AILEAWYPGAKGGIALADVLFGDYNPGGKLTVT-FPKTVGQIPFNFPYKPASQIDGGKNP 674
Query: 613 GRTYRFYE-GKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVP 671
G ++PFG GLSY+ + Y ++ + N+ ++
Sbjct: 675 GPEGNMSRINGALYPFGYGLSYTTFEYSDLEITPKVITPNEEAT---------------- 718
Query: 672 ELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKE 731
V + V N G+ AG V L+++ K L GF+ V L E
Sbjct: 719 -------------VRLKVTNTGKRAGDEVVQLYIRDVVSSVITYEKNLAGFERVHLEPGE 765
Query: 732 KAEIVFEL 739
E+VF L
Sbjct: 766 TKEVVFTL 773
>gi|242206820|ref|XP_002469265.1| hypothetical protein POSPLDRAFT_51213 [Postia placenta Mad-698-R]
gi|220731725|gb|EED85567.1| hypothetical protein POSPLDRAFT_51213 [Postia placenta Mad-698-R]
Length = 312
Score = 267 bits (682), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 138/298 (46%), Positives = 177/298 (59%), Gaps = 12/298 (4%)
Query: 34 DPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEA 93
D +N T C T+ +RA L+S TL+EKI+ N+AP +PRLG+PAY+WW EA
Sbjct: 22 DCANGPLTTNTVCDTSATPLERATALISLFTLEEKINNTGNTAPGVPRLGLPAYQWWQEA 81
Query: 94 LHGVAGVGKGIFF--NGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQA 151
LHGVA G+ F +G ATSFPQ IL A+FD L + + EARA NA ++
Sbjct: 82 LHGVA-ESPGVIFAPSGEYSYATSFPQPILMGAAFDDALINHVATIVSTEARAFNNANRS 140
Query: 152 IGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQAS 211
G+ FW PNIN F+DPRWGRGQETPGEDP Y + + G+QG G +
Sbjct: 141 -GIDFWTPNINPFKDPRWGRGQETPGEDPFHLQSYVYNLITGLQG-----GLDPEYKRIV 194
Query: 212 ACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNG 271
A CKHF AYDL+NW+G RY FDA V++QDL++ Y F +C + MC+YN VNG
Sbjct: 195 ATCKHFAAYDLENWEGNVRYGFDALVSLQDLSEFYTRSFRTCARDANVGSFMCSYNAVNG 254
Query: 272 IPSCADRNLLSKTARRQWGFHG---YITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAG 326
+PSCA+ LL R WG+ YITSDCDA+ IY+ Y + + V D L AG
Sbjct: 255 VPSCANSYLLQDILRDHWGWTNEDQYITSDCDAIQNIYEPHYYTATRAETVADALNAG 312
>gi|317474225|ref|ZP_07933501.1| glycosyl hydrolase family 3 N terminal domain-containing protein
[Bacteroides eggerthii 1_2_48FAA]
gi|316909535|gb|EFV31213.1| glycosyl hydrolase family 3 N terminal domain-containing protein
[Bacteroides eggerthii 1_2_48FAA]
Length = 858
Score = 267 bits (682), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 158/424 (37%), Positives = 241/424 (56%), Gaps = 34/424 (8%)
Query: 51 PISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTI 110
PI +R DL+SRLT++EKIS L ++P I RL IP Y +EALHGV G+
Sbjct: 35 PIHERIMDLLSRLTVEEKISLLRATSPGISRLDIPKYYHGNEALHGVVRPGR-------- 86
Query: 111 RGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA---GQAIG------MTFWAPNI 161
T FPQ I AA+++ L ++ I EARA +N G+ +TFW+P +
Sbjct: 87 --FTVFPQAIGLAATWNPVLQKQVATVISDEARARWNELDQGREQNSQFSDLLTFWSPTV 144
Query: 162 NIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYD 221
N+ RDPRWGR ET GEDP ++G ++V+G+QG+ L+ + KHF A
Sbjct: 145 NMARDPRWGRTPETYGEDPYLSGIMGTAFVKGLQGNDSR------YLKIVSTPKHFAA-- 196
Query: 222 LDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLL 281
N + R+ + +++ + L + Y P FE+CVK+G+++ IM AYN +N +P + LL
Sbjct: 197 --NNEEHNRFVCNPQISEKQLREYYLPAFEACVKEGKSASIMSAYNALNDVPCTLNAWLL 254
Query: 282 SKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG-SFLQKHTK 340
+K R WGF GY+ SDC +++ +A Y K+ E A +KAG+D+ CG
Sbjct: 255 TKVLREDWGFKGYVVSDCGGPALLVNAHKYVKTKEAAATLSIKAGLDLECGDDVYDAPLL 314
Query: 341 AAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAA 400
A +Q + +++ID A + + RM+LGLF+ P+ KI V+ S HQ +AL AA
Sbjct: 315 NAYRQYMVTDADIDSAAYRVLRARMQLGLFDSGEN-NPYTKISPKVIGSKEHQKVALDAA 373
Query: 401 QDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAG-PSCRSITPLQALQNYV 459
++ IVLLKN + +LPL K S+A++G NA ++ G+Y+G P ++ LQ +++ V
Sbjct: 374 RECIVLLKNQNKMLPLDAKKIKSIAVVGINA--GRSEFGDYSGLPVIAPVSILQGIKDRV 431
Query: 460 ENTV 463
+TV
Sbjct: 432 GDTV 435
Score = 127 bits (318), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 97/348 (27%), Positives = 159/348 (45%), Gaps = 59/348 (16%)
Query: 435 KTLLGNYAGPSCRSITP---LQALQNYVENTVYYPGCDTVACS----------SASID-- 479
K L+ + G + R+ T L+A ++Y YY D +D
Sbjct: 534 KMLIDAWQGHAIRTDTATIYLEAGRDYRLKAEYYDNRDYAVAKLQWKVPQIGKKTRLDLY 593
Query: 480 -KAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLC 538
+A + + + VV ++G+++ E+E DR D+ LP Q+E + + + +++VL+
Sbjct: 594 GEAGRVVRECEKVVAVLGINKAIEREGQDRSDIQLPADQREFLKEIYKVNPN-IVVVLVA 652
Query: 539 GGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPM 598
G + I + D +I +I+ A YPGE+G A+AEV+FGD+NPGGRLP+T+Y P
Sbjct: 653 GSSLSINWM--DEHIPAIINAWYPGESGGKAVAEVLFGDYNPGGRLPLTYYRSLDELPPF 710
Query: 599 TDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKM 658
D + GRTY++++G ++PFG GLSY+ + Y V++
Sbjct: 711 DDYDI-------TKGRTYQYFKGNVLYPFGYGLSYTSFKYSDLQVTEG------------ 751
Query: 659 VENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQ 718
NQ E+ FC +KN G+ AG ++VK R PIK+
Sbjct: 752 --NQ---------EVNVSFC----------LKNVGKYAGDEVAQIYVKLPERDKIMPIKE 790
Query: 719 LVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
L GF+ + L ++ L E G V G + ++VG
Sbjct: 791 LKGFERISLKRGGSRKVTIRLKKDLLRYWDEEKGCFVHPSGDYTIMVG 838
>gi|160890986|ref|ZP_02071989.1| hypothetical protein BACUNI_03433 [Bacteroides uniformis ATCC 8492]
gi|156859207|gb|EDO52638.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
uniformis ATCC 8492]
Length = 869
Score = 267 bits (682), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 210/711 (29%), Positives = 326/711 (45%), Gaps = 100/711 (14%)
Query: 81 RLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGL 140
RLGIP + +E+LHG G+T FPQ + ++F+ L Y++ +AI
Sbjct: 117 RLGIPVFTV-TESLHG-----------SVHDGSTIFPQSVAVGSTFNLDLAYQMTKAIAT 164
Query: 141 EARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFN 200
E R+ Q + T +P +++ RD RWGR +E+ GEDP + G+ ++ V+G +
Sbjct: 165 ELRS-----QGVIQTL-SPGLDVVRDLRWGRVEESFGEDPWLVGQMGIAQVKGY----ID 214
Query: 201 GGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRAS 260
GG S K F +D+ + + P+E V+
Sbjct: 215 GG-------ISPMLKPFGPGGAPLGGLNLA---SVESGERDIRNIHIKPYEMAVRNTEVK 264
Query: 261 GIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVV 320
+M +YN NGIP+ A LL+ R +WGF GY+ SD AV+++ D + AK +A +
Sbjct: 265 AVMTSYNSWNGIPNSASSYLLTNILRNEWGFKGYVYSDWGAVAMLKDFQHTAKDDSEAAI 324
Query: 321 DVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFG 380
L AG+D+ S + ++Q + E +D A+ + V+ LGLF NP
Sbjct: 325 QALTAGVDLEASSNCYWALEQLIEQGRFDEKYVDLAVGRILRVKFELGLFE-NPYQ---- 379
Query: 381 KIGADV--VCSPAHQVLAL--QAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKT 436
GAD+ V + + L + A + IVLLKN + LLPL +K SLA+IGPNAN +
Sbjct: 380 --GADMPGVAMRTKEAVELSRRVADESIVLLKNENTLLPLNLNKIKSLAVIGPNANQVQ- 436
Query: 437 LLGNYAGPSCRS--ITPLQALQNYVENTV---YYPGCDTVACSSASIDKAVDIAKGADHV 491
G+Y +TPL+ L+ V N + Y GCD + + + D+AV K +D
Sbjct: 437 -FGDYTWSRSNKDGVTPLEGLKKRVGNKIKINYAAGCDLITDNKSGFDEAVAAVKASDMA 495
Query: 492 VLMMGLDQTQEK---------EELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPV 542
V+ +G E D L L G Q+EL+ + A KPVI+VL+ G P
Sbjct: 496 VVFVGSSSASLARDYSDATCGEGFDLSSLDLTGVQEELVEEIY-AIGKPVIVVLVTGKPF 554
Query: 543 DITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQD-------YIK 595
I++ K +I +I+ Y GE A+A+++ G+ NP +LP + +PQ Y
Sbjct: 555 SISWIK--EHIPAIVVQWYGGEKAGDAIADMLLGNINPSAKLPFS-FPQSVGHLPVFYNH 611
Query: 596 VPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSS 655
+P R PGR Y F ++ FG GLSY+ + YLN S
Sbjct: 612 LPTDKGFYRRPGRPNEPGRDYVFSSPAPLWSFGHGLSYTTFE-----------YLNAHYS 660
Query: 656 TKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRP 715
+++ D +V++ +KN G +AGK V L+V+ P
Sbjct: 661 AELLHPSDT------------------LIVSVSLKNTGSVAGKEVVQLYVRDVVSSVVTP 702
Query: 716 IKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
+KQL F L E +V +L P + L+ V+EEG + + +G
Sbjct: 703 VKQLKAFSKPFLQPGEMQTVVLKL-PIQELALYDLSMKKVVEEGEYEIQIG 752
>gi|427387354|ref|ZP_18883410.1| hypothetical protein HMPREF9447_04443 [Bacteroides oleiciplenus YIT
12058]
gi|425725515|gb|EKU88386.1| hypothetical protein HMPREF9447_04443 [Bacteroides oleiciplenus YIT
12058]
Length = 786
Score = 267 bits (682), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 225/794 (28%), Positives = 368/794 (46%), Gaps = 132/794 (16%)
Query: 51 PISQRARDLVSRLTLDEKISQLVN--SAPAIPRLGIPAYEW----WSEALHGV----AGV 100
P+ R +DL+S++T++EK Q+ + + + +P +W W + + + G+
Sbjct: 48 PLEARVKDLLSQMTMEEKTCQMATLYGSGRVLKDSLPTEQWKNEIWKDGIANIDEQANGL 107
Query: 101 GK-----------------------------GI---FFNGTIRG-----ATSFPQVILTA 123
GK GI F N IRG AT FP
Sbjct: 108 GKFGSSLSYPYVNSVENRQAIQRWFVEQTRLGIPVDFTNEGIRGLCHDRATMFPAQCGQG 167
Query: 124 ASFDSYLWYRIGQAIGLEARALYNAGQAIGMT-FWAPNINIFRDPRWGRGQETPGEDPLV 182
A+++ L I + EA+AL G T ++P ++I +DPRWGR E GEDP +
Sbjct: 168 ATWNKELISEIAKVTAEEAKAL-------GYTNIYSPILDIAQDPRWGRVVECYGEDPFL 220
Query: 183 TGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDL 242
G+ ++G+Q + + KHF Y + + D V +++
Sbjct: 221 VGELGKRMIKGLQAEGL-----------VSTPKHFAVYSIPVGGRDAGTRTDPHVAPREM 269
Query: 243 ADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAV 302
Y PF + A G+M +YN +G P + L++ R +WGF GY+ SD +AV
Sbjct: 270 RTLYIEPFRKAFCEAGALGVMSSYNDYDGEPITGSYHFLTEILRHEWGFKGYVVSDSEAV 329
Query: 303 SIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKH----TKAAVKQKKLPESEIDRALH 358
+Y A + D V+ AG++V L ++ + A+ + K+ E ID +
Sbjct: 330 EFLYSKHNVAANAVDGAAQVINAGLNVRTNFTLPENFIRPLRQAISEGKVSEQTIDSRVA 389
Query: 359 NLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPK 418
++ V+ +GLF+ NP + K VV S HQ ++++AA + IVLLKN + +LPL K
Sbjct: 390 DVLRVKFMMGLFD-NP-YKGDAKKPEKVVHSKEHQAVSMRAALESIVLLKNENNILPLSK 447
Query: 419 SKSVSLALIGPNANSAKTLLGNYAGPSCRSI-TPLQALQNYV--ENTVYYPGCDTV---- 471
S + +A+IGPNA L+ Y GP+ I T Q +++Y+ + Y G D +
Sbjct: 448 S-TKKVAVIGPNAAEVDNLICRY-GPANAPIKTVYQGIKDYLPDADVRYAKGADIIDKYF 505
Query: 472 -----------ACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQEL 520
A ID+AV +AK +D ++++G ++ +EE R +L L GRQ++L
Sbjct: 506 PESELYDVPLDKDEQAMIDEAVALAKESDVAIMVLGGNEKTVREEYSRTNLDLCGRQEKL 565
Query: 521 ITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
+ V A KPV+L+L+ G I +A++ I I+ A +PGE A+A+V+FGD+NP
Sbjct: 566 LQAVY-ATGKPVVLLLVDGRAATINWAEH--YIPGIVHAWFPGEFMGDAVAKVLFGDYNP 622
Query: 581 GGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKF 640
GG+L +T +P+ ++P +P + S R ++PFG GLSY+ ++Y
Sbjct: 623 GGKLAVT-FPRSVGQIPFA-FPFKPGSDSKGFVRV-----TGTLYPFGYGLSYTTFAY-- 673
Query: 641 KAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHP 700
S +EN P +G + ++ VKN G++AG
Sbjct: 674 --------------SDLKIEN---------PVIGVQ----GSVKLSCKVKNTGKVAGDEV 706
Query: 701 VLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGT 760
V L++ +K L GF+ V L E+ + F L+P E L +D V+E GT
Sbjct: 707 VQLYLHDEMSSVTTYVKVLRGFERVHLEPGEEKTVNFVLTPQE-LGLWNKDNHFVVEPGT 765
Query: 761 HFLVVGDEEYPISI 774
++VG I +
Sbjct: 766 FAVMVGSSSQDIRL 779
>gi|154493932|ref|ZP_02033252.1| hypothetical protein PARMER_03276 [Parabacteroides merdae ATCC
43184]
gi|154086192|gb|EDN85237.1| glycosyl hydrolase family 3 C-terminal domain protein
[Parabacteroides merdae ATCC 43184]
Length = 955
Score = 267 bits (682), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 220/800 (27%), Positives = 365/800 (45%), Gaps = 139/800 (17%)
Query: 49 TLPISQRARDLVSRLTLDEKISQLVN--SAPAIPRLGIPAYEW----W-------SEALH 95
T+PI R DL+S++ ++EK Q+V + + +P +W W E L+
Sbjct: 65 TVPIDARVEDLLSQMNVEEKTCQMVTLYGYKRVLKDDLPTSDWKKQLWKDGIGAIDEHLN 124
Query: 96 GVAGVGK-------------------------------GI---FFNGTIRG-----ATSF 116
G G GI F N IRG AT+F
Sbjct: 125 GFQQWGLPPSDNPYVWPASRHAWALNEVQRFFIEETRLGIPTDFTNEGIRGVESYIATNF 184
Query: 117 PQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMT-FWAPNINIFRDPRWGRGQET 175
P + +++ L +++G G E R L G T +AP +++ RD RWGR +E
Sbjct: 185 PTQLGLGHTWNRNLVHKVGYITGREGRLL-------GYTNVYAPILDVGRDQRWGRYEEV 237
Query: 176 PGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDA 235
GE P + + + +G+Q D Q +A KH+ AY + + D
Sbjct: 238 YGESPYLVAELGIEMAKGMQTDH----------QVAATSKHYIAYSNNKGGREGMARVDP 287
Query: 236 RVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYI 295
+++ +++ + P++ +K+ G+M +YN +G P + L+ R ++GF GY+
Sbjct: 288 QMSPREVEMIHVYPWKRVIKEAGILGVMSSYNDYDGFPIQSSYYWLTTRLRGEFGFRGYV 347
Query: 296 TSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHT-----KAAVKQKKLPE 350
SD DAV ++ G A +++V+ + AG+++ C +F + + + + +P
Sbjct: 348 VSDSDAVEYLFSKHGTAADMKESVLQSVLAGLNIRC-TFRSPDSYVLPLRELIAEGAIPM 406
Query: 351 SEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNS 410
S ID + ++ V+ +GLF+ +P + +V C+ +Q++ALQA+++ +VLLKN
Sbjct: 407 STIDDRVRDILRVKFLVGLFD-HPYQIDLKETDKEVNCA-ENQLVALQASKESLVLLKNQ 464
Query: 411 HGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVE---NTVYYPG 467
+LPL +K +A+ GPNA+ L +Y + T L+ ++N V+ + ++ G
Sbjct: 465 DAVLPLDVNKISKIAVCGPNADEEAYALTHYGPLAVEVTTVLEGIRNKVKPGTDVLFTKG 524
Query: 468 CDTV---------------ACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLV 512
CD V A + IDKAV+ AK +D V+++G E R L
Sbjct: 525 CDLVDANWPESELIRYPLTAEEQSEIDKAVENAKKSDVTVVVLGGSNRTCGENKSRSSLD 584
Query: 513 LPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAE 572
LPGRQ +L+ V A KPV+LVL+ G P+ I +A D+ + +IL A YPG G A+A+
Sbjct: 585 LPGRQLDLLQAVV-ATGKPVVLVLINGRPLSINWA--DKYVPAILEAWYPGSQGGTAIAD 641
Query: 573 VIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQA------TSGNPGRTYRFYEGKEVFP 626
+FGD+NPGG+L +T +P+ ++P + +P A G G R ++P
Sbjct: 642 ALFGDYNPGGKLTVT-FPKTVGQIPF-NFPTKPNAQVDGGRNKGLDGNMSRV--NGPLYP 697
Query: 627 FGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVT 686
FG GLSY+ + Y ++ P + T + + V
Sbjct: 698 FGYGLSYTTFEYSDISIQ--------------------------PAIVT---QVQPVTVR 728
Query: 687 IGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLS 746
V N G+ AG V L+V+ K LVGF + LN E E+ F + P L
Sbjct: 729 CKVTNTGKRAGDEVVQLYVRDILSSVTTYEKNLVGFDRIHLNPGETKELTFTIEP-RDLQ 787
Query: 747 RAREDGLMVIEEGTHFLVVG 766
D V+E G ++VG
Sbjct: 788 LLNSDNHWVVEPGDFKVMVG 807
>gi|296081550|emb|CBI20073.3| unnamed protein product [Vitis vinifera]
Length = 309
Score = 266 bits (681), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 126/249 (50%), Positives = 174/249 (69%), Gaps = 21/249 (8%)
Query: 136 QAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQ 195
Q + EARA+YN G A G+TFW+PN+NIF+DPRWGRGQETPGEDPL++ KYA YVRG+Q
Sbjct: 77 QVVSTEARAMYNVGLA-GLTFWSPNVNIFQDPRWGRGQETPGEDPLLSSKYASGYVRGLQ 135
Query: 196 GDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVK 255
+ +G +L+ +ACCKH+TAYDLDNWKG + F+A VT QD+ DT+QPPF+SCV
Sbjct: 136 -QSDDGSP--DRLKVAACCKHYTAYDLDNWKGVDCFHFNAVVTNQDMDDTFQPPFKSCVI 192
Query: 256 QGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFH-GYITSDCDAVSIIYDAEGYAKS 314
G + +MC+YN+ I + +H Y+ SDCD+V + Y+++ Y K+
Sbjct: 193 DGNVASVMCSYNQFYLI----------------YKYHLQYLVSDCDSVDVFYNSQHYTKT 236
Query: 315 PEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNP 374
PE+AV + AG+D+NCGSFL +HT+AAVK + ES +D+A+ N F+ MRLG F+GNP
Sbjct: 237 PEEAVAKAILAGLDLNCGSFLGQHTEAAVKGGLVDESAVDKAVSNNFATLMRLGFFDGNP 296
Query: 375 TMQPFGKIG 383
+ +GK+G
Sbjct: 297 SKAIYGKLG 305
>gi|218130696|ref|ZP_03459500.1| hypothetical protein BACEGG_02285 [Bacteroides eggerthii DSM 20697]
gi|217987040|gb|EEC53371.1| glycosyl hydrolase family 3 N-terminal domain protein [Bacteroides
eggerthii DSM 20697]
Length = 858
Score = 266 bits (681), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 158/424 (37%), Positives = 241/424 (56%), Gaps = 34/424 (8%)
Query: 51 PISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTI 110
PI +R DL+SRLT++EKIS L ++P I RL IP Y +EALHGV G+
Sbjct: 35 PIHERIMDLLSRLTVEEKISLLRATSPGISRLDIPKYYHGNEALHGVVRPGR-------- 86
Query: 111 RGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA---GQAIG------MTFWAPNI 161
T FPQ I AA+++ L ++ I EARA +N G+ +TFW+P +
Sbjct: 87 --FTVFPQAIGLAATWNPVLQKQVATVISDEARARWNELDQGREQNSQFSDLLTFWSPTV 144
Query: 162 NIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYD 221
N+ RDPRWGR ET GEDP ++G ++V+G+QG+ L+ + KHF A
Sbjct: 145 NMARDPRWGRTPETYGEDPYLSGIMGTAFVKGLQGNDSR------YLKIVSTPKHFAA-- 196
Query: 222 LDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLL 281
N + R+ + +++ + L + Y P FE+CVK+G+++ IM AYN +N +P + LL
Sbjct: 197 --NNEEHNRFVCNPQISEKQLREYYLPAFEACVKEGKSASIMSAYNALNDVPCTLNAWLL 254
Query: 282 SKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG-SFLQKHTK 340
+K R WGF GY+ SDC +++ +A Y K+ E A +KAG+D+ CG
Sbjct: 255 TKVLREDWGFKGYVVSDCGGPALLVNAHKYVKTKEAAATLSIKAGLDLECGDDVYDAPLL 314
Query: 341 AAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAA 400
A +Q + +++ID A + + RM+LGLF+ P+ KI V+ S HQ +AL AA
Sbjct: 315 NAYRQYMVTDADIDSAAYRVLRARMQLGLFDSGEN-NPYTKISPKVIGSKEHQKVALDAA 373
Query: 401 QDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAG-PSCRSITPLQALQNYV 459
++ IVLLKN + +LPL K S+A++G NA ++ G+Y+G P ++ LQ +++ V
Sbjct: 374 RECIVLLKNQNKMLPLDAKKIKSIAVVGINA--GRSEFGDYSGLPVIAPVSILQGIKDRV 431
Query: 460 ENTV 463
+TV
Sbjct: 432 GDTV 435
Score = 125 bits (315), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 92/321 (28%), Positives = 150/321 (46%), Gaps = 59/321 (18%)
Query: 435 KTLLGNYAGPSCRSITP---LQALQNYVENTVYYPGCDTVACS----------SASID-- 479
K L+ + G + R+ T L+A ++Y YY D +D
Sbjct: 534 KMLIDAWQGHAIRTDTATIYLEAGRDYRLKAEYYDNRDYAVAKLQWKVPQIGKKTRLDLY 593
Query: 480 -KAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLC 538
+A + + + VV ++G+++ E+E DR D+ LP Q+E + + + +++VL+
Sbjct: 594 GEAGRVVRECEKVVAVLGINKAIEREGQDRSDIQLPADQREFLKEIYKVNPN-IVVVLVA 652
Query: 539 GGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPM 598
G + I + D +I +I+ A YPGE+G A+AEV+FGD+NPGGRLP+T+Y P
Sbjct: 653 GSSLSINWM--DEHIPAIINAWYPGESGGKAVAEVLFGDYNPGGRLPLTYYRSLDELPPF 710
Query: 599 TDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKM 658
D + GRTY++++G ++PFG GLSY+ + Y V+
Sbjct: 711 DDYDI-------TKGRTYQYFKGNVLYPFGYGLSYTSFKYSDLQVTDG------------ 751
Query: 659 VENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQ 718
NQ E+ FC +KN G+ AG ++VK R PIK+
Sbjct: 752 --NQ---------EVNVSFC----------LKNVGKYAGDEVAQIYVKLPERDKIMPIKE 790
Query: 719 LVGFQSVILNAKEKAEIVFEL 739
L GF+ + L E ++ L
Sbjct: 791 LKGFERISLKRGESRKVTIRL 811
>gi|449527525|ref|XP_004170761.1| PREDICTED: beta-D-xylosidase 1-like [Cucumis sativus]
Length = 241
Score = 266 bits (681), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 126/205 (61%), Positives = 153/205 (74%), Gaps = 6/205 (2%)
Query: 33 CDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSE 92
CD N T FC+ +L I +R +DL+ RLTL EKI LVN+A A+PRLGI YEWWSE
Sbjct: 34 CDKRNVGTRNMGFCQESLGIEERVKDLIGRLTLGEKIRLLVNNAIAVPRLGIRGYEWWSE 93
Query: 93 ALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAI 152
ALHGV+ VG G F GT GATSFPQVI TAASF+ LW IG+ + EARA+YN G A
Sbjct: 94 ALHGVSNVGPGTKFGGTFPGATSFPQVITTAASFNQSLWLLIGRVVSDEARAMYNGGTA- 152
Query: 153 GMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASA 212
G+T+W+PN+NIFRDPRWGRGQETPGEDP++ KYA +YV+G+QG N GK +L+ +A
Sbjct: 153 GLTYWSPNVNIFRDPRWGRGQETPGEDPILAAKYAANYVQGLQG---NDGKK--RLKVAA 207
Query: 213 CCKHFTAYDLDNWKGTTRYKFDARV 237
CCKH+TAYDLDNW G RY F+A+V
Sbjct: 208 CCKHYTAYDLDNWNGVDRYHFNAKV 232
>gi|392531731|ref|ZP_10278868.1| glycoside hydrolase [Carnobacterium maltaromaticum ATCC 35586]
Length = 716
Score = 266 bits (681), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 203/676 (30%), Positives = 341/676 (50%), Gaps = 70/676 (10%)
Query: 104 IFFNGTIRG-ATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNIN 162
IF I G T FP + +S+++ L + LE+ A + +TF +P ++
Sbjct: 83 IFMADVIHGYKTIFPIPLALGSSWNTELAEETARIAALES-----AVSGLHVTF-SPMVD 136
Query: 163 IFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQ-ASACCKHFTAYD 221
+ RDPRWGR E+ GED + + A ++VRG QG LK + +AC KHF AY
Sbjct: 137 LVRDPRWGRVMESTGEDAFLNAELAKAFVRGYQGT-----DLKNDFERVAACVKHFAAY- 190
Query: 222 LDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLL 281
R ++ + L ++Y P +++ + +G A +M A+N V+G+P+ A++ L+
Sbjct: 191 --GAPIAGREYNTVNMSERQLRESYLPGYKAALDEG-AKLVMTAFNTVDGVPATANKWLM 247
Query: 282 SKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHT-K 340
R +W F+G + SD AV + G A ++A + L A +D+ + T +
Sbjct: 248 RDLLRNEWDFNGVLISDWGAVKELV-PHGVAADEKEAAILALNASVDIEMMTMCYMDTLE 306
Query: 341 AAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAA 400
A+++ ++ E+ +D A+ + +++ LGLF NP + VV S H+ LA +AA
Sbjct: 307 ASIEAGEIEEALVDEAVLRILNLKNDLGLFE-NPYRGADAEKEKAVVLSEEHRSLARKAA 365
Query: 401 QDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYA--GPSCRSITPLQALQNY 458
++ IVLLKN G+LPL +++ + ALIGP A S K +LG +A G S ++I+ + L +
Sbjct: 366 EESIVLLKN-EGVLPLKQNQKI--ALIGPGAES-KDILGAWAWQGESEKAISLAEGLAQH 421
Query: 459 V--ENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGR 516
V E + GCD ++ + A ID AV +AK AD +VL +G + E R D+ LP
Sbjct: 422 VPAEQLLIAKGCDILSGTQAEIDAAVAVAKEADVIVLALGENSDMSGEAASRSDIRLPKI 481
Query: 517 QQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFG 576
Q EL+ ++ ++ KP+I+ L G P+DI ++ + I+ A +PG G ALA ++ G
Sbjct: 482 QLELLAQL-KSVGKPIIVTLFNGRPLDI----HELEVAGIVEAWFPGTEGGAALANILMG 536
Query: 577 DHNPGGRLPMTWYPQDYIKVPMT---DMKMRPQ-ATSGNPGRTYRFYE--GKEVFPFGCG 630
NP +L M+ +P + +VP+ D RP+ S + ++ + + ++PFG G
Sbjct: 537 AVNPSAKLSMS-FPYNVGQVPVYYNPDNTGRPELGRSVDEKYVSKYLDSPNEALYPFGFG 595
Query: 631 LSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVK 690
LSY+ +SY +S TE ++ V + VK
Sbjct: 596 LSYTSFSYSDMTLS-----------------------------ATELKDSETLTVEVTVK 626
Query: 691 NHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRARE 750
N G+ AGK V L+++ RP+K+L G+Q + L A + + F ++ E L +
Sbjct: 627 NSGKYAGKEIVQLYLRDIVGEVVRPVKELKGYQKIELAAGAEKTVSFTVTE-EQLRYVHQ 685
Query: 751 DGLMVIEEGTHFLVVG 766
D + G ++VG
Sbjct: 686 DCSFSSDSGVFEVMVG 701
>gi|424661946|ref|ZP_18098983.1| hypothetical protein HMPREF1205_02332 [Bacteroides fragilis HMW
616]
gi|404578257|gb|EKA82992.1| hypothetical protein HMPREF1205_02332 [Bacteroides fragilis HMW
616]
Length = 814
Score = 266 bits (681), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 224/717 (31%), Positives = 339/717 (47%), Gaps = 104/717 (14%)
Query: 81 RLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGL 140
RLGIP + E HG +G T FP I A++++ L ++G+ I +
Sbjct: 149 RLGIPLF-LAEECPHGHMAIG-----------TTVFPTSIGQASTWNPELIRQMGRVIAI 196
Query: 141 EARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFN 200
EA A G IG + P +++ RDPRW R +ET GEDP + G + VRG QGDT
Sbjct: 197 EASA---QGAHIG---YGPVLDLARDPRWSRVEETYGEDPYLNGVMGAALVRGFQGDT-- 248
Query: 201 GGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRAS 260
L+G+ A KHF +Y W A + ++L + PPF V G A
Sbjct: 249 ---LRGRKSVIATLKHFASY---GWTEGGHNGGTAHLGERELEEAIFPPFREAVGAG-AL 301
Query: 261 GIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVV 320
+M +YN ++G P R LL+ + +W F G++ SD A+ + + G A S +A V
Sbjct: 302 SVMSSYNEIDGNPCTGSRYLLTDILKDRWLFKGFVVSDLYAIGGLRE-HGVAGSDYEAAV 360
Query: 321 DVLKAGMDVNCGS-FLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPF 379
+ AG+D + G+ + AAV++ + +D+A+ + S++ +GLF+ PF
Sbjct: 361 KAVNAGVDSDLGTNVYAEQLVAAVRKGDVAMETVDKAVRRILSLKFHMGLFDA-----PF 415
Query: 380 --GKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTL 437
K A +V SP H LA + A+ IVLLKN LLPL K +LA+IGPNA++ +
Sbjct: 416 VDDKRPAQLVASPEHIGLAREVARQSIVLLKNEDKLLPLKKDIR-TLAVIGPNADNGYNM 474
Query: 438 LGNYAGPSCRS--ITPLQALQNYVEN---TVYYPGCDTVACSSASIDKAVDIAKGADHVV 492
LG+Y P +T L+ ++ V +Y GC S A++ A+ AD VV
Sbjct: 475 LGDYTAPQADGSVVTVLEGIRQKVSKDTRVLYAKGCAVRDSSRTGFADAIEAARSADVVV 534
Query: 493 LMMG----LDQTQEKEE-------------------LDRVDLVLPGRQQELITRVAEAAK 529
+++G D + E EE DR L L GRQ EL+ V + K
Sbjct: 535 MVVGGSSARDFSSEYEETGAAKVSANRVSDMESGEGYDRATLHLMGRQLELLEEVRKLGK 594
Query: 530 KPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWY 589
P++LVL+ G P + + +IL A YPG G A+A+V+FGD+NP GRL ++
Sbjct: 595 -PMVLVLIKGRP--LLMEGVIQEADAILDAWYPGMQGGNAVADVLFGDYNPAGRLTLS-V 650
Query: 590 PQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLY 649
P+ ++P+ R GN R Y G +PFG GLSY+ +SY + K+
Sbjct: 651 PRSVGQLPVYYNTKR----KGNRSR-YIEEAGTPRYPFGYGLSYTMFSY-----TGMKVR 700
Query: 650 LNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPAR 709
+++ S+ V+ V++ V+N G + G V L+++
Sbjct: 701 VSEESNHCRVD------------------------VSVTVRNQGTVDGDEVVQLYLRDEV 736
Query: 710 RGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
P +QL F V L A E EI F L +SL+ DG +E G ++ G
Sbjct: 737 GSFTTPDRQLRAFSRVRLKAGETREITFTLDK-KSLALYMRDGEWAVEPGRFTVMAG 792
>gi|380694609|ref|ZP_09859468.1| periplasmic beta-glucosidase , xylosidase/arabinosidase
[Bacteroides faecis MAJ27]
Length = 804
Score = 266 bits (680), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 206/713 (28%), Positives = 337/713 (47%), Gaps = 101/713 (14%)
Query: 81 RLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGL 140
RLGIP + EA HG +G T FP I +A++ L +G+AI
Sbjct: 142 RLGIPVF-LAEEAPHGHMAIG-----------TTVFPTGIGMSATWSPTLIEEVGKAIAK 189
Query: 141 EARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFN 200
E R+ G I + P +++ RDPRW R +ET GEDP+++G+ + V G+
Sbjct: 190 EIRS---QGAHIS---YGPVLDLSRDPRWSRVEETFGEDPVLSGRLGAAMVTGL-----G 238
Query: 201 GGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRAS 260
G L + A KHF AY + Y A V +DL + + PPF ++ G A
Sbjct: 239 SGDLSREHATIATLKHFLAYAVPEGGQNGNY---ASVGARDLHENFLPPFREAIEAG-AL 294
Query: 261 GIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVV 320
+M +YN ++GIP A+ LL++ R +W F G++ SD ++ I+++ A + E+A V
Sbjct: 295 SVMTSYNSIDGIPCTANHYLLTQLLRNEWKFRGFVVSDLYSIEGIHESHFVASTMEEAAV 354
Query: 321 DVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFG 380
L AG+D++ G + AV+ KL E++I+ A+ + ++ +GLF +P + P
Sbjct: 355 QALSAGVDIDLGGDAFMNLLQAVRSGKLDETQINAAVDRILRMKFEMGLFE-HPYVNP-- 411
Query: 381 KIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGN 440
K +V + H LA + AQ +VLL+N + +LPL K K +A++GPNA++ +LG+
Sbjct: 412 KTTTKMVRNKEHVKLARKVAQSSVVLLENKNSILPLSK-KIKRVAVVGPNADNRYNMLGD 470
Query: 441 YAGP----SCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMG 496
Y P R++ + Y GC + I +AV+ A ++ ++ ++G
Sbjct: 471 YTAPQEDKDIRTVLDGVISKLSPSRVEYVRGCAIRDTTVNEIAEAVEAAHRSEVIIAVVG 530
Query: 497 -----------------------LDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVI 533
+ + E DR L L G+QQ+L+ + + KP+I
Sbjct: 531 GSSARDFKTSYQETGAAIADEKSISDMECGEGFDRATLTLLGKQQDLLNAL-KTTGKPLI 589
Query: 534 LVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDY 593
+V + G P+D +A ++L A YPG+AG A+A+V+FGD+NP GRLP++ P+
Sbjct: 590 VVYIEGRPLDKVWAS--ECADALLTASYPGQAGGDAIADVLFGDYNPAGRLPVS-VPRSV 646
Query: 594 IKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQS 653
++P+ K P+ Y ++ FG GLSY+ + Y + L + Q
Sbjct: 647 GQIPVYYNKKAPR------NHDYVEMAASPLYGFGYGLSYTTFEY-------SDLQITQK 693
Query: 654 SSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNG 713
S C F V+ VKN G G+ L++K
Sbjct: 694 SP----------------------CH---FEVSFKVKNTGNYDGEEVAQLYLKDEYASVV 728
Query: 714 RPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
+P+KQL F+ L E+ EI+F L+ + LS V+E G +++G
Sbjct: 729 QPLKQLKHFERFFLRKGEEKEILFTLTE-KDLSIIDRSMKRVVETGDFRIMIG 780
>gi|270294308|ref|ZP_06200510.1| periplasmic beta-glucosidase [Bacteroides sp. D20]
gi|270275775|gb|EFA21635.1| periplasmic beta-glucosidase [Bacteroides sp. D20]
Length = 862
Score = 266 bits (680), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 210/711 (29%), Positives = 326/711 (45%), Gaps = 100/711 (14%)
Query: 81 RLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGL 140
RLGIP + +E+LHG G+T FPQ + ++F+ L Y++ +AI
Sbjct: 110 RLGIPVFTV-TESLHG-----------SVHDGSTIFPQSVAVGSTFNLDLAYQMTKAIAT 157
Query: 141 EARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFN 200
E R+ Q + T +P +++ RD RWGR +E+ GEDP + G+ ++ V+G +
Sbjct: 158 ELRS-----QGVIQTL-SPGLDVVRDLRWGRVEESFGEDPWLVGQMGIAQVKGY----ID 207
Query: 201 GGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRAS 260
GG S K F +D+ + + P+E V+
Sbjct: 208 GG-------ISPMLKPFGPGGAPLGGLNLA---SVESGERDIRNIHIKPYEMAVRNTEVK 257
Query: 261 GIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVV 320
+M +YN NGIP+ A LL+ R +WGF GY+ SD AV+++ D + AK +A +
Sbjct: 258 AVMTSYNSWNGIPNSASSYLLTNILRNEWGFKGYVYSDWGAVAMLKDFQHTAKDDSEAAI 317
Query: 321 DVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFG 380
L AG+D+ S + ++Q + E +D A+ + V+ LGLF NP
Sbjct: 318 QALTAGVDLEASSNCYWALEQLIEQGRFDEKYVDLAVGRILRVKFELGLFE-NPYQ---- 372
Query: 381 KIGADV--VCSPAHQVLAL--QAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKT 436
GAD+ V + + L + A + IVLLKN + LLPL +K SLA+IGPNAN +
Sbjct: 373 --GADMPGVAMRTKEAVELSRRVADESIVLLKNENTLLPLNLNKIKSLAVIGPNANQVQ- 429
Query: 437 LLGNYAGPSCRS--ITPLQALQNYVENTV---YYPGCDTVACSSASIDKAVDIAKGADHV 491
G+Y +TPL+ L+ V N + Y GCD + + + D+AV K +D
Sbjct: 430 -FGDYTWSRSNKDGVTPLEGLKKRVGNKIKINYAAGCDLITDNKSGFDEAVAAVKASDMA 488
Query: 492 VLMMGLDQTQEK---------EELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPV 542
V+ +G E D L L G Q+EL+ + A KPVI+VL+ G P
Sbjct: 489 VVFVGSSSASLARDYSDATCGEGFDLSSLDLTGVQEELVEEIY-AIGKPVIVVLVTGKPF 547
Query: 543 DITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQD-------YIK 595
I++ K +I +I+ Y GE A+A+++ G+ NP +LP + +PQ Y
Sbjct: 548 SISWIK--EHIPAIVVQWYGGEKAGDAIADMLLGNINPSAKLPFS-FPQSVGHLPVFYNH 604
Query: 596 VPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSS 655
+P R PGR Y F ++ FG GLSY+ + YLN S
Sbjct: 605 LPTDKGFYRRPGRPNEPGRDYVFSSPAPLWSFGHGLSYTTFE-----------YLNAHYS 653
Query: 656 TKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRP 715
+++ D +V++ +KN G +AGK V L+V+ P
Sbjct: 654 AELLHPSDT------------------LIVSVSLKNTGSVAGKEVVQLYVRDVVSSVVTP 695
Query: 716 IKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
+KQL F L E +V +L P + L+ V+EEG + + +G
Sbjct: 696 VKQLKAFSKPFLQPGEMQTVVLKL-PIQELALYDLSMKKVVEEGEYEIQIG 745
>gi|423342899|ref|ZP_17320613.1| hypothetical protein HMPREF1077_02043 [Parabacteroides johnsonii
CL02T12C29]
gi|409217154|gb|EKN10133.1| hypothetical protein HMPREF1077_02043 [Parabacteroides johnsonii
CL02T12C29]
Length = 955
Score = 266 bits (680), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 224/803 (27%), Positives = 361/803 (44%), Gaps = 145/803 (18%)
Query: 49 TLPISQRARDLVSRLTLDEKISQLVN--SAPAIPRLGIPAYEW----W-------SEALH 95
T PI R DL+S++ ++EK Q+V + + +P +W W E L+
Sbjct: 65 TAPIDARVEDLLSQMNVEEKTCQMVTLYGYKRVLKDDLPTPDWKNQLWKDGMGAIDEHLN 124
Query: 96 GVAGVGK-------------------------------GI---FFNGTIRG-----ATSF 116
G G GI F N IRG AT+F
Sbjct: 125 GFQQWGLPPSDNPYVWPASRHAWALNEVQRFFIEETRLGIPTDFTNEGIRGVESYIATNF 184
Query: 117 PQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMT-FWAPNINIFRDPRWGRGQET 175
P + +++ L +++G G E R L G T +AP +++ RD RWGR +E
Sbjct: 185 PTQLGLGHTWNRDLVHKVGYITGREGRLL-------GYTNVYAPILDVGRDQRWGRYEEV 237
Query: 176 PGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDA 235
GE P + + V +G+Q D Q +A KH+ AY + + D
Sbjct: 238 YGESPYLVAELGVEMAKGMQTD----------YQVAATSKHYIAYSNNKGGREGMARVDP 287
Query: 236 RVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYI 295
+++ +++ + P++ +K+ G+M +YN +G P + L+ R ++GF GY+
Sbjct: 288 QMSPREVEMLHVYPWKRVIKEAGILGVMSSYNDYDGFPIQSSYYWLTTRLRGEFGFRGYV 347
Query: 296 TSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHT-----KAAVKQKKLPE 350
SD DAV ++ G A +++V+ + AG+++ C +F + + + + LP
Sbjct: 348 VSDSDAVEYLFSKHGTAADMKESVLQSVLAGLNIRC-TFRSPDSYVLPLRELIAEGALPM 406
Query: 351 SEIDRALHNLFSVRMRLGLFNGNPTMQPFG---KIGADVVCSPAHQVLALQAAQDGIVLL 407
S ID + ++ V+ +GLF+ QP+ K V S +Q +ALQA+++ +VLL
Sbjct: 407 STIDDRVRDILRVKFLVGLFD-----QPYQIDLKQADKEVNSAENQQVALQASKESLVLL 461
Query: 408 KNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVE---NTVY 464
KN +LPL +K +A+ GPNA+ L +Y + T L+ +QN V+ ++
Sbjct: 462 KNQDAVLPLDVNKISKIAVCGPNADEEAYALTHYGPLAVEVTTVLEGIQNKVKPGTEVLF 521
Query: 465 YPGCDTVACS---------------SASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRV 509
GCD V + + IDKAV+ AK +D V+++G E R
Sbjct: 522 TKGCDLVDANWPESELIRYPLTSEEQSEIDKAVENAKKSDVAVVVLGGSNRTCGENKSRS 581
Query: 510 DLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVA 569
L LPGRQ +L+ V A KPV+LVL+ G P+ I +A D+ + +IL A YPG G A
Sbjct: 582 SLELPGRQLDLLQAVV-ATGKPVVLVLINGRPISINWA--DKYVPAILEAWYPGSQGGTA 638
Query: 570 LAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQA------TSGNPGRTYRFYEGKE 623
+A+ +FGD+NPGG+L +T +P+ ++P + +P A G G R
Sbjct: 639 IADALFGDYNPGGKLTVT-FPKTVGQIPF-NFPTKPNAQVDGGRNKGLDGNMSRV--NGP 694
Query: 624 VFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKF 683
++PFG GLSY+ + Y ++ P + T + +
Sbjct: 695 LYPFGYGLSYTTFEYSDISIQ--------------------------PAIVT---QVQPV 725
Query: 684 LVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCE 743
V V N G+ AG V L+V+ K LVGF + LN E E+ F + P
Sbjct: 726 TVRCKVTNTGKRAGDEVVQLYVRDILSSVTTYEKNLVGFDRIHLNPGETKELTFTIEP-R 784
Query: 744 SLSRAREDGLMVIEEGTHFLVVG 766
L D V+E G ++VG
Sbjct: 785 DLQLLNSDNHWVVEPGDFKVMVG 807
>gi|317503000|ref|ZP_07961085.1| beta-glucosidase, partial [Prevotella salivae DSM 15606]
gi|315665888|gb|EFV05470.1| beta-glucosidase [Prevotella salivae DSM 15606]
Length = 770
Score = 266 bits (679), Expect = 4e-68, Method: Compositional matrix adjust.
Identities = 219/788 (27%), Positives = 363/788 (46%), Gaps = 139/788 (17%)
Query: 52 ISQRARDLVSRLTLDEKISQL----------------------VNSAPAI-PRLGIPAYE 88
++QR DL+ R+TL+EK+ Q+ N+A A P + + E
Sbjct: 33 VAQRVDDLLRRMTLEEKVGQMNQLVGIEHFKTNSITMSAEELATNTATAFYPGVTVSEIE 92
Query: 89 WW------SEALHG----------------------VAGVGKGIFFNGTIRGATSFPQVI 120
+W S LH + G+ I N + T +P I
Sbjct: 93 YWVRRGWVSSFLHVLTLEEANYLQKLSMQSRLQIPLIIGI-DAIHGNAKCKNNTVYPTNI 151
Query: 121 LTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDP 180
A+SFD L Y+I + E RA+ + F PN+ + RD RWGR ET GEDP
Sbjct: 152 GLASSFDVDLAYKIARQTAEEMRAM-----NMHWNF-NPNVEVARDGRWGRCGETFGEDP 205
Query: 181 LVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFT--AYDLDNWKGTTRYKFDARVT 238
+ + V+ +G Q + N + G C KHF +Y ++ G V+
Sbjct: 206 YLVMQMGVATNKGYQRNLDNTSDVLG------CVKHFVGGSYSINGTNGAP-----CDVS 254
Query: 239 MQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSD 298
+ L + + PPF++ ++QG +M ++N +NGIP +R L++ R++WGF G+I SD
Sbjct: 255 ERTLREVFFPPFKATLQQGGDWNVMMSHNELNGIPCHTNRWLMTDVLRKEWGFQGFIVSD 314
Query: 299 CDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNC-GSFLQKHTKAAVKQKKLPESEIDRAL 357
+ D AK ++A + AGMD++ G QK V++ ++PES ID ++
Sbjct: 315 WMDIEHCVDQHHTAKDNKEAFYQSIMAGMDMHMHGPEWQKDVVELVREGRIPESRIDESV 374
Query: 358 HNLFSVRMRLGLFNGNPTMQPFGKIGA--DVVCSPAHQVLALQAAQDGIVLLKNSHGLLP 415
+ +V+ RLGLF P+ + V+ P H+ AL A+++ IVLLKN LLP
Sbjct: 375 RRILTVKFRLGLFE-----HPYSDVKTRDRVINDPVHKQTALDASRESIVLLKNEKQLLP 429
Query: 416 LPKSKSVSLALIGPNANSAKTLLGNYA--GPSCRSITPLQALQNYVENTVYY---PGCDT 470
L + K + + G NAN + ++G+++ P + T L+ L+ +T + G D
Sbjct: 430 LDEQKYKKVLVTGINAND-QNIMGDWSELQPEDKVWTVLKGLKLVSPHTDFRFVDQGWDP 488
Query: 471 VACSSASIDKAVDIAKGADHVVLMMG-------LDQTQEKEELDRVDLVLPGRQQELITR 523
S + +D AV+ AK +D ++ G ++ E+ DR +L L G Q++LI R
Sbjct: 489 RNMSQSQVDAAVEAAKESDLNIVCCGEYMMRFRWNERTSGEDTDRDNLELVGLQEQLIRR 548
Query: 524 VAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGR 583
+ E KP IL+++ G P+ + +A ++ +I+ A PG+ G A+AE+++G NP +
Sbjct: 549 LNETG-KPTILIIISGRPLSVRYAA--DHVPAIVNAWEPGQYGGQAIAEILYGKINPSAK 605
Query: 584 LPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFY-----EGKEVFPFGCGLSYSKYSY 638
L MT P+ ++ ++ N R+ F+ + ++PFG GLSY+K+ Y
Sbjct: 606 LAMT-IPRHVGQI----------SSWYNHKRSAYFHPAVCADNTPLYPFGYGLSYTKFKY 654
Query: 639 KFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGK 698
N S ++EN K+ I ++N G G
Sbjct: 655 S-----------NLVLSDTVIENDGKSAIKA----------------QITIENIGNREGT 687
Query: 699 HPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEE 758
L++ RP+K+L F+ V L A EK I F ++P + L+ D + IE
Sbjct: 688 EVCQLYINDIVSSVARPVKELKDFRRVTLKAGEKQTIEFIITP-DKLAFYDVDMKLKIEP 746
Query: 759 GTHFLVVG 766
G +++G
Sbjct: 747 GEFKVMIG 754
>gi|410096880|ref|ZP_11291865.1| hypothetical protein HMPREF1076_01043 [Parabacteroides goldsteinii
CL02T12C30]
gi|409225497|gb|EKN18416.1| hypothetical protein HMPREF1076_01043 [Parabacteroides goldsteinii
CL02T12C30]
Length = 799
Score = 266 bits (679), Expect = 4e-68, Method: Compositional matrix adjust.
Identities = 186/596 (31%), Positives = 297/596 (49%), Gaps = 64/596 (10%)
Query: 81 RLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGL 140
RLGIP ++ +E +HG+ N T AT P I ++++ L ++ G G
Sbjct: 139 RLGIPV-DFSNEGIHGL---------NHT--KATPLPAPINIGSTWNRDLVHQAGDIAGK 186
Query: 141 EARAL-YNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTF 199
EA+AL YN +AP +++ RDPRWGR ET GEDP + G+ + V+G+Q
Sbjct: 187 EAKALGYN-------NVYAPILDVARDPRWGRVLETYGEDPYLVGELGIQMVKGIQQ--- 236
Query: 200 NGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRA 259
NG ++ KHF Y + + D V ++L + + PF+ V++
Sbjct: 237 NG--------VASTLKHFAVYSIPKGGRDAAVRTDPHVAPRELHEIHLYPFKRVVQKAHP 288
Query: 260 SGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAV 319
G+M +YN +G+P A L++ R+++GF GYI SD +AV + A S E+AV
Sbjct: 289 KGVMSSYNDWDGVPVTASYYFLTQLLRQEYGFKGYIVSDSEAVEFVQTKHHVADSYEEAV 348
Query: 320 VDVLKAGMDVNCGSFLQKH----TKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPT 375
V++AG++V K + VK+ KL +DR + ++ V+ LGLF+
Sbjct: 349 RQVVEAGLNVRTNFTHPKDYILPVRKLVKEGKLSMKSVDRMVADVLRVKFELGLFDSPYV 408
Query: 376 MQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAK 435
P K +V + H+ L + +VLLKN + LLPL K+++ + + GP A
Sbjct: 409 KDP--KAADKIVGADKHRDFVLDMQKQSLVLLKNENNLLPLDKNQTKKVLIAGPLAKETN 466
Query: 436 TLLGNYAGPSCRSITPLQALQNYVEN---TVYYPGCDTVACS---------------SAS 477
++ Y +IT +++Y+ N VY GC+ +
Sbjct: 467 YMISRYGPQGLDNITVYDGIKDYLGNQTEVVYAKGCEVKDANWPDSEIVPTPLTDEEKKG 526
Query: 478 IDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLL 537
I +A A D ++ ++G D++ E R L LPGRQQ+L+ + A KPV+LVL+
Sbjct: 527 IAEAATAAADCDVIIAVLGEDESCTGESKSRTGLDLPGRQQQLLEAL-HATGKPVVLVLI 585
Query: 538 CGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVP 597
G P+ I +A DRNI SIL A +PG+ G A+A+ +FGD+NPGGRL +T +P+ ++
Sbjct: 586 NGQPLTINWA--DRNIPSILEAWFPGQLGGEAIAQTLFGDYNPGGRLSVT-FPRSIGQIE 642
Query: 598 MTDMKMRPQATSGN----PGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLY 649
+ +P + G P + R ++PFG GLSY+ ++Y +V Q Y
Sbjct: 643 F-NFPFKPGSQDGQYFEGPNGSGRTRVNGALYPFGYGLSYTTFAYSNLSVKQETPY 697
>gi|386821036|ref|ZP_10108252.1| beta-glucosidase-like glycosyl hydrolase [Joostella marina DSM
19592]
gi|386426142|gb|EIJ39972.1| beta-glucosidase-like glycosyl hydrolase [Joostella marina DSM
19592]
Length = 725
Score = 266 bits (679), Expect = 4e-68, Method: Compositional matrix adjust.
Identities = 220/749 (29%), Positives = 352/749 (46%), Gaps = 93/749 (12%)
Query: 43 FPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGK 102
+PF + +R +L+S +T+DEK++ L ++ P +PRLG+ E LHG+A G
Sbjct: 30 YPFQNPKIATEKRVDNLLSLMTIDEKVNAL-STNPEVPRLGVKG-TGHVEGLHGLALGGP 87
Query: 103 GIFFNGTIRG--ATSFPQVILTAASFDSYLWYRIGQAIGLEAR-ALYNAGQAIGMTFWAP 159
+ T+FPQ ++D+ L I + G EAR AL G+ G+ AP
Sbjct: 88 AGWGGKGKEPLPTTTFPQAYGLGETWDTELLKEIAKIEGYEARYALQKYGRG-GLVIRAP 146
Query: 160 NINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTA 219
N ++ RDPRWGR +E+ GED GK V++V+G+QG K Q ++ KHF A
Sbjct: 147 NADLARDPRWGRTEESYGEDAFFNGKMTVAFVKGLQGSD------KTYWQTASLMKHFLA 200
Query: 220 YDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRN 279
++ + T FD R+ + Y PF+ V +G + M AYN+VNGIP+
Sbjct: 201 NSNEDGRTYTSSDFDERL----WREYYALPFKMGVVEGGSRAYMAAYNKVNGIPAMV-HP 255
Query: 280 LLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHT 339
+L +WG +G I +D A ++ Y K +KAG++ FL T
Sbjct: 256 MLKDITVDEWGQNGIICTDGGAYKLLLSDHKYYKDKYLGAAATIKAGIN----QFLDDFT 311
Query: 340 K---AAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCS-----PA 391
+ A+ L E+++D L + V ++LG+ + + P+ KIGA+ A
Sbjct: 312 EGVYGALANGYLTEADLDEVLRGNYRVMIKLGMLDSSAN-NPYAKIGAEADSMDPWELEA 370
Query: 392 HQVLALQAAQDGIVLLKN--SHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSI 449
H+ LAL+A + IVLLKN + LLPL K K +A+IG A++ LL Y+G +I
Sbjct: 371 HKKLALEATEKSIVLLKNDPAKRLLPLQKKKVKKIAIIGEYADA--VLLDWYSGTPPYTI 428
Query: 450 TPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQT--------- 500
+PLQ ++N V V + + + + KAV+IAK AD ++ +G T
Sbjct: 429 SPLQGIKNKVGENV-----EVLFAKNNADGKAVEIAKNADVAIVFIGNHPTCNAGWAQCP 483
Query: 501 ---QEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSIL 557
KE +DR L + E + ++ A ++ L+ P I + + NI +I
Sbjct: 484 VPSNGKEAVDRQAL---NSEYEDLVKLVYKANPNTVVGLISSFPYTINWTQ--ENIPAIF 538
Query: 558 WAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYR 617
+ A+A V+FG +NP GRL TW P+ D +R GRTY
Sbjct: 539 HVTQNSQELGTAIANVLFGAYNPAGRLTQTWVKDISDLPPLMDYNIR-------NGRTYM 591
Query: 618 FYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEF 677
+++GK ++ FG GLSY+ + YK + + ++ EN++V
Sbjct: 592 YFKGKPLYAFGHGLSYTTFKYKDMEIPK-----------QIKENEEVS------------ 628
Query: 678 CETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVF 737
V + + N GE+ G V L+VK RPIK+L F+ + + A E +
Sbjct: 629 -------VKVNITNAGEVDGDEVVQLYVKHINSTVERPIKELKSFKRIHIKAGETKTVSL 681
Query: 738 ELSPCESLSRAREDGLMVIEEGTHFLVVG 766
L+P + +E ++E+ T + +G
Sbjct: 682 LLNPKDMKYWNKEKQDFILEKDTIEVQIG 710
>gi|448410571|ref|ZP_21575276.1| beta-glucosidase [Halosimplex carlsbadense 2-9-1]
gi|445671607|gb|ELZ24194.1| beta-glucosidase [Halosimplex carlsbadense 2-9-1]
Length = 760
Score = 266 bits (679), Expect = 5e-68, Method: Compositional matrix adjust.
Identities = 214/721 (29%), Positives = 339/721 (47%), Gaps = 115/721 (15%)
Query: 81 RLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGL 140
RLG+PA E L G G T+FPQ I A+++D L + IG
Sbjct: 84 RLGVPAIPH-EECLSGYMG-----------PEGTTFPQGIGMASTWDPDLMAAVTDTIGD 131
Query: 141 EARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFN 200
+ A+ G A ++ P +++ RD RWGR +ET GEDP + + A +YV G+QGD+
Sbjct: 132 QLEAI---GTAHALS---PVLDVARDLRWGRVEETYGEDPYLVAEMATAYVDGLQGDSPA 185
Query: 201 GGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRAS 260
G SA KHF + + G R D V+ + L + + PFE+ +++G A
Sbjct: 186 DG-------ISATLKHFVGHAV-GAGGKNRSSVD--VSRRTLREVHMFPFEAAIQEGNAE 235
Query: 261 GIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVV 320
+M AY+ ++G+P D LL+ R +WGF G + SD +V + + G A + ++A V
Sbjct: 236 SVMNAYHDIDGVPCAKDEWLLTDVLRGEWGFDGTVVSDYFSVDFLKEEHGVAATQQEAAV 295
Query: 321 DVLKAGMDV-----NCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPT 375
++AG+DV +C +L + AV+ L E +D ++ + + GLF
Sbjct: 296 SAVEAGVDVELPNTDCYEYLAE----AVRDGDLAEESLDESVRRVLRAKFEKGLFEEY-- 349
Query: 376 MQPFGKIGADVVCSP----AHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNA 431
+ D P A LA +AA+D +V+LKN LLPL + SV A++GP A
Sbjct: 350 -----TVDVDAATDPYEDEAAVGLAREAARDSLVVLKNESDLLPLDDADSV--AVVGPKA 402
Query: 432 NSAKTLLGNYAGPS--------CRSITPLQALQNYVENTVYYP-GCDTVACSSASIDKAV 482
+ K +LG+YA + + TPL A++N V V Y GC S+ I +AV
Sbjct: 403 DDKKGMLGDYAYAAHYPEEEYEFEADTPLSAIENRVGADVNYAQGCTATGNSTDKIGRAV 462
Query: 483 DIAKGADHVVLMMGLDQTQEKEELDRV-----------------DLVLPGRQQELITRVA 525
+ A+ AD + +G + + D V DL LPG Q EL+ +V
Sbjct: 463 EAAENADVALAFVGARSAVDFSDADGVKAEQPMVPTSGEGCDVTDLGLPGVQNELVAQV- 521
Query: 526 EAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLP 585
E PV++VL+ G P I + D +++ A PGE A+ +V+F H+ GG LP
Sbjct: 522 EETDTPVVIVLVSGKPHAIP--EIDAGADAVVQAWLPGEEAGNAIVDVVFEGHDSGGHLP 579
Query: 586 MTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQ 645
++ P+ ++P+ +P S Y + + + V+PFG GLSY+++ Y +S
Sbjct: 580 VSM-PKSVGQLPVH-YSRKPNTYS----EDYVYDDAQPVYPFGHGLSYAEFEYSDLDLSD 633
Query: 646 NKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFV 705
+ + GT F ++ V+N E G V L+V
Sbjct: 634 VDVDPS----------------------GT-------FSASVTVENTAERDGSDVVQLYV 664
Query: 706 KPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVV 765
RP+++LVGF+ V L+A E EI F+L+ + L+ + + +E G + L V
Sbjct: 665 SAENPDLARPVQELVGFRRVELDAGESTEITFDLAASQ-LAYHDRNANLAVEAGDYELRV 723
Query: 766 G 766
G
Sbjct: 724 G 724
>gi|298480647|ref|ZP_06998843.1| beta-glucosidase [Bacteroides sp. D22]
gi|298273081|gb|EFI14646.1| beta-glucosidase [Bacteroides sp. D22]
Length = 862
Score = 265 bits (678), Expect = 5e-68, Method: Compositional matrix adjust.
Identities = 175/460 (38%), Positives = 245/460 (53%), Gaps = 37/460 (8%)
Query: 44 PFCKT-TLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGK 102
P KT +L +RA L+ LTL+EK+S +++ + A+ RLGI Y WW+EALHGVA G
Sbjct: 20 PTYKTISLSPEKRAELLLKELTLEEKVSLMMDGSKAVERLGIKPYNWWNEALHGVARAGL 79
Query: 103 GIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARA--LYNAGQAI-----GMT 155
AT FPQ I AASF Y + A+ EARA Y A Q G+T
Sbjct: 80 ----------ATVFPQPIGMAASFSPQTVYGVFDAVSDEARAKNTYYASQGSYERYQGLT 129
Query: 156 FWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCK 215
W P +N++RDPRWGRG ET GEDP +T + V V+G+QG N GK + AC K
Sbjct: 130 MWTPTVNVYRDPRWGRGIETYGEDPYLTSRMGVEVVKGLQGG--NDGKYD---KLHACAK 184
Query: 216 HFTAYDLDNWKGTTRYKFDAR-VTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPS 274
HF + W R++F+ + +DL +TY PPFE+ VK+ + +MCAYNR G P
Sbjct: 185 HFAVHSGPEW---NRHEFNVENIKPRDLYETYLPPFEALVKEAKVKEVMCAYNRFEGDPC 241
Query: 275 CADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSP--EDAVVDVLKAGMDVNCG 332
C LL + R +WGF G + SDC A++ Y+ G+ P E A + +G D+ CG
Sbjct: 242 CGSDRLLMQILRNEWGFDGIVLSDCGAIADFYNDRGHKTHPDAESASAAAVLSGTDLECG 301
Query: 333 SFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAH 392
S K AVK K+ E ++D A+ L + R LG + P + KI VV S H
Sbjct: 302 SSY-KALIEAVKLGKIDEKDVDVAVKRLLAARFALGEMD-EPDKVSWTKIPFSVVASEKH 359
Query: 393 QVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPL 452
LAL A+ + LL N + +LPL K +++A++GPNAN + GNY G ++T L
Sbjct: 360 DSLALDIARKSMTLLMNKNDILPL-KRGGLTIAVMGPNANDSVMQWGNYNGMPSHTVTVL 418
Query: 453 QALQNYV---ENTVYYPGCDTVACSSASIDKAVDIAKGAD 489
++ + + +Y GC V + I A ++ K AD
Sbjct: 419 DGVRKALDANDKLIYEQGCGWV--DRSLIQSAFNLCKSAD 456
Score = 118 bits (295), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 91/277 (32%), Positives = 134/277 (48%), Gaps = 53/277 (19%)
Query: 478 IDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEA 527
I K+V+ KGAD V+ + G+ + E EE+ DR D+ LP Q+ELI + +
Sbjct: 589 IRKSVEKVKGADVVIFVGGISPSLEGEEMGVNLPGFKRGDRTDIELPAVQRELIAALHHS 648
Query: 528 AKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMT 587
KK VILV G P I + +IL A YPG+AG A+AEV+FG +NP GRLP+T
Sbjct: 649 GKK-VILVNCSGSP--IGLEPETKKCEAILQAWYPGQAGGTAVAEVLFGKYNPAGRLPVT 705
Query: 588 WYPQDYIKVP-MTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQN 646
+Y ++ ++P D M GRTYRF + +FPFG GLSY+ + Y + +N
Sbjct: 706 FY-RNVSQLPDFEDYDMS--------GRTYRFMTQEPLFPFGYGLSYTTFEYGEMTLDKN 756
Query: 647 KLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVK 706
K +N K+ T+ + N G+ G+ V ++++
Sbjct: 757 K--INAGEVLKL---------------------------TVPITNTGKRDGEEVVQVYLR 787
Query: 707 PARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCE 743
+G P K L F+ V + A E + FEL E
Sbjct: 788 KHGDTDG-PQKTLRAFERVNIPAGETVNVDFELKDRE 823
>gi|374374543|ref|ZP_09632202.1| Beta-glucosidase [Niabella soli DSM 19437]
gi|373233985|gb|EHP53779.1| Beta-glucosidase [Niabella soli DSM 19437]
Length = 799
Score = 265 bits (678), Expect = 5e-68, Method: Compositional matrix adjust.
Identities = 204/721 (28%), Positives = 337/721 (46%), Gaps = 94/721 (13%)
Query: 81 RLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGL 140
RLGIP ++ +E +HG+ AT+FP I ++++ L +++GQ IG
Sbjct: 138 RLGIPV-DFTNEGIHGL-----------NQDHATAFPAPIGIGSTWNKELVHQMGQIIGR 185
Query: 141 EARALYNAGQAIGMT-FWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTF 199
EA+AL G T +AP +++ RD RWGR ET GEDP + + G+Q
Sbjct: 186 EAKAL-------GYTNVYAPILDVARDQRWGRVVETYGEDPFLVAGLGTALAGGIQE--- 235
Query: 200 NGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRA 259
NG ++ KHF Y + + D V +++ + PF ++
Sbjct: 236 NG--------VASTLKHFAVYSVPKGGRDGNARTDPHVAPREMQQLFLYPFRKVIQNVHP 287
Query: 260 SGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAV 319
G+M +YN +G+P A L++ R+Q+GF GY+ SD AV +Y+ AK ++AV
Sbjct: 288 LGVMSSYNDWDGMPVTASNYFLTQLLRQQFGFDGYVVSDSRAVEFVYEKHHVAKDYKEAV 347
Query: 320 VDVLKAGMDV----NCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPT 375
V++AG++V N S + +K+ L +++ + + SV+ RLGLF+
Sbjct: 348 KMVMEAGLNVRTEFNAPSNFILPLRQLIKEGGLSMETLNQRVGEVLSVKFRLGLFDAPYV 407
Query: 376 MQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAK 435
P K +V + A + +ALQ ++ +VLLKN +LPL + ++ + GP A+ +
Sbjct: 408 KDP--KAADKIVATEASEAVALQMNRESLVLLKNDKNILPLSLGQYRNILVTGPLADEKE 465
Query: 436 TLLGNYAGPSCRSITPLQALQNYVENTV---YYPGCDTVACS---------------SAS 477
+ Y + + I+ L+ ++++ Y GC+ + A
Sbjct: 466 HAISRYGPSNKKVISVLEGIRHFAAKKATINYIKGCEAADATWPESEIIDTPPTPQEIAE 525
Query: 478 IDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLL 537
++KAV+ AK D ++ +MG + Q E L R L LPGRQ L+ + + KP++L+L+
Sbjct: 526 MNKAVEAAKQNDIIIAVMGENDKQVGESLSRTGLNLPGRQLRLLEELKKTG-KPMVLILI 584
Query: 538 CGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVP 597
G P+ I + +R + +IL +PG AG A+AE IFG +NPGG+L T +P+ ++
Sbjct: 585 NGQPLTINWE--NRYLDAILETWFPGPAGGTAVAEAIFGAYNPGGKLTTT-FPKTTGQIE 641
Query: 598 MTDMKMRPQATSGNPGRTYRFYEGKEV----FPFGCGLSYSKYSYKFKAVSQNKLYLNQS 653
M + +P + +G PG Y V +PFG GLSY+ + Y V K
Sbjct: 642 M-NFPFKPASHAGQPGDGPNGYGKTAVVGPLYPFGYGLSYTTFEYANLKVDPEKARTQAD 700
Query: 654 SSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNG 713
S V + VKN G++ G V L+VK
Sbjct: 701 IS-----------------------------VAVDVKNTGKVKGDEVVQLYVKQLVSSVT 731
Query: 714 RPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGDEEYPIS 773
L GF+ V L+ E + F+L+P + LS ++ V+E G ++VG I
Sbjct: 732 TYESILRGFERVSLSPGETKTVHFKLTP-DDLSILDKNMNFVVEPGAFDIMVGSSSVDIR 790
Query: 774 I 774
+
Sbjct: 791 L 791
>gi|317480996|ref|ZP_07940076.1| glycosyl hydrolase family 3 C terminal domain-containing protein
[Bacteroides sp. 4_1_36]
gi|316902889|gb|EFV24763.1| glycosyl hydrolase family 3 C terminal domain-containing protein
[Bacteroides sp. 4_1_36]
Length = 864
Score = 265 bits (678), Expect = 5e-68, Method: Compositional matrix adjust.
Identities = 175/458 (38%), Positives = 241/458 (52%), Gaps = 36/458 (7%)
Query: 45 FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGI 104
+ +LP +RA L+ LTL+EK+S +++S+ + RLGI Y WW+EALHGVA G
Sbjct: 22 YKNASLPAEERAGLLLKELTLEEKVSLMMDSSKPVERLGIKPYNWWNEALHGVARAGL-- 79
Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARA--LYNAGQAI-----GMTFW 157
AT FPQ I AASF Y + A+ EARA Y A Q G+T W
Sbjct: 80 --------ATVFPQPIGMAASFSPETVYEVFTAVSDEARAKNAYYASQGSRERYQGLTMW 131
Query: 158 APNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHF 217
P +NI+RDPRWGRG ET GEDP +T + V V+G+QG N GK + AC KHF
Sbjct: 132 TPTVNIYRDPRWGRGIETYGEDPYLTSRMGVMVVKGLQGT--NDGKYD---KLHACAKHF 186
Query: 218 TAYDLDNWKGTTRYKFDAR-VTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCA 276
+ W R++F+A + +DL +TY PPFE+ VK+G+ +MCAYNR G P C
Sbjct: 187 AVHSGPEW---NRHEFNAENIKPRDLYETYLPPFEALVKEGKVKEVMCAYNRFEGDPCCG 243
Query: 277 DRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSP--EDAVVDVLKAGMDVNCGSF 334
LL + R +WGF G + SDC A++ Y G+ P E A + +G D+ CGS
Sbjct: 244 SDRLLMQILRDEWGFDGIVLSDCGAIADFYRDYGHKTHPDAESASAAAVLSGTDLECGSS 303
Query: 335 LQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQV 394
+ + AVKQ K+ E +D A+ L + R LG + P + I VV S H
Sbjct: 304 YEALVE-AVKQGKIDEKAVDVAVKRLLTARFALGEMD-EPEKVSWTGIPFSVVASAGHDS 361
Query: 395 LALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQA 454
LAL A+ + LL N LPL K +++A++GPNAN + GNY G ++T L
Sbjct: 362 LALDMARKSMTLLMNKDNTLPL-KRGGLTVAVMGPNANDSVMQWGNYNGMPPHTVTILDG 420
Query: 455 LQNYV---ENTVYYPGCDTVACSSASIDKAVDIAKGAD 489
++ + + +Y GC V A I + K AD
Sbjct: 421 IRKALGTDDRLIYEQGCGWV--ERAQIQSVFNRCKTAD 456
Score = 119 bits (299), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 93/304 (30%), Positives = 143/304 (47%), Gaps = 53/304 (17%)
Query: 474 SSASIDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITR 523
+ + K+V+ K AD VV + G+ E EE+ DR D+ LP Q+ELI
Sbjct: 586 NEVDVRKSVERVKDADVVVFVGGVSPNLEGEEMGVELPGFRGGDRTDIELPAVQRELIAA 645
Query: 524 VAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGR 583
+ A KK V+LV G P I G+IL A YPG+AG A+AEV+FGD+NP GR
Sbjct: 646 LHHAGKK-VVLVNCSGSP--IGLEPETGRCGAILQAWYPGQAGGTAVAEVLFGDYNPAGR 702
Query: 584 LPMTWYPQDYIKVP-MTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKA 642
LP+T+Y ++ ++P D M GRTYR+ + +FPFG GLSY+ + Y
Sbjct: 703 LPVTFY-RNVSQLPDFEDYNM--------TGRTYRYMTQEPLFPFGHGLSYTSFCYGAVV 753
Query: 643 VSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVL 702
+ + + + K + + V N G+ G+ V
Sbjct: 754 LGSDNIKSGE-----------------------------KLRLNVPVTNTGKCDGEEVVQ 784
Query: 703 LFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHF 762
+++K G P K L F+ V + A + ++ F+L E + + M + EG++
Sbjct: 785 VYLKKNDDVEG-PSKALRAFKRVHIPAGKTVDVEFDLGDKELVWWNPQSNTMCVSEGSYE 843
Query: 763 LVVG 766
L+VG
Sbjct: 844 LMVG 847
>gi|270294390|ref|ZP_06200592.1| conserved hypothetical protein [Bacteroides sp. D20]
gi|270275857|gb|EFA21717.1| conserved hypothetical protein [Bacteroides sp. D20]
Length = 864
Score = 265 bits (678), Expect = 5e-68, Method: Compositional matrix adjust.
Identities = 175/458 (38%), Positives = 241/458 (52%), Gaps = 36/458 (7%)
Query: 45 FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGI 104
+ +LP +RA L+ LTL+EK+S +++S+ + RLGI Y WW+EALHGVA G
Sbjct: 22 YKNASLPAEERAGLLLKELTLEEKVSLMMDSSKPVERLGIKPYNWWNEALHGVARAGL-- 79
Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARA--LYNAGQAI-----GMTFW 157
AT FPQ I AASF Y + A+ EARA Y A Q G+T W
Sbjct: 80 --------ATVFPQPIGMAASFSPETVYEVFTAVSDEARAKNAYYASQGSRERYQGLTMW 131
Query: 158 APNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHF 217
P +NI+RDPRWGRG ET GEDP +T + V V+G+QG N GK + AC KHF
Sbjct: 132 TPTVNIYRDPRWGRGIETYGEDPYLTSRMGVMVVKGLQGT--NDGKYD---KLHACAKHF 186
Query: 218 TAYDLDNWKGTTRYKFDAR-VTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCA 276
+ W R++F+A + +DL +TY PPFE+ VK+G+ +MCAYNR G P C
Sbjct: 187 AVHSGPEW---NRHEFNAENIKPRDLYETYLPPFEALVKEGKVKEVMCAYNRFEGDPCCG 243
Query: 277 DRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSP--EDAVVDVLKAGMDVNCGSF 334
LL + R +WGF G + SDC A++ Y G+ P E A + +G D+ CGS
Sbjct: 244 SDRLLMQILRDEWGFDGIVLSDCGAIADFYRDYGHKTHPDAESASAVAVLSGTDLECGSS 303
Query: 335 LQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQV 394
+ + AVKQ K+ E +D A+ L + R LG + P + I VV S H
Sbjct: 304 YEALVE-AVKQGKIDEKAVDVAVKRLLTARFALGEMD-EPEKVSWTGIPFSVVASAGHDS 361
Query: 395 LALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQA 454
LAL A+ + LL N LPL K +++A++GPNAN + GNY G ++T L
Sbjct: 362 LALDMARKSMTLLMNKDNTLPL-KRGGLTVAVMGPNANDSVMQWGNYNGMPPHTVTILDG 420
Query: 455 LQNYV---ENTVYYPGCDTVACSSASIDKAVDIAKGAD 489
++ + + +Y GC V A I + K AD
Sbjct: 421 IRKALGTDDRLIYEQGCGWV--ERAQIQSVFNRCKTAD 456
Score = 122 bits (306), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 94/304 (30%), Positives = 145/304 (47%), Gaps = 53/304 (17%)
Query: 474 SSASIDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITR 523
+ + K+V+ K AD VV + G+ E EE+ DR D+ LP Q+ELI
Sbjct: 586 NEVDVRKSVERVKDADIVVFVGGVSPNLEGEEMGVELPGFRGGDRTDIELPAVQRELIAA 645
Query: 524 VAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGR 583
+ A KK V+LV G P I G+IL A YPG+AG A+AEV+FGD+NP GR
Sbjct: 646 LHRAGKK-VVLVNCSGSP--IGLEPETGRCGAILQAWYPGQAGGTAVAEVLFGDYNPAGR 702
Query: 584 LPMTWYPQDYIKVP-MTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKA 642
LP+T+Y ++ ++P D M GRTYR+ + +FPFG GLSY+ +SY
Sbjct: 703 LPVTFY-RNVSQLPDFEDYNM--------TGRTYRYMTQEPLFPFGHGLSYTSFSYGAVV 753
Query: 643 VSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVL 702
+ + + + K +++ V N G+ G+ V
Sbjct: 754 LGSDNIKSGE-----------------------------KLRLSVPVTNTGKCDGEEVVQ 784
Query: 703 LFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHF 762
+++K G P K L F+ V + A + ++ F+L E + + M + EG++
Sbjct: 785 VYLKKNDDVEG-PSKALRAFKRVHIPAGKTVDVEFDLGDKELVWWNPQSNTMCVSEGSYE 843
Query: 763 LVVG 766
L+VG
Sbjct: 844 LMVG 847
>gi|429745624|ref|ZP_19279029.1| glycosyl hydrolase family 3 protein [Capnocytophaga sp. oral taxon
380 str. F0488]
gi|429168470|gb|EKY10301.1| glycosyl hydrolase family 3 protein [Capnocytophaga sp. oral taxon
380 str. F0488]
Length = 770
Score = 265 bits (678), Expect = 6e-68, Method: Compositional matrix adjust.
Identities = 233/818 (28%), Positives = 374/818 (45%), Gaps = 131/818 (16%)
Query: 7 SLVFPLLCLCFTSLLTRVDS---TQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRL 63
+L+ L+ C T + + D+ TQ FS D I QR ++ +
Sbjct: 10 TLISLLMIGCATPAILKSDNEKFTQKSFSGDKH---------------IEQRVDSVLRLM 54
Query: 64 TLDEKISQLVN-------SAPAIPRLGIPAYEWWSEALHG-------VAGVGK------- 102
TL+EKI Q+ + P + P E + L G VAG+ K
Sbjct: 55 TLEEKIGQMTQFSADWSVTGPVMADKYQPYLE---KGLVGSIFNATSVAGIRKLQKIAVE 111
Query: 103 ----GI---FFNGTIRG-ATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGM 154
GI F I G T FP + + S+D L + + EA A I
Sbjct: 112 QTRLGIPILFGQDVIHGYKTIFPIPLAESCSWDLALMRKTAELAAREASA-----DGINW 166
Query: 155 TFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACC 214
TF AP ++I RD RWGR E GEDP + A + V+G QG N L AC
Sbjct: 167 TF-APMVDITRDARWGRAMEGAGEDPYLGSLIAEARVKGFQGGD-NWQMLSSPHTLLACG 224
Query: 215 KHFTAYDLDNWKGTTRYKFD---ARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNG 271
KHF Y G D A ++M L + Y PP+E+ + G S IM + N +NG
Sbjct: 225 KHFAGY------GAAESGKDYNTAELSMHTLRNVYLPPYEATLNAGVGS-IMASLNEING 277
Query: 272 IPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAG--MDV 329
+P+ AD+ LL++ R++WGF+G + SD ++ + G AK + A AG MD+
Sbjct: 278 VPATADKWLLTEVLRKEWGFNGLLVSDYTGINELV-RHGVAKDDKQAANLSANAGIEMDM 336
Query: 330 NCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCS 389
N +F+ K+ A VK+ K+ E++ID+A+ ++ ++ LGLF+ +P + +
Sbjct: 337 NGATFI-KYLSALVKEGKVTEAQIDKAVRHILEMKFLLGLFD-DPYRYLDETRAKENTFT 394
Query: 390 PAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYA--GPSCR 447
+ +A QA +VLLKN +LP+ K ++A+IGP N+ + G++ G +
Sbjct: 395 EEYLKVARQAVASSVVLLKNEAEVLPIKKDSGKTIAVIGPMMNNTSDINGSWTCLGDGKQ 454
Query: 448 SITPLQALQNYVENT----VYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEK 503
S++ L L + T +Y GC S+ + +AV IA+ AD V++ +G +
Sbjct: 455 SVSLLTGLTEKYKGTNVKLLYAEGCGFTTISTEQLKEAVAIARKADRVLVAVGEQSSWAG 514
Query: 504 EELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPG 563
E R D+ LP Q++L+ + +A KP+ +V G P+D+++ + N+ +IL A +PG
Sbjct: 515 ESAVRTDIRLPQAQRQLLEAL-KAINKPIAIVTFSGRPLDLSWE--NENVQAILQAWFPG 571
Query: 564 EAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPM---------------TDMKMRPQAT 608
G +A+VI GD NP G L M+ +P+ ++P+ ++ RP
Sbjct: 572 TQGGNGIADVIAGDVNPSGHLTMS-FPRSVGQIPIYYNYKSTGRPVYTNNEEVDHRPHYN 630
Query: 609 SGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYK 668
+G Y ++PFG GLSY+ ++ + ++LN+ S + ++
Sbjct: 631 AG-----YLDSSITPLYPFGYGLSYTTFAI-------SNVHLNKKSIKRYNDS------- 671
Query: 669 SVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILN 728
+V V+N G G+ V L+ + RP+K+L GFQ + L
Sbjct: 672 --------------IIVNASVQNTGTTEGEIVVQLYTRQLVASVSRPVKELKGFQKISLK 717
Query: 729 AKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
A E ++ FEL P E+L+ +G E L VG
Sbjct: 718 AGESKQVRFEL-PSEALAFYSINGKKDTEPSECLLWVG 754
>gi|237712573|ref|ZP_04543054.1| glycoside hydrolase family 3 protein [Bacteroides sp. 9_1_42FAA]
gi|345512524|ref|ZP_08792050.1| glycoside hydrolase family beta-glycosidase [Bacteroides dorei
5_1_36/D4]
gi|423239901|ref|ZP_17221016.1| hypothetical protein HMPREF1065_01639 [Bacteroides dorei
CL03T12C01]
gi|229435409|gb|EEO45486.1| glycoside hydrolase family beta-glycosidase [Bacteroides dorei
5_1_36/D4]
gi|229453894|gb|EEO59615.1| glycoside hydrolase family 3 protein [Bacteroides sp. 9_1_42FAA]
gi|392644890|gb|EIY38624.1| hypothetical protein HMPREF1065_01639 [Bacteroides dorei
CL03T12C01]
Length = 788
Score = 265 bits (678), Expect = 6e-68, Method: Compositional matrix adjust.
Identities = 214/761 (28%), Positives = 344/761 (45%), Gaps = 131/761 (17%)
Query: 51 PISQRARDLVSRLTLDEKISQLVN--SAPAIPRLGIPAYEW----WSEALHGVAGVGKGI 104
P+ +R +DL+S++TL+EK Q+ + + + +P W W + + + G+
Sbjct: 49 PLEERVQDLLSQMTLEEKSCQMATLYGSGRVLKDALPQDNWKTEVWKDGIGNIDEEHNGL 108
Query: 105 ------------------------------------FFNGTIRG-----ATSFPQVILTA 123
F N IRG AT FP
Sbjct: 109 GTFKSEYSFPYTKHVDAKHAIQRWFVEETRLGIPVDFTNEGIRGLCHDRATYFPAQCGQG 168
Query: 124 ASFDSYLWYRIGQAIGLEARALYNAGQAIGMT-FWAPNINIFRDPRWGRGQETPGEDPLV 182
A+++ L RIG+ EA AL G T ++P ++I +DPRWGR ET GEDP +
Sbjct: 169 ATWNKELIARIGEVEAKEAVAL-------GYTNIYSPILDIAQDPRWGRCVETYGEDPYL 221
Query: 183 TGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDL 242
G+ + +Q K A KHF Y + + + D V +++
Sbjct: 222 VGELGKQMITSLQ-----------KHNLVATPKHFAVYSIPVGGRDGKTRTDPHVAPREM 270
Query: 243 ADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAV 302
Y PF ++ A G+M +YN +G P + L++ R++WGF GY+ SD +AV
Sbjct: 271 RTLYIEPFRMAFQEAGALGVMSSYNDYDGEPITGSYHFLTEILRQEWGFKGYVVSDSEAV 330
Query: 303 SIIYDAEGYAKSPEDAVVDVLKAGMDVNC-----GSFLQKHTKAAVKQKKLPESEIDRAL 357
I A + ED + + AG+++ F+ K AV K+ + +D+ +
Sbjct: 331 EFISSKHKVANTYEDGIAQAVNAGLNIRTHFTPPADFILPLRK-AVADGKISQETLDKRV 389
Query: 358 HNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLP 417
+ V+ LGLF+ NP + GK +V S HQ ++L+AA+ +VLLKN LLPL
Sbjct: 390 AEILRVKFWLGLFD-NP-YRGNGKQAEQIVHSKEHQAVSLEAARQSLVLLKNEMNLLPLS 447
Query: 418 KSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENT--VYYPGCDTV---- 471
KS S+A+IGPNA+ L+ Y + T Q ++ + +T +Y GCD +
Sbjct: 448 KSLR-SIAVIGPNADERTQLICRYGPANAPIKTVYQGIKERLPHTEVIYRKGCDIIDPHF 506
Query: 472 -----------ACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQEL 520
+ +++A+ AK A+ VV+++G ++ +E+ R L LPGRQ+EL
Sbjct: 507 PESEVLDFPKTTEEARLMEEAIHAAKQAEVVVMVLGGNELTVREDRSRTSLNLPGRQEEL 566
Query: 521 ITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
+ V A KPV+LVLL G I +A ++ +IL A +PGE A+AE +FGD+NP
Sbjct: 567 LKAVC-ATGKPVVLVLLDGRASSINYAA--AHVPAILHAWFPGEFCGQAVAEALFGDYNP 623
Query: 581 GGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKF 640
GGRL +T +P+ ++P +P + + Y ++PFG GLSY+ +SY
Sbjct: 624 GGRLAVT-FPKSVGQIPFA-FPFKPGSDESSSTSVYGV-----LYPFGHGLSYTTFSYGD 676
Query: 641 KAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHP 700
+S L Q + ++ +KN G++ G
Sbjct: 677 LKISP----LRQGVQGDIN-------------------------ISCKIKNTGKIKGDEV 707
Query: 701 VLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSP 741
V L+++ K L GF+ + L A E+ + F L P
Sbjct: 708 VQLYLRDEVSSVTTYTKVLRGFERISLEAGEEQMVHFRLRP 748
>gi|423279982|ref|ZP_17258895.1| hypothetical protein HMPREF1203_03112 [Bacteroides fragilis HMW
610]
gi|404584318|gb|EKA88983.1| hypothetical protein HMPREF1203_03112 [Bacteroides fragilis HMW
610]
Length = 812
Score = 265 bits (678), Expect = 6e-68, Method: Compositional matrix adjust.
Identities = 238/815 (29%), Positives = 369/815 (45%), Gaps = 148/815 (18%)
Query: 37 NPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSA--PAIPRLG----------- 83
N + + + + P+ +R L+S++TL+EK+ Q++ S P R+G
Sbjct: 39 NKNGQMDTYENPSAPVEERVEHLLSQMTLEEKVGQMLTSLGWPMYERVGEEIRLTARLEK 98
Query: 84 -IPAYE------------WWSEALHGVAGVGKGIFFNGTIR------------------- 111
I Y W LH G+ + + R
Sbjct: 99 EISEYHIGALWGFMRADPWTQRTLH--TGLNPSLAARASNRLQAFVMEHSRLGIPLFLAE 156
Query: 112 ---------GATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNIN 162
G T FP I A++++ L ++G+ I +EA A G IG + P ++
Sbjct: 157 ECPHGHMAIGTTVFPTSIGQASTWNPELIRQMGRVIAIEASA---QGAHIG---YGPVLD 210
Query: 163 IFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDL 222
+ RDPRW R +ET GEDP + G + VRG QGDT L+G+ A KHF +Y
Sbjct: 211 LARDPRWSRVEETYGEDPYLNGVMGAALVRGFQGDT-----LRGRKSVIATLKHFASY-- 263
Query: 223 DNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLS 282
W A + ++L + PPF V G A +M +YN ++G P R LL+
Sbjct: 264 -GWTEGGHNGGTAHLGERELEEAIFPPFREAVGAG-ALSVMSSYNEIDGNPCTGSRYLLT 321
Query: 283 KTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGS-FLQKHTKA 341
+W F G++ SD A+ + + G A S +A V + AG+D + G+ + A
Sbjct: 322 DILEDRWLFKGFVVSDLYAIGGLRE-HGVAGSDYEAAVKAVNAGVDSDLGTNVYAEQLVA 380
Query: 342 AVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPF--GKIGADVVCSPAHQVLALQA 399
AV++ + +D+A+ + S++ +GLF+ PF K A +V SP H LA +
Sbjct: 381 AVRKGDVAMETVDKAVRRILSLKFHMGLFDA-----PFVDDKRPAQLVASPEHIGLAREV 435
Query: 400 AQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRS--ITPLQALQN 457
A+ IVLLKN LLPL K +LA+IGPNA++ +LG+Y P +T L+ ++
Sbjct: 436 ARQSIVLLKNEDKLLPLKKDIR-TLAVIGPNADNGYNMLGDYTAPQADGSVVTVLEGIRQ 494
Query: 458 YV--ENTVYYP-GCDTVACSSASIDKAVDIAKGADHVVLMMG----LDQTQEKEE----- 505
V + V+Y GC S A++ A+ AD VV+++G D + E EE
Sbjct: 495 KVSKDTRVFYAKGCAVRDSSRTGFADAIESARSADVVVMVVGGSSARDFSSEYEETGAAK 554
Query: 506 --------------LDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDR 551
DR L L GRQ EL+ V + K P++LVL+ G P + +
Sbjct: 555 VSANRVSDMESGEGYDRATLHLMGRQLELLEEVRKLGK-PMVLVLIKGRP--LLMEGVIQ 611
Query: 552 NIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGN 611
+IL A YPG G A+A+V+FGD+NP GRL ++ P+ ++P+ R GN
Sbjct: 612 EADAILDAWYPGMQGGNAVADVLFGDYNPAGRLTLS-VPRSVGQLPVYYNTKR----KGN 666
Query: 612 PGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVP 671
R Y G +PFG GLSY+ +SY + K+ +++ S+ V+
Sbjct: 667 RSR-YIEEAGTPRYPFGYGLSYTTFSY-----TGMKVRVSEESNHCRVD----------- 709
Query: 672 ELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKE 731
V++ V+N G + G V L+++ P +QL F+ V L A E
Sbjct: 710 -------------VSVTVRNQGTVDGDEVVQLYLRDEVGSFTTPDRQLRAFRRVRLKAGE 756
Query: 732 KAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
EI F L +SL+ DG +E G ++ G
Sbjct: 757 TWEITFTLDK-KSLALYMRDGEWAVEPGRFTVMAG 790
>gi|313145353|ref|ZP_07807546.1| periplasmic beta-glucosidase [Bacteroides fragilis 3_1_12]
gi|313134120|gb|EFR51480.1| periplasmic beta-glucosidase [Bacteroides fragilis 3_1_12]
Length = 802
Score = 265 bits (677), Expect = 7e-68, Method: Compositional matrix adjust.
Identities = 236/815 (28%), Positives = 366/815 (44%), Gaps = 148/815 (18%)
Query: 37 NPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSA--PAIPRLG----------- 83
N + + + ++P+ +R L+S++TL+EK+ Q++ S P R+G
Sbjct: 29 NKNGQMDTYENPSVPVEERVEHLLSQMTLEEKVGQMLTSLGWPMYERVGEEIRLTARLEK 88
Query: 84 -IPAYE------------WWSEALHGVAGVGKGIFFNGTIR------------------- 111
I Y W LH G+ + + R
Sbjct: 89 EISEYHIGALWGFMRADPWTQRTLH--TGLNPSLAARASNRLQAFVMEHSRLGIPLFLAE 146
Query: 112 ---------GATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNIN 162
G T FP I A++++ L ++G+ I EA A G IG + P ++
Sbjct: 147 ECPHGHMAIGTTVFPTSIGQASTWNPELIRQMGRVIATEASA---QGAHIG---YGPVLD 200
Query: 163 IFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDL 222
+ RDPRW R +ET GEDP + G + VRG QGDT L+G+ A KHF +Y
Sbjct: 201 LARDPRWSRVEETYGEDPYLNGVMGAALVRGFQGDT-----LRGRKSVIATLKHFASY-- 253
Query: 223 DNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLS 282
W A + ++L + PPF V G A +M +YN ++G P R LL+
Sbjct: 254 -GWTEGGHNGGTAHLGERELEEAIFPPFREAVGAG-ALSVMSSYNEIDGNPCTGSRYLLT 311
Query: 283 KTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGS-FLQKHTKA 341
+ +W F G++ SD A+ + + G A S +A V + AG+D + G+ + A
Sbjct: 312 DILKDRWQFKGFVVSDLYAIGGLRE-HGVAGSDYEAAVKAVNAGVDSDLGTNVYAEQLVA 370
Query: 342 AVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPF--GKIGADVVCSPAHQVLALQA 399
AV++ + +D+A+ + ++ +GLF+ PF K A +V SP H LA +
Sbjct: 371 AVRKGDVAMETVDKAVRRILFLKFHMGLFDA-----PFVDDKRPAQLVASPEHIGLAREV 425
Query: 400 AQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRS--ITPLQALQN 457
A+ IVLLKN LLPL K +LA+IGPNA++ +LG+Y P +T L+ ++
Sbjct: 426 ARQSIVLLKNEDKLLPLKKDIR-TLAVIGPNADNGYNMLGDYTAPQADGSVVTVLEGIRQ 484
Query: 458 YVEN---TVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMG----LDQTQEKEE----- 505
V +Y GC S A++ A+ AD VV+++G D + E EE
Sbjct: 485 KVSKDTRVLYAKGCAVRDSSRTGFADAIEAARSADVVVMVVGGSSARDFSSEYEETGAAK 544
Query: 506 --------------LDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDR 551
DR L L GRQ EL+ V + K P++LVL+ G P + +
Sbjct: 545 VSANRVSDMESGEGYDRATLHLMGRQLELLEEVRKLGK-PMVLVLIKGRP--LLMEGVIQ 601
Query: 552 NIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGN 611
+IL A YPG G A+A+V+FGD+NP GRL ++ P+ ++P+ R GN
Sbjct: 602 EADAILDAWYPGMQGGNAVADVLFGDYNPAGRLTLS-VPRSVGQLPVYYNTKR----KGN 656
Query: 612 PGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVP 671
R Y G +PFG GLSY+ +SY + K+ +++ S+ V+
Sbjct: 657 RSR-YIEEAGTPRYPFGYGLSYTTFSY-----TGMKVRVSEESNHCRVD----------- 699
Query: 672 ELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKE 731
V++ V+N G + G V L+++ P +QL F V L A E
Sbjct: 700 -------------VSVTVRNQGTVDGDEVVQLYLRDEVGSFTTPDRQLRAFSRVRLKAGE 746
Query: 732 KAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
EI F L +SL+ DG +E G ++ G
Sbjct: 747 TREITFTLDK-KSLALYMRDGEWAVEPGRFTVMAG 780
>gi|423229063|ref|ZP_17215468.1| hypothetical protein HMPREF1063_01288 [Bacteroides dorei
CL02T00C15]
gi|423244903|ref|ZP_17225977.1| hypothetical protein HMPREF1064_02183 [Bacteroides dorei
CL02T12C06]
gi|392634816|gb|EIY28728.1| hypothetical protein HMPREF1063_01288 [Bacteroides dorei
CL02T00C15]
gi|392640944|gb|EIY34735.1| hypothetical protein HMPREF1064_02183 [Bacteroides dorei
CL02T12C06]
Length = 788
Score = 265 bits (677), Expect = 7e-68, Method: Compositional matrix adjust.
Identities = 212/760 (27%), Positives = 342/760 (45%), Gaps = 129/760 (16%)
Query: 51 PISQRARDLVSRLTLDEKISQLVN--SAPAIPRLGIPAYEW----WSEALHGVAGVGKGI 104
P+ +R +DL+S++TL+EK Q+ + + + +P W W + + + G+
Sbjct: 49 PLEERVQDLLSQMTLEEKSCQMATLYGSGRVLKDALPQDNWKTEVWKDGIGNIDEEHNGL 108
Query: 105 ------------------------------------FFNGTIRG-----ATSFPQVILTA 123
F N IRG AT FP
Sbjct: 109 GTFKSEYSFPYTKHVDAKHAIQRWFVEETRLGIPVDFTNEGIRGLCHDRATYFPAQCGQG 168
Query: 124 ASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVT 183
A+++ L RIG+ EA AL ++P ++I +DPRWGR ET GEDP +
Sbjct: 169 ATWNKELIARIGEVEAKEAVALEYT------NIYSPILDIAQDPRWGRCVETYGEDPYLV 222
Query: 184 GKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLA 243
G+ + +Q K A KHF Y + + + D V +++
Sbjct: 223 GELGKQMITSLQ-----------KHNLVATPKHFAVYSIPVGGRDGKTRTDPHVAPREMR 271
Query: 244 DTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVS 303
Y PF ++ A G+M +YN +G P + L++ R++WGF GY+ SD +AV
Sbjct: 272 TLYIEPFRMAFQEAGALGVMSSYNDYDGEPITGSYHFLTEILRQEWGFKGYVVSDSEAVE 331
Query: 304 IIYDAEGYAKSPEDAVVDVLKAGMDVNC-----GSFLQKHTKAAVKQKKLPESEIDRALH 358
I A + ED + + AG+++ F+ K AV K+ + +D+ +
Sbjct: 332 FISSKHKVANTYEDGIAQAVNAGLNIRTHFTPPADFILPLRK-AVADGKISQETLDKRVA 390
Query: 359 NLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPK 418
+ V+ LGLF+ NP + GK +V S HQ ++L+AA+ +VLLKN LLPL K
Sbjct: 391 EILRVKFWLGLFD-NP-YRGNGKQAEQIVHSKEHQAVSLEAARQSLVLLKNEMNLLPLSK 448
Query: 419 SKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENT--VYYPGCDTV----- 471
S S+A+IGPNA+ L+ Y + T Q ++ + +T +Y GCD +
Sbjct: 449 SLR-SIAVIGPNADERTQLICRYGPANAPIKTVYQGIKERLPHTEVIYRKGCDIIDPHFP 507
Query: 472 ----------ACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELI 521
+ +++A+ AK A+ VV+++G ++ +E+ R L LPGRQ+EL+
Sbjct: 508 ESEVLDFPKTTEEARLMEEAIHAAKQAEVVVMVLGGNELTVREDRSRTSLNLPGRQEELL 567
Query: 522 TRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
V A KPV+LVLL G I +A ++ +IL A +PGE A+AE +FGD+NPG
Sbjct: 568 KAVC-ATGKPVVLVLLDGRASSINYAA--AHVPAILHAWFPGEFCGQAVAEALFGDYNPG 624
Query: 582 GRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFK 641
GRL +T +P+ ++P +P + + Y ++PFG GLSY+ +SY
Sbjct: 625 GRLAVT-FPKSVGQIPFA-FPFKPGSDESSSTSVYGV-----LYPFGHGLSYTTFSYGDL 677
Query: 642 AVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPV 701
+S L Q + ++ +KN G++ G V
Sbjct: 678 KISP----LRQGVQGDIN-------------------------ISCKIKNTGKIKGDEVV 708
Query: 702 LLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSP 741
L+++ K L GF+ + L A E+ + F L P
Sbjct: 709 QLYLRDEVSSVTTYTKVLRGFERISLEAGEEQMVHFRLRP 748
>gi|346226088|ref|ZP_08847230.1| glycoside hydrolase family 3 domain protein [Anaerophaga
thermohalophila DSM 12881]
Length = 749
Score = 265 bits (676), Expect = 8e-68, Method: Compositional matrix adjust.
Identities = 220/793 (27%), Positives = 368/793 (46%), Gaps = 112/793 (14%)
Query: 2 IFHKLSLVFPLLCLCFTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVS 61
IFH +P L L +L ++++ + E++PF L R DL+S
Sbjct: 26 IFHG----YPALYLFMVFILAGLNAS-----------AQESYPFQNPELDSEARIDDLLS 70
Query: 62 RLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRG-----ATSF 116
R+TLDEK+S L ++ P++PRLG+ + + G GV G N +G T+F
Sbjct: 71 RMTLDEKVSAL-STDPSVPRLGVKG----APHIEGYHGVAMGGPANWAPKGDEAVPTTTF 125
Query: 117 PQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAI--GMTFWAPNINIFRDPRWGRGQE 174
PQ A+++ L G+ +EAR ++ + G+ APN ++ RDPRWGR +E
Sbjct: 126 PQAYGMGATWNPELIRLAGEIESIEARYIFQNPEIAKGGLVVRAPNADLGRDPRWGRTEE 185
Query: 175 TPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFD 234
GEDP + G A ++ +G+QGD + ++ KHF A +N + ++ FD
Sbjct: 186 CFGEDPFLVGTSATAFTKGLQGDD------DQYWRTASLLKHFLANSNENGRESSSSDFD 239
Query: 235 ARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGY 294
MQ + Y F +G ++ M AYN +NG+P+ ++ + R WG G
Sbjct: 240 ----MQLYHEYYGASFRRAFIEGGSNAYMAAYNAINGVPAHV-HDMHKEITERMWGVDGI 294
Query: 295 ITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEID 354
+D ++ Y A V+KAG++ ++ ++ A+ + E++ID
Sbjct: 295 KCTDGGGYQLLVYGHKYYDDLYLAAEGVIKAGLNQFLDNY-REGVYGALAHGYITEADID 353
Query: 355 RALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSP----AHQVLALQAAQDGIVLLKNS 410
L ++ V ++LG + + P+ IG D +P H+ AL+ A++ IVLLKN+
Sbjct: 354 EVLRGVYRVMIKLGQLDPQEKV-PYSAIGRDGKPAPWTTQKHKDAALRMARESIVLLKNN 412
Query: 411 HGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVEN---TVYYPG 467
+ LPL K +A+IG A++ LL Y+G ITPL+ ++ + N +Y P
Sbjct: 413 NKTLPLNADKLNKVAVIGYLADT--VLLDWYSGLPPYRITPLEGIREKLGNDSKVLYAPD 470
Query: 468 CDTVACSSASIDKAVDIAKGADHVVLMMGL-------------DQTQEKEELDRVDLVLP 514
D A AV+ A AD ++++G D +E +DR L L
Sbjct: 471 NDYNA--------AVEAASEADVAIVILGNYPTCNSEIWADCPDPGMGREAIDRKTLRL- 521
Query: 515 GRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVI 574
E + ++ A I VL P I +++ +N+ +IL + G+ ALA+V+
Sbjct: 522 --TDEYLVKLVMEANPNTIFVLQSSFPYAINWSQ--QNVPAILHLTHNGQETGSALADVL 577
Query: 575 FGDHNPGGRLPMTWYPQDYIKVP-MTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSY 633
FGD+NPGG+L TW P+ ++P M + +R G TY ++E K ++PFG GLSY
Sbjct: 578 FGDYNPGGKLTQTW-PKSEDQLPDMMEYDIR-------KGHTYMYFEDKPLYPFGHGLSY 629
Query: 634 SKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHG 693
+ ++++ ++ N+ VV + + ++T+ +KN G
Sbjct: 630 TTFAWEDISI-----------------NKPVVS-----------ADDEEVIITVKLKNTG 661
Query: 694 EMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGL 753
++ G V L+ RP K L GF+ V L EK +I + + R
Sbjct: 662 DVKGDEVVQLYASFPESTVRRPAKALKGFKRVTLEPGEKKKIEIPIKLQDIAYRDINKER 721
Query: 754 MVIEEGTHFLVVG 766
V+E GT ++ G
Sbjct: 722 FVVEPGTIKIMAG 734
>gi|393787408|ref|ZP_10375540.1| hypothetical protein HMPREF1068_01820 [Bacteroides nordii
CL02T12C05]
gi|392658643|gb|EIY52273.1| hypothetical protein HMPREF1068_01820 [Bacteroides nordii
CL02T12C05]
Length = 764
Score = 265 bits (676), Expect = 9e-68, Method: Compositional matrix adjust.
Identities = 207/657 (31%), Positives = 321/657 (48%), Gaps = 80/657 (12%)
Query: 103 GIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNIN 162
I N RG T +P I A+SF+ L IG+ +E R+ + TF PNI
Sbjct: 130 AIHGNCMHRGCTVYPTSIGMASSFNPVLLKEIGRQTAVEMRS-----SGVHWTF-NPNIE 183
Query: 163 IFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDL 222
+ RD RWGR ET GED + + + + G+QG+ NG G L AC KHF
Sbjct: 184 LARDARWGRVGETFGEDTYLVTQMGTALILGLQGE--NGFDGSGVL---ACAKHFVGGG- 237
Query: 223 DNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLS 282
+ G D ++ Q L D Y PF + + + +M A+N +NG+P A+ LL
Sbjct: 238 EPAGGINAAPMD--MSEQKLRDLYLSPFAEAINKAYVATVMPAHNELNGVPCHANHYLLQ 295
Query: 283 KTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNC-GSFLQKHTKA 341
+ R + GF G++ SD + +++ YA S E+A +KAG+D++ G +
Sbjct: 296 EILRNELGFQGFVISDWMDIERLHEMHHYAPSQEEAFRMAVKAGVDMHMQGDGFLEAIVE 355
Query: 342 AVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQ-PFGKIGADVVCSPAHQVLALQAA 400
AV+ K +PE+ ID A++ + + RLGLF NP + P + ++ + HQ AL+AA
Sbjct: 356 AVRNKYIPETRIDLAVYKILEAKFRLGLFE-NPLVDIPASR---SLIYTEDHQATALEAA 411
Query: 401 QDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAG--PSCRSITPLQALQNY 458
+ IVLLKN + LLPL + + + + GPNANS T++G++ P IT L +Q
Sbjct: 412 RQSIVLLKNDNYLLPLKQGRYKKILVTGPNANSP-TIMGDWTTRQPEENVITVLAGIQQQ 470
Query: 459 VENTVYYPGCDTVACSSASIDK--------AVDIAKGADHVVLMMGLDQTQEK------E 504
V + V DTV C S I K A A AD ++++G + + E
Sbjct: 471 VPDAV----IDTV-CFSNKIRKMDRSLIKTAAQKAVEADINIVVVGENSERYNSDRTCGE 525
Query: 505 ELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGE 564
DR +L LP QQEL+ V A+ KPVILVLL G P+ +T+A+ ++I +I+ A PG
Sbjct: 526 NCDRDNLELPTHQQELLEAVY-ASGKPVILVLLNGRPLSVTWAQ--QHIPAIVEAWEPGG 582
Query: 565 AGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEV 624
G A+AE++FG NP G+LP+T +P+ ++ Q + R + +
Sbjct: 583 MGGRAIAEILFGKVNPSGKLPIT-FPRSVGQIQTVYNHKASQYS-----RKFALTTTGPL 636
Query: 625 FPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFL 684
+ FG GLSY+ + Y +S++ ++ N++ S
Sbjct: 637 YHFGYGLSYTTFEYGNPVLSKDTIHTNEAVS----------------------------- 667
Query: 685 VTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSP 741
V+ + N G G L+++ RP+K+L GFQ + LN EK + F ++P
Sbjct: 668 VSFELANTGLCQGTEIAQLYIQDEYGTVTRPVKELKGFQRITLNPGEKQRVSFLITP 724
>gi|393782347|ref|ZP_10370532.1| hypothetical protein HMPREF1071_01400 [Bacteroides salyersiae
CL02T12C01]
gi|392673618|gb|EIY67077.1| hypothetical protein HMPREF1071_01400 [Bacteroides salyersiae
CL02T12C01]
Length = 862
Score = 265 bits (676), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 160/437 (36%), Positives = 230/437 (52%), Gaps = 33/437 (7%)
Query: 44 PFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKG 103
P+ L QRA DL+ RLTL+EK+ + N + + RLGI Y+WW+EALHGVA G
Sbjct: 23 PYKNPDLSPEQRAEDLLGRLTLEEKVRLMQNESFPVERLGIAHYDWWNEALHGVARAGI- 81
Query: 104 IFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAG--------QAIGMT 155
AT FP + A++FD L + + EARA Y+ + G+T
Sbjct: 82 ---------ATVFPITMGMASTFDDKLVEDVFTVVSDEARAKYHNAHREGRRGVRCEGLT 132
Query: 156 FWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCK 215
FW PN+NIFRDPRWGRGQET GEDP +T + V+ V G+QG KL AC K
Sbjct: 133 FWTPNVNIFRDPRWGRGQETYGEDPYLTSRMGVAVVNGLQGP---ADAKYDKLH--ACAK 187
Query: 216 HFTAYDLDNWKGTTRYKFDAR-VTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPS 274
H+ + K R+ F+A ++ +DL +TY P F+ V++ +MCAYNR G P
Sbjct: 188 HYAVHSGPEAK---RHYFNAENISPRDLWETYLPAFKDLVQEADVKEVMCAYNRFEGDPC 244
Query: 275 CADRNLLSKTARRQWGFHGYITSDCDAVSIIY--DAEGYAKSPEDAVVDVLKAGMDVNCG 332
C LL++ R +WG+ + SDC A+S + D K DA + +G D+ CG
Sbjct: 245 CGSNRLLTQILRDEWGYKHMVVSDCGAISDFFYKDRHATHKDAADASAAAVLSGTDLECG 304
Query: 333 SFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAH 392
H + AVK+ + E I+ +L L R LG + + + P+ KI D V H
Sbjct: 305 -IEYAHLEEAVKKGLISEERINTSLRRLLKARFELGEMD-DDALVPWSKISIDTVDCETH 362
Query: 393 QVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPL 452
+ +AL + +VLL N +G+LPL K+ +A++GPNA + GNY G + T L
Sbjct: 363 KQMALDVTRKSMVLLHN-NGVLPLAKT-GTRIAVMGPNAVDSVMQWGNYKGTPSHTSTIL 420
Query: 453 QALQNYVENTVYYPGCD 469
+ ++N + N Y GC+
Sbjct: 421 EGIRNKIGNVPYEKGCE 437
Score = 96.3 bits (238), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 72/265 (27%), Positives = 120/265 (45%), Gaps = 52/265 (19%)
Query: 488 ADHVVLMMGLDQTQEKEEL-----------DRVDLVLPGRQQELITRVAEAAKKPVILVL 536
AD V+ + G+ E E+ DR + LP Q+ ++ + +A KK ++ +
Sbjct: 598 ADVVIFVGGISPDLEGEDKYFVNCPGFSGGDRTTIELPEVQRNILKALKQAGKK--VIFV 655
Query: 537 LCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKV 596
C G + ++ +IL A YPG+AG A+A+V+FGD+NP G+LP+T+Y
Sbjct: 656 NCSGSA-VALVPETKSCDAILQAWYPGQAGGTAVADVLFGDYNPSGKLPVTFYKNTDQLP 714
Query: 597 PMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSST 656
+ M+ GRTYR+ ++PFG GLSY+ + +S+N + NQ+ +
Sbjct: 715 DFENYDMK--------GRTYRYMTETPLYPFGYGLSYTTFDISKGRLSKNVISTNQTVTF 766
Query: 657 KMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPI 716
K VKN G+ G V ++V+ G P+
Sbjct: 767 KA-----------------------------NVKNTGKREGTEVVQVYVRKVGDKEG-PV 796
Query: 717 KQLVGFQSVILNAKEKAEIVFELSP 741
K L F+S+ L A + + + +L P
Sbjct: 797 KTLRAFRSIPLKAGKSSVVSIDLPP 821
>gi|160891087|ref|ZP_02072090.1| hypothetical protein BACUNI_03534 [Bacteroides uniformis ATCC 8492]
gi|156859308|gb|EDO52739.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
uniformis ATCC 8492]
Length = 865
Score = 264 bits (675), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 173/458 (37%), Positives = 242/458 (52%), Gaps = 36/458 (7%)
Query: 45 FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGI 104
+ +LP +RA L+ LTL+EK+S +++S+ + RLGI Y WW+EALHGVA G
Sbjct: 23 YKNASLPAEERAGLLLKELTLEEKVSLMMDSSKPVERLGIKPYNWWNEALHGVARAGL-- 80
Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARA---LYNAGQA----IGMTFW 157
AT FPQ I AASF Y + A+ EARA Y + ++ G+T W
Sbjct: 81 --------ATVFPQPIGMAASFSPETVYEVFTAVSDEARAKNAYYTSQESHERYQGLTMW 132
Query: 158 APNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHF 217
P +NI+RDPRWGRG ET GEDP +T + V V+G+QG N GK + AC KHF
Sbjct: 133 TPTVNIYRDPRWGRGIETYGEDPYLTSRMGVMVVKGLQGT--NDGKYD---KLHACAKHF 187
Query: 218 TAYDLDNWKGTTRYKFDAR-VTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCA 276
+ W R++F+A + +DL +TY PPFE+ VK+G+ +MCAYNR G P C
Sbjct: 188 AVHSGPEW---NRHEFNAENIKPRDLYETYLPPFEALVKEGKVKEVMCAYNRFEGDPCCG 244
Query: 277 DRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSP--EDAVVDVLKAGMDVNCGSF 334
LL + R +WGF G + SDC A++ Y G+ P E A + +G D+ CGS
Sbjct: 245 SDRLLMQILRDEWGFDGIVLSDCGAIADFYRDYGHKTHPDAESASAAAVLSGTDLECGSS 304
Query: 335 LQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQV 394
+ + AVKQ K+ E +D A+ L + R LG + P + I VV S H
Sbjct: 305 YEALVE-AVKQGKIDEKAVDVAVKRLLTARFALGEMD-EPEKVSWTGIPFSVVASAGHDS 362
Query: 395 LALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQA 454
LAL A+ + LL N LPL K +++A++GPNAN + GNY G ++T L
Sbjct: 363 LALDMARKSMTLLMNKDNTLPL-KRGGLTIAVMGPNANDSVMQWGNYNGMPPHTVTILDG 421
Query: 455 LQNYV---ENTVYYPGCDTVACSSASIDKAVDIAKGAD 489
++ + + +Y GC V A I + K AD
Sbjct: 422 IRKALGSDDRLIYEQGCGWV--ERAQIQSVFNRCKTAD 457
Score = 122 bits (305), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 94/304 (30%), Positives = 145/304 (47%), Gaps = 53/304 (17%)
Query: 474 SSASIDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITR 523
+ + K+V+ K AD VV + G+ E EE+ DR D+ LP Q+ELI
Sbjct: 587 NEVDVRKSVERVKDADIVVFVGGVSPNLEGEEMGVELPGFRGGDRTDIELPAVQRELIAA 646
Query: 524 VAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGR 583
+ A KK V+LV G P I G+IL A YPG+AG A+AEV+FGD+NP GR
Sbjct: 647 LHHAGKK-VVLVNCSGSP--IGLEPETGRCGAILQAWYPGQAGGTAVAEVLFGDYNPAGR 703
Query: 584 LPMTWYPQDYIKVP-MTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKA 642
LP+T+Y ++ ++P D M GRTYR+ + +FPFG GLSY+ +SY
Sbjct: 704 LPVTFY-RNVSQLPDFEDYNM--------TGRTYRYMTQEPLFPFGHGLSYTSFSYGAVV 754
Query: 643 VSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVL 702
+ + + + K +++ V N G+ G+ V
Sbjct: 755 LGSDNIKSGE-----------------------------KLRLSVPVTNTGKCDGEEVVQ 785
Query: 703 LFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHF 762
+++K G P K L F+ V + A + ++ F+L E + + M + EG++
Sbjct: 786 VYLKKNDDVEG-PSKALRAFKRVHIPAGKTVDVEFDLGDKELVWWNPQSNAMCVSEGSYE 844
Query: 763 LVVG 766
L+VG
Sbjct: 845 LMVG 848
>gi|218258058|ref|ZP_03474485.1| hypothetical protein PRABACTJOHN_00138 [Parabacteroides johnsonii
DSM 18315]
gi|218225777|gb|EEC98427.1| hypothetical protein PRABACTJOHN_00138 [Parabacteroides johnsonii
DSM 18315]
Length = 955
Score = 264 bits (675), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 223/803 (27%), Positives = 361/803 (44%), Gaps = 145/803 (18%)
Query: 49 TLPISQRARDLVSRLTLDEKISQLVN--SAPAIPRLGIPAYEW----W-------SEALH 95
T PI R DL+S++ ++EK Q+V + + +P +W W E L+
Sbjct: 65 TAPIDARVEDLLSQMNVEEKTCQMVTLYGYKRVLKDDLPTPDWKNQLWKDGMGAIDEHLN 124
Query: 96 GVAGVGK-------------------------------GI---FFNGTIRG-----ATSF 116
G G GI F N IRG AT+F
Sbjct: 125 GFQQWGLPPSDNPYVWPASRHAWALNEVQRFFIEETRLGIPTDFTNEGIRGVESYIATNF 184
Query: 117 PQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMT-FWAPNINIFRDPRWGRGQET 175
P + +++ L +++G G E R L G T +AP +++ RD RWGR +E
Sbjct: 185 PTQLGLGHTWNRDLVHKVGYITGREGRLL-------GYTNVYAPILDVGRDQRWGRYEEV 237
Query: 176 PGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDA 235
GE P + + V +G+Q D Q +A KH+ AY + + D
Sbjct: 238 YGESPYLVAELGVEMAKGMQTD----------YQVAATSKHYIAYSNNKGGREGMARVDP 287
Query: 236 RVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYI 295
+++ +++ + P++ +K+ G+M +YN +G P + L+ R ++GF GY+
Sbjct: 288 QMSPREVEMLHVYPWKRVIKEAGILGVMSSYNDYDGFPIQSSYYWLTTRLRGEFGFRGYV 347
Query: 296 TSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHT-----KAAVKQKKLPE 350
SD DAV ++ G A +++V+ + AG+++ C +F + + + + LP
Sbjct: 348 VSDSDAVEYLFSKHGTAADMKESVLQSVLAGLNIRC-TFRSPDSYVLPLRELIAEGALPM 406
Query: 351 SEIDRALHNLFSVRMRLGLFNGNPTMQPFG---KIGADVVCSPAHQVLALQAAQDGIVLL 407
S ID + ++ V+ +GLF+ QP+ K V S +Q +ALQA+++ +VLL
Sbjct: 407 STIDDRVRDILRVKFLVGLFD-----QPYQIDLKQADKEVNSAENQQVALQASKESLVLL 461
Query: 408 KNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVE---NTVY 464
KN +LPL +K +A+ GPNA+ L +Y + T L+ +QN V+ ++
Sbjct: 462 KNQDAVLPLDVNKISKIAVCGPNADEEAYALTHYGPLAVEVTTVLEGIQNKVKPGTEVLF 521
Query: 465 YPGCDTVACS---------------SASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRV 509
GCD V + + I+KAV+ AK +D V+++G E R
Sbjct: 522 TKGCDLVDANWPESELIRYPLTSEEQSEINKAVENAKKSDVAVVVLGGSNRTCGENKSRS 581
Query: 510 DLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVA 569
L LPGRQ +L+ V A KPV+LVL+ G P+ I +A D+ + +IL A YPG G A
Sbjct: 582 SLELPGRQLDLLQAVV-ATGKPVVLVLINGRPISINWA--DKYVPAILEAWYPGSQGGTA 638
Query: 570 LAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQA------TSGNPGRTYRFYEGKE 623
+A+ +FGD+NPGG+L +T +P+ ++P + +P A G G R
Sbjct: 639 IADALFGDYNPGGKLTVT-FPKTVGQIPF-NFPTKPNAQVDGGRNKGLDGNMSRV--NGP 694
Query: 624 VFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKF 683
++PFG GLSY+ + Y ++ P + T + +
Sbjct: 695 LYPFGYGLSYTTFEYSDISIQ--------------------------PAIVT---QVQPV 725
Query: 684 LVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCE 743
V V N G+ AG V L+V+ K LVGF + LN E E+ F + P
Sbjct: 726 TVRCKVTNTGKRAGDEVVQLYVRDILSSVTTYEKNLVGFDRIHLNPGETKELTFTIEP-R 784
Query: 744 SLSRAREDGLMVIEEGTHFLVVG 766
L D V+E G ++VG
Sbjct: 785 DLQLLNSDNHWVVEPGDFKVMVG 807
>gi|427387416|ref|ZP_18883472.1| hypothetical protein HMPREF9447_04505 [Bacteroides oleiciplenus YIT
12058]
gi|425725577|gb|EKU88448.1| hypothetical protein HMPREF9447_04505 [Bacteroides oleiciplenus YIT
12058]
Length = 733
Score = 264 bits (675), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 224/752 (29%), Positives = 359/752 (47%), Gaps = 95/752 (12%)
Query: 51 PISQRARDLVSRLTLDEKISQL-----------------VNSAPAIPRLGIPAYEWWSEA 93
P+ R +DL+ R+TL EK+ QL V + PA +G Y
Sbjct: 31 PVETRVKDLLKRMTLHEKVLQLNQYTFGENDNPNNIGTEVKNLPA--EIGSLIYLHTDPK 88
Query: 94 LHGV--------AGVGKGIFFN-GTIRGA-TSFPQVILTAASFDSYLWYRIGQAIGLEAR 143
L + +G I F I G T +P + A SF+ L + QA G+ A+
Sbjct: 89 LRNQIQRKAMEESRLGIPILFGFDVIHGLRTVYPISLAQACSFNPDL---VTQACGMAAK 145
Query: 144 ALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGK 203
+G I TF +P I++ RDPRWGR E GEDP + + V+ V+G QG+ K
Sbjct: 146 ESVLSG--IDWTF-SPMIDVARDPRWGRISECYGEDPYLNTVFGVASVQGYQGE-----K 197
Query: 204 LKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIM 263
L +AC KH+ Y +G Y++ ++ Q L +TY PP+E+CVK G A+ +M
Sbjct: 198 LSDPYSIAACLKHYVGYGAS--EGGRDYRY-TDISPQALWETYLPPYEACVKAGAAT-LM 253
Query: 264 CAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVS-IIYDAEGYAKSPEDAVVDV 322
++N ++G+P+ ++ +L++ + +W G++ SD +A+ +IY +G AK ++A
Sbjct: 254 SSFNDISGVPATSNHYILTEILKNKWRHDGFVVSDWNAIEQLIY--QGVAKDRKEAAYKA 311
Query: 323 LKAGMDVNC-GSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGK 381
AG++++ + ++ + V +KK+ S+ID A+ + V+ RLGLF+ +P+ K
Sbjct: 312 FHAGVEMDMRDNIYYEYLEQLVAEKKIQMSQIDDAVARILRVKFRLGLFD-----EPYTK 366
Query: 382 --IGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLG 439
+ LA + A++ +VLLKN + LLPL S +ALIGP A + LLG
Sbjct: 367 ELTEQERYLQKEDIALAARLAEESMVLLKNENNLLPL-SSTVKRVALIGPMAKDSANLLG 425
Query: 440 NYA--GPSCRSITPLQALQNYVENTV---YYPGCDTVACSSASIDKAVDIAKGADHVVLM 494
+A G + T + +Q + V Y GC + A+ A+ +D VV+
Sbjct: 426 AWAFKGHAEDVETIYEGMQKEFGDKVQLDYEQGCALDGNDESGFSAALKTAEASDVVVVC 485
Query: 495 MGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIG 554
+G + E R + LP Q++L+ + +A KP++LVL G P+++ + + +
Sbjct: 486 LGESKQWSGENASRSTIALPDIQEKLLLHLKQA-NKPIVLVLSSGRPLELI--RLEPQVE 542
Query: 555 SILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGR 614
+I+ PG AG LA ++ G NP G+L +T +P ++P+ MR A +
Sbjct: 543 AIIEMWQPGVAGGTPLAGILSGRVNPSGKLSVT-FPLSTGQIPVY-YNMRQSARPFDAMG 600
Query: 615 TYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELG 674
Y+ K ++PFG GLSY+ + Y S KL SS K+ +NQ
Sbjct: 601 DYQDIPTKPLYPFGHGLSYTTFVY-----SDAKL-----SSLKIRKNQ------------ 638
Query: 675 TEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAE 734
K + V N G+M GK VL +V RP+K+L F+ LNA E
Sbjct: 639 -------KITAEVTVTNAGKMEGKETVLWYVSDPFCSISRPMKELKFFEKHSLNAGESRV 691
Query: 735 IVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
FE+ P LS G +E G + VG
Sbjct: 692 FRFEIDPMRDLSYTDATGKRFLEAGEFIVSVG 723
>gi|329962030|ref|ZP_08300041.1| glycosyl hydrolase family 3 protein [Bacteroides fluxus YIT 12057]
gi|328530678|gb|EGF57536.1| glycosyl hydrolase family 3 protein [Bacteroides fluxus YIT 12057]
Length = 941
Score = 264 bits (675), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 231/854 (27%), Positives = 374/854 (43%), Gaps = 155/854 (18%)
Query: 8 LVFPLLCLCFTSLLTRVDSTQPPFSCDPS------------NPSTETFPFCKTTLPISQR 55
L+ +L L ++LLT + + P + P+ N + + + R
Sbjct: 4 LIAAVLLLSNSALLTAQKTMKVPATYKPTKSEMYHKGWIDFNKNGVMDVYEDPAATVDAR 63
Query: 56 ARDLVSRLTLDEKISQLVN--SAPAIPRLGIPAYEW------------------------ 89
DL+ ++TLDEK Q+V + + +P EW
Sbjct: 64 VEDLLKQMTLDEKTCQMVTLYGYKRVLKDALPTPEWKQMLWKDGIGAIDEHLNGFQQWGL 123
Query: 90 --------WSEALHG----------VAGVGKGI---FFNGTIRG-----ATSFPQVILTA 123
W + H V GI F N IRG AT+FP +
Sbjct: 124 PPSDNENVWPASRHAWALNEVQRFFVEETRLGIPVDFTNEGIRGVESYKATNFPTQLGLG 183
Query: 124 ASFDSYLWYRIGQAIGLEARALYNAGQAIGMT-FWAPNINIFRDPRWGRGQETPGEDPLV 182
+++ L +++G G EAR L G T +AP +++ RD RWGR +E GE P +
Sbjct: 184 HTWNRALIHKVGLITGREARML-------GYTNVYAPILDVGRDQRWGRYEEVYGESPYL 236
Query: 183 TGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDL 242
+ + VRG+Q +A KHF AY + + D + + ++
Sbjct: 237 VAELGIEMVRGLQQ------------HVAATGKHFAAYSNNKGAREGMARVDPQTSPHEV 284
Query: 243 ADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAV 302
+ + PF +K+ G+M +YN +GIP L+ R + GF GY+ SD DAV
Sbjct: 285 ENIHIYPFRRVIKEAGLLGVMSSYNDYDGIPIQGSYYWLTTRLRDEMGFRGYVVSDSDAV 344
Query: 303 SIIYDAEGYAKSPEDAVVDVLKAGMDVNC-----GSFLQKHTKAAVKQKKLPESEIDRAL 357
+Y G AK ++AV ++AG++V C SF+ + VK+ L E ++ +
Sbjct: 345 EYLYTKHGTAKDMKEAVRQSVEAGLNVRCTFRSPDSFVLP-LRELVKEGGLDEETVNDRV 403
Query: 358 HNLFSVRMRLGLFNGNPTMQPFGKIGADV-VCSPAHQVLALQAAQDGIVLLKNSHGLLPL 416
++ V+ +GLF+ P GAD V ++ +ALQA+++ +VLLKN + LPL
Sbjct: 404 RDILRVKFLIGLFDA-PYQTDLA--GADKEVEKEENEAVALQASRESVVLLKNENSTLPL 460
Query: 417 PKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVE---NTVYYPGCDTVAC 473
+ +A+ GPNA+ L +Y + T L+ +Q+ V +Y GCD V
Sbjct: 461 NINTVKKIAVCGPNADEDGYALTHYGPLAVEVTTVLKGIQDKVNGKAEVLYTKGCDLVDA 520
Query: 474 S---------------SASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQ 518
+ A I+KAV+ A+ AD V+++G Q E R L LPGRQ
Sbjct: 521 NWPESEIIDYPLTPDEQAEINKAVENARRADVAVVVLGGGQRTCGENKSRSSLDLPGRQL 580
Query: 519 ELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDH 578
+L+ V +A KPV+L+L+ G P+ + +A D+ + +IL A YPG G VALA+++FGD+
Sbjct: 581 QLLQAV-QATGKPVVLILINGRPLSVNWA--DKYVPAILEAWYPGSKGGVALADILFGDY 637
Query: 579 NPGGRLPMTWYPQDYIKVPMTDMKMRPQA------TSGNPGRTYRFYEGKEVFPFGCGLS 632
NPGG+L +T +P+ ++P + +P + +G G R ++PFG GLS
Sbjct: 638 NPGGKLTVT-FPKTVGQIPF-NFPCKPASQIDGGKNAGPDGNMSRI--NGALYPFGYGLS 693
Query: 633 YSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNH 692
Y+ + Y ++ + N+ ++ V + V N
Sbjct: 694 YTTFEYSNLEITPKVITPNEKAT-----------------------------VRLKVTNT 724
Query: 693 GEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDG 752
G+ AG V L+ + K L GF+ + L E E+ F L + L D
Sbjct: 725 GKYAGDEVVQLYTRDVLSSVTTYEKNLAGFERIHLEPGETKEVTFILD-RKHLELLDADM 783
Query: 753 LMVIEEGTHFLVVG 766
V+E G ++ G
Sbjct: 784 KRVVEPGDFAIMAG 797
>gi|427384377|ref|ZP_18880882.1| hypothetical protein HMPREF9447_01915 [Bacteroides oleiciplenus YIT
12058]
gi|425727638|gb|EKU90497.1| hypothetical protein HMPREF9447_01915 [Bacteroides oleiciplenus YIT
12058]
Length = 1050
Score = 264 bits (674), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 157/424 (37%), Positives = 239/424 (56%), Gaps = 34/424 (8%)
Query: 51 PISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTI 110
P +R DL+SRLT++EKIS L ++P I RL IP Y +EALHGV G+
Sbjct: 35 PTHERIMDLLSRLTVEEKISLLRATSPGISRLDIPKYYHGNEALHGVVRPGR-------- 86
Query: 111 RGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYN---------AGQAIGMTFWAPNI 161
T FPQ I AA+++ L ++ I EARA +N + + +TFW+P +
Sbjct: 87 --FTVFPQAIGLAATWNPVLQEQVATVISDEARARWNELDQGREQKSQFSDLLTFWSPTV 144
Query: 162 NIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYD 221
N+ RDPRWGR ET GEDP ++G ++V+G+QG+ L+ + KHF A
Sbjct: 145 NMARDPRWGRTPETYGEDPYLSGIMGTAFVKGLQGNDSR------YLKIVSTPKHFAA-- 196
Query: 222 LDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLL 281
N + R+ + +++ + L + Y P FE+CVK G+++ IM AYN +N +P + LL
Sbjct: 197 --NNEEHNRFVCNPQISEKQLREYYLPAFEACVKDGKSASIMSAYNALNDVPCTLNAWLL 254
Query: 282 SKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG-SFLQKHTK 340
+K R WGF GY+ SDC S++ +A Y K+ E A +KAG+D+ CG +
Sbjct: 255 TKVLRNDWGFKGYVVSDCGGPSLLVNAHKYVKTKEAAATLSIKAGLDLECGDDVYDEPLL 314
Query: 341 AAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAA 400
+A +Q + +++ID A + + RM+LGLF+ P+ KI V+ S HQ +AL AA
Sbjct: 315 SAYRQYMVTDADIDSAAYRVLRARMQLGLFDSGEK-NPYTKISPAVIGSKEHQEVALNAA 373
Query: 401 QDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAG-PSCRSITPLQALQNYV 459
++ IVLLKN +LPL K S+A++G NA S++ G+Y+G P ++ LQ +++ V
Sbjct: 374 RECIVLLKNQKKMLPLNAKKIKSIAVVGINAGSSE--FGDYSGLPVIAPVSVLQGIKDRV 431
Query: 460 ENTV 463
V
Sbjct: 432 GEDV 435
Score = 122 bits (307), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 80/280 (28%), Positives = 143/280 (51%), Gaps = 47/280 (16%)
Query: 489 DHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAK 548
+ VV ++G++++ E+E DR D+ LP Q+E + + + +++VL+ G + + +
Sbjct: 604 ETVVAVLGINKSIEREGQDRYDIQLPADQREFLQEIYKVNPN-IVVVLVAGSSLAVNWM- 661
Query: 549 YDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQAT 608
D ++ +I+ A YPGE+G A+AEV+FGD+NPGGRLP+T+Y P D +
Sbjct: 662 -DEHVPAIVNAWYPGESGGKAVAEVLFGDYNPGGRLPLTYYRSLDELPPFDDYDI----- 715
Query: 609 SGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYK 668
GRTY++++G ++PFG GLSY+ + Y S+ ++ + ++ V
Sbjct: 716 --TKGRTYKYFKGDVLYPFGYGLSYTSFKY---------------SNLQVADGEEEVS-- 756
Query: 669 SVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILN 728
V+ +KN G AG ++VK R P+K+L GF+ V L
Sbjct: 757 ----------------VSFQLKNTGRYAGDEVAQVYVKLPEREEVMPVKELKGFERVSLK 800
Query: 729 AKEKAEIVFELSPCESLSRARED--GLMVIEEGTHFLVVG 766
+ E ++ +L + L R ++ G + G + ++VG
Sbjct: 801 SGESKKVTIKLR--KDLLRYWDEAKGKFIYPSGNYNIMVG 838
>gi|189468349|ref|ZP_03017134.1| hypothetical protein BACINT_04746 [Bacteroides intestinalis DSM
17393]
gi|189436613|gb|EDV05598.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
intestinalis DSM 17393]
Length = 786
Score = 264 bits (674), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 224/794 (28%), Positives = 361/794 (45%), Gaps = 132/794 (16%)
Query: 51 PISQRARDLVSRLTLDEKISQLVN--SAPAIPRLGIPAYEW----WSEALHGVAGVGKGI 104
PI R +DL+S++TL+EK Q+ + + + +P +W W + + + G+
Sbjct: 48 PIEARVQDLLSQMTLEEKTCQMATLYGSGRVLKDSLPTEKWKDEIWKDGIANIDEQANGL 107
Query: 105 ------------------------------------FFNGTIRG-----ATSFPQVILTA 123
F N IRG AT FP
Sbjct: 108 GRFGSSLSYPYVNSVENRQTIQRWFVEQTRLGIPVDFTNEGIRGLCHDRATMFPAQCGQG 167
Query: 124 ASFDSYLWYRIGQAIGLEARALYNAGQAIGMT-FWAPNINIFRDPRWGRGQETPGEDPLV 182
A+++ L I Q EA+AL G T ++P ++I +DPRWGR E GEDP +
Sbjct: 168 ATWNKELISEIAQVTAEEAKAL-------GYTNIYSPILDIAQDPRWGRVVECYGEDPFL 220
Query: 183 TGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDL 242
G+ ++G+Q + A KHF Y + + D V +++
Sbjct: 221 VGELGKRMIKGLQQEGL-----------VATPKHFAVYSIPVGGRDAGTRTDPHVAPREM 269
Query: 243 ADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAV 302
Y PF + A G+M +YN +G P + L++ R +WGF GY+ SD +AV
Sbjct: 270 RTLYIEPFRKAFCEAGALGVMSSYNDYDGEPITGSYHFLTEILRHEWGFKGYVVSDSEAV 329
Query: 303 SIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKH----TKAAVKQKKLPESEIDRALH 358
+Y A D V+ AG++V L ++ + A+ + K+ ID +
Sbjct: 330 EFLYSKHQVAVDAVDGAAQVVNAGLNVRTNFTLPENFIRPLRQAISEGKVSMQTIDSRVA 389
Query: 359 NLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPK 418
++ V+ +GLF+ NP + K VV S HQ ++++AA + IVLLKN + +LPL K
Sbjct: 390 DVLRVKFGMGLFD-NP-YKGDAKHPEKVVHSKEHQAVSMRAALESIVLLKNENNILPLSK 447
Query: 419 SKSVSLALIGPNANSAKTLLGNYAGPSCRSI-TPLQALQNYVEN--TVYYPGCDTV---- 471
+A+IGPNAN + L+ Y GP+ I T Q ++ Y+ + Y G D +
Sbjct: 448 DLK-KIAVIGPNANEVQNLICRY-GPANAPIKTVYQGIKEYLPDAEVRYAKGTDIIDKYF 505
Query: 472 -----------ACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQEL 520
A +D+AV +A+ +D ++++G ++ +EE R +L L GRQ++L
Sbjct: 506 PESELYEVPLDQEEQAMMDEAVTLAEESDVAIMVLGGNEKTVREEYSRTNLDLCGRQEKL 565
Query: 521 ITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
+ V A KPVIL+L+ G I +A +R I I+ A +PGE A+A+V+FGD+NP
Sbjct: 566 LQAVY-ATGKPVILLLVDGRVATINWA--ERYIPGIVHAWFPGEFMGDAVAQVLFGDYNP 622
Query: 581 GGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKF 640
GG+L +T +P+ ++P +P + S R ++PFG GLSY+ ++Y
Sbjct: 623 GGKLAVT-FPRSVGQIPFA-FPFKPGSDSKGFVRV-----TGTLYPFGYGLSYTTFAY-- 673
Query: 641 KAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHP 700
S +EN P +G + ++ VKN G++AG
Sbjct: 674 --------------SDLKIEN---------PVIGVQ----GSVKLSCKVKNTGKVAGDEV 706
Query: 701 VLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGT 760
V L++ +K L GF+ + L E+ I F L+P E L +D V+E GT
Sbjct: 707 VQLYLHDEMSSVTTYVKVLRGFERIHLEPGEEKVIDFVLTPQE-LGLWNKDNHFVVEPGT 765
Query: 761 HFLVVGDEEYPISI 774
++VG I +
Sbjct: 766 FAVMVGSSSQDIRL 779
>gi|390167927|ref|ZP_10219905.1| beta-glucosidase, partial [Sphingobium indicum B90A]
gi|389589522|gb|EIM67539.1| beta-glucosidase, partial [Sphingobium indicum B90A]
Length = 771
Score = 264 bits (674), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 228/714 (31%), Positives = 339/714 (47%), Gaps = 100/714 (14%)
Query: 81 RLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGL 140
RLGIP + E LHG A VG ATSFPQ I A+S+D L + I
Sbjct: 117 RLGIPIL-FHEEGLHGYAAVG-----------ATSFPQSIAMASSWDPDLLREVNAVIAR 164
Query: 141 EARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFN 200
E R+ + + + +P ++I RDPRWGR +ET GEDP + G+ V+ V G+QG +
Sbjct: 165 EIRS-----RGVSLVL-SPVVDIARDPRWGRIEETYGEDPYLVGEMGVAAVEGLQGKGRS 218
Query: 201 GGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRAS 260
GK+ A KH T + GT A V+ ++L + + PPFE VK+
Sbjct: 219 RLLPPGKV--FATLKHLTGHGQPE-SGTNVGP--APVSERELRENFFPPFEQVVKRTGIE 273
Query: 261 GIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVV 320
+M +YN ++G+PS A+R LL R +WGF G + SD AV + + A E A
Sbjct: 274 AVMASYNEIDGVPSHANRWLLRDVLRGEWGFRGAVVSDYSAVDQLMNIHHVAADLEQAAG 333
Query: 321 DVLKAGMDVNCGSFLQKHTKA-AVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPF 379
L AG+D + L T V++ K+ E+ +DRA+ ++ ++ R GLF P+
Sbjct: 334 RALDAGVDADLPDGLSYATLGRQVREGKIGEALVDRAVRHMLELKFRAGLFE-----NPY 388
Query: 380 GKIGAD--VVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTL 437
A + + LAL+AAQ I+LLKN G+LPL S+ A+IGP+A A+
Sbjct: 389 ADAAASEKITNDGRARALALKAAQRSIILLKND-GMLPLKPEGSI--AVIGPSAAVAR-- 443
Query: 438 LGNYAGPSCRSITPLQALQNYVEN---TVYYPG----------CDTVACSSAS-----ID 479
LG Y G S++ L+ ++ V N V+ G D V S + I
Sbjct: 444 LGGYYGQPPHSVSILEGIRAKVGNRAKIVFAQGVRITENDDWWADKVTRSDPAENRRLIA 503
Query: 480 KAVDIAKGADHVVLMMGLDQTQEKEEL------DRVDLVLPGRQQELITRVAEAAKKPVI 533
+AV+ A+ D +VL +G + +E DR L L G QQEL + +A KP+
Sbjct: 504 QAVEAARHVDRIVLTLGDTEQSSREGWADNHLGDRPSLDLMGEQQELFDAL-KALGKPIA 562
Query: 534 LVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDY 593
+VL+ G P + K +IL Y GE G A+A+V+FGD NPGG+LP+T P+
Sbjct: 563 VVLINGRPA--STVKVSEQADAILEGWYLGEQGGHAVADVLFGDVNPGGKLPVT-IPRSA 619
Query: 594 IKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQS 653
++PM ++P A G Y F ++PFG GLSY+ + +S K+ + +
Sbjct: 620 GQLPMF-YNVKPSARRG-----YLFDTTDPLYPFGFGLSYTSFDLSAPRLSAAKIGVGGT 673
Query: 654 SSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNG 713
+ V++ V+N G G V L+V+
Sbjct: 674 TR-----------------------------VSVDVRNSGRREGDEVVQLYVRDKVGSVT 704
Query: 714 RPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGD 767
RPIK+L GFQ V L E + F + P E+L + V+E G ++ G+
Sbjct: 705 RPIKELKGFQRVTLKPGEVRTVTFTVGP-EALQMWNDHMDRVVEPGDFEIMTGN 757
>gi|338729946|ref|YP_004659338.1| glycoside hydrolase family 3 domain-containing protein [Thermotoga
thermarum DSM 5069]
gi|335364297|gb|AEH50242.1| glycoside hydrolase family 3 domain protein [Thermotoga thermarum
DSM 5069]
Length = 774
Score = 264 bits (674), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 238/812 (29%), Positives = 367/812 (45%), Gaps = 173/812 (21%)
Query: 49 TLPISQRARDLVSRLTLDEKISQL----------------VNSAPAI------------- 79
+P RA+DL+S++TL+EKI+QL V A I
Sbjct: 8 NVPAKVRAKDLLSKMTLEEKIAQLGSVWSYELLTEDGKFSVEKAKEILKHGIGQITRPGG 67
Query: 80 -----------------------PRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSF 116
RLGIPA E L G G+G AT F
Sbjct: 68 ATNFEPPEVAKLVNEIQKFLVENTRLGIPAI-MHEECLAGYMGLG-----------ATIF 115
Query: 117 PQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETP 176
PQ I A+++D L +I AI + R L G G+ AP +++ RDPRWGR +ET
Sbjct: 116 PQPIGMASTWDPELVEKITSAIREDLRKL---GITQGL---APVLDVARDPRWGRTEETF 169
Query: 177 GEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLD----NWKGTTRYK 232
GE P + K V+YV+G+QG G + A KHF Y NW T
Sbjct: 170 GESPYLVAKMGVAYVKGLQGSDITKGVV-------ATGKHFAGYSASEGGKNWAPTN--- 219
Query: 233 FDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFH 292
+ ++L + + PFE+ VK+ IM +Y+ ++GIP ++R LL+ RR WGF
Sbjct: 220 ----IPPRELREVFLYPFEAAVKEAGLLSIMNSYSEIDGIPCASNRELLTDILRRTWGFE 275
Query: 293 GYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNC-GSFLQKHTKAAVKQKKLPES 351
G + SD AV ++ AK+ +A L+AG+D+ + H K+ V+ + E
Sbjct: 276 GIVVSDYFAVDMLAAYHRMAKNKAEAAKYALEAGIDIELPKTDCYLHLKSLVENGIISEK 335
Query: 352 EIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSH 411
+D A+ + +++ LGLF P+ + G+ H +AL+A++ IVLLKN +
Sbjct: 336 LLDEAVLRVLTLKFLLGLFE-----NPYAEGGS----LNDHNDIALEASRKSIVLLKN-N 385
Query: 412 GLLPLPKSKSVSLALIGPNANSAKTLLGNYAG--------PSCRSIT------PLQALQN 457
G+LPL V +AL+GP AN + LLG+Y+ + + T L+ ++
Sbjct: 386 GILPL--KNDVKIALVGPTANDVRNLLGDYSYLVHIKTLLENVNATTFNAPKFNLKKVEE 443
Query: 458 YVE-----------------NTVYYP-GCDTVACSSASIDKAVDIAKGADHVVLMMG--- 496
VE +YY GCD S +A++ AK AD VV ++G
Sbjct: 444 LVEAHLKKIPSILAEFAKRAKEIYYAKGCDITDPSREGFAEALEAAKKADVVVAVVGDRS 503
Query: 497 --LDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIG 554
+ E D +L LPG Q+E I + + K P+++VL+ G P + + +
Sbjct: 504 GLTIECTSGESRDMANLKLPGVQEEFILELTKVGK-PIVVVLVTGRPY--SLKAFVNKVN 560
Query: 555 SILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGR 614
+I+ PGE GA ALAEVIFG NP G+LP++ +P ++P+ ++P
Sbjct: 561 AIVQLWLPGETGAQALAEVIFGQVNPSGKLPIS-FPASAGQIPVFHY-VKPSGGRSCWHG 618
Query: 615 TYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELG 674
Y + K +FPFG GLSY+ +SY + Q ++ + S K
Sbjct: 619 NYVDEDVKPLFPFGHGLSYTTFSYTNLRIEQQEIPIAGSVKLK----------------- 661
Query: 675 TEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAE 734
+ V+N GE+ G+ V L++ RP+K+L GF V L KE
Sbjct: 662 ------------VDVQNTGEIYGEEVVQLYISREHASVTRPVKELKGFARVKLQPKETKT 709
Query: 735 IVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
+VFE+ + L+ D +V+E G + +++G
Sbjct: 710 VVFEIH-TDVLAYYDRDMKLVVEPGEYKVLIG 740
>gi|340616359|ref|YP_004734812.1| xylosidase/arabinosidase [Zobellia galactanivorans]
gi|339731156|emb|CAZ94420.1| Xylosidase/arabinosidase, family GH3 [Zobellia galactanivorans]
Length = 801
Score = 264 bits (674), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 229/810 (28%), Positives = 358/810 (44%), Gaps = 146/810 (18%)
Query: 49 TLPISQRARDLVSRLTLDEKISQLVN--SAPAIPRLGIPAYEW----WSE---------- 92
T P+ R DL+S++TL+EK Q+ + + +P +W W +
Sbjct: 48 TRPVDLRIEDLLSQMTLEEKSCQMATLYGFGRVLKDELPTPDWKNQIWKDGIGNIDEQLN 107
Query: 93 --ALHGVAGVGK------------------------GI---FFNGTIRG-----ATSFPQ 118
A H A K GI F N IRG ATSFP
Sbjct: 108 NLAYHPSAVTDKAWPPSNHIKALNTIQEFFVEDTRLGIPVDFTNEGIRGLCHEKATSFPS 167
Query: 119 VILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMT-FWAPNINIFRDPRWGRGQETPG 177
+ A+++ L +IG G EAR L G T ++P ++I RDPRWGR E G
Sbjct: 168 QLGVGATWNKNLVGKIGHITGKEARLL-------GYTNVYSPILDIARDPRWGRVVECYG 220
Query: 178 EDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARV 237
EDP + G+ V+G+Q + + + KHF Y + DA +
Sbjct: 221 EDPYLVGELGYQMVKGIQQE-----------KVVSTPKHFAIYSAPKGGRDGDARTDAHI 269
Query: 238 TMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITS 297
T ++L Y PF+ +K A G+M +YN NG+P + + L+ R WGF GY+ S
Sbjct: 270 TERELFSLYLHPFKRAIKDAGAMGVMSSYNDYNGVPVSSSKYFLNDILREDWGFKGYVVS 329
Query: 298 DCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKH----TKAAVKQKKLPESEI 353
D AV I D AK +DAV + AG++V + + + VK+ L + I
Sbjct: 330 DSRAVEFIADKHHVAKDRKDAVRQAVLAGLNVRTDFTMPEDFILPVRELVKEGGLDMATI 389
Query: 354 DRALHNLFSVRMRLGLFNGNPTMQPFGKIGAD---VVCSPAHQVLALQAAQDGIVLLKNS 410
D + ++ V+ GLF+ P+GK + V P +Q +A QA+ + IVLLKN
Sbjct: 390 DDRVRDILRVKFWQGLFDA-----PYGKQMKEADKTVGKPEYQEVAYQASLESIVLLKNE 444
Query: 411 HGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSIT----------PLQALQNYVE 460
+LPL SK S+ + GPNA + + Y GPS + P Y +
Sbjct: 445 ENILPLDFSKYKSVLVTGPNAKAINHSVSRY-GPSHIDVVSVFDGIKEKFPKDVEIKYTK 503
Query: 461 NTVYY----PGCDTV-----ACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDL 511
V++ P + + + IDKAV +AK ++++G D+ E R L
Sbjct: 504 GCVFFDENWPDSELMNTPPTEAEQSEIDKAVAMAKTVGLAIVVLGDDEETVGESRSRTSL 563
Query: 512 VLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALA 571
LPG QQ+L+ + + PVI+VL+ G P+ I + D+ + I+ + G+ G A+A
Sbjct: 564 DLPGNQQKLVEEIYKTG-TPVIVVLINGRPMTINWV--DKYVPGIVEGWFQGKFGGSAIA 620
Query: 572 EVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGR----TYRFYEGKEVFPF 627
+V+ G +NPGG+LP++ +P+ ++PM + +P A + P + + + G ++PF
Sbjct: 621 DVLVGSYNPGGKLPVS-FPKTVGQLPM-NFPSKPGAQADQPAKGPNGSGKTRVGGFLYPF 678
Query: 628 GCGLSYSKYSY---KFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFL 684
G GLSY+ + Y K ++ +N L DVV
Sbjct: 679 GYGLSYTTFEYTNLKIRSNIKNGL-------------GDVV------------------- 706
Query: 685 VTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCES 744
V++ + N G+ G V L+ KQL GF+ + L A E + F LSP E
Sbjct: 707 VSVDITNSGKRKGDEIVQLYFSDETSSVTVYEKQLRGFERISLEAGETKTVNFTLSP-ED 765
Query: 745 LSRAREDGLMVIEEGTHFLVVGDEEYPISI 774
LS V+E G+ +++G I +
Sbjct: 766 LSLYNRQMEFVLEPGSFTIMIGSSAEDIHV 795
>gi|423301451|ref|ZP_17279475.1| hypothetical protein HMPREF1057_02616 [Bacteroides finegoldii
CL09T03C10]
gi|408472052|gb|EKJ90581.1| hypothetical protein HMPREF1057_02616 [Bacteroides finegoldii
CL09T03C10]
Length = 781
Score = 263 bits (673), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 208/713 (29%), Positives = 332/713 (46%), Gaps = 102/713 (14%)
Query: 81 RLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGL 140
RLGIP + EA HG +G +F G AT PQ+I +G+AIG
Sbjct: 129 RLGIPLF-LAEEAPHGHMAIGTTVFPTGIGMAATWSPQLI-----------NEVGKAIGK 176
Query: 141 EARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFN 200
E R G + P +++ RDPRW R +ET GEDP++TG+ + V G+
Sbjct: 177 EIRL------QGGHISYGPVLDLARDPRWSRVEETFGEDPVLTGEIGKAMVAGL-----G 225
Query: 201 GGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRAS 260
G L A KHF AY + F M++L + + PPF + G A
Sbjct: 226 SGDLSRPYSTLATLKHFLAYGISESGQNGNPSF---AGMRELHENFLPPFGQAINAG-AL 281
Query: 261 GIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVV 320
+M +YN ++G P A+ LL++ R W F G + SD ++ I+ + A + ++A V
Sbjct: 282 SVMTSYNSMDGTPCTANHYLLTELLRDDWKFKGVVVSDLYSIEGIHQSHFVASTMKEAAV 341
Query: 321 DVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFG 380
L AG+D++ G + AV +K++ + +D A+ + ++ +GLF NP + P G
Sbjct: 342 MALSAGVDIDLGGDAYMNLMDAVNRKEISKEILDAAVSRVLRLKFEMGLFE-NPYVDP-G 399
Query: 381 KIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGN 440
K + V S + LA Q AQ I LLKN H LLPL +S+ +ALIGPNA++ +LG+
Sbjct: 400 KAKKE-VRSKEYVALARQVAQASITLLKNEHSLLPL--DRSMKVALIGPNADNRYNMLGD 456
Query: 441 YAGPSCRS--ITPLQALQNYVENTV--YYPGCDTVACSSASIDKAVDIAKGADHVVLMMG 496
Y P T L ++ + ++ Y GC ++ I++AV A+ ++ V+ ++G
Sbjct: 457 YTAPQEEENVKTVLDGIRAKLSSSQVEYVKGCSIRDTVTSDIEQAVAAARRSEVVIAVVG 516
Query: 497 -----------------------LDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVI 533
+ + E DR L L G+QQEL+ + +A KP+I
Sbjct: 517 GSSARDFKTSYKETGAAIADEKTISDMECGEGFDRATLSLLGKQQELL-KALKATGKPLI 575
Query: 534 LVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDY 593
+V + G P+D +A N ++L A YPG+ G A+A+V+FG+ NP GRLP + P+
Sbjct: 576 VVYIEGRPLDKNWA--SENADALLTAYYPGQEGGNAIADVLFGEFNPAGRLPFS-VPRSV 632
Query: 594 IKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQS 653
+VP+ K PQ+ Y ++ FG GLSY+ + Y
Sbjct: 633 GQVPVYYNKKAPQS------HDYVEVSASPLYSFGYGLSYTTFEYS-------------- 672
Query: 654 SSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNG 713
+L F V+ ++N G+ G+ V L+++
Sbjct: 673 ------------------DLHLSALTPHSFEVSCKIRNTGKYDGEEVVQLYLRDEYASVV 714
Query: 714 RPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
+P+KQL F + L E+ ++ F LS E + + V+E GT +++G
Sbjct: 715 QPLKQLKHFARLFLKCGEEQKVKFILSE-EDFALVDRNLKRVVEPGTFQVMIG 766
Score = 40.4 bits (93), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 16/31 (51%), Positives = 23/31 (74%)
Query: 43 FPFCKTTLPISQRARDLVSRLTLDEKISQLV 73
FP+ TLP+ R DL+SR+TL+EK+ QL+
Sbjct: 27 FPYKNPTLPVETRVADLLSRMTLEEKVGQLL 57
>gi|256819849|ref|YP_003141128.1| glycoside hydrolase [Capnocytophaga ochracea DSM 7271]
gi|256581432|gb|ACU92567.1| glycoside hydrolase family 3 domain protein [Capnocytophaga
ochracea DSM 7271]
Length = 804
Score = 263 bits (673), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 229/798 (28%), Positives = 369/798 (46%), Gaps = 131/798 (16%)
Query: 7 SLVFPLLCLCFTSLLTRVDS---TQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRL 63
+L+ L+ C T + + D+ TQ FS D I QR ++ +
Sbjct: 44 TLISLLMIGCATPAILKSDNEKLTQKSFSGDKH---------------IEQRVDSVLRLM 88
Query: 64 TLDEKISQLVN-------SAPAIPRLGIPAYEWWSEALHG-------VAGVGK------- 102
TL+EKI Q+ + P + P E + L G VAG+ K
Sbjct: 89 TLEEKIGQMTQFSADWSVTGPVMADKYQPYLE---KGLVGSIFNATSVAGIRKLQKIAVE 145
Query: 103 ----GI---FFNGTIRG-ATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGM 154
GI F I G T FP + + S+D L + + EA A I
Sbjct: 146 QTRLGIPILFGQDVIHGYKTIFPIPLAESCSWDLALMRKTAELAAREASA-----DGINW 200
Query: 155 TFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACC 214
TF AP ++I RD RWGR E GEDP + A + V+G QG N L AC
Sbjct: 201 TF-APMVDITRDARWGRAMEGAGEDPYLGSLIAEARVKGFQGGD-NWQTLSSPHTLLACG 258
Query: 215 KHFTAYDLDNWKGTTRYKFD---ARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNG 271
KHF Y G D A ++M L + Y PP+E+ +K G S IM + N +NG
Sbjct: 259 KHFAGY------GAAESGKDYNTAELSMHTLRNVYLPPYEATLKAGVGS-IMASLNEING 311
Query: 272 IPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAG--MDV 329
+P+ AD+ LL++ R++WGF+G + SD ++ + G AK + AG MD+
Sbjct: 312 VPATADKWLLTEVLRKEWGFNGLLVSDYTGINELV-RHGVAKDDKQVANLSANAGIEMDM 370
Query: 330 NCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCS 389
N +F+ K+ A VK+ K+ E++ID+A+ ++ ++ LGLF+ +P + +
Sbjct: 371 NGATFI-KYLSALVKEGKVTENQIDKAVRHILEMKFLLGLFD-DPYRYLDETRAKENTFT 428
Query: 390 PAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYA--GPSCR 447
+ +A QA +VLLKN LP+ K+ ++A+IGP N+ + G++ G +
Sbjct: 429 EEYLKVARQAVASSVVLLKNEAEALPIKKNSDKTIAVIGPMMNNTSDINGSWTCLGDGKQ 488
Query: 448 SITPLQALQNYVENT----VYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEK 503
S++ L L + T +Y GC S+ + +AV IA+ AD V++ +G +
Sbjct: 489 SVSLLTGLTEKYKGTNVKLLYAEGCGFTTISTEQLKEAVAIARKADRVLVAVGEQSSWAG 548
Query: 504 EELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPG 563
E R D+ LP Q++L+ + +A KP+ ++ G P+D+++ + N+ +IL A +PG
Sbjct: 549 ESAVRTDIRLPQAQRQLLEAL-KAINKPIAIITFSGRPLDLSWE--NENVQAILQAWFPG 605
Query: 564 EAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPM---------------TDMKMRPQAT 608
G +A+VI GD NP G L M+ +P+ ++P+ ++ RP
Sbjct: 606 TQGGNGIADVIAGDVNPSGHLTMS-FPRSVGQIPIYYNYKSTGRPVHTNNEEVDHRPHYN 664
Query: 609 SGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYK 668
+G Y ++PFG GLSY+ + A+S ++LN+ S + ++
Sbjct: 665 AG-----YLDSSITPLYPFGYGLSYTTF-----AISN--VHLNKKSIKRYNDS------- 705
Query: 669 SVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILN 728
+V V+N G G+ V L+ + RP+K+L GFQ + L
Sbjct: 706 --------------IIVNASVQNTGRTEGEIVVQLYTRQLVASVSRPVKELKGFQKIPLK 751
Query: 729 AKEKAEIVFELSPCESLS 746
A E ++ FEL P E+L+
Sbjct: 752 AGESKQVRFEL-PSEALA 768
>gi|423305378|ref|ZP_17283377.1| hypothetical protein HMPREF1072_02317 [Bacteroides uniformis
CL03T00C23]
gi|423311197|ref|ZP_17289166.1| hypothetical protein HMPREF1073_03916 [Bacteroides uniformis
CL03T12C37]
gi|392679729|gb|EIY73108.1| hypothetical protein HMPREF1073_03916 [Bacteroides uniformis
CL03T12C37]
gi|392681368|gb|EIY74727.1| hypothetical protein HMPREF1072_02317 [Bacteroides uniformis
CL03T00C23]
Length = 864
Score = 263 bits (673), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 174/458 (37%), Positives = 240/458 (52%), Gaps = 36/458 (7%)
Query: 45 FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGI 104
+ +LP +RA L+ LTL+EK+S +++S+ + RLGI Y WW+EALHGVA G
Sbjct: 22 YKNASLPAEERAGLLLKELTLEEKVSLMMDSSKPVERLGIKPYNWWNEALHGVARAGL-- 79
Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARA--LYNAGQAI-----GMTFW 157
AT FPQ I AASF Y + A+ EARA Y A Q G+T W
Sbjct: 80 --------ATVFPQPIGMAASFSPETVYEVFTAVSDEARAKNAYYASQGSRERYQGLTMW 131
Query: 158 APNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHF 217
P +NI+RDPRWGRG ET GEDP +T + V V+G+QG N GK + AC KHF
Sbjct: 132 TPTVNIYRDPRWGRGIETYGEDPYLTSRMGVMVVKGLQGT--NDGKYD---KLHACAKHF 186
Query: 218 TAYDLDNWKGTTRYKFDAR-VTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCA 276
+ W R++F+A + +DL +TY PPFE+ VK+G+ +MCAYNR G P C
Sbjct: 187 AVHSGPEW---NRHEFNAENIKPRDLYETYLPPFEALVKEGKVKEVMCAYNRFEGDPCCG 243
Query: 277 DRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAK--SPEDAVVDVLKAGMDVNCGSF 334
LL + R +WGF G + SDC A++ Y G+ E A + +G D+ CGS
Sbjct: 244 SDRLLMQILRDEWGFDGIVLSDCGAIADFYRDYGHKTHLDAESASAAAVLSGTDLECGSS 303
Query: 335 LQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQV 394
+ + AVKQ K+ E +D A+ L + R LG + P + I VV S H
Sbjct: 304 YEALVE-AVKQGKIDEKAVDVAVKRLLTARFALGEMD-EPEKVSWTGIPFSVVASAGHDS 361
Query: 395 LALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQA 454
LAL A+ + LL N LPL K +++A++GPNAN + GNY G ++T L
Sbjct: 362 LALDMARKSMTLLMNKDNTLPL-KRGGLTIAVMGPNANDSVMQWGNYNGMPPHTVTILDG 420
Query: 455 LQNYV---ENTVYYPGCDTVACSSASIDKAVDIAKGAD 489
++ + + +Y GC V A I + K AD
Sbjct: 421 IRKALGSDDRLIYEQGCGWV--ERAQIQSVFNRCKTAD 456
Score = 122 bits (305), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 94/304 (30%), Positives = 145/304 (47%), Gaps = 53/304 (17%)
Query: 474 SSASIDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITR 523
+ + K+V+ K AD VV + G+ E EE+ DR D+ LP Q+ELI
Sbjct: 586 NEVDVRKSVERVKDADIVVFVGGVSPNLEGEEMGVELPGFRGGDRTDIELPAVQRELIAA 645
Query: 524 VAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGR 583
+ A KK V+LV G P I G+IL A YPG+AG A+AEV+FGD+NP GR
Sbjct: 646 LHRAGKK-VVLVNCSGSP--IGLEPETGRCGAILQAWYPGQAGGTAVAEVLFGDYNPAGR 702
Query: 584 LPMTWYPQDYIKVP-MTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKA 642
LP+T+Y ++ ++P D M GRTYR+ + +FPFG GLSY+ +SY
Sbjct: 703 LPVTFY-RNVSQLPDFEDYNM--------TGRTYRYMTQEPLFPFGHGLSYTSFSYGAVV 753
Query: 643 VSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVL 702
+ + + + K +++ V N G+ G+ V
Sbjct: 754 LGSDNIKSGE-----------------------------KLRLSVPVTNTGKCDGEEVVQ 784
Query: 703 LFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHF 762
+++K G P K L F+ V + A + ++ F+L E + + M + EG++
Sbjct: 785 VYLKKNDDVEG-PSKALRAFKRVHIPAGKTVDVEFDLGDKELVWWNPQSNTMCVSEGSYE 843
Query: 763 LVVG 766
L+VG
Sbjct: 844 LMVG 847
>gi|94442924|emb|CAJ91136.1| beta-xylosidase [Platanus x acerifolia]
Length = 231
Score = 263 bits (673), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 119/234 (50%), Positives = 166/234 (70%), Gaps = 5/234 (2%)
Query: 187 AVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTY 246
A SYVRG+Q +LK +ACCKH+TAYDLDNWKG R+ F+ARV+ QDL DT+
Sbjct: 1 AASYVRGLQQPYGRDSRLK----VAACCKHYTAYDLDNWKGIDRFHFNARVSKQDLEDTF 56
Query: 247 QPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIY 306
PF CV +G+ + +MC+YN+VNGIP+CAD NLL T R +W +GYI +DCD++ ++Y
Sbjct: 57 NVPFRECVVEGKVASVMCSYNQVNGIPTCADPNLLRNTIRGEWRLNGYIVTDCDSIGVLY 116
Query: 307 DAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMR 366
D + Y +PE++ D +KAG+D++CG FL HT+ A+ ++ L E +D AL N +V+MR
Sbjct: 117 DTQHYTSTPEESAADAIKAGVDLDCGPFLAVHTQEAITRRMLSEVYVDGALANTLAVQMR 176
Query: 367 LGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSK 420
LG+F+G P+ QPFG +G VC+ AHQ LAL+AA+ GIVL+KN G LPL ++
Sbjct: 177 LGMFDGEPSAQPFGHLGPRDVCTSAHQQLALEAARQGIVLMKN-QGSLPLSTAR 229
>gi|319643197|ref|ZP_07997825.1| glycoside hydrolase family 3 [Bacteroides sp. 3_1_40A]
gi|345520511|ref|ZP_08799899.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp.
4_3_47FAA]
gi|254835034|gb|EET15343.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp.
4_3_47FAA]
gi|317385101|gb|EFV66052.1| glycoside hydrolase family 3 [Bacteroides sp. 3_1_40A]
Length = 788
Score = 263 bits (673), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 214/761 (28%), Positives = 345/761 (45%), Gaps = 131/761 (17%)
Query: 51 PISQRARDLVSRLTLDEKISQLVN--SAPAIPRLGIPAYEW----WSEALHGVAGVGKGI 104
P+ R +DL+S++TL+EK Q+ + + + +P W W + + + G+
Sbjct: 49 PLEDRVQDLLSQMTLEEKTCQMATLYGSGRVLKDALPQDNWKTEVWKDGIGNIDEEHNGL 108
Query: 105 ------------------------------------FFNGTIRG-----ATSFPQVILTA 123
F N IRG AT FP
Sbjct: 109 GAFKSEYSFPYAKHVDAKHTIQRWFVEKTRLGIPVDFTNEGIRGLCHDRATYFPAQCGQG 168
Query: 124 ASFDSYLWYRIGQAIGLEARALYNAGQAIGMT-FWAPNINIFRDPRWGRGQETPGEDPLV 182
A+++ L RIG+ EA AL G T ++P ++I +DPRWGR ET GEDP +
Sbjct: 169 ATWNKKLIARIGEVEAKEAVAL-------GYTNIYSPILDIAQDPRWGRCVETYGEDPYL 221
Query: 183 TGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDL 242
G+ + +Q K A KHF Y + + + D V +++
Sbjct: 222 VGELGKQMITSLQ-----------KYNLVATPKHFAVYSIPIGGRDGKTRTDPHVAPREM 270
Query: 243 ADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAV 302
Y PF ++ A G+M +YN +G P + L++ R++WGF GY+ SD +AV
Sbjct: 271 RTLYIEPFRMAFQEAGALGVMSSYNDYDGEPITGSYHFLTEILRQEWGFKGYVVSDSEAV 330
Query: 303 SIIYDAEGYAKSPEDAVVDVLKAGMDVNC-----GSFLQKHTKAAVKQKKLPESEIDRAL 357
I + A + ED + + AG+++ F+ K AV K+ + +D+ +
Sbjct: 331 EFISNKHKVADTYEDGIAQAVNAGLNIRTHFTPPADFILPLRK-AVDDGKISQETLDKRV 389
Query: 358 HNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLP 417
+ ++ LGLF+ NP + GK +V S HQ ++L+AA+ +VLLKN LLPL
Sbjct: 390 AEILRIKFWLGLFD-NP-YRGNGKQAEQIVHSKEHQAVSLEAARQSLVLLKNETHLLPLS 447
Query: 418 KSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVEN--TVYYPGCDTV---- 471
KS S+A+IGPNA+ L+ Y + T Q ++ + + +Y GCD +
Sbjct: 448 KSIR-SIAVIGPNADEQTQLICRYGPANAPIKTVYQGIKELLPHAEVIYKKGCDIIDPHF 506
Query: 472 -----------ACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQEL 520
A + + + AK A+ VV+++G ++ +E+ R L LPGRQ+EL
Sbjct: 507 PESEILDFPKTAEEVRLMQEVIRAAKQAEVVVMVLGGNELTVREDRSRTSLNLPGRQEEL 566
Query: 521 ITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
+ V A KPVILV+L G I +A ++ +IL A +PGE A+AE +FGD+NP
Sbjct: 567 LKAVC-ATGKPVILVMLDGRASSINYAA--AHVPAILHAWFPGEFCGQAVAEALFGDYNP 623
Query: 581 GGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKF 640
GGRL +T +P+ ++P +P + + Y ++PFG GLSY+ ++Y
Sbjct: 624 GGRLAVT-FPKSVGQIPFA-FPFKPGSDESSSTSVY-----GALYPFGHGLSYTTFTYS- 675
Query: 641 KAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHP 700
L+ S S + V+ +H V+ +KN G++ G
Sbjct: 676 --------DLHISPSHQGVQGD--IH------------------VSCKIKNTGKIKGDEV 707
Query: 701 VLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSP 741
V L+++ K L GF+ + L A E+ + F L P
Sbjct: 708 VQLYLRDEISSVTTYTKVLRGFERISLKAGEEQTVHFRLRP 748
>gi|431797765|ref|YP_007224669.1| beta-glucosidase-like glycosyl hydrolase [Echinicola vietnamensis
DSM 17526]
gi|430788530|gb|AGA78659.1| beta-glucosidase-like glycosyl hydrolase [Echinicola vietnamensis
DSM 17526]
Length = 799
Score = 263 bits (672), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 213/723 (29%), Positives = 350/723 (48%), Gaps = 112/723 (15%)
Query: 81 RLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGL 140
RLGIP E HG +G T FP I A++++ L + AI L
Sbjct: 137 RLGIPMM-LAEECPHGHMAIG-----------TTVFPTSIGQASTWNPALIQEMAAAIAL 184
Query: 141 EARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFN 200
EAR G IG + P +++ R+PRW R +ET GEDP + + + V G QG++
Sbjct: 185 EARL---QGGHIG---YGPVLDLAREPRWSRVEETYGEDPYINSQMGRAMVSGFQGESIA 238
Query: 201 GGKLKGKLQASACCKHFTAYDL--DNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGR 258
GK + KHFTAY + GT+ V ++L ++Y PPF++ V +G
Sbjct: 239 SGK-----NVISTLKHFTAYGVPEGGHNGTS-----VSVGQRELHESYLPPFKAAVAEGA 288
Query: 259 ASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDA 318
S +M AYN ++G+P ++ +LL+ R WGF+G++ SD ++S + + ++ E A
Sbjct: 289 LS-VMTAYNSIDGVPCTSNGHLLNDVLRDDWGFNGFVVSDLGSISGLRGSHHVTETAEGA 347
Query: 319 VVDVLKAGMDVNCGSF-LQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQ 377
+ AG+D + G + K+ AAV+ + + +D A+ + V+ +GLF NP +
Sbjct: 348 AQLAINAGVDSDLGGYGFGKNLLAAVQAGGVSQEVLDEAVRRVLKVKFDMGLFE-NPYVD 406
Query: 378 PFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTL 437
P +V S H LA + A++ +VLLKN + LLPL K K S+A+IGPNA++
Sbjct: 407 P--SKAESLVRSAKHIALARKVARESVVLLKNENDLLPLRK-KVNSIAVIGPNADNTYNQ 463
Query: 438 LGNYAGPSCRS--ITPLQALQNYVENTV---YYPGCDTVACSSASIDKAVDIAKGADHVV 492
LG+Y P +T L+ ++N V V Y GC + + I KA +A +D V
Sbjct: 464 LGDYTAPQPNENVVTVLEGIKNKVGKDVRVNYVKGCAIRDTTQSEIGKAASLAARSDVAV 523
Query: 493 LMMG---------------------------LDQTQEKEELDRVDLVLPGRQQELITRVA 525
+++G + + E DR+ L L G Q +L+ V
Sbjct: 524 VVLGGSSARDFDTEYEETAAAKVSEAEEGQVISDMESGEGFDRMTLDLLGDQLKLVQAV- 582
Query: 526 EAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLP 585
+A PV++VL+ G P+++ + D ++ +I+ A YPG+ G A+A+V+FGD+NP GRL
Sbjct: 583 QATGTPVVVVLIKGRPLNLNW--IDEHVPAIVDAWYPGQEGGNAIADVLFGDYNPSGRLT 640
Query: 586 MTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEG--KEVFPFGCGLSYSKYSYKFKAV 643
I VP + ++ NP R + + EG + ++ FG GLSY+ + Y V
Sbjct: 641 --------ISVPRSVGQLPVFYNYRNPKR-HDYVEGSAEPLYAFGHGLSYADFEYDNLEV 691
Query: 644 SQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLL 703
+ + + + + V VH++ V N + G+ V L
Sbjct: 692 TASGM-----AGSPTVR----VHFQ--------------------VSNISNVDGEEVVQL 722
Query: 704 FVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFL 763
+V+ RP+ +L F+ V++ A E ++I F L+ E L +D ++E G+ +
Sbjct: 723 YVRDEAGSTVRPLLELKRFEKVMVPAGESSKITFMLT-AEDLQVLGQDMNWLVEPGSFQV 781
Query: 764 VVG 766
+VG
Sbjct: 782 LVG 784
>gi|423302093|ref|ZP_17280116.1| hypothetical protein HMPREF1057_03257 [Bacteroides finegoldii
CL09T03C10]
gi|408471184|gb|EKJ89716.1| hypothetical protein HMPREF1057_03257 [Bacteroides finegoldii
CL09T03C10]
Length = 1039
Score = 263 bits (672), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 232/796 (29%), Positives = 358/796 (44%), Gaps = 135/796 (16%)
Query: 51 PISQRARDLVSRLTLDEKISQLVN--SAPAIPRLGIPAYEW----W-------SEALHGV 97
PI R DL+S++TL+EK Q+V + + +P EW W E L+G
Sbjct: 151 PIDARIEDLLSQMTLEEKTCQMVTLYGYKRVLKDDLPTPEWKNQLWKDGIGAIDEHLNGF 210
Query: 98 AGVGK-------------------------------GI---FFNGTIRG-----ATSFPQ 118
G GI F N IRG AT+FP
Sbjct: 211 QQWGLPPSDNEYVWPASKHAWALNEVQRFFIEETRLGIPTDFTNEGIRGVESYKATNFPT 270
Query: 119 VILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMT-FWAPNINIFRDPRWGRGQETPG 177
+ +++ L +IG G EAR L G T +AP +++ RD RWGR +E G
Sbjct: 271 QLGLGHTWNRQLLRQIGLITGREARML-------GYTNVYAPILDVGRDQRWGRYEEVYG 323
Query: 178 EDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARV 237
E P + + + V+G+Q + Q +A KHF AY + + D ++
Sbjct: 324 ESPYLVAELGIEMVKGMQHNH----------QVAATGKHFIAYSNNKGAREGMARVDPQM 373
Query: 238 TMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITS 297
+ +++ + PF+ +++ G+M +YN +G P + L+ R GF GY+ S
Sbjct: 374 SPREVEMIHVYPFKRVIREAGLLGVMSSYNDYDGFPIQSSYYWLTTRLRGDMGFRGYVVS 433
Query: 298 DCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHT-----KAAVKQKKLPESE 352
D DAV +Y G AK ++AV ++AG+++ C +F + + VK+ +L E
Sbjct: 434 DSDAVEYLYTKHGTAKDMKEAVRQSVEAGLNIRC-TFRSPDSYVLPLRELVKEGELSEEI 492
Query: 353 IDRALHNLFSVRMRLGLFNGNPTMQPFGKIGAD-VVCSPAHQVLALQAAQDGIVLLKNSH 411
I+ + ++ V+ +GLF+ +P GAD V +++ +ALQA+++ IVLLKN
Sbjct: 493 INDRVRDILRVKFLVGLFD-HPYQTDLK--GADEEVEKASNEEIALQASRESIVLLKNDK 549
Query: 412 GLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYV---ENTVYYPGC 468
+LPL S +A+ GPNA+ L +Y + + L+ +Q + +Y GC
Sbjct: 550 NVLPLNASTIKKIAVCGPNADEHSYALTHYGPLAVEVTSVLKGIQEKLGGKAEVLYTKGC 609
Query: 469 DTVACS---------------SASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVL 513
+ V + I+KAV K AD V+++G Q E R L L
Sbjct: 610 ELVDANWPESELMEYPLSENEQEEIEKAVSQTKQADVAVVVLGGGQRTCGENKSRSSLAL 669
Query: 514 PGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEV 573
PGRQ +L+ V A KPV+LVL+ G P+ I +A D+ + +IL A YPG G A+A+V
Sbjct: 670 PGRQLDLLKAVV-ATGKPVVLVLINGRPLSINWA--DKFVPAILEAWYPGSKGGKAVADV 726
Query: 574 IFGDHNPGGRLPMTWYPQDYIKVPMT-DMKMRPQATSG-NPGRTYRFYE-GKEVFPFGCG 630
+FGD+NPGG+L +T +P+ ++P K Q G NPG ++PFG G
Sbjct: 727 LFGDYNPGGKLTVT-FPKTVGQIPFNFPCKPSSQIDGGKNPGLNGNMSRVNGALYPFGFG 785
Query: 631 LSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVK 690
LSY+ + Y +S + NQ K VT V
Sbjct: 786 LSYTTFEYSDLKISPAIITPNQ-----------------------------KTYVTCKVT 816
Query: 691 NHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRARE 750
N G+ AG V L+V+ K L GF+ V L E EI F + ++L
Sbjct: 817 NTGKRAGDEVVQLYVRDVLSSVTTYEKNLAGFERVHLKPGETKEITFPIDR-KALELLNA 875
Query: 751 DGLMVIEEGTHFLVVG 766
D V+E G L++G
Sbjct: 876 DMHWVVEPGEFTLMIG 891
>gi|167517889|ref|XP_001743285.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163778384|gb|EDQ91999.1| predicted protein [Monosiga brevicollis MX1]
Length = 521
Score = 263 bits (672), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 191/541 (35%), Positives = 266/541 (49%), Gaps = 37/541 (6%)
Query: 114 TSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAG-------QAIGMTFWAPNINIFRD 166
++FP + AASF++ LW+ G + E RA N I T + PNINI RD
Sbjct: 1 STFPGPLGMAASFNASLWFAKGDVLATELRAFSNTNWHRSNVDNQIQYTGFGPNINIARD 60
Query: 167 PRWGRGQETPGEDPLVTGKYAVSYVRGV-QGDTFNGGKLKGKLQASACCKHFTAYDLDNW 225
PR+GR E PGEDP ++G YA VRG+ Q D G + A KHFTAY +
Sbjct: 61 PRFGRTSELPGEDPYLSGTYATHMVRGMMQADA------AGHPKMLAYLKHFTAYSTE-- 112
Query: 226 KGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTA 285
T R D V+ DL DTY P +E + + +G MC+YN VNG PSCA+ +L
Sbjct: 113 --TNRQHSDFNVSSHDLWDTYLPQYEMAFRSAQPAGAMCSYNAVNGRPSCANGYILRDVL 170
Query: 286 RRQWGF-HGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVK 344
R QW + +ITSDC A+S + A +A A L G D+ GS + AV
Sbjct: 171 RNQWQQPNAHITSDCGAISSLRGAPVFAPDDATAAAVALNNGTDLEMGSQVYASLAEAVA 230
Query: 345 QKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGI 404
+ + ++ A + R G F+ T++ + G + S A Q L +A +
Sbjct: 231 RNLTSSTLVEEAFRRAARILFRGGRFDPPATVE-WNAYGVQDINSSATQALVHEATAQSL 289
Query: 405 VLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSIT----PLQALQNYVE 460
VLL+N HG+LPL + V A++GP LL +YA C T P
Sbjct: 290 VLLQNRHGILPLAPGQRV--AVVGPLVERGDALLSDYASLVCYDGTYDCIPTLGASVTAA 347
Query: 461 N----TVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGR 516
N T PG D + +S+ + AV A+ AD VV +G D+T E+E LDRV+L LP
Sbjct: 348 NKGGATTVVPGVDVNSNNSSGLAAAVAAAQAADVVVAFLGTDKTIEREGLDRVNLTLP-G 406
Query: 517 QQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFG 576
Q L+ A PV+L+L GGP+ I Y +++ G+ GA +A+ +FG
Sbjct: 407 LQGLLLDQLLATGTPVVLLLNNGGPLAIE--SYLNRTAAVMETFNAGQFGATVMAKALFG 464
Query: 577 DHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKY 636
N G+LP T YP Y+ T+ M + PGRTYR+ VFPFG GLSY+ +
Sbjct: 465 QVNNFGKLPYTVYPAGYV----TEQAMNNYDMALYPGRTYRYLVQAPVFPFGFGLSYTTF 520
Query: 637 S 637
+
Sbjct: 521 N 521
>gi|294146775|ref|YP_003559441.1| beta-glucosidase [Sphingobium japonicum UT26S]
gi|292677192|dbj|BAI98709.1| beta-glucosidase [Sphingobium japonicum UT26S]
Length = 791
Score = 263 bits (672), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 228/714 (31%), Positives = 337/714 (47%), Gaps = 100/714 (14%)
Query: 81 RLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGL 140
RLGIP + E LHG A VG ATSFPQ I A+S+D L + I
Sbjct: 137 RLGIPIL-FHEEGLHGYAAVG-----------ATSFPQSIAMASSWDPDLLREVNAVIAR 184
Query: 141 EARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFN 200
E R+ + + + +P ++I RDPRWGR +ET GEDP + G+ V+ V G+QG +
Sbjct: 185 EIRS-----RGVSLVL-SPVVDIARDPRWGRIEETYGEDPYLVGEMGVAAVEGLQGKGRS 238
Query: 201 GGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRAS 260
GK+ A KH T + GT A V+ ++L + + PPFE VK+
Sbjct: 239 RLLPPGKV--FATLKHLTGHGQPE-SGTNVGP--APVSERELRENFFPPFEQVVKRTGIE 293
Query: 261 GIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVV 320
+M +YN ++G+PS A+R LL R +WGF G + SD AV + A E A
Sbjct: 294 AVMASYNEIDGVPSHANRWLLRDVLRGEWGFRGAVVSDYSAVDQLMSIHHVAADLEQAAG 353
Query: 321 DVLKAGMDVNCGSFLQKHTKA-AVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPF 379
L AG+D + L T V++ K+ E+ +DRA+ ++ ++ R GLF P+
Sbjct: 354 RALDAGVDADLPDGLSYATLGRQVREGKIGEALVDRAVRHMLELKFRAGLFE-----NPY 408
Query: 380 GKIGAD--VVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTL 437
A + + LAL+AAQ I+LLKN G+LPL S+ A+IGP+A A+
Sbjct: 409 ADAAASEKITNDARARALALKAAQRSIILLKND-GMLPLKPEGSI--AVIGPSAAVAR-- 463
Query: 438 LGNYAGPSCRSITPLQALQNYVEN---TVYYPG----------CDTVACSSAS-----ID 479
LG Y G S++ L+ ++ V N V+ G D V S + I
Sbjct: 464 LGGYYGQPPHSVSILEGIRAKVGNRAKIVFAQGVRITENDDWWADKVTRSDPAENRRLIA 523
Query: 480 KAVDIAKGADHVVLMMGLDQTQEKEEL------DRVDLVLPGRQQELITRVAEAAKKPVI 533
+AV+ A+ D +VL +G + +E DR L L G QQEL + +A KP+
Sbjct: 524 QAVEAARHVDRIVLTLGDTEQSSREGWADNHLGDRPSLDLVGEQQELFDAL-KALGKPIA 582
Query: 534 LVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDY 593
+VL+ G P + K +IL Y GE G A+A+V+FGD NPGG+LP+T P+
Sbjct: 583 VVLINGRPA--STVKVSEQADAILEGWYLGEQGGHAVADVLFGDVNPGGKLPVT-IPRSA 639
Query: 594 IKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQS 653
++PM ++P A G Y F ++PFG GLSY+ + +S K+ +
Sbjct: 640 GQLPMF-YNVKPSARRG-----YLFDTTDPLYPFGFGLSYTSFDLSAPRLSAAKISVGGM 693
Query: 654 SSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNG 713
+ V++ V+N G G V L+V+
Sbjct: 694 TR-----------------------------VSVDVRNSGRREGDEVVQLYVRDKVGSVT 724
Query: 714 RPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGD 767
RPIK+L GFQ V L E + F + P E+L + V+E G ++ G+
Sbjct: 725 RPIKELKGFQRVTLKPGEVRTVTFTIGP-EALQMWNDHMDRVVEPGDFEIMTGN 777
>gi|224538725|ref|ZP_03679264.1| hypothetical protein BACCELL_03619 [Bacteroides cellulosilyticus
DSM 14838]
gi|224519667|gb|EEF88772.1| hypothetical protein BACCELL_03619 [Bacteroides cellulosilyticus
DSM 14838]
Length = 942
Score = 263 bits (672), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 212/697 (30%), Positives = 331/697 (47%), Gaps = 92/697 (13%)
Query: 105 FFNGTIRG-----ATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMT-FWA 158
F N IRG AT+FP + +++ L +IG G EAR L G T +A
Sbjct: 160 FTNEGIRGVESYRATNFPTQLGLGHTWNRELIRQIGLITGREARML-------GYTNVYA 212
Query: 159 PNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFT 218
P +++ RD RWGR +E GE P + + + VRG+Q + Q +A KHF
Sbjct: 213 PILDVGRDQRWGRYEEVYGESPYLVAELGIEMVRGMQHNH----------QVAATGKHFV 262
Query: 219 AYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADR 278
AY + + D +++ +++ + PF+ +K+ G+M +YN +G+P
Sbjct: 263 AYSNNKGAREGMARVDPQMSPREVEMIHVYPFKRVIKEAGLLGVMSSYNDYDGVPIQGSY 322
Query: 279 NLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKH 338
L+ R + GF GY+ SD DAV +Y AK ++AV ++AG++V C +F
Sbjct: 323 YWLTTRLRGEMGFRGYVVSDSDAVEYLYTKHSTAKDMKEAVRQSVEAGLNVRC-TFRSPD 381
Query: 339 T-----KAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPA-H 392
+ + VK+ L E I+ + ++ V+ +GLF+ P GAD A +
Sbjct: 382 SYVLPLRELVKEGGLSEEVINDRVRDILRVKFLVGLFD-TPYQTDLA--GADKEVEKAEN 438
Query: 393 QVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPL 452
+ LALQA+++ +VLLKN + +LPL + +A+ GPNA+ L +Y + T L
Sbjct: 439 ESLALQASRESLVLLKNENNVLPLDINNVKKIAVCGPNADEEGYALTHYGPLAVEVTTVL 498
Query: 453 QALQNYVE---NTVYYPGCDTVACS---------------SASIDKAVDIAKGADHVVLM 494
+ ++ E +Y GCD V + A IDKAV+ A+ AD V++
Sbjct: 499 EGIRQKAEGKAEVLYTKGCDLVDANWPESELIDYPMTDSEQAEIDKAVENARQADVAVVV 558
Query: 495 MGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIG 554
+G Q E R L LPGRQ +L+ V +A KPV+LVL+ G P+ I +A D+ +
Sbjct: 559 LGGGQRTCGENKSRSSLELPGRQLKLLQAV-QATGKPVVLVLINGRPLSINWA--DKFVP 615
Query: 555 SILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMT-DMKMRPQATSG-NP 612
+IL A YPG G A+A+V+FGD+NPGG+L +T +P+ ++P K Q G NP
Sbjct: 616 AILEAWYPGSKGGTAVADVLFGDYNPGGKLTVT-FPKSVGQIPFNFPCKPSSQIDGGKNP 674
Query: 613 GRTYRFYE-GKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVP 671
G ++ FG GLSY+ + Y +S + NQ ++ +
Sbjct: 675 GLDGNMSRVNGALYSFGYGLSYTTFEYSDIEISPKVITPNQKATVR-------------- 720
Query: 672 ELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKE 731
C+ V N G+ AG V L+V+ K L GF+ + L E
Sbjct: 721 ------CK---------VTNTGKRAGDEVVQLYVRDILSSVTTYEKNLAGFERIHLQPGE 765
Query: 732 KAEIVFELS--PCESLSRAREDGLMVIEEGTHFLVVG 766
E+VF L E L + E V+E G +++G
Sbjct: 766 TKEVVFTLDRKQLELLDKHME---WVVEPGDFSIMIG 799
>gi|375254464|ref|YP_005013631.1| glycosyl hydrolase family 3, C-terminal domain-containing protein
[Tannerella forsythia ATCC 43037]
gi|363407375|gb|AEW21061.1| glycosyl hydrolase family 3, C-terminal domain protein [Tannerella
forsythia ATCC 43037]
Length = 775
Score = 263 bits (672), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 214/739 (28%), Positives = 343/739 (46%), Gaps = 123/739 (16%)
Query: 67 EKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASF 126
E ++ L A RLGIP + + E +HG +G T FP I A+++
Sbjct: 107 EALNALQKYAMENTRLGIPIF-FAEECMHGHMAIG-----------TTVFPTSIGQASTW 154
Query: 127 DSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKY 186
+ L ++G AI E R+ G I + P +++ R+PRW R +ET GEDP+++G
Sbjct: 155 NRTLIEKMGAAIAHETRS---QGAHIA---YGPVLDLAREPRWSRVEETFGEDPVLSGIL 208
Query: 187 AVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTY 246
++VRG+QG F G+ + KH AY + R A++ ++L +
Sbjct: 209 GSAFVRGLQGKDFADGR-----HTYSTLKHLAAYGIPVGGHNGR---QAQIGARELIAEH 260
Query: 247 QPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIY 306
PFE VK G A +M +YN V+G+P ++ +L K R +W F+G++ SD ++ I
Sbjct: 261 LLPFEMAVKAG-AQSVMTSYNAVDGVPCTSNTYILKKILRGEWDFNGFVVSDLGSIEGIA 319
Query: 307 DAEGYAKSPEDAVVDVLKAGMDVNCG------SFLQKHTKAAVKQKKLPESEIDRALHNL 360
A + A L AG++++ G + Q HT + + SEID A+ +
Sbjct: 320 TTHRVAPDIKHAAAMALNAGVEMDLGGVAYTRNMEQAHTDSLISM-----SEIDDAVSRI 374
Query: 361 FSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSK 420
++ +GLF +P +QP +++ S H LA + A++ IVLLKN+ LLPL K+
Sbjct: 375 LRLKFEMGLFE-SPYVQP--SRTTEIIRSKEHNRLARKVAEESIVLLKNNANLLPLSKNI 431
Query: 421 SVSLALIGPNANSAKTLLGNYAGPSCRS--ITPLQALQNYVENTV---YYPGCDTVACSS 475
S+A+IGPNA++ LG+Y P +T L+ ++N V T Y GC +
Sbjct: 432 G-SIAVIGPNADNLYNQLGDYTAPQPEEHIVTILEGIRNAVSPTTVIRYVKGCAVRDTTQ 490
Query: 476 ASIDKAVDIAKGADHVVLMMG--------------------------LDQTQEKEELDRV 509
++ID+AV A ++ VVL++G + + E DR
Sbjct: 491 SNIDEAVRAANASNAVVLVVGGSSARDFHTKYIETGAATVSSRENELIPDMESGEGYDRK 550
Query: 510 DLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVA 569
L L G Q++LI +A A KP+I+V + G P+++ A D+ ++L A YPGE G A
Sbjct: 551 SLTLLGHQEKLIESIA-ATGKPLIMVYIQGRPLNMNLA--DKKASALLTAWYPGEEGGNA 607
Query: 570 LAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKE--VFPF 627
+A VIFGD NP GRLP++ P+ ++P+ + G++ + EG ++ F
Sbjct: 608 VANVIFGDVNPSGRLPIS-VPRSTGQLPVYY----------SLGKSNDYVEGTSTPLYAF 656
Query: 628 GCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTI 687
G GLSY+ + Y +S+ E V+
Sbjct: 657 GYGLSYTAFEYGNLTISR---------------------------------EGGNITVSC 683
Query: 688 GVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSR 747
V N G G V L+++ P L F + L E A + F L+P E L+
Sbjct: 684 TVTNTGNTDGDEVVQLYLRDHVASVSVPPVLLKDFAKISLKKGESARVNFVLTP-EQLAF 742
Query: 748 AREDGLMVIEEGTHFLVVG 766
D V+E G +++G
Sbjct: 743 FNTDLKRVVEPGEFTVMIG 761
>gi|332671963|ref|YP_004454971.1| glycoside hydrolase family 3 domain-containing protein
[Cellulomonas fimi ATCC 484]
gi|332341001|gb|AEE47584.1| glycoside hydrolase family 3 domain protein [Cellulomonas fimi ATCC
484]
Length = 771
Score = 263 bits (672), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 228/758 (30%), Positives = 331/758 (43%), Gaps = 124/758 (16%)
Query: 52 ISQRARDLVSRLTLDEKISQLV-----NSAPAIPRLG-IPAYEWWSEALHGVAGVGKGIF 105
I R DL+SR+TLDEK QL + P +G +P + G G +F
Sbjct: 34 IRARVEDLLSRMTLDEKAGQLTQYFAFDLPPGEAAVGDLPQKPGEVDDALASGGAGSLLF 93
Query: 106 FNGTIRG---------------------------ATSFPQVILTAASFDSYLWYRIGQAI 138
N R T P I AAS+D R
Sbjct: 94 VNDAARTNRLQRQAVDGSRHGIPVLFGYDVIHGLRTILPVPIALAASWDPETIER----- 148
Query: 139 GLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDT 198
G A I TF AP ++I RDPRWGR E GEDP + A + VRG
Sbjct: 149 GQAVAAREARAVGIHWTF-APMVDIARDPRWGRIVEGAGEDPYLGSAVAAAQVRG----- 202
Query: 199 FNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGR 258
F GG++ + A KHF Y G Y + ++ Q L + Y PPF++ V G
Sbjct: 203 FQGGRIGTPERVIAGPKHFAGYGAA--IGGRDYD-EVDLSDQQLRNVYLPPFKAAVDAG- 258
Query: 259 ASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDA 318
A IM AY +NG+P+ A+ LL R +WGF G++ SD +AV + GYA DA
Sbjct: 259 AGNIMTAYMPLNGVPATANAWLLKDVLRDEWGFDGFLVSDANAVRNLV-THGYAADLPDA 317
Query: 319 VVDVLKAGMDVNCG--SFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTM 376
V ++ G+D+ H AV+ + +D + + V++RLGLF
Sbjct: 318 AVRAVEVGLDLEMAISDPAYAHLPEAVESGAVSIEAVDACVRRILEVKVRLGLFE----- 372
Query: 377 QPF--GKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSA 434
+P+ A V+ PAH+ +A +AAQ VLL+N LLPL S+A+IGP A+S
Sbjct: 373 EPYVDEDRAAAVLADPAHRQVAREAAQRSAVLLRNDGRLLPLDPDALGSVAVIGPLADSK 432
Query: 435 KTLLGNYAGPS--CRSITPLQALQNYVENTV---YYPGCDTVACS--------------- 474
+ LG + ++T L L+ + +T Y PG +
Sbjct: 433 RDTLGPWVFDDDLDETVTILDGLRARLGDTTDVRYAPGVRPAQRTFPSMFEMFPGNATPD 492
Query: 475 ------SASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAA 528
A + +AV++A+ +D VL++G Q E R L LPGRQ EL+ VA A
Sbjct: 493 PEDFDDEAELARAVELARTSDVAVLVVGEWQGMIGEAASRSSLELPGRQLELLQAVA-AT 551
Query: 529 KKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTW 588
PV+L+++ G P+D+ +A +++ +IL YPG G A+A+++ GD PGG+LP TW
Sbjct: 552 GTPVVLLVMNGRPLDLRWAA--QHVPAILDVWYPGTQGGTAVADLLVGDVAPGGKLPFTW 609
Query: 589 YPQDYIKVPMT---DMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQ 645
P+ +VPM PQ N GR Y E +FPFG GL Y +++Y +
Sbjct: 610 -PRTVGQVPMIYSHTTSFEPQ----NQGRRYWDEESTPLFPFGFGLGYGEFTYSDLRLDA 664
Query: 646 NKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFV 705
+ L+ S ++ V N G+ G V L+V
Sbjct: 665 ETIPLDGS-----------------------------LTASVTVTNAGQHDGDEVVQLYV 695
Query: 706 KPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCE 743
RP+++L GFQ + L A E + + P E
Sbjct: 696 HQRHGSAARPVRELKGFQRIHLAAGESRTVELTIGPDE 733
>gi|94497563|ref|ZP_01304132.1| xylosidase/arabinosidase [Sphingomonas sp. SKA58]
gi|94422980|gb|EAT08012.1| xylosidase/arabinosidase [Sphingomonas sp. SKA58]
Length = 774
Score = 263 bits (671), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 224/744 (30%), Positives = 345/744 (46%), Gaps = 103/744 (13%)
Query: 51 PISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTI 110
P R RD + L ++ L A RLGIP + E LHG A VG
Sbjct: 93 PRVARGRDPRQTVAL---VNALQKWAMTQTRLGIPIL-FHEEGLHGYAAVG--------- 139
Query: 111 RGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWG 170
ATSFPQ I A+S+D +L ++ I E R + + M +P ++I RDPRWG
Sbjct: 140 --ATSFPQSIALASSWDPHLVQQVNSVIAREIRV-----RGVPMVL-SPVVDIARDPRWG 191
Query: 171 RGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTR 230
R +ET GEDP + G+ V+ V G+QG+ + GK+ A KH T + GT
Sbjct: 192 RIEETYGEDPYLVGEMGVAAVEGLQGEGRSHDLRPGKV--FATLKHLTGHGQPE-SGTNV 248
Query: 231 YKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWG 290
A ++ ++L + + PPFE VK+ + +M +YN ++G+PS +R LL R +WG
Sbjct: 249 GP--APISERELRENFFPPFEQVVKRTGINAVMASYNEIDGVPSHMNRWLLDDVLRGEWG 306
Query: 291 FHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKA-AVKQKKLP 349
F G + SD V + + A S ++A L AG+D + L T V+ K+
Sbjct: 307 FRGAVVSDYSGVDQLMNIHHVAGSLDEAARRALDAGVDADLPEGLSYATLGDQVRAGKVS 366
Query: 350 ESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCS--PAHQVLALQAAQDGIVLL 407
E+++D+A+ + ++ R GLF P+ V + + LA AAQ I LL
Sbjct: 367 EAQVDKAVRRMLELKFRAGLFE-----HPYADAAQAVALTNDAEARALARTAAQRSITLL 421
Query: 408 KNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPG 467
KN G+LPL S+ A+IGP+A A+ LG Y G ++ L ++ V + V
Sbjct: 422 KND-GMLPLKVEGSI--AVIGPSAAVAR--LGGYYGQPPHVVSILDGIKARVGDRVRIVF 476
Query: 468 CDTVACS------SASIDK------------AVDIAKGADHVVLMMGLDQTQEKEEL--- 506
V + + +DK AV+ A+ D +VL +G + +E
Sbjct: 477 AQGVKITQDDDWWADKVDKADPAENRRLIAQAVEAARNVDRIVLTLGDTEQSSREGWAAN 536
Query: 507 ---DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPG 563
DR L L G QQEL + + KP+ +VL+ G P + K ++L Y G
Sbjct: 537 HLGDRPSLDLVGEQQELFDAL-KTLGKPITVVLINGRPA--STVKVSEEANALLEGWYLG 593
Query: 564 EAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKE 623
E G A+A+++FGD NPGG+LP+T P+ ++P ++P A GR Y F
Sbjct: 594 EQGGHAVADILFGDVNPGGKLPVT-VPRSVGQLPAF-YNVKPSA-----GRGYLFDTNAP 646
Query: 624 VFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKF 683
++PFG GLSY+ ++ ++Q+ + ++S
Sbjct: 647 LYPFGFGLSYTNFTLSPPRLAQSSIGPGGTTS---------------------------- 678
Query: 684 LVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCE 743
VT+ V+N G G V L++ RPIK+L GF+ V L E + F ++P E
Sbjct: 679 -VTVDVRNDGARDGDEVVQLYIHDKVSSVTRPIKELKGFERVSLKPGEVRTVRFTITP-E 736
Query: 744 SLSRAREDGLMVIEEGTHFLVVGD 767
SL + V+E G ++ G+
Sbjct: 737 SLQMWNDKMHRVVEPGEFEIMTGN 760
>gi|121308314|dbj|BAF43576.1| arabinofuranosidase/xylosidase homolog [Prunus persica]
Length = 349
Score = 263 bits (671), Expect = 4e-67, Method: Compositional matrix adjust.
Identities = 146/345 (42%), Positives = 201/345 (58%), Gaps = 15/345 (4%)
Query: 436 TLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASI-DKAVDIAKGADHVVLM 494
T++GNYAG +C TPLQ + Y T++ GC V C+ + A A+ AD VL+
Sbjct: 5 TMIGNYAGVACGYTTPLQGIGRYTR-TIHQAGCTDVHCNGNQLFGAAEAAARQADATVLV 63
Query: 495 MGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIG 554
MGLDQ+ E E +DR L+LPG QQEL++RVA A++ P ILVL+ GGP+D+TFAK D I
Sbjct: 64 MGLDQSIEAEFVDRAGLLLPGHQQELVSRVARASRGPTILVLMSGGPIDVTFAKNDPRIS 123
Query: 555 SILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYI-KVPMTDMKMRPQATSGNPG 613
+I+W GYPG+AG A+A+V+FG NPGG+LPMTWYPQ+Y+ +PMTDM MR G PG
Sbjct: 124 AIIWVGYPGQAGGTAIADVLFGTTNPGGKLPMTWYPQNYVTHLPMTDMAMRADPARGYPG 183
Query: 614 RTYRFYEGKEVFPFGCGLSYSKYSYKF----KAVSQNKLYLNQSSSTKMVENQDVVHYKS 669
RTYRFY G VFPFG GLSY+ +++ VS L ++++ M+ V +
Sbjct: 184 RTYRFYRGPVVFPFGLGLSYTTFAHNLAHGPTLVSVPLTSLKATANSTMLSKAVRVSHAD 243
Query: 670 VPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNA 729
L V + VKN G M G H +L+F P G KQL+GF + + A
Sbjct: 244 CNAL-------SPLDVHVDVKNTGSMDGTHTLLVFTSPP-DGKWASSKQLMGFHKIHIAA 295
Query: 730 KEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGDEEYPISI 774
+ + + C+ LS G+ I G H L +GD + +S+
Sbjct: 296 GSEKRVRIAVHVCKHLSVVDRFGIRRIPLGEHKLQIGDLSHHVSL 340
>gi|225871719|ref|YP_002753173.1| glycosyl hydrolase family, 3 [Acidobacterium capsulatum ATCC 51196]
gi|225793416|gb|ACO33506.1| glycosyl hydrolase family, 3 [Acidobacterium capsulatum ATCC 51196]
Length = 776
Score = 263 bits (671), Expect = 4e-67, Method: Compositional matrix adjust.
Identities = 226/791 (28%), Positives = 355/791 (44%), Gaps = 133/791 (16%)
Query: 13 LCLCFTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQL 72
LC + +L T + P N S + QRA L++ ++ EKI QL
Sbjct: 13 LCFAISLMLGLSGITFSAHAQAPGNSSAA----------VKQRADHLLAEMSTQEKIGQL 62
Query: 73 VNSAPAIPRLGIPAYEWWSEALHGVAG--VGKGIFFNG---------------------- 108
P + IP ++ +A VG +F
Sbjct: 63 SQLFYFGPNVKIPGFDSGPSIETQIAKGEVGSLLFITDPATINRMQKIAVQDSPHHIPLI 122
Query: 109 ----TIRG-ATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINI 163
I G T FP + AAS+D + R +EAR++ I F AP ++I
Sbjct: 123 FGFDVIHGLRTIFPVPLAQAASWDPVMVSRDQSIAAMEARSV-----GIDWAF-APMVDI 176
Query: 164 FRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLD 223
RDPRWGR E G DP + A + VRG QG + G AC KHF Y
Sbjct: 177 ARDPRWGRMVEGAGSDPYLGAAMAAAQVRGFQG-AYPGAPN----HILACAKHFAGYG-- 229
Query: 224 NWKGTTRYKFDAR-VTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLS 282
+DA ++ L + Y PPF + VK G A+ +M AY +N +P+ ++ LL
Sbjct: 230 --AAEGGRDYDASYISDSQLWNVYLPPFHAAVKAGVAT-LMSAYMDLNDVPATGNQWLLQ 286
Query: 283 KTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG---SFLQKHT 339
RR W F GY+ SD +AV + G+A+ EDA V KAG+++ +
Sbjct: 287 DVLRRDWKFDGYVVSDANAVRNL-QTHGFAQDQEDAAVRAFKAGVNMEMAIGQTAYDSEL 345
Query: 340 KAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGAD--VVCSPAHQVLAL 397
A++Q + ++D A+ + ++MRLGLF P+ + ++ PAH+ A
Sbjct: 346 SKALQQGVITGQQLDDAVRPILEMKMRLGLFE-----HPYVDVARSQRILDDPAHRTAAR 400
Query: 398 QAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPS--CRSITPLQAL 455
AA+ VLL+N GLLPL K++ ++A+IGP A+S + LG + ++T LQ L
Sbjct: 401 IAAERSAVLLRNEGGLLPLNKTRYHNIAVIGPLADSQRDTLGPWTFDENLSETVTVLQGL 460
Query: 456 QNYV---ENTVYYPGCD---------------------TVACSSASIDKAVDIAKGADHV 491
+N Y PG T A + + +A+D+A+ +D V
Sbjct: 461 RNAFGASAKITYAPGAQMHRKFPSMFDALDRGKKPPVWTPAQARQQMQQAIDLARKSDLV 520
Query: 492 VLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDR 551
V+++G Q E L LPG Q++L+ VA A KP++LVL+ G P++I +A
Sbjct: 521 VMVLGEHQNMSGEAASSDSLKLPGDQEQLLQSVA-ATGKPLVLVLMNGRPLNIKWAAL-- 577
Query: 552 NIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPM---TDMKMRPQAT 608
++ +IL YPG G A+A ++ G PGG+LP W P+D +VP+ ++ PQ
Sbjct: 578 HVPAILDVWYPGSQGGNAVANLLLGKSVPGGKLPFDW-PRDVGQVPIPYAHNLTHEPQ-- 634
Query: 609 SGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYK 668
N R Y ++PFG GLSY+ +++ + L +++SS +K ++ VH
Sbjct: 635 --NQARRYWDEASTPLYPFGYGLSYTAFAF-------SHLQIDKSSVSK----KEDVH-- 679
Query: 669 SVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILN 728
V++ V N G++AG L++ RP+++L GF+ + L
Sbjct: 680 ----------------VSVDVTNTGKLAGDEVAQLYIHQEYGNASRPVRELKGFERITLQ 723
Query: 729 AKEKAEIVFEL 739
+ + F L
Sbjct: 724 PGQTKTLQFTL 734
>gi|323344052|ref|ZP_08084278.1| beta-glucosidase [Prevotella oralis ATCC 33269]
gi|323094781|gb|EFZ37356.1| beta-glucosidase [Prevotella oralis ATCC 33269]
Length = 779
Score = 263 bits (671), Expect = 4e-67, Method: Compositional matrix adjust.
Identities = 204/721 (28%), Positives = 336/721 (46%), Gaps = 102/721 (14%)
Query: 81 RLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGL 140
RLGIP + EA HG +G AT FP + AA++ + + + G I
Sbjct: 128 RLGIPLF-LAEEAPHGHMAIG-----------ATVFPTGLGMAATWSTDVIEQAGVIIAK 175
Query: 141 EARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFN 200
E R G + P +++ +PRW R +ET GEDP+++G AV+ V+G
Sbjct: 176 EIRL------QGGHISYGPVLDLAHEPRWSRVEETMGEDPVLSGTIAVAQVKG-----LG 224
Query: 201 GGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRAS 260
G + A KHF AY + + + + + +DL D + PPF + G A
Sbjct: 225 AGDITKPFATIATLKHFIAYGIPE---SGQNGAPSIIGTRDLLDNFLPPFRRAIDAG-AL 280
Query: 261 GIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVV 320
+M +YN ++GIP ++ +LL++ R QWGF G++ SD ++ IY S ++A +
Sbjct: 281 SVMTSYNSMDGIPCTSNGHLLTEILRNQWGFKGFVVSDLYSIDGIYGTHHTVSSLQEAGI 340
Query: 321 DVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFG 380
+ L+AG+DV+ G+ AV+Q ++ E+ ID A+ + +++ +GLF +P + P
Sbjct: 341 EALRAGVDVDLGANAFALLCDAVRQGRVSEAAIDEAVLRILRMKIEMGLFE-HPYVNP-- 397
Query: 381 KIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGN 440
K V + + +A + A++ I LLKNS+ LLPL SK++ +A+IGPNA++ +LG+
Sbjct: 398 KTAKTGVRTAENIQVAKRVAEESITLLKNSNKLLPL--SKNIKIAVIGPNADNRYNMLGD 455
Query: 441 YAGPSCRS--ITPLQALQNYVE--NTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMG 496
Y P S T L +++ + Y GC I +AV A+ AD +V+ +G
Sbjct: 456 YTAPQQDSNVKTILDGIRSKLSPSQITYVKGCSIRDTVFNEIGEAVRAAREADVIVVAVG 515
Query: 497 -----------------------LDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVI 533
+ + E DR L L G Q L+ + E K P++
Sbjct: 516 GSSARDFKTSYQETGAAITSSKVVSDMESGEGFDRASLSLMGIQSRLLQSLKETGK-PMV 574
Query: 534 LVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDY 593
++ + G P+D T+A + ++L A YPG+ G A+A V+FGD+NP GRLP+T P+
Sbjct: 575 VIYIEGRPLDKTWASEQAD--ALLTAYYPGQEGGNAIANVLFGDYNPAGRLPIT-VPRSV 631
Query: 594 IKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQS 653
++P+ K RP Y ++PFG GLSY+ + Y +LN +
Sbjct: 632 GQLPVYYNKKRPVV------HNYVEMASTPLYPFGYGLSYTSFDYS---------HLNIT 676
Query: 654 SSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNG 713
++ ++ V+ ++N GE G L++
Sbjct: 677 KKSE-----------------------EEYEVSFDIRNSGERDGDEVAQLYISDKVASVV 713
Query: 714 RPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGDEEYPIS 773
+P+KQL GF + L E I L + LS + V+E G + +G I
Sbjct: 714 QPVKQLKGFARIHLKKGETKRITLILKK-DDLSITDRNMERVVEAGDFEIQIGSSSEDIR 772
Query: 774 I 774
+
Sbjct: 773 L 773
>gi|404405497|ref|ZP_10997081.1| glycoside hydrolase family protein [Alistipes sp. JC136]
Length = 804
Score = 262 bits (670), Expect = 4e-67, Method: Compositional matrix adjust.
Identities = 202/693 (29%), Positives = 313/693 (45%), Gaps = 102/693 (14%)
Query: 81 RLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGL 140
RLGIP E+ +E +HG+ AT P I ++++ L +R G+ G
Sbjct: 145 RLGIPV-EFTNEGIHGL-----------NHSRATPLPAPIAIGSTWNRALVHRAGEIAGH 192
Query: 141 EARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFN 200
EAR L +AP +++ RDPRWGR E GEDP + + V VRG+Q
Sbjct: 193 EARVLGYK------NVYAPILDVARDPRWGRVVECYGEDPFLIAELGVEMVRGIQSQG-- 244
Query: 201 GGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRAS 260
++ KH+ AY + + D + ++L Y PF +++
Sbjct: 245 ---------VASTLKHYAAYSVPKGGRDGNCRTDPHIAPRELHQMYLYPFRRVIRESGPM 295
Query: 261 GIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVV 320
G+M +YN +G+P A R L+ R ++GF GY+ SD +AV ++ A++ EDAV
Sbjct: 296 GVMSSYNDWDGVPVTASRYFLTDLLRHEYGFDGYVVSDSEAVEYVHTKHAVAETYEDAVR 355
Query: 321 DVLKAGMDVNC-----GSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPT 375
VL+AG++V F+ + V++ +L +D+ + + V+ RLGLF+ NP
Sbjct: 356 QVLEAGLNVRTNFSPPARFILP-VRKLVREGRLSMEVVDQRVREVLRVKFRLGLFD-NPY 413
Query: 376 MQP---FGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNAN 432
P + GAD H+ L + +VLLKN LPL K K+ + + GP A+
Sbjct: 414 NDPREAVAEAGAD-----KHRDFVLDIQRQSLVLLKNEDKTLPLDKKKTARVLVAGPLAD 468
Query: 433 SAKTLLGNYAGPSCRSITPLQALQNYVENTV---YYPGCDTV---------------ACS 474
++ Y ++T L ++NY+ + Y GCD V A
Sbjct: 469 EDNFMISRYGPNDLPTVTVLDGIRNYLGDGAEVRYAKGCDVVDAGFPDSELTATPLTAAE 528
Query: 475 SASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVIL 534
A I++AV A G D +V ++G D + E R L LPGRQQ+L+ + A PV+L
Sbjct: 529 RAGINEAVKQAAGCDVIVAVLGEDDERVGESHSRTSLELPGRQQQLLEAL-HATGVPVVL 587
Query: 535 VLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYP---- 590
VL+ G P+ + +A +N+ +IL +P G A+AE +FGD+NPGG+L +T +P
Sbjct: 588 VLINGQPLTVNWAA--QNVPAILEGWFPSVEGGTAIAETLFGDYNPGGKLTIT-FPRSTG 644
Query: 591 QDYIKVPMTDMKMRPQATSG-NPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLY 649
Q + P Q G N G R ++PFG GLSY+ ++YK N
Sbjct: 645 QIELNFPYKKGSHGAQPRKGPNGGGVTRVL--GSIYPFGYGLSYTTFAYK------NLRI 696
Query: 650 LNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPAR 709
+ S T+ F V+ V N G+ G V L++
Sbjct: 697 APEPSRTQG-----------------------SFRVSCEVTNTGDRRGDEVVQLYISDKF 733
Query: 710 RGNGRPIKQLVGFQSVILNAKEKAEIVFELSPC 742
L GF+ V L E + FE++P
Sbjct: 734 SSVVTYESVLRGFERVTLEPGETKTVSFEVTPS 766
>gi|255532174|ref|YP_003092546.1| glycoside hydrolase family protein [Pedobacter heparinus DSM 2366]
gi|255345158|gb|ACU04484.1| glycoside hydrolase family 3 domain protein [Pedobacter heparinus
DSM 2366]
Length = 799
Score = 262 bits (670), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 214/795 (26%), Positives = 363/795 (45%), Gaps = 143/795 (17%)
Query: 51 PISQRARDLVSRLTLDEKISQLVN--SAPAIPRLGIPAYEW----W-------SEALHG- 96
P++ R +L+S++TL+EK Q+ + + +P EW W E L+G
Sbjct: 54 PLNARIDNLLSQMTLEEKTCQMATLYGWKRVLKDSLPTKEWKTAIWKDGIANIDEHLNGF 113
Query: 97 -----------VAGVGKGI---------------------FFNGTIRG-----ATSFPQV 119
V + K + F N IRG AT FP
Sbjct: 114 LTWGVTSTSELVTDIKKHVWAMNETQRFFIEQTRLGIPVDFTNEGIRGVEAYEATGFPTQ 173
Query: 120 ILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMT-FWAPNINIFRDPRWGRGQETPGE 178
+ +++ L ++G+ G EARAL G T +AP +++ RD RWGR +E GE
Sbjct: 174 LNMGMTWNRNLIRKMGRITGQEARAL-------GYTNVYAPILDVARDQRWGRLEEVYGE 226
Query: 179 DPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVT 238
DP + + V G+Q + Q ++ KHF Y + + D +V+
Sbjct: 227 DPYLVARLGVEMTLGMQENN----------QIASTAKHFAVYSANKGAREGLARTDPQVS 276
Query: 239 MQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSD 298
+++ D PF+ +++ G+M +YN NGIP L++ R+ +GF GY+ SD
Sbjct: 277 PREVEDIMLYPFKKVIQEAGIMGVMSSYNDYNGIPITGSEYWLTQRLRKDFGFGGYVVSD 336
Query: 299 CDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG----SFLQKHTKAAVKQKKLPESEID 354
DA+ +Y+ A + ++AV AG++V + + + V + ++P I+
Sbjct: 337 SDALEYLYNKHHVAANLKEAVFQAFMAGLNVRTTFRPPDSIIIYARQLVNEGRIPIETIN 396
Query: 355 RALHNLFSVRMRLGLFNGNPTMQPFGKIGA---DVVCSPAHQVLALQAAQDGIVLLKNSH 411
+ ++ V+ +LGLF+ QP+ K A +V S AHQ +ALQA+++ IVLLKN++
Sbjct: 397 SRVKDVLRVKFKLGLFD-----QPYVKDAAASEKLVNSIAHQAVALQASKESIVLLKNNN 451
Query: 412 GLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYV--ENTVYYPGCD 469
+LPL +S +A+IGPNA +Y +S L+ ++N + + Y GC+
Sbjct: 452 QILPLSRSLK-KIAVIGPNAADNDYAHTHYGPLQSKSTNILEGIRNKIGADKVWYAKGCE 510
Query: 470 TV---------------ACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLP 514
V A + A I+ AV+ A AD ++++G + E R L LP
Sbjct: 511 LVDKNWPESEIFPEDPDATAIALIEDAVNTAMKADVAIVVLGGNTKTAGENKSRTTLELP 570
Query: 515 GRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVI 574
G Q LI + + KPV+ V++ P+ I + D+ I I++AGYPG G +A+A+V+
Sbjct: 571 GFQLNLIKAIQKTG-KPVVAVMIGTQPMGINWI--DKYIDGIVYAGYPGVKGGIAVADVL 627
Query: 575 FGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYS 634
FGD+NPGG+L +T +P+ ++P+ + +P A + + G + ++PFG GLSY+
Sbjct: 628 FGDYNPGGKLTLT-FPKSVGQLPL-NFPSKPNAQT-DEGELAKI--KGLLYPFGFGLSYT 682
Query: 635 KYSY---KFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKN 691
++Y K + Q K +++ + N
Sbjct: 683 TFAYSNLKISPIEQEK--------------------------------DGNISISVDITN 710
Query: 692 HGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRARED 751
++ G V L+++ K L GF+ + L E + F L P + L +
Sbjct: 711 TAKLEGDEIVQLYIRDVLSTVTTYEKILRGFERISLKPNETKTLKFTLFP-DDLKLWNRE 769
Query: 752 GLMVIEEGTHFLVVG 766
VIE GT +++G
Sbjct: 770 MQHVIEPGTFKVMIG 784
>gi|157363220|ref|YP_001469987.1| glycoside hydrolase family protein [Thermotoga lettingae TMO]
gi|157313824|gb|ABV32923.1| glycoside hydrolase family 3 domain protein [Thermotoga lettingae
TMO]
Length = 779
Score = 262 bits (670), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 218/829 (26%), Positives = 364/829 (43%), Gaps = 179/829 (21%)
Query: 45 FCKTTLPISQRARDLVSRLTLDEKI----------------------------------- 69
+ +LP+ R +DL+SR+TLDEK+
Sbjct: 7 YKNASLPVDIRVKDLLSRMTLDEKVAQLGSVWSYELLDDQGNFSNEKAEALLKNGIGQIT 66
Query: 70 -------------SQLVNSAPAI----PRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRG 112
++L+N RLGIPA E L G G+G
Sbjct: 67 RPGGATNLSAKEVARLINQIQKYLIEQTRLGIPAI-MHEECLTGYMGLG----------- 114
Query: 113 ATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRG 172
AT+FPQ I A+++D L ++ I + R + G G+ AP +++ RDPRWGR
Sbjct: 115 ATNFPQAIAMASTWDPELIEKMTSTIREDMRQM---GIHQGL---APVLDVVRDPRWGRT 168
Query: 173 QETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLD----NWKGT 228
+E+ GE + K VSY+ G+QG G + A KHF Y NW T
Sbjct: 169 EESFGESAYLVAKMGVSYIIGLQGKDIKNGVI-------ATAKHFVGYGASEGGKNWAPT 221
Query: 229 TRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQ 288
+ ++L + + PFE+ VK+ +M +Y+ ++GIP + + L + R+
Sbjct: 222 N-------IPERELREIFMFPFEAAVKEASVMSVMNSYSEIDGIPCASSKELFTGVLRKN 274
Query: 289 WGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDV-----NCGSFLQKHTKAAV 343
WGF G + SD A+ ++ + AK ++A L+AG+DV +C + +++ V
Sbjct: 275 WGFSGIVVSDYFAIDMLREYHRLAKDKKEAAKYALQAGIDVELPKADCYTTIRE----LV 330
Query: 344 KQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDG 403
+Q + ES +++A + ++ LGLF+ +P+ + + + H +A + A+
Sbjct: 331 EQGLISESTVNQATSRVLQIKFMLGLFD-----KPY--VDVEKIELKKHYSIATEIARKS 383
Query: 404 IVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYA------------------GPS 445
IVLLKN G+LPL K +AL+GPNA+ + LLG+YA P
Sbjct: 384 IVLLKND-GILPL--KKDAKIALVGPNASEVRNLLGDYAYLAHIKVLLDSVNQTTFNAPK 440
Query: 446 CRSITPLQALQNYVENT--------------VYYPGCDTVACSSASIDKAVDIAKGADHV 491
+ + +E + GCD + S+ +A+ K AD
Sbjct: 441 FNLKNVEKVINESIEKIPSILDSMKAEGVIFTHAIGCDILNSSTEGFSEALHAVKNADIA 500
Query: 492 VLMMG-----LDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITF 546
V+++G + E D +L LPG Q+EL+ +A+ K P++LVL+ G P +
Sbjct: 501 VVVVGDRSGLTEDCTSGESRDSANLKLPGVQEELVLEIAKCGK-PIVLVLVTGRPY--SL 557
Query: 547 AKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQ 606
+ +I+ PGE G +AL +V+FG NPGG+LP++ +P+ ++P+ ++P
Sbjct: 558 KNIVSRVNAIIEMWLPGEVGGMALVDVLFGKVNPGGKLPIS-FPRSAGQIPVYH-DVKPS 615
Query: 607 ATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVH 666
+ + Y + +F FG GLSY+K+ + S ++E Q
Sbjct: 616 GGRSHWHKDYVDELVEPLFSFGHGLSYTKFEF----------------SNLVIEPQ---- 655
Query: 667 YKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVI 726
+P G + + + VKN GE+ G V L++ RPIK+L GF+ +
Sbjct: 656 --KIPSDG-------QVTIKVDVKNSGEVEGDEVVQLYLTREHASVTRPIKELKGFKRIT 706
Query: 727 LNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGDEEYPISIF 775
L E VF++ + L+ +V+E G ++G I F
Sbjct: 707 LKPGESRTTVFKIH-TDVLAYYDRGMELVVEPGVFKAMIGSSSTDIRCF 754
>gi|383115541|ref|ZP_09936297.1| hypothetical protein BSGG_2589 [Bacteroides sp. D2]
gi|313695054|gb|EFS31889.1| hypothetical protein BSGG_2589 [Bacteroides sp. D2]
Length = 800
Score = 262 bits (670), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 228/794 (28%), Positives = 366/794 (46%), Gaps = 148/794 (18%)
Query: 51 PISQRARDLVSRLTLDEKISQLVN--SAPAIPRLGIPAYEW----WSEALHGV----AGV 100
PI R DL+S++TL+EK Q+ + + + P W W + + + G+
Sbjct: 62 PIEARIADLLSQMTLEEKTCQMATLYGSGRVLKDAWPTDGWSTEIWKDGIGNIDEQANGL 121
Query: 101 GK-----------------------------GI---FFNGTIRG-----ATSFPQVILTA 123
GK GI F N IRG AT FP
Sbjct: 122 GKFGSEISYPYANSVKNRHTVQRWFVEQTRLGIPVDFTNEGIRGLCHDRATMFPAQCGQG 181
Query: 124 ASFDSYLWYRIGQAIGLEARALYNAGQAIGMT-FWAPNINIFRDPRWGRGQETPGEDPLV 182
A+++ L I + EA+AL G T +AP ++I +DPRWGR E+ GEDP +
Sbjct: 182 ATWNKKLIREIAKVTADEAKAL-------GYTNIYAPILDIAQDPRWGRVVESYGEDPYL 234
Query: 183 TGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDL--DNWKGTTRYKFDARVTMQ 240
G+ + G+Q + A KHF Y + G TR D V +
Sbjct: 235 VGELGKQMILGLQSEGI-----------VATPKHFAVYSIPVGGRDGGTRT--DPHVAPR 281
Query: 241 DLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCD 300
++ Y PF +++ A G+M +YN +G P + L++ R+QWGF GY+ SD +
Sbjct: 282 EMKTLYLEPFRKGIQEAGALGVMSSYNDYDGEPVSGSYHFLTEILRQQWGFKGYVVSDSE 341
Query: 301 AVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG-----SFLQKHTKAAVKQKKLPESEIDR 355
AV ++ + E+ V+ AG+++ F+ + A+ + K+ +D+
Sbjct: 342 AVEFLHTKHRITPTEEEMAAQVVNAGLNIRTNFTPPQDFILP-LRRAISEGKVSLHTLDQ 400
Query: 356 ALHNLFSVRMRLGLFNGNPTMQPFGKIGAD-----VVCSPAHQVLALQAAQDGIVLLKNS 410
+ + V+ +GLF+ NP G D VV + AHQ ++++AA + IVLLKN
Sbjct: 401 RVGEILRVKFMMGLFD-NPYP------GDDRRPEVVVHNAAHQDVSMRAALESIVLLKNE 453
Query: 411 HGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSI-TPLQALQNYVENTV--YYPG 467
+LPL KS S +A+IGPNA K L Y GP+ SI T Q ++ Y+ N Y G
Sbjct: 454 KEMLPLSKSFS-KIAVIGPNAEEVKELTCRY-GPANASIKTVYQGIKEYLPNAEVRYAKG 511
Query: 468 CDTV---------------ACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLV 512
CD + A I++AV++AK +D +L++G ++ +EE R +L
Sbjct: 512 CDIIDKYFPESELYNVPLDTQEQAMINEAVELAKASDVAILVLGGNEKTVREEFSRTNLD 571
Query: 513 LPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAE 572
L GRQQ+L+ V A KPV+LV++ G I +A ++ + +I+ A +PGE A+A+
Sbjct: 572 LCGRQQQLLEAVY-ATGKPVVLVMVDGRAATINWA--NKYVPAIIHAWFPGEFMGDAIAK 628
Query: 573 VIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLS 632
V+FGD+NPGGRL +T +P+ ++P +P + S R ++PFG GLS
Sbjct: 629 VLFGDYNPGGRLAVT-FPKSVGQIPFA-FPFKPGSDSKGKVRV-----AGVLYPFGYGLS 681
Query: 633 YSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNH 692
Y+ ++Y +S+ P +G + ++ VKN
Sbjct: 682 YTTFNYSNLKISK-------------------------PVIGAQ----ENITLSCTVKNT 712
Query: 693 GEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDG 752
G+ AG V L+++ K L GF+ + L E+ I F L+P + L ++
Sbjct: 713 GKKAGDEVVQLYIRDDFSSVTTYDKVLRGFERIHLQPGEEQTISFTLTP-QDLGLWDKNN 771
Query: 753 LMVIEEGTHFLVVG 766
+E G+ ++VG
Sbjct: 772 QFTVEPGSFSVMVG 785
>gi|357489449|ref|XP_003615012.1| Auxin-induced beta-glucosidase, partial [Medicago truncatula]
gi|355516347|gb|AES97970.1| Auxin-induced beta-glucosidase, partial [Medicago truncatula]
Length = 190
Score = 262 bits (670), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 124/177 (70%), Positives = 141/177 (79%), Gaps = 2/177 (1%)
Query: 8 LVFPLLCLCFTSLLTRVDSTQ--PPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTL 65
+ F L L L DS PP+SCD +NP T++ PFC L I+QRA+D+VSRLTL
Sbjct: 11 ISFIFLFLTRYHRLVHADSPTHVPPYSCDTTNPLTKSLPFCNLNLTITQRAKDIVSRLTL 70
Query: 66 DEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAAS 125
DEKISQLVN+AP+IPRLGIP+Y+WW EALHGVA GKGI NG++ GATSFPQVILTAAS
Sbjct: 71 DEKISQLVNTAPSIPRLGIPSYQWWDEALHGVANAGKGIRLNGSVAGATSFPQVILTAAS 130
Query: 126 FDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLV 182
FDS LWY+I + IG EAR +YNAGQA GMTFWAPNINIFRDPRWGRGQET GEDPLV
Sbjct: 131 FDSKLWYQISKVIGTEARGVYNAGQAQGMTFWAPNINIFRDPRWGRGQETAGEDPLV 187
>gi|189464498|ref|ZP_03013283.1| hypothetical protein BACINT_00840 [Bacteroides intestinalis DSM
17393]
gi|189438288|gb|EDV07273.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
intestinalis DSM 17393]
Length = 862
Score = 262 bits (669), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 167/457 (36%), Positives = 241/457 (52%), Gaps = 40/457 (8%)
Query: 45 FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGI 104
+ T+L +RA LV LTL+EK+ +++ + + RL I Y WW+EALHGVA G
Sbjct: 22 YKNTSLSPEERADLLVKELTLEEKVQLMMDGSRPVERLDIKPYNWWNEALHGVARAGL-- 79
Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARA---LYNAGQA----IGMTFW 157
AT FPQ I AASF Y + A+ EARA Y++ + G+T W
Sbjct: 80 --------ATVFPQPIGMAASFSPETVYDVFTAVSDEARAKNTYYSSKDSRERYQGLTMW 131
Query: 158 APNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTF-NGGKLKGKLQASACCKH 216
P +NI+RDPRWGRG ET GEDP +T + V V+G+QG N KL AC KH
Sbjct: 132 TPTVNIYRDPRWGRGIETYGEDPYLTSRMGVMVVKGLQGPADGNYDKLH------ACAKH 185
Query: 217 FTAYDLDNWKGTTRYKFDAR-VTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSC 275
F + W R+ F+A + +DL +TY PPFE+ VK+G +MCAYNR G P C
Sbjct: 186 FAVHSGPEW---NRHSFNAENIKPRDLYETYLPPFEALVKEGEVKEVMCAYNRFEGEPCC 242
Query: 276 ADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPE--DAVVDVLKAGMDVNCGS 333
LL + R +WGF G + SDC A++ Y+ G+ P+ A + +G D+ CGS
Sbjct: 243 GSNRLLMQILRDEWGFDGIVVSDCGAIADFYNDRGHHTHPDAESASAAAVVSGTDLECGS 302
Query: 334 FLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQ 393
K +VK+ + E +++ ++ L R LG + + + KI VV S AH
Sbjct: 303 SY-KALVESVKKGLISEEKVNTSVRRLLKARFELGEMDDLEKVS-WAKIPFSVVASAAHD 360
Query: 394 VLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQ 453
LAL+ A+ + LL N LPL + +++A++GPNAN + GNY G ++T L
Sbjct: 361 SLALKIARKSMTLLMNKDNFLPL-RRGGLTVAVMGPNANDSVMQWGNYNGMPPHTVTILD 419
Query: 454 ALQNYV---ENTVYYPGC----DTVACSSASIDKAVD 483
++N + + +Y GC T+ S+ S K+VD
Sbjct: 420 GVRNLLGADDKLIYEQGCPWVERTLIHSAFSQCKSVD 456
Score = 122 bits (306), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 94/300 (31%), Positives = 145/300 (48%), Gaps = 53/300 (17%)
Query: 478 IDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEA 527
I K+V+ K AD V+ G+ + E EE+ DR D+ LP Q+ELI + A
Sbjct: 589 IQKSVEQVKEADIVIFASGISPSLEGEEMGVNLPGFKKGDRTDIELPAVQRELIDALYRA 648
Query: 528 AKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMT 587
KK ++LV G P I + +IL A YPG+ G A+AEV+FGD+NP GRLP+T
Sbjct: 649 GKK-IVLVNCSGSP--IGLEPETKKCEAILQAWYPGQQGGTAVAEVLFGDYNPAGRLPVT 705
Query: 588 WYPQDYIKVP-MTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQN 646
+Y ++ ++P D M GRTYR+ + +FPFG GLSY+ +SY +
Sbjct: 706 FY-RNVSQLPDFEDYNM--------AGRTYRYMQDTPLFPFGYGLSYTTFSY-------D 749
Query: 647 KLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVK 706
K+ L++S E + +T+ V N G+ G+ V +++K
Sbjct: 750 KVVLDKS----------------------EVTAGQTLKLTVPVTNTGKRDGEEVVQVYLK 787
Query: 707 PARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
G P+K L F+ V + A + I F+L E + + + GT+ ++VG
Sbjct: 788 KQGDAEG-PVKTLRAFKRVFIPAGQTTNIEFDLKDKELEWWDEQTNTVRVCSGTYDIMVG 846
>gi|224537102|ref|ZP_03677641.1| hypothetical protein BACCELL_01979 [Bacteroides cellulosilyticus
DSM 14838]
gi|224521279|gb|EEF90384.1| hypothetical protein BACCELL_01979 [Bacteroides cellulosilyticus
DSM 14838]
Length = 769
Score = 262 bits (669), Expect = 6e-67, Method: Compositional matrix adjust.
Identities = 199/679 (29%), Positives = 325/679 (47%), Gaps = 84/679 (12%)
Query: 107 NGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWA--PNINIF 164
NG G T +P I AA+F+ L + + LE RA WA PN+ +
Sbjct: 140 NGLCSGTTIYPTPIGQAATFEPALIEQASRETALEMRAT--------GAHWAFTPNVEVA 191
Query: 165 RDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFT--AYDL 222
RDPRWGR ET GEDP + G + VRG+QG+ F+ + AC KHF + +
Sbjct: 192 RDPRWGRTGETFGEDPHLVGVMGAATVRGLQGNDFSNPE-----NVIACPKHFIGGSQSI 246
Query: 223 DNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLS 282
+ G V+ + + + + PPF++C+ A M A+N VNGIPS +++ L++
Sbjct: 247 NGINGAP-----CDVSERTIREIFLPPFKACL-DANAYTFMMAHNEVNGIPSHSNKYLMT 300
Query: 283 KTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNC-GSFLQKHTKA 341
R +W F GYI SD + ++D +S DA V +++GMD++ G +
Sbjct: 301 DLLRDEWKFDGYIVSDWMDIERLHDYHRITESYTDAFVLSVQSGMDMHMHGPDFMEALLE 360
Query: 342 AVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQ 401
AVK +L E ID+++ + + + +LGLF NP D++ + AHQ AL+ A
Sbjct: 361 AVKDGRLTEKRIDQSVRRILTAKFKLGLFE-NPYFDE--AKSKDLLFNKAHQQTALEIAH 417
Query: 402 DGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYA--GPSCRSITPLQALQNYV 459
IVLLKN G+LPL SK + + GPNA++ +LG++A P +T L+ L++
Sbjct: 418 KSIVLLKND-GILPLDVSKYKKIFVTGPNADT-HVILGDWAVPQPEGNVVTVLKGLKDAA 475
Query: 460 ENTVYYP---GCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEK-------EELDRV 509
NT + G + A + +A +A+ AD ++++G + +E E DR
Sbjct: 476 PNTTFSFLDFGWNIRTMDPAKVKQAAGMARQADLAIVVVGENSMREHWSEKTCGENTDRS 535
Query: 510 DLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVA 569
D+ LPG QQEL+ + P I++L+ G P+ + + ++ +++ A PG G A
Sbjct: 536 DINLPGLQQELVETIQNTGV-PTIVILVNGRPLGVEWIA--DHVAALIEAWEPGSFGGQA 592
Query: 570 LAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKE--VFPF 627
+A++++G NP ++P+T VP +++ + + GK +F F
Sbjct: 593 IADILYGKVNPSAKMPVT--------VPRNVGQIQSVYNHKLTSNWFPYAIGKNGPLFHF 644
Query: 628 GCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTI 687
G GLSY+ Y Y L L++S E +I
Sbjct: 645 GYGLSYTTYQY-------TNLRLSKS----------------------EISTDETLTASI 675
Query: 688 GVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSR 747
+ N G+M G V L++ RP+K+L GF+ + L E+ + F+++P LS
Sbjct: 676 DITNTGQMDGDEIVQLYIGDDFSSVTRPLKELKGFKRISLKKGEQKTVTFDITP-HMLSF 734
Query: 748 AREDGLMVIEEGTHFLVVG 766
D +E+GT ++VG
Sbjct: 735 YDTDMNFKVEKGTFKIMVG 753
>gi|336415919|ref|ZP_08596257.1| hypothetical protein HMPREF1017_03365 [Bacteroides ovatus
3_8_47FAA]
gi|335939822|gb|EGN01694.1| hypothetical protein HMPREF1017_03365 [Bacteroides ovatus
3_8_47FAA]
Length = 782
Score = 262 bits (669), Expect = 7e-67, Method: Compositional matrix adjust.
Identities = 206/713 (28%), Positives = 332/713 (46%), Gaps = 101/713 (14%)
Query: 81 RLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGL 140
RLGIP + EA HG +G AT FP I AA++ L +GQ I
Sbjct: 129 RLGIPMF-LAEEAPHGHMAIG-----------ATVFPTGIGMAATWSPELVKEVGQVIAK 176
Query: 141 EARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFN 200
E R+ G + P +++ RDPRW R +ET GEDP+++G S V D
Sbjct: 177 EIRS------QGGHISYGPVLDLTRDPRWSRVEETFGEDPVLSGILGASMV-----DGLG 225
Query: 201 GGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRAS 260
GG L K A KHF AY + Y A V ++DL + PPF + G A
Sbjct: 226 GGNLSQKYATIATLKHFLAYAVPEGGQNGNY---ASVGIRDLHQNFLPPFRKAIDAG-AL 281
Query: 261 GIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVV 320
+M +YN ++GIP ++ NLL++ R +W F G++ SD ++ I+++ A + E+A +
Sbjct: 282 SVMTSYNSIDGIPCTSNHNLLTQLLRNEWKFRGFVVSDLYSIEGIHESHFVAPTKENAAI 341
Query: 321 DVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFG 380
+ AG+DV+ G + AV+ ++ ++ ID A+ + ++ +GLF +P + P
Sbjct: 342 QSVTAGVDVDLGGDAYTNLCHAVQSGQMDKAVIDTAVCRVLRMKFEMGLFE-HPYVDP-- 398
Query: 381 KIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGN 440
KI A V H LA + AQ I LLKN + +LPL K+ + +A+IGPNA++ +LG+
Sbjct: 399 KIAAKTVRRKEHIELARKIAQSSITLLKNENSILPLSKTIN-KVAVIGPNADNRYNMLGD 457
Query: 441 YAGPSCRS--ITPLQALQNYV--ENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMG 496
Y P S T L + + Y GC + I++A++ A+ ++ V++++G
Sbjct: 458 YTAPQEDSNVKTVLDGIITKLSPSRVEYVRGCAIRDTTVNEIEQAIEAARRSEVVIVVVG 517
Query: 497 LDQTQE-----------------------KEELDRVDLVLPGRQQELITRVAEAAKKPVI 533
++ E DR L L GRQQEL+ + + KP+I
Sbjct: 518 GSSARDFKTSYKETGAAVAEEGSVSDMECGEGFDRASLSLLGRQQELLESLQKTG-KPLI 576
Query: 534 LVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDY 593
+V + G P++ +A ++L A YPG+ G A+A+V+FGD+NP GRLP++ P+
Sbjct: 577 VVYIEGRPLEKNWA--SEYADALLTAYYPGQEGGNAIADVLFGDYNPSGRLPIS-VPRSV 633
Query: 594 IKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQS 653
++P+ + P+ Y ++ FG G+SY+ + Y V Q
Sbjct: 634 GQIPVYYNQKAPR------NHDYVEVSSSPLYSFGYGMSYTTFEYSDLQVVQKS------ 681
Query: 654 SSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNG 713
R F V+ VKN G+ G+ L+++
Sbjct: 682 --------------------------ARCFEVSFKVKNTGKYDGEEVSQLYMRDEYASVV 715
Query: 714 RPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
+P+KQL F+ L E+ ++ F L+ E V+E G L++G
Sbjct: 716 QPMKQLKHFERFHLKKGEEKKVTFVLTE-EDFFLVNYTLKKVVESGNFHLMIG 767
>gi|255693561|ref|ZP_05417236.1| periplasmic beta-glucosidase(Cellobiase) [Bacteroides finegoldii
DSM 17565]
gi|260620626|gb|EEX43497.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
finegoldii DSM 17565]
Length = 800
Score = 261 bits (668), Expect = 8e-67, Method: Compositional matrix adjust.
Identities = 228/794 (28%), Positives = 365/794 (45%), Gaps = 148/794 (18%)
Query: 51 PISQRARDLVSRLTLDEKISQLVN--SAPAIPRLGIPAYEW----WSEALHGV----AGV 100
PI R DL+S++TL+EK Q+ + + + P W W + + + G+
Sbjct: 62 PIEARIADLLSQMTLEEKTCQMATLYGSGRVLKDAWPTDGWSTEIWKDGIGNIDEQANGL 121
Query: 101 GK-----------------------------GI---FFNGTIRG-----ATSFPQVILTA 123
GK GI F N IRG AT FP
Sbjct: 122 GKFGSEISYPYANSVKNRHTVQRWFVEQTRLGIPVDFTNEGIRGLCHDRATMFPAQCGQG 181
Query: 124 ASFDSYLWYRIGQAIGLEARALYNAGQAIGMT-FWAPNINIFRDPRWGRGQETPGEDPLV 182
A+++ L I + EA+AL G T +AP ++I +DPRWGR E+ GEDP +
Sbjct: 182 ATWNKKLIREIAKVTADEAKAL-------GYTNIYAPILDIAQDPRWGRVVESYGEDPYL 234
Query: 183 TGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDL--DNWKGTTRYKFDARVTMQ 240
G+ + G+Q + A KHF Y + G TR D V +
Sbjct: 235 VGELGKQMILGLQSEGI-----------VATPKHFAVYSIPVGGRDGGTRT--DPHVAPR 281
Query: 241 DLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCD 300
++ Y PF +++ A G+M +YN +G P + L++ R+QWGF GY+ SD +
Sbjct: 282 EMKTLYLEPFRKGIQEAGALGVMSSYNDYDGEPVSGSYHFLTEILRQQWGFKGYVVSDSE 341
Query: 301 AVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG-----SFLQKHTKAAVKQKKLPESEIDR 355
AV ++ + E+ V+ AG+++ F+ + A+ + K+ +D+
Sbjct: 342 AVEFLHTKHRITPTEEEMAAQVVNAGLNIRTNFTPPQDFILP-LRRAISEGKVSLHTLDQ 400
Query: 356 ALHNLFSVRMRLGLFNGNPTMQPFGKIGAD-----VVCSPAHQVLALQAAQDGIVLLKNS 410
+ + V+ +GLF+ NP G D VV + AHQ ++++AA + IVLLKN
Sbjct: 401 RVGEILRVKFMMGLFD-NPYP------GDDRRPEVVVHNAAHQDVSMRAALESIVLLKNE 453
Query: 411 HGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSI-TPLQALQNYVENTV--YYPG 467
+LPL KS S +A+IGPNA K L Y GP+ SI T Q ++ Y+ N Y G
Sbjct: 454 KEMLPLSKSFS-KIAVIGPNAEEVKELTCRY-GPANASIKTVYQGIKEYLPNAEVRYAKG 511
Query: 468 CDTV---------------ACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLV 512
CD + A I++AV++AK +D +L++G ++ +EE R +L
Sbjct: 512 CDIIDKYFPESELYNVPLDTQEQAMINEAVELAKASDVAILVLGGNEKTVREEFSRTNLD 571
Query: 513 LPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAE 572
L GRQQ+L+ V A KPV+LV++ G I +A ++ + +I+ A +PGE A+A+
Sbjct: 572 LCGRQQQLLEAVY-ATGKPVVLVMVDGRAATINWA--NKYVPAIIHAWFPGEFMGDAIAK 628
Query: 573 VIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLS 632
V+FGD+NPGGRL +T +P+ ++P +P + S R ++PFG GLS
Sbjct: 629 VLFGDYNPGGRLAVT-FPKSVGQIPFA-FPFKPGSDSKGKVRV-----AGVLYPFGYGLS 681
Query: 633 YSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNH 692
Y+ + Y +S+ P +G + ++ VKN
Sbjct: 682 YTTFGYSDLKISK-------------------------PVIGAQ----ENITLSCTVKNT 712
Query: 693 GEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDG 752
G+ AG V L+++ K L GF+ + L E+ I F L+P + L ++
Sbjct: 713 GKKAGDEVVQLYIRDDFSSVTTYDKVLRGFERIHLQPGEEQTISFTLTP-QDLGLWDKNN 771
Query: 753 LMVIEEGTHFLVVG 766
+E G+ ++VG
Sbjct: 772 QFTVEPGSFSVMVG 785
>gi|153807033|ref|ZP_01959701.1| hypothetical protein BACCAC_01310 [Bacteroides caccae ATCC 43185]
gi|423219984|ref|ZP_17206480.1| hypothetical protein HMPREF1061_03253 [Bacteroides caccae
CL03T12C61]
gi|149130153|gb|EDM21363.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
caccae ATCC 43185]
gi|392624247|gb|EIY18340.1| hypothetical protein HMPREF1061_03253 [Bacteroides caccae
CL03T12C61]
Length = 786
Score = 261 bits (668), Expect = 8e-67, Method: Compositional matrix adjust.
Identities = 225/789 (28%), Positives = 365/789 (46%), Gaps = 138/789 (17%)
Query: 51 PISQRARDLVSRLTLDEKISQLVN--SAPAIPRLGIPAYEW----WSEALHGV----AGV 100
PI R DL+S++TL+EK Q+ + + + P EW W + + + G+
Sbjct: 48 PIEARVADLLSQMTLEEKTCQMATLYGSGRVLKDAWPTAEWSKEIWKDGIGNIDEQANGL 107
Query: 101 GK-----------------------------GI---FFNGTIRG-----ATSFPQVILTA 123
GK GI F N IRG AT FP
Sbjct: 108 GKFGSELSYPYANSVKNRHEIQRWFVEQTRLGIPVDFTNEGIRGLCHNRATMFPAQCGQG 167
Query: 124 ASFDSYLWYRIGQAIGLEARALYNAGQAIGMT-FWAPNINIFRDPRWGRGQETPGEDPLV 182
A+++ L I + EA+AL G T +AP ++I +DPRWGR E+ GEDP +
Sbjct: 168 ATWNKKLIREIAKVTADEAKAL-------GYTNIYAPILDIAQDPRWGRVVESYGEDPYL 220
Query: 183 TGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDL--DNWKGTTRYKFDARVTMQ 240
G+ + G+Q + +A KHF Y + G TR D V +
Sbjct: 221 AGELGKQMILGLQAEGL-----------AATPKHFAVYSIPVGGRDGGTRT--DPHVAPR 267
Query: 241 DLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCD 300
++ Y PF +++ A G+M +YN +G P + L++ R+QWGF GY+ SD +
Sbjct: 268 EMKTLYLEPFRKGIQEAGALGVMSSYNDYDGEPVSGSYHFLTEILRQQWGFKGYVVSDSE 327
Query: 301 AVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG-----SFLQKHTKAAVKQKKLPESEIDR 355
AV ++ + E+ V+ AG+++ F+ + A+ + K+ +D+
Sbjct: 328 AVEFLHTKHRITPTEEEMAAQVVNAGLNIRTNFTPPQDFILP-LRRAISEGKISLHTLDQ 386
Query: 356 ALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLP 415
+ + V+ LGLF+ NP + VV + AHQ ++++AA + IVLLKN + +LP
Sbjct: 387 RVGEILRVKFMLGLFD-NPYPGD-DRHPETVVHNAAHQEVSMKAALESIVLLKNENQMLP 444
Query: 416 LPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSI-TPLQALQNYVEN--TVYYPGCDTV- 471
L KS + +A+IGPNA K L Y GP+ I T Q ++ Y+ N Y GC+ +
Sbjct: 445 LSKSLN-KIAVIGPNAEEVKELTCRY-GPAHAPIKTVYQGIKEYLPNAEVSYAKGCNIID 502
Query: 472 --------------ACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQ 517
A I++AV++AK +D +L++G ++ +EE R L L GRQ
Sbjct: 503 KYFPESELYNVPLDTQEQAMINEAVELAKVSDIAILVLGGNEKTVREEFSRTSLDLCGRQ 562
Query: 518 QELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGD 577
Q+L+ V A KPV+LV++ G I +A ++ + +I+ A +PGE A+A+V+FGD
Sbjct: 563 QQLLEAVY-ATGKPVVLVMVDGRAATINWA--NKYVPAIVHAWFPGEFMGNAIAKVLFGD 619
Query: 578 HNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYS 637
+NPGGRL +T +P+ +VP +P + S R ++PFG GLSY+ +
Sbjct: 620 YNPGGRLAVT-FPKSVGQVPFA-FPFKPGSDSKG-----RVRVDGVLYPFGYGLSYTTFE 672
Query: 638 YKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAG 697
Y +S+ P +G + ++ VKN G+ AG
Sbjct: 673 YSALKISK-------------------------PVIGPQ----ENMTLSCIVKNTGKRAG 703
Query: 698 KHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIE 757
V L+++ K L GF+ + L E+ I F L+P + L ++ +E
Sbjct: 704 DEVVQLYIRDDFSSVTTYDKMLRGFERIHLQPGEEQTISFTLTP-QDLGLWDKNNQFTVE 762
Query: 758 EGTHFLVVG 766
G+ +++G
Sbjct: 763 PGSFSIMIG 771
>gi|423227452|ref|ZP_17213913.1| hypothetical protein HMPREF1062_06099 [Bacteroides cellulosilyticus
CL02T12C19]
gi|392623082|gb|EIY17188.1| hypothetical protein HMPREF1062_06099 [Bacteroides cellulosilyticus
CL02T12C19]
Length = 786
Score = 261 bits (668), Expect = 9e-67, Method: Compositional matrix adjust.
Identities = 222/793 (27%), Positives = 359/793 (45%), Gaps = 130/793 (16%)
Query: 51 PISQRARDLVSRLTLDEKISQLVN--SAPAIPRLGIPAYEW----WSEALHGVAGVGKGI 104
PI R +DL+S++TL+EK Q+ + + + +P +W W + + + G+
Sbjct: 48 PIEARVQDLLSQMTLEEKTCQMATLYGSGRVLKDSLPTEKWKDEIWKDGIANIDEQANGL 107
Query: 105 ------------------------------------FFNGTIRG-----ATSFPQVILTA 123
F N IRG AT FP
Sbjct: 108 GRFGSSLSYPYVNSVENRQTIQRWFVEQTRLGIPVDFTNEGIRGLCHDRATMFPAQCGQG 167
Query: 124 ASFDSYLWYRIGQAIGLEARALYNAGQAIGMT-FWAPNINIFRDPRWGRGQETPGEDPLV 182
A+++ L I Q EA+AL G T ++P ++I +DPRWGR E GEDP +
Sbjct: 168 ATWNKELISEIAQVTAEEAKAL-------GYTNIYSPILDIAQDPRWGRVVECYGEDPFL 220
Query: 183 TGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDL 242
G+ ++G+Q + A KHF Y + + D V +++
Sbjct: 221 VGELGKRMIKGLQQEGL-----------VATPKHFAVYSIPVGGRDAGTRTDPHVAPREM 269
Query: 243 ADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAV 302
Y PF + A G+M +YN +G P + L++ R +WGF GY+ SD +AV
Sbjct: 270 RTLYIEPFRKAFCEAGALGVMSSYNDYDGEPITGSYHFLTEILRHEWGFKGYVVSDSEAV 329
Query: 303 SIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKH----TKAAVKQKKLPESEIDRALH 358
+Y A D V+ AG++V L ++ + A+ + K+ ID +
Sbjct: 330 EFLYSKHQVAADAVDGAAQVVNAGLNVRTNFTLPENFIRPLRQAISEGKVSMQTIDSRVA 389
Query: 359 NLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPK 418
++ V+ +GLF+ NP + K VV S HQ ++++AA + IVLLKN + +LPL K
Sbjct: 390 DVLRVKFGMGLFD-NP-YKGDAKHPEKVVHSKEHQAVSMRAALESIVLLKNENNILPLSK 447
Query: 419 SKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVEN--TVYYPGCDTV----- 471
+A+IGPNAN + L+ Y + T Q ++ Y+ + Y G D +
Sbjct: 448 DLK-KVAVIGPNANEVQNLICRYGPANAPIKTVYQGIKEYLPDAEVRYAKGTDIIDKYFP 506
Query: 472 ----------ACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELI 521
A +D+AV +AK +D ++++G ++ +EE R +L L GRQ++L+
Sbjct: 507 ESELYEVPLDQEEQAMMDEAVALAKESDVAIMVLGGNEKTVREEYSRTNLDLCGRQEKLL 566
Query: 522 TRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
V A KPVIL+L+ G I +A +R I I+ A +PGE A+A+V+FGD+NPG
Sbjct: 567 QAVY-ATGKPVILLLVDGRAATINWA--ERYIPGIVHAWFPGEFMGDAVAQVLFGDYNPG 623
Query: 582 GRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFK 641
G+L +T +P+ ++P +P + S R ++PFG GLSY+ ++Y
Sbjct: 624 GKLAVT-FPRSVGQIPFA-FPFKPGSDSKGFVRV-----TGTLYPFGYGLSYTTFAY--- 673
Query: 642 AVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPV 701
S +EN P +G + ++ VKN G++AG V
Sbjct: 674 -------------SDLKIEN---------PVIGVQ----GSVKLSCKVKNTGKVAGDEVV 707
Query: 702 LLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTH 761
L++ +K L GF+ + L + E+ I F L+P E L +D V+E GT
Sbjct: 708 QLYLHDEMSSVTTYVKVLRGFERIHLESGEEKVIDFVLTPQE-LGLWNKDNHFVVEPGTF 766
Query: 762 FLVVGDEEYPISI 774
++VG I +
Sbjct: 767 AVMVGSSSQDIKL 779
>gi|380692851|ref|ZP_09857710.1| beta-glucosidase [Bacteroides faecis MAJ27]
Length = 854
Score = 261 bits (668), Expect = 9e-67, Method: Compositional matrix adjust.
Identities = 162/437 (37%), Positives = 229/437 (52%), Gaps = 33/437 (7%)
Query: 44 PFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKG 103
P+ L +RA DL+ RLTL EKI + NS+ A+ RLGI Y WWSE LHGVA G
Sbjct: 15 PYKNHQLSPKERADDLLGRLTLKEKIGLMQNSSNAVERLGIAPYNWWSEGLHGVARNGL- 73
Query: 104 IFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAI--------GMT 155
AT FP + A++FD +I A+ E RA ++ G+T
Sbjct: 74 ---------ATVFPITMGMASTFDDEAIEKIYTAVSDEGRAKFHDAHRRNRYGCGNEGLT 124
Query: 156 FWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCK 215
FW PN+NIFRDPRWGRGQET GEDP +T + VS V+G+QG + AC K
Sbjct: 125 FWNPNVNIFRDPRWGRGQETFGEDPYLTTRMGVSVVKGMQGPADAAYD-----KTHACAK 179
Query: 216 HFTAYDLDNWKGTTRYKFDA-RVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPS 274
H+ + K R+ FD ++ +DL +TY P F++ V++ +MCAY R G P
Sbjct: 180 HYAVHSGPEAK---RHSFDVEHLSPRDLWETYLPAFKALVQEADVKEVMCAYQRFEGEPC 236
Query: 275 CADRNLLSKTARRQWGFHGYITSDCDAVSIIY--DAEGYAKSPEDAVVDVLKAGMDVNCG 332
C LL++ R +WG+ + SDC A+S + D G K DA + G D+ CG
Sbjct: 237 CDSNRLLTQILRDEWGYKHLVVSDCGAISDFFYKDRHGTHKDAADASASAVINGTDLECG 296
Query: 333 SFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAH 392
H + AVK+ + E I+ +L L R LG + + ++ + KIG D V H
Sbjct: 297 -VEYAHLEEAVKRGLITEERINTSLRRLLEARFALGEMDPD-SIVAWSKIGIDTVDCDLH 354
Query: 393 QVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPL 452
+ +AL + +VLL N+ G+LPL K+ S +A++GPNA + GNY G + T L
Sbjct: 355 RQMALDITRKSMVLLHNN-GILPLAKT-STRIAVMGPNAVDSVMQWGNYKGVPSHTYTIL 412
Query: 453 QALQNYVENTVYYPGCD 469
+ ++N V N Y GC+
Sbjct: 413 EGIRNKVGNVPYEKGCE 429
Score = 86.3 bits (212), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 78/272 (28%), Positives = 118/272 (43%), Gaps = 52/272 (19%)
Query: 482 VDIAKGADHVVLMMGLDQTQEKEEL-----------DRVDLVLPGRQQELITRVAEAAKK 530
VD K AD V+ + G+ E EE DR + LP Q++ I + + A K
Sbjct: 584 VDKVKDADIVIFLGGISPDLEGEEKHFVNCPGFSGGDRTSIELPQVQRD-ILKALKKAGK 642
Query: 531 PVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYP 590
V+ V G V + + +IL A YPG+AG +A+A+V+FGD NP G+LP+T+Y
Sbjct: 643 KVVFVNCSGSAVALVPEM--ESCDAILQAWYPGQAGGLAVADVLFGDFNPSGKLPVTFYK 700
Query: 591 QDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYL 650
D M+ RTYR+ +FPFG GLSY+ + +S+ +L
Sbjct: 701 STDQLPDFEDYSMK--------NRTYRYMVEAPLFPFGYGLSYTTFD-----ISKGRLNK 747
Query: 651 NQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARR 710
S+ K + + VKN G+ G + ++V+
Sbjct: 748 KSISAGKDLN------------------------FKVNVKNTGKCDGAEVIQVYVRKVDD 783
Query: 711 GNGRPIKQLVGFQSVILNAKEKAEIVFELSPC 742
G P+K L F+ V L A E + +L P
Sbjct: 784 MEG-PVKSLRAFRRVPLKAGESCVVSIDLLPA 814
>gi|393779898|ref|ZP_10368130.1| glycosyl hydrolase family 3, N-terminal domain protein
[Capnocytophaga sp. oral taxon 412 str. F0487]
gi|392609318|gb|EIW92128.1| glycosyl hydrolase family 3, N-terminal domain protein
[Capnocytophaga sp. oral taxon 412 str. F0487]
Length = 770
Score = 261 bits (667), Expect = 9e-67, Method: Compositional matrix adjust.
Identities = 221/795 (27%), Positives = 368/795 (46%), Gaps = 125/795 (15%)
Query: 7 SLVFPLLCLCFTSLLTRVDS---TQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRL 63
+L+ L+ C T + + D+ TQ FS D I QR ++ +
Sbjct: 10 TLISLLMIGCATPAILKSDNEKLTQKSFSGDKH---------------IEQRVDSVLRLM 54
Query: 64 TLDEKISQLVN-------SAPAI-----PRL--GIPAYEWWSEALHGVAGVGK------- 102
TL+EKI Q+ + P + P L G+ + + ++ G+ + K
Sbjct: 55 TLEEKIGQMTQFSADWSVTGPVMADKYQPYLEKGLVGSIFNATSVAGIRNLQKIAVEQTR 114
Query: 103 -GI---FFNGTIRG-ATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFW 157
GI F I G T FP + + S+D L + + EA A I TF
Sbjct: 115 LGIPILFGQDVIHGYKTIFPIPLAESCSWDLTLMRKTAELAAREASA-----DGINWTF- 168
Query: 158 APNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHF 217
AP ++I RD RWGR E GEDP + A + V+G QG N L AC KHF
Sbjct: 169 APMVDITRDARWGRAMEGAGEDPYLGSLIAEARVKGFQGGD-NWQMLSSPHTLLACGKHF 227
Query: 218 TAYDLDNWKGTTRYKFD---ARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPS 274
Y G D A ++M L + Y PP+E+ + R IM + N +NG+P+
Sbjct: 228 AGY------GAAESGKDYNTAELSMHTLRNVYLPPYEATL-NARVGSIMASLNEINGVPA 280
Query: 275 CADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAG--MDVNCG 332
AD+ LL++ R++WGF+G + SD ++ + G AK + A AG MD+N
Sbjct: 281 TADKWLLTEVLRKEWGFNGLLVSDYTGINELV-RHGVAKDDKQAANLSANAGIEMDMNGA 339
Query: 333 SFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAH 392
+F+ K+ A VK+ K+ E++ID+A+ ++ ++ LGLF+ +P + + +
Sbjct: 340 TFI-KYLSALVKEGKVTEAQIDKAVRHILEIKFLLGLFD-DPYRYLDETRAKENTFTEKY 397
Query: 393 QVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYA--GPSCRSIT 450
+A QA +VLLKN +LP+ K ++A+IGP N+ + G++ G +S++
Sbjct: 398 LKVARQAVASSVVLLKNEAEVLPIKKDSGKTIAVIGPMMNNTSDINGSWTCLGDGKQSVS 457
Query: 451 PLQALQNYVENT----VYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEEL 506
L L + T +Y GC S+ + +AV +A+ AD V++ +G + E
Sbjct: 458 LLTGLTEKYKATNVKLLYAEGCGFTTISTEQLKEAVAMARKADRVLVAVGEQSSWSGESA 517
Query: 507 DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAG 566
R D+ LP Q++L+ + + KP+ ++ G P+D+++ + N+ +IL A +PG G
Sbjct: 518 VRTDIRLPQAQRQLLEAL-KTINKPIAIITFSGRPLDLSWE--NENVQAILQAWFPGTQG 574
Query: 567 AVALAEVIFGDHNPGGRLPMTWYPQDYIKVPM---------------TDMKMRPQATSGN 611
+A+VI GD NP G L M+ +P+ ++P+ ++ RP +G
Sbjct: 575 GYGIADVIAGDVNPSGHLTMS-FPRSVGQIPIYYNYKSTGRPVHTNNEEVDHRPHYNAG- 632
Query: 612 PGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVP 671
Y ++PFG GLSY+ ++ + ++LN+ S + ++
Sbjct: 633 ----YLDSSITPLYPFGYGLSYTTFAI-------SNVHLNKKSLKRYNDS---------- 671
Query: 672 ELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKE 731
+V V+N G G+ V L+ + RP+K+L GFQ + L A E
Sbjct: 672 -----------IIVNASVQNTGTTEGEIVVQLYTRQLVASVSRPVKELKGFQKISLKAGE 720
Query: 732 KAEIVFELSPCESLS 746
++ FEL P E+L+
Sbjct: 721 SKQVRFEL-PSEALA 734
>gi|189467715|ref|ZP_03016500.1| hypothetical protein BACINT_04107 [Bacteroides intestinalis DSM
17393]
gi|189435979|gb|EDV04964.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
intestinalis DSM 17393]
Length = 943
Score = 261 bits (667), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 211/697 (30%), Positives = 331/697 (47%), Gaps = 92/697 (13%)
Query: 105 FFNGTIRG-----ATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMT-FWA 158
F N IRG AT+FP + +++ L ++G G EAR L G T +A
Sbjct: 160 FTNEGIRGIESYRATNFPTQLGLGHTWNRELIRQVGLITGREARIL-------GYTNVYA 212
Query: 159 PNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFT 218
P +++ RD RWGR +E GE P + + + VRG+Q + Q +A KHF
Sbjct: 213 PILDVGRDQRWGRYEEVYGESPYLVAELGIEMVRGMQHNH----------QVAATGKHFV 262
Query: 219 AYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADR 278
AY + + D +++ +++ + PF+ +K+ G+M +YN +G+P
Sbjct: 263 AYSNNKGAREGMARVDPQMSPREVEMIHVYPFKRVIKEAGLLGVMSSYNDYDGVPIQGSY 322
Query: 279 NLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKH 338
L+ R + GF GY+ SD DAV +Y AK ++AV ++AG++V C +F
Sbjct: 323 YWLTTRLRGEMGFRGYVVSDSDAVEYLYTKHSTAKDMKEAVRQSVEAGLNVRC-TFRSPD 381
Query: 339 T-----KAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPA-H 392
+ + VK+ L E I+ + ++ V+ +GLF+ P GAD A +
Sbjct: 382 SYVLPLRELVKEGGLSEEVINDRVRDILRVKFLIGLFDA-PYQTDLA--GADREVEKAEN 438
Query: 393 QVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPL 452
+ LALQA+++ +VLLKN + +LPL + +A+ GPNA+ L +Y + T L
Sbjct: 439 ESLALQASRESLVLLKNENNVLPLDINNVKKIAVCGPNADEEGYALTHYGPLAVEVTTVL 498
Query: 453 QALQNYVE---NTVYYPGCDTVACS---------------SASIDKAVDIAKGADHVVLM 494
+ ++ E +Y GCD V + A IDKAV+ A+ AD V++
Sbjct: 499 EGIRQKSEGKAEVLYTKGCDLVDANWPESELIDYPMTDNEQAEIDKAVENARQADVAVVV 558
Query: 495 MGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIG 554
+G Q E R L LPGRQ +L+ V +A KPV+LVL+ G P+ I +A D+ +
Sbjct: 559 LGGGQRTCGENKSRSSLDLPGRQLKLLQAV-QATGKPVVLVLINGRPLSINWA--DKFVP 615
Query: 555 SILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMT-DMKMRPQATSG-NP 612
+IL A YPG G A+A+V+FGD+NPGG++ +T +P+ ++P K Q G NP
Sbjct: 616 AILEAWYPGSKGGTAVADVLFGDYNPGGKMTVT-FPKSVGQIPFNFPCKPSSQIDGGKNP 674
Query: 613 GRTYRFYE-GKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVP 671
G ++ FG GLSY+ + Y +S + NQ ++ +
Sbjct: 675 GLDGNMSRVNGALYSFGYGLSYTTFEYSGIEISPKVITPNQKATVR-------------- 720
Query: 672 ELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKE 731
C+ V N G+ AG V L+V+ K L GF+ + L E
Sbjct: 721 ------CK---------VTNTGKRAGDEVVQLYVRDILSSVTTYEKNLAGFERIHLQPGE 765
Query: 732 KAEIVFELS--PCESLSRAREDGLMVIEEGTHFLVVG 766
E+VF L E L + E V+E G ++VG
Sbjct: 766 TKEVVFTLDRKQLELLDKHME---WVVEPGDFSIMVG 799
>gi|393781221|ref|ZP_10369422.1| hypothetical protein HMPREF1071_00290 [Bacteroides salyersiae
CL02T12C01]
gi|392677556|gb|EIY70973.1| hypothetical protein HMPREF1071_00290 [Bacteroides salyersiae
CL02T12C01]
Length = 946
Score = 261 bits (667), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 230/798 (28%), Positives = 353/798 (44%), Gaps = 135/798 (16%)
Query: 49 TLPISQRARDLVSRLTLDEKISQLVN--SAPAIPRLGIPAYEW----------------- 89
T PI R DL+S++ L+EK Q+V + + +P EW
Sbjct: 57 TAPIDARIEDLLSQMNLNEKTCQMVTLYGYKRVLKDDLPTPEWKQMLWKDGMGAIDEHLN 116
Query: 90 ---------------WSEALHG----------VAGVGKGI---FFNGTIRG-----ATSF 116
W + H V GI F N IRG AT+F
Sbjct: 117 GFQQWGLPPSDNEYVWPASRHAWALNEVQRFFVEETRLGIPVDFTNEGIRGVESYKATNF 176
Query: 117 PQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMT-FWAPNINIFRDPRWGRGQET 175
P + +++ L ++IG G EAR L G T +AP +++ RD RWGR +E
Sbjct: 177 PTQLGLGHTWNRKLIHQIGLITGREARML-------GYTNVYAPILDVGRDQRWGRYEEV 229
Query: 176 PGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDA 235
GE P + + + VRG+Q + Q +A KHF AY + + D
Sbjct: 230 YGESPYLVAELGIEMVRGMQHNH----------QVAATGKHFIAYSNNKGAREGMARVDP 279
Query: 236 RVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYI 295
+++ +++ + PF+ +++ G+M +YN +G P + L+ R Q GF GY+
Sbjct: 280 QMSPREVEMIHVYPFKRVIQEAGLLGVMSSYNDYDGFPIQSSYYWLTTRLRGQMGFRGYV 339
Query: 296 TSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHT-----KAAVKQKKLPE 350
SD DAV +Y G AK ++AV ++AG++V C +F + + V++ L E
Sbjct: 340 VSDSDAVEYLYTKHGTAKDMKEAVRQSVEAGLNVRC-TFRSPDSYVLPLRELVQEGGLSE 398
Query: 351 SEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNS 410
I+ + ++ V+ +GLF+ Q K D V ++ +ALQA+++ IVLLKN
Sbjct: 399 EVINDRVRDILRVKFLVGLFDA--PYQTDLKGADDEVEKEENEAVALQASRESIVLLKNE 456
Query: 411 HGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQ---NYVENTVYYPG 467
+ LPL + +A+ GPNA L +Y + T + L+ N +Y G
Sbjct: 457 NNTLPLDITSVKKIAVCGPNAAEKAYALTHYGPLAVEVTTVVDGLREKLNGKAEVLYTKG 516
Query: 468 CDTVAC---------------SSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLV 512
CD V + IDKAV A+ AD V+++G Q E R L
Sbjct: 517 CDLVDAHWPESEIIDYPLSKDEQSEIDKAVAQAQEADVAVVVLGGGQRTCGENKSRSSLD 576
Query: 513 LPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAE 572
LPGRQ +L+ + +A KPVILVL+ G P+ + +A D+ + +IL A YPG G A+A+
Sbjct: 577 LPGRQLDLL-KAVQATGKPVILVLINGRPLSVNWA--DKFVPAILEAWYPGSKGGTAIAD 633
Query: 573 VIFGDHNPGGRLPMTWYPQDYIKVPMT-DMKMRPQATSG-NPGRTYRFYE-GKEVFPFGC 629
V+FGD+NPGG+L +T +P+ ++P K Q G NPG ++PFG
Sbjct: 634 VLFGDYNPGGKLTVT-FPKSVGQIPFNFPHKPSSQIDGGKNPGTKGDMSRVNGALYPFGY 692
Query: 630 GLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGV 689
GLSY+ + Y +S + NQ K V V
Sbjct: 693 GLSYTTFEYSDINISPKVITPNQ-----------------------------KVQVRCKV 723
Query: 690 KNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELS-PCESLSRA 748
N G+ AG V L+V+ K L GF+ + L E E+ F L L A
Sbjct: 724 TNTGKHAGDEVVQLYVRDLISSVTTYEKNLEGFERIHLQPGETKEVSFTLDRKALELLNA 783
Query: 749 REDGLMVIEEGTHFLVVG 766
+ D V+E G +++G
Sbjct: 784 KND--WVVEPGDFSIMLG 799
>gi|383302737|gb|AFH08276.1| hypothetical protein [uncultured bacterium]
Length = 768
Score = 261 bits (667), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 232/790 (29%), Positives = 359/790 (45%), Gaps = 146/790 (18%)
Query: 52 ISQRARDLVSRLTLDEKISQL----------VNSA----------------PAIPRLGIP 85
+S+R DL+SR+TL+EK+ Q+ NSA P I G
Sbjct: 34 VSERVEDLLSRMTLEEKVGQMNQFVGIEHIKANSAVLTEGDLFNNTAQAFYPGIT--GDT 91
Query: 86 AYEWWSEALHG---------------------------VAGVGKGIFFNGTIRGATSFPQ 118
W E L G + G+ I N T +P
Sbjct: 92 VIRWTREGLVGSFLHVLTIEEANMLQRHAMSSRLAIPILFGI-DAIHGNANAPDNTVYPT 150
Query: 119 VILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGE 178
I A+SFD + Y+I + E RA+ + TF PN+++ RDPRWGR ET GE
Sbjct: 151 NIGLASSFDPEMAYKIARQTAAEMRAM-----NLHWTF-NPNVDVVRDPRWGRVGETFGE 204
Query: 179 DPLVTGKYAVSYVRGVQG--DTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDAR 236
DP + V+G QG DT N AC KHF GT D
Sbjct: 205 DPYLISVLGAESVKGYQGTLDTPN--------DVLACIKHFVGGGFPA-NGTNGSPTD-- 253
Query: 237 VTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYIT 296
V+ + L + PPFE+ V+ G A +M ++N VNGIP+ ++ L+ R +WGF G++
Sbjct: 254 VSERTLREVLLPPFEAGVEAG-AGSLMTSHNEVNGIPAHSNEWLMRDVLRGEWGFKGFVV 312
Query: 297 SDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNC-GSFLQKHTKAAVKQKKLPESEIDR 355
SD + IYD A++ ++A + AGMD++ G + + V++ ++PES ID
Sbjct: 313 SDWMDIEHIYDLHRTAENLKEAFYQSIMAGMDMHMHGIYWNELVCELVREGRIPESRIDE 372
Query: 356 ALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLP 415
++ + V+ RLG+F NP + +V SP H+ AL+AA++ IVLLKN G+LP
Sbjct: 373 SVRRILDVKFRLGIFE-NPYADEARTM--EVRLSPGHRATALEAARNSIVLLKND-GVLP 428
Query: 416 LPKSKSVSLALIGPNANSAKTLLGNYAGPSC--RSITPLQALQNYVENTVYY---PGCDT 470
L SK + + G NA+ + +LG+++ T L+ L+ +T + G +
Sbjct: 429 LDASKYKRVMVTGINADD-ENILGDWSASQRPENVTTILEGLREVAPDTHFEFVDQGWNP 487
Query: 471 VACSSASIDKAVDIAKGADHVVLMMGLDQTQEK-------EELDRVDLVLPGRQQELITR 523
S A ++KA + A+ AD +++ G + + E+ DR D+ L G Q ELI +
Sbjct: 488 QTMSPAQVEKAAEHARHADLNIVVAGEYMMRHRWALRTGGEDTDRSDIDLVGLQNELIEK 547
Query: 524 VAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGR 583
VA A+ KP IL+L+ G + + +A N+ +I+ A PG G A+AE+++G NP +
Sbjct: 548 VA-ASGKPTILILVNGRQLGVEWAA--ENLPAIVEAWEPGMYGGQAVAEILYGTVNPSAK 604
Query: 584 LPMTWYPQDYIKVPM-----TDMKMRPQAT--SGNPGRTYRFYEGKEVFPFGCGLSYSKY 636
LP+T P+ ++ M + P A S +P ++PFG GLSY+ Y
Sbjct: 605 LPVT-IPRSVGQIQMYYNHKPSLYFHPYAAGKSSSP-----------LWPFGFGLSYTTY 652
Query: 637 SYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMA 696
Y +S + E VT+ VKN G
Sbjct: 653 EYSDLRLSSD-----------------------------EIAADGTLDVTVRVKNTGSRD 683
Query: 697 GKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVI 756
G + L+++ RP+K+L F V L A E +I F ++P + L +D V+
Sbjct: 684 GVEIIQLYIRDLYSSVTRPVKELKDFGRVALKAGETKDITFTITP-DKLQFLDKDLRPVV 742
Query: 757 EEGTHFLVVG 766
E G ++VG
Sbjct: 743 EPGEFVVMVG 752
>gi|423223135|ref|ZP_17209604.1| hypothetical protein HMPREF1062_01790 [Bacteroides cellulosilyticus
CL02T12C19]
gi|392639742|gb|EIY33554.1| hypothetical protein HMPREF1062_01790 [Bacteroides cellulosilyticus
CL02T12C19]
Length = 769
Score = 261 bits (666), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 198/679 (29%), Positives = 325/679 (47%), Gaps = 84/679 (12%)
Query: 107 NGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWA--PNINIF 164
NG G T +P I AA+F+ L + + LE RA WA PN+ +
Sbjct: 140 NGLCSGTTIYPTPIGQAATFEPALIEQASRETALEMRATG--------AHWAFTPNVEVA 191
Query: 165 RDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFT--AYDL 222
RDPRWGR ET GEDP + G + VRG+QG+ F+ + AC KHF + +
Sbjct: 192 RDPRWGRTGETFGEDPHLVGVMGAATVRGLQGNDFSNPE-----NVIACPKHFIGGSQSI 246
Query: 223 DNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLS 282
+ G V+ + + + + PPF++C+ A M A+N VNGIPS +++ L++
Sbjct: 247 NGINGAP-----CDVSERTIREIFLPPFKACL-DANAYTFMMAHNEVNGIPSHSNKYLMT 300
Query: 283 KTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNC-GSFLQKHTKA 341
R +W F GYI SD + ++D +S +A V +++GMD++ G +
Sbjct: 301 DLLRDEWKFDGYIVSDWMDIERLHDYHRVTESYANAFVLSVQSGMDMHMHGPDFMEALLE 360
Query: 342 AVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQ 401
AVK +L E ID+++ + + + +LGLF NP D++ + AHQ AL+ A
Sbjct: 361 AVKDGRLTEKRIDQSVRRILTAKFKLGLFE-NPYFDE--AKSKDLLFNKAHQQTALEIAH 417
Query: 402 DGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYA--GPSCRSITPLQALQNYV 459
IVLLKN G+LPL SK + + GPNA++ +LG++A P +T L+ L++
Sbjct: 418 KSIVLLKND-GILPLDASKYKKIFVTGPNADT-HVILGDWAVPQPEGNVVTVLKGLKDAA 475
Query: 460 ENTVYYP---GCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEK-------EELDRV 509
NT + G + A + +A +A+ AD ++++G + +E E DR
Sbjct: 476 PNTTFSFLDFGWNIRTMDPAKVKQAAGMARQADLAIVVVGENSMREHWSEKTCGENTDRS 535
Query: 510 DLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVA 569
D+ LPG QQEL+ + P I++L+ G P+ + + ++ +++ A PG G A
Sbjct: 536 DINLPGLQQELVETIQNTGV-PTIVILVNGRPLGVEWIA--DHVAALIEAWEPGSFGGQA 592
Query: 570 LAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKE--VFPF 627
+A++++G NP ++P+T VP +++ + + GK +F F
Sbjct: 593 IADILYGKVNPSAKMPVT--------VPRNVGQIQSVYNHKLTSNWFPYAIGKNGPLFHF 644
Query: 628 GCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTI 687
G GLSY+ Y Y L L++S E +I
Sbjct: 645 GYGLSYTTYQY-------TNLRLSKS----------------------EISTDETLTASI 675
Query: 688 GVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSR 747
+ N G+M G V L++ RP+K+L GF+ + L E+ + F+++P LS
Sbjct: 676 DITNTGQMDGDEIVQLYIGDDFSSVTRPLKELKGFKRISLKKGEQKTVTFDITP-HMLSF 734
Query: 748 AREDGLMVIEEGTHFLVVG 766
D +E+GT ++VG
Sbjct: 735 YDMDMNFKVEKGTFKIMVG 753
>gi|448415866|ref|ZP_21578437.1| beta-glucosidase [Halosarcina pallida JCM 14848]
gi|445680029|gb|ELZ32480.1| beta-glucosidase [Halosarcina pallida JCM 14848]
Length = 765
Score = 261 bits (666), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 218/727 (29%), Positives = 337/727 (46%), Gaps = 123/727 (16%)
Query: 81 RLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGL 140
RLGIPA E L G G T+FPQ I A+++D L I I
Sbjct: 84 RLGIPAIPH-EECLSGYMG-----------PEGTTFPQSIGLASTWDPSLVEEITGTIRT 131
Query: 141 EARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFN 200
+ A+ A +P +++ RD RWGR +ET GEDP + A YV G+QGD
Sbjct: 132 QLEAIGTA------HALSPVLDVARDLRWGRVEETFGEDPYLVASMACGYVDGLQGDGDG 185
Query: 201 GGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRAS 260
SA KHF + + G R + + ++L +T+ PFE+ V+ A
Sbjct: 186 ---------ISATLKHFAGHSV-GEGGKNRSSVN--LGRRELRETHLFPFEAAVRTSDAE 233
Query: 261 GIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVV 320
+M AY+ ++GIP +D LL+ R +WGF G + SD +V + G A E+A
Sbjct: 234 SVMNAYHDIDGIPCASDEWLLTDVLRGEWGFDGTVVSDYYSVEFLRSEHGVAADEEEAGA 293
Query: 321 DVLKAGMDV-----NC-GSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNP 374
L+AG+DV +C G L K V+ L E +D A+ + ++R GLF+ +P
Sbjct: 294 MALEAGIDVELPYTDCYGDSLVK----GVESGHLSEETVDHAVRRVLRAKVRKGLFD-DP 348
Query: 375 TM------QPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIG 428
T+ +PFG AD L +AA++ + LLKN LLPL S++ S+A+IG
Sbjct: 349 TVDPDAASEPFGTDAAD--------ELTTRAARESMTLLKNEGDLLPLAGSETDSVAVIG 400
Query: 429 PNANSAKTLLGNYAGPS--------CRSITPLQALQNYVE----NTVYYPGCDTVACSSA 476
P A+ + L+G+YA + + TPL A+++ + + GC +
Sbjct: 401 PKADDGQELMGDYAYAAHYPEEEVELDATTPLDAIRSRGDEFGFEVSHEQGCTMTGPGTG 460
Query: 477 SIDKAVDIAK---------GADHVVLMMGLDQTQEK--------EELDRVDLVLPGRQQE 519
D A A GA V + +D+ QE E D VDL LPG QQE
Sbjct: 461 GFDAAASAAAEADVAVAFVGARSAVDLSDMDKEQENRSTVPTSGEGCDVVDLDLPGVQQE 520
Query: 520 LITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHN 579
L+ RV + P+++V++ G P I + +++ A PGE G +A +FG+HN
Sbjct: 521 LVERVDQTG-TPLVVVVVSGKPHSI--EAISEAVPAVVQAWLPGERGGEGIAATLFGEHN 577
Query: 580 PGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYK 639
PGG LP++ P+ ++P+ +P + + + + + + ++PFG GLSY+ + Y
Sbjct: 578 PGGHLPVS-IPRTVGQIPV-HYSRKPNSANED----HVYVDSDPLYPFGHGLSYTDFEYG 631
Query: 640 FKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKH 699
A+S ++ +P GT + V+N GE AG
Sbjct: 632 DLALSDDE----------------------IPPAGT-------ITAAVTVENAGERAGHD 662
Query: 700 PVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEG 759
V L+V+ RP+++LVGF+ V L+A + + FE+ + R L V EEG
Sbjct: 663 VVQLYVRAENPSQARPVQELVGFERVSLDAGDARRVSFEIDASQLAYHDRNFDLTV-EEG 721
Query: 760 THFLVVG 766
+ L VG
Sbjct: 722 PYQLRVG 728
>gi|224536087|ref|ZP_03676626.1| hypothetical protein BACCELL_00952 [Bacteroides cellulosilyticus
DSM 14838]
gi|224522306|gb|EEF91411.1| hypothetical protein BACCELL_00952 [Bacteroides cellulosilyticus
DSM 14838]
Length = 791
Score = 261 bits (666), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 216/786 (27%), Positives = 355/786 (45%), Gaps = 132/786 (16%)
Query: 51 PISQRARDLVSRLTLDEKISQLVN--SAPAIPRLGIPAYEW----WSEALHGVAGVGKGI 104
P+ +R DL+S++TL+EKI Q+ + + +P W W + + + G+
Sbjct: 53 PMEERVNDLLSQMTLEEKICQMATLYGSGRVLEDALPEEHWKQALWKDGIGNIDEEHNGL 112
Query: 105 ------------------------------------FFNGTIRG-----ATSFPQVILTA 123
F N IRG AT FP
Sbjct: 113 GTFGSEYSFPYNKHVKAKHEIQRWFVEETRLGIPVDFTNEGIRGLCHDRATFFPSQSGQG 172
Query: 124 ASFDSYLWYRIGQAIGLEARALYNAGQAIGMT-FWAPNINIFRDPRWGRGQETPGEDPLV 182
++++ L RIG+ EA AL G T ++P ++I +DPRWGR E GEDP +
Sbjct: 173 STWNKELIARIGEVEAKEAIAL-------GYTNIYSPILDICQDPRWGRSVECYGEDPYL 225
Query: 183 TGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDL 242
G+ ++ +Q K + + KHF Y + + + D V+ +++
Sbjct: 226 VGQLGKQMIQSLQ-----------KHRLVSTVKHFAVYSIPVGGRDGKTRTDPHVSPREM 274
Query: 243 ADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAV 302
Y PF + A G+M +YN +G P + + L++ R+++GF GY+ SD +AV
Sbjct: 275 RTLYLEPFRRAFCEAGALGVMSSYNDYDGEPITSSHHFLTEILRQEYGFKGYVVSDSEAV 334
Query: 303 SIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHT-----KAAVKQKKLPESEIDRAL 357
I + + V + AG+++ F + + A+K+ K+ I+ +
Sbjct: 335 EFITTKHHVVSNEVEGVAQAVNAGLNIRT-HFTKPEDFVLPLRQAIKEGKVSPETINSRV 393
Query: 358 HNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLP 417
++ ++ LGLF+ NP + K +V HQ +AL+AA+ +VLLKN + LLPL
Sbjct: 394 ADILRIKFWLGLFD-NP-YRGDEKQEEKIVHCKEHQQVALEAARQSLVLLKNENQLLPLK 451
Query: 418 KSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENT--VYYPGCDTVAC-- 473
K+ S+A+IGPNAN L+ Y + T Q ++ + T VY GC+ +
Sbjct: 452 KTVK-SVAVIGPNANEQTQLICRYGPANAPIKTVYQGIKELLPETEVVYRKGCEIIDSHF 510
Query: 474 -------------SSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQEL 520
+D+AV A+ A+ VVL++G + +E+ R L LPG QQEL
Sbjct: 511 PESEILPFEKTTEEQQMLDEAVAAARNAEVVVLVLGGSELTVREDRSRTSLDLPGHQQEL 570
Query: 521 ITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
+ + A KP +LVLL G I +A ++ I +IL A +PGE A+AE +FGD+NP
Sbjct: 571 MQAI-HATGKPTVLVLLDGRAATINYA--NQYIPAILHAWFPGEFAGTAVAEALFGDYNP 627
Query: 581 GGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKF 640
GGRL +T +P+ ++P +P S P T + ++PFG GLSY+K+SYK
Sbjct: 628 GGRLAVT-FPKSVGQIPFA-FPFKP--GSDEPCETAVY---GALYPFGYGLSYTKFSYK- 679
Query: 641 KAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHP 700
N + + P+ + V+ V N G+ G
Sbjct: 680 --------------------NLQITPEEQGPQ--------GEITVSCEVTNIGDRTGDEV 711
Query: 701 VLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGT 760
V L+++ +K L GF+ + LN E ++ F L+P + L ++ V+E G
Sbjct: 712 VQLYLRDEVSSVTTYMKVLRGFERITLNPGETKKVTFILTP-QDLGLWDKNNKFVVEPGM 770
Query: 761 HFLVVG 766
+++G
Sbjct: 771 FKVMIG 776
>gi|420148909|ref|ZP_14656095.1| glycosyl hydrolase family 3, N-terminal domain protein
[Capnocytophaga sp. oral taxon 335 str. F0486]
gi|394754508|gb|EJF37885.1| glycosyl hydrolase family 3, N-terminal domain protein
[Capnocytophaga sp. oral taxon 335 str. F0486]
Length = 770
Score = 260 bits (665), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 226/798 (28%), Positives = 367/798 (45%), Gaps = 131/798 (16%)
Query: 7 SLVFPLLCLCFTSLLTRVDS---TQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRL 63
+L+ L+ C T + + D+ TQ FS D I QR ++ +
Sbjct: 10 TLISLLMIGCATPAILKSDNEKLTQKSFSGDKH---------------IEQRVDSVLRLM 54
Query: 64 TLDEKISQLVN-------SAPAIPRLGIPAYEWWSEALHG-------VAGVGK------- 102
TL+EKI Q+ + P + P E + L G V G+ K
Sbjct: 55 TLEEKIGQMTQFSADWSVTGPVMADKYQPYLE---KGLVGSIFNATSVVGMRKLQKIAVE 111
Query: 103 ----GI---FFNGTIRG-ATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGM 154
GI F I G T FP + + S+D L + + EA A I
Sbjct: 112 QTRLGIPILFGQDVIHGYKTIFPIPLAESCSWDLALMRKTAELAAREATA-----DGINW 166
Query: 155 TFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACC 214
TF AP ++I RD RWGR E GEDP + A + V+G QG N L AC
Sbjct: 167 TF-APMVDITRDARWGRAMEGAGEDPYLGSLIAEARVKGFQGGD-NWQTLSSPHTLLACG 224
Query: 215 KHFTAYDLDNWKGTTRYKFD---ARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNG 271
KHF Y G D A ++M L + Y PP+E+ + G S IM + N +NG
Sbjct: 225 KHFAGY------GAAESGKDYNTAELSMHTLRNVYLPPYEATLNAGVGS-IMASLNEING 277
Query: 272 IPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAG--MDV 329
+P+ A + LL++ R++WGF+G + SD ++ + G AK + A AG MD+
Sbjct: 278 VPATAYKWLLTEVLRKEWGFNGLLVSDYTGINELV-RHGVAKDDKQAANLSANAGIEMDM 336
Query: 330 NCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCS 389
N +F+ K+ A VK+ K+ E++ID+A+ ++ ++ LGLF+ +P + +
Sbjct: 337 NGATFI-KYLSALVKEGKVTEAQIDKAVRHILEMKFLLGLFD-DPYRYLDETRAKENTFT 394
Query: 390 PAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYA--GPSCR 447
+ +A QA +VLLKN +LP+ K ++A+IGP N+ + G++ G +
Sbjct: 395 EEYLKVARQAVASSVVLLKNEAEVLPIKKDSGKTIAVIGPMMNNTSDINGSWTCLGDGKQ 454
Query: 448 SITPLQALQNYVENT----VYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEK 503
S++ L L + T +Y GC S+ + +AV IA+ AD V++ +G +
Sbjct: 455 SVSLLTGLTEKYKGTNVKLLYAEGCGFTTISTEQLKEAVAIARKADRVLVAVGEQSSWSG 514
Query: 504 EELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPG 563
E R D+ LP Q++L+ + +A KP+ ++ G P+D+++ + N+ +IL A +PG
Sbjct: 515 ESAVRTDIRLPQAQRQLLEAL-KAINKPIAIITFSGRPLDLSWE--NENVQAILQAWFPG 571
Query: 564 EAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPM---------------TDMKMRPQAT 608
G +A+VI GD NP G+L M+ +P+ ++P+ ++ RP
Sbjct: 572 TQGGYGIADVIAGDVNPSGQLTMS-FPRSVGQIPIYYNYKSTGRPVYTNNEEVDHRPHYN 630
Query: 609 SGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYK 668
+G Y ++PFG GLSY+ ++ N ++LN+ S + ++
Sbjct: 631 AG-----YLDSSITPLYPFGYGLSYTTFAI-------NNVHLNKKSIKRYNDS------- 671
Query: 669 SVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILN 728
+V V+N G G+ V L+ + RP+K+L GFQ + L
Sbjct: 672 --------------IIVNASVQNTGTTEGEIVVQLYTRQLVASVSRPVKELKGFQKIPLK 717
Query: 729 AKEKAEIVFELSPCESLS 746
A E ++ FEL P E+L+
Sbjct: 718 AGESKQVHFEL-PSEALA 734
>gi|365836152|ref|ZP_09377549.1| glycosyl hydrolase family 3 protein [Hafnia alvei ATCC 51873]
gi|364564272|gb|EHM42040.1| glycosyl hydrolase family 3 protein [Hafnia alvei ATCC 51873]
Length = 810
Score = 260 bits (665), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 233/798 (29%), Positives = 369/798 (46%), Gaps = 142/798 (17%)
Query: 15 LCFTSLLT-RVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARD-----LVSRLTLDEK 68
LC SL+ + +T P D + P+ P K P S + RD L+S++TLDEK
Sbjct: 42 LCSLSLVAGLIATTLSPVYADATQPA---HPLVK---PQSTQQRDAFVTDLLSKMTLDEK 95
Query: 69 ISQ--LVNSAPAIPRLGIPAYEWWSEALHGVAGVGK-GIFFNGTIRG------------- 112
I Q L++ P P+ EA+ + G+ G FN R
Sbjct: 96 IGQMRLISVGPDNPK----------EAIREMIKQGQVGAIFNTVTRQDIRKMQDQAMQLS 145
Query: 113 ----------------ATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTF 156
T FP + A+S++ +G+ EA + + MT
Sbjct: 146 RLKIPLFFAYDVVHGQRTVFPISLGLASSWNRDAVTTVGRISAYEA-----SEDGLNMT- 199
Query: 157 WAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKH 216
WAP +++ RDPRWGR E GED +T + V +QG+ + KH
Sbjct: 200 WAPMVDVTRDPRWGRTSEGFGEDTYLTSEMGRLMVEAMQGNN-----PADRHSVMTSVKH 254
Query: 217 FTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCA 276
F AY +G Y ++ Q L Y PP+++ + G + G+M + N +NG P+ +
Sbjct: 255 FAAYGA--VEGGRDYN-TVDMSPQRLFQDYMPPYKAALDAG-SGGVMVSLNSINGTPATS 310
Query: 277 DRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG-SFL 335
+ LL R +W F G SD A+ + G A PEDAV +K+G+D++ +
Sbjct: 311 NSWLLKDVLRDEWNFKGITISDHGAIKELI-KHGVASDPEDAVRVAVKSGIDMSMSDEYY 369
Query: 336 QKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPA---- 391
K+ + VK ++ E E+D A ++ +V+ +GLF G +G+D V + A
Sbjct: 370 SKYLPSLVKSGRVSEKEVDDAARHVLNVKYDMGLFTD--AYSHLGPVGSDPVDTNAESRL 427
Query: 392 HQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNY--AGPSCRSI 449
H+ A A++ +VLLKN LPL KS ++ ALIGP A+S + ++G++ AG +S+
Sbjct: 428 HRPEARSVARESMVLLKNRLNTLPLAKSGTI--ALIGPLADSKRDVMGSWSAAGVVDQSV 485
Query: 450 TPLQALQN---YVENTVYYPGCD------------------TVACSS--ASIDKAVDIAK 486
T LQ L+N Y VY G + TV S A ID+AV AK
Sbjct: 486 TVLQGLRNAVGYKAQIVYAKGANVSNDPGITDFLNLYEKAVTVDPRSPQAMIDEAVAAAK 545
Query: 487 GADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITF 546
+D +V ++G Q E R D+ +P Q++LI + + K P++LVL+ G P +
Sbjct: 546 KSDVIVAVVGEAQGMAHEASSRSDITIPQSQRDLIAALKQTGK-PLVLVLMNGRP--LAL 602
Query: 547 AKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKM--- 603
K D+ +IL + + G G A+A+V+FGD+NP G+LPM+ +P+ ++P+ +
Sbjct: 603 EKEDQQADAILESWFSGTEGGNAIADVLFGDYNPSGKLPMS-FPRSVGQIPIYYSHLNTG 661
Query: 604 RPQATSGNPGRTYRFYEGKE--VFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVEN 661
RP T +Y+ ++PFG GLSY+ +S VS K+ SS M +
Sbjct: 662 RPYNPEKPEKYTSHYYDAANGPLYPFGYGLSYTTFS-----VSDVKM-----SSPIMKRD 711
Query: 662 QDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVG 721
V ++ VKN G+ G V L+++ RP+K+L G
Sbjct: 712 GSVT-------------------ASVTVKNTGKRDGATVVQLYIQDKTATMSRPVKELKG 752
Query: 722 FQSVILNAKEKAEIVFEL 739
F+ V+L + E+ I F++
Sbjct: 753 FEKVMLKSGEQKTISFKI 770
>gi|423222018|ref|ZP_17208488.1| hypothetical protein HMPREF1062_00674 [Bacteroides cellulosilyticus
CL02T12C19]
gi|392644204|gb|EIY37946.1| hypothetical protein HMPREF1062_00674 [Bacteroides cellulosilyticus
CL02T12C19]
Length = 942
Score = 260 bits (665), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 212/697 (30%), Positives = 329/697 (47%), Gaps = 92/697 (13%)
Query: 105 FFNGTIRG-----ATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMT-FWA 158
F N IRG AT+FP + +++ L ++G G EAR L G T +A
Sbjct: 160 FTNEGIRGVESYRATNFPTQLGLGHTWNRELIRQVGLITGREARML-------GYTNVYA 212
Query: 159 PNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFT 218
P +++ RD RWGR +E GE P + + + VRG+Q Q +A KHF
Sbjct: 213 PILDVGRDQRWGRYEEVYGESPYLVAELGIEMVRGMQHSH----------QVAATGKHFV 262
Query: 219 AYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADR 278
AY + + D +++ +++ + PF+ +K+ G+M +YN +G+P
Sbjct: 263 AYSNNKGAREGMARVDPQMSPREVEMIHVYPFKRVIKEAGLLGVMSSYNDYDGVPIQGSY 322
Query: 279 NLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKH 338
L+ R + GF GY+ SD DAV +Y AK ++AV ++AG++V C +F
Sbjct: 323 YWLTTRLRGEMGFRGYVVSDSDAVEYLYTKHSTAKDMKEAVRQSVEAGLNVRC-TFRSPD 381
Query: 339 T-----KAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPA-H 392
+ + VK+ L E I+ + ++ V+ +GLF+ P GAD A +
Sbjct: 382 SYVLPLRELVKEGGLSEEVINDRVRDILRVKFLVGLFD-TPYQTDLA--GADKEVEKAEN 438
Query: 393 QVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPL 452
+ LALQA+++ +VLLKN + +LPL + +A+ GPNA+ L +Y + T L
Sbjct: 439 ESLALQASRESLVLLKNENNVLPLDINNVKKIAVCGPNADEEGYALTHYGPLAVEVTTVL 498
Query: 453 QALQNYVE---NTVYYPGCDTVACS---------------SASIDKAVDIAKGADHVVLM 494
+ ++ E +Y GCD V + A IDKAV+ A+ AD V++
Sbjct: 499 EGIRQKAEGKAEVLYTKGCDLVDANWPESELIDYPMTDSEQAEIDKAVENARQADVAVVV 558
Query: 495 MGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIG 554
+G Q E R L LPGRQ +L+ V +A KPV+LVL+ G P+ I +A D+ +
Sbjct: 559 LGGGQRTCGENKSRSSLDLPGRQLKLLQAV-QATGKPVVLVLINGRPLSINWA--DKFVP 615
Query: 555 SILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMT-DMKMRPQATSG-NP 612
IL A YPG G A+A+V+FGD+NPGG+L +T +P+ ++P K Q G NP
Sbjct: 616 VILEAWYPGSKGGTAVADVLFGDYNPGGKLTVT-FPKSVGQIPFNFPCKPSSQIDGGKNP 674
Query: 613 GRTYRFYE-GKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVP 671
G ++ FG GLSY+ + Y +S + NQ ++ +
Sbjct: 675 GLDGNMSRVNGALYSFGYGLSYTTFEYSDIEISPKVITPNQKATVR-------------- 720
Query: 672 ELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKE 731
C+ V N G+ AG V L+V+ K L GF+ + L E
Sbjct: 721 ------CK---------VTNTGKRAGDEVVQLYVRDILSSVTTYEKNLAGFERIHLQPGE 765
Query: 732 KAEIVFELS--PCESLSRAREDGLMVIEEGTHFLVVG 766
E+VF L E L + E V+E G ++VG
Sbjct: 766 TKEVVFTLDRKQLELLDKHME---WVVEPGDFSIMVG 799
>gi|87200432|ref|YP_497689.1| beta-glucosidase [Novosphingobium aromaticivorans DSM 12444]
gi|87136113|gb|ABD26855.1| Beta-glucosidase [Novosphingobium aromaticivorans DSM 12444]
Length = 849
Score = 260 bits (665), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 158/395 (40%), Positives = 223/395 (56%), Gaps = 32/395 (8%)
Query: 67 EKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASF 126
EK +QL ++ A +PAY++W+EALHG+A NG AT FPQ I AA++
Sbjct: 30 EKAAQLQSTVTA-SSPDLPAYDYWNEALHGLA-------RNGV---ATVFPQAIGLAATW 78
Query: 127 DSYLWYRIGQAIGLEARALYNA------GQAIGMTFWAPNINIFRDPRWGRGQETPGEDP 180
D+ L RIG + EARA YNA + G+T W+PNINIFRDPRWGRGQET GEDP
Sbjct: 79 DAPLLERIGTVVSTEARAKYNALPGKDRRRYQGLTIWSPNINIFRDPRWGRGQETYGEDP 138
Query: 181 LVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQ 240
++TG A +YVRG+QG + ++ A KH A+ R F+ + +
Sbjct: 139 VLTGTLATAYVRGLQGPDLDHPRVI------ATPKHLVAHSGPE---AGRDSFNVQSSAY 189
Query: 241 DLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCD 300
D+ TY P F + +G+A +MC+YN V+G+P C LL++ R+ WGF G + SDCD
Sbjct: 190 DMEATYLPAFRRALTEGKALSVMCSYNSVHGVPVCGADWLLNQRVRKDWGFDGLVVSDCD 249
Query: 301 AVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNL 360
A+ I + Y ++ A + AGMD+NCG + A+ Q +DRAL
Sbjct: 250 AIGNINHYQRYRQTNAAASAAAINAGMDLNCGRTYAALPE-ALAQGLTTREVVDRALART 308
Query: 361 FSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSK 420
F+ R +LG G P+ I A+ V + H+ LAL+AA+ +VLLKN +G+LPL K
Sbjct: 309 FAARRKLGDAFG--ATSPWATIPANTVDTAEHRALALEAARKSLVLLKN-NGVLPL--RK 363
Query: 421 SVSLALIGPNANSAKTLLGNYAGPSCRSITPLQAL 455
+A++GPNA+S TL NY G + + +TPL
Sbjct: 364 GARIAVVGPNADSLDTLEANYHGTAAQPVTPLDGF 398
Score = 103 bits (256), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 78/261 (29%), Positives = 128/261 (49%), Gaps = 37/261 (14%)
Query: 507 DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAG 566
DR D+ LP QQ+L+ + +A KP+++VLL G V + + K N +++ A YPG++G
Sbjct: 601 DRTDIALPRAQQDLLETL-KATGKPLVVVLLSGSAVAMPWVK--ENADAVVAAWYPGQSG 657
Query: 567 AVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFP 626
A+A+++ G NP GRLP+T+Y + D MR RTYR++ G ++
Sbjct: 658 GTAIADLVDGTLNPSGRLPVTFYARTRDLPAFVDYNMRE--------RTYRYFHGTPLWS 709
Query: 627 FGCGLSYSKYSYKFKAVSQNKLYLNQS-SSTKMVENQDVVHYKSVPELGTEFCETRKFLV 685
FG GLSY+ ++Y KA + ++ ++ ++T + N V ++ E+ + + L
Sbjct: 710 FGEGLSYTSFAYG-KASAPARIKAGETLTATVSLAN---VGARNGEEVAQAYLVPPEHLR 765
Query: 686 TIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESL 745
TIG N PVL LV ++ L E + F L P SL
Sbjct: 766 TIGEFN-------DPVLR-------------HSLVAYRRAALAKGETTRLSFTLDP-RSL 804
Query: 746 SRAREDGLMVIEEGTHFLVVG 766
S +G+ + GT+ L +G
Sbjct: 805 STVDRNGVRAVRPGTYRLFIG 825
>gi|374375578|ref|ZP_09633236.1| Beta-glucosidase [Niabella soli DSM 19437]
gi|373232418|gb|EHP52213.1| Beta-glucosidase [Niabella soli DSM 19437]
Length = 768
Score = 260 bits (664), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 236/808 (29%), Positives = 357/808 (44%), Gaps = 160/808 (19%)
Query: 40 TETFPFCKTTL-PISQRARDLVSRLTLDEKISQLVN-SAPAI------------------ 79
T P K L + R +DL+SR+TL EK++Q+ + SA I
Sbjct: 22 TRKGPLYKDPLQSVEVRVKDLLSRMTLQEKLAQMQDLSASEIMVSDAIDPVKMDQKLKGK 81
Query: 80 -----------------------------PRLGIPAYEWWSEALHG-VAGVGKGIFFNGT 109
RLGIP SEALHG V G
Sbjct: 82 SFGVFEGMGLTVAQYAKVINTLQEYMVKKTRLGIPVLTS-SEALHGCVHG---------- 130
Query: 110 IRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRW 169
GAT +PQ I ++F+ L ++ Q I E +A Q I +P++++ R+ RW
Sbjct: 131 --GATVYPQAIALGSTFNPALVNQMTQTITKELKA-----QGITQVL-SPDLDLARELRW 182
Query: 170 GRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACC-KHFTAYDLDNWKGT 228
GR +ET GEDP +T + V++ +G + C KHF A+ GT
Sbjct: 183 GRVEETYGEDPYLTSRMGVAFTKGFTENNI------------ICTPKHFAAH------GT 224
Query: 229 TRYKFD-ARVT--MQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTA 285
R + A V ++L Y PFE+ +K+ + IM AY+ +G+P A +L+
Sbjct: 225 PRGGLNLASVAGGERELRSIYLKPFEAVIKEAQPLSIMNAYSSYDGVPMAASHQVLTDIL 284
Query: 286 RRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQ 345
R++ GF GY+ SD AV ++Y + AK P +A + +KAG+D+ S + + VK
Sbjct: 285 RKELGFRGYVYSDWGAVEMLYSFQRTAKGPAEAALQAVKAGLDLEVWSDCFEKLDSLVKA 344
Query: 346 KKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGA--DVVCSPAHQVLALQAAQDG 403
LP ID A+ + S + +GLF P+ + + +P LAL A++
Sbjct: 345 GTLPVRYIDTAVSRILSAKFAIGLFE-----HPYPNLKGLTSDIHTPQSVQLALDIARES 399
Query: 404 IVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYA--GPSCRSITPLQALQNYVEN 461
IVLLKN LLPL K S+A+IGPNA+ + G+Y+ + ITPLQ +Q N
Sbjct: 400 IVLLKNEDHLLPL-KGSIRSIAVIGPNADHVQ--FGDYSWTNDNRHGITPLQGIQALAGN 456
Query: 462 TV---YYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQ---------EKEELDRV 509
+ + GCDT + + AV A +D V+ +G E D
Sbjct: 457 KIQINHAKGCDTWSQNREGFSAAVAAASKSDVAVVFVGSSSASPGYPHPDATSGEGYDLS 516
Query: 510 DLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVA 569
DL LPG Q++L+ + + KPV++VL+ G P I + K NI +I+ YPGE G A
Sbjct: 517 DLKLPGVQEDLVKAIKQTG-KPVVVVLVSGKPFAIPWIK--ENIPAIVGQWYPGEQGGTA 573
Query: 570 LAEVIFGDH----NPGGRLPMTWYPQD-------YIKVPMTDMKMRPQATSGNPGRTYRF 618
+AEV+FG NP G+L ++ +PQ Y P + + + +PG+ Y F
Sbjct: 574 IAEVLFGPANGGINPSGKLNVS-FPQSVGHLPVFYNYYPSDNGYYNKRGSLNSPGKDYVF 632
Query: 619 YEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFC 678
++ FG GLSY+ + Y+ +S+ T F
Sbjct: 633 SSPDPLWAFGTGLSYTSFEYQEMELSK-----------------------------TSFS 663
Query: 679 ETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFE 738
+ + VKN G M GK V L+V+ P+K+L F+ V + A + F+
Sbjct: 664 AGETCHIKLRVKNTGAMDGKEVVQLYVRDKVSSVATPVKELKRFEKVFIKKGATATVQFD 723
Query: 739 LSPCESLSRAREDGLMVIEEGTHFLVVG 766
L P + L+ D V+E G L G
Sbjct: 724 L-PMKELALYNADMKRVVEAGEFELQAG 750
>gi|429756169|ref|ZP_19288778.1| glycosyl hydrolase family 3 protein [Capnocytophaga sp. oral taxon
324 str. F0483]
gi|429171889|gb|EKY13478.1| glycosyl hydrolase family 3 protein [Capnocytophaga sp. oral taxon
324 str. F0483]
Length = 755
Score = 260 bits (664), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 215/750 (28%), Positives = 352/750 (46%), Gaps = 113/750 (15%)
Query: 52 ISQRARDLVSRLTLDEKISQLVN-------SAPAIPRLGIPAYEWWSEALHG-------V 97
I QR ++ +TL+EKI Q+ + P + P E + L G V
Sbjct: 28 IEQRVDSVLRLMTLEEKIGQMTQFSADWSVTGPVMADKYQPYLE---KGLVGSIFNATSV 84
Query: 98 AGVGK-----------GI---FFNGTIRG-ATSFPQVILTAASFDSYLWYRIGQAIGLEA 142
G+ K GI F I G T FP + + S+D L + + EA
Sbjct: 85 VGIRKLQKIAVEQTRLGIPILFGQDVIHGYKTIFPIPLAESCSWDLALMRKTAELAAREA 144
Query: 143 RALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGG 202
A I TF AP ++I RD RWGR E GEDP + A + V+G QG N
Sbjct: 145 TA-----DGINWTF-APMVDITRDARWGRAMEGAGEDPYLGSLIAEARVKGFQGGD-NWQ 197
Query: 203 KLKGKLQASACCKHFTAYDLDNWKGTTRYKFD---ARVTMQDLADTYQPPFESCVKQGRA 259
L AC KHF Y G D A ++M L + Y PP+E+ + G
Sbjct: 198 TLSSPHTLLACGKHFAGY------GAAESGKDYNTAELSMHTLRNVYLPPYEATLNAGVG 251
Query: 260 SGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAV 319
S IM + N +NG+P+ AD+ LL++ R++WGF+G + SD ++ + G AK + A
Sbjct: 252 S-IMASLNEINGVPATADKWLLTEELRKEWGFNGLLVSDYTGINELV-RHGVAKDDKQAA 309
Query: 320 VDVLKAG--MDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQ 377
AG MD+N +F+ K+ A VK+ K E++ID+A+ ++ ++ LGLF+ +P
Sbjct: 310 NLSANAGIEMDMNGATFI-KYLSALVKEGKATEAQIDKAVRHILEMKFLLGLFD-DPYRY 367
Query: 378 PFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTL 437
+ + + +A QA +VLLKN +LP+ K+ ++A+IGP N+ +
Sbjct: 368 LDETRAKENTFTEEYLKVARQAVASSVVLLKNEAEVLPIKKNSGKTIAVIGPMMNNTSDI 427
Query: 438 LGNYA--GPSCRSITPLQALQNYVENT----VYYPGCDTVACSSASIDKAVDIAKGADHV 491
G++ G +S++ L L + T +Y GC S+ + +AV IA+ AD V
Sbjct: 428 NGSWTCLGDGKQSVSLLSGLTQKYKGTNVKLLYAEGCGFTTISTEQLKEAVAIARKADRV 487
Query: 492 VLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDR 551
++ +G + E R D+ LP Q++L+ + +A KP+ ++ G P+D+++ +
Sbjct: 488 LVAVGEQSSWAGESAVRTDIRLPQAQRQLLEAL-KAINKPITIITFSGRPLDLSWE--NE 544
Query: 552 NIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPM------------- 598
N+ +IL A +PG G +A+VI GD NP G L M+ +P+ ++P+
Sbjct: 545 NVQAILQAWFPGTQGGNGIADVIAGDVNPSGHLTMS-FPRSVGQIPIYYNYKNTGRPVYT 603
Query: 599 --TDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSST 656
++ +RP +G Y ++PFG GLSY+ ++ + ++LN+ S
Sbjct: 604 NNEEVDLRPHYNAG-----YLDSSITPLYPFGYGLSYTTFAI-------SNVHLNKKSMK 651
Query: 657 KMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPI 716
+ ++ +V V+N G G+ + L+ + RP+
Sbjct: 652 RYNDS---------------------IIVNASVQNTGTTEGEIVLQLYTRQLVASVSRPV 690
Query: 717 KQLVGFQSVILNAKEKAEIVFELSPCESLS 746
K+L GFQ + L A E ++ FEL P E+L+
Sbjct: 691 KELKGFQKISLKAGESKQVRFEL-PSEALA 719
>gi|383114908|ref|ZP_09935668.1| hypothetical protein BSGG_5166 [Bacteroides sp. D2]
gi|382948422|gb|EIC71783.1| hypothetical protein BSGG_5166 [Bacteroides sp. D2]
Length = 782
Score = 260 bits (664), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 206/713 (28%), Positives = 331/713 (46%), Gaps = 101/713 (14%)
Query: 81 RLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGL 140
RLGIP + EA HG +G AT FP I AA++ L +GQ I
Sbjct: 129 RLGIPMF-LAEEAPHGHMAIG-----------ATVFPTGIGMAATWSPELVKEVGQVIAK 176
Query: 141 EARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFN 200
E R+ G + P +++ RDPRW R +ET GEDP+++G S V D
Sbjct: 177 EIRS------QGGHISYGPVLDLTRDPRWSRVEETFGEDPVLSGILGASMV-----DGLG 225
Query: 201 GGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRAS 260
GG L K A KHF AY + Y A V ++DL + PPF + G A
Sbjct: 226 GGNLSQKYATIATLKHFLAYAVPEGGQNGNY---ASVGIRDLHQNFLPPFRKAIDSG-AL 281
Query: 261 GIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVV 320
+M +YN ++GIP ++ LL++ R +W F G++ SD ++ I+++ A + E+A +
Sbjct: 282 SVMTSYNSIDGIPCTSNHYLLTQLLRNEWKFRGFVVSDLYSIEGIHESHFVALTKENAAI 341
Query: 321 DVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFG 380
+ AG+DV+ G + AV+ ++ ++ ID A+ + ++ +GLF +P + P
Sbjct: 342 QSVTAGVDVDLGGDAYTNLCHAVQSGQMDKAVIDTAVCRVLRMKFEMGLFE-HPYVDP-- 398
Query: 381 KIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGN 440
KI A V H LA + AQ I LLKN + +LPL K+ + +A+IGPNA++ +LG+
Sbjct: 399 KIAAKTVRRKEHIELARKIAQSSITLLKNENSILPLSKTIN-KVAVIGPNADNRYNMLGD 457
Query: 441 YAGPSCRS--ITPLQALQNYV--ENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMG 496
Y P S T L + + Y GC + I++A++ A+ ++ V++++G
Sbjct: 458 YTAPQEDSNVKTVLDGIITKLSPSRVEYVRGCAIRDTTVNEIEQAIEAARRSEVVIVVVG 517
Query: 497 LDQTQE-----------------------KEELDRVDLVLPGRQQELITRVAEAAKKPVI 533
++ E DR L L GRQQEL+ + + KP+I
Sbjct: 518 GSSARDFKTSYKETGAAVAEEGSVSDMECGEGFDRASLSLLGRQQELLESLQKTG-KPLI 576
Query: 534 LVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDY 593
+V + G P++ +A ++L A YPG+ G A+A+V+FGD+NP GRLP++ P+
Sbjct: 577 VVYIEGRPLEKNWA--SEYADALLTAYYPGQEGGNAIADVLFGDYNPSGRLPIS-VPRSV 633
Query: 594 IKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQS 653
++P+ K P+ Y ++ FG G+SY+ + Y V Q
Sbjct: 634 GQIPVYYNKKAPR------NHDYVEVSSSPLYSFGYGMSYTTFEYSALQVVQKS------ 681
Query: 654 SSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNG 713
R F V+ VKN G+ G+ L+++
Sbjct: 682 --------------------------ARCFEVSFKVKNTGKYDGEEVSQLYMRDEYASVV 715
Query: 714 RPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
+P+KQL F+ L E+ ++ F L+ E V+E G L++G
Sbjct: 716 QPMKQLKHFERFHLKKGEEKKVTFVLTE-EDFFLVNYTLKKVVESGNFHLMIG 767
>gi|423300893|ref|ZP_17278917.1| hypothetical protein HMPREF1057_02058 [Bacteroides finegoldii
CL09T03C10]
gi|408472228|gb|EKJ90756.1| hypothetical protein HMPREF1057_02058 [Bacteroides finegoldii
CL09T03C10]
Length = 798
Score = 260 bits (664), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 228/794 (28%), Positives = 365/794 (45%), Gaps = 148/794 (18%)
Query: 51 PISQRARDLVSRLTLDEKISQLVN--SAPAIPRLGIPAYEW----WSEALHGV----AGV 100
PI R DL+S++TL+EK Q+ + + + P W W + + + G+
Sbjct: 60 PIEARIADLLSQMTLEEKTCQMATLYGSGRVLKDAWPTDGWSTEIWKDGIGNIDEQANGL 119
Query: 101 GK-----------------------------GI---FFNGTIRG-----ATSFPQVILTA 123
GK GI F N IRG AT FP
Sbjct: 120 GKFGSEISYPYANSAKNRHTVQRWFVEKTRLGIPVDFTNEGIRGLCHDRATMFPAQCGQG 179
Query: 124 ASFDSYLWYRIGQAIGLEARALYNAGQAIGMT-FWAPNINIFRDPRWGRGQETPGEDPLV 182
A+++ L I + EA+AL G T +AP ++I +DPRWGR E+ GEDP +
Sbjct: 180 ATWNKKLIREIAKVTADEAKAL-------GYTNIYAPILDIAQDPRWGRVVESYGEDPYL 232
Query: 183 TGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDL--DNWKGTTRYKFDARVTMQ 240
G+ + G+Q + A KHF Y + G TR D V +
Sbjct: 233 VGELGKQMILGLQNEGI-----------VATPKHFAVYSIPVGGRDGGTRT--DPHVAPR 279
Query: 241 DLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCD 300
++ Y PF +++ A G+M +YN +G P + L++ R+QWGF GY+ SD +
Sbjct: 280 EMKTLYLEPFRKGIQEAGALGVMSSYNDYDGEPVSGSYHFLTEILRQQWGFKGYVVSDSE 339
Query: 301 AVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG-----SFLQKHTKAAVKQKKLPESEIDR 355
AV ++ + E+ V+ AG+++ F+ + A+ + K+ +D+
Sbjct: 340 AVEFLHTKHRITPTEEEMAAQVVNAGLNIRTNFTPPQDFILP-LRRAISEGKVSLHTLDQ 398
Query: 356 ALHNLFSVRMRLGLFNGNPTMQPFGKIGAD-----VVCSPAHQVLALQAAQDGIVLLKNS 410
+ + V+ +GLF+ NP G D VV + AHQ ++++AA + IVLLKN
Sbjct: 399 RVGEILRVKFMMGLFD-NPYP------GDDRRPEVVVHNAAHQDVSMRAALESIVLLKNE 451
Query: 411 HGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSI-TPLQALQNYVEN--TVYYPG 467
+LPL KS + +A+IGPNA K L Y GP+ SI T Q ++ Y+ N Y G
Sbjct: 452 KEMLPLSKSFN-KIAVIGPNAEEVKELTCRY-GPANASIKTVYQGIKEYLPNAEVRYAKG 509
Query: 468 CDTV---------------ACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLV 512
CD + A I++AV++AK +D +L++G ++ +EE R +L
Sbjct: 510 CDIIDKYFPESELYNVPLDTQEKAMINEAVELAKASDVAILVLGGNEKTVREEFSRTNLD 569
Query: 513 LPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAE 572
L GRQQ+L+ V A KPV+LV++ G I +A ++ + +I+ A +PGE A+A+
Sbjct: 570 LCGRQQQLLEAVY-ATGKPVVLVMVDGRAATINWA--NKYVPAIIHAWFPGEFMGDAIAK 626
Query: 573 VIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLS 632
V+FGD+NPGGRL +T +P+ ++P +P + S R ++PFG GLS
Sbjct: 627 VLFGDYNPGGRLAVT-FPKSVGQIPFA-FPFKPGSDSKGKVRV-----AGVLYPFGYGLS 679
Query: 633 YSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNH 692
Y+ + Y VS+ P +G + ++ VKN
Sbjct: 680 YTTFGYSDLKVSK-------------------------PVIGAQ----ENITLSCTVKNT 710
Query: 693 GEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDG 752
G+ AG V L+++ K L GF+ + L E+ I F L+P + L ++
Sbjct: 711 GKKAGDEVVQLYIRDDFSSVTTYDKVLRGFERIHLQPGEERTISFTLTP-QDLGLWDKNN 769
Query: 753 LMVIEEGTHFLVVG 766
+E G+ ++VG
Sbjct: 770 HFTVEPGSFSVMVG 783
>gi|423212854|ref|ZP_17199383.1| hypothetical protein HMPREF1074_00915 [Bacteroides xylanisolvens
CL03T12C04]
gi|392694712|gb|EIY87939.1| hypothetical protein HMPREF1074_00915 [Bacteroides xylanisolvens
CL03T12C04]
Length = 782
Score = 260 bits (664), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 206/713 (28%), Positives = 332/713 (46%), Gaps = 101/713 (14%)
Query: 81 RLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGL 140
RLGIP + EA HG +G AT FP I AA++ L +GQ I
Sbjct: 129 RLGIPMF-LAEEAPHGHMAIG-----------ATVFPTGIGMAATWSPELVKEVGQVIAK 176
Query: 141 EARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFN 200
E R+ G + P +++ RDPRW R +ET GEDP+++G S V D
Sbjct: 177 EIRS------QGGHISYGPVLDLTRDPRWSRVEETFGEDPVLSGILGASMV-----DGLG 225
Query: 201 GGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRAS 260
GG L K A KHF AY + Y A V ++DL + PPF + G A
Sbjct: 226 GGNLSQKYATIATLKHFLAYAVPEGGQNGNY---ASVGIRDLHQNFLPPFRKAIDAG-AL 281
Query: 261 GIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVV 320
+M +YN ++GIP ++ LL++ R +W F G++ SD ++ I+++ A + E+A +
Sbjct: 282 SVMTSYNSIDGIPCTSNHYLLTQLLRNEWKFRGFVVSDLYSIEGIHESHFVAPTKENAAI 341
Query: 321 DVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFG 380
+ AG+DV+ G + AV+ ++ ++ ID A+ + ++ +GLF +P + P
Sbjct: 342 QSVTAGVDVDLGGDAYTNLCHAVQSGQMDKTVIDTAVCRVLRMKFEMGLFE-HPYVDP-- 398
Query: 381 KIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGN 440
KI A V H LA + AQ I LLKN + +LPL K+ + +A+IGPNA++ +LG+
Sbjct: 399 KIAAKTVRRKEHIELARKIAQSSITLLKNENSILPLSKTIN-KVAVIGPNADNRYNMLGD 457
Query: 441 YAGPSCRS--ITPLQALQNYVE--NTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMG 496
Y P S T L + + Y GC + I++A++ A+ ++ V++++G
Sbjct: 458 YTAPQEDSNVKTVLDGILTKLSPFRVEYVRGCAIRDTTVNEIEQAIEAARRSEVVIVVVG 517
Query: 497 LDQTQE-----------------------KEELDRVDLVLPGRQQELITRVAEAAKKPVI 533
++ E DR L L GRQQEL+ + + KP+I
Sbjct: 518 GSSARDFKTSYKETGAAVAEEGSVSDMECGEGFDRASLSLLGRQQELLESLQKTG-KPLI 576
Query: 534 LVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDY 593
+V + G P++ +A ++L A YPG+ G A+A+V+FGD+NP GRLP++ P+
Sbjct: 577 VVYIEGRPLEKNWA--SEYADALLTAYYPGQEGGNAIADVLFGDYNPSGRLPIS-VPRSV 633
Query: 594 IKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQS 653
++P+ K P+ Y ++ FG G+SY+ + Y V Q
Sbjct: 634 GQIPVYYNKKAPR------NHDYVEVSSSPLYSFGYGMSYTTFEYSDLQVVQKS------ 681
Query: 654 SSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNG 713
R F V+ VKN G+ G+ L+++
Sbjct: 682 --------------------------ARCFEVSFKVKNTGKYDGEEVSQLYMRDEYASVV 715
Query: 714 RPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
+P+KQL F+ L E+ ++ F L+ E V+E GT +++G
Sbjct: 716 QPMKQLKHFERFHLKKGEEKKVTFVLTE-EDFFLVNYTLKKVVESGTFQVMIG 767
>gi|224535250|ref|ZP_03675789.1| hypothetical protein BACCELL_00111 [Bacteroides cellulosilyticus
DSM 14838]
gi|224523135|gb|EEF92240.1| hypothetical protein BACCELL_00111 [Bacteroides cellulosilyticus
DSM 14838]
Length = 786
Score = 260 bits (664), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 220/793 (27%), Positives = 355/793 (44%), Gaps = 130/793 (16%)
Query: 51 PISQRARDLVSRLTLDEKISQLVN--SAPAIPRLGIPAYEW----WSEALHGVAGVGKGI 104
PI R +DL+S++TL+EK Q+ + + + +P +W W + + + G+
Sbjct: 48 PIEARVQDLLSQMTLEEKTCQMATLYGSGRVLKDSLPTEKWKDEIWKDGIANIDEQANGL 107
Query: 105 ------------------------------------FFNGTIRG-----ATSFPQVILTA 123
F N IRG AT FP
Sbjct: 108 GRFGSSLSYPYVNSVENRQTIQRWFVEQTRLGIPVDFTNEGIRGLCHDRATMFPAQCGQG 167
Query: 124 ASFDSYLWYRIGQAIGLEARALYNAGQAIGMT-FWAPNINIFRDPRWGRGQETPGEDPLV 182
A+++ L I Q EA+AL G T ++P ++I +DPRWGR E GEDP +
Sbjct: 168 ATWNKELISEIAQVTAEEAKAL-------GYTNIYSPILDIAQDPRWGRVVECYGEDPFL 220
Query: 183 TGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDL 242
G+ ++G+Q + A KHF Y + + D V +++
Sbjct: 221 VGELGKRMIKGLQQEGL-----------VATPKHFAVYSIPVGGRDAGTRTDPHVAPREM 269
Query: 243 ADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAV 302
Y PF + A G+M +YN +G P + L++ R +WGF GY+ SD +AV
Sbjct: 270 RTLYIEPFRKAFCEAGALGVMSSYNDYDGEPITGSYHFLTEILRHEWGFKGYVVSDSEAV 329
Query: 303 SIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKH----TKAAVKQKKLPESEIDRALH 358
+Y A D V+ AG++V L ++ + A+ + K+ ID +
Sbjct: 330 EFLYSKHQVAADAVDGAAQVVNAGLNVRTNFTLPENFIRPLRQAISEGKVSMQTIDSRVA 389
Query: 359 NLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPK 418
++ V+ +GLF+ NP + K VV S HQ ++++AA + IVLLKN + +LPL K
Sbjct: 390 DVLRVKFGMGLFD-NP-YKGDAKHPEKVVHSKEHQAVSMRAALESIVLLKNENNILPLSK 447
Query: 419 SKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVEN--TVYYPGCDTV----- 471
+A+IGPNAN + L+ Y + T Q ++ Y+ + Y G D +
Sbjct: 448 DLK-KVAVIGPNANEVQNLICRYGPANAPIKTVYQGIKEYLPDAEVRYAKGTDIIDKYFP 506
Query: 472 ----------ACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELI 521
A +D+AV +AK +D ++++G ++ +EE R +L L GRQ++L+
Sbjct: 507 ESELYEVPLDQEEQAMMDEAVALAKESDVAIMVLGGNEKTVREEYSRTNLDLCGRQEKLL 566
Query: 522 TRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
V A KPVIL+L+ G I +A +R I I+ A +PGE A+A+V+FGD+NPG
Sbjct: 567 QAVY-ATGKPVILLLVDGRAATINWA--ERYIPGIVHAWFPGEFMGDAVAQVLFGDYNPG 623
Query: 582 GRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFK 641
G+L +T +P+ ++P +P + S R ++PFG GLSY+ ++Y
Sbjct: 624 GKLAVT-FPRSVGQIPFA-FPFKPGSDSKGFVRV-----TGTLYPFGYGLSYTTFAYSDL 676
Query: 642 AVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPV 701
+ L + S K+ + VKN G++AG V
Sbjct: 677 KIEN--LVIGVQGSVKL---------------------------SCKVKNTGKVAGDEVV 707
Query: 702 LLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTH 761
L++ +K L GF+ + L E+ I F L+P E L +D V+E GT
Sbjct: 708 QLYLHDEMSSVTTYVKVLRGFERIHLEPGEEKVIDFVLTPQE-LGLWNKDNHFVVEPGTF 766
Query: 762 FLVVGDEEYPISI 774
++VG I +
Sbjct: 767 AVMVGSSSQDIKL 779
>gi|332881173|ref|ZP_08448832.1| glycosyl hydrolase family 3 protein, partial [Capnocytophaga sp.
oral taxon 329 str. F0087]
gi|332680887|gb|EGJ53825.1| glycosyl hydrolase family 3 protein [Capnocytophaga sp. oral taxon
329 str. F0087]
Length = 675
Score = 260 bits (664), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 160/446 (35%), Positives = 231/446 (51%), Gaps = 35/446 (7%)
Query: 36 SNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALH 95
S+P+ + P+ L +RA DL+ RLTL EKIS + N +P + RLGI Y WWSEALH
Sbjct: 17 SSPAQDE-PYKNPDLSPQERADDLLKRLTLKEKISLMQNQSPGVERLGIKPYNWWSEALH 75
Query: 96 GVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIG-- 153
GVA G AT +P + A+ FD L I + E RA ++ + G
Sbjct: 76 GVARNGL----------ATVYPITMGMASVFDDKLIEDIYVTVSDEGRAKFHDARRHGRY 125
Query: 154 ------MTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGK 207
+TFW PN+NIFRDPRWGRGQET GEDP +T + V+ VRG+QG K
Sbjct: 126 GRGNEGLTFWNPNVNIFRDPRWGRGQETWGEDPYLTTRMGVAVVRGMQGPA--DAKYD-- 181
Query: 208 LQASACCKHFTAYDLDNWKGTTRYKFDAR-VTMQDLADTYQPPFESCVKQGRASGIMCAY 266
+ AC KH+ + K R+ FD + +DL +TY P F++ V++ +MCAY
Sbjct: 182 -KTHACAKHYAVHSGPEAK---RHSFDVENLEPRDLWETYLPAFKALVQEADVKEVMCAY 237
Query: 267 NRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKA- 325
R G P C LL++ R +WG+ + SDC A+S + +G ++ DA A
Sbjct: 238 QRFEGEPCCGSNRLLTQILRDEWGYKHLVVSDCGAISDFF-YQGRHETHPDAATSSASAV 296
Query: 326 --GMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIG 383
G D+ CG H AV++ + E ID +L L R LG + + + P+ +I
Sbjct: 297 INGTDLECG-VEYAHLDEAVERGLITEHRIDTSLRRLLEARFALGEMD-DDALVPWSRIS 354
Query: 384 ADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAG 443
D V H+ +AL + +VLL N+ G+LPL K + +A++GPNA + GNY G
Sbjct: 355 IDTVDCGTHRRMALDVTRKSMVLLHNN-GILPLDKGDAGKIAVMGPNAVDSVMQWGNYKG 413
Query: 444 PSCRSITPLQALQNYVENTVYYPGCD 469
+ T L+ ++ + N Y GC+
Sbjct: 414 VPAHTYTILEGIRGAIGNVPYEKGCE 439
>gi|357047866|ref|ZP_09109459.1| glycosyl hydrolase family 3 protein, partial [Paraprevotella clara
YIT 11840]
gi|355529205|gb|EHG98644.1| glycosyl hydrolase family 3 protein, partial [Paraprevotella clara
YIT 11840]
Length = 676
Score = 260 bits (664), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 160/446 (35%), Positives = 231/446 (51%), Gaps = 35/446 (7%)
Query: 36 SNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALH 95
S+P+ + P+ L +RA DL+ RLTL EKIS + N +P + RLGI Y WWSEALH
Sbjct: 17 SSPAQDE-PYKNPDLSPQERADDLLKRLTLKEKISLMQNQSPGVERLGIKPYNWWSEALH 75
Query: 96 GVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIG-- 153
GVA G AT +P + A+ FD L I + E RA ++ + G
Sbjct: 76 GVARNGL----------ATVYPITMGMASVFDDKLIEDIYVTVSDEGRAKFHDARRHGRY 125
Query: 154 ------MTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGK 207
+TFW PN+NIFRDPRWGRGQET GEDP +T + V+ VRG+QG K
Sbjct: 126 GRGNEGLTFWNPNVNIFRDPRWGRGQETWGEDPYLTTRMGVAVVRGMQGPA--DAKYD-- 181
Query: 208 LQASACCKHFTAYDLDNWKGTTRYKFDAR-VTMQDLADTYQPPFESCVKQGRASGIMCAY 266
+ AC KH+ + K R+ FD + +DL +TY P F++ V++ +MCAY
Sbjct: 182 -KTHACAKHYAVHSGPEAK---RHSFDVENLEPRDLWETYLPAFKALVQEADVKEVMCAY 237
Query: 267 NRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKA- 325
R G P C LL++ R +WG+ + SDC A+S + +G ++ DA A
Sbjct: 238 QRFEGEPCCGSNRLLTQILRDEWGYKHLVVSDCGAISDFF-YQGRHETHPDAATSSASAV 296
Query: 326 --GMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIG 383
G D+ CG H AV++ + E ID +L L R LG + + + P+ +I
Sbjct: 297 INGTDLECG-VEYAHLDEAVERGLITEHRIDTSLRRLLEARFALGEMD-DDALVPWSRIS 354
Query: 384 ADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAG 443
D V H+ +AL + +VLL N+ G+LPL K + +A++GPNA + GNY G
Sbjct: 355 IDTVDCGTHRRMALDVTRKSMVLLHNN-GILPLDKGDAGKIAVMGPNAVDSVMQWGNYKG 413
Query: 444 PSCRSITPLQALQNYVENTVYYPGCD 469
+ T L+ ++ + N Y GC+
Sbjct: 414 VPAHTYTILEGIRGAIGNVPYEKGCE 439
>gi|333379224|ref|ZP_08470948.1| hypothetical protein HMPREF9456_02543 [Dysgonomonas mossii DSM
22836]
gi|332885492|gb|EGK05741.1| hypothetical protein HMPREF9456_02543 [Dysgonomonas mossii DSM
22836]
Length = 745
Score = 259 bits (663), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 214/758 (28%), Positives = 338/758 (44%), Gaps = 100/758 (13%)
Query: 52 ISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAY-----EWWSEALHG--VAGVGKG- 103
+ + DL+ R+TL+EKI Q V + P E+ + + G VG
Sbjct: 31 VDKFVDDLLRRMTLEEKIGQTVLYTSGYDVITGPTVDPNYKEYLKKGMVGGIFNAVGADY 90
Query: 104 ------------------IFFNGTIRGA-TSFPQVILTAASFDSYLWYRIGQAIGLEARA 144
IF I G T FP + + S+D +A+ AR
Sbjct: 91 TRSLQKIAVEETRLGIPLIFGYDVIHGQRTIFPIPLAESCSWDL-------EAMERSARI 143
Query: 145 LYNAGQAIGMTF-WAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGK 203
+ A G+ + +AP ++I RDPRWGR E GED + A + V+G QGD
Sbjct: 144 AASEATAEGINWIYAPMVDISRDPRWGRVAEGAGEDVYLGSLIAAARVKGFQGDN----- 198
Query: 204 LKGKLQASACCKHFTAYDLDNWKGTTRYKFD---ARVTMQDLADTYQPPFESCVKQGRAS 260
L AC KH+ AY G T D +++ +L +TY PPF++ + G +
Sbjct: 199 LSAVNTVVACVKHYAAY------GATMAGRDYNTVDMSLNELWNTYLPPFKAALDAGCGT 252
Query: 261 GIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVV 320
IM ++N +NGIP+ ++ LL R +W F+G++ +D +++ + GYA + +
Sbjct: 253 -IMTSFNDLNGIPATGNKYLLKDILRDKWNFNGFVVTDYTSINEMI-PHGYANDEKHSAE 310
Query: 321 DVLKAGMDVNC-GSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPF 379
+ AG+D++ G H K +++ K+ E ++ A + ++ +LGLF
Sbjct: 311 IAMNAGVDMDMQGGVYMNHLKTLIEEGKVSEKDVTEAARAILKIKYKLGLFEDPYRYCDA 370
Query: 380 GKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLG 439
+ D++ +PA++ A A+ +VLLKN LPL ++K V ALIGP +LG
Sbjct: 371 NREKTDIL-TPANKEAARDMARKSMVLLKNDKQTLPLKENKRV--ALIGPLVKDKYEILG 427
Query: 440 NYAGPSCRSITPLQALQNYVE-----NTVYYPGCDTVACSSASIDKAVDIAKGADHVVLM 494
++ R P+ VE Y GCD + + +AV +A +D VV++
Sbjct: 428 CWSAMGNRDTIPVSVYDGLVEAIGKDKISYAKGCDIQSEDTKGFAEAVRVASASDVVVMV 487
Query: 495 MGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIG 554
MG E R +L LPG Q +L+ + + K PV+LVL+ G P+ I + K N+
Sbjct: 488 MGEFHNMSGENNSRTNLSLPGVQVDLLKAIKKTGK-PVVLVLMNGRPLTINWEK--DNLD 544
Query: 555 SILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPM------TDMKMRPQAT 608
+IL A +PG G A+A+V+ G +NP G+L MT +PQ+ ++P+ T P
Sbjct: 545 AILEAWFPGTMGGAAIADVLTGKYNPSGKLTMT-FPQNVGQIPLFYNHKNTGRPYDPNVP 603
Query: 609 SGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYK 668
G Y + ++PFG GLSY+ ++Y +S
Sbjct: 604 QFAYGSRYWDVSNEPLYPFGYGLSYTTFTYSDLTLSSK---------------------- 641
Query: 669 SVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILN 728
E + V++ + N GE G+ V L+ + RP+K+L GF+ V L
Sbjct: 642 -------EITKENPLKVSVKLTNSGEYDGEEVVQLYTRDLVGSVTRPVKELKGFKKVFLK 694
Query: 729 AKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
A E I F LS L V E G L VG
Sbjct: 695 AGESKVIDFTLS-VNDLRFYNSQLEYVYEPGDFHLFVG 731
>gi|160890630|ref|ZP_02071633.1| hypothetical protein BACUNI_03075 [Bacteroides uniformis ATCC 8492]
gi|317479833|ref|ZP_07938953.1| glycosyl hydrolase family 3 C terminal domain-containing protein
[Bacteroides sp. 4_1_36]
gi|156859629|gb|EDO53060.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
uniformis ATCC 8492]
gi|316904039|gb|EFV25873.1| glycosyl hydrolase family 3 C terminal domain-containing protein
[Bacteroides sp. 4_1_36]
Length = 860
Score = 259 bits (663), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 212/718 (29%), Positives = 316/718 (44%), Gaps = 116/718 (16%)
Query: 81 RLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGL 140
RLGIP + +EALHG G+T +PQ I A++F+ L YR I
Sbjct: 113 RLGIPVFTV-AEALHG-----------SVHEGSTIYPQNIALASTFNPELAYRRAAEISK 160
Query: 141 EARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTF- 199
E Y + I AP I++ RD RWGR +E+ GEDP + G +A +G +
Sbjct: 161 ELH--YQGIRQI----LAPCIDVVRDLRWGRVEESYGEDPFLNGIFAYEEAKGYLDNGIS 214
Query: 200 --------NGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFE 251
+G L G AS C + +L D Y PF+
Sbjct: 215 PMLKHYGPHGNPLGGLNLASVHCG-----------------------VGELHDVYLQPFK 251
Query: 252 SCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGY 311
V +M YN N +P+ + LL++ R +WGF GYI SD A+ +++ +
Sbjct: 252 RVVTSLPIHAVMSTYNSWNRVPNSSSHYLLTEVLRNRWGFQGYIYSDWGAIDMLHTFQRT 311
Query: 312 AKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFN 371
A + +A V + AG+DV S H A VK+KK+ E ID+A+ + + R+GLF
Sbjct: 312 ASNQAEAAVQAIVAGLDVEASSECFPHLAALVKEKKVDEGIIDKAVSRVLLAKFRMGLFE 371
Query: 372 GNPTMQPFG-KIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPN 430
P+G + + S + +A Q A + VLLKN LLPL S+ S+A+IGPN
Sbjct: 372 -----DPYGDRFAGHSLHSQENIQVARQIADESTVLLKNDKDLLPLNLSQLKSIAVIGPN 426
Query: 431 ANSAKTLLGNYAGPSCRS--ITPLQALQNYVE----NTVYYPGCDTVACSSASIDKAVDI 484
A+ + G+Y ITPL+ ++ VE Y GC+ ++ + I AV+
Sbjct: 427 ADQVQ--FGDYTWSRTNQDGITPLEGIRKQVEPVGIKIRYAKGCNMMSMDTTQIAAAVEA 484
Query: 485 AKGADHVVLMMG---------LDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILV 535
A+ +D +L G +T E D DL L G Q +LI V A KPV+LV
Sbjct: 485 ARQSDAAILFCGSASASLARDYHETNCGEGFDLTDLSLTGAQGKLIQAV-HATGKPVVLV 543
Query: 536 LLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTW------Y 589
L+ G P I + K +I +IL Y GE ++A+++FG NP G L +++
Sbjct: 544 LVTGKPFAIAWEK--EHIPAILVQWYAGEQEGSSIADILFGKTNPSGHLTVSFPKSSGHL 601
Query: 590 PQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLY 649
P Y +P + PGR Y F ++ FG GL+Y+ + Y + Q
Sbjct: 602 PAYYNHLPTDRGFYHKPGSYEQPGRDYVFSSPGPLWAFGHGLTYTTFEYADLQIEQT--- 658
Query: 650 LNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPAR 709
++ K LVT VKN G AGK L+V+
Sbjct: 659 ----------------------------TDSVKVLVT--VKNTGTRAGKAVPQLYVRDVF 688
Query: 710 RGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGD 767
P++QL FQ V L E+ ++ + E L+ E+G +E G + +GD
Sbjct: 689 SSISTPVRQLKAFQKVELKPDEEVQVPLHFA-IEDLAFTNENGQTAVEPGNFEIQMGD 745
>gi|374310554|ref|YP_005056984.1| Beta-glucosidase [Granulicella mallensis MP5ACTX8]
gi|358752564|gb|AEU35954.1| Beta-glucosidase [Granulicella mallensis MP5ACTX8]
Length = 739
Score = 259 bits (663), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 226/782 (28%), Positives = 351/782 (44%), Gaps = 111/782 (14%)
Query: 10 FPLLCLC-FTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEK 68
F LLCL S++ PP D S+ E+ R L++ +TLDEK
Sbjct: 22 FLLLCLVPVASVMAAQKVAAPPVYLD-SHADPES------------RVTALLAAMTLDEK 68
Query: 69 ISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRG--------ATSFPQVI 120
I L ++ P++PRLG+ E LHG+A G G + G G T FPQ
Sbjct: 69 IHAL-STDPSVPRLGVAGTNH-VEGLHGLALGGPG-HWEGHSEGRTMLNVIPTTQFPQSR 125
Query: 121 LTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDP 180
++D L + E R + G+ APN ++ RDPRWGRG+E+ GEDP
Sbjct: 126 GLGQTWDPALLQKAAAQEAYETRFAFGKYHRGGLVVRAPNADLSRDPRWGRGEESYGEDP 185
Query: 181 LVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQ 240
+ G A ++ G+QGD + ++ KHF A ++ + + FDAR+
Sbjct: 186 FLVGTLATAFAHGLQGDDPH------VWMTASLLKHFLANSNEDGRDGSSSNFDARL--- 236
Query: 241 DLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCD 300
+ Y PF +++G A +M +YN N +P A+ ++ QWG G + +D
Sbjct: 237 -FHEYYAVPFRMAIEEGHADAMMTSYNAWNSVPMTAN-PVVRDVVMAQWGLDGIVCTDAG 294
Query: 301 AVS-IIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHN 359
A++ ++ YA PE A + AG++ + Q+ + A++QK + E +IDR L
Sbjct: 295 ALTNMVKQHHTYATMPEAAAAAI-HAGINQFLDDY-QQPVRDALQQKLITEQDIDRNLRG 352
Query: 360 LFSVRMRLGLFNGNPTMQ-PFGKIGA------DVVCSPAHQVLALQAAQDGIVLLKNSHG 412
++ V + LGL +PT P+ IGA D + A + L +A + IVLLKN+ G
Sbjct: 353 VYRVMLHLGLL--DPTANSPYSHIGAFDQAQSDPWNTEAPRALVRRATDESIVLLKNTGG 410
Query: 413 LLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYV--ENTVYYPGCDT 470
LPL +K S+A+IG ++ L Y+G S+TP++ ++ + V+ G D
Sbjct: 411 ALPLDAAKLKSIAVIGQWGDTVA--LDWYSGTPLLSVTPVEGIRRRAAGASVVFNDGKDE 468
Query: 471 VACSSASIDKAVDIAKGADHVVLMMGLDQ----TQEKEELDRVDLVLPGRQQELITRVAE 526
A ++ + I +H G ++ ++ KE +DR L LP E + +
Sbjct: 469 AAAAALAARSEAVIVIVGNHPTCDAGWNKCALPSEGKEAIDRKSLTLP---DESLVKAVL 525
Query: 527 AAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPM 586
AA ++VL P + + + +IL + E ALA+V+FGD+NP GRL
Sbjct: 526 AANPHAVVVLQTSFPYTTNWTQ--EHAPAILEITHNSEEQGTALADVLFGDYNPAGRLTQ 583
Query: 587 TWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQN 646
TW PM D +R GRTY + E ++PFG GLSY+ ++Y V+Q
Sbjct: 584 TWPASLEQLPPMMDYDLR-------HGRTYLYAEKAPLYPFGFGLSYTSFAYSDLTVTQR 636
Query: 647 KLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVK 706
+ V + V N G AG V ++
Sbjct: 637 G---------------------------------KSIAVQVTVANTGSRAGDEVVQIYAA 663
Query: 707 PARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTH-FLVV 765
RPI++L F+ V L A EK + FE+ P SL+ +E TH F+V
Sbjct: 664 HQGSTVPRPIEELKAFRRVALRAGEKQVVRFEM-PVTSLA--------YWDEATHRFIVE 714
Query: 766 GD 767
GD
Sbjct: 715 GD 716
>gi|167765093|ref|ZP_02437206.1| hypothetical protein BACSTE_03479 [Bacteroides stercoris ATCC
43183]
gi|167696721|gb|EDS13300.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
stercoris ATCC 43183]
Length = 944
Score = 259 bits (662), Expect = 4e-66, Method: Compositional matrix adjust.
Identities = 203/694 (29%), Positives = 322/694 (46%), Gaps = 86/694 (12%)
Query: 105 FFNGTIRG-----ATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMT-FWA 158
F N IRG AT+FP + +++ L ++G G EAR L G T +A
Sbjct: 160 FTNEGIRGVESYKATNFPTQLGLGHTWNRELIRQVGLITGREARML-------GYTNVYA 212
Query: 159 PNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFT 218
P +++ RD RWGR +E GE P + + + VRG+Q + Q +A KHF
Sbjct: 213 PILDVGRDQRWGRYEEVYGESPYLVAELGIEMVRGLQHNH----------QVAATAKHFA 262
Query: 219 AYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADR 278
AY + + D ++ +++ + + PF+ +++ G+M +YN +GIP
Sbjct: 263 AYSNNKGAREGMARVDPQMPPREVENIHIYPFKRVIREAGLLGVMSSYNDYDGIPIQGSY 322
Query: 279 NLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNC-----GS 333
L+ R++ GF GY+ SD DAV +Y AK ++AV ++AG++V C S
Sbjct: 323 YWLTTRLRKEMGFRGYVVSDSDAVEYLYTKHNTAKDMKEAVRQSVEAGLNVRCTFRSPDS 382
Query: 334 FLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQ 393
F+ + VK+ L E I+ + ++ V+ +GLF+ G D V A++
Sbjct: 383 FVLP-LRELVKEGGLSEEVINDRVRDILRVKFLIGLFDAPYQTDLAG--ADDEVEKEANE 439
Query: 394 VLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQ 453
+ALQA+++ IVLLKN+ LPL K +A+ GPNA+ L +Y + T L+
Sbjct: 440 AVALQASRESIVLLKNTDNTLPLNIDKIKKIAVCGPNADEEGYALTHYGPLAVEVTTVLE 499
Query: 454 ALQNYVE---NTVYYPGCDTVAC---------------SSASIDKAVDIAKGADHVVLMM 495
++ + +Y GCD V A ID+AV A+ AD V+++
Sbjct: 500 GIREKAQGKAEVLYTKGCDLVDAHWPESEIMEYPLTPDEQAEIDRAVANARQADVAVVVL 559
Query: 496 GLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGS 555
G Q E R L LPG Q +L+ V +A KPVIL+L+ G P+ + +A D+ + +
Sbjct: 560 GGGQRTCGENKSRTSLELPGHQLKLLQAV-QATGKPVILILINGRPLSVNWA--DKFVPA 616
Query: 556 ILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMT-DMKMRPQATSG-NPG 613
IL A YPG G +A+++FGD+NPGG+L +T +P+ ++P K Q G NPG
Sbjct: 617 ILEAWYPGSKGGTVVADILFGDYNPGGKLTVT-FPKTVGQIPFNFPYKPASQIDGGKNPG 675
Query: 614 RTYRFYE-GKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPE 672
++PFG GLSY+ + Y ++ + NQ ++ ++
Sbjct: 676 PDGNMSRINGALYPFGYGLSYTTFEYSDLEITPKVITPNQKATIRL-------------- 721
Query: 673 LGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEK 732
V N G+ AG V L+ + K L GF+ + L E
Sbjct: 722 ---------------KVTNTGKRAGDEVVQLYTRDILSSVTTYEKNLAGFERIHLKPGES 766
Query: 733 AEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
EIVF L + L D +E G ++ G
Sbjct: 767 KEIVFTLD-RKHLELLNADMKWTVEPGEFAIMAG 799
>gi|317492602|ref|ZP_07951029.1| glycosyl hydrolase family 3 N terminal domain-containing protein
[Enterobacteriaceae bacterium 9_2_54FAA]
gi|316919352|gb|EFV40684.1| glycosyl hydrolase family 3 N terminal domain-containing protein
[Enterobacteriaceae bacterium 9_2_54FAA]
Length = 810
Score = 259 bits (662), Expect = 4e-66, Method: Compositional matrix adjust.
Identities = 235/798 (29%), Positives = 372/798 (46%), Gaps = 142/798 (17%)
Query: 15 LCFTSLLT-RVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARD-----LVSRLTLDEK 68
LC SL+ + +T P D + P+ P K P S + RD L+S++TLDEK
Sbjct: 42 LCSLSLVAGLIATTLSPVYADATQPA---HPLVK---PQSTQQRDAFVTDLLSKMTLDEK 95
Query: 69 ISQ--LVNSAPAIPRLGIPAYEWWSEALHGVAGVGK-GIFFNGTIRG------------- 112
I Q L++ P P+ EA+ + G+ G FN R
Sbjct: 96 IGQMRLISVGPDNPK----------EAIREMIKQGQVGAIFNTVTRQDIRKMQDQAMQLS 145
Query: 113 ----------------ATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTF 156
T FP + A+S++ +G+ EA + + MT
Sbjct: 146 RLKIPLFFAYDVVHGQRTVFPISLGLASSWNRDAVTTVGRISAYEA-----SEDGLNMT- 199
Query: 157 WAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKH 216
WAP +++ RDPRWGR E GED +T + V +QG N + + S KH
Sbjct: 200 WAPMVDVTRDPRWGRTSEGFGEDTYLTSEMGRLMVEAMQG---NNPADRNSVMTS--VKH 254
Query: 217 FTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCA 276
F AY +G Y ++ Q L Y PP+++ + G + G+M + N +NG P+ +
Sbjct: 255 FAAYGA--VEGGRDYN-TVDMSPQRLFQDYMPPYKAALDAG-SGGVMVSLNSINGTPATS 310
Query: 277 DRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGS-FL 335
+ LL R +W F G SD A+ + G A PEDAV +K+G+D++ +
Sbjct: 311 NSWLLKDVLRDEWNFKGITISDHGAIKELI-KHGVASDPEDAVRVAVKSGIDMSMSDEYY 369
Query: 336 QKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPA---- 391
K+ + VK ++ E E+D A ++ +V+ +GLF G +G+D V + A
Sbjct: 370 SKYLPSLVKSGRVSEKEVDDAARHVLNVKYDMGLFTD--AYSHLGPVGSDPVDTNAESRL 427
Query: 392 HQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNY--AGPSCRSI 449
H+ A A++ +VLLKN LPL KS ++ ALIGP A+S + ++G++ AG +S+
Sbjct: 428 HRPEARSVARESMVLLKNRLNTLPLAKSGTI--ALIGPLADSKRDVMGSWSAAGVVDQSV 485
Query: 450 TPLQALQNYVENT---VYYPGCD------------------TVACSS--ASIDKAVDIAK 486
T LQ L+N V + VY G + TV S A ID+AV AK
Sbjct: 486 TVLQGLRNAVGDKAQIVYAKGANVSNDPGITDFLNLYEKAVTVDPRSPQAMIDEAVATAK 545
Query: 487 GADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITF 546
+D +V ++G Q E R D+ +P Q++LI + + K P++LVL+ G P +
Sbjct: 546 KSDVIVAVVGEAQGMAHEASSRSDITIPQSQRDLIAALKQTGK-PLVLVLMNGRP--LAL 602
Query: 547 AKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKM--- 603
K D+ +IL + + G G A+A+V+FGD+NP G+LPM+ +P+ ++P+ +
Sbjct: 603 EKEDQQADAILESWFSGTEGGNAIADVLFGDYNPSGKLPMS-FPRSVGQIPIYYSHLNTG 661
Query: 604 RPQATSGNPGRTYRFYEGKE--VFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVEN 661
RP T +Y+ ++PFG GLSY+ +S VS K+ SS M +
Sbjct: 662 RPYNPEKPEKYTSHYYDAANGPLYPFGYGLSYTTFS-----VSDVKM-----SSPIMKRD 711
Query: 662 QDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVG 721
V ++ VKN G+ G V L+++ RP+K+L G
Sbjct: 712 GSVT-------------------ASVTVKNTGKRDGATVVQLYIQDKTATMSRPVKELKG 752
Query: 722 FQSVILNAKEKAEIVFEL 739
F+ V+L + E+ I F++
Sbjct: 753 FEKVMLKSGEQKTIHFKI 770
>gi|218131278|ref|ZP_03460082.1| hypothetical protein BACEGG_02889 [Bacteroides eggerthii DSM 20697]
gi|217986495|gb|EEC52831.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
eggerthii DSM 20697]
Length = 862
Score = 259 bits (661), Expect = 5e-66, Method: Compositional matrix adjust.
Identities = 167/441 (37%), Positives = 237/441 (53%), Gaps = 36/441 (8%)
Query: 45 FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGI 104
+ T+L +RA L+ LTL+EK++ +++++ A+ RLGI Y WW+EALHGVA G
Sbjct: 22 YKTTSLAPEKRAELLLKELTLEEKVALMMDNSQAVERLGIKPYNWWNEALHGVARAGL-- 79
Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAI--------GMTF 156
AT FPQ I AASF Y + A+ EARA NA A G+T
Sbjct: 80 --------ATVFPQPIGMAASFSPESVYEVFSAVSDEARA-KNANYASQGSYARYQGLTM 130
Query: 157 WAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKH 216
W P +NI+RDPRWGRG ET GEDP +T + V V+G+QG N GK + AC KH
Sbjct: 131 WTPTVNIYRDPRWGRGIETYGEDPYLTSRMGVMVVKGLQGT--NDGKYD---KLHACAKH 185
Query: 217 FTAYDLDNWKGTTRYKFDAR-VTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSC 275
F + W R+ F+ ++ +DL +TY PPFE+ VK+ + +MCAYN G P C
Sbjct: 186 FAVHSGPEW---NRHSFNVEDLSARDLYETYLPPFEALVKEAKVKEVMCAYNSFEGEPCC 242
Query: 276 ADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSP--EDAVVDVLKAGMDVNCGS 333
LL + R WGF G I SDC A++ Y+ G+ P E A + G D+ CGS
Sbjct: 243 GSNRLLMQILRNDWGFDGIILSDCGAIADFYNEHGHKAYPDAESASAAAVLNGTDLECGS 302
Query: 334 FLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQ 393
+ K AV+ ++ E +ID+A+ L R LG + +P + KI VV S H
Sbjct: 303 SYKALVK-AVRAGQINEKDIDKAVMRLLEARFALGEMD-DPDNVSWTKIPFSVVASAEHD 360
Query: 394 VLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQ 453
LAL A+ + LL+N+ +LPL K +++A++GPNAN + GNY G ++T L
Sbjct: 361 SLALNMARKSMTLLQNTGNILPL-KRGGLTVAVMGPNANDSVMQWGNYNGMPPHTVTILG 419
Query: 454 ALQNYV---ENTVYYPGCDTV 471
++ + + +Y GC V
Sbjct: 420 GIRKALGADDKLIYEQGCSWV 440
Score = 118 bits (295), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 94/300 (31%), Positives = 137/300 (45%), Gaps = 53/300 (17%)
Query: 478 IDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEA 527
I+ +V K AD VV G+ E EE+ DR D+ LP Q+ELI+ + A
Sbjct: 589 INASVARVKDADIVVFASGVSPVLEGEEMGVNLPGFKGGDRTDIELPAIQRELISALHRA 648
Query: 528 AKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMT 587
KK +ILV G P I +N +IL A YPG+ G A+AEV+FGD+NPGGRLP+T
Sbjct: 649 GKK-IILVNCSGSP--IALEPETKNCEAILQAWYPGQQGGTAVAEVLFGDYNPGGRLPVT 705
Query: 588 WYPQDYIKVPMTDMKMRPQATSGN-PGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQN 646
+Y +M P N GRTYR+ + +FPFG GLSY+ + Y +S +
Sbjct: 706 FY---------RNMSQLPDFEDYNMTGRTYRYMTQQPLFPFGYGLSYTTFDY--GKLSLD 754
Query: 647 KLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVK 706
K + Q K+ + V N G+ G+ V +++K
Sbjct: 755 KEQIKQGEPLKLA---------------------------VSVTNSGQRDGEEVVQVYLK 787
Query: 707 PARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
G P K L F+ V + A + + F+L E + M + G + ++VG
Sbjct: 788 KQDDAEG-PGKTLRAFKRVYIPAGKSVNVEFDLKDKELEWWDAQSNTMRVCAGKYDIMVG 846
>gi|347736643|ref|ZP_08869226.1| xylosidase/arabinosidase [Azospirillum amazonense Y2]
gi|346919803|gb|EGY01181.1| xylosidase/arabinosidase [Azospirillum amazonense Y2]
Length = 775
Score = 259 bits (661), Expect = 5e-66, Method: Compositional matrix adjust.
Identities = 218/723 (30%), Positives = 341/723 (47%), Gaps = 100/723 (13%)
Query: 81 RLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGL 140
RLGIP + E LHG +G TSFPQ I A+S+D L + +
Sbjct: 121 RLGIPVL-FHEEGLHGYPAIGP-----------TSFPQAIAQASSWDPDLIREVDSVVAR 168
Query: 141 EARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFN 200
E R + + + +P +++ RDPRWGR +ET GEDP + G+ V+ V+G+QGD+
Sbjct: 169 EIRV-----RGVSLVL-SPVVDVARDPRWGRIEETFGEDPYLAGEMGVAAVQGLQGDSLP 222
Query: 201 GGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRAS 260
GK+ A KH T + GT A V + L + + PPFE + +
Sbjct: 223 --LADGKV--FATLKHLTGHGQPE-SGTNVGP--ASVGERTLREMFFPPFEQVIHRTNVR 275
Query: 261 GIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVV 320
+M +YN ++G+PS + LL R +WG+ G I SD A+ + A +
Sbjct: 276 AVMASYNEIDGVPSHVNTWLLHDILRGEWGYKGSIISDYSAIDQLVSIHHVVPDLPSAAI 335
Query: 321 DVLKAGMDVNCGSFLQKHTKA-AVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPF 379
++AG+D + + A +V+ K+ E IDRA+ + ++ + GLF P+
Sbjct: 336 RAIQAGVDADLPDGESYASLADSVRAGKIKEEVIDRAVRRILELKFQAGLFE-----HPY 390
Query: 380 GKI--GADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTL 437
+ + + +AL+AAQ +VLLKN G+LPL +K +LA+IGPNA AK
Sbjct: 391 ADADKAEALTANGEARAVALKAAQKSVVLLKND-GVLPLDMAKVKTLAVIGPNA--AKAH 447
Query: 438 LGNYAGPSCRSITPLQALQNYVENTV---YYPGC----------DTVACSSAS-----ID 479
LG Y+G ++++ L ++ V V Y G DTV + + I
Sbjct: 448 LGGYSGEPKQTVSILDGIKAKVGARVKVTYAEGVRITKDDDWYGDTVELADPAENARLIQ 507
Query: 480 KAVDIAKGADHVVLMMGLDQTQEKEEL------DRVDLVLPGRQQELITRVAEAAKKPVI 533
+AV +AK ADH+VL++G ++ +E DR L L G+Q +L + A KPV+
Sbjct: 508 QAVAVAKTADHIVLVIGDNEQTSREGWANNHLGDRDSLDLVGQQNDL-AKALFALGKPVV 566
Query: 534 LVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDY 593
+VL G P+ + N +++ Y G+ G A+A+V+FGD NPGG+LP+T +
Sbjct: 567 VVLQNGRPLSVVDVAARAN--ALVEGWYLGQEGGTAMADVLFGDVNPGGKLPVT-VARSV 623
Query: 594 IKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQS 653
++PM K +P A G Y F +FPFG GLSY+ + V +L
Sbjct: 624 GQLPMFYNK-KPSARRG-----YLFDTTDPLFPFGYGLSYTTFD-----VGSPRL----- 667
Query: 654 SSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNG 713
S P + + T V + V+N G+ AG V L++
Sbjct: 668 ---------------STPTIAKDGAIT----VAVDVRNTGKRAGDEVVQLYLHQQVASVT 708
Query: 714 RPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGDEEYPIS 773
RP+K+L GFQ + L E + F + ++L+ +D V+E G ++VGD +
Sbjct: 709 RPVKELKGFQRITLAPGESRTVTFTVD-GKALALWNQDMKRVVEPGAFDIMVGDNSVDLK 767
Query: 774 IFV 776
V
Sbjct: 768 TAV 770
>gi|317476310|ref|ZP_07935559.1| glycosyl hydrolase family 3 C terminal domain-containing protein
[Bacteroides eggerthii 1_2_48FAA]
gi|316907336|gb|EFV29041.1| glycosyl hydrolase family 3 C terminal domain-containing protein
[Bacteroides eggerthii 1_2_48FAA]
Length = 862
Score = 259 bits (661), Expect = 5e-66, Method: Compositional matrix adjust.
Identities = 167/441 (37%), Positives = 237/441 (53%), Gaps = 36/441 (8%)
Query: 45 FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGI 104
+ T+L +RA L+ LTL+EK++ +++++ A+ RLGI Y WW+EALHGVA G
Sbjct: 22 YKTTSLAPEKRAELLLKELTLEEKVALMMDNSQAVERLGIKPYNWWNEALHGVARAGL-- 79
Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAI--------GMTF 156
AT FPQ I AASF Y + A+ EARA NA A G+T
Sbjct: 80 --------ATVFPQPIGMAASFSPESVYEVFSAVSDEARA-KNANYASQGSYARYQGLTM 130
Query: 157 WAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKH 216
W P +NI+RDPRWGRG ET GEDP +T + V V+G+QG N GK + AC KH
Sbjct: 131 WTPTVNIYRDPRWGRGIETYGEDPYLTSRMGVMVVKGLQGT--NDGKYD---KLHACAKH 185
Query: 217 FTAYDLDNWKGTTRYKFDAR-VTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSC 275
F + W R+ F+ ++ +DL +TY PPFE+ VK+ + +MCAYN G P C
Sbjct: 186 FAVHSGPEW---NRHSFNVEDLSARDLYETYLPPFEALVKEAKVKEVMCAYNSFEGEPCC 242
Query: 276 ADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSP--EDAVVDVLKAGMDVNCGS 333
LL + R WGF G I SDC A++ Y+ G+ P E A + G D+ CGS
Sbjct: 243 GSNRLLMQILRNDWGFDGIILSDCGAIADFYNEHGHKAYPDAESASAAAVLNGTDLECGS 302
Query: 334 FLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQ 393
+ K AV+ ++ E +ID+A+ L R LG + +P + KI VV S H
Sbjct: 303 SYKALVK-AVRAGQINEKDIDKAVMRLLEARFALGEMD-DPDNVSWTKIPFSVVASAEHD 360
Query: 394 VLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQ 453
LAL A+ + LL+N+ +LPL K +++A++GPNAN + GNY G ++T L
Sbjct: 361 SLALDMARKSMTLLQNTGNILPL-KRGGLTVAVMGPNANDSVMQWGNYNGMPPHTVTILG 419
Query: 454 ALQNYV---ENTVYYPGCDTV 471
++ + + +Y GC V
Sbjct: 420 GIRKALGADDKLIYEQGCSWV 440
Score = 118 bits (295), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 94/300 (31%), Positives = 137/300 (45%), Gaps = 53/300 (17%)
Query: 478 IDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEA 527
I+ +V K AD VV G+ E EE+ DR D+ LP Q+ELI+ + A
Sbjct: 589 INASVARVKDADIVVFASGVSPVLEGEEMGVNLPGFKGGDRTDIELPAIQRELISALHRA 648
Query: 528 AKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMT 587
KK +ILV G P I +N +IL A YPG+ G A+AEV+FGD+NPGGRLP+T
Sbjct: 649 GKK-IILVNCSGSP--IALEPETKNCEAILQAWYPGQQGGTAVAEVLFGDYNPGGRLPVT 705
Query: 588 WYPQDYIKVPMTDMKMRPQATSGN-PGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQN 646
+Y +M P N GRTYR+ + +FPFG GLSY+ + Y +S
Sbjct: 706 FY---------RNMSQLPDFEDYNMTGRTYRYMTQQPLFPFGYGLSYTTFDY--GKLSLG 754
Query: 647 KLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVK 706
K + Q K+ + V N G+ G+ V +++K
Sbjct: 755 KEQIKQGEPLKLA---------------------------VSVTNSGQRDGEEVVQVYLK 787
Query: 707 PARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
G P K L F+ V + A + ++ F+L E + M + G + ++VG
Sbjct: 788 KQDDAEG-PGKTLRAFKRVYIPAGKSVDVEFDLKDKELEWWDAQSNTMRVCAGKYDIMVG 846
>gi|146299327|ref|YP_001193918.1| glycoside hydrolase family 3 protein [Flavobacterium johnsoniae
UW101]
gi|146153745|gb|ABQ04599.1| Candidate beta-glucosidase; Glycoside hydrolase family 3
[Flavobacterium johnsoniae UW101]
Length = 743
Score = 259 bits (661), Expect = 5e-66, Method: Compositional matrix adjust.
Identities = 229/769 (29%), Positives = 364/769 (47%), Gaps = 103/769 (13%)
Query: 52 ISQRARDLVSRLTLDEKISQLV----NSAPAIPRLGIPAYEWWSEALHGVAG-----VGK 102
I Q+ DL+ ++T++EKI QL ++ P P + SE G+ G +G
Sbjct: 23 IDQKVNDLLKKMTIEEKIGQLNQYTGDNQATGPITINPNKQ--SEIKAGLIGSMLNIIGT 80
Query: 103 G------------------IFFNGTIRG-ATSFPQVILTAASFDSYLWYRIGQAIGLEAR 143
+F I G T+FP + AAS+D QAI L AR
Sbjct: 81 KYTRQYQELAMQSRLKIPLLFGQDVIHGYKTTFPLPLAEAASWDL-------QAIELAAR 133
Query: 144 ALYNAGQAIGMTF-WAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGG 202
A G+ + +AP ++I RDPRWGR E GED + K A + V+G QG+ G
Sbjct: 134 VAATEASASGIHWTFAPMVDISRDPRWGRVMEGAGEDTYLGSKIAYARVKGFQGNKL--G 191
Query: 203 KLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQD--LADTYQPPFESCVKQGRAS 260
L + AC KHF AY G +++ V M + L +TY PPF++ + G A+
Sbjct: 192 DLNSVM---ACVKHFAAYG----AGVGGRDYNS-VDMSERMLWETYLPPFKAALDAGAAT 243
Query: 261 GIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVV 320
M ++N +NGIP+ + +L + +W F G++ SD ++ + A GY+K+ ++A
Sbjct: 244 -FMNSFNDINGIPATGNAHLQRDILKGKWNFQGFVVSDWGSIGEMV-AHGYSKNLKEAAY 301
Query: 321 DVLKAGMDVNCGSFLQKHTKAA-VKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPF 379
+ AG D++ S ++ A VK+ ++ ID A+ + + LGLF+ +P
Sbjct: 302 SAITAGSDMDMESNAYRYNLAQLVKEGRVSVDLIDDAVKRILRKKFELGLFD-DPYRYSD 360
Query: 380 GKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLG 439
K + +P H+ AL AQ IVLLKN + LP+ KS ++A IGP K +G
Sbjct: 361 EKRAEKALNNPEHRKAALDVAQKSIVLLKNENQTLPISKSVK-TIAFIGPMVKEYKENMG 419
Query: 440 NYAG--PSCRS----ITPLQALQNYV-ENT--VYYPGCDTVACSSASIDKAVDIAKGADH 490
++ P ++ LQN V +NT +Y GC+ + +AV+ AK AD
Sbjct: 420 FWSVELPEVDYNKWIVSQWDGLQNKVGKNTKLLYAKGCEIEGTNKDGFAEAVETAKQADV 479
Query: 491 VVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYD 550
V+L +G + E R D+ LPG Q++L+ + +A KPV++++ G P + F
Sbjct: 480 VILSIGERRDMSGEAKSRSDIHLPGVQEDLV-KAIQATGKPVVVLINAGRP--LVFNWTA 536
Query: 551 RNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKM---RPQA 607
N+ ++++ + G A+A V+FGD+NP G+LPMT +P++ ++P+ RP
Sbjct: 537 DNVPAVVYTWWLGTEAGNAIANVLFGDYNPSGKLPMT-FPREVGQIPIYYNHFSTGRPAK 595
Query: 608 TSGNPGRTYRFYEGKEV--FPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVV 665
T + + K FPFG GLSY+++SY S KL SSTK+ N+ +
Sbjct: 596 TENETNYVSAYIDLKNSPKFPFGYGLSYTQFSY-----SDLKL-----SSTKIKSNETIK 645
Query: 666 HYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSV 725
V+ + N G++AG+ L++K RP+ +L F+ V
Sbjct: 646 -------------------VSFKLSNVGKVAGEEVAQLYLKDKFGSVVRPVLELRDFEKV 686
Query: 726 ILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGDEEYPISI 774
LNA E I F + E LS + E G L++G I +
Sbjct: 687 KLNAGESKTIEFTIDK-EKLSFYNDKLEWTTEPGDFELMIGSSSADIKL 734
>gi|402826673|ref|ZP_10875843.1| glycoside hydrolase family protein [Sphingomonas sp. LH128]
gi|402259784|gb|EJU09977.1| glycoside hydrolase family protein [Sphingomonas sp. LH128]
Length = 737
Score = 259 bits (661), Expect = 5e-66, Method: Compositional matrix adjust.
Identities = 210/658 (31%), Positives = 313/658 (47%), Gaps = 89/658 (13%)
Query: 114 TSFPQVILTAASFDSYLWYRIGQAIGL-EARALYNAGQAIGMTFWAPNINIFRDPRWGRG 172
T P I AAS+D R GQA+ EARA+ I TF AP ++I DPRWGR
Sbjct: 97 TIMPVPIAMAASWDPATAER-GQAVAASEARAV-----GINWTF-APMVDITMDPRWGRM 149
Query: 173 QETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYK 232
E GEDP++ A + VRG F GG + + KHF Y ++ G +
Sbjct: 150 VEGAGEDPVLGAAMAAAQVRG-----FQGGAIGTPGHVLSGPKHFVGYGA-SFGGRDYDE 203
Query: 233 FDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFH 292
D ++ L +TY PPF++ + G A IM AY +NG+P+ A+ LL+ R++WGF
Sbjct: 204 VD--LSESQLRNTYLPPFKAALDAG-AGNIMSAYMGLNGVPAAANTWLLTDVLRKEWGFK 260
Query: 293 GYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDV------NCGSFLQKHTKAAVKQK 346
G++ SD + V + + +G A SP+ A V L+AGMD+ N L AVK
Sbjct: 261 GFVVSDANGVDSL-EKQGMAGSPKQAAVRALEAGMDLAMTVPSNPSPMLA--LVDAVKGG 317
Query: 347 KLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVL 406
++ ES ++ + L + RLGLF+ P + P K PAH+ +A AA+ VL
Sbjct: 318 EVAESALEGPVLRLLEAKYRLGLFD-KPYVDP--KAVPRTFDDPAHREVARIAAERSAVL 374
Query: 407 LKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCR--SITPLQALQ-------- 456
L N+ GLLPL ++K S+A+IGP ++ +LG + S + ++ L L+
Sbjct: 375 LANAGGLLPLDRAKLKSVAVIGPLGDAGHDMLGPWVFSSNKPEGVSVLAGLRAKLGAGVK 434
Query: 457 -NYVENTVY-------------YPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQE 502
Y T + PG A + KA+ +A +D V+++G Q
Sbjct: 435 VEYATGTAWPTRKNPSFFDAMNKPGEHPAIDEKAELAKAITLANQSDVAVMVLGEAQNMA 494
Query: 503 KEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYP 562
E R DL LPGRQQEL+ V A KPV++VL+ G P+ + AK G++L A YP
Sbjct: 495 GEFASRSDLKLPGRQQELLDAVI-ATGKPVVVVLVNGRPLSLGDAK----PGAVLEAWYP 549
Query: 563 GEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGK 622
G G A+A ++ GD NPGG+LP +W + + P T + P G+ + Y +
Sbjct: 550 GSEGGNAVANLLLGDVNPGGKLPFSWI-RSAAQAPYTYAYL-PSHQPGSADKRYWNEDNS 607
Query: 623 EVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRK 682
+PFG GLSY+ +SY AV + + L + +
Sbjct: 608 PTWPFGHGLSYTTFSYGKLAVDRASVKLGEPVT--------------------------- 640
Query: 683 FLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELS 740
V+ + N G+ AG L++ + RP++QL F V L E + F L+
Sbjct: 641 --VSFDLTNTGKRAGDEVAQLYIHQRVGTSSRPVRQLKKFARVALAPGETKHMQFTLT 696
>gi|393782348|ref|ZP_10370533.1| hypothetical protein HMPREF1071_01401 [Bacteroides salyersiae
CL02T12C01]
gi|392673619|gb|EIY67078.1| hypothetical protein HMPREF1071_01401 [Bacteroides salyersiae
CL02T12C01]
Length = 852
Score = 259 bits (661), Expect = 6e-66, Method: Compositional matrix adjust.
Identities = 160/441 (36%), Positives = 229/441 (51%), Gaps = 38/441 (8%)
Query: 45 FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGI 104
F P +R DL+SRLT++EKIS LVN A IPRL I Y +EALHG+ G+
Sbjct: 30 FRDMNAPQHERLLDLLSRLTIEEKISLLVNDAREIPRLNIDKYYHGNEALHGIVRPGE-- 87
Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYN---------AGQAIGMT 155
T FPQ I AA+++ L + + AI EAR + AG + +T
Sbjct: 88 --------FTVFPQAIGLAATWNPGLIFEVSSAISDEARGRWKELDYGKKQIAGASDLLT 139
Query: 156 FWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCK 215
FW+P +N+ RDPRWGR ET GEDP +TG +V+G+QGD L+ + K
Sbjct: 140 FWSPTVNMARDPRWGRTPETYGEDPFLTGVIGCEFVKGLQGDH------PRYLKTVSTPK 193
Query: 216 HFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSC 275
HF A N + R +AR++ +DL + Y P FE C+ +A IM AYN VNG+P
Sbjct: 194 HFAA----NNEEHNRSSCNARMSERDLREFYLPSFERCIVDAKAQSIMMAYNAVNGVPCT 249
Query: 276 ADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFL 335
+ L+ R WGF+GYI SDC A + Y + + A +KAG+D+ CG +
Sbjct: 250 VNTYLIKNVLRGDWGFNGYIVSDCSAPEWMVTKHKYVRDLDAAATLAIKAGLDLECGDRV 309
Query: 336 QKH-TKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQV 394
A + + +++ID A + + RM LGLF+ +P+ P+ +I V+ HQ
Sbjct: 310 YTAPLLKAYNESMVSKADIDSAAYRVLRGRMLLGLFD-DPSQNPYNQIEPSVIGCKKHQE 368
Query: 395 LALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAG-PSCRSITPLQ 453
LAL+ A+ +VLLKN LPL K S+A++G NA + G+Y+G P ++ L
Sbjct: 369 LALETARQSMVLLKNQKNFLPLNLKKVKSIAVVGINAGHCE--FGDYSGIPKNAPVSVLD 426
Query: 454 ALQNYVE----NTVYYPGCDT 470
++ Y E VY P T
Sbjct: 427 GIRKYAEKANVEVVYAPWVST 447
Score = 125 bits (315), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 88/290 (30%), Positives = 137/290 (47%), Gaps = 44/290 (15%)
Query: 485 AKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDI 544
AK D V ++G++++ E+E DR L LP QQE I + + V+ VL+ G + I
Sbjct: 601 AKECDVTVAVLGINKSIEREGQDRYSLELPTDQQEFIRELYKVNPNTVV-VLVAGSSLAI 659
Query: 545 TFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMR 604
+ D N+ +IL A YPGE G A+AEV+FGD+NPGGRLP+T+Y ++P D
Sbjct: 660 NWI--DENVPAILNAWYPGEQGGTAIAEVLFGDYNPGGRLPLTYY-NSLDELPSFD---- 712
Query: 605 PQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDV 664
RTY++++GK ++ FG GLSY+K++YK K VS + N +
Sbjct: 713 ---NYSVQNRTYQYFKGKPLYEFGYGLSYTKFNYKKKNVS--------------IANDTI 755
Query: 665 VHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQS 724
+T V N G+ G ++V+ G P+KQL GF
Sbjct: 756 D-------------------ITFKVSNAGKYDGDEVAQVYVQYPETGTYMPLKQLRGFSR 796
Query: 725 VILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGDEEYPISI 774
V + + A++ + E + V EG + ++G I +
Sbjct: 797 VHIKKGKSADVTISVPKKELRYWDEKTRQFVTPEGKYVFLIGSSSEDIKL 846
>gi|404484440|ref|ZP_11019644.1| hypothetical protein HMPREF9448_00046 [Barnesiella intestinihominis
YIT 11860]
gi|404339445|gb|EJZ65876.1| hypothetical protein HMPREF9448_00046 [Barnesiella intestinihominis
YIT 11860]
Length = 742
Score = 259 bits (661), Expect = 6e-66, Method: Compositional matrix adjust.
Identities = 224/750 (29%), Positives = 359/750 (47%), Gaps = 115/750 (15%)
Query: 45 FCKTTLPISQRAR--DLVSRLTLDEK-ISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVG 101
F K+ I + R +L S L +D K I+++ +A RLGIP +G
Sbjct: 51 FDKSLDSIKAQVRNGELGSMLNIDPKLINEIQKTAVEESRLGIPLI------------IG 98
Query: 102 KGIFFNGTIRG-ATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPN 160
+ I + G T P + AASFD L + EAR Q I TF AP
Sbjct: 99 RDI-----VHGYKTVLPIPLGMAASFDPQLVEKGTHMAATEARE-----QGITWTF-APM 147
Query: 161 INIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAY 220
++I RD RWGR E+ GEDP +T + V+ VRG QGD L +AC KHF Y
Sbjct: 148 LDISRDARWGRIAESLGEDPYLTSELGVAMVRGFQGD-----NLSDNDAIAACVKHFVGY 202
Query: 221 DLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNL 280
+G Y + + L + Y PPF+ V+ G A+ +M ++N +G+P+ + L
Sbjct: 203 GAS--EGGQDYN-STNIPERLLRNVYLPPFQKTVEAGAAT-LMTSFNDNDGVPASGNDFL 258
Query: 281 LSKTARRQWGFHGYITSD-CDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGS-FLQKH 338
L R +WGF G++ SD C V +I G+A +D AG+D+ S +
Sbjct: 259 LRTVLRDEWGFDGFVVSDWCSMVEMI--NHGFAADRKDVARLSANAGLDMEMVSQTYVDY 316
Query: 339 TKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQ 398
+ + K+ ID A+ N+ ++ RLGLF NP + ++ + S H A Q
Sbjct: 317 LPELIAENKVSIDVIDNAVRNILRIKYRLGLFE-NPYVD---EVETSTIYSDEHLQTARQ 372
Query: 399 AAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYA--GPSCRSITPLQALQ 456
AA + +LLKN +G+LPL ++K+V A+IGP A++ LG ++ G ++TPL+ALQ
Sbjct: 373 AATESAILLKN-NGVLPLKENKTV--AIIGPMAHAPYDQLGTWSFDGDKNHTVTPLKALQ 429
Query: 457 N--YVENTVYYPGC--DTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLV 512
+ Y YY + S+ + ++A IA+ AD VV+ +G + E D+
Sbjct: 430 SDEYKHIKYYYEAGLGHSRDESTRNFERAKSIARQADVVVVFVGEEAILSGEAHSLSDIN 489
Query: 513 LPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNI---GSILWAGYPGEAGAVA 569
L G+Q +L+ + ++ KPV++V++ G P+ I +R++ ++L+ +PG G +A
Sbjct: 490 LIGKQSDLL-KAIKSTGKPVVMVVMAGRPLTI-----ERDLPYADAVLYNFHPGTMGGLA 543
Query: 570 LAEVIFGDHNPGGRLPMTWY------------------PQDYIKVPMTDMKMR-PQATSG 610
+ ++++G NP G+LP+T+ QD+I P+ D+ + PQ + G
Sbjct: 544 IMDLLYGKANPSGKLPVTFVREVGQIPMYYNHNNTGRPAQDWI-TPINDIPLEAPQTSLG 602
Query: 611 NPGRTYRFYEGKE-VFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKS 669
N ++ GK+ +F FG GLSYS + Y +S N++ N +
Sbjct: 603 NT--SFYLDSGKDPLFAFGYGLSYSTFEYSDLNLSSNEVNANDT---------------- 644
Query: 670 VPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNA 729
VT +KN ++ G V L+V+ RP+K+L GFQ + L A
Sbjct: 645 -------------LTVTATIKNTSDIDGTEVVQLYVRDLVGSITRPVKELKGFQRLALKA 691
Query: 730 KEKAEIVFELSPCESLSRAREDGLMVIEEG 759
E + F+L P L+ +D + +E G
Sbjct: 692 GEAQTVSFKL-PISELAFYGKDLIKKVEAG 720
>gi|399028199|ref|ZP_10729502.1| beta-glucosidase-like glycosyl hydrolase [Flavobacterium sp. CF136]
gi|398074276|gb|EJL65427.1| beta-glucosidase-like glycosyl hydrolase [Flavobacterium sp. CF136]
Length = 764
Score = 258 bits (660), Expect = 6e-66, Method: Compositional matrix adjust.
Identities = 205/637 (32%), Positives = 309/637 (48%), Gaps = 65/637 (10%)
Query: 151 AIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQA 210
I TF PN+++ D RWGR E GEDP + K A + V+G QG+T L
Sbjct: 158 GINWTF-GPNVDVANDARWGRVMEGAGEDPYLGSKIATARVKGFQGETI--ADLTKINTI 214
Query: 211 SACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVN 270
+AC KHF AY + Y ++ L ++ PPF++ V G + M ++N +N
Sbjct: 215 AACAKHFAAYGY--VEAGLEYNI-VDISNSKLYNSVLPPFKAAVDAGIRT-FMNSFNTLN 270
Query: 271 GIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVN 330
G+P+ + L + +W F G++ SD ++ + A GYAK EDA + + AG D++
Sbjct: 271 GVPATGNAFLQRDILKGKWKFDGFVISDYASIREMI-AHGYAKDEEDAALKAVVAGSDMD 329
Query: 331 CGSFLQ-KHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCS 389
S+L VK+ K+ E+ +D A+ + V+ LGLF+ +P K + + S
Sbjct: 330 MESYLYVAKLVGLVKEGKVKEAMVDDAVRRILRVKFELGLFD-DPYRYCDEKREKETIGS 388
Query: 390 PAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNY--AGPSCR 447
A+ L A+ IVLLKN LLPL KS +ALIG AN + LG++ A
Sbjct: 389 KANNDGVLDMAKKSIVLLKNDKNLLPLKKSGQ-KIALIGALANDKTSPLGSWRIAADDNT 447
Query: 448 SITPLQALQNYVENTVYYP-GCDTVACSSASIDKAV-------------DIAKGADHVVL 493
+++ L+ +Q Y +N + + G D + +D+ V AK AD VV+
Sbjct: 448 AVSVLEGMQQYKDNKLTFEKGADLTVGKVSFLDEVVFNTTDKSGFEAAKTAAKNADVVVM 507
Query: 494 MMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNI 553
++G Q E R DL LPG QQEL+ + + V+LVL G P+ I +A N+
Sbjct: 508 VLGEHGFQSGEGRSRTDLNLPGLQQELLEEIYKVNPN-VVLVLNNGRPLSIPWAA--ENV 564
Query: 554 GSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKM---RPQATSG 610
+I+ A + G A+A+V++GD+NP G+LPM+ +P++ +VP+ K RP T
Sbjct: 565 PAIVEAWHLGTQNGNAVAQVLYGDYNPSGKLPMS-FPRNVGQVPIYYNKYNTGRPVNTDK 623
Query: 611 NPGRT-YRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKS 669
N T Y E FPFG GLSY+K+ YK L LN+
Sbjct: 624 NVFWTHYTDVEKTPQFPFGFGLSYTKFDYK-------NLKLNK----------------- 659
Query: 670 VPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNA 729
T+F + V++ V N G GK V L++ RPIK+L GF+ V L
Sbjct: 660 -----TDFAKGETVKVSVEVTNSGNYDGKEVVQLYIHDEFASIIRPIKELKGFELVNLKK 714
Query: 730 KEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
E I F L+ E L +G ++E GT ++VG
Sbjct: 715 GETKTISFTLTEKE-LGFYDNEGNYLVEPGTFKIMVG 750
>gi|255533985|ref|YP_003094357.1| glycoside hydrolase family protein [Pedobacter heparinus DSM 2366]
gi|255346969|gb|ACU06295.1| glycoside hydrolase family 3 domain protein [Pedobacter heparinus
DSM 2366]
Length = 738
Score = 258 bits (660), Expect = 7e-66, Method: Compositional matrix adjust.
Identities = 235/771 (30%), Positives = 366/771 (47%), Gaps = 129/771 (16%)
Query: 50 LPISQRARD----LVSRLTLDEKISQLVNSAP-AIPRLGIPAYEWWSEALHGVAGVGKGI 104
+P++++ R L+S++TL+EK QL + P A RL IP + E LHGV G
Sbjct: 28 IPLTKQERQKVELLLSKMTLEEKAHQLASFYPNANKRLNIPHMQA-GECLHGVVAAG--- 83
Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTF-WAPNINI 163
TSFPQ I A+S+D L R+ I EARAL G+ + P + +
Sbjct: 84 --------TTSFPQAISIASSWDPSLVERVSTVIAKEARAL-------GIHHCYTPMLGV 128
Query: 164 FRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLD 223
RD RWGR +E GED + K V+++ G+QG G K A KHF A D +
Sbjct: 129 LRDARWGRFEEGYGEDAYLVSKIGVAFINGLQG---RGKNRFDKDHVVATAKHFVA-DSE 184
Query: 224 NWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSK 283
G + ++++ L + + PPF + V++ + +M A++ +NG+P + L+
Sbjct: 185 PLLGANGAAVE--ISLRSLHEVHLPPFRAAVEEAQVGSVMPAHHTLNGVPCHINTYTLND 242
Query: 284 TARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTK--- 340
R+++GF G + SD + + + + +S E+ + L+AG+ +F Q
Sbjct: 243 VFRKEYGFDGLVVSDNNDLRWVQERLFATESQEETIRKALEAGVHTEL-AFKQTWADKRM 301
Query: 341 ------AAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGK------------- 381
AAVK K+P +D A+ + ++ L L + P GK
Sbjct: 302 YGPPLVAAVKNGKVPVKLLDDAVRKVLEFKIALHL---DEEENPLGKEMTELQKGTKDAD 358
Query: 382 IGADVVCS-------------------PAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSV 422
+ ADV S P H LAL+AA+ ++LLKN++ LLP KS+
Sbjct: 359 VNADVFFSQIDGSLSSPRSNYKTVLNNPVHDALALEAARKSLILLKNNN-LLPFKKSQFK 417
Query: 423 SLALIGPNANSAKTLLGNYAGPSCRS-ITPLQALQNYV---ENTVYYPGCDTVACSSASI 478
+A+IGPNA++ + LG Y+ + IT Q ++ V +Y G D +
Sbjct: 418 KIAVIGPNADTIR--LGTYSTQQPKHFITVKQGIETAVGKNAQVLYAKGTDIQHPKDTQL 475
Query: 479 DKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLC 538
+AV IAK AD +L++G D E +DR D+ LPG Q +L+ + A KPV+LVLL
Sbjct: 476 AEAVAIAKEADVCILVLGDDDKTVMENVDRDDITLPGDQDKLMQAIV-ATGKPVVLVLLH 534
Query: 539 GGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPM 598
G P I +AK ++ +IL + G+ A+AE IFGD NP G+L +T YP++ +VP
Sbjct: 535 GRPAAIQWAK--DHVPAILDGWFLGQETGTAIAEAIFGDLNPSGKLTVT-YPRNVGQVPA 591
Query: 599 TDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKM 658
+ P G P + EG +PFG G+SY+++ Y +S+ +S K
Sbjct: 592 FYNTLIP----GRPRMMWGTTEGA-TYPFGYGISYTQFKYGVPKLSK--------ASMKA 638
Query: 659 VENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQ 718
E T F E I V N G++AG V L+++ RPIK+
Sbjct: 639 SE--------------TVFAE-------IEVTNTGKVAGDEIVQLYLRDDISSLARPIKE 677
Query: 719 LVGFQSVILNAKEKAEIVFELSPCESLSRARE---DGLMVIEEGTHFLVVG 766
L GF+ + L E +I +S SR+ E DG + E G+ +++G
Sbjct: 678 LKGFKRISLRPGETQKISLPIS-----SRSLEFWKDGKWITEPGSFTVMMG 723
>gi|345881765|ref|ZP_08833275.1| hypothetical protein HMPREF9431_01939 [Prevotella oulorum F0390]
gi|343918424|gb|EGV29187.1| hypothetical protein HMPREF9431_01939 [Prevotella oulorum F0390]
Length = 1552
Score = 258 bits (660), Expect = 7e-66, Method: Compositional matrix adjust.
Identities = 222/806 (27%), Positives = 346/806 (42%), Gaps = 165/806 (20%)
Query: 43 FPFCKTTLPISQRARDLVSRLTLDEKISQLVN---------------------------- 74
P+ LP + R DL+ R+TLDEK++Q+ +
Sbjct: 719 LPYQNAALPSAIRVHDLLQRMTLDEKLAQMRHIHFKHYNTDGHVDLTKLRNNYTHSMSFG 778
Query: 75 ---------------------SAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGA 113
+A R GIP + G+ G+ + G
Sbjct: 779 CFEAFPYSSTQYRQAVSTIQQNAADSTRFGIPVIP----VIEGIHGIVQD--------GC 826
Query: 114 TSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQ 173
T FPQ I A+F+ L +R+ Q IG E RA+ A Q + AP+++I R+ RWGR +
Sbjct: 827 TIFPQAIAQGATFNPQLVFRMAQHIGTEMRAI-GARQVL-----APDLDIAREQRWGRVE 880
Query: 174 ETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAY-------DLDNWK 226
ET GEDP + + +YV+G+Q GG KHF A+ +L + K
Sbjct: 881 ETFGEDPYLISRMGYNYVKGIQS---RGG--------IPTLKHFVAHGTPQGGLNLASVK 929
Query: 227 GTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTAR 286
G R +L D Y PFE ++ +A +M Y+ + + L R
Sbjct: 930 GGQR----------ELFDVYVKPFEYVIRHTKAGSVMNCYSAYDNEAITSSPFFLRTLLR 979
Query: 287 RQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQK 346
F GYI SD ++ ++ A S +A + AG+D+ GS + + Q
Sbjct: 980 DSLHFKGYIYSDWGSIPMLRYFHHTADSETEAAQQAINAGVDLEAGSDYYRTAPTLIAQG 1039
Query: 347 KLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADV------VCSPAHQVLALQAA 400
L ++ ID A ++ + GLF+ ++ +D + +P +A Q A
Sbjct: 1040 LLDKARIDSAAAHVLYTKFEAGLFD---------ELASDTLHWRQQIHTPEAVAVAKQLA 1090
Query: 401 QDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYA--GPSCRSITPLQALQNY 458
+ +VLL+N + LPL ++ S+A++GPNA A+ G+Y+ + ITPL +Q
Sbjct: 1091 DESLVLLENRNHFLPLDLNRLHSIAVVGPNA--AQVQFGDYSWTADNRHGITPLAGIQQV 1148
Query: 459 V---ENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQ---------TQEKEEL 506
Y GCD + ++ SID+AV +AK +D V+++G + E
Sbjct: 1149 AGMRTKVRYVKGCDYYSQNTDSIDEAVALAKQSDVTVVVVGTQSMLLARPSQPSTSGEGY 1208
Query: 507 DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAG 566
D DL+LPG QQ+LI R+A A KP I+V++ G P+ +T A + ++L Y GE
Sbjct: 1209 DLSDLILPGVQQQLIERIA-ATGKPFIVVMVTGRPL-LTEA-FKNKADALLVQWYGGEQA 1265
Query: 567 AVALAEVIFGDHNPGGRLPMTW------YPQDYIKVPMTDMKMRPQATSGNPGRTYRFYE 620
++LA+ +FG NP GRLP+++ P Y +P + T PGR Y F +
Sbjct: 1266 GLSLAQALFGQLNPSGRLPISFPKATGQLPVYYNHLPTDKGYYNKKGTPDKPGRDYVFAD 1325
Query: 621 GKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCET 680
+PFG GLSY+ + Y A+S+ K D +
Sbjct: 1326 PYPAYPFGYGLSYTTFKYSQLALSK-----------KQTNENDTI--------------- 1359
Query: 681 RKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELS 740
VT V+N G+ AGK L+++ + PIKQL GF+ L E I +L
Sbjct: 1360 ---AVTFRVQNTGKRAGKEVAQLYIRDMKSSVATPIKQLFGFEKCALQPGETKTITQQL- 1415
Query: 741 PCESLSRAREDGLMVIEEGTHFLVVG 766
P L V+E G + +G
Sbjct: 1416 PIADLYLHNAVMQRVVEPGDFEVQIG 1441
>gi|119476117|ref|ZP_01616469.1| periplasmic beta-glucosidase [marine gamma proteobacterium
HTCC2143]
gi|119450744|gb|EAW31978.1| periplasmic beta-glucosidase [marine gamma proteobacterium
HTCC2143]
Length = 748
Score = 258 bits (660), Expect = 7e-66, Method: Compositional matrix adjust.
Identities = 197/652 (30%), Positives = 324/652 (49%), Gaps = 79/652 (12%)
Query: 114 TSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQ 173
T FP + AAS++ L + + LEA A + TF AP I+I RDPRWGR
Sbjct: 101 TIFPIPLGQAASWNPELIEQGARVAALEA-----ATVGVNWTF-APMIDITRDPRWGRIA 154
Query: 174 ETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKF 233
E+ GEDP + G+ + VRG F G L +AC KHF Y +G Y
Sbjct: 155 ESLGEDPYLCGELGAAMVRG-----FQGKDLSAIGSIAACAKHFAGYGAA--EGGVDYN- 206
Query: 234 DARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHG 293
A + +L + Y PPF++ + G AS M A+N +NG+P+ + LL + R +W + G
Sbjct: 207 TAIIAENELRNVYLPPFKAALDSGVAS-FMTAFNDLNGVPASGNEFLLKQILREEWCYQG 265
Query: 294 YITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVN-CGSFLQKHTKAAVKQKKLPESE 352
+ SD +++ + + G+ + ++A + AG+D+ + +H ++ + + ++ ++
Sbjct: 266 MVVSDWESIVQLTE-HGFTANDKEAAFEAANAGIDMEMVSNTYSQHLESLIIEGRISLAQ 324
Query: 353 IDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHG 412
+D + N+ ++ RLGLF NP QP K+ A + + H+ A + A + +VLLKNSH
Sbjct: 325 VDEMVKNILRLKFRLGLFE-NPYPQP-DKLPA--LVNHDHRQAAKKLALESVVLLKNSHQ 380
Query: 413 LLPLPKSKSVSLALIGPNANSAKTLLGN--YAGPSCRSITPLQALQNYVENTVYYPGCDT 470
LPL S S+ALIGP A+ A LG + G + S T LQA+ + +++
Sbjct: 381 SLPLRLSALSSIALIGPLADDAYEQLGTWIFDGDADDSETVLQAINAFAGDSLTVNVDRA 440
Query: 471 VACSSAS----IDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAE 526
+ + ++ ID+ + A+ +D +VL +G + E R D+ LPG Q++LI +A+
Sbjct: 441 LETTRSNTFIDIDRTMAAAQSSDAIVLCLGEESILSGEAHSRADISLPGAQEQLIHLLAK 500
Query: 527 AAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPM 586
A KP+IL+++ G P +T ++ +IL+A +PG AL +++FG+ +P G+LP+
Sbjct: 501 TA-KPMILIVMAGRP--LTLEPIIDHVDAILYAWHPGTMAGTALTDLLFGEVSPSGKLPI 557
Query: 587 TWYPQDYIKVP-------------------MTDMKMRPQATSGNPGRTYRFYEGKEVFPF 627
T +P+ +VP M D+ R TS + +FPF
Sbjct: 558 T-FPRMVGQVPIYYGKKNTGKPPSAESVVHMNDIAPRAAQTSLGMSAFHLDAGFTPLFPF 616
Query: 628 GCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTI 687
G GLSY+ ++Y+ L+ SSST ++ VT+
Sbjct: 617 GFGLSYTSFTYE---------NLHLSSSTMNIDG--------------------VITVTV 647
Query: 688 GVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFEL 739
V N GE G+ V L+ + RP+K+L FQ V L+A E+ ++ F L
Sbjct: 648 DVINCGEREGQEVVQLYTRDLAANVTRPVKELKQFQKVHLSAGERQQVKFLL 699
>gi|423683845|ref|ZP_17658684.1| glycoside hydrolase family 3 [Bacillus licheniformis WX-02]
gi|383440619|gb|EID48394.1| glycoside hydrolase family 3 [Bacillus licheniformis WX-02]
Length = 981
Score = 258 bits (660), Expect = 8e-66, Method: Compositional matrix adjust.
Identities = 173/508 (34%), Positives = 257/508 (50%), Gaps = 36/508 (7%)
Query: 12 LLCLCFTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQ 71
L +C D QP + S +PF +LP+ +R DL+SRLTL+EK+S
Sbjct: 10 LGAMCTILSAAGFDPPQP--AAAKSQADEYEYPFQDPSLPVQKRVSDLLSRLTLEEKVSL 67
Query: 72 LVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLW 131
+ PA+PRLGIPA++ +EALHGVA +G+ AT FPQ A ++D YL
Sbjct: 68 MHQYQPAVPRLGIPAFKTGTEALHGVAWLGE----------ATVFPQAFGLAHTWDRYLI 117
Query: 132 YRIGQAIGLEARALYNAGQAI-GMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSY 190
+IG A+G E R ++ A G+ WAP +++ RDPRWGR +E EDP +TG+ A +Y
Sbjct: 118 KQIGSAVGDEVRGFHHLDPAANGVNVWAPVVDLLRDPRWGRNEEGYSEDPFLTGEIATAY 177
Query: 191 VRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPF 250
G++GD L+ KHF AY+ + +G + D R ++ + Y PF
Sbjct: 178 ASGLRGDH------PFYLKTVPTLKHFLAYNNETDRGFSSSSIDPR----NMHEYYLKPF 227
Query: 251 ESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQW-GFHGYITSDCDAVSIIYDAE 309
E+ + + A G+M AYN VN P+ LL T +R+W G +I SD S I +
Sbjct: 228 ETAISKKAAYGLMPAYNSVNDKPAILS-PLLDSTVKRRWAGDDFFIVSDAFDPSGIVNDH 286
Query: 310 GYAKSPEDAVVDVLKAGMD--VNCG---SFLQKHTKAAVKQKKLPESEIDRALHNLFSVR 364
Y S E A +KAG+D + G + A+KQ + E ++D+AL N FS+R
Sbjct: 287 KYYDSHEKAHAHAVKAGIDNFTDQGENPELTRNALTGALKQGLISEKDLDQALANTFSIR 346
Query: 365 MRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSL 424
R G F+ + + P+ ++ DV+ SP HQ+LA +AA+ IVLLKN LLP K+ S+
Sbjct: 347 FRTGEFDPD-ELNPYSRLTDDVINSPKHQLLAKKAAEKAIVLLKNDQDLLPFNARKNESI 405
Query: 425 ALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYV--ENTVYYPGCDTVACSSASIDKAV 482
A+IGP N+ Y+G TPL + + + + + G + SA K V
Sbjct: 406 AVIGPFGNALYEDW--YSGTMPYRKTPLDEIADKIGKDRVSFAEGLEQSGFKSAPTGKFV 463
Query: 483 DIAKGADHVVLMMGLDQTQEKEELDRVD 510
+ L D+ ++ D D
Sbjct: 464 TAGRDGKQ-PLTAAADKLEKSTAFDAAD 490
Score = 98.2 bits (243), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 81/290 (27%), Positives = 134/290 (46%), Gaps = 40/290 (13%)
Query: 479 DKAVDIAKGADHVVLMMGLD-QTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLL 537
D+AV +AK AD ++ +G + +E DR D+ LP Q+ L+ V++A +LV+
Sbjct: 584 DEAVQLAKSADKAIVFVGNNPYINGRETEDRKDITLPPAQENLLKAVSKANTN-TVLVVT 642
Query: 538 CGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVP 597
P + +AK +I +I+++ + G+ ALA++++GD NPGGRL TW+ +K
Sbjct: 643 SSYPFALNWAK--AHIPAIIYSAHGGQEAGSALADILYGDVNPGGRLTQTWHTS--VK-E 697
Query: 598 MTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTK 657
+ DM M G RTY+++ GK +FPFG GLSY+ + Y ++S + + + S +
Sbjct: 698 LPDM-MNYDIIKGK--RTYKYFVGKPLFPFGHGLSYTAFRYSDLSLSSDSIKKDGSVT-- 752
Query: 658 MVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGN-GRPI 716
+ + V N G G V L+ P + +P
Sbjct: 753 ---------------------------IRVNVTNTGGRKGDEVVQLYTSPLFKTRVKQPN 785
Query: 717 KQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
L FQ V L KE + F+L + +E G + + VG
Sbjct: 786 LDLKDFQRVELAPKETKTVTFKLKQTDLAFWDVTRETFAVERGPYRISVG 835
>gi|365121891|ref|ZP_09338802.1| hypothetical protein HMPREF1033_02148 [Tannerella sp.
6_1_58FAA_CT1]
gi|363644131|gb|EHL83433.1| hypothetical protein HMPREF1033_02148 [Tannerella sp.
6_1_58FAA_CT1]
Length = 855
Score = 258 bits (659), Expect = 8e-66, Method: Compositional matrix adjust.
Identities = 163/424 (38%), Positives = 237/424 (55%), Gaps = 34/424 (8%)
Query: 51 PISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTI 110
PI R DL+SR+T++EK+S ++++AP IPRL I Y +EALHG+ GK
Sbjct: 38 PIHDRVMDLLSRMTVEEKVSLMIHNAPGIPRLEIDKYYHGNEALHGIVRPGK-------- 89
Query: 111 RGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA---------GQAIGMTFWAPNI 161
T FPQ I AAS++ L Y+I AI EAR +NA G + ++FW+P +
Sbjct: 90 --FTVFPQAIGMAASWNPELIYKISTAISDEARGKWNALGLGKKQLDGSSDLLSFWSPTV 147
Query: 162 NIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYD 221
N+ RDPRWGR ET GEDP +TG ++V+G+QG N K L+A A KHF A
Sbjct: 148 NMARDPRWGRTPETYGEDPHLTGTLGCAFVKGLQG---NHPKY---LKAVATPKHFAA-- 199
Query: 222 LDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLL 281
N + R +A ++ +DL + Y P FE C+ +G+A IM AYN VNGIP + L+
Sbjct: 200 --NNEEHNRAHCNAVISERDLREYYLPSFEKCIVEGKAQSIMTAYNAVNGIPCTVNTYLI 257
Query: 282 SKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG-SFLQKHTK 340
K R WGF GY+ +DC A + + Y K E A V + KAG D+ C + +
Sbjct: 258 KKVLREDWGFQGYVVTDCSAPAWMVTQHKYVKDYETAAVLMAKAGSDMECADNVYTQPLL 317
Query: 341 AAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAA 400
A ++ +++ID ++L RM LGLF+ +P P+ KI + V HQ LAL+ A
Sbjct: 318 NAYYNYRVSDADIDSIAYHLLRGRMLLGLFD-DPEKNPYNKISPEKVGCKEHQELALETA 376
Query: 401 QDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAG-PSCRSITPLQALQNYV 459
+ +VLLKN + LP+ K S+A++G NA+ + G+Y+G P S+T L+ ++ V
Sbjct: 377 RQSLVLLKNENNFLPINPKKIKSIAVVGINADRCE--FGDYSGTPVNESVTVLEGIKRLV 434
Query: 460 ENTV 463
+ V
Sbjct: 435 GDQV 438
Score = 123 bits (309), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 87/290 (30%), Positives = 135/290 (46%), Gaps = 43/290 (14%)
Query: 485 AKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDI 544
AK D V ++G+D++ E+E DR L LP QQE I + + K V+ VL+ G + I
Sbjct: 603 AKECDITVAVLGIDKSIEREGQDRYTLELPADQQEFIREIYKINPKTVV-VLVAGSSIAI 661
Query: 545 TFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMR 604
+ D NI +I+ A YPGE G A+AE +FG +NPGGRLP+T+Y P D ++
Sbjct: 662 NWI--DENIPAIIDAWYPGEQGGTAVAEALFGKYNPGGRLPLTFYNSMDELPPFDDYAVK 719
Query: 605 PQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDV 664
GRTY+++ GK ++ FG GLSY+K++Y+ LN +S QD
Sbjct: 720 -------KGRTYQYFTGKPLYEFGYGLSYTKFNYR---------KLNIASK------QDT 757
Query: 665 VHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQS 724
++ + + N G+ G ++V+ G PIKQL GF+
Sbjct: 758 IN------------------IQFSISNTGKYDGDEVAQVYVQYPETGTYMPIKQLKGFKR 799
Query: 725 VILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVGDEEYPISI 774
V + + + + E + V G + VG I++
Sbjct: 800 VHIKKGQTQNVSISIPKKELRYWDEKTRKFVTPSGNYIFQVGSSSQRINL 849
>gi|295135996|ref|YP_003586672.1| beta-glucosidase [Zunongwangia profunda SM-A87]
gi|294984011|gb|ADF54476.1| putative beta-glucosidase [Zunongwangia profunda SM-A87]
Length = 796
Score = 258 bits (659), Expect = 9e-66, Method: Compositional matrix adjust.
Identities = 205/697 (29%), Positives = 311/697 (44%), Gaps = 104/697 (14%)
Query: 81 RLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGL 140
RLGIP EA+HG VG T FP I A++++ L ++ I
Sbjct: 126 RLGIPLL-LEEEAMHGHMAVG-----------TTVFPTAIGQASTWNPDLIKKMAHVIAK 173
Query: 141 EARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFN 200
E RA + T + P I+I R+PRW R +ET GEDP + + S V G QG +
Sbjct: 174 EIRAQGSN------TAYGPIIDIAREPRWSRVEETFGEDPYLIAEMGKSMVTGFQGS--H 225
Query: 201 GGKLKGKLQASACCKHFTAYDLDN--WKGTTRYKFDARVTMQDLADTYQPPFESCVKQGR 258
LK +A KHF AY + G + +DL Y P + V G
Sbjct: 226 ESDLKSNEHVAATLKHFAAYGVSEGGHNGAA-----VHIGQRDLFQNYMYPVKEAVDNGV 280
Query: 259 ASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDA 318
S +M AY+ ++G+PS A +NLL+ + +WGF G++ SD ++ + + EDA
Sbjct: 281 MS-VMTAYSSIDGVPSTAHKNLLTNILKEKWGFKGFVISDLASIEGLLGDHHIVDTEEDA 339
Query: 319 VVDVLKAGMDVNC-GSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQ 377
+ AG+DV+ G+ AV K+ E ID A+ + +V+ +LGLF NP
Sbjct: 340 AAMAMNAGVDVDLGGNGYDDALIDAVNAGKVAEERIDEAVRRILTVKFKLGLFE-NPYAN 398
Query: 378 PFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTL 437
K +V + H LA + A+ I +LKN +LPL K ++A+IG NA+
Sbjct: 399 E--KQAEKIVRNSEHIELAREVARQSITMLKNEDNILPLNKELQ-NIAVIGSNADMQYNQ 455
Query: 438 LGNYAGPSCRS--ITPLQALQNYV--ENTVYYPGCDTVACSSASIDKAVDIAKGADHVVL 493
LG+Y P IT L+ +Q+ + N Y G + +I AV+ AK A+ ++
Sbjct: 456 LGDYTAPQSEENIITVLEGIQHKMPNANIEYVKGTAVRDTTQTNIPAAVEAAKNAEVAIV 515
Query: 494 MMG--------------------------LDQTQEKEELDRVDLVLPGRQQELITRVAEA 527
++G L + E DR L L G+Q EL+ V A
Sbjct: 516 VLGGSSARDFKTEYLETGAATISSKEDQVLSDMESGEGYDRSTLNLMGKQLELLQAVV-A 574
Query: 528 AKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMT 587
P +LVL+ G P+ + + N+ IL A YPG+ G A+A+VIFGD NP GRLP++
Sbjct: 575 TGTPTVLVLIKGRPLLLNWPA--ENVPVILDAWYPGQEGGSAIADVIFGDFNPAGRLPVS 632
Query: 588 WYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYK-FKAVSQN 646
P+ ++P+ P R Y + K ++PFG GLSYS++ Y K +
Sbjct: 633 -VPKSLGQIPVYYNYWFPNR------RDYVETDAKPLYPFGYGLSYSEFKYSDLKVATSG 685
Query: 647 KLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVK 706
K K +++ + N ++ G + L+++
Sbjct: 686 K------------------------------GRNTKIEISLKISNTSKVDGDEVIQLYIR 715
Query: 707 PARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCE 743
P+KQL F+ V + A E + FEL P E
Sbjct: 716 DMVSTVLSPVKQLRAFERVSIKAGETKTVQFELLPKE 752
>gi|218130692|ref|ZP_03459496.1| hypothetical protein BACEGG_02281 [Bacteroides eggerthii DSM 20697]
gi|217987036|gb|EEC53367.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
eggerthii DSM 20697]
Length = 786
Score = 258 bits (659), Expect = 9e-66, Method: Compositional matrix adjust.
Identities = 216/793 (27%), Positives = 356/793 (44%), Gaps = 130/793 (16%)
Query: 51 PISQRARDLVSRLTLDEKISQLVN--SAPAIPRLGIPAYEW----WSEALHGVAGVGKGI 104
P+ R DL+S+++++EK Q+ + + R +P +W W + + + G+
Sbjct: 48 PLDARVHDLLSQMSIEEKTCQMATLYGSGRVLRDSLPTEQWKNEIWKDGIANIDEQANGL 107
Query: 105 ------------------------------------FFNGTIRG-----ATSFPQVILTA 123
F N IRG AT FP
Sbjct: 108 GTFGSTLSYPYVNSVENRQAIQRWFVEETRLGIPVDFTNEGIRGLCHDRATMFPAQCGQG 167
Query: 124 ASFDSYLWYRIGQAIGLEARALYNAGQAIGMT-FWAPNINIFRDPRWGRGQETPGEDPLV 182
A+++ L I + EA+AL G T ++P ++I +DPRWGR E GEDP +
Sbjct: 168 ATWNKELISEIARVTAEEAKAL-------GYTNIYSPILDIAQDPRWGRVVECYGEDPFL 220
Query: 183 TGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDL 242
G+ ++G+Q + A KHF Y + + D V +++
Sbjct: 221 VGELGKRMIKGLQAEGL-----------VATPKHFAVYSVPVGGRDAGTRTDPHVAPREM 269
Query: 243 ADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAV 302
Y PF + A G+M +YN +G P + L++ R +WGF GY+ SD +AV
Sbjct: 270 RTLYLEPFRKAFCEAGALGVMSSYNDYDGEPITGSYHFLTEILRHEWGFKGYVVSDSEAV 329
Query: 303 SIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKH----TKAAVKQKKLPESEIDRALH 358
+Y A D V+ AG++V L ++ + A+ + K+ ID +
Sbjct: 330 EFLYSKHKVAADAVDGAAQVINAGLNVRTNFSLPENFIYPLRHAISEGKISMQTIDSRVA 389
Query: 359 NLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPK 418
++ V+ +GLF+ NP K+ V C HQ ++++AA + IVLLKN + +LPL K
Sbjct: 390 DVLRVKFMMGLFD-NPYKGDTKKLEKVVHCK-EHQNVSMRAALESIVLLKNENNILPLSK 447
Query: 419 SKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTV--YYPGCDTV----- 471
S +A+IGPNA L+ Y + T Q +++Y+ ++ Y G D +
Sbjct: 448 SIK-KVAVIGPNAAEVDNLICRYGPANAPIKTVYQGIKDYLPDSEVRYAKGADIIDEYFP 506
Query: 472 ----------ACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELI 521
ID+AV +AK +D ++++G ++ +EE R +L L GRQ++L+
Sbjct: 507 ESELYDVPLNKTEETMIDEAVALAKESDVAIMVLGGNEKTVREEYSRTNLDLCGRQEKLL 566
Query: 522 TRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
V A KPVILV++ G I +A +R I I+ A +PGE A+A+VIFGD+NPG
Sbjct: 567 QAVY-ATGKPVILVMIDGRAATINWA--ERYIPGIVHAWFPGEFMGDAIAKVIFGDYNPG 623
Query: 582 GRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFK 641
G+L +T +P+ ++P +P + S R ++PFG GLSY+ + Y
Sbjct: 624 GKLAVT-FPRSVGQIPFA-FPFKPGSDSKGFVRV-----TGSLYPFGYGLSYTTFDY--- 673
Query: 642 AVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPV 701
S +EN P +GT+ ++ VKN G++AG V
Sbjct: 674 -------------SALKIEN---------PVIGTQ----GSVKLSCRVKNTGKIAGDEVV 707
Query: 702 LLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTH 761
L++ +K L GF+ + L E+ + F L+P + + +D ++E GT
Sbjct: 708 QLYLHDEVSSVTTYVKVLRGFERIHLQPGEEKVVDFVLTP-QDMGLWNKDNHFIVEPGTF 766
Query: 762 FLVVGDEEYPISI 774
++VG I +
Sbjct: 767 AVMVGSSSQDIRL 779
>gi|383123909|ref|ZP_09944579.1| hypothetical protein BSIG_4072 [Bacteroides sp. 1_1_6]
gi|382983834|gb|EES66944.2| hypothetical protein BSIG_4072 [Bacteroides sp. 1_1_6]
Length = 815
Score = 258 bits (659), Expect = 9e-66, Method: Compositional matrix adjust.
Identities = 201/714 (28%), Positives = 333/714 (46%), Gaps = 103/714 (14%)
Query: 81 RLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGL 140
RLGIP + EA HG +G T FP I AA++ L +G I
Sbjct: 160 RLGIPLF-LAEEAPHGHMAIG-----------TTVFPTGIGMAATWSPVLIEEVGNVIAK 207
Query: 141 EARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFN 200
E R+ G I + P +++ RDPRW R +ET GEDP+++G+ + V G+
Sbjct: 208 EIRS---QGAHIS---YGPVLDLSRDPRWSRVEETFGEDPVLSGRLGAAMVIGL-----G 256
Query: 201 GGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRAS 260
G L + A KHF AY + Y A V +DL + + PPF+ + G A
Sbjct: 257 SGDLSREYATIATLKHFLAYAVPEGGQNGNY---ASVGTRDLHENFLPPFQEAIDAG-AL 312
Query: 261 GIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVV 320
+M +YN ++GIP A+ LL++ R +W F G++ SD ++ ++++ A + E+A +
Sbjct: 313 SVMTSYNSIDGIPCTANYYLLTQLLRNEWRFRGFVVSDLYSIEGVHESHFVAPTIEEAAM 372
Query: 321 DVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFG 380
V+ AG+D++ G + AV+ K+ E+ ID A+ + ++ +GLF +P + P
Sbjct: 373 QVVSAGVDIDLGGNAFMNLTHAVQSGKISEAVIDTAVCRVLRMKFEMGLFE-HPYVNP-- 429
Query: 381 KIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGN 440
K VV S H LA + AQ IVLLKN + +LPL K K +A++GPNA++ +LG+
Sbjct: 430 KSATKVVRSEEHIRLAHKVAQSSIVLLKNKNSILPLNK-KIKKVAVVGPNADNRYNMLGD 488
Query: 441 YAGP----SCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMG 496
Y P + +++ + Y GC + I + V+ A ++ ++ ++G
Sbjct: 489 YTAPQEDENIKTVLDGVISKLSPSKVEYVRGCAIRDTTVNEIAEVVEAASRSEVIIAVVG 548
Query: 497 -----------------------LDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVI 533
+ + E DR L L G+QQ+L+ + +A KP+I
Sbjct: 549 GSSARDFKTSYQETGAAIADEKSISDMECGEGFDRATLTLLGKQQDLLNAL-KATGKPLI 607
Query: 534 LVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDY 593
+V + G P+D +A ++L A YPG+ G A+A+V+FGD+NP GRLP++ P+
Sbjct: 608 VVYIEGRPLDKVWAS--EYADALLTASYPGQEGGYAIADVLFGDYNPAGRLPVS-VPRSV 664
Query: 594 IKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYK-FKAVSQNKLYLNQ 652
++P+ K P Y ++ FG GLSY+ + Y + + ++ Y
Sbjct: 665 GQIPVYYNKKAP------CNHDYVEQAASPLYTFGYGLSYTTFEYSDLQVIRKSPCY--- 715
Query: 653 SSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGN 712
F V+ VKN G G+ L+++
Sbjct: 716 ------------------------------FEVSFKVKNTGSYDGEEVAQLYLRDEYASV 745
Query: 713 GRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
+P++QL F+ L E+ EI F L+ + LS + V+E G +++G
Sbjct: 746 VQPLRQLKCFERFFLKRGEEKEIFFTLTE-KDLSIIDRNMARVVETGDFRIMIG 798
>gi|315225249|ref|ZP_07867066.1| periplasmic beta-glucosidase [Capnocytophaga ochracea F0287]
gi|420158631|ref|ZP_14665447.1| glycosyl hydrolase family 3, N-terminal domain protein
[Capnocytophaga ochracea str. Holt 25]
gi|314944932|gb|EFS96964.1| periplasmic beta-glucosidase [Capnocytophaga ochracea F0287]
gi|394763447|gb|EJF45542.1| glycosyl hydrolase family 3, N-terminal domain protein
[Capnocytophaga ochracea str. Holt 25]
Length = 770
Score = 258 bits (658), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 226/798 (28%), Positives = 367/798 (45%), Gaps = 131/798 (16%)
Query: 7 SLVFPLLCLCFTSLLTRVDS---TQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRL 63
+L+ L+ C T + + ++ TQ FS D I QR ++ +
Sbjct: 10 TLISLLMIGCATPAILKSNNEKFTQKSFSGDRH---------------IEQRVDSVLRLM 54
Query: 64 TLDEKISQLVN-------SAPAIPRLGIPAYEWWSEALHG-------VAGVGK------- 102
TL+EKI Q+ + P + P E + L G VAG+ K
Sbjct: 55 TLEEKIGQMTQFSADWSVTGPVMADKYQPYLE---KGLVGSIFNATSVAGIRKLQKIAVE 111
Query: 103 ----GI---FFNGTIRG-ATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGM 154
GI F I G T FP + + S+D L + + EA A I
Sbjct: 112 QTRLGIPILFGQDVIHGYKTIFPIPLAESCSWDLALMRKTTELAAREASA-----DGINW 166
Query: 155 TFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACC 214
TF AP ++I RD RWGR E GEDP + A + V+G QG N L AC
Sbjct: 167 TF-APMVDITRDARWGRAMEGAGEDPYLGSLIAEARVKGFQGGD-NWQTLSSPHTLLACG 224
Query: 215 KHFTAYDLDNWKGTTRYKFD---ARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNG 271
KHF Y G D A ++M + Y PP+E+ + G S IM + N +NG
Sbjct: 225 KHFAGY------GAAESGKDYNTAELSMHTFRNVYLPPYEATLNAGVGS-IMASLNEING 277
Query: 272 IPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAG--MDV 329
+P+ AD+ LL++ R++WGF+G + SD ++ + G AK + A AG MD+
Sbjct: 278 VPATADKWLLTEVLRKEWGFNGLLVSDYTGINELV-RHGVAKDDKQAANLSANAGIEMDM 336
Query: 330 NCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCS 389
N +F+ K+ A VK+ K+ E++ID+A+ ++ ++ LGLF+ +P + +
Sbjct: 337 NGATFI-KYLSALVKEGKVTEAQIDKAVRHILEMKFLLGLFD-DPYRYLDETRAKENTFT 394
Query: 390 PAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYA--GPSCR 447
+ +A QA +VLLKN +LP+ K ++A+IGP N+ + G++ G +
Sbjct: 395 EEYLKVARQAVASSVVLLKNEAEVLPIKKDSGKTIAVIGPMMNNTSDINGSWTCLGDGKQ 454
Query: 448 SITPLQALQNYVENT----VYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEK 503
S++ L + T +Y GC S+ + +AV IA+ AD V++ +G
Sbjct: 455 SVSLFTGLTEKYKGTNVKLLYAEGCGFTTISTEQLKEAVAIARKADRVLVAVGEQSNWAG 514
Query: 504 EELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPG 563
E R D+ LP Q++L+ + +A KP+ +V G P+D+++ + N+ +IL A +PG
Sbjct: 515 ESAVRTDIRLPQAQRQLLEAL-KAINKPIAIVTFSGRPLDLSWE--NENVQAILQAWFPG 571
Query: 564 EAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVPM---------------TDMKMRPQAT 608
G +A+VI GD NP G L M+ +P++ ++P+ ++ RP
Sbjct: 572 TQGGNGIADVIAGDVNPSGHLTMS-FPRNVGQIPIYYNYKSTGRPVYTNNEEVDHRPHYN 630
Query: 609 SGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYK 668
+G Y ++PFG GLSY+ + A+S ++LN+ S + ++
Sbjct: 631 AG-----YLDSSITPLYPFGYGLSYTTF-----AISN--VHLNKKSLKRYNDS------- 671
Query: 669 SVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILN 728
+V V+N G G+ V L+ + RP+K+L GF+ + L
Sbjct: 672 --------------IIVNASVQNTGTTEGEIVVQLYTRQLVASVSRPVKELKGFEKISLK 717
Query: 729 AKEKAEIVFELSPCESLS 746
A E ++ FEL P E+L+
Sbjct: 718 AGESKQVCFEL-PSEALA 734
>gi|160884749|ref|ZP_02065752.1| hypothetical protein BACOVA_02738 [Bacteroides ovatus ATCC 8483]
gi|156109784|gb|EDO11529.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
ovatus ATCC 8483]
Length = 800
Score = 258 bits (658), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 225/793 (28%), Positives = 365/793 (46%), Gaps = 146/793 (18%)
Query: 51 PISQRARDLVSRLTLDEKISQLVN--SAPAIPRLGIPAYEW----WSEALHGV----AGV 100
PI R DL+S++TL+EK Q+ + + + P W W + + + G+
Sbjct: 62 PIEARIADLLSQMTLEEKTCQMATLYGSGRVLKDAWPTTGWSTEIWKDGIGNIDEQANGL 121
Query: 101 GK-----------------------------GI---FFNGTIRG-----ATSFPQVILTA 123
GK GI F N IRG AT FP
Sbjct: 122 GKFGSEISYPYANSVKNRHTIQRWFVEQTRLGIPVDFTNEGIRGLCHDRATMFPAQCGQG 181
Query: 124 ASFDSYLWYRIGQAIGLEARALYNAGQAIGMT-FWAPNINIFRDPRWGRGQETPGEDPLV 182
A+++ L I + EA+AL G T ++P ++I +DPRWGR E+ GEDP +
Sbjct: 182 ATWNKKLIGEIAKVTADEAKAL-------GYTNIYSPILDIAQDPRWGRVVESYGEDPYL 234
Query: 183 TGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDL--DNWKGTTRYKFDARVTMQ 240
G+ + G+Q + A KHF Y + G TR D V +
Sbjct: 235 VGELGKQMILGLQNEGI-----------VATPKHFAVYSIPVGGRDGGTRT--DPHVAPR 281
Query: 241 DLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCD 300
++ Y PF +++ A G+M +YN +G P + L++ R+QWGF GYI SD +
Sbjct: 282 EMKTLYLEPFRKGIQEAGALGVMSSYNDYDGEPVSGSYHFLTEILRQQWGFKGYIVSDSE 341
Query: 301 AVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG-----SFLQKHTKAAVKQKKLPESEIDR 355
AV ++ + E+ V+ AG+++ F+ + A+ + K+ +D+
Sbjct: 342 AVEFLHTKHRITPTEEEMAAQVVNAGLNIRTNFTPPQDFILP-LRRAINEGKVSLHTLDQ 400
Query: 356 ALHNLFSVRMRLGLFNGNP----TMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSH 411
+ + V+ +GLF+ NP +P VV + AH+ ++++AA + IVLLKN +
Sbjct: 401 RVGEILRVKFMMGLFD-NPYPGDDRRP-----ETVVHNDAHKAVSMKAALESIVLLKNEN 454
Query: 412 GLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSI-TPLQALQNYVENTV--YYPGC 468
+LPL K+ S +A+IGPN K L Y GP+ SI T Q ++ Y+ N+ Y GC
Sbjct: 455 QMLPLSKNFS-KIAVIGPNGEEVKELTCRY-GPANASIKTVYQGIKEYLPNSEVRYAKGC 512
Query: 469 DTV---------------ACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVL 513
D + A I +AV++AK +D +L++G ++ +EE R +L L
Sbjct: 513 DIIDKYFPESELYNVPLDTQEQAMIQEAVELAKASDIAILVLGGNEKTVREEFSRTNLDL 572
Query: 514 PGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEV 573
GRQQ+L+ V A KPVILV++ G I +A ++ I +I+ A +PGE A+A+V
Sbjct: 573 CGRQQQLLEAVY-ATGKPVILVMVDGRAATINWA--NKYIPAIIHAWFPGEFMGDAIAKV 629
Query: 574 IFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSY 633
+FGD+NPGGRL +T +P+ ++P +P + S R ++PFG GLSY
Sbjct: 630 LFGDYNPGGRLAVT-FPKSVGQIPFA-FPFKPGSDSKGKVRV-----DGVLYPFGYGLSY 682
Query: 634 SKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHG 693
+ + Y +S+ P +G + ++ VKN G
Sbjct: 683 TTFGYSDLKISK-------------------------PVIGPQ----ENITLSCTVKNTG 713
Query: 694 EMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGL 753
+ AG V L+++ K L GF+ + L E+ + F L+P + L ++
Sbjct: 714 KKAGDEVVQLYIRDDFSSVTTYDKVLRGFERIHLQPGEEQTVNFTLTP-QDLGLWDKNNQ 772
Query: 754 MVIEEGTHFLVVG 766
+E G+ ++VG
Sbjct: 773 FTVEPGSFSVMVG 785
>gi|52081845|ref|YP_080636.1| glycoside hydrolase family 3 [Bacillus licheniformis DSM 13 = ATCC
14580]
gi|404490730|ref|YP_006714836.1| glycoside hydrolase [Bacillus licheniformis DSM 13 = ATCC 14580]
gi|52005056|gb|AAU24998.1| putative Glycoside Hydrolase Family 3 [Bacillus licheniformis DSM
13 = ATCC 14580]
gi|52349736|gb|AAU42370.1| putative glycoside hydrolase [Bacillus licheniformis DSM 13 = ATCC
14580]
Length = 981
Score = 258 bits (658), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 173/508 (34%), Positives = 257/508 (50%), Gaps = 36/508 (7%)
Query: 12 LLCLCFTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQ 71
L +C D QP + S +PF +LP+ +R DL+SRLTL+EK+S
Sbjct: 10 LGAMCTILSAAGFDPPQP--AAAKSQADEYEYPFQDPSLPVQKRVSDLLSRLTLEEKVSL 67
Query: 72 LVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLW 131
+ PA+PRLGIPA++ +EALHGVA +G+ AT FPQ A ++D YL
Sbjct: 68 MHQYQPAVPRLGIPAFKTGTEALHGVAWLGE----------ATVFPQAFGLAHTWDRYLI 117
Query: 132 YRIGQAIGLEARALYNAGQAI-GMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSY 190
+IG A+G E R ++ A G+ WAP +++ RDPRWGR +E EDP +TG+ A +Y
Sbjct: 118 KQIGSAVGDEVRGFHHLDPAANGVNVWAPVVDLLRDPRWGRNEEGYSEDPFLTGEIATAY 177
Query: 191 VRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPF 250
G++GD L+ KHF AY+ + +G + D R ++ + Y PF
Sbjct: 178 ASGLRGDH------PFYLKTVPTLKHFLAYNNETDRGFSSSSIDPR----NMHEYYLKPF 227
Query: 251 ESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQW-GFHGYITSDCDAVSIIYDAE 309
E+ + + A G+M AYN VN P+ LL T +R+W G +I SD S I +
Sbjct: 228 ETAISKKAAYGLMPAYNSVNDKPAILS-PLLDSTVKRRWAGDDFFIVSDAFDPSGIVNDH 286
Query: 310 GYAKSPEDAVVDVLKAGMD--VNCG---SFLQKHTKAAVKQKKLPESEIDRALHNLFSVR 364
Y S E A +KAG+D + G + A+KQ + E ++D+AL N FS+R
Sbjct: 287 KYYDSHEKAHAHAVKAGIDNFTDQGENPELTRNALTGALKQGLISEKDLDQALANTFSIR 346
Query: 365 MRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSL 424
R G F+ + + P+ ++ DV+ SP HQ+LA +AA+ IVLLKN LLP K+ S+
Sbjct: 347 FRTGEFDPD-ELNPYSRLTDDVINSPKHQLLAKKAAEKAIVLLKNDQDLLPFNARKNESI 405
Query: 425 ALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYV--ENTVYYPGCDTVACSSASIDKAV 482
A+IGP N+ Y+G TPL + + + + + G + SA K V
Sbjct: 406 AVIGPFGNALYEDW--YSGTMPYRKTPLDEIADKIGKDRVSFAEGLEQSGFKSALTGKFV 463
Query: 483 DIAKGADHVVLMMGLDQTQEKEELDRVD 510
+ L D+ ++ D D
Sbjct: 464 TAGRDGKQ-PLTAAADKLEKSAAFDAAD 490
Score = 98.2 bits (243), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 81/290 (27%), Positives = 134/290 (46%), Gaps = 40/290 (13%)
Query: 479 DKAVDIAKGADHVVLMMGLD-QTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLL 537
D+AV +AK AD ++ +G + +E DR D+ LP Q+ L+ V++A +LV+
Sbjct: 584 DEAVQLAKSADKAIVFVGNNPYINGRETEDRKDITLPPAQENLLKAVSKANTN-TVLVVT 642
Query: 538 CGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVP 597
P + +AK +I +I+++ + G+ ALA++++GD NPGGRL TW+ +K
Sbjct: 643 SSYPFALNWAK--AHIPAIIYSAHGGQEAGSALADILYGDVNPGGRLTQTWHTS--VK-E 697
Query: 598 MTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTK 657
+ DM M G RTY+++ GK +FPFG GLSY+ + Y ++S + + + S +
Sbjct: 698 LPDM-MNYDIIKGK--RTYKYFVGKPLFPFGHGLSYTAFRYSDLSLSSDSIKKDGSVT-- 752
Query: 658 MVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGN-GRPI 716
+ + V N G G V L+ P + +P
Sbjct: 753 ---------------------------IRVNVTNTGGRKGDEVVQLYTSPLFKTRVKQPN 785
Query: 717 KQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
L FQ V L KE + F+L + +E G + + VG
Sbjct: 786 LDLKDFQRVELAPKETKTVTFKLKQTDLAFWDVTRETFAVERGPYRISVG 835
>gi|383115617|ref|ZP_09936373.1| hypothetical protein BSGG_2514 [Bacteroides sp. D2]
gi|313694979|gb|EFS31814.1| hypothetical protein BSGG_2514 [Bacteroides sp. D2]
Length = 946
Score = 258 bits (658), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 231/796 (29%), Positives = 356/796 (44%), Gaps = 135/796 (16%)
Query: 51 PISQRARDLVSRLTLDEKISQLVN--SAPAIPRLGIPAYEW----W-------SEALHGV 97
P+ R DL+ ++TL+EK Q+V + + +P EW W E L+G
Sbjct: 59 PVDARIEDLLKQMTLEEKTCQMVTLYGYKRVLKDDLPTPEWKNQLWKDGIGAIDEHLNGF 118
Query: 98 AGVGK-------------------------------GI---FFNGTIRG-----ATSFPQ 118
G GI F N IRG AT+FP
Sbjct: 119 QQWGLPPSDNEYVWPASRHAWALNEVQRFFIEETRLGIPTDFTNEGIRGVESYKATNFPT 178
Query: 119 VILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMT-FWAPNINIFRDPRWGRGQETPG 177
+ +++ L ++G G EAR L G T +AP +++ RD RWGR +E G
Sbjct: 179 QLGLGHTWNRELIRQVGVITGREARML-------GYTNVYAPILDVGRDQRWGRYEEVYG 231
Query: 178 EDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARV 237
E P + + + VRG+Q D Q +A KHF AY + + D ++
Sbjct: 232 ESPYLVAELGIEMVRGMQQD----------YQVAATGKHFIAYSNNKGGREGMSRVDPQM 281
Query: 238 TMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITS 297
+ +++ + PF+ +++ G+M +YN +G P + L+ R + GF GY+ S
Sbjct: 282 SPREVEMVHVYPFKRVIREAGLLGVMSSYNDYDGFPIQSSYYWLTTRLRGEMGFRGYVVS 341
Query: 298 DCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHT-----KAAVKQKKLPESE 352
D DAV +Y AK ++AV ++AG++V C +F + + VK+ L E
Sbjct: 342 DSDAVEYLYTKHNTAKDMKEAVRQSVEAGLNVRC-TFRSPDSYVLPLRELVKEGGLSEEV 400
Query: 353 IDRALHNLFSVRMRLGLFNGNPTMQPFGKIGAD-VVCSPAHQVLALQAAQDGIVLLKNSH 411
I+ + ++ V+ +GLF+ +P GAD V ++ +ALQA+++ IVLLKN
Sbjct: 401 INDRVRDILRVKFLVGLFD-HPYQTDLK--GADEEVEKAENEEVALQASRESIVLLKNDQ 457
Query: 412 GLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTV---YYPGC 468
+LPL S +A+ GPNA+ LG+Y + + L+ +Q + V Y GC
Sbjct: 458 DVLPLDISGIKKIAVCGPNADECSYALGHYGPLAVEVTSVLKGIQEKTDGKVEVLYSKGC 517
Query: 469 DTVACS---------------SASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVL 513
+ V + ID+AV AK AD V+++G Q E R L L
Sbjct: 518 ELVDANWPESELIDFPLTEEEQKEIDRAVSQAKEADVAVVVLGGGQRTCGENKSRSSLDL 577
Query: 514 PGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEV 573
PGRQ +L+ V A KPV+LVL+ G P+ I +A D+ + +IL A YPG G A+A+V
Sbjct: 578 PGRQLDLLKAVV-ATGKPVVLVLINGRPLSINWA--DKFVPAILEAWYPGAKGGKAVADV 634
Query: 574 IFGDHNPGGRLPMTWYPQDYIKVPMT-DMKMRPQATSG-NPGRTYRFYEGK-EVFPFGCG 630
+FGD+NPGG+L +T +P+ ++P K Q G NPG ++ FG G
Sbjct: 635 LFGDYNPGGKLTVT-FPKTVGQIPFNFPCKPSSQIDGGKNPGMDGNMSRANGALYAFGHG 693
Query: 631 LSYSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVK 690
LSY+ + Y ++ + NQ K VT V
Sbjct: 694 LSYTSFEYSDLKITPAVITPNQ-----------------------------KTYVTCKVT 724
Query: 691 NHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRARE 750
N G+ AG V L+V+ K L GF+ + L E E+ F + ++L
Sbjct: 725 NTGKRAGDEVVQLYVRDVLSSVTTYEKNLAGFERIHLKPGETKEVFFPIDR-KALELLNA 783
Query: 751 DGLMVIEEGTHFLVVG 766
D V+E G L+VG
Sbjct: 784 DMHWVVEPGDFTLMVG 799
>gi|319647763|ref|ZP_08001981.1| hypothetical protein HMPREF1012_03020 [Bacillus sp. BT1B_CT2]
gi|317390104|gb|EFV70913.1| hypothetical protein HMPREF1012_03020 [Bacillus sp. BT1B_CT2]
Length = 981
Score = 258 bits (658), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 173/508 (34%), Positives = 257/508 (50%), Gaps = 36/508 (7%)
Query: 12 LLCLCFTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQ 71
L +C D QP + S +PF +LP+ +R DL+SRLTL+EK+S
Sbjct: 10 LGAMCTILSAAGFDPPQP--AAAKSQADEYEYPFQDPSLPVQKRVSDLLSRLTLEEKVSL 67
Query: 72 LVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLW 131
+ PA+PRLGIPA++ +EALHGVA +G+ AT FPQ A ++D YL
Sbjct: 68 MHQYQPAVPRLGIPAFKTGTEALHGVAWLGE----------ATVFPQAFGLAHTWDRYLI 117
Query: 132 YRIGQAIGLEARALYNAGQAI-GMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSY 190
+IG A+G E R ++ A G+ WAP +++ RDPRWGR +E EDP +TG+ A +Y
Sbjct: 118 KQIGSAVGDEVRGFHHLDPAANGVNVWAPVVDLLRDPRWGRNEEGYSEDPFLTGEIATAY 177
Query: 191 VRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPF 250
G++GD L+ KHF AY+ + +G + D R ++ + Y PF
Sbjct: 178 ASGLRGDH------PFYLKTVPTLKHFLAYNNETDRGFSSSSIDPR----NMHEYYLKPF 227
Query: 251 ESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQW-GFHGYITSDCDAVSIIYDAE 309
E+ + + A G+M AYN VN P+ LL T +R+W G +I SD S I +
Sbjct: 228 ETAISKKAAYGLMPAYNSVNDKPAILS-PLLDSTVKRRWAGDDFFIVSDAFDPSGIVNDH 286
Query: 310 GYAKSPEDAVVDVLKAGMD--VNCG---SFLQKHTKAAVKQKKLPESEIDRALHNLFSVR 364
Y S E A +KAG+D + G + A+KQ + E ++D+AL N FS+R
Sbjct: 287 KYYDSHEKAHAHAVKAGIDNFTDQGENPELTRNALTGALKQGLISEKDLDQALANTFSIR 346
Query: 365 MRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSL 424
R G F+ + + P+ ++ DV+ SP HQ+LA +AA+ IVLLKN LLP K+ S+
Sbjct: 347 FRTGEFDPD-ELNPYSRLTDDVINSPKHQLLAKKAAEKAIVLLKNDQDLLPFNARKNESI 405
Query: 425 ALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYV--ENTVYYPGCDTVACSSASIDKAV 482
A+IGP N+ Y+G TPL + + + + + G + SA K V
Sbjct: 406 AVIGPFGNALYEDW--YSGTMPYRKTPLDEIADKIGKDRVSFAEGLEQSGFKSALTGKFV 463
Query: 483 DIAKGADHVVLMMGLDQTQEKEELDRVD 510
+ L D+ ++ D D
Sbjct: 464 TAGRDGKQ-PLTAAADKLEKSAAFDAAD 490
Score = 98.2 bits (243), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 81/290 (27%), Positives = 134/290 (46%), Gaps = 40/290 (13%)
Query: 479 DKAVDIAKGADHVVLMMGLD-QTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLL 537
D+AV +AK AD ++ +G + +E DR D+ LP Q+ L+ V++A +LV+
Sbjct: 584 DEAVQLAKSADKAIVFVGNNPYINGRETEDRKDITLPPAQENLLKAVSKANTN-TVLVVT 642
Query: 538 CGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYIKVP 597
P + +AK +I +I+++ + G+ ALA++++GD NPGGRL TW+ +K
Sbjct: 643 SSYPFALNWAK--AHIPAIIYSAHGGQEAGSALADILYGDVNPGGRLTQTWHTS--VK-E 697
Query: 598 MTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSSSTK 657
+ DM M G RTY+++ GK +FPFG GLSY+ + Y ++S + + + S +
Sbjct: 698 LPDM-MNYDIIKGK--RTYKYFVGKPLFPFGHGLSYTAFRYSDLSLSSDSIKKDGSVT-- 752
Query: 658 MVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGN-GRPI 716
+ + V N G G V L+ P + +P
Sbjct: 753 ---------------------------IRVNVTNTGGRKGDEVVQLYTSPMFKTRVKQPN 785
Query: 717 KQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
L FQ V L KE + F+L + +E G + + VG
Sbjct: 786 LDLKDFQRVELAPKETKTVTFKLKQTDLAFWDVTRETFAVERGPYRISVG 835
>gi|317474221|ref|ZP_07933497.1| glycosyl hydrolase family 3 C terminal domain-containing protein
[Bacteroides eggerthii 1_2_48FAA]
gi|316909531|gb|EFV31209.1| glycosyl hydrolase family 3 C terminal domain-containing protein
[Bacteroides eggerthii 1_2_48FAA]
Length = 786
Score = 258 bits (658), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 216/793 (27%), Positives = 356/793 (44%), Gaps = 130/793 (16%)
Query: 51 PISQRARDLVSRLTLDEKISQLVN--SAPAIPRLGIPAYEW----WSEALHGVAGVGKGI 104
P+ R DL+S+++++EK Q+ + + R +P +W W + + + G+
Sbjct: 48 PLDARVHDLLSQMSIEEKTCQMATLYGSGRVLRDSLPTEQWKNEIWKDGIANIDEQANGL 107
Query: 105 ------------------------------------FFNGTIRG-----ATSFPQVILTA 123
F N IRG AT FP
Sbjct: 108 GTFGSTLSYPYVNSVENRQAIQRWFVEETRLGIPVDFTNEGIRGLCHDRATMFPAQCGQG 167
Query: 124 ASFDSYLWYRIGQAIGLEARALYNAGQAIGMT-FWAPNINIFRDPRWGRGQETPGEDPLV 182
A+++ L I + EA+AL G T ++P ++I +DPRWGR E GEDP +
Sbjct: 168 ATWNKELISEIARVTAEEAKAL-------GYTNIYSPILDIAQDPRWGRVVECYGEDPFL 220
Query: 183 TGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDL 242
G+ ++G+Q + A KHF Y + + D V +++
Sbjct: 221 VGELGKRMIKGLQAEGL-----------VATPKHFAVYSVPVGGRDAGTRTDPHVAPREM 269
Query: 243 ADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAV 302
Y PF + A G+M +YN +G P + L++ R +WGF GY+ SD +AV
Sbjct: 270 RTLYLEPFRKAFCEAGALGVMSSYNDYDGEPITGSYHFLTEILRHEWGFKGYVVSDSEAV 329
Query: 303 SIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKH----TKAAVKQKKLPESEIDRALH 358
+Y A D V+ AG++V L ++ + A+ + K+ ID +
Sbjct: 330 EFLYSKHKVAADAVDGAAQVINAGLNVRTNFSLPENFIYPLRHAISEGKISMQTIDSRVA 389
Query: 359 NLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPK 418
++ V+ +GLF+ NP K+ V C HQ ++++AA + IVLLKN + +LPL K
Sbjct: 390 DVLRVKFMMGLFD-NPYKGDTKKLEKVVHCK-EHQNVSMRAALESIVLLKNENNILPLSK 447
Query: 419 SKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTV--YYPGCDTV----- 471
S +A+IGPNA L+ Y + T Q +++Y+ ++ Y G D +
Sbjct: 448 SIK-KVAVIGPNAAEVDKLICRYGPANAPIKTVYQGIKDYLPDSEVRYAKGADIIDEYFP 506
Query: 472 ----------ACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELI 521
ID+AV +AK +D ++++G ++ +EE R +L L GRQ++L+
Sbjct: 507 ESELYDVPLNKTEETMIDEAVALAKESDVAIMVLGGNEKTVREEYSRTNLDLCGRQEKLL 566
Query: 522 TRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
V A KPVILV++ G I +A +R I I+ A +PGE A+A+VIFGD+NPG
Sbjct: 567 QAVY-ATGKPVILVMIDGRAATINWA--ERYIPGIVHAWFPGEFMGDAIAKVIFGDYNPG 623
Query: 582 GRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFK 641
G+L +T +P+ ++P +P + S R ++PFG GLSY+ + Y
Sbjct: 624 GKLAVT-FPRSVGQIPFA-FPFKPGSDSKGFVRV-----TGSLYPFGYGLSYTTFDY--- 673
Query: 642 AVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPV 701
S +EN P +GT+ ++ VKN G++AG V
Sbjct: 674 -------------SALKIEN---------PVIGTQ----GSVKLSCRVKNTGKIAGDEVV 707
Query: 702 LLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTH 761
L++ +K L GF+ + L E+ + F L+P + + +D ++E GT
Sbjct: 708 QLYLHDEVSSVTTYVKVLRGFERIHLQPGEEKVVDFVLTP-QDMGLWNKDNHFIVEPGTF 766
Query: 762 FLVVGDEEYPISI 774
++VG I +
Sbjct: 767 AVMVGSSSQDIRL 779
>gi|423293673|ref|ZP_17271800.1| hypothetical protein HMPREF1070_00465 [Bacteroides ovatus
CL03T12C18]
gi|392677631|gb|EIY71047.1| hypothetical protein HMPREF1070_00465 [Bacteroides ovatus
CL03T12C18]
Length = 800
Score = 258 bits (658), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 225/793 (28%), Positives = 365/793 (46%), Gaps = 146/793 (18%)
Query: 51 PISQRARDLVSRLTLDEKISQLVN--SAPAIPRLGIPAYEW----WSEALHGV----AGV 100
PI R DL+S++TL+EK Q+ + + + P W W + + + G+
Sbjct: 62 PIEARIADLLSQMTLEEKTCQMATLYGSGRVLKDAWPTTGWSTEIWKDGIGNIDEQANGL 121
Query: 101 GK-----------------------------GI---FFNGTIRG-----ATSFPQVILTA 123
GK GI F N IRG AT FP
Sbjct: 122 GKFGSEISYPYANSVKNRHTIQRWFVEQTRLGIPVDFTNEGIRGLCHDRATMFPAQCGQG 181
Query: 124 ASFDSYLWYRIGQAIGLEARALYNAGQAIGMT-FWAPNINIFRDPRWGRGQETPGEDPLV 182
A+++ L I + EA+AL G T ++P ++I +DPRWGR E+ GEDP +
Sbjct: 182 ATWNKKLIGEIAKVTADEAKAL-------GYTNIYSPILDIAQDPRWGRVVESYGEDPYL 234
Query: 183 TGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDL--DNWKGTTRYKFDARVTMQ 240
G+ + G+Q + A KHF Y + G TR D V +
Sbjct: 235 VGELGKQMILGLQNEGI-----------VATPKHFAVYSIPVGGRDGGTRT--DPHVAPR 281
Query: 241 DLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCD 300
++ Y PF +++ A G+M +YN +G P + L++ R+QWGF GYI SD +
Sbjct: 282 EMKTLYLEPFRKGIQEAGALGVMSSYNDYDGEPVSGSYHFLTEILRQQWGFKGYIVSDSE 341
Query: 301 AVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG-----SFLQKHTKAAVKQKKLPESEIDR 355
AV ++ + E+ V+ AG+++ F+ + A+ + K+ +D+
Sbjct: 342 AVEFLHTKHRITPTEEEMAAQVVNAGLNIRTNFTPPQDFILP-LRRAINEGKVSLHTLDQ 400
Query: 356 ALHNLFSVRMRLGLFNGNP----TMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSH 411
+ + V+ +GLF+ NP +P VV + AH+ ++++AA + IVLLKN +
Sbjct: 401 RVGEILRVKFMMGLFD-NPYPGDDRRP-----ETVVHNDAHKAVSMKAALESIVLLKNEN 454
Query: 412 GLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSI-TPLQALQNYVENTV--YYPGC 468
+LPL K+ S +A+IGPN K L Y GP+ SI T Q ++ Y+ N+ Y GC
Sbjct: 455 QMLPLSKNFS-KIAVIGPNGEEVKELTCRY-GPANASIKTVYQGIKEYLPNSEVRYAKGC 512
Query: 469 DTV---------------ACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVL 513
D + A I +AV++AK +D +L++G ++ +EE R +L L
Sbjct: 513 DIIDKYFPESELNNVPLDTQEQAMIQEAVELAKASDIAILVLGGNEKTVREEFSRTNLDL 572
Query: 514 PGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEV 573
GRQQ+L+ V A KPVILV++ G I +A ++ I +I+ A +PGE A+A+V
Sbjct: 573 CGRQQQLLEAVY-ATGKPVILVMVDGRAATINWA--NKYIPAIIHAWFPGEFMGDAIAKV 629
Query: 574 IFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSY 633
+FGD+NPGGRL +T +P+ ++P +P + S R ++PFG GLSY
Sbjct: 630 LFGDYNPGGRLAVT-FPKSVGQIPFA-FPFKPGSDSKGKVRV-----DGVLYPFGYGLSY 682
Query: 634 SKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHG 693
+ + Y +S+ P +G + ++ VKN G
Sbjct: 683 TTFGYSDLKISK-------------------------PVIGPQ----ENITLSCTVKNTG 713
Query: 694 EMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGL 753
+ AG V L+++ K L GF+ + L E+ + F L+P + L ++
Sbjct: 714 KKAGDEVVQLYIRDDFSSVTTYDKVLRGFERIHLQPGEEQTVNFTLTP-QDLGLWDKNNR 772
Query: 754 MVIEEGTHFLVVG 766
+E G+ ++VG
Sbjct: 773 FTVEPGSFSVMVG 785
>gi|323452325|gb|EGB08199.1| hypothetical protein AURANDRAFT_4612, partial [Aureococcus
anophagefferens]
Length = 262
Score = 258 bits (658), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 134/266 (50%), Positives = 171/266 (64%), Gaps = 10/266 (3%)
Query: 42 TFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGV-AGV 100
+ PFC+ LP ++R DLV R+ DE LVN A +PRL + + WWSEALHGV AG
Sbjct: 2 SLPFCRGDLPRAKRVADLVDRVEPDEVQKLLVNGAAGLPRLWLQPHNWWSEALHGVQAGC 61
Query: 101 GKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPN 160
+ +G+ R T FP I TAASF++ L+ +G IG EARAL N G G TFW+PN
Sbjct: 62 AQAD--DGSSRCPTGFPAAISTAASFNATLFRAVGSVIGAEARALANEGVTNGFTFWSPN 119
Query: 161 INIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAY 220
+NI RDPRWGRGQETPGEDPL+ GKY ++V G + + G + ASAC KHF AY
Sbjct: 120 LNILRDPRWGRGQETPGEDPLLNGKYGENFVLGFEHPDGSSGD---AIAASACPKHFFAY 176
Query: 221 DLDNW---KGTTRYKFD-ARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCA 276
+L+N K R+ FD A ++ +L TY PPFE + G+ASG+MC+YN VNG PSCA
Sbjct: 177 NLENCFKVKDNCRHTFDMANLSQGELEATYLPPFEQAISSGKASGLMCSYNAVNGTPSCA 236
Query: 277 DRNLLSKTARRQWGFHGYITSDCDAV 302
+ + AR +WGF GY+TSDC AV
Sbjct: 237 NAWGIETLARGKWGFEGYVTSDCHAV 262
>gi|256419370|ref|YP_003120023.1| glycoside hydrolase [Chitinophaga pinensis DSM 2588]
gi|256034278|gb|ACU57822.1| glycoside hydrolase family 3 domain protein [Chitinophaga pinensis
DSM 2588]
Length = 807
Score = 258 bits (658), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 220/722 (30%), Positives = 345/722 (47%), Gaps = 112/722 (15%)
Query: 81 RLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGL 140
RLGIP E HG +G +F +TS Q A+++D L + AI
Sbjct: 140 RLGIPLL-LAEECPHGHMAIGTTVF-------STSIGQ----ASTWDPALIQEMAGAIAK 187
Query: 141 EARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFN 200
EAR G IG + P +++ R+PRW R +ET GEDP + + +S V+G QG++
Sbjct: 188 EARV---QGAHIG---YGPVLDLVREPRWSRLEETYGEDPYLISQMGISMVKGFQGNSIG 241
Query: 201 GGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTM---QDLADTYQPPFESCVKQG 257
G + KHFTAY G+ + + + +DL +Y PPF++ VK G
Sbjct: 242 SGS-----NVISTLKHFTAY------GSPEGGHNGGIALTGLRDLYSSYLPPFQAAVKAG 290
Query: 258 RASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPED 317
S IM +YN ++GIP ++ LL +QWGF G+ SD + + A + E+
Sbjct: 291 ALS-IMASYNSIDGIPCSSNSFLLKDVLVKQWGFSGFSVSDLGGIPGVRSTHHIAATMEE 349
Query: 318 AVVDVLKAGMDVNCGSFLQKHTKA---AVKQKKLPESEIDRALHNLFSVRMRLGLFNGNP 374
A + AG+D + G + + A AV KK+ + +D A+ ++ ++ +GLF NP
Sbjct: 350 AATLAINAGLDADLGG--EAYGDALIKAVNNKKVTMTTLDTAVAHVLRLKFTMGLFE-NP 406
Query: 375 TMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSA 434
+ + V + A++ L+ + A + IVL+KN +GLLPL K+ +LA+IGPNA++
Sbjct: 407 YVDV--DVAEKTVGTAANRALSKRVAAESIVLMKNENGLLPLQKTIK-NLAVIGPNADNI 463
Query: 435 KTLLGNYAGPSCRS--ITPLQALQNYVE---NTVYYPGCDTVACSSASIDKAVDIAKGAD 489
LG+Y P + +T L+ ++ V Y GC + A+I AV A+ AD
Sbjct: 464 YNQLGDYTAPQPQEKIVTVLEGIKAKVSADTKVTYVKGCAIRDTAHANISAAVAAAQQAD 523
Query: 490 HVVLMMG-------------------------LDQTQEKEELDRVDLVLPGRQQELITRV 524
VV+++G + + E DRV L L G Q L+ +
Sbjct: 524 AVVIVLGGSSARDFETTFQSTGAAEVKAAEVAVSDMESGEGYDRVSLDLMGLQSRLLESI 583
Query: 525 AEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRL 584
A KPV+LVL+ G P++I +A +N+ +I+ A YPG+ G A+A+V+FGD+NP GRL
Sbjct: 584 V-ATGKPVVLVLIEGRPLNINWAA--KNVPAIVNAWYPGQEGGNAVADVLFGDYNPAGRL 640
Query: 585 PMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVS 644
P++ P+ ++P+ + A Y + K ++ FG GLSY+ + YK
Sbjct: 641 PVS-IPKSVGQLPVY-YNYKSAAR-----HDYVEMDAKPLYSFGHGLSYASFEYK----- 688
Query: 645 QNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLF 704
D+ + V G K V+ +KN +AG L+
Sbjct: 689 ------------------DL--HSDVQATGNNL----KVTVSFKLKNTSTVAGDEVAQLY 724
Query: 705 VKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLV 764
++ +KQL FQ V L E+ EI FELSP + L D V+E GT ++
Sbjct: 725 IRDDASSVVTAVKQLKKFQRVHLAPGEQKEIKFELSP-DDLRLLNVDMKWVVEPGTFTMM 783
Query: 765 VG 766
+G
Sbjct: 784 IG 785
>gi|299144988|ref|ZP_07038056.1| xylosidase [Bacteroides sp. 3_1_23]
gi|298515479|gb|EFI39360.1| xylosidase [Bacteroides sp. 3_1_23]
Length = 800
Score = 258 bits (658), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 225/790 (28%), Positives = 365/790 (46%), Gaps = 140/790 (17%)
Query: 51 PISQRARDLVSRLTLDEKISQLVNS-------APAIPRLGIPAYEWWSEALHGV----AG 99
PI R DL+S++TL+EK Q+ A P G A E W + + + G
Sbjct: 62 PIEARIADLLSQMTLEEKTCQMATLYGSGRVLKDAWPTAGWSA-EIWKDGIGNIDEQANG 120
Query: 100 VGK-----------------------------GI---FFNGTIRG-----ATSFPQVILT 122
+GK GI F N IRG AT FP
Sbjct: 121 LGKFGSEISYPYANSVKNRHTIQRWFMEQTRLGIPVDFTNEGIRGLCHDRATMFPAQCGQ 180
Query: 123 AASFDSYLWYRIGQAIGLEARALYNAGQAIGMT-FWAPNINIFRDPRWGRGQETPGEDPL 181
A+++ L I + EA+AL G T ++P ++I +DPRWGR E+ GEDP
Sbjct: 181 GATWNKKLIREIAKVTADEAKAL-------GYTNIYSPILDIAQDPRWGRVVESYGEDPY 233
Query: 182 VTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDL--DNWKGTTRYKFDARVTM 239
+ G+ + G+Q + A KHF Y + G TR D V
Sbjct: 234 LVGELGKQMILGLQNEGI-----------VATPKHFAVYSIPVGGRDGGTRT--DPHVAP 280
Query: 240 QDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDC 299
+++ Y PF +++ A G+M +YN +G P + L++ R+QWGF GY+ SD
Sbjct: 281 REMKTLYLEPFRKGIQEAGALGVMSSYNDYDGEPVSGSYHFLTEILRQQWGFKGYVVSDS 340
Query: 300 DAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG-----SFLQKHTKAAVKQKKLPESEID 354
+AV ++ + E+ V+ AG+++ F+ + A+ + K+ +D
Sbjct: 341 EAVEFLHTKHRITPTEEEMAAQVVNAGLNIRTNFTPPQDFILP-LRRAINEGKVSLHTLD 399
Query: 355 RALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLL 414
+ + + V+ +GLF+ NP + A VV + AH+ ++++AA + IVLLKN + +L
Sbjct: 400 QRVGEILRVKFMMGLFD-NPYPGDDRRPEA-VVHNDAHKAVSMKAALESIVLLKNENQML 457
Query: 415 PLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSI-TPLQALQNYVENTV--YYPGCDTV 471
PL K+ S +A+IGPN K L Y GP+ SI T Q ++ Y+ N+ Y GCD +
Sbjct: 458 PLSKNFS-KIAVIGPNGEEVKELTCRY-GPANASIKTVYQGIKEYLPNSEVRYVKGCDII 515
Query: 472 ---------------ACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGR 516
A I +AV++AK +D +L++G ++ +EE R +L L GR
Sbjct: 516 DKYFPESELYNVPLDTQEQAMIHEAVELAKASDVAILVLGGNEKTVREEFSRTNLDLCGR 575
Query: 517 QQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFG 576
QQ+L+ V A KPV+LV++ G I +A ++ + +I+ A +PGE A+A+V+FG
Sbjct: 576 QQQLLEAVY-ATGKPVVLVMVDGRAATINWA--NKYVPAIIHAWFPGEFMGDAIAKVLFG 632
Query: 577 DHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKY 636
D+NPGGRL +T +P+ ++P +P + S R ++PFG GLSY+ +
Sbjct: 633 DYNPGGRLAVT-FPKSVGQIPFA-FPFKPGSDSKGKVRV-----DGVLYPFGYGLSYTTF 685
Query: 637 SYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMA 696
Y +S+ P +G + ++ VKN G+ A
Sbjct: 686 GYSDLKISK-------------------------PVIGPQ----ENITLSCTVKNTGKKA 716
Query: 697 GKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVI 756
G V L+++ K L GF+ + L E+ + F L+P + L ++ +
Sbjct: 717 GDEVVQLYIRDDFSSVTTYDKVLRGFERIHLQPGEEQTVNFTLTP-QDLGLWDKNNRFTV 775
Query: 757 EEGTHFLVVG 766
E G+ ++VG
Sbjct: 776 EPGSFSVMVG 785
>gi|330995911|ref|ZP_08319806.1| glycosyl hydrolase family 3 protein [Paraprevotella xylaniphila YIT
11841]
gi|329574250|gb|EGG55825.1| glycosyl hydrolase family 3 protein [Paraprevotella xylaniphila YIT
11841]
Length = 865
Score = 257 bits (657), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 159/441 (36%), Positives = 230/441 (52%), Gaps = 33/441 (7%)
Query: 44 PFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKG 103
P+ L +RA DL+ RLTL EK+S ++NS+PA+ RLGI Y WWSEALHGVA G
Sbjct: 24 PYKNPDLSPRERAEDLLKRLTLKEKVSLMINSSPAVERLGIKPYNWWSEALHGVARSGI- 82
Query: 104 IFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIG--------MT 155
AT +P + A+ FD + + EARA Y+ G +T
Sbjct: 83 ---------ATVYPITMGMASVFDDEAVEAVYTTVSDEARAKYHDAHKKGRYGQMNEGLT 133
Query: 156 FWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCK 215
FW PN+NIFRDPRWGRGQET GEDP +T + V+ VRG+QG KL AC K
Sbjct: 134 FWTPNVNIFRDPRWGRGQETWGEDPYLTSRMGVAVVRGLQGPA---DAKYDKLH--ACAK 188
Query: 216 HFTAYDLDNWKGTTRYKFDA-RVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPS 274
H+ + K R+ FD ++ +DL +TY P F+ V++ +MCAY R G P
Sbjct: 189 HYAVHSGPEAK---RHYFDVEQLDPRDLWETYLPAFKVLVQEADVKEVMCAYQRFEGEPC 245
Query: 275 CADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPE--DAVVDVLKAGMDVNCG 332
C + LL + R +WGF + SDC A+ ++ + P+ A + +G D+ CG
Sbjct: 246 CGSKRLLHQILRDEWGFKYVVVSDCGAIGDFFNPGLHETHPDAATASASAVTSGTDLECG 305
Query: 333 SFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAH 392
+AAV + + E ID +L L R LG + + ++ P+ +IG D V H
Sbjct: 306 WGDYMQLEAAVDRGLITEHRIDTSLCRLLEARFALGEMD-DDSLVPWSRIGIDTVDCQTH 364
Query: 393 QVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPL 452
+ +AL A+ +VLL N G+LPL K++ + ++GPNA + GNY G + T L
Sbjct: 365 KQMALDIARKSLVLLHND-GVLPLDKTRG-DVVVMGPNAVDSVMQWGNYEGTPSHTYTVL 422
Query: 453 QALQNYVENTVYYP-GCDTVA 472
++ + V Y GC+ ++
Sbjct: 423 DGIRERLGRDVRYEKGCNLLS 443
Score = 102 bits (254), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 87/267 (32%), Positives = 124/267 (46%), Gaps = 52/267 (19%)
Query: 486 KGADHVVLMMGLDQTQEKEEL-----------DRVDLVLPGRQQELITRVAEAAKKPVIL 534
K A VV + G+ E EE DR + LP Q++++ + A KK ++
Sbjct: 599 KDAGTVVFVGGISPDLEGEEKNHVNCPGFAGGDRTSIELPQVQRDILKALKAAGKK--VV 656
Query: 535 VLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYI 594
+ C G + + +IL A YPG+AG +A+A+V+FGD NP G+LP+T+Y
Sbjct: 657 FVNCSGSA-MALVPELESCDAILQAWYPGQAGGLAVADVLFGDFNPSGKLPVTFYKNTEQ 715
Query: 595 KVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSS 654
D M+ GRTYR+ K +FPFG GLSY+ + A+S+ +L
Sbjct: 716 LPDFEDYSMK--------GRTYRYMTDKPLFPFGYGLSYTTF-----AISKGRL------ 756
Query: 655 STKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNGR 714
KS E G E KF T VKN G+ G V ++V+ G
Sbjct: 757 ------------SKSSVEAG----EGVKF--TAQVKNTGKRDGAEVVQVYVRKVGDTGG- 797
Query: 715 PIKQLVGFQSVILNAKEKAEIVFELSP 741
P+K L GF+ V L A E + ELSP
Sbjct: 798 PLKSLRGFRRVELKAGESRRVSIELSP 824
>gi|237721943|ref|ZP_04552424.1| glycoside hydrolase family 3 protein [Bacteroides sp. 2_2_4]
gi|229448812|gb|EEO54603.1| glycoside hydrolase family 3 protein [Bacteroides sp. 2_2_4]
Length = 792
Score = 257 bits (657), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 222/793 (27%), Positives = 365/793 (46%), Gaps = 146/793 (18%)
Query: 51 PISQRARDLVSRLTLDEKISQLVN--SAPAIPRLGIPAYEW----WSEALHGV----AGV 100
PI R DL+S++TL+EK Q+ + + + P W W + + + G+
Sbjct: 54 PIEARIADLLSQMTLEEKTCQMATLYGSGRVLKDACPTAGWLAEIWKDGIGNIDEQANGL 113
Query: 101 GK-----------------------------GI---FFNGTIRG-----ATSFPQVILTA 123
GK GI F N IRG AT FP
Sbjct: 114 GKFGSEISYPYANSVKNRHTIQRWFVEQTRLGIPVDFTNEGIRGLCHDRATMFPAQCGQG 173
Query: 124 ASFDSYLWYRIGQAIGLEARALYNAGQAIGMT-FWAPNINIFRDPRWGRGQETPGEDPLV 182
A+++ L I + EA+AL G T ++P ++I +DPRWGR E+ GEDP +
Sbjct: 174 ATWNKKLIREIAKVTANEAKAL-------GYTNIYSPILDIAQDPRWGRVVESYGEDPYL 226
Query: 183 TGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDL--DNWKGTTRYKFDARVTMQ 240
G+ + G+Q + A KHF Y + G TR D V +
Sbjct: 227 AGELGKQMILGLQNEGI-----------VATPKHFAVYSIPVGGRDGGTRT--DPHVAPR 273
Query: 241 DLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCD 300
++ Y PF +++ A G+M +YN +G P + L++ R+QWGF GY+ SD +
Sbjct: 274 EMKTLYLEPFRKGIQEAGALGVMSSYNDYDGEPVSGSYHFLTEILRQQWGFKGYVVSDSE 333
Query: 301 AVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG-----SFLQKHTKAAVKQKKLPESEIDR 355
AV ++ + E+ V+ AG+++ F+ + A+ + K+ +D+
Sbjct: 334 AVEFLHTKHRITPTEEEMAAQVVNAGLNIRTNFTPPQDFILP-LRRAIDEGKVSLHTLDQ 392
Query: 356 ALHNLFSVRMRLGLFNGNP----TMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSH 411
+ + V+ +GLF+ NP +P VV + AH+ ++++AA + +VLLKN +
Sbjct: 393 RVSEILRVKFMMGLFD-NPYPGDDRRP-----ETVVHNDAHKAVSMKAALESVVLLKNEN 446
Query: 412 GLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSI-TPLQALQNYVENTV--YYPGC 468
+LPL K+ +A+IGPNA K L Y GP+ SI T Q ++ Y+ N+ Y GC
Sbjct: 447 QMLPLSKNFK-KIAVIGPNAEEVKELTCRY-GPANASIKTVYQGIKEYLPNSEVRYAKGC 504
Query: 469 DTV---------------ACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVL 513
D + A I +AV++AK +D +L++G ++ +EE R +L L
Sbjct: 505 DIIDKYFPESELYNVPLDTQEQAMIHEAVELAKASDIAILVLGGNEKTVREEFSRTNLDL 564
Query: 514 PGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEV 573
GRQQ+L+ V A KPV+LV++ G I +A ++ I +I+ A +PGE A+A+V
Sbjct: 565 CGRQQQLLEAVY-ATGKPVVLVMVDGRAATINWA--NKYIPAIIHAWFPGEFMGDAIAKV 621
Query: 574 IFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSY 633
+FGD+NPGGRL +T +P+ ++P +P + S R ++PFG GLSY
Sbjct: 622 LFGDYNPGGRLAVT-FPKSVGQIPFA-FPFKPGSDSKGKVRV-----DGALYPFGYGLSY 674
Query: 634 SKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHG 693
+ + Y +S+ P +G + ++ VKN G
Sbjct: 675 TTFGYSDLKISK-------------------------PVIGPQ----ENITLSCTVKNTG 705
Query: 694 EMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGL 753
+ AG V L+++ K L GF+ + L E+ + F L+P + L ++
Sbjct: 706 KKAGDEVVQLYIRDDFSSVTTYDKVLRGFERIHLQPGEEQTVSFTLTP-QDLGLWDKNNR 764
Query: 754 MVIEEGTHFLVVG 766
+E G+ ++VG
Sbjct: 765 FTVEPGSFSVMVG 777
>gi|218132023|ref|ZP_03460827.1| hypothetical protein BACEGG_03648 [Bacteroides eggerthii DSM 20697]
gi|217985783|gb|EEC52123.1| glycosyl hydrolase family 3 N-terminal domain protein [Bacteroides
eggerthii DSM 20697]
Length = 762
Score = 257 bits (657), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 224/780 (28%), Positives = 346/780 (44%), Gaps = 130/780 (16%)
Query: 51 PISQRARDLVSRLTLDEKISQL-------------VNSAPAIPRLGIPAY---------- 87
P+ R DL+ R+TL+EKI+Q+ V++ L +Y
Sbjct: 34 PVEVRVADLLKRMTLEEKIAQMQDLKFKDFSVDGKVDTVKMDSVLKGMSYASVFGSRLSV 93
Query: 88 EWWSEALH------------GVAGVGKGIFFNGTIR-GATSFPQVILTAASFDSYLWYRI 134
E E++ G+ +G+ +G I GAT FPQ I +++F+ + +R+
Sbjct: 94 EQMQESMFAINKYMAEHNRLGIPVLGEAESLHGLIHDGATIFPQSIALSSTFNPDITHRV 153
Query: 135 GQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGV 194
I EA+A Q + +P +++ R+ RWGR +ET GEDP + G+ V+YV
Sbjct: 154 ATVIAQEAKAT-GVDQVL-----SPVLDLARELRWGRVEETYGEDPYLVGRMGVAYVSA- 206
Query: 195 QGDTFNGGKLKGKLQASACCKHFTAY-------DLDNWKGTTRYKFDARVTMQDLADTYQ 247
FN K KHF A+ +L + G R DL Y
Sbjct: 207 ----FN------KEGVMTTLKHFLAHGSPTGGLNLASVTGCER----------DLRSLYL 246
Query: 248 PPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYD 307
PF+ +++ +M +YN +P A +L R + GF GYI+SD +V ++
Sbjct: 247 KPFQDVMREAMPYSVMNSYNSYESVPVAASHWILDDILRGEMGFKGYISSDWGSVEMLRS 306
Query: 308 AEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRL 367
AK DA + AG+DV + + V+ LPE EID+ + + + + +
Sbjct: 307 LHHTAKDKADAACQAVIAGVDVEVDGDCYETLDSLVRSGVLPEKEIDKCVSRVLTAKFAM 366
Query: 368 GLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALI 427
GLF+ + T + + V +P LAL AA++ +L+KN + LLPL +K S+A+I
Sbjct: 367 GLFDKDYTKRAN---LSQTVHTPEAVELALVAARESAILVKNENSLLPLDANKLRSVAVI 423
Query: 428 GPNANSAKTLLGNYAGPSCRS--ITPLQALQNYVENTV---YYPGCDTVACSSASIDKAV 482
GPNA A+ G+Y + ITPLQ ++ + V Y GC+ + +AV
Sbjct: 424 GPNA--AQVQFGDYMWTNSNEYGITPLQGIEAVTQGKVKINYAKGCEIHTQDRSGFSQAV 481
Query: 483 DIAKGADHVVLMMG---------LDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVI 533
A+ +D +L +G + E D D+ LPG Q+ LI R +A KP I
Sbjct: 482 TAARNSDVALLFVGAMSGSPGRPWPNSVSGESFDLSDISLPGCQEALI-RAVKATGKPTI 540
Query: 534 LVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQD- 592
+VL+ G P I + K N +++ Y GE A+AE++FG+ NP GRL ++ +PQ
Sbjct: 541 VVLVAGKPFAIPWVK--DNCEAVIVQWYGGEQEGRAIAEILFGEVNPSGRLNVS-FPQST 597
Query: 593 ------YIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQN 646
Y P T PGR Y F V+ FG GLSY+ + YK +S N
Sbjct: 598 GHLPVFYNYYPSDKGFYHDHGTLEKPGRDYVFSSPDPVWAFGHGLSYTTFKYKSMQIS-N 656
Query: 647 KLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVK 706
K + + + CE +T+ V N G+ GK V L+V
Sbjct: 657 KEFTDDDT-----------------------CE-----ITVEVANTGKRDGKEVVQLYVN 688
Query: 707 PARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
P+K+L F+ V + A E + F L P + L+ D V+E G L VG
Sbjct: 689 DIVSSVVTPVKELRRFEKVFIPAGETRTVKFNL-PIKELALWNTDMKEVVEPGDFELQVG 747
>gi|383643328|ref|ZP_09955734.1| glycoside hydrolase family 3 [Sphingomonas elodea ATCC 31461]
Length = 799
Score = 257 bits (657), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 224/713 (31%), Positives = 328/713 (46%), Gaps = 99/713 (13%)
Query: 81 RLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGL 140
RLGIP EALHG+ G ATSFPQ I A+SFD L I
Sbjct: 145 RLGIPML-MHEEALHGLVAPG-----------ATSFPQSIALASSFDPKLVENIFSMAAK 192
Query: 141 EARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFN 200
EARA + + AP +++ RDPRWGR +ET GEDP + + ++ +RG QG T
Sbjct: 193 EARA-----RGANLVL-APVVDVARDPRWGRIEETYGEDPYLVTQMGLAAIRGFQGTTM- 245
Query: 201 GGKLKGKLQASACCKHFTAY-DLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRA 259
LK + KH T + +N R +D + PPFE+ VK
Sbjct: 246 --PLKSD-KVFITLKHMTGHGQPENGTNVGPASLGERTLRED----FFPPFEAAVKTLPV 298
Query: 260 SGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAV 319
+M +YN ++GIPS A++ LL+ R +WGF G + SD A+ + K P+DA
Sbjct: 299 MSVMASYNEIDGIPSHANKWLLTDVLRGEWGFQGAVVSDYFAIRELITRHHLFKDPKDAA 358
Query: 320 VDVLKAGMDVNC-GSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQP 378
L AG+DV H VKQ ++ + EID A+ + ++ GLF NP P
Sbjct: 359 QRALDAGVDVETPDGEAYTHLVQLVKQGRVSQGEIDNAVRRVLRMKFEGGLFE-NP--YP 415
Query: 379 FGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLL 438
K+ A +P L+ QAA++ IVLLKN+ GLLPL +A+IG +A T +
Sbjct: 416 EVKLAAARTNTPEAIALSRQAARESIVLLKNAQGLLPLDARGIKRMAVIGTHAKD--TPI 473
Query: 439 GNYAGPSCRSITPLQALQ---------NYVENTVYYP----GCDTVACSSASID-----K 480
G Y+ ++ L+ +Q +Y E D VA AS++ +
Sbjct: 474 GGYSDLPNHVVSVLEGMQAEGKGKFAVDYAEGIRITNHREWSKDAVAQVPASVNDQLRAQ 533
Query: 481 AVDIAKGADHVVLMMGLDQTQEKEEL------DRVDLVLPGRQQELITRVAEAAKKPVIL 534
A++ AK AD VVL++G ++ +E D L LPG Q +L + A KPV++
Sbjct: 534 ALETAKNADVVVLVLGGNEAVSREAWADNHLGDSETLDLPGPQDQLAKELI-ALGKPVVV 592
Query: 535 VLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDYI 594
+LL G P + + +++ Y GE A+A+V+FG +NPGG+LP++ +
Sbjct: 593 ILLNGRPYAVNYLA--EKAPALIEGWYLGEQTGNAIADVVFGRYNPGGKLPVS-VARSVG 649
Query: 595 KVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQSS 654
++P+ K +P A G Y F + ++PFG GLSY+ +
Sbjct: 650 QLPIYYNK-KPSARRG-----YLFGDTSPLYPFGYGLSYTTFDI---------------- 687
Query: 655 STKMVENQDVVHYKSVPELGT-EFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNG 713
S P LGT K V + V N G++AG V LFV
Sbjct: 688 --------------SAPRLGTPTIGIADKASVEVDVTNTGKVAGDEVVQLFVHDDEASVT 733
Query: 714 RPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
RP+ +L F+ V L EK + FEL+P + L+ V+E GT + G
Sbjct: 734 RPVIELKRFERVTLKPGEKKTVRFELTP-DDLALWNSQMRHVVEPGTFTISSG 785
>gi|330996730|ref|ZP_08320605.1| glycosyl hydrolase family 3 protein, partial [Paraprevotella
xylaniphila YIT 11841]
gi|329572575|gb|EGG54218.1| glycosyl hydrolase family 3 protein [Paraprevotella xylaniphila YIT
11841]
Length = 725
Score = 257 bits (657), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 160/446 (35%), Positives = 230/446 (51%), Gaps = 35/446 (7%)
Query: 36 SNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALH 95
S+P+ + P+ L +RA DL+ RLTL EKIS + N +P + RLGI Y WWSEALH
Sbjct: 17 SSPAQDE-PYKNPDLSPQERADDLLKRLTLKEKISLMQNQSPGVERLGIKPYNWWSEALH 75
Query: 96 GVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIG-- 153
GVA G AT +P + A+ FD L I + E RA ++ + G
Sbjct: 76 GVARNGL----------ATVYPITMGMASVFDDKLIEDIYVTVSDEGRAKFHDARRHGRY 125
Query: 154 ------MTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGK 207
+TFW PN+NIFRDPRWGRGQET GEDP +T + V+ V+G+QG K
Sbjct: 126 GRGNEGLTFWNPNVNIFRDPRWGRGQETWGEDPYLTTRMGVAVVQGMQGPA--DAKYD-- 181
Query: 208 LQASACCKHFTAYDLDNWKGTTRYKFDAR-VTMQDLADTYQPPFESCVKQGRASGIMCAY 266
+ AC KH+ + K R+ FD + +DL +TY P F++ V++ +MCAY
Sbjct: 182 -KTHACAKHYAVHSGPEAK---RHSFDVENLEPRDLWETYLPAFKALVQEADVKEVMCAY 237
Query: 267 NRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKA- 325
R G P C LL++ R +WG+ + SDC A+S + +G ++ DA A
Sbjct: 238 QRFEGEPCCGSNRLLTQILRDEWGYKHLVVSDCGAISDFF-YQGRHETHPDAATSSASAV 296
Query: 326 --GMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIG 383
G D+ CG H AV++ + E ID +L L R LG + + + P+ +I
Sbjct: 297 INGTDLECG-VEYAHLDEAVERGLITEHRIDTSLRRLLEARFALGEMD-DDALVPWSRIS 354
Query: 384 ADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAG 443
D V H+ +AL + +VLL N+ G+LPL K +A++GPNA + GNY G
Sbjct: 355 IDTVDCDMHRQMALDVTRKSMVLLHNN-GILPLDKGDVGKIAVMGPNAVDSVMQWGNYKG 413
Query: 444 PSCRSITPLQALQNYVENTVYYPGCD 469
+ T L+ ++ V N Y GC+
Sbjct: 414 VPAHTYTILEGIRMEVGNVPYEKGCE 439
Score = 61.2 bits (147), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 37/121 (30%), Positives = 65/121 (53%), Gaps = 14/121 (11%)
Query: 480 KAVDIAKGADHVVLMMGLDQTQEKEEL-----------DRVDLVLPGRQQELITRVAEAA 528
+ V+ K A+ ++ + G+ E E+ DR + LP Q++++ + A
Sbjct: 592 EVVERVKDAETIIFVGGISPNLEGEDKYFVYCPGFAGGDRTSIELPQVQRDILKALKAAG 651
Query: 529 KKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTW 588
KK ++ + C G + + +IL A YPG+AG +A+A+V+FGD NP G+LP+T+
Sbjct: 652 KK--VVFVNCSGSA-VALVPELESCDAILQAWYPGQAGGLAVADVLFGDFNPSGKLPVTF 708
Query: 589 Y 589
Y
Sbjct: 709 Y 709
>gi|156933295|ref|YP_001437211.1| hypothetical protein ESA_01107 [Cronobacter sakazakii ATCC BAA-894]
gi|156531549|gb|ABU76375.1| hypothetical protein ESA_01107 [Cronobacter sakazakii ATCC BAA-894]
Length = 757
Score = 257 bits (657), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 218/767 (28%), Positives = 359/767 (46%), Gaps = 123/767 (16%)
Query: 38 PSTETFPFCKTTLPISQRARD-----LVSRLTLDEKISQL--VNSAPAIPRLGI------ 84
P+ + PF + P++ ARD L+ ++T+DEKI QL ++ P P+ I
Sbjct: 9 PALASEPFGEH--PLTPEARDAFVTDLLKKMTVDEKIGQLRLISVGPDNPKEAIRDMIKE 66
Query: 85 ------------PAYEWWSEALHGVAGVGKGIFFN-GTIRGA-TSFPQVILTAASFDSYL 130
P + + ++ + +FF + G T FP + A+SF+
Sbjct: 67 GQVGAIFNTVTRPDIRAMQDQVMQLSRLKIPLFFAYDVLHGQRTIFPNSLGLASSFNLDA 126
Query: 131 WYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSY 190
+G+ EA A + MT WAP +++ RDPRWGRG E GED +T +
Sbjct: 127 VKTVGRVSAYEA-----ADDGLNMT-WAPMVDVTRDPRWGRGSEGFGEDTYLTSMMGKTM 180
Query: 191 VRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPF 250
V +QG + + KHF AY +G Y ++ Q L + Y PP+
Sbjct: 181 VESMQGKS-----PADRYSVMTSVKHFAAYGA--VEGGKEYN-TVDMSSQRLFNDYMPPY 232
Query: 251 ESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEG 310
+ + G + G+M A N +NG P+ +D LL R WGF G SD A+ + G
Sbjct: 233 KEALDAG-SGGVMIALNSLNGTPAASDGWLLKDLLRGDWGFKGITISDHGAIKELI-KHG 290
Query: 311 YAKSPEDAVVDVLKAGMDVNCGS-FLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGL 369
A PEDAV +K+G+D++ + K+ +K K+ +E+D A ++ +V+ +GL
Sbjct: 291 TASDPEDAVRVAIKSGVDMSMADEYYSKYLPNLIKSGKVTMAELDDATRHVLNVKYDMGL 350
Query: 370 FNGNPTMQPFGKIG------ADVVC-SPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSV 422
FN P+ +G D S H+ A + A++ +VLLKN LPL KS ++
Sbjct: 351 FN-----DPYSHLGPKDSDPKDTNAESRLHRDDARKVARESLVLLKNRLETLPLKKSGTI 405
Query: 423 SLALIGPNANSAKTLLGNY--AGPSCRSITPLQALQN---YVENTVYYPGCDTV------ 471
A++GP A+S + ++G++ AG + +S+T LQ ++N + +Y G +
Sbjct: 406 --AVVGPLADSKRDMMGSWSAAGVADQSVTLLQGMKNVAGHKAKILYAKGANVTDDKGIV 463
Query: 472 --------------ACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQ 517
A ID+AV++AK +D VV ++G Q E R DL +P Q
Sbjct: 464 DFLNLYEPAVVVDKRTPKAMIDEAVNVAKQSDVVVAVVGEAQGMAHEASSRTDLTIPQSQ 523
Query: 518 QELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGD 577
++LI+ + +A KP++LVL+ G P + K D+ +IL + G G A+A+V+FGD
Sbjct: 524 RDLISAL-KATGKPLVLVLMNGRP--LALVKEDQQADAILETWFAGTEGGNAIADVLFGD 580
Query: 578 HNPGGRLPMTWYPQDYIKVPMTDMKM---RPQATSGNPGRTYRFYEGKE--VFPFGCGLS 632
+NP G+LP++ +P+ ++P+ + RP T R+++ ++PFG GLS
Sbjct: 581 YNPSGKLPIS-FPRSVGQIPVYYSHLNTGRPYNPEKPNKYTSRYFDEANGPLYPFGYGLS 639
Query: 633 YSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNH 692
Y+ +S VS KL S P + + K ++ V N
Sbjct: 640 YTTFS-----VSDVKL--------------------SAPTMKRD----GKVTASVTVTNT 670
Query: 693 GEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFEL 739
G+ G V ++++ RP+KQL GF+ + L E + F +
Sbjct: 671 GKREGATAVQMYLQDVTASMSRPVKQLRGFEKITLKPGESQTVSFPI 717
>gi|423295566|ref|ZP_17273693.1| hypothetical protein HMPREF1070_02358 [Bacteroides ovatus
CL03T12C18]
gi|392672275|gb|EIY65744.1| hypothetical protein HMPREF1070_02358 [Bacteroides ovatus
CL03T12C18]
Length = 782
Score = 257 bits (657), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 205/713 (28%), Positives = 330/713 (46%), Gaps = 101/713 (14%)
Query: 81 RLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGL 140
RLGIP + EA HG +G AT FP I AA++ L +GQ I
Sbjct: 129 RLGIPMF-LAEEAPHGHMAIG-----------ATVFPTGIGMAATWSPELVKEVGQVIAK 176
Query: 141 EARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFN 200
E R+ G + P +++ RDPRW R +ET GEDP+++G S V D
Sbjct: 177 EIRS------QGGHISYGPVLDLTRDPRWSRVEETFGEDPVLSGTLGASMV-----DGLG 225
Query: 201 GGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRAS 260
GG L K A KHF AY + Y A V ++DL + PPF + G A
Sbjct: 226 GGNLSQKYATIATLKHFLAYAVPEGGQNGNY---ASVGIRDLHQNFLPPFRKAIDAG-AL 281
Query: 261 GIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVV 320
+M +YN ++GIP ++ LL++ R +W F G++ SD ++ I+++ A + E+A +
Sbjct: 282 SVMTSYNSIDGIPCTSNHYLLTQLLRNEWKFRGFVVSDLYSIEGIHESHFVAPTKENAAI 341
Query: 321 DVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFG 380
+ AG+DV+ G + AV+ ++ ++ ID A+ + ++ +GLF +P + P
Sbjct: 342 QSVMAGVDVDLGGDAYTNLCHAVQSGQMDKAVIDTAVCRVLRMKFEMGLFE-HPYVDP-- 398
Query: 381 KIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGN 440
KI A V H LA + AQ I LLKN + +LPL K + +A+IGPNA++ +LG+
Sbjct: 399 KIAAKTVRRKEHIELARKIAQSSITLLKNENSILPLSKMIN-KVAVIGPNADNRYNMLGD 457
Query: 441 YAGPSCRS--ITPLQALQNYV--ENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMG 496
Y P S T L + + Y GC + I++A++ A+ ++ V++++G
Sbjct: 458 YTAPQEDSNVKTVLDGIITKLSPSRVEYVRGCAIRDTTVNEIEQAIEAARRSEVVIVVVG 517
Query: 497 LDQTQE-----------------------KEELDRVDLVLPGRQQELITRVAEAAKKPVI 533
++ E DR L L GRQQEL+ + + KP+I
Sbjct: 518 GSSARDFKTSYKETGAAVAEEGSVSDMECGEGFDRASLSLLGRQQELLESLQKTG-KPLI 576
Query: 534 LVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPMTWYPQDY 593
+V + G P++ +A ++L A YPG+ G A+A+V+FGD+NP GRLP++ P+
Sbjct: 577 VVYIEGRPLEKNWA--SEYADALLTAYYPGQEGGNAIADVLFGDYNPSGRLPIS-VPRSV 633
Query: 594 IKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQNKLYLNQS 653
++P+ + P+ Y ++ FG G+SY+ + Y V Q
Sbjct: 634 GQIPVYYNQKAPR------NHDYVEVSSSPLYSFGYGMSYTTFEYSDLQVVQKS------ 681
Query: 654 SSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKHPVLLFVKPARRGNG 713
R F V+ VKN G+ G+ L+++
Sbjct: 682 --------------------------ARCFEVSFKVKNTGKYDGEEVSQLYMRDEYASVV 715
Query: 714 RPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEGTHFLVVG 766
+P+KQL F+ L E+ ++ F L+ E V+E G L++G
Sbjct: 716 QPMKQLKHFERFHLKKGEEKKVTFVLTE-EDFFLVNYTLKKVVESGNFHLMIG 767
>gi|282878201|ref|ZP_06286997.1| glycosyl hydrolase family 3 C-terminal domain protein [Prevotella
buccalis ATCC 35310]
gi|281299619|gb|EFA91992.1| glycosyl hydrolase family 3 C-terminal domain protein [Prevotella
buccalis ATCC 35310]
Length = 947
Score = 257 bits (657), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 223/804 (27%), Positives = 362/804 (45%), Gaps = 147/804 (18%)
Query: 49 TLPISQRARDLVSRLTLDEKISQLV----------NSAP------AIPRLGIPAY----- 87
T I +R DL+S++TL+EK Q+V + P + + GI A
Sbjct: 63 TAGIDERINDLISQMTLEEKTCQMVTLYGYKRVLQDDLPNESWKQKLWKDGIGAIDEHLN 122
Query: 88 ---EW----------WSEALHG----------VAGVGKGI---FFNGTIRG-----ATSF 116
EW W + H V GI F N IRG AT+F
Sbjct: 123 GFQEWGLPPSDNPWVWPASKHAWAINEVQRFFVEQTRLGIPVDFTNEGIRGVESFKATNF 182
Query: 117 PQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMT-FWAPNINIFRDPRWGRGQET 175
P + +++ L +++G G EAR L G T +AP +++ RD RWGR +E
Sbjct: 183 PTQLGLGTTWNRKLIHQVGYITGREARLL-------GYTNVYAPILDVGRDQRWGRYEEV 235
Query: 176 PGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDA 235
GE P + + + RG+Q + Q ++ KHF AY + + D
Sbjct: 236 YGESPFLVAELGIQMTRGLQTN----------YQVASTGKHFAAYSNNKGAREGMARVDP 285
Query: 236 RVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYI 295
+++ +++ + + P+ V++ G M +YN +G+P + L++ R+Q+GF GY+
Sbjct: 286 QMSPREVQNIHLYPWGRVVREAGLLGAMSSYNDYDGVPIQGSFHWLTEVLRQQFGFKGYV 345
Query: 296 TSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNC-----GSFLQKHTKAAVKQKKLPE 350
SD DA+ ++ A + ++AV + AG++V C SF+ + VK+ ++P
Sbjct: 346 VSDSDALEYLFSKHRTAANMKEAVYKAVMAGLNVRCTFRSPDSFVLP-LRELVKEGRIPM 404
Query: 351 SEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNS 410
ID L ++ V+ +G+F+ M + V +HQ +ALQA+++ IVLLKN
Sbjct: 405 KVIDERLRDILRVKFMVGIFDRPYQMNL--QAADKEVDGKSHQQVALQASRESIVLLKNQ 462
Query: 411 HGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTV---YYPG 467
+ LPL K+ +A+ GPNAN A L +Y + T + ++N V + V Y G
Sbjct: 463 NNTLPLNKASIKKIAVCGPNANDAAYALTHYGPLAVEVTTVFEGIRNKVGSDVEVTYTKG 522
Query: 468 CDTV---------------ACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLV 512
CD V A IDKAV+ + +D V+++G + E R L
Sbjct: 523 CDLVDAHWPESELVDYPMTADEQNEIDKAVEQVRQSDVAVVVLGGNSRTCGENKSRSSLE 582
Query: 513 LPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAE 572
LPGRQ +L+ + +A K VILVL+ G P+ + +A D+ + +I+ A YPG G A+A+
Sbjct: 583 LPGRQLQLL-KAVQATGKTVILVLINGRPLSVNWA--DKFVPAIVEAWYPGSQGGTAVAD 639
Query: 573 VIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQA---------TSGNPGRTYRFYEGKE 623
V+FGD+NPGG+L +T +P+ ++P + +P A GN R
Sbjct: 640 VLFGDYNPGGKLTVT-FPKTVGQIPF-NFPSKPAALVDGGNKLGLHGNASRA-----NGA 692
Query: 624 VFPFGCGLSYSKYSY-KFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRK 682
++ FG GLSY+ + Y + +QN T
Sbjct: 693 LYYFGHGLSYTTFKYSNLRLSAQN------------------------------ISPTDS 722
Query: 683 FLVTIGVKNHGEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPC 742
+V+ + N G+ AG V L+++ K L GF+ V L E + F + P
Sbjct: 723 VVVSCDITNTGQRAGDEVVQLYIQDVLSTVTTYEKNLRGFERVHLKPGETRTLSFVIKP- 781
Query: 743 ESLSRAREDGLMVIEEGTHFLVVG 766
E L E V+E G +++G
Sbjct: 782 EHLQLINEQYQHVVEPGDFKVMMG 805
>gi|62733930|gb|AAX96039.1| Similar to F28K19.27 [Oryza sativa Japonica Group]
Length = 244
Score = 257 bits (657), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 120/193 (62%), Positives = 146/193 (75%), Gaps = 9/193 (4%)
Query: 45 FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGI 104
FC L QRA DLV+ LTL EK+SQL + A + RLG+PAYEWWSE LHG++ G+GI
Sbjct: 50 FCDAWLTAEQRAADLVANLTLAEKVSQLGDRAAGVARLGVPAYEWWSEGLHGLSIWGRGI 109
Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIF 164
FNGT+R TSFPQVILTAA+FD+ LW R+G+A+G EARALYN GQA G+T W+PN+NIF
Sbjct: 110 RFNGTVRAVTSFPQVILTAAAFDAGLWRRVGEAVGAEARALYNLGQANGLTIWSPNVNIF 169
Query: 165 RDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDN 224
RDPRWGRGQETPGEDP+ +YAV++V G+QG GG +ASACCKH TAYDLD
Sbjct: 170 RDPRWGRGQETPGEDPVTASRYAVAFVTGLQGI---GG------EASACCKHATAYDLDY 220
Query: 225 WKGTTRYKFDARV 237
W RY +D++V
Sbjct: 221 WNNVVRYNYDSKV 233
>gi|293373755|ref|ZP_06620101.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
ovatus SD CMC 3f]
gi|292631245|gb|EFF49877.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
ovatus SD CMC 3f]
Length = 800
Score = 257 bits (656), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 225/794 (28%), Positives = 365/794 (45%), Gaps = 148/794 (18%)
Query: 51 PISQRARDLVSRLTLDEKISQLVNS-------APAIPRLGIPAYEWWSEALHGV----AG 99
PI R DL+S++TL+EK Q+ A P G A E W + + + G
Sbjct: 62 PIEARIADLLSQMTLEEKTCQMATLYGSGRVLKDAWPTAGWSA-EIWKDGIGNIDEQANG 120
Query: 100 VGK-----------------------------GI---FFNGTIRG-----ATSFPQVILT 122
+GK GI F N IRG AT FP
Sbjct: 121 LGKFGSEISYPYANSVKNRHTIQRWFMEQTRLGIPVDFTNEGIRGLCHDRATMFPAQCGQ 180
Query: 123 AASFDSYLWYRIGQAIGLEARALYNAGQAIGMT-FWAPNINIFRDPRWGRGQETPGEDPL 181
A+++ L I + EA+AL G T ++P ++I +DPRWGR E+ GEDP
Sbjct: 181 GATWNKKLIREIAKVTADEAKAL-------GYTNIYSPILDIAQDPRWGRVVESYGEDPY 233
Query: 182 VTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDL--DNWKGTTRYKFDARVTM 239
+ G+ + G+Q + A KHF Y + G TR D V
Sbjct: 234 LVGELGKQMILGLQNEGI-----------VATPKHFAVYSIPVGGRDGGTRT--DPHVAP 280
Query: 240 QDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDC 299
+++ Y PF +++ A G+M +YN +G P + L++ R+QWGF GY+ SD
Sbjct: 281 REMKTLYLEPFRKGIQEAGALGVMSSYNDYDGEPVSGSYHFLTEILRQQWGFKGYVVSDS 340
Query: 300 DAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG-----SFLQKHTKAAVKQKKLPESEID 354
+AV ++ + E+ V+ AG+++ F+ + A+ + K+ +D
Sbjct: 341 EAVEFLHTKHRITPTEEEMAAQVVNAGLNIRTNFTPPQDFILP-LRRAINEGKVSLHTLD 399
Query: 355 RALHNLFSVRMRLGLFNGNP----TMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNS 410
+ + + V+ +GLF+ NP +P VV + AH+ ++++AA + IVLLKN
Sbjct: 400 QRVGEILRVKFMMGLFD-NPYPGDDRRP-----ETVVHNDAHKAVSMKAALESIVLLKNE 453
Query: 411 HGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSI-TPLQALQNYVENTV--YYPG 467
+ +LPL K+ S +A+IGPN K L Y GP+ SI T Q ++ Y+ N+ Y G
Sbjct: 454 NQMLPLSKNFS-KIAVIGPNGEEVKELTCRY-GPANASIKTVYQGIKEYLPNSEVRYVKG 511
Query: 468 CDTV---------------ACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLV 512
CD + A I +AV++AK +D +L++G ++ +EE R +L
Sbjct: 512 CDIIDKYFPESELYNVPLDTQEQAMIHEAVELAKASDVAILVLGGNEKTVREEFSRTNLD 571
Query: 513 LPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAE 572
L GRQQ+L+ V A KPV+LV++ G I +A ++ + +I+ A +PGE A+A+
Sbjct: 572 LCGRQQQLLEAVY-ATGKPVVLVMVDGRAATINWA--NKYVPAIIHAWFPGEFMGDAIAK 628
Query: 573 VIFGDHNPGGRLPMTWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLS 632
V+FGD+NPGGRL +T +P+ ++P +P + S R ++PFG GLS
Sbjct: 629 VLFGDYNPGGRLAVT-FPKSVGQIPFA-FPFKPGSDSKGKVRV-----DGVLYPFGYGLS 681
Query: 633 YSKYSYKFKAVSQNKLYLNQSSSTKMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNH 692
Y+ + Y +S+ P +G + ++ VKN
Sbjct: 682 YTTFGYSDLKISK-------------------------PVIGPQ----ENITLSCTVKNT 712
Query: 693 GEMAGKHPVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDG 752
G+ AG V L+++ K L GF+ + L E+ + F L+P + L ++
Sbjct: 713 GKKAGDEVVQLYIRDDFSSVTTYDKVLRGFERIHLQPGEEQTVNFTLTP-QDLGLWDKNN 771
Query: 753 LMVIEEGTHFLVVG 766
+E G+ ++VG
Sbjct: 772 RFTVEPGSFSVMVG 785
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.320 0.136 0.416
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 12,501,646,230
Number of Sequences: 23463169
Number of extensions: 545759960
Number of successful extensions: 1281309
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 5739
Number of HSP's successfully gapped in prelim test: 1979
Number of HSP's that attempted gapping in prelim test: 1228513
Number of HSP's gapped (non-prelim): 16538
length of query: 776
length of database: 8,064,228,071
effective HSP length: 151
effective length of query: 625
effective length of database: 8,816,256,848
effective search space: 5510160530000
effective search space used: 5510160530000
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 81 (35.8 bits)