BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 003606
(808 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|225469218|ref|XP_002264031.1| PREDICTED: probable beta-D-xylosidase 6-like [Vitis vinifera]
Length = 789
Score = 1197 bits (3097), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 569/797 (71%), Positives = 670/797 (84%), Gaps = 17/797 (2%)
Query: 13 RILFRVLQLIVVVNVIAFSNSKPVLNKPDFPCKPPHFDSYPFCNTSLSISTRAKSLISLL 72
R LF L + V+ + + S S P FPC PP YPFCNTSL ISTRA+SL+SLL
Sbjct: 6 RFLFICL-FLQVLPLFSISES----THPQFPCMPPTNSDYPFCNTSLPISTRAQSLVSLL 60
Query: 73 TLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSA 132
TL EKIQQLSD A+AIPRL IPAYEWWSESLHGIA+NGPGV+FNGTVS+ TSFPQVL++A
Sbjct: 61 TLSEKIQQLSDEAAAIPRLYIPAYEWWSESLHGIATNGPGVSFNGTVSAATSFPQVLLTA 120
Query: 133 ASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVS 192
ASFNRSLW +IGSA+AVEARAMYN+GQAGLTFWAPNINIFRDPRWGRGQETPGEDPMV S
Sbjct: 121 ASFNRSLWFSIGSAIAVEARAMYNVGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVAS 180
Query: 193 AYAVEFVKSFQGENWKSDD---GGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAY 249
AYAVEFV+ FQG NWK D G +G +KRVL+G +SD GD LMLSACCKHL AY
Sbjct: 181 AYAVEFVRGFQGGNWKGGDEIRGAVG----KKRVLRG---DSD-GDGLMLSACCKHLTAY 232
Query: 250 DLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDL 309
DLEKWGNFSRYSF+A+++ QD EDT+QPPFRSC++QGKASC+MCSYN+VNGVPAC R DL
Sbjct: 233 DLEKWGNFSRYSFDAVVSNQDLEDTYQPPFRSCVQQGKASCLMCSYNRVNGVPACARQDL 292
Query: 310 FQKARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQS 369
FQKA+ EWGFKGYITSDCDAVAT++EYQ+Y + ED+ A VLKAG DINCG+ MLRHTQS
Sbjct: 293 FQKAKTEWGFKGYITSDCDAVATVYEYQHYANSPEDAVADVLKAGTDINCGSYMLRHTQS 352
Query: 370 AIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAAR 429
AID+GKV+E+DIDRAL NLFSVQ+RLGLF+GDP G YG LGP DVCT EH+ LAL+AAR
Sbjct: 353 AIDQGKVKEEDIDRALFNLFSVQMRLGLFDGDPANGLYGNLGPKDVCTKEHRTLALEAAR 412
Query: 430 QGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSK 489
QGIVLLKNDKKFLPL+K+ +SSLAIIGP + +GGGYTGIPC P+SL+ GL+ YV K
Sbjct: 413 QGIVLLKNDKKFLPLDKSRISSLAIIGPQADQ-PFLGGGYTGIPCKPESLVEGLKTYVEK 471
Query: 490 THYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSL 549
T +A+GC DVPC SD GF EAV IA+KAD V+VVAGLDL+QETED DRVSLLLPG+QM+L
Sbjct: 472 TSFAAGCVDVPCLSDTGFDEAVSIARKADIVVVVAGLDLSQETEDHDRVSLLLPGKQMAL 531
Query: 550 VTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNP 609
++SVA ++P++LVLTGGGPLDVSFAE D +I+SILWIGYPGEAGAKALAEIIFGDFNP
Sbjct: 532 ISSVASAIQKPLVLVLTGGGPLDVSFAEQDPRIASILWIGYPGEAGAKALAEIIFGDFNP 591
Query: 610 GGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKF 669
GGRLPMTWYPESFT+VPMNDMNMRAD R YPGR+YRFY G +VYGFG GLSYT ++Y+F
Sbjct: 592 GGRLPMTWYPESFTRVPMNDMNMRADPYRGYPGRTYRFYIGHRVYGFGQGLSYTKFAYQF 651
Query: 670 LSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGS 729
+SAP++L + S S KN+ +Q ++Y HI+E+ +C SLRFHV+ISVTN GD+DGS
Sbjct: 652 VSAPNKLNLLRSSDTVSSKNLPRQRREEVNYFHIEELDTCDSLRFHVEISVTNVGDMDGS 711
Query: 730 HVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPL 789
HVVMLF+RVPK+ +GTPEKQLIGF RVHTV++ S E S VDPCE SIAN+ G+RI+PL
Sbjct: 712 HVVMLFSRVPKIVKGTPEKQLIGFSRVHTVSRRSTETSIMVDPCEHFSIANEQGKRIMPL 771
Query: 790 GNHVLMVGELRHSLTIE 806
G+H +M+G++ HS+++E
Sbjct: 772 GDHTIMLGDVVHSVSVE 788
>gi|296084630|emb|CBI25718.3| unnamed protein product [Vitis vinifera]
Length = 768
Score = 1176 bits (3041), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 556/794 (70%), Positives = 655/794 (82%), Gaps = 32/794 (4%)
Query: 13 RILFRVLQLIVVVNVIAFSNSKPVLNKPDFPCKPPHFDSYPFCNTSLSISTRAKSLISLL 72
R LF L + V+ + + S S P FPC PP YPFCNTSL ISTRA+SL+SLL
Sbjct: 6 RFLFICL-FLQVLPLFSISES----THPQFPCMPPTNSDYPFCNTSLPISTRAQSLVSLL 60
Query: 73 TLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSA 132
TL EKIQQLSD A+AIPRL IPAYEWWSESLHGIA+NGPGV+FNGTVS+ TSFPQVL++A
Sbjct: 61 TLSEKIQQLSDEAAAIPRLYIPAYEWWSESLHGIATNGPGVSFNGTVSAATSFPQVLLTA 120
Query: 133 ASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVS 192
ASFNRSLW +IGSA+AVEARAMYN+GQAGLTFWAPNINIFRDPRWGRGQETPGEDPMV S
Sbjct: 121 ASFNRSLWFSIGSAIAVEARAMYNVGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVAS 180
Query: 193 AYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLE 252
AYAVEFV+ FQG++ GD LMLSACCKHL AYDLE
Sbjct: 181 AYAVEFVRGFQGDS--------------------------DGDGLMLSACCKHLTAYDLE 214
Query: 253 KWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK 312
KWGNFSRYSF+A+++ QD EDT+QPPFRSC++QGKASC+MCSYN+VNGVPAC R DLFQK
Sbjct: 215 KWGNFSRYSFDAVVSNQDLEDTYQPPFRSCVQQGKASCLMCSYNRVNGVPACARQDLFQK 274
Query: 313 ARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAID 372
A+ EWGFKGYITSDCDAVAT++EYQ+Y + ED+ A VLKAG DINCG+ MLRHTQSAID
Sbjct: 275 AKTEWGFKGYITSDCDAVATVYEYQHYANSPEDAVADVLKAGTDINCGSYMLRHTQSAID 334
Query: 373 KGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGI 432
+GKV+E+DIDRAL NLFSVQ+RLGLF+GDP G YG LGP DVCT EH+ LAL+AARQGI
Sbjct: 335 QGKVKEEDIDRALFNLFSVQMRLGLFDGDPANGLYGNLGPKDVCTKEHRTLALEAARQGI 394
Query: 433 VLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHY 492
VLLKNDKKFLPL+K+ +SSLAIIGP + +GGGYTGIPC P+SL+ GL+ YV KT +
Sbjct: 395 VLLKNDKKFLPLDKSRISSLAIIGPQADQ-PFLGGGYTGIPCKPESLVEGLKTYVEKTSF 453
Query: 493 ASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTS 552
A+GC DVPC SD GF EAV IA+KAD V+VVAGLDL+QETED DRVSLLLPG+QM+L++S
Sbjct: 454 AAGCVDVPCLSDTGFDEAVSIARKADIVVVVAGLDLSQETEDHDRVSLLLPGKQMALISS 513
Query: 553 VARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGR 612
VA ++P++LVLTGGGPLDVSFAE D +I+SILWIGYPGEAGAKALAEIIFGDFNPGGR
Sbjct: 514 VASAIQKPLVLVLTGGGPLDVSFAEQDPRIASILWIGYPGEAGAKALAEIIFGDFNPGGR 573
Query: 613 LPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSA 672
LPMTWYPESFT+VPMNDMNMRAD R YPGR+YRFY G +VYGFG GLSYT ++Y+F+SA
Sbjct: 574 LPMTWYPESFTRVPMNDMNMRADPYRGYPGRTYRFYIGHRVYGFGQGLSYTKFAYQFVSA 633
Query: 673 PSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVV 732
P++L + S S KN+ +Q ++Y HI+E+ +C SLRFHV+ISVTN GD+DGSHVV
Sbjct: 634 PNKLNLLRSSDTVSSKNLPRQRREEVNYFHIEELDTCDSLRFHVEISVTNVGDMDGSHVV 693
Query: 733 MLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNH 792
MLF+RVPK+ +GTPEKQLIGF RVHTV++ S E S VDPCE SIAN+ G+RI+PLG+H
Sbjct: 694 MLFSRVPKIVKGTPEKQLIGFSRVHTVSRRSTETSIMVDPCEHFSIANEQGKRIMPLGDH 753
Query: 793 VLMVGELRHSLTIE 806
+M+G++ HS+++E
Sbjct: 754 TIMLGDVVHSVSVE 767
>gi|224058158|ref|XP_002299457.1| predicted protein [Populus trichocarpa]
gi|222846715|gb|EEE84262.1| predicted protein [Populus trichocarpa]
Length = 780
Score = 1155 bits (2989), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 564/780 (72%), Positives = 654/780 (83%), Gaps = 21/780 (2%)
Query: 31 SNSKPVLNKPDFPCKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPR 90
SNSK V N P FPCKPP ++Y FCN SL I+ RA+SLIS LTLQEKIQQLSDNAS IPR
Sbjct: 20 SNSKSVAN-PQFPCKPPTHNTYSFCNKSLPITRRAQSLISHLTLQEKIQQLSDNASGIPR 78
Query: 91 LGIPAYEWWSESLHGIASNGPGVNFN--GTVSSVTSFPQVLVSAASFNRSLWSNIGSAVA 148
LGIP YEWWSESLHGI+ NGPGV+F G V+S T FPQV+VSAASFNR+LW IGSA+A
Sbjct: 79 LGIPHYEWWSESLHGISINGPGVSFKNGGPVTSATGFPQVIVSAASFNRTLWFLIGSAIA 138
Query: 149 VEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWK 208
+EARAMYN+GQAGLTFWAPNINIFRDPRWGRGQETPGEDPMV SAYA+EFVK FQG +WK
Sbjct: 139 IEARAMYNVGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVASAYAIEFVKGFQGGHWK 198
Query: 209 SDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITE 268
++DG I D+LMLSACCKH AYDLEKWGNFSRYSFNA++TE
Sbjct: 199 NEDGEI------------------NDDKLMLSACCKHSTAYDLEKWGNFSRYSFNAVVTE 240
Query: 269 QDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKARNEWGFKGYITSDCD 328
QD EDT+QPPFRSCI++GKASC+MCSYN+VNGVPAC R DL QK R EWGFKGYITSDCD
Sbjct: 241 QDMEDTYQPPFRSCIQKGKASCLMCSYNEVNGVPACAREDLLQKPRTEWGFKGYITSDCD 300
Query: 329 AVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNL 388
AVATIFEYQNY+K+ ED+ A LKAGMDINCGT +LR+ QSA++KGK+QE+DIDRAL NL
Sbjct: 301 AVATIFEYQNYSKSPEDAVAIALKAGMDINCGTYVLRNAQSAVEKGKLQEEDIDRALHNL 360
Query: 389 FSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNA 448
FSVQLRLGLF+GDPRKG++GKLGP +VCT EHK LAL+AARQGIVLLKNDKK LPLNK A
Sbjct: 361 FSVQLRLGLFDGDPRKGQFGKLGPKNVCTKEHKTLALEAARQGIVLLKNDKKLLPLNKKA 420
Query: 449 VSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFH 508
VSSLAIIGPL N + +GG YTG PC P+SL GL+AYV KT YA GC DV C SD FH
Sbjct: 421 VSSLAIIGPLANMANSLGGDYTGYPCDPQSLFEGLKAYVKKTSYAIGCLDVACVSDTQFH 480
Query: 509 EAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGG 568
+A+ +AK+ADFVI+VAGLDL+QETE+ DRVSLLLPG+QMSLV+SVA SK+PVILVLTGG
Sbjct: 481 KAIIVAKRADFVIIVAGLDLSQETEEHDRVSLLLPGKQMSLVSSVAAASKKPVILVLTGG 540
Query: 569 GPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMN 628
GPLDVSFA+ D +I+SILWIGYPGEAGAKALAEIIFG++NPGGRLPMTWYPESFT+V M
Sbjct: 541 GPLDVSFAKGDPRIASILWIGYPGEAGAKALAEIIFGEYNPGGRLPMTWYPESFTEVSMT 600
Query: 629 DMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDK 688
DMNMR + SR YPGR+YRFYTG +VYGFG GLSYTN++YK LSAPS+L++S SL + S K
Sbjct: 601 DMNMRPNPSRGYPGRTYRFYTGNRVYGFGGGLSYTNFTYKILSAPSKLSLSGSLSSNSRK 660
Query: 689 NILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEK 748
ILQQ G RL Y++I+E+TSC SLRF++QI V N G++DG HVVMLF+RVP V +G PEK
Sbjct: 661 RILQQGGERLSYININEITSCDSLRFYMQILVENVGNMDGGHVVMLFSRVPTVFRGAPEK 720
Query: 749 QLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIETY 808
QL+GFDRVHT++ S E+S VDPCE LS+AN+ G++I+ LG H LM+G+L H +TI+ Y
Sbjct: 721 QLVGFDRVHTISHRSTEMSILVDPCEHLSVANEQGKKIMLLGGHGLMLGDLEHFVTIQIY 780
>gi|357485313|ref|XP_003612944.1| Beta-D-xylosidase [Medicago truncatula]
gi|355514279|gb|AES95902.1| Beta-D-xylosidase [Medicago truncatula]
Length = 783
Score = 1107 bits (2863), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 528/807 (65%), Positives = 634/807 (78%), Gaps = 27/807 (3%)
Query: 5 FMSIQLRERILFRVLQLIVVVNVIAFSNSKPVLNKPDFPCKPPHFDSYPFCNTSLSISTR 64
M +Q R I+ + L+++ + PD+PCKPPH YPFCN SL ISTR
Sbjct: 1 MMFLQQRSTIIIFLFSLLLI-------HLPKFFTTPDYPCKPPH-SHYPFCNISLPISTR 52
Query: 65 AKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTS 124
SLISLLTL +KI QLS+ AS+I LGIP+Y+WWSE+LHGIA+NGPGVNFNG+V S T+
Sbjct: 53 TTSLISLLTLSDKINQLSNTASSISHLGIPSYQWWSEALHGIATNGPGVNFNGSVKSATN 112
Query: 125 FPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETP 184
FPQV+VSAA+FNRSLW IG AV VE RAM+N+GQAGL+FWAPN+N+FRDPRWGRGQETP
Sbjct: 113 FPQVIVSAAAFNRSLWFLIGYAVGVEGRAMFNVGQAGLSFWAPNVNVFRDPRWGRGQETP 172
Query: 185 GEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCK 244
GEDPMV SAYAVEFV+ QG + GI K+VL + D LM+SACCK
Sbjct: 173 GEDPMVGSAYAVEFVRGIQGVD------GI------KKVLN---DHDSDDDGLMVSACCK 217
Query: 245 HLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPAC 304
H AYDLEKWG FSRY+FNA++T+QD EDT+QPPFR C++QGKASC+MCSYN+VNGVPAC
Sbjct: 218 HFTAYDLEKWGEFSRYNFNAVVTQQDLEDTYQPPFRGCVQQGKASCLMCSYNEVNGVPAC 277
Query: 305 LRGDLFQKARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCML 364
DL RN+WGF+GYI SDCDAVAT+FEYQ Y K+ ED+ A VLKAGMDINCGT ML
Sbjct: 278 ASKDLLGLVRNKWGFEGYIASDCDAVATVFEYQKYAKSAEDAVADVLKAGMDINCGTFML 337
Query: 365 RHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLA 424
RHT+SAI++G V+E+D+DRAL NLFSVQ+RLGLFNGDP KGK+GKLGP DVCT EHKKLA
Sbjct: 338 RHTESAIEQGLVKEEDLDRALFNLFSVQMRLGLFNGDPEKGKFGKLGPQDVCTPEHKKLA 397
Query: 425 LDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLE 484
L+AARQGIVLLKND KFLPL+K SLAIIGP+ S++GGGY+GIPCSP+SL GL+
Sbjct: 398 LEAARQGIVLLKNDNKFLPLDKKDRVSLAIIGPMATT-SELGGGYSGIPCSPRSLYDGLK 456
Query: 485 AYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPG 544
YV YA GC DV C+SD GF A+ IAK+ADFV++VAGLD T ETED DRVSLLLPG
Sbjct: 457 EYVKTISYAFGCSDVKCDSDDGFAVAIDIAKQADFVVIVAGLDTTLETEDLDRVSLLLPG 516
Query: 545 QQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIF 604
+QM LV+ VA SKRPVILVLTGGGPLDVSFAE++ I+SILWIGYPGEAG KALAEIIF
Sbjct: 517 KQMDLVSRVAAASKRPVILVLTGGGPLDVSFAESNQLITSILWIGYPGEAGGKALAEIIF 576
Query: 605 GDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTN 664
G+FNP GRLPMTWYPESFT VPMNDM MRAD SR YPGR+YRFYTG+++YGFGHGLSY++
Sbjct: 577 GEFNPAGRLPMTWYPESFTNVPMNDMGMRADPSRGYPGRTYRFYTGSRIYGFGHGLSYSD 636
Query: 665 YSYKFLSAPSELTISASLKAGSDKNILQQTGS---RLDYVHIDEVTSCTSLRFHVQISVT 721
+SY+ LSAPS+L++S + G +++L + +D+VH+DE+ +C SL F V ISV
Sbjct: 637 FSYRVLSAPSKLSLSKTTNGGLRRSLLNKVEKDVFEVDHVHVDELQNCNSLSFSVHISVM 696
Query: 722 NAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANK 781
N GD+DGSHVVMLF++ PK QG+PE QL+G R+HTV+ S E S DPCE S A++
Sbjct: 697 NVGDMDGSHVVMLFSKWPKNIQGSPESQLVGPSRLHTVSNKSIETSILADPCEHFSFADE 756
Query: 782 HGRRILPLGNHVLMVGELRHSLTIETY 808
G+RILPLGNH+L VG++ H ++IE Y
Sbjct: 757 QGKRILPLGNHILNVGDVEHIVSIEIY 783
>gi|297811163|ref|XP_002873465.1| glycosyl hydrolase family 3 protein [Arabidopsis lyrata subsp.
lyrata]
gi|297319302|gb|EFH49724.1| glycosyl hydrolase family 3 protein [Arabidopsis lyrata subsp.
lyrata]
Length = 796
Score = 1106 bits (2860), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 531/803 (66%), Positives = 636/803 (79%), Gaps = 28/803 (3%)
Query: 18 VLQLIVVVNVIAFSNSKPVLNKPDFPCKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEK 77
++ L+ + IA K + + P FPCKPPHF SYPFCN SLSI RA SL+SLLTL EK
Sbjct: 7 LISLVFFTSAIA-ETFKNLDSHPQFPCKPPHFSSYPFCNVSLSIKQRAISLVSLLTLPEK 65
Query: 78 IQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNR 137
I QLS A+++PRLGIP YEWWSESLHG+A NGPGV+FNG++S+ TSFPQV+VSAASFNR
Sbjct: 66 IGQLSTTAASVPRLGIPPYEWWSESLHGLADNGPGVSFNGSISAATSFPQVIVSAASFNR 125
Query: 138 SLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVE 197
+LW IGSAVAVEARAMYN GQAGLTFWAPNIN+FRDPRWGRGQETPGEDP VVS Y VE
Sbjct: 126 TLWYEIGSAVAVEARAMYNGGQAGLTFWAPNINLFRDPRWGRGQETPGEDPKVVSEYGVE 185
Query: 198 FVKSFQGENWKSDDGGIGFGFREKRVLKG-FGEES---------DRGDELMLSACCKHLI 247
FV+ FQ + ++++VLK FG ++ D +LMLSACCKH
Sbjct: 186 FVRGFQEK-------------KKRKVLKTRFGSDNVDDDARYDDDADGKLMLSACCKHFT 232
Query: 248 AYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRG 307
AYDLEKWGNF+RY FNA++TEQD EDT+QPPF +CI+ GKASC+MCSYN VNGVPAC +G
Sbjct: 233 AYDLEKWGNFTRYDFNAVVTEQDMEDTYQPPFETCIKDGKASCLMCSYNAVNGVPACAQG 292
Query: 308 DLFQKARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHT 367
DL QKAR EWGF GYITSDCDAVATIFEYQ YTK+ E++ A +KAG+DINCGT MLR+T
Sbjct: 293 DLLQKARVEWGFDGYITSDCDAVATIFEYQGYTKSPEEAVADAIKAGVDINCGTYMLRNT 352
Query: 368 QSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDA 427
QSAI++GKV E+ +DRALLNLF+VQLRLGLF+GDPR G YGKLG +D+C+S+H+KLAL+A
Sbjct: 353 QSAIEQGKVSEELVDRALLNLFAVQLRLGLFDGDPRGGHYGKLGSNDICSSDHRKLALEA 412
Query: 428 ARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV 487
ARQGIVLLKND K LPLNKN VSSLAI+GP+ NNIS MGG YTG PC K+L L YV
Sbjct: 413 ARQGIVLLKNDYKLLPLNKNHVSSLAIVGPMANNISNMGGTYTGKPCQRKTLFTELLEYV 472
Query: 488 SKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQM 547
KT YASGC DV C SD GF EAV IAK ADFVIVVAGLDL+QETED+DR SL LPG+Q
Sbjct: 473 KKTSYASGCSDVSCVSDTGFGEAVAIAKGADFVIVVAGLDLSQETEDKDRFSLSLPGKQK 532
Query: 548 SLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDF 607
LV+SVA SK+PVILVLTGGGP+DV+FA+ D +I SI+WIGYPGE G +ALAEIIFGDF
Sbjct: 533 DLVSSVAAVSKKPVILVLTGGGPVDVTFAKTDPRIGSIIWIGYPGETGGQALAEIIFGDF 592
Query: 608 NPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSY 667
NPGGRLP+TWYPESF VPM+DM+MRADSSR YPGR+YRFYTG QVY FG GLSYT + Y
Sbjct: 593 NPGGRLPITWYPESFADVPMSDMHMRADSSRGYPGRTYRFYTGPQVYSFGTGLSYTKFDY 652
Query: 668 KFLSAPSELTISASL--KAGSDKNILQQTGSRLDYVHIDE--VTSCTSLRFHVQISVTNA 723
K +SAP L++S L ++ K +LQ +L Y+ +D+ V SC SLRF+V+++V N
Sbjct: 653 KIISAPIRLSLSELLPQQSSHKKQLLQHGEEQLQYIQLDDVMVNSCESLRFNVRVNVRNT 712
Query: 724 GDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHG 783
G++DGSHV+MLF+++ +V G PEKQLIGFDRVH + E F +DPC+ LS+AN G
Sbjct: 713 GEIDGSHVLMLFSKMARVLSGVPEKQLIGFDRVHIRSNEMMETVFVIDPCKYLSVANDVG 772
Query: 784 RRILPLGNHVLMVGELRHSLTIE 806
+R++PLG H L +G+L+HSL++E
Sbjct: 773 KRVIPLGIHALFLGDLQHSLSVE 795
>gi|15238197|ref|NP_196618.1| putative beta-D-xylosidase 6 [Arabidopsis thaliana]
gi|75264319|sp|Q9LXA8.1|BXL6_ARATH RecName: Full=Probable beta-D-xylosidase 6; Short=AtBXL6; Flags:
Precursor
gi|7671447|emb|CAB89387.1| beta-xylosidase-like protein [Arabidopsis thaliana]
gi|15982753|gb|AAL09717.1| AT5g10560/F12B17_90 [Arabidopsis thaliana]
gi|332004180|gb|AED91563.1| putative beta-D-xylosidase 6 [Arabidopsis thaliana]
Length = 792
Score = 1104 bits (2856), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 532/803 (66%), Positives = 631/803 (78%), Gaps = 14/803 (1%)
Query: 6 MSIQLRERILFRVLQLIVVVNVIAFSNSKPVLNKPDFPCKPPHFDSYPFCNTSLSISTRA 65
M++QL ++ L+ + IA K + + P FPCKPPHF SYPFCN SLSI RA
Sbjct: 1 MNLQLT------LISLLFFTSAIA-ETFKNLDSHPQFPCKPPHFSSYPFCNVSLSIKQRA 53
Query: 66 KSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSF 125
SL+SLL L EKI QLS+ A+++PRLGIP YEWWSESLHG+A NGPGV+FNG++S+ TSF
Sbjct: 54 ISLVSLLMLPEKIGQLSNTAASVPRLGIPPYEWWSESLHGLADNGPGVSFNGSISAATSF 113
Query: 126 PQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPG 185
PQV+VSAASFNR+LW IGSAVAVE RAMYN GQAGLTFWAPNIN+FRDPRWGRGQETPG
Sbjct: 114 PQVIVSAASFNRTLWYEIGSAVAVEGRAMYNGGQAGLTFWAPNINVFRDPRWGRGQETPG 173
Query: 186 EDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKH 245
EDP VVS Y VEFV+ FQ E K F + D +LMLSACCKH
Sbjct: 174 EDPKVVSEYGVEFVRGFQ-EKKKRKVLKRRFSDDVDDDRH----DDDADGKLMLSACCKH 228
Query: 246 LIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACL 305
AYDLEKWGNF+RY FNA++TEQD EDT+QPPF +CI GKASC+MCSYN VNGVPAC
Sbjct: 229 FTAYDLEKWGNFTRYDFNAVVTEQDMEDTYQPPFETCIRDGKASCLMCSYNAVNGVPACA 288
Query: 306 RGDLFQKARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLR 365
+GDL QKAR EWGF+GYITSDCDAVATIF YQ YTK+ E++ A +KAG+DINCGT MLR
Sbjct: 289 QGDLLQKARVEWGFEGYITSDCDAVATIFAYQGYTKSPEEAVADAIKAGVDINCGTYMLR 348
Query: 366 HTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLAL 425
HTQSAI++GKV E+ +DRALLNLF+VQLRLGLF+GDPR+G+YGKLG +D+C+S+H+KLAL
Sbjct: 349 HTQSAIEQGKVSEELVDRALLNLFAVQLRLGLFDGDPRRGQYGKLGSNDICSSDHRKLAL 408
Query: 426 DAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEA 485
+A RQGIVLLKND K LPLNKN VSSLAI+GP+ NNIS MGG YTG PC K+L L
Sbjct: 409 EATRQGIVLLKNDHKLLPLNKNHVSSLAIVGPMANNISNMGGTYTGKPCQRKTLFTELLE 468
Query: 486 YVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQ 545
YV KT YASGC DV C+SD GF EAV IAK ADFVIVVAGLDL+QETED+DRVSL LPG+
Sbjct: 469 YVKKTSYASGCSDVSCDSDTGFGEAVAIAKGADFVIVVAGLDLSQETEDKDRVSLSLPGK 528
Query: 546 QMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFG 605
Q LV+ VA SK+PVILVLTGGGP+DV+FA+ D +I SI+WIGYPGE G +ALAEIIFG
Sbjct: 529 QKDLVSHVAAVSKKPVILVLTGGGPVDVTFAKNDPRIGSIIWIGYPGETGGQALAEIIFG 588
Query: 606 DFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNY 665
DFNPGGRLP TWYPESFT V M+DM+MRA+SSR YPGR+YRFYTG QVY FG GLSYT +
Sbjct: 589 DFNPGGRLPTTWYPESFTDVAMSDMHMRANSSRGYPGRTYRFYTGPQVYSFGTGLSYTKF 648
Query: 666 SYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDE--VTSCTSLRFHVQISVTNA 723
YK LSAP L++S L S Q G L Y+ +D+ V SC SLRF+V++ V+N
Sbjct: 649 EYKILSAPIRLSLSELLPQQSSHKKQLQHGEELRYLQLDDVIVNSCESLRFNVRVHVSNT 708
Query: 724 GDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHG 783
G++DGSHVVMLF+++P V G PEKQLIG+DRVH + E F +DPC+QLS+AN G
Sbjct: 709 GEIDGSHVVMLFSKMPPVLSGVPEKQLIGYDRVHVRSNEMMETVFVIDPCKQLSVANDVG 768
Query: 784 RRILPLGNHVLMVGELRHSLTIE 806
+R++PLG+HVL +G+L+HSL++E
Sbjct: 769 KRVIPLGSHVLFLGDLQHSLSVE 791
>gi|449451581|ref|XP_004143540.1| PREDICTED: probable beta-D-xylosidase 6-like [Cucumis sativus]
Length = 777
Score = 1073 bits (2774), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 495/785 (63%), Positives = 623/785 (79%), Gaps = 14/785 (1%)
Query: 28 IAFSNSKPVLNKP---DFPCKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDN 84
+A+ N K + N FPC SYPFCN SLS + RA+SL+SLLTL EKIQQLS+N
Sbjct: 3 VAWYNLKQLCNSSSDFQFPCNSSLHPSYPFCNRSLSFTARAQSLVSLLTLDEKIQQLSNN 62
Query: 85 ASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIG 144
AS+IPRLGIP+Y+WWSE LHGIA+NGPGV+FNG+++S TSFPQVLV+AASFNR+LW IG
Sbjct: 63 ASSIPRLGIPSYQWWSEGLHGIATNGPGVSFNGSITSATSFPQVLVTAASFNRTLWFLIG 122
Query: 145 SAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQG 204
SA+AVEARAM+N+GQ GLT WAPNINIFRDPRWGRGQETPGEDPMV SAY+++FV+ Q
Sbjct: 123 SAIAVEARAMFNVGQCGLTIWAPNINIFRDPRWGRGQETPGEDPMVASAYSIQFVRGLQS 182
Query: 205 ENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNA 264
NW +E + EE + LM+SACCKH AYDLEKW NF+RY+F++
Sbjct: 183 GNW----------MKEHEIRNEVLEEDNGMGSLMVSACCKHFTAYDLEKWNNFTRYTFDS 232
Query: 265 MITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKARNEWGFKGYIT 324
++TEQD DT+QPPFRSCI+QGKASC+MCSYN VNGVPAC DL +KARN+WG KGYIT
Sbjct: 233 VVTEQDLGDTYQPPFRSCIQQGKASCLMCSYNAVNGVPACANPDLLKKARNDWGLKGYIT 292
Query: 325 SDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRA 384
SDCDAVAT++EYQ YT T ED+ A VLKAGMDINCGT MLR T+SAID+GKV+E+++D A
Sbjct: 293 SDCDAVATVYEYQKYTDTPEDAIADVLKAGMDINCGTFMLRGTKSAIDQGKVREEELDSA 352
Query: 385 LLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPL 444
L+NLFSVQ RLG F+G+PR+GK+G+LG DVCT++HK LAL+AARQGIVLLKN+ KFLPL
Sbjct: 353 LINLFSVQARLGFFDGNPREGKFGELGAQDVCTAQHKTLALEAARQGIVLLKNENKFLPL 412
Query: 445 NKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSD 504
+KNA+SSL +IG L N+ S++ GGY G+PCSP SL+ G + Y +ASGC DVPC SD
Sbjct: 413 DKNAISSLTVIGSLANDSSKLLGGYAGVPCSPMSLVEGFQEYAETIFFASGCLDVPCASD 472
Query: 505 AGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILV 564
F +A+ IAKKADFVI VAGLD +QETED DRVSLLLPG+QM LV+SVA SK+P+ILV
Sbjct: 473 NRFEDAILIAKKADFVIAVAGLDASQETEDLDRVSLLLPGKQMDLVSSVASVSKKPIILV 532
Query: 565 LTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTK 624
L GGGPLD+SFA+ DS+++SILWIG PGEAG KALAE+IFGD+NPGGRLP+TWYP+SFT
Sbjct: 533 LIGGGPLDISFAKKDSRVASILWIGNPGEAGGKALAEVIFGDYNPGGRLPVTWYPQSFTN 592
Query: 625 VPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKA 684
VPMNDM+MR + SR YPGR+YRFYTG ++YGFG GLSYT++ Y+ LSAP ++ + +
Sbjct: 593 VPMNDMHMRPNPSRGYPGRTYRFYTGDRIYGFGEGLSYTSFKYRLLSAPKKVNLLGKAET 652
Query: 685 GSDKNILQ-QTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQ 743
+ I Q + G + Y+ ++EV SC LRF V++SV+N G+ DGSHVVM+F+ PKV
Sbjct: 653 SRRRIIPQVRDGVNMSYMEVEEVESCDLLRFEVKLSVSNIGEFDGSHVVMMFSEFPKVLT 712
Query: 744 GTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSL 803
GTP++QLIGFDR++ S E S VDPC +S+A+++G+R++PLG+H + +G+L H +
Sbjct: 713 GTPQRQLIGFDRLYVKRNQSAESSIMVDPCNHVSLADEYGKRVIPLGDHTISLGDLEHVI 772
Query: 804 TIETY 808
+I+ +
Sbjct: 773 SIQVF 777
>gi|449496501|ref|XP_004160150.1| PREDICTED: probable beta-D-xylosidase 6-like, partial [Cucumis
sativus]
Length = 767
Score = 1072 bits (2772), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 491/768 (63%), Positives = 616/768 (80%), Gaps = 11/768 (1%)
Query: 42 FPCKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSE 101
FPC SYPFCN SLS + RA+SL+SLLTL EKIQQLS+NAS+IPRLGIP+Y+WWSE
Sbjct: 10 FPCNSSLHPSYPFCNRSLSFTARAQSLVSLLTLDEKIQQLSNNASSIPRLGIPSYQWWSE 69
Query: 102 SLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAG 161
LHGIA+NGPGV+FNG+++S TSFPQVLV+AASFNR+LW IGSA+AVEARAM+N+GQ G
Sbjct: 70 GLHGIATNGPGVSFNGSITSATSFPQVLVTAASFNRTLWFLIGSAIAVEARAMFNVGQCG 129
Query: 162 LTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREK 221
LT WAPNINIFRDPRWGRGQETPGEDPMV SAY+++FV+ Q NW +E
Sbjct: 130 LTIWAPNINIFRDPRWGRGQETPGEDPMVASAYSIQFVRGLQSGNW----------MKEH 179
Query: 222 RVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRS 281
+ EE + LM+SACCKH AYDLEKW NF+RY+F++++TEQD DT+QPPFRS
Sbjct: 180 EIRNEVLEEDNGMGSLMVSACCKHFTAYDLEKWNNFTRYTFDSVVTEQDLGDTYQPPFRS 239
Query: 282 CIEQGKASCIMCSYNQVNGVPACLRGDLFQKARNEWGFKGYITSDCDAVATIFEYQNYTK 341
CI+QGKASC+MCSYN VNGVPAC DL +KARN+WG KGYITSDCDAVAT++EYQ YT
Sbjct: 240 CIQQGKASCLMCSYNAVNGVPACANPDLLKKARNDWGLKGYITSDCDAVATVYEYQKYTD 299
Query: 342 THEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGD 401
T ED+ A VLKAGMDINCGT MLR T+SAID+GKV+E+++D AL+NLFSVQ RLG F+G+
Sbjct: 300 TPEDAIADVLKAGMDINCGTFMLRGTKSAIDQGKVREEELDSALINLFSVQARLGFFDGN 359
Query: 402 PRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNN 461
PR+GK+G+LG DVCT++HK LAL+AARQGIVLLKN+ KFLPL+KNA+SSL +IG L N+
Sbjct: 360 PREGKFGELGAQDVCTAQHKTLALEAARQGIVLLKNENKFLPLDKNAISSLTVIGSLAND 419
Query: 462 ISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVI 521
S++ GGY G+PCSP SL+ G + Y +ASGC DVPC SD F +A+ IAKKADFVI
Sbjct: 420 SSKLLGGYAGVPCSPMSLVEGFQEYAETIFFASGCLDVPCASDNRFEDAILIAKKADFVI 479
Query: 522 VVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQ 581
VAGLD +QETED DRVSLLLPG+QM LV+SVA SK+P+ILVL GGGPLD+SFA+ DS+
Sbjct: 480 AVAGLDASQETEDLDRVSLLLPGKQMDLVSSVASVSKKPIILVLIGGGPLDISFAKKDSR 539
Query: 582 ISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYP 641
++SILWIG PGEAG KALAE+IFGD+NPGGRLP+TWYP+SFT VPMNDM+MR + SR YP
Sbjct: 540 VASILWIGNPGEAGGKALAEVIFGDYNPGGRLPVTWYPQSFTNVPMNDMHMRPNPSRGYP 599
Query: 642 GRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQ-QTGSRLDY 700
GR+YRFYTG ++YGFG GLSYT++ Y+ LSAP ++ + + + I Q + G + Y
Sbjct: 600 GRTYRFYTGDRIYGFGEGLSYTSFKYRLLSAPKKVNLLGKAETSRRRIIPQVRDGVNMSY 659
Query: 701 VHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVA 760
+ ++EV SC LRF V++SV+N G+ DGSHVVM+F+ PKV GTP++QLIGFDR++
Sbjct: 660 MEVEEVESCDLLRFEVKLSVSNIGEFDGSHVVMMFSEFPKVLTGTPQRQLIGFDRLYVKR 719
Query: 761 KGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIETY 808
S E S VDPC +S+A+++G+R++PLG+H + +G+L H ++I+ +
Sbjct: 720 NQSAESSIMVDPCNHVSLADEYGKRVIPLGDHTISLGDLEHVISIQVF 767
>gi|356531391|ref|XP_003534261.1| PREDICTED: probable beta-D-xylosidase 6-like [Glycine max]
Length = 780
Score = 1056 bits (2731), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 509/773 (65%), Positives = 618/773 (79%), Gaps = 27/773 (3%)
Query: 41 DFPCKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWS 100
D+ CK S+PFC+TSL TRA+SL+SLLTL EKI LS+NAS+IPRLGIPAY+WWS
Sbjct: 30 DYACK--FSQSHPFCDTSLPTLTRARSLVSLLTLPEKILLLSNNASSIPRLGIPAYQWWS 87
Query: 101 ESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA 160
ESLHG+A NGPGV+F G V S TSFPQV++SAASFNRSLW +A+A EARAM+N+GQA
Sbjct: 88 ESLHGLALNGPGVSFAGAVPSATSFPQVILSAASFNRSLWLRTAAAIAREARAMFNVGQA 147
Query: 161 GLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFRE 220
GLTFWAPNIN+FRDPRWGRGQETPGEDPM+ SAYAVE+V+ QG
Sbjct: 148 GLTFWAPNINLFRDPRWGRGQETPGEDPMLASAYAVEYVRGLQG---------------- 191
Query: 221 KRVLKGF--GEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPP 278
L G D D LM+SACCKH AYDL+ WG FSRY+FNA++++QD EDT+QPP
Sbjct: 192 ---LSGIQDAVVVDDDDTLMVSACCKHFTAYDLDMWGQFSRYNFNAVVSQQDLEDTYQPP 248
Query: 279 FRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKARNEWGFKGYITSDCDAVATIFEYQN 338
FRSCI+QGKASC+MCSYN+VNGVPAC +L AR++WGFKGYITSDCDAVAT++EYQ
Sbjct: 249 FRSCIQQGKASCLMCSYNEVNGVPACASEELLGLARDKWGFKGYITSDCDAVATVYEYQK 308
Query: 339 YTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLF 398
Y K+ ED+ A VLKAGMDINCGT MLRHT+SAI++GKV+E+D+DRALLNLFSVQLRLGLF
Sbjct: 309 YAKSQEDAVADVLKAGMDINCGTFMLRHTESAIEQGKVKEEDLDRALLNLFSVQLRLGLF 368
Query: 399 NGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPL 458
+GDP +G++GKLGP DVCT EHK LALDAARQGIVLLKNDKKFLPL+++ +SLA+IGPL
Sbjct: 369 DGDPIRGRFGKLGPKDVCTQEHKTLALDAARQGIVLLKNDKKFLPLDRDIGASLAVIGPL 428
Query: 459 VNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKAD 518
+++GGGY+GIPCS SL GL + + YA GC+DVPC+SD GF EA+ AK+AD
Sbjct: 429 ATT-TKLGGGYSGIPCSSSSLYEGLGEFAERISYAFGCYDVPCDSDDGFAEAIDTAKQAD 487
Query: 519 FVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEA 578
FV++VAGLD TQETED DRVSLLLPG+QM+LV+SVA SK PVILVL GGGPLDVSFAE
Sbjct: 488 FVVIVAGLDATQETEDHDRVSLLLPGKQMNLVSSVADASKNPVILVLIGGGPLDVSFAEK 547
Query: 579 DSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSR 638
+ QI+SI+W+GYPGEAG KALAEIIFG+FNP GRLPMTWYPE+FT VPMN+M+MRAD SR
Sbjct: 548 NPQIASIIWLGYPGEAGGKALAEIIFGEFNPAGRLPMTWYPEAFTNVPMNEMSMRADPSR 607
Query: 639 QYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSR- 697
YPGR+YRFYTG +VYGFGHGLS++++SY FLSAPS++++S ++K GS K +L Q +
Sbjct: 608 GYPGRTYRFYTGGRVYGFGHGLSFSDFSYNFLSAPSKISLSRTIKDGSRKRLLYQVENEV 667
Query: 698 --LDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDR 755
+DYV ++++ +C L F V ISV N G +DGSHVVMLF++ PKV G+PE QL+GF R
Sbjct: 668 YGVDYVPVNQLQNCNKLSFSVHISVMNLGGLDGSHVVMLFSKGPKVVDGSPETQLVGFSR 727
Query: 756 VHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIETY 808
+HT++ E S V PCE LS A+K G+RILPLG H L VG+L H ++IE Y
Sbjct: 728 LHTISSKPTETSILVHPCEHLSFADKQGKRILPLGPHTLSVGDLEHVVSIEIY 780
>gi|26449574|dbj|BAC41913.1| putative beta-xylosidase [Arabidopsis thaliana]
Length = 732
Score = 1046 bits (2705), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 499/735 (67%), Positives = 589/735 (80%), Gaps = 7/735 (0%)
Query: 74 LQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAA 133
L EKI QLS+ A+++PRLGIP YEWWSESLHG+A NGPGV+FNG++S+ TSFPQV+VSAA
Sbjct: 2 LPEKIGQLSNTAASVPRLGIPPYEWWSESLHGLADNGPGVSFNGSISAATSFPQVIVSAA 61
Query: 134 SFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSA 193
SFNR+LW IGSAVAVE RAMYN GQAGLTFWAPNIN+FRDPRWGRGQETPGEDP VVS
Sbjct: 62 SFNRTLWYEIGSAVAVEGRAMYNGGQAGLTFWAPNINVFRDPRWGRGQETPGEDPKVVSE 121
Query: 194 YAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEK 253
Y VEFV+ FQ E K F + D +LMLSACCKH AYDLEK
Sbjct: 122 YGVEFVRGFQ-EKKKRKVLKRRFSDDVDDDRH----DDDADGKLMLSACCKHFTAYDLEK 176
Query: 254 WGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA 313
WGNF+RY FNA++TEQD EDT+QPPF +CI GKASC+MCSYN VNGVPAC +GDL QKA
Sbjct: 177 WGNFTRYDFNAVVTEQDMEDTYQPPFETCIRDGKASCLMCSYNAVNGVPACAQGDLLQKA 236
Query: 314 RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDK 373
R EWGF+GYITSDCDAVATIF YQ YTK+ E++ A +KAG+DINCGT MLRHTQSAI++
Sbjct: 237 RVEWGFEGYITSDCDAVATIFAYQGYTKSPEEAVADAIKAGVDINCGTYMLRHTQSAIEQ 296
Query: 374 GKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIV 433
GKV E+ +DRALLNLF+VQLRLGLF+GDPR+G+YGKLG +D+C+S+H+KLAL+A RQGIV
Sbjct: 297 GKVSEELVDRALLNLFAVQLRLGLFDGDPRRGQYGKLGSNDICSSDHRKLALEATRQGIV 356
Query: 434 LLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYA 493
LLKND K LPLNKN VSSLAI+GP+ NNIS MGG YTG PC K+L L YV KT YA
Sbjct: 357 LLKNDHKLLPLNKNHVSSLAIVGPMANNISNMGGTYTGKPCQRKTLFTELLEYVKKTSYA 416
Query: 494 SGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSV 553
SGC DV C+SD GF EAV IAK ADFVIVVAGLDL+QETED+DRVSL LPG+Q LV+ V
Sbjct: 417 SGCSDVSCDSDTGFGEAVAIAKGADFVIVVAGLDLSQETEDKDRVSLSLPGKQKDLVSHV 476
Query: 554 ARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRL 613
A SK+PVILVLTGGGP+DV+FA+ D +I SI+WIGYPGE G +ALAEIIFGDFNPGGRL
Sbjct: 477 AAVSKKPVILVLTGGGPVDVTFAKNDPRIGSIIWIGYPGETGGQALAEIIFGDFNPGGRL 536
Query: 614 PMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAP 673
P TWYPESFT V M+DM+MRA+SSR YPGR+YRFYTG QVY FG GLSYT + YK LSAP
Sbjct: 537 PTTWYPESFTDVAMSDMHMRANSSRGYPGRTYRFYTGPQVYSFGTGLSYTKFEYKILSAP 596
Query: 674 SELTISASLKAGSDKNILQQTGSRLDYVHIDE--VTSCTSLRFHVQISVTNAGDVDGSHV 731
L++S L S Q G L Y+ +D+ V SC SLRF+V++ V+N G++DGSHV
Sbjct: 597 IRLSLSELLPQQSSHKKQLQHGEELRYLQLDDVIVNSCESLRFNVRVHVSNTGEIDGSHV 656
Query: 732 VMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGN 791
VMLF+++P V G PEKQLIG+DRVH + E F +DPC+QLS+AN G+R++PLG+
Sbjct: 657 VMLFSKMPPVLSGVPEKQLIGYDRVHVRSNEMMETVFVIDPCKQLSVANDVGKRVIPLGS 716
Query: 792 HVLMVGELRHSLTIE 806
HVL +G+L+HSL++E
Sbjct: 717 HVLFLGDLQHSLSVE 731
>gi|384872601|gb|AFI25186.1| putative beta-D-xylosidase [Nicotiana tabacum]
Length = 791
Score = 1043 bits (2697), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 494/756 (65%), Positives = 600/756 (79%), Gaps = 7/756 (0%)
Query: 52 YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGP 111
Y FCN +L ISTR +SLISLLT+ EKI LSDN ++IPRLG+PAYEWWSESLHGIA+NGP
Sbjct: 41 YTFCNKNLPISTRVQSLISLLTIDEKILHLSDNTTSIPRLGLPAYEWWSESLHGIATNGP 100
Query: 112 GVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINI 171
VNFNG + VTSFPQV+++AA+FNR+LW +I +A+AVEARAMYNLGQAGLTFWAPNINI
Sbjct: 101 AVNFNGQIKGVTSFPQVILTAAAFNRTLWHSIATAIAVEARAMYNLGQAGLTFWAPNINI 160
Query: 172 FRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGI-GFGFREKRVLKGFGEE 230
RDPRWGRGQETPGEDPMVVSAYA+E+V FQG N K+ G G+G +++RVLK E+
Sbjct: 161 LRDPRWGRGQETPGEDPMVVSAYAIEYVTGFQGLNPKAKKGNRNGYG-KKRRVLK---ED 216
Query: 231 SDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASC 290
+ G+ LMLSACCKH AYDLEKWG+ +RY FNA++T+QD EDTFQ PFRSCI+QGKASC
Sbjct: 217 DNDGERLMLSACCKHFTAYDLEKWGDATRYDFNAVVTKQDMEDTFQAPFRSCIQQGKASC 276
Query: 291 IMCSYNQVNGVPACLRGDLFQKARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGV 350
+MCSYN VNGVPAC +L K R +WGF GYITSDCDAVATI+E Q YTKT ED+ A
Sbjct: 277 LMCSYNSVNGVPACADKELLDKVRTDWGFDGYITSDCDAVATIYENQKYTKTPEDAVAVA 336
Query: 351 LKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKL 410
LKAG +INCGT MLRH +SA +G V E+D+DRAL LFSVQ RLGLF+G+P G++
Sbjct: 337 LKAGTNINCGTYMLRHMKSAFQQGSVLEEDLDRALQYLFSVQFRLGLFDGNPADGQFANF 396
Query: 411 GPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT 470
G DVCTS H LALDAARQGIVLLKND+KFLPL+K +VS+LAI+GP+ N+S GG Y+
Sbjct: 397 GAQDVCTSNHLNLALDAARQGIVLLKNDQKFLPLDKTSVSTLAIVGPMA-NVSSPGGTYS 455
Query: 471 GIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQ 530
G+PC KS+ G ++++T YA+GC DV CNS AGF +A+ I K+AD+VIVVAG DL++
Sbjct: 456 GVPCKLKSIREGFHRHINRTLYAAGCLDVGCNSTAGFQDAISIVKEADYVIVVAGSDLSE 515
Query: 531 ETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGY 590
ETED DR SLLLPGQQ +LVT++A SK+P+ILVLTGGGP+DVSFAE D +I+SILW+ Y
Sbjct: 516 ETEDHDRYSLLLPGQQTNLVTTLAAASKKPIILVLTGGGPVDVSFAEKDPRIASILWVAY 575
Query: 591 PGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTG 650
PGE G KAL+EIIFG NPGG+LPMTWY ESFTKVPM DMNMRAD S YPGR+YRFYTG
Sbjct: 576 PGETGGKALSEIIFGYQNPGGKLPMTWYLESFTKVPMTDMNMRADPSNGYPGRTYRFYTG 635
Query: 651 TQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCT 710
+YGFGHGLSYT++S + LSAPS L++S + K+ ++IL + SRL Y+H+DEV SC
Sbjct: 636 DVLYGFGHGLSYTSFSSQLLSAPSRLSLSLA-KSNRKRSILAKGRSRLGYIHVDEVESCH 694
Query: 711 SLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGV 770
S +F V ISVTN GD+DGSHV+MLF+RV + QG P+KQL+GFDRVH A+ E S V
Sbjct: 695 SSKFFVHISVTNDGDMDGSHVLMLFSRVLQNFQGAPQKQLVGFDRVHVPARKYVETSLLV 754
Query: 771 DPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
DPCE S AN G RIL LG H ++ ++ H + +E
Sbjct: 755 DPCELFSFANDQGNRILALGEHTFILDDIEHVVFVE 790
>gi|163889365|gb|ABY48135.1| beta-D-xylosidase [Medicago truncatula]
Length = 776
Score = 983 bits (2540), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 489/824 (59%), Positives = 598/824 (72%), Gaps = 69/824 (8%)
Query: 6 MSIQLRERILFRVLQLIVVVNVIAFSNSKPVLNKPDFPCKPPHFDSYPFCNTSLSISTRA 65
M +Q R I+ + L+++ + PD+PCKPPH YPFCN SL ISTR
Sbjct: 1 MFLQQRSTIIIFLFSLLLI-------HLPKFFTTPDYPCKPPH-SHYPFCNISLPISTRT 52
Query: 66 KSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSF 125
SLISLLTL +KI QLS+ AS+I LGIP+Y+WWSE+LHGIA+NGPGVNFNG+V S T+F
Sbjct: 53 TSLISLLTLSDKINQLSNTASSISHLGIPSYQWWSEALHGIATNGPGVNFNGSVKSATNF 112
Query: 126 PQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPG 185
PQV+VSAA+FNRSLW IG AV VE RAM+N+GQAGL+FWAPN+N+FRDPRWGRGQETPG
Sbjct: 113 PQVIVSAAAFNRSLWFLIGYAVGVEGRAMFNVGQAGLSFWAPNVNVFRDPRWGRGQETPG 172
Query: 186 EDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKH 245
EDPMV SAYAVEFV+ QG + GI K+VL + D LM+SACCKH
Sbjct: 173 EDPMVGSAYAVEFVRGIQGVD------GI------KKVLN---DHDSDDDGLMVSACCKH 217
Query: 246 LIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACL 305
AYDLEKWG FSRY+FNA++ +T+QPPFR C++QGKASC+MCSYN+VNGVPAC
Sbjct: 218 FTAYDLEKWGEFSRYNFNAVV------NTYQPPFRGCVQQGKASCLMCSYNEVNGVPACA 271
Query: 306 RGDLFQKARNEWGFKGY------------------ITSDCDAVATIFEYQNYTKTHEDSA 347
DL RN+WGF+G + + + +F Q + E+
Sbjct: 272 SKDLLGLVRNKWGFEGVGILPQTVMLWLLFLSIKSMQNLPKMLLLMFLKQVFFYVFEN-- 329
Query: 348 AGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKY 407
L MDINCGT MLRHT+SAI++G V+E+D+DRAL NLFSVQ+RLGLFNGDP KGK+
Sbjct: 330 ---LWFCMDINCGTFMLRHTESAIEQGLVKEEDLDRALFNLFSVQMRLGLFNGDPEKGKF 386
Query: 408 GKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGG 467
GKLGP DVCT EHKKLAL+AARQGIVLLKND KFLPL+K SLAIIGP+ S++GG
Sbjct: 387 GKLGPQDVCTPEHKKLALEAARQGIVLLKNDNKFLPLDKKDRVSLAIIGPMATT-SELGG 445
Query: 468 GYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLD 527
GY+GIPCSP+SL GL+ YV YA GC DV C+SD GF A+ IAK+ADFV++VAGLD
Sbjct: 446 GYSGIPCSPRSLYDGLKEYVKTISYAFGCSDVKCDSDDGFAVAIDIAKQADFVVIVAGLD 505
Query: 528 LTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILW 587
T ETED DRVSLLLPG+QM LV+ VA SKRPVILVLTGGGPLDVSFAE++ I+SILW
Sbjct: 506 TTLETEDLDRVSLLLPGKQMDLVSRVAAASKRPVILVLTGGGPLDVSFAESNQLITSILW 565
Query: 588 IGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRF 647
IGYP DF+ GRLPMTWYPESFT VPMNDM MRAD SR YPGR+YRF
Sbjct: 566 IGYP-------------VDFDAAGRLPMTWYPESFTNVPMNDMGMRADPSRGYPGRTYRF 612
Query: 648 YTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGS---RLDYVHID 704
YTG+++YGFGHGLSY+++SY+ LSAPS+L++S + G +++L + +D+VH+D
Sbjct: 613 YTGSRIYGFGHGLSYSDFSYRVLSAPSKLSLSKTTNGGLRRSLLNKVEKDVFEVDHVHVD 672
Query: 705 EVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSK 764
E+ +C SL F V ISV N GD+DGSHVVMLF++ PK QG+PE QL+G R+HTV+ S
Sbjct: 673 ELQNCNSLSFSVHISVMNVGDMDGSHVVMLFSKWPKNIQGSPESQLVGPSRLHTVSNKSI 732
Query: 765 EISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIETY 808
E S DPCE S A++ G+RILPLGNH+L VG++ H ++IE Y
Sbjct: 733 ETSILADPCEHFSFADEQGKRILPLGNHILNVGDVEHIVSIEIY 776
>gi|357164885|ref|XP_003580200.1| PREDICTED: probable beta-D-xylosidase 6-like [Brachypodium
distachyon]
Length = 771
Score = 968 bits (2503), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 453/759 (59%), Positives = 580/759 (76%), Gaps = 29/759 (3%)
Query: 52 YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGP 111
YPFC+ SL RA++L+SLLTL EKI QLS+ A+ +PRLGIP YEWWSESLHG+A NGP
Sbjct: 37 YPFCDASLPFPVRARALVSLLTLDEKIAQLSNTAAGVPRLGIPPYEWWSESLHGLADNGP 96
Query: 112 GVNFN-GTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNIN 170
GVNF+ G V + T FPQV++SAASFNRSLW + AVAVEARAM+N GQAGLT+WAPNIN
Sbjct: 97 GVNFSSGPVGAATIFPQVILSAASFNRSLWRAVAEAVAVEARAMHNAGQAGLTYWAPNIN 156
Query: 171 IFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEE 230
+FRDPRWGRGQETPGEDP V++AY+VE+VK FQGE +G
Sbjct: 157 VFRDPRWGRGQETPGEDPAVIAAYSVEYVKGFQGEYGDGKEG------------------ 198
Query: 231 SDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASC 290
+MLSACCKH +AYDLEKWGNF+RY+FNA + EQD EDT++PPF+SCI++G+ASC
Sbjct: 199 -----RMMLSACCKHYVAYDLEKWGNFTRYTFNAKVNEQDFEDTYEPPFKSCIQEGRASC 253
Query: 291 IMCSYNQVNGVPACLRGDLFQKARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGV 350
+MCSYNQVNGVPAC R DL QK R+EWGF+GY+ SDCDAV I+ YQNYT + EDS A V
Sbjct: 254 LMCSYNQVNGVPACARKDLLQKVRDEWGFQGYVVSDCDAVGIIYGYQNYTNSDEDSIAIV 313
Query: 351 LKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKL 410
LKAGMDINCG+ ++RHT+SAI KGK+ E+DI+ AL NLFSVQLRLGLF+ + +L
Sbjct: 314 LKAGMDINCGSFLIRHTKSAIQKGKITEEDINHALFNLFSVQLRLGLFDKTSGNQWFTQL 373
Query: 411 GPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT 470
GP ++CT EH++LA +AARQG VLLKND FLPL ++ VS +AIIGP+ N+ MGG YT
Sbjct: 374 GPSNICTKEHRELAAEAARQGTVLLKNDNSFLPLKRSEVSHIAIIGPVANDAYIMGGDYT 433
Query: 471 GIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQ 530
G+PC+P + L+G++A V +T A+GC D+ CNS GF EA+ +AK+AD V+++AGL+LTQ
Sbjct: 434 GVPCNPTTFLKGMQAVVPQTTIAAGCKDISCNSTDGFGEAIEVAKRADIVVLIAGLNLTQ 493
Query: 531 ETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGY 590
ETED DRVSLLLPG+QM L+ S+A +K+P++LV+TGGGP+DVSFA+ D +I+S+LWIGY
Sbjct: 494 ETEDLDRVSLLLPGKQMDLINSIASVTKKPLVLVITGGGPVDVSFAKQDKRIASVLWIGY 553
Query: 591 PGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTG 650
PGE G + L EI+FG++NPGG+LP+TWYPESFT VPMNDMNMRAD SR YPGR+YRFYTG
Sbjct: 554 PGEVGGQVLPEILFGEYNPGGKLPITWYPESFTAVPMNDMNMRADPSRSYPGRTYRFYTG 613
Query: 651 TQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSR---LDYVHIDEVT 707
VYGFG+GLSY+ YSY + AP+++++S S + D ++ +R LDYV ++++
Sbjct: 614 DVVYGFGYGLSYSKYSYNIIQAPTKISLSRS--SAVDFISTKRAHTRRDGLDYVQVEDIA 671
Query: 708 SCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEIS 767
SC S++F V ISV N G +DGSH V+LF R G P KQL+GF+R++ A + +
Sbjct: 672 SCESIKFSVHISVANDGAMDGSHAVLLFTRSKSSVPGFPLKQLVGFERLYAAAGKATNVE 731
Query: 768 FGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
VDPC+ +S AN GRR+L LG+H+LMVG+ H +E
Sbjct: 732 ITVDPCKLMSSANTEGRRVLLLGSHLLMVGDEEHEFFME 770
>gi|115459584|ref|NP_001053392.1| Os04g0530700 [Oryza sativa Japonica Group]
gi|38346629|emb|CAD41212.2| OSJNBa0074L08.23 [Oryza sativa Japonica Group]
gi|38346760|emb|CAE03865.2| OSJNBa0081C01.11 [Oryza sativa Japonica Group]
gi|113564963|dbj|BAF15306.1| Os04g0530700 [Oryza sativa Japonica Group]
gi|218195263|gb|EEC77690.1| hypothetical protein OsI_16749 [Oryza sativa Indica Group]
Length = 770
Score = 966 bits (2497), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 459/766 (59%), Positives = 577/766 (75%), Gaps = 27/766 (3%)
Query: 43 PCKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSES 102
PC P +YPFCN +L RA++L+SLLTL EKI QLS+ A+ PRLG+P +EWWSES
Sbjct: 29 PCASPAASAYPFCNATLPFPARARALVSLLTLDEKIAQLSNTAAGAPRLGVPPFEWWSES 88
Query: 103 LHGIASNGPGVNFN-GTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAG 161
LHG+ NGPGVNF+ G V S T FPQV++SAA+FNRSLW A+AVEARAM+N GQAG
Sbjct: 89 LHGVCDNGPGVNFSSGPVRSATIFPQVILSAAAFNRSLWRAAARAIAVEARAMHNAGQAG 148
Query: 162 LTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREK 221
LTFWAPNIN+FRDPRWGRGQETPGEDP VVSAY+VE+VK FQ
Sbjct: 149 LTFWAPNINVFRDPRWGRGQETPGEDPAVVSAYSVEYVKGFQ------------------ 190
Query: 222 RVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRS 281
+ +GEE +MLSACCKH IAYDLEKW F+RY+FNA + QD EDT+QPPF+S
Sbjct: 191 ---RDYGEEG----RMMLSACCKHYIAYDLEKWRGFTRYTFNAKVNAQDMEDTYQPPFKS 243
Query: 282 CIEQGKASCIMCSYNQVNGVPACLRGDLFQKARNEWGFKGYITSDCDAVATIFEYQNYTK 341
CI++G+ASC+MCSYNQVNGVPAC R D+ Q+AR+EWGF+GYITSDCDAVA I E Q YT
Sbjct: 244 CIQEGRASCLMCSYNQVNGVPACARKDILQRARDEWGFQGYITSDCDAVAIIHENQTYTA 303
Query: 342 THEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGD 401
+ EDS A VLKAGMDINCG+ ++RHT+SAI+KGKVQE+DI+ AL NLFSVQLRLG F+
Sbjct: 304 SDEDSIAVVLKAGMDINCGSFLIRHTKSAIEKGKVQEEDINHALFNLFSVQLRLGFFDKT 363
Query: 402 PRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNN 461
+ +LGP++VCT+EH++LA +A RQG VLLKND FLPL ++ V +A+IGP N+
Sbjct: 364 NENQWFTQLGPNNVCTTEHRELAAEAVRQGTVLLKNDNGFLPLKRSEVGHIALIGPAAND 423
Query: 462 ISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVI 521
+GG YTG+PC + ++G++AYV KT +A+GC DVPCNS GF EA+ AK+AD V+
Sbjct: 424 PYILGGDYTGVPCHSTTFVKGMQAYVPKTTFAAGCKDVPCNSTDGFGEAIEAAKRADVVV 483
Query: 522 VVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQ 581
++AGL+LT+ETED DRVSLLLPG+QM L+ +VA +K+PV+LVL GGGP+DVSFA+ D +
Sbjct: 484 LIAGLNLTEETEDHDRVSLLLPGRQMDLIHTVASVTKKPVVLVLMGGGPVDVSFAKHDPR 543
Query: 582 ISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYP 641
I+SILWIGYPGE G L EI+FG +NPGG+LP+TWYPESFT VPM+DMNMRAD+SR YP
Sbjct: 544 IASILWIGYPGEVGGNVLPEILFGKYNPGGKLPITWYPESFTAVPMDDMNMRADASRGYP 603
Query: 642 GRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTIS-ASLKAGSDKNILQQTGSRLDY 700
GR+YRFYTG VYGFG+GLSY+ YSY L AP ++++S +S+ + +DY
Sbjct: 604 GRTYRFYTGDVVYGFGYGLSYSKYSYSILQAPKKISLSRSSVPDLISRKPAYTRRDGVDY 663
Query: 701 VHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVA 760
V ++++ SC +L+F V ISV+N G +DGSH V+LFA G+P KQL+GF+RVHT A
Sbjct: 664 VQVEDIASCEALQFPVHISVSNDGAMDGSHAVLLFASSKPSFPGSPIKQLVGFERVHTAA 723
Query: 761 KGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
S ++ VDPC+ +S AN G R+L LG HVLMVG+ H L IE
Sbjct: 724 GRSTDVEITVDPCKLMSFANTEGTRVLFLGTHVLMVGDEEHELLIE 769
>gi|326491679|dbj|BAJ94317.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 772
Score = 956 bits (2471), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 460/767 (59%), Positives = 576/767 (75%), Gaps = 27/767 (3%)
Query: 44 CKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESL 103
C +SY FC+ SL RA++L+SLLTL EKI QLS+ A+ +PRLG+P YEWWSESL
Sbjct: 30 CASAEANSYAFCDGSLPFPVRARALVSLLTLDEKIAQLSNTAAGVPRLGVPPYEWWSESL 89
Query: 104 HGIASNGPGVNFN-GTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGL 162
HG+A NGPGVNF+ G V++ T FPQV++SAA+FNRSLW + AVAVEARAM+N GQAGL
Sbjct: 90 HGLADNGPGVNFSSGPVAAATIFPQVILSAAAFNRSLWRAVAEAVAVEARAMHNAGQAGL 149
Query: 163 TFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKR 222
T+WAPNIN+FRDPRWGRGQETPGEDP +++AY+VE+VK FQGE DG RE R
Sbjct: 150 TYWAPNINVFRDPRWGRGQETPGEDPAMIAAYSVEYVKGFQGE---YGDG------REGR 200
Query: 223 VLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSC 282
+MLSACCKH IAYDLEKWG F+RY+FNA + QD EDT++PPF+SC
Sbjct: 201 --------------MMLSACCKHYIAYDLEKWGKFARYTFNAEVNAQDFEDTYEPPFKSC 246
Query: 283 IEQGKASCIMCSYNQVNGVPACLRGDLFQKARNEWGFKGYITSDCDAVATIFEYQNYTKT 342
I++G+ASC+MCSYNQVNGVPAC R DL QK R+EWGFKGYI SDCDAVA I E Q YT +
Sbjct: 247 IQEGRASCLMCSYNQVNGVPACARKDLLQKIRDEWGFKGYIVSDCDAVAIIHENQTYTSS 306
Query: 343 HEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDP 402
EDS A VLKAGMD+NCG+ ++RHT+SAI+KGK+QE+DI+ AL NLFSVQLRLGLF
Sbjct: 307 DEDSVAIVLKAGMDVNCGSFLIRHTKSAIEKGKIQEEDINHALYNLFSVQLRLGLFEKAN 366
Query: 403 RKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNI 462
+ +LGP +VCT EH++LA +A RQG VLLKND FLPL ++ VS +A+IG N+
Sbjct: 367 ENQWFTRLGPSNVCTKEHRELAAEAVRQGTVLLKNDNSFLPLKRSKVSHIALIGAAANDA 426
Query: 463 SQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIV 522
MGG YTG+PC P + L+G++A+V +T A+GC DV C+S GF EA+ AK+AD V+V
Sbjct: 427 YIMGGDYTGVPCDPITFLKGMQAFVPQTTVAAGCKDVSCDSPDGFGEAIEAAKRADIVVV 486
Query: 523 VAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQI 582
+AGL+LTQE+ED DRV+LLLPG+Q LV +A +K+P++LV+TGGGP+DV+FA+ D +I
Sbjct: 487 IAGLNLTQESEDLDRVTLLLPGRQQDLVNIIASVTKKPIVLVITGGGPVDVAFAKQDPRI 546
Query: 583 SSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPG 642
+S+LWIGYPGE G + L EI+FG++NPGG+LPMTWYPESFT VPMNDMNMRAD SR YPG
Sbjct: 547 ASVLWIGYPGEVGGQVLPEILFGEYNPGGKLPMTWYPESFTAVPMNDMNMRADPSRGYPG 606
Query: 643 RSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAG--SDKNILQQTGSRLDY 700
R+YRFYTG VYGFG+GLSY+ YSY + AP +++S S G S K + LDY
Sbjct: 607 RTYRFYTGEVVYGFGYGLSYSKYSYNIVQAPQRISLSHSPVPGLISRKPAYTRRDG-LDY 665
Query: 701 VHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVA 760
V ++++ SC SL F V ISV N G +DGSH V+LFAR G P KQL+GF+RV+T A
Sbjct: 666 VQVEDIASCESLVFSVHISVANDGAMDGSHAVLLFARSKSSVPGFPLKQLVGFERVYTAA 725
Query: 761 KGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIET 807
SK ++ VDPC+ +S AN GRR+L LG+H LMVG+ H IE
Sbjct: 726 GSSKNVAITVDPCKYMSAANTEGRRVLLLGSHHLMVGDEVHEFVIEA 772
>gi|242076578|ref|XP_002448225.1| hypothetical protein SORBIDRAFT_06g023450 [Sorghum bicolor]
gi|241939408|gb|EES12553.1| hypothetical protein SORBIDRAFT_06g023450 [Sorghum bicolor]
Length = 766
Score = 946 bits (2444), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 454/770 (58%), Positives = 574/770 (74%), Gaps = 33/770 (4%)
Query: 43 PCKPPHFDS--YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWS 100
PC P+ S YPFC+ SLSI RA++L+SLLTL EKI QLS+ A +PRLGIP Y+WWS
Sbjct: 23 PCASPYSGSSAYPFCDASLSIPARARALVSLLTLDEKIAQLSNTAGGVPRLGIPPYQWWS 82
Query: 101 ESLHGIASNGPGVNFN-GTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQ 159
ESLHG+A NGPGVNF+ G V + T+FPQV++S A+FNRSLW + AVA EA M+N GQ
Sbjct: 83 ESLHGLADNGPGVNFSSGPVRAATTFPQVILSTAAFNRSLWRAVAEAVATEALGMHNAGQ 142
Query: 160 AGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFR 219
AGLT+WAPNINIFRDPRWGRGQET GEDP V +AY++E+VK FQGE
Sbjct: 143 AGLTYWAPNINIFRDPRWGRGQETSGEDPAVAAAYSLEYVKGFQGEQ------------- 189
Query: 220 EKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPF 279
GEE + LSACCKH AYD+EKW FSRY+FNA + QD EDT+QPPF
Sbjct: 190 --------GEEG----RIRLSACCKHYTAYDMEKWEGFSRYTFNAKVNAQDLEDTYQPPF 237
Query: 280 RSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKARNEWGFKGYITSDCDAVATIFEYQNY 339
++CI++ +ASC+MC+YNQVNGVP C DL QK R+EWGF+GYITSDCDAVA I E Q Y
Sbjct: 238 KTCIQEARASCLMCAYNQVNGVPMCANKDLLQKTRDEWGFQGYITSDCDAVAIIHENQTY 297
Query: 340 TKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFN 399
TK+ EDS A VLKAGMDINCG+ ++RHT+SA++KGKVQE+DIDRAL NLFSVQLRLG+F+
Sbjct: 298 TKSDEDSIAIVLKAGMDINCGSFLVRHTKSAVEKGKVQEQDIDRALFNLFSVQLRLGIFD 357
Query: 400 GDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLV 459
+LGP++VCT EH++LA +A RQG VLLKND FLPL ++ V +AIIGP
Sbjct: 358 KPNNNQWSTQLGPNNVCTKEHRELAAEAVRQGAVLLKNDHSFLPLKRSEVRHVAIIGPSA 417
Query: 460 NNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADF 519
N++ MGG YTG+ C+P + L+G++AY ++T +A+GC DV CNS F EA+ AK+AD
Sbjct: 418 NDVYAMGGDYTGVACNPTTFLKGIQAYATQTTFAAGCKDVSCNSTELFGEAIAAAKRADI 477
Query: 520 VIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEAD 579
V+VVAGL+LT+E ED DRVSLLLPG+QMSL+ +VA +K+P++LVL GGGP+DVSFA+ D
Sbjct: 478 VVVVAGLNLTEEREDFDRVSLLLPGKQMSLIHAVASVAKKPLVLVLLGGGPVDVSFAKQD 537
Query: 580 SQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQ 639
+I+SILW+GYPGE G + L EI+FG++NPGG+L MTWYPESFT +PM DMNMRAD SR
Sbjct: 538 PRIASILWLGYPGEVGGQVLPEILFGEYNPGGKLAMTWYPESFTAIPMTDMNMRADPSRG 597
Query: 640 YPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISAS--LKAGSDK-NILQQTGS 696
YPGR+YRFYTG VYGFG+GLSY+ YSY LSAP ++T+S S L S K + +++ G
Sbjct: 598 YPGRTYRFYTGDVVYGFGYGLSYSKYSYSILSAPKKITMSRSSVLDIISRKPSYIRRDG- 656
Query: 697 RLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRV 756
LD+V +++ SC +L F V ++V+N G +DGSH V+LFAR G P KQL+GF+RV
Sbjct: 657 -LDFVKTEDIASCEALAFSVHVAVSNHGSMDGSHAVLLFARSKSSVPGFPIKQLVGFERV 715
Query: 757 HTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
HT A + + VDPC+ +S AN G+R+L LG+HVL VG+ L IE
Sbjct: 716 HTAAGSASNVEISVDPCKHMSAANPEGKRVLLLGDHVLTVGDEEFELFIE 765
>gi|414586138|tpg|DAA36709.1| TPA: putative O-Glycosyl hydrolase superfamily protein [Zea mays]
Length = 769
Score = 913 bits (2360), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 445/771 (57%), Positives = 562/771 (72%), Gaps = 36/771 (4%)
Query: 43 PCKPPHFDS-YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSE 101
PC P+ S YPFC+ SLSI RA++L+SLLTL EKI QLS+ A +PRLGIP Y+WWSE
Sbjct: 27 PCASPYSTSAYPFCDASLSIPARARALVSLLTLDEKIAQLSNTAGGVPRLGIPPYQWWSE 86
Query: 102 SLHGIASNGPGVNFN-GTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA 160
SLHG+A NGPGVNF+ G V + T FPQV++S A+FNRSLW + AVA EA M+N GQA
Sbjct: 87 SLHGLADNGPGVNFSSGPVRAATDFPQVILSTAAFNRSLWRAVAEAVATEALGMHNAGQA 146
Query: 161 GLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFRE 220
GLT+WAPNINIFRDPRWGRGQET GEDP V +AY++E+VK FQ
Sbjct: 147 GLTYWAPNINIFRDPRWGRGQETSGEDPAVAAAYSLEYVKGFQ----------------- 189
Query: 221 KRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFR 280
E + LSACCKH AYD+EKW FSRY+FNA + QD EDT+QPPF+
Sbjct: 190 --------GEEGEEGRIRLSACCKHYTAYDMEKWEGFSRYTFNAKVNAQDLEDTYQPPFK 241
Query: 281 SCIEQGKASCIMCSYNQVNGVPACLRGDLFQKARNEWGFKGYITSDCDAVATIFEYQNYT 340
+CI++ +ASC+MC+YNQVNGVP C DL QK R+EWGF+GYITSDCDAVA I E Q YT
Sbjct: 242 TCIQEARASCLMCAYNQVNGVPMCAHKDLLQKTRDEWGFQGYITSDCDAVAIIHENQTYT 301
Query: 341 KTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNG 400
K+ EDS A VLKAGMDINCG+ ++RHT+SAI+KGK+QE+DIDRAL NLFSVQLRLG+F+
Sbjct: 302 KSGEDSIAIVLKAGMDINCGSFLVRHTKSAIEKGKIQEEDIDRALFNLFSVQLRLGIFDK 361
Query: 401 DPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVN 460
+ +LGP+ VCT EH++LA +A RQG VLLKND FLPL ++ V +AIIGP N
Sbjct: 362 PSNNQWFSQLGPNSVCTKEHRELAAEAVRQGAVLLKNDHNFLPLKRSEVRHVAIIGPSAN 421
Query: 461 NISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFV 520
+ MGG YTG+PC+P + L+G++AY ++T +A GC D CNS F EAV AK+AD V
Sbjct: 422 DAYAMGGDYTGVPCNPTTFLKGIQAYATQTSFAPGCKDASCNSTDLFGEAVEAAKRADIV 481
Query: 521 IVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADS 580
+V+AGL+LT+E ED DRVSLLLPG+QM L+ ++A +K+P++LVL GGGP+DVSFA+ D
Sbjct: 482 VVIAGLNLTEEREDFDRVSLLLPGKQMGLIHAIASVAKKPLVLVLLGGGPVDVSFAKQDP 541
Query: 581 QISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQY 640
+I+SILW+GYPGE G + L EI+FG++NPGG+LP+TWYPESFT +PM DMNMRAD SR Y
Sbjct: 542 RIASILWLGYPGEVGGQVLPEILFGEYNPGGKLPITWYPESFTAIPMTDMNMRADPSRGY 601
Query: 641 PGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSR--- 697
PGR+YRFYTG VYGFG+GLSY+ YSY SAP ++T+S S SD I+ + +
Sbjct: 602 PGRTYRFYTGDVVYGFGYGLSYSKYSYSISSAPKKITVSRS----SDLGIISRKPAYTRR 657
Query: 698 --LDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDR 755
L V +++ SC +L F V ++V+N G +DGSH V+LFAR G P KQL+GF+
Sbjct: 658 DGLGSVKTEDIASCEALVFSVHVAVSNHGSMDGSHAVLLFARSKSSVPGFPIKQLVGFES 717
Query: 756 VHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
VHT A + + VDPC+Q+S AN G+R+L LG HVL VG+ L+IE
Sbjct: 718 VHTAAGSASNVEITVDPCKQMSAANPEGKRVLLLGAHVLTVGDEEFELSIE 768
>gi|222629257|gb|EEE61389.1| hypothetical protein OsJ_15562 [Oryza sativa Japonica Group]
Length = 771
Score = 907 bits (2343), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 443/775 (57%), Positives = 561/775 (72%), Gaps = 44/775 (5%)
Query: 43 PCKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPR---------LGI 93
PC P +YPFCN +L RA++L+SLLTL EKI QL + P +G+
Sbjct: 29 PCASPAASAYPFCNATLPFPARARALVSLLTLDEKIAQLLQHRRGRPPPRRPALRVVVGV 88
Query: 94 PAYEWWSESLHGIASNGPG-VNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEAR 152
P+ A+ GPG + G V S T FPQV++SAA+FNRSLW A+AVEAR
Sbjct: 89 PS--------TASATTGPGSTSPRGPVRSATIFPQVILSAAAFNRSLWRAAARAIAVEAR 140
Query: 153 AMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDG 212
AM+N GQAGLTFWAPNIN+FRDPRWGRGQETPGEDP VVSAY+VE+VK FQ
Sbjct: 141 AMHNAGQAGLTFWAPNINVFRDPRWGRGQETPGEDPAVVSAYSVEYVKGFQ--------- 191
Query: 213 GIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTE 272
+ +GEE +MLSACCKH IAYDLEKW F+RY+FNA + QD E
Sbjct: 192 ------------RDYGEEG----RMMLSACCKHYIAYDLEKWRGFTRYTFNAKVNAQDME 235
Query: 273 DTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKARNEWGFKGYITSDCDAVAT 332
DT+QPPF+SCI++G+ASC+MCSYNQVNGVPAC R D+ Q+AR+EWGF+GYITSDCDAVA
Sbjct: 236 DTYQPPFKSCIQEGRASCLMCSYNQVNGVPACARKDILQRARDEWGFQGYITSDCDAVAI 295
Query: 333 IFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQ 392
I E Q YT + EDS A VLKAGMDINCG+ ++RHT+SAI+KGKVQE+DI+ AL NLFSVQ
Sbjct: 296 IHENQTYTASDEDSIAVVLKAGMDINCGSFLIRHTKSAIEKGKVQEEDINHALFNLFSVQ 355
Query: 393 LRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSL 452
LRLG F+ + +LGP++VCT+EH++LA +A RQG VLLKND FLPL ++ V +
Sbjct: 356 LRLGFFDKTNENQWFTQLGPNNVCTTEHRELAAEAVRQGTVLLKNDNGFLPLKRSEVGHI 415
Query: 453 AIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVR 512
A+IGP N+ +GG YTG+PC + ++G++AYV KT +A+GC DVPCNS GF EA+
Sbjct: 416 ALIGPAANDPYILGGDYTGVPCHSTTFVKGMQAYVPKTTFAAGCKDVPCNSTDGFGEAIE 475
Query: 513 IAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLD 572
AK+AD V+++AGL+LT+ETED DRVSLLLPG+QM L+ +VA +K+PV+LVL GGGP+D
Sbjct: 476 AAKRADVVVLIAGLNLTEETEDHDRVSLLLPGRQMDLIHTVASVTKKPVVLVLMGGGPVD 535
Query: 573 VSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNM 632
VSFA+ D +I+SILWIGYPGE G L EI+FG +NPGG+LP+TWYPESFT VPM+DMNM
Sbjct: 536 VSFAKHDPRIASILWIGYPGEVGGNVLPEILFGKYNPGGKLPITWYPESFTAVPMDDMNM 595
Query: 633 RADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTIS-ASLKAGSDKNIL 691
RAD+SR YPGR+YRFYTG VYGFG+GLSY+ YSY L AP ++++S +S+ +
Sbjct: 596 RADASRGYPGRTYRFYTGDVVYGFGYGLSYSKYSYSILQAPKKISLSRSSVPDLISRKPA 655
Query: 692 QQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLI 751
+DYV ++++ SC +L+F V ISV+N G +DGSH V+LFA G+P KQL+
Sbjct: 656 YTRRDGVDYVQVEDIASCEALQFPVHISVSNDGAMDGSHAVLLFASSKPSFPGSPIKQLV 715
Query: 752 GFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
GF+RVHT A S ++ VDPC+ +S AN G R+L LG HVLMVG+ H L IE
Sbjct: 716 GFERVHTAAGRSTDVEITVDPCKLMSFANTEGTRVLFLGTHVLMVGDEEHELLIE 770
>gi|90399376|emb|CAJ86207.1| B1011H02.4 [Oryza sativa Indica Group]
Length = 738
Score = 894 bits (2311), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 437/766 (57%), Positives = 549/766 (71%), Gaps = 59/766 (7%)
Query: 43 PCKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSES 102
PC P +YPFCN +L RA++L+SLLTL EKI QLS+ A+ PRLG+P +EWWSES
Sbjct: 29 PCASPAASAYPFCNATLPFPARARALVSLLTLDEKIAQLSNTAAGAPRLGVPPFEWWSES 88
Query: 103 LHGIASNGPGVNFN-GTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAG 161
LHG+ NGPGVNF+ G V S T FPQV++SAA+FNRSLW A+AVEARAM+N GQAG
Sbjct: 89 LHGVCDNGPGVNFSSGPVRSATIFPQVILSAAAFNRSLWRAAARAIAVEARAMHNAGQAG 148
Query: 162 LTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREK 221
LTFWAPNIN+FRDPRWGRGQETPGEDP VVSAY+VE+VK FQ
Sbjct: 149 LTFWAPNINVFRDPRWGRGQETPGEDPAVVSAYSVEYVKGFQ------------------ 190
Query: 222 RVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRS 281
+ +GEE +MLSACCKH IAYDLEKW F+RY+FNA
Sbjct: 191 ---RDYGEEG----RMMLSACCKHYIAYDLEKWRGFTRYTFNA----------------- 226
Query: 282 CIEQGKASCIMCSYNQVNGVPACLRGDLFQKARNEWGFKGYITSDCDAVATIFEYQNYTK 341
+VNGVPAC R D+ Q+AR+EWGF+GYITSDCDAVA I E Q YT
Sbjct: 227 ---------------KVNGVPACARKDILQRARDEWGFQGYITSDCDAVAIIHENQTYTA 271
Query: 342 THEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGD 401
+ EDS A VLKAGMDINCG+ ++RHT+SAI+KGKVQE+DI+ AL NLFSVQLRLG F+
Sbjct: 272 SDEDSIAVVLKAGMDINCGSFLIRHTKSAIEKGKVQEEDINHALFNLFSVQLRLGFFDKT 331
Query: 402 PRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNN 461
+ +LGP++VCT+EH++LA +A RQG VLLKND FLPL ++ V +A+IGP N+
Sbjct: 332 NENQWFTQLGPNNVCTTEHRELAAEAVRQGTVLLKNDNGFLPLKRSEVGHIALIGPAAND 391
Query: 462 ISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVI 521
+GG YTG+PC + ++G++AYV KT +A+GC DVPCNS GF EA+ AK+AD V+
Sbjct: 392 PYILGGDYTGVPCHSTTFVKGMQAYVPKTTFAAGCKDVPCNSTDGFGEAIEAAKRADVVV 451
Query: 522 VVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQ 581
++AGL+LT+ETED DRVSLLLPG+QM L+ +VA +K+PV+LVL GGGP+DVSFA+ D +
Sbjct: 452 LIAGLNLTEETEDHDRVSLLLPGRQMDLIHTVASVTKKPVVLVLMGGGPVDVSFAKHDPR 511
Query: 582 ISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYP 641
I+SILWIGYPGE G L EI+FG +NPGG+LP+TWYPESFT VPM+DMNMRAD+SR YP
Sbjct: 512 IASILWIGYPGEVGGNVLPEILFGKYNPGGKLPITWYPESFTAVPMDDMNMRADASRGYP 571
Query: 642 GRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTIS-ASLKAGSDKNILQQTGSRLDY 700
GR+YRFYTG VYGFG+GLSY+ YSY L AP ++++S +S+ + +DY
Sbjct: 572 GRTYRFYTGDVVYGFGYGLSYSKYSYSILQAPKKISLSRSSVPDLISRKPAYTRRDGVDY 631
Query: 701 VHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVA 760
V ++++ SC +L+F V ISV+N G +DGSH V+LFA G+P KQL+GF+RVHT A
Sbjct: 632 VQVEDIASCEALQFPVHISVSNDGAMDGSHAVLLFASSKPSFPGSPIKQLVGFERVHTAA 691
Query: 761 KGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
S ++ VDPC+ +S AN G R+L LG HVLMVG+ H L IE
Sbjct: 692 GRSTDVEITVDPCKLMSFANTEGTRVLFLGTHVLMVGDEEHELLIE 737
>gi|371917286|dbj|BAL44719.1| SlArf/Xyl4 [Solanum lycopersicum]
Length = 775
Score = 810 bits (2093), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 398/774 (51%), Positives = 520/774 (67%), Gaps = 33/774 (4%)
Query: 39 KPDFPCKP--PHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAY 96
+P F C P S FC T L IS R L+S LTL EKI QL ++A AIPRLGIPAY
Sbjct: 26 QPPFSCDSSNPQTKSLKFCQTGLPISVRVLDLVSRLTLDEKISQLVNSAPAIPRLGIPAY 85
Query: 97 EWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYN 156
EWWSESLHG+ S G G+ FNG+++ TSFPQV+++AA+F+ +LW IG + VEAR +YN
Sbjct: 86 EWWSESLHGVGSAGKGIFFNGSIAGATSFPQVILTAATFDENLWYRIGQVIGVEARGVYN 145
Query: 157 LGQA-GLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIG 215
GQA G+TFWAPNINIFRDPRWGRGQETPGEDP++ YA+ +V+ QG+++ +GG
Sbjct: 146 AGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPIMTGKYAIRYVRGVQGDSF---NGG-- 200
Query: 216 FGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTF 275
++ KG L SACCKH AYDL++W N R+SFNA++T QD DTF
Sbjct: 201 ------QLKKG---------HLQASACCKHFTAYDLDQWKNLDRFSFNAIVTPQDMADTF 245
Query: 276 QPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK-ARNEWGFKGYITSDCDAVATIF 334
QPPF+ CI++ +AS IMCSYN VNG+P+C +L K AR +WGF GYITSDCDAV +
Sbjct: 246 QPPFQDCIQKAQASGIMCSYNSVNGIPSCANYNLLTKTARQQWGFHGYITSDCDAVQVMH 305
Query: 335 EYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLR 394
+ Y T EDS A LKAGMDI+CG + ++T+SA+ K KV + IDRAL NLFS+++R
Sbjct: 306 DNHRYGNTPEDSTAFALKAGMDIDCGDYLKKYTKSAVMKKKVSQVHIDRALHNLFSIRMR 365
Query: 395 LGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAI 454
LGLFNGDPRK YG + P VC +H++LAL+AAR GIVLLKN K LPL+K +SLA+
Sbjct: 366 LGLFNGDPRKQLYGNISPSQVCAPQHQQLALEAARNGIVLLKNTGKLLPLSKAKTNSLAV 425
Query: 455 IGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIA 514
IG NN + G Y G PC +L+ L Y Y GC+ C S A +AV IA
Sbjct: 426 IGHNANNAYILRGNYDGPPCKYIEILKALVGYAKSVQYQQGCNAANCTS-ANIDQAVNIA 484
Query: 515 KKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVS 574
+ AD+V+++ GLD TQE E DR L+LPGQQ +L+ SVA+ +K+PVILV+ GGP+D+S
Sbjct: 485 RNADYVVLIMGLDQTQEREQFDRDDLVLPGQQENLINSVAKAAKKPVILVILSGGPVDIS 544
Query: 575 FAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRA 634
FA+ + +I SILW GYPGEAG ALAEIIFG+ NPGG+LP+TWYP++F K+PM DM MR
Sbjct: 545 FAKYNPKIGSILWAGYPGEAGGIALAEIIFGEHNPGGKLPVTWYPQAFVKIPMTDMRMRP 604
Query: 635 DSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSA-PSELTISASLKAGSDKNILQQ 693
D YPGR+YRFY G +VY FG+GLSYT YSY F SA P+ + ++ L + +N
Sbjct: 605 DPKTGYPGRTYRFYKGPKVYEFGYGLSYTTYSYGFHSATPNTIQLNQLLSVKTVEN---- 660
Query: 694 TGSRLDYVHIDEVTS--CTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLI 751
+ Y +DE+ S C +F +SV N+G++DG H V+LF + K G+P KQL+
Sbjct: 661 -SDSIRYTFVDEIGSDNCEKAKFSAHVSVENSGEMDGKHPVLLFVKQDKARNGSPIKQLV 719
Query: 752 GFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTI 805
GF V A + ++ F + PCE LS AN+ G ++ G+ L+VG+ H + I
Sbjct: 720 GFQSVSLKAGENSQLVFEISPCEHLSSANEDGLMMIEEGSRYLVVGDAEHPINI 773
>gi|302141935|emb|CBI19138.3| unnamed protein product [Vitis vinifera]
Length = 1411
Score = 805 bits (2078), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 407/793 (51%), Positives = 523/793 (65%), Gaps = 62/793 (7%)
Query: 18 VLQLIVVVNVIAFSNSKPVLNKPDFPCKP--PHFDSYPFCNTSLSISTRAKSLISLLTLQ 75
VL + + A S K + + P F C P SY FCNT+L IS RA LIS LTL
Sbjct: 674 VLLFVRRLKASAGSPIKQLSSSPPFACDSSDPLTKSYAFCNTTLRISQRASDLISRLTLD 733
Query: 76 EKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASF 135
EKI QL +A++IPRLGIPAYEWWSE+LHGI G+ FNGT+ S TSFPQV+++AASF
Sbjct: 734 EKISQLISSAASIPRLGIPAYEWWSEALHGIRDRH-GIRFNGTIRSATSFPQVILTAASF 792
Query: 136 NRSLWSNIGSAVAVEARAMYNLGQA-GLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAY 194
+ LW IG A+ +E RAMYN GQA G+TFWAPNINIFRDPRWGRGQETPGEDP+V Y
Sbjct: 793 DAHLWYRIGQAIGIETRAMYNAGQAMGMTFWAPNINIFRDPRWGRGQETPGEDPVVAGKY 852
Query: 195 AVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKW 254
AV +V+ QG+ + E + D L SACCKH AYDL+ W
Sbjct: 853 AVSYVRGLQGDTF----------------------EGGKVDVLQASACCKHFTAYDLDNW 890
Query: 255 GNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK-A 313
+ RY+F+A +T QD DT+QPPFRSCIE+G+AS +MC+YN VNGVP C +L K A
Sbjct: 891 TSIDRYTFDARVTMQDLADTYQPPFRSCIEEGRASGLMCAYNLVNGVPNCADFNLLSKTA 950
Query: 314 RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDK 373
R +WGF GYI SDCDAV+ + + Q Y K+ ED+ A VL AGMD+ CG + +H +SA+ +
Sbjct: 951 RGQWGFDGYIVSDCDAVSLVHDVQGYAKSPEDAVAIVLTAGMDVACGGYLQKHAKSAVSQ 1010
Query: 374 GKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIV 433
K+ E +IDRALLNLF+V++RLGLFNG+PRK +G +GPD VC++EH+ LAL+AAR GIV
Sbjct: 1011 KKLTESEIDRALLNLFTVRMRLGLFNGNPRKLPFGNIGPDQVCSTEHQTLALEAARSGIV 1070
Query: 434 LLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYA 493
LLKN + LPL+K SLA+IGP N + G Y G PC S L+GL++YV+ T Y
Sbjct: 1071 LLKNSDRLLPLSKGETLSLAVIGPNANATDTLLGNYAGPPCKFISPLQGLQSYVNNTMYH 1130
Query: 494 SGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSV 553
+GC+DV C+S A AV +AK+AD+V++V GLD TQE E DR+ L+LPG+Q L+T V
Sbjct: 1131 AGCNDVACSS-ASIENAVDVAKQADYVVLVMGLDQTQEREKYDRLDLVLPGKQEQLITGV 1189
Query: 554 ARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRL 613
A+ +K+PV+LVL GGP+D+SFA+ S I SILW GYPGEAG A+AE IFGD NPGGRL
Sbjct: 1190 AKAAKKPVVLVLLCGGPVDISFAKGSSNIGSILWAGYPGEAGGAAIAETIFGDHNPGGRL 1249
Query: 614 PMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLS-A 672
P+TWYP+ F K+PM DM MR + YPGR++RFYTG V+ FG+GLSY+ YSY+FLS
Sbjct: 1250 PVTWYPKDFIKIPMTDMRMRPEPQSGYPGRTHRFYTGKTVFEFGNGLSYSPYSYEFLSVT 1309
Query: 673 PSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVV 732
P++L Y++ T HV V N+G + G H V
Sbjct: 1310 PNKL-----------------------YLNQPSTT-------HV---VENSGKMAGKHPV 1336
Query: 733 MLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNH 792
+LF + K G+P KQL+GF V A S + F + PCE LS ANK G ++ G H
Sbjct: 1337 LLFVKQAKAGNGSPMKQLVGFQNVFLDAGESSNVEFILSPCEHLSRANKDGLMVMEQGIH 1396
Query: 793 VLMVGELRHSLTI 805
+L+VG+ + + I
Sbjct: 1397 LLVVGDKEYPIAI 1409
Score = 729 bits (1881), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 377/747 (50%), Positives = 492/747 (65%), Gaps = 59/747 (7%)
Query: 8 IQLRERILFRVLQLIVVVNVIAFSNSKPVLNKPDFPCKP--PHFDSYPFCNTSLSISTRA 65
++L +L ++ + V++ + + S P F C P SY FC T+L I R
Sbjct: 1 MKLHSLLLINLIYVTVILVGVESTQSPP------FSCDSSNPSTKSYHFCKTTLPIPDRV 54
Query: 66 KSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSF 125
+ L+S LTL EKI QL ++A AIPRLGIPAYEWWSE+LHG+A GPG+ FNGT+ S TSF
Sbjct: 55 RDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVADAGPGIRFNGTIRSATSF 114
Query: 126 PQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA-GLTFWAPNINIFRDPRWGRGQETP 184
PQV+++AASF+ LW IG A+ VEARA+YN GQ G+TFWAPNINIFRDPRWGRGQETP
Sbjct: 115 PQVILTAASFDVHLWYRIGRAIGVEARAVYNAGQTKGMTFWAPNINIFRDPRWGRGQETP 174
Query: 185 GEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCK 244
GEDP+V +YAV +V+ QG+ L+G R EL SACCK
Sbjct: 175 GEDPLVTGSYAVSYVRGVQGD-----------------CLRGL----KRCGELQASACCK 213
Query: 245 HLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPAC 304
H AYDL+ W R+ F+A +T QD DT+QPPF CIE+G+AS IMC+YN+VNGVP+C
Sbjct: 214 HFTAYDLDDWKGIDRFKFDARVTMQDLADTYQPPFHRCIEEGRASGIMCAYNRVNGVPSC 273
Query: 305 LRGDLF-QKARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCM 363
+L AR W F+GYITSDCDAV+ I + + KT ED+ VLKAGMD+NCGT +
Sbjct: 274 ADFNLLTNTARKRWNFQGYITSDCDAVSLIHDSYGFAKTPEDAVVDVLKAGMDVNCGTYL 333
Query: 364 LRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKL 423
L HT+SA+ + K+ E ++DRAL NLF+V++RLGLFNG+P+ YG +GP+ VC+ EH+ L
Sbjct: 334 LNHTKSAVMQKKLPESELDRALENLFAVRMRLGLFNGNPKGQPYGDIGPNQVCSVEHQTL 393
Query: 424 ALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGL 483
ALDAAR GIVLLKN ++ LPL K SLA+IGP N+ + G Y G PC + L+ L
Sbjct: 394 ALDAARDGIVLLKNSQRLLPLPKGKTMSLAVIGPNANSPKTLIGNYAGPPCKFITPLQAL 453
Query: 484 EAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLP 543
++YV T Y GC V C+S + +AV IA+KAD+V++V GLD TQE E DR+ L+LP
Sbjct: 454 QSYVKSTMYHPGCDAVACSSPS-IEKAVEIAQKADYVVLVMGLDQTQEREAHDRLDLVLP 512
Query: 544 GQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEII 603
G+Q L+ VA +K+PV+LVL GGP+D+SFA+ + I SILW GYPG AG A+AE I
Sbjct: 513 GKQQQLIICVANAAKKPVVLVLLSGGPVDISFAKYSNNIGSILWAGYPGGAGGAAIAETI 572
Query: 604 FGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYT 663
FGD NPGGRLP+TWYP+ FTK+PM DM MR +S+ YPGR+YRFYTG +V+ FG+GLSY+
Sbjct: 573 FGDHNPGGRLPVTWYPQDFTKIPMTDMRMRPESNSGYPGRTYRFYTGEKVFEFGYGLSYS 632
Query: 664 NYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNA 723
YS + ++ +K Q+ + H+ E T S+R+
Sbjct: 633 TYSCE------------TIPVTRNKLYFNQSST----AHVYENTD--SIRYT-------- 666
Query: 724 GDVDGSHVVMLFARVPKVSQGTPEKQL 750
+ G H V+LF R K S G+P KQL
Sbjct: 667 -SMAGKHSVLLFVRRLKASAGSPIKQL 692
>gi|224082152|ref|XP_002306583.1| predicted protein [Populus trichocarpa]
gi|222856032|gb|EEE93579.1| predicted protein [Populus trichocarpa]
Length = 745
Score = 803 bits (2075), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 393/771 (50%), Positives = 513/771 (66%), Gaps = 56/771 (7%)
Query: 39 KPDFPCKP--PHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAY 96
+P F C P ++PFC T+L IS RA L+S LTL+EKI QL ++A IPRLGIP Y
Sbjct: 25 QPPFSCDSSNPSTKTFPFCKTTLPISQRANDLVSRLTLEEKISQLVNSAQPIPRLGIPGY 84
Query: 97 EWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYN 156
+WWSE+LHG+A GPG+ FNGT+ TSFPQV++SAASF+ + W I A+ EARA+YN
Sbjct: 85 QWWSEALHGVAYAGPGIRFNGTIKRATSFPQVILSAASFDANQWYRISQAIGKEARALYN 144
Query: 157 LGQA-GLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIG 215
GQA G+TFWAPNINIFRDPRWGRGQETPGEDP++ YAV +V+ QG+++K +
Sbjct: 145 AGQATGMTFWAPNINIFRDPRWGRGQETPGEDPLMTGKYAVSYVRGLQGDSFKGGE---- 200
Query: 216 FGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTF 275
+KG L SACCKH AYDLE W SRY F+A +T QD DT+
Sbjct: 201 --------IKG---------PLQASACCKHFTAYDLENWNGTSRYVFDAYVTAQDLADTY 243
Query: 276 QPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK-ARNEWGFKGYITSDCDAVATIF 334
QPPF+SC+E+G+AS IMC+YN+VNG+P C + + AR +WGF GYI SDCDAV+ I
Sbjct: 244 QPPFKSCVEEGRASGIMCAYNRVNGIPNCADSNFLSRTARAQWGFDGYIASDCDAVSIIH 303
Query: 335 EYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLR 394
+ Q Y KT ED+ VLKAGMD+NCG+ + +HT++A+D+ K+ +IDRAL NLFSV++R
Sbjct: 304 DAQGYAKTPEDAVVAVLKAGMDVNCGSYLQQHTKAAVDQKKLTISEIDRALHNLFSVRMR 363
Query: 395 LGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAI 454
LGLFNG+P ++G +GPD VC+ E++ LALDAAR GIVLLKN LPL+K+ SLA+
Sbjct: 364 LGLFNGNPTGQQFGNIGPDQVCSQENQILALDAARNGIVLLKNSAGLLPLSKSKTMSLAV 423
Query: 455 IGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIA 514
IGP N++ + G Y G PC + L+ L++Y+ T GC V C+S A AV +A
Sbjct: 424 IGPNANSVQTLLGNYAGPPCKLVTPLQALQSYIKHTIPYPGCDSVQCSS-ASIVGAVNVA 482
Query: 515 KKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVS 574
K AD V+++ GLD TQE E DR L+LPG+Q L+ SVA+ +K PV+LVL GGP+D+S
Sbjct: 483 KGADHVVLIMGLDDTQEKEGLDRRDLVLPGKQQELIISVAKAAKNPVVLVLLSGGPVDIS 542
Query: 575 FAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRA 634
FA+ D I SILW GYPGEAGA ALAEIIFGD NPGG+LPMTWYP+ F KVPM DM MR
Sbjct: 543 FAKNDKNIGSILWAGYPGEAGAIALAEIIFGDHNPGGKLPMTWYPQEFVKVPMTDMRMRP 602
Query: 635 DSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQT 694
++S YPGR+YRFY G V+ FG+GLSY+ Y+Y+
Sbjct: 603 ETSSGYPGRTYRFYKGPTVFEFGYGLSYSKYTYE-------------------------- 636
Query: 695 GSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFD 754
L ++I E C +++F V +SV N G + G H V+LFAR K +G P K+L+GF
Sbjct: 637 ---LRAIYIGE-EQCENIKFKVTVSVKNEGQMAGKHPVLLFARHAKPGKGRPIKKLVGFQ 692
Query: 755 RVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTI 805
V A EI + + PCE LS AN+ G ++ G+ +L+VG+ H +TI
Sbjct: 693 TVKLGAGEKTEIEYELSPCEHLSSANEDGVMVMEEGSQILLVGDKEHPVTI 743
>gi|255545664|ref|XP_002513892.1| Periplasmic beta-glucosidase precursor, putative [Ricinus communis]
gi|223546978|gb|EEF48475.1| Periplasmic beta-glucosidase precursor, putative [Ricinus communis]
Length = 774
Score = 802 bits (2072), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 398/774 (51%), Positives = 524/774 (67%), Gaps = 34/774 (4%)
Query: 39 KPDFPCKP--PHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAY 96
+P F C P P S+ FC TSL IS R + L+S LTL EKI QL +A +IPRLGIPAY
Sbjct: 26 EPPFSCDPSNPSTSSFLFCKTSLPISQRVRDLVSRLTLDEKISQLVSSAPSIPRLGIPAY 85
Query: 97 EWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYN 156
EWWSE+LHG+A+ G G++F G + + TSFPQV+++AASF+ W IG + EARA+YN
Sbjct: 86 EWWSEALHGVANVGRGIHFEGAIKAATSFPQVILTAASFDAYQWYRIGQVIGREARAVYN 145
Query: 157 LGQA-GLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIG 215
GQA G+TFWAPNINIFRDPRWGRGQETPGEDP+V YAV +V+ QG++++ GG
Sbjct: 146 AGQATGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDSFQ---GG-- 200
Query: 216 FGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTF 275
LKG L SACCKH AYDL+ W +R+ F+A +T QD DT+
Sbjct: 201 -------KLKG---------HLQASACCKHFTAYDLDNWKGVNRFVFDARVTMQDLADTY 244
Query: 276 QPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK-ARNEWGFKGYITSDCDAVATIF 334
QPPF+SC++QGKAS IMC+YN+VNG+P+C +L + AR +W F GYI SDCDAV+ I+
Sbjct: 245 QPPFQSCVQQGKASGIMCAYNRVNGIPSCADFNLLSRTARGQWDFHGYIASDCDAVSIIY 304
Query: 335 EYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLR 394
+ Q Y K+ ED+ VLKAGMD+NCG+ + +HT++A+++ K+ E IDRAL NLFSV++R
Sbjct: 305 DNQGYAKSPEDAVVDVLKAGMDVNCGSYLQKHTKAAVEQKKLPEASIDRALHNLFSVRMR 364
Query: 395 LGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAI 454
LGLFNG+P + + +GPD VC+ EH+ LAL+AAR GIVLLKN + LPL K+ SLA+
Sbjct: 365 LGLFNGNPTEQPFSNIGPDQVCSQEHQILALEAARNGIVLLKNSARLLPLQKSKTVSLAV 424
Query: 455 IGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIA 514
IGP N++ + G Y G PC + L+ L+ YV T Y SGC V C+S A +AV IA
Sbjct: 425 IGPNANSVQTLLGNYAGPPCKTVTPLQALQYYVKNTIYYSGCDTVKCSS-ASIDKAVDIA 483
Query: 515 KKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVS 574
K D V+++ GLD TQE E+ DR+ L+LPG+Q L+T+VA+++K P++LVL GGP+D+S
Sbjct: 484 KGVDRVVMIMGLDQTQEREELDRLDLVLPGKQQELITNVAKSAKNPIVLVLLSGGPVDIS 543
Query: 575 FAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRA 634
FA+ D I SILW GYPGEAG ALAEIIFGD NPGG+LPMTWYP+ F KVPM DM MR
Sbjct: 544 FAKYDENIGSILWAGYPGEAGGIALAEIIFGDHNPGGKLPMTWYPQEFVKVPMTDMRMRP 603
Query: 635 DSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSY--KFLSAPSELTISASLKAGSDKNILQ 692
D S YPGR+YRFY G V+ FG+GLSY+ YSY K++S ++L ++ S S I+
Sbjct: 604 DPSSGYPGRTYRFYKGRNVFEFGYGLSYSKYSYELKYVSQ-TKLYLNQS----STMRIID 658
Query: 693 QTGS-RLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLI 751
+ R V C +F V++ V N G++ G H V+LFAR + G P +QLI
Sbjct: 659 NSDPVRATLVAQLGAEFCKESKFSVKVGVENQGEMAGKHPVLLFARHARHGNGRPRRQLI 718
Query: 752 GFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTI 805
GF V A EI F + PCE S AN+ G R++ G H LMVG ++ +++
Sbjct: 719 GFKSVILNAGEKAEIEFELSPCEHFSRANEDGLRVMEEGTHFLMVGGDKYPISV 772
>gi|224066931|ref|XP_002302285.1| predicted protein [Populus trichocarpa]
gi|222844011|gb|EEE81558.1| predicted protein [Populus trichocarpa]
Length = 773
Score = 799 bits (2063), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 399/764 (52%), Positives = 518/764 (67%), Gaps = 32/764 (4%)
Query: 39 KPDFPCKP--PHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAY 96
+P F C P ++PFC T+L IS RA+ L+S LTL EKI QL ++A IPRLGIP Y
Sbjct: 25 QPPFSCDSSNPSTKAFPFCETTLPISQRARDLVSRLTLDEKISQLVNSAPPIPRLGIPGY 84
Query: 97 EWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYN 156
EWWSE+LHG+++ GPG++FN + TSFPQV+++AASF+ W IG A+ EARA+YN
Sbjct: 85 EWWSEALHGVSNAGPGIHFNDNIKGATSFPQVILTAASFDAYQWYRIGQAIGKEARALYN 144
Query: 157 LGQA-GLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIG 215
GQA G+TFWAPNINIFRDPRWGRGQETPGEDP+V YA +VK QG+++ +GG
Sbjct: 145 AGQATGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGLYAASYVKGVQGDSF---EGG-- 199
Query: 216 FGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTF 275
+KG L SACCKH AYDL+ W +R+ F+A +T QD DT+
Sbjct: 200 -------KIKG---------HLQASACCKHFTAYDLDNWKGMNRFVFDARVTMQDLADTY 243
Query: 276 QPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK-ARNEWGFKGYITSDCDAVATIF 334
QPPF+SC+EQG+AS IMC+YN+VNGVP+C +L K AR +WGF+GYITSDCDAV+ I
Sbjct: 244 QPPFKSCVEQGRASGIMCAYNKVNGVPSCADSNLLSKTARAQWGFRGYITSDCDAVSIIH 303
Query: 335 EYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLR 394
+ Q Y K+ ED+ VLKAGMD+NCG+ +L+H + A+++ K+ E DID+AL NLFSV++R
Sbjct: 304 DDQGYAKSPEDAVVDVLKAGMDVNCGSYLLKHAKVAVEQKKLSESDIDKALHNLFSVRMR 363
Query: 395 LGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAI 454
LGLFNG P +G +GPD VC+ EH+ LAL+AAR GIVLLKN + LPL+K+ SLA+
Sbjct: 364 LGLFNGRPEGQLFGNIGPDQVCSQEHQILALEAARNGIVLLKNSARLLPLSKSKTKSLAV 423
Query: 455 IGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIA 514
IGP N+ + G Y G PC + L+ L++Y+ +T Y C V C+S A AV +A
Sbjct: 424 IGPNANSGQMLLGNYAGPPCRFVTPLQALQSYIKQTVYHPACDTVQCSS-ASVDRAVDVA 482
Query: 515 KKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVS 574
K AD V+++ GLD TQE E+ DR LLLPG+Q L+ +VA+ +K PV+LVL GGP+D+S
Sbjct: 483 KGADNVVLMMGLDQTQEREELDRTDLLLPGKQQELIIAVAKAAKNPVVLVLFSGGPVDIS 542
Query: 575 FAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRA 634
FA+ D I SILW GYPGE GA ALAEI+FGD NPGGRLPMTWYP+ F KVPM DM MR
Sbjct: 543 FAKNDKNIGSILWAGYPGEGGAIALAEIVFGDHNPGGRLPMTWYPQEFVKVPMTDMGMRP 602
Query: 635 DSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQT 694
++S YPGR+YRFY G V+ FG+G+SY+ YSY+ L+A S+ T+ L S +I+
Sbjct: 603 EASSGYPGRTYRFYRGRSVFEFGYGISYSKYSYE-LTAVSQNTL--YLNQSSTMHIINDF 659
Query: 695 GSRLDYVHIDEVTS--CTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIG 752
S + I E+ + C + +I V N G++ G H V+LFAR K G P KQLIG
Sbjct: 660 DS-VRSTLISELGTEFCEQNKCRARIGVKNHGEMAGKHPVLLFARQEKHGNGRPRKQLIG 718
Query: 753 FDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMV 796
F V A EI F V PCE LS AN+ G ++ G H L+V
Sbjct: 719 FQSVVLGAGERAEIEFEVSPCEHLSRANEDGLMVMEEGRHFLVV 762
>gi|242062502|ref|XP_002452540.1| hypothetical protein SORBIDRAFT_04g027700 [Sorghum bicolor]
gi|241932371|gb|EES05516.1| hypothetical protein SORBIDRAFT_04g027700 [Sorghum bicolor]
Length = 784
Score = 795 bits (2053), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 391/768 (50%), Positives = 518/768 (67%), Gaps = 35/768 (4%)
Query: 38 NKPDFPCKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYE 97
++P + C + PFC+T+L I R L+S LT+ EKI QL D + AIPRLG+PAY+
Sbjct: 36 SEPPYTCGAGAPPNIPFCDTALPIDRRVDDLVSRLTVAEKISQLGDESPAIPRLGVPAYK 95
Query: 98 WWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNL 157
WWSE+LHG+A+ G G++ +G + + TSFPQV+++AASFN LW IG + VEARA+YN
Sbjct: 96 WWSEALHGVANAGRGIHLDGPLRAATSFPQVILTAASFNPHLWYRIGQVIGVEARAVYNN 155
Query: 158 GQA-GLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGF 216
GQA GLTFWAPNIN+FRDPRWGRGQETPGEDP + YA FV+ QG +
Sbjct: 156 GQAEGLTFWAPNINVFRDPRWGRGQETPGEDPTMTGKYAAVFVRGVQG-----------Y 204
Query: 217 GFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQ 276
G + G +D L SACCKH AYDLE W +RY ++A +T QD EDT+
Sbjct: 205 G------VAGPVNSTD----LEASACCKHFTAYDLENWKGITRYVYDAKVTAQDLEDTYN 254
Query: 277 PPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK-ARNEWGFKGYITSDCDAVATIFE 335
PPF+SC+E G AS IMCSYN+VNGVP C +L K AR WGF GYITSDCDAV+ I +
Sbjct: 255 PPFKSCVEDGHASGIMCSYNRVNGVPTCADYNLLSKTARQSWGFYGYITSDCDAVSIIHD 314
Query: 336 YQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRL 395
Q Y KT ED+ A VLKAGMD+NCG + ++ SA+ +GK+ E+DI+RAL NLF+V++RL
Sbjct: 315 AQGYAKTSEDAVADVLKAGMDVNCGGYVQKYGASALQQGKITEQDINRALHNLFTVRMRL 374
Query: 396 GLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAII 455
GLFNGDPR+ +YG +GPD VCT EH+ LAL+AA+ GIVLLKND LPL+K+ V+SLA+I
Sbjct: 375 GLFNGDPRRNRYGNIGPDQVCTQEHQDLALEAAQDGIVLLKNDGGALPLSKSGVASLAVI 434
Query: 456 GPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAK 515
G NN + + G Y G PC + L+ L+ YV T + +GC+ CN EAV+ A
Sbjct: 435 GFNANNATSLLGNYFGPPCVTVTPLQVLQGYVKDTSFVAGCNSAACNVTT-IPEAVQAAS 493
Query: 516 KADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSF 575
AD V++ GLD QE E+ DR+ L LPGQQ +L+ SVA +K+PVILVL GGP+DVSF
Sbjct: 494 SADSVVLFMGLDQNQEREEVDRLDLTLPGQQQTLIESVANAAKKPVILVLLCGGPVDVSF 553
Query: 576 AEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRAD 635
A+ + +I +ILW GYPGEAG A+A+++FG+ NPGGRLP+TWYP+ FTKVPM DM MRAD
Sbjct: 554 AKTNPKIGAILWAGYPGEAGGIAIAQVLFGEHNPGGRLPVTWYPQDFTKVPMTDMRMRAD 613
Query: 636 SSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLS-APSELTISASLKAGSDKNILQQT 694
+ YPGR+YRFY G V+ FG+GLSY+ YS++F++ P ++ A LKA L T
Sbjct: 614 PATGYPGRTYRFYRGPTVFNFGYGLSYSKYSHRFVTKPPPSMSNVAGLKA------LATT 667
Query: 695 GSRLDYVHIDEVTS--CTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGT--PEKQL 750
+ ++ + S C L+F + V N G +DG H V++F R P + G+ P +QL
Sbjct: 668 AGGVATYDVEAIGSETCDRLKFPAVVRVQNHGPMDGKHPVLVFLRWPNATDGSGRPARQL 727
Query: 751 IGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGE 798
IGF +H A + + F V PC+ S A + GR+++ G+H +MVG+
Sbjct: 728 IGFQSLHLRATQTAHVEFEVSPCKHFSRATEDGRKVIDQGSHFVMVGD 775
>gi|449465962|ref|XP_004150696.1| PREDICTED: probable beta-D-xylosidase 7-like [Cucumis sativus]
Length = 783
Score = 795 bits (2052), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 393/775 (50%), Positives = 520/775 (67%), Gaps = 34/775 (4%)
Query: 38 NKPDFPCKP--PHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPA 95
++P + C P + PFC T L I RA+ L+S LTL EK+ QL + IPRLGIPA
Sbjct: 34 SQPPYACDSSNPLTKTLPFCKTYLPIKLRARDLVSRLTLDEKVLQLVNTVPPIPRLGIPA 93
Query: 96 YEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMY 155
YEWWSE+LHG+A+ G G+ NGT+++ TSFPQV+++AASF+ +LW IG A+ EARA+Y
Sbjct: 94 YEWWSEALHGVANVGYGIRLNGTITAATSFPQVILTAASFDENLWYQIGQAIGTEARAVY 153
Query: 156 NLGQA-GLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGI 214
N GQA G+TFW PNINIFRDPRWGRGQETPGEDP++ Y+V +V+ QG+ + +GG
Sbjct: 154 NAGQAKGMTFWTPNINIFRDPRWGRGQETPGEDPLMTGKYSVAYVRGIQGD---AIEGG- 209
Query: 215 GFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDT 274
G++L SACCKH AYDL++W +RY F+A +T QD DT
Sbjct: 210 -----------------KLGNQLKASACCKHFTAYDLDRWNGMTRYVFDAKVTMQDMADT 252
Query: 275 FQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQ-KARNEWGFKGYITSDCDAVATI 333
+QPPF SC+E+GKAS IMC+YN+VNGVP+C L AR +W F GYITSDCDAV+ I
Sbjct: 253 YQPPFESCVEEGKASGIMCAYNRVNGVPSCADHHLLTATARKQWKFNGYITSDCDAVSII 312
Query: 334 FEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQL 393
+ Q Y K ED+ A VL+AGMD+NCGT + HT+SA++ KV IDRAL NLFSV++
Sbjct: 313 HDAQGYAKIPEDAVADVLRAGMDVNCGTYLKEHTKSAVEMKKVPMLHIDRALRNLFSVRM 372
Query: 394 RLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLA 453
RLGLF+G+P K +G++G D VC+ +H+ LAL AAR+GIVLLKN K LPL+K+ SLA
Sbjct: 373 RLGLFDGNPTKLPFGQIGRDQVCSQQHQNLALQAAREGIVLLKNSAKLLPLSKSNTHSLA 432
Query: 454 IIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRI 513
+IG N+ + G Y GIPC + +GL YV T Y GC+ C ++A ++AV+I
Sbjct: 433 VIGHNGNDPKTLRGNYAGIPCKSATPFQGLNNYVKNTVYHRGCNYANC-TEATIYQAVKI 491
Query: 514 AKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDV 573
AK D+V++V GLD TQE ED DR L LPG+Q L+ VA+ +KRPVILV+ GGP+D+
Sbjct: 492 AKSVDYVVLVMGLDQTQEREDFDRTELGLPGKQDKLIAEVAKAAKRPVILVILSGGPVDI 551
Query: 574 SFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMR 633
S A+ + +I SILW GYPG+AG A+AEIIFGD NPGGRLP+TWYP F K PM DM MR
Sbjct: 552 SSAKYNEKIGSILWAGYPGQAGGTAIAEIIFGDHNPGGRLPLTWYPHDFIKFPMTDMRMR 611
Query: 634 ADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLS-APSELTISASLKAGSDKNILQ 692
ADSS YPGR+YRFY G +VY FG+GLSY+N+ Y+F S + S+L +S + KN
Sbjct: 612 ADSSTGYPGRTYRFYNGPKVYEFGYGLSYSNHIYEFTSVSESKLLLSHPKASQPAKN--- 668
Query: 693 QTGSRLDYVHIDEVTS--CTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQL 750
+ Y + E+ C S +V + V N G++ G H V+LF + K G+P KQL
Sbjct: 669 --SDLVSYRLVSELDKKFCESKTVNVTVGVRNEGEMGGKHSVLLFIKPSKPINGSPVKQL 726
Query: 751 IGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTI 805
+GF +V A +EI F V PC+ +S A++ G I+ G++ L+VG++ H L I
Sbjct: 727 VGFKKVEINAGERREIEFLVSPCDHISKASEEGLMIIEEGSYSLVVGDVEHPLDI 781
>gi|449508468|ref|XP_004163321.1| PREDICTED: LOW QUALITY PROTEIN: probable beta-D-xylosidase 7-like
[Cucumis sativus]
Length = 783
Score = 792 bits (2045), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 392/775 (50%), Positives = 519/775 (66%), Gaps = 34/775 (4%)
Query: 38 NKPDFPCKP--PHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPA 95
++P + C P + PFC T L I RA+ L+S LTL EK+ QL + IPRLGIPA
Sbjct: 34 SQPPYACDSSNPLTKTLPFCKTYLPIKLRARDLVSRLTLDEKVLQLVNTVPPIPRLGIPA 93
Query: 96 YEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMY 155
YEWWSE+LHG+A+ G G+ NGT+++ TSFPQV+++AASF+ +LW IG A+ EARA+Y
Sbjct: 94 YEWWSEALHGVANVGYGIRLNGTITAATSFPQVILTAASFDENLWYQIGQAIGTEARAVY 153
Query: 156 NLGQA-GLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGI 214
N GQA G+TFW PNINIFRDPRWGRGQETPGEDP++ Y+V +V+ QG+ + +GG
Sbjct: 154 NAGQAKGMTFWTPNINIFRDPRWGRGQETPGEDPLMTGKYSVAYVRGIQGD---AIEGG- 209
Query: 215 GFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDT 274
G++L SACCKH AYDL++W +RY F+A +T QD DT
Sbjct: 210 -----------------KLGNQLKASACCKHFTAYDLDRWNGMTRYVFDAKVTMQDMADT 252
Query: 275 FQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQ-KARNEWGFKGYITSDCDAVATI 333
+QPPF SC+E+GKAS IMC+YN+VNGVP+C L AR +W F GYITSDCDAV+ I
Sbjct: 253 YQPPFESCVEEGKASGIMCAYNRVNGVPSCADHHLLTATARKQWKFNGYITSDCDAVSII 312
Query: 334 FEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQL 393
+ Q Y K ED+ A VL+AGMD+NCGT + HT+SA++ KV IDRAL NLFSV++
Sbjct: 313 HDAQGYAKIPEDAVADVLRAGMDVNCGTYLKEHTKSAVEMKKVPMLHIDRALRNLFSVRM 372
Query: 394 RLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLA 453
RLGLF+G+P K +G++G D VC+ +H+ LAL AAR+GIVLLKN K LPL+K+ SLA
Sbjct: 373 RLGLFDGNPTKLPFGQIGRDQVCSQQHQNLALQAAREGIVLLKNSAKLLPLSKSNTHSLA 432
Query: 454 IIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRI 513
+IG N+ + G Y GIPC + +GL YV T Y GC+ C ++A ++AV+I
Sbjct: 433 VIGHNGNDPKTLRGNYAGIPCKSATPFQGLNNYVKNTVYHRGCNYANC-TEATIYQAVKI 491
Query: 514 AKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDV 573
AK D+V++V GLD TQE ED DR L LPG+Q L+ VA+ +K PVILV+ GGP+D+
Sbjct: 492 AKSVDYVVLVMGLDQTQEREDFDRTELGLPGKQDKLIAEVAKAAKXPVILVILSGGPVDI 551
Query: 574 SFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMR 633
S A+ + +I SILW GYPG+AG A+AEIIFGD NPGGRLP+TWYP F K PM DM MR
Sbjct: 552 SSAKYNEKIGSILWAGYPGQAGGTAIAEIIFGDHNPGGRLPLTWYPHDFIKFPMTDMRMR 611
Query: 634 ADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLS-APSELTISASLKAGSDKNILQ 692
ADSS YPGR+YRFY G +VY FG+GLSY+N+ Y+F S + S+L +S + KN
Sbjct: 612 ADSSTGYPGRTYRFYNGPKVYEFGYGLSYSNHIYEFTSVSESKLLLSHPKASQPAKN--- 668
Query: 693 QTGSRLDYVHIDEVTS--CTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQL 750
+ Y + E+ C S +V + V N G++ G H V+LF + K G+P KQL
Sbjct: 669 --SDLVSYRLVSELDKKFCESKTVNVTVGVRNEGEMGGKHSVLLFIKPSKPINGSPVKQL 726
Query: 751 IGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTI 805
+GF +V A +EI F V PC+ +S A++ G I+ G++ L+VG++ H L I
Sbjct: 727 VGFKKVEINAGERREIEFLVSPCDHISKASEEGLMIIEEGSYSLVVGDVEHPLDI 781
>gi|212275712|ref|NP_001130324.1| uncharacterized protein LOC100191418 precursor [Zea mays]
gi|194688848|gb|ACF78508.1| unknown [Zea mays]
gi|413938927|gb|AFW73478.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
Length = 780
Score = 785 bits (2028), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 387/771 (50%), Positives = 507/771 (65%), Gaps = 36/771 (4%)
Query: 36 VLNKPDFPCKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPA 95
V ++P + C + PFC+ L I R L+S +T+ EKI QL D + AIPRLG+PA
Sbjct: 29 VASEPPYTCGAGAPPNIPFCDAGLPIDRRVDDLVSRMTVAEKISQLGDQSPAIPRLGVPA 88
Query: 96 YEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMY 155
Y+WWSE+LHGI++ G G++ +G + + TSFPQV+++AASFN LW IG + VEARA+Y
Sbjct: 89 YKWWSEALHGISNQGRGIHLDGPLRAATSFPQVILTAASFNPHLWYRIGQVIGVEARAVY 148
Query: 156 NLGQA-GLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGI 214
N GQA GLTFWAPNIN+FRDPRWGRGQETPGEDP + YA FV+ QG
Sbjct: 149 NNGQAEGLTFWAPNINVFRDPRWGRGQETPGEDPTMTGKYAAVFVRGVQGYGLAGPVNST 208
Query: 215 GFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDT 274
G L SACCKH AYDLE W +RY F+A +T QD DT
Sbjct: 209 G---------------------LEASACCKHFTAYDLENWKGVTRYVFDAKVTAQDLADT 247
Query: 275 FQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK-ARNEWGFKGYITSDCDAVATI 333
+ PPF+SC+E G AS IMCSYN+VNGVP C +L AR +WGF GYITSDCDAVA I
Sbjct: 248 YNPPFKSCVEDGHASGIMCSYNRVNGVPTCADYNLLSTTARQDWGFYGYITSDCDAVAII 307
Query: 334 FEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQL 393
+ Q Y KT ED+ A VLKAGMD+NCG+ + H SA+ +GK+ E+DI+RAL NLF+V++
Sbjct: 308 HDAQGYAKTAEDAVADVLKAGMDVNCGSYVQDHGASALQQGKITEQDINRALHNLFAVRM 367
Query: 394 RLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKND--KKFLPLNKNAVSS 451
RLGLFNGDPR+ YG +GPD VCT EH+ LAL+AA+ GIVLLKND LPL+K V+S
Sbjct: 368 RLGLFNGDPRRNLYGDIGPDQVCTQEHQDLALEAAQDGIVLLKNDGGAGALPLSKPNVAS 427
Query: 452 LAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAV 511
LA+IG N+ ++ G Y G PC + L+ L+ YV T + +GC+ CN EAV
Sbjct: 428 LAVIGFNANDAIRLRGNYFGPPCVTVTPLQVLQGYVKDTSFVAGCNSAACNVTT-IPEAV 486
Query: 512 RIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPL 571
+ A AD V++ GLD QE E+ DR+ L LPGQQ +L+ SVA +K+PVILVL GGP+
Sbjct: 487 QAASSADSVVLFMGLDQDQEREEVDRLDLTLPGQQQTLIESVANAAKKPVILVLLCGGPV 546
Query: 572 DVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMN 631
DVSFA+ + +I +ILW GYPGEAG A+A+++FG+ NPGGRLP+TWYP+ FT+VPM DM
Sbjct: 547 DVSFAKTNPKIGAILWAGYPGEAGGIAIAQVLFGEHNPGGRLPVTWYPQDFTRVPMTDMR 606
Query: 632 MRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNIL 691
MRAD + YPGR+YRFY G V+ FG+GLSY+ YS++F + P + A LKA +
Sbjct: 607 MRADPATGYPGRTYRFYRGPTVFNFGYGLSYSKYSHRFATKPPPTSNVAGLKA------V 660
Query: 692 QQTGSRLDYVHIDEVTS--CTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGT--PE 747
+ T + ++ + S C L+F + V N G +DG H V++F R P + G+ P
Sbjct: 661 EATAGGMASYDVEAIGSETCDRLKFPAVVRVQNHGPMDGKHSVLVFMRWPNATDGSGRPA 720
Query: 748 KQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGE 798
QLIGF +H A + + F V PC+ S A + GR+++ G+H +MVGE
Sbjct: 721 SQLIGFQSLHLRATQTAHVEFEVSPCKHFSRATEDGRKVIDQGSHFVMVGE 771
>gi|225459350|ref|XP_002285805.1| PREDICTED: probable beta-D-xylosidase 7-like [Vitis vinifera]
Length = 774
Score = 783 bits (2022), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 400/804 (49%), Positives = 527/804 (65%), Gaps = 38/804 (4%)
Query: 8 IQLRERILFRVLQLIVVVNVIAFSNSKPVLNKPDFPCKP--PHFDSYPFCNTSLSISTRA 65
++L +L ++ + V++ + + S P F C P SY FC T+L I R
Sbjct: 1 MKLHSLLLINLIYVTVILVGVESTQSPP------FSCDSSNPSTKSYHFCKTTLPIPDRV 54
Query: 66 KSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSF 125
+ L+S LTL EKI QL ++A AIPRLGIPAYEWWSE+LHG+A GPG+ FNGT+ S TSF
Sbjct: 55 RDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVADAGPGIRFNGTIRSATSF 114
Query: 126 PQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA-GLTFWAPNINIFRDPRWGRGQETP 184
PQV+++AASF+ LW IG A+ VEARA+YN GQ G+TFWAPNINIFRDPRWGRGQETP
Sbjct: 115 PQVILTAASFDVHLWYRIGRAIGVEARAVYNAGQTKGMTFWAPNINIFRDPRWGRGQETP 174
Query: 185 GEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCK 244
GEDP+V +YAV +V+ QG+ L+G R EL SACCK
Sbjct: 175 GEDPLVTGSYAVSYVRGVQGD-----------------CLRGL----KRCGELQASACCK 213
Query: 245 HLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPAC 304
H AYDL+ W R+ F+A +T QD DT+QPPF CIE+G+AS IMC+YN+VNGVP+C
Sbjct: 214 HFTAYDLDDWKGIDRFKFDARVTMQDLADTYQPPFHRCIEEGRASGIMCAYNRVNGVPSC 273
Query: 305 LRGDLF-QKARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCM 363
+L AR W F+GYITSDCDAV+ I + + KT ED+ VLKAGMD+NCGT +
Sbjct: 274 ADFNLLTNTARKRWNFQGYITSDCDAVSLIHDSYGFAKTPEDAVVDVLKAGMDVNCGTYL 333
Query: 364 LRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKL 423
L HT+SA+ + K+ E ++DRAL NLF+V++RLGLFNG+P+ YG +GP+ VC+ EH+ L
Sbjct: 334 LNHTKSAVMQKKLPESELDRALENLFAVRMRLGLFNGNPKGQPYGDIGPNQVCSVEHQTL 393
Query: 424 ALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGL 483
ALDAAR GIVLLKN ++ LPL K SLA+IGP N+ + G Y G PC + L+ L
Sbjct: 394 ALDAARDGIVLLKNSQRLLPLPKGKTMSLAVIGPNANSPKTLIGNYAGPPCKFITPLQAL 453
Query: 484 EAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLP 543
++YV T Y GC V C+S + +AV IA+KAD+V++V GLD TQE E DR+ L+LP
Sbjct: 454 QSYVKSTMYHPGCDAVACSSPS-IEKAVEIAQKADYVVLVMGLDQTQEREAHDRLDLVLP 512
Query: 544 GQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEII 603
G+Q L+ VA +K+PV+LVL GGP+D+SFA+ + I SILW GYPG AG A+AE I
Sbjct: 513 GKQQQLIICVANAAKKPVVLVLLSGGPVDISFAKYSNNIGSILWAGYPGGAGGAAIAETI 572
Query: 604 FGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYT 663
FGD NPGGRLP+TWYP+ FTK+PM DM MR +S+ YPGR+YRFYTG +V+ FG+GLSY+
Sbjct: 573 FGDHNPGGRLPVTWYPQDFTKIPMTDMRMRPESNSGYPGRTYRFYTGEKVFEFGYGLSYS 632
Query: 664 NYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTS--CTSLRFHVQISVT 721
YS + + S ++ + T S + Y + E+ C S + I V
Sbjct: 633 TYSCETIPVTRN---KLYFNQSSTAHVYENTDS-IRYTSVAELGKELCDSNNISISIRVR 688
Query: 722 NAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANK 781
N G++ G H V+LF R K S G+P KQL+ F VH S ++ F ++PCE S NK
Sbjct: 689 NDGEMAGKHSVLLFVRRLKASAGSPIKQLVAFQSVHLNGGESADVGFLLNPCEHFSGPNK 748
Query: 782 HGRRILPLGNHVLMVGELRHSLTI 805
G ++ G H L+VG+ H +T+
Sbjct: 749 DGLMVIEEGTHFLVVGDQEHPVTV 772
>gi|293336530|ref|NP_001167905.1| uncharacterized protein LOC100381616 [Zea mays]
gi|223944757|gb|ACN26462.1| unknown [Zea mays]
Length = 630
Score = 783 bits (2022), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 375/658 (56%), Positives = 477/658 (72%), Gaps = 34/658 (5%)
Query: 154 MYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGG 213
M+N GQAGLT+WAPNINIFRDPRWGRGQET GEDP V +AY++E+VK FQ
Sbjct: 1 MHNAGQAGLTYWAPNINIFRDPRWGRGQETSGEDPAVAAAYSLEYVKGFQ---------- 50
Query: 214 IGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTED 273
E + LSACCKH AYD+EKW FSRY+FNA + QD ED
Sbjct: 51 ---------------GEEGEEGRIRLSACCKHYTAYDMEKWEGFSRYTFNAKVNAQDLED 95
Query: 274 TFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKARNEWGFKGYITSDCDAVATI 333
T+QPPF++CI++ +ASC+MC+YNQVNGVP C DL QK R+EWGF+GYITSDCDAVA I
Sbjct: 96 TYQPPFKTCIQEARASCLMCAYNQVNGVPMCAHKDLLQKTRDEWGFQGYITSDCDAVAII 155
Query: 334 FEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQL 393
E Q YTK+ EDS A VLKAGMDINCG+ ++RHT+SAI+KGK+QE+DIDRAL NLFSVQL
Sbjct: 156 HENQTYTKSGEDSIAIVLKAGMDINCGSFLVRHTKSAIEKGKIQEEDIDRALFNLFSVQL 215
Query: 394 RLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLA 453
RLG+F+ + +LGP+ VCT EH++LA +A RQG VLLKND FLPL ++ V +A
Sbjct: 216 RLGIFDKPSNNQWFSQLGPNSVCTKEHRELAAEAVRQGAVLLKNDHNFLPLKRSEVRHVA 275
Query: 454 IIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRI 513
IIGP N+ MGG YTG+PC+P + L+G++AY ++T +A GC D CNS F EAV
Sbjct: 276 IIGPSANDAYAMGGDYTGVPCNPTTFLKGIQAYATQTSFAPGCKDASCNSTDLFGEAVEA 335
Query: 514 AKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDV 573
AK+AD V+V+AGL+LT+E ED DRVSLLLPG+QM L+ ++A +K+P++LVL GGGP+DV
Sbjct: 336 AKRADIVVVIAGLNLTEEREDFDRVSLLLPGKQMGLIHAIASVAKKPLVLVLLGGGPVDV 395
Query: 574 SFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMR 633
SFA+ D +I+SILW+GYPGE G + L EI+FG++NPGG+LP+TWYPESFT +PM DMNMR
Sbjct: 396 SFAKQDPRIASILWLGYPGEVGGQVLPEILFGEYNPGGKLPITWYPESFTAIPMTDMNMR 455
Query: 634 ADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQ 693
AD SR YPGR+YRFYTG VYGFG+GLSY+ YSY SAP ++T+S S SD I+ +
Sbjct: 456 ADPSRGYPGRTYRFYTGDVVYGFGYGLSYSKYSYSISSAPKKITVSRS----SDLGIISR 511
Query: 694 TGSR-----LDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEK 748
+ L V +++ SC +L F V ++V+N G +DGSH V+LFAR G P K
Sbjct: 512 KPAYTRRDGLGSVKTEDIASCEALVFSVHVAVSNHGSMDGSHAVLLFARSKSSVPGFPIK 571
Query: 749 QLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
QL+GF+ VHT A + + VDPC+Q+S AN G+R+L LG HVL VG+ L+IE
Sbjct: 572 QLVGFESVHTAAGSASNVEITVDPCKQMSAANPEGKRVLLLGAHVLTVGDEEFELSIE 629
>gi|371917284|dbj|BAL44718.1| SlArf/Xyl3 [Solanum lycopersicum]
Length = 777
Score = 782 bits (2019), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 385/794 (48%), Positives = 518/794 (65%), Gaps = 34/794 (4%)
Query: 18 VLQLIVVVNVIAFSNSKPVLNKPDFPCKP--PHFDSYPFCNTSLSISTRAKSLISLLTLQ 75
+ I V+ V+ F ++ KP F C P+ SYPFCN +L I R L+S LT+
Sbjct: 10 TVMFIFVILVLLFRRTEST--KPPFSCDSSNPNTSSYPFCNAALPIPQRVNDLVSRLTVD 67
Query: 76 EKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASF 135
EKI QL + A IPRLGI AYEWWSE LHGI+ +G G FNGT+ + T FPQ++++A+SF
Sbjct: 68 EKILQLVNGAPEIPRLGISAYEWWSEGLHGISRHGKGTLFNGTIKAATQFPQIILTASSF 127
Query: 136 NRSLWSNIGSAVAVEARAMYNLGQ-AGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAY 194
+ +LW I A+ EARA+YN GQ G+T WAPNINI RDPRWGRGQETPGEDPM+V Y
Sbjct: 128 DENLWYRIAQAIGREARAVYNAGQLKGITLWAPNINILRDPRWGRGQETPGEDPMMVGKY 187
Query: 195 AVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKW 254
V +V+ QG+++ +GG + L SACCKH IA D++ W
Sbjct: 188 GVAYVRGLQGDSF---EGG-----------------KLKDGHLQTSACCKHFIAQDMDNW 227
Query: 255 GNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK-A 313
NFSRY+F+A + +QD D+++PPF+ C+EQGKAS +MC+YN VNG+P C DL A
Sbjct: 228 HNFSRYTFDAQVLKQDLADSYEPPFKDCVEQGKASSVMCAYNLVNGIPNCANFDLLTTTA 287
Query: 314 RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDK 373
R +WG +GYI SDCDAV ++ Q+Y K ED+ A LKAGMD+NCG+ + +T+SA++K
Sbjct: 288 RGKWGLQGYIVSDCDAVDKMYSEQHYAKEPEDAVAATLKAGMDVNCGSHLKTYTKSALEK 347
Query: 374 GKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIV 433
KV+E DIDRAL NLFSV++RLGLFNGDP K +YG + +VC+ EH+ LA++AAR G V
Sbjct: 348 QKVKESDIDRALHNLFSVRMRLGLFNGDPSKLEYGDISAAEVCSEEHRALAVEAARSGSV 407
Query: 434 LLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYA 493
LLKN + LPL+K +SLA+IGP N+ + G Y G C +L +GL+ YV+ T Y
Sbjct: 408 LLKNSNRLLPLSKMKTASLAVIGPKANDSEVLLGNYEGFSCKNVTLFQGLQGYVANTMYH 467
Query: 494 SGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSV 553
GC + C S A EAV IAKKAD+V++V GLD T E E DR L LPG Q L+TS+
Sbjct: 468 PGCDFINCTSPA-IDEAVNIAKKADYVVLVMGLDQTLEREKFDRTELGLPGMQEKLITSI 526
Query: 554 ARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRL 613
A + +PVILVL GGP+DV+FA+ + +I ILW+GYPGE GA ALA+I+FG+ NPGGR
Sbjct: 527 AEAASKPVILVLMCGGPVDVTFAKDNPKIGGILWVGYPGEGGAAALAQILFGEHNPGGRS 586
Query: 614 PMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAP 673
P+TWYP+ F KV MNDM MR +SS YPGR+YRFY G +V+ FG+GLSYTNYSY F S
Sbjct: 587 PVTWYPKEFNKVAMNDMRMRPESSSGYPGRTYRFYNGPKVFEFGYGLSYTNYSYTFASVS 646
Query: 674 SELTISASLKAGSDKNILQQTGSRLDYVHIDEV--TSCTSLRFHVQISVTNAGDVDGSHV 731
+ + K N + GS L+ + + +V C S V+++V N G++ G H
Sbjct: 647 KNQLLFKNPKI----NQSTEKGSVLN-IAVSDVGPEVCNSAMITVKVAVKNQGEMAGKHP 701
Query: 732 VMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGN 791
V+LF + P+K LIGF V+ A + +++F V PCE + AN+ G ++ G
Sbjct: 702 VLLFLKHSSTVDEVPKKTLIGFKSVNLEAGANTQVTFDVKPCEHFTRANRDGTLVIDEGK 761
Query: 792 HVLMVGELRHSLTI 805
H L++G+ + + +
Sbjct: 762 HFLLLGDQEYPIPV 775
>gi|18025342|gb|AAK38482.1| beta-D-xylosidase [Hordeum vulgare]
Length = 777
Score = 780 bits (2014), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 391/775 (50%), Positives = 505/775 (65%), Gaps = 32/775 (4%)
Query: 36 VLNKPDFPCKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPA 95
V P F C P S FC+ L I RA L+S LTL+EKI QL D + A+ RLG+PA
Sbjct: 28 VAADPPFSCGAP--SSAAFCDRRLPIEQRAADLVSKLTLEEKISQLGDESPAVDRLGVPA 85
Query: 96 YEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMY 155
Y+WWSE+LHG+A+ G GV+ +G + + TSFPQV+++AASFN LW IG + EAR +Y
Sbjct: 86 YKWWSEALHGVANAGRGVHLDGPLRAATSFPQVILTAASFNPHLWYRIGQVIGTEARGVY 145
Query: 156 NLGQA-GLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGI 214
N GQA GLTFWAPNIN+FRDPRWGRGQETPGEDP + YA FV+ QG
Sbjct: 146 NNGQAEGLTFWAPNINVFRDPRWGRGQETPGEDPTMTGKYAAVFVRGVQG---------- 195
Query: 215 GFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDT 274
+G + G SD L SACCKH AYDLE W +R++F+A +TEQD DT
Sbjct: 196 -YG------MSGAINSSD----LEASACCKHFTAYDLENWKGVTRFAFDAKVTEQDLADT 244
Query: 275 FQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK-ARNEWGFKGYITSDCDAVATI 333
+ PPF+SC+E G AS IMCSYN+VNGVP C +L K AR +W F GYITSDCDAVA I
Sbjct: 245 YNPPFKSCVEDGGASGIMCSYNRVNGVPTCADHNLLSKTARGDWSFNGYITSDCDAVAII 304
Query: 334 FEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQL 393
+ Q Y K ED+ A VLKAGMD+NCG + H SA +GK+ +DIDRAL NLF++++
Sbjct: 305 HDVQGYAKAPEDAVADVLKAGMDVNCGGYIQTHGVSAYQQGKITGEDIDRALRNLFAIRM 364
Query: 394 RLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLA 453
RLGLF+G+P+ +YG +G D VC+ EH+ LAL AAR GIVLLKND LPL+K+ VSSLA
Sbjct: 365 RLGLFDGNPKYNRYGNIGADQVCSKEHQDLALQAARDGIVLLKNDGAALPLSKSKVSSLA 424
Query: 454 IIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRI 513
+IGP NN S + G Y G PC + L+ L+ YV + GC+ CN + EAV
Sbjct: 425 VIGPNGNNASLLLGNYFGPPCISVTPLQALQGYVKDARFVQGCNAAVCNV-SNIGEAVHA 483
Query: 514 AKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDV 573
A AD+V++ GLD QE E+ DR+ L LPG Q SLV SVA +K+PVILVL GGP+DV
Sbjct: 484 AGSADYVVLFMGLDQNQEREEVDRLELGLPGMQESLVNSVADAAKKPVILVLLCGGPVDV 543
Query: 574 SFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMR 633
+FA+ + +I +I+W GYPG+AG A+A+++FGD NPGGRLP+TWYP+ FT VPM DM MR
Sbjct: 544 TFAKNNPKIGAIVWAGYPGQAGGIAIAQVLFGDHNPGGRLPVTWYPKEFTAVPMTDMRMR 603
Query: 634 ADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLS---APSELTISASLKAGSDKNI 690
AD S YPGR+YRFY G VY FG+GLSY+ YS++F S P ++ LKA + +
Sbjct: 604 ADPSTGYPGRTYRFYKGKTVYNFGYGLSYSKYSHRFASKGTKPPSMSGIEGLKATARASA 663
Query: 691 LQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQL 750
++ + + +C LRF + V N G +DG H+V+LF R P + G P QL
Sbjct: 664 AGTVSYDVEEMGAE---ACDRLRFPAVVRVQNHGPMDGGHLVLLFLRWPNATDGRPASQL 720
Query: 751 IGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTI 805
IGF VH A + + F V PC+ LS A + GR+++ G+H + VG+ L+
Sbjct: 721 IGFQSVHLRADEAAHVEFEVSPCKHLSRAAEDGRKVIDQGSHFVRVGDDEFELSF 775
>gi|413925162|gb|AFW65094.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
Length = 774
Score = 777 bits (2007), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 376/776 (48%), Positives = 504/776 (64%), Gaps = 32/776 (4%)
Query: 36 VLNKPDFPCKPPHFDS---YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLG 92
V P F C P ++ FC+ +L+ + RA L+S LT EKI QL D A +PRLG
Sbjct: 25 VAGDPPFSCGPSSAEASEGLAFCDVTLAPAQRAADLVSRLTAAEKIAQLGDQAPGVPRLG 84
Query: 93 IPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEAR 152
+P Y+WW+E+LHG+A++G G++F+ V + TSFPQVL++AA+F+ LW IG A+ EAR
Sbjct: 85 VPGYKWWNEALHGLATSGKGLHFDAAVRAATSFPQVLLTAAAFDDDLWLRIGQAIGREAR 144
Query: 153 AMYNLGQA-GLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDD 211
A++N+GQA GLT W+PN+NIFRDPRWGRGQETPGEDP V S YAV FV+ QG + S
Sbjct: 145 ALFNVGQAEGLTIWSPNVNIFRDPRWGRGQETPGEDPAVASRYAVAFVRGIQGNSSSS-- 202
Query: 212 GGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDT 271
L SACCKH AYDLE W +RYSF A +TEQD
Sbjct: 203 ------------------------LLQTSACCKHATAYDLEDWNGVARYSFVARVTEQDL 238
Query: 272 EDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQ-KARNEWGFKGYITSDCDAV 330
EDTF PPFRSC+ + KASC+MC+Y +NGVPAC DL R +WG GY+ SDCDAV
Sbjct: 239 EDTFNPPFRSCVVEAKASCVMCAYTAINGVPACANSDLLTGTVRGDWGLDGYVASDCDAV 298
Query: 331 ATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFS 390
A + + Q Y T ED+ A LKAG+DI+CG+ + +H +AI +GK+ E+DID+AL NL++
Sbjct: 299 AIMRDAQRYAPTPEDAVAVSLKAGLDIDCGSYVQQHAAAAIQQGKLTEQDIDKALTNLYA 358
Query: 391 VQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVS 450
V++RLG F+GDPRK YG LG D+CT EH+ LAL+AA+ GIVLLKND LPL+++ V+
Sbjct: 359 VRMRLGHFDGDPRKNMYGVLGAADICTPEHRNLALEAAQDGIVLLKNDGGILPLDRSTVT 418
Query: 451 SLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEA 510
S A+IGP N+ + Y G PC + L+GL++YV+ + +GC+ C+ A +A
Sbjct: 419 SAAVIGPNANDGMALIANYFGPPCESTTPLKGLQSYVNDVRFLAGCNSAACDV-AATDQA 477
Query: 511 VRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGP 570
V +A D+V + GL QE+E +DR SLLLPG Q SL+T+VA SKRPVILVL GGP
Sbjct: 478 VALAGSEDYVFLFMGLSQKQESEGKDRTSLLLPGMQQSLITAVADASKRPVILVLLSGGP 537
Query: 571 LDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDM 630
+D++FA+++ +I +ILW GYPG+AG A+A+++FGD NP GRLP+TWYPE FTKVPM DM
Sbjct: 538 VDITFAQSNPKIGAILWAGYPGQAGGLAIAKVLFGDHNPSGRLPVTWYPEEFTKVPMTDM 597
Query: 631 NMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNI 690
MRAD + YPGRSYRFY G VY FG+GLSY+ +S + + S +S++L G + +
Sbjct: 598 RMRADPTSGYPGRSYRFYQGNTVYKFGYGLSYSTFSRRLVHGTSVPALSSTLLTGLRETM 657
Query: 691 LQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQL 750
Q G R +V C L+F + V N G +DG H V++F R P QG P QL
Sbjct: 658 TPQDGDRSYHVDAIGTEGCEQLKFPAMVEVQNHGPMDGKHSVLMFLRWPNTKQGRPASQL 717
Query: 751 IGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
IGF H A + ++ F + PC+ S GR+++ +G+H LMV + E
Sbjct: 718 IGFRSQHLKAGETAKLRFDISPCKHFSRVRADGRKVIDIGSHFLMVDNHEMEIRFE 773
>gi|357489431|ref|XP_003615003.1| Xylan 1 4-beta-xylosidase [Medicago truncatula]
gi|355516338|gb|AES97961.1| Xylan 1 4-beta-xylosidase [Medicago truncatula]
Length = 780
Score = 775 bits (2002), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 386/776 (49%), Positives = 519/776 (66%), Gaps = 35/776 (4%)
Query: 38 NKPDFPCKP--PHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPA 95
N P + C P S+PFCN +L+I+ RAK ++S LTL EKI QL + A AIPRLGIP+
Sbjct: 30 NVPPYSCDTSNPLTKSFPFCNLNLTITQRAKDIVSRLTLDEKISQLVNTAPAIPRLGIPS 89
Query: 96 YEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMY 155
Y+WW+E+LHG++ G G+ NG++++ TSFPQ+++ AASF+ LW I + EAR +Y
Sbjct: 90 YQWWNEALHGVSYVGKGIRLNGSITAATSFPQIILIAASFDPKLWYRISKVIGTEARGVY 149
Query: 156 NLGQA-GLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGI 214
N GQA G+TFWAPNINIFRDPRWGRGQET GEDP+V S Y V +V+ QG+++ +GG
Sbjct: 150 NAGQAQGMTFWAPNINIFRDPRWGRGQETAGEDPLVNSKYGVSYVRGLQGDSF---EGGK 206
Query: 215 GFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDT 274
G R L SACCKH AYDLE W +RY F+A +T QD DT
Sbjct: 207 LIGGR-----------------LKASACCKHFTAYDLENWKGVNRYVFDAKVTLQDLADT 249
Query: 275 FQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATI 333
+QP F SC+ QG++S IMC+YN+VNGVP C +L AR +W F GYI SDCDAV I
Sbjct: 250 YQPSFHSCVVQGRSSGIMCAYNRVNGVPNCADYNLLTNTARKKWNFNGYIASDCDAVRFI 309
Query: 334 FEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQL 393
+E Q Y KT ED A VL+AGMD+ CG M +H +SA+ + K+ IDRAL NLF++++
Sbjct: 310 YEKQGYAKTPEDVVADVLRAGMDVECGNYMTKHAKSAVLQKKIPISQIDRALHNLFTIRI 369
Query: 394 RLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLA 453
RLGLF+G+P K +YG++GP+ VC+ E+ LAL+AAR GIVLLKN LPL + V++L
Sbjct: 370 RLGLFDGNPTKLQYGRIGPNQVCSKENLDLALEAARSGIVLLKNTASILPLPR--VNTLG 427
Query: 454 IIGPLVNNIS-QMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHD-VPCNSDAGFHEAV 511
+IGP N S + G Y G PC S+L+G Y S+THY SGC D V C S A AV
Sbjct: 428 VIGPNANKSSIVLLGNYFGQPCKQVSILKGFYTYASQTHYRSGCTDGVKCAS-AEIDRAV 486
Query: 512 RIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPL 571
+AK +D+VI+V GLD +QETE DR L LPG+Q L+ SVA+ SK+PVILV+ GGP+
Sbjct: 487 EVAKISDYVILVMGLDQSQETETLDRDHLELPGKQQKLINSVAKASKKPVILVILCGGPV 546
Query: 572 DVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMN 631
D++FA+ + +I I+W GYPGE G +ALA+++FGD+NPGGRLPMTWYP+ F K+PM DM
Sbjct: 547 DITFAKNNDKIGGIIWAGYPGELGGRALAQVVFGDYNPGGRLPMTWYPKDFIKIPMTDMR 606
Query: 632 MRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNIL 691
MRAD S YPGR+YRFYTG +VY FG+GLSY+NYSY F+S + + + + +IL
Sbjct: 607 MRADPSSGYPGRTYRFYTGPKVYEFGYGLSYSNYSYNFISVKNN---NIHINQSTTHSIL 663
Query: 692 QQTGSRLDYVHIDEV--TSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQ 749
+ + + + Y + E+ +C ++ V + +TN G + G H V+LF + K G P KQ
Sbjct: 664 ENSET-IRYKLVSELGKKACKTMSISVTLGITNTGSMAGKHPVLLFVKPKKGRNGNPVKQ 722
Query: 750 LIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTI 805
L+GF+ V G E+ F V CE LS AN+ G +++ G ++ +VGEL +S+ I
Sbjct: 723 LVGFESVTVEGGGKGEVGFEVSVCEHLSRANESGVKVIEEGGYLFLVGELEYSINI 778
>gi|218191593|gb|EEC74020.1| hypothetical protein OsI_08964 [Oryza sativa Indica Group]
Length = 774
Score = 775 bits (2001), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 382/765 (49%), Positives = 508/765 (66%), Gaps = 33/765 (4%)
Query: 40 PDFPCKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWW 99
P F C P S FCN L I RA L+S LTL+EKI QL D + A+ RLG+PAY+WW
Sbjct: 28 PPFSCGAP--SSAAFCNPRLPIEQRADDLVSRLTLEEKISQLGDQSPAVDRLGVPAYKWW 85
Query: 100 SESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQ 159
SE+LHG+++ G G++ +G + + TSFPQV+++AASFN LW IG + EARA+YN GQ
Sbjct: 86 SEALHGVSNAGRGIHLDGPLRAATSFPQVILTAASFNPHLWYRIGQVIGTEARAVYNNGQ 145
Query: 160 A-GLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGF 218
A GLTFWAPNIN+FRDPRWGRGQETPGEDP V YA FV+ QG
Sbjct: 146 AEGLTFWAPNINVFRDPRWGRGQETPGEDPTVTGKYAAVFVRGVQG-------------- 191
Query: 219 REKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPP 278
L G +D L SACCKH AYDLE W +RY+F+A +T QD DT+ PP
Sbjct: 192 ---YALAGAINSTD----LEASACCKHFTAYDLENWKGVTRYAFDAKVTAQDLADTYNPP 244
Query: 279 FRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK-ARNEWGFKGYITSDCDAVATIFEYQ 337
FRSC+E G AS IMCSYN+VNGVP C +L K AR +W F GYITSDCDAV+ I + Q
Sbjct: 245 FRSCVEDGGASGIMCSYNRVNGVPTCADYNLLSKTARGDWRFYGYITSDCDAVSIIHDVQ 304
Query: 338 NYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGL 397
Y KT ED+ A VLKAGMD+NCG+ + H SAI +GK+ E+DI+RAL NLF+V++RLGL
Sbjct: 305 GYAKTAEDAVADVLKAGMDVNCGSYVQEHGLSAIQQGKITEQDINRALHNLFAVRMRLGL 364
Query: 398 FNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGP 457
FNG+P+ +YG +GPD VCT EH+ LAL+AA+ G+VLLKND LPL+K+ VSS+A+IG
Sbjct: 365 FNGNPKYNRYGNIGPDQVCTQEHQNLALEAAQHGVVLLKNDANALPLSKSQVSSIAVIGH 424
Query: 458 LVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKA 517
N+ +++ G Y G PC + L+ L+ YV T + +GC+ CN + EA ++A
Sbjct: 425 NANDATRLLGNYFGPPCISVTPLQVLQGYVKDTRFLAGCNSAACNVSS-IGEAAQLASSV 483
Query: 518 DFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAE 577
D+V++ GLD QE E+ DR+ L LPG Q +L+ +VA +K+PVILVL GGP+DV+FA+
Sbjct: 484 DYVVLFMGLDQDQEREEVDRLELSLPGMQENLINTVANAAKKPVILVLLCGGPVDVTFAK 543
Query: 578 ADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSS 637
+ +I +ILW GYPGEAG A+A+++FG+ NPGGRLP+TWYP+ FT VPM DM MRAD S
Sbjct: 544 YNPKIGAILWAGYPGEAGGIAIAQVLFGEHNPGGRLPVTWYPKEFTSVPMTDMRMRADPS 603
Query: 638 RQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSR 697
YPGR+YRFY G VY FG+GLSY+ YS+ F++ ++L +S+ K +
Sbjct: 604 TGYPGRTYRFYRGNTVYKFGYGLSYSKYSHHFVANGTKLPSLSSIDG--LKAMATAAAGT 661
Query: 698 LDYVHIDEV--TSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPK--VSQGTPEKQLIGF 753
+ Y ++E+ +C L+F + V N G +DG H V+LF R P G P QLIGF
Sbjct: 662 VSY-DVEEIGTETCDKLKFPALVRVQNHGPMDGRHPVLLFLRWPNGAADGGRPASQLIGF 720
Query: 754 DRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGE 798
+H + + + F V PC+ S A + G++++ G+H +MVG+
Sbjct: 721 QSLHLKSMQTVHVEFEVSPCKHFSRATEDGKKVIDHGSHFMMVGD 765
>gi|115448721|ref|NP_001048140.1| Os02g0752200 [Oryza sativa Japonica Group]
gi|46390122|dbj|BAD15557.1| putative beta-D-xylosidase [Oryza sativa Japonica Group]
gi|46390225|dbj|BAD15656.1| putative beta-D-xylosidase [Oryza sativa Japonica Group]
gi|113537671|dbj|BAF10054.1| Os02g0752200 [Oryza sativa Japonica Group]
gi|125583710|gb|EAZ24641.1| hypothetical protein OsJ_08409 [Oryza sativa Japonica Group]
Length = 780
Score = 775 bits (2001), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 382/765 (49%), Positives = 508/765 (66%), Gaps = 33/765 (4%)
Query: 40 PDFPCKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWW 99
P F C P S FCN L I RA L+S LTL+EKI QL D + A+ RLG+PAY+WW
Sbjct: 34 PPFSCGAP--SSAAFCNPRLPIEQRADDLVSRLTLEEKISQLGDQSPAVDRLGVPAYKWW 91
Query: 100 SESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQ 159
SE+LHG+++ G G++ +G + + TSFPQV+++AASFN LW IG + EARA+YN GQ
Sbjct: 92 SEALHGVSNAGRGIHLDGPLRAATSFPQVILTAASFNPHLWYRIGQVIGTEARAVYNNGQ 151
Query: 160 A-GLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGF 218
A GLTFWAPNIN+FRDPRWGRGQETPGEDP V YA FV+ QG
Sbjct: 152 AEGLTFWAPNINVFRDPRWGRGQETPGEDPTVTGKYAAVFVRGVQG-------------- 197
Query: 219 REKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPP 278
L G +D L SACCKH AYDLE W +RY+F+A +T QD DT+ PP
Sbjct: 198 ---YALAGAINSTD----LEASACCKHFTAYDLENWKGVTRYAFDAKVTAQDLADTYNPP 250
Query: 279 FRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK-ARNEWGFKGYITSDCDAVATIFEYQ 337
FRSC+E G AS IMCSYN+VNGVP C +L K AR +W F GYITSDCDAV+ I + Q
Sbjct: 251 FRSCVEDGGASGIMCSYNRVNGVPTCADYNLLSKTARGDWRFYGYITSDCDAVSIIHDVQ 310
Query: 338 NYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGL 397
Y KT ED+ A VLKAGMD+NCG+ + H SAI +GK+ E+DI+RAL NLF+V++RLGL
Sbjct: 311 GYAKTAEDAVADVLKAGMDVNCGSYVQEHGLSAIQQGKITEQDINRALHNLFAVRMRLGL 370
Query: 398 FNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGP 457
FNG+P+ +YG +GPD VCT EH+ LAL+AA+ G+VLLKND LPL+K+ VSS+A+IG
Sbjct: 371 FNGNPKYNRYGNIGPDQVCTQEHQNLALEAAQHGVVLLKNDANALPLSKSQVSSIAVIGH 430
Query: 458 LVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKA 517
N+ +++ G Y G PC + L+ L+ YV T + +GC+ CN + EA ++A
Sbjct: 431 NANDATRLLGNYFGPPCISVTPLQVLQGYVKDTRFLAGCNSAACNVSS-IGEAAQLASSV 489
Query: 518 DFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAE 577
D+V++ GLD QE E+ DR+ L LPG Q +L+ +VA +K+PVILVL GGP+DV+FA+
Sbjct: 490 DYVVLFMGLDQDQEREEVDRLELSLPGMQENLINTVANAAKKPVILVLLCGGPVDVTFAK 549
Query: 578 ADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSS 637
+ +I +ILW GYPGEAG A+A+++FG+ NPGGRLP+TWYP+ FT VPM DM MRAD S
Sbjct: 550 YNPKIGAILWAGYPGEAGGIAIAQVLFGEHNPGGRLPVTWYPKEFTSVPMTDMRMRADPS 609
Query: 638 RQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSR 697
YPGR+YRFY G VY FG+GLSY+ YS+ F++ ++L +S+ K +
Sbjct: 610 TGYPGRTYRFYRGNTVYKFGYGLSYSKYSHHFVANGTKLPSLSSIDG--LKAMATAAAGT 667
Query: 698 LDYVHIDEV--TSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPK--VSQGTPEKQLIGF 753
+ Y ++E+ +C L+F + V N G +DG H V+LF R P G P QLIGF
Sbjct: 668 VSY-DVEEIGPETCDKLKFPALVRVQNHGPMDGRHPVLLFLRWPNGAADGGRPASQLIGF 726
Query: 754 DRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGE 798
+H + + + F V PC+ S A + G++++ G+H +MVG+
Sbjct: 727 QSLHLKSMQTVHVEFEVSPCKHFSRATEDGKKVIDHGSHFMMVGD 771
>gi|356515806|ref|XP_003526589.1| PREDICTED: probable beta-D-xylosidase 7-like [Glycine max]
Length = 772
Score = 773 bits (1995), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 392/789 (49%), Positives = 513/789 (65%), Gaps = 42/789 (5%)
Query: 27 VIAFSNSKPVLNKPDFPC--KPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDN 84
+ +SKP + F C P SYPFCN L I R K L+S LTL EK+ QL +
Sbjct: 14 TVTVQSSKP---EAPFACDFSNPSSRSYPFCNPKLPIPQRTKDLLSRLTLDEKLSQLVNT 70
Query: 85 ASAIPRLGIPAYEWWSESLHGIASNGPGVNF--NGTVSSVTSFPQVLVSAASFNRSLWSN 142
A IPRLGIPAY+WWSE+LHG++ GPG+ F N T+SS TSFPQV+++AASF+ LW
Sbjct: 71 APPIPRLGIPAYQWWSEALHGVSGVGPGILFDNNSTISSATSFPQVILTAASFDSRLWYR 130
Query: 143 IGSAVAVEARAMYNLGQA-GLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKS 201
IG A+ +EARA++N GQA GLTFWAPNINIFRDPRWGRGQET GEDP++ S YAV FV+
Sbjct: 131 IGHAIGIEARAIFNAGQANGLTFWAPNINIFRDPRWGRGQETAGEDPLLTSRYAVSFVRG 190
Query: 202 FQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYS 261
QG+++K G L+ SACCKH AYDL+ W R+
Sbjct: 191 LQGDSFK-------------------------GAHLLASACCKHFTAYDLDNWKGVDRFV 225
Query: 262 FNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACL-RGDLFQKARNEWGFK 320
F+A ++ QD DT+QPPF+SC++QG+AS IMC+YN+VNGVP C G L Q ARN+W F
Sbjct: 226 FDARVSLQDLADTYQPPFQSCVQQGRASGIMCAYNRVNGVPNCADYGLLTQTARNQWDFN 285
Query: 321 GYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKD 380
GYITSDC AV I + Q Y K+ ED A VL+AGMD+ CG+ + H +SA+ + K+ +
Sbjct: 286 GYITSDCGAVGFIHDRQRYAKSPEDVVADVLRAGMDLECGSYLTYHAKSAVLQKKLGMSE 345
Query: 381 IDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKK 440
IDRAL NLFS+++RLGLF+G+P + +G +G + VC+ EH+ LAL+AAR GIVLLKN
Sbjct: 346 IDRALQNLFSIRMRLGLFDGNPTRLSFGLIGSNHVCSKEHQYLALEAARNGIVLLKNSPT 405
Query: 441 FLPLNKNAVS-SLAIIGPLVNNIS-QMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHD 498
LPL K + S SLA+IGP N+ + G Y G PC ++L+G YV Y GC
Sbjct: 406 LLPLPKTSPSISLAVIGPNANSSPLTLLGNYAGPPCKYVTILQGFRHYVKNAFYHPGCDG 465
Query: 499 VPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSK 558
P S A +AV +AKK D+V++V GLD ++E E+RDRV L LPG+Q+ L+ VA SK
Sbjct: 466 GPKCSSAQIDQAVEVAKKVDYVVLVMGLDQSEEREERDRVHLDLPGKQLELINGVAEASK 525
Query: 559 RPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWY 618
+PVILVL GGPLD++ A+ + +I ILW GYPGE G ALA+IIFGD NPGGRLP TWY
Sbjct: 526 KPVILVLLSGGPLDITSAKYNHKIGGILWAGYPGELGGIALAQIIFGDHNPGGRLPTTWY 585
Query: 619 PESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTI 678
P+ + KVPM DM MRAD S YPGR+YRFY G +VY FG+GLSY+ YSY+F+S +T
Sbjct: 586 PKDYIKVPMTDMRMRADPSTGYPGRTYRFYKGPKVYEFGYGLSYSKYSYEFVS----VTH 641
Query: 679 SASLKAGSDKNILQQTGSRLDYVHIDEV--TSCTSLRFHVQISVTNAGDVDGSHVVMLFA 736
S +++ + + Y + E+ +C S+ V + V N G + G H V+LF
Sbjct: 642 DKLHFNQSSTHLMVENSETISYKLVSELDEQTCQSMSLSVTVRVQNHGSMVGKHPVLLFI 701
Query: 737 RVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMV 796
R + G+P KQL+GF+ V A + F V PCE LS AN+ G I+ G+H+L+V
Sbjct: 702 RPKRQKSGSPVKQLVGFESVMLDAGEMAHVEFEVSPCEHLSRANEAGAMIIEEGSHMLLV 761
Query: 797 GELRHSLTI 805
+L H + I
Sbjct: 762 DDLEHPIDI 770
>gi|302786474|ref|XP_002975008.1| hypothetical protein SELMODRAFT_103038 [Selaginella moellendorffii]
gi|300157167|gb|EFJ23793.1| hypothetical protein SELMODRAFT_103038 [Selaginella moellendorffii]
Length = 772
Score = 771 bits (1990), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 376/769 (48%), Positives = 508/769 (66%), Gaps = 32/769 (4%)
Query: 44 CKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESL 103
C S+PFC+ SL + R L+ + L EKI Q+ NAS IPRLGIP Y+WW E+L
Sbjct: 25 CTDSRSSSFPFCDVSLPVPDRVADLVGRMNLSEKIAQIVSNASGIPRLGIPGYQWWEEAL 84
Query: 104 HGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLT 163
HG+A + PGV F V S TSFPQV+++ ASFN SLW+ I A+++EA AMYN G++GLT
Sbjct: 85 HGVAES-PGVKFAAPVPSATSFPQVILTVASFNSSLWNKIAQAISIEAIAMYNAGRSGLT 143
Query: 164 FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRV 223
FW+PNINIFRDPRWGRGQETPGEDP++ S YA FV+ Q ++ D G ++R
Sbjct: 144 FWSPNINIFRDPRWGRGQETPGEDPLLSSKYAAYFVRGLQEGDY---DEGTAISTMQRRP 200
Query: 224 LKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCI 283
+ L +S+CCKH AYD+EK + FNA +T QD +DTF PPFRSCI
Sbjct: 201 TR-----------LKVSSCCKHFTAYDMEKSEGTDCFHFNAQVTVQDLQDTFDPPFRSCI 249
Query: 284 EQGKASCIMCSYNQVNGVPACLRGD-LFQKARNEWGFKGYITSDCDAVATIFEYQNYTKT 342
G+AS +MCSYN+VNGVP+C L + RN WGF+GYI SDCDAVA ++EY NYT T
Sbjct: 250 VDGQASGLMCSYNRVNGVPSCADYTFLTETVRNSWGFEGYIVSDCDAVALLYEYINYTTT 309
Query: 343 HEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDP 402
ED+ A VL AGMD+NCGT +LRHT +AI++GKV E +DRAL N+ +V++RLGLF+G+
Sbjct: 310 AEDAVADVLSAGMDLNCGTFLLRHTAAAIEQGKVTEAAVDRALSNVMTVRMRLGLFDGNS 369
Query: 403 RKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNI 462
+ Y +GPD VCT EH++L+L+AA QGIVLLKN LP +N + ++A+IGP N
Sbjct: 370 GE-TYNSIGPDAVCTREHRQLSLEAAEQGIVLLKNSGNVLPFPRNDLMTIAVIGPSGNAT 428
Query: 463 SQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIV 522
M G Y G+PC + +GL+ Y + GC D+ CN F AVR A+ +D V++
Sbjct: 429 ETMLGNYAGVPCQYITPFQGLQEYTKGVVFEPGCKDIMCNDTTLFLAAVRAAENSDAVVI 488
Query: 523 VAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQI 582
V GLD QE E DR SLLLPG Q LV V++ +K PVILV+ GGP+DV+FA+ + +I
Sbjct: 489 VVGLDKDQEREGLDRTSLLLPGYQQDLVLEVSKVAKGPVILVVMSGGPIDVTFAKGNCKI 548
Query: 583 SSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTK-VPMNDMNMRADSSRQYP 641
SS+LW+GYPGEAG KA+A +IFGD NP GRLPMTWYP++F + V + +M++R ++S +P
Sbjct: 549 SSVLWVGYPGEAGGKAIARVIFGDHNPAGRLPMTWYPQAFAEHVSILNMHLRPNTSTGFP 608
Query: 642 GRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSR---L 698
GR+YRFYTG VY FGHGLSYTN++Y SAPS +T ++ + L++ G+R +
Sbjct: 609 GRTYRFYTGENVYEFGHGLSYTNFTYTNFSAPSNITARNTVAI---RTPLREDGARHFPI 665
Query: 699 DYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGT--PEKQLIGFDRV 756
DY T C +L F V ++N G D H+ +L+A P S P KQLI F R
Sbjct: 666 DY------TGCEALAFKVVAYISNTGTRDSDHISLLYAIPPAASSSLSPPRKQLISFKRQ 719
Query: 757 HTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTI 805
H +A ++ F VD C+ L + N+ G ++L G++ L +G++ H +++
Sbjct: 720 HLIAGRCAKVEFDVDTCKDLGLTNEAGTKVLVHGDYKLSLGDIEHVISL 768
>gi|253761872|ref|XP_002489310.1| hypothetical protein SORBIDRAFT_0010s010920 [Sorghum bicolor]
gi|241946958|gb|EES20103.1| hypothetical protein SORBIDRAFT_0010s010920 [Sorghum bicolor]
Length = 772
Score = 770 bits (1988), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 378/776 (48%), Positives = 507/776 (65%), Gaps = 41/776 (5%)
Query: 40 PDFPCKPPHFDS---YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAY 96
P F C P ++ FC+ +LS + RA L+S LT EKI QL D A+ +PRLG+P Y
Sbjct: 26 PPFSCGPTSAEASEGLAFCDVTLSPAQRAADLVSRLTPAEKIAQLGDQATGVPRLGVPGY 85
Query: 97 EWWSESLHGIASNGPGVNFN--GTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAM 154
+WW+E+LHG+A++G G++F+ G V + TSFPQVL++AA+F+ LW IG A+ EARA+
Sbjct: 86 KWWNEALHGLATSGKGLHFDVVGGVRAATSFPQVLLTAAAFDDDLWFRIGQAIGREARAL 145
Query: 155 YNLGQA-GLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGG 213
+N+GQA GLT W+PN+NIFRDPRWGRGQETPGEDP V S YAV FV+ QG + S
Sbjct: 146 FNVGQAEGLTIWSPNVNIFRDPRWGRGQETPGEDPAVASRYAVAFVRGIQGNSSSS---- 201
Query: 214 IGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTED 273
L SACCKH AYDLE W +RYSF A +T QD ED
Sbjct: 202 ----------------------LLQTSACCKHATAYDLEDWNGVARYSFVARVTAQDLED 239
Query: 274 TFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQ-KARNEWGFKGYITSDCDAVAT 332
TF PPFRSC+ +GKASCIMC+Y +NGVPAC DL R +WG GY+ SDCDAVA
Sbjct: 240 TFNPPFRSCVVEGKASCIMCAYTAINGVPACANTDLLTGTVRGDWGLDGYVASDCDAVAI 299
Query: 333 IFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQ 392
+ + Q Y T ED+ A LKAG+DI+CG+ + +H +AI +GK+ E DID+AL+NLF+V+
Sbjct: 300 MRDAQRYAPTPEDAVAVSLKAGLDIDCGSYIQQHATAAIQQGKLTELDIDKALVNLFAVR 359
Query: 393 LRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSL 452
+RLG F+GDPRK YG L D+CT EH+ LAL+AA+ GIVLLKND LPL+++ V+S
Sbjct: 360 MRLGHFDGDPRKNMYGALSAADICTPEHRSLALEAAQDGIVLLKNDGGILPLDRSTVTSA 419
Query: 453 AIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVR 512
A+IGP N+ + Y G PC + L+GL++YV+ + +GC C+ A +AV
Sbjct: 420 AVIGPNSNDGMALIANYFGPPCESTTPLQGLQSYVNNVRFLAGCSSAACDV-AVTDQAVV 478
Query: 513 IAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLD 572
++ D+V + GL QE+E +DR SLLLPG Q SL+T+VA SKRPVILVL GGP+D
Sbjct: 479 LSGSEDYVFLFMGLSQQQESEGKDRTSLLLPGMQQSLITAVADASKRPVILVLLSGGPVD 538
Query: 573 VSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNM 632
++FA+++ +I +ILW GYPG+AG A+A+++FGD NP GRLPMTWYPE FTKVPM DM M
Sbjct: 539 ITFAQSNPKIGAILWAGYPGQAGGLAIAKVLFGDHNPSGRLPMTWYPEDFTKVPMTDMRM 598
Query: 633 RADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQ 692
RAD + YPGRSYRFY G VY FG+GLSY+ +S + L S +S+++ AG + + +
Sbjct: 599 RADPTSGYPGRSYRFYQGNAVYKFGYGLSYSTFSSRLLYGTSMPALSSTVLAGLRETVTE 658
Query: 693 QTGSRLDYVHIDEVTS--CTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQL 750
+ G R HID++ + C L+F + V N G +DG H ++F R P + G P QL
Sbjct: 659 E-GDR--SYHIDDIGTDGCEQLKFPAMVEVQNHGPMDGKHSALMFLRWPNTNGGRPASQL 715
Query: 751 IGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
IGF H A + + F + PCE S G +++ +G+H L V H++ I
Sbjct: 716 IGFMSQHLKAGETANLRFDISPCEHFSRVRADGMKVIDIGSHFLTVDN--HAIEIR 769
>gi|15218202|ref|NP_177929.1| putative beta-D-xylosidase 7 [Arabidopsis thaliana]
gi|259585708|sp|Q9SGZ5.2|BXL7_ARATH RecName: Full=Probable beta-D-xylosidase 7; Short=AtBXL7; Flags:
Precursor
gi|18086336|gb|AAL57631.1| At1g78060/F28K19_32 [Arabidopsis thaliana]
gi|332197942|gb|AEE36063.1| putative beta-D-xylosidase 7 [Arabidopsis thaliana]
Length = 767
Score = 770 bits (1987), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 393/776 (50%), Positives = 516/776 (66%), Gaps = 42/776 (5%)
Query: 40 PDFPCKP--PHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYE 97
P C P P Y FC T L I RA+ L+S LT+ EKI QL + A IPRLG+PAYE
Sbjct: 22 PPHSCDPSNPTTKLYQFCRTDLPIGKRARDLVSRLTIDEKISQLVNTAPGIPRLGVPAYE 81
Query: 98 WWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNL 157
WWSE+LHG+A GPG+ FNGTV + TSFPQV+++AASF+ W I + EAR +YN
Sbjct: 82 WWSEALHGVAYAGPGIRFNGTVKAATSFPQVILTAASFDSYEWFRIAQVIGKEARGVYNA 141
Query: 158 GQA-GLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGF 216
GQA G+TFWAPNINIFRDPRWGRGQETPGEDPM+ YAV +V+ QG++
Sbjct: 142 GQANGMTFWAPNINIFRDPRWGRGQETPGEDPMMTGTYAVAYVRGLQGDS---------- 191
Query: 217 GFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQ 276
F ++ L + L SACCKH AYDL++W +RY FNA ++ D +T+Q
Sbjct: 192 -FDGRKTLS---------NHLQASACCKHFTAYDLDRWKGITRYVFNAQVSLADLAETYQ 241
Query: 277 PPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK-ARNEWGFKGYITSDCDAVATIFE 335
PPF+ CIE+G+AS IMC+YN+VNG+P+C +L + AR +W F+GYITSDCDAV+ I++
Sbjct: 242 PPFKKCIEEGRASGIMCAYNRVNGIPSCADPNLLTRTARGQWAFRGYITSDCDAVSIIYD 301
Query: 336 YQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRL 395
Q Y K+ ED+ A VLKAGMD+NCG+ + +HT+SA+ + KV E DIDRALLNLFSV++RL
Sbjct: 302 AQGYAKSPEDAVADVLKAGMDVNCGSYLQKHTKSALQQKKVSETDIDRALLNLFSVRIRL 361
Query: 396 GLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAII 455
GLFNGDP K YG + P++VC+ H+ LALDAAR GIVLLKN+ K LP +K +VSSLA+I
Sbjct: 362 GLFNGDPTKLPYGNISPNEVCSPAHQALALDAARNGIVLLKNNLKLLPFSKRSVSSLAVI 421
Query: 456 GPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAK 515
GP + + + G Y G PC + L L +YV Y GC V C S+A +AV IAK
Sbjct: 422 GPNAHVVKTLLGNYAGPPCKTVTPLDALRSYVKNAVYHQGCDSVAC-SNAAIDQAVAIAK 480
Query: 516 KADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSF 575
AD V+++ GLD TQE ED DRV L LPG+Q L+TSVA +K+PV+LVL GGP+D+SF
Sbjct: 481 NADHVVLIMGLDQTQEKEDFDRVDLSLPGKQQELITSVANAAKKPVVLVLICGGPVDISF 540
Query: 576 AEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRAD 635
A +++I SI+W GYPGEAG A++EIIFGD NPGGRLP+TWYP+SF + M DM MR
Sbjct: 541 AANNNKIGSIIWAGYPGEAGGIAISEIIFGDHNPGGRLPVTWYPQSFVNIQMTDMRMR-- 598
Query: 636 SSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLS-APSELTISASLKAGSDKNILQQT 694
S+ YPGR+Y+FY G +VY FGHGLSY+ YSY+F + A + L ++ S KA Q
Sbjct: 599 SATGYPGRTYKFYKGPKVYEFGHGLSYSAYSYRFKTLAETNLYLNQS-KA-------QTN 650
Query: 695 GSRLDYVHIDEV--TSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQG--TPEKQL 750
+ Y + E+ C + V + V N G++ G H V++FAR + + EKQL
Sbjct: 651 SDSVRYTLVSEMGKEGCDVAKTKVTVEVENQGEMAGKHPVLMFARHERGGEDGKRAEKQL 710
Query: 751 IGFDRVHTVAKGSK-EISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTI 805
+GF + ++ G K E+ F + CE LS AN+ G +L G + L VG+ L +
Sbjct: 711 VGFKSI-VLSNGEKAEMEFEIGLCEHLSRANEFGVMVLEEGKYFLTVGDSELPLIV 765
>gi|356552866|ref|XP_003544783.1| PREDICTED: probable beta-D-xylosidase 7-like [Glycine max]
Length = 776
Score = 769 bits (1985), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 380/761 (49%), Positives = 508/761 (66%), Gaps = 34/761 (4%)
Query: 52 YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGP 111
YPFCNT L IS RA+ L+S LTL EK+ QL + A AIPRLGIP+Y+WWSE+LHG+A G
Sbjct: 41 YPFCNTRLPISKRAQDLVSRLTLDEKLAQLVNTAPAIPRLGIPSYQWWSEALHGVADAGF 100
Query: 112 GVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA-GLTFWAPNIN 170
G+ FNGT+ S TSFPQV+++AASF+ +LW I + EARA+YN GQA G+TFWAPNIN
Sbjct: 101 GIRFNGTIKSATSFPQVILTAASFDPNLWYQISKTIGKEARAVYNAGQATGMTFWAPNIN 160
Query: 171 IFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEE 230
+FRDPRWGRGQET GEDP++ + Y V +V+ QG+++ E
Sbjct: 161 VFRDPRWGRGQETAGEDPLMNAKYGVAYVRGLQGDSF---------------------EG 199
Query: 231 SDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASC 290
G+ L SACCKH AYDL+ W R+ ++A +T QD DT+QPPF+SCIEQG+AS
Sbjct: 200 GKLGERLQASACCKHFTAYDLDHWKGLDRFVYDARVTSQDLADTYQPPFQSCIEQGRASG 259
Query: 291 IMCSYNQVNGVPACLRGDLFQK-ARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAG 349
IMC+YN+VNGVP C +L K AR +W F GYITSDC AV+ I + Q Y KT ED+ A
Sbjct: 260 IMCAYNRVNGVPNCANFNLLTKTARQQWKFDGYITSDCGAVSIIHDEQGYAKTAEDAIAD 319
Query: 350 VLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGK 409
V +AGMD+ CG + +H +SA+ + K+ IDRAL NLFS+++RLGL +G+P K +G
Sbjct: 320 VFRAGMDVECGDYITKHGKSAVSQKKLPISQIDRALQNLFSIRIRLGLLDGNPTKLPFGT 379
Query: 410 LGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMG-GG 468
+GPD VC+ + +LAL+AAR GIVLLKN LPL K ++A+IGP N S++ G
Sbjct: 380 IGPDQVCSKQSLQLALEAARDGIVLLKNTNSLLPLPKTN-PTIALIGPNANASSKVFLGN 438
Query: 469 YTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDL 528
Y G PC+ +LL+G E Y T Y GC D P + A AV +AKK D+V++V GLD
Sbjct: 439 YYGRPCNLVTLLQGFEGYAKDTVYHPGCDDGPQCAYAQIEGAVEVAKKVDYVVLVMGLDQ 498
Query: 529 TQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWI 588
+QE E DR L LPG+Q L+ SVAR SKRPV+LVL GGP+D++ A+ D ++ ILW
Sbjct: 499 SQERESHDREYLGLPGKQEELIKSVARASKRPVVLVLLCGGPVDITSAKFDDKVGGILWA 558
Query: 589 GYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFY 648
GYPGE G ALA+++FGD NPGG+LP+TWYP+ F KVPM DM MRAD + YPGR+YRFY
Sbjct: 559 GYPGELGGVALAQVVFGDHNPGGKLPITWYPKDFIKVPMTDMRMRADPASGYPGRTYRFY 618
Query: 649 TGTQVYGFGHGLSYTNYSYKFLS-APSELTISASLKAGSDKNILQQTGSRLDYVHIDEVT 707
TG +VY FG+GLSYT YSYK LS + + L I+ S ++ Q + Y + E+
Sbjct: 619 TGPKVYEFGYGLSYTKYSYKLLSLSHNTLHINQ-----SSTHLTTQNSETIRYKLVSELA 673
Query: 708 --SCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKV-SQGTPEKQLIGFDRVHTVAKGSK 764
+C ++ + + VTN G++ G H V+LF R KV + G P KQL+GF V A +
Sbjct: 674 EETCQTMLLSIALGVTNHGNMAGKHPVLLFVRQGKVRNNGNPVKQLVGFQSVKLNAGETV 733
Query: 765 EISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTI 805
++ F + PCE LS+AN+ G ++ G+++L+VG+ + + I
Sbjct: 734 QVGFELSPCEHLSVANEAGSMVIEEGSYLLLVGDQEYPIEI 774
>gi|357489441|ref|XP_003615008.1| Xylan 1 4-beta-xylosidase [Medicago truncatula]
gi|355516343|gb|AES97966.1| Xylan 1 4-beta-xylosidase [Medicago truncatula]
Length = 798
Score = 768 bits (1982), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 385/789 (48%), Positives = 520/789 (65%), Gaps = 49/789 (6%)
Query: 40 PDFPCKP--PHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYE 97
P + C P S PFCN +L+I+ RAK ++S LTL EKI QL + A +IPRLGIP+Y+
Sbjct: 34 PPYSCDTTNPLTKSLPFCNLNLTITQRAKDIVSRLTLDEKISQLVNTAPSIPRLGIPSYQ 93
Query: 98 WWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNL 157
WW E+LHG+A+ G G+ NG+V+ TSFPQV+++AASF+ LW I + EAR +YN
Sbjct: 94 WWDEALHGVANAGKGIRLNGSVAGATSFPQVILTAASFDSKLWYQISKVIGTEARGVYNA 153
Query: 158 GQA-GLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGF 216
GQA G+TFWAPNINIFRDPRWGRGQET GEDP+V S Y V +V+ QG+++ +GG
Sbjct: 154 GQAQGMTFWAPNINIFRDPRWGRGQETAGEDPLVNSKYGVSYVRGLQGDSF---EGG--- 207
Query: 217 GFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNA------------ 264
K + GD L SACCKH AYDL+ W R+ F+A
Sbjct: 208 ----KLI----------GDRLKASACCKHFTAYDLDNWKGLDRFDFDAKVSFLFSMAYSP 253
Query: 265 ----MITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK-ARNEWGF 319
+T QD DT+QPPF SCI QG++S IMC+YN+VNGVP C +L K AR +W F
Sbjct: 254 WMINYVTLQDLADTYQPPFHSCIVQGRSSGIMCAYNRVNGVPNCADYNLLTKTARQKWNF 313
Query: 320 KGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEK 379
GYITSDC+AV I++ Q Y KT ED+ A VL+AGMD+ CG + +H ++A+ + KV
Sbjct: 314 NGYITSDCEAVRIIYDNQGYAKTPEDAVADVLQAGMDVECGDYLTKHAKAAVLQKKVPIS 373
Query: 380 DIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDK 439
IDRAL NLF++++RLGLF+G+P K +YG++GP+ VC+ E+ LAL+AAR GIVLLKN
Sbjct: 374 QIDRALHNLFTIRIRLGLFDGNPTKLQYGRIGPNQVCSKENLDLALEAARSGIVLLKNTA 433
Query: 440 KFLPLNKNAVSSLAIIGPLVNNISQMG-GGYTGIPCSPKSLLRGLEAYVSKTHYASGCHD 498
LPL + V++L +IGP N S++ G Y G PC +L+G Y S+THY SGC D
Sbjct: 434 SILPLPR--VNTLGVIGPNANKSSKVVLGNYFGRPCRLVPILKGFYTYASQTHYRSGCLD 491
Query: 499 VPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSK 558
+ A AV +AK +D+VI+V GLD +QE E RDR L LPG+Q L+ SVA+ SK
Sbjct: 492 GTKCASAEIDRAVEVAKISDYVILVMGLDQSQERESRDRDDLELPGKQQELINSVAKASK 551
Query: 559 RPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWY 618
+PVILVL GGP+D++FA+ + +I I+W GYPGE G +ALA+++FGD+NPGGRLPMTWY
Sbjct: 552 KPVILVLLCGGPVDITFAKNNDKIGGIIWAGYPGELGGRALAQVVFGDYNPGGRLPMTWY 611
Query: 619 PESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTI 678
P+ F K+PM DM MRAD S YPGR+YRFYTG +VY FG+GLSY+NYSY F+S +
Sbjct: 612 PKDFIKIPMTDMRMRADPSSGYPGRTYRFYTGPKVYEFGYGLSYSNYSYNFISVKNN--- 668
Query: 679 SASLKAGSDKNILQQTGSRLDYVHIDEV--TSCTSLRFHVQISVTNAGDVDGSHVVMLFA 736
+ + + +IL+ + + + Y + E+ +C ++ V + +TN G + G H V+LF
Sbjct: 669 NLHINQSTTHSILENSET-IYYKLVSELGEETCKTMSISVTLGITNTGSMAGKHPVLLFV 727
Query: 737 RVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMV 796
+ K G P KQL+GF+ V G E+ F V CE LS AN+ G +++ G H+L+V
Sbjct: 728 KPKKGRNGNPVKQLVGFESVTVEGGGKGEVGFEVSVCEHLSRANESGVKVIEEGGHLLVV 787
Query: 797 GELRHSLTI 805
GE +S+ I
Sbjct: 788 GEEEYSINI 796
>gi|414588273|tpg|DAA38844.1| TPA: putative O-Glycosyl hydrolase superfamily protein [Zea mays]
Length = 775
Score = 767 bits (1981), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 383/777 (49%), Positives = 500/777 (64%), Gaps = 32/777 (4%)
Query: 36 VLNKPDFPCKPPHFD-SYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIP 94
V + P F C P +YPFC+ SL + RA L+S LT+ EK+ QL D A+ +PRLG+P
Sbjct: 24 VASDPMFSCGPSSASRAYPFCDRSLPAARRAADLVSRLTVAEKVSQLGDEAAGVPRLGVP 83
Query: 95 AYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAM 154
Y+WWSE LHG+A G G+ FNGTVS+VTSFPQVL++ ASF+ SLW IG A+ EARA+
Sbjct: 84 PYKWWSEGLHGLAFWGHGMRFNGTVSAVTSFPQVLLTTASFDESLWFRIGQAIGREARAL 143
Query: 155 YNLGQA-GLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGG 213
YNLGQA GLT W+PN+NIFRDPRWGRGQETPGEDP V S YAV FV+ QG N
Sbjct: 144 YNLGQAEGLTIWSPNVNIFRDPRWGRGQETPGEDPAVASKYAVAFVRGIQGSN------- 196
Query: 214 IGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTED 273
L SACCKH AYDLE W +RY+F+A +T QD D
Sbjct: 197 ---------------PAGAAAAPLQASACCKHATAYDLEDWNGVARYNFDARVTLQDLAD 241
Query: 274 TFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVAT 332
TF PPF+SC+ GKASC+MC+Y +NGVPAC DL K R WG GY++SDCDAVA
Sbjct: 242 TFNPPFQSCVVDGKASCVMCAYTVINGVPACASSDLLTKTFRGAWGLDGYVSSDCDAVAI 301
Query: 333 IFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQ 392
+ + Q Y T ED+ A LKAG+D+NCGT +H +AI +GK+ EKD+D+AL NLF+V+
Sbjct: 302 MRDAQRYEPTPEDTVAVALKAGLDLNCGTYTQQHGMAAIQQGKMTEKDVDKALTNLFAVR 361
Query: 393 LRLGLFNGDPR-KGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSS 451
+RLG F+GDPR YG+LG DVCT++HK LAL+AA+ GIVLLKND LPL+++AV S
Sbjct: 362 MRLGHFDGDPRGNALYGRLGAADVCTADHKNLALEAAQDGIVLLKNDAGILPLDRSAVGS 421
Query: 452 LAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAV 511
A+IG N+ + G Y G C + L GL++YV + +GC C A +A
Sbjct: 422 AAVIGHNANDPLVLSGNYFGPACETTTPLEGLQSYVRNVRFLAGCSSAACGYAA-TGQAA 480
Query: 512 RIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPL 571
+A A++V + GL QE E DR SLLLPG+Q SLVT+VA +KRPV+LVL GGP+
Sbjct: 481 ALASSAEYVFLFMGLSQDQEKEGLDRTSLLLPGKQQSLVTAVASAAKRPVVLVLLTGGPV 540
Query: 572 DVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMN 631
D++FA+++ +I +ILW GYPG+AG A+A ++FGD NP GRLP+TWY E FTKVPM DM
Sbjct: 541 DITFAQSNPKIGAILWAGYPGQAGGLAIARVLFGDHNPSGRLPVTWYTEDFTKVPMTDMR 600
Query: 632 MRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNIL 691
MRAD + YPGR+YRFY G +Y FG+GLSY+ +S + ++ L + SL A
Sbjct: 601 MRADPATGYPGRTYRFYRGKTIYKFGYGLSYSKFSRQLVTGDKNLAPNTSLLAHLSAKTQ 660
Query: 692 QQTGSRLDYVHIDEVTS--CTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQ 749
S Y H+D++ + C L+F ++ V N G +DG H V++F R P + G P +Q
Sbjct: 661 HAATS---YYHVDDIGTVGCEQLKFPAEVEVLNHGPMDGKHSVLMFLRWPNATDGRPVRQ 717
Query: 750 LIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
LIGF H A + F V PCE S G++++ G+H LMVG+ ++ E
Sbjct: 718 LIGFRSQHIKAGEKANVRFHVSPCEHFSRTRADGKKVIDRGSHFLMVGKEELEISFE 774
>gi|356548162|ref|XP_003542472.1| PREDICTED: probable beta-D-xylosidase 7-like [Glycine max]
Length = 778
Score = 765 bits (1975), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 378/763 (49%), Positives = 516/763 (67%), Gaps = 37/763 (4%)
Query: 52 YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGP 111
Y FCNT L I+ RA+ L+S LTL EK+ QL + A AIPRLGIP+Y+WWSE+LHG+A G
Sbjct: 42 YSFCNTKLPITKRAQDLVSRLTLDEKLAQLVNTAPAIPRLGIPSYQWWSEALHGVADAGF 101
Query: 112 GVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA-GLTFWAPNIN 170
G+ FNGT+ S TSFPQV+++AASF+ +LW I + EARA+YN GQA G+TFWAPNIN
Sbjct: 102 GIRFNGTIKSATSFPQVILTAASFDPNLWYQISKTIGREARAVYNAGQATGMTFWAPNIN 161
Query: 171 IFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEE 230
+FRDPRWGRGQET GEDP++ + Y V +V+ QG++++ G+
Sbjct: 162 VFRDPRWGRGQETAGEDPLMNAKYGVAYVRGLQGDSFEG------------------GKL 203
Query: 231 SDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASC 290
++R L SACCKH AYDL++W R+ F+A +T QD DT+QPPF+SCIEQG+AS
Sbjct: 204 AER---LQASACCKHFTAYDLDQWKGLDRFVFDARVTSQDLADTYQPPFQSCIEQGRASG 260
Query: 291 IMCSYNQVNGVPACLRGDLFQK-ARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAG 349
IMC+YN+VNGVP C +L K AR +W F GYITSDC AV+ I E Q Y KT ED+ A
Sbjct: 261 IMCAYNRVNGVPNCADFNLLTKTARQQWKFDGYITSDCGAVSIIHEKQGYAKTAEDAIAD 320
Query: 350 VLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGK 409
V +AGMD+ CG + +H +SA+ + K+ IDRAL NLFS+++RLGLF+G+P K +G
Sbjct: 321 VFRAGMDVECGDYITKHAKSAVFQKKLPISQIDRALQNLFSIRIRLGLFDGNPTKLPFGT 380
Query: 410 LGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMG-GG 468
+GP++VC+ + +LAL+AAR GIVLLKN LPL K ++A+IGP N S++ G
Sbjct: 381 IGPNEVCSKQSLQLALEAARDGIVLLKNTNSLLPLPKTN-PTIALIGPNANASSKVFLGN 439
Query: 469 YTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDL 528
Y G PC+ +LL+G E Y +KT Y GC D P + A EAV +AKK D+V++V GLD
Sbjct: 440 YYGRPCNLVTLLQGFEGY-AKTVYHPGCDDGPQCAYAQIEEAVEVAKKVDYVVLVMGLDQ 498
Query: 529 TQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWI 588
+QE E DR L LPG+Q L+ SVAR +KRPV++VL GGP+D++ A+ D ++ ILW
Sbjct: 499 SQERESHDREYLGLPGKQEELIKSVARAAKRPVVVVLLCGGPVDITSAKFDDKVGGILWA 558
Query: 589 GYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFY 648
GYPGE G ALA+++FGD NPGG+LP+TWYP+ F KVPM DM MRAD + YPGR+YRFY
Sbjct: 559 GYPGELGGVALAQVVFGDHNPGGKLPITWYPKDFIKVPMTDMRMRADPASGYPGRTYRFY 618
Query: 649 TGTQVYGFGHGLSYTNYSYKFLS-APSELTISASLKAGSDKNILQQTGSRLDYVHIDEVT 707
TG +VY FG+GLSYT YSYK LS + S L I+ S +++ Q + Y + E+
Sbjct: 619 TGPKVYEFGYGLSYTKYSYKLLSLSHSTLHINQ-----SSTHLMTQNSETIRYKLVSELA 673
Query: 708 --SCTSLRFHVQISVTNAGDVDGSHVVMLFAR---VPKVSQGTPEKQLIGFDRVHTVAKG 762
+C ++ + + VTN G++ G H V+LF R V ++ G P KQL+GF V A
Sbjct: 674 EETCQTMLLSIALGVTNRGNLAGKHPVLLFVRQGKVRNINNGNPVKQLVGFQSVKVNAGE 733
Query: 763 SKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTI 805
+ ++ F + PCE LS+AN+ G ++ G+++ +VG+ + + +
Sbjct: 734 TVQVGFELSPCEHLSVANEAGSMVIEEGSYLFIVGDQEYPIEV 776
>gi|358349509|ref|XP_003638778.1| Xylan 1 4-beta-xylosidase [Medicago truncatula]
gi|355504713|gb|AES85916.1| Xylan 1 4-beta-xylosidase [Medicago truncatula]
Length = 776
Score = 764 bits (1974), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 386/773 (49%), Positives = 510/773 (65%), Gaps = 33/773 (4%)
Query: 40 PDFPC--KPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYE 97
P F C P SYPFCN L I+ R K L+S LTL EK+ QL ++A IPRLGIPAYE
Sbjct: 28 PPFACDYSNPSTRSYPFCNPKLPITQRTKDLVSRLTLDEKLAQLVNSAPPIPRLGIPAYE 87
Query: 98 WWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNL 157
WWSE+LHGI + G G+ FNG+++S TSFPQV+++AASF+ LW IG A+ VEARA+YN
Sbjct: 88 WWSEALHGIGNVGRGIFFNGSITSATSFPQVILTAASFDSHLWYRIGQAIGVEARAIYNG 147
Query: 158 GQA-GLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGF 216
GQA G+TFWAPNINIFRDPRWGRGQET GEDPM+ S YAV +V+ QG++++ GG
Sbjct: 148 GQAMGMTFWAPNINIFRDPRWGRGQETAGEDPMMTSNYAVSYVRGLQGDSFQ---GG--- 201
Query: 217 GFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQ 276
L+G L SACCKH AYDL+ W +R+ F+A ++ QD DT+Q
Sbjct: 202 ------KLRG---------HLQASACCKHFTAYDLDNWKGVNRFHFDARVSLQDLADTYQ 246
Query: 277 PPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIFE 335
PPFRSCIEQG+AS IMC+YN+VNG+P+C +L R +W F GYI SDC AV I +
Sbjct: 247 PPFRSCIEQGRASGIMCAYNRVNGIPSCADFNLLTNTVRKQWEFHGYIVSDCGAVGIIHD 306
Query: 336 YQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRL 395
Q Y K+ ED+ A VL AGMD+ CG+ + H +SA+ + K+ IDRAL NLFS+++RL
Sbjct: 307 EQGYAKSAEDAVADVLHAGMDLECGSYLTDHAKSAVQQKKLPIVRIDRALHNLFSIRIRL 366
Query: 396 GLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAII 455
G F+G+P K +G +GP+ VC+ H LAL+AAR GIVLLKN LPL K ++S LA+I
Sbjct: 367 GQFDGNPAKLPFGMIGPNHVCSENHLYLALEAARNGIVLLKNTASLLPLPKTSIS-LAVI 425
Query: 456 GPLVN-NISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIA 514
GP N + + G Y G PC ++L+G + YV + GC P + A +AV++A
Sbjct: 426 GPNANASPLTLLGNYAGPPCKSITILQGFQHYVKNAVFHPGCDGGPKCASAPIDKAVKVA 485
Query: 515 KKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVS 574
K AD+V++V GLD + E E+RDRV L LPG+Q+ L+ SVA+ SKRPVILVL GGP+D+S
Sbjct: 486 KNADYVVLVMGLDQSVEREERDRVHLDLPGKQLELINSVAKASKRPVILVLLCGGPIDIS 545
Query: 575 FAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRA 634
A+ + +I I+W GYPGE G ALA+IIFGD NPGGRLP+TWYP+ + KVPM DM MRA
Sbjct: 546 SAKNNDKIGGIIWAGYPGELGGIALAQIIFGDHNPGGRLPITWYPKDYIKVPMTDMRMRA 605
Query: 635 DSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQT 694
D + YPGR+YRFY G VY FGHGLSYT YSY+F+S +T S +++ +
Sbjct: 606 DPTTGYPGRTYRFYKGPTVYEFGHGLSYTKYSYEFVS----VTHDKLHFNQSSTHLMTEN 661
Query: 695 GSRLDYVHIDEV--TSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIG 752
+ Y + E+ +C S+ V + V N G++ G H ++LF R K +P KQL+G
Sbjct: 662 SETIRYKLVSELDEETCKSMSVSVTVGVKNHGNIVGRHPILLFMRPQKHRTRSPMKQLVG 721
Query: 753 FDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTI 805
F + A + F + PCE LS AN+ G +I+ G+H+L VGE + + I
Sbjct: 722 FHSLLLDAGEMSHVGFELSPCEHLSRANEAGLKIIEEGSHLLHVGEEEYLIDI 774
>gi|224066929|ref|XP_002302284.1| predicted protein [Populus trichocarpa]
gi|222844010|gb|EEE81557.1| predicted protein [Populus trichocarpa]
Length = 742
Score = 764 bits (1972), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 394/806 (48%), Positives = 517/806 (64%), Gaps = 81/806 (10%)
Query: 12 ERILFRVLQLIVV--VNVIAFSNSKPVLNKPDFPCKP--PHFDSYPFCNTSLSISTRAKS 67
+ + R+L LI + ++ + S +P + C P YPFC T L IS R +
Sbjct: 4 QNLCLRILILIAIHTTSLHLYVES----TQPPYSCDSSDPSTKLYPFCQTKLPISQRVED 59
Query: 68 LISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIA---SNGPGVNFNGTVSSVTS 124
L+S LTL EK+ QL D A AIPRLGIPAYEWWSE+LHG+A + G+ FNGT+ TS
Sbjct: 60 LVSRLTLDEKVSQLVDTAPAIPRLGIPAYEWWSEALHGVALQTTVRQGIRFNGTIRFATS 119
Query: 125 FPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA-GLTFWAPNINIFRDPRWGRGQET 183
FPQV+++AASF+ LW IG + EAR +YN GQA G+TFWAPNINIFRDPRWGRGQET
Sbjct: 120 FPQVILTAASFDAHLWYRIGQVIGKEARGIYNAGQATGMTFWAPNINIFRDPRWGRGQET 179
Query: 184 PGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACC 243
PGEDP+V YAV +V+ QG+++ GG G++L SACC
Sbjct: 180 PGEDPLVAGKYAVSYVRGVQGDSF----GG-----------------GTLGEQLQASACC 218
Query: 244 KHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPA 303
KH AYDL+KW +R+ F+A QD DT+QPPF+SCI++GKAS IMC+YN+VNGVP
Sbjct: 219 KHFTAYDLDKWKGMNRFVFDA----QDLADTYQPPFQSCIQEGKASGIMCAYNRVNGVPN 274
Query: 304 CLRGDLF-QKARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTC 362
C +L +KAR +WGF GYITSDCDAVA I + Q Y K+ ED+ A VLKAGMD+NCG
Sbjct: 275 CADYNLLSKKARGQWGFYGYITSDCDAVAIIHDDQGYAKSPEDAVADVLKAGMDVNCGDY 334
Query: 363 MLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKK 422
+ +T+SA+ K K+ E +IDRAL NLFS+++RLGLFNG+P K YG + PD VC+ EH+
Sbjct: 335 LKNYTKSAVKKKKLPESEIDRALHNLFSIRMRLGLFNGNPTKQPYGNIAPDQVCSQEHQA 394
Query: 423 LALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRG 482
LAL AA+ GIVLLKN K LPL+K SLA+IGP NN +++ G Y G PC + L+G
Sbjct: 395 LALKAAQDGIVLLKNPDKLLPLSKLETKSLAVIGPNANNSTKLLGNYFGPPCKTVTPLQG 454
Query: 483 LEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLL 542
L+ Y+ T Y GC V C+S A ++AV+IAK AD VI+V GLD TQE E++DRV L+L
Sbjct: 455 LQNYIKNTRYHPGCSRVACSS-ASINQAVKIAKGADQVILVMGLDQTQEKEEQDRVDLVL 513
Query: 543 PGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEI 602
PG+Q L+T+VA+ +K+PV+LVL GGP+DVSFA+ D I SI+W GYPGEAG ALA+I
Sbjct: 514 PGKQRELITAVAKAAKKPVVLVLFCGGPVDVSFAKYDQNIGSIIWAGYPGEAGGTALAQI 573
Query: 603 IFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSY 662
IFGD NPGGRLPMTWYP+ FTKVPM DM MR S YPGR+YRFY G +V+ FG+GLSY
Sbjct: 574 IFGDHNPGGRLPMTWYPQDFTKVPMTDMRMRPQLSSGYPGRTYRFYNGKKVFEFGYGLSY 633
Query: 663 TNYSYKFLS-APSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTS--CTSLRFHVQIS 719
+NYSY+ S ++L + AS N + + + + + I + C +F V +
Sbjct: 634 SNYSYELASDTQNKLYLRAS------SNQITKNSNTIRHKLISNIGKELCEKTKFTVTVR 687
Query: 720 VTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIA 779
V N G++ G + EI + + PCE LS
Sbjct: 688 VKNHGEMAGENA---------------------------------EIQYELSPCEHLSSP 714
Query: 780 NKHGRRILPLGNHVLMVGELRHSLTI 805
+ G ++ G+ L++G+ + +TI
Sbjct: 715 DDRGMMVMEEGSQFLLIGDKEYPITI 740
>gi|302791321|ref|XP_002977427.1| hypothetical protein SELMODRAFT_106899 [Selaginella moellendorffii]
gi|300154797|gb|EFJ21431.1| hypothetical protein SELMODRAFT_106899 [Selaginella moellendorffii]
Length = 772
Score = 763 bits (1970), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 373/769 (48%), Positives = 505/769 (65%), Gaps = 32/769 (4%)
Query: 44 CKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESL 103
C S+PFC+ SL + R L+ + L EKI Q+ NAS IPRLGIP Y+WW E+L
Sbjct: 25 CTDSRSSSFPFCDVSLPVPDRVADLVGRMNLSEKIAQIVSNASGIPRLGIPGYQWWEEAL 84
Query: 104 HGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLT 163
HG+A + PGV F V S TSFPQV+++ ASFN SLW+ I A+++EA AMYN G++GLT
Sbjct: 85 HGVAES-PGVKFAAPVPSATSFPQVILTVASFNSSLWNKIAQAISIEAIAMYNAGRSGLT 143
Query: 164 FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRV 223
FW+PNINIFRDPRWGRGQETPGEDP++ S YA FV+ Q ++ D G +
Sbjct: 144 FWSPNINIFRDPRWGRGQETPGEDPLLSSKYAAYFVRGLQEGDY---DEGTAISTMQGSP 200
Query: 224 LKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCI 283
+ L +S+CCKH AYD+EK + FNA +T QD +DTF PPFRSCI
Sbjct: 201 TR-----------LKVSSCCKHFTAYDMEKSEGTDCFHFNAQVTVQDLQDTFDPPFRSCI 249
Query: 284 EQGKASCIMCSYNQVNGVPACLRGD-LFQKARNEWGFKGYITSDCDAVATIFEYQNYTKT 342
G+AS +MCSYN+VNGVP+C L + RN WGF+GYI SDCDAVA ++EY NYT T
Sbjct: 250 VDGQASGLMCSYNRVNGVPSCADYTFLTETVRNSWGFEGYIVSDCDAVALLYEYINYTTT 309
Query: 343 HEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDP 402
ED+ A VL AGMD+NCGT +LRHT +AI++GKV E +DRAL N+ +V++RLGLF+G+
Sbjct: 310 AEDAVADVLSAGMDLNCGTFLLRHTAAAIEQGKVTEAAVDRALSNVMTVRMRLGLFDGNS 369
Query: 403 RKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNI 462
+ Y +GPD VCT EH++L+L+AA QGIVLLKN LP +N + ++A+IGP N
Sbjct: 370 GE-TYNSIGPDAVCTPEHRQLSLEAAEQGIVLLKNSGNVLPFPRNDLMTIAVIGPSGNAT 428
Query: 463 SQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIV 522
M G Y G+PC + +GL+ Y + GC D+ CN F AVR A+ +D V++
Sbjct: 429 ETMLGNYAGVPCQYITPFQGLQEYTKCVVFEPGCKDIMCNDTTLFLAAVRAAENSDAVVI 488
Query: 523 VAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQI 582
V GLD QE E DR SLLLPG Q LV V++ +K PVILV+ GGP+DV+FA+ + +I
Sbjct: 489 VVGLDKDQEREGLDRTSLLLPGNQQGLVLEVSKVAKGPVILVVMSGGPIDVTFAKENCKI 548
Query: 583 SSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTK-VPMNDMNMRADSSRQYP 641
S++LW+GYPGEAG KA+A +IFGD NP GRLPMTWYP++F + V + +M++R ++S +P
Sbjct: 549 SNVLWVGYPGEAGGKAIARVIFGDHNPAGRLPMTWYPQAFAEHVSILNMHLRPNTSTGFP 608
Query: 642 GRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSR---L 698
GR+YRFYTG VY FGHGLSYTN++Y APS +T ++ + L++ G+R +
Sbjct: 609 GRTYRFYTGENVYEFGHGLSYTNFTYTNFCAPSNITARNTVAI---RTPLREDGARQFPI 665
Query: 699 DYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGT--PEKQLIGFDRV 756
DY T C +L F V ++N G D H+ +L+A P S P KQLI F R
Sbjct: 666 DY------TGCEALAFKVVAYISNTGTRDSDHISLLYAIPPAASSSLSPPRKQLISFKRQ 719
Query: 757 HTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTI 805
H +A ++ F VD C+ L + N+ G ++L G++ L +G++ H +++
Sbjct: 720 HLIAGRCAKVEFDVDTCKDLGLTNEAGTKVLVHGDYKLSLGDIEHVISL 768
>gi|297842585|ref|XP_002889174.1| glycosyl hydrolase family 3 protein [Arabidopsis lyrata subsp.
lyrata]
gi|297335015|gb|EFH65433.1| glycosyl hydrolase family 3 protein [Arabidopsis lyrata subsp.
lyrata]
Length = 766
Score = 762 bits (1968), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 393/780 (50%), Positives = 513/780 (65%), Gaps = 50/780 (6%)
Query: 40 PDFPCKP--PHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYE 97
P C P P Y FC T L IS RA+ L+S L + EKI QL + A IPRLG+PAYE
Sbjct: 21 PPHSCDPSNPTTKLYQFCRTDLPISQRARDLVSRLNIDEKISQLGNTAPGIPRLGVPAYE 80
Query: 98 WWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNL 157
WWSE+LHG+A GPG+ FNGTV + TSFPQV+++AASF+ W I + EAR +YN
Sbjct: 81 WWSEALHGVAYAGPGIRFNGTVKAATSFPQVILTAASFDSYEWFRIAQVIGKEARGVYNA 140
Query: 158 GQA-GLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGF 216
GQA G+TFWAPNINIFRDPRWGRGQETPGEDP++ YAV +V+ QG++
Sbjct: 141 GQAQGMTFWAPNINIFRDPRWGRGQETPGEDPIMTGTYAVAYVRGLQGDS---------- 190
Query: 217 GFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQ 276
F ++ L L SACCKH AYDL++W +RY FNA ++ D +T+Q
Sbjct: 191 -FDGRKTLS---------IHLQASACCKHFTAYDLDRWKGITRYVFNAQVSLADLAETYQ 240
Query: 277 PPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK-ARNEWGFKGYITSDCDAVATIFE 335
PPF+ CIE+G+AS IMC+YN+VNG+P+C +L + AR W F+GYITSDCDAV+ I +
Sbjct: 241 PPFKKCIEEGRASGIMCAYNRVNGIPSCADPNLLTRTARGLWRFRGYITSDCDAVSIIHD 300
Query: 336 YQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRL 395
Q Y KT ED+ A VLKAGMD+NCG+ + +HT+SA+ + KV E DIDRALLNLFSV++RL
Sbjct: 301 AQGYAKTPEDAVADVLKAGMDVNCGSYLQKHTKSALQQKKVSETDIDRALLNLFSVRIRL 360
Query: 396 GLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAII 455
GLFNGDP K YG + P+DVC+ H+ LAL+AAR GIVLLKN+ K LP +K +VSSLA+I
Sbjct: 361 GLFNGDPTKLPYGNISPNDVCSPAHQALALEAARNGIVLLKNNLKLLPFSKRSVSSLAVI 420
Query: 456 GPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAK 515
GP + + G Y G PC + L L +YV Y +GC V C S+A +AV IA+
Sbjct: 421 GPNAHVAKTLLGNYAGPPCKTVTPLDALRSYVKNAVYHNGCDSVAC-SNAAIDQAVAIAR 479
Query: 516 KADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSF 575
AD V+++ GLD TQE ED DRV L LPG+Q L+TSVA +K+PV+LVL GGP+D+SF
Sbjct: 480 NADHVVLIMGLDQTQEKEDMDRVDLSLPGKQQELITSVANAAKKPVVLVLICGGPVDISF 539
Query: 576 AEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRAD 635
A + +I SI+W GYPGEAG ALAEIIFGD NPGGRLP+TWYP+SF V M DM MR
Sbjct: 540 ATNNDKIGSIMWAGYPGEAGGIALAEIIFGDHNPGGRLPVTWYPQSFVNVQMTDMRMR-- 597
Query: 636 SSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTG 695
S+ YPGR+Y+FY G +V+ FGHGLSY+ YSY+F + G+ L Q+
Sbjct: 598 SATGYPGRTYKFYKGPKVFEFGHGLSYSTYSYRFKT------------LGATNLYLNQSK 645
Query: 696 SRLD-----YVHIDEV--TSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQG--TP 746
++L+ Y + E+ C + V ++V N G++ G H V++FAR + +
Sbjct: 646 AQLNSDSVRYTLVSEMGEEGCNIAKTKVIVTVENQGEMAGKHPVLMFARHERGGENGKRA 705
Query: 747 EKQLIGFDRVHTVAKGSK-EISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTI 805
EKQL+GF + ++ G K E+ F + CE LS AN+ G ++ G + L VG+ LTI
Sbjct: 706 EKQLVGFKSI-VLSNGEKAEMEFEIGLCEHLSRANEVGVMVVEEGKYFLTVGDSELPLTI 764
>gi|356525896|ref|XP_003531557.1| PREDICTED: beta-xylosidase/alpha-L-arabinofuranosidase 1-like
[Glycine max]
Length = 776
Score = 761 bits (1964), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 376/769 (48%), Positives = 507/769 (65%), Gaps = 40/769 (5%)
Query: 42 FPC---KPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEW 98
F C K P Y FC+ SLS+ R L+ LTLQEKI L ++A+++ RLGIP YEW
Sbjct: 41 FACDVAKNPALAGYGFCDKSLSLEDRVADLVKRLTLQEKIGSLVNSATSVSRLGIPKYEW 100
Query: 99 WSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG 158
WSE+LHG+++ GPG +F+ V TSFP +++AASFN SL+ IG V+ EARAMYN+G
Sbjct: 101 WSEALHGVSNVGPGTHFSSLVPGATSFPMPILTAASFNASLFEAIGRVVSTEARAMYNVG 160
Query: 159 QAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGF 218
AGLT+W+PNINIFRDPRWGRGQETPGEDP++ S YA +VK Q ++DDG
Sbjct: 161 LAGLTYWSPNINIFRDPRWGRGQETPGEDPLLSSKYATGYVKGLQ----QTDDG------ 210
Query: 219 REKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPP 278
++L ++ACCKH AYDL+ W RY+FNA++T+QD +DTFQPP
Sbjct: 211 --------------DSNKLKVAACCKHYTAYDLDNWKGIQRYTFNAVVTQQDMDDTFQPP 256
Query: 279 FRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQ 337
F+SC+ G + +MCSYNQVNG P C DL + R EW GYI SDCD+V +F+ Q
Sbjct: 257 FKSCVIDGNVASVMCSYNQVNGKPTCADPDLLKGVIRGEWKLNGYIVSDCDSVEVLFKDQ 316
Query: 338 NYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGL 397
+YTKT E++AA + AG+D+NCG + ++T+ A+ +G + E I+ A+ N F+ +RLG
Sbjct: 317 HYTKTPEEAAAETILAGLDLNCGNYLGQYTEGAVKQGLLDEASINNAVSNNFATLMRLGF 376
Query: 398 FNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGP 457
F+GDP K YG LGP+DVCTSE+++LA +AARQGIVLLKN LPLN A+ SLA+IGP
Sbjct: 377 FDGDPSKQTYGNLGPNDVCTSENRELAREAARQGIVLLKNSLGSLPLNAKAIKSLAVIGP 436
Query: 458 LVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKA 517
N M G Y GIPC+ S L+ L A V T YA+GC +V C ++A +A +IA A
Sbjct: 437 NANATRVMIGNYEGIPCNYISPLQALTALV-PTSYAAGCPNVQC-ANAELDDATQIAASA 494
Query: 518 DFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAE 577
D ++V G L E E DR+++LLPGQQ LV+ VA SK PVILV+ GG +DVSFA+
Sbjct: 495 DATVIVVGASLAIEAESLDRINILLPGQQQLLVSEVANASKGPVILVIMSGGGMDVSFAK 554
Query: 578 ADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNMRADS 636
++ +I+SILW+GYPGEAG A+A++IFG +NP GRLPMTWYP+S+ KVPM +MNMRAD
Sbjct: 555 SNDKITSILWVGYPGEAGGAAIADVIFGFYNPSGRLPMTWYPQSYVNKVPMTNMNMRADP 614
Query: 637 SRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGS 696
+ YPGR+YRFY G V+ FG G+S++N +K + AP +++ + D
Sbjct: 615 ATGYPGRTYRFYKGETVFSFGDGISFSNIEHKIVKAPQLVSVPLA----EDHECRSSECM 670
Query: 697 RLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRV 756
LD DE C +L F + + V N G + SHVV+LF P V P+K L+GF++V
Sbjct: 671 SLDVA--DE--HCQNLAFDIHLGVKNMGKMSSSHVVLLFFTPPDV-HNAPQKHLLGFEKV 725
Query: 757 HTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTI 805
H K ++ F VD C+ LS+ ++ G R +PLG H+L VG L+H L++
Sbjct: 726 HLPGKSEAQVRFKVDICKDLSVVDELGNRKVPLGQHLLHVGNLKHQLSV 774
>gi|357152329|ref|XP_003576084.1| PREDICTED: probable beta-D-xylosidase 7-like [Brachypodium
distachyon]
Length = 779
Score = 761 bits (1964), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 374/776 (48%), Positives = 497/776 (64%), Gaps = 38/776 (4%)
Query: 40 PDFPCKP-PHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEW 98
P F C P Y FC+ +L + RA L+S LTL EK+ QL D A A+PRLG+PAY+W
Sbjct: 34 PPFSCGPGSATQGYAFCDKALPVERRAADLVSRLTLAEKVSQLGDEADAVPRLGVPAYKW 93
Query: 99 WSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG 158
WSE LHG++ G G++F+G V ++TSFPQVL++AASF++ +W IG A+ EARA+YNLG
Sbjct: 94 WSEGLHGLSFWGHGMHFDGAVRAITSFPQVLLTAASFDQDIWYRIGQAIGTEARALYNLG 153
Query: 159 QA-GLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFG 217
QA GLT W+PN+NI+RDPRWGRGQETPGEDP S YAV FVK QG + +
Sbjct: 154 QAQGLTIWSPNVNIYRDPRWGRGQETPGEDPTTASKYAVAFVKGLQGTSATT-------- 205
Query: 218 FREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQP 277
L SACCKH AYDLE W RY+FNA +T QD DTF P
Sbjct: 206 -------------------LQTSACCKHATAYDLEDWNGVVRYNFNAKVTLQDLADTFNP 246
Query: 278 PFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEY 336
PF+SC+E+GKA+C+MC+Y +NGVPAC DL K + +WG GY++SDCDAVA + +
Sbjct: 247 PFKSCVEEGKATCVMCAYTNINGVPACASSDLITKTFKGDWGLNGYVSSDCDAVALLRDA 306
Query: 337 QNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLG 396
Q Y T ED+ A LKAG+D+NCG H SA+ +GK+ E+D+D AL NLF+V++RLG
Sbjct: 307 QRYRATPEDTVAVALKAGLDLNCGNYTQVHGMSALQQGKMTEQDVDNALKNLFAVRMRLG 366
Query: 397 LFNGDPRKGK-YGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAII 455
F+GDPR YG LG DVC+ HK LAL+AA+ GIVLLKND LPL+ +AV+S A I
Sbjct: 367 HFDGDPRTSALYGSLGAADVCSPAHKNLALEAAQSGIVLLKNDAGILPLDPSAVASAAAI 426
Query: 456 GPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAK 515
G N+ + + G Y G PC + L+GL+ YV + +GC C A +AV +A
Sbjct: 427 GHNANDPAALNGNYFGPPCETTTPLQGLQGYVKNVKFLAGCDSAACGF-AATGQAVTLAS 485
Query: 516 KADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSF 575
+D+VI+ GL +E E DR SLLLPG+Q +L+T+VA SKRPVILVL GG +D++F
Sbjct: 486 SSDYVILFMGLSQKEEQEGIDRTSLLLPGKQQNLITAVASASKRPVILVLLTGGSVDITF 545
Query: 576 AEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRAD 635
A+++ +I +ILW GYPG+AG A+A ++FGD NP GRLP+TWYPE FTKVPM DM MRAD
Sbjct: 546 AKSNPKIGAILWAGYPGQAGGLAIARVLFGDHNPSGRLPVTWYPEEFTKVPMTDMRMRAD 605
Query: 636 SSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTI-SASLKAGSDKNILQQT 694
+ YPGRSYRFY G VY FG GLSY+ +S + +S+ + + + +L G
Sbjct: 606 PATGYPGRSYRFYQGKTVYKFGDGLSYSKFSRQLVSSTNTHQVPNTNLLTGLTARTATDG 665
Query: 695 GSRLDYVHIDE--VTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPK-VSQGTPEKQLI 751
G + Y H++E V C L+F + V N G +DG H VM+F R P G P QL+
Sbjct: 666 G--MSYYHVEEIGVEGCDKLKFPAVVEVQNHGPMDGKHSVMMFLRWPNSTGTGRPVSQLV 723
Query: 752 GFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIET 807
GF H A ++F V PCE + A + G++++ G+H L+VG+ ++ +
Sbjct: 724 GFRSQHLKAGEKASLTFDVSPCEHFARAREDGKKVIDRGSHFLVVGKDEREISFHS 779
>gi|359485890|ref|XP_002264183.2| PREDICTED: beta-xylosidase/alpha-L-arabinofuranosidase 2-like
[Vitis vinifera]
Length = 774
Score = 759 bits (1961), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 383/772 (49%), Positives = 505/772 (65%), Gaps = 40/772 (5%)
Query: 40 PDFPC---KPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAY 96
P F C P + FCNTSL + R L+ LTL+EKI L ++A+++ RLGIP Y
Sbjct: 37 PVFACDVENNPTLGQFGFCNTSLETAARVADLVKRLTLEEKIGFLVNSAASVSRLGIPKY 96
Query: 97 EWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYN 156
EWWSE+LHG++ GPG +FN V TSFPQV+++AASFN SL+ IG AV+ EARAMYN
Sbjct: 97 EWWSEALHGVSYVGPGTHFNSVVPGATSFPQVILTAASFNASLFEAIGKAVSTEARAMYN 156
Query: 157 LGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGF 216
+G AGLTFW+PN+NIFRDPRWGRGQETPGEDP++ S YA +V+ Q +SDDG
Sbjct: 157 VGLAGLTFWSPNVNIFRDPRWGRGQETPGEDPLLSSKYASGYVRGLQ----QSDDG---- 208
Query: 217 GFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQ 276
D L ++ACCKH AYDL+ W R+ FNA++T+QD +DTFQ
Sbjct: 209 ----------------SPDRLKVAACCKHYTAYDLDNWKGVDRFHFNAVVTKQDMDDTFQ 252
Query: 277 PPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK-ARNEWGFKGYITSDCDAVATIFE 335
PPF+SC+ G + +MCSYNQVNG PAC DL R EW GYI SDCD+V +
Sbjct: 253 PPFKSCVIDGNVASVMCSYNQVNGKPACADPDLLSGIVRGEWKLNGYIVSDCDSVDVFYN 312
Query: 336 YQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRL 395
Q+YTKT E++AA + AG+D+NCG+ + +HT++A+ G V E +D+A+ N F+ +RL
Sbjct: 313 SQHYTKTPEEAAAKAILAGLDLNCGSFLGQHTEAAVKGGLVDESAVDKAVSNNFATLMRL 372
Query: 396 GLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAII 455
G F+G+P K YGKLGP DVCTSEH++LA +AARQGIVLLKN K LPL+ A+ +LA+I
Sbjct: 373 GFFDGNPSKAIYGKLGPKDVCTSEHQELAREAARQGIVLLKNSKGSLPLSPTAIKTLAVI 432
Query: 456 GPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAK 515
GP N M G Y G PC + L+GL A V+ T Y GC +V C + A EA +IA
Sbjct: 433 GPNANVTKTMIGNYEGTPCKYTTPLQGLTALVATT-YLPGCSNVACGT-AQIDEAKKIAA 490
Query: 516 KADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSF 575
AD +++ G+D + E E RDRV++ LPGQQ L+T VA+ SK VILV+ GG D+SF
Sbjct: 491 AADATVLIVGIDQSIEAEGRDRVNIQLPGQQPLLITEVAKASKGNVILVVMSGGGFDISF 550
Query: 576 AEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNMRA 634
A+ D +I+SILW+GYPGEAG A+A++IFG +NP GRLPMTWYP+S+ KVPM +MNMR
Sbjct: 551 AKNDDKITSILWVGYPGEAGGAAIADVIFGFYNPSGRLPMTWYPQSYVDKVPMTNMNMRP 610
Query: 635 DSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQT 694
D + YPGR+YRFYTG +Y FG GLSYT +++ + AP S S+ +
Sbjct: 611 DPASGYPGRTYRFYTGETIYTFGDGLSYTQFNHHLVQAPK----SVSIPIEEGHSCHSSK 666
Query: 695 GSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFD 754
+D V SC +L F + + V NAG++ GSH V LF+ P V +P+K L+GF+
Sbjct: 667 CKSVDAVQ----ESCQNLVFDIHLRVNNAGNISGSHTVFLFSSPPSV-HNSPQKHLLGFE 721
Query: 755 RVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
+V AK + F VD C+ LSI ++ G R + LG HVL VG L+HSL +
Sbjct: 722 KVFVTAKAKALVRFKVDVCKDLSIVDELGTRKVALGLHVLHVGNLKHSLNVR 773
>gi|115485165|ref|NP_001067726.1| Os11g0297800 [Oryza sativa Japonica Group]
gi|62734696|gb|AAX96805.1| beta-D-xylosidase [Oryza sativa Japonica Group]
gi|77549999|gb|ABA92796.1| Glycosyl hydrolase family 3 C terminal domain containing protein,
expressed [Oryza sativa Japonica Group]
gi|113644948|dbj|BAF28089.1| Os11g0297800 [Oryza sativa Japonica Group]
gi|125534139|gb|EAY80687.1| hypothetical protein OsI_35869 [Oryza sativa Indica Group]
gi|215766717|dbj|BAG98945.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 782
Score = 758 bits (1956), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 375/755 (49%), Positives = 491/755 (65%), Gaps = 47/755 (6%)
Query: 54 FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
FC+ +L RA L++ LT EK+ QL D A+ +PRLG+PAY+WWSE+LHG+A++G G+
Sbjct: 52 FCDATLPAEQRAADLVARLTAAEKVAQLGDQAAGVPRLGVPAYKWWSEALHGLATSGRGL 111
Query: 114 NFNGTVSSV---TSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA-GLTFWAPNI 169
+F+ S+ TSFPQVL++AA+F+ LW IG A+ EARA+YN+GQA GLT W+PN+
Sbjct: 112 HFDAPGSAARAATSFPQVLLTAAAFDDDLWFRIGQAIGTEARALYNIGQAEGLTMWSPNV 171
Query: 170 NIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGE 229
NIFRDPRWGRGQETPGEDP + S YAV FVK QG +
Sbjct: 172 NIFRDPRWGRGQETPGEDPTMASKYAVAFVKGMQGNS----------------------- 208
Query: 230 ESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKAS 289
L SACCKH+ AYDLE W RY+FNA +T QD EDT+ PPFRSC+ KA+
Sbjct: 209 ----SAILQTSACCKHVTAYDLEDWNGVQRYNFNAKVTAQDLEDTYNPPFRSCVVDAKAT 264
Query: 290 CIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAA 348
CIMC+Y +NGVPAC DL K R +WG GYI SDCDAVA + + Q YT+T ED+ A
Sbjct: 265 CIMCAYTGINGVPACANADLLTKTVRGDWGLDGYIASDCDAVAIMRDAQRYTQTPEDAVA 324
Query: 349 GVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGK-Y 407
LKAG+D+NCGT M +H +AI +GK+ E+DID+AL NLF++++RLG F+GDPR Y
Sbjct: 325 VALKAGLDMNCGTYMQQHATAAIQQGKLTEEDIDKALKNLFAIRMRLGHFDGDPRSNSVY 384
Query: 408 GKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGG 467
G LG D+CT EH+ LAL+AA GIVLLKND LPL++ AV+S A+IGP N+ + G
Sbjct: 385 GGLGAADICTPEHRSLALEAAMDGIVLLKNDAGILPLDRTAVASAAVIGPNANDGLALIG 444
Query: 468 GYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLD 527
Y G PC + L G+ Y+ + +GC+ C+ A +A +A +D+V + GL
Sbjct: 445 NYFGPPCESTTPLNGILGYIKNVRFLAGCNSAACDVAA-TDQAAAVASSSDYVFLFMGLS 503
Query: 528 LTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILW 587
QE+E RDR SLLLPG+Q SL+T+VA +KRPVILVL GGP+DV+FA+ + +I +ILW
Sbjct: 504 QKQESEGRDRTSLLLPGEQQSLITAVADAAKRPVILVLLTGGPVDVTFAQTNPKIGAILW 563
Query: 588 IGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRF 647
GYPG+AG A+A ++FGD NPGGRLP+TWYPE FTKVPM DM MRAD + YPGRSYRF
Sbjct: 564 AGYPGQAGGLAIARVLFGDHNPGGRLPVTWYPEEFTKVPMTDMRMRADPATGYPGRSYRF 623
Query: 648 YTGTQVYGFGHGLSYTNYSYKFLS----APSELTISASLKAGSDKNILQQTGSRLDYVHI 703
Y G VY FG+GLSY++YS + +S A S + ASL+ + T + HI
Sbjct: 624 YQGKTVYKFGYGLSYSSYSRQLVSGGKPAESYTNLLASLRTTT-------TSEGDESYHI 676
Query: 704 DEVTS--CTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAK 761
+E+ + C L+F + V N G +DG H V+++ R P G P QLIGF H
Sbjct: 677 EEIGTDGCEQLKFPAVVEVQNHGPMDGKHSVLMYLRWPNAKGGRPTTQLIGFRSQHLKVG 736
Query: 762 GSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMV 796
I F + PCE S K G++++ G+H LMV
Sbjct: 737 EKANIRFDISPCEHFSRVRKDGKKVIDRGSHYLMV 771
>gi|302786124|ref|XP_002974833.1| hypothetical protein SELMODRAFT_101733 [Selaginella moellendorffii]
gi|300157728|gb|EFJ24353.1| hypothetical protein SELMODRAFT_101733 [Selaginella moellendorffii]
Length = 784
Score = 755 bits (1950), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 367/760 (48%), Positives = 502/760 (66%), Gaps = 31/760 (4%)
Query: 51 SYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNG 110
S+PFC+T L I R + L+S LTL EK+ ++ + A IPRLG+P+Y+WW E+LHG+AS+
Sbjct: 48 SFPFCDTKLGIDVRVQDLVSRLTLDEKVDEMVNAAQGIPRLGVPSYQWWQEALHGVASS- 106
Query: 111 PGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNIN 170
PGV F G + TSFP + +AASFN +L+ +IG AV+ EARA++NLG+AGLTFW+PN+N
Sbjct: 107 PGVQFGGLAPAATSFPMPIATAASFNSTLFYSIGEAVSSEARALHNLGRAGLTFWSPNVN 166
Query: 171 IFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEE 230
IFRDPRWGRGQETPGEDP++ S +A +V+ QG G E GF
Sbjct: 167 IFRDPRWGRGQETPGEDPLLASKFASLYVRGLQG------------GAYEGSASDGF--- 211
Query: 231 SDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASC 290
L +SACCKHL AYD++ W RY FNA ++EQD DT+ PPF+SCIE G+ S
Sbjct: 212 ------LKVSACCKHLTAYDVDNWKGMDRYHFNAEVSEQDLVDTYNPPFQSCIEDGRVSS 265
Query: 291 IMCSYNQVNGVPACL-RGDLFQKARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAG 349
+MCSYN+VNGVP C R L + RN WGF GYI SDCDA+ +FE Y + ED+ A
Sbjct: 266 VMCSYNRVNGVPTCADRNLLTETVRNSWGFNGYIVSDCDALQVLFEDTTYAPSAEDAVAD 325
Query: 350 VLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGK 409
+ AG+D+NCGT + +H +SA+ GK+ E D+D A+ NL ++RLGLF+GDP Y
Sbjct: 326 SILAGLDLNCGTFLGKHAKSALQAGKITEADLDHAVSNLMRTRMRLGLFDGDPNSQPYSS 385
Query: 410 LGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGY 469
LG D+C+++H++LALDAA QG+VLLKND LPL+ A+ ++A+IGP N M G Y
Sbjct: 386 LGATDICSNDHQQLALDAALQGVVLLKNDGS-LPLS-TALKTVALIGPNANATYTMLGNY 443
Query: 470 TGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLT 529
GIPC S L+G++ Y S Y+ GC +V CN AV +A KAD V++V GLD +
Sbjct: 444 EGIPCKYISPLQGMQIYSSNILYSPGCRNVACNEGDLVASAVEVATKADAVVLVVGLDQS 503
Query: 530 QETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIG 589
QE E DR SLLLPG Q LV+++A P++LV+ GP+D+S + +S+ISS++W+G
Sbjct: 504 QERETFDRTSLLLPGMQSQLVSNIANAVTSPIVLVIMSAGPVDISTFKDNSRISSVIWLG 563
Query: 590 YPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYT 649
YPG++G ALA ++FG +NPGGRLP TWY E FT V M DM MR + YPGRSYRFYT
Sbjct: 564 YPGQSGGAALAHVVFGAYNPGGRLPNTWYHEEFTNVSMLDMQMRPNPLSGYPGRSYRFYT 623
Query: 650 GTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNI--LQQTGSRLDYVHI--DE 705
GT +Y FG GLSY+ Y YKFL AP++L+ S G+ + + ++ ++ H+ D+
Sbjct: 624 GTPLYNFGDGLSYSTYFYKFLLAPTKLSFFKS-NTGNSRGCPAVNRSKAKSGCFHLPADD 682
Query: 706 VTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKE 765
+ +C S+ F V + V+N G GSH V++F+ P V +G P KQLI F +VH + ++
Sbjct: 683 LETCNSILFQVSVEVSNLGPRSGSHSVLIFSAPPPV-EGAPLKQLIAFQKVHLESDTTQR 741
Query: 766 ISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTI 805
+ FG+DPC+ LS ++G+R L G H L++G H L I
Sbjct: 742 LIFGIDPCKHLSSVRRNGKRFLHSGRHKLLIGNAVHILDI 781
>gi|255545293|ref|XP_002513707.1| Beta-glucosidase, putative [Ricinus communis]
gi|223547158|gb|EEF48654.1| Beta-glucosidase, putative [Ricinus communis]
Length = 777
Score = 754 bits (1948), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 381/785 (48%), Positives = 513/785 (65%), Gaps = 39/785 (4%)
Query: 27 VIAFSNSKPVLNKPDFPC---KPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSD 83
V++ N + P F C P S+ FCN SL IS R L++ LTLQEKI L +
Sbjct: 26 VLSSRNQVLAQSSPVFACDVKSNPSLASFGFCNVSLGISDRVTDLVNRLTLQEKIGFLVN 85
Query: 84 NASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNI 143
+A ++ RLGIP YEWWSE+LHG++ GPG +F+ V TSFPQV+++AASFN SL+ I
Sbjct: 86 SAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNIVPGATSFPQVILTAASFNASLFEAI 145
Query: 144 GSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQ 203
G V+ EARAMYN+G AGLTFW+PNINIFRDPRWGRGQETPGEDP++ S Y +V+ Q
Sbjct: 146 GKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLSSKYGSCYVRGLQ 205
Query: 204 GENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFN 263
++D+G +S+R L ++ACCKH AYDL+ W RY FN
Sbjct: 206 ----QTDNG-----------------DSER---LKVAACCKHYTAYDLDNWKGTDRYHFN 241
Query: 264 AMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK-ARNEWGFKGY 322
A++T+QD +DTFQPPF+SC+ G + +MCSYNQVNG P C DL R EW GY
Sbjct: 242 AVVTKQDLDDTFQPPFKSCVIDGNVASVMCSYNQVNGKPTCADPDLLAGIIRGEWKLNGY 301
Query: 323 ITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDID 382
I SDCD+V I+ Q+YTKT E++AA + AG+D+NCG+ + +HT++A++ G + +D
Sbjct: 302 IVSDCDSVDVIYNSQHYTKTPEEAAAITILAGLDLNCGSFLGKHTEAAVNAGLLNVSAVD 361
Query: 383 RALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFL 442
+A+ N F+ +RLG F+GDP K YGKLGP DVCT+ +++LA +AARQGIVLLKN L
Sbjct: 362 KAVSNNFATLMRLGFFDGDPSKQLYGKLGPKDVCTAVNQELAREAARQGIVLLKNSPGSL 421
Query: 443 PLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCN 502
PL+ A+ +LA+IGP N M G Y G PC + L+GL A V+ T Y +GC +V C
Sbjct: 422 PLSPTAIKTLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLTASVATT-YLAGCSNVACA 480
Query: 503 SDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVI 562
+ A +A ++A AD ++V G D + E E RDRV +LLPGQQ L+T VA SK PVI
Sbjct: 481 A-AQVDDAKKLAASADATVLVMGADQSIEAESRDRVDVLLPGQQQLLITQVANVSKGPVI 539
Query: 563 LVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESF 622
LV+ GG +DVSFA+ + +I+SILW+GYPGEAG A+A++IFG +NP GRLPMTWYP+++
Sbjct: 540 LVIMSGGGMDVSFAKTNDKITSILWVGYPGEAGGAAIADVIFGYYNPSGRLPMTWYPQAY 599
Query: 623 T-KVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISAS 681
KVPM +MNMR D S YPGR+YRFYTG VY FG GLSY+ Y ++ + AP ++I
Sbjct: 600 VDKVPMTNMNMRPDPSSGYPGRTYRFYTGETVYSFGDGLSYSEYKHQLVQAPQLVSIPL- 658
Query: 682 LKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKV 741
+ + + ++ S+ V E +C L F++ + V N G V G+H V LF P V
Sbjct: 659 -----EDDHVCRSSSKCISVDAGE-QNCQGLAFNIDLKVRNIGKVRGTHTVFLFFTPPSV 712
Query: 742 SQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRH 801
+P+K L+ F++V AK +SF VD C+ LS+ ++ G R + LG HVL VG L H
Sbjct: 713 -HNSPQKHLVDFEKVSLDAKTYGMVSFKVDVCKHLSVVDEFGSRKVALGGHVLHVGNLEH 771
Query: 802 SLTIE 806
SLT+
Sbjct: 772 SLTVR 776
>gi|226531269|ref|NP_001145980.1| uncharacterized protein LOC100279508 precursor [Zea mays]
gi|219885199|gb|ACL52974.1| unknown [Zea mays]
gi|413920228|gb|AFW60160.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
Length = 794
Score = 754 bits (1946), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 370/789 (46%), Positives = 507/789 (64%), Gaps = 53/789 (6%)
Query: 42 FPCKPPH-FDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWS 100
F C P S PFC SL + RA+ L+S LT EK++ L +NA+ +PRLG+ YEWWS
Sbjct: 27 FACAPGGPAASLPFCRQSLPLRARARDLVSRLTRAEKVRLLVNNAAGVPRLGVAGYEWWS 86
Query: 101 ESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA 160
E+LHG++ GPGV F G T+FPQV+ +AAS N +LW +G AV+ EARAMYN G+A
Sbjct: 87 EALHGVSDTGPGVRFGGAFPGATAFPQVIGTAASLNATLWELVGRAVSDEARAMYNGGRA 146
Query: 161 GLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFRE 220
GLTFW+PN+NIFRDPRWGRGQETPGEDP V + YA +V+ Q + + + +GG
Sbjct: 147 GLTFWSPNVNIFRDPRWGRGQETPGEDPAVSARYAAAYVRGLQ-QPYAAPNGG------- 198
Query: 221 KRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFR 280
+ L L+ACCKH AYDL+KWG R+ FNA++ QD EDTF PFR
Sbjct: 199 ------------HRNRLKLAACCKHFTAYDLDKWGGTDRFHFNAVVAAQDLEDTFNVPFR 246
Query: 281 SCIEQGKASCIMCSYNQVNGVPAC-----LRGDLFQKARNEWGFKGYITSDCDAVATIFE 335
+C+E G+A+ +MCSYNQVNGVP C LRG + R WG GYI SDCD+V F
Sbjct: 247 ACVEDGRAASVMCSYNQVNGVPTCADAAFLRGTI----RGRWGLDGYIVSDCDSVDVFFR 302
Query: 336 YQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRL 395
Q+YT+T ED+AA L+AG+D++CG + + SA+ GKV + D+D ALLN +VQ+RL
Sbjct: 303 DQHYTRTPEDAAAATLRAGLDLDCGPFLALYAGSAVAAGKVADADVDAALLNTVTVQMRL 362
Query: 396 GLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKN------DKKFLPLNKNAV 449
G+F+GDP G +G+LGP DVCT EH+ LALDAARQG+VLLKN ++ LPL A
Sbjct: 363 GMFDGDPAAGPFGRLGPADVCTREHQDLALDAARQGVVLLKNRRGARHNRDVLPLRPAAH 422
Query: 450 SSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHE 509
+A++GP + M G Y G PC + L+G+ AY ++ + +GC DV C +
Sbjct: 423 RVVAVVGPHADATVAMIGNYAGKPCRYTTPLQGVAAYAARVAHQAGCTDVACRGNQPIAA 482
Query: 510 AVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGG 569
AV A++AD +VVAGLD E E DR +LLLPG+Q L+++VA+ SK PVILVL GG
Sbjct: 483 AVEAARQADATVVVAGLDQRVEAEGLDRTTLLLPGRQAELISAVAKASKGPVILVLMSGG 542
Query: 570 PLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESF-TKVPMN 628
P+D++FA+ D +I ILW+GYPG+AG +A+A++IFG NPG +LP+TWY + + KVPM
Sbjct: 543 PIDIAFAQNDPRIDGILWVGYPGQAGGQAIADVIFGHHNPGAKLPVTWYHQDYLQKVPMT 602
Query: 629 DMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTI-------SAS 681
+M MRA+ +R YPGR+YRFYTG +Y FGHGLSYT +++ AP++LT+ SA+
Sbjct: 603 NMAMRANPARGYPGRTYRFYTGPTIYPFGHGLSYTQFTHTLAHAPTQLTVRLSGSGHSAA 662
Query: 682 LKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKV 741
A L + + H C L V + V+N GD DG+H V+++ P
Sbjct: 663 SAASLLNATLARPVRAVRVAH----ARCEGLTVPVHVDVSNVGDRDGAHAVLVYHAAPSP 718
Query: 742 SQG-----TPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMV 796
S P +QL+ F++VH A G + + C++LS+A+++G R +P+G H LM+
Sbjct: 719 SHAAPGADAPARQLVAFEKVHVPAGGVARVEMRIGVCDRLSVADRNGVRRVPVGEHRLMI 778
Query: 797 GELRHSLTI 805
GEL HS+++
Sbjct: 779 GELTHSVSL 787
>gi|356558612|ref|XP_003547598.1| PREDICTED: beta-xylosidase/alpha-L-arabinofuranosidase 1-like
[Glycine max]
Length = 776
Score = 754 bits (1946), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 372/769 (48%), Positives = 505/769 (65%), Gaps = 40/769 (5%)
Query: 42 FPC---KPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEW 98
F C K P Y FC+ SLS+ R L+ LTLQEKI L ++A+++ RLGIP YEW
Sbjct: 41 FACDVAKNPALAGYGFCDKSLSVEDRVADLVKRLTLQEKIGSLVNSATSVSRLGIPKYEW 100
Query: 99 WSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG 158
WSE+LHG+++ GPG +F+ V TSFP +++AASFN SL+ IG V+ EARAMYN+G
Sbjct: 101 WSEALHGVSNVGPGTHFSSLVPGATSFPMPILTAASFNASLFEAIGRVVSTEARAMYNVG 160
Query: 159 QAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGF 218
AGLT+W+PNINIFRDPRWGRGQETPGEDP++ S YA +VK Q ++DDG
Sbjct: 161 LAGLTYWSPNINIFRDPRWGRGQETPGEDPLLSSKYATGYVKGLQ----QTDDG------ 210
Query: 219 REKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPP 278
++L ++ACCKH AYDL+ W RY+FNA++T+QD +DTFQPP
Sbjct: 211 --------------DSNKLKVAACCKHYTAYDLDNWKGIQRYTFNAVVTQQDMDDTFQPP 256
Query: 279 FRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK-ARNEWGFKGYITSDCDAVATIFEYQ 337
F+SC+ G + +MCSYNQVNG P C DL + R EW GYI SDCD+V +F+ Q
Sbjct: 257 FKSCVIDGNVASVMCSYNQVNGKPTCADPDLLKGIIRGEWKLNGYIVSDCDSVEVLFKDQ 316
Query: 338 NYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGL 397
+YTKT E++AA + AG+D+NCG + ++T+ A+ +G + E I+ A+ N F+ +RLG
Sbjct: 317 HYTKTPEEAAAQTILAGLDLNCGNYLGQYTEGAVKQGLLDEASINNAVSNNFATLMRLGF 376
Query: 398 FNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGP 457
F+GDP K YG LGP DVCTSE+++LA +AARQGIVLLKN LPLN + SLA+IGP
Sbjct: 377 FDGDPSKQPYGNLGPKDVCTSENRELAREAARQGIVLLKNSPGSLPLNAKTIKSLAVIGP 436
Query: 458 LVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKA 517
N M G Y GIPC+ S L+ L A V T YA+GC +V C ++A +A +IA A
Sbjct: 437 NANATRVMIGNYEGIPCNYISPLQTLTALV-PTSYAAGCPNVQC-ANAELDDATQIAASA 494
Query: 518 DFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAE 577
D +++ G L E E DR+++LLPGQQ LV+ VA SK PVILV+ GG +DVSFA+
Sbjct: 495 DATVIIVGASLAIEAESLDRINILLPGQQQLLVSEVANASKGPVILVIMSGGGMDVSFAK 554
Query: 578 ADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNMRADS 636
++ +I+SILW+GYPGEAG A+A++IFG +NP GRLPMTWYP+++ KVPM +MNMRAD
Sbjct: 555 SNDKITSILWVGYPGEAGGAAIADVIFGFYNPSGRLPMTWYPQAYVNKVPMTNMNMRADP 614
Query: 637 SRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGS 696
+ YPGR+YRFY G V+ FG G+S+++ +K + AP +++ + D
Sbjct: 615 ATGYPGRTYRFYKGETVFSFGDGISFSSIEHKIVKAPQLVSVPLA----EDHECRSSECM 670
Query: 697 RLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRV 756
LD DE C +L F + + V N G + SHVV+LF P V P+K L+GF++V
Sbjct: 671 SLDIA--DE--HCQNLAFDIHLGVKNTGKMSTSHVVLLFFTPPDV-HNAPQKHLLGFEKV 725
Query: 757 HTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTI 805
H K ++ F VD C+ LS+ ++ G R +PLG H+L VG L+H L++
Sbjct: 726 HLPGKSEAQVRFKVDVCKDLSVVDELGNRKVPLGQHLLHVGNLKHPLSL 774
>gi|326517420|dbj|BAK00077.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 781
Score = 753 bits (1945), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 379/775 (48%), Positives = 502/775 (64%), Gaps = 38/775 (4%)
Query: 40 PDFPCKPPHFDS---YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAY 96
P F C P + Y FC+ +L ++ RA L++ LT EK+ QL D A+ +PRLG+PAY
Sbjct: 36 PPFSCGPSSTAATQGYAFCDATLPVAQRAADLVARLTTAEKVAQLGDEAAGVPRLGVPAY 95
Query: 97 EWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYN 156
+WW+E+LHG+A++G G++FNG V S TSFPQV ++AA+F+ LW IG A+ EARA+YN
Sbjct: 96 KWWNEALHGLATSGKGLHFNGAVRSATSFPQVSLTAAAFDDDLWLRIGQAIGREARALYN 155
Query: 157 LGQA-GLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIG 215
+GQA GLT W+PN+NI+RDPRWGRGQETPGEDP S Y V FVK QG + S
Sbjct: 156 VGQAEGLTMWSPNVNIYRDPRWGRGQETPGEDPTTASRYGVAFVKGLQGNSTSSS----- 210
Query: 216 FGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTF 275
L SACCKH AYDLE WG +RY+F+A +T QD EDT+
Sbjct: 211 --------------------LLQTSACCKHATAYDLEDWGGVARYNFDARVTAQDLEDTY 250
Query: 276 QPPFRSCIEQGKASCIMCSYNQVNGVPACLR-GDLFQKARNEWGFKGYITSDCDAVATIF 334
PPFRSC+ GKASC+MC+Y +NGVPAC G L R +WG GY+ SDCDAVA +
Sbjct: 251 NPPFRSCVVDGKASCVMCAYTAINGVPACANSGLLTNTVRADWGLDGYVASDCDAVAIMR 310
Query: 335 EYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLR 394
+ Q Y T ED+ A LKAG+DI+CGT M +H +A+ +GK+ E D+D+AL NLF++++R
Sbjct: 311 DAQRYAPTPEDAVALALKAGLDIDCGTYMQQHAPAALQQGKITEDDVDKALKNLFAIRMR 370
Query: 395 LGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAI 454
LG F+GDPR YG L +CT EH+ LAL+AA+ GIVLLKND LPL++ A++S A+
Sbjct: 371 LGHFDGDPRANIYGGLNAAHICTPEHRSLALEAAQDGIVLLKNDAGILPLDRAAIASAAV 430
Query: 455 IGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIA 514
IGP NN + G Y G PC + L+G++ YV + +GC C+ A +A +A
Sbjct: 431 IGPNANNPGLLIGNYFGPPCESVTPLKGVQGYVKDVRFMAGCGSAACDV-ADTDQAATLA 489
Query: 515 KKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVS 574
+D+V++ GL QE+E RDR SLLLPGQQ SL+T+VA +KRPVILVL GGP+DV+
Sbjct: 490 GSSDYVLLFMGLSQQQESEGRDRTSLLLPGQQQSLITAVADAAKRPVILVLLTGGPVDVT 549
Query: 575 FAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRA 634
FA+ + +I +ILW GYPG+AG A+A ++FGD NPGGRLP+TWYPE FTKVPM DM MRA
Sbjct: 550 FAKNNPKIGAILWAGYPGQAGGLAIARVLFGDHNPGGRLPVTWYPEEFTKVPMTDMRMRA 609
Query: 635 DSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAG-SDKNILQQ 693
D + YPGRSYRFY G VY FG+GLSY++YS + LS+ T + L AG S +
Sbjct: 610 DPATGYPGRSYRFYQGETVYKFGYGLSYSSYSRRLLSSG---TPNTDLLAGLSTMPTPAE 666
Query: 694 TGSRLDYVHIDEVTS--CTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLI 751
G Y H++ + + C L+F + V N G +DG H V+++ R + G P KQLI
Sbjct: 667 EGGVASY-HVEHIGARGCEQLKFPAVVEVENHGPMDGKHSVLMYLRWANATAGRPAKQLI 725
Query: 752 GFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
GF R H A ++F + PCE S K G +++ G+H LMV +T E
Sbjct: 726 GFRRQHLKAGEKASLTFDISPCEHFSRVRKDGNKVVDRGSHFLMVDMHEMEITFE 780
>gi|357489463|ref|XP_003615019.1| hypothetical protein MTR_5g062650 [Medicago truncatula]
gi|355516354|gb|AES97977.1| hypothetical protein MTR_5g062650 [Medicago truncatula]
Length = 785
Score = 753 bits (1944), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 381/775 (49%), Positives = 505/775 (65%), Gaps = 34/775 (4%)
Query: 40 PDFPCK--PPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYE 97
P + C P SY FCN +L+ RAK ++S LTL EK+ QL + A AIPRLGI +Y+
Sbjct: 34 PPYSCDITNPLTKSYTFCNLNLTTIQRAKDIVSRLTLDEKLAQLVNTAPAIPRLGIHSYQ 93
Query: 98 WWSESLHGIASNGPGVNFNG--TVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMY 155
WWSE+LHG+A G G+ NG T+ + T FPQV+++AASF+ LW I + EARA+Y
Sbjct: 94 WWSEALHGVADYGKGIRLNGNVTIKAATIFPQVILTAASFDSKLWYRISKVIGTEARAVY 153
Query: 156 NLGQA-GLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGI 214
N GQA G+TFWAPNINIFRDPRWGRGQET GEDP+V + YAV FV+ QG++++
Sbjct: 154 NAGQAEGMTFWAPNINIFRDPRWGRGQETAGEDPLVSAKYAVSFVRGLQGDSFE------ 207
Query: 215 GFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDT 274
G E D L SACCKH AYDL+ W R+ F+A +T QD DT
Sbjct: 208 GGKLNE--------------DRLKASACCKHFTAYDLDNWKGVDRFDFDANVTLQDLADT 253
Query: 275 FQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATI 333
+QPPF SCI QG++S IMC+YN+VNG+P C +L AR +W F GYITSDC AV I
Sbjct: 254 YQPPFHSCIVQGRSSGIMCAYNRVNGIPNCADYNLLTNTARKKWNFNGYITSDCSAVDII 313
Query: 334 FEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQL 393
+ Q Y K ED+ A VL+AGMD+ CG H++SA+ + KV IDRAL NLFS+++
Sbjct: 314 HDRQGYAKAPEDAVADVLQAGMDVECGDYFTSHSKSAVLQKKVPISQIDRALHNLFSIRI 373
Query: 394 RLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLA 453
RLGLF+G P K KYGK+GP+ VC+ ++ +AL+AAR GIVLLKN LPL K + S+
Sbjct: 374 RLGLFDGHPTKLKYGKIGPNRVCSKQNLNIALEAARSGIVLLKNAASILPLPK-STDSIV 432
Query: 454 IIGPLVNNISQMG-GGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVR 512
+IGP N+ SQ+ G Y G PC+ ++L+G E Y Y GC D A AV
Sbjct: 433 VIGPNANSSSQVVLGNYFGRPCNLVTILQGFENYSDNLLYHPGCSDGTKCVSAEIDRAVE 492
Query: 513 IAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLD 572
+AK D+V++V GLD +QE+E DR L LPG+Q L+ SVA+ SKRPVILVL GGP+D
Sbjct: 493 VAKVVDYVVLVMGLDQSQESEGHDRDDLELPGKQQELINSVAKASKRPVILVLFCGGPVD 552
Query: 573 VSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNM 632
+SFA+ D +I ILW GYPGE G ALA+++FGD+NPGGRLPMTWYP+ F K+PM DM M
Sbjct: 553 ISFAKVDDKIGGILWAGYPGELGGMALAQVVFGDYNPGGRLPMTWYPKDFIKIPMTDMRM 612
Query: 633 RADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQ 692
RAD S YPGR+YRFYTG +VY FG+GLSY+NYSY F+S + + + + +IL+
Sbjct: 613 RADPSSGYPGRTYRFYTGPKVYEFGYGLSYSNYSYNFISVKNN---NLHINQSTTYSILE 669
Query: 693 QTGSRLDYVHIDEV--TSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQL 750
++ + + Y + E+ +C ++ V + +TN G + G H V+LF + K G P KQL
Sbjct: 670 KSQT-IHYKLVSELGKKACKTMSISVTLGITNTGSMAGKHPVLLFVKPKKGRNGNPVKQL 728
Query: 751 IGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTI 805
+GF+ V G E+ F V CE LS AN+ G +++ G ++ +VGEL +S+ I
Sbjct: 729 VGFESVTVEGGGKGEVGFEVSVCEHLSRANESGVKVIEEGGYLFLVGELEYSINI 783
>gi|359481045|ref|XP_002268626.2| PREDICTED: beta-xylosidase/alpha-L-arabinofuranosidase 2-like
[Vitis vinifera]
gi|296089342|emb|CBI39114.3| unnamed protein product [Vitis vinifera]
Length = 774
Score = 752 bits (1942), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 383/783 (48%), Positives = 507/783 (64%), Gaps = 42/783 (5%)
Query: 30 FSNSKPVLNK--PDFPC---KPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDN 84
S+ K VL + P F C P + FCNTSL + R L+ LTL+EKI L ++
Sbjct: 25 LSSPKWVLGQSSPVFACDVENNPTLGQFGFCNTSLETAARVADLVKRLTLEEKIGFLVNS 84
Query: 85 ASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIG 144
A+++ RLGIP YEWWSE+LHG++ GPG +FN V TSFPQV+++AASFN SL+ IG
Sbjct: 85 AASVSRLGIPKYEWWSEALHGVSYVGPGTHFNSIVPGATSFPQVILTAASFNASLFEAIG 144
Query: 145 SAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQG 204
V+ EARAMYN+G AGLTFW+PN+NIFRDPRWGRGQETPGEDP++ S YA +V+ Q
Sbjct: 145 KVVSTEARAMYNVGLAGLTFWSPNVNIFRDPRWGRGQETPGEDPLLSSKYASAYVRGLQ- 203
Query: 205 ENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNA 264
+ DDG D L ++ACCKH AYDL+ W R FNA
Sbjct: 204 ---QGDDG--------------------SPDRLKVAACCKHYTAYDLDNWKGVDRLHFNA 240
Query: 265 MITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK-ARNEWGFKGYI 323
++T+QD +DTFQPPF+SC+ G + +MCS+NQVNG P C DL R EW GYI
Sbjct: 241 VVTKQDMDDTFQPPFKSCVIDGNVASVMCSFNQVNGKPTCADPDLLSGIVRGEWKLNGYI 300
Query: 324 TSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDR 383
SDCD+V + Q+YTKT E++AA + AG+D+NCG+ + +HT++A+ G V E +D+
Sbjct: 301 VSDCDSVDVFYNSQHYTKTPEEAAAKAILAGLDLNCGSFLGQHTEAAVKGGLVDESAVDK 360
Query: 384 ALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLP 443
A+ N F+ +RLG F+G+P K YGKLGP DVCTSEH+++A +AARQGIVLLKN K LP
Sbjct: 361 AVSNNFATLMRLGFFDGNPSKAIYGKLGPKDVCTSEHQEMAREAARQGIVLLKNSKGSLP 420
Query: 444 LNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNS 503
L+ A+ +LAIIGP N M G Y G PC + L+GL A V+ T Y GC +V C +
Sbjct: 421 LSPTAIKTLAIIGPNANVTKTMIGNYEGTPCKYTTPLQGLTALVATT-YLPGCSNVACGT 479
Query: 504 DAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVIL 563
A EA +IA AD +++ G+D + E E RDRVS+ LPGQQ L+T VA+ SK VIL
Sbjct: 480 -AQIDEAKKIAAAADATVLIVGIDQSIEAEGRDRVSIQLPGQQPLLITEVAKASKGNVIL 538
Query: 564 VLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT 623
V+ GG D+SFA+ D +I+SILW+GYPGEAG A+A++IFG +NP GRLPMTWYP+S+
Sbjct: 539 VVMSGGGFDISFAKNDDKIASILWVGYPGEAGGAAIADVIFGFYNPSGRLPMTWYPQSYV 598
Query: 624 -KVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASL 682
KVPM +MNMR D + YPGR+YRFYTG +Y FG GLSYT +++ + AP S S+
Sbjct: 599 DKVPMTNMNMRPDPASGYPGRTYRFYTGETIYTFGDGLSYTQFNHHLVQAPK----SVSI 654
Query: 683 KAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVS 742
+ +D V SC +L F + + V NAG++ GSH V LF+ P V
Sbjct: 655 PIEEGHSCHSSKCKSVDAVQ----ESCQNLAFDIHLRVNNAGNISGSHTVFLFSSPPSV- 709
Query: 743 QGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHS 802
+P+K L+GF++V AK + F VD C+ LSI ++ G + + LG HVL VG L+HS
Sbjct: 710 HNSPQKHLLGFEKVFVTAKAEALVRFKVDVCKDLSIVDELGTQKVALGLHVLHVGSLKHS 769
Query: 803 LTI 805
L +
Sbjct: 770 LNV 772
>gi|357156904|ref|XP_003577615.1| PREDICTED: probable beta-D-xylosidase 7-like [Brachypodium
distachyon]
Length = 767
Score = 752 bits (1941), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 380/777 (48%), Positives = 504/777 (64%), Gaps = 49/777 (6%)
Query: 40 PDFPCKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWW 99
P F C SY FC+ +L ++ RA L+S LT EK+ QL D A+ +PRLG+P Y+WW
Sbjct: 25 PPFSCGQAS-SSYAFCDAALPVAQRAADLVSRLTAAEKVAQLGDEAAGVPRLGVPGYKWW 83
Query: 100 SESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQ 159
+E+LHG+A++G G++F+G V S TSFPQV ++AA+F+ LW IG A+ EARA+YNLGQ
Sbjct: 84 NEALHGLATSGKGLHFDGAVRSATSFPQVCLTAAAFDDDLWFRIGQAIGREARALYNLGQ 143
Query: 160 A-GLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGF 218
A GLT W+PN+NI+RDPRWGRGQETPGEDP S YAV FV+ QG +
Sbjct: 144 AEGLTMWSPNVNIYRDPRWGRGQETPGEDPTTASRYAVAFVRGMQGNSTSL--------- 194
Query: 219 REKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPP 278
L SACCKH AYDLE W +RY+F+A +T QD EDTF PP
Sbjct: 195 ------------------LQASACCKHATAYDLEDWNGVARYNFDAKVTAQDLEDTFNPP 236
Query: 279 FRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQ 337
FRSC+ GKASC+MC+Y +NGVPAC DL K R +WG GY SDCDAVA + + Q
Sbjct: 237 FRSCVVDGKASCVMCAYTGINGVPACANADLLTKTVRGDWGLDGYTASDCDAVAIMRDAQ 296
Query: 338 NYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGL 397
Y ++ ED+ A LKAG+DI+CGT M +H +AI +GK+ E+DID+AL NLF++++RLG
Sbjct: 297 RYAQSPEDAVALALKAGLDIDCGTYMQQHAAAAIQQGKITEEDIDKALKNLFAIRMRLGH 356
Query: 398 FNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGP 457
F+GDPR YG LG D+CT+EH+ LALDAA+ GIVLLKND LPL++ AV+S A+IGP
Sbjct: 357 FDGDPRTNMYGGLGAADICTAEHRSLALDAAQDGIVLLKNDAGILPLDRAAVASTAVIGP 416
Query: 458 LVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKA 517
NN + Y G PC + L+G++ YV + +GC C+ A +A +A +
Sbjct: 417 NANNPGALIANYFGPPCESTTPLKGIQGYVKDARFLAGCSSTACDV-ATTDQAAALASTS 475
Query: 518 DFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAE 577
D+V + GL QE+E RDR SLLLPG+Q SL+T+VA ++RPVILVL GGP+DV+FA+
Sbjct: 476 DYVFLFMGLGQRQESEGRDRTSLLLPGKQQSLITAVADAAQRPVILVLLSGGPVDVTFAQ 535
Query: 578 ADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSS 637
+ +I +ILW GYPG+AG A+A ++FGD NP GRLP+TWYPE FT VPM DM MRAD +
Sbjct: 536 TNPKIGAILWAGYPGQAGGLAIARVLFGDHNPSGRLPVTWYPEEFTNVPMTDMRMRADPA 595
Query: 638 RQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKF----LSAP---SELTISASLKAGSDKNI 690
YPGRSYRFY G VY FG+GLSY++YS + S P ++L S + S +NI
Sbjct: 596 NGYPGRSYRFYQGKTVYKFGYGLSYSSYSRRLLSSGTSTPAPNADLLASLTTTMPSAENI 655
Query: 691 LQQTGSRLDYVHIDEVTS--CTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEK 748
L GS H++++ + C L+F + V N G +DG V+++ R P + G PE+
Sbjct: 656 L---GS----YHVEQIGAQGCEMLKFPAVVEVQNHGPMDGKQSVLMYLRWPNATAGRPER 708
Query: 749 QLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTI 805
QLIGF + H A I F + PCE LS + G +++ G+H L V +H L I
Sbjct: 709 QLIGFKKEHLKAGEKAHIKFEIRPCEHLSRVREDGNKVIDRGSHFLRVD--KHELEI 763
>gi|302760655|ref|XP_002963750.1| hypothetical protein SELMODRAFT_80102 [Selaginella moellendorffii]
gi|300169018|gb|EFJ35621.1| hypothetical protein SELMODRAFT_80102 [Selaginella moellendorffii]
Length = 785
Score = 750 bits (1936), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 367/783 (46%), Positives = 506/783 (64%), Gaps = 32/783 (4%)
Query: 30 FSNSKPVLNKPDFPC---KPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNAS 86
SN++ +P + C S+PFC+T L + R + L+S LTL EK+ ++ + A
Sbjct: 25 LSNARVSTAQPRYACDVSSNASLGSFPFCDTKLGVDVRVQDLVSRLTLDEKVDEMVNAAQ 84
Query: 87 AIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSA 146
IPRLG+P+Y+WW E+LHG+AS+ PGV F G + TSFP + AASFN +L+ +IG A
Sbjct: 85 GIPRLGVPSYQWWQEALHGVASS-PGVQFGGLAPAATSFPMPIAMAASFNSTLFYSIGEA 143
Query: 147 VAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGEN 206
V+ EARA++NLG+AGLTFW+PN+NIFRDPRWGRGQETPGEDP++ S +A +V+ QG
Sbjct: 144 VSSEARALHNLGRAGLTFWSPNVNIFRDPRWGRGQETPGEDPLLASKFASLYVRGLQGGA 203
Query: 207 WKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMI 266
+ G GF L +SACCKHL AYD++ W RY FNA +
Sbjct: 204 YG---GSASDGF------------------LKVSACCKHLTAYDMDNWKGMDRYHFNAEV 242
Query: 267 TEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACL-RGDLFQKARNEWGFKGYITS 325
+EQD DT+ PPF+SCIE G+ S +MCSYN+VNGVP C R L + RN WGF GYI S
Sbjct: 243 SEQDLVDTYNPPFQSCIEDGRVSSVMCSYNRVNGVPTCADRSLLTETVRNSWGFNGYIVS 302
Query: 326 DCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRAL 385
DCDA+ +FE Y + ED+ A + AG+D+NCGT + +H +SA+ GKV E D+D A+
Sbjct: 303 DCDALQVLFEDTTYAPSAEDAVADSILAGLDLNCGTFLGKHAKSALQAGKVTEADLDHAI 362
Query: 386 LNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLN 445
NL ++RLGLF+GD Y LG D+C+++H++LALDAA QG+VLLKND LPL+
Sbjct: 363 SNLMRTRMRLGLFDGDLNTRPYSSLGATDICSNDHQQLALDAALQGVVLLKNDGS-LPLS 421
Query: 446 KNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDA 505
A+ ++A+IGP N M G Y GIPC S L+G++ Y + Y+ GC DV C+
Sbjct: 422 -TALKTVALIGPNANATYTMLGNYEGIPCKYVSPLQGMQIYNNNILYSPGCRDVACSEGD 480
Query: 506 GFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVL 565
AV +A KAD V++V GLD +QE E DR SLLLPG Q LV+++A P++LV+
Sbjct: 481 LVASAVEVATKADAVVLVVGLDQSQERETFDRTSLLLPGMQSQLVSNIANAVTCPIVLVI 540
Query: 566 TGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKV 625
GP+D+S + +S+ISS++WIGYPG++G ALA ++FG +NPGGRLP TWY E FT V
Sbjct: 541 MSAGPVDISTFKDNSRISSVIWIGYPGQSGGAALAHVVFGAYNPGGRLPNTWYHEEFTNV 600
Query: 626 PMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELT-ISASLKA 684
M DM MR + YPGRSYRFYTGT +Y FG GLSY+ Y YKFL AP+ L+ ++ +
Sbjct: 601 SMLDMRMRPNPPSGYPGRSYRFYTGTPLYNFGDGLSYSTYLYKFLLAPTRLSFFKSNTRN 660
Query: 685 GSDKNILQQTGSRLDYVHI--DEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVS 742
D + ++ + H+ D++ +C S+ F V + V+N G GSH V++F+ P V
Sbjct: 661 SRDCPTVNRSEAEFGCFHLPADDLETCNSILFQVSVEVSNLGPRSGSHSVLIFSAPPPV- 719
Query: 743 QGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHS 802
+G P KQLI F +VH + ++ + FG+DPC+ LS ++G+R L G H L++G H
Sbjct: 720 EGAPLKQLIAFQKVHLESDTTQRLIFGIDPCKHLSSVRRNGKRFLHSGRHKLLIGNAVHI 779
Query: 803 LTI 805
L +
Sbjct: 780 LDV 782
>gi|125534137|gb|EAY80685.1| hypothetical protein OsI_35867 [Oryza sativa Indica Group]
Length = 779
Score = 750 bits (1936), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 368/767 (47%), Positives = 485/767 (63%), Gaps = 39/767 (5%)
Query: 40 PDFPCKPPHFDS-YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEW 98
P F C P + FCN +L RA L++ LT EK+ QL D A +PRLGIP Y+W
Sbjct: 35 PGFTCGPASAQKGFAFCNAALPAEQRAADLVARLTTAEKVGQLGDQAPGVPRLGIPVYKW 94
Query: 99 WSESLHGIASNGPGVNF-NGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNL 157
WSE+LHG+A +G G++F NG + TSFPQV+ +AA+F+ LW IG A+ E RA YNL
Sbjct: 95 WSEALHGLAISGKGIHFGNGPARTATSFPQVIHTAAAFDDGLWFRIGQAIGKEGRAFYNL 154
Query: 158 GQA-GLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGF 216
GQA GL W+PN+NIFRDPRWGRGQETPGEDP S Y FVK QG + +
Sbjct: 155 GQAEGLAMWSPNVNIFRDPRWGRGQETPGEDPATASKYGAAFVKGLQGSSLTN------- 207
Query: 217 GFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQ 276
L SACCKH+ AYD+E+W SRY+FNA +T QD DT+
Sbjct: 208 --------------------LQTSACCKHITAYDIEEWKGVSRYNFNAKVTPQDLADTYN 247
Query: 277 PPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFE 335
PPFRSC+ GKASCIMC+Y +NGVPAC DL K R EW GY SDCDAVA + +
Sbjct: 248 PPFRSCVVDGKASCIMCAYTLINGVPACASSDLLTKTVRGEWKLDGYTASDCDAVAILHK 307
Query: 336 YQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRL 395
+++T+T E++ A LKAG+DINCG M ++ SA+ +GK+ EKD+D+AL NLF++++RL
Sbjct: 308 SEHFTRTAEEAVAVALKAGLDINCGVYMQQNAASALQQGKMTEKDVDKALKNLFAIRMRL 367
Query: 396 GLFNGDPRKGK-YGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAI 454
G F+GDPR K YG+LG DVCT HK LAL+AAR+G+VLLKND + LPL VSS A+
Sbjct: 368 GHFDGDPRGNKLYGRLGAADVCTPVHKALALEAARRGVVLLKNDARLLPLRAPTVSSAAV 427
Query: 455 IGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIA 514
IG N+I + G Y G+PC + G++ YV + GC C+ A +A +A
Sbjct: 428 IGHNANDILALLGNYYGLPCETTTPFGGIQKYVKSAKFLPGCSSAACDV-AATDQATALA 486
Query: 515 KKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVS 574
K +D+V +V GL QE E DR SLLLPG+Q +L+T+VA SKRPVIL+L GGP+D++
Sbjct: 487 KSSDYVFLVMGLSQKQEQEGLDRTSLLLPGKQQALITAVATASKRPVILILLTGGPVDIT 546
Query: 575 FAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRA 634
FA+ + +I +ILW GYPG+AG +A+A+++FG+FNP G+LP+TWYPE FTK M DM MR
Sbjct: 547 FAQTNPKIGAILWAGYPGQAGGQAIADVLFGEFNPSGKLPVTWYPEEFTKFTMTDMRMRP 606
Query: 635 DSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQT 694
D + YPGRSYRFY G VY FG+GLSY+ ++ + +S S+S + + T
Sbjct: 607 DPATGYPGRSYRFYKGKTVYKFGYGLSYSKFACRIVSGAGN---SSSYGKAALAGLRAAT 663
Query: 695 GSRLDYVH-IDEV--TSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLI 751
D V+ +DE+ C LRF V + V N G +DG H V++F R G P +QLI
Sbjct: 664 TPEGDAVYRVDEIGDDRCERLRFPVMVEVQNHGPMDGKHTVLMFVRWSSTDGGRPVRQLI 723
Query: 752 GFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGE 798
GF H K++ + PCE LS A G +++ G+H LMV E
Sbjct: 724 GFRNQHLKVGEKKKLKMEISPCEHLSRARVDGEKVIDRGSHFLMVEE 770
>gi|62701898|gb|AAX92971.1| beta-D-xylosidase [Oryza sativa Japonica Group]
gi|62733926|gb|AAX96035.1| beta-D-xylosidase [Oryza sativa Japonica Group]
gi|77550045|gb|ABA92842.1| Glycosyl hydrolase family 3 C terminal domain containing protein,
expressed [Oryza sativa Japonica Group]
gi|125576900|gb|EAZ18122.1| hypothetical protein OsJ_33667 [Oryza sativa Japonica Group]
Length = 771
Score = 750 bits (1936), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 370/768 (48%), Positives = 494/768 (64%), Gaps = 39/768 (5%)
Query: 39 KPDFPCKPPHFDS-YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYE 97
+P + C P S Y FC+ L + RA L+S LT EK+ QL D A +PRLG+P Y+
Sbjct: 24 EPPYSCGPRSPSSGYAFCDARLPPARRAADLVSRLTAAEKVAQLGDEAGGVPRLGVPPYK 83
Query: 98 WWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNL 157
WWSE LHG++ G G++FNG V+++TSFPQVL++AA+F+ LW IG A+ EARA+YNL
Sbjct: 84 WWSEGLHGLSYWGHGMHFNGAVTAITSFPQVLLTAAAFDDRLWFRIGQAIGTEARALYNL 143
Query: 158 GQA-GLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGF 216
GQA GLT W+PN+NI+RDPRWGRGQETPGEDP S YAV FVK QG +
Sbjct: 144 GQAEGLTIWSPNVNIYRDPRWGRGQETPGEDPTTASKYAVAFVKGLQGSTPGT------- 196
Query: 217 GFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQ 276
L SACCKH AYDLE+W +RY+FNA +T QD DTF
Sbjct: 197 --------------------LQTSACCKHATAYDLEEWNGVARYNFNAKVTAQDLADTFN 236
Query: 277 PPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFE 335
PPF+SC+ KASC+MC+Y +NGVPAC DL K R +WG GY++SDCDAVA + +
Sbjct: 237 PPFKSCVVDAKASCVMCAYTDINGVPACASSDLLSKTFRGQWGLDGYVSSDCDAVALLRD 296
Query: 336 YQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRL 395
Q Y T ED+ A +KAG+D+NCG H +A+ +GK++E D+DRAL NLF+V++RL
Sbjct: 297 AQRYAPTPEDTVAVAIKAGLDLNCGNYTQVHGMAALQQGKMRESDVDRALTNLFAVRMRL 356
Query: 396 GLFNGDPR-KGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAI 454
G F+GDPR YG LG DVCT H+ LAL+AA+ GIVLLKND LPL++ V S A+
Sbjct: 357 GHFDGDPRSNAAYGHLGAADVCTQAHRDLALEAAQDGIVLLKNDAGALPLDRATVRSAAV 416
Query: 455 IGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIA 514
IGP N+ + + G Y G PC + L+G++ Y+S + +GC D P A +A +A
Sbjct: 417 IGPNANDPAALNGNYFGPPCETTTPLQGVQRYISSVRFLAGC-DSPACGFAATGQAAALA 475
Query: 515 KKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVS 574
+D VI+ GL QE E DR SLLLPG+Q SL+T+VA ++RPVILVL GGP+DV+
Sbjct: 476 SSSDQVIMFMGLSQDQEKEGLDRTSLLLPGKQQSLITAVASAARRPVILVLLTGGPVDVT 535
Query: 575 FAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRA 634
FA+ + +I +ILW GYPG+AG A+A+++FGD NP GRLP+TWYPE FT++PM DM MRA
Sbjct: 536 FAKNNPKIGAILWAGYPGQAGGLAIAKVLFGDHNPSGRLPVTWYPEEFTRIPMTDMRMRA 595
Query: 635 DSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQT 694
D + YPGRSYRFY G VY FG+GLSY+ +S + ++A + +L AG I +
Sbjct: 596 DPATGYPGRSYRFYQGNPVYKFGYGLSYSKFSRRLVAAAKPRRPNRNLLAGV---IPKPA 652
Query: 695 GSRLDYVHIDEV--TSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGT--PEKQL 750
G + H++E+ C L+F + V N G +DG H V++F R P + G P +QL
Sbjct: 653 GDGGESYHVEEIGEEGCERLKFPATVEVHNHGPMDGKHSVLVFVRWPNATAGASRPARQL 712
Query: 751 IGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGE 798
+GF H A ++ ++PCE LS A + G +++ G+H L VGE
Sbjct: 713 VGFSSQHVRAGEKARLTMEINPCEHLSRAREDGTKVIDRGSHFLKVGE 760
>gi|297745522|emb|CBI40687.3| unnamed protein product [Vitis vinifera]
Length = 751
Score = 750 bits (1936), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 379/773 (49%), Positives = 500/773 (64%), Gaps = 65/773 (8%)
Query: 40 PDFPC---KPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAY 96
P F C P + FCNTSL + R L+ LTL+EKI L ++A+++ RLGIP Y
Sbjct: 37 PVFACDVENNPTLGQFGFCNTSLETAARVADLVKRLTLEEKIGFLVNSAASVSRLGIPKY 96
Query: 97 EWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYN 156
EWWSE+LHG++ GPG +FN V TSFPQV+++AASFN SL+ IG AV+ EARAMYN
Sbjct: 97 EWWSEALHGVSYVGPGTHFNSVVPGATSFPQVILTAASFNASLFEAIGKAVSTEARAMYN 156
Query: 157 LGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGF 216
+G AGLTFW+PN+NIFRDPRWGRGQETPGEDP++ S YA +V+ Q +SDDG
Sbjct: 157 VGLAGLTFWSPNVNIFRDPRWGRGQETPGEDPLLSSKYASGYVRGLQ----QSDDG---- 208
Query: 217 GFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQ 276
D L ++ACCKH AYDL+ W R+ FNA++T+QD +DTFQ
Sbjct: 209 ----------------SPDRLKVAACCKHYTAYDLDNWKGVDRFHFNAVVTKQDMDDTFQ 252
Query: 277 PPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK-ARNEWGFKGYITSDCDAVATIFE 335
PPF+SC+ G + +MCSYNQVNG PAC DL R EW GYI SDCD+V +
Sbjct: 253 PPFKSCVIDGNVASVMCSYNQVNGKPACADPDLLSGIVRGEWKLNGYIVSDCDSVDVFYN 312
Query: 336 YQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRL 395
Q+YTKT E++AA + AG+D+NCG+ + +HT++A+ G V E +D+A+ N F+ +RL
Sbjct: 313 SQHYTKTPEEAAAKAILAGLDLNCGSFLGQHTEAAVKGGLVDESAVDKAVSNNFATLMRL 372
Query: 396 GLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAII 455
G F+G+P K YGKLGP DVCTSEH++LA +AARQGIVLLKN K LPL+ A+ +LA+I
Sbjct: 373 GFFDGNPSKAIYGKLGPKDVCTSEHQELAREAARQGIVLLKNSKGSLPLSPTAIKTLAVI 432
Query: 456 GPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAK 515
GP N M G Y G PC + L+GL A V+ T Y GC +V C + A EA +IA
Sbjct: 433 GPNANVTKTMIGNYEGTPCKYTTPLQGLTALVATT-YLPGCSNVACGT-AQIDEAKKIAA 490
Query: 516 KADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSF 575
AD +++ G+D + E E RDRV++ LPGQQ L+T VA+ SK VILV+ GG D+SF
Sbjct: 491 AADATVLIVGIDQSIEAEGRDRVNIQLPGQQPLLITEVAKASKGNVILVVMSGGGFDISF 550
Query: 576 AEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNMRA 634
A+ D +I+SILW+GYPGEAG A+A++IFG +NP GRLPMTWYP+S+ KVPM +MNMR
Sbjct: 551 AKNDDKITSILWVGYPGEAGGAAIADVIFGFYNPSGRLPMTWYPQSYVDKVPMTNMNMRP 610
Query: 635 DSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQT 694
D + YPGR+YRFYTG +Y FG GLSYT +++
Sbjct: 611 DPASGYPGRTYRFYTGETIYTFGDGLSYTQFNH--------------------------- 643
Query: 695 GSRLDYVHIDEVT-SCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGF 753
++ +D V SC +L F + + V NAG++ GSH V LF+ P V +P+K L+GF
Sbjct: 644 -----HLSVDAVQESCQNLVFDIHLRVNNAGNISGSHTVFLFSSPPSV-HNSPQKHLLGF 697
Query: 754 DRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
++V AK + F VD C+ LSI ++ G R + LG HVL VG L+HSL +
Sbjct: 698 EKVFVTAKAKALVRFKVDVCKDLSIVDELGTRKVALGLHVLHVGNLKHSLNVR 750
>gi|224111912|ref|XP_002316021.1| predicted protein [Populus trichocarpa]
gi|222865061|gb|EEF02192.1| predicted protein [Populus trichocarpa]
Length = 768
Score = 750 bits (1936), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 362/776 (46%), Positives = 503/776 (64%), Gaps = 42/776 (5%)
Query: 36 VLNKPDFPCKPPH--FDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGI 93
V ++ F C P S FC +L I R + LI LTLQEKI+ L +NA+A+PRLGI
Sbjct: 22 VESREPFACDPKLGLTRSLKFCRVNLPIHVRVRDLIGRLTLQEKIRLLVNNAAAVPRLGI 81
Query: 94 PAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARA 153
YEWWSE+LHG+++ GPG F G T+FPQV+ +AASFN SLW IG V+ EARA
Sbjct: 82 QGYEWWSEALHGVSNVGPGTKFGGAFPGATAFPQVITTAASFNESLWEEIGRVVSDEARA 141
Query: 154 MYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGG 213
MYN G AGLT+W+PN+N+FRDPRWGRGQETPGEDP+V YA +V+ QG N
Sbjct: 142 MYNGGMAGLTYWSPNVNVFRDPRWGRGQETPGEDPVVAGKYAASYVRGLQGNN------- 194
Query: 214 IGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTED 273
G L ++ACCKH AYDL+ W RY FNA +++QD ED
Sbjct: 195 --------------------GLRLKVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLED 234
Query: 274 TFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVAT 332
T+ PF+SC+ GK + +MCSYNQVNG P C L + R EWG GYI SDCD+V
Sbjct: 235 TYNVPFKSCVVAGKVASVMCSYNQVNGKPTCADPYLLKNTIRGEWGLNGYIVSDCDSVGV 294
Query: 333 IFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQ 392
+F+ Q+YT T E++AA ++AG+D++CG + HT++A+ G ++E+D++ AL N +VQ
Sbjct: 295 LFDTQHYTATPEEAAASTIRAGLDLDCGPFLAIHTENAVKGGLLKEEDVNMALANTITVQ 354
Query: 393 LRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSL 452
+RLG+F+G+P +G LGP DVCT H++LAL AARQGIVLL+N + LPL++ + ++
Sbjct: 355 MRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLALQAARQGIVLLQNRGRTLPLSRT-LQTV 413
Query: 453 AIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVR 512
A+IGP + M G Y G+ C + L+G+ Y H+ GC+DV CN + F+ A
Sbjct: 414 AVIGPNSDVTVTMIGNYAGVACGYTTPLQGIRRYAKTVHH-PGCNDVFCNGNQQFNAAEV 472
Query: 513 IAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLD 572
A+ AD I+V GLD + E E RDR LLLPG Q LV+ VAR S+ P ILVL GGP+D
Sbjct: 473 AARHADATILVMGLDQSIEAEFRDRKGLLLPGYQQELVSIVARASRGPTILVLMSGGPID 532
Query: 573 VSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESF-TKVPMNDMN 631
VSFA+ D +I +ILW+GYPG+AG A+A+++FG NPGG+LPMTWYP ++ KVPM +M
Sbjct: 533 VSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGTANPGGKLPMTWYPHNYLAKVPMTNMG 592
Query: 632 MRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTIS-ASLKAGSDKNI 690
MRAD SR YPGR+YRFY G V+ FGHG+SYT +++ + AP E+++ ASL ++
Sbjct: 593 MRADPSRGYPGRTYRFYKGPVVFPFGHGMSYTTFAHSLVQAPREVSVPLASL------HV 646
Query: 691 LQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQL 750
+ T + + + +C +L V I V N GD+DG+H +++F+ P T +KQL
Sbjct: 647 SRNTTGASNAIRVSH-ANCEALALGVHIDVKNTGDMDGTHTLLVFSSPPGGKWST-QKQL 704
Query: 751 IGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
IGF++VH V K + + C+ LS+ ++ G R +P+G H L +G+L+HS++++
Sbjct: 705 IGFEKVHLVTGSQKRVKIDIHVCKHLSVVDRFGIRRIPIGEHDLYIGDLKHSISLQ 760
>gi|224054312|ref|XP_002298197.1| predicted protein [Populus trichocarpa]
gi|222845455|gb|EEE83002.1| predicted protein [Populus trichocarpa]
Length = 741
Score = 749 bits (1935), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 382/774 (49%), Positives = 505/774 (65%), Gaps = 42/774 (5%)
Query: 40 PDFPC---KPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAY 96
P F C P S+ FCNTSL +S R L+ LTLQEKI L ++A ++ RLGIP Y
Sbjct: 2 PVFACDVVSNPSLASFGFCNTSLGVSDRVVDLVKRLTLQEKILFLVNSAGSVSRLGIPKY 61
Query: 97 EWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYN 156
EWWSE+LHG++ GPG +F+ V TSFPQV+++AASFN SL+ IG V+ EARAMYN
Sbjct: 62 EWWSEALHGVSYVGPGTHFSSVVPGATSFPQVILTAASFNTSLFVAIGKVVSTEARAMYN 121
Query: 157 LGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGF 216
+G AGLTFW+PNINIFRDPRWGRGQETPGEDP++ S Y +VK Q + DDG
Sbjct: 122 VGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLSSKYGSGYVKGLQ----QRDDG---- 173
Query: 217 GFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQ 276
D L ++ACCKH AYDL+ W RY FNA++T+QD +DTFQ
Sbjct: 174 ----------------NPDGLKVAACCKHYTAYDLDNWKGVDRYHFNAVVTKQDMDDTFQ 217
Query: 277 PPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFE 335
PPF+SC+ G + +MCSYN+VNG+P C DL R EW GYI +DCD++ +
Sbjct: 218 PPFKSCVVDGNVASVMCSYNKVNGIPTCADPDLLSGVIRGEWKLNGYIVTDCDSIDVFYN 277
Query: 336 YQNYTKTHEDSAAGVLKAG--MDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQL 393
Q+YTKT E++AA + AG +D+NCG+ + +HT++A+ G V E IDRA+ N F+ +
Sbjct: 278 SQHYTKTPEEAAAKAILAGIRLDLNCGSFLGKHTEAAVTAGLVNESAIDRAVSNNFATLM 337
Query: 394 RLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLA 453
RLG F+GDP K YGKLGP DVCT+E+++LA +AARQGIVLLKN LPL+ A+ +LA
Sbjct: 338 RLGFFDGDPSKQLYGKLGPKDVCTAENQELAREAARQGIVLLKNTAGSLPLSPTAIKNLA 397
Query: 454 IIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRI 513
+IGP N M G Y G PC + L+GL A V+ T Y GC +V C S A +A +I
Sbjct: 398 VIGPNANVTKTMIGNYEGTPCKYTTPLQGLAALVATT-YLPGCSNVAC-STAQVDDAKKI 455
Query: 514 AKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDV 573
A AD ++V G DL+ E E RDRV +LLPGQQ L+T+VA S PVILV+ GG +DV
Sbjct: 456 AAAADATVLVMGADLSIEAESRDRVDILLPGQQQLLITAVANASTGPVILVIMSGGGMDV 515
Query: 574 SFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNM 632
SFA+ + +I+SILW+GYPGEAG A+A+IIFG +NP GRLPMTWYP+S+ KVPM +MNM
Sbjct: 516 SFAKTNDKITSILWVGYPGEAGGAAIADIIFGSYNPSGRLPMTWYPQSYVDKVPMTNMNM 575
Query: 633 RADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQ 692
R D S YPGR+YRFYTG VY FG GLSY+ +S++ AP +++ ++N +
Sbjct: 576 RPDPSNGYPGRTYRFYTGETVYSFGDGLSYSEFSHELTQAPGLVSVPL------EENHVC 629
Query: 693 QTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIG 752
+ S V E T C +L F V + + N G GSH V LF+ P V +P+K L+G
Sbjct: 630 YS-SECKSVAAAEQT-CQNLTFDVHLRIKNTGTTSGSHTVFLFSTPPSV-HNSPQKHLVG 686
Query: 753 FDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
F++V A+ + F VD C+ LS+ ++ G + + LG HVL +G L+HS+T+
Sbjct: 687 FEKVFLHAQTDSHVGFKVDVCKDLSVVDELGSKKVALGEHVLHIGSLKHSMTVR 740
>gi|297611657|ref|NP_001067709.2| Os11g0291000 [Oryza sativa Japonica Group]
gi|255680005|dbj|BAF28072.2| Os11g0291000 [Oryza sativa Japonica Group]
Length = 764
Score = 748 bits (1930), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 375/752 (49%), Positives = 488/752 (64%), Gaps = 38/752 (5%)
Query: 54 FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
FC+ L+ RA L++ LTL EK+ QL D A+ + RLG+PAYEWWSE LHG++ G G+
Sbjct: 31 FCDAWLTAEQRAADLVANLTLAEKVSQLGDRAAGVARLGVPAYEWWSEGLHGLSIWGRGI 90
Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA-GLTFWAPNINIF 172
FNGTV +VTSFPQV+++AA+F+ LW +G AV EARA+YNLGQA GLT W+PN+NIF
Sbjct: 91 RFNGTVRAVTSFPQVILTAAAFDAGLWRRVGEAVGAEARALYNLGQANGLTIWSPNVNIF 150
Query: 173 RDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESD 232
RDPRWGRGQETPGEDP+ S YAV FV QG G E+
Sbjct: 151 RDPRWGRGQETPGEDPVTASRYAVAFVTGLQG----------------------IGGEA- 187
Query: 233 RGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIM 292
SACCKH AYDL+ W N RY++++ +T QD EDT+ PPF+SC+ +GKA+CIM
Sbjct: 188 -------SACCKHATAYDLDYWNNVVRYNYDSKVTLQDLEDTYNPPFKSCVAEGKATCIM 240
Query: 293 CSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVL 351
C YN +NGVPAC DL +K R EWG GY+ SDCDAVATI + +YT + ED+ A +
Sbjct: 241 CGYNSINGVPACASSDLLTKKVRQEWGMNGYVASDCDAVATIRDAHHYTLSPEDTVAVSI 300
Query: 352 KAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGK-YGKL 410
K GMD+NCG H +A+ KG + EKDIDRAL+NLF+V++RLG F+GDPR YG L
Sbjct: 301 KVGMDVNCGNYTQVHAMAAVQKGNLTEKDIDRALVNLFAVRMRLGHFDGDPRSNAVYGHL 360
Query: 411 GPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT 470
G DVC+ HK LAL+AA+ GIVLLKND LPL +AV+SLA+IGP +N+ + G Y
Sbjct: 361 GAADVCSPAHKSLALEAAQDGIVLLKNDAGALPLQPSAVTSLAVIGPNADNLGALHGNYF 420
Query: 471 GIPCSPKSLLRGLEAYVS-KTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLT 529
G PC + L+G++ Y+ + + +GC D P + A +EA +A +D V++ GL
Sbjct: 421 GPPCETTTPLQGIKGYLGDRARFLAGC-DSPACAVAATNEAAALASSSDHVVLFMGLSQK 479
Query: 530 QETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIG 589
QE + DR SLLLPG+Q L+T+VA ++RPVILVL GGP+DV+FA+ + +I +ILW G
Sbjct: 480 QEQDGLDRTSLLLPGEQQGLITAVANAARRPVILVLLTGGPVDVTFAKDNPKIGAILWAG 539
Query: 590 YPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYT 649
YPG+AG A+A+++FGD NP GRLP+TWYPE FTKVPM DM MRAD + YPGRSYRFY
Sbjct: 540 YPGQAGGLAIAKVLFGDHNPSGRLPVTWYPEEFTKVPMTDMRMRADPATGYPGRSYRFYQ 599
Query: 650 GTQVYGFGHGLSYTNYSYKFLSAPSELTI-SASLKAGSDKNILQQTGSRLDYVHIDE--V 706
G VY FG+GLSY+ +S + S+ S + SL AG G + + E V
Sbjct: 600 GNTVYNFGYGLSYSKFSRRMFSSFSTSNAGNLSLLAGVMARRAGDDGGGMSSYLVKEIGV 659
Query: 707 TSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEI 766
C+ L F + V N G +DG H V+++ R P S G P +QLIGF H +
Sbjct: 660 ERCSRLVFPAVVEVQNHGPMDGKHSVLMYLRWPTTSGGRPARQLIGFRSQHVKVGEKAMV 719
Query: 767 SFGVDPCEQLSIANKHGRRILPLGNHVLMVGE 798
SF V PCE S + G R++ G H LMVG+
Sbjct: 720 SFEVSPCEHFSWVGEDGERVIDGGAHFLMVGD 751
>gi|62734691|gb|AAX96800.1| Glycosyl hydrolase family 3 C terminal domain, putative [Oryza
sativa Japonica Group]
gi|77549994|gb|ABA92791.1| beta-D-xylosidase, putative, expressed [Oryza sativa Japonica
Group]
Length = 853
Score = 747 bits (1929), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 366/767 (47%), Positives = 484/767 (63%), Gaps = 39/767 (5%)
Query: 40 PDFPCKPPHFDS-YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEW 98
P F C P + FCN +L RA L++ LT EK+ QL D A +PRLGIP Y+W
Sbjct: 109 PGFTCGPASAQKGFAFCNAALPAEQRAADLVARLTTAEKVGQLGDQAPGVPRLGIPVYKW 168
Query: 99 WSESLHGIASNGPGVNF-NGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNL 157
WSE+LHG+A +G G++F NG + TSFPQV+ +AA+F+ LW IG A+ E RA YNL
Sbjct: 169 WSEALHGLAISGKGIHFGNGPARTATSFPQVIHTAAAFDDGLWFRIGQAIGKEGRAFYNL 228
Query: 158 GQA-GLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGF 216
GQA GL W+PN+NIFRDPRWGRGQETPGEDP S Y FVK QG + +
Sbjct: 229 GQAEGLAMWSPNVNIFRDPRWGRGQETPGEDPATASKYGAAFVKGLQGSSLTN------- 281
Query: 217 GFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQ 276
L SACCKH+ AYD+E+W SRY+FNA +T QD DT+
Sbjct: 282 --------------------LQTSACCKHITAYDIEEWKGVSRYNFNAKVTPQDLADTYN 321
Query: 277 PPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFE 335
PPFRSC+ GKASCIMC+Y +NGVPAC DL K R EW GY SDCDAVA + +
Sbjct: 322 PPFRSCVVDGKASCIMCAYTLINGVPACASSDLLTKTVRGEWKLDGYTASDCDAVAILHK 381
Query: 336 YQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRL 395
+++T+T E++ A LKAG+DINCG M ++ SA+ +GK+ EKD+D+AL NLF++++RL
Sbjct: 382 SEHFTRTAEEAVAVALKAGLDINCGVYMQQNAASALQQGKMTEKDVDKALKNLFAIRMRL 441
Query: 396 GLFNGDPRKGK-YGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAI 454
G F+GDPR K YG+L DVCT HK LAL+AAR+G+VLLKND + LPL V+S A+
Sbjct: 442 GHFDGDPRGNKLYGRLSAADVCTPVHKALALEAARRGVVLLKNDARLLPLRAPTVASAAV 501
Query: 455 IGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIA 514
IG N+I + G Y G+PC + G++ YV + GC C+ A +A +A
Sbjct: 502 IGHNANDILALLGNYYGLPCETTTPFGGIQKYVKSAKFLPGCSSAACDV-AATDQATALA 560
Query: 515 KKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVS 574
K +D+V +V GL QE E DR SLLLPG+Q +L+T+VA SKRPVIL+L GGP+D++
Sbjct: 561 KSSDYVFLVMGLSQKQEQEGLDRTSLLLPGKQQALITAVATASKRPVILILLTGGPVDIT 620
Query: 575 FAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRA 634
FA+ + +I +ILW GYPG+AG +A+A+++FG+FNP G+LP+TWYPE FTK M DM MR
Sbjct: 621 FAQTNPKIGAILWAGYPGQAGGQAIADVLFGEFNPSGKLPVTWYPEEFTKFTMTDMRMRP 680
Query: 635 DSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQT 694
D + YPGRSYRFY G VY FG+GLSY+ ++ + +S S+S + + T
Sbjct: 681 DPATGYPGRSYRFYKGKTVYKFGYGLSYSKFACRIVSGAGN---SSSYGKAALAGLRAAT 737
Query: 695 GSRLDYVH-IDEV--TSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLI 751
D V+ +DE+ C LRF V + V N G +DG H V++F R G P +QLI
Sbjct: 738 TPEGDAVYRVDEIGDDRCERLRFPVMVEVQNHGPMDGKHTVLMFVRWSSTDGGRPVRQLI 797
Query: 752 GFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGE 798
GF H K++ + PCE LS A G +++ G+H LMV E
Sbjct: 798 GFRNQHLKVGEKKKLKMEISPCEHLSRARVDGEKVIDRGSHFLMVEE 844
>gi|225437531|ref|XP_002270249.1| PREDICTED: probable beta-D-xylosidase 2 [Vitis vinifera]
gi|297743965|emb|CBI36935.3| unnamed protein product [Vitis vinifera]
Length = 768
Score = 747 bits (1929), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 360/793 (45%), Positives = 503/793 (63%), Gaps = 48/793 (6%)
Query: 18 VLQLIVVVNVIAFSNSKPVLNKPDFPCKPPHFDS--YPFCNTSLSISTRAKSLISLLTLQ 75
+L +VV+ V++ P F C P + +PFC S+ I R K LI LTL+
Sbjct: 11 LLIFLVVLAVVSGEARDP------FACDPKDGANAGFPFCRKSIGIGERVKDLIGRLTLE 64
Query: 76 EKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASF 135
EK++ L +NA+ +PRLGI YEWWSE+LHG+++ GPG F+G TSFPQV+ +AASF
Sbjct: 65 EKVRLLVNNAAGVPRLGIKGYEWWSEALHGVSNVGPGTKFSGDFPGATSFPQVITTAASF 124
Query: 136 NRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYA 195
N SLW IG V+ EARAMYN G AGLTFW+PN+NIFRDPRWGRGQETPGEDP++ YA
Sbjct: 125 NSSLWEAIGQVVSDEARAMYNGGAAGLTFWSPNVNIFRDPRWGRGQETPGEDPVLAGKYA 184
Query: 196 VEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWG 255
+V+ QG + GD L ++ACCKH AYDL+ W
Sbjct: 185 ARYVRGLQG---------------------------NAGDRLKVAACCKHFTAYDLDNWN 217
Query: 256 NFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-R 314
R+ F+A +++Q+ EDTF PFRSC+ +GK + +MCSYNQVNGVP C +L + R
Sbjct: 218 GVDRFHFDARVSKQEMEDTFDVPFRSCVVEGKVASVMCSYNQVNGVPTCADPNLLRNTVR 277
Query: 315 NEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKG 374
+W GY+ SDCD+V ++ Q+YT T E++AA +KAG+D++CG + HTQ AI KG
Sbjct: 278 KQWHLNGYVVSDCDSVGVFYDNQHYTNTPEEAAADAIKAGLDLDCGPFLAVHTQDAIKKG 337
Query: 375 KVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVL 434
V E D+D AL+N +VQ+RLG+F+G+P +G LGP DVC+ H++LA++AARQGIVL
Sbjct: 338 LVSEADVDSALVNTVTVQMRLGMFDGEPSAQPFGDLGPKDVCSPAHQELAIEAARQGIVL 397
Query: 435 LKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYAS 494
LKN LPL+ + S+A+IGP + M G Y GIPC + L+G+ Y S+T +
Sbjct: 398 LKNHGHSLPLSTRSHRSIAVIGPNSDANVTMIGNYAGIPCEYTTPLQGIGRY-SRTIHQK 456
Query: 495 GCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVA 554
GC DV C+ D F A+ A +AD ++V GLD + E E +DR LLLPG+Q LV+ VA
Sbjct: 457 GCADVACSEDQLFAGAIDAASQADATVLVMGLDQSIEAEAKDRADLLLPGRQQELVSKVA 516
Query: 555 RTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLP 614
S+ P +LVL GGP+DVSFA+ D +I++I+W GYPG+AG A+A+I+FG NPGG+LP
Sbjct: 517 MASRGPTVLVLMSGGPVDVSFAKKDPRIAAIVWAGYPGQAGGAAIADILFGVANPGGKLP 576
Query: 615 MTWYPESF-TKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAP 673
MTWYP+ + +KVPM M MRA S+ YPGR+YRFY G VY FGHGLSYTN+ + AP
Sbjct: 577 MTWYPQEYLSKVPMTTMAMRAIPSKAYPGRTYRFYKGPVVYRFGHGLSYTNFVHTIAQAP 636
Query: 674 SELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVM 733
+ + I L + + +G + H C L + + V N G+ DGSH ++
Sbjct: 637 TAVAI--PLHGHHNTTV---SGKAIRVTH----AKCNRLSIALHLDVKNVGNKDGSHTLL 687
Query: 734 LFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHV 793
+F++ P P KQL+ F++VH A+ + + + C+ LS+ ++ G R +P+G H
Sbjct: 688 VFSK-PPAGHWAPHKQLVAFEKVHVAARTQQRVQINIHVCKYLSVVDRSGIRRIPMGQHG 746
Query: 794 LMVGELRHSLTIE 806
L +G+ +H ++++
Sbjct: 747 LHIGDTKHLVSLQ 759
>gi|350534908|ref|NP_001233910.1| beta-D-xylosidase 1 precursor [Solanum lycopersicum]
gi|37359706|dbj|BAC98298.1| LEXYL1 [Solanum lycopersicum]
Length = 770
Score = 747 bits (1929), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 372/774 (48%), Positives = 496/774 (64%), Gaps = 40/774 (5%)
Query: 38 NKPDFPC---KPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIP 94
N P F C P + FC+ SL++ R L++ LTL EKI L A + RLGIP
Sbjct: 31 NSPVFACDVTSNPALGNLTFCDASLAVENRVNDLVNRLTLGEKIGFLVSGAGGVSRLGIP 90
Query: 95 AYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAM 154
YEWWSE+LHG+A GPGV+F V TSFPQV+++AASFN +L+ IG V+ EARAM
Sbjct: 91 KYEWWSEALHGVAYTGPGVHFTSLVPGATSFPQVILTAASFNVTLFQTIGKVVSTEARAM 150
Query: 155 YNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGI 214
YN+G AGLT+W+PN+NIFRDPRWGRGQETPGEDP + S Y V +V+ Q ++DDG
Sbjct: 151 YNVGLAGLTYWSPNVNIFRDPRWGRGQETPGEDPTLTSKYGVAYVEGLQ----QTDDGST 206
Query: 215 GFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDT 274
++L ++ACCKH AYD++ W RYSFNA++ +QD +DT
Sbjct: 207 --------------------NKLKVAACCKHYTAYDVDNWKGIERYSFNAVVRQQDLDDT 246
Query: 275 FQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK-ARNEWGFKGYITSDCDAVATI 333
FQPPFRSC+ +G + +MCSYNQVNG P C +L R EW GYI +DCD++ I
Sbjct: 247 FQPPFRSCVLEGAVASVMCSYNQVNGKPTCGDPNLLAGIVRGEWKLNGYIVTDCDSLQVI 306
Query: 334 FEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQL 393
F+ QNYTKT E++AA L +G+D+NCG+ + +TQ A+++ V E IDRA+ N F+ +
Sbjct: 307 FKSQNYTKTPEEAAALGLNSGVDLNCGSWLSTYTQGAVNQKLVNESVIDRAISNNFATLM 366
Query: 394 RLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLA 453
RLG F+G+P+ YG LGP DVCT E+++LA +AARQGIVLLKN LPL A+ SLA
Sbjct: 367 RLGFFDGNPKSRIYGNLGPKDVCTPENQELAREAARQGIVLLKNTAGSLPLTPTAIKSLA 426
Query: 454 IIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRI 513
+IGP N M G Y GIPC + L+GL A V+ T Y GC DV CN+ A +A +I
Sbjct: 427 VIGPNANVTKTMIGNYEGIPCKYTTPLQGLTASVA-TIYKPGCADVSCNT-AQIDDAKQI 484
Query: 514 AKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDV 573
A AD V++V G D + E E DR S+ LPGQQ LV VA+ +K PVILV+ GG +DV
Sbjct: 485 ATTADAVVLVMGSDQSIEKESLDRTSITLPGQQSILVAEVAKVAKGPVILVIMSGGGMDV 544
Query: 574 SFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTK-VPMNDMNM 632
FA + +I+SILW+G+PGEAG ALA++IFG +NP GRLPMTWYP+S+ VPM DMNM
Sbjct: 545 QFAVDNPKITSILWVGFPGEAGGAALADVIFGYYNPSGRLPMTWYPQSYADVVPMTDMNM 604
Query: 633 RADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQ 692
R + + YPGR+YRFYTG V+ FGHGLSY+ + + AP + SL G
Sbjct: 605 RPNPATNYPGRTYRFYTGPTVFTFGHGLSYSQFKHHLDKAPQFV----SLPLGEKHTCRL 660
Query: 693 QTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIG 752
+D V SC+++ F + + V N G + GSH++ LF P V P+K L+G
Sbjct: 661 SKCKTVDAVG----QSCSNMGFDIHLRVKNVGKISGSHIIFLFTSPPSV-HNAPKKHLLG 715
Query: 753 FDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
F++VH +G + F V+ C+ LS+ ++ G R + LG HVL +G+L+HSLT+
Sbjct: 716 FEKVHLTPQGEGVVKFNVNVCKHLSVHDELGNRKVALGPHVLHIGDLKHSLTVR 769
>gi|115485163|ref|NP_001067725.1| Os11g0297300 [Oryza sativa Japonica Group]
gi|113644947|dbj|BAF28088.1| Os11g0297300 [Oryza sativa Japonica Group]
Length = 779
Score = 746 bits (1926), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 366/767 (47%), Positives = 484/767 (63%), Gaps = 39/767 (5%)
Query: 40 PDFPCKPPHFDS-YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEW 98
P F C P + FCN +L RA L++ LT EK+ QL D A +PRLGIP Y+W
Sbjct: 35 PGFTCGPASAQKGFAFCNAALPAEQRAADLVARLTTAEKVGQLGDQAPGVPRLGIPVYKW 94
Query: 99 WSESLHGIASNGPGVNF-NGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNL 157
WSE+LHG+A +G G++F NG + TSFPQV+ +AA+F+ LW IG A+ E RA YNL
Sbjct: 95 WSEALHGLAISGKGIHFGNGPARTATSFPQVIHTAAAFDDGLWFRIGQAIGKEGRAFYNL 154
Query: 158 GQA-GLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGF 216
GQA GL W+PN+NIFRDPRWGRGQETPGEDP S Y FVK QG + +
Sbjct: 155 GQAEGLAMWSPNVNIFRDPRWGRGQETPGEDPATASKYGAAFVKGLQGSSLTN------- 207
Query: 217 GFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQ 276
L SACCKH+ AYD+E+W SRY+FNA +T QD DT+
Sbjct: 208 --------------------LQTSACCKHITAYDIEEWKGVSRYNFNAKVTPQDLADTYN 247
Query: 277 PPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFE 335
PPFRSC+ GKASCIMC+Y +NGVPAC DL K R EW GY SDCDAVA + +
Sbjct: 248 PPFRSCVVDGKASCIMCAYTLINGVPACASSDLLTKTVRGEWKLDGYTASDCDAVAILHK 307
Query: 336 YQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRL 395
+++T+T E++ A LKAG+DINCG M ++ SA+ +GK+ EKD+D+AL NLF++++RL
Sbjct: 308 SEHFTRTAEEAVAVALKAGLDINCGVYMQQNAASALQQGKMTEKDVDKALKNLFAIRMRL 367
Query: 396 GLFNGDPRKGK-YGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAI 454
G F+GDPR K YG+L DVCT HK LAL+AAR+G+VLLKND + LPL V+S A+
Sbjct: 368 GHFDGDPRGNKLYGRLSAADVCTPVHKALALEAARRGVVLLKNDARLLPLRAPTVASAAV 427
Query: 455 IGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIA 514
IG N+I + G Y G+PC + G++ YV + GC C+ A +A +A
Sbjct: 428 IGHNANDILALLGNYYGLPCETTTPFGGIQKYVKSAKFLPGCSSAACDV-AATDQATALA 486
Query: 515 KKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVS 574
K +D+V +V GL QE E DR SLLLPG+Q +L+T+VA SKRPVIL+L GGP+D++
Sbjct: 487 KSSDYVFLVMGLSQKQEQEGLDRTSLLLPGKQQALITAVATASKRPVILILLTGGPVDIT 546
Query: 575 FAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRA 634
FA+ + +I +ILW GYPG+AG +A+A+++FG+FNP G+LP+TWYPE FTK M DM MR
Sbjct: 547 FAQTNPKIGAILWAGYPGQAGGQAIADVLFGEFNPSGKLPVTWYPEEFTKFTMTDMRMRP 606
Query: 635 DSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQT 694
D + YPGRSYRFY G VY FG+GLSY+ ++ + +S S+S + + T
Sbjct: 607 DPATGYPGRSYRFYKGKTVYKFGYGLSYSKFACRIVSGAGN---SSSYGKAALAGLRAAT 663
Query: 695 GSRLDYVH-IDEVTS--CTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLI 751
D V+ +DE+ C LRF V + V N G +DG H V++F R G P +QLI
Sbjct: 664 TPEGDAVYRVDEIGDDRCERLRFPVMVEVQNHGPMDGKHTVLMFVRWSSTDGGRPVRQLI 723
Query: 752 GFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGE 798
GF H K++ + PCE LS A G +++ G+H LMV E
Sbjct: 724 GFRNQHLKVGEKKKLKMEISPCEHLSRARVDGEKVIDRGSHFLMVEE 770
>gi|356524862|ref|XP_003531047.1| PREDICTED: beta-xylosidase/alpha-L-arabinofuranosidase 2-like
[Glycine max]
Length = 765
Score = 744 bits (1921), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 370/771 (47%), Positives = 497/771 (64%), Gaps = 44/771 (5%)
Query: 42 FPC---KPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEW 98
F C K P Y FC+ SL + R K L+ LTLQEKI L ++A + RLGIP YEW
Sbjct: 30 FACDVGKSPAVAGYGFCDKSLGVEARVKDLVGRLTLQEKIGNLVNSAVDVSRLGIPKYEW 89
Query: 99 WSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG 158
WSE+LHG+++ GPG F+ + TSFP +++AASFN SL+ IG V+ EARAMYN+G
Sbjct: 90 WSEALHGVSNVGPGTRFSNVIPGATSFPMPILTAASFNTSLFEVIGRVVSTEARAMYNVG 149
Query: 159 QAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGF 218
AGLT+W+PNINIFRDPRWGRG ETPGEDP++ S YA +VK Q
Sbjct: 150 LAGLTYWSPNINIFRDPRWGRGLETPGEDPVLTSKYAAGYVKGLQ--------------- 194
Query: 219 REKRVLKGFGEESDRGD--ELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQ 276
++D GD +L ++ACCKH AYD++ W RY+FNA++T+QD EDTFQ
Sbjct: 195 -----------QTDGGDPNKLKVAACCKHYTAYDVDNWKGIQRYTFNAVVTKQDMEDTFQ 243
Query: 277 PPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK-ARNEWGFKGYITSDCDAVATIFE 335
PPF+SC+ G + +MCSYN+VNG P C DL + R EW GYI SDCD+V +++
Sbjct: 244 PPFKSCVIDGNVASVMCSYNKVNGKPTCADPDLLKGVVRGEWKLNGYIVSDCDSVEVLYK 303
Query: 336 YQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRL 395
Q+YTKT E++AA + AG+D+NCG + ++T+ A+ +G + E I+ A+ N F+ +RL
Sbjct: 304 DQHYTKTPEEAAAISILAGLDLNCGRFLGQYTEGAVKQGLIDEASINNAVTNNFATLMRL 363
Query: 396 GLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAII 455
G F+GDPRK YG LGP DVCT E+++LA +AARQGIVLLKN LPLN A+ SLA+I
Sbjct: 364 GFFDGDPRKQPYGNLGPKDVCTQENQELAREAARQGIVLLKNSPASLPLNAKAIKSLAVI 423
Query: 456 GPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAK 515
GP N M G Y GIPC S L+GL A+ + T YA+GC DV C + +A +IA
Sbjct: 424 GPNANATRVMIGNYEGIPCKYISPLQGLTAF-APTSYAAGCLDVRCPNPV-LDDAKKIAA 481
Query: 516 KADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSF 575
AD ++V G L E E DRV++LLPGQQ LV+ VA SK PVILV+ GG +DVSF
Sbjct: 482 SADATVIVVGASLAIEAESLDRVNILLPGQQQLLVSEVANASKGPVILVIMSGGGMDVSF 541
Query: 576 AEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNMRA 634
A+ +++I+SILW+GYPGEAG A+A++IFG NP GRLPMTWYP+S+ KVPM +MNMR
Sbjct: 542 AKNNNKITSILWVGYPGEAGGAAIADVIFGFHNPSGRLPMTWYPQSYVDKVPMTNMNMRP 601
Query: 635 DSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQT 694
D + YPGR+YRFY G V+ FG GLSY++ +K + AP + S++ D
Sbjct: 602 DPATGYPGRTYRFYKGETVFAFGDGLSYSSIVHKLVKAPQLV----SVQLAEDHVCRSSE 657
Query: 695 GSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFD 754
+D V C +L F + + + N G + +H V LF+ P V P+K L+GF+
Sbjct: 658 CKSIDVVG----EHCQNLVFDIHLRIKNKGKMSSAHTVFLFSTPPAV-HNAPQKHLLGFE 712
Query: 755 RVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTI 805
+VH + K +SF VD C+ LSI ++ G R + LG H+L VG+L+H L++
Sbjct: 713 KVHLIGKSEALVSFKVDVCKDLSIVDELGNRKVALGQHLLHVGDLKHPLSV 763
>gi|253761874|ref|XP_002489311.1| hypothetical protein SORBIDRAFT_0010s012040 [Sorghum bicolor]
gi|241946959|gb|EES20104.1| hypothetical protein SORBIDRAFT_0010s012040 [Sorghum bicolor]
Length = 791
Score = 744 bits (1921), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 368/775 (47%), Positives = 483/775 (62%), Gaps = 35/775 (4%)
Query: 36 VLNKPDFPCKPPH-FDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIP 94
V P F C P PFCN L S RA L+S +T EK QL D A+ +PRLG+P
Sbjct: 43 VAGAPPFSCGPSSPSKGLPFCNMKLPASQRAADLVSRMTPAEKASQLGDIANGVPRLGVP 102
Query: 95 AYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAM 154
+Y+WW+E+LHG+A +G G++ N V S TSFPQVL +AASFN +LW IG A EARA
Sbjct: 103 SYKWWNEALHGVAISGKGIHMNQGVRSATSFPQVLHTAASFNDNLWFRIGQATGKEARAF 162
Query: 155 YNLGQA-GLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGG 213
YN+GQA GLT W+PN+NIFRDPRWGRGQETPGEDP V S Y FV+ QG +
Sbjct: 163 YNIGQAEGLTMWSPNVNIFRDPRWGRGQETPGEDPAVASRYGAAFVRGLQGSS------- 215
Query: 214 IGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTED 273
L SACCKH AYDLE W SRYSF A +T QD D
Sbjct: 216 --------------SNTKSVPPVLQTSACCKHATAYDLEDWKGVSRYSFKATVTIQDLAD 261
Query: 274 TFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVAT 332
TF PPFRSC+ GKASC+MC+Y VNGVP+C GDL K R WG GY+ +DCDAVA
Sbjct: 262 TFNPPFRSCVVDGKASCVMCAYTIVNGVPSCANGDLLTKTFRGSWGLDGYVAADCDAVAI 321
Query: 333 IFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQ 392
+ Q Y T ED+ A LKAG+DI+CG + ++ +AI KGK+ ++D+D+A+ NL + +
Sbjct: 322 MRNSQFYRPTAEDTVAATLKAGLDIDCGPYIQQYAMAAIQKGKLTQQDVDKAVKNLLTTR 381
Query: 393 LRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSL 452
+RLG F+GDP+ YG LG +CT+EHK LAL+AA GIVLLKN LPL + V+S
Sbjct: 382 MRLGHFDGDPKTNVYGNLGAGHICTAEHKNLALEAALDGIVLLKNSAGVLPLKRGTVNSA 441
Query: 453 AIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVR 512
A+IG N++ + G Y G PC+P + L+G++ YV + +GC+ CN A +A
Sbjct: 442 AVIGHNANDVLALLGNYWGPPCAPTTPLQGIQGYVKNVKFLAGCNKAACNV-AATPQATA 500
Query: 513 IAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLD 572
+A +D VI+ GL QE+E +DR +LLLPG Q SL+ +VA +KRPVILVL GGP+D
Sbjct: 501 LASSSDAVILFMGLSQEQESEGKDRTTLLLPGNQQSLINAVANAAKRPVILVLLTGGPVD 560
Query: 573 VSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNM 632
++FA+A+ +I +ILW GYPG+AG A+A+++FG+ NP G+LP TWYPE FT++PM DM M
Sbjct: 561 ITFAQANPKIGAILWAGYPGQAGGLAIAKVLFGEKNPSGKLPNTWYPEEFTRIPMTDMRM 620
Query: 633 RADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQ 692
RA S YPGR+YRFY G +Y FG+GLSY+ +S++ ++ + SL A +
Sbjct: 621 RAAGS--YPGRTYRFYNGKTIYKFGYGLSYSKFSHRVVTGRKNPAHNTSLLAAG---LAA 675
Query: 693 QTGSRLDY--VHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQL 750
T L Y HI +V C L+F + V N G +DG H ++F R P + G P +QL
Sbjct: 676 MTEDNLSYHVEHIGDVV-CDQLKFLAVVKVQNHGPIDGKHTALMFLRWPSATDGRPTRQL 734
Query: 751 IGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTI 805
IGF H A + F V PCE S + GR+++ G+H L VG +H L I
Sbjct: 735 IGFQSQHIKAGEKANLRFEVSPCEHFSRVRQDGRKVIDKGSHFLKVG--KHELEI 787
>gi|255548487|ref|XP_002515300.1| Beta-glucosidase, putative [Ricinus communis]
gi|223545780|gb|EEF47284.1| Beta-glucosidase, putative [Ricinus communis]
Length = 768
Score = 743 bits (1919), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 349/756 (46%), Positives = 487/756 (64%), Gaps = 39/756 (5%)
Query: 53 PFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPG 112
PFC L I R K LI LTL EK+ L +NA A+ RLGI YEWWSE+LHG+++ GPG
Sbjct: 41 PFCQVKLPIQDRVKDLIGRLTLAEKVGLLVNNAGAVSRLGIKGYEWWSEALHGVSNVGPG 100
Query: 113 VNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIF 172
F G+ TSFPQV+ +AASFN +LW IG V+ EARAMYN G AGLT+W+PN+NI
Sbjct: 101 TKFGGSFPGATSFPQVITTAASFNSTLWEAIGRVVSDEARAMYNGGAAGLTYWSPNVNIL 160
Query: 173 RDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESD 232
RDPRWGRGQETPGEDP++V YA +VK QG +
Sbjct: 161 RDPRWGRGQETPGEDPLLVGKYAASYVKGLQGND-------------------------- 194
Query: 233 RGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIM 292
G+ L ++ACCKH AYDL+ W R+ FNA +++QD +DTF PFR C+++GK + +M
Sbjct: 195 -GERLKVAACCKHFTAYDLDNWNGVDRFHFNAKVSKQDMKDTFDVPFRMCVKEGKVASVM 253
Query: 293 CSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVL 351
CSYNQVNG+P C +L +K R +WG GYI SDCD+V ++ Q+YT T E++AA +
Sbjct: 254 CSYNQVNGIPTCADPNLLRKTVRTQWGLNGYIVSDCDSVGVFYDKQHYTSTPEEAAADAI 313
Query: 352 KAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLG 411
KAG+D++CG + HTQ A+ +G + E D++ AL N +VQ+RLG+F+G+P YG LG
Sbjct: 314 KAGLDLDCGPFLAVHTQDAVKRGLISEADVNGALFNTLTVQMRLGMFDGEPSAQPYGNLG 373
Query: 412 PDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTG 471
P DVCT H++LAL+A RQGIVLLKN LPL+ ++AIIGP N M G Y G
Sbjct: 374 PKDVCTPAHQELALEAGRQGIVLLKNHGPSLPLSPRRHRTVAIIGPNSNVTVTMIGNYAG 433
Query: 472 IPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQE 531
+ C + L+G+ +Y +KT + GC DV C +D F A+ A++AD ++V GLD + E
Sbjct: 434 VACQYTTPLQGIGSY-AKTIHQQGCADVGCVTDQLFSGAIDAARQADATVLVMGLDQSIE 492
Query: 532 TEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYP 591
E RDR LLLPG+Q LV+ VA SK P ILVL GGP+DVSFA+ D +I++ILW GYP
Sbjct: 493 AEFRDRTGLLLPGRQQELVSKVAMASKGPTILVLMSGGPIDVSFAKKDPKIAAILWAGYP 552
Query: 592 GEAGAKALAEIIFGDFNPGGRLPMTWYPESF-TKVPMNDMNMRADSSRQYPGRSYRFYTG 650
G+AG A+A+++FG NPGG+LPMTWYP+ + T +PM +M MR+ S+ YPGR+YRFY G
Sbjct: 553 GQAGGAAIADVLFGTINPGGKLPMTWYPQEYITNLPMTEMAMRSSQSKGYPGRTYRFYQG 612
Query: 651 TQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCT 710
VY FGHG+SYT++ + SAP+ +++ G+ +G + H T C
Sbjct: 613 KVVYPFGHGMSYTHFVHNIASAPTMVSVPLDGHRGNT----SISGKAIRVTH----TKCN 664
Query: 711 SLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGV 770
L +Q+ V N G DG+H +++++ P + +P KQL+ F+RVH A + + +
Sbjct: 665 KLSLGIQVDVKNVGSKDGTHTLLVYS-APPAGRWSPHKQLVAFERVHVSAGTQERVGISI 723
Query: 771 DPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
C+ LS+ ++ G R +P+G H + +G ++HS++++
Sbjct: 724 HVCKLLSVVDRSGIRRIPIGEHSIHIGNVKHSVSLQ 759
>gi|125534112|gb|EAY80660.1| hypothetical protein OsI_35838 [Oryza sativa Indica Group]
Length = 771
Score = 743 bits (1918), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 367/769 (47%), Positives = 492/769 (63%), Gaps = 41/769 (5%)
Query: 39 KPDFPCKP--PHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAY 96
+P + C P P Y FC+ L + RA L+S LT EK+ QL D A + RLG+P Y
Sbjct: 24 EPPYSCGPRSPSL-GYAFCDARLPPARRAADLVSRLTAAEKVAQLGDEAGGVARLGVPPY 82
Query: 97 EWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYN 156
+WWSE LHG++ G G++FNG V+++TSFPQVL++AA+F+ LW IG A+ EARA+YN
Sbjct: 83 KWWSEGLHGLSYWGHGMHFNGAVTAITSFPQVLLTAAAFDDRLWFRIGQAIGTEARALYN 142
Query: 157 LGQA-GLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIG 215
LGQA GLT W+PN+NI+RDPRWGRGQETPGEDP S YAV FVK QG +
Sbjct: 143 LGQAEGLTIWSPNVNIYRDPRWGRGQETPGEDPTTASKYAVAFVKGLQGSTPGT------ 196
Query: 216 FGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTF 275
L SACCKH AYDLE+W +RY+FNA +T QD DTF
Sbjct: 197 ---------------------LQTSACCKHATAYDLEEWNGVARYNFNAKVTAQDLADTF 235
Query: 276 QPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIF 334
PPF+SC+ KASC+MC+Y +NGVPAC DL K R +WG GY++SDCDAVA +
Sbjct: 236 NPPFKSCVVDAKASCVMCAYTDINGVPACASSDLLSKTFRGQWGLDGYVSSDCDAVALLR 295
Query: 335 EYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLR 394
+ Q Y T ED+ A +KAG+D+NCG H +A+ +GK++E D+DRAL NLF+V++R
Sbjct: 296 DAQRYAPTPEDTVAVAIKAGLDLNCGNYTQVHGMAALQQGKMRESDVDRALTNLFAVRMR 355
Query: 395 LGLFNGDPR-KGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLA 453
LG F+GDPR YG LG DVCT H+ LAL+AA+ GIVLLKND LPL++ V S A
Sbjct: 356 LGHFDGDPRSNAAYGHLGAADVCTQAHRDLALEAAQNGIVLLKNDAGALPLDRATVRSAA 415
Query: 454 IIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRI 513
+IGP N+ + + G Y G PC + L+G++ Y+S + +GC D P A +A +
Sbjct: 416 VIGPNANDPAALNGNYFGPPCETTTPLQGVQRYISSVRFLAGC-DSPACGFAATGQAAAL 474
Query: 514 AKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDV 573
A +D VI+ GL QE E DR SLLLPG+Q SL+T+VA ++RPVILVL GGP+DV
Sbjct: 475 ASSSDQVIMFMGLSQDQEKEGLDRTSLLLPGKQQSLITAVASAARRPVILVLLTGGPVDV 534
Query: 574 SFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMR 633
+FA+ + +I +ILW GYPG+AG A+A+++FGD NP GRLP+TWYPE FT++PM DM MR
Sbjct: 535 TFAKNNPKIGAILWAGYPGQAGGLAIAKVLFGDHNPSGRLPVTWYPEEFTRIPMTDMRMR 594
Query: 634 ADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQ 693
AD + YPGRSYRFY G VY FG+GLSY+ ++ + ++A + +L AG I +
Sbjct: 595 ADPATGYPGRSYRFYQGNPVYKFGYGLSYSKFTRRLVAAAKPRRPNRNLLAGV---IPKP 651
Query: 694 TGSRLDYVHIDEV--TSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGT--PEKQ 749
G + H++E+ C L+F + V N G +DG H V++F + P + G P +Q
Sbjct: 652 AGDGGESYHVEEIGEEGCERLKFPATVEVHNHGPMDGKHSVLVFVQWPNATAGASRPARQ 711
Query: 750 LIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGE 798
L+GF H A ++ ++PCE LS A G +++ G+H L VGE
Sbjct: 712 LVGFSSQHVRAGEKARLTMEINPCEHLSRARDDGTKVIDRGSHFLKVGE 760
>gi|413925164|gb|AFW65096.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
Length = 829
Score = 742 bits (1916), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 373/803 (46%), Positives = 501/803 (62%), Gaps = 46/803 (5%)
Query: 14 ILFRVLQLIVVVNVIAFSNSKPVLNKPDFPCKPPHFDSYPFCNTSLSISTRAKSLISLLT 73
+L ++ L+V ++ + P F C PFCNT L + RA L+S +T
Sbjct: 58 LLAILMHLLVPALLLMRVVAVVAAGAPPFSCGGGPSLGLPFCNTKLPAAQRAADLVSRMT 117
Query: 74 LQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFN-GTVSSVTSFPQVLVSA 132
EK QL D A+ +PRLG+P+Y+WW+E+LHG+A +G G++ + G V S TSFPQVL++A
Sbjct: 118 PAEKASQLGDVANGVPRLGVPSYKWWNEALHGVAISGKGIHMDRGAVRSATSFPQVLLTA 177
Query: 133 ASFNRSLWSNIGSAVAVEARAMYNLGQA-GLTFWAPNINIFRDPRWGRGQETPGEDPMVV 191
ASFN +LW IG A EARA YN+GQA GLT W+PN+NIFRDPRWGRGQETPGEDP V
Sbjct: 178 ASFNDNLWFRIGQATGKEARAFYNIGQAEGLTMWSPNVNIFRDPRWGRGQETPGEDPAVA 237
Query: 192 SAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDL 251
S YA FV+ QG + L+ SACCKH AYDL
Sbjct: 238 SRYAAAFVRGLQGSS---------------------SNTKSVPPVLLTSACCKHATAYDL 276
Query: 252 EKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQ 311
E W +RYSF A +T QD DTF PPFRSC+ GKASC+MC+Y VNGVP+C DL
Sbjct: 277 EDWKGVTRYSFRATVTVQDLADTFNPPFRSCVVDGKASCVMCAYTSVNGVPSCANADLLT 336
Query: 312 KA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSA 370
K R WG GY+ +DCDAV+ + Q Y T ED+ A LKAG+DI+CG + +H +A
Sbjct: 337 KTFRGSWGLDGYVAADCDAVSIMRNSQFYRPTAEDTVATTLKAGLDIDCGPYVQQHAMAA 396
Query: 371 IDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQ 430
I KGK+ ++D+D+A+ NLF+ ++RLG F+GDP+ YG LG +CT EHK LAL+AA
Sbjct: 397 IQKGKLTQQDVDKAVKNLFTTRMRLGHFDGDPKAHVYGNLGAAHICTQEHKNLALEAALD 456
Query: 431 GIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKT 490
GIVLLKN LPL + +V+S A+IG N++ + G Y G PC+P + L+G++ YV
Sbjct: 457 GIVLLKNSAGVLPLKRGSVASAAVIGHNANDVLALLGNYWGPPCAPTTPLQGIQGYVKNV 516
Query: 491 HYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLV 550
+ +GCH CN A +A +A +D VI+ GL QE+E +DR +LLLPG Q SL+
Sbjct: 517 RFLAGCHKAACNVAA-TPQAAALASTSDSVILFMGLSQEQESEGKDRTTLLLPGNQQSLI 575
Query: 551 TSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPG 610
T+VA +KRPVILVL GGP+D++FA+A+ +I +ILW GYPG+AG A+A+++FG+ NP
Sbjct: 576 TAVANAAKRPVILVLLTGGPVDITFAQANPKIGAILWAGYPGQAGGLAIAKVLFGEKNPS 635
Query: 611 GRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFL 670
GRLP+TWYPE FTKVPM DM MR+ S YPGRSYRFY G +Y FG+GLSY+ +S++ +
Sbjct: 636 GRLPVTWYPEEFTKVPMTDMRMRSAGS--YPGRSYRFYKGKTIYKFGYGLSYSKFSHRVV 693
Query: 671 SA------PSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTS--CTSLRFHVQISVTN 722
+A + L ++A A ++ N L Y H+D + C L+F + V N
Sbjct: 694 TARNNPAHNTTLLLAAGHAATTEDN--------LSY-HVDHIGDELCRQLKFLAVVKVQN 744
Query: 723 AGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKH 782
G +DG H ++F R P + G P +QL+GF H A + F V PCE S
Sbjct: 745 HGPMDGKHTALMFLRWPNATDGRPARQLVGFQSQHIKAGEKAHLRFEVSPCEDFSRVRDD 804
Query: 783 GRRILPLGNHVLMVGELRHSLTI 805
GR+++ G+H L VG +H L I
Sbjct: 805 GRKVIDKGSHFLKVG--KHELEI 825
>gi|297797477|ref|XP_002866623.1| beta-xylosidase 4 [Arabidopsis lyrata subsp. lyrata]
gi|297312458|gb|EFH42882.1| beta-xylosidase 4 [Arabidopsis lyrata subsp. lyrata]
Length = 784
Score = 742 bits (1915), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 373/779 (47%), Positives = 499/779 (64%), Gaps = 45/779 (5%)
Query: 38 NKPDFPC---KPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIP 94
+ P F C P +Y FCNT L I R L++ LTLQEKI L A+ + RLGIP
Sbjct: 40 SSPVFACDVAANPSLAAYGFCNTVLKIEYRVADLVARLTLQEKIGFLVSKANGVTRLGIP 99
Query: 95 AYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAM 154
YEWWSE+LHG++ GPG +F+ V TSFPQV+++AASFN SL+ IG V+ EARAM
Sbjct: 100 TYEWWSEALHGVSYIGPGTHFSSQVPGATSFPQVILTAASFNVSLFQAIGKVVSTEARAM 159
Query: 155 YNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGI 214
YN+G AGLT+W+PN+NIFRDPRWGRGQETPGEDP++ S YA +VK Q
Sbjct: 160 YNVGLAGLTYWSPNVNIFRDPRWGRGQETPGEDPLLASKYASGYVKGLQ----------- 208
Query: 215 GFGFREKRVLKGFGEESDRGD--ELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTE 272
E+D GD L ++ACCKH AYD++ W RYSFNA++T+QD +
Sbjct: 209 ---------------ETDGGDSNRLKVAACCKHYTAYDVDNWKGVERYSFNAVVTQQDMD 253
Query: 273 DTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVA 331
DT+QPPF+SC+ G + +MCSYNQVNG P C DL R EW GYI SDCD+V
Sbjct: 254 DTYQPPFKSCVVDGNVASVMCSYNQVNGKPTCADPDLLSGVIRGEWKLNGYIVSDCDSVD 313
Query: 332 TIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSV 391
+++ Q+YTKT ++AA + AG+D+NCG+ + +HT+ A+ G V E ID+A+ N F
Sbjct: 314 VLYKNQHYTKTPAEAAAISILAGLDLNCGSFLGQHTEEAVKSGLVNEAAIDKAISNNFLT 373
Query: 392 QLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSS 451
+RLG F+G+P+ YG LGP DVCTS +++LA DAARQGIVLLKN FLPL+ ++ +
Sbjct: 374 LMRLGFFDGNPKNQIYGGLGPTDVCTSANQELAADAARQGIVLLKN-TGFLPLSPKSIKT 432
Query: 452 LAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAV 511
LA+IGP N M G Y G PC + L+GL VS T Y GC +V C + A A
Sbjct: 433 LAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAGAVSTT-YLPGCSNVAC-AVADVAGAT 490
Query: 512 RIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPL 571
++A AD +++ G D + E E RDRV L LPGQQ LV VA+ +K PV+LV+ GG
Sbjct: 491 KLAATADVTVLLIGADQSIEAESRDRVDLNLPGQQQELVIQVAKAAKGPVLLVIMSGGGF 550
Query: 572 DVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDM 630
D++FA+ D +I+ ILW+GYPGEAG A+A+IIFG +NP GRLPMTWYP+S+ KVPM M
Sbjct: 551 DITFAKNDPKIAGILWVGYPGEAGGIAIADIIFGRYNPSGRLPMTWYPQSYVEKVPMTIM 610
Query: 631 NMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNI 690
NMR D S+ YPGR+YRFYTG VY FG GLSYT +S+ + APS +++S + ++
Sbjct: 611 NMRPDKSKGYPGRTYRFYTGETVYAFGDGLSYTKFSHSLVKAPSLVSLSL-----EENHV 665
Query: 691 LQQTGSR-LDYV--HIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPE 747
+ + + LD + H + S F VQI V N GD +G H V LF P + G+P
Sbjct: 666 CRSSECQSLDAIGPHCENAVSGGGSAFEVQIKVRNGGDREGIHTVFLFTTPPAI-HGSPR 724
Query: 748 KQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
K L+GF+++ + F V+ C+ LS+ ++ G+R + LG H+L VG+L+HSL+I
Sbjct: 725 KHLLGFEKIRLGKMEEAVVRFKVEVCKDLSVVDEIGKRKIGLGKHLLHVGDLKHSLSIR 783
>gi|292630923|sp|A5JTQ3.1|XYL2_MEDVA RecName: Full=Beta-xylosidase/alpha-L-arabinofuranosidase 2;
AltName: Full=Xylan
1,4-beta-xylosidase/Alpha-N-arabinofuranosidase 2;
Short=MsXyl2; Includes: RecName: Full=Beta-xylosidase;
AltName: Full=1,4-beta-D-xylan xylohydrolase; AltName:
Full=Xylan 1,4-beta-xylosidase; Includes: RecName:
Full=Alpha-N-arabinofuranosidase; AltName:
Full=Alpha-L-arabinofuranosidase; Short=Arabinosidase;
Flags: Precursor
gi|146762263|gb|ABQ45228.1| beta-xylosidase/alpha-L-arabinosidase [Medicago sativa subsp. x
varia]
Length = 774
Score = 741 bits (1913), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 363/769 (47%), Positives = 495/769 (64%), Gaps = 40/769 (5%)
Query: 42 FPC---KPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEW 98
F C K P +Y FCN LS+ R K L+ LTLQEK+ L ++A + RLGIP YEW
Sbjct: 39 FACDVAKNPALANYGFCNKKLSVDARVKDLVRRLTLQEKVGNLVNSAVDVSRLGIPKYEW 98
Query: 99 WSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG 158
WSE+LHG+++ GPG +F+ + TSFP ++ AASFN SL+ IG V+ EARAM+N+G
Sbjct: 99 WSEALHGVSNIGPGTHFSNVIPGATSFPMPILIAASFNASLFQTIGKVVSTEARAMHNVG 158
Query: 159 QAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGF 218
AGLT+W+PNINIFRDPRWGRGQETPGEDP++ S YA +VK Q ++DDG
Sbjct: 159 LAGLTYWSPNINIFRDPRWGRGQETPGEDPLLASKYAAGYVKGLQ----QTDDG------ 208
Query: 219 REKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPP 278
++L ++ACCKH AYD++ W RY+FNA++T+QD +DT+QPP
Sbjct: 209 --------------DSNKLKVAACCKHYTAYDVDDWKGVQRYTFNAVVTQQDLDDTYQPP 254
Query: 279 FRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQ 337
F+SC+ G + +MCSYNQVNG P C DL + R +W GYI SDCD+V +F+ Q
Sbjct: 255 FKSCVIDGNVASVMCSYNQVNGKPTCADPDLLKGVIRGKWKLNGYIVSDCDSVDVLFKNQ 314
Query: 338 NYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGL 397
+YTKT E++AA + AG+D+NCG+ + R+T+ A+ +G + E I+ A+ N F+ +RLG
Sbjct: 315 HYTKTPEEAAAKSILAGLDLNCGSFLGRYTEGAVKQGLIGEASINNAVYNNFATLMRLGF 374
Query: 398 FNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGP 457
F+GDP K YG LGP DVCTS +++LA +AARQGIVLLKN LPLN A+ SLA+IGP
Sbjct: 375 FDGDPSKQPYGNLGPKDVCTSANQELAREAARQGIVLLKNCAGSLPLNAKAIKSLAVIGP 434
Query: 458 LVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKA 517
N M G Y GIPC S L+GL A V T +A+GC DV C ++A +A +IA A
Sbjct: 435 NANATRAMIGNYEGIPCKYTSPLQGLTALVP-TSFAAGCPDVQC-TNAALDDAKKIAASA 492
Query: 518 DFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAE 577
D ++V G +L E E DR+++LLPGQQ LVT VA +K PVIL + GG +DVSFA+
Sbjct: 493 DATVIVVGANLAIEAESHDRINILLPGQQQQLVTEVANVAKGPVILAIMSGGGMDVSFAK 552
Query: 578 ADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNMRADS 636
+ +I+SILW+GYPGEAG A+A++IFG NP GRLPMTWYP+S+ KVPM +MNMR D
Sbjct: 553 TNKKITSILWVGYPGEAGGAAIADVIFGYHNPSGRLPMTWYPQSYVDKVPMTNMNMRPDP 612
Query: 637 SRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGS 696
+ YPGR+YRFY G V+ FG G+SY+ + +K + AP +++ + D
Sbjct: 613 ATGYPGRTYRFYKGETVFSFGDGISYSTFEHKLVKAPQLVSVPLA----EDHVCRSSKCK 668
Query: 697 RLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRV 756
LD V C +L F + + + N G + S V LF+ P V P+K L+ F++V
Sbjct: 669 SLDVVG----EHCQNLAFDIHLRIKNKGKMSSSQTVFLFSTPPAV-HNAPQKHLLAFEKV 723
Query: 757 HTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTI 805
K +SF VD C+ L + ++ G R + LG H+L VG+L+H L++
Sbjct: 724 LLTGKSEALVSFKVDVCKDLGLVDELGNRKVALGKHMLHVGDLKHPLSV 772
>gi|253761860|ref|XP_002489304.1| hypothetical protein SORBIDRAFT_0010s007570 [Sorghum bicolor]
gi|241946952|gb|EES20097.1| hypothetical protein SORBIDRAFT_0010s007570 [Sorghum bicolor]
Length = 750
Score = 740 bits (1910), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 376/792 (47%), Positives = 499/792 (63%), Gaps = 48/792 (6%)
Query: 22 IVVVNVIAFSNSKPVLNKPDFPCKPPHFD-SYPFCNTSLSISTRAKSLISLLTLQEKIQQ 80
++++ + FS++ V + P F C P +YPFC+ SL + RA L+S LT+ EK+ Q
Sbjct: 1 MLLLAAVFFSSA--VASDPLFSCGPSSPSRAYPFCDRSLPAARRAADLVSRLTVAEKVSQ 58
Query: 81 LSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLW 140
L D A+ +PRLG+P Y+WWSE LHG+A G G+ FNGTV+ VTSFPQVL++ ASF+ LW
Sbjct: 59 LGDEAAGVPRLGVPPYKWWSEGLHGLAFWGHGMRFNGTVTGVTSFPQVLLTTASFDDGLW 118
Query: 141 SNIGSAVAVEARAMYNLGQA-GLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFV 199
IG A+ EARA+YNLGQA GLT W+PN+NIFRDPRWGRGQETPGEDP V S YAV FV
Sbjct: 119 FRIGQAIGREARALYNLGQAEGLTIWSPNVNIFRDPRWGRGQETPGEDPAVASKYAVAFV 178
Query: 200 KSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSR 259
+ QG + L SACCKH AYDLE W +R
Sbjct: 179 RGIQGS-----------------------SAAGAAAPLQASACCKHATAYDLEDWNGVAR 215
Query: 260 YSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWG 318
Y+F+A +T QD DTF PPF+SC+ GKA+C+MC+Y +NGVPAC DL K R WG
Sbjct: 216 YNFDARVTAQDLADTFNPPFQSCVVDGKATCVMCAYTGINGVPACASSDLLTKTFRGAWG 275
Query: 319 FKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQE 378
GY++SDCDAVA + + Q Y T ED+ A LK H +AI +GK+ E
Sbjct: 276 HDGYVSSDCDAVAIMHDAQRYVPTPEDTVAVALK------------EHGMAAIQQGKMTE 323
Query: 379 KDIDRALLNLFSVQLRLGLFNGDPR-KGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKN 437
KD+D+AL NLF+V++RLG F+GDPR YG LG DVCT++HK LAL+AA+ GIVLLKN
Sbjct: 324 KDVDKALTNLFAVRMRLGHFDGDPRGNALYGHLGAADVCTADHKNLALEAAQDGIVLLKN 383
Query: 438 DKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCH 497
D LPL+++A+ S A+IG N+ + G Y G C + L+G+++YVS + +GC
Sbjct: 384 DAGILPLDRSAMGSAAVIGHNANDALVLRGNYFGPACETTTPLQGVQSYVSNVRFLAGCS 443
Query: 498 DVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTS 557
C A +A +A +++V + GL QE E DR SLLLPG+Q SL+T+VA +
Sbjct: 444 SAACGYAA-TGQAAALASSSEYVFLFMGLSQDQEKEGLDRTSLLLPGKQQSLITAVASAA 502
Query: 558 KRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTW 617
KRPVILVL GGP+D++FA+++ +I +ILW GYPG+AG A+A ++FGD NP GRLP+TW
Sbjct: 503 KRPVILVLLTGGPVDITFAQSNPKIGAILWAGYPGQAGGLAIARVLFGDHNPSGRLPVTW 562
Query: 618 YPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELT 677
YPE FTKVPM DM MRAD + YPGRSYRFY G +Y FG+GLSY+ +S + ++
Sbjct: 563 YPEEFTKVPMTDMRMRADPANGYPGRSYRFYRGNTIYKFGYGLSYSKFSRQLVTGGKNQL 622
Query: 678 ISASLKAGSDKNILQQTGSRLDYVHIDEVTS--CTSLRFHVQISVTNAGDVDGSHVVMLF 735
ASL AG + Y H+D++ + C LRF ++ V N G +DG H V++F
Sbjct: 623 --ASLLAGLSATTKDDDAT--SYYHVDDIGADGCEQLRFPAEVEVQNHGPMDGKHSVLMF 678
Query: 736 ARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLM 795
R P + G P QLIGF H A + F V PCE S A G++++ G+H LM
Sbjct: 679 LRWPNATDGRPVSQLIGFTSQHIKAGEKANVRFDVRPCEHFSRARADGKKVIDRGSHFLM 738
Query: 796 VGELRHSLTIET 807
VG+ ++ E
Sbjct: 739 VGKEEVEVSFEA 750
>gi|242077366|ref|XP_002448619.1| hypothetical protein SORBIDRAFT_06g030270 [Sorghum bicolor]
gi|241939802|gb|EES12947.1| hypothetical protein SORBIDRAFT_06g030270 [Sorghum bicolor]
Length = 767
Score = 739 bits (1908), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 377/770 (48%), Positives = 503/770 (65%), Gaps = 40/770 (5%)
Query: 40 PDFPCKPPH--FDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYE 97
P F C + SY FCN S S S RA L+S LTL EK+ L D +A+PRLGIP YE
Sbjct: 32 PVFACDASNATLASYGFCNRSASASARAADLVSRLTLAEKVGFLVDKQAALPRLGIPLYE 91
Query: 98 WWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNL 157
WWSE+LHG++ GPG F+ V + TSFPQ +++AASFN +L+ IG V+ EARAM+N+
Sbjct: 92 WWSEALHGVSYVGPGTRFSSLVPAATSFPQPILTAASFNATLFRAIGEVVSNEARAMHNV 151
Query: 158 GQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFG 217
G AGLTFW+PNINIFRDPRWGRGQETPGEDP++ S YAV +V Q D G G G
Sbjct: 152 GLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLTSKYAVGYVTGLQ-------DAGSGSG 204
Query: 218 FREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQP 277
L ++ACCKH AYD++ W RY+FNA++++QD +DTFQP
Sbjct: 205 ------------------SLKVAACCKHYTAYDVDNWKGVERYTFNAVVSQQDLDDTFQP 246
Query: 278 PFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEY 336
PF+SC+ G + +MCSYNQVNG P C DL R +W GYI+SDCD+V ++
Sbjct: 247 PFKSCVVDGNVASVMCSYNQVNGKPTCADKDLLSGVIRGDWKLNGYISSDCDSVDVLYNN 306
Query: 337 QNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLG 396
Q+YTKT ED+AA +KAG+D+NCG + +HT +A+ GK+ E D+DRA+ N F +RLG
Sbjct: 307 QHYTKTPEDAAAISIKAGLDLNCGNFLAQHTVAAVQAGKLSESDVDRAITNNFITLMRLG 366
Query: 397 LFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIG 456
F+GDPRK +G LGP DVCTS +++LA +AARQGIVLLKN LPL+ +++ SLA+IG
Sbjct: 367 FFDGDPRKLPFGNLGPSDVCTSSNQELAREAARQGIVLLKNSGA-LPLSASSIKSLAVIG 425
Query: 457 PLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDA-GFHEAVRIAK 515
P N M G Y G PC + L+GL A V+ T Y GC +V C+ ++ A + A
Sbjct: 426 PNANASFTMIGNYEGTPCKYTTPLQGLGANVA-TVYQPGCTNVGCSGNSLQLDAATKAAA 484
Query: 516 KADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSF 575
AD ++V G D + E E DR SLLLPGQQ LV++VA S+ P ILV+ GGP D+SF
Sbjct: 485 SADVTVLVVGADQSIERESLDRTSLLLPGQQPQLVSAVANASRGPCILVIMSGGPFDISF 544
Query: 576 AEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRAD 635
A++ +I++ILW+GYPGEAG A+A+++FG NP GRLP+TWYPESFTKVPM DM MR D
Sbjct: 545 AKSSDKIAAILWVGYPGEAGGAAIADVLFGHHNPSGRLPVTWYPESFTKVPMIDMRMRPD 604
Query: 636 SSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTG 695
+S YPGR+YRFYTG VY FG GLSYT++++ +SAP ++ +L+ L +
Sbjct: 605 ASTGYPGRTYRFYTGDTVYAFGDGLSYTSFAHHLVSAPKQV----ALQLAEGHTCLTEQC 660
Query: 696 SRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDR 755
++ E C L F V + V NAGD+ G+H V LF+ P V P K L+GF++
Sbjct: 661 PSVEA----EGAHCEGLAFDVHLRVRNAGDMSGAHTVFLFSSPPAV-HNAPAKHLLGFEK 715
Query: 756 VHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTI 805
V + ++F VD C+ LS+ ++ G R + LGNH L VG+L+H+L++
Sbjct: 716 VSLEPGQAGVVAFKVDVCKDLSVVDELGNRKVALGNHTLHVGDLKHTLSL 765
>gi|356501877|ref|XP_003519750.1| PREDICTED: probable beta-D-xylosidase 2-like [Glycine max]
Length = 772
Score = 738 bits (1906), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 357/772 (46%), Positives = 490/772 (63%), Gaps = 43/772 (5%)
Query: 42 FPCKPPHF--DSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWW 99
F C P + + PFC SL+ R K LI LTLQEK+ L +NA+A+PRLGI YEWW
Sbjct: 28 FACDPKNTATKNLPFCKASLATGARVKDLIGRLTLQEKVNLLVNNAAAVPRLGIKGYEWW 87
Query: 100 SESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQ 159
SE+LHG+++ GPG F G + TSFPQV+ +AASFN SLW IG + EARAMYN G
Sbjct: 88 SEALHGVSNVGPGTKFGGQFPAATSFPQVITTAASFNASLWEAIGRVASDEARAMYNGGT 147
Query: 160 AGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFR 219
AGLT+W+PN+NIFRDPRWGRGQETPGEDP++ YA +V+ QG +
Sbjct: 148 AGLTYWSPNVNIFRDPRWGRGQETPGEDPILAGKYAASYVRGLQGTD------------- 194
Query: 220 EKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPF 279
G+ L ++A CKH AYDL+ W R+ FNA +++QD EDTF PF
Sbjct: 195 --------------GNRLKVAASCKHFTAYDLDNWNGVDRFHFNAQVSKQDIEDTFNVPF 240
Query: 280 RSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQN 338
R C+++GK + +MCSYNQVNGVP C L ++ R +WG GYI SDCD+V + Q+
Sbjct: 241 RMCVKEGKVASVMCSYNQVNGVPTCADPILLKRTVRGQWGLNGYIVSDCDSVGVFYNSQH 300
Query: 339 YTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLF 398
YT T E++AA +KAG+D++CG + +HTQ+A+ KG + E D++ ALLN +VQ+RLG++
Sbjct: 301 YTSTPEEAAADAIKAGLDLDCGPFLGQHTQNAVKKGLISEADVNGALLNTLTVQMRLGMY 360
Query: 399 NGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPL 458
+G+P Y LGP DVCT H++LAL+AARQGIVLLKN LPL+ ++A+IGP
Sbjct: 361 DGEPSSHPYNNLGPRDVCTQSHQELALEAARQGIVLLKNKGPSLPLSTRRGRTVAVIGPN 420
Query: 459 VNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKAD 518
N M G Y GI C S L+G+ Y +KT Y GC +V C D F A+ A++AD
Sbjct: 421 SNVTFTMIGNYAGIACGYTSPLQGIGTY-TKTIYEHGCANVACTDDKQFGRAINAAQQAD 479
Query: 519 FVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEA 578
++V GLD + E E DR SLLLPG Q LV+ VA SK P ILV+ GGP+D++FA+
Sbjct: 480 ATVLVMGLDQSIEAETVDRASLLLPGHQQDLVSKVAAASKGPTILVIMSGGPVDITFAKN 539
Query: 579 DSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTK-VPMNDMNMRADSS 637
D +I ILW GYPG+AG A+A+I+FG NPGG+LPMTWYP+ + K +PM +M MRA S
Sbjct: 540 DPRIQGILWAGYPGQAGGAAIADILFGTSNPGGKLPMTWYPQGYIKNLPMTNMAMRASRS 599
Query: 638 RQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISA-SLKAGSDKNILQQTGS 696
+ YPGR+YRFY G VY FG+GLSYT++ + SAP ++I + G+ NI +
Sbjct: 600 KGYPGRTYRFYNGPVVYPFGYGLSYTHFVHTLTSAPKLVSIPVDGHRHGNSSNIANKA-- 657
Query: 697 RLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQG--TPEKQLIGFD 754
+ H C L ++ + V N G DG H +++F+ P G P KQL+ F+
Sbjct: 658 -IKVTH----ARCGKLSINLHVDVKNVGSKDGIHTLLVFS-APPAGNGHWAPHKQLVAFE 711
Query: 755 RVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
+VH AK + + + C+ LS+ ++ G R +P+G H L +G+++HS++++
Sbjct: 712 KVHIPAKAQQRVRVKIHVCKLLSVVDRSGTRRIPMGLHSLHIGDVKHSVSLQ 763
>gi|356556038|ref|XP_003546334.1| PREDICTED: beta-D-xylosidase 1-like [Glycine max]
Length = 775
Score = 738 bits (1905), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 354/770 (45%), Positives = 500/770 (64%), Gaps = 37/770 (4%)
Query: 42 FPCKPPH--FDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWW 99
F C P + + FCNT + I R + LI+ LTL EKI+ + +NA A+PRLGI YEWW
Sbjct: 37 FACDPRNGLTRGFKFCNTHVPIHVRVQDLIARLTLPEKIRLVVNNAIAVPRLGIQGYEWW 96
Query: 100 SESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQ 159
SE+LHG+++ GPG F G T FPQV+ +AASFN+SLW IG V+ EARAMYN GQ
Sbjct: 97 SEALHGVSNVGPGTKFGGAFPGATMFPQVISTAASFNQSLWQEIGRVVSDEARAMYNGGQ 156
Query: 160 AGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFR 219
AGLT+W+PN+NIFRDPRWGRGQETPGEDP + + YA +VK QG++
Sbjct: 157 AGLTYWSPNVNIFRDPRWGRGQETPGEDPTLAAKYAASYVKGLQGDS------------- 203
Query: 220 EKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPF 279
G+ L ++ACCKH AYDL+ W R+ FNA +++QD EDT+ PF
Sbjct: 204 -------------AGNHLKVAACCKHYTAYDLDNWNGVDRFHFNAKVSKQDLEDTYDVPF 250
Query: 280 RSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQN 338
++C+ +G+ + +MCSYNQVNG P C DL + R +W GYI SDCD+V F+ Q+
Sbjct: 251 KACVLEGQVASVMCSYNQVNGKPTCADPDLLRNTIRGQWRLNGYIVSDCDSVGVFFDNQH 310
Query: 339 YTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLF 398
YTKT E++AA +KAG+D++CG + HT SAI KG + E D++ AL NL SVQ+RLG+F
Sbjct: 311 YTKTPEEAAAEAIKAGLDLDCGPFLAIHTDSAIRKGLISENDLNLALANLISVQMRLGMF 370
Query: 399 NGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPL 458
+G+P YG LGP DVCTS H++LAL+AAR+ IVLL+N LPL+ + + ++ ++GP
Sbjct: 371 DGEPSTQPYGNLGPRDVCTSAHQQLALEAARESIVLLQNKGNSLPLSPSRLRTIGVVGPN 430
Query: 459 VNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKAD 518
+ M G Y G+ C + L+G+ YV KT + GC V C + F A IA++AD
Sbjct: 431 ADATVTMIGNYAGVACGYTTPLQGIARYV-KTAHQVGCRGVACRGNELFGAAETIARQAD 489
Query: 519 FVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEA 578
+++V GLD T E E RDRV LLLPG Q LVT VAR +K PVIL++ GGP+D+SFA+
Sbjct: 490 AIVLVMGLDQTVEAETRDRVGLLLPGLQQELVTRVARAAKGPVILLIMSGGPVDISFAKN 549
Query: 579 DSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESF-TKVPMNDMNMRADSS 637
D +IS+ILW+GYPG+AG A+A++IFG NPGGRLPMTWYP+ + KVPM +M+MR + +
Sbjct: 550 DPKISAILWVGYPGQAGGTAIADVIFGTTNPGGRLPMTWYPQGYLAKVPMTNMDMRPNPT 609
Query: 638 RQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSR 697
YPGR+YRFY G V+ FGHGLSY+ +S+ AP ++++ + L +
Sbjct: 610 TGYPGRTYRFYKGPVVFPFGHGLSYSRFSHSLALAPKQVSVPIMSLQALTNSTLSSKAVK 669
Query: 698 LDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVH 757
+ + + D+ SL + V N G +DG+H +++F++ P + + KQL+GF + H
Sbjct: 670 VSHANCDD-----SLEMEFHVDVKNEGSMDGTHTLLIFSQPPH-GKWSQIKQLVGFHKTH 723
Query: 758 TVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIET 807
+A + + GV C+ LS+ ++ G R +P G H L +G+++HS++++T
Sbjct: 724 VLAGSKQRVKVGVHVCKHLSVVDQFGVRRIPTGEHELHIGDVKHSISVQT 773
>gi|413925166|gb|AFW65098.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
Length = 830
Score = 738 bits (1904), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 373/804 (46%), Positives = 501/804 (62%), Gaps = 47/804 (5%)
Query: 14 ILFRVLQLIVVVNVIAFSNSKPVLNKPDFPCKPPHFDSYPFCNTSLSISTRAKSLISLLT 73
+L ++ L+V ++ + P F C PFCNT L + RA L+S +T
Sbjct: 58 LLAILMHLLVPALLLMRVVAVVAAGAPPFSCGGGPSLGLPFCNTKLPAAQRAADLVSRMT 117
Query: 74 LQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFN-GTVSSVTSFPQVLVSA 132
EK QL D A+ +PRLG+P+Y+WW+E+LHG+A +G G++ + G V S TSFPQVL++A
Sbjct: 118 PAEKASQLGDVANGVPRLGVPSYKWWNEALHGVAISGKGIHMDRGAVRSATSFPQVLLTA 177
Query: 133 ASFNRSLWSNIGSAVAVEARAMYNLGQA-GLTFWAPNINIFRDPRWGRGQETPGEDPMVV 191
ASFN +LW IG A EARA YN+GQA GLT W+PN+NIFRDPRWGRGQETPGEDP V
Sbjct: 178 ASFNDNLWFRIGQATGKEARAFYNIGQAEGLTMWSPNVNIFRDPRWGRGQETPGEDPAVA 237
Query: 192 SAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDL 251
S YA FV+ QG + L+ SACCKH AYDL
Sbjct: 238 SRYAAAFVRGLQGSS---------------------SNTKSVPPVLLTSACCKHATAYDL 276
Query: 252 EKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQ 311
E W +RYSF A +T QD DTF PPFRSC+ GKASC+MC+Y VNGVP+C DL
Sbjct: 277 EDWKGVTRYSFRATVTVQDLADTFNPPFRSCVVDGKASCVMCAYTSVNGVPSCANADLLT 336
Query: 312 KA-RNEWGFKG-YITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQS 369
K R WG G Y+ +DCDAV+ + Q Y T ED+ A LKAG+DI+CG + +H +
Sbjct: 337 KTFRGSWGLDGRYVAADCDAVSIMRNSQFYRPTAEDTVATTLKAGLDIDCGPYVQQHAMA 396
Query: 370 AIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAAR 429
AI KGK+ ++D+D+A+ NLF+ ++RLG F+GDP+ YG LG +CT EHK LAL+AA
Sbjct: 397 AIQKGKLTQQDVDKAVKNLFTTRMRLGHFDGDPKAHVYGNLGAAHICTQEHKNLALEAAL 456
Query: 430 QGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSK 489
GIVLLKN LPL + +V+S A+IG N++ + G Y G PC+P + L+G++ YV
Sbjct: 457 DGIVLLKNSAGVLPLKRGSVASAAVIGHNANDVLALLGNYWGPPCAPTTPLQGIQGYVKN 516
Query: 490 THYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSL 549
+ +GCH CN A +A +A +D VI+ GL QE+E +DR +LLLPG Q SL
Sbjct: 517 VRFLAGCHKAACNVAA-TPQAAALASTSDSVILFMGLSQEQESEGKDRTTLLLPGNQQSL 575
Query: 550 VTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNP 609
+T+VA +KRPVILVL GGP+D++FA+A+ +I +ILW GYPG+AG A+A+++FG+ NP
Sbjct: 576 ITAVANAAKRPVILVLLTGGPVDITFAQANPKIGAILWAGYPGQAGGLAIAKVLFGEKNP 635
Query: 610 GGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKF 669
GRLP+TWYPE FTKVPM DM MR+ S YPGRSYRFY G +Y FG+GLSY+ +S++
Sbjct: 636 SGRLPVTWYPEEFTKVPMTDMRMRSAGS--YPGRSYRFYKGKTIYKFGYGLSYSKFSHRV 693
Query: 670 LSA------PSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTS--CTSLRFHVQISVT 721
++A + L ++A A ++ N L Y H+D + C L+F + V
Sbjct: 694 VTARNNPAHNTTLLLAAGHAATTEDN--------LSY-HVDHIGDELCRQLKFLAVVKVQ 744
Query: 722 NAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANK 781
N G +DG H ++F R P + G P +QL+GF H A + F V PCE S
Sbjct: 745 NHGPMDGKHTALMFLRWPNATDGRPARQLVGFQSQHIKAGEKAHLRFEVSPCEDFSRVRD 804
Query: 782 HGRRILPLGNHVLMVGELRHSLTI 805
GR+++ G+H L VG +H L I
Sbjct: 805 DGRKVIDKGSHFLKVG--KHELEI 826
>gi|15237736|ref|NP_201262.1| beta-D-xylosidase 4 [Arabidopsis thaliana]
gi|75262663|sp|Q9FLG1.1|BXL4_ARATH RecName: Full=Beta-D-xylosidase 4; Short=AtBXL4; Flags: Precursor
gi|10178060|dbj|BAB11424.1| beta-xylosidase [Arabidopsis thaliana]
gi|332010539|gb|AED97922.1| beta-D-xylosidase 4 [Arabidopsis thaliana]
Length = 784
Score = 736 bits (1900), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 375/797 (47%), Positives = 503/797 (63%), Gaps = 47/797 (5%)
Query: 21 LIVVVNVIAFSNSKPVLNKPDFPC---KPPHFDSYPFCNTSLSISTRAKSLISLLTLQEK 77
LI + + F N + P F C P +Y FCNT L I R L++ LTLQEK
Sbjct: 23 LIFLCFFLYFLNFSNAQSSPVFACDVAANPSLAAYGFCNTVLKIEYRVADLVARLTLQEK 82
Query: 78 IQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNR 137
I L A+ + RLGIP YEWWSE+LHG++ GPG +F+ V TSFPQV+++AASFN
Sbjct: 83 IGFLVSKANGVTRLGIPTYEWWSEALHGVSYIGPGTHFSSQVPGATSFPQVILTAASFNV 142
Query: 138 SLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVE 197
SL+ IG V+ EARAMYN+G AGLT+W+PN+NIFRDPRWGRGQETPGEDP++ S YA
Sbjct: 143 SLFQAIGKVVSTEARAMYNVGLAGLTYWSPNVNIFRDPRWGRGQETPGEDPLLASKYASG 202
Query: 198 FVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGD--ELMLSACCKHLIAYDLEKWG 255
+VK Q E+D GD L ++ACCKH AYD++ W
Sbjct: 203 YVKGLQ--------------------------ETDGGDSNRLKVAACCKHYTAYDVDNWK 236
Query: 256 NFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-R 314
RYSFNA++T+QD +DT+QPPF+SC+ G + +MCSYNQVNG P C DL R
Sbjct: 237 GVERYSFNAVVTQQDMDDTYQPPFKSCVVDGNVASVMCSYNQVNGKPTCADPDLLSGVIR 296
Query: 315 NEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKG 374
EW GYI SDCD+V +++ Q+YTKT ++AA + AG+D+NCG+ + +HT+ A+ G
Sbjct: 297 GEWKLNGYIVSDCDSVDVLYKNQHYTKTPAEAAAISILAGLDLNCGSFLGQHTEEAVKSG 356
Query: 375 KVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVL 434
V E ID+A+ N F +RLG F+G+P+ YG LGP DVCTS +++LA DAARQGIVL
Sbjct: 357 LVNEAAIDKAISNNFLTLMRLGFFDGNPKNQIYGGLGPTDVCTSANQELAADAARQGIVL 416
Query: 435 LKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYAS 494
LKN LPL+ ++ +LA+IGP N M G Y G PC + L+GL VS T Y
Sbjct: 417 LKN-TGCLPLSPKSIKTLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAGTVSTT-YLP 474
Query: 495 GCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVA 554
GC +V C + A A ++A AD ++V G D + E E RDRV L LPGQQ LV VA
Sbjct: 475 GCSNVAC-AVADVAGATKLAATADVSVLVIGADQSIEAESRDRVDLHLPGQQQELVIQVA 533
Query: 555 RTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLP 614
+ +K PV+LV+ GG D++FA+ D +I+ ILW+GYPGEAG A+A+IIFG +NP G+LP
Sbjct: 534 KAAKGPVLLVIMSGGGFDITFAKNDPKIAGILWVGYPGEAGGIAIADIIFGRYNPSGKLP 593
Query: 615 MTWYPESFT-KVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAP 673
MTWYP+S+ KVPM MNMR D + YPGR+YRFYTG VY FG GLSYT +S+ + AP
Sbjct: 594 MTWYPQSYVEKVPMTIMNMRPDKASGYPGRTYRFYTGETVYAFGDGLSYTKFSHTLVKAP 653
Query: 674 SELTISASLKAGSDKNILQQTGS--RLDYV--HIDEVTSCTSLRFHVQISVTNAGDVDGS 729
S +++ G ++N + ++ LD + H + S F V I V N GD +G
Sbjct: 654 SLVSL------GLEENHVCRSSECQSLDAIGPHCENAVSGGGSAFEVHIKVRNGGDREGI 707
Query: 730 HVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPL 789
H V LF P + G+P K L+GF+++ + + F V+ C+ LS+ ++ G+R + L
Sbjct: 708 HTVFLFTTPPAI-HGSPRKHLVGFEKIRLGKREEAVVRFKVEICKDLSVVDEIGKRKIGL 766
Query: 790 GNHVLMVGELRHSLTIE 806
G H+L VG+L+HSL+I
Sbjct: 767 GKHLLHVGDLKHSLSIR 783
>gi|356503923|ref|XP_003520749.1| PREDICTED: probable beta-D-xylosidase 2-like [Glycine max]
Length = 775
Score = 734 bits (1896), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 353/796 (44%), Positives = 500/796 (62%), Gaps = 39/796 (4%)
Query: 16 FRVLQLIVVVNVIAFSNSKPVLNKPDFPCKPPH--FDSYPFCNTSLSISTRAKSLISLLT 73
F L ++ V ++ F + F C P + ++ PFC SL+I R K L+ LT
Sbjct: 5 FSPLLNLIAVFLLLFLVRHTCEARDPFACDPKNGATENMPFCKASLAIPERVKDLVGRLT 64
Query: 74 LQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAA 133
LQEK++ L +NA+A+PRLG+ YEWWSE+LHG+++ GPGV FN TSFPQV+ +AA
Sbjct: 65 LQEKVRLLVNNAAAVPRLGMKGYEWWSEALHGVSNVGPGVKFNAQFPGATSFPQVITTAA 124
Query: 134 SFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSA 193
SFN SLW IG V+ EARAMYN G AGLT+W+PN+NIFRDPRWGRGQETPGEDP++
Sbjct: 125 SFNASLWEAIGQVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLAGT 184
Query: 194 YAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEK 253
YA +V+ QG + G+ L ++ACCKH AYDL+
Sbjct: 185 YAASYVRGLQGTD---------------------------GNRLKVAACCKHFTAYDLDN 217
Query: 254 WGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK- 312
W R+ FNA +++QD E+TF PFR C+ +GK + +MCSYNQVNGVP C +L +K
Sbjct: 218 WNGMDRFHFNAQVSKQDIEETFDVPFRMCVSEGKVASVMCSYNQVNGVPTCADPNLLKKT 277
Query: 313 ARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAID 372
R W GYI SDCD+V ++ Q+YT T E++AA +KAG+D++CG + HTQ+A++
Sbjct: 278 VRGLWQLDGYIVSDCDSVGVFYDNQHYTPTPEEAAADAIKAGLDLDCGPFLAVHTQNAVE 337
Query: 373 KGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGI 432
KG + E D++ AL+N +VQ+RLG+F+G+P YGKLGP DVC H++LAL+AARQGI
Sbjct: 338 KGLLSEADVNGALVNTLTVQMRLGMFDGEPSAHAYGKLGPKDVCKPAHQELALEAARQGI 397
Query: 433 VLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHY 492
VLLKN LPL+ ++A+IGP M G Y G+ C + L+G+ Y +KT +
Sbjct: 398 VLLKNTGPVLPLSPQRHHTVAVIGPNSKATVTMIGNYAGVACGYTNPLQGIGRY-AKTIH 456
Query: 493 ASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTS 552
GC +V C +D F A+ A++AD ++V GLD + E E DR LLLPG+Q LV+
Sbjct: 457 QLGCENVACKNDKLFGSAINAARQADATVLVMGLDQSIEAETVDRTGLLLPGRQQDLVSK 516
Query: 553 VARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGR 612
VA SK P ILV+ GG +D++FA+ + +I ILW GYPG+AG A+A+I+FG NPGG+
Sbjct: 517 VAAASKGPTILVIMSGGSVDITFAKNNPRIVGILWAGYPGQAGGAAIADILFGTTNPGGK 576
Query: 613 LPMTWYPESF-TKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLS 671
LP+TWYP+ + TK+PM +M MR S YPGR+YRFY G VY FGHGL+YT++ + S
Sbjct: 577 LPVTWYPQEYLTKLPMTNMAMRGSKSAGYPGRTYRFYNGPVVYPFGHGLTYTHFVHTLAS 636
Query: 672 APSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHV 731
AP+ +S L N+ + + H C L +++ + N G DG+H
Sbjct: 637 APT--VVSVPLNGHRRANVTNISNRAIRVTH----ARCDKLSISLEVDIKNVGSRDGTHT 690
Query: 732 VMLFARVPK-VSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLG 790
+++F+ P EKQL+ F+++H AKG + + + C+ LS+ +K G R +PLG
Sbjct: 691 LLVFSAPPAGFGHWALEKQLVAFEKIHVPAKGLQRVGVNIHVCKLLSVVDKSGIRRIPLG 750
Query: 791 NHVLMVGELRHSLTIE 806
H +G+++HS++++
Sbjct: 751 EHSFNIGDVKHSVSLQ 766
>gi|9294427|dbj|BAB02547.1| beta-1,4-xylosidase [Arabidopsis thaliana]
Length = 876
Score = 734 bits (1896), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 367/806 (45%), Positives = 507/806 (62%), Gaps = 54/806 (6%)
Query: 16 FRVLQLIVVVNVIAFSNSKPVLNKPDFPC--KPPHFDSYPFCNTSLSISTRAKSLISLLT 73
F L L+++ V + S+ +F C P Y FCN SLS RAK L+S L+
Sbjct: 6 FVRLSLLIIALVSSLCESQK-----NFACDISAPATAKYGFCNVSLSYEARAKDLVSRLS 60
Query: 74 LQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAA 133
L+EK+QQL + A+ +PRLG+P YEWWSE+LHG++ GPGV+FNGTV TSFP +++AA
Sbjct: 61 LKEKVQQLVNKATGVPRLGVPPYEWWSEALHGVSDVGPGVHFNGTVPGATSFPATILTAA 120
Query: 134 SFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSA 193
SFN SLW +G V+ EARAM+N+G AGLT+W+PN+N+FRDPRWGRGQETPGEDP+VVS
Sbjct: 121 SFNTSLWLKMGEVVSTEARAMHNVGLAGLTYWSPNVNVFRDPRWGRGQETPGEDPLVVSK 180
Query: 194 YAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEK 253
YAV +VK Q + ++ + L +S+CCKH AYDL+
Sbjct: 181 YAVNYVKGLQDVH-----------------------DAGKSRRLKVSSCCKHYTAYDLDN 217
Query: 254 WGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA 313
W R+ F+A +T+QD EDT+Q PF+SC+E+G S +MCSYN+VNG+P C +L +
Sbjct: 218 WKGIDRFHFDAKVTKQDLEDTYQTPFKSCVEEGDVSSVMCSYNRVNGIPTCADPNLLRGV 277
Query: 314 -RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAID 372
R +W GYI SDCD++ F +YTKT ED+ A LKAG+++NCG + ++T++A+
Sbjct: 278 IRGQWRLDGYIVSDCDSIQVYFNDIHYTKTREDAVALALKAGLNMNCGDFLGKYTENAVK 337
Query: 373 KGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGI 432
K+ D+D AL+ + V +RLG F+GDP+ +G LGP DVC+ +H+ LAL+AA+QGI
Sbjct: 338 LKKLNGSDVDEALIYNYIVLMRLGFFDGDPKSLPFGNLGPSDVCSKDHQMLALEAAKQGI 397
Query: 433 VLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV-SKTH 491
VLL+N + LPL K V LA+IGP N M Y G+PC S ++GL+ YV K
Sbjct: 398 VLLEN-RGDLPLPKTTVKKLAVIGPNANATKVMISNYAGVPCKYTSPIQGLQKYVPEKIV 456
Query: 492 YASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVT 551
Y GC DV C AV+ +AD ++V GLD T E E DRV+L LPG Q LV
Sbjct: 457 YEPGCKDVKCGDQTLISAAVKAVSEADVTVLVVGLDQTVEAEGLDRVNLTLPGYQEKLVR 516
Query: 552 SVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGG 611
VA +K+ V+LV+ GP+D+SFA+ S I ++LW+GYPGEAG A+A++IFGD+NP G
Sbjct: 517 DVANAAKKTVVLVIMSAGPIDISFAKNLSTIRAVLWVGYPGEAGGDAIAQVIFGDYNPSG 576
Query: 612 RLPMTWYPESFT-KVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFL 670
RLP TWYP+ F KV M DMNMR +S+ +PGRSYRFYTG +Y FG+GLSY+++S L
Sbjct: 577 RLPETWYPQEFADKVAMTDMNMRPNSTSGFPGRSYRFYTGKPIYKFGYGLSYSSFSTFVL 636
Query: 671 SAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSH 730
SAPS + I K N+ + T + V+ C L+ + I V N G GSH
Sbjct: 637 SAPSIIHI----KTNPIMNLNKTTSVDISTVN------CHDLKIRIVIGVKNHGLRSGSH 686
Query: 731 VVMLFARVPKVSQ-----GTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRR 785
VV++F + PK S+ G P QL+GF+RV +++ + D C+ LS+ + HG+R
Sbjct: 687 VVLVFWKPPKCSKSLVGGGVPLTQLVGFERVEVGRSMTEKFTVDFDVCKALSLVDTHGKR 746
Query: 786 ILPLGNHVLMVG-----ELRHSLTIE 806
L G+H L++G ++ H L +
Sbjct: 747 KLVTGHHKLVIGSNSDQQIYHHLNVR 772
>gi|15230897|ref|NP_188596.1| putative beta-D-xylosidase 5 [Arabidopsis thaliana]
gi|259585724|sp|Q9LJN4.2|BXL5_ARATH RecName: Full=Probable beta-D-xylosidase 5; Short=AtBXL5; Flags:
Precursor
gi|332642747|gb|AEE76268.1| putative beta-D-xylosidase 5 [Arabidopsis thaliana]
Length = 781
Score = 734 bits (1896), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 367/806 (45%), Positives = 507/806 (62%), Gaps = 54/806 (6%)
Query: 16 FRVLQLIVVVNVIAFSNSKPVLNKPDFPC--KPPHFDSYPFCNTSLSISTRAKSLISLLT 73
F L L+++ V + S+ +F C P Y FCN SLS RAK L+S L+
Sbjct: 6 FVRLSLLIIALVSSLCESQK-----NFACDISAPATAKYGFCNVSLSYEARAKDLVSRLS 60
Query: 74 LQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAA 133
L+EK+QQL + A+ +PRLG+P YEWWSE+LHG++ GPGV+FNGTV TSFP +++AA
Sbjct: 61 LKEKVQQLVNKATGVPRLGVPPYEWWSEALHGVSDVGPGVHFNGTVPGATSFPATILTAA 120
Query: 134 SFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSA 193
SFN SLW +G V+ EARAM+N+G AGLT+W+PN+N+FRDPRWGRGQETPGEDP+VVS
Sbjct: 121 SFNTSLWLKMGEVVSTEARAMHNVGLAGLTYWSPNVNVFRDPRWGRGQETPGEDPLVVSK 180
Query: 194 YAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEK 253
YAV +VK Q + ++ + L +S+CCKH AYDL+
Sbjct: 181 YAVNYVKGLQDVH-----------------------DAGKSRRLKVSSCCKHYTAYDLDN 217
Query: 254 WGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA 313
W R+ F+A +T+QD EDT+Q PF+SC+E+G S +MCSYN+VNG+P C +L +
Sbjct: 218 WKGIDRFHFDAKVTKQDLEDTYQTPFKSCVEEGDVSSVMCSYNRVNGIPTCADPNLLRGV 277
Query: 314 -RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAID 372
R +W GYI SDCD++ F +YTKT ED+ A LKAG+++NCG + ++T++A+
Sbjct: 278 IRGQWRLDGYIVSDCDSIQVYFNDIHYTKTREDAVALALKAGLNMNCGDFLGKYTENAVK 337
Query: 373 KGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGI 432
K+ D+D AL+ + V +RLG F+GDP+ +G LGP DVC+ +H+ LAL+AA+QGI
Sbjct: 338 LKKLNGSDVDEALIYNYIVLMRLGFFDGDPKSLPFGNLGPSDVCSKDHQMLALEAAKQGI 397
Query: 433 VLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV-SKTH 491
VLL+N + LPL K V LA+IGP N M Y G+PC S ++GL+ YV K
Sbjct: 398 VLLEN-RGDLPLPKTTVKKLAVIGPNANATKVMISNYAGVPCKYTSPIQGLQKYVPEKIV 456
Query: 492 YASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVT 551
Y GC DV C AV+ +AD ++V GLD T E E DRV+L LPG Q LV
Sbjct: 457 YEPGCKDVKCGDQTLISAAVKAVSEADVTVLVVGLDQTVEAEGLDRVNLTLPGYQEKLVR 516
Query: 552 SVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGG 611
VA +K+ V+LV+ GP+D+SFA+ S I ++LW+GYPGEAG A+A++IFGD+NP G
Sbjct: 517 DVANAAKKTVVLVIMSAGPIDISFAKNLSTIRAVLWVGYPGEAGGDAIAQVIFGDYNPSG 576
Query: 612 RLPMTWYPESFT-KVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFL 670
RLP TWYP+ F KV M DMNMR +S+ +PGRSYRFYTG +Y FG+GLSY+++S L
Sbjct: 577 RLPETWYPQEFADKVAMTDMNMRPNSTSGFPGRSYRFYTGKPIYKFGYGLSYSSFSTFVL 636
Query: 671 SAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSH 730
SAPS + I K N+ + T + V+ C L+ + I V N G GSH
Sbjct: 637 SAPSIIHI----KTNPIMNLNKTTSVDISTVN------CHDLKIRIVIGVKNHGLRSGSH 686
Query: 731 VVMLFARVPKVSQ-----GTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRR 785
VV++F + PK S+ G P QL+GF+RV +++ + D C+ LS+ + HG+R
Sbjct: 687 VVLVFWKPPKCSKSLVGGGVPLTQLVGFERVEVGRSMTEKFTVDFDVCKALSLVDTHGKR 746
Query: 786 ILPLGNHVLMVG-----ELRHSLTIE 806
L G+H L++G ++ H L +
Sbjct: 747 KLVTGHHKLVIGSNSDQQIYHHLNVR 772
>gi|371917282|dbj|BAL44717.1| SlArf/Xyl2 [Solanum lycopersicum]
Length = 774
Score = 734 bits (1896), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 351/772 (45%), Positives = 495/772 (64%), Gaps = 38/772 (4%)
Query: 39 KPDFPC--KPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAY 96
+P F C K F ++PFC T+L I R + LI LTLQEK++ L +NA+A+PRLGI Y
Sbjct: 28 RPPFACDQKNRAFRNFPFCQTNLPIGDRVRDLIGRLTLQEKVKLLGNNAAAVPRLGIKGY 87
Query: 97 EWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYN 156
EWWSE+LHG+++ GPG F G TSFPQV+ +AASFN SLW IG V+ EARAMYN
Sbjct: 88 EWWSEALHGVSNVGPGTKFGGEFPGATSFPQVITTAASFNASLWEEIGRVVSDEARAMYN 147
Query: 157 LGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGF 216
GLT+W+PN+NIFRDPRWGRGQETPGEDP+V + YA +V+ QG
Sbjct: 148 GEMGGLTYWSPNVNIFRDPRWGRGQETPGEDPVVAALYAERYVRGLQG------------ 195
Query: 217 GFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQ 276
++ GD L ++ACCKH AYDL+ WG R+ FNA +T+QD EDTF
Sbjct: 196 --------------NEDGDSLKVAACCKHYTAYDLDNWGGVDRFHFNAKVTKQDIEDTFD 241
Query: 277 PPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFE 335
PFRSC++QGK + IMCSYNQVNG+P C L +K R WG GYI SDCD+V ++
Sbjct: 242 VPFRSCVKQGKVASIMCSYNQVNGIPTCADPQLLRKTIRGGWGLNGYIVSDCDSVGVFYD 301
Query: 336 YQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRL 395
Q+YT T E++AA +KAG+D++CG + +HT++A+ G ++E ID L N +VQ+RL
Sbjct: 302 TQHYTSTPEEAAAAAIKAGLDLDCGPFLSQHTENAVHIGILKEAAIDTNLANTVAVQMRL 361
Query: 396 GLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAII 455
G+F+G+P +YG LGP DVC+ H++LA++AARQGIVLLKN LPL+ ++A+I
Sbjct: 362 GMFDGEPSAQQYGHLGPRDVCSPAHQELAVEAARQGIVLLKNHGPALPLSPRRHRTVAVI 421
Query: 456 GPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAK 515
GP + M G Y G+ C S L+G+ Y +KT + GC DV C+ D F AV A+
Sbjct: 422 GPNSDVTVTMIGNYAGVACGYTSPLQGISKY-AKTIHEKGCGDVACSDDKLFAGAVNAAR 480
Query: 516 KADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSF 575
+AD ++V GLD + E E RDR LLLPG Q L++ V++ S+ PV+LVL GGP+DV+F
Sbjct: 481 QADATVLVMGLDQSIEAEFRDRTGLLLPGFQQELISEVSKASRGPVVLVLMSGGPVDVTF 540
Query: 576 AEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESF-TKVPMNDMNMRA 634
A D +I +I+W GYPG+ G A+A+++FG NPGG+LPMTWYP+ + +PM M+MR+
Sbjct: 541 ANNDPRIGAIVWAGYPGQGGGAAIADVLFGAHNPGGKLPMTWYPQEYLNNLPMTTMDMRS 600
Query: 635 DSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQT 694
+ ++ YPGR+YRFY G VY FGHGLSYT + AP L I + + + +
Sbjct: 601 NLAKGYPGRTYRFYKGPLVYPFGHGLSYTKFITTIFEAPKTLAIPIDGRHTYNSSTISNK 660
Query: 695 GSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFD 754
R+ + C+ + + + V N G DGSH +++F++ P V P KQL+ F
Sbjct: 661 SIRVTH------AKCSKISVQIHVDVKNVGPKDGSHTLLVFSK-PPVDIWVPHKQLVAFQ 713
Query: 755 RVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
+V+ A+ + ++ + C+ LS+ ++ G R +P+G H + +G+ +HSL+++
Sbjct: 714 KVYVPARSKQRVAINIHVCKYLSVVDRAGVRRIPIGEHSIHIGDAKHSLSLQ 765
>gi|449438167|ref|XP_004136861.1| PREDICTED: beta-xylosidase/alpha-L-arabinofuranosidase 2-like
[Cucumis sativus]
Length = 782
Score = 734 bits (1894), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 373/764 (48%), Positives = 494/764 (64%), Gaps = 41/764 (5%)
Query: 47 PHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGI 106
P + FC++SL R + L+ LTLQEKI L +NA + RLGIP YEWWSE+LHG+
Sbjct: 55 PSVSGFAFCDSSLGFEARVEDLVKRLTLQEKIGFLINNARNVTRLGIPKYEWWSEALHGV 114
Query: 107 ASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWA 166
+ GPG F+ V TSFPQV+++AASFN SL+ IG V+ EARAMYN+G AGLT+W+
Sbjct: 115 SYVGPGTKFSNVVPGATSFPQVILTAASFNASLFEAIGKVVSTEARAMYNVGLAGLTYWS 174
Query: 167 PNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKG 226
PN+NIFRDPRWGRGQETPGEDP++ S YA +V+ Q + DDG
Sbjct: 175 PNVNIFRDPRWGRGQETPGEDPLLSSKYAAGYVRGLQ----QRDDG-------------- 216
Query: 227 FGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQG 286
D L ++ACCKH AYDL+ W RY FNA+++ QD EDTFQPPF+SC+ G
Sbjct: 217 ------DPDRLKVAACCKHYTAYDLDNWKGTDRYHFNAVVSPQDLEDTFQPPFKSCVIDG 270
Query: 287 KASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHED 345
+ +MCSYNQVNG P C DL R +W GYI SDCD+V ++ Q+YTK+ E+
Sbjct: 271 NVASVMCSYNQVNGKPTCADPDLLAGVIRGQWKLNGYIVSDCDSVDVLYNSQHYTKSPEE 330
Query: 346 SAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKG 405
+AA + AG+D++CG + +HT++A+ G V E I +A+ N +RLG F+G+P K
Sbjct: 331 AAAKTILAGLDLDCGDFLGKHTEAAVTGGLVNEAAISKAVFNNLLTLMRLGFFDGNPSKQ 390
Query: 406 KYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQM 465
YGKLGP DVCT EH++LA +AARQGIVLLKN K LPL+ +A+ SLA+IGP N M
Sbjct: 391 LYGKLGPKDVCTPEHQELAREAARQGIVLLKNSPKSLPLSSSAIKSLAVIGPNANVTKTM 450
Query: 466 GGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAG 525
G Y G PC + L+GL A VS T + GC +V C S A EA +IA AD ++V G
Sbjct: 451 IGNYEGTPCKYTTPLQGLSAVVS-TSFQPGCANVACTS-AQLDEAKKIAASADATVLVVG 508
Query: 526 LDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSI 585
D + E E RDRV L LPGQQ L+T VA+ SK PVILV+ GG +D++FA+ D +I+SI
Sbjct: 509 SDQSIEAESRDRVDLNLPGQQALLITEVAKASKGPVILVIMTGGGMDITFAKKDDKITSI 568
Query: 586 LWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNMRADSSRQYPGRS 644
LW+G+PGEAG A+A++IFG FNP GRLPMTWYP+S+ KVPM DM MR +S +PGR+
Sbjct: 569 LWVGFPGEAGGAAIADVIFGSFNPSGRLPMTWYPQSYVEKVPMTDMRMRPSASNGFPGRT 628
Query: 645 YRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHID 704
YRFYTG +Y FG GLSY+++ + + AP ++I L+ G +I + H
Sbjct: 629 YRFYTGETIYSFGDGLSYSDFKHHLVKAPKLVSI--PLEEG---HICHSS-----KCHSL 678
Query: 705 EVT--SCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKG 762
EV SC +L F V + V N G GSH V L++ P V +P+K L+GF++V G
Sbjct: 679 EVVQESCQNLGFDVHLRVKNVGQRSGSHTVFLYSTPPSV-HNSPQKHLLGFEKVSLGRGG 737
Query: 763 SKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
+ F VD C+ LS+A++ G R + LG H+L VG L+HSL ++
Sbjct: 738 ETVVRFKVDVCKDLSVADEVGSRKVALGLHILHVGTLKHSLNVK 781
>gi|357445735|ref|XP_003593145.1| Beta-xylosidase/alpha-L-arabinofuranosidase [Medicago truncatula]
gi|355482193|gb|AES63396.1| Beta-xylosidase/alpha-L-arabinofuranosidase [Medicago truncatula]
Length = 775
Score = 733 bits (1893), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 371/769 (48%), Positives = 501/769 (65%), Gaps = 40/769 (5%)
Query: 42 FPC---KPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEW 98
F C K + SY FC+ SLS+ R L+ LTLQEKI L ++A + RLGIP YEW
Sbjct: 40 FACDVAKNTNVSSYGFCDKSLSVEDRVSDLVKRLTLQEKIGNLGNSAVEVSRLGIPKYEW 99
Query: 99 WSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG 158
WSE+LHG+++ GPG +F+ V TSFP +++AASFN SL+ IGS V+ EARAMYN+G
Sbjct: 100 WSEALHGVSNIGPGTHFSSLVPGATSFPMPILTAASFNTSLFQAIGSVVSNEARAMYNVG 159
Query: 159 QAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGF 218
AGLT+W+PNINIFRDPRWGRGQETPGEDP++ S YA +VK Q ++DDG
Sbjct: 160 LAGLTYWSPNINIFRDPRWGRGQETPGEDPLLSSKYAAGYVKGLQ----QTDDG------ 209
Query: 219 REKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPP 278
D+L ++ACCKH AYD++ W RY+F+A++++QD +DTFQPP
Sbjct: 210 --------------DSDKLKVAACCKHYTAYDVDNWKGVQRYTFDAVVSQQDLDDTFQPP 255
Query: 279 FRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQ 337
F+SC+ G + +MCSYN+VNG P C DL + R +W GYI SDCD+V +F+ Q
Sbjct: 256 FKSCVIDGNVASVMCSYNKVNGKPTCADPDLLKGVIRGKWKLNGYIVSDCDSVEVLFKDQ 315
Query: 338 NYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGL 397
+YTKT E++AA + +G+D++CG+ + ++T A+ +G V E I+ A+ N F+ +RLG
Sbjct: 316 HYTKTPEEAAAKTILSGLDLDCGSYLGQYTGGAVKQGLVDEASINNAVSNNFATLMRLGF 375
Query: 398 FNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGP 457
F+GDP K YG LGP DVCT E+++LA +AARQGIVLLKN LPL+ A+ SLA+IGP
Sbjct: 376 FDGDPSKQPYGNLGPKDVCTPENQELAREAARQGIVLLKNSPGSLPLSSKAIKSLAVIGP 435
Query: 458 LVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKA 517
N M G Y GIPC S L+GL A+V T YA GC DV C ++A +A +IA A
Sbjct: 436 NANATRVMIGNYEGIPCKYTSPLQGLTAFV-PTSYAPGCPDVQC-ANAQIDDAAKIAASA 493
Query: 518 DFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAE 577
D I+V G +L E E DRV++LLPGQQ LV VA SK PVILV+ GG +DVSFA+
Sbjct: 494 DATIIVVGANLAIEAESLDRVNILLPGQQQQLVNEVANVSKGPVILVIMSGGGMDVSFAK 553
Query: 578 ADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNMRADS 636
+ +I+SILW+GYPGEAG A+A++IFG +NP GRLPMTWYP+S+ K+PM +MNMR+D
Sbjct: 554 TNDKITSILWVGYPGEAGGAAIADVIFGSYNPSGRLPMTWYPQSYVEKIPMTNMNMRSDP 613
Query: 637 SRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGS 696
+ YPGR+YRFY G V+ FG G+S+ +K + AP +++ + D
Sbjct: 614 ATGYPGRTYRFYKGETVFSFGDGMSFGTVEHKIVKAPQLVSVPLA----EDHECRSLECK 669
Query: 697 RLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRV 756
LD DE C +L F + +SV N G + SH V+LF P V P+K L+GF++V
Sbjct: 670 SLDVA--DE--HCQNLAFDIHLSVKNMGKMSSSHSVLLFFTPPNV-HNAPQKHLLGFEKV 724
Query: 757 HTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTI 805
K + F VD C LS+ ++ G R +PLG+H+L VG L+HSL++
Sbjct: 725 QLAGKSEGMVRFKVDVCNDLSVVDELGNRKVPLGDHMLHVGNLKHSLSV 773
>gi|62701894|gb|AAX92967.1| beta-xylosidase, putative [Oryza sativa Japonica Group]
gi|77550041|gb|ABA92838.1| Glycosyl hydrolase family 3 C terminal domain containing protein
[Oryza sativa Japonica Group]
Length = 793
Score = 733 bits (1893), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 375/780 (48%), Positives = 488/780 (62%), Gaps = 66/780 (8%)
Query: 54 FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
FC+ L+ RA L++ LTL EK+ QL D A+ + RLG+PAYEWWSE LHG++ G G+
Sbjct: 32 FCDAWLTAEQRAADLVANLTLAEKVSQLGDRAAGVARLGVPAYEWWSEGLHGLSIWGRGI 91
Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA-GLTFWAPNINIF 172
FNGTV +VTSFPQV+++AA+F+ LW +G AV EARA+YNLGQA GLT W+PN+NIF
Sbjct: 92 RFNGTVRAVTSFPQVILTAAAFDAGLWRRVGEAVGAEARALYNLGQANGLTIWSPNVNIF 151
Query: 173 RDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESD 232
RDPRWGRGQETPGEDP+ S YAV FV QG G E+
Sbjct: 152 RDPRWGRGQETPGEDPVTASRYAVAFVTGLQG----------------------IGGEA- 188
Query: 233 RGDELMLSACCKHLIAYDLEKWGNFSRYSFNA---------------------------- 264
SACCKH AYDL+ W N RY++++
Sbjct: 189 -------SACCKHATAYDLDYWNNVVRYNYDSKDGASTGKSGETSSQVEKKHGPYEKGYF 241
Query: 265 MITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYI 323
+T QD EDT+ PPF+SC+ +GKA+CIMC YN +NGVPAC DL +K R EWG GY+
Sbjct: 242 AVTLQDLEDTYNPPFKSCVAEGKATCIMCGYNSINGVPACASSDLLTKKVRQEWGMNGYV 301
Query: 324 TSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDR 383
SDCDAVATI + +YT + ED+ A +K GMD+NCG H +A+ KG + EKDIDR
Sbjct: 302 ASDCDAVATIRDAHHYTLSPEDTVAVSIKVGMDVNCGNYTQVHAMAAVQKGNLTEKDIDR 361
Query: 384 ALLNLFSVQLRLGLFNGDPRKGK-YGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFL 442
AL+NLF+V++RLG F+GDPR YG LG DVC+ HK LAL+AA+ GIVLLKND L
Sbjct: 362 ALVNLFAVRMRLGHFDGDPRSNAVYGHLGAADVCSPAHKSLALEAAQDGIVLLKNDAGAL 421
Query: 443 PLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVS-KTHYASGCHDVPC 501
PL +AV+SLA+IGP +N+ + G Y G PC + L+G++ Y+ + + +GC D P
Sbjct: 422 PLQPSAVTSLAVIGPNADNLGALHGNYFGPPCETTTPLQGIKGYLGDRARFLAGC-DSPA 480
Query: 502 NSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPV 561
+ A +EA +A +D V++ GL QE + DR SLLLPG+Q L+T+VA ++RPV
Sbjct: 481 CAVAATNEAAALASSSDHVVLFMGLSQKQEQDGLDRTSLLLPGEQQGLITAVANAARRPV 540
Query: 562 ILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPES 621
ILVL GGP+DV+FA+ + +I +ILW GYPG+AG A+A+++FGD NP GRLP+TWYPE
Sbjct: 541 ILVLLTGGPVDVTFAKDNPKIGAILWAGYPGQAGGLAIAKVLFGDHNPSGRLPVTWYPEE 600
Query: 622 FTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTI-SA 680
FTKVPM DM MRAD + YPGRSYRFY G VY FG+GLSY+ +S + S+ S +
Sbjct: 601 FTKVPMTDMRMRADPATGYPGRSYRFYQGNTVYNFGYGLSYSKFSRRMFSSFSTSNAGNL 660
Query: 681 SLKAGSDKNILQQTGSRLDYVHIDE--VTSCTSLRFHVQISVTNAGDVDGSHVVMLFARV 738
SL AG G + + E V C+ L F + V N G +DG H V+++ R
Sbjct: 661 SLLAGVMARRAGDDGGGMSSYLVKEIGVERCSRLVFPAVVEVQNHGPMDGKHSVLMYLRW 720
Query: 739 PKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGE 798
P S G P +QLIGF H +SF V PCE S + G R++ G H LMVG+
Sbjct: 721 PTTSGGRPARQLIGFRSQHVKVGEKAMVSFEVSPCEHFSWVGEDGERVIDGGAHFLMVGD 780
>gi|292630922|sp|A5JTQ2.1|XYL1_MEDVA RecName: Full=Beta-xylosidase/alpha-L-arabinofuranosidase 1;
AltName: Full=Xylan
1,4-beta-xylosidase/Alpha-N-arabinofuranosidase 1;
Short=MsXyl1; Includes: RecName: Full=Beta-xylosidase;
AltName: Full=1,4-beta-D-xylan xylohydrolase; AltName:
Full=Xylan 1,4-beta-xylosidase; Includes: RecName:
Full=Alpha-N-arabinofuranosidase; AltName:
Full=Alpha-L-arabinofuranosidase; Short=Arabinosidase;
Flags: Precursor
gi|146762261|gb|ABQ45227.1| beta-xylosidase/alpha-L-arabinosidase [Medicago sativa subsp. x
varia]
Length = 774
Score = 733 bits (1892), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 369/769 (47%), Positives = 499/769 (64%), Gaps = 40/769 (5%)
Query: 42 FPC---KPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEW 98
F C K + SY FC+ SLS+ R L+ LTLQEKI L ++A + RLGIP YEW
Sbjct: 39 FACDVAKNTNVSSYGFCDNSLSVEDRVSDLVKRLTLQEKIGNLGNSAVEVSRLGIPKYEW 98
Query: 99 WSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG 158
WSE+LHG+++ GPG +F+ V T+FP +++AASFN SL+ IGS V+ EARAMYN+G
Sbjct: 99 WSEALHGVSNIGPGTHFSSLVPGATNFPMPILTAASFNTSLFQAIGSVVSNEARAMYNVG 158
Query: 159 QAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGF 218
AGLT+W+PNINIFRDPRWGRGQETPGEDP++ S YA +VK Q ++DDG
Sbjct: 159 LAGLTYWSPNINIFRDPRWGRGQETPGEDPLLSSKYAAGYVKGLQ----QTDDG------ 208
Query: 219 REKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPP 278
D+L ++ACCKH AYD++ W RY+F+A++++QD +DTFQPP
Sbjct: 209 --------------DSDKLKVAACCKHYTAYDVDNWKGVQRYTFDAVVSQQDLDDTFQPP 254
Query: 279 FRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQ 337
F+SC+ G + +MCSYN+VNG P C DL + R +W GYI SDCD+V +++ Q
Sbjct: 255 FKSCVIDGNVASVMCSYNKVNGKPTCADPDLLKGVIRGKWKLNGYIVSDCDSVEVLYKDQ 314
Query: 338 NYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGL 397
+YTKT E++AA + +G+D++CG+ + ++T A+ +G V E I A+ N F+ +RLG
Sbjct: 315 HYTKTPEEAAAKTILSGLDLDCGSYLGQYTGGAVKQGLVDEASITNAVSNNFATLMRLGF 374
Query: 398 FNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGP 457
F+GDP K YG LGP DVCT E+++LA +AARQGIVLLKN + LPL+ A+ SLA+IGP
Sbjct: 375 FDGDPSKQPYGNLGPKDVCTPENQELAREAARQGIVLLKNSPRSLPLSSKAIKSLAVIGP 434
Query: 458 LVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKA 517
N M G Y GIPC S L+GL A+V T YA GC DV C ++A +A +IA A
Sbjct: 435 NANATRVMIGNYEGIPCKYTSPLQGLTAFV-PTSYAPGCPDVQC-ANAQIDDAAKIAASA 492
Query: 518 DFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAE 577
D I+V G +L E E DRV++LLPGQQ LV VA SK PVILV+ GG +DVSFA+
Sbjct: 493 DATIIVVGANLAIEAESLDRVNILLPGQQQQLVNEVANVSKGPVILVIMSGGGMDVSFAK 552
Query: 578 ADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNMRADS 636
+ +I+SILW+GYPGEAG A+A++IFG +NP GRLPMTWYP+S+ KVPM +MNMRAD
Sbjct: 553 TNDKITSILWVGYPGEAGGAAIADVIFGSYNPSGRLPMTWYPQSYVEKVPMTNMNMRADP 612
Query: 637 SRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGS 696
+ YPGR+YRFY G V+ FG G+S+ +K + AP +++ + D
Sbjct: 613 ATGYPGRTYRFYKGETVFSFGDGMSFGTVEHKIVKAPQLVSVPLA----EDHECRSLECK 668
Query: 697 RLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRV 756
LD C +L F + +SV N G + SH V+LF P V P+K L+GF++V
Sbjct: 669 SLDVAD----KHCQNLAFDIHLSVKNMGKMSSSHSVLLFFTPPNV-HNAPQKHLLGFEKV 723
Query: 757 HTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTI 805
K + F VD C LS+ ++ G R +PLG+H+L VG L+HSL++
Sbjct: 724 QLAGKSEGMVRFKVDVCNDLSVVDELGNRKVPLGDHMLHVGNLKHSLSV 772
>gi|357156390|ref|XP_003577440.1| PREDICTED: probable beta-D-xylosidase 7-like [Brachypodium
distachyon]
Length = 755
Score = 732 bits (1890), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 361/784 (46%), Positives = 492/784 (62%), Gaps = 56/784 (7%)
Query: 40 PDFPCKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWW 99
P F C PP Y FCN +L RA L++ LTL+EK+ QL D A +PR G+P Y WW
Sbjct: 12 PAFSCGPPQQAQYAFCNRALPAEQRAADLVAKLTLEEKVSQLGDQAPGVPRFGVPGYNWW 71
Query: 100 SESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQ 159
SE LHG++ G G++FNG V VT+FPQVL++ ASF+ S+W IG A+ EARAM+NLGQ
Sbjct: 72 SEGLHGVSMWGHGMHFNGAVRGVTTFPQVLLTTASFDDSIWYRIGQAIGTEARAMFNLGQ 131
Query: 160 A-GLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGF 218
A GLT W+PN+NI+RDPRWGRGQETPGEDP S YAV FV+ QG + +
Sbjct: 132 ADGLTIWSPNVNIYRDPRWGRGQETPGEDPATASKYAVAFVRGLQGTSTTT--------- 182
Query: 219 REKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPP 278
L SACCKH AYDL+ W RY+FNA +T QD E+TF PP
Sbjct: 183 ------------------LQTSACCKHATAYDLDDWNRIGRYNFNAKVTAQDLEETFNPP 224
Query: 279 FRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQ 337
F+SC+ +GKA+C+MC+Y VNG+PAC L K + EWG GYI+SDCDAVA ++ +
Sbjct: 225 FKSCVVEGKATCVMCAYTSVNGIPACADSGLLTKTIKGEWGMNGYISSDCDAVALLYGTR 284
Query: 338 NYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGL 397
Y+ T ED+ A +KAG+D+NCG H +A+ + K+ E+D+D+AL NLF++++RLG
Sbjct: 285 -YSGTPEDAVAAAIKAGLDMNCGNFSQVHGMAALQQRKMSEQDVDKALRNLFAIRMRLGH 343
Query: 398 FNGDPRKGK-YGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNK--NAVSSLAI 454
F+GDP + YG+LG DVC+ HK LAL+AA+ GIVLLKND LPL++ A +S A+
Sbjct: 344 FDGDPLQSPLYGRLGAQDVCSPAHKDLALEAAQNGIVLLKNDAATLPLSRPTAASASFAV 403
Query: 455 IGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSK-THYASGCHDVPCNSDAGFHEAVRI 513
IGP N + G Y G PC + L+ L+ + SK + GC CN A ++A +
Sbjct: 404 IGPNANEPGALLGNYFGPPCETTTPLQALQKFYSKNVRFVPGCDSAACNV-ADTYQASGL 462
Query: 514 AKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDV 573
A +D+ I+ GL QE E DR SLLLPG+Q SL+T+VA +KRP+ILVL GGP+D+
Sbjct: 463 AATSDYTILFMGLSQKQEQEGLDRTSLLLPGKQESLITAVAAAAKRPIILVLLTGGPVDI 522
Query: 574 SFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMR 633
+FA+ + +I +ILW GYPG+AG A+A+++FG+ NP GRLP+TWYPE +TKVPM+DM MR
Sbjct: 523 TFAKFNPKIGAILWAGYPGQAGGLAIAKVLFGEHNPSGRLPVTWYPEEYTKVPMDDMRMR 582
Query: 634 ADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFL-------SAP-SELTISASLKAG 685
AD + YPGRSYRFY G VY FG+GLSY+ +S + + AP +EL +A++ G
Sbjct: 583 ADPATGYPGRSYRFYKGNAVYKFGYGLSYSKFSRQLVRNSSSNNRAPNTELLAAAAVDCG 642
Query: 686 SDKNILQQTGSRLDYVHIDEVTS--CTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQ 743
+ + Y ++E+ C L+F + V N G +DG V+LF R P ++
Sbjct: 643 ASR-----------YYLVEEIGGEVCERLKFPAVVEVENHGPMDGKQSVLLFLRWPTATE 691
Query: 744 GTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSL 803
G P QL+GF A +SF + PCE S G +++ G+H LMV E +
Sbjct: 692 GRPASQLVGFRSQDLRAGEKASVSFDISPCEHFSRTTVDGTKVIDRGSHFLMVDEDEMEI 751
Query: 804 TIET 807
+ ++
Sbjct: 752 SFDS 755
>gi|449479116|ref|XP_004155509.1| PREDICTED: beta-xylosidase/alpha-L-arabinofuranosidase 2-like
[Cucumis sativus]
Length = 809
Score = 731 bits (1888), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 373/764 (48%), Positives = 494/764 (64%), Gaps = 41/764 (5%)
Query: 47 PHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGI 106
P + FC++SL R + L+ LTLQEKI L +NA + RLGIP YEWWSE+LHG+
Sbjct: 82 PSVSGFAFCDSSLGFEARVEDLVKRLTLQEKIGFLINNARNVTRLGIPKYEWWSEALHGV 141
Query: 107 ASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWA 166
+ GPG F+ V TSFPQV+++AASFN SL+ IG V+ EARAMYN+G AGLT+W+
Sbjct: 142 SYVGPGTKFSNVVPGATSFPQVILTAASFNASLFEAIGKVVSTEARAMYNVGLAGLTYWS 201
Query: 167 PNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKG 226
PN+NIFRDPRWGRGQETPGEDP++ S YA +V+ Q + DDG
Sbjct: 202 PNVNIFRDPRWGRGQETPGEDPLLSSKYAAGYVRGLQ----QRDDG-------------- 243
Query: 227 FGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQG 286
D L ++ACCKH AYDL+ W RY FNA+++ QD EDTFQPPF+SC+ G
Sbjct: 244 ------DPDRLKVAACCKHYTAYDLDNWKGTDRYHFNAVVSPQDLEDTFQPPFKSCVIDG 297
Query: 287 KASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHED 345
+ +MCSYNQVNG P C DL R +W GYI SDCD+V ++ Q+YTK+ E+
Sbjct: 298 NVASVMCSYNQVNGKPTCADPDLLAGVIRGQWKLNGYIVSDCDSVDVLYNSQHYTKSPEE 357
Query: 346 SAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKG 405
+AA + AG+D++CG + +HT++A+ G V E I +A+ N +RLG F+G+P K
Sbjct: 358 AAAKTILAGLDLDCGDFLGKHTEAAVTGGLVNEAAISKAVFNNLLTLMRLGFFDGNPSKQ 417
Query: 406 KYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQM 465
YGKLGP DVCT EH++LA +AARQGIVLLKN K LPL+ +A+ SLA+IGP N M
Sbjct: 418 LYGKLGPKDVCTPEHQELAREAARQGIVLLKNSPKSLPLSSSAIKSLAVIGPNANVTKTM 477
Query: 466 GGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAG 525
G Y G PC + L+GL A VS T + GC +V C S A EA +IA AD ++V G
Sbjct: 478 IGNYEGTPCKYTTPLQGLSAVVS-TSFQPGCANVACTS-AQLDEAKKIAASADATVLVVG 535
Query: 526 LDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSI 585
D + E E RDRV L LPGQQ L+T VA+ SK PVILV+ GG +D++FA+ D +I+SI
Sbjct: 536 SDQSIEAESRDRVDLNLPGQQALLITEVAKASKGPVILVIMTGGGMDITFAKKDDKITSI 595
Query: 586 LWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNMRADSSRQYPGRS 644
LW+G+PGEAG A+A++IFG FNP GRLPMTWYP+S+ KVPM DM MR +S +PGR+
Sbjct: 596 LWVGFPGEAGGAAIADVIFGSFNPSGRLPMTWYPQSYVEKVPMTDMRMRPSASNGFPGRT 655
Query: 645 YRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHID 704
YRFYTG +Y FG GLSY+++ + + AP ++I L+ G +I + H
Sbjct: 656 YRFYTGETIYSFGDGLSYSDFKHHLVKAPKLVSI--PLEEG---HICHSSK-----CHSL 705
Query: 705 EVT--SCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKG 762
EV SC +L F V + V N G GSH V L++ P V +P+K L+GF++V G
Sbjct: 706 EVVQESCQNLGFDVHLRVKNVGQRSGSHTVFLYSTPPSV-HNSPQKHLLGFEKVSLGRGG 764
Query: 763 SKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
+ F VD C+ LS+A++ G R + LG H+L VG L+HSL ++
Sbjct: 765 ETVVRFKVDVCKDLSVADEVGSRKVALGLHILHVGTLKHSLNVK 808
>gi|74355968|dbj|BAE44362.1| alpha-L-arabinofuranosidase [Raphanus sativus]
Length = 780
Score = 731 bits (1887), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 374/807 (46%), Positives = 500/807 (61%), Gaps = 48/807 (5%)
Query: 11 RERILFRVLQLIVVVNVIAFSNSKPVLNKPDFPCK---PPHFDSYPFCNTSLSISTRAKS 67
R R V L ++ + +S + P F C P +Y FCNT++ I R
Sbjct: 10 RNRAPSSVFSLSLIF--LCLLDSSNAQSTPVFACDVAGNPSLAAYGFCNTAIKIEYRVAD 67
Query: 68 LISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQ 127
L++ LTLQEKI L+ + RLGIP YEWWSE+LHG++ GPG F+G V TSFPQ
Sbjct: 68 LVARLTLQEKIGVLTSKLHGVARLGIPTYEWWSEALHGVSYVGPGTRFSGQVPGATSFPQ 127
Query: 128 VLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGED 187
V+++AASFN SL+ IG V+ EARAMYN+G AGLT+W+PN+NIFRDPRWGRGQETPGED
Sbjct: 128 VILTAASFNVSLFQAIGKVVSTEARAMYNVGLAGLTYWSPNVNIFRDPRWGRGQETPGED 187
Query: 188 PMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGD--ELMLSACCKH 245
P++ S YA +VK Q E+D D L ++ACCKH
Sbjct: 188 PLLSSKYASGYVKGLQ--------------------------ETDSSDANRLKVAACCKH 221
Query: 246 LIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACL 305
AYD++ W RYSFNA++ +QD +DT+QPPF+SC+ G + +MCSYN+VNG P C
Sbjct: 222 YTAYDVDNWKGVERYSFNAVVNQQDLDDTYQPPFKSCVVDGNVASVMCSYNKVNGKPTCA 281
Query: 306 RGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCML 364
DL R EW GYI SDCD+V +++ Q+YTKT E++AA + AG+D+NCG +
Sbjct: 282 DPDLLSGVIRGEWKLNGYIVSDCDSVDVLYKNQHYTKTPEEAAAISINAGLDLNCGYFLG 341
Query: 365 RHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLA 424
HT++A+ G V+E ID+A+ N F +RLG F+GDP+K YG LGP DVCT +++LA
Sbjct: 342 DHTEAAVKAGLVKEAAIDKAITNNFLTLMRLGFFDGDPKKQIYGGLGPKDVCTPANQELA 401
Query: 425 LDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLE 484
+AARQGIVLLKN LPL+ + +LA+IGP N M G Y G PC + L+GL
Sbjct: 402 AEAARQGIVLLKNTGA-LPLSPKTIKTLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLA 460
Query: 485 AYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPG 544
V T Y GC +V C + A + ++A +D ++V G D + E E RDRV L LPG
Sbjct: 461 GTVHTT-YLPGCSNVAC-AVADVAGSTKLAAASDATVLVIGADQSIEAESRDRVDLNLPG 518
Query: 545 QQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIF 604
QQ LVT VA+ +K PV LV+ GG D++FA+ D++I+ ILW+GYPGEAG A A++IF
Sbjct: 519 QQQELVTQVAKAAKGPVFLVIMSGGGFDITFAKNDAKIAGILWVGYPGEAGGIATADVIF 578
Query: 605 GDFNPGGRLPMTWYPESFT-KVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYT 663
G +NP GRLPMTWYP+S+ KVPM +MNMR D S YPGR+YRFYTG VY FG GLSYT
Sbjct: 579 GRYNPSGRLPMTWYPQSYVEKVPMTNMNMRPDKSNGYPGRTYRFYTGETVYAFGDGLSYT 638
Query: 664 NYSYKFLSAPSELTISAS---LKAGSDKNILQQTGSRLDYVHIDEVTSCTSLR-FHVQIS 719
+S+ + AP +++S + S+ L G H D S T + F V I
Sbjct: 639 KFSHSLVKAPRLVSLSLEENHVCRSSECQSLNAIGP-----HCDNAVSGTGGKAFEVHIK 693
Query: 720 VTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIA 779
V N GD +G H V LF P V G+P K L+GF+++ + F VD C+ LS+
Sbjct: 694 VQNGGDREGIHTVFLFTTPPAV-HGSPRKHLLGFEKIRLGKMEEAVVKFKVDVCKDLSVV 752
Query: 780 NKHGRRILPLGNHVLMVGELRHSLTIE 806
++ G+R + LG H+L VG+++HSL+I
Sbjct: 753 DEVGKRKIGLGQHLLHVGDVKHSLSIR 779
>gi|356534827|ref|XP_003535953.1| PREDICTED: probable beta-D-xylosidase 2-like [Glycine max]
Length = 771
Score = 731 bits (1886), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 349/770 (45%), Positives = 492/770 (63%), Gaps = 39/770 (5%)
Query: 42 FPCKPPHF--DSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWW 99
F C P + + PFC L+ R K LI LTLQEK+ L +NA+A+PRLGI YEWW
Sbjct: 27 FACDPKNTATKNLPFCKAWLATGARVKDLIGRLTLQEKVNLLVNNAAAVPRLGIKGYEWW 86
Query: 100 SESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQ 159
SE+LHG+++ GPG F G + TSFPQV+ +AASFN SLW IG + EARAMYN G
Sbjct: 87 SEALHGVSNVGPGTKFGGQFPAATSFPQVITTAASFNASLWEAIGRVASDEARAMYNGGT 146
Query: 160 AGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFR 219
AGLT+W+PN+NIFRDPRWGRGQETPGEDP++ YA +V+ Q
Sbjct: 147 AGLTYWSPNVNIFRDPRWGRGQETPGEDPILAGKYAASYVRGLQ---------------- 190
Query: 220 EKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPF 279
E+D G+ L ++A CKH AYDL+ W R+ FNA +++QD EDTF PF
Sbjct: 191 ----------ETD-GNRLKVAASCKHFTAYDLDNWNGVDRFHFNAQVSKQDIEDTFNVPF 239
Query: 280 RSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQN 338
R C+++GK + +MCSYNQVNGVP C L ++ R +WG GYI SDCD+V + Q+
Sbjct: 240 RMCVKEGKVASVMCSYNQVNGVPTCADPILLKRTVRGQWGLNGYIVSDCDSVGVFYNSQH 299
Query: 339 YTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLF 398
YT T E++AA +KAG+D++CG + +HTQ+A+ KG + E D++ ALLN +VQ+RLG++
Sbjct: 300 YTSTPEEAAADAIKAGLDLDCGPFLGQHTQNAVKKGLISETDVNGALLNTLTVQMRLGMY 359
Query: 399 NGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPL 458
+G+P YGKLGP DVCT H++LAL+AARQGIVLLKN LPL+ ++A+IGP
Sbjct: 360 DGEPSSHPYGKLGPRDVCTPSHQELALEAARQGIVLLKNKGPSLPLSTRRHPTVAVIGPN 419
Query: 459 VNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKAD 518
N M G Y GI C S L G+ Y +KT + GC +V C +D F A+ +A++AD
Sbjct: 420 SNVTVTMIGNYAGIACGYTSPLEGIGRY-TKTIHELGCANVACTNDKQFGRAINVAQQAD 478
Query: 519 FVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEA 578
++V GLD + E E DR LLLPG+Q LV+ VA SK P ILV+ GGP+D++FA+
Sbjct: 479 ATVLVMGLDQSIEAETVDRAGLLLPGRQQDLVSKVAAASKGPTILVIMSGGPVDITFAKN 538
Query: 579 DSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTK-VPMNDMNMRADSS 637
+ +I +ILW GYPG+AG A+A+I+FG NPGG+LPMTWYP+ + K +PM +M MRA S
Sbjct: 539 NPRIQAILWAGYPGQAGGAAIADILFGTSNPGGKLPMTWYPQGYIKNLPMTNMAMRASRS 598
Query: 638 RQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSR 697
+ YPGR+YRFY G VY FG+GLSYT++ + SAP ++I + + + +
Sbjct: 599 KGYPGRTYRFYNGPVVYPFGYGLSYTHFVHTLASAPKLVSIPVDGHRHGNSSSIANKAIK 658
Query: 698 LDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVS-QGTPEKQLIGFDRV 756
+ + C L +Q+ V N G DG+H +++F+ P + P KQL+ F ++
Sbjct: 659 VTHAR------CGKLSISLQVDVKNVGSKDGTHTLLVFSAPPAGNGHWAPHKQLVAFQKL 712
Query: 757 HTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
H +K + ++ + C+ LS+ ++ G R +P+G H L +G+++H ++++
Sbjct: 713 HIPSKAQQRVNVNIHVCKLLSVVDRSGTRRVPMGLHSLHIGDVKHYVSLQ 762
>gi|255573163|ref|XP_002527511.1| Beta-glucosidase, putative [Ricinus communis]
gi|223533151|gb|EEF34909.1| Beta-glucosidase, putative [Ricinus communis]
Length = 810
Score = 731 bits (1886), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 371/787 (47%), Positives = 504/787 (64%), Gaps = 48/787 (6%)
Query: 41 DFPC--KPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEW 98
+F C P + Y FCNTSLS RAK LIS LTLQEK+QQ+ ++A+ IPRLGIPAYEW
Sbjct: 24 NFACDKNSPQTNDYSFCNTSLSYQDRAKDLISRLTLQEKVQQVVNHAAGIPRLGIPAYEW 83
Query: 99 WSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG 158
WSE+LHG+++ G GV FNGTV TSFP +++SAASFN +LW +G V+ EAR M+++G
Sbjct: 84 WSEALHGVSNVGFGVRFNGTVPGATSFPAMILSAASFNETLWLKMGQVVSTEARTMHSVG 143
Query: 159 QAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGF 218
AGLT+W+PN+N+FRDPRWGRGQETPGEDP+VVS YAV +V
Sbjct: 144 LAGLTYWSPNVNVFRDPRWGRGQETPGEDPLVVSRYAVNYV------------------- 184
Query: 219 REKRVLKGFGEESDR-GDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQP 277
R L+ G+E + D+L +S+CCKH AYDL+KW R+ F+A +T+QD EDT+QP
Sbjct: 185 ---RGLQEVGDEGNSTADKLKVSSCCKHYTAYDLDKWKGVDRFHFDAKVTKQDLEDTYQP 241
Query: 278 PFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK-ARNEWGFKGYITSDCDAVATIFEY 336
PFRSC+E+ S +MCSYN+VNG+P C DL + R EW GYI SDCD++ ++
Sbjct: 242 PFRSCVEEAHVSSVMCSYNRVNGIPTCADPDLLKGIIRGEWNLDGYIVSDCDSIEVYYDS 301
Query: 337 QNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLG 396
NYT T ED+ A LKAG+++NCG + ++T A+ KV+E +D+AL+ F V +RLG
Sbjct: 302 INYTATPEDAVALALKAGLNMNCGEFLGKYTVDAVKLNKVEESVVDQALIYNFIVLMRLG 361
Query: 397 LFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIG 456
F+GDP+ +G LGP DVC+ H+KLALDAARQGIVLL N K LPL+KN +LA+IG
Sbjct: 362 FFDGDPKSLLFGNLGPSDVCSDGHQKLALDAARQGIVLLYN-KGALPLSKNNTRNLAVIG 420
Query: 457 PLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKK 516
P N + M Y GIPC + L+GL+ YVS YA+GC V C+ D A + A
Sbjct: 421 PNANVTTTMISNYAGIPCKYTTPLQGLQKYVSTVTYAAGCKSVSCSDDTLIDAATQAAAA 480
Query: 517 ADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFA 576
AD V+++ GLD + E E DR +L LPG Q LV V + V+LV+ P+DVSFA
Sbjct: 481 ADAVVLLVGLDQSIEREGLDRENLTLPGFQEKLVVDVVNATNGTVVLVVMSSSPIDVSFA 540
Query: 577 EADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNMRAD 635
S+I ILW+GYPG+AG A+A+++FGD+NP GR P TWYP+ + +VPM DMNMRA+
Sbjct: 541 VNKSKIKGILWVGYPGQAGGDAVAQVMFGDYNPAGRSPFTWYPQEYAHQVPMTDMNMRAN 600
Query: 636 SSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKN---ILQ 692
S+ +PGR+YRFY G +Y FGHGLSY+ +S +S PS L LK SD IL
Sbjct: 601 STANFPGRTYRFYAGNTLYKFGHGLSYSTFSNFIISGPSTLL----LKTNSDLKPDIILS 656
Query: 693 QTGSRLDYVHID----EVTS--CTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQ--G 744
S ++ I+ ++T+ CT+ + + V N G V G HVV++F + P S+ G
Sbjct: 657 THNSTEEHPFINSQAMDITTLNCTNSLLSLILGVRNNGPVSGDHVVLVFWKPPNSSEVTG 716
Query: 745 TPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG-----EL 799
QL+GF RV ++ ++ +D C++LS+ + G+R L G H+ +G ++
Sbjct: 717 AANVQLVGFSRVEVNRGKTQNVTLEIDVCKRLSLVDSEGKRKLVTGQHIFTIGSDTEHQV 776
Query: 800 RHSLTIE 806
RH L +
Sbjct: 777 RHHLDVR 783
>gi|255556320|ref|XP_002519194.1| Periplasmic beta-glucosidase precursor, putative [Ricinus communis]
gi|223541509|gb|EEF43058.1| Periplasmic beta-glucosidase precursor, putative [Ricinus communis]
Length = 782
Score = 730 bits (1885), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 355/775 (45%), Positives = 505/775 (65%), Gaps = 40/775 (5%)
Query: 36 VLNKPDFPCKPPH--FDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGI 93
V ++ F C P + + FC +L I R + LIS LTLQEKI+ L +NA+A+PRLGI
Sbjct: 36 VESRAPFACDPRNGVTRNLKFCRANLPIHVRVRDLISRLTLQEKIRLLVNNAAAVPRLGI 95
Query: 94 PAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARA 153
YEWWSE+LHG+++ GPGV F G TSFPQV+ +AASFN+SLW IG V+ EARA
Sbjct: 96 QGYEWWSEALHGVSNVGPGVKFGGAFPGATSFPQVITTAASFNQSLWEQIGRVVSDEARA 155
Query: 154 MYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGG 213
MYN G AGLT+W+PN+N+FRDPRWGRGQETPGEDP++ YA +V+ Q
Sbjct: 156 MYNGGLAGLTYWSPNVNVFRDPRWGRGQETPGEDPVLAGKYAASYVRGLQ---------- 205
Query: 214 IGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTED 273
S G +L ++ACCKH AYDL+ W RY FNA +++QD ED
Sbjct: 206 -----------------SSTGLKLKVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLED 248
Query: 274 TFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVAT 332
T+ PF++C+ +GK + +MCSYNQVNG P C L + R +WG GYI SDCD+V
Sbjct: 249 TYDVPFKACVVEGKVASVMCSYNQVNGKPTCADPILLKNTIRGQWGLNGYIVSDCDSVGV 308
Query: 333 IFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQ 392
+++ Q+YT T E++AA +KAG+D++CG + HT++A+ KG + E+D++ AL N +VQ
Sbjct: 309 LYDNQHYTSTPEEAAAATIKAGLDLDCGPFLAIHTENAVKKGLLVEEDVNLALANTITVQ 368
Query: 393 LRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSL 452
+RLG+F+G+P YG LGP DVCT H++LAL+AARQGIVLL+N + LPL+ + ++
Sbjct: 369 MRLGMFDGEPSAHPYGNLGPRDVCTPAHQELALEAARQGIVLLENRGQALPLSSSRHHTI 428
Query: 453 AIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVR 512
A+IGP + M G Y GI C S L+G+ Y +KT + +GC DV C+S+ F A
Sbjct: 429 AVIGPNSDVTVTMIGNYAGIACKYTSPLQGISRY-AKTLHQNGCGDVACHSNQQFGAAEA 487
Query: 513 IAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLD 572
A++AD ++V GLD + E E RDRV LLLPG Q LV+ VAR S+ P ILVL GGP+D
Sbjct: 488 AARQADATVLVMGLDQSIEAEFRDRVGLLLPGHQQELVSRVARASRGPTILVLMSGGPID 547
Query: 573 VSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESF-TKVPMNDMN 631
VSFA+ D ++ +ILW GYPG+AG A+A+++FG NPGG+LPMTWYP+ + KVPM +M
Sbjct: 548 VSFAKNDPRVGAILWAGYPGQAGGAAIADVLFGTTNPGGKLPMTWYPQGYLAKVPMTNMG 607
Query: 632 MRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNIL 691
MR D + YPGR+YRFY G V+ FGHG+SYT++S+ AP E+++ + + I
Sbjct: 608 MRPDPATGYPGRTYRFYKGNVVFPFGHGMSYTSFSHSLTQAPKEVSLPITNLYALNTTIS 667
Query: 692 QQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLI 751
+ + + + TSL + I+V N G +DG+H +++F+ P + + KQLI
Sbjct: 668 SKA------IRVSHINCQTSLG--IDINVKNTGTMDGTHTLLVFSSPPSGEKESSNKQLI 719
Query: 752 GFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
GF++V VA ++ + C+ LS ++ G R +P+G+H + +G+L+HS++++
Sbjct: 720 GFEKVDLVAGSQIQVKIDIHVCKHLSAVDRFGIRRIPIGDHHIYIGDLKHSISLQ 774
>gi|225428983|ref|XP_002264114.1| PREDICTED: probable beta-D-xylosidase 5-like [Vitis vinifera]
Length = 818
Score = 730 bits (1884), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 358/766 (46%), Positives = 498/766 (65%), Gaps = 43/766 (5%)
Query: 47 PHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGI 106
P+ +PFCNTSL RA L+S LTLQEK +QL ++A+ I RLG+P YEWWSE+LHG+
Sbjct: 58 PNSSQFPFCNTSLPYQDRASDLVSRLTLQEKAKQLINSATGISRLGVPDYEWWSEALHGV 117
Query: 107 ASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWA 166
+++G GV+F+ + +VT FP V++SAASFN SLW +G V+ E RAMYN+GQAGLT+W+
Sbjct: 118 SNSGIGVHFHDPIPAVTIFPAVILSAASFNESLWYTMGQVVSTEGRAMYNVGQAGLTYWS 177
Query: 167 PNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKG 226
PN+NIFRDPRWGRGQETPGEDP+VVS YAV +V R L+
Sbjct: 178 PNVNIFRDPRWGRGQETPGEDPLVVSRYAVNYV----------------------RGLQE 215
Query: 227 FGEESD-RGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQ 285
G+E + D L +S+CCKH AYD++KW R+ F+A +T QD EDT+QPPF+SC+E+
Sbjct: 216 VGKEGNFAADRLKVSSCCKHYTAYDVDKWKGVDRFHFDAKVTLQDLEDTYQPPFKSCVEE 275
Query: 286 GKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHE 344
G S +MCSYN+VNGVP C +L + R++WG GYI SDCD++ E NYT+T E
Sbjct: 276 GHVSSVMCSYNRVNGVPTCANPELLKGVIRDQWGLDGYIVSDCDSIMVYHERMNYTETPE 335
Query: 345 DSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRK 404
D+ A LKAG+++NCG+ + +T++A++ GKV+E +++AL+ + V +RLG F+GDP
Sbjct: 336 DAVALALKAGLNLNCGSYLGDYTKNAVNLGKVKESIVNQALIYNYIVLMRLGFFDGDPTM 395
Query: 405 GKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQ 464
+GK+GP DVCT +H+ LALDAA+QGIVLL N+ LPL+ N +LA+IGP + +
Sbjct: 396 LPFGKMGPSDVCTVDHQLLALDAAKQGIVLLHNNGA-LPLSPNTTKTLAVIGPNADATNT 454
Query: 465 MGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVA 524
M Y G+PC S L+GL+ YVS Y GC +V C+ + A IA AD +VV
Sbjct: 455 MLSNYAGVPCRYTSPLQGLQKYVSAVSYEKGCANVSCSEETLIEGAASIASMADATVVVV 514
Query: 525 GLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISS 584
GLDL E ED DRV+L LPG Q LV A+ + VILV+ GP+D+SF + S+I
Sbjct: 515 GLDLFIEAEDLDRVNLTLPGFQEKLVMEAAKAANGTVILVVMSAGPVDISFVKNVSKIGG 574
Query: 585 ILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNMRADSSRQYPGR 643
ILW+GYPG+AG A++++IFGD+NPGGR P TWYP+ + +VPM DMNMR +++ +PGR
Sbjct: 575 ILWVGYPGQAGGDAISQVIFGDYNPGGRSPFTWYPQEYVDQVPMTDMNMRPNATSNFPGR 634
Query: 644 SYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNIL----------QQ 693
+YRFYTG +Y FGHGLSY+ + YKF+ + + L NI
Sbjct: 635 TYRFYTGKSLYQFGHGLSYSTF-YKFIKSAPTTVLVHLLPQMDMPNIFSSNYPTMPNPNT 693
Query: 694 TGSRLDYVHIDEVTSCTSL-RFHVQISVTNAGDVDGSHVVMLFARVPKVS-QGTPEKQLI 751
G +D ID C +L + I V NAG++DG+HVV+ F + P+ +G P +L+
Sbjct: 694 NGQAIDISAID----CRNLSNIDIVIGVKNAGEIDGTHVVLAFWKPPRSGVRGAPGVELV 749
Query: 752 GFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
GF+RV ++ + +D C ++S ++ G+R L +G H L+VG
Sbjct: 750 GFERVEVKRGKTEMVGMRLDVCGKISNVDEEGKRKLVMGMHTLVVG 795
>gi|356572781|ref|XP_003554544.1| PREDICTED: probable beta-D-xylosidase 2-like [Glycine max]
Length = 771
Score = 729 bits (1883), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 349/770 (45%), Positives = 487/770 (63%), Gaps = 39/770 (5%)
Query: 42 FPCKPPH--FDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWW 99
F C P + FC SL+I+ R K LI LTL+EK++ L +NA+A+PRLG+ YEWW
Sbjct: 27 FACDPKNGGTKKMAFCKVSLAIAERVKDLIGRLTLEEKVRLLVNNAAAVPRLGMKGYEWW 86
Query: 100 SESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQ 159
SE+LHG+++ GP V FN + TSFPQV+ +AASFN SLW IG V+ EARAMYN G
Sbjct: 87 SEALHGVSNLGPAVKFNAQFPAATSFPQVITTAASFNASLWEAIGQVVSDEARAMYNGGT 146
Query: 160 AGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFR 219
AGLT+W+PN+NIFRDPRWGRGQETPGEDP++ YA +V+ QG +
Sbjct: 147 AGLTYWSPNVNIFRDPRWGRGQETPGEDPVLAGTYAATYVRGLQGTH------------- 193
Query: 220 EKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPF 279
+ L ++ACCKH AYDL+ W R+ FNA +++QD EDTF PF
Sbjct: 194 --------------ANRLKVAACCKHFTAYDLDNWNGMDRFHFNAQVSKQDIEDTFDVPF 239
Query: 280 RSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK-ARNEWGFKGYITSDCDAVATIFEYQN 338
+ C+ +GK + +MCSYNQVNGVP C +L +K R W GYI SDCD+V ++ Q+
Sbjct: 240 KMCVSEGKVASVMCSYNQVNGVPTCADPNLLKKTVRGLWQLDGYIVSDCDSVGVFYDNQH 299
Query: 339 YTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLF 398
YT T E++AA +KAG+D++CG + HTQ+A+ KG + E D++ AL+N +VQ+RLG+F
Sbjct: 300 YTPTPEEAAADAIKAGLDLDCGPFLAVHTQNAVKKGLLSEADVNGALVNTLTVQMRLGMF 359
Query: 399 NGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPL 458
+G+P YG LGP DVC H++LAL+AARQGIVLLKN LPL+ ++A+IGP
Sbjct: 360 DGEPTAHPYGHLGPKDVCKPAHQELALEAARQGIVLLKNTGPVLPLSSQLHRTVAVIGPN 419
Query: 459 VNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKAD 518
M G Y G+ C + L+G+ Y ++T + GC +V C +D F A+ A++AD
Sbjct: 420 SKATITMIGNYAGVACGYTNPLQGIGRY-ARTVHQLGCQNVACKNDKLFGPAINAARQAD 478
Query: 519 FVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEA 578
++V GLD + E E DR LLLPG+Q LV+ VA SK P ILVL GGP+D++FA+
Sbjct: 479 ATVLVMGLDQSIEAETVDRTGLLLPGRQPDLVSKVAAASKGPTILVLMSGGPVDITFAKN 538
Query: 579 DSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESF-TKVPMNDMNMRADSS 637
+ +I ILW GYPG+AG A+A+I+FG NPGG+LP+TWYPE + TK+PM +M MRA S
Sbjct: 539 NPRIVGILWAGYPGQAGGAAIADILFGTANPGGKLPVTWYPEEYLTKLPMTNMAMRATKS 598
Query: 638 RQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSR 697
YPGR+YRFY G VY FGHGL+YT++ + SAP+ +S L N+ +
Sbjct: 599 AGYPGRTYRFYNGPVVYPFGHGLTYTHFVHTLASAPT--VVSVPLNGHRRANVTNISNRA 656
Query: 698 LDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPK-VSQGTPEKQLIGFDRV 756
+ H C L +Q+ + N G DG+H +++F+ P EKQL+ F++V
Sbjct: 657 IRVTH----ARCDKLSITLQVDIKNVGSRDGTHTLLVFSAPPAGFGHWALEKQLVAFEKV 712
Query: 757 HTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
H AKG + + C+ LS+ ++ G R +PLG H +G+++HS++++
Sbjct: 713 HVPAKGQHRVGVNIHVCKLLSVVDRSGIRRIPLGEHSFNIGDVKHSVSLQ 762
>gi|298364130|gb|ADI79208.1| alpha-L-arabinofuranosidase/beta-D-xylosidase [Malus x domestica]
Length = 774
Score = 729 bits (1882), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 359/773 (46%), Positives = 490/773 (63%), Gaps = 40/773 (5%)
Query: 39 KPDFPCKP--PHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAY 96
+P F C P P + FC + I R + LI LTLQEKI L +NA A+PRLGI Y
Sbjct: 29 RPPFACDPRNPITRTLKFCRVRVPIHVRVQDLIGRLTLQEKIGLLVNNAIAVPRLGIQGY 88
Query: 97 EWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYN 156
EWWSE+LHG+++ GPG F GT TSFPQV+ +AASFN SLW IG V+ EARAMYN
Sbjct: 89 EWWSEALHGVSNVGPGTKF-GTFLGATSFPQVITTAASFNESLWEEIGRVVSDEARAMYN 147
Query: 157 LGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGF 216
G AGLTFW+PN+NIFRDPRWGRGQETPGEDP++ + Y +VK QG DG
Sbjct: 148 GGAAGLTFWSPNVNIFRDPRWGRGQETPGEDPILAAKYGARYVKGLQG------DGA--- 198
Query: 217 GFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQ 276
G+ L ++ACCKH AYDL+ W R+ FNA +++QD EDT+
Sbjct: 199 -----------------GNRLKVAACCKHYTAYDLDNWNGVDRFHFNARVSKQDLEDTYN 241
Query: 277 PPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFE 335
PFR+C+ G + +MCSYNQVNG P C +L + R +W GYI SDCD+V ++
Sbjct: 242 VPFRACVVDGNVASVMCSYNQVNGKPTCADPELLKGTIRGQWKLNGYIVSDCDSVGVYYD 301
Query: 336 YQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRL 395
Q+YTKT E++AA +KAG+D++CG + HT++A+ G+V E DI+ AL N +VQ+RL
Sbjct: 302 NQHYTKTPEEAAAYAIKAGLDLDCGPFLGIHTEAAVRFGQVNEIDINYALANTITVQMRL 361
Query: 396 GLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAII 455
G+F+G+P +YG LG DVC +LAL+AARQGIVLL+N LPL+ ++A+I
Sbjct: 362 GMFDGEPSAQRYGNLGLADVCKPSSNELALEAARQGIVLLENRGNSLPLSTMRHRTVAVI 421
Query: 456 GPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAK 515
GP + M G Y GI C + L+G+ Y H A GC DV CN + A A+
Sbjct: 422 GPNSDVTETMIGNYAGIACGYTTPLQGIARYTRTIHQA-GCTDVHCNGNQLIGAAEVAAR 480
Query: 516 KADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSF 575
+AD ++V GLD + E E RDR LLLPG Q LV+ VAR S+ P ILV+ GGP+DV+F
Sbjct: 481 QADATVLVIGLDQSIEAEFRDRTDLLLPGHQQELVSRVARASRGPTILVIMSGGPIDVTF 540
Query: 576 AEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNMRA 634
A+ D +I +I+W+GYPG+AG A+A+++FG NP G+LPMTWYP+++ +PM DM MRA
Sbjct: 541 AKNDPRIGAIIWVGYPGQAGGTAIADVLFGTTNPSGKLPMTWYPQNYVANLPMTDMAMRA 600
Query: 635 DSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTIS-ASLKAGSDKNILQQ 693
D +R YPGR+YRFY G V+ FG GLSYT +S+ P+ +++ SL A + +L
Sbjct: 601 DPARGYPGRTYRFYKGPVVFPFGLGLSYTRFSHSLAQGPTLVSVPFTSLVASKNTTMLGN 660
Query: 694 TGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGF 753
R+ + T+C SL V I + N+G +DG+H +++FA P + P KQL+GF
Sbjct: 661 HDIRVSH------TNCDSLSLDVHIDIKNSGTMDGTHTLLVFA-TPPTGKWAPNKQLVGF 713
Query: 754 DRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
+VH VA + + GV C+ LS+ ++ G R +PLG H L +G+L+H +++E
Sbjct: 714 HKVHIVAGSERRVRVGVQVCKHLSVVDELGIRRIPLGQHKLEIGDLQHHVSVE 766
>gi|357444469|ref|XP_003592512.1| Xylosidase [Medicago truncatula]
gi|355481560|gb|AES62763.1| Xylosidase [Medicago truncatula]
Length = 781
Score = 729 bits (1881), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 365/755 (48%), Positives = 495/755 (65%), Gaps = 30/755 (3%)
Query: 47 PHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGI 106
P ++PFCNTSLS TRAK L+S LTLQEK QQL + ++ I RLG+PAYEWWSE+LHG+
Sbjct: 31 PKTSNFPFCNTSLSYETRAKDLVSRLTLQEKAQQLVNPSTGISRLGVPAYEWWSEALHGV 90
Query: 107 ASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWA 166
++ GPG F+ V TSFP V++SAASFN +LW +G V+ EARAMYN+ AGLTFW+
Sbjct: 91 SNVGPGTRFDSRVPGATSFPAVILSAASFNETLWYTMGQVVSNEARAMYNVDLAGLTFWS 150
Query: 167 PNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKG 226
PN+N+FRDPRWGRGQETPGEDP+VVS YAV +V+ Q +G
Sbjct: 151 PNVNVFRDPRWGRGQETPGEDPLVVSRYAVNYVRGLQE---------VG----------- 190
Query: 227 FGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQG 286
E S +GD L +S+CCKH AYD++ W R+ F+A +T+QD EDT+QPPF+SC+ +G
Sbjct: 191 -DEASAKGDRLKVSSCCKHYTAYDVDNWKGVDRFHFDAKVTKQDLEDTYQPPFKSCVLEG 249
Query: 287 KASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHED 345
S +MCSYN+VNG+P C DL Q R +WG GYI SDCD+V + +YTKT ED
Sbjct: 250 HVSSVMCSYNRVNGIPTCADPDLLQGVIRGQWGLDGYIVSDCDSVEVYYNSIHYTKTPED 309
Query: 346 SAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKG 405
+ A LKAG+++NCG + ++T +A++ KV +D+AL+ + V +RLG F +P+
Sbjct: 310 AVALALKAGLNMNCGDFLKKYTANAVNLKKVDVSIVDQALVYNYIVLMRLGFFE-NPKSL 368
Query: 406 KYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQM 465
+ LGP DVCT E+++LAL+AA+QGIVLL+N+K LPL+K + +LA+IGP N + M
Sbjct: 369 PFANLGPSDVCTKENQQLALEAAKQGIVLLENNKGALPLSKTKIKNLAVIGPNANATTVM 428
Query: 466 GGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAG 525
Y GIPC S L+GL+ Y+S YA GC DV C++ F AV+ A AD V++V G
Sbjct: 429 ISNYAGIPCRYSSPLQGLQKYISSVTYARGCSDVKCSNQNLFAAAVKAAASADAVVLVVG 488
Query: 526 LDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSI 585
LD + E E DRV+L LPG Q LV VA +K +ILV+ GP+D+SF ++ S I I
Sbjct: 489 LDQSIEAEGLDRVNLTLPGFQEKLVKDVAAATKGTLILVIMAAGPIDISFTKSVSNIGGI 548
Query: 586 LWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNMRADSSRQYPGRS 644
LW+GYPG+ G A+A++IFGD+NPGGR P TWYP+S+ +VPM DMNMRA+SSR +PGR+
Sbjct: 549 LWVGYPGQDGGNAIAQVIFGDYNPGGRSPFTWYPQSYVDQVPMTDMNMRANSSRNFPGRT 608
Query: 645 YRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHID 704
YRFY G +Y FG+GLSY+ +S SAPS + + + N + +D I
Sbjct: 609 YRFYNGKSLYEFGYGLSYSTFSTHIASAPSTIMLQKNTSISKPLNNIFLDDQVIDISTI- 667
Query: 705 EVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVP--KVSQGTPEKQLIGFDRVHTVAKG 762
SC +L F + I V N G DGSHVV++F P + G P KQLIGF+R
Sbjct: 668 ---SCFNLTFSLVIGVKNNGPFDGSHVVLVFLEPPSSEAVSGVPLKQLIGFERAQVKVGK 724
Query: 763 SKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
++ ++ +D C+ LS + G+R L +G H ++VG
Sbjct: 725 TEFVTVKIDICKMLSNVDSDGKRKLVIGQHNILVG 759
>gi|357130854|ref|XP_003567059.1| PREDICTED: probable beta-D-xylosidase 2-like [Brachypodium
distachyon]
Length = 779
Score = 729 bits (1881), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 357/778 (45%), Positives = 491/778 (63%), Gaps = 38/778 (4%)
Query: 38 NKPDFPCKPPHFDS-YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAY 96
+P F C P + PFC +L RA+ L++ LT EK++ L +NA+ +PRLG+ Y
Sbjct: 23 TRPPFACAPGGPSTRLPFCRQALPPRARARDLVARLTRAEKVRLLVNNAAGVPRLGVEGY 82
Query: 97 EWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYN 156
EWWSE+LHG++ GPGV F G T+FPQV+ +AASFN SLW IG AV+ E RA+YN
Sbjct: 83 EWWSEALHGVSDTGPGVRFGGAFPGATAFPQVIGTAASFNASLWELIGRAVSDEGRAIYN 142
Query: 157 LGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGF 216
QAGLTFW+PN+NIFRDPRWGRGQETPGEDP V YA +V+ Q ++
Sbjct: 143 GRQAGLTFWSPNVNIFRDPRWGRGQETPGEDPAVSGRYAAAYVRGLQQQH---------- 192
Query: 217 GFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQ 276
L +ACCKH AYDL++W R+ FNA++T QD EDTF
Sbjct: 193 -----------------AGRLKTAACCKHFTAYDLDRWSGADRFHFNAIVTPQDLEDTFN 235
Query: 277 PPFRSCIEQGKASCIMCSYNQVNGVPACL-RGDLFQKARNEWGFKGYITSDCDAVATIFE 335
PFR+C+ +G+A+ +MCSYNQVNGVP C +G L R +W GYI SDCD+V +
Sbjct: 236 APFRACVVEGRAAAVMCSYNQVNGVPTCADQGFLRGTIRGKWKLDGYIVSDCDSVDVFYR 295
Query: 336 YQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRL 395
Q+YT+T ED+ A L+AG+D++CG + ++T++A+ +GKV+E DID A++N +VQ+RL
Sbjct: 296 EQHYTRTREDAVAATLRAGLDLDCGPFLAQYTEAAVAQGKVKEADIDAAVVNTVTVQMRL 355
Query: 396 GLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKN---DKKFLPLN-KNAVSS 451
G+F+GD +G LGP VCT H++LAL+AA Q IVLLKN + LPL+ + +
Sbjct: 356 GMFDGDVAAQPFGHLGPQHVCTPAHRELALEAACQSIVLLKNGGGNNMRLPLSSHHRRGT 415
Query: 452 LAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCN-SDAGFHEA 510
+A++GP M G Y G PC+ + L+G+ Y T + +GC DV C S A
Sbjct: 416 VAVVGPHSEATVAMIGNYAGKPCAYTTPLQGVGRYARATVHQAGCTDVACQGSGQPIDAA 475
Query: 511 VRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGP 570
V A+ AD +VV GLD + E E DR +LLLPG+Q LV++VAR SK PVILVL GGP
Sbjct: 476 VDAARHADATVVVVGLDQSVEAEGLDRTTLLLPGRQAELVSAVARASKGPVILVLMSGGP 535
Query: 571 LDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESF-TKVPMND 629
+D++FA+ D +++ILW GYPG+AG +A+A++IFG NPGG+LP+TWYPE + K PM +
Sbjct: 536 VDIAFAQNDRNVAAILWAGYPGQAGGQAIADVIFGHHNPGGKLPVTWYPEDYLRKAPMTN 595
Query: 630 MNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKN 689
M MRAD +R YPGR+YRFY G ++ FGHGLSYT +++ AP+ LT+ + +
Sbjct: 596 MAMRADPARGYPGRTYRFYAGPTIHPFGHGLSYTKFAHTLAHAPAHLTVRRAAGHRTTAA 655
Query: 690 ILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVS-QGTPEK 748
I T S L+ V + C L V + V N G DG+H V ++A P + G P +
Sbjct: 656 INTTTASHLNDVRVAH-AQCEGLSVSVHVDVKNVGSRDGAHTVFVYASPPIAAIHGAPVR 714
Query: 749 QLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG-ELRHSLTI 805
QL+ F++VH A + GVD C LSIA++ G R +P+G H LM+G EL H++ +
Sbjct: 715 QLVAFEKVHVAAGAVARVKMGVDVCGSLSIADQEGVRRIPIGEHRLMIGEELTHAVML 772
>gi|224070626|ref|XP_002303181.1| predicted protein [Populus trichocarpa]
gi|222840613|gb|EEE78160.1| predicted protein [Populus trichocarpa]
Length = 773
Score = 728 bits (1880), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 377/791 (47%), Positives = 505/791 (63%), Gaps = 40/791 (5%)
Query: 21 LIVVVNVIAFSNSKPVLNKPDFPC---KPPHFDSYPFCNTSLSISTRAKSLISLLTLQEK 77
L+ + ++ S + P F C P S FCNTS+ I+ R L+ LTLQEK
Sbjct: 17 LLFCMFLVFLSTHVSAQSSPVFACDVVSNPSLASLGFCNTSIGINDRVVDLVKRLTLQEK 76
Query: 78 IQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNR 137
I L ++A + RLGIP YEWWSE+LHG++ GPG +F+ V+ TSFPQV+++AASFN
Sbjct: 77 IVFLVNSAGNVSRLGIPKYEWWSEALHGVSYVGPGTHFSDDVAGATSFPQVILTAASFNT 136
Query: 138 SLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVE 197
SL+ IG V+ EARAMYN+G AGLTFW+PNINIFRDPRWGRGQETPGEDP++ S Y
Sbjct: 137 SLFEAIGKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLSSKYGSC 196
Query: 198 FVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNF 257
+VK Q + DDG D+L ++ACCKH AYDL+ W
Sbjct: 197 YVKGLQ----QRDDG--------------------DPDKLKVAACCKHYTAYDLDNWKGS 232
Query: 258 SRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNE 316
RY FNA++T+QD +DTFQPPF+SC+ G + +MCSYNQVNG P C DL R E
Sbjct: 233 DRYHFNAVVTKQDMDDTFQPPFKSCVIDGNVASVMCSYNQVNGKPTCADPDLLSGVIRGE 292
Query: 317 WGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKV 376
W GYI +DCD++ ++ QNYTKT E++AA + AG+D+NCG+ + +HT++A+ G V
Sbjct: 293 WNLNGYIVTDCDSLDVFYKSQNYTKTPEEAAAAAILAGVDLNCGSFLGQHTEAAVKGGLV 352
Query: 377 QEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLK 436
E ID A+ N F+ +RLG F+GDP K YGKLGP DVCT+E+++LA +AARQGIVLLK
Sbjct: 353 NEHAIDIAVSNNFATLMRLGFFDGDPSKQLYGKLGPKDVCTAENQELAREAARQGIVLLK 412
Query: 437 NDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGC 496
N LPL+ A+ +LA+IGP N M G Y G PC + L+GL A V+ T Y GC
Sbjct: 413 NTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAASVATT-YLPGC 471
Query: 497 HDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVART 556
+V C S A +A ++A AD ++V G DL+ E E RDRV +LLPGQQ L+T+VA
Sbjct: 472 SNVAC-STAQVDDAKKLAAAADATVLVMGADLSIEAESRDRVDVLLPGQQQLLITAVANV 530
Query: 557 SKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMT 616
S PVILV+ GG +DVSFA + +I+SILW+GYPGEAG A+A+IIFG +NP GRLPMT
Sbjct: 531 SCGPVILVIMSGGGMDVSFARTNDKITSILWVGYPGEAGGAAIADIIFGYYNPSGRLPMT 590
Query: 617 WYPESFT-KVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSE 675
WYP+S+ KVPM +MNMR D S YPGR+YRFYTG VY FG GLSY+ ++++ + AP
Sbjct: 591 WYPQSYVDKVPMTNMNMRPDPSNGYPGRTYRFYTGETVYSFGDGLSYSQFTHELIQAPQL 650
Query: 676 LTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLF 735
+ + + Q + +C + F + + V N G + GSH V LF
Sbjct: 651 VYVPLEESHVCHSSECQSV--------VASEQTCQNSTFDMLLRVKNEGTISGSHTVFLF 702
Query: 736 ARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLM 795
+ P V +P+K L+GF++V A+ + + F VD C+ LS+ ++ G + + LG HVL
Sbjct: 703 SSPPAV-HNSPQKHLVGFEKVFLNAQTGRHVRFKVDICKDLSVVDELGSKKVALGEHVLH 761
Query: 796 VGELRHSLTIE 806
VG L+H L++
Sbjct: 762 VGSLKHFLSVR 772
>gi|147844622|emb|CAN82161.1| hypothetical protein VITISV_035506 [Vitis vinifera]
Length = 925
Score = 728 bits (1879), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 357/766 (46%), Positives = 497/766 (64%), Gaps = 43/766 (5%)
Query: 47 PHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGI 106
P+ +PFCNTSL RA L+S LTLQEK +QL ++A+ I RLG+P YEWWSE+LHG+
Sbjct: 34 PNSSQFPFCNTSLPYQDRASDLVSRLTLQEKAKQLINSATGISRLGVPDYEWWSEALHGV 93
Query: 107 ASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWA 166
+++G GV+F+ + +VT FP V++SAASFN SLW +G V+ E RAMYN+GQAGLT+W+
Sbjct: 94 SNSGIGVHFHDPIPAVTIFPAVILSAASFNESLWYTMGQVVSTEGRAMYNVGQAGLTYWS 153
Query: 167 PNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKG 226
PN+NIFRDPRWGRGQETPGEDP+VVS YAV +V R L+
Sbjct: 154 PNVNIFRDPRWGRGQETPGEDPLVVSRYAVNYV----------------------RGLQE 191
Query: 227 FGEESD-RGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQ 285
G+E + D L +S+CCKH AYD++KW R+ F+A +T QD EDT+QPPF+ C+E+
Sbjct: 192 VGKEGNFAADRLKVSSCCKHYTAYDVDKWKGVDRFHFDAKVTLQDLEDTYQPPFKXCVEE 251
Query: 286 GKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHE 344
G S +MCSYN+VNGVP C +L + R++WG GYI SDCD++ E NYT+T E
Sbjct: 252 GHVSSVMCSYNRVNGVPTCANPELLKGVIRDQWGLDGYIVSDCDSIMVYHERMNYTETPE 311
Query: 345 DSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRK 404
D+ A LKAG+++NCG+ + +T++A++ GKV+E +B+AL+ + V +RLG F+GDP
Sbjct: 312 DAVALALKAGLNLNCGSYLGDYTKNAVNLGKVKESIVBQALIYNYIVLMRLGFFDGDPTM 371
Query: 405 GKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQ 464
+GK+GP DVCT +H+ LALDAA+QGIVLL N+ LPL+ N +LA+IGP + +
Sbjct: 372 LPFGKMGPSDVCTVDHQLLALDAAKQGIVLLHNNGA-LPLSPNTTKTLAVIGPNADATNT 430
Query: 465 MGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVA 524
M Y G+PC S L+GL+ YVS Y GC +V C+ + A IA AD +VV
Sbjct: 431 MLSNYAGVPCRYTSPLQGLQKYVSAVSYEKGCANVSCSEETLIEGAASIASMADATVVVV 490
Query: 525 GLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISS 584
GLDL E ED DRV+L LPG Q LV A+ + VILV+ GP+D+SF + S+I
Sbjct: 491 GLDLFIEAEDLDRVNLTLPGFQEKLVMEAAKAANGTVILVVMSAGPVDISFVKNVSKIGG 550
Query: 585 ILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNMRADSSRQYPGR 643
ILW+GYPG+AG A++++IFGD+NPGGR P TWYP+ + +VPM DMNMR +++ +PGR
Sbjct: 551 ILWVGYPGQAGGDAISQVIFGDYNPGGRSPFTWYPQEYVDQVPMTDMNMRPNATXNFPGR 610
Query: 644 SYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNIL----------QQ 693
+YRFYTG +Y FGHGLSY+ + YKF+ + + L NI
Sbjct: 611 TYRFYTGKSLYQFGHGLSYSTF-YKFIKSAPXTVLVHLLPQMDMPNIFSSNYPTMPNPNT 669
Query: 694 TGSRLDYVHIDEVTSCTSL-RFHVQISVTNAGDVDGSHVVMLFARVPKVS-QGTPEKQLI 751
G +D ID C +L + I V NAG++DG+HVV+ F + P+ +G P +L+
Sbjct: 670 NGQAIDISAID----CRNLSNIDIVIGVKNAGEIDGTHVVLAFWKPPRSGVRGAPGVELV 725
Query: 752 GFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
GF+RV ++ + +D C ++S ++ G+R L +G H L+VG
Sbjct: 726 GFERVEVKRGKTEMVGMRLDVCGKISNVDEEGKRKLVMGMHTLVVG 771
>gi|449436749|ref|XP_004136155.1| PREDICTED: probable beta-D-xylosidase 2-like [Cucumis sativus]
Length = 772
Score = 727 bits (1877), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 356/796 (44%), Positives = 505/796 (63%), Gaps = 45/796 (5%)
Query: 17 RVLQLIVVVNVIAFSNSKPVLNKPDFPCKPPH--FDSYPFCNTSLSISTRAKSLISLLTL 74
R++ ++++++ I F + + F C P YPFC +L I R K LI LTL
Sbjct: 7 RIIPILIILSAI-FRHGGGA--REPFACDPKDAALSRYPFCRVALPIPERVKDLIGRLTL 63
Query: 75 QEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAAS 134
QEK++ L +NA+A+PRLGI YEWWSE+LHG+++ GPG F G TSFPQV+ + AS
Sbjct: 64 QEKVRLLVNNAAAVPRLGIKGYEWWSEALHGVSNVGPGTEFGGDFPGATSFPQVITTVAS 123
Query: 135 FNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAY 194
FN SLW IG V+ EARAMYN G AGLT+W+PN+NIFRDPRWGRGQETPGEDP+V Y
Sbjct: 124 FNVSLWEAIGRVVSDEARAMYNGGAAGLTYWSPNVNIFRDPRWGRGQETPGEDPVVAGEY 183
Query: 195 AVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKW 254
A ++K QG + GD L ++ACCKH AYDL+ W
Sbjct: 184 AARYIKGLQGND---------------------------GDRLKVAACCKHFTAYDLDNW 216
Query: 255 GNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA- 313
R+ FNA +T QD DTF+ PFR C+++GK + +MCSYNQVNGVP C +L +
Sbjct: 217 NGTDRFHFNAKVTRQDMVDTFEVPFRKCVKEGKVASVMCSYNQVNGVPTCADPNLLKGTI 276
Query: 314 RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDK 373
RN+WG GYI SDCD+V ++ Q+YT T E++AA +KAG+D++CG + HT+ A+ K
Sbjct: 277 RNQWGLNGYIVSDCDSVGVFYDNQHYTSTAEEAAADAIKAGLDLDCGPFLAVHTEDAVKK 336
Query: 374 GKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIV 433
G + + I+ AL N +VQ+RLG+F+G P YGKLGP +VC+ H++LALDAARQGIV
Sbjct: 337 GLLTQTHINNALANTITVQMRLGMFDGAPSSHAYGKLGPKNVCSPSHQQLALDAARQGIV 396
Query: 434 LLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYA 493
LLKN LPL+ + ++A+IGP + M G Y G+ C + L G++ Y + H
Sbjct: 397 LLKNRLPGLPLSADHHRTVAVIGPNSDVNVTMIGNYAGVACGYVTPLEGIKRYTTVVH-R 455
Query: 494 SGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSV 553
GC +V C +D F +A+ A AD ++V GLD + E E +DR LLLPG+Q LV V
Sbjct: 456 KGCDNVACATDYSFTDALAAASTADATVLVMGLDQSVEAETKDRDGLLLPGRQQELVLKV 515
Query: 554 ARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRL 613
A S+ P +++L GGP+DVSFA+ D +IS+ILW+GYPG+AG A+A+++FG NPGG+L
Sbjct: 516 AAASRGPTVVILMSGGPIDVSFADNDPRISAILWVGYPGQAGGAAIADVLFGTTNPGGKL 575
Query: 614 PMTWYPESF-TKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSA 672
PMTWYP+S+ + +PM +M MR+ SS YPGR+YRFY G VY FGHGLSYTN+ + + A
Sbjct: 576 PMTWYPQSYLSNLPMTNMAMRSTSS--YPGRTYRFYAGPVVYEFGHGLSYTNFIHTIVKA 633
Query: 673 PSELTISAS-LKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHV 731
P+ ++IS S + + L R+ + C L + + V N GD DG H
Sbjct: 634 PTIVSISLSGHRQTHSASTLSSKAIRVTH------AKCQKLSLVIHVDVENKGDRDGFHT 687
Query: 732 VMLFARVP-KVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLG 790
+++F+ P + P KQL+ F+++H ++ + + V C+ LS+ +K G R +PLG
Sbjct: 688 MLVFSTPPANGATWVPRKQLVAFEKLHLASREKRRLQVHVHVCKYLSVVDKLGVRRIPLG 747
Query: 791 NHVLMVGELRHSLTIE 806
+H + +G ++H+++++
Sbjct: 748 DHYIHIGNVKHTVSLQ 763
>gi|413919688|gb|AFW59620.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
Length = 773
Score = 725 bits (1872), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 369/770 (47%), Positives = 496/770 (64%), Gaps = 40/770 (5%)
Query: 40 PDFPCKPPH--FDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYE 97
P F C + SY FCN S + + RA L+S LTL EK+ L D +A+PRLG+P YE
Sbjct: 38 PAFACDASNATLASYGFCNRSAAAAARAADLVSRLTLAEKVGFLVDKQAALPRLGVPLYE 97
Query: 98 WWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNL 157
WWSE+LHG++ GPG F+ V TSFPQ +++AASFN +L+ IG V+ EARAM+N+
Sbjct: 98 WWSEALHGVSYVGPGTRFSPLVPGATSFPQPILTAASFNATLFRAIGEVVSNEARAMHNV 157
Query: 158 GQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFG 217
G AGLTFW+PNINIFRDPRWGRGQETPGEDP++ S YAV +V QG
Sbjct: 158 GLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLTSKYAVGYVTGLQG------------- 204
Query: 218 FREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQP 277
+ G G L ++ACCKH AYD++ W RY+F+A++++QD +DTFQP
Sbjct: 205 -----AVSGAGA-------LKVAACCKHYTAYDVDNWKGVERYTFDAVVSQQDLDDTFQP 252
Query: 278 PFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEY 336
PF+SC+ G + +MCSYNQVNG P C DL R +W GYI+SDCD+V ++
Sbjct: 253 PFKSCVVDGNVASVMCSYNQVNGKPTCADKDLLSGVIRGDWKLNGYISSDCDSVDVLYNN 312
Query: 337 QNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLG 396
Q+YTKT ED+AA +KAG+D+NCGT + +HT +A+ GK+ E D+DRA+ N +RLG
Sbjct: 313 QHYTKTPEDAAAISIKAGLDLNCGTFLAQHTVAAVQAGKLSESDVDRAVTNNLVTLMRLG 372
Query: 397 LFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIG 456
F+GDPR+ +G LGP DVCT +++LA +AARQGIVLLKN K LPL+ ++ S+A+IG
Sbjct: 373 FFDGDPRELPFGNLGPSDVCTPSNQELAREAARQGIVLLKNTGK-LPLSATSIKSMAVIG 431
Query: 457 PLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDA-GFHEAVRIAK 515
P N M G Y G PC + L+GL A V+ T Y GC +V C+ ++ A + A
Sbjct: 432 PNANASFTMIGNYEGTPCKYTTPLQGLGANVA-TVYQPGCTNVGCSGNSLQLDAATKAAA 490
Query: 516 KADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSF 575
AD ++V G D + E E DR SLLLPGQQ LV++VA S P ILV+ GGP D+SF
Sbjct: 491 SADVTVLVVGADQSIERESLDRTSLLLPGQQPQLVSAVANASSGPCILVVMSGGPFDISF 550
Query: 576 AEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRAD 635
A++ +I++ILW+GYPGEAG A+A+++FG NP GRLP+TWYPESFTKVPM DM MR D
Sbjct: 551 AKSSDKIAAILWVGYPGEAGGAAIADVLFGYHNPSGRLPVTWYPESFTKVPMTDMRMRPD 610
Query: 636 SSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTG 695
S YPGR+YRFYTG VY FG GLSYT++++ +SAP +L +L+ L +
Sbjct: 611 PSTGYPGRTYRFYTGDTVYAFGDGLSYTSFAHHLVSAPKQL----ALQLAEGHACLTEQC 666
Query: 696 SRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDR 755
++ E C L F V + V NAG+ G H V LF+ P V P K L+GF++
Sbjct: 667 PSVEA----EGAHCEGLAFDVHLRVRNAGERSGGHTVFLFSSPPAV-HNAPAKHLLGFEK 721
Query: 756 VHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTI 805
V + ++F VD C+ LS+ ++ G R + LG+H L VG+L+H+L +
Sbjct: 722 VSLEPGQAGVVAFKVDVCKDLSVVDELGNRKVALGSHTLHVGDLKHTLNL 771
>gi|15242492|ref|NP_196535.1| beta-xylosidase 3 [Arabidopsis thaliana]
gi|75264323|sp|Q9LXD6.1|BXL3_ARATH RecName: Full=Beta-D-xylosidase 3; Short=AtBXL3; AltName:
Full=Alpha-L-arabinofuranosidase; Flags: Precursor
gi|7671416|emb|CAB89357.1| beta-xylosidase-like protein [Arabidopsis thaliana]
gi|9759004|dbj|BAB09531.1| beta-xylosidase [Arabidopsis thaliana]
gi|15450735|gb|AAK96639.1| AT5g09730/F17I14_80 [Arabidopsis thaliana]
gi|332004056|gb|AED91439.1| beta-xylosidase 3 [Arabidopsis thaliana]
Length = 773
Score = 725 bits (1872), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 374/804 (46%), Positives = 496/804 (61%), Gaps = 45/804 (5%)
Query: 11 RERILFRVLQLIVVVNVIAFSNSKPVLNKPDFPCK---PPHFDSYPFCNTSLSISTRAKS 67
R R LF V L + ++ S + P F C P FCN LSI R
Sbjct: 4 RNRALFSVSTLFLCF-IVCISEQSNNQSSPVFACDVTGNPSLAGLRFCNAGLSIKARVTD 62
Query: 68 LISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQ 127
L+ LTL+EKI L+ A + RLGIP+Y+WWSE+LHG+++ G G F G V TSFPQ
Sbjct: 63 LVGRLTLEEKIGFLTSKAIGVSRLGIPSYKWWSEALHGVSNVGGGSRFTGQVPGATSFPQ 122
Query: 128 VLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGED 187
V+++AASFN SL+ IG V+ EARAMYN+G AGLTFW+PN+NIFRDPRWGRGQETPGED
Sbjct: 123 VILTAASFNVSLFQAIGKVVSTEARAMYNVGSAGLTFWSPNVNIFRDPRWGRGQETPGED 182
Query: 188 PMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGD--ELMLSACCKH 245
P + S YAV +VK Q E+D GD L ++ACCKH
Sbjct: 183 PTLSSKYAVAYVKGLQ--------------------------ETDGGDPNRLKVAACCKH 216
Query: 246 LIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACL 305
AYD++ W N +R +FNA++ +QD DTFQPPF+SC+ G + +MCSYNQVNG P C
Sbjct: 217 YTAYDIDNWRNVNRLTFNAVVNQQDLADTFQPPFKSCVVDGHVASVMCSYNQVNGKPTCA 276
Query: 306 RGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCML 364
DL R +W GYI SDCD+V +F Q+Y KT E++ A L AG+D+NC
Sbjct: 277 DPDLLSGVIRGQWQLNGYIVSDCDSVDVLFRKQHYAKTPEEAVAKSLLAGLDLNCDHFNG 336
Query: 365 RHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLA 424
+H A+ G V E ID+A+ N F+ +RLG F+GDP+K YG LGP DVCT+++++LA
Sbjct: 337 QHAMGAVKAGLVNETAIDKAISNNFATLMRLGFFDGDPKKQLYGGLGPKDVCTADNQELA 396
Query: 425 LDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLE 484
D ARQGIVLLKN LPL+ +A+ +LA+IGP N M G Y G+PC + L+GL
Sbjct: 397 RDGARQGIVLLKNSAGSLPLSPSAIKTLAVIGPNANATETMIGNYHGVPCKYTTPLQGLA 456
Query: 485 AYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPG 544
VS T Y GC +V C DA AV +A AD V++V G D + E E DRV L LPG
Sbjct: 457 ETVSST-YQLGC-NVAC-VDADIGSAVDLAASADAVVLVVGADQSIEREGHDRVDLYLPG 513
Query: 545 QQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIF 604
+Q LVT VA ++ PV+LV+ GG D++FA+ D +I+SI+W+GYPGEAG A+A++IF
Sbjct: 514 KQQELVTRVAMAARGPVVLVIMSGGGFDITFAKNDKKITSIMWVGYPGEAGGLAIADVIF 573
Query: 605 GDFNPGGRLPMTWYPESFT-KVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYT 663
G NP G LPMTWYP+S+ KVPM++MNMR D S+ YPGRSYRFYTG VY F L+YT
Sbjct: 574 GRHNPSGNLPMTWYPQSYVEKVPMSNMNMRPDKSKGYPGRSYRFYTGETVYAFADALTYT 633
Query: 664 NYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYV--HIDEVTSCTSLRFHVQISVT 721
+ ++ + AP +++S + Q LD + H + S F V ++V
Sbjct: 634 KFDHQLIKAPRLVSLSLDENHPCRSSECQS----LDAIGPHCENAVEGGS-DFEVHLNVK 688
Query: 722 NAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANK 781
N GD GSH V LF P+V G+P KQL+GF+++ + F V+ C+ LS+ ++
Sbjct: 689 NTGDRAGSHTVFLFTTSPQV-HGSPIKQLLGFEKIRLGKSEEAVVRFNVNVCKDLSVVDE 747
Query: 782 HGRRILPLGNHVLMVGELRHSLTI 805
G+R + LG+H+L VG L+HSL I
Sbjct: 748 TGKRKIALGHHLLHVGSLKHSLNI 771
>gi|357511337|ref|XP_003625957.1| Beta-xylosidase [Medicago truncatula]
gi|355500972|gb|AES82175.1| Beta-xylosidase [Medicago truncatula]
Length = 771
Score = 725 bits (1871), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 348/758 (45%), Positives = 489/758 (64%), Gaps = 40/758 (5%)
Query: 53 PFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPG 112
PFCN L+I R K LI LT+QEK+ L +NA A+PR+G+ +YEWWSE+LHG+++ GPG
Sbjct: 41 PFCNVKLAIPERVKDLIGRLTMQEKVNLLVNNAPAVPRVGMKSYEWWSEALHGVSNVGPG 100
Query: 113 VNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIF 172
F G + TSFPQV+ +AASFN SLW IG V+ EARAMYN G AGLT+W+PN+NIF
Sbjct: 101 TRFGGVFPAATSFPQVITTAASFNASLWEAIGRVVSDEARAMYNGGAAGLTYWSPNVNIF 160
Query: 173 RDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESD 232
RDPRWGRGQETPGEDP++ YA +VK QG +
Sbjct: 161 RDPRWGRGQETPGEDPVLAGRYAASYVKGLQGTD-------------------------- 194
Query: 233 RGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIM 292
G++L ++ACCKH AYD++ W R+ FNA++++QD EDTF PFR C+++GK + +M
Sbjct: 195 -GNKLKVAACCKHFTAYDVDNWNGVDRFHFNALVSKQDIEDTFDVPFRMCVKEGKVASVM 253
Query: 293 CSYNQVNGVPACLRGDLFQK-ARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVL 351
CSYNQVNGVP C +L +K R WG GYI SDCD+V ++ Q+YT T E++AA +
Sbjct: 254 CSYNQVNGVPTCADPNLLKKTVRGVWGLDGYIVSDCDSVGVLYNSQHYTSTPEEAAADAI 313
Query: 352 KAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLG 411
KAG+D++CG + HTQ A+ KG + E D++ AL+N VQ+RLG+F+G+P YG+LG
Sbjct: 314 KAGLDLDCGPFLGVHTQDAVKKGLLTEADVNNALVNTLKVQMRLGMFDGEPSAQAYGRLG 373
Query: 412 PDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTG 471
P DVC H++LAL+AARQGIVLLKN LPL+ ++A+IGP + M G Y G
Sbjct: 374 PKDVCKPAHQELALEAARQGIVLLKNTGPTLPLSPQRHRTVAVIGPNSDVTVTMIGNYAG 433
Query: 472 IPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQE 531
I C S L+G+ Y +KT + GC +V C D F A+ A+ AD I+V GLD + E
Sbjct: 434 IACGYTSPLQGIGRY-AKTIHQQGCSNVACRDDKQFGPALDAARHADATILVIGLDQSIE 492
Query: 532 TEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYP 591
E DR SLLLPG Q LV+ VA SK P ILVL GGP+D++FA+ D +++ ILW GYP
Sbjct: 493 AETVDRTSLLLPGHQQDLVSKVAAASKGPTILVLMSGGPVDITFAKNDPKVAGILWAGYP 552
Query: 592 GEAGAKALAEIIFGDFNPGGRLPMTWYPESFTK-VPMNDMNMRADSSRQYPGRSYRFYTG 650
G+AG A+A+I+FG +PGG+LP+TWYP+ + K + M +M MR S YPGR+YRFY G
Sbjct: 553 GQAGGAAIADILFGTASPGGKLPVTWYPQEYLKNLAMTNMAMRP-SKIGYPGRTYRFYKG 611
Query: 651 TQVYGFGHGLSYTNYSYKFLSAPSELTISA-SLKAGSDKNILQQTGSRLDYVHIDEVTSC 709
VY FGHGL+YT++ ++ SAP+ +++ + G++ NI + + H C
Sbjct: 612 PVVYPFGHGLTYTHFVHELSSAPTVVSVPVHGHRHGNNTNISNKA---IRVTH----ARC 664
Query: 710 TSLRFHVQISVTNAGDVDGSHVVMLFARVPK-VSQGTPEKQLIGFDRVHTVAKGSKEISF 768
L + + V N G DG+H +++F+ P + P+K L+ F++VH AK + +
Sbjct: 665 GKLSIALHVDVKNVGSRDGTHTLLVFSAPPNGGNHWVPQKSLVAFEKVHVPAKTKQRVRV 724
Query: 769 GVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
+ C+ LS+ +K G R +P+G H L +G+++HS++++
Sbjct: 725 NIHVCKLLSVVDKSGIRRIPMGEHSLHIGDVKHSVSLQ 762
>gi|408354266|gb|AFU54452.1| alpha-L-arabinofuranosidase/beta-D-xylosidase [Prunus salicina]
Length = 775
Score = 724 bits (1870), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 358/778 (46%), Positives = 497/778 (63%), Gaps = 41/778 (5%)
Query: 36 VLNKPDFPCKP--PHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGI 93
V +P F C P P FC ++ I R + LI LTLQEKI+ L +NA A+PRLGI
Sbjct: 26 VHARPPFACDPHNPITRGLKFCRVTVPIHVRVQDLIGRLTLQEKIRLLVNNAIAVPRLGI 85
Query: 94 PAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARA 153
YEWWSE+LHG+++ GPG F G TSFPQV+ +AASFN SLW IG V EARA
Sbjct: 86 QGYEWWSEALHGVSNVGPGTKFGGAFPGATSFPQVITTAASFNESLWQEIGRVVPDEARA 145
Query: 154 MYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGG 213
MYN G AGLT+W+PN+NIFRDPRWGRGQETPGEDP++ S YA +VK QG DG
Sbjct: 146 MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLASKYAARYVKGLQG------DGA 199
Query: 214 IGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTED 273
G+ L ++ACCKH AYDL+ W +R+ FNA +++QD D
Sbjct: 200 --------------------GNRLKVAACCKHYTAYDLDNWNGVNRFHFNARVSKQDLAD 239
Query: 274 TFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVAT 332
T+ PF++C+ +G + +MCSYNQVNG P C DL + R +W GYI SDCD+V
Sbjct: 240 TYNVPFKACVVEGHVASVMCSYNQVNGKPTCADPDLLKGTIRGQWRLNGYIVSDCDSVGV 299
Query: 333 IFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQ 392
++E Q+YT+T E++AA +KAG+D++CG + HT++A+ +G V + +I+ AL N +VQ
Sbjct: 300 LYEEQHYTRTPEEAAADAIKAGLDLDCGPFLAIHTEAAVRRGLVSQLEINWALANTMTVQ 359
Query: 393 LRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSL 452
+RLG+F+G+P +YG LGP DVCT H++LAL+AARQGIVLL+N + LPL+ ++
Sbjct: 360 MRLGMFDGEPSAHQYGNLGPRDVCTPAHQQLALEAARQGIVLLENRGRSLPLSIRRHRTV 419
Query: 453 AIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVR 512
A+IGP + M G Y G+ C + L+G+ Y H A GC DV CN + F A
Sbjct: 420 AVIGPNSDVTVTMIGNYAGVACGYTTPLQGIGRYTRTIHQA-GCTDVHCNGNQLFGAAEA 478
Query: 513 IAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLD 572
A++AD ++V GLD + E E DRV LLLPG Q LV+ VAR S+ P ILVL GGP+D
Sbjct: 479 AARQADATVLVMGLDQSIEAEFVDRVGLLLPGHQQELVSRVARASRGPTILVLMSGGPID 538
Query: 573 VSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESF-TKVPMNDMN 631
V+FA+ D +IS+I+W+GYPG+AG A+A+++FG NPGG+LPMTWYP+++ T +PM DM
Sbjct: 539 VTFAKNDPRISAIIWVGYPGQAGGTAIADVLFGTTNPGGKLPMTWYPQNYVTHLPMTDMA 598
Query: 632 MRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTIS-ASLKAGSDKNI 690
MRAD +R YPGR+YRFY G V+ FG GLSYT +++ P+ +++ SLKA ++ +
Sbjct: 599 MRADPARGYPGRTYRFYRGPVVFPFGLGLSYTTFAHNLAHGPTSVSVPLTSLKATANSTM 658
Query: 691 LQQTGSRLDYVHIDEVTSCTSLR-FHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQ 749
L + + H D C +L V + V N G +DG+H +++F P + KQ
Sbjct: 659 LSKA---VRVSHAD----CNALSPLDVHVDVKNTGSMDGTHTLLVFTSPPD-GKWAASKQ 710
Query: 750 LIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIET 807
L+GF ++H A + V C+ LS+ ++ G R +PLG H L +G+L H ++++T
Sbjct: 711 LVGFHKIHIAAGSETRVRIAVHVCKHLSVVDRFGIRRIPLGEHKLQIGDLSHHVSLQT 768
>gi|408354264|gb|AFU54451.1| alpha-L-arabinofuranosidase/beta-D-xylosidase [Prunus salicina]
Length = 775
Score = 724 bits (1870), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 358/778 (46%), Positives = 497/778 (63%), Gaps = 41/778 (5%)
Query: 36 VLNKPDFPCKP--PHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGI 93
V +P F C P P FC ++ I R + LI LTLQEKI+ L +NA A+PRLGI
Sbjct: 26 VHARPPFACDPHNPITRGLKFCRVTVPIHVRVQDLIGRLTLQEKIRLLVNNAIAVPRLGI 85
Query: 94 PAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARA 153
YEWWSE+LHG+++ GPG F G TSFPQV+ +AASFN SLW IG V EARA
Sbjct: 86 QGYEWWSEALHGVSNVGPGTKFGGAFPGATSFPQVITTAASFNESLWQEIGRGVPDEARA 145
Query: 154 MYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGG 213
MYN G AGLT+W+PN+NIFRDPRWGRGQETPGEDP++ S YA +VK QG DG
Sbjct: 146 MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLASKYAARYVKGLQG------DGA 199
Query: 214 IGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTED 273
G+ L ++ACCKH AYDL+ W +R+ FNA +++QD D
Sbjct: 200 --------------------GNRLKVAACCKHYTAYDLDNWNGVNRFHFNARVSKQDLAD 239
Query: 274 TFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVAT 332
T+ PF++C+ +G + +MCSYNQVNG P C DL + R +W GYI SDCD+V
Sbjct: 240 TYNVPFKACVVEGHVASVMCSYNQVNGKPTCADPDLLKGTIRGQWRLNGYIVSDCDSVGV 299
Query: 333 IFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQ 392
++E Q+YT+T E++AA +KAG+D++CG + HT++A+ +G V + +I+ AL N +VQ
Sbjct: 300 LYEEQHYTRTPEEAAADAIKAGLDLDCGPFLAIHTEAAVRRGLVSQLEINWALANTMTVQ 359
Query: 393 LRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSL 452
+RLG+F+G+P +YG LGP DVCT H++LAL+AARQGIVLL+N + LPL+ ++
Sbjct: 360 MRLGMFDGEPSAHQYGNLGPRDVCTPAHQQLALEAARQGIVLLENRGRSLPLSIRRHRTV 419
Query: 453 AIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVR 512
A+IGP + M G Y G+ C + L+G+ Y H A GC DV CN + F A
Sbjct: 420 AVIGPNSDVTVTMIGNYAGVACGYTTPLQGIGRYTRTIHQA-GCTDVHCNGNQLFGAAEA 478
Query: 513 IAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLD 572
A++AD ++V GLD + E E DRV LLLPG Q LV+ VAR S+ P ILVL GGP+D
Sbjct: 479 AARQADATVLVMGLDQSIEAEFVDRVGLLLPGHQQELVSRVARASRGPTILVLMSGGPID 538
Query: 573 VSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESF-TKVPMNDMN 631
V+FA+ D +IS+I+W+GYPG+AG A+A+++FG NPGG+LPMTWYP+++ T +PM DM
Sbjct: 539 VTFAKNDPRISAIIWVGYPGQAGGTAIADVLFGTTNPGGKLPMTWYPQNYVTHLPMTDMA 598
Query: 632 MRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTIS-ASLKAGSDKNI 690
MRAD +R YPGR+YRFY G V+ FG GLSYT +++ P+ +++ SLKA ++ +
Sbjct: 599 MRADPARGYPGRTYRFYRGPVVFPFGLGLSYTTFAHNLAHGPTSVSVPLTSLKATANSTM 658
Query: 691 LQQTGSRLDYVHIDEVTSCTSLR-FHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQ 749
L + + H D C +L V + V N G +DG+H +++F P + KQ
Sbjct: 659 LSKA---VRVSHAD----CNALSPLDVHVDVKNTGSMDGTHTLLVFTSPPD-GKWAASKQ 710
Query: 750 LIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIET 807
L+GF ++H A + V C+ LS+ ++ G R +PLG H L +G+L H ++++T
Sbjct: 711 LVGFHKIHIAAGSETRVRIAVHVCKHLSVVDRFGIRRIPLGEHKLQIGDLSHHVSLQT 768
>gi|18378991|ref|NP_563659.1| beta-glucosidase [Arabidopsis thaliana]
gi|75250279|sp|Q94KD8.1|BXL2_ARATH RecName: Full=Probable beta-D-xylosidase 2; Short=AtBXL2; Flags:
Precursor
gi|14194121|gb|AAK56255.1|AF367266_1 At1g02640/T14P4_11 [Arabidopsis thaliana]
gi|23506063|gb|AAN28891.1| At1g02640/T14P4_11 [Arabidopsis thaliana]
gi|332189332|gb|AEE27453.1| beta-glucosidase [Arabidopsis thaliana]
Length = 768
Score = 724 bits (1869), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 357/797 (44%), Positives = 499/797 (62%), Gaps = 43/797 (5%)
Query: 14 ILFRVLQLIVVVNVIAFSNSKPVLNKPDFPC--KPPHFDSYPFCNTSLSISTRAKSLISL 71
IL ++ L V++ + S+S V ++ F C K + FC S+ I R + LI
Sbjct: 2 ILHKMAFLAVILFFLISSSSVCVHSRETFACDTKDAATATLRFCQLSVPIPERVRDLIGR 61
Query: 72 LTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVS 131
LTL EK+ L + A+AIPRLGI YEWWSE+LHG+++ GPG F G + TSFPQV+ +
Sbjct: 62 LTLAEKVSLLGNTAAAIPRLGIKGYEWWSEALHGVSNVGPGTKFGGVYPAATSFPQVITT 121
Query: 132 AASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVV 191
ASFN SLW +IG V+ EARAMYN G GLT+W+PN+NI RDPRWGRGQETPGEDP+V
Sbjct: 122 VASFNASLWESIGRVVSNEARAMYNGGVGGLTYWSPNVNILRDPRWGRGQETPGEDPVVA 181
Query: 192 SAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDL 251
YA +V+ QG +DR L ++ACCKH AYDL
Sbjct: 182 GKYAASYVRGLQG--------------------------NDR-SRLKVAACCKHFTAYDL 214
Query: 252 EKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQ 311
+ W R+ FNA +++QD EDTF PFR C+++G + IMCSYNQVNGVP C +L +
Sbjct: 215 DNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVKEGNVASIMCSYNQVNGVPTCADPNLLK 274
Query: 312 KA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSA 370
K RN+WG GYI SDCD+V +++ Q+YT T E++AA +KAG+D++CG + HT A
Sbjct: 275 KTIRNQWGLNGYIVSDCDSVGVLYDTQHYTGTPEEAAADSIKAGLDLDCGPFLGAHTIDA 334
Query: 371 IDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQ 430
+ K ++E D+D AL+N +VQ+RLG+F+GD YG LGP VCT HK LAL+AA+Q
Sbjct: 335 VKKNLLRESDVDNALINTLTVQMRLGMFDGDIAAQPYGHLGPAHVCTPVHKGLALEAAQQ 394
Query: 431 GIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKT 490
GIVLLKN LPL+ ++A+IGP + M G Y G+ C S ++G+ Y ++T
Sbjct: 395 GIVLLKNHGSSLPLSSQRHRTVAVIGPNSDATVTMIGNYAGVACGYTSPVQGITGY-ART 453
Query: 491 HYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLV 550
+ GC DV C D F AV A+ AD ++V GLD + E E +DR SLLLPG+Q LV
Sbjct: 454 IHQKGCVDVHCMDDRLFDAAVEAARGADATVLVMGLDQSIEAEFKDRNSLLLPGKQQELV 513
Query: 551 TSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPG 610
+ VA+ +K PVILVL GGP+D+SFAE D +I +I+W GYPG+ G A+A+I+FG NPG
Sbjct: 514 SRVAKAAKGPVILVLMSGGPIDISFAEKDRKIPAIVWAGYPGQEGGTAIADILFGSANPG 573
Query: 611 GRLPMTWYPESF-TKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKF 669
G+LPMTWYP+ + T +PM +M+MR S++ PGR+YRFY G VY FGHGLSYT +++
Sbjct: 574 GKLPMTWYPQDYLTNLPMTEMSMRPVHSKRIPGRTYRFYDGPVVYPFGHGLSYTRFTHNI 633
Query: 670 LSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGS 729
AP + I+ + G+ +G + H C L V + VTN G DG+
Sbjct: 634 ADAPKVIPIAVRGRNGT------VSGKSIRVTH----ARCDRLSLGVHVEVTNVGSRDGT 683
Query: 730 HVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPL 789
H +++F+ P + P+KQL+ F+RVH K + + C+ LS+ ++ G R +P+
Sbjct: 684 HTMLVFS-APPGGEWAPKKQLVAFERVHVAVGEKKRVQVNIHVCKYLSVVDRAGNRRIPI 742
Query: 790 GNHVLMVGELRHSLTIE 806
G+H + +G+ H+++++
Sbjct: 743 GDHGIHIGDESHTVSLQ 759
>gi|449505346|ref|XP_004162442.1| PREDICTED: LOW QUALITY PROTEIN: probable beta-D-xylosidase 2-like
[Cucumis sativus]
Length = 772
Score = 724 bits (1868), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 355/796 (44%), Positives = 503/796 (63%), Gaps = 45/796 (5%)
Query: 17 RVLQLIVVVNVIAFSNSKPVLNKPDFPCKPPH--FDSYPFCNTSLSISTRAKSLISLLTL 74
R++ ++++++ I F + + F C P YPFC +L I R K LI LTL
Sbjct: 7 RIIPILIILSAI-FRHGGGA--REPFACDPKDAALSRYPFCRVALPIPERVKDLIGRLTL 63
Query: 75 QEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAAS 134
QEK++ L +NA+A+PRLGI YEWWSE+LHG+++ GPG F G TSFPQV+ + AS
Sbjct: 64 QEKVRLLVNNAAAVPRLGIKGYEWWSEALHGVSNVGPGTEFGGDFPGATSFPQVITTVAS 123
Query: 135 FNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAY 194
FN SLW IG V+ EARAMYN G AGLT+W+PN+NIFRDPRWGRGQETPGEDP+V Y
Sbjct: 124 FNVSLWEAIGRVVSDEARAMYNGGAAGLTYWSPNVNIFRDPRWGRGQETPGEDPVVAGEY 183
Query: 195 AVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKW 254
A ++K QG + GD L ++ACCKH AYDL+ W
Sbjct: 184 AARYIKGLQGND---------------------------GDRLKVAACCKHFTAYDLDNW 216
Query: 255 GNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA- 313
R+ FNA +T QD DTF+ PFR C+++GK + +MCSYNQVNGVP C +L +
Sbjct: 217 NGTDRFHFNAKVTRQDMVDTFEVPFRKCVKEGKVASVMCSYNQVNGVPTCADPNLLKGTI 276
Query: 314 RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDK 373
RN+WG GYI SDCD+V ++ Q+YT T E++AA +KAG+D++CG + HT+ A+ K
Sbjct: 277 RNQWGLNGYIVSDCDSVGVFYDNQHYTSTAEEAAADAIKAGLDLDCGPFLAVHTEDAVKK 336
Query: 374 GKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIV 433
+ + I+ AL N +VQ+RLG+F+G P YGKLGP +VC+ H++LALDAARQGIV
Sbjct: 337 XLLTQTHINNALANTITVQMRLGMFDGAPSSHAYGKLGPKNVCSPSHQQLALDAARQGIV 396
Query: 434 LLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYA 493
LLKN LPL+ ++A+IGP + M G Y G+ C + L G++ Y + H
Sbjct: 397 LLKNRLPGLPLSAXHHRTVAVIGPNSDVNVTMIGNYAGVACGYVTPLEGIKRYTTVVH-R 455
Query: 494 SGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSV 553
GC +V C +D F +A+ A AD ++V GLD + E E +DR LLLPG+Q LV V
Sbjct: 456 KGCDNVACATDYSFTDALAAASTADATVLVMGLDQSVEAETKDRDGLLLPGRQQELVLKV 515
Query: 554 ARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRL 613
A S+ P +++L GGP+DVSFA+ D +IS+ILW+GYPG+AG A+A+++FG NPGG+L
Sbjct: 516 AAASRGPTVVILMSGGPIDVSFADNDPRISAILWVGYPGQAGGAAIADVLFGTTNPGGKL 575
Query: 614 PMTWYPESF-TKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSA 672
PMTWYP+S+ + +PM +M MR+ SS YPGR+YRFY G VY FGHGLSYTN+ + + A
Sbjct: 576 PMTWYPQSYLSNLPMTNMAMRSTSS--YPGRTYRFYAGPVVYEFGHGLSYTNFIHTIVKA 633
Query: 673 PSELTISAS-LKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHV 731
P+ ++IS S + + L R+ + C L + + V N GD DG H
Sbjct: 634 PTIVSISLSGHRQTHSASTLSSKAIRVTH------AKCQKLSLVIHVDVENKGDRDGFHT 687
Query: 732 VMLFARVP-KVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLG 790
+++F+ P + P KQL+ F+++H ++ + + V C+ LS+ +K G R +PLG
Sbjct: 688 MLVFSTPPANGATWVPRKQLVAFEKLHLASREKRRLQVHVHVCKYLSVVDKLGVRRIPLG 747
Query: 791 NHVLMVGELRHSLTIE 806
+H + +G ++H+++++
Sbjct: 748 DHYIHIGNVKHTVSLQ 763
>gi|357442285|ref|XP_003591420.1| Beta xylosidase [Medicago truncatula]
gi|355480468|gb|AES61671.1| Beta xylosidase [Medicago truncatula]
Length = 765
Score = 724 bits (1868), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 355/792 (44%), Positives = 495/792 (62%), Gaps = 46/792 (5%)
Query: 21 LIVVVNVIAFSNSKPVLNKPDFPCKPPHF--DSYPFCNTSLSISTRAKSLISLLTLQEKI 78
LI +V ++ +S+ + F C P + +++PFC SL I TR LI LTLQEK+
Sbjct: 5 LITIVFLLLLMSSEA---RDPFACDPKNTSTNNFPFCKASLPIPTRVNDLIGRLTLQEKV 61
Query: 79 QQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRS 138
L +NA+A+PR+GI YEWWSE+LHG+++ GPG F G + TSFPQV+ + ASFN S
Sbjct: 62 SMLVNNAAAVPRVGIKGYEWWSEALHGVSNVGPGTKFAGQFPAATSFPQVITTVASFNAS 121
Query: 139 LWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEF 198
LW IG + EARAMYN G AGLT+W+PN+NIFRDPRWGRGQETPGEDP++ YA +
Sbjct: 122 LWEAIGRVASDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPILAGKYAASY 181
Query: 199 VKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFS 258
V+ Q G +S R L ++A CKH AYDL+ W
Sbjct: 182 VRGLQ------------------------GTDSSR---LKVAASCKHFTAYDLDNWNGVD 214
Query: 259 RYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEW 317
R+ FNA +++QD EDTF PFR C+++G + +MCSYNQVNGVP C +L ++ R +W
Sbjct: 215 RFHFNAKVSKQDMEDTFNVPFRMCVKEGNVASVMCSYNQVNGVPTCADPNLLKRTIRGQW 274
Query: 318 GFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQ 377
GYI SDCD+V + Q+YT T E++AA +KAG+D++CG + +HTQ+A+ KG +
Sbjct: 275 HLDGYIVSDCDSVGVFYTNQHYTSTPEEAAADAIKAGLDLDCGPFLAQHTQNAVKKGLLT 334
Query: 378 EKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKN 437
E D++ AL N +VQ+RLG+F+G+P YG LGP DVCT H++LALDAARQGIVLLKN
Sbjct: 335 ETDVNGALANTLTVQMRLGMFDGEPSAQPYGNLGPTDVCTPTHQELALDAARQGIVLLKN 394
Query: 438 DKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCH 497
LPL+ ++A+IGP N M G Y GI C S L+G+ Y ++T + GC
Sbjct: 395 TGPSLPLSTKNHQTVAVIGPNSNATVTMIGNYAGIACGYTSPLQGIGKY-ARTIHEPGCA 453
Query: 498 DVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTS 557
+V CN D F A+ A++AD ++V GLD + E E DR LLLPG Q LV+ VA S
Sbjct: 454 NVACNDDKQFGSALNAARQADATVLVMGLDQSIEAEMVDRTGLLLPGHQQDLVSKVAAAS 513
Query: 558 KRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTW 617
+ P ILVL GGP+D++FA+ D +I ILW GYPG+AG A+A+I+FG NPG +LPMTW
Sbjct: 514 RGPTILVLMSGGPIDITFAKNDPRIMGILWAGYPGQAGGAAIADILFGTTNPGAKLPMTW 573
Query: 618 YPESFTK-VPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSEL 676
YP+ + K + M +M MR SS YPGR+YRFY G VY FG+GLSYTN+ + SAP
Sbjct: 574 YPQGYLKNLAMTNMAMRPSSSTGYPGRTYRFYNGPVVYPFGYGLSYTNFVHTLASAPK-- 631
Query: 677 TISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFA 736
+S + N + R+ + C L + I V N G DG++ +++F+
Sbjct: 632 VVSVPVDGHRRGNSSNKAAIRVTHAR------CGKLSIRLDIDVKNVGSKDGTNTLLVFS 685
Query: 737 RVPKVSQG--TPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVL 794
VP G P+KQL+ F++V+ AK + + + C+ LS+ +K G R +P+G H +
Sbjct: 686 -VPPTGNGHWAPQKQLVAFEKVYVPAKAQQRVRINIHVCKLLSVVDKSGTRRIPMGAHSI 744
Query: 795 MVGELRHSLTIE 806
+G+++H ++++
Sbjct: 745 HIGDVKHFVSLQ 756
>gi|224099193|ref|XP_002311398.1| predicted protein [Populus trichocarpa]
gi|222851218|gb|EEE88765.1| predicted protein [Populus trichocarpa]
Length = 755
Score = 723 bits (1867), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 359/759 (47%), Positives = 500/759 (65%), Gaps = 40/759 (5%)
Query: 51 SYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNG 110
S FC ++ + R + LI LTLQEKI+ L +NA+A+PRLGI YEWWSE+LHG+++ G
Sbjct: 31 SLKFCRVNMPLHVRVRDLIGRLTLQEKIRLLVNNAAAVPRLGIQGYEWWSEALHGVSNVG 90
Query: 111 PGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNIN 170
PG F G TSFPQV+ +AASFN+SLW IG V+ EARAM+N G AGLT+W+PN+N
Sbjct: 91 PGTKFGGAFPGATSFPQVITTAASFNKSLWEEIGRVVSDEARAMFNGGMAGLTYWSPNVN 150
Query: 171 IFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEE 230
+FRDPRWGRGQETPGEDP+V YA +V+ QG + GFR
Sbjct: 151 VFRDPRWGRGQETPGEDPVVAGKYAASYVRGLQGNS----------GFR----------- 189
Query: 231 SDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASC 290
L ++ACCKH AYDL+ W RY FNA +++QD EDT+ PF+SC+ +GK +
Sbjct: 190 ------LKVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYDVPFKSCVVEGKVAS 243
Query: 291 IMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAG 349
+MCSYNQVNG P C +L + R EW GYI SDCD+V ++E Q+YT T E++AA
Sbjct: 244 VMCSYNQVNGKPTCADPNLLKNTIRGEWRLNGYIVSDCDSVGVLYENQHYTATPEEAAAA 303
Query: 350 VLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGK 409
+KAG+D++CG + HT++A+ G + E+D++ AL N +VQ+RLGLF+G+P +GK
Sbjct: 304 TIKAGLDLDCGPFLAIHTENAVKGGLLNEEDVNMALANTITVQMRLGLFDGEPSAQPFGK 363
Query: 410 LGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGY 469
LGP DVCT H++LAL AA+QGIVLL+N + LPL++ + ++A+IGP+ + M G Y
Sbjct: 364 LGPRDVCTPAHQQLALHAAQQGIVLLQNSGRTLPLSRPNL-TVAVIGPIADVTVTMIGNY 422
Query: 470 TGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLT 529
G+ C + L+G+ Y +KT + SGC DV CN + F A A +AD ++V GLD +
Sbjct: 423 AGVACGYTTPLQGISRY-AKTIHQSGCIDVACNGNQQFGMAEAAASQADATVLVMGLDQS 481
Query: 530 QETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIG 589
E E RDR LLLPG Q L++ VAR S+ P ILVL GGP+DVSFA+ D +I +ILW G
Sbjct: 482 IEAEFRDRKDLLLPGYQQELISRVARASRGPTILVLMSGGPIDVSFAKNDPRIGAILWAG 541
Query: 590 YPGEAGAKALAEIIFGDFNPGGRLPMTWYPESF-TKVPMNDMNMRADSSRQYPGRSYRFY 648
YPG+AG A+A+++FG NPGG+LPMTWYP+ + KVPM +M MRAD SR YPGR+YRFY
Sbjct: 542 YPGQAGGAAIADVLFGTTNPGGKLPMTWYPQDYLAKVPMTNMGMRADPSRGYPGRTYRFY 601
Query: 649 TGTQVYGFGHGLSYTNYSYKFLSAPSELTIS-ASLKAGSDKNILQQTGSRLDYVHIDEVT 707
G V+ FGHG+SYT +++ + AP E+ + SL A LQ T + + + +
Sbjct: 602 KGPVVFPFGHGMSYTTFAHSLVQAPQEVAVPFTSLYA------LQNTTAARNSIRVSH-A 654
Query: 708 SCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEIS 767
+C L V I V N GD+DG +++F+ P+ + + K+LIGF++VH VA K +
Sbjct: 655 NCEPLVLGVHIDVKNTGDMDGIQTLLVFSSPPE-GKWSANKKLIGFEKVHIVAGSKKRVK 713
Query: 768 FGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
+ C+ LS+ ++ G R LP+G H L +G+L+HS++++
Sbjct: 714 IDIPVCKHLSVVDRFGIRRLPIGKHDLHIGDLKHSISLQ 752
>gi|115460876|ref|NP_001054038.1| Os04g0640700 [Oryza sativa Japonica Group]
gi|38344900|emb|CAE02971.2| OSJNBb0079B02.3 [Oryza sativa Japonica Group]
gi|113565609|dbj|BAF15952.1| Os04g0640700 [Oryza sativa Japonica Group]
gi|116310882|emb|CAH67823.1| OSIGBa0138H21-OSIGBa0138E01.14 [Oryza sativa Indica Group]
gi|218195682|gb|EEC78109.1| hypothetical protein OsI_17615 [Oryza sativa Indica Group]
Length = 765
Score = 723 bits (1867), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 371/776 (47%), Positives = 493/776 (63%), Gaps = 49/776 (6%)
Query: 40 PDFPCKPPH--FDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYE 97
P F C + Y FC+ + S + RA L+ LTL EK+ L + +A+PRLGIPAYE
Sbjct: 29 PVFACDASNATVSGYGFCDRTKSSAARAADLLGRLTLAEKVGFLVNKQAALPRLGIPAYE 88
Query: 98 WWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNL 157
WWSE+LHG++ GPG F+ V TSFPQ +++AASFN SL+ IG V+ EARAM+N+
Sbjct: 89 WWSEALHGVSYVGPGTRFSTLVPGATSFPQPILTAASFNASLFRAIGEVVSTEARAMHNV 148
Query: 158 GQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFG 217
G AGLTFW+PNINIFRDPRWGRGQETPGEDP++ S YAV +V Q D G G
Sbjct: 149 GLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYAVGYVTGLQ-------DAGGG-- 199
Query: 218 FREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQP 277
D L ++ACCKH AYD++ W RY+F+A++++QD +DTFQP
Sbjct: 200 ----------------SDALKVAACCKHYTAYDVDNWKGVERYTFDAVVSQQDLDDTFQP 243
Query: 278 PFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEY 336
PF+SC+ G + +MCSYN+VNG P C DL R +W GYI SDCD+V ++
Sbjct: 244 PFKSCVIDGNVASVMCSYNKVNGKPTCADKDLLSGVIRGDWKLNGYIVSDCDSVDVLYNN 303
Query: 337 QNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLG 396
Q+YTK ED+AA +K+G+D+NCG + +HT +A+ GK+ E D+DRA+ N F V +RLG
Sbjct: 304 QHYTKNPEDAAAITIKSGLDLNCGNFLAQHTVAAVQAGKLSESDVDRAITNNFIVLMRLG 363
Query: 397 LFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIG 456
F+GDPRK +G LGP DVCTS +++LA +AARQGIVLLKN LPL+ ++ S+A+IG
Sbjct: 364 FFDGDPRKLPFGSLGPKDVCTSSNQELAREAARQGIVLLKNTGA-LPLSAKSIKSMAVIG 422
Query: 457 PLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDA-GFHEAVRIAK 515
P N M G Y G PC + L+GL A V+ T Y GC +V C+ ++ A + A
Sbjct: 423 PNANASFTMIGNYEGTPCKYTTPLQGLGANVA-TVYQPGCTNVGCSGNSLQLSAATQAAA 481
Query: 516 KADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSF 575
AD ++V G D + E E DR SLLLPGQQ LV++VA S+ PVILV+ GGP D+SF
Sbjct: 482 SADVTVLVVGADQSVERESLDRTSLLLPGQQPQLVSAVANASRGPVILVVMSGGPFDISF 541
Query: 576 AEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNMRA 634
A++ +IS+ILW+GYPGEAG ALA+I+FG NPGGRLP+TWYP SF KV M DM MR
Sbjct: 542 AKSSDKISAILWVGYPGEAGGAALADILFGYHNPGGRLPVTWYPASFADKVSMTDMRMRP 601
Query: 635 DSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQT 694
DSS YPGR+YRFYTG VY FG GLSYT +++ +SAP ++ + +
Sbjct: 602 DSSTGYPGRTYRFYTGDTVYAFGDGLSYTKFAHSLVSAPEQVAVQLA------------E 649
Query: 695 GSRLDYVHIDEVTS----CTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQL 750
G H V + C SL F V + V NAG + G H V LF+ P V P K L
Sbjct: 650 GHACHTEHCFSVEAAGEHCGSLSFDVHLRVRNAGGMAGGHTVFLFSSPPSV-HSAPAKHL 708
Query: 751 IGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
+GF++V + ++F VD C+ LS+ ++ G R + LG+H L VG+L+H+L +
Sbjct: 709 LGFEKVSLEPGQAGVVAFKVDVCKDLSVVDELGNRKVALGSHTLHVGDLKHTLNLR 764
>gi|302811514|ref|XP_002987446.1| hypothetical protein SELMODRAFT_426206 [Selaginella moellendorffii]
gi|300144852|gb|EFJ11533.1| hypothetical protein SELMODRAFT_426206 [Selaginella moellendorffii]
Length = 772
Score = 723 bits (1866), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 371/758 (48%), Positives = 498/758 (65%), Gaps = 33/758 (4%)
Query: 51 SYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNG 110
++PFCNTSL I+ R + ++ LTL+EKI QL + A+ IPRLG+P Y+WW E+LHG+AS+
Sbjct: 41 AFPFCNTSLPITDRVEDYVARLTLEEKISQLINTATGIPRLGVPKYQWWQEALHGVASS- 99
Query: 111 PGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNIN 170
PGV F G+V + TSFP + +AASFN SL+ IG AV+ EARAM+NLGQ+GLTFW+PNIN
Sbjct: 100 PGVQFGGSVPAATSFPMPITTAASFNTSLFYGIGQAVSTEARAMHNLGQSGLTFWSPNIN 159
Query: 171 IFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEE 230
I+RDPRWGRGQETPGEDP++ S +A +V+ Q E
Sbjct: 160 IYRDPRWGRGQETPGEDPLLSSNFATYYVRGLQ-------------------------ES 194
Query: 231 SDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASC 290
D+L +SACCKH+ AYD++ W RY FNA++TEQD EDT+ PF+SC+E G S
Sbjct: 195 QAGSDKLKVSACCKHMTAYDVDNWLGTDRYHFNAIVTEQDLEDTYNAPFKSCVEDGGVSS 254
Query: 291 IMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAG 349
+MCSYN++NGVP C +L R W GYI SDCD++ F+ NY T ED+AA
Sbjct: 255 VMCSYNRLNGVPTCADHELLTTTVRETWKLNGYIVSDCDSLQVFFDNTNYAATAEDAAAD 314
Query: 350 VLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGK 409
L AG+++NCGT + +HT SAI + KV E I++AL L +VQ+RLGL++GDP+ YG
Sbjct: 315 ALLAGLNLNCGTFLAKHTLSAIQQKKVTEATINQALTYLVTVQMRLGLYDGDPKSQTYGS 374
Query: 410 LGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGY 469
LG DVCTSEH+ LAL+AARQG+VLLKN LPL+ + + SLA++GP N M G Y
Sbjct: 375 LGASDVCTSEHQTLALEAARQGMVLLKNLGA-LPLSTSKIKSLAVVGPHANATRAMIGNY 433
Query: 470 TGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLT 529
GIPC S L+ + Y ++ YA GC +V C+SD+ AV A AD V+V GLDLT
Sbjct: 434 AGIPCKYTSPLQAFQKY-AQVSYAPGCANVACSSDSLISGAVSAAAAADAVVVAVGLDLT 492
Query: 530 QETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIG 589
E E DR SLLLPG+Q LV+ V + +K PV++V+ G +D+ FA +DS+I+ ILW G
Sbjct: 493 IEAESLDRTSLLLPGKQQELVSQVMQAAKGPVVIVILSAGAIDIPFALSDSRIAGILWAG 552
Query: 590 YPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYT 649
YPG+AG A+AE+IFGD NP G+LP TWYP++FT + M DMNMR ++S YPGR+YRFYT
Sbjct: 553 YPGQAGGAAIAEVIFGDHNPSGKLPATWYPQNFTSISMLDMNMRPNASTGYPGRTYRFYT 612
Query: 650 GTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSC 709
G ++ FG GLSYT+ S KF+ APS L+I ++ L+++ S DE SC
Sbjct: 613 GPTIFKFGDGLSYTSLSAKFIKAPSFLSIPSTAPMQPCTG-LKKSSSCFHLDATDE-KSC 670
Query: 710 TSLRFHVQISVTNAGDVDGSHVVMLFARVPKV-SQGTPEKQLIGFDRVHTVAKG-SKEIS 767
SL+ V ISV N G + SH +MLF+ P S G P++QL+GF+++ S +
Sbjct: 671 ESLKSQVAISVRNKGAMAISHTLMLFSTPPSAGSDGVPQRQLVGFNKIQIAGDSISNPVI 730
Query: 768 FGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTI 805
F +DPC A++ G+++L G HVL G +HSL +
Sbjct: 731 FDLDPCRHFVHADRDGKKLLRSGTHVLTAGNEQHSLRL 768
>gi|157041199|dbj|BAF79669.1| beta-D-xylosidase [Pyrus pyrifolia]
Length = 774
Score = 723 bits (1866), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 363/773 (46%), Positives = 488/773 (63%), Gaps = 40/773 (5%)
Query: 39 KPDFPCKP--PHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAY 96
+P F C P P + FC + I R + LI LTLQEKI L +NA A+PRLGI Y
Sbjct: 29 RPPFACDPRNPITRTLKFCRVRVPIHVRVQDLIGRLTLQEKIGLLVNNAIAVPRLGIQGY 88
Query: 97 EWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYN 156
EWWSE+LHG+++ GPG F GT TSFPQV+ +AASFN SLW IG V+ EARAMYN
Sbjct: 89 EWWSEALHGVSNVGPGTKF-GTFLGATSFPQVITTAASFNESLWEEIGRVVSDEARAMYN 147
Query: 157 LGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGF 216
G AGLTFW+PN+NIFRDPRWGRGQETPGEDP++ + Y +VK QG DG
Sbjct: 148 GGAAGLTFWSPNVNIFRDPRWGRGQETPGEDPVLAAKYGARYVKGLQG------DGA--- 198
Query: 217 GFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQ 276
G+ L ++ACCKH AYDL+ W R+ FNA +++QD EDT+
Sbjct: 199 -----------------GNRLKVAACCKHYTAYDLDNWNGVDRFHFNARVSKQDLEDTYN 241
Query: 277 PPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFE 335
PF++C+ G + +MCSYNQVNG P C DL + R +W GYI SDCD+V ++
Sbjct: 242 VPFKACVVDGNVASVMCSYNQVNGKPTCADPDLLKGTIRGQWKLNGYIVSDCDSVGVYYD 301
Query: 336 YQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRL 395
Q+YTKT E +AA +KAG+D++CG + HT++AI G+V E DI+ AL N +VQ+RL
Sbjct: 302 NQHYTKTPEAAAAYAIKAGLDLDCGPFLGIHTEAAIRTGQVNEIDINYALANTITVQMRL 361
Query: 396 GLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAII 455
G+F+G+P +YG LG DVC +LAL+AARQGIVLL+N LPL+ ++A+I
Sbjct: 362 GMFDGEPSTQRYGNLGLADVCKPSSNELALEAARQGIVLLENRGNSLPLSTIRHRTVAVI 421
Query: 456 GPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAK 515
GP + M G Y GI C + L+G+ Y H A GC DV CN + A A+
Sbjct: 422 GPNSDVTETMIGNYAGIACGYTTPLQGIARYTRTIHQA-GCTDVHCNGNQLIGAAEVAAR 480
Query: 516 KADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSF 575
+AD ++V GLD + E E RDR LLLPG Q LV+ VAR S+ P ILV+ GGP+DV+F
Sbjct: 481 QADATVLVIGLDQSIEAEFRDRTGLLLPGHQQELVSRVARASRGPTILVIMSGGPIDVTF 540
Query: 576 AEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNMRA 634
A+ D +I +I+W+GYPG+AG A+A+++FG NP G+LPMTWYP+++ +PM DM MRA
Sbjct: 541 AKNDPRIGAIIWVGYPGQAGGTAIADVLFGTTNPSGKLPMTWYPQNYVANLPMTDMAMRA 600
Query: 635 DSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTIS-ASLKAGSDKNILQQ 693
D +R YPGR+YRFY G V+ FG GLSYT +S+ P+ +++ SL A + +L
Sbjct: 601 DPARGYPGRTYRFYKGPVVFPFGMGLSYTRFSHSLAQGPTLVSVPLTSLVAAKNTTMLSN 660
Query: 694 TGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGF 753
G R+ + T+C SL I + N G +DG+H +++FA P + P KQL+GF
Sbjct: 661 HGVRVSH------TNCDSLSLDFHIDIKNTGTMDGTHTLLVFATQP-AGKWAPNKQLVGF 713
Query: 754 DRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
+VH VA + + GV C+ LSI +K G R +PLG H L +G+L+H ++IE
Sbjct: 714 HKVHIVAGSERRVRVGVHVCKHLSIVDKLGIRRIPLGQHKLEIGDLKHYVSIE 766
>gi|302811516|ref|XP_002987447.1| hypothetical protein SELMODRAFT_426207 [Selaginella moellendorffii]
gi|300144853|gb|EFJ11534.1| hypothetical protein SELMODRAFT_426207 [Selaginella moellendorffii]
Length = 779
Score = 723 bits (1865), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 367/774 (47%), Positives = 492/774 (63%), Gaps = 62/774 (8%)
Query: 52 YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGP 111
+ FCNT L STR + LIS +TLQEKI QL +NA+ IPRLG+P YEWW E+LHG+A + P
Sbjct: 44 FGFCNTRLPTSTRVEDLISRMTLQEKIIQLVNNAAGIPRLGLPRYEWWQEALHGVAVS-P 102
Query: 112 GVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINI 171
GV F G TSFP +++AASF+ AV+ EARAM+N +AGLT+W+PN+NI
Sbjct: 103 GVKFGGKFPGATSFPMPILTAASFD---------AVSTEARAMHNYQRAGLTYWSPNVNI 153
Query: 172 FRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEES 231
+RDPRWGRGQETPGEDP++ S YA +V+ Q N
Sbjct: 154 YRDPRWGRGQETPGEDPLLSSKYATFYVRGLQDTNLG----------------------- 190
Query: 232 DRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCI 291
GD+L +SACCKH+ AYD++ W +R+ FNA++T+QD DT+ PPF+SC+E K S +
Sbjct: 191 --GDKLKVSACCKHMTAYDVDNWKGTTRFKFNAIVTQQDLSDTYNPPFQSCVEDAKVSSV 248
Query: 292 MCSYNQVNGVPACLRGDLFQKA-RNEWGFKG----------------YITSDCDAVATIF 334
MCSYN+VNGVP C +L R+ W G YI SDCD++ T F
Sbjct: 249 MCSYNRVNGVPTCADYNLLSATVRSSWNLNGSILLTCEVLLLYLPCSYIVSDCDSLQTFF 308
Query: 335 EYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLR 394
+ NY KT ED A L AG++++CG + HTQSAI GK+ E ++++AL L++VQ+R
Sbjct: 309 DNTNYAKTAEDVVADALLAGLNLDCGPFLAIHTQSAITNGKITEANVNQALRYLYNVQMR 368
Query: 395 LGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAI 454
LGL++G+PR YG LGP VCT E+++LALDAA++GIVLLKN+ LP +K+ + ++A
Sbjct: 369 LGLYDGNPRSQPYGNLGPQSVCTGENQQLALDAAKEGIVLLKNNGNVLPFSKSNIRTVAA 428
Query: 455 IGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIA 514
IGP M G Y GIPC + GL AY ++ Y++GC DV C SD+ AV A
Sbjct: 429 IGPHAKATRAMIGNYQGIPCKYTTPHDGLSAY-ARVVYSAGCSDVACYSDSLIGSAVSTA 487
Query: 515 KKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVS 574
+AD V++ GLDL QE E +DR SLLLPG+Q LVT V + +K P +LV+ GG +DVS
Sbjct: 488 SQADAVVLFVGLDLNQEAEGKDRTSLLLPGKQQELVTEVTKAAKGPAVLVIFSGGSVDVS 547
Query: 575 FAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRA 634
FA+ ++++ ILW GYPGEAG A+A+++FGD NPGGRLP+TWYPESFT + M DMNMR
Sbjct: 548 FAKYNNKVQGILWAGYPGEAGGAAIAQVLFGDHNPGGRLPVTWYPESFTGITMLDMNMRP 607
Query: 635 DSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTI--SASLKAGSDKNILQ 692
D+SR YPGR+YRFYTG VY FG+G +Y+ S+KF AP L +A++K D N+
Sbjct: 608 DASRGYPGRTYRFYTGQSVYNFGYGKTYSKLSHKFKEAPLSLGFPEAAAVKRSCDGNL-- 665
Query: 693 QTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQ-GTPEKQLI 751
T L+ H DE+T C++L V+I V N GD + V+L++ P + G P +QL
Sbjct: 666 -TCFHLN-AH-DEIT-CSTLTSKVRILVHNKGDRPSNRAVLLYSSPPNAGRDGAPIRQLA 721
Query: 752 GFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTI 805
GF +V + + +DPC+ LS A +G RIL G H L VG RH L I
Sbjct: 722 GFGKVSVAPGAVENVEIEIDPCKHLSHAGANGVRILHGGIHTLAVGNARHPLPI 775
>gi|297811069|ref|XP_002873418.1| beta-xylosidase 3 [Arabidopsis lyrata subsp. lyrata]
gi|297319255|gb|EFH49677.1| beta-xylosidase 3 [Arabidopsis lyrata subsp. lyrata]
Length = 780
Score = 723 bits (1865), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 376/809 (46%), Positives = 507/809 (62%), Gaps = 46/809 (5%)
Query: 7 SIQLRERILFRVLQLIVVVNVIAFSNSKPVLNKPDFPCK---PPHFDSYPFCNTSLSIST 63
S+ R R LF V L++ ++ S + P F C P FCNT L+I +
Sbjct: 5 SLTERNRALFSVSTLLLCF-LLCISEQSNAQSSPVFACDVTGNPSLAGLRFCNTGLNIKS 63
Query: 64 RAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVT 123
R L+ LTL+EKI L NA + RLGIPAY+WWSE+LHG+++ G G +F+G V T
Sbjct: 64 RVTDLVGRLTLEEKIGFLGSNAIGVSRLGIPAYKWWSEALHGVSNVGGGSSFSGQVPGAT 123
Query: 124 SFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQET 183
SFPQV+++AASFN SL+ IG V+ EARAMYN+G AGLTFW+PN+NIFRDPRWGRGQET
Sbjct: 124 SFPQVILTAASFNVSLFQAIGKVVSTEARAMYNVGSAGLTFWSPNVNIFRDPRWGRGQET 183
Query: 184 PGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGD--ELMLSA 241
PGEDP + S YAV +V+ Q E+D GD L ++A
Sbjct: 184 PGEDPELSSKYAVAYVRGLQ--------------------------ETDGGDPNRLKVAA 217
Query: 242 CCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGV 301
CCKH AYD++ W + R++FNA++ +QD DTFQPPF+SC+ G + +MCSYNQVNG
Sbjct: 218 CCKHYTAYDVDNWKDVHRFTFNAVVNQQDMADTFQPPFKSCVVDGNVASVMCSYNQVNGK 277
Query: 302 PACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCG 360
P C DL R +W GYI SDCD+V ++ Q+YTKT E++ A + AG+D+NC
Sbjct: 278 PTCADPDLLSGVIRGQWKLNGYIVSDCDSVDVLYTKQHYTKTPEEAVAKSILAGLDLNCD 337
Query: 361 TCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGK-YGKLGPDDVCTSE 419
++ A+ G V E ID+A+ N F+ +RLG F+GDP+K + YG LGP+DVCT+
Sbjct: 338 HFTGQYAMKAVKVGLVNETAIDKAISNNFATLMRLGFFDGDPKKQQLYGGLGPNDVCTAN 397
Query: 420 HKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSL 479
+++LA DAARQGIVLLKN LPL+ +A+ +LA+IGP N M G Y GIPC +
Sbjct: 398 NQELARDAARQGIVLLKNSAGSLPLSPSAIKTLAVIGPNANATETMIGNYNGIPCKYTTP 457
Query: 480 LRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVS 539
L+GL VS T Y GC +V C ++ A +A AD V++V G D + E E+ DR+
Sbjct: 458 LQGLAETVSST-YQLGC-NVAC-AEPDLGSAAALAASADAVVLVMGADQSIEQENLDRLD 514
Query: 540 LLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKAL 599
L LPG+Q LVT VA+ +K PV+LV+ GG D++FA+ + +I+ I+W+GYPGEAG A+
Sbjct: 515 LYLPGKQQELVTQVAKVAKGPVVLVIMSGGAFDITFAKNEEKITGIMWVGYPGEAGGLAI 574
Query: 600 AEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGH 658
A++IFG NP G LPMTWYP+S+ KVPM +MNMR D S YPGR+YRFYTG VY FG
Sbjct: 575 ADVIFGRHNPSGNLPMTWYPQSYVEKVPMTNMNMRPDKSNGYPGRTYRFYTGETVYAFGD 634
Query: 659 GLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYV--HIDEVTSCTSLRFHV 716
GLSYTN++++ L AP + SL + +D + H D L F V
Sbjct: 635 GLSYTNFNHQILKAPKLV----SLDLDENHACRSSECQSVDAIGPHCDNAVG-GGLNFEV 689
Query: 717 QISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQL 776
Q+ V N GD +GSH V LF P+V G+P K L+GF+++ K I F VD C+ L
Sbjct: 690 QLKVRNVGDREGSHTVFLFTTPPEV-HGSPRKHLLGFEKIRLGEKEETVIRFNVDVCKDL 748
Query: 777 SIANKHGRRILPLGNHVLMVGELRHSLTI 805
S+ ++ G+R + LG+++L VG +HSLTI
Sbjct: 749 SVVDEIGKRKIALGHYLLHVGSFKHSLTI 777
>gi|302796583|ref|XP_002980053.1| hypothetical protein SELMODRAFT_112087 [Selaginella moellendorffii]
gi|300152280|gb|EFJ18923.1| hypothetical protein SELMODRAFT_112087 [Selaginella moellendorffii]
Length = 772
Score = 722 bits (1864), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 371/758 (48%), Positives = 498/758 (65%), Gaps = 33/758 (4%)
Query: 51 SYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNG 110
++PFCNTSL+I+ R + ++ LTL+EKI QL + A+ IPRLG+P Y+WW E+LHG+AS+
Sbjct: 41 AFPFCNTSLAITDRVEDYVARLTLEEKISQLINTATGIPRLGVPKYQWWQEALHGVASS- 99
Query: 111 PGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNIN 170
PGV F G+V + TSFP + +AASFN SL+ IG AV+ EARAM+NLGQ+GLTFW+PNIN
Sbjct: 100 PGVQFGGSVPAATSFPMPITTAASFNTSLFYGIGQAVSTEARAMHNLGQSGLTFWSPNIN 159
Query: 171 IFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEE 230
I+RDPRWGRGQETPGEDP++ S +A +V+ Q E
Sbjct: 160 IYRDPRWGRGQETPGEDPLLSSNFATYYVRGLQ-------------------------ES 194
Query: 231 SDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASC 290
D+L +SACCKH+ AYD++ W RY FNA++TEQD EDT+ PF+SC+E G S
Sbjct: 195 QAGSDKLKVSACCKHMTAYDVDNWLGTDRYHFNAIVTEQDLEDTYNAPFKSCVEDGGVSS 254
Query: 291 IMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAG 349
+MCSYN++NGVP C +L R W GYI SDCD++ F+ NY T ED+AA
Sbjct: 255 VMCSYNRLNGVPTCADHELLTTTVRETWKLNGYIVSDCDSLQVFFDNTNYAATAEDAAAD 314
Query: 350 VLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGK 409
L AG+++NCGT + +HT SAI + KV E I++AL L +VQ+RLGL++GDP+ YG
Sbjct: 315 ALLAGLNLNCGTFLAKHTLSAIQQKKVTEATINQALTYLVTVQMRLGLYDGDPKSQTYGS 374
Query: 410 LGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGY 469
LG DVCTSEH+ LAL+AARQG+VLLKN LPL+ + + SLA++GP N M G Y
Sbjct: 375 LGASDVCTSEHQTLALEAARQGMVLLKNLGA-LPLSTSKIKSLAVVGPHANATRAMIGNY 433
Query: 470 TGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLT 529
GIPC S L+ + Y ++ YA GC +V C+SD+ AV A AD V+V GLDLT
Sbjct: 434 AGIPCKYTSPLQAFQKY-AQVSYAPGCANVACSSDSLISGAVSAAAAADAVVVAVGLDLT 492
Query: 530 QETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIG 589
E E DR SLLLPG+Q LV+ V + +K PV++V+ G +D+ FA +DS+I+ ILW G
Sbjct: 493 IEAESLDRTSLLLPGKQQELVSQVMQAAKGPVVIVILSAGAIDIPFALSDSRIAGILWAG 552
Query: 590 YPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYT 649
YPG+AG A+AE+IFGD NP G+LP TWYP++FT + M DMNMR ++S YPGR+YRFYT
Sbjct: 553 YPGQAGGAAIAEVIFGDHNPSGKLPATWYPQNFTSISMLDMNMRPNASTGYPGRTYRFYT 612
Query: 650 GTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSC 709
G ++ FG GLSYT+ S KF+ APS L+I ++ L+++ S DE SC
Sbjct: 613 GPTIFKFGDGLSYTSLSAKFIKAPSFLSIPSTAPMQPCTG-LKKSSSCFHLDATDE-KSC 670
Query: 710 TSLRFHVQISVTNAGDVDGSHVVMLFARVPKV-SQGTPEKQLIGFDRVHTVAKG-SKEIS 767
SL+ V ISV N G + SH +MLF+ P S G P++QL+GF+++ S +
Sbjct: 671 ESLKSQVAISVRNKGAMAISHTLMLFSTPPNAGSDGVPQRQLVGFNKIQIAGDSISNPVI 730
Query: 768 FGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTI 805
F +DPC A+ G+++L G HVL G +HSL +
Sbjct: 731 FDLDPCRHFVHADPDGKKLLRSGTHVLTAGNEQHSLRL 768
>gi|356529243|ref|XP_003533205.1| PREDICTED: beta-D-xylosidase 1-like [Glycine max]
Length = 774
Score = 722 bits (1864), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 356/771 (46%), Positives = 498/771 (64%), Gaps = 39/771 (5%)
Query: 42 FPCKPPH--FDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWW 99
F C P + + FCNT + I R + LI+ LTL EKI+ + +NA A+PRLGI YEWW
Sbjct: 36 FACDPRNGLTRGFKFCNTHVPIHVRVQDLIARLTLPEKIRLVVNNAIAVPRLGIQGYEWW 95
Query: 100 SESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQ 159
SE+LHG+++ GPG F G T FPQV+ +AASFN+SLW IG V+ EARAMYN GQ
Sbjct: 96 SEALHGVSNVGPGTKFGGAFPGATMFPQVISTAASFNQSLWQEIGRVVSDEARAMYNGGQ 155
Query: 160 AGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFR 219
AGLT+W+PN+NIFRDPRWGRGQETPGEDP + + YA +VK QG DG
Sbjct: 156 AGLTYWSPNVNIFRDPRWGRGQETPGEDPTLAAKYAASYVKGLQG------DGA------ 203
Query: 220 EKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPF 279
G+ L ++ACCKH AYDL+ W R+ FNA +++QD EDT+ PF
Sbjct: 204 --------------GNRLKVAACCKHYTAYDLDNWNGVDRFHFNAKVSKQDLEDTYDVPF 249
Query: 280 RSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQN 338
++C+ +G+ + +MCSYNQVNG P C DL + R +WG GYI SDCD+V F+ Q+
Sbjct: 250 KACVLEGQVASVMCSYNQVNGKPTCADPDLLRNTIRGQWGLNGYIVSDCDSVGVFFDNQH 309
Query: 339 YTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLF 398
YT+T E++AA +KAG+D++CG + HT SAI KG + E D++ AL NL +VQ+RLG+F
Sbjct: 310 YTRTPEEAAAEAIKAGLDLDCGPFLAIHTDSAIRKGLISENDLNLALANLITVQMRLGMF 369
Query: 399 NGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPL 458
+G+P +G LGP DVCT H++LAL+AAR+ IVLL+N LPL+ + + + +IGP
Sbjct: 370 DGEPSTQPFGNLGPRDVCTPAHQQLALEAARESIVLLQNKGNSLPLSPSRLRIVGVIGPN 429
Query: 459 VNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKAD 518
+ M G Y G+ C + L+G+ YV KT + GC V C + F A IA++ D
Sbjct: 430 TDATVTMIGNYAGVACGYTTPLQGIARYV-KTAHQVGCRGVACRGNELFGAAEIIARQVD 488
Query: 519 FVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEA 578
++V GLD T E E RDRV LLLPG Q LVT VAR +K PVILV+ GGP+DVSFA+
Sbjct: 489 ATVLVMGLDQTIEAETRDRVGLLLPGLQQELVTRVARAAKGPVILVIMSGGPVDVSFAKN 548
Query: 579 DSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESF-TKVPMNDMNMRADSS 637
+ +IS+ILW+GYPG+AG A+A++IFG NPGGRLPMTWYP+ + KVPM +M+MR + +
Sbjct: 549 NPKISAILWVGYPGQAGGTAIADVIFGATNPGGRLPMTWYPQGYLAKVPMTNMDMRPNPA 608
Query: 638 RQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISA-SLKAGSDKNILQQTGS 696
YPGR+YRFY G V+ FGHGLSY+ +S AP ++++ SL+A ++ + L
Sbjct: 609 TGYPGRTYRFYKGPVVFPFGHGLSYSRFSQSLALAPKQVSVQILSLQALTN-STLSSKAV 667
Query: 697 RLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRV 756
++ + + D+ SL + V N G +DG+H +++F++ P + + KQL+ F +
Sbjct: 668 KVSHANCDD-----SLETEFHVDVKNEGSMDGTHTLLIFSK-PPPGKWSQIKQLVTFHKT 721
Query: 757 HTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIET 807
H A + + V C+ LS+ ++ G R +P G H L +G+L+HS+ ++T
Sbjct: 722 HVPAGSKQRLKVNVHSCKHLSVVDQFGVRRIPTGEHELHIGDLKHSINVQT 772
>gi|297843058|ref|XP_002889410.1| hypothetical protein ARALYDRAFT_470222 [Arabidopsis lyrata subsp.
lyrata]
gi|297335252|gb|EFH65669.1| hypothetical protein ARALYDRAFT_470222 [Arabidopsis lyrata subsp.
lyrata]
Length = 763
Score = 722 bits (1864), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 355/790 (44%), Positives = 495/790 (62%), Gaps = 43/790 (5%)
Query: 21 LIVVVNVIAFSNSKPVLNKPDFPC--KPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKI 78
L ++ + S+S V ++ F C K + FC S+ I+ R K LI LTL EK+
Sbjct: 4 LAAILFFLISSSSVCVQSRETFACDIKDAATATLRFCQLSVPITERVKDLIGRLTLVEKV 63
Query: 79 QQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRS 138
L + A+AIPRLGI YEWWSE+LHG+++ GPG F G + TSFPQV+ + ASFN S
Sbjct: 64 SLLGNTAAAIPRLGIKGYEWWSEALHGVSNVGPGTKFGGVYPAATSFPQVITTVASFNAS 123
Query: 139 LWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEF 198
LW +IG V+ EARAMYN G GLT+W+PN+NI RDPRWGRGQETPGEDP+V YA +
Sbjct: 124 LWESIGRVVSNEARAMYNGGVGGLTYWSPNVNILRDPRWGRGQETPGEDPVVAGKYAASY 183
Query: 199 VKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFS 258
V+ QG +DR L ++ACCKH AYDL+ W
Sbjct: 184 VRGLQG--------------------------NDR-SRLKVAACCKHFTAYDLDNWNGVD 216
Query: 259 RYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEW 317
R+ FNA +++QD EDTF PFR C+++G + IMCSYN+VNGVP C +L +K RNEW
Sbjct: 217 RFHFNAKVSKQDIEDTFDVPFRMCVKEGNVASIMCSYNEVNGVPTCADPNLLKKTIRNEW 276
Query: 318 GFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQ 377
G GYI SDCD+V +++ Q+YT T E++AA +KAG+D++CG + HT A+ K ++
Sbjct: 277 GLNGYIVSDCDSVGVLYDTQHYTGTPEEAAADSIKAGLDLDCGPFLGAHTIDAVKKNLLR 336
Query: 378 EKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKN 437
E D+D AL+N +VQ+RLG+F+GD YG LGP VCT HK LAL+AA+QGIVLLKN
Sbjct: 337 ESDVDNALINTLTVQMRLGMFDGDIAAQPYGHLGPAHVCTPVHKGLALEAAQQGIVLLKN 396
Query: 438 DKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCH 497
LPL+ ++A+IGP + M G Y GI C S ++G+ Y ++T + GC
Sbjct: 397 HGSSLPLSSQRHRTVAVIGPNSDATVAMIGNYAGIACGYTSPVQGITGY-ARTVHQKGCV 455
Query: 498 DVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTS 557
DV C D F AV A+ AD ++V GLD + E E +DR SLLLPG+Q L++ VA+ +
Sbjct: 456 DVHCMDDRLFDAAVEAARGADATVLVMGLDQSIEAEFKDRNSLLLPGKQQELISRVAKAA 515
Query: 558 KRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTW 617
K PVILVL GGP+D+SFAE D +I +I+W GYPG+ G A+A+I+FG NPGG+LPMTW
Sbjct: 516 KGPVILVLMSGGPIDISFAEKDRKIPAIVWAGYPGQEGGTAIADILFGSANPGGKLPMTW 575
Query: 618 YPESF-TKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSEL 676
YP+ + T +PM +M+MR S++ PGR+YRFY G VY FGHGLSYT +++ AP +
Sbjct: 576 YPQDYLTNLPMTEMSMRPIHSKRIPGRTYRFYDGPVVYPFGHGLSYTRFTHSIADAPKVI 635
Query: 677 TISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFA 736
I+ + G+ +G + H C L V + VTN G DG+H +++F+
Sbjct: 636 PIAVRGRNGT------VSGKSIRVTH----ARCNRLSLGVHVDVTNVGSRDGTHTMLVFS 685
Query: 737 RVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMV 796
P + P+KQL+ F+RVH K + + C+ LS+ ++ G R +P+G+H + +
Sbjct: 686 -APPGGEWAPKKQLVAFERVHVAVGEKKRVQVNIHVCKYLSVVDRAGNRRIPIGDHGIHI 744
Query: 797 GELRHSLTIE 806
G+ H+++++
Sbjct: 745 GDESHTVSLQ 754
>gi|9972374|gb|AAG10624.1|AC022521_2 Similar to xylosidase [Arabidopsis thaliana]
Length = 763
Score = 722 bits (1863), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 355/790 (44%), Positives = 495/790 (62%), Gaps = 43/790 (5%)
Query: 21 LIVVVNVIAFSNSKPVLNKPDFPC--KPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKI 78
L V++ + S+S V ++ F C K + FC S+ I R + LI LTL EK+
Sbjct: 4 LAVILFFLISSSSVCVHSRETFACDTKDAATATLRFCQLSVPIPERVRDLIGRLTLAEKV 63
Query: 79 QQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRS 138
L + A+AIPRLGI YEWWSE+LHG+++ GPG F G + TSFPQV+ + ASFN S
Sbjct: 64 SLLGNTAAAIPRLGIKGYEWWSEALHGVSNVGPGTKFGGVYPAATSFPQVITTVASFNAS 123
Query: 139 LWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEF 198
LW +IG V+ EARAMYN G GLT+W+PN+NI RDPRWGRGQETPGEDP+V YA +
Sbjct: 124 LWESIGRVVSNEARAMYNGGVGGLTYWSPNVNILRDPRWGRGQETPGEDPVVAGKYAASY 183
Query: 199 VKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFS 258
V+ QG +DR L ++ACCKH AYDL+ W
Sbjct: 184 VRGLQG--------------------------NDR-SRLKVAACCKHFTAYDLDNWNGVD 216
Query: 259 RYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEW 317
R+ FNA +++QD EDTF PFR C+++G + IMCSYNQVNGVP C +L +K RN+W
Sbjct: 217 RFHFNAKVSKQDIEDTFDVPFRMCVKEGNVASIMCSYNQVNGVPTCADPNLLKKTIRNQW 276
Query: 318 GFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQ 377
G GYI SDCD+V +++ Q+YT T E++AA +KAG+D++CG + HT A+ K ++
Sbjct: 277 GLNGYIVSDCDSVGVLYDTQHYTGTPEEAAADSIKAGLDLDCGPFLGAHTIDAVKKNLLR 336
Query: 378 EKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKN 437
E D+D AL+N +VQ+RLG+F+GD YG LGP VCT HK LAL+AA+QGIVLLKN
Sbjct: 337 ESDVDNALINTLTVQMRLGMFDGDIAAQPYGHLGPAHVCTPVHKGLALEAAQQGIVLLKN 396
Query: 438 DKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCH 497
LPL+ ++A+IGP + M G Y G+ C S ++G+ Y ++T + GC
Sbjct: 397 HGSSLPLSSQRHRTVAVIGPNSDATVTMIGNYAGVACGYTSPVQGITGY-ARTIHQKGCV 455
Query: 498 DVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTS 557
DV C D F AV A+ AD ++V GLD + E E +DR SLLLPG+Q LV+ VA+ +
Sbjct: 456 DVHCMDDRLFDAAVEAARGADATVLVMGLDQSIEAEFKDRNSLLLPGKQQELVSRVAKAA 515
Query: 558 KRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTW 617
K PVILVL GGP+D+SFAE D +I +I+W GYPG+ G A+A+I+FG NPGG+LPMTW
Sbjct: 516 KGPVILVLMSGGPIDISFAEKDRKIPAIVWAGYPGQEGGTAIADILFGSANPGGKLPMTW 575
Query: 618 YPESF-TKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSEL 676
YP+ + T +PM +M+MR S++ PGR+YRFY G VY FGHGLSYT +++ AP +
Sbjct: 576 YPQDYLTNLPMTEMSMRPVHSKRIPGRTYRFYDGPVVYPFGHGLSYTRFTHNIADAPKVI 635
Query: 677 TISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFA 736
I+ + G+ +G + H C L V + VTN G DG+H +++F+
Sbjct: 636 PIAVRGRNGT------VSGKSIRVTH----ARCDRLSLGVHVEVTNVGSRDGTHTMLVFS 685
Query: 737 RVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMV 796
P + P+KQL+ F+RVH K + + C+ LS+ ++ G R +P+G+H + +
Sbjct: 686 -APPGGEWAPKKQLVAFERVHVAVGEKKRVQVNIHVCKYLSVVDRAGNRRIPIGDHGIHI 744
Query: 797 GELRHSLTIE 806
G+ H+++++
Sbjct: 745 GDESHTVSLQ 754
>gi|302796585|ref|XP_002980054.1| hypothetical protein SELMODRAFT_419541 [Selaginella moellendorffii]
gi|300152281|gb|EFJ18924.1| hypothetical protein SELMODRAFT_419541 [Selaginella moellendorffii]
Length = 779
Score = 721 bits (1862), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 366/774 (47%), Positives = 491/774 (63%), Gaps = 62/774 (8%)
Query: 52 YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGP 111
+ FCNT L STR + LIS +TLQEKI QL +NA+ IPRLG+P YEWW E+LHG+A + P
Sbjct: 44 FGFCNTRLPTSTRVEDLISRMTLQEKIIQLVNNAAGIPRLGLPRYEWWQEALHGVAVS-P 102
Query: 112 GVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINI 171
GV F G TSFP +++AASF+ AV+ EARAM+N +AGLT+W+PN+NI
Sbjct: 103 GVKFGGKFPGATSFPMPILTAASFD---------AVSTEARAMHNYQRAGLTYWSPNVNI 153
Query: 172 FRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEES 231
+RDPRWGRGQETPGEDP++ S YA +V+ Q N
Sbjct: 154 YRDPRWGRGQETPGEDPLLSSKYATFYVRGLQDTNLG----------------------- 190
Query: 232 DRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCI 291
GD+L +SACCKH+ AYD++ W +R+ FNA++T+QD DT+ PPF+SC+E K S +
Sbjct: 191 --GDKLKVSACCKHMTAYDVDNWKGTTRFKFNAIVTQQDLSDTYNPPFQSCVEDAKVSSV 248
Query: 292 MCSYNQVNGVPACLRGDLFQKA-RNEWGFKG----------------YITSDCDAVATIF 334
MCSYN+VNGVP C +L R+ W G YI SDCD++ T F
Sbjct: 249 MCSYNRVNGVPTCADYNLLSATVRSSWNLNGSILLTCEVLLLYLPCSYIVSDCDSLQTFF 308
Query: 335 EYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLR 394
+ NY KT ED A L AG++++CG + HTQSAI GK+ E ++++AL L++VQ+R
Sbjct: 309 DNTNYAKTAEDVVADALLAGLNLDCGPFLAIHTQSAITNGKITEANVNQALRYLYNVQMR 368
Query: 395 LGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAI 454
LGL++G+PR YG LGP VCT E+++LALDAA++GIVLLKN+ LP +K+ + ++A
Sbjct: 369 LGLYDGNPRSQPYGNLGPQSVCTGENQQLALDAAKEGIVLLKNNGNVLPFSKSNIRTVAA 428
Query: 455 IGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIA 514
IGP M G Y GIPC + GL AY ++ Y++GC DV C S++ A A
Sbjct: 429 IGPHAKATRAMIGNYQGIPCKYTTPHDGLSAY-ARVVYSAGCSDVACYSNSLIGSAASTA 487
Query: 515 KKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVS 574
+AD V++ GLDL QE E +DR SLLLPG+Q LVT V + +K PV+LV+ GG +DVS
Sbjct: 488 SQADAVVLFVGLDLNQEAEGKDRTSLLLPGKQQELVTEVTKAAKGPVVLVIFSGGSVDVS 547
Query: 575 FAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRA 634
FA+ D ++ +LW GYPGEAG A+A+++FGD NPGGRLP+TWYPESFT + M DMNMR
Sbjct: 548 FAKYDKKVQGMLWAGYPGEAGGAAIAQVLFGDHNPGGRLPVTWYPESFTGITMLDMNMRP 607
Query: 635 DSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTI--SASLKAGSDKNILQ 692
D+SR YPGR+YRFYTG VY FG+G +Y+ S+KF AP L +A++K D N+
Sbjct: 608 DASRGYPGRTYRFYTGQSVYNFGYGKTYSKLSHKFKEAPLSLGFPEAAAVKRSCDGNL-- 665
Query: 693 QTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQ-GTPEKQLI 751
T L+ H DE+T C++L V+I V N GD + V+L++ P + G P +QL
Sbjct: 666 -TCFHLN-AH-DEIT-CSTLTSKVRILVHNEGDRPSNRAVLLYSSPPNAGRDGAPIRQLA 721
Query: 752 GFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTI 805
GF +V + + +DPC+ LS A +G RIL G H L VG RH L I
Sbjct: 722 GFGKVSVAPGAVENVEIEIDPCKHLSHAGANGVRILHGGIHTLAVGNARHPLPI 775
>gi|65736613|dbj|BAD98523.1| alpha-L-arabinofuranosidase / beta-D-xylosidase [Pyrus pyrifolia]
Length = 774
Score = 721 bits (1862), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 363/773 (46%), Positives = 487/773 (63%), Gaps = 40/773 (5%)
Query: 39 KPDFPCKP--PHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAY 96
+P F C P P + FC + I R + LI LTLQEKI L +NA A+PRLGI Y
Sbjct: 29 RPPFACDPRNPITRTLKFCRVRVPIHVRVQDLIGRLTLQEKIGLLVNNAIAVPRLGIQGY 88
Query: 97 EWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYN 156
EWWSE+LHG+++ GPG F GT TSFPQV+ +AASFN SLW IG V+ EARAMYN
Sbjct: 89 EWWSEALHGVSNVGPGTKF-GTFLGATSFPQVITTAASFNESLWEEIGRVVSDEARAMYN 147
Query: 157 LGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGF 216
G AGLTFW+PN+NIFRDPRWGRGQETPGEDP++ + Y +VK QG DG
Sbjct: 148 GGAAGLTFWSPNVNIFRDPRWGRGQETPGEDPVLAAKYGARYVKGLQG------DGA--- 198
Query: 217 GFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQ 276
G+ L ++ACCKH AYDL+ W R+ FNA +++QD EDT+
Sbjct: 199 -----------------GNRLKVAACCKHYTAYDLDNWNGVDRFHFNARVSKQDLEDTYN 241
Query: 277 PPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFE 335
PF++C+ G + +MCSYNQVNG P C DL + R +W GYI SDCD+V ++
Sbjct: 242 VPFKACVVDGNVASVMCSYNQVNGKPTCADPDLLKGTIRGQWKLNGYIVSDCDSVGVYYD 301
Query: 336 YQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRL 395
Q+YTKT E +AA +KAG+D++CG + HT++AI G+V E DI+ AL N +VQ+RL
Sbjct: 302 NQHYTKTPEAAAAYAIKAGLDLDCGPFLGIHTEAAIRTGQVNEIDINYALANTITVQMRL 361
Query: 396 GLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAII 455
G+F+G+P +YG LG DVC +LAL+AARQGIVLL+N LPL+ ++A+I
Sbjct: 362 GMFDGEPSTQRYGNLGLADVCKPSSNELALEAARQGIVLLENRGNSLPLSTIRHRTVAVI 421
Query: 456 GPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAK 515
GP + M G Y GI C + L+G+ Y H A GC DV CN + A A+
Sbjct: 422 GPNSDVTETMIGNYAGIACGYTTPLQGIARYTRTIHQA-GCTDVHCNGNQLIGAAEVAAR 480
Query: 516 KADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSF 575
+AD ++V GLD + E E RDR LLLPG Q LV+ VAR S+ P ILV+ GGP+DV+F
Sbjct: 481 QADATVLVIGLDQSIEAEFRDRTGLLLPGHQQELVSRVARASRGPTILVIMSGGPIDVTF 540
Query: 576 AEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNMRA 634
A+ D I +I+W+GYPG+AG A+A+++FG NP G+LPMTWYP+++ +PM DM MRA
Sbjct: 541 AKNDPCIGAIIWVGYPGQAGGTAIADVLFGTTNPSGKLPMTWYPQNYVANLPMTDMAMRA 600
Query: 635 DSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTIS-ASLKAGSDKNILQQ 693
D +R YPGR+YRFY G V+ FG GLSYT +S+ P+ +++ SL A + +L
Sbjct: 601 DPARGYPGRTYRFYKGPVVFPFGMGLSYTRFSHSLAQGPTLVSVPLTSLVAAKNTTMLSN 660
Query: 694 TGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGF 753
G R+ + T+C SL I + N G +DG+H +++FA P + P KQL+GF
Sbjct: 661 HGVRVSH------TNCDSLSLDFHIDIKNTGTMDGTHTLLVFATQP-AGKWAPNKQLVGF 713
Query: 754 DRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
+VH VA + + GV C+ LSI +K G R +PLG H L +G+L+H ++IE
Sbjct: 714 HKVHIVAGSERRVRVGVHVCKHLSIVDKLGIRRIPLGQHKLEIGDLKHYVSIE 766
>gi|297834874|ref|XP_002885319.1| beta-1,4-xylosidase [Arabidopsis lyrata subsp. lyrata]
gi|297331159|gb|EFH61578.1| beta-1,4-xylosidase [Arabidopsis lyrata subsp. lyrata]
Length = 865
Score = 719 bits (1855), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 356/784 (45%), Positives = 495/784 (63%), Gaps = 60/784 (7%)
Query: 38 NKPDFPC--KPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPA 95
++ +F C P Y FCN SLS RAK L+S L+L+EK+QQL + A+ + RLG+P
Sbjct: 23 SQKNFACDRNDPATAKYGFCNVSLSYEARAKDLVSRLSLKEKVQQLVNKATGVSRLGVPP 82
Query: 96 YEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMY 155
YEWWSE+LHG++ GPGV FNGTV TSFP +++AASFN SLW +G V+ EARAM+
Sbjct: 83 YEWWSEALHGVSDVGPGVRFNGTVPGATSFPATILTAASFNTSLWLKMGEVVSTEARAMH 142
Query: 156 NLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIG 215
N+G AGLT+W+PN+NIFRDPRWGRGQETPGEDP+VVS YAV +VK Q
Sbjct: 143 NVGLAGLTYWSPNVNIFRDPRWGRGQETPGEDPLVVSKYAVNYVKGLQDV---------- 192
Query: 216 FGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTF 275
+++ + L +S+CCKH AYDL+ W R+ F+A +T+QD EDT+
Sbjct: 193 -------------QDAGKSRRLKVSSCCKHYTAYDLDNWKGIDRFHFDAKVTKQDLEDTY 239
Query: 276 QPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIF 334
QPPF+SC+E+G S +MCSYN+VNG+P C +L + R +W GYI SDCD++ F
Sbjct: 240 QPPFKSCVEEGDVSSVMCSYNRVNGIPTCADPNLLRGVIRGQWRLDGYIVSDCDSIQVYF 299
Query: 335 EYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLR 394
+ +YTKT +++NCG + ++T++A+ K+ ++D AL+ + V +R
Sbjct: 300 DDIHYTKTR-----------LNMNCGDFLGKYTENAVKLKKLNGSEVDEALIYNYIVLMR 348
Query: 395 LGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAI 454
LG F+GDP+ +G+LGP DVC+ +H+ LAL+AA+QGIVLL+N + LPL+K AV +A+
Sbjct: 349 LGFFDGDPKSLPFGQLGPSDVCSKDHQMLALEAAKQGIVLLEN-RGDLPLSKTAVKKIAV 407
Query: 455 IGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV-SKTHYASGCHDVPCNSDAGFHEAVRI 513
IGP N M Y G+PC S L+GL+ YV K Y GC DV C AV+
Sbjct: 408 IGPNANATKVMISNYAGVPCKYTSPLQGLQKYVPEKVVYEPGCKDVNCGEQTLISAAVKA 467
Query: 514 AKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDV 573
+AD ++V GLD T E E DRV+L LPG Q LV VA +K+ V+LV+ GP+D+
Sbjct: 468 VSEADVTVLVVGLDQTVEAEGLDRVNLTLPGYQEKLVRDVANAAKKTVVLVIMSAGPIDI 527
Query: 574 SFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNM 632
SFA+ S IS++LW+GYPGEAG A+A++IFGD+NP GRLP TWY + F KV M DMNM
Sbjct: 528 SFAKNLSTISAVLWVGYPGEAGGDAIAQVIFGDYNPSGRLPETWYSQEFADKVAMTDMNM 587
Query: 633 RADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQ 692
R +S+ +PGRSYRFYTG +Y FG+GLSY+ +S LSAPS + I K N+ +
Sbjct: 588 RPNSTSGFPGRSYRFYTGKPIYKFGYGLSYSAFSTFVLSAPSIIHI----KTNPILNLNK 643
Query: 693 QTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQ-----GTPE 747
T + V+ C L+ + I V N G GSHVV++F + PK S+ G P+
Sbjct: 644 TTSIDISTVN------CHDLKIRIVIGVKNRGQRSGSHVVLVFWKPPKCSKTLVGAGVPQ 697
Query: 748 KQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG-----ELRHS 802
QL+GF+RV +++++ D C+ LS+ + HG+R L G+H L++G ++ H
Sbjct: 698 TQLVGFERVEVGRSMTEKVTVEFDVCKALSLVDTHGKRKLVTGHHTLVIGSNSDQQIYHH 757
Query: 803 LTIE 806
L +
Sbjct: 758 LNVR 761
>gi|371917280|dbj|BAL44716.1| SlArf/Xyl1 [Solanum lycopersicum]
Length = 771
Score = 719 bits (1855), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 359/794 (45%), Positives = 507/794 (63%), Gaps = 44/794 (5%)
Query: 21 LIVVVNVIAFSNSKPVLNKPDFPCKPPH--FDSYPFCNTSLSISTRAKSLISLLTLQEKI 78
I+++ V+AF+ S ++ F C P + + FC TSL I R + LI+ LTLQEKI
Sbjct: 6 FILIIFVLAFAYSGE--SRQPFACDPANAGIRNLRFCKTSLPIHVRVQDLIARLTLQEKI 63
Query: 79 QQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRS 138
+ L +NA+ + RLGI YEWWSE+LHG+++ G GV F G TSFPQV+ +AASFN S
Sbjct: 64 RLLVNNAAPVQRLGISGYEWWSEALHGVSNTGYGVKFGGAFPGATSFPQVITTAASFNAS 123
Query: 139 LWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEF 198
LW IG V+ E RAMYN G AGLTFW+PN+NIFRDPRWGRGQETPGEDP +V+ Y V +
Sbjct: 124 LWEEIGRVVSEEGRAMYNGGAAGLTFWSPNVNIFRDPRWGRGQETPGEDPHLVAQYGVSY 183
Query: 199 VKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFS 258
VK QG + + L ++ACCKH AYDL+ W +
Sbjct: 184 VKGLQGGGGRGN------------------------TRLKVAACCKHYTAYDLDDWNGYD 219
Query: 259 RYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEW 317
RY FNA ++ QD EDT+ PF++C+ +G + +MCSYNQ+NG P+C L + RN+W
Sbjct: 220 RYHFNAKVSMQDLEDTYNAPFKACVVEGNVASVMCSYNQINGKPSCADPTLLRDTIRNQW 279
Query: 318 GFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQ 377
GYI SDCD+V +FE Q+YT+ ED+AA +KAG+D++CG + HT A+ GKV
Sbjct: 280 HLNGYIVSDCDSVGVLFEKQHYTRYPEDAAAITIKAGLDLDCGPFLAIHTDKAVHTGKVS 339
Query: 378 EKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKN 437
+ +I+ AL N +VQ+RLG+F+G G Y LGP DVC+ H++LAL AAR+GIVLLKN
Sbjct: 340 QVEINNALANTITVQMRLGMFDGP--NGPYANLGPKDVCSPAHQQLALQAAREGIVLLKN 397
Query: 438 DKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCH 497
+ LPL+ ++A+IGP + M G Y G+PC S L+G+ Y ++T + GC
Sbjct: 398 IGQALPLSTKRHRTVAVIGPNSDATLAMIGNYAGVPCGYISPLQGISRY-ARTIHQQGCM 456
Query: 498 DVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTS 557
V C + F A A+ AD ++V GLD + E E +DRV+LLLPG Q L++ VA S
Sbjct: 457 GVACPGNQNFGLAEVAARHADATVLVMGLDQSIEAEAKDRVTLLLPGHQQDLISRVAMAS 516
Query: 558 KRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTW 617
K PV+LVL GGP+DV+FA+ D ++SSI+W+GYPG+AG A+A+++FG NPGG+LPMTW
Sbjct: 517 KGPVVLVLMSGGPIDVTFAKNDPRVSSIVWVGYPGQAGGAAIADVLFGATNPGGKLPMTW 576
Query: 618 YPESFT-KVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAP--- 673
YP+ + KV M +M+MRA+ S+ YPGR+YRFY G V+ FG G+SYT +S +SAP
Sbjct: 577 YPQDYVAKVSMANMDMRANPSKGYPGRTYRFYKGPTVFPFGAGISYTTFSQHLVSAPITV 636
Query: 674 SELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVM 733
S T+ + ++ L + + + +H T+C SL + I V N GD+DG+H V+
Sbjct: 637 SVPTLHSHDLVSNNTTTLMKAKATVRTIH----TNCESLDIDMHIDVKNTGDMDGTHAVL 692
Query: 734 LFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHV 793
+F+ P T KQL+ F++VH VA + + ++ C+ LS+A+++G R + +G H
Sbjct: 693 IFSTPP---DPTETKQLVAFEKVHVVAGAKQRVKINMNACKHLSVADEYGVRRIYMGEHK 749
Query: 794 LMVGE-LRHSLTIE 806
+ VG+ L+HS+T +
Sbjct: 750 IHVGDHLKHSITFQ 763
>gi|356574315|ref|XP_003555294.1| PREDICTED: LOW QUALITY PROTEIN: probable beta-D-xylosidase 5-like
[Glycine max]
Length = 901
Score = 716 bits (1847), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 365/779 (46%), Positives = 504/779 (64%), Gaps = 39/779 (5%)
Query: 44 CKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESL 103
C ++PFC+TSLS RAK L+S LTLQEK QQL + ++ I RLG+PAYEWWSE+L
Sbjct: 26 CGSAKTSNFPFCDTSLSYEDRAKDLVSRLTLQEKTQQLVNPSAGISRLGVPAYEWWSEAL 85
Query: 104 HGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLT 163
HG+++ GPG F+ V TSFP V++SAASFN SLW +G V+ EARAMYN+ AGLT
Sbjct: 86 HGVSNLGPGTRFDKKVPGATSFPAVILSAASFNASLWQKMGQVVSTEARAMYNVDLAGLT 145
Query: 164 FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRV 223
FW+PN+N+FRDPRWGRGQETPGEDP+VVS YAV +++ Q + +D
Sbjct: 146 FWSPNVNVFRDPRWGRGQETPGEDPLVVSRYAVMYLRGLQ----EVED------------ 189
Query: 224 LKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCI 283
E S + D L +S+CCKH AYDL+ W R+ F+A +T+QD ED++QPPF+SC+
Sbjct: 190 -----EASAKADRLKVSSCCKHYTAYDLDNWKGIDRFHFDAKVTKQDLEDSYQPPFKSCV 244
Query: 284 EQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKT 342
+G S +MCSYN+VNG+P C DL + R +WG GYI SDCD+V + +YT T
Sbjct: 245 VEGHVSSVMCSYNRVNGIPTCADPDLLKGIIRGQWGLDGYIVSDCDSVEVYYNAIHYTAT 304
Query: 343 HEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDP 402
ED+ A LKAG+++NCG + ++T +A++ KV +D+AL+ + V +RLG F+ DP
Sbjct: 305 PEDAVALALKAGLNMNCGDFLKKYTANAVNLKKVDVATVDQALVYNYIVLMRLGFFD-DP 363
Query: 403 RKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNI 462
+ + LGP DVCT ++++LALDAA+QGIVLL+N+ LPL++ + LA+IGP N
Sbjct: 364 KSLPFANLGPSDVCTKDNQQLALDAAKQGIVLLENNNGALPLSQTNIKKLAVIGPNANAT 423
Query: 463 SQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIV 522
+ M Y GIPC S L+GL+ Y+S +YA GC +V C++ + AV+ A AD V++
Sbjct: 424 TVMISNYAGIPCRYTSPLQGLQKYISSVNYAPGCSNVKCDNQSLIAAAVKAAASADAVVL 483
Query: 523 VAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQI 582
V GLD + E E DR +L LPG Q V VA +K VILV+ GP+D+S ++ S I
Sbjct: 484 VVGLDQSIEAEGLDRENLTLPGFQEKFVKDVAGATKGKVILVIMAAGPIDISSTKSVSNI 543
Query: 583 SSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNMRADSSRQYP 641
ILW+GYPG+AG A+A++IFGD+NPGGR P TWYP+S+ +VPM DMNMRA+ SR +P
Sbjct: 544 GGILWVGYPGQAGGDAIAQVIFGDYNPGGRSPFTWYPQSYVDQVPMTDMNMRANKSRNFP 603
Query: 642 GRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSEL-----TISASLKAGSDKNILQQTGS 696
GR+YRFY G +Y FGHGLSY+ +S SAPS + +IS S N Q S
Sbjct: 604 GRTYRFYNGNSLYEFGHGLSYSTFSMYVASAPSSIMIENTSISEPHNMLSSNNSGTQVES 663
Query: 697 RLDYVHIDEVT-SCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQ---GTPEKQLIG 752
D ID T +C L F + I V N G ++GSHVV++F P S+ G P KQLIG
Sbjct: 664 LSDGQAIDISTINCQDLTFLLVIGVKNNGPLNGSHVVLVFWE-PATSEFVIGAPIKQLIG 722
Query: 753 FDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG-----ELRHSLTIE 806
F+RV V ++ ++ +D C+ +S + G+R L +G H ++VG ++RH + ++
Sbjct: 723 FERVQVVVGVTEFVTVKIDICQLISNVDSDGKRKLVIGQHTILVGSSSETQVRHHIDVK 781
>gi|449466797|ref|XP_004151112.1| PREDICTED: beta-D-xylosidase 1-like [Cucumis sativus]
Length = 770
Score = 716 bits (1847), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 356/756 (47%), Positives = 488/756 (64%), Gaps = 42/756 (5%)
Query: 54 FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
FC SL I R K LI LTL EKI+ L +NA A+PRLGI YEWWSE+LHG+++ GPG
Sbjct: 46 FCQESLGIEERVKDLIGRLTLGEKIRLLVNNAIAVPRLGIRGYEWWSEALHGVSNVGPGT 105
Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFR 173
F GT TSFPQV+ +AASFN+SLW IG V+ EARAMYN G AGLT+W+PN+NIFR
Sbjct: 106 KFGGTFPGATSFPQVITTAASFNQSLWLLIGRVVSDEARAMYNGGTAGLTYWSPNVNIFR 165
Query: 174 DPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDR 233
DPRWGRGQETPGEDP++ + YA +V+ QG + K
Sbjct: 166 DPRWGRGQETPGEDPILAAKYAANYVQGLQGNDGKK------------------------ 201
Query: 234 GDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMC 293
L ++ACCKH AYDL+ W RY FNA +++QD EDT+ PF++C+ +GK + +MC
Sbjct: 202 --RLKVAACCKHYTAYDLDNWNGVDRYHFNAKVSKQDLEDTYNVPFKACVVEGKVASVMC 259
Query: 294 SYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLK 352
SYNQVNG P C DL + R WG GYI SDCD+V +++ Q++T T E++AA +K
Sbjct: 260 SYNQVNGKPTCADPDLLKNTIRGAWGLDGYIVSDCDSVGVLYDSQHFTPTPEEAAASTIK 319
Query: 353 AGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGP 412
AG+D++CG + HT +A+ +G ++E D++ AL NL SVQ+RLG+F+G+P YG LGP
Sbjct: 320 AGLDLDCGPFLAVHTATAVGRGLLKEVDLNNALANLLSVQMRLGMFDGEPAAQPYGNLGP 379
Query: 413 DDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGI 472
DVCT HK LAL+AARQGIVLL+N LPL+ ++A+IGP + M G Y G+
Sbjct: 380 KDVCTPAHKHLALEAARQGIVLLQNRAGALPLSPTRHRTVAVIGPNSDATVTMIGNYAGV 439
Query: 473 PCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQET 532
C + ++G+ YV KT +A GC +V C D EA A+ AD +VV GLD + E
Sbjct: 440 ACEYTTPVQGISKYV-KTIHAKGCANVACVGDQLIGEAEAAARVADAAVVVVGLDQSIEA 498
Query: 533 EDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPG 592
E RDR +LLPG+Q LV + K P ++VL GGP+DVSFA+ D +IS ILW+GYPG
Sbjct: 499 ESRDRNGVLLPGKQEELVRRIGLACKGPTVVVLMSGGPIDVSFAKNDGKISGILWVGYPG 558
Query: 593 EAGAKALAEIIFGDFNPGGRLPMTWYPESF-TKVPMNDMNMRADSSRQYPGRSYRFYTGT 651
+AG A+A+++FG NPGG+LPMTWYP+S+ KVPM +M +R D S YPGR+YRFY G
Sbjct: 559 QAGGAAIADVLFGATNPGGKLPMTWYPQSYLAKVPMTNMGLRPDPSTGYPGRTYRFYKGP 618
Query: 652 QVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTS 711
V+ FG GLSY+ +S F AP+++++ S + + ++ + H D C S
Sbjct: 619 VVFPFGFGLSYSKFSQSFAEAPTKISLPLSSLSPNSSATVKVS-------HTD----CAS 667
Query: 712 LR-FHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGV 770
+ + I V N G VDGSH +++F+ VP + +PEK LIGF++VH +A K + G+
Sbjct: 668 VSDLPIMIDVKNTGTVDGSHTILVFSTVPNQTW-SPEKHLIGFEKVHLIAGSQKRVRIGI 726
Query: 771 DPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
C+ LS ++ G R +P+G H L +G+L HS++++
Sbjct: 727 HVCDHLSRVDEFGTRRIPMGEHKLHIGDLTHSISLQ 762
>gi|296083056|emb|CBI22460.3| unnamed protein product [Vitis vinifera]
Length = 896
Score = 714 bits (1843), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 353/755 (46%), Positives = 488/755 (64%), Gaps = 69/755 (9%)
Query: 47 PHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGI 106
P+ +PFCNTSL RA L+S LTLQEK +QL ++A+ I RLG+P YEWWSE+LHG+
Sbjct: 58 PNSSQFPFCNTSLPYQDRASDLVSRLTLQEKAKQLINSATGISRLGVPDYEWWSEALHGV 117
Query: 107 ASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWA 166
+++G GV+F+ + +VT FP V++SAASFN SLW +G V+ E RAMYN+GQAGLT+W+
Sbjct: 118 SNSGIGVHFHDPIPAVTIFPAVILSAASFNESLWYTMGQVVSTEGRAMYNVGQAGLTYWS 177
Query: 167 PNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKG 226
PN+NIFRDPRWGRGQETPGEDP+VVS YAV +V R L+
Sbjct: 178 PNVNIFRDPRWGRGQETPGEDPLVVSRYAVNYV----------------------RGLQE 215
Query: 227 FGEESD-RGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQ 285
G+E + D L +S+CCKH AYD++KW R+ F+A +T QD EDT+QPPF+SC+E+
Sbjct: 216 VGKEGNFAADRLKVSSCCKHYTAYDVDKWKGVDRFHFDAKVTLQDLEDTYQPPFKSCVEE 275
Query: 286 GKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHE 344
G S +MCSYN+VNGVP C +L + R++WG GYI SDCD++ E NYT+T E
Sbjct: 276 GHVSSVMCSYNRVNGVPTCANPELLKGVIRDQWGLDGYIVSDCDSIMVYHERMNYTETPE 335
Query: 345 DSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRK 404
D+ A LKAG+++NCG+ + +T++A++ GKV+E +++AL+ + V +RLG F+GDP
Sbjct: 336 DAVALALKAGLNLNCGSYLGDYTKNAVNLGKVKESIVNQALIYNYIVLMRLGFFDGDPTM 395
Query: 405 GKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQ 464
+GK+GP DVCT +H+ LALDAA+QGIVLL N+ LPL+ N +LA+IGP + +
Sbjct: 396 LPFGKMGPSDVCTVDHQLLALDAAKQGIVLLHNNGA-LPLSPNTTKTLAVIGPNADATNT 454
Query: 465 MGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVA 524
M Y G+PC S L+GL+ YVS Y GC +V C+ + A IA AD +VV
Sbjct: 455 MLSNYAGVPCRYTSPLQGLQKYVSAVSYEKGCANVSCSEETLIEGAASIASMADATVVVV 514
Query: 525 GLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISS 584
GLDL E ED DRV+L LPG Q LV A+ + VILV+ GP+D+SF + S+I
Sbjct: 515 GLDLFIEAEDLDRVNLTLPGFQEKLVMEAAKAANGTVILVVMSAGPVDISFVKNVSKIGG 574
Query: 585 ILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNMRADSSRQYPGR 643
ILW+GYPG+AG A++++IFGD+NPGGR P TWYP+ + +VPM DMNMR +++ +PGR
Sbjct: 575 ILWVGYPGQAGGDAISQVIFGDYNPGGRSPFTWYPQEYVDQVPMTDMNMRPNATSNFPGR 634
Query: 644 SYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHI 703
+YRFYTG +Y FGHGLSY+ + YK LS +I
Sbjct: 635 TYRFYTGKSLYQFGHGLSYSTF-YKNLS------------------------------NI 663
Query: 704 DEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVS-QGTPEKQLIGFDRVHTVAKG 762
D V I V NAG++DG+HVV+ F + P+ +G P +L+GF+RV
Sbjct: 664 DIV-----------IGVKNAGEIDGTHVVLAFWKPPRSGVRGAPGVELVGFERVEVKRGK 712
Query: 763 SKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
++ + +D C ++S ++ G+R L +G H L+VG
Sbjct: 713 TEMVGMRLDVCGKISNVDEEGKRKLVMGMHTLVVG 747
>gi|86553064|gb|AAS17751.2| beta xylosidase [Fragaria x ananassa]
Length = 772
Score = 714 bits (1843), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 352/793 (44%), Positives = 504/793 (63%), Gaps = 43/793 (5%)
Query: 18 VLQLIVVVNVIAFSNSKPVLNKPDFPCKP--PHFDSYPFCNTSLSISTRAKSLISLLTLQ 75
++ L++ V+ + F+ V +P F C P P + FC T + + R + LI LTLQ
Sbjct: 11 LIALVLCVSALLFN---LVHARPPFACDPRNPLTRGFKFCRTRVPVHVRVQDLIGRLTLQ 67
Query: 76 EKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASF 135
EKI+ L +NA A+PRLGI YEWWSE+LHG+++ GPG F G TSFPQV+ +AASF
Sbjct: 68 EKIRLLVNNAIAVPRLGIQGYEWWSEALHGVSNVGPGTKFGGAFPGATSFPQVITTAASF 127
Query: 136 NRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYA 195
N+SLW IG V+ EARAMYN GQAGLT+W+PN+NIFRDPRWGRGQETPGEDP++ + YA
Sbjct: 128 NQSLWQEIGQVVSDEARAMYNGGQAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSAKYA 187
Query: 196 VEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWG 255
+VK QG DG G+ L ++ACCKH AYDL+ W
Sbjct: 188 ASYVKGLQG------DGA--------------------GNRLKVAACCKHYTAYDLDNWN 221
Query: 256 NFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-R 314
R+ FNA +++QD DT+ PFR C+ +GK + +MCSYNQVNG P C DL + R
Sbjct: 222 GVDRFHFNARVSKQDLADTYDVPFRGCVLEGKVASVMCSYNQVNGKPTCADPDLLKNTIR 281
Query: 315 NEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKG 374
EW GYI SDCD+V ++ Q+YT+T E++AA +KAG+D++CG + HT+ AI G
Sbjct: 282 GEWKLNGYIVSDCDSVGVFYDQQHYTRTPEEAAAEAIKAGLDLDCGPFLAIHTEGAIKAG 341
Query: 375 KVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVL 434
+ E D+D AL N +VQ+RLG+F+G+P +YG LGP DVCT H++LAL+A+RQGIVL
Sbjct: 342 LLPEIDVDYALANTLTVQMRLGMFDGEPSAQQYGNLGPRDVCTPAHQELALEASRQGIVL 401
Query: 435 LKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYAS 494
L+N+ LPL+ ++A++GP + M G Y G+ C + L+G+ Y +KT +
Sbjct: 402 LQNNGHTLPLSTVRHRTVAVVGPNSDVTETMIGNYAGVACGYTTPLQGIGRY-TKTIHQQ 460
Query: 495 GCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVA 554
GC +V C ++ F A A++AD ++V GLD + E E RDR L++PG Q LV+ VA
Sbjct: 461 GCTNVACTTNQLFGAAEAAARQADATVLVMGLDQSIEAEFRDRTDLVMPGHQQELVSRVA 520
Query: 555 RTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLP 614
R S+ P +LVL GGP+DVSFA+ D +I +I+W+GYPG+AG A+A+++FG NP G+LP
Sbjct: 521 RASRGPTVLVLMSGGPIDVSFAKNDPKIGAIIWVGYPGQAGGTAMADVLFGTTNPSGKLP 580
Query: 615 MTWYPESF-TKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAP 673
MTWYP+ + +KVPM +M MRA R YPGR+YRFY G V+ FG GLSYT +++ P
Sbjct: 581 MTWYPQDYVSKVPMTNMAMRA--GRGYPGRTYRFYKGPVVFPFGLGLSYTTFAHSLAQVP 638
Query: 674 SELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVM 733
+ +++ + + + + + + R+ + T+C L + + V N G DG+H ++
Sbjct: 639 TSVSVPLTSLSATTNSTMLSSAVRVSH------TNCNPLSLALHVVVKNTGARDGTHTLL 692
Query: 734 LFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHV 793
+F+ P + KQL+GF +VH VA K + V C+ LS+ ++ G R +P+G H
Sbjct: 693 VFSSPPS-GKWAANKQLVGFHKVHIVAGSHKRVKVDVHVCKHLSVVDQFGIRRIPIGEHK 751
Query: 794 LMVGELRHSLTIE 806
L +G+L H +++E
Sbjct: 752 LQIGDLEHHISVE 764
>gi|449469042|ref|XP_004152230.1| PREDICTED: probable beta-D-xylosidase 2-like [Cucumis sativus]
Length = 769
Score = 713 bits (1841), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 351/774 (45%), Positives = 494/774 (63%), Gaps = 41/774 (5%)
Query: 38 NKPDFPCKPPH--FDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPA 95
++ F C P + YPFC SL + R K LI LTL+EK++ L NA +PRLGI A
Sbjct: 23 SRAPFACDPNNSVTTDYPFCRRSLVVGERVKDLIGRLTLEEKVKLLVSNAGGVPRLGIKA 82
Query: 96 YEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMY 155
Y+WWSE+LHG+++ GPG F G + TSFPQV+ +AASFN SLW IG V+ EARAMY
Sbjct: 83 YQWWSEALHGVSNVGPGTRFGGEFPAATSFPQVISTAASFNASLWEAIGRVVSDEARAMY 142
Query: 156 NLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIG 215
N G GLT+W+PN+NIFRDPRWGRGQETPGEDP++ YAV +V+ QG
Sbjct: 143 NGGVGGLTYWSPNVNIFRDPRWGRGQETPGEDPILAGTYAVNYVRGLQGT---------- 192
Query: 216 FGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTF 275
G+ L ++ACCKH AYDL+ W R+ FNA +++QD EDTF
Sbjct: 193 -----------------EGNRLKVAACCKHFTAYDLDNWNGVDRFHFNAQVSKQDIEDTF 235
Query: 276 QPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIF 334
+ PFR C++ GK S +MCSYNQVNGVP C +L R++W GYI SDCD+V +
Sbjct: 236 EVPFRMCVKGGKVSSVMCSYNQVNGVPTCADPNLLTNTLRSQWHLDGYIVSDCDSVGVFY 295
Query: 335 EYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLR 394
Q+YT T E++AA +KAG+D++CG+ + HT++A+ +G + E I+ AL N SVQ+R
Sbjct: 296 NSQHYTSTPEEAAAMAIKAGLDLDCGSFLETHTENAVKRGLLNESHINGALSNTLSVQMR 355
Query: 395 LGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAI 454
LG+F+GD + Y LG VC+ +++LA+DAARQGIVLL+N + LPL+ N +A+
Sbjct: 356 LGMFDGDLKTQPYAHLGAKHVCSDHNRQLAVDAARQGIVLLENRRGSLPLSTNRHRIVAV 415
Query: 455 IGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIA 514
+GP N M G Y GI C + L+G+ Y ++T + GC V C S+ F A+ A
Sbjct: 416 VGPNSNATLTMIGNYAGIACEYITPLQGISKY-TRTIHQEGCRGVACRSNKFFGGAIEAA 474
Query: 515 KKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVS 574
+ AD V++V GLD + E E RDR LLLPG Q LV VA +K PVILVL GGP+DVS
Sbjct: 475 RVADAVVLVMGLDQSIEAEFRDRAGLLLPGLQPDLVLKVASVAKGPVILVLMSGGPIDVS 534
Query: 575 FAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESF-TKVPMNDMNMR 633
FA+ +IS I+W GYPG+AG A+A+++FG NPGG+LPMTWYP+ + +K+PM M++R
Sbjct: 535 FAKDHPKISGIIWGGYPGQAGGLAIADVLFGQTNPGGKLPMTWYPQDYVSKLPMTTMSLR 594
Query: 634 ADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQ 693
+S YPGR+YRFY G VY FGHGLSYT +++K LSAP+ LT+ + N +
Sbjct: 595 PGTS--YPGRTYRFYKGPVVYPFGHGLSYTAFTHKILSAPTTLTVPVTGHR-HPHNGSEF 651
Query: 694 TGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGF 753
G + H C L ++++V N G DG+H +++++ +P + P+KQL+ F
Sbjct: 652 WGKAVRVTH----AKCDRLSLVIKVAVRNIGARDGAHTLLVYS-IPPMGVWVPQKQLVAF 706
Query: 754 DRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGE-LRHSLTIE 806
++VH A+ KE+ + C+ LS+ +K+G R +P+G H + +G+ +RH ++++
Sbjct: 707 EKVHIDAQALKEVQINIHVCKLLSVVDKYGIRRVPMGEHGIDIGDNVRHIVSLQ 760
>gi|449484229|ref|XP_004156823.1| PREDICTED: LOW QUALITY PROTEIN: probable beta-D-xylosidase 2-like
[Cucumis sativus]
Length = 769
Score = 713 bits (1840), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 351/774 (45%), Positives = 494/774 (63%), Gaps = 41/774 (5%)
Query: 38 NKPDFPCKPPH--FDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPA 95
++ F C P + YPFC SL + R K LI LTL+EK++ L NA +PRLGI A
Sbjct: 23 SRAPFACDPNNSVTTDYPFCRRSLVVEERVKDLIGRLTLEEKVKLLVSNAGGVPRLGIKA 82
Query: 96 YEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMY 155
Y+WWSE+LHG+++ GPG F G + TSFPQV+ +AASFN SLW IG V+ EARAMY
Sbjct: 83 YQWWSEALHGVSNVGPGTRFGGEFPAATSFPQVISTAASFNASLWEAIGRVVSDEARAMY 142
Query: 156 NLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIG 215
N G GLT+W+PN+NIFRDPRWGRGQETPGEDP++ YAV +V+ QG
Sbjct: 143 NGGVGGLTYWSPNVNIFRDPRWGRGQETPGEDPILAGTYAVNYVRGLQGT---------- 192
Query: 216 FGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTF 275
G+ L ++ACCKH AYDL+ W R+ FNA +++QD EDTF
Sbjct: 193 -----------------EGNRLKVAACCKHFTAYDLDNWNGVDRFHFNAQVSKQDIEDTF 235
Query: 276 QPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIF 334
+ PFR C++ GK S +MCSYNQVNGVP C +L R++W GYI SDCD+V +
Sbjct: 236 EVPFRMCVKGGKVSSVMCSYNQVNGVPTCADPNLLTNTLRSQWHLDGYIVSDCDSVGVFY 295
Query: 335 EYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLR 394
Q+YT T E++AA +KAG+D++CG+ + HT++A+ +G + E I+ AL N SVQ+R
Sbjct: 296 NSQHYTSTPEEAAAMAIKAGLDLDCGSFLETHTENAVKRGLLNESHINGALSNTLSVQMR 355
Query: 395 LGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAI 454
LG+F+GD + Y LG VC+ +++LA+DAARQGIVLL+N + LPL+ N +A+
Sbjct: 356 LGMFDGDLKTQPYAHLGAKHVCSDHNRQLAVDAARQGIVLLENRRGSLPLSTNRHRIVAV 415
Query: 455 IGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIA 514
+GP N M G Y GI C + L+G+ Y ++T + GC V C S+ F A+ A
Sbjct: 416 VGPNSNATLTMIGNYAGIACEYITPLQGISKY-TRTIHQEGCRGVACRSNKFFGGAIEAA 474
Query: 515 KKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVS 574
+ AD V++V GLD + E E RDR LLLPG Q LV VA +K PVILVL GGP+DVS
Sbjct: 475 RVADAVVLVMGLDQSIEAEFRDRAGLLLPGLQPDLVLKVASVAKGPVILVLMSGGPIDVS 534
Query: 575 FAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESF-TKVPMNDMNMR 633
FA+ +IS I+W GYPG+AG A+A+++FG NPGG+LPMTWYP+ + +K+PM M++R
Sbjct: 535 FAKDHPKISGIIWGGYPGQAGGLAIADVLFGQTNPGGKLPMTWYPQDYVSKLPMTTMSLR 594
Query: 634 ADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQ 693
+S YPGR+YRFY G VY FGHGLSYT +++K LSAP+ LT+ + N +
Sbjct: 595 PGTS--YPGRTYRFYKGPVVYPFGHGLSYTAFTHKILSAPTTLTVPVTGHR-HPHNGSEF 651
Query: 694 TGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGF 753
G + H C L ++++V N G DG+H +++++ +P + P+KQL+ F
Sbjct: 652 WGKAVRVTH----AKCDRLSLVIKVAVRNIGARDGAHTLLVYS-IPPMGVWVPQKQLVAF 706
Query: 754 DRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGE-LRHSLTIE 806
++VH A+ KE+ + C+ LS+ +K+G R +P+G H + +G+ +RH ++++
Sbjct: 707 EKVHIDAQALKEVQINIHVCKLLSVVDKYGIRRVPMGEHGIDIGDNVRHIVSLQ 760
>gi|168046596|ref|XP_001775759.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162672911|gb|EDQ59442.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 784
Score = 711 bits (1834), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 364/791 (46%), Positives = 502/791 (63%), Gaps = 33/791 (4%)
Query: 30 FSNSKPVLNKPDFPCKP--PHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASA 87
F + + K + C P P +PFCNTS+S R + LIS LT+QEKI+QL + A+
Sbjct: 8 FEKPRIKVYKLQYACDPDGPADLLFPFCNTSISDDDRVEDLISRLTIQEKIEQLVNTAAN 67
Query: 88 IPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAV 147
+ RLGIP Y+WW E LHG+A + P V F G + TSFP +S S+NR+LW+ IG V
Sbjct: 68 VSRLGIPPYQWWGEGLHGVAIS-PSVYFGGATPAATSFPLPCLSVCSYNRTLWNKIGQVV 126
Query: 148 AVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENW 207
+ E RAMYN G++GLT+W+PNINI RDPRWGR QETPGEDP + S YAV FVK Q ++
Sbjct: 127 STEGRAMYNQGRSGLTYWSPNINIARDPRWGRTQETPGEDPKLSSGYAVHFVKGLQEGDY 186
Query: 208 KSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMIT 267
+ + + V +G L +SACCKH A+DL++W ++ R F++ +T
Sbjct: 187 DQN--------QPQAVSRG-------PRRLKISACCKHFTAHDLDRWKDYDRDHFDSKVT 231
Query: 268 EQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQ-KARNEWGFKGYITSD 326
+QD EDT+ P F+SC+++G++S +MCSYN++NG+P C +L RN+WGF GYI SD
Sbjct: 232 QQDLEDTYNPSFKSCVKEGQSSSVMCSYNRLNGIPMCTHYELLTLTVRNQWGFDGYIVSD 291
Query: 327 CDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALL 386
CDAVA I +Y NY T ED+ + V+ AGMD+NCG+ L H +A+DK + E ID L
Sbjct: 292 CDAVALIHDYINYAPTSEDAVSYVMLAGMDLNCGSTTLVHGLAALDKKLIWEGLIDMHLR 351
Query: 387 NLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNK 446
NLF V++RLG+F+G+P YG LGP+D+CT +++ LAL+AARQ +VLLKN+K LP K
Sbjct: 352 NLFRVRMRLGMFDGNPSTLPYGSLGPEDMCTEDNQHLALEAARQSLVLLKNEKNALPWKK 411
Query: 447 NAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVS----KTHYASGCHDVPCN 502
LA+IG + +M G Y G PC S L+G +S + + GC D C
Sbjct: 412 THGLKLAVIGHHADATREMLGNYEGYPCKFVSPLQGFAKVLSDHSPRISHERGCSDAACE 471
Query: 503 SDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSK-RPV 561
+ A A +AD V++V G+ QE E RDR SLLLPG+QM LV+SV S RPV
Sbjct: 472 DQFYIYAAKEAAAQADAVVLVLGISQAQEKEGRDRDSLLLPGRQMELVSSVVEASAGRPV 531
Query: 562 ILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPES 621
+LVL G PLDVSFA D +I SI+W GYPG++G +A+AE IFG NPGGRL +WY E+
Sbjct: 532 VLVLLSGSPLDVSFANDDPRIQSIIWAGYPGQSGGEAIAEAIFGLVNPGGRLAQSWYYEN 591
Query: 622 FTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISAS 681
+T + M++MNMR ++S YPGR+YRF+T T ++ FGHGLSY+++ Y +SAP + ++
Sbjct: 592 YTNIDMSNMNMRPNASTGYPGRTYRFFTDTPLWEFGHGLSYSDFKYTMVSAPQSI-MAPH 650
Query: 682 LK---AGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARV 738
L+ SD+ ++ + L Y E +C FHV++ V N G + G H V+LF++
Sbjct: 651 LRYQLCSSDRAVMTSDLNCLHY----EKEACKESSFHVRVWVINHGPLSGDHSVLLFSKP 706
Query: 739 P-KVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
P + G P KQL+ F+RVH A +EI F V+PCE L G R + LG H LMVG
Sbjct: 707 PSRGIDGIPLKQLVSFERVHLEAGAGQEILFKVNPCEDLGTVGDDGIRTVELGEHTLMVG 766
Query: 798 ELRHSLTIETY 808
++H LT+E +
Sbjct: 767 MVQHVLTVENW 777
>gi|225431898|ref|XP_002276351.1| PREDICTED: beta-D-xylosidase 1-like [Vitis vinifera]
Length = 770
Score = 710 bits (1832), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 353/770 (45%), Positives = 493/770 (64%), Gaps = 40/770 (5%)
Query: 42 FPCKPPH--FDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWW 99
F C P + + PFC SL I RA+ L+ LTLQEKI+ L +NA +PRLGI YEWW
Sbjct: 27 FACDPRNGVTRNLPFCRVSLPIQERARDLVGRLTLQEKIRLLVNNAIDVPRLGIKGYEWW 86
Query: 100 SESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQ 159
SE+LHG+++ GPG F G+ TSFPQV+ +AASFN SLW IG V+ EARAMYN G
Sbjct: 87 SEALHGVSNVGPGTKFGGSFPGATSFPQVITTAASFNASLWEEIGRVVSDEARAMYNGGM 146
Query: 160 AGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFR 219
AGLT+W+PN+NIFRDPRWGRGQETPGEDP V + YA +V+ QG
Sbjct: 147 AGLTYWSPNVNIFRDPRWGRGQETPGEDPAVAAKYAAAYVRGLQG--------------- 191
Query: 220 EKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPF 279
+ D L ++ACCKH AYDL+ WG R+ FNA +++QD EDT+ PF
Sbjct: 192 ------------NARDRLKVAACCKHYTAYDLDHWGGIDRFHFNARVSKQDLEDTYDVPF 239
Query: 280 RSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQN 338
++C+ +G + +MCSYNQVNG P C L + R EW GYI SDCD+V ++ Q+
Sbjct: 240 KACVVEGNVASVMCSYNQVNGKPTCADPHLLRDTIRGEWKLNGYIVSDCDSVGVFYDEQH 299
Query: 339 YTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLF 398
YT T E++AA +KAG+D++CG + HT++AI GK+ E D++ AL+N SVQ+RLG+F
Sbjct: 300 YTATPEEAAAVAIKAGLDLDCGPFLAIHTEAAIRGGKLTEADVNGALMNTISVQMRLGMF 359
Query: 399 NGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPL 458
+G+P YG LGP DVCT H++LAL+AARQGIVL++N LPL+ + ++A+IGP
Sbjct: 360 DGEPSAQPYGNLGPRDVCTPAHQQLALEAARQGIVLVQNRGPALPLSTSRHRTIAVIGPN 419
Query: 459 VNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKAD 518
+ M G Y G+ C + L+G+ Y H A GC V C D F AV A++AD
Sbjct: 420 SDVTETMIGNYAGVACGYTTPLQGIGRYARTIHQA-GCSGVACRDDQQFGAAVAAARQAD 478
Query: 519 FVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEA 578
++V GLD + E E RDRV +LLPG+Q LV+ VA S+ P +LVL GGP+DVSFA+
Sbjct: 479 ATVLVMGLDQSIEAEFRDRVDILLPGRQQELVSKVAVASRGPTVLVLMSGGPIDVSFAKN 538
Query: 579 DSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESF-TKVPMNDMNMRADSS 637
D +I++I+W+GYPG+AG A+A+++FG NPGG+LP+TWYP+S+ K PM +M MRA S
Sbjct: 539 DPRIAAIIWVGYPGQAGGTAIADVLFGRTNPGGKLPVTWYPQSYLRKAPMTNMAMRAIPS 598
Query: 638 RQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTIS-ASLKAGSDKNILQQTGS 696
R YPGR+YRFY G V+ FGHGLSY+ +++ AP+ +++S ASL+ + I+
Sbjct: 599 RGYPGRTYRFYNGPVVFPFGHGLSYSTFAHSLAQAPTTVSVSLASLQTIKNSTIVSSGAI 658
Query: 697 RLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRV 756
R+ + +C + I V N G +DGSH ++LF+ P +P K+L+ F++V
Sbjct: 659 RISHA------NCNTQPLGFHIDVKNTGTMDGSHTLLLFS-TPPPGTWSPNKRLLAFEKV 711
Query: 757 HTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
H A + + F V C+ LS+ + G +P+G H +G+L+HS++++
Sbjct: 712 HVGAGSQERVRFDVHVCKHLSVVDHFGIHRIPMGEHHFHIGDLKHSISLQ 761
>gi|15239867|ref|NP_199747.1| beta-xylosidase 1 [Arabidopsis thaliana]
gi|75262458|sp|Q9FGY1.1|BXL1_ARATH RecName: Full=Beta-D-xylosidase 1; Short=AtBXL1; AltName:
Full=Alpha-L-arabinofuranosidase; Flags: Precursor
gi|9759419|dbj|BAB09906.1| xylosidase [Arabidopsis thaliana]
gi|21539545|gb|AAM53325.1| xylosidase [Arabidopsis thaliana]
gi|332008419|gb|AED95802.1| beta-xylosidase 1 [Arabidopsis thaliana]
Length = 774
Score = 710 bits (1832), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 360/794 (45%), Positives = 498/794 (62%), Gaps = 45/794 (5%)
Query: 21 LIVVVNVIAFSNSKPVLNKPDFPCKPPH--FDSYPFCNTSLSISTRAKSLISLLTLQEKI 78
++++V ++ +S L +P F C P + + FC ++ I R + L+ LTLQEKI
Sbjct: 15 VVILVFLLCLVHSSESL-RPLFACDPANGLTRTLRFCRANVPIHVRVQDLLGRLTLQEKI 73
Query: 79 QQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRS 138
+ L +NA+A+PRLGI YEWWSE+LHGI+ GPG F G TSFPQV+ +AASFN+S
Sbjct: 74 RNLVNNAAAVPRLGIGGYEWWSEALHGISDVGPGAKFGGAFPGATSFPQVITTAASFNQS 133
Query: 139 LWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEF 198
LW IG V+ EARAMYN G AGLT+W+PN+NI RDPRWGRGQETPGEDP+V + YA +
Sbjct: 134 LWEEIGRVVSDEARAMYNGGVAGLTYWSPNVNILRDPRWGRGQETPGEDPIVAAKYAASY 193
Query: 199 VKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFS 258
V+ QG + G+ L ++ACCKH AYDL+ W
Sbjct: 194 VRGLQG--------------------------TAAGNRLKVAACCKHYTAYDLDNWNGVD 227
Query: 259 RYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEW 317
R+ FNA +T+QD EDT+ PF+SC+ +GK + +MCSYNQVNG P C +L + R +W
Sbjct: 228 RFHFNAKVTQQDLEDTYNVPFKSCVYEGKVASVMCSYNQVNGKPTCADENLLKNTIRGQW 287
Query: 318 GFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQ 377
GYI SDCD+V F Q+YT T E++AA +KAG+D++CG + T+ A+ KG +
Sbjct: 288 RLNGYIVSDCDSVDVFFNQQHYTSTPEEAAARSIKAGLDLDCGPFLAIFTEGAVKKGLLT 347
Query: 378 EKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKN 437
E DI+ AL N +VQ+RLG+F+G+ G Y LGP DVCT HK LAL+AA QGIVLLKN
Sbjct: 348 ENDINLALANTLTVQMRLGMFDGN--LGPYANLGPRDVCTPAHKHLALEAAHQGIVLLKN 405
Query: 438 DKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCH 497
+ LPL+ ++A+IGP + M G Y G C+ S L+G+ Y H A GC
Sbjct: 406 SARSLPLSPRRHRTVAVIGPNSDVTETMIGNYAGKACAYTSPLQGISRYARTLHQA-GCA 464
Query: 498 DVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTS 557
V C + GF A A++AD ++V GLD + E E RDR LLLPG Q LVT VA+ S
Sbjct: 465 GVACKGNQGFGAAEAAAREADATVLVMGLDQSIEAETRDRTGLLLPGYQQDLVTRVAQAS 524
Query: 558 KRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTW 617
+ PVILVL GGP+DV+FA+ D ++++I+W GYPG+AG A+A IIFG NPGG+LPMTW
Sbjct: 525 RGPVILVLMSGGPIDVTFAKNDPRVAAIIWAGYPGQAGGAAIANIIFGAANPGGKLPMTW 584
Query: 618 YPESFT-KVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAP-SE 675
YP+ + KVPM M MRA S YPGR+YRFY G V+ FG GLSYT +++ +P ++
Sbjct: 585 YPQDYVAKVPMTVMAMRA--SGNYPGRTYRFYKGPVVFPFGFGLSYTTFTHSLAKSPLAQ 642
Query: 676 LTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSL-RFHVQISVTNAGDVDGSHVVML 734
L++S S S IL + + H T+C S + + + V+N G+ DG+H V +
Sbjct: 643 LSVSLS-NLNSANTILNSSSHSIKVSH----TNCNSFPKMPLHVEVSNTGEFDGTHTVFV 697
Query: 735 FARVP--KVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNH 792
FA P + KQLI F++VH +A + + VD C+ L + +++G+R +P+G H
Sbjct: 698 FAEPPINGIKGLGVNKQLIAFEKVHVMAGAKQTVQVDVDACKHLGVVDEYGKRRIPMGEH 757
Query: 793 VLMVGELRHSLTIE 806
L +G+L+H++ ++
Sbjct: 758 KLHIGDLKHTILVQ 771
>gi|32481073|gb|AAP83934.1| auxin-induced beta-glucosidase [Chenopodium rubrum]
Length = 767
Score = 710 bits (1832), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 347/756 (45%), Positives = 487/756 (64%), Gaps = 36/756 (4%)
Query: 54 FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
FC +L I R + LI L LQEK++ L +NA+ +PRLGI YEWWSE+LHG+++ GPG
Sbjct: 40 FCRVNLPIRARVQDLIGRLNLQEKVKLLVNNAAPVPRLGISGYEWWSEALHGVSNVGPGT 99
Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFR 173
F G + TSFPQV+ +AASFN SLW IG V+ EARAMYN G AGLT+W+PN+NIFR
Sbjct: 100 KFRGAFPAATSFPQVITTAASFNASLWEAIGQVVSDEARAMYNGGTAGLTYWSPNVNIFR 159
Query: 174 DPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDR 233
DPRWGRGQETPGEDP + S YA +V+ QG K
Sbjct: 160 DPRWGRGQETPGEDPTLASQYAASYVRGLQGIYNK------------------------- 194
Query: 234 GDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMC 293
+ L ++ACCKH AYDL+ W R+ FNA +++QD EDT+ PF+ C+++G+ + +MC
Sbjct: 195 -NRLKVAACCKHYTAYDLDNWNAVDRFHFNAKVSKQDLEDTYNVPFKGCVQEGRVASVMC 253
Query: 294 SYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLK 352
SYNQVNG P C DL + R +W GYI SDCD+V +++ Q+YT+T E++AA +K
Sbjct: 254 SYNQVNGKPTCADPDLLRNTIRGQWRLNGYIVSDCDSVGVLYDDQHYTRTPEEAAADTIK 313
Query: 353 AGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGP 412
AG+D++CG + HT++A+ +G + E D+++AL N F+VQ+RLG+F+G+ +G LGP
Sbjct: 314 AGLDLDCGPFLAVHTEAAVKRGLLTEADVNQALTNTFTVQMRLGMFDGEAAAQPFGHLGP 373
Query: 413 DDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGI 472
DVC+ H+ LAL AARQGIVLL+N + LPL+ ++A+IGP + M G Y G+
Sbjct: 374 KDVCSPAHQDLALQAARQGIVLLQNRGRSLPLSTARHRNIAVIGPNADATVTMIGNYAGV 433
Query: 473 PCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQET 532
C S L+G+ Y +KT + +GC V C S+ F A A AD ++V GLD + E
Sbjct: 434 ACGYTSPLQGIARY-AKTVHQAGCIGVACTSNQQFGAATAAAAHADATVLVMGLDQSIEA 492
Query: 533 EDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPG 592
E RDR S+LLPG Q LV+ VA S+ P ILVL GGP+DV+FA+ D +IS+ILW+GYPG
Sbjct: 493 EFRDRASVLLPGHQQELVSKVALASRGPTILVLMCGGPVDVTFAKNDPKISAILWVGYPG 552
Query: 593 EAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNMRADSSRQYPGRSYRFYTGT 651
+AG A+A+++FG NPGG+LP TWYP+S+ KVPM D+ MRA+ S YPGR+YRFY G
Sbjct: 553 QAGGTAIADVLFGTTNPGGKLPNTWYPQSYVAKVPMTDLAMRANPSNGYPGRTYRFYKGP 612
Query: 652 QVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTS 711
V+ FG GLSYT ++ AP+++ + + + ++ NI L +H T+C +
Sbjct: 613 VVFPFGFGLSYTRFTQSLAHAPTKVMVPLANQF-TNSNITSFNKDALKVLH----TNCDN 667
Query: 712 LRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVD 771
+ + I V N G VDGSH +++F+ PK ++ + EKQLIGF RVH A + + +
Sbjct: 668 IPLSLHIDVKNKGKVDGSHTILVFSTPPKGTKSS-EKQLIGFKRVHVFAGSKQRVRMNIH 726
Query: 772 PCEQLSIANKHGRRILPLGNHVLMVG-ELRHSLTIE 806
C LS A++ G R +P+G H L +G + +H L++
Sbjct: 727 VCNHLSRADEFGVRRIPIGEHTLHIGDDHKHKLSLH 762
>gi|242071935|ref|XP_002451244.1| hypothetical protein SORBIDRAFT_05g026400 [Sorghum bicolor]
gi|241937087|gb|EES10232.1| hypothetical protein SORBIDRAFT_05g026400 [Sorghum bicolor]
Length = 790
Score = 709 bits (1830), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 349/776 (44%), Positives = 483/776 (62%), Gaps = 57/776 (7%)
Query: 51 SYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNG 110
+ PFC SL + RA+ L+S LT EK++ L +NA+ + RLG+ YEWWSE+LHG++ G
Sbjct: 44 TLPFCRQSLPLHARARDLVSRLTRAEKVRLLVNNAAGVARLGVGGYEWWSEALHGVSDTG 103
Query: 111 PGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNIN 170
PGV F G T+FPQV+ +AA+ N +LW IG AV+ EARAMYN G+AGLTFW+PN+N
Sbjct: 104 PGVKFGGAFPGATAFPQVIGAAAALNATLWELIGRAVSDEARAMYNGGRAGLTFWSPNVN 163
Query: 171 IFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEE 230
IFRDPRWGRGQETPGEDP + S YA +V+ Q +
Sbjct: 164 IFRDPRWGRGQETPGEDPAISSRYAAAYVRGLQ-------------------------QP 198
Query: 231 SDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASC 290
D + L L+ACCKH AYDL+ WG R+ FNA+++ QD EDTF PFR+C+ G+A+
Sbjct: 199 YDH-NRLKLAACCKHFTAYDLDSWGGTDRFHFNAVVSPQDLEDTFNVPFRACVAGGRAAS 257
Query: 291 IMCSYNQVNGVPACL-RGDLFQKARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAG 349
+MCSYNQVNGVP C +G L R WG GYI SDCD+V F Q+YT+T ED+ A
Sbjct: 258 VMCSYNQVNGVPTCADQGFLRGTIRKAWGLDGYIVSDCDSVDVFFRDQHYTRTAEDAVAA 317
Query: 350 VLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGK 409
L+AG+D++CG + +T++A+ + KV + D+D ALLN +VQ+RLG+F+GDP G +G
Sbjct: 318 TLRAGLDLDCGPFLALYTENAVARKKVSDADVDAALLNTVTVQMRLGMFDGDPASGPFGH 377
Query: 410 LGPDDVCTSEHKKLALDAARQGIVLLKN-------DKKFLPLNKNAVSSLAIIGPLVNNI 462
LG DVCT H+ LALDAARQ +VLLKN D+ LPL A +A++GP +
Sbjct: 378 LGAADVCTKAHQDLALDAARQSVVLLKNQRGRKHRDRDVLPLRPAAHRVVAVVGPHADAT 437
Query: 463 SQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCN-SDAGFHEAVRIAKKADFVI 521
M G Y G PC + L+G+ AY ++ + +GC DV C + AV A++
Sbjct: 438 VAMIGNYAGKPCRYTTPLQGVAAYAARVVHQAGCADVACQGKNQPIAAAVDAARRLTPPS 497
Query: 522 VVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQ 581
GL SLLLPG+Q L+++VA+ +K PVILVL GGP+D++FA+ D +
Sbjct: 498 SSPGLTR----------SLLLPGRQAELISAVAKAAKGPVILVLMSGGPIDIAFAQNDPR 547
Query: 582 ISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESF-TKVPMNDMNMRADSSRQY 640
I ILW+GYPG+AG +A+A++IFG NPGG+LP+TWYP+ + KVPM +M MRA+ +R Y
Sbjct: 548 IDGILWVGYPGQAGGQAIADVIFGQHNPGGKLPVTWYPQDYLEKVPMTNMAMRANPARGY 607
Query: 641 PGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQ----TGS 696
PGR+YRFYTG ++ FGHGLSYT +++ AP++LT+ S + S +
Sbjct: 608 PGRTYRFYTGPTIHAFGHGLSYTQFTHTLAHAPAQLTVRLSTSSASASASASAASLLNAT 667
Query: 697 RLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLF-----ARVPKVSQGT--PEKQ 749
R C L V + V N GD DG+H V+++ + GT P +Q
Sbjct: 668 RPSRAVRVAHARCEGLTVPVHVDVRNVGDRDGAHAVLVYHVAPSSSSSSAPAGTDAPARQ 727
Query: 750 LIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTI 805
L+ F++VH A G + G+D C++LS+A++ G R +P+G H LM+GEL HS+T+
Sbjct: 728 LVAFEKVHVPAGGVARVEMGIDVCDRLSVADRDGVRRIPVGEHRLMIGELTHSVTL 783
>gi|222615852|gb|EEE51984.1| hypothetical protein OsJ_33664 [Oryza sativa Japonica Group]
Length = 753
Score = 708 bits (1827), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 359/752 (47%), Positives = 472/752 (62%), Gaps = 49/752 (6%)
Query: 54 FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
FC+ L+ RA L++ LTL EK+ QL D A+ + RLG+PAYEWWSE LHG++ G G+
Sbjct: 31 FCDAWLTAEQRAADLVANLTLAEKVSQLGDRAAGVARLGVPAYEWWSEGLHGLSIWGRGI 90
Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA-GLTFWAPNINIF 172
FNGTV +VTSFPQV+++AA+F+ LW +G AV EARA+YNLGQA GLT W+PN+NIF
Sbjct: 91 RFNGTVRAVTSFPQVILTAAAFDAGLWRRVGEAVGAEARALYNLGQANGLTIWSPNVNIF 150
Query: 173 RDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESD 232
RDP R PG+ R GE+
Sbjct: 151 RDPSGTR----PGD----------------------------------ARRGPRHGEQGI 172
Query: 233 RGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIM 292
G+ SACCKH AYDL+ W N RY++++ +T QD EDT+ PPF+SC+ +GKA+CIM
Sbjct: 173 GGEA---SACCKHATAYDLDYWNNVVRYNYDSKVTLQDLEDTYNPPFKSCVAEGKATCIM 229
Query: 293 CSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVL 351
C YN +NGVPAC DL +K R EWG GY+ SDCDAVATI + +YT + ED+ A +
Sbjct: 230 CGYNSINGVPACASSDLLTKKVRQEWGMNGYVASDCDAVATIRDAHHYTLSPEDTVAVSI 289
Query: 352 KAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGK-YGKL 410
K GMD+NCG H +A+ KG + EKDIDRAL+NLF+V++RLG F+GDPR YG L
Sbjct: 290 KVGMDVNCGNYTQVHAMAAVQKGNLTEKDIDRALVNLFAVRMRLGHFDGDPRSNAVYGHL 349
Query: 411 GPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT 470
G DVC+ HK LAL+AA+ GIVLLKND LPL +AV+SLA+IGP +N+ + G Y
Sbjct: 350 GAADVCSPAHKSLALEAAQDGIVLLKNDAGALPLQPSAVTSLAVIGPNADNLGALHGNYF 409
Query: 471 GIPCSPKSLLRGLEAYVS-KTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLT 529
G PC + L+G++ Y+ + + +GC C DA +EA +A +D V++ GL
Sbjct: 410 GPPCETTTPLQGIKGYLGDRARFLAGCDSPACAVDA-TNEAAALASSSDHVVLFMGLSQK 468
Query: 530 QETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIG 589
QE + DR SLLLPG+Q L+T+VA ++RPVILVL GGP+DV+FA+ + +I +ILW G
Sbjct: 469 QEQDGLDRTSLLLPGEQQGLITAVANAARRPVILVLLTGGPVDVTFAKDNPKIGAILWAG 528
Query: 590 YPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYT 649
YPG+AG A+A+++FGD NP GRLP+TWYPE FTKVPM DM MRAD + YPGRSYRFY
Sbjct: 529 YPGQAGGLAIAKVLFGDHNPSGRLPVTWYPEEFTKVPMTDMRMRADPATGYPGRSYRFYQ 588
Query: 650 GTQVYGFGHGLSYTNYSYKFLSAPSELTI-SASLKAGSDKNILQQTGSRLDYVHIDE--V 706
G VY FG+GLSY+ +S + S+ S + SL AG G + + E V
Sbjct: 589 GNTVYNFGYGLSYSKFSRRMFSSFSTSNAGNLSLLAGVMARRAGDDGGGMSSYLVKEIGV 648
Query: 707 TSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEI 766
C+ L F + V N G +DG H V+++ R P S G P +QLIGF H +
Sbjct: 649 ERCSRLVFPAVVEVQNHGPMDGKHSVLMYLRWPTTSGGRPARQLIGFRSQHVKVGEKAMV 708
Query: 767 SFGVDPCEQLSIANKHGRRILPLGNHVLMVGE 798
SF V PCE S + G R++ G H LMVG+
Sbjct: 709 SFEVSPCEHFSWVGEDGERVIDGGAHFLMVGD 740
>gi|297795695|ref|XP_002865732.1| beta-xylosidase 1 [Arabidopsis lyrata subsp. lyrata]
gi|297311567|gb|EFH41991.1| beta-xylosidase 1 [Arabidopsis lyrata subsp. lyrata]
Length = 774
Score = 708 bits (1827), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 360/794 (45%), Positives = 498/794 (62%), Gaps = 45/794 (5%)
Query: 21 LIVVVNVIAFSNSKPVLNKPDFPCKPPH--FDSYPFCNTSLSISTRAKSLISLLTLQEKI 78
++++V ++ +S L +P F C P + + FC ++ I R + LI LTLQEKI
Sbjct: 15 VVILVFLLCLVHSSESL-RPLFACDPANGLTRTLRFCRVNVPIHVRVQDLIGRLTLQEKI 73
Query: 79 QQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRS 138
+ L +NA+A+PRLGI YEWWSE+LHG++ GPG F G TSFPQV+ +AASFN+S
Sbjct: 74 RNLVNNAAAVPRLGIGGYEWWSEALHGVSDVGPGSKFGGAFPGATSFPQVITTAASFNQS 133
Query: 139 LWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEF 198
LW IG V+ EARAMYN G AGLT+W+PN+NI RDPRWGRGQETPGEDP+V + YA +
Sbjct: 134 LWEEIGRVVSDEARAMYNGGVAGLTYWSPNVNILRDPRWGRGQETPGEDPIVAAKYAASY 193
Query: 199 VKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFS 258
V+ QG + G+ L ++ACCKH AYDL+ W
Sbjct: 194 VRGLQG--------------------------TAAGNRLKVAACCKHYTAYDLDNWNGVD 227
Query: 259 RYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEW 317
R+ FNA +T+QD EDT+ PF+SC+ +GK + +MCSYNQVNG P C +L + R +W
Sbjct: 228 RFHFNAKVTQQDLEDTYNVPFKSCVYEGKVASVMCSYNQVNGKPTCADENLLKNTIRGKW 287
Query: 318 GFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQ 377
GYI SDCD+V F Q+YT T E++AA +KAG+D++CG + T+ A+ KG +
Sbjct: 288 RLNGYIVSDCDSVDVFFNQQHYTSTPEEAAAASIKAGLDLDCGPFLAIFTEGAVKKGLLT 347
Query: 378 EKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKN 437
E DI+ AL N +VQ+RLG+F+G+ G Y LGP DVC+ HK LAL+AA QGIVLLKN
Sbjct: 348 ENDINLALANTLTVQMRLGMFDGN--LGPYANLGPRDVCSLAHKHLALEAAHQGIVLLKN 405
Query: 438 DKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCH 497
+ LPL+ ++A+IGP + M G Y G C+ + L+G+ Y H A GC
Sbjct: 406 SGRSLPLSPRRHRTVAVIGPNSDVTETMIGNYAGKACAYTTPLQGISRYARTLHQA-GCA 464
Query: 498 DVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTS 557
V C + GF A A++AD ++V GLD + E E RDR LLLPG Q LVT VA+ S
Sbjct: 465 GVACKGNQGFGAAEAAAREADATVLVMGLDQSIEAETRDRTGLLLPGYQQDLVTRVAQAS 524
Query: 558 KRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTW 617
+ PVILVL GGP+DV+FA+ D ++++I+W GYPG+AG A+A IIFG NPGG+LPMTW
Sbjct: 525 RGPVILVLMSGGPIDVTFAKNDPRVAAIIWAGYPGQAGGAAIANIIFGAANPGGKLPMTW 584
Query: 618 YPESFT-KVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAP-SE 675
YP+ + KVPM M MRA S YPGR+YRFY G V+ FG GLSYT ++ +P ++
Sbjct: 585 YPQDYVAKVPMTVMAMRA--SGNYPGRTYRFYKGPVVFPFGFGLSYTTFTNSLAKSPLAQ 642
Query: 676 LTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSL-RFHVQISVTNAGDVDGSHVVML 734
L++S S S IL T + H T+C S + + + V+N G+ DG+H V +
Sbjct: 643 LSVSLS-NLNSANAILNSTSHSIKVSH----TNCNSFPKMPLHVEVSNTGEFDGTHTVFV 697
Query: 735 FARVPK--VSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNH 792
FA PK + KQLI F++VH +A + + VD C+ L + +++G+R +P+G H
Sbjct: 698 FAEPPKNGIKGLGVNKQLIAFEKVHVMAGAKQTVRVDVDACKHLGVVDEYGKRRIPMGKH 757
Query: 793 VLMVGELRHSLTIE 806
L +G+L+H++ ++
Sbjct: 758 KLHIGDLKHTILVQ 771
>gi|357449039|ref|XP_003594795.1| Beta xylosidase [Medicago truncatula]
gi|355483843|gb|AES65046.1| Beta xylosidase [Medicago truncatula]
Length = 762
Score = 707 bits (1824), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 346/770 (44%), Positives = 482/770 (62%), Gaps = 44/770 (5%)
Query: 42 FPCKPPH--FDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWW 99
F C P + SY FCNT + I R + LI L L EKI+ + +NA A+PRLGI YEWW
Sbjct: 27 FACDPKNGLTRSYKFCNTRVPIHARVQDLIGRLALPEKIRLVVNNAIAVPRLGIQGYEWW 86
Query: 100 SESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQ 159
SE+LHG+++ GPG F G S+ TSFPQV+ +AASFN+SLW IG V+ EARAMYN G
Sbjct: 87 SEALHGVSNVGPGTKFGGAFSAATSFPQVITTAASFNQSLWLEIGRIVSDEARAMYNGGA 146
Query: 160 AGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFR 219
AGLTFW+PN+NIFRDPRWGRGQETPGEDP V YA +V+ QG
Sbjct: 147 AGLTFWSPNVNIFRDPRWGRGQETPGEDPTVAGKYAASYVQGLQGNG------------- 193
Query: 220 EKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPF 279
G+ L ++ACCKH AYDL+ W R+ FNA +++QD DT+ PF
Sbjct: 194 -------------AGNRLKVAACCKHYTAYDLDNWNGVDRFHFNAKVSKQDLADTYDVPF 240
Query: 280 RSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQN 338
++C+ GK + +MCSYNQVNG P C +L + R EWG GYI SDCD+V +++ Q+
Sbjct: 241 KACVRDGKVASVMCSYNQVNGKPTCADPELLRNTIRGEWGLNGYIVSDCDSVGVLYDNQH 300
Query: 339 YTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLF 398
YT+T E +AA +KAG+D++CG + HT AI +G + E D++ AL NL +VQ+RLG+F
Sbjct: 301 YTRTPEQAAAAAIKAGLDLDCGPFLALHTDGAIKQGLISENDLNLALANLITVQMRLGMF 360
Query: 399 NGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPL 458
+GD + YG LG DVC H +AL+AARQGIVLL+N LPL+ ++ +IGP
Sbjct: 361 DGDAQ--PYGNLGTRDVCLPSHNDVALEAARQGIVLLQNKGNALPLSPTRYRTVGVIGPN 418
Query: 459 VNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKAD 518
+ M G Y GI C + L+G+ YV H A GC DV C + F + ++A++AD
Sbjct: 419 SDVTVTMIGNYAGIACGYTTPLQGIARYVKTIHQA-GCKDVGCGGNQLFGLSEQVARQAD 477
Query: 519 FVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEA 578
++V GLD + E E RDR LLLPG Q LV+ VAR ++ PVILVL GGP+DV+FA+
Sbjct: 478 ATVLVMGLDQSIEAEFRDRTGLLLPGHQQELVSRVARAARGPVILVLMSGGPIDVTFAKN 537
Query: 579 DSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNMRADSS 637
D +IS+ILW+GYPG++G A+A++IFG NP GRLP TWYP+ + KVPM +M+MRA+ +
Sbjct: 538 DPKISAILWVGYPGQSGGTAIADVIFGRTNPSGRLPNTWYPQDYVRKVPMTNMDMRANPA 597
Query: 638 RQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSR 697
YPGR+YRFY G V+ FGHGLSY+ +++ AP ++++ + +
Sbjct: 598 TGYPGRTYRFYKGPVVFPFGHGLSYSRFTHSLALAPKQVSVQFTTPLTQAFTNSSNKAMK 657
Query: 698 LDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVH 757
+ + + DE+ + FHV V N G +DG+H ++++++ P KQL+ F + +
Sbjct: 658 VSHANCDEL----EVGFHV--DVKNEGSMDGAHTLLVYSKAPN-----GVKQLVNFHKTY 706
Query: 758 TVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIET 807
A + GV C LS ++ G R +P+G H L +G+L+HS+ ++T
Sbjct: 707 VPAGSKTRVKVGVHVCNHLSAVDEFGVRRIPMGEHELQIGDLKHSILVQT 756
>gi|326494302|dbj|BAJ90420.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326521150|dbj|BAJ96778.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326527851|dbj|BAK08165.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 775
Score = 704 bits (1818), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 361/772 (46%), Positives = 492/772 (63%), Gaps = 37/772 (4%)
Query: 40 PDFPCKPPH--FDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYE 97
P F C + +Y FCN + S RA+ L+S LTL EK+ L + A+ RLGIPAYE
Sbjct: 35 PVFACDASNATLAAYGFCNRKATASARARDLVSRLTLAEKVGFLVNKQPALGRLGIPAYE 94
Query: 98 WWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNL 157
WWSE+LHG++ GPG F+ V TSFPQ +++AASFN SL+ IG V+ EARAM+N+
Sbjct: 95 WWSEALHGVSYVGPGTRFSPLVPGATSFPQPILTAASFNASLFRAIGEVVSTEARAMHNV 154
Query: 158 GQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFG 217
G AGLTFW+PNINIFRDPRWGRGQETPGEDP++ S YAV +V Q + GG+ G
Sbjct: 155 GLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYAVGYVTGLQ----DAGAGGVTDG 210
Query: 218 FREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQP 277
L ++ACCKH AYD++ W RY+F+A +++QD +DTFQP
Sbjct: 211 ------------------ALKVAACCKHYTAYDVDNWKGVERYTFDAKVSQQDLDDTFQP 252
Query: 278 PFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEY 336
PF+SC+ G + +MCSYN+VNG P C DL + R +W GYI SDCD+V ++
Sbjct: 253 PFKSCVLDGNVASVMCSYNKVNGKPTCADKDLLEGVIRGDWKLNGYIVSDCDSVDVLYTQ 312
Query: 337 QNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLG 396
Q+YTKT E++AA +K+G+D+NCG + +HT +A+ G++ E+D+DRA+ N F + +RLG
Sbjct: 313 QHYTKTPEEAAAITIKSGLDLNCGNFLAQHTVAAVQAGELSEEDVDRAITNNFIMLMRLG 372
Query: 397 LFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIG 456
F+GDPR+ +G LGP DVCTS +++LA + ARQGIVLLKN LPL+ ++ S+A+IG
Sbjct: 373 FFDGDPRQLAFGSLGPKDVCTSSNRELARETARQGIVLLKNSGA-LPLSAKSIKSMAVIG 431
Query: 457 PLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDA-GFHEAVRIAK 515
P N M G Y G PC + L+GL A V+ T Y GC +V C+ ++ AV A
Sbjct: 432 PNANASFTMIGNYEGTPCKYTTPLQGLGAKVN-TVYQPGCTNVGCSGNSLQLSTAVAAAA 490
Query: 516 KADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSF 575
AD ++V G D + E E DR SLLLPGQQ LV++VA S PVILV+ GGP D+SF
Sbjct: 491 SADVTVLVVGADQSIERESLDRTSLLLPGQQTQLVSAVANASSGPVILVVMSGGPFDISF 550
Query: 576 AEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNMRA 634
A+A +I++ILW+GYPGEAG ALA+I+FG NP GRLP+TWYP S+ V M DM MR
Sbjct: 551 AKASDKIAAILWVGYPGEAGGAALADILFGSHNPSGRLPVTWYPASYADTVTMTDMRMRP 610
Query: 635 DSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQT 694
D+S YPGR+YRFYTG V+ FG GLSYT S+ +SAP S++ D +
Sbjct: 611 DTSTGYPGRTYRFYTGDTVFAFGDGLSYTKMSHSLVSAPPSYV---SMRLAEDHPCRAEE 667
Query: 695 GSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFD 754
+ ++ C L F V++ V NAG+V G+H V+LF+ P + P K L+GF+
Sbjct: 668 CASVEAAG----DHCDDLAFDVKLQVRNAGEVAGAHSVLLFSS-PPPAHNAPAKHLLGFE 722
Query: 755 RVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
+V + ++F VD C LS+ ++ G R + LG H L VG+L+H++ +
Sbjct: 723 KVSLAPGEAGTVAFRVDVCRDLSVVDELGGRKVALGGHTLHVGDLKHTVELR 774
>gi|357166259|ref|XP_003580652.1| PREDICTED: beta-D-xylosidase 4-like [Brachypodium distachyon]
Length = 774
Score = 703 bits (1815), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 364/771 (47%), Positives = 488/771 (63%), Gaps = 39/771 (5%)
Query: 40 PDFPCKPPH--FDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYE 97
P F C + Y FC+ + S S RA L+S LTL +K+ L + A+ RLGIPAYE
Sbjct: 36 PVFACDAANSTVAGYAFCDRAKSASARAADLVSRLTLADKVGFLVNKQPALARLGIPAYE 95
Query: 98 WWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNL 157
WWSE+LHG++ GPG F+ V TSFPQ +++AASFN SL+ IG V+ EARAM+N+
Sbjct: 96 WWSEALHGVSYVGPGTRFSPLVPGATSFPQPILTAASFNASLFRAIGEVVSNEARAMHNV 155
Query: 158 GQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFG 217
G AGLTFW+PNINIFRDPRWGRGQETPGEDP++ S YAV +V Q D G
Sbjct: 156 GLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASRYAVGYVSGLQ-------DAG---- 204
Query: 218 FREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQP 277
+D L ++ACCKH AYD++ W RY+F+A +++QD +DTFQP
Sbjct: 205 -------------ADADGPLKVAACCKHYTAYDVDNWKGVERYTFDAKVSQQDLDDTFQP 251
Query: 278 PFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEY 336
PF+SC+ GK + +MCSYN+VNG P C DL R +W GYI SDCD+V ++
Sbjct: 252 PFKSCVIDGKVASVMCSYNKVNGKPTCADKDLLSGVIRGDWKLNGYIVSDCDSVDVLYSQ 311
Query: 337 QNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLG 396
Q+YTKT E++AA +K+G+D+NCG + +HT +A+ G + E D+DRA+ N F + +RLG
Sbjct: 312 QHYTKTPEEAAAITIKSGLDLNCGDFLAKHTVAAVQAGNLSESDVDRAITNNFIMLMRLG 371
Query: 397 LFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIG 456
F+GDPRK YG LGP DVCTS +++LA + ARQGIVLLKND LPL+ ++ S+A+IG
Sbjct: 372 FFDGDPRKLAYGSLGPKDVCTSSNQELARETARQGIVLLKNDGA-LPLSAKSIKSMAVIG 430
Query: 457 PLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDA-GFHEAVRIAK 515
P N M G Y G PC + L GL V+ T Y GC +V C+ ++ A A
Sbjct: 431 PNANASFTMIGNYEGTPCKYTTPLHGLGNNVA-TVYQPGCSNVGCSGNSLQLSAATAAAA 489
Query: 516 KADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSF 575
AD ++V G D + E E DR SLLLPGQQ L+++VA SK VILV+ GGP D+SF
Sbjct: 490 SADVTVLVVGADQSIEREALDRTSLLLPGQQPDLISAVANASKGHVILVVMSGGPFDISF 549
Query: 576 AEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNMRA 634
A+A +IS+ILW+GYPGEAG A+A+IIFG +NP GRLP+TWYP SF KVPM DM MR
Sbjct: 550 AKASDKISAILWVGYPGEAGGAAIADIIFGKYNPSGRLPVTWYPASFADKVPMTDMRMRP 609
Query: 635 DSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQT 694
D+S YPGR+YRFYTG V+ FG GLSYT S+ ++AP +S L G + +
Sbjct: 610 DNSTGYPGRTYRFYTGETVFAFGDGLSYTTMSHNLVAAPPS-EVSMQLAEGHACHTKECA 668
Query: 695 GSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFD 754
H C + F V++ V N G++ G+H V+LF+ P V P K L+GF+
Sbjct: 669 SVEAAGDH------CEGMAFEVRLRVHNTGEMAGAHTVLLFSSPPAV-HNAPAKHLLGFE 721
Query: 755 RVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTI 805
+++ + +F VD C+ LS+ ++ G R + LG H L VG+L+H+L +
Sbjct: 722 KLNLEPGQAGVAAFKVDVCKDLSVVDELGNRKVALGGHTLHVGDLKHTLNL 772
>gi|326492918|dbj|BAJ90315.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 775
Score = 703 bits (1814), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 360/772 (46%), Positives = 492/772 (63%), Gaps = 37/772 (4%)
Query: 40 PDFPCKPPH--FDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYE 97
P F C + +Y FCN + S RA+ L+S LTL EK+ L + A+ RLGIPAYE
Sbjct: 35 PVFACDASNATLAAYGFCNRKATASARARDLVSRLTLAEKVGFLVNKQPALGRLGIPAYE 94
Query: 98 WWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNL 157
WWSE+LHG++ GPG F+ V TSFPQ +++AASFN SL+ IG V+ EARAM+N+
Sbjct: 95 WWSEALHGVSYVGPGTRFSPLVPGATSFPQPILTAASFNASLFRAIGEVVSTEARAMHNV 154
Query: 158 GQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFG 217
G AGLTFW+PNINIFRDPRWGRGQETPGEDP++ S YAV +V Q + GG+ G
Sbjct: 155 GLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYAVGYVTGLQ----DAGAGGVTDG 210
Query: 218 FREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQP 277
L ++ACCKH AYD++ W RY+F+A +++QD +DTFQP
Sbjct: 211 ------------------ALKVAACCKHYTAYDVDNWKGVERYTFDAKVSQQDLDDTFQP 252
Query: 278 PFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEY 336
PF+SC+ G + +MCSYN+VNG P C DL + R +W GYI SDCD+V ++
Sbjct: 253 PFKSCVLDGNVASVMCSYNKVNGKPTCADKDLLEGVIRGDWKLNGYIVSDCDSVDVLYTQ 312
Query: 337 QNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLG 396
Q+YTKT E++AA +K+G+D+NCG + +HT +A+ G++ E+D+DRA+ N F + +RLG
Sbjct: 313 QHYTKTPEEAAAITIKSGLDLNCGNFLAQHTVAAVQAGELSEEDVDRAITNNFIMLMRLG 372
Query: 397 LFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIG 456
F+GDPR+ +G LGP DVCTS +++LA + ARQGIVLLKN LPL+ ++ S+A+IG
Sbjct: 373 FFDGDPRQLAFGSLGPKDVCTSSNRELARETARQGIVLLKNSGA-LPLSAKSIKSMAVIG 431
Query: 457 PLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDA-GFHEAVRIAK 515
P N M G Y G PC + L+GL A V+ T Y GC +V C+ ++ AV A
Sbjct: 432 PNANASFTMIGNYEGTPCKYTTPLQGLGAKVN-TVYQPGCTNVGCSGNSLQLSTAVAAAA 490
Query: 516 KADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSF 575
AD ++V G D + E E DR SLLLPGQQ LV++VA S PVILV+ GGP D+SF
Sbjct: 491 SADVTVLVVGADQSIERESLDRTSLLLPGQQTQLVSAVANASSGPVILVVMSGGPFDISF 550
Query: 576 AEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNMRA 634
A+A +I++ILW+GYPGEAG ALA+I+FG NP G+LP+TWYP S+ V M DM MR
Sbjct: 551 AKASDKIAAILWVGYPGEAGGAALADILFGSHNPSGKLPVTWYPASYADTVTMTDMRMRP 610
Query: 635 DSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQT 694
D+S YPGR+YRFYTG V+ FG GLSYT S+ +SAP S++ D +
Sbjct: 611 DTSTGYPGRTYRFYTGDTVFAFGDGLSYTKMSHSLVSAPPSYV---SMRLAEDHPCRAEE 667
Query: 695 GSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFD 754
+ ++ C L F V++ V NAG+V G+H V+LF+ P + P K L+GF+
Sbjct: 668 CASVEAAG----DHCDDLAFDVKLQVRNAGEVAGAHSVLLFSS-PPPAHNAPAKHLLGFE 722
Query: 755 RVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
+V + ++F VD C LS+ ++ G R + LG H L VG+L+H++ +
Sbjct: 723 KVSLAPGEAGTVAFRVDVCRDLSVVDELGGRKVALGGHTLHVGDLKHTVELR 774
>gi|115486595|ref|NP_001068441.1| Os11g0673200 [Oryza sativa Japonica Group]
gi|113645663|dbj|BAF28804.1| Os11g0673200 [Oryza sativa Japonica Group]
Length = 822
Score = 701 bits (1809), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 367/806 (45%), Positives = 499/806 (61%), Gaps = 81/806 (10%)
Query: 51 SYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNG 110
+ PFC SL RA+ L++ LT EK++ L +NA+ +PRLG+ YEWWSE+LHG++ G
Sbjct: 40 TLPFCRRSLPARARARDLVARLTRAEKVRLLVNNAAGVPRLGVAGYEWWSEALHGVSDTG 99
Query: 111 PGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIG-------------------------- 144
PGV F G T+FPQV+ +AASFN +LW IG
Sbjct: 100 PGVRFGGAFPGATAFPQVIGTAASFNATLWELIGQVMPILKGGHARCNQRPSCIRISVFM 159
Query: 145 ------SAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEF 198
AV+ E RAMYN GQAGLTFW+PN+NIFRDPRWGRGQETPGEDP V + YA +
Sbjct: 160 YVYVCAQAVSDEGRAMYNGGQAGLTFWSPNVNIFRDPRWGRGQETPGEDPAVAARYAAAY 219
Query: 199 VKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFS 258
V+ Q ++ L L+ACCKH AYDL+ W
Sbjct: 220 VRGLQ-------------------------QQQPSSGRLKLAACCKHFTAYDLDNWSGTD 254
Query: 259 RYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPAC-----LRGDLFQKA 313
R+ FNA++T QD EDTF PFRSC+ G+A+ +MCSYNQVNGVP C LRG +
Sbjct: 255 RFHFNAVVTRQDLEDTFNVPFRSCVVDGRAASVMCSYNQVNGVPTCADAAFLRGTI---- 310
Query: 314 RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDK 373
R WG GYI SDCD+V + Q+YT+T ED+ A L+AG+D++CG + ++T+ A+ +
Sbjct: 311 RRRWGLAGYIVSDCDSVDVFYSDQHYTRTREDAVAATLRAGLDLDCGPFLAQYTEGAVAQ 370
Query: 374 GKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIV 433
GKV + DID A+ N +VQ+RLG+F+GDP +G LGP VCT+ H++LA++AARQGIV
Sbjct: 371 GKVGDGDIDAAVTNTVTVQMRLGMFDGDPAAQPFGHLGPQHVCTAAHQELAVEAARQGIV 430
Query: 434 LLKNDKKFLPLN-KNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHY 492
LLKND + LPL+ A ++A++GP M G Y G PC + L+G+ Y ++ +
Sbjct: 431 LLKNDGRALPLSPATARRAVAVVGPHAEATVAMIGNYAGKPCRYTTPLQGVARYAARAAH 490
Query: 493 ASGCHDVPC-NSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVT 551
GC DV C S AV A++AD IVVAGLD E E DR SLLLPG+Q L++
Sbjct: 491 QPGCTDVACAGSGQPIAAAVDAARRADATIVVAGLDQKIEAEGLDRASLLLPGRQAELIS 550
Query: 552 SVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGG 611
SVA+ SK PVILVL GGP+D+ FA+ D +I+ ILW GYPG+AG +A+A++IFG NPGG
Sbjct: 551 SVAKASKGPVILVLMSGGPIDIGFAQNDPKIAGILWAGYPGQAGGQAIADVIFGHHNPGG 610
Query: 612 RLPMTWYPESF-TKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFL 670
+LP+TWYP+ + KVPM +M MRA+ ++ YPGR+YRFYTG ++ FGHGLSYT++++
Sbjct: 611 KLPVTWYPQDYLQKVPMTNMAMRANPAKGYPGRTYRFYTGPTIHPFGHGLSYTSFTHSIA 670
Query: 671 SAPSELTI--SASLKAGSDKNILQQTGSRLD--YVHIDEVTSCTSLRFHVQISVTNAGDV 726
APS+LT+ SA A S L T +RL C LR V + V N G+
Sbjct: 671 HAPSQLTVRLSAHHAAASASASLNAT-ARLSRAAAVRVAHARCEELRMPVHVDVRNVGER 729
Query: 727 DGSHVVMLFARVPKVSQGTPE-------KQLIGFDRVHTVAKGSKEISFGVDPCEQLSIA 779
DG+H V+++A P S +QL+ F++VH A G+ + G+D C+ LS+A
Sbjct: 730 DGAHTVLVYAAAPASSAAEAAAGHGAPVRQLVAFEKVHVGAGGTARVEMGIDVCDGLSVA 789
Query: 780 NKHGRRILPLGNHVLMVGELRHSLTI 805
+++G R +P+G H L++GEL H++TI
Sbjct: 790 DRNGVRRIPVGEHRLIIGELTHTVTI 815
>gi|183579871|dbj|BAG28345.1| arabinofuranosidase [Citrus unshiu]
Length = 769
Score = 700 bits (1807), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 345/770 (44%), Positives = 488/770 (63%), Gaps = 41/770 (5%)
Query: 33 SKPVLNKPDFPCKPPH--FDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPR 90
S+ V ++ F C P + S FC TS+ I R + LI LTLQEKI+ L +NA+A+PR
Sbjct: 19 SRFVESRVPFACDPRNGLTRSLRFCRTSVPIHVRVQDLIGRLTLQEKIRLLVNNAAAVPR 78
Query: 91 LGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVE 150
LGI YEWWSE+LHG+++ GPG F G TSFPQV+ +AA+FN SLW IG V+ E
Sbjct: 79 LGIQGYEWWSEALHGVSNVGPGTKFGGAFPGATSFPQVITTAAAFNESLWEEIGRVVSDE 138
Query: 151 ARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSD 210
ARAMYN G AGLT+W+PN+NIFRDPRWGRGQETPGEDP++ YA +V+ QG
Sbjct: 139 ARAMYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLAGKYAASYVRRLQG------ 192
Query: 211 DGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQD 270
+ G L ++ACCKH AYDL+ W RY FNA +++QD
Sbjct: 193 ---------------------NTGSRLKVAACCKHYTAYDLDNWNGVDRYHFNARVSKQD 231
Query: 271 TEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDA 329
EDT+ PF++C+ +GK + +MCSYNQVNG P C D+ + R +W GYI SDCD+
Sbjct: 232 LEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADPDILKNTIRGQWRLDGYIVSDCDS 291
Query: 330 VATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLF 389
V ++ Q+YT+T E++AA +KAG+D++CG + HT+ A+ G ++E+D++ A
Sbjct: 292 VGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLAIHTEGAVRGGLLREEDVNLASAYTI 351
Query: 390 SVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAV 449
+VQ+RLG+F+G+P +G LGP DVCT H++LAL AA QGIVLLKN + LPL+
Sbjct: 352 TVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRH 411
Query: 450 SSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHE 509
++A+IGP + M G Y G+ C + L+G+ Y +KT + +GC V CN +
Sbjct: 412 HTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQGISRY-AKTIHQAGCLGVACNGNQLIGA 470
Query: 510 AVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGG 569
A A++AD ++V GLD + E E DR LLLPG+Q LV+ VA+ S+ PV+LVL GG
Sbjct: 471 AEVAARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGG 530
Query: 570 PLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMN 628
P+DVSFA+ D +I +ILW+GYPG+AG A+A+++FG NPGG+LPMTWYP+ + ++PM
Sbjct: 531 PVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANPGGKLPMTWYPQDYVARLPMT 590
Query: 629 DMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDK 688
DM MRA R YPGR+YRFY G V+ FGHG+SYT +++ AP++ ++ + + K
Sbjct: 591 DMRMRA--GRGYPGRTYRFYKGPVVFPFGHGMSYTTFAHTLSKAPNQFSVPIATSLYAFK 648
Query: 689 N-ILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPE 747
N + R+ + + ++ SL HV V N GD+ G+H +++FA+ P +P
Sbjct: 649 NTTISSNAIRVAHTNCNDA---MSLGLHVD--VKNTGDMAGTHTLLVFAK-PPAGNWSPN 702
Query: 748 KQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
KQLIGF +VH A + + + C+ LS+ +K G R +P N V ++G
Sbjct: 703 KQLIGFKKVHVTAGALQSVRLDIHVCKHLSVVDKFGIRRIPWVNIVSILG 752
>gi|255572559|ref|XP_002527213.1| Thermostable beta-glucosidase B, putative [Ricinus communis]
gi|223533389|gb|EEF35139.1| Thermostable beta-glucosidase B, putative [Ricinus communis]
Length = 454
Score = 695 bits (1794), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 325/454 (71%), Positives = 392/454 (86%)
Query: 355 MDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDD 414
MDINCG+ +R+ QSA+DKGK++E+DIDRALLNLFSVQLRLGLF+GD G + KLGP+D
Sbjct: 1 MDINCGSYAIRNAQSAVDKGKLREEDIDRALLNLFSVQLRLGLFDGDRINGHFSKLGPED 60
Query: 415 VCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPC 474
VCT EHKKLAL+AARQGIVLLKN+KKFLPLNK AVSSLAIIGPL NN +GG YTG C
Sbjct: 61 VCTEEHKKLALEAARQGIVLLKNEKKFLPLNKKAVSSLAIIGPLANNGGSLGGDYTGYSC 120
Query: 475 SPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETED 534
+P+SL G++AY+ +T YA GC +V C+SD F EA+ IAK ADFVIVVAG+DL+QETED
Sbjct: 121 NPQSLFDGVQAYIKRTSYAVGCSNVSCDSDDQFPEAIHIAKTADFVIVVAGIDLSQETED 180
Query: 535 RDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEA 594
RDR+SLLLPG+QM+LV+ VA SK+PVILVLTGGGP+DVSFA+ DS+I+SILWIGYPGEA
Sbjct: 181 RDRISLLLPGKQMALVSYVAAASKKPVILVLTGGGPVDVSFAKRDSRIASILWIGYPGEA 240
Query: 595 GAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVY 654
GAKALA+IIFG++NPGGRLPMTWYPESFT VPMNDMNMRA+ +R YPGR+YRFYTG +VY
Sbjct: 241 GAKALADIIFGEYNPGGRLPMTWYPESFTNVPMNDMNMRANPNRGYPGRTYRFYTGERVY 300
Query: 655 GFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRF 714
GFG GLSYTNY+YKFLSAPS+L++S SL A S K IL Q G RLDY+ IDE++SC SLRF
Sbjct: 301 GFGEGLSYTNYAYKFLSAPSKLSLSGSLTATSRKRILHQRGDRLDYIFIDEISSCNSLRF 360
Query: 715 HVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCE 774
VQISV N GD+DGSHVVMLF+RVP+VS+GTPEKQL+GF+R++TV+ S E S +DPC+
Sbjct: 361 TVQISVMNVGDMDGSHVVMLFSRVPQVSEGTPEKQLVGFERINTVSHKSTETSILLDPCK 420
Query: 775 QLSIANKHGRRILPLGNHVLMVGELRHSLTIETY 808
LSIAN G+RI+P+G+HVL++G+L+H +TIE Y
Sbjct: 421 HLSIANGQGKRIMPVGSHVLLLGDLQHFVTIEAY 454
>gi|18025340|gb|AAK38481.1| alpha-L-arabinofuranosidase/beta-D-xylosidase isoenzyme ARA-I
[Hordeum vulgare]
Length = 777
Score = 693 bits (1788), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 356/772 (46%), Positives = 487/772 (63%), Gaps = 37/772 (4%)
Query: 40 PDFPCKPPH--FDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYE 97
P F C + +Y FCN + S RA+ L+S LTL EK+ L + A+ RLGIPAYE
Sbjct: 37 PVFACDASNATLAAYGFCNRKATASARARDLVSRLTLAEKVGFLVNKQPALGRLGIPAYE 96
Query: 98 WWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNL 157
WWSE+LHG++ GPG F+ V TSFPQ +++AASFN SL+ IG V+ EARAM+N+
Sbjct: 97 WWSEALHGVSYVGPGTRFSPLVPGATSFPQPILTAASFNASLFRAIGEVVSTEARAMHNV 156
Query: 158 GQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFG 217
G AGLTFW+PNINIFRDPRWGRGQETPGEDP++ S YAV +V Q + GG+ G
Sbjct: 157 GLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYAVGYVTGLQ----DAGAGGVTDG 212
Query: 218 FREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQP 277
L ++ACCKH AYD++ W RY+F+A +++QD +DTFQP
Sbjct: 213 ------------------ALKVAACCKHYTAYDVDNWKGVERYTFDAKVSQQDLDDTFQP 254
Query: 278 PFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEY 336
PF+SC+ G + +MCSYN+VNG P C DL + R +W GYI SDCD+V ++
Sbjct: 255 PFKSCVLDGNVASVMCSYNKVNGKPTCADKDLLEGVIRGDWKLNGYIVSDCDSVDVLYTQ 314
Query: 337 QNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLG 396
Q+YTKT E++AA +K+G+D+NCG + +HT +A+ G++ E+D+DRA+ N F + +RLG
Sbjct: 315 QHYTKTPEEAAAITIKSGVDLNCGNFLAQHTVAAVQAGELSEEDVDRAITNNFIMLMRLG 374
Query: 397 LFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIG 456
F+GDPR+ +G LGP DVCTS +++LA + ARQGIVLLKN LPL+ ++ S+A+IG
Sbjct: 375 FFDGDPRQLAFGSLGPKDVCTSSNRELARETARQGIVLLKNSGA-LPLSAKSIKSMAVIG 433
Query: 457 PLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDA-GFHEAVRIAK 515
P N M G Y G PC + L+GL A V+ T Y GC +V C+ ++ AV A
Sbjct: 434 PNANASFTMIGNYEGTPCKYTTPLQGLGAKVN-TVYQPGCTNVGCSGNSLQLSTAVAAAA 492
Query: 516 KADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSF 575
AD ++V G D + E E DR SLLLPGQQ LV++VA S PVILV+ GGP D+SF
Sbjct: 493 SADVTVLVVGADQSIERESLDRTSLLLPGQQTQLVSAVANASSGPVILVVMSGGPFDISF 552
Query: 576 AEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNMRA 634
A+A +I++ LW+GYPGEAG AL + +FG NP GRLP+TWYP S+ V M DM MR
Sbjct: 553 AKASDKIAATLWVGYPGEAGGAALDDTLFGSHNPSGRLPVTWYPASYADTVTMTDMRMRP 612
Query: 635 DSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQT 694
D+S YPGR+YRFYTG V+ FG GLSYT S+ +SAP S++ D +
Sbjct: 613 DTSTGYPGRTYRFYTGDTVFAFGDGLSYTKMSHSLVSAPPSYV---SMRLAEDHLCRAEE 669
Query: 695 GSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFD 754
+ ++ C L V++ V NAG+V G+H V+LF+ P + P K L+GF+
Sbjct: 670 CASVEAAG----DHCDDLALDVKLQVRNAGEVAGAHSVLLFSS-PPPAHNAPAKHLVGFE 724
Query: 755 RVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
+V + ++F VD C LS+ ++ G R + LG H L G+L+H++ +
Sbjct: 725 KVSLAPGEAGTVAFRVDVCRDLSVVDELGGRKVALGGHTLHDGDLKHTVELR 776
>gi|189380221|gb|ACD93208.1| beta xylosidase [Camellia sinensis]
Length = 767
Score = 692 bits (1787), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 341/773 (44%), Positives = 487/773 (63%), Gaps = 44/773 (5%)
Query: 38 NKPDFPCKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYE 97
++P F C + PFC SL I R + LI LTLQEKI+ L +NA+A+PRLGI YE
Sbjct: 27 SRPAFACDGA-TRNLPFCRVSLPIQDRVRDLIGRLTLQEKIRLLVNNAAAVPRLGIKGYE 85
Query: 98 WWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNL 157
WWSE+LHG+++ PGV F G TSFPQV+ +AASFN SLW +IG V+ EARAMYN
Sbjct: 86 WWSEALHGVSNADPGVKFGGAFPGATSFPQVISTAASFNASLWEHIGRVVSDEARAMYNG 145
Query: 158 GQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFG 217
G AGLT+W+PN+NIFRDPRWGRGQETPGEDP++ YA +V+ QG
Sbjct: 146 GMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLAGKYAASYVRGLQG------------- 192
Query: 218 FREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQP 277
+ G++L ++ACCKH AYDL+ W + RY FNA +++QD DT+
Sbjct: 193 --------------NSGNQLKVAACCKHYTAYDLDNWNSVDRYRFNARVSKQDLADTYDV 238
Query: 278 PFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKARNEWGFKGYITSDCDAVATIFEYQ 337
PF++C+ +GK ++ P L L + W + ++ C + + +
Sbjct: 239 PFKACVVEGKYQVYCAHTIKLMANPLVL--TLISPQHHPWSWHSWL--HCFRLYRCWGFI 294
Query: 338 NYTKTH---EDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLR 394
++ H ED+AA +KAG+D+ CG + HT+ A+ +GK+ E D++ AL+N SVQ+R
Sbjct: 295 CHSTLHSTPEDAAAATIKAGLDLECGPFLAIHTEQAVRQGKLGEADVNGALINTLSVQMR 354
Query: 395 LGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAI 454
LG+F+G+P YG LGP DVCT H++LAL+AARQGIVLL+N + LPL+ ++A+
Sbjct: 355 LGMFDGEPSSQPYGNLGPRDVCTPAHQQLALEAARQGIVLLQNRGRSLPLSTQLHRTVAV 414
Query: 455 IGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIA 514
IGP + M G Y G+ C + L+G+E YV H SGC V C+++ F A A
Sbjct: 415 IGPNSDVTVTMLGNYAGVACGFTTPLQGIERYVRTIH-QSGCDSVACSNNQLFGVAETAA 473
Query: 515 KKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVS 574
++AD ++V GLD + ETE +DRV LLLPG Q LV+ VA S+ PV+LVL GGP+DVS
Sbjct: 474 RQADATVLVMGLDQSIETEFKDRVGLLLPGPQQELVSRVAMASRGPVVLVLMSGGPIDVS 533
Query: 575 FAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESF-TKVPMNDMNMR 633
FA+ D +I +ILW+GYPG+AG A+A+++FG NPGGRLPMTWYP+ + K PM +M MR
Sbjct: 534 FAKNDPRIGAILWVGYPGQAGGTAIADVLFGRTNPGGRLPMTWYPQDYLAKAPMTNMAMR 593
Query: 634 ADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQ 693
A+ S YPGR+YRFY G V+ FGHG+SYT ++++ AP+ +++ + G +
Sbjct: 594 ANPSSGYPGRTYRFYKGPVVFPFGHGMSYTTFAHELAHAPTTVSVPLTSLYGLQNSTTFN 653
Query: 694 TGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGF 753
G R+ + T+C +L + I V N GD+DG+H V++F+ P V + KQLIGF
Sbjct: 654 NGIRVTH------TNCDTLILGIHIDVKNTGDMDGTHTVLVFS-TPPVGKWGANKQLIGF 706
Query: 754 DRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
+VH VA+G + + V C QLS+ ++ G R +P+G H L +G+++HS++++
Sbjct: 707 KKVHVVARGRQRVKIHVHVCNQLSVVDQFGIRRIPIGEHSLHIGDIKHSISLQ 759
>gi|296083274|emb|CBI22910.3| unnamed protein product [Vitis vinifera]
Length = 738
Score = 692 bits (1786), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 350/769 (45%), Positives = 480/769 (62%), Gaps = 70/769 (9%)
Query: 42 FPCKPPH--FDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWW 99
F C P + + PFC SL I RA+ L+ LTLQEKI+ L +NA +PRLGI YEWW
Sbjct: 27 FACDPRNGVTRNLPFCRVSLPIQERARDLVGRLTLQEKIRLLVNNAIDVPRLGIKGYEWW 86
Query: 100 SESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQ 159
SE+LHG+++ GPG F G+ TSFPQV+ +AASFN SLW IG V+ EARAMYN G
Sbjct: 87 SEALHGVSNVGPGTKFGGSFPGATSFPQVITTAASFNASLWEEIGRVVSDEARAMYNGGM 146
Query: 160 AGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFR 219
AGLT+W+PN+NIFRDPRWGRGQETPGEDP V + YA +V+ QG
Sbjct: 147 AGLTYWSPNVNIFRDPRWGRGQETPGEDPAVAAKYAAAYVRGLQG--------------- 191
Query: 220 EKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPF 279
+ D L ++ACCKH AYDL+ WG R+ FNA +++QD EDT+ PF
Sbjct: 192 ------------NARDRLKVAACCKHYTAYDLDHWGGIDRFHFNARVSKQDLEDTYDVPF 239
Query: 280 RSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQN 338
++C+ +G + +MCSYNQVNG P C L + R EW GYI SDCD+V ++ Q+
Sbjct: 240 KACVVEGNVASVMCSYNQVNGKPTCADPHLLRDTIRGEWKLNGYIVSDCDSVGVFYDEQH 299
Query: 339 YTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLF 398
YT T E++AA +KAG+D++CG + HT++AI GK+ E D++ AL+N SVQ+RLG+F
Sbjct: 300 YTATPEEAAAVAIKAGLDLDCGPFLAIHTEAAIRGGKLTEADVNGALMNTISVQMRLGMF 359
Query: 399 NGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPL 458
+G+P YG LGP DVCT H++LAL+AARQGIVL++N LPL+ + ++A+IGP
Sbjct: 360 DGEPSAQPYGNLGPRDVCTPAHQQLALEAARQGIVLVQNRGPALPLSTSRHRTIAVIGPN 419
Query: 459 VNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKAD 518
+ M G Y G+ C + L+G+ Y H A GC V C D F AV A++AD
Sbjct: 420 SDVTETMIGNYAGVACGYTTPLQGIGRYARTIHQA-GCSGVACRDDQQFGAAVAAARQAD 478
Query: 519 FVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEA 578
++V GLD + E E RDRV +LLPG+Q LV+ VA S+ P +LVL GGP+DVSFA+
Sbjct: 479 ATVLVMGLDQSIEAEFRDRVDILLPGRQQELVSKVAVASRGPTVLVLMSGGPIDVSFAKN 538
Query: 579 DSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESF-TKVPMNDMNMRADSS 637
D +I++I+W+GYPG+AG A+A+++FG NPGG+LP+TWYP+S+ K PM +M MRA S
Sbjct: 539 DPRIAAIIWVGYPGQAGGTAIADVLFGRTNPGGKLPVTWYPQSYLRKAPMTNMAMRAIPS 598
Query: 638 RQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSR 697
R YPGR+YRFY G V+ FGHGLSY+ +++ AP+
Sbjct: 599 RGYPGRTYRFYNGPVVFPFGHGLSYSTFAHSLAQAPT----------------------- 635
Query: 698 LDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVH 757
T L FH I V N G +DGSH ++LF+ P +P K+L+ F++VH
Sbjct: 636 ------------TPLGFH--IDVKNTGTMDGSHTLLLFS-TPPPGTWSPNKRLLAFEKVH 680
Query: 758 TVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
A + + F V C+ LS+ + G +P+G H +G+L+HS++++
Sbjct: 681 VGAGSQERVRFDVHVCKHLSVVDHFGIHRIPMGEHHFHIGDLKHSISLQ 729
>gi|85813772|emb|CAJ65922.1| xylan 1,4-beta-xylosidase [Populus tremula x Populus alba]
Length = 757
Score = 692 bits (1786), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 375/819 (45%), Positives = 497/819 (60%), Gaps = 107/819 (13%)
Query: 18 VLQLIVVVNVIAFSNSKPVL--NKPDFPC---KPPHFDSYPFCNTSLSISTRAKSLISLL 72
V +V + FS+S VL + P F C P S+ FCNTSL +S R L+ L
Sbjct: 15 VFLFFSLVCFLLFSSSHVVLAQSSPVFACDVVSNPSLASFGFCNTSLGVSDRVVDLVKRL 74
Query: 73 TLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSA 132
TLQEKI L ++A ++ RLGIP YEWWSE+LHG++ GPG +F+ V TSFPQV+++A
Sbjct: 75 TLQEKILFLVNSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSSVVPGATSFPQVILTA 134
Query: 133 ASFNRSLWSNIG----SAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDP 188
ASFN SL+ IG V+ EARAMYN+G AGLTFW+PNINIFRDPRWGRGQETPGEDP
Sbjct: 135 ASFNTSLFVAIGKVISQVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDP 194
Query: 189 MVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIA 248
++ S Y +VK Q + DDG D L ++ACCKH A
Sbjct: 195 LLSSKYGSGYVKGLQ----QRDDG--------------------NPDGLKVAACCKHYTA 230
Query: 249 YDLEKWGNFSRYSFNA-MITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRG 307
YDL+ W RY FNA ++T+QD +DTFQPPF+SC+ G + +MCSYN+VNG+P C
Sbjct: 231 YDLDNWKGVDRYHFNAVVVTKQDMDDTFQPPFKSCVVDGNVASVMCSYNKVNGIPTCADP 290
Query: 308 DLFQKA-RNEWGFKG--YITSDCDAVATIFEYQNYTKTHEDSAAGVLKA--GMDINCGTC 362
DL R EW G YI +DCD++ + Q+YTKT E++AA + A G+D+NCG+
Sbjct: 291 DLLSGVIRGEWKLNGYVYIVTDCDSIDVFYNSQHYTKTPEEAAAKAILAGIGLDLNCGSF 350
Query: 363 MLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKK 422
+ +HT++A+ G V E IDRA+ N F+ +RLG F+GDP K YGKLGP DVCT+E+++
Sbjct: 351 LGKHTEAAVTAGLVNESAIDRAVSNNFATLMRLGFFDGDPSKQLYGKLGPKDVCTAENQE 410
Query: 423 LALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRG 482
LA +AARQGIVLLKN TG PC + L+G
Sbjct: 411 LAREAARQGIVLLKN--------------------------------TGTPCKYTTPLQG 438
Query: 483 LEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLL 542
L A V+ T Y GC +V C S A +A +IA AD ++V G DL+ E E RDRV +LL
Sbjct: 439 LAALVATT-YLPGCSNVAC-STAQVDDAKKIAAAADATVLVMGADLSIEAESRDRVDILL 496
Query: 543 PGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEI 602
PGQQ L+T+VA S PVILV+ GG +DVSFA+ + +I+SILW+GYPGEAG A+A+I
Sbjct: 497 PGQQQLLITAVANASTGPVILVIMSGGGMDVSFAKTNDKITSILWVGYPGEAGGAAIADI 556
Query: 603 IFGDFN------PGGRLPMTWYPESFT-KVPMNDMNMRADSSRQYPGRSYRFYTGTQVYG 655
IFG +N PGGRLPMTWYP+S+ KVPM +MNMR D S YPGR+YRFYTG VY
Sbjct: 557 IFGSYNPSTHQPPGGRLPMTWYPQSYVDKVPMTNMNMRPDPSNGYPGRTYRFYTGETVYS 616
Query: 656 FGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSL--- 712
FG GLSY+ +S++ AP +++ L+ H+ + C S+
Sbjct: 617 FGDGLSYSEFSHELTQAPGLVSVP------------------LEENHVCYSSECKSVAAA 658
Query: 713 -----RFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEIS 767
F V + + N G GSH V LF+ P V +P+K L+GF++V A+ +
Sbjct: 659 EQTCQNFDVHLRIKNTGTTSGSHTVFLFSTPPSV-HNSPQKHLVGFEKVFLHAQTDSHVG 717
Query: 768 FGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
F VD C+ LS+ ++ G + + LG HVL +G L+HS+T+
Sbjct: 718 FKVDVCKDLSVVDELGSKKVALGEHVLHIGSLKHSMTVR 756
>gi|357138088|ref|XP_003570630.1| PREDICTED: probable beta-D-xylosidase 7-like [Brachypodium
distachyon]
Length = 1026
Score = 690 bits (1780), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 350/682 (51%), Positives = 452/682 (66%), Gaps = 40/682 (5%)
Query: 11 RERILFRVLQLIVVVNVIAFSNSKPVLNKPDFPCKPPHFDSYPFCNTSLSISTRAKSLIS 70
R I VL L +V+ ++ P F C P SYPFC+ L I RA L S
Sbjct: 4 RSIIFHVVLPLCLVLQATMATD-------PPFSCGSP--SSYPFCDRKLPIGQRAADLAS 54
Query: 71 LLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGP----GVNFN-GTVSSVTSF 125
LT++EK+ L D + +PRLG+PAY+WWSE+LHG+A N P GV F+ G V + TSF
Sbjct: 55 RLTVEEKVSLLGDVSPGVPRLGVPAYKWWSEALHGVA-NAPADRAGVRFDDGPVRAATSF 113
Query: 126 PQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA-GLTFWAPNINIFRDPRWGRGQETP 184
PQVLV+AASFN LW IG + EAR +YN GQA GLTFWAPNIN+FRDPRWGRGQETP
Sbjct: 114 PQVLVTAASFNPHLWYRIGQVIGREARGIYNSGQAEGLTFWAPNINVFRDPRWGRGQETP 173
Query: 185 GEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCK 244
GEDP + YA FV+ QG +G G + G L SACCK
Sbjct: 174 GEDPTMTGKYAAVFVRGVQG-----------YG--------ASGAVNSSG--LEASACCK 212
Query: 245 HLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPAC 304
H AYDLE W +R++FNA ++EQD DT+ PPFRSC+E G AS IMCSYN+VNGVP C
Sbjct: 213 HFTAYDLENWNGVTRFAFNAKVSEQDLADTYNPPFRSCVEDGGASGIMCSYNRVNGVPTC 272
Query: 305 LRGDLFQK-ARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCM 363
+L K AR +W F GYITSDCDAVA I + Q Y K ED+ A VLKAGMD+NCG +
Sbjct: 273 ADHNLLSKTARGDWRFNGYITSDCDAVAIIHDVQGYAKEPEDAVADVLKAGMDVNCGDYV 332
Query: 364 LRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKL 423
+H SA +GK+ E+DIDRAL NLF++++RLGLF+G+P+ +YG +G D VC EH+ L
Sbjct: 333 QKHGVSAFHQGKITEQDIDRALQNLFAIRMRLGLFDGNPKYNRYGNIGADQVCKKEHQDL 392
Query: 424 ALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGL 483
AL+AA+ GIVLLKND LPL K +SSLA+IG N+ ++ G Y G PC S L+ L
Sbjct: 393 ALEAAQDGIVLLKNDAGTLPLPKQKISSLAVIGHNANDAQRLQGNYFGPPCISVSPLQAL 452
Query: 484 EAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLP 543
+ YV +T + +GC+ CN + A + A +A++V++ GLD QE ED DR+ L LP
Sbjct: 453 QGYVRETKFVAGCNAAVCNV-SDIAGAAKAASEAEYVVLFMGLDQDQEREDLDRIELGLP 511
Query: 544 GQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEII 603
G Q SLV +VA +K+PV+LVL GGP+DV+FA+ + +I +I+W GYPG+AG A+A+++
Sbjct: 512 GMQESLVNAVADAAKKPVVLVLLCGGPVDVTFAKGNPKIGAIIWAGYPGQAGGIAIAQVL 571
Query: 604 FGDFNPGGRLPMTWYPESF-TKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSY 662
FG+ NPGGRLP+TWYP+ + T V M DM MRAD+S YPGR+YRFY G VY FG+GLSY
Sbjct: 572 FGEHNPGGRLPVTWYPKEYATAVAMTDMRMRADASTGYPGRTYRFYKGKTVYNFGYGLSY 631
Query: 663 TNYSYKFLSAPSELTISASLKA 684
+ YS+ F+S P + ++A
Sbjct: 632 SKYSHSFVSKPPASMMIGEVRA 653
>gi|357489437|ref|XP_003615006.1| Xylan 1 4-beta-xylosidase [Medicago truncatula]
gi|355516341|gb|AES97964.1| Xylan 1 4-beta-xylosidase [Medicago truncatula]
Length = 685
Score = 687 bits (1772), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 342/699 (48%), Positives = 462/699 (66%), Gaps = 31/699 (4%)
Query: 112 GVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA-GLTFWAPNIN 170
G+ NG++ + TSFPQV+++AASF+ LW I + EAR +YN GQA G+ FWAPNIN
Sbjct: 2 GIILNGSIPAATSFPQVILTAASFDPKLWYQISKVIGTEARGVYNAGQAQGMNFWAPNIN 61
Query: 171 IFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEE 230
IFRDPRWGRGQET GEDP+V S Y V +V+ QG+++ +GG G R
Sbjct: 62 IFRDPRWGRGQETAGEDPLVNSKYGVSYVRGLQGDSF---EGGKLIGGR----------- 107
Query: 231 SDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASC 290
L SACCKH AYDLE W +RY F+A +T QD DT+QP F SC+ QG++S
Sbjct: 108 ------LKASACCKHFTAYDLENWKGVNRYVFDAKVTLQDLADTYQPSFHSCVVQGRSSG 161
Query: 291 IMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAG 349
IMC+YN+VNGVP C +L AR +W F GYI SDCDAV I+E Q Y KT ED A
Sbjct: 162 IMCAYNRVNGVPNCADYNLLTNTARKKWNFNGYIASDCDAVRFIYEKQGYAKTPEDVVAD 221
Query: 350 VLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGK 409
VL+AGMD+ CG M +H +SA+ + K+ IDRAL NLF++++RLGLF+G+P K +YG+
Sbjct: 222 VLRAGMDLECGNYMTKHAKSAVLQKKIPISQIDRALHNLFTIRIRLGLFDGNPTKLQYGR 281
Query: 410 LGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNIS-QMGGG 468
+GP+ VC+ E+ LAL+AAR GIVLLKN LPL + V++L +IGP N S + G
Sbjct: 282 IGPNQVCSKENLDLALEAARSGIVLLKNTASILPLPR--VNTLGVIGPNANKSSIVLLGN 339
Query: 469 YTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDL 528
Y G PC S+L+G Y S+THY SGC D + A AV +AK +D+VI+V GLD
Sbjct: 340 YIGPPCKNVSILKGFYTYASQTHYHSGCTDGTKCASAEIDRAVEVAKISDYVILVMGLDQ 399
Query: 529 TQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWI 588
+QETE DR L LPG+Q L+ SVA+ SK+PVILVL GGP+D++FA+ + +I I+W
Sbjct: 400 SQETETLDRDHLELPGKQQKLINSVAKASKKPVILVLLCGGPVDITFAKNNDKIGGIIWA 459
Query: 589 GYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFY 648
GYPGE G +ALA+++FGD+NPGGRLPMTWYP+ F K+PM DM MRAD S YPGR+YRFY
Sbjct: 460 GYPGELGGRALAQVVFGDYNPGGRLPMTWYPKDFIKIPMTDMRMRADPSSGYPGRTYRFY 519
Query: 649 TGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEV-- 706
TG +VY FG+GLSY+NYSY F+S + + + + +IL+ + + ++Y + E+
Sbjct: 520 TGPKVYEFGYGLSYSNYSYNFISVKNN---NLHINQSTTYSILENSET-INYKLVSELGE 575
Query: 707 TSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEI 766
+C ++ V + +TN G + G H V+LF + K G P KQL+GF+ V G E+
Sbjct: 576 ETCKTMSISVTLGITNTGSMAGKHPVLLFVKPKKGRNGNPVKQLVGFESVTVEGGGKGEV 635
Query: 767 SFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTI 805
F V CE LS AN+ G +++ G ++ +VG+ +S+ I
Sbjct: 636 GFEVSVCEHLSRANESGVKVIEEGGYLFLVGQEEYSINI 674
>gi|168065036|ref|XP_001784462.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162663987|gb|EDQ50724.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 726
Score = 680 bits (1754), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 348/759 (45%), Positives = 473/759 (62%), Gaps = 52/759 (6%)
Query: 54 FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
FC+TSLS R L+S LTL+EK+ QL + ASAIPRL IPAYEWW E LHG+A V
Sbjct: 3 FCDTSLSDEIRVFDLVSRLTLEEKVTQLVNTASAIPRLSIPAYEWWQEGLHGVAH----V 58
Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFR 173
+F G++ TSFP +++ ASFN+ LW+ IG A + EARA YN G AGLT+W+P INI R
Sbjct: 59 SFGGSLPRATSFPLPILTTASFNKDLWNQIGQAFSTEARAFYNDGIAGLTYWSPVINIAR 118
Query: 174 DPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDR 233
DPRWGR QET GEDP SAYA FV+ Q E
Sbjct: 119 DPRWGRIQETSGEDPYTTSAYATHFVQGMQ-------------------------EGDAN 153
Query: 234 GDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMC 293
L LSACCKH AYD++ W RY F+A + + DT+ PPF+SC+++G+++ +MC
Sbjct: 154 SKRLKLSACCKHFTAYDVDNWEGIDRYHFDA---KANLADTYNPPFQSCVQEGRSASLMC 210
Query: 294 SYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLK 352
SYN+VNGVP C D + R WG GYI SDCD+V + E NY T ED+AA L
Sbjct: 211 SYNKVNGVPTCANYDFLENTVRRAWGLNGYIVSDCDSVLVMHESTNYAPTTEDAAADALN 270
Query: 353 AGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGP 412
AG+D+NCG + +T+ A+ GKV +D A+ N+F V++RLG+F+G+P ++G +G
Sbjct: 271 AGLDLNCGDYLASYTEGAVAMGKVNASRVDNAVYNVFLVRMRLGMFDGNPANQEFGNIGV 330
Query: 413 DDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGI 472
DVCT H++LA++AARQGIVLLKND LPL+KN + A+IGP N M G Y GI
Sbjct: 331 ADVCTPAHQELAVEAARQGIVLLKNDGNILPLSKNI--NTAVIGPNANATHTMLGNYEGI 388
Query: 473 PCSPKSLLRGLEAYVS----KTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDL 528
PC + L+GL + S K ++ GC + C D AV A AD V++V GL
Sbjct: 389 PCQYITPLQGLVKFGSGDYHKVWFSEGCVNTACQQDDQISSAVSTAAVADAVVLVVGLSQ 448
Query: 529 TQETEDRDRVSLLLPGQQMSLVTSVA-RTSKRPVILVLTGGGPLDVSFAEADSQISSILW 587
QE+E DR SLLLPG Q +L+ VA + RPV+LVL GP+D++FA+ D +I SILW
Sbjct: 449 VQESEALDRTSLLLPGYQQTLIDEVAGAAAGRPVVLVLMCAGPVDINFAKNDKRIQSILW 508
Query: 588 IGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRF 647
+GYPG++G +A+AE+IFG NPGG+LPM+WYPE +TK+ M +MNMR DS YPGR+YRF
Sbjct: 509 VGYPGQSGGQAIAEVIFGAHNPGGKLPMSWYPEDYTKISMTNMNMRPDSRSNYPGRTYRF 568
Query: 648 YTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVT 707
YTG ++Y FG+GLSYT Y + F AP+ + ++ S+ + GS+
Sbjct: 569 YTGEKIYDFGYGLSYTEYKHSFALAPTTV-MTPSIHSQLCDPHQTSAGSK---------- 617
Query: 708 SCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQ-GTPEKQLIGFDRVHTVAKGSKEI 766
+C+S F V I+V N G + G+H ++LF P + GTP KQL FD V+ + +++
Sbjct: 618 TCSSSNFDVHINVENIGAMAGNHTLLLFFTAPSAGKNGTPLKQLAAFDSVYIRSGSQEKV 677
Query: 767 SFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTI 805
++PC+ L + G R+L GNH+L VG+ +HSL++
Sbjct: 678 VLTLNPCQHLGTVAEDGTRMLEAGNHILSVGDAKHSLSV 716
>gi|326489197|dbj|BAK01582.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 709
Score = 678 bits (1749), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 345/733 (47%), Positives = 472/733 (64%), Gaps = 35/733 (4%)
Query: 76 EKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASF 135
+K+ L + A+ RLGIPAYEWWSE+LHG++ GPG F+ V TSFPQ +++AASF
Sbjct: 7 QKVGFLVNKQPALGRLGIPAYEWWSEALHGVSYVGPGTRFSPLVPGATSFPQPILTAASF 66
Query: 136 NRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYA 195
N SL+ IG V+ EARAM+N+G AGLTFW+PNINIFRDPRWGRGQETPGEDP++ S YA
Sbjct: 67 NASLFRAIGEVVSTEARAMHNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYA 126
Query: 196 VEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWG 255
V +V Q + GG+ G L ++ACCKH AYD++ W
Sbjct: 127 VGYVTGLQ----DAGAGGVTDG------------------ALKVAACCKHYTAYDVDNWK 164
Query: 256 NFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-R 314
RY+F+A +++QD +DTFQPPF+SC+ G + +MCSYN+VNG P C DL + R
Sbjct: 165 GVERYTFDAKVSQQDLDDTFQPPFKSCVLDGNVASVMCSYNKVNGKPTCADKDLLEGVIR 224
Query: 315 NEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKG 374
+W GYI SDCD+V ++ Q+YTKT E++AA +K+G+D+NCG + +HT +A+ G
Sbjct: 225 GDWKLNGYIVSDCDSVDVLYTQQHYTKTPEEAAAITIKSGLDLNCGNFLAQHTVAAVQAG 284
Query: 375 KVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVL 434
++ E+D+DRA+ N F + +RLG F+GDPR+ +G LGP DVCTS +++LA + ARQGIVL
Sbjct: 285 ELSEEDVDRAITNNFIMLMRLGFFDGDPRQLAFGSLGPKDVCTSSNRELARETARQGIVL 344
Query: 435 LKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYAS 494
LKN LPL+ ++ S+A+IGP N M G Y G PC + L+GL A V+ T Y
Sbjct: 345 LKNSGA-LPLSAKSIKSMAVIGPNANASFTMIGNYEGTPCKYTTPLQGLGAKVN-TVYQP 402
Query: 495 GCHDVPCNSDA-GFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSV 553
GC +V C+ ++ AV A AD ++V G D + E E DR SLLLPGQQ LV++V
Sbjct: 403 GCTNVGCSGNSLQLSTAVAAAASADVTVLVVGADQSIERESLDRTSLLLPGQQTQLVSAV 462
Query: 554 ARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRL 613
A S PVILV+ GGP D+SFA+A +I++ILW+GYPGEAG ALA+I+FG NP GRL
Sbjct: 463 ANASSGPVILVVMSGGPFDISFAKASDKIAAILWVGYPGEAGGAALADILFGSHNPSGRL 522
Query: 614 PMTWYPESFTK-VPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSA 672
P+TWYP S+ V M DM MR D+S YPGR+YRFYTG V+ FG GLSYT S+ +SA
Sbjct: 523 PVTWYPASYADTVTMTDMRMRPDTSTGYPGRTYRFYTGDTVFAFGDGLSYTKMSHSLVSA 582
Query: 673 PSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVV 732
P S++ D + + ++ C L F V++ V NAG+V G+H V
Sbjct: 583 PPSYV---SMRLAEDHPCRAEECASVEAAG----DHCDDLAFDVKLQVRNAGEVAGAHSV 635
Query: 733 MLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNH 792
+LF+ P + P K L+GF++V + ++F VD C LS+ ++ G R + LG H
Sbjct: 636 LLFSS-PPPAHNAPAKHLLGFEKVSLAPGEAGTVAFRVDVCRDLSVVDELGGRKVALGGH 694
Query: 793 VLMVGELRHSLTI 805
L VG+L+H++ +
Sbjct: 695 TLHVGDLKHTVEL 707
>gi|449433577|ref|XP_004134574.1| PREDICTED: probable beta-D-xylosidase 2-like [Cucumis sativus]
gi|449530107|ref|XP_004172038.1| PREDICTED: probable beta-D-xylosidase 2-like [Cucumis sativus]
Length = 812
Score = 666 bits (1718), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 340/789 (43%), Positives = 472/789 (59%), Gaps = 40/789 (5%)
Query: 32 NSKPVLNKPDFPCKPPHFD-------SYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDN 84
+ P +N F C P +D S+ FC++SLS RAK LI +TL EK QL
Sbjct: 46 DDPPAVNNFTFVCDPSRYDKLGLDFSSFGFCDSSLSFPERAKDLIDRMTLSEKAAQLGHV 105
Query: 85 ASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIG 144
AS + RLG+P Y WWSE+LHG+++ GPG F+ V TSFP V+ +A+SFN LW IG
Sbjct: 106 ASGVDRLGLPPYNWWSEALHGVSNVGPGTQFDKVVPGATSFPNVITTASSFNEDLWKTIG 165
Query: 145 SAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQG 204
AV+ EARAMYNLG+AGLT+W+P IN+ RDPRWGR ETPGEDP VV YA +V+ Q
Sbjct: 166 QAVSTEARAMYNLGRAGLTYWSPTINVIRDPRWGRTVETPGEDPFVVGKYAKNYVRGLQD 225
Query: 205 ENWKSDDGGIGFGFREKRVLKGFGEESDRGDE-LMLSACCKHLIAYDLEKWGNFSRYSFN 263
++G +D L +S+CCKH AYD++ W RYSF+
Sbjct: 226 -------------------VEGSENVTDLNSRPLKVSSCCKHYAAYDVDNWLGVERYSFD 266
Query: 264 AMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGY 322
A +TEQD +TF PF C+++G S +MCSYN+VNG+P C L + R WG GY
Sbjct: 267 ARVTEQDMLETFNKPFEMCVKEGDVSSVMCSYNRVNGIPTCADPVLLKDTIRGNWGLHGY 326
Query: 323 ITSDCDAVATIFEYQNYTK-THEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDI 381
I SDCD+V + E +Y + T+ED+ A LKAG+D++CG +T+S + +GKV ++I
Sbjct: 327 IVSDCDSVKVMVEDAHYLQDTNEDAVAQTLKAGLDLDCGQIYPNYTESTVRQGKVGMRNI 386
Query: 382 DRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKF 441
D AL NL+ V +RLG F+G+ + LG D+C+ EH +LA +AARQG VLLKND
Sbjct: 387 DNALNNLYVVLMRLGYFDGNT---GFESLGKPDICSDEHIELATEAARQGTVLLKNDNDT 443
Query: 442 LPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPC 501
LP + + +LA++GP N S M G Y G+PC S + GL Y +K Y GC V C
Sbjct: 444 LPFDPSNYKTLAVVGPHANATSAMLGNYAGVPCRMNSPMDGLSEY-AKVKYQMGCDSVAC 502
Query: 502 NSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPV 561
+D A+ A+ +D ++ G+DL+ E E DRV LLLPG Q LV VA SK PV
Sbjct: 503 KNDTFIFGAMEAARTSDATVIFVGIDLSIEAESLDRVDLLLPGYQTQLVQQVATVSKGPV 562
Query: 562 ILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPES 621
+LV+ G +DVSFA+ +S I +I+W GYPGE G +A+A++IFG FNPGGRLP+TWY
Sbjct: 563 VLVILSAGGIDVSFAKNNSNIKAIIWAGYPGEEGGRAIADVIFGKFNPGGRLPLTWYEND 622
Query: 622 FT-KVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISA 680
+ ++PM M +R S YPGR+Y+FY G VY FGHGLSYT + + SA + I
Sbjct: 623 YVYQLPMTSMPLRPVKSLGYPGRTYKFYDGPVVYPFGHGLSYTFFLHNLTSAKRSIAIDL 682
Query: 681 SLKAGSDKNILQQTGS---RLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFAR 737
S + ++I G+ V +D++T + F Q+ V N G+ DGS V+++++
Sbjct: 683 SNRTQC-RDIAYTNGTFKPECPAVLVDDLTCTEEIEF--QMEVENTGERDGSQVLLVYSV 739
Query: 738 VPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
P T KQ++GF RV A S+ ++F ++ C+ L + + G +LP G H ++VG
Sbjct: 740 PPGGISSTHIKQVVGFQRVFLKAGDSETVTFKLNACKSLGLVDFTGYNLLPAGGHTIVVG 799
Query: 798 ELRHSLTIE 806
+ S +E
Sbjct: 800 DGEVSFPVE 808
>gi|255557375|ref|XP_002519718.1| Beta-glucosidase, putative [Ricinus communis]
gi|223541135|gb|EEF42691.1| Beta-glucosidase, putative [Ricinus communis]
Length = 802
Score = 657 bits (1694), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 340/790 (43%), Positives = 479/790 (60%), Gaps = 49/790 (6%)
Query: 26 NVIAFSNSKPVLNKPDFPCKPPHFD-------SYPFCNTSLSISTRAKSLISLLTLQEKI 78
N + ++ P + + C +D ++ FC++SLS RAK L++ +TL+EK+
Sbjct: 33 NTLNHDDANPRGSSFTYVCDSSRYDNLGLDMTTFGFCDSSLSYEVRAKDLVNQMTLKEKV 92
Query: 79 QQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRS 138
QQL D A +PRLGIP YEWWSE+LHG++ GPG F+ V TSFP +++ ASFN S
Sbjct: 93 QQLGDLAYGVPRLGIPKYEWWSEALHGVSDVGPGTFFDDLVPGATSFPTTILTTASFNES 152
Query: 139 LWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEF 198
LW NIG A A +ARAMYNLG+AGLT+W+PN+N+ RDPRWGR ETPGEDP VV YAV +
Sbjct: 153 LWKNIGQASA-KARAMYNLGRAGLTYWSPNVNVVRDPRWGRTVETPGEDPYVVGRYAVNY 211
Query: 199 VKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFS 258
V+ Q +G + R LK +S+CCKH AYD+EKW
Sbjct: 212 VRGLQDV-----EGTENYTDLNTRPLK-------------VSSCCKHYAAYDVEKWQGVE 253
Query: 259 RYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEW 317
R +F+A +TEQD +TF PF C+++G S +MCS+N+VNG+P C L Q R +W
Sbjct: 254 RLTFDARVTEQDMVETFLRPFEMCVKEGDVSSVMCSFNRVNGIPTCADPKLLNQTIRGDW 313
Query: 318 GFKGYITSDCDAVATIFEYQNYT-KTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKV 376
GYI SDCD++ + + + T+ED+ A VLKAG+D++CG T++++ +GK
Sbjct: 314 DLHGYIVSDCDSIEVMVDNHKFLGDTNEDAVAQVLKAGLDLDCGGYYTNFTETSVKQGKA 373
Query: 377 QEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLK 436
+E+ IDR+L L+ V +RLG F+G P +Y KLG D+CT E+ +LA AAR+GIVLLK
Sbjct: 374 REEYIDRSLKYLYVVLMRLGFFDGTP---QYQKLGKKDICTKENVELAKQAAREGIVLLK 430
Query: 437 NDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGC 496
N+ LPL+ + V +LA++GP N M G Y G+PC S + G Y S Y GC
Sbjct: 431 NNDT-LPLSMDKVKNLAVVGPHANATRVMIGNYAGVPCRYVSPIDGFSIY-SNVTYEIGC 488
Query: 497 HDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVART 556
DVPC +++ AV AK AD I+VAGLDLT E E DR LLLPG Q L+ VA
Sbjct: 489 -DVPCKNESLVFPAVHAAKNADATIIVAGLDLTIEAEGLDRNDLLLPGYQTQLINQVAGA 547
Query: 557 SKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMT 616
+ PVILV+ G +D+SFA + +I +ILW+GYPG+ G A+A+++FG +NPGGRLP+T
Sbjct: 548 ANGPVILVIMAAGGVDISFARDNEKIKAILWVGYPGQEGGHAIADVVFGKYNPGGRLPIT 607
Query: 617 WYPESFT-KVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSE 675
WY F +VPM M +R D YPG++Y+FY G+ VY FG+GLSYT +SY SA
Sbjct: 608 WYEADFVEQVPMTYMQLRPDEELGYPGKTYKFYDGSTVYPFGYGLSYTTFSYNITSAKRS 667
Query: 676 LTISAS-------LKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDG 728
I+ + L+ G++ D++ ++ F +++ V N G DG
Sbjct: 668 KHIALNKFQHCRDLRYGNETFKPSCPAVLTDHLPCND-------DFELEVEVENTGSRDG 720
Query: 729 SHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILP 788
S VVM++++ P+ G+ KQ+IGF RV A ++++F + C+ I + + ILP
Sbjct: 721 SEVVMVYSKTPEGIVGSYIKQVIGFKRVFVQAGSVEKVNFRFNVCKSFRIIDYNAYSILP 780
Query: 789 LGNHVLMVGE 798
G H +MVG+
Sbjct: 781 SGGHTIMVGD 790
>gi|195614824|gb|ACG29242.1| auxin-induced beta-glucosidase [Zea mays]
gi|413920229|gb|AFW60161.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
Length = 655
Score = 634 bits (1635), Expect = e-179, Method: Compositional matrix adjust.
Identities = 313/672 (46%), Positives = 431/672 (64%), Gaps = 44/672 (6%)
Query: 154 MYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGG 213
MYN G+AGLTFW+PN+NIFRDPRWGRGQETPGEDP V + YA +V+ Q + + + +GG
Sbjct: 1 MYNGGRAGLTFWSPNVNIFRDPRWGRGQETPGEDPAVSARYAAAYVRGLQ-QPYAAPNGG 59
Query: 214 IGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTED 273
+ L L+ACCKH AYDL+KWG R+ FNA++ QD ED
Sbjct: 60 -------------------HRNRLKLAACCKHFTAYDLDKWGGTDRFHFNAVVAAQDLED 100
Query: 274 TFQPPFRSCIEQGKASCIMCSYNQVNGVPAC-----LRGDLFQKARNEWGFKGYITSDCD 328
TF PFR+C+E G+A+ +MCSYNQVNGVP C LRG + R WG GYI SDCD
Sbjct: 101 TFNVPFRACVEDGRAASVMCSYNQVNGVPTCADAAFLRGTI----RGRWGLDGYIVSDCD 156
Query: 329 AVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNL 388
+V F Q+YT+T ED+AA L+AG+D++CG + + SA+ GKV + D+D ALLN
Sbjct: 157 SVDVFFRDQHYTRTPEDAAAATLRAGLDLDCGPFLALYAGSAVAAGKVADADVDAALLNT 216
Query: 389 FSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKN------DKKFL 442
+VQ+RLG+F+GDP G +G+LGP DVCT EH+ LALDAARQG+VLLKN ++ L
Sbjct: 217 VTVQMRLGMFDGDPAAGPFGRLGPADVCTREHQDLALDAARQGVVLLKNRRGARHNRDVL 276
Query: 443 PLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCN 502
PL A +A++GP + M G Y G PC + L+G+ AY ++ + +GC DV C
Sbjct: 277 PLRPAAHRVVAVVGPHADATVAMIGNYAGKPCRYTTPLQGVAAYAARVAHQAGCTDVACR 336
Query: 503 SDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVI 562
+ AV A++AD +VVAGLD E E DR +LLLPG+Q L+++VA+ SK PVI
Sbjct: 337 GNQPIAAAVEAARQADATVVVAGLDQRVEAEGLDRTTLLLPGRQAELISAVAKASKGPVI 396
Query: 563 LVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESF 622
LVL GGP+D++FA+ D +I ILW+GYPG+AG +A+A++IFG NPG +LP+TWY + +
Sbjct: 397 LVLMSGGPIDIAFAQNDPRIDGILWVGYPGQAGGQAIADVIFGHHNPGAKLPVTWYHQDY 456
Query: 623 -TKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISAS 681
KVPM +M MRA+ +R YPGR+YRFYTG +Y FGHGLSYT +++ AP++LT+ S
Sbjct: 457 LQKVPMTNMAMRANPARGYPGRTYRFYTGPTIYPFGHGLSYTQFTHTLAHAPTQLTVRLS 516
Query: 682 LKAGSDKNILQQTGSRLDY-VHIDEVTS--CTSLRFHVQISVTNAGDVDGSHVVMLFARV 738
S + + L V V C L V + V+N GD DG+H V+++
Sbjct: 517 GSGHSAASAASLLNATLARPVRAVRVAHARCEGLTVPVHVDVSNVGDRDGAHAVLVYHAA 576
Query: 739 PKVSQG-----TPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHV 793
P S P +QL+ F++VH A G + + C++LS+A+++G R +P+G H
Sbjct: 577 PSPSHAAPGADAPARQLVAFEKVHVPAGGVARVEMRIGVCDRLSVADRNGVRRVPVGEHR 636
Query: 794 LMVGELRHSLTI 805
LM+GEL HS+++
Sbjct: 637 LMIGELTHSVSL 648
>gi|85813770|emb|CAJ65921.1| xylan 1,4-beta-xylosidase [Populus tremula x Populus alba]
Length = 704
Score = 634 bits (1634), Expect = e-179, Method: Compositional matrix adjust.
Identities = 339/686 (49%), Positives = 441/686 (64%), Gaps = 55/686 (8%)
Query: 17 RVLQLIVVVNVIAFSNSKPVLNKPDFPC---KPPHFDSYPFCNTSLSISTRAKSLISLLT 73
+V+ L+ + ++ S + P F C P S FCNTS+ I+ R L+ LT
Sbjct: 14 KVVFLLFCMFLVFLSTHVSAQSSPVFACDVVSNPSLASLGFCNTSIGINDRVVDLVKRLT 73
Query: 74 LQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAA 133
LQEKI L ++A + RLGIP YEWWSE+LHG++ GPG +F+ V+ TSFPQV+++AA
Sbjct: 74 LQEKIVFLVNSAGNVSRLGIPKYEWWSEALHGVSYVGPGTHFSDDVAGATSFPQVILTAA 133
Query: 134 SFNRSLWSNIG-----SAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDP 188
SFN SL+ IG V+ EARAMYN+G AGLTFW+PNINIFRDPRWGRGQETPGEDP
Sbjct: 134 SFNTSLFEAIGKVYYTQVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDP 193
Query: 189 MVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIA 248
++ S Y +VK Q + DDG D+L ++ACCKH A
Sbjct: 194 LLSSKYGSCYVKGLQ----QRDDG--------------------DPDKLKVAACCKHYTA 229
Query: 249 YDLEKWGNFSRYSFNAMI-TEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRG 307
YDL+ W RY FNA++ T+QD +DTFQPPF+SC+ G + +MCSYNQVNG P C
Sbjct: 230 YDLDNWKGSDRYHFNAVVVTKQDMDDTFQPPFKSCVIDGNVASVMCSYNQVNGKPTCADP 289
Query: 308 DLFQKA-RNEWGFKGY-------ITSDCDAVATIFEYQNYTKTHEDS------AAGVLKA 353
DL R EW GY I +DCD++ ++ QNYTKT E++ A L
Sbjct: 290 DLLSGVIRGEWNLNGYQWGCCRYIVTDCDSLDVFYKSQNYTKTPEEAAAAAILAGNSLVT 349
Query: 354 GMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPD 413
G+D+NCG+ + +HT++A+ G V E ID A+ N F+ +RLG F+GDP K YGKLGP
Sbjct: 350 GVDLNCGSFLGQHTEAAVKGGLVNEHAIDIAVSNNFATLMRLGFFDGDPSKQLYGKLGPK 409
Query: 414 DVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT-GI 472
DVCT+E+++LA +AARQGIVLLKN LPL+ A+ +LA+IGP N M G Y G
Sbjct: 410 DVCTAENQELAREAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGGT 469
Query: 473 PCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQET 532
PC + L+GL A V+ T Y GC +V C S A +A ++A AD ++V G DL+ E
Sbjct: 470 PCKYTTPLQGLAASVATT-YLPGCSNVAC-STAQVDDAKKLAAAADATVLVMGADLSIEA 527
Query: 533 EDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPG 592
E RDRV +LLPGQQ L+T+VA S PVILV+ GG +DVSFA + +I+SILW+GYPG
Sbjct: 528 ESRDRVDVLLPGQQQLLITAVANVSCGPVILVIMSGGGMDVSFARTNDKITSILWVGYPG 587
Query: 593 EAGAKALAEIIFGDFNPG----GRLPMTWYPESFT-KVPMNDMNMRADSSRQYPGRSYRF 647
EAG A+A+IIFG +NP GRLPMTWYP+S+ KVPM +MNMR D S YPGR+YRF
Sbjct: 588 EAGGAAIADIIFGYYNPSTHQPGRLPMTWYPQSYVDKVPMTNMNMRPDPSNGYPGRTYRF 647
Query: 648 YTGTQVYGFGHGLSYTNYSYKFLSAP 673
YTG VY FG GLSY+ ++++ + AP
Sbjct: 648 YTGETVYSFGDGLSYSQFTHELIQAP 673
>gi|222629651|gb|EEE61783.1| hypothetical protein OsJ_16354 [Oryza sativa Japonica Group]
Length = 771
Score = 632 bits (1631), Expect = e-178, Method: Compositional matrix adjust.
Identities = 340/769 (44%), Positives = 455/769 (59%), Gaps = 91/769 (11%)
Query: 87 AIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSA 146
A+PRLGIPAYEWWSE+LHG++ GPG F+ V TSFPQ +++AASFN SL+ IG +
Sbjct: 44 ALPRLGIPAYEWWSEALHGVSYVGPGTRFSTLVPGATSFPQPILTAASFNASLFRAIGES 103
Query: 147 ------------------------------------------VAVEARAMYNLGQAGLTF 164
V+ EARAM+N+G AGLTF
Sbjct: 104 ACNNTSQFFFSSKSPFSICIAMENLHCDFRSRLVRFYRGARVVSTEARAMHNVGLAGLTF 163
Query: 165 WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVL 224
W+PNINIFRDPRWGRGQETPGEDP++ S YAV +V Q D G G
Sbjct: 164 WSPNINIFRDPRWGRGQETPGEDPLLASKYAVGYVTGLQ-------DAGGG--------- 207
Query: 225 KGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIE 284
D L ++ACCKH AYD++ W RY+F+A++++QD +DTFQPPF+SC+
Sbjct: 208 ---------SDALKVAACCKHYTAYDVDNWKGVERYTFDAVVSQQDLDDTFQPPFKSCVI 258
Query: 285 QGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTH 343
G + +MCSYN+VNG P C DL R +W GYI SDCD+V ++ Q+YTK
Sbjct: 259 DGNVASVMCSYNKVNGKPTCADKDLLSGVIRGDWKLNGYIVSDCDSVDVLYNNQHYTKNP 318
Query: 344 EDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPR 403
ED+AA +K+G+D+NCG + +HT +A+ GK+ E D+DRA+ N F V +RLG F+GDPR
Sbjct: 319 EDAAAITIKSGLDLNCGNFLAQHTVAAVQAGKLSESDVDRAITNNFIVLMRLGFFDGDPR 378
Query: 404 KGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNIS 463
K +G LGP DVCTS +++LA +AARQGIVLLKN LPL+ ++ S+A+IGP N
Sbjct: 379 KLPFGSLGPKDVCTSSNQELAREAARQGIVLLKNTGA-LPLSAKSIKSMAVIGPNANASF 437
Query: 464 QMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDA-GFHEAVRIAKKADFVIV 522
M G Y G PC + L+GL A V+ T Y GC +V C+ ++ A + A AD ++
Sbjct: 438 TMIGNYEGTPCKYTTPLQGLGANVA-TVYQPGCTNVGCSGNSLQLSAATQAAASADVTVL 496
Query: 523 VAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQI 582
V G D + E E DR SLLLPGQQ LV++VA S+ PVILV+ GGP D+SFA++ +I
Sbjct: 497 VVGADQSVERESLDRTSLLLPGQQPQLVSAVANASRGPVILVVMSGGPFDISFAKSSDKI 556
Query: 583 SSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNMRADSSRQYP 641
S+ILW+GYP + + LP+TWYP SF KV M DM MR DSS YP
Sbjct: 557 SAILWVGYPRRSRWRRPRRHPLRIPQ--SWLPVTWYPASFADKVSMTDMRMRPDSSTGYP 614
Query: 642 GRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYV 701
GR+YRFYTG VY FG GLSYT +++ +SAP ++ + + G
Sbjct: 615 GRTYRFYTGDTVYAFGDGLSYTKFAHSLVSAPEQVAVQLA------------EGHACHTE 662
Query: 702 HIDEVTS----CTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVH 757
H V + C SL F V + V NAG + G H V LF+ P V P K L+GF++V
Sbjct: 663 HCFSVEAAGEHCGSLSFDVHLRVRNAGGMAGGHTVFLFSSPPSV-HSAPAKHLLGFEKVS 721
Query: 758 TVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
+ ++F VD C+ LS+ ++ G R + LG+H L VG+L+H+L +
Sbjct: 722 LEPGQAGVVAFKVDVCKDLSVVDELGNRKVALGSHTLHVGDLKHTLNLR 770
>gi|225432132|ref|XP_002274591.1| PREDICTED: beta-xylosidase/alpha-L-arabinofuranosidase 1-like
[Vitis vinifera]
Length = 805
Score = 630 bits (1626), Expect = e-178, Method: Compositional matrix adjust.
Identities = 317/760 (41%), Positives = 460/760 (60%), Gaps = 37/760 (4%)
Query: 51 SYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNG 110
S+ FC+ SLS RAK L+S +TLQEK+ Q AS + RLG+P Y WWSE+LHGI++ G
Sbjct: 63 SFAFCDKSLSYKERAKDLVSRMTLQEKVMQSVHTASGVRRLGLPEYSWWSEALHGISNLG 122
Query: 111 PGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNIN 170
PGV F+ T+ TS P V++S A+FN++LW +G V+ E RAMYNLG AGLTFW+PNIN
Sbjct: 123 PGVFFDETIPGATSLPTVILSTAAFNQTLWKTLGRVVSTEGRAMYNLGHAGLTFWSPNIN 182
Query: 171 IFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEE 230
+ RD RWGR QET GEDP +V +AV +V+ Q ++G
Sbjct: 183 VVRDTRWGRTQETSGEDPFIVGEFAVNYVRGLQD-------------------VEGTENV 223
Query: 231 SDRGDE-LMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKAS 289
+D L +S+CCKH AYD++ W N R++F+A ++EQD ++TF PF C+ +G S
Sbjct: 224 TDLNSRPLKVSSCCKHYAAYDIDSWLNVDRHTFDARVSEQDMKETFVSPFERCVREGDVS 283
Query: 290 CIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHE-DSA 347
+MCS+N++NG+P C L + R+EW GYI SDC + I + QNY + D+
Sbjct: 284 SVMCSFNKINGIPPCSDPRLLKGVIRDEWDLHGYIVSDCYGLEVIVDNQNYLNDSKVDAV 343
Query: 348 AGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKY 407
A L+AG+D+ CG ++ GKV + ++DRAL N++ + +R+G F+G P Y
Sbjct: 344 AKTLQAGLDLECGHYYTDALNESVLTGKVSQYELDRALKNIYVLLMRVGYFDGIP---AY 400
Query: 408 GKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGG 467
LG D+C ++H +LA +AARQGIVLLKND + LPL +A++GP N M G
Sbjct: 401 ESLGLKDICAADHIELAREAARQGIVLLKNDYEVLPLKPG--KKIALVGPHANATEVMIG 458
Query: 468 GYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLD 527
Y G+PC S L A + YA+GC D C++D F EA AK A+ I+ G D
Sbjct: 459 NYAGLPCKYVSPLEAFSA-IGNVTYATGCLDASCSNDTYFSEAKEAAKSAEVTIIFVGTD 517
Query: 528 LTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILW 587
L+ E E DRV LLPG Q L+ VA S PVILV+ G +D++FA+ + +IS+ILW
Sbjct: 518 LSIEAEFVDRVDFLLPGNQTELIKQVAEVSSGPVILVVLSGSNIDITFAKNNPRISAILW 577
Query: 588 IGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTK-VPMNDMNMRADSSRQYPGRSYR 646
+G+PGE G A+A+++FG +NPGGRLP+TWY + +PM+ M++R YPGR+Y+
Sbjct: 578 VGFPGEQGGHAIADVVFGKYNPGGRLPVTWYEADYVDMLPMSSMSLRPVDELGYPGRTYK 637
Query: 647 FYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASL----KAGSDKNILQQTGSRLDYVH 702
F+ G+ VY FG+G+SYT +SY A S+++I L K + Q V
Sbjct: 638 FFDGSTVYPFGYGMSYTKFSYSL--ATSKISIDIDLNKFQKCRTVAYTEDQKVPSCPAVL 695
Query: 703 IDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKG 762
+D+++ ++ F +++VTN G VDGS V+M+++ P GT KQ+IGF +V A
Sbjct: 696 LDDMSCDDTIEF--EVAVTNVGMVDGSEVLMVYSIPPSGIVGTHIKQVIGFQKVFVAAGD 753
Query: 763 SKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHS 802
++ + F ++ C+ L I + G +LP G+H + VG+ +S
Sbjct: 754 TERVKFSMNACKSLRIVDSTGYSLLPSGSHTIRVGDYSNS 793
>gi|32488698|emb|CAE03635.1| OSJNBb0003B01.27 [Oryza sativa Japonica Group]
Length = 839
Score = 624 bits (1608), Expect = e-176, Method: Compositional matrix adjust.
Identities = 319/668 (47%), Positives = 424/668 (63%), Gaps = 47/668 (7%)
Query: 147 VAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGEN 206
V+ EARAM+N+G AGLTFW+PNINIFRDPRWGRGQETPGEDP++ S YAV +V Q
Sbjct: 212 VSTEARAMHNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYAVGYVTGLQ--- 268
Query: 207 WKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMI 266
D G G D L ++ACCKH AYD++ W RY+F+A++
Sbjct: 269 ----DAGGG------------------SDALKVAACCKHYTAYDVDNWKGVERYTFDAVV 306
Query: 267 TEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITS 325
++QD +DTFQPPF+SC+ G + +MCSYN+VNG P C DL R +W GYI S
Sbjct: 307 SQQDLDDTFQPPFKSCVIDGNVASVMCSYNKVNGKPTCADKDLLSGVIRGDWKLNGYIVS 366
Query: 326 DCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRAL 385
DCD+V ++ Q+YTK ED+AA +K+G+D+NCG + +HT +A+ GK+ E D+DRA+
Sbjct: 367 DCDSVDVLYNNQHYTKNPEDAAAITIKSGLDLNCGNFLAQHTVAAVQAGKLSESDVDRAI 426
Query: 386 LNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLN 445
N F V +RLG F+GDPRK +G LGP DVCTS +++LA +AARQGIVLLKN LPL+
Sbjct: 427 TNNFIVLMRLGFFDGDPRKLPFGSLGPKDVCTSSNQELAREAARQGIVLLKNTGA-LPLS 485
Query: 446 KNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDA 505
++ S+A+IGP N M G Y G PC + L+GL A V+ T Y GC +V C+ ++
Sbjct: 486 AKSIKSMAVIGPNANASFTMIGNYEGTPCKYTTPLQGLGANVA-TVYQPGCTNVGCSGNS 544
Query: 506 -GFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILV 564
A + A AD ++V G D + E E DR SLLLPGQQ LV++VA S+ PVILV
Sbjct: 545 LQLSAATQAAASADVTVLVVGADQSVERESLDRTSLLLPGQQPQLVSAVANASRGPVILV 604
Query: 565 LTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT- 623
+ GGP D+SFA++ +IS+ILW+GYPGEAG ALA+I+FG NPGGRLP+TWYP SF
Sbjct: 605 VMSGGPFDISFAKSSDKISAILWVGYPGEAGGAALADILFGYHNPGGRLPVTWYPASFAD 664
Query: 624 KVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLK 683
KV M DM MR DSS YPGR+YRFYTG VY FG GLSYT +++ +SAP ++ + +
Sbjct: 665 KVSMTDMRMRPDSSTGYPGRTYRFYTGDTVYAFGDGLSYTKFAHSLVSAPEQVAVQLA-- 722
Query: 684 AGSDKNILQQTGSRLDYVHIDEVTS----CTSLRFHVQISVTNAGDVDGSHVVMLFARVP 739
G H V + C SL F V + V NAG + G H V LF+ P
Sbjct: 723 ----------EGHACHTEHCFSVEAAGEHCGSLSFDVHLRVRNAGGMAGGHTVFLFSSPP 772
Query: 740 KVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGEL 799
V P K L+GF++V + ++F VD C+ LS+ ++ G R + LG+H L VG+L
Sbjct: 773 SV-HSAPAKHLLGFEKVSLEPGQAGVVAFKVDVCKDLSVVDELGNRKVALGSHTLHVGDL 831
Query: 800 RHSLTIET 807
+H+L +
Sbjct: 832 KHTLNLRV 839
>gi|125576920|gb|EAZ18142.1| hypothetical protein OsJ_33692 [Oryza sativa Japonica Group]
Length = 618
Score = 620 bits (1600), Expect = e-175, Method: Compositional matrix adjust.
Identities = 302/640 (47%), Positives = 402/640 (62%), Gaps = 36/640 (5%)
Query: 164 FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRV 223
W+PN+NIFRDPRWGRGQETPGEDP S Y FVK QG + +
Sbjct: 1 MWSPNVNIFRDPRWGRGQETPGEDPATASKYGAAFVKGLQGSSLTN-------------- 46
Query: 224 LKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCI 283
L SACCKH+ AYD+E+W SRY+FNA +T QD DT+ PPFRSC+
Sbjct: 47 -------------LQTSACCKHITAYDIEEWKGVSRYNFNAKVTPQDLADTYNPPFRSCV 93
Query: 284 EQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKT 342
GKASCIMC+Y +NGVPAC DL K R EW GY SDCDAVA + + +++T+T
Sbjct: 94 VDGKASCIMCAYTLINGVPACASSDLLTKTVRGEWKLDGYTASDCDAVAILHKSEHFTRT 153
Query: 343 HEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDP 402
E++ A LKAG+DINCG M ++ SA+ +GK+ EKD+D+AL NLF++++RLG F+GDP
Sbjct: 154 AEEAVAVALKAGLDINCGVYMQQNAASALQQGKMTEKDVDKALKNLFAIRMRLGHFDGDP 213
Query: 403 RKGK-YGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNN 461
R K YG+L DVCT HK LAL+AAR+G+VLLKND + LPL V+S A+IG N+
Sbjct: 214 RGNKLYGRLSAADVCTPVHKALALEAARRGVVLLKNDARLLPLRAPTVASAAVIGHNAND 273
Query: 462 ISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVI 521
I + G Y G+PC + G++ YV + GC C+ A +A +AK +D+V
Sbjct: 274 ILALLGNYYGLPCETTTPFGGIQKYVKSAKFLPGCSSAACDV-AATDQATALAKSSDYVF 332
Query: 522 VVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQ 581
+V GL QE E DR SLLLPG+Q +L+T+VA SKRPVIL+L GGP+D++FA+ + +
Sbjct: 333 LVMGLSQKQEQEGLDRTSLLLPGKQQALITAVATASKRPVILILLTGGPVDITFAQTNPK 392
Query: 582 ISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYP 641
I +ILW GYPG+AG +A+A+++FG+FNP G+LP+TWYPE FTK M DM MR D + YP
Sbjct: 393 IGAILWAGYPGQAGGQAIADVLFGEFNPSGKLPVTWYPEEFTKFTMTDMRMRPDPATGYP 452
Query: 642 GRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYV 701
GRSYRFY G VY FG+GLSY+ ++ + +S S+S + + T D V
Sbjct: 453 GRSYRFYKGKTVYKFGYGLSYSKFACRIVSGAGN---SSSYGKAALAGLRAATTPEGDAV 509
Query: 702 H-IDEV--TSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHT 758
+ +DE+ C LRF V + V N G +DG H V++F R G P +QLIGF H
Sbjct: 510 YRVDEIGDDRCERLRFPVMVEVQNHGPMDGKHTVLMFVRWSSTDGGRPVRQLIGFRNQHL 569
Query: 759 VAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGE 798
K++ + PCE LS A G +++ G+H LMV E
Sbjct: 570 KVGEKKKLKMEISPCEHLSRARVDGEKVIDRGSHFLMVEE 609
>gi|225432136|ref|XP_002274651.1| PREDICTED: probable beta-D-xylosidase 5-like [Vitis vinifera]
Length = 809
Score = 620 bits (1599), Expect = e-175, Method: Compositional matrix adjust.
Identities = 316/752 (42%), Positives = 454/752 (60%), Gaps = 33/752 (4%)
Query: 52 YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGP 111
+ +C++S RAK L+ +TL EK+ Q D AS + R+G+P Y WWSE+LHG+++ G
Sbjct: 68 FHYCDSSSPYEVRAKDLVDRMTLSEKVMQTGDQASGVERIGLPKYNWWSEALHGVSNFGR 127
Query: 112 GVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINI 171
V F+ V TSFP V++SAASFN+SLW +G AV+ EARAMYN G AGLTFW+PNIN+
Sbjct: 128 CVFFDEVVPGATSFPTVILSAASFNQSLWKTLGQAVSTEARAMYNSGNAGLTFWSPNINV 187
Query: 172 FRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEES 231
RDPRWGR ETPGEDP +V YAV +V+ Q + G +
Sbjct: 188 VRDPRWGRILETPGEDPHLVGLYAVNYVRGLQD-------------------VVGAENTT 228
Query: 232 DRGDE-LMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASC 290
D L +S+CCKH AYDL+ W R F+A ++ QD +TF PF C+++G S
Sbjct: 229 DLNSRPLKVSSCCKHYAAYDLDNWKGADRVHFDARVSVQDMAETFVLPFEMCVKEGDVSS 288
Query: 291 IMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIFEYQNY-TKTHEDSAA 348
+MCSYN++NG+P+C L Q R EW GYI SDCD+V + Q + + DSAA
Sbjct: 289 VMCSYNKINGIPSCADSRLLKQTIRGEWDLHGYIVSDCDSVEVMAVDQKWLDSSFSDSAA 348
Query: 349 GVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYG 408
L AGM+++CGT R A+++GK + D+D +L L+ + +R+G F+G P +
Sbjct: 349 QALNAGMNLDCGTFNNRSLTEAVNQGKANQADLDHSLRYLYVLLMRVGFFDGIP---AFA 405
Query: 409 KLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGG 468
LG DD+C++EH +LA +AARQGIVLLKND LPL +V ++A++GP N M G
Sbjct: 406 SLGKDDICSAEHIELAREAARQGIVLLKNDNATLPL--KSVKNIALVGPHANATDAMIGN 463
Query: 469 YTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDL 528
Y GIPC S L + + + Y GC DV C ++ A+ AK+AD I+ AG DL
Sbjct: 464 YAGIPCYYVSPLDAFSS-MGEVRYEKGCADVQCLNETYIFNAMEAAKRADATIIFAGTDL 522
Query: 529 TQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWI 588
+ E E DRV LLLPG Q L+ VA S PV+LV+ GG +D+SFA + +I++ILW
Sbjct: 523 SIEAEALDRVDLLLPGYQTQLINQVADLSTGPVVLVIMSGGGVDISFARDNPKIAAILWA 582
Query: 589 GYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTK-VPMNDMNMRADSSRQYPGRSYRF 647
GYPGE G A+A++I G +NPGGRLP+TWY + +PM M +R S YPGR+Y+F
Sbjct: 583 GYPGEQGGNAIADVILGKYNPGGRLPITWYEADYVDMLPMTSMALRPVDSLGYPGRTYKF 642
Query: 648 YTGTQVYGFGHGLSYTNYSYKFLSAP--SELTISASLKAGSDKNILQQTGSRLDYVHIDE 705
+ G+ VY FG+G+SYTN+SY ++ + + + + S I V +D+
Sbjct: 643 FNGSTVYPFGYGMSYTNFSYSLSTSQRWTNINLRKLQRCRSMVYINDTFVPDCPAVLVDD 702
Query: 706 VTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKE 765
++ S+ F +++V N G +DGS VV++++ P GT K+++GF+RV G+++
Sbjct: 703 LSCKESIEF--EVAVKNVGRMDGSEVVVVYSSPPLGIAGTHIKKVVGFERVFVKVGGTEK 760
Query: 766 ISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
+ F ++ C+ L I + G +LP G+H + VG
Sbjct: 761 VKFSMNVCKSLGIVDSTGYALLPSGSHTIKVG 792
>gi|224093292|ref|XP_002309869.1| predicted protein [Populus trichocarpa]
gi|222852772|gb|EEE90319.1| predicted protein [Populus trichocarpa]
Length = 694
Score = 619 bits (1595), Expect = e-174, Method: Compositional matrix adjust.
Identities = 314/746 (42%), Positives = 442/746 (59%), Gaps = 65/746 (8%)
Query: 68 LISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQ 127
L++ +TL EK+ QL + A +PRLG+ Y+WWSE+LHG+++ GPG F+ + TSFP
Sbjct: 3 LVNQMTLNEKVLQLGNKAYGVPRLGLAEYQWWSEALHGVSNVGPGTFFDDLIPGSTSFPT 62
Query: 128 VLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGED 187
V+ +AA+FN SLW IG AV+ EARAMYNLG+AGLT+W+PNIN+ RDPRWGR ETPGED
Sbjct: 63 VITTAAAFNESLWKVIGQAVSTEARAMYNLGRAGLTYWSPNINVVRDPRWGRAIETPGED 122
Query: 188 PMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLI 247
P +V YAV +V+ Q +G + R LK +S+CCKH
Sbjct: 123 PYLVGRYAVNYVRGLQ-----DVEGSENYTDPNSRPLK-------------VSSCCKHYA 164
Query: 248 AYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRG 307
AYD++ W RY+F+A ++EQD +TF PF C++ G S +MCSYN+VNG+P C
Sbjct: 165 AYDVDNWKGVERYTFDARVSEQDMVETFLRPFEMCVKDGDVSSVMCSYNRVNGIPTCADP 224
Query: 308 DLF-QKARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRH 366
L Q R +W GYI SDCD++ + E + G+D++CG +
Sbjct: 225 KLLNQTIRGDWDLHGYIVSDCDSLQVMVENHKWL-------------GLDLDCGAYYTEN 271
Query: 367 TQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALD 426
++A+ +GKV+E DID++L L+ V +RLG F+G P +Y G +DVC+ E+ +LA +
Sbjct: 272 VEAAVRQGKVREADIDKSLNFLYVVLMRLGFFDGIP---QYNSFGKNDVCSKENIELATE 328
Query: 427 AARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAY 486
AAR+G VLLKN+ LPL+ V +LA+IGP N S M G Y GIPC + + GL Y
Sbjct: 329 AAREGAVLLKNENDSLPLSIEKVKTLAVIGPHSNATSAMIGNYAGIPCQIITPIEGLSKY 388
Query: 487 VSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQ 546
+K Y GC D+ C ++ A+ AKKAD I++AG+DL+ E E DR LLLPG Q
Sbjct: 389 -AKVDYQMGCSDIACKDESFIFPAMESAKKADATIILAGIDLSIEAESLDRDDLLLPGYQ 447
Query: 547 MSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGD 606
L+ VA S PV+LVL G +D+SFA+++ I SILW+GYPGE G A+A++IFG
Sbjct: 448 TQLINQVASVSNGPVVLVLMSAGGVDISFAKSNGDIKSILWVGYPGEEGGNAIADVIFGK 507
Query: 607 FNPGGRLPMTWYPESFTK-VPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNY 665
+NPGGRLP+TW+ + +PM M +R S YPGR+Y+F+ G+ VY FGHGLSYT +
Sbjct: 508 YNPGGRLPLTWHEADYVDMLPMTSMPLRPIDSLGYPGRTYKFFNGSTVYPFGHGLSYTQF 567
Query: 666 SYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSL-----RFHVQISV 720
+YK S L I +D+ C L F V
Sbjct: 568 TYKLTSTIRSLDI-----------------------KLDKYQYCHDLGYKNDSFKPSFEV 604
Query: 721 TNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIAN 780
NAG DGS VV+++A+ P+ T KQ+IGF RV A GS+++ F + + L + +
Sbjct: 605 LNAGAKDGSEVVIVYAKPPEGIDATYIKQVIGFKRVFVPAGGSEKVKFEFNASKSLQVVD 664
Query: 781 KHGRRILPLGNHVLMVGELRHSLTIE 806
+ +LP G H +M+G+ S +++
Sbjct: 665 FNAYSVLPSGGHTIMLGDDIISFSVQ 690
>gi|225432134|ref|XP_002274619.1| PREDICTED: probable beta-D-xylosidase 5-like [Vitis vinifera]
Length = 805
Score = 614 bits (1584), Expect = e-173, Method: Compositional matrix adjust.
Identities = 320/766 (41%), Positives = 454/766 (59%), Gaps = 45/766 (5%)
Query: 52 YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGP 111
+ +C++SL R K L+ +TL+EK + + D AS +PR+G+P Y+WWSE+LHG+A+ G
Sbjct: 65 FVYCDSSLPYDVRVKDLVDRITLEEKARNVIDVASGVPRIGLPPYKWWSEALHGVANVGS 124
Query: 112 GVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINI 171
F+ V TSFP V++SAASFN+SLW +G V+ EARAMYNLG AGLTFW+PNIN+
Sbjct: 125 ATFFDEVVPGATSFPNVILSAASFNQSLWKTLGQVVSTEARAMYNLGHAGLTFWSPNINV 184
Query: 172 FRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEES 231
RDPRWGR ETPGEDP+ V Y V +V+ Q ++G +
Sbjct: 185 ARDPRWGRILETPGEDPLTVGVYGVNYVRGLQD-------------------IEGTENTT 225
Query: 232 DRGDE-LMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASC 290
D L +++ CKH AYDL++W N R F+A ++EQD +TF PF C+++G S
Sbjct: 226 DLNSRPLKIASSCKHFAAYDLDQWFNVDRRHFDAKVSEQDMTETFLRPFEMCVKEGDTSS 285
Query: 291 IMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTK-THEDSAA 348
+MCS+N +NG+P C + R +W GYI SDC A+ TI + Q + T E+ A
Sbjct: 286 VMCSFNNINGIPPCADPRFLKGVIREQWNLHGYIVSDCWAIDTIVQDQKFLDVTSEEGVA 345
Query: 349 GVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYG 408
+KAG+D+ CG +A+ +G+V E D+D++L L+ V +R+G F+G P
Sbjct: 346 LSMKAGLDLECGHYYNDSLATAVREGRVSEHDVDKSLSYLYVVLMRVGFFDGIP---SLA 402
Query: 409 KLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGG 468
LG D+C EH +LA +AARQGIVLLKND LPL V LA++GP N M G
Sbjct: 403 SLGKKDICNDEHIELAREAARQGIVLLKNDNATLPLKP--VKKLALVGPHANATVAMIGN 460
Query: 469 YTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDL 528
Y GIPC S L + Y GC DV C++D ++A AK AD I++ G DL
Sbjct: 461 YAGIPCHYVSPLDAFSE-LGDVTYEVGCADVKCHNDTHVYKAAEAAKNADATIILVGTDL 519
Query: 529 TQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWI 588
+ E E+RDR LLLPG Q +V V S PVILV+ GGP+D+SFA+ + +I++ILW
Sbjct: 520 SIEAEERDREDLLLPGYQTEMVNQVTDLSTGPVILVVMCGGPIDISFAKNNPKIAAILWA 579
Query: 589 GYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNMRADSSRQYPGRSYRF 647
G+PGE G A+A+I+FG +NPGGR P+TWY + +PM M +R S YPGR+Y+F
Sbjct: 580 GFPGEQGGNAIADIVFGKYNPGGRSPITWYENGYVGMLPMTSMALRPIESLGYPGRTYKF 639
Query: 648 YTGTQVYGFGHGLSYTNYSYKFLSAP--------SELTISASLKAGSDKNILQQTGSRLD 699
+ G+ VY FG+GLSYTN+SY L+AP + L S+ SD + + +D
Sbjct: 640 FNGSTVYPFGYGLSYTNFSYS-LTAPTRSVHISLTRLQQCRSMAYSSDSFQPECSAVLVD 698
Query: 700 YVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTV 759
+ DE F Q++V N G +DGS VVM+++ P GT KQ+IGF+RV
Sbjct: 699 DLSCDE-------SFEFQVAVKNVGSMDGSEVVMVYSSPPSGIVGTHIKQVIGFERVFVK 751
Query: 760 AKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTI 805
++++ F ++ C+ L + + G +LP G+H +M G+ S++
Sbjct: 752 VGNTEKVKFSMNVCKSLGLVDSSGYILLPSGSHTIMAGDNSTSVSF 797
>gi|297736787|emb|CBI25988.3| unnamed protein product [Vitis vinifera]
Length = 774
Score = 610 bits (1573), Expect = e-171, Method: Compositional matrix adjust.
Identities = 319/757 (42%), Positives = 448/757 (59%), Gaps = 60/757 (7%)
Query: 52 YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGP 111
+ +C++SL R K L+ +TL+EK + + D AS +PR+G+P Y+WWSE+LHG+A+ G
Sbjct: 65 FVYCDSSLPYDVRVKDLVDRITLEEKARNVIDVASGVPRIGLPPYKWWSEALHGVANVGS 124
Query: 112 GVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINI 171
F+ V TSFP V++SAASFN+SLW +G V+ EARAMYNLG AGLTFW+PNIN+
Sbjct: 125 ATFFDEVVPGATSFPNVILSAASFNQSLWKTLGQVVSTEARAMYNLGHAGLTFWSPNINV 184
Query: 172 FRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEES 231
RDPRWGR ETPGEDP+ V Y V +V+ Q ++G +
Sbjct: 185 ARDPRWGRILETPGEDPLTVGVYGVNYVRGLQD-------------------IEGTENTT 225
Query: 232 DRGDE-LMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASC 290
D L +++ CKH AYDL++W N R F+A ++EQD +TF PF C+++G S
Sbjct: 226 DLNSRPLKIASSCKHFAAYDLDQWFNVDRRHFDAKVSEQDMTETFLRPFEMCVKEGDTSS 285
Query: 291 IMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTK-THEDSAA 348
+MCS+N +NG+P C + R +W GYI SDC A+ TI + Q + T E+ A
Sbjct: 286 VMCSFNNINGIPPCADPRFLKGVIREQWNLHGYIVSDCWAIDTIVQDQKFLDVTSEEGVA 345
Query: 349 GVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYG 408
+KAG+D+ CG +A+ +G+V E D+D++L L+ V +R+G F+G P
Sbjct: 346 LSMKAGLDLECGHYYNDSLATAVREGRVSEHDVDKSLSYLYVVLMRVGFFDGIP---SLA 402
Query: 409 KLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGG 468
LG D+C EH +LA +AARQGIVLLKND LPL V LA++GP N M G
Sbjct: 403 SLGKKDICNDEHIELAREAARQGIVLLKNDNATLPLKP--VKKLALVGPHANATVAMIGN 460
Query: 469 YTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDL 528
Y GIPC S L + Y GC DV C++D ++A AK AD I++ G DL
Sbjct: 461 YAGIPCHYVSPLDAFSE-LGDVTYEVGCADVKCHNDTHVYKAAEAAKNADATIILVGTDL 519
Query: 529 TQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWI 588
+ E E+RDR LLLPG Q +V V S PVILV+ GGP+D+SFA+ + +I++ILW
Sbjct: 520 SIEAEERDREDLLLPGYQTEMVNQVTDLSTGPVILVVMCGGPIDISFAKNNPKIAAILWA 579
Query: 589 GYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNMRADSSRQYPGRSYRF 647
G+PGE G A+A+I+FG +NPGGR P+TWY + +PM M +R S YPGR+Y+F
Sbjct: 580 GFPGEQGGNAIADIVFGKYNPGGRSPITWYENGYVGMLPMTSMALRPIESLGYPGRTYKF 639
Query: 648 YTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVT 707
+ G+ VY FG+GLSYTN+SY L+AP+ VHI
Sbjct: 640 FNGSTVYPFGYGLSYTNFSYS-LTAPTR------------------------SVHI---- 670
Query: 708 SCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEIS 767
S TS F Q++V N G +DGS VVM+++ P GT KQ+IGF+RV ++++
Sbjct: 671 SLTSFEF--QVAVKNVGSMDGSEVVMVYSSPPSGIVGTHIKQVIGFERVFVKVGNTEKVK 728
Query: 768 FGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLT 804
F ++ C+ L + + G +LP G+H +M G+ S++
Sbjct: 729 FSMNVCKSLGLVDSSGYILLPSGSHTIMAGDNSTSVS 765
>gi|77552476|gb|ABA95273.1| Beta-D-xylosidase, putative, expressed [Oryza sativa Japonica
Group]
Length = 883
Score = 609 bits (1570), Expect = e-171, Method: Compositional matrix adjust.
Identities = 319/679 (46%), Positives = 434/679 (63%), Gaps = 49/679 (7%)
Query: 146 AVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGE 205
AV+ E RAMYN GQAGLTFW+PN+NIFRDPRWGRGQETPGEDP V + YA +V+ Q
Sbjct: 228 AVSDEGRAMYNGGQAGLTFWSPNVNIFRDPRWGRGQETPGEDPAVAARYAAAYVRGLQ-- 285
Query: 206 NWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM 265
++ L L+ACCKH AYDL+ W R+ FNA+
Sbjct: 286 -----------------------QQQPSSGRLKLAACCKHFTAYDLDNWSGTDRFHFNAV 322
Query: 266 ITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPAC-----LRGDLFQKARNEWGFK 320
+T QD EDTF PFRSC+ G+A+ +MCSYNQVNGVP C LRG + R WG
Sbjct: 323 VTRQDLEDTFNVPFRSCVVDGRAASVMCSYNQVNGVPTCADAAFLRGTI----RRRWGLA 378
Query: 321 GYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKD 380
GYI SDCD+V + Q+YT+T ED+ A L+AG+D++CG + ++T+ A+ +GKV + D
Sbjct: 379 GYIVSDCDSVDVFYSDQHYTRTREDAVAATLRAGLDLDCGPFLAQYTEGAVAQGKVGDGD 438
Query: 381 IDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKK 440
ID A+ N +VQ+RLG+F+GDP +G LGP VCT+ H++LA++AARQGIVLLKND +
Sbjct: 439 IDAAVTNTVTVQMRLGMFDGDPAAQPFGHLGPQHVCTAAHQELAVEAARQGIVLLKNDGR 498
Query: 441 FLPLN-KNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDV 499
LPL+ A ++A++GP M G Y G PC + L+G+ Y ++ + GC DV
Sbjct: 499 ALPLSPATARRAVAVVGPHAEATVAMIGNYAGKPCRYTTPLQGVARYAARAAHQPGCTDV 558
Query: 500 PC-NSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSK 558
C S AV A++AD IVVAGLD E E DR SLLLPG+Q L++SVA+ SK
Sbjct: 559 ACAGSGQPIAAAVDAARRADATIVVAGLDQKIEAEGLDRASLLLPGRQAELISSVAKASK 618
Query: 559 RPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWY 618
PVILVL GGP+D+ FA+ D +I+ ILW GYPG+AG +A+A++IFG NPGG+LP+TWY
Sbjct: 619 GPVILVLMSGGPIDIGFAQNDPKIAGILWAGYPGQAGGQAIADVIFGHHNPGGKLPVTWY 678
Query: 619 PESF-TKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELT 677
P+ + KVPM +M MRA+ ++ YPGR+YRFYTG ++ FGHGLSYT++++ APS+LT
Sbjct: 679 PQDYLQKVPMTNMAMRANPAKGYPGRTYRFYTGPTIHPFGHGLSYTSFTHSIAHAPSQLT 738
Query: 678 I--SASLKAGSDKNILQQTGSRLD--YVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVM 733
+ SA A S L T +RL C LR V + V N G+ DG+H V+
Sbjct: 739 VRLSAHHAAASASASLNAT-ARLSRAAAVRVAHARCEELRMPVHVDVRNVGERDGAHTVL 797
Query: 734 LFARVPKVSQGTPE-------KQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRI 786
++A P S +QL+ F++VH A G+ + G+D C+ LS+A+++G R
Sbjct: 798 VYAAAPASSAAEAAAGHGAPVRQLVAFEKVHVGAGGTARVEMGIDVCDGLSVADRNGVRR 857
Query: 787 LPLGNHVLMVGELRHSLTI 805
+P+G H L++GEL H++TI
Sbjct: 858 IPVGEHRLIIGELTHTVTI 876
Score = 113 bits (283), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 48/94 (51%), Positives = 65/94 (69%)
Query: 51 SYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNG 110
+ PFC SL RA+ L++ LT EK++ L +NA+ +PRLG+ YEWWSE+LHG++ G
Sbjct: 40 TLPFCRRSLPARARARDLVARLTRAEKVRLLVNNAAGVPRLGVAGYEWWSEALHGVSDTG 99
Query: 111 PGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIG 144
PGV F G T+FPQV+ +AASFN +LW IG
Sbjct: 100 PGVRFGGAFPGATAFPQVIGTAASFNATLWELIG 133
>gi|125535275|gb|EAY81823.1| hypothetical protein OsI_36995 [Oryza sativa Indica Group]
Length = 885
Score = 608 bits (1568), Expect = e-171, Method: Compositional matrix adjust.
Identities = 316/681 (46%), Positives = 433/681 (63%), Gaps = 51/681 (7%)
Query: 146 AVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGE 205
AV+ E RAMYN GQAGLTFW+PN+NIFRDPRWGRGQETPGEDP V + YA +V+ Q
Sbjct: 228 AVSDEGRAMYNGGQAGLTFWSPNVNIFRDPRWGRGQETPGEDPAVAARYAAAYVRGLQ-- 285
Query: 206 NWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM 265
++ L L+ACCKH AYDL+ W R+ FNA+
Sbjct: 286 -----------------------QQQPSSGRLKLAACCKHFTAYDLDNWSGTDRFHFNAV 322
Query: 266 ITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPAC-----LRGDLFQKARNEWGFK 320
+T QD EDTF PFRSC+ G+A+ +MCSYNQVNGVP C LRG + R WG
Sbjct: 323 VTRQDLEDTFNVPFRSCVVDGRAASVMCSYNQVNGVPTCADAAFLRGTI----RRRWGLA 378
Query: 321 GYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKD 380
GYI SDCD+V + Q+YT+T ED+ A L+AG+D++CG + ++T+ A+ +GKV + D
Sbjct: 379 GYIVSDCDSVDVFYSDQHYTRTREDAVAATLRAGLDLDCGPFLAQYTEGAVAQGKVGDGD 438
Query: 381 IDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKK 440
ID A+ N +VQ+RLG+F+GDP +G LGP VCT+ H++LA++AARQGIVLLKND +
Sbjct: 439 IDAAVTNTVTVQMRLGMFDGDPAAQPFGHLGPQHVCTAAHQELAVEAARQGIVLLKNDGR 498
Query: 441 FLPLN-KNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDV 499
LPL+ A ++A++GP M G Y G PC + L+G+ Y ++ + GC DV
Sbjct: 499 ALPLSPATARRAVAVVGPHAEATVAMIGNYAGKPCRYTTPLQGVARYAARAAHQPGCTDV 558
Query: 500 PC-NSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSK 558
C S AV A++AD IVVAGLD E E DR SLLLPG+Q L++SVA+ SK
Sbjct: 559 ACAGSGQPIAAAVDAARRADATIVVAGLDQKIEAEGLDRASLLLPGRQAELISSVAKASK 618
Query: 559 RPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWY 618
PVILVL GGP+D+ FA+ D +I+ ILW GYPG+AG +A+A++IFG NPGG+LP+TWY
Sbjct: 619 GPVILVLMSGGPIDIGFAQNDPKIAGILWAGYPGQAGGQAIADVIFGHHNPGGKLPVTWY 678
Query: 619 PESF-TKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELT 677
P+ + KVPM +M MRA+ ++ YPGR+YRFYTG ++ FGHGLSYT++++ APS+LT
Sbjct: 679 PQDYLQKVPMTNMAMRANPAKGYPGRTYRFYTGPTIHPFGHGLSYTSFTHSIAHAPSQLT 738
Query: 678 I----SASLKAGSDKNILQQTGSRLD--YVHIDEVTSCTSLRFHVQISVTNAGDVDGSHV 731
+ + + S L T +RL C LR V + V N G+ DG+H
Sbjct: 739 VRLAAHHAAASASASASLNAT-ARLSRAAAVRVAHARCEELRMPVHVDVRNVGERDGAHT 797
Query: 732 VMLFARVPKVSQGTPE-------KQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGR 784
V+++A P S +QL+ F++VH A G+ + G+D C+ LS+A+++G
Sbjct: 798 VLVYAAAPASSAAEAAAGHGAPVRQLVAFEKVHVGAGGTARVEMGIDVCDGLSVADRNGV 857
Query: 785 RILPLGNHVLMVGELRHSLTI 805
R +P+G H L++GEL H++TI
Sbjct: 858 RRIPVGEHRLIIGELTHTVTI 878
Score = 112 bits (281), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 47/94 (50%), Positives = 65/94 (69%)
Query: 51 SYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNG 110
+ PFC SL RA+ L++ +T EK++ L +NA+ +PRLG+ YEWWSE+LHG++ G
Sbjct: 40 TLPFCRRSLPARARARDLVARMTRAEKVRLLVNNAAGVPRLGVAGYEWWSEALHGVSDTG 99
Query: 111 PGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIG 144
PGV F G T+FPQV+ +AASFN +LW IG
Sbjct: 100 PGVRFGGAFPGATAFPQVIGTAASFNATLWELIG 133
>gi|318136853|gb|ADV41671.1| alpha-L-arabinofuranosidase/beta-D-xylosidase [Actinidia deliciosa
var. deliciosa]
Length = 634
Score = 599 bits (1544), Expect = e-168, Method: Compositional matrix adjust.
Identities = 295/662 (44%), Positives = 415/662 (62%), Gaps = 40/662 (6%)
Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
EARAMYN G AGLTFW+PN+NIFRDPRWGRGQETPGEDPM+ YA +V+ QG +
Sbjct: 2 EARAMYNGGMAGLTFWSPNVNIFRDPRWGRGQETPGEDPMLAGNYAASYVRGLQGND--- 58
Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQ 269
G+ L ++ACCKH AYDL+ W R+ FNA +++Q
Sbjct: 59 ------------------------GERLKVAACCKHYTAYDLDNWRGVDRFHFNARVSKQ 94
Query: 270 DTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCD 328
D +DTF+ PFR C+ GK + +MCSYNQVNG+P C L + R W GYI SDCD
Sbjct: 95 DIKDTFEIPFRECVLGGKVASVMCSYNQVNGIPTCANPKLLKGTIRGSWRLNGYIVSDCD 154
Query: 329 AVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNL 388
+V FE Q+YT E++ A +KAG+D++CG + HT++A+ +G V + +I+ AL N
Sbjct: 155 SVGVFFENQHYTSKPEEAVAAAIKAGLDLDCGPFLAIHTEAAVRRGLVSQLEINWALANT 214
Query: 389 FSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNA 448
+ Q+RLG+F+G+P +YG LGP DVCT H++LAL+AARQGIVLL+N + LPL+
Sbjct: 215 MTAQMRLGMFDGEPSAHQYGNLGPRDVCTPAHQQLALEAARQGIVLLENRGRSLPLSIRR 274
Query: 449 VSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFH 508
++A+IGP + M G Y G+ C + L+G+ Y H A GC DV CN + F
Sbjct: 275 HRTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQGIGRYTRTIHQA-GCTDVHCNGNQLFG 333
Query: 509 EAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGG 568
A A++AD ++V GLD + E E DR LLPG Q LV+ VAR S+ P ILVL G
Sbjct: 334 AAEAAARQADATVLVMGLDQSIEAEFVDRAGPLLPGHQQELVSRVARASRGPTILVLMSG 393
Query: 569 GPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESF-TKVPM 627
GP+DV+FA+ D +IS+I+W+GYPG+AG A+A+++FG NPGG+LPMTWYP+++ T +PM
Sbjct: 394 GPIDVTFAKNDPRISAIIWVGYPGQAGGTAIADVLFGTTNPGGKLPMTWYPQNYVTHLPM 453
Query: 628 NDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTIS-ASLKAGS 686
DM MRAD +R YPGR+YRFY G V+ FG GLSYT +++ P+ +++ SLKA +
Sbjct: 454 TDMAMRADPARGYPGRTYRFYRGPVVFPFGLGLSYTTFAHNLAHGPTLVSVPLTSLKATA 513
Query: 687 DKNILQQTGSRLDYVHIDEVTSCTSLR-FHVQISVTNAGDVDGSHVVMLFARVPKVSQGT 745
+ +L + + H D C +L V + V N G +DG+H +++F P +
Sbjct: 514 NSTMLSKA---VRVSHAD----CNALSPLDVHVDVKNTGSMDGTHTLLVFTSPPD-GKWA 565
Query: 746 PEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTI 805
KQL+GF ++H A + V C+ LS+ ++ G R +PLG H L +G+L H +++
Sbjct: 566 ASKQLVGFHKIHIAAGSETRVRIAVHVCKHLSVVDRFGIRRIPLGEHKLQIGDLSHHVSL 625
Query: 806 ET 807
+T
Sbjct: 626 QT 627
>gi|359477633|ref|XP_003632006.1| PREDICTED: LOW QUALITY PROTEIN: beta-D-xylosidase 3-like [Vitis
vinifera]
Length = 781
Score = 593 bits (1529), Expect = e-166, Method: Compositional matrix adjust.
Identities = 331/781 (42%), Positives = 464/781 (59%), Gaps = 61/781 (7%)
Query: 44 CKPPHFDS-------YPFCNTSLSI-STRAKSLISLLTLQEKIQQLSDNASAIPRLGIPA 95
C P F + + +CN+SL I R K L+ +TL+EK + A+ + R+G+P
Sbjct: 17 CDPARFAALGFDMKDFVYCNSSLPIYDVRVKDLVDRMTLEEKATNVIYKAAGVERIGLPP 76
Query: 96 YEWWSESLHGIAS---NGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEAR 152
Y+WWSE+LHG++S NGP F+ TV TSFP V++SAASFN+SLW I V+ EAR
Sbjct: 77 YQWWSEALHGVSSVSINGPTF-FDETVPGATSFPNVILSAASFNQSLWKTIRQVVSKEAR 135
Query: 153 AMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDG 212
A YNLG AGLTFW PN+N+ RDPRWGR QET GEDP VS YAV +V+ Q
Sbjct: 136 ATYNLGHAGLTFWCPNVNVARDPRWGRTQETXGEDPFTVSVYAVSYVRGLQD-------- 187
Query: 213 GIGFGFREKRVLKGFGEESDRGDE-LMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDT 271
++G +D L +S+ KH AYDL+ W N R FNA ++EQD
Sbjct: 188 -----------VEGTENTTDLNSRPLKVSSSGKHFAAYDLDNWLNVDRNHFNARVSEQDM 236
Query: 272 EDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAV 330
+TF PF +C+ +G S +MCS+N +NG+P C LF+ R+EW GYI SDC ++
Sbjct: 237 AETFLRPFEACVREGDVSGVMCSFNNINGIPPCADPRLFKGTIRDEWNLHGYIVSDCWSI 296
Query: 331 ATIFEYQNYTK-THEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLF 389
TI E Q + T E++ A LKAG+D+ CG SA+ G+V + D+D++L NL+
Sbjct: 297 ETIVEDQKFLDVTGEEAVALNLKAGLDLECGHYYNDSPASAVMAGRVGQHDLDQSLSNLY 356
Query: 390 SVQLRLGLFNGDPRKGKYGKLGPDDVCTS-EHKKLALDAARQGIVLLKNDKKFLPLNKNA 448
V +RLG F+G P LG DD+C S EH +LA +AARQGIVLLKND LPL +
Sbjct: 357 VVLMRLGFFDGIP---ALASLGKDDICLSAEHIELAREAARQGIVLLKNDNATLPL--KS 411
Query: 449 VSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFH 508
V +LA++GP + M G Y G PC S A + Y GC DV C++D +
Sbjct: 412 VKNLALVGPNADAYGAMMGNYAGPPCRSVSPRDAFSA-IGNVTYEMGCGDVLCHNDTYVY 470
Query: 509 EAVRIAKKADFVIVVAGL-DLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTG 567
+AV AK AD I+V G+ D++ TED+DRV LLLPG Q LV +A+ + P+ILV+ G
Sbjct: 471 KAVEAAKHADTTIIVVGITDVSIGTEDKDRVDLLLPGYQTHLVNQIAKATTAPIILVVCG 530
Query: 568 --GGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-K 624
GGP+D+SFA + I ILW G+PGE G A+A++++G +NPGGRLP+TWY +
Sbjct: 531 HCGGPIDISFARDNPGIEPILWAGFPGEEGGNAIADVVYGKYNPGGRLPVTWYENGYVGM 590
Query: 625 VPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLK- 683
+PM M +R+ S YPGR Y+F++G+ VY FG GLSYTN+SY L+AP+ +I LK
Sbjct: 591 LPMTSMALRSVESLGYPGRKYKFFSGSTVYPFGCGLSYTNFSYS-LTAPTR-SIHTHLKK 648
Query: 684 -------AGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFA 736
A S +++ Q + L +D++ SC F +++V G +DGS VV++++
Sbjct: 649 LQPCRSMAYSICSVIPQCPAVL----VDDL-SCNE-TFEFEVAVKTVGSMDGSEVVIVYS 702
Query: 737 RVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMV 796
P GT KQ+IGF+RV +++ F ++ C+ L I + G +LP G+ ++
Sbjct: 703 SPPSGIVGTHIKQVIGFERVFVKVGXVEKVKFSMNVCKSLGIVHSSGHTLLPSGSDIIKA 762
Query: 797 G 797
G
Sbjct: 763 G 763
>gi|37359708|dbj|BAC98299.1| LEXYL2 [Solanum lycopersicum]
Length = 633
Score = 592 bits (1526), Expect = e-166, Method: Compositional matrix adjust.
Identities = 310/668 (46%), Positives = 422/668 (63%), Gaps = 41/668 (6%)
Query: 143 IGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSF 202
IG V+ E RAMYN+GQAGLT+W+PN+NI+RDPRWGRGQET GEDP + S Y V +VK
Sbjct: 2 IGKVVSTEGRAMYNVGQAGLTYWSPNVNIYRDPRWGRGQETAGEDPTLSSRYGVAYVKGL 61
Query: 203 QGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSF 262
Q + DDG + D L +++CCKH AYD++ W RY+F
Sbjct: 62 Q----QRDDG--------------------KKDMLKVASCCKHYTAYDVDDWKGIQRYNF 97
Query: 263 NAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKG 321
NA +T+QD +DTF PPF+SC+ G + +MCSYNQV+G P C DL R +W G
Sbjct: 98 NAKVTQQDLDDTFNPPFKSCVLDGNVASVMCSYNQVDGKPTCGDYDLLAGVIRGQWKLNG 157
Query: 322 YITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDI 381
YI +DCD++ ++ Q+YTKT E++AA L AG+ +NCG+ + ++TQ A+++G V E I
Sbjct: 158 YIVTDCDSLNEMYWAQHYTKTPEETAALSLNAGLGLNCGSWLGKYTQGAVNQGLVNESVI 217
Query: 382 DRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKF 441
DRA+ N F+ +RLG F+G+P+ YG LGP D+CT +H++LA +AARQGIVLLKN
Sbjct: 218 DRAVTNNFATLMRLGFFDGNPKNQLYGNLGPKDICTEDHQELAREAARQGIVLLKNTAGS 277
Query: 442 LPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPC 501
LPL+ ++ SLA+IGP N M G Y G PC + L GL A VS T Y GC D+ C
Sbjct: 278 LPLSPKSIKSLAVIGPNANLAYTMVGSYEGSPCKYTTPLDGLGASVS-TVYQQGC-DIAC 335
Query: 502 NSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPV 561
+ A A ++A AD V++V G D T E E +DR ++ LPGQQ LVT VA SK PV
Sbjct: 336 -ATAQVDNAKKVAAAADAVVLVMGSDQTIERESKDRFNITLPGQQSLLVTEVASVSKGPV 394
Query: 562 ILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPES 621
ILV+ GG +DV FA + +++SILW+G+PGEAG ALA+++FG NPGGRLPMTWYP+S
Sbjct: 395 ILVIMSGGGMDVKFAVDNPKVTSILWVGFPGEAGGAALADVVFGYHNPGGRLPMTWYPQS 454
Query: 622 FT-KVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISA 680
+ KV M +MNMRAD +PGRSYRFY G V+ FG GLSYT Y + + AP ++I
Sbjct: 455 YVDKVDMTNMNMRADPKTGFPGRSYRFYKGPTVFNFGDGLSYTQYKHHLVKAPKFVSI-- 512
Query: 681 SLKAGSDKNILQQTGSRLDYVHIDEVTS--CTSLRFHVQISVTNAGDVDGSHVVMLFARV 738
L+ G + + T + ID V C +L + + V N G + GSH V+LF
Sbjct: 513 PLEEG---HACRSTKCK----SIDAVNEQGCNNLGLDIHLKVQNVGKMRGSHTVLLFTSP 565
Query: 739 PKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGE 798
P V P+K L+ F ++H + + F +D C+ LS+ ++ G R + LG HVL +G+
Sbjct: 566 PSV-HNAPQKHLLDFQKIHLTPQSEGVVKFNLDVCKHLSVVDEVGNRKVALGLHVLHIGD 624
Query: 799 LRHSLTIE 806
L+HSLT+
Sbjct: 625 LKHSLTLR 632
>gi|297736786|emb|CBI25987.3| unnamed protein product [Vitis vinifera]
Length = 745
Score = 592 bits (1526), Expect = e-166, Method: Compositional matrix adjust.
Identities = 302/755 (40%), Positives = 438/755 (58%), Gaps = 87/755 (11%)
Query: 51 SYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNG 110
S+ FC+ SLS RAK L+S +TLQEK+ Q AS + RLG+P Y WWSE+LHGI++ G
Sbjct: 63 SFAFCDKSLSYKERAKDLVSRMTLQEKVMQSVHTASGVRRLGLPEYSWWSEALHGISNLG 122
Query: 111 PGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNIN 170
PGV F+ T+ TS P V++S A+FN++LW +G V+ E RAMYNLG AGLTFW+PNIN
Sbjct: 123 PGVFFDETIPGATSLPTVILSTAAFNQTLWKTLGRVVSTEGRAMYNLGHAGLTFWSPNIN 182
Query: 171 IFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEE 230
+ RD RWGR QET GEDP +V +AV +V+ Q G E
Sbjct: 183 VVRDTRWGRTQETSGEDPFIVGEFAVNYVRGLQDVE---------------------GTE 221
Query: 231 SDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASC 290
+ +S+CCKH AYD++ W N R++F+A ++EQD ++TF PF C+ +G S
Sbjct: 222 N-------VSSCCKHYAAYDIDSWLNVDRHTFDARVSEQDMKETFVSPFERCVREGDVSS 274
Query: 291 IMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNY-TKTHEDSAA 348
+MCS+N++NG+P C L + R+EW GYI SDC + I + QNY + D+ A
Sbjct: 275 VMCSFNKINGIPPCSDPRLLKGVIRDEWDLHGYIVSDCYGLEVIVDNQNYLNDSKVDAVA 334
Query: 349 GVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYG 408
L+AG+D+ CG ++ GKV + ++DRAL N++ + +R+G F+G P Y
Sbjct: 335 KTLQAGLDLECGHYYTDALNESVLTGKVSQYELDRALKNIYVLLMRVGYFDGIP---AYE 391
Query: 409 KLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGG 468
LG D+C ++H +LA +AARQGIVLLKND + LPL +A++GP N M G
Sbjct: 392 SLGLKDICAADHIELAREAARQGIVLLKNDYEVLPLKPG--KKIALVGPHANATEVMIGN 449
Query: 469 YTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDL 528
Y G+PC S L A + YA+G F I+ G DL
Sbjct: 450 YAGLPCKYVSPLEAFSA-IGNVTYATG-----------------------FTIIFVGTDL 485
Query: 529 TQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWI 588
+ E E DRV LLPG Q L+ VA S PVILV+ G +D++FA+ + +IS+ILW+
Sbjct: 486 SIEAEFVDRVDFLLPGNQTELIKQVAEVSSGPVILVVLSGSNIDITFAKNNPRISAILWV 545
Query: 589 GYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTK-VPMNDMNMRADSSRQYPGRSYRF 647
G+PGE G A+A+++FG +NPGGRLP+TWY + +PM+ M++R YPGR+Y+F
Sbjct: 546 GFPGEQGGHAIADVVFGKYNPGGRLPVTWYEADYVDMLPMSSMSLRPVDELGYPGRTYKF 605
Query: 648 YTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVT 707
+ G+ VY FG+G+SYT +SY A S+++I + +++
Sbjct: 606 FDGSTVYPFGYGMSYTKFSYSL--ATSKISID---------------------IDLNKFQ 642
Query: 708 SCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEIS 767
C + +++VTN G VDGS V+M+++ P GT KQ+IGF +V A ++ +
Sbjct: 643 KCRTF----EVAVTNVGMVDGSEVLMVYSIPPSGIVGTHIKQVIGFQKVFVAAGDTERVK 698
Query: 768 FGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHS 802
F ++ C+ L I + G +LP G+H + VG+ +S
Sbjct: 699 FSMNACKSLRIVDSTGYSLLPSGSHTIRVGDYSNS 733
>gi|242052713|ref|XP_002455502.1| hypothetical protein SORBIDRAFT_03g012290 [Sorghum bicolor]
gi|241927477|gb|EES00622.1| hypothetical protein SORBIDRAFT_03g012290 [Sorghum bicolor]
Length = 825
Score = 590 bits (1520), Expect = e-165, Method: Compositional matrix adjust.
Identities = 313/790 (39%), Positives = 453/790 (57%), Gaps = 65/790 (8%)
Query: 44 CKPPHFDS-------YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAY 96
C P F + + +C+ SL + R + L+ L+L+EK++ L D A PR+G+P Y
Sbjct: 51 CDPVRFAALGLDMSRFRYCDASLPYAERVRDLVGRLSLEEKVRNLGDQAEGAPRVGLPPY 110
Query: 97 EWWSESLHGIASNGPGVN-FNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMY 155
+WW E+LHG++ GPG F V TSFP V+ SAA+FN SLW IG V+ E RAMY
Sbjct: 111 KWWGEALHGVSDVGPGGTWFGDVVPGATSFPLVINSAAAFNESLWRAIGGVVSTEIRAMY 170
Query: 156 NLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIG 215
NLG A LT+W+PNIN+ RDPRWGR ETPGEDP VV YAV FV+ Q +
Sbjct: 171 NLGHAELTYWSPNINVVRDPRWGRASETPGEDPFVVGRYAVNFVRGMQD---------VV 221
Query: 216 FGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTF 275
F + +S+CCKH AYD++ W R +F+A + E+D +TF
Sbjct: 222 IAAGAAATADPFSRP------IKVSSCCKHFAAYDVDAWFKADRLTFDAQVEERDMVETF 275
Query: 276 QPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIF 334
+ PF CI G ASC+MCSYN++NG+PAC L + R++W GYI SDCD+V +
Sbjct: 276 ERPFEMCIRDGDASCVMCSYNRINGIPACADARLLSETVRSQWQLHGYIVSDCDSVRVMV 335
Query: 335 ---EYQNYTKTHEDSAAGVLKAGMDINCGTC-------MLRHTQSAIDKGKVQEKDIDRA 384
++ NYT +AA +KAG+D++CG + A+ +GK++E D+D A
Sbjct: 336 RDAKWLNYTGVEATAAA--MKAGLDLDCGMFWEGARDFFTTYGVDAVRQGKIKEADVDNA 393
Query: 385 LLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPL 444
L N+++ +RLG F+G P ++ LG DDVCT +HK+LA DAARQG+VLLKND + LPL
Sbjct: 394 LGNVYTTLMRLGFFDGMP---EFESLGADDVCTRDHKELAADAARQGMVLLKNDARRLPL 450
Query: 445 NKNAVSSLAIIGPL--VNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCN 502
+ + ++S++++G L +N M G Y G PC + + V+ T Y C C+
Sbjct: 451 DPSKINSVSLVGLLEHINATDVMLGDYRGKPCRIVTPYDAIRQVVNAT-YVHACDSGACS 509
Query: 503 SDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVI 562
+ G A R AK AD IV+AGL+++ E E DR LLLP Q S + +VA S P++
Sbjct: 510 TAEGMGRASRTAKIADATIVIAGLNMSVERESNDREDLLLPWNQSSWINAVAEASTTPIV 569
Query: 563 LVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESF 622
LV+ G +DVSFA+ +++I +I+W GYPGE G A+A+++FG +NPGGRLP+TW+ +
Sbjct: 570 LVIMSAGGVDVSFAQNNTKIGAIVWAGYPGEEGGTAIADVLFGKYNPGGRLPLTWFKNEY 629
Query: 623 T-KVPMNDMNMRADSSRQYPGRSYRFYTGTQV-YGFGHGLSYTNYSYKFLSAPSELTISA 680
++PM M +R D++ YPGR+Y+FY G V Y FGHGLSYT+++Y S + T++
Sbjct: 630 VNQIPMTSMALRPDAAHGYPGRTYKFYGGPAVLYPFGHGLSYTSFTYA--SGTTGATVTI 687
Query: 681 SLKAGSDKNILQQTGSR------------LDYVHIDEVTSCTSLRFHVQISVTNAGDVDG 728
+ A +L + + DEV S SLR V N G V G
Sbjct: 688 PIGAWEHCKMLTYKSGKAPSPSPACPALNVASHRCDEVVS-FSLR------VANTGGVGG 740
Query: 729 SHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILP 788
HVV ++ P P KQL+ F RV A + ++ F ++ C+ +I + ++P
Sbjct: 741 DHVVPVYTAPPPEVGDAPRKQLVEFRRVFVPAGAAVDVPFALNVCKTFAIVEETAYTVVP 800
Query: 789 LGNHVLMVGE 798
G ++VG+
Sbjct: 801 SGVSTVIVGD 810
>gi|297736788|emb|CBI25989.3| unnamed protein product [Vitis vinifera]
Length = 746
Score = 590 bits (1520), Expect = e-165, Method: Compositional matrix adjust.
Identities = 307/749 (40%), Positives = 431/749 (57%), Gaps = 90/749 (12%)
Query: 52 YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGP 111
+ +C++S RAK L+ +TL EK+ Q D AS + R+G+P Y WWSE+LHG+++ G
Sbjct: 68 FHYCDSSSPYEVRAKDLVDRMTLSEKVMQTGDQASGVERIGLPKYNWWSEALHGVSNFGR 127
Query: 112 GVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINI 171
V F+ V TSFP V++SAASFN+SLW +G AV+ EARAMYN G AGLTFW+PNIN+
Sbjct: 128 CVFFDEVVPGATSFPTVILSAASFNQSLWKTLGQAVSTEARAMYNSGNAGLTFWSPNINV 187
Query: 172 FRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEES 231
RDPRWGR ETPGEDP +V YAV +
Sbjct: 188 VRDPRWGRILETPGEDPHLVGLYAVNY--------------------------------- 214
Query: 232 DRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCI 291
H AYDL+ W R F+A ++ QD +TF PF C+++G S +
Sbjct: 215 -------------HYAAYDLDNWKGADRVHFDARVSVQDMAETFVLPFEMCVKEGDVSSV 261
Query: 292 MCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIFEYQNYT-KTHEDSAAG 349
MCSYN++NG+P+C L Q R EW GYI SDCD+V + Q + + DSAA
Sbjct: 262 MCSYNKINGIPSCADSRLLKQTIRGEWDLHGYIVSDCDSVEVMAVDQKWLDSSFSDSAAQ 321
Query: 350 VLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGK 409
L AGM+++CGT R A+++GK + D+D +L L+ + +R+G F+G P +
Sbjct: 322 ALNAGMNLDCGTFNNRSLTEAVNQGKANQADLDHSLRYLYVLLMRVGFFDGIP---AFAS 378
Query: 410 LGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGY 469
LG DD+C++EH +LA +AARQGIVLLKND LPL +V ++A++GP N M G Y
Sbjct: 379 LGKDDICSAEHIELAREAARQGIVLLKNDNATLPLK--SVKNIALVGPHANATDAMIGNY 436
Query: 470 TGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLT 529
GIPC S L + + + Y GC DV C ++ A+ AK+AD I+ AG DL+
Sbjct: 437 AGIPCYYVSPLDAFSS-MGEVRYEKGCADVQCLNETYIFNAMEAAKRADATIIFAGTDLS 495
Query: 530 QETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIG 589
E E DRV LLLPG Q L+ VA S PV+LV+ GG +D+SFA + +I++ILW G
Sbjct: 496 IEAEALDRVDLLLPGYQTQLINQVADLSTGPVVLVIMSGGGVDISFARDNPKIAAILWAG 555
Query: 590 YPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTK-VPMNDMNMRADSSRQYPGRSYRFY 648
YPGE G A+A++I G +NPGGRLP+TWY + +PM M +R S YPGR+Y+F+
Sbjct: 556 YPGEQGGNAIADVILGKYNPGGRLPITWYEADYVDMLPMTSMALRPVDSLGYPGRTYKFF 615
Query: 649 TGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTS 708
G+ VY FG+G+SYTN+SY L+ S S K
Sbjct: 616 NGSTVYPFGYGMSYTNFSYS-------LSTSQSCK------------------------- 643
Query: 709 CTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISF 768
S+ F +++V N G +DGS VV++++ P GT K+++GF+RV G++++ F
Sbjct: 644 -ESIEF--EVAVKNVGRMDGSEVVVVYSSPPLGIAGTHIKKVVGFERVFVKVGGTEKVKF 700
Query: 769 GVDPCEQLSIANKHGRRILPLGNHVLMVG 797
++ C+ L I + G +LP G+H + VG
Sbjct: 701 SMNVCKSLGIVDSTGYALLPSGSHTIKVG 729
>gi|226506870|ref|NP_001146482.1| uncharacterized protein LOC100280070 precursor [Zea mays]
gi|219887469|gb|ACL54109.1| unknown [Zea mays]
gi|413947917|gb|AFW80566.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
Length = 835
Score = 588 bits (1516), Expect = e-165, Method: Compositional matrix adjust.
Identities = 312/782 (39%), Positives = 449/782 (57%), Gaps = 51/782 (6%)
Query: 44 CKPPHF-------DSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAY 96
C P F + +C+ SL + R + L+ L L+EK++ L D A PR+G+P Y
Sbjct: 63 CDPARFVALGLDMSRFRYCDASLPYADRVRDLVGRLALEEKVRNLGDQAEGAPRVGLPPY 122
Query: 97 EWWSESLHGIASNGPGVN-FNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMY 155
+WW E+LHG++ GPG F V TSFP V+ SAA+FN SLW IG V+ E RAMY
Sbjct: 123 KWWGEALHGVSDVGPGGTWFGDVVPGATSFPLVINSAAAFNESLWRAIGGVVSTEIRAMY 182
Query: 156 NLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIG 215
NLG A LT+W+PNIN+ RDPRWGR ETPGEDP VV YAV FV+ Q DD
Sbjct: 183 NLGHAELTYWSPNINVVRDPRWGRASETPGEDPFVVGRYAVNFVRGMQ----DVDD---- 234
Query: 216 FGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTF 275
R + R + +S+CCKH AYD++ W R +F+A + E+D +TF
Sbjct: 235 ------RPYAAAADPFSR--PIKVSSCCKHFAAYDVDAWFKADRLTFDAQVEERDMVETF 286
Query: 276 QPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIF 334
+ PF CI G ASC+MCSYN++NG+PAC L + R++W GYI SDCD+V +
Sbjct: 287 ERPFEMCIRDGDASCVMCSYNRINGIPACADARLLSETVRSQWQLHGYIVSDCDSVRVMV 346
Query: 335 ---EYQNYTKTHEDSAAGVLKAGMDINCGTC-------MLRHTQSAIDKGKVQEKDIDRA 384
++ NYT +AA +KAG+D++CG + A+ +GK++E D+D A
Sbjct: 347 RDAKWLNYTGVEATAAA--MKAGLDLDCGMFWEGARDFFTTYGVDAVRQGKIKEGDVDNA 404
Query: 385 LLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPL 444
L N+++ +RLG F+G P ++ LG +VCT HK+LA DAARQG+VLLKND + LPL
Sbjct: 405 LSNVYTTLMRLGFFDGMP---EFESLGASNVCTDGHKELAADAARQGMVLLKNDARRLPL 461
Query: 445 NKNAVSSLAIIGPL--VNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCN 502
+ N ++S++++G L +N M G Y G PC + + V+ T Y C CN
Sbjct: 462 DPNKINSVSLVGLLEHINATDVMLGDYRGKPCRIVTPYNAIRNMVNAT-YVHACDSGACN 520
Query: 503 SDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVI 562
+ G A AK AD IV+AGL+++ E E DR LLLP Q S + +VA S P++
Sbjct: 521 TAEGMGRASSTAKIADATIVIAGLNMSVERESNDREDLLLPWNQSSWINAVAMASPTPIV 580
Query: 563 LVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESF 622
LV+ G +DVSFA +++I +I+W GYPGE G A+A+++FG +NPGGRLP+TW+ +
Sbjct: 581 LVIMSAGGVDVSFAHNNTKIGAIVWAGYPGEEGGTAIADVLFGKYNPGGRLPLTWFKNEY 640
Query: 623 T-KVPMNDMNMRADSSRQYPGRSYRFYTGTQV-YGFGHGLSYTNYSYKFLSAPSELTISA 680
++PM M +R D++ YPGR+Y+FY G V Y FGHGLSYTN+SY S + T++
Sbjct: 641 VNQIPMTSMALRPDAALGYPGRTYKFYGGPAVLYPFGHGLSYTNFSYA--SGTTGATVTI 698
Query: 681 SLKAGSDKNILQ-QTGSRLDYVHIDEVTSCTSLRFHV---QISVTNAGDVDGSHVVMLFA 736
+ A +L + G+ + + + V + V N G V G HVV ++
Sbjct: 699 HIGAWEHCKMLTYKMGAPSPSPACPALNVASHMCSEVVSFSLRVANTGGVGGDHVVPVYT 758
Query: 737 RVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMV 796
P P KQL+ F RV A + ++ F ++ C+ +I + ++P G ++V
Sbjct: 759 APPPEVGDAPLKQLVAFRRVFVPAGAAVDVPFALNVCKTFAIVEETAYTVVPSGVSTVVV 818
Query: 797 GE 798
G+
Sbjct: 819 GD 820
>gi|224128360|ref|XP_002320310.1| predicted protein [Populus trichocarpa]
gi|222861083|gb|EEE98625.1| predicted protein [Populus trichocarpa]
Length = 635
Score = 588 bits (1515), Expect = e-165, Method: Compositional matrix adjust.
Identities = 286/663 (43%), Positives = 408/663 (61%), Gaps = 43/663 (6%)
Query: 147 VAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGEN 206
V+ EARAM+N G AGLT+W+PN+NIFRDPRWGRGQETPGEDP+VV YA +V+ QG +
Sbjct: 4 VSDEARAMFNGGVAGLTYWSPNVNIFRDPRWGRGQETPGEDPVVVGKYAASYVRGLQGSD 63
Query: 207 WKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMI 266
G+ L ++ACCKH AYDL+ W R+ FNA +
Sbjct: 64 ---------------------------GNRLKVAACCKHFTAYDLDNWNGVDRFHFNAEV 96
Query: 267 TEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKARNEWGFKGYITSD 326
++QD EDTF PFR C+++GK + +MCSYNQVNG+P C +L +K +G +
Sbjct: 97 SKQDMEDTFDVPFRMCVKEGKVASVMCSYNQVNGIPTCADPNLLKKT-----VRGTLFQT 151
Query: 327 CDAVATIFEYQNYTKTHEDSAAGVLK-AGMDINCGTCMLRHTQSAIDKGKVQEKDIDRAL 385
+ I + +LK A +D++CG + +HT+ A+ KG + E +I+ AL
Sbjct: 152 VTLLEFIMGSNTILQPRRKQPRMLLKQASLDLDCGPFLGQHTEDAVKKGLLNEAEINNAL 211
Query: 386 LNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLN 445
LN +VQ+RLG+F+G+P YG LGP+DVCT H++LAL+AARQGIVLLKN LPL+
Sbjct: 212 LNTLTVQMRLGMFDGEPSSQLYGNLGPNDVCTPAHQELALEAARQGIVLLKNHGPSLPLS 271
Query: 446 KNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDA 505
S+AI+GP N + M G Y G+ C + L+G++ Y H GC DV C SD
Sbjct: 272 TRRHLSVAIVGPNSNVTATMIGNYAGLACGYTTPLQGIQRYAQTIHR-QGCADVACVSDQ 330
Query: 506 GFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVL 565
F A+ A++AD ++V GLD + E E RDR LLLPG+Q LV+ VA SK P ILVL
Sbjct: 331 QFSAAIDAARQADATVLVMGLDQSIEAEFRDRTGLLLPGRQQELVSKVAAASKGPTILVL 390
Query: 566 TGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESF-TK 624
GGP+DVSFAE D +I SI+W GYPG+AG A+++++FG NPGG+LPMTWYP+ + T
Sbjct: 391 MSGGPIDVSFAENDPKIGSIVWAGYPGQAGGAAISDVLFGITNPGGKLPMTWYPQDYITN 450
Query: 625 VPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISA-SLK 683
+PM +M MR+ S+ YPGR+YRFY G VY FGHG+SYTN+ + SAP+ +++ +
Sbjct: 451 LPMTNMAMRSSKSKGYPGRTYRFYKGKVVYPFGHGISYTNFVHTIASAPTMVSVPLDGHR 510
Query: 684 AGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQ 743
GS + R+ + C L +Q+ V N G +DG+H +++++R P
Sbjct: 511 HGSGNATISGKAIRVTHAR------CNRLSLGMQVDVKNTGSMDGTHTLLVYSR-PPARH 563
Query: 744 GTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSL 803
P KQL+ F++VH A + + + C+ LS+ + G R +P+G H L +G+++HS+
Sbjct: 564 WAPHKQLVAFEKVHVAAGTQQRVGINIHVCKSLSVVDGSGIRRIPMGEHSLHIGDVKHSV 623
Query: 804 TIE 806
+++
Sbjct: 624 SLQ 626
>gi|40363751|dbj|BAD06320.1| putative beta-xylosidase [Triticum aestivum]
Length = 573
Score = 579 bits (1492), Expect = e-162, Method: Compositional matrix adjust.
Identities = 285/575 (49%), Positives = 374/575 (65%), Gaps = 12/575 (2%)
Query: 236 ELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSY 295
+L SACCKH AYDLE W +R++F+A +TEQD DT+ PPF+SC+E G AS IMCSY
Sbjct: 4 DLEASACCKHFTAYDLENWKGVTRFAFDAKVTEQDLADTYNPPFKSCVEDGGASGIMCSY 63
Query: 296 NQVNGVPACLRGDLFQK-ARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAG 354
N+VNGVP C +L K AR +W F GYITSDCDAVA I + Q Y K ED+ A VLKAG
Sbjct: 64 NRVNGVPTCADHNLLSKTARGDWSFNGYITSDCDAVAIIHDVQGYAKAPEDAVADVLKAG 123
Query: 355 MDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDD 414
MD+NCG + H SA +GK+ +DIDRAL NLF++++RLGLFNG+P+ +YG +G D
Sbjct: 124 MDVNCGGYIQTHGVSAYQQGKITGEDIDRALRNLFAIRMRLGLFNGNPKYNRYGNIGADQ 183
Query: 415 VCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPC 474
VC EH+ LAL AA+ GIVLLKND LPL+K+ VSS+A+IGP NN S + G Y G PC
Sbjct: 184 VCKKEHQDLALQAAQDGIVLLKNDAGALPLSKSKVSSVAVIGPNGNNASLLLGNYFGPPC 243
Query: 475 SPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETED 534
+ + L+ YV + GC+ CN + EAV A AD+V++ GLD QE E+
Sbjct: 244 ISVTPFQALQGYVKDATFVQGCNAAVCNV-SNIGEAVHAASSADYVVLFMGLDQNQEREE 302
Query: 535 RDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEA 594
DR+ L LPG Q SLV VA +K+PVILVL GGP+DV+FA+ + +I +I+W GYPG+A
Sbjct: 303 VDRLELGLPGMQESLVNKVADAAKKPVILVLLCGGPVDVTFAKNNPKIGAIVWAGYPGQA 362
Query: 595 GAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVY 654
G A+A+++FG+ NPGGRLP+TWYP+ FT VPM DM MRAD S YPGR+YRFY G VY
Sbjct: 363 GGIAIAQVLFGEHNPGGRLPVTWYPKEFTAVPMTDMRMRADPSTGYPGRTYRFYKGKTVY 422
Query: 655 GFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHID----EVTSCT 710
FG+GLSY+ YS++F S ++ + ++ L+ T S V D +C
Sbjct: 423 NFGYGLSYSKYSHRFASEGTKPPSMSGIEG------LKATASAAGTVSYDVEEMGAEACD 476
Query: 711 SLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGV 770
LRF + V N G +DG H V+LF R P + G P QLIGF VH A + + F V
Sbjct: 477 RLRFPAVVRVQNHGPMDGRHPVLLFLRWPNATDGRPASQLIGFQSVHLRADEAAHVEFEV 536
Query: 771 DPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTI 805
PC+ S A + GR+++ G+H + VG+ L+
Sbjct: 537 SPCKHFSRAAEDGRKVIDQGSHFVKVGDDEFELSF 571
>gi|326523729|dbj|BAJ93035.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 810
Score = 578 bits (1491), Expect = e-162, Method: Compositional matrix adjust.
Identities = 310/779 (39%), Positives = 445/779 (57%), Gaps = 53/779 (6%)
Query: 44 CKPPHFDS-------YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAY 96
C P F + + +C+ SL + R + L+ LTL+EK++ L D A R+G+P Y
Sbjct: 46 CDPARFAALGLEMAGFRYCDASLPYADRVRDLVGRLTLEEKVRNLGDRAEGAARVGLPPY 105
Query: 97 EWWSESLHGIASNGPG-VNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMY 155
WW E+LHG++ GPG F V TSFP V+ SAA+FN +LW IG AV+ E RAMY
Sbjct: 106 LWWGEALHGVSDTGPGGTRFGDVVPGATSFPLVINSAAAFNETLWGAIGGAVSTEIRAMY 165
Query: 156 NLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIG 215
NLG A LT+W+PNIN+ RDPRWGR ETPGEDP VV YAV FV++ Q D G G
Sbjct: 166 NLGHAELTYWSPNINVVRDPRWGRASETPGEDPFVVGRYAVSFVRAMQ----DIDGAGPG 221
Query: 216 FGFRE-KRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDT 274
G R +K +S+CCKH AYD++ W R +F+A + E+D +T
Sbjct: 222 AGADPFARPIK-------------VSSCCKHYAAYDVDAWLTADRLTFDAQVEERDMIET 268
Query: 275 FQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATI 333
F+ PF C+ G ASC+MCSYN++NGVPAC L + R EW GYI SDCD+V +
Sbjct: 269 FERPFEMCVRDGDASCVMCSYNRINGVPACANARLLSETVRGEWQLHGYIVSDCDSVRVM 328
Query: 334 FEYQNYTKTHE-DSAAGVLKAGMDINCGT-------CMLRHTQSAIDKGKVQEKDIDRAL 385
+ + ++ A +KAG+D++CG A+ +GK++E ++D AL
Sbjct: 329 VRDAKWLGYNGVEATAAAMKAGLDLDCGMFWEGAQDFFTAFGLDAVRQGKLRESEVDNAL 388
Query: 386 LNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLN 445
NL+ +RLG F+G P + LG +DVCT EHK+LA DAARQG+VL+KND LPL+
Sbjct: 389 RNLYLTLMRLGFFDGIP---ELESLGANDVCTEEHKELAADAARQGMVLIKNDHGRLPLD 445
Query: 446 KNAVSSLAIIGPL--VNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNS 503
+ V+SL+++G L +N M G Y G PC + + VS T C C++
Sbjct: 446 TSKVNSLSLVGLLQHINATDVMLGDYRGKPCRVVTPYDAIRKVVSATSMQV-CDHGACST 504
Query: 504 DAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVIL 563
A K D IV+AGL+++ E E DR LLLP Q + + +VA S P+IL
Sbjct: 505 AAN-------GKTVDATIVIAGLNMSVEKEGNDREDLLLPWNQTNWINAVAEASPYPIIL 557
Query: 564 VLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWY-PESF 622
V+ G +DVSFA+ + +I +I+W GYPGE G A+A+++FG +NPGGRLP+TWY E
Sbjct: 558 VIISAGGVDVSFAQNNPKIGAIVWAGYPGEEGGTAIADVLFGKYNPGGRLPLTWYKSEYI 617
Query: 623 TKVPMNDMNMRADSSRQYPGRSYRFYTGTQV-YGFGHGLSYTNYSYKFLSAPSELTISAS 681
+K+PM M +R + + YPGR+Y+FY G +V Y FGHGLSY+N+SY + + +T+
Sbjct: 618 SKIPMTSMALRPVADKGYPGRTYKFYGGPEVLYPFGHGLSYSNFSYASDTTGASVTVRVG 677
Query: 682 LKAGSDKNILQQTGSR--LDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVP 739
S K + ++ G+ L ++ ++V N G DG+HVVM++ P
Sbjct: 678 AWE-SCKQLTRKPGTTAPLACPAVNVAGHGCKEEVSFSLTVANRGSRDGAHVVMVYTVPP 736
Query: 740 KVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGE 798
P KQL+ F RV A + ++ F ++ C+ +I + ++P G ++VG+
Sbjct: 737 AEVDDAPLKQLVAFRRVFVPAGAAVQVPFTLNVCKAFAIVEETAYTVVPSGVSTVLVGD 795
>gi|14164501|dbj|BAB55751.1| putative alpha-L-arabinofuranosidase/beta-D- xylosidase isoenzyme
ARA-I [Oryza sativa Japonica Group]
Length = 818
Score = 578 bits (1489), Expect = e-162, Method: Compositional matrix adjust.
Identities = 320/790 (40%), Positives = 446/790 (56%), Gaps = 56/790 (7%)
Query: 44 CKPPHFDS-------YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAY 96
C P F + +P+C+ SL + R + L+ +TL+EK+ L D A PR+G+P Y
Sbjct: 47 CDPARFAAAGLDMAGFPYCDASLPYADRVRDLVGRMTLEEKVANLGDRAGGAPRVGLPRY 106
Query: 97 EWWSESLHGIASNGPG-VNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMY 155
WW E+LHG++ GPG F V TSFP V+ SAASFN +LW IG V+ E RAMY
Sbjct: 107 LWWGEALHGVSDVGPGGTWFGDAVPGATSFPLVINSAASFNETLWRAIGGVVSTEIRAMY 166
Query: 156 NLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIG 215
NLG A LT+W+PNIN+ RDPRWGR ETPGEDP VV YAV FV+ Q D G
Sbjct: 167 NLGHAELTYWSPNINVVRDPRWGRASETPGEDPFVVGRYAVNFVRGMQ------DIDGAT 220
Query: 216 FGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTF 275
F + +S+CCKH AYD++ W R +F+A + E+D +TF
Sbjct: 221 TAASAAAATDAFSR------PIKVSSCCKHYAAYDVDAWNGTDRLTFDARVQERDMVETF 274
Query: 276 QPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIF 334
+ PF CI G ASC+MCSYN++NGVPAC L + R +W GYI SDCD+V +
Sbjct: 275 ERPFEMCIRDGDASCVMCSYNRINGVPACADARLLTETVRRDWQLHGYIVSDCDSVRVMV 334
Query: 335 ---EYQNYTKTHEDSAAGVLKAGMDINCGT-------CMLRHTQSAIDKGKVQEKDIDRA 384
++ YT +AA +KAG+D++CG + A+ +GK++E +D A
Sbjct: 335 RDAKWLGYTGVEATAAA--MKAGLDLDCGMFWEGVHDFFTTYGVDAVRQGKLKESAVDNA 392
Query: 385 LLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPL 444
L NL+ +RLG F+G P + LG DVCT EHK+LA DAARQG+VLLKND LPL
Sbjct: 393 LTNLYLTLMRLGFFDGIP---ELESLGAADVCTEEHKELAADAARQGMVLLKNDAALLPL 449
Query: 445 NKNAVSSLAIIGPL--VNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCN 502
+ V+S+A+ G L +N M G Y G PC + G+ VS T C C+
Sbjct: 450 SPEKVNSVALFGQLQHINATDVMLGDYRGKPCRVVTPYDGVRKVVSSTS-VHACDKGSCD 508
Query: 503 SDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVI 562
+ A AK D IVVAGL+++ E E DR LLLP Q S + +VA S P++
Sbjct: 509 T------AAAAAKTVDATIVVAGLNMSVERESNDREDLLLPWSQASWINAVAEASPSPIV 562
Query: 563 LVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYP-ES 621
LV+ G +DVSFA+ + +I +++W GYPGE G A+A+++FG +NPGGRLP+TWY E
Sbjct: 563 LVIMSAGGVDVSFAQDNPKIGAVVWAGYPGEEGGTAIADVLFGKYNPGGRLPLTWYKNEY 622
Query: 622 FTKVPMNDMNMRADSSRQYPGRSYRFYTGTQV-YGFGHGLSYTNYSYKFLSAPSELTISA 680
+K+PM M +R D+ YPGR+Y+FY G V Y FGHGLSYTN++Y +A + +T+
Sbjct: 623 VSKIPMTSMALRPDAEHGYPGRTYKFYGGADVLYPFGHGLSYTNFTYASATAAAPVTV-- 680
Query: 681 SLKAGSDKNILQQT-----GSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLF 735
K G+ + Q T S ++ + ++V N G DG+HVV ++
Sbjct: 681 --KVGAWEYCKQLTYKAGVSSPPACPAVNVASHACQEEVSFAVTVANTGGRDGTHVVPMY 738
Query: 736 ARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLM 795
P G P KQL+ F RV A + E++F ++ C+ +I + ++P G ++
Sbjct: 739 TAPPAEVDGAPRKQLVAFRRVRVAAGAAVEVAFALNVCKAFAIVEETAYTVVPSGVSRVL 798
Query: 796 VGELRHSLTI 805
VG+ SL+
Sbjct: 799 VGDDALSLSF 808
>gi|125576923|gb|EAZ18145.1| hypothetical protein OsJ_33695 [Oryza sativa Japonica Group]
Length = 591
Score = 577 bits (1487), Expect = e-162, Method: Compositional matrix adjust.
Identities = 293/615 (47%), Positives = 383/615 (62%), Gaps = 43/615 (6%)
Query: 190 VVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAY 249
+ S YAV FVK QG + L SACCKH+ AY
Sbjct: 1 MASKYAVAFVKGMQGNSSAI---------------------------LQTSACCKHVTAY 33
Query: 250 DLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDL 309
DLE W RY+FNA +T QD EDT+ PPFRSC+ KA+CIMC+Y +NGVPAC DL
Sbjct: 34 DLEDWNGVQRYNFNAKVTAQDLEDTYNPPFRSCVVDAKATCIMCAYTGINGVPACANADL 93
Query: 310 FQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQ 368
K R +WG GYI SDCDAVA + + Q YT+T ED+ A LKAG+D+NCGT M +H
Sbjct: 94 LTKTVRGDWGLDGYIASDCDAVAIMRDAQRYTQTPEDAVAVALKAGLDMNCGTYMQQHAT 153
Query: 369 SAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGK-YGKLGPDDVCTSEHKKLALDA 427
+AI +GK+ E+DID+AL NLF++++RLG F+GDPR YG LG D+CT EH+ LAL+A
Sbjct: 154 AAIQQGKLTEEDIDKALKNLFAIRMRLGHFDGDPRSNSVYGGLGAADICTPEHRSLALEA 213
Query: 428 ARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV 487
A GIVLLKND LPL++ AV+S A+IGP N+ + G Y G PC + L G+ Y+
Sbjct: 214 AMDGIVLLKNDAGILPLDRTAVASAAVIGPNANDGLALIGNYFGPPCESTTPLNGILGYI 273
Query: 488 SKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQM 547
+ +GC+ C+ A +A +A +D+V + GL QE+E RDR SLLLPG+Q
Sbjct: 274 KNVRFLAGCNSAACDVAA-TDQAAAVASSSDYVFLFMGLSQKQESEGRDRTSLLLPGEQQ 332
Query: 548 SLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDF 607
SL+T+VA +KRPVILVL GGP+DV+FA+ + +I +ILW GYPG+AG A+A ++FGD
Sbjct: 333 SLITAVADAAKRPVILVLLTGGPVDVTFAQTNPKIGAILWAGYPGQAGGLAIARVLFGDH 392
Query: 608 NPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSY 667
NPGGRLP+TWYPE FTKVPM DM MRAD + YPGRSYRFY G VY FG+GLSY++YS
Sbjct: 393 NPGGRLPVTWYPEEFTKVPMTDMRMRADPATGYPGRSYRFYQGKTVYKFGYGLSYSSYSR 452
Query: 668 KFLS----APSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTS--CTSLRFHVQISVT 721
+ +S A S + ASL+ + T + HI+E+ + C L+F + V
Sbjct: 453 QLVSGGKPAESYTNLLASLRTTT-------TSEGDESYHIEEIGTDGCEQLKFPAVVEVQ 505
Query: 722 NAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANK 781
N G +DG H V+++ R P G P QLIGF H I F + PCE S K
Sbjct: 506 NHGPMDGKHSVLMYLRWPNAKGGRPTTQLIGFRSQHLKVGEKANIRFDISPCEHFSRVRK 565
Query: 782 HGRRILPLGNHVLMV 796
G++++ G+H LMV
Sbjct: 566 DGKKVIDRGSHYLMV 580
>gi|356510699|ref|XP_003524073.1| PREDICTED: beta-xylosidase/alpha-L-arabinofuranosidase 2-like
[Glycine max]
Length = 613
Score = 574 bits (1479), Expect = e-161, Method: Compositional matrix adjust.
Identities = 289/580 (49%), Positives = 380/580 (65%), Gaps = 35/580 (6%)
Query: 36 VLNKPDFPC---KPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLG 92
V+ F C K P Y FC+ SL + R K L+ LTLQEKI L ++A + RLG
Sbjct: 24 VVRGQTFACDVGKSPAVAGYGFCDKSLGVEARVKDLVGRLTLQEKIGNLVNSAGDVSRLG 83
Query: 93 IPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEAR 152
IP YEWWSE+LHG+++ G G F+ V TSFP +++AASFN SL+ IG V+ EA
Sbjct: 84 IPRYEWWSEALHGVSNVGLGTRFSNVVPGATSFPMPILTAASFNTSLFEVIGRVVSTEAG 143
Query: 153 AMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDG 212
AMYN+G AGLT+W+PNINIFRDPRWGRG ETPGEDP++ S YA +VK Q
Sbjct: 144 AMYNVGLAGLTYWSPNINIFRDPRWGRGLETPGEDPVLTSKYAAGYVKGLQ--------- 194
Query: 213 GIGFGFREKRVLKGFGEESDRGD--ELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQD 270
++D GD +L ++ACCKH AYD++KW RY+FNA++T+QD
Sbjct: 195 -----------------QTDGGDPNKLKVAACCKHYTAYDVDKWKGIQRYTFNAVLTKQD 237
Query: 271 TEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK-ARNEWGFKGYITSDCDA 329
EDTFQPPF+SC+ G + +MCSYN+VNG P C DL + R EW GY+ SDCD+
Sbjct: 238 LEDTFQPPFKSCVIDGNVASVMCSYNKVNGKPTCADPDLLKGVVRGEWKLNGYMVSDCDS 297
Query: 330 VATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLF 389
V +++YQ+YTKT E++AA + AG+D+NCG + ++T+ A+ +G + E I+ A+ N F
Sbjct: 298 VEVLYKYQHYTKTPEEAAAISILAGLDLNCGRFLGQYTEGAVKQGLIDES-INNAVSNNF 356
Query: 390 SVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAV 449
+ +RLG F+GDPRK YG LGP DVCT +++LA +AARQGIV LKN LPLN A+
Sbjct: 357 ATLMRLGFFDGDPRKQPYGNLGPKDVCTPANQELAREAARQGIVSLKNSPASLPLNAKAI 416
Query: 450 SSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHE 509
SLA+IGP N M G Y GIPC S L+GL A+V T YA+GC DV C + +
Sbjct: 417 KSLAVIGPNANATRVMIGNYEGIPCKYISPLQGLTAFVP-TSYAAGCLDVRCPNPV-LDD 474
Query: 510 AVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGG 569
A +I+ D ++V G L E E DRV++LLPGQQ LVT VA SK PVILV+ GG
Sbjct: 475 AKKISASGDATVIVVGASLAIEAESLDRVNILLPGQQQLLVTEVANASKGPVILVIMSGG 534
Query: 570 PLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNP 609
+DVSFA+ +++I+SILW+GYPGEAG A+A++IFG NP
Sbjct: 535 GMDVSFAKDNNKITSILWVGYPGEAGGAAIADVIFGFHNP 574
>gi|340370206|ref|XP_003383637.1| PREDICTED: probable beta-D-xylosidase 5-like [Amphimedon
queenslandica]
Length = 728
Score = 572 bits (1474), Expect = e-160, Method: Compositional matrix adjust.
Identities = 311/771 (40%), Positives = 449/771 (58%), Gaps = 74/771 (9%)
Query: 44 CKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESL 103
C+ F++Y +C+ + SI R L+S +T+ +KI QL +A AIP L IPAY+WWSE L
Sbjct: 21 CEKAPFNTYKYCDYTQSIPERVNDLLSRMTILDKIPQLITSAPAIPSLDIPAYQWWSEGL 80
Query: 104 HGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLT 163
HG+A + PGV+F G + TSFPQV+ A+FN SL + ++ EARA N GQAGLT
Sbjct: 81 HGVAGS-PGVHFGGNFPNATSFPQVIGLGATFNMSLVLAMAQVISTEARAFANGGQAGLT 139
Query: 164 FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRV 223
++APNINIFRDPRWGRGQETPGEDP + S YA FVK Q
Sbjct: 140 YFAPNINIFRDPRWGRGQETPGEDPYLSSQYAANFVKGMQ-------------------- 179
Query: 224 LKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCI 283
E +D L A CKH AYDLE + N SR++FNA++++QD E+T+ P FRSC+
Sbjct: 180 -----EGADDTRYLKTIATCKHYAAYDLENYLNLSRHTFNAIVSDQDFEETYFPAFRSCV 234
Query: 284 EQGKASCIMCSYNQVNGVPACLRGDLFQK-ARNEWGFKGYITSDCDAVATIFEYQNYTKT 342
E+GK IMCSYN VNGVP+C + + AR +WGF+GY+ SDC A++ I YT
Sbjct: 235 EEGKVGSIMCSYNAVNGVPSCANDFINNEVARGKWGFEGYVVSDCGAISDIINSHKYTSN 294
Query: 343 HEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDP 402
+D+ A L+ G D+NCG H Q+A D G + + DIDRA+ LF+ ++RLG+F+ P
Sbjct: 295 TDDTVAAGLRGGCDLNCGHFYSDHAQAAYDNGAITDDDIDRAMTRLFTYRMRLGMFD-PP 353
Query: 403 RKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNI 462
+ D V T +H+ LALDA+R+ IVLL+N+K LPL+ +A++GP
Sbjct: 354 SMQPFRDYTNDKVDTKQHEALALDASRESIVLLQNNKDILPLSLTTHRKIALVGPHGQAQ 413
Query: 463 SQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKK--ADFV 520
M G Y G S ++GL+ +A+GC V C + AGF E ++ ++ + +
Sbjct: 414 GAMQGNYKGTAPYLISPMQGLQDLGLSVTFAAGCTQVACPTIAGFSEVTKLVEEHSIEAI 473
Query: 521 IVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKR--PVILVLTGGGPLDVSFAEA 578
I V GLD +QE+E DR SL LPGQQ+ L+ + + + P I+V+ GGP+D+S +
Sbjct: 474 IAVIGLDESQESEGHDRTSLTLPGQQVQLLEDIKKKAVPGIPFIVVVMSGGPVDLSGVKD 533
Query: 579 DSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESF-TKVPMNDMNMRADSS 637
+ +ILW GYPG++G +A+AE+I+G NP GRLP+T+YP S+ ++P +M+MR
Sbjct: 534 IAD--AILWAGYPGQSGGQAIAEVIYGKVNPSGRLPVTFYPASYINEIPYTNMSMRVP-- 589
Query: 638 RQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSR 697
PGRSY+FYTGT V+ FG GLSYT + K+ + P+ LK D ++
Sbjct: 590 ---PGRSYKFYTGTPVFPFGFGLSYTTFEMKWKNPPN----VTHLKTTHDVDV------- 635
Query: 698 LDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVH 757
+ ++ VTNAG GS V + A + G P K+L GF +++
Sbjct: 636 -----------------NYEVVVTNAGKRSGS--VSVLAYITSTVPGAPMKELFGFQKIY 676
Query: 758 TVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG---ELRHSLTI 805
+ S +SF +P + + +KHG R + G + + +G +L+H++ I
Sbjct: 677 LKPEQSMTLSFVAEP-KVFTTVDKHGERKIRPGTYKITIGDTSDLKHTVFI 726
>gi|357128056|ref|XP_003565692.1| PREDICTED: beta-D-xylosidase 3-like [Brachypodium distachyon]
Length = 821
Score = 566 bits (1458), Expect = e-158, Method: Compositional matrix adjust.
Identities = 308/783 (39%), Positives = 441/783 (56%), Gaps = 50/783 (6%)
Query: 44 CKPPHFDS-------YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNAS-AIPRLGIPA 95
C P F S + +C+ SL + R + L+ LTL+EK+ L D A A R+G+P
Sbjct: 46 CDPARFASLGLDMAGFRYCDASLPYAERVRDLVGRLTLEEKVANLGDQAKGAEQRVGLPR 105
Query: 96 YEWWSESLHGIASNGPG-VNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAM 154
Y WW E+LHG++ PG F V TSFP VL SAA+FN +LW IG A + E RAM
Sbjct: 106 YMWWGEALHGVSDTNPGGTRFGDVVPGATSFPLVLNSAAAFNETLWRAIGGATSTEIRAM 165
Query: 155 YNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGI 214
YNLG A LT+W+PNIN+ RDPRWGR ETPGEDP +V +AV FV++ Q + ++ G
Sbjct: 166 YNLGHAELTYWSPNINVVRDPRWGRASETPGEDPFLVGRFAVSFVRAMQDIDDGANAGAG 225
Query: 215 GFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDT 274
+R L +S+CCKH AYD++KW R SF+A + E+D +T
Sbjct: 226 AADPFARR--------------LKVSSCCKHYAAYDVDKWFGADRLSFDANVQERDMVET 271
Query: 275 FQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQ-KARNEWGFKGYITSDCDAVATI 333
F+ PF C+ G ASC+MCSYN++NGVPAC G L R +W GYI SDCD+V +
Sbjct: 272 FERPFEMCVRDGDASCVMCSYNRINGVPACANGRLLTGTVRRDWQLHGYIVSDCDSVRVM 331
Query: 334 F---EYQNYTKTHEDSAAGVLKAGMDINCGT-------CMLRHTQSAIDKGKVQEKDIDR 383
++ Y +AA +KAG+D++CG + A+ +GK++E ++D
Sbjct: 332 VRDAKWLGYDGVQATAAA--MKAGLDLDCGMFWEGAKDFFTAYGLQAVRQGKLKEAEVDE 389
Query: 384 ALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLP 443
AL +L+ +RLG F+G P ++ LG DVCT EHK++A +AARQG+VLLKND LP
Sbjct: 390 ALGHLYLTLMRLGFFDGSP---EFQSLGASDVCTEEHKEMAAEAARQGMVLLKNDHDRLP 446
Query: 444 LNKNAVSSLAIIGPL--VNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPC 501
L+ N V+SLA++G L +N M G Y G PC + + VS T + C C
Sbjct: 447 LDANKVNSLALVGLLQHINATDVMLGDYRGKPCRVVTPYEAIRKVVSGTSMQA-CDKGAC 505
Query: 502 NSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPV 561
+ A A AK D IV+ GL+++ E E DR LLLP Q + +VA S+ P+
Sbjct: 506 GTTA--LGAAIAAKTVDATIVITGLNMSVEREGNDREDLLLPWDQTQWINAVAEASRDPI 563
Query: 562 ILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPES 621
LV+ G +D+SFA+ + +I +ILW GYPGE G +A+++FG +NPGGRLP+TWY
Sbjct: 564 TLVIISAGGVDISFAQNNPKIGAILWAGYPGEEGGTGIADVLFGKYNPGGRLPLTWYKNE 623
Query: 622 FT-KVPMNDMNMRADSSRQYPGRSYRFYTGTQV-YGFGHGLSYTNYSYKFLSAPSELTIS 679
+ K+PM M +R + + YPGR+Y+FY+G V Y FGHGLSYTN++Y + + +T+
Sbjct: 624 YIGKLPMTSMALRPVADKGYPGRTYKFYSGPDVLYPFGHGLSYTNFTYDSYTTGASVTVK 683
Query: 680 -ASLKAGSDKNILQQTGSRLDYV---HIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLF 735
+ S KN+ + G+ I+ + V+N G + GSHVV ++
Sbjct: 684 IGTAWEDSCKNLTYKPGTTASTAPCPAINVAGHGCQEEVSFTLKVSNTGGIGGSHVVPVY 743
Query: 736 ARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLM 795
P P KQL+ F R+ A + E+ F + C+ +I ++P G ++
Sbjct: 744 TAPPAEVDDAPLKQLVAFRRMFVPAGDAVEVPFTLSVCKAFAIVEGTAYTVVPAGVSRVL 803
Query: 796 VGE 798
VG+
Sbjct: 804 VGD 806
>gi|348667575|gb|EGZ07400.1| xylosidase [Phytophthora sojae]
Length = 751
Score = 565 bits (1457), Expect = e-158, Method: Compositional matrix adjust.
Identities = 303/772 (39%), Positives = 439/772 (56%), Gaps = 72/772 (9%)
Query: 41 DFPCKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWS 100
D C P S PFC+ SL I R L++ + L++ + L + ASA P + +P+YEWW+
Sbjct: 21 DNVCDDPKVSSLPFCDGSLPIDARVSDLVNRIPLEQAVGLLVNKASAAPSVNVPSYEWWN 80
Query: 101 ESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA 160
E+LHG+A + PGV F G +++ TSFPQVL +AASFNR+L+ I A++ EARA YN A
Sbjct: 81 EALHGVALS-PGVTFKGPLTAATSFPQVLSTAASFNRTLFYQIAEAISTEARAFYNEKNA 139
Query: 161 GLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFRE 220
GLTFW PN+NIFRDPRWGRGQETPGEDP + YAV FV+ QGE
Sbjct: 140 GLTFWTPNVNIFRDPRWGRGQETPGEDPYLTGEYAVAFVRGLQGE--------------- 184
Query: 221 KRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFR 280
++G E D L +S+CCKH AY E R+ +A++T+QD DT+ P F
Sbjct: 185 --AMEGH-ENKDDNKFLKISSCCKHFSAYSQE----VPRHRNDAIVTKQDQADTYFPAFE 237
Query: 281 SCIEQGKASCIMCSYNQVNGVPACL-RGDLFQKARNEWGFKGYITSDCDAVATIFEYQNY 339
C+++G S IMCSYN VNG+P+C +G L RN+W F GYITSDC+AVA + ++
Sbjct: 238 DCVKRGHVSSIMCSYNAVNGIPSCADKGLLTDLVRNQWKFDGYITSDCEAVADVIYRHHF 297
Query: 340 TKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFN 399
T++ E + A L AGMD+NCG + +H SAI++G V + + AL N F V +RLG+F
Sbjct: 298 TQSPEQTCATTLDAGMDLNCGEFLRQHLSSAIEQGIVSTEMVHNALKNQFRVMMRLGMFE 357
Query: 400 GDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVS---SLAIIG 456
+ + + D V T+ H++LAL+AARQ +VLLKN+ LPL + S SLA+IG
Sbjct: 358 KGTQ--PFSNITKDAVDTAAHRQLALEAARQSVVLLKNEDNTLPLATDVFSKDGSLALIG 415
Query: 457 PLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKK 516
P N + + G Y GIP + L+G+ +YV Y+ GC V F EA+ + KK
Sbjct: 416 PHFNASTALLGNYFGIPSHIVTPLKGVSSYVPNVAYSLGC-KVSGEVLPDFDEAIEVVKK 474
Query: 517 ADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFA 576
AD V+V GLD +QE E+ DR L LPG Q++L+ + + P++LVL GG +D+S
Sbjct: 475 ADRVVVFMGLDQSQEREEIDRYHLKLPGFQIALLNRILAAASHPIVLVLISGGSVDLSLY 534
Query: 577 EADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTK-VPMNDMNMRAD 635
+ ++ +I++ GY G+AG +ALA+++FG ++P GRL T+Y + +P+ DM+MR
Sbjct: 535 KNHPKVGAIVFGGYLGQAGGQALADMLFGKYSPAGRLTQTFYDSDYVNTMPIYDMHMRPT 594
Query: 636 SSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTG 695
PGR+YRF++G VY FG GLSYT +
Sbjct: 595 FVTGNPGRTYRFFSGAPVYEFGFGLSYTTFH----------------------------- 625
Query: 696 SRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQ-GTPEKQLIGFD 754
+C S +I+VTN GDV+G ++++A P + G P + L+ F+
Sbjct: 626 -----------KACRSCVASFEITVTNLGDVEGEDAILIYAEPPHAGEGGRPLRSLVAFE 674
Query: 755 RVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
R V G + + ++AN G ++ GN + V L+H + ++
Sbjct: 675 RTALVTTGKTATADFCLEAKAFALANAEGSWVVEQGNWTIHVDTLQHRVNVQ 726
>gi|167525174|ref|XP_001746922.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163774702|gb|EDQ88329.1| predicted protein [Monosiga brevicollis MX1]
Length = 1620
Score = 563 bits (1452), Expect = e-157, Method: Compositional matrix adjust.
Identities = 327/770 (42%), Positives = 439/770 (57%), Gaps = 74/770 (9%)
Query: 43 PCKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSES 102
PC ++PFCN SL + TR + +IS L++Q+K+ ++ A A G+PAY+WWSE+
Sbjct: 915 PCDELPAKNFPFCNASLDLDTRIRDVISRLSIQDKVALTANTAGAAADAGLPAYQWWSEA 974
Query: 103 LHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGL 162
LHG+ + PGV F G V + TSFPQV+ ++ASFN++LW +IG ++ EARAM N+ QAGL
Sbjct: 975 LHGVGFS-PGVTFMGKVQAATSFPQVIHTSASFNKTLWHHIGMTISTEARAMNNVNQAGL 1033
Query: 163 TFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQ-GENWKSDDGGIGFGFREK 221
TFWAPNINI RDPRWGRGQETPGEDP YA FV Q GE +
Sbjct: 1034 TFWAPNINIIRDPRWGRGQETPGEDPYATGLYAANFVPGMQEGE--------------DT 1079
Query: 222 RVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRS 281
R +K S+CCKH Y+LE W N R+ FNA+ T+QD DT+ P F S
Sbjct: 1080 RYIKA-------------SSCCKHFFDYNLEDWHNVDRHHFNAIATDQDIADTYLPAFES 1126
Query: 282 CIEQGKASCIMCSYNQVNGVPACLRGDLFQK-ARNEWGFKGYITSDCDAVATIFEYQNYT 340
C+ G+AS +MCSYN VNGVP+C D+ AR WGF GYITSDC AV ++ Y
Sbjct: 1127 CVRFGRASSLMCSYNAVNGVPSCANADIMTTLAREAWGFDGYITSDCGAVEDVYSNHKYY 1186
Query: 341 KTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNG 400
T + GVL AGMD++CG+ + +H AID G V +D+AL NLF VQ RLG+F
Sbjct: 1187 NTTGATVNGVLSAGMDVDCGSFLSQHLADAIDSGDVTNATVDQALYNLFRVQFRLGMF-- 1244
Query: 401 DPRKGK-YGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLV 459
DP + + Y L D V T EH++LAL+AARQG+ LL+N LPL+ +++ LA+IGP
Sbjct: 1245 DPAEDQPYLNLTTDAVNTPEHQQLALEAARQGMTLLENRDSRLPLDASSIKQLALIGPNA 1304
Query: 460 NNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADF 519
N M G Y G S +G++ YVS G AV AK AD
Sbjct: 1305 NATGVMQGNYNGKAPFLISPQQGVQQYVSNVALELG--------------AVTAAKAADT 1350
Query: 520 VIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEAD 579
V++V GLD TQE+E DR + LPG Q LV VA S P+++V+ GG +D++ +
Sbjct: 1351 VVMVIGLDQTQESEGHDREIIALPGMQAELVAQVANASSSPIVVVVMTGGAVDLTPVKDL 1410
Query: 580 SQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNMRADSSR 638
+ G+AG +ALAE +FGD NPGGRLP T YP +V M D MR +++
Sbjct: 1411 DNV---------GQAGGQALAETLFGDNNPGGRLPYTLYPADLVNQVSMFDDGMRPNATS 1461
Query: 639 QYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRL 698
PGR+YRFYTGT VY +G GLSYT++SY+ T + SL+ +++ + +R
Sbjct: 1462 GNPGRTYRFYTGTPVYAYGTGLSYTSFSYE--------TSTPSLRVSAER-VRAWVAARG 1512
Query: 699 DYVHI-DEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKV-SQGTPEKQLIGFDRV 756
I DEV + ++ ++V N G V G+ VV +F + + G P K L GF+RV
Sbjct: 1513 QTSFIRDEVDA----EDYITVTVQNNGTVAGADVVQVFIKTTTPGADGNPIKSLCGFERV 1568
Query: 757 HTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMV-GELRHSLTI 805
+ I F V P LS+ N G R+ G + V E R S+ I
Sbjct: 1569 FLKPGETTSIQFPVTP-HDLSVVNSRGERVAVPGTWTVEVHHEARLSIPI 1617
>gi|357153280|ref|XP_003576399.1| PREDICTED: probable beta-D-xylosidase 2-like [Brachypodium
distachyon]
Length = 807
Score = 563 bits (1451), Expect = e-157, Method: Compositional matrix adjust.
Identities = 302/775 (38%), Positives = 436/775 (56%), Gaps = 75/775 (9%)
Query: 52 YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGP 111
Y +C+ L R + LI +T++EK+ L D A+ PR+G+P Y+WWSE+LHG++S GP
Sbjct: 60 YRYCDAKLPYGDRVRDLIGWMTVEEKVSNLGDWAAGAPRVGLPPYKWWSEALHGLSSTGP 119
Query: 112 GVNFN-----------GTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA 160
F+ V + T F V+ SAASFN SLW +IG A++ EARAMYNLG+
Sbjct: 120 TTKFDDLKKPRLHSGRAAVFNGTVFANVINSAASFNESLWRSIGQAISTEARAMYNLGKG 179
Query: 161 GLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFRE 220
GLT+W+PNIN+ RDPRWGR ETPGEDP VV YAV FV+ Q DD GF
Sbjct: 180 GLTYWSPNINVVRDPRWGRALETPGEDPFVVGRYAVNFVRGMQ----DVDDAAAGFN--- 232
Query: 221 KRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFR 280
G+ R L SACCKH AYD++ W +R+ F+A +TE+D +TFQ PF
Sbjct: 233 -------GDPLSR--PLKTSACCKHYAAYDVDDWYGHTRFKFDARVTERDMVETFQRPFE 283
Query: 281 SCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNY 339
C+ G AS +MCSYN+VNG+PAC L R +WG GYI SDCDAV + + +
Sbjct: 284 MCVRDGDASAVMCSYNRVNGIPACADARLLAGTLRRDWGLHGYIVSDCDAVRVMTDNATW 343
Query: 340 TK-THEDSAAGVLKAGMDINCGTCML------------RHTQSAIDKGKVQEKDIDRALL 386
T +++A LKAG+D++CG + + +A+ +GK++E DID AL+
Sbjct: 344 LGYTPAEASAASLKAGLDLDCGESWIVQKGKPVMDFLSTYGMAAVRQGKMRESDIDNALV 403
Query: 387 NLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNK 446
NL++ +RLG F+G PR Y L D+C+ H+ LALD ARQ +VLLKN LPL+
Sbjct: 404 NLYTTLMRLGYFDGMPR---YESLDEKDICSEAHRSLALDGARQSMVLLKNLDGLLPLDA 460
Query: 447 NAVSSLAIIGPLVNNISQ-MGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDA 505
+ ++S+A+ GP + M G YTG PC Y++
Sbjct: 461 SKLASVAVRGPHAEAPEKVMDGDYTGPPCR----------YITPRE-------------- 496
Query: 506 GFHEAVRIAKK-ADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILV 564
G + V I+++ D I + G+++ E E DR LLLP Q + VA S P++LV
Sbjct: 497 GISKDVNISQQGGDVTIYMGGINMHIEREGNDREDLLLPKNQTEEILRVAAASPSPIVLV 556
Query: 565 LTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT- 623
+ GG +DVSFA++ +I +ILW GYPG G A+A++IFG +NPGGRLP+TW+ +
Sbjct: 557 ILSGGGIDVSFAQSHPKIGAILWAGYPGGEGGHAIADVIFGRYNPGGRLPLTWFKNKYIH 616
Query: 624 KVPMNDMNMRADSSRQYPGRSYRFYTGTQV-YGFGHGLSYTNYSYKFLSAPSELTISASL 682
++PM M +R YPGR+Y+FY G V Y FG+GLSYT + Y+ L+ + +T++
Sbjct: 617 QLPMTSMALRPRPEHGYPGRTYKFYDGPDVLYPFGYGLSYTKFRYELLNKETAVTLAPGR 676
Query: 683 KAGSDKNILQQTGS-RLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKV 741
+ + + +TGS D +D + + +SV NAG DG++ V+++ P
Sbjct: 677 R--HCRQLSYKTGSVGPDCPAVDVASHACAETVSFNVSVVNAGKADGANAVLVYTAPPAE 734
Query: 742 SQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMV 796
G P KQ+ F RV A ++ + F ++ C+ I K ++P G ++V
Sbjct: 735 LAGAPIKQVAAFRRVAVKAGAAETVVFTLNVCKAFGIVEKTAYTVVPSGVSTVIV 789
>gi|413954831|gb|AFW87480.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
Length = 814
Score = 561 bits (1446), Expect = e-157, Method: Compositional matrix adjust.
Identities = 305/782 (39%), Positives = 436/782 (55%), Gaps = 83/782 (10%)
Query: 48 HFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIA 107
+ ++P+C+ SL + R + LI +T++EK+ L D + PR+G+P Y+WWSE+LHG++
Sbjct: 64 NMSAFPYCDASLPYADRVRDLIGWMTVEEKVGNLGDISHGAPRVGLPPYKWWSEALHGVS 123
Query: 108 SNGPGVNFN------------GTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMY 155
S GP + F+ TV++ T F V+ SAASFN +LW++IG AV+ EARAMY
Sbjct: 124 STGPTMLFDDLHSKPGNHSGRATVNNATVFANVINSAASFNETLWNSIGQAVSTEARAMY 183
Query: 156 NLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIG 215
NLG+ GLT+W+PNIN+ RDPRWGR ETPGEDP V YAV FV+ Q D G
Sbjct: 184 NLGKGGLTYWSPNINVVRDPRWGRALETPGEDPYVAGRYAVNFVRGMQ------DIPGHY 237
Query: 216 FGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTF 275
G R +K SACCKH AYD++ W N +R++++A ++E+D +TF
Sbjct: 238 SGDPSARPIK-------------TSACCKHHAAYDVDNWHNQTRFTYDARVSERDMAETF 284
Query: 276 QPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQ-KARNEWGFKGYITSDCDAVATIF 334
PF C+ +G S +MCSYN+VNGVPAC L R EW GYI SDCDAV +
Sbjct: 285 LRPFEMCVREGDVSSVMCSYNRVNGVPACADARLLSGTVRGEWHLNGYIVSDCDAVRVMT 344
Query: 335 EYQNYTK-THEDSAAGVLKAGMDINCGTCML------------RHTQSAIDKGKVQEKDI 381
+ + T +S+A L+AGMD++C + + +A+ +GK++E DI
Sbjct: 345 DNATWLNFTAAESSAVSLRAGMDLDCAESWIEEEGRPLRDYLSEYGMAAVAQGKMRESDI 404
Query: 382 DRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKF 441
D AL NL+ +RLG F+ PR Y L DVCT EHK LALD ARQGIVLLKND
Sbjct: 405 DNALTNLYMTLMRLGYFDNIPR---YASLNETDVCTDEHKSLALDGARQGIVLLKNDHGL 461
Query: 442 LPLNKNAVSSLAIIGPLVNNISQ-MGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVP 500
LPL+ ++A+ GP + M G YTG PC YV+
Sbjct: 462 LPLDPKKTLAVAVHGPHARAPEKIMDGDYTGPPCR----------YVTPRQ--------- 502
Query: 501 CNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRP 560
G V+I+ KA I + G++L E E DR LLLP Q + A+ S P
Sbjct: 503 -----GISRDVKISHKAKMTIYLGGINLYIEREGNDREDLLLPKNQTEEILHFAQASPTP 557
Query: 561 VILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPE 620
+ILV+ GG +D+SFA+ +I +ILW GYPG G A+A++IFG +NPGGRLP+TW+
Sbjct: 558 IILVILSGGGIDISFAQKHPKIGAILWAGYPGGEGGNAIADVIFGRYNPGGRLPLTWFKN 617
Query: 621 SFT-KVPMNDMNMRADSSRQYPGRSYRFYTGTQV-YGFGHGLSYTNYSYKFLSAPSELTI 678
+ ++PM M R + YPGR+Y+FY G +V Y FG+GLSYT + Y+ ++ +++
Sbjct: 618 KYIEQIPMTSMEFRPVPEKGYPGRTYKFYDGPEVLYPFGYGLSYTKFQYE--TSTDGVSV 675
Query: 679 SASLKAGSDKNILQQ----TGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVML 734
S G K + + T V++ + ++ F+V SVTNAG G+HVV++
Sbjct: 676 SLPAPGGHCKGLSYKPSVATVPACQAVNVADHACTETVSFNV--SVTNAGGRGGAHVVLV 733
Query: 735 FARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVL 794
+ P P KQ+ F RV A+ + + F ++ C+ I + ++P G +
Sbjct: 734 YTAPPPEVAEAPIKQVAAFRRVFVAARSTATVPFALNVCKAFGIVERTAYTVVPSGVSKV 793
Query: 795 MV 796
+V
Sbjct: 794 LV 795
>gi|125535311|gb|EAY81859.1| hypothetical protein OsI_37025 [Oryza sativa Indica Group]
Length = 816
Score = 556 bits (1434), Expect = e-155, Method: Compositional matrix adjust.
Identities = 309/786 (39%), Positives = 438/786 (55%), Gaps = 76/786 (9%)
Query: 52 YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGP 111
+ +C+ SL + R + LI +T++EK+ L D R+G+PAY WWSE+LHG++S GP
Sbjct: 67 FRYCDASLPYADRVRDLIGRMTVEEKVGALGDWTDGAARIGLPAYRWWSEALHGLSSTGP 126
Query: 112 GVNFN-----------GTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA 160
F+ V + T F V+ SAASFN +LW +IG AV+ EARAMYN+G+
Sbjct: 127 TTKFDDLATPHLHSGVSAVYNATVFANVINSAASFNETLWKSIGQAVSTEARAMYNMGKG 186
Query: 161 GLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFRE 220
GLT+W+PNIN+ RDPRWGR ETPGEDP VV YAV FV+ Q D G
Sbjct: 187 GLTYWSPNINVVRDPRWGRALETPGEDPYVVGRYAVNFVRGMQ------DIPG------- 233
Query: 221 KRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFR 280
+ G+ + R L SACCKH AYDL+ W N +R+ F+A + E+D +TFQ PF
Sbjct: 234 HEAVAAGGDPNTR--PLKTSACCKHYAAYDLDDWHNHTRFEFDARVDERDMVETFQRPFE 291
Query: 281 SCIEQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIFEYQNY 339
C+ G S +MCSYN+VNG+PAC L Q R +WG GYI SDCDAV + + +
Sbjct: 292 MCVRDGDVSSVMCSYNRVNGIPACADARLLSQTIRRDWGLHGYIVSDCDAVRVMTDNATW 351
Query: 340 TK-THEDSAAGVLKAGMDINCGTCMLRHTQ-------------SAIDKGKVQEKDIDRAL 385
T +++A LKAG+D++CG T+ A++KGK++E DID AL
Sbjct: 352 LGYTGAEASAAALKAGLDLDCGESWKNDTEGHPLMDFLTTYGMEAVNKGKMRESDIDNAL 411
Query: 386 LNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLN 445
N + +RLG F+ +Y LG D+CT +HK LALD ARQGIVLLKND K LPL+
Sbjct: 412 TNQYMTLMRLGYFD---DITQYSSLGRQDICTDQHKTLALDGARQGIVLLKNDNKLLPLD 468
Query: 446 KNAVSSLAIIGPLVNNISQ-MGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSD 504
N V + + GP V + M G YTG PC + +G+ Y
Sbjct: 469 ANKVGFVNVRGPHVQAPEKIMDGDYTGPPCRYVTPRQGVSKY------------------ 510
Query: 505 AGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILV 564
VR + +A+ I GL+L E E DR +LLP Q + VA+ S P+ILV
Sbjct: 511 ------VRFSHRANTTIYFGGLNLNIEREGNDREDILLPKNQTEEIIRVAKASPNPIILV 564
Query: 565 LTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT- 623
+ GG +DVSFA+ + +I +ILW GYPG G A+A++IFG NP GRLP+TW+ +
Sbjct: 565 ILSGGGIDVSFAQNNPKIGAILWAGYPGGEGGNAIADVIFGKHNPSGRLPLTWFKNKYIY 624
Query: 624 KVPMNDMNMRADSSRQYPGRSYRFYTGTQV-YGFGHGLSYTNYSYKFLSAPSELTISAS- 681
++PM M++R + YPGR+Y+FY G V Y FG+GLSYT + Y+ + + LT+ +
Sbjct: 625 QLPMTSMDLRPVAKHGYPGRTYKFYNGPDVLYPFGYGLSYTKFLYEMGTNGTALTVPVAG 684
Query: 682 --LKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVP 739
K S K+ + ++++ ++ F+V SVTN GD GSH V++F++ P
Sbjct: 685 GHCKKLSYKSGVSSAAPACPAINVNGHACTETVSFNV--SVTNGGDTGGSHPVIVFSKPP 742
Query: 740 KVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGEL 799
P KQ++ F V A + +SF ++ C+ I K ++P G ++V +
Sbjct: 743 AEVDDAPIKQVVAFRSVFVPAWSTVSVSFELNVCKAFGIVEKTAYTVVPSGVSTVLVENV 802
Query: 800 RHSLTI 805
S++
Sbjct: 803 DSSVSF 808
>gi|115486735|ref|NP_001068511.1| Os11g0696400 [Oryza sativa Japonica Group]
gi|77552754|gb|ABA95551.1| Glycosyl hydrolase family 3 C terminal domain containing protein
[Oryza sativa Japonica Group]
gi|113645733|dbj|BAF28874.1| Os11g0696400 [Oryza sativa Japonica Group]
Length = 816
Score = 555 bits (1429), Expect = e-155, Method: Compositional matrix adjust.
Identities = 305/787 (38%), Positives = 436/787 (55%), Gaps = 79/787 (10%)
Query: 52 YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGP 111
+ +C+ SL + R + LI +T++EK+ L D R+G+PAY WWSE+LHG++S GP
Sbjct: 68 FRYCDASLPYADRVRDLIGRMTVEEKVGALGDWTDGAARIGLPAYRWWSEALHGLSSTGP 127
Query: 112 GVNFN-----------GTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA 160
F+ V + T F V+ SAASFN +LW +IG AV+ EARAMYN+G+
Sbjct: 128 TTKFDDLATPHLHSGVSAVYNATVFANVINSAASFNETLWKSIGQAVSTEARAMYNMGKG 187
Query: 161 GLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFRE 220
GLT+W+PNIN+ RDPRWGR ETPGEDP VV YAV FV+ Q
Sbjct: 188 GLTYWSPNINVVRDPRWGRALETPGEDPYVVGRYAVNFVRGMQ-------------DIPG 234
Query: 221 KRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFR 280
+ G+ + R L SACCKH AYDL+ W N +R+ F+A + E+D +TFQ PF
Sbjct: 235 HEAVAAGGDPNTR--PLKTSACCKHYAAYDLDDWHNHTRFEFDARVDERDMVETFQRPFE 292
Query: 281 SCIEQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIFEYQNY 339
C+ G S +MCSYN+VNG+PAC L Q R +WG GYI SDCDAV + + +
Sbjct: 293 MCVRDGDVSSVMCSYNRVNGIPACADARLLSQTIRRDWGLHGYIVSDCDAVRVMTDNATW 352
Query: 340 TK-THEDSAAGVLKAGMDINCGTC-------------MLRHTQSAIDKGKVQEKDIDRAL 385
T +++A LKAG+D++CG + + A++KGK++E DID AL
Sbjct: 353 LGYTGAEASAAALKAGLDLDCGESWKNDTDGHPLMDFLTTYGMEAVNKGKMRESDIDNAL 412
Query: 386 LNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLN 445
N + +RLG F+ +Y LG D+CT +HK LALD ARQGIVLLKND K LPL+
Sbjct: 413 TNQYMTLMRLGYFD---DIAQYSSLGRQDICTDQHKTLALDGARQGIVLLKNDNKLLPLD 469
Query: 446 KNAVSSLAIIGPLVNNISQ-MGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSD 504
N V + + GP V + M G YTG PC + +G+ Y
Sbjct: 470 ANKVGFVNVRGPHVQAPEKIMDGDYTGPPCRYVTPRQGVSKY------------------ 511
Query: 505 AGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILV 564
VR + +A+ I GL+L E E DR +LLP Q + VA+ S P+ILV
Sbjct: 512 ------VRFSHRANTTIYFGGLNLNIEREGNDREDILLPKNQTEEIIRVAKASPNPIILV 565
Query: 565 LTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT- 623
+ GG +DVSFA+ + +I +ILW GYPG G A+A++IFG NP GRLP+TW+ +
Sbjct: 566 ILSGGGIDVSFAQNNPKIGAILWAGYPGGEGGNAIADVIFGKHNPSGRLPLTWFKNKYIY 625
Query: 624 KVPMNDMNMRADSSRQYPGRSYRFYTGTQV-YGFGHGLSYTNYSYKFLSAPSELTISASL 682
++PM M++R + YPGR+Y+FY G V Y FG+GLSYT + Y+ + + L + +
Sbjct: 626 QLPMTSMDLRPVAKHGYPGRTYKFYDGPDVLYPFGYGLSYTKFLYEMGTNGTALIVPVA- 684
Query: 683 KAGSDKNILQQTG----SRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARV 738
G K + ++G ++++ ++ F+V SVTN GD GSH V++F++
Sbjct: 685 -GGHCKKLSYKSGVSTAPACPAINVNGHVCTETVSFNV--SVTNGGDTGGSHPVIVFSKP 741
Query: 739 PKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGE 798
P P KQ++ F V A + +SF ++ C+ I K ++P G ++V
Sbjct: 742 PAEVDDAPMKQVVAFKSVFVPAWSTVSVSFELNVCKAFGIVEKTAYTVVPSGVSTILVEN 801
Query: 799 LRHSLTI 805
+ S++
Sbjct: 802 VDSSVSF 808
>gi|242093144|ref|XP_002437062.1| hypothetical protein SORBIDRAFT_10g020500 [Sorghum bicolor]
gi|241915285|gb|EER88429.1| hypothetical protein SORBIDRAFT_10g020500 [Sorghum bicolor]
Length = 809
Score = 553 bits (1424), Expect = e-154, Method: Compositional matrix adjust.
Identities = 300/804 (37%), Positives = 440/804 (54%), Gaps = 74/804 (9%)
Query: 23 VVVNVIAFSNS-KPVLNKPDFPCKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQL 81
VNV +F + V + F + ++P+C+ SL + R + LI +T++EK+ L
Sbjct: 31 TTVNVTSFGKAYTKVCDADRFAEMGLNMSAFPYCDASLPYADRVRDLIGWMTVEEKVGNL 90
Query: 82 SDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFN------------GTVSSVTSFPQVL 129
D + PR+G+P Y+WWSE+LHG++S GP + F+ TV++ T F V+
Sbjct: 91 GDVSHGAPRVGLPPYKWWSEALHGVSSTGPTMLFDDLHSKPGNHSGRATVNNATVFANVI 150
Query: 130 VSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPM 189
SAASFN +LW +IG AV+ EARAMYNLG+ GLT+W+PNIN+ RDPRWGR ETPGEDP
Sbjct: 151 NSAASFNETLWKSIGQAVSTEARAMYNLGKGGLTYWSPNINVVRDPRWGRALETPGEDPF 210
Query: 190 VVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAY 249
V YAV FV+ Q + DGG G++ + SACCKH AY
Sbjct: 211 VAGRYAVNFVRGMQ--DIPGHDGG--------------GDDPST-RPIKTSACCKHYAAY 253
Query: 250 DLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDL 309
D++ W N +R++F+A ++E+D +TF PF C+ G AS +MCSYN+VNG+PAC L
Sbjct: 254 DVDDWHNHTRFTFDARVSERDMAETFLRPFEMCVRDGDASGVMCSYNRVNGIPACADARL 313
Query: 310 FQKA-RNEWGFKGYITSDCDAVATIFEYQNYTK-THEDSAAGVLKAGMDINCGTCML--- 364
R +W GYI SDCDAV + + + T +S+A ++AG+D++C +
Sbjct: 314 LSGTIRGDWQLHGYIVSDCDAVRVMTDNATWLHFTGAESSAASIRAGLDLDCAESWIEEK 373
Query: 365 ---------RHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDV 415
+ ++A+ +GK++E DID AL N + +RLG F+ PR Y L D+
Sbjct: 374 GRPLRDFLSEYGKAAVAQGKMRESDIDSALRNQYMTLMRLGYFDNIPR---YASLNETDI 430
Query: 416 CTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQ-MGGGYTGIPC 474
CT EHK LA D ARQG+VLLKND LPL+ + ++A+ GP + M G YTG PC
Sbjct: 431 CTDEHKSLAHDGARQGMVLLKNDDGLLPLDPEKILAVAVHGPHARAPEKIMDGDYTGPPC 490
Query: 475 SPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETED 534
YV+ G + V+I+ +A+ I + G++L E E
Sbjct: 491 R----------YVTPRQ--------------GISKDVKISHRANTTIYLGGINLHIEREG 526
Query: 535 RDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEA 594
DR LLLP Q + A+ S P+ILV+ GG +D+SFA +I +ILW GYPG
Sbjct: 527 NDREDLLLPKNQTEEILHFAKASPNPIILVILSGGGIDISFAHKHPKIGAILWAGYPGGE 586
Query: 595 GAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNMRADSSRQYPGRSYRFYTGTQV 653
G A+A++IFG +NPGGRLP+TW+ + ++PM M R + YPGR+Y+FY G +V
Sbjct: 587 GGNAIADVIFGRYNPGGRLPLTWFKNKYIQQIPMTSMEFRPVPEKGYPGRTYKFYDGPEV 646
Query: 654 -YGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSL 712
Y FG+GLSYT + Y+ + + +T+ A+ + + +D +
Sbjct: 647 LYPFGYGLSYTKFLYETSTNGTAVTLPATGGHCKGLSYKPSVATTPACQAVDVAGHACTE 706
Query: 713 RFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDP 772
ISVTNAG G+HVV+++ P P KQ+ F RV A+ + + F ++
Sbjct: 707 TVSFNISVTNAGGRGGAHVVLVYTAPPPEVAQAPIKQVAAFRRVFVPARSTATVPFTLNV 766
Query: 773 CEQLSIANKHGRRILPLGNHVLMV 796
C+ I + ++P G ++V
Sbjct: 767 CKAFGIVERTAYTVVPSGVSKVLV 790
>gi|340370204|ref|XP_003383636.1| PREDICTED: probable beta-D-xylosidase 2-like [Amphimedon
queenslandica]
Length = 755
Score = 550 bits (1417), Expect = e-153, Method: Compositional matrix adjust.
Identities = 313/774 (40%), Positives = 449/774 (58%), Gaps = 73/774 (9%)
Query: 36 VLNKPDFPCKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPA 95
V ++ D+ C+ P F++Y +CN S SI+ R K L+S LT+ EK+ Q + NASAI RL IPA
Sbjct: 41 VASRADY-CEKPPFNAYLYCNYSASITERVKDLLSRLTVLEKMSQTATNASAIERLDIPA 99
Query: 96 YEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMY 155
Y+WWSE LHG+A + PGV F ++S TSFPQV+ A+FN SL +G ++ EARA
Sbjct: 100 YDWWSECLHGLAQS-PGVFFENDLTSATSFPQVIGLGATFNMSLVLAMGQVISTEARAFA 158
Query: 156 NLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIG 215
N GQ+GLTF+APNINI+RDPRWGRGQETPGEDP + S YA FVK GI
Sbjct: 159 NNGQSGLTFFAPNINIYRDPRWGRGQETPGEDPYLTSQYAANFVK------------GIQ 206
Query: 216 FGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTF 275
G ++R LK A CKH AY+LE++ + R +FNA++++QD E+T+
Sbjct: 207 EGSEDRRYLKAI-------------ATCKHYAAYNLERYLDVRRVNFNAIVSDQDLEETY 253
Query: 276 QPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK-ARNEWGFKGYITSDCDAVATIF 334
P F++C+++G+ IMCSYN +NGVP C + K AR+ WGF+GYI SDC A+ I
Sbjct: 254 LPAFKACVQEGQVGSIMCSYNAINGVPNCANDFINNKIARDTWGFEGYIVSDCGAILDIQ 313
Query: 335 EYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLR 394
NYT + A LK G D+NCG ++ + A D + E+DID++L LF+ ++R
Sbjct: 314 YKHNYTSDTNITVADALKGGCDLNCGHFYEKYMEDAFDNSTITEEDIDKSLTRLFTSRMR 373
Query: 395 LGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAI 454
LG+F+ P + + DV T E + LAL+AAR+GIVLL+N LPL+ S++A
Sbjct: 374 LGMFD-PPEIQPFRQYSVKDVNTPEAQDLALNAAREGIVLLQNKGSVLPLDIVKHSNIAA 432
Query: 455 IGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIA 514
IGP + M G Y GI S L+G Y GC V CN GF +AV+
Sbjct: 433 IGPNADATHIMQGNYHGIAPYLISPLQGFSNLGINATYQIGC-PVACNDTEGFPDAVKAV 491
Query: 515 KKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKR--PVILVLTGGGPLD 572
+ D VI V GL+ TQE E DR S+ LPG Q L+ + + + + P+I+V+ GG +D
Sbjct: 492 QGVDAVIAVIGLNNTQEGESHDRTSIALPGHQEDLLLELKKNAAKGTPLIVVVMSGGSVD 551
Query: 573 VSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESF-TKVPMNDMN 631
++ + + +ILW GYPG++G +A+AE+I+G NP GRLP+T+YP S+ ++P +M+
Sbjct: 552 LTGVKDIAD--AILWAGYPGQSGGQAIAEVIYGKVNPSGRLPVTFYPASYINEIPYTNMS 609
Query: 632 MRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNIL 691
MR PGRSY+FYTGT V+ FG GLSYT + K+ K+
Sbjct: 610 MRVP-----PGRSYKFYTGTPVFPFGFGLSYTTFEIKW------------------KDTS 646
Query: 692 QQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLF--ARVPKVSQGTPEKQ 749
L H DEV + + +VTN+G GS V+ F + VP G P K+
Sbjct: 647 TAKDYYLKTTH-DEVV-------NYEATVTNSGSRPGSVSVLAFITSSVP----GAPMKE 694
Query: 750 LIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSL 803
L F +++ S ++SF +P + + + +G R + G + +++G+ H +
Sbjct: 695 LFAFKKIYLEPTESVDVSFVAEP-KVFTTVDIYGIRKIRPGAYKIIIGDDDHHI 747
>gi|326431595|gb|EGD77165.1| beta-glucosidase [Salpingoeca sp. ATCC 50818]
Length = 900
Score = 545 bits (1404), Expect = e-152, Method: Compositional matrix adjust.
Identities = 313/755 (41%), Positives = 436/755 (57%), Gaps = 53/755 (7%)
Query: 51 SYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNG 110
S PFCNT+LS R + LIS + + L ++A+ + L +PAY+WWSE+LHG+ +
Sbjct: 181 SLPFCNTALSYDDRIRDLISRINDSDLPGLLVNSATGVEHLNLPAYQWWSEALHGVG-HS 239
Query: 111 PGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNIN 170
PGV+F G V + TSFPQV+ + A+FN++L+ IG+ ++ EARAM N+ +AG TFWAPNIN
Sbjct: 240 PGVHFGGDVPAATSFPQVIHTGATFNKTLYRKIGTVISTEARAMNNVQRAGNTFWAPNIN 299
Query: 171 IFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEE 230
I RDPRWGRGQETPGEDP YA FV FQ GE+
Sbjct: 300 IIRDPRWGRGQETPGEDPFATGEYAANFVSGFQD-----------------------GED 336
Query: 231 SDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASC 290
+ + S+CCKH Y+LE W R+ +NA+ T+QD DT+ P F +C+ G+AS
Sbjct: 337 MNY---IKASSCCKHFFDYNLENWHGVDRHHYNAIATDQDIADTYLPSFEACVRYGRASG 393
Query: 291 IMCSYNQVNGVPACLRGDLFQ-KARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAG 349
+MCSYN VNGVP+C GD+ AR WGF GYITSDC AVA + +T+ ++
Sbjct: 394 LMCSYNAVNGVPSCANGDIMTVMARESWGFDGYITSDCGAVADVLNSHKFTRNTSETIRA 453
Query: 350 VLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDP-RKGKYG 408
VL+AGMD +CG+ + ++ A+ +G V + ++ AL LF VQ RLGLF DP K Y
Sbjct: 454 VLEAGMDTDCGSFVQQYLAKAMQEGVVPRELVNTALHRLFMVQFRLGLF--DPVSKQPYT 511
Query: 409 KLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGG 468
V T +++LAL+AA+QGIVLLKN LPL +A+IGP + + M G
Sbjct: 512 NYSVARVNTPANQQLALEAAQQGIVLLKNTNARLPLKTGL--HVALIGPNADATTVMQGN 569
Query: 469 YTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDL 528
Y G S +RG + Y + YA GC DV C +GF AV AK+AD V+VV GLD
Sbjct: 570 YQGTAPFLISPVRGFKNYSAAVTYAKGC-DVACKDTSGFDAAVAAAKEADAVVVVVGLDQ 628
Query: 529 TQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWI 588
QE+E DR S+ LPG Q LV VA +K P+++ + GG +D+S +A+ ++ ILW
Sbjct: 629 GQESEGHDRTSITLPGHQEDLVAQVAAAAKSPIVVFVMTGGAVDLSTIKANKNVAGILWC 688
Query: 589 GYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKV-PMNDMNMRADSSRQYPGRSYRF 647
GYPG++G +A+A+++FG +PGGRLP T YP S+ M D MR + + PGR+YRF
Sbjct: 689 GYPGQSGGQAMADVVFGAVSPGGRLPYTIYPGSYVDACSMLDNGMRPNKTSGNPGRTYRF 748
Query: 648 YTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHID--E 705
YTG VY +G GLSYT++SY + T A+++ +Q ++ D E
Sbjct: 749 YTGKPVYEYGTGLSYTSFSYHIHYLNTMDTSLATVQT-----YVQDAKQNHKFIRYDAPE 803
Query: 706 VTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQ--GTPEKQLIGFDRVHTVAKGS 763
T V+++VTN G V G+ VV +F PK G P K LIGF+RV
Sbjct: 804 FT-------RVEVNVTNVGRVAGADVVQVFVE-PKTPAELGAPIKTLIGFERVFLNPGQW 855
Query: 764 KEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGE 798
+ F V+ L+ + G+R+ G ++ +G
Sbjct: 856 TIVQFSVN-AHDLTFVDASGKRVARAGEWLVHIGH 889
>gi|326513064|dbj|BAK03439.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 694
Score = 536 bits (1382), Expect = e-149, Method: Compositional matrix adjust.
Identities = 281/642 (43%), Positives = 394/642 (61%), Gaps = 35/642 (5%)
Query: 167 PNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKG 226
P + P + ETPGEDP++ S YAV +V Q + GG+ G
Sbjct: 83 PALGRLGIPAYEWWSETPGEDPLLASKYAVGYVTGLQ----DAGAGGVTDG--------- 129
Query: 227 FGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQG 286
L ++ACCKH AYD++ W RY+F+A +++QD +DTFQPPF+SC+ G
Sbjct: 130 ---------ALKVAACCKHYTAYDVDNWKGVERYTFDAKVSQQDLDDTFQPPFKSCVLDG 180
Query: 287 KASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHED 345
+ +MCSYN+VNG P C DL + R +W GYI SDCD+V ++ Q+YTKT E+
Sbjct: 181 NVASVMCSYNKVNGKPTCADKDLLEGVIRGDWKLNGYIVSDCDSVDVLYTQQHYTKTPEE 240
Query: 346 SAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKG 405
+AA +K+G+D+NCG + +HT +A+ G++ E+D+DRA+ N F + +RLG F+GDPR+
Sbjct: 241 AAAITIKSGLDLNCGNFLAQHTVAAVQAGELSEEDVDRAITNNFIMLMRLGFFDGDPRQL 300
Query: 406 KYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQM 465
+G LGP DVCTS +++LA + ARQGIVLLKN LPL+ ++ S+A+IGP N M
Sbjct: 301 AFGSLGPKDVCTSSNRELARETARQGIVLLKNSGA-LPLSAKSIKSMAVIGPNANASFTM 359
Query: 466 GGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDA-GFHEAVRIAKKADFVIVVA 524
G Y G PC + L+GL A V+ T Y GC +V C+ ++ AV A AD ++V
Sbjct: 360 IGNYEGTPCKYTTPLQGLGAKVN-TVYQPGCTNVGCSGNSLQLSTAVAAAASADVTVLVV 418
Query: 525 GLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISS 584
G D + E E DR SLLLPGQQ LV++VA S PVILV+ GGP D+SFA+A +I++
Sbjct: 419 GADQSIERESLDRTSLLLPGQQTQLVSAVANASSGPVILVVMSGGPFDISFAKASDKIAA 478
Query: 585 ILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTK-VPMNDMNMRADSSRQYPGR 643
ILW+GYPGEAG ALA+I+FG NP GRLP+TWYP S+ V M DM MR D+S YPGR
Sbjct: 479 ILWVGYPGEAGGAALADILFGSHNPSGRLPVTWYPASYADTVTMTDMRMRPDTSTGYPGR 538
Query: 644 SYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHI 703
+YRFYTG V+ FG GLSYT S+ +SAP S++ D + + ++
Sbjct: 539 TYRFYTGDTVFAFGDGLSYTKMSHSLVSAPPSYV---SMRLAEDHPCRAEECASVEAAG- 594
Query: 704 DEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGS 763
C L F V++ V NAG+V G+H V+LF+ P + P K L+GF++V +
Sbjct: 595 ---DHCDDLAFDVKLQVRNAGEVAGAHSVLLFSS-PPPAHNAPAKHLLGFEKVSLAPGEA 650
Query: 764 KEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTI 805
++F VD C LS+ ++ G R + LG H L VG+L+H++ +
Sbjct: 651 GTVAFRVDVCRDLSVVDELGGRKVALGGHTLHVGDLKHTVEL 692
Score = 64.3 bits (155), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 31/65 (47%), Positives = 40/65 (61%), Gaps = 2/65 (3%)
Query: 40 PDFPCKPPH--FDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYE 97
P F C + +Y FCN + S RA+ L+S LTL EK+ L + A+ RLGIPAYE
Sbjct: 35 PVFACDASNATLAAYGFCNRKATASARARDLVSRLTLAEKVGFLVNKQPALGRLGIPAYE 94
Query: 98 WWSES 102
WWSE+
Sbjct: 95 WWSET 99
>gi|147857580|emb|CAN78858.1| hypothetical protein VITISV_030325 [Vitis vinifera]
Length = 699
Score = 533 bits (1372), Expect = e-148, Method: Compositional matrix adjust.
Identities = 284/662 (42%), Positives = 390/662 (58%), Gaps = 102/662 (15%)
Query: 147 VAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGEN 206
V+ EARAMYN+G AGLTFW+PN+NIF+DPRWGRGQETPGEDP++ S YA +V+ Q
Sbjct: 137 VSTEARAMYNVGLAGLTFWSPNVNIFQDPRWGRGQETPGEDPLLSSKYASGYVRGLQ--- 193
Query: 207 WKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMI 266
+SDDG D L ++ACCKH AYDL+ W + FNA++
Sbjct: 194 -QSDDGS--------------------PDRLKVAACCKHYTAYDLDNWKGVDCFHFNAVV 232
Query: 267 TEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKARNEWGFKGYITSD 326
T QD +DTFQPPF+SC+ G + ++ YI SD
Sbjct: 233 TNQDMDDTFQPPFKSCVIDGNVASVI-----------------------------YIVSD 263
Query: 327 CDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALL 386
CD+V + Q+YTKT E++AA + AG+D+NCG+ + +HT++A+ G V E +D+A+
Sbjct: 264 CDSVDVFYNSQHYTKTPEEAAAKAILAGLDLNCGSFLGQHTEAAVKGGLVDESAVDKAVS 323
Query: 387 NLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNK 446
N F+ +RLG F+G+P K YGKLGP DVCTSEH++ A +A RQGIV
Sbjct: 324 NNFATLMRLGFFDGNPSKAIYGKLGPKDVCTSEHQERAREAPRQGIV------------- 370
Query: 447 NAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAG 506
+ G PC + L+GL A V+ T Y GC +V C + A
Sbjct: 371 ----------------------FAGTPCKYTTPLQGLTALVATT-YLPGCSNVACGT-AQ 406
Query: 507 FHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLT 566
EA +IA AD +++ G+D + E E RDRV++ LPGQQ L+T VA+ SK VILV+
Sbjct: 407 IDEAKKIAAAADATVLIVGIDQSIEAEGRDRVNIQLPGQQPLLITEVAKXSKGNVILVVM 466
Query: 567 GGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KV 625
GG D+SFA+ D +I+SI W+GYPGEAG A+A++IFG +NP G+LPMTWYP+S+ KV
Sbjct: 467 SGGGFDISFAKNDDKITSIQWVGYPGEAGGAAIADVIFGFYNPSGKLPMTWYPQSYVDKV 526
Query: 626 PMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAG 685
PM +MNMR D + YPGR+YRFYTG +Y FG GLSYT +++ + AP ++I
Sbjct: 527 PMTNMNMRPDPASGYPGRTYRFYTGETIYTFGDGLSYTQFNHHLVQAPKSVSIPIEEAHS 586
Query: 686 SDKNILQQTGSRLDYVHIDEVT-SCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQG 744
+ + +D V SC +L F + + V NAG++ GSH V LF+ P V
Sbjct: 587 CHSSKCKS---------VDAVQESCQNLAFDIHLRVNNAGNISGSHTVFLFSSPPSV-HN 636
Query: 745 TPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLT 804
+P+K L+GF++V AK + F VD C+ LSI ++ G R + LG HVL VG L+HSL
Sbjct: 637 SPQKHLLGFEKVFVTAKAKALVRFKVDVCKDLSIVDELGTRKVALGLHVLHVGNLKHSLN 696
Query: 805 IE 806
+
Sbjct: 697 VR 698
>gi|301110280|ref|XP_002904220.1| beta-D-xylosidase, putative [Phytophthora infestans T30-4]
gi|262096346|gb|EEY54398.1| beta-D-xylosidase, putative [Phytophthora infestans T30-4]
Length = 709
Score = 525 bits (1352), Expect = e-146, Method: Compositional matrix adjust.
Identities = 287/753 (38%), Positives = 421/753 (55%), Gaps = 74/753 (9%)
Query: 60 SISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTV 119
+ + R+ ++ + L + + L + A+ P + IP+YEWW+E+LHG+A + PGV F G++
Sbjct: 3 ATTPRSLHCLTRIPLDQAVGLLVNKAAPAPSVNIPSYEWWNEALHGVALS-PGVTFKGSI 61
Query: 120 SSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGR 179
++ TSFPQVL +AASFNRSL+ I ++ EARA +N AGLTFW PN+NIFRDPRWGR
Sbjct: 62 TAATSFPQVLSTAASFNRSLFYQIADVISTEARAFHNAKDAGLTFWTPNVNIFRDPRWGR 121
Query: 180 GQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELML 239
GQETPGEDP + YAV FV+ QGE + G E + L +
Sbjct: 122 GQETPGEDPYLTGEYAVAFVRGLQGEGME-------------------GREVENSKFLKI 162
Query: 240 SACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVN 299
S+CCKH AY E R+ NAM+T+QD DT+ P F C+++G S IMCSYN VN
Sbjct: 163 SSCCKHFSAYSQE----VPRHRNNAMVTKQDQADTYFPAFEDCVKRGHVSSIMCSYNAVN 218
Query: 300 GVPACL-RGDLFQKARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDIN 358
G+P+C +G L R +W F GYI SDC+AVA + ++ +YT++ E + A L AGMD+N
Sbjct: 219 GIPSCADKGLLTDLVRGQWKFDGYIASDCEAVADVIDHHHYTQSPEQTCATTLDAGMDLN 278
Query: 359 CGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTS 418
CG + +H A+++G V + I AL N F V +RLG+F + + + D V T+
Sbjct: 279 CGEFLRQHLPKALEQGIVTTEMIHNALKNQFRVLMRLGMFE---KVEPFANITKDSVDTT 335
Query: 419 EHKKLALDAARQGIVLLKNDKKFLPLNKNAVS---SLAIIGPLVNNISQMGGGYTGIPCS 475
H++LAL+AARQ IVLLKND LPL + SLA+IGP N + + G Y GIP
Sbjct: 336 MHRQLALEAARQSIVLLKNDGNTLPLATKDFTRDRSLALIGPHFNASAALLGNYFGIPSH 395
Query: 476 PKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDR 535
+ L G+ +V ++ GC V F +A+ +AKKAD +IV GLD +QE E+
Sbjct: 396 IVTPLEGISQFVPNVAHSLGC-KVSGEVLPDFDDAIAVAKKADRLIVFVGLDQSQEREEI 454
Query: 536 DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAG 595
DR + LP Q +L+ V + P++ V+ GG +D+S + ++ +I++ GY G+AG
Sbjct: 455 DRYHIGLPAFQSTLLKRVLEVASHPIVFVVISGGCVDLSAYKNHPKVGAIVFGGYLGQAG 514
Query: 596 AKALAEIIFGDFNPGGRLPMTWYPESFTK-VPMNDMNMRADSSRQYPGRSYRFYTGTQVY 654
+ALA+++FG +NP G+LP T+Y + + + DM+MR GR+YRF+TG VY
Sbjct: 515 GQALADVLFGKYNPSGKLPQTFYDSEYVNAMSIYDMHMRPTPVTGNSGRTYRFFTGVPVY 574
Query: 655 GFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRF 714
FG GLSYT + KN C +
Sbjct: 575 EFGFGLSYTTFH---------------------KN-------------------CHACVA 594
Query: 715 HVQISVTNAGDVDGSHVVMLFARVPKVSQ-GTPEKQLIGFDRVHTVAKGSKEISFGVDPC 773
I+VTNAG + G V++ + P + G P K L+ F+R +A G + +
Sbjct: 595 TFNITVTNAGAISGEDVILTYVEPPLAGEGGRPLKSLVAFERTPLIAAGQRATAKICLEA 654
Query: 774 EQLSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
+ ++AN+ G ++ GN + V L+H + I+
Sbjct: 655 KAFALANEAGNWVVEPGNWTIHVDTLQHKVDIQ 687
>gi|340377241|ref|XP_003387138.1| PREDICTED: probable beta-D-xylosidase 2-like [Amphimedon
queenslandica]
Length = 733
Score = 519 bits (1337), Expect = e-144, Method: Compositional matrix adjust.
Identities = 308/782 (39%), Positives = 434/782 (55%), Gaps = 91/782 (11%)
Query: 44 CKPPHFDSYP-----FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEW 98
C P + + P +CN LS R K L+S LTL+EKI QL ++ASAI RLGIP Y+W
Sbjct: 22 CTEPPYKTRPNFTPAYCNYRLSFKDRVKDLLSRLTLEEKISQLGNSASAIDRLGIPGYQW 81
Query: 99 WSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG 158
WSE LHG+A + PG++ G ++ TSFPQ++ +A+SFN+SL+ IG AV+ EAR + G
Sbjct: 82 WSEGLHGVAVS-PGLHLGGNLTCTTSFPQIITTASSFNKSLFYEIGEAVSTEARGFADNG 140
Query: 159 QAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGF 218
Q GLT++ PNINI RDPRWGRGQET GEDP + S YAV V+ QG + +
Sbjct: 141 QGGLTYFTPNINIVRDPRWGRGQETAGEDPYLTSQYAVNLVRGAQGNDSEY--------- 191
Query: 219 REKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFS-RYSFNAMITEQDTEDTFQP 277
K+++ A CKH AYDLE + N R SFNA +T+QD E+T+ P
Sbjct: 192 --KKII----------------ATCKHFAAYDLESYINGDVRDSFNAEVTKQDLEETYFP 233
Query: 278 PFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK-ARNEWGFKGYITSDCDAVATIFEY 336
FRSC+ G IMCSYN VNGVP+C+ G K ARN+W F GY+ SDC A+ +
Sbjct: 234 AFRSCVTAGGVGSIMCSYNSVNGVPSCVDGVFNNKIARNKWKFDGYLVSDCGAIDDVMNK 293
Query: 337 QNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLG 396
+YT T D+ A LK G D+NCG+ H A G + E DIDRA+ LF+ ++RLG
Sbjct: 294 HHYTSTPTDTVAAGLKGGTDLNCGSFYQTHAMDAFLNGSITEVDIDRAVGRLFTARMRLG 353
Query: 397 LFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIG 456
LF+ P+ Y D V T +H+ LAL AAR+ IVLL+N+ K LPL+ +A++G
Sbjct: 354 LFDL-PKYQPYSYFNTDVVNTKQHQDLALQAARESIVLLQNNGK-LPLSYEDHHKIAVVG 411
Query: 457 P-LVNNISQMGGGYTGIPCSPKSLLRGLEAYVSK---THYASGCHDVPCNSDAGFHEAVR 512
P ++ N++ G P L+ ++ + SK Y+ GC DV C GFH+A +
Sbjct: 412 PNILANVTMQGISQVIAP----YLISPVDGFKSKGLHVTYSLGC-DVKCIVTDGFHDAFK 466
Query: 513 IAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVART-----SKRPVILVLTG 567
+ K A V+ V GLD E E DR + LPG Q + + T S P+I+V+
Sbjct: 467 LVKDAKAVVAVMGLDQGIERETVDREDIFLPGLQDKFLLGLRDTLTNLQSPVPLIVVIMS 526
Query: 568 GGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPM 627
G +D+S E+ S +ILW+GYPG++G +A+AE+I+G+ NP GRLP+T+YP + +
Sbjct: 527 GSSVDLS--ESKSLADAILWVGYPGQSGGQAIAEVIYGEVNPSGRLPLTFYPGEYIDL-- 582
Query: 628 NDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKF---LSAPSELTISASLKA 684
+ R S R+ PGR+YRFYT V+ FGHGLSYT + + ++ +E+ IS S+
Sbjct: 583 --VAYRHMSMREPPGRTYRFYTENPVFPFGHGLSYTTFELSWTNKMNNVTEIVISDSVD- 639
Query: 685 GSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQG 744
++ID I+V N G + G+ V + V
Sbjct: 640 ----------------INID-----------FDITVVNTGYLSGA--VSVLGYVSSNIPD 670
Query: 745 TPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLT 804
P ++L FD+V SK+IS ++ K R ILP G + + + L H +
Sbjct: 671 APLRELFDFDKVFIDKYESKKISLFATNDAFTTVDEKGRRNILP-GEYDIAIENLSHKII 729
Query: 805 IE 806
I+
Sbjct: 730 IK 731
>gi|223945397|gb|ACN26782.1| unknown [Zea mays]
Length = 516
Score = 518 bits (1335), Expect = e-144, Method: Compositional matrix adjust.
Identities = 258/514 (50%), Positives = 345/514 (67%), Gaps = 14/514 (2%)
Query: 292 MCSYNQVNGVPACLRGDLFQK-ARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGV 350
MCSYN+VNGVP C +L AR +WGF GYITSDCDAVA I + Q Y KT ED+ A V
Sbjct: 1 MCSYNRVNGVPTCADYNLLSTTARQDWGFYGYITSDCDAVAIIHDAQGYAKTAEDAVADV 60
Query: 351 LKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKL 410
LKAGMD+NCG+ + H SA+ +GK+ E+DI+RAL NLF+V++RLGLFNGDPR+ YG +
Sbjct: 61 LKAGMDVNCGSYVQDHGASALQQGKITEQDINRALHNLFAVRMRLGLFNGDPRRNLYGDI 120
Query: 411 GPDDVCTSEHKKLALDAARQGIVLLKND--KKFLPLNKNAVSSLAIIGPLVNNISQMGGG 468
GPD VCT EH+ LAL+AA+ GIVLLKND LPL+K V+SLA+IG N+ ++ G
Sbjct: 121 GPDQVCTQEHQDLALEAAQDGIVLLKNDGGAGALPLSKPNVASLAVIGFNANDAIRLRGN 180
Query: 469 YTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDL 528
Y G PC + L+ L+ YV T + +GC+ CN EAV+ A AD V++ GLD
Sbjct: 181 YFGPPCVTVTPLQVLQGYVKDTSFVAGCNSAACNVTT-IPEAVQAASSADSVVLFMGLDQ 239
Query: 529 TQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWI 588
QE E+ DR+ L LPGQQ +L+ SVA +K+PVILVL GGP+DVSFA+ + +I +ILW
Sbjct: 240 DQEREEVDRLDLTLPGQQQTLIESVANAAKKPVILVLLCGGPVDVSFAKTNPKIGAILWA 299
Query: 589 GYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFY 648
GYPGEAG A+A+++FG+ NPGGRLP+TWYP+ FT+VPM DM MRAD + YPGR+YRFY
Sbjct: 300 GYPGEAGGIAIAQVLFGEHNPGGRLPVTWYPQDFTRVPMTDMRMRADPATGYPGRTYRFY 359
Query: 649 TGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTS 708
G V+ FG+GLSY+ YS++F + P + A LKA ++ T + ++ + S
Sbjct: 360 RGPTVFNFGYGLSYSKYSHRFATKPPPTSNVAGLKA------VEATAGGMASYDVEAIGS 413
Query: 709 --CTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGT--PEKQLIGFDRVHTVAKGSK 764
C L+F + V N G +DG H V++F R P + G+ P QLIGF +H A +
Sbjct: 414 ETCDRLKFPAVVRVQNHGPMDGKHSVLVFMRWPNATDGSGRPASQLIGFQSLHLRATQTA 473
Query: 765 EISFGVDPCEQLSIANKHGRRILPLGNHVLMVGE 798
+ F V PC+ S A + GR+++ G+H +MVGE
Sbjct: 474 HVEFEVSPCKHFSRATEDGRKVIDQGSHFVMVGE 507
>gi|340370208|ref|XP_003383638.1| PREDICTED: probable beta-D-xylosidase 2-like [Amphimedon
queenslandica]
Length = 732
Score = 516 bits (1329), Expect = e-143, Method: Compositional matrix adjust.
Identities = 297/779 (38%), Positives = 431/779 (55%), Gaps = 88/779 (11%)
Query: 44 CKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESL 103
C+ F S+ +CN SL IS R K L+S +TL EKI QL + A +I RL IPAY+WWSE L
Sbjct: 24 CEKTKFQSFSYCNYSLPISDRVKDLLSRMTLAEKITQLGNTAGSIDRLDIPAYQWWSEGL 83
Query: 104 HGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLT 163
HG+A + PGV+FNG + TSFPQV+ +A+SFN++L+ I + ++ EARA N G+
Sbjct: 84 HGVA-DSPGVHFNGMFHNATSFPQVITTASSFNKTLYHEIAAVMSTEARAFAN---QGIV 139
Query: 164 FWAPNINIF--------RDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIG 215
++ + + RDPRWGR QETPGEDP + S YA++FV QG++
Sbjct: 140 YFKQHQQLLSNYLLFYCRDPRWGRAQETPGEDPYLNSQYAIQFVTGAQGDS--------- 190
Query: 216 FGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGN-FSRYSFNAMITEQDTEDT 274
L + CKH YDLE + + +R+SFNA IT QD E+T
Sbjct: 191 -------------------KYLKVVTTCKHFAGYDLEDYVDGETRHSFNAKITPQDFEET 231
Query: 275 FQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK-ARNEWGFKGYITSDCDAVATI 333
+ P F++C+E+ + IMCSYN+VNGVP+C G + K AR+ WGF G+I SDC A+ I
Sbjct: 232 YYPAFKACVEEANVASIMCSYNEVNGVPSCADGQINNKLARDTWGFDGFIASDCGAIDDI 291
Query: 334 FEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQL 393
+YT +D+ A LK G D+NCG+ H QSA G + +I+ AL LF+ ++
Sbjct: 292 QNKHHYTNNTDDTVAAALKGGCDLNCGSYYQSHAQSAFLNGTITIGEINLALTRLFTARM 351
Query: 394 RLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLA 453
+LG+F+ P Y + PD V + EH+ LAL+AAR+ IVLL+N+ LPLN S++A
Sbjct: 352 KLGMFD-PPELQPYNAISPDVVNSLEHQALALNAARESIVLLQNNNDVLPLNFEKHSTIA 410
Query: 454 IIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAY-VSKTHYASGCHDVPCNSDAGFHEAVR 512
++GP M G Y G+ S + G E + ASGC DV C GF +A
Sbjct: 411 VVGPHAMATDVMQGNYNGVAPYLISPVEGFENLGIDSVLTASGC-DVNCEVTDGFQDAFD 469
Query: 513 IAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSK-----RPVILVLTG 567
IA KAD VI V GLD + E+E DR L LP Q V + T K P+I+V+
Sbjct: 470 IAVKADAVIAVLGLDQSHESEGHDREDLFLPNLQDKFVQDLKNTLKAAGTNAPLIVVVMS 529
Query: 568 GGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPM 627
G +D++ + + +ILW GYPG++G +A+AEII+G NP GRLP+T+YP S+ +
Sbjct: 530 GSSVDLTVTKKHAD--AILWAGYPGQSGGQAIAEIIYGKVNPSGRLPVTFYPGSYIDL-- 585
Query: 628 NDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSD 687
+ R S R+YPGR+Y+FY T + FG GLSYT + Y S P
Sbjct: 586 --VAFRHMSMREYPGRTYKFYNDTPDFSFGDGLSYTTF-YLEWSKP-------------- 628
Query: 688 KNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPE 747
V++ V S + ++VTN G + G+ + + A + + G P+
Sbjct: 629 -------------VNMSGVRSVSYPTVVYNVTVTNTGKMPGA--ISVLAYISYNNSGAPK 673
Query: 748 KQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG-ELRHSLTI 805
K+L GF++V S ++F D + S +K G+R + G++ + +G +L H +++
Sbjct: 674 KKLFGFEKVFLNPLQSVSVTFPAD-SKAFSTVDKSGKRSVNPGDYHVTIGDQLIHKISL 731
>gi|300121549|emb|CBK22068.2| unnamed protein product [Blastocystis hominis]
Length = 690
Score = 504 bits (1298), Expect = e-140, Method: Compositional matrix adjust.
Identities = 285/740 (38%), Positives = 410/740 (55%), Gaps = 77/740 (10%)
Query: 64 RAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVT 123
RA++L++ LTL EK+ + AS + RL IP Y+WWSE+LHG+A++ PGV F T
Sbjct: 4 RARALVAELTLAEKMSLMGHTASEVKRLNIPKYQWWSEALHGVAAS-PGVVFQEPTPFAT 62
Query: 124 SFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQET 183
+FPQV ++A SF++ L+ +I S ++ EAR M N +A LT+W+PN+N++RDPRWGRGQET
Sbjct: 63 AFPQVALTAQSFDKPLFHDIASIISTEARVMNNAERANLTYWSPNVNVYRDPRWGRGQET 122
Query: 184 PGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACC 243
PGEDP +V+ YAVEFV+ Q E + L +SACC
Sbjct: 123 PGEDPFLVATYAVEFVRGLQ--------------------------EGEDPRYLKVSACC 156
Query: 244 KHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPA 303
KH AYDLE W R+ F+A+++++D DTFQ PF C+++G S +MCSYN +NG+PA
Sbjct: 157 KHYSAYDLENWHGVERFEFDAIVSDRDMTDTFQVPFEQCVKKGHVSSLMCSYNAINGIPA 216
Query: 304 CL-RGDLFQKARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTC 362
C R L+ AR WGF+GYITSDC A+ TI +YT + +A ++A D++CG
Sbjct: 217 CADRELLYGTARGGWGFEGYITSDCGAIDTIIYNHHYTNDTDTTAMLGVRATCDLDCGGF 276
Query: 363 MLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGK-YGKLGPDDVCTSEHK 421
+H +++ G+++E ++D AL NLF VQ+RLGLF DP + + Y G D + T EH+
Sbjct: 277 YQQHILHSVESGRLKEAEVDDALANLFKVQMRLGLF--DPVEQQVYTHYGLDKLNTKEHQ 334
Query: 422 KLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLR 481
+AL AAR+GI LLKN FLPL+ + ++GP + M G Y GIP ++ +
Sbjct: 335 AMALRAAREGIALLKNQNDFLPLSLKD-KHVVVMGPYAEDAGVMLGNYNGIPEFIVTVAQ 393
Query: 482 GLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLL 541
GL + C V EA+ + D ++V GL+ E E DR LL
Sbjct: 394 GLR---------NVCDHVDVVKSL---EALSKLEGVDLIVVTVGLNQEIEREGLDREDLL 441
Query: 542 LPGQQMSLVTSVARTSKRPVIL-VLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALA 600
LP Q +L+ + + PV+L +L+GGG +D+S E + + +L +GY G G +A+A
Sbjct: 442 LPASQRALLDGLLAQTDVPVVLTLLSGGGSVDISAYEQNEHVVGVLAVGYGGMFGGQAIA 501
Query: 601 EIIFGDFNPGGRLPMT-WYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHG 659
E+I GD NP GRL T +Y + T + DMNMR +PGR+YRF+ G ++ FG G
Sbjct: 502 EVIVGDVNPSGRLVNTMYYNDYVTNLDYFDMNMRPKEETGFPGRTYRFFAGPVIHPFGFG 561
Query: 660 LSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQIS 719
LSYT +++ I Q RL ++L V +
Sbjct: 562 LSYTTFAHAV-------------------EIGQMRNHRLR----------SALAIDVYVK 592
Query: 720 VTNAGDVDGSHVVMLFARVPKV-SQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSI 778
VTN G G V+LF + P QG P K L F RV ++A G + V EQL +
Sbjct: 593 VTNTGSRQGDESVLLFVKSPLAGKQGYPLKSLADFSRV-SLAPGETQTVHFVLGEEQLHL 651
Query: 779 ANKHGRRILPLGNHVLMVGE 798
AN+ + +L G + V E
Sbjct: 652 ANEQAKYVLLRGEWKVEVEE 671
>gi|320170454|gb|EFW47353.1| beta-xylosidase [Capsaspora owczarzaki ATCC 30864]
Length = 779
Score = 503 bits (1294), Expect = e-139, Method: Compositional matrix adjust.
Identities = 300/788 (38%), Positives = 426/788 (54%), Gaps = 80/788 (10%)
Query: 44 CKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESL 103
C+ + PFCN +L+ RA L+ LTLQEKI Q A + RLG+ AYEWWSE+L
Sbjct: 27 CEDAALRNLPFCNPNLAWEQRADDLVGRLTLQEKISQFGTTAPGVARLGVNAYEWWSEAL 86
Query: 104 HGIASNGPGVNFNGTVSSVTSFPQVLVS--------AASFNRSLWSNIGSAVAVEARAMY 155
HG+A + PGVNF G T FPQ++ + A+FN + + ++ EARA
Sbjct: 87 HGVAES-PGVNFTGNTPVSTCFPQIIGNNCSSLSRVGATFNLDSVAAMAQVISTEARAFA 145
Query: 156 NLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQ-GENWKSDDGGI 214
N G AGLT++ PNINIFRDPRWGRGQETPGEDP + S Y V++ Q GE
Sbjct: 146 NAGHAGLTYFTPNINIFRDPRWGRGQETPGEDPYLTSRYVETLVQNLQNGE--------- 196
Query: 215 GFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDT 274
+ R LK + A CKH AYD+E WG R+ FNA++++QD +T
Sbjct: 197 -----DARYLK-------------VVATCKHYTAYDMEDWGGIDRFHFNAVVSDQDLVET 238
Query: 275 FQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLF---QKARNEWGFKGYITSDCDAVA 331
F PPF +C+ GK + +MCSYN VNG+P+C D F + AR +WGF GYI SDC A+
Sbjct: 239 FMPPFEACVRVGKGASLMCSYNAVNGIPSC--ADDFINNEIAREQWGFDGYIVSDCGAID 296
Query: 332 TIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSV 391
I NYT T + + A ++ G D++CG H AI + E D+D +L LF
Sbjct: 297 CIQYTHNYTNTTQATCAAGIQGGCDLDCGDFYQSHLMDAIGNATLHEADLDFSLRRLFGH 356
Query: 392 QLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSS 451
++RLG F+ + Y ++ + + EH++LAL AR+ IVLL ND LP + V
Sbjct: 357 RIRLGEFDAASIQ-PYRQIPVSAINSQEHQELALQIARESIVLLGNDNNTLPFSLATVRK 415
Query: 452 LAIIGPLVNNISQMGGGYTG---IPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFH 508
LAIIGP ++ + G Y G +P + L+ +S T + GC DV +GF
Sbjct: 416 LAIIGPNADDAETLLGNYYGDAPYLITPLKGFQQLDPTLSIT-FVKGC-DVNSTDTSGFV 473
Query: 509 EAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGG 568
A AK AD IVV GL+ T E+E+ DR +L+LPG Q L+ ++ ++ PVILV+ G
Sbjct: 474 AAAAAAKAADATIVVVGLNQTVESENLDRTTLVLPGVQAELILALTAAARGPVILVVMSG 533
Query: 569 GPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPM 627
P+D+S + + LWIGYPG+AG +ALAE +FG F+P GRLP T YP + ++PM
Sbjct: 534 SPIDLS--NVIHPVRAALWIGYPGQAGGRALAEAVFGVFSPAGRLPFTVYPADYVNQLPM 591
Query: 628 NDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSD 687
+M+MRA PGR+YRFYTGT ++ FGHGLSY+ + Y + + + S+S + +
Sbjct: 592 TNMDMRAG-----PGRTYRFYTGTPLFEFGHGLSYSTFQYTWSN-----SSSSSSSSATS 641
Query: 688 KNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVS----- 742
++ L ++ S R VQ N G + VV+ FA S
Sbjct: 642 QHSLSTAALAAQHLAARAPVEAVSFRVLVQ----NTGKMASDDVVLAFASFNASSIIDQS 697
Query: 743 ----QGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILP------LGNH 792
P + L+GF R+H S+EI F V + + + + ++P G+
Sbjct: 698 SSQFASPPIRSLVGFRRIHLAPGASQEIFFAVTSSQLAQVDSTGAQTLVPSRLQVAFGSD 757
Query: 793 VLMVGELR 800
+V E++
Sbjct: 758 ARLVAEIQ 765
>gi|389748500|gb|EIM89677.1| glycoside hydrolase family 3 protein [Stereum hirsutum FP-91666
SS1]
Length = 770
Score = 487 bits (1254), Expect = e-135, Method: Compositional matrix adjust.
Identities = 291/750 (38%), Positives = 417/750 (55%), Gaps = 57/750 (7%)
Query: 55 CNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVN 114
CNTS + RAK+L++ +TL+E + + + +PRLG+P YEWWSE+LHG+AS+ PGV
Sbjct: 36 CNTSANFLDRAKALVNAMTLEEMVNNTVNTSPGVPRLGLPPYEWWSEALHGVASS-PGVT 94
Query: 115 F--NGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIF 172
F +G S TSFP+ ++ +A+F+ L ++ S ++ EARA N +GL F+ PNIN F
Sbjct: 95 FETSGDFSGATSFPEPILMSAAFDDDLIFSVASTISTEARAFGNTNHSGLDFFTPNINPF 154
Query: 173 RDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESD 232
+DPRWGRGQETPGEDP+ S Y + + QG G+G K +
Sbjct: 155 KDPRWGRGQETPGEDPLHTSRYVYQLITGLQG--------GVGPSPYYKII--------- 197
Query: 233 RGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIM 292
A CKH AYDLE W +R +FNA+++ QD + + P F+SC+ K +M
Sbjct: 198 --------ADCKHFAAYDLENWEGNNRMAFNAIVSTQDLAEFYTPSFQSCVRDAKVGSVM 249
Query: 293 CSYNQVNGVPACLRGDLFQ---KARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAG 349
CSYN VNGVPAC L Q + E G +ITSDCDAV IF+ NYT T +++A
Sbjct: 250 CSYNAVNGVPACGSPYLLQDLVRDYFELGNDTWITSDCDAVGNIFDPHNYTTTLTNASAV 309
Query: 350 VLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKG-KYG 408
L AG D++CGT A+ +G V + D++RAL+ L+ +RLG F DP Y
Sbjct: 310 ALLAGTDVDCGTSYSETLGEAVSEGLVSKSDVERALVRLYGSLVRLGYF--DPEDSVPYR 367
Query: 409 KLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGG 468
LG DV T + LA AA +GIVLLKND LPL+ N VS +A+IGP N +QM G
Sbjct: 368 ALGASDVNTPAAQTLAYTAAVEGIVLLKND-GLLPLSSN-VSHIALIGPWANATTQMQGN 425
Query: 469 YTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDL 528
Y GI S L G + + +G + NS +GF +A+ +A AD ++ + G+D
Sbjct: 426 YEGIAPLLISPLDGFTSAGFNVSFTNGTT-ISGNSTSGFADALSMASAADVIVYIGGIDD 484
Query: 529 TQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWI 588
T E E +DR S+ PG Q+ L+ + K P +++ GGG +D + +A+S ++++LW
Sbjct: 485 TVEAEGQDRTSITWPGNQLELIGELGAFGK-PFVVIQMGGGQVDDTELKANSSVNALLWG 543
Query: 589 GYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNMRADSSRQYPGRSYRF 647
GYPG+AG KALA+II G P GRL T YP S+ +V M DM++R +S PGR+Y++
Sbjct: 544 GYPGQAGGKALADIITGVQAPAGRLTTTQYPASYVDQVAMTDMSVRPSNSTGSPGRTYKW 603
Query: 648 YTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVT 707
YTGT V+ FG GL YT + ++ +E + +AS + S H+D
Sbjct: 604 YTGTPVFEFGFGLHYTTFDVEW----AEGSPAASYSI--QDLVASANSSSSAVAHVDSAI 657
Query: 708 SCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEIS 767
T F VQ VTN G+V +V +LF+ P ++L+ + RV K I+
Sbjct: 658 LDT---FTVQ--VTNTGNVTSDYVALLFSNTTAGPSPAPLQELVSYARV-------KGIT 705
Query: 768 FGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
GV L++ R+ GN ++ G
Sbjct: 706 PGVSATASLNVTLGTIARVDEDGNSIIYPG 735
>gi|326488213|dbj|BAJ89945.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 525
Score = 483 bits (1242), Expect = e-133, Method: Compositional matrix adjust.
Identities = 248/515 (48%), Positives = 333/515 (64%), Gaps = 28/515 (5%)
Query: 40 PDFPCKPPH--FDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYE 97
P F C + +Y FCN + S RA+ L+S LTL EK+ L + A+ RLGIPAYE
Sbjct: 35 PVFACDASNATLAAYGFCNRKATASARARDLVSRLTLAEKVGFLVNKQPALGRLGIPAYE 94
Query: 98 WWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNL 157
WWSE+LHG++ GPG F+ V TSFPQ +++AASFN SL+ IG V+ EARAM+N+
Sbjct: 95 WWSEALHGVSYVGPGTRFSPLVPGATSFPQPILTAASFNASLFRAIGEVVSTEARAMHNV 154
Query: 158 GQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFG 217
G AGLTFW+PNINIFRDPRWGRGQETPGEDP++ S YAV +V Q + GG+ G
Sbjct: 155 GLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLTSKYAVGYVTGLQ----DAGAGGVTDG 210
Query: 218 FREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQP 277
L ++ACCKH AYD++ W RY+F+A +++QD +DTFQP
Sbjct: 211 ------------------ALKVAACCKHYTAYDVDNWKGVERYTFDAKVSQQDLDDTFQP 252
Query: 278 PFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEY 336
PF+SC+ G + +MCSYN+VNG P C DL + R +W GYI SDCD+V ++
Sbjct: 253 PFKSCVLDGNVASVMCSYNKVNGKPTCADKDLLEGVIRGDWKLNGYIVSDCDSVDVLYTQ 312
Query: 337 QNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLG 396
Q+YTKT E++AA +K+G+D+NCG + +HT +A+ G++ E+D+DRA+ N F + +RLG
Sbjct: 313 QHYTKTPEEAAAITIKSGLDLNCGNFLAQHTVAAVQAGELSEEDVDRAITNNFIMLMRLG 372
Query: 397 LFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIG 456
F+GDPR+ +G LGP DVCTS +++LA + ARQGIVLLKN LPL+ ++ S+A+IG
Sbjct: 373 FFDGDPRQLAFGSLGPKDVCTSSNRELARETARQGIVLLKNSGA-LPLSAKSIKSMAVIG 431
Query: 457 PLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDA-GFHEAVRIAK 515
P N M G Y G PC + L+GL A V+ T Y GC +V C+ ++ AV A
Sbjct: 432 PNANASFTMIGNYEGTPCKYTTPLQGLGAKVN-TVYQPGCTNVGCSGNSLQLSTAVAAAA 490
Query: 516 KADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLV 550
AD ++V G D + E E DR SLLLPGQQ LV
Sbjct: 491 SADVTVLVVGADQSIERESLDRTSLLLPGQQTQLV 525
>gi|336365124|gb|EGN93476.1| glycoside hydrolase family 3 protein [Serpula lacrymans var.
lacrymans S7.3]
Length = 732
Score = 481 bits (1239), Expect = e-133, Method: Compositional matrix adjust.
Identities = 289/764 (37%), Positives = 427/764 (55%), Gaps = 63/764 (8%)
Query: 55 CNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVN 114
C+TSL +RA +++ L T+ E I + +PRLG+P Y+WWSE LHG+A + PGVN
Sbjct: 22 CDTSLDPISRATAVVDLFTIDELINNTVSTSPGVPRLGLPPYQWWSEGLHGVA-DSPGVN 80
Query: 115 FN--GTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIF 172
F+ G S TSFPQ ++ A+F+ L ++G+ V +E R+ N G+AGL FW PNIN F
Sbjct: 81 FSASGEFSYATSFPQPIIMGAAFDDELIKSVGAIVGMEGRSFNNYGRAGLDFWTPNINPF 140
Query: 173 RDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESD 232
+DPRWGRGQETPGEDP ++ Y V+ QG G K +
Sbjct: 141 KDPRWGRGQETPGEDPYHLAQYVYNLVQGLQG------------GLDPKPYYQ------- 181
Query: 233 RGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIM 292
+ + CKH AYDLE W RY F+A++T QD + + P F+SC K M
Sbjct: 182 ------VISTCKHFAAYDLEDWDGNYRYGFDAIVTTQDLSEYYLPSFQSCYRDAKVGAAM 235
Query: 293 CSYNQVNGVPACLRGDLFQKA-RNEWGFKG--YITSDCDAVATIFEYQNYTKTHEDSAAG 349
CSYN VNG+P+C L Q R+ WGF ++TSDCDAV I++ NYTKT E++ A
Sbjct: 236 CSYNAVNGIPSCANTYLLQSILRDFWGFAEDRWVTSDCDAVDNIYDPHNYTKTPEEAVAD 295
Query: 350 VLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGK-YG 408
LKAG DI+CGT + A ++ + E ++ +AL+ ++ +RLG F DP + Y
Sbjct: 296 ALKAGTDIDCGTFYSEYLPGAYNQSLITETELRQALIRQYASLVRLGYF--DPTDIQPYR 353
Query: 409 KLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGG 468
+ ++V T + ++LA AA +GIVLLKND LPL+ + + ++A+IGP N +M G
Sbjct: 354 QYNWNNVDTPQAQQLAYQAAAEGIVLLKNDGT-LPLSSD-IKNIALIGPWGNATGEMQGN 411
Query: 469 YTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDL 528
Y G+ S L G A Y G + + N +GF A+ A+ AD VI G+D
Sbjct: 412 YYGVAPYLISPLMGAVATGYNVTYVFGTN-ITSNDTSGFAAAIAAAQGADVVIYAGGIDE 470
Query: 529 TQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWI 588
T E+E DR + PG Q+ LV +A K P+++V GGG +D + +A+S ++++LW
Sbjct: 471 TVESEGNDRNYITWPGNQLDLVGELAAVGK-PLVVVQFGGGQVDDTSLKANSTVNALLWA 529
Query: 589 GYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNMRADSSRQYPGRSYRF 647
GYPG++G AL +II G P GRLP+T YP + ++PM DM++R +++ PGR+Y++
Sbjct: 530 GYPGQSGGSALFDIISGKVAPAGRLPVTQYPADYVYEIPMTDMDLRPNATS--PGRTYKW 587
Query: 648 YTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNI--LQQTGSRLDYVHIDE 705
YTGT +Y FG+GL YT +SYK+ APS S NI L Q+G+ Y+ +
Sbjct: 588 YTGTPIYDFGYGLHYTTFSYKWAKAPS-----------STYNIQTLVQSGNLYSYLDLAP 636
Query: 706 VTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKG-SK 764
+ T ++VTN G+V +LF P K LI + R+H +A G +
Sbjct: 637 FDTFT-------VNVTNTGNVTSDFASLLFVNGTYGPSPYPNKSLITYARLHDIASGDTA 689
Query: 765 EISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIETY 808
++ GV ++ A+ +G L G + + + L +S+ T+
Sbjct: 690 SVALGVT-LGSIARADTYGNMWLYPGTYQVTLDTLGNSVGANTF 732
>gi|255572557|ref|XP_002527212.1| beta-glucosidase, putative [Ricinus communis]
gi|223533388|gb|EEF35138.1| beta-glucosidase, putative [Ricinus communis]
Length = 349
Score = 480 bits (1236), Expect = e-132, Method: Compositional matrix adjust.
Identities = 239/334 (71%), Positives = 276/334 (82%), Gaps = 4/334 (1%)
Query: 23 VVVNVIAFSNSKPVLNKPDFPCKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLS 82
+V+ ++ SNS+ +K +PC+PP +SY FCN SLS+ TRA SLISLLTL+EKI+QLS
Sbjct: 13 IVLFELSVSNSES--HKLQYPCQPPLHNSYTFCNQSLSVPTRAHSLISLLTLEEKIKQLS 70
Query: 83 DNASAIPRLGIPAYEWWSESLHGIASNGPGVNFN-GTVSSVTSFPQVLVSAASFNRSLWS 141
DNAS IPR GIP YEWWSESLHGIA NGPGV+F G VS+ T FPQV++SAA+FNR+LW
Sbjct: 71 DNASGIPRFGIPPYEWWSESLHGIAINGPGVSFTIGPVSAATGFPQVIISAAAFNRTLWF 130
Query: 142 NIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKS 201
IGSA+A+EARAM+N+GQ+GLTFWAPN+NIFRDPRWGRGQETPGEDPM+ SAYA+EFVK
Sbjct: 131 LIGSAIAIEARAMHNVGQSGLTFWAPNVNIFRDPRWGRGQETPGEDPMLTSAYAIEFVKG 190
Query: 202 FQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYS 261
FQG NWKS G G G R K + D D LMLSACCKHL AYDLEKWGNFSRYS
Sbjct: 191 FQGGNWKSGVSGSGSG-RYGFGEKRMLRDDDGDDGLMLSACCKHLTAYDLEKWGNFSRYS 249
Query: 262 FNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKARNEWGFKG 321
FNA++TEQD EDT+QPPFRSCIE+GKASC+MCSYN+VNGVPAC R DL QKAR EWGF+G
Sbjct: 250 FNAVVTEQDLEDTYQPPFRSCIEEGKASCLMCSYNEVNGVPACAREDLLQKAREEWGFEG 309
Query: 322 YITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGM 355
YI SDCDAVATIFEYQNY+K+ ED+ A LKAGM
Sbjct: 310 YIVSDCDAVATIFEYQNYSKSAEDAVAIALKAGM 343
>gi|396473219|ref|XP_003839293.1| similar to beta-1,4-xylosidase [Leptosphaeria maculans JN3]
gi|312215862|emb|CBX95814.1| similar to beta-1,4-xylosidase [Leptosphaeria maculans JN3]
Length = 789
Score = 480 bits (1235), Expect = e-132, Method: Compositional matrix adjust.
Identities = 299/794 (37%), Positives = 416/794 (52%), Gaps = 76/794 (9%)
Query: 17 RVLQ-LIVVVNVIAFSNSKPVLNKPDFP-CKPPHFDSYPFCNTSLSISTRAKSLISLLTL 74
R+LQ LI V V+A FP C+ + CNTS S RAKSL++L TL
Sbjct: 6 RLLQPLIWVTGVLAV-----------FPDCQNGPLQTETICNTSASPLDRAKSLVTLYTL 54
Query: 75 QEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGT---VSSVTSFPQVLVS 131
+EKI S + +PRLGIP Y+WWSE LHGIA GP NF+ + S TSFPQ ++
Sbjct: 55 EEKINATSSGSPGVPRLGIPPYQWWSEGLHGIA--GPYTNFSTSGIEYSYSTSFPQPILM 112
Query: 132 AASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVV 191
A+F+ L +++ ++ EARA N + GL FW PNIN FRDPRWGRGQETPGED +
Sbjct: 113 GAAFDDHLITDVAKVISTEARAFNNANRTGLDFWTPNINPFRDPRWGRGQETPGEDAFHL 172
Query: 192 SAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDL 251
S+Y + QGE KRV+ A CKH YD+
Sbjct: 173 SSYVKALIAGLQGETTDP----------YKRVV----------------ATCKHFAGYDI 206
Query: 252 EKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQ 311
E W RY F+A I++QD + + PF++C+ Q MCSYN VNGVP C L Q
Sbjct: 207 EDWNGNLRYQFDAQISQQDLVEYYLQPFQACV-QANVGAFMCSYNAVNGVPTCADPYLLQ 265
Query: 312 KA-RNEWGF---KGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHT 367
R WG+ + ++TSDCDAV I+ ++ T E + A L AG D++CGT M H
Sbjct: 266 TILREHWGWTNEEQWVTSDCDAVQNIYLPHQWSATREQAVADALIAGTDLDCGTYMQEHL 325
Query: 368 QSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDA 427
A +G V E +D+AL+ +S +RLG F+ D Y + G D V T + LA A
Sbjct: 326 PGAFAQGLVNENVLDQALVRQYSSLVRLGWFD-DAADQPYRQFGWDSVATDASQALARRA 384
Query: 428 ARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV 487
A +GIVLLKND LPL+ ++ SL + G N SQ+ G Y G+P S L L+
Sbjct: 385 AVEGIVLLKNDG-VLPLSIDSSVSLGVFGDWANATSQLLGNYAGVPTYLHSPLWALQQEN 443
Query: 488 SKTHYASGC----HDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLP 543
+YA G D N + A+ +D +I + G+D + E E DR SL
Sbjct: 444 LTINYAGGNPGGQGDPTTNRWSSLSGAI---ATSDILIYIGGIDNSIEEEGHDRTSLAWT 500
Query: 544 GQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEII 603
G Q+ ++ +A T K P I+V+ GGG +D + ++ IS+ILW GYPG+ G A+ +I+
Sbjct: 501 GAQLDVIFQLAATGK-PTIVVVMGGGQIDSAPLANNANISAILWAGYPGQDGGPAIVDIL 559
Query: 604 FGDFNPGGRLPMTWYPESFTK-VPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSY 662
G P GRLP T YP S+T VPM DM +R S PGR+Y++Y GT Y FGHGL Y
Sbjct: 560 TGKSPPAGRLPQTQYPASYTSLVPMTDMGLR--PSENNPGRTYKWYNGTATYEFGHGLHY 617
Query: 663 TNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTN 722
TN+S +++P + + + + KN T R + +D ISVTN
Sbjct: 618 TNFS-ATVTSPMQQSYRIADLMSTCKNATSITLERCAFTSVD-------------ISVTN 663
Query: 723 AGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKH 782
G V +V + + P+K L+G+ R+ +A G+ + + E L+ ++
Sbjct: 664 TGAVASDYVTLCYISGSHGPAPHPKKSLVGYQRLFGIAAGASDTARIDLTLESLARVDEV 723
Query: 783 GRRILPLGNHVLMV 796
G ++L G + LMV
Sbjct: 724 GNKVLYPGEYSLMV 737
>gi|449531013|ref|XP_004172482.1| PREDICTED: beta-D-xylosidase 1-like, partial [Cucumis sativus]
Length = 534
Score = 479 bits (1234), Expect = e-132, Method: Compositional matrix adjust.
Identities = 242/538 (44%), Positives = 348/538 (64%), Gaps = 16/538 (2%)
Query: 272 EDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAV 330
EDT+ PF++C+ +GK + +MCSYNQVNG P C DL + R WG GYI SDCD+V
Sbjct: 2 EDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADPDLLKNTIRGAWGLDGYIVSDCDSV 61
Query: 331 ATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFS 390
+++ Q++T T E++AA +KAG+D++CG + HT +A+ +G ++E D++ AL NL S
Sbjct: 62 GVLYDSQHFTPTPEEAAASTIKAGLDLDCGPFLAVHTATAVGRGLLKEVDLNNALANLLS 121
Query: 391 VQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVS 450
VQ+RLG+F+G+P YG LGP DVCT HK LAL+AARQGIVLL+N LPL+
Sbjct: 122 VQMRLGMFDGEPAAQPYGNLGPKDVCTPAHKHLALEAARQGIVLLQNRAGALPLSPTRHR 181
Query: 451 SLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEA 510
++A+IGP + M G Y G+ C + ++G+ YV KT +A GC +V C D EA
Sbjct: 182 TVAVIGPNSDATVTMIGNYAGVACEYTTPVQGISKYV-KTIHAKGCANVACVGDQLIGEA 240
Query: 511 VRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGP 570
A+ AD +VV GLD + E E RDR +LLPG+Q LV + K P ++VL GGP
Sbjct: 241 EAAARVADAAVVVVGLDQSIEAESRDRNGVLLPGKQEELVRRIGLACKGPTVVVLMSGGP 300
Query: 571 LDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESF-TKVPMND 629
+DVSFA+ D +IS ILW+GYPG+AG A+A+++FG NPGG+LPMTWYP+S+ KVPM +
Sbjct: 301 IDVSFAKNDGKISGILWVGYPGQAGGAAIADVLFGATNPGGKLPMTWYPQSYLAKVPMTN 360
Query: 630 MNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKN 689
M +R D S YPGR+YRFY G V+ FG GLSY+ +S F AP+++++ S + +
Sbjct: 361 MGLRPDPSTGYPGRTYRFYKGPVVFPFGFGLSYSKFSQSFAEAPTKISLPLSSLSPNSSA 420
Query: 690 ILQQTGSRLDYVHIDEVTSCTSLR-FHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEK 748
++ + H D C S+ + I V N G VDGSH +++F+ VP + +PEK
Sbjct: 421 TVKVS-------HTD----CASVSDLPIMIDVKNTGTVDGSHTILVFSTVPNQTW-SPEK 468
Query: 749 QLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
LIGF++VH +A K + G+ C+ LS ++ G R +P+G H L +G+L HS++++
Sbjct: 469 HLIGFEKVHLIAGSQKRVRIGIHVCDHLSRVDEFGTRRIPMGEHKLHIGDLTHSISLQ 526
>gi|336377735|gb|EGO18896.1| glycoside hydrolase family 3 protein [Serpula lacrymans var.
lacrymans S7.9]
Length = 766
Score = 478 bits (1231), Expect = e-132, Method: Compositional matrix adjust.
Identities = 287/755 (38%), Positives = 422/755 (55%), Gaps = 63/755 (8%)
Query: 55 CNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVN 114
C+TSL +RA +++ L T+ E I + +PRLG+P Y+WWSE LHG+A + PGVN
Sbjct: 37 CDTSLDPISRATAVVDLFTIDELINNTVSTSPGVPRLGLPPYQWWSEGLHGVA-DSPGVN 95
Query: 115 FN--GTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIF 172
F+ G S TSFPQ ++ A+F+ L ++G+ V +E R+ N G+AGL FW PNIN F
Sbjct: 96 FSASGEFSYATSFPQPIIMGAAFDDELIKSVGAIVGMEGRSFNNYGRAGLDFWTPNINPF 155
Query: 173 RDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESD 232
+DPRWGRGQETPGEDP ++ Y V+ QG G K +
Sbjct: 156 KDPRWGRGQETPGEDPYHLAQYVYNLVQGLQG------------GLDPKPYYQ------- 196
Query: 233 RGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIM 292
+ + CKH AYDLE W RY F+A++T QD + + P F+SC K M
Sbjct: 197 ------VISTCKHFAAYDLEDWDGNYRYGFDAIVTTQDLSEYYLPSFQSCYRDAKVGAAM 250
Query: 293 CSYNQVNGVPACLRGDLFQKA-RNEWGFKG--YITSDCDAVATIFEYQNYTKTHEDSAAG 349
CSYN VNG+P+C L Q R+ WGF ++TSDCDAV I++ NYTKT E++ A
Sbjct: 251 CSYNAVNGIPSCANTYLLQSILRDFWGFAEDRWVTSDCDAVDNIYDPHNYTKTPEEAVAD 310
Query: 350 VLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGK-YG 408
LKAG DI+CGT + A ++ + E ++ +AL+ ++ +RLG F DP + Y
Sbjct: 311 ALKAGTDIDCGTFYSEYLPGAYNQSLITETELRQALIRQYASLVRLGYF--DPTDIQPYR 368
Query: 409 KLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGG 468
+ ++V T + ++LA AA +GIVLLKND LPL+ + + ++A+IGP N +M G
Sbjct: 369 QYNWNNVDTPQAQQLAYQAAAEGIVLLKNDGT-LPLSSD-IKNIALIGPWGNATGEMQGN 426
Query: 469 YTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDL 528
Y G+ S L G A Y G + + N +GF A+ A+ AD VI G+D
Sbjct: 427 YYGVAPYLISPLMGAVATGYNVTYVFGTN-ITSNDTSGFAAAIAAAQGADVVIYAGGIDE 485
Query: 529 TQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWI 588
T E+E DR + PG Q+ LV +A K P+++V GGG +D + +A+S ++++LW
Sbjct: 486 TVESEGNDRNYITWPGNQLDLVGELAAVGK-PLVVVQFGGGQVDDTSLKANSTVNALLWA 544
Query: 589 GYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNMRADSSRQYPGRSYRF 647
GYPG++G AL +II G P GRLP+T YP + ++PM DM++R +++ PGR+Y++
Sbjct: 545 GYPGQSGGSALFDIISGKVAPAGRLPVTQYPADYVYEIPMTDMDLRPNATS--PGRTYKW 602
Query: 648 YTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNI--LQQTGSRLDYVHIDE 705
YTGT +Y FG+GL YT +SYK+ APS S NI L Q+G+ Y+ +
Sbjct: 603 YTGTPIYDFGYGLHYTTFSYKWAKAPS-----------STYNIQTLVQSGNLYSYLDLAP 651
Query: 706 VTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKG-SK 764
+ T ++VTN G+V +LF P K LI + R+H +A G +
Sbjct: 652 FDTFT-------VNVTNTGNVTSDFASLLFVNGTYGPSPYPNKSLITYARLHDIASGDTA 704
Query: 765 EISFGVDPCEQLSIANKHGRRILPLGNHVLMVGEL 799
++ GV ++ A+ +G L G + + + L
Sbjct: 705 SVALGVT-LGSIARADTYGNMWLYPGTYQVTLDTL 738
>gi|393247584|gb|EJD55091.1| beta-xylosidase [Auricularia delicata TFB-10046 SS5]
Length = 763
Score = 478 bits (1229), Expect = e-132, Method: Compositional matrix adjust.
Identities = 272/760 (35%), Positives = 405/760 (53%), Gaps = 58/760 (7%)
Query: 55 CNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVN 114
CNT+ + RAK+LI T +E + + + +PRLG+P Y+WWSE+LHG+A PGV+
Sbjct: 37 CNTTANFMDRAKALIDEFTTEELVNNTVNGSPGVPRLGLPPYQWWSEALHGVAGANPGVH 96
Query: 115 F---NGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINI 171
F TSFPQ ++ A+F+ L + + ++ EARA N G +G+ F+ PNIN
Sbjct: 97 FAPAGEDFDHATSFPQPILMGAAFDDELIHEVATVISTEARAFNNFGFSGIDFFTPNINP 156
Query: 172 FRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEES 231
FRDPRWGRGQETPGEDP+ +S Y + V + QG G+G K V
Sbjct: 157 FRDPRWGRGQETPGEDPLHISRYVFQLVTALQG--------GLGPSPYYKIV-------- 200
Query: 232 DRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCI 291
A CKH YDLE W R+ F+A+IT QD + + P F+SC+ K +
Sbjct: 201 ---------ADCKHFAGYDLESWEGIDRFHFDAVITTQDLAEFYTPSFQSCVRDAKVGSV 251
Query: 292 MCSYNQVNGVPACLRGDLFQK-ARNEWGF-KGYITSDCDAVATIFEYQNYTKTHEDSAAG 349
MCSYN VNGVPAC L Q R+ +G G+ITSDCDAV +F N+T T +++A
Sbjct: 252 MCSYNSVNGVPACASSYLLQDIVRDFYGLGDGWITSDCDAVQNVFTTHNFTTTQANASAI 311
Query: 350 VLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGK 409
LKAG D++CG + A+D+G V+E D+ +AL+ L+ +R G F+ P + + +
Sbjct: 312 SLKAGTDVDCGNVYAQSLGDALDQGLVEEDDLKQALVRLYGSLVRTGYFD-SPEEQPFRQ 370
Query: 410 LGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGY 469
LG DV T ++LAL AA +GIVLLKND LPL+ V ++ ++GP N + M G Y
Sbjct: 371 LGWADVDTPASRRLALLAAEEGIVLLKND-GLLPLSSRDVPNVIMVGPWGNATTMMQGNY 429
Query: 470 TGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLT 529
G S +G + +G +GF EAV A D ++ V G D
Sbjct: 430 FGNAPYLVSPRQGFVDAGFNVTFFNGTVGTNGTDTSGFDEAVAAAGDTDLIVFVGGPDNV 489
Query: 530 QETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIG 589
E E RDR+++ PG Q+ L+ +A K P+I++ G G +D ++ + I++++W G
Sbjct: 490 VERESRDRINITWPGVQLDLIKELAGVGK-PMIVLQMGAGQVDDTWLKESDAINALIWGG 548
Query: 590 YPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYT 649
YPG++G ALA I+ G P RLP+T YPE + +PM DMN+R +S PGR+Y+++T
Sbjct: 549 YPGQSGGTALANIVTGKTAPAARLPITQYPEDYISLPMTDMNVRPSNSS--PGRTYKWFT 606
Query: 650 GTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSC 709
G ++ FG GL Y+ + + + P AS G +++ S +D
Sbjct: 607 GEPIFEFGFGLHYSKFDFAWAEEP-----PASFAIG---DLVANASSPVDLA-------- 650
Query: 710 TSLRFHV-QISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISF 768
FH Q++VTN G V V MLF P K+L+G+ R+ + G+ +
Sbjct: 651 ---TFHTFQVNVTNLGPVASDFVAMLFGNTTAGPSPAPLKELVGYTRLTNIPVGATVTAS 707
Query: 769 GVDPCEQLSIANKHGRRILPLGNHVLMV---GELRHSLTI 805
++ A++ G +L G + + + GE+ H +
Sbjct: 708 VPVTLGTIARADEDGNSVLFPGQYSVWLDTTGEILHDFEL 747
>gi|395334835|gb|EJF67211.1| beta-xylosidase [Dichomitus squalens LYAD-421 SS1]
Length = 774
Score = 477 bits (1228), Expect = e-131, Method: Compositional matrix adjust.
Identities = 291/761 (38%), Positives = 412/761 (54%), Gaps = 51/761 (6%)
Query: 55 CNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVN 114
C+TS TRA +LI L T +E + + +PRLG+PAY WWSE LHG+A + PGV
Sbjct: 41 CDTSKDPITRATALIDLWTDEELTNNTVNASPGVPRLGLPAYNWWSEGLHGVAQS-PGVT 99
Query: 115 F--NGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIF 172
F +G S TSFPQ ++ A+F+ L + S V+ E RA N+G+AGL +W PNIN F
Sbjct: 100 FAPSGNFSYATSFPQPILMGAAFDDPLIQAVASVVSTEGRAFNNVGRAGLDYWTPNINPF 159
Query: 173 RDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESD 232
+DPRWGRGQETPGEDP + Y + QG G K
Sbjct: 160 KDPRWGRGQETPGEDPFHLQGYVYNLILGLQG------------GLDPTPYFK------- 200
Query: 233 RGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIM 292
+ A CKH AYD++ W RY FNA++T+QD + + P F++C+ K + +M
Sbjct: 201 ------VVADCKHFAAYDMDNWEGNVRYGFNAVVTQQDLSEYYLPSFQTCVRDAKVASVM 254
Query: 293 CSYNQVNGVPACLRGDLFQKA-RNEWGFKG--YITSDCDAVATIFEYQNYTKTHEDSAAG 349
CSYN VNG+P+C L Q R+ WGF ++TSDCDAV I+ NYT +AA
Sbjct: 255 CSYNAVNGIPSCANSFLLQDILRDYWGFDDTRWVTSDCDAVQNIYTPHNYTDNPAQAAAD 314
Query: 350 VLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGK 409
L AG DI+CGT + A+ +G V D+ RA + ++ +RLG F+ P Y +
Sbjct: 315 ALLAGTDIDCGTFSSTYLPDALSQGLVNATDLKRAAIRQYASLVRLGYFD-PPESQPYRQ 373
Query: 410 LGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGY 469
LG DV T E ++LA AA +G+VLLKND LPL+K+ V LA+IGP N + M G Y
Sbjct: 374 LGWSDVNTPEAQQLAHTAAVEGMVLLKNDGT-LPLSKH-VRKLALIGPWANATTLMQGNY 431
Query: 470 TGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLT 529
GI S L G + Y G + N +GF AV AK+AD VI GLD T
Sbjct: 432 AGIAPYLISPLLGAQQAGFDVEYVFGTNVTTTNDTSGFAAAVAAAKRADAVIFAGGLDET 491
Query: 530 QETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIG 589
E E+ DR+++ PG Q+ LV +A K P+I+ GGG LD S ++ +++I+W G
Sbjct: 492 VEREEVDRLNVTWPGNQLDLVAELASVGK-PLIVAQFGGGQLDDSALKSKRSVNAIIWGG 550
Query: 590 YPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNMRADSSRQYPGRSYRFY 648
YPG++G AL +I+ G P GRLP+T YP + +VPM DM +R ++ PGR+Y++Y
Sbjct: 551 YPGQSGGTALFDILTGKAAPAGRLPITQYPAEYANQVPMTDMTLRPSATN--PGRTYKWY 608
Query: 649 TGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTS 708
TGT V+ FG GL YT +S+ + S T +AS + ++ + LD +D
Sbjct: 609 TGTPVFEFGFGLHYTTFSFAWASNAHANTPAASYSIDALMASGNKSAAFLDLAPLDTFA- 667
Query: 709 CTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISF 768
+ VTN G + +V +LFA P KQL+ + RVH VA I+
Sbjct: 668 ---------VRVTNTGKMTSDYVALLFASGTFGPAPHPNKQLVAYTRVHGVAPKQSTIAE 718
Query: 769 GVDPCEQLSIANKHGRRILPLGNHVLMVG---ELRHSLTIE 806
++ A++ G + + G + L + +L+H+ T+E
Sbjct: 719 LTVTLGAIARADESGAKWVYPGTYTLALDTTEQLKHTFTLE 759
>gi|409079878|gb|EKM80239.1| hypothetical protein AGABI1DRAFT_120267 [Agaricus bisporus var.
burnettii JB137-S8]
Length = 786
Score = 477 bits (1227), Expect = e-131, Method: Compositional matrix adjust.
Identities = 289/751 (38%), Positives = 407/751 (54%), Gaps = 66/751 (8%)
Query: 22 IVVVNVIAFSNSKPVLNKPDFP-CKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQ 80
I + V+ F P FP C S P C+++ +TRA+SLI + T E IQ
Sbjct: 8 IFFIAVVIF----PAHGTFSFPDCTNGPLKSTPVCDSAKDPATRAQSLIQMFTDDELIQN 63
Query: 81 LSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNF--NGTVSSVTSFPQVLVSAASFNRS 138
+ + +PRLG+P YEWWSE+LHG+ + PGV F +G SS TSFPQ +V A+F+
Sbjct: 64 GDNASPGVPRLGLPPYEWWSEALHGVG-HSPGVVFAPSGDFSSATSFPQPIVIGAAFDDD 122
Query: 139 LWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEF 198
L + + V+ EARA N G+AGL ++ PNIN F+DPRWGRGQETPGEDP +S Y
Sbjct: 123 LVKAVANVVSTEARAFNNFGRAGLNYFTPNINPFKDPRWGRGQETPGEDPFHLSQYVYHL 182
Query: 199 VKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFS 258
V Q GGI D + ++A CKH AYDLE W
Sbjct: 183 VDGLQ--------GGI-----------------DPWPYIKVAADCKHFAAYDLENWEGID 217
Query: 259 RYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEW 317
R+ F+A +++QD + + PPF+SC+ KA+ +MCSYN VNGVPAC L Q R+ W
Sbjct: 218 RFHFDAQVSQQDLSEYYLPPFQSCVRDAKAASVMCSYNSVNGVPACASTYLLQDILRDAW 277
Query: 318 GFKG--YITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGK 375
GF ++TSDC A+ IF+ N+T++ ++AA LKAG DI+CG+ H +A+++
Sbjct: 278 GFDDDRWVTSDCWALDKIFDSHNFTRSFAEAAAISLKAGTDIDCGSTFADHLPAALNQSL 337
Query: 376 VQEKDIDRALLNLFSVQLRLGLFNGDPRKGK-YGKLGPDDVCTSEHKKLALDAARQGIVL 434
+ D+ RA + ++ +RLG F DP + Y + DV T E + L+ AA +G+VL
Sbjct: 338 ISRDDLTRAFIRQYTSLIRLGYF--DPSDSQTYRQFDWSDVNTPEAQALSRRAAVEGLVL 395
Query: 435 LKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYAS 494
LKND LPL + ++AIIGP N S M G Y G S +G + K A+
Sbjct: 396 LKND-GLLPLAPDG-KTIAIIGPYTNATSSMQGNYFGNAPIITSPFQGAQDVGFKVVSAA 453
Query: 495 GCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVA 554
G V S AGF EA+ AK AD V+ V G+D T E E DR S+ PG Q+ LV +A
Sbjct: 454 GT-TVNGTSSAGFAEAINTAKAADVVVFVGGIDNTLEREGLDRSSISWPGNQLDLVKDLA 512
Query: 555 RTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLP 614
K P+I+V GGG +D + A+ ++ +I+W GYPG++G A+ +II G P GRLP
Sbjct: 513 SLGK-PLIVVQFGGGQVDDTEILANKKVQAIIWAGYPGQSGGTAIFDIIVGSTAPAGRLP 571
Query: 615 MTWYPESFT-KVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAP 673
+T YP +T +V M DM++R S PGR+Y++Y T V +GHGL +T + + + P
Sbjct: 572 VTQYPADYTHQVRMTDMSLRPSSHN--PGRTYKWYK-TPVLEYGHGLHFTTFDFSWQRQP 628
Query: 674 -SELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVV 732
+E I ++A K LD H D C V N G++ +V
Sbjct: 629 AAEYDIQELIRASHSK--------FLDLAHFDTFEIC----------VRNTGNITSDYVG 670
Query: 733 MLFARVPKVSQGTPEKQLIGFDRVHTVAKGS 763
+LF P K L+ + RVH + G+
Sbjct: 671 LLFLSGNTGPGPHPIKSLVAYSRVHDIQGGT 701
>gi|426198356|gb|EKV48282.1| hypothetical protein AGABI2DRAFT_67675 [Agaricus bisporus var.
bisporus H97]
Length = 763
Score = 474 bits (1221), Expect = e-131, Method: Compositional matrix adjust.
Identities = 284/731 (38%), Positives = 401/731 (54%), Gaps = 62/731 (8%)
Query: 42 FP-CKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWS 100
FP C S P C+++ +TRA+SLI + T E IQ + + +PRLG+P YEWWS
Sbjct: 24 FPDCTNGPLKSTPVCDSTKDPATRAQSLIQMFTDDELIQNGDNASPGVPRLGLPPYEWWS 83
Query: 101 ESLHGIASNGPGVNF--NGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG 158
E+LHG+ + PGV F +G SS TSFPQ +V A+F+ L + + V+ EARA N G
Sbjct: 84 EALHGVG-HSPGVVFAPSGDFSSATSFPQPIVIGAAFDDDLVKAVANVVSTEARAFNNFG 142
Query: 159 QAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGF 218
+AGL ++ PNIN F+DPRWGRGQETPGEDP +S Y V Q GGI
Sbjct: 143 RAGLNYFTPNINPFKDPRWGRGQETPGEDPFHLSQYVYHLVDGLQ--------GGI---- 190
Query: 219 REKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPP 278
D + ++A CKH AYDLE W R+ F+A +++QD + + PP
Sbjct: 191 -------------DPWPYIKVAADCKHFAAYDLENWEGIDRFHFDAQVSQQDLSEYYLPP 237
Query: 279 FRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKG--YITSDCDAVATIFE 335
F+SC+ KA+ +MCSYN VNGVPAC L Q R+ WGF ++TSDC A+ IF+
Sbjct: 238 FQSCVRDAKAASVMCSYNSVNGVPACASTYLLQDILRDAWGFDDDRWVTSDCWALDKIFD 297
Query: 336 YQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRL 395
N+T++ ++AA LKAG DI+CG+ H +A+++ + D+ RA + ++ +RL
Sbjct: 298 SHNFTRSFAEAAAISLKAGTDIDCGSTFADHLPAALNQSLISRDDLTRAFIRQYTSLIRL 357
Query: 396 GLFNGDPRKGK-YGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAI 454
G F DP + Y + DV T E + L+ AA +G+VLLKND LPL + ++AI
Sbjct: 358 GYF--DPSHSQTYRQFDWSDVNTPEAQALSRRAAVEGLVLLKND-GLLPLAPDG-KTIAI 413
Query: 455 IGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIA 514
IGP N S M G Y G S +G + K A+G V S AGF EA+ A
Sbjct: 414 IGPYTNATSSMQGNYFGNAPFITSPFQGAQDVGFKVVSAAGTI-VNGTSSAGFAEAINTA 472
Query: 515 KKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVS 574
+ AD V+ V G+D T E E DR S+ PG Q+ LV +A K P+I+V GGG +D +
Sbjct: 473 RAADVVVFVGGIDNTLEREGLDRSSISWPGNQLDLVKDLASLGK-PLIVVQFGGGQVDDT 531
Query: 575 FAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNMR 633
A+ ++ +I+W GYPG++G A+ +II G P GRLP+T YP +T +V M DM++R
Sbjct: 532 EILANEKVQAIIWAGYPGQSGGTAIFDIIVGATAPAGRLPVTQYPADYTHQVRMTDMSLR 591
Query: 634 ADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAP-SELTISASLKAGSDKNILQ 692
S PGR+Y++Y T V +GHGL +T + + + P +E I ++A K
Sbjct: 592 PSSHN--PGRTYKWYK-TPVLEYGHGLHFTTFDFSWQRQPAAEYDIQELIRASHSK---- 644
Query: 693 QTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIG 752
LD H D C V N G++ +V +LF P K L+
Sbjct: 645 ----FLDLAHFDTFEIC----------VRNTGNITSDYVGLLFLSGNSGPGPHPIKSLVA 690
Query: 753 FDRVHTVAKGS 763
+ RVH + G+
Sbjct: 691 YSRVHDIQGGT 701
>gi|226491558|ref|NP_001146416.1| uncharacterized protein LOC100279996 [Zea mays]
gi|223975771|gb|ACN32073.1| unknown [Zea mays]
Length = 507
Score = 474 bits (1220), Expect = e-131, Method: Compositional matrix adjust.
Identities = 245/516 (47%), Positives = 332/516 (64%), Gaps = 13/516 (2%)
Query: 292 MCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGV 350
MCSYNQVNG P C DL R +W GYI+SDCD+V ++ Q+YTKT ED+AA
Sbjct: 1 MCSYNQVNGKPTCADKDLLSGVIRGDWKLNGYISSDCDSVDVLYNNQHYTKTPEDAAAIS 60
Query: 351 LKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKL 410
+KAG+D+NCGT + +HT +A+ GK+ E D+DRA+ N +RLG F+GDPR+ +G L
Sbjct: 61 IKAGLDLNCGTFLAQHTVAAVQAGKLSESDVDRAVTNNLVTLMRLGFFDGDPRELPFGNL 120
Query: 411 GPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT 470
GP DVCT +++LA +AARQGIVLLKN K LPL+ ++ S+A+IGP N M G Y
Sbjct: 121 GPSDVCTPSNQELAREAARQGIVLLKNTGK-LPLSATSIKSMAVIGPNANASFTMIGNYE 179
Query: 471 GIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDA-GFHEAVRIAKKADFVIVVAGLDLT 529
G PC + L+GL A V+ T Y GC +V C+ ++ A + A AD ++V G D +
Sbjct: 180 GTPCKYTTPLQGLGANVA-TVYQPGCTNVGCSGNSLQLDAATKAAASADVTVLVVGADQS 238
Query: 530 QETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIG 589
E E DR SLLLPGQQ LV++VA S P ILV+ GGP D+SFA++ +I++ILW+G
Sbjct: 239 IERESLDRTSLLLPGQQPQLVSAVANASSGPCILVVMSGGPFDISFAKSSDKIAAILWVG 298
Query: 590 YPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYT 649
YPGEAG A+A+++FG NP GRLP+TWYPESFTKVPM DM MR D S YPGR+YRFYT
Sbjct: 299 YPGEAGGAAIADVLFGYHNPSGRLPVTWYPESFTKVPMTDMRMRPDPSTGYPGRTYRFYT 358
Query: 650 GTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSC 709
G VY FG GLSYT++++ +SAP +L +L+ L + ++ E C
Sbjct: 359 GDTVYAFGDGLSYTSFAHHLVSAPKQL----ALQLAEGHACLTEQCPSVEA----EGAHC 410
Query: 710 TSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFG 769
L F V + V NAG+ G H V LF+ P V P K L+GF++V + ++F
Sbjct: 411 EGLAFDVHLRVRNAGERSGGHTVFLFSSPPAV-HNAPAKHLLGFEKVSLEPGQAGVVAFK 469
Query: 770 VDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTI 805
VD C+ LS+ ++ G R + LG+H L VG+L+H+L +
Sbjct: 470 VDVCKDLSVVDELGNRKVALGSHTLHVGDLKHTLNL 505
>gi|409041356|gb|EKM50841.1| glycoside hydrolase family 3 protein [Phanerochaete carnosa
HHB-10118-sp]
Length = 764
Score = 474 bits (1220), Expect = e-131, Method: Compositional matrix adjust.
Identities = 278/761 (36%), Positives = 417/761 (54%), Gaps = 58/761 (7%)
Query: 55 CNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVN 114
CN S ++RA +L+ LTL+E + + + +PRLG+P Y WWSE+LHG+A + PG N
Sbjct: 38 CNPSADPTSRANALVDALTLEELVNNTVNASPGVPRLGLPPYNWWSEALHGVALS-PGTN 96
Query: 115 FN---GTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINI 171
F+ SS TSFPQ ++ A+F+ L ++I + ++ EARA N G+AGL F+ PNIN
Sbjct: 97 FSVPGSPFSSATSFPQPIILGATFDDDLVTSIATVISTEARAFNNAGRAGLDFFTPNINP 156
Query: 172 FRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEES 231
F+DPRWGRGQETPGEDP ++ Y + V QG G K
Sbjct: 157 FKDPRWGRGQETPGEDPFHIAQYVYQLVTGLQG------------GLSPDPYYK------ 198
Query: 232 DRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCI 291
+ A CKH YDLE W SR +FNA+I+ QD + + P F+SC+ +
Sbjct: 199 -------VIADCKHFAGYDLENWEGNSRMAFNAIISTQDLAEYYTPSFQSCVRDAHVGSV 251
Query: 292 MCSYNQVNGVPACLRGDLFQK-ARNEWGF-KGYITSDCDAVATIFEYQNYTKTHEDSAAG 349
MCSYN VNG+P+C L Q R +G G+ITSDCDAVA IF YT T +++A
Sbjct: 252 MCSYNAVNGIPSCANSYLLQDIIRGHFGLGDGWITSDCDAVANIFSPHQYTTTLVNASAV 311
Query: 350 VLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGK 409
LKAG D++CGT + A+D+ V E DI +++ L+ +RLG F+ P + + +
Sbjct: 312 ALKAGTDVDCGTTYSQTLVDAVDQNLVTEDDIKNSMIRLYRSLVRLGYFD-SPAEQPFRQ 370
Query: 410 LGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGY 469
LG DV T + LAL AA +G+ LLKND LPL+ +A+ +A++GP N +QM G Y
Sbjct: 371 LGWSDVNTPSSQALALTAAEEGVTLLKNDGT-LPLS-SAIKRIALVGPWANATTQMQGNY 428
Query: 470 TGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLT 529
GI S L+ L+ + +A+G + D+GF AV + AD VI G+D T
Sbjct: 429 QGIAPFLVSPLQALQDAGFQVTFANGTA-INSTDDSGFAAAVSAVQVADAVIYAGGIDET 487
Query: 530 QETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIG 589
E+E DR + PG Q+ LV+ +A K P +++ GGG +D S +++ +++++W G
Sbjct: 488 IESEGNDREIITWPGNQLDLVSQLAAVGK-PFVVLQMGGGQVDSSSLKSNKAVNALIWGG 546
Query: 590 YPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNMRADSSRQYPGRSYRFY 648
YPG++G A+ I+ G P GRLP+T YP + ++PM DM +R + + PGR+Y+++
Sbjct: 547 YPGQSGGAAIVNILTGKIAPAGRLPITQYPADYVNEIPMTDMALRPNGTS--PGRTYKWF 604
Query: 649 TGTQVYGFGHGLSYTNYSYKFL-SAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVT 707
TGT ++GFG GL YT +S + + PS IS + + + T
Sbjct: 605 TGTPIFGFGFGLHYTTFSLDWAPTPPSSFAISTLVSEANTAGV--------------SFT 650
Query: 708 SCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEIS 767
+ L F +++V N G V +V +LF+ Q P KQL+ + RV +A G E +
Sbjct: 651 NLAPL-FTFRVNVKNTGKVGSDYVALLFSNTTAGPQPAPLKQLVSYTRVKGIAPGQTETA 709
Query: 768 FGVDPCEQLSIANKHGRRILPLGNHVLMV---GELRHSLTI 805
++ +++G L G + + V G++ HS +
Sbjct: 710 ELKVTLGSIARIDENGDSALYPGRYNIWVDTTGDIVHSFEL 750
>gi|392590128|gb|EIW79457.1| glycoside hydrolase family 3 protein [Coniophora puteana RWD-64-598
SS2]
Length = 770
Score = 473 bits (1216), Expect = e-130, Method: Compositional matrix adjust.
Identities = 283/755 (37%), Positives = 412/755 (54%), Gaps = 56/755 (7%)
Query: 55 CNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVN 114
C+TSL+ + RA +L+ L T++E I + + +PRLG+PAY+WWSE LHG+A + PGVN
Sbjct: 37 CDTSLNATQRAAALVELFTVEELINNTVNGSPGVPRLGLPAYQWWSEGLHGVA-DSPGVN 95
Query: 115 FN--GTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIF 172
F+ G S TSFPQ +V +A+F+ +L +G V +E R+ N G AGL FW PNIN F
Sbjct: 96 FSTSGPFSYATSFPQPIVMSAAFDDALIKAVGGVVGMEGRSFNNYGHAGLDFWTPNINPF 155
Query: 173 RDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESD 232
+DPRWGRGQETPGEDP ++ Y ++ QG G E
Sbjct: 156 KDPRWGRGQETPGEDPYHIAQYVYNLIQGLQG---------------------GVNPEP- 193
Query: 233 RGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIM 292
+ A CKH YDLE W N RY F+A+IT QD + + P F+SC +A M
Sbjct: 194 ---YFQVVATCKHFAGYDLEDWENNFRYGFDALITTQDLSEFYLPSFQSCYRDAQAGASM 250
Query: 293 CSYNQVNGVPACLRGDLFQKA-RNEWGFKG--YITSDCDAVATIFEYQNYTKTHEDSAAG 349
CSYN VNG+P C L Q R+ W F ++TSDCDAV I+ NYT + +AA
Sbjct: 251 CSYNAVNGIPTCADTYLLQDILRDYWNFDETRWVTSDCDAVENIYNPHNYTALPQQAAAD 310
Query: 350 VLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGK-YG 408
L+AG D++CGT + A ++ + E ++ AL ++ +RLG F DP + Y
Sbjct: 311 ALRAGTDLDCGTFYTEYLPLAYNQSLITETELRAALTRQYASLVRLGYF--DPAAQQPYR 368
Query: 409 KLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGG 468
+ G +V T ++LA AA +GI LLKND LPL + + ++A+IGP N +QM G
Sbjct: 369 QYGWSNVDTPYAQQLAYTAATEGITLLKNDGT-LPL-PSTLKNIALIGPWANATNQMQGN 426
Query: 469 YTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDL 528
Y G+ S L+G A Y G ++ NS AGF A+ A++AD V+ G+D+
Sbjct: 427 YFGVAPYLVSPLQGALAAGYNVTYVFGT-NITSNSTAGFAAAIAAAREADAVVYAGGIDV 485
Query: 529 TQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWI 588
T E E DR ++ PG Q+ L+ +A K P ++ GGG +D + +A++ ++S++W
Sbjct: 486 TVEAEAMDRYNVTWPGNQLQLIGELAALGK-PFVVAQFGGGQVDDTEIKANASVNSLIWA 544
Query: 589 GYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNMRADSS-RQYPGRSYR 646
GYPG++G +AL +II G P GRL T YP + ++PM DMN+R +++ PGR+Y+
Sbjct: 545 GYPGQSGGQALFDIISGKVAPAGRLVTTQYPADYVYEIPMTDMNLRPNANGTTSPGRTYK 604
Query: 647 FYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEV 706
+YTG VY FG+GL YTN++Y + AP+ S NI + HID
Sbjct: 605 WYTGAPVYEFGYGLHYTNFTYTWTKAPA-----------STYNIQTLVSAASGAAHIDLA 653
Query: 707 TSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEI 766
T + ++VTNAG V + +LF P K L + R+H+VA G+ +
Sbjct: 654 PFDT-----LSVAVTNAGAVTSDYSALLFVNGTYGPAPYPNKALAAYTRLHSVAAGAAQT 708
Query: 767 SFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRH 801
+ Q++ A+ +G L G + L + R
Sbjct: 709 ATFDVVLNQIARADAYGNFWLYPGAYELALDTTRE 743
>gi|452989371|gb|EME89126.1| glycoside hydrolase family 3 protein [Pseudocercospora fijiensis
CIRAD86]
Length = 790
Score = 472 bits (1215), Expect = e-130, Method: Compositional matrix adjust.
Identities = 282/750 (37%), Positives = 394/750 (52%), Gaps = 55/750 (7%)
Query: 55 CNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVN 114
C+T+ TRAK+LI+ TL EKI + +PRLG+ YEWW E+LHG+AS+ PGVN
Sbjct: 39 CDTAADPLTRAKALIAEFTLAEKINNTGSTSPGVPRLGLLPYEWWQEALHGVASS-PGVN 97
Query: 115 FN--GTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIF 172
F+ G TSFPQ ++ A+F+ L ++ S ++ EARA N +AGL FW PNIN F
Sbjct: 98 FSVSGEFRYATSFPQPILMGAAFDDQLIHDVASVISTEARAFSNDDRAGLDFWTPNINPF 157
Query: 173 RDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESD 232
+DPRWGRGQETPGEDP +S+Y ++ QG+N K+V+
Sbjct: 158 KDPRWGRGQETPGEDPYHLSSYVHSLIRGLQGDN-----------PSYKKVV-------- 198
Query: 233 RGDELMLSACCKHLIAYDLEKW-GNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCI 291
A CKH +AYD+E W GNF RY +A I QD + + PPFRSC
Sbjct: 199 --------ATCKHFVAYDVENWNGNF-RYQLDAHINSQDLVEYYMPPFRSCARDSNVGAF 249
Query: 292 MCSYNQVNGVPACLRGDLFQKA-RNEWGFKG---YITSDCDAVATIFEYQNYTKTHEDSA 347
MCSYN +NGVP C L Q R W + ++TSDCD+V +F Y NY + E++A
Sbjct: 250 MCSYNSLNGVPTCADPYLLQTVLREHWNWTAEEQWVTSDCDSVQNVFLYHNYASSREEAA 309
Query: 348 AGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKY 407
A LKAG DINCGT H A ++G + E D+D +L+ + +RLG F+GD + Y
Sbjct: 310 AISLKAGTDINCGTYYQEHLPRAYEQGLINETDVDTSLIRQYGSLIRLGYFDGD--RVPY 367
Query: 408 GKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGG 467
L +DV T + LAL AA GI LLKND LPL + +A+IG N QM G
Sbjct: 368 RNLTWNDVSTPYAQDLALKAATSGITLLKNDG-ILPLQITNGTKIALIGDWANATDQMLG 426
Query: 468 GYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLD 527
Y GIP S L + ++ Y G + + A K+D +I + G+D
Sbjct: 427 NYHGIPPYFHSPLWAAQQTGAEVTYVQGPGGQSDPTTYTWRPIWSAANKSDVIIYIGGMD 486
Query: 528 LTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILW 587
E E++DRVS+ G Q+ ++ +A +P I+V GGG LD S + I ++LW
Sbjct: 487 ERVEAEEKDRVSIAWSGPQLDVIGQLADYYDKPTIVVQMGGGSLDSSPLVKNPNIRALLW 546
Query: 588 IGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESF-TKVPMNDMNMRADSSRQYPGRSYR 646
GYPG+ G KA+ +I+ G P GRLP+T Y + +KVPM D ++R +++ PGR+Y
Sbjct: 547 GGYPGQDGGKAIFDILQGISAPAGRLPITQYRADYISKVPMTDTSLRPNATSGSPGRTYI 606
Query: 647 FYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEV 706
+ V+ FG+GL YTN++ A S T + SD T S LD
Sbjct: 607 WLNEEPVFEFGYGLHYTNFTATIPDAESSDTTYSIDSLASDC-----TESYLD------- 654
Query: 707 TSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEI 766
C F I VTN G V +V + F + P K+L+ + R+H + GS +
Sbjct: 655 -RCPFKTF--SIDVTNTGSVTSDYVTLGFLTGAHGPEPCPNKRLVSYQRLHNITAGSTQT 711
Query: 767 SFGVDPCEQLSIANKHGRRILPLGNHVLMV 796
+ LS + G +L G++ L+V
Sbjct: 712 AALNLTLGSLSRVDDKGNTVLFPGSYALLV 741
>gi|62321271|dbj|BAD94481.1| beta-xylosidase [Arabidopsis thaliana]
Length = 523
Score = 470 bits (1209), Expect = e-129, Method: Compositional matrix adjust.
Identities = 244/531 (45%), Positives = 336/531 (63%), Gaps = 16/531 (3%)
Query: 281 SCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNY 339
S + G + +MCSYNQVNG P C DL R EW GYI SDCD+V +++ Q+Y
Sbjct: 1 SYVVDGNVASVMCSYNQVNGKPTCADPDLLSGVIRGEWKLNGYIVSDCDSVDVLYKNQHY 60
Query: 340 TKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFN 399
TKT ++AA + AG+D+NCG+ + +HT+ A+ G V E ID+A+ N F +RLG F+
Sbjct: 61 TKTPAEAAAISILAGLDLNCGSFLGQHTEEAVKSGLVNEAAIDKAISNNFLTLMRLGFFD 120
Query: 400 GDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLV 459
G+P+ YG LGP DVCTS +++LA DAARQGIVLLKN LPL+ ++ +LA+IGP
Sbjct: 121 GNPKNQIYGGLGPTDVCTSANQELAADAARQGIVLLKN-TGCLPLSPKSIKTLAVIGPNA 179
Query: 460 NNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADF 519
N M G Y G PC + L+GL VS T Y GC +V C + A A ++A AD
Sbjct: 180 NVTKTMIGNYEGTPCKYTTPLQGLAGTVSTT-YLPGCSNVAC-AVADVAGATKLAATADV 237
Query: 520 VIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEAD 579
++V G D + E E RDRV L LPGQQ LV VA+ +K PV+LV+ GG D++FA+ D
Sbjct: 238 SVLVIGADQSIEAESRDRVDLRLPGQQQELVIQVAKAAKGPVLLVIMSGGGFDITFAKND 297
Query: 580 SQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNMRADSSR 638
+I+ ILW+GYPGEAG A+A+IIFG +NP G+LPMTWYP+S+ KVPM MNMR D +
Sbjct: 298 PKIAGILWVGYPGEAGGIAIADIIFGRYNPSGKLPMTWYPQSYVEKVPMTIMNMRPDKAS 357
Query: 639 QYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGS-- 696
YPGR+YRFYTG VY FG GLSYT +S+ + APS +++ G ++N + ++
Sbjct: 358 GYPGRTYRFYTGETVYAFGDGLSYTKFSHTLVKAPSLVSL------GLEENHVCRSSECQ 411
Query: 697 RLDYV--HIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFD 754
LD + H + S F V I V N GD +G H V LF P + G+P K L+GF+
Sbjct: 412 SLDAIGPHCENAVSGGGSAFEVHIKVRNGGDREGIHTVFLFTTPPAI-HGSPRKHLVGFE 470
Query: 755 RVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTI 805
++ + + F V+ C+ LS+ ++ G+R + LG H+L VG+L+HSL+I
Sbjct: 471 KIRLGKREEAVVRFKVEICKDLSVVDEIGKRKIGLGKHLLHVGDLKHSLSI 521
>gi|452846807|gb|EME48739.1| glycoside hydrolase family 3 protein [Dothistroma septosporum
NZE10]
Length = 802
Score = 464 bits (1193), Expect = e-127, Method: Compositional matrix adjust.
Identities = 278/757 (36%), Positives = 408/757 (53%), Gaps = 59/757 (7%)
Query: 55 CNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVN 114
C+T+ TRA +LI+ TLQEK+ + +PRLG+PAY WW E+LHG+AS+ PGVN
Sbjct: 39 CDTTADPLTRATALINAFTLQEKLNNTGSTSPGVPRLGLPAYTWWQEALHGVASS-PGVN 97
Query: 115 FN--GTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIF 172
F+ G TSFPQ ++ A+F+ L ++ + ++ EARA N +AGL FW PNIN F
Sbjct: 98 FSDSGPFRYATSFPQPILMGAAFDDDLIRDVATVISTEARAFNNDKRAGLDFWTPNINPF 157
Query: 173 RDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESD 232
+D RWGRGQETPGEDP +S+Y ++ QG DD + KRV+
Sbjct: 158 KDSRWGRGQETPGEDPYHLSSYVAALIEGLQG---SPDD-------KYKRVV-------- 199
Query: 233 RGDELMLSACCKHLIAYDLEKW-GNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCI 291
A CKH +AYD+E W GNF RY F+A ++ QD + + PPF+ C
Sbjct: 200 --------ATCKHFVAYDMESWNGNF-RYQFDAQVSSQDLVEYYMPPFQQCARDSNVGAF 250
Query: 292 MCSYNQVNGVPACLRGDLFQKA-RNEWGF---KGYITSDCDAVATIFEYQNYTKTHEDSA 347
MCSYN +NGVP C L Q R +W + + ++TSDCDAV +F +Y T E++A
Sbjct: 251 MCSYNALNGVPTCADPWLLQTVLREKWNWTSEQQWVTSDCDAVQNVFLPHDYASTREEAA 310
Query: 348 AGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKY 407
A LKAG DINCGT H +A D+G + D+D +L+ +S +RLG F+G Y
Sbjct: 311 ALSLKAGTDINCGTYYQDHLPAAYDQGLINTTDLDISLIRQYSSLVRLGYFDG--LAVPY 368
Query: 408 GKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGG 467
L +DV T ++LA AA +GI LLKND LPL + +S+A+IG N QM G
Sbjct: 369 RNLTWNDVSTPHAQQLAYKAAAEGITLLKND-GVLPLTISNGTSIALIGDWANATDQMLG 427
Query: 468 GYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLD 527
Y GIP S L + + ++A+G + + A K+D +I G+D
Sbjct: 428 NYDGIPPFFHSPLYAAQQTGATVNFATGPGGQGDPTTDHWLPVWAAANKSDVIIYAGGID 487
Query: 528 LTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILW 587
+ E+E DRVSL G Q+ ++ +A K PVI++ GGG +D S + +S+++W
Sbjct: 488 NSVESEGMDRVSLTWTGAQLDMIGQLAMYGK-PVIVLQMGGGQIDSSPLVNNPNVSALIW 546
Query: 588 IGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESF-TKVPMNDMNMRADSSRQYPGRSYR 646
GYPG+ G AL +II G P GRLP T YP + ++VPM DM +R +S+ PGR+Y
Sbjct: 547 GGYPGQDGGVALFDIIRGITAPAGRLPTTQYPAKYISQVPMTDMTLRPNSTTGSPGRTYI 606
Query: 647 FYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEV 706
+Y V+ +G GL YTN++ I S + D + + D I +
Sbjct: 607 WYNENAVFPYGLGLHYTNFT---------AAIKPSFPSTYDSSSSNSGSASYD---ISTL 654
Query: 707 TS-CTSLRFHV------QISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTV 759
TS CT+ + +S+TN G++ +V + F P K+L+ + R+H +
Sbjct: 655 TSNCTATYKDLCPFTSFSVSITNTGEIMSDYVTLGFLAGIHGPAPHPNKRLVSYQRLHNI 714
Query: 760 AKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMV 796
GS + ++ L+ ++ G ++L G++ L+V
Sbjct: 715 TAGSSQTAWLNLTLGSLARVDEMGNKVLYPGDYALLV 751
>gi|119473971|ref|XP_001258861.1| beta-xylosidase [Neosartorya fischeri NRRL 181]
gi|292495290|sp|A1DJS5.1|XYND_NEOFI RecName: Full=Probable exo-1,4-beta-xylosidase xlnD; AltName:
Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
gi|119407014|gb|EAW16964.1| beta-xylosidase [Neosartorya fischeri NRRL 181]
Length = 771
Score = 463 bits (1191), Expect = e-127, Method: Compositional matrix adjust.
Identities = 290/756 (38%), Positives = 409/756 (54%), Gaps = 72/756 (9%)
Query: 55 CNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVN 114
C+TSL ++TRA+SL++ +T +EK+ N+ +PRLG+PAY WWSE+LHG+A + PGV
Sbjct: 43 CDTSLDVTTRARSLVNAMTFEEKVNNTQYNSPGVPRLGLPAYNWWSEALHGVAGS-PGVE 101
Query: 115 F--NGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIF 172
F +G S TSFPQ ++ A+F+ L + + V+ E RA N G+AGL FW PNIN F
Sbjct: 102 FADSGPFSYATSFPQPILLGATFDDDLIKQVATVVSTEGRAFGNAGRAGLDFWTPNINPF 161
Query: 173 RDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESD 232
RD RWGRGQETPGEDP+ VS Y V Q G G +V+
Sbjct: 162 RDARWGRGQETPGEDPLHVSRYVYHLVDGLQN----------GIGPANPKVV-------- 203
Query: 233 RGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIM 292
A CKH AYDLE W R+SFNA ++ QD + + PPF+SC K +M
Sbjct: 204 --------ATCKHFAAYDLEDWNGVVRHSFNAEVSTQDLSEFYLPPFKSCARDAKVDAVM 255
Query: 293 CSYNQVNGVPACLRGDLFQKA-RNEWGFK---GYITSDCDAVATIFEYQNYTKTHEDSAA 348
CSYN +NGVPAC L Q R W + +IT DC A+ I+ NYTKT ++AA
Sbjct: 256 CSYNALNGVPACADSYLLQTILREHWKWDEPGHWITGDCGAIDDIYNGHNYTKTPAEAAA 315
Query: 349 GVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGK-Y 407
L AG D++CGT ++ A D+G K +D+AL+ L+S ++LG F DP + + Y
Sbjct: 316 TALNAGTDLDCGTVFPKYLGQAADEGLYTNKTLDKALVRLYSSLVKLGYF--DPAEDQPY 373
Query: 408 GKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGG 467
+G DV + + LA AA +GIVLLKNDK LPL A +LA+IGP N QM G
Sbjct: 374 RSIGWKDVDSPAAEALAHKAAVEGIVLLKNDKT-LPL--KAKGTLALIGPYANATKQMQG 430
Query: 468 GYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLD 527
Y G P ++LL Y +G + NS AGF A+ AK+AD V+ G+D
Sbjct: 431 NYEGPPKYIRTLLWAATQAGYDVKYVAGTA-INANSTAGFDAALSAAKQADVVVYAGGID 489
Query: 528 LTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILW 587
T E E DR +++ PG Q+ L+ +++ K P+++V GGG +D S ++ ++++LW
Sbjct: 490 NTIEAEGHDRTTIVWPGNQLDLIDQLSKIGK-PLVVVQFGGGQVDDSSLLSNPHVNALLW 548
Query: 588 IGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNMRADSSRQYPGRSYR 646
GYP + G A+ +I+ G P GRLP+T YP + +VP+ DM +R S+ PGR+YR
Sbjct: 549 TGYPSQEGGSAIFDILTGKTAPAGRLPVTQYPADYVNQVPLTDMALRPGSNT--PGRTYR 606
Query: 647 FYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEV 706
+Y V FG GL YT + + +A+L + S KN V ID
Sbjct: 607 WYD-KAVLPFGFGLHYTTFKISWPRRALGPYDTAALVSRSPKN-----------VPIDRA 654
Query: 707 TSCTSLRFHVQISVTNAGDVDGSHVVMLFAR-VPKVSQGTPEKQLIGFDRVHTVAKGSK- 764
T FH+Q VTN G +V +LF + + + P K L+G+ R + G K
Sbjct: 655 AFDT---FHIQ--VTNTGKTTSDYVALLFLKTIDAGPKPYPLKTLVGYTRAKQIKPGEKR 709
Query: 765 ----EISFGVDPCEQLSIANKHGRRILPLGNHVLMV 796
++S G L+ ++G +L G + L V
Sbjct: 710 SVDIKVSLG-----SLARTAENGDLVLYPGRYTLEV 740
>gi|407922988|gb|EKG16078.1| Glycoside hydrolase family 3 [Macrophomina phaseolina MS6]
Length = 800
Score = 462 bits (1190), Expect = e-127, Method: Compositional matrix adjust.
Identities = 272/751 (36%), Positives = 409/751 (54%), Gaps = 57/751 (7%)
Query: 55 CNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPG-- 112
C++S + RA +L+ LTL+EK+ + + +PRLGIP Y+WW+E+LHG+A PG
Sbjct: 41 CDSSATPLARATALVKELTLEEKLNNTGNTSPGVPRLGIPEYQWWNEALHGVAFTYPGQP 100
Query: 113 VNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIF 172
+ +G SS TSFPQ ++ A+F+ L + S V+ EARA N G++GL +W PNIN +
Sbjct: 101 MTESGNFSSATSFPQPILMGAAFDDELIYEVASVVSTEARAYSNGGRSGLDYWTPNINPY 160
Query: 173 RDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESD 232
+DPRWGRGQETPGEDP +++Y ++ +G ++D
Sbjct: 161 KDPRWGRGQETPGEDPFHLASYVQNLIRGLEGN------------------------QND 196
Query: 233 RGDELMLSACCKHLIAYDLEKW-GNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCI 291
+++ A CKH YD+E W GNF RY F+A I +D + + PPF++C + K
Sbjct: 197 PYKKIV--ATCKHFTGYDMENWNGNF-RYQFDAQINMRDMVEYYMPPFQACAREAKVGAF 253
Query: 292 MCSYNQVNGVPACLRGDLFQKA-RNEWGFK---GYITSDCDAVATIFEYQNYTKTHEDSA 347
MCSYN VNGVP C L Q R WG+ ++ SDCDA+ ++ + ++ E +
Sbjct: 254 MCSYNAVNGVPTCADPWLLQTVLREHWGWNQEDQWVVSDCDAIQNVYLPHEWAESREQAV 313
Query: 348 AGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLF-NGDPRKGK 406
A L AG D+NCGT R+ A ++G + + +DRAL +S ++LG F N D +
Sbjct: 314 ADTLNAGTDLNCGTYYQRYLPGAYEQGLINDTTLDRALTRTYSSLIKLGYFDNADSQ--P 371
Query: 407 YGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMG 466
Y ++G DV + ++LAL AA++GIVLLKND LPL+ + VSS+A+IG N QM
Sbjct: 372 YRQIGWQDVNSQHAQELALKAAQEGIVLLKND-GLLPLSLDGVSSIALIGSWANATEQMQ 430
Query: 467 GGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGL 526
G Y G+ S L E K +YA G +D E A+ +D +IVV G+
Sbjct: 431 GNYAGVAPYLHSPLYAAEQLGVKVNYAEGASQSNPTTDQWGAE-YTAAENSDVIIVVGGI 489
Query: 527 DLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSIL 586
D E+E+ DRV++ G Q+ ++T +A T +PVI+V G G LD + +++ IS++L
Sbjct: 490 DNDIESEELDRVAIAWSGPQLDMITKLA-TYGKPVIVVQMGAGQLDSTPLVSNANISALL 548
Query: 587 WIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTK-VPMNDMNMRADSSRQYPGRSY 645
W GYPG+ G AL +II G P GRLP+T YP +TK V M DM++R S+ GR+Y
Sbjct: 549 WGGYPGQDGGTALFDIITGAVAPAGRLPITQYPARYTKEVAMTDMSLRPSSTSA--GRTY 606
Query: 646 RFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDE 705
++Y GT V+ FG GL YTN+S S P+ + L A N S+LD
Sbjct: 607 KWYNGTAVFPFGFGLHYTNFSAAIPSPPASSFAISDLVASCSAN----DTSKLD------ 656
Query: 706 VTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKE 765
+ TSL + + N G V + F P+ L+ + R+H +A G +
Sbjct: 657 LCPFTSL----AVDIANDGTRASDFVALAFLTGEFGPSPHPKSSLVAYQRLHAIAAGETQ 712
Query: 766 ISFGVDPCEQLSIANKHGRRILPLGNHVLMV 796
+ L +++G ++L G++ +++
Sbjct: 713 TARLNLTLGSLVRVDENGDKLLYPGDYSVLI 743
>gi|391865040|gb|EIT74331.1| beta-glucosidase-related glycosidase [Aspergillus oryzae 3.042]
Length = 822
Score = 462 bits (1190), Expect = e-127, Method: Compositional matrix adjust.
Identities = 288/781 (36%), Positives = 425/781 (54%), Gaps = 78/781 (9%)
Query: 44 CKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESL 103
C+ S+P C+TSLSI+ R SL+ LTL+EKI L D ++ RLG+P+YEWWSE+
Sbjct: 69 CQTQPLCSHPVCDTSLSIAERVDSLVKSLTLEEKILNLVDASAGSTRLGLPSYEWWSEAT 128
Query: 104 HGIASNGPGVNFN---GTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA 160
HG+ S PGV F S TSFP +++AASF+ +L I + E RA N G +
Sbjct: 129 HGVGS-APGVQFTSKPANFSYATSFPAPILTAASFDDTLIRKIAEVIGREGRAFGNNGFS 187
Query: 161 GLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFRE 220
G FWAPNIN FRDPRWGRGQETPGEDP+V Y FV QG++ K+
Sbjct: 188 GFDFWAPNINGFRDPRWGRGQETPGEDPLVAQNYIRNFVPGLQGDDPKN----------- 236
Query: 221 KRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFR 280
K+V+ A CKH YDLE RY N T+QD D F PF+
Sbjct: 237 KQVI----------------ATCKHYAVYDLET----GRYGNNYNPTQQDLSDYFLAPFK 276
Query: 281 SCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKG---YITSDCDAVATIFEY 336
+C+ IMCSYN V+G+PAC L + R W F Y+ SDC AV I++Y
Sbjct: 277 TCVRDTDVGSIMCSYNSVSGIPACANEYLLDEVLRKHWNFNSDYYYVVSDCGAVTDIWQY 336
Query: 337 QNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLG 396
N+T T E +A+ L AG+D+ CG+ L+ +S + + K +DR+L L+S +G
Sbjct: 337 HNFTDTEEAAASVALNAGVDLECGSSYLKLNES-LAANQTSVKVMDRSLARLYSALFTVG 395
Query: 397 LFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLN-KNAVSSLAII 455
F+G GKY KL DV T + + LA +AA +G+ LLKND LPL+ + S+A+I
Sbjct: 396 FFDG----GKYDKLDFSDVSTPDAQALAYEAAVEGMTLLKND-DLLPLDFPHKYKSVAVI 450
Query: 456 GPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVS---KTHYASGCHDVPCNSDAGFHEAVR 512
GP N +QM G Y+G L+ LEA+ K +YA G + + +GF EA+
Sbjct: 451 GPFANATTQMQGDYSG---DAPYLISPLEAFGDSRWKVNYALGTA-INNQNTSGFEEALA 506
Query: 513 IAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLD 572
A K+D +I + G+D + E+E DR SL PG Q+ L+TS+++ SK P+++V GGG +D
Sbjct: 507 AANKSDLIIYLGGIDNSLESETLDRTSLAWPGNQLDLITSLSKLSK-PLVVVQFGGGQVD 565
Query: 573 VSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMN 631
S + I +++W GYP ++G AL +++ G +P GRLP+T YP S+ +V + D+N
Sbjct: 566 DSAILKNKDIQALVWAGYPSQSGGTALLDVLVGKRSPAGRLPVTQYPASYADQVNIFDIN 625
Query: 632 MRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNIL 691
+R S YPGR+Y++YTG V FG+GL YT + + + E T++ + NI
Sbjct: 626 LRPTDS--YPGRTYKWYTGKPVLPFGYGLHYTKFMFDW-----EKTLN------REYNIQ 672
Query: 692 QQTGSRLDYV--HIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGT-PEK 748
S + I++ T T+++ V+ N G +V +LF P K
Sbjct: 673 DLVASCRNSSGGPINDNTPLTTVKARVK----NVGHKTSDYVSLLFLSSKNAGPAPRPNK 728
Query: 749 QLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG---ELRHSLTI 805
L+ + R+ +A+GS +++ L+ A+++G ++ G + + + EL T+
Sbjct: 729 SLVSYVRLLNIARGSDQVAELPLTLGSLARADENGSLVIFPGRYKIALDNSEELTFEFTL 788
Query: 806 E 806
+
Sbjct: 789 K 789
>gi|297039776|gb|ADH95739.1| beta-xylosidase [Aspergillus fumigatus]
Length = 771
Score = 462 bits (1188), Expect = e-127, Method: Compositional matrix adjust.
Identities = 294/770 (38%), Positives = 413/770 (53%), Gaps = 73/770 (9%)
Query: 42 FP-CKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWS 100
FP C C+TSL ++TRA+SL++ +T +EK+ N+ +PRLG+PAY WWS
Sbjct: 29 FPDCSSGPLSKLAVCDTSLDVTTRAQSLVNAMTFEEKVNNTQYNSPGVPRLGLPAYNWWS 88
Query: 101 ESLHGIASNGPGVNF--NGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG 158
E+LHG+A + PGV F +G S TSFPQ ++ A+F+ L + + V+ E RA N G
Sbjct: 89 EALHGVAGS-PGVEFADSGPFSYATSFPQPILLGATFDDDLIKQVATVVSTEGRAFGNAG 147
Query: 159 QAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGF 218
++GL FW PNIN FRD RWGRGQETPGEDP+ VS Y V Q G G
Sbjct: 148 RSGLDFWTPNINPFRDARWGRGQETPGEDPLHVSRYVYHLVDGLQN----------GIGP 197
Query: 219 REKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPP 278
+V+ A CKH AY LE W R+SFNA ++ QD + + PP
Sbjct: 198 ANPKVV----------------ATCKHFAAYGLEDWNGVVRHSFNAEVSTQDLSEFYLPP 241
Query: 279 FRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKARNE---WGFKG-YITSDCDAVATIF 334
F+SC + +MCSYN +NGVPAC L Q E W G +ITSDC A+ I+
Sbjct: 242 FKSCARDARVDAVMCSYNALNGVPACADSYLLQTILREHWKWDEPGRWITSDCGAIDDIY 301
Query: 335 EYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLR 394
N+T T ++AA L AG D++CGT ++ A D+G + +DRAL+ L+S ++
Sbjct: 302 NGHNFTTTPAEAAATALNAGTDLDCGTVFPKYLGQAADEGLYSNQTLDRALVRLYSSFVK 361
Query: 395 LGLFNGDPRKGK-YGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLA 453
LG F DP + + Y +G DV T + LA AA +GIVLLKNDK LPL A +LA
Sbjct: 362 LGYF--DPAEDQPYRSIGWTDVDTPAVEALAHKAAGEGIVLLKNDKT-LPL--KAKGTLA 416
Query: 454 IIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRI 513
+IGP N QM G Y G ++LL YA+G + NS AGF A+
Sbjct: 417 LIGPYANATKQMQGNYEGPAKYIRTLLWAATQAGYDVKYAAGTA-INTNSTAGFDAALSA 475
Query: 514 AKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDV 573
AK+AD V+ G+D T E E RDR ++ PG Q++L+ +++ K P+++V GGG +D
Sbjct: 476 AKQADVVVYAGGIDNTIEAEGRDRTTIAWPGNQVNLIDQLSKIGK-PLVVVQFGGGQVDD 534
Query: 574 SFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNM 632
S ++ +++++LW GYP + G A+ +I+ G P GRLP+T YP + +VPM DM +
Sbjct: 535 SSLLSNPRVNALLWAGYPSQEGGSAIFDILTGKTAPAGRLPVTQYPADYVNQVPMTDMAL 594
Query: 633 RADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQ 692
R S+ PGR+YR+Y V FG GL YT + + +A+L + S KN
Sbjct: 595 RPGSNT--PGRTYRWYD-KAVLPFGFGLHYTTFKISWPRRALGPYNTAALVSRSPKN--- 648
Query: 693 QTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFAR-VPKVSQGTPEKQLI 751
V ID T FH+Q VTN G +V +LF + + + P K L+
Sbjct: 649 --------VPIDRAAFDT---FHIQ--VTNTGKTTSDYVALLFLKTIDAGPKPYPLKTLV 695
Query: 752 GFDRVHTVAKGSK-----EISFGVDPCEQLSIANKHGRRILPLGNHVLMV 796
G+ R + G K E+S G L+ ++G +L G + L V
Sbjct: 696 GYTRAKQIKPGEKRSVDIEVSLG-----SLARTAENGDLVLYPGRYTLEV 740
>gi|83774566|dbj|BAE64689.1| unnamed protein product [Aspergillus oryzae RIB40]
Length = 822
Score = 461 bits (1187), Expect = e-127, Method: Compositional matrix adjust.
Identities = 292/780 (37%), Positives = 425/780 (54%), Gaps = 80/780 (10%)
Query: 44 CKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESL 103
C+ S+P C+TSLSI+ R SL+ LTL+EKI L D ++ RLG+P+YEWWSE+
Sbjct: 69 CQTQPLCSHPVCDTSLSIAERVDSLVKSLTLEEKILNLVDASAGSTRLGLPSYEWWSEAT 128
Query: 104 HGIASNGPGVNFN---GTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA 160
HG+ S PGV F S TSFP +++AASF+ +L I + E RA N G +
Sbjct: 129 HGVGS-APGVQFTSKPANFSYATSFPAPILTAASFDDTLIRKIAEVIGREGRAFGNNGFS 187
Query: 161 GLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFRE 220
G FWAPNIN FRDPRWGRGQETPGEDP+V Y FV QG++ K+
Sbjct: 188 GFDFWAPNINGFRDPRWGRGQETPGEDPLVAQNYIRNFVPGLQGDDPKN----------- 236
Query: 221 KRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFR 280
K+V+ A CKH YDLE RY N T+QD D F PF+
Sbjct: 237 KQVI----------------ATCKHYAVYDLET----GRYGNNYNPTQQDLSDYFLAPFK 276
Query: 281 SCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKG---YITSDCDAVATIFEY 336
+C+ IMCSYN V+G+PAC L + R W F Y+ SDC AV I++Y
Sbjct: 277 TCVRDTDVGSIMCSYNSVSGIPACANEYLLSEVLRKHWNFNSDYHYVVSDCGAVTDIWQY 336
Query: 337 QNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLG 396
N+T T E +A+ L AG+D+ CG+ L+ +S + + K +D++L L+S +G
Sbjct: 337 HNFTDTEEAAASVALNAGVDLECGSSYLKLNES-LAANQTSVKVMDQSLARLYSALFTVG 395
Query: 397 LFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNK-NAVSSLAII 455
F+G GKY KL DV T + + LA +AA +G+ LLKND LPL+ + S+A+I
Sbjct: 396 FFDG----GKYDKLDFSDVSTPDAQALAYEAAVEGMTLLKND-DLLPLDSPHKYKSVAVI 450
Query: 456 GPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVS---KTHYASGCHDVPCNSDAGFHEAVR 512
GP N +QM G Y+G L+ LEA+ K +YA G N+ +GF EA+
Sbjct: 451 GPFANATTQMQGDYSG---DAPYLISPLEAFGDSRWKVNYALGTAMNNQNT-SGFEEALA 506
Query: 513 IAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLD 572
A K+D +I + G+D + E+E DR SL PG Q+ L+TS+++ SK P+++V GGG +D
Sbjct: 507 AANKSDLIIYLGGIDNSLESETLDRTSLTWPGNQLDLITSLSKLSK-PLVVVQFGGGQVD 565
Query: 573 VSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMN 631
S + I +++W GYP ++G AL +++ G +P GRLP+T YP S+ +V + D+N
Sbjct: 566 DSDILKNKDIQALVWAGYPSQSGGTALLDVLVGKRSPAGRLPVTQYPASYADQVNIFDIN 625
Query: 632 MRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNIL 691
+R S YPGR+Y++YTG V FG+GL YT + + + E T++ + NI
Sbjct: 626 LRPTDS--YPGRTYKWYTGKPVLPFGYGLHYTKFMFDW-----EKTLN------REYNIQ 672
Query: 692 QQTGSRLDYV--HIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGT-PEK 748
S + I++ T T+ V++ V N G +V +LF P K
Sbjct: 673 DLVASCRNSSGGPINDNTPLTT----VKVRVKNVGHKTSDYVSLLFLSSKNAGPAPRPNK 728
Query: 749 QLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHS--LTIE 806
L+ + R+ +A+GS +++ L+ A+++G ++ G + + L HS LT E
Sbjct: 729 SLVSYVRLLNIARGSDQVAELPLTLGSLARADENGSLVIFPGRYKIA---LDHSEELTFE 785
>gi|317156541|ref|XP_001825822.2| exo-1,4-beta-xylosidase xlnD [Aspergillus oryzae RIB40]
Length = 882
Score = 461 bits (1185), Expect = e-127, Method: Compositional matrix adjust.
Identities = 292/780 (37%), Positives = 425/780 (54%), Gaps = 80/780 (10%)
Query: 44 CKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESL 103
C+ S+P C+TSLSI+ R SL+ LTL+EKI L D ++ RLG+P+YEWWSE+
Sbjct: 129 CQTQPLCSHPVCDTSLSIAERVDSLVKSLTLEEKILNLVDASAGSTRLGLPSYEWWSEAT 188
Query: 104 HGIASNGPGVNFN---GTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA 160
HG+ S PGV F S TSFP +++AASF+ +L I + E RA N G +
Sbjct: 189 HGVGS-APGVQFTSKPANFSYATSFPAPILTAASFDDTLIRKIAEVIGREGRAFGNNGFS 247
Query: 161 GLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFRE 220
G FWAPNIN FRDPRWGRGQETPGEDP+V Y FV QG++ K+
Sbjct: 248 GFDFWAPNINGFRDPRWGRGQETPGEDPLVAQNYIRNFVPGLQGDDPKN----------- 296
Query: 221 KRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFR 280
K+V+ A CKH YDLE RY N T+QD D F PF+
Sbjct: 297 KQVI----------------ATCKHYAVYDLET----GRYGNNYNPTQQDLSDYFLAPFK 336
Query: 281 SCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKG---YITSDCDAVATIFEY 336
+C+ IMCSYN V+G+PAC L + R W F Y+ SDC AV I++Y
Sbjct: 337 TCVRDTDVGSIMCSYNSVSGIPACANEYLLSEVLRKHWNFNSDYHYVVSDCGAVTDIWQY 396
Query: 337 QNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLG 396
N+T T E +A+ L AG+D+ CG+ L+ +S + + K +D++L L+S +G
Sbjct: 397 HNFTDTEEAAASVALNAGVDLECGSSYLKLNES-LAANQTSVKVMDQSLARLYSALFTVG 455
Query: 397 LFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNK-NAVSSLAII 455
F+G GKY KL DV T + + LA +AA +G+ LLKND LPL+ + S+A+I
Sbjct: 456 FFDG----GKYDKLDFSDVSTPDAQALAYEAAVEGMTLLKND-DLLPLDSPHKYKSVAVI 510
Query: 456 GPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVS---KTHYASGCHDVPCNSDAGFHEAVR 512
GP N +QM G Y+G L+ LEA+ K +YA G N+ +GF EA+
Sbjct: 511 GPFANATTQMQGDYSG---DAPYLISPLEAFGDSRWKVNYALGTAMNNQNT-SGFEEALA 566
Query: 513 IAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLD 572
A K+D +I + G+D + E+E DR SL PG Q+ L+TS+++ SK P+++V GGG +D
Sbjct: 567 AANKSDLIIYLGGIDNSLESETLDRTSLTWPGNQLDLITSLSKLSK-PLVVVQFGGGQVD 625
Query: 573 VSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMN 631
S + I +++W GYP ++G AL +++ G +P GRLP+T YP S+ +V + D+N
Sbjct: 626 DSDILKNKDIQALVWAGYPSQSGGTALLDVLVGKRSPAGRLPVTQYPASYADQVNIFDIN 685
Query: 632 MRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNIL 691
+R S YPGR+Y++YTG V FG+GL YT + + + E T++ + NI
Sbjct: 686 LRPTDS--YPGRTYKWYTGKPVLPFGYGLHYTKFMFDW-----EKTLN------REYNIQ 732
Query: 692 QQTGSRLDYV--HIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGT-PEK 748
S + I++ T T+ V++ V N G +V +LF P K
Sbjct: 733 DLVASCRNSSGGPINDNTPLTT----VKVRVKNVGHKTSDYVSLLFLSSKNAGPAPRPNK 788
Query: 749 QLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHS--LTIE 806
L+ + R+ +A+GS +++ L+ A+++G ++ G + + L HS LT E
Sbjct: 789 SLVSYVRLLNIARGSDQVAELPLTLGSLARADENGSLVIFPGRYKIA---LDHSEELTFE 845
>gi|392560759|gb|EIW53941.1| glycoside hydrolase family 3 protein [Trametes versicolor FP-101664
SS1]
Length = 783
Score = 460 bits (1183), Expect = e-126, Method: Compositional matrix adjust.
Identities = 285/770 (37%), Positives = 414/770 (53%), Gaps = 62/770 (8%)
Query: 55 CNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVN 114
C+ + TRA +LI L T +E + + +PRLG+PAY WWSE LHG+A + PGV
Sbjct: 41 CDITKDPITRATALIGLWTDEELTSNTVNASPGVPRLGLPAYNWWSEGLHGVAQS-PGVT 99
Query: 115 F--NGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIF 172
F +G S TSFPQ ++ A+F+ +L I + V+ E RA N G+AGL +W PNIN F
Sbjct: 100 FAPSGNFSHATSFPQPILMGAAFDDTLIQAIATIVSTEGRAFNNAGRAGLDYWTPNINPF 159
Query: 173 RDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESD 232
+DPRWGRGQETPGEDP +S Y + QG G K K
Sbjct: 160 KDPRWGRGQETPGEDPFHLSQYVYNLILGLQG------------GLDPKPYFK------- 200
Query: 233 RGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIM 292
+ A CKH AYDLE W R F+A++++QD + + PPF++C+ K + +M
Sbjct: 201 ------VVADCKHFAAYDLENWEGIVRNGFDAIVSQQDLSEFYLPPFQTCVRDAKVASVM 254
Query: 293 CSYNQVNGVPACLRGDLFQKA-RNEWGFKG--YITSDCDAVATIFEYQNYTKTHEDSAAG 349
CSYN VNG+P+C L Q R+ WGF ++TSDCDAV I YT +AA
Sbjct: 255 CSYNAVNGIPSCANSFLLQDVLRDHWGFTDDRWVTSDCDAVENILTPHKYTTDPAQAAAD 314
Query: 350 VLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGK 409
L AG DI+CGT + A+ +G V D+ RA + ++ +RLG F+ DP Y +
Sbjct: 315 ALLAGTDIDCGTFSSTYLPEALQRGLVNSTDLRRAAIRQYASLVRLGYFD-DPAAQPYRQ 373
Query: 410 LGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGY 469
LG DV T + ++LA AA +GIVLLKND LP +K+ V LA+IGP N S + G Y
Sbjct: 374 LGWSDVNTPQAQQLAHTAAVEGIVLLKND-GVLPFSKH-VRKLALIGPWANATSLLQGSY 431
Query: 470 TGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLT 529
G+ S L+G + + Y G + N +GF AV ++AD V+ GLD T
Sbjct: 432 IGVAPYLVSPLQGAQEAGFEVEYVLGTNVTTQNDMSGFAAAVAAVRRADAVVFAGGLDET 491
Query: 530 QETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIG 589
E E DR+++ PG Q+ LV + R K P+I+ GGG LD + + +++I+W G
Sbjct: 492 VECEGTDRLNVTWPGNQLDLVAELERVGK-PLIVAQFGGGQLDDTALKHSKAVNAIIWGG 550
Query: 590 YPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTK-VPMNDMNMRADSSRQYPGRSYRFY 648
YPG++G AL +I+ G P GRLP+T YP ++TK VPM DM++R ++ PGR+Y++Y
Sbjct: 551 YPGQSGGTALFDILTGKAAPAGRLPITQYPAAYTKQVPMTDMSLRPSATN--PGRTYKWY 608
Query: 649 TGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNIL---------QQTGSRLD 699
+GT V+ FG GL YT + + + + + + ++ GS Q++ + LD
Sbjct: 609 SGTPVFEFGFGLHYTTFVFSWAAPSAAAAVDSTASFGSLAKSYSISQLVAHGQESTAFLD 668
Query: 700 YVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTV 759
+D + VTN G V +V +LF P+KQL+ + RVH +
Sbjct: 669 LAPLDTFA----------VRVTNTGRVASDYVALLFVSGAFGPAPHPKKQLVAYTRVHGL 718
Query: 760 A-KGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMV---GELRHSLTI 805
A +GS V ++ A+K+G + + G + L + L H+ T+
Sbjct: 719 APRGSTVAQLPVT-LGAIARADKNGEKWVHPGTYTLALDTDAVLTHTFTL 767
>gi|70986056|ref|XP_748529.1| beta-xylosidase [Aspergillus fumigatus Af293]
gi|74668295|sp|Q4WFI6.1|BXLB_ASPFU RecName: Full=Probable exo-1,4-beta-xylosidase bxlB; AltName:
Full=1,4-beta-D-xylan xylohydrolase bxlB; AltName:
Full=Beta-xylosidase bxlB; AltName: Full=Xylobiase bxlB;
Flags: Precursor
gi|296439536|sp|B0Y0I4.1|BXLB_ASPFC RecName: Full=Probable exo-1,4-beta-xylosidase bxlB; AltName:
Full=1,4-beta-D-xylan xylohydrolase bxlB; AltName:
Full=Beta-xylosidase bxlB; AltName: Full=Xylobiase bxlB;
Flags: Precursor
gi|66846158|gb|EAL86491.1| beta-xylosidase, putative [Aspergillus fumigatus Af293]
gi|159128339|gb|EDP53454.1| beta-xylosidase [Aspergillus fumigatus A1163]
Length = 771
Score = 459 bits (1182), Expect = e-126, Method: Compositional matrix adjust.
Identities = 295/770 (38%), Positives = 413/770 (53%), Gaps = 73/770 (9%)
Query: 42 FP-CKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWS 100
FP C C+TSL ++TRA+SL++ +T +EK+ N+ +PRLG+PAY WWS
Sbjct: 29 FPDCSSGPLSKLAVCDTSLDVTTRAQSLVNAMTFEEKVNNTQYNSPGVPRLGLPAYNWWS 88
Query: 101 ESLHGIASNGPGVNF--NGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG 158
E+LHG+A + PGV F +G S TSFPQ ++ A+F+ L + + V+ E RA N G
Sbjct: 89 EALHGVAGS-PGVEFADSGPFSYATSFPQPILLGATFDDDLIKQVATVVSTEGRAFGNAG 147
Query: 159 QAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGF 218
++GL FW PNIN FRD RWGRGQETPGEDP+ VS Y V Q G G
Sbjct: 148 RSGLDFWTPNINPFRDARWGRGQETPGEDPLHVSRYVYHLVDGLQN----------GIGP 197
Query: 219 REKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPP 278
+V+ A CKH AYDLE W R+SFNA ++ QD + + PP
Sbjct: 198 ANPKVV----------------ATCKHFAAYDLEDWNGVVRHSFNAEVSTQDLSEFYLPP 241
Query: 279 FRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKARNE---WGFKG-YITSDCDAVATIF 334
F+SC + +MCSYN +NGVPAC L Q E W G +ITSDC A+ I+
Sbjct: 242 FKSCARDARVDAVMCSYNALNGVPACADSYLLQTILREHWKWDEPGRWITSDCGAIDDIY 301
Query: 335 EYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLR 394
N+T T ++AA L AG D++CGT ++ A D+G + +DRAL+ L+S ++
Sbjct: 302 NGHNFTTTPAEAAATALNAGTDLDCGTVFPKYLGQAADEGLYSNQTLDRALVRLYSSLVK 361
Query: 395 LGLFNGDPRKGK-YGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLA 453
LG F DP + + Y +G DV T + LA AA +GIVLLKNDK LPL A +LA
Sbjct: 362 LGYF--DPAEDQPYRSIGWTDVDTPAAEALAHKAAGEGIVLLKNDKT-LPL--KAKGTLA 416
Query: 454 IIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRI 513
+IGP N QM G Y G ++LL YA+G + NS AGF A+
Sbjct: 417 LIGPYANATKQMQGNYEGPAKYIRTLLWAATQAGYDVKYAAGTA-INTNSTAGFDAALSA 475
Query: 514 AKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDV 573
AK+AD V+ G+D T E E RDR ++ PG Q++L+ +++ K P+++V GGG +D
Sbjct: 476 AKQADVVVYAGGIDNTIEAEGRDRTTIAWPGNQVNLIDQLSKIGK-PLVVVQFGGGQVDD 534
Query: 574 SFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNM 632
S ++ +++++LW GYP + G A+ +I+ G P GRLP+T YP + +VPM DM +
Sbjct: 535 SSLLSNPRVNALLWAGYPSQEGGSAIFDILTGKTAPAGRLPVTQYPADYVNQVPMTDMAL 594
Query: 633 RADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQ 692
R S+ PGR+YR+Y V FG GL YT + + +A+L + S KN
Sbjct: 595 RPGSNT--PGRTYRWYD-KAVLPFGFGLHYTTFKISWPRRALGPYNTAALVSRSPKN--- 648
Query: 693 QTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVS-QGTPEKQLI 751
V ID T FH+Q VTN G +V +LF + + P K L+
Sbjct: 649 --------VPIDRAAFDT---FHIQ--VTNTGKTTSDYVALLFLKTTDAGPKPYPLKTLV 695
Query: 752 GFDRVHTVAKGSK-----EISFGVDPCEQLSIANKHGRRILPLGNHVLMV 796
G+ R + G K E+S G L+ ++G +L G + L V
Sbjct: 696 GYTRAKQIKPGEKRSVDIEVSLG-----SLARTAENGDLVLYPGRYTLEV 740
>gi|344303941|gb|EGW34190.1| hypothetical protein SPAPADRAFT_65353 [Spathaspora passalidarum
NRRL Y-27907]
Length = 788
Score = 459 bits (1180), Expect = e-126, Method: Compositional matrix adjust.
Identities = 272/810 (33%), Positives = 424/810 (52%), Gaps = 83/810 (10%)
Query: 19 LQLIVVVNVIAFSNSKPVLNKPDF-----PCKPP------HFDSYP-----------FCN 56
++L+ + + F+N + + PD+ P PP H+ ++P CN
Sbjct: 4 IKLLATLALAIFANGETPPSSPDYNTEANPQLPPITFEAVHY-TFPDCQNGPLKHNAVCN 62
Query: 57 TSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNF- 115
L RAK+++ L T+ E I + + + + RLG+P Y+WWSE+LHGIA + NF
Sbjct: 63 PHLPTEQRAKAVVDLFTVDELIANMGNTSPGVERLGLPPYQWWSEALHGIARS----NFT 118
Query: 116 -NGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRD 174
+G S TSFPQ ++ +FN L+ +G+ + EARA N+G+AGL F++PNIN FRD
Sbjct: 119 ASGEYSHATSFPQPILMGGAFNNDLYKQVGNVIGTEARAFNNVGRAGLDFYSPNINPFRD 178
Query: 175 PRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRG 234
RWGRGQE E P++V YA+ +V+ QG G +S++
Sbjct: 179 ARWGRGQEVASESPVLVGNYALNYVQGLQG-----------------------GLDSNQN 215
Query: 235 DE-LMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMC 293
D+ L ++A CKH + YD+E W SR +NA+I++QD D + P F+SC+ KA+ MC
Sbjct: 216 DDTLQVAATCKHFVGYDMESWNQHSRLGYNAIISDQDLADFYLPTFQSCVRDAKAAGAMC 275
Query: 294 SYNQVNGVPACLRGDLFQKA-RNEWGFK-GYITSDCDAVATIFEYQNYTKTHEDSAAGVL 351
SYN VNGVPAC R+ + F+ G I SDCDA+ ++ Y + +AA +
Sbjct: 276 SYNAVNGVPACASEFFLNTVLRDGFDFQNGVIHSDCDAIYNVWNPHLYAQDLGGAAADAI 335
Query: 352 KAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLG 411
KAG+D+NCG + A+ + E I ++ +S +RLG F+ P+ KY K
Sbjct: 336 KAGVDVNCGDTYQNNLGYALGNKTINENQIRTSVTRQYSNLIRLGYFD-SPQTNKYRKYD 394
Query: 412 PDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTG 471
+DV T + +LA AA +GI LLKND LP NK V +A+IGP N +QM G Y G
Sbjct: 395 WNDVSTPQANQLAYQAAVEGIALLKNDGT-LPFNKQKVRKVAVIGPWANATTQMLGDYAG 453
Query: 472 IPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQE 531
P S L+G ++ + YA G + +G+ A+ AK AD ++ G+D + E
Sbjct: 454 TPPYMISPLQGAQSEGFQVEYALGTQ-INTTDTSGYTAALNAAKGADAIVYFGGIDNSVE 512
Query: 532 TEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYP 591
E DR SL PG Q+ LV+ ++ K+P++++ GGG +D + + + +++I++ GYP
Sbjct: 513 NEALDRESLAWPGNQLDLVSKLSGL-KKPLVVLQFGGGQIDDTEIKNNKNVNAIVYAGYP 571
Query: 592 GEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNMRADSSRQ-YPGRSYRFYT 649
G++G A+ +I+ G + P GRL T YP S+ +VPM DM +R RQ YPGR++ +Y
Sbjct: 572 GQSGGTAIWDILSGKYAPAGRLTTTQYPASYADQVPMTDMTLR---PRQGYPGRTFMWYN 628
Query: 650 GTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQ---TGSRLDYVHIDEV 706
G VY FG+GL YT +S +AP + G ++Q R YV +
Sbjct: 629 GEPVYEFGYGLHYTTFSASLANAP---------RGGHQSFNIEQVVAAAKRSQYVDTGLI 679
Query: 707 TSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEI 766
T+ +++ N G + +L+++ P K L+ FD++H + G +
Sbjct: 680 TT-------FDVNIKNTGKTTSDYAALLYSKTTAGPGPHPNKILVSFDKLHQIHAGQTQT 732
Query: 767 SFGVDPCEQLSIANKHGRRILPLGNHVLMV 796
+ L + +G + L G + V
Sbjct: 733 AKLPVTIGSLLQTDTNGNKWLYPGTYTFFV 762
>gi|440799679|gb|ELR20723.1| betaxylosidase [Acanthamoeba castellanii str. Neff]
Length = 748
Score = 458 bits (1178), Expect = e-126, Method: Compositional matrix adjust.
Identities = 284/781 (36%), Positives = 403/781 (51%), Gaps = 114/781 (14%)
Query: 43 PCKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSES 102
P + PFCNTSL+ R L+S LTL + I Q+ A A+P LGIPAY WW+E
Sbjct: 4 PLQDAPLKDLPFCNTSLTAGQRTDDLVSRLTLDQLIGQMGHQAPAVPSLGIPAYNWWTEC 63
Query: 103 LHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGL 162
LHG+ + G N TSFP A+FN L + A++ EARA+ N G GL
Sbjct: 64 LHGVLTKC-GTN------CPTSFPAPCALGAAFNMKLIHKMARAISNEARALNNEGIGGL 116
Query: 163 TFWAPNI-----------------------NIFRDPRWGRGQETPGEDPMVVSAYAVEFV 199
FWAPNI +I RDPRWGR E PGEDP + + Y F+
Sbjct: 117 DFWAPNIKYSTQPTNKTRQESQLRNAMVCISINRDPRWGRNMEVPGEDPFMTAQYVAHFM 176
Query: 200 KSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSR 259
+ Q GE+S R +++ CKH AY LE W ++ R
Sbjct: 177 RGLQE-----------------------GEDS-RYPQVV--GTCKHFAAYSLEAWKDYDR 210
Query: 260 YSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWG 318
+ F+A++++ D +T+ P F+ CI +G+A IMCSYN VNGVP+C L + R+ W
Sbjct: 211 FMFDAIVSDYDFVETYLPAFKGCIVEGRARSIMCSYNSVNGVPSCANDFLLRTILRDSWS 270
Query: 319 FKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQE 378
F GY+ SDCDAV TI+ ++TKT E + A L AG D+NCG +H A +G+V E
Sbjct: 271 FDGYVVSDCDAVDTIYNNHHFTKTPEGACAVALHAGTDLNCGDFYQKHLGKAHSEGRVTE 330
Query: 379 KDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKND 438
++ A+ LF ++ LG+++ P + Y + P V + EH LAL AAR+ +VLL+N
Sbjct: 331 DEVRLAVKRLFRQRMELGMWD-PPAEQPYKQYPPSVVGSREHSDLALQAARESMVLLQNR 389
Query: 439 KKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV--------SKT 490
+ LPL K +V +A+IGP N M G Y G C + + Y+ +
Sbjct: 390 RGVLPLRK-SVRRVAVIGPNANATETMLGNYYGSRCHDGTYDCIVSPYLAIKAKLPQALV 448
Query: 491 HYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLV 550
Y GC DV + G EAV+ A+ AD IVV GL+ + E+E +DRV++ LPG Q L+
Sbjct: 449 TYNLGC-DVDSTNTTGIPEAVKAAQAADVAIVVLGLNTSVESEGKDRVAITLPGMQDHLI 507
Query: 551 TSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPG 610
S+ T+ V++++ GG V+ Q+ I+ YPGE G +A+A+++FGD+NPG
Sbjct: 508 KSIVATNTPTVVVMMHGGA---VAIEWIKDQVDGIVDAFYPGENGGQAIADVLFGDYNPG 564
Query: 611 ----------GRLPMTWYPESFTK-VPMNDMNMRADSSRQYPGRSYRFYTG-TQVYGFGH 658
GRLP+T P ++ VP+ +M+MRA S PGR+YR+YTG ++ FG
Sbjct: 565 DNKTDGTTLLGRLPVTVLPANYVDMVPLTNMSMRA--SGNNPGRTYRYYTGPAPLWEFGF 622
Query: 659 GLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQI 718
GLSYT + ++LS P + + + DE S ++
Sbjct: 623 GLSYTTFKTEWLSTPQPSALKSYAR--------------------DEAVS-------FRV 655
Query: 719 SVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSI 778
VTN G V G VV+ F +G P KQL F+RVH SKEI F P + L++
Sbjct: 656 RVTNVGPVAGDEVVLAFVTRDNADRG-PLKQLFAFERVHLNPGESKEIFFNTGP-DTLAV 713
Query: 779 A 779
A
Sbjct: 714 A 714
>gi|242786966|ref|XP_002480909.1| conserved hypothetical protein [Talaromyces stipitatus ATCC 10500]
gi|218721056|gb|EED20475.1| conserved hypothetical protein [Talaromyces stipitatus ATCC 10500]
Length = 757
Score = 457 bits (1176), Expect = e-125, Method: Compositional matrix adjust.
Identities = 284/769 (36%), Positives = 417/769 (54%), Gaps = 72/769 (9%)
Query: 52 YPFCNTSLSI---STRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIAS 108
YP LS + R KSLI LTL+EKI L D ++ RLG+P+YEWW+E+ HG+ S
Sbjct: 10 YPPAKQRLSAVINTQRVKSLIDSLTLEEKILNLVDASAGSERLGLPSYEWWNEATHGVGS 69
Query: 109 NGPGVNFNG---TVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFW 165
PGV F S TSFP +++AASF+ +L I S + E RA N G +G FW
Sbjct: 70 -APGVQFTEKPVNFSYATSFPAPILTAASFDDALVREIASVIGREGRAFGNNGFSGFDFW 128
Query: 166 APNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLK 225
APNIN FRDPRWGRGQETPGED VV +Y F+ QG++ + +K+V+
Sbjct: 129 APNINPFRDPRWGRGQETPGEDSFVVQSYIRNFIPGLQGDDPE-----------DKQVI- 176
Query: 226 GFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQ 285
A CKH AYDLE RY + T+QD D F PF++C+
Sbjct: 177 ---------------ATCKHYAAYDLET----GRYGNDYNPTQQDLADYFLAPFKTCVRD 217
Query: 286 GKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKG---YITSDCDAVATIFEYQNYTK 341
IMC+YN V+G+P C L Q R W F Y+ SDC AV I++Y N+T
Sbjct: 218 TGVGSIMCAYNAVDGIPTCASEYLLDQVLRKHWNFTADYNYVVSDCGAVTDIWQYHNFTD 277
Query: 342 THEDSAAGVLKAGMDINCGTCMLRHTQS-AIDKGKVQEKDIDRALLNLFSVQLRLGLFNG 400
T E +A+ L AG+D+ CG+ L+ +S A ++ VQ +D+AL L+S +G F+G
Sbjct: 278 TEEAAASVSLNAGVDLECGSSYLKLNESLAANQTTVQA--LDQALTRLYSALFTVGFFDG 335
Query: 401 DPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPL-NKNAVSSLAIIGPLV 459
GKY LG DV T E + LA +AA +G+ LLKNDK+ LP+ + + S+A+IGP
Sbjct: 336 ----GKYTALGFADVSTPEAQSLAYEAAVEGMTLLKNDKRLLPIRSSHKYKSVALIGPFA 391
Query: 460 NNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADF 519
N +QM G Y+GIP S L + + + +YA G + + GF A+ A+K+D
Sbjct: 392 NATTQMQGDYSGIPPFLISPLEAFKGHDWEVNYAMGT-GINNQTTTGFASALAAAEKSDL 450
Query: 520 VIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEAD 579
VI + G+D + E E DR SL PG Q+ LVT +++ K P+I+V GGG LD S +
Sbjct: 451 VIYLGGIDNSIEAETLDRTSLTWPGNQLDLVTQLSKLHK-PLIVVQFGGGQLDDSALLQN 509
Query: 580 SQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNMRADSSR 638
+ +++W GYP ++G AL +++ G + GRLP+T YP S+ +V + D+N+R + S
Sbjct: 510 EGVQALVWAGYPSQSGGSALLDVLLGKRSIAGRLPVTQYPASYADQVSIFDINIRPNDS- 568
Query: 639 QYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRL 698
YPGR+Y++YTG V FG+GL YT + +++ + + NI Q S
Sbjct: 569 -YPGRTYKWYTGMPVVPFGYGLHYTKFEFEW-----------AQTLNHEYNIQQLVASCQ 616
Query: 699 DYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGT-PEKQLIGFDRVH 757
I + T T+++ HV+ N G +V +LF P P K L+ + R+H
Sbjct: 617 STGPISDNTPFTTVKAHVK----NIGPEASDYVGLLFLSSPDAGPAPRPNKSLVSYLRLH 672
Query: 758 TVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
+ GS+ ++ A+++G ++ G++ + + ++ SLT E
Sbjct: 673 NITSGSQGTLDLPLTLGSMARADENGNLVIFPGHYKIAL-DVSDSLTFE 720
>gi|392570764|gb|EIW63936.1| glycoside hydrolase family 3 protein [Trametes versicolor FP-101664
SS1]
Length = 781
Score = 456 bits (1174), Expect = e-125, Method: Compositional matrix adjust.
Identities = 288/796 (36%), Positives = 426/796 (53%), Gaps = 59/796 (7%)
Query: 27 VIAFSNSKPVLNKPDFP-CKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNA 85
+ AFS + FP C + C+ + TRA +LIS+ T +E + +
Sbjct: 12 LTAFSVGAATVRAYGFPDCVNGPLKNNAVCDVTKDPITRATALISIWTDEELTNNTVNAS 71
Query: 86 SAIPRLGIPAYEWWSESLHGIASNGPGVNF--NGTVSSVTSFPQVLVSAASFNRSLWSNI 143
+PRLG+PAY WWSE LHG+A + PGV F +G S TSFPQ ++ A+F+ L I
Sbjct: 72 PGVPRLGLPAYNWWSEGLHGVAQS-PGVTFAPSGNFSYATSFPQPILMGAAFDDPLIQAI 130
Query: 144 GSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQ 203
+ V+ E RA N G+AGL +W PNIN F+DPRWGRGQETPGEDP +S Y + Q
Sbjct: 131 ATIVSTEGRAFNNAGRAGLDYWTPNINPFKDPRWGRGQETPGEDPFHLSQYVYNLILGLQ 190
Query: 204 GENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFN 263
G G K K + A CKH AYD++ W RY FN
Sbjct: 191 G------------GLDPKPYFK-------------VVADCKHFAAYDMDNWEGVVRYGFN 225
Query: 264 AMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKG- 321
A++++QD + + PPF++C+ K + +MCSYN VNG+P+C L Q R+ WGF
Sbjct: 226 AVVSQQDLSEFYLPPFQTCVRDAKVASVMCSYNAVNGIPSCANSFLLQDVLRDHWGFTDD 285
Query: 322 -YITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKD 380
++TSDCDAV IF NYT +AA L AG DI+CGT + A+ +G V D
Sbjct: 286 RWVTSDCDAVQNIFTPHNYTTDPAQAAADALLAGTDIDCGTFSSTYLPEALQRGLVNSTD 345
Query: 381 IDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKK 440
+ RA + ++ +RLG F+ DP Y +LG DV T + ++LA AA +G+VLLKND
Sbjct: 346 LRRAAIRQYASLVRLGYFD-DPAAQPYRQLGWSDVNTLQAQQLAHTAAVEGMVLLKND-G 403
Query: 441 FLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVP 500
LPL+K V LA+IGP N + G Y GI S ++G + + Y G +
Sbjct: 404 LLPLSKR-VRKLALIGPWANATRLLQGNYFGIAPYLVSPVQGAQQAGFEVEYVFGTNVTT 462
Query: 501 CNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRP 560
N +GF AV AK+AD V+ GLD T E E+ DR+++ PG Q+ LV + R K P
Sbjct: 463 RNDTSGFAAAVAAAKRADAVVFAGGLDETVEREEIDRLNVTWPGNQLDLVAELERVGK-P 521
Query: 561 VILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPE 620
+I+ GGG LD + + +++I+W GYPG++G AL +I+ G P GRLP+T YP
Sbjct: 522 LIVAQFGGGQLDNTALKRSKAVNAIIWGGYPGQSGGTALFDILTGKAAPAGRLPITQYPA 581
Query: 621 SFT-KVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTIS 679
++ +VPM DM +R ++ PGR+Y++Y+GT V+ FG GL YT +++ + + + +
Sbjct: 582 AYAEQVPMTDMTLRPSATN--PGRTYKWYSGTPVFEFGFGLHYTTFAFAWAAPGAAADST 639
Query: 680 ASL----KAGSDKNIL---QQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVV 732
AS K+ S ++ Q++ + LD +D + VTN G V +V
Sbjct: 640 ASFGGPAKSYSISQLVAHGQESAAFLDLAPLDTFA----------VRVTNTGKVASDYVA 689
Query: 733 MLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNH 792
+LF P+K L+ + R+H +A + ++ A+++G + + G +
Sbjct: 690 LLFVSGSFGPAPHPKKTLVAYTRIHGLAPRGSTVGQLPVTLGAIARADENGEKWVHPGTY 749
Query: 793 VLMV---GELRHSLTI 805
L + +L H+ T+
Sbjct: 750 TLALDTDAKLTHTFTL 765
>gi|60729621|pir||JC7966 xylan 1,4-beta-xylosidase (EC 3.2.1.37) - Talaromyces emersonii
gi|21326570|gb|AAL32053.2|AF439746_1 beta-xylosidase [Rasamsonia emersonii]
Length = 796
Score = 456 bits (1174), Expect = e-125, Method: Compositional matrix adjust.
Identities = 281/755 (37%), Positives = 407/755 (53%), Gaps = 62/755 (8%)
Query: 55 CNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVN 114
CNTS RA++L+SL TL+E I + A +PRLG+P Y+ W+E+LHG+ N
Sbjct: 64 CNTSADPWARAEALVSLFTLEELINNTQNTAPGVPRLGLPQYQVWNEALHGLDR----AN 119
Query: 115 FN--GTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIF 172
F+ G S TSFP ++S ASFNR+L + I S +A +ARA N G+ GL +APNIN F
Sbjct: 120 FSDSGEYSWATSFPMPILSMASFNRTLINQIASIIATQARAFNNAGRYGLDSYAPNINGF 179
Query: 173 RDPRWGRGQETPGEDPMVVS-AYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEES 231
R P WGRGQETPGED +S AYA E++ QG G E
Sbjct: 180 RSPLWGRGQETPGEDAFFLSSAYAYEYITGLQG---------------------GVDPE- 217
Query: 232 DRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCI 291
+ + A KH YDLE WGN SR NA+IT+QD + + P F + K +
Sbjct: 218 ----HVKIVATAKHFAGYDLENWGNVSRLGSNAIITQQDLSEYYTPQFLASARYAKTRSL 273
Query: 292 MCSYNQVNGVPACLRGDLFQK-ARNEWGF--KGYITSDCDAVATIFEYQNYTKTHEDSAA 348
MCSYN VNGVP+C Q R + F GY++SDCDAV +F Y +AA
Sbjct: 274 MCSYNAVNGVPSCSNSFFLQTLLRESFNFVDDGYVSSDCDAVYNVFNPHGYALNQSGAAA 333
Query: 349 GVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYG 408
L AG DI+CG M H + + V DI+++L L++ +RLG F+G+ Y
Sbjct: 334 DSLLAGTDIDCGQTMPWHLNESFYERYVSRGDIEKSLTRLYANLVRLGYFDGN--NSVYR 391
Query: 409 KLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGG 468
L +DV T++ ++ +AA +GI LLKND LPL+K V S+A+IGP N QM G
Sbjct: 392 NLNWNDVVTTDAWNISYEAAVEGITLLKNDGT-LPLSKK-VRSIALIGPWANATVQMQGN 449
Query: 469 YTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDL 528
Y G P S L +A +YA G ++ +S F EA+ AKK+D +I G+D
Sbjct: 450 YYGTPPYLISPLEAAKASGFTVNYAFGT-NISTDSTQWFAEAISAAKKSDVIIYAGGIDN 508
Query: 529 TQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWI 588
T E E +DR L PG Q+ L+ +++ K P++++ GGG +D S +A+ +++++W
Sbjct: 509 TIEAEGQDRTDLKWPGNQLDLIEQLSKVGK-PLVVLQMGGGQVDSSSLKANKNVNALVWG 567
Query: 589 GYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESF-TKVPMNDMNMRADSSRQYPGRSYRF 647
GYPG++G AL +I+ G P GRL T YP + T+ P NDMN+R + S PG++Y +
Sbjct: 568 GYPGQSGGAALFDILTGKRAPAGRLVSTQYPAEYATQFPANDMNLRPNGSN--PGQTYIW 625
Query: 648 YTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGS-RLDYVHIDEV 706
YTGT VY FGHGL YT + + + +A S +IL + Y +I+ V
Sbjct: 626 YTGTPVYEFGHGLFYTEF---------QESAAAGTNKTSTLDILDLVPTPHPGYEYIELV 676
Query: 707 TSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKG-SKE 765
+V + V N G + +LFA + P K L+GFDR+ T+ + +
Sbjct: 677 P-----FLNVTVDVKNVGHTPSPYTGLLFANTTAGPKPYPNKWLVGFDRLATIHPAKTAQ 731
Query: 766 ISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELR 800
++F V P ++ A+++G +++ G + L + R
Sbjct: 732 VTFPV-PLGAIARADENGNKVIFPGEYELALNNER 765
>gi|255590044|ref|XP_002535159.1| Thermostable beta-glucosidase B, putative [Ricinus communis]
gi|223523880|gb|EEF27223.1| Thermostable beta-glucosidase B, putative [Ricinus communis]
Length = 449
Score = 456 bits (1173), Expect = e-125, Method: Compositional matrix adjust.
Identities = 226/453 (49%), Positives = 303/453 (66%), Gaps = 8/453 (1%)
Query: 355 MDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDD 414
MD+NCG + +T+SA++K KV E +IDRAL NLFS+++RLGLFNG+P K YG + D
Sbjct: 1 MDVNCGNYLKNYTKSAVEKKKVSESEIDRALHNLFSIRMRLGLFNGNPTKLPYGDISADQ 60
Query: 415 VCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPC 474
VC+ EH+ +AL+AAR GIVLLKN + LPL+K+ +SLAIIGP +N + + G Y G PC
Sbjct: 61 VCSQEHQAVALEAARDGIVLLKNSNQLLPLSKSKTTSLAIIGPNADNSTILVGNYAGPPC 120
Query: 475 SPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETED 534
+ +GL+ Y+ T Y GC V C+S A +A++IAK+AD V++V GLD TQE E+
Sbjct: 121 KTVTPFQGLQNYIKTTKYHPGCSTVACSS-AAIDQAIKIAKEADQVVLVMGLDQTQEREE 179
Query: 535 RDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEA 594
DRV L+LPG+Q L+ SVAR +K+PV+LVL GGP+D+SFA+ D I ILW GYPGEA
Sbjct: 180 HDRVDLVLPGKQQELIISVARAAKKPVVLVLLCGGPVDISFAKYDRNIGGILWAGYPGEA 239
Query: 595 GAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVY 654
G ALAEIIFG+ NPGGRLP+TWYP+ FTKVPM DM MR S YPGR+YRFY G +V+
Sbjct: 240 GGIALAEIIFGNHNPGGRLPVTWYPQDFTKVPMTDMRMRPQPSSGYPGRTYRFYKGKKVF 299
Query: 655 GFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTS--CTSL 712
FG+GLSY+NYSY+ +S SL++ D+ + S + Y I E+ C
Sbjct: 300 EFGYGLSYSNYSYELVSVTQN---KISLRSSIDQK--AENSSPIGYKTISEIEEELCERS 354
Query: 713 RFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDP 772
+F V + V N G++ G H V+LFAR K G P K+LI F V A + EI + V+P
Sbjct: 355 KFSVTVRVKNQGEMTGKHPVLLFARQDKPGSGGPIKKLIAFQSVKLNAGENAEIEYKVNP 414
Query: 773 CEQLSIANKHGRRILPLGNHVLMVGELRHSLTI 805
CE LS AN+ G ++ G+ L+VG+ + + I
Sbjct: 415 CEHLSRANEDGLMVMEEGSQYLLVGDKEYPINI 447
>gi|238492365|ref|XP_002377419.1| conserved hypothetical protein [Aspergillus flavus NRRL3357]
gi|220695913|gb|EED52255.1| conserved hypothetical protein [Aspergillus flavus NRRL3357]
Length = 775
Score = 456 bits (1172), Expect = e-125, Method: Compositional matrix adjust.
Identities = 284/781 (36%), Positives = 424/781 (54%), Gaps = 78/781 (9%)
Query: 44 CKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESL 103
C+ S+P C+TSLSI+ R SL+ LTL+EKI L D ++ RLG+P+YEWWSE+
Sbjct: 22 CQTQPLCSHPVCDTSLSIAERVDSLVKSLTLEEKILNLVDASAGSTRLGLPSYEWWSEAT 81
Query: 104 HGIASNGPGVNFN---GTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA 160
HG+ S PGV F S TSFP +++AASF+ +L I + E R N G +
Sbjct: 82 HGVGS-APGVQFTSKPANFSYATSFPAPILTAASFDDTLIRKIAEVIGREGRVFGNNGFS 140
Query: 161 GLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFRE 220
G FWAPNIN FRDPRWGRGQETPGEDP+V Y FV QG++ K+
Sbjct: 141 GFDFWAPNINGFRDPRWGRGQETPGEDPLVAQNYIRNFVPGLQGDDPKN----------- 189
Query: 221 KRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFR 280
K+V+ A CKH YDLE RY N T+QD + F PF+
Sbjct: 190 KQVI----------------ATCKHYAVYDLET----GRYGNNYNPTQQDLSEYFLAPFK 229
Query: 281 SCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKG---YITSDCDAVATIFEY 336
+C+ IMCSYN V+G+PAC L + R W F Y+ SDC AV I++Y
Sbjct: 230 TCVRDTDVGSIMCSYNSVSGIPACANEYLLDEVLRKHWNFNSDYHYVVSDCGAVTDIWQY 289
Query: 337 QNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLG 396
N+T T E +A+ L AG+D+ CG+ L+ +S + + K +D++L L+S +G
Sbjct: 290 HNFTDTEEAAASVALNAGVDLECGSSYLKLNES-LAANQTSVKVMDQSLARLYSALFTVG 348
Query: 397 LFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNK-NAVSSLAII 455
F+G GKY KL DV T + + LA +AA +G+ LLKND LPL+ + S+A+I
Sbjct: 349 FFDG----GKYDKLDFSDVSTPDAQALAYEAAVEGMTLLKND-DLLPLDSPHKYKSVAVI 403
Query: 456 GPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVS---KTHYASGCHDVPCNSDAGFHEAVR 512
GP N +QM G Y+G L+ LEA+ K +YA G + + +GF EA+
Sbjct: 404 GPFANATTQMQGDYSG---DAPYLISPLEAFGDSRWKVNYALGTA-INNQNTSGFEEALA 459
Query: 513 IAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLD 572
A K+D +I + G+D + E+E DR SL PG Q+ L+TS+++ SK P+++V GGG +D
Sbjct: 460 AANKSDLIIYLGGIDNSLESETLDRTSLAWPGNQLDLITSLSKLSK-PLVVVQFGGGQVD 518
Query: 573 VSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMN 631
S + I +++W GYP ++G AL +++ G +P GRLP+T YP S+ +V + D+N
Sbjct: 519 DSAILKNKDIQALVWAGYPSQSGGTALLDVLVGKRSPAGRLPVTQYPASYADQVNIFDIN 578
Query: 632 MRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNIL 691
+R + YPGR+Y++YTG V FG+GL YT + + + E T++ + NI
Sbjct: 579 LRP--TDLYPGRTYKWYTGKPVLPFGYGLHYTKFMFDW-----EKTLN------REYNIQ 625
Query: 692 QQTGSRLDYV--HIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGT-PEK 748
S + I++ T T+++ V+ N G +V +LF P K
Sbjct: 626 DLVASCRNSSGGPINDNTPLTTVKARVK----NVGHKTSDYVSLLFLSSKNAGPAPRPNK 681
Query: 749 QLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG---ELRHSLTI 805
L+ + R+ +A+GS +++ L+ A+++G ++ G + + + EL T+
Sbjct: 682 SLVSYVRLLNIARGSDQVAELPLTLGSLARADENGSLVIFPGRYKIALDNSEELTFEFTL 741
Query: 806 E 806
+
Sbjct: 742 K 742
>gi|403412992|emb|CCL99692.1| predicted protein [Fibroporia radiculosa]
Length = 760
Score = 454 bits (1169), Expect = e-125, Method: Compositional matrix adjust.
Identities = 281/758 (37%), Positives = 407/758 (53%), Gaps = 62/758 (8%)
Query: 55 CNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVN 114
C+TS S RA +LI L TL+EKI + + +PRLG+PAY+WW E+LHG+A + PGV
Sbjct: 34 CDTSASPVARATALIGLFTLEEKINNTGNTSPGVPRLGLPAYQWWQEALHGVAES-PGVI 92
Query: 115 F--NGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIF 172
F G S TSFPQ ++ A+F+ L + + + V+ EARA N ++GL FW PNIN F
Sbjct: 93 FAETGEYSYATSFPQPILMGAAFDDELINQVATIVSTEARAFNNANRSGLDFWTPNINPF 152
Query: 173 RDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESD 232
+DPRWGRGQETPGEDP + +Y + QG G KR++
Sbjct: 153 KDPRWGRGQETPGEDPFHLQSYVYNLITGLQG----------GLDPEYKRIV-------- 194
Query: 233 RGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIM 292
A CKH YDLE W RY F+A+I+ QD + + F +C M
Sbjct: 195 --------ATCKHYAGYDLENWEGNVRYGFDALISIQDLSEFYTRSFETCARDANVGAFM 246
Query: 293 CSYNQVNGVPACLRGDLFQKA-RNEWGFKG---YITSDCDAVATIFEYQNYTKTHEDSAA 348
CSYN VNGVP+C L Q R W + +ITSDCDA+ I+E Y T E + A
Sbjct: 247 CSYNAVNGVPSCANSYLLQDILRGHWNWTSDDQWITSDCDAIQNIYEPHYYAPTRELTVA 306
Query: 349 GVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGK-Y 407
L AG D++CGT + +A D+G E +DRAL+ ++ ++LG F DP + + Y
Sbjct: 307 DALNAGADLDCGTYYPENLGAAYDEGLFAESTLDRALIRQYASLVKLGYF--DPAENQPY 364
Query: 408 GKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGG 467
++G +V T E ++LA AA +GI L+KND LPL+ ++ SLA+IGP N +QM G
Sbjct: 365 RQIGWANVSTPEAEELAYRAAVEGITLIKNDGT-LPLSP-SIKSLALIGPWANATTQMQG 422
Query: 468 GYTGIPCSPKSLLRGLEAYVSKTHYASGCH-DVPCNSDAGFHEAVRIAKKADFVIVVAGL 526
Y G P S L EA +Y+ G D P S F A A+ AD +I + G+
Sbjct: 423 NYYGQPPYLISPLMAAEALNYTVYYSPGPGVDDPTTSS--FPAAFAAAQAADAIIYIGGI 480
Query: 527 DLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSIL 586
D T E E DR +L PG Q + +++ K P++++ GGG +D S ++ +++++
Sbjct: 481 DTTVEAEAMDRYTLDWPGVQPDFIDQLSQFGK-PLVVLQMGGGQVDDSCLLPNTNVNALI 539
Query: 587 WIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNMRADSSRQYPGRSY 645
W GYPG++G AL +II G+ P GRLP T YP + +V M DM++R ++ PGR+Y
Sbjct: 540 WGGYPGQSGGTALMDIIVGNAAPAGRLPTTQYPLDYVYQVAMTDMSLRPSATN--PGRTY 597
Query: 646 RFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDE 705
+YTGT + FG GL YTN+S + LS PS + + G+ + + + LD +
Sbjct: 598 MWYTGTPIVEFGFGLHYTNFSAE-LSQPSAPSYDIASLVGACEGV-----AHLDLCAFES 651
Query: 706 VTSCTSLRFHVQISVTNAGD-VDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSK 764
T ++VTN G V +V +LF P K L +DR+HT+A S
Sbjct: 652 YT----------VNVTNIGSKVTSDYVALLFVAGEHGPAPIPNKVLAAYDRLHTIAPLSS 701
Query: 765 EISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHS 802
+ + LS +++G R+L G + L++ L +
Sbjct: 702 QQATLNLTLGSLSRVDEYGNRVLYPGEYTLILDVLPQA 739
>gi|451992719|gb|EMD85198.1| glycoside hydrolase family 3 protein [Cochliobolus heterostrophus
C5]
Length = 781
Score = 454 bits (1167), Expect = e-124, Method: Compositional matrix adjust.
Identities = 277/770 (35%), Positives = 401/770 (52%), Gaps = 59/770 (7%)
Query: 36 VLNKPDFP-CKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIP 94
V + +FP CK + C+ S S RAKSL++L TL+EKI S++A + RLG+P
Sbjct: 17 VFAQGNFPDCKNGPLKNETICDPSASTLARAKSLVALYTLEEKINATSNSAPGVARLGVP 76
Query: 95 AYEWWSESLHGIASNGPGVNF--NGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEAR 152
Y+WW+E LHGIA GP +F G S TSFPQ ++ A+F+ L + + ++ EAR
Sbjct: 77 PYQWWNEGLHGIA--GPFTSFAKQGDYSYSTSFPQPILMGAAFDDDLITEVAKVISTEAR 134
Query: 153 AMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDG 212
A N + GL FW PNIN FRDPRWGRGQETPGED +S+Y + QG +D
Sbjct: 135 AFNNANRTGLDFWTPNINPFRDPRWGRGQETPGEDSYHLSSYVKALIHGLQGN--ATDP- 191
Query: 213 GIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTE 272
+RV+ A CKH YD+E W RY + I++QD
Sbjct: 192 -------YRRVV----------------ATCKHYAGYDIENWNGNLRYQNDVQISQQDLV 228
Query: 273 DTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKG---YITSDCD 328
+ + PF +C+ Q MCSYN VNG P C L Q R WG+ ++TSDCD
Sbjct: 229 EYYLAPFEACV-QANVGAFMCSYNAVNGAPPCADPYLLQTVLREHWGWSSDDHWVTSDCD 287
Query: 329 AVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNL 388
A+ ++ ++ T E +AA L AG D++CGT + H A+ +G E +D+AL+
Sbjct: 288 AIQNVYLPHQWSSTREGAAADSLNAGTDLDCGTYLQTHLPGAVKQGLTDETTLDKALIRQ 347
Query: 389 FSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNA 448
+S ++LG F+ P Y +LG D V TS + LAL AA +GIVLLKND LP+N +
Sbjct: 348 YSSLIKLGYFDA-PENQPYRQLGFDAVATSASQALALKAAEEGIVLLKNDG-VLPINLGS 405
Query: 449 VSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFH 508
+ I G N SQ+ G Y G+ S L L+ YA + G
Sbjct: 406 -KQVGIYGDWANATSQLQGNYFGVAKFLTSPLMALQNLGVDVKYAGNLPGGQGDPTTGAW 464
Query: 509 EAVR-IAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTG 567
++ + +D I V G+D E+EDRDR L L G Q+ ++ +A T K PVI+V+ G
Sbjct: 465 SSLSGVITTSDVHIWVGGIDNGVESEDRDRSWLTLTGGQLDVIGQLADTGK-PVIVVIMG 523
Query: 568 GGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESF-TKVP 626
GG +D S + +IS++LW GYPG+ G A+ I+ G P GRLP T YP + ++VP
Sbjct: 524 GGQIDTSPLIRNPKISAVLWAGYPGQDGGTAIVNILTGKAAPAGRLPQTQYPSKYVSEVP 583
Query: 627 MNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGS 686
M DM MR S + PGR+Y++YTG ++ FG+GL YTN+S + P + + L G
Sbjct: 584 MTDMAMRP--SDKNPGRTYKWYTGEPIFEFGYGLHYTNFSASITNQPKQSYAISDLVKGC 641
Query: 687 DKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTP 746
+ TG L+ +T +SV N G + +V + F + P
Sbjct: 642 NS-----TGGFLERCPFTGIT----------VSVQNTGKISSDYVTLGFLTGSFGPKPYP 686
Query: 747 EKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMV 796
+K L+ +DR+ +A GS + L+ ++ G ++L G++ L +
Sbjct: 687 KKSLVAYDRLFNIAAGSSSTATLNLTLASLARVDESGNKVLYPGDYELQI 736
>gi|238508313|ref|XP_002385353.1| beta-xylosidase, putative [Aspergillus flavus NRRL3357]
gi|296439537|sp|B8NYD8.1|BXLB_ASPFN RecName: Full=Probable exo-1,4-beta-xylosidase bxlB; AltName:
Full=1,4-beta-D-xylan xylohydrolase bxlB; AltName:
Full=Beta-xylosidase bxlB; AltName: Full=Xylobiase bxlB;
Flags: Precursor
gi|220688872|gb|EED45224.1| beta-xylosidase, putative [Aspergillus flavus NRRL3357]
Length = 776
Score = 454 bits (1167), Expect = e-124, Method: Compositional matrix adjust.
Identities = 275/752 (36%), Positives = 413/752 (54%), Gaps = 65/752 (8%)
Query: 55 CNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVN 114
C+TSL +RAKSL++ +TL+EKI ++S PRLG+PAY WW+E+LHG+A G GV+
Sbjct: 41 CDTSLDPVSRAKSLVAAMTLEEKINNTKYDSSGAPRLGLPAYNWWNEALHGVAE-GHGVS 99
Query: 115 FN--GTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIF 172
F+ G S TSFP ++ A+F+ L + + ++ EARA N G AGL +W PNIN F
Sbjct: 100 FSDSGNFSYATSFPMPILLGAAFDDDLVKQVATVISTEARAFANGGHAGLDYWTPNINPF 159
Query: 173 RDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESD 232
RDPRWGRGQETPGEDP+ +S Y V Q G G E
Sbjct: 160 RDPRWGRGQETPGEDPLHLSRYVYHLVDGLQ---------------------DGIGPERP 198
Query: 233 RGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIM 292
+ + A CKH AYDLE W RY+F+A+++ QD + + P F++C K +M
Sbjct: 199 K-----VVATCKHFAAYDLENWEGIERYAFDAVVSPQDLSEYYLPSFKTCTRDAKVDAVM 253
Query: 293 CSYNQVNGVPACLRGDLFQK-ARNEWGFK---GYITSDCDAVATIFEYQNYTKTHEDSAA 348
CSYN +NG+P C L Q R WG++ ++T DC A+ I+ +Y +AA
Sbjct: 254 CSYNSLNGIPTCADRWLLQTLLREHWGWEQTGHWVTGDCGAIDNIYADHHYVADGAHAAA 313
Query: 349 GVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGK-Y 407
L AG D++CG+ + +SA+ +G + ++ AL+ L+S ++LG F DP + Y
Sbjct: 314 AALNAGTDLDCGSVFPEYLRSALQQGLYNNQTLNNALIRLYSSLVKLGYF--DPADDQPY 371
Query: 408 GKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGG 467
+G ++V T ++LA A +GIV+LKND LPL N ++AIIGP N +Q+ G
Sbjct: 372 RSIGWNEVFTPAAEELAHKATVEGIVMLKNDGT-LPLKSNG--TVAIIGPFANATTQLQG 428
Query: 468 GYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLD 527
Y G P ++L+ K ++ G D+ NS AGF EA+ AK+AD VI G+D
Sbjct: 429 NYEGPPKYIRTLIWAAVHNGYKVKFSQGT-DINSNSSAGFAEAISAAKEADTVIYAGGID 487
Query: 528 LTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILW 587
T E E +DR +++ PG Q+ L+ ++ K P+I+V GGG +D S A++ + ++LW
Sbjct: 488 NTIEKESQDRTTIVWPGNQLDLIEQLSDLEK-PLIVVQFGGGQVDDSSLLANAGVGALLW 546
Query: 588 IGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNMRADSSRQYPGRSYR 646
GYP +AG A+ +I+ G P GRLP+T YP S+ +VPM DM +R S+ PGR+YR
Sbjct: 547 AGYPSQAGGAAVFDILTGKSAPAGRLPVTQYPASYVDEVPMTDMTLRPGSNN--PGRTYR 604
Query: 647 FYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEV 706
+Y V FG GL YT ++ + A + S+ +G+ T + +D
Sbjct: 605 WYD-KAVLPFGFGLHYTTFNVSWNHAEYGPYNTDSVASGT-------TNAPVD------- 649
Query: 707 TSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVS-QGTPEKQLIGFDRVHTVAKG-SK 764
T L I+VTN G+V ++ +LF +V + P K L+G+ R + G S+
Sbjct: 650 ---TELFDTFSITVTNTGNVASDYIALLFLTADRVGPEPYPIKTLVGYSRAKGIEPGQSQ 706
Query: 765 EISFGVDPCEQLSIANKHGRRILPLGNHVLMV 796
++ V ++ ++G +L G++ L V
Sbjct: 707 QVKLDVS-VGSVARTAENGDLVLYPGSYKLEV 737
>gi|115436902|ref|XP_001217674.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
gi|121734342|sp|Q0CB82.1|BXLB_ASPTN RecName: Full=Probable exo-1,4-beta-xylosidase bxlB; AltName:
Full=1,4-beta-D-xylan xylohydrolase bxlB; AltName:
Full=Beta-xylosidase bxlB; AltName: Full=Xylobiase bxlB;
Flags: Precursor
gi|114188489|gb|EAU30189.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
Length = 765
Score = 451 bits (1161), Expect = e-124, Method: Compositional matrix adjust.
Identities = 271/759 (35%), Positives = 400/759 (52%), Gaps = 78/759 (10%)
Query: 18 VLQLIVVVNVIAFSNSKPVLNKPDFP-CKPPHFDSYPFCNTSLSISTRAKSLISLLTLQE 76
+L IV + + F+ S FP C+ C+T+L TRA++L++ +TL+E
Sbjct: 12 ILACIVSLTQLGFAQSP-------FPDCENGPLSKNAVCDTTLDPVTRAQALLAAMTLEE 64
Query: 77 KIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNF--NGTVSSVTSFPQVLVSAAS 134
KI N+ +PRLG+PAY WWSE+LHG+A + PGV+F +G S TSFP + A+
Sbjct: 65 KINNTQYNSPGVPRLGLPAYNWWSEALHGVAGS-PGVHFADSGNFSYATSFPSPITLGAA 123
Query: 135 FNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAY 194
F+ L I + + E RA N G AGL +W PNIN +RDPRWGRGQETPGEDP S Y
Sbjct: 124 FDDDLVKQIATVIGTEGRAFGNAGHAGLDYWTPNINPYRDPRWGRGQETPGEDPFHTSRY 183
Query: 195 AVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKW 254
+ Q G G E + + A CKH YD+E W
Sbjct: 184 VYHLIDGLQ---------------------DGIGPEKPK-----IVATCKHFAGYDIEDW 217
Query: 255 GNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA- 313
RY+F+A+I++QD + + PPF++C K +MCSYN VNG+P C L Q
Sbjct: 218 EGNERYAFDAVISDQDMAEYYFPPFKTCTRDAKVDAVMCSYNSVNGIPTCADPWLLQTVL 277
Query: 314 RNEWGFKG---YITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSA 370
R W ++G ++TSDC A+ I++ Y +AA + AG D++CG+ + SA
Sbjct: 278 REHWEWEGVGHWVTSDCGAIDNIYKDHKYVADGAHAAAVAVNAGTDLDCGSVYPQFLGSA 337
Query: 371 IDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGK-YGKLGPDDVCTSEHKKLALDAAR 429
I +G + + +DRAL L+S ++LG F DP + Y +G DV T + ++LA AA
Sbjct: 338 ISQGLLGNRTLDRALTRLYSSLVKLGYF--DPAADQPYRSIGWSDVATPDAEQLAHTAAV 395
Query: 430 QGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSK 489
+G VLLKND LPL KN ++AI+GP N +Q+ G Y G ++L K
Sbjct: 396 EGTVLLKNDGT-LPLKKNG--TVAIVGPYANATTQLQGNYEGTAKYIHTMLSAAAQQGYK 452
Query: 490 THYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSL 549
YA G + NS +GF +A+ AK +D VI G+D E E DR S+ PG Q+ L
Sbjct: 453 VKYAPGT-GINSNSTSGFEQALNAAKGSDLVIYFGGIDHEVEAEALDRTSIAWPGNQLDL 511
Query: 550 VTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNP 609
+ ++ K+P+++V GGG +D S +++ ++ +LW GYP +AG A+ +I+ G P
Sbjct: 512 IQQLSDL-KKPLVVVQFGGGQVDDSSLLSNAGVNGLLWAGYPSQAGGAAVFDILTGKTAP 570
Query: 610 GGRLPMTWYPESFT-KVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYK 668
GRLP+T YPE + +VPM DMN+R S PGR+YR+Y V FG+G+ YT +
Sbjct: 571 AGRLPVTQYPEEYVDQVPMTDMNLRPGPSN--PGRTYRWYD-KAVIPFGYGMHYTTFDVS 627
Query: 669 FLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDG 728
+ +A++KA +N + +T S + V N G V
Sbjct: 628 WKRKNYGPYNTAAVKA---ENAVLETFS---------------------LQVKNTGKVTS 663
Query: 729 SHVVMLFARVPKVS-QGTPEKQLIGFDRVHTVAKGSKEI 766
+V ++F + P K L+G+ RV + G +++
Sbjct: 664 DYVALVFLTTTDAGPKPYPIKTLVGYQRVKAIRPGERKV 702
>gi|121797681|sp|Q2TYT2.1|BXLB_ASPOR RecName: Full=Probable exo-1,4-beta-xylosidase bxlB; AltName:
Full=1,4-beta-D-xylan xylohydrolase bxlB; AltName:
Full=Beta-xylosidase bxlB; AltName: Full=Xylobiase bxlB;
Flags: Precursor
gi|83775471|dbj|BAE65591.1| unnamed protein product [Aspergillus oryzae RIB40]
Length = 797
Score = 451 bits (1161), Expect = e-124, Method: Compositional matrix adjust.
Identities = 275/752 (36%), Positives = 411/752 (54%), Gaps = 65/752 (8%)
Query: 55 CNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVN 114
C+TSL +RAKSL++ +TL+EKI ++S PRLG+PAY WW+E+LHG+A G GV+
Sbjct: 62 CDTSLDPVSRAKSLVAAMTLEEKINNTKYDSSGAPRLGLPAYNWWNEALHGVAE-GHGVS 120
Query: 115 FN--GTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIF 172
F+ G S TSFP ++ A+F+ L + + ++ EARA N G AGL +W PNIN F
Sbjct: 121 FSDSGNFSYATSFPMPILLGAAFDDDLVKQVATVISTEARAFANGGHAGLDYWTPNINPF 180
Query: 173 RDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESD 232
RDPRWGRGQETPGEDP+ +S Y V Q G G E
Sbjct: 181 RDPRWGRGQETPGEDPLHLSRYVYHLVDGLQ---------------------DGIGPERP 219
Query: 233 RGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIM 292
+ + A CKH AYDLE W RY+F+A+++ QD + + P F++C K +M
Sbjct: 220 K-----VVATCKHFAAYDLENWEGIERYAFDAVVSPQDLSEYYLPSFKTCTRDAKVDAVM 274
Query: 293 CSYNQVNGVPACLRGDLFQK-ARNEWGFK---GYITSDCDAVATIFEYQNYTKTHEDSAA 348
CSYN +NG+P C L Q R WG++ ++T DC A+ I+ +Y +AA
Sbjct: 275 CSYNSLNGIPTCADRWLLQTLLREHWGWEQTGHWVTGDCGAIDNIYADHHYVADGAHAAA 334
Query: 349 GVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGK-Y 407
L AG D++CG+ + SA+ +G + ++ AL+ L+S ++LG F DP + Y
Sbjct: 335 AALNAGTDLDCGSVFPEYLGSALQQGLYNNQTLNNALIRLYSSLVKLGYF--DPADDQPY 392
Query: 408 GKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGG 467
+G ++V T ++LA A +GIV+LKND LPL N ++AIIGP N +Q+ G
Sbjct: 393 RSIGWNEVFTPAAEELAHKATVEGIVMLKNDGT-LPLKSNG--TVAIIGPFANATTQLQG 449
Query: 468 GYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLD 527
Y G P ++L+ K ++ G D+ NS AGF EA+ AK+AD VI G+D
Sbjct: 450 NYEGPPKYIRTLIWAAVHNGYKVKFSQGT-DINSNSSAGFAEAISAAKEADTVIYAGGID 508
Query: 528 LTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILW 587
T E E +DR +++ PG Q+ L+ ++ K P+I+V GGG +D S A++ + ++LW
Sbjct: 509 NTIEKESQDRTTIVWPGNQLDLIEQLSDLEK-PLIVVQFGGGQVDDSSLLANAGVGALLW 567
Query: 588 IGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNMRADSSRQYPGRSYR 646
GYP +AG A+ +I+ G P GRLP+T YP S+ +VPM DM +R S+ PGR+YR
Sbjct: 568 AGYPSQAGGAAVFDILTGKSAPAGRLPVTQYPASYVDEVPMTDMTLRPGSNN--PGRTYR 625
Query: 647 FYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEV 706
+Y V FG GL YT ++ + A + S+ +G+ T + +D
Sbjct: 626 WYD-KAVLPFGFGLHYTTFNVSWNHAEYGPYNTDSVASGT-------TNAPVD------- 670
Query: 707 TSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVS-QGTPEKQLIGFDRVHTVAKG-SK 764
T L I+VTN G+V ++ +LF V + P K L+G+ R + G S+
Sbjct: 671 ---TELFDTFSITVTNTGNVASDYIALLFLTADGVGPEPYPIKTLVGYSRAKGIEPGQSQ 727
Query: 765 EISFGVDPCEQLSIANKHGRRILPLGNHVLMV 796
++ V ++ ++G +L G++ L V
Sbjct: 728 QVKLDVS-VGSVARTAENGDLVLYPGSYKLEV 758
>gi|317158006|ref|XP_001826724.2| exo-1,4-beta-xylosidase xlnD [Aspergillus oryzae RIB40]
Length = 776
Score = 451 bits (1160), Expect = e-124, Method: Compositional matrix adjust.
Identities = 275/752 (36%), Positives = 411/752 (54%), Gaps = 65/752 (8%)
Query: 55 CNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVN 114
C+TSL +RAKSL++ +TL+EKI ++S PRLG+PAY WW+E+LHG+A G GV+
Sbjct: 41 CDTSLDPVSRAKSLVAAMTLEEKINNTKYDSSGAPRLGLPAYNWWNEALHGVAE-GHGVS 99
Query: 115 FN--GTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIF 172
F+ G S TSFP ++ A+F+ L + + ++ EARA N G AGL +W PNIN F
Sbjct: 100 FSDSGNFSYATSFPMPILLGAAFDDDLVKQVATVISTEARAFANGGHAGLDYWTPNINPF 159
Query: 173 RDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESD 232
RDPRWGRGQETPGEDP+ +S Y V Q G G E
Sbjct: 160 RDPRWGRGQETPGEDPLHLSRYVYHLVDGLQ---------------------DGIGPERP 198
Query: 233 RGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIM 292
+ + A CKH AYDLE W RY+F+A+++ QD + + P F++C K +M
Sbjct: 199 K-----VVATCKHFAAYDLENWEGIERYAFDAVVSPQDLSEYYLPSFKTCTRDAKVDAVM 253
Query: 293 CSYNQVNGVPACLRGDLFQK-ARNEWGFK---GYITSDCDAVATIFEYQNYTKTHEDSAA 348
CSYN +NG+P C L Q R WG++ ++T DC A+ I+ +Y +AA
Sbjct: 254 CSYNSLNGIPTCADRWLLQTLLREHWGWEQTGHWVTGDCGAIDNIYADHHYVADGAHAAA 313
Query: 349 GVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGK-Y 407
L AG D++CG+ + SA+ +G + ++ AL+ L+S ++LG F DP + Y
Sbjct: 314 AALNAGTDLDCGSVFPEYLGSALQQGLYNNQTLNNALIRLYSSLVKLGYF--DPADDQPY 371
Query: 408 GKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGG 467
+G ++V T ++LA A +GIV+LKND LPL N ++AIIGP N +Q+ G
Sbjct: 372 RSIGWNEVFTPAAEELAHKATVEGIVMLKNDGT-LPLKSNG--TVAIIGPFANATTQLQG 428
Query: 468 GYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLD 527
Y G P ++L+ K ++ G D+ NS AGF EA+ AK+AD VI G+D
Sbjct: 429 NYEGPPKYIRTLIWAAVHNGYKVKFSQGT-DINSNSSAGFAEAISAAKEADTVIYAGGID 487
Query: 528 LTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILW 587
T E E +DR +++ PG Q+ L+ ++ K P+I+V GGG +D S A++ + ++LW
Sbjct: 488 NTIEKESQDRTTIVWPGNQLDLIEQLSDLEK-PLIVVQFGGGQVDDSSLLANAGVGALLW 546
Query: 588 IGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNMRADSSRQYPGRSYR 646
GYP +AG A+ +I+ G P GRLP+T YP S+ +VPM DM +R S+ PGR+YR
Sbjct: 547 AGYPSQAGGAAVFDILTGKSAPAGRLPVTQYPASYVDEVPMTDMTLRPGSNN--PGRTYR 604
Query: 647 FYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEV 706
+Y V FG GL YT ++ + A + S+ +G+ T + +D
Sbjct: 605 WYD-KAVLPFGFGLHYTTFNVSWNHAEYGPYNTDSVASGT-------TNAPVD------- 649
Query: 707 TSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVS-QGTPEKQLIGFDRVHTVAKG-SK 764
T L I+VTN G+V ++ +LF V + P K L+G+ R + G S+
Sbjct: 650 ---TELFDTFSITVTNTGNVASDYIALLFLTADGVGPEPYPIKTLVGYSRAKGIEPGQSQ 706
Query: 765 EISFGVDPCEQLSIANKHGRRILPLGNHVLMV 796
++ V ++ ++G +L G++ L V
Sbjct: 707 QVKLDVS-VGSVARTAENGDLVLYPGSYKLEV 737
>gi|242813865|ref|XP_002486253.1| beta-xylosidase, putative [Talaromyces stipitatus ATCC 10500]
gi|218714592|gb|EED14015.1| beta-xylosidase, putative [Talaromyces stipitatus ATCC 10500]
Length = 893
Score = 450 bits (1157), Expect = e-123, Method: Compositional matrix adjust.
Identities = 280/766 (36%), Positives = 409/766 (53%), Gaps = 62/766 (8%)
Query: 51 SYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNG 110
S P C+TSL TRAK L+ +T +EK+Q + + RLG+PAY+WW+E+LHG+A +
Sbjct: 161 SNPICDTSLDPLTRAKGLVDAMTFEEKVQNTQNGSPGAARLGLPAYQWWNEALHGVAGS- 219
Query: 111 PGVNF--NGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPN 168
PGV F +G S TSFPQ ++ +A+F+ +L +G+ V++E RA N G AGL FW PN
Sbjct: 220 PGVTFQPSGNFSYATSFPQPILMSAAFDDALIKEVGTVVSIEGRAFNNYGNAGLDFWTPN 279
Query: 169 INIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFG 228
IN FRDPRWGRGQETPGEDP ++ Y V Q G RV+
Sbjct: 280 INPFRDPRWGRGQETPGEDPYHIARYVYNLVDGLQN----------GIAPANPRVV---- 325
Query: 229 EESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKA 288
A CKH YD+E W SRY FNA+I+ QD + + PPF+SC +
Sbjct: 326 ------------ATCKHFAGYDIEDWEGNSRYGFNAIISTQDLSEYYLPPFKSCARDAQV 373
Query: 289 SCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKG---YITSDCDAVATIFEYQNYTKTHE 344
IMCSYN VNG+P C L R+ W + ++TSDCDAV I+ YT +
Sbjct: 374 DAIMCSYNAVNGIPTCADSYLLDTILRDHWNWNQTGHWVTSDCDAVDNIYSDHRYTSSLA 433
Query: 345 DSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRK 404
+AA L AG +++CGT M + +A + + ++ AL+ L+S +RLG F D
Sbjct: 434 AAAADALNAGTNLDCGTTMSNNLAAAAAQDLFKNATLNSALVYLYSSLVRLGWF--DSED 491
Query: 405 GKYGKLGPDDVCTSEHKKLALDAARQGIVLLKND-KKFLPLNKNAVSSLAIIGPLVNNIS 463
+Y LG DV T+ ++LA AA +GIVLLKND KK LPL+++ ++A+IGP N +
Sbjct: 492 SQYSSLGWSDVGTTASQQLANRAAVEGIVLLKNDHKKVLPLSQHG-QTIALIGPYANATT 550
Query: 464 QMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVV 523
Q+ G Y G P ++L+ G E Y +G + +GF AV AK AD VI
Sbjct: 551 QLQGNYYGTPAYIRTLVWGAEQMGYTVQYEAGT-GINSTDTSGFAAAVAAAKTADIVIYA 609
Query: 524 AGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQIS 583
G+D + E E DR ++ G Q+ L+ +++ K P++++ GGG LD S + ++
Sbjct: 610 GGIDNSIEAEAMDRNTIAWTGNQLQLIDQLSQVGK-PLVVLQFGGGQLDDSALLQNENVN 668
Query: 584 SILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTK-VPMNDMNMRADSSRQYPG 642
++LW GYP + G +A+ +I+ G P GRLP+T YP ++T +PM DM++R + S PG
Sbjct: 669 ALLWCGYPSQTGGQAVFDILTGQSAPAGRLPVTQYPANYTNAIPMTDMSLRPNGST--PG 726
Query: 643 RSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVH 702
R+YR+Y V FG GL YT + + +ASL A + K+ Q T + D H
Sbjct: 727 RTYRWYDDA-VIPFGFGLHYTTFDASWADKKFGPYNTASLVAKASKSKYQDT-APFDSFH 784
Query: 703 IDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVS-QGTPEKQLIGFDRVHTVAK 761
++ V N G V V +LFA + P K LI + R ++
Sbjct: 785 VN---------------VKNTGKVTSDFVALLFASTDNAGPKPYPIKTLISYARASSIKP 829
Query: 762 G-SKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
G ++ +S V ++ +G +L G++ L + +H T E
Sbjct: 830 GETRTVSIDVT-IGSIARTATNGDLVLYPGSYTLQLDVGQHYPTAE 874
>gi|85813774|emb|CAJ65923.1| xylan 1,4-beta-xylosidase [Populus tremula x Populus alba]
Length = 704
Score = 449 bits (1156), Expect = e-123, Method: Compositional matrix adjust.
Identities = 232/486 (47%), Positives = 317/486 (65%), Gaps = 18/486 (3%)
Query: 326 DCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRAL 385
DCDAV + Q Y KT ED+ A LK+G+ + + +T+SA++K KV +IDRAL
Sbjct: 229 DCDAVNVLHVEQKYAKTPEDAVADALKSGI-----SYLRNYTKSAVEKKKVTVSEIDRAL 283
Query: 386 LNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLN 445
NLFS ++RLGLFNGDP K Y +GPD VC+ EH+ LAL+AA GIVLLKN + LPL+
Sbjct: 284 HNLFSTRMRLGLFNGDPTKQLYSDIGPDQVCSQEHQALALEAALDGIVLLKNADRLLPLS 343
Query: 446 KNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDA 505
K+ +SSLA+IGP +N + + G Y G C ++L GL YVS Y GC++V C S A
Sbjct: 344 KSGISSLAVIGPNAHNSTNLLGNYFGPACKNVTILEGLRNYVSSASYEKGCNNVSCTS-A 402
Query: 506 GFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVL 565
+ V +A+ D VI+V GLD +QE E DR+ L+LPG+Q +L+T+VA+ +KRP++LVL
Sbjct: 403 AKKKPVEMAQTEDQVILVMGLDQSQEKERLDRMDLVLPGKQPTLITAVAKAAKRPIVLVL 462
Query: 566 TGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNP---GGRLPMTWYPESF 622
GG P+DV+FA+ + +I SILW GYPG+AGA ALA+IIFG+ NP GGRLPMTWYP+ F
Sbjct: 463 LGGSPMDVTFAKNNRKIGSILWAGYPGQAGATALAQIIFGEHNPGNAGGRLPMTWYPQDF 522
Query: 623 TKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLS-APSELTISAS 681
TKVPM DM MR S PGR+YRFY G +V+ FG+GLSY++YSY F S A ++L + S
Sbjct: 523 TKVPMTDMRMRPQPSTGNPGRTYRFYEGEKVFEFGYGLSYSDYSYTFASVAQNQLNVKDS 582
Query: 682 LKAGSDKNILQQTGSRLDYVHIDEV--TSCTSLRFHVQISVTNAGDVDGSHVVMLFARVP 739
N + Y + ++ C +++F V +SV N G + G H V+LFAR
Sbjct: 583 ------SNQQPENSETPGYKLVSDIGEEQCENIKFKVTVSVKNEGQMAGKHPVLLFARHA 636
Query: 740 KVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGEL 799
K +G P K+L+GF V A EI + + PCE LS AN+ G ++ G+ +L+VG+
Sbjct: 637 KPGKGRPIKKLVGFQTVKLGAGEKTEIEYELSPCEHLSSANEDGVMVMEEGSQILLVGDK 696
Query: 800 RHSLTI 805
H +TI
Sbjct: 697 EHPVTI 702
Score = 223 bits (569), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 115/224 (51%), Positives = 140/224 (62%), Gaps = 28/224 (12%)
Query: 38 NKPDFPCKP--PHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPA 95
++P + C P +Y FC T+L IS RA+ L+S LT +EK QL D + AIPRLGIPA
Sbjct: 26 SQPPYSCDSSDPSTKTYDFCKTTLPISRRAEDLVSRLTFEEKATQLVDTSPAIPRLGIPA 85
Query: 96 YEWWSESLHGIA---SNGPGVNF-NGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEA 151
YEWWSE LHGI G++F N T+ TSFPQV+++AASF+ +W IG V EA
Sbjct: 86 YEWWSEGLHGIGFLTRVQQGISFFNRTIQHATSFPQVILTAASFDAHIWYRIGQ-VGKEA 144
Query: 152 RAMYNLGQ-AGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSD 210
RA+YN GQ GL FWAPN+NIFRDPRWGRGQETPGEDP+VV Y FV+ QG++
Sbjct: 145 RALYNAGQVTGLGFWAPNVNIFRDPRWGRGQETPGEDPLVVGKYGASFVRGVQGDS---- 200
Query: 211 DGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKW 254
F ES GD L SACCKH A+DL+ W
Sbjct: 201 ----------------FEGESTLGDHLQASACCKHYTAHDLDNW 228
>gi|391864313|gb|EIT73609.1| beta-glucosidase-related glycosidase [Aspergillus oryzae 3.042]
Length = 797
Score = 449 bits (1156), Expect = e-123, Method: Compositional matrix adjust.
Identities = 275/752 (36%), Positives = 410/752 (54%), Gaps = 65/752 (8%)
Query: 55 CNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVN 114
C+TSL +RAKSL++ +TL+EKI ++S PRLG+PAY WW+E+LHG+A G GV+
Sbjct: 62 CDTSLDPVSRAKSLVAAMTLEEKINNTKYDSSGAPRLGLPAYNWWNEALHGVAE-GHGVS 120
Query: 115 FN--GTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIF 172
F+ G S TSFP ++ A+F+ L + + ++ EARA N G AGL +W PNIN F
Sbjct: 121 FSDSGNFSYATSFPMPILLGAAFDDDLVKQVATVISTEARAFANGGHAGLDYWTPNINPF 180
Query: 173 RDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESD 232
RDPRWGRGQETPGEDP+ +S Y V Q G G E
Sbjct: 181 RDPRWGRGQETPGEDPLHLSRYVYHLVDGLQ---------------------DGIGPERP 219
Query: 233 RGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIM 292
+ + A CKH AYDLE W RY+F+A+++ QD + + P F++C K +M
Sbjct: 220 K-----VVATCKHFAAYDLENWEGIERYAFDAVVSPQDLSEYYLPSFKTCTRDAKVDAVM 274
Query: 293 CSYNQVNGVPACLRGDLFQK-ARNEWGFK---GYITSDCDAVATIFEYQNYTKTHEDSAA 348
CSYN +NG+P C L Q R WG++ ++T DC A+ I+ +Y +AA
Sbjct: 275 CSYNSLNGIPTCADRWLLQTLLREHWGWEQTGHWVTGDCGAIDNIYADHHYVADGAHAAA 334
Query: 349 GVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGK-Y 407
L AG D++CG+ + SA+ +G + + AL+ L+S ++LG F DP + Y
Sbjct: 335 AALNAGTDLDCGSVFPEYLGSALQQGLYNNQTLYNALIRLYSSLVKLGYF--DPADDQPY 392
Query: 408 GKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGG 467
+G ++V T ++LA A +GIV+LKND LPL N ++AIIGP N +Q+ G
Sbjct: 393 RSIGWNEVFTPAAEELAHKATVEGIVMLKNDGT-LPLKSNG--TVAIIGPFANATTQLQG 449
Query: 468 GYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLD 527
Y G P ++L+ K ++ G D+ NS AGF EA+ AK+AD VI G+D
Sbjct: 450 NYEGPPKYIRTLIWAAVHNGYKVKFSQGT-DINSNSSAGFAEAISAAKEADTVIYAGGID 508
Query: 528 LTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILW 587
T E E +DR +++ PG Q+ L+ ++ K P+I+V GGG +D S A++ + ++LW
Sbjct: 509 NTIEKESQDRTTIVWPGNQLDLIEQLSDLEK-PLIVVQFGGGQVDDSSLLANAGVGALLW 567
Query: 588 IGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNMRADSSRQYPGRSYR 646
GYP +AG A+ +I+ G P GRLP+T YP S+ +VPM DM +R S+ PGR+YR
Sbjct: 568 AGYPSQAGGAAVFDILTGKSAPAGRLPVTQYPASYVDEVPMTDMTLRPGSNN--PGRTYR 625
Query: 647 FYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEV 706
+Y V FG GL YT ++ + A + S+ +G+ T + +D
Sbjct: 626 WYD-KAVLPFGFGLHYTTFNVSWNHAEYGPYNTDSVASGT-------TNAPVD------- 670
Query: 707 TSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVS-QGTPEKQLIGFDRVHTVAKG-SK 764
T L I+VTN G+V ++ +LF V + P K L+G+ R + G S+
Sbjct: 671 ---TELFDTFSITVTNTGNVASDYIALLFLTADGVGPEPYPIKTLVGYSRAKGIEPGQSQ 727
Query: 765 EISFGVDPCEQLSIANKHGRRILPLGNHVLMV 796
++ V ++ ++G +L G++ L V
Sbjct: 728 QVKLDVS-VGSVARTAENGDLVLYPGSYKLEV 758
>gi|332982588|ref|YP_004464029.1| glycoside hydrolase [Mahella australiensis 50-1 BON]
gi|332700266|gb|AEE97207.1| glycoside hydrolase family 3 domain protein [Mahella australiensis
50-1 BON]
Length = 714
Score = 449 bits (1154), Expect = e-123, Method: Compositional matrix adjust.
Identities = 282/774 (36%), Positives = 413/774 (53%), Gaps = 113/774 (14%)
Query: 48 HFDSYP-FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGI 106
H + P + + SLS RAK L+S +TL EKI Q+ +A AIPRL IPAY WW+E LHG+
Sbjct: 6 HEEKLPAYKDVSLSFEDRAKDLVSRMTLPEKISQMIYDAPAIPRLDIPAYNWWNECLHGV 65
Query: 107 ASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG-------- 158
A G T FPQ + AA+FN L + A++ EARA ++
Sbjct: 66 ARAG----------IATVFPQAIAMAATFNPELIHKVAEAISDEARAKHHEAVRNGDRGI 115
Query: 159 QAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGF 218
GLTFW+PNINIFRDPRWGRG ET GEDP + S V FVK QG++ K
Sbjct: 116 YKGLTFWSPNINIFRDPRWGRGHETYGEDPYLTSRMGVAFVKGLQGDDPKY--------- 166
Query: 219 REKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPP 278
L + A KH Y + R+SF+A ++++D +T+ P
Sbjct: 167 ------------------LKVVATPKH---YAVHSGPESQRHSFDARVSQKDLRETYLPA 205
Query: 279 FRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQ 337
F C+++GKA IM +YN+ NG P C L + R+EWGF GY+ SDC A+ I +
Sbjct: 206 FEECVKEGKAVSIMGAYNRTNGEPCCASKTLLKDILRDEWGFDGYVVSDCGAIDDIHMHH 265
Query: 338 NYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGL 397
TKT +SAA + G ++NCG + A+++G + E+ ID+A++ LF+ ++RLG+
Sbjct: 266 KVTKTAAESAALAVNNGCELNCGKTY-EYLCQAVEQGLISEETIDQAVIKLFTARMRLGM 324
Query: 398 FNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGP 457
F+ P +Y + D + EH++LAL+ ARQ IVLLKND+ LPL+K + ++A+IGP
Sbjct: 325 FD-PPEMVRYAHIPYDVNDSPEHRELALETARQSIVLLKNDENILPLSKK-LKTIAVIGP 382
Query: 458 LVNNISQMGGGYTGIPCSPKSLLRGLEAYVS---KTHYASGCHDVPCNSDAGFHEAVRIA 514
+++ + Y G P + L G++ VS K YA GC +V NS GF EAV IA
Sbjct: 383 NADDLDVLLANYFGTPSKYVTPLEGIKNKVSPDTKVLYAKGC-EVTGNSVDGFDEAVNIA 441
Query: 515 KKADFVIVVAGLDLTQETEDRD---------RVSLLLPGQQMSLVTSVARTSKRPVILVL 565
+ AD VI+ GL E E+ D R+ + LPG Q L+ ++ T K P++LVL
Sbjct: 442 EMADIVIMCLGLSPRIEGEEGDVADSDGGGDRLHIDLPGMQEQLLETIYGTGK-PIVLVL 500
Query: 566 TGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKV 625
G + +++A + +I+ YPGE G A+A+++FGD+NP GRLP+T+
Sbjct: 501 LNGSAIAINWAH--EHVPAIIEAWYPGEEGGTAIADVLFGDYNPAGRLPITFVRSLDDLP 558
Query: 626 PMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISA-SLKA 684
P D NM+ GR+YR++ +Y FG+GLSYT++ Y S L +SA L A
Sbjct: 559 PFTDYNMK--------GRTYRYFEKEPLYPFGYGLSYTSFKY------SNLRLSAMRLPA 604
Query: 685 GSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQG 744
G++ +I + V N G + G VV L+ + S
Sbjct: 605 GNNLDI--------------------------NVDVENTGKLAGREVVQLYISDVEASVE 638
Query: 745 TPEKQLIGFDRVHTVAKGSKE-ISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
P +QL G + T+ G K+ +SF V+P + +S+ + G+RIL G ++ VG
Sbjct: 639 VPMRQLCGIQCI-TLEPGQKQTVSFTVEP-QHMSLFDYDGKRILEPGQFIIAVG 690
>gi|212531051|ref|XP_002145682.1| beta-xylosidase XylA [Talaromyces marneffei ATCC 18224]
gi|210071046|gb|EEA25135.1| beta-xylosidase XylA [Talaromyces marneffei ATCC 18224]
Length = 799
Score = 447 bits (1151), Expect = e-123, Method: Compositional matrix adjust.
Identities = 272/759 (35%), Positives = 409/759 (53%), Gaps = 74/759 (9%)
Query: 55 CNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVN 114
C+TS + RA+ LI+L TL+E I ++ +PRLG+P YE WSE LHG+ V
Sbjct: 64 CDTSANYVDRAEGLIALFTLEELINNTQNSGPGVPRLGLPPYEVWSEGLHGL-DRAHFVK 122
Query: 115 FNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRD 174
+ TSFP ++S A+ NR+L + I S +A +ARA N+G+ GL +APNIN FR
Sbjct: 123 SGDEWTWATSFPMPILSMAALNRTLINQIASIIATQARAFNNVGRYGLDAYAPNINGFRS 182
Query: 175 PRWGRGQETPGEDP-MVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDR 233
P WGRGQETPGED + S+YA E++ Q GGI
Sbjct: 183 PLWGRGQETPGEDANFLTSSYAYEYITGLQ--------GGI------------------D 216
Query: 234 GDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMC 293
D L ++A KH YDLE WG SR F+A IT+QD + + P F + KA MC
Sbjct: 217 PDNLKIAATAKHFAGYDLENWGGNSRLGFDARITQQDLAEYYTPQFLAASRYAKARSFMC 276
Query: 294 SYNQVNGVPACLRGDLFQK-ARNEWGFK--GYITSDCDAVATIFEYQNYTKTHEDSAAGV 350
SYN VN +P+C L Q R +W F GY++SDCDAV +F Y +AA
Sbjct: 277 SYNSVNAIPSCSSSFLLQTLLREQWDFPEYGYVSSDCDAVYNVFNPHGYASNQSSAAAES 336
Query: 351 LKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKL 410
L+AG DI+CG H + +G V +I+R++L L+S ++LG F+GD K +Y +L
Sbjct: 337 LRAGTDIDCGQTYSWHLNQSFIEGSVTRGEIERSILRLYSNLVKLGYFDGD--KNEYRQL 394
Query: 411 GPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT 470
G +DV T++ ++ +AA +GIVLLKND LPL+KN V S+A++GP N Q+ G Y
Sbjct: 395 GWNDVVTTDAWNISYEAAVEGIVLLKND-GVLPLSKN-VKSVALVGPWANATKQLQGNYF 452
Query: 471 GIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQ 530
G + L+G K +YA G + + N+ GF A+ AKK+D ++ + G+D T
Sbjct: 453 GTAPYLITPLQGASDAGYKVNYALGTN-ISGNTTDGFANALSAAKKSDVIVYLGGIDNTI 511
Query: 531 ETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGY 590
E E DR+++ P Q+ L+ +++T K P++++ GGG +D S +++S++++++W GY
Sbjct: 512 EAEGTDRMNVTWPRNQLDLIQQLSQTGK-PLVVLQMGGGQVDSSSIKSNSKVNALIWGGY 570
Query: 591 PGEAGAKALAEIIFGDFNPGGRLPMTWYPESF-TKVPMNDMNMRADSSRQYPGRSYRFYT 649
PG++G KA+ +I+ G P GRL T YP + T+ P DM++R D + PG++Y +Y
Sbjct: 571 PGQSGGKAIFDILKGKRAPAGRLVSTQYPAEYATQFPATDMSLRPD-GKSNPGQTYMWYI 629
Query: 650 GTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTS- 708
G VY FG+GL YT + K ++ GS I E+ S
Sbjct: 630 GKPVYEFGYGLFYTTF----------------------KETAKKLGSSSSSFDISEIVSS 667
Query: 709 --CTSLRF-------HVQISVTNAGDVDGSHVVMLFARVPKVSQGT-PEKQLIGFDRVHT 758
S + +V ++ N G + MLFA P K L+G+DR+ +
Sbjct: 668 PRSPSYEYSELVPFLNVTATIKNTGKTASPYTAMLFANTTNAGPAPYPNKWLVGYDRLPS 727
Query: 759 VAKG-SKEISFGVDPCEQLSIANKHGRRILPLGNHVLMV 796
+ G S ++ V P ++ +K+G RI+ G++ L +
Sbjct: 728 IEPGKSADLVIPV-PIGAIARVDKNGNRIVYPGDYQLTL 765
>gi|348604625|dbj|BAK96214.1| beta-xylosidase [Acremonium cellulolyticus]
Length = 797
Score = 447 bits (1150), Expect = e-122, Method: Compositional matrix adjust.
Identities = 273/764 (35%), Positives = 411/764 (53%), Gaps = 78/764 (10%)
Query: 55 CNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVN 114
C+TS + RA+ LI+L TL+E I + A +PRLG+P Y+ WSE+LHG+ N
Sbjct: 64 CDTSANYVDRAEGLIALFTLEELINNTQNTAPGVPRLGLPPYQVWSEALHGLDR----AN 119
Query: 115 FNGT---VSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINI 171
F + + TSFP ++S A+ NR+L + I + +ARA N G+ GL +APNIN
Sbjct: 120 FATSGDEWTWATSFPMPILSMAALNRTLINQIAGIIGTQARAFNNAGRYGLDAYAPNING 179
Query: 172 FRDPRWGRGQETPGEDP-MVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEE 230
FR P WGRGQETPGED + S+YA E++ Q GG+
Sbjct: 180 FRSPLWGRGQETPGEDANFLSSSYAYEYITGLQ--------GGV---------------- 215
Query: 231 SDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASC 290
D L + A KH YDLE WG SR F+A IT+QD + + P F + KA
Sbjct: 216 --DPDHLKVVATAKHFAGYDLENWGGNSRLGFDASITQQDLAEYYTPQFLAASRYAKARS 273
Query: 291 IMCSYNQVNGVPACLRGDLFQK-ARNEWGFK--GYITSDCDAVATIFEYQNYTKTHEDSA 347
MCSYN VNGVP+C L Q R+ W F GY++SDCDAV +F Y +A
Sbjct: 274 FMCSYNSVNGVPSCSSSFLLQTLLRDNWDFPEYGYVSSDCDAVYNVFNPHGYASNQSAAA 333
Query: 348 AGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKY 407
A L+AG DI+CG + + +G V +I+R+++ L+S ++LG F+GD K +Y
Sbjct: 334 ADSLRAGTDIDCGQTYPWNLNQSFIEGSVTRGEIERSIVRLYSNLVKLGYFDGD--KSEY 391
Query: 408 GKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGG 467
+LG +DV T++ ++ +AA +GIVLLKND LPL+K+ V S+A+IGP N Q+ G
Sbjct: 392 RQLGWNDVVTTDAWNISYEAAVEGIVLLKND-GILPLSKH-VKSIALIGPWANATEQLQG 449
Query: 468 GYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLD 527
Y G + L+G K +YA G + + N+ GF +A+ AKK+D ++ + G+D
Sbjct: 450 NYYGTAPYLITPLQGASDAGYKVNYALGTN-ILGNTTEGFADALSAAKKSDVIVYLGGID 508
Query: 528 LTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILW 587
T E E DR+++ PG Q+ L+ +++T K P++++ GGG +D S +A+S++++++W
Sbjct: 509 NTIEAEGTDRMNVTWPGNQLDLIQQLSQTGK-PLVVLQMGGGQVDSSSIKANSKVNALVW 567
Query: 588 IGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESF-TKVPMNDMNMRADSSRQYPGRSYR 646
GYPG++G A+ +I+ G P GRL T YP + T+ P DMN+R D + PG++Y
Sbjct: 568 GGYPGQSGGTAIFDILSGKRVPAGRLVTTQYPAEYATQFPATDMNLRPDGASN-PGQTYM 626
Query: 647 FYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTG-SRLDYVHIDE 705
+YTGT VY FG+GL YT + K Q+ G S D I
Sbjct: 627 WYTGTPVYDFGYGLFYTTF----------------------KETAQKLGSSSFDISEIVA 664
Query: 706 VTSCTSLRF-------HVQISVTNAGDVDGSHVVMLFARVPKVSQGT-PEKQLIGFDRVH 757
S + ++ ++ N G + MLFA P K L+G+DR+
Sbjct: 665 APRSPSYEYSELVPFVNITATIKNTGKTASPYTAMLFANTTNAGPAPYPNKWLVGYDRLA 724
Query: 758 TVAKG-SKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELR 800
++ G S ++ V P ++ +++G RI+ G++ L + R
Sbjct: 725 SIEPGKSADLVIPV-PIGAIARVDENGNRIVYPGDYQLALNVER 767
>gi|443893988|dbj|GAC71176.1| hypothetical protein PANT_1d00031 [Pseudozyma antarctica T-34]
Length = 759
Score = 447 bits (1150), Expect = e-122, Method: Compositional matrix adjust.
Identities = 269/769 (34%), Positives = 414/769 (53%), Gaps = 78/769 (10%)
Query: 42 FP-CKP-PHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWW 99
FP C P + C+TSL TRA SL++ T QE I + A +PRLGIP Y+WW
Sbjct: 21 FPDCSPGTPLSANAVCDTSLDYWTRATSLVAEFTTQELINNTINTAPGVPRLGIPPYQWW 80
Query: 100 SESLHGIASNGPGVNFNGTVS----SVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMY 155
+E+LHG+A + PGVNF V S T+FPQ++ A+F+ +L+ + + +A E RA
Sbjct: 81 TEALHGVAGS-PGVNFADDVEAPYGSATNFPQIINLGATFDDALYEQVATHIANETRAFN 139
Query: 156 NLGQAGLTFWAP-NINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGI 214
N G+AGL ++P NIN FRDPRWGRGQET GEDP+ +S YAV+ V+ QG N
Sbjct: 140 NAGKAGLNMYSPLNINCFRDPRWGRGQETTGEDPLHMSRYAVKMVQGLQGPN-------- 191
Query: 215 GFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDT 274
DEL L+A CKH +AYDLEKW RY F+A ++ Q+ +
Sbjct: 192 -------------------QDELRLAATCKHYLAYDLEKWDGVERYQFDAQVSRQELAEF 232
Query: 275 FQPPFRSCIEQGKASCIMCSYNQVNGV-PACLRGDLFQKARNEWGF---KGYITSDCDAV 330
+ P FR+C+ GKA +M SYN VN V P+ R L AR EWG Y+TSDCDAV
Sbjct: 233 YLPQFRACVRDGKAVTLMTSYNAVNNVPPSASRYYLETLARKEWGLDKKHNYVTSDCDAV 292
Query: 331 ATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFS 390
A +F+ +Y ++ +AA + AG D+NCG + A+++ + I A+ +++
Sbjct: 293 ANVFDGHHYADSYVQAAADSINAGTDLNCGATYSDNLGQALEQNLTDVETIRTAVARMYA 352
Query: 391 VQLRLGLFNGDPRKGK-YGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAV 449
Q+RLGLF DP++G+ +LG + V T + LA +A + LLKN+ LP+ +
Sbjct: 353 SQVRLGLF--DPKQGQPLRELGWEHVNTKAAQDLAYSSAAASVTLLKNNGT-LPV--DGA 407
Query: 450 SSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKT--HYASGCHDVPCNSDAGF 507
+ +A+IGP N + G Y G ++ + S+ A+G +
Sbjct: 408 TKVAVIGPYSNATFALRGNYAGPGPFAITMTEAAQRVFSQATISSANGTTISGTYNHTDA 467
Query: 508 HEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTG 567
A+++AK+AD VI G+D T E+E+ DR ++ P Q+ L+ ++ +K+ + +V G
Sbjct: 468 EAAMQLAKEADLVIFAGGIDPTIESEELDRATIAWPPNQLQLIHALGGMAKK-MAVVQFG 526
Query: 568 GGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTK-VP 626
GG +D + +AD I ++LW GYPG++GA A+ ++I G+ P GRLP+T YP + +
Sbjct: 527 GGQIDGASIKADGNIGALLWAGYPGQSGALAVMDVIAGNTAPAGRLPITQYPAEYIDGLA 586
Query: 627 MNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGS 686
M +R +++ YPGR+Y++Y+GT Y + HGL YT + + L+ P+ TI+ +
Sbjct: 587 ETTMALRPNAT--YPGRTYKWYSGTPTYPYAHGLHYTEFKAE-LAQPAPYTIATA----- 638
Query: 687 DKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTP 746
Y + V + VQ ++TNAG + ++FAR P
Sbjct: 639 ------------GYAEFERVAT-------VQATITNAGQRTSDYAALVFARHTNGPAPHP 679
Query: 747 EKQLIGFDRVHTVAKG-SKEISFGVDPCEQLSIANKHGRRILPLGNHVL 794
K L+G+ +V +A G S+ + + L+ ++ G +L G + L
Sbjct: 680 NKTLVGYKKVKAIAPGESRSVEVEITQAA-LARGDEEGNLVLYPGKYEL 727
>gi|426198365|gb|EKV48291.1| hypothetical protein AGABI2DRAFT_219902 [Agaricus bisporus var.
bisporus H97]
Length = 767
Score = 447 bits (1149), Expect = e-122, Method: Compositional matrix adjust.
Identities = 276/777 (35%), Positives = 418/777 (53%), Gaps = 64/777 (8%)
Query: 42 FP-CKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWS 100
FP C S C+ + + + RAK+LI + T +E +Q + + +PRLG+P+Y+WWS
Sbjct: 24 FPDCVNGPLSSTAVCDPTKAPAARAKTLIQMFTDEELMQNTDNVSPGVPRLGVPSYQWWS 83
Query: 101 ESLHGIASNGPGVNF--NGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG 158
E+LHG+A + PGV+F +G SS TSFPQ +V ++F+ L + + ++ EARA N
Sbjct: 84 EALHGVAGS-PGVSFAPSGEFSSATSFPQSIVLGSTFDIDLVKAVATVISTEARAFNNFH 142
Query: 159 QAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGF 218
+AGL ++ PNIN F+DPRWGRGQETPGEDP VS Y + QG G
Sbjct: 143 RAGLDYFTPNINPFKDPRWGRGQETPGEDPFHVSQYVYSLIDGLQG------------GI 190
Query: 219 REKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPP 278
+ K ++A CKH AYDL+ W R+ F+A ++ QD + + P
Sbjct: 191 DPRPYFK-------------VAADCKHYAAYDLDSWEGIDRFHFDAKVSLQDLSEYYLPS 237
Query: 279 FRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKG--YITSDCDAVATIFE 335
F+SC+ K + +MCSYN VNG+PAC L Q R+ WGF ++TSDCDA+ IF
Sbjct: 238 FQSCVRDAKVASVMCSYNSVNGIPACANPYLLQDILRDFWGFDDDRWVTSDCDAIGNIFT 297
Query: 336 YQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRL 395
N+T T ++ A LKAG D++CGT H A+++ + D++RAL ++ +RL
Sbjct: 298 THNFTDTFAEAVADALKAGTDVDCGTSYSTHLPDALNQSLITRDDLERALTRQYTSLMRL 357
Query: 396 GLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAII 455
G F+ P +L DV + + LA AA +G+VLLKND FLP++ + ++AII
Sbjct: 358 GYFD-PPESQPLRQLAWSDVNKPDAQALAHTAAVEGLVLLKND-GFLPVSASG-KTIAII 414
Query: 456 GPLVNNISQMGGGYTGI-PCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIA 514
GP N M G Y G P ++A ++ A+G + S+A F A+ +A
Sbjct: 415 GPYANATKDMQGNYFGTAPFIVTPFQGAVDAGFNEVVSAAGT-SINGTSEADFAAAIAVA 473
Query: 515 KKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVS 574
+D +I G++ + E+E +DR+++ G Q+SLV +A K PV++V GGG LD S
Sbjct: 474 NSSDIIIFAGGINNSIESEAKDRLTIAWTGNQLSLVKQLASLGK-PVVVVQFGGGQLDDS 532
Query: 575 FAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNMR 633
+ + +++W GYPG++G A+ ++I G P GRL +T YPE F +V M DM +R
Sbjct: 533 DLLDNDAVRAVIWAGYPGQSGGTAIFDVITGAVAPAGRLSVTQYPEDFVNQVGMTDMALR 592
Query: 634 ADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQ 693
S+ PGR+Y++YTG V FGHGL +T + + + P L +DK
Sbjct: 593 PGSAN--PGRTYKWYTGRPVLEFGHGLHFTTFDFSWRGRPGRKYNIQHLLHTADKKF--- 647
Query: 694 TGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGF 753
D + +D FHV I N G++ +V +LF R P+K L+ F
Sbjct: 648 ----PDLIPLDT--------FHVNIR--NTGNITSDYVALLFLRSNAGFAPHPKKSLVSF 693
Query: 754 DRVHTVAKGSK-EISFGVDPCEQLSIANKHGRRILPLGNHVLMV----GELRHSLTI 805
R H + GS + GV+ ++ ++HG L G++ L++ G L HS ++
Sbjct: 694 ARAHRIDAGSSATVDLGVN-LGSIARVDEHGDSWLFAGDYQLVLDIGDGVLSHSFSL 749
>gi|413919687|gb|AFW59619.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
Length = 451
Score = 447 bits (1149), Expect = e-122, Method: Compositional matrix adjust.
Identities = 219/439 (49%), Positives = 290/439 (66%), Gaps = 29/439 (6%)
Query: 40 PDFPCKPPH--FDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYE 97
P F C + SY FCN S + + RA L+S LTL EK+ L D +A+PRLG+P YE
Sbjct: 38 PAFACDASNATLASYGFCNRSAAAAARAADLVSRLTLAEKVGFLVDKQAALPRLGVPLYE 97
Query: 98 WWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNL 157
WWSE+LHG++ GPG F+ V TSFPQ +++AASFN +L+ IG V+ EARAM+N+
Sbjct: 98 WWSEALHGVSYVGPGTRFSPLVPGATSFPQPILTAASFNATLFRAIGEVVSNEARAMHNV 157
Query: 158 GQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFG 217
G AGLTFW+PNINIFRDPRWGRGQETPGEDP++ S YAV +V QG
Sbjct: 158 GLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLTSKYAVGYVTGLQG------------- 204
Query: 218 FREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQP 277
+ G G L ++ACCKH AYD++ W RY+F+A++++QD +DTFQP
Sbjct: 205 -----AVSGAG-------ALKVAACCKHYTAYDVDNWKGVERYTFDAVVSQQDLDDTFQP 252
Query: 278 PFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEY 336
PF+SC+ G + +MCSYNQVNG P C DL R +W GYI+SDCD+V ++
Sbjct: 253 PFKSCVVDGNVASVMCSYNQVNGKPTCADKDLLSGVIRGDWKLNGYISSDCDSVDVLYNN 312
Query: 337 QNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLG 396
Q+YTKT ED+AA +KAG+D+NCGT + +HT +A+ GK+ E D+DRA+ N +RLG
Sbjct: 313 QHYTKTPEDAAAISIKAGLDLNCGTFLAQHTVAAVQAGKLSESDVDRAVTNNLVTLMRLG 372
Query: 397 LFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIG 456
F+GDPR+ +G LGP DVCT +++LA +AARQGIVLLKN K LPL+ ++ S+A+IG
Sbjct: 373 FFDGDPRELPFGNLGPSDVCTPSNQELAREAARQGIVLLKNTGK-LPLSATSIKSMAVIG 431
Query: 457 PLVNNISQMGGGYTGIPCS 475
P N M G Y G C+
Sbjct: 432 PNANASFTMIGNYEGTSCN 450
>gi|6573772|gb|AAF17692.1|AC009243_19 F28K19.27 [Arabidopsis thaliana]
Length = 696
Score = 447 bits (1149), Expect = e-122, Method: Compositional matrix adjust.
Identities = 241/486 (49%), Positives = 322/486 (66%), Gaps = 18/486 (3%)
Query: 326 DCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRAL 385
DCDAV+ I++ Q Y K+ ED+ A VLKAGMD+NCG+ + +HT+SA+ + KV E DIDRAL
Sbjct: 221 DCDAVSIIYDAQGYAKSPEDAVADVLKAGMDVNCGSYLQKHTKSALQQKKVSETDIDRAL 280
Query: 386 LNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLN 445
LNLFSV++RLGLFNGDP K YG + P++VC+ H+ LALDAAR GIVLLKN+ K LP +
Sbjct: 281 LNLFSVRIRLGLFNGDPTKLPYGNISPNEVCSPAHQALALDAARNGIVLLKNNLKLLPFS 340
Query: 446 KNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDA 505
K +VSSLA+IGP + + + G Y G PC + L L +YV Y GC V C S+A
Sbjct: 341 KRSVSSLAVIGPNAHVVKTLLGNYAGPPCKTVTPLDALRSYVKNAVYHQGCDSVAC-SNA 399
Query: 506 GFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVL 565
+AV IAK AD V+++ GLD TQE ED DRV L LPG+Q L+TSVA +K+PV+LVL
Sbjct: 400 AIDQAVAIAKNADHVVLIMGLDQTQEKEDFDRVDLSLPGKQQELITSVANAAKKPVVLVL 459
Query: 566 TGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKV 625
GGP+D+SFA +++I SI+W GYPGEAG A++EIIFGD NPGGRLP+TWYP+SF +
Sbjct: 460 ICGGPVDISFAANNNKIGSIIWAGYPGEAGGIAISEIIFGDHNPGGRLPVTWYPQSFVNI 519
Query: 626 PMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLS-APSELTISASLKA 684
M DM MR S+ YPGR+Y+FY G +VY FGHGLSY+ YSY+F + A + L ++ S KA
Sbjct: 520 QMTDMRMR--SATGYPGRTYKFYKGPKVYEFGHGLSYSAYSYRFKTLAETNLYLNQS-KA 576
Query: 685 GSDKNILQQTGSRLDYVHIDEV--TSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVS 742
Q + Y + E+ C + V + V N G++ G H V++FAR +
Sbjct: 577 -------QTNSDSVRYTLVSEMGKEGCDVAKTKVTVEVENQGEMAGKHPVLMFARHERGG 629
Query: 743 QG--TPEKQLIGFDRVHTVAKGSK-EISFGVDPCEQLSIANKHGRRILPLGNHVLMVGEL 799
+ EKQL+GF + ++ G K E+ F + CE LS AN+ G +L G + L VG+
Sbjct: 630 EDGKRAEKQLVGFKSI-VLSNGEKAEMEFEIGLCEHLSRANEFGVMVLEEGKYFLTVGDS 688
Query: 800 RHSLTI 805
L +
Sbjct: 689 ELPLIV 694
Score = 241 bits (615), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 115/218 (52%), Positives = 141/218 (64%), Gaps = 23/218 (10%)
Query: 40 PDFPCKP--PHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYE 97
P C P P Y FC T L I RA+ L+S LT+ EKI QL + A IPRLG+PAYE
Sbjct: 22 PPHSCDPSNPTTKLYQFCRTDLPIGKRARDLVSRLTIDEKISQLVNTAPGIPRLGVPAYE 81
Query: 98 WWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNL 157
WWSE+LHG+A GPG+ FNGTV + TSFPQV+++AASF+ W I + EAR +YN
Sbjct: 82 WWSEALHGVAYAGPGIRFNGTVKAATSFPQVILTAASFDSYEWFRIAQVIGKEARGVYNA 141
Query: 158 GQA-GLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGF 216
GQA G+TFWAPNINIFRDPRWGRGQETPGEDPM+ YAV +V+ QG++
Sbjct: 142 GQANGMTFWAPNINIFRDPRWGRGQETPGEDPMMTGTYAVAYVRGLQGDS---------- 191
Query: 217 GFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKW 254
F ++ L + L SACCKH AYDL++W
Sbjct: 192 -FDGRKTLS---------NHLQASACCKHFTAYDLDRW 219
>gi|392596548|gb|EIW85871.1| hypothetical protein CONPUDRAFT_80240 [Coniophora puteana
RWD-64-598 SS2]
Length = 770
Score = 447 bits (1149), Expect = e-122, Method: Compositional matrix adjust.
Identities = 274/761 (36%), Positives = 403/761 (52%), Gaps = 70/761 (9%)
Query: 55 CNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVN 114
C+TSL+ + RA +LI L T+ E I + A +PRLG+PAYEWWSE LHG+A N GV
Sbjct: 37 CDTSLNATQRAAALIDLFTVDELIVNTVNWAPGVPRLGLPAYEWWSEGLHGVA-NSAGVT 95
Query: 115 FN--GTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIF 172
++ G S TSFPQ ++ +A+F+ +L +G + +E RA N G AGL FW PNIN F
Sbjct: 96 WSITGPFSYATSFPQPILMSAAFDDALIKAVGGVIGMEGRAFNNYGHAGLDFWTPNINPF 155
Query: 173 RDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESD 232
+DPRWGRGQETPGEDP ++ Y ++ QG G E
Sbjct: 156 KDPRWGRGQETPGEDPYHIAQYVYNLIQGLQG---------------------GLDPEP- 193
Query: 233 RGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIM 292
+ A CKH YDLE W RY +NA+I+ QD + + P F+SC A M
Sbjct: 194 ---YFQVVATCKHFAGYDLEDWDFNYRYGYNAIISTQDLSEYYLPSFQSCYRDAFAGASM 250
Query: 293 CSYNQVNGVPACLRGDLFQKA-RNEWGFKG--YITSDCDAVATIFEYQNYTKTHEDSAAG 349
CSYN +NG+P C L Q R WGF ++T DCD+V I+++ +YT + +AA
Sbjct: 251 CSYNAINGIPTCADTYLLQDILRGFWGFDQTRWVTGDCDSVEDIYDFHHYTALPQQAAAD 310
Query: 350 VLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGK-YG 408
LKAG DI+CG A + + E+D+ AL ++ +RLG F DP + Y
Sbjct: 311 ALKAGSDIDCGIFYTTWLPLAYTESLITEQDLRAALTRQYASLVRLGYF--DPASEQPYR 368
Query: 409 KLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGG 468
+ +V TS ++LA AA +GI LLKND LP + +A+ ++A+IGP +QM G
Sbjct: 369 QYNWSNVDTSYAQELAYTAAVEGITLLKNDGT-LPFS-SAIKNIALIGPWTFATTQMQGN 426
Query: 469 YTGIPCSPKSLLRGLE------AYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIV 522
Y G S +G + +YV +T +V N+ G+ A A+ AD ++
Sbjct: 427 YYGNAPYLISPYQGAQLAGYNISYVLET-------NVTSNTTDGYAAAFTAAQGADAIVF 479
Query: 523 VAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQI 582
V G+D T E E DR + P Q+ L+ + + K P+++V GGG +D + A+ +
Sbjct: 480 VGGIDNTVEAEAMDRNDITWPAFQLWLIGELGKLGK-PLVVVQFGGGQVDDTEINANPDV 538
Query: 583 SSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNMRADSS-RQY 640
+++LW GYPG++G +AL +II G P GRL T YP + ++PM +MN+R D++
Sbjct: 539 NALLWGGYPGQSGGQALFDIISGKVAPAGRLVSTQYPADYVNEIPMTNMNLRPDANGTTS 598
Query: 641 PGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSE-LTISASLKAGSDKNILQQTGSRLD 699
PGR+Y++YTGT VY FG+GL YTN++Y + AP+ +I A + AG Q + +D
Sbjct: 599 PGRTYKWYTGTPVYEFGYGLHYTNFTYAWTKAPAATYSIEALVAAG-------QGSAHID 651
Query: 700 YVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTV 759
D ++ + VTNAG V + +LF P K L + R+H V
Sbjct: 652 LAPFDTLS----------VEVTNAGAVTSDYSALLFVNGTYGPAPYPNKSLAAYTRLHNV 701
Query: 760 AKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELR 800
G+ + + Q++ A+ G L G + + + R
Sbjct: 702 TAGASQTATFEVVLNQIARADVQGNFWLYPGAYEVALDTTR 742
>gi|398403795|ref|XP_003853364.1| putative xylan 1,4-beta-Xylosidase [Zymoseptoria tritici IPO323]
gi|339473246|gb|EGP88340.1| putative xylan 1,4-beta-Xylosidase [Zymoseptoria tritici IPO323]
Length = 785
Score = 446 bits (1147), Expect = e-122, Method: Compositional matrix adjust.
Identities = 279/778 (35%), Positives = 407/778 (52%), Gaps = 75/778 (9%)
Query: 42 FP-CKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWS 100
FP C+ + C+ + TRA +LI+ T++EKI A +PRLG+PAY WW
Sbjct: 25 FPDCQNGPLKNNTVCDFTADPLTRATALIAAFTIEEKINNTGSTAPGVPRLGLPAYTWWQ 84
Query: 101 ESLHGIASNGPGVNFN--GTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG 158
E+LHG+A + PGVNF+ G TSFPQ ++ A+F+ L ++ + ++ EARA N
Sbjct: 85 EALHGVAQS-PGVNFSDSGDFRYATSFPQPILMGAAFDDDLIKDVATVISTEARAFNNDA 143
Query: 159 QAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGF 218
++GL +W PNIN F+D RWGRGQETPGEDP +S+Y + QG+
Sbjct: 144 RSGLDYWTPNINPFKDSRWGRGQETPGEDPYHLSSYVKSLIAGLQGDG------------ 191
Query: 219 REKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKW-GNFSRYSFNAMITEQDTEDTFQP 277
+ K+V+ A CKH +AYDLE W GNF RY F+ + Q+ + + P
Sbjct: 192 KYKKVV----------------ATCKHFVAYDLETWNGNF-RYQFDPHVGSQELVEYYMP 234
Query: 278 PFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGF---KGYITSDCDAVATI 333
PF++C MCSYN +NG+P C L Q R W + + ++TSDCD++ +
Sbjct: 235 PFQACARDANVGAFMCSYNSLNGIPTCADPYLLQTILREHWNWTSEEQWVTSDCDSIQNV 294
Query: 334 FEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQL 393
+ YT T E++ A LKAG D+NCGT A+ G V EKDID AL+ +S +
Sbjct: 295 YLPHEYTSTREEAVAVSLKAGTDVNCGTYYQEFLPGALSLGLVTEKDIDMALIRQYSSLV 354
Query: 394 RLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLA 453
RLG F+G +Y L DV T ++LAL AA +GI LLKND LPL + +A
Sbjct: 355 RLGYFDG--TAVEYRSLSWKDVSTPYAQQLALKAAVEGITLLKND-GILPLAITKDTKIA 411
Query: 454 IIGPLVNNISQMGGGYTGIPC---SPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEA 510
+IG N QM G Y GIP SP + A V+ + G D N+ A
Sbjct: 412 VIGDWANATEQMLGNYDGIPPYLHSPLWAAQQTGANVTYSGNPGGQGDPTTNNWLHIWTA 471
Query: 511 VRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGP 570
V +AD ++ G+D E E DRVS+ G Q+ ++ +A K PVI+ G
Sbjct: 472 V---DEADVILFAGGIDNGVEAEGMDRVSIAWTGAQLDVIGQLASRGK-PVIVAQMGTNG 527
Query: 571 LDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESF-TKVPMND 629
+D + + IS++LW GYPG+ G AL +II G P GRLP T YP S+ +KVPM D
Sbjct: 528 VDSTPLLNNQNISALLWGGYPGQDGGVALLDIIQGKSAPAGRLPTTQYPASYISKVPMTD 587
Query: 630 MNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKN 689
M++R +S+ +PGR+Y +Y V+ FG+GL YTN+S +P++ T S S+ A K+
Sbjct: 588 MHLRPNSTTGFPGRTYMWYNEKPVFEFGYGLHYTNFSATI--SPTD-TTSFSI-ADLTKD 643
Query: 690 ILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQ 749
+ R + + +I+VTN G+V +V + F P K+
Sbjct: 644 CTEHYMDRCPFADM-------------KIAVTNTGNVTSDYVTLGFLAGEHGPAPCPNKR 690
Query: 750 LIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIET 807
L+ + R+H + G+ + + L++ R+ +GN VL G ++L I+T
Sbjct: 691 LVNYQRLHNITAGASQTT-------SLNLTLASLARVDDMGNTVLYPGS--YALLIDT 739
>gi|302683060|ref|XP_003031211.1| glycoside hydrolase family 3 protein [Schizophyllum commune H4-8]
gi|300104903|gb|EFI96308.1| glycoside hydrolase family 3 protein [Schizophyllum commune H4-8]
Length = 761
Score = 445 bits (1144), Expect = e-122, Method: Compositional matrix adjust.
Identities = 275/753 (36%), Positives = 404/753 (53%), Gaps = 67/753 (8%)
Query: 55 CNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVN 114
C+TSL RA++L+ LT+ E I A +PRLG+P Y WW+E+LHG+A++ PGV
Sbjct: 35 CDTSLGHVERARALVEELTVAEMINNTVHTAPGVPRLGLPPYNWWNEALHGVAAS-PGVV 93
Query: 115 FNG---TVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINI 171
F SS TSFP + ++F+ +L +G+ + EARA N G AGL +W PNIN
Sbjct: 94 FTSPGEEFSSATSFPMPINMGSAFDDALMLAVGNVTSTEARAFNNAGLAGLDYWTPNINP 153
Query: 172 FRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEES 231
F+DPRWGRG ETPGEDP+ + Y V+ QG G S
Sbjct: 154 FKDPRWGRGAETPGEDPLHAARYVRTLVEGLQG---------------------GIDPPS 192
Query: 232 DRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCI 291
L ++A CKH AYDLE WG +RY+F+A++T QD + + PPF+SC+ +A+ +
Sbjct: 193 -----LKVAADCKHWAAYDLEDWGGVARYAFDAVVTPQDLAEYYSPPFKSCVRDARAASV 247
Query: 292 MCSYNQVNGVPACLRGDLFQKA-RNEWGFKG--YITSDCDAVATIFEYQNYTKTHEDSAA 348
MCSYN VNGVPAC L + R+ WG ++TSDCDAV +++ YT+ + +A
Sbjct: 248 MCSYNAVNGVPACASPYLLKTVLRDAWGLAEDRWVTSDCDAVGNVYDPHGYTEDFVNGSA 307
Query: 349 GVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYG 408
LKAG D++CGT ++ A D+G + E D+ AL L++ + LG F+ P Y
Sbjct: 308 VSLKAGSDLDCGTTYSQYLPEAYDRGLIDEDDLKAALTRLYASLVWLGYFDA-PEDQPYR 366
Query: 409 KLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVN-NISQMGG 467
++ DV T + LA AA + VLLKND LPL +++ S+A+IGP+ N + Q+ G
Sbjct: 367 QISWADVNTPAAQALAYTAAIESFVLLKNDGT-LPLTDSSL-SIALIGPMANASAVQLQG 424
Query: 468 GYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLD 527
Y GIP + L+G Y G + V N AV A+ AD VI V G+D
Sbjct: 425 NYNGIPPFAIAPLQGFLDAGFNVTYVLGTN-VTGNDADDIDGAVAAAEAADVVIYVGGID 483
Query: 528 LTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILW 587
T E E +DR + P Q++L++++ K P+++V GGG LD + + +++ILW
Sbjct: 484 STVEEEAKDRTEISWPDNQLALLSALEEAGK-PLVVVQMGGGQLDDTPLKESDAVNAILW 542
Query: 588 IGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTK-VPMNDMNMRADSSRQYPGRSYR 646
GYPG++G A+A+ + G P GRL +T YP S+ V M DM +R D+S PGR+Y+
Sbjct: 543 AGYPGQSGGTAIADTVMGKVAPAGRLSITQYPASYVDAVAMTDMTLRPDNSTGNPGRTYK 602
Query: 647 FYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEV 706
+YTGT VY +G+GL YTN+S + S E S +++ +D +D
Sbjct: 603 WYTGTPVYPYGYGLHYTNFSVAWASDAPEACYSI-------QDLTSSADGFVDLAPLDT- 654
Query: 707 TSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSK-- 764
+++VTN GDV V +LF P K+L+ + R V G
Sbjct: 655 ---------FRVTVTNDGDVASDFVALLFVSTQAGPAPAPMKELVAYARASDVQPGDSTD 705
Query: 765 ---EISFGVDPCEQLSIANKHGRRILPLGNHVL 794
E++ G L+ +++ G L G++ L
Sbjct: 706 VDLEVTLGA-----LARSDESGDASLYPGDYEL 733
>gi|409079872|gb|EKM80233.1| hypothetical protein AGABI1DRAFT_57801 [Agaricus bisporus var.
burnettii JB137-S8]
Length = 767
Score = 443 bits (1140), Expect = e-121, Method: Compositional matrix adjust.
Identities = 274/777 (35%), Positives = 417/777 (53%), Gaps = 64/777 (8%)
Query: 42 FP-CKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWS 100
FP C S C+ + + + RA +LI + T +E +Q + + +PRLG+P+Y+WWS
Sbjct: 24 FPDCVNGPLSSTAVCDPTKAPAARATTLIQMFTDEELMQNTDNVSPGVPRLGVPSYQWWS 83
Query: 101 ESLHGIASNGPGVNF--NGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG 158
E+LHG+A + PGV+F +G SS TSFPQ +V ++F+ L + + ++ EARA N
Sbjct: 84 EALHGVAGS-PGVSFAPSGEFSSATSFPQSIVLGSTFDIDLVKAVATVISTEARAFNNFH 142
Query: 159 QAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGF 218
+AGL ++ PNIN F+DPRWGRGQETPGEDP VS Y + QG G
Sbjct: 143 RAGLDYFTPNINPFKDPRWGRGQETPGEDPFHVSQYVYSLIDGLQG------------GI 190
Query: 219 REKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPP 278
+ K ++A CKH AYDL+ W R+ F+A ++ QD + + P
Sbjct: 191 DPRPYFK-------------VAADCKHYAAYDLDSWEGIDRFHFDAKVSLQDLSEYYLPS 237
Query: 279 FRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKG--YITSDCDAVATIFE 335
F+SC+ K + +MCSYN VNG+PAC L Q R+ WGF ++TSDCDA+ IF
Sbjct: 238 FQSCVRDAKVASVMCSYNSVNGIPACANPYLLQDILRDFWGFDDDRWVTSDCDAIGNIFT 297
Query: 336 YQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRL 395
N+T T ++ A LKAG D++CGT H A+++ + D++RAL ++ +RL
Sbjct: 298 THNFTDTFAEAVADALKAGTDVDCGTSYSTHLPDALNQSLITRDDLERALTRQYTSLMRL 357
Query: 396 GLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAII 455
G F+ P +L DV + + LA AA +G+VLLKND FLP++ + ++AII
Sbjct: 358 GYFD-PPESQPLRQLAWSDVNKPDAQALAHTAAVEGLVLLKND-GFLPVSASG-KTIAII 414
Query: 456 GPLVNNISQMGGGYTGI-PCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIA 514
GP N M G Y G P ++A ++ A+G + S+A F A+ +A
Sbjct: 415 GPYANATKDMQGNYFGTAPFIVTPFQGAVDAGFNEVVSAAGT-SINGTSEADFAAAIAVA 473
Query: 515 KKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVS 574
+D +I G++ + E+E +DR+++ G Q+SLV +A K PV++V GGG LD S
Sbjct: 474 NSSDIIIFAGGINNSIESEAKDRLTIAWTGNQLSLVKQLASLGK-PVVVVQFGGGQLDDS 532
Query: 575 FAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNMR 633
+ + +++W GYPG++G A+ ++I G P GRL +T YPE F +V M DM +R
Sbjct: 533 DLLDNDAVRAVIWAGYPGQSGGTAIFDVITGAVAPAGRLSVTQYPEDFVNQVGMTDMALR 592
Query: 634 ADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQ 693
S+ PGR+Y++YTG V FGHGL +T + + + P L +DK
Sbjct: 593 PGSAN--PGRTYKWYTGRPVLEFGHGLHFTTFDFSWRGRPGRKYNIQHLLHTADKKF--- 647
Query: 694 TGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGF 753
D + +D FHV I N G++ +V +LF + P+K L+ F
Sbjct: 648 ----PDLIPLDT--------FHVNIR--NTGNITSDYVALLFLKSNAGFAPHPKKSLVSF 693
Query: 754 DRVHTVAKGSKE-ISFGVDPCEQLSIANKHGRRILPLGNHVLMV----GELRHSLTI 805
R H + GS + GV+ ++ ++HG L G++ L++ G L HS ++
Sbjct: 694 ARAHRIDAGSSATVDLGVN-LGSIARVDEHGDSWLFAGDYQLVLDIGDGVLSHSFSL 749
>gi|344302281|gb|EGW32586.1| hypothetical protein SPAPADRAFT_51129 [Spathaspora passalidarum
NRRL Y-27907]
Length = 788
Score = 442 bits (1138), Expect = e-121, Method: Compositional matrix adjust.
Identities = 253/714 (35%), Positives = 384/714 (53%), Gaps = 52/714 (7%)
Query: 55 CNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVN 114
CN L + RAK+++ L T+ E I + + + + RLG+P Y+WWSE LHGIA + N
Sbjct: 61 CNPYLPNNQRAKAVVDLFTVDELIANMGNTSPGVERLGLPPYQWWSEGLHGIARS----N 116
Query: 115 F--NGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIF 172
F +G S TSFPQ ++ +FN L+ +G+ + EARA N+G+AGL +++PNIN F
Sbjct: 117 FTASGEYSHATSFPQPILMGGAFNSDLYKQVGNVIGTEARAFNNVGRAGLDYYSPNINPF 176
Query: 173 RDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESD 232
+DPRWGRGQE E P++V YA+ +V+ Q GGI + +
Sbjct: 177 KDPRWGRGQEVASESPVLVGNYALNYVQGLQ--------GGI--------------DSNP 214
Query: 233 RGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIM 292
D L ++A CKH YD+E W SR +NA+I++QD D + P F+SC+ KA+ M
Sbjct: 215 NDDTLQVAATCKHFAGYDMESWKQHSRLGYNAIISDQDLADYYFPTFQSCVRDAKAAGAM 274
Query: 293 CSYNQVNGVPACLRGDLFQKARNEWGF---KGYITSDCDAVATIFEYQNYTKTHEDSAAG 349
CSYN +NG+P C + F GF G I SDCD++ +I+ Y + +AA
Sbjct: 275 CSYNAINGIPVCAS-EFFLGTVIREGFDFQNGVIHSDCDSLYSIWNPHLYVQDLGAAAAD 333
Query: 350 VLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGK 409
+KAG+D+NCG + A+ + E I ++ +S +RLG F+ P+ KY
Sbjct: 334 GIKAGVDVNCGDTYQNNLGYALGNKTINEDQIRASVTRQYSNLIRLGYFD-SPQTNKYRT 392
Query: 410 LGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGY 469
DV TS+ +LA AA +GI LLKND LP NK+ V ++A+IGP N + M G Y
Sbjct: 393 YNWSDVSTSQANQLAYQAAVEGITLLKNDGT-LPFNKDKVKNVAVIGPWANATTDMLGDY 451
Query: 470 TGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLT 529
G P S L+G + K YA G + + A+ AK AD ++ G+D +
Sbjct: 452 AGTPPYLISPLQGAQDSGFKVQYAYGTQ-INTTLTTNYTAALNAAKGADAIVYFGGIDNS 510
Query: 530 QETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIG 589
E E DR SL PG Q+ LV+ ++ +K P+++V G G +D + + ++ ++SI++ G
Sbjct: 511 IENEALDRESLAWPGNQLDLVSKLSGLNK-PLVVVQFGAGQVDDTEIKNNNNVNSIVYAG 569
Query: 590 YPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNMRADSSRQYPGRSYRFY 648
YPG++G A+ +++ G + P GRL T YP S+ +VPM DM +R YPGR++ +Y
Sbjct: 570 YPGQSGGTAIWDVLNGIYAPAGRLSTTQYPASYADQVPMTDMTLRPRDG--YPGRTFMWY 627
Query: 649 TGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTS 708
G VY FG+GL YT +S +AP + A D+ I ++ +D
Sbjct: 628 NGEPVYEFGYGLHYTTFSVSLANAPPK---GAPQSFNIDQFIAAKSSQYVD--------- 675
Query: 709 CTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKG 762
TSL +++ N G V + +L++ P K L+ FD++H + G
Sbjct: 676 -TSLITTFDVNIKNTGKVTSDYAALLYSNTTSGPGPHPNKILVSFDKLHQIHPG 728
>gi|189203341|ref|XP_001938006.1| beta-xylosidase [Pyrenophora tritici-repentis Pt-1C-BFP]
gi|187985105|gb|EDU50593.1| beta-xylosidase [Pyrenophora tritici-repentis Pt-1C-BFP]
Length = 761
Score = 442 bits (1138), Expect = e-121, Method: Compositional matrix adjust.
Identities = 271/744 (36%), Positives = 395/744 (53%), Gaps = 63/744 (8%)
Query: 64 RAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFN--GTVSS 121
RA+SL++L TL+EKI S A +PRLG+P Y+WWSE LHGIA GP NF+ G S
Sbjct: 10 RAQSLVALYTLEEKINATSSGAPGVPRLGVPPYQWWSEGLHGIA--GPYTNFSDSGEWSY 67
Query: 122 VTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQ 181
TSFPQ ++ A+F+ L +++ ++ EARA N + GL FW PNIN FRDPRWGRGQ
Sbjct: 68 STSFPQPILMGAAFDDDLITDVAKVISTEARAFNNANRTGLDFWTPNINPFRDPRWGRGQ 127
Query: 182 ETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSA 241
ETPGED +S+Y + QGE S D KRV+ A
Sbjct: 128 ETPGEDAYHLSSYVQALIHGLQGE---STD-------PYKRVV----------------A 161
Query: 242 CCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGV 301
CKH YD+E W RY + IT+Q+ + + PF++C+ Q MCSYN VNG
Sbjct: 162 TCKHFAGYDVEDWNGNLRYQNDVQITQQELVEYYLAPFQACV-QANVGAFMCSYNAVNGA 220
Query: 302 PACLRGDLFQKA-RNEWGF---KGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDI 357
P C L Q R WG+ + ++T DCDAV ++ ++ T +AA L AG D+
Sbjct: 221 PPCADPYLLQTILREHWGWTNEEQWVTGDCDAVQNVYLPHQWSPTRAGAAADSLVAGTDV 280
Query: 358 NCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCT 417
CGT M H +A + + E +D+AL+ +S +RLG F+ + Y +LG D V T
Sbjct: 281 TCGTYMQEHLPAAFQQKLLNESSLDQALIRQYSSLVRLGYFDASENQ-PYRQLGFDAVAT 339
Query: 418 SEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPK 477
+ + LA AA +GIVLLKND LPL+ ++ ++ + G N SQ+ G Y G+
Sbjct: 340 NASQALARRAAAEGIVLLKNDGT-LPLSLDSSVTVGLFGDWANATSQLLGNYAGVATYLH 398
Query: 478 SLLRGLEAYVSKTHYASGC----HDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETE 533
S L LE K +YA G D N + + A +D +I V G+D + E E
Sbjct: 399 SPLYALEQTGVKINYAGGNPGGQGDPTTNRWSNLYGAY---STSDVLIYVGGIDNSVEEE 455
Query: 534 DRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGE 593
RDR L G Q+ ++ +A T K PVI+V+TGGG +D S + IS+I+W GYPG+
Sbjct: 456 GRDRGYLTWTGAQLDVIGQLADTGK-PVIVVVTGGGQIDSSPLVNNPNISAIMWAGYPGQ 514
Query: 594 AGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNMRADSSRQYPGRSYRFYTGTQ 652
G A+ +II G P GRLP T YP ++T V M +MN+R + PGR+Y++Y G+
Sbjct: 515 DGGSAIIDIIGGKTAPAGRLPQTQYPANYTAAVSMMNMNLRPGENS--PGRTYKWYNGSA 572
Query: 653 VYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSL 712
+ FG+G+ YTN+S + + + +SL +G + TG L+ C
Sbjct: 573 TFEFGYGMHYTNFSAEITTQMQQSYAISSLASGCNS-----TGGFLE--------RCPFA 619
Query: 713 RFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDP 772
+VQ V N G+V ++ + + P K L+ + R+H++A G+ +
Sbjct: 620 SVNVQ--VHNTGNVTSDYITLGYMAGTFGPAPHPRKTLVSYKRLHSIAGGATSTATLNLT 677
Query: 773 CEQLSIANKHGRRILPLGNHVLMV 796
L+ ++HG ++L G++ L +
Sbjct: 678 LASLARVDEHGNKVLYPGDYSLQI 701
>gi|125534110|gb|EAY80658.1| hypothetical protein OsI_35835 [Oryza sativa Indica Group]
Length = 511
Score = 442 bits (1137), Expect = e-121, Method: Compositional matrix adjust.
Identities = 236/493 (47%), Positives = 313/493 (63%), Gaps = 8/493 (1%)
Query: 320 KGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEK 379
+ Y+ SDCDAVATI + +YT + ED+ A +KAGMD+NCG H +A+ KG + EK
Sbjct: 14 RKYVASDCDAVATIRDAHHYTLSPEDTVAVSIKAGMDVNCGNYTQVHAMAAVQKGNLTEK 73
Query: 380 DIDRALLNLFSVQLRLGLFNGDPRKGK-YGKLGPDDVCTSEHKKLALDAARQGIVLLKND 438
DIDRAL+NLF+V++RLG F+GDPR YG LG DVC+ HK LAL+AA+ GIVLLKND
Sbjct: 74 DIDRALVNLFAVRMRLGHFDGDPRSNAVYGHLGAADVCSPAHKSLALEAAQDGIVLLKND 133
Query: 439 KKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVS-KTHYASGCH 497
LPL +AV+SLA+IGP +N+ + G Y G PC + L+G++ Y+ + + +GC
Sbjct: 134 AGALPLQPSAVTSLAVIGPNADNLGALHGNYFGPPCETTTPLQGIKGYLGDRARFLAGC- 192
Query: 498 DVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTS 557
D P + A +EA +A +D V++ GL QE E DR SLLLPG+Q L+T+VA +
Sbjct: 193 DSPACAVAATNEAAALASSSDHVVLFMGLSQKQEQEGLDRTSLLLPGEQQGLITAVANAA 252
Query: 558 KRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTW 617
+RPVILVL GGP+DV+FA+ + +I +IL GYPG+AG A+A+++FGD NP GRLP+TW
Sbjct: 253 RRPVILVLLTGGPVDVTFAKDNPKIGAILLAGYPGQAGGLAIAKVLFGDHNPSGRLPVTW 312
Query: 618 YPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELT 677
YPE FTKVPM DM MRAD + YPGRSYRFY G VY FG+GLSY+ +S + S+ S
Sbjct: 313 YPEEFTKVPMTDMRMRADPATGYPGRSYRFYQGNTVYNFGYGLSYSKFSRRMFSSFSTSN 372
Query: 678 I-SASLKAGSDKNILQQTGSRLDYVHIDE--VTSCTSLRFHVQISVTNAGDVDGSHVVML 734
+ SL AG G + + E V C+ L F + V N G +DG H V++
Sbjct: 373 AGNLSLLAGVMARRAGDDGGGMSSYLVKEIGVERCSRLVFPAVVEVQNHGPMDGKHSVLM 432
Query: 735 FARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVL 794
+ R P S G P +QLIGF H +SF V PCE S + G R++ G H L
Sbjct: 433 YLRWPTKSGGRPARQLIGFRSQHVKVGEKAMVSFEVSPCEHFSWVGEDGERVIDGGAHFL 492
Query: 795 MVG--ELRHSLTI 805
MVG EL SL +
Sbjct: 493 MVGDEELETSLAL 505
>gi|302683012|ref|XP_003031187.1| glycoside hydrolase family 3 protein [Schizophyllum commune H4-8]
gi|300104879|gb|EFI96284.1| glycoside hydrolase family 3 protein [Schizophyllum commune H4-8]
Length = 752
Score = 440 bits (1131), Expect = e-120, Method: Compositional matrix adjust.
Identities = 269/735 (36%), Positives = 389/735 (52%), Gaps = 61/735 (8%)
Query: 42 FP-CKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWS 100
FP C S P C+ SL RA++L+ T+ E I + A +PRLG+P YEWW+
Sbjct: 22 FPDCVNGPLASNPVCDASLGHVERARALVEEFTVPEMINNTVNAAFGVPRLGLPPYEWWN 81
Query: 101 ESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA 160
E+LHG+ + V F + TSFP + ++F+ +L +G ++ EARA N G+A
Sbjct: 82 EALHGVGLSPGVVFFEPEPAVATSFPMPINMGSAFDDALMLAMGDVISTEARAFSNAGRA 141
Query: 161 GLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFRE 220
GL +W PNIN F+DPRWGRG ETPGEDP+ + Y V+ QG
Sbjct: 142 GLDYWTPNINPFKDPRWGRGAETPGEDPLHAARYVRSLVEGLQG---------------- 185
Query: 221 KRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFR 280
G S L ++A CKH AYDLE WG +RY+F+A++T QD + + PPFR
Sbjct: 186 -----GIDPPS-----LKVAAACKHWAAYDLENWGGVTRYAFDAVVTPQDLAEYYAPPFR 235
Query: 281 SCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKG--YITSDCDAVATIFEYQ 337
SC+ +A+ MCSYN VNGVPAC L + R+ WG ++TSDC AV +++
Sbjct: 236 SCVRDARAASAMCSYNAVNGVPACASPYLLKTVLRDAWGLAEDRWVTSDCGAVGNVYDPH 295
Query: 338 NYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGL 397
YT+ +++ LKAG D+NCGT ++ A D+G + E D+ AL L++ + LG
Sbjct: 296 GYTEDLVNASTVSLKAGTDLNCGTNYTQYLPEAYDRGLIDEDDLKAALTRLYASLVWLGY 355
Query: 398 FNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGP 457
F+ P Y ++ DV T E + LA AA + VLLKND LPL + + SLA+IGP
Sbjct: 356 FDA-PEDQPYRQITWADVNTPEAQALAYTAAIKSFVLLKNDGT-LPLTDSTL-SLALIGP 412
Query: 458 LVNNIS-QMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKK 516
+ N + QM G Y GIP + L+G Y G + V N F AV A+
Sbjct: 413 MANASALQMLGNYFGIPPFVIAPLQGFLDAGFNVTYVLGTN-VTGNDAGSFDAAVAAAEA 471
Query: 517 ADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFA 576
AD VI V G+D T E E++DR + P Q++L++++ K P+++V GGG LD +
Sbjct: 472 ADVVIYVGGIDNTLEMEEKDRTEISWPDNQLALLSALEGVGK-PLVVVQMGGGQLDDTPL 530
Query: 577 EADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADS 636
+ +++ILW GYPG++G A+A+ + G P GRL +V M DM +R D+
Sbjct: 531 KESDAVNAILWAGYPGQSGGTAIADTVTGKVAPAGRL-------YVDEVAMTDMTLRPDN 583
Query: 637 SRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGS 696
+ PGR+Y++YTGT VY +G+GL YTN S + S E S I TG
Sbjct: 584 ATGNPGRTYKWYTGTPVYPYGYGLHYTNISVAWASDAPEACYS----------IQDLTGE 633
Query: 697 RLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRV 756
+V + + + +++VTN GD+ V +LF P K+++ + R
Sbjct: 634 ASGFVDLAPLDT-------FRVTVTNEGDIASDFVALLFVSTQAGPAPAPIKEMVAYARA 686
Query: 757 HTVAKG-SKEISFGV 770
V G S E+ V
Sbjct: 687 SDVQPGNSTEVELEV 701
>gi|67902828|ref|XP_681670.1| hypothetical protein AN8401.2 [Aspergillus nidulans FGSC A4]
gi|74592887|sp|Q5ATH9.1|BXLB_EMENI RecName: Full=Exo-1,4-beta-xylosidase bxlB; AltName:
Full=1,4-beta-D-xylan xylohydrolase bxlB; AltName:
Full=Beta-xylosidase bxlB; AltName: Full=Xylobiase bxlB;
Flags: Precursor
gi|40747867|gb|EAA67023.1| hypothetical protein AN8401.2 [Aspergillus nidulans FGSC A4]
gi|259484335|tpe|CBF80465.1| TPA: beta-1,4-xylosidase (Eurofung) [Aspergillus nidulans FGSC A4]
Length = 763
Score = 439 bits (1129), Expect = e-120, Method: Compositional matrix adjust.
Identities = 282/775 (36%), Positives = 412/775 (53%), Gaps = 73/775 (9%)
Query: 35 PVLNKPDFP-CKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGI 93
P L + ++P C P C+TSLS RAKSL+S LTL+EKI A+ RLG+
Sbjct: 18 PALCQSNYPDCTTGPLSELPICDTSLSPLERAKSLVSALTLEEKINNTGHEAAGSSRLGL 77
Query: 94 PAYEWWSESLHGIASNGPGVNF--NGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEA 151
PAY WW+E+LHG+A GV+F +G S TSFP +V A+FN +L + ++ EA
Sbjct: 78 PAYNWWNEALHGVAEKH-GVSFEESGDFSYATSFPAPIVLGAAFNDALIRRVAEIISTEA 136
Query: 152 RAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDD 211
RA N AG+ +W PN+N F+DPRWGRGQETPGEDP+ S Y EFV QG
Sbjct: 137 RAFSNSDHAGIDYWTPNVNPFKDPRWGRGQETPGEDPLHCSRYVKEFVGGLQG------- 189
Query: 212 GGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDT 271
D ++ + A CKHL AYDLE+WG SR+ F+A ++ D
Sbjct: 190 --------------------DDPEKPKVVATCKHLAAYDLEEWGGVSRFEFDAKVSAVDL 229
Query: 272 EDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKG---YITSDC 327
+ + PPF++C MCSYN +NGVPAC L Q R WG++G ++T DC
Sbjct: 230 LEYYLPPFKTCAVDASVGAFMCSYNALNGVPACADRYLLQTVLREHWGWEGPGHWVTGDC 289
Query: 328 DAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLN 387
AV I Y +Y ++ ++AA L AG+D++CGT + + A +G + + +D AL
Sbjct: 290 GAVERIQTYHHYVESGPEAAAAALNAGVDLDCGTWLPSYLGEAERQGLISNETLDAALTR 349
Query: 388 LFSVQLRLGLFNGDPRKGK-YGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNK 446
L++ ++LG F DP +G+ LG DDV TSE ++LA A QG VLLKN LPL
Sbjct: 350 LYTSLVQLGYF--DPAEGQPLRSLGWDDVATSEAEELAKTVAIQGTVLLKNIDWTLPLKA 407
Query: 447 NAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAG 506
N +LA+IGP +N +++ Y G +++ E A G +V S G
Sbjct: 408 NG--TLALIGPFINFTTELQSNYAGPAKHIPTMIEAAERLGYNVLTAPGT-EVNSTSTDG 464
Query: 507 FHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLT 566
F +A+ IA +AD +I G+D T E E DR + PG Q L+ +A RP+ +V
Sbjct: 465 FDDALAIAAEADALIFFGGIDNTVEEESLDRTRIDWPGNQEELILELAELG-RPLTVVQF 523
Query: 567 GGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KV 625
GGG +D S A + + +I+W GYP +AG + +++ G P GRLP+T YP+S+ +V
Sbjct: 524 GGGQVDDSALLASAGVGAIVWAGYPSQAGGAGVFDVLTGKAAPAGRLPITQYPKSYVDEV 583
Query: 626 PMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAG 685
PM DMN++ + PGR+YR+Y V FG GL YT ++ + +A+L G
Sbjct: 584 PMTDMNLQPGTDN--PGRTYRWYE-DAVLPFGFGLHYTTFNVSWAKKAFGPYDAATLARG 640
Query: 686 SDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKV-SQG 744
+ + +++ ++VTN GDV +V ++FA P++ +Q
Sbjct: 641 KNPS--------------------SNIVDTFSLAVTNTGDVASDYVALVFASAPELGAQP 680
Query: 745 TPEKQLIGFDRVHTVAKGSK---EISFGVDPCEQLSIANKHGRRILPLGNHVLMV 796
P K L+G+ R + G ++ V P L+ A + GR +L G + L+V
Sbjct: 681 APIKTLVGYSRASLIKPGETRKVDVEVTVAP---LTRATEDGRVVLYPGEYTLLV 732
>gi|389748262|gb|EIM89440.1| hypothetical protein STEHIDRAFT_182874, partial [Stereum hirsutum
FP-91666 SS1]
Length = 772
Score = 439 bits (1129), Expect = e-120, Method: Compositional matrix adjust.
Identities = 266/755 (35%), Positives = 405/755 (53%), Gaps = 61/755 (8%)
Query: 55 CNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVN 114
CNT+ RA SLI L + + + + + RLG+P Y+WW+E+LHG+ S+ PGVN
Sbjct: 39 CNTTAHFVDRATSLIEEFNLTDLVNNTVNGSPGVDRLGLPPYQWWNEALHGVGSS-PGVN 97
Query: 115 F----NGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNIN 170
+ + +S TSFP ++ A+FN SL ++I ++ EARA N AGLTF+ PNIN
Sbjct: 98 WGSGPDANFTSATSFPAPILLGATFNDSLIASIADVISTEARAFNNFNYAGLTFFTPNIN 157
Query: 171 IFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEE 230
FRDPRWGRGQETPGEDP +S Y ++V QG G K
Sbjct: 158 PFRDPRWGRGQETPGEDPYHLSRYVYQYVVGLQG------------GLSPDPYYK----- 200
Query: 231 SDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASC 290
+ A CKH++AYD+E W R FNA++T QD + + P F+ C+ + +
Sbjct: 201 --------VLANCKHVLAYDVENWEGNDRTGFNAVVTTQDLSEFYTPSFQGCLRDAQGAS 252
Query: 291 IMCSYNQVNGVPACLRGDLFQK-ARNEWGF---KGYITSDCDAVATIFEYQNYTKTHEDS 346
MCSYN VNGVP+C + + R+ WG +G+IT DC AV I++ YT T ++
Sbjct: 253 AMCSYNAVNGVPSCASSYILKDLVRDFWGLGEREGWITGDCGAVQNIYQPHGYTDTLVNA 312
Query: 347 AAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGK 406
A + AG D++CG + +A+ +G + I AL+ L+ +RLG F DP + +
Sbjct: 313 TAVAMDAGTDLDCGDVYSPNLWTAVVEGLITAGQIQTALIRLYGSLIRLGYF--DPAEQQ 370
Query: 407 -YGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQM 465
Y +V T + LA +AA QGIVLL+ND LPL+ N V ++A+IGP+ N +
Sbjct: 371 PYRSFDWSNVNTPSSQDLAYNAAVQGIVLLEND-GLLPLSTN-VKNIALIGPMANATLSL 428
Query: 466 GGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAG 525
G Y GI S + E +A G + + ++G+ EA+ A+ AD V+ V G
Sbjct: 429 QGNYAGIAPFVISPQQAFETAGYNVTFAFGT-GISNSDNSGYSEALEAAQGADVVVFVGG 487
Query: 526 LDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSI 585
+D + E E +DR S+ PG Q+ L+ + K P+++V GGG D S +A++ ++++
Sbjct: 488 IDNSIEAEGQDRTSIEWPGSQLDLIGQLGELGK-PLVVVRMGGGQCDDSTLKANATVNAL 546
Query: 586 LWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESF-TKVPMNDMNMRADSSRQYPGRS 644
LW GYPG++G AL +II G +P GRLP+T YP S+ +++ M DM +R +SS PGR+
Sbjct: 547 LWAGYPGQSGGTALVDIISGKQSPSGRLPVTQYPSSYVSEIDMTDMAIRPNSSGS-PGRT 605
Query: 645 YRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHID 704
Y++YTG +Y FG+G+ YT + + + S + + ++K +G D +D
Sbjct: 606 YKWYTGAPIYPFGYGIHYTTFRLAWSDSSSTTYNIQDIVSSANK-----SGGFADTEILD 660
Query: 705 EVTSCTSLRFHVQISVTNAG-DVDGSHVVMLFARVPKVSQGTPEKQLIGFDRV-HTVAKG 762
+ + VTN G + +V +LFA P ++L+G+ RV H G
Sbjct: 661 TFS----------LLVTNTGSNYTSDYVALLFANSTSGPSPAPLQELVGYTRVPHITPGG 710
Query: 763 SKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
+ V +S +++G IL G + L VG
Sbjct: 711 TATAELNVT-LGSISRVDENGNWILYPGTYNLWVG 744
>gi|222618262|gb|EEE54394.1| hypothetical protein OsJ_01415 [Oryza sativa Japonica Group]
Length = 776
Score = 438 bits (1126), Expect = e-120, Method: Compositional matrix adjust.
Identities = 281/796 (35%), Positives = 398/796 (50%), Gaps = 110/796 (13%)
Query: 44 CKPPHFDS-------YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAY 96
C P F + +P+C+ SL + R + L+ +TL+EK+ L D A PR+G+P Y
Sbjct: 47 CDPARFAAAGLDMAGFPYCDASLPYADRVRDLVGRMTLEEKVANLGDRAGGAPRVGLPRY 106
Query: 97 EWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEAR---- 152
P V + L + R + + AVA R
Sbjct: 107 CGGGRRCTAC----PTSARRDVVWRRRARRHQLPARHQQRRVVQRD---AVARHRRRGVD 159
Query: 153 ---AMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
MYNLG A LT+W+PNIN+ RDPRWGR ETPGEDP VV YAV FV+ Q
Sbjct: 160 GDQGMYNLGHAELTYWSPNINVVRDPRWGRASETPGEDPFVVGRYAVNFVRGMQ------ 213
Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQ 269
D G F + +S+CCKH
Sbjct: 214 DIDGATTAASAAAATDAFSRP------IKVSSCCKHY----------------------- 244
Query: 270 DTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCD 328
A+C+MCSYN++NGVPAC L + R +W GYI SDCD
Sbjct: 245 ------------------AACVMCSYNRINGVPACADARLLTETVRRDWQLHGYIVSDCD 286
Query: 329 AVATIF---EYQNYTKTHEDSAAGVLKAGMDINCGTC-------MLRHTQSAIDKGKVQE 378
+V + ++ YT +AA +KAG+D++CG + A+ +GK++E
Sbjct: 287 SVRVMVRDAKWLGYTGVEATAAA--MKAGLDLDCGMFWEGVHDFFTTYGVDAVRQGKLKE 344
Query: 379 KDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKND 438
+D AL NL+ +RLG F+G P + LG DVCT EHK+LA DAARQG+VLLKND
Sbjct: 345 SAVDNALTNLYLTLMRLGFFDGIP---ELESLGAADVCTEEHKELAADAARQGMVLLKND 401
Query: 439 KKFLPLNKNAVSSLAIIGPL--VNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGC 496
LPL+ V+S+A+ G L +N M G Y G PC + G+ VS T C
Sbjct: 402 AALLPLSPEKVNSVALFGQLQHINATDVMLGDYRGKPCRVVTPYDGVRKVVSSTS-VHAC 460
Query: 497 HDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVART 556
C++ A AK D IVVAGL+++ E E DR LLLP Q S + +VA
Sbjct: 461 DKGSCDT------AAAAAKTVDATIVVAGLNMSVERESNDREDLLLPWSQASWINAVAEA 514
Query: 557 SKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMT 616
S P++LV+ G +DVSFA+ + +I +++W GYPGE G A+A+++FG +NPGGRLP+T
Sbjct: 515 SPSPIVLVIMSAGGVDVSFAQDNPKIGAVVWAGYPGEEGGTAIADVLFGKYNPGGRLPLT 574
Query: 617 WYP-ESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQV-YGFGHGLSYTNYSYKFLSAPS 674
WY E +K+PM M +R D+ YPGR+Y+FY G V Y FGHGLSYTN++Y +A +
Sbjct: 575 WYKNEYVSKIPMTSMALRPDAEHGYPGRTYKFYGGADVLYPFGHGLSYTNFTYASATAAA 634
Query: 675 ELTISASLKAGSDKNILQQT-----GSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGS 729
+T+ K G+ + Q T S ++ + ++V N G DG+
Sbjct: 635 PVTV----KVGAWEYCKQLTYKAGVSSPPACPAVNVASHACQEEVSFAVTVANTGGRDGT 690
Query: 730 HVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPL 789
HVV ++ P G P KQL+ F RV A + E++F ++ C+ +I + ++P
Sbjct: 691 HVVPMYTAPPAEVDGAPRKQLVAFRRVRVAAGAAVEVAFALNVCKAFAIVEETAYTVVPS 750
Query: 790 GNHVLMVGELRHSLTI 805
G ++VG+ SL+
Sbjct: 751 GVSRVLVGDDALSLSF 766
>gi|242216161|ref|XP_002473890.1| beta-xylosidase [Postia placenta Mad-698-R]
gi|220726990|gb|EED80923.1| beta-xylosidase [Postia placenta Mad-698-R]
Length = 741
Score = 438 bits (1126), Expect = e-120, Method: Compositional matrix adjust.
Identities = 276/751 (36%), Positives = 403/751 (53%), Gaps = 62/751 (8%)
Query: 55 CNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVN 114
C+TS + RA +LISL TL+EKI + A +PRLG+PAY+WW E+LHG+A + PGV
Sbjct: 34 CDTSATPLERATALISLFTLEEKINNTGNTAPGVPRLGLPAYQWWQEALHGVAES-PGVI 92
Query: 115 F--NGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIF 172
F +G S TSFPQ ++ A+F+ +L +++ + V+ EARA N ++G+ FW PNIN F
Sbjct: 93 FAPSGEYSYATSFPQPILMGAAFDDALINHVATIVSTEARAFNNANRSGIDFWTPNINPF 152
Query: 173 RDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESD 232
+DPRWGRGQETPGEDP + +Y + QG G KR++
Sbjct: 153 KDPRWGRGQETPGEDPFHLQSYVYNLITGLQG----------GLDPEYKRIV-------- 194
Query: 233 RGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIM 292
A CKH AYDLE W RY F+A+++ QD + + FR+C M
Sbjct: 195 --------ATCKHFAAYDLENWEGNVRYGFDALVSLQDLSEFYTRSFRTCARDANVGSFM 246
Query: 293 CSYNQVNGVPACLRGDLFQKA-RNEWGFKG---YITSDCDAVATIFEYQNYTKTHEDSAA 348
CSYN VNGVP+C L Q R+ WG+ YITSDCDA+ I+E YT T ++ A
Sbjct: 247 CSYNAVNGVPSCANSYLLQDILRDHWGWTNEDQYITSDCDAIQNIYEPHYYTATRAETVA 306
Query: 349 GVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGK-Y 407
L AG D++CG + +A D+G E ++RAL+ ++ ++LG F DP + Y
Sbjct: 307 DALNAGTDLDCGEYYPENLGAAYDQGLFTESTLNRALIRQYAALVKLGYF--DPADIQPY 364
Query: 408 GKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGG 467
++G +V T E ++LA AA +GI LLKND LPL+ ++ ++A+IGP N +QM G
Sbjct: 365 RQIGWANVSTPEAEELAYTAAVEGITLLKNDGT-LPLSP-SIKTIALIGPWANATTQMQG 422
Query: 468 GYTGIPCSPKSLLRGLEAYVSKTHYASGCH-DVPCNSDAGFHEAVRIAKKADFVIVVAGL 526
Y G+ S L E +Y++G D P S F A A+ AD +I G+
Sbjct: 423 NYYGVAPYLISPLMAAEELGFTVYYSAGPGVDDPTTSS--FPAAFAAAEAADAIIYAGGI 480
Query: 527 DLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSIL 586
D+T E E DR +L PG Q + ++ K P+I++ GGG +D S + +++++
Sbjct: 481 DITVEAEAMDRYTLDWPGVQPDFIDQLSLLGK-PLIVLQFGGGQIDDSALLPNPGVNALV 539
Query: 587 WIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNMRADSSRQYPGRSY 645
W GYPG++G KA+ +II G+ P GRLP+T YP + +V M DM++R + PGR+Y
Sbjct: 540 WGGYPGQSGGKAIMDIIVGNAAPAGRLPITQYPLDYVYQVAMTDMSLRPSPTN--PGRTY 597
Query: 646 RFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDE 705
+YTGT + FG GL YT ++ LS PS + D L S + + +
Sbjct: 598 MWYTGTPIVEFGFGLHYTTFTAS-LSQPSAPSY--------DIATLVSLCSGVAHPDLCP 648
Query: 706 VTSCTSLRFHVQISVTNAG-DVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSK 764
S T+ +VTN G V V +LF P K L+ +DR+H +A +
Sbjct: 649 FASYTA-------NVTNTGSSVTSDFVSLLFLAGEHGPAPYPNKVLVAYDRLHAIAPLAS 701
Query: 765 EISFGVDPCEQLSIANKHGRRILPLGNHVLM 795
+ + LS + +G IL G + L+
Sbjct: 702 QTTTLNLTLGSLSRVDDYGNTILYPGEYTLI 732
>gi|194700280|gb|ACF84224.1| unknown [Zea mays]
Length = 452
Score = 437 bits (1124), Expect = e-119, Method: Compositional matrix adjust.
Identities = 217/450 (48%), Positives = 299/450 (66%), Gaps = 13/450 (2%)
Query: 355 MDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDD 414
MD+NCG+ + H SA+ +GK+ E+DI+RAL NLF+V++RLGLFNGDPR+ YG +GPD
Sbjct: 1 MDVNCGSYVQDHGASALQQGKITEQDINRALHNLFAVRMRLGLFNGDPRRNLYGDIGPDQ 60
Query: 415 VCTSEHKKLALDAARQGIVLLKND--KKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGI 472
VCT EH+ LAL+AA+ GIVLLKND LPL+K V+SLA+IG N+ ++ G Y G
Sbjct: 61 VCTQEHQDLALEAAQDGIVLLKNDGGAGALPLSKPNVASLAVIGFNANDAIRLRGNYFGP 120
Query: 473 PCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQET 532
PC + L+ L+ YV T + +GC+ CN EAV+ A AD V++ GLD QE
Sbjct: 121 PCVTVTPLQVLQGYVKDTSFVAGCNSAACNVTT-IPEAVQAASSADSVVLFMGLDQDQER 179
Query: 533 EDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPG 592
E+ DR+ L LPGQQ +L+ SVA +K+PVILVL GGP+DVSFA+ + +I +ILW GYPG
Sbjct: 180 EEVDRLDLTLPGQQQTLIESVANAAKKPVILVLLCGGPVDVSFAKTNPKIGAILWAGYPG 239
Query: 593 EAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQ 652
EAG A+A+++FG+ NPGGRLP+TWYP+ FT+VPM DM MRAD + YPGR+YRFY G
Sbjct: 240 EAGGIAIAQVLFGEHNPGGRLPVTWYPQDFTRVPMTDMRMRADPATGYPGRTYRFYRGPT 299
Query: 653 VYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTS--CT 710
V+ FG+GLSY+ YS++F + P + A LKA ++ T + ++ + S C
Sbjct: 300 VFNFGYGLSYSKYSHRFATKPPPTSNVAGLKA------VEATAGGMASYDVEAIGSETCD 353
Query: 711 SLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGT--PEKQLIGFDRVHTVAKGSKEISF 768
L+F + V N G +DG H V++F R P + G+ P QLIGF +H A + + F
Sbjct: 354 RLKFPAVVRVQNHGPMDGKHSVLVFMRWPNATDGSGRPASQLIGFQSLHLRATQTAHVEF 413
Query: 769 GVDPCEQLSIANKHGRRILPLGNHVLMVGE 798
V PC+ S A + GR+++ G+H +MVGE
Sbjct: 414 EVSPCKHFSRATEDGRKVIDQGSHFVMVGE 443
>gi|402225863|gb|EJU05924.1| hypothetical protein DACRYDRAFT_113532 [Dacryopinax sp. DJM-731
SS1]
Length = 778
Score = 437 bits (1124), Expect = e-119, Method: Compositional matrix adjust.
Identities = 260/726 (35%), Positives = 388/726 (53%), Gaps = 61/726 (8%)
Query: 55 CNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVN 114
C+++L TRA++L+ +LT+ EK + + +PRLG+P Y WWSE LHG+AS+ PGV
Sbjct: 42 CDSALDPLTRARALVGMLTMAEKFNNTVNASPGVPRLGLPPYNWWSEGLHGVASS-PGVT 100
Query: 115 F---NGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINI 171
F S TSFP+ ++ A+F+ +L +I + ++ EARA N +GL FW PNIN
Sbjct: 101 FAPAGQNFSYATSFPEPILMGAAFDDNLIYDIATIISTEARAFNNFNHSGLDFWTPNINP 160
Query: 172 FRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEES 231
RDPRWGR ETPGEDP +++Y + V G+ FG + + K
Sbjct: 161 VRDPRWGRSLETPGEDPFHLASYVAKLVT------------GLQFGGDDPKYQK------ 202
Query: 232 DRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCI 291
L A CKH YDLE WG ++RY F+A+I+ QD + F PPF++C + +
Sbjct: 203 -------LVATCKHYAGYDLENWGGYARYGFDAVISNQDLVEYFLPPFQTCARDVNVTSV 255
Query: 292 MCSYNQVNGVPACLRGDLFQK-ARNEWGFK--------GYITSDCDAVATIFEYQNYTKT 342
MCSYN VNG+P+C L Q R WG++ Y+TSDCDAV+ I+ NYT T
Sbjct: 256 MCSYNAVNGIPSCANDYLLQSLLRTYWGWEPDSESLNAHYVTSDCDAVSNIYYPHNYTIT 315
Query: 343 HEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDP 402
E + A LKAG D++CGT S+ ++G + DIDRAL+ ++ LG F DP
Sbjct: 316 PEQAVAVSLKAGTDLDCGTFYAEWLPSSYEQGLFHQTDIDRALIRSYAALFLLGYF--DP 373
Query: 403 RKGK-YGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNN 461
+G+ Y + ++ T ++LA AA +GI LLKN LPL + ++++A+IGP N
Sbjct: 374 AEGQIYRQYNWANINTDYAQQLAYTAAWEGITLLKNIDDMLPL-PSTMTNIALIGPWANA 432
Query: 462 ISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVI 521
+QM G Y GI S L L+ Y G ++ NS AGF A+ A+ AD +
Sbjct: 433 TTQMQGNYQGIAPFLHSPLYALQQRGINVTYVLGT-NITSNSTAGFAAALAAAQTADLTL 491
Query: 522 VVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQ 581
+ G+D+T E E DRV++ PG Q+ L+ +A S +I+ GGG +D + + +
Sbjct: 492 YIGGIDITVEAEAMDRVNITWPGNQLDLIAQLANVSTH-LIVYQMGGGQIDDTVLLENPK 550
Query: 582 ISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESF-TKVPMNDMNMRADSSRQY 640
+ +LW GYPG+ G A+ +I++G P GRLP++ YP +F +VPM D MR +
Sbjct: 551 VHGLLWGGYPGQDGGTAMIDILYGSRAPAGRLPLSQYPANFINEVPMTD--MRLHPALGT 608
Query: 641 PGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDY 700
PGR+Y++Y+G V FG+GL YT ++ L S S + N +Q+ + LD
Sbjct: 609 PGRTYKWYSGDLVLPFGYGLHYTTFAKAALKDHSP----RSSDIATLVNEAKQSSAWLDK 664
Query: 701 VHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVA 760
D + VTN G + +V + + P+ L+ + R+ V
Sbjct: 665 AFFDVFAA----------EVTNTGSLTSDYVALGYLTGEFGPAPYPKSSLVSYTRLSQVT 714
Query: 761 KGSKEI 766
G ++
Sbjct: 715 PGETQV 720
>gi|296439595|sp|A1CCL9.2|BXLB_ASPCL RecName: Full=Probable exo-1,4-beta-xylosidase bxlB; AltName:
Full=1,4-beta-D-xylan xylohydrolase bxlB; AltName:
Full=Beta-xylosidase bxlB; AltName: Full=Xylobiase bxlB;
Flags: Precursor
Length = 771
Score = 437 bits (1124), Expect = e-119, Method: Compositional matrix adjust.
Identities = 284/770 (36%), Positives = 407/770 (52%), Gaps = 73/770 (9%)
Query: 42 FP-CKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWS 100
FP C C+TS ++TRA+SL+ ++ EK+ A +PRLG+PAY WWS
Sbjct: 29 FPDCTSGPLSKLAVCDTSRDVTTRAQSLVDAMSFAEKVNNTQYEAPGVPRLGLPAYNWWS 88
Query: 101 ESLHGIASNGPGVNF--NGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG 158
E+LHG+A PGV+F +G S TSF Q ++ ASF+ L + + V E RA N G
Sbjct: 89 EALHGVAG-APGVHFADSGPFSYATSFAQPILLGASFDDELVKQVATVVGTEGRAFGNAG 147
Query: 159 QAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGF 218
+AGL +W PNIN FRDPRWGRGQETPGEDP+ VS Y V Q GGIG
Sbjct: 148 RAGLDYWTPNINPFRDPRWGRGQETPGEDPLHVSRYVYHLVDGLQ--------GGIGPAR 199
Query: 219 REKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPP 278
+ ++A CKH AYD+E W SR+ F+A ++ QD + + P
Sbjct: 200 PQ------------------IAATCKHFAAYDMEDWNGVSRHEFDARVSTQDLAEFYLPS 241
Query: 279 FRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK-ARNEWGFK---GYITSDCDAVATIF 334
F+SC+ + +MCSYN +NGVP C L Q R W + ++ SDC A+ I+
Sbjct: 242 FKSCVRDAQVDAVMCSYNALNGVPTCADPYLLQTLLREHWDWDQPGHWVVSDCGAIDDIY 301
Query: 335 EYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLR 394
NYTKT ++AA L AG D++CGT +H A ++G + +DRAL+ L+S ++
Sbjct: 302 IGHNYTKTGAEAAAVALNAGTDLDCGTVFPKHLGEAAEQGLYTNQTLDRALVRLYSSLVK 361
Query: 395 LGLFNGDP-RKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLA 453
LG F DP K YG +G DV T ++LA AA +GIVLLKND+ LPL A +LA
Sbjct: 362 LGYF--DPAEKQPYGSIGWKDVDTPAAEQLAHKAAVEGIVLLKNDQT-LPL--KAKGTLA 416
Query: 454 IIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRI 513
+IGP N QM G Y G P ++L + + Y+ G + +S AGF A+
Sbjct: 417 LIGPYANATKQMQGNYQGPPKYIRTLEWAATQHGYQVQYSPGTA-INNSSTAGFAAALAA 475
Query: 514 AKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDV 573
AK AD V+ G+D T E+E DR ++ PG Q+SL++ ++ K P+I++ GGG +D
Sbjct: 476 AKDADVVLYAGGIDNTIESETLDRTTITWPGNQLSLISELSNLHK-PLIVIQFGGGQVDD 534
Query: 574 SFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNM 632
+ + ++++LW GYP + G A+ +I+ G P GRLP+T YP ++T +VPM +M +
Sbjct: 535 TPLLTNPHVNALLWAGYPSQEGGAAIFDILTGKAAPAGRLPITQYPAAYTAQVPMTEMGL 594
Query: 633 RADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQ 692
RA PGR+YR+Y V FG GL YT++ + +A+L +
Sbjct: 595 RAGGDN--PGRTYRWYD-KAVVPFGFGLHYTSFEVSWDRGRLGPYNTAALVNRAPG---- 647
Query: 693 QTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVS-QGTPEKQLI 751
GS +D D F VQ V N G V +V +LF + + P K L+
Sbjct: 648 --GSHVDRALFDT--------FRVQ--VQNTGTVTSDYVALLFVKTEDAGPEPYPLKTLV 695
Query: 752 GFDRVHTVAKGSK-----EISFGVDPCEQLSIANKHGRRILPLGNHVLMV 796
G+ RV V G + E++ G ++ +G +L G + L V
Sbjct: 696 GYTRVQQVKPGERRSVEIEVTLGA-----MARTAANGDLVLYPGKYTLQV 740
>gi|347832625|emb|CCD48322.1| glycoside hydrolase family 3 protein [Botryotinia fuckeliana]
Length = 772
Score = 437 bits (1124), Expect = e-119, Method: Compositional matrix adjust.
Identities = 275/753 (36%), Positives = 397/753 (52%), Gaps = 65/753 (8%)
Query: 55 CNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVN 114
C+TS TRA +LISL TL EK+ + + +PR+G+P+YEWW+E+LHGIA + PG
Sbjct: 34 CDTSSDPYTRAAALISLFTLAEKVNNTGNTSPGVPRIGLPSYEWWNEALHGIARS-PGTT 92
Query: 115 FNGTVSSV---TSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINI 171
F T S+ TSFPQ ++ A+F+ L + + V+ EARA N+ + GL FW PNIN
Sbjct: 93 FAATGSNYSYSTSFPQPILMGATFDDELIHKVATQVSTEARAFNNVNRFGLNFWTPNINP 152
Query: 172 FRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEES 231
++DPRWGRGQETPGEDP S+Y + QG G + KG
Sbjct: 153 YKDPRWGRGQETPGEDPFHTSSYVNALITGLQG------------GLDDLPYKKGV---- 196
Query: 232 DRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCI 291
A CKH YDLE RY F+A+I QD D + PPF+ C +
Sbjct: 197 ---------ATCKHFAGYDLESSDGAIRYGFDAIIKSQDLRDYYLPPFQQCARDSNVQSV 247
Query: 292 MCSYNQVNGVPACLRGDLFQK-ARNEWGF---KGYITSDCDAVATIFEYQNYTKTHEDSA 347
MCSYN +NGVP C L Q R WG+ ++TSDCDAV I++Y NYT T E SA
Sbjct: 248 MCSYNAMNGVPTCADDWLLQTLLREHWGWTEEDQWVTSDCDAVKNIWDYHNYTLTPEQSA 307
Query: 348 AGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKY 407
A L AG D++CGT + SA D+G +DR+L ++ +RLG F+ P Y
Sbjct: 308 ADALNAGTDLDCGTFWPTYLGSAYDQGLYDISTLDRSLARRYASLVRLGYFD-PPSVQPY 366
Query: 408 GKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGG 467
+L D+V T ++LAL AA GIVLLKND LPL+ N ++++A+IGPL N QM G
Sbjct: 367 RQLNWDNVSTPAAQQLALQAAEDGIVLLKND-GILPLSSN-ITNVALIGPLANATKQMQG 424
Query: 468 GYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLD 527
Y G +S L + K Y G D+ + F A+ A+ AD VI V G+D
Sbjct: 425 NYYGTAPYLRSPLIAAQNAGFKVTYVQGA-DIDSQNTTDFSAAISAAQSADLVIYVGGID 483
Query: 528 LTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILW 587
+ E E+ DR S+ P Q+SL+ +A S +I + G +D S +++ ++++LW
Sbjct: 484 NSIEAEEIDRTSISWPSSQLSLINQLANLSTPLIISQM--GCMIDSSSLLSNTGVNALLW 541
Query: 588 IGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESF-TKVPMNDMNMRADSSRQYPGRSYR 646
GYPG+ G A+ I+ G P GRLP+T YP ++ +V M DMN++ SR PGR+Y+
Sbjct: 542 AGYPGQDGGTAIFNILTGKTAPAGRLPITQYPSNYVNQVTMTDMNLQP--SRFNPGRTYK 599
Query: 647 FYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEV 706
+Y G V+ +G+GL YT T A + S N + + + + ++
Sbjct: 600 WYNGEPVFEYGYGLQYT-------------TFDAKITPSSPNNTFEISELLANASNYKDL 646
Query: 707 TSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSK-- 764
T + I+V+N G +V + F P+K L+ + R+H + G+
Sbjct: 647 TPFV----KIPITVSNTGTTTSDYVALFFLSGTFGPAPHPKKSLVAYTRLHDITGGANAT 702
Query: 765 -EISFGVDPCEQLSIANKHGRRILPLGNHVLMV 796
E+S + L+ N +G IL G++ ++V
Sbjct: 703 AEVSLNL---ASLARGNWNGDLILYPGDYKVVV 732
>gi|291167620|dbj|BAI82526.1| 1,4-beta-D-xylosidase [Aureobasidium pullulans var. melanogenum]
Length = 805
Score = 437 bits (1123), Expect = e-119, Method: Compositional matrix adjust.
Identities = 254/752 (33%), Positives = 389/752 (51%), Gaps = 56/752 (7%)
Query: 55 CNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVN 114
C+ S RAK+L++ T+ EK+ +N+ +PRLG+P Y+WW E+LHG+AS+ PGV
Sbjct: 44 CDKSADPVARAKALVAAFTVAEKLNLTGNNSPGVPRLGLPVYQWWQEALHGVASS-PGVT 102
Query: 115 FN--GTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIF 172
FN G S TSFPQ ++ A+F+ +L ++ V+ EARA N G+AGL FW PNIN +
Sbjct: 103 FNATGQFDSATSFPQPILMGAAFDDALIQSVAEVVSTEARAFNNYGRAGLDFWTPNINPY 162
Query: 173 RDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESD 232
RDPRWGRGQETPGEDP +S+Y + QG GE+ +
Sbjct: 163 RDPRWGRGQETPGEDPYHLSSYVHSLIMGLQG-----------------------GEDPE 199
Query: 233 RGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIM 292
++A CKH YD+E W RY + I ++D + + P FRSC M
Sbjct: 200 I---RKITATCKHFAGYDIESWNGNLRYQNDVQIPQRDLVEYYLPSFRSCARDSNVGAFM 256
Query: 293 CSYNQVNGVPACLRGDLFQKA-RNEWGFKG---YITSDCDAVATIFEYQNYTKTHEDSAA 348
C+Y+ +NGVP C L R WG+ ++TSDCD++ IF N++ T + +AA
Sbjct: 257 CTYSALNGVPTCADPWLLNDVLREHWGWTNEEQWVTSDCDSIQNIFLPHNFSDTRQGAAA 316
Query: 349 GVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYG 408
L AG D++CGT H A +G + + +D+AL+ L++ +R G F+G Y
Sbjct: 317 AALNAGTDLDCGTYYQHHLPLAYSQGLINQTTVDQALVRLYTSLVRTGYFDG--PNAMYR 374
Query: 409 KLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGG 468
L DV T+ ++LAL AA +G+VLLKND LPL+ + + +A+IG N +QM G
Sbjct: 375 NLTWSDVGTTHAQQLALQAAEEGMVLLKND-GLLPLSISNGTKIALIGSWANATTQMQGN 433
Query: 469 YTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDL 528
Y G+P S L + ++ YA G + + A+KAD +I + G+D+
Sbjct: 434 YYGVPTYLHSPLYAAQQTGAQVFYAQGPGGQGDPTTDHWLPVWTAAEKADIIIYIGGVDI 493
Query: 529 TQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWI 588
+ E E DR + G Q+ ++ +A K P++L G LD + ++ IS+++W
Sbjct: 494 SVEAEGMDREDINWTGAQLDIIGELAMYGK-PMVLAQM-GDQLDNTPIVNNANISALIWG 551
Query: 589 GYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESF-TKVPMNDMNMRADSSRQYPGRSYRF 647
GYPG+ G AL II G P GRLP+T YP + +PM DM +R +++ PGR+Y++
Sbjct: 552 GYPGQDGGVALFNIITGKTAPAGRLPVTQYPAHYIADIPMTDMTLRPNATTGSPGRTYKW 611
Query: 648 YTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVT 707
Y GT V+ FG+G+ YT +S +SL +G ++ +
Sbjct: 612 YNGTAVFEFGYGMHYTKFSADISPMSKSSYDISSLLSGCNETYKDRCAFE---------- 661
Query: 708 SCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEIS 767
+ ++V N G+V + + F P+K L+ + R+H +A GS + +
Sbjct: 662 -------SISVNVHNTGNVTSDYAALGFIAGQFGPSPYPKKSLVNYQRLHNIAGGSSQTA 714
Query: 768 FGVDPCEQLSIANKHGRRILPLGNHVLMVGEL 799
LS + HG L G++ LM+ +
Sbjct: 715 TLNLTLGSLSRVDDHGNTYLYPGDYALMIDTM 746
>gi|121712174|ref|XP_001273702.1| beta-xylosidase [Aspergillus clavatus NRRL 1]
gi|119401854|gb|EAW12276.1| beta-xylosidase [Aspergillus clavatus NRRL 1]
Length = 803
Score = 437 bits (1123), Expect = e-119, Method: Compositional matrix adjust.
Identities = 284/770 (36%), Positives = 407/770 (52%), Gaps = 73/770 (9%)
Query: 42 FP-CKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWS 100
FP C C+TS ++TRA+SL+ ++ EK+ A +PRLG+PAY WWS
Sbjct: 61 FPDCTSGPLSKLAVCDTSRDVTTRAQSLVDAMSFAEKVNNTQYEAPGVPRLGLPAYNWWS 120
Query: 101 ESLHGIASNGPGVNF--NGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG 158
E+LHG+A PGV+F +G S TSF Q ++ ASF+ L + + V E RA N G
Sbjct: 121 EALHGVAG-APGVHFADSGPFSYATSFAQPILLGASFDDELVKQVATVVGTEGRAFGNAG 179
Query: 159 QAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGF 218
+AGL +W PNIN FRDPRWGRGQETPGEDP+ VS Y V Q GGIG
Sbjct: 180 RAGLDYWTPNINPFRDPRWGRGQETPGEDPLHVSRYVYHLVDGLQ--------GGIGPAR 231
Query: 219 REKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPP 278
+ ++A CKH AYD+E W SR+ F+A ++ QD + + P
Sbjct: 232 PQ------------------IAATCKHFAAYDMEDWNGVSRHEFDARVSTQDLAEFYLPS 273
Query: 279 FRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK-ARNEWGFK---GYITSDCDAVATIF 334
F+SC+ + +MCSYN +NGVP C L Q R W + ++ SDC A+ I+
Sbjct: 274 FKSCVRDAQVDAVMCSYNALNGVPTCADPYLLQTLLREHWDWDQPGHWVVSDCGAIDDIY 333
Query: 335 EYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLR 394
NYTKT ++AA L AG D++CGT +H A ++G + +DRAL+ L+S ++
Sbjct: 334 IGHNYTKTGAEAAAVALNAGTDLDCGTVFPKHLGEAAEQGLYTNQTLDRALVRLYSSLVK 393
Query: 395 LGLFNGDP-RKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLA 453
LG F DP K YG +G DV T ++LA AA +GIVLLKND+ LPL A +LA
Sbjct: 394 LGYF--DPAEKQPYGSIGWKDVDTPAAEQLAHKAAVEGIVLLKNDQT-LPL--KAKGTLA 448
Query: 454 IIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRI 513
+IGP N QM G Y G P ++L + + Y+ G + +S AGF A+
Sbjct: 449 LIGPYANATKQMQGNYQGPPKYIRTLEWAATQHGYQVQYSPGTA-INNSSTAGFAAALAA 507
Query: 514 AKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDV 573
AK AD V+ G+D T E+E DR ++ PG Q+SL++ ++ K P+I++ GGG +D
Sbjct: 508 AKDADVVLYAGGIDNTIESETLDRTTITWPGNQLSLISELSNLHK-PLIVIQFGGGQVDD 566
Query: 574 SFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNM 632
+ + ++++LW GYP + G A+ +I+ G P GRLP+T YP ++T +VPM +M +
Sbjct: 567 TPLLTNPHVNALLWAGYPSQEGGAAIFDILTGKAAPAGRLPITQYPAAYTAQVPMTEMGL 626
Query: 633 RADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQ 692
RA PGR+YR+Y V FG GL YT++ + +A+L +
Sbjct: 627 RAGGDN--PGRTYRWYD-KAVVPFGFGLHYTSFEVSWDRGRLGPYNTAALVNRAPG---- 679
Query: 693 QTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVS-QGTPEKQLI 751
GS +D D F VQ V N G V +V +LF + + P K L+
Sbjct: 680 --GSHVDRALFDT--------FRVQ--VQNTGTVTSDYVALLFVKTEDAGPEPYPLKTLV 727
Query: 752 GFDRVHTVAKGSK-----EISFGVDPCEQLSIANKHGRRILPLGNHVLMV 796
G+ RV V G + E++ G ++ +G +L G + L V
Sbjct: 728 GYTRVQQVKPGERRSVEIEVTLGA-----MARTAANGDLVLYPGKYTLQV 772
>gi|334187562|ref|NP_196532.2| Glycosyl hydrolase family protein [Arabidopsis thaliana]
gi|332004052|gb|AED91435.1| Glycosyl hydrolase family protein [Arabidopsis thaliana]
Length = 526
Score = 436 bits (1122), Expect = e-119, Method: Compositional matrix adjust.
Identities = 220/487 (45%), Positives = 313/487 (64%), Gaps = 10/487 (2%)
Query: 322 YITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDI 381
YI SDCD++ ++ Q+YTKT E++AA + AG+D+NCG+ + HT++A+ KG + E I
Sbjct: 45 YIVSDCDSLGILYGSQHYTKTPEEAAAKSILAGLDLNCGSFLGNHTENAVKKGLIDEAAI 104
Query: 382 DRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKF 441
++A+ N F+ +RLG F+G+P+ YG LGP DVCT E+++LA++ ARQGIVLLKN
Sbjct: 105 NKAISNNFATLMRLGFFDGNPKNQPYGGLGPKDVCTVENRELAVETARQGIVLLKNSAGS 164
Query: 442 LPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPC 501
LPL+ +A+ +LA+IGP N M G Y G+ C + L+GLE V T Y GC +V C
Sbjct: 165 LPLSPSAIKTLAVIGPNANVTKTMIGNYEGVACKYTTPLQGLERTVLTTKYHRGCFNVTC 224
Query: 502 NSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPV 561
++A A +A AD ++V G D T E E DR+ L LPG+Q LVT VA+ ++ PV
Sbjct: 225 -TEADLDSAKTLAASADATVLVMGADQTIEKETLDRIDLNLPGKQQELVTQVAKAARGPV 283
Query: 562 ILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPES 621
+LV+ GG D++FA+ D +I+SI+W+GYPGEAG A+A++IFG NP G+LPMTWYP+S
Sbjct: 284 VLVIMSGGGFDITFAKNDEKITSIMWVGYPGEAGGIAIADVIFGRHNPSGKLPMTWYPQS 343
Query: 622 FT-KVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISA 680
+ KVPM +MNMR D S Y GR+YRFY G VY FG GLSYTN+S++ + AP +
Sbjct: 344 YVEKVPMTNMNMRPDKSNGYLGRTYRFYIGETVYAFGDGLSYTNFSHQLIKAPKFV---- 399
Query: 681 SLKAGSDKNILQQTGSRLDYV--HIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARV 738
SL ++ LD + H ++ S F VQ+ V N GD +G+ V LF
Sbjct: 400 SLNLDESQSCRSPECQSLDAIGPHCEKAVGERS-DFEVQLKVRNVGDREGTETVFLFTTP 458
Query: 739 PKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGE 798
P+V G+P KQL+GF+++ K + F VD C+ L + ++ G+R L LG+H+L VG
Sbjct: 459 PEV-HGSPRKQLLGFEKIRLGKKEETVVRFKVDVCKDLGVVDEIGKRKLALGHHLLHVGS 517
Query: 799 LRHSLTI 805
L+HS I
Sbjct: 518 LKHSFNI 524
>gi|115387056|ref|XP_001210069.1| predicted protein [Aspergillus terreus NIH2624]
gi|114191067|gb|EAU32767.1| predicted protein [Aspergillus terreus NIH2624]
Length = 908
Score = 436 bits (1121), Expect = e-119, Method: Compositional matrix adjust.
Identities = 275/738 (37%), Positives = 396/738 (53%), Gaps = 70/738 (9%)
Query: 44 CKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESL 103
C+ S+ C+TSLSI+ R SL+ LTL+EKI L D A+ RLG+P YEWW+E+
Sbjct: 152 CQTQPLCSHRVCDTSLSIAERVNSLVKSLTLEEKILNLVDAAAGSTRLGLPFYEWWNEAT 211
Query: 104 HGIASNGPGVNFN---GTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA 160
HG+ S PGV F S TSFP ++ AASF+ +L I + E RA N G +
Sbjct: 212 HGVGS-APGVQFTSKPANFSYATSFPAPILIAASFDNALIRKIAEVIGKEGRAFANNGFS 270
Query: 161 GLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFRE 220
G FWAPNIN FRDPRWGRGQETPGED V Y F+ QG++ K+
Sbjct: 271 GFDFWAPNINGFRDPRWGRGQETPGEDTFVAQNYIRNFIPGLQGDDPKN----------- 319
Query: 221 KRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFR 280
K+V+ A CKH YDLE RY N T+QD D F PF+
Sbjct: 320 KQVI----------------ATCKHYAVYDLET----GRYGNNYNPTQQDLSDYFLAPFK 359
Query: 281 SCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKG---YITSDCDAVATIFEY 336
+C+ IMCSYN V+G+PAC L + R WGF Y+ SDC+AV I++Y
Sbjct: 360 TCVRDTDVGSIMCSYNSVSGIPACANEYLLDEVLRKHWGFNADYHYVVSDCNAVTDIWQY 419
Query: 337 QNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLG 396
N+T T E +AA L AG+D+ CG+ L+ +S + + K +D++L L+S +G
Sbjct: 420 HNFTDTEEAAAAVALNAGVDLECGSSYLKLNES-LAANQTSVKAMDQSLARLYSALFTIG 478
Query: 397 LFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLN-KNAVSSLAII 455
F+G GKY L DV + LA +AA +G+ LLKND LPL+ ++ S+A+I
Sbjct: 479 FFDG----GKYDHLDFSDVSIPAAQALAYEAAVEGMTLLKND-GLLPLHSQHKYKSVAVI 533
Query: 456 GPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVS----KTHYASGCHDVPCNSDAGFHEAV 511
GP N +QM GGY+G + L+ L A+ S K +YA G N+ GF ++
Sbjct: 534 GPFANATTQMQGGYSG---NAPYLISPLVAFESDHRWKVNYAVGTAINDQNT-TGFEASL 589
Query: 512 RIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPL 571
AKK+D ++ + G+D + E+E DR SL PG Q+ L+ S++ SK P+++V GGG +
Sbjct: 590 AAAKKSDLIVYLGGIDNSIESETIDRTSLAWPGNQLDLIKSLSNLSK-PMVVVQFGGGQV 648
Query: 572 DVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDM 630
D S + I +++W GYP ++G AL +I+ G +P GRLP+T YP S+ ++ + D+
Sbjct: 649 DDSALLENKDIQALIWAGYPSQSGGTALLDILVGKRSPAGRLPVTQYPASYADQINIFDI 708
Query: 631 NMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNI 690
N+R +S +PGR+Y++YTG V FGHGL YT + + + +L
Sbjct: 709 NLRPNSKDSHPGRTYKWYTGKPVIPFGHGLHYTKFKFGW---------EETLNREYSIQE 759
Query: 691 LQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGT-PEKQ 749
L + R I + T T+++ V+ N G +V +LF P K
Sbjct: 760 LVASCQRSSGGPIKDNTPFTTVKARVR----NVGHETSDYVSLLFLSSKNAGPAPRPNKS 815
Query: 750 LIGFDRVHTVAKGSKEIS 767
L+ + R+H +A GS ++
Sbjct: 816 LVSYKRLHNIAPGSDRVA 833
>gi|451849522|gb|EMD62825.1| glycoside hydrolase family 3 protein [Cochliobolus sativus ND90Pr]
Length = 849
Score = 434 bits (1117), Expect = e-119, Method: Compositional matrix adjust.
Identities = 264/741 (35%), Positives = 387/741 (52%), Gaps = 58/741 (7%)
Query: 64 RAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNF--NGTVSS 121
RAKSL++L TL+EKI S++A + RLGIP Y+WW+E LHGIA GP +F G S
Sbjct: 114 RAKSLVALYTLEEKINATSNSAPGVARLGIPPYQWWNEGLHGIA--GPFTSFAKQGDYSY 171
Query: 122 VTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQ 181
TSFPQ ++ A+F+ +L + + + ++ EARA N+ + GL FW PNIN FRDPRWGRGQ
Sbjct: 172 STSFPQPILMGAAFDDNLITEVANVISTEARAFNNVNRTGLDFWTPNINPFRDPRWGRGQ 231
Query: 182 ETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSA 241
ETPGED +S+Y + QG ++D +RV+ A
Sbjct: 232 ETPGEDSYHLSSYVKALIHGLQGN--ETDP--------YRRVV----------------A 265
Query: 242 CCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGV 301
CKH YD+E W RY + I++QD + + PF +C+ Q MCSYN VNG
Sbjct: 266 TCKHYAGYDIENWNGNLRYQNDVQISQQDLVEYYLAPFEACV-QANVGAFMCSYNAVNGA 324
Query: 302 PACLRGDLFQKA-RNEWGFKG---YITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDI 357
P C + Q R WG+ ++TSDCD++ ++ ++ T E +AA L AG D+
Sbjct: 325 PPCADPYMLQTVLREHWGWSSDEHWVTSDCDSIQNVYLPHQWSSTREGAAADSLNAGTDL 384
Query: 358 NCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCT 417
+CGT + H A+ +G E +D AL+ +S ++LG F+ P Y +LG D V T
Sbjct: 385 DCGTYLQSHLPGAVKQGLTNETTLDNALIRQYSSLIKLGYFD-IPENQPYRQLGFDAVAT 443
Query: 418 SEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPK 477
S + LAL AA +GIVLLKND LP+N + ++ I G N SQ+ G Y G+
Sbjct: 444 SASQALALKAAEEGIVLLKND-GVLPINFGS-KNVGIYGDWANATSQLQGNYFGVAKFLT 501
Query: 478 SLLRGLEAYVSKTHYASGCHDVPCNSDAG-FHEAVRIAKKADFVIVVAGLDLTQETEDRD 536
S LE YA + G + + +D I V G+D E+EDRD
Sbjct: 502 SPYMALEKLGVNVRYAGNLPGGQGDPTTGSWPRLSGVITTSDVHIWVGGMDNGIESEDRD 561
Query: 537 RVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGA 596
R L L G Q+ ++ +A T K PVI+++ GGG +D S + +IS++LW GYPG+ G
Sbjct: 562 RSWLTLTGSQLDVIGQLADTGK-PVIVIIMGGGQIDTSPLIKNPKISAVLWAGYPGQDGG 620
Query: 597 KALAEIIFGDFNPGGRLPMTWYPESF-TKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYG 655
A+ I+ G P GRLP T Y + ++VPM DM MR S + PGR+Y++YTG ++
Sbjct: 621 TAIVNILTGKAAPAGRLPQTQYLYKYVSEVPMTDMAMRP--SNKNPGRTYKWYTGKPIFE 678
Query: 656 FGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFH 715
FG+GL YTN+S + P + + L G + TG L+
Sbjct: 679 FGYGLHYTNFSASITNQPKQSYAISDLVKGCNS-----TGGFLERCPFT----------G 723
Query: 716 VQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQ 775
+ +SV N G +V + F + P+K L+ +DR+ +A S +
Sbjct: 724 INVSVQNTGKTSSDYVTLGFLTGSFGPKPYPKKSLVAYDRLFNIAASSSSTATLNLTLAS 783
Query: 776 LSIANKHGRRILPLGNHVLMV 796
L+ ++ G ++L G++ L +
Sbjct: 784 LARVDESGNKVLYPGDYELQI 804
>gi|330934749|ref|XP_003304687.1| hypothetical protein PTT_17336 [Pyrenophora teres f. teres 0-1]
gi|311318569|gb|EFQ87188.1| hypothetical protein PTT_17336 [Pyrenophora teres f. teres 0-1]
Length = 798
Score = 434 bits (1115), Expect = e-118, Method: Compositional matrix adjust.
Identities = 273/768 (35%), Positives = 394/768 (51%), Gaps = 65/768 (8%)
Query: 42 FP-CKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWS 100
FP C + C+ S S RAKSL++L TL+EKI S A +PRLG+P Y+WW+
Sbjct: 23 FPDCTNGPLKNVTICDPSASPLARAKSLVALYTLEEKINATSSGAPGVPRLGVPPYQWWN 82
Query: 101 ESLHGIASNGPGVNFNGT---VSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNL 157
E LHGIA GP NF+ + S TSFPQ ++ A+F+ L + + ++ EARA N
Sbjct: 83 EGLHGIA--GPYTNFSHSGVEWSYSTSFPQPILMGAAFDDDLITEVAKVISTEARAFNNA 140
Query: 158 GQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFG 217
+ GL FW PNIN FRDPRWGRGQETPGED +S+Y + QGE +D
Sbjct: 141 NRTGLDFWTPNINPFRDPRWGRGQETPGEDAYHLSSYVQALIHGLQGE--ATDP------ 192
Query: 218 FREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQP 277
KRV+ A CKH YD+E W RY + IT+QD + +
Sbjct: 193 --YKRVV----------------ATCKHFAGYDVEDWNGNLRYQNDVQITQQDLVEYYLA 234
Query: 278 PFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGF---KGYITSDCDAVATI 333
PF++C+ Q MCSYN VNG P C L Q R WG+ + ++T DCDAV +
Sbjct: 235 PFQACV-QANVGAFMCSYNAVNGAPPCADPYLLQTILREHWGWNKEEQWVTGDCDAVQNV 293
Query: 334 FEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQL 393
+ ++ T +AA L AG DI CGT M H +A + + E +D AL+ +S +
Sbjct: 294 YFPHQWSSTRAGAAADSLVAGTDITCGTYMQEHLPAAFRQKLLNESSLDLALIRQYSSLV 353
Query: 394 RLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLA 453
RLG F+ P Y +LG D V T+ + LA AA +GIVLLKND LPL+ ++ ++
Sbjct: 354 RLGYFDA-PENQPYRQLGFDAVATNASQALARRAAAEGIVLLKNDGT-LPLSLDSSMTVG 411
Query: 454 IIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASG----CHDVPCNSDAGFHE 509
+ G N +Q+ G Y G+ S L L+ K +YA G D N + +
Sbjct: 412 LFGDWANATTQLLGNYAGVATYLHSPLYALKQTGVKINYAGGKPGGQGDPTTNRWSNLYG 471
Query: 510 AVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGG 569
A +D +I V G+D E E DR L G Q+ ++ +A T K PVI+V+TGGG
Sbjct: 472 AY---STSDVLIYVGGIDNGVEEEGHDRGYLTWTGPQLDVIGQLAETGK-PVIVVVTGGG 527
Query: 570 PLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESF-TKVPMN 628
+D S + IS+I+W GYPG+ G A+ +II G P GRLP T YP S+ V M
Sbjct: 528 QIDSSPLVNNPNISAIMWAGYPGQDGGSAIIDIISGKTAPAGRLPQTQYPASYAAAVSMM 587
Query: 629 DMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDK 688
+MN+R + PGR+Y++Y G+ V+ FG+G+ YTN+S + + +SL +G +
Sbjct: 588 NMNLRPGENN--PGRTYKWYNGSAVFEFGYGMHYTNFSAAISTQMQQSYAISSLASGCNS 645
Query: 689 NILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEK 748
TG L+ V + V N G V +V + + P K
Sbjct: 646 -----TGGFLERCPFAS----------VDVQVHNTGKVTSDYVTLGYMAGTFGPAPHPRK 690
Query: 749 QLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMV 796
L+ + R+H +A G+ + ++ +++G ++L G++ L +
Sbjct: 691 TLVSYKRLHNIAGGATSTAKLNLTLASVARVDEYGNKVLYPGHYSLQI 738
>gi|449303062|gb|EMC99070.1| glycoside hydrolase family 3 protein [Baudoinia compniacensis UAMH
10762]
Length = 786
Score = 432 bits (1110), Expect = e-118, Method: Compositional matrix adjust.
Identities = 274/772 (35%), Positives = 400/772 (51%), Gaps = 59/772 (7%)
Query: 42 FP-CKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWS 100
FP C + P CN SLS RA +L+ L TL+E + A +PRLG+PAYE W+
Sbjct: 46 FPSCVNSTLSTTPVCNRSLSAWDRAHALVQLFTLEELANNTGNTAPGVPRLGLPAYEVWN 105
Query: 101 ESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA 160
E+LHGI+ N GT S TSFP ++S AS NR+L + IG ++ + RA N G+
Sbjct: 106 EALHGISHGHFATN--GTWSWATSFPSPILSMASMNRTLINQIGDIISTQGRAFSNAGRY 163
Query: 161 GLTFWAPNINIFRDPRWGRGQETPGEDPMVVSA-YAVEFVKSFQGENWKSDDGGIGFGFR 219
GL +APNIN FR P WGRGQETPGED +S+ YA E++ QG +
Sbjct: 164 GLDSYAPNINGFRSPVWGRGQETPGEDAFFLSSLYAYEYITGMQGG-------------K 210
Query: 220 EKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPF 279
V K L A KH YD+E W N SR + IT+QD + P F
Sbjct: 211 APAVPK-------------LVAVPKHFAGYDIENWNNNSRLGLDVNITQQDLAGYYTPQF 257
Query: 280 RSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK-ARNEWGF-KGYITSDCDAVATIFEYQ 337
RS I+ KA +MCSYN VNGVP+C Q AR+ WGF G+++SDCDAV ++
Sbjct: 258 RSAIQNAKALGLMCSYNAVNGVPSCSNSFFLQTLARDTWGFGNGFVSSDCDAVYNVYNPH 317
Query: 338 NYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGL 397
Y + A L+AG DI+CGT + A + G V DI+ AL +S + G
Sbjct: 318 GYAANTTGAVADSLRAGTDIDCGTSYPFYLVPAFNAGLVSRNDIELALTRYYSGLVMQGY 377
Query: 398 FNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGP 457
F+G+ Y LG +DV T++ ++ +AA +GI LLKND LPL+K + S+A+IGP
Sbjct: 378 FDGN--SSLYRNLGWNDVLTTDAWNISYEAAVEGITLLKNDGT-LPLSK-STRSVALIGP 433
Query: 458 LVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKA 517
N Q+ G Y S L+ A ++ +G + + +GF EA+ +A+++
Sbjct: 434 WANATLQLQGNYYAAAPYLISPLQAFRASGMTVNFVNGT-TISSTNTSGFAEAITLAQQS 492
Query: 518 DFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAE 577
D +I G+D + E E DR ++ PG Q+ L+ +++ K P++++ GGG +D S +
Sbjct: 493 DVIIYAGGIDNSIEAEGLDRQNITWPGNQLDLIYQLSQVGK-PLVVLQMGGGQVDSSALK 551
Query: 578 ADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSS 637
+S++++++W GYPG++G +AL +II G+ P GRL T YP S+ N +NM
Sbjct: 552 NNSKVNALVWGGYPGQSGGQALFDIIMGNRAPAGRLVTTQYPASY-ATSFNQLNMNMAPV 610
Query: 638 RQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSR 697
G++Y +YTGT VY FGHGL YTN++ P S+ A
Sbjct: 611 NGSLGQTYMWYTGTPVYPFGHGLFYTNFTTTSTMGPVTTYNLTSIFAAPHPG-------- 662
Query: 698 LDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVH 757
Y ++EV F V + A D G MLFA P K L+G DR
Sbjct: 663 --YEFVEEV-PIMDFNFIVNNTGRTASDWSG----MLFASTTSGPTPRPIKWLVGIDREA 715
Query: 758 TVAKGS-KEISFGVDPCEQLSIANKHGRRILPLGNHVLMV---GELRHSLTI 805
+ G ++ V P L+ A+ +G ++ G++ LM+ +R++ T+
Sbjct: 716 IIVPGGLASVTIKV-PVGALARADANGNLVVYPGSYSLMLNNEASIRYNFTL 766
>gi|292495285|sp|B6EY09.1|XYND_ASPJA RecName: Full=Probable exo-1,4-beta-xylosidase xlnD; AltName:
Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
gi|211970990|dbj|BAG82824.1| 1,4-beta-D-xylosidase [Aspergillus japonicus]
Length = 804
Score = 432 bits (1110), Expect = e-118, Method: Compositional matrix adjust.
Identities = 273/771 (35%), Positives = 401/771 (52%), Gaps = 81/771 (10%)
Query: 55 CNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVN 114
C+++ S RA +L+SL TL+E I + + +PRLG+P Y+ WSE+LHG+A N
Sbjct: 60 CDSTASPYDRAAALVSLFTLEELIANTGNTSPGVPRLGLPPYQVWSEALHGLAR----AN 115
Query: 115 F--NGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIF 172
F NG S TSFP ++SAA+FNR+L + I S ++ + RA N G+ GL ++PNIN F
Sbjct: 116 FTDNGAYSWATSFPSPILSAAAFNRTLINQIASIISTQGRAFNNAGRFGLDVYSPNINTF 175
Query: 173 RDPRWGRGQETPGEDPMVVSA-YAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEES 231
R P WGRGQETPGED ++A YA E++ QG G E
Sbjct: 176 RHPVWGRGQETPGEDAYTLTAAYAYEYITGIQG---------------------GVNPE- 213
Query: 232 DRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCI 291
L L+A KH YD+E W N SR + IT+QD + + P F
Sbjct: 214 ----HLKLAATAKHFAGYDIENWDNHSRLGNDVNITQQDLAEYYTPQFLVAARDAHVHSF 269
Query: 292 MCSYNQVNGVPACLRGDLFQK-ARNEWGF--KGYITSDCDAVATIFEYQNYTKTHEDSAA 348
MCSYN VNGVP+C Q R+ + F GY++ DC AV +F Y +AA
Sbjct: 270 MCSYNAVNGVPSCSNTFFLQTLLRDTFSFVDHGYVSGDCGAVYGVFNPHGYAANEPSAAA 329
Query: 349 GVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGK-Y 407
+ AG DI+CGT H +I G V DI+R + L++ + LG F+G+ Y
Sbjct: 330 DAILAGTDIDCGTSYQYHFNESITTGAVARDDIERGFIRLYANLVELGYFDGNSSSSNPY 389
Query: 408 GKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPL---NKNAVSSLAIIGPLVNNISQ 464
LG DV ++ ++ +AA +GIVLLKND LPL ++ S+A+IGP N +Q
Sbjct: 390 RSLGWPDVQKTDAWNISYEAAVEGIVLLKNDGT-LPLASPSEGKNKSIALIGPWANATTQ 448
Query: 465 MGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVA 524
+ G Y G S + A HYA G ++ NS A F A+ A+ AD ++ +
Sbjct: 449 LQGNYYGDAPYLISPVDAFTAAGYTVHYAPGT-EISTNSTANFSAALSAARAADTIVFLG 507
Query: 525 GLDLTQETEDRDRVSLLLPGQQMSLVTSVA--RTSKRPVILVLTGGGPLDVSFAEADSQI 582
G+D T E E +DR S+ PG Q+ L++ +A ++ +P+++ GGG +D S ++++++
Sbjct: 508 GIDNTIEAEAQDRSSIAWPGNQLELISQLAAQKSDDQPLVVYQMGGGQVDSSALKSNAKV 567
Query: 583 SSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYP----ESFTKVPMNDMNMRADSSR 638
+++LW GYPG++G AL +I+ G P GRL T YP ESF+ + DMN+R + +
Sbjct: 568 NALLWGGYPGQSGGLALRDILTGARAPAGRLTTTQYPAAYAESFSAL---DMNLRPNETT 624
Query: 639 QYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDK---NILQQTG 695
Q PG++Y +YTG VY FGHGL YT ++ ++S +A K NI T
Sbjct: 625 QNPGQTYMWYTGEPVYAFGHGLFYTTFN------------ASSAQAAKTKYTFNITDLTS 672
Query: 696 SRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGT-PEKQLIGFD 754
+ H D T F+ S+TN+G D + +++A P K L+GFD
Sbjct: 673 A----AHPDTTTVGQRTLFNFTASITNSGQRDSDYTALVYANTSTAGPSPYPNKWLVGFD 728
Query: 755 RVHTVAK--GSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSL 803
R+ VAK G+ E++ + +A R+ GN VL G +L
Sbjct: 729 RLAAVAKEGGTAELN--------VPVAVDRLARVDEAGNTVLFPGRYEVAL 771
>gi|156062754|ref|XP_001597299.1| hypothetical protein SS1G_01493 [Sclerotinia sclerotiorum 1980]
gi|154696829|gb|EDN96567.1| hypothetical protein SS1G_01493 [Sclerotinia sclerotiorum 1980
UF-70]
Length = 758
Score = 431 bits (1108), Expect = e-118, Method: Compositional matrix adjust.
Identities = 267/748 (35%), Positives = 386/748 (51%), Gaps = 58/748 (7%)
Query: 55 CNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVN 114
C+T+ TRA +L+SL TL EKI + + +PR+G+PAY+WW+E+LHGIA
Sbjct: 34 CDTTADPYTRATALVSLFTLAEKINNTGNTSPGVPRIGLPAYQWWNEALHGIAYGTHFAA 93
Query: 115 FNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRD 174
S TSFPQ ++ A+F+ +L ++ S ++ EARA N + GL FW PNIN ++D
Sbjct: 94 AGSNYSYATSFPQPILMGAAFDDALIHDVASQISTEARAFSNANRYGLNFWTPNINPYKD 153
Query: 175 PRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRG 234
PRWGRGQETPGEDP VS+Y V QG G + KG
Sbjct: 154 PRWGRGQETPGEDPFHVSSYVNALVTGLQG------------GLDDLPYKKGV------- 194
Query: 235 DELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCS 294
A CKH YDLE G RY+F+A+I QD D + P F+ C IMCS
Sbjct: 195 ------ATCKHYAGYDLENGGGIQRYAFDAIINSQDLRDYYLPSFQQCARDSNVQSIMCS 248
Query: 295 YNQVNGVPACLRGDLFQK-ARNEWGF---KGYITSDCDAVATIFEYQNYTKTHEDSAAGV 350
YN VNGVP C L Q R WG+ ++TSDCDAV I++ NYT T E +AA
Sbjct: 249 YNAVNGVPTCADDWLLQSLLREHWGWVEEDQWVTSDCDAVQNIWDSHNYTSTPEQAAADA 308
Query: 351 LKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGK-YGK 409
L AG D++CG + SA ++ +DR+L ++ +RLG F DP + Y +
Sbjct: 309 LNAGTDLDCGGFWPTYLGSAYNQSLYNISTLDRSLTRRYASLVRLGYF--DPASIQPYRQ 366
Query: 410 LGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGY 469
LG DV T ++LAL AA GIVLLKND LPL N ++++A+IGP N +QM G Y
Sbjct: 367 LGWSDVSTPSAEQLALQAAEDGIVLLKND-GILPLPSN-ITNVALIGPWANATTQMQGNY 424
Query: 470 TGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLT 529
G S L + Y G D+ + F A+ AKKAD +I + G+D +
Sbjct: 425 YGQAPYLHSPLIAAQNAGFHVTYVQGA-DIDSTNTTEFTAAIAAAKKADVIIYIGGIDNS 483
Query: 530 QETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIG 589
E E +DR ++ P Q+SLV +A S +I + G +D S + ++ I+W G
Sbjct: 484 IEAEAKDRKTIAWPSSQISLVNQLANLSIPLIISQM--GTMIDSSSLLTNRGVNGIIWAG 541
Query: 590 YPGEAGAKALAEIIFGDFNPGGRLPMTWYPESF-TKVPMNDMNMRADSSRQYPGRSYRFY 648
YPG+ G A+ I+ G P GRLP+T YP + +V MN+MN+ ++ PGR+Y+++
Sbjct: 542 YPGQDGGTAIFNILTGKTAPAGRLPITQYPSDYVNEVSMNNMNLHPGANN--PGRTYKWF 599
Query: 649 TGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTS 708
GT ++ FG GL YT ++ K ++ PS T S ++ T + D
Sbjct: 600 NGTSIFDFGFGLHYTTFNAK-ITPPSSNTFEIS-------HLTSNTSTHKDLTPF----- 646
Query: 709 CTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISF 768
+ IS++N G +V +LF P+K L+ + R+H + G+ +
Sbjct: 647 -----LTLPISISNTGTTTSDYVALLFLTGSFGPTPYPKKSLVAYTRLHDIKGGASSTAQ 701
Query: 769 GVDPCEQLSIANKHGRRILPLGNHVLMV 796
L+ N+ G +L G++ ++V
Sbjct: 702 LKLNLASLARGNEKGDLVLYPGDYKVVV 729
>gi|2791278|emb|CAA93248.1| beta-xylosidase [Trichoderma reesei]
gi|340519464|gb|EGR49702.1| glycoside hydrolase family 3 [Trichoderma reesei QM6a]
Length = 797
Score = 426 bits (1095), Expect = e-116, Method: Compositional matrix adjust.
Identities = 268/752 (35%), Positives = 395/752 (52%), Gaps = 63/752 (8%)
Query: 55 CNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVN 114
C++S RA++LISL TL+E I ++ +PRLG+P Y+ W+E+LHG+ N
Sbjct: 63 CDSSAGYVERAQALISLFTLEELILNTQNSGPGVPRLGLPNYQVWNEALHGLDR----AN 118
Query: 115 F---NGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINI 171
F G TSFP +++ A+ NR+L I ++ +ARA N G+ GL +APN+N
Sbjct: 119 FATKGGQFEWATSFPMPILTTAALNRTLIHQIADIISTQARAFSNSGRYGLDVYAPNVNG 178
Query: 172 FRDPRWGRGQETPGEDPMVVS-AYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEE 230
FR P WGRGQETPGED +S AY E++ QG G E
Sbjct: 179 FRSPLWGRGQETPGEDAFFLSSAYTYEYITGIQG---------------------GVDPE 217
Query: 231 SDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASC 290
L ++A KH YDLE W N SR F+A+IT+QD + + P F + K+
Sbjct: 218 -----HLKVAATVKHFAGYDLENWNNQSRLGFDAIITQQDLSEYYTPQFLAAARYAKSRS 272
Query: 291 IMCSYNQVNGVPACLRGDLFQK-ARNEWGFK--GYITSDCDAVATIFEYQNYTKTHEDSA 347
+MC+YN VNGVP+C Q R WGF GY++SDCDAV +F +Y +A
Sbjct: 273 LMCAYNSVNGVPSCANSFFLQTLLRESWGFPEWGYVSSDCDAVYNVFNPHDYASNQSSAA 332
Query: 348 AGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKY 407
A L+AG DI+CG H + G+V +I+R++ L++ +RLG F+ +K +Y
Sbjct: 333 ASSLRAGTDIDCGQTYPWHLNESFVAGEVSRGEIERSVTRLYANLVRLGYFD---KKNQY 389
Query: 408 GKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGG 467
LG DV ++ ++ +AA +GIVLLKND LPL+K V S+A+IGP N +QM G
Sbjct: 390 RSLGWKDVVKTDAWNISYEAAVEGIVLLKNDGT-LPLSKK-VRSIALIGPWANATTQMQG 447
Query: 468 GYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLD 527
Y G S L + ++ G ++ NS GF +A+ AKK+D +I + G+D
Sbjct: 448 NYYGPAPYLISPLEAAKKAGYHVNFELGT-EIAGNSTTGFAKAIAAAKKSDAIIYLGGID 506
Query: 528 LTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILW 587
T E E DR + PG Q+ L+ ++ K P++++ GGG +D S +++ +++S++W
Sbjct: 507 NTIEQEGADRTDIAWPGNQLDLIKQLSEVGK-PLVVLQMGGGQVDSSSLKSNKKVNSLVW 565
Query: 588 IGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNMRADSSRQYPGRSYR 646
GYPG++G AL +I+ G P GRL T YP + + P NDMN+R D + PG++Y
Sbjct: 566 GGYPGQSGGVALFDILSGKRAPAGRLVTTQYPAEYVHQFPQNDMNLRPD-GKSNPGQTYI 624
Query: 647 FYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEV 706
+YTG VY FG GL YT + S P L + T S L H
Sbjct: 625 WYTGKPVYEFGSGLFYTTFKETLASHPKSLKFN--------------TSSILSAPHPGYT 670
Query: 707 TSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGT-PEKQLIGFDRVHTVAKG-SK 764
S F + ++ N+G + + MLF R P K L+GFDR+ + G S
Sbjct: 671 YSEQIPVFTFEANIKNSGKTESPYTAMLFVRTSNAGPAPYPNKWLVGFDRLADIKPGHSS 730
Query: 765 EISFGVDPCEQLSIANKHGRRILPLGNHVLMV 796
++S + P L+ + HG RI+ G + L +
Sbjct: 731 KLSIPI-PVSALARVDSHGNRIVYPGKYELAL 761
>gi|340519849|gb|EGR50086.1| glycoside hydrolase family 3 [Trichoderma reesei QM6a]
Length = 796
Score = 426 bits (1094), Expect = e-116, Method: Compositional matrix adjust.
Identities = 275/801 (34%), Positives = 425/801 (53%), Gaps = 81/801 (10%)
Query: 25 VNVIAFSNSKPVLN-KPDFP-CKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLS 82
+N+ + SN P P FP C S C+T+ SI+ RA +++ +TL EK+ +
Sbjct: 43 INIYSISNPSPNPPLTPSFPDCTRDPLCSNDVCDTTKSIAERAAAIVKPMTLNEKVANVG 102
Query: 83 DNASAIPRLGIPAYEWWSESLHGIASNGPGVNFN----GTVSSVTSFPQVLVSAASFNRS 138
+AS RLG+PAY+W +E+LHG+A + GV F S+ TSFP ++ +A+F+ +
Sbjct: 103 SSASGSARLGLPAYQWQNEALHGVAGS-TGVQFQSPLGANFSAATSFPMPILLSAAFDDA 161
Query: 139 LWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEF 198
L ++ +A++ EARA N G AGL FW PNIN FRDPRWGRG ETPGED + Y +
Sbjct: 162 LVKSVATAISTEARAFANYGFAGLDFWTPNINPFRDPRWGRGMETPGEDAFRIQGYVLAL 221
Query: 199 VKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFS 258
V QG GI F R L + CKH AYD+E
Sbjct: 222 VDGLQG--------GIDPDFY--RTL----------------STCKHFAAYDIEN----G 251
Query: 259 RYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEW 317
R + N T+QD D + P F +C+ K + IMC+YN V+GVPAC L Q R+ +
Sbjct: 252 RTANNLSPTQQDMADYYLPMFETCVRDAKVASIMCAYNAVDGVPACADSYLLQDVLRDTY 311
Query: 318 GFK---GYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKG 374
GF Y+ SDCDAV +F+ +Y +AA + AG D++CG+ +++ G
Sbjct: 312 GFTEDFNYVVSDCDAVENVFDPHHYAANLTQAAAMSINAGTDLDCGSSY-NVLNASVQAG 370
Query: 375 KVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVL 434
E +D++L+ L+S +++G F+ + +Y LG +V T++ + LA DAA +G+ L
Sbjct: 371 LTTEATLDKSLIRLYSALVKVGYFD---QPAEYNSLGWGNVNTTQSQALAHDAATEGMTL 427
Query: 435 LKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYAS 494
LKND LPL++ +S++A+IGP N +QM G Y G + L + YA
Sbjct: 428 LKNDGT-LPLSR-TLSNVAVIGPWANVTTQMQGNYAGTAPLLVNPLSVFQQKWRNVKYAQ 485
Query: 495 GCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVA 554
G + +GF+ A+ A +D ++ + G+D++ E E DR S+ PG Q++L++ +A
Sbjct: 486 GTA-INSQDTSGFNAALSAASSSDVIVYLGGIDISVENEGFDRSSITWPGNQLNLISQLA 544
Query: 555 RTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLP 614
K P+++V GGG +D S ++S+++SILW GYPG+ G A+ +++ G P GRLP
Sbjct: 545 NLGK-PLVIVQFGGGQIDDSALLSNSKVNSILWAGYPGQDGGNAIFDVLTGANPPAGRLP 603
Query: 615 MTWYPESF-TKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAP 673
+T YP ++ + DMN+R S PGR+Y +YTGT V FG+GL YTN+S F S
Sbjct: 604 VTQYPANYVNNNNIQDMNLR--PSNGIPGRTYAWYTGTPVLPFGYGLHYTNFSLSFQSTK 661
Query: 674 SELTISASLKAGSD-KNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAG---DVDGS 729
+ AGSD ++ GS D + ++V N G ++
Sbjct: 662 T---------AGSDIATLVNNAGSNKDLATFATIV----------VNVKNTGGKANLASD 702
Query: 730 HVVMLFARVPKVSQGT-PEKQLIGFDRVHTVAKG-SKEISFGVDPCEQLSIANKHGRRIL 787
+V +LF + P KQL + RV V G +++++ V+ L+ A+ +G R +
Sbjct: 703 YVGLLFLKSTNAGPAPHPNKQLAAYGRVRNVGVGATQQLTLTVN-LGSLARADTNGDRWI 761
Query: 788 PLGNHVLMV---GELRHSLTI 805
G + L++ G L + T+
Sbjct: 762 YPGAYTLILDVNGPLTFNFTL 782
>gi|242771939|ref|XP_002477942.1| beta-xylosidase XylA [Talaromyces stipitatus ATCC 10500]
gi|218721561|gb|EED20979.1| beta-xylosidase XylA [Talaromyces stipitatus ATCC 10500]
Length = 797
Score = 425 bits (1092), Expect = e-116, Method: Compositional matrix adjust.
Identities = 272/758 (35%), Positives = 413/758 (54%), Gaps = 74/758 (9%)
Query: 55 CNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGI-----ASN 109
CNTS++ RA+ LISL TL+E I ++A +PRLG+P Y+ WSE LHG+ A +
Sbjct: 64 CNTSVNYVERAEGLISLFTLEELINNTQNSAPGVPRLGLPPYQVWSEGLHGLDRANWAKS 123
Query: 110 GPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNI 169
G + TSFP ++S A+ NR+L + I S +A +ARA N+G+ GL +APNI
Sbjct: 124 GEEWKW------ATSFPMPILSMAALNRTLINQIASIIATQARAFNNVGRYGLDAYAPNI 177
Query: 170 NIFRDPRWGRGQETPGEDP-MVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFG 228
N FR P WGRGQETPGED + S+YA E++ QG G
Sbjct: 178 NGFRSPLWGRGQETPGEDAGFLSSSYAYEYITGLQG---------------------GVD 216
Query: 229 EESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKA 288
E L + A KH YDLE W N SR F+A IT+QD + + P F + KA
Sbjct: 217 PE-----HLKIVATAKHFAGYDLENWNNNSRLGFDASITQQDLAEYYTPQFLAASRYAKA 271
Query: 289 SCIMCSYNQVNGVPACLRGDLFQK-ARNEWGFK--GYITSDCDAVATIFEYQNYTKTHED 345
MCSYN VNGVP+C L Q R W F GY++SDCDA +F Y
Sbjct: 272 RSFMCSYNSVNGVPSCSSSFLLQTLLRENWDFPDYGYVSSDCDAAYNVFNPHGYAINISA 331
Query: 346 SAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKG 405
+AA L+AG DI+CG + + +G V +I+R+L+ L+S ++LG F+G+ +
Sbjct: 332 AAADSLRAGTDIDCGQTYPWYLNQSFIEGSVTRGEIERSLIRLYSNLVKLGYFDGN--QS 389
Query: 406 KYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQM 465
+Y +LG +DV ++ ++ +AA +GIVLLKND LPL++ + S+A+IGP N Q+
Sbjct: 390 EYRQLGWNDVVATDAWNISYEAAVEGIVLLKND-GVLPLSEK-LKSVAVIGPWANATQQL 447
Query: 466 GGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAG 525
G Y G + L+ K +YA G ++ N+ GF A+ AKK+D +I + G
Sbjct: 448 QGNYFGPAPYLITPLQAARDAGYKVNYAFGT-NILGNTTDGFAAALSAAKKSDVIIYLGG 506
Query: 526 LDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSI 585
+D T E E DR+++ PG Q+ L+ +++T K P++++ GGG +D S ++++ ++++
Sbjct: 507 IDNTIEAEGTDRMNVTWPGNQLDLIQQLSQTGK-PLVVLQMGGGQVDSSSLKSNNNVNAL 565
Query: 586 LWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESF-TKVPMNDMNMRADSSRQYPGRS 644
+W GYPG++G KA+ +I+ G P GRL T YP + T+ P DMN+R D + PG++
Sbjct: 566 VWGGYPGQSGGKAIFDILSGKRAPAGRLVTTQYPAEYATQFPATDMNLRPD-GKSNPGQT 624
Query: 645 YRFYTGTQVYGFGHGLSYTNY---SYKFLSAPSELT-ISASLKAGSDKNILQQTGSRLDY 700
Y +YTG VY FG+ L YT + + K S+ +++ I AS ++ S Y
Sbjct: 625 YIWYTGKPVYEFGYALFYTTFKETAEKLASSSFDISDIIASPRSSS-------------Y 671
Query: 701 VHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGT-PEKQLIGFDRVHTV 759
+ + V +V ++ N G + MLFA P K L+G+DR+ ++
Sbjct: 672 AYSELVPFV-----NVTATIKNTGKTASPYTAMLFANTTNAGPTPYPNKWLVGYDRLPSI 726
Query: 760 AKG-SKEISFGVDPCEQLSIANKHGRRILPLGNHVLMV 796
G S E+ V P +S +++G RI+ G++ L +
Sbjct: 727 EPGKSTELVIPV-PIGAISRVDENGNRIVYPGDYQLAL 763
>gi|388857998|emb|CCF48443.1| related to Beta-xylosidase [Ustilago hordei]
Length = 782
Score = 423 bits (1087), Expect = e-115, Method: Compositional matrix adjust.
Identities = 252/772 (32%), Positives = 401/772 (51%), Gaps = 62/772 (8%)
Query: 54 FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
C+ ++ TRA SL++ T +E + + A +PRLGIP Y+WW+E+LHG+A + PGV
Sbjct: 36 ICDPTIPFYTRATSLVNQFTTEELLNNTINYAPGVPRLGIPNYQWWTEALHGVAKS-PGV 94
Query: 114 NFN-----GTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAP- 167
NF+ +S T FPQ + A+F+ L+ I S +A E RA N G+AGL ++P
Sbjct: 95 NFDLSDPHAEFTSATQFPQTINLGATFDDDLYQQIASVIASEVRAYNNAGKAGLNLYSPL 154
Query: 168 NINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGF 227
NIN FRDPRWGRGQET GEDP+ +S +AV V QG +
Sbjct: 155 NINCFRDPRWGRGQETVGEDPLHMSRFAVSIVHGLQGPH--------------------- 193
Query: 228 GEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGK 287
+ G++L ++A CKH +AYDLE++ RY F+A++++QD D P FR+C+ G
Sbjct: 194 AQNEAEGNKLTVAATCKHFLAYDLEQYDRGERYQFDAIVSKQDLSDFHLPQFRACVRDGG 253
Query: 288 ASCIMCSYNQVNGVPACLRGDLFQK-ARNEWGF---KGYITSDCDAVATIFEYQNYTKTH 343
A+ +M SYN VN VP Q AR WG Y+TSDCDAVA +++ Y + +
Sbjct: 254 ATTLMTSYNAVNNVPPSASKYYLQTLARQAWGLDKTHNYVTSDCDAVANVYDGHRYAQNY 313
Query: 344 EDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPR 403
++AA + AG D++CG + +A+ + I RA++ +++ +RLG F+ DP
Sbjct: 314 VEAAAKSINAGTDLDCGATYSENLGAALKQKLTDIATIRRAVIRMYASLVRLGYFD-DPA 372
Query: 404 KGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNIS 463
+L DV + ++LA +A I LLKN LP+ K + +AIIGP N +
Sbjct: 373 SQPLRQLTWKDVNSPSSQRLAYTSALSSITLLKNLDSTLPI-KQKPTKIAIIGPYTNVST 431
Query: 464 QMGGGYTGIPCSPKSLLRGLEAYV--SKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVI 521
G Y G +++ +K + +G + +AV++ AD V+
Sbjct: 432 SFSGNYAGPAAFNMTMVHAASQVFPDAKIVWVNGTDISGPYIPSDAQDAVKLTSDADSVV 491
Query: 522 VVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKR----PVILVLTGGGPLDVSFAE 577
G+D + E E DR + P Q+ L+ ++++ K+ +++V GGG LD + +
Sbjct: 492 FAGGIDASIERESHDRKDIAWPPNQLRLIHELSQSRKKDKKSKLVVVQFGGGQLDGASLK 551
Query: 578 ADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTK-VPMNDMNMRADS 636
+D + +++W GYPG++ + A+ +I+ G P GRLP+T YP S+ +P + M++R +
Sbjct: 552 SDDAVGALVWAGYPGQSASLAVWDILAGKAVPAGRLPVTQYPASYIDGLPESAMSLRPKA 611
Query: 637 SRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGS 696
YPGR+Y++Y G Y FGHGL YT +S L+ P I + A +
Sbjct: 612 G--YPGRTYKWYKGVPTYPFGHGLHYTTFSAS-LAKPQPYAIPTTPAAKGPEG------- 661
Query: 697 RLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRV 756
VH + ++ VQ ++ N G V + +LFAR P K L+G+ +V
Sbjct: 662 ----VHAEHISVA-----DVQANIKNTGKVASDYTALLFARHSNGPAPYPRKTLVGYTKV 712
Query: 757 HTVAKGSKEISFGVDPCE-QLSIANKHGRRILPLGNHVLMVGELRHSLTIET 807
++ G +E S + + L+ A++ G + L G++ L + H L T
Sbjct: 713 KNLSAG-EESSVTIKITQAALARADEEGNQFLYPGSYQLELDTEEHRLASTT 763
>gi|367046937|ref|XP_003653848.1| glycoside hydrolase family 3 protein [Thielavia terrestris NRRL
8126]
gi|347001111|gb|AEO67512.1| glycoside hydrolase family 3 protein [Thielavia terrestris NRRL
8126]
Length = 923
Score = 422 bits (1085), Expect = e-115, Method: Compositional matrix adjust.
Identities = 289/798 (36%), Positives = 410/798 (51%), Gaps = 81/798 (10%)
Query: 25 VNVIAFSNSKPV---LNKPDFPCKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQL 81
+N+ SN P LNK C+ S P CNTSL I+ R + L+ LTLQEKI L
Sbjct: 132 MNLYNISNPAPTITPLNKFTPVCQTSPLCSSPACNTSLPIADRVRWLVGQLTLQEKITNL 191
Query: 82 SDNASAIPRLGIPAYEWWSESLHGIASNGPGVNF---NGTV-SSVTSFPQVLVSAASFNR 137
D AS R+G+P YEWWSE+LHG+A++ PGV F NGT S TSFP + +A+F+
Sbjct: 192 VDGASGSARVGLPPYEWWSEALHGVAAS-PGVTFAGPNGTAFSYATSFPMPITISAAFDD 250
Query: 138 SLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVE 197
L S I + V E RA N G +G FW PNIN FRDPRWGRG ETPGED + Y
Sbjct: 251 DLVSQIAAVVGREGRAFANHGLSGFDFWTPNINPFRDPRWGRGPETPGEDAFRIQQYIRH 310
Query: 198 FVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNF 257
+ QG SD D+ ++ A CKH YD+E
Sbjct: 311 LIPGLQG--------------------------SDPLDKQII-ATCKHYAVYDVET---- 339
Query: 258 SRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNE 316
RY ++ D + + PF++C+ +MCSYN V+G+PAC L Q R+
Sbjct: 340 GRYEYDYDPQPHDLAEYYLAPFKTCVRDVGIGSVMCSYNAVDGIPACASEYLLQSVLRDH 399
Query: 317 WGFK---GYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDK 373
WGF Y+ SDCDAV I+ N+T + +AA L AG D+ CG+ L QS +
Sbjct: 400 WGFTEPYQYVVSDCDAVRFIYSPHNFTDSPAAAAAVALNAGTDLECGSTYLNLNQS-LAS 458
Query: 374 GKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIV 433
E +DRAL L++ +G F+G R YG LG D V T + + LA AA G V
Sbjct: 459 NMTTEAALDRALTRLYTALHTIGFFDGSAR---YGGLGWDAVGTGDAQVLAYQAAVDGAV 515
Query: 434 LLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVS----- 488
LLKN+K LPL+ + LA+IGP N +QM G Y G L+ L A+ S
Sbjct: 516 LLKNEKSLLPLDSKRLRKLAVIGPWANATTQMQGNYFG---QAAYLVSPLAAFQSAWGAD 572
Query: 489 KTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMS 548
+A+G + NS AGF A+ AK AD V+ + G+D + E+E DR ++ PG Q+
Sbjct: 573 NVLFANGT-GIAGNSTAGFAAALAAAKAADAVVFLGGVDNSVESESLDRTAISWPGNQLD 631
Query: 549 LVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFN 608
L+ +A K P+++V GGG LD S A+ ++ ++LW GYPG+AG A+A+++ G
Sbjct: 632 LIAQLAAVGK-PLVVVQCGGGQLDDSALLANPRVGALLWAGYPGQAGGAAIADLLTGKQA 690
Query: 609 PGGRLPMTWYPESFT-KVPMNDMNMR------ADSSRQYPGRSYRFYTGTQVYGFGHGLS 661
P GRLP+T Y S+T +V + D ++R + S +PGR+Y++YTG V FG+GL
Sbjct: 691 PAGRLPVTQYAASYTSEVSLFDPSLRPRRSGGSKSHSTFPGRTYKWYTGKPVLPFGYGLH 750
Query: 662 YTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVT 721
YT + + P + ++ S D + +V ++VT
Sbjct: 751 YTTFRTAWADEPRGRAYDIAGLFPANTTTTSSAFSAADTYPV----------LNVSVTVT 800
Query: 722 NAGDVDGSHVVMLFARVPKVSQGT-PEKQLIGFDRVHTVAKGSK---EISFGVDPCEQLS 777
N G +V +LF R P K L+G+ R +A GS E++ + L+
Sbjct: 801 NTGRGASDYVGLLFLRTRNAGPAPYPNKWLVGYARARGLAPGSSARLELAVALG---SLA 857
Query: 778 IANKHGRRILPLGNHVLM 795
A++ GRR++ G++ L+
Sbjct: 858 RADEDGRRVVYPGDYELL 875
>gi|429850127|gb|ELA25427.1| glycoside hydrolase family 3 protein [Colletotrichum
gloeosporioides Nara gc5]
Length = 918
Score = 422 bits (1084), Expect = e-115, Method: Compositional matrix adjust.
Identities = 270/791 (34%), Positives = 415/791 (52%), Gaps = 61/791 (7%)
Query: 18 VLQLIVVVNVIAFSNSKPVLNKPDFPCKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEK 77
++++ ++N I + K N PD C S C+ SLS RA +L++ LT+ EK
Sbjct: 137 MIEVYEIINPII--DEKAPGNFPD--CTRDPLCSNDVCDESLSDKQRAAALVAELTIWEK 192
Query: 78 IQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFN--GTVSSVTSFPQVLVSAASF 135
+ L + A IPRL +P YEWWSE LHG+A + PG F G S TSFPQ ++ ++F
Sbjct: 193 LDNLVNEAPGIPRLRVPPYEWWSEGLHGVARS-PGTKFTSKGNFSYATSFPQPILLGSAF 251
Query: 136 NRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYA 195
+ L +G V+ EARA N G++GL ++PNIN F+DPRWGRGQETPGED + Y
Sbjct: 252 DDELVRAVGEVVSREARAFSNAGRSGLDLYSPNINAFKDPRWGRGQETPGEDTFHLQKYV 311
Query: 196 VEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWG 255
+ +G++ +K+++ A CKH A D E +
Sbjct: 312 SAMLSGLEGDD------------PDKKLI----------------ATCKHYAANDFENYK 343
Query: 256 NFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-R 314
R FNA+I+ QD + + PPF++C + MCSYN +NG P C L + R
Sbjct: 344 GVDRSGFNAVISTQDLSEYYLPPFKTCAVEKNVGSFMCSYNGINGTPLCANSYLIEDILR 403
Query: 315 NEWGFKG---YITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHT-QSA 370
WG+ G Y+++DCD VA + Y +Y +AA ++AG D+ C QSA
Sbjct: 404 KHWGWNGDGQYVSTDCDCVALMVSYHHYAPDLGHAAAWSMQAGTDLECNAFPGSEALQSA 463
Query: 371 IDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQ 430
++ + EKD+D+AL +++ + +GLF+ D RK LG D+V T E + LA AA +
Sbjct: 464 WNQSLISEKDVDKALTRMYTSLVSVGLFDLD-RKDPLRSLGWDEVNTKEAQDLAYRAAVE 522
Query: 431 GIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIP---CSPKSLLRGLEAYV 487
G VL+KND LPL+ ++ A+IGP V+ +QM G Y G SP+ + L +
Sbjct: 523 GAVLMKND-GILPLSPDSSKKYALIGPWVSATTQMQGNYFGPAPYLISPRKAAKDLG--L 579
Query: 488 SKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQM 547
T++ + SD+ F +A++ A+ AD VI + G+D T E E DR +L P Q+
Sbjct: 580 DFTYFLGSRTN---KSDSSFAQAIKAAQAADVVIFMGGVDNTLEQETLDRNTLAWPEPQL 636
Query: 548 SLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDF 607
L+ +++ K P++++ GGG +D + A+ +++ILW GYPG++G KA+ +I+FG
Sbjct: 637 QLLRALSEVGK-PLVVLQFGGGQVDDTELLANDSVNAILWGGYPGQSGGKAILDIVFGRA 695
Query: 608 NPGGRLPMTWYPESFTK-VPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYS 666
P GRL +T YP S+ VP DMN+R GR+YR+YTG +G GL YT +S
Sbjct: 696 APAGRLSVTQYPASYNDAVPATDMNLRPGPGNSGLGRTYRWYTGETPVPYGFGLHYTKFS 755
Query: 667 YKFLSAPSELTIS-ASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGD 725
A + I A + A ++ + + S + VT V +S N G+
Sbjct: 756 VDMKPASNVHNIDIAQMAAEANDDAASEIPSWQRGLERRMVT--------VTVSAKNEGN 807
Query: 726 VDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRR 785
V +V ++F R + P+K L+G+ R+ + G + + EQL ++ G R
Sbjct: 808 VISDYVALVFLRSEAGPKPWPQKTLVGYTRLRNIKPGEERKEEIIIKMEQLVRVDEVGNR 867
Query: 786 ILPLGNHVLMV 796
+L G + L +
Sbjct: 868 VLYEGLYSLFL 878
>gi|358382857|gb|EHK20527.1| hypothetical protein TRIVIDRAFT_192759 [Trichoderma virens Gv29-8]
Length = 860
Score = 422 bits (1084), Expect = e-115, Method: Compositional matrix adjust.
Identities = 275/803 (34%), Positives = 424/803 (52%), Gaps = 83/803 (10%)
Query: 24 VVNVIAFSNSKPVLN-KPDFP-CKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQL 81
++N+ + +N P P FP C S C+T+ SI+ RA +++ +TL EK+ +
Sbjct: 106 MINIYSITNPSPNPPLTPSFPDCTRDPLCSNDVCDTTKSIAARAAAIVKPMTLNEKVANV 165
Query: 82 SDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTV----SSVTSFPQVLVSAASFNR 137
+AS RLG+PAY+W +E+LHG+A + GV F + S+ TSFP ++ +A+F+
Sbjct: 166 GSSASGSGRLGLPAYQWQNEALHGVAGS-TGVQFQSPLGANFSAATSFPMPILLSAAFDD 224
Query: 138 SLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVE 197
+L ++ +A++ EARA N G AGL FW PNIN FRDPRWGRG ETPGED + Y +
Sbjct: 225 ALVQSVATAISTEARAFANYGFAGLDFWTPNINPFRDPRWGRGMETPGEDAFRIQGYVLS 284
Query: 198 FVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNF 257
+ QG GI F + CKH AYD+E
Sbjct: 285 LINGLQG--------GIDPDF------------------FRTISTCKHFAAYDIEN---- 314
Query: 258 SRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNE 316
R + N T+QD D + P F +C+ K IMC+YN VNGVPAC L Q R+
Sbjct: 315 GRTANNLSPTQQDMADYYLPMFETCVRDAKVGSIMCAYNSVNGVPACADSYLLQSVLRDG 374
Query: 317 WGFK---GYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDK 373
+GF Y+ SDCDAV +++ +Y +AA L AG D++CG+ +++
Sbjct: 375 YGFTEDFNYVVSDCDAVENVYDPHHYAANLTQAAAMSLNAGTDLDCGSSY-NVLNASVQA 433
Query: 374 GKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIV 433
G E +D++L+ L+S +++G F+ + KY LG +V T++ + LA DAA G+
Sbjct: 434 GMTTEATLDKSLIRLYSALIKVGWFD---QPAKYSSLGWGNVNTTQTRALAHDAATGGMT 490
Query: 434 LLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYA 493
LLKND LPL+ + ++A+IGP VN +Q+ G Y G + L + YA
Sbjct: 491 LLKNDGT-LPLSP-TLQNVAVIGPWVNATTQLQGNYAGTAPVLVNPLTVFQQKWRNVKYA 548
Query: 494 SGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSV 553
G + +GF+ A+ A +D ++ + G+D++ E E DR ++ PG Q+SL++ +
Sbjct: 549 QGTA-INSQDTSGFNAAISAASSSDVIVYLGGIDISVENEGFDRTAITWPGNQLSLISQL 607
Query: 554 ARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRL 613
A K P+++V GGG +D S ++S+++SILW GYPG+ G AL +++ G P GRL
Sbjct: 608 ANLGK-PLVIVQFGGGQIDDSSLLSNSKVNSILWAGYPGQEGGNALFDVLTGANPPAGRL 666
Query: 614 PMTWYPESF-TKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSA 672
P+T YP ++ + DMN+R S PGR+Y +YTGT V FG+GL YTN+S F S
Sbjct: 667 PITQYPANYVNNNNIQDMNLRPSGS--IPGRTYAWYTGTPVLPFGYGLHYTNFSVSFQST 724
Query: 673 PSELTISASL--KAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAG---DVD 727
+ T A++ AGS+K D T T + ++V N G ++
Sbjct: 725 KTSGTDVATIVNNAGSNK---------------DRATFAT-----LVVNVKNTGGKANLA 764
Query: 728 GSHVVMLFARVPKVSQGT-PEKQLIGFDRVHTVAKG-SKEISFGVDPCEQLSIANKHGRR 785
+V +LF + P KQL + RV V G +++++ V+ L+ A+ +G R
Sbjct: 765 SDYVGLLFLKSTNAGPAPHPNKQLAAYGRVKKVGVGATQQLTLTVN-LGSLARADTNGDR 823
Query: 786 ILPLGNHVLMV---GELRHSLTI 805
+ G + L + G L + T+
Sbjct: 824 WVYPGAYTLTLDVNGPLTFNFTL 846
>gi|292495281|sp|C0STH4.1|XYND_ASPAC RecName: Full=Probable exo-1,4-beta-xylosidase xlnD; AltName:
Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
gi|225878711|dbj|BAH30675.1| beta-xylosidase [Aspergillus aculeatus]
Length = 805
Score = 421 bits (1082), Expect = e-115, Method: Compositional matrix adjust.
Identities = 270/772 (34%), Positives = 399/772 (51%), Gaps = 82/772 (10%)
Query: 55 CNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVN 114
C+++ S RA +L+SL TL+E I + + +PRLG+P Y+ WSE+LHG+ N
Sbjct: 60 CDSTASPYDRAAALVSLFTLEELIANTGNTSPGVPRLGLPPYQVWSEALHGLGR----AN 115
Query: 115 F--NGTV-SSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINI 171
F NG + + SFP ++SAA+FNR+L + I S ++ + RA N G+ GL ++PNIN
Sbjct: 116 FTDNGALHAGRPSFPSPILSAAAFNRTLINQIASIISTQGRAFNNAGRFGLDVYSPNINT 175
Query: 172 FRDPRWGRGQETPGEDPMVVSA-YAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEE 230
FR P WGRGQETPGED ++A YA E++ QG G E
Sbjct: 176 FRHPVWGRGQETPGEDAYTLTAAYAYEYITGIQG---------------------GVNPE 214
Query: 231 SDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASC 290
L L+A KH YD+E W N SR + IT+QD + + P F
Sbjct: 215 -----HLKLAATAKHFAGYDIENWDNHSRLGNDVNITQQDLAEYYTPQFLVAARDAHVHS 269
Query: 291 IMCSYNQVNGVPACLRGDLFQK-ARNEWGF--KGYITSDCDAVATIFEYQNYTKTHEDSA 347
MCSYN VNGVP+C Q R+ + F GY++ DC AV +F Y +A
Sbjct: 270 FMCSYNAVNGVPSCSNTFFLQTLLRDTFSFVDHGYVSGDCGAVYGVFNPHGYAANEPSAA 329
Query: 348 AGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGK- 406
A + AG DI+CGT H +I G V DI+R + L++ + LG F+G+
Sbjct: 330 ADAILAGTDIDCGTSYQYHFNESITTGAVARDDIERGFIRLYANLVELGYFDGNSSSSNP 389
Query: 407 YGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPL---NKNAVSSLAIIGPLVNNIS 463
Y LG DV ++ ++ +AA +GIVLLKND LPL ++ S+A+IGP N +
Sbjct: 390 YRSLGWPDVQKTDAWNISYEAAVEGIVLLKNDGT-LPLASPSEGKNKSIALIGPWANATT 448
Query: 464 QMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVV 523
Q+ G Y G S + A HYA G ++ NS A F A+ A+ AD ++ +
Sbjct: 449 QLQGNYYGDAPYLISPVDAFTAAGYTVHYAPGT-EISTNSTANFSAALSAARAADTIVFL 507
Query: 524 AGLDLTQETEDRDRVSLLLPGQQMSLVTSVA--RTSKRPVILVLTGGGPLDVSFAEADSQ 581
G+D T E E +DR S+ PG Q+ L++ +A ++ +P+++ GGG +D S + +++
Sbjct: 508 GGIDNTIEAEAQDRSSIAWPGNQLELISQLAAQKSDDQPLVVYQMGGGQVDSSSLKFNAK 567
Query: 582 ISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYP----ESFTKVPMNDMNMRADSS 637
++++LW GYPG++G AL +I+ G P GRL T YP ESF+ + DMN+R + +
Sbjct: 568 VNALLWGGYPGQSGGLALRDILTGARAPAGRLTTTQYPAAYAESFSAL---DMNLRPNET 624
Query: 638 RQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDK---NILQQT 694
Q PG++Y +YTG VY FGHGL YT ++ ++S +A K NI T
Sbjct: 625 TQNPGQTYMWYTGEPVYAFGHGLFYTTFN------------ASSAQAAKTKYTFNITDLT 672
Query: 695 GSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGT-PEKQLIGF 753
+ H D T F+ S+TN+G D + +++A P K L+GF
Sbjct: 673 SA----AHPDTTTVGQRTLFNFTASITNSGQRDSDYTALVYANTSTAGPSPYPNKWLVGF 728
Query: 754 DRVHTVAK--GSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSL 803
DR+ VAK G+ E++ + +A R+ GN VL G +L
Sbjct: 729 DRLAAVAKEGGTAELN--------VPVAVDRLARVDEAGNTVLFPGRYEVAL 772
>gi|70996610|ref|XP_753060.1| beta-xylosidase XylA [Aspergillus fumigatus Af293]
gi|74672055|sp|Q4WRB0.1|XYND_ASPFU RecName: Full=Probable exo-1,4-beta-xylosidase xlnD; AltName:
Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
gi|66850695|gb|EAL91022.1| beta-xylosidase XylA [Aspergillus fumigatus Af293]
Length = 792
Score = 420 bits (1079), Expect = e-114, Method: Compositional matrix adjust.
Identities = 260/758 (34%), Positives = 397/758 (52%), Gaps = 68/758 (8%)
Query: 55 CNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVN 114
C+TS RA +L+S+ T +E + + + +PRLG+P Y+ WSE+LHG+ N
Sbjct: 63 CDTSARPHDRAAALVSMFTFEELVNNTGNTSPGVPRLGLPPYQVWSEALHGLDR----AN 118
Query: 115 FN--GTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIF 172
F G S TSFP +++ ++ NR+L + I + +A + RA N+G+ GL +APNIN F
Sbjct: 119 FTDEGEYSWATSFPMPILTMSALNRTLINQIATIIATQGRAFNNVGRYGLDVYAPNINAF 178
Query: 173 RDPRWGRGQETPGEDPM-VVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEES 231
R WGRGQETPGED + SAYA E++ QG G E
Sbjct: 179 RSAMWGRGQETPGEDAYCLASAYAYEYITGIQG---------------------GVDPE- 216
Query: 232 DRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCI 291
L L A KH YDLE W SR + IT+Q+ + + P F K +
Sbjct: 217 ----HLKLVATAKHYAGYDLENWDGHSRLGNDMNITQQELSEYYTPQFLVAARDAKVHSV 272
Query: 292 MCSYNQVNGVPACLRGDLFQK-ARNEWGF--KGYITSDCDAVATIFEYQNYTKTHEDSAA 348
MCSYN VNGVP+C Q R+ +GF GY++SDCD+ ++ + +AA
Sbjct: 273 MCSYNAVNGVPSCANSFFLQTLLRDTFGFVEDGYVSSDCDSAYNVWNPHEFAANITGAAA 332
Query: 349 GVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYG 408
++AG DI+CGT + A D+ +V +I+R ++ L+S +RLG F+G+ Y
Sbjct: 333 DSIRAGTDIDCGTTYQYYFGEAFDEQEVTRAEIERGVIRLYSNLVRLGYFDGN--GSVYR 390
Query: 409 KLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGG 468
L +DV T++ ++ +AA +GIVLLKND LPL K +V S+A+IGP +N +Q+ G
Sbjct: 391 DLTWNDVVTTDAWNISYEAAVEGIVLLKNDGT-LPLAK-SVRSVALIGPWMNVTTQLQGN 448
Query: 469 YTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDL 528
Y G S L + +YA G ++ +S GF EA+ AKK+D +I G+D
Sbjct: 449 YFGPAPYLISPLNAFQNSDFDVNYAFGT-NISSHSTDGFSEALSAAKKSDVIIFAGGIDN 507
Query: 529 TQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWI 588
T E E DR+++ PG Q+ L+ +++ K P+I++ GGG +D S +++ ++S++W
Sbjct: 508 TLEAEAMDRMNITWPGNQLQLIDQLSQLGK-PLIVLQMGGGQVDSSSLKSNKNVNSLIWG 566
Query: 589 GYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESF-TKVPMNDMNMRADSSRQYPGRSYRF 647
GYPG++G +AL +II G P GRL +T YP + T+ P DM++R + PG++Y +
Sbjct: 567 GYPGQSGGQALLDIITGKRAPAGRLVVTQYPAEYATQFPATDMSLRPHGNN--PGQTYMW 624
Query: 648 YTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKA-GSDKNILQ----QTGSRLDYVH 702
YTGT VY FGHGL YT T ASL G DK T + +
Sbjct: 625 YTGTPVYEFGHGLFYT-------------TFHASLPGTGKDKTSFNIQDLLTQPHPGFAN 671
Query: 703 IDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKG 762
++++ + +++TN G V + MLFA P K L+GFDR+ ++
Sbjct: 672 VEQMP-----LLNFTVTITNTGKVASDYTAMLFANTTAGPAPYPNKWLVGFDRLASLEPH 726
Query: 763 SKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELR 800
+ + ++ ++ G R+L G + L + R
Sbjct: 727 RSQTMTIPVTIDSVARTDEAGNRVLYPGKYELALNNER 764
>gi|76160898|gb|ABA40420.1| Xld [Aspergillus fumigatus]
Length = 792
Score = 420 bits (1079), Expect = e-114, Method: Compositional matrix adjust.
Identities = 260/758 (34%), Positives = 397/758 (52%), Gaps = 68/758 (8%)
Query: 55 CNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVN 114
C+TS RA +L+S+ T +E + + + +PRLG+P Y+ WSE+LHG+ N
Sbjct: 63 CDTSARPHDRAAALVSMFTFEELVNNTGNTSPGVPRLGLPPYQVWSEALHGLDR----AN 118
Query: 115 FN--GTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIF 172
F G S TSFP +++ ++ NR+L + I + +A + RA N+G+ GL +APNIN F
Sbjct: 119 FTDEGEYSWATSFPMPILTMSALNRTLINQIATIIATQGRAFNNVGRYGLDVYAPNINAF 178
Query: 173 RDPRWGRGQETPGEDPM-VVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEES 231
R WGRGQETPGED + SAYA E++ QG G E
Sbjct: 179 RSAMWGRGQETPGEDAYCLASAYAYEYITGIQG---------------------GVDPE- 216
Query: 232 DRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCI 291
L L A KH YDLE W SR + IT+Q+ + + P F K +
Sbjct: 217 ----HLKLVATAKHYAGYDLENWDGHSRLGNDMNITQQELSEYYTPQFLVAARDAKVHSV 272
Query: 292 MCSYNQVNGVPACLRGDLFQK-ARNEWGF--KGYITSDCDAVATIFEYQNYTKTHEDSAA 348
MCSYN VNGVP+C Q R+ +GF GY++SDCD+ ++ + +AA
Sbjct: 273 MCSYNAVNGVPSCANSFFLQTLLRDTFGFVEDGYVSSDCDSAYNVWNPHEFAANITGAAA 332
Query: 349 GVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYG 408
++AG DI+CGT + A D+ +V +I+R ++ L+S +RLG F+G+ Y
Sbjct: 333 DSIRAGTDIDCGTTYQYYFGEAFDEQEVTRAEIERGVIRLYSNLVRLGYFDGN--GSVYR 390
Query: 409 KLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGG 468
L +DV T++ ++ +AA +GIVLLKND LPL K +V S+A+IGP +N +Q+ G
Sbjct: 391 DLTWNDVVTTDAWNISYEAAVEGIVLLKNDGT-LPLAK-SVRSVALIGPWMNVTTQLQGN 448
Query: 469 YTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDL 528
Y G S L + +YA G ++ +S GF EA+ AKK+D +I G+D
Sbjct: 449 YFGPAPYLISPLNAFQNSDFDVNYAFGT-NISSHSTDGFSEALSAAKKSDVIIFAGGIDN 507
Query: 529 TQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWI 588
T E E DR+++ PG Q+ L+ +++ K P+I++ GGG +D S +++ ++S++W
Sbjct: 508 TLEAEAMDRMNITWPGNQLQLIDQLSQLGK-PLIVLQMGGGQVDSSSLKSNKNVNSLIWG 566
Query: 589 GYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESF-TKVPMNDMNMRADSSRQYPGRSYRF 647
GYPG++G +AL +II G P GRL +T YP + T+ P DM++R + PG++Y +
Sbjct: 567 GYPGQSGGQALLDIITGKRAPAGRLVVTQYPAEYATQFPATDMSLRPHGNN--PGQTYMW 624
Query: 648 YTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKA-GSDKNILQ----QTGSRLDYVH 702
YTGT VY FGHGL YT T ASL G DK T + +
Sbjct: 625 YTGTPVYEFGHGLFYT-------------TFHASLPGTGKDKTSFNIQDLLTQPHPGFAN 671
Query: 703 IDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKG 762
++++ + +++TN G V + MLFA P K L+GFDR+ ++
Sbjct: 672 VEQMP-----LLNFTVTITNTGKVASDYTAMLFANTTAGPAPYPNKWLVGFDRLASLEPH 726
Query: 763 SKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELR 800
+ + ++ ++ G R+L G + L + R
Sbjct: 727 RSQTMTIPVTIDSVARTDEAGNRVLYPGKYELALNNER 764
>gi|292495282|sp|B0XP71.1|XYND_ASPFC RecName: Full=Probable exo-1,4-beta-xylosidase xlnD; AltName:
Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
gi|159131796|gb|EDP56909.1| beta-xylosidase XylA [Aspergillus fumigatus A1163]
Length = 792
Score = 420 bits (1079), Expect = e-114, Method: Compositional matrix adjust.
Identities = 260/758 (34%), Positives = 397/758 (52%), Gaps = 68/758 (8%)
Query: 55 CNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVN 114
C+TS RA +L+S+ T +E + + + +PRLG+P Y+ WSE+LHG+ N
Sbjct: 63 CDTSARPHDRAAALVSMFTFEELVNNTGNTSPGVPRLGLPPYQVWSEALHGLDR----AN 118
Query: 115 FN--GTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIF 172
F G S TSFP +++ ++ NR+L + I + +A + RA N+G+ GL +APNIN F
Sbjct: 119 FTDEGEYSWATSFPMPILTMSALNRTLINQIATIIATQGRAFNNVGRYGLDVYAPNINAF 178
Query: 173 RDPRWGRGQETPGEDPM-VVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEES 231
R WGRGQETPGED + SAYA E++ QG G E
Sbjct: 179 RSAMWGRGQETPGEDAYCLASAYAYEYITGIQG---------------------GVDPE- 216
Query: 232 DRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCI 291
L L A KH YDLE W SR + IT+Q+ + + P F K +
Sbjct: 217 ----HLKLVATAKHYAGYDLENWDGHSRLGNDMNITQQELSEYYTPQFLVAARDAKVHSV 272
Query: 292 MCSYNQVNGVPACLRGDLFQK-ARNEWGF--KGYITSDCDAVATIFEYQNYTKTHEDSAA 348
MCSYN VNGVP+C Q R+ +GF GY++SDCD+ ++ + +AA
Sbjct: 273 MCSYNAVNGVPSCANSFFLQTLLRDTFGFVEDGYVSSDCDSAYNVWNPHEFAANITGAAA 332
Query: 349 GVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYG 408
++AG DI+CGT + A D+ +V +I+R ++ L+S +RLG F+G+ Y
Sbjct: 333 DSIRAGTDIDCGTTYQYYFGEAFDEQEVTRAEIERGVIRLYSNLVRLGYFDGN--GSVYR 390
Query: 409 KLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGG 468
L +DV T++ ++ +AA +GIVLLKND LPL K +V S+A+IGP +N +Q+ G
Sbjct: 391 DLTWNDVVTTDAWNISYEAAVEGIVLLKNDGT-LPLAK-SVRSVALIGPWMNVTTQLQGN 448
Query: 469 YTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDL 528
Y G S L + +YA G ++ +S GF EA+ AKK+D +I G+D
Sbjct: 449 YFGPAPYLISPLNAFQNSDFDVNYAFGT-NISSHSTDGFSEALSAAKKSDVIIFAGGIDN 507
Query: 529 TQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWI 588
T E E DR+++ PG Q+ L+ +++ K P+I++ GGG +D S +++ ++S++W
Sbjct: 508 TLEAEAMDRMNITWPGNQLQLIDQLSQLGK-PLIVLQMGGGQVDSSSLKSNKNVNSLIWG 566
Query: 589 GYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESF-TKVPMNDMNMRADSSRQYPGRSYRF 647
GYPG++G +AL +II G P GRL +T YP + T+ P DM++R + PG++Y +
Sbjct: 567 GYPGQSGGQALLDIITGKRAPAGRLVVTQYPAEYATQFPATDMSLRPHGNN--PGQTYMW 624
Query: 648 YTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKA-GSDKNILQ----QTGSRLDYVH 702
YTGT VY FGHGL YT T ASL G DK T + +
Sbjct: 625 YTGTPVYEFGHGLFYT-------------TFHASLPGTGKDKTSFNIQDLLTQPHPGFAN 671
Query: 703 IDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKG 762
++++ + +++TN G V + MLFA P K L+GFDR+ ++
Sbjct: 672 VEQMP-----LLNFTVTITNTGKVASDYTAMLFANTTAGPAPYPNKWLVGFDRLASLEPH 726
Query: 763 SKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELR 800
+ + ++ ++ G R+L G + L + R
Sbjct: 727 RSQTMTIPVTIDSVARTDEAGNRVLYPGKYELALNNER 764
>gi|442803736|ref|YP_007371885.1| beta-xylosidase BxlB [Clostridium stercorarium subsp. stercorarium
DSM 8532]
gi|442739586|gb|AGC67275.1| beta-xylosidase BxlB [Clostridium stercorarium subsp. stercorarium
DSM 8532]
Length = 715
Score = 420 bits (1079), Expect = e-114, Method: Compositional matrix adjust.
Identities = 268/774 (34%), Positives = 397/774 (51%), Gaps = 119/774 (15%)
Query: 54 FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
+ + S S RAK L+S +T++EK+ Q+ N+ AI RLGIPAY WW+E+LHG+A G
Sbjct: 7 YLDPSYSFEERAKDLVSRMTIEEKVSQMLYNSPAIERLGIPAYNWWNEALHGVARAG--- 63
Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLTFW 165
+ T FPQ + AA+F+ L + ++ E RA Y+ GLTFW
Sbjct: 64 -------TATMFPQAIGMAATFDEELIYKVADVISTEGRAKYHASSKKGDRGIYKGLTFW 116
Query: 166 APNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLK 225
+PNINIFRDPRWGRGQET GEDP + + V FVK QG + K
Sbjct: 117 SPNINIFRDPRWGRGQETYGEDPYLTARLGVAFVKGLQGNHPKY---------------- 160
Query: 226 GFGEESDRGDELMLSACCKHLIAYDLEKWGNFS-RYSFNAMITEQDTEDTFQPPFRSCIE 284
L +AC KH + G S R+ FNA+++++D +T+ P F++ ++
Sbjct: 161 -----------LKAAACAKHFAVHS----GPESLRHEFNAVVSKKDLYETYLPAFKALVQ 205
Query: 285 QGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTH 343
+ K +M +YN+ NG P C L R EWGFKG++ SDC A+ + + T T
Sbjct: 206 EAKVESVMGAYNRTNGEPCCGSKTLLSDILRGEWGFKGHVVSDCWAIRDFHMHHHVTATA 265
Query: 344 EDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPR 403
+SAA ++ G D+NCG M + A+ +G + E++IDRA+ L +++LG+F DP
Sbjct: 266 PESAALAVRNGCDLNCGN-MFGNLLIALKEGLITEEEIDRAVTRLMITRMKLGMF--DPE 322
Query: 404 -KGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNI 462
+ Y + D V EH++LALD A++ IVLLKND LPL++ + S+A+IGP ++
Sbjct: 323 DQVPYASISYDFVDCKEHRELALDVAKKSIVLLKND-GLLPLDRKKIRSIAVIGPNADSR 381
Query: 463 SQMGGGYTGIPCSPKSLLRGLEAYVS---KTHYASGCH----DVPCNSDAG--FHEAVRI 513
+ G Y G ++L G+ + +Y+ GCH V + G EAV
Sbjct: 382 QALIGNYEGTASEYVTVLDGIREMAGDDVRIYYSVGCHLYKDRVENLGEPGDRIAEAVTC 441
Query: 514 AKKADFVIVVAGLDLTQETEDR---------DRVSLLLPGQQMSLVTSVARTSKRPVILV 564
A+ AD VI+ GLD T E E+ D+ L LPGQQ L+ +V T K P++LV
Sbjct: 442 AEHADVVIMCLGLDSTIEGEEMHESNIYGSGDKPDLNLPGQQQELLEAVYATGK-PIVLV 500
Query: 565 LTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTK 624
L G L V++ AD I +IL YPG G +A+A ++FG+ NP G+LP+T+Y +
Sbjct: 501 LLTGSALAVTW--ADEHIPAILNAWYPGALGGRAIASVLFGETNPSGKLPVTFYRTTEEL 558
Query: 625 VPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISA-SLK 683
D +M R+YRF +Y FG GLSYT + Y S+L +S +++
Sbjct: 559 PDFTDYSME--------NRTYRFMKNEALYPFGFGLSYTTFDY------SDLKLSKDTIR 604
Query: 684 AGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQ 743
AG F+V + VTN G + G VV ++ + + S
Sbjct: 605 AGEG--------------------------FNVSVKVTNTGKMAGEEVVQVYIKDLEASW 638
Query: 744 GTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
P QL G RV + + EI+F + P EQL++ G+ ++ G + VG
Sbjct: 639 RVPNWQLSGMKRVRLESGETAEITFEIRP-EQLAVVTDEGKSVIEPGEFEIYVG 691
>gi|292495634|sp|A1CND4.2|XYND_ASPCL RecName: Full=Probable exo-1,4-beta-xylosidase xlnD; AltName:
Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
Length = 792
Score = 419 bits (1077), Expect = e-114, Method: Compositional matrix adjust.
Identities = 257/751 (34%), Positives = 399/751 (53%), Gaps = 56/751 (7%)
Query: 55 CNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVN 114
C+T S RA +LISL TL+E + + + +PRLG+P Y+ W+E+LHG+ +
Sbjct: 63 CDTLTSPYDRAAALISLFTLEELVNATGNTSPGVPRLGLPPYQVWNEALHGL--DRAYFT 120
Query: 115 FNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRD 174
G S TSFP +++ ++ NR+L + + S ++ + RA N G+ GL ++PNIN FR
Sbjct: 121 DEGQFSWSTSFPMPILTMSALNRTLINQVASIISTQGRAFSNAGRYGLDVYSPNINSFRH 180
Query: 175 PRWGRGQETPGEDPMVVS-AYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDR 233
P WGRGQETPGED +S AYA E++ QG G K
Sbjct: 181 PVWGRGQETPGEDAYCLSSAYAYEYITGIQG------------GVDPK------------ 216
Query: 234 GDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMC 293
L L A KH YD+E W SR + IT+QD + + P F K +MC
Sbjct: 217 --SLKLVATAKHYAGYDIENWDGHSRLGNDMNITQQDLSEYYTPQFLVAARDAKVRSVMC 274
Query: 294 SYNQVNGVPACLRGDLFQKA-RNEWGF--KGYITSDCDAVATIFEYQNYTKTHEDSAAGV 350
SYN VNGVP+C Q R+ +GF GYI+SDCD+ +F Y +AA
Sbjct: 275 SYNAVNGVPSCANSFFLQTLLRDTFGFVEDGYISSDCDSAYNVFNPHEYAANVSSAAADS 334
Query: 351 LKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKL 410
++AG DI+CGT + A+D+ + DI+R ++ L+S +RLG F+G+ Y L
Sbjct: 335 IRAGTDIDCGTTYQYYFDEAVDQNLLSRADIERGVIRLYSNLMRLGYFDGN--SSAYRNL 392
Query: 411 GPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT 470
+DV T+ ++ + +G VLLKND LPL++ ++ S+A++GP +N +Q+ G Y
Sbjct: 393 TWNDVVTTNSWNISYEV--EGTVLLKNDGT-LPLSE-SIRSIALVGPWMNVSTQLQGNYF 448
Query: 471 GIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQ 530
G S L +YA G ++ NS GF +A+ AKK+D +I G+D +
Sbjct: 449 GPAPYLISPLDAFRDSHLDVNYAFGT-NISSNSTDGFSKALSAAKKSDAIIFAGGIDNSL 507
Query: 531 ETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGY 590
E E DR+++ PG+Q+ L+ +++ K P+I++ GGG +D S +++ ++S++W GY
Sbjct: 508 EAETLDRMNITWPGKQLELIDQLSQLGK-PLIVLQMGGGQVDSSLLKSNKNVNSLIWGGY 566
Query: 591 PGEAGAKALAEIIFGDFNPGGRLPMTWYPESF-TKVPMNDMNMRADSSRQYPGRSYRFYT 649
PG++G +AL +II G P GRL +T YP + T+ P DM++R + PG++Y +YT
Sbjct: 567 PGQSGGQALLDIITGKRAPAGRLVVTQYPAEYATQFPATDMSLRPHGNN--PGQTYMWYT 624
Query: 650 GTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSC 709
GT VY FGHGL YT + +S + I + +++L Q Y+H++++
Sbjct: 625 GTPVYEFGHGLFYTTFR---VSHARAVKIKPTYNI---QDLLAQ--PHPGYIHVEQM--- 673
Query: 710 TSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFG 769
L F V I TN G + MLFA P+K L+GFDR+ T+ + ++
Sbjct: 674 PFLNFTVDI--TNTGKASSDYTAMLFANTTAGPAPYPKKWLVGFDRLPTLGPSTSKLMTI 731
Query: 770 VDPCEQLSIANKHGRRILPLGNHVLMVGELR 800
++ ++ G R+L G + L + R
Sbjct: 732 PVTINSMARTDELGNRVLYPGKYELALNNER 762
>gi|121809149|sp|Q4AEG8.1|XYND_ASPAW RecName: Full=Exo-1,4-beta-xylosidase xlnD; AltName:
Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
gi|73486695|dbj|BAE19756.1| beta-xylosidase [Aspergillus awamori]
Length = 804
Score = 419 bits (1076), Expect = e-114, Method: Compositional matrix adjust.
Identities = 273/781 (34%), Positives = 403/781 (51%), Gaps = 70/781 (8%)
Query: 42 FP-CKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWS 100
FP C+ S+ C+ + + RA SLISL TL E I + + RLG+PAY+ WS
Sbjct: 55 FPDCQNGPLRSHLICDETATPYDRAASLISLFTLDELIANTGNTGLGVSRLGLPAYQVWS 114
Query: 101 ESLHGIASNGPGVNFN--GTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG 158
E+LHG+ NF+ G + TSFPQ +++ A+ NR+L I S ++ + RA N G
Sbjct: 115 EALHGLDR----ANFSDSGAYNWATSFPQPILTTAALNRTLIHQIASIISTQGRAFNNAG 170
Query: 159 QAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGF 218
+ GL +APNIN FR P WGRGQETPGED + + YA E++ QG + +S+
Sbjct: 171 RYGLDVYAPNINTFRHPVWGRGQETPGEDVSLAAVYAYEYITGIQGPDPESN-------- 222
Query: 219 REKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPP 278
L L+A KH YD+E W N SR + IT+QD + + P
Sbjct: 223 ------------------LKLAATAKHYAGYDIENWHNHSRLGNDMNITQQDLSEYYTPQ 264
Query: 279 FRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK-ARNEWGF--KGYITSDCDAVATIFE 335
F K +MC+YN VNGVPAC Q R+ +GF GY++SDCDA I+
Sbjct: 265 FHVAARDAKVQSVMCAYNAVNGVPACADSYFLQTLLRDTFGFVDHGYVSSDCDAAYNIYN 324
Query: 336 YQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRL 395
Y + +AA + AG DI+CGT H +I G + DI++ ++ L++ ++
Sbjct: 325 PHGYASSQAAAAAEAILAGTDIDCGTTYQWHLNESITAGDLSRDDIEQGVIRLYTTLVQA 384
Query: 396 GLFNGDPRKGK--YGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAV---- 449
G F+ + K Y L DV ++ ++ AA QGIVLLKN LPL + A
Sbjct: 385 GYFDSNTTKANNPYRDLSWSDVLETDAWNISYQAATQGIVLLKNSNNVLPLTEKAYPPSN 444
Query: 450 SSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHE 509
+++A+IGP N +Q+ G Y G S E K ++A G + S +GF
Sbjct: 445 TTVALIGPWANATTQLLGNYYGNAPYMISPRAAFEEAGYKVNFAEGT-GISSTSTSGFAA 503
Query: 510 AVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTS-KRPVILVLTGG 568
A+ A+ AD +I G+D T E E DR S+ PG Q+ L+ +A + K+P+I++ GG
Sbjct: 504 ALSAAQSADVIIYAGGIDNTLEAEALDRESIAWPGNQLDLIQKLASAAGKKPLIVLQMGG 563
Query: 569 GPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPM 627
G +D S + ++++S++LW GYPG++G AL +II G NP GRL T YP S+ + P
Sbjct: 564 GQVDSSSLKNNTKVSALLWGGYPGQSGGFALRDIITGKKNPAGRLVTTQYPASYAEEFPA 623
Query: 628 NDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSD 687
DMN+R + PG++Y++YTG VY FGHGL YT + + S T + +K +
Sbjct: 624 TDMNLRPEGDN--PGQTYKWYTGEAVYEFGHGLFYTTF-----AESSSNTTTKEVKL-NI 675
Query: 688 KNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGT-P 746
++IL +T L +T L F I N G ++ + M+FA P
Sbjct: 676 QDILSRTHEEL-----ASITQLPVLNFTANIR--NTGKLESDYTAMVFANTSDAGPAPYP 728
Query: 747 EKQLIGFDRVHTVAKG-SKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTI 805
+K L+G+DR+ V G ++E+ P E S A R+ G+ VL G +L +
Sbjct: 729 KKWLVGWDRLGEVKVGETRELRV---PVEVGSFA-----RVNEDGDWVLFPGTFELALNL 780
Query: 806 E 806
E
Sbjct: 781 E 781
>gi|67523807|ref|XP_659963.1| hypothetical protein AN2359.2 [Aspergillus nidulans FGSC A4]
gi|74597492|sp|Q5BAS1.1|XYND_EMENI RecName: Full=Exo-1,4-beta-xylosidase xlnD; AltName:
Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
gi|40745314|gb|EAA64470.1| hypothetical protein AN2359.2 [Aspergillus nidulans FGSC A4]
gi|259487761|tpe|CBF86686.1| TPA: Beta-xylosidase (EC 3.2.1.37)
[Source:UniProtKB/TrEMBL;Acc:O42810] [Aspergillus
nidulans FGSC A4]
Length = 803
Score = 419 bits (1076), Expect = e-114, Method: Compositional matrix adjust.
Identities = 255/766 (33%), Positives = 390/766 (50%), Gaps = 58/766 (7%)
Query: 42 FP-CKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWS 100
FP C P C+ SLS RA +L+SL T E + + + RLG+P Y+ W
Sbjct: 47 FPDCASGPLSLTPVCDRSLSPKDRATALVSLFTFDELVNNTGNTGLGVSRLGLPNYQVWG 106
Query: 101 ESLHGIASNGPGVNF--NGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG 158
E+LHG+ NF +G S TSFP + A+ N++L IG+ V+ + RA N G
Sbjct: 107 EALHGVGR----ANFVESGNFSWATSFPMPITMMAALNKTLIHQIGTIVSTQLRAFSNAG 162
Query: 159 QAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGF 218
G+ ++PNIN FR P WGRGQETPGED + S Y E++ + QG
Sbjct: 163 LGGVDVYSPNINTFRHPVWGRGQETPGEDAFLTSVYGYEYITALQG-------------- 208
Query: 219 REKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPP 278
G E+ L + A KH YD+E W N SR + IT+Q+ + + PP
Sbjct: 209 -------GVDPET-----LKIIATAKHYAGYDIESWNNHSRLGNDMQITQQELSEYYTPP 256
Query: 279 FRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK-ARNEWGFK--GYITSDCDAVATIFE 335
F K +MCSYN VNGVP+C Q R+ + F GY++ DC AV ++
Sbjct: 257 FIVASRDAKVRSVMCSYNAVNGVPSCANKFFLQTLLRDTFEFSEDGYVSGDCGAVYNVWN 316
Query: 336 YQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRL 395
Y ++A + AG DI+CGT H++ A + V DI+R ++ L+S ++
Sbjct: 317 PHGYASNEAAASADSILAGTDIDCGTSYQWHSEDAFEDSLVSRSDIERGVIRLYSNLVQA 376
Query: 396 GLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAII 455
G F+G+ Y + DDV +++ +A +AA +GIVLLKND+ LPL+K+ + S+A+I
Sbjct: 377 GYFDGE--DAPYRDITWDDVLSTDAWNIAYEAAVEGIVLLKNDET-LPLSKD-IKSVAVI 432
Query: 456 GPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAK 515
GP N ++ G Y G S L G HYA G ++ +S +GF EA+ AK
Sbjct: 433 GPWANVTEELQGNYFGPAPYLISPLTGFRDSGLDVHYALGT-NLTSHSTSGFEEALTAAK 491
Query: 516 KADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSF 575
+AD +I G+D T E E DR ++ PG Q+ L++ ++ K P++++ GGG +D S
Sbjct: 492 QADAIIFAGGIDNTIEAEAMDRENITWPGNQLDLISKLSELGK-PLVVLQMGGGQVDSSS 550
Query: 576 AEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKV-PMNDMNMRA 634
+ + +++++W GYPG++G ALA+II G P GRL T YP + +V P DMN+R
Sbjct: 551 LKDNDNVNALIWGGYPGQSGGHALADIITGKRAPAGRLVTTQYPAEYAEVFPAIDMNLRP 610
Query: 635 DSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQT 694
+ + PG++Y +YTGT VY FGHGL YT + + + AGS T
Sbjct: 611 NETSGNPGQTYMWYTGTPVYEFGHGLFYT----------TFEESTETTDAGSFNIQTVLT 660
Query: 695 GSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFD 754
Y H + T + +V N G+ + + +++ P+K ++GFD
Sbjct: 661 TPHSGYEHAQQKT-----LLNFTATVKNTGERESDYTALVYVNTTAGPAPYPKKWVVGFD 715
Query: 755 RVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELR 800
R+ + G + E ++ ++ G R+L G++ L + R
Sbjct: 716 RLGGLEPGDSQTLTVPVTVESVARTDEQGNRVLYPGSYELALNNER 761
>gi|164429277|ref|XP_958209.2| hypothetical protein NCU09923 [Neurospora crassa OR74A]
gi|16945419|emb|CAB91343.2| related to xylan 1, 4-beta-xylosidase [Neurospora crassa]
gi|157073010|gb|EAA28973.2| hypothetical protein NCU09923 [Neurospora crassa OR74A]
Length = 774
Score = 418 bits (1075), Expect = e-114, Method: Compositional matrix adjust.
Identities = 261/774 (33%), Positives = 393/774 (50%), Gaps = 62/774 (8%)
Query: 35 PVLNKPDFPCKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIP 94
P PD C S C+ +LS RA +L++ +T +EK+Q L + PR+G+P
Sbjct: 24 PFQTYPD--CVNGPLASLKVCDATLSPPQRAAALVAAMTTEEKLQNLVSKSKGAPRIGLP 81
Query: 95 AYEWWSESLHGIASNGPGVNF---NGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEA 151
AY WWSE+LHG+A PG F +G +S TSFP L+ AA+F+ L +G + E
Sbjct: 82 AYNWWSEALHGVAY-APGTQFRSGDGPFNSSTSFPMPLLMAATFDDELIEKVGEVIGTEG 140
Query: 152 RAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDD 211
RA N G +G +W PN+N F+DPRWGRG ETPGED + + YA ++ QG
Sbjct: 141 RAFGNAGFSGFDYWTPNVNPFKDPRWGRGSETPGEDILRIKRYAASMIRGLQGP------ 194
Query: 212 GGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDT 271
E+RV+ A CKH A D E W +R+ F+A +T QD
Sbjct: 195 ------LPERRVV----------------ATCKHYAANDFEDWNGSTRHDFDAKVTLQDL 232
Query: 272 EDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKG---YITSDC 327
+ + PF+ C K IMCSYN VNGVPAC L Q R W + YITSDC
Sbjct: 233 AEYYLSPFQQCARDSKVGSIMCSYNAVNGVPACANTYLMQTILREHWNWTAPGNYITSDC 292
Query: 328 DAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLN 387
+AV IF +Y KT+ + A +AG D +C A +G +++ +DRAL
Sbjct: 293 EAVLDIFANHHYAKTNAEGTALAFEAGTDSSCEYESSSDIPGAWTQGLLEQSTVDRALTR 352
Query: 388 LFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKF-LPLNK 446
L+ +R+G F+G+ +Y LG DV + + +++AL A +GIVLLKND+ L L
Sbjct: 353 LYEGLVRVGYFDGN--HSEYASLGWKDVNSPKSQEVALQTAVEGIVLLKNDQTLPLGLKT 410
Query: 447 NAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASG-CHDVPCNSDA 505
+ S LA+IG N+ + GGY+G P S + EA A G ++D
Sbjct: 411 DPKSKLAMIGFWANDPKTLSGGYSGKPAFEHSPVYAAEAMGFNVTTAGGPVLQNSTSNDT 470
Query: 506 GFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVL 565
A+ A+ A++++ GLD + E +DR ++ P Q+ L+ ++ + K V++ +
Sbjct: 471 WTQAALEAAQDANYILYFGGLDTSAAGETKDRTTINWPEAQLQLIKTLTKLGKPLVVVQM 530
Query: 566 TGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-K 624
G LD + A ++SILW +PG+ G A+ +I+ G +P GRLP+T YP ++T
Sbjct: 531 --GDQLDNTPLLATKTVNSILWANWPGQDGGTAVMQILTGLKSPAGRLPVTQYPANYTAA 588
Query: 625 VPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKA 684
VPM DMN+R S + PGR+YR+Y T V FG GL YT + K I+A L
Sbjct: 589 VPMTDMNLR--PSDRLPGRTYRWYP-TAVQPFGFGLHYTTFQAK---------IAAPLPR 636
Query: 685 GSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQG 744
+ +++L + G + D T +++ VTN+G+ +VV+ F +
Sbjct: 637 LAIQDLLSRCGGDNANAYPD-----TCALPPLKVEVTNSGNRSSDYVVLAFLAGDAGPRP 691
Query: 745 TPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGE 798
P K L+ + R+ V+ G K + ++ ++ G +L G + + V E
Sbjct: 692 YPIKTLVSYTRLRDVSPGHKTTAHLEWTLGDIARYDEQGNTVLYPGTYTVTVDE 745
>gi|238483831|ref|XP_002373154.1| beta-xylosidase XylA [Aspergillus flavus NRRL3357]
gi|292495283|sp|B8MYV0.1|XYND_ASPFN RecName: Full=Probable exo-1,4-beta-xylosidase xlnD; AltName:
Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
gi|220701204|gb|EED57542.1| beta-xylosidase XylA [Aspergillus flavus NRRL3357]
Length = 797
Score = 418 bits (1074), Expect = e-114, Method: Compositional matrix adjust.
Identities = 263/760 (34%), Positives = 385/760 (50%), Gaps = 65/760 (8%)
Query: 55 CNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVN 114
C+TS RA +L+SLLT +E + ++ PR+G+PAY+ W+E+LHG+A +
Sbjct: 63 CDTSAKPHDRAAALVSLLTFEELVNNTANTGHGAPRIGLPAYQVWNEALHGVAH----AD 118
Query: 115 FN--GTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIF 172
F+ G S TSFPQ + + A+ NR+L I + ++ + RA N G+ GL ++PNIN F
Sbjct: 119 FSDAGDFSWSTSFPQPISTMAALNRTLIHQIATIISTQGRAFMNAGRYGLDVYSPNINTF 178
Query: 173 RDPRWGRGQETPGEDPM-VVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEES 231
R P WGRGQETPGED + S YA E++ QG G+
Sbjct: 179 RHPVWGRGQETPGEDAYCLASTYAYEYITGIQG--------GV----------------- 213
Query: 232 DRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCI 291
+ L L A KH YD+E W N SR + IT+QD + + P F K +
Sbjct: 214 -DANPLKLIATAKHYAGYDIENWDNHSRLGNDMQITQQDLAEYYTPQFLVASRDAKVHSV 272
Query: 292 MCSYNQVNGVPACLRGDLFQKA-RNEWGF--KGYITSDCDAVATIFEYQNYTKTHEDSAA 348
MCSYN VNGVP+C Q R+ + F GY++ DC AV +F Y +AA
Sbjct: 273 MCSYNAVNGVPSCSNSFFLQTLLRDTFDFVEDGYVSGDCGAVYNVFNPHGYATNESSAAA 332
Query: 349 GVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYG 408
++AG DI+CG RH Q + +V +D++R + L++ +R G F+G + Y
Sbjct: 333 DSIRAGTDIDCGVSYPRHFQESFHDQEVSRQDLERGVTRLYASLIRAGYFDG--KTSPYR 390
Query: 409 KLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPL-NKNAVSSLAIIGPLVNNISQMGG 467
+ DV ++ + L+ +AA Q IVLLKND LPL ++ ++A+IGP N +QM G
Sbjct: 391 NITWSDVVSTNAQNLSYEAAAQSIVLLKND-GILPLTTSSSTKTIALIGPWANATTQMLG 449
Query: 468 GYTGIP---CSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVA 524
Y G SP + E ++ T + D S + A+ AK+AD +I
Sbjct: 450 NYYGPAPYLISPLQAFQDSEYKITYTIGTNTTTDPDSTSQS---TALTTAKEADLIIFAG 506
Query: 525 GLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISS 584
G+D T ETE +DR ++ P Q+SL+T +A K P+I++ GGG +D S + + +++
Sbjct: 507 GIDNTLETEAQDRSNITWPSNQLSLITKLADLGK-PLIVLQMGGGQVDSSALKNNKNVNA 565
Query: 585 ILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKV-PMNDMNMRADSSRQYPGR 643
++W GYPG++G +ALA+II G P RL T YP + +V P DMN+R + S PG+
Sbjct: 566 LIWGGYPGQSGGQALADIITGKRAPAARLVTTQYPAEYAEVFPAIDMNLRPNGSN--PGQ 623
Query: 644 SYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVH- 702
+Y +YTGT VY FGHGL YTN+ T SAS +G+ L H
Sbjct: 624 TYMWYTGTPVYEFGHGLFYTNF-----------TASASAGSGTKNRTSFNIDEVLGRPHP 672
Query: 703 -IDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAK 761
V L F V V N GD + M F P K L+GFDR+ V
Sbjct: 673 GYKLVEQMPLLNFTVD--VKNTGDRVSDYTAMAFVNTTAGPAPHPNKWLVGFDRLSAVEP 730
Query: 762 GSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRH 801
GS + + L+ ++ G R+L G + + + R
Sbjct: 731 GSAKTMVIPVTVDSLARTDEEGNRVLYPGRYEVALNNERE 770
>gi|145230215|ref|XP_001389416.1| exo-1,4-beta-xylosidase xlnD [Aspergillus niger CBS 513.88]
gi|74626559|sp|O00089.2|XYND_ASPNG RecName: Full=Exo-1,4-beta-xylosidase xlnD; AltName:
Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
gi|292495287|sp|A2QA27.1|XYND_ASPNC RecName: Full=Probable exo-1,4-beta-xylosidase xlnD; AltName:
Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
gi|2181180|emb|CAB06417.1| xylosidase [Aspergillus niger]
gi|134055533|emb|CAK37179.1| xylosidase xlnD-Aspergillus niger
gi|350638468|gb|EHA26824.1| hypothetical protein ASPNIDRAFT_205670 [Aspergillus niger ATCC
1015]
Length = 804
Score = 418 bits (1074), Expect = e-114, Method: Compositional matrix adjust.
Identities = 273/781 (34%), Positives = 402/781 (51%), Gaps = 70/781 (8%)
Query: 42 FP-CKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWS 100
FP C+ S+ C+ + + RA SLISL TL E I + + RLG+PAY+ WS
Sbjct: 55 FPDCQNGPLRSHLICDETATPYDRAASLISLFTLDELIANTGNTGLGVSRLGLPAYQVWS 114
Query: 101 ESLHGIASNGPGVNFN--GTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG 158
E+LHG+ NF+ G + TSFPQ +++ A+ NR+L I S ++ + RA N G
Sbjct: 115 EALHGLDR----ANFSDSGAYNWATSFPQPILTTAALNRTLIHQIASIISTQGRAFNNAG 170
Query: 159 QAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGF 218
+ GL +APNIN FR P WGRGQETPGED + + YA E++ QG + +S+
Sbjct: 171 RYGLDVYAPNINTFRHPVWGRGQETPGEDVSLAAVYAYEYITGIQGPDPESN-------- 222
Query: 219 REKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPP 278
L L+A KH YD+E W N SR + IT+QD + + P
Sbjct: 223 ------------------LKLAATAKHYAGYDIENWHNHSRLGNDMNITQQDLSEYYTPQ 264
Query: 279 FRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK-ARNEWGF--KGYITSDCDAVATIFE 335
F K +MC+YN VNGVPAC Q R+ +GF GY++SDCDA I+
Sbjct: 265 FHVAARDAKVQSVMCAYNAVNGVPACADSYFLQTLLRDTFGFVDHGYVSSDCDAAYNIYN 324
Query: 336 YQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRL 395
Y + +AA + AG DI+CGT H +I G + DI++ ++ L++ ++
Sbjct: 325 PHGYASSQAAAAAEAILAGTDIDCGTTYQWHLNESIAAGDLSRDDIEQGVIRLYTTLVQA 384
Query: 396 GLFNGDPRKGK--YGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAV---- 449
G F+ + K Y L DV ++ ++ AA QGIVLLKN LPL + A
Sbjct: 385 GYFDSNTTKANNPYRDLSWSDVLETDAWNISYQAATQGIVLLKNSNNVLPLTEKAYPPSN 444
Query: 450 SSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHE 509
+++A+IGP N +Q+ G Y G S E K ++A G + S +GF
Sbjct: 445 TTVALIGPWANATTQLLGNYYGNAPYMISPRAAFEEAGYKVNFAEGT-GISSTSTSGFAA 503
Query: 510 AVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTS-KRPVILVLTGG 568
A+ A+ AD +I G+D T E E DR S+ PG Q+ L+ +A + K+P+I++ GG
Sbjct: 504 ALSAAQSADVIIYAGGIDNTLEAEALDRESIAWPGNQLDLIQKLASAAGKKPLIVLQMGG 563
Query: 569 GPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPM 627
G +D S + ++ +S++LW GYPG++G AL +II G NP GRL T YP S+ + P
Sbjct: 564 GQVDSSSLKNNTNVSALLWGGYPGQSGGFALRDIITGKKNPAGRLVTTQYPASYAEEFPA 623
Query: 628 NDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSD 687
DMN+R + PG++Y++YTG VY FGHGL YT + + S T + +K +
Sbjct: 624 TDMNLRPEGDN--PGQTYKWYTGEAVYEFGHGLFYTTF-----AESSSNTTTKEVKL-NI 675
Query: 688 KNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGT-P 746
++IL QT L +T L F I N G ++ + M+FA P
Sbjct: 676 QDILSQTHEDL-----ASITQLPVLNFTANIR--NTGKLESDYTAMVFANTSDAGPAPYP 728
Query: 747 EKQLIGFDRVHTVAKG-SKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTI 805
+K L+G+DR+ V G ++E+ P E S A R+ G+ V+ G +L +
Sbjct: 729 KKWLVGWDRLGEVKVGETRELRV---PVEVGSFA-----RVNEDGDWVVFPGTFELALNL 780
Query: 806 E 806
E
Sbjct: 781 E 781
>gi|290889355|gb|ADD69953.1| xylosidase HistTag [synthetic construct]
Length = 810
Score = 418 bits (1074), Expect = e-114, Method: Compositional matrix adjust.
Identities = 273/781 (34%), Positives = 402/781 (51%), Gaps = 70/781 (8%)
Query: 42 FP-CKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWS 100
FP C+ S+ C+ + + RA SLISL TL E I + + RLG+PAY+ WS
Sbjct: 55 FPDCQNGPLRSHLICDETATPYDRAASLISLFTLDELIANTGNTGLGVSRLGLPAYQVWS 114
Query: 101 ESLHGIASNGPGVNFN--GTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG 158
E+LHG+ NF+ G + TSFPQ +++ A+ NR+L I S ++ + RA N G
Sbjct: 115 EALHGLDR----ANFSDSGAYNWATSFPQPILTTAALNRTLIHQIASIISTQGRAFNNAG 170
Query: 159 QAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGF 218
+ GL +APNIN FR P WGRGQETPGED + + YA E++ QG + +S+
Sbjct: 171 RYGLDVYAPNINTFRHPVWGRGQETPGEDVSLAAVYAYEYITGIQGPDPESN-------- 222
Query: 219 REKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPP 278
L L+A KH YD+E W N SR + IT+QD + + P
Sbjct: 223 ------------------LKLAATAKHYAGYDIENWHNHSRLGNDMNITQQDLSEYYTPQ 264
Query: 279 FRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK-ARNEWGF--KGYITSDCDAVATIFE 335
F K +MC+YN VNGVPAC Q R+ +GF GY++SDCDA I+
Sbjct: 265 FHVAARDAKVQSVMCAYNAVNGVPACADSYFLQTLLRDTFGFVDHGYVSSDCDAAYNIYN 324
Query: 336 YQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRL 395
Y + +AA + AG DI+CGT H +I G + DI++ ++ L++ ++
Sbjct: 325 PHGYASSQAAAAAEAILAGTDIDCGTTYQWHLNESIAAGDLSRDDIEQGVIRLYTTLVQA 384
Query: 396 GLFNGDPRKGK--YGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAV---- 449
G F+ + K Y L DV ++ ++ AA QGIVLLKN LPL + A
Sbjct: 385 GYFDSNTTKANNPYRDLSWSDVLETDAWNISYQAATQGIVLLKNSNNVLPLTEKAYPPSN 444
Query: 450 SSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHE 509
+++A+IGP N +Q+ G Y G S E K ++A G + S +GF
Sbjct: 445 TTVALIGPWANATTQLLGNYYGNAPYMISPRAAFEEAGYKVNFAEGT-GISSTSTSGFAA 503
Query: 510 AVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTS-KRPVILVLTGG 568
A+ A+ AD +I G+D T E E DR S+ PG Q+ L+ +A + K+P+I++ GG
Sbjct: 504 ALSAAQSADVIIYAGGIDNTLEAEALDRESIAWPGNQLDLIQKLASAAGKKPLIVLQMGG 563
Query: 569 GPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPM 627
G +D S + ++ +S++LW GYPG++G AL +II G NP GRL T YP S+ + P
Sbjct: 564 GQVDSSSLKNNTNVSALLWGGYPGQSGGFALRDIITGKKNPAGRLVTTQYPASYAEEFPA 623
Query: 628 NDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSD 687
DMN+R + PG++Y++YTG VY FGHGL YT + + S T + +K +
Sbjct: 624 TDMNLRPEGDN--PGQTYKWYTGEAVYEFGHGLFYTTF-----AESSSNTTTKEVKL-NI 675
Query: 688 KNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGT-P 746
++IL QT L +T L F I N G ++ + M+FA P
Sbjct: 676 QDILSQTHEDL-----ASITQLPVLNFTANIR--NTGKLESDYTAMVFANTSDAGPAPYP 728
Query: 747 EKQLIGFDRVHTVAKG-SKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTI 805
+K L+G+DR+ V G ++E+ P E S A R+ G+ V+ G +L +
Sbjct: 729 KKWLVGWDRLGEVKVGETRELRV---PVEVGSFA-----RVNEDGDWVVFPGTFELALNL 780
Query: 806 E 806
E
Sbjct: 781 E 781
>gi|194400335|gb|ACF61038.1| beta-xylosidase [Aspergillus awamori]
Length = 804
Score = 417 bits (1072), Expect = e-113, Method: Compositional matrix adjust.
Identities = 273/781 (34%), Positives = 399/781 (51%), Gaps = 70/781 (8%)
Query: 42 FP-CKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWS 100
FP C+ S+ C+ S + RA SLISL TL E I + + RLG+PAY+ WS
Sbjct: 55 FPDCQNGPLRSHLICDESATPYDRAASLISLFTLDELIANTGNTGLGVSRLGLPAYQVWS 114
Query: 101 ESLHGIASNGPGVNFN--GTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG 158
E+LHG+ NF+ G+ + TSFPQ +++ A+ NR+L I S ++ + RA N G
Sbjct: 115 EALHGLDR----ANFSDSGSYNWATSFPQPILTTAALNRTLIHQIASIISTQGRAFNNAG 170
Query: 159 QAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGF 218
+ GL +APNIN FR P WGRGQETPGED + + YA E++ QG
Sbjct: 171 RYGLDVYAPNINTFRHPVWGRGQETPGEDVSLAAVYAYEYITGIQG-------------- 216
Query: 219 REKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPP 278
D L L+A KH YD+E W N SR + IT+QD + + P
Sbjct: 217 ------------PDPDSNLKLAATAKHYAGYDIENWHNHSRLGNDMNITQQDLSEYYTPQ 264
Query: 279 FRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK-ARNEWGF--KGYITSDCDAVATIFE 335
F K +MC+YN VNGVPAC Q R+ +GF GY++SDCDA I+
Sbjct: 265 FHVAARDAKVHSVMCAYNAVNGVPACADSYFLQTLLRDTFGFVDHGYVSSDCDAAYNIYN 324
Query: 336 YQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRL 395
Y + +AA + AG DI+CGT H +I G + DI++ ++ L++ ++
Sbjct: 325 PHGYASSQAAAAAEAILAGTDIDCGTTYQWHLNESITAGDLSRDDIEKGVIRLYTTLVQA 384
Query: 396 GLFNGDPRKGK--YGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAV---- 449
G F+ + K Y L DV ++ ++ AA QGIVLLKN LPL + A
Sbjct: 385 GYFDSNTTKANNPYRDLTWSDVLETDAWNISYQAATQGIVLLKNSNNVLPLTEKAYPPSN 444
Query: 450 SSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHE 509
+++A+IGP N +Q+ G Y G S E K ++A G + S +GF
Sbjct: 445 TTVALIGPWANATTQLLGNYYGNAPYMISPRAAFEEAGYKVNFAEGT-GISSTSTSGFAA 503
Query: 510 AVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTS-KRPVILVLTGG 568
A+ A+ AD +I G+D T E E DR S+ PG Q+ L+ +A ++ +P+I++ GG
Sbjct: 504 ALSAARSADVIIYAGGIDNTLEAEALDRESIAWPGNQLDLIQKLASSAGSKPLIVLQMGG 563
Query: 569 GPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPM 627
G +D S + ++ ++++LW GYPG++G AL +II G NP GRL T YP S+ + P
Sbjct: 564 GQVDSSSLKNNTNVTALLWGGYPGQSGGFALRDIITGKKNPAGRLVTTQYPASYAEEFPA 623
Query: 628 NDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSD 687
DMN+R + PG++Y++YTG VY FGHGL YT + + S T + +K +
Sbjct: 624 TDMNLRPEGDN--PGQTYKWYTGEAVYEFGHGLFYTTF-----AESSSNTTTKEVKL-NI 675
Query: 688 KNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGT-P 746
++IL QT L +T L F I N G ++ + M+FA P
Sbjct: 676 QDILSQTHEEL-----ASITQLPVLNFTANIK--NTGKLESDYTAMVFANTSDAGPAPYP 728
Query: 747 EKQLIGFDRVHTVAKG-SKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTI 805
K L+G+DR+ V G ++E+ P E S A R+ G+ VL G +L +
Sbjct: 729 VKWLVGWDRLGDVKVGETRELRV---PVEVGSFA-----RVNEDGDWVLFPGTFELALNL 780
Query: 806 E 806
E
Sbjct: 781 E 781
>gi|169767016|ref|XP_001817979.1| exo-1,4-beta-xylosidase xlnD [Aspergillus oryzae RIB40]
gi|121805502|sp|Q2UR38.1|XYND_ASPOR RecName: Full=Exo-1,4-beta-xylosidase xlnD; AltName:
Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
gi|83765834|dbj|BAE55977.1| unnamed protein product [Aspergillus oryzae RIB40]
Length = 798
Score = 417 bits (1072), Expect = e-113, Method: Compositional matrix adjust.
Identities = 264/768 (34%), Positives = 391/768 (50%), Gaps = 80/768 (10%)
Query: 55 CNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVN 114
C+TS RA +L+SLLT +E + ++ PR+G+PAY+ W+E+LHG+A +
Sbjct: 63 CDTSAKPHDRAAALVSLLTFEELVNNTANTGHGAPRIGLPAYQVWNEALHGVAH----AD 118
Query: 115 FN--GTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIF 172
F+ G S TSFPQ + + A+ NR+L I + ++ + RA N G+ GL ++PNIN F
Sbjct: 119 FSDAGGFSWSTSFPQPISTMAALNRTLIHQIATIISTQGRAFMNAGRYGLDVYSPNINTF 178
Query: 173 RDPRWGRGQETPGEDPM-VVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEES 231
R P WGRGQETPGED + S YA E++ QG G+
Sbjct: 179 RHPVWGRGQETPGEDAYCLASTYAYEYITGIQG--------GV----------------- 213
Query: 232 DRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCI 291
+ L L A KH YD+E W N SR + IT+QD + + P F K +
Sbjct: 214 -DANPLKLIATAKHYAGYDIENWDNHSRLGNDMQITQQDLAEYYTPQFLVASRDAKVHSV 272
Query: 292 MCSYNQVNGVPACLRGDLFQK-ARNEWGF--KGYITSDCDAVATIFEYQNYTKTHEDSAA 348
MCSYN VNGVP+C Q R+ + F GY++ DC AV +F Y +AA
Sbjct: 273 MCSYNAVNGVPSCSNSFFLQTLLRDTFDFVEDGYVSGDCGAVYNVFNPHGYATNESSAAA 332
Query: 349 GVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYG 408
++AG DI+CG RH Q + +V +D++R + L++ +R G F+G + Y
Sbjct: 333 DSIRAGTDIDCGVSYPRHFQESFHDQEVSRQDLERGVTRLYASLIRAGYFDG--KTSPYR 390
Query: 409 KLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPL--NKNAVSSLAIIGPLVNNISQMG 466
+ DV ++ + L+ +AA Q IVLLKND LPL ++ ++A+IGP N +QM
Sbjct: 391 NITWSDVVSTNAQNLSYEAAAQSIVLLKND-GILPLTSTSSSTKTIALIGPWANATTQML 449
Query: 467 GGYTGIP---CSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVV 523
G Y G SP + E ++ T + D S + A+ AK+AD +I
Sbjct: 450 GNYYGPAPYLISPLQAFQDSEYKITYTIGTNTTTDPDSTSQS---TALTTAKEADLIIFA 506
Query: 524 AGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQIS 583
G+D T ETE +DR ++ P Q+SL+T +A K P+I++ GGG +D S + + ++
Sbjct: 507 GGIDNTLETEAQDRSNITWPSNQLSLITKLADLGK-PLIVLQMGGGQVDSSALKNNKNVN 565
Query: 584 SILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKV-PMNDMNMRADSSRQYPG 642
+++W GYPG++G +ALA+II G P RL T YP + +V P DMN+R + S PG
Sbjct: 566 ALIWGGYPGQSGGQALADIITGKRAPAARLVTTQYPAEYAEVFPAIDMNLRPNGSN--PG 623
Query: 643 RSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVH 702
++Y +YTGT VY FGHGL YTN+ T SAS +G T +R + +
Sbjct: 624 QTYMWYTGTPVYEFGHGLFYTNF-----------TASASASSG--------TKNRTSF-N 663
Query: 703 IDEVTSCTSLRFHV---------QISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGF 753
IDEV L + + + V N GD + M F P K L+GF
Sbjct: 664 IDEVLGRPHLGYKLVEQMPLLNFTVDVKNTGDRVSDYTAMAFVNTTAGPAPHPNKWLVGF 723
Query: 754 DRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRH 801
DR+ V GS + + L+ ++ G R+L G + + + R
Sbjct: 724 DRLSAVEPGSAKTMVIPVTVDSLARTDEEGNRVLYPGRYEVALNNERE 771
>gi|378730020|gb|EHY56479.1| beta-glucosidase, variant [Exophiala dermatitidis NIH/UT8656]
gi|378730021|gb|EHY56480.1| beta-glucosidase [Exophiala dermatitidis NIH/UT8656]
Length = 783
Score = 417 bits (1072), Expect = e-113, Method: Compositional matrix adjust.
Identities = 267/763 (34%), Positives = 399/763 (52%), Gaps = 64/763 (8%)
Query: 55 CNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVN 114
CNT+ S++ RAK+L++ LT +EK + + +PRLG+ +Y+WW E+LHG+AS+ PGVN
Sbjct: 35 CNTNASVADRAKALVAALTNEEKFNLTGNTSPGVPRLGLYSYQWWQEALHGVASS-PGVN 93
Query: 115 FN--GTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIF 172
F+ G S TSFPQ ++ +A+F+ +L + + + V+ EARA N+ ++GL FW PNIN +
Sbjct: 94 FSTSGDFSHATSFPQPILMSAAFDDALINAVATVVSTEARAFNNVNRSGLDFWTPNINPY 153
Query: 173 RDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESD 232
+DPRWGRGQETPGED + +Y + QG G K+V+
Sbjct: 154 KDPRWGRGQETPGEDTFHLKSYVAALIDGLQG----------GLNPPIKKVI-------- 195
Query: 233 RGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIM 292
A CKH +AYDLE W RY+F+A+++ QD + + PF++C + IM
Sbjct: 196 --------ATCKHFVAYDLEDWITTDRYNFDAIVSTQDLAEYYMQPFQTCARDARVGSIM 247
Query: 293 CSYNQVNGVPACLRGDLFQKA-RNEWGFKG---YITSDCDAVATIFEYQNYTKTHEDSAA 348
CSYN +NGVP C + Q R W + Y+TSDCDA+ I+ Y T E + A
Sbjct: 248 CSYNAMNGVPTCADPYILQTVLREHWNWTDDGQYVTSDCDAIQNIYAPHYYEPTREQAVA 307
Query: 349 GVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYG 408
L AG D+NCGT H +A +G + ID+ + L+S ++LG F+ P Y
Sbjct: 308 DALTAGTDLNCGTYYQTHLPAAFSEGLFNQTVIDQTITRLYSALIKLGYFD-PPSATPYR 366
Query: 409 KLGPDDVCTSEHKKLALDAARQGIVLLKNDKKF---LPLNKNAVSSLAIIGPLVNNISQM 465
L DV T + LAL AA +GIVLLKND P +KN +++AIIG N + M
Sbjct: 367 SLNWSDVSTPAAEALALKAAEEGIVLLKNDGLLPLSFPTDKN--TTVAIIGGWANATTTM 424
Query: 466 GGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAG 525
G Y GI S L L+ + G VP G+ E + A +AD +I+ G
Sbjct: 425 QGNYFGIAPYLHSPLYALQQLPNINAVYGGGFGVPTTD--GWDELLGAAGEADLIIIADG 482
Query: 526 LDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSI 585
L + E+E DR ++ + ++ ++ K V L + G LD + + IS++
Sbjct: 483 LTTSDESESNDRYTIGWQPAAIDIINQLSGMGKPTVFLQM--GDQLDNTPLLNNPNISAL 540
Query: 586 LWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESF-TKVPMNDMNMRADSSRQYPGRS 644
+W GYPG AG AL I+ G P GRLP+T YP + +V M DM +R +++ PGR+
Sbjct: 541 IWGGYPGMAGGDALINILTGKAAPAGRLPVTQYPADYVNQVNMTDMELRPNATSGNPGRT 600
Query: 645 YRFYTGTQVYGFGHGLSYTNYSY--------KFLSAPSELTISASLKAGSDKNI--LQQT 694
Y++Y V FG+GL YTN+S + S PS ++S G+ NI L +
Sbjct: 601 YKWYNNA-VLPFGYGLHYTNFSVAASAQGQAQTQSGPSS---NSSQGQGTSYNISSLVSS 656
Query: 695 GSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVML-FARVPKVSQGTPEKQLIGF 753
R Y ++D C F+V +VTN G S V L F Q P KQL+ +
Sbjct: 657 CDRSQYAYLDL---CPFESFNV--NVTNTGSKLASDFVALGFISGSYGPQPYPIKQLVAY 711
Query: 754 DRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMV 796
R+ ++ G+ + L+ +++G +L G++ L++
Sbjct: 712 QRLFNISAGASATATLNLTLGSLARHDENGNAVLYPGDYGLLI 754
>gi|354508473|gb|AER26905.1| beta-xylosidase 3 [synthetic construct]
Length = 778
Score = 416 bits (1070), Expect = e-113, Method: Compositional matrix adjust.
Identities = 272/781 (34%), Positives = 399/781 (51%), Gaps = 70/781 (8%)
Query: 42 FP-CKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWS 100
FP C+ S+ C+ S + RA SLISL TL E I + + RLG+PAY+ WS
Sbjct: 29 FPDCQNGPLRSHLICDESATPYDRAASLISLFTLDELIANTGNTGLGVSRLGLPAYQVWS 88
Query: 101 ESLHGIASNGPGVNFN--GTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG 158
E+LHG+ NF+ G+ + TSFPQ +++ A+ NR+L I S ++ + RA N G
Sbjct: 89 EALHGLDR----ANFSDSGSYNWATSFPQPILTTAALNRTLIHQIASIISTQGRAFNNAG 144
Query: 159 QAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGF 218
+ GL +APNIN FR P WGRGQETPGED + + YA E++ QG
Sbjct: 145 RYGLDVYAPNINTFRHPVWGRGQETPGEDVSLAAVYAYEYITGIQG-------------- 190
Query: 219 REKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPP 278
D L L+A KH YD+E W N SR + IT+QD + + P
Sbjct: 191 ------------PDPDSNLKLAATAKHYAGYDIENWHNHSRLGNDMNITQQDLSEYYTPQ 238
Query: 279 FRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK-ARNEWGF--KGYITSDCDAVATIFE 335
F K +MC+YN V+GVPAC Q R+ +GF GY++SDCDA I+
Sbjct: 239 FHVAARDAKVHSVMCAYNAVDGVPACADSYFLQTLLRDTFGFVDHGYVSSDCDAAYNIYN 298
Query: 336 YQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRL 395
Y + +AA + AG DI+CGT H +I G + DI++ ++ L++ ++
Sbjct: 299 PHGYASSQAAAAAEAILAGTDIDCGTTYQWHLNESITAGDLSRDDIEKGVIRLYTTLVQA 358
Query: 396 GLFNGDPRKGK--YGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAV---- 449
G F+ + K Y L DV ++ ++ AA QGIVLLKN LPL + A
Sbjct: 359 GYFDSNTTKANNPYRDLTWSDVLETDAWNISYQAATQGIVLLKNSNNVLPLTEKAYPPSN 418
Query: 450 SSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHE 509
+++A+IGP N +Q+ G Y G S E K ++A G + S +GF
Sbjct: 419 TTVALIGPWANATTQLLGNYYGNAPYMISPRAAFEEAGYKVNFAEGT-GISSTSTSGFAA 477
Query: 510 AVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTS-KRPVILVLTGG 568
A+ A+ AD +I G+D T E E DR S+ PG Q+ L+ +A ++ +P+I++ GG
Sbjct: 478 ALSAARSADVIIYAGGIDNTLEAEALDRESIAWPGNQLDLIQKLASSAGSKPLIVLQMGG 537
Query: 569 GPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPM 627
G +D S + ++ ++++LW GYPG++G AL +II G NP GRL T YP S+ + P
Sbjct: 538 GQVDSSSLKNNTNVTALLWGGYPGQSGGFALRDIITGKKNPAGRLVTTQYPASYAEEFPA 597
Query: 628 NDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSD 687
DMN+R + PG++Y++YTG VY FGHGL YT + + S T + +K +
Sbjct: 598 TDMNLRPEGDN--PGQTYKWYTGEAVYEFGHGLFYTTF-----AESSSNTTTKEVKL-NI 649
Query: 688 KNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGT-P 746
++IL QT L +T L F I N G ++ + M+FA P
Sbjct: 650 QDILSQTHEEL-----ASITQLPVLNFTANIK--NTGKLESDYTAMVFANTSDAGPAPYP 702
Query: 747 EKQLIGFDRVHTVAKG-SKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTI 805
K L+G+DR+ V G ++E+ P E S A R+ G+ VL G +L +
Sbjct: 703 VKWLVGWDRLGDVKVGETRELRV---PVEVGSFA-----RVNEDGDWVLFPGTFELALNL 754
Query: 806 E 806
E
Sbjct: 755 E 755
>gi|436410475|gb|AGB57183.1| beta-xylosidase [Aspergillus sp. BCC125]
Length = 804
Score = 416 bits (1069), Expect = e-113, Method: Compositional matrix adjust.
Identities = 274/781 (35%), Positives = 397/781 (50%), Gaps = 70/781 (8%)
Query: 42 FP-CKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWS 100
FP C+ S+ C+ S + RA SLISL TL E I + + RLG+PAY+ WS
Sbjct: 55 FPDCQNGPLRSHLICDESATPYDRAASLISLFTLDELIANTGNTGLGVSRLGLPAYQVWS 114
Query: 101 ESLHGIASNGPGVNFN--GTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG 158
E+LHG+ NF+ G+ + TSFPQ +++ A+ NR+L I S ++ + RA N G
Sbjct: 115 EALHGLDR----ANFSDLGSYNWATSFPQPILTTAALNRTLIHQIASIISTQGRAFNNAG 170
Query: 159 QAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGF 218
+ GL +APNIN FR P WGRGQETPGED + + YA E++ QG
Sbjct: 171 RYGLDVYAPNINTFRHPVWGRGQETPGEDVSLAAIYAYEYITGIQG-------------- 216
Query: 219 REKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPP 278
D L L+A KH YD+E W N SR + IT+QD + + P
Sbjct: 217 ------------PDPDSNLKLAATAKHYAGYDIENWHNHSRLGNDMNITQQDLSEYYTPQ 264
Query: 279 FRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK-ARNEWGF--KGYITSDCDAVATIFE 335
F K +MC+YN VNGVPAC Q R+ +GF GY++SDCDA I+
Sbjct: 265 FHVAARDAKVHSVMCAYNAVNGVPACADSYFLQTLLRDTFGFVDHGYVSSDCDAAYNIYN 324
Query: 336 YQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRL 395
Y + +AA + AG DI+CGT H +I G + DI++ ++ L++ ++
Sbjct: 325 PHGYASSQAAAAAEAILAGTDIDCGTTYQWHLNESITAGDLSRDDIEKGVIRLYTTLVQA 384
Query: 396 GLFNGDPRKGK--YGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAV---- 449
G F+ + K Y L DV ++ ++ AA QGIVLLKN LPL + A
Sbjct: 385 GYFDSNTTKANNPYRDLTWSDVLETDAWNISYQAATQGIVLLKNSNNVLPLTEKAYPPSN 444
Query: 450 SSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHE 509
+++A+IGP N +Q+ G Y G S E ++A G + S +GF
Sbjct: 445 TTVALIGPWANATTQLLGNYYGNAPYMISPRAAFEEAGYNVNFAEGT-GISSTSTSGFAA 503
Query: 510 AVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTS-KRPVILVLTGG 568
A+ A+ AD +I G+D T E E DR S+ PG Q+ L+ +A ++ +P+I++ GG
Sbjct: 504 ALSAAQSADVIIYAGGIDNTLEAEALDRESIAWPGNQLDLIQKLASSAGSKPLIVLQMGG 563
Query: 569 GPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPM 627
G +D S + ++ +S++LW GYPG++G AL +II G NP GRL T YP S+ + P
Sbjct: 564 GQVDSSSLKNNTNVSALLWGGYPGQSGGFALRDIITGKKNPAGRLVTTQYPASYAEEFPA 623
Query: 628 NDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSD 687
DMN+R + PG++Y++YTG VY FGHGL YT + + S T + +K +
Sbjct: 624 TDMNLRPEGDN--PGQTYKWYTGEAVYEFGHGLFYTTF-----AESSSNTTTREIKL-NI 675
Query: 688 KNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGT-P 746
++IL QT L +T L F I N G V+ + M+FA P
Sbjct: 676 QDILSQTHEDL-----ASITQLPVLNFTANIK--NTGKVESDYTAMVFANTSDAGPAPYP 728
Query: 747 EKQLIGFDRVHTVAKG-SKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTI 805
K L+G+DR+ V G ++E+ P E S A R+ G+ VL G L +
Sbjct: 729 VKWLVGWDRLGDVKVGETRELRV---PIEVGSFA-----RVNEDGDWVLFPGTFELGLNL 780
Query: 806 E 806
E
Sbjct: 781 E 781
>gi|310792973|gb|EFQ28434.1| glycosyl hydrolase family 3 N terminal domain-containing protein
[Glomerella graminicola M1.001]
Length = 728
Score = 416 bits (1069), Expect = e-113, Method: Compositional matrix adjust.
Identities = 250/710 (35%), Positives = 384/710 (54%), Gaps = 77/710 (10%)
Query: 72 LTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNF---NGTVSSVTSFPQV 128
++++EK++ L D ++ + LG+P + WW+E LHG+ + PGV F + TSFP
Sbjct: 1 MSVEEKVRNLVDASAGVKSLGLPPHGWWNEGLHGVGFS-PGVLFAQDSEPFGYATSFPLP 59
Query: 129 LVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDP 188
+++AASF+ L++ IG + E RA N G AG FW PN+N FRDPRWGRGQETPGED
Sbjct: 60 ILTAASFDDDLFNAIGQVIGREGRAFSNYGYAGFNFWTPNMNAFRDPRWGRGQETPGEDV 119
Query: 189 MVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIA 248
+VVS Y +V QG SD D+++++A CKH A
Sbjct: 120 LVVSNYVQSYVTGLQG--------------------------SDPTDKVIIAA-CKHFAA 152
Query: 249 YDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGD 308
YD+E +R + N T+QD +D + P FR C+ +MCSYN V+G+PAC
Sbjct: 153 YDIET----ARRANNYNPTQQDLQDYYLPAFRRCVRDSHVGTVMCSYNSVDGIPACSSEY 208
Query: 309 LFQKA-RNEWGFKG---YITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCML 364
L ++ R+ WGF ++ SDC AV ++ N+T T +D+A+ + AG D+ CG+ L
Sbjct: 209 LLKEVLRDTWGFTNDYQFVVSDCGAVTDVWLLHNFTNTEQDAASVSMAAGTDLECGSSYL 268
Query: 365 RHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLA 424
S DK QE+ +D AL L+ +G F+G + LG DV T + +++A
Sbjct: 269 HLNGSLADKQVTQER-VDEALTRLYKALFTVGYFDG----SSHSSLGWSDVSTIDAQQIA 323
Query: 425 LDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLE 484
+AAR G+ LLKND LPL S+A+IGP N +QM G Y G +S L
Sbjct: 324 CEAARAGMTLLKND-GVLPLADGKYKSVALIGPFANATTQMQGNYFGRAPFVRSPLWAFT 382
Query: 485 AYVS-KTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLP 543
S + +YA+G D+ SD+GF +A+ AK +D VI G+D T E E DRVS+ P
Sbjct: 383 QQSSLQVNYAAGT-DINSTSDSGFADALAAAKNSDIVIFCGGIDTTIEAETLDRVSITWP 441
Query: 544 GQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEII 603
G Q+ L++ ++ K P+++ GGG +D + ++ ++++ W G PG+AG A+ +++
Sbjct: 442 GNQLDLISQLSMLGK-PLVVAQFGGGQVDDTALVDNANVNALFWAGLPGQAGGLAMYDLV 500
Query: 604 FGDFNPGGRLPMTWYPESFTK-VPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSY 662
G + GRLP T YP S+ V + ++N+R + + +PGR+Y++Y G V+ FG GL Y
Sbjct: 501 VGKASFAGRLPTTQYPASYADLVSIFNINLRPNGT--FPGRTYKWYIGEPVFPFGFGLHY 558
Query: 663 TNYSYKFLSAPSELTISASLKAGSD-KNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVT 721
T +++ T +L+ D NI+ S+ + H+ + T TS V ++V
Sbjct: 559 TKFNF---------TWKDTLEPTYDISNIISWARSQ-NNGHVTDTTPFTS----VNVTVK 604
Query: 722 NAGDVDGSHVVMLF------ARVPKVSQGTPEKQLIGFDRVHTVAKGSKE 765
N G+V +V +LF VP+ P K L + R H + G+ +
Sbjct: 605 NVGNVRSDYVGLLFLSSKNAGPVPR-----PNKSLASYSRAHDIETGASD 649
>gi|2723496|dbj|BAA24107.1| beta-1,4-xylosidase [Aspergillus oryzae]
Length = 798
Score = 416 bits (1068), Expect = e-113, Method: Compositional matrix adjust.
Identities = 259/759 (34%), Positives = 384/759 (50%), Gaps = 62/759 (8%)
Query: 55 CNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVN 114
C+TS RA +L+SLLT +E + ++ PR+G+PAY+ W+E+LHG+A +
Sbjct: 63 CDTSAKPHDRAAALVSLLTFEELVNNTANTGHGAPRIGLPAYQVWNEALHGVAH----AD 118
Query: 115 FN--GTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIF 172
F+ G S TSFPQ + + A+ NR+L I + ++ + RA N G+ GL ++PNIN F
Sbjct: 119 FSDAGDFSWSTSFPQPISTMAALNRTLIHQIATIISTQGRAFMNAGRYGLDVYSPNINTF 178
Query: 173 RDPRWGRGQETPGEDPM-VVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEES 231
R P WGRGQETPGED + S YA E++ QG G+
Sbjct: 179 RHPVWGRGQETPGEDAYCLASTYAYEYITGIQG--------GV----------------- 213
Query: 232 DRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCI 291
+ L L A KH YD+E W N SR + IT+QD + + P F K +
Sbjct: 214 -DANPLKLIATAKHYAGYDIENWDNHSRLGNDMQITQQDLAEYYTPQFLVASRDAKVHSV 272
Query: 292 MCSYNQVNGVPACLRGDLFQK-ARNEWGF--KGYITSDCDAVATIFEYQNYTKTHEDSAA 348
MCSYN VNGVP+C Q R+ + F GY++ DC AV +F Y +AA
Sbjct: 273 MCSYNAVNGVPSCSNSFFLQTLLRDTFDFVEDGYVSGDCGAVYNVFNPHGYATNESSAAA 332
Query: 349 GVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYG 408
++AG DI+CG RH Q + +V +D++R ++ L++ +R G F+G + Y
Sbjct: 333 DSIRAGTDIDCGVSYPRHFQESFHDQEVSRQDLERGVIRLYASLIRAGYFDG--KTSPYR 390
Query: 409 KLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPL--NKNAVSSLAIIGPLVNNISQMG 466
+ DV ++ + L+ +AA Q IVLLKND LPL ++ ++A+IGP N +QM
Sbjct: 391 NITWSDVVSTNAQNLSYEAAAQSIVLLKND-GILPLTSTSSSTKTIALIGPWANATTQML 449
Query: 467 GGYTGIP---CSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVV 523
G Y G SP + E ++ T + D S + A+ AK+AD +I
Sbjct: 450 GNYYGPAPYLISPLQAFQDSEYKITYTIGTNTTTDPDSTSQS---TALTTAKEADLIIFA 506
Query: 524 AGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQIS 583
G+D T ETE +DR ++ P Q+SL+T +A K P+I++ GGG +D S + + ++
Sbjct: 507 GGIDNTLETEAQDRSNITWPSNQLSLITKLADLGK-PLIVLQMGGGQVDSSALKNNKNVN 565
Query: 584 SILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKV-PMNDMNMRADSSRQYPG 642
+++W GYPG++G +ALA+II G P RL T YP + +V P DMN+R + S PG
Sbjct: 566 ALIWGGYPGQSGGQALADIITGKRAPAARLVTTQYPAEYAEVFPAIDMNLRPNGSN--PG 623
Query: 643 RSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVH 702
++Y +YTGT VY FGHGL YTN+ T SAS +G+ L H
Sbjct: 624 QTYMWYTGTPVYEFGHGLFYTNF-----------TASASAGSGTKNRTSFNIDEVLGRPH 672
Query: 703 IDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKG 762
+ + V N GD + M F P K L+GFDR+ V G
Sbjct: 673 PGYKLVEQMPLLNFTVDVKNTGDRVSDYTAMAFVNTTAGPAPHPNKWLVGFDRLSAVEPG 732
Query: 763 SKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRH 801
S + + L+ ++ G R+L G + + + R
Sbjct: 733 SAKTMVIPVTVDSLARTDEEGNRVLYPGRYEVALNNERE 771
>gi|224068498|ref|XP_002302758.1| predicted protein [Populus trichocarpa]
gi|222844484|gb|EEE82031.1| predicted protein [Populus trichocarpa]
Length = 462
Score = 416 bits (1068), Expect = e-113, Method: Compositional matrix adjust.
Identities = 203/456 (44%), Positives = 292/456 (64%), Gaps = 8/456 (1%)
Query: 352 KAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLG 411
+A +D++CG + +HT+ A+ KG + E +I+ ALLN +VQ+RLG+F+G+P YG LG
Sbjct: 5 QASLDLDCGPFLGQHTEDAVRKGLLTEAEINNALLNTLTVQMRLGMFDGEPSSKPYGNLG 64
Query: 412 PDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTG 471
P DVCT H++LAL+AARQGIVLLKN LPL+ S+AIIGP N M G Y G
Sbjct: 65 PTDVCTPAHQELALEAARQGIVLLKNHGPPLPLSTRHHQSVAIIGPNSNVTVTMIGNYAG 124
Query: 472 IPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQE 531
+ C + L+G+ Y +KT Y GC DV C SD F A+ A++AD ++V GLD + E
Sbjct: 125 VACGYTTPLQGIGRY-AKTIYQQGCADVACVSDQQFVAAMDAARQADATVLVMGLDQSIE 183
Query: 532 TEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYP 591
E RDR LLLPG+Q L++ VA SK P ILVL GGP+DVSFAE D +I I+W GYP
Sbjct: 184 AESRDRTELLLPGRQQELISKVAAASKGPTILVLMSGGPIDVSFAENDPKIGGIVWAGYP 243
Query: 592 GEAGAKALAEIIFGDFNPGGRLPMTWYPESF-TKVPMNDMNMRADSSRQYPGRSYRFYTG 650
G+AG A+++++FG NPGG+LPMTWYP+ + T +PM +M MR S YPGR+YRFY G
Sbjct: 244 GQAGGAAISDVLFGTTNPGGKLPMTWYPQDYVTNLPMTNMAMRPSKSNGYPGRTYRFYKG 303
Query: 651 TQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCT 710
VY FGHG+SYTN+ + SAP+ +++ + +N +G + H C
Sbjct: 304 KVVYPFGHGISYTNFVHTIASAPTMVSVPLDGHRQASRNA-TISGKAIRVTH----ARCN 358
Query: 711 SLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGV 770
L F VQ+ V N G +DG+H ++++++ P P KQL+ F++VH A + + V
Sbjct: 359 RLSFGVQVDVKNTGSMDGTHTLLVYSK-PPAGHWAPLKQLVAFEKVHVAAGTQQRVGINV 417
Query: 771 DPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
C+ LS+ ++ G R +P+G H L +G+++HS++++
Sbjct: 418 HVCKFLSVVDRSGIRRIPMGAHSLHIGDVKHSVSLQ 453
>gi|358397360|gb|EHK46735.1| glycoside hydrolase family 3 protein [Trichoderma atroviride IMI
206040]
Length = 865
Score = 415 bits (1067), Expect = e-113, Method: Compositional matrix adjust.
Identities = 271/794 (34%), Positives = 425/794 (53%), Gaps = 86/794 (10%)
Query: 24 VVNVIAFSN--SKPVLNKPDFP-CKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQ 80
++N+ + +N + P L P FP C S C+T+LS++ RA +++ +TL EK+
Sbjct: 111 MINIYSVTNPSANPPL-TPSFPDCTRDPLCSNAICDTTLSMAERAAAIVKPMTLDEKVAN 169
Query: 81 LSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFN----GTVSSVTSFPQVLVSAASFN 136
+ +AS RLG+PAY+W +E+LHG+A + GV F S+ TSFP ++ +A+F+
Sbjct: 170 VGSSASGSARLGLPAYQWQNEALHGVAGS-TGVQFQSPLGANFSAATSFPMPILLSAAFD 228
Query: 137 RSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAV 196
+L N+ +A++ EARA N G AGL FW PNIN FRDPRWGRG ETPGED + Y +
Sbjct: 229 DALVQNVATAISTEARAFANYGFAGLDFWTPNINPFRDPRWGRGMETPGEDAFRIQGYVL 288
Query: 197 EFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGN 256
+ Q GGI D + A CKH AYD+E
Sbjct: 289 ALISGLQ--------GGI------------------NPDFFRIIATCKHFAAYDIEN--- 319
Query: 257 FSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RN 315
R N T+QD D + P F +C+ K +MC+YN V+G+PAC L Q R+
Sbjct: 320 -GRTGNNLNPTQQDMADYYLPMFETCVRDAKVGSVMCAYNAVDGIPACASEYLLQDVLRD 378
Query: 316 EWGFK---GYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAID 372
+GF Y+ SDCDAV +F+ +Y ++AA L AG D++CG+ ++++
Sbjct: 379 GFGFTEDFNYVVSDCDAVDNVFDPHHYASNLTEAAALSLNAGTDLDCGSSY-NVLNASVE 437
Query: 373 KGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGI 432
E ++++L+ L+S +++G F+ + +Y L +V T++++ LA DAA G+
Sbjct: 438 AALTSEAALNQSLVRLYSALIKVGYFD---QPSEYKSLSWANVNTTQNQALAHDAATGGM 494
Query: 433 VLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSK--- 489
LLKND LPL++ +S++AIIGP VN +QM G Y G + L+ L+ + K
Sbjct: 495 TLLKNDGT-LPLSR-TLSNVAIIGPWVNATTQMQGNYAG---TAPFLVNPLDVFQQKWGN 549
Query: 490 THYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSL 549
YA G + +GF A+ A +D ++ + G+D+T E E DR S++ PG Q+ L
Sbjct: 550 VKYAQGTA-INSQDTSGFSAALSAASSSDVIVYLGGIDITVENEGFDRGSIVWPGNQLDL 608
Query: 550 VTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNP 609
++ +A K P+++V GGG +D S ++ + SILW GYPG+ G A+ +++ G P
Sbjct: 609 ISQLANLGK-PLVIVQFGGGQIDDSSLLSNPNVRSILWAGYPGQDGGNAVFDVLTGANPP 667
Query: 610 GGRLPMTWYPESF-TKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYK 668
GRLP+T YP S+ + DMN+R S PGR+Y +YTGT V FG+GL YTN+S
Sbjct: 668 AGRLPITQYPASYINNNNIQDMNLR--PSNGIPGRTYAWYTGTPVLPFGYGLHYTNFSVS 725
Query: 669 FLSAPSELTISASLKAGSD-KNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAG--- 724
F S + AG+D I+ G+ +D TS+ + +SV N G
Sbjct: 726 FQSINT---------AGTDVATIVNNAGAVID----------TSVFATLVVSVHNTGGKA 766
Query: 725 DVDGSHVVMLFARVPKVSQGT-PEKQLIGFDRVHTVAKG-SKEISFGVDPCEQLSIANKH 782
++ +V ++F P KQL + R +V G +++++ ++ L+ A+ +
Sbjct: 767 NLASDYVGLVFLSSTNAGPSPYPNKQLAAYGRAKSVGVGATQQLTLKIN-LGSLARADTN 825
Query: 783 GRRILPLGNHVLMV 796
G R + G++ L +
Sbjct: 826 GDRWIYPGDYKLTL 839
>gi|4235093|gb|AAD13106.1| beta-xylosidase [Aspergillus niger]
Length = 804
Score = 415 bits (1067), Expect = e-113, Method: Compositional matrix adjust.
Identities = 272/781 (34%), Positives = 399/781 (51%), Gaps = 70/781 (8%)
Query: 42 FP-CKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWS 100
FP C+ S+ C+ S + RA SLISL TL E I + + RLG+PAY+ WS
Sbjct: 55 FPDCQNGPLRSHLICDESATPYDRAASLISLFTLDELIANTGNTGLGVSRLGLPAYQVWS 114
Query: 101 ESLHGIASNGPGVNFN--GTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG 158
E+LHG+ NF+ G+ + TSFPQ +++ A+ NR+L I S ++ + RA N G
Sbjct: 115 EALHGLDR----ANFSDSGSYNWATSFPQPILTTAALNRTLIHQIASIISTQGRAFNNAG 170
Query: 159 QAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGF 218
+ GL +APNIN FR P WGRGQETPGED + + YA E++ QG
Sbjct: 171 RYGLDVYAPNINTFRHPVWGRGQETPGEDVSLAAVYAYEYITGIQG-------------- 216
Query: 219 REKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPP 278
D L L+A KH YD+E W N SR + IT+QD + + P
Sbjct: 217 ------------PDPDSNLKLAATAKHYAGYDIENWHNHSRLGNDMNITQQDLSEYYTPQ 264
Query: 279 FRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK-ARNEWGF--KGYITSDCDAVATIFE 335
F K +MC+YN V+GVPAC Q R+ +GF GY++SDCDA I+
Sbjct: 265 FHVAARDAKVHSVMCAYNAVDGVPACADSYFLQTLLRDTFGFVDHGYVSSDCDAAYNIYN 324
Query: 336 YQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRL 395
Y + +AA + AG DI+CGT H +I G + DI++ ++ L++ ++
Sbjct: 325 PHGYASSQAAAAAEAILAGTDIDCGTTYQWHLNESITAGDLSRDDIEKGVIRLYTTLVQA 384
Query: 396 GLFNGDPRKGK--YGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAV---- 449
G F+ + K Y L DV ++ ++ AA QGIVLLKN LPL + A
Sbjct: 385 GYFDSNTTKANNPYRDLTWSDVLETDAWNISYQAATQGIVLLKNSNNVLPLTEKAYPPSN 444
Query: 450 SSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHE 509
+++A+IGP N +Q+ G Y G S E K ++A G + S +GF
Sbjct: 445 TTVALIGPWANATTQLLGNYYGNAPYMISPRAAFEEAGYKVNFAEGT-GISSTSTSGFAA 503
Query: 510 AVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTS-KRPVILVLTGG 568
A+ A+ AD +I G+D T E E DR S+ PG Q+ L+ +A ++ +P+I++ GG
Sbjct: 504 ALSAARSADVIIYAGGIDNTLEAEALDRESIAWPGNQLDLIQKLASSAGSKPLIVLQMGG 563
Query: 569 GPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPM 627
G +D S + ++ ++++LW GYPG++G AL +II G NP GRL T YP S+ + P
Sbjct: 564 GQVDSSSLKNNTNVTALLWGGYPGQSGGFALRDIITGKKNPAGRLVTTQYPASYAEEFPA 623
Query: 628 NDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSD 687
DMN+R + PG++Y++YTG VY FGHGL YT + + S T + +K +
Sbjct: 624 TDMNLRPEGDN--PGQTYKWYTGEAVYEFGHGLFYTTF-----AESSSNTTTKEVKL-NI 675
Query: 688 KNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGT-P 746
++IL QT L +T L F I N G ++ + M+FA P
Sbjct: 676 QDILSQTHEEL-----ASITQLPVLNFTANIK--NTGKLESDYTAMVFANTSDAGPAPYP 728
Query: 747 EKQLIGFDRVHTVAKG-SKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTI 805
K L+G+DR+ V G ++E+ P E S A R+ G+ VL G +L +
Sbjct: 729 VKWLVGWDRLGDVKVGETRELRV---PVEVGSFA-----RVNEDGDWVLFPGTFELALNL 780
Query: 806 E 806
E
Sbjct: 781 E 781
>gi|3135209|dbj|BAA28267.1| beta-xylosidase A [Aspergillus oryzae]
Length = 798
Score = 415 bits (1067), Expect = e-113, Method: Compositional matrix adjust.
Identities = 259/759 (34%), Positives = 384/759 (50%), Gaps = 62/759 (8%)
Query: 55 CNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVN 114
C+TS RA +L+SLLT +E + ++ PR+G+PAY+ W+E+LHG+A +
Sbjct: 63 CDTSAKPHDRAAALVSLLTFEELVNNTANTGHGAPRIGLPAYQVWNEALHGVAH----AD 118
Query: 115 FN--GTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIF 172
F+ G S TSFPQ + + A+ NR+L I + ++ + RA N G+ GL ++PNIN F
Sbjct: 119 FSDAGDFSWSTSFPQPISTMAALNRTLIHQIATIISTQGRAFMNAGRYGLDVYSPNINTF 178
Query: 173 RDPRWGRGQETPGEDPM-VVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEES 231
R P WGRGQETPGED + S YA E++ QG G+
Sbjct: 179 RHPVWGRGQETPGEDAYCLASTYAYEYITGIQG--------GV----------------- 213
Query: 232 DRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCI 291
+ L L A KH YD+E W N SR + IT+QD + + P F K +
Sbjct: 214 -DANPLKLIATAKHYAGYDIENWDNHSRLGNDMQITQQDLAEYYTPQFLVASRDAKVHSV 272
Query: 292 MCSYNQVNGVPACLRGDLFQK-ARNEWGF--KGYITSDCDAVATIFEYQNYTKTHEDSAA 348
MCSYN VNGVP+C Q R+ + F GY++ DC AV +F Y +AA
Sbjct: 273 MCSYNAVNGVPSCSNSFFLQTLLRDTFDFVEDGYVSGDCGAVYNVFNPHGYATNESSAAA 332
Query: 349 GVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYG 408
++AG DI+CG RH Q + +V +D++R ++ L++ +R G F+G + Y
Sbjct: 333 DSIRAGTDIDCGVSYPRHFQESFHDQEVSRQDLERGVIRLYASLIRAGYFDG--KTSPYR 390
Query: 409 KLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPL--NKNAVSSLAIIGPLVNNISQMG 466
+ DV ++ + L+ +AA Q IVLLKND LPL ++ ++A+IGP N +QM
Sbjct: 391 NITWSDVVSTNAQNLSYEAAAQSIVLLKND-GILPLTSTSSSTKTIALIGPWANATTQML 449
Query: 467 GGYTGIP---CSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVV 523
G Y G SP + E ++ T + D S + A+ AK+AD +I
Sbjct: 450 GNYYGPAPYLISPLQAFQDSEYKITYTIGTNTTTDPDSTSQS---TALTTAKEADLIIFA 506
Query: 524 AGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQIS 583
G+D T ETE +DR ++ P Q+SL+T +A K P+I++ GGG +D S + + ++
Sbjct: 507 GGIDNTLETEAQDRSNITWPSNQLSLITKLADLGK-PLIVLQMGGGQVDSSALKNNKNVN 565
Query: 584 SILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKV-PMNDMNMRADSSRQYPG 642
+++W GYPG++G +ALA+II G P RL T YP + +V P DMN+R + S PG
Sbjct: 566 ALIWGGYPGQSGGQALADIITGKRAPAARLVTTQYPAEYAEVFPAIDMNLRPNGSN--PG 623
Query: 643 RSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVH 702
++Y +YTGT VY FGHGL YTN+ T SAS +G+ L H
Sbjct: 624 QTYMWYTGTPVYEFGHGLFYTNF-----------TASASAGSGTKNRTSFNIDEVLGRPH 672
Query: 703 IDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKG 762
+ + V N GD + M F P K L+GFDR+ V G
Sbjct: 673 PGYKLVEQMPLLNFTVDVKNTGDRVSDYTAMAFVNTTAGPAPHPNKWLVGFDRLSAVEPG 732
Query: 763 SKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRH 801
S + + L+ ++ G R+L G + + + R
Sbjct: 733 SAKTMVIPVTVDSLARTDEEGNRVLYPGRYEVALNNERE 771
>gi|391872736|gb|EIT81831.1| beta-glucosidase-related glycosidase [Aspergillus oryzae 3.042]
Length = 798
Score = 414 bits (1065), Expect = e-113, Method: Compositional matrix adjust.
Identities = 259/759 (34%), Positives = 383/759 (50%), Gaps = 62/759 (8%)
Query: 55 CNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVN 114
C+TS RA +L+SLLT +E + ++ PR+G+PAY+ W+E+LHG+A +
Sbjct: 63 CDTSAKPHDRAAALVSLLTFEELVNNTANTGHGAPRIGLPAYQVWNEALHGVAH----AD 118
Query: 115 FN--GTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIF 172
F+ G S TSFPQ + + A+ NR+L I + ++ + RA N G+ GL ++PNIN F
Sbjct: 119 FSDAGDFSWSTSFPQPISTMAALNRTLIHQIATIISTQGRAFMNAGRYGLDVYSPNINTF 178
Query: 173 RDPRWGRGQETPGEDPM-VVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEES 231
R P WGRGQETPGED + S YA E++ QG G+
Sbjct: 179 RHPVWGRGQETPGEDAYCLASTYAYEYITGIQG--------GV----------------- 213
Query: 232 DRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCI 291
+ L L A KH YD+E W N SR + IT+QD + + P F K +
Sbjct: 214 -DANPLKLIATAKHYAGYDIENWDNHSRLGNDMQITQQDLAEYYTPQFLVASRDAKVHSV 272
Query: 292 MCSYNQVNGVPACLRGDLFQK-ARNEWGF--KGYITSDCDAVATIFEYQNYTKTHEDSAA 348
MCSYN VNGVP+C Q R+ + F GY++ DC AV +F Y +AA
Sbjct: 273 MCSYNAVNGVPSCSNSFFLQTLLRDTFDFVEDGYVSGDCGAVYNVFNPHGYATNESSAAA 332
Query: 349 GVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYG 408
++AG DI+CG RH Q + +V +D++R + L++ +R G F+G + Y
Sbjct: 333 DSIRAGTDIDCGVSYPRHFQESFHDQEVSRQDLERGVTRLYASLIRAGYFDG--KTSPYR 390
Query: 409 KLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPL--NKNAVSSLAIIGPLVNNISQMG 466
+ DV ++ + L+ +AA Q IVLLKND LPL ++ ++A+IGP N +QM
Sbjct: 391 NITWSDVVSTNAQNLSYEAAAQSIVLLKND-GILPLTSTSSSTKTIALIGPWANATTQML 449
Query: 467 GGYTGIP---CSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVV 523
G Y G SP + E ++ T + D S + A+ AK+AD +I
Sbjct: 450 GNYYGPAPYLISPLQAFQDSEYKITYTIGTNTTTDPDSTSQS---TALTTAKEADLIIFA 506
Query: 524 AGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQIS 583
G+D T ETE +DR ++ P Q+SL+T +A K P+I++ GGG +D S + + ++
Sbjct: 507 GGIDNTLETEAQDRSNITWPSNQLSLITKLADLGK-PLIVLQMGGGQVDSSALKNNKNVN 565
Query: 584 SILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKV-PMNDMNMRADSSRQYPG 642
+++W GYPG++G +ALA+II G P RL T YP + +V P DMN+R + S PG
Sbjct: 566 ALIWGGYPGQSGGQALADIITGKRAPAARLVTTQYPAEYAEVFPAIDMNLRPNGSN--PG 623
Query: 643 RSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVH 702
++Y +YTGT VY FGHGL YTN+ T SAS +G+ L H
Sbjct: 624 QTYMWYTGTPVYEFGHGLFYTNF-----------TASASAGSGTKNRTSFNIDEVLGRPH 672
Query: 703 IDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKG 762
+ + V N GD + M F P K L+GFDR+ V G
Sbjct: 673 PGYKLVEQMPLLNFTVDVKNTGDRVSDYTAMAFVNTTAGPAPHPNKWLVGFDRLSAVEPG 732
Query: 763 SKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRH 801
S + + L+ ++ G R+L G + + + R
Sbjct: 733 SAKTMVIPVTVDSLARTDEEGNRVLYPGRYEVALNNERE 771
>gi|380293100|gb|AFD50200.1| beta-xylosidase [Hypocrea orientalis]
Length = 797
Score = 414 bits (1063), Expect = e-112, Method: Compositional matrix adjust.
Identities = 268/752 (35%), Positives = 397/752 (52%), Gaps = 63/752 (8%)
Query: 55 CNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVN 114
C++S RA++LISL TL+E I ++ +PRLG+P Y+ W+E+LHG+ N
Sbjct: 63 CDSSAGYVERAQALISLFTLEELILNTQNSGPGVPRLGLPNYQVWNEALHGLDR----AN 118
Query: 115 F---NGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINI 171
F G TSFP +++ A+ NR+L I ++ +ARA N G+ GL +APN+N
Sbjct: 119 FATKGGQFEWATSFPMPILTTAALNRTLIHQIADIISTQARAFSNSGRYGLDVYAPNVNG 178
Query: 172 FRDPRWGRGQETPGEDPMVVS-AYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEE 230
FR P WGRGQETPGED +S AY E++ QG G E
Sbjct: 179 FRSPLWGRGQETPGEDAFFLSSAYTYEYITGIQG---------------------GVDPE 217
Query: 231 SDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASC 290
+L ++A KH YDLE W N SR F+A+IT+QD + + P F + K+
Sbjct: 218 -----QLKVAATVKHFAGYDLENWNNQSRLGFDAIITQQDLSEYYTPQFLAAARYAKSRS 272
Query: 291 IMCSYNQVNGVPACLRGDLFQK-ARNEWGFK--GYITSDCDAVATIFEYQNYTKTHEDSA 347
+MCSYN VNGVP+C Q R WGF GY++SDCDAV +F +Y +A
Sbjct: 273 LMCSYNSVNGVPSCANSFFLQTLLRESWGFPEWGYVSSDCDAVYNVFNPHDYASNQSSAA 332
Query: 348 AGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKY 407
A L+AG DI+CG H + G+V +I+R++ L++ +RLG F+ +K +Y
Sbjct: 333 ASSLRAGTDIDCGQTYPWHLNESFVAGEVTRGEIERSVTRLYANLVRLGYFD---KKNQY 389
Query: 408 GKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGG 467
LG DV ++ ++ +AA +GIVLLKND LPL+K V S+A+IGP N +QM G
Sbjct: 390 RSLGWKDVVKTDAWNISYEAAVEGIVLLKNDGT-LPLSKK-VRSIALIGPWANATTQMQG 447
Query: 468 GYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLD 527
Y G S L + ++ G ++ NS AGF +A+ AKK+D ++ + G+D
Sbjct: 448 NYFGPAPYLISPLEAAKKAGYHVNFELGT-EIAGNSTAGFAKAIAAAKKSDAIVYLGGID 506
Query: 528 LTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILW 587
T E E DR + PG Q+ L+ ++ K P++++ GGG +D S +++ +++S++W
Sbjct: 507 NTIEQEGADRTDIAWPGNQLDLIKQLSEVGK-PLVVLQMGGGQVDSSSLKSNKKVNSLVW 565
Query: 588 IGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNMRADSSRQYPGRSYR 646
GYPG++G AL +I+ G P GRL T YP + + P NDMN+R D + PG++Y
Sbjct: 566 GGYPGQSGGVALFDILSGKRAPAGRLITTQYPAEYVHQFPQNDMNLRPD-GKSNPGQTYI 624
Query: 647 FYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEV 706
+YTG VY FG GL YT + S P L + T S L H
Sbjct: 625 WYTGKPVYEFGSGLFYTTFKETLASHPKCLKFN--------------TSSILSAPHPGYT 670
Query: 707 TSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGT-PEKQLIGFDRVHTVAKG-SK 764
S F + ++ N+G + + MLF R P K L+GFDR+ + G S
Sbjct: 671 YSEQIPVFTFEANIKNSGKTESPYTAMLFVRTSNAGPAPYPNKWLVGFDRLADIKPGHSS 730
Query: 765 EISFGVDPCEQLSIANKHGRRILPLGNHVLMV 796
++S + P L+ + +G RI+ G + L +
Sbjct: 731 KLSIPI-PVSALARVDSYGNRIVYPGKYELAL 761
>gi|2920706|emb|CAA73902.1| beta-xylosidase [Emericella nidulans]
Length = 802
Score = 414 bits (1063), Expect = e-112, Method: Compositional matrix adjust.
Identities = 252/766 (32%), Positives = 389/766 (50%), Gaps = 58/766 (7%)
Query: 42 FP-CKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWS 100
FP C P C+ SLS RA +L+SL T E + + + RLG+P Y+ W
Sbjct: 46 FPDCATGPLSLTPVCDRSLSPKDRATALVSLFTFDELVNNTGNTGLGVSRLGLPNYQVWG 105
Query: 101 ESLHGIASNGPGVNF--NGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG 158
E+LHG+ NF +G S TSFP + A+ N++L IG+ V+ + RA N G
Sbjct: 106 EALHGVGR----ANFVESGNFSWATSFPMPITMMAALNKTLIHQIGTIVSTQLRAFSNAG 161
Query: 159 QAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGF 218
G+ ++PNIN FR P WGRGQETPGED + S Y E++ + QG
Sbjct: 162 LGGVDVYSPNINTFRHPVWGRGQETPGEDAFLTSVYGYEYITALQG-------------- 207
Query: 219 REKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPP 278
E+ + + A KH YD+E W N SR + IT+Q+ + + PP
Sbjct: 208 -------AVDPETSK-----IIATAKHYAGYDIESWNNHSRLGNDMQITQQELSEYYTPP 255
Query: 279 FRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK-ARNEWGFK--GYITSDCDAVATIFE 335
F K +MCSYN VNGVP+C Q R+ + F GY++ DC AV ++
Sbjct: 256 FIVASRDAKVRSVMCSYNAVNGVPSCANKFFLQTLLRDTFEFSEDGYVSGDCGAVYNVWN 315
Query: 336 YQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRL 395
Y ++A + AG DI+CGT H++ A + V DI+R ++ L+S ++
Sbjct: 316 PHGYASNEAAASADSILAGTDIDCGTSYQWHSEDAFEDSLVSRSDIERGVIRLYSNLVQA 375
Query: 396 GLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAII 455
G F+G+ Y + DDV +++ +A +AA +GIVLLKND+ LPL+K+ + S+A+I
Sbjct: 376 GYFDGE--DAPYRDITWDDVLSTDAWNIAYEAAVEGIVLLKNDET-LPLSKD-IKSVAVI 431
Query: 456 GPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAK 515
GP N ++ G Y G S L G HYA G ++ +S +GF EA+ AK
Sbjct: 432 GPWANVTEELQGNYFGPAPYLISPLTGFRDSGLDVHYALGT-NLTSHSTSGFEEALTAAK 490
Query: 516 KADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSF 575
+AD +I G+D T E E DR ++ PG Q+ L++ ++ K P++++ GGG +D S
Sbjct: 491 QADAIIFAGGIDNTIEAEAMDRENITWPGNQLDLISKLSELGK-PLVVLQMGGGQVDSSS 549
Query: 576 AEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKV-PMNDMNMRA 634
+ + +++++W GYPG++G ALA+II G P GRL T YP + +V P DMN+R
Sbjct: 550 LKDNDNVNALIWGGYPGQSGGHALADIITGKRAPAGRLVTTQYPAEYAEVFPAIDMNLRP 609
Query: 635 DSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQT 694
+ + PG++Y +YTGT VY FGHGL YT + + + AGS T
Sbjct: 610 NETSGNPGQTYMWYTGTPVYEFGHGLFYT----------TFEESTETTDAGSFNIQTVLT 659
Query: 695 GSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFD 754
Y H + T + +V N G+ + + +++ P+K ++GFD
Sbjct: 660 TPHSGYEHAQQKT-----LLNFTATVKNTGERESDYTALVYVNTTAGPAPYPKKWVVGFD 714
Query: 755 RVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELR 800
R+ + G + E ++ ++ G R+L G++ + + R
Sbjct: 715 RLGGLEPGDSQTLTVPVTVESVARTDEQGNRVLYPGSYDVALNNER 760
>gi|292495632|sp|Q0CMH8.2|XYND_ASPTN RecName: Full=Probable exo-1,4-beta-xylosidase xlnD; AltName:
Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
Length = 793
Score = 413 bits (1061), Expect = e-112, Method: Compositional matrix adjust.
Identities = 272/768 (35%), Positives = 399/768 (51%), Gaps = 71/768 (9%)
Query: 55 CNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVN 114
C+ S RA +L+S+ TL+E + + + +PRLG+P Y+ WSESLHG+ N
Sbjct: 63 CDKSARPHDRAAALVSMFTLEELVNNTGNTGTGVPRLGLPKYQVWSESLHGVYR----AN 118
Query: 115 F--NGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIF 172
+ G S TSFPQ +++ A+ NR+L IG ++ +ARA N+G+ GL +APNIN F
Sbjct: 119 WASEGDYSWATSFPQPILTMAALNRTLIHQIGDILSTQARAFSNVGRYGLDTYAPNINSF 178
Query: 173 RDPRWGRGQETPGEDPM-VVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEES 231
R P WGRGQETPGED + S YA E++ QG G E+
Sbjct: 179 RHPVWGRGQETPGEDAYYLASTYAYEYITGIQG---------------------GVDPET 217
Query: 232 DRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCI 291
L L A KH YD+E W SR + IT+QD + + P F K +
Sbjct: 218 -----LKLVATAKHYAGYDIENWDGHSRLGNDMQITQQDLSEYYTPQFLVSARDAKVHSV 272
Query: 292 MCSYNQVNGVPACLRGDLFQKA-RNEWGF--KGYITSDCDAVATIFEYQNYTKTHEDSAA 348
MCSYN VNGVP+C Q R +GF GY++ DC AV F Y ++A
Sbjct: 273 MCSYNAVNGVPSCSNSFFLQTLLRETFGFVEDGYVSGDCGAVYNAFNPHEYAANESSASA 332
Query: 349 GVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYG 408
++AG DI+CGT H +A D+G++ +DI+R ++ L++ +RLG F+G+ +Y
Sbjct: 333 DSIRAGTDIDCGTSYQYHFTNAFDEGEISRQDIERGVIRLYTNLVRLGYFDGN--SSQYR 390
Query: 409 KLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGG 468
L DV T++ ++ +AA +G VLLKND LPL +++ S+A+IGP N +QM G
Sbjct: 391 DLTWSDVQTTDAWNISHEAAVEGTVLLKNDGT-LPL-ADSIRSVALIGPWANATTQMQGN 448
Query: 469 YTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDL 528
Y G S L LEA HYA G ++ + AGF +A+ A+KAD +I G+D
Sbjct: 449 YYGPAPYLTSPLAALEASDLDVHYAFGT-NISSTTTAGFADALAAARKADAIIFAGGIDN 507
Query: 529 TQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWI 588
T E E DR+++ PG Q+ L+ ++ K P++++ GGG +D S + ++ +S++LW
Sbjct: 508 TIEGEALDRMNITWPGNQLDLINQLSALGK-PLVVLQMGGGQVDSSALKHNTNVSALLWG 566
Query: 589 GYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESF-TKVPMNDMNMRADSSRQYPGRSYRF 647
GYPG++G AL +II G P GRL T YP + T+ P DM +R + + PG++Y +
Sbjct: 567 GYPGQSGGTALLDIIRGVRAPAGRLVTTQYPAGYATQFPAIDMGLRPNGTN--PGQTYMW 624
Query: 648 YTGTQVYGFGHGLSYTNYSYKFLSAP---SELTISASLKAGSDKNILQQTGSRLDYVHID 704
YTGT VY FGHGL YT + K S S I L A Q L++
Sbjct: 625 YTGTPVYEFGHGLFYTTFEAKRASTATNHSSFNIEDLLTAPHPGYAYPQLRPFLNFT--- 681
Query: 705 EVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKG-S 763
+TN G + MLFA P K L+GFDR+ + G S
Sbjct: 682 -------------AHITNTGRTTSDYTAMLFANTTAGPAPHPNKWLVGFDRLGALEPGAS 728
Query: 764 KEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGE-----LRHSLTIE 806
+ ++F + + ++ ++ G R+L G + L + LR +LT E
Sbjct: 729 QTMTFPIT-IDNVARTDELGNRVLYPGRYELALNNERSVVLRFTLTGE 775
>gi|336471692|gb|EGO59853.1| hypothetical protein NEUTE1DRAFT_99999 [Neurospora tetrasperma FGSC
2508]
gi|350292807|gb|EGZ74002.1| glycoside hydrolase [Neurospora tetrasperma FGSC 2509]
Length = 770
Score = 412 bits (1060), Expect = e-112, Method: Compositional matrix adjust.
Identities = 260/788 (32%), Positives = 398/788 (50%), Gaps = 62/788 (7%)
Query: 21 LIVVVNVIAFSNSKPVLNKPDFPCKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQ 80
L+ +V+ + P PD C S C+ +LS RA +L++ +T +EK+Q
Sbjct: 10 LLSCSSVLVSAIDLPFQTYPD--CVNGPLASLKVCDVTLSPPQRAAALVAAMTTEEKLQN 67
Query: 81 LSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNF---NGTVSSVTSFPQVLVSAASFNR 137
L + PR+G+PAY WWSE+LHG+A PG F +G ++ TSFP L+ AA+F+
Sbjct: 68 LVSKSKGAPRIGLPAYNWWSEALHGVAY-APGTQFWSGDGPFNASTSFPMPLLMAATFDD 126
Query: 138 SLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVE 197
L +G + E RA N G +G +W PN+N F+DPRWGRG ETPGED + + YA
Sbjct: 127 ELIEKVGEVIGTEGRAFGNAGFSGFDYWTPNVNPFKDPRWGRGSETPGEDILRIKRYAAS 186
Query: 198 FVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNF 257
++ QG RE+RV+ A CKH A D E W
Sbjct: 187 MIRGLQGPA------------RERRVV----------------ATCKHYAANDFEDWNGS 218
Query: 258 SRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNE 316
+R+ FNA +T QD + + PF+ C K IMCSYN VNGVPAC L Q R
Sbjct: 219 TRHDFNAKVTLQDLAEYYLSPFQQCARDSKVGSIMCSYNAVNGVPACANTYLMQTILREH 278
Query: 317 WGFKG---YITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDK 373
W + YITSDC+AV I +Y +T+ + A +AG+D +C A +
Sbjct: 279 WNWTAPGNYITSDCEAVLDISANHHYAETNAEGTALAFEAGIDSSCEYESSSDIPGAWTQ 338
Query: 374 GKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIV 433
G +++ +DRAL ++ +R+G F+G+ +Y LG DV + + +++AL AA +GIV
Sbjct: 339 GLLEQSTVDRALKRIYEGLVRVGYFDGN--HSEYASLGWKDVNSPKSQEVALQAAVEGIV 396
Query: 434 LLKNDKKF-LPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHY 492
LLKNDK L L + S LA+IG N+ + GGY+G P S + +A
Sbjct: 397 LLKNDKTLPLDLRTDPKSKLAMIGFWANDPKTLSGGYSGKPAFEHSPVYAAQAMGFSVTT 456
Query: 493 ASG-CHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVT 551
A G ++D A+ AK A++++ G D + E +DR ++ P Q+ L+T
Sbjct: 457 AGGPVLQNSTSNDTWTQAALEAAKDANYILYFGGQDTSAAGETKDRTTINWPEAQLQLIT 516
Query: 552 SVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGG 611
++++ K V++ + G LD + A +++ILW + G+ G A+ +I+ G NP G
Sbjct: 517 TLSKLGKPLVVVQM--GDQLDNTPLLAAKAVNAILWANWLGQDGGTAVMQILTGLKNPAG 574
Query: 612 RLPMTWYPESFT-KVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFL 670
RLP+T YP ++T VPM DMN+R S + PGR+YR+Y T V FG GL YT + K
Sbjct: 575 RLPVTQYPANYTAAVPMTDMNLR--PSDKLPGRTYRWYP-TAVQPFGFGLHYTTFQTKIA 631
Query: 671 SAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSH 730
L I +++L + G + D T +++ VTN+G+ +
Sbjct: 632 VPLPRLAI---------QDLLSRCGGDNANAYPD-----TCALPPLKVEVTNSGNRSSDY 677
Query: 731 VVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLG 790
VV+ F + P K L+ + R+ ++ G K + ++ ++ G +L G
Sbjct: 678 VVLAFLAGDVGPKPYPIKTLVSYTRLRDLSPGHKTTAHLKWTLGDIARYDEQGNTVLYPG 737
Query: 791 NHVLMVGE 798
+ + V E
Sbjct: 738 TYTVTVDE 745
>gi|154313073|ref|XP_001555863.1| hypothetical protein BC1G_05538 [Botryotinia fuckeliana B05.10]
Length = 755
Score = 412 bits (1060), Expect = e-112, Method: Compositional matrix adjust.
Identities = 268/753 (35%), Positives = 386/753 (51%), Gaps = 82/753 (10%)
Query: 55 CNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVN 114
C+TS TRA +LISL TL EK+ + + +PR+G+P+YEWW+E+LHGIA + PG
Sbjct: 34 CDTSSDPYTRAAALISLFTLAEKVNNTGNTSPGVPRIGLPSYEWWNEALHGIARS-PGTT 92
Query: 115 FNGTVSSV---TSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINI 171
F T S+ TSFPQ ++ A+F+ L + + V+ EARA N+ + GL FW PNIN
Sbjct: 93 FAATGSNYSYSTSFPQPILMGATFDDELIHKVATQVSTEARAFNNVNRFGLNFWTPNINP 152
Query: 172 FRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEES 231
++DPRWGRGQETPGEDP S+Y + QG G + KG
Sbjct: 153 YKDPRWGRGQETPGEDPFHTSSYVNALITGLQG------------GLDDLPYKKGV---- 196
Query: 232 DRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCI 291
A CKH YDLE RY F+A+I QD D + PPF+ C +
Sbjct: 197 ---------ATCKHFAGYDLENSDGAIRYGFDAIIKSQDLRDYYLPPFQQCARDSNVQSV 247
Query: 292 MCSYNQVNGVPACLRGDLFQK-ARNEWGF---KGYITSDCDAVATIFEYQNYTKTHEDSA 347
MCSYN +NGVP C L Q R WG+ ++TSDCDAV I++Y NYT T E SA
Sbjct: 248 MCSYNAMNGVPTCADDWLLQTLLREHWGWTEEDQWVTSDCDAVKNIWDYHNYTLTPEQSA 307
Query: 348 AGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKY 407
A L AG D++CGT + SA D+G +DR+L ++ +RLG F+ P Y
Sbjct: 308 ADALNAGTDLDCGTFWPTYLGSAYDQGLYDISTLDRSLARRYASLVRLGYFD-PPSVQPY 366
Query: 408 GKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGG 467
+L D+V T ++LAL AA GIVLLKND LPL+ N ++++A+IGPL N QM G
Sbjct: 367 RQLNWDNVSTPAAQQLALQAAEDGIVLLKND-GILPLSSN-ITNVALIGPLANATKQMQG 424
Query: 468 GYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLD 527
Y G +S L + K Y G D+ + F A+ A+ AD VI V G+D
Sbjct: 425 NYYGTAPYLRSPLIAAQNAGFKVTYVQGA-DIDSQNTTDFSAAISAAQSADLVIYVGGID 483
Query: 528 LTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILW 587
+ E E+ +L L+ S G +D S +++ ++++LW
Sbjct: 484 NSIEAEE------ILANLSTPLIISQM-------------GCMIDSSSLLSNTGVNALLW 524
Query: 588 IGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESF-TKVPMNDMNMRADSSRQYPGRSYR 646
GYPG+ G A+ I+ G P GRLP+T YP ++ +V M DMN++ SR PGR+Y+
Sbjct: 525 AGYPGQDGGTAIFNILTGKTAPAGRLPITQYPSNYVNQVTMTDMNLQP--SRFNPGRTYK 582
Query: 647 FYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEV 706
+Y G V+ +G+GL YT T A + S N + + + + ++
Sbjct: 583 WYNGEPVFEYGYGLQYT-------------TFDAKITPSSPNNTFEISELLANASNYKDL 629
Query: 707 TSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSK-- 764
T + I+V+N G +V + F P+K L+ + R+H + G+
Sbjct: 630 TPFV----KIPITVSNTGTTTSDYVALFFLSGTFGPAPHPKKSLVAYTRLHDITGGANAT 685
Query: 765 -EISFGVDPCEQLSIANKHGRRILPLGNHVLMV 796
E+S + L+ N +G IL G++ ++V
Sbjct: 686 AEVSLNL---ASLARGNWNGDLILYPGDYKVVV 715
>gi|343172466|gb|AEL98937.1| beta-xylosidase, partial [Silene latifolia]
gi|343172468|gb|AEL98938.1| beta-xylosidase, partial [Silene latifolia]
Length = 374
Score = 412 bits (1058), Expect = e-112, Method: Compositional matrix adjust.
Identities = 191/399 (47%), Positives = 258/399 (64%), Gaps = 28/399 (7%)
Query: 90 RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
RLG+ YEWWSE+LHG+++ GPG F G + TSFPQV+ +AASFN SLW IG AV+
Sbjct: 1 RLGLQGYEWWSEALHGVSNVGPGTKFQGAFPAATSFPQVITTAASFNASLWQAIGQAVSD 60
Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
EARAMYN G AGLT+W+PN+NIFRDPRWGRGQETPGEDP + + YA +V QG
Sbjct: 61 EARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPTLSAQYAASYVTGLQG----- 115
Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQ 269
+ G+ L ++ACCKH AYDL+ W R+ FNA +++Q
Sbjct: 116 ----------------------NYGNRLKVAACCKHYTAYDLDNWNGMDRFHFNAKVSKQ 153
Query: 270 DTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCD 328
D EDT+ PF++C+ +GK + +MCSYNQVNG P C D+ + R +W GYI SDCD
Sbjct: 154 DLEDTYNVPFKACVLEGKVASVMCSYNQVNGKPTCADPDILRNTIRGQWHLNGYIVSDCD 213
Query: 329 AVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNL 388
+V +++ Q+YT+T E++AA + AG+D++CG + HT+ AI +G V E +++AL N
Sbjct: 214 SVGVLYDDQHYTRTPEEAAADTINAGLDLDCGPFLAVHTEGAIRQGLVTEAAVNQALANT 273
Query: 389 FSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNA 448
+VQ+RLG+F+G+P +G LGP DVCT H+ LAL AAR+GIVLLKN LPL+
Sbjct: 274 ITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQDLALQAAREGIVLLKNQVGSLPLSTVR 333
Query: 449 VSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV 487
++A+IGP + M G Y GI C S L+G+ Y
Sbjct: 334 HRNIAVIGPNAQATTTMIGNYAGIACGYTSPLQGISRYA 372
>gi|421077748|ref|ZP_15538711.1| glycoside hydrolase family 3 domain protein [Pelosinus fermentans
JBW45]
gi|392524151|gb|EIW47314.1| glycoside hydrolase family 3 domain protein [Pelosinus fermentans
JBW45]
Length = 750
Score = 412 bits (1058), Expect = e-112, Method: Compositional matrix adjust.
Identities = 258/773 (33%), Positives = 393/773 (50%), Gaps = 112/773 (14%)
Query: 52 YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGP 111
+ + + +LS RAK L+S +TL+EK+ Q+ + AIPRLG+PAY WWSE+LHG+A G
Sbjct: 30 FDYQDETLSFEQRAKDLVSRMTLEEKVTQMVYISPAIPRLGVPAYNWWSEALHGVARAG- 88
Query: 112 GVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQ--------AGLT 163
T FPQ + AA+F+ L ++ +++E RA ++ Q GLT
Sbjct: 89 ---------VATVFPQAIGLAATFDEKLIHDVAEVISIEGRAKFHEFQRKGDHGIYKGLT 139
Query: 164 FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRV 223
FW+PN+NIFRDPRWGRGQET GEDP + V F+K QG++ K+
Sbjct: 140 FWSPNVNIFRDPRWGRGQETFGEDPYLTGRLGVSFIKGLQGQD--------------KKY 185
Query: 224 LKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCI 283
L+ +AC KH + + R+SF+A+++ +D +T+ P F+ C+
Sbjct: 186 LRA-------------AACAKH---FAVHSGPESERHSFDAVVSPKDLRETYLPAFKECV 229
Query: 284 EQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKT 342
++ +M +YN+VNG P C L ++ R EWGF G++ SDC A+ E T +
Sbjct: 230 KEANVEAVMGAYNRVNGEPCCGSNMLLKETLRQEWGFTGHVVSDCWAIKDFHENHRVTSS 289
Query: 343 HEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDP 402
+S A L G D+NCG M + A +G V E+ I+ A+ L +++LGLF+
Sbjct: 290 APESVALALNNGCDLNCGN-MYLNLLIAYQEGLVTEEAINTAVTRLMLTRMKLGLFD-TA 347
Query: 403 RKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNI 462
Y +G EH++ AL+ +++ +VLLKN+ LPL++N +SS+A+IGP N+
Sbjct: 348 ENVPYTNIGFHQNDCQEHREFALEVSKKTLVLLKNENNLLPLDRNTISSIAVIGPNANSR 407
Query: 463 SQMGGGYTGIPCSPKSLLRGLEAYVSK---THYASGCH---DVPCNSDAG---FHEAVRI 513
+ G Y G + ++L G+ V K YA GCH D N F EAV
Sbjct: 408 EALTGNYCGTASNYITVLEGIREAVGKDTIVSYAQGCHLYRDKAENLGEARDRFAEAVST 467
Query: 514 AKKADFVIVVAGLDLTQETED---------RDRVSLLLPGQQMSLVTSVARTSKRPVILV 564
A++AD V++ GLD + E E+ D++ L LPG Q L+ + +T K P+ILV
Sbjct: 468 AERADIVVMCMGLDASIEGEEGDVSNEYASGDKLGLNLPGLQQELLEVIYQTGK-PIILV 526
Query: 565 LTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTK 624
L G L V++A ++ +I+ YPG G KALA IFG+++P G+LP+T+Y +
Sbjct: 527 LLAGSALAVTWAA--EKVPAIIQAWYPGAEGGKALASAIFGEYSPVGKLPITFYRTTEEL 584
Query: 625 VPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKA 684
D +M+ R+YR+ T +Y FG+GL YT ++Y+ L + ISA
Sbjct: 585 PEFTDYSMK--------NRTYRYMTKEALYPFGYGLGYTTFAYRQLQL-NRTQISAG--- 632
Query: 685 GSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQG 744
E C+ L V N G+ V L+ + K S
Sbjct: 633 --------------------ENVQCSVL-------VKNTGNFASDETVQLYIKDVKASVE 665
Query: 745 TPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
P +L G +VH + +E+ F + P QL++ N+ G IL G + VG
Sbjct: 666 VPILELQGIQKVHLLPGTEQEVFFTLTP-RQLALINEEGNCILEPGAFEIYVG 717
>gi|421060771|ref|ZP_15523202.1| glycoside hydrolase family 3 domain protein [Pelosinus fermentans
B3]
gi|421065248|ref|ZP_15527033.1| glycoside hydrolase family 3 domain protein [Pelosinus fermentans
A12]
gi|421073214|ref|ZP_15534285.1| glycoside hydrolase family 3 domain protein [Pelosinus fermentans
A11]
gi|392444242|gb|EIW21677.1| glycoside hydrolase family 3 domain protein [Pelosinus fermentans
A11]
gi|392454445|gb|EIW31278.1| glycoside hydrolase family 3 domain protein [Pelosinus fermentans
B3]
gi|392459366|gb|EIW35779.1| glycoside hydrolase family 3 domain protein [Pelosinus fermentans
A12]
Length = 724
Score = 411 bits (1057), Expect = e-112, Method: Compositional matrix adjust.
Identities = 258/777 (33%), Positives = 397/777 (51%), Gaps = 114/777 (14%)
Query: 49 FDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIAS 108
+ + + + +LS RAK L+S +TL+EK+ Q+ + AIPRLG+PAY WWSE+LHG+A
Sbjct: 1 MEIFAYQDETLSFEQRAKDLVSRMTLEEKVTQMVYISPAIPRLGVPAYNWWSEALHGVAR 60
Query: 109 NGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQ--------A 160
G T FPQ + AA+F+ L N+ +++E RA ++ Q
Sbjct: 61 AG----------VATVFPQAIGLAATFDEKLIFNVAEVISIEGRAKFHEFQRKGDHGIYK 110
Query: 161 GLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFRE 220
GLTFW+PN+NIFRDPRWGRGQET GEDP + V F+K QG++
Sbjct: 111 GLTFWSPNVNIFRDPRWGRGQETFGEDPYLTGRLGVSFIKGLQGQD-------------- 156
Query: 221 KRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFR 280
K+ L+ +AC KH + + R+SF+A+++ +D +T+ P F+
Sbjct: 157 KKYLRA-------------AACAKH---FAVHSGPESERHSFDAVVSPKDLRETYLPAFK 200
Query: 281 SCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNY 339
C+++ +M +YN+VNG P C L ++ R EWGF G++ SDC A+ E
Sbjct: 201 ECVKEANVEAVMGAYNRVNGEPCCGSNMLLKETLRREWGFTGHVVSDCWAIKDFHENHRV 260
Query: 340 TKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFN 399
T + +S A L G D+NCG M + A +G V E+ I+ A+ L +++LGLF+
Sbjct: 261 TSSAPESVAMALNNGCDLNCGN-MYLNLLIAYQEGLVTEEAINTAVTRLMLTRMKLGLFD 319
Query: 400 GDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLV 459
Y K+G EH++ AL+ +++ +VLLKN+ LPL++N +SS+A+IGP
Sbjct: 320 -TAENVPYTKIGFHQNDCQEHREFALEVSKKTLVLLKNENNLLPLDRNTISSIAVIGPNA 378
Query: 460 NNISQMGGGYTGIPCSPKSLLRGLEAYVSK---THYASGCH---DVPCNSDAG---FHEA 510
N+ + G Y G + ++L G+ V K YA GCH D N F EA
Sbjct: 379 NSREALTGNYCGTASNYITVLEGIREAVGKDTMVSYAQGCHLYRDKAENLGEARDRFAEA 438
Query: 511 VRIAKKADFVIVVAGLDLTQETED---------RDRVSLLLPGQQMSLVTSVARTSKRPV 561
V A++AD V++ GLD + E E+ D++ L LPG Q L+ + +T K P+
Sbjct: 439 VSTAERADIVVMCMGLDASIEGEEGDVSNEYASGDKLGLNLPGLQQELLEVIYQTGK-PI 497
Query: 562 ILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPES 621
ILVL G L V++A +I +I+ YPG G KALA IFG+++P G+LP+T+Y +
Sbjct: 498 ILVLLAGSALAVTWAA--EKIPAIIQAWYPGAEGGKALASAIFGEYSPVGKLPITFYRTT 555
Query: 622 FTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSA-PSELTISA 680
D +M+ R+YR+ T +Y FG+GL YT ++Y+ L +++++
Sbjct: 556 EELPEFTDYSMK--------NRTYRYMTKEALYPFGYGLGYTTFAYRQLQLNRTQISVGE 607
Query: 681 SLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPK 740
+++ GS L V N G+ V L+ + K
Sbjct: 608 NVQ-----------GSVL---------------------VKNTGNFASDETVQLYIKDVK 635
Query: 741 VSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
S P L G +VH + +E+ F + P QL++ N+ G IL G + VG
Sbjct: 636 ASVEVPIWALQGIQKVHLLPGTEQEVFFTLTP-RQLALINEEGNCILEPGVFEIYVG 691
>gi|23304843|emb|CAD48309.1| beta-xylosidase B [Clostridium stercorarium]
Length = 715
Score = 411 bits (1056), Expect = e-112, Method: Compositional matrix adjust.
Identities = 264/773 (34%), Positives = 397/773 (51%), Gaps = 117/773 (15%)
Query: 54 FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
+ + S S RAK L+S +T++EK+ Q+ N+ AI RLGIPAY WW+E+LHG+A G
Sbjct: 7 YLDPSYSFEERAKDLVSRMTIEEKVSQMLYNSPAIERLGIPAYNWWNEALHGVARAG--- 63
Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLTFW 165
+ T FPQ + AA+F+ L + ++ E RA Y+ GLTFW
Sbjct: 64 -------TATMFPQAIGMAATFDEELIYKVADVISTEGRAKYHASSKKGDRGIYKGLTFW 116
Query: 166 APNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLK 225
+PNINIFRDPRWGRGQET GEDP + + V FVK QG + + LK
Sbjct: 117 SPNINIFRDPRWGRGQETYGEDPYLTARLGVAFVKGLQGNH--------------PKYLK 162
Query: 226 GFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQ 285
G CK+++ + + R+ FNA+++++D +T+ P F++ +++
Sbjct: 163 AGG-------------MCKNILPFTVVP--ESLRHEFNAVVSKKDLYETYLPAFKALVQE 207
Query: 286 GKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHE 344
K +M +YN+ NG P C L R EWGFKG++ SDC A+ + + T T
Sbjct: 208 AKVESVMGAYNRTNGEPCCGSKTLLSDILRGEWGFKGHVVSDCWAIRDFHMHHHVTATAP 267
Query: 345 DSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPR- 403
+SAA ++ G D+NCG M + A+ +G + E++IDRA+ L +++LG+F DP
Sbjct: 268 ESAALAVRNGCDLNCGN-MFGNLLIALKEGLITEEEIDRAVTRLMITRMKLGMF--DPED 324
Query: 404 KGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNIS 463
+ Y + C EH++LALD A++ IVLLKND LPL++ + S+A+IGP ++
Sbjct: 325 QVPYASISSFVDC-KEHRELALDVAKKSIVLLKND-GLLPLDRKKIRSIAVIGPNADSRQ 382
Query: 464 QMGGGYTGIPCSPKSLLRGLEAYVS---KTHYASGCH----DVPCNSDAG--FHEAVRIA 514
+ G Y G ++L G+ + +Y+ GCH V + G EAV A
Sbjct: 383 ALIGNYEGTASEYVTVLDGIREMAGDDVRIYYSVGCHLYKDRVENLGEPGDRIAEAVTCA 442
Query: 515 KKADFVIVVAGLDLTQETEDR---------DRVSLLLPGQQMSLVTSVARTSKRPVILVL 565
+ AD VI+ GLD T E E+ D+ L LPGQQ L+ +V T K P++LVL
Sbjct: 443 EHADVVIMCLGLDSTIEGEEMHESNIYGSGDKPDLNLPGQQQELLEAVYATGK-PIVLVL 501
Query: 566 TGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKV 625
G L V++ AD I +IL YPG G +A+A ++FG+ NP G+LP+T+Y +
Sbjct: 502 LTGSALAVTW--ADEHIPAILNAWYPGALGGRAIASVLFGETNPSGKLPVTFYRTTEELP 559
Query: 626 PMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISA-SLKA 684
D +M R+YRF +Y FG GLSYT + Y S+L +S +++A
Sbjct: 560 DFTDYSME--------NRTYRFMKNEALYPFGFGLSYTTFDY------SDLKLSKDTIRA 605
Query: 685 GSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQG 744
G F+V + VTN G + G VV ++ + + S
Sbjct: 606 GEG--------------------------FNVSVKVTNTGKMAGEEVVQVYIKDLEASWR 639
Query: 745 TPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
P QL G RV + + EI+F + P EQL++ G+ ++ G + VG
Sbjct: 640 VPNWQLSGMKRVRLESGETAEITFEIRP-EQLAVVTDEGKSVIEPGEFEIYVG 691
>gi|115397385|ref|XP_001214284.1| hypothetical protein ATEG_05106 [Aspergillus terreus NIH2624]
gi|114192475|gb|EAU34175.1| hypothetical protein ATEG_05106 [Aspergillus terreus NIH2624]
Length = 776
Score = 410 bits (1055), Expect = e-111, Method: Compositional matrix adjust.
Identities = 267/751 (35%), Positives = 391/751 (52%), Gaps = 66/751 (8%)
Query: 55 CNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVN 114
C+ S RA +L+S+ TL+E + + + +PRLG+P Y+ WSESLHG+ N
Sbjct: 81 CDKSARPHDRAAALVSMFTLEELVNNTGNTGTGVPRLGLPKYQVWSESLHGVYR----AN 136
Query: 115 F--NGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIF 172
+ G S TSFPQ +++ A+ NR+L IG ++ +ARA N+G+ GL +APNIN F
Sbjct: 137 WASEGDYSWATSFPQPILTMAALNRTLIHQIGDILSTQARAFSNVGRYGLDTYAPNINSF 196
Query: 173 RDPRWGRGQETPGEDPM-VVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEES 231
R P WGRGQETPGED + S YA E++ QG G E+
Sbjct: 197 RHPVWGRGQETPGEDAYYLASTYAYEYITGIQG---------------------GVDPET 235
Query: 232 DRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCI 291
L L A KH YD+E W SR + IT+QD + + P F K +
Sbjct: 236 -----LKLVATAKHYAGYDIENWDGHSRLGNDMQITQQDLSEYYTPQFLVSARDAKVHSV 290
Query: 292 MCSYNQVNGVPACLRGDLFQKA-RNEWGF--KGYITSDCDAVATIFEYQNYTKTHEDSAA 348
MCSYN VNGVP+C Q R +GF GY++ DC AV F Y ++A
Sbjct: 291 MCSYNAVNGVPSCSNSFFLQTLLRETFGFVEDGYVSGDCGAVYNAFNPHEYAANESSASA 350
Query: 349 GVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYG 408
++AG DI+CGT H +A D+G++ +DI+R ++ L++ +RLG F+G+ +Y
Sbjct: 351 DSIRAGTDIDCGTSYQYHFTNAFDEGEISRQDIERGVIRLYTNLVRLGYFDGN--SSQYR 408
Query: 409 KLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGG 468
L DV T++ ++ +AA +G VLLKND LPL +++ S+A+IGP N +QM G
Sbjct: 409 DLTWSDVQTTDAWNISHEAAVEGTVLLKNDGT-LPL-ADSIRSVALIGPWANATTQMQGN 466
Query: 469 YTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDL 528
Y G S L LEA HYA G ++ + AGF +A+ A+KAD +I G+D
Sbjct: 467 YYGPAPYLTSPLAALEASDLDVHYAFGT-NISSTTTAGFADALAAARKADAIIFAGGIDN 525
Query: 529 TQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWI 588
T E E DR+++ PG Q+ L+ ++ K P++++ GGG +D S + ++ +S++LW
Sbjct: 526 TIEGEALDRMNITWPGNQLDLINQLSALGK-PLVVLQMGGGQVDSSALKHNTNVSALLWG 584
Query: 589 GYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESF-TKVPMNDMNMRADSSRQYPGRSYRF 647
GYPG++G AL +II G P GRL T YP + T+ P DM +R + + PG++Y +
Sbjct: 585 GYPGQSGGTALLDIIRGVRAPAGRLVTTQYPAGYATQFPAIDMGLRPNGTN--PGQTYMW 642
Query: 648 YTGTQVYGFGHGLSYTNYSYKFLSAP---SELTISASLKAGSDKNILQQTGSRLDYVHID 704
YTGT VY FGHGL YT + K S S I L A Q L++
Sbjct: 643 YTGTPVYEFGHGLFYTTFEAKRASTATNHSSFNIEDLLTAPHPGYAYPQLRPFLNFT--- 699
Query: 705 EVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKG-S 763
+TN G + MLFA P K L+GFDR+ + G S
Sbjct: 700 -------------AHITNTGRTTSDYTAMLFANTTAGPAPHPNKWLVGFDRLGALEPGAS 746
Query: 764 KEISFGVDPCEQLSIANKHGRRILPLGNHVL 794
+ ++F + + ++ ++ G R+L G L
Sbjct: 747 QTMTFPIT-IDNVARTDELGNRVLYPGRRCL 776
>gi|358365439|dbj|GAA82061.1| beta-xylosidase [Aspergillus kawachii IFO 4308]
Length = 788
Score = 410 bits (1054), Expect = e-111, Method: Compositional matrix adjust.
Identities = 266/759 (35%), Positives = 388/759 (51%), Gaps = 69/759 (9%)
Query: 63 TRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFN--GTVS 120
T SLISL TL E I + + RLG+PAY+ WSE+LHG+ NF+ G+ +
Sbjct: 61 TEQHSLISLFTLDELIANTGNTGLGVSRLGLPAYQVWSEALHGLDR----ANFSDSGSYN 116
Query: 121 SVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRG 180
TSFPQ +++ A+ NR+L I S ++ + RA N G+ GL +APNIN FR P WGRG
Sbjct: 117 WATSFPQPILTTAALNRTLIHQIASIISTQGRAFNNAGRYGLDVYAPNINTFRHPVWGRG 176
Query: 181 QETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLS 240
QETPGED + + YA E++ QG D L L+
Sbjct: 177 QETPGEDVSLAAVYAYEYITGIQG--------------------------PDPDSNLKLA 210
Query: 241 ACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNG 300
A KH YD+E W N SR + IT+QD + + P F K +MC+YN VNG
Sbjct: 211 ATAKHYAGYDIENWHNHSRLGNDMNITQQDLSEYYTPQFHVAARDAKVHSVMCAYNAVNG 270
Query: 301 VPACLRGDLFQK-ARNEWGF--KGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDI 357
VPAC Q R+ +GF GY++SDCDA I+ Y + +AA + AG DI
Sbjct: 271 VPACADSYFLQTLLRDTFGFVDHGYVSSDCDAAYNIYNPHGYASSQAAAAAEAILAGTDI 330
Query: 358 NCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGK--YGKLGPDDV 415
+CGT H +I G + DI++ ++ L++ ++ G F+ + K Y L DV
Sbjct: 331 DCGTTYQWHLNESITAGDLSRDDIEKGVIRLYTTLVQAGYFDSNTTKANNPYRDLTWSDV 390
Query: 416 CTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAV----SSLAIIGPLVNNISQMGGGYTG 471
++ ++ AA QGIVLLKN LPL + A +++A+IGP N +Q+ G Y G
Sbjct: 391 LETDAWNISYQAATQGIVLLKNSNNVLPLTEKAYPPSNTTVALIGPWANATTQLLGNYYG 450
Query: 472 IPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQE 531
S E K ++A G + S +GF A+ A+ AD +I G+D T E
Sbjct: 451 NAPYMISPRAAFEEAGYKVNFAEGT-GISSTSTSGFAAALSAARSADVIIYAGGIDNTLE 509
Query: 532 TEDRDRVSLLLPGQQMSLVTSVARTS-KRPVILVLTGGGPLDVSFAEADSQISSILWIGY 590
E DR S+ PG Q+ L+ +A ++ +P+I++ GGG +D S + ++ ++++LW GY
Sbjct: 510 AEALDRESIAWPGNQLDLIQKLASSAGSKPLIVLQMGGGQVDSSSLKNNTNVTALLWGGY 569
Query: 591 PGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNMRADSSRQYPGRSYRFYT 649
PG++G AL +II G NP GRL T YP S+ + P DMN+R + PG++Y++YT
Sbjct: 570 PGQSGGFALRDIITGKKNPAGRLVTTQYPASYAEEFPATDMNLRPEGDN--PGQTYKWYT 627
Query: 650 GTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSC 709
G VY FGHGL YT + + S T + +K + ++IL QT L +T
Sbjct: 628 GEAVYEFGHGLFYTTF-----AESSSNTTTKEVKL-NIQDILSQTHEEL-----ASITQL 676
Query: 710 TSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGT-PEKQLIGFDRVHTVAKG-SKEIS 767
L F I N G ++ + M+FA P K L+G+DR+ V G ++E+
Sbjct: 677 PVLNFTANIK--NTGKLESDYTAMVFANTSDAGPAPYPVKWLVGWDRLGDVKVGETRELR 734
Query: 768 FGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
P E S A R+ G+ VL G +L +E
Sbjct: 735 V---PVEVGSFA-----RVNEDGDWVLFPGTFELALNLE 765
>gi|358393086|gb|EHK42487.1| glycoside hydrolase family 3 protein [Trichoderma atroviride IMI
206040]
Length = 794
Score = 410 bits (1053), Expect = e-111, Method: Compositional matrix adjust.
Identities = 267/752 (35%), Positives = 398/752 (52%), Gaps = 64/752 (8%)
Query: 55 CNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVN 114
C+++ RA++LISL TL+E I ++ +PRLG+P Y+ W+E+LHG+ N
Sbjct: 64 CDSTAGYVERAQALISLFTLEELILNTQNSGPGVPRLGLPNYQVWNEALHGLDR----AN 119
Query: 115 F---NGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINI 171
F G TSFP ++S A+ NR+L I ++ +ARA N G+ GL +APNIN
Sbjct: 120 FATKGGEFEWGTSFPMPILSMAALNRTLIHQIADIISTQARAFSNNGRYGLDVYAPNING 179
Query: 172 FRDPRWGRGQETPGEDPMVV-SAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEE 230
FR P WGRGQETPGED V+ SAY E++ QG G E
Sbjct: 180 FRSPLWGRGQETPGEDANVLTSAYTYEYITGMQG---------------------GVDPE 218
Query: 231 SDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASC 290
+ L ++A KH YDLE + N SR F+A+IT+QD + + P F + K+
Sbjct: 219 N-----LKIAATAKHFAGYDLENYNNQSRLGFDAIITQQDLSEYYTPQFLAASRYAKSHS 273
Query: 291 IMCSYNQVNGVPACLRGDLFQK-ARNEWGFK--GYITSDCDAVATIFEYQNYTKTHEDSA 347
MC+YN VNGVP+C Q R WGF GY++SDCDA+ ++ NY + +A
Sbjct: 274 FMCAYNSVNGVPSCSNSFFLQTLLRESWGFPEYGYVSSDCDAIYNVWNPHNYANSQSSAA 333
Query: 348 AGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKY 407
A LKAG DI+CG H + G V +I+R++ L++ +RLG F+ +K +Y
Sbjct: 334 ADSLKAGTDIDCGQTYPWHLNESFVAGTVSRGEIERSVTRLYANLVRLGYFD---KKNEY 390
Query: 408 GKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGG 467
LG DV ++ ++ +AA +GIVLLKND LPL+K V S+A+IGP VN Q+ G
Sbjct: 391 RSLGWKDVVKTDAWNISYEAAVEGIVLLKNDGT-LPLSKK-VRSIALIGPWVNATEQLQG 448
Query: 468 GYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLD 527
Y G S L+ + + +Y G + + AGF +A+ AKK+D +I + G+D
Sbjct: 449 NYFGTAPYLISPLQAAKKAGYEVNYELGT-GINNQTTAGFAKAIAAAKKSDAIIFIGGID 507
Query: 528 LTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILW 587
T E E DR + PG Q+ L+ ++ K P++++ GGG +D S +++ +++S++W
Sbjct: 508 NTIEQEGADRTDIAWPGNQLDLIKQLSEVGK-PLVVLQMGGGQVDSSSIKSNKKVNSLVW 566
Query: 588 IGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNMRADSSRQYPGRSYR 646
GYPG++G AL +I+ G P GRL T YP + + NDMN+R D + PG++Y
Sbjct: 567 GGYPGQSGGYALFDILSGKRAPAGRLVSTQYPAEYVHQFAQNDMNLRPDGKKN-PGQTYI 625
Query: 647 FYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEV 706
+YTG VY FG GL YT + + L S L +AS G+ H
Sbjct: 626 WYTGKPVYQFGDGLFYTTFK-ETLGKQSTLKFNASQILGAG--------------HPGYT 670
Query: 707 TSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVS-QGTPEKQLIGFDRVHTVAKG-SK 764
S + F ++ N+G + M F R + P K L+GFDR+ T+ G S
Sbjct: 671 YSEQTPVFTFTANIQNSGKTASPYSAMAFVRTSNAGPKPYPNKWLVGFDRLATIKPGHSS 730
Query: 765 EISFGVDPCEQLSIANKHGRRILPLGNHVLMV 796
+S + P LS + +G +I+ G + L++
Sbjct: 731 TLSIPI-PLNALSRVDSNGNKIVYPGKYELVL 761
>gi|323447708|gb|EGB03620.1| hypothetical protein AURANDRAFT_72703 [Aureococcus anophagefferens]
Length = 744
Score = 409 bits (1051), Expect = e-111, Method: Compositional matrix adjust.
Identities = 253/756 (33%), Positives = 376/756 (49%), Gaps = 119/756 (15%)
Query: 43 PCKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSES 102
PC F++ PFC+ +L+I RA +S +T+ EKI L I LG+PAY WWSE+
Sbjct: 28 PCLNATFEALPFCDATLAIDLRAADAVSRMTIPEKIDALDTKTGPIASLGLPAYNWWSEA 87
Query: 103 LHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGL 162
G+ + P T F + +A SFNR+LW G+A+ EARA+ N G A
Sbjct: 88 SSGVMGSRP----------TTKFAYPVTTAMSFNRTLWRATGAAIGREARALMNAGAAYS 137
Query: 163 TFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKR 222
T+WAP +N+ R+PRWGR E PGEDP + YA EFV FQ
Sbjct: 138 TYWAPVVNLAREPRWGRNIEVPGEDPYLTGEYATEFVGGFQAA----------------- 180
Query: 223 VLKGFGEESDRGDELMLSACCKHLIAYDLEKW-----GNFSRYSFNAMITEQDTEDTFQP 277
+ L SACCKH +A +LE + R ++ +T++D D++
Sbjct: 181 --------PEDPYHLQASACCKHYVANELENTRQPDGEQWDRQHVDSNVTQRDLVDSYMV 232
Query: 278 PFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK-ARNEWGFKGYITSDCDAVATIFEY 336
PF++C+E+GK S +MCSYN VNGVP+C L + AR+ W F GYITSDCDA + +++
Sbjct: 233 PFQACVEKGKVSSLMCSYNAVNGVPSCANDWLLRTVARDAWHFDGYITSDCDADSNVYDA 292
Query: 337 QNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLG 396
+Y T E++ A VLKAG D++C + + +H +SA+DKG + E D+D L+NLF V+LRLG
Sbjct: 293 HHYAATPEEAVADVLKAGTDVDCQSFVGQHARSALDKGLITEADMDARLVNLFKVRLRLG 352
Query: 397 LFN-----GDPRKGKYGKLGPDD-VCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVS 450
F+ PR G ++ D VC+ H +++ Q LLKND LPL + +
Sbjct: 353 HFDLSFDAAKPR-GPLDEIDADAVVCSDAHLDASMEGLAQSATLLKNDGA-LPLKPSGTA 410
Query: 451 SLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEA 510
A++GP A +SK +DAG++
Sbjct: 411 --AVVGP--------------------------NALLSK-------------ADAGYYGP 429
Query: 511 VRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGP 570
AD V++ G DLT E +D S++ Q+ L+ +VA S PV++V+ P
Sbjct: 430 T---DAADAVVLAVGTDLTWAAEGKDATSIVFTAAQLELIDAVATASATPVVVVVFSATP 486
Query: 571 LDVS--FAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPM 627
LD++ A +D ++ +++ +G P K L ++++G + GR T YP ++ ++ +
Sbjct: 487 LDLTPLLARSDGKVGAVVHVGQP-SVTVKGLGDLLYGRRSFAGRAVQTVYPAAYADQISI 545
Query: 628 NDMNMRADSSR----------------QYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLS 671
D NMR S PGR+YRFY V FG GLSYT ++Y S
Sbjct: 546 FDFNMRPGPSAFARPDCATNESACPRGTNPGRTYRFYVDEPVVPFGFGLSYTTFAYAVRS 605
Query: 672 APSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHV 731
AP+ + ++ A + + G D+ + T + VTN GD+D V
Sbjct: 606 APTTVDLAPLRAAYAGVAAARGDGGPAFLSLHDDAAAAT-----YAVDVTNTGDIDADDV 660
Query: 732 VMLFARVPKVS-QGTPEKQLIGFDRVHTVAKGSKEI 766
V+ F P G P K+L GF+RVH A +K +
Sbjct: 661 VLGFVTPPGAGVDGVPLKELFGFERVHVKAGETKTV 696
>gi|343428088|emb|CBQ71612.1| related to Beta-xylosidase [Sporisorium reilianum SRZ2]
Length = 698
Score = 409 bits (1050), Expect = e-111, Method: Compositional matrix adjust.
Identities = 244/659 (37%), Positives = 353/659 (53%), Gaps = 51/659 (7%)
Query: 42 FP-CKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWS 100
FP C + P C+TSL TRA SL++ T E I ++A +PRLGIP Y+WW+
Sbjct: 21 FPNCSTLPLSTLPVCDTSLDFYTRATSLVAQFTTAELINNTVNHAPGVPRLGIPQYQWWT 80
Query: 101 ESLHGIASNGPGVNFN----GTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYN 156
E+LHG+A + PGVNFN G TSFPQV+ A+F+ +L+ + + +A E RA N
Sbjct: 81 EALHGVARS-PGVNFNPDAAGEFGCATSFPQVINLGATFDDALYEAVAAHIANETRAFSN 139
Query: 157 LGQAGLTFWAP-NINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIG 215
G+AGL ++P NIN FRDPRWGRGQET GEDP+ +S YAV V+ QG
Sbjct: 140 AGRAGLNMYSPLNINAFRDPRWGRGQETVGEDPLHLSRYAVRVVRGLQGP---------- 189
Query: 216 FGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTF 275
+ + L L+A CKH +AYDLE RY F+A+++ QD D
Sbjct: 190 ------------AAQDEANPRLTLAATCKHYLAYDLEASAGVERYQFDALVSNQDLADLH 237
Query: 276 QPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK-ARNEWGF---KGYITSDCDAVA 331
P FR+C+ G A+ +M SYN VNGVP + AR+ WG Y+TSDCDAVA
Sbjct: 238 LPQFRACVRDGGATTLMTSYNAVNGVPPSASKYYLETLARDTWGLDKHHNYVTSDCDAVA 297
Query: 332 TIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSV 391
+++ +Y + +AA L AG D++CG +A+ + I RA+ ++
Sbjct: 298 NVYDAHHYAADYVHAAAASLNAGTDLDCGATYRDSLAAALAQNLTDVATIRRAVTRMYGS 357
Query: 392 QLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSS 451
+RLG F+ + +LG DV +KLA +AA I LLKN + LPL + A +
Sbjct: 358 LVRLGYFDAAEAQ-PLRQLGWKDVNAPAAQKLAYEAAAASITLLKNRQSTLPLRETAGKT 416
Query: 452 LAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCN--SDAGFHE 509
+A+IGP N + G Y G P L+ + + + S H V N S AG ++
Sbjct: 417 IALIGPYTNATFALRGNYAG----PSPLV--ITPFDAARRTFSDAHIVSANGTSIAGPYD 470
Query: 510 ------AVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVIL 563
A+ AK AD ++ G+D T E E DR + P Q+ L+ +A K +++
Sbjct: 471 TATASAALATAKSADIIVYAGGIDPTVEGESLDRRDIAWPANQLRLIQELAALGKV-LVV 529
Query: 564 VLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT 623
V GGG +D + + D + +++W GYPG++GA AL +I+ G P GRLP+T YP ++T
Sbjct: 530 VQFGGGQVDGALLKGDDGVGALVWAGYPGQSGALALMDILAGKRAPAGRLPITQYPANYT 589
Query: 624 KVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASL 682
+ + M + YPGR+Y++YTGT + FG GL YT + ++ P+ TI L
Sbjct: 590 HA-LRETTMALRPTATYPGRTYKWYTGTPTFPFGFGLHYTTFRAS-IAPPATYTIPPPL 646
>gi|358385386|gb|EHK22983.1| glycoside hydrolase family 3 protein [Trichoderma virens Gv29-8]
Length = 795
Score = 409 bits (1050), Expect = e-111, Method: Compositional matrix adjust.
Identities = 268/752 (35%), Positives = 395/752 (52%), Gaps = 63/752 (8%)
Query: 55 CNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVN 114
C++S + RA++LISL TL+E I ++ +PRLG+P Y+ W+E+LHG+ N
Sbjct: 63 CDSSAGYAERAQALISLFTLEELILNTQNSGPGVPRLGLPNYQVWNEALHGLDR----AN 118
Query: 115 F---NGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINI 171
F G TSFP ++S A+ NR+L I ++ +ARA N G+ GL +APNIN
Sbjct: 119 FATKGGQFQWATSFPMPILSMAALNRTLIHQIADIISTQARAFSNSGRYGLDVYAPNING 178
Query: 172 FRDPRWGRGQETPGEDPMVV-SAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEE 230
FR P WGRGQETPGED V+ SAY E++ QG G E
Sbjct: 179 FRSPLWGRGQETPGEDANVLTSAYTYEYITGMQG---------------------GVDPE 217
Query: 231 SDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASC 290
+ L ++A KH YDLE W N SR F+A+IT+QD + + P F + K+
Sbjct: 218 N-----LKIAATAKHFAGYDLENWNNQSRLGFDAIITQQDLSEYYTPQFLAASRYAKSHS 272
Query: 291 IMCSYNQVNGVPACLRGDLFQK-ARNEWGFK--GYITSDCDAVATIFEYQNYTKTHEDSA 347
MC+YN VNGVP+C Q R WGF GY++SDCDAV ++ +Y +A
Sbjct: 273 FMCAYNSVNGVPSCANSFFLQTLLRESWGFPEWGYVSSDCDAVYNVWNPHDYASNQSSAA 332
Query: 348 AGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKY 407
A L+AG DI+CG H + G+V +I+R++ L++ +RLG F+ +K +Y
Sbjct: 333 ASSLRAGTDIDCGQTYPWHLNESFVAGEVSRGEIERSVTRLYANLVRLGYFD---KKNEY 389
Query: 408 GKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGG 467
LG DV ++ ++ +AA +GIVLLKND LPL+K V S+A+IGP N +QM G
Sbjct: 390 RSLGWKDVVKTDAWNISYEAAVEGIVLLKNDGT-LPLSKK-VRSIALIGPWANATTQMQG 447
Query: 468 GYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLD 527
Y G S L + + ++ G + S AGF +A+ AKK+D +I G+D
Sbjct: 448 NYFGAAPYLISPLEAAKKAGYQVNFELGT-ETASTSTAGFAKAIAAAKKSDAIIFAGGID 506
Query: 528 LTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILW 587
T E E DR + PG Q+ L+ ++ K P++++ GGG +D S +++ +++S++W
Sbjct: 507 NTVEQEGADRTDIAWPGNQLDLIKQLSELGK-PLVVLQMGGGQVDSSSLKSNKKVNSLVW 565
Query: 588 IGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNMRADSSRQYPGRSYR 646
GYPG++G AL +I+ G P GRL T YP + + P NDMN+R D + PG++Y
Sbjct: 566 GGYPGQSGGVALFDILSGKRAPAGRLVSTQYPADYVHQFPQNDMNLRPD-GKSNPGQTYI 624
Query: 647 FYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEV 706
+YTG VY FG G+ YT + +GS K + S L H
Sbjct: 625 WYTGKPVYQFGDGIFYTTFKETL--------------SGSSKGLKFNVSSVLAAPHPGYT 670
Query: 707 TSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGT-PEKQLIGFDRVHTVAKG-SK 764
S + ++ N+G D + MLF R P K L+GFDR+ T+ G S
Sbjct: 671 YSEQTPVLTFTANIENSGKTDSPYSAMLFVRTANAGPAPYPNKWLVGFDRLATIKPGHSS 730
Query: 765 EISFGVDPCEQLSIANKHGRRILPLGNHVLMV 796
++S + P L+ + G RI+ G + L +
Sbjct: 731 KLSIPI-PVSALARVDSLGNRIVYPGKYELAL 761
>gi|413919686|gb|AFW59618.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
Length = 475
Score = 408 bits (1049), Expect = e-111, Method: Compositional matrix adjust.
Identities = 213/457 (46%), Positives = 292/457 (63%), Gaps = 12/457 (2%)
Query: 350 VLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGK 409
V AG+D+NCGT + +HT +A+ GK+ E D+DRA+ N +RLG F+GDPR+ +G
Sbjct: 28 VAAAGLDLNCGTFLAQHTVAAVQAGKLSESDVDRAVTNNLVTLMRLGFFDGDPRELPFGN 87
Query: 410 LGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGY 469
LGP DVCT +++LA +AARQGIVLLKN K LPL+ ++ S+A+IGP N M G Y
Sbjct: 88 LGPSDVCTPSNQELAREAARQGIVLLKNTGK-LPLSATSIKSMAVIGPNANASFTMIGNY 146
Query: 470 TGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDA-GFHEAVRIAKKADFVIVVAGLDL 528
G PC + L+GL A V+ T Y GC +V C+ ++ A + A AD ++V G D
Sbjct: 147 EGTPCKYTTPLQGLGANVA-TVYQPGCTNVGCSGNSLQLDAATKAAASADVTVLVVGADQ 205
Query: 529 TQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWI 588
+ E E DR SLLLPGQQ LV++VA S P ILV+ GGP D+SFA++ +I++ILW+
Sbjct: 206 SIERESLDRTSLLLPGQQPQLVSAVANASSGPCILVVMSGGPFDISFAKSSDKIAAILWV 265
Query: 589 GYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFY 648
GYPGEAG A+A+++FG NP GRLP+TWYPESFTKVPM DM MR D S YPGR+YRFY
Sbjct: 266 GYPGEAGGAAIADVLFGYHNPSGRLPVTWYPESFTKVPMTDMRMRPDPSTGYPGRTYRFY 325
Query: 649 TGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTS 708
TG VY FG GLSYT++++ +SAP +L +L+ L + ++ E
Sbjct: 326 TGDTVYAFGDGLSYTSFAHHLVSAPKQL----ALQLAEGHACLTEQCPSVEA----EGAH 377
Query: 709 CTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISF 768
C L F V + V NAG+ G H V LF+ P V P K L+GF++V + ++F
Sbjct: 378 CEGLAFDVHLRVRNAGERSGGHTVFLFSSPPAV-HNAPAKHLLGFEKVSLEPGQAGVVAF 436
Query: 769 GVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTI 805
VD C+ LS+ ++ G R + LG+H L VG+L+H+L +
Sbjct: 437 KVDVCKDLSVVDELGNRKVALGSHTLHVGDLKHTLNL 473
>gi|329745495|gb|AEB98984.1| xylosidase precursor [synthetic construct]
Length = 804
Score = 408 bits (1049), Expect = e-111, Method: Compositional matrix adjust.
Identities = 272/785 (34%), Positives = 398/785 (50%), Gaps = 71/785 (9%)
Query: 37 LNKPDFPCKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAY 96
L+ PD C+ S+ C+ S + RA SLISL TL E I + + RLG+P Y
Sbjct: 53 LSSPD--CQNGPLRSHLICDESATPYDRAASLISLFTLDELIANTGNTGLGVSRLGLPVY 110
Query: 97 EWWSESLHGIASNGPGVNFN--GTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAM 154
+ WSE+LHG+ NF+ G+ + TSFPQ +++ A+ NR+L I S ++ + RA
Sbjct: 111 QVWSEALHGLDR----ANFSDSGSYNWATSFPQPILTTAALNRTLIHQIASIISTQGRAF 166
Query: 155 YNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGI 214
N G+ GL +APNIN FR P GRGQETPGED + + YA E++ QG
Sbjct: 167 NNAGRYGLDVYAPNINTFRHPVRGRGQETPGEDVSLAAVYAYEYITGIQG---------- 216
Query: 215 GFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDT 274
D L L+A KH YD+E W N SR + IT+QD +
Sbjct: 217 ----------------PDPDSNLKLAATAKHYAGYDIENWHNHSRLGNDMNITQQDLSEY 260
Query: 275 FQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK-ARNEWGF--KGYITSDCDAVA 331
+ P F K +MC+YN VNGVPAC Q R+ +GF GY++SDCDA
Sbjct: 261 YTPQFHVAARDAKVHSVMCAYNAVNGVPACADSYFLQTLLRDTFGFVDHGYVSSDCDAAY 320
Query: 332 TIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSV 391
I+ Y + +AA + AG DI+CGT H +I G + DI++ ++ L++
Sbjct: 321 NIYNPHGYASSQAAAAAEAILAGTDIDCGTTYQWHLNESITAGDLSRDDIEKGVIRLYTT 380
Query: 392 QLRLGLFNGDPRKGK--YGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAV 449
++ G F+ + K Y L DV ++ ++ AA QGIVLLKN K LPL + A
Sbjct: 381 LVQAGYFDSNTTKANNPYRDLTWSDVLETDAWNISYQAATQGIVLLKNSNKVLPLTEKAY 440
Query: 450 ----SSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDA 505
+++A+IGP N +Q+ G Y G S E ++A + + +
Sbjct: 441 PPSNTTVALIGPWANATTQLLGNYYGNAPYMISPRVAFEEAGYNVNFAERT-GISSTNTS 499
Query: 506 GFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTS-KRPVILV 564
GF A+ A+ AD +I G+D T E E DR S+ PG Q+ L+ +A ++ +P+I++
Sbjct: 500 GFAAALSAAQSADVIIYAGGIDNTLEAEALDRESIAWPGNQLDLIQKLASSAGSKPLIVL 559
Query: 565 LTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT- 623
GGG +D S + ++ +S++LW GYPG++G AL +II G NP GRL T YP S+
Sbjct: 560 QMGGGQVDSSSLKNNTNVSALLWGGYPGQSGGFALRDIITGKKNPAGRLVTTQYPASYAE 619
Query: 624 KVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLK 683
+ P DMN+R + PG++Y++YTG VY FGHGL YT + + S T + +K
Sbjct: 620 EFPATDMNLRPEGDN--PGQTYKWYTGEAVYEFGHGLFYTTF-----AESSSNTTTREIK 672
Query: 684 AGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQ 743
+ ++IL QT L +T L F I N G V+ + M+FA
Sbjct: 673 L-NIQDILSQTHEDL-----ASITQLPVLNFTANIK--NTGKVESDYTAMVFANTSDAGP 724
Query: 744 GT-PEKQLIGFDRVHTVAKG-SKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRH 801
P K L+G+DR+ V G ++E+ P E S A R+ G+ VL G
Sbjct: 725 APYPVKWLVGWDRLGEVKVGETRELRV---PVEVGSFA-----RVNEDGDWVLFPGTFEL 776
Query: 802 SLTIE 806
+L +E
Sbjct: 777 ALNLE 781
>gi|392962219|ref|ZP_10327666.1| glycoside hydrolase family 3 domain protein [Pelosinus fermentans
DSM 17108]
gi|392452977|gb|EIW29882.1| glycoside hydrolase family 3 domain protein [Pelosinus fermentans
DSM 17108]
Length = 724
Score = 408 bits (1048), Expect = e-111, Method: Compositional matrix adjust.
Identities = 253/776 (32%), Positives = 389/776 (50%), Gaps = 112/776 (14%)
Query: 49 FDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIAS 108
+ + + + +LS RAK L+S +T++EK+ Q+ ++ AI RLGIPAY WWSE+LHG+A
Sbjct: 1 MEIFDYQDETLSFEQRAKDLVSRMTIEEKVTQMVYSSPAISRLGIPAYNWWSEALHGVAR 60
Query: 109 NGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQ--------A 160
G T FPQ + AA+F+ L ++ +++EARA ++ Q
Sbjct: 61 AG----------VATVFPQAIGLAATFDEKLIYDVAEIISIEARAKFHEFQRKGDHGIYK 110
Query: 161 GLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFRE 220
GLTFW+PN+NIFRDPRWGRGQET GEDP + V F+K QG++
Sbjct: 111 GLTFWSPNVNIFRDPRWGRGQETFGEDPYLTGRLGVSFIKGLQGQD-------------- 156
Query: 221 KRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFR 280
K+ L+ +AC KH + + R+ F+A+++ +D +T+ P F+
Sbjct: 157 KKYLRA-------------AACAKH---FAVHSGPESERHRFDAVVSPKDLRETYLPAFK 200
Query: 281 SCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNY 339
C+++ +M +YN+VNG P C L ++ R EWGF G++ SDC A+ E
Sbjct: 201 ECVKEANVEAVMGAYNRVNGEPCCGSNILLKETLRQEWGFTGHVVSDCWAIKDFHENHRV 260
Query: 340 TKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFN 399
T + +S A L G D+NCG M + A +G V E+ I+ A+ L +++LGLF+
Sbjct: 261 TSSAPESVALALNNGCDLNCGN-MYLNLLIAYQEGLVTEEAINTAVTRLMLTRMKLGLFD 319
Query: 400 GDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLV 459
Y +G EH++ AL+ +++ +VLLKN+ LPL++N +SS+A+IGP
Sbjct: 320 A-AENVPYTNIGFHQNDCQEHREFALEVSKKTLVLLKNENHLLPLDRNTISSIAVIGPNA 378
Query: 460 NNISQMGGGYTGIPCSPKSLLRGLEAYVSK---THYASGCH---DVPCN---SDAGFHEA 510
N+ + G Y G + ++L G+ V K YA GCH D N F EA
Sbjct: 379 NSREALTGNYFGTASNYITVLEGIREAVGKDTMVSYAQGCHLYRDKAENLGEERDRFAEA 438
Query: 511 VRIAKKADFVIVVAGLDLTQETED---------RDRVSLLLPGQQMSLVTSVARTSKRPV 561
V A++AD V++ GLD + E E+ D++ L LPG Q L+ + +T K P+
Sbjct: 439 VSTAERADLVVMCMGLDASIEGEEGDVSNEYASGDKLGLNLPGLQQELLEVIYKTGK-PI 497
Query: 562 ILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPES 621
ILVL G L V++A ++ +I+ YPG G KALA IFG+++P G+LP+T+Y +
Sbjct: 498 ILVLLAGSALAVTWAA--EKVPAIIQAWYPGAEGGKALASAIFGEYSPVGKLPITFYRTT 555
Query: 622 FTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISAS 681
D +M+ R+YR+ T +Y FG+GL YT ++Y+ L
Sbjct: 556 EELPEFTDYSMK--------NRTYRYMTKEALYPFGYGLGYTTFAYRQL----------- 596
Query: 682 LKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKV 741
++ C I V N G+ V L+ + K
Sbjct: 597 --------------------QLNRTKICAGENVQCSILVKNTGNFASDETVQLYIKDVKA 636
Query: 742 SQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
S P L G ++H + +EISF + QL++ N+ G IL G + VG
Sbjct: 637 SVEVPIWALQGIQKIHLLPGAEQEISFTLTS-RQLALINEKGNCILEPGIFEIYVG 691
>gi|255957137|ref|XP_002569321.1| Pc21g23540 [Penicillium chrysogenum Wisconsin 54-1255]
gi|211591032|emb|CAP97251.1| Pc21g23540 [Penicillium chrysogenum Wisconsin 54-1255]
Length = 791
Score = 407 bits (1045), Expect = e-110, Method: Compositional matrix adjust.
Identities = 251/754 (33%), Positives = 388/754 (51%), Gaps = 53/754 (7%)
Query: 55 CNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVN 114
C+T+ RA +LI++ T +E + + AIPRLG+P Y+ W+E+LHG+ + +
Sbjct: 61 CDTTAKPHDRAAALIAMFTFEELVNSTGNVMPAIPRLGLPPYQVWNEALHGL--DRANLT 118
Query: 115 FNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRD 174
G S TSFP +++ A+ NR+L + IG V+ + RA N G+ GL ++PNIN FR
Sbjct: 119 EFGDYSWATSFPSPILTMAALNRTLINQIGGIVSTQGRAFNNGGRYGLDVYSPNINSFRH 178
Query: 175 PRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRG 234
P WGRGQETPGED + S Y +E++ QG G K
Sbjct: 179 PVWGRGQETPGEDIQLCSVYGLEYITGLQG------------GLDPK------------- 213
Query: 235 DELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCS 294
EL L+A KH YD+E WGN SR + I+ D + P F + + + +M S
Sbjct: 214 -ELKLAATAKHFAGYDIENWGNHSRLGNDMSISAFDFASYYAPQFVTAVRDARVHSVMAS 272
Query: 295 YNQVNGVPACLRGDLFQK-ARNEWGF--KGYITSDCDAVATIFEYQNYTKTHEDSAAGVL 351
YN VNGVPA L Q R+ W F GY++SDCD+V +F Y + +AA +
Sbjct: 273 YNAVNGVPASANSFLLQTLLRDTWNFVEDGYVSSDCDSVYNVFNPHGYASSASLAAAKSI 332
Query: 352 KAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLG 411
+AG DI+CG + + +G++ +I+RA +S + LG F+GD KY L
Sbjct: 333 QAGTDIDCGATYQLYLNQSFTQGEISRSEIERAATRFYSNLVSLGYFDGD--NSKYRDLD 390
Query: 412 PDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTG 471
DV ++ ++ +AA +GIVLLKND LPL+K+ S+A+IGP N + M G Y G
Sbjct: 391 WSDVVATDAWNISYEAAVEGIVLLKNDGT-LPLSKD-THSVALIGPWANVTTTMQGNYYG 448
Query: 472 IPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQE 531
L L+A +YA G ++ + +GF A+ A+K+D VI G+D + E
Sbjct: 449 AAPYLTGPLAALQASDLDVNYAFGT-NISSETTSGFEAALSAARKSDVVIFAGGIDNSVE 507
Query: 532 TEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYP 591
E DR ++ PG Q+ L+ ++ K P++++ GGG +D S +A+ ++S++W GYP
Sbjct: 508 AEGVDRETITWPGNQLQLIEQLSELGK-PLVVLQMGGGQVDSSSLKANKNVNSLVWGGYP 566
Query: 592 GEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNMRADSSRQYPGRSYRFYTG 650
G++G A+ +I+ G P GRL +T YP + + P DM++R S PG++Y +YTG
Sbjct: 567 GQSGGPAILDILTGKRAPAGRLTVTQYPAEYALQFPATDMSLRPKGSN--PGQTYMWYTG 624
Query: 651 TQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQ-QTGSRLDYVHIDEVTSC 709
VY FGHGL YT + + L S G+ +I++ + S Y I++V
Sbjct: 625 KPVYEFGHGLFYTTFE-------TSLANSHGANNGASFDIVKLLSRSNAGYNVIEQVP-- 675
Query: 710 TSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFG 769
+ I V N G V + M F P K L+GFDR+ + + +
Sbjct: 676 ---FMNYTIEVENTGTVTSDYTAMAFVNTKAGPSPHPNKWLVGFDRLGGIEPHATQTMTI 732
Query: 770 VDPCEQLSIANKHGRRILPLGNHVLMVGELRHSL 803
+ ++ ++ G RI+ G + L + R ++
Sbjct: 733 PVSLDNVARTDEDGNRIVYPGKYELALNNERSAV 766
>gi|376259588|ref|YP_005146308.1| beta-glucosidase-like glycosyl hydrolase [Clostridium sp. BNL1100]
gi|373943582|gb|AEY64503.1| beta-glucosidase-like glycosyl hydrolase [Clostridium sp. BNL1100]
Length = 712
Score = 407 bits (1045), Expect = e-110, Method: Compositional matrix adjust.
Identities = 250/773 (32%), Positives = 382/773 (49%), Gaps = 119/773 (15%)
Query: 54 FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
+ + SLS RA L+S +TL+EK QL +A + RLGIP Y WW+E+LHG+A G
Sbjct: 6 YLDKSLSFKERAADLVSRMTLEEKASQLRYDAQPVERLGIPRYNWWNEALHGVARAG--- 62
Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLTFW 165
T FPQ + AA F+ I +A E RA YN G+TFW
Sbjct: 63 -------VATVFPQAIGMAAIFDDEFLEKIADVIATEGRAKYNENAKKGDRDIYKGITFW 115
Query: 166 APNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLK 225
+PN+NIFRDPRWGRG ET GEDP + S V FVK QG+
Sbjct: 116 SPNVNIFRDPRWGRGHETYGEDPYLTSRLGVAFVKGLQGD-------------------- 155
Query: 226 GFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQ 285
G L +AC KH + + R+ F+A+++++D +T+ P F + +++
Sbjct: 156 --------GKYLKTAACAKH---FAVHSGPEDDRHHFDAVVSQKDLYETYLPAFEALVKE 204
Query: 286 GKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHE 344
K +M +YN+ NG P L + R+ WGF G++ SDC A+ E TKT
Sbjct: 205 AKVESVMGAYNRTNGEPCNGSKTLLKDILRDGWGFDGHVVSDCWAIKDFHEGHGVTKTPT 264
Query: 345 DSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRK 404
+S A LK+G D+NCG L A+ +G++ E+DIDRA + L + ++RLG+F+ D
Sbjct: 265 ESVALALKSGCDLNCGNMYLL-ILLALKEGRITEEDIDRAAIRLMTTRMRLGMFDDD--- 320
Query: 405 GKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQ 464
++ K+ + + EH KL+L+AA++ +VLLKND LPL+ + ++A+IGP ++
Sbjct: 321 CEFDKIPYELNDSVEHNKLSLEAAKKSMVLLKNDG-LLPLDSKKIKNIAVIGPNADSSLA 379
Query: 465 MGGGYTGIPCSPKSLLRGLEAYVS---KTHYASGCH------DVPCNSDAGFHEAVRIAK 515
+ Y+G P ++L G+ VS + Y+ G H + D EAV +A+
Sbjct: 380 LRANYSGTPSQNITILDGIRKRVSEDTRVWYSVGSHLFMNREEDLAQPDDRLKEAVSVAE 439
Query: 516 KADFVIVVAGLDLTQETEDRD-----------RVSLLLPGQQMSLVTSVARTSKRPVILV 564
++D V++ GLD + E E D + L LP Q +L+ +V T K ++ +
Sbjct: 440 RSDVVVLCLGLDASVEGEQNDQGTVILDAGGDKADLNLPESQRNLLNAVLATGKPTIVAL 499
Query: 565 LTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTK 624
L+G +S +A + ++I+ YPG G A AE+IFGD++P GRLP+T+Y +
Sbjct: 500 LSGSA---LSIGDAADKAAAIVQCWYPGSRGGLAFAEMIFGDYSPAGRLPVTFYKSTEEL 556
Query: 625 VPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKA 684
P D +M R+Y+F G +Y FG GLSYTN+ Y + P + +L
Sbjct: 557 PPFADYSME--------NRTYKFMKGEALYPFGFGLSYTNFEYSNIVCPQNVNNGENLS- 607
Query: 685 GSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQG 744
V + V NAG VD VV ++ + S
Sbjct: 608 -------------------------------VSVDVQNAGSVDSDEVVQVYIKDMDASVR 636
Query: 745 TPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
P+ L GF R+H + K ++F +D ++I ++ G+R + G L VG
Sbjct: 637 VPKYSLCGFKRIHLKSGEKKTVTFEID-SNAMTIVDEAGKRYIENGEFTLYVG 688
>gi|310795958|gb|EFQ31419.1| glycosyl hydrolase family 3 N terminal domain-containing protein
[Glomerella graminicola M1.001]
Length = 824
Score = 406 bits (1044), Expect = e-110, Method: Compositional matrix adjust.
Identities = 275/802 (34%), Positives = 399/802 (49%), Gaps = 85/802 (10%)
Query: 31 SNSKPVLNKPDFPCKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPR 90
+N+ V + PD C S C+ +LS RA +L++ LT+ EK+ L + A +PR
Sbjct: 43 ANNNAVNSFPD--CSRDPLCSNDVCDETLSPKERAAALVAELTIWEKLDNLVNEAPGVPR 100
Query: 91 LGIPAYEWWSESLHGIASNGPGVNF--NGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVA 148
L IP YEWWSE LHG+AS+ PG F +G S TSFPQ +V ++F+ L IG V+
Sbjct: 101 LAIPPYEWWSEGLHGVASS-PGTKFAKSGNFSYATSFPQPIVLGSAFDDDLVKAIGEVVS 159
Query: 149 VEARAMYNLGQAGLTFW---------------------APNINIFRDPRWGRGQETPGED 187
EARA N G++GL + +PNIN F+DPRWGRGQETPGED
Sbjct: 160 KEARAFSNRGRSGLDLYVSSISRHIEPEVRDDMLTEPESPNINAFKDPRWGRGQETPGED 219
Query: 188 PMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLI 247
P + Y + +G G+ S + L A CKH
Sbjct: 220 PFHLQNYVAAMLTGLEG-----------------------GDPSKK-----LIATCKHYA 251
Query: 248 AYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRG 307
A D E + R F+A IT QD + + PPF++C K MCSYN +NG P C
Sbjct: 252 ANDFENYKGVDRAGFDANITTQDLSEYYLPPFKTCAVDKKVGSFMCSYNAINGEPLCANP 311
Query: 308 DLFQKA-RNEWGFKG---YITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCM 363
L + R WG+ G Y+++DCD VA + + +Y +AA +KAG D+ C
Sbjct: 312 YLLEDILRQHWGWNGDGQYVSTDCDCVALMVSHHHYAPDLGHAAAWAMKAGTDLECNAFP 371
Query: 364 LRHT-QSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKK 422
Q A ++ + EK++D++L +++ + +G F+ R L DDV T E +K
Sbjct: 372 GSEALQLAWNQSLISEKEVDKSLTRMYTALVSVGQFDS-ARGQPLRSLSWDDVNTKEAQK 430
Query: 423 LALDAARQGIVLLKNDKKFLPLNKN-AVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLR 481
LA A +G VLLKND LPL+ A+IGP +N +QM G Y G SL +
Sbjct: 431 LAYQAVIEGAVLLKND-GILPLSAAWREKKYALIGPWINATTQMQGNYFGPAPYLISLYQ 489
Query: 482 GLEAYVSKTHYASGCHDVPCNS-DAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSL 540
+ + Y+ G NS D F +A+ A A ++ G+D T E E RDR +L
Sbjct: 490 AAKEFGLDFTYSLGSR---INSTDDSFKQALDSAHAAALIVFAGGVDNTLEAETRDRKTL 546
Query: 541 LLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALA 600
P Q+ L+ +V+ K PVI++ GGG +D + A+ I+++LW GYPG++G KA+
Sbjct: 547 AWPESQLDLLRAVSALGK-PVIVLQFGGGQVDDTELLANHSINALLWGGYPGQSGGKAVI 605
Query: 601 EIIFGDFNPGGRLPMTWYPESFTK-VPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHG 659
+++FG P GRL +T YP S+ + VP DMN+R GR+Y +Y G V +G G
Sbjct: 606 DLLFGRAAPAGRLSVTQYPASYNEDVPSTDMNLRPGPGNSGLGRTYMWYNGDAVVPYGFG 665
Query: 660 LSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTS--CTSLRFHVQ 717
L YT + K + + SA +K ++L DYV V T V
Sbjct: 666 LHYTTFDAKLKARQA----SALIKTEEVSSLLSN-----DYVSGTLVWQQILTKPVVSVL 716
Query: 718 ISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKG---SKEISFGVDPCE 774
I+V+N G+V +V +LF R P K L G+ R + G +E+S + E
Sbjct: 717 ITVSNTGNVASDYVALLFLRSNAGPTPQPTKTLAGYHRFRNIQPGDRSEREVSITI---E 773
Query: 775 QLSIANKHGRRILPLGNHVLMV 796
+L ++ G R+L G++ L V
Sbjct: 774 RLVRVDELGNRVLHPGSYELFV 795
>gi|425780840|gb|EKV18836.1| Beta-xylosidase XylA [Penicillium digitatum PHI26]
gi|425783077|gb|EKV20946.1| Beta-xylosidase XylA [Penicillium digitatum Pd1]
Length = 792
Score = 405 bits (1042), Expect = e-110, Method: Compositional matrix adjust.
Identities = 248/754 (32%), Positives = 387/754 (51%), Gaps = 53/754 (7%)
Query: 55 CNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVN 114
C+T+ RA +L S+ TL+E + + A+PRLG+P Y+ WSE+LHG+ + +
Sbjct: 62 CDTTAKPHDRAAALTSMFTLEELVNSTGNVIPAVPRLGLPPYQVWSEALHGL--DRANLT 119
Query: 115 FNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRD 174
+G S TSFP ++ A+ NR+L + IG ++ + RA N G+ GL +APNIN FR
Sbjct: 120 ESGDYSWATSFPSPILIMAALNRTLINQIGEIISTQGRAFNNGGRYGLDVYAPNINSFRH 179
Query: 175 PRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRG 234
P WGRGQETPGED + S Y VE++ QG G R
Sbjct: 180 PVWGRGQETPGEDVQLCSIYGVEYITGIQG----------GLNPR--------------- 214
Query: 235 DELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCS 294
+L L+A KH YDLE WGN SR N I+ D + P F + + + +M S
Sbjct: 215 -DLKLAATAKHFAGYDLENWGNHSRLGNNVAISSFDLASYYTPQFITAVRDARVHSVMSS 273
Query: 295 YNQVNGVPACLRGDLFQK-ARNEWGF--KGYITSDCDAVATIFEYQNYTKTHEDSAAGVL 351
YN VNGVP+ L Q R W F GY++SDCDAV +F Y + +AA +
Sbjct: 274 YNAVNGVPSSANSFLLQTLLRETWNFVEDGYVSSDCDAVFNVFNPHGYASSASLAAAKSI 333
Query: 352 KAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLG 411
+AG DI+CG + ++ ++ +I+RA+ +S + LG F+GD KY L
Sbjct: 334 QAGTDIDCGATYQLYLNESLSHDEISRSEIERAVTRFYSTLVSLGYFDGD--NSKYRHLH 391
Query: 412 PDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTG 471
DV ++ ++ +AA +GIVLLKND LPL+ N S+A+IGP N + + G Y G
Sbjct: 392 WPDVVATDAWNISYEAAVEGIVLLKNDGT-LPLSNN-TRSVALIGPWANVTTTLQGNYYG 449
Query: 472 IPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQE 531
L L+A +YA G ++ +S +GF A+ A K++ +I G+D T E
Sbjct: 450 AAPYLTGPLAALQASNLDVNYAFGT-NISSDSTSGFEAALSAAGKSEVIIFAGGIDNTVE 508
Query: 532 TEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYP 591
E DR S+ PG Q+ L+ +++ K P++++ GGG +D S +A+ ++S++W GYP
Sbjct: 509 AEGVDRESITWPGNQLQLIEQLSKLGK-PLVVLQMGGGQVDSSSLKANKNVNSLVWGGYP 567
Query: 592 GEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNMRADSSRQYPGRSYRFYTG 650
G++G A+ +I+ G P GRL +T YP + + P DM++R + PG++Y +YTG
Sbjct: 568 GQSGGPAILDILTGKRAPAGRLTVTQYPAEYALQFPATDMSLRPKGNN--PGQTYMWYTG 625
Query: 651 TQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQ-QTGSRLDYVHIDEVTSC 709
VY FGHGL YT + L + G+ +I+Q + Y ++++
Sbjct: 626 KPVYEFGHGLFYTTFKVS-------LAHFHGAENGTSFDIVQLLSRPNAGYSVVEQIP-- 676
Query: 710 TSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFG 769
+ + V N G+V + M F P K L+GFDR+ ++ + +
Sbjct: 677 ---FINYTVEVMNTGNVTSDYTAMAFVNTKAGPSPHPNKWLVGFDRLGGISPRTTQTMTI 733
Query: 770 VDPCEQLSIANKHGRRILPLGNHVLMVGELRHSL 803
+ ++ ++ G RI+ G + L + R ++
Sbjct: 734 PITLDNVARTDERGNRIVYPGKYELTLNNERSAV 767
>gi|171678585|ref|XP_001904242.1| hypothetical protein [Podospora anserina S mat+]
gi|170937362|emb|CAP62020.1| unnamed protein product [Podospora anserina S mat+]
Length = 800
Score = 405 bits (1042), Expect = e-110, Method: Compositional matrix adjust.
Identities = 259/737 (35%), Positives = 376/737 (51%), Gaps = 89/737 (12%)
Query: 55 CNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVN 114
CNT+LS RA +L++ LT +EK+Q + + PR+G+PAY WWSE+LHG+A PG
Sbjct: 40 CNTTLSPPERAAALVAALTPEEKLQNIVSKSLGAPRIGLPAYNWWSEALHGVAY-APGTQ 98
Query: 115 F---NGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINI 171
F +G +S TSFP L+ AA+F+ L I + +E RA N G +GL +W PN+N
Sbjct: 99 FWQGDGPFNSSTSFPMPLLMAATFDDELLEKIAEVIGIEGRAFGNAGFSGLDYWTPNVNP 158
Query: 172 FRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEES 231
F+DPRWGRG ETPGED ++V YA +K +G +E+RV+
Sbjct: 159 FKDPRWGRGSETPGEDVLLVKRYAAAMIKGLEGP----------VPEKERRVV------- 201
Query: 232 DRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCI 291
A CKH A D E W +R++FNA I+ QD + + PF+ C+ + I
Sbjct: 202 ---------ATCKHYAANDFEDWNGATRHNFNAKISLQDMAEYYFMPFQQCVRDSRVGSI 252
Query: 292 MCSYNQVNGVPACLRGDLFQKA-RNEWGF---KGYITSDCDAVATIFEYQNYTKTHEDSA 347
MC+YN VNGVP+C L Q R W + YITSDC+AV + Y T+ +
Sbjct: 253 MCAYNAVNGVPSCASPYLLQTILREHWNWTEHNNYITSDCEAVLDVSLNHKYAATNAEGT 312
Query: 348 AGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKY 407
A +AGMD +C A +G ++E +DRALL L+ +R G F+G ++ Y
Sbjct: 313 AISFEAGMDTSCEYEGSSDIPGAWSQGLLKESTVDRALLRLYEGIVRAGYFDG--KQSLY 370
Query: 408 GKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLP----LNKNAVSSLAIIGPLVNNIS 463
LG DV +KL+L AA G VLLKND LP L+K+ +A+IG +
Sbjct: 371 SSLGWADVNKPSAQKLSLQAAVDGTVLLKNDGT-LPLSDLLDKSRPKKVAMIGFWSDAKD 429
Query: 464 QMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAG--FH-----------EA 510
++ GGY+G AY+ YA+ +P ++ +G H A
Sbjct: 430 KLRGGYSGT-----------AAYLHTPAYAASQLGIPFSTASGPILHSDLASNQSWTDNA 478
Query: 511 VRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGP 570
+ AK AD+++ G+D + E +DR L PG Q+SL+ + SK ++VL G
Sbjct: 479 MAAAKDADYILYFGGIDTSAAGETKDRYDLDWPGAQLSLINLLTTLSKP--LIVLQMGDQ 536
Query: 571 LDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTK-VPMND 629
LD + ++ +I++ILW +PG+ G A+ E++ G +P GRLP+T YP +FT+ VPM D
Sbjct: 537 LDNTPLLSNPKINAILWANWPGQDGGTAVMELVTGLKSPAGRLPVTQYPSNFTELVPMTD 596
Query: 630 MNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSA-PSELTISASLKAGSDK 688
M +R + GR+YR+Y T V FG GL YT +S KF P+ + + L+ DK
Sbjct: 597 MALRPSAGNSQLGRTYRWYK-TPVQAFGFGLHYTTFSPKFGKKFPAVIDVDEVLEGCDDK 655
Query: 689 NILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGT-PE 747
LD + ++ + V N G+ +V + F P V G P
Sbjct: 656 --------YLDTCPLPDLP----------VVVENRGNRTSDYVALAFVSAPGVGPGPWPI 697
Query: 748 KQLIGFDRVHTVAKGSK 764
K L F R+ V G K
Sbjct: 698 KTLGAFTRLRGVKGGEK 714
>gi|220927661|ref|YP_002504570.1| glycoside hydrolase [Clostridium cellulolyticum H10]
gi|219997989|gb|ACL74590.1| glycoside hydrolase family 3 domain protein [Clostridium
cellulolyticum H10]
Length = 712
Score = 405 bits (1040), Expect = e-110, Method: Compositional matrix adjust.
Identities = 254/774 (32%), Positives = 381/774 (49%), Gaps = 121/774 (15%)
Query: 54 FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
+ + SLS RA L+S +TL+EK QL +A + RLGIP Y WW+E+LHG+A G
Sbjct: 6 YLDKSLSFKERAVDLVSRMTLEEKASQLRYDAQPVERLGIPRYNWWNEALHGVARAG--- 62
Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLTFW 165
T FPQ + AA F+ I +A E RA YN G+TFW
Sbjct: 63 -------VATVFPQAIGLAAIFDDEFLEKIADVIATEGRAKYNESSKKGDRDIYKGITFW 115
Query: 166 APNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLK 225
+PN+NIFRDPRWGRG ET GEDP + S V FVK QG+
Sbjct: 116 SPNVNIFRDPRWGRGHETYGEDPYLTSRLGVAFVKGLQGD-------------------- 155
Query: 226 GFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQ 285
G L +AC KH + + R+ FNA+ +++D +T+ P F + +++
Sbjct: 156 --------GKYLKSAACAKH---FAVHSGPEDDRHHFNAVASQKDMYETYLPAFEALVKE 204
Query: 286 GKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHE 344
K +M +YN+ NG P L + R++WGF G++ SDC A+ E TKT
Sbjct: 205 AKVESVMGAYNRTNGEPCNGSKTLLKDILRDDWGFDGHVVSDCWAIKDFHEGHGVTKTPT 264
Query: 345 DSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRK 404
+S A LK G D+NCG L A+ +GK+ E+DIDRA + L + +++LG+F+ D
Sbjct: 265 ESVALALKNGCDLNCGNMYLL-ILLALKEGKITEEDIDRAAIRLMTTRMKLGMFDDDCEF 323
Query: 405 GKYGKLGPDDVCTS-EHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNIS 463
K P +V S EH KL+L+AAR+ +VLLKN+ LPL+ + ++A+IGP ++
Sbjct: 324 DKI----PYEVNDSIEHNKLSLEAARKSMVLLKNNG-LLPLDSKKIKNIAVIGPNADSSL 378
Query: 464 QMGGGYTGIPCSPKSLLRGLEAYVS---KTHYASGCH------DVPCNSDAGFHEAVRIA 514
+ Y+G P ++L G+ + VS + Y+ G H + D EAV +A
Sbjct: 379 ALRANYSGTPSHNITILDGVRSRVSEDTRVWYSLGSHLFMNREEDLAQPDDRLKEAVSMA 438
Query: 515 KKADFVIVVAGLDLTQETEDRD-----------RVSLLLPGQQMSLVTSVARTSKRPVIL 563
+++D V++ GLD + E E D + L LP Q +L+ +V T K ++
Sbjct: 439 ERSDVVVLCLGLDASVEGEQNDQGTVILDAGGDKADLNLPESQRNLLNAVLATGKPTIVA 498
Query: 564 VLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT 623
+L+G +S +A + ++I+ YPG G A AE+IFGD++P GRLP+T+Y +
Sbjct: 499 LLSGSA---LSIGDAADKAAAIVQCWYPGSKGGLAFAEMIFGDYSPAGRLPVTFYKSTEE 555
Query: 624 KVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLK 683
P D +M R+Y+F G +Y FG GLSYTN+ Y + P + SL
Sbjct: 556 LPPFEDYSME--------NRTYKFMKGEALYPFGFGLSYTNFEYSNIVCPQAVNNGESLS 607
Query: 684 AGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQ 743
V + V NAG VD VV ++ + + S
Sbjct: 608 --------------------------------VSVDVQNAGSVDSDEVVQVYIKDMEASV 635
Query: 744 GTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
P L GF R+ + K ++F +D ++I ++ G+R + G+ L VG
Sbjct: 636 RVPNHSLCGFKRIFLKSGEKKTVTFEID-SRAMTIVDEEGKRYIENGDFTLYVG 688
>gi|333379783|ref|ZP_08471502.1| hypothetical protein HMPREF9456_03097 [Dysgonomonas mossii DSM
22836]
gi|332884929|gb|EGK05184.1| hypothetical protein HMPREF9456_03097 [Dysgonomonas mossii DSM
22836]
Length = 737
Score = 405 bits (1040), Expect = e-110, Method: Compositional matrix adjust.
Identities = 262/780 (33%), Positives = 379/780 (48%), Gaps = 128/780 (16%)
Query: 51 SYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNG 110
+YPF NT+LSI R L+S LTL+EK+ Q+ +N AI RL IPAY WW+E LHGI
Sbjct: 24 NYPFQNTNLSIDERVNDLVSKLTLEEKVAQMLNNTPAIERLNIPAYNWWNECLHGIGR-- 81
Query: 111 PGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GL 162
T VT FPQ + AA++N+ L + SA++ E RA+YN + GL
Sbjct: 82 -------TDYKVTVFPQAIGMAAAWNKELMKEVASAISDEGRAIYNDATSKGNREIYYGL 134
Query: 163 TFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKR 222
T+W PNINIFRDPRWGRGQET GEDP + FV QG++ K
Sbjct: 135 TYWTPNINIFRDPRWGRGQETYGEDPFLTGVLGKSFVAGLQGDDTKY------------- 181
Query: 223 VLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSC 282
L +AC KH Y + +R++FN +T+ D DT+ P FR+
Sbjct: 182 --------------LKAAACAKH---YAVHSGPENTRHTFNTFVTDYDLWDTYLPAFRNL 224
Query: 283 IEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTK 341
+ + K + +MC+YN NG P C L Q+ R +W F GY+TSDC A+ +++ K
Sbjct: 225 VVEAKVAGVMCAYNAYNGEPCCGNNFLMQEILREKWNFTGYVTSDCGAIDDFYQHH---K 281
Query: 342 THEDS---AAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLF 398
TH D+ AA + G DI+CG + A+ G + EK ID +L LF+++ RLG+F
Sbjct: 282 THPDAKYAAADAVYNGTDIDCGNEAYKALVDAVKTGIITEKQIDISLKRLFTIRFRLGMF 341
Query: 399 NGDPRKG-KYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGP 457
DP + KY ++ + + +HK LAL R+ IVLLKN+ LPL+K + +A++GP
Sbjct: 342 --DPAENVKYSQISTSVLESQKHKDLALKITRESIVLLKNENNTLPLSKK-LKKVAVVGP 398
Query: 458 LVNNISQMGGGYTGIPC-------SPKSLLRGLEAYVSKTHYASGCHDVP--CNSDAGFH 508
NN + G Y G P + K L+G E Y G V NS
Sbjct: 399 NANNEVSVLGNYNGFPTEIVTPYEAVKQKLKGAEVI-----YEKGIDFVTPSTNSKEEVS 453
Query: 509 EAVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVARTSK 558
V+ K D VI V G+ E E+ DR S+ LP Q + ++ K
Sbjct: 454 ALVKRLKDVDVVIFVGGISPELEGEEMPVKIEGFTGGDRTSIKLPKIQTDFMKALV-AEK 512
Query: 559 RPVILVLTGGGPLDVSFAEADSQ-ISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTW 617
P + V+ G + E +SQ I +I+ Y G+ A+A+++FGD+NP G+LP+T+
Sbjct: 513 IPTVFVMMTGSAIA---TEWESQNIPAIVNAWYGGQDAGTAIADVLFGDYNPSGKLPVTF 569
Query: 618 YPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELT 677
Y + D ++ A +S + R+YR++ G +Y FG+GLSYT + Y + PS +
Sbjct: 570 YAK--------DSDLPAFNSYEMKNRTYRYFNGEVLYPFGYGLSYTKFEYSPIQVPSTID 621
Query: 678 ISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFAR 737
+ K V +S+ N G V+G VV L+
Sbjct: 622 TGNNAK--------------------------------VSVSIKNTGKVEGEEVVQLYIS 649
Query: 738 VPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
P P L GF+RV A SK + F + P E L + + G + G + +G
Sbjct: 650 YPDTKGQKPLYALKGFNRVSLKAGESKTVEFNLSPRE-LGLVDDAGILKVSAGKRKIFIG 708
>gi|78482949|emb|CAJ41429.1| beta (1,4)-xylosidase [Populus tremula x Populus alba]
Length = 732
Score = 403 bits (1036), Expect = e-109, Method: Compositional matrix adjust.
Identities = 225/539 (41%), Positives = 310/539 (57%), Gaps = 53/539 (9%)
Query: 42 FPCKPPHFDS--YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWW 99
F C P + PFC +L I TR LI +TLQEK+ L +NA+A+PRLGI YEWW
Sbjct: 27 FACDPKDGTNRDLPFCQVNLPIHTRVNDLIGRMTLQEKVGLLVNNAAAVPRLGIKGYEWW 86
Query: 100 SESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQ 159
SE+LHG+++ GPG F G TSFPQV+ +AASFN +LW IG V+ EARAM+N G
Sbjct: 87 SEALHGVSNVGPGTKFGGAFPVATSFPQVITTAASFNATLWEAIGRVVSDEARAMFNGGV 146
Query: 160 AGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFR 219
AGLT+W+PN+ PRWGRGQETPGEDP+VV YA +V+ QG +
Sbjct: 147 AGLTYWSPNVTYSVYPRWGRGQETPGEDPVVVGKYAASYVRGLQGSD------------- 193
Query: 220 EKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPF 279
G L ++ACCKH AYDL+ W R+ FNA +++QD DTF PF
Sbjct: 194 --------------GIRLKVAACCKHFTAYDLDNWNGVDRFHFNAKVSKQDMVDTFDVPF 239
Query: 280 RSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK-ARNEWGFKGYITSDCDAVATIFEYQN 338
R C+++GK + +MCSYNQVNG+P C +L +K R +W GYI SDCD+ + Q+
Sbjct: 240 RMCVKEGKVASVMCSYNQVNGIPTCADPNLLKKTVRGQWRLNGYIVSDCDSFGVYYGQQH 299
Query: 339 YTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLF 398
+T S+ G KAG+D++CG ++ H + K +E +I+ A L + Q+ LG+F
Sbjct: 300 FTSPRR-SSLGCYKAGLDLDCGPFLVTHRDAV--KKAAEEAEINNAWLKTLTFQISLGIF 356
Query: 399 NGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGP- 457
+G P + G + P + ++ LA++A ++ + + KN + FL S I GP
Sbjct: 357 DGSPLQA-VGDVVP-TMGPPTNQDLAVNAPKR-LFIFKN-RAFL-----LYSPRHIFGPV 407
Query: 458 -LVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCN-SDAGFHEAVRIAK 515
L ++ M G Y G+PC L+GL +VS Y GC +V C +D G AV +A
Sbjct: 408 ALFKSLPFMLGNYEGLPCKYLFPLQGLAGFVSLL-YLPGCSNVICAVADVG--SAVDLAA 464
Query: 516 KADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILV-----LTGGG 569
AD V++V G D + E E DRV LPG+Q LVT VA +K PV+LV ++GGG
Sbjct: 465 SADAVVLVVGADQSIEREGHDRVDFYLPGKQQELVTRVAMAAKGPVLLVIMDLAISGGG 523
Score = 74.7 bits (182), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 29/91 (31%), Positives = 56/91 (61%), Gaps = 1/91 (1%)
Query: 716 VQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQ 775
+Q+ V N G +DG+H ++++ R P P KQL+ F++VH A + + + C+
Sbjct: 634 MQVDVKNTGSMDGTHTLLVYFR-PPARHWAPHKQLVAFEKVHVAAGTQQRVGINIHVCKS 692
Query: 776 LSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
LS+ + G R +P+G H L +G+++HS++++
Sbjct: 693 LSVVDGSGIRRIPMGEHSLHIGDVKHSVSLQ 723
>gi|410628680|ref|ZP_11339398.1| beta-glucosidase [Glaciecola mesophila KMM 241]
gi|410151684|dbj|GAC26167.1| beta-glucosidase [Glaciecola mesophila KMM 241]
Length = 732
Score = 400 bits (1029), Expect = e-108, Method: Compositional matrix adjust.
Identities = 269/783 (34%), Positives = 391/783 (49%), Gaps = 128/783 (16%)
Query: 48 HFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIA 107
H + N LS TRA++L++ +T+ EKI QLS + AIPRL +P Y WW+E+LHGIA
Sbjct: 24 HVKQQIWFNPELSFETRAQALVNAMTIDEKITQLSHSTPAIPRLEVPQYNWWNEALHGIA 83
Query: 108 SNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQ-------- 159
NG T FPQ + A+F+ L + +A++ EARA Y + Q
Sbjct: 84 RNG----------KATIFPQAIGLGATFDPELAQEVANAISDEARAKYAIAQSIGNQGQY 133
Query: 160 AGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFR 219
AGLTFW PN+NIFRDPRWGRGQET GEDP++ S FVK QG++ K +
Sbjct: 134 AGLTFWTPNVNIFRDPRWGRGQETYGEDPLLTSQMGTAFVKGLQGDDPKY--------LK 185
Query: 220 EKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPF 279
V K F S G E + R+ F+ +++D +T+ P F
Sbjct: 186 SAGVAKHFAVHS--GPESL--------------------RHQFDVEPSKKDLYETYLPAF 223
Query: 280 RSCIEQGKASCIMCSYNQVNGVPAC----LRGDLFQKARNEWGFKGYITSDCDAVATIFE 335
+ + Q K + +MC+YN V G P+C L G++ +K +W F GY+ SDC A+
Sbjct: 224 EALVTQAKVAGVMCAYNGVYGQPSCASEFLLGEMLKK---KWQFNGYVVSDCGALHDFHS 280
Query: 336 YQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRL 395
T +SAA L+AG+D+NCG + ++A ++G + + ID+ L NL ++ RL
Sbjct: 281 GHKVTHNRVESAALALRAGVDLNCGFTYEKSLKAAFEEGLITQSLIDQRLKNLLMIRFRL 340
Query: 396 GLFNGDPRK-GKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAI 454
GLF DP + + +G + + + EH +LA A + IVLLKN+K+ LPL+K+ + +
Sbjct: 341 GLF--DPSELNPHNAIGQEVIHSLEHIELARKVAAKSIVLLKNEKQVLPLSKD-IKVPYV 397
Query: 455 IGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV---SKTHYASGCHDVPCNSDAG-FHEA 510
GP + + G Y GI S ++L G+ V S +Y +G +P +S+ + A
Sbjct: 398 TGPFAASSDMLMGNYYGISDSLVTVLEGIAGKVSLGSSLNYRAGA--LPFHSNINPLNWA 455
Query: 511 VRIAKKADFVIVVAGLDLTQETE--------DR-DRVSLLLPGQQMSLVTSVARTSKRPV 561
+AK AD VI V G+ E E DR DRV++ LP Q+ V +A K P+
Sbjct: 456 PEVAKTADAVIAVVGISADMEGEEVDAIASADRGDRVAITLPQNQVDYVKQLAENKKGPL 515
Query: 562 ILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPES 621
ILV+ G P+D+S E D +ILWI YPGE G A+A++IFGD NP G LP+T+
Sbjct: 516 ILVVAAGSPVDIS--ELDPLADAILWIWYPGEQGGNAVADVIFGDTNPSGHLPLTFVKTI 573
Query: 622 FTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLS----APSELT 677
P +D M GR+Y+F +Y FG GLSYT + + LS AP E
Sbjct: 574 DDLPPFDDYTMT--------GRTYKFLKKLPLYPFGFGLSYTQFKFGKLSLSKRAPQE-- 623
Query: 678 ISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLF-- 735
G + NI + V N+ +DG VV ++
Sbjct: 624 -------GENINI--------------------------SVEVENSTALDGETVVQVYLS 650
Query: 736 ARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLM 795
+VP ++ L F RVH A + I F ++ + L N G + P G + L
Sbjct: 651 PQVPLKNEAI--TNLKAFKRVHIGAYEKRLIEFTIE-GKNLYRVNDAGENVWPSGAYTLA 707
Query: 796 VGE 798
VG+
Sbjct: 708 VGD 710
>gi|398406144|ref|XP_003854538.1| hypothetical protein MYCGRDRAFT_38178 [Zymoseptoria tritici IPO323]
gi|339474421|gb|EGP89514.1| hypothetical protein MYCGRDRAFT_38178 [Zymoseptoria tritici IPO323]
Length = 884
Score = 400 bits (1027), Expect = e-108, Method: Compositional matrix adjust.
Identities = 271/790 (34%), Positives = 399/790 (50%), Gaps = 72/790 (9%)
Query: 24 VVNVIAFSNSKP------VLNKPDFPCKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEK 77
V+N+ + SN P V P C F + C+TSLS R +LIS +T++EK
Sbjct: 111 VMNLYSISNPNPHPPPKGVKRSP--VCLTDPFCANKACDTSLSQDDRIAALISQMTVEEK 168
Query: 78 IQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNF---NGT-VSSVTSFPQVLVSAA 133
L D A +PR+G+P YEWW+E+LHG+A + GV+F NG+ S TSFP ++ A
Sbjct: 169 ATNLVDGALGLPRIGLPPYEWWNEALHGVAGSR-GVSFDSPNGSDFSYATSFPLPILMGA 227
Query: 134 SFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSA 193
+F+ L ++ S + EARA N +G FW PN+N F DPRWGRG E P ED
Sbjct: 228 AFDDPLIYDVASIIGKEARAFANYAHSGYDFWTPNMNTFLDPRWGRGLEVPTEDSFHAQR 287
Query: 194 YAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEK 253
Y V QG K+D K+++ A CKH YD+E
Sbjct: 288 YVASLVPGLQGGKEKTD---------HKQII----------------ATCKHFAVYDVET 322
Query: 254 WGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA 313
+R++ N T QD + + P F++C+ IMCSYN V GVPAC Q
Sbjct: 323 ----NRHAQNYEPTPQDLGEYYLPAFKTCVRDVNVGSIMCSYNAVYGVPACASEYFLQDV 378
Query: 314 -RNEWGFK---GYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQS 369
R++W F Y+TSDC+AV I+ N+T T +AA L AG D NCGT L+ +
Sbjct: 379 LRDQWNFNEPYHYVTSDCEAVKDIWTPHNFTDTEPAAAAVALNAGTDTNCGTSYLQ-LNT 437
Query: 370 AIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAAR 429
++ E +D +L L++ +G F+G P +Y L DV T + A AA
Sbjct: 438 SVANNWTTEAQMDISLTRLYNALFTVGYFDGQP---EYDGLSFADVSTPFAQATAYRAAS 494
Query: 430 QGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSK 489
+GI LLKND LPL K + +S+A+IGP N +QM G Y GI S L +A
Sbjct: 495 EGITLLKND-GLLPL-KKSYNSVALIGPWANATTQMQGIYQGIAPYLVSPLAAAQAQWGH 552
Query: 490 THYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSL 549
+ +G N+ GF A+ A+ AD +I G+D + E E RDR S+ PG Q+ L
Sbjct: 553 ISFTNGTAINSTNT-TGFASALSAARDADVIIYAGGIDSSIEKESRDRTSISWPGNQLDL 611
Query: 550 VTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNP 609
V ++ K P+++V GGG +D S + ++S++W GYPG+ G AL +++ G +P
Sbjct: 612 VQQLSELGK-PLVVVQFGGGQVDDSALLRNKNVNSLVWAGYPGQDGGSALIDVLVGKQSP 670
Query: 610 GGRLPMTWYPESF-TKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYK 668
GRL +T YP + ++ + D N+R S PGR+Y++Y V FG+GL YT + +
Sbjct: 671 AGRLTITQYPADYINQISLFDPNLRPSDSS--PGRTYKWYNKEPVLPFGYGLHYTTFEFD 728
Query: 669 FLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDG 728
+ AP ASL + T S ++ + T L I V N+G +
Sbjct: 729 WAKAPQASYDIASL--------VDSTASYTTSPKKNDASPWTEL----SIKVHNSGSLGS 776
Query: 729 SHVVMLFARVPKVSQGT-PEKQLIGFDRVHTVAKG-SKEISFGVDPCEQLSIANKHGRRI 786
+V ++F R P P K L + R+H ++ G S E+SF + L+ ++HG I
Sbjct: 777 DYVGLVFLRTPNAGPAPYPNKWLASYARLHGLSAGASAELSFSLS-LGALARGDEHGDLI 835
Query: 787 LPLGNHVLMV 796
+ G++ + +
Sbjct: 836 IYPGDYEVQI 845
>gi|359473427|ref|XP_002265788.2| PREDICTED: beta-xylosidase/alpha-L-arabinofuranosidase 1-like
[Vitis vinifera]
Length = 464
Score = 399 bits (1026), Expect = e-108, Method: Compositional matrix adjust.
Identities = 199/473 (42%), Positives = 279/473 (58%), Gaps = 28/473 (5%)
Query: 154 MYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGG 213
MYNLG AGLTFW+PNIN+ RD RWGR QET EDP +V +AV +V+ Q
Sbjct: 1 MYNLGHAGLTFWSPNINVVRDTRWGRTQETSREDPFMVGEFAVNYVRGLQD--------- 51
Query: 214 IGFGFREKRVLKGFGEESDRGDE-LMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTE 272
++G +D L +S+CCKH AYD++ W N R++F+A ++EQD +
Sbjct: 52 ----------VEGTENVTDLNSRPLKVSSCCKHYAAYDIDSWLNIDRHTFDARVSEQDMK 101
Query: 273 DTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVA 331
+TF PF C+ +G S +MCS+N++NG+P C L + R+EW GYI SDC +
Sbjct: 102 ETFVSPFERCVREGDVSSVMCSFNKINGIPPCSDPRLLKGVIRDEWDLHGYIVSDCYGLE 161
Query: 332 TIFEYQNYTKTHE-DSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFS 390
I + QNY + D+ A L+AG+D+ CG + GKV + ++DRAL N++
Sbjct: 162 VIVDNQNYLNDSKVDAVAKTLQAGLDLECGHYYTDALNELVLTGKVSQYELDRALKNIYV 221
Query: 391 VQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVS 450
+ +R+G F+G P Y LG D+C ++H +LA +AARQGIVLLKND + PL
Sbjct: 222 LLMRVGYFDGIP---AYESLGLKDICAADHIELAREAARQGIVLLKNDYEVFPLKPG--K 276
Query: 451 SLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEA 510
LA++GP N M G Y G+P S L A + Y +GC D C++D F EA
Sbjct: 277 KLALVGPHANATEVMIGNYAGLPRKYVSPLEAFSA-IGNVTYTTGCLDASCSNDTYFSEA 335
Query: 511 VRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGP 570
AK A+ I+ G DL+ E E DRV LLPG Q L+ VA S PVILV+ G
Sbjct: 336 KEAAKSAEVTIIFVGTDLSIEAEFVDRVDFLLPGNQTELIKQVAEVSSGPVILVVLSGSN 395
Query: 571 LDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT 623
+D++FA+ + +IS+ILW+G+PGE G A+A+++FG +NPGGRLP+TWY +
Sbjct: 396 IDITFAKNNPRISAILWVGFPGEQGGHAIADVVFGKYNPGGRLPVTWYEADYV 448
>gi|225878709|dbj|BAH30674.1| beta-xylosidase [Aspergillus aculeatus]
Length = 785
Score = 398 bits (1023), Expect = e-108, Method: Compositional matrix adjust.
Identities = 258/756 (34%), Positives = 382/756 (50%), Gaps = 65/756 (8%)
Query: 55 CNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVN 114
C+ + S RA +L S++TL+E + + AIPRLG+P Y+ W+E+LHG+ N
Sbjct: 62 CDRTASAHDRAAALTSMMTLEELMNSTGNRIPAIPRLGLPPYQIWNEALHGLYL----AN 117
Query: 115 F--NGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIF 172
F +G S TSFP +++ A+ NR+L I +A + RA N G+ GL ++PNIN F
Sbjct: 118 FTESGPFSWSTSFPSPILTMATLNRTLIHQIAQIIATQGRAFNNAGRYGLNAFSPNINAF 177
Query: 173 RDPRWGRGQETPGEDP-MVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEES 231
R P WGRGQETPGED + SAYA E++ QG
Sbjct: 178 RHPVWGRGQETPGEDANCLCSAYAYEYITGLQGN-------------------------- 211
Query: 232 DRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCI 291
+ A KH YD+E W SR+ + IT+QD + F P F + + +
Sbjct: 212 --ATNPKIIATAKHYAGYDIENWRQRSRFGNDLNITQQDLAEYFTPQFVVAVRDAQVRSV 269
Query: 292 MCSYNQVNGVPACLRGDLFQK-ARNEWGF--KGYITSDCDAVATIFEYQNYTKTHEDSAA 348
M SYN VNGVP+ L Q R+ WGF GY+ SDCDAV +F Y ++A
Sbjct: 270 MPSYNAVNGVPSSANTFLLQTLVRDSWGFIQDGYMASDCDAVYNVFNPHGYAANLSSASA 329
Query: 349 GVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYG 408
L+AG DI+CG L ++ +G++ +I+RA+ +S + G F+G Y
Sbjct: 330 MSLRAGTDIDCGISYLTTLNESLTQGQISRSEIERAVTRFYSNLVSAGYFDGP--DAPYR 387
Query: 409 KLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGG 468
L DV + +A +AA G+VLLKND LPL+K +V +A+IGP N QM G
Sbjct: 388 DLSWSDVVRTNRWNVAYEAAVAGVVLLKNDG-VLPLSK-SVQRVALIGPWANATEQMQGN 445
Query: 469 YTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDL 528
Y G+ S L ++A + +YA G ++ N F A+ A+K+D +I G+D
Sbjct: 446 YHGVAPYLTSPLAAVQASGLEVNYAFGT-NITSNVTNCFAAALAAAEKSDIIIFAGGIDN 504
Query: 529 TQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWI 588
T E E+ DR ++ PG Q+ L+ + K P++++ GGG +D S +A ++ ++LW
Sbjct: 505 TLEAEELDRANITWPGNQLELIHRLGELGK-PLVVLQMGGGQVDSSALKASEKVGALLWG 563
Query: 589 GYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNMRADSSRQYPGRSYRF 647
GYPG+AG +AL +I+ G P GRL T YP + + P DM++R PG++Y +
Sbjct: 564 GYPGQAGGQALWDILTGQRAPAGRLTTTQYPAEYALQFPATDMSLRPRGDN--PGQTYMW 621
Query: 648 YTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVH--IDE 705
YTG VY FGHGL YT ++ L+ P G + G+ L H +
Sbjct: 622 YTGEPVYAFGHGLFYTTFATA-LAGP-----------GQEPERSFDIGALLARPHAGYNL 669
Query: 706 VTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTV-AKGSK 764
V L F V+ VTN G+V + M FA + P K L+GFDR+ + + S
Sbjct: 670 VEQLPFLNFTVK--VTNTGEVISDYTAMAFANTTAGPRPHPNKWLVGFDRIGPLDPRVSA 727
Query: 765 EISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELR 800
+S V + L+ + G R++ G + L + R
Sbjct: 728 RMSVPVS-LDSLARTDAQGNRVIYPGPYELALNNER 762
>gi|367028614|ref|XP_003663591.1| glycoside hydrolase family 3 protein [Myceliophthora thermophila
ATCC 42464]
gi|347010860|gb|AEO58346.1| glycoside hydrolase family 3 protein [Myceliophthora thermophila
ATCC 42464]
Length = 760
Score = 398 bits (1022), Expect = e-108, Method: Compositional matrix adjust.
Identities = 265/770 (34%), Positives = 380/770 (49%), Gaps = 69/770 (8%)
Query: 40 PDFPCKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWW 99
PD C S C+TS S RA +L+S++ EK+ L +N+ + RLG+ AY+WW
Sbjct: 26 PD--CANGPLKSNTVCDTSASPGARAAALVSVMNNNEKLANLVNNSPGVSRLGLSAYQWW 83
Query: 100 SESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQ 159
+E+LHG+A N G+ + G S+ T FPQ + ++A+F+ +L IG+ ++ EARA N G+
Sbjct: 84 NEALHGVAHNR-GITWGGEFSAATQFPQAITTSATFDDALIEQIGTIISTEARAFANNGR 142
Query: 160 AGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFR 219
A L FW PN+N FRDPRWGRG ETPGED +A FVK QG
Sbjct: 143 AHLDFWTPNVNPFRDPRWGRGHETPGEDAFKNKKWAEAFVKGMQGP-------------- 188
Query: 220 EKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPF 279
G + A CKH AYDLE G+ +R++F+A ++ QD + + PPF
Sbjct: 189 --------------GPTHRVIATCKHYAAYDLENSGSTTRFNFDAKVSTQDLAEYYLPPF 234
Query: 280 RSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKG---YITSDCDAVATIFE 335
+ C K IMCSYN VN +PAC L R W + YI SDCDAV +
Sbjct: 235 QQCARDSKVGSIMCSYNAVNEIPACANPYLMDTILRKHWNWTDEHQYIVSDCDAVYYLGN 294
Query: 336 Y---QNYTKTHEDSAAGVLKAGMDINCGTC--MLRHTQSAIDKGKVQEKDIDRALLNLFS 390
Y ++ + L+AG D C SA + G+ + +D A+L
Sbjct: 295 ANGGHRYKPSYAAAIGASLEAGCDNMCWATGGTAPDPASAFNSGQFSQTTLDTAILRQMQ 354
Query: 391 VQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVS 450
+ G F+G G Y L DV T + AL AA GIVLLKND LPL+ N +
Sbjct: 355 GLVLAGYFDGP--GGMYRNLSVADVNTQTAQDTALKAAEGGIVLLKND-GILPLSVNGSN 411
Query: 451 -SLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHE 509
+A+IG N +M GGY+G P + + +Y +G P N D
Sbjct: 412 FQVAMIGFWANAADKMLGGYSGSPPFNHDPVTAARSMGITVNYVNGPLTQP-NGDT--SA 468
Query: 510 AVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGG 569
A+ A+K++ V+ G+D T E E +DR S+ P Q++L+ +A T K PVI+V G
Sbjct: 469 ALNAAQKSNAVVFFGGIDNTVEKESQDRTSIEWPSGQLALIRRLAETGK-PVIVVRLGTH 527
Query: 570 PLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMN 628
D + +ILW GYPG+ G A+ +II G +P GRLP T YP S+T + P
Sbjct: 528 VDDTPLLSI-PNVRAILWAGYPGQDGGTAVVKIITGLASPAGRLPATVYPSSYTSQAPFT 586
Query: 629 DMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDK 688
+M +R SS YPGR+YR+Y+ V+ FGHGL YTN+S P+ I+
Sbjct: 587 NMALRPSSS--YPGRTYRWYS-NAVFPFGHGLHYTNFSVSVRDFPASFAIA--------- 634
Query: 689 NILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEK 748
++L G + Y+ + S V ++VTN G +V + F P K
Sbjct: 635 DLLASCGDSVAYLDLCPFPS-------VSLNVTNTGTRVSDYVALGFLSGDFGPSPHPIK 687
Query: 749 QLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGE 798
L + RV + G +++ E L ++ G R+L G + L+V +
Sbjct: 688 TLATYKRVFNIEPGETQVAELDWKLESLVRVDEKGNRVLYPGTYTLLVDQ 737
>gi|375150455|ref|YP_005012896.1| Beta-glucosidase [Niastella koreensis GR20-10]
gi|361064501|gb|AEW03493.1| Beta-glucosidase [Niastella koreensis GR20-10]
Length = 711
Score = 395 bits (1016), Expect = e-107, Method: Compositional matrix adjust.
Identities = 242/738 (32%), Positives = 363/738 (49%), Gaps = 109/738 (14%)
Query: 54 FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
F N + R L+ LTL EKI L + + RLGIPAY WW+E+LHG+A G
Sbjct: 17 FRNPQQPMEARVNDLLHQLTLPEKISLLGYRSKEVERLGIPAYNWWNEALHGVARAG--- 73
Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLTFW 165
T FPQ + AA+FN L + ++ EARA YNL A GLTFW
Sbjct: 74 -------VATVFPQAIGMAATFNDDLLKEAATVISTEARAKYNLSLAQGRHLQYMGLTFW 126
Query: 166 APNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLK 225
+PNINIFRDPRWGRGQET GEDP + + FVK QG + R LK
Sbjct: 127 SPNINIFRDPRWGRGQETYGEDPFLTAHMGTAFVKGLQG--------------NDPRYLK 172
Query: 226 GFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQ 285
SAC KH + + G R++FNA++ E+D +T+ F + ++
Sbjct: 173 A-------------SACAKHFAVHSGPENG---RHTFNAIVDEKDLRETYLYAFHALVDA 216
Query: 286 GKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHE 344
G S +MC+YN+VN P C L RNEW FKG++ +DC A+ IF +
Sbjct: 217 GVES-VMCAYNRVNDQPCCSGNFLLNSILRNEWKFKGHVVTDCGALDDIFMRHKVMPSGV 275
Query: 345 DSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRK 404
+ AA +KAG++++C + + + A+++ + EKDID +L +L Q++LG ++ DP
Sbjct: 276 EVAAAAIKAGVNLDCSNVLQKDVEKAVEQKLLNEKDIDSSLAHLLRTQIKLGFYD-DPTA 334
Query: 405 GKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQ 464
+ K G D V + H LA A+Q +VLLKN + LPL+K ++ ++G ++
Sbjct: 335 NPFYKYGADSVANTAHATLARAMAQQSMVLLKNSNQLLPLDKKKYPAIMVVGTNSASMDA 394
Query: 465 MGGGYTGIPCSPKSLLRGLEAYV---SKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVI 521
+ G Y G+ S + G+ V ++ Y G +D + A AD +
Sbjct: 395 LLGNYHGVSNRAVSFVEGITNAVDAGTRVEYDQGSD----YNDTTHFGGIWAAGNADITV 450
Query: 522 VVAGLDLTQETED---------RDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLD 572
V GL E E+ D+ + LP ++ + ++ + +K+P+I V+T G +D
Sbjct: 451 AVIGLTPVYEGEEGDAFLAAKGGDKPDMSLPAAHIAFMKALRKANKKPIIAVITAGSAVD 510
Query: 573 VSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNM 632
+S E + + W YPGE G ALA+I+FG +P GRLP+T+Y +SF VP D
Sbjct: 511 ISAIEPYADAILLAW--YPGEQGGNALADILFGKVSPAGRLPVTFY-QSFADVPAYD--- 564
Query: 633 RADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQ 692
+ GR+YR++ G Y FG+GLSYT+++Y++ P+ +
Sbjct: 565 ----NYAMKGRTYRYFNGKVQYPFGYGLSYTSFAYEWQQMPANIR--------------- 605
Query: 693 QTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIG 752
T+ S+ F I V N G +DG VV ++ P V + P K+L
Sbjct: 606 --------------TAKDSVSF--SIKVKNTGSMDGDEVVQVYVEYPAVER-MPLKELKA 648
Query: 753 FDRVHTVAKGSKEISFGV 770
F RVH A G + + +
Sbjct: 649 FKRVHVKAGGEETVQLTI 666
>gi|326202986|ref|ZP_08192853.1| glycoside hydrolase family 3 domain protein [Clostridium
papyrosolvens DSM 2782]
gi|325987063|gb|EGD47892.1| glycoside hydrolase family 3 domain protein [Clostridium
papyrosolvens DSM 2782]
Length = 712
Score = 395 bits (1016), Expect = e-107, Method: Compositional matrix adjust.
Identities = 248/778 (31%), Positives = 378/778 (48%), Gaps = 119/778 (15%)
Query: 49 FDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIAS 108
D + + SLS RA L+S +TL+EK QL +A + RLGIP Y WW+E+LHG+A
Sbjct: 1 MDKPKYLDKSLSFKERAADLVSKMTLEEKASQLRYDAQPVERLGIPRYNWWNEALHGVAR 60
Query: 109 NGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA-------- 160
G T FPQ + AA F+ I +A E RA YN
Sbjct: 61 AG----------VATVFPQAIGMAAMFDDEFLEKIADVIATEGRAKYNESAKKGDRDIYK 110
Query: 161 GLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFRE 220
G+TFW+PN+NIFRDPRWGRG ET GEDP + S V FVK QG+
Sbjct: 111 GITFWSPNVNIFRDPRWGRGHETYGEDPYLTSRLGVAFVKGLQGD--------------- 155
Query: 221 KRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFR 280
G L +AC KH Y + R+ F+A+++++D +T+ P F
Sbjct: 156 -------------GKYLKTAACAKH---YAVHSGPEDDRHFFDAIVSQKDLYETYLPAFE 199
Query: 281 SCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNY 339
+ +++ K IM +YN+ NG P L + R+ WGF G++ SDC A+ E
Sbjct: 200 ALVKEAKVESIMGAYNRTNGEPCNGSKTLLKDILRDGWGFDGHVVSDCWAIKDFHEGHGV 259
Query: 340 TKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFN 399
TKT +S A LK+G D+NCG L A+ +G + E+DIDRA + L + +++LG+F+
Sbjct: 260 TKTPTESVALALKSGCDLNCGNMYLL-ILLALKEGLITEEDIDRAAIRLMTTRMKLGMFD 318
Query: 400 GDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLV 459
D ++ + + ++EH K++L+AA++ +VLLKND LPL+ + ++A+IGP
Sbjct: 319 DD---CEFDNIPYELNDSAEHNKISLEAAKKSMVLLKNDG-LLPLDSKKIKNVAVIGPNA 374
Query: 460 NNISQMGGGYTGIPCSPKSLLRGLEAYVS---KTHYASGCH------DVPCNSDAGFHEA 510
++ + Y+G P +++ G+ VS + YA G H + D EA
Sbjct: 375 DSSLALRANYSGTPSQNVTIIEGIRKRVSENTRVWYAMGSHLFLNRDEDLAQPDDRLKEA 434
Query: 511 VRIAKKADFVIVVAGLDLTQETEDRD-----------RVSLLLPGQQMSLVTSVARTSKR 559
V A+++D V++ GLD + E E D + L LP Q +L+ +V T K
Sbjct: 435 VSAAERSDVVVLCLGLDASVEGEQNDQGTVILDAGGDKADLNLPESQRNLLNAVLATGKP 494
Query: 560 PVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYP 619
++ +L+G +S +A + ++I+ YPG G A AE+IFGD++P GRLP+T+Y
Sbjct: 495 TIVALLSGSA---LSIGDAADKAAAIVQCWYPGAIGGLAFAEMIFGDYSPAGRLPVTFYK 551
Query: 620 ESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTIS 679
+ P D +M R+Y+F G +Y FG GLSYT++ Y + P +
Sbjct: 552 STEELPPFADYSME--------NRTYKFMKGDALYPFGFGLSYTSFEYSNMVCPQTVNNG 603
Query: 680 ASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVP 739
+L V + V N G VD VV ++ +
Sbjct: 604 ENLS--------------------------------VSVDVQNTGSVDSDEVVQVYIKDM 631
Query: 740 KVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
S P+ L GF R+H + K ++F V +SI ++ G+R + G L G
Sbjct: 632 DASVRVPKYSLCGFKRIHLKSGEKKTVTFEV-ASNAMSIVDEAGKRHIENGEFTLYAG 688
>gi|240146254|ref|ZP_04744855.1| beta-glucosidase [Roseburia intestinalis L1-82]
gi|257201613|gb|EEU99897.1| beta-glucosidase [Roseburia intestinalis L1-82]
gi|291539969|emb|CBL13080.1| Beta-glucosidase-related glycosidases [Roseburia intestinalis
XB6B4]
Length = 710
Score = 395 bits (1015), Expect = e-107, Method: Compositional matrix adjust.
Identities = 255/769 (33%), Positives = 386/769 (50%), Gaps = 124/769 (16%)
Query: 64 RAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVT 123
RA L+ +TL+EK+ Q A A+ RL I AY WW+E+LHG+A G + T
Sbjct: 16 RAAELVGKMTLEEKVAQTLYQAPAVERLNIKAYNWWNEALHGVARAG----------TAT 65
Query: 124 SFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAG--------LTFWAPNINIFRDP 175
FPQ + AA+F+ L +G AV+ EARA +N+ Q G LTFWAPN+NIFRDP
Sbjct: 66 VFPQAIGLAATFDEDLLEQVGDAVSTEARAKFNMQQEGKDTDIYKGLTFWAPNVNIFRDP 125
Query: 176 RWGRGQETPGEDPMVVSAYAVEFVKSFQG--ENWKSDDGGIGFGFREKRVLKGFGEESDR 233
RWGRG ET GEDP + S V +++ QG EN+
Sbjct: 126 RWGRGHETFGEDPYLTSRLGVRYIEGLQGHDENY-------------------------- 159
Query: 234 GDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMC 293
L +AC KH + + R+ F+A +TEQD +T+ P F +C+++GK +M
Sbjct: 160 ---LKAAACAKHFAVHSGPE---AVRHEFDAEVTEQDLRETYLPAFEACVKEGKVEAVMG 213
Query: 294 SYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLK 352
+YN+ NGVP C L R EWGF G++TSDC A+ E + T T +S A +
Sbjct: 214 AYNRTNGVPCCGNKRLLIDILRKEWGFSGHVTSDCWAIRDFHEGHHVTGTAIESVAMAMN 273
Query: 353 AGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGP 412
G D+NCGT + A+ +G V+E+ +D A+ NLF +++LG+F+ + Y K+
Sbjct: 274 NGCDLNCGT-LFGFLVQAVRQGLVKEERLDEAVTNLFMARMKLGVFD-KKEENPYDKIPY 331
Query: 413 DDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGI 472
+ E KKL AR+ +VLLKN + LPL+KN + ++ +IGP ++ + G Y G
Sbjct: 332 LAADSREMKKLNEAVARRTVVLLKNKEHILPLDKNKIKTVGVIGPNADSRRALVGNYEGT 391
Query: 473 PCSPKSLLRGLEAYVS---KTHYASGCH---DVPCN---SDAGFHEAVRIAKKADFVIVV 523
++L G+E YV + Y+ GCH D N + E + + K++D V+ V
Sbjct: 392 ASRYITVLEGIEDYVGDDVRVLYSEGCHLYKDRTSNLAQENDRMSEVLGVCKESDVVVAV 451
Query: 524 AGLDLTQETED---------RDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVS 574
GLD E E+ D+ L LPG Q ++ + A + +PVILVL G L V+
Sbjct: 452 LGLDAGIEGEEGDAGNEYGSGDKPDLNLPGLQEEILEA-AVSCGKPVILVLLSGSALAVN 510
Query: 575 FAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRA 634
+ AD + +I+ YPG G A+A+I+FG+ NP G+LP+T+Y + D +M+
Sbjct: 511 W--ADEHVDAIVQGWYPGARGGAAIADILFGEANPEGKLPVTFYRTTEELPDFEDYSMQ- 567
Query: 635 DSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSY---KFLSAPSELTISASLKAGSDKNIL 691
GR+YR+ +Y FG+GLSYT Y+Y +FL E +S + G
Sbjct: 568 -------GRTYRYMEQEALYPFGYGLSYTEYAYQNVRFLE--QEPVVSEGVTIG------ 612
Query: 692 QQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLI 751
+SV N G +DG+ V ++ + P QL
Sbjct: 613 --------------------------LSVKNTGKMDGTETVQVYVKAEHSKM--PHGQLK 644
Query: 752 GFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELR 800
++ A KEI+ ++ E + +++G +ILP G+ + VG ++
Sbjct: 645 KIVKLPLCAGEEKEINIRLE-SEAFMLYDENGEKILPSGHFEIFVGGMQ 692
>gi|297745533|emb|CBI40698.3| unnamed protein product [Vitis vinifera]
Length = 461
Score = 395 bits (1015), Expect = e-107, Method: Compositional matrix adjust.
Identities = 202/399 (50%), Positives = 263/399 (65%), Gaps = 27/399 (6%)
Query: 154 MYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGG 213
MYN+G AGLTFW+PN+NIFRDPRWGRGQETPGEDP++ S YA +V+ Q +SDDG
Sbjct: 1 MYNVGLAGLTFWSPNVNIFRDPRWGRGQETPGEDPLLSSKYASGYVRGLQ----QSDDGS 56
Query: 214 IGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTED 273
D L ++ACCKH AYDL+ W R+ FNA++T+QD +D
Sbjct: 57 --------------------PDRLKIAACCKHYTAYDLDNWKGVDRFHFNAVVTKQDMDD 96
Query: 274 TFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK-ARNEWGFKGYITSDCDAVAT 332
TFQPPF+SC+ G + +MCSYNQVNG PAC DL R EW GYI SDCD+V
Sbjct: 97 TFQPPFKSCVIDGNVASVMCSYNQVNGKPACADPDLLSGIVRGEWKLNGYIVSDCDSVDV 156
Query: 333 IFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQ 392
+ Q+YTKT E++AA + AG+D+NCG+ + +HT++A+ G V E +D+A+ N F+
Sbjct: 157 FYNSQHYTKTPEEAAAKAILAGLDLNCGSFLGQHTEAAVKGGLVDESAVDKAVSNNFATL 216
Query: 393 LRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSL 452
+RLG F+G+P K YGKLGP DVCT EH++LA +AARQGI+LLKN K LPL+ A+ +L
Sbjct: 217 MRLGFFDGNPSKAIYGKLGPKDVCTLEHQELAREAARQGIMLLKNSKGSLPLSPTAIKTL 276
Query: 453 AIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVR 512
AIIGP N M G Y G PC + L+GL A V+ T Y SGC +V C S A EA +
Sbjct: 277 AIIGPNANVTKTMIGNYEGTPCKYTTPLQGLMALVATT-YLSGCSNVAC-STAQIDEAKK 334
Query: 513 IAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVT 551
IA AD +++ G+D + E E RDRV++ LPGQQ L+T
Sbjct: 335 IAAAADATVLIVGIDQSIEAEGRDRVNIQLPGQQPLLIT 373
>gi|291537442|emb|CBL10554.1| Beta-glucosidase-related glycosidases [Roseburia intestinalis
M50/1]
Length = 710
Score = 395 bits (1014), Expect = e-107, Method: Compositional matrix adjust.
Identities = 255/769 (33%), Positives = 386/769 (50%), Gaps = 124/769 (16%)
Query: 64 RAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVT 123
RA L+ +TL+EK+ Q A A+ RL I AY WW+E+LHG+A G + T
Sbjct: 16 RAAELVGKMTLEEKVAQTLYQAPAVERLNIKAYNWWNEALHGVARAG----------TAT 65
Query: 124 SFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAG--------LTFWAPNINIFRDP 175
FPQ + AA+F+ L +G AV+ EARA +N+ Q G LTFWAPN+NIFRDP
Sbjct: 66 VFPQAIGLAATFDEDLLEQVGDAVSTEARAKFNMQQEGKDTDIYKGLTFWAPNVNIFRDP 125
Query: 176 RWGRGQETPGEDPMVVSAYAVEFVKSFQG--ENWKSDDGGIGFGFREKRVLKGFGEESDR 233
RWGRG ET GEDP + S V +++ QG EN+
Sbjct: 126 RWGRGHETFGEDPYLTSRLGVRYIEGLQGHDENY-------------------------- 159
Query: 234 GDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMC 293
L +AC KH + + R+ F+A +TEQD +T+ P F +C+++GK +M
Sbjct: 160 ---LKAAACAKHFAVHSGPE---AVRHEFDAEVTEQDLRETYLPAFEACVKEGKVEAVMG 213
Query: 294 SYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLK 352
+YN+ NGVP C L R EWGF G++TSDC A+ E + T T +S A +
Sbjct: 214 AYNRTNGVPCCGNKRLLIDILRKEWGFSGHVTSDCWAIRDFHEGHHVTGTAIESVAMAMN 273
Query: 353 AGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGP 412
G D+NCGT + A+ +G V+E+ +D A+ NLF +++LG+F+ + Y K+
Sbjct: 274 NGCDLNCGT-LFGFLVQAVRQGLVKEERLDEAVTNLFMARMKLGVFD-KKEENPYDKIPY 331
Query: 413 DDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGI 472
+ E KKL AR+ +VLLKN + LPL+KN + ++ +IGP ++ + G Y G
Sbjct: 332 LAADSREMKKLNEAVARRTVVLLKNKEHILPLDKNKIKTIGVIGPNADSRRALVGNYEGT 391
Query: 473 PCSPKSLLRGLEAYVS---KTHYASGCH---DVPCN---SDAGFHEAVRIAKKADFVIVV 523
++L G+E YV + Y+ GCH D N + E + + K++D V+ V
Sbjct: 392 ASRYITVLEGIEDYVGDDVRVLYSEGCHLYKDRTSNLAQENDRMSEVLGVCKESDVVVAV 451
Query: 524 AGLDLTQETED---------RDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVS 574
GLD E E+ D+ L LPG Q ++ + A + +PVILVL G L V+
Sbjct: 452 LGLDAGIEGEEGDAGNEYGSGDKPDLNLPGLQEEILEA-AVSCGKPVILVLLSGSALAVN 510
Query: 575 FAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRA 634
+ AD + +I+ YPG G A+A+I+FG+ NP G+LP+T+Y + D +M+
Sbjct: 511 W--ADEHVDAIVQGWYPGARGGAAIADILFGEANPEGKLPVTFYRTTEELPDFEDYSMQ- 567
Query: 635 DSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSY---KFLSAPSELTISASLKAGSDKNIL 691
GR+YR+ +Y FG+GLSYT Y+Y +FL E +S + G
Sbjct: 568 -------GRTYRYMEQEALYPFGYGLSYTEYAYQNVRFLE--QEPVVSEGVTIG------ 612
Query: 692 QQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLI 751
+SV N G +DG+ V ++ + P QL
Sbjct: 613 --------------------------LSVKNTGKMDGTETVQVYVKAEHSKM--PHGQLK 644
Query: 752 GFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELR 800
++ A KEI+ ++ E + +++G +ILP G+ + VG ++
Sbjct: 645 KIVKLPLCAGEEKEINIRLE-SEAFMLYDENGEKILPSGHFEIFVGGMQ 692
>gi|255690205|ref|ZP_05413880.1| xylosidase/arabinosidase [Bacteroides finegoldii DSM 17565]
gi|260624224|gb|EEX47095.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
finegoldii DSM 17565]
Length = 1425
Score = 395 bits (1014), Expect = e-107, Method: Compositional matrix adjust.
Identities = 260/761 (34%), Positives = 377/761 (49%), Gaps = 109/761 (14%)
Query: 52 YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGP 111
YPF N LSI R L+S LTL+EK++Q+ +NA AI RLGIPAY WW+E LHG+
Sbjct: 712 YPFRNPQLSIEQRVDDLVSRLTLEEKVRQMLNNAPAIKRLGIPAYNWWNECLHGVGR--- 768
Query: 112 GVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLT 163
T VT FPQ + AAS+N L + S++A E RA+YN Q LT
Sbjct: 769 ------TKYHVTVFPQAIGMAASWNDVLMKEVASSIADEGRAIYNDAQKRGDYSQYHALT 822
Query: 164 FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRV 223
+W PNINIFRDPRWGRGQET GEDP + S FV QG+ + R
Sbjct: 823 YWTPNINIFRDPRWGRGQETYGEDPYLTSKIGKAFVLGLQGD--------------DPRY 868
Query: 224 LKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCI 283
LK SAC KH Y + +R+SFN+ ++ D DT+ P FR+ +
Sbjct: 869 LKA-------------SACAKH---YAVHSGPEKNRHSFNSDVSTYDLWDTYLPAFRTLV 912
Query: 284 EQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKT 342
S +MC+YN G P C L Q R++W FKGY+TSDC A+ IF +
Sbjct: 913 VDANVSGVMCAYNAFKGQPCCGNDLLMQSILRDKWNFKGYVTSDCGAIDDIFNHHKAHPD 972
Query: 343 HEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDP 402
+AA + G D++CG A+ G + EK +D ++ LF+++ RLGLF DP
Sbjct: 973 AATAAADAVFHGTDLDCGQSAYLALVKAVKNGIITEKQLDVSVKRLFTIRFRLGLF--DP 1030
Query: 403 -RKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNN 461
+ Y + + +H+ LA AR+ +VLLKND + LPL KN + + ++GP +
Sbjct: 1031 AEQVDYAHIPISVLECKKHQDLAKQLARESMVLLKND-RLLPLQKNKLKKVVVMGPNADC 1089
Query: 462 ISQMGGGYTGIPC---SPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKAD 518
+ G Y G P +P +R V++ Y SG + S+ V AK AD
Sbjct: 1090 KDALLGNYNGHPSRMLTPLQAIRERLKGVAEVVYVSGIDYINTVSEDELKRYVNQAKGAD 1149
Query: 519 FVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGG 568
VI + G+ E E+ DR S+ LP Q L+ ++ + P + V+ G
Sbjct: 1150 AVIFIGGISPRLEGEEMSVNKDGFDGGDRTSIALPTVQTQLMKALV-AGRIPTVFVMMTG 1208
Query: 569 GPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMN 628
L + + EA + +IL Y G+ G +A+A+++FGD+NP G+LP+T+Y +
Sbjct: 1209 SALAIPW-EA-KHVPAILNAWYGGQYGGEAIADVLFGDYNPSGKLPVTFYAK-------- 1258
Query: 629 DMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDK 688
D ++ S GR+YR++ G +Y FG+GLSYT++ Y L P+ +DK
Sbjct: 1259 DSDLPDFESYDMQGRTYRYFKGKALYPFGYGLSYTDFRYSSLKMPTACNT-------TDK 1311
Query: 689 NILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEK 748
I V ++V N G +DG VV L+ P P
Sbjct: 1312 EI------------------------PVTVTVKNTGKMDGEEVVQLYVSHPDKKILVPVT 1347
Query: 749 QLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHG-RRILP 788
L GF R++ A +K+I+F + E LS +++G R++LP
Sbjct: 1348 ALKGFKRIYLKAGEAKQITFSLS-SEDLSCVDENGIRKVLP 1387
>gi|30316196|sp|P83344.1|XYNB_PRUPE RecName: Full=Putative beta-D-xylosidase; AltName: Full=PpAz152
gi|19879972|gb|AAM00218.1|AF362990_1 beta-D-xylosidase, partial [Prunus persica]
Length = 461
Score = 394 bits (1013), Expect = e-107, Method: Compositional matrix adjust.
Identities = 202/463 (43%), Positives = 296/463 (63%), Gaps = 12/463 (2%)
Query: 348 AGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKY 407
A +KAG+D++CG + HT++A+ +G V + +I+ AL N +VQ+RLG+F+G+P +Y
Sbjct: 1 ADAIKAGLDLDCGPFLAIHTEAAVRRGLVSQLEINWALANTMTVQMRLGMFDGEPSAHQY 60
Query: 408 GKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGG 467
G LGP DVCT H++LAL+AARQGIVLL+N + LPL+ ++A+IGP + M G
Sbjct: 61 GNLGPRDVCTPAHQQLALEAARQGIVLLENRGRSLPLSTRRHRTVAVIGPNSDVTVTMIG 120
Query: 468 GYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLD 527
Y G+ C + L+G+ Y H A GC DV CN + F A A++AD ++V GLD
Sbjct: 121 NYAGVACGYTTPLQGIGRYTRTIHQA-GCTDVHCNGNQLFGAAEAAARQADATVLVMGLD 179
Query: 528 LTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILW 587
+ E E DR LLLPG Q LV+ VAR S+ P ILVL GGP+DV+FA+ D +IS+I+W
Sbjct: 180 QSIEAEFVDRAGLLLPGHQQELVSRVARASRGPTILVLMSGGPIDVTFAKNDPRISAIIW 239
Query: 588 IGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESF-TKVPMNDMNMRADSSRQYPGRSYR 646
+GYPG+AG A+A ++FG NPGG+LPMTWYP+++ T +PM DM MRAD +R YPGR+YR
Sbjct: 240 VGYPGQAGGTAIANVLFGTANPGGKLPMTWYPQNYVTHLPMTDMAMRADPARGYPGRTYR 299
Query: 647 FYTGTQVYGFGHGLSYTNYSYKFLSAPSELTIS-ASLKAGSDKNILQQTGSRLDYVHIDE 705
FY G V+ FG GLSYT +++ P+ +++ SLKA ++ +L +T + H D
Sbjct: 300 FYIGPVVFPFGLGLSYTTFAHNLAHGPTLVSVPLTSLKATANSTMLSKT---VRVSHPD- 355
Query: 706 VTSCTSLR-FHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSK 764
C +L V + V N G +DG+H +++F P + KQL+GF ++H K
Sbjct: 356 ---CNALSPLDVHVDVKNTGSMDGTHTLLVFTSPPD-GKWASSKQLMGFHKIHIATGSEK 411
Query: 765 EISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIET 807
+ V C+ LS+ ++ G R +PLG H L +G+L H ++++T
Sbjct: 412 RVRIAVHVCKHLSVVDRFGIRRIPLGEHKLQIGDLSHHVSLQT 454
>gi|333381510|ref|ZP_08473192.1| hypothetical protein HMPREF9455_01358 [Dysgonomonas gadei ATCC
BAA-286]
gi|332830480|gb|EGK03108.1| hypothetical protein HMPREF9455_01358 [Dysgonomonas gadei ATCC
BAA-286]
Length = 738
Score = 390 bits (1003), Expect = e-105, Method: Compositional matrix adjust.
Identities = 260/779 (33%), Positives = 385/779 (49%), Gaps = 126/779 (16%)
Query: 52 YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGP 111
YPF +T LS R L+S LTL+EK+ Q+ +N AI RL IPAY WW+E LHGI
Sbjct: 24 YPFRDTKLSTDKRVSDLVSRLTLEEKVLQMLNNTPAIERLNIPAYNWWNECLHGIGR--- 80
Query: 112 GVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLT 163
T VT FPQ + AA+++ L ++ +A++ E RA+YN A GLT
Sbjct: 81 ------TEYKVTVFPQAIGMAAAWDARLLKDVANAISDEGRAIYNDASAKGNYSIYHGLT 134
Query: 164 FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRV 223
+W PN+NIFRDPRWGRGQET GEDP + A FV QG
Sbjct: 135 YWTPNVNIFRDPRWGRGQETYGEDPYLTGALGKSFVAGLQG------------------- 175
Query: 224 LKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCI 283
D L +AC KH Y + +R++FN +T D DT+ P FR +
Sbjct: 176 --------DDSQYLKAAACAKH---YAVHSGPENTRHTFNTFVTTFDLWDTYLPAFRDLV 224
Query: 284 EQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKT 342
K + +MC+YN +G P C L Q+ R++WGF GY+TSDC A+ + + KT
Sbjct: 225 VDAKVAGVMCAYNAFSGEPCCGNNLLMQEILRDKWGFTGYVTSDCGAIDDFYRHH---KT 281
Query: 343 HED---SAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFN 399
H D +AA + +G DI+CG + A+ G + E+ ID +L LF ++ RLG+F
Sbjct: 282 HPDAKYAAADAVYSGTDIDCGNEAYKALVDAVKTGLITEEQIDISLKRLFEIRFRLGMF- 340
Query: 400 GDPRKG-KYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPL 458
DP + K+ K+ + + HK LAL R+ IVLLKN+ FLPL+K + +A+IGP
Sbjct: 341 -DPAEDVKFSKIPLSVLESQPHKDLALKITRESIVLLKNENNFLPLSKK-LKKVAVIGPN 398
Query: 459 VNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTH--YASGCHDVPCNSDAGFHEAVRIAKK 516
+N + G Y G P + + ++ + T Y G D S+ E +AK+
Sbjct: 399 ADNEVSVLGNYNGFPTQIITPYKAIKNKLKNTEVIYEKGI-DFVKPSENSKEEIAALAKR 457
Query: 517 ---ADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVARTSKRPVIL 563
D VI G+ E E+ DR S+ LP Q L+ ++ + + P +
Sbjct: 458 LKGMDVVIFAGGISPELEGEEMPVKIEGFTGGDRTSIKLPKIQTELMQAL-KAERIPTVF 516
Query: 564 VLTGGGPLDVSFAEADSQ-ISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESF 622
V+ G + AE +SQ + +IL Y G+ A+A+++FGD+NP G+LP+T+Y +
Sbjct: 517 VMMTGSAIA---AEWESQNVPAILNAWYGGQDAGTAIADVLFGDYNPSGKLPVTFYTK-- 571
Query: 623 TKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASL 682
D ++ A +S + R+YR++ G +Y FG+GLSYT + Y + P AS+
Sbjct: 572 ------DSDLPAFNSYEMKNRTYRYFDGQVLYPFGYGLSYTKFEYSPIQMP------ASI 619
Query: 683 KAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVS 742
KAG + V I+V N G DG VV L+ + +
Sbjct: 620 KAGEN--------------------------MEVSITVKNTGKTDGEEVVQLY--ISHDN 651
Query: 743 QGT----PEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
GT P L F+R+ A SK ++F + P E +++A++ G + G L +G
Sbjct: 652 NGTNRQLPLYALKSFERISLKAGESKSVTFKLSPRE-MALADEDGVLKMTKGKSKLYIG 709
>gi|347531439|ref|YP_004838202.1| beta-glucosidase [Roseburia hominis A2-183]
gi|345501587|gb|AEN96270.1| beta-glucosidase [Roseburia hominis A2-183]
Length = 716
Score = 390 bits (1001), Expect = e-105, Method: Compositional matrix adjust.
Identities = 249/767 (32%), Positives = 380/767 (49%), Gaps = 111/767 (14%)
Query: 58 SLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNG 117
SL AK L+ +TL+EKI Q+ + AI RL IPAY WW+E+LHG+A +G
Sbjct: 2 SLETKEYAKRLVEQMTLEEKISQMRYESPAIERLHIPAYNWWNEALHGVARSG------- 54
Query: 118 TVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLTFWAPNI 169
T FPQ + AA+F+ L IG V+ E RA + GLTFWAPNI
Sbjct: 55 ---VATMFPQAIALAATFDEELIEKIGDVVSTEGRAKFEAYSGRGDRGIYKGLTFWAPNI 111
Query: 170 NIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGE 229
NIFRDPRWGRG ET GEDP + + +++ QG++
Sbjct: 112 NIFRDPRWGRGHETYGEDPCLTAKLGCAYIRGIQGKD----------------------- 148
Query: 230 ESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKAS 289
D L +AC KH + + R+ F+A ++ D DT+ F+ C++
Sbjct: 149 ----PDHLKAAACAKH---FAVHSGPEALRHEFDAKVSLHDLYDTYLYAFKRCVKDAGVE 201
Query: 290 CIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAA 348
+M +YN+VNG PAC L Q R ++GF+G++ SDC A+ E+ + TKT E+SAA
Sbjct: 202 AVMGAYNRVNGEPACGSKTLLQDILREQFGFEGHVVSDCWAILDFHEHHHVTKTVEESAA 261
Query: 349 GVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYG 408
+ G D+NCG L + A ++G V+EK I A+ L V++RLG+ P Y
Sbjct: 262 MAVNHGCDLNCGKAFL-YLSRACEQGLVEEKTITEAVERLMDVRIRLGMMEDYP--SPYA 318
Query: 409 KLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGG 468
+ D V EH L+L+A+++ +VLLKND FLPL + V ++A+IGP N+ + + G
Sbjct: 319 NIPYDVVECPEHIALSLEASKRSMVLLKNDNHFLPLKQEQVHTIAVIGPNANSRAALVGN 378
Query: 469 YTGIPCSPKSLLRGLEAYV---SKTHYASGCH------DVPCNSDAGFHEAVRIAKKADF 519
Y G + L G++ Y ++ YA GCH + F EA+ A++AD
Sbjct: 379 YEGTSSRYITPLEGIQEYTGEKTRVLYAQGCHLYKDQVEFLGEPKDRFKEALIAAERADV 438
Query: 520 VIVVAGLDLTQETEDRD---------RVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGP 570
+++ GLD E E+ D ++ L LPG Q L+ +VA K P++L + G
Sbjct: 439 IVMCLGLDAGIEGEEGDAGNEYASGDKLGLKLPGLQQELLEAVAAVGK-PIVLTVLAGSA 497
Query: 571 LDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDM 630
LD+S+A+ +QI +IL YPG G KA+AE +FG+F+P G+LP+T+Y + D
Sbjct: 498 LDLSWAQEHAQIRAILDCWYPGARGGKAIAEALFGEFSPCGKLPVTFYEGTEFLPDFTDY 557
Query: 631 NMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNI 690
+M GR+YR+ +Y FG+GL+Y+ Y A +++T
Sbjct: 558 SM--------AGRTYRYTDRHVLYPFGYGLTYSQIRYS--DAHADVT------------- 594
Query: 691 LQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQL 750
D+ ++ VT V ++V N G V ++ R + P QL
Sbjct: 595 --------DFGILEPVT--------VHVTVENTGTYPVQEAVQVYVRFSEREAYDPGYQL 638
Query: 751 IGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
G V KE+ + P ++ ++ G+ ++ G++ + VG
Sbjct: 639 KGIRSVALECGEKKEVCITLSP-RDFALISEEGKCLVHPGSYEIAVG 684
>gi|373955483|ref|ZP_09615443.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
paludis DSM 18603]
gi|373892083|gb|EHQ27980.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
paludis DSM 18603]
Length = 738
Score = 389 bits (998), Expect = e-105, Method: Compositional matrix adjust.
Identities = 259/779 (33%), Positives = 388/779 (49%), Gaps = 118/779 (15%)
Query: 48 HFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIA 107
H YPF N +LS+ R L+ +TL+EK+ Q+ ++A AI RLG+PAY WW+E LHG+A
Sbjct: 27 HTYKYPFNNPALSMDERVADLVGRMTLEEKVSQMLNSAPAIERLGVPAYNWWNECLHGVA 86
Query: 108 SNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG--------Q 159
T VT +PQ + AA+++++ +G A E RA+YN
Sbjct: 87 R---------TPFKVTVYPQAIAMAATWDKTSMHVMGDYTAEEGRAVYNESIKNDKHDIY 137
Query: 160 AGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFR 219
GLT+W PNINIFRDPRWGRGQET GEDP + FVK QG+
Sbjct: 138 LGLTYWTPNINIFRDPRWGRGQETYGEDPFLTGEMGSAFVKGLQGD-------------- 183
Query: 220 EKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPF 279
+ R LK G C KH Y + R+ FN I++ D DT+ P F
Sbjct: 184 DPRYLKAAG-------------CAKH---YAVHSGPEDLRHKFNTDISDYDLWDTYLPAF 227
Query: 280 RSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA--RNEWGFKGYITSDCDAVATIFEYQ 337
R + K + +MC+YN G P C DL + ++W F GY+TSDC + + +
Sbjct: 228 RKLVVDAKVTGVMCAYNAFKGQPCC-GSDLLMNSILHDKWKFTGYVTSDCGGIDDFYR-E 285
Query: 338 NYTKTHED---SAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLR 394
N +T D +AA + G D+ CG + A+ GK+ EK ID++L LFSV+ +
Sbjct: 286 NTHQTQPDAESAAADAVLHGTDVECGNVTYKSLVKAVKDGKLSEKQIDQSLKRLFSVRFK 345
Query: 395 LGLFNGDPRKG-KYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLA 453
LG+F DP KY ++G D + H AL A Q IVLLKN+ LPL+KN + +A
Sbjct: 346 LGMF--DPADAVKYNQIGKDALEAPAHGAQALKMAHQSIVLLKNEGNLLPLSKN-LKKIA 402
Query: 454 IIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV---SKTHYASGCHDVPCNSDAGFHEA 510
++GP +N + G Y G P + L+G++ + ++ Y D +S A ++ A
Sbjct: 403 VLGPNADNAVSVLGNYNGTPSRIVTALQGIKNKLPAGTEVIYDKAV-DYVADSAARYNYA 461
Query: 511 VRIA--KKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVARTSK 558
A K AD +I + G+ E E+ DR ++LLPG Q L+ ++ T K
Sbjct: 462 AMAAKVKDADAIIYIGGISPELEGEEMPVSKPGFHGGDRSTILLPGVQTELLKALKATGK 521
Query: 559 RPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWY 618
PV+ V+ G + + EA++ + +I+ Y G+A A+A+++FGD+NP GRLP+T+Y
Sbjct: 522 -PVVFVMMTGSAIATPW-EAEN-LPAIVNAWYGGQAAGTAIADVLFGDYNPAGRLPVTFY 578
Query: 619 PESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTI 678
+D ++ + + R+YR++ G +Y FG+GLSY+ + Y L AP
Sbjct: 579 G--------SDKDLPSFTDYSMDNRTYRYFKGKPLYAFGYGLSYSKFEYAPLDAP----- 625
Query: 679 SASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARV 738
+LKAG +L HV+ VTN +DG V L+
Sbjct: 626 -LTLKAGE------------------------ALTVHVK--VTNKSKMDGEEVTELYLSH 658
Query: 739 PKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
+ Q T + L GF+R A +K+I+F + + LSI + +G + G + VG
Sbjct: 659 IGIKQKTAIRALKGFERTLIKAGETKDITFKLSSAD-LSITDLNGNLVKASGKIAISVG 716
>gi|313202830|ref|YP_004041487.1| glycoside hydrolase [Paludibacter propionicigenes WB4]
gi|312442146|gb|ADQ78502.1| glycoside hydrolase family 3 domain protein [Paludibacter
propionicigenes WB4]
Length = 742
Score = 389 bits (998), Expect = e-105, Method: Compositional matrix adjust.
Identities = 257/780 (32%), Positives = 387/780 (49%), Gaps = 114/780 (14%)
Query: 52 YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGP 111
YPF +TS +I R K L+S LTL EK Q+ NA AI RLGI Y WW+E+LHG+A G
Sbjct: 38 YPFQDTSKTIDERVKDLVSRLTLDEKAGQMLHNAPAIKRLGILPYSWWNEALHGVARTG- 96
Query: 112 GVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQ--------AGLT 163
T FP+ + AA+F+ L IG A++ EA A YN+ Q +G+T
Sbjct: 97 ---------RATVFPENVGLAATFDEDLVYRIGQAISDEAWAKYNIAQRLENYGQYSGIT 147
Query: 164 FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRV 223
F+APN+NIFRDPRWGRGQET GEDP + S V +VK QG + K
Sbjct: 148 FYAPNVNIFRDPRWGRGQETYGEDPFLTSRMGVAYVKGMQGNDPKY-------------- 193
Query: 224 LKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCI 283
L +AC KH Y + R+S++A +D +T+ P F + +
Sbjct: 194 -------------LKTAACAKH---YVVHSGPEALRHSYDAEPPMKDFMETYVPAFETLV 237
Query: 284 EQGKASCIMCSYNQVNGVPAC----LRGDLFQKARNEWGFKGYITSDCDAVATIFEYQNY 339
++GK +MC+YN+ G P C L DL R +WGF GY+T+DC A+ + +
Sbjct: 238 KEGKVESVMCAYNRTFGKPCCGSSFLLHDLL---REKWGFTGYVTTDCWAIQNFYLHHGA 294
Query: 340 TKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFN 399
K ++ A +K+G+++NCG + +A+ KG V EK++D AL L + RLGLF+
Sbjct: 295 AKDSLEACALAIKSGVNLNCGN-EFNYLPAAVRKGLVTEKEVDEALSQLLRTRFRLGLFD 353
Query: 400 GDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLV 459
P + Y K+ + + + ++ LA +AA + +VLL+N LPL K+ + SL ++GP
Sbjct: 354 -SPNENPYAKIKEEVIGSQQNIDLAYEAAAKSLVLLQNKNNTLPLKKD-MKSLYVVGPYA 411
Query: 460 NNISQMGGGYTGIPCSPKSLLRGLEAYVS---KTHYASGCHDVPCNSDAGFHE--AVRIA 514
N + G Y G+ ++++ + VS +Y G N ++ + A
Sbjct: 412 ANQDILLGNYNGVNSRLTTIMQAIVGKVSAGTSVNYRIGVEPSAPNKNSMNYSIGEAADA 471
Query: 515 KKADFVIVVAGL------DLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGG 568
V ++G+ + T T DR+ L LP Q+ + + + K+P+ILVLTGG
Sbjct: 472 DAVVAVFGISGVFEGEEGESTASTSRGDRLDLNLPQNQLDYLRELKKKCKKPIILVLTGG 531
Query: 569 GPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVP-M 627
P + E + +IL++ YPG+ G A+A++IFGD NP GRL +T +P+S +++P
Sbjct: 532 SP--ICTPELADMVDAILFVWYPGQEGGHAVADVIFGDVNPSGRLCIT-FPKSVSQLPAF 588
Query: 628 NDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSD 687
D +M+ GR+YR+ T +Y FG GLSYTNYSY + +
Sbjct: 589 EDYSMK--------GRTYRYMTEEPLYPFGFGLSYTNYSYSNIKTDKDKIKKGQ------ 634
Query: 688 KNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPE 747
HV +V+N G G V L+ K S TP
Sbjct: 635 -------------------------SVHVTATVSNTGKTAGEEVAQLYITDVKASAPTPL 669
Query: 748 KQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGN-HVLMVGELRHSLTIE 806
L G RV A SKE+SF V P + + + G +++ G+ V + G +L+I+
Sbjct: 670 YALKGTKRVKLAAGESKEVSFEVTP-QMMELVTVTGEKVIEPGDFKVYIAGSTPSALSIK 728
>gi|310797011|gb|EFQ32472.1| glycosyl hydrolase family 3 N terminal domain-containing protein
[Glomerella graminicola M1.001]
Length = 767
Score = 388 bits (996), Expect = e-105, Method: Compositional matrix adjust.
Identities = 268/765 (35%), Positives = 388/765 (50%), Gaps = 79/765 (10%)
Query: 55 CNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVN 114
C+ +LS RA +L+ LT++EK+Q L A PR+G+PAY WWSE+LHG+A PG
Sbjct: 43 CDRTLSPPERAAALVKALTVEEKLQNLVSKAQGAPRIGLPAYNWWSEALHGVAY-APGTY 101
Query: 115 F-NGTV--SSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINI 171
F G V +S TS+P L+ AA+F+ L IG+A+ +EARA N G AGL +W PN+N
Sbjct: 102 FPEGDVEFNSSTSYPMPLLMAAAFDDELIEQIGAAIGIEARAWGNAGWAGLDYWTPNVNP 161
Query: 172 FRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEES 231
F+DPRWGRG ETPGED + V YA + G ++RV+
Sbjct: 162 FKDPRWGRGSETPGEDVLRVKRYAEYITRGLDGP----------VPGEQRRVI------- 204
Query: 232 DRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCI 291
+ CKH D E W SR+ F+A IT QD + + PF+ C K I
Sbjct: 205 ---------STCKHYAGNDFEDWNGTSRHDFDAKITAQDLAEYYLMPFQQCARDSKVGSI 255
Query: 292 MCSYNQVNGVPACLRGDLFQKA-RNEWGF---KGYITSDCDAVATIFEYQNYTKTHEDSA 347
MC+YN VNGVP+C L Q R W + Y+TSDC+AV + Y T+
Sbjct: 256 MCAYNAVNGVPSCANEYLLQNILREHWNWTEHNNYVTSDCEAVLDVSANHKYAPTNAAGT 315
Query: 348 AGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKY 407
A +AGMD +C A +G ++E+ +DRALL L+ +R G F+G + Y
Sbjct: 316 AICFEAGMDTSCEYTGSSDIPGAWSQGLLKEETVDRALLRLYEGLVRAGYFDG--HEAIY 373
Query: 408 GKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGG 467
KLG DV ++E + LAL AA +GIVLLKN+ LPL+ +A+IG + ++ G
Sbjct: 374 AKLGWKDVNSAEAQSLALQAAVEGIVLLKNNGT-LPLDLKPSHKVAMIGFWADAPDKLQG 432
Query: 468 GYTGIPC---SPKSLLRGLEAYVSKTHYASGCHDVPCN-SDAGFHEAVRIAKKADFVIVV 523
GY+G +P R L ++ ASG N SD A+ A+ AD+++
Sbjct: 433 GYSGRAAHLHTPAYAARQLGLDIT---LASGPVLQRNNASDNWTAAALEAAEGADYILYF 489
Query: 524 AGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQIS 583
GLD + E DR L P Q+ L+ ++ K P+++ L G D + D ++S
Sbjct: 490 GGLDTSAAGETLDRTDLEWPEAQLMLIKKLSALGK-PLVVNLLGDQLDDTPLLQLD-EVS 547
Query: 584 SILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTK-VPMNDMNMRADSSRQYPG 642
SILW +PG+ G A+ ++I G+ +P GRLP+T YP ++T +PM M++R S QYPG
Sbjct: 548 SILWANWPGQDGGVAIMKLITGEKSPAGRLPVTQYPSNYTDLIPMTSMDLRPTS--QYPG 605
Query: 643 RSYRFYTGTQVYGFGHGLSYTNYSYKFLSA-PSELTISASLKAGSDKNILQQTGSRLDYV 701
R+YR+Y + FG GL YT + + A P L I+ + G++
Sbjct: 606 RTYRWYD-KPIKRFGFGLHYTTFKAEVGGAFPKTLRIADLVGCGNE-------------- 650
Query: 702 HIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAK 761
H D +C + V S+TN G+ +V + + + P K L + R+ VA
Sbjct: 651 HPD---TCPAPPLPV--SITNTGNRTSDYVALAYLSGEYGPRPYPIKTLSAYKRLRDVAP 705
Query: 762 GSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
G VD L +H + GN VL GE +++TI+
Sbjct: 706 GETAT---VDLAWTLGDIARHDEQ----GNTVLYPGE--YTITID 741
>gi|366163035|ref|ZP_09462790.1| glycoside hydrolase family 3 [Acetivibrio cellulolyticus CD2]
Length = 705
Score = 388 bits (996), Expect = e-105, Method: Compositional matrix adjust.
Identities = 249/765 (32%), Positives = 376/765 (49%), Gaps = 126/765 (16%)
Query: 64 RAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVT 123
+A+ L++ +TL+EK QL+ N+ AI RLGIPAY WW+E+LHG+A G + T
Sbjct: 10 KAEELVAQMTLEEKASQLTYNSPAIERLGIPAYNWWNEALHGVARAG----------TAT 59
Query: 124 SFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLTFWAPNINIFRDP 175
FPQ + AA F+ I +A+A+EARA YN GLT W+PNINIFRDP
Sbjct: 60 VFPQAIGLAAMFDDEFLMKIANAIAIEARAKYNESSKHGDRDIYKGLTIWSPNINIFRDP 119
Query: 176 RWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGD 235
RWGRG ET GEDP + V F+K QG+ D
Sbjct: 120 RWGRGHETYGEDPFLSGKLGVAFIKGLQGDK----------------------------D 151
Query: 236 ELMLSACCKHLIAY----DLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCI 291
+M +AC KH AY DL R+ FNA +T++D +T+ P F +C++ K +
Sbjct: 152 VMMTAACVKHFAAYSGPEDL-------RHGFNAEVTKKDLWETYLPAFETCVKDAKVEAV 204
Query: 292 MCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGV 350
M YN+ NG P C L + R +WGF+G++ SDC A+ TKT E+S A
Sbjct: 205 MGGYNRTNGEPCCGSYTLLRDILREKWGFEGHVVSDCWAIKDFHTDHMVTKTPEESVALA 264
Query: 351 LKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKL 410
+ AG D+NCG M A+ +G + E+ I RA + +F+ + +LGLF G ++ +
Sbjct: 265 IDAGCDLNCGN-MYLMLLIALQEGLITEEHITRAAVRIFTTRFKLGLFEG----SEFDNI 319
Query: 411 GPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT 470
+ V SEHK++A++AAR+ VLLKND LP+NK A+ ++ +IGP N+ + G Y
Sbjct: 320 PYEVVECSEHKEMAIEAARKSAVLLKND-GILPINKGAIKTIGVIGPNANSRIALKGNYH 378
Query: 471 GIPCSPKSLLRGLEAYVS---KTHYASGCH------DVPCNSDAGFHEAVRIAKKADFVI 521
G +LL G++ V + Y++GC +V ++ EAV +A+ +D V+
Sbjct: 379 GTSSRYITLLEGIQDEVGDEVRVLYSNGCELVKDRTEVLAYANDRLAEAVTVAEHSDLVV 438
Query: 522 VVAGLDLTQETEDR---------DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLD 572
+ GLD T E E D+ L LP Q SL+ + T K P +L L G ++
Sbjct: 439 LCLGLDETIEGEQSDEGNNGGSGDKKDLDLPEVQKSLLEKIVATGK-PTVLCLMAGSAIN 497
Query: 573 VSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNM 632
+S+A + IL YPG G KA+A+I+FG+ +P G+LP+T+Y P+ D +M
Sbjct: 498 LSYAH--EHCNGILLTWYPGARGGKAVADILFGNASPSGKLPVTFYRSLDNLPPITDYSM 555
Query: 633 RADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQ 692
+ R+YR+ +Y FG+GL+Y + K + + I +K+I
Sbjct: 556 K--------NRTYRYIEEAPLYPFGYGLTYGDVELKHVEIKGTVEI--------EKDI-- 597
Query: 693 QTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIG 752
++ +++ N G V VV + + + L
Sbjct: 598 ----------------------YITVTLQNRGSVAVEEVVQAYIKDEQSMYAVTNTSLCA 635
Query: 753 FDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
F RV A K++S + P + L + N G ++L L G
Sbjct: 636 FMRVGLGANEEKQVSMRI-PFDSLKVVNLDGEKVLDSKKFTLFAG 679
>gi|410617070|ref|ZP_11328046.1| beta-glucosidase [Glaciecola polaris LMG 21857]
gi|410163339|dbj|GAC32184.1| beta-glucosidase [Glaciecola polaris LMG 21857]
Length = 731
Score = 388 bits (996), Expect = e-105, Method: Compositional matrix adjust.
Identities = 255/768 (33%), Positives = 377/768 (49%), Gaps = 110/768 (14%)
Query: 54 FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
+ + +S + RA L++ +T+ EKI QLS AI RL +P Y WW+E+LHGIA NG
Sbjct: 29 WFDPDISFAQRANLLVNAMTVDEKIAQLSHATPAIARLNVPQYNWWNEALHGIARNG--- 85
Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQ--------AGLTFW 165
T FPQ + AA+F+ L + SA++ EARA Y + Q AGLTFW
Sbjct: 86 -------KATIFPQAIGLAATFDPDLAHQVASAISDEARAKYAIAQSIGNQGQYAGLTFW 138
Query: 166 APNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLK 225
PN+NIFRDPRWGRGQET GEDP + + FVK QG++ K + V K
Sbjct: 139 TPNVNIFRDPRWGRGQETYGEDPFLTAQMGTAFVKGLQGDDPKY--------LKSAGVAK 190
Query: 226 GFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQ 285
F S G E + R+ F+ +++D +T+ P F + + Q
Sbjct: 191 HFAVHS--GPESL--------------------RHHFDVEPSQKDLYETYLPAFEALVTQ 228
Query: 286 GKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHE 344
K + +MC+YN VNG PAC L + +WGF GYI SDC A+ TK+
Sbjct: 229 AKVAGVMCAYNAVNGEPACASAQLLDGILKKQWGFHGYIVSDCGALNDFQAGHKVTKSGP 288
Query: 345 DSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRK 404
+SAA L++G+++NCG+ ++A+++ V + ID+ L L ++ +LG F DP
Sbjct: 289 ESAALALQSGVNLNCGSTYEHFLKAALEQNLVPLELIDQRLTQLLMIRFQLGFF--DPAG 346
Query: 405 -GKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNIS 463
Y ++ PD + + EH L+ D AR+ IVLLKND LPL+K+ + + GP +
Sbjct: 347 LNPYNEVTPDVIHSPEHINLSRDVARKSIVLLKNDNHVLPLSKD-IKVPYVTGPFAASSD 405
Query: 464 QMGGGYTGIPCSPKSLLRGLEAYV---SKTHYASGCHDVPCNSDAG-FHEAVRIAKKADF 519
+ G Y GI S S+L G+ V S +Y SG +P +++ + A ++AK AD
Sbjct: 406 MLIGNYYGISDSLVSVLEGIAGKVSLGSSLNYRSGS--LPFHNNINPLNWAPQVAKTADA 463
Query: 520 VIVVAGLDLTQETE--------DR-DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGP 570
VI V G+ E E DR DRV++ LP Q+ V +A K P+ILV+ G P
Sbjct: 464 VIAVVGVSADMEGEEVDAIASADRGDRVAITLPQNQVDYVKQLAAHKKGPLILVVAAGSP 523
Query: 571 LDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDM 630
+D+S + + +ILWI YPGE G A+A+++FGD NP G LP+T+ P +D
Sbjct: 524 VDIS--DLEPLADAILWIWYPGEQGGNAVADVLFGDTNPSGHLPLTFVKSIDDLPPFDDY 581
Query: 631 NMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNI 690
M GR+Y+F +Y FG G SYT +S+ ++LT+S
Sbjct: 582 AMT--------GRTYKFLEKAPLYPFGFGRSYTEFSF------NDLTVS----------- 616
Query: 691 LQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQL 750
+ + + + V N GD+ G VV + L
Sbjct: 617 --------------QGKAIEGEALTLSVEVENRGDIAGETVVQAYLSPIARMNNEAISSL 662
Query: 751 IGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGE 798
F R+H K ++ + + + L N G + P G + L VG+
Sbjct: 663 KSFKRIHLAPKETRWVELTIQ-GKDLYQVNNAGETVWPQGRYSLAVGD 709
>gi|389632743|ref|XP_003714024.1| beta-xylosidase [Magnaporthe oryzae 70-15]
gi|351646357|gb|EHA54217.1| beta-xylosidase [Magnaporthe oryzae 70-15]
Length = 847
Score = 387 bits (994), Expect = e-104, Method: Compositional matrix adjust.
Identities = 260/782 (33%), Positives = 389/782 (49%), Gaps = 98/782 (12%)
Query: 55 CNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVN 114
C+ + + + RA L+ ++ L EK++ L + + PR+G+PAYEWWSE+LHG+A + PGV
Sbjct: 96 CDQAATPAERAAGLVDIMELDEKLENLVNKSPGAPRIGLPAYEWWSEALHGVAKS-PGVT 154
Query: 115 FN----GTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNIN 170
FN SS TSF +V +A+F+ L + + ++ EARA N G AGL +W PNIN
Sbjct: 155 FNKSSGAAFSSATSFSNPIVLSAAFDDELVEAVATQISTEARAFSNAGLAGLDWWTPNIN 214
Query: 171 IFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEE 230
++DPRWGRG ETPGED + +S Y ++ +G
Sbjct: 215 PYKDPRWGRGMETPGEDALRISKYVKALLRGLEG-------------------------- 248
Query: 231 SDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASC 290
SD M+ A CKH A DLE+W +RY+F+A +T QD + + P F+ C
Sbjct: 249 SDPTTRKMV-ANCKHYAANDLERWNGVTRYNFDAPVTLQDLSEYYLPAFKQCARDSNVGS 307
Query: 291 IMCSYNQV---------NGVPACLRGDLFQKA-RNEWGFK---GYITSDCDAVATIFEYQ 337
MC+YN + NG P C L R WG+K +ITSDC+AV ++
Sbjct: 308 FMCAYNAMSIKGKDLSWNGTPVCASKYLMNDILREHWGWKEHNNWITSDCNAVLHMWNQH 367
Query: 338 NYTKTHEDSAAGVLKAGMDINCGTCMLRHT--QSAIDKGKVQEKDIDRALLNLFSVQLRL 395
+++ T E++A AG D C T + A D+G + E +DRAL L+ +R
Sbjct: 368 HWSDTREEAAGSAYTAGTDTVCEVSNYDKTAVKGAFDRGLLDEDVVDRALKRLYEGLVRA 427
Query: 396 GLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDK----KFLPLNKNAVSS 451
G F+G Y + DV T E +KLA +A +G+VL KN+ K L K +
Sbjct: 428 GYFDGP--DAPYRNITWADVNTPEARKLAHRSAVEGMVLTKNNGVLPIKLEELQKKG-KT 484
Query: 452 LAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHE-- 509
+A+IG V+N QM G Y+GI + L +A K A G P N G +
Sbjct: 485 VALIGNWVDNGEQMLGTYSGIAPFRNTPLAAAKALNLKMVTAGG----PVNQSTGSRDSW 540
Query: 510 ---AVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLT 566
A+ A +AD V+ G+DL+ E EDRDR SL P Q L++ ++ K V++ L
Sbjct: 541 TRPALNAAIQADVVLYFGGIDLSVEAEDRDRYSLAWPSAQAKLLSDISALGKPTVVVQL- 599
Query: 567 GGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KV 625
G LD + + IS+I+W GYPG+ G A +II G P GRLP+T YP + +V
Sbjct: 600 -GTMLDDTALLDNKNISAIIWAGYPGQDGGTAAFDIITGKTAPSGRLPVTQYPAKYANQV 658
Query: 626 PMNDMNMRADSSRQ-----YPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISA 680
PM DM +R + PGR+YR+Y V+ FG GL +TN++ + S ++
Sbjct: 659 PMTDMEVRPSKDTKGGAASNPGRTYRWYD-EAVHPFGFGLHFTNFTTSVAVSSSSAISTS 717
Query: 681 SLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAG-DVDGSHVVMLFARVP 739
L++G HID+ + +SL ++SVTN G S+ + F R
Sbjct: 718 DLESGCKSE-----------KHIDKCSFPSSL----EVSVTNDGKSTTSSYAALAFVRGE 762
Query: 740 KVSQGTPEKQLIGFDRVHTVAKGSK-----EISFGVDPCEQLSIANKHGRRILPLGNHVL 794
+ P K L+ + ++H +A G E++ G L+ ++G +L G + +
Sbjct: 763 YGPKPYPLKTLVAYGKLHDIAPGQTKKVKLELTLG-----DLARTAENGDLVLYPGKYEV 817
Query: 795 MV 796
+V
Sbjct: 818 LV 819
>gi|440472411|gb|ELQ41274.1| beta-xylosidase [Magnaporthe oryzae Y34]
gi|440484691|gb|ELQ64724.1| beta-xylosidase [Magnaporthe oryzae P131]
Length = 792
Score = 387 bits (994), Expect = e-104, Method: Compositional matrix adjust.
Identities = 262/782 (33%), Positives = 392/782 (50%), Gaps = 98/782 (12%)
Query: 55 CNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVN 114
C+ + + + RA L+ ++ L EK++ L + + PR+G+PAYEWWSE+LHG+A + PGV
Sbjct: 41 CDQAATPAERAAGLVDIMELDEKLENLVNKSPGAPRIGLPAYEWWSEALHGVAKS-PGVT 99
Query: 115 FN----GTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNIN 170
FN SS TSF +V +A+F+ L + + ++ EARA N G AGL +W PNIN
Sbjct: 100 FNKSSGAAFSSATSFSNPIVLSAAFDDELVEAVATQISTEARAFSNAGLAGLDWWTPNIN 159
Query: 171 IFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEE 230
++DPRWGRG ETPGED + +S Y K +L+G E
Sbjct: 160 PYKDPRWGRGMETPGEDALRISKYV-------------------------KALLRGL-EG 193
Query: 231 SDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASC 290
SD M+ A CKH A DLE+W +RY+F+A +T QD + + P F+ C
Sbjct: 194 SDPTTRKMV-ANCKHYAANDLERWNGVTRYNFDAPVTLQDLSEYYLPAFKQCARDSNVGS 252
Query: 291 IMCSYNQV---------NGVPACLRGDLFQKA-RNEWGFK---GYITSDCDAVATIFEYQ 337
MC+YN + NG P C L R WG+K +ITSDC+AV ++
Sbjct: 253 FMCAYNAMSIKGKDLSWNGTPVCASKYLMNDILREHWGWKEHNNWITSDCNAVLHMWNQH 312
Query: 338 NYTKTHEDSAAGVLKAGMDINCGTCMLRHT--QSAIDKGKVQEKDIDRALLNLFSVQLRL 395
+++ T E++A AG D C T + A D+G + E +DRAL L+ +R
Sbjct: 313 HWSDTREEAAGSAYTAGTDTVCEVSNYDKTAVKGAFDRGLLDEDVVDRALKRLYEGLVRA 372
Query: 396 GLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAV----SS 451
G F+G Y + DV T E +KLA +A +G+VL KN+ LP+ + +
Sbjct: 373 GYFDGP--DAPYRNITWADVNTPEARKLAHRSAVEGMVLTKNN-GVLPIKLEELQKKGKT 429
Query: 452 LAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHE-- 509
+A+IG V+N QM G Y+GI + L +A K A G P N G +
Sbjct: 430 VALIGNWVDNGEQMLGTYSGIAPFRNTPLAAAKALNLKMVTAGG----PVNQSTGSRDSW 485
Query: 510 ---AVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLT 566
A+ A +AD V+ G+DL+ E EDRDR SL P Q L++ ++ K V++ L
Sbjct: 486 TRPALNAAIQADVVLYFGGIDLSVEAEDRDRYSLAWPSAQAKLLSDISALGKPTVVVQL- 544
Query: 567 GGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KV 625
G LD + + IS+I+W GYPG+ G A +II G P GRLP+T YP + +V
Sbjct: 545 -GTMLDDTALLDNKNISAIIWAGYPGQDGGTAAFDIITGKTAPSGRLPVTQYPAKYANQV 603
Query: 626 PMNDMNMRADSSRQ-----YPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISA 680
PM DM +R + PGR+YR+Y V+ FG GL +TN++ + S ++
Sbjct: 604 PMTDMEVRPSKDTKGGAASNPGRTYRWYD-EAVHPFGFGLHFTNFTTSVAVSSSSAISTS 662
Query: 681 SLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAG-DVDGSHVVMLFARVP 739
L++G HID+ + +SL ++SVTN G S+ + F R
Sbjct: 663 DLESGCKSE-----------KHIDKCSFPSSL----EVSVTNDGKSTTSSYAALAFVRGE 707
Query: 740 KVSQGTPEKQLIGFDRVHTVAKGSK-----EISFGVDPCEQLSIANKHGRRILPLGNHVL 794
+ P K L+ + ++H +A G E++ G L+ ++G +L G + +
Sbjct: 708 YGPKPYPLKTLVAYGKLHDIAPGQTKKVKLELTLG-----DLARTAENGDLVLYPGKYEV 762
Query: 795 MV 796
+V
Sbjct: 763 LV 764
>gi|238578959|ref|XP_002388893.1| hypothetical protein MPER_12044 [Moniliophthora perniciosa FA553]
gi|215450599|gb|EEB89823.1| hypothetical protein MPER_12044 [Moniliophthora perniciosa FA553]
Length = 658
Score = 387 bits (994), Expect = e-104, Method: Compositional matrix adjust.
Identities = 249/725 (34%), Positives = 373/725 (51%), Gaps = 95/725 (13%)
Query: 96 YEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMY 155
Y WWSE+L NF SS TSFP + A+F+ L I + ++ EARA
Sbjct: 1 YNWWSEAL----------NF----SSATSFPAPITMGATFDDGLIHAIATVISTEARAFN 46
Query: 156 NLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIG 215
N+ + GL F+ PNIN F+DPRWGRGQETPGEDP +S Y + V Q GG+G
Sbjct: 47 NVNRGGLDFFTPNINPFKDPRWGRGQETPGEDPFHISQYVYQLVTGLQ--------GGVG 98
Query: 216 FGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTF 275
L ++A CKH AYDLE G SR+ F+A +T QD + +
Sbjct: 99 ------------------PTNLKIAADCKHWAAYDLENLG-VSRFEFDAKVTMQDLAEFY 139
Query: 276 QPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK-ARNEWGF--KGYITSDCDAVAT 332
P F+SCI K + IMCSYN VNG+P+C L Q AR+ WG + +IT DC AV
Sbjct: 140 SPSFQSCIRDAKVASIMCSYNAVNGIPSCANRYLLQTLARDFWGLGEEQWITGDCGAVGN 199
Query: 333 IFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQS---AIDKGKVQEKDIDRALLNLF 389
IF +YT + A L AG DI+C + ++Q+ A+++ V E + A+ +
Sbjct: 200 IFARHHYTDDPANGTAVALNAGTDIDCDSGAAAYSQNLGQALNRSLVSEDQLRTAVTRQY 259
Query: 390 SVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAV 449
+ +RL DDV T ++LA AA +GIVLLKND LPL ++V
Sbjct: 260 NSLVRLSW---------------DDVNTEPAQQLAYQAAVEGIVLLKND-GILPL-ASSV 302
Query: 450 SSLAIIGPLVNNISQMGGGYTGIP---CSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAG 506
+A++GP+ N +QM Y GI SP+ R V+ +A+G + + +G
Sbjct: 303 KKVAVVGPMANATTQMQSNYNGIAPFLVSPQQAFRNAGFNVT---FANGT-GLNSSDTSG 358
Query: 507 FHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLT 566
F A+ A AD V V G+D T E EDRDR + G Q++LV +A K P+I++
Sbjct: 359 FSAAIAAADDADVVFYVGGIDTTIEREDRDRPEISWTGNQLALVQQLASLGK-PLIVLQM 417
Query: 567 GGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTK-V 625
GGG +D S ++ +++++W GYPG++G AL ++I G P GRLP+T YP S+
Sbjct: 418 GGGQVDSSSLRDNTSVNALIWGGYPGQSGGTALVDLITGKQAPAGRLPITQYPASYVDGF 477
Query: 626 PMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAG 685
PM DM +R SS PGR+Y++YTG ++ FG GL YT + ++ S ++ + +
Sbjct: 478 PMTDMTLRPSSSN--PGRTYKWYTGAPIFEFGFGLHYTTFDAEWASGGDSFSVQDLVSSA 535
Query: 686 SDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGT 745
+ + H+D T ++VTN+G V +V +LF+R
Sbjct: 536 KNSGV----------AHVDLGVLDT-----FNVTVTNSGTVASDYVALLFSRTTAGPSPA 580
Query: 746 PEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMV-----GELR 800
P K+L+ + RV + G+ + ++ ++ G R+L G +VL++ G+++
Sbjct: 581 PNKELVSYTRVKGIEPGASSAASLKVTLGAVARTDEQGNRVLYPGEYVLLLDTGAEGKIQ 640
Query: 801 HSLTI 805
+T+
Sbjct: 641 KKITL 645
>gi|374372635|ref|ZP_09630297.1| Beta-glucosidase [Niabella soli DSM 19437]
gi|373235166|gb|EHP54957.1| Beta-glucosidase [Niabella soli DSM 19437]
Length = 734
Score = 387 bits (993), Expect = e-104, Method: Compositional matrix adjust.
Identities = 252/780 (32%), Positives = 376/780 (48%), Gaps = 129/780 (16%)
Query: 53 PFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPG 112
PF N LS R L+S LTL+EK++Q+ ++A AIPRLGIPAY+WWSE LHG+A
Sbjct: 27 PFWNYKLSFEARVNDLVSRLTLEEKVKQMLNHAPAIPRLGIPAYDWWSEVLHGVAR---- 82
Query: 113 VNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQ---------AGLT 163
T T +PQ + AA+++ + A E RA++N GLT
Sbjct: 83 -----TPYHTTVYPQAIAMAATWDTVALYTMADQSAREGRAIHNKATEEGKNGDRYVGLT 137
Query: 164 FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRV 223
+W PNINIFRDPRWGRGQET GEDP + + FV+ QGE+ K
Sbjct: 138 YWTPNINIFRDPRWGRGQETYGEDPFLTAMLGRAFVRGLQGEDPKY-------------- 183
Query: 224 LKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCI 283
L +AC KH Y + R+SF+ +++ D +T+ P F+ +
Sbjct: 184 -------------LKAAACAKH---YAIHSGPEAVRHSFDVDVSDYDLWNTYLPAFKELV 227
Query: 284 EQGKASCIMCSYNQVNGVPAC----LRGDLFQKARNEWGFKGYITSDCDAVATIFEYQNY 339
K + +MC+YN P C L D+ R +WGF GY+TSDC A+ F NY
Sbjct: 228 THAKVAGVMCAYNAFRKKPCCGSDLLMTDIL---RRQWGFTGYVTSDCGAIDDFF---NY 281
Query: 340 TKTHEDSAAGVLKA---GMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLG 396
KTH ++ A + A G D+ CG A+ G++ EK+IDR++ LF +++RLG
Sbjct: 282 HKTHPNAEAAAIDAVTNGTDVECGNRAYLTLTDAVKTGRIAEKEIDRSVKRLFMIRMRLG 341
Query: 397 LFNGDP-RKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAII 455
+F DP Y + P + ++ HK AL A++ IVLLKN+ LPL+K ++ +A++
Sbjct: 342 MF--DPVSMVSYAQTSPAVLESAPHKAQALKMAQESIVLLKNENHLLPLSK-SIKKIAVV 398
Query: 456 GPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCN--------SDAGF 507
GP +N + G Y G P + L G++A + S ++ N F
Sbjct: 399 GPNADNSIAVLGNYNGTPSKIVTALDGIKAKLGTN--GSVVYEKAVNFTNAMLPEGKTDF 456
Query: 508 HEAVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVARTS 557
K AD +I V G+ E E+ DR ++LLP Q + ++ T
Sbjct: 457 AALTSRVKDADAIIFVGGISPQLEGEEMKVNEPGFNSGDRTTILLPTVQTEAMKALKATG 516
Query: 558 KRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTW 617
K PV+ V+ G L + + + + I +I+ Y G+A A+A+++FGD+NP GRLP+T+
Sbjct: 517 K-PVVFVMMTGSALAIPWEQEN--IPAIVNAWYGGQAAGTAIADVLFGDYNPSGRLPVTF 573
Query: 618 YPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELT 677
Y +D ++ A + R+YR+++G +Y FG+GLSYT + Y+ L P+
Sbjct: 574 Y--------KSDADLPAFDDYRMENRTYRYFSGQALYPFGYGLSYTTFRYEGLKVPT--- 622
Query: 678 ISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFAR 737
T +R V I +TN G G VV L+
Sbjct: 623 -----------------------------TVKNKVRIPVSIQLTNTGAKGGEEVVQLYIS 653
Query: 738 VPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
P K L GF RV +K I F + P + L+IA ++G+ + P G + VG
Sbjct: 654 YQGQPIKKPLKALKGFQRVWLNRGQTKTIKFLLTP-DALAIAGENGKLLNPKGKLRISVG 712
>gi|367053033|ref|XP_003656895.1| glycoside hydrolase family 3 protein [Thielavia terrestris NRRL
8126]
gi|347004160|gb|AEO70559.1| glycoside hydrolase family 3 protein [Thielavia terrestris NRRL
8126]
Length = 758
Score = 386 bits (992), Expect = e-104, Method: Compositional matrix adjust.
Identities = 266/772 (34%), Positives = 384/772 (49%), Gaps = 92/772 (11%)
Query: 55 CNTSLSISTRAKSLISLLTLQEKIQQLSD------NASAIPRLGIPAYEWWSESLHGIAS 108
C+T+ S RA +L+ + + EK+ L + ++ PRLG+P YEWWSE+LHG+A+
Sbjct: 11 CDTTASPPKRAAALVEAMNITEKLANLVEYVMARSSSKGAPRLGLPPYEWWSEALHGVAA 70
Query: 109 NGPGVNFN---GTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFW 165
+ PGV+FN G S TSF + +A+F+ L + ++ EARA N G AGL FW
Sbjct: 71 S-PGVSFNWSGGPFSYATSFANPITLSAAFDDELVQKVADVISTEARAFANAGSAGLDFW 129
Query: 166 APNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLK 225
PNIN +RDPRWGRG ETPGEDP+ + Y ++ +GE K+V+
Sbjct: 130 TPNINPWRDPRWGRGSETPGEDPVRIKGYVRSLLRGLEGEE------------SIKKVI- 176
Query: 226 GFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQ 285
A CKH AYDLE+W N +RY F+A+++ QD + + PPF+ C
Sbjct: 177 ---------------ATCKHYAAYDLERWHNITRYEFDAIVSLQDLSEYYLPPFQQCARD 221
Query: 286 GKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGF---KGYITSDCDAVATIF-EYQNYT 340
K IMCSYN +NG PAC L R W + YITSDC+A+ + N+T
Sbjct: 222 SKVGSIMCSYNSLNGTPACANTYLMDDILRKHWRWTEDNNYITSDCNAIKDFLPDEHNFT 281
Query: 341 KTHEDSAAGVLKAGMDINC---GTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGL 397
+T ++AA AG D C G+ A D+ + E+ IDRAL L+ +R G
Sbjct: 282 QTAAEAAAAAYTAGTDTVCEVAGSPPYTDVVGAYDQKLLSEEVIDRALRRLYEGLVRAGY 341
Query: 398 FNGDPRKGK-YGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIG 456
F DP Y +G DV T+E + LAL +A G+VLLKND LP+ K ++A+IG
Sbjct: 342 F--DPASASPYRDIGWSDVNTAEAQALALQSASDGLVLLKNDGT-LPI-KLEGKTVALIG 397
Query: 457 PLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASG-CHDVPCNSDAGFHEAVRIAK 515
+ M GGY+GIP S + YASG D +A+ A
Sbjct: 398 HWASGTRSMLGGYSGIPPYYHSPVYAAGQLNLTYKYASGPVAPASAARDTWTADALSAAN 457
Query: 516 KADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSF 575
K+D ++ GLD + +ED+DR S+ P Q++L+ ++A K V++ L G +D +
Sbjct: 458 KSDVILYFGGLDQSVASEDKDRDSIAWPPAQLTLIQTLAGLGKPLVVIQL--GDQVDDTP 515
Query: 576 AEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNMRA 634
+ +S+ILW GYPG++G A+ I G P GRLP+T YP S+T ++P+ DM++R
Sbjct: 516 LLTNPNVSAILWAGYPGQSGGTAVLNAITGVSPPAGRLPVTQYPSSYTSQLPLTDMSLRP 575
Query: 635 DSSRQYPGRSYRFYT-GTQVYGFGHGLSYTNYSYK------FLSAPSELTISASLKAGSD 687
D + PGR+YR+ V FG+GL YTN++ + F PS L L
Sbjct: 576 DPASGRPGRTYRWLPRNATVLPFGYGLHYTNFTARPNPAQNFTLTPSALLAPCKL----- 630
Query: 688 KNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGT-P 746
H D L + V + VTN G +V ++FA P
Sbjct: 631 -------------AHRD----LCPLPYPVTVEVTNTGARTSDYVGLVFATTRDAGPPPHP 673
Query: 747 EKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGE 798
K L+ + R+ +A G + Q+ +A R+ GN VL G
Sbjct: 674 LKTLVAYARLRGIAPGRTARA-------QVQVALGDLARVDAAGNRVLYPGR 718
>gi|371776901|ref|ZP_09483223.1| beta-glucosidase [Anaerophaga sp. HS1]
Length = 720
Score = 386 bits (991), Expect = e-104, Method: Compositional matrix adjust.
Identities = 259/767 (33%), Positives = 382/767 (49%), Gaps = 111/767 (14%)
Query: 54 FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
F + +L I TRAK+L+S LTL+EKI L N + RL IPAY WW+E+LHG+A G
Sbjct: 29 FRDEALDIETRAKALLSELTLKEKISLLGYNNPPVERLQIPAYNWWNEALHGVARAG--- 85
Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLTFW 165
T FPQ + AA+F+ +L I A++ EAR+ YN+ ++ G+TFW
Sbjct: 86 -------EATVFPQAIALAATFDTTLVYRIADAISTEARSKYNINRSKGFQNQYLGITFW 138
Query: 166 APNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLK 225
PNINIFRDPRWGRGQET GEDP + ++ FVK Q
Sbjct: 139 TPNINIFRDPRWGRGQETYGEDPFLTASMGKAFVKGLQ---------------------- 176
Query: 226 GFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQ 285
G E +R L +A KH + + R+ FNA++ E+D +T+ P F++ +E
Sbjct: 177 --GSEPER--RLKTAAGAKHFAVHSGPE---ADRHHFNAVVDEKDLRETYLPAFKALVEN 229
Query: 286 GKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHE 344
G + IMC+YN+VNG P C L Q R+EWGFKG + +DC A+ I+ T
Sbjct: 230 G-VTTIMCAYNRVNGEPCCTGKTLLQDILRDEWGFKGQVVTDCWALDDIWLRHKTIPTRV 288
Query: 345 DSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRK 404
+ AA +KAG++++C + Q AI+K + + +D ALL QL+LG ++ DP
Sbjct: 289 EVAAAAVKAGVNLDCANILQEDVQDAIEKRLLTLEQVDSALLPTLQTQLKLGFYD-DPSH 347
Query: 405 GKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQ 464
Y G D V S H LA +AA + +VLLKND LPL K+ +SS+ ++G +IS
Sbjct: 348 SPYRHYGIDSVNNSYHISLAKEAAEKSMVLLKND-GILPLKKDTISSIMVVGENAASISA 406
Query: 465 MGGGYTGIPCSPKSLLRGLEAYVS---KTHYASGCHDVPCNSDAGFHEAVRIAKKADFVI 521
+ G Y G+ + + + GL Y GC +D + A D I
Sbjct: 407 LTGNYHGLSGNMVTFVEGLVKAGGPGMSVQYDYGC----SFADTSHFGGIWAAGFTDVTI 462
Query: 522 VVAGLDLTQETED---------RDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLD 572
V GL E E D+ L +P + + + PVI V+TGG LD
Sbjct: 463 AVIGLSPLLEGEHGDAFLSNWGGDKKDLRMPRSHEIYLKKLRESHNHPVIAVVTGGSALD 522
Query: 573 VSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNM 632
+S E + +I++ YPGE G ALA++IFG+ +P GRLP+T+Y + P +D NM
Sbjct: 523 ISAIEPYA--DAIIYAWYPGEQGGTALADLIFGEVSPSGRLPITFYKDIKDLPPYHDYNM 580
Query: 633 RADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQ 692
R+YR++ G +Y FG+GLSYT++ Y++LS PS
Sbjct: 581 T--------NRTYRYFQGDVLYPFGYGLSYTSFHYEWLSKPST----------------- 615
Query: 693 QTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIG 752
V D++ S V I+VTN G +D V+ ++ P + + P ++L G
Sbjct: 616 -------KVSEDDIIS-------VNIAVTNTGTMDADEVIQVYIVYPDIER-MPLRELKG 660
Query: 753 FDRVHTVAKGSKEISFGVDPCEQLSIAN-KHGRRILPLGNHVLMVGE 798
F R+H A ++ + P + L + K+ R L G + + V +
Sbjct: 661 FSRIHIKAGQTQNTDIQI-PVKNLKKWDSKNNRWKLYKGKYKIQVSQ 706
>gi|182415033|ref|YP_001820099.1| Beta-glucosidase [Opitutus terrae PB90-1]
gi|177842247|gb|ACB76499.1| Beta-glucosidase [Opitutus terrae PB90-1]
Length = 905
Score = 386 bits (991), Expect = e-104, Method: Compositional matrix adjust.
Identities = 260/792 (32%), Positives = 391/792 (49%), Gaps = 135/792 (17%)
Query: 56 NTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNF 115
++S + RA LI ++L EK+ QL + A IPRLG+PAY++W+E+ HGIA+NG
Sbjct: 207 DSSKPLRVRADDLIRRMSLAEKVSQLKNAAPGIPRLGLPAYDYWNEAAHGIANNG----- 261
Query: 116 NGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYN----------LGQAGLTFW 165
T FPQ + +AA++N +L G+ + +E RA +N GLT+W
Sbjct: 262 -----IATVFPQAIGAAAAWNPALLHQEGTVIGIEGRAKFNDYANRHNGDSKWWTGLTYW 316
Query: 166 APNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLK 225
APNIN+FRDPRWGRGQET GEDP + + +EFVK QG++ +
Sbjct: 317 APNINLFRDPRWGRGQETYGEDPFLTAEIGIEFVKGVQGDDPRY---------------- 360
Query: 226 GFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQ 285
++ AC KH Y + +R+SFNA I E+D DT+ P F + +
Sbjct: 361 -----------MLAMACAKH---YAVHSGPERTRHSFNAEIPERDLFDTYLPHFERVVRE 406
Query: 286 GKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIF--EYQNYTKT 342
GK + +M +YN VNGVPA L + R WGF+GY+ SDCDA+ I+ + +Y KT
Sbjct: 407 GKVAGVMSAYNAVNGVPASANSFLLTELLRKRWGFEGYVPSDCDAIRDIYGEKQHHYVKT 466
Query: 343 HEDSAAGVLKAGMDINCG---TCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFN 399
E++AA +KAG ++ CG ++R A+ +G V EKD+D AL + + RLGLF
Sbjct: 467 AEEAAALAVKAGCNLCCGGDYNALVR----AVQQGLVTEKDLDGALYHTLWTRFRLGLF- 521
Query: 400 GDP-RKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPL 458
DP + + D H ++AL+ ARQ IVLLKND LPL++ + +A+IGP
Sbjct: 522 -DPAEQVPFSGYTLKDNDLPAHSQVALELARQAIVLLKNDGT-LPLDRTKLKQIAVIGPN 579
Query: 459 VNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAG------------ 506
+ S + G Y G S+L + V + P + G
Sbjct: 580 AASKSMLEGNYHGSASRSISILDDIRNLVGSEIKITHAMGSPVTTKPGTAPWSGQDNTTD 639
Query: 507 ------FHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRP 560
EA+++A +AD +I V G+ QE E DR S+ LP +Q L+ ++ T K P
Sbjct: 640 RPVAELKAEALKLAAEADAIIYVGGITPAQEGESFDRESIELPSEQEDLIRALHATGK-P 698
Query: 561 VILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPE 620
V++V G + +++ D + +I+ YPG+ G +A+AE++FG+ NP G LP+T+Y
Sbjct: 699 VVMVNCSGSAMALTWQ--DENLPAIVQAWYPGQEGGRAVAEVLFGETNPSGHLPITFYRS 756
Query: 621 SFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISA 680
+ +D +M+ R+YR++TG +Y FGHGLSY+ + Y L
Sbjct: 757 TADLPDFSDYSMK--------NRTYRYFTGRPLYAFGHGLSYSTFEYANL---------- 798
Query: 681 SLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPK 740
V + V + +TN+G DG VV L+A P
Sbjct: 799 ------------------------RVAPAANGALTVTLDLTNSGKRDGDDVVQLYATPPA 834
Query: 741 VSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPC--EQLSIANKHGRRILPLGNHVLMVG- 797
SQ + L GF R H A ++ ++ V + IA K +P G+ + G
Sbjct: 835 SSQPQELRALCGFRRTHVKAGETRTVTVTVPAVALRRWDIAKKD--YAIPSGDWTIAAGA 892
Query: 798 ---ELRHSLTIE 806
+LR TI+
Sbjct: 893 SSADLRQKATIK 904
>gi|238923424|ref|YP_002936940.1| beta-glucosidase [Eubacterium rectale ATCC 33656]
gi|238875099|gb|ACR74806.1| beta-glucosidase [Eubacterium rectale ATCC 33656]
Length = 714
Score = 385 bits (988), Expect = e-104, Method: Compositional matrix adjust.
Identities = 244/735 (33%), Positives = 365/735 (49%), Gaps = 112/735 (15%)
Query: 65 AKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTS 124
AK L+S +T+ EKI Q+ + AI RLGIP Y WW+E+LHG+A G T
Sbjct: 9 AKKLVSQMTIDEKISQMLYESPAIERLGIPEYNWWNEALHGVARAG----------VATV 58
Query: 125 FPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLTFWAPNINIFRDPR 176
FPQ + AA+F+ L IG V+ E R +N GLTFWAPN+NIFRDPR
Sbjct: 59 FPQAIGLAATFDTDLIEKIGDVVSTEGRGKFNEFSKKGDHGIYKGLTFWAPNVNIFRDPR 118
Query: 177 WGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDE 236
WGRG ET GEDP + +++ QG D D
Sbjct: 119 WGRGHETYGEDPYLTGKLGCAYIRGLQG---------------------------DDPDH 151
Query: 237 LMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYN 296
L +AC KH + + R+ F+A ++ D DT+ F+ C++ K +M +YN
Sbjct: 152 LKSAACAKH---FAVHSGPEAIRHEFDAKASKHDMYDTYLYAFKRCVKDAKVEAVMGAYN 208
Query: 297 QVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGM 355
+VNG PAC L + R+E+GF+G++ SDC A+ E+ + T T E+SAA + G
Sbjct: 209 RVNGEPACGSRTLLKDILRDEFGFEGHVVSDCWAILDFHEHHHVTDTVEESAAMAVNNGC 268
Query: 356 DINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDV 415
D+NCG+ L H + A DKG V ++ I A+ L V++RLG+ P Y + + V
Sbjct: 269 DLNCGSAFL-HLKDAYDKGMVSDEAITAAVERLMEVRIRLGMMKDYP--SPYEDISYEVV 325
Query: 416 CTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCS 475
EH +L+++AAR+ +VLLKN FLPL++ V ++A+IGP N+ + G Y G
Sbjct: 326 ECKEHVELSVEAARRSLVLLKNKDNFLPLDRKNVKTIAVIGPNANSRDALIGNYYGTSSR 385
Query: 476 PKSLLRGLEAYV---SKTHYASGCHDVP------CNSDAGFHEAVRIAKKADFVIVVAGL 526
+ L GL+ Y+ ++ YA GCH F EA+ +A+++D V++ GL
Sbjct: 386 YITPLEGLQQYLGEDTRVLYAEGCHLYKDKVQGLAEEKDRFKEALIMAEQSDVVVMCLGL 445
Query: 527 DLTQETED---------RDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAE 577
D T E E+ D++ L+LPG Q L+ +VA K PVILVL+ G +D+S+AE
Sbjct: 446 DATIEGEEGDAGNEYASGDKLGLMLPGLQEELLEAVAAVGK-PVILVLSAGSAIDLSWAE 504
Query: 578 ADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSS 637
+ +I+ YPG G KA+AE IFG+++P G+LP+T+Y + D +M
Sbjct: 505 --EHVDAIIDSWYPGARGGKAVAEAIFGEYSPNGKLPVTFYQGTENLPEFTDYSM----- 557
Query: 638 RQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSR 697
R+YR+ +Y FG+GL Y +Y LS
Sbjct: 558 ---AHRTYRYTNENVLYPFGYGLHYGETNYDGLS-------------------------- 588
Query: 698 LDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVH 757
+D+ S + V ++VTN + +V L+ R ++ P QL G + V
Sbjct: 589 -----VDKAESDVNEPVEVFVNVTNDSRYTVNEIVQLYIRHVDAAEYEPGYQLKGIEVVK 643
Query: 758 TVAKGSKEISFGVDP 772
+K++ + P
Sbjct: 644 LEPHETKKVKLTLSP 658
>gi|291528382|emb|CBK93968.1| Beta-glucosidase-related glycosidases [Eubacterium rectale M104/1]
Length = 714
Score = 384 bits (986), Expect = e-103, Method: Compositional matrix adjust.
Identities = 244/735 (33%), Positives = 365/735 (49%), Gaps = 112/735 (15%)
Query: 65 AKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTS 124
AK L+S +T+ EKI Q+ + AI RLGIP Y WW+E+LHG+A G T
Sbjct: 9 AKKLVSQMTIDEKISQMLYESPAIERLGIPEYNWWNEALHGVARAG----------VATV 58
Query: 125 FPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLTFWAPNINIFRDPR 176
FPQ + AA+F+ L IG V+ E R +N GLTFWAPN+NIFRDPR
Sbjct: 59 FPQAIGLAAAFDADLIEKIGDVVSTEGRGKFNEFSKKGDHGIYKGLTFWAPNVNIFRDPR 118
Query: 177 WGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDE 236
WGRG ET GEDP + +++ QG D D
Sbjct: 119 WGRGHETYGEDPYLTGKLGCAYIRGLQG---------------------------DDPDH 151
Query: 237 LMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYN 296
L +AC KH + + R+ F+A ++ D DT+ F+ C++ K +M +YN
Sbjct: 152 LKSAACAKH---FAVHSGPEAIRHEFDAKASKHDMYDTYLYAFKRCVKDAKVEAVMGAYN 208
Query: 297 QVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGM 355
+VNG PAC L + R+E+GF+G++ SDC A+ E+ + T T E+SAA + G
Sbjct: 209 RVNGEPACGSRTLLKDILRDEFGFEGHVVSDCWAILDFHEHHHVTDTVEESAAMAVNNGC 268
Query: 356 DINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDV 415
D+NCG+ L H + A DKG V ++ I A+ L V++RLG+ P Y + + V
Sbjct: 269 DLNCGSAFL-HLKDAYDKGLVSDEAITAAVERLMEVRIRLGMMKDYP--SPYEDISYEVV 325
Query: 416 CTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCS 475
EH +L+++AAR+ +VLLKN FLPL++ V ++A+IGP N+ + G Y G
Sbjct: 326 ECKEHVELSVEAARRSLVLLKNKDNFLPLDRKNVKTIAVIGPNANSRDALIGNYYGTSSR 385
Query: 476 PKSLLRGLEAYV---SKTHYASGCHDVP------CNSDAGFHEAVRIAKKADFVIVVAGL 526
+ L GL+ Y+ ++ YA GCH F EA+ +A+++D V++ GL
Sbjct: 386 YITPLEGLQQYLGDDTRVLYAEGCHLYKDKVQGLAEEKDRFKEALIMAEQSDVVVMCLGL 445
Query: 527 DLTQETED---------RDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAE 577
D T E E+ D++ L+LPG Q L+ +VA K PVILVL+ G +D+S+AE
Sbjct: 446 DATIEGEEGDAGNEYASGDKLGLMLPGLQEELLEAVAAVGK-PVILVLSAGSAIDLSWAE 504
Query: 578 ADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSS 637
+ +I+ YPG G KA+AE IFG+++P G+LP+T+Y + D +M
Sbjct: 505 --EHVDAIIDSWYPGARGGKAVAEAIFGEYSPSGKLPVTFYQGTENLPEFTDYSM----- 557
Query: 638 RQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSR 697
R+YR+ +Y FG+GL Y +Y LS
Sbjct: 558 ---AHRTYRYTNENVLYPFGYGLHYGETNYDGLS-------------------------- 588
Query: 698 LDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVH 757
+D+ S + V ++VTN + +V L+ R ++ P QL G + V
Sbjct: 589 -----VDKAESDVNEPVEVFVNVTNDSRYTVNEIVQLYIRHVDAAEYEPGYQLKGIEVVK 643
Query: 758 TVAKGSKEISFGVDP 772
+K++ + P
Sbjct: 644 LEPHETKKVKLTLSP 658
>gi|266619450|ref|ZP_06112385.1| beta-glucosidase [Clostridium hathewayi DSM 13479]
gi|288869013|gb|EFD01312.1| beta-glucosidase [Clostridium hathewayi DSM 13479]
Length = 714
Score = 384 bits (985), Expect = e-103, Method: Compositional matrix adjust.
Identities = 249/771 (32%), Positives = 377/771 (48%), Gaps = 116/771 (15%)
Query: 54 FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
+ + S + R + L+S +TL+EK+ QL +A A+ RLGIP+Y WW+E+LHG+A G
Sbjct: 5 YLDESRTDEERVRDLVSQMTLEEKVSQLRYDAPAVERLGIPSYNWWNEALHGVARAG--- 61
Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG--------QAGLTFW 165
+ T FPQ + AA F+ +L IG A+E RA Y+ G+TFW
Sbjct: 62 -------AATVFPQAIGLAAMFDEALLEKIGDVTALEGRAKYHEAVRNGDRGLYKGITFW 114
Query: 166 APNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLK 225
+PNINIFRDPRWGRG ET GEDP + ++K QG
Sbjct: 115 SPNINIFRDPRWGRGHETYGEDPCLTGRMGTAYIKGMQG--------------------- 153
Query: 226 GFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQ 285
G L +AC KH A+ + G R+SFN++++++D +T+ P F C+++
Sbjct: 154 -------NGKRLKAAACVKHFAAHSGPEKG---RHSFNSVVSKKDLTETYFPAFERCVKE 203
Query: 286 GKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHE 344
+M YN++NG AC L + R +WGF GY SDC A+ + T T +
Sbjct: 204 AGVEGVMGGYNRLNGEAACGSHHLITEILREKWGFDGYYVSDCGAIKDFHMHHGLTDTPQ 263
Query: 345 DSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRK 404
+SAA LK+G D+NCG L H SA ++G V +DIDRA+ +L ++RLG+F+ +
Sbjct: 264 ESAALALKSGCDLNCGAVYL-HVMSAYNQGLVSAEDIDRAVTHLMMTRMRLGMFD---QH 319
Query: 405 GKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQ 464
++ ++ + +EH LAL AA + +VLLKND LPL+K A+ ++A+IGP ++
Sbjct: 320 TEFDEIPYEINDCAEHHGLALKAAEESMVLLKND-GILPLDKTALKTVAVIGPNGDSEEI 378
Query: 465 MGGGYTGIPCSPKSLLRGLEAYVSKTHY---ASGCHDVPCN------SDAGFHEAVRIAK 515
+ G Y G ++L G+ A + K + G H N +D EAV +A
Sbjct: 379 LKGNYNGTATEKYTILEGIRAVLGKETRIFCSEGSHLYRDNVENLAEADDRLKEAVSMAV 438
Query: 516 KADFVIVVAGLDLTQETED---------RDRVSLLLPGQQMSLVTSVARTSKRPVILVLT 566
++D V + GL+ T E E+ D+ L LP QM L+ +V T PVIL+L
Sbjct: 439 RSDVVFLCLGLNGTLEGEEGDANNSYAGADKADLNLPESQMRLLKAVCGTGT-PVILLLA 497
Query: 567 GGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVP 626
G + +++A S+IL I YPG+ G A A ++ G+ P GRLP+T+Y +
Sbjct: 498 AGSAMAINYAA--EHCSAILHIWYPGQMGGLAAARLLTGEAVPSGRLPVTFYQTTEELPE 555
Query: 627 MNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGS 686
D +M+ GR+YR+ +Y FG+GLSY ++ Y A +AG
Sbjct: 556 FTDYSMK--------GRTYRYMEREALYPFGYGLSYGDFEYSNFKAE-------QTEAGP 600
Query: 687 DKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTP 746
D + RF V+I TN + + ++ R+ P
Sbjct: 601 DGQV----------------------RFSVKI--TNRSKAECDEIAEVYVRIADSELAAP 636
Query: 747 EKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
L F R+H A S + F + P + + N+ G IL V+ G
Sbjct: 637 GGSLADFRRIHMKAGESVTVPFTL-PVKAFMVVNEEGEYILDGSTAVVTCG 686
>gi|291525508|emb|CBK91095.1| Beta-glucosidase-related glycosidases [Eubacterium rectale DSM
17629]
Length = 714
Score = 383 bits (984), Expect = e-103, Method: Compositional matrix adjust.
Identities = 243/735 (33%), Positives = 365/735 (49%), Gaps = 112/735 (15%)
Query: 65 AKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTS 124
AK L+S +T+ EKI Q+ + AI RLGIP Y WW+E+LHG+A G T
Sbjct: 9 AKKLVSQMTIDEKISQMLYESPAIERLGIPEYNWWNEALHGVARAG----------VATV 58
Query: 125 FPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLTFWAPNINIFRDPR 176
FPQ + AA+F+ L IG V+ E R +N GLTFWAPN+NIFRDPR
Sbjct: 59 FPQAIGLAATFDTDLIEKIGDVVSTEGRGKFNEFSKKGDHGIYKGLTFWAPNVNIFRDPR 118
Query: 177 WGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDE 236
WGRG ET GEDP + +++ QG D D
Sbjct: 119 WGRGHETYGEDPYLTGKLGCAYIRGLQG---------------------------DDPDH 151
Query: 237 LMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYN 296
L +AC KH + + R+ F+A ++ D DT+ F+ C++ K +M +YN
Sbjct: 152 LKSAACAKH---FAVHSGPEAIRHEFDAKASKHDMYDTYLYAFKRCVKDAKVEAVMGAYN 208
Query: 297 QVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGM 355
+VNG PAC L + R+E+GF+G++ SDC A+ E+ + T T E+SAA + G
Sbjct: 209 RVNGEPACGSRTLLKDILRDEFGFEGHVVSDCWAILDFHEHHHVTDTVEESAAMAVNNGC 268
Query: 356 DINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDV 415
D+NCG+ L H + A DKG V ++ I A+ L V++RLG+ P Y + + V
Sbjct: 269 DLNCGSAFL-HLKDAYDKGLVSDEAITAAVERLMEVRIRLGMMKDYP--SPYEDISYEVV 325
Query: 416 CTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCS 475
EH +L+++AAR+ +VLLKN FLPL++ V ++A+IGP N+ + G Y G
Sbjct: 326 ECKEHVELSVEAARRSLVLLKNKDNFLPLDRKNVKTIAVIGPNANSRDALIGNYYGTSSR 385
Query: 476 PKSLLRGLEAYV---SKTHYASGCHDVP------CNSDAGFHEAVRIAKKADFVIVVAGL 526
+ L GL+ Y+ ++ YA GCH F EA+ +A+++D V++ GL
Sbjct: 386 YITPLEGLQQYLGEDTRVLYAEGCHLYKDKVQGLAEEKDRFKEALIMAEQSDVVVMCLGL 445
Query: 527 DLTQETED---------RDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAE 577
D T E E+ D++ L+LPG Q L+ +VA K PVILVL+ G +D+S+AE
Sbjct: 446 DATIEGEEGDAGNEYASGDKLGLMLPGLQEELLEAVAAVGK-PVILVLSAGSAIDLSWAE 504
Query: 578 ADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSS 637
+ +I+ YPG G KA+AE IFG+++P G+LP+T+Y + D +M
Sbjct: 505 --EHVDAIIDSWYPGARGGKAVAEAIFGEYSPSGKLPVTFYQGTENLPEFTDYSM----- 557
Query: 638 RQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSR 697
R+YR+ +Y FG+GL Y +Y +S
Sbjct: 558 ---AHRTYRYTNENVLYPFGYGLHYGETNYDGMS-------------------------- 588
Query: 698 LDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVH 757
+D+ S + V ++VTN + +V L+ R ++ P QL G + V
Sbjct: 589 -----VDKAESDVNEPVEVFVNVTNDSRYTVNEIVQLYIRHVDAAEYEPGYQLKGIEVVK 643
Query: 758 TVAKGSKEISFGVDP 772
+K++ + P
Sbjct: 644 LEPYETKKVKLTLSP 658
>gi|336435507|ref|ZP_08615222.1| hypothetical protein HMPREF0988_00807 [Lachnospiraceae bacterium
1_4_56FAA]
gi|336000960|gb|EGN31106.1| hypothetical protein HMPREF0988_00807 [Lachnospiraceae bacterium
1_4_56FAA]
Length = 717
Score = 383 bits (984), Expect = e-103, Method: Compositional matrix adjust.
Identities = 257/764 (33%), Positives = 375/764 (49%), Gaps = 114/764 (14%)
Query: 64 RAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVT 123
+A+ L+ +TL EK QL +A AIPRL IPAY WW+ESLHG+A G + T
Sbjct: 13 QAEELVDQMTLMEKASQLRYDAPAIPRLHIPAYNWWNESLHGVARGG----------TAT 62
Query: 124 SFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG--------QAGLTFWAPNINIFRDP 175
FPQ + AASF+R + IG A+A+E RA YN GLTFWAPN+NIFRDP
Sbjct: 63 VFPQAIGLAASFDREMLEEIGEAIALEGRAKYNAAVKLDDRDIYKGLTFWAPNVNIFRDP 122
Query: 176 RWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGD 235
RWGRG ET GEDP + S V +++ QG+ G+
Sbjct: 123 RWGRGHETYGEDPYLSSRLGVSYIRGLQGD----------------------------GE 154
Query: 236 ELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSY 295
+ +AC KH + + R+ F+A ++E+D +T+ P F++C+++G +M +Y
Sbjct: 155 TMKAAACAKH---FAVHSGPEALRHEFDAEVSEKDLRETYLPAFQACVQEGHVEAVMGAY 211
Query: 296 NQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAG 354
N VNG P C L +K R EWGF G++ SDC A+ E T T SAA ++AG
Sbjct: 212 NCVNGEPCCGSETLLKKILREEWGFDGHVVSDCWAIKDFHENHLVTGTPVQSAALAMEAG 271
Query: 355 MDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDD 414
D+NCG L H A +G V E I A + LF+ + LG+F+G +Y +
Sbjct: 272 CDLNCGVTYL-HLVHACQEGLVTEAQITEAAIRLFTTRFLLGMFDG----SEYDSVPYTV 326
Query: 415 VCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPC 474
V EH+ L+ AAR+ IVLLKN+ LPL++ + ++ IIGP ++ + G Y G
Sbjct: 327 VECKEHRDLSERAARESIVLLKNN-GILPLDREKLKTIGIIGPNADSRKALIGNYHGTSS 385
Query: 475 SPKSLLRGLEAYVS---KTHYASGCH------DVPCNSDAGFHEAVRIAKKADFVIVVAG 525
++L G+ V + Y+ GCH + EA +A+++D VI+ G
Sbjct: 386 EYITVLEGVRRLVGDEVRILYSDGCHLYENKTENLAREQDRLSEARIVARESDVVILCLG 445
Query: 526 LDLTQETED---------RDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFA 576
LD T E E+ D+V L LP Q L+ +VA K+P +L L G +D+SFA
Sbjct: 446 LDETLEGEEGDTGNSYASGDKVDLRLPKSQRMLMEAVA-MEKKPTVLCLMAGSDIDLSFA 504
Query: 577 EADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVP-MNDMNMRAD 635
E LW YPG G A A+I+FG +P G+LP+T+Y ES +P D +MR
Sbjct: 505 EKHFDAIVDLW--YPGAYGGAAAADILFGKCSPSGKLPITFY-ESLEVLPSFEDYSMR-- 559
Query: 636 SSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTG 695
GR+YR+ Y FG+GL+YT +N+ +
Sbjct: 560 ------GRTYRYLEQKAQYPFGYGLTYTKMKI--------------------RNVWLENA 593
Query: 696 SRLDYVHIDEVTSCTSLRFHVQI--SVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGF 753
+ + EVT + V + V N G +D V+ ++ R + TP L GF
Sbjct: 594 EK----DMKEVTDGENAEAAVIVCAEVENCGGMDSQEVLQIYIRDTESEHETPHPHLAGF 649
Query: 754 DRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
+R+ V KG K++ ++ ++ GRR G + + G
Sbjct: 650 ERIF-VEKGVKKLVKIPVNRSAFTVVDESGRRFTDSGKYEIFAG 692
>gi|372209074|ref|ZP_09496876.1| glycoside hydrolase family protein [Flavobacteriaceae bacterium
S85]
Length = 727
Score = 383 bits (983), Expect = e-103, Method: Compositional matrix adjust.
Identities = 255/770 (33%), Positives = 377/770 (48%), Gaps = 113/770 (14%)
Query: 54 FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
F +T SI RA+ L+S +TL+EKI QL + A AI RL +P Y+WW+E+LHG+A NG
Sbjct: 21 FLDTDKSIEERAEILVSQMTLKEKIAQLKNTAPAISRLKVPDYDWWNEALHGVARNG--- 77
Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQ--------AGLTFW 165
T FPQ + A+F+ L + SA++ EARA Y + Q AGLTFW
Sbjct: 78 -------KATIFPQGIGIGATFDPDLALRVASAISTEARAKYTISQQMGNHSRYAGLTFW 130
Query: 166 APNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLK 225
PN+NIFRDPRWGRGQET GEDP +++ V FVK QG
Sbjct: 131 TPNVNIFRDPRWGRGQETFGEDPYLMTQMGVAFVKGLQG--------------------- 169
Query: 226 GFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQ 285
D + L +AC KH Y + R FNA+ T+QD +T+ P F + ++
Sbjct: 170 ------DDPNYLKSAACAKH---YAVHSGPESLRLEFNAVPTQQDLYETYLPAFEALVKD 220
Query: 286 GKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHE 344
+M ++N V G P L R+ WGF GY+ +DC A+ I Y +
Sbjct: 221 ANVEGVMPAHNAVFGAPMAANKFLLTDVLRDRWGFDGYVVTDCGAIKQIKVGHKYVDSEV 280
Query: 345 DSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRK 404
+AA LKAG ++NCG + + AID+G V E+ + LF + RLG+F+ D K
Sbjct: 281 AAAAVALKAGTNLNCGATY-KELKKAIDQGLVTEELVHERTKQLFKTRFRLGMFDKDLSK 339
Query: 405 GKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQ 464
Y K+GP+ + + EH +LA +AA++ IV+LKN LPL + + + GP N+
Sbjct: 340 NPYSKIGPELIHSKEHIELAREAAQKSIVMLKNKNNLLPLPTD-IKVPYVTGPFANSSDM 398
Query: 465 MGGGYTGIPCSPKSLLRGLEAYVS---KTHYASGCHDVPCNSDAGFHEAVRIAKKADFVI 521
+ G Y G+ ++L G+ VS +Y SG N + + A +A +D I
Sbjct: 399 LMGSYYGVSPGVVTILAGITDAVSLGTSLNYRSGALPFQKNINPK-NWAPNVAGMSDVTI 457
Query: 522 VVAGLDLTQETE---------DRDRVSLLLPGQQMSLVTSVARTSK-RPVILVLTGGGPL 571
V GL +E E DR+ L LP Q++ V +A K +P++LV+ G P
Sbjct: 458 CVVGLTADREGEGVDAIASNHKGDRLDLKLPENQINYVKQLAAKKKDKPLVLVIASGSP- 516
Query: 572 DVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVP-MNDM 630
VS + +IL I YPGE G A+A+++FG +P G LPMT +P+S ++P D
Sbjct: 517 -VSLEGIEEHCDAILQIWYPGEQGGNAVADVLFGKVSPTGHLPMT-FPKSVAQLPDYKDY 574
Query: 631 NMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAP-SELTISASLKAGSDKN 689
+M+ GR+Y++ T ++ FG GL+Y+ +K L ++L SLK
Sbjct: 575 SMK--------GRTYKYMTEEPMFPFGFGLTYSKTEFKNLVVEDAKLRKKESLK------ 620
Query: 690 ILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLF--ARVPKVSQGTPE 747
V + VTN GD D +V L+ + K +G P
Sbjct: 621 --------------------------VSVEVTNVGDFDIDEIVQLYISPKSQKEGEGLPF 654
Query: 748 KQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
L F RV ++++ F + P E L + N G+++ G + + VG
Sbjct: 655 TTLKAFKRVALKKGETQKVEFTIHP-ESLKVINVKGQKVWRKGAYKVTVG 703
>gi|336425135|ref|ZP_08605165.1| hypothetical protein HMPREF0994_01171 [Lachnospiraceae bacterium
3_1_57FAA_CT1]
gi|336013044|gb|EGN42933.1| hypothetical protein HMPREF0994_01171 [Lachnospiraceae bacterium
3_1_57FAA_CT1]
Length = 705
Score = 383 bits (983), Expect = e-103, Method: Compositional matrix adjust.
Identities = 255/763 (33%), Positives = 378/763 (49%), Gaps = 120/763 (15%)
Query: 64 RAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVT 123
+A L+S +TL+EK QL +A AIPRLG+P Y WW+E+LHG+A G T
Sbjct: 10 KAHELVSQMTLEEKASQLRYDAPAIPRLGVPTYNWWNEALHGVARAG----------VAT 59
Query: 124 SFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQ--------AGLTFWAPNINIFRDP 175
SFPQ + AA+F+ L +G AVA E RA YN GLTFW+PN+NIFRDP
Sbjct: 60 SFPQAIAMAAAFDDELLKTVGDAVAAEGRAKYNEYSRHDDRDIYKGLTFWSPNVNIFRDP 119
Query: 176 RWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGD 235
RWGRG ET GEDP + S V +V+ QG S D
Sbjct: 120 RWGRGHETYGEDPYLTSRLGVAYVEGLQG--------------------------SQDDD 153
Query: 236 ELMLSACCKHLIAYDLEKWGNFS-RYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCS 294
+ +AC KH + G S R+ F+A +++D +T+ P F +C+++ +M +
Sbjct: 154 FMKTAACAKHFAVHS----GPESVRHEFDAQASKKDMYETYLPAFEACVKEAGVEAVMGA 209
Query: 295 YNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKA 353
YN+ NG P C L Q R EW F+G+ SDC A+A + TKT E+SAA LK+
Sbjct: 210 YNRTNGEPCCGSPTLIQNILREEWDFQGHYVSDCWAIADFHMHHMVTKTPEESAALALKS 269
Query: 354 GMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPD 413
G D+NCG L H A +G V E++I +A LF+ + LG F+ K +Y + +
Sbjct: 270 GCDVNCGVTYL-HLLKAYQQGLVTEEEITQAAERLFTTRFLLGCFD----KNEYDDIPYE 324
Query: 414 DVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIP 473
V EH +LA A++ +VLLKND LPLNK+ + ++ +IGP ++ + + G Y G
Sbjct: 325 VVECKEHLELAQKMAKESMVLLKND-GILPLNKDGLKTIGVIGPNADSRTPLVGNYHGTS 383
Query: 474 CSPKSLLRGLEAYVS---KTHYASGCHDVPCNSDA------GFHEAVRIAKKADFVIVVA 524
+LL G++ +V + +Y+ GCH + EA+ +A+ +D V++
Sbjct: 384 SRYITLLEGIQDFVGEDVRVYYSEGCHIYKDRVEGLGWKQDRISEALTVAEHSDVVVLCL 443
Query: 525 GLDLTQETED---------RDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSF 575
GLD E E+ D+ L LP Q L+ +VA K PV+L + G +D+ F
Sbjct: 444 GLDENLEGEEGDTGNSYASGDKKDLELPESQRELLEAVAGCGK-PVVLCMMSGSAIDMQF 502
Query: 576 AEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRAD 635
A +++IL + YPG G KA AEI+FG +P G+LP+T+Y + D +M+
Sbjct: 503 AA--EHVNAILQVWYPGARGGKAAAEILFGACSPSGKLPVTFYKDLEGFPAFEDYSMK-- 558
Query: 636 SSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTG 695
GR+YR+ +Y FG+GL+Y K +ELT +++ G + I
Sbjct: 559 ------GRTYRYLEKEPLYPFGYGLTYGQVCVK----AAELT--GAVEEGKELTI----- 601
Query: 696 SRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDR 755
+ V N+G D V+ ++ + P L F R
Sbjct: 602 ---------------------KAMVENSGKYDTDDVIQVYIKDLDSKNAVPNHSLCAFKR 640
Query: 756 VHTVAKGSK-EISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
V ++ KG K EI V P E L ++ G++ + + VL VG
Sbjct: 641 V-SLKKGEKAEILLKV-PYEALMAVDEEGKKYVDSSHFVLSVG 681
>gi|410098444|ref|ZP_11293422.1| hypothetical protein HMPREF1076_02600 [Parabacteroides goldsteinii
CL02T12C30]
gi|409222318|gb|EKN15263.1| hypothetical protein HMPREF1076_02600 [Parabacteroides goldsteinii
CL02T12C30]
Length = 738
Score = 382 bits (982), Expect = e-103, Method: Compositional matrix adjust.
Identities = 249/777 (32%), Positives = 370/777 (47%), Gaps = 119/777 (15%)
Query: 52 YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGP 111
YPF N L + R + +IS LTL+EK+Q + A A+PRLGIPAY WW+E+LHG+A
Sbjct: 26 YPFRNPDLPLDVRVQDIISRLTLEEKVQLMKHAAPAVPRLGIPAYNWWNEALHGVAR--- 82
Query: 112 GVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYN----LGQAG-----L 162
T VT FPQ + AA+F+ +G + E RA++N G+ G L
Sbjct: 83 ------TKEKVTVFPQAIGMAATFDTEALQKMGDMTSSEGRALFNEDLKAGKTGEIYRGL 136
Query: 163 TFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKR 222
T+W PNINIFRDPRWGRGQET GEDP + + V +G N
Sbjct: 137 TYWTPNINIFRDPRWGRGQETYGEDPYLTAKMGSAIVHGLEGNN---------------- 180
Query: 223 VLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSC 282
+ L AC KH Y + +R+S++A ++ D DT+ P FR
Sbjct: 181 -----------PEYLKSVACAKH---YAVHSGPEHNRHSYDARVSMYDLWDTYLPAFREL 226
Query: 283 IEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTK 341
+ + K +MC+YN+ G P C +L Q RN+W F GY+TSDC AV+ ++ Y K
Sbjct: 227 VTKAKVHGVMCAYNRFEGTPCCGHNELLQDILRNQWKFDGYVTSDCWAVS---DFAKYHK 283
Query: 342 THEDSAAGVLKA---GMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLF 398
TH + V A G D+ CG + + Q ++KG + EKDI+ +L LF +Q +LG++
Sbjct: 284 THSNDTEAVADAVLNGTDLECGN-LYQKLQQGVEKGLISEKDINVSLARLFEIQFKLGMY 342
Query: 399 NGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPL 458
+ R Y +G + + HKK A + A++ +VLLKN+K LPLN + + +A+IGP
Sbjct: 343 DPADRV-PYASIGREVIECDAHKKHAYEMAQKSMVLLKNNKNILPLNASKIKRIALIGPN 401
Query: 459 VNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDV----PCNSDAGFHEAVRIA 514
++N S + Y G P + + L+ + V F + A
Sbjct: 402 MDNGSTLLANYFGTPSEIITPYKSLQKRFGNSIQIDTLTGVGIVQKLEGAPSFAQVAAQA 461
Query: 515 KKADFVIVVAGLDLTQE-------------TEDRDRVSLLLPGQQMSLVTSVARTSKRPV 561
KKAD +I V G+ E DR ++ LP Q L+ + +T RP+
Sbjct: 462 KKADIIIFVGGISADYEGEAGDAGAAGYGGFASGDRTTMKLPPVQTELMKELKKTG-RPL 520
Query: 562 ILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPES 621
ILV G + SF +IL Y G+A A+ +++FGD+NP GR+P+T Y
Sbjct: 521 ILVNMSGSVM--SFDWESRNADAILQAWYGGQAAGDAITDVLFGDYNPAGRMPLTTY--- 575
Query: 622 FTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISAS 681
MND ++ R+YR++ G Y FG+GLSYT + Y L S + S
Sbjct: 576 -----MNDEDLPDFEDYSMANRTYRYFKGDVRYPFGYGLSYTTFGYAPLQNASTVKTGES 630
Query: 682 LKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKV 741
++ V +VTN G G VV L+ P+
Sbjct: 631 IQ--------------------------------VTTTVTNTGKRAGDEVVQLYISHPQN 658
Query: 742 SQG-TPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
P + L GF R+H S++++F + P E+LS+ ++ G ++ G L +G
Sbjct: 659 GNTRVPLRALKGFKRIHLDTGESRQVTFTLSP-EELSLVDEKGNQVEKEGTVELYIG 714
>gi|171695518|ref|XP_001912683.1| hypothetical protein [Podospora anserina S mat+]
gi|170948001|emb|CAP60165.1| unnamed protein product [Podospora anserina S mat+]
Length = 805
Score = 380 bits (977), Expect = e-102, Method: Compositional matrix adjust.
Identities = 265/806 (32%), Positives = 397/806 (49%), Gaps = 117/806 (14%)
Query: 55 CNTSLSISTRAKSLISLLTLQEK------------------IQQLSDNASAI--PRLGIP 94
C+T+ S RA +L+ L + EK IQ ++ ++ ++ R+G+P
Sbjct: 36 CDTTASPPARAAALVQALNITEKLVNLVEYVKSREAPLGISIQLITPHSMSLGAERIGLP 95
Query: 95 AYEWWSESLHGIASNGPGVNFNGT---VSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEA 151
AY WW+E+LHG+A++ PGV+FN S TSF + AA+F+ L + ++ EA
Sbjct: 96 AYAWWNEALHGVAAS-PGVSFNQAGQEFSHATSFANTITLAAAFDNDLVYEVADTISTEA 154
Query: 152 RAMYNLGQAGLTFWAPNINIFRDPRWGRG------------------QETPGEDPMVVSA 193
RA N AGL +W PNIN ++DPRWGRG Q+TPGEDP+ +
Sbjct: 155 RAFSNAELAGLDYWTPNINPYKDPRWGRGHEVCYLSLLFRAVQLLRTQKTPGEDPVHIKG 214
Query: 194 YAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEK 253
Y ++ +G + + ++V+ A CKH AYDLE+
Sbjct: 215 YVQALLEGLEGRD------------KIRKVI----------------ATCKHFAAYDLER 246
Query: 254 WGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPAC----LRGDL 309
W RY FNA++T QD + + PF+ C K MCSYN +NG PAC L D+
Sbjct: 247 WQGALRYRFNAVVTSQDLSEYYLQPFQQCARDSKVGSFMCSYNALNGTPACASTYLMDDI 306
Query: 310 FQKARNEWGFKGYITSDCDAVATIF-EYQNYTKTHEDSAAGVLKAGMDINC---GTCMLR 365
+K N YITSDC+A+ + N+++T +AA AG D C G L
Sbjct: 307 LRKHWNWTEHNNYITSDCNAIQDFLPNFHNFSQTPAQAAADAYNAGTDTVCEVPGYPPLT 366
Query: 366 HTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLAL 425
A ++ + E+ IDRAL L+ +R G + Y K+ V T + + LAL
Sbjct: 367 DVIGAYNQSLLSEEIIDRALRRLYEGLIRAGYLDSA-SPHPYTKISWSQVNTPKAQALAL 425
Query: 426 DAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEA 485
+A GIVLLKN+ LPL+ ++A+IG N QM GGY+GIP + +
Sbjct: 426 QSATDGIVLLKNN-GLLPLDLTN-KTIALIGHWANATRQMLGGYSGIPPYYANPIYAATQ 483
Query: 486 YVSKTHYASGC--HDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLP 543
H+A G P +D A+ A K+D ++ + G DL+ EDRDR S+ P
Sbjct: 484 LNVTFHHAPGPVNQSSPSTNDTWTSPALSAASKSDIILYLGGTDLSIAAEDRDRDSIAWP 543
Query: 544 GQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEII 603
Q+SL+TS+A+ K ++ L G +D + ++ ISSILW+GYPG++G AL II
Sbjct: 544 SAQLSLLTSLAQMGKPTIVARL--GDQVDDTPLLSNPNISSILWVGYPGQSGGTALLNII 601
Query: 604 FGDFNPGGRLPMTWYPESFTK-VPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSY 662
G +P RLP+T YPE++T +P+ M++R S+R PGR+YR+Y + V FGHGL Y
Sbjct: 602 TGVSSPAARLPVTVYPETYTSLIPLTAMSLRPTSAR--PGRTYRWYP-SPVLPFGHGLHY 658
Query: 663 TNYSYKFLSAPSELTIS-ASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRF-HVQISV 720
T ++ KF LTI+ A L + ++ L RF V + V
Sbjct: 659 TTFTAKF-GVFESLTINIAELVSNCNERYLDLC------------------RFPQVSVWV 699
Query: 721 TNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIAN 780
+N G++ +V ++F R + P K L+G+ R+ + G+ G P + +
Sbjct: 700 SNTGELKSDYVALVFVRGEYGPEPYPIKTLVGYKRIRDIEPGTT----GAAP---VGVVV 752
Query: 781 KHGRRILPLGNHVLMVGELRHSLTIE 806
R+ GN VL G+ L +E
Sbjct: 753 GDLARVDLGGNRVLFPGKYEFLLDVE 778
>gi|336261464|ref|XP_003345521.1| hypothetical protein SMAC_07509 [Sordaria macrospora k-hell]
gi|380088197|emb|CCC13872.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 762
Score = 380 bits (975), Expect = e-102, Method: Compositional matrix adjust.
Identities = 257/783 (32%), Positives = 389/783 (49%), Gaps = 82/783 (10%)
Query: 27 VIAFSNSKPVLNKPDFPCKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNAS 86
+ ++ P PD C S C+ +LS RA +L++ +T +EK+Q L +
Sbjct: 23 ALVYAIDLPFQTYPD--CVNGPLASLKVCDATLSPPQRAAALVAAMTTEEKLQNLVSKSK 80
Query: 87 AIPRLGIPAYEWWSESLHGIASNGPGVNF---NGTVSSVTSFPQVLVSAASFNRSLWSNI 143
PR+G+PAY WWSE+LHG+A PG F NGT +S TSFP L+ AA+F+ L +
Sbjct: 81 GAPRIGLPAYNWWSEALHGVAY-APGTQFRSGNGTFNSSTSFPMPLLMAATFDDELIERV 139
Query: 144 GSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQ 203
G + +E RA N G +G +W PN+N F+DPRWGRG ETPGED + + YA ++ +
Sbjct: 140 GEVIGIEGRAFGNAGFSGFDYWTPNVNPFKDPRWGRGSETPGEDILRIKRYAASMIRGLE 199
Query: 204 GENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFN 263
G RE+R++ A CKH A D E W +R+ FN
Sbjct: 200 GP----------VRERERRIV----------------ATCKHYAANDFEDWNGSTRHDFN 233
Query: 264 AMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKG- 321
A +T QD + + PF+ C K IMCSYN VNGVPAC L Q R+ W +
Sbjct: 234 AKVTLQDLAEYYLSPFQQCARDSKVGSIMCSYNAVNGVPACANTYLMQTILRDHWNWTAP 293
Query: 322 --YITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEK 379
YITSDC+AV I +Y KT+ + A +AG+D +C A +G +++
Sbjct: 294 GNYITSDCEAVLDISANHHYAKTNAEGTALAFEAGIDSSCEYEGSSDILGAWTQGLLKQS 353
Query: 380 DIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDK 439
+DRAL L+ +++G F+G+ + +Y LG + V + +++AL AA +GIVLLKNDK
Sbjct: 354 TVDRALRRLYEGLVQVGYFDGN--RSEYASLGWNHVNRPKSQEVALQAAVEGIVLLKNDK 411
Query: 440 KF-LPLNKNAVS-SLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASG-C 496
L + KN LA+IG N+ + GGY+G P S + +A K A G
Sbjct: 412 TLPLGVKKNGPKLKLAMIGFWANDPKTLSGGYSGTPAFEHSPVYATQAMGFKVTTAGGPV 471
Query: 497 HDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVART 556
+ D A+ AK A++++ G D + E +DR ++ P Q+ L+T +++
Sbjct: 472 LQNSTSKDTWTQAALAAAKDANYILYFGGQDTSAAGETKDRTTINWPEAQLQLITDLSKL 531
Query: 557 SKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMT 616
K P+++V G LD + A I+SILW +P P GRLP+T
Sbjct: 532 GK-PLVVVQMGDQ-LDNTPLLASKAINSILWANWP----------------VPAGRLPVT 573
Query: 617 WYPESFT-KVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSE 675
Y ++T VPM DM +R S + PGR+YR+Y T V FG GL YT + K + P
Sbjct: 574 QYHANYTAAVPMTDMTLR--PSDKLPGRTYRWYP-TPVQPFGFGLHYTTFKTKIVRLP-R 629
Query: 676 LTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLF 735
I K++L + G+ + D T +++ VTN G +VV+ F
Sbjct: 630 FAI---------KDLLSRCGN----AYPD-----TCGLPPLKVEVTNTGKRSSDYVVLAF 671
Query: 736 ARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLM 795
+ + P K L+ + R+ ++ G K + ++ ++ G +L G + ++
Sbjct: 672 LKGDVGPKPYPIKTLVSYTRLRDLSPGRKTTAHLDWTLGDIARYDEQGNTVLYPGTYTVI 731
Query: 796 VGE 798
V E
Sbjct: 732 VDE 734
>gi|280977785|gb|ACZ98610.1| glucosidase [Cellulosilyticum ruminicola]
Length = 711
Score = 379 bits (974), Expect = e-102, Method: Compositional matrix adjust.
Identities = 248/771 (32%), Positives = 383/771 (49%), Gaps = 118/771 (15%)
Query: 60 SISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTV 119
+ AK L+ + L EK QL +A AI RLGIP Y WW+E+LHG+A G
Sbjct: 3 NFKNEAKELVRQMDLLEKASQLRYDAPAIKRLGIPTYNWWNEALHGVARAGVA------- 55
Query: 120 SSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYN-LGQA-------GLTFWAPNINI 171
T FPQ + AA F+ I +A+E RA YN Q G+TFWAPNINI
Sbjct: 56 ---TVFPQAIGLAAMFDEEKLGEIADIIAIEGRAKYNQFSQKEDRDIYKGMTFWAPNINI 112
Query: 172 FRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEES 231
FRDPRWGRG ET GEDP + + V F+K QG+
Sbjct: 113 FRDPRWGRGHETYGEDPYLTARLGVAFIKGLQGD-------------------------- 146
Query: 232 DRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCI 291
+ D L +AC KH + + R+ F+A+++++D +T+ P F + +++ +
Sbjct: 147 ENEDYLKAAACAKHFAVHSGPEE---DRHHFDAIVSKKDLYETYLPAFEAAVKEANVIGV 203
Query: 292 MCSYNQVNGVPAC----LRGDLFQKARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSA 347
M +YN+VNG PAC L D+ +K +WGF GYI SDC A+ T T +SA
Sbjct: 204 MGAYNRVNGEPACGSKTLLVDILKK---DWGFDGYIVSDCWAIRDFHTEHMVTHTAAESA 260
Query: 348 AGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKY 407
A + G ++NCG L H A +G V+E+ I A L ++++LGLF+ + KY
Sbjct: 261 ALAINNGCELNCGNTYL-HMLEAHQEGLVKEEIITEAAEKLMRIRMQLGLFD---KNCKY 316
Query: 408 GKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGG 467
++ H+++AL+A+R+ +V+LKND LPLNK+ + S+ IIGP NN + + G
Sbjct: 317 NEIPYAVNDCKVHREVALEASRRSMVMLKND-GILPLNKDKLKSIGIIGPTANNRTVLEG 375
Query: 468 GYTGIPCSPKSLLRGLEAYVS---KTHYASGCH-------DVPCNSDAGFHEAVRIAKKA 517
Y G + + G++ YV + +Y+ GCH ++ +D EA+ +A+++
Sbjct: 376 NYNGTASRYTTFVEGIQDYVGDDVRVYYSEGCHLFANGMSNLAWENDRE-AEALIVAEQS 434
Query: 518 DFVIVVAGLDLTQETED---------RDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGG 568
D V++ GLD T E E D++SL L G+Q L+ V K PVILVL+ G
Sbjct: 435 DVVVLCLGLDSTIEGEQGDTGNAFAGGDKLSLNLIGRQQQLLEKVVAVGK-PVILVLSTG 493
Query: 569 GPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMN 628
+ +++ AD ++I YPG G KALA+++FG+++P G+LP+T+Y +
Sbjct: 494 SAMAINY--ADEHCNAIFQTWYPGAQGGKALAQLLFGEYSPSGKLPVTFYKTTEELPAFE 551
Query: 629 DMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDK 688
D +M+ R+YR+ +Y FG+GLSY
Sbjct: 552 DYSMK--------DRTYRYMPNEALYPFGYGLSYA------------------------- 578
Query: 689 NILQQTGSRLDYVHIDEVT--SCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTP 746
+I Q+ LD +E+T S ++ V++ + N +VD VV ++ + + P
Sbjct: 579 DIKVQSVKVLDGAKGEEITNFSAGQTKYKVKVELENKSNVDSYDVVQIYIKDMESQYAVP 638
Query: 747 EKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
L F V A SKE++ V + ++ N+ G+RI+ L +G
Sbjct: 639 NFSLCSFKSVFLKAGESKEVTLNVGE-KAFTVINEEGKRIVDSKKFKLFIG 688
>gi|373954937|ref|ZP_09614897.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
paludis DSM 18603]
gi|373891537|gb|EHQ27434.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
paludis DSM 18603]
Length = 723
Score = 379 bits (973), Expect = e-102, Method: Compositional matrix adjust.
Identities = 247/773 (31%), Positives = 391/773 (50%), Gaps = 128/773 (16%)
Query: 53 PFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPG 112
PF T + R + LIS LTL+EK+ Q+ D + ++PRL +P Y WW+E+LHG+A +G
Sbjct: 27 PFNPTDV----RVRDLISKLTLEEKVHQMMDVSPSVPRLNLPKYNWWNEALHGVARSG-- 80
Query: 113 VNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQ--------AGLTF 164
T FPQ + A+F++ L +A++ EARAMYN GLTF
Sbjct: 81 --------VATIFPQAIALGATFDQDLAKRESTAISDEARAMYNAAMVNGYNEKYGGLTF 132
Query: 165 WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVL 224
W PNINIFRDPRWGRGQET GEDP + S V F++ QG
Sbjct: 133 WTPNINIFRDPRWGRGQETYGEDPFLTSQIGVAFIQGLQG-------------------- 172
Query: 225 KGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIE 284
D + L ++AC KH + + R+SFNA+ + +D +T+ P F++ +
Sbjct: 173 -------DDPEHLKVAACAKH---FAVHSGPERLRHSFNAIASPKDLRETYLPAFKALV- 221
Query: 285 QGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIFEYQNYTKTH 343
+ +MC+YN+ N C L Q R+EW F G++ SDC A+ +
Sbjct: 222 NARVEAVMCAYNRTNSEVCCGSNLLLDQILRDEWHFTGHVVSDCGAIVDFYMGHKVVPGQ 281
Query: 344 EDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPR 403
++ A +K G+D+NCG + A+ +G + EK+ID+AL L + +LGLF DP+
Sbjct: 282 PEAVALAVKHGVDLNCGDEYPALIE-AVKRGLITEKEIDKALATLLKTRFKLGLF--DPK 338
Query: 404 KGK-YGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNI 462
+ Y + + +++H+ LA + A + IVLLKN+K LPL KN +S I GP ++
Sbjct: 339 QNSPYNNIPVSVINSTDHRALAKEVALKSIVLLKNEK-CLPL-KNNLSKYYITGPNAASV 396
Query: 463 SQMGGGYTGIPCSPKSLLRGLEAYV---SKTHYASGC-----HDVPCNSDAGFHEAVRIA 514
+ G Y G+ ++L G+ + S+ Y G ++ P + G A
Sbjct: 397 DALMGNYYGVNPHMSTILEGIAGAIQPGSQMQYKPGILLDRDNNNPIDWTTG------DA 450
Query: 515 KKADFVIVVAGLDLTQETEDR---------DRVSLLLPGQQMSLVTSVARTSKRPVILVL 565
K +D VV G+ E E+ DR+ LP Q+ + + + +K V+ ++
Sbjct: 451 KASDVTFVVMGITGLLEGEEGEAIASPNYGDRLDYNLPKNQIDFLRKIRKGNKNKVVAII 510
Query: 566 TGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKV 625
TGG P+++S E ++L YPGE G A+A+I+FG +P GRLP+T +P+SF ++
Sbjct: 511 TGGSPMNLS--EVHELADAVLLAWYPGEEGGNAVADILFGKVSPSGRLPVT-FPKSFAQL 567
Query: 626 P-MNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKA 684
P D +M+ GR+YR+ T +Y FG+GLSY+ Y+Y S LT+S
Sbjct: 568 PPYEDYSMK--------GRTYRYMTAEPMYTFGYGLSYSTYTY------SSLTLS----- 608
Query: 685 GSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQG 744
+K I + ++ + VTN G ++G VV L+ VP+ ++
Sbjct: 609 --EKQIKK------------------NMTIIAETMVTNTGKMEGEEVVQLYITVPQ-TEK 647
Query: 745 TPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
P+ L GF RV+ A S+++ F + P + + + +G +L G++V+ +G
Sbjct: 648 NPQYSLKGFKRVNLKAGESRKVQFQITP-DLMKSVDANGSEVLLSGSYVVRIG 699
>gi|330836687|ref|YP_004411328.1| Beta-glucosidase [Sphaerochaeta coccoides DSM 17374]
gi|329748590|gb|AEC01946.1| Beta-glucosidase [Sphaerochaeta coccoides DSM 17374]
Length = 709
Score = 379 bits (973), Expect = e-102, Method: Compositional matrix adjust.
Identities = 219/626 (34%), Positives = 335/626 (53%), Gaps = 82/626 (13%)
Query: 65 AKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTS 124
A+ ++S +TL EKI Q+ ASAIPRL IP Y WW+E+LHG+A G T
Sbjct: 14 ARRIVSRMTLDEKISQIDYRASAIPRLDIPEYNWWNEALHGVARAGIA----------TV 63
Query: 125 FPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG--------QAGLTFWAPNINIFRDPR 176
FPQ + AA F+ + IG+ ++ E RA YN GLTFW+PN+NIFRDPR
Sbjct: 64 FPQAIGLAAMFDSDMMERIGAVISTEGRAKYNEAVRHGDRDIYKGLTFWSPNVNIFRDPR 123
Query: 177 WGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDE 236
WGRGQET GEDP + + AV F++ QG+ G
Sbjct: 124 WGRGQETYGEDPYLTARLAVAFIRGIQGD----------------------------GKY 155
Query: 237 LMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYN 296
L +AC KH + + R+ F+A ++++D +T+ F++ +++ + +M +YN
Sbjct: 156 LKAAACAKH---FAVHSGPEALRHEFDARVSQKDLHETYLSAFKAAVKEAQVEIVMGAYN 212
Query: 297 QVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGM 355
+VNGVPAC +L R+EWGF+G++ SD +A+ IF++ +Y + A LKAG
Sbjct: 213 RVNGVPACASHELLSDILRSEWGFEGHVVSDYEALEDIFKHHHYVADEAHTMAVALKAGC 272
Query: 356 DINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDV 415
++ C + RH +S++D+G + E +I A+ LF+ ++ +G+ D Y +G ++
Sbjct: 273 NL-CAGKIARHLRSSVDEGLISEDEITEAVERLFTTRIMMGMMADDC---PYDSIGYEEN 328
Query: 416 CTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCS 475
T EH +LA++AA + VLLKND LPL +SS+A+IGP N+ + G Y G
Sbjct: 329 DTPEHHQLAVEAASRSFVLLKNDG-LLPLEMEKISSIAVIGPNANSRKMLEGNYNGTASR 387
Query: 476 PKSLLRGLEAYVS---KTHYASGCH------DVPCNSDAGFHEAVRIAKKADFVIVVAGL 526
++L G++ V + Y+ GCH + EAV A+ AD V++ GL
Sbjct: 388 YVTVLEGIQDLVGDSVRVWYSEGCHLYKNFHSSLSGRNDRLAEAVSAAQHADVVVLCLGL 447
Query: 527 DLTQETEDRD---------RVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAE 577
D T E E+ D + +L LPG+Q L+ ++ T +PVIL+L G L + E
Sbjct: 448 DATLEGEEGDVEVGFGSGDKPNLSLPGRQQLLLDTML-TVGKPVILLLASGSALTLGGRE 506
Query: 578 ADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSS 637
D + +IL I YPG G KA+A+++FG P G+LP+T+Y + D +M
Sbjct: 507 NDENLKAILQIWYPGAMGGKAVADVLFGRRAPAGKLPVTFYASADELPAFEDYSM----- 561
Query: 638 RQYPGRSYRFYTGTQVYGFGHGLSYT 663
GR+YR+ G +Y FG+GL+Y+
Sbjct: 562 ---AGRTYRYMKGNALYPFGYGLTYS 584
>gi|372208556|ref|ZP_09496358.1| beta-glucosidase [Flavobacteriaceae bacterium S85]
Length = 729
Score = 379 bits (972), Expect = e-102, Method: Compositional matrix adjust.
Identities = 254/768 (33%), Positives = 372/768 (48%), Gaps = 112/768 (14%)
Query: 54 FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
+ +TSL+ R L+ +TL+EKI QL + + RL IP Y WW+E+LHG+A NG
Sbjct: 26 WLDTSLTFEERIHHLVKAMTLKEKIAQLDSGSPEVKRLDIPEYNWWNEALHGVARNGKS- 84
Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQ--------AGLTFW 165
T FPQ + AA+F+ L + SA++ EARA +N+ Q AGLTFW
Sbjct: 85 ---------TVFPQAIGLAATFDPVLAKQVASAISDEARAKFNISQSIGNRGQYAGLTFW 135
Query: 166 APNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLK 225
PN+NIFRDPRWGRGQET GEDP + S V FVK QG + K
Sbjct: 136 TPNVNIFRDPRWGRGQETYGEDPYLTSQMGVAFVKGLQGNHPKY---------------- 179
Query: 226 GFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQ 285
L +AC KH + + R+ FNA +++D +T+ P F + ++Q
Sbjct: 180 -----------LKSAACAKHFAVHSGPEE---LRHHFNANPSKKDLYETYLPAFEALVKQ 225
Query: 286 GKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHE 344
+M +YN V GVPA L ++ R WGF GYI SDC A+ IF+ KT
Sbjct: 226 ANVEGVMSAYNAVYGVPAGSSEFLLKETLRKSWGFDGYIVSDCGALGDIFKGHKQVKTMP 285
Query: 345 DSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRK 404
++AA LKAG+++NCG + A+ +G V E+ ID L L + +LG F DP++
Sbjct: 286 EAAAVALKAGVNLNCGYVYNGALEKAVQQGLVSEELIDTRLKQLLKTRFKLGFF--DPKE 343
Query: 405 GK-YGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNIS 463
Y + + + +H LA A++ IVLLKN LPL+KN + + GP ++
Sbjct: 344 ANPYNAIPTSVIHSDDHIALARKTAQKSIVLLKNKNHTLPLDKN-IKVPYVTGPFASSSD 402
Query: 464 QMGGGYTGIPCSPKSLLRGLEAYVS---KTHYASGCHDVPCNSDAGFHE-AVRIAKKADF 519
+ Y G+ + S+L G+ VS +Y G +P N + A +AK AD
Sbjct: 403 VLLANYYGMTTNLVSVLEGIADKVSLGTSLNYRMGA--LPFNKNLNPKNWAPNVAKTADA 460
Query: 520 VIVVAGLDLTQETEDRDRVS---------LLLPGQQMSLVTSVARTSKRPVILVLTGGGP 570
VI V GL E E+ D ++ L LP Q+ V +A K P+ILV+ G
Sbjct: 461 VIAVVGLSADFEGEEVDAIASPNKGDKKDLKLPQNQIDYVKEMAAKKKGPLILVVASGSA 520
Query: 571 LDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKV-PMND 629
V+ E +I+ + YPGE G A+A+++FGD +P G LP+T +P+S ++ P D
Sbjct: 521 --VALGELYDLADAIVLMWYPGEQGGNAVADVLFGDVSPSGHLPVT-FPKSVAQLPPFED 577
Query: 630 MNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKN 689
+M+ GR+Y++ ++ FG GLSYT++
Sbjct: 578 YSMQ--------GRTYKYMEEEPLFPFGFGLSYTDF------------------------ 605
Query: 690 ILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQ 749
+ V I E F V SV N G VDG VV L+ ++ P+ Q
Sbjct: 606 -------KFSNVQISEEKIKKKDSFTVSCSVANNGKVDGEEVVQLYLVPLNSNKDLPKYQ 658
Query: 750 LIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
L+ F R+ SK +SF ++ + L NK G++ G + L+V
Sbjct: 659 LLKFKRIEIQKNTSKTVSFNLE-AKDLFQVNKEGKKTWIKGKYKLVVA 705
>gi|261368518|ref|ZP_05981401.1| beta-glucosidase [Subdoligranulum variabile DSM 15176]
gi|282569400|gb|EFB74935.1| glycosyl hydrolase family 3 C-terminal domain protein
[Subdoligranulum variabile DSM 15176]
Length = 717
Score = 378 bits (971), Expect = e-102, Method: Compositional matrix adjust.
Identities = 240/762 (31%), Positives = 382/762 (50%), Gaps = 117/762 (15%)
Query: 64 RAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVT 123
RA++L++ +TL+EKI Q+ A AIPRLGIPAY WW+E +HG+ G + T
Sbjct: 14 RARALVAQMTLKEKISQMLSWAPAIPRLGIPAYNWWNEGIHGV----------GRAGTAT 63
Query: 124 SFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLTFWAPNINIFRDP 175
FPQ + AASF+ L +G AV VEAR YN+ ++ GLT WAPN+NIFRDP
Sbjct: 64 VFPQAIGLAASFDEDLLGQVGEAVGVEARGKYNMYRSYQDRDIYKGLTIWAPNVNIFRDP 123
Query: 176 RWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGD 235
RWGRG ET GEDP + S V FV+ QG D D
Sbjct: 124 RWGRGHETYGEDPYLTSRLGVRFVEGMQG---------------------------DDPD 156
Query: 236 ELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSY 295
L +AC KH + + R+ F+A +++QD +T+ P FR+ +++ +M +Y
Sbjct: 157 YLRAAACAKHFAVHSGPE---DQRHYFDAKVSQQDLWETYLPAFRALVKEAGVEAVMGAY 213
Query: 296 NQVNGVPAC-LRGDLFQKARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAG 354
N+ NG P C + L R +W F+G++TSDC A+ E T DS A + G
Sbjct: 214 NRTNGEPCCGSKTLLVDILRGKWNFQGHVTSDCWAIKDFHEGHMVTSGPVDSVALAVNNG 273
Query: 355 MDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDD 414
D+NCG + + + A+ +GKV+E+ IDR+L+ LF+ +++LG+F+ + K Y K+G D
Sbjct: 274 CDLNCGD-LYAYLEEAVAEGKVKEETIDRSLVRLFTTRMKLGMFDAE-EKVPYNKIGYDA 331
Query: 415 VCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPC 474
V + E + L L+ A + +VLLKN+ LPL+K+ + +A++GP +N + G Y G
Sbjct: 332 VDSREMQALNLEVAEKILVLLKNENHTLPLDKSKLHRVAVVGPNADNRKALVGNYEGTAS 391
Query: 475 SPKSLLRGLEAYVS---KTHYASGCHDVP------CNSDAGFHEAVRIAKKADFVIVVAG 525
++L G++ Y+ + Y+ GCH S+ E + + D VI G
Sbjct: 392 RYVTVLDGIQEYLGEDVQVRYSEGCHLYADKIQGLAKSNELISEVRGVCAECDVVICCLG 451
Query: 526 LDLTQETED---------RDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFA 576
LD E E+ D+ SL LPG Q S++ + + K V++VL+G ++
Sbjct: 452 LDAGLEGEEGDQGNQFASGDKQSLSLPGNQESVLKACIESGKPVVVVVLSGSA---LALG 508
Query: 577 EADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADS 636
A +++L YPG G +A+A +FG+ NP G+LP+T+Y +D ++ A +
Sbjct: 509 TAQEGAAAVLQAWYPGAQGGRAVARALFGECNPQGKLPVTFY--------HSDEDLPAFT 560
Query: 637 SRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGS 696
GR+YR+ +Y FG+GLSY++++++ A + + G D
Sbjct: 561 DYAMKGRTYRYMEKEPLYPFGYGLSYSHFTFRDAKADAA-------QIGPDG-------- 605
Query: 697 RLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRV 756
V+++V N G G V ++ + + GTP QL +V
Sbjct: 606 -----------------VDVRVTVVNDGQYRGRETVEVYVKAER--PGTPNAQLKALAKV 646
Query: 757 HTVAKGSKEISFGVDPCEQLSIANKHG-RRILPLGNHVLMVG 797
+ K ++ + C ++ N+ G +LP G + + +G
Sbjct: 647 DLMPGEEKCVTLHLPQC-AFALCNEEGISEVLP-GEYTVWLG 686
>gi|322512556|gb|ADX05682.1| putative carbohydrate-active enzyme [uncultured organism]
Length = 717
Score = 378 bits (970), Expect = e-102, Method: Compositional matrix adjust.
Identities = 244/774 (31%), Positives = 389/774 (50%), Gaps = 123/774 (15%)
Query: 54 FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
+ + + + RA++L+ +TL+EK+ Q NA AI RLG+PAY +W+E+LHG+A G
Sbjct: 6 WLDETKTFEERAQALVCEMTLEEKVFQTLFNAPAIERLGVPAYNYWNEALHGVARAG--- 62
Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLTFW 165
T FPQ + AASF+ L + ++ EARA +N+ Q GLTFW
Sbjct: 63 -------VATVFPQAIGLAASFDEELLGQVADTISTEARAKFNMQQKFGDRDIYKGLTFW 115
Query: 166 APNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLK 225
+PN+NIFRDPRWGRG ET GEDP + V F++ QG++ +R +K
Sbjct: 116 SPNVNIFRDPRWGRGHETFGEDPFLSGRLGVSFIRGMQGDD--------------ERYMK 161
Query: 226 GFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQ 285
++AC KH + + R+SFNA+++EQD +T+ P F +C+ +
Sbjct: 162 -------------VAACAKH---FAVHSGPEDQRHSFNAVVSEQDLRETYLPAFHACVTE 205
Query: 286 GKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHE 344
+M +YN+ NG C L R EWGF+G++TSDC A+ E+ TK E
Sbjct: 206 AGVEAVMGAYNRTNGEACCGSKKLLVDILRGEWGFRGHVTSDCWALKDFHEFHMVTKNQE 265
Query: 345 DSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRK 404
++ A + +G D+NCG + H A+ G V+E IDRA+ LF+ +++LGLF+ +
Sbjct: 266 ETVALAMNSGCDLNCGNLYV-HLLQAVRDGLVEESVIDRAVTRLFTTRMKLGLFD-RSEE 323
Query: 405 GKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQ 464
Y +G D V T ++KL +A+R+ + LLKN LPL+ + + ++ ++GP +N
Sbjct: 324 VPYNGIGYDRVDTEANRKLNREASRRTVCLLKNADGLLPLDISKLRTIGVVGPNADNRKA 383
Query: 465 MGGGYTGIPCSPKSLLRGLEAYVS---KTHYASGCH---------DVPCNSDAGFHEAVR 512
+ G Y G ++L G+ + Y+ GCH P + A EA
Sbjct: 384 LVGNYEGTASEYVTVLDGIRELAGDDVRVVYSEGCHLFRDRVQGLGQPNDRIA---EARA 440
Query: 513 IAKKADFVIVVAGLDLTQETED---------RDRVSLLLPGQQMSLVTSVARTSKRPVIL 563
+A+ +D VI V GLD E E+ D+ +L LPG Q ++ ++ + K PV+L
Sbjct: 441 VAELSDVVIAVMGLDPGLEGEEGDQGNEFASGDKPNLELPGLQGEVLKALVESGK-PVVL 499
Query: 564 VLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT 623
VL GG L + +AE + +IL YPG G +A+A+++FG P G+LP+T+Y S
Sbjct: 500 VLLGGSALAIPWAE--EHVPAILDAWYPGAQGGRAVADVLFGRACPEGKLPVTFYRTSEE 557
Query: 624 KVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLK 683
D +M+ R+YR+ +Y FG+GLSYT++ ELT + +
Sbjct: 558 LPAFTDYSMK--------NRTYRYMKQPALYPFGYGLSYTSW---------ELTNTTA-- 598
Query: 684 AGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQ 743
GS +D+ C ++ + N G + G+ V ++ + P +
Sbjct: 599 ----------EGS------VDDGVVCRAV-------LRNTGAMAGAQTVQVYVKAPLAT- 634
Query: 744 GTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
P QL G ++ S E++ +D E + N+ G R+L G + + +G
Sbjct: 635 -GPNAQLKGLRKIRLQPGESAEVAISLDK-EAFGVYNEKGLRVLLPGEYKIYIG 686
>gi|346225847|ref|ZP_08846989.1| beta-glucosidase [Anaerophaga thermohalophila DSM 12881]
gi|346227016|ref|ZP_08848158.1| beta-glucosidase [Anaerophaga thermohalophila DSM 12881]
Length = 718
Score = 378 bits (970), Expect = e-102, Method: Compositional matrix adjust.
Identities = 247/768 (32%), Positives = 380/768 (49%), Gaps = 114/768 (14%)
Query: 54 FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
F N +S+ RA+ ++ LT++EKI QL + A A+ RL IP Y+WW+E LHG+A G
Sbjct: 17 FRNPDISLDERAECIVKQLTVEEKINQLMNAAPAVDRLEIPEYDWWNECLHGVARAG--- 73
Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNL-------GQ-AGLTFW 165
T FPQ + AA+++ +L +G A++ EARA YN+ GQ GLTFW
Sbjct: 74 -------RATVFPQAIGMAATWDTTLVYRVGDAISTEARAKYNVFSKHGYRGQYKGLTFW 126
Query: 166 APNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLK 225
PN+NIFRDPRWGRGQET GEDP + S V FVK QG + K
Sbjct: 127 TPNVNIFRDPRWGRGQETYGEDPFLTSRIGVSFVKGLQGNHPKY---------------- 170
Query: 226 GFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQ 285
L ++A KH Y + R+ F+A ++ +D +T+ P F + +++
Sbjct: 171 -----------LKVAALAKH---YAVHNGPEALRHEFDAKVSMKDLWETYLPAFEALVKE 216
Query: 286 GKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHE 344
+M +YN+ NG P C L Q+ R +WGF GY SDC A+ + T E
Sbjct: 217 AGVEGVMGAYNRTNGDPCCAHPYLMQEVLREKWGFDGYYVSDCGAIMDFYTGHKIVDTPE 276
Query: 345 DSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRK 404
++AA L AG ++NCG +S ++KG E++IDR++ LF +LRLGLF P
Sbjct: 277 EAAAMALNAGCNLNCGDTYASLLKS-LEKGLTTEEEIDRSVKQLFKTRLRLGLFA--PEG 333
Query: 405 G-KYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNIS 463
Y + D + + EH+KLAL+AAR+ +VLLKN+ LP+ ++ V + + GP ++
Sbjct: 334 AVPYDTISTDVIRSKEHQKLALEAARKSVVLLKNEANTLPVARD-VKKVYVTGPTATHVQ 392
Query: 464 QMGGGYTGIPCSPKSLLRGLEAYVS---KTHYASGC--HDVPCNSDAGFHEAVRIAKKAD 518
+ Y G+ ++L G+ VS Y G ++ N+ F A A AD
Sbjct: 393 ALLANYYGVSEDMTTILEGIVGKVSPQTSVQYRQGALLYEANRNTMDWFSGA---AASAD 449
Query: 519 FVIVVAGLDLTQETEDR---------DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGG 569
+ G+ E E+ DR LP Q+ + + ++K+ V+++ +G
Sbjct: 450 VTVACLGISQLIEGEEGEAIASEHRGDRERTRLPQNQIDFLKRIRASAKKLVVVITSGSA 509
Query: 570 PLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMND 629
+S E ++L++ YPGE G KA+A+++FGD P GRLP+T P +
Sbjct: 510 ---ISLPEIYDMADALLYVWYPGEQGGKAVADVLFGDAVPSGRLPVTVVKSVDDLPPYEN 566
Query: 630 MNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKN 689
+M+ GR+YR+ + + FG GLSYT+++Y S LT+ +
Sbjct: 567 YDMK--------GRTYRYMEVSPQFPFGFGLSYTDFTY------SNLTLES--------- 603
Query: 690 ILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQ 749
++V S S+R + +TN G+ D VV + + S P++
Sbjct: 604 --------------NKVKSGESVR--LSFDLTNEGEYDADEVVQFYITDVEASVNVPKQS 647
Query: 750 LIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
LIGF RV A S +I F V P + + I + +G +IL G + +G
Sbjct: 648 LIGFKRVGLAAGESTKIEFTVTP-DMMKIVDNNGEKILESGEFKIYIG 694
>gi|330947691|ref|XP_003306937.1| hypothetical protein PTT_20252 [Pyrenophora teres f. teres 0-1]
gi|311315273|gb|EFQ84970.1| hypothetical protein PTT_20252 [Pyrenophora teres f. teres 0-1]
Length = 756
Score = 377 bits (968), Expect = e-101, Method: Compositional matrix adjust.
Identities = 242/764 (31%), Positives = 377/764 (49%), Gaps = 61/764 (7%)
Query: 40 PDFPCKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWW 99
PD CK S C+ + S + RA +L++ + QEK++ L + + RLG+PAY WW
Sbjct: 22 PD--CKNGPLKSNAICDVTASPAKRAAALVAAMQTQEKLENLVSKSKGVARLGLPAYNWW 79
Query: 100 SESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQ 159
E+LHG+A PG+NF G+ + TSFP L+ +A+F+ L I + EARA N G
Sbjct: 80 GEALHGVAG-APGINFTGSYRTATSFPMPLLMSAAFDDDLIHQIAIVIGNEARAFGNGGI 138
Query: 160 AGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFR 219
A + FW P+IN FRDPRWGRG ETPGED + + Y + +G+
Sbjct: 139 APVDFWTPDINPFRDPRWGRGSETPGEDILRIKGYTKSLLSGLEGDK------------A 186
Query: 220 EKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPF 279
+++++ A CKH + YD+E W R+ F+A IT QD + F PPF
Sbjct: 187 QRKII----------------ATCKHYVGYDVENWNGTDRHHFDAKITTQDLAEYFMPPF 230
Query: 280 RSCIEQGKASCIMCSYNQVNGVPAC----LRGDLFQKARNEWGFKGYITSDCDAVATIFE 335
+ C K MCSYN VNGVP C + D+ +K N YITSDC+AV I
Sbjct: 231 QQCARDSKVGSFMCSYNAVNGVPTCADTYVLEDILRKHWNWTDSNNYITSDCEAVKDISL 290
Query: 336 YQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRL 395
Y T +++ A GMD++C A +G + IDRAL + +
Sbjct: 291 RHKYVATLQEATAIAFNNGMDLSCEYSGTSDIPGAFSQGLLNVSVIDRALTRQYEGLVHA 350
Query: 396 GLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAII 455
G F+G Y LG D+ T E +KL L A +G+ LLKND LPL+ + S +A++
Sbjct: 351 GYFDG--AAATYAHLGVQDINTPEAQKLVLQVAAEGLTLLKNDDT-LPLSLKSGSKVAMV 407
Query: 456 GPLVNNISQMGGGYTG-IPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIA 514
G N S++ G Y+G P + G + + +D A+ A
Sbjct: 408 GFWANTTSKLSGIYSGPAPYLHTPVYAGNKLGLDMAVATGPILQTSGAADNWTTTALNAA 467
Query: 515 KKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVS 574
KK+DF++ GLD + E DR + P Q+ L+T +A K V++ L G +D +
Sbjct: 468 KKSDFILYFGGLDPSAAAEGSDRTDISWPSAQIDLITKLAALGKPLVVIAL--GDMVDHT 525
Query: 575 FAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRA 634
++S++W +PG+ G A+ ++I G+ GRLP+T YP +T++ M DMNMR
Sbjct: 526 PILKMKGVNSLIWANWPGQDGGTAVMQVITGEHAIAGRLPITQYPAEYTQLSMLDMNMRP 585
Query: 635 DSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQT 694
+ PGR+YR+Y V FG GL YT ++ KF S+ S LT++ ++I++
Sbjct: 586 GGNN--PGRTYRWYN-ESVQPFGFGLHYTKFAAKFGSS-SGLTVNI-------QDIMKSC 634
Query: 695 GSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFD 754
D+ + +V ++++VTN G+ + + F + + P K L+ +
Sbjct: 635 TK--DHPDLCDVPP-------IEVAVTNEGNRTSDFIALAFIKGEVGPKPYPLKTLVSYA 685
Query: 755 RVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGE 798
R+ ++ +++ LS ++ G + G + L++ E
Sbjct: 686 RLRDISGSQTKMASLALTLGALSRVDQSGNLVAYPGEYTLLLDE 729
>gi|169611757|ref|XP_001799296.1| hypothetical protein SNOG_08993 [Phaeosphaeria nodorum SN15]
gi|160702362|gb|EAT83185.2| hypothetical protein SNOG_08993 [Phaeosphaeria nodorum SN15]
Length = 755
Score = 377 bits (968), Expect = e-101, Method: Compositional matrix adjust.
Identities = 253/753 (33%), Positives = 374/753 (49%), Gaps = 73/753 (9%)
Query: 54 FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
C+ + + + RA +L+ + EK+ L + RLG+P Y WW E+LHG+A PG+
Sbjct: 33 ICDVTAAPAERAAALVEAMQTNEKLDNL---MRGVTRLGLPKYNWWGEALHGVAG-APGI 88
Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFR 173
NF G + TSFP L+ +A+F+ L I + + EARA N G A + FW P+IN FR
Sbjct: 89 NFTGAYKTATSFPMPLLMSAAFDDDLIFKIANIIGNEARAFGNGGVAPVDFWTPDINPFR 148
Query: 174 DPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDR 233
DPRWGRG ETPGED + + Y + +G+ +++++
Sbjct: 149 DPRWGRGSETPGEDIVRIKGYTKHLLAGLEGDK------------PQRKII--------- 187
Query: 234 GDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMC 293
A CKH + YD+E WG R+SFNA I QD + + PPF+ C K MC
Sbjct: 188 -------ATCKHYVGYDMEAWGGIDRHSFNAKINMQDLAEYYMPPFQQCARDSKVGSFMC 240
Query: 294 SYNQVNGVPACLRGDLFQKA-RNEWGF---KGYITSDCDAVATIFEYQNYTKTHEDSAAG 349
SYN VNGVP C + Q R+ W + YITSDC+AV I Y KT+ +
Sbjct: 241 SYNAVNGVPTCADTYVLQTILRDHWNWTESNNYITSDCEAVKDISLKHKYAKTNAEGTGL 300
Query: 350 VLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGK 409
AGMD +C A ++ + IDRAL + +R G F+G Y
Sbjct: 301 AFTAGMDNSCEYTGSSDIPGAFNQSYLSIPTIDRALKRQYEGLVRAGYFDG--AAATYAN 358
Query: 410 LGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGY 469
LG D+ T E ++L+L A +G+VLLKND LPL+ S +A++G N+ S++ G Y
Sbjct: 359 LGVKDINTPEAQQLSLQVASEGLVLLKNDDT-LPLSLTNGSKVAMLGFWANDTSKLSGIY 417
Query: 470 TGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNS---DAGFHEAVRIAKKADFVIVVAGL 526
+G +S + + ASG NS D A+ A+K+D+++ GL
Sbjct: 418 SGPAPYLRSPVWAGQKLGLDMAIASGPILQQSNSSTRDNWTTNALAAAEKSDYILYFGGL 477
Query: 527 DLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSIL 586
D + E DR S+ P Q+ L+ +A K V+LVL G +D S ++S++
Sbjct: 478 DPSAAAEGFDRNSIAWPTAQVDLIKKLAAIGKPLVVLVL--GDLMDNSPLLELDGVNSVI 535
Query: 587 WIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYR 646
W +PG+ G A+ +++ G GRLP+T YP ++T++ M DMNMR SS PGR+YR
Sbjct: 536 WANWPGQDGGSAVMQVVTGAVAVAGRLPITQYPANYTELSMLDMNMRPSSSS--PGRTYR 593
Query: 647 FYTGTQVYGFGHGLSYTNYSYKFLSAPS-ELTISASLKAGSDKNILQQTGSRLDYVHIDE 705
++ G V FG GL YT + KF + + E IS NI ++ ++ Y
Sbjct: 594 WFNGA-VQPFGTGLHYTTFDAKFAANSTIEYDIS---------NITKECTNQ--YPDTCS 641
Query: 706 VTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKE 765
V S + ++VTN+G+ + + F + P K LI + RV V KG +
Sbjct: 642 VPS-------IPVAVTNSGNRTSDFIALAFIKGENGPAPYPLKTLISYTRVRDV-KGGQT 693
Query: 766 ISFGVDPCEQLSIANKHGRRILPLGNHVLMVGE 798
S + QL++ N R+ +GN VL GE
Sbjct: 694 KSAEM----QLTLGNL--ARVDQMGNTVLYPGE 720
>gi|380696433|ref|ZP_09861292.1| glycoside hydrolase [Bacteroides faecis MAJ27]
Length = 739
Score = 377 bits (968), Expect = e-101, Method: Compositional matrix adjust.
Identities = 249/774 (32%), Positives = 376/774 (48%), Gaps = 114/774 (14%)
Query: 50 DSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASN 109
+ +PF + L + R + L+S LTL+EK++Q+ ++ + RLGIPAY WW+E LHGI
Sbjct: 23 EKFPFRDPQLPVEQRVEDLVSRLTLEEKVKQMLNSTPPVERLGIPAYNWWNECLHGIGR- 81
Query: 110 GPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------G 161
T VT FPQ + AA++N +L + S++A E RA+YN Q
Sbjct: 82 --------TKYHVTVFPQAIGMAAAWNDALIKEVASSIADEGRAIYNDTQRKEDYSQYHA 133
Query: 162 LTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREK 221
LT+W PNINIFRDPRWGRGQET GEDP + + FV+ QG+N
Sbjct: 134 LTYWTPNINIFRDPRWGRGQETYGEDPYLTARIGEAFVQGLQGDN--------------P 179
Query: 222 RVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRS 281
R LK SAC KH Y + +R+SFN+ ++ D DT+ P FR+
Sbjct: 180 RYLKA-------------SACAKH---YAVHSGPEKNRHSFNSDVSTYDLWDTYLPAFRT 223
Query: 282 CIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYT 340
+ K S +MC+YN G P C L Q R++W F GY+TSDC A+ IF N+
Sbjct: 224 LVVDAKVSGVMCAYNAFQGQPCCGNDLLMQSILRDKWNFTGYVTSDCGAIDDIF---NHH 280
Query: 341 KTHEDSAAGVLKA---GMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGL 397
KTH D+A A G D++CG A+ G + EK +D ++ LF+++ RLGL
Sbjct: 281 KTHPDAATAAADAVFHGTDLDCGHSAYLALVKAVKDGIITEKQLDVSVKRLFTIRFRLGL 340
Query: 398 FNGDPRK-GKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIG 456
F DP + Y ++ + +H+ LA AR+ +VLLKND + LPL KN + + ++G
Sbjct: 341 F--DPVELVDYARIPISILECRKHQDLAKQLARESMVLLKND-QLLPLQKNKLKKVVVMG 397
Query: 457 PLVNNISQMGGGYTGIP---CSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRI 513
P ++ + G Y G P +P +R ++ Y G V S + V
Sbjct: 398 PNADSRESLLGNYNGNPSRMLTPLQAIRERLGGWTEVEYIEGVDHVNTISADDLKQYVNR 457
Query: 514 AKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVARTSKRPVIL 563
AK AD VI + G+ E E+ DR ++ LP Q ++ + P +
Sbjct: 458 AKGADAVIFIGGISPRLEGEEMPVSKDGFDGGDRTTIALPAVQTQMMKAWV-AEHIPTVF 516
Query: 564 VLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT 623
V+ G L + + + + +IL Y G+ G +A+A+++FGD+NP G+LP+T+Y +
Sbjct: 517 VMMTGSALAIPWEAQN--VPAILNAWYGGQYGGEAIADVLFGDYNPSGKLPVTFYAK--- 571
Query: 624 KVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLK 683
D ++ S GR+YR++ G +Y FG+GLSYT+++Y L P
Sbjct: 572 -----DSDLPDFESYDMQGRTYRYFNGKALYPFGYGLSYTSFAYSSLKLP---------- 616
Query: 684 AGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQ 743
+V T V ++V N G +G VV L+ P
Sbjct: 617 ---------------------KVCRTTDKEIEVTVTVKNTGHTEGEEVVQLYVSHPDKKI 655
Query: 744 GTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
P L GF R+ A ++ ++F + E LS +++G R + G + VG
Sbjct: 656 LVPLTALKGFKRIQLKAGEAQRVTFSLS-SEDLSCVDENGIRKVWAGTVKIQVG 708
>gi|449299051|gb|EMC95065.1| glycoside hydrolase family 3 protein [Baudoinia compniacensis UAMH
10762]
Length = 849
Score = 377 bits (968), Expect = e-101, Method: Compositional matrix adjust.
Identities = 261/804 (32%), Positives = 385/804 (47%), Gaps = 80/804 (9%)
Query: 40 PDFPCKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWW 99
PD P S C+T+ + RA ++I+ + + EK+ L D + RLG+P YEWW
Sbjct: 28 PDCVNGPALLTSNLVCDTNATPYQRASAIINAMNITEKLANLLDVSYGSARLGLPPYEWW 87
Query: 100 SESLHGIASNGPGVNF--NGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNL 157
SE+LHG+A + PGVNF +G S TSFP + +++F+ NI S ++ EARA N
Sbjct: 88 SEALHGVAGS-PGVNFTSSGNYSYATSFPMPITFSSAFDDPSVQNIASVISTEARAYSNA 146
Query: 158 GQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFG 217
+ GL ++ PNIN F+DPRWGRG ETPGEDP+ + Y + +G +DD
Sbjct: 147 ARGGLDYFTPNINPFKDPRWGRGSETPGEDPLRIQGYVKNLLIGLEG----TDD------ 196
Query: 218 FREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQP 277
G+ S G + M+ A CKH YDLE W + RY ++A IT QD + + P
Sbjct: 197 --------GYFNTSHSGYKKMI-ATCKHFAGYDLEDWDGYIRYGYDAEITTQDLAEYYLP 247
Query: 278 PFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGF---KGYITSDCDAVATI 333
PF++C + IMCSYN VN VPAC L + R WG+ YITSDC+A++ I
Sbjct: 248 PFQTCARDQNVASIMCSYNSVNSVPACANSYLQETILREHWGWTIDNNYITSDCNAISDI 307
Query: 334 FEYQNYTKTHEDSAAGVLKAGMDINC---GTCMLRHTQSAIDKGKVQEKDIDRALLNLFS 390
+ NY+ + +A L GMD C T ++ + G V E I AL+ +
Sbjct: 308 YYNHNYSVNNAAAAGLSLSNGMDTACIVANTGVMTDVNGSYYGGYVTEATITTALIRQYE 367
Query: 391 VQLRLGLFNGDPRKGK-YGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAV 449
+ G F DP Y +G V T + LA AA +G LLKN LP +
Sbjct: 368 ALVIAGYF--DPASSNPYRSIGWSSVNTPAAQTLARQAATEGTTLLKN-TGLLPYKFTSQ 424
Query: 450 SSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHE 509
+ +A+IG N SQM GGY+G S L +YA+G + +
Sbjct: 425 TKVAMIGMWANGTSQMQGGYSGPAPYLHSPLYAASQLGLSYNYANGPINQTTLTSNYSQN 484
Query: 510 AVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGG 569
A A+ AD ++ G+D + E E DR + PG Q +L+ +A K ++VL G
Sbjct: 485 ATAAAQNADVILFFGGIDWSVEAEAMDRYQIAWPGAQQALIAQLAALGKP--MIVLQMGS 542
Query: 570 PLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMN 628
LD + +++ IS+++W+GYPG+ G A +I+ G P GRLP+T YP + +VPM
Sbjct: 543 MLDATPILSNNNISALVWVGYPGQDGGVAAFDILTGAVAPAGRLPVTMYPADYVNQVPMT 602
Query: 629 DMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKF--------------LSAPS 674
+M++R PGR+Y++Y V F +GL YT + F +AP
Sbjct: 603 NMSLRPGPGN--PGRTYKWYNNA-VLPFAYGLHYTTFKATFNGGPPGPGSPWSPPWNAPW 659
Query: 675 ELTISASLKAGS----------DKNILQQTGSRLDYVHIDE-VTSCTSLR------FHVQ 717
+ G+ + G +I ++SCT+ V
Sbjct: 660 SAKVRRGWGWGNWGPPNWGWTQPSQVAPGNGGLSSSYNIQSLLSSCTAAHPDLCAFPSVA 719
Query: 718 ISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKG-----SKEISFGVDP 772
ISV NAG V ++F+ P K L + R+H+VA G S ++ GV
Sbjct: 720 ISVQNAGQTTSDFVALVFSNTTAGPAPYPYKSLASYTRLHSVAAGQTVTASLNMTLGV-- 777
Query: 773 CEQLSIANKHGRRILPLGNHVLMV 796
L+ + G +IL G + L++
Sbjct: 778 ---LARRDDQGNQILYPGTYNLLL 798
>gi|167519969|ref|XP_001744324.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163777410|gb|EDQ91027.1| predicted protein [Monosiga brevicollis MX1]
Length = 721
Score = 376 bits (966), Expect = e-101, Method: Compositional matrix adjust.
Identities = 244/727 (33%), Positives = 361/727 (49%), Gaps = 83/727 (11%)
Query: 53 PFCNTSLSISTRAKSLISLLTLQEKIQQLSDN-------ASAIPRLGIPAYEWWSESLHG 105
PFC+ SL RA L LTL E QQL+ A +PRLG+ Y + +E LHG
Sbjct: 43 PFCDLSLDFRDRAWDLAQRLTLDELAQQLNTYSFTPQAYAPGVPRLGLRNYSYHAEGLHG 102
Query: 106 IASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQ------ 159
I + VN+ T+ +PQV AA+ N SL + + + E RA+ N Q
Sbjct: 103 I-RDANVVNYPATL-----YPQVTAMAATANASLIHEMSTIMGTELRAVNNRAQELGEIF 156
Query: 160 ---AGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGF 216
L+ + P +NI RD RWGR QE+ EDP + YAV FV
Sbjct: 157 GRGGALSIYGPTMNIIRDGRWGRSQESVSEDPWLNGLYAVNFVLGL-------------- 202
Query: 217 GFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGN-FSRYSFNAMITEQDTEDTF 275
E+ + L + CKHL AY E + N +R+SFNA+I E D DT+
Sbjct: 203 ------------EQRNSSKYLQAATSCKHLFAYSFEGYNNTLTRHSFNAVIDELDIHDTY 250
Query: 276 QPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQ-KARNEWGFKGYITSDCDAVATIF 334
P FR+C+E G IMCSYN VNG+PAC RGD+ + R WGF+G I SDCDAVA I+
Sbjct: 251 LPAFRACVELGHVQQIMCSYNSVNGIPACARGDVQNDRVRKAWGFEGLIVSDCDAVADIY 310
Query: 335 EYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLR 394
NYT+T ED+ L+ G D++CG +H SA+ + + +++ + ++
Sbjct: 311 NTHNYTRTPEDAVTVALQGGCDLDCGDFYSQHLASAVQQNLTTLAALQQSMTRVLEMRFL 370
Query: 395 LGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAI 454
LG F+ D Y +LG + + T + +L A+R+ +VLL+N K LP+ +A +A+
Sbjct: 371 LGEFDPD-TSVPYRQLGREAIDTPFARDSSLRASRESVVLLENRIKLLPVTLSADIKVAL 429
Query: 455 IGPLVNNIS-QMGGGYTGIPCSPKSLLRGLEAYVSKTHYAS--GCHDVPCNSDAGFHEAV 511
IGP VN + MGG P + +G +A + TH S GC ++ +AV
Sbjct: 430 IGPYVNLTTIMMGGKLDYTPSFITTYFQGFQA-IGITHLTSSPGC-NITAPLPGALDKAV 487
Query: 512 RIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSV-ARTSKRPVILVLTGGGP 570
+IA +AD V++ GL E E DR +L LP Q L ++ A +++VL GGP
Sbjct: 488 QIATQADLVVLTLGLSSDIEHEGGDRETLGLPTPQQDLYDAISAAIPSSKLVVVLVNGGP 547
Query: 571 LDVS-FAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMN 628
+ V ++ +I+ Y G++ ALAE IFG NP G LP T + + T VP
Sbjct: 548 VSVDRIKYGIARTPTIIEAFYGGQSAGTALAETIFGQNNPSGTLPYTVFFSNITAHVPFT 607
Query: 629 DMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDK 688
DM++R D++ +PGR++RF+ ++ FGHGLSY+ +S A D+
Sbjct: 608 DMHLRPDAATGFPGRTHRFFDAPVMWPFGHGLSYSTFSL----------------AWQDE 651
Query: 689 NILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEK 748
+ T + T T + + ++VTN G + G + L+ VP + P +
Sbjct: 652 TVPSIT--------TGDFTQPTLMHQLLSVNVTNHGPLPGRRALHLYVTVPVTNVSVPLR 703
Query: 749 QLIGFDR 755
L+G +
Sbjct: 704 NLVGLQK 710
>gi|150019484|ref|YP_001311738.1| glycoside hydrolase family protein [Clostridium beijerinckii NCIMB
8052]
gi|149905949|gb|ABR36782.1| glycoside hydrolase, family 3 domain protein [Clostridium
beijerinckii NCIMB 8052]
Length = 709
Score = 376 bits (965), Expect = e-101, Method: Compositional matrix adjust.
Identities = 244/761 (32%), Positives = 369/761 (48%), Gaps = 117/761 (15%)
Query: 64 RAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVT 123
+AK L+ +TL+EK +QL+ +SA+ RL +P Y WW+E LHG+A G + T
Sbjct: 15 KAKELVGKMTLEEKAEQLTYKSSAVKRLNVPRYNWWNEGLHGVARAG----------TAT 64
Query: 124 SFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLTFWAPNINIFRDP 175
FPQ + AA F+ L + I ++ E RA YN G+TFW+PN+NIFRDP
Sbjct: 65 VFPQAIGLAAMFDDELLNYIAKVISTEGRAKYNENSKKDDRDIYKGITFWSPNVNIFRDP 124
Query: 176 RWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGD 235
RWGRG ET GEDP + S V FVK QGE G
Sbjct: 125 RWGRGHETYGEDPYLTSRLGVAFVKGLQGE----------------------------GK 156
Query: 236 ELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSY 295
L +AC KH + + R+ F+A+++++D +T+ P F +C+++G +M +Y
Sbjct: 157 YLKAAACAKHFAVHSGPEG---LRHEFDAVVSKKDLYETYLPAFEACVKEGDVEAVMGAY 213
Query: 296 NQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAG 354
N+ NG P C L + R +W FKG++ SDC A+A + T T +SAA +K G
Sbjct: 214 NRTNGEPCCGSKTLLRDILRGKWNFKGHVVSDCWAIADFHLHHRVTSTATESAALAMKNG 273
Query: 355 MDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDD 414
D+NCG L+ A +G V E+DI A L + ++RLG+F+ + +Y K+ +
Sbjct: 274 CDLNCGNVYLQ-LLLAYKEGLVTEEDITTAAERLMATRIRLGMFD---EECEYNKIPYEL 329
Query: 415 VCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPC 474
EH +L+L AAR +VLLKN+ LPLNKN + S+A+IGP ++ + G Y+G
Sbjct: 330 NDCKEHNELSLKAARNSMVLLKNN-GILPLNKNNLKSIAVIGPNADSQIMLKGNYSGTAS 388
Query: 475 SPKSLLRGLEAYVS---KTHYASGCH------DVPCNSDAGFHEAVRIAKKADFVIVVAG 525
++L G+ V + +Y+ GCH + + EA+ IA+++D I+ G
Sbjct: 389 RYITVLEGIHEAVGEDVRVYYSEGCHLFRDRVEELAEPNDRLKEAISIAERSDVAILCLG 448
Query: 526 LDLTQETED---------RDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFA 576
LD T E E D+ SL LPG+Q L+ + T PVILV+ G L +F
Sbjct: 449 LDSTIEGEQGDAGNSEGAGDKASLNLPGRQQELLEKIIETGT-PVILVIGAGSAL--TFN 505
Query: 577 EADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADS 636
A+ + S+IL YPG G +A+A++IFG +P G+LP+T+Y + D +M+
Sbjct: 506 NAEDKCSAILDAWYPGSRGGRAVADLIFGKCSPSGKLPITFYRNTKDLPEFIDYSMK--- 562
Query: 637 SRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGS 696
R+YR+ + +Y FG+GL+Y+
Sbjct: 563 -----DRTYRYMSCESLYPFGYGLTYSTV------------------------------- 586
Query: 697 RLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRV 756
+L +H+ +V S V + +TN G+ D V+ + + + L GF RV
Sbjct: 587 KLSELHVPDVKSDFE-DVEVSVKITNTGNFDIEEVIQCYIKDLESKYAVRNHSLAGFKRV 645
Query: 757 HTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
SK + + N G RIL L VG
Sbjct: 646 RLKIGESKIAKMKIKK-SSFEVVNDDGERILDSKRFKLFVG 685
>gi|295134875|ref|YP_003585551.1| beta-glucosidase [Zunongwangia profunda SM-A87]
gi|294982890|gb|ADF53355.1| beta-glucosidase [Zunongwangia profunda SM-A87]
Length = 735
Score = 376 bits (965), Expect = e-101, Method: Compositional matrix adjust.
Identities = 256/807 (31%), Positives = 397/807 (49%), Gaps = 122/807 (15%)
Query: 13 RILFRVLQLIVVVNVIAFSNSKPVLNKPDFPCKPPHFDSYPFCNTSLSISTRAKSLISLL 72
R LF ++ ++ + ++ +K ++K +F F +T LS+ R LIS L
Sbjct: 5 RQLFAIMLMLFLTAIMHAQQTK--IDKSEFD----------FYDTDLSMDERIDDLISRL 52
Query: 73 TLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSA 132
TL+EK QQ+ + + AI RLGIPAY+WW+E+LHG+ +G T FPQ +
Sbjct: 53 TLEEKAQQMLNASPAIERLGIPAYDWWNEALHGLGRSG----------VATVFPQAIGMG 102
Query: 133 ASFNRSLWSNIGSAVAVEARAMYNLG--------QAGLTFWAPNINIFRDPRWGRGQETP 184
A+F+ L + +A++ EARA +N GLTFW PN+NIFRDPRWGRGQET
Sbjct: 103 ATFDDDLILKVSTAISDEARANFNNAVKHGYHRKYGGLTFWTPNVNIFRDPRWGRGQETY 162
Query: 185 GEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCK 244
GEDP + S FVK QG+N K L +A K
Sbjct: 163 GEDPYLTSKLGEAFVKGLQGDNDKY---------------------------LKTAAAAK 195
Query: 245 HLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPAC 304
H Y + R+ FNA ++E+D +T+ P F++ ++ IMC+YN NG P C
Sbjct: 196 H---YAVHSGPEKLRHEFNADVSEKDLWETYLPAFKTLVD-ANVETIMCAYNSTNGEPCC 251
Query: 305 LRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCM 363
L R++WGF G++ SDC A+ + ++ E +AA ++ G+++NCG
Sbjct: 252 ANNRLINDILRDKWGFNGHVVSDCWALQDFVSGHDIVESPEAAAALAVEVGIELNCGDTY 311
Query: 364 LRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGK-YGKLGPDDVCTSEHKK 422
A++ G V E+ +D+ L L + +LGLF DP + Y K+G + + + EH+
Sbjct: 312 -NFLAKAVEDGLVSEELVDKRLHKLLETRFKLGLF--DPEESNPYNKIGVEVMNSDEHRA 368
Query: 423 LALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRG 482
LA + AR+ IVLLKND LPL KN +S I GP NI + G Y G+ ++L G
Sbjct: 369 LARETARKSIVLLKNDG-VLPL-KNNLSKYFITGPNATNIEVLLGNYHGVNPDMVTVLEG 426
Query: 483 LEAYV---SKTHYASGCH-DVPCNSDAGFHEAVRIAKKADFVIV-VAGLDLTQETED--- 534
+ + S+ Y G ++P + + A FV++ ++GL +E E
Sbjct: 427 IAKAIKPESQLQYRMGTRLNLPNENPQDWASPNAGNSDATFVVMGISGLLEGEEGESIAS 486
Query: 535 ---RDRVSLLLPGQQMSLVTSVARTSK-RPVILVLTGGGPLDVSFAEADSQISSILWIGY 590
DR+ LP Q+ + V+ ++ RPV+ ++TGG P++++ E ++L + Y
Sbjct: 487 PTFGDRMDYNLPQNQIDYLQKVSEAAEDRPVVAIVTGGSPMNLT--EVHKLADAVLLVWY 544
Query: 591 PGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTG 650
PGE G A+A+IIFG +P GRLP+T+ PM ++ A GR+Y++
Sbjct: 545 PGEEGGNAVADIIFGKNSPSGRLPITF--------PMTIEDLPAYEDYTMEGRTYKYMDV 596
Query: 651 TQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCT 710
+Y FG+GLSYT++ Y SE+ +S +
Sbjct: 597 VPMYPFGYGLSYTDFEY------SEIKLSKDKIKKKES---------------------- 628
Query: 711 SLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGV 770
+ISVTN GD + VV ++ + K S P +L+ F +H SKE++F +
Sbjct: 629 ---VEARISVTNTGDFEADEVVQVYLKDVKASSRVPNFELVAFKNIHLKRGESKELTFEI 685
Query: 771 DPCEQLSIANKHGRRILPLGNHVLMVG 797
P E LS + +G+ L G + +G
Sbjct: 686 TP-EMLSFIDDNGKEKLEKGAFEIYIG 711
>gi|373952439|ref|ZP_09612399.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
paludis DSM 18603]
gi|373889039|gb|EHQ24936.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
paludis DSM 18603]
Length = 721
Score = 372 bits (956), Expect = e-100, Method: Compositional matrix adjust.
Identities = 254/808 (31%), Positives = 384/808 (47%), Gaps = 133/808 (16%)
Query: 14 ILFRVLQLIVVVNVIAFSNSKPVLNKPDFPCKPPHFDSYPFCNTSLSISTRAKSLISLLT 73
I+F VL + ++ AF P+ PD +STR + LIS LT
Sbjct: 8 IIFAVLTSLGLIKT-AFCQQIPIYRNPD-----------------KKLSTRVQDLISRLT 49
Query: 74 LQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAA 133
L EK+ L + A+PRL IPAY WW+E LHG+A G T FPQ + AA
Sbjct: 50 LAEKVSLLGYRSQAVPRLNIPAYNWWNEGLHGVARAG----------EATIFPQAIAMAA 99
Query: 134 SFNRSLWSNIGSAVAVEARAMYNLGQA--------GLTFWAPNINIFRDPRWGRGQETPG 185
+F+ +L + + V+ EARA YNL A GLTFW+PNINIFRDPRWGRGQET G
Sbjct: 100 TFDDNLVKQVANVVSTEARAKYNLSTAMGRHLQYMGLTFWSPNINIFRDPRWGRGQETYG 159
Query: 186 EDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELML--SACC 243
EDP + S +V QG D L L SA
Sbjct: 160 EDPFLTSKMGNAYVHGLQGT-----------------------------DPLHLKTSATA 190
Query: 244 KHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPA 303
KH +A+ + R F+A++ E+D DT+ F+S ++ G S IM +YN+VNGVP
Sbjct: 191 KHFVAHSGPEG---ERDYFDALVDEKDLRDTYLYAFKSLVDGGVES-IMTAYNRVNGVPN 246
Query: 304 CLRGDLFQK-ARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTC 362
+ L EWGFKG++ +DC A+ +++ + AA +KAG+D++C +
Sbjct: 247 SINKTLVNDIVIKEWGFKGHVVTDCGALDDVYKTHKVLPNRMEVAAAAIKAGVDLDCSSI 306
Query: 363 MLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKK 422
+AI+ + EK +D AL + S Q +LG F+ P + G D + H
Sbjct: 307 FQTDIINAINNKLLTEKQVDAALAAVLSTQFKLGFFDA-PSSSPFYSFGADSIHNDSHVM 365
Query: 423 LALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRG 482
LA A++ +VLLKNDK+ LPL SS+ ++GP ++ + Y G+ + + G
Sbjct: 366 LARQMAQKSMVLLKNDKQILPLKMQNYSSIMVVGPNAASLDALVASYHGVSSKAVNFVEG 425
Query: 483 LEAYVSKTHYASGCHDVPCN-SDAGFHEAVRIAKKADFVIVVAGLDLTQETE-------- 533
+ A V K +D+ + D + A AD + V GL E E
Sbjct: 426 ITAAVDKGTRVE--YDLGADYRDTTHFGGIWGAGNADVTVAVIGLTPVLEGEAGDAFLSQ 483
Query: 534 -DRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPG 592
D+ L LP ++ + ++ ++ K+P+I V+T G +D++ + + W YPG
Sbjct: 484 TGGDKKDLSLPAGDIAFMKALRKSVKKPIIAVVTSGSDVDIAAIAPYADAVILAW--YPG 541
Query: 593 EAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVP-MNDMNMRADSSRQYPGRSYRFYTGT 651
E G ALA+I+FG +P G LP+T+Y S +P N+ +M+ GR+YR++ G
Sbjct: 542 EQGGNALADILFGKISPSGHLPLTFY-NSVNDLPAYNNYSMK--------GRTYRYFAGA 592
Query: 652 QVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTS 711
Y FG GLSYT ++Y++ P + S K+ +Q
Sbjct: 593 VQYPFGFGLSYTTFNYQWQQQPKT--------SYSAKDTIQ------------------- 625
Query: 712 LRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVD 771
+ + V N G++ VV + P +++ P K+L GF R+ T+ KGS ++
Sbjct: 626 ----LSVVVKNTGNISADEVVQAYIGYPTLNR-MPLKELKGFKRI-TLNKGSTSLASISI 679
Query: 772 PCEQLSIAN--KHGRRILPLGNHVLMVG 797
P +L N KH + P GN+ + +G
Sbjct: 680 PVTELQKWNSSKHQFELYP-GNYTVYLG 706
>gi|116197206|ref|XP_001224415.1| hypothetical protein CHGG_05201 [Chaetomium globosum CBS 148.51]
gi|88181114|gb|EAQ88582.1| hypothetical protein CHGG_05201 [Chaetomium globosum CBS 148.51]
Length = 735
Score = 372 bits (956), Expect = e-100, Method: Compositional matrix adjust.
Identities = 249/735 (33%), Positives = 358/735 (48%), Gaps = 82/735 (11%)
Query: 82 SDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWS 141
+ N+ + RLG+ AY+WW+E+LHG+A N G+ + G S+ T FPQ + S+A+F+ L
Sbjct: 42 ASNSPGVSRLGLSAYQWWNEALHGVAHNR-GITWGGQFSAATQFPQAITSSAAFDDHLIE 100
Query: 142 NIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKS 201
IG ++ EARA N G+A L FW PN+N FRDPRWGRG ETPGED +A FV+
Sbjct: 101 RIGVIISTEARAFANNGRAHLDFWTPNVNPFRDPRWGRGHETPGEDAFRNKKWAEAFVQG 160
Query: 202 FQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYS 261
QG E + R + A CKH AYDLE G+ +R++
Sbjct: 161 MQGT-----------------------ESTHR-----VIATCKHYAAYDLENSGSTTRFN 192
Query: 262 FNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFK 320
F+A ++ QD + + PPF+ C K IMCSYN VNGVPAC L R W +
Sbjct: 193 FDAKVSTQDLAEYYLPPFQQCARDSKVGSIMCSYNAVNGVPACASPYLMDTILRKHWNWT 252
Query: 321 G---YITSDCDAVATIFEY---QNYTKTHEDSAAGVLKAGMDINCGTC--MLRHTQSAID 372
Y+ SDCDAV + Y ++ + L+AG D C SA +
Sbjct: 253 DQNQYVVSDCDAVYYLGNANGGHRYKSSYAAAIGASLEAGCDNMCWATGGTTPDPASAFN 312
Query: 373 KGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGI 432
+ + +D+A+L ++ G F+G Y L DV T + AL AA +GI
Sbjct: 313 SRQFTQATLDKAMLRQMQGLVKAGYFDGP--NSLYRNLTAADVNTQVARDTALKAAEEGI 370
Query: 433 VLLKNDKKFLPLNKNAVSS-LAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTH 491
VLLKND LPL ++ +A+IG N +M GGY+G P + + +
Sbjct: 371 VLLKND-NILPLTLGGSNTQVAMIGFWANAADKMLGGYSGSPPFSHDPVTAARSMGITVN 429
Query: 492 YASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVT 551
Y +G ++A AV A+K+ VI G+D T E E +DR S+ P Q++++
Sbjct: 430 YVNGPL---TQTNADTSAAVNAAQKSSVVIFFGGIDNTVEKESQDRTSIAWPSGQLTMIQ 486
Query: 552 SVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGG 611
+A+T K PVI+V G D + +ILW GYPG+ G A+ +I G +P G
Sbjct: 487 RLAQTGK-PVIVVRMGTHVDDTPLLSI-PNVKAILWAGYPGQDGGTAVMNLITGLASPAG 544
Query: 612 RLPMTWYPESFT-KVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFL 670
RLP+T YP S+T + P +M +R SS YPGR+YR+Y V+ FGHGL YTN+S L
Sbjct: 545 RLPVTVYPSSYTNQAPYTNMALRPSSS--YPGRTYRWYK-DPVFPFGHGLHYTNFSVAPL 601
Query: 671 SAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRF-------HVQISVTNA 723
P+ +I+ D + SC + + V +SVTN
Sbjct: 602 DFPATFSIA------------------------DLLASCKGVTYLELCPFPSVSVSVTNT 637
Query: 724 GDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHG 783
G +VV+ F P K L + RV V G + + E L+ + G
Sbjct: 638 GSRASDYVVLGFLAGDFGPTPRPIKSLATYKRVFDVQPGKTQSAELDWKLESLARVDGKG 697
Query: 784 RRILPLGNHVLMVGE 798
R+L G + L++ +
Sbjct: 698 NRVLYPGTYTLLLDQ 712
>gi|291240563|ref|XP_002740191.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii]
Length = 747
Score = 372 bits (955), Expect = e-100, Method: Compositional matrix adjust.
Identities = 252/748 (33%), Positives = 358/748 (47%), Gaps = 109/748 (14%)
Query: 52 YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDN-------ASAIPRLGIPAYEWWSESLH 104
+PF NTSL S R L+ LTL+E + Q+S A I RLGI Y W +E LH
Sbjct: 25 FPFRNTSLPWSERVDDLVGRLTLEEIVLQMSRGGTGSNGPAPPIDRLGIGPYSWNTECLH 84
Query: 105 GIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNL------- 157
G + GP TSFPQ AA+F+ L I +A A E RA YN
Sbjct: 85 GDVAAGPA----------TSFPQAFGLAATFDAVLIEQIANATAYEVRAKYNNYAKHKEY 134
Query: 158 -GQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGF 216
GL+ ++P INI R P WGR QET GEDP + A +V QG + +
Sbjct: 135 GDHKGLSCFSPVINIARHPLWGRIQETYGEDPYLSGTLAASYVNGLQGNHPRY------- 187
Query: 217 GFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQ 276
+ +A CKH AY + SR +F+A ++++D TF
Sbjct: 188 --------------------VTANAGCKHFDAYAGPEDIPSSRSTFDAKVSDRDLRMTFL 227
Query: 277 PPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFE 335
P F CI+ G S +MCSYN +NGVPAC L R EW F GY+ SD AV +++
Sbjct: 228 PAFHECIQAGTHS-LMCSYNSINGVPACANKKLLTDILRTEWNFTGYVISDQSAVEKVYD 286
Query: 336 YQNYTKTHEDSAAGVLKAGMDI----NCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSV 391
+YTK D+A + +G+++ N ++ T A+ +G V K + + LF
Sbjct: 287 AHHYTKDMLDTAIACVNSGLNLELSSNLEDNVMMQTTKAVKQGNVTMKTVKARVSPLFYT 346
Query: 392 QLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSS 451
++RLG F+ P Y KL + + EH++L+L AA + VLLKN+ +FLPL K +
Sbjct: 347 RMRLGEFD-PPEMNPYSKLDLSIIQSQEHQELSLKAAAKSFVLLKNENRFLPL-KEKIDK 404
Query: 452 LAIIGPLVNNISQMGGGYTGIPCSPKSLLR-GLEAYVSKTHYASGCHDVPCNS-DAGFHE 509
LA++GPL +N+ + G Y+ P + R GL T YASGC + C D+G +
Sbjct: 405 LAVVGPLADNVDALYGDYSATPNNYTVTPRNGLARLAGNTSYASGCDNPKCRKYDSG--Q 462
Query: 510 AVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGG 569
AD V+V G E+E DR L LPG+Q+SL+ + +PVIL+L G
Sbjct: 463 VKSAVSGADMVVVCVGTGTDIESEGNDRHELALPGKQLSLLQDAVKFGTKPVILLLFNAG 522
Query: 570 PLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFG---DFNPGGRLPMTWYPESFTKV- 625
PLDVS+A + + +I+ +P +A AL + + NP GRLPMTW P S +V
Sbjct: 523 PLDVSWAVENPAVQTIVACFFPAQATGDALYRMFMNTSPESNPAGRLPMTW-PRSMEQVP 581
Query: 626 PMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAG 685
PM D M+ GR+YR+ ++ FG GLSYT + Y SA +
Sbjct: 582 PMTDYTMK--------GRTYRYSDADPLFPFGFGLSYTLFKYYNTSASPTV--------- 624
Query: 686 SDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGT 745
+ SC ++ + ++VTN GD G V+ ++ S
Sbjct: 625 --------------------IKSCDTVT--IPLTVTNVGDFPGDEVMQVYISWSNASVTV 662
Query: 746 PEKQLIGFDRVHTV-AKGSKEISFGVDP 772
P+ QL+GF RV + S + F V P
Sbjct: 663 PKLQLVGFRRVREIEPSASATVHFAVLP 690
>gi|7671419|emb|CAB89360.1| beta-glucosidase-like protein [Arabidopsis thaliana]
gi|9758998|dbj|BAB09525.1| unnamed protein product [Arabidopsis thaliana]
Length = 411
Score = 371 bits (953), Expect = e-100, Method: Compositional matrix adjust.
Identities = 190/416 (45%), Positives = 263/416 (63%), Gaps = 10/416 (2%)
Query: 393 LRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSL 452
+RLG F+G+P+ YG LGP DVCT E+++LA++ ARQGIVLLKN LPL+ +A+ +L
Sbjct: 1 MRLGFFDGNPKNQPYGGLGPKDVCTVENRELAVETARQGIVLLKNSAGSLPLSPSAIKTL 60
Query: 453 AIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVR 512
A+IGP N M G Y G+ C + L+GLE V T Y GC +V C ++A A
Sbjct: 61 AVIGPNANVTKTMIGNYEGVACKYTTPLQGLERTVLTTKYHRGCFNVTC-TEADLDSAKT 119
Query: 513 IAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLD 572
+A AD ++V G D T E E DR+ L LPG+Q LVT VA+ ++ PV+LV+ GG D
Sbjct: 120 LAASADATVLVMGADQTIEKETLDRIDLNLPGKQQELVTQVAKAARGPVVLVIMSGGGFD 179
Query: 573 VSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMN 631
++FA+ D +I+SI+W+GYPGEAG A+A++IFG NP G+LPMTWYP+S+ KVPM +MN
Sbjct: 180 ITFAKNDEKITSIMWVGYPGEAGGIAIADVIFGRHNPSGKLPMTWYPQSYVEKVPMTNMN 239
Query: 632 MRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNIL 691
MR D S Y GR+YRFY G VY FG GLSYTN+S++ + AP + SL ++
Sbjct: 240 MRPDKSNGYLGRTYRFYIGETVYAFGDGLSYTNFSHQLIKAPKFV----SLNLDESQSCR 295
Query: 692 QQTGSRLDYV--HIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQ 749
LD + H ++ S F VQ+ V N GD +G+ V LF P+V G+P KQ
Sbjct: 296 SPECQSLDAIGPHCEKAVGERS-DFEVQLKVRNVGDREGTETVFLFTTPPEV-HGSPRKQ 353
Query: 750 LIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTI 805
L+GF+++ K + F VD C+ L + ++ G+R L LG+H+L VG L+HS I
Sbjct: 354 LLGFEKIRLGKKEETVVRFKVDVCKDLGVVDEIGKRKLALGHHLLHVGSLKHSFNI 409
>gi|302669556|ref|YP_003829516.1| beta-xylosidase [Butyrivibrio proteoclasticus B316]
gi|302394029|gb|ADL32934.1| beta-xylosidase Xyl3A [Butyrivibrio proteoclasticus B316]
Length = 709
Score = 371 bits (953), Expect = e-100, Method: Compositional matrix adjust.
Identities = 245/764 (32%), Positives = 380/764 (49%), Gaps = 118/764 (15%)
Query: 64 RAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVT 123
RAK L++ +T++EK QL +A AI RLGIPAY WW+E+LHG+A G + T
Sbjct: 9 RAKELVAKMTVEEKASQLRYDAPAIDRLGIPAYNWWNEALHGVARAG----------TAT 58
Query: 124 SFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLTFWAPNINIFRDP 175
FPQ + AA+F+ L S +G +A EARA YN GLTFWAPN+NIFRDP
Sbjct: 59 MFPQAIGLAAAFDEELMSEVGEVIAEEARAKYNEQSKREDRDIYKGLTFWAPNVNIFRDP 118
Query: 176 RWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGD 235
RWGRG ET GEDP + S AV FVK+ QG+ G+
Sbjct: 119 RWGRGHETYGEDPFLTSRLAVPFVKAMQGD----------------------------GE 150
Query: 236 ELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSY 295
+ +AC KH + + R+ F+A +++D E+T+ P F + +++ + +M +Y
Sbjct: 151 YMKAAACAKHFAVHSGPEG---ERHFFDAKASKKDLEETYLPAFEALVKEAEVEAVMGAY 207
Query: 296 NQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAG 354
N+ NG P C L R +WGF+G+ SDC A+ E T + E+SA L+ G
Sbjct: 208 NRTNGEPCCANKPLMVDTLRGKWGFQGHFVSDCWAIKDFHENHKVTSSPEESAKLALEMG 267
Query: 355 MDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDD 414
D+NCG C + + + G + EK I + LF+ + LG+F+ K ++ ++ +
Sbjct: 268 CDLNCG-CTYQSIMNGVRAGLIDEKLITESCERLFTTRFLLGMFD----KTEFDEIPYEK 322
Query: 415 VCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPC 474
V EH +A AAR+ +VLLKND LPLNK+++ ++ ++GP N+ + G Y G
Sbjct: 323 VECKEHLAVAKRAARESVVLLKNDG-LLPLNKDSIKTIGVVGPNANSRLSLIGNYHGTSS 381
Query: 475 SPKSLLRGLEAYVS---KTHYASGCHDVPCNSDAGF---------HEAVRIAKKADFVIV 522
++L G++ V + Y+ GC D+ N+ + EA +A +D V+V
Sbjct: 382 RYITVLEGIQDKVGDDVRVLYSEGC-DIFQNNISNLADPNLPDRLSEAQAVADHSDVVVV 440
Query: 523 VAGLDLTQETEDRD---------RVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDV 573
V GLD E E+ D +++L LP Q L+ +V K P I++ G +D+
Sbjct: 441 VVGLDENLEGEEGDAGNQFASGDKINLNLPLSQRQLLNAVLDCGK-PTIVIDMAGSAIDL 499
Query: 574 SFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMR 633
S +A + +++L YPG G +A+I+FGD +P G+LP+T+Y + D +M+
Sbjct: 500 S--KAQDEANAVLQAFYPGARGGADVADILFGDVSPSGKLPVTFYKSADDLPDFKDYSMK 557
Query: 634 ADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQ 693
R+Y+++TGT +Y FG+GL+Y + +K D N+
Sbjct: 558 --------NRTYKYFTGTPLYPFGYGLTYGD---------------CYVKPDYDFNV--- 591
Query: 694 TGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGF 753
Y D+V+ + ++V N G +D VV L+ + T L+GF
Sbjct: 592 -----KYADADKVSGA-----EITVTVVNDGKLDTDEVVQLYIKDMDSYFATTNPSLVGF 641
Query: 754 DRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
RVH A G ++ V + + N+ G R + N L G
Sbjct: 642 KRVHVPAGGETRVTLTVSE-KAFTSVNEEGERAVFGKNFRLYAG 684
>gi|365135698|ref|ZP_09343911.1| hypothetical protein HMPREF1032_03710 [Subdoligranulum sp.
4_3_54A2FAA]
gi|363612160|gb|EHL63713.1| hypothetical protein HMPREF1032_03710 [Subdoligranulum sp.
4_3_54A2FAA]
Length = 643
Score = 371 bits (953), Expect = e-100, Method: Compositional matrix adjust.
Identities = 232/671 (34%), Positives = 356/671 (53%), Gaps = 90/671 (13%)
Query: 61 ISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVS 120
+ RA++L++ +TL+EK+ Q+ +A AI RLGIPAY WW+E LHG+ +G
Sbjct: 4 FAQRARALVAQMTLEEKVSQMRYDAPAIERLGIPAYNWWNECLHGVGRSG---------- 53
Query: 121 SVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYN----LGQA----GLTFWAPNINIF 172
+ T FPQ + AASF+ SL ++ A++ EARA YN G+ GLTFW+PNIN+F
Sbjct: 54 TATVFPQPIGMAASFDESLLEHVAQAISDEARAKYNQYKTFGETGIYQGLTFWSPNINLF 113
Query: 173 RDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESD 232
RDPRWGRG ET GEDP++ F++ Q E +
Sbjct: 114 RDPRWGRGHETYGEDPLLTGRMGTAFIRGLQ--------------------------EGE 147
Query: 233 RGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIM 292
L A KH A+ + G R+SFNA ++ +D D++ FR CIE K + +M
Sbjct: 148 DSQYRKLDATVKHFAAHSGPEAG---RHSFNAEVSAEDMADSYLWAFRYCIEHAKPAAVM 204
Query: 293 CSYNQVNGVPAC-----LRGDLFQKARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSA 347
+YN++NG PAC L+G L++ EW F GY+ SDC A+ I E + TK ++SA
Sbjct: 205 GAYNRINGEPACASSTYLKGVLYE----EWKFDGYVVSDCGAIQDINENHHVTKNEKESA 260
Query: 348 AGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKY 407
A + G +NCG ++A++ G + E + A+ LF + RLG+F+ D Y
Sbjct: 261 ALAVNNGCQLNCGKAY-HWVKAAVEDGLISEDTVTCAVERLFEARFRLGMFDSD---CVY 316
Query: 408 GKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGG 467
+ + + +H++L A++ IVLLKN+ LPLN ++A+IGP ++ + + G
Sbjct: 317 DSIPMNVIECRKHRELNRKMAQESIVLLKNN-GILPLNPE--KTIAVIGPNADDKTVLLG 373
Query: 468 GYTGIPCSPKSLLRGLEAYV-SKTHYASGCHDVPCN----SDAGFHEAVRIAKKADFVIV 522
Y G P +LLRG++ + +YA G V ++ HEA+ AK AD V++
Sbjct: 374 NYNGTPSHWTTLLRGIQDQARGEVYYARGSVLVEKEALPWAEKPLHEAIYTAKAADVVVL 433
Query: 523 VAGLDLTQETED---------RDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDV 573
GL E E+ DR + LP Q L+ ++ T K PV+LV GG +D+
Sbjct: 434 CLGLSPLLEGEEGDAYNGADSGDRKDISLPDIQQQLLCAILDTEK-PVVLVNVSGGCVDL 492
Query: 574 SFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMR 633
+AD + ++IL YPG G ALA+I+FG +P GRLP+T+Y P D +M+
Sbjct: 493 R--QADERCAAILQCFYPGAEGGNALADILFGRVSPSGRLPVTFYRTVEDLPPFTDYSMK 550
Query: 634 ADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQ- 692
GR+YRF+ G +Y FGHGL+Y + ++ + P + + + + + +IL+
Sbjct: 551 --------GRTYRFFDGKPLYPFGHGLTYADIKEQW-TDPYTVRVKNASETATGYSILRY 601
Query: 693 QTGSRLDYVHI 703
+ G+ D+ HI
Sbjct: 602 ENGALADFKHI 612
>gi|402493386|ref|ZP_10840139.1| beta-glucosidase [Aquimarina agarilytica ZC1]
Length = 734
Score = 371 bits (952), Expect = e-100, Method: Compositional matrix adjust.
Identities = 255/773 (32%), Positives = 386/773 (49%), Gaps = 124/773 (16%)
Query: 51 SYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNG 110
++ + +T+ S RAK+L++ LTL+EKI + D ++ I RL IP Y WW+E LHG+A NG
Sbjct: 38 NFEWFDTNKSFEKRAKALVASLTLEEKISLMVDQSAPIDRLNIPEYNWWNECLHGVARNG 97
Query: 111 PGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYN----LGQ----AGL 162
T FPQ + AA+F++ L + A++ EARA +N +G AGL
Sbjct: 98 ----------RATVFPQAIGLAATFDQDLIFKVADAISTEARAKFNASIAIGNRGKYAGL 147
Query: 163 TFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKR 222
TFW PNINIFRDPRWGRGQET GEDP + S V FVK QG + K
Sbjct: 148 TFWTPNINIFRDPRWGRGQETYGEDPYLTSQIGVNFVKGLQGNHPKY------------- 194
Query: 223 VLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSC 282
L +AC KH Y + R+ F+A+ +++D +T+ P F +
Sbjct: 195 --------------LKSAACAKH---YAVHSGPEELRHEFDAIASKKDMAETYLPAFEAL 237
Query: 283 IEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTK 341
+++ K +M +YN+VNG AC L +K ++ WGFKGYI SDC A++ + ++ T+
Sbjct: 238 VKEAKVEGVMGAYNRVNGEGACASPYLLEKLLKDTWGFKGYIVSDCWALSDLHKFHKVTQ 297
Query: 342 THEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGD 401
T E+SAA L G+++NCG + AI +G EK +D L + + +LG F D
Sbjct: 298 TAEESAAAALNVGLNVNCGN-VYPALDGAIKQGLTSEKQLDNVLQHQLLTRFKLGFF--D 354
Query: 402 PRKGK-YGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVN 460
P Y K+ D V + H+ +AL+AA++ IVLLKN+ LPL K+ + S+ + GP
Sbjct: 355 PSNNNPYNKITTDVVDSEAHRAIALEAAQKSIVLLKNNNNLLPLKKD-LKSVYVAGPNAA 413
Query: 461 NISQMGGGYTGIPCSPKSLLRGLEAYVS---KTHYASGCHDVPCNSDAGFHEAVRIAKKA 517
+ G Y G+ +++L G+ + VS +Y G N + I+ +A
Sbjct: 414 REDVLLGNYYGVTSKTQTILDGIVSKVSAGTSINYKQGLLPFQKNVNPIDWSTGEIS-RA 472
Query: 518 DFVIVVAGLDLTQETED---------RDRVSLLLPGQQMSLVTSV-ARTSKRPVILVLTG 567
D I+V GL E E+ DRV + LP Q+ + + A+ + P++LVLTG
Sbjct: 473 DVGIIVMGLSGNYEGEEGEAIASESKGDRVDIRLPQNQIDYIKKIKAKNTGNPLVLVLTG 532
Query: 568 GGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKV-P 626
G P ++ E + +I++ YPGE G +A+A+I+FGD P G+LP+T +P+S + P
Sbjct: 533 GSP--IAMPEVYDLVDAIVFAWYPGEEGGQAVADILFGDVVPSGKLPIT-FPKSVDDLPP 589
Query: 627 MNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGS 686
ND M+ GR+Y++ T T + FG GLSYT++ Y L E
Sbjct: 590 YNDYAMK--------GRTYKYMTKTPQFPFGFGLSYTSFKYDNLKVYKE----------- 630
Query: 687 DKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTP 746
S+TN G+VD V ++ P +G P
Sbjct: 631 ----------------------------KASFSITNNGNVDAEEVAQVYVSSPNAGKGDP 662
Query: 747 EKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIA--NKHGRRILPLGNHVLMVG 797
L+GF RV A +K++S P + + + G+ I G + + VG
Sbjct: 663 LNTLVGFTRVSLKAGATKQVSI---PFSKKAFVQFDSDGKEITRKGTYTIHVG 712
>gi|291518645|emb|CBK73866.1| Beta-glucosidase-related glycosidases [Butyrivibrio fibrisolvens
16/4]
Length = 713
Score = 370 bits (951), Expect = 1e-99, Method: Compositional matrix adjust.
Identities = 238/761 (31%), Positives = 369/761 (48%), Gaps = 114/761 (14%)
Query: 64 RAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVT 123
RAK L+S +T++EK Q+ +A AI RLGIP Y WW+E+LHG+A G T
Sbjct: 8 RAKELVSQMTIEEKCSQMLHHAEAIDRLGIPKYCWWNEALHGVARAG----------DAT 57
Query: 124 SFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLTFWAPNINIFRDP 175
FPQ + A+F+ L + + E RA YN GLT+WAPN+NIFRDP
Sbjct: 58 VFPQAIGLGATFDEELVEKVADVTSTEGRAKYNEFTKHGDRDIYKGLTYWAPNVNIFRDP 117
Query: 176 RWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGD 235
RWGRG ET GEDP + + +V+ QG++ D
Sbjct: 118 RWGRGHETYGEDPYLTGQLGMAYVRGLQGDDL---------------------------D 150
Query: 236 ELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSY 295
+AC KH + + R+ F+A + +QD DT+ F+ ++ K +M +Y
Sbjct: 151 NPKSAACAKHFAVHSGPE---AERHHFDAKVNDQDLYDTYLYAFKRLVKDAKVEAVMGAY 207
Query: 296 NQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAG 354
N+VNG PAC L + R +WGF+G++ SDC A+ E T +SAA + G
Sbjct: 208 NRVNGEPACGSKRLLKDILRGDWGFEGHVVSDCWAIRDFHENHKVTGCEVESAALAVNNG 267
Query: 355 MDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDD 414
D+NCG C+ A V E+ I ++ L ++LRLG R+ KY + +
Sbjct: 268 CDLNCG-CVYEKLLYAYKANLVTEETITESVERLIELRLRLGTL--PERRSKYDDIPYEV 324
Query: 415 VCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPC 474
V EHK+LA++AA++ +VLLKND LPL K+ + ++ +IGP N+ + G Y GI
Sbjct: 325 VECKEHKELAIEAAKRSMVLLKNDG-LLPLKKDEIKTIGVIGPNSNSRMALVGNYEGISS 383
Query: 475 SPKSLLRGLEAYVS---KTHYASGC------HDVPCNSDAGFHEAVRIAKKADFVIVVAG 525
++L G++ YV + ++ G V + F EA+ +A+ +D V++ G
Sbjct: 384 EYITVLEGIQQYVGDDVRVFHSDGTPLWKDRMHVLSEARDTFAEAMAVAEHSDVVVLAMG 443
Query: 526 LDLTQETEDRD---------RVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFA 576
LD T E E+ D + L LPG Q L+ + K PV+L++ G +D+S+A
Sbjct: 444 LDSTIEGEEGDAGNEFGSGDKKGLKLPGLQQELLEKITAIGK-PVVLLVLAGSAMDLSWA 502
Query: 577 EADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADS 636
+ +++I+ YPG G KA+A+++FG+ +P G+LP+T+Y P D +M
Sbjct: 503 --NENVNAIMHCWYPGARGGKAIAQVLFGEDSPSGKLPLTFYKSDADLPPFEDYSME--- 557
Query: 637 SRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGS 696
GR+YR++ GT +Y FG+GLSY++ Y AG
Sbjct: 558 -----GRTYRYFKGTPLYPFGYGLSYSDIQYS--------------NAG----------- 587
Query: 697 RLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRV 756
ID+ +F V+++V NAGD V ++ + + S L +V
Sbjct: 588 ------IDKTEGAIGDKFTVKVTVKNAGDYKAHETVQVYVKDVEASTRVANCSLRKIAKV 641
Query: 757 HTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
+ SKE+S + +I ++ G I+ G + VG
Sbjct: 642 ELLPGESKEVSLEL-SARDFAIIDEKGHCIVEPGKFKVFVG 681
>gi|150019782|ref|YP_001312036.1| glycoside hydrolase family protein [Clostridium beijerinckii NCIMB
8052]
gi|149906247|gb|ABR37080.1| glycoside hydrolase, family 3 domain protein [Clostridium
beijerinckii NCIMB 8052]
Length = 709
Score = 370 bits (950), Expect = 2e-99, Method: Compositional matrix adjust.
Identities = 242/760 (31%), Positives = 370/760 (48%), Gaps = 117/760 (15%)
Query: 65 AKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTS 124
AK L+S +TLQEK +QL+ + AI L +P Y WW+E LHG+A G + T
Sbjct: 16 AKELVSKMTLQEKAEQLTYQSPAIKHLNVPEYNWWNEGLHGVARAG----------TATV 65
Query: 125 FPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLTFWAPNINIFRDPR 176
FPQ + AA F+ + + +A E RA YN GLT+W+PNINIFRDPR
Sbjct: 66 FPQAIGLAAIFDDEFLGKVANIIATEGRAKYNEYSKKDDRGIYKGLTYWSPNINIFRDPR 125
Query: 177 WGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDE 236
WGRG ET GEDP + S V F+K QGE G
Sbjct: 126 WGRGHETYGEDPYLTSRLGVAFIKGLQGE----------------------------GKY 157
Query: 237 LMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYN 296
L L+AC KH + + R+ FNA++ ++D +T+ P F +C+++ +M +YN
Sbjct: 158 LKLAACAKHFAVHSGPEG---LRHEFNAVVNKKDLYETYLPAFEACVKEANVESVMGAYN 214
Query: 297 QVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGM 355
+ NG P C L + R +WGFKG++ SDC A+A + T T +S A ++ G
Sbjct: 215 RTNGEPCCGSKTLLKDILRGKWGFKGHVVSDCWALADFHLHHMVTSTATESVALAIENGC 274
Query: 356 DINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDV 415
D+NCG M + A +G V E+ I A L + + +LG+F+ + +Y K+ +
Sbjct: 275 DLNCGN-MYLNLLLAYKEGLVTEEQITTAAERLMTTRFKLGMFD---EECEYNKIPYEVN 330
Query: 416 CTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCS 475
+ EH ++AL A+R+ +VLLKN+ LPL+K+ + S+A+IGP N+ + G Y+G
Sbjct: 331 DSREHNEVALIASRKSMVLLKNNGT-LPLDKSNLKSIAVIGPNANSEIMLKGNYSGTASK 389
Query: 476 PKSLLRGLEAYVS---KTHYASGCH------DVPCNSDAGFHEAVRIAKKADFVIVVAGL 526
++L G+ V + +Y+ GCH + D EA+ +A+++D V++ GL
Sbjct: 390 YTTILEGIHDAVGNDVRVYYSEGCHLFKDKVEDLARPDDRLSEAISVAERSDVVVLCLGL 449
Query: 527 DLTQETED---------RDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAE 577
D T E E D+ +L LPG+Q +L+ V K PVI+VL G L ++ AE
Sbjct: 450 DSTIEGEQGDAGNSYGAGDKENLNLPGRQQNLLEKVLEVGK-PVIVVLGAGSALTLNGAE 508
Query: 578 ADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSS 637
+ ++IL YPG G A+A+I+FG +P G+LP+T+Y ++ D +M+
Sbjct: 509 --EKCAAILNAWYPGSHGGTAVADILFGKCSPSGKLPVTFYKDTAKLPDFTDYSMK---- 562
Query: 638 RQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSR 697
GR+YR+ +Y FG+GL+Y+ L PS + Q GS
Sbjct: 563 ----GRTYRYLGHESLYPFGYGLTYSTVELSNLQVPS---------------VKQGFGS- 602
Query: 698 LDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVH 757
F + I + N G+ D VV + + + L GF RV
Sbjct: 603 ----------------FDISIEIKNTGEYDIEEVVQCYVKDIESKYAVLNHSLAGFKRVS 646
Query: 758 TVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
SK ++ ++ + + N G R+L L VG
Sbjct: 647 LKKGESKIVTIKLNK-KSFEVVNDDGERLLDSKKFKLFVG 685
>gi|333995841|ref|YP_004528454.1| beta-glucosidase [Treponema azotonutricium ZAS-9]
gi|333737309|gb|AEF83258.1| periplasmic beta-glucosidase (Gentiobiase)(Cellobiase)
(Beta-D-glucoside glucohydrolase) [Treponema
azotonutricium ZAS-9]
Length = 706
Score = 370 bits (950), Expect = 2e-99, Method: Compositional matrix adjust.
Identities = 257/769 (33%), Positives = 384/769 (49%), Gaps = 123/769 (15%)
Query: 63 TRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSV 122
+R K +IS +TL+EK+ QLS +A A+ GIP Y WW+E LHG+A G
Sbjct: 5 SRIKEMISKMTLEEKVSQLSYDAPAVESAGIPKYNWWNECLHGVARAG----------LA 54
Query: 123 TSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLTFWAPNINIFRD 174
T FPQ + AA+F+ + ++ A++ E RA YN GLTFW PN+NIFRD
Sbjct: 55 TVFPQAIALAATFDEAFIRSVADAISDEGRAKYNEAVKRGNRSQYYGLTFWTPNVNIFRD 114
Query: 175 PRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRG 234
PRWGRGQET GEDP + + F+K QG D
Sbjct: 115 PRWGRGQETYGEDPYLTGRIGLAFMKGLQG---------------------------DDT 147
Query: 235 DELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCS 294
+ L ++AC KH Y + R++F+A+++++D +T+ P F+ +E G +M +
Sbjct: 148 EHLKVAACAKH---YAVHSGPEKLRHTFDAVVSKKDLFETYLPAFKLLVENG-VEAVMGA 203
Query: 295 YNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKA 353
YN+ G P L ++ R WGFKG++TSDC A+ E TK+ E+SAA L A
Sbjct: 204 YNRTLGEPCGGSTYLLKEILRGRWGFKGHVTSDCWAIRDFHENHKVTKSPEESAAMALNA 263
Query: 354 GMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPD 413
G D+NCG C + + KG V ++ ID AL L + +LGLF+ P + Y LG D
Sbjct: 264 GCDLNCG-CTYPYLTVSHKKGLVTDETIDTALTRLLRTRFKLGLFD-PPEQDPYRNLGND 321
Query: 414 DVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIP 473
V +H+ LAL+AA++ IVLLKND LPL+ +A L ++GP NI + Y G+
Sbjct: 322 IVGCEKHRNLALEAAQKSIVLLKNDSNILPLDDSARKIL-LMGPGAANILTLLANYYGMS 380
Query: 474 CSPKSLLRGL-EAYVSKT----HYASGC--------HDVPCNSDAGFHEA-VRIAKKADF 519
++L GL E +KT Y G +VP S EA + + D
Sbjct: 381 SRLVTILEGLAEKIKTKTAISFEYRQGSLMYEPNHLSNVPFGSTGVDAEAPIYGLDEIDL 440
Query: 520 VIVVAGLDLTQETED---------RDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGP 570
VI V GLD + E E+ DR ++ LP Q++ + + + K+ V+L+LTGG P
Sbjct: 441 VIAVYGLDGSMEGEEGDSIASDANGDRDTIELPSWQLNFLRRIRKAGKK-VVLILTGGSP 499
Query: 571 LDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKV-PMND 629
+ AD ++L+ YPGE G A+A+I+FGD +P G+LP+T +P+S ++ P +D
Sbjct: 500 IAFPEDLAD----AVLFAWYPGEQGGNAVADILFGDVSPSGKLPIT-FPQSTAQLPPYDD 554
Query: 630 MNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKN 689
++ GR+YR+ T +Y FG GLSYT S++F S EL+ S
Sbjct: 555 YALK--------GRTYRYMKETPLYPFGFGLSYT--SFRFDSV--ELSSS---------- 592
Query: 690 ILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQ 749
++++ S++ VQ+S N G D VV L+ S+ P
Sbjct: 593 ---------------KISAGNSVKAKVQVS--NTGKRDAEEVVQLYIAKDNRSEDEPASS 635
Query: 750 LIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGE 798
L GF R+ +A S + + P N G +L G++ ++ +
Sbjct: 636 LRGFRRLKILAGKSASVEIEL-PASAFETINAEGASVLIPGSYTVIAAD 683
>gi|320161274|ref|YP_004174498.1| beta-D-xylosidase [Anaerolinea thermophila UNI-1]
gi|319995127|dbj|BAJ63898.1| beta-D-xylosidase [Anaerolinea thermophila UNI-1]
Length = 712
Score = 369 bits (947), Expect = 4e-99, Method: Compositional matrix adjust.
Identities = 246/766 (32%), Positives = 379/766 (49%), Gaps = 108/766 (14%)
Query: 54 FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
+ N + R LIS +TL+EKI Q+ ++ +AIPRLGIPAY++WSE+LHG+A NG
Sbjct: 8 YLNPDAPLEERVNDLISRMTLEEKISQMCNSCAAIPRLGIPAYDYWSEALHGVARNG--- 64
Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYN-----LGQA----GLTF 164
T FPQ + AA+++ L + A+A EARA ++ G+ GLT
Sbjct: 65 -------KATVFPQAIGMAATWDTELIERVADAIASEARAKFHETLRKFGKTDIYQGLTM 117
Query: 165 WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVL 224
W+PNINIFRDPRWGRGQET GEDP + FV+ QG ++ L
Sbjct: 118 WSPNINIFRDPRWGRGQETWGEDPYLTGEMGAAFVRGLQG--------------KDPHYL 163
Query: 225 KGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIE 284
K +AC KH Y + R++FNA++T ++ DT+ P F+ +
Sbjct: 164 K-------------TAACAKH---YTVHSGPEKERHTFNAIVTRRELFDTYLPAFKKLVT 207
Query: 285 QGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTH 343
+ K +M +YN+ G P C L ++ RN+WGFKG++ SDC A+ + TK
Sbjct: 208 EAKVEAVMGAYNRTLGEPCCGSPYLLKEILRNQWGFKGHVVSDCGAINDFHLHHQVTKDG 267
Query: 344 EDSAAGVLKAGMDINC-GTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDP 402
+SAA +K G D+ C T + A+++G + E+DID AL N + +LGLF DP
Sbjct: 268 AESAALGIKNGCDMACICTYSYENLTEALNRGLITEEDIDHALRNTLRTRFKLGLF--DP 325
Query: 403 R-KGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNN 461
+ K Y + V H+KLA + A + VLLKN LP+ K V S+ I+GP N
Sbjct: 326 QEKVPYAHISMSVVGCEAHRKLAYETAVKSAVLLKNHNHILPV-KPDVKSILIVGPNAGN 384
Query: 462 ISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVI 521
+ + G Y G+ S + + GL + + +A A F +
Sbjct: 385 VHVLLGNYYGLSDSMTTFMEGLVGRLPEGVRMEFMPGSLLTDSKKIKNDWSVASAASFDL 444
Query: 522 VVAGLDLTQETE----------DRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPL 571
V+A + L+ E + DR + LP Q + +A T + ++LVLTGG +
Sbjct: 445 VIAFMGLSPLLEGEEGEAILSDNGDREDIALPKAQQEYIRDLAATGAK-IVLVLTGGSAI 503
Query: 572 DVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMN 631
++ E + +ILW+GYPG+ G +A+A++IFGD +P G+LP+T +P S ++P
Sbjct: 504 ALNGIE--DLVEAILWVGYPGQEGGRAIADLIFGDHSPSGKLPIT-FPVSTDQLP----P 556
Query: 632 MRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNIL 691
R S ++ R+YR+ T + ++ FG GLSYT + YK L + L AG
Sbjct: 557 FREYSMKE---RTYRYMTSSPLFPFGFGLSYTQFEYKNLQLEHPV-----LSAGE----- 603
Query: 692 QQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLI 751
+LR +++ N G+ +G VV ++ + S P ++LI
Sbjct: 604 -------------------ALRGTFELA--NVGEYEGEEVVQVYLSDLEASTIVPLQKLI 642
Query: 752 GFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
F RV + ++SF + P E + + + G ++L G L +G
Sbjct: 643 SFQRVRLKPGETVQLSFAIQP-EAMMMIDDEGNQVLEPGKFKLTIG 687
>gi|410648100|ref|ZP_11358515.1| beta-glucosidase [Glaciecola agarilytica NO2]
gi|410132388|dbj|GAC06914.1| beta-glucosidase [Glaciecola agarilytica NO2]
Length = 733
Score = 369 bits (946), Expect = 5e-99, Method: Compositional matrix adjust.
Identities = 242/776 (31%), Positives = 379/776 (48%), Gaps = 103/776 (13%)
Query: 44 CKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESL 103
P + P+ +T L R LI +TL+EK QL + AI RLG+P Y++W+E+L
Sbjct: 16 ATPALANDQPWFDTQLPTQKRIDLLIDAMTLKEKTSQLVNGNVAIERLGLPEYDFWNEAL 75
Query: 104 HGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQ---- 159
HG+A NG T FPQ + AA+F++ L S ++ EARA +N+
Sbjct: 76 HGVARNG----------RATVFPQAIGMAATFDQHLLLKAASVISDEARAKFNVSSEIGN 125
Query: 160 ----AGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIG 215
+GLTFW PNINIFRDPRWGRGQET GEDP + + V QG++ K
Sbjct: 126 RSKYSGLTFWTPNINIFRDPRWGRGQETYGEDPYLTAQMGKAMVNGLQGDHPKY------ 179
Query: 216 FGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTF 275
L +A KH + + R+ F+A+ + +D +T+
Sbjct: 180 ---------------------LKTAAAAKH---FAVHSGPEALRHEFDAIASPKDMYETY 215
Query: 276 QPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIF 334
P F + + + +M +YN+VNG PA L R++WGF G++ SDC +A
Sbjct: 216 FPAFEALVTEANVETVMAAYNRVNGHPAGGSDFLLNTVLRDKWGFSGHVVSDCWGLADFH 275
Query: 335 EYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLR 394
+Y T +SAA + G D+NCG + A++ G V EK ID+ L + + + +
Sbjct: 276 QYHKVTANAVESAALAINTGTDLNCG-AVYNALPDAVEAGLVDEKTIDKRLSKVLATKFK 334
Query: 395 LGLFNGDPRKGK-YGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLA 453
LG F DP+ Y + D V + H ++A + A + IVLL+N LPL++N + +L
Sbjct: 335 LGFF--DPKDDNPYNNISADVVNSEAHAQVAYEMAVKSIVLLQNKNNILPLDRN-IRNLY 391
Query: 454 IIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVS---KTHYASGCHDVPCNSDAGFHEA 510
+ GP ++ + G Y G+ ++L G+ A VS +Y G N +
Sbjct: 392 VTGPFASSSEVLLGNYYGLSGKTTNILDGITANVSVGTTINYKQGILPYQANVNPIDWTT 451
Query: 511 VRIAKKADFVIVVAGLDLTQETEDR---------DRVSLLLPGQQMSLVTSVARTSKRPV 561
+ D +I V GL E E+ DR+SL LP Q++ + + + + +PV
Sbjct: 452 GEAKQMGDVIIAVMGLSGAYEGEEGEAIASPHKGDRLSLDLPEHQIAFLRKLRKDNDKPV 511
Query: 562 ILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPES 621
I+VLT G P V+ E +I++ YPG+ G KA+A+I+FG+ +P GRLP+T +P+S
Sbjct: 512 IVVLTAGTP--VNLTEIAELADAIVFAWYPGQEGGKAVADILFGERSPSGRLPIT-FPKS 568
Query: 622 FTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISAS 681
++P D D S Q GR+YR+ T +Y FG GLSY + ++ +
Sbjct: 569 EAQLPPYD-----DYSMQ--GRTYRYMTQEPMYPFGFGLSYAQVKFDNITL-------GN 614
Query: 682 LKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKV 741
+A + KN LQ+ + V ++VTN G+ + VV L+ + P
Sbjct: 615 TQALASKNELQENMT-------------------VTVNVTNTGEREFEEVVQLYLKTPDA 655
Query: 742 SQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
P L GF R+ A ++++ F + P + L N+ G+ +L G + ++VG
Sbjct: 656 GVSQPLHSLKGFTRIKLAAGQTEQVLFNI-PKKHLYSINEQGKPVLLKGQYSVIVG 710
>gi|410723195|ref|ZP_11362440.1| beta-glucosidase-like glycosyl hydrolase [Clostridium sp.
Maddingley MBC34-26]
gi|410603399|gb|EKQ57833.1| beta-glucosidase-like glycosyl hydrolase [Clostridium sp.
Maddingley MBC34-26]
Length = 709
Score = 368 bits (944), Expect = 9e-99, Method: Compositional matrix adjust.
Identities = 240/760 (31%), Positives = 365/760 (48%), Gaps = 117/760 (15%)
Query: 65 AKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTS 124
AK L+S +TLQE+ +QL+ + AI L +P Y WW+E LHG+A G + T
Sbjct: 16 AKELVSKMTLQERAEQLTYQSPAIKHLNVPEYNWWNEGLHGVARAG----------TATV 65
Query: 125 FPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLTFWAPNINIFRDPR 176
FPQ + AA F+ I ++ E RA YN GLT+W+PN+NIFRDPR
Sbjct: 66 FPQAIGLAAIFDEEFLGEIADIISTEGRAKYNEYSKKDDRGIYKGLTYWSPNVNIFRDPR 125
Query: 177 WGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDE 236
WGRG ET GEDP + S V F+K QGE G
Sbjct: 126 WGRGHETYGEDPYLTSRLGVAFIKGLQGE----------------------------GKY 157
Query: 237 LMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYN 296
L L+AC KH + + R+ FNA++ ++D +T+ P F +C+++ +M +YN
Sbjct: 158 LKLAACAKHFAVHSGPEG---LRHEFNAVVEKKDLYETYLPAFEACVKEANVESVMGAYN 214
Query: 297 QVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGM 355
+ NG P C L + R +WGFKG++ SDC A+A + T T +S A ++ G
Sbjct: 215 RTNGEPCCGSKTLLKDILRGKWGFKGHVVSDCWALADFHLHHMITSTATESVALAIENGC 274
Query: 356 DINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDV 415
D+NCG M + A +G V E+ I A L + + +LG+F+ D +Y ++ +
Sbjct: 275 DLNCGN-MYLNLLLAYKEGLVTEEQITTAAERLMTTRFKLGMFDEDC---EYNRIPYEVN 330
Query: 416 CTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCS 475
EH ++AL A+R+ +VLLKND LPL+K+++ S+A+IGP N+ + G Y+G
Sbjct: 331 DCKEHNEIALIASRKSMVLLKNDGT-LPLDKSSLKSIAVIGPNANSEIMLKGNYSGTASK 389
Query: 476 PKSLLRGLEAYVS---KTHYASGCH------DVPCNSDAGFHEAVRIAKKADFVIVVAGL 526
++L G+ V + +Y+ GCH + D EA+ +A+++D VI+ GL
Sbjct: 390 YTTILEGIHNAVGDNIRVYYSEGCHLFKDKVEDLAGPDDRLSEAISVAERSDVVILCLGL 449
Query: 527 DLTQETEDRD---------RVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAE 577
D T E E D + SL LPG+Q +L+ V K PVI+VL G L +F
Sbjct: 450 DSTIEGEQGDAGNSYGAGDKESLNLPGRQQNLLEKVLEVGK-PVIVVLGAGSAL--TFNG 506
Query: 578 ADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSS 637
A+ + ++IL YPG G A+A+I+FG +P G+LP+T+Y ++ D +M+
Sbjct: 507 AEEKCAAILNAWYPGSHGGTAVADILFGKCSPSGKLPVTFYKDTANLPEFTDYSMK---- 562
Query: 638 RQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSR 697
GR+YR+ +Y FG+GL+Y+ L P + A ++
Sbjct: 563 ----GRTYRYLEHESLYPFGYGLTYSKVELSNLQVP---FVKADFES------------- 602
Query: 698 LDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVH 757
F + I + N G+ VV + + K L GF RV
Sbjct: 603 ----------------FDISIDIRNTGNYGIEEVVQCYVKDLKSKYAVLNHSLAGFKRVS 646
Query: 758 TVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
SK ++ + N G R+L + L VG
Sbjct: 647 LKKGESKTVTIELSK-RSFEAVNNDGERLLDSKSFKLFVG 685
>gi|109897152|ref|YP_660407.1| beta-glucosidase [Pseudoalteromonas atlantica T6c]
gi|109699433|gb|ABG39353.1| Beta-glucosidase [Pseudoalteromonas atlantica T6c]
Length = 733
Score = 368 bits (944), Expect = 9e-99, Method: Compositional matrix adjust.
Identities = 243/776 (31%), Positives = 380/776 (48%), Gaps = 103/776 (13%)
Query: 44 CKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESL 103
P + +P+ +T L + R +SLI +TL+EK QL + AI RLG+P Y++W+E+L
Sbjct: 16 ASPTLANDHPWFDTQLPTNERIESLIDAMTLKEKASQLVNGNVAIERLGLPEYDFWNEAL 75
Query: 104 HGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQ---- 159
HG+A NG T FPQ + AA+F++ L + ++ EARA +N+
Sbjct: 76 HGVARNG----------RATVFPQAIGMAATFDQDLLLQAATVISDEARAKFNVSSEIGN 125
Query: 160 ----AGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIG 215
+GLTFW PNINIFRDPRWGRGQET GEDP + + V QG++ K
Sbjct: 126 RSKYSGLTFWTPNINIFRDPRWGRGQETYGEDPYLTAQMGKAMVNGLQGDHPKY------ 179
Query: 216 FGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTF 275
L +A KH + + R+ F+A+ +E+D +T+
Sbjct: 180 ---------------------LKTAAAAKH---FAVHSGPEALRHEFDAIASEKDMYETY 215
Query: 276 QPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIF 334
P F + + + +M +YN+VNG PA L R++WGF G+I SDC +A
Sbjct: 216 FPAFEALVTEADVETVMAAYNRVNGHPAGGSDFLLNTVLRDKWGFSGHIVSDCWGLADFH 275
Query: 335 EYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLR 394
EY T +SAA + G D+NCG+ + A++ G V EK ID L + + + +
Sbjct: 276 EYHKVTANAVESAALAINTGTDLNCGS-VYTALPDAVEAGLVDEKTIDTRLHKVLATKFK 334
Query: 395 LGLFNGDPRKGK-YGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLA 453
LG F DP+ Y + D V + H +A + A + IVLL+N+ + LPL+KN + ++
Sbjct: 335 LGFF--DPKDDNPYNSISADVVNSDAHADVAYEMAVKSIVLLQNENQVLPLDKN-IRNVY 391
Query: 454 IIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVS---KTHYASGCHDVPCNSDAGFHEA 510
+ GP ++ + G Y G+ ++L G+ A VS +Y G N +
Sbjct: 392 VTGPFASSSEVLLGNYYGLSGKTTNILDGITANVSVGTTINYKQGILPYQANVNPIDWTT 451
Query: 511 VRIAKKADFVIVVAGLDLTQETED---------RDRVSLLLPGQQMSLVTSVARTSKRPV 561
+ D +I V GL E E+ DR+SL LP Q+ + + + + +PV
Sbjct: 452 GEAKQMGDVIIAVMGLSGAYEGEEGEAIASPHKGDRLSLDLPEHQIEFLRKLRKDNDKPV 511
Query: 562 ILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPES 621
I+VLT G P++V+ E +I++ YPG+ G KA+A+I+FG+ +P GRLP+T +P+S
Sbjct: 512 IVVLTAGTPVNVT--EIAQLADAIVFAWYPGQEGGKAVADILFGERSPSGRLPIT-FPKS 568
Query: 622 FTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISAS 681
++P D D S Q GR+YR+ T +Y FG GLSY + ++ + +S+
Sbjct: 569 EAQLPPYD-----DYSMQ--GRTYRYMTEEPMYPFGFGLSYATVKFDNITLGNAEALSS- 620
Query: 682 LKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKV 741
T V ++VTN G + VV L+ + P
Sbjct: 621 -------------------------TDGQKGTLDVSVNVTNTGTRELEEVVQLYLKTPNA 655
Query: 742 SQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
P + L GF R+ + ++SF V + SI N G+ +L G++ ++VG
Sbjct: 656 GIDQPIQSLKGFQRIKLAPGQTGQVSFTVSKKQLYSI-NAKGKPVLLEGDYHVIVG 710
>gi|386347261|ref|YP_006045510.1| glycoside hydrolase family protein [Spirochaeta thermophila DSM
6578]
gi|339412228|gb|AEJ61793.1| glycoside hydrolase family 3 domain protein [Spirochaeta
thermophila DSM 6578]
Length = 693
Score = 367 bits (942), Expect = 1e-98, Method: Compositional matrix adjust.
Identities = 248/756 (32%), Positives = 362/756 (47%), Gaps = 113/756 (14%)
Query: 64 RAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVT 123
R SL+S ++++EK + A IPRLGIP Y WW+E+LHG+A++G T
Sbjct: 6 RMTSLLSKMSIEEKAGLMLHRAKGIPRLGIPHYNWWNEALHGVANSG----------EAT 55
Query: 124 SFPQVLVSAASFNRSLWSNIGSAVAVEARAMYN-LG-------QAGLTFWAPNINIFRDP 175
FPQ + AA+F+ L + A++ EARA +N +G + GLTFW+PNINI+RDP
Sbjct: 56 VFPQAIGLAATFDPDLVRRVAEAISTEARAKFNAIGKERAAEYERGLTFWSPNINIYRDP 115
Query: 176 RWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGD 235
RWGRGQET GEDP + S V FVK QG D
Sbjct: 116 RWGRGQETYGEDPFLTSKIGVSFVKGLQG---------------------------DHPY 148
Query: 236 ELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSY 295
+ ++AC KH Y + R+ F+A ++E+D +T+ P F + ++ G +M +Y
Sbjct: 149 YMRVAACAKH---YAVHSGPEGLRHVFDARVSEKDLWETYLPAFEALVKAG-VEAVMGAY 204
Query: 296 NQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAG 354
N+VNG PAC L + R WGFKG++ SDC A+A + TK +S A L+AG
Sbjct: 205 NRVNGEPACGSKRLLDEILRKRWGFKGHVVSDCWAIADFHLHHKVTKDPIESIAMALEAG 264
Query: 355 MDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDD 414
D+NCG H A+ G V E+ +DR++ L S RLGLF D Y +L D
Sbjct: 265 CDLNCGNTY-EHLLDAVKAGVVSEELVDRSVARLLSTLDRLGLFTDD---HPYARLSLSD 320
Query: 415 VCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPC 474
+ H+ LA +AA + +VLLKN+ LP ++ + + + GP N + G Y G+
Sbjct: 321 IDWEAHRALAREAAEKSVVLLKNNG-ILPFDRQKLRYIYVTGPNAANPVALLGNYAGVSS 379
Query: 475 SPKSLLRGLEAYVS---KTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQE 531
++L G+ Y Y GC + N A +A+ AD + V G D T E
Sbjct: 380 RLVTVLEGITGYAGPGITVTYKIGC-PLQGNKINPIDWASGVARYADVTVAVMGRDSTVE 438
Query: 532 TEDRDRV---------SLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQI 582
E+ D + L LP +Q+ + + K P+++VL G P V E +
Sbjct: 439 GEEGDAIFSDNYGDLSDLDLPREQIEYLRRIKEIGK-PLVVVLLSGAP--VCSPELEELA 495
Query: 583 SSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVP-MNDMNMRADSSRQYP 641
+I++ YPGE G A+A ++FG+ +P GRLP+T +P ++P D +M
Sbjct: 496 DAIVYAWYPGEEGGNAIARVLFGEISPSGRLPIT-FPRGVDQLPPFTDYSME-------- 546
Query: 642 GRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYV 701
GR+YR+ +Y FG GLSY +SY+ L Q + SR D
Sbjct: 547 GRTYRYMREEPLYPFGFGLSYATFSYRGL---------------------QSSASRWDKR 585
Query: 702 HIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAK 761
E+ V N + VV L+ R P L GF RV A
Sbjct: 586 ETLELVC----------EVENTSSIPADEVVQLYVRWEDAPFRVPLWSLKGFTRVSLGAG 635
Query: 762 GSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
K++ F + P E+LS ++ GR++LP G VG
Sbjct: 636 ERKQVRFVLSP-EELSFIDEEGRKVLPEGRLHFHVG 670
>gi|160881137|ref|YP_001560105.1| glycoside hydrolase family 3 [Clostridium phytofermentans ISDg]
gi|160429803|gb|ABX43366.1| glycoside hydrolase family 3 domain protein [Clostridium
phytofermentans ISDg]
Length = 717
Score = 367 bits (942), Expect = 1e-98, Method: Compositional matrix adjust.
Identities = 248/773 (32%), Positives = 369/773 (47%), Gaps = 122/773 (15%)
Query: 57 TSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFN 116
T RA L+ +TL+EK+ Q +A +IPRL I AY +W+E+LHG+A G
Sbjct: 6 TQQEFQQRATELVKKMTLEEKVFQTLHSAPSIPRLDIKAYNYWNEALHGVARAG------ 59
Query: 117 GTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLTFWAPN 168
T FPQ + AA+F+ L I ++ E R +N Q GLTFW+PN
Sbjct: 60 ----VATVFPQAIGLAATFDEDLIEEIADTISTEGRGKFNAQQKYGDHDIYKGLTFWSPN 115
Query: 169 INIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFG 228
+NIFRDPRWGRG ET GEDP + FV QG ++ LK
Sbjct: 116 VNIFRDPRWGRGHETFGEDPFLSGTLGGRFVDGIQG--------------HDETYLKA-- 159
Query: 229 EESDRGDELMLSACCKHLIAY----DLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIE 284
+AC KH + D+ R+SFNA ++EQD +T+ P F+ ++
Sbjct: 160 -----------AACAKHFAVHSGPEDI-------RHSFNAEVSEQDLRETYLPAFKKLVK 201
Query: 285 QGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTH 343
+ K +M +YN+ NG P C L + R EW F G++TSDC A+ E+ T
Sbjct: 202 EHKVEAVMGAYNRTNGEPCCGSKTLLEDILRGEWEFVGHVTSDCWAIKDFHEHHMVTSNA 261
Query: 344 EDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPR 403
+S A + G D+NCG + Q A+ G V+E+ ID AL+ LF+ +++LGLF+ +
Sbjct: 262 VESVALAMNRGCDLNCGNLYVNLLQ-AVRDGLVEEETIDTALIRLFTTRMKLGLFDKE-E 319
Query: 404 KGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNIS 463
+ + D V T K+L + A+++ +VLLKN+ LPLN ++S+ +IGP NN +
Sbjct: 320 SIPFNTITYDQVDTKSSKELNIKASKKCVVLLKNEDNILPLNPKKITSVGVIGPNANNRN 379
Query: 464 QMGGGYTGIPCSPKSLLRGLEAYVS---KTHYASGCHDVPCNSDAGFHEAVRIA------ 514
+ G Y G ++L G++ V + +++ GCH E RIA
Sbjct: 380 ALVGNYEGTASEYITVLEGIKQVVPEDVRVYFSEGCHLFKNKLSNLSQENDRIAEVRAVC 439
Query: 515 KKADFVIVVAGLDLTQETED---------RDRVSLLLPGQQMSLVTSVARTSKRPVILVL 565
+ +D VI GLD E E+ D+ +L LPG Q ++ ++ K PVIL+L
Sbjct: 440 EHSDVVIACLGLDPGLEGEEGDQGNQFASGDKKTLALPGIQEDVLKTIYECGK-PVILIL 498
Query: 566 TGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKV 625
G L V + AD I +IL YPG G +A+AE+IFGD NP G+LP+T+Y +
Sbjct: 499 LSGSALAVPW--ADEHIPAILQGWYPGAQGGRAIAELIFGDGNPEGKLPVTFYRTTEELP 556
Query: 626 PMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAG 685
D M+ R+YR+ +Y FG+GLSYT + + L
Sbjct: 557 EFTDYAMK--------NRTYRYMKNEALYPFGYGLSYTTFEHTLL--------------- 593
Query: 686 SDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGT 745
YV+ D + +++ V+ V N GD +GS V A V V +
Sbjct: 594 --------------YVNTDTLGKGSNVECMVR--VKNTGDYEGS--VTTQAYVKYVGEDA 635
Query: 746 PEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGE 798
P QL G +V + K+I +D + N+ G IL G + L + +
Sbjct: 636 PNCQLKGLKKVSLLPGEEKDIMIELDD-RAFGLYNEEGEFILNQGEYELYLSD 687
>gi|189201569|ref|XP_001937121.1| beta-xylosidase [Pyrenophora tritici-repentis Pt-1C-BFP]
gi|187984220|gb|EDU49708.1| beta-xylosidase [Pyrenophora tritici-repentis Pt-1C-BFP]
Length = 756
Score = 367 bits (941), Expect = 2e-98, Method: Compositional matrix adjust.
Identities = 240/764 (31%), Positives = 373/764 (48%), Gaps = 61/764 (7%)
Query: 40 PDFPCKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWW 99
PD C S C+ + S + RA +L++ + QEK+ L + + RLG+PAY WW
Sbjct: 22 PD--CVNGPLKSNAICDVTASPAKRAAALVAAMQTQEKLDNLVSKSKGVARLGLPAYNWW 79
Query: 100 SESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQ 159
E+LHG+A PG+NF G + TSFP L+ +A+F+ L I + EARA N G
Sbjct: 80 GEALHGVAG-APGINFTGPYRTATSFPMPLLMSAAFDDDLIHQIAIVIGNEARAFGNGGI 138
Query: 160 AGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFR 219
A + FW P+IN FRDPRWGRG ETPGED + + Y + +G+
Sbjct: 139 APVDFWTPDINPFRDPRWGRGSETPGEDILRIKGYTKSLLSGLEGDK------------A 186
Query: 220 EKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPF 279
+++++ A CKH + YD+E W R+SF+A IT QD + F PPF
Sbjct: 187 QRKII----------------ATCKHYVGYDMEDWNGTDRHSFDAKITTQDLAEYFMPPF 230
Query: 280 RSCIEQGKASCIMCSYNQVNGVPAC----LRGDLFQKARNEWGFKGYITSDCDAVATIFE 335
+ C K MCSYN VNGVP C + D+ +K N YITSDC+AV I
Sbjct: 231 QQCARDSKVGSFMCSYNAVNGVPTCADTYVLEDILRKHWNWTDSNNYITSDCEAVKDISL 290
Query: 336 YQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRL 395
Y T +++ A GMD++C A +G + IDRAL + +
Sbjct: 291 RHKYVATLQEATAIAFNNGMDLSCEYSGSSDIPGAFSQGLLNVSVIDRALTRQYEGLVHA 350
Query: 396 GLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAII 455
G F+G Y LG D+ T E +KL L A +G+ LLKND LPL+ + S +A++
Sbjct: 351 GYFDG--AAATYANLGVQDINTPEAQKLVLQVAAEGLTLLKNDDT-LPLSLKSGSKVAMV 407
Query: 456 GPLVNNISQMGGGYTG-IPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIA 514
G N+ S++ G Y+G P + G + + +D +A+ A
Sbjct: 408 GFWANDSSKLSGIYSGPAPYLHNPVYAGNKLGLDMAVATGPILQKSGAADNWTTKALDAA 467
Query: 515 KKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVS 574
KK+D ++ GLD + E DR + P Q+ L+T +A K V++ L G +D
Sbjct: 468 KKSDTILYFGGLDPSAAAEGSDRTDISWPSAQIDLITKLAALGKPLVVIAL--GDMVDHM 525
Query: 575 FAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRA 634
++S++W +PG+ G A+ ++I G+ GRLP+T YP +T++ M DMN+R
Sbjct: 526 PILNMKGVNSLIWANWPGQDGGTAVMQVITGEHAIAGRLPITQYPAKYTQLSMLDMNLRP 585
Query: 635 DSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQT 694
+ PGR+YR+Y V FG GL YT ++ KF S S LT++ ++I++
Sbjct: 586 GGNN--PGRTYRWYN-ESVQPFGFGLHYTKFAAKFGSN-SSLTVNI-------QDIMKSC 634
Query: 695 GSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFD 754
D+ + +V ++++VTN G+ + + F + + P K L+ +
Sbjct: 635 TK--DHPDLCDVPP-------IEVAVTNKGNRTSDFIALAFIKGEVGPKPYPLKTLVSYA 685
Query: 755 RVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGE 798
R+ ++ + + LS ++ G + G + L++ E
Sbjct: 686 RLRDISGSQTKTASLALTLGTLSRVDQSGNLVAYPGEYTLLLDE 729
>gi|219887077|gb|ACL53913.1| unknown [Zea mays]
gi|224035251|gb|ACN36701.1| unknown [Zea mays]
gi|413919685|gb|AFW59617.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
Length = 405
Score = 367 bits (941), Expect = 2e-98, Method: Compositional matrix adjust.
Identities = 194/414 (46%), Positives = 264/414 (63%), Gaps = 12/414 (2%)
Query: 393 LRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSL 452
+RLG F+GDPR+ +G LGP DVCT +++LA +AARQGIVLLKN K LPL+ ++ S+
Sbjct: 1 MRLGFFDGDPRELPFGNLGPSDVCTPSNQELAREAARQGIVLLKNTGK-LPLSATSIKSM 59
Query: 453 AIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDA-GFHEAV 511
A+IGP N M G Y G PC + L+GL A V+ T Y GC +V C+ ++ A
Sbjct: 60 AVIGPNANASFTMIGNYEGTPCKYTTPLQGLGANVA-TVYQPGCTNVGCSGNSLQLDAAT 118
Query: 512 RIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPL 571
+ A AD ++V G D + E E DR SLLLPGQQ LV++VA S P ILV+ GGP
Sbjct: 119 KAAASADVTVLVVGADQSIERESLDRTSLLLPGQQPQLVSAVANASSGPCILVVMSGGPF 178
Query: 572 DVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMN 631
D+SFA++ +I++ILW+GYPGEAG A+A+++FG NP GRLP+TWYPESFTKVPM DM
Sbjct: 179 DISFAKSSDKIAAILWVGYPGEAGGAAIADVLFGYHNPSGRLPVTWYPESFTKVPMTDMR 238
Query: 632 MRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNIL 691
MR D S YPGR+YRFYTG VY FG GLSYT++++ +SAP +L +L+ L
Sbjct: 239 MRPDPSTGYPGRTYRFYTGDTVYAFGDGLSYTSFAHHLVSAPKQL----ALQLAEGHACL 294
Query: 692 QQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLI 751
+ ++ E C L F V + V NAG+ G H V LF+ P V P K L+
Sbjct: 295 TEQCPSVEA----EGAHCEGLAFDVHLRVRNAGERSGGHTVFLFSSPPAV-HNAPAKHLL 349
Query: 752 GFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTI 805
GF++V + ++F VD C+ LS+ ++ G R + LG+H L VG+L+H+L +
Sbjct: 350 GFEKVSLEPGQAGVVAFKVDVCKDLSVVDELGNRKVALGSHTLHVGDLKHTLNL 403
>gi|332307852|ref|YP_004435703.1| glycoside hydrolase family protein [Glaciecola sp. 4H-3-7+YE-5]
gi|332175181|gb|AEE24435.1| glycoside hydrolase family 3 domain protein [Glaciecola sp.
4H-3-7+YE-5]
Length = 733
Score = 366 bits (939), Expect = 4e-98, Method: Compositional matrix adjust.
Identities = 242/776 (31%), Positives = 377/776 (48%), Gaps = 103/776 (13%)
Query: 44 CKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESL 103
P + P+ +T L R LI +TL+EK QL + AI RLG+P Y++W+E+L
Sbjct: 16 ATPALANDQPWFDTQLPTQERIDLLIDAMTLKEKTSQLVNGNVAIERLGLPEYDFWNEAL 75
Query: 104 HGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQ---- 159
HG+A NG T FPQ + AA+F++ L S ++ EARA +N+
Sbjct: 76 HGVARNG----------RATVFPQAIGMAATFDQHLLLKAASVISDEARAKFNVSSEIGN 125
Query: 160 ----AGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIG 215
+GLTFW PNINIFRDPRWGRGQET GEDP + + V QG++ K
Sbjct: 126 RSKYSGLTFWTPNINIFRDPRWGRGQETYGEDPYLTAQMGKAMVNGLQGDHPKY------ 179
Query: 216 FGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTF 275
L +A KH + + R+ F+A+ + +D +T+
Sbjct: 180 ---------------------LKTAAAAKH---FAVHSGPEALRHEFDAIASPKDMYETY 215
Query: 276 QPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIF 334
P F + I + +M +YN+VNG PA L R++WGF G++ SDC +A
Sbjct: 216 FPAFEALITEANVETVMAAYNRVNGHPAGGSDFLLNTVLRDKWGFSGHVVSDCWGLADFH 275
Query: 335 EYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLR 394
+Y T +SAA + G D+NCG + A++ G V EK ID+ L + + + +
Sbjct: 276 QYHKVTANAVESAALAINTGTDLNCG-AVYNALPDAVEAGLVDEKTIDKRLSKVLATKFK 334
Query: 395 LGLFNGDPRKGK-YGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLA 453
LG F DP+ Y + D V + H ++A + A + IVLL+N LPL++N + +L
Sbjct: 335 LGFF--DPKDDNPYNNISADVVNSEAHAQVAYEMAVKSIVLLQNKNNILPLDRN-IRNLY 391
Query: 454 IIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVS---KTHYASGCHDVPCNSDAGFHEA 510
+ GP ++ + G Y G+ ++L G+ A VS +Y G N +
Sbjct: 392 VTGPFASSSEVLLGNYYGLSGKTTNILDGITANVSVGTTINYKQGILPYQANVNPIDWTT 451
Query: 511 VRIAKKADFVIVVAGLDLTQETEDR---------DRVSLLLPGQQMSLVTSVARTSKRPV 561
+ D +I V GL E E+ DR+SL LP Q++ + + + + +PV
Sbjct: 452 GEAKQMGDVIIAVMGLSGAYEGEEGEAIASPHKGDRLSLDLPEHQIAFLRKLRKDNDKPV 511
Query: 562 ILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPES 621
I+VLT G P V+ E +I++ YPG+ G KA+A+I+FG+ +P GRLP+T +P+S
Sbjct: 512 IVVLTAGTP--VNLTEIAELADAIVFAWYPGQEGGKAVADILFGERSPSGRLPIT-FPKS 568
Query: 622 FTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISAS 681
++P D D S Q GR+YR+ T +Y FG GLSY + ++ +
Sbjct: 569 EAQLPPYD-----DYSMQ--GRTYRYMTQEPMYPFGFGLSYAQVKFDNITL-------GN 614
Query: 682 LKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKV 741
+A + KN Q+ V ++VTN G+ + VV L+ + P
Sbjct: 615 TQALASKNEPQE-------------------NMTVTVNVTNTGEREFEEVVQLYLKTPDA 655
Query: 742 SQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
P L GF R+ A ++++ F + P + L N+ G+ +L G + ++VG
Sbjct: 656 GVSQPLHSLKGFTRIKLAAGQTEQVLFSI-PKKHLYSINEQGKPVLLKGQYSVIVG 710
>gi|345519864|ref|ZP_08799275.1| beta-glucosidase [Bacteroides sp. 4_3_47FAA]
gi|254836262|gb|EET16571.1| beta-glucosidase [Bacteroides sp. 4_3_47FAA]
Length = 736
Score = 365 bits (937), Expect = 5e-98, Method: Compositional matrix adjust.
Identities = 243/777 (31%), Positives = 368/777 (47%), Gaps = 115/777 (14%)
Query: 50 DSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASN 109
++ PF N L + R K L++ LTL+EK+ + ++ AIPRLGIPAY+WW+E+LHG+A
Sbjct: 22 ENLPFRNADLPLEVRVKDLVARLTLEEKVLLMQHHSPAIPRLGIPAYDWWNEALHGVAR- 80
Query: 110 GPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYN----LGQAG---- 161
T+ VT FPQ + AA+F+ +G + E RA++N G+ G
Sbjct: 81 --------TLEKVTVFPQAIGMAATFDTEALQKMGDITSTEGRALFNEDWKAGKTGTRYR 132
Query: 162 -LTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFRE 220
LT+W PNINIFRDPRWGRGQET GEDP + + V+ +GE +
Sbjct: 133 GLTYWTPNINIFRDPRWGRGQETYGEDPYLTAKMGAAIVRGLEGE--------------D 178
Query: 221 KRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFR 280
LK AC KH Y + ++R+SF+A + D DT+ P FR
Sbjct: 179 PHYLKSV-------------ACAKH---YAVHSGPEYNRHSFDARPSVFDLWDTYMPAFR 222
Query: 281 SCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNY 339
+ + K +MC+YN++NG P C L RN+W F GY+TSDC A+ E+
Sbjct: 223 ELVTKAKVHGVMCAYNRLNGQPCCGNDPLLVDILRNQWHFDGYVTSDCWALKDFAEFHKT 282
Query: 340 TKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFN 399
H + + L AG D+ CG + + KG E+DI+ +L LF++ ++G+F
Sbjct: 283 HPEHTIAMSDALLAGTDLECGN-LYHLLAEGVKKGLHSERDINVSLSRLFTILFKIGMF- 340
Query: 400 GDP-RKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPL 458
DP + Y +G + + HK+ A A++ IVLL+N LPL+ + + S+A+IGP
Sbjct: 341 -DPAERVPYSSIGREVLECEAHKQHAERMAKESIVLLENKNHILPLDASKIKSIALIGPN 399
Query: 459 VNNISQMGGGYTGIP---CSPKSLLRGLEAYVSKTHYASGCHDVPCNSDA-GFHEAVRIA 514
+N Y G P +P L+ K +Y G V DA F + A
Sbjct: 400 ADNGQTQLANYFGTPSEIVTPYMSLKRRLGDKIKINYLPGVGIVDKLKDAPSFVQVAHKA 459
Query: 515 KKADFVIVVAGLDLTQE-------------TEDRDRVSLLLPGQQMSLVTSVARTSKRPV 561
++D ++ V+G+ E DR ++ LP Q+ L+ + +T RP+
Sbjct: 460 AQSDVIVFVSGISADYEGEAGDAGAAGYGGFASGDRTTMQLPLVQIELLKKLKKTG-RPL 518
Query: 562 ILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPES 621
I+V G + SF ++L Y G+A A+ +++FG NP GR+P+T Y
Sbjct: 519 IIVNMSGSVM--SFEWESQNADALLQAWYGGQAAGDAIVDVLFGHCNPAGRMPLTTYKSD 576
Query: 622 FTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISAS 681
P + +M GR+YR++ G Y FG+GLSYT ++Y
Sbjct: 577 NDLPPFENYSML--------GRTYRYFKGEPRYPFGYGLSYTTFAYS------------- 615
Query: 682 LKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKV 741
D +DE + + R V ++V+N GD DG VV L+ P+
Sbjct: 616 -----------------DVQCVDETHTGDTAR--VTVTVSNTGDCDGDEVVQLYVVHPQD 656
Query: 742 SQG-TPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
+ P L GF R+H S +SF + P E+L++ G + G L VG
Sbjct: 657 GRKQIPLCALKGFKRIHLKRGESTSVSFTLTP-EELALTETDGNLVEKNGQVTLFVG 712
>gi|402074909|gb|EJT70380.1| hypothetical protein GGTG_11406 [Gaeumannomyces graminis var.
tritici R3-111a-1]
Length = 793
Score = 364 bits (935), Expect = 9e-98, Method: Compositional matrix adjust.
Identities = 255/775 (32%), Positives = 389/775 (50%), Gaps = 84/775 (10%)
Query: 55 CNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVN 114
C+ SLS S RA +L++ L + EK+ L NA+ R+G+P Y WWSE+LHG+A PG
Sbjct: 44 CDRSLSPSERAAALVAALNVTEKMANLVSNANGSARIGLPKYNWWSEALHGVAY-APGTQ 102
Query: 115 FN---GTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINI 171
F G +S TSFP L+ AASF+ SL IG + E+RA N +GL +W PN+N
Sbjct: 103 FRRGPGDFNSSTSFPMPLLLAASFDDSLIEKIGDVIGTESRAFGNGRWSGLDYWTPNVNP 162
Query: 172 FRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEES 231
F+DPRWGRG ETPGED + + YA +K +G + + +E+RV+
Sbjct: 163 FKDPRWGRGSETPGEDILRIKRYAASMIKGLEGPHPE----------KERRVV------- 205
Query: 232 DRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCI 291
+ CKH A D E W SR+ F+A I+ QD + + PF+ C + I
Sbjct: 206 ---------STCKHYAANDFEDWNGTSRHDFDARISAQDLAEYYLMPFQQCARDSRVGSI 256
Query: 292 MCSYNQVNGVPACLRGDLFQKA-RNEWGFKG---YITSDCDAVATIFEYQNYTKTHEDSA 347
MC+YN VNGVP+C L R WG+ G Y+TSDC+AV + Y +T+ +
Sbjct: 257 MCAYNAVNGVPSCANSYLLDTVLRKHWGWTGHNNYVTSDCEAVLDVSAGHKYARTNAEGT 316
Query: 348 AGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKY 407
A +AG D +C + A +G ++E+ +DRALL L+ +R+G F+G+ +
Sbjct: 317 AMCFEAGTDTSCEYTPSSDIRGAYAQGLLREETMDRALLRLYEGLVRVGYFDGN--SSAF 374
Query: 408 GKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKF-LPLNKNAVS-----------SLAII 455
+ DV + L+L +A +GIV+LKND LPL S LA+I
Sbjct: 375 SDISWADVNAPAAQDLSLQSAVEGIVMLKNDGTLPLPLGAKCSSKSKKRSSSGGPKLAMI 434
Query: 456 GPLVNNISQMGGGYTGIPC---SPKSLLR--GLEAYVSKTHYASGCHDVPCNSDAGFHEA 510
G + ++ GGY+G +P R GL+ + G +D A
Sbjct: 435 GFWADAPEKLRGGYSGTAAYLRTPAYAARQMGLDVVTAGGPVLQGAAAA--AADNWTAPA 492
Query: 511 VRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGP 570
+ A+ AD+++ GLD T E++DR + PG Q++LV +A K P+++V G
Sbjct: 493 LAAAEGADYIVYFGGLDETAAGENKDRWDVEWPGAQLALVKRLAALGK-PLVVVQMGDQ- 550
Query: 571 LDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTK-VPMND 629
LD + A++ + ++LW +PG+ G A+ ++ G +P GRLP+T YP ++T+ VPM +
Sbjct: 551 LDGTPLLANAGVGAVLWASWPGQDGGPAVMRLLSGAASPAGRLPVTQYPANYTRLVPMTE 610
Query: 630 MNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKF-----LSAPSELTISASLKA 684
M +R +S PGR+YR+Y+ T V FG GL YTN++ L+A S +T S+ L+A
Sbjct: 611 MALRPSASGSRPGRTYRWYS-TPVLPFGFGLHYTNFTPAVTVPPALAAASGVTTSSLLEA 669
Query: 685 GSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQG 744
D + E + LR ++V N G +V + F +
Sbjct: 670 CRDPH--------------PERCALPPLR----VAVANTGRRASDYVALAFVSGDYGPRP 711
Query: 745 TPEKQLIGFDRVHTV-AKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGE 798
P K L + R+ V A GS E ++ ++ G +L G + + + E
Sbjct: 712 RPIKTLAAYARLRGVRAGGSAEADLAWT-LGDIARHDEDGNTVLYPGTYKVQIDE 765
>gi|359409694|ref|ZP_09202159.1| Beta-glucosidase [Clostridium sp. DL-VIII]
gi|357168578|gb|EHI96752.1| Beta-glucosidase [Clostridium sp. DL-VIII]
Length = 723
Score = 364 bits (935), Expect = 9e-98, Method: Compositional matrix adjust.
Identities = 244/771 (31%), Positives = 371/771 (48%), Gaps = 120/771 (15%)
Query: 56 NTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNF 115
NT + AK L++ +TLQEK +QL+ N+ A+ RL IP Y WW+E LHG+A G
Sbjct: 20 NTHEAAVKAAKELVAKMTLQEKAEQLTYNSPAVKRLNIPEYNWWNEGLHGVARAG----- 74
Query: 116 NGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLTFWAP 167
+ T FPQ + AA F+ + +A E RA YN GLT+W+P
Sbjct: 75 -----TATVFPQAIGLAAMFDEEFLGKVAGIIATEGRAKYNENSKKEDRDIYKGLTYWSP 129
Query: 168 NINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGF 227
N+NIFRDPRWGRG ET GEDP + S V FVK QG+
Sbjct: 130 NVNIFRDPRWGRGHETYGEDPYLTSRLGVAFVKGLQGD---------------------- 167
Query: 228 GEESDRGDELMLSACCKHLIAYDLEKWGNFS-RYSFNAMITEQDTEDTFQPPFRSCIEQG 286
G L LSAC KH + G S R+ FNA+++++D +T+ P F +C+++
Sbjct: 168 ------GKYLKLSACAKHFAVHS----GPESLRHEFNAVVSQKDLHETYLPAFEACVKEA 217
Query: 287 KASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHED 345
+M +YN+ NG P C L + R +WGFKG++ SDC A+A + T T +
Sbjct: 218 NVESVMGAYNRTNGEPCCGSKALLKDILRGKWGFKGHVVSDCWALADFHMHHKVTSTATE 277
Query: 346 SAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKG 405
S A ++ G D+NCG M + A +G V E+ I A L + + +LG+F+ D
Sbjct: 278 SVALAIENGCDLNCGN-MYLNLLLAYKEGLVTEEQITTAAERLMTTRFKLGMFDEDC--- 333
Query: 406 KYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQM 465
+Y ++ + EH +++L+A+R+ +VLLKN+ LPL+K+ + ++A+IGP N+ +
Sbjct: 334 EYNQIPYEVNDCKEHNQVSLEASRKSMVLLKNN-GILPLDKSKLKAVAVIGPNANSEIML 392
Query: 466 GGGYTGIPCSPKSLLRGLEAYVS---KTHYASGCH------DVPCNSDAGFHEAVRIAKK 516
G Y+G ++L G+ + + +Y+ GCH + D EAV +A++
Sbjct: 393 KGNYSGTASKYTTILDGIHDVLDDDVRVYYSEGCHLYKEKVEDLARRDDRLAEAVSVAER 452
Query: 517 ADFVIVVAGLDLTQETED---------RDRVSLLLPGQQMSLVTSVARTSKRPVILVLTG 567
AD VI+ GLD T E E D++ L LPG Q L+ V T K PV++VL
Sbjct: 453 ADVVILCLGLDSTIEGEQGDAGNGYGAGDKLDLNLPGIQQELLEKVLETGK-PVVVVLGT 511
Query: 568 GGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPM 627
G L ++ AE + ++IL YPG G A A+I+FG +P G+LP+T+Y ++
Sbjct: 512 GSGLTLNGAE--ERCAAILNAWYPGSHGGTAAADILFGKCSPSGKLPVTFYKDTDKLPEF 569
Query: 628 NDMNMRADSSRQYPGRSYRFYTGTQ-VYGFGHGLSYTNYSYKFLSAPSELTISASLKAGS 686
D M+ GR+YR+ + +Y FG+GL+Y+ L P+
Sbjct: 570 TDYAMK--------GRTYRYMDESNCLYPFGYGLTYSTVELSNLQVPA------------ 609
Query: 687 DKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTP 746
R ++ ID + + + N G D VV + + +
Sbjct: 610 ---------VRGEFDGID-----------ISVEIENTGSYDIEEVVQCYIKDLESKYAVL 649
Query: 747 EKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
L GF RV SK ++ ++ + G RIL L VG
Sbjct: 650 NHSLAGFKRVSLKKGESKTVTMKLNR-RAFEAVDDAGERILDSKKFKLFVG 699
>gi|358380569|gb|EHK18247.1| glycoside hydrolase family 3 protein, partial [Trichoderma virens
Gv29-8]
Length = 722
Score = 363 bits (932), Expect = 2e-97, Method: Compositional matrix adjust.
Identities = 252/745 (33%), Positives = 373/745 (50%), Gaps = 81/745 (10%)
Query: 72 LTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVS-------SVTS 124
LTL EK L +NA + RLG+P YEW +E+LHG+A PG N T + S T
Sbjct: 12 LTLDEKAANLVNNAPGVKRLGLPPYEWRNEALHGLAGVSPGQGINSTFTQGNVAFNSSTQ 71
Query: 125 FPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETP 184
FP +V A+F+ L +I +AV+ EARA N +AGL +WAPNIN +RDPRWGRGQETP
Sbjct: 72 FPSPIVLGAAFDDHLVHDIATAVSTEARAFSNHLKAGLDYWAPNINPYRDPRWGRGQETP 131
Query: 185 GEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCK 244
GEDP V+ YA +V +G G G + +V+ + CK
Sbjct: 132 GEDPYHVAQYAYNYVVGLKG----------GVGPAKSKVV----------------STCK 165
Query: 245 HLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPAC 304
H YD+E R S+NA+I+ QD + + P FRSC K +MCSYN VNG P+C
Sbjct: 166 HFAGYDIEDSDGVVRGSYNAIISTQDLAEYYLPSFRSCFRDAKTGAVMCSYNAVNGHPSC 225
Query: 305 LRGDLFQKA-RNEWGFKG---YITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCG 360
+ R+ WG+ ++T DC AV +F + ++ A + G D++CG
Sbjct: 226 ANSYMLDTVLRDHWGWGSSAHWVTGDCGAVDGVFNQHHVGQSAAQGVAFAINNGTDLDCG 285
Query: 361 TCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGK-YGKLGPDDVCTSE 419
T + SA+ E +D+AL L+S + LG F DP +G+ Y LG DV T
Sbjct: 286 TAYASNIASAVQNNYTTEAQLDQALSRLYSSLIVLGYF--DPPEGQEYRTLGVSDVNTPS 343
Query: 420 HKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNIS-QMGGGYTGIPCSPKS 478
+KLA A +GI +L P+ + ++ +GP NN S M G Y G+ +P
Sbjct: 344 TQKLAYTALVEGINIL----PIRPMGQ----TVLFVGPWANNASVSMFGNYNGV--APYK 393
Query: 479 LL----RGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETED 534
+ AY Y+ G V N + F AV A++AD V+ + G+D E E
Sbjct: 394 TIPVPTANSSAYNWNVTYSQGLQYVLSNDTSQFAAAVSAAQEADVVVYIGGIDEQVEAEA 453
Query: 535 RDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEA 594
DR S+ PG Q++L+ +A + +PV++V GGG +D S + + +LW+GYPG+
Sbjct: 454 HDRTSIDWPGAQLNLIKQLA--AVKPVVVVQVGGGQVDDSSLLQNKNVKGLLWMGYPGQE 511
Query: 595 GAKALAEIIFGDFNPGGRLPMTWYPESF-TKVPMNDMNMRADSSRQYPGRSYRFYTGTQV 653
L +I+ G P GRLP+T YP ++ T+VPM D ++R SS PGR+YR+Y G+ V
Sbjct: 512 FGSGLIDILSGASAPAGRLPVTQYPANYITQVPMTDQSLRPSSSN--PGRTYRWYNGS-V 568
Query: 654 YGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLR 713
FG G+ YT ++ + + GS + T D+++ ++
Sbjct: 569 IPFGTGIHYTKFNISWKT------------GGSGRG----TYDTADFINAEDPKDLAEFD 612
Query: 714 FHVQISVTNAGDVDGSHVVMLFARVPKVS-QGTPEKQLIGFDRVHTVAKG-SKEISFGVD 771
QI+V N G +V +LF + Q P K L+ + R H G + +I V+
Sbjct: 613 V-FQINVENVGSTTSDYVALLFVKSSDSGPQPYPLKTLVSYARAHGTQPGETTKIDLRVN 671
Query: 772 PCEQLSIANKHGRRILPLGNHVLMV 796
Q++ + G +L G + L +
Sbjct: 672 -VGQIARNDSSGNLVLYPGAYTLEI 695
>gi|325192664|emb|CCA27085.1| unnamed protein product [Albugo laibachii Nc14]
Length = 2278
Score = 362 bits (930), Expect = 3e-97, Method: Compositional matrix adjust.
Identities = 254/791 (32%), Positives = 386/791 (48%), Gaps = 106/791 (13%)
Query: 34 KPVLNKPDFPCKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNAS---AIPR 90
K +L+ C+ P FD +PFCN+SLS+ R + L+ L L EK++ L+ AS +IPR
Sbjct: 17 KLILSSDAHACEYP-FDYFPFCNSSLSLDLRVEDLLQRLQLDEKVRMLTARASTHGSIPR 75
Query: 91 LGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVE 150
LG+P Y W + +HG+ S GT TSFP + A F+ + + + E
Sbjct: 76 LGVPEYNWGANCVHGVQSTC------GT-HCATSFPNPVNLGAIFDPNEIYKMAQVIGKE 128
Query: 151 ARAM--------YNLG-QAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKS 201
RA+ Y G GL W+PNINI RDPRWGR ETP EDP V + Y V + K
Sbjct: 129 LRALRLEGARENYARGPHIGLDCWSPNININRDPRWGRAMETPSEDPYVNAKYGVAYTKG 188
Query: 202 FQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYS 261
Q +G ++ R L+ KH +AY E +G R
Sbjct: 189 LQ-------EG------QDSRFLQAV-------------VTLKHYLAYSYENYGGTDRTQ 222
Query: 262 FNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKARNEWGFKG 321
F+A+++ D DT+ P F + + GKA IMCSYN +NG+P C L Q R++ F G
Sbjct: 223 FDAIVSAYDFADTYFPAFEASVVDGKAKGIMCSYNSLNGIPTCANKWLNQLLRDDLEFDG 282
Query: 322 YITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDI 381
YITSD A+ IF+ YTKT ++ +++G+DI G + + I
Sbjct: 283 YITSDTGAIQGIFDGHKYTKTLCEATKIAMESGVDICSGNAYWNCLKQLANSTNFS-ASI 341
Query: 382 DRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKF 441
D A+ ++ +LGLF+ G GP+DV T++ +L+LD AR+ IVLL+N
Sbjct: 342 DEAIRRTLKLRFQLGLFDA---IGDQPHFGPEDVRTAKSLQLSLDLARKSIVLLQNHGNT 398
Query: 442 LPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPC-----SPKSLLRGLEAYVS-----KTH 491
LPL +A+IGP + G Y G C + + LEA S TH
Sbjct: 399 LPLRLGL--RIAVIGPHSMTRRGIMGNYYGQLCHGDYDEVRCIQSPLEAIQSVNGRNNTH 456
Query: 492 YASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVT 551
+ +GC + S A F +A++ + AD ++ G+D++ E E +DR ++ +P Q+ L+
Sbjct: 457 HVNGC-GINDTSTAEFDDALQAVRTADVAVLFLGIDISIERESKDRDNIDVPHIQLELLK 515
Query: 552 SVARTSKRPVILVLTGGGPLDVS--FAEADSQISSILWIGYPGEAGAKALAEIIFGDFNP 609
++ R + +P ++VL GG L + ADS + + YPG GA+A+AEI+FG NP
Sbjct: 516 AI-RVAGKPTVVVLFNGGILGIEKLILYADSVLEAF----YPGFFGAQAIAEILFGSINP 570
Query: 610 GGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKF 669
G+LP+T Y +F +ND++M++ S YPGRSYR+YT VY FG GLSYT +S +
Sbjct: 571 SGKLPVTMYRSNF----INDVDMKSMSMTLYPGRSYRYYTEVPVYSFGWGLSYTTFSIQS 626
Query: 670 LSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGS 729
+ + ++ L A +I +TN G G
Sbjct: 627 IDSHDTRAMNHVLTAQPKM---------------------------YRILITNNGKYYGE 659
Query: 730 HVVMLFARVPKVSQGTP----EKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRR 785
V+ F R + P ++QL + RV +E+ V E L++ +++G
Sbjct: 660 EVLFAFFRPLDIHATGPVESLQQQLFNYTRVRLDPGDMREVPLHVKD-ENLALHDRNGNL 718
Query: 786 ILPLGNHVLMV 796
+ G + L++
Sbjct: 719 CVFEGFYELII 729
>gi|410639677|ref|ZP_11350222.1| beta-glucosidase [Glaciecola chathamensis S18K6]
gi|410140558|dbj|GAC08409.1| beta-glucosidase [Glaciecola chathamensis S18K6]
Length = 733
Score = 362 bits (930), Expect = 4e-97, Method: Compositional matrix adjust.
Identities = 240/776 (30%), Positives = 375/776 (48%), Gaps = 103/776 (13%)
Query: 44 CKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESL 103
P + P+ +T L R LI +TL+EK QL + AI RLG+P Y++W+E+L
Sbjct: 16 ATPALANDQPWFDTQLPTQKRIDLLIDAMTLKEKTSQLVNGNVAIERLGLPEYDFWNEAL 75
Query: 104 HGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQ---- 159
HG+A NG T FPQ + AA+F++ L S ++ EARA +N+
Sbjct: 76 HGVARNG----------RATVFPQAIGMAATFDQHLLLKAASVISDEARAKFNVSSEIGN 125
Query: 160 ----AGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIG 215
+GLTFW PNINIFRDPRWGRGQET GEDP + + V QG++ K
Sbjct: 126 RSKYSGLTFWTPNINIFRDPRWGRGQETYGEDPYLTAQMGKAMVNGLQGDHPKY------ 179
Query: 216 FGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTF 275
L +A KH + + R+ F+A+ + +D +T+
Sbjct: 180 ---------------------LKTAAAAKH---FAVHSGPEALRHEFDAIASPKDMYETY 215
Query: 276 QPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIF 334
P F + + + +M +YN+VNG PA L R++WGF G++ SDC +A
Sbjct: 216 FPAFEALVTEANVETVMAAYNRVNGHPAGGSDFLLNTVLRDKWGFSGHVVSDCWGLADFH 275
Query: 335 EYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLR 394
+Y T +SAA + G D+NCG + A++ G V EK ID+ L + + + +
Sbjct: 276 QYHKVTANAVESAALAINTGTDLNCG-AVYNALPDAVEAGLVDEKTIDKRLSKVLATKFK 334
Query: 395 LGLFNGDPRKGK-YGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLA 453
LG F DP+ Y + D V + H ++A + A + IVLL+N LPL++N + +L
Sbjct: 335 LGFF--DPKDDNPYNNISADVVNSEAHAQVAYEMAVKSIVLLQNKNNILPLDRN-IRNLY 391
Query: 454 IIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVS---KTHYASGCHDVPCNSDAGFHEA 510
+ GP ++ + G Y G+ ++L G+ A VS +Y G N +
Sbjct: 392 VTGPFASSSEVLLGNYYGLSGKTTNILDGITANVSVGTTINYKQGILPYQANVNPIDWTT 451
Query: 511 VRIAKKADFVIVVAGLDLTQETEDR---------DRVSLLLPGQQMSLVTSVARTSKRPV 561
+ D +I V GL E E+ DR+SL LP Q++ + + + + +PV
Sbjct: 452 GEAKQMGDVIIAVMGLSGAYEGEEGEAIASPHKGDRLSLDLPEHQIAFLRKLRKDNDKPV 511
Query: 562 ILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPES 621
I+VLT G P V+ E +I++ YPG+ G KA+A+I+FG+ +P GRLP+T +P+S
Sbjct: 512 IVVLTAGTP--VNLTEIAELADAIVFAWYPGQEGGKAVADILFGERSPSGRLPIT-FPKS 568
Query: 622 FTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISAS 681
++P D D S Q R+YR+ T +Y FG GLSY + ++ +
Sbjct: 569 EAQLPPYD-----DYSMQE--RTYRYMTQEPMYPFGFGLSYAQVKFDNITL-------GN 614
Query: 682 LKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKV 741
+A + KN Q+ V ++VTN G+ + VV L+ + P
Sbjct: 615 TQALASKNEPQEN-------------------MTVTVNVTNTGEREFEEVVQLYLKTPDA 655
Query: 742 SQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
P L GF R+ A ++++ F + P + L N G+ +L G + ++VG
Sbjct: 656 GVSQPLHSLKGFTRIKLAAGQTEQVLFNI-PKKHLYSINAQGKPVLLKGQYSVIVG 710
>gi|367032987|ref|XP_003665776.1| glycoside hydrolase family 3 protein [Myceliophthora thermophila
ATCC 42464]
gi|347013048|gb|AEO60531.1| glycoside hydrolase family 3 protein [Myceliophthora thermophila
ATCC 42464]
Length = 835
Score = 362 bits (930), Expect = 4e-97, Method: Compositional matrix adjust.
Identities = 233/669 (34%), Positives = 342/669 (51%), Gaps = 46/669 (6%)
Query: 25 VNVIAFSNSKPVLNKPDFPCKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDN 84
++ +A+ P PD C P C+ +L + RA +L++ LT +EK+Q L
Sbjct: 1 MSAVAYGLDGPFQTYPD--CTKPPLSDIKVCDRTLPEAERAAALVAALTDEEKLQNLVSK 58
Query: 85 ASAIPRLGIPAYEWWSESLHGIASNGPGVNFN---GTVSSVTSFPQVLVSAASFNRSLWS 141
A PR+G+PAY WWSE+LHG+A + PG F G +S TSFP L+ AA+F+ L
Sbjct: 59 APGAPRIGLPAYNWWSEALHGVA-HAPGTQFRDGPGDFNSSTSFPMPLLMAAAFDDELIE 117
Query: 142 NIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKS 201
+G + EARA N G +GL +W PN+N FRDPRWGRG ETPGED + + YA ++
Sbjct: 118 AVGDVIGTEARAFGNAGWSGLDYWTPNVNPFRDPRWGRGSETPGEDVVRLKRYAASMIRG 177
Query: 202 FQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYS 261
+G + S G G RV+ + CKH D E W +R+
Sbjct: 178 LEGRSSSSSSCSFGSGGEPPRVI----------------STCKHYAGNDFEDWNGTTRHD 221
Query: 262 FNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGF- 319
F+A+I+ QD + + PF+ C + +MC+YN VNGVP+C L R W +
Sbjct: 222 FDAVISAQDLAEYYLAPFQQCARDSRVGSVMCAYNAVNGVPSCANSYLMNTILRGHWNWT 281
Query: 320 --KGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQ 377
Y+TSDC+AV + + +Y T+ + +AGMD +C A G +
Sbjct: 282 EHDNYVTSDCEAVLDVSAHHHYADTNAEGTGLCFEAGMDTSCEYEGSSDIPGASAGGFLT 341
Query: 378 EKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKN 437
+DRAL L+ +R+G F+G + + LG DV E ++LAL AA +GIVLLKN
Sbjct: 342 WPAVDRALTRLYRSLVRVGYFDGP--ESPHASLGWADVNRPEAQELALRAAVEGIVLLKN 399
Query: 438 DKKFLPL---------NKNAVSSLAIIGPLVNNISQMGGGYTGIPC---SPKSLLRGLEA 485
D LPL +A+IG + ++ GGY+G P SP S R L
Sbjct: 400 DNDTLPLPLPDDVVVTADGGRRRVAMIGFWADAPDKLFGGYSGAPPFARSPASAARQLGW 459
Query: 486 YVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQ 545
V+ D D AV A AD+++ GLD + E +DR+++ P
Sbjct: 460 NVTVAGGPVLEGDSDEEEDTWTAPAVEAAADADYIVYFGGLDTSAAGETKDRMTIGWPAA 519
Query: 546 QMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFG 605
Q++L++ +AR K PV++V G D E D + ++LW +PG+ G A+ ++ G
Sbjct: 520 QLALISELARLGK-PVVVVQMGDQLDDTPLFELDG-VGAVLWANWPGQDGGTAVVRLLSG 577
Query: 606 DFNPGGRLPMTWYPESFTK-VPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTN 664
+P GRLP+T YP ++T VP+ DM +R ++ PGR+YR+Y T V FG GL YT
Sbjct: 578 AESPAGRLPVTQYPANYTDAVPLTDMTLRPSATN--PGRTYRWYP-TPVRPFGFGLHYTT 634
Query: 665 YSYKFLSAP 673
+ +F P
Sbjct: 635 FRAEFGPHP 643
>gi|291548352|emb|CBL21460.1| Beta-glucosidase-related glycosidases [Ruminococcus sp. SR1/5]
Length = 697
Score = 362 bits (929), Expect = 4e-97, Method: Compositional matrix adjust.
Identities = 242/751 (32%), Positives = 361/751 (48%), Gaps = 124/751 (16%)
Query: 64 RAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVT 123
+A++L++ +TL+EK QL +A AI RLGIPAY WW+E LHG+A G T
Sbjct: 9 KAEALVARMTLEEKASQLRYDAPAIKRLGIPAYNWWNEGLHGVARAG----------QAT 58
Query: 124 SFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLTFWAPNINIFRDP 175
FPQ + AA+F+R + + VA E RA YN GLTFW+PN+NIFRDP
Sbjct: 59 VFPQAIGMAAAFDRKSVAEMAGIVATEGRAKYNAYSVNGDRDIYKGLTFWSPNVNIFRDP 118
Query: 176 RWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGD 235
RWGRG ET GEDP + V FVK+ QG GD
Sbjct: 119 RWGRGHETYGEDPYLTKELGVSFVKALQG----------------------------NGD 150
Query: 236 ELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSY 295
+ +AC KH + + R+ F+A + +D E+T+ P F +++ K +M +Y
Sbjct: 151 TMKAAACAKHFAVHSGPE---ALRHEFDAEASAKDMEETYLPAFEGLVKEAKVEAVMGAY 207
Query: 296 NQVNGVPACLRGDLFQKARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGM 355
N+ NG P C L +K R EW F+G+ SDC A+ E+ T T +SAA + G
Sbjct: 208 NRTNGEPCCGSPTLQKKLRGEWKFQGHFVSDCWAIRDFHEHHMVTDTAVESAALAINNGC 267
Query: 356 DINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDV 415
D+NCG L H A +KG V E+ I RA + LF+ + LGLF+G +Y L +V
Sbjct: 268 DLNCGNTYL-HIMKAYEKGLVTEETITRAAVRLFTTRYLLGLFDG----SEYDNLSYMEV 322
Query: 416 CTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCS 475
+ H A AA + VLLKN+ LPL+K + ++ IIGP ++ + G Y G
Sbjct: 323 ESPRHLDAAEKAAEKSFVLLKNN-GILPLDKEKLKTIGIIGPNADSRQALIGNYHGTASR 381
Query: 476 PKSLLRGLEAYVS---KTHYASGC-------HDVPCNSDAGFHEAVRIAKKADFVIVVAG 525
++ G++ YV + + GC + D EA +A+ +D VI+ G
Sbjct: 382 YITIQEGIQDYVGDDVRILTSRGCDLFRDRTEHLAFTRDR-IAEAKVVAENSDVVILCMG 440
Query: 526 LDLTQETED---------RDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFA 576
LD T E E+ D+ + LPG Q L+ ++A T K PV+ L G LD+ +A
Sbjct: 441 LDETLEGEEGDTGNSYVSGDKEDIELPGVQRELMEAIADTGK-PVVFCLLAGSDLDLKYA 499
Query: 577 EADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVP-MNDMNMRAD 635
+LW YPG G KA A+++FG+ +P G+LP+T+Y ES ++P D +M+
Sbjct: 500 AEKFDAVMMLW--YPGCQGGKAAAKVLFGEISPSGKLPVTFY-ESLEELPDFTDYSMK-- 554
Query: 636 SSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTG 695
GR+YR+ + FG+GL+Y+ K DK
Sbjct: 555 ------GRTYRYMERKAQFPFGYGLTYS-------------------KVAVDKA------ 583
Query: 696 SRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDR 755
EV +C + +V++ V N G D VV ++ + P L GF R
Sbjct: 584 ---------EVKTCGQ-KINVEVEVQNNGAYDTEDVVQIYVKNIDSKNAIPNPMLAGFQR 633
Query: 756 VHTVAKGSKEISFGVDPCEQLSIANKHGRRI 786
+ A ++I + + ++ ++ G+R+
Sbjct: 634 IFLKAGECRKIEIPIWE-KAFTVVDETGKRM 663
>gi|255284060|ref|ZP_05348615.1| beta-glucosidase [Bryantella formatexigens DSM 14469]
gi|255265405|gb|EET58610.1| glycosyl hydrolase family 3 C-terminal domain protein
[Marvinbryantia formatexigens DSM 14469]
Length = 700
Score = 362 bits (928), Expect = 5e-97, Method: Compositional matrix adjust.
Identities = 240/754 (31%), Positives = 366/754 (48%), Gaps = 130/754 (17%)
Query: 64 RAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVT 123
RA++L++ +T++EK QL +A AI RLGIPAY WW+E+LHG+A G T
Sbjct: 9 RAEALVAQMTVEEKASQLKYDAPAIKRLGIPAYNWWNEALHGVARAG----------QAT 58
Query: 124 SFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLTFWAPNINIFRDP 175
FPQ + A+F+ +L I +A E RA YN A GLTFW+PN+NIFRDP
Sbjct: 59 VFPQAIGLGATFDEALLGEIADVIATEGRAKYNAYAAKEDRDIYKGLTFWSPNVNIFRDP 118
Query: 176 RWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGD 235
RWGRG ET GEDP + S V FVK QG+ G+
Sbjct: 119 RWGRGHETYGEDPCLTSRLGVAFVKGLQGD----------------------------GE 150
Query: 236 ELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSY 295
+ +AC KH + + R+ FNA + +D E+T+ P F + +++ +M +Y
Sbjct: 151 TMKAAACAKH---FAVHSGPEAVRHEFNAEASAKDMEETYLPAFEALVKEADVEAVMGAY 207
Query: 296 NQVNGVPACLRGDLFQKARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGM 355
N+ NG C L + R +WGF+G+ SDC A+ E+ T T ++SAA + +G
Sbjct: 208 NRTNGEACCASPVLQKILREDWGFEGHFVSDCWAIRDFHEHHMLTATAKESAAMAINSGC 267
Query: 356 DINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDV 415
D+NCG L H A G V E+ I A + LF+ + LGLF+G +Y + V
Sbjct: 268 DLNCGNTYL-HILHAYRDGLVSEETITEAAVRLFTTRFLLGLFDGS----EYDDIPYTVV 322
Query: 416 CTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCS 475
+ EH LA AA + VLLKN+ LPL K + ++ +IGP ++ + + G Y G
Sbjct: 323 ESKEHLALAEKAALESAVLLKNN-GILPLKKERLRTVGVIGPNADSRAALAGNYHGTASR 381
Query: 476 PKSLLRGLEAYVS---KTHYASGCHDVPCNSD----AG--FHEAVRIAKKADFVIVVAGL 526
+++ +GL+ Y+ + + GC ++ AG EA +A+ +D VI+ GL
Sbjct: 382 YETIQQGLQDYLGEDVRVLTSVGCALSEDRTEKLALAGDRLAEAQIVAENSDVVILCLGL 441
Query: 527 DLTQETED---------RDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAE 577
D T E E+ D+ +LLLP Q L+ +VA T K PV+L + G LD+S+A
Sbjct: 442 DETLEGEEGDTGNSYASGDKETLLLPEAQRDLMEAVAATGK-PVVLCMMSGSDLDMSYAA 500
Query: 578 ADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVP-MNDMNMRADS 636
+IL + YPG G A A+++FG+ +P G+LP+T+Y E+ ++P D +M+
Sbjct: 501 --EHFDAILQLWYPGSQGGSAAAKLLFGEVSPSGKLPVTFY-ETLEELPAFEDYSMK--- 554
Query: 637 SRQYPGRSYRFYTGTQVYGFGHGLSY-----TNYSYKFLSAPSELTISASLKAGSDKNIL 691
GR+YR+ Y FG GL+Y T+ + + SA +LT++
Sbjct: 555 -----GRTYRYMGHPAQYPFGFGLTYGDVRVTDANIRGASAEGDLTLA------------ 597
Query: 692 QQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLI 751
++ NAG+ V+ ++ + + P L
Sbjct: 598 --------------------------VTAENAGNAVTDEVLQIYVKCTDSANAVPNPALA 631
Query: 752 GFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRR 785
F R+H A K I V P ++ ++ G R
Sbjct: 632 AFGRIHLEAGEKKTIEMTV-PARAFTVVDEAGVR 664
>gi|339499234|ref|YP_004697269.1| beta-glucosidase [Spirochaeta caldaria DSM 7334]
gi|338833583|gb|AEJ18761.1| Beta-glucosidase [Spirochaeta caldaria DSM 7334]
Length = 699
Score = 359 bits (921), Expect = 3e-96, Method: Compositional matrix adjust.
Identities = 252/757 (33%), Positives = 367/757 (48%), Gaps = 120/757 (15%)
Query: 66 KSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSF 125
++LIS ++L+EKI + A IPRLGIP Y WW+E+LHG+A+NG T F
Sbjct: 13 ETLISNMSLEEKIGLMIHRAKGIPRLGIPDYNWWNEALHGVANNG----------EATVF 62
Query: 126 PQVLVSAASFNRSLWSNIGSAVAVEARAMYN-LGQA-------GLTFWAPNINIFRDPRW 177
PQ + A+F+ L + A+++EARA +N +G+ GLTFWAPNINIFRDPRW
Sbjct: 63 PQAIALGATFDEDLVHRVAEAISIEARAKFNAVGKEKAEQYHRGLTFWAPNINIFRDPRW 122
Query: 178 GRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDEL 237
GRGQET GEDP++ S +V+ QG + L
Sbjct: 123 GRGQETYGEDPVLTSRLGTAYVRGLQGSD---------------------------PYYL 155
Query: 238 MLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQ 297
+AC KH + + R++FNA ++++D E+T+ P F++ ++ G S +M +YN+
Sbjct: 156 RAAACAKHFAVHSGPE---GLRHTFNAEVSQKDLEETYLPAFKALVKSGVES-VMGAYNR 211
Query: 298 VNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMD 356
VNG PAC L QK R EW F+G++ SDC A+ + T +S A L++G D
Sbjct: 212 VNGEPACGSTYLLKQKLREEWQFQGHVVSDCWAICDFHKNHKVTNDILESIALALRSGCD 271
Query: 357 INCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVC 416
+NCG + A+ KG V E DI+RA++ L +LGL + D G Y + +
Sbjct: 272 LNCGDAY-NYLAEAVLKGYVTEDDINRAVVRLLITLDKLGLIHDD---GPYQGITIHQID 327
Query: 417 TSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSP 476
+H LAL+AA + IVLLKN+ LPL K+ +S + + GP N + G Y G+
Sbjct: 328 WKKHDSLALEAAEKSIVLLKNN-GVLPLKKDKISYIYVTGPNATNSDALLGNYAGV---S 383
Query: 477 KSLLRGLEAYVSK------THYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQ 530
LL LEA V + Y GC + A + K AD I V G D +
Sbjct: 384 SRLLTVLEAIVEEAGPEITVTYKKGCPLAERRVNPN-DWASGVTKYADVTIAVMGRDTSV 442
Query: 531 ETEDRDRV---------SLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQ 581
E E+ D + L L +Q+S + + + S +P+I+VL GG P + E
Sbjct: 443 EGEEGDAILSSTYGDFEDLNLNDEQLSYLHKL-KESGKPLIVVLMGGAP--ICSPELHEI 499
Query: 582 ISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVP-MNDMNMRADSSRQY 640
+IL YPG+AG A++ I+FG NP G+LP+T +P+S ++P + +M+
Sbjct: 500 ADAILVAWYPGQAGGTAVSNIVFGKTNPSGKLPVT-FPKSVRQLPEFENYSMQ------- 551
Query: 641 PGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDY 700
GR+YR+ T +Y FG GLSYT +K ++ R
Sbjct: 552 -GRTYRYMTEEPLYPFGFGLSYTKMEFKHVTG------------------------RWKS 586
Query: 701 VHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVA 760
DE+ T L N G +DG VV L+ P LI F RV A
Sbjct: 587 PEKDELIVSTELY--------NQGTIDGEEVVQLYYHWKDAPFAVPNWSLIDFKRVLVAA 638
Query: 761 KGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
S F + P E+L + G+ ++P G VG
Sbjct: 639 GASCICEFKI-PLEKLQCIDPSGKGVIPTGTLQFYVG 674
>gi|288870210|ref|ZP_06113312.2| beta-glucosidase [Clostridium hathewayi DSM 13479]
gi|288868024|gb|EFD00323.1| beta-glucosidase [Clostridium hathewayi DSM 13479]
Length = 730
Score = 358 bits (919), Expect = 6e-96, Method: Compositional matrix adjust.
Identities = 213/630 (33%), Positives = 321/630 (50%), Gaps = 79/630 (12%)
Query: 64 RAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVT 123
+A+ L+ +TL+EK+ Q + A AI RLGI AY WW+E LHG+A G T
Sbjct: 24 KAEYLVKQMTLEEKVFQTMNQAPAIERLGIKAYNWWNEGLHGVARAG----------VAT 73
Query: 124 SFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQ--------AGLTFWAPNINIFRDP 175
FPQ + AA+F+ L +G AV+ EARA Y++ Q GLT WAPNINIFRDP
Sbjct: 74 IFPQAIGLAATFDEDLIETVGEAVSTEARAKYHMQQRYGDTDIYKGLTLWAPNINIFRDP 133
Query: 176 RWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGD 235
RWGRG ET GEDP + S + +++ QG + K
Sbjct: 134 RWGRGHETYGEDPWLTSRLGIRYIRGLQGSHEKY-------------------------- 167
Query: 236 ELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSY 295
L +AC KH + + R+SF+A ++E+D +T+ P F +C++ G +M +Y
Sbjct: 168 -LKTAACVKHFAVHSGPE---ELRHSFDAEVSEKDLRETYLPAFEACVKDGDVEAVMGAY 223
Query: 296 NQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAG 354
N+VNGVP C L + R EWGF G++ SDC A+ E T + +S + + G
Sbjct: 224 NRVNGVPCCGNEYLLETILRKEWGFHGHVVSDCWAIKDFHEGHGVTDSPVESVSMAMNHG 283
Query: 355 MDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDD 414
D+NCG + + A+ +GKV+E+ +D A++ LF+ +L+LG Y + +
Sbjct: 284 CDLNCGN-LFTYLIQAVKEGKVKEERLDEAVIRLFTTRLKLGALGKMEEDDPYAGISYLE 342
Query: 415 VCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPC 474
V + KKL AA + +VLLKN + LP++ ++ +IGP ++ + G Y G
Sbjct: 343 VDSPAMKKLNRSAAGKSVVLLKNTEGLLPIDTKRYKTIGVIGPNADSRRALVGNYEGTAS 402
Query: 475 SPKSLLRGLEAYV---SKTHYASGCHDVPCN------SDAGFHEAVRIAKKADFVIVVAG 525
++L G+ ++ Y+ GCH N + E I +++D VI G
Sbjct: 403 EYVTVLEGIREAAEPEARVLYSEGCHLYKSNVSGLGARNDRLSEVKGICRESDIVIACMG 462
Query: 526 LDLTQETED---------RDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFA 576
LD T E E D+ L+LPG Q ++ + A S +PV+LVL G + V++
Sbjct: 463 LDSTLEGEQGDTGNIYAGGDKPDLMLPGLQQKILET-AYDSGKPVVLVLLAGSAMAVTW- 520
Query: 577 EADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADS 636
AD + +IL YPG G + +A+++FG NP GRLP+T+Y + + +M
Sbjct: 521 -ADEHLPAILTAWYPGAEGGRGVADVLFGTVNPEGRLPVTFYRTTEELPDFTNYSME--- 576
Query: 637 SRQYPGRSYRFYTGTQVYGFGHGLSYTNYS 666
GR+YRF +Y FG GLSYT +S
Sbjct: 577 -----GRTYRFMKQKALYPFGFGLSYTEFS 601
>gi|336408348|ref|ZP_08588841.1| hypothetical protein HMPREF1018_00856 [Bacteroides sp. 2_1_56FAA]
gi|423248801|ref|ZP_17229817.1| hypothetical protein HMPREF1066_00827 [Bacteroides fragilis
CL03T00C08]
gi|423253750|ref|ZP_17234681.1| hypothetical protein HMPREF1067_01325 [Bacteroides fragilis
CL03T12C07]
gi|335937826|gb|EGM99722.1| hypothetical protein HMPREF1018_00856 [Bacteroides sp. 2_1_56FAA]
gi|392655379|gb|EIY49022.1| hypothetical protein HMPREF1067_01325 [Bacteroides fragilis
CL03T12C07]
gi|392657742|gb|EIY51373.1| hypothetical protein HMPREF1066_00827 [Bacteroides fragilis
CL03T00C08]
Length = 722
Score = 358 bits (919), Expect = 7e-96, Method: Compositional matrix adjust.
Identities = 229/745 (30%), Positives = 365/745 (48%), Gaps = 97/745 (13%)
Query: 56 NTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNF 115
+ S ++ R ++LI +TL EK+ QL + +IPRL +PAY +W+E LHG+A G
Sbjct: 53 DLSQPVAVRVRTLIQQMTLAEKVAQLVSESDSIPRLNLPAYNYWNECLHGVARAG----- 107
Query: 116 NGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDP 175
VT FPQ + A++++ L + SA++ EAR Y GLT+W+P IN+ RDP
Sbjct: 108 -----EVTVFPQAINLASTWDTVLVKRVASAISTEARLKYLEIGKGLTYWSPTINMARDP 162
Query: 176 RWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGD 235
RWGR +ET GEDP + S V FVK QG D
Sbjct: 163 RWGRNEETYGEDPHLTSRLGVAFVKGLQG---------------------------DHPT 195
Query: 236 ELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSY 295
L A KH +A + E +R+S ++ I + + + P + +C+++ A +M +Y
Sbjct: 196 YLKTVATIKHFVANNEEN----NRFSSSSQIPTKQLYEYYFPAYEACVKEANAQSVMTAY 251
Query: 296 NQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAG 354
N NGVP L R EWGF G++ SDC A+ + + E++AA + +G
Sbjct: 252 NAFNGVPPSGSHWLLDDVLRKEWGFDGFVVSDCGAIGVMNWQHRVVNSLEEAAALGVNSG 311
Query: 355 MDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRK-GKYGKLGPD 413
D+ CGT A+++G + E IDRAL + + + +LG F DP + Y
Sbjct: 312 CDLECGTTYKEKLVQAVEQGLISEAAIDRALTRVLTARFKLGEF--DPMELVPYNHYDKK 369
Query: 414 DVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIP 473
+ + +LA +AA + +VLLKND LPLNK + S+A++GP + GGY+G P
Sbjct: 370 LLAGKKFAELAYEAAVKSVVLLKND-ALLPLNKEKIKSVAVVGPFAD--YNYLGGYSGQP 426
Query: 474 CSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETE 533
SLL+G++ + K + + + ++D+ ++ K AD V+V G D E
Sbjct: 427 PYSVSLLKGVKELIGKKGKVTYLNGMGTSADS----IAQVVKGADIVLVALGSDEKMARE 482
Query: 534 DRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGE 593
+ D S+ LP +Q L+ + + + R ++LV G PL + AD+ I +I+ YPG+
Sbjct: 483 NHDMPSIYLPEEQEKLLKEIYQVNPR-IVLVFHTGNPLTSEW--ADTHIPAIMQAWYPGQ 539
Query: 594 AGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQV 653
+ALA ++FG+ NP G+LPMT Y + D +M + GR+YR+ G +
Sbjct: 540 EAGRALANLLFGNENPSGKLPMTIYKTEEQLPDILDFDM-------WKGRTYRYMKGEPL 592
Query: 654 YGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLR 713
YGFGHGLSYT++ + + D + D + C+
Sbjct: 593 YGFGHGLSYTSFEFDNIQGN-------------------------DTLQSDAILQCS--- 624
Query: 714 FHVQISVTNAGDVDGSHVVMLFARVPKVSQGT-PEKQLIGFDRVHTVAKGSKEISFGVDP 772
+ ++N+G + G VV ++ T P K+L+ F +V + K++ F + P
Sbjct: 625 ----VELSNSGQLAGEEVVQVYVSRENTPVYTYPLKKLVAFKKVKLASGEKKKVDFTIAP 680
Query: 773 CEQLSIANKHGRRILPLGNHVLMVG 797
E LS+ R+L G + L +G
Sbjct: 681 RE-LSVWEDGKWRMLS-GKYTLFIG 703
>gi|365120422|ref|ZP_09338009.1| hypothetical protein HMPREF1033_01355 [Tannerella sp.
6_1_58FAA_CT1]
gi|363647477|gb|EHL86692.1| hypothetical protein HMPREF1033_01355 [Tannerella sp.
6_1_58FAA_CT1]
Length = 735
Score = 358 bits (918), Expect = 8e-96, Method: Compositional matrix adjust.
Identities = 247/757 (32%), Positives = 357/757 (47%), Gaps = 112/757 (14%)
Query: 50 DSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASN 109
+++PF N LS R L+S LTL+EKI Q+ + A AI RLGIPAY+WW+E LHG+
Sbjct: 25 NTFPFQNPDLSFEKRVDDLVSRLTLEEKISQMLNKAPAIERLGIPAYDWWNECLHGVGR- 83
Query: 110 GPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------G 161
T VT FPQ + AA+++ +L+ + S++A E RA+Y+ + G
Sbjct: 84 --------TPYKVTVFPQAIGMAATWDDALFQQVASSIADEGRAIYHDAISKGVHEIYHG 135
Query: 162 LTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREK 221
LT+W PNINIFRDPRWGRGQET GEDP + FV QG++ K
Sbjct: 136 LTYWTPNINIFRDPRWGRGQETYGEDPYLTGTLGKAFVNGLQGDDPKY------------ 183
Query: 222 RVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRS 281
L SAC KH Y + SR+ FN ++ D DT+ P FR
Sbjct: 184 ---------------LKASACAKH---YAVHSGPEISRHFFNTEVSMYDLWDTYLPAFRD 225
Query: 282 CIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYT 340
+ K S +MC+YN + G P C L Q R +W F GY+TSDC A+ +++ +
Sbjct: 226 LVVDAKVSSVMCAYNALAGQPCCGNDLLMQDILRKQWKFTGYVTSDCGAIDDFLKHKTHA 285
Query: 341 KTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNG 400
SA VL G D+ CG + A+ +G + E ID ++ LF + RLGLF+
Sbjct: 286 DAAHASADAVLH-GTDLECGQNIYVKLVDAVKQGLITEAQIDESVKRLFMTRFRLGLFDP 344
Query: 401 DPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVN 460
R KY + EHK LAL +R+ +VLLKND LPL KN + +A+IGP +
Sbjct: 345 ADRV-KYADTPLSVLECDEHKALALKMSRESVVLLKND-NVLPLRKN-LKKIAVIGPNAD 401
Query: 461 NISQMGGGYTGIPCSPKSLLRGLEAYVSKTH---YASGCHDVPCNSDAGFHEAVRIAKKA 517
+ + + G Y G P + L + + V K Y V + + + + K
Sbjct: 402 DSTVVLGNYNGFPSKVITPLEAIRSKVGKRTQVIYDRAIDCVKPSDEKTLNALIERLKGV 461
Query: 518 DFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVARTSKRPVILVLTG 567
D VI V G+ E E+ DR ++ LP Q L+ + + + PVI V+
Sbjct: 462 DQVIFVGGISPRLEGEELPISVDGFRGGDRTTIALPEVQTELMKKM-KEAGLPVIFVMMT 520
Query: 568 GGPLDVSFAEADSQ-ISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVP 626
G L + E +SQ I +IL Y G+ +A+A+++FGD+NP G+LP+T+Y P
Sbjct: 521 GSALGI---EWESQNIPAILNAWYGGQFAGQAIADVLFGDYNPSGKLPVTFYRSDSDLPP 577
Query: 627 MNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGS 686
+M R+YR++ G +Y FG GLSYT + Y S + +K
Sbjct: 578 FGAFSM--------ANRTYRYFKGEALYPFGFGLSYTMFDYSVPQVVSGGKVGEPIK--- 626
Query: 687 DKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTP 746
V + V N G +G VV L+ V + P
Sbjct: 627 -----------------------------VSVKVKNIGKKNGDEVVQLYLSHEGVEKA-P 656
Query: 747 EKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHG 783
L GF RV+ A K +SF + P +S+ + +G
Sbjct: 657 ITALKGFKRVYLKAGEEKTLSFEISP-RDMSLPDDNG 692
>gi|375357164|ref|YP_005109936.1| putative glycosyl hydrolase [Bacteroides fragilis 638R]
gi|301161845|emb|CBW21389.1| putative glycosyl hydrolase [Bacteroides fragilis 638R]
Length = 722
Score = 358 bits (918), Expect = 8e-96, Method: Compositional matrix adjust.
Identities = 229/745 (30%), Positives = 365/745 (48%), Gaps = 97/745 (13%)
Query: 56 NTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNF 115
+ S ++ R ++LI +TL EK+ QL + +IPRL +PAY +W+E LHG+A G
Sbjct: 53 DLSQPVAVRVRTLIQQMTLAEKVAQLVSESDSIPRLNLPAYNYWNECLHGVARAG----- 107
Query: 116 NGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDP 175
VT FPQ + A++++ L + SA++ EAR Y GLT+W+P IN+ RDP
Sbjct: 108 -----EVTVFPQAINLASTWDTVLVKRVASAISTEARLKYLEIGKGLTYWSPTINMARDP 162
Query: 176 RWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGD 235
RWGR +ET GEDP + S V FVK QG D
Sbjct: 163 RWGRNEETYGEDPHLTSRLGVAFVKGLQG---------------------------DHPT 195
Query: 236 ELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSY 295
L A KH +A + E +R+S ++ I + + + P + +C+++ A +M +Y
Sbjct: 196 YLKTVATIKHFVANNEEN----NRFSSSSQIPTKQLYEYYFPAYEACVKEANAQSVMTAY 251
Query: 296 NQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAG 354
N NGVP L R EWGF G++ SDC A+ + + E++AA + +G
Sbjct: 252 NAFNGVPPSGSHWLLDDVLRKEWGFDGFVVSDCGAIGVMNWQHRVVNSLEEAAALGVNSG 311
Query: 355 MDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRK-GKYGKLGPD 413
D+ CGT A+++G + E IDRAL + + + +LG F DP + Y
Sbjct: 312 CDLECGTTYKEKLVQAVEQGLISEAAIDRALTRVLTARFKLGEF--DPMELVPYNHYDKK 369
Query: 414 DVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIP 473
+ + +LA +AA + +VLLKND LPLNK + S+A++GP + GGY+G P
Sbjct: 370 LLAGKKFAELAYEAAVKSVVLLKND-ALLPLNKEKIKSVAVVGPFAD--YNYLGGYSGQP 426
Query: 474 CSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETE 533
SLL+G++ + K + + + ++D+ ++ K AD V+V G D E
Sbjct: 427 PYSVSLLKGVKELIGKKGKVTYLNGMGTSADS----IAQVVKGADIVLVALGSDEKMARE 482
Query: 534 DRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGE 593
+ D S+ LP +Q L+ + + + R ++LV G PL + AD+ I +I+ YPG+
Sbjct: 483 NHDMPSIYLPEEQEKLLKKIYQVNPR-IVLVFHTGNPLTSEW--ADTHIPAIMQAWYPGQ 539
Query: 594 AGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQV 653
+ALA ++FG+ NP G+LPMT Y + D +M + GR+YR+ G +
Sbjct: 540 EAGRALANLLFGNENPSGKLPMTIYKTEEQLPDILDFDM-------WKGRTYRYMKGEPL 592
Query: 654 YGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLR 713
YGFGHGLSYT++ + + D + D + C+
Sbjct: 593 YGFGHGLSYTSFEFDNIQGN-------------------------DTLQPDAILQCS--- 624
Query: 714 FHVQISVTNAGDVDGSHVVMLFARVPKVSQGT-PEKQLIGFDRVHTVAKGSKEISFGVDP 772
+ ++N+G + G VV ++ T P K+L+ F +V + K++ F + P
Sbjct: 625 ----VELSNSGQLAGEEVVQVYVSRENTPVYTYPLKKLVAFKKVKLASGEKKKVDFTIAP 680
Query: 773 CEQLSIANKHGRRILPLGNHVLMVG 797
E LS+ R+L G + L +G
Sbjct: 681 RE-LSVWEDGKWRMLS-GKYTLFIG 703
>gi|423258868|ref|ZP_17239791.1| hypothetical protein HMPREF1055_02068 [Bacteroides fragilis
CL07T00C01]
gi|423264161|ref|ZP_17243164.1| hypothetical protein HMPREF1056_00851 [Bacteroides fragilis
CL07T12C05]
gi|387776448|gb|EIK38548.1| hypothetical protein HMPREF1055_02068 [Bacteroides fragilis
CL07T00C01]
gi|392706427|gb|EIY99550.1| hypothetical protein HMPREF1056_00851 [Bacteroides fragilis
CL07T12C05]
Length = 722
Score = 358 bits (918), Expect = 9e-96, Method: Compositional matrix adjust.
Identities = 229/745 (30%), Positives = 365/745 (48%), Gaps = 97/745 (13%)
Query: 56 NTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNF 115
+ S ++ R ++LI +TL EK+ QL + +IPRL +PAY +W+E LHG+A G
Sbjct: 53 DLSQPVAVRVRTLIQQMTLAEKVAQLVSESDSIPRLNLPAYNYWNECLHGVARAG----- 107
Query: 116 NGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDP 175
VT FPQ + A++++ L + SA++ EAR Y GLT+W+P IN+ RDP
Sbjct: 108 -----EVTVFPQAINLASTWDTVLVKRVASAISTEARLKYLEIGKGLTYWSPTINMARDP 162
Query: 176 RWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGD 235
RWGR +ET GEDP + S V FVK QG D
Sbjct: 163 RWGRNEETYGEDPHLTSRLGVAFVKGLQG---------------------------DHPT 195
Query: 236 ELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSY 295
L A KH +A + E +R+S ++ I + + + P + +C+++ A +M +Y
Sbjct: 196 YLKTVATIKHFVANNEEN----NRFSSSSQIPTKQLYEYYFPAYEACVKEANAQSVMTAY 251
Query: 296 NQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAG 354
N NGVP L R EWGF G++ SDC A+ + + E++AA + +G
Sbjct: 252 NAFNGVPPSGSHWLLDDVLRKEWGFDGFVVSDCGAIGVMNWQHRVVNSLEEAAALGVNSG 311
Query: 355 MDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRK-GKYGKLGPD 413
D+ CGT A+++G + E IDRAL + + + +LG F DP + Y
Sbjct: 312 CDLECGTTYKEKLVQAVEQGLISEAAIDRALTRVLTARFKLGEF--DPMELVPYNHYDKK 369
Query: 414 DVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIP 473
+ + +LA +AA + +VLLKND LPLNK + S+A++GP + GGY+G P
Sbjct: 370 LLAGKKFAELAYEAAVKSVVLLKND-ALLPLNKEKIKSVAVVGPFAD--YNYLGGYSGQP 426
Query: 474 CSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETE 533
SLL+G++ + K + + + ++D+ ++ K AD V+V G D E
Sbjct: 427 PYSVSLLKGVKELIGKKGKVTYLNGMGTSADS----IAQVVKGADIVLVALGSDEKMARE 482
Query: 534 DRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGE 593
+ D S+ LP +Q L+ + + + R ++LV G PL + AD+ I +I+ YPG+
Sbjct: 483 NHDMPSIYLPEEQEKLLKEIYQVNPR-IVLVFHTGNPLTSEW--ADTHIPAIMQAWYPGQ 539
Query: 594 AGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQV 653
+ALA ++FG+ NP G+LPMT Y + D +M + GR+YR+ G +
Sbjct: 540 EAGRALANLLFGNENPSGKLPMTIYKTEEQLPDILDFDM-------WKGRTYRYMKGEPL 592
Query: 654 YGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLR 713
YGFGHGLSYT++ + + D + D + C+
Sbjct: 593 YGFGHGLSYTSFEFDNIQGN-------------------------DTLQPDAILQCS--- 624
Query: 714 FHVQISVTNAGDVDGSHVVMLFARVPKVSQGT-PEKQLIGFDRVHTVAKGSKEISFGVDP 772
+ ++N+G + G VV ++ T P K+L+ F +V + K++ F + P
Sbjct: 625 ----VELSNSGQLAGEEVVQVYVSRENTPVYTYPLKKLVAFKKVKLASGEKKKVDFTIAP 680
Query: 773 CEQLSIANKHGRRILPLGNHVLMVG 797
E LS+ R+L G + L +G
Sbjct: 681 RE-LSVWEDGKWRMLS-GKYTLFIG 703
>gi|265765457|ref|ZP_06093732.1| beta-xylosidase [Bacteroides sp. 2_1_16]
gi|263254841|gb|EEZ26275.1| beta-xylosidase [Bacteroides sp. 2_1_16]
Length = 722
Score = 358 bits (918), Expect = 9e-96, Method: Compositional matrix adjust.
Identities = 229/745 (30%), Positives = 365/745 (48%), Gaps = 97/745 (13%)
Query: 56 NTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNF 115
+ S ++ R ++LI +TL EK+ QL + +IPRL +PAY +W+E LHG+A G
Sbjct: 53 DLSQPVAVRVRTLIQQMTLAEKVAQLVSESDSIPRLNLPAYNYWNECLHGVARAG----- 107
Query: 116 NGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDP 175
VT FPQ + A++++ L + SA++ EAR Y GLT+W+P IN+ RDP
Sbjct: 108 -----EVTVFPQAINLASTWDTVLVKRVASAISTEARLKYLEIGKGLTYWSPTINMARDP 162
Query: 176 RWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGD 235
RWGR +ET GEDP + S V FVK QG D
Sbjct: 163 RWGRNEETYGEDPHLTSRLGVAFVKGLQG---------------------------DHPT 195
Query: 236 ELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSY 295
L A KH +A + E +R+S ++ I + + + P + +C+++ A +M +Y
Sbjct: 196 YLKTVATIKHFVANNEEN----NRFSSSSQIPTKQLYEYYFPAYEACVKEANAQSVMTAY 251
Query: 296 NQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAG 354
N NGVP L R EWGF G++ SDC A+ + + E++AA + +G
Sbjct: 252 NAFNGVPPSGSHWLLDDVLRKEWGFDGFVVSDCGAIGVMNWQHRVVNSLEEAAALGVNSG 311
Query: 355 MDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRK-GKYGKLGPD 413
D+ CGT A+++G + E IDRAL + + + +LG F DP + Y
Sbjct: 312 CDLECGTTYKEKLVQAVEQGLISEAAIDRALTRVLTARFKLGEF--DPMELVPYNHYDKK 369
Query: 414 DVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIP 473
+ + +LA +AA + +VLLKND LPLNK + S+A++GP + GGY+G P
Sbjct: 370 LLAGKKFAELAYEAAVKSVVLLKND-ALLPLNKEKIKSVAVVGPFAD--YNYLGGYSGQP 426
Query: 474 CSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETE 533
SLL+G++ + K + + + ++D+ ++ K AD V+V G D E
Sbjct: 427 PYSVSLLKGVKELIGKKGKVTYLNGMGTSADS----IAQVVKGADIVLVALGSDEKMARE 482
Query: 534 DRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGE 593
+ D S+ LP +Q L+ + + + R ++LV G PL + AD+ I +I+ YPG+
Sbjct: 483 NHDMPSIYLPEEQEKLLKKIYQVNPR-IVLVFHTGNPLTSEW--ADTHIPAIMQAWYPGQ 539
Query: 594 AGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQV 653
+ALA ++FG+ NP G+LPMT Y + D +M + GR+YR+ G +
Sbjct: 540 EAGRALANLLFGNENPSGKLPMTIYKTEEQLPDILDFDM-------WKGRTYRYMKGEPL 592
Query: 654 YGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLR 713
YGFGHGLSYT++ + + D + D + C+
Sbjct: 593 YGFGHGLSYTSFEFDNIQGN-------------------------DTLQPDAILQCS--- 624
Query: 714 FHVQISVTNAGDVDGSHVVMLFARVPKVSQGT-PEKQLIGFDRVHTVAKGSKEISFGVDP 772
+ ++N+G + G VV ++ T P K+L+ F +V + K++ F + P
Sbjct: 625 ----VELSNSGQLAGEEVVQVYVSRENTPVYTYPLKKLVAFKKVKLASGEKKKVDFTIAP 680
Query: 773 CEQLSIANKHGRRILPLGNHVLMVG 797
E LS+ R+L G + L +G
Sbjct: 681 RE-LSVWEDGKWRMLS-GKYTLFIG 703
>gi|451996250|gb|EMD88717.1| glycoside hydrolase family 3 protein [Cochliobolus heterostrophus
C5]
Length = 763
Score = 358 bits (918), Expect = 9e-96, Method: Compositional matrix adjust.
Identities = 234/754 (31%), Positives = 364/754 (48%), Gaps = 67/754 (8%)
Query: 54 FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
C+ + RA +L++ + QEK+ L + + RLG+PAY WW E+LHG+A PG+
Sbjct: 36 ICDVNAPPHERAAALVAAMEPQEKLDNLVSKSKGVSRLGLPAYNWWGEALHGVAG-APGI 94
Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFR 173
F + TSFP ++ +A+F+ L I + + EARA N G A + +W P+IN R
Sbjct: 95 KFVEPYKNATSFPMPILMSAAFDDDLIFKIANIIGNEARAFGNGGVAPMDYWTPDINPVR 154
Query: 174 DPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDR 233
D RWGR E+PGED + Y + +G+ +++++
Sbjct: 155 DIRWGRASESPGEDIRRIKGYTKALLAGLEGDQ------------AQRKII--------- 193
Query: 234 GDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMC 293
A CKH + YD+E WG + R++F+A IT QD + + PPF+ C K MC
Sbjct: 194 -------ATCKHYVGYDMEAWGGYDRHNFSAKITMQDLAEYYMPPFQQCARDSKVGSFMC 246
Query: 294 SYNQVNGVPACLRGDLFQKA-RNEWGF---KGYITSDCDAVATIFEYQNYTKTHEDSAAG 349
SYN VNGVP C + Q R+ W + YITSDC+AVA I E Y +T A
Sbjct: 247 SYNAVNGVPTCADTYVLQTILRDHWNWTDSNNYITSDCEAVADISENHKYVETLAQGTAL 306
Query: 350 VLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGK 409
GMD++C A +G + ID+AL + + G F+G K Y
Sbjct: 307 AFAKGMDLSCEYSGSSDIPGAWSQGLLNLSVIDKALTRQYEGLVHAGYFDG--AKATYAN 364
Query: 410 LGPDDVCTSEHKKLALDAARQGIVLLKNDKKF-LPLNKNAVSSLAIIGPLVNNISQMGGG 468
L +D+ T E ++L+L +G+V+LKND LPL K S +A+IG N+ S++ G
Sbjct: 365 LSYNDINTPEARQLSLQVTSEGLVMLKNDHTLPLPLTKG--SKVAMIGFWANDSSKLQGI 422
Query: 469 YTGIPCSPKSLLRGLEAYVSKTHYASG----CHDVPCNSDAGFHEAVRIAKKADFVIVVA 524
Y+G P S + E A G VP N A+ A+K+D+++
Sbjct: 423 YSGPPPYRHSPVFAGEQMGLDMAIAWGPMIQNSSVPDNWTT---NALDAAEKSDYILYFG 479
Query: 525 GLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISS 584
G D T E DR ++ P Q+ L+ +A+ K V++ L G D S + I+S
Sbjct: 480 GQDWTVAQEGYDRTTISFPQVQIDLLAKLAKLGKPLVVITL--GDMTDHSPLLSMEGINS 537
Query: 585 ILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRS 644
I+W +PG+ G A+ +I G P GRLP+T YP + K+ M DMN+R + + PGR+
Sbjct: 538 IIWANWPGQDGGPAILNVISGVHAPAGRLPITEYPADYVKLSMLDMNLRPHA--ESPGRT 595
Query: 645 YRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHID 704
YR++ V FG GL YT + F SE ++ ++ D Q Y +
Sbjct: 596 YRWFN-ESVQPFGFGLHYTTFEAGFA---SEEGLTYDIQETLDSCTQQ-------YKDLC 644
Query: 705 EVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSK 764
EV ++++V N G+ V + F + + P K LI + R+ + G+K
Sbjct: 645 EVAP-------LEVTVANKGNRTSDFVALAFIKGEVGPKPYPLKTLITYGRLRDIHGGAK 697
Query: 765 EISFGVDPCEQLSIANKHGRRILPLGNHVLMVGE 798
+ + +L+ ++ G ++ G + L++ E
Sbjct: 698 KSASLPLTLGELARVDQSGNTVIYPGEYTLLLDE 731
>gi|291530120|emb|CBK95705.1| Beta-glucosidase-related glycosidases [Eubacterium siraeum 70/3]
Length = 689
Score = 357 bits (917), Expect = 1e-95, Method: Compositional matrix adjust.
Identities = 249/775 (32%), Positives = 376/775 (48%), Gaps = 143/775 (18%)
Query: 54 FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
+ + LS RA +L L+ +E+ QQL +A AI + G+P+Y WW+E LHG+A G
Sbjct: 4 YKDKQLSAYERAAALADTLSTEEQAQQLKYDAPAIEKAGLPSYNWWNEGLHGVARAG--- 60
Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLTFW 165
+ T FPQ + AA+F++ + +G V+ EARAMYN GLT W
Sbjct: 61 -------TATVFPQAIALAAAFDKDMMCRVGEVVSTEARAMYNSAAKHGDTDIYKGLTLW 113
Query: 166 APNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLK 225
APNINIFRDPRWGRG ET GEDP + S V FVK QGE EK
Sbjct: 114 APNINIFRDPRWGRGHETYGEDPYLTSRLGVNFVKGIQGE--------------EKY--- 156
Query: 226 GFGEESDRGDELMLSACCKHLIAYDLEKWGNFS-RYSFNAMITEQDTEDTFQPPFRSCIE 284
L +AC KH + G S R+ F+A ++E+D E+T+ P F++ ++
Sbjct: 157 -----------LRAAACAKHFAVHS----GPESLRHEFDARVSEKDLEETYLPAFKALVK 201
Query: 285 QGKASCIMCSYNQVNGVPACLRGDLFQKARNEWGFKGYITSDCDAVATIFEYQNYTKTHE 344
+G+ +M +YN+VNG P+C L K R EWGF GY SDC A+ T T
Sbjct: 202 EGRVEGVMGAYNRVNGEPSCASEKLMGKLR-EWGFDGYFVSDCGAIRDFHTNHKITDTAP 260
Query: 345 DSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRK 404
SAA LKAG D+NCG L H +A+++G + ++DI A ++ ++RLG + +
Sbjct: 261 QSAAMALKAGCDVNCGNTYL-HILAALEEGLITKQDIRTACIHALRTRIRLGQLDDN--- 316
Query: 405 GKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQ 464
++ L D + +K L+L+AA + +VLL ND LPL+K+ +SS+A+IGP ++ +
Sbjct: 317 -EFDDLPFDIIACDGNKALSLEAAEKSMVLLHND-GILPLDKSRISSIAVIGPNADSRAA 374
Query: 465 MGGGYTGIPCSPKSLLRGLE-AYVSKTHYASGCH---------DVPCNSDAGFHEAVRIA 514
+ G Y G P + L G++ A+ + +YA GC +P + + EAV
Sbjct: 375 LLGNYEGTPDRSVTFLEGIQDAFDGRVYYAEGCQLFRDRTQGLALPGDR---YAEAVAAC 431
Query: 515 KKADFVIVVAGLDLTQETED-------RDRVSLLLPGQQMSLVTSVARTSKRPVILVLTG 567
+ AD +V GLD T E E+ D+ L LP Q L+ + T K P+I+VL
Sbjct: 432 EAADVTVVCVGLDSTLEGEEGDTENKSGDKPDLRLPEVQRVLLQKLKDTGK-PLIIVLAA 490
Query: 568 GGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPM 627
G ++ E ++ I++ W YPG+ G KALAEI+FG+ +P G+LP+T+Y +
Sbjct: 491 GSSVNTE-CEGNALINA--W--YPGQYGGKALAEILFGEVSPSGKLPVTFYKSADMLPDF 545
Query: 628 NDMNMRADSSRQYPGRSYRFYTGTQ--VYGFGHGLSYTNYSYKFLSAPSELTISASLKAG 685
D +M+ R+YRF +Y FG+GL+Y+++
Sbjct: 546 TDYSMK--------NRTYRFCDDESNVLYPFGYGLTYSHF-------------------- 577
Query: 686 SDKNILQQTGSRLDYVHIDEVTSCTSLRFH---VQISVTNAGDVDGSHVVMLFARVPKVS 742
C + + + ++VTN G V+ ++ R
Sbjct: 578 ----------------------ECGDISYKDNTLAVNVTNTGSRSAEDVLQVYIR---SE 612
Query: 743 QGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
G L F+RV S+ IS + P + +G R + G + L G
Sbjct: 613 NGVKNHSLCAFERVSLFDGESRTISINI-PEGAFETVDDNGVRAVRSGRYTLYAG 666
>gi|291544853|emb|CBL17962.1| Beta-glucosidase-related glycosidases [Ruminococcus champanellensis
18P13]
Length = 697
Score = 357 bits (916), Expect = 1e-95, Method: Compositional matrix adjust.
Identities = 224/634 (35%), Positives = 336/634 (52%), Gaps = 87/634 (13%)
Query: 54 FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
+ N SL+ RA+ L LT++E+ QL +A IPRLGIPAY WW+E LHG+A G
Sbjct: 9 YLNPSLTPDERAEDLADRLTVEEQASQLRYDALPIPRLGIPAYNWWNEGLHGVARAG--- 65
Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQ--------AGLTFW 165
+ T FPQ + AA+F+ +L IG A EARA + + GLT W
Sbjct: 66 -------TATMFPQAIGMAATFDTALLHQIGEITATEARAKHMAAREHGDFDIYKGLTLW 118
Query: 166 APNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLK 225
APNIN+FRDPRWGRG ET GEDP + + V FVK QGE +VLK
Sbjct: 119 APNINLFRDPRWGRGHETYGEDPFLTARLGVAFVKGMQGEG---------------KVLK 163
Query: 226 GFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQ 285
+AC KH + + R+SF+A ++ +D E+++ P F + + +
Sbjct: 164 A-------------AACAKH---FAVHSGPEALRHSFDAQVSPKDLEESYLPAFHALVAE 207
Query: 286 GKASCIMCSYNQVNGVPACLRGDLFQKARNEWGFKGYITSDCDAVATIFEYQNYTKTHED 345
K +M +YN+VNG P+C L K ++WGF GY SDC A+ ++ TK +
Sbjct: 208 AKVEGVMGAYNRVNGEPSCASPMLMDKL-HQWGFAGYFVSDCWAIQDFHKHHGVTKNVTE 266
Query: 346 SAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKG 405
SAA L+ G D+NCG L + +A+++G + DI RA + + ++RLGLF+ +P
Sbjct: 267 SAALALRTGCDLNCGNTYL-YVLAALEEGLIDAADIRRACIRVLRTRIRLGLFDPEPH-- 323
Query: 406 KYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQM 465
+ D + + HK ++L A + +VLLKND LPL+ + + ++A+IGP ++ + +
Sbjct: 324 -FAACTYDTIASPAHKAVSLSCAEKSMVLLKND-GILPLDLSKLHAIAVIGPNADSRAAL 381
Query: 466 GGGYTGIPCSPKSLLRGLE-AYVSKTHYASGCH---DVPCN---SDAGFHEAVRIAKKAD 518
G Y G + L G++ A+ + HYA GCH D N +D + EA+ A+ +D
Sbjct: 382 EGNYCGTADRYVTFLEGIQDAFPGRVHYAQGCHLYKDRTSNLAMADDRYAEALAAAEASD 441
Query: 519 FVIVVAGLDLTQETED---------RDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGG 569
VI+ GLD T E E+ D+ L LP Q L+ + K PVILVL G
Sbjct: 442 VVILCLGLDATLEGEEGDTGNEFSSGDKADLRLPPPQCKLLEKLHAVGK-PVILVLAAGS 500
Query: 570 PLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVP-MN 628
L+ + +++L YPG+ G +ALA I+FG +P G+LP+T+Y E+ ++P
Sbjct: 501 ALNPEIS-----CNAVLQAWYPGQCGGQALAHILFGKVSPSGKLPVTFY-ETAEQLPDFT 554
Query: 629 DMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSY 662
D +M+ R+YR+ +Y FG+GL+Y
Sbjct: 555 DYSMQ--------NRTYRYARNNVLYPFGYGLTY 580
>gi|451851086|gb|EMD64387.1| glycoside hydrolase family 3 protein [Cochliobolus sativus ND90Pr]
Length = 763
Score = 357 bits (916), Expect = 1e-95, Method: Compositional matrix adjust.
Identities = 240/792 (30%), Positives = 376/792 (47%), Gaps = 70/792 (8%)
Query: 16 FRVLQLIVVVNVIAFSNSKPVLNKPDFPCKPPHFDSYPFCNTSLSISTRAKSLISLLTLQ 75
L VV+ S S + PD C + C+ + RA +L++ + Q
Sbjct: 1 MHFLSTAVVLGASLISKSHAAIG-PD--CTNGPLSTNAICDVNAPPHERAAALVAAMEPQ 57
Query: 76 EKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASF 135
EK+ L + + RLG+PAY WW E+LHG+A PG+ F + TSFP ++ +A+F
Sbjct: 58 EKLDNLVSKSKGVSRLGLPAYNWWGEALHGVAG-APGIKFVEPYKNATSFPMPILMSAAF 116
Query: 136 NRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYA 195
+ L I + + EARA N G A + +W P+IN RD RWGR E+PGED + Y
Sbjct: 117 DDDLIFKIANIIGNEARAFGNGGVAPVDYWTPDINPVRDIRWGRASESPGEDIRRIKGYT 176
Query: 196 VEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWG 255
+ +G+ +++++ A CKH + YD+E WG
Sbjct: 177 KALLAGLEGDQ------------AQRKII----------------ATCKHYVGYDMEAWG 208
Query: 256 NFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-R 314
+ R++F+A IT QD + + PPF+ C K MCSYN VNG+P C + Q R
Sbjct: 209 GYDRHNFSAKITMQDLAEYYMPPFQQCARDSKVGSFMCSYNAVNGIPTCADTYVLQTILR 268
Query: 315 NEWGF---KGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAI 371
+ W + YITSDC+AVA I E Y +T A GMD++C A
Sbjct: 269 DHWNWTDSNNYITSDCEAVADISENHKYVETLAQGTALAFAKGMDLSCEYTGSSDIPGAW 328
Query: 372 DKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQG 431
+G + ID+AL + + G F+G K Y L D+ T E ++L+L +G
Sbjct: 329 AQGLLNISVIDKALTRQYEGLVHAGYFDG--AKATYANLSYKDINTPEARQLSLQVTSEG 386
Query: 432 IVLLKNDKKF-LPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKT 490
+V+LKND LPL K S +A+IG N+ S++ G Y+G P S + E
Sbjct: 387 LVMLKNDHTLPLPLTKG--SKVAMIGFWANDSSKLQGIYSGPPPYRHSPVFAGEQMGLDM 444
Query: 491 HYASG----CHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQ 546
A G VP N A+ A+K+D+++ G D T E DR ++ P Q
Sbjct: 445 AIAWGPMIQNSSVPDNWTT---NALDAAEKSDYILYFGGQDWTVAQEGYDRTTISFPQVQ 501
Query: 547 MSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGD 606
+ L+T +A+ K V++ L G D S + ++SI+W +PG+ G A+ ++ G
Sbjct: 502 IDLLTKLAKLGKPLVVITL--GDMTDHSPLLSMEGVNSIIWANWPGQDGGPAILNVVSGA 559
Query: 607 FNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYS 666
P GRLP+T YP + K+ M DMN+R + + PGR+YR++ + V FG GL YT +
Sbjct: 560 HAPAGRLPITEYPADYVKLSMLDMNLRPHT--ESPGRTYRWFNES-VQPFGFGLHYTTFE 616
Query: 667 YKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDV 726
F SE ++ ++ D G Y + EV ++++V N G+
Sbjct: 617 ASFA---SEEGLTYDIEEILD-------GCTQQYKDLCEVAP-------LEVTVANKGNR 659
Query: 727 DGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRI 786
V + F + + P K LI + R+ + G+K+ + +L+ ++ G +
Sbjct: 660 TSDFVALAFIKGEVGPKPYPLKTLITYGRLRDIHGGAKKSASLPLTLGELARVDQSGNTV 719
Query: 787 LPLGNHVLMVGE 798
+ G + L++ E
Sbjct: 720 IYPGEYTLLLDE 731
>gi|317474362|ref|ZP_07933636.1| glycosyl hydrolase family 3 C terminal domain-containing protein
[Bacteroides eggerthii 1_2_48FAA]
gi|316909043|gb|EFV30723.1| glycosyl hydrolase family 3 C terminal domain-containing protein
[Bacteroides eggerthii 1_2_48FAA]
Length = 723
Score = 357 bits (916), Expect = 1e-95, Method: Compositional matrix adjust.
Identities = 252/780 (32%), Positives = 378/780 (48%), Gaps = 136/780 (17%)
Query: 53 PFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPG 112
P+ N SLS + RA L+S LTL+EKI + +N+SA+ RLGI YEWW+E+LHG+A NG
Sbjct: 24 PYQNKSLSPTERAADLVSRLTLEEKITLMQNNSSAVKRLGIKPYEWWNEALHGVARNG-- 81
Query: 113 VNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQ--------AGLTF 164
T +PQ + ASFN +L + ++++ EAR Y + GLTF
Sbjct: 82 --------LATVYPQAIGMGASFNDTLLYQVFTSISDEARVKYRQAREAGNYKRYTGLTF 133
Query: 165 WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQG-ENWKSDDGGIGFGFREKRV 223
W PNINIFRDPRWGRGQET GEDP + S + V QG +N K +
Sbjct: 134 WTPNINIFRDPRWGRGQETYGEDPYLTSRMGLSVVNGLQGPQNTKYNK------------ 181
Query: 224 LKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPFRSC 282
AC KH + +W +R+SFNA I +D +T+ P F+
Sbjct: 182 ---------------THACAKHYAVHSGPEW---NRHSFNAENINPRDLWETYLPAFQDL 223
Query: 283 IEQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIFEYQNYTK 341
+ QG +MC+YN+ G P C L RNEW +KG + SDC A+ F ++ +
Sbjct: 224 VIQGNVKEVMCAYNRFEGDPCCGSDRLLINILRNEWNYKGLVVSDCGAIDN-FYFKGRHE 282
Query: 342 THE---DSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLF 398
TH+ D++A + +G D+ CG SA+ +G + E ID++L L + LG
Sbjct: 283 THKNKADASAAAVLSGTDLECGRSYTGLI-SAVKEGLINESAIDQSLCRLMKARFELGEM 341
Query: 399 NGDPRKGKYGKLGPDDVCTSE-HKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGP 457
+ + +L PD + + H++LAL AR+ + LL+N K LPL+K ++A+IGP
Sbjct: 342 DD---TTPWDQL-PDSLLSCHAHQQLALQMARESMTLLQNHKNILPLDKEM--TVALIGP 395
Query: 458 LVNNISQMGGGYTGIPCSPKSLLRGLEAYVSK---THYASGCHDVPCNSDAGFH-----E 509
N+ Y G P +LL GL Y+ + + +V ++
Sbjct: 396 NANDSVMQWANYNGFPVHTITLLEGLTQYLPQERLIYIPQKNIEVQKYPWVNYYPNDIQA 455
Query: 510 AVRIAKKADFVIVVAGLDLTQETEDRD----------RVSLLLPGQQMSLVTSVARTSKR 559
+ A KAD +I G+ + E E+ D R ++ LP Q LV ++ T K
Sbjct: 456 VINQAAKADVIIYAGGISASLEGEEMDVDAEGFRGGDRTTIELPNVQRKLVKALKATGK- 514
Query: 560 PVILVLTGGGPLDVSFAEADSQI-SSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWY 618
P++ V G + + + +SQI +IL YPG+AG A+AE++FGD+NP GRLP+T+Y
Sbjct: 515 PIVFVNFSGCAMGL---QPESQICDAILQAWYPGQAGGTAIAEVLFGDYNPAGRLPITFY 571
Query: 619 PESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTI 678
+ D NM+ GR+YR+ +Y FGHGLSYT +SY S P
Sbjct: 572 KKDNQLPDFEDYNMQ--------GRTYRYLNYEPLYPFGHGLSYTTFSY---STP----- 615
Query: 679 SASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARV 738
+ G +++ VTN+G+ +G V+ L+ +
Sbjct: 616 ------------FIENGK-------------------LKVKVTNSGNYNGDEVIQLYIKR 644
Query: 739 PKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPL-GNHVLMVG 797
G P K L GF R+H A + E+SF + + + +K + PL G + ++VG
Sbjct: 645 YDDPDG-PLKTLRGFQRIHIPAGQTSEVSFPLT-SDTFTWWDKDSNTVHPLQGRYKILVG 702
>gi|383117083|ref|ZP_09937830.1| hypothetical protein BSHG_0813 [Bacteroides sp. 3_2_5]
gi|251947612|gb|EES87894.1| hypothetical protein BSHG_0813 [Bacteroides sp. 3_2_5]
Length = 722
Score = 357 bits (916), Expect = 2e-95, Method: Compositional matrix adjust.
Identities = 229/745 (30%), Positives = 365/745 (48%), Gaps = 97/745 (13%)
Query: 56 NTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNF 115
+ S ++ R ++LI +TL EK+ QL + +IPRL +PAY +W+E LHG+A G
Sbjct: 53 DLSQPVAVRVRTLIQQMTLAEKVAQLVSESDSIPRLNLPAYNYWNECLHGVARAG----- 107
Query: 116 NGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDP 175
VT FPQ + A++++ L + SA++ EAR Y GLT+W+P IN+ RDP
Sbjct: 108 -----EVTVFPQAINLASTWDTVLVKRVASAISTEARLKYLEIGKGLTYWSPTINMARDP 162
Query: 176 RWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGD 235
RWGR +ET GEDP + S V FVK QG D
Sbjct: 163 RWGRNEETYGEDPHLTSRLGVAFVKGLQG---------------------------DHPT 195
Query: 236 ELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSY 295
L A KH +A + E +R+S ++ I + + + P + +C+++ A +M +Y
Sbjct: 196 YLKTVATIKHFVANNEEN----NRFSSSSQIPTKQLYEYYFPAYEACVKEANAQSVMTAY 251
Query: 296 NQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAG 354
N NGVP L R EWGF G++ SDC A+ + + E++AA + +G
Sbjct: 252 NAFNGVPPSGSHWLLDDVLRKEWGFDGFVVSDCGAIGVMNWQHRVVNSLEEAAALGVNSG 311
Query: 355 MDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRK-GKYGKLGPD 413
D+ CGT A+++G + E IDRAL + + + +LG F DP + Y
Sbjct: 312 CDLECGTTYKEKLVQAVEQGLISEVAIDRALTRVLTARFKLGEF--DPMELVPYNHYDKK 369
Query: 414 DVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIP 473
+ + +LA +AA + +VLLKND LPLNK + S+A++GP + GGY+G P
Sbjct: 370 LLAGKKFAELAYEAAVKSVVLLKND-ALLPLNKEKIKSVAVVGPFAD--YNYLGGYSGQP 426
Query: 474 CSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETE 533
SLL+G++ + K + + + ++D+ ++ K AD V+V G D E
Sbjct: 427 PYSVSLLKGVKELIGKKGKVTYLNGMGTSADS----IAQVVKGADIVLVALGSDEKMARE 482
Query: 534 DRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGE 593
+ D S+ LP +Q L+ + + + R ++LV G PL + AD+ I +I+ YPG+
Sbjct: 483 NHDMPSIYLPEEQEKLLKKIYQVNPR-IVLVFHTGNPLTSEW--ADTHIPAIMQAWYPGQ 539
Query: 594 AGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQV 653
+ALA ++FG+ NP G+LPMT Y + D +M + GR+YR+ G +
Sbjct: 540 EAGRALANLLFGNENPSGKLPMTIYKTEEQLPDILDFDM-------WKGRTYRYMKGEPL 592
Query: 654 YGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLR 713
YGFGHGLSYT++ + + D + D + C+
Sbjct: 593 YGFGHGLSYTSFEFDNIQGN-------------------------DTLQPDAILQCS--- 624
Query: 714 FHVQISVTNAGDVDGSHVVMLFARVPKVSQGT-PEKQLIGFDRVHTVAKGSKEISFGVDP 772
+ ++N+G + G VV ++ T P K+L+ F +V + K++ F + P
Sbjct: 625 ----VELSNSGQLAGEEVVQVYVSRENTPVYTYPLKKLVAFKKVKLASGEKKKVDFTIAP 680
Query: 773 CEQLSIANKHGRRILPLGNHVLMVG 797
E LS+ R+L G + L +G
Sbjct: 681 RE-LSVWEDGKWRMLS-GKYTLFIG 703
>gi|443695317|gb|ELT96258.1| hypothetical protein CAPTEDRAFT_179825 [Capitella teleta]
Length = 750
Score = 357 bits (915), Expect = 2e-95, Method: Compositional matrix adjust.
Identities = 249/782 (31%), Positives = 380/782 (48%), Gaps = 107/782 (13%)
Query: 16 FRVLQLIVVVNVIAFSNSKPVLNKPDFPCKPPHFDSYPFCNTSLSISTRAKSLISLLTLQ 75
R L++ VV + + S N F DS+PF N SL I TR LIS LT++
Sbjct: 1 MRALRVTVVCLTLISTASFSQDNVLRFAPSSHALDSFPFRNVSLPIETRLNDLISRLTIE 60
Query: 76 EKIQQLSDN----ASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVS 131
+ I Q I RLGI E+ +E L G+ + T FPQ L
Sbjct: 61 DAINQTVARYGKFTPGIERLGIKPIEYITECLRGVRR-----------ENATGFPQALGL 109
Query: 132 AASFNRSLWSNIGSAVAVEARAMYN-------LGQAGLTFWAPNINIFRDPRWGRGQETP 184
AASF+R L + +AV+VE RA YN G G+T ++P INI R P WGR QET
Sbjct: 110 AASFSRDLMQRVATAVSVEVRAFYNHDIQRETYGAHGITCFSPVINILRHPLWGRNQETY 169
Query: 185 GEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCK 244
GEDP + A ++V QG++ + L +SA CK
Sbjct: 170 GEDPYLSGELASQYVSGLQGDDPRY---------------------------LRVSAGCK 202
Query: 245 HLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPAC 304
H A+ ++ F+A I E+D + TF P F+ CI K +MCS+N +NGVP+C
Sbjct: 203 HFDAHGGPDTIPVRKFGFDAKIEERDLQMTFLPAFKKCI-AAKPYNVMCSFNSINGVPSC 261
Query: 305 LRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDIN-CGTC 362
L R +WG++G++ SD AV IF +Y + E +A +K+G ++ G
Sbjct: 262 ANKRLLTDVLRAQWGYEGFVVSDDAAVEYIFTEHHYNSSFETAAVEAIKSGCNMELVGKF 321
Query: 363 MLRHTQ--SAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEH 420
+ Q A+++ + + ++ + +F + LG F+ P + ++ D V ++EH
Sbjct: 322 DPSYWQLTKALNEHLITKDELMENVRPVFLTRFLLGEFD-PPALNPFNQITKDVVLSAEH 380
Query: 421 KKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT--GIPCSPKS 478
++LAL+AA + VLLKND+ FLPL KN++ ++A++GP+ N + G Y+ P +
Sbjct: 381 QRLALEAAVKSFVLLKNDRNFLPLLKNSLKTVAVVGPMSNYTDGLIGDYSTDTDPSLILT 440
Query: 479 LLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRV 538
L G++ +ASGC + C +D + A V V G E E+ DR
Sbjct: 441 PLHGIKKLAPNVQFASGCSNSTC-TDYRATDVAAAVDGAQVVFVALGTGFIVEAENNDRS 499
Query: 539 SLLLPGQQMSLVT-SVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAK 597
++LPG Q+ L+ +V + RPV+L+L GGPLDV+FA+ S I SI+ +P +
Sbjct: 500 DIVLPGAQLQLLKDAVYHANGRPVVLLLFNGGPLDVTFAQLTSGIVSIVECFFPAMMTGE 559
Query: 598 ALAEIIF---GDFNPGGRLPMTWYPESFTKVP-MNDMNMRADSSRQYPGRSYRFYTGTQV 653
A+ ++ G +P GRLP+TW P +VP + D M+ GR+YR+YT +
Sbjct: 560 AIYRMLINNEGISSPAGRLPLTW-PAYLNQVPNITDYTMK--------GRTYRYYTEDPL 610
Query: 654 YGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLR 713
Y FG+GLSYT + Y S+L ++ EVT +R
Sbjct: 611 YPFGYGLSYTQFKY------SDLKVTPL-----------------------EVTKGQEIR 641
Query: 714 FHVQISVTNAG--DVDGSHVVMLFARV--PKVSQGTPEKQLIGFDRVHTVAKGSKEISFG 769
V++ VTN G D D ++++ A V PK P QL+ FDR+H + S+ +
Sbjct: 642 --VKVKVTNIGLYDADEVRIIVVQAYVSWPKTEIPVPRWQLVAFDRIHIASGKSETVELT 699
Query: 770 VD 771
++
Sbjct: 700 IE 701
>gi|268610157|ref|ZP_06143884.1| glycoside hydrolase family 3 protein [Ruminococcus flavefaciens
FD-1]
Length = 690
Score = 356 bits (914), Expect = 2e-95, Method: Compositional matrix adjust.
Identities = 252/775 (32%), Positives = 371/775 (47%), Gaps = 140/775 (18%)
Query: 54 FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
+ + SLS RA+ L + LTL+E+ QL +A A+ RL IPAY WWSE LHG+A G
Sbjct: 4 YKDKSLSAQERAEDLTNRLTLEEQASQLKYDAPAVDRLDIPAYNWWSEGLHGVARAG--- 60
Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLTFW 165
+ T FPQ + AA F+ + +GS + EARA YN A GL W
Sbjct: 61 -------TATMFPQAIGLAAMFDEEAMNKVGSIIGDEARAKYNEYSAHGDHDIYKGLCLW 113
Query: 166 APNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLK 225
+PN+NIFRDPRWGRGQET GEDP + + V F K QGE
Sbjct: 114 SPNVNIFRDPRWGRGQETYGEDPYLTTRLGVAFAKGLQGE-------------------- 153
Query: 226 GFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQ 285
G+ L +AC KHL + + R+ F+A+ + +D E+T+ P F + +++
Sbjct: 154 --------GEVLKTAACAKHLAVHSGPE---AIRHEFDAVASPKDMEETYLPAFEALVKE 202
Query: 286 GKASCIMCSYNQVNGVPACLRGDLFQKARNEWGFKGYITSDCDAVATIFEYQNYTKTHED 345
K +M +YN+VNG PAC L K +EWGF GY SDC A+ TKT +
Sbjct: 203 AKVEGVMGAYNRVNGEPACASKFLMGKL-DEWGFDGYFVSDCWAIRDFHTNHMVTKTAPE 261
Query: 346 SAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKG 405
SAA LK G D+NCG L H A ++G + ++DI +A +L ++RLG+F+ +
Sbjct: 262 SAAMALKLGCDLNCGNTYL-HLLHAYNEGLINDEDIKKACTHLMRTRVRLGMFDDET--- 317
Query: 406 KYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQM 465
+Y KL V E+K A + + +V+LKN+ LPL+ + + ++ +IGP ++ +
Sbjct: 318 EYDKLDYSIVANEENKAYARKCSERSMVMLKNN-GILPLDPSKIKTIGVIGPNADSRPAL 376
Query: 466 GGGYTGIPCSPKSLLRGLE-AYVSKTHYASGCHDVP------CNSDAGFHEAVRIAKKAD 518
G Y G + L G++ A+ + Y+ G H +D EA + + +D
Sbjct: 377 EGNYNGRADRYITFLEGIQDAFGGRVLYSEGSHLYKDRCMGLAVADDRLSEAEIVTEHSD 436
Query: 519 FVIVVAGLDLTQETED---------RDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGG 569
V++ GLD T E E+ D+ L LP Q LV +V R K PVI+V G
Sbjct: 437 VVVLCVGLDATIEGEEGDTGNEFSSGDKNDLRLPEAQRKLVETVMRKGK-PVIIVTAAGS 495
Query: 570 PLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVP-MN 628
++V EAD W YPG+ G ALA+I+FG +P G+LP+T+Y ++ TK+P
Sbjct: 496 AINV---EADCDALIHAW--YPGQFGGTALADILFGKISPSGKLPVTFYTDT-TKLPEFT 549
Query: 629 DMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDK 688
D +M+ GR+YR+ +Y FG+GL+Y+
Sbjct: 550 DYSMK--------GRTYRYTQDNILYPFGYGLTYSK------------------------ 577
Query: 689 NILQQTGSRLDYVHIDEVTSCTSLRFH---VQISVTNAGDVDGSHVVMLFARVPKVSQGT 745
T + L+F + VTN GD D VV + + + S
Sbjct: 578 ------------------TEVSDLKFENGKASVKVTNTGDFDTEDVVQFYIK-GEGSDYV 618
Query: 746 PEKQLIGFDRVHTVAKGSK---EISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
P L GF RV + KG E++ G E + +++GRR +L G
Sbjct: 619 PFYSLCGFRRVF-LKKGESTVVEVTLGDSAFEAV---DENGRRSRSAKGFILYAG 669
>gi|423281966|ref|ZP_17260851.1| hypothetical protein HMPREF1204_00389 [Bacteroides fragilis HMW
615]
gi|404582453|gb|EKA87147.1| hypothetical protein HMPREF1204_00389 [Bacteroides fragilis HMW
615]
Length = 722
Score = 356 bits (914), Expect = 3e-95, Method: Compositional matrix adjust.
Identities = 229/745 (30%), Positives = 364/745 (48%), Gaps = 97/745 (13%)
Query: 56 NTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNF 115
+ S ++ R ++LI +TL EK+ QL + +IPRL +PAY +W+E LHG+A G
Sbjct: 53 DLSQPVAVRVRTLIQQMTLAEKVAQLVSESDSIPRLNLPAYNYWNECLHGVARAG----- 107
Query: 116 NGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDP 175
VT FPQ + A++++ L + SA++ EAR Y GLT+W+P IN+ RDP
Sbjct: 108 -----EVTVFPQAINLASTWDTVLVKRVASAISTEARLKYLEIGKGLTYWSPTINMARDP 162
Query: 176 RWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGD 235
RWGR +ET GEDP + S V FVK QG D
Sbjct: 163 RWGRNEETYGEDPHLTSRLGVAFVKGLQG---------------------------DHPT 195
Query: 236 ELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSY 295
L A KH +A + E +R+S ++ I + + + P + +C+++ A +M +Y
Sbjct: 196 YLKTVATIKHFVANNEEN----NRFSSSSQIPTKQLYEYYFPAYEACVKEANAQSVMTAY 251
Query: 296 NQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAG 354
N NGVP L R EWGF G++ SDC A+ + + E++AA + +G
Sbjct: 252 NAFNGVPPSGSHWLLDDVLRKEWGFDGFVVSDCGAIGVMNWQHRVVNSLEEAAALGVNSG 311
Query: 355 MDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRK-GKYGKLGPD 413
D+ CGT A+++G + E IDRAL + + + +LG F DP + Y
Sbjct: 312 CDLECGTTYKEKLVQAVEQGLISEAAIDRALTRVLTARFKLGEF--DPMELVPYNHYDKK 369
Query: 414 DVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIP 473
+ + +LA +AA + +VLLKND LPLNK + S+A++GP + GGY+G P
Sbjct: 370 LLAGKKFAELAYEAAVKSVVLLKND-ALLPLNKEKIKSVAVVGPFAD--YNYLGGYSGQP 426
Query: 474 CSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETE 533
SLL+G++ + K + + + ++D+ ++ K AD V+V G D E
Sbjct: 427 PYSVSLLKGVKELIGKKGKVTYLNGMGTSADS----IAQVVKGADIVLVALGSDEKMARE 482
Query: 534 DRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGE 593
+ D S+ LP +Q L+ + + + R + LV G PL + AD+ I +I+ YPG+
Sbjct: 483 NHDMPSIYLPEEQEKLLKEIYQVNPR-IALVFHTGNPLTSEW--ADTHIPAIMQAWYPGQ 539
Query: 594 AGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQV 653
+ALA ++FG+ NP G+LPMT Y + D +M + GR+YR+ G +
Sbjct: 540 EAGRALANLLFGNENPSGKLPMTIYKTEEQLPDILDFDM-------WKGRTYRYMKGEPL 592
Query: 654 YGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLR 713
YGFGHGLSYT++ + + D + D + C+
Sbjct: 593 YGFGHGLSYTSFEFDNIQGN-------------------------DTLQPDAILQCS--- 624
Query: 714 FHVQISVTNAGDVDGSHVVMLFARVPKVSQGT-PEKQLIGFDRVHTVAKGSKEISFGVDP 772
+ ++N+G + G VV ++ T P K+L+ F +V + K++ F + P
Sbjct: 625 ----VELSNSGQLAGEEVVQVYVSRENTPVYTYPLKKLVAFKKVKLASGEKKKVDFTIAP 680
Query: 773 CEQLSIANKHGRRILPLGNHVLMVG 797
E LS+ R+L G + L +G
Sbjct: 681 RE-LSVWEDGKWRMLS-GKYTLFIG 703
>gi|402308386|ref|ZP_10827395.1| glycosyl hydrolase family 3, N-terminal domain protein [Prevotella
sp. MSX73]
gi|400375830|gb|EJP28725.1| glycosyl hydrolase family 3, N-terminal domain protein [Prevotella
sp. MSX73]
Length = 721
Score = 356 bits (914), Expect = 3e-95, Method: Compositional matrix adjust.
Identities = 240/764 (31%), Positives = 365/764 (47%), Gaps = 121/764 (15%)
Query: 61 ISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVS 120
+S AK +I+ +T+ EKI QL + + AI LGI Y+WWSE LHG+ +G
Sbjct: 30 MSRHAKEIIARMTVSEKISQLMNESPAIEHLGIKPYDWWSEGLHGVGRDG---------- 79
Query: 121 SVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQ--------AGLTFWAPNINIF 172
T FPQ + A+F+ +L IG AVA E RA +N+ Q AGLTFW+PN+NIF
Sbjct: 80 RATVFPQPIALGATFDEALVREIGDAVATEGRAKFNVAQKLKNYSRNAGLTFWSPNVNIF 139
Query: 173 RDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESD 232
RDPRWGRG ET GEDP++ +V+ QG D
Sbjct: 140 RDPRWGRGMETYGEDPLLSGMLGTAYVRGLQG---------------------------D 172
Query: 233 RGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIM 292
L AC KH Y + +R+ + + +D +T+ P F+ ++QG+ +M
Sbjct: 173 DAFYLKTGACAKH---YAVHSGPEGTRHEADIHPSRRDLFETYLPQFKMLVQQGRVEAVM 229
Query: 293 CSYNQVNGVPA----CLRGDLFQKARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAA 348
+YN+V G P L D+ +K+ WGF G+I SDCDA+ + Y KT E++ A
Sbjct: 230 SAYNRVYGEPCGGSKYLLTDILRKS---WGFNGHIVSDCDAINDFYGGHRYVKTPEEACA 286
Query: 349 GVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYG 408
+KAG+++ CG + Q A+D+G + E D+DRAL L +L+LG+ D Y
Sbjct: 287 AAIKAGLNVECGHT-FKAMQGALDQGLLAEADLDRALFPLVMTRLKLGILEPD-SACPYN 344
Query: 409 KLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGG 468
++C+ H LAL AA + +VLLKN+ LPL+KN + +L + GP ++ + G
Sbjct: 345 SYDESEICSPAHTALALRAADEAMVLLKNN-GILPLDKN-IRTLFVAGPGASDAFYLMGN 402
Query: 469 YTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPC-----NSDAGFHEAVRIAKKADFVIVV 523
Y G+ + L+G+ VS+ + + P + AV A A+ IVV
Sbjct: 403 YFGLSNRYSTYLQGI---VSRVSSGTSVNFRPAFMQITEELNDMNWAVNEACAAEVAIVV 459
Query: 524 AGLDLTQETED---------RDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVS 574
G + E E+ DRV + LP Q++ + V +++VLTGG P+D+
Sbjct: 460 MGNNGNMEGEEGEAIASASRGDRVGIGLPASQLNYLRRVKARKGGRIVVVLTGGSPIDLR 519
Query: 575 FAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVP-MNDMNMR 633
+ + W YPG+ G +AL +++FGD N GRLP+T +P +P +D +M
Sbjct: 520 KISKLADAVVMAW--YPGQEGGEALGDLLFGDKNFSGRLPIT-FPADVDSLPAFDDYSMN 576
Query: 634 ADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQ 693
GR+Y++ +G +Y FG+GLSY +Y ++ + +K G
Sbjct: 577 --------GRTYKYMSGNVMYPFGYGLSYGRVTY------TDARVVGRIKKGEP------ 616
Query: 694 TGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGF 753
V++ +TN GD V + P +G+P L+GF
Sbjct: 617 --------------------LAVEVVLTNNGDRTIDEVAQAYIATPTAGKGSPMASLVGF 656
Query: 754 DRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
RV K S + F + P E+L G L GN+ L +G
Sbjct: 657 RRVSIPPKSSVKAVFKIVP-ERLMTIQSDGSSKLLKGNYTLTIG 699
>gi|319641744|ref|ZP_07996426.1| beta-glucosidase [Bacteroides sp. 3_1_40A]
gi|317386631|gb|EFV67528.1| beta-glucosidase [Bacteroides sp. 3_1_40A]
Length = 702
Score = 356 bits (914), Expect = 3e-95, Method: Compositional matrix adjust.
Identities = 239/763 (31%), Positives = 361/763 (47%), Gaps = 115/763 (15%)
Query: 64 RAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVT 123
R K L++ LTL+EK+ + ++ AIPRLGIPAY+WW+E+LHG+A T+ VT
Sbjct: 2 RVKDLVARLTLEEKVLLMQHHSPAIPRLGIPAYDWWNEALHGVAR---------TLEKVT 52
Query: 124 SFPQVLVSAASFNRSLWSNIGSAVAVEARAMYN----LGQAG-----LTFWAPNINIFRD 174
FPQ + AA+F+ +G + E RA++N G+ G LT+W PNINIFRD
Sbjct: 53 VFPQAIGMAATFDTEALQKMGDITSTEGRALFNEDWKAGKTGTRYRGLTYWTPNINIFRD 112
Query: 175 PRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRG 234
PRWGRGQET GEDP + + V+ +GE + LK
Sbjct: 113 PRWGRGQETYGEDPYLTAKMGAAIVRGLEGE--------------DPHYLKSV------- 151
Query: 235 DELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCS 294
AC KH Y + ++R+SF+A + D DT+ P FR + + K +MC+
Sbjct: 152 ------ACAKH---YAVHSGPEYNRHSFDARPSVFDLWDTYMPAFRELVTKAKVHGVMCA 202
Query: 295 YNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKA 353
YN++NG P C L RN+W F GY+TSDC A+ E+ H + + L A
Sbjct: 203 YNRLNGQPCCGNDPLLVDILRNQWHFDGYVTSDCWALKDFAEFHKTHPEHTIAMSDALLA 262
Query: 354 GMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDP-RKGKYGKLGP 412
G D+ CG + + KG E+DI+ +L LF++ ++G+F DP + Y +G
Sbjct: 263 GTDLECGN-LYHLLAEGVKKGLHSERDINVSLSRLFTILFKIGMF--DPAERVPYSSIGR 319
Query: 413 DDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGI 472
+ + HK+ A A++ IVLL+N LPL+ + + S+A+IGP +N Y G
Sbjct: 320 EVLECEAHKQHAERMAKESIVLLENKNHILPLDASKIKSIALIGPNADNGQTQLANYFGT 379
Query: 473 P---CSPKSLLRGLEAYVSKTHYASGCHDVPCNSDA-GFHEAVRIAKKADFVIVVAGLDL 528
P +P L+ K +Y G V DA F + A ++D ++ V+G+
Sbjct: 380 PSEIVTPYMSLKRRLGDKIKINYLPGVGIVDKLKDAPSFVQVAHKAAQSDVIVFVSGISA 439
Query: 529 TQE-------------TEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSF 575
E DR ++ LP Q+ L+ + +T RP+I+V G + SF
Sbjct: 440 DYEGEAGDAGAAGYGGFASGDRTTMQLPLVQIELLKKLKKTG-RPLIIVNMSGSVM--SF 496
Query: 576 AEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRAD 635
++L Y G+A A+ +++FG NP GR+P+T Y P + +M
Sbjct: 497 EWESQNADALLQAWYGGQAAGDAIVDVLFGHCNPAGRMPLTTYKSDNDLPPFENYSML-- 554
Query: 636 SSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTG 695
GR+YR++ G Y FG+GLSYT ++Y
Sbjct: 555 ------GRTYRYFKGEPRYPFGYGLSYTTFAYS--------------------------- 581
Query: 696 SRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQG-TPEKQLIGFD 754
D +DE + + R V ++V+N GD DG VV L+ P+ + P L GF
Sbjct: 582 ---DVQCVDETHTGDTAR--VTVTVSNTGDCDGDEVVQLYVVHPQDGRKQIPLCALKGFK 636
Query: 755 RVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
R+H S +SF + P E+L++ G + G L VG
Sbjct: 637 RIHLKRGESTSVSFTLTP-EELALTETDGNLVEKNGQVTLFVG 678
>gi|288924872|ref|ZP_06418809.1| beta-glucosidase [Prevotella buccae D17]
gi|288338659|gb|EFC77008.1| beta-glucosidase [Prevotella buccae D17]
Length = 721
Score = 356 bits (913), Expect = 3e-95, Method: Compositional matrix adjust.
Identities = 240/764 (31%), Positives = 366/764 (47%), Gaps = 121/764 (15%)
Query: 61 ISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVS 120
+S AK +I+ +T+ EKI QL + + AI LGI Y+WWSE LHG+ +G
Sbjct: 30 MSRHAKEIIARMTVSEKISQLMNESPAIEHLGIKPYDWWSEGLHGVGRDG---------- 79
Query: 121 SVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQ--------AGLTFWAPNINIF 172
T FPQ + A+F+ +L IG AVA E RA +N+ + AGLTFW+PN+NIF
Sbjct: 80 RATVFPQPIALGATFDEALVREIGDAVATEGRAKFNVARKLKNYSRNAGLTFWSPNVNIF 139
Query: 173 RDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESD 232
RDPRWGRG ET GEDP++ +V+ QG D
Sbjct: 140 RDPRWGRGMETYGEDPLLSGMLGTAYVRGLQG---------------------------D 172
Query: 233 RGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIM 292
L AC KH Y + +R+ + + +D +T+ P F+ ++QG+ +M
Sbjct: 173 DAFYLKTGACAKH---YAVHSGPEGTRHEADIHPSRRDLFETYLPQFKMLVQQGRVEAVM 229
Query: 293 CSYNQVNGVPA----CLRGDLFQKARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAA 348
+YN+V G P L D+ +K+ WGF G+I SDCDA+ + Y KT E++ A
Sbjct: 230 SAYNRVYGEPCGGSKYLLTDILRKS---WGFNGHIVSDCDAINDFYGGHRYVKTPEEACA 286
Query: 349 GVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYG 408
+KAG+++ CG + Q A+D+G + E D+DRAL L +L+LG+ D Y
Sbjct: 287 AAIKAGLNVECGHT-FKAMQGALDQGLLAEADLDRALFPLVMTRLKLGILEPD-SACPYN 344
Query: 409 KLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGG 468
++C+ H LAL AA + +VLLKN+ LPL+KN + +L + GP ++ + G
Sbjct: 345 SYDESEICSPAHTALALRAADEAMVLLKNN-GILPLDKN-IRTLFVAGPGASDAFYLMGN 402
Query: 469 YTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPC-----NSDAGFHEAVRIAKKADFVIVV 523
Y G+ + L+G+ VS+ + + P + AV A A+ IVV
Sbjct: 403 YFGLSNRYSTYLQGI---VSRVSSGTSVNFRPAFMQITEELNDMNWAVNEACAAEVAIVV 459
Query: 524 AGLDLTQETED---------RDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVS 574
G + E E+ DRV + LP QM+ + V +++VLTGG P+D+
Sbjct: 460 MGNNGNMEGEEGEAIASASRGDRVGIGLPASQMNYLRRVKARKGGRIVVVLTGGSPIDLR 519
Query: 575 FAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVP-MNDMNMR 633
E +++ YPG+ G +AL +++FGD N GRLP+T +P +P +D +M
Sbjct: 520 --EISKLADAVVMAWYPGQEGGEALGDLLFGDKNFSGRLPIT-FPADVDSLPAFDDYSMN 576
Query: 634 ADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQ 693
GR+Y++ +G +Y FG+GLSY +Y ++ + +K G
Sbjct: 577 --------GRTYKYMSGNVMYPFGYGLSYGRVTY------TDARVVGRIKKGEP------ 616
Query: 694 TGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGF 753
V++ +TN GD V + P +G+P L+GF
Sbjct: 617 --------------------LAVEVVLTNNGDRTIDEVAQAYIATPTAGKGSPMASLVGF 656
Query: 754 DRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
RV K S + F + P E+L G L GN+ L +G
Sbjct: 657 RRVSIPPKSSVKAVFKIVP-ERLMTIQSDGSSKLLKGNYTLTIG 699
>gi|53712125|ref|YP_098117.1| beta-xylosidase [Bacteroides fragilis YCH46]
gi|52214990|dbj|BAD47583.1| beta-xylosidase [Bacteroides fragilis YCH46]
Length = 722
Score = 356 bits (913), Expect = 3e-95, Method: Compositional matrix adjust.
Identities = 229/745 (30%), Positives = 364/745 (48%), Gaps = 97/745 (13%)
Query: 56 NTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNF 115
+ S ++ R ++LI +TL EK+ QL + +IPRL +PAY +W+E LHG+A G
Sbjct: 53 DLSQPVAVRVRTLIQQMTLAEKVAQLVSESDSIPRLNLPAYNYWNECLHGVARAG----- 107
Query: 116 NGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDP 175
VT FPQ + A++++ L + SA++ EAR Y GLT+W+P IN+ RDP
Sbjct: 108 -----EVTVFPQAINLASTWDTVLVKRVASAISTEARLKYLEIGKGLTYWSPTINMARDP 162
Query: 176 RWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGD 235
RWGR +ET GEDP + S V FVK QG D
Sbjct: 163 RWGRNEETYGEDPHLTSRLGVAFVKGLQG---------------------------DHPT 195
Query: 236 ELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSY 295
L A KH +A + E +R+S ++ I + + + P + +C+++ A +M +Y
Sbjct: 196 YLKTVATIKHFVANNEEN----NRFSSSSQIPTKQLYEYYFPAYEACVKEANAQSVMTAY 251
Query: 296 NQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAG 354
N NGVP L R EWGF G++ SDC A+ + + E++AA + +G
Sbjct: 252 NAFNGVPPSGSHWLLDDVLRKEWGFDGFVVSDCGAIGVMNWQHRVVNSLEEAAALGVNSG 311
Query: 355 MDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRK-GKYGKLGPD 413
D+ CGT A+++G + E IDRAL + + + +LG F DP + Y
Sbjct: 312 CDLECGTTYKEKLVQAVEQGLISEAAIDRALTRVLTARFKLGEF--DPMELVPYNHYDKK 369
Query: 414 DVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIP 473
+ + +LA +AA + +VLLKND LPLNK + S+A++GP + GGY+G P
Sbjct: 370 LLAGKKFAELAYEAAVKSVVLLKND-ALLPLNKEKIKSVAVVGPFAD--YNYLGGYSGQP 426
Query: 474 CSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETE 533
SLL+G++ + K + + + ++D+ ++ K AD V+V G D E
Sbjct: 427 PYSVSLLKGVKELIGKKGKVTYLNGMGTSADS----IAQVVKGADIVLVALGSDEKMARE 482
Query: 534 DRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGE 593
+ D S+ LP Q L+ + + + R ++LV G PL + AD+ I +I+ YPG+
Sbjct: 483 NHDMPSIYLPEGQEKLLKEIYQVNPR-IVLVFHTGNPLTSEW--ADTHIPAIMQAWYPGQ 539
Query: 594 AGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQV 653
+ALA ++FG+ NP G+LPMT Y + D +M + GR+YR+ G +
Sbjct: 540 EAGRALANLLFGNENPSGKLPMTIYKTEEQLPDILDFDM-------WKGRTYRYMKGEPL 592
Query: 654 YGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLR 713
YGFGHGLSYT++ + + D + D + C+
Sbjct: 593 YGFGHGLSYTSFEFDNIQGN-------------------------DTLQPDAILQCS--- 624
Query: 714 FHVQISVTNAGDVDGSHVVMLFARVPKVSQGT-PEKQLIGFDRVHTVAKGSKEISFGVDP 772
+ ++N+G + G VV ++ T P K+L+ F +V + K++ F + P
Sbjct: 625 ----VELSNSGQLAGEEVVQVYVSRENTPVYTYPLKKLVAFKKVKLASGEKKKVDFTIAP 680
Query: 773 CEQLSIANKHGRRILPLGNHVLMVG 797
E LS+ R+L G + L +G
Sbjct: 681 RE-LSVWEDGKWRMLS-GKYTLFIG 703
>gi|60680313|ref|YP_210457.1| glycosyl hydrolase [Bacteroides fragilis NCTC 9343]
gi|60491747|emb|CAH06504.1| putative glycosyl hydrolase [Bacteroides fragilis NCTC 9343]
Length = 722
Score = 356 bits (913), Expect = 4e-95, Method: Compositional matrix adjust.
Identities = 228/745 (30%), Positives = 364/745 (48%), Gaps = 97/745 (13%)
Query: 56 NTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNF 115
+ S ++ R ++LI +TL EK+ QL + +IPRL +PAY +W+E LHG+A G
Sbjct: 53 DLSQPVAVRVRTLIQQMTLAEKVAQLVSESDSIPRLNLPAYNYWNECLHGVARAG----- 107
Query: 116 NGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDP 175
VT FPQ + A++++ L + SA++ EAR Y GLT+W+P IN+ RDP
Sbjct: 108 -----EVTVFPQAINLASTWDTVLVKRVASAISTEARLKYLEIGKGLTYWSPTINMARDP 162
Query: 176 RWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGD 235
RWGR +ET GEDP + S V FVK QG D
Sbjct: 163 RWGRNEETYGEDPHLTSRLGVAFVKGLQG---------------------------DHPT 195
Query: 236 ELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSY 295
L A KH +A + E +R+S ++ I + + + P + +C+++ A +M +Y
Sbjct: 196 YLKTVATIKHFVANNEEN----NRFSSSSQIPTKQLYEYYFPAYEACVKEANAQSVMTAY 251
Query: 296 NQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAG 354
N NGVP L R EWGF G++ SDC A+ + + E++AA + +G
Sbjct: 252 NAFNGVPPSGSHWLLDDVLRKEWGFDGFVVSDCGAIGVMNWQHRVVNSLEEAAALGVNSG 311
Query: 355 MDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRK-GKYGKLGPD 413
D+ CGT A+++G + E IDRAL + + + +LG F DP + Y
Sbjct: 312 CDLECGTTYKEKLVQAVEQGLISEAAIDRALTRVLTARFKLGEF--DPMELVPYNHYDKK 369
Query: 414 DVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIP 473
+ + +LA +AA + +VLLKND LPLNK + S+A++GP + GGY+G P
Sbjct: 370 LLAGKKFAELAYEAAVKSVVLLKND-ALLPLNKEKIKSVAVVGPFAD--YNYLGGYSGQP 426
Query: 474 CSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETE 533
SLL+G++ + K + + + ++D+ ++ K AD V+V G D E
Sbjct: 427 PYSVSLLKGVKELIGKKGKVTYLNGMGTSADS----IAQVVKGADIVLVALGSDEKMARE 482
Query: 534 DRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGE 593
+ D S+ LP +Q + + + + R ++LV G PL + AD+ I +I+ YPG+
Sbjct: 483 NHDMPSIYLPEEQEKFLKKIYQVNPR-IVLVFHTGNPLTSEW--ADTHILAIMQAWYPGQ 539
Query: 594 AGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQV 653
+ALA ++FG+ NP G+LPMT Y + D +M + GR+YR+ G +
Sbjct: 540 EAGRALANLLFGNENPSGKLPMTIYKTEEQLPDILDFDM-------WKGRTYRYMKGEPL 592
Query: 654 YGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLR 713
YGFGHGLSYT++ + + D + D + C+
Sbjct: 593 YGFGHGLSYTSFEFDNIQGN-------------------------DTLQPDAILQCS--- 624
Query: 714 FHVQISVTNAGDVDGSHVVMLFARVPKVSQGT-PEKQLIGFDRVHTVAKGSKEISFGVDP 772
+ ++N+G + G VV ++ T P K+L+ F +V + K++ F + P
Sbjct: 625 ----VELSNSGQLAGEEVVQVYVSRENTPVYTYPLKKLVAFKKVKLASGEKKKVDFTIAP 680
Query: 773 CEQLSIANKHGRRILPLGNHVLMVG 797
E LS+ R+L G + L +G
Sbjct: 681 RE-LSVWEDGKWRMLS-GKYTLFIG 703
>gi|326791674|ref|YP_004309495.1| beta-glucosidase [Clostridium lentocellum DSM 5427]
gi|326542438|gb|ADZ84297.1| Beta-glucosidase [Clostridium lentocellum DSM 5427]
Length = 696
Score = 355 bits (912), Expect = 4e-95, Method: Compositional matrix adjust.
Identities = 241/750 (32%), Positives = 356/750 (47%), Gaps = 126/750 (16%)
Query: 64 RAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVT 123
+AK+L++ +TL+E+ QL ++ AI RLG+PAY WW+E+LHG+A G T
Sbjct: 9 KAKALVAEMTLEERASQLKYDSPAIKRLGVPAYNWWNEALHGVARAG----------VAT 58
Query: 124 SFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQ--------AGLTFWAPNINIFRDP 175
SFPQ + AA+F+ L + +A E RA YN GLTFW+PN+NIFRDP
Sbjct: 59 SFPQAIGMAATFDDELLKRVAEVIAEEGRAKYNAYSQEGDRDIYKGLTFWSPNVNIFRDP 118
Query: 176 RWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGD 235
RWGRG ET GEDP + S V FVK QGE +
Sbjct: 119 RWGRGHETYGEDPYLTSRLGVAFVKGLQGE-----------------------------E 149
Query: 236 ELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSY 295
L +AC KH + + R+ F+A ++++D +T+ P F + +++ + +M +Y
Sbjct: 150 GLKTAACAKH---FAVHSGPEADRHHFDARVSQKDLWETYLPAFEALVKEAEVESVMGAY 206
Query: 296 NQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAG 354
N+ NG P C L + R +WGF+G+ SDC A+ E+ T T ++SAA LK+G
Sbjct: 207 NRTNGEPCCGSPTLMKDILREKWGFQGHYVSDCWAIKDFHEHHMVTSTAQESAALALKSG 266
Query: 355 MDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDD 414
D+NCG L H A G V E++I A LF+ + LGLF+G Y + +
Sbjct: 267 CDLNCGNTYL-HILMAYQNGLVTEEEITTAAERLFTTRYLLGLFDGS----TYDAIPYEV 321
Query: 415 VCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPC 474
V + H +A +A + IVLLKN+ LPLNK ++ ++ +IGP N+ + G Y G
Sbjct: 322 VESKPHLSVADEATAKSIVLLKNN-GLLPLNKESIKTIGVIGPNANSRKALIGNYHGTSS 380
Query: 475 SPKSLLRGLEAYVS---KTHYASGCH------DVPCNSDAGFHEAVRIAKKADFVIVVAG 525
++L GL+ V + Y+ G H + EA +AK +D VIV G
Sbjct: 381 QYITILEGLQKEVGDEVRILYSEGSHLYADRVEPLAYQRDRLSEAKIVAKHSDVVIVCVG 440
Query: 526 LDLTQETED---------RDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFA 576
LD T E E+ D+ L LP Q LV ++A+ K PVIL L+ G +D+ +
Sbjct: 441 LDETLEGEEGDTGNAYASGDKRDLALPEPQQELVEAMAKMGK-PVILCLSAGSAIDLQY- 498
Query: 577 EADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADS 636
AD+ ++L YPG G + +A+ + G+ P G+LP+T+Y + D +M+
Sbjct: 499 -ADAHYDAVLQAWYPGARGGQVIAKALLGEIVPSGKLPVTFYRDLSGLPAFEDYSMQ--- 554
Query: 637 SRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGS 696
GR+YR+ +Y FG+GL+Y
Sbjct: 555 -----GRTYRYMQEEALYPFGYGLTYGK-------------------------------C 578
Query: 697 RLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRV 756
R++ D+ SLR + V N D VV L+ + P L GF RV
Sbjct: 579 RIEEASYDQ----GSLR----VLVHNEVDFKLEEVVQLYIKNLDSEFAVPNHSLCGFKRV 630
Query: 757 HTVAKGSKEISFGVDPCEQLSIANKHGRRI 786
A +KEI V P + N+ G I
Sbjct: 631 SLEAGETKEIQINVSP-NAFKVVNEQGEWI 659
>gi|291556907|emb|CBL34024.1| Beta-glucosidase-related glycosidases [Eubacterium siraeum V10Sc8a]
Length = 691
Score = 355 bits (911), Expect = 5e-95, Method: Compositional matrix adjust.
Identities = 245/777 (31%), Positives = 376/777 (48%), Gaps = 145/777 (18%)
Query: 54 FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
+ + LS RA +L L+ +E+ QQL +A AI + G+P+Y WW+E LHG+A G
Sbjct: 4 YKDKQLSAYERAAALADTLSTEEQAQQLKYDAPAIEKAGLPSYNWWNEGLHGVARAG--- 60
Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLTFW 165
+ T FPQ + AA+F++ + +G ++ EARAMYN GLT W
Sbjct: 61 -------TATVFPQAIALAAAFDKDMMYRVGEVISTEARAMYNSAAKHGDTDIYKGLTLW 113
Query: 166 APNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLK 225
APNINIFRDPRWGRG ET GEDP + S V FVK QGE
Sbjct: 114 APNINIFRDPRWGRGHETYGEDPYLTSRLGVSFVKGIQGEE------------------- 154
Query: 226 GFGEESDRGDELMLSACCKHLIAYDLEKWGNFS-RYSFNAMITEQDTEDTFQPPFRSCIE 284
+ L +AC KH + G S R+ F+A ++E+D E+T+ P F++ ++
Sbjct: 155 ---------EYLRAAACAKHFAVHS----GPESLRHEFDARVSEKDMEETYLPAFKALVK 201
Query: 285 QGKASCIMCSYNQVNGVPACLRGDLFQKARNEWGFKGYITSDCDAVATIFEYQNYTKTHE 344
+G+ +M +YN+VNG P+C L K R EWGF GY SDC A+ T T
Sbjct: 202 EGRVEGVMGAYNRVNGEPSCASEKLMGKLR-EWGFDGYFVSDCWAIRDFHTTHKITDTAP 260
Query: 345 DSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRK 404
SAA LKAG D+NCG L H +A+++G + ++DI A ++ ++RLG + +
Sbjct: 261 QSAAMALKAGCDVNCGNTYL-HILAALEEGLITKQDIRTACIHALRTRIRLGQLDDN--- 316
Query: 405 GKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQ 464
++ L D + +K L+L+AA + +VLL ND LPL+K+ +SS+A+IGP ++ +
Sbjct: 317 -EFDDLPFDIIACDGNKALSLEAAEKSMVLLHND-GILPLDKSRISSIAVIGPNADSRAA 374
Query: 465 MGGGYTGIPCSPKSLLRGLE-AYVSKTHYASGCH---------DVPCNSDAGFHEAVRIA 514
+ G Y G P + L G++ A+ + +YA GC +P + + EAV
Sbjct: 375 LLGNYNGTPDRSVTFLEGIQDAFDGRVYYAEGCQLFRDRTQGLALPGDR---YAEAVAAC 431
Query: 515 KKADFVIVVAGLDLTQETED---------RDRVSLLLPGQQMSLVTSVARTSKRPVILVL 565
+ AD ++ GLD T E E+ D+ L LP Q L+ ++ T K P+I+VL
Sbjct: 432 EAADVTVICVGLDATLEGEEGDTGNEFASGDKPDLRLPEVQRVLLQNLKDTGK-PLIIVL 490
Query: 566 TGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKV 625
G ++ E ++ I++ W YPG+ G KALAEI+FG+ +P G+LP+T+Y +
Sbjct: 491 AAGSSVNTE-CEGNALINA--W--YPGQYGGKALAEILFGEVSPSGKLPVTFYKSADMLP 545
Query: 626 PMNDMNMRADSSRQYPGRSYRFYTGTQ--VYGFGHGLSYTNYSYKFLSAPSELTISASLK 683
D +M+ R+YRF +Y FG+GL+Y+++
Sbjct: 546 DFTDYSMK--------NRTYRFCDDESNVLYPFGYGLTYSHF------------------ 579
Query: 684 AGSDKNILQQTGSRLDYVHIDEVTSCTSLRFH---VQISVTNAGDVDGSHVVMLFARVPK 740
C + + + ++VTN G V+ ++ K
Sbjct: 580 ------------------------ECGDVSYKDNTLAVNVTNTGSRSAEDVLQVYI---K 612
Query: 741 VSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
G L F+RV S+ IS + P + +G R + G + L G
Sbjct: 613 SENGVKNHSLCAFERVSLFDGESRTISINI-PEGAFETVDDNGIRAVRSGRYTLYAG 668
>gi|326789672|ref|YP_004307493.1| beta-glucosidase [Clostridium lentocellum DSM 5427]
gi|326540436|gb|ADZ82295.1| Beta-glucosidase [Clostridium lentocellum DSM 5427]
Length = 704
Score = 355 bits (910), Expect = 7e-95, Method: Compositional matrix adjust.
Identities = 235/766 (30%), Positives = 370/766 (48%), Gaps = 124/766 (16%)
Query: 64 RAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVT 123
+A L++ + L EK L ++ AI RLG+P Y WWSE+LHG+A G T
Sbjct: 8 KAGQLVAQMDLLEKASMLRYDSPAIKRLGVPTYNWWSEALHGVARAGVA----------T 57
Query: 124 SFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLTFWAPNINIFRDP 175
FPQ + AA F+ I +A EARA YN G+T WAPNINIFRDP
Sbjct: 58 VFPQAIGMAAMFDEEYLYEIADIIATEARAKYNEFAKKEDRDIYKGMTLWAPNINIFRDP 117
Query: 176 RWGRGQETPGEDPMVVSAYAVEFVKSFQGEN----WKSDDGGIGFGFREKRVLKGFGEES 231
RWGRG ET GEDP + S V F+ QG+ WK+
Sbjct: 118 RWGRGHETYGEDPYLTSRLGVAFIHGLQGDENHHYWKA---------------------- 155
Query: 232 DRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCI 291
+AC KH + + R+ F+A+++++D +T+ P F + + +GK + +
Sbjct: 156 --------AACAKH---FAVHSGPEEERHHFDAVVSKKDLYETYLPAFEAAVTKGKVAGM 204
Query: 292 MCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGV 350
M +YN+VNG PAC L Q + EWGF GY+ SDC A+ T T +SAA
Sbjct: 205 MGAYNRVNGEPACGSKVLLQDILKEEWGFDGYVVSDCWAIRDFHTEHMVTHTATESAALA 264
Query: 351 LKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKL 410
+ G +NCG L H A +G V E+ I ++ L +++++LGLF+ + +Y K+
Sbjct: 265 INNGCQLNCGNTYL-HMLQAYKEGLVTEETITKSAQKLMAIRMKLGLFD---KNCEYNKI 320
Query: 411 GPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT 470
+ H+ +ALD AR+ +VLLKN+ LPLN ++ +IGP N+ + + G Y
Sbjct: 321 PYEVNDCKVHRDIALDVARRSMVLLKNNG-ILPLNLKQTKAIGVIGPTANSRTVLQGNYF 379
Query: 471 GIPCSPKSLLRGLEAYV---SKTHYASGCHDVPCNSDAGFH-------EAVRIAKKADFV 520
G + L G++ YV ++ +YA GCH + NS +G EA+ +A+++D V
Sbjct: 380 GTASRYTTFLEGIQDYVGDAARVYYAEGCH-LFKNSISGLSWENDRLSEALIVAEQSDVV 438
Query: 521 IVVAGLDLTQETEDRD---------RVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPL 571
I+ GLD + E E D + L L G+Q L+ V + K P IL+L+ G +
Sbjct: 439 ILCLGLDASIEGEQGDTGNAFAAGDKSDLNLIGRQQLLLEEVLKIGK-PTILILSSGSAM 497
Query: 572 DVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMN 631
+ A+ +IL YPG++G KALA+++FG+++P G+LP+T+Y + D +
Sbjct: 498 AIHTAQ--EYCEAILETWYPGQSGGKALAQLLFGEYSPSGKLPITFYKTTEELPDFRDYS 555
Query: 632 MRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNIL 691
M GR+YR+ +Y FG+GL+Y K A +K ++NI
Sbjct: 556 M--------AGRTYRYMKNEALYPFGYGLNYAKVEVK----------DAVIK---ERNIE 594
Query: 692 QQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLI 751
+ + + +Q+ VTN +V VV ++ + + P L
Sbjct: 595 NE------------------IIYEIQLQVTNQSEVCTYDVVQVYIKDMESRWAVPNYSLC 636
Query: 752 GFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
+ ++ A +I+ + I ++ G+R + + L +G
Sbjct: 637 AYKSIYLAAYDEPQITLQIKQS-AFEIVDEEGKRYIDSHHFKLFIG 681
>gi|121700633|ref|XP_001268581.1| beta-xylosidase XylA [Aspergillus clavatus NRRL 1]
gi|119396724|gb|EAW07155.1| beta-xylosidase XylA [Aspergillus clavatus NRRL 1]
Length = 743
Score = 355 bits (910), Expect = 8e-95, Method: Compositional matrix adjust.
Identities = 238/751 (31%), Positives = 363/751 (48%), Gaps = 105/751 (13%)
Query: 55 CNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVN 114
C+T S RA +LISL TL+E + + + +PRLG+P Y+ W+E+LHG+ +
Sbjct: 63 CDTLTSPYDRAAALISLFTLEELVNATGNTSPGVPRLGLPPYQVWNEALHGL--DRAYFT 120
Query: 115 FNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRD 174
G S TSFP +++ ++ NR+L + + S ++ + RA N G+ GL ++PNIN FR
Sbjct: 121 DEGQFSWSTSFPMPILTMSALNRTLINQVASIISTQGRAFSNAGRYGLDVYSPNINSFRH 180
Query: 175 PRWGRGQETPGEDPMVVS-AYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDR 233
P WGRGQETPGED +S AYA E++ QG G K
Sbjct: 181 PVWGRGQETPGEDAYCLSSAYAYEYITGIQG------------GVDPK------------ 216
Query: 234 GDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMC 293
L L A KH YD+E W SR + IT+QD + + P F K +MC
Sbjct: 217 --SLKLVATAKHYAGYDIENWDGHSRLGNDMNITQQDLSEYYTPQFLVAARDAKVRSVMC 274
Query: 294 SYNQVNGVPACLRGDLFQKA-RNEWGF--KGYITSDCDAVATIFEYQNYTKTHEDSAAGV 350
SYN VNGVP+C Q R+ +GF GYI+SDCD+ +F Y +AA
Sbjct: 275 SYNAVNGVPSCANSFFLQTLLRDTFGFVEDGYISSDCDSAYNVFNPHEYAANVSSAAADS 334
Query: 351 LKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKL 410
++AG DI+CGT + A V + + RA + ++L L +L
Sbjct: 335 IRAGTDIDCGTTYQYYFDEA-----VDQNLLSRADIERGVIRLYSNLM----------RL 379
Query: 411 GPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT 470
G DV GP +N +Q+ G Y
Sbjct: 380 GYFDV----------------------------------------GPWMNVSTQLQGNYF 399
Query: 471 GIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQ 530
G S L +YA G ++ NS GF +A+ AKK+D +I G+D +
Sbjct: 400 GPAPYLISPLDAFRDSHLDVNYAFGT-NISSNSTDGFSKALSAAKKSDAIIFAGGIDNSL 458
Query: 531 ETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGY 590
E E DR+++ PG+Q+ L+ +++ K P+I++ GGG +D S +++ ++S++W GY
Sbjct: 459 EAETLDRMNITWPGKQLELIDQLSQLGK-PLIVLQMGGGQVDSSLLKSNKNVNSLIWGGY 517
Query: 591 PGEAGAKALAEIIFGDFNPGGRLPMTWYPESF-TKVPMNDMNMRADSSRQYPGRSYRFYT 649
PG++G +AL +II G P GRL +T YP + T+ P DM++R + PG++Y +YT
Sbjct: 518 PGQSGGQALLDIITGKRAPAGRLVVTQYPAEYATQFPATDMSLRPHGNN--PGQTYMWYT 575
Query: 650 GTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSC 709
GT VY FGHGL YT + +S + I + +++L Q Y+H++++
Sbjct: 576 GTPVYEFGHGLFYTTFR---VSHARAVKIKPTYNI---QDLLAQ--PHPGYIHVEQM--- 624
Query: 710 TSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFG 769
L F V I TN G + MLFA P+K L+GFDR+ T+ + ++
Sbjct: 625 PFLNFTVDI--TNTGKASSDYTAMLFANTTAGPAPYPKKWLVGFDRLPTLGPSTSKLMTI 682
Query: 770 VDPCEQLSIANKHGRRILPLGNHVLMVGELR 800
++ ++ G R+L G + L + R
Sbjct: 683 PVTINSMARTDELGNRVLYPGKYELALNNER 713
>gi|405955586|gb|EKC22647.1| Putative beta-D-xylosidase 2 [Crassostrea gigas]
Length = 745
Score = 354 bits (909), Expect = 9e-95, Method: Compositional matrix adjust.
Identities = 237/779 (30%), Positives = 378/779 (48%), Gaps = 112/779 (14%)
Query: 48 HFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNAS--------AIPRLGIPAYEWW 99
H YPF NTSL R K L+ LT++E + Q+S S A+PRLG+ + W
Sbjct: 22 HVQDYPFRNTSLPWDARVKDLVDRLTIEEIVVQMSRGGSGPRASPAPAVPRLGVGPFSWN 81
Query: 100 SESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQ 159
+E L G G + TSFPQ L AA+F+ + ++ SA ++E RA +N Q
Sbjct: 82 TECLRGDVYAG----------NATSFPQALGLAATFSTEVICDVASATSIEVRAKFNDYQ 131
Query: 160 --------AGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDD 211
G++ ++P INI R P WGR QET GEDP + A FVK QG
Sbjct: 132 RRKIYGDHKGISCFSPVINIMRHPLWGRNQETYGEDPFLSGELAAIFVKCLQG------- 184
Query: 212 GGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDT 271
D + +A CKH + + SR+SF+A ++E+D
Sbjct: 185 --------------------DDPTYIRANAGCKHFDVHGGPENIPVSRFSFDAKVSERDW 224
Query: 272 EDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAV 330
TF P F+ C++ G S +MCS+N++NGVPAC L R EWGF GY+ SD +A+
Sbjct: 225 RLTFLPAFKRCVQAGSYS-LMCSFNRINGVPACGNKRLLTDILRTEWGFTGYVVSDQEAI 283
Query: 331 ATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQ----SAIDKGKVQEKDIDRALL 386
I Y +YT D+AA +KAG ++ T ++ T A+ GK+ ++D+ +++
Sbjct: 284 ENIMTYHHYTNNSVDTAALCVKAGCNLELSTNEVKPTYFYIIDALKAGKLDKEDLVKSVS 343
Query: 387 NLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNK 446
LF ++RLG F+ P Y + + + EH+ ++L+AA + VLLKN FLP+ K
Sbjct: 344 PLFYTRMRLGEFD-PPDHNPYNFIDLSVIQSEEHRAISLNAAMKSFVLLKNKGGFLPITK 402
Query: 447 NAVSSLAIIGPLVNNISQMGGGYTG--IPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSD 504
+++++GP+ +N Q G Y +P + L+GL + YA+GC+D C S
Sbjct: 403 -LFDTISVLGPMADNKYQQIGSYAPDVMPSYTTTPLQGLSKLSKRVQYAAGCNDNAC-SK 460
Query: 505 AGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKR--PVI 562
E R +D V G E ED DR S+ LPGQQ L+ S + P++
Sbjct: 461 YNRTEIQRAVNSSDIFFVCLGTGPMIENEDHDRASMELPGQQAQLLKDAIMFSAKGVPIV 520
Query: 563 LVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIF---GDFNPGGRLPMTWYP 619
L+L GGP+++++A+ ++ +I+ +P + +A+ ++ NP GRLP TW P
Sbjct: 521 LLLFNGGPVNITWADRSDRVVAIMECFFPAQETGEAVLRVVTNTGNSSNPAGRLPYTW-P 579
Query: 620 ESFTKVP-MNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTI 678
+ ++P M + +M GR+YR++ G +Y FG+GLSY+ +++ + +
Sbjct: 580 KYQDQIPSMVNYSME--------GRTYRYFHGDPLYPFGYGLSYSTFNF------TNAWM 625
Query: 679 SASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARV 738
+ + G D V++ V N G DG V+ ++ +
Sbjct: 626 NPIISQGQD--------------------------LTVRVEVCNEGPTDGDEVIQVYLKW 659
Query: 739 PKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
++ P QL+GF+RV AK + V E +++ N+ + G + L +G
Sbjct: 660 LDTNETMPIHQLVGFERVSLRAKETLSWLITVR-AENMAVWNESRGFYIEPGRYRLYIG 717
>gi|423269271|ref|ZP_17248243.1| hypothetical protein HMPREF1079_01325 [Bacteroides fragilis
CL05T00C42]
gi|423273165|ref|ZP_17252112.1| hypothetical protein HMPREF1080_00765 [Bacteroides fragilis
CL05T12C13]
gi|392701693|gb|EIY94850.1| hypothetical protein HMPREF1079_01325 [Bacteroides fragilis
CL05T00C42]
gi|392708197|gb|EIZ01305.1| hypothetical protein HMPREF1080_00765 [Bacteroides fragilis
CL05T12C13]
Length = 722
Score = 354 bits (909), Expect = 9e-95, Method: Compositional matrix adjust.
Identities = 228/745 (30%), Positives = 364/745 (48%), Gaps = 97/745 (13%)
Query: 56 NTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNF 115
+ S ++ R ++LI +TL EK+ QL + +IPRL +PAY +W+E LHG+A G
Sbjct: 53 DLSQPVAVRVRTLIQQMTLAEKVAQLVSESDSIPRLNLPAYNYWNECLHGVARAG----- 107
Query: 116 NGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDP 175
VT FPQ + A++++ L + SA++ EAR Y GLT+W+P IN+ RDP
Sbjct: 108 -----EVTVFPQAINLASTWDTVLVKRVASAISTEARLKYLEIGKGLTYWSPTINMARDP 162
Query: 176 RWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGD 235
RWGR +ET GE+P + S V FVK QG D
Sbjct: 163 RWGRNEETYGEEPHLTSRLGVAFVKGLQG---------------------------DHPT 195
Query: 236 ELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSY 295
L A KH +A + E +R+S ++ I + + + P + +C+++ A +M +Y
Sbjct: 196 YLKTVATIKHFVANNEEN----NRFSSSSQIPTKQLYEYYFPAYEACVKEANAQSVMTAY 251
Query: 296 NQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAG 354
N NGVP L R EWGF G++ SDC A+ + + E++AA + +G
Sbjct: 252 NAFNGVPPSGSHWLLDDVLRKEWGFDGFVVSDCGAIGVMNWQHRVVNSLEEAAALGVNSG 311
Query: 355 MDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRK-GKYGKLGPD 413
D+ CGT A+++G + E IDRAL + + + +LG F DP + Y
Sbjct: 312 CDLECGTTYKEKLVQAVEQGLISEAAIDRALTRVLTARFKLGEF--DPMELVPYNHYDKK 369
Query: 414 DVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIP 473
+ + +LA +AA + +VLLKND LPLNK + S+A++GP + GGY+G P
Sbjct: 370 LLAGKKFAELAYEAAVKSVVLLKND-ALLPLNKEKIKSVAVVGPFAD--YNYLGGYSGQP 426
Query: 474 CSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETE 533
SLL+G++ + K + + + ++D+ ++ K AD V+V G D E
Sbjct: 427 PYSVSLLKGVKELIGKKGKVTYLNGMGTSADS----IAQVVKGADIVLVALGSDEKMARE 482
Query: 534 DRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGE 593
+ D S+ LP Q L+ + + + R ++LV G PL + AD+ I +I+ YPG+
Sbjct: 483 NHDMPSIYLPEGQEKLLKEIYQVNPR-IVLVFHTGNPLTSEW--ADTHIPAIMQAWYPGQ 539
Query: 594 AGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQV 653
+ALA ++FG+ NP G+LPMT Y + D +M + GR+YR+ G +
Sbjct: 540 EAGRALANLLFGNENPSGKLPMTIYKTEEQLPDILDFDM-------WKGRTYRYMKGEPL 592
Query: 654 YGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLR 713
YGFGHGLSYT++ + + D + D + C+
Sbjct: 593 YGFGHGLSYTSFEFDNIQGN-------------------------DTLQPDAILQCS--- 624
Query: 714 FHVQISVTNAGDVDGSHVVMLFARVPKVSQGT-PEKQLIGFDRVHTVAKGSKEISFGVDP 772
+ ++N+G + G VV ++ T P K+L+ F +V + K++ F + P
Sbjct: 625 ----VELSNSGQLAGEEVVQVYVSRENTPVYTYPLKKLVAFKKVKLASGEKKKVDFTIAP 680
Query: 773 CEQLSIANKHGRRILPLGNHVLMVG 797
E LS+ R+L G + L +G
Sbjct: 681 RE-LSVWEDGKWRMLS-GKYTLFIG 703
>gi|307719075|ref|YP_003874607.1| glycoside hydrolase family protein [Spirochaeta thermophila DSM
6192]
gi|306532800|gb|ADN02334.1| glycoside hydrolase family 3 [Spirochaeta thermophila DSM 6192]
Length = 693
Score = 354 bits (908), Expect = 1e-94, Method: Compositional matrix adjust.
Identities = 245/756 (32%), Positives = 361/756 (47%), Gaps = 113/756 (14%)
Query: 64 RAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVT 123
R SL+S ++++EK + A +PRLGIP Y WW+E+LHG+A++G T
Sbjct: 6 RMTSLLSRMSIEEKAGLMVHRAKGVPRLGIPNYNWWNEALHGVANSG----------EAT 55
Query: 124 SFPQVLVSAASFNRSLWSNIGSAVAVEARAMYN-LG-------QAGLTFWAPNINIFRDP 175
FPQ + AA+F+ L + A++ EARA +N +G + GLTFW+PNINI+RDP
Sbjct: 56 VFPQAIGLAATFDPDLVRRVADAISREARAKFNAVGKERAAEYERGLTFWSPNINIYRDP 115
Query: 176 RWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGD 235
RWGRGQET GEDP + S V FVK QG D
Sbjct: 116 RWGRGQETYGEDPFLTSKIGVAFVKGLQG---------------------------DHPY 148
Query: 236 ELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSY 295
L ++AC KH Y + R+ F+A ++E+D +T+ P F + ++ G +M +Y
Sbjct: 149 YLRVAACAKH---YAVHSGPEGLRHVFDARVSEKDLWETYLPAFEALVKAG-VEAVMGAY 204
Query: 296 NQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAG 354
N+VNG PAC L ++ R +WGFKG++ SDC A+A + TK +S A L+AG
Sbjct: 205 NRVNGEPACGSKRLLEEILRKKWGFKGHVVSDCWAIADFHLHHKVTKDPIESIAMALEAG 264
Query: 355 MDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDD 414
D+NCG H A+ G V E+ +DR++ L S RLGLF D Y +L D
Sbjct: 265 CDLNCGNTY-EHLLDAVKAGAVSEELVDRSVARLLSTLDRLGLFTDD---HPYVRLSLAD 320
Query: 415 VCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPC 474
+ H+ LA +AA + +VLLKN+ LPL++ + + + GP N + G Y G+
Sbjct: 321 IDWEAHRALAREAAEKSVVLLKNN-GILPLDRRKLRYIYVTGPNAANPVALLGNYAGVSS 379
Query: 475 SPKSLLRGLEAYVS---KTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQE 531
++L G+ Y Y GC + N A +A+ AD + V G D E
Sbjct: 380 RLVTVLEGITGYAGPGITVTYKIGC-PLQGNKINPIDWASGVARYADVTVAVMGRDSAVE 438
Query: 532 TEDRDRV---------SLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQI 582
E+ D + L L +Q+ + + K P+++VL G P V E +
Sbjct: 439 GEEGDAIFSDNYGDLSDLNLSREQIDYLRRIKEIGK-PLVVVLLSGAP--VCSPELEELA 495
Query: 583 SSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKV-PMNDMNMRADSSRQYP 641
+I++ YPGE G A+A ++FG+ +P GRLP+T +P+ ++ P D +M
Sbjct: 496 DAIVYAWYPGEEGGNAIARVLFGEVSPSGRLPIT-FPKGVDQLPPFTDYSME-------- 546
Query: 642 GRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYV 701
GR+YR+ +Y FG GLSY +SY+ P + + SR D
Sbjct: 547 GRTYRYMKEEPLYPFGFGLSYATFSYR---DP------------------KSSASRWDKR 585
Query: 702 HIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAK 761
EV V N + VV L+ R P L GF RV
Sbjct: 586 ETLEVVC----------EVENTSSIPADEVVQLYVRWEDAPFRVPLWSLKGFTRVSLGTG 635
Query: 762 GSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
++ F + P E LS ++ GR++LP G VG
Sbjct: 636 ERIQVRFVLSP-EDLSFIDEKGRKVLPEGRLRFHVG 670
>gi|167751044|ref|ZP_02423171.1| hypothetical protein EUBSIR_02029 [Eubacterium siraeum DSM 15702]
gi|167655962|gb|EDS00092.1| glycosyl hydrolase family 3 C-terminal domain protein [Eubacterium
siraeum DSM 15702]
Length = 691
Score = 353 bits (907), Expect = 2e-94, Method: Compositional matrix adjust.
Identities = 245/777 (31%), Positives = 375/777 (48%), Gaps = 145/777 (18%)
Query: 54 FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
+ + LS RA +L L+ +E+ QQL +A AI + G+P+Y WW+E LHG+A G
Sbjct: 4 YKDKQLSAYERAAALADTLSTEEQAQQLKYDAPAIEKAGLPSYNWWNEGLHGVARAG--- 60
Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLTFW 165
+ T FPQ + AA+F++ + +G ++ EARAMYN GLT W
Sbjct: 61 -------TATVFPQAIALAAAFDKDMMYRVGEVISTEARAMYNSAAKHGDTDIYKGLTLW 113
Query: 166 APNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLK 225
APNINIFRDPRWGRG ET GEDP + S V FVK QGE
Sbjct: 114 APNINIFRDPRWGRGHETYGEDPYLTSRLGVNFVKGIQGEE------------------- 154
Query: 226 GFGEESDRGDELMLSACCKHLIAYDLEKWGNFS-RYSFNAMITEQDTEDTFQPPFRSCIE 284
+ L +AC KH + G S R+ F+A ++E+D E+T+ P F++ ++
Sbjct: 155 ---------EYLRAAACAKHFAVHS----GPESLRHEFDARVSEKDMEETYLPAFKALVK 201
Query: 285 QGKASCIMCSYNQVNGVPACLRGDLFQKARNEWGFKGYITSDCDAVATIFEYQNYTKTHE 344
+G+ +M +YN+VNG P+C L K R EWGF GY SDC A+ T T
Sbjct: 202 EGRVEGVMGAYNRVNGEPSCASEKLMGKLR-EWGFDGYFVSDCWAIRDFHTTHKITDTAP 260
Query: 345 DSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRK 404
SAA LKAG D+NCG L H +A+++G + +++I A ++ ++RLG + +
Sbjct: 261 QSAAMALKAGCDVNCGNTYL-HILAALEEGLITKQNIRTACIHALRTRIRLGQLDDN--- 316
Query: 405 GKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQ 464
++ L D + +K L+L+AA + +VLL ND LPL+K+ +SS+A+IGP ++ +
Sbjct: 317 -EFDDLPFDIIACDGNKALSLEAAEKSMVLLHND-GILPLDKSRISSIAVIGPNADSRAA 374
Query: 465 MGGGYTGIPCSPKSLLRGLE-AYVSKTHYASGCH---------DVPCNSDAGFHEAVRIA 514
+ G Y G P + L G++ A+ + +YA GC +P + + EAV
Sbjct: 375 LLGNYNGTPDRSVTFLEGIQDAFDGRVYYAEGCQLFRDRTQGLALPGDR---YAEAVAAC 431
Query: 515 KKADFVIVVAGLDLTQETED---------RDRVSLLLPGQQMSLVTSVARTSKRPVILVL 565
+ AD +V GLD T E E+ D+ L LP Q L+ + T K P+I+VL
Sbjct: 432 EAADVTVVCVGLDATLEGEEGDTGNEFASGDKPDLRLPEVQRVLLQKLKDTGK-PLIIVL 490
Query: 566 TGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKV 625
G ++ E ++ I++ W YPG+ G KALAEI+FG+ +P G+LP+T+Y +
Sbjct: 491 AAGSSVNTE-CEGNALINA--W--YPGQYGGKALAEILFGEVSPSGKLPVTFYKSADMLP 545
Query: 626 PMNDMNMRADSSRQYPGRSYRFYTGTQ--VYGFGHGLSYTNYSYKFLSAPSELTISASLK 683
D +M+ R+YRF +Y FG+GL+Y+++
Sbjct: 546 DFTDYSMK--------NRTYRFCDDESNVLYPFGYGLTYSHF------------------ 579
Query: 684 AGSDKNILQQTGSRLDYVHIDEVTSCTSLRFH---VQISVTNAGDVDGSHVVMLFARVPK 740
C + + + ++VTN G V+ ++ K
Sbjct: 580 ------------------------ECGDISYKDNTLAVNVTNTGSRSAEDVLQVYI---K 612
Query: 741 VSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
G L F+RV S+ IS + P + +G R + G + L G
Sbjct: 613 SENGVKNHSLCAFERVSLFDGESRTISINI-PEGAFETVDDNGVRAVISGRYTLYAG 668
>gi|373852136|ref|ZP_09594936.1| Beta-glucosidase [Opitutaceae bacterium TAV5]
gi|372474365|gb|EHP34375.1| Beta-glucosidase [Opitutaceae bacterium TAV5]
Length = 740
Score = 353 bits (905), Expect = 3e-94, Method: Compositional matrix adjust.
Identities = 246/764 (32%), Positives = 361/764 (47%), Gaps = 117/764 (15%)
Query: 53 PFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPG 112
PF + L++ R + L+S LTL EK+ Q+ A+AIPRLGIPAY +W+E LHG+A NG
Sbjct: 22 PFRDPDLALDHRVRDLVSRLTLAEKVSQMEHAAAAIPRLGIPAYNYWNECLHGVARNG-- 79
Query: 113 VNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA-----------G 161
T FPQ++ AA+++ L + +A++ EARA ++ A G
Sbjct: 80 --------RATVFPQIIGLAATWDTDLVYRVATAISDEARAKHHAALARQGFAQTQQYQG 131
Query: 162 LTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREK 221
LTFW PNIN+FRDPRWGRGQET GEDP + + A FV+ QG+ +
Sbjct: 132 LTFWTPNINLFRDPRWGRGQETWGEDPHLTARLAAAFVRGLQGDTPDT------------ 179
Query: 222 RVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRS 281
L L+AC KH Y + R++FNA +T D D++ P F
Sbjct: 180 --------------HLKLAACAKH---YAVHSGPENERHTFNARVTPHDLWDSYLPAFEH 222
Query: 282 CIEQGKASCIMCSYNQVNGVPACL-RGDLFQKARNEWGFKGYITSDCDAVATIFEYQNYT 340
+ + +M +YN+ P C + L R WGF+G++ SDC A+ I E T
Sbjct: 223 LVRHARVESVMGAYNRTLDEPCCASQFLLLDILRERWGFEGHVVSDCWALRDIHETHRIT 282
Query: 341 KTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFN- 399
+SAA L G D+ CGT A+ +G + E DIDRAL + +LG+F+
Sbjct: 283 TDPVESAALALTKGCDLACGT-TFELLGEAVQRGLITEADIDRALSRHLRARFKLGMFDP 341
Query: 400 -GDPRKGKYGKLGPDDVCT-SEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGP 457
D R P+ + T + H LA +AA VLL+N LPL + V S+ I GP
Sbjct: 342 ADDNRNPWSNPPAPEAIVTCAAHTALACEAAVASCVLLQNHNHILPLRPD-VRSIYITGP 400
Query: 458 LVNNISQMGGGYTGIPCSPKSLLRGLEAYVS---KTHYASGCHDVPCNSDAGFHEAVRIA 514
L + G Y G+P +LL GL A + + Y G +A A
Sbjct: 401 LAATQDALLGNYYGLPPRAITLLDGLAAALPEGIRADYRPGALLSTPKQNA-LEWAEFDC 459
Query: 515 KKADFVIVVAGLDLTQETE-----------DRDRVSLLLPGQQMSLVTSVARTSKRPVIL 563
D I GL E E DRD +SL P Q++ L + + R ++ VI+
Sbjct: 460 ASCDVTIACLGLTALLEGEEGEAIASSLHGDRDDISLP-PPQRLFLESLIQRGAR--VIV 516
Query: 564 VLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT 623
+L GG L S ++ +ILW GYPG+ G +ALA+I+ G +P GRLP+T+Y
Sbjct: 517 ILFGGSAL--SLGPLADKVEAILWAGYPGQEGGRALADILLGRASPSGRLPITFYENIND 574
Query: 624 KVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLK 683
P + +MR GR++R++ GT + FG GL+YT ++Y S+L +S
Sbjct: 575 LPPYANYSMR--------GRTHRWFDGTPAWPFGFGLTYTRFTY------SDLRVSDVYS 620
Query: 684 AGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLF-ARVPKVS 742
G+D + C S + +TN GD + + +V ++
Sbjct: 621 PGNDSPL------------------CGS------VLLTNTGDHEAAEIVQIYLTDFDAPG 656
Query: 743 QG-TPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRR 785
G P + L F RV S+ + F + P E + + + +GRR
Sbjct: 657 NGPVPRENLADFHRVTLAPGQSRRVEFSIPP-EHILLVDTNGRR 699
>gi|451821678|ref|YP_007457879.1| periplasmic beta-glucosidase BglX [Clostridium
saccharoperbutylacetonicum N1-4(HMT)]
gi|451787657|gb|AGF58625.1| periplasmic beta-glucosidase BglX [Clostridium
saccharoperbutylacetonicum N1-4(HMT)]
Length = 710
Score = 352 bits (904), Expect = 4e-94, Method: Compositional matrix adjust.
Identities = 242/762 (31%), Positives = 364/762 (47%), Gaps = 118/762 (15%)
Query: 64 RAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVT 123
+AK L+S +TLQE+ +QL+ A AI L I Y WW+E LHG+A G + T
Sbjct: 15 KAKELVSKMTLQERAEQLTYKAPAIKHLNISRYNWWNEGLHGVARAG----------TAT 64
Query: 124 SFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLTFWAPNINIFRDP 175
FPQ + AA F+ L I +A E RA YN GLTFW+PN+NIFRDP
Sbjct: 65 VFPQAIGLAAIFDDELLEKIAGIIATEGRAKYNENSKKEDKDIYKGLTFWSPNVNIFRDP 124
Query: 176 RWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGD 235
RWGRG ET GEDP + S V FVK QG+ ++ LK
Sbjct: 125 RWGRGHETYGEDPYLTSRLGVAFVKGLQGD---------------EKYLK---------- 159
Query: 236 ELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSY 295
++AC KH + + R+ FNA+++++D +T+ P F +C+++ +M +Y
Sbjct: 160 ---IAACAKHFAVHSGPEG---LRHEFNAVVSKKDLYETYLPAFEACVKEADVEAVMGAY 213
Query: 296 NQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAG 354
N+ N P C L + R +W FKG++ SDC A+A Y T T +SAA +K G
Sbjct: 214 NRTNDEPCCGSSLLLKDILRGKWQFKGHVVSDCWAIADFHLYHGVTSTATESAALAIKNG 273
Query: 355 MDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDD 414
D+NCG L+ A +G V E+DI RA L + ++RLG+F+ + K D
Sbjct: 274 CDLNCGNVYLQMLL-AYKEGLVTEEDITRAAERLMATRIRLGMFDEECEFNKIPYTMND- 331
Query: 415 VCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPC 474
EH +++L A+R+ IV+L+N+ LPL+K+ + S+ IIGP ++ + G Y G
Sbjct: 332 --CKEHHEVSLMASRKSIVMLRNN-GLLPLDKSKLKSIGIIGPNADSELMLKGNYFGTAS 388
Query: 475 SPKSLLRGLEAYVS----KTHYASGCHDVP------CNSDAGFHEAVRIAKKADFVIVVA 524
++L G+ V + Y+ GCH D EAV +A+ +D VI+
Sbjct: 389 KYITVLEGIHEAVDSENIRIFYSEGCHLYKDRVQDLAEPDDRMAEAVTVAEHSDVVILCL 448
Query: 525 GLDLTQETED---------RDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSF 575
GLD + E E D+++L LPG+Q L+ V T K PVI+VL G L +
Sbjct: 449 GLDSSIEGEQGDAGNSDGAGDKLNLNLPGKQQELLEKVIATGK-PVIVVLGAGSALTLQG 507
Query: 576 AEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRAD 635
E + ++IL YPG G +A+A++IFG +P G+LP+T+Y + D +M+
Sbjct: 508 QEENC--AAILNAWYPGSFGGRAIADLIFGKCSPSGKLPVTFYKTTEELPEFTDYSMK-- 563
Query: 636 SSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTG 695
R+YR+ +Y FG GL+Y+ S+L++S K
Sbjct: 564 ------NRTYRYMKNESLYPFGFGLTYSKVQL------SDLSVSDISK------------ 599
Query: 696 SRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDR 755
D+ ++ V I ++N G+ D V+ + + + L F R
Sbjct: 600 ---DFEGVE-----------VSIKISNVGNFDIEEVLQCYIKDLESKYAVDNHSLSAFKR 645
Query: 756 VHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
V SK + ++ + N G RIL L VG
Sbjct: 646 VALNKGESKVVKMTINK-RAFEVVNDEGDRILDSKKFKLFVG 686
>gi|348684866|gb|EGZ24681.1| hypothetical protein PHYSODRAFT_325770 [Phytophthora sojae]
Length = 805
Score = 352 bits (903), Expect = 5e-94, Method: Compositional matrix adjust.
Identities = 257/753 (34%), Positives = 376/753 (49%), Gaps = 102/753 (13%)
Query: 53 PFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPR-----LGIPAYEWWSESLHGIA 107
PFCNTSLS + R + L+S L LQEK L+ AS PR +G+P Y W + +HG+
Sbjct: 36 PFCNTSLSTADRVEDLLSRLPLQEKATLLTARAS--PRGNMSSIGLPEYNWGANCVHGVQ 93
Query: 108 SNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG--------- 158
S G N TSFP + A F+ + ++ + E RA++ G
Sbjct: 94 STC-GTN------CPTSFPNPVNLGAIFDPQVVFDMAQVIGWELRALWLEGATENYKGGP 146
Query: 159 QAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGF 218
GL W+PNINI RDPRWGR ETP EDP+V S Y V + + Q
Sbjct: 147 HLGLDCWSPNININRDPRWGRNTETPSEDPLVNSKYGVAYTRGLQ--------------- 191
Query: 219 REKRVLKGFGEESDRGDELMLSACC--KHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQ 276
E R D L A KH AY E +G +R F+A+++ D DT+
Sbjct: 192 -----------EGKRQDPRFLQAVVTLKHYAAYSYENYGGVNRMEFDAIVSPYDFADTYF 240
Query: 277 PPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK-ARNEWGFKGYITSDCDAVATIFE 335
P FRS + G A +MCSYN VNG+P C +L + R GF GY+TSD AV I +
Sbjct: 241 PAFRSSVVDGNAKGVMCSYNSVNGIPMCANKELVETLLRGTLGFDGYVTSDSGAVEAISD 300
Query: 336 YQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRL 395
+Y + ++A + AG DIN G ++ +D +++EK +D AL + ++ L
Sbjct: 301 MHHYADSQCEAARLAILAGTDINSGKSYEACLKTLVDDNQLEEKALDDALRHTLKLRFEL 360
Query: 396 GLFNGDPRKGK-YGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAI 454
GLF DP + Y + P +V T+ K L+L+A R+ +V+L+N+ LPL K LA+
Sbjct: 361 GLF--DPIDDQPYWNVTPSEVNTAAAKALSLNATRKSLVMLQNNASVLPLQKGV--KLAV 416
Query: 455 IGP--------LVNNISQM-GGGYTGIPC--SPKSLLRGLEAYVSKTHYASGCHDVPCNS 503
+GP L N + QM G Y + C +P +R S T +A GC + NS
Sbjct: 417 LGPHAKSKRGLLGNYLGQMCHGDYDEVGCVQTPLDAIRAANG-ASNTTFAEGC-GISGNS 474
Query: 504 DAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVIL 563
AGF +AV AK+AD V++ G+D + E E DR ++ LP QM L+ V RP ++
Sbjct: 475 TAGFEKAVAAAKEADAVVLFLGIDKSIEGEVGDRNNIDLPNIQMQLLQRV-HAVGRPTVV 533
Query: 564 VLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT 623
VL GG + E + +++ YPG GA+A+A+++FGD NP G+LP+T Y +
Sbjct: 534 VLINGGVIGAE--EIIERTDALVEAFYPGFFGARAMADVLFGDTNPSGKLPVTMYRSDYV 591
Query: 624 -KVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASL 682
+V M M+M A +PGR+YR++ G V+ FG GLSYT +S L++ +
Sbjct: 592 DQVEMKSMDMTA-----HPGRTYRYFKGEPVFPFGWGLSYTTFS---------LSVDSGT 637
Query: 683 KAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVS 742
+ S N +G EV+ ++ + + V N G+V G VV+ F R +
Sbjct: 638 NSSSHSNNAAFSGG--------EVSDTANVT--ISVVVKNDGEVAGDEVVLAFFRPVNSN 687
Query: 743 QGTP----EKQLIGFDRVHTVAKGSKEISFGVD 771
P +QL + RV S E+SF ++
Sbjct: 688 VTGPATLLNEQLFDYQRVSLGPLDSTEVSFTIE 720
>gi|333494646|gb|AEF56854.1| putative glycosyl hydrolase [synthetic construct]
Length = 743
Score = 352 bits (902), Expect = 6e-94, Method: Compositional matrix adjust.
Identities = 256/777 (32%), Positives = 379/777 (48%), Gaps = 128/777 (16%)
Query: 54 FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
+ + +LS RA+ L+S +TL+EKI Q+ A +I RLG+PAY WW+E+LHG+A G
Sbjct: 30 YRDENLSFEERARDLVSRMTLEEKIAQMQHEAPSIERLGVPAYNWWNEALHGVARAG--- 86
Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQ--------AGLTFW 165
VS T FPQ + AA+F+ L ++ E RA Y+ Q GLTFW
Sbjct: 87 -----VS--TMFPQAIGMAATFDAELIEKTADVISTEGRARYHEFQRKGDRDIYKGLTFW 139
Query: 166 APNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLK 225
+P INI RDPRWGRGQET GEDP + S AV F++ QG
Sbjct: 140 SPTINIDRDPRWGRGQETYGEDPYLTSRLAVSFIRGIQG--------------------- 178
Query: 226 GFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQ 285
RG L +AC KH + + R+ FNA ++++D +T+ P F + +++
Sbjct: 179 -------RGRYLKAAACAKHFAVHSGPE---SERHQFNAEVSQKDLWETYLPAFEASVKE 228
Query: 286 GKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHE 344
K + +M +YN+VNG P C G L R EW F GY+TSDC A+ I E TKT E
Sbjct: 229 AKVAGVMGAYNRVNGEPCCGSGTLLGDVLRGEWEFGGYVTSDCWAIKDINEGHGVTKTIE 288
Query: 345 DSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRK 404
+S+A +K+G D+NCG C A G + EK+ID A+ L ++RLG+F+ P K
Sbjct: 289 ESSALAVKSGCDLNCG-CAYASLVKAYRAGLIGEKEIDTAVHRLMLTRMRLGMFDA-PEK 346
Query: 405 GKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQ 464
Y + + +EH+ AL+ A + +VLL+N FLPL+++ + S+A+IGP ++
Sbjct: 347 VPYSSIPYEKNDCAEHRAFALEVAEKSLVLLRNRSGFLPLDRSRIRSVAVIGPNADSRVA 406
Query: 465 MGGGYTGIPCSPKSLLRGLEAYV---SKTHYASGCHDVPCNSDAGFHE-------AVRIA 514
+ G Y G ++L G+ V ++ +YA G H + NS G + A A
Sbjct: 407 LEGNYNGTASEYVTVLDGIREAVGDRARVYYAEGSH-LFRNSMGGLSQKNDRLAEAAAAA 465
Query: 515 KKADFVIVVAGLDLTQETED---------RDRVSLLLPGQQMSLVTSVARTSKRPVILVL 565
++AD +V GL+ E E+ D+ L LPG Q L+ +V T PV+LVL
Sbjct: 466 ERADVAVVCLGLNRDIEGEEGDPSNEYPAGDKRDLRLPGLQEELLETVKATGT-PVVLVL 524
Query: 566 TGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKV 625
G L V++A+ ++ W YPG A A+ +FG P G P
Sbjct: 525 LSGSALAVNWADENADAVVQAW--YPG-AQAEGRRGALFGIIRPAGGFP----------- 570
Query: 626 PMNDMNMRADSSRQYPGRSYR----FYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISAS 681
+ +R +SR + G + G +Y FG+GLSYT + Y +L ++AS
Sbjct: 571 --SRSTVRTRTSRIF-GTIHENRLPLLQGDPLYPFGYGLSYTKFQY------GDLKLAAS 621
Query: 682 -LKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPK 740
+ AG D V ++V NAG+ D VV L+ + +
Sbjct: 622 EIPAGEDA--------------------------EVSVTVRNAGERDSDEVVQLYLQDLE 655
Query: 741 VSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
S P+ QL GF RVH S + F V Q+++ ++ GR +L G + G
Sbjct: 656 SSVPVPKWQLAGFRRVHLKPGESAGVRFTV-AARQMALIDEDGRCVLEPGGFRVYAG 711
>gi|315607899|ref|ZP_07882892.1| beta-glucosidase [Prevotella buccae ATCC 33574]
gi|315250368|gb|EFU30364.1| beta-glucosidase [Prevotella buccae ATCC 33574]
Length = 721
Score = 351 bits (901), Expect = 8e-94, Method: Compositional matrix adjust.
Identities = 238/764 (31%), Positives = 365/764 (47%), Gaps = 121/764 (15%)
Query: 61 ISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVS 120
+S AK +I+ +T+ EKI QL + + AI LGI Y+WWSE LHG+ +G
Sbjct: 30 MSRHAKEIIARMTVSEKISQLMNESPAIEHLGIKPYDWWSEGLHGVGRDG---------- 79
Query: 121 SVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQ--------AGLTFWAPNINIF 172
T FPQ + A+F+ +L IG AVA E RA +N+ + AGLTFW+PN+NIF
Sbjct: 80 RATVFPQPIALGATFDEALVREIGDAVATEGRAKFNVARKLKNYSRNAGLTFWSPNVNIF 139
Query: 173 RDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESD 232
RD RWGRG ET GEDP++ +V+ QG D
Sbjct: 140 RDLRWGRGMETYGEDPLLSGMLGTAYVRGLQG---------------------------D 172
Query: 233 RGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIM 292
L AC KH Y + +R+ + + +D +T+ P F+ ++QG+ +M
Sbjct: 173 DAFYLKTGACAKH---YAVHSGPEGTRHEADIHPSRRDLFETYLPQFKMLVQQGRVEAVM 229
Query: 293 CSYNQVNGVPA----CLRGDLFQKARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAA 348
+YN+V G P L D+ +K+ WGF G+I SDCDA+ + Y KT E++ A
Sbjct: 230 SAYNRVYGEPCGGSKYLLTDILRKS---WGFNGHIVSDCDAINDFYGGHRYVKTPEEACA 286
Query: 349 GVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYG 408
+KAG+++ CG + Q A+D+G + E D+DRAL L +L+LG+ D Y
Sbjct: 287 AAIKAGLNVECGHT-FKAMQGALDQGLLAEADLDRALFPLVMTRLKLGILEPD-SACPYN 344
Query: 409 KLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGG 468
++C+ H LAL AA + +VLLKN+ LPL+KN + +L + GP ++ + G
Sbjct: 345 SYDESEICSPAHTALALRAADEAMVLLKNN-GILPLDKN-IRTLFVAGPGASDAFYLMGN 402
Query: 469 YTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPC-----NSDAGFHEAVRIAKKADFVIVV 523
Y G+ + L+G+ VS+ + + P + AV A A+ IVV
Sbjct: 403 YFGLSNRYSTYLQGI---VSRVSSGTSVNFRPAFMQITEELNDMNWAVNEACAAEVAIVV 459
Query: 524 AGLDLTQETED---------RDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVS 574
G + E E+ DRV + LP Q++ + V +++VLTGG P+D+
Sbjct: 460 MGNNGNMEGEEGEAIASASRGDRVGIGLPASQLNYLRRVKARKGGRIVVVLTGGSPIDLR 519
Query: 575 FAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVP-MNDMNMR 633
E +++ YPG+ G +AL +++FGD N GRLP+T +P +P +D +M
Sbjct: 520 --EISKLADAVVMAWYPGQEGGEALGDLLFGDKNFSGRLPIT-FPADVDSLPAFDDYSMN 576
Query: 634 ADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQ 693
GR+Y++ +G +Y FG+GLSY +Y ++ + +K G
Sbjct: 577 --------GRTYKYMSGNVMYPFGYGLSYGRVTY------TDARVVGRIKKGEP------ 616
Query: 694 TGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGF 753
V++ +TN GD V + P +G+P L+GF
Sbjct: 617 --------------------LAVEVVLTNNGDRTIDEVAQAYIATPTAGKGSPMASLVGF 656
Query: 754 DRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
RV K S + F + P E+L G L GN+ L +G
Sbjct: 657 RRVSIPPKSSVKAVFKIVP-ERLMTVQSDGSSKLLKGNYTLTIG 699
>gi|409385818|ref|ZP_11238358.1| Beta-glucosidase [Lactococcus raffinolactis 4877]
gi|399206850|emb|CCK19273.1| Beta-glucosidase [Lactococcus raffinolactis 4877]
Length = 695
Score = 350 bits (897), Expect = 3e-93, Method: Compositional matrix adjust.
Identities = 221/658 (33%), Positives = 337/658 (51%), Gaps = 90/658 (13%)
Query: 64 RAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVT 123
A ++S +TL EKI Q+ +ASAI RL IP Y +W+E LHG+A G T
Sbjct: 11 EAIKIVSQMTLAEKISQIDFDASAIERLNIPHYNYWNEGLHGVARAGVA----------T 60
Query: 124 SFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLTFWAPNINIFRDP 175
FPQ + AA+F+ L +I +++E RA YN GLTFW+PNIN+FRDP
Sbjct: 61 VFPQAIGLAATFDTELVKHIAEVISIEGRAKYNAYTKHGDRDIYKGLTFWSPNINLFRDP 120
Query: 176 RWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGD 235
RWGRGQET GEDP + + V F+K QGE G
Sbjct: 121 RWGRGQETYGEDPFLTAQIGVAFIKGLQGE----------------------------GK 152
Query: 236 ELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSY 295
L L+AC KH + + R+ F+A++ +D + + P F++ IE+ M +Y
Sbjct: 153 YLRLAACTKH---FAVHSGPEADRHYFDAVVNPKDLNEFYLPQFKAAIEEADVESFMGAY 209
Query: 296 NQVNGVPACLRGDLFQKAR-NEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAG 354
N +NG PAC+ +L K +WGF+G++ SD A+ + E +YT+T ++ A +K G
Sbjct: 210 NAINGQPACVNEELIAKTLLGKWGFEGHVVSDYAALEDVHENHHYTQTAAETMALAMKIG 269
Query: 355 MDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDD 414
++ C + A+ KG V E +I +++ L++ +RLG+F D Y + +
Sbjct: 270 TNL-CAGKISDALFEAVGKGLVTETEITASVVKLYTTHVRLGMFAED---NDYDTIPYEV 325
Query: 415 VCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPC 474
++EH+ L+L AA + +VLLKND FLPL+++ + S+A+IGP NI + G Y G
Sbjct: 326 NASAEHEMLSLKAAEKSMVLLKNDN-FLPLSQSEIKSVAVIGPTARNIGALEGNYAGTAN 384
Query: 475 SPKSLLRGLEAYVS---KTHYASGCHDVPCNSDAGFH-------EAVRIAKKADFVIVVA 524
++ + G++ +S + YA GCH ++++ EA+ A+ AD ++
Sbjct: 385 HYETFVSGIQQALSNQARVTYALGCHLYADHAESSLSRANERESEAIIAAEHADIAVLCV 444
Query: 525 GLDLTQETEDRD---------RVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSF 575
GLD T E E D + SL LPGQQ L+ V T K VILVLT G L +
Sbjct: 445 GLDPTIEGEQGDAGNVYGSGDKPSLSLPGQQKRLIEKVLETGKT-VILVLTSGSALSLEG 503
Query: 576 AEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRAD 635
E + + +I+ YPG G ALA I+ G +P G+LP+T+ ++ +D +M
Sbjct: 504 LEKHTGVKAIIQAWYPGAHGGTALANILLGKVSPSGKLPVTFCKDTQGLPDFSDYSM--- 560
Query: 636 SSRQYPGRSYRFYTGTQ---VYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNI 690
+ R Y TQ +Y FG+GL+Y + K L +LT+S + + D +I
Sbjct: 561 --------AERTYQNTQLEVLYPFGYGLTYGHAEIKTLQL-DDLTLSVTAENKGDYDI 609
>gi|350295750|gb|EGZ76727.1| glycoside hydrolase [Neurospora tetrasperma FGSC 2509]
Length = 839
Score = 348 bits (892), Expect = 8e-93, Method: Compositional matrix adjust.
Identities = 258/821 (31%), Positives = 381/821 (46%), Gaps = 118/821 (14%)
Query: 55 CNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVN 114
C+ + + RA SL+ LT+ EK+ L D A R+G+P Y WWSE LHG+A + PGV
Sbjct: 37 CDVTGTAPERAASLVDQLTIDEKLVNLVDQALGASRIGLPKYAWWSEGLHGVAGS-PGVT 95
Query: 115 FNGT---VSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINI 171
FN T S TSF + ASF+ L +G+A++ EARA N G GL +W PN+N
Sbjct: 96 FNTTGYPFSYATSFANAINLGASFDDDLVYEVGTAISTEARAFANFGFGGLDYWTPNVNP 155
Query: 172 FRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGF-GEE 230
++DPRWGRG ETPGEDP+ + Y K +L G G E
Sbjct: 156 YKDPRWGRGAETPGEDPLHIKGYV-------------------------KAMLAGLEGNE 190
Query: 231 SDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASC 290
+ R + A CKH AYDLE+W +RY F A++T QD + + PPF+ C K
Sbjct: 191 TVR----KVIATCKHYAAYDLERWHGLTRYEFEAIVTLQDLSEYYLPPFQQCARDSKVGS 246
Query: 291 IMCSYNQV------------------NGVPACLRGDLFQKARNEWGF---KGYITSDCDA 329
IMCSYN + PAC L R+ W + YITSDC+A
Sbjct: 247 IMCSYNALTIRDMAGGSKPDEIINLTTAQPACANTYLMTILRDHWNWTEHNNYITSDCNA 306
Query: 330 VATIF-EYQNYTKTHEDSAAGVLKAGMDINCGT--CMLRHTQSAIDKGKVQEKDIDRALL 386
+ + N+++T ++AA KAG D C L A ++ + E ID AL
Sbjct: 307 ILDFLPDNHNFSQTPAEAAAAAYKAGTDTVCEVSGSPLTDVVGAYNQSLLPEAVIDTALR 366
Query: 387 NLFSVQLRLGLFN-----------GDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLL 435
L+ +R G + G Y L +DV T ++LAL +A +GIVLL
Sbjct: 367 RLYEGLIRAGYLDHGRSAVAGGDGGSFSSPAYDALNWNDVNTPSTQELALRSATEGIVLL 426
Query: 436 KNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASG 495
KN LPL+ + +A+IG N M G Y+GIP + L + YA+G
Sbjct: 427 KNSGSLLPLDFSG-KKVALIGHWANATGTMRGPYSGIPPFYHNPLYAAQQLNLSLSYANG 485
Query: 496 CHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVAR 555
+ D A+ A+ AD V+ G D T +ED DR S+ P QM L++ +A
Sbjct: 486 PVVNASDPDTWTAPALAAAEGADVVLYFGGTDTTVASEDLDRESIAWPEAQMKLLSELAG 545
Query: 556 TSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPM 615
K V++ L G +D S + +SSILW+GYPG++G A+ +++ G P GRLP+
Sbjct: 546 LGKPVVVIQL--GDQVDDSSLLNNGNVSSILWVGYPGQSGGTAVFDVLTGKKAPAGRLPV 603
Query: 616 TWYPESFT-KVPMNDMNMR-----------------------------ADSSRQYPGRSY 645
T YPE + +VP+ +M +R + + PGR+Y
Sbjct: 604 TQYPEGYVDEVPLTEMALRPFNHSSSNLEEEVSVQGGASLTIQARSTPGNKTLSSPGRTY 663
Query: 646 RFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDE 705
++Y+ T V FG+GL YT ++ + + + S S+ + +L + H+D
Sbjct: 664 KWYS-TPVLPFGYGLHYTTFNVSLSLSSNASSPSFSIPS-----LLTPCTA----THLDL 713
Query: 706 VTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKE 765
S + +S+TN G +V +LF + P K L+ + RV + G +
Sbjct: 714 CPFSPSANSALSVSITNTGTHTSDYVALLFLSGEFGPEPYPLKTLVSYKRVKDIKPG-ET 772
Query: 766 ISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
++ P +I+ G GN VL G R + ++
Sbjct: 773 VTVKDVPVSLGAISRVDGD-----GNTVLYPGTYRFVVDVD 808
>gi|348684865|gb|EGZ24680.1| family 3 glycoside hydrolase [Phytophthora sojae]
Length = 769
Score = 347 bits (890), Expect = 2e-92, Method: Compositional matrix adjust.
Identities = 267/793 (33%), Positives = 388/793 (48%), Gaps = 110/793 (13%)
Query: 16 FRVLQLIVVVNVIAFSNSKPVLNKPDFP--CKPPHFDSYPFCNTSLSISTRAKSLISLLT 73
R LQLI I N V DFP C PFCNTSLS + R + L+S L
Sbjct: 1 MRALQLI-----ITLFNWVAVYGDADFPRACVGKTNQELPFCNTSLSTADRVEDLLSRLP 55
Query: 74 LQEKIQQLSDNASAIPR-----LGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQV 128
LQEK L+ AS PR +G+P Y W + +HG+ S G N TSFP
Sbjct: 56 LQEKATLLTARAS--PRGNMSSIGLPEYNWGANCVHGVQSTC-GTN------CPTSFPNP 106
Query: 129 LVSAASFNRSLWSNIGSAVAVEARAMYNLG---------QAGLTFWAPNINIFRDPRWGR 179
+ A F+ + ++ + E RA++ G GL W+PNINI RDPRWGR
Sbjct: 107 VNLGAIFDPQVVFDMAQVIGWELRALWLEGATENYKGGPHLGLDCWSPNININRDPRWGR 166
Query: 180 GQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELML 239
ETP EDP+V S Y V + + Q E R D L
Sbjct: 167 NTETPSEDPLVNSKYGVAYTRGLQ--------------------------EGKRQDPRFL 200
Query: 240 SACC--KHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQ 297
A KH AY E +G +R F+A+++ D DT+ P FRS + G A +MCSYN
Sbjct: 201 QAVVTLKHYAAYSYENYGGVNRMEFDAIVSPYDFADTYFPAFRSSVVDGNAKGVMCSYNS 260
Query: 298 VNGVPACLRGDLFQK-ARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMD 356
VNG+P C +L + R GF GY+TSD AV I + +Y + ++A + AG D
Sbjct: 261 VNGIPMCANKELVETLLRGTLGFDGYVTSDSGAVEAISDMHHYADSQCEAARLAILAGTD 320
Query: 357 INCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGK-YGKLGPDDV 415
IN G ++ +D +++EK +D AL + ++ LGLF DP + Y + P +V
Sbjct: 321 INSGKSYEACLKTLVDDNQLEEKALDDALRHTLKLRFELGLF--DPIDDQPYWNVTPSEV 378
Query: 416 CTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGP--------LVNNISQM-G 466
T+ K L+L+A R+ +V+L+N+ LPL K LA++GP L N + QM
Sbjct: 379 NTAAAKALSLNATRKSLVMLQNNASVLPLQKGV--KLAVLGPHAKSKRGLLGNYLGQMCH 436
Query: 467 GGYTGIPC--SPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVA 524
G Y + C +P +R S T +A GC + NS AGF +AV AK+AD V++
Sbjct: 437 GDYDEVGCVQTPLDAIRAANG-ASNTTFAEGC-GISGNSTAGFEKAVAAAKEADAVVLFL 494
Query: 525 GLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISS 584
G+D + E E DR ++ LP QM L+ V RP ++VL GG + E + +
Sbjct: 495 GIDKSIEGEVGDRNNIDLPNIQMQLLQRV-HAVGRPTVVVLINGGVIGAE--EIIERTDA 551
Query: 585 ILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNMRADSSRQYPGR 643
++ YPG GA+A+A+++FGD NP G+LP+T Y + +V M M+M A +PGR
Sbjct: 552 LVEAFYPGFFGARAMADVLFGDTNPSGKLPVTMYRSDYVDQVEMKSMDMTA-----HPGR 606
Query: 644 SYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHI 703
+YR++ G V+ FG GLSYT +S L++ + + S N +G
Sbjct: 607 TYRYFKGEPVFPFGWGLSYTTFS---------LSVDSGTNSSSHSNNAAFSGG------- 650
Query: 704 DEVTSCTSLRFHVQISVTNAGDVDGSHVV--MLFARVPKVSQGTPEKQLIGFDRVH--TV 759
EV+ ++ + + V N G+V G V+ + V ++ E L+ F + V
Sbjct: 651 -EVSDTANVT--ISVVVKNDGEVAGDEVLGPLDSTEVSTLALPDEEGNLVSFPGSYEVIV 707
Query: 760 AKGSKE-ISFGVD 771
+ G KE + F V+
Sbjct: 708 SNGVKERLRFSVE 720
>gi|373460527|ref|ZP_09552278.1| hypothetical protein HMPREF9944_00542 [Prevotella maculosa OT 289]
gi|371955145|gb|EHO72949.1| hypothetical protein HMPREF9944_00542 [Prevotella maculosa OT 289]
Length = 699
Score = 347 bits (889), Expect = 2e-92, Method: Compositional matrix adjust.
Identities = 254/770 (32%), Positives = 380/770 (49%), Gaps = 134/770 (17%)
Query: 61 ISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVS 120
+ +A+ LI+++TL EKI Q+ + IPRLGI Y+WW+E LHG+ +G
Sbjct: 9 MQQKARRLINMMTLDEKISQMMNETPGIPRLGIKPYDWWNEGLHGVGRDG---------- 58
Query: 121 SVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQ--------AGLTFWAPNINIF 172
T FPQ + A+FN +L IG A+A E RA YN+ Q GLTFW+PNINIF
Sbjct: 59 RATVFPQPIGMGATFNPALIRQIGDAIATEGRAKYNVAQRNNNYARYTGLTFWSPNINIF 118
Query: 173 RDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESD 232
RDPRWGRG ET GEDP + + +V+ QG +
Sbjct: 119 RDPRWGRGMETYGEDPFLTGTLGIAYVQGMQGND-------------------------- 152
Query: 233 RGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIM 292
L ++AC KH Y + +R+ N T++D +T+ P F+ ++QG IM
Sbjct: 153 -PFYLKVAACGKH---YAVHSGPEATRHEANVSPTKRDLFETYLPAFKMLVQQGHVEAIM 208
Query: 293 CSYNQVNGVPAC-----LRGDLFQKARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSA 347
+YN+V G AC L D+ +K +WGF+G+I SDCDAVA I KT ++
Sbjct: 209 GAYNRVYG-EACSGSKYLLTDVLRK---QWGFRGHIVSDCDAVADIHAGHKIVKTEAEAC 264
Query: 348 AGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKY 407
A +KAG++I CG + A+ + + E++IDRALL L +L+LG+ D + Y
Sbjct: 265 AIAIKAGLNIECGHT-FEAMKQAVAQKLLTEQEIDRALLPLMMTRLKLGILEYDA-ECPY 322
Query: 408 GKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGG 467
++ ++C+ EH LA AA + +VLLKN+ LPL+KN + +L I GP ++ + G
Sbjct: 323 NEVKETEICSPEHIALARKAATESMVLLKNN-GILPLDKN-LHTLFIAGPGASDSFWLMG 380
Query: 468 GYTGIPCSPKSLLRGLEAYVSK----THYASGCHDVPCNSDAGFHEAVRIAKKADFVIVV 523
Y GI + L+G+ VS + P + + A+ A A+ IVV
Sbjct: 381 NYFGISNRYCTYLQGIADKVSSGTAVNFRPAFGESTPTKNTINW--ALDEAIAAEKTIVV 438
Query: 524 AGLDLTQETED---------RDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVS 574
G + E E+ DRVS+ LP QM + + + K +++VLTGG P+DV
Sbjct: 439 MGNNGNLEGEEGESIASETRGDRVSMRLPASQMKFLRDL-KARKNGIVVVLTGGSPIDVR 497
Query: 575 FAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKV-PMNDMNMR 633
E +++ YPG+ G ALA+++FGD N GRLP+T +PES + P D M+
Sbjct: 498 --EISRLADAVVMAWYPGQEGGYALADLLFGDENFSGRLPVT-FPESTDALPPFEDYAMK 554
Query: 634 ADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSY---KFLSAPSE---LTISASLKAGSD 687
GR+Y++ T Y FG+GLSYT +Y K + P + +T+SA
Sbjct: 555 --------GRTYKYQTAHIQYPFGYGLSYTTVTYAHAKVETMPQKGRGMTVSA------- 599
Query: 688 KNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPE 747
+L+ TG++ +DEV V +S AG ++ F R+ + G E
Sbjct: 600 --VLKNTGNKA----VDEVA-------QVYLSAPGAGTTAALASLVAFKRI-GLQPG--E 643
Query: 748 KQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
+QL+ FD P ++L + G L GN+ + VG
Sbjct: 644 QQLVRFD----------------IPFDRLLTVQEDGTAQLLKGNYTITVG 677
>gi|424661938|ref|ZP_18098975.1| hypothetical protein HMPREF1205_02324 [Bacteroides fragilis HMW
616]
gi|404578249|gb|EKA82984.1| hypothetical protein HMPREF1205_02324 [Bacteroides fragilis HMW
616]
Length = 722
Score = 345 bits (886), Expect = 4e-92, Method: Compositional matrix adjust.
Identities = 226/745 (30%), Positives = 362/745 (48%), Gaps = 97/745 (13%)
Query: 56 NTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNF 115
+ S I+ R K+LI +TL EK QL + +IPRL +PAY +W+E LHG+A G
Sbjct: 53 DLSQPIAVRVKTLIQQMTLAEKASQLVSESDSIPRLNLPAYNYWNECLHGVARAG----- 107
Query: 116 NGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDP 175
VT FPQ + A++++ L + SA++ EAR Y GLT+W+P IN+ RDP
Sbjct: 108 -----EVTVFPQAINLASTWDTVLVKRVASAISTEARLKYLEIGKGLTYWSPTINMARDP 162
Query: 176 RWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGD 235
RWGR +ET GEDP + S V FVK QG D
Sbjct: 163 RWGRNEETYGEDPYLTSRLGVAFVKGLQG---------------------------DHPA 195
Query: 236 ELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSY 295
L A KH +A + E +R+S ++ I + + + P + +C+++ +M +Y
Sbjct: 196 YLKTVATIKHFVANNEEN----NRFSSSSQIPTKQLYEYYFPAYEACVKEADVQSVMTAY 251
Query: 296 NQVNGVP-ACLRGDLFQKARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAG 354
N NGVP + R L + R EWGF G++ SDC A+ + + E++AA + +G
Sbjct: 252 NAFNGVPPSGSRWLLGEVLRKEWGFDGFVVSDCGAIGVMNWQHRVVNSLEEAAALGVNSG 311
Query: 355 MDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRK-GKYGKLGPD 413
D+ CGT A+ +G + E ID+AL + + + +LG F DP + Y
Sbjct: 312 CDLECGTTYKEKLVQAVKQGLISEATIDQALTRVLTARFKLGEF--DPMELVPYNHYDKK 369
Query: 414 DVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIP 473
+ + +LA +AA + +VLLKN+ LPL+K S+A++GP ++ GGY+G P
Sbjct: 370 LLAGKKFAELAYEAAVKSVVLLKNE-NLLPLSKEKTKSVAVVGPFADH--NYLGGYSGQP 426
Query: 474 CSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETE 533
+LL+G++ + K + + + + D+ V K D V+V G D E
Sbjct: 427 PYSITLLKGVKDLMGKRGKVNYLNGIGASRDS----IVAAVKGVDVVLVALGSDEKMARE 482
Query: 534 DRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGE 593
+ D S+ LP +Q L+ ++ + + R ++LV G PL + AD+ I +I+ YPG+
Sbjct: 483 NHDMTSIYLPEEQEKLLKAIYQVNPR-IVLVFHSGNPLTSEW--ADTHIPAIMQAWYPGQ 539
Query: 594 AGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQV 653
+ALA ++FG+ NP G+LPMT Y + D +M + GR+YR+ G +
Sbjct: 540 EAGRALANLLFGNENPSGKLPMTIYKTEEQLPDILDFDM-------WKGRTYRYMKGEPL 592
Query: 654 YGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLR 713
Y FGHGLSYT++ + + D + D + C+
Sbjct: 593 YSFGHGLSYTSFEFDNIQGN-------------------------DTLQPDAILQCS--- 624
Query: 714 FHVQISVTNAGDVDGSHVVMLFARVPKVSQGT-PEKQLIGFDRVHTVAKGSKEISFGVDP 772
+ ++N+G + G VV ++ T P K+L+ F +V + K++ F + P
Sbjct: 625 ----VELSNSGQLAGEEVVQVYVSRENTPVYTYPLKKLVAFKKVKLASGEKKKVDFTIAP 680
Query: 773 CEQLSIANKHGRRILPLGNHVLMVG 797
E LS+ R+L G + L +G
Sbjct: 681 RE-LSVWEDGKWRMLS-GKYTLFIG 703
>gi|301090543|ref|XP_002895482.1| beta-glucosidase, putative [Phytophthora infestans T30-4]
gi|262098232|gb|EEY56284.1| beta-glucosidase, putative [Phytophthora infestans T30-4]
Length = 809
Score = 345 bits (885), Expect = 5e-92, Method: Compositional matrix adjust.
Identities = 255/758 (33%), Positives = 371/758 (48%), Gaps = 103/758 (13%)
Query: 44 CKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASA---IPRLGIPAYEWWS 100
C P + FCN SLS + R + L+ L L EK+ L+ AS + +G+P Y W +
Sbjct: 27 CSSPEHQQFAFCNASLSTAERVEDLLRRLPLDEKVTLLTARASPKGNMSSIGLPEYNWGA 86
Query: 101 ESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG-- 158
+HG+ S G N TSFP + A F+ ++ V E RA++ G
Sbjct: 87 NCVHGVQSTC-GTN------CATSFPNPVNLGAIFDPRAVFDMAQVVGWELRALWLEGAR 139
Query: 159 -------QAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDD 211
GL W+PNINI RDPRWGR ETP EDP+V S Y V + K Q +
Sbjct: 140 ENYATGPHLGLDCWSPNININRDPRWGRNMETPSEDPLVNSKYGVAYTKGLQ-------E 192
Query: 212 GGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDT 271
G ++KR L+ KH AY E + R +FNA+++ D
Sbjct: 193 G------KDKRFLQAV-------------VTLKHYAAYSYEHYDGIDRMAFNAVVSRYDF 233
Query: 272 EDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK-ARNEWGFKGYITSDCDAV 330
DT+ P F + + GKA +MCSYN VNG+P C L K R+ GF GYITSD A+
Sbjct: 234 ADTYLPAFEASVVHGKAKGVMCSYNSVNGMPMCANEQLNSKLLRDALGFDGYITSDSGAI 293
Query: 331 ATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFS 390
A I+ ++YTKT ++ + +G D+N G+ + + G++ EK +D A+
Sbjct: 294 AGIYHQRHYTKTLCEAGRLAILSGTDVNSGSVYKQCLAELVTSGQLPEKAVDDAMRRTLK 353
Query: 391 VQLRLGLFNGDPRKGK-YGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAV 449
++ LGLF DP + Y + P++V T+E K+L+LD +R+ IVLL+N LPL K
Sbjct: 354 LRFELGLF--DPIDDQPYWHVAPNEVNTAESKQLSLDLSRKSIVLLQNHGNILPLAKG-- 409
Query: 450 SSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRG-----LEAYV-----SKTHYA--SGCH 497
LA+IGP + G Y G C L G LEA S T YA SG +
Sbjct: 410 KKLAVIGPHAAAKRALLGNYLGQMCHGDYLEVGCVQTPLEAITIANGASNTLYAKGSGIN 469
Query: 498 DVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTS 557
D S AGF EA A+KA+ V++ G+D + E E DR ++ +P QM L+ V R
Sbjct: 470 DT---STAGFDEAEAAARKAETVVLFLGIDTSIEREAWDRENIDMPNIQMQLLKRVRRAG 526
Query: 558 KRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTW 617
K P ++VL GG V E ++ YPG GA+A+++I+FGD P G+LP+T
Sbjct: 527 K-PTVVVLFNGGV--VGAEELILHTDGVVEAFYPGFFGAQAVSDILFGDAIPSGKLPVTM 583
Query: 618 YPESF-TKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSEL 676
YP ++ T V M M+M +YPGRSYR+Y V+ FG GLSYT ++ S
Sbjct: 584 YPSNYVTSVDMKSMSM-----TKYPGRSYRYYKEVPVFPFGWGLSYTRFTMALDS----- 633
Query: 677 TISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFA 736
S+ + S+ ++ + +D+ V + ++N G++ G VV F
Sbjct: 634 --SSGVTDPSEPIVVTR--------QLDQT---------VTVILSNDGNLVGDEVVFAFF 674
Query: 737 RVPKV----SQGTPEKQLIGFDRVHTVAKGSKEISFGV 770
R KV + +QL + RV +++ F +
Sbjct: 675 RPLKVNATGNAALLNEQLFDYRRVSLRPTQYRKLKFRI 712
>gi|363742357|ref|XP_003642627.1| PREDICTED: probable beta-D-xylosidase 5-like [Gallus gallus]
Length = 748
Score = 345 bits (884), Expect = 7e-92, Method: Compositional matrix adjust.
Identities = 250/755 (33%), Positives = 356/755 (47%), Gaps = 114/755 (15%)
Query: 52 YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDN-------ASAIPRLGIPAYEWWSESLH 104
+PF + +L R + L+ LT E + Q++ A IPRLGI Y W +E L
Sbjct: 27 FPFRDPTLPWHRRLEDLLGRLTPAEMVLQMARGGALGNGPAPPIPRLGIAPYNWNTECLR 86
Query: 105 GIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNL------- 157
G A PG T+FPQ L AA+F+ L + +A A E RA +N
Sbjct: 87 GDA-EAPG--------WATAFPQALGLAAAFSPELVYRVANATATEVRAKHNSFVAAGRY 137
Query: 158 -GQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGF 216
GL+ ++P +NI R P WGR QET GEDP + + A FV+ QG++
Sbjct: 138 DDHTGLSCFSPVLNIMRHPLWGRNQETYGEDPYLTAELATSFVQGLQGQH---------- 187
Query: 217 GFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQ 276
R +K SA CKH + + SR SF+A + E+D TF
Sbjct: 188 ----PRYIKA-------------SAGCKHFSVHGGPENIPVSRLSFDAKVLERDWHTTFL 230
Query: 277 PPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFE 335
P F++C+ G S MCSYN++NGVPAC L R EWGF+GY+ SD AV I
Sbjct: 231 PQFQACVRAGSYS-FMCSYNRINGVPACANKKLLTDILRGEWGFEGYVVSDEGAVELILL 289
Query: 336 YQNYTKTHEDSAAGVLKAGMDINCGTCM----LRHTQSAIDKGKVQEKDIDRALLNLFSV 391
YT T ++A + AG+++ M H A+ G + + + + LF
Sbjct: 290 GHRYTHTFLETAIASVNAGLNLELSYGMRNNVFMHIPKALAMGNITLEMLRDRVRPLFYT 349
Query: 392 QLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSS 451
+LRLG F+ P Y L V +SEH+ L+L+AA + VLLKN + LPL +
Sbjct: 350 RLRLGEFD-PPAMNPYNALELSVVQSSEHRNLSLEAAIKSFVLLKNQRDTLPLRELHGKR 408
Query: 452 LAIIGPLVNNISQMGGGYTGIPCSPKSLL---RGLEAYVSKTHYASGCHDVPC--NSDAG 506
LA++GP +N + G Y +P P+ + RGL+ + +A+GC + C S
Sbjct: 409 LAVVGPFADNPRVLFGDYAPVP-EPQYIYTPRRGLQTLPANVSFAAGCREPRCWVYSRDE 467
Query: 507 FHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSK-RPVILVL 565
AVR AD V+V G + E E RDR L LPG Q+ L+ R + PVIL+L
Sbjct: 468 VENAVR---GADVVLVCLGTGIDVEMEARDRKDLSLPGHQLQLLQDAVRAAAGHPVILLL 524
Query: 566 TGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGD--FNPGGRLPMTWYPESFT 623
GPLDVS+A+ + +IL +P +A A+A ++ G +P GRLP TW P
Sbjct: 525 FNAGPLDVSWAQLHDGVGAILACFFPAQATGLAIASVLLGKQGASPAGRLPATW-PAGMH 583
Query: 624 KV-PMNDMNMRADSSRQYPGRSYRFY-TGTQVYGFGHGLSYTNYSYK-FLSAPSELTISA 680
+V PM + M GR+YR+Y +Y FG+GLSYT + Y+ + +P L I A
Sbjct: 584 QVPPMENYTME--------GRTYRYYGQEAPLYPFGYGLSYTTFHYRDLVLSPPVLPICA 635
Query: 681 SLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPK 740
+L V + + N G D VV L+ R +
Sbjct: 636 NLS--------------------------------VSVVLENTGPRDSEEVVQLYLRWEQ 663
Query: 741 VSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQ 775
S P QL+ F RV A G+ ++SFGV ++
Sbjct: 664 PSVPVPRWQLVAFRRVAVPAGGATKLSFGVTAAQR 698
>gi|423279990|ref|ZP_17258903.1| hypothetical protein HMPREF1203_03120 [Bacteroides fragilis HMW
610]
gi|404584326|gb|EKA88991.1| hypothetical protein HMPREF1203_03120 [Bacteroides fragilis HMW
610]
Length = 722
Score = 345 bits (884), Expect = 8e-92, Method: Compositional matrix adjust.
Identities = 228/745 (30%), Positives = 360/745 (48%), Gaps = 97/745 (13%)
Query: 56 NTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNF 115
+ S I+ R K+LI +TL EK QL + +IPRL +PAY +W+E LHG+A G
Sbjct: 53 DLSQPIAVRVKTLIQQMTLAEKASQLVSESDSIPRLNLPAYNYWNECLHGVARAG----- 107
Query: 116 NGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDP 175
VT FPQ + A++++ L + SA++ EAR Y GLT+W+P IN+ RDP
Sbjct: 108 -----EVTVFPQAINLASTWDTVLVKRVASAISTEARLKYLEIGKGLTYWSPTINMARDP 162
Query: 176 RWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGD 235
RWGR +ET GEDP + S V FVK QG D
Sbjct: 163 RWGRNEETYGEDPYLTSRLGVAFVKGLQG---------------------------DHPA 195
Query: 236 ELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSY 295
L A KH +A + E +R+S ++ I + + + P + +C+++ +M +Y
Sbjct: 196 YLKTVATIKHFVANNEEN----NRFSSSSQIPTKQLYEYYFPAYEACVKEAGVQSVMTAY 251
Query: 296 NQVNGVP-ACLRGDLFQKARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAG 354
N NGVP + R L + R EWGF G++ SDC A+ + + E++AA + +G
Sbjct: 252 NAFNGVPPSGSRWLLGEVLRKEWGFDGFVVSDCGAIGVMNWQHRVVNSLEEAAALGVNSG 311
Query: 355 MDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRK-GKYGKLGPD 413
D+ CGT A+ +G + E ID+AL + + + +LG F DP + Y
Sbjct: 312 CDLECGTTYKEKLVQAVKQGLISEATIDQALTRVLTARFKLGEF--DPMELVPYNHYDKK 369
Query: 414 DVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIP 473
+ + +LA +AA + +VLLKN+ LPL+K S+A++GP ++ GGY+G P
Sbjct: 370 LLAGKKFAELAYEAAVKSVVLLKNE-NLLPLSKEKTKSVAVVGPFADH--NYLGGYSGQP 426
Query: 474 CSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETE 533
+LL+G++ + K + + + + D+ V K D V+V G D E
Sbjct: 427 PYSVTLLKGVKDLMGKRGKVNYLNGIGASRDS----IVAAVKGVDVVLVALGSDEKMARE 482
Query: 534 DRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGE 593
+ D S+ LP +Q L+ ++ + + R ++LV G PL + AD I +I+ YPG+
Sbjct: 483 NHDMTSIYLPEEQEKLLKAIYQVNPR-IVLVFHSGNPLTSEW--ADVHIPAIMQAWYPGQ 539
Query: 594 AGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQV 653
+ALA+++FG+ NP G+LPMT Y + D +M + GR+YR+ +
Sbjct: 540 EAGRALADLLFGNENPSGKLPMTIYRAEDQLPDILDFDM-------WKGRTYRYMKEDPL 592
Query: 654 YGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLR 713
YGFGHGLSYT++ + + S +LK+G+ R
Sbjct: 593 YGFGHGLSYTSFGFDGIQG------SDTLKSGA--------------------------R 620
Query: 714 FHVQISVTNAGDVDGSHVVMLFARVPKVSQGT-PEKQLIGFDRVHTVAKGSKEISFGVDP 772
+ ++N G G VV ++ T P K+L+ F +V K + F + P
Sbjct: 621 LQCSVELSNTGKWTGEEVVQVYVSRENTPVYTYPLKKLVAFKKVKLAPGEKKRVEFNIPP 680
Query: 773 CEQLSIANKHGRRILPLGNHVLMVG 797
E LS+ R+L G + L +G
Sbjct: 681 RE-LSVWENGNWRML-TGKYTLFIG 703
>gi|340369765|ref|XP_003383418.1| PREDICTED: probable beta-D-xylosidase 2-like [Amphimedon
queenslandica]
Length = 748
Score = 344 bits (883), Expect = 9e-92, Method: Compositional matrix adjust.
Identities = 246/781 (31%), Positives = 363/781 (46%), Gaps = 115/781 (14%)
Query: 52 YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDN-------ASAIPRLGIPAYEWWSESLH 104
+PF + SL I R K ++ L+L + ++Q++ A IP+ I Y+W +E L
Sbjct: 27 FPFRDPSLPIEERVKDIVDQLSLDQLVEQMAHGGAGSNGPAPGIPKFNIKPYQWGTECLS 86
Query: 105 GIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG------ 158
G + G TSFP + AASFN L + +A A E RA
Sbjct: 87 GDVNAG----------DATSFPMSIGMAASFNYDLLKQVSNATAYEVRAKNTAAVLNGSY 136
Query: 159 --QAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGF 216
GL+ W+P +NI RDPRWGR QET GEDP + FV QG
Sbjct: 137 AFHTGLSCWSPVLNIMRDPRWGRNQETYGEDPYLSGYLGQAFVTGLQG------------ 184
Query: 217 GFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQ 276
D ++ +A CKH + + R SF+A +T D TF
Sbjct: 185 ---------------DDPTYVIANAGCKHFDVHGGPEDTPLPRASFDANVTMIDWRMTFL 229
Query: 277 PPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFE 335
P F++C+E G S +MCSYN++NGVPAC L RNEW FKGY+ SD A+ I
Sbjct: 230 PQFKACVEAGALS-LMCSYNRINGVPACANKKLLTDILRNEWNFKGYVVSDQGALENIVT 288
Query: 336 YQNY-----TKTHEDSAAGV-LKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLF 389
+Y T + + AG L+ G G + + A++KG V + A+ LF
Sbjct: 289 QHHYAPDFVTAAADAANAGTCLEDGNSEGKGGNVFDNLDDAVEKGLVSVDTLKDAVSRLF 348
Query: 390 SVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKK---FLPLNK 446
V+ +LG F+ Y + + + EH KL++ AA + IVL+KND FLPL
Sbjct: 349 YVRTKLGEFDPPDNNNPYANIPLSIIQSDEHIKLSIQAAMETIVLMKNDNDGSPFLPLAA 408
Query: 447 NAVSSLAIIGPLVNNISQMGGGY-----TGIPCSPKSLLRGLEAYVSKTHYASGCHDVP- 500
+ ++GP + N M G Y T +P + ++ + +Y GC D P
Sbjct: 409 DDFKKACVVGPFIENADTMFGDYSPTMMTDYIVTPLAGIKTTQIGSDLLNYEDGCTDGPA 468
Query: 501 CNSDAGFHEAVRIA-KKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTS-K 558
C G+ VR A + D VIV AGL E E D + LPG QMSL+T S
Sbjct: 469 CEIYDGYK--VRTACEGVDLVIVTAGLSRYLEHEGHDISDIYLPGHQMSLLTDAESASGS 526
Query: 559 RPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWY 618
P+IL+L PLD+S+A+++ + ++IL YPG+ A+A ++ G +NP GRLP TW
Sbjct: 527 APIILLLFNANPLDISYAKSNPRFAAILEAYYPGQEAGVAIANVLTGSYNPAGRLPNTW- 585
Query: 619 PESFTKVP-MNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELT 677
P S +VP M D M+ R+YR++T +Y FG+GLS+T ++Y S+L
Sbjct: 586 PASLDQVPDMIDYTMKE--------RTYRYFTQEPLYPFGYGLSFTTFNY------SDLN 631
Query: 678 ISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFAR 737
++++ + +I V ++V N G +DG V + +
Sbjct: 632 VASTANTNGEGSIA------------------------VSVTVMNTGTMDGDEVTQAYVK 667
Query: 738 VPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCE-QLSIANKHGRRILPLGNHVLMV 796
V++ P QL+G R S +SF + P + Q+ I G+ +P G + L V
Sbjct: 668 WDNVAEA-PNIQLVGVSRKFISKGQSITVSFTIKPEQLQVWINGDDGKWSIPGGTYSLFV 726
Query: 797 G 797
G
Sbjct: 727 G 727
>gi|301118693|ref|XP_002907074.1| glycoside hydrolase, putative [Phytophthora infestans T30-4]
gi|262105586|gb|EEY63638.1| glycoside hydrolase, putative [Phytophthora infestans T30-4]
Length = 809
Score = 343 bits (880), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 254/758 (33%), Positives = 370/758 (48%), Gaps = 103/758 (13%)
Query: 44 CKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASA---IPRLGIPAYEWWS 100
C P + FCN SLS + R + L+ L L EK+ L+ AS + +G+P Y W +
Sbjct: 27 CSSPEHQQFAFCNASLSTAERVEDLLRRLPLDEKVTLLTARASPKGNMSSIGLPEYNWGA 86
Query: 101 ESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG-- 158
+HG+ S G N TSFP + A F+ ++ V E RA++ G
Sbjct: 87 NCVHGVQSTC-GTN------CATSFPNPVNLGAIFDPRAVFDMAQVVGWELRALWLEGAR 139
Query: 159 -------QAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDD 211
GL W+PNINI RDPRWGR ETP EDP+V S Y V + K Q +
Sbjct: 140 ENYATGPHLGLDCWSPNININRDPRWGRNMETPSEDPLVNSKYGVAYTKGLQ-------E 192
Query: 212 GGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDT 271
G ++KR L+ KH AY E + R +FNA+++ D
Sbjct: 193 G------KDKRFLQAV-------------VTLKHYAAYSYEHYDGIDRMAFNAVVSRYDF 233
Query: 272 EDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK-ARNEWGFKGYITSDCDAV 330
DT+ P F + + GKA +MCSYN VNG+P C L K R+ GF GYITSD A+
Sbjct: 234 ADTYLPAFEASVVHGKAKGVMCSYNSVNGMPMCANEQLNSKLLRDALGFDGYITSDSGAI 293
Query: 331 ATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFS 390
A I+ ++YTKT ++ + +G D+N G+ + + G++ EK +D A+
Sbjct: 294 AGIYHQRHYTKTLCEAGRLAILSGTDVNSGSVYKQCLAELVTSGQLPEKAVDDAMRRTLK 353
Query: 391 VQLRLGLFNGDPRKGK-YGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAV 449
++ LGLF DP + Y + P++V T+E K+L+LD +R+ IVLL+N LPL K
Sbjct: 354 LRFELGLF--DPIDDQPYWHVAPNEVNTAESKQLSLDLSRKSIVLLQNHGNILPLAKG-- 409
Query: 450 SSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRG-----LEAYV-----SKTHYA--SGCH 497
LA+IGP + G Y G C L G LEA S T YA SG +
Sbjct: 410 KKLAVIGPHAAAKRALLGNYLGQMCHGDYLEVGCVQTPLEAITIANGASNTLYAKGSGIN 469
Query: 498 DVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTS 557
D S GF EA A+KA+ V++ G+D + E E DR ++ +P QM L+ V R
Sbjct: 470 DT---STGGFDEAEAAARKAETVVLFLGIDTSIEREAWDRENIDMPNIQMQLLKRVRRAG 526
Query: 558 KRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTW 617
K P ++VL GG V E ++ YPG GA+A+++I+FGD P G+LP+T
Sbjct: 527 K-PTVVVLFNGGV--VGAEELILHTDGVVEAFYPGFFGAQAVSDILFGDAIPSGKLPVTM 583
Query: 618 YPESF-TKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSEL 676
YP ++ T V M M+M +YPGRSYR+Y V+ FG GLSYT ++ S
Sbjct: 584 YPSNYVTSVDMKSMSM-----TKYPGRSYRYYKEVPVFPFGWGLSYTRFTMALDS----- 633
Query: 677 TISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFA 736
S+ + S+ ++ + +D+ V + ++N G++ G VV F
Sbjct: 634 --SSGVTDPSEPIVVTR--------QLDQT---------VTVILSNDGNLVGDEVVFAFF 674
Query: 737 RVPKV----SQGTPEKQLIGFDRVHTVAKGSKEISFGV 770
R KV + +QL + RV +++ F +
Sbjct: 675 RPLKVNATGNAALLNEQLFDYRRVSLRPTQYRKLKFRI 712
>gi|325679939|ref|ZP_08159508.1| glycosyl hydrolase family 3 C-terminal domain protein [Ruminococcus
albus 8]
gi|324108377|gb|EGC02624.1| glycosyl hydrolase family 3 C-terminal domain protein [Ruminococcus
albus 8]
Length = 691
Score = 343 bits (880), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 218/633 (34%), Positives = 330/633 (52%), Gaps = 85/633 (13%)
Query: 54 FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
+ + SLS RA++L +T +E+ QL +A AI RLGIPAY WW+E +HG+A +G
Sbjct: 4 YLDESLSAEERAEALTDEMTTEEQASQLRYDAPAIERLGIPAYNWWNEGIHGLARSG--- 60
Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLTFW 165
T FPQ + AA F+ L + EARA YN GLT W
Sbjct: 61 -------VATMFPQAIGLAAMFDDELTKRTAEITSEEARAKYNAYTVEGDRDIYKGLTLW 113
Query: 166 APNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLK 225
APNINIFRDPRWGRG ET GEDP + + V+ QG+ +V+K
Sbjct: 114 APNINIFRDPRWGRGHETFGEDPYLTAQNGKAVVRGLQGDG---------------KVMK 158
Query: 226 GFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQ 285
+AC KH + + R+SF+A +D E+T+ P F + +++
Sbjct: 159 A-------------AACAKH---FAVHSGPEALRHSFDAKADAKDMEETYLPAFEALVKE 202
Query: 286 GKASCIMCSYNQVNGVPACLRGDLFQKARNEWGFKGYITSDCDAVATIFEYQNYTKTHED 345
K +M +YN+VNG PAC L +K + EW F GY SDC A+ E+ T +
Sbjct: 203 AKVESVMGAYNRVNGEPACASDYLMEKLK-EWEFDGYFVSDCWAIRDFHEHHMVTANAVE 261
Query: 346 SAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKG 405
SAA LKAG D+NCG C ++ +A+DKG + ++ I A ++L ++RLG+F+ +
Sbjct: 262 SAAMALKAGCDVNCG-CTYQNLLAALDKGLITKEQIRTACVHLMRTRIRLGMFD---KHT 317
Query: 406 KYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQM 465
+ + V +EHK ++L+ A + +VLLKN+ LPL+ ++A+IGP ++ + +
Sbjct: 318 DFDDIPYSKVACAEHKAVSLECAEKSLVLLKNN-GILPLDDKKYKTIAVIGPNADSRTAL 376
Query: 466 GGGYTGIPCSPKSLLRGLE-AYVSKTHYASGCH----DVPCNSDAG--FHEAVRIAKKAD 518
G Y G+ + L G++ + + +A GCH + + AG + EAV AK AD
Sbjct: 377 EGNYNGLSDRYTTFLNGIQDRFEGRVIFAEGCHLYKKSISGLAQAGDRYAEAVAAAKNAD 436
Query: 519 FVIVVAGLDLTQETED---------RDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGG 569
VI+ GLD T E E+ D+ L LP Q LV + K PV+ V+ G
Sbjct: 437 LVIMCVGLDATIEGEEGDTGNEFSSGDKNGLTLPPPQKILVEKIMSVGK-PVVTVVCAGS 495
Query: 570 PLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMND 629
++ +SQ +++ YPG G+KALAE++FGD +P G+LP+T+Y ++ D
Sbjct: 496 AINT-----ESQPDALIHAFYPGAEGSKALAEVLFGDVSPSGKLPVTFYEDTDKLPEFTD 550
Query: 630 MNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSY 662
+M+ GR+YR+ T ++ FG+GL+Y
Sbjct: 551 YSMK--------GRTYRYTTDNILFPFGYGLTY 575
>gi|291240559|ref|XP_002740189.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii]
Length = 745
Score = 343 bits (879), Expect = 3e-91, Method: Compositional matrix adjust.
Identities = 243/758 (32%), Positives = 355/758 (46%), Gaps = 110/758 (14%)
Query: 52 YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDN-------ASAIPRLGIPAYEWWSESLH 104
+PF NTSL + R + L+ L L+E + Q+S A I RL I Y W +E L
Sbjct: 25 FPFRNTSLPWNKRVEDLVGRLKLEEIVLQMSRGGRYSNGPAPPIDRLNIGPYSWNTECLR 84
Query: 105 GIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNL------- 157
G S GP TSFPQ AA+F+ L I +A A E RA YN
Sbjct: 85 GDLSAGPA----------TSFPQAFGLAATFDAVLIKQIANATAYEVRAKYNNYTKHKEY 134
Query: 158 -GQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGF 216
GL+ ++P INI R P WGR QET GEDP + A FV QG + + G
Sbjct: 135 GDHKGLSCFSPVINIARHPLWGRIQETYGEDPYLSGTLAASFVTGLQGNHPRYVTANAG- 193
Query: 217 GFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQ 276
CKH AY + SR +F+A ++++D TF
Sbjct: 194 --------------------------CKHFDAYAGPENIPSSRSTFDAKVSDRDLRMTFL 227
Query: 277 PPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFE 335
P F CI+ G S +MCSYN +NGVPAC L R EW F GY+ SD AV +++
Sbjct: 228 PAFHECIQAGTYS-LMCSYNSINGVPACANKKLLTDILRTEWNFTGYVISDQSAVEKVYD 286
Query: 336 YQNYTKTHEDSAAGVLKAGMDI----NCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSV 391
+YTK D+A + +G+++ N ++ T A+ +G V K + + LF
Sbjct: 287 AHHYTKDMLDTAIACVNSGLNLELSSNLTDNVMMQTTKAVKQGNVTMKTVKARVSPLFYT 346
Query: 392 QLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSS 451
++RLG F+ P Y KL + + EH++L+L AA + VLLKN+ +FLPL K +
Sbjct: 347 RMRLGEFD-PPEMNPYSKLDLSIIQSQEHQELSLKAAAKSFVLLKNENRFLPL-KEKIDK 404
Query: 452 LAIIGPLVNNISQMGGG----YTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGF 507
LA++GP +N ++ G + + +P+ L + T +ASGC C ++
Sbjct: 405 LAVVGPFGDNPIEIYGSKSPDVSNLTVTPRYGLSKIARLA--TTFASGCLSPAC-TEYDP 461
Query: 508 HEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVT-SVARTSKRPVILVLT 566
+ + D V+V G E E DR L LPGQQ+ L+ +V + +PVIL+L
Sbjct: 462 KSTKQAIDRVDMVVVCLGTGNEVENEAHDRSELTLPGQQLRLLQDAVTFAADKPVILLLF 521
Query: 567 GGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGD--FNPGGRLPMTWYPESFTK 624
GPLD+++A ++ I I+ +P + AL + NPGGRLP+TW P+S ++
Sbjct: 522 NAGPLDITWAVSNPAIPVIVECFFPAQTTGTALYHLFVNSPGSNPGGRLPITW-PKSMSQ 580
Query: 625 VP-MNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLK 683
VP M D M GR+YR++ G ++ FG+GLSYT + Y S+L I+ S
Sbjct: 581 VPPMEDYTME--------GRTYRYFNGDPLFPFGYGLSYTTFHY------SDLLITPS-- 624
Query: 684 AGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQ 743
+ C+S+ ++ + + N GDV G V + S
Sbjct: 625 --------------------TPIKPCSSI--NIDVFLENTGDVTGDEVTQFYLSWKNASI 662
Query: 744 GTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANK 781
P+ QL+G R +K I+ V P NK
Sbjct: 663 PVPKWQLVGVSRTQLQSKTFANIAIIVPPRLMAVYTNK 700
>gi|325970053|ref|YP_004246244.1| beta-glucosidase [Sphaerochaeta globus str. Buddy]
gi|324025291|gb|ADY12050.1| Beta-glucosidase [Sphaerochaeta globus str. Buddy]
Length = 698
Score = 342 bits (878), Expect = 3e-91, Method: Compositional matrix adjust.
Identities = 243/767 (31%), Positives = 366/767 (47%), Gaps = 120/767 (15%)
Query: 59 LSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGT 118
++ RA+ L+ + L + + QL +A AI LGIPAY WW+E LHG A +G
Sbjct: 1 MTARQRAQELVERMNLPQMMSQLRHDAPAIESLGIPAYNWWNEGLHGSARSG-------- 52
Query: 119 VSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQ--------AGLTFWAPNIN 170
+ T FPQ + A+ F+ + S V+ E RA YNL GLT W+PN+N
Sbjct: 53 --TATVFPQAIGLASLFDPDFLYAVASVVSTEQRAKYNLFTHENDRDIYKGLTVWSPNVN 110
Query: 171 IFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEE 230
IFRDPRWGRGQET GEDP + + AV F++ QGE
Sbjct: 111 IFRDPRWGRGQETFGEDPYLTARLAVAFIRGLQGE------------------------- 145
Query: 231 SDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASC 290
G L ++C KH A+ + R+ FNA++ ++D E+T+ P F S +++ KA
Sbjct: 146 ---GPVLKTASCVKHFAAHSGPE---PLRHGFNAVVGKKDLEETYLPAFASAVKEAKADA 199
Query: 291 IMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAG 349
+M +Y+ +N P C L ++ R WGF+G SDC A+ TK E+SAA
Sbjct: 200 VMGAYSALNDEPCCASSFLMEETLRLRWGFEGMYISDCWAIRDFHLNHKVTKNEEESAAL 259
Query: 350 VLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGK 409
LK G D+ CG C + + A KG + + I +A + + + + +LG F+ + Y
Sbjct: 260 ALKRGCDLACG-CEYQSLEKAFQKGLITREQIKKAAIRVMTTRFKLGQFD---QGTAYDT 315
Query: 410 LGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGY 469
LG + + + EH LA +A+ + +VLLKND LPL K AVS LA+IGP ++ + G Y
Sbjct: 316 LGLESLDSDEHAALAFEASCRSLVLLKND-ALLPLKKEAVSCLAVIGPNADSRQALWGNY 374
Query: 470 TGIPCSPKSLLRGLEAYV---SKTHYASGCH------DVPCNSDAGFHEAVRIAKKADFV 520
G ++L GL YV ++ Y+ G + + D EAV +AK +D V
Sbjct: 375 HGTSSRYVTILEGLRDYVGSSTRILYSEGSNLTKNKVERLAKDDDRLSEAVFMAKASDVV 434
Query: 521 IVVAGLDLTQETEDR---------DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPL 571
++ GL+ T E E D+ L LP Q L+ +VA T K P+I+VL GG L
Sbjct: 435 VLCLGLNETVEGEMHDDGNGGWAGDKDDLRLPLCQRKLLKAVAETGK-PIIVVLLSGGSL 493
Query: 572 DVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMN 631
D E + + +++ YPG+ G KA+A +++G P G+LP+T+Y P D +
Sbjct: 494 DPEI-EQYANVKALIQAWYPGQEGGKAIAHLLYGALCPSGKLPVTFYKAEAKLPPFTDYS 552
Query: 632 MRADSSRQYPGRSYRFYTGTQV-YGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNI 690
+ R+YR+ V Y FG GLSY ++S+ LSA E +++N
Sbjct: 553 LIR--------RTYRYCDDPDVLYPFGFGLSYASFSF-CLSAAQE----------TEQNG 593
Query: 691 LQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQL 750
+ T + V N +D VV L+ + + P L
Sbjct: 594 VAAT-----------------------VLVRNTSALDARTVVQLYLAM-EGKDLPPHPVL 629
Query: 751 IGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
G VH A +I+F ++ +Q + + G R G + L G
Sbjct: 630 CGMKSVHLKAGEETQITFILEE-KQFTAVQEDGNRYAVRGGYTLYAG 675
>gi|332638085|ref|ZP_08416948.1| glycoside hydrolase family 3 protein [Weissella cibaria KACC 11862]
Length = 713
Score = 342 bits (878), Expect = 3e-91, Method: Compositional matrix adjust.
Identities = 225/763 (29%), Positives = 370/763 (48%), Gaps = 121/763 (15%)
Query: 64 RAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVT 123
+AK ++ +T+ EKI Q+ A AI RL IP Y +W+E+LHG+A G T
Sbjct: 13 QAKVIVDQMTIDEKIGQIKYEAPAIERLNIPEYNYWNEALHGVARAG----------VAT 62
Query: 124 SFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLTFWAPNINIFRDP 175
FPQ + AA+F+ L ++I + E RA YN GLTFW+PN+NIFRDP
Sbjct: 63 VFPQAIGLAATFDDQLINDIADVIGTEGRAKYNEFTKHEDRDIYKGLTFWSPNVNIFRDP 122
Query: 176 RWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGD 235
RWGRG ET GEDP + S + V F+K QG+
Sbjct: 123 RWGRGHETYGEDPFLTSKFGVAFIKGLQGQ----------------------------AK 154
Query: 236 ELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSY 295
L L+A KH + + R+ F+A+++++D +T+ P F++ +E+ IM +Y
Sbjct: 155 YLKLAATAKHFAVHSGPEG---LRHGFDAVVSDKDLYETYLPAFKAAVEEADVESIMTAY 211
Query: 296 NQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAG 354
N V+GVPA + L + ++W F+G++ SD A + E YTK ++ +KAG
Sbjct: 212 NAVDGVPASVSEMLLRDILHDKWSFEGHVVSDYMAPEDVHENHKYTKDAAETMGLAIKAG 271
Query: 355 MDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDD 414
+++ G + + A+++G V E++I A+++L++ ++RLG+F D +Y + +
Sbjct: 272 LNLVAG-HIEQSLHEALNRGLVTEEEITNAVISLYATRVRLGMFATD---NEYDAIPYEA 327
Query: 415 VCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPC 474
T H L+ AA + VLLKND LPL K + ++A++GP ++ + G Y G P
Sbjct: 328 NDTKAHNNLSEIAAEKSFVLLKNDG-VLPLRKETMEAIAVVGPNAHSEIALLGNYFGTPS 386
Query: 475 SPKSLLRGLEAYVS---KTHYASG-------CHDVPCNSDAGFHEAVRIAKKADFVIVVA 524
++L G++ + + HY+ G + +D EA+ A+ +D ++ V
Sbjct: 387 RSYTILEGIQERLGDDVRVHYSIGSGVFQDHAAEPLAKADERESEAIIAAEHSDVIVAVL 446
Query: 525 GLDLTQETEDRD---------RVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSF 575
GLD T E E+ D + +L LPG+Q L+ + K PV+++L G L +
Sbjct: 447 GLDSTIEGEEGDAGNSQGAGDKPNLSLPGRQRQLLERLLAVGK-PVVVLLASGSSLQLDG 505
Query: 576 AEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRAD 635
E + +I+ I YPG G A+A+++FG +P G+LP+T+Y + D NM
Sbjct: 506 LENHPNLRAIMQIWYPGARGGLAVADVLFGTVSPSGKLPVTFYKNTDNLPAFEDYNM--- 562
Query: 636 SSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTG 695
GR+YR+ T +Y FG+GL+Y++ L S
Sbjct: 563 -----AGRTYRYMTEEALYPFGYGLTYSSVELSDLQVKS--------------------- 596
Query: 696 SRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDR 755
+E + T +++ N G+ D VV ++ + + P QL GF R
Sbjct: 597 -------YEETATAT-------VTIQNTGNFDTDEVVQVYVKDLESEFAVPNAQLKGFKR 642
Query: 756 VHTVAKGSKE-ISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
V + KGSK+ I+F + P + + ++ G + + VG
Sbjct: 643 VF-LGKGSKQTITFDLRP-QDFEVFDEQGHNFIDSNRFEISVG 683
>gi|313145345|ref|ZP_07807538.1| beta-glucosidase [Bacteroides fragilis 3_1_12]
gi|313134112|gb|EFR51472.1| beta-glucosidase [Bacteroides fragilis 3_1_12]
Length = 722
Score = 342 bits (878), Expect = 3e-91, Method: Compositional matrix adjust.
Identities = 230/740 (31%), Positives = 361/740 (48%), Gaps = 97/740 (13%)
Query: 61 ISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVS 120
I+ R K+LI +TL EK QL + +IPRL +PAY +W+E LHG+A G
Sbjct: 58 IAVRVKTLIQQMTLAEKASQLVSESDSIPRLNLPAYNYWNECLHGVARAG---------- 107
Query: 121 SVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRG 180
VT FPQ + A++++ L + SA++ EAR Y GLT+W+P IN+ RDPRWGR
Sbjct: 108 EVTVFPQAINLASTWDTVLVKRVASAISTEARLKYLEIGKGLTYWSPTINMARDPRWGRN 167
Query: 181 QETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLS 240
+ET GEDP + S V FVK QG D L
Sbjct: 168 EETYGEDPYLTSRLGVAFVKGLQG---------------------------DHPAYLKTV 200
Query: 241 ACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNG 300
A KH +A + E +R+S ++ I + + + P + +C+++ +M +YN NG
Sbjct: 201 ATIKHFVANNEEN----NRFSSSSQIPTKQLYEYYFPAYEACVKEAGVQSVMTAYNAFNG 256
Query: 301 VP-ACLRGDLFQKARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINC 359
VP + R L + R EWGF G++ SDC A+ + + E++AA + +G D+ C
Sbjct: 257 VPPSGSRWLLGEVLRKEWGFDGFVVSDCGAIGVMNWQHRVVNSLEEAAALGVNSGCDLEC 316
Query: 360 GTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRK-GKYGKLGPDDVCTS 418
GT A+ +G + E ID+AL + + + +LG F DP + Y +
Sbjct: 317 GTTYKEKLVQAVKQGLISEATIDQALTRVLTARFKLGEF--DPMELVPYNHYDKKLLAGK 374
Query: 419 EHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKS 478
+ +LA +AA + +VLLKN+ LPL+K S+A++GP ++ GGY+G P +
Sbjct: 375 KFAELAYEAAVKSVVLLKNE-NLLPLSKEKTKSVAVVGPFADH--NYLGGYSGQPPYSVT 431
Query: 479 LLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRV 538
LL+G++ + K + + + + D+ V K D V+V G D E+ D
Sbjct: 432 LLKGVKDLMGKRGKVNYLNGIGASRDS----IVAAVKGVDVVLVALGSDEKMARENHDMT 487
Query: 539 SLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKA 598
S+ LP +Q L+ ++ + + R ++LV G PL + AD I +I+ YPG+ +A
Sbjct: 488 SIYLPEEQEKLLKAIYQVNPR-IVLVFHSGNPLTSEW--ADVHIPAIMQAWYPGQEAGRA 544
Query: 599 LAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGH 658
LA+++FG+ NP G+LPMT Y + D +M + GR+YR+ +YGFGH
Sbjct: 545 LADLLFGNENPSGKLPMTIYRAEDQLPDILDFDM-------WKGRTYRYMKEDPLYGFGH 597
Query: 659 GLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQI 718
GLSYT++ + + S +LK+G T+L+ V++
Sbjct: 598 GLSYTSFGFDGIQG------SDTLKSG------------------------TTLQCSVEL 627
Query: 719 SVTNAGDVDGSHVVMLFARVPKVSQGT-PEKQLIGFDRVHTVAKGSKEISFGVDPCEQLS 777
S N G G VV ++ T P K+L+ F +V K + F + P E LS
Sbjct: 628 S--NTGKWTGEEVVQVYVSRENTPVYTYPLKKLVAFKKVKLAPGEKKRVEFNIPPRE-LS 684
Query: 778 IANKHGRRILPLGNHVLMVG 797
+ R+L G + L +G
Sbjct: 685 VWENGNWRML-TGKYTLFIG 703
>gi|317057539|ref|YP_004106006.1| glycoside hydrolase family protein [Ruminococcus albus 7]
gi|315449808|gb|ADU23372.1| glycoside hydrolase family 3 domain protein [Ruminococcus albus 7]
Length = 691
Score = 342 bits (876), Expect = 6e-91, Method: Compositional matrix adjust.
Identities = 219/633 (34%), Positives = 331/633 (52%), Gaps = 85/633 (13%)
Query: 54 FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
+ + +LS RA++L +T +E+ QL +A A+ RLGIPAY WW+E +HG+A +G
Sbjct: 4 YLDETLSAQERAEALTDEMTTEEQASQLRYDAPAVERLGIPAYNWWNEGIHGLARSG--- 60
Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLTFW 165
T FPQ + AA F+ L + EARA YN GLT W
Sbjct: 61 -------VATMFPQAIGLAAMFDDELTKKTAEVTSEEARAKYNAYSGEEDRDIYKGLTLW 113
Query: 166 APNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLK 225
APNINIFRDPRWGRG ET GEDP + + + V+ QG DG +V+K
Sbjct: 114 APNINIFRDPRWGRGHETFGEDPYLTTKNGMAVVRGLQG------DG---------KVIK 158
Query: 226 GFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQ 285
+AC KH + + R+SF+A +D E+T+ P F + +++
Sbjct: 159 A-------------AACAKH---FAVHSGPEAIRHSFDAKANAKDMEETYLPAFEALVKE 202
Query: 286 GKASCIMCSYNQVNGVPACLRGDLFQKARNEWGFKGYITSDCDAVATIFEYQNYTKTHED 345
K +M +YN+VNG PAC L K + EW F GY SDC A+ E T +
Sbjct: 203 AKVESVMGAYNRVNGEPACASNFLMDKLK-EWEFDGYFVSDCWAIRDFHENHMVTANAIE 261
Query: 346 SAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKG 405
S A LKAG D+NCG C ++ A++KG V ++DI A ++L ++RLG+F+ +K
Sbjct: 262 STAMALKAGCDVNCG-CTYQNLLVALEKGAVTKEDIRTACVHLMRTRIRLGMFD---KKT 317
Query: 406 KYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQM 465
+Y + D V EHK ++L+ A + +V+L+N+ LP++ + ++A+IGP ++ + +
Sbjct: 318 EYDDIPYDKVACKEHKAISLECAEKSLVMLENN-GILPVDTSKYKTIAVIGPNADSRTAL 376
Query: 466 GGGYTGIPCSPKSLLRGLE-AYVSKTHYASGCH---DVPCN-SDAG--FHEAVRIAKKAD 518
G Y G+ + L G++ + + +A GCH D N + AG + EAV AK AD
Sbjct: 377 EGNYNGLSDRYTTFLNGIQDRFDGRVIFAEGCHLYKDRVSNLAQAGDRYAEAVAAAKFAD 436
Query: 519 FVIVVAGLDLTQETED---------RDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGG 569
I+ GLD T E E+ D+ L LP Q LV + K PV+ V+ G
Sbjct: 437 MTILCLGLDATIEGEEGDTGNEFSSGDKNGLTLPPPQRELVKKIMAVGK-PVVTVVCAGS 495
Query: 570 PLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMND 629
++ +S+ +++ YPG G KALAE++FGD +P G+LP+T+Y ++ D
Sbjct: 496 AINT-----ESKPDALIHAFYPGAEGGKALAEVLFGDVSPSGKLPVTFYEDTDKLPEFTD 550
Query: 630 MNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSY 662
+M+ GR+YR+ T +Y FG+GL+Y
Sbjct: 551 YSMK--------GRTYRYTTENVLYPFGYGLTY 575
>gi|253579611|ref|ZP_04856880.1| glycoside hydrolase, family 3 domain-containing protein
[Ruminococcus sp. 5_1_39B_FAA]
gi|251849112|gb|EES77073.1| glycoside hydrolase, family 3 domain-containing protein
[Ruminococcus sp. 5_1_39BFAA]
Length = 706
Score = 342 bits (876), Expect = 7e-91, Method: Compositional matrix adjust.
Identities = 234/757 (30%), Positives = 356/757 (47%), Gaps = 121/757 (15%)
Query: 64 RAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVT 123
+A+ L+S +TL EK QL +A+ + RLG+PAY +W+E+LHG+A G T
Sbjct: 14 KAEKLVSQMTLLEKASQLKYDAAPVKRLGVPAYNYWNEALHGVARAG----------VAT 63
Query: 124 SFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLTFWAPNINIFRDP 175
FPQ + AA F+ +G +A E RA YN A GLTFW+PN+NIFRDP
Sbjct: 64 MFPQAIAMAAVFDDEEMKKVGDIIATEGRAKYNAYSAKEDRDIYKGLTFWSPNVNIFRDP 123
Query: 176 RWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGD 235
RWGRG ET GEDP + S V+FV+ QG+ G
Sbjct: 124 RWGRGHETYGEDPYLTSRLGVKFVEGIQGD----------------------------GP 155
Query: 236 ELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSY 295
+ +AC KH Y + R+ F+A + +D +T+ P F + + + +M +Y
Sbjct: 156 VMKAAACAKH---YAVHSGPESLRHEFDAQASMKDMWETYLPAFEALVTEADVEAVMGAY 212
Query: 296 NQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAG 354
N+ NG P C L + R +W F+G+ TSDC A+ E+ T T SAA L AG
Sbjct: 213 NRTNGEPCCAHKYLMEDVLRGKWKFEGHYTSDCWAIRDFHEHHMVTSTPRQSAAMALNAG 272
Query: 355 MDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDD 414
D+NCG L H A G V E+ I + + L + + LGLF+G +Y K+
Sbjct: 273 CDLNCGNTYL-HMMGAYQDGLVTEEKITESAVRLLTTRYLLGLFDG----SEYDKIPYSV 327
Query: 415 VCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPC 474
V EH AL AR+ VLLKND LP++K V+++ +IGP ++ + + G Y G
Sbjct: 328 VECKEHIDEALKMARKSCVLLKND-GVLPIDKTKVNTIGVIGPNADSRAALIGNYHGTSS 386
Query: 475 SPKSLLRGLEAYVS---KTHYASGC-------HDVPCNSDAGFHEAVRIAKKADFVIVVA 524
++L G+ + Y+ GC ++ + D EAV A+ +D VI+
Sbjct: 387 EYITVLEGIREEAGDDVRILYSQGCDLYKDKVENLAWDQDR-ISEAVITAENSDVVILCV 445
Query: 525 GLDLTQETED---------RDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSF 575
GL+ T E E+ D+V L LP Q L+ V K P I+VL G +D+++
Sbjct: 446 GLNETLEGEEGDTGNSDASGDKVDLHLPKVQEELIEKVTAVGK-PTIVVLMAGSAIDLNY 504
Query: 576 AEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRAD 635
A+ + IL YPG G +A+A+++FG +P G+LP+T+Y + D +M+
Sbjct: 505 AQ--DNCNGILLAWYPGARGGRAIADLLFGKESPSGKLPITFYKDLEGMPEFTDYSMK-- 560
Query: 636 SSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTG 695
R+YR+ +Y FG+GL+Y++ + +E + + A SD
Sbjct: 561 ------NRTYRYMEKEALYPFGYGLTYSD------TCVTEAEVVGEVSAESD-------- 600
Query: 696 SRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDR 755
++ +V N G VD VV ++ + L GF R
Sbjct: 601 ------------------IVLKATVKNNGTVDTDEVVQVYIKDLDSPLAVRNYSLCGFKR 642
Query: 756 VHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNH 792
V A K + F + + ++I ++ G R + G H
Sbjct: 643 VSLKAGEEKSVEFTISN-KAMNIVDEDGNRYIA-GKH 677
>gi|291240561|ref|XP_002740190.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii]
Length = 763
Score = 341 bits (875), Expect = 8e-91, Method: Compositional matrix adjust.
Identities = 245/756 (32%), Positives = 355/756 (46%), Gaps = 118/756 (15%)
Query: 51 SYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIP--RLGIPAYEWWSESLHGIAS 108
+YPF NTSLS R L+S LTL E + Q++ + A P RLGI Y W SE LHG+
Sbjct: 25 AYPFQNTSLSWEERVDDLVSRLTLDEMVLQMARTSPAPPIDRLGIKPYVWNSECLHGVVP 84
Query: 109 NGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYN--------LGQA 160
T+FPQ + AASF+ L S++ A+ +E RA +N
Sbjct: 85 PD---------GLATAFPQSIGLAASFSPDLLSDVAKAIGLEVRAKHNDYVQRGVYQEHT 135
Query: 161 GLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFRE 220
GL+ ++P INI R P WGR QET GEDP ++ +V+ QG
Sbjct: 136 GLSCFSPVINIARHPLWGRNQETYGEDPFLIGELGSAYVRGLQG---------------- 179
Query: 221 KRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFR 280
D ++ +A CKH + + SR+SF+A + E+D + TF P F
Sbjct: 180 -----------DHPRYVLANAGCKHFDVHGGPEDIPVSRFSFDAKVFERDWQMTFLPAFH 228
Query: 281 SCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNY 339
C++ G S +MCSYN++N VPAC L R EWGF GY+ SD AV I +Y
Sbjct: 229 ECVKAGVYS-VMCSYNRINEVPACANTRLLTDILRKEWGFDGYVVSDEGAVEFIMTSHHY 287
Query: 340 TKTHEDSAAGVLKAGMDIN----CGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRL 395
T + D+ A + AG +++ G M A+ GK++EK + + LF ++RL
Sbjct: 288 TDSIVDTVASAVNAGCNLDLAFPVGDGMYIKIGDAVTAGKIKEKTVVERVKPLFYTRMRL 347
Query: 396 GLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQ-----GIVLLKNDKKFLPLNKNAVS 450
G F+ P Y L V + EH++LA+ AA Q VLLK + + LPL+ V+
Sbjct: 348 GEFD-PPELNPYANLNLSVVQSEEHRELAVKAALQSFVLLNFVLLKREGRVLPLD-TLVN 405
Query: 451 SLAIIGPLVNNISQMGGGYTGIPCSPKSLL----RGLEAYVSKTHYASGCHDVPCNSDAG 506
LA+IGP +N S + G Y+ P K + +GL T GC PC +
Sbjct: 406 KLAVIGPFADNPSYLFGDYS--PNPDKEFVVTPCKGLSNAARDTRCTPGCLTAPCTT--Y 461
Query: 507 FHEAVRIA-KKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSK-RPVILV 564
F E V+ A AD ++V G + E E DR L LPG+Q L+ V + + +P+IL+
Sbjct: 462 FSEMVKAAVTGADLIVVCLGTGVKIEAEFVDRSDLSLPGKQFQLLQDVVKYANGKPIILL 521
Query: 565 LTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIF-------GDFNPGGRLPMTW 617
L GPLD+ +A + I I+ +P +A AL + G+ NPGGRLP+TW
Sbjct: 522 LFNAGPLDIVWAVENPAIQVIVACFFPSQATGDALYRMFMNTHGVDTGNGNPGGRLPITW 581
Query: 618 YPESFTKV-PMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSEL 676
P S +V PM + M GR+YR++ G ++ FG+GLSY ++SY S L
Sbjct: 582 -PRSMNQVPPMTNYTME--------GRTYRYFNGDPLFPFGYGLSYGSFSY------SSL 626
Query: 677 TISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFA 736
I S + +C ++ V + G G V ++
Sbjct: 627 VIWPS-----------------------TIPACNGVKVSVTVYKLGPG---GDEVTQVYM 660
Query: 737 RVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDP 772
S P+ QL+ F R + G E+ F + P
Sbjct: 661 SWNNASVVVPKLQLVAFKRFYLETNGVTEVHFTIAP 696
>gi|332377068|gb|AEE64772.1| Xyl3A [Ruminococcus albus 8]
Length = 691
Score = 340 bits (872), Expect = 2e-90, Method: Compositional matrix adjust.
Identities = 217/633 (34%), Positives = 328/633 (51%), Gaps = 85/633 (13%)
Query: 54 FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
+ + SLS RA++L +T +E+ QL +A AI RLGIPAY WW+E +HG+A +G
Sbjct: 4 YLDESLSAEERAEALTDEMTTEEQASQLRYDAPAIERLGIPAYNWWNEGIHGLARSG--- 60
Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLTFW 165
T FPQ + AA F+ L + EARA YN GLT W
Sbjct: 61 -------VATMFPQAIGLAAMFDDELTKRTAEITSEEARAKYNAYTVEGDRDIYKGLTLW 113
Query: 166 APNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLK 225
APNINIFRDPRWGR ET GEDP + + V+ QG+ +V+K
Sbjct: 114 APNINIFRDPRWGRSHETFGEDPYLTAQNGKAVVRGLQGDG---------------KVMK 158
Query: 226 GFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQ 285
+AC KH + + R+SF+A +D E+T+ P F + +++
Sbjct: 159 A-------------AACAKH---FAVHSGPEALRHSFDAKADAKDMEETYLPAFEALVKE 202
Query: 286 GKASCIMCSYNQVNGVPACLRGDLFQKARNEWGFKGYITSDCDAVATIFEYQNYTKTHED 345
K +M +YN+VNG PAC L +K + EW F GY SDC A+ E+ T +
Sbjct: 203 AKVESVMGAYNRVNGEPACASDYLMEKLK-EWEFDGYFVSDCWAIRDFHEHHMVTANAVE 261
Query: 346 SAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKG 405
SAA LKAG D+NCG C ++ +A+DKG + ++ I A ++L ++RLG+F+ +
Sbjct: 262 SAAMALKAGCDVNCG-CTYQNLLAALDKGLITKEQIRTACVHLMRTRIRLGMFD---KHT 317
Query: 406 KYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQM 465
+ + V +EHK ++L+ A + +VLLKN+ LPL+ ++A+IGP ++ + +
Sbjct: 318 DFDDIPYSKVACAEHKAVSLECAEKSLVLLKNN-GILPLDDKKYKTIAVIGPNADSRTAL 376
Query: 466 GGGYTGIPCSPKSLLRGLE-AYVSKTHYASGCH----DVPCNSDAG--FHEAVRIAKKAD 518
G Y G+ + L G++ + + +A GCH + + AG + EAV AK AD
Sbjct: 377 EGNYNGLSDRYTTFLNGIQDRFEGRVIFAEGCHLYKKSISGLAQAGDRYAEAVAAAKNAD 436
Query: 519 FVIVVAGLDLTQETED---------RDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGG 569
VI+ GLD T E E+ D+ L LP Q LV + K PV+ V+ G
Sbjct: 437 LVIMCVGLDATIEGEEGDTGNEFSSGDKNGLTLPPPQKILVEKIMSVGK-PVVTVVCAGS 495
Query: 570 PLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMND 629
++ +SQ +++ YPG G KALAE++FGD +P G+LP+T+Y ++ D
Sbjct: 496 AINT-----ESQPDALIHAFYPGAEGGKALAEVLFGDVSPSGKLPVTFYEDTDKLPEFTD 550
Query: 630 MNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSY 662
+M+ GR+YR+ T ++ FG+GL+Y
Sbjct: 551 YSMK--------GRTYRYTTDNILFPFGYGLTY 575
>gi|413925161|gb|AFW65093.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
Length = 323
Score = 340 bits (871), Expect = 2e-90, Method: Compositional matrix adjust.
Identities = 163/322 (50%), Positives = 206/322 (63%), Gaps = 31/322 (9%)
Query: 36 VLNKPDFPCKPPHFDS---YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLG 92
V P F C P ++ FC+ +L+ + RA L+S LT EKI QL D A +PRLG
Sbjct: 25 VAGDPPFSCGPSSAEASEGLAFCDVTLAPAQRAADLVSRLTAAEKIAQLGDQAPGVPRLG 84
Query: 93 IPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEAR 152
+P Y+WW+E+LHG+A++G G++F+ V + TSFPQVL++AA+F+ LW IG A+ EAR
Sbjct: 85 VPGYKWWNEALHGLATSGKGLHFDAAVRAATSFPQVLLTAAAFDDDLWLRIGQAIGREAR 144
Query: 153 AMYNLGQA-GLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDD 211
A++N+GQA GLT W+PN+NIFRDPRWGRGQETPGEDP V S YAV FV+ QG + S
Sbjct: 145 ALFNVGQAEGLTIWSPNVNIFRDPRWGRGQETPGEDPAVASRYAVAFVRGIQGNSSSS-- 202
Query: 212 GGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDT 271
L SACCKH AYDLE W +RYSF A +TEQD
Sbjct: 203 ------------------------LLQTSACCKHATAYDLEDWNGVARYSFVARVTEQDL 238
Query: 272 EDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQ-KARNEWGFKGYITSDCDAV 330
EDTF PPFRSC+ + KASC+MC+Y +NGVPAC DL R +WG GY+ SDCDAV
Sbjct: 239 EDTFNPPFRSCVVEAKASCVMCAYTAINGVPACANSDLLTGTVRGDWGLDGYVASDCDAV 298
Query: 331 ATIFEYQNYTKTHEDSAAGVLK 352
A + + Q Y T ED+ A LK
Sbjct: 299 AIMRDAQRYAPTPEDAVAVSLK 320
>gi|358061481|ref|ZP_09148135.1| hypothetical protein HMPREF9473_00197 [Clostridium hathewayi
WAL-18680]
gi|356700240|gb|EHI61746.1| hypothetical protein HMPREF9473_00197 [Clostridium hathewayi
WAL-18680]
Length = 695
Score = 340 bits (871), Expect = 2e-90, Method: Compositional matrix adjust.
Identities = 209/626 (33%), Positives = 323/626 (51%), Gaps = 85/626 (13%)
Query: 64 RAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVT 123
+A L+ +TL+E+ Q+ +A A+PRLGIPAY WW E LHG+A G + T
Sbjct: 9 KAVRLVEQMTLEERASQMRYDAPAVPRLGIPAYNWWGEGLHGVARAG----------TAT 58
Query: 124 SFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG--------QAGLTFWAPNINIFRDP 175
FPQ + AA F+ L I + V+ E RA YN GLTFW+PN+NIFRDP
Sbjct: 59 MFPQAIAMAAMFDVELTEEIANVVSTEGRAKYNQFCEEGDRDIYKGLTFWSPNVNIFRDP 118
Query: 176 RWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGD 235
RWGRG ET GEDP + S FV+ QG+ G+
Sbjct: 119 RWGRGHETYGEDPYLTSRLGTAFVRGLQGD----------------------------GE 150
Query: 236 ELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSY 295
L ++AC KH + + R+ F A +++D +T+ P F +C+++ +M +Y
Sbjct: 151 HLKIAACAKH---FAVHSGPEALRHEFWADTSKKDLWETYLPAFEACVKEAHVESVMGAY 207
Query: 296 NQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAG 354
N +G P C L ++ R +WGF+G+ SDC A+ T T +SAA +K G
Sbjct: 208 NSYHGEPCCANTLLMEEILRGQWGFEGHFVSDCWAIRDFHMNYMVTDTAMESAALAVKKG 267
Query: 355 MDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDD 414
D+NCG L+ A ++G + + + A++ LF+ + LG+ + +Y + +
Sbjct: 268 CDLNCGNTYLQ-VLKACEEGLLDDACVTEAVVRLFTTRYLLGM----GEETEYDDIPYEV 322
Query: 415 VCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPC 474
V EH++LA++AAR+ +VLLKND LPL+ ++++A+IGP +N + + G Y G
Sbjct: 323 VECKEHRELAVEAARRSMVLLKNDG-LLPLHAEKLNTIAVIGPNADNRTALIGNYHGTSS 381
Query: 475 SPKSLLRGLEAYVS---KTHYASGCH----DVPCNSDAG--FHEAVRIAKKADFVIVVAG 525
++L G++ V + YA GCH V + AG EA +AK +D V++ G
Sbjct: 382 CYTTILEGIQDAVGEDVRVLYAEGCHLFKDRVEHLAVAGDRLSEARIVAKHSDVVVLCVG 441
Query: 526 LDLTQETEDRD---------RVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFA 576
LD T E E+ D + LLLP Q L+ + K PV++ G +D+S A
Sbjct: 442 LDETLEGEEGDTGNSHASGDKKDLLLPESQRRLMEEILNLGK-PVVVCNMSGSAIDLSLA 500
Query: 577 EADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADS 636
+ + +++ + YPG G +ALA+++FG +P G+LP+T+Y + P D +M
Sbjct: 501 Q--EKAGAVIQVWYPGAEGGRALADLLFGKASPSGKLPVTFYKDLENLPPFEDYSM---- 554
Query: 637 SRQYPGRSYRFYTGTQVYGFGHGLSY 662
GR+YR+ T +Y FG GL+Y
Sbjct: 555 ----DGRTYRYLTAEPLYPFGFGLTY 576
>gi|429738050|ref|ZP_19271875.1| glycosyl hydrolase family 3 protein [Prevotella saccharolytica
F0055]
gi|429161155|gb|EKY03583.1| glycosyl hydrolase family 3 protein [Prevotella saccharolytica
F0055]
Length = 722
Score = 339 bits (869), Expect = 4e-90, Method: Compositional matrix adjust.
Identities = 241/760 (31%), Positives = 366/760 (48%), Gaps = 121/760 (15%)
Query: 64 RAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVT 123
+AKS+IS LTL EKI QL+ +A I RLGI Y W +E+LHG+ +G T
Sbjct: 34 KAKSIISQLTLDEKISQLTQDAKGIDRLGIKPYYWLNEALHGVGRDG----------RAT 83
Query: 124 SFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQ--------AGLTFWAPNINIFRDP 175
FPQ + A+F+ + IG A+A E RA + + Q AGLTFWAPN+NIFRDP
Sbjct: 84 VFPQPINLGATFDPKIVHQIGDAIATEGRAKFIVAQRQKNYSMYAGLTFWAPNVNIFRDP 143
Query: 176 RWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGD 235
RWGRG ET GEDP + FVK QG D
Sbjct: 144 RWGRGMETYGEDPFLTGTLGTAFVKGMQG---------------------------DDPF 176
Query: 236 ELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSY 295
L +AC KH + + +R++ N T++D +T+ P F+ +++GK IM +Y
Sbjct: 177 YLKAAACGKH---FAVHSGPERTRHTANVEPTKRDLYETYLPAFKMLVQKGKVESIMGAY 233
Query: 296 NQVNGVPAC-----LRGDLFQKARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGV 350
++ G +C L D+ R +WGFKG++ SDC AV ++E K+ ++ A
Sbjct: 234 QRLYG-ESCSGSKYLLTDIL---RKDWGFKGHVVSDCGAVTDMYEGHKLVKSEAEAVAFA 289
Query: 351 LKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKL 410
+KAG+++ CG M R + AI + + EKD+D+ALL L +L+LG+ D Y +
Sbjct: 290 IKAGLNLECGNSM-RTMKDAIQQKLITEKDLDKALLPLMMTRLKLGILQPDA-ACPYNEF 347
Query: 411 GPDDVCTSE-HKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGY 469
P+ V SE ++K+A AA + +VLLKN+ LP+ K+ + +L + GP + + G Y
Sbjct: 348 -PESVIGSEANRKIAEQAAEESMVLLKNN-GVLPIAKD-IRTLFVTGPGATDAYYLMGNY 404
Query: 470 TGIPCSPKSLLRGLEAYVSK---THYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGL 526
G+ + L G+ VS +Y G V N + + +V ++ A+ I++ G
Sbjct: 405 FGLSNRYSTYLEGIVGKVSNGTSVNYKQGFMQVFKNLN-DVNWSVSESRGAEVSILIMGN 463
Query: 527 ---------DLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAE 577
D E DRV+L LP QM + V++ +++VLTGG P+DV E
Sbjct: 464 SGNTEGEEGDAIASAERGDRVNLRLPDSQMEYLREVSKDRTNKLVVVLTGGSPIDVK--E 521
Query: 578 ADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVP-MNDMNMRADS 636
+++ YPG+ G ALA ++FGD N GRLP+T +PES ++P +D +M+
Sbjct: 522 ITELADAVVMAWYPGQEGGVALANLLFGDANFSGRLPVT-FPESADRLPAFDDYSMK--- 577
Query: 637 SRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGS 696
GR+Y++ T +Y FG+GLSY+ +Y + T + +
Sbjct: 578 -----GRTYKYMTDNILYPFGYGLSYSKVTYSNAAVTKMPTKTTPMT------------- 619
Query: 697 RLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRV 756
V + VTN GD+ VV ++ P +P + LIGF RV
Sbjct: 620 -------------------VYVDVTNNGDMPVDEVVQVYLSTPGAGNTSPIESLIGFKRV 660
Query: 757 HTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMV 796
+ F + P E L G L G + + +
Sbjct: 661 KIYPHITVTKDFQI-PMELLETVQADGTSKLLKGEYQIKI 699
>gi|323344407|ref|ZP_08084632.1| beta-glucosidase [Prevotella oralis ATCC 33269]
gi|323094534|gb|EFZ37110.1| beta-glucosidase [Prevotella oralis ATCC 33269]
Length = 722
Score = 338 bits (866), Expect = 1e-89, Method: Compositional matrix adjust.
Identities = 237/759 (31%), Positives = 368/759 (48%), Gaps = 119/759 (15%)
Query: 64 RAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVT 123
+AKS+IS LTL EKI QL+ +A I RLGI Y W +E+LHG+ +G T
Sbjct: 34 KAKSIISQLTLDEKISQLTQDAKGIDRLGIKPYYWLNEALHGVGRDG----------RAT 83
Query: 124 SFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQ--------AGLTFWAPNINIFRDP 175
FPQ + A+F+ + IG A+A E RA + + Q AGLTFWAPN+NIFRDP
Sbjct: 84 VFPQPISLGATFDPEIVQQIGDAIATEGRAKFIVAQRQKNYSMYAGLTFWAPNVNIFRDP 143
Query: 176 RWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGD 235
RWGRG ET GEDP + FVK QG + F
Sbjct: 144 RWGRGMETYGEDPFLTGVLGTAFVKGMQGND----------PFY---------------- 177
Query: 236 ELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSY 295
L +AC KH + + +R++ N T+ D +T+ P F+ ++QGK IM +Y
Sbjct: 178 -LKAAACGKH---FAVHSGPERTRHTANVEPTKHDLYETYLPAFKMLVQQGKVESIMGAY 233
Query: 296 NQVNGVPAC-----LRGDLFQKARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGV 350
++ G +C L D+ R +WGFKG++ SDC AV ++E K+ ++ A
Sbjct: 234 QRLYG-ESCSGSKYLLTDIL---RKDWGFKGHVVSDCGAVTDMYEGHKLVKSEAEAVAFA 289
Query: 351 LKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKL 410
+KAG+++ CG M R + A+ + + EKD+D+ALL L +L+LG+ D Y +
Sbjct: 290 IKAGLNLECGNSM-RTMKDALKQKLITEKDLDKALLPLMMTRLKLGILQPD-VACPYNEF 347
Query: 411 GPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT 470
+ + +++ +A AA + +VLLKND LP+ K+ + +L + GP + + G Y
Sbjct: 348 PESVIGSIDNRNIAQRAAEESMVLLKND-GVLPIAKD-IRTLFVTGPGATDAYYLMGNYF 405
Query: 471 GIPCSPKSLLRGLEAYVSK---THYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGL- 526
G+ + L G+ VS +Y G V N + + +V ++ A+ I++ G
Sbjct: 406 GLSDRYSTYLEGIVGKVSNGTSVNYKQGFMQVFKNLN-DVNWSVSESRGAEVSIIIMGNS 464
Query: 527 --------DLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEA 578
D +E DRV L LP QM + V++ +++VLTGG P+DV E
Sbjct: 465 GNTEGEEGDAIASSERGDRVDLRLPEPQMQYLREVSKDRTNKLVVVLTGGSPIDVK--EI 522
Query: 579 DSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVP-MNDMNMRADSS 637
+++ YPG+ G ALA ++FGD N GRLP+T +PE+ K+P +D +M+
Sbjct: 523 TELADAVVMAWYPGQEGGVALANLLFGDANFSGRLPVT-FPETTDKLPSFDDYSMK---- 577
Query: 638 RQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSR 697
GR+Y++ T +Y FG+GLSY +Y T ++
Sbjct: 578 ----GRTYKYMTDNILYPFGYGLSYGKVAYG-----------------------NATVTK 610
Query: 698 LDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVH 757
L H +S T V + ++N G++ VV ++ P +P + L+ F RV
Sbjct: 611 LPTKH----SSMT-----VSVDLSNDGNMPVDEVVQVYLSTPSAGVTSPIESLVAFKRVK 661
Query: 758 TVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMV 796
+ F + P E+L + G L G + +M+
Sbjct: 662 IAPHATVTTDFEI-PVERLETVQEDGTSKLLKGEYRVMI 699
>gi|405968899|gb|EKC33925.1| Putative beta-D-xylosidase 5 [Crassostrea gigas]
Length = 748
Score = 337 bits (864), Expect = 1e-89, Method: Compositional matrix adjust.
Identities = 234/760 (30%), Positives = 356/760 (46%), Gaps = 108/760 (14%)
Query: 42 FPCKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDN--------ASAIPRLGI 93
F P ++PF N SLS S R L+ LTL + +QQL+ A AI LGI
Sbjct: 14 FALTPLASSNFPFQNVSLSWSERVDDLVGRLTLDQIVQQLARGGAGLNGGPAPAIENLGI 73
Query: 94 PAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARA 153
Y+W +E L G G + TSFPQ + AA+F++ L N+ A A E RA
Sbjct: 74 GPYQWNTECLRGDVEAG----------NATSFPQAIGLAAAFSKDLIFNVSKAAATEVRA 123
Query: 154 MYN--------LGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGE 205
+N GL+ ++P +NI R P WGR QET GEDP + YA FV+ QG+
Sbjct: 124 KHNDFVKRGIFTDHTGLSCFSPVVNIMRHPLWGRNQETYGEDPYLSGTYASYFVQGLQGD 183
Query: 206 NWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM 265
+ DR + +A CKH A+ + SR F+A
Sbjct: 184 H-------------------------DR--YIQANAGCKHFDAHGGPEDIPESRMGFDAK 216
Query: 266 ITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYIT 324
++ +D TF P F+ C++ G A +MCSYN +NGVPAC L R EW F GY+
Sbjct: 217 VSMRDLRLTFLPAFQKCVQAG-AYSLMCSYNSINGVPACSNKLLMMDILRGEWNFTGYVV 275
Query: 325 SDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDI----NCGTCMLRHTQSAIDKGKVQEKD 380
SD A+ + +Y ED+AAG + AG ++ N + A+ GK++E
Sbjct: 276 SDEGAIENQISFHHYYNNSEDAAAGSVNAGCNLELSGNLTEPVFMKIGDAVKSGKLEESV 335
Query: 381 IDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKK 440
+ + LF ++RLG F+ P Y + + + EH+ L+L AA + +VLLK K
Sbjct: 336 VRNRVKPLFYTRMRLGEFD-PPEMNPYSSVNLSVIQSEEHRNLSLTAAAKSLVLLKRPSK 394
Query: 441 FLP---LNKNAVSSLAIIGPLVNNISQMGGGY--TGIPCSPKSLLRGLEAYVSKTHYASG 495
F + +A+IGP+ NN Q+ G Y T P K+ L+GL +YA+G
Sbjct: 395 FSKRHLIGGFPSERMAVIGPMANNTDQIFGDYSPTTDPRFVKTPLKGLTELNFSMNYAAG 454
Query: 496 CHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVAR 555
C D + + AD V+V G E+E+ DR ++LPG+Q+ L+ V
Sbjct: 455 CVDGTRCLNYSQDDVKTALVGADLVVVCLGTGKDLESENVDRKDMMLPGKQLQLLQDVVS 514
Query: 556 TSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIF---GDFNPGGR 612
+ + V L++ GP+++++A+ ++ IL YP ++ A+ + + G FNP GR
Sbjct: 515 MTNKAVYLLVFSAGPVNITWAQESERVLIILQCFYPAQSAGDAITQALIMRDGRFNPAGR 574
Query: 613 LPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSA 672
LP TWY + M D +M ++YR++TG +Y FG+GLSY+ + +
Sbjct: 575 LPYTWYRYTEQIPEMTDYSMAR--------KTYRYFTGVPLYPFGYGLSYSTFVF----- 621
Query: 673 PSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVV 732
S+L + AG D N+ VQ+ V N G DG V+
Sbjct: 622 -SKLYFLPKVNAG-DPNV-------------------------VQVRVFNEGPFDGDEVL 654
Query: 733 MLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDP 772
++ + + P QL+ F+RV ++ +IS + P
Sbjct: 655 QVYIKWMSTKERMPRVQLVAFERVFIRSQQYVDISTAITP 694
>gi|115436096|ref|NP_001042806.1| Os01g0296700 [Oryza sativa Japonica Group]
gi|113532337|dbj|BAF04720.1| Os01g0296700, partial [Oryza sativa Japonica Group]
Length = 522
Score = 337 bits (863), Expect = 2e-89, Method: Compositional matrix adjust.
Identities = 202/527 (38%), Positives = 293/527 (55%), Gaps = 36/527 (6%)
Query: 298 VNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIF---EYQNYTKTHEDSAAGVLKA 353
+NGVPAC L + R +W GYI SDCD+V + ++ YT +AA +KA
Sbjct: 1 INGVPACADARLLTETVRRDWQLHGYIVSDCDSVRVMVRDAKWLGYTGVEATAAA--MKA 58
Query: 354 GMDINCGTC-------MLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGK 406
G+D++CG + A+ +GK++E +D AL NL+ +RLG F+G P +
Sbjct: 59 GLDLDCGMFWEGVHDFFTTYGVDAVRQGKLKESAVDNALTNLYLTLMRLGFFDGIP---E 115
Query: 407 YGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPL--VNNISQ 464
LG DVCT EHK+LA DAARQG+VLLKND LPL+ V+S+A+ G L +N
Sbjct: 116 LESLGAADVCTEEHKELAADAARQGMVLLKNDAALLPLSPEKVNSVALFGQLQHINATDV 175
Query: 465 MGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVA 524
M G Y G PC + G+ VS T C C++ A AK D IVVA
Sbjct: 176 MLGDYRGKPCRVVTPYDGVRKVVSSTS-VHACDKGSCDT------AAAAAKTVDATIVVA 228
Query: 525 GLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISS 584
GL+++ E E DR LLLP Q S + +VA S P++LV+ G +DVSFA+ + +I +
Sbjct: 229 GLNMSVERESNDREDLLLPWSQASWINAVAEASPSPIVLVIMSAGGVDVSFAQDNPKIGA 288
Query: 585 ILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYP-ESFTKVPMNDMNMRADSSRQYPGR 643
++W GYPGE G A+A+++FG +NPGGRLP+TWY E +K+PM M +R D+ YPGR
Sbjct: 289 VVWAGYPGEEGGTAIADVLFGKYNPGGRLPLTWYKNEYVSKIPMTSMALRPDAEHGYPGR 348
Query: 644 SYRFYTGTQV-YGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQT-----GSR 697
+Y+FY G V Y FGHGLSYTN++Y +A + +T+ K G+ + Q T S
Sbjct: 349 TYKFYGGADVLYPFGHGLSYTNFTYASATAAAPVTV----KVGAWEYCKQLTYKAGVSSP 404
Query: 698 LDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVH 757
++ + ++V N G DG+HVV ++ P G P KQL+ F RV
Sbjct: 405 PACPAVNVASHACQEEVSFAVTVANTGGRDGTHVVPMYTAPPAEVDGAPRKQLVAFRRVR 464
Query: 758 TVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLT 804
A + E++F ++ C+ +I + ++P G ++VG+ SL+
Sbjct: 465 VAAGAAVEVAFALNVCKAFAIVEETAYTVVPSGVSRVLVGDDALSLS 511
>gi|340368019|ref|XP_003382550.1| PREDICTED: probable beta-D-xylosidase 2-like [Amphimedon
queenslandica]
Length = 742
Score = 336 bits (862), Expect = 3e-89, Method: Compositional matrix adjust.
Identities = 244/787 (31%), Positives = 364/787 (46%), Gaps = 115/787 (14%)
Query: 52 YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDN-------ASAIPRLGIPAYEWWSESLH 104
+PF NTSLSI R K ++ LTL+E ++Q++ A IPRL I Y+W +E L
Sbjct: 24 FPFQNTSLSIEDRVKDIVDNLTLEELVEQMAHGGATLNGPAPGIPRLHINPYQWGTECLS 83
Query: 105 GIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG------ 158
G S G TSFP + AASFN L + +A A E RA +
Sbjct: 84 GNVSAG----------DATSFPMPIGMAASFNYDLLKRVTNATAYEVRAKHAAAVKDGSY 133
Query: 159 --QAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGF 216
GL+ W+P +NI RDPRWGR QET GEDP + +V QG N +
Sbjct: 134 AFHTGLSCWSPVLNIMRDPRWGRNQETYGEDPYLSGYLGQAYVNGLQGNNSRY------- 186
Query: 217 GFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQ 276
++ +A CKH + + SR+SF+A ++ +D TF
Sbjct: 187 --------------------IIANAGCKHFDVHGGPENIPTSRFSFDAKVSMRDWRMTFL 226
Query: 277 PPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFE 335
P F++C+E G S +MCSYN++NGVPAC L RNEW FKGY+ SD A+ I
Sbjct: 227 PQFKACVEAGALS-LMCSYNRINGVPACANKALLTDILRNEWDFKGYVVSDQGALEFIVI 285
Query: 336 YQNYTKTHEDSAAGVLKAGMDINCGTC------MLRHTQSAIDKGKVQEKDIDRALLNLF 389
+Y +AA AG + G + H A+ V + A+ LF
Sbjct: 286 EHHYAPDFMKAAADAANAGTCLEDGNIGRKFFNVFEHLVDAVKNNLVSVDTLKNAVSRLF 345
Query: 390 SVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKND----KKFLPLN 445
V+++LG F+ P Y + + + H L+L AA + IVL+KND FLP+
Sbjct: 346 YVRMKLGEFD-PPDNNPYANIPLSVIQSDAHINLSLQAAMESIVLMKNDDGFRSPFLPIT 404
Query: 446 KNAVSSLAIIGPLVNNISQMGGGYT-----GIPCSPKSLLRGLEAYVSKTHYASGCHDVP 500
N V ++GP ++ + G Y+ + + L+ +YA GC D P
Sbjct: 405 -NEVKKACMVGPFSDDPEVLFGDYSPTLMRDYVITSLAGLKNANIGTDTLNYAVGCEDGP 463
Query: 501 CNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRP 560
+ + + +IV AGL E+E +D + LPG Q+ L+ SK
Sbjct: 464 ACRNYDSAKVRSACDGVELIIVTAGLSKHLESEGKDLSDINLPGHQLDLMQDAEAASKNA 523
Query: 561 -VILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYP 619
VIL+L PLD+ +A+ D +I IL YPG+ KA+A ++ G++NP GRLP TW P
Sbjct: 524 SVILILFNASPLDIRYAKTDPRIVGILEAYYPGQTAGKAIANVLTGEYNPSGRLPNTW-P 582
Query: 620 ESFTKVP-MNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTI 678
S +VP + + M+ R+YR++T +Y FG+GLSYT + Y S L I
Sbjct: 583 ASLDQVPGITNYTMKE--------RTYRYFTQEPLYPFGYGLSYTTFHY------SNLNI 628
Query: 679 SASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARV 738
S++ A I V + VTN G +DG+ V ++
Sbjct: 629 SSTATASGAGMIA------------------------VSVLVTNTGSMDGTEVTQVYVWC 664
Query: 739 PKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGE 798
+S P+ QL+G ++ + E+SF + P EQL + + ++P+ + L V
Sbjct: 665 -NISYA-PKLQLVGVNKDFISKGKTLEVSFSIKP-EQLQVWTDDDKWVIPVSTYDLSVSG 721
Query: 799 LRHSLTI 805
+ + T+
Sbjct: 722 QQPNQTV 728
>gi|374316077|ref|YP_005062505.1| beta-glucosidase-like glycosyl hydrolase [Sphaerochaeta pleomorpha
str. Grapes]
gi|359351721|gb|AEV29495.1| beta-glucosidase-like glycosyl hydrolase [Sphaerochaeta pleomorpha
str. Grapes]
Length = 701
Score = 335 bits (860), Expect = 4e-89, Method: Compositional matrix adjust.
Identities = 217/638 (34%), Positives = 316/638 (49%), Gaps = 89/638 (13%)
Query: 64 RAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVT 123
+AK L++ ++L+E QL A AIPRLG+P Y WW+E+LHG A +G + T
Sbjct: 9 QAKQLVAHMSLKEMFSQLLHEAPAIPRLGLPRYNWWNEALHGAARSG----------TAT 58
Query: 124 SFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLTFWAPNINIFRDP 175
FPQ + AA F+ I + ++ E RA YN A GLT W+PN+NIFRDP
Sbjct: 59 VFPQAIGLAAMFDDVFLKEIATVISTEQRAKYNTFSALGDRGIYKGLTLWSPNVNIFRDP 118
Query: 176 RWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGD 235
RWGRGQET GEDP + S V F++ QG+ G
Sbjct: 119 RWGRGQETYGEDPYLASQLGVSFIQGLQGD----------------------------GP 150
Query: 236 ELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSY 295
L +AC KH + + R+ FNA+++ +D +T+ P F +C+++G+ + +M +Y
Sbjct: 151 YLKTAACVKH---FAVHSGPEPLRHDFNAIVSRKDLYETYLPAFEACVKEGEVNAVMGAY 207
Query: 296 NQVNGVPAC----LRGDLFQKARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVL 351
+ VNG P C L D+ RN+WGF+G SDC A+ TK DS A L
Sbjct: 208 SAVNGEPCCGSPFLITDIL---RNDWGFEGMYISDCWAIRDFHLNHAVTKNQVDSVALAL 264
Query: 352 KAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLG 411
AG D+NCG C + A +G + K I +A + + + + LGLF+ D Y +G
Sbjct: 265 NAGCDLNCG-CEYLSLEKAYQQGLIDRKTITQACIRVMTTRFALGLFSED---CTYSNIG 320
Query: 412 PDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTG 471
+ T EH+K+A A+ +VLLKND LPL+ ++ ++AIIGP ++ + G Y G
Sbjct: 321 YEQNDTEEHRKVAFKASCNSLVLLKND-GMLPLDSRSLHAIAIIGPNADSREALWGNYHG 379
Query: 472 IPCSPKSLLRGLEAYVS---KTHYASGCH------DVPCNSDAGFHEAVRIAKKADFVIV 522
+ ++L G + K Y+ G + + EA+ +A +D +I+
Sbjct: 380 TSSTYTTVLEGFRKTLGESVKVKYSQGSAIQKEKLERLAEPNDRIAEAIAVATVSDTIIL 439
Query: 523 VAGLDLTQETEDR---------DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDV 573
G D T E E D+ L LP Q +L+ +VA T K P++LVL GG +D
Sbjct: 440 CLGYDETVEGEMHDDGNGGWAGDKQDLRLPPCQRALLKAVASTGK-PIVLVLLSGGAIDP 498
Query: 574 SFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMR 633
E + ++L YPG+ G A+A I G NP G LP+T+Y D M
Sbjct: 499 EI-ERFPNVKALLQGWYPGQEGGLAIAHTILGLNNPSGHLPVTFYRSETVLPDFCDYRME 557
Query: 634 ADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLS 671
GR+YR+ +Y FG GLSYT +SY LS
Sbjct: 558 --------GRTYRYVQEKVLYPFGFGLSYTTFSYGNLS 587
>gi|413925165|gb|AFW65097.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
Length = 412
Score = 334 bits (856), Expect = 1e-88, Method: Compositional matrix adjust.
Identities = 168/346 (48%), Positives = 210/346 (60%), Gaps = 25/346 (7%)
Query: 14 ILFRVLQLIVVVNVIAFSNSKPVLNKPDFPCKPPHFDSYPFCNTSLSISTRAKSLISLLT 73
+L ++ L+V ++ + P F C PFCNT L + RA L+S +T
Sbjct: 58 LLAILMHLLVPALLLMRVVAVVAAGAPPFSCGGGPSLGLPFCNTKLPAAQRAADLVSRMT 117
Query: 74 LQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFN-GTVSSVTSFPQVLVSA 132
EK QL D A+ +PRLG+P+Y+WW+E+LHG+A +G G++ + G V S TSFPQVL++A
Sbjct: 118 PAEKASQLGDVANGVPRLGVPSYKWWNEALHGVAISGKGIHMDRGAVRSATSFPQVLLTA 177
Query: 133 ASFNRSLWSNIGSAVAVEARAMYNLGQA-GLTFWAPNINIFRDPRWGRGQETPGEDPMVV 191
ASFN +LW IG A EARA YN+GQA GLT W+PN+NIFRDPRWGRGQETPGEDP V
Sbjct: 178 ASFNDNLWFRIGQATGKEARAFYNIGQAEGLTMWSPNVNIFRDPRWGRGQETPGEDPAVA 237
Query: 192 SAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDL 251
S YA FV+ QG + L+ SACCKH AYDL
Sbjct: 238 SRYAAAFVRGLQGSS---------------------SNTKSVPPVLLTSACCKHATAYDL 276
Query: 252 EKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQ 311
E W +RYSF A +T QD DTF PPFRSC+ GKASC+MC+Y VNGVP+C DL
Sbjct: 277 EDWKGVTRYSFRATVTVQDLADTFNPPFRSCVVDGKASCVMCAYTSVNGVPSCANADLLT 336
Query: 312 KA-RNEWGFKG-YITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGM 355
K R WG G Y+ +DCDAV+ + Q Y T ED+ A LKAGM
Sbjct: 337 KTFRGSWGLDGRYVAADCDAVSIMRNSQFYRPTAEDTVATTLKAGM 382
>gi|372209036|ref|ZP_09496838.1| glycoside hydrolase [Flavobacteriaceae bacterium S85]
Length = 859
Score = 333 bits (854), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 244/753 (32%), Positives = 366/753 (48%), Gaps = 100/753 (13%)
Query: 56 NTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNF 115
N + SI R L++ +TL+EKI I RLGIP +EW+ E+LHGI S
Sbjct: 27 NPNASIEDRVNDLLANMTLEEKISYCGSRIPEIKRLGIPYFEWYGEALHGIIS------- 79
Query: 116 NGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDP 175
+ T FPQ + A++N L ++ +A++ EARA+ N G+ + ++P +N+ RDP
Sbjct: 80 ----WNCTQFPQNIAMGATWNPDLMFDVATAISNEARALKNAGKKEVMMFSPTVNMARDP 135
Query: 176 RWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGD 235
RWGR E EDP ++S A +V+ QG + K
Sbjct: 136 RWGRNGECYAEDPHLMSEMARMYVRGMQGNDPKY-------------------------- 169
Query: 236 ELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSY 295
+ KH +A ++E R ++ I ++D + + P +++CI +A+ IM +
Sbjct: 170 -VKTVTTVKHYVANNVET----KREWIHSNIGKKDLYEYYFPAYKTCIVDEEATGIMTAL 224
Query: 296 NQVNGVPACLRGDLFQKA--RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKA 353
N +NG+P C D RNEWGFKGY+ +D AV + + Y + +AA +KA
Sbjct: 225 NGLNGIP-CSAHDWLVNGVLRNEWGFKGYVIADWAAVQGLEKRMKYASSQAQAAAMAIKA 283
Query: 354 GMDINC------GTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKY 407
G+D C M++ A+ +G + EK++D + L ++ G F+ DP Y
Sbjct: 284 GVDQECFRNKVRQAPMVQALPDALQQGLITEKELDVTVKRLLRLRFMTGDFD-DPSLNPY 342
Query: 408 GKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGG 467
+ + HK+LAL AA Q IVLLKND LPL K+ + S+A+IGP + G
Sbjct: 343 SAIPTSVLECDAHKQLALKAAEQSIVLLKNDA-VLPLKKD-LKSIAMIGPFADRCWM--G 398
Query: 468 GYTGIPCSPKSLLRGLEAYV-SKTHYASGCHDVPCNSDA-GFHEAVRIAKKADFVIVVAG 525
Y+G P S S L G++AY +K +A GC D EAV +AKK++ VI+V G
Sbjct: 399 IYSGHPKSKVSPLDGIKAYTNAKVSFAQGCEVTAKEDDEQKIAEAVALAKKSEQVILVVG 458
Query: 526 LDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSI 585
D T TE+ DR S+ LPG Q L+ +V +K VILVL GP V++ + + I I
Sbjct: 459 NDETTSTENTDRKSIKLPGNQHQLIKAVQAVNKN-VILVLVPSGPTAVTWEQKN--IPGI 515
Query: 586 LWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSY 645
+ G+ ALA+++FGD NPGG+L TWY +D M + R+Y
Sbjct: 516 VCAWPNGQEQGTALAKVLFGDVNPGGKLNATWYQSDKDLPNFHDYKMAGGN------RTY 569
Query: 646 RFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDE 705
++ G +Y FG+GLSYTN++ SD +I ++T +YV
Sbjct: 570 MYFKGKPLYPFGYGLSYTNFTI------------------SDVSINKKTLQANEYV---- 607
Query: 706 VTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKE 765
V+ V N G V G VV ++ R K + TP K L GF R+ A SK
Sbjct: 608 ---------TVKAKVNNTGAVAGDEVVQVYIRDVKSKEKTPLKALKGFQRISVAAGASKW 658
Query: 766 ISFGVDPCEQLSIAN-KHGRRILPLGNHVLMVG 797
+ + P E S N K ++ G ++VG
Sbjct: 659 VEIKI-PYEAFSHYNTKKEALMVAKGEFEILVG 690
>gi|336463686|gb|EGO51926.1| hypothetical protein NEUTE1DRAFT_125528 [Neurospora tetrasperma
FGSC 2508]
Length = 788
Score = 332 bits (851), Expect = 5e-88, Method: Compositional matrix adjust.
Identities = 248/791 (31%), Positives = 362/791 (45%), Gaps = 120/791 (15%)
Query: 90 RLGIPAYEWWSESLHGIASNGPGVNFNGT---VSSVTSFPQVLVSAASFNRSLWSNIGSA 146
R+G+P Y WWSE LHG+A + PGV FN T S TSF + ASF+ L +G+A
Sbjct: 13 RIGLPKYAWWSEGLHGVAGS-PGVTFNTTGYPFSYATSFANAINLGASFDDDLVYEVGTA 71
Query: 147 VAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGEN 206
++ EARA N G GL +W PN+N ++DPRWGRG ETPGEDP+ + Y
Sbjct: 72 ISTEARAFANFGFGGLDYWTPNVNPYKDPRWGRGAETPGEDPLHIKGYV----------- 120
Query: 207 WKSDDGGIGFGFREKRVLKGF-GEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM 265
K +L G G E+ R + A CKH AYDLE+W +RY F A+
Sbjct: 121 --------------KAMLAGLEGNETVR----KVIATCKHYAAYDLERWHGLTRYEFEAI 162
Query: 266 ITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQV-----------------NGVPACLRGD 308
+T QD + + PPF+ C K IMCSYN + PAC
Sbjct: 163 VTLQDLSEYYLPPFQQCARDSKVGSIMCSYNALTIRDMAGGNPDEIINLTTAQPACANTY 222
Query: 309 LFQKARNEWGF---KGYITSDCDAVATIF-EYQNYTKTHEDSAAGVLKAGMDINCGT--C 362
L R+ W + YITSDC+A+ + N+++T ++AA KAG D C
Sbjct: 223 LMTILRDHWNWTEHNNYITSDCNAILDFLPDNHNFSQTPAEAAAAAYKAGTDTVCEVSGS 282
Query: 363 MLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFN------------GDPRKGKYGKL 410
L A ++ + E ID AL L+ +R G + G Y L
Sbjct: 283 PLTDVVGAYNQSLLPEAVIDTALRRLYEGLIRAGYLDHGRSSAVAGGDGGSFSSPAYDAL 342
Query: 411 GPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT 470
+DV T ++LAL +A +GIVLLKN LPL+ ++ +A+IG N M G Y+
Sbjct: 343 NWEDVNTPSTQELALRSATEGIVLLKNSGSLLPLDFSSGKKVALIGHWANATGTMRGPYS 402
Query: 471 GIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQ 530
GIP + L + YA+G + D A+ A+ AD V+ G D T
Sbjct: 403 GIPPFYHNPLYAAQQLNLSFSYANGPVVNASDPDTWTAPALAAAEGADVVLYFGGTDTTV 462
Query: 531 ETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGY 590
+ED DR S+ P QM L++ +A K V++ L G +D SF + +SSILW+GY
Sbjct: 463 ASEDLDRESIAWPKAQMKLLSELAGLGKPLVVIQL--GDQVDDSFLLENGNVSSILWVGY 520
Query: 591 PGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNMR---------------- 633
PG++G A+ +++ G P GRLP+T YPE + +VP+ +M +R
Sbjct: 521 PGQSGGTAVFDVLTGKKAPAGRLPVTQYPEGYVDEVPLTEMALRPFNHSSSTSSSSNPEE 580
Query: 634 ------------------ADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSE 675
+ + PGR+Y++Y+ V FG+GL YT ++ + +
Sbjct: 581 EVSVQGSGSLTIQPRSTPGNKTLSSPGRTYKWYS-NPVLPFGYGLHYTTFNVSLSLSSNA 639
Query: 676 LTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLF 735
+ S S S T H+D S + IS+TN G +V +LF
Sbjct: 640 SSPSPSFSIPSLLTPCTAT-------HLDLCPFSPSANSALSISITNTGTHTSDYVALLF 692
Query: 736 ARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLM 795
+ P K L+ + RV + G + ++ P +I+ G GN VL
Sbjct: 693 LSGEFGPKPYPLKTLVSYKRVKDIKPG-ETVTVKDVPVSLGAISRVDGD-----GNTVLY 746
Query: 796 VGELRHSLTIE 806
G R ++ ++
Sbjct: 747 PGTYRFAVDVD 757
>gi|348684872|gb|EGZ24687.1| family 3 glycoside hydrolase [Phytophthora sojae]
Length = 805
Score = 331 bits (848), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 247/757 (32%), Positives = 358/757 (47%), Gaps = 105/757 (13%)
Query: 44 CKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASA---IPRLGIPAYEWWS 100
C +PFC+ SLS S R + L+ L L EK+ L+ AS + +G+P Y W +
Sbjct: 26 CSDTEHQKFPFCDASLSTSERVEDLLRRLPLDEKVTLLTARASPKGNMSSIGLPEYNWGA 85
Query: 101 ESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG-- 158
+HG+ S G N TSFP + A F+ ++ + E RA++ G
Sbjct: 86 NCVHGVQSTC-GTN------CATSFPNPVNLGAIFDPQAVFDMAQVIGWELRALWLEGAR 138
Query: 159 -------QAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDD 211
GL W+PNINI RDPRWGR ETP EDP+V S Y V + + Q +
Sbjct: 139 ENYAAGPHLGLDCWSPNININRDPRWGRNMETPSEDPLVNSKYGVAYTRGLQ-------E 191
Query: 212 GGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDT 271
G ++KR L+ KH AY E + R +FNA ++ D
Sbjct: 192 G------KDKRFLQAV-------------VTLKHYAAYSYEHYDGIDRMAFNAQVSRYDF 232
Query: 272 EDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK-ARNEWGFKGYITSDCDAV 330
DT+ P F + + +GKA +MCSYN VNG+P C L K R GF GYITSD A+
Sbjct: 233 ADTYLPAFHASVVEGKAKGVMCSYNSVNGMPMCANEQLNTKLLREALGFDGYITSDSGAI 292
Query: 331 ATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFS 390
I+ ++YTK+ ++ + +G D+N G+ + + G++ EK +D A+
Sbjct: 293 EGIYRQRHYTKSLCEAGRLAIMSGTDVNSGSVYKKCLADLVTSGQLPEKAVDDAMRRTLK 352
Query: 391 VQLRLGLFNGDPRKGK-YGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAV 449
++ LGLF DP + Y + P +V +E K+L+L+ R+ IVLL+N LPL K
Sbjct: 353 LRFELGLF--DPIDDQPYWHVAPSEVGKTESKQLSLELTRKSIVLLQNHGNVLPLRKG-- 408
Query: 450 SSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRG-----LEAYV-----SKTHYA--SGCH 497
LA+IGP + G Y G C L G LEA S T YA SG +
Sbjct: 409 KKLAVIGPHAKAKRALLGNYLGQMCHGDYLEVGCVQTPLEAITAANGASNTVYAKGSGIN 468
Query: 498 DVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTS 557
D S A F A A+ AD V++ G+D + E E DR ++ +P QM L+ V R
Sbjct: 469 DT---STADFDAAEAAARGADAVVLFLGIDTSIEREAWDRENIDMPNIQMQLLKRVRRAG 525
Query: 558 KRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTW 617
K P ++VL GG V E + YPG GA+A+++I+FGD P G+LP+T
Sbjct: 526 K-PTVVVLFNGGV--VGAEELILHTDGVAEAFYPGFFGAQAVSDILFGDAIPSGKLPVTM 582
Query: 618 YPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELT 677
YP ++ +N ++M++ S +YPGRSYR+Y V+ FG GLSYT ++
Sbjct: 583 YPSNY----INSVDMKSMSMTKYPGRSYRYYKEVPVFPFGWGLSYTKFT----------- 627
Query: 678 ISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFAR 737
L G D D + L V + V+N GD+ G VV F R
Sbjct: 628 -------------LALDGEMPD----DPIVITRDLDQTVTVIVSNDGDLVGDEVVFAFFR 670
Query: 738 VPKVS----QGTPEKQLIGFDRVHTVAKGSKEISFGV 770
V+ +QL + RV ++++F +
Sbjct: 671 PLNVNATGDAALLNEQLFDYRRVSLRPTQYRKLTFRI 707
>gi|157676888|emb|CAP07659.1| beta-xylosidase [uncultured rumen bacterium]
Length = 761
Score = 330 bits (847), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 244/823 (29%), Positives = 368/823 (44%), Gaps = 169/823 (20%)
Query: 44 CKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESL 103
C P + + S RAK+L+ L+L+EK + N+ A+ RLGI AY WWSE+L
Sbjct: 24 CAPQEIS---YTDKSQPAELRAKALLPKLSLEEKAGLVQYNSPAVERLGIKAYNWWSEAL 80
Query: 104 HGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQ---- 159
HG+A NG S T FPQ + AASF+ + +AV+ EAR +
Sbjct: 81 HGVARNG----------SATVFPQPIGMAASFDVEKIETVFTAVSDEARVKNRIAAEDGR 130
Query: 160 ----AGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIG 215
AGL+FW PNINIFRDPRWGRG ET GEDP ++ + V+ QG+
Sbjct: 131 VYQYAGLSFWTPNINIFRDPRWGRGMETYGEDPYLMGQLGMAVVRGLQGD---------- 180
Query: 216 FGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTF 275
D L AC KH Y + +R+ F+A ++E+D +T+
Sbjct: 181 ----------------PDADVLKTHACAKH---YAVHSGLESNRHRFDAQVSERDLRETY 221
Query: 276 QPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIF 334
P F+ + + +M +YN+ G P L QK R EWG+KG + SDC A+ F
Sbjct: 222 LPAFKDLVTKAGVKEVMTAYNRFRGYPCAASEYLVQKILREEWGYKGLVVSDCWAIPDFF 281
Query: 335 E--YQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQ 392
E + T E++AA + G+D+ CG+ + +AID+G ++E+D+DR LL + + +
Sbjct: 282 EPGRHGFVATGEEAAALAVANGLDVECGSTFSK-IPAAIDQGLLKEEDLDRNLLRVLTER 340
Query: 393 LRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSL 452
RLG +G+ + L P V EH+ L+LD AR+ +VLL+N+ LPL A +
Sbjct: 341 FRLGEMDGE---SPWDDLDPAIVEGPEHRALSLDIARETMVLLRNN-GVLPL--KAGEKI 394
Query: 453 AIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCN---------- 502
A+IGP ++ G Y +P S +LL+ ++A V Y C +
Sbjct: 395 ALIGPNADDAQMQWGNYNPVPKSTITLLQAMQARVPGLVYDRACGILDAEYAPQGSAYAN 454
Query: 503 ----SDAGFHEAVR-------------------------------IAKK---ADFVIVVA 524
S+A A R + KK D V+
Sbjct: 455 LIGASEAQLEAAARRYAVSVNDIKNYIRRDEEQRRSFMPALDEAAVLKKLEGVDVVVFAG 514
Query: 525 GLDLTQETEDR----------DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVS 574
G+ E E+ DR + LPG Q L+ ++ K+ V++ +G +
Sbjct: 515 GISPRLEGEEMRVQVPGFSGGDRTDIELPGVQRRLLKALHDAGKKVVLVNFSGCA---IG 571
Query: 575 FAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRA 634
+IL YPG+ G A+A+++FGD NP G+LP+T+Y + D NM
Sbjct: 572 LVPETESCDAILQAWYPGQEGGTAIADVLFGDVNPSGKLPVTFYKNVDQLPDVEDYNME- 630
Query: 635 DSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQT 694
G +YR++ G +Y FG+GLSYT++++ E + KN+
Sbjct: 631 -------GHTYRYFRGEPLYPFGYGLSYTSFAF------GEPKVKG-------KNL---- 666
Query: 695 GSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFD 754
+I VTN G V G+ VV L+ R P + G P K L F
Sbjct: 667 ----------------------EIDVTNTGSVAGTEVVQLYVRKPDDTAG-PVKTLRAFR 703
Query: 755 RVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
RV A + ++S +D L + K + G + L+ G
Sbjct: 704 RVSVPAGQTVKVSIPLDKETFLWWSEKDQDMVPVRGRYELLCG 746
>gi|164428543|ref|XP_964543.2| hypothetical protein NCU00709 [Neurospora crassa OR74A]
gi|157072187|gb|EAA35307.2| hypothetical protein NCU00709 [Neurospora crassa OR74A]
Length = 786
Score = 330 bits (845), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 249/787 (31%), Positives = 367/787 (46%), Gaps = 114/787 (14%)
Query: 90 RLGIPAYEWWSESLHGIASNGPGVNFNGT---VSSVTSFPQVLVSAASFNRSLWSNIGSA 146
RLG+P Y WWSE LHG+A + PGV FN T S TSF + ASF+ L +G+A
Sbjct: 13 RLGLPKYAWWSEGLHGVAGS-PGVKFNTTGYPFSYATSFANAINLGASFDDDLVYEVGTA 71
Query: 147 VAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGEN 206
++ EARA N G GL +W PN+N ++DPRWGRG ETPGEDP+ + Y
Sbjct: 72 ISTEARAFANFGFGGLDYWTPNVNPYKDPRWGRGAETPGEDPLHIKGYV----------- 120
Query: 207 WKSDDGGIGFGFREKRVLKGF-GEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM 265
K +L G G E+ R + A CKH AYDLE+W +RY F A+
Sbjct: 121 --------------KAILAGLEGNETVR----KVIATCKHYAAYDLERWHGLTRYEFEAI 162
Query: 266 ITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQV-----------------NGVPACLRGD 308
+T QD + + PPF+ C K IMCSYN + PAC +
Sbjct: 163 VTLQDLSEYYLPPFQQCARDSKVGSIMCSYNALTIRDMASGKPDEEINLTTAQPACAKPY 222
Query: 309 LFQKARNEWGF---KGYITSDCDAVATIF-EYQNYTKTHEDSAAGVLKAGMDINCGT--C 362
L R+ W + YITSDC+A+ + N+++T ++AA KAG D C
Sbjct: 223 LMTILRDHWNWTEHNNYITSDCNAILDFLPDNHNFSQTPAEAAAAAYKAGTDTVCEVSGS 282
Query: 363 MLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFN------------GDPRKGKYGKL 410
L A ++ + E ID AL L+ +R G + G Y L
Sbjct: 283 PLTDVVGAYNQSLLPEAVIDTALRRLYEGLIRAGYLDHGRSSAVAGGDGGSFSSPAYDAL 342
Query: 411 GPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT 470
+DV T ++LAL +A +GIVLLKN LPL+ + +A+IG N M G Y+
Sbjct: 343 NWEDVNTPSTQELALRSATEGIVLLKNAGSLLPLDFSG-KKVALIGHWANATGTMRGPYS 401
Query: 471 GIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQ 530
GIP + L + YA+G + D A+ A+ AD V+ G D T
Sbjct: 402 GIPPFYHNPLYAAQQLNLSFSYANGPVVNASDPDTWTAPALAAAEGADVVLYFGGTDTTV 461
Query: 531 ETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGY 590
+ED DR S+ P QM L++ +A K V++ L G +D S + +SSILW+GY
Sbjct: 462 ASEDLDRESIAWPETQMQLLSELAGLGKPLVVIQL--GDQVDDSSLLNNGNVSSILWVGY 519
Query: 591 PGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNMR---------------- 633
PG++G A+ +++ G P GRLP+T YPE + +VP+ +M +R
Sbjct: 520 PGQSGGTAVFDVLTGKKAPAGRLPVTQYPEGYVDEVPLTEMALRPFNYSSSSNLEQEVSV 579
Query: 634 --------------ADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTIS 679
+ + PGR+Y++Y+ + V FG+GL YT ++ + S + S
Sbjct: 580 QGRGSLTIQPRSTPGNKTLSSPGRTYKWYS-SPVLPFGYGLHYTTFNVSLSLSSSNASSS 638
Query: 680 ASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVP 739
+S + S ++L + H+D S + +S+TN G +VV+LF
Sbjct: 639 SSSPSFSIPSLLTPCTA----THLDLCPFSPSANSALSVSITNTGTHTSDYVVLLFLSGE 694
Query: 740 KVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGEL 799
+ P K L+ + RV + G + ++ P +I+ G GN VL G
Sbjct: 695 FGPKPYPLKTLVSYKRVKDIKPG-ETVTVKDVPVSLGAISRVDGD-----GNTVLYPGTY 748
Query: 800 RHSLTIE 806
R + ++
Sbjct: 749 RFVVDVD 755
>gi|390956994|ref|YP_006420751.1| beta-glucosidase-like glycosyl hydrolase [Terriglobus roseus DSM
18391]
gi|390411912|gb|AFL87416.1| beta-glucosidase-like glycosyl hydrolase [Terriglobus roseus DSM
18391]
Length = 742
Score = 328 bits (842), Expect = 5e-87, Method: Compositional matrix adjust.
Identities = 243/779 (31%), Positives = 365/779 (46%), Gaps = 115/779 (14%)
Query: 54 FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
+ + S I R LI TLQEK QL+ +PRLG+P + W+++LHG+ S P
Sbjct: 38 YRDMSRPIEDRITDLIKRFTLQEKAMQLNHTNRGVPRLGLPMWGGWNQTLHGVWSKQP-- 95
Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAG------LTFWAP 167
T FP A+++ L + A++ EARA+YN G L + +P
Sbjct: 96 --------TTLFPIPTAMGATWDPELVHTVADAMSDEARALYNAHAEGPRTPHGLVYRSP 147
Query: 168 NINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGF 227
INI RDPRWGR QE EDP++ V +V+ QG++ +
Sbjct: 148 VINISRDPRWGRIQEVFSEDPLLTGRMGVAYVRGLQGDDLQ------------------- 188
Query: 228 GEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGK 287
L L+A KH ++E R NA + E++ + + P +R+ I +
Sbjct: 189 --------HLKLAATVKHFAVNNVES----GRQHLNADVDERNLFEFWLPHWRAAIMEAH 236
Query: 288 ASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVA---------TIFEYQ 337
A +M SYN +NG+P + L R +WGF G++T D AVA T Q
Sbjct: 237 AQSVMSSYNAINGMPDAVNHWLLTDVLRKKWGFDGFVTDDLGAVALLSGTRATNTSEPGQ 296
Query: 338 NYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGL 397
++++ +AA ++AG D + + A+ +G + EKD+D AL N+ V RLG
Sbjct: 297 HFSEDPVVAAAAAIRAGNDSD-DVEFETNLPLAVQRGLLTEKDVDGALRNVLRVGFRLGA 355
Query: 398 FNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGP 457
++ P+ KY ++G D V + H+ L+ A + + LL N ++FLPL ++ V S+A+IGP
Sbjct: 356 YD-PPQASKYSRIGMDVVRSQAHRDLSQRVAEESMTLLLNRRQFLPLQRDQVKSVAVIGP 414
Query: 458 LVNNISQMGGGYTGIPCSPKSLLRGLEAYVS---KTHYASGCHDVPCNSDAGFHEAVRIA 514
+ G Y G P S+ GL A + K Y G V D A +A
Sbjct: 415 AGGEAYET-GNYYGTPAVKTSVTEGLRALLGSGVKVEYEKGAGYVDLADDKEIERAANLA 473
Query: 515 KKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVS 574
+K+D V++ G +L E E RDR L LPG Q L+ +V + + V LVL GPL V+
Sbjct: 474 RKSDVVVLCLGTNLQVEAEGRDRRDLNLPGAQQRLLEAVYAANPK-VALVLMNAGPLGVT 532
Query: 575 FAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRA 634
+A + +IL YPGE G A+A +FG NPGG LP T Y P N+
Sbjct: 533 WAH--DHVPAILSAWYPGELGGAAIARTLFGLNNPGGHLPYTVYANLDGVPPQNEY---- 586
Query: 635 DSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQT 694
D SR Y +Y+++ G +Y FGHGLSYT++ Y S+L ++ + +G N+
Sbjct: 587 DVSRGY---TYQYFKGVPLYPFGHGLSYTHFDY------SKLKVTQT--SGDHANVT--- 632
Query: 695 GSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFD 754
V ++TN G G+ V L++ K S+ P + L GF+
Sbjct: 633 ---------------------VSFTLTNTGQSAGAEVTQLYSHQVKSSEVQPLRTLRGFE 671
Query: 755 RVHTVAKGSKEISFGVDPCEQLSIANK--HGRRILPLGNHVLMVG------ELRHSLTI 805
RV SK ++ + P L + H R+ P G MVG LR ++TI
Sbjct: 672 RVTLQPGESKAVAISI-PTSALGWYDTAVHNFRVEP-GAFNFMVGSSSEDIRLRDTVTI 728
>gi|167537541|ref|XP_001750439.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163771117|gb|EDQ84789.1| predicted protein [Monosiga brevicollis MX1]
Length = 834
Score = 327 bits (838), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 231/764 (30%), Positives = 355/764 (46%), Gaps = 94/764 (12%)
Query: 51 SYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASA-IPRLGIPAYEWWSESLHGIASN 109
SYPFC+T LS+ R K L+S ++ + QL SA I +G+PAY W + ++HG+ +
Sbjct: 106 SYPFCDTKLSVDDRLKDLVSRVSTADAATQLRARESAQIDNIGLPAYYWGTNAIHGMQNT 165
Query: 110 GPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG-QAGLTFWAPN 168
+ TSFP +A+FN SL ++G + E RA YN GL W+P
Sbjct: 166 ACLADGQ----CPTSFPAPNGLSATFNYSLVKDMGRIIGRELRAYYNTKFHNGLDTWSPT 221
Query: 169 INIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFG 228
IN RDPRWGR E+PGE P V Y + + Q
Sbjct: 222 INPSRDPRWGRNVESPGESPFVCGQYGAAYTEGLQ------------------------- 256
Query: 229 EESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKA 288
D D KH +AY +E + N +RY +NA+++E D DT+ P + ++ K
Sbjct: 257 -NGDDKDYTQAVVTLKHWVAYSVEDYDNVTRYEYNAIVSEYDLMDTYFPGWEYVVKNAKP 315
Query: 289 SCIMCSYNQVNGVPACLRGDLFQKARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAA 348
+MCSYN +NGVP C L R +WGF+GYITSD D++ I+ +Y +
Sbjct: 316 LGVMCSYNSLNGVPTCGNPALTAYLREDWGFEGYITSDSDSIHCIWADHHYESNAVLATR 375
Query: 349 GVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPR-KGKY 407
L G DI+ G + ++A+++ V +D AL N + ++ LGLF DP Y
Sbjct: 376 DGLLGGCDIDSGDTYADNLEAAVNQSLVNRSAVDAALTNSYRMRFNLGLF--DPNVTNAY 433
Query: 408 GKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGG 467
++ D+V S ++ +L AAR+ + LLKND + LP +A+IG N+ + G
Sbjct: 434 DRISADEVGMSSSQETSLLAARKSMTLLKNDGQTLPFATG--KKVAVIGKSSNSAEDILG 491
Query: 468 GYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSD-AGFHEAVRIAKKADFVIVVAGL 526
Y G P P ++ A+ + D A + A+++A AD V++
Sbjct: 492 NYVG-PICPSGAFDCVQTLYQGVAAANQGGATTLSDDVADINTAIQLAMDADQVVLTIS- 549
Query: 527 DLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSIL 586
+ Q E +DR + L Q LV +V + K P +V+ GG + + + + ++Q +IL
Sbjct: 550 NYGQAGEGKDRTYIGLDTDQQELVAAVLKVGK-PTAIVMLNGGLISLDWIKDEAQ--AIL 606
Query: 587 WIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTK------------VPMNDMNMRA 634
PG G +A+AE IFG NPGG+LP+T Y + ++ MN+
Sbjct: 607 VAFAPGVHGGQAVAETIFGANNPGGKLPVTMYASDYVNDVDFLNMSMQAVAVLHLMNVNG 666
Query: 635 DSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQT 694
+ PGRSY++YTG +Y F +GLSYT ++ + AP T +++L++ + + T
Sbjct: 667 ERDDTGPGRSYKYYTGEPLYPFAYGLSYTTFNLSWSPAPPMTTFTSTLRSTTYTATVTNT 726
Query: 695 GSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPK--------VSQGTP 746
GS V G VV F + PK V P
Sbjct: 727 GS-----------------------------VGGDEVVFAFYK-PKSESLKTLPVGNPVP 756
Query: 747 EKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLG 790
K++ GF RV S +++F ++ E L+ G R L G
Sbjct: 757 IKEIFGFQRVALGPGQSTQVTFELN-AETLAQVTLDGHRELHSG 799
>gi|348688508|gb|EGZ28322.1| family 3 glycoside hydrolase [Phytophthora sojae]
Length = 701
Score = 327 bits (837), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 231/653 (35%), Positives = 333/653 (50%), Gaps = 108/653 (16%)
Query: 54 FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPR-----LGIPAYEWWSESLHGIAS 108
FCNTSL +S R + L++ L L EK L+ AS PR +G+P Y W + +HG+ S
Sbjct: 34 FCNTSLPVSARVEDLLARLPLDEKAILLTARAS--PRGNMSSIGLPEYNWGANCVHGVRS 91
Query: 109 NGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPN 168
G N TSFP + NL +
Sbjct: 92 TC-GTN------CPTSFPNPV--------------------------NL----------S 108
Query: 169 INIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFG 228
I+ RDPRWGR ETP EDP+V S Y V + K Q
Sbjct: 109 IHRRRDPRWGRNTETPSEDPLVNSKYGVAYTKGLQ------------------------- 143
Query: 229 EESDRGDELMLSACC--KHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQG 286
E D L A KH +AY E +G +R +FNA+++ D DT+ P FRS I G
Sbjct: 144 -EGKHEDPRYLQAVVTLKHYVAYSYENYGGGNRKTFNAIVSPYDFADTYFPAFRSSIVDG 202
Query: 287 KASCIMCSYNQVNGVPACLRGDLFQK-ARNEWGFKGYITSDCDAVATIFEYQNYTKTHED 345
A +MCSYN VNGVPAC +L K R GF GYITSD A+ I ++ +Y T +
Sbjct: 203 NAKGVMCSYNSVNGVPACANNELENKLLRGMLGFDGYITSDSGAIEAISDWLHYVPTRCE 262
Query: 346 SAAGVLKAGMDINCGTCM--LRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPR 403
+A + AG D+N G + + ++ ++ K +D L + ++ LGLF DP
Sbjct: 263 AARLAILAGTDVNSGRGFGYMACLKELVESNQLDVKVVDDVLRHTLKLRFELGLF--DPI 320
Query: 404 KGK-YGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGP----- 457
+ + Y K+ P+DV T KKL+LD AR+ IVLL+N++ LPL + LA++GP
Sbjct: 321 EDQPYWKVTPNDVNTDAAKKLSLDLARKSIVLLQNNQPVLPLRRGV--KLAVVGPHAQAK 378
Query: 458 ---LVNNISQM-GGGYTGIPCSPKSLLRGLEAYV--SKTHYASGCHDVPCNSDAGFHEAV 511
L N + QM G Y + C K+ + A S T YA GC +V NS AGF EAV
Sbjct: 379 RALLGNYLGQMCHGDYNEVGCI-KTPFEAVSASNGDSSTTYALGC-NVTGNSTAGFVEAV 436
Query: 512 RIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPL 571
+ + A+ V++ G+D + E E RDR ++ LP Q+ L+ V R +P ++VL GG L
Sbjct: 437 KAVQGAEAVVLFLGIDKSVEAEVRDRNNIDLPAIQVQLLQRV-RAVGKPTVVVLMNGGVL 495
Query: 572 DVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMN 631
+ Q +++ YPG GA+A+ +I+FGD NPGG+LP+T Y + +N ++
Sbjct: 496 TAE--DIIGQTDALVEAFYPGFFGAQAMTDILFGDANPGGKLPVTMYRSDY----VNTVD 549
Query: 632 MRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKF--LSAPSELTISASL 682
M++ + YPGRSYR++ G V+ FG GLSYT++S K +A + ++SA++
Sbjct: 550 MKSMNVTAYPGRSYRYFKGEPVFPFGWGLSYTSFSLKADDATATTAKSVSATM 602
>gi|443692971|gb|ELT94448.1| hypothetical protein CAPTEDRAFT_221920 [Capitella teleta]
Length = 757
Score = 326 bits (835), Expect = 3e-86, Method: Compositional matrix adjust.
Identities = 244/730 (33%), Positives = 353/730 (48%), Gaps = 110/730 (15%)
Query: 52 YPFCNTSLSISTRAKSLISLLTLQEKIQQLS-----DNASAIPRLGIPAYEWWSESLHGI 106
+PF + SLS RA L++ LTL+E Q + AI RLGI Y W +E L G
Sbjct: 20 FPFQDPSLSWDDRADDLVARLTLEEIAPQTQASYGGQHTPAIERLGIKPYVWITECLAG- 78
Query: 107 ASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA------ 160
VN N T ++PQ + AASF+ L N+ ++ E RA +N +A
Sbjct: 79 -----QVNTNAT-----AYPQPIGMAASFSEELLFNVSRDISYEVRAHWNANRAVGKYST 128
Query: 161 --GLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGF 218
GL+ ++P INI R P WGR QET GEDP++ A FV+ QG+
Sbjct: 129 KVGLSCFSPVINIMRHPLWGRNQETYGEDPLLSGTLAQSFVRGLQGD------------- 175
Query: 219 REKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPP 278
+ R L+ +A CKH + + SR+SF+A + +D TF P
Sbjct: 176 -DPRYLRA-------------NAGCKHFDVHGGPEDIPVSRFSFDAKVNMRDWRMTFLPQ 221
Query: 279 FRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK-ARNEWGFKGYITSDCDAVATIFEYQ 337
F+ C++ G S +MCSYN++NG+PAC L R+EWGF GYI SD A++ I E
Sbjct: 222 FKMCVDAGSYS-LMCSYNRINGIPACANKQLLTDITRDEWGFHGYIVSDSGAISNIKEQH 280
Query: 338 NYTKTHEDSAAGVLKAG--MDINCGTCMLRHTQ-SAIDKGKVQEKDIDRALLNLFSVQLR 394
+YT + + +KAG +++ G+ M Q A+ +G + EK+I + L +LR
Sbjct: 281 HYTNSTVATVVAAIKAGTNLELGGGSNMYYPKQLDAMKQGLLTEKEIRDNVRPLLYTRLR 340
Query: 395 LGLFNGDPRKG-KYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLA 453
LG F DP Y K+G D + + EH++ A+ AA G VLLKN LP+ K S LA
Sbjct: 341 LGEF--DPEAMVDYNKIGVDVIQSPEHREQAVKAAYMGFVLLKNHNNLLPIKKQ-YSKLA 397
Query: 454 IIGPLVNNISQMGGGYTGIPCSP--KSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAV 511
I+GP N S++ G Y+ ++ GL T A+GC + C+
Sbjct: 398 IVGPFTNATSELFGTYSSEVNLKFTSTIFEGLSPLGGSTRSANGCTNSACSGYVRDDVET 457
Query: 512 RIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVT-SVARTSKRPVILVLTGGGP 570
+A AD VIV G E+E DR L L G Q+ ++ +V ++ PVILVL GP
Sbjct: 458 AVA-GADLVIVALGSGQRFESEGNDRAYLDLHGHQLDILKDAVFFSNGAPVILVLINAGP 516
Query: 571 LDVSFAEADSQISSILWIGYPGEAGAKALAEIIF---GDFNPGGRLPMTWYPESFTKVP- 626
LD+++A+ D +++IL GYP ++ +AL + P GRL TW P + +VP
Sbjct: 517 LDITWAKLDPGVTAILSCGYPAQSTGEALRRSLTMSEPQAAPAGRLQATW-PLNLDQVPK 575
Query: 627 MNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLS-APSELTISASLKAG 685
+ D M+ GR+YR+Y G +Y FG GLSYT++SY LS +PS +T
Sbjct: 576 ITDYTMQ--------GRTYRYYVGEPLYPFGFGLSYTSFSYTRLSISPSVIT-------- 619
Query: 686 SDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGT 745
D VT V++ + N G D VV ++ P+
Sbjct: 620 ----------------QGDNVT--------VEVCLKNTGSYDSDEVVQVYMSWPQTPFPL 655
Query: 746 PEKQLIGFDR 755
P+ L F R
Sbjct: 656 PKWTLAAFAR 665
>gi|198425898|ref|XP_002119549.1| PREDICTED: similar to predicted protein [Ciona intestinalis]
Length = 754
Score = 325 bits (834), Expect = 5e-86, Method: Compositional matrix adjust.
Identities = 247/770 (32%), Positives = 365/770 (47%), Gaps = 126/770 (16%)
Query: 50 DSYPFCNTSLSISTRAKSLISLLTLQEKIQQLS-----DN--ASAIPRLGIPAYEWWSES 102
+ +PF N SL I R + L++ LT++E I QLS DN A AI RLGI Y+W +E
Sbjct: 23 EEFPFRNFSLPIEERLEDLVNRLTIEEVILQLSRGGVRDNGPAPAITRLGIGPYQWNTEC 82
Query: 103 LHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYN------ 156
L G A NG T FPQ + AA+F++ L + VA+EARA +N
Sbjct: 83 LRGYAMNG----------DATCFPQPIGLAATFDQGLIYKMAKTVALEARAKHNNFTKNG 132
Query: 157 -LG-QAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGI 214
G GL+ ++P INI R P WGR QET GEDP++ S A +V QG
Sbjct: 133 NFGDHTGLSCFSPVINILRHPLWGRNQETYGEDPVLTSLMARAYVTGLQG---------- 182
Query: 215 GFGFREKRVLKGFGEESDRGDELML--SACCKHLIAYDLEKWGNFSRYSFNAMITEQDTE 272
DE+ L +A CKH +AY + +R+SF+A +++ D
Sbjct: 183 --------------------DEIYLPATAVCKHFVAYGGPENIPTTRFSFSANVSDHDIG 222
Query: 273 DTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKARNEWGFKGYITSDCDAVAT 332
TF P FR C+ G A +MCSYN +NGVP+C L R ++ F GY+ SD +A+
Sbjct: 223 TTFYPAFRECVHAG-AQGVMCSYNAINGVPSCANPMLETTLRKKFHFDGYVVSDENALEN 281
Query: 333 IFEYQNYTKTHEDSAAGVLKAGMDINC----GTCMLRHTQSAIDKGKVQEKDIDRALLNL 388
I Y N+TK+ ++AA L AG+D+ T A+++G V E + R+ L
Sbjct: 282 IDLYFNFTKSKLETAAVALNAGVDLELTGFGKTNRYSLLNQAVEQGLVTEAALRRSAKRL 341
Query: 389 FSVQLRLGLFNGDPRKGKYGKLGPDDVCTS-EHKKLALDAARQGIVLLKNDKKFLPLNKN 447
F ++ LG F DP + + P DV S H+K A++ A + VLLKND LPL K
Sbjct: 342 FRTRMALGEF--DPPEFNHWLNVPIDVVQSLAHRKQAVEVAAKSFVLLKND-GILPL-KQ 397
Query: 448 AVSSLAIIGPLVNNISQMGGGYTG-----IPCSPKSLLRGLEAYVSKTHYASGC-----H 497
++I+GP +NN + G Y SP L + + +GC
Sbjct: 398 LYDKVSIVGPFINNSEALTGDYPAEFNLKYFSSPLFAANSLSS-SGVARFTTGCVGTNNQ 456
Query: 498 DVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTS 557
++P + + +D V+V G E E DR + LPG+Q+ L+ V + +
Sbjct: 457 NLPICATYNSTNVKEVVTGSDIVLVTLGTGRGVEAESNDRRDINLPGKQLQLIQDVVKYA 516
Query: 558 KRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTW 617
PVI+VL GPLDVS+ ++ ++++ + + +A+ E++ G NP GRLP TW
Sbjct: 517 NGPVIVVLFNAGPLDVSWVMGNT--AAVIACHFSAQMTGEAMLEVLTGVVNPAGRLPNTW 574
Query: 618 YPESFTKV-PMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSEL 676
P S +V PM D +M R+YR+ T + ++ FG+GLSYT + Y
Sbjct: 575 -PASMEQVPPMTDYSMHE--------RTYRYSTSSPLFPFGYGLSYTKFWY--------- 616
Query: 677 TISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFA 736
L A + +Q+ + V++ + N G +DG VV ++
Sbjct: 617 -----LDAVVEPTTIQRCQIPV-----------------VRVLIQNTGHLDGEEVVQIYM 654
Query: 737 RVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLS--IANKHGR 784
K +QL+ F RV K +E+S + P LS I KH R
Sbjct: 655 TSKKKRDRELLRQLVAFQRVPI--KAGEEVSISL-PIPHLSRAIWMKHNR 701
>gi|365118446|ref|ZP_09337032.1| hypothetical protein HMPREF1033_00378 [Tannerella sp.
6_1_58FAA_CT1]
gi|363649697|gb|EHL88801.1| hypothetical protein HMPREF1033_00378 [Tannerella sp.
6_1_58FAA_CT1]
Length = 1283
Score = 323 bits (829), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 247/764 (32%), Positives = 366/764 (47%), Gaps = 121/764 (15%)
Query: 54 FCNTSLSISTRAKSLISLLTLQEKIQQLSDN--ASAIPRLGIPAYEWWSESLHGIASNGP 111
+ N ++ I R L+ LTL+EK+ QLSD+ + I RL IPA +E LHG
Sbjct: 72 YLNPNIPIEERIDDLLPRLTLEEKVIQLSDSWGSKGIARLKIPAM-LKTEGLHG------ 124
Query: 112 GVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINI 171
+ T FP + ++F+ L +G A A+EA+A NL + W+P +++
Sbjct: 125 ----QSYATGSTIFPHGINMGSTFDTELIQEVGKATAIEAKAA-NLRVS----WSPVLDV 175
Query: 172 FRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEES 231
RD RWGR +ET GEDP +V V ++K FQGE+
Sbjct: 176 ARDARWGRVEETYGEDPYLVGRIGVAWIKGFQGEH------------------------- 210
Query: 232 DRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCI 291
+ AC KH + G R S + ++++ + PFR I++ A +
Sbjct: 211 -------MFACPKHFAGHGQPVGG---RDSHDYGLSDRVMRNIHLAPFRDVIKEANAFGV 260
Query: 292 MCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGV 350
M +Y NGVP +L QK R EWGF+G++ SDC I Q+ T E++AA
Sbjct: 261 MAAYGLWNGVPDNGSKELLQKILREEWGFEGFVVSDCSGPENIQRKQSVVGTMEEAAAMA 320
Query: 351 LKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGK-YGK 409
++AG+DI CG+ + SA+ KG ++E ++D L +F ++RLGLF+ + + K
Sbjct: 321 VRAGVDIECGSAYKKALASAVKKGIIKESELDANLRRVFRAKMRLGLFDRPSIENMVWNK 380
Query: 410 LGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGY 469
L D T EH+ LA A + VLLKN+ LPL+KN + ++A+IGP N G Y
Sbjct: 381 LPEYD--TPEHRALARKVAVKSTVLLKNENNLLPLDKN-IKTIAVIGP--NADQGQTGDY 435
Query: 470 TG--IPCSPKSLLRGLEAYVS---KTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVA 524
+ P S+L G++ +VS K YA GC + ++ GF EAV IAK+AD VI+V
Sbjct: 436 SAKYAPGQIISVLEGVKNHVSPSTKVLYAQGCTQLDMDT-TGFAEAVNIAKQADAVILVV 494
Query: 525 GLDLTQET---------EDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSF 575
G + + E+ D +L +PG Q L+ +V T K PV+LVL G P +++
Sbjct: 495 GDNSNRHENGNKKSTTGENVDGATLEIPGVQRQLIKAVEATGK-PVVLVLVNGKPFTLTW 553
Query: 576 AEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRAD 635
D I SIL YPGE G A A+IIFGD NP GRLP++ +P ++P+
Sbjct: 554 --EDENIESILETWYPGEEGGNATADIIFGDENPSGRLPIS-FPRHPGQLPLW------- 603
Query: 636 SSRQYPGRSYRFYTG--TQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQ 693
+ + GR+Y +Y T +Y FGHGLSYT + Y + A+ K+G +
Sbjct: 604 YNYETSGRNYDYYDMPFTPLYRFGHGLSYTTFRYS--------NLKATTKSGDPGFVT-- 653
Query: 694 TGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGF 753
V + + N G G V L+ S T L GF
Sbjct: 654 ----------------------VSVDIENTGKRPGEEVAQLYITDLVASVNTAVIDLKGF 691
Query: 754 DRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
RV K ++F ++P LS+ N +R+L G + VG
Sbjct: 692 KRVFLKPGEKKTVTFELNPY-LLSLLNPDMKRVLEAGKFRMHVG 734
>gi|390630430|ref|ZP_10258413.1| Beta-xylosidase B [Weissella confusa LBAE C39-2]
gi|390484359|emb|CCF30761.1| Beta-xylosidase B [Weissella confusa LBAE C39-2]
Length = 674
Score = 320 bits (820), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 217/736 (29%), Positives = 350/736 (47%), Gaps = 121/736 (16%)
Query: 91 LGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVE 150
+ IP Y +W+E+LHG+A G T FPQ + AA+F+ L + I + E
Sbjct: 1 MNIPEYNYWNEALHGVARAG----------VATVFPQAIGLAATFDDHLINEIADVIGTE 50
Query: 151 ARAMYNLGQA--------GLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSF 202
RA YN GLTFW+PN+NIFRDPRWGRG ET GEDP + S + V F+K
Sbjct: 51 GRAKYNEFTKHDDRDIYKGLTFWSPNVNIFRDPRWGRGHETYGEDPFLTSKFGVAFIKGL 110
Query: 203 QGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSF 262
QG+ L L+A KH + + R+ F
Sbjct: 111 QGQ----------------------------AKYLKLAATAKHFAVHSGPEG---LRHGF 139
Query: 263 NAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKG 321
+A+++++D +T+ P F++ +E+ IM +YN V+GVPA + L + ++W F+G
Sbjct: 140 DAVVSDKDLYETYLPAFKAAVEEADVESIMTAYNAVDGVPASVSEMLLKDILHDKWSFEG 199
Query: 322 YITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDI 381
++ SD A + E YTK ++ +KAG+++ G + + A+D+G V E++I
Sbjct: 200 HVVSDYMAPEDVHENHKYTKDAAETMGLAIKAGLNLVAG-HIEQSLHEALDRGLVTEEEI 258
Query: 382 DRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKF 441
A+++L++ ++RLG+F D +Y + + T H L+ AA + VLLKND
Sbjct: 259 TNAVISLYATRVRLGMFATD---NEYDAIPYEANDTKAHNNLSEIAAEKSFVLLKNDG-V 314
Query: 442 LPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVS---KTHYASG--- 495
LPL K + ++A++GP ++ + G Y G P ++L G++ + + HY+ G
Sbjct: 315 LPLRKETMEAIAVVGPNAHSEIALLGNYFGTPSRSYTILEGIQERLGDDVRVHYSIGSGL 374
Query: 496 ----CHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRD---------RVSLLL 542
+ +D EAV A+ +D V+ V GLD T E E+ D + +L L
Sbjct: 375 FQDHAAEPLAKADERESEAVIAAEHSDVVVAVLGLDSTIEGEEGDAGNSQGAGDKPNLSL 434
Query: 543 PGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEI 602
PG+Q L+ + K PV+++L G L + E + +I+ I YPG G A+A++
Sbjct: 435 PGRQRQLLERLLAVGK-PVVVLLASGSSLQLDGLENHPNLRAIMQIWYPGARGGLAVADV 493
Query: 603 IFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSY 662
+FG +P G+LP+T+Y D NM GR+YR+ T +Y FG+GL+Y
Sbjct: 494 LFGAVSPSGKLPVTFYKNVDNLPAFEDYNM--------AGRTYRYMTDEALYPFGYGLTY 545
Query: 663 TNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTN 722
++ L S + D T V ++ N
Sbjct: 546 SSVELSDLQVKS---------------------------YEDTAT--------VTATIQN 570
Query: 723 AGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKE-ISFGVDPCEQLSIANK 781
G+ D VV ++ + P QL GF RV+ + KG+K+ I+F + P + + +
Sbjct: 571 TGNFDTDEVVQVYVKDLGSEFAVPNAQLKGFKRVY-LGKGAKQTITFDLRP-QDFEVFDA 628
Query: 782 HGRRILPLGNHVLMVG 797
GR + + VG
Sbjct: 629 QGRNFIDSDRFEISVG 644
>gi|308080460|ref|NP_001183746.1| uncharacterized protein LOC100502339 [Zea mays]
gi|238014360|gb|ACR38215.1| unknown [Zea mays]
Length = 344
Score = 320 bits (819), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 151/328 (46%), Positives = 209/328 (63%), Gaps = 1/328 (0%)
Query: 469 YTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDL 528
Y G PC + L+GL++YV+ + +GC+ C+ A +AV +A D+V + GL
Sbjct: 7 YFGPPCESTTPLKGLQSYVNDVRFLAGCNSAACDV-AATDQAVALAGSEDYVFLFMGLSQ 65
Query: 529 TQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWI 588
QE+E +DR SLLLPG Q SL+T+VA SKRPVILVL GGP+D++FA+++ +I +ILW
Sbjct: 66 KQESEGKDRTSLLLPGMQQSLITAVADASKRPVILVLLSGGPVDITFAQSNPKIGAILWA 125
Query: 589 GYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFY 648
GYPG+AG A+A+++FGD NP GRLP+TWYPE FTKVPM DM MRAD + YPGRSYRFY
Sbjct: 126 GYPGQAGGLAIAKVLFGDHNPSGRLPVTWYPEEFTKVPMTDMRMRADPTSGYPGRSYRFY 185
Query: 649 TGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTS 708
G VY FG+GLSY+ +S + + S +S++L G + + Q G R +V
Sbjct: 186 QGNTVYKFGYGLSYSTFSRRLVHGTSVPALSSTLLTGLRETMTPQDGDRSYHVDAIGTEG 245
Query: 709 CTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISF 768
C L+F + V N G +DG H V++F R P QG P QLIGF H A + ++ F
Sbjct: 246 CEQLKFPAMVEVQNHGPMDGKHSVLMFLRWPNTKQGRPASQLIGFRSQHLKAGETAKLRF 305
Query: 769 GVDPCEQLSIANKHGRRILPLGNHVLMV 796
+ PC+ S GR+++ +G+H LMV
Sbjct: 306 DISPCKHFSRVRADGRKVIDIGSHFLMV 333
>gi|297740661|emb|CBI30843.3| unnamed protein product [Vitis vinifera]
Length = 401
Score = 319 bits (818), Expect = 3e-84, Method: Compositional matrix adjust.
Identities = 172/435 (39%), Positives = 248/435 (57%), Gaps = 41/435 (9%)
Query: 373 KGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGI 432
+GK +E+D+D +L NL+ V ++G F+G P Y L D+CT EH +LA DAARQGI
Sbjct: 2 QGKAREEDVDTSLRNLYIVLTQVGFFDGIP---SYESLDKKDLCTKEHIELAADAARQGI 58
Query: 433 VLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHY 492
VLLKN + LPL+ + +LA+IGP N +M G Y G+PC S L G AY K Y
Sbjct: 59 VLLKNINETLPLDPAKLKNLALIGPHANATIEMLGNYAGVPCQYSSPLDGFSAY-GKVTY 117
Query: 493 ASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTS 552
GC++V C++ AV +K AD I++ GLD T E E DR LLLPG Q L+
Sbjct: 118 EMGCNNVTCDNKTFIMPAVEASKNADATILLVGLDKTVEGEGLDRNDLLLPGYQTELILQ 177
Query: 553 VARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGR 612
V SK P+ILV+ G +D+SF++ D ++ +ILW GYPGE G +A+A++++G +NPGGR
Sbjct: 178 VIVASKGPIILVIMSGSAVDISFSKTDDRVKAILWAGYPGEEGGRAIADVVYGKYNPGGR 237
Query: 613 LPMTWYPESF-TKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLS 671
LP+TW+ + + +PM M++R YPGR+Y+F+ G+ VY FGHGLSYT ++Y S
Sbjct: 238 LPLTWHQNDYLSMLPMTSMSLRP--VNNYPGRTYKFFNGSVVYPFGHGLSYTKFNYTLRS 295
Query: 672 APSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHV 731
+ SC F + I V N G G+ V
Sbjct: 296 SN---------------------------------MSCKD-HFELDIEVKNIGAKHGNEV 321
Query: 732 VMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGN 791
V+++++ P GT KQ+IGF RV A GS+ + F + C+ L I + ++LP G
Sbjct: 322 VLVYSKPPTGIVGTHAKQVIGFKRVFVPAGGSQNVKFEFNVCKSLGIVGYNAYKLLPSGE 381
Query: 792 HVLMVGELRHSLTIE 806
H +++G+ SL I+
Sbjct: 382 HKIIIGDSPTSLPID 396
>gi|5690010|emb|CAB51937.1| Family 3 Glycoside Hydrolase [Ruminococcus flavefaciens]
Length = 690
Score = 318 bits (815), Expect = 7e-84, Method: Compositional matrix adjust.
Identities = 241/784 (30%), Positives = 351/784 (44%), Gaps = 139/784 (17%)
Query: 54 FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
+ + +LS RA+ + L+ +EK +Q +A A RLG AY WWSE LHG+A G
Sbjct: 6 YLDEALSDLERAEDITDRLSTEEKAEQQKYDAPAEERLGKDAYNWWSEGLHGVARAG--- 62
Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLTFW 165
+ T FPQ + AA F+ G + EARA YN A GLT W
Sbjct: 63 -------TATMFPQTIGMAAMFDDEAVHRAGETTSREARAKYNEYSAHDDRDIYKGLTLW 115
Query: 166 APNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLK 225
+PN+NIFRDPRWGRGQET GEDP + S V + K QG+
Sbjct: 116 SPNVNIFRDPRWGRGQETYGEDPYLTSCLGVAYAKGLQGD-------------------- 155
Query: 226 GFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQ 285
G L +AC KH + + +R+ F+A +D +T+ F + ++
Sbjct: 156 --------GKVLRTAACAKHFAVHSGPE---ATRHEFDAKANMKDMTETYIAAFEALVKD 204
Query: 286 GKASCIMCSYNQVNGVPACLRGDLFQKARNEWGFKGYITSDCDAVATIFEYQNYTKTHED 345
K +M +YN+VNG PAC D EWGF G+ SDC A+ TKT +
Sbjct: 205 AKVESVMGAYNRVNGEPACA-SDFVMNKLEEWGFDGHFVSDCWAIRDFHTNHGVTKTAPE 263
Query: 346 SAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKG 405
SAA LK G D+NCG L H +A ++G + E+D+ R+ + L ++RLG+F+ +
Sbjct: 264 SAALALKKGCDLNCGNTYL-HLLAAFNEGLINEEDLRRSCIKLMRTRVRLGMFD---KST 319
Query: 406 KYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQM 465
+Y L D V EHK+ +L + + +VLLKN+ LPL+ + ++ +IGP +++ +
Sbjct: 320 EYDGLDYDIVACDEHKEFSLRCSERSMVLLKNN-GILPLDGSKYKTIGVIGPNADSVPAL 378
Query: 466 GGGYTGIPCSPKSLLRGL-EAYVSKTHYASGCH--DVPCNS----DAGFHEAVRIAKKAD 518
G Y G + L G+ EA+ + Y G H C D EA I +
Sbjct: 379 EGNYNGKADEYITFLSGIREAHDGRVLYTEGSHLYKDRCMGLALPDDRLSEAEIITRTLR 438
Query: 519 FVIVVAGLDLTQETED---------RDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGG 569
+ LD T E E+ D+ L LP Q LV +V K PVI+V G
Sbjct: 439 CSGSLCWLDATIEGEEGDTGNEFSSGDKNDLRLPESQRKLVKTVMAKGK-PVIIVTAAGS 497
Query: 570 PLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVP-MN 628
++V A+ D+ I + W YPG+ G +ALA I+FG +P G+LP+T+Y E +K+P +
Sbjct: 498 AINVE-ADCDALIQA--W--YPGQLGGRALANILFGKVSPSGKLPVTFY-EDASKLPDFS 551
Query: 629 DMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDK 688
D +M+ R+YR+ G ++ FG+GL+Y+
Sbjct: 552 DYSMK--------NRTYRYSEGNILFPFGYGLTYSE------------------------ 579
Query: 689 NILQQTGSRLDYVHIDEVTSCTSLRFH---VQISVTNAGDVDGSHVVMLFARVPKVSQGT 745
T C+ L F + VTN G VV ++ +
Sbjct: 580 ------------------TECSELSFENGVATVKVTNTGSRFTEDVVQIYIK-GYSENAV 620
Query: 746 PEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG-----ELR 800
P L GF RV A S+ + + P N+ G I L G EL
Sbjct: 621 PNHSLCGFKRVALDAGESRIVQITL-PERAFMAVNEKGEWIKEGSEFTLYAGTSQPDELS 679
Query: 801 HSLT 804
LT
Sbjct: 680 KKLT 683
>gi|390340546|ref|XP_001186857.2| PREDICTED: probable beta-D-xylosidase 2-like [Strongylocentrotus
purpuratus]
Length = 623
Score = 318 bits (814), Expect = 1e-83, Method: Compositional matrix adjust.
Identities = 221/632 (34%), Positives = 320/632 (50%), Gaps = 84/632 (13%)
Query: 53 PFCNTSLSISTRAKSLISLLTLQEKIQQLSDN-------ASAIPRLGIPAYEWWSESLHG 105
PF N SL R L+S L + + QL+ A AI RL I Y W +E L G
Sbjct: 31 PFWNQSLPWDQRLDDLLSRLKVDDMTYQLARGGADPNGPAPAIGRLQIGKYVWNTECLRG 90
Query: 106 IASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNL-------- 157
A G + T+FPQ L +A+F+R L + +A E RA YN
Sbjct: 91 DAQAG----------NATAFPQALGLSAAFSRDLLFEVANATGYEVRAKYNYYLQKGDFN 140
Query: 158 GQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFG 217
GL ++P INI R P WGR QET GEDP + A FV QG + +
Sbjct: 141 NHQGLNCFSPVINIMRHPYWGRNQETYGEDPYLTGELAKSFVWGLQGNHPRY-------- 192
Query: 218 FREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQP 277
L+ +A CKH AY + SR+SF+A ++++D + TF P
Sbjct: 193 -------------------LLTNAGCKHFAAYSGPENYPSSRFSFDAKVSDKDLQVTFFP 233
Query: 278 PFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEY 336
F+ CI+ G S +MCSYN VNG+PAC L R EWGFKGY+ SD A+
Sbjct: 234 AFKECIKAGTYS-VMCSYNSVNGIPACANSYLLNDVLRTEWGFKGYVVSDQRALELEELA 292
Query: 337 QNYTKTHEDSAAGVLKAGMDINCGT---CMLRHTQSAIDKGKVQEKDIDRALLNLFSVQL 393
NYT ++ D+A LKAG +++ GT + + A++ G + +D+ ++ LF +L
Sbjct: 293 HNYTTSYLDTAIKSLKAGCNLDLGTTKPAVYDYLAEAVELGMLTAQDLRDSIAPLFYTRL 352
Query: 394 RLGLFNGDPRKGKYGKLGPDDVCTS-EHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSL 452
RLG F+ P Y KL D V S EH+++AL AA + VL+KND LP+ + + +L
Sbjct: 353 RLGEFD-PPDHNPYVKLNVDQVVESPEHQEIALKAALKSFVLVKNDGSTLPI-EGTIHTL 410
Query: 453 AIIGPLVNNISQMGGGYTGIPCSPK---SLLRGLEAYVSKTHYASGCHDVPCNS--DAGF 507
A++GP NN + G Y P P+ ++L GL +KT +ASGC C + G
Sbjct: 411 AVVGPFANNSKLLFGDYAPNP-DPRFVTTVLEGLSPMATKTRHASGCPSPKCVTYDQQGV 469
Query: 508 HEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVAR-TSKRPVILVLT 566
AV AD V+V G + E+E DR +LLPG+Q L+ AR + +PVIL+L
Sbjct: 470 LNAV---TGADVVVVCLGTGIELESEGNDRRDMLLPGKQEQLLQDAARYAAGKPVILLLF 526
Query: 567 GGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGD---FNPGGRLPMTWYPESFT 623
GPL++++A + + +I+ +P +A AL ++F + NPGGRLP TW P +
Sbjct: 527 NAGPLNITWALSSPSVQAIVECFFPAQATGVAL-RMMFQNAPGANPGGRLPSTW-PATVA 584
Query: 624 KV-PMNDMNMRADSSRQYPGRSYRFYTGTQVY 654
++ PM + +M GR+YR++ TQ +
Sbjct: 585 QIPPMENYSMD--------GRTYRYFMATQCF 608
>gi|289670678|ref|ZP_06491753.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. musacearum
NCPPB 4381]
Length = 886
Score = 317 bits (813), Expect = 1e-83, Method: Compositional matrix adjust.
Identities = 177/445 (39%), Positives = 246/445 (55%), Gaps = 56/445 (12%)
Query: 64 RAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVT 123
RA +L++ ++ +EK+ Q ++A AIPRLGIPAYEWWSE LHGIA NG T
Sbjct: 37 RAAALVAQMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNG----------HAT 86
Query: 124 SFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG---------QAGLTFWAPNINIFRD 174
FPQ + AAS+N +L +G+ V+ EARA +N AGLT W+PNINIFRD
Sbjct: 87 VFPQAIGLAASWNTNLMQQVGTVVSTEARAKFNQAGGPGKDHKRYAGLTIWSPNINIFRD 146
Query: 175 PRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRG 234
PRWGRG ET GEDP + AV F++ QGE+ R +
Sbjct: 147 PRWGRGMETYGEDPFLTGQMAVGFIRGLQGED-----------LNHPRTI---------- 185
Query: 235 DELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCS 294
A KHL + + G R+ F+ ++ D E T+ P FR+ + QG+A +MC+
Sbjct: 186 ------ATPKHLAVHSGPEPG---RHGFDVDVSPHDVEATYTPAFRAALVQGQAGSVMCA 236
Query: 295 YNQVNGVPACLRGDLFQ-KARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKA 353
YN ++G PAC L + R +WGFKG++ SDCDAV + ++ + + S+A LKA
Sbjct: 237 YNSLHGTPACAADWLLNGRVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAAALKA 296
Query: 354 GMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPD 413
G D+NCG R +AI++G V E +D++L+ LF+ + RLG PRK Y +LG
Sbjct: 297 GHDLNCGHAY-RELGTAIERGDVDEALLDQSLVRLFAARYRLGELEA-PRKDPYARLGAK 354
Query: 414 DVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIP 473
DV + H+ LAL AA + IVLLKND LPL NA + LA+IGP + ++ + Y G
Sbjct: 355 DVDNAAHRALALQAAAESIVLLKNDANTLPL--NAGTRLAVIGPNADALAALEANYQGTS 412
Query: 474 CSPKSLLRGLEAY--VSKTHYASGC 496
+P + L GL + YA G
Sbjct: 413 SAPVTPLLGLRQRFGAQQVRYAQGA 437
Score = 119 bits (297), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 89/292 (30%), Positives = 145/292 (49%), Gaps = 54/292 (18%)
Query: 516 KADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVARTSKRPVILVL 565
++D V+ GL E E+ DR + LP Q +L+ A+ S +P+++VL
Sbjct: 614 QSDAVVAFVGLSPDVEGEELRIDVPGFDGGDRNDIALPAPQQALLER-AKASGKPLVVVL 672
Query: 566 TGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKV 625
G + +++A+ + W YPG++G A+A ++ GD NPGGRLP+T+Y
Sbjct: 673 MSGSAVALNWAKTHADAIVAAW--YPGQSGGTAIARMLAGDDNPGGRLPVTFY------- 723
Query: 626 PMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAG 685
+ ++ A S GR+YR++ G ++ FG+GLSYT ++Y AP +L+ S +L+AG
Sbjct: 724 -RSTKDLPAYVSYDMKGRTYRYFKGEPLFPFGYGLSYTRFAY---DAP-QLS-STTLQAG 777
Query: 686 SDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGT 745
N LQ V +V N G G V ++ + P Q +
Sbjct: 778 ---NPLQ-----------------------VTTTVRNTGTHAGDEVAQVYLQYPDRPQ-S 810
Query: 746 PEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
P + L+GF RVH A + ++F +D LS ++ G+R + GN+ L VG
Sbjct: 811 PLRSLVGFQRVHLAAGEQRTLTFNLD-ARALSDVDRSGQRAVEAGNYTLFVG 861
>gi|427385932|ref|ZP_18882239.1| hypothetical protein HMPREF9447_03272 [Bacteroides oleiciplenus YIT
12058]
gi|425726971|gb|EKU89834.1| hypothetical protein HMPREF9447_03272 [Bacteroides oleiciplenus YIT
12058]
Length = 732
Score = 317 bits (811), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 228/782 (29%), Positives = 372/782 (47%), Gaps = 115/782 (14%)
Query: 54 FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAY-EWWSESLHGIASNGPG 112
F N +S+ R L+S LTL++K Q L+ + G + W++ LHG+ P
Sbjct: 33 FLNQEMSMEARVADLMSRLTLEQKAQLLNHRGKTVVVDGFSIRADQWNQCLHGVKWTEP- 91
Query: 113 VNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQ---------AGLT 163
T+FP + A+++ L + + ++ EARA+YN + GL
Sbjct: 92 ---------TTNFPTSIALGATWDTELIHRVATVISDEARAIYNGWKQDPEFRGEHKGLI 142
Query: 164 FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRV 223
+ +P INI R+P WGR E GEDP V +VK QG
Sbjct: 143 YRSPVINISRNPYWGRINEIFGEDPYHTGRMGVAYVKGLQG------------------- 183
Query: 224 LKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCI 283
D L L++ KH ++E R +A + E+ + + P F+ CI
Sbjct: 184 --------DDSHYLKLASTLKHYAVNNVE----VDRMKLSAQVPERMLYEYWLPHFKDCI 231
Query: 284 EQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKT 342
+GKA +M SYN +NGVP + L +N+WG +G++ SD V T+ E + +
Sbjct: 232 VEGKAQSVMASYNAINGVPNNINKLLLTDILKNQWGHEGFVVSDLGGVKTMVEGHHQRQI 291
Query: 343 HEDSAAG-VLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGD 401
+ A G + AG D + ++ A+ KG + E+ ++ AL + V+ RLG F+ D
Sbjct: 292 SCEEAVGRSIMAGCDFSDAEYE-KYIPDALRKGYLTEERLNDALRRVLLVRFRLGEFD-D 349
Query: 402 PRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNN 461
+ Y ++ PD + EH+ L+L+AAR+ IVLLKN+KK LP++++ + +A+IGP +
Sbjct: 350 FKSVPYSRISPDVIGCKEHRNLSLEAARKSIVLLKNEKKLLPIDRSIIKRVAVIGPYADL 409
Query: 462 ISQMGGGYTGIPCSPKSLLRGLEAYVS---KTHYASGCHDVPC------------NSDAG 506
+Q G Y G+P P + L+G++ V + Y G P + +A
Sbjct: 410 FNQ--GNYGGVPKDPVTPLQGIKNAVGNNVEVLYCKGAQITPVKVRKGQPIPPRFDKEAE 467
Query: 507 FHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLT 566
+AV +A+ +D V + G E E RDR +L+LPG Q LV +V +K+ V++VL
Sbjct: 468 MKKAVEMARNSDVVFLFVGTTADIEVEGRDRKTLVLPGNQNELVKAVYEVNKK-VVVVLM 526
Query: 567 GGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVP 626
GP V+ E I ++L +PG+ G A+A+++FGD+NPGG+LP T Y S +VP
Sbjct: 527 SAGP--VAVPEVKKNIPAVLQAWWPGDEGGNAIADVLFGDYNPGGKLPYTMYA-SDEQVP 583
Query: 627 MNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGS 686
D D S+ G +Y + ++ FGHGLSY+ + Y S+L IS+ +
Sbjct: 584 STD---EYDISK---GFTYMYLKKKPLFAFGHGLSYSKFHY------SDLQISSPV---- 627
Query: 687 DKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTP 746
V +++ S V + V N G G VV L+ R K P
Sbjct: 628 --------------VSVNDTVS-------VVLKVKNMGKRTGEEVVQLYVRDVKAKVVRP 666
Query: 747 EKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKH-GRRILPLGNHVLMVGELRHSLTI 805
K+L GF R+ +EI + P + L+ ++ G ++ G+ +++G + +
Sbjct: 667 TKELRGFKRIALQPNEEQEIRLML-PVKSLAFYDESIGDFLVEPGSFEILLGSASDDIRL 725
Query: 806 ET 807
++
Sbjct: 726 QS 727
>gi|289664871|ref|ZP_06486452.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. vasculorum
NCPPB 702]
Length = 886
Score = 317 bits (811), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 177/445 (39%), Positives = 246/445 (55%), Gaps = 56/445 (12%)
Query: 64 RAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVT 123
RA +L++ ++ +EK+ Q ++A AIPRLGIPAYEWWSE LHGIA NG T
Sbjct: 37 RAAALVAHMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNG----------HAT 86
Query: 124 SFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG---------QAGLTFWAPNINIFRD 174
FPQ + AAS+N +L +G+ V+ EARA +N AGLT W+PNINIFRD
Sbjct: 87 VFPQAIGLAASWNTNLMQQVGTVVSTEARAKFNQAGGPGKDHKRYAGLTIWSPNINIFRD 146
Query: 175 PRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRG 234
PRWGRG ET GEDP + AV F++ QGE+ R +
Sbjct: 147 PRWGRGMETYGEDPFLTGQMAVGFIRGLQGED-----------LNHPRTI---------- 185
Query: 235 DELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCS 294
A KHL + + G R+ F+ ++ D E T+ P FR+ + QG+A +MC+
Sbjct: 186 ------ATPKHLAVHSGPEPG---RHGFDVDVSPHDVEATYTPAFRAALVQGQAGSVMCA 236
Query: 295 YNQVNGVPACLRGDLFQ-KARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKA 353
YN ++G PAC L + R +WGFKG++ SDCDAV + ++ + + S+A LKA
Sbjct: 237 YNSLHGTPACAADWLLNGRVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAAALKA 296
Query: 354 GMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPD 413
G D+NCG R +AI++G V E +D++L+ LF+ + RLG PRK Y +LG
Sbjct: 297 GHDLNCGHAY-RELGTAIERGDVDEALLDQSLVRLFAARYRLGELEA-PRKDPYARLGAK 354
Query: 414 DVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIP 473
DV + H+ LAL AA + IVLLKND LPL NA + LA+IGP + ++ + Y G
Sbjct: 355 DVDNAAHRALALQAAAESIVLLKNDANTLPL--NAGTRLAVIGPNADALAALEANYQGTS 412
Query: 474 CSPKSLLRGLEAY--VSKTHYASGC 496
+P + L GL + YA G
Sbjct: 413 SAPVTPLLGLRQRFGAQQVRYAQGA 437
Score = 119 bits (297), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 89/292 (30%), Positives = 145/292 (49%), Gaps = 54/292 (18%)
Query: 516 KADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVARTSKRPVILVL 565
++D V+ GL E E+ DR + LP Q +L+ A+ S +P+++VL
Sbjct: 614 QSDAVVAFVGLSPDVEGEELRIDVPGFDGGDRNDIALPAPQQALLER-AKASGKPLVVVL 672
Query: 566 TGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKV 625
G + +++A+ + W YPG++G A+A ++ GD NPGGRLP+T+Y
Sbjct: 673 MSGSAVALNWAKTHADAIVAAW--YPGQSGGTAIARMLAGDDNPGGRLPVTFY------- 723
Query: 626 PMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAG 685
+ ++ A S GR+YR++ G ++ FG+GLSYT ++Y AP +L+ +A L+AG
Sbjct: 724 -RSTKDLPAYVSYDMKGRTYRYFKGEPLFPFGYGLSYTRFAY---DAP-QLSTTA-LQAG 777
Query: 686 SDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGT 745
N LQ V +V N G G V ++ + P Q +
Sbjct: 778 ---NPLQ-----------------------VTTTVRNTGTRAGDEVAQVYLQYPDRPQ-S 810
Query: 746 PEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
P + L+GF RVH A + ++F +D LS ++ G+R + GN+ L VG
Sbjct: 811 PLRSLVGFQRVHLAAGEQRTLTFNLD-ARALSDVDRSGQRAVEAGNYTLFVG 861
>gi|282877070|ref|ZP_06285912.1| glycosyl hydrolase family 3 C-terminal domain protein [Prevotella
buccalis ATCC 35310]
gi|281300752|gb|EFA93079.1| glycosyl hydrolase family 3 C-terminal domain protein [Prevotella
buccalis ATCC 35310]
Length = 721
Score = 316 bits (809), Expect = 3e-83, Method: Compositional matrix adjust.
Identities = 220/749 (29%), Positives = 343/749 (45%), Gaps = 122/749 (16%)
Query: 51 SYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNG 110
+YPF + LS RA L LTL+EK + +N+ +PRLGI ++WW E+LHG A G
Sbjct: 23 TYPFQDARLSFEQRADDLCKRLTLEEKAGLMQNNSKPVPRLGIKQFQWWGEALHGSARTG 82
Query: 111 PGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GL 162
T FPQ + AASF+ L + + + EARA YN+ +
Sbjct: 83 ----------LATVFPQTIGMAASFDDELLLQVFNIASTEARAKYNVAAKKGYFDTSWSV 132
Query: 163 TFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKR 222
+ W PN+NIFRDPRWGRGQET GEDP + S V+ QG G +
Sbjct: 133 SLWTPNVNIFRDPRWGRGQETYGEDPYLTSRMGCAVVEGLQG------------GKGPHK 180
Query: 223 VLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNA-MITEQDTEDTFQPPFRS 281
K F AC KH + +W +R+S + ++ +D +T+ P F+
Sbjct: 181 YYKAF-------------ACAKHFAVHSGPEW---NRHSISIDDVSPRDFHETYLPAFKH 224
Query: 282 CIEQGKASCIMCSYNQVNGVPACLRGDLFQK-ARNEWGFKGYITSDCDAVATIFEYQNYT 340
++ G +MC+YN ++G P C L ++ R+EWGFKG + SDC A+ I+ + +
Sbjct: 225 LVQVGGVKEVMCAYNSIDGEPCCSDQRLLEQLLRDEWGFKGIVVSDCGAIDDIWR-KGFH 283
Query: 341 KTHEDSA---AGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGL 397
+ D+A A +K G D++CG A+ GKV E+ ID++L L +++LG
Sbjct: 284 EVEPDAAHASARAVKGGTDMSCGQT-YGSLPEAVRLGKVTEERIDKSLKRLIVGRMQLGE 342
Query: 398 FNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGP 457
F+ D ++ + DV T +++AL AR+ + LL N LPL+K + + ++GP
Sbjct: 343 FDPDSIT-RWNAISMKDVSTPASREVALKMARETMTLLHNPMHALPLSKQ-LKQVVVMGP 400
Query: 458 LVNNISQMGGGYTGIPCSPKSLLRGLEAYV--SKTHYASGCHDVP----CNSDAGFHEAV 511
N+ M G Y G P ++L G+ + + + GC V N + V
Sbjct: 401 NANDSVMMWGNYNGTPHHTVTILDGIRRKIGAQRVKFIEGCGLVEPHRRGNQALTTQQLV 460
Query: 512 RIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVARTSKRPV 561
VI V G+ E E DRV++ LP Q ++ ++ K+ +
Sbjct: 461 EEVGDNKTVIFVGGISPQLEGEQLEVEAKGFKGGDRVTIELPQVQREMIAALHAAGKQVI 520
Query: 562 ILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPES 621
++ +G + + +IL YPGE G +A+A+++FGD+NP G+LP+T+Y +
Sbjct: 521 MVNCSGSA---IGLVPEVTHTDAILQAWYPGERGGEAVADVLFGDYNPAGKLPVTFYRDD 577
Query: 622 FTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISAS 681
D NMR R+YR++ G ++ FGHGLSYT++ A
Sbjct: 578 SQLPDYLDYNMR--------NRTYRYFKGKPLFPFGHGLSYTSFKIG----------KAK 619
Query: 682 LKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKV 741
++ G + +SV N G DG VV L+
Sbjct: 620 MRNGK-----------------------------LTVSVKNTGKRDGEEVVQLYISCLDD 650
Query: 742 SQGTPEKQLIGFDRVHTVAKGSKEISFGV 770
G P K L GF R+ A + ++ +
Sbjct: 651 PNG-PIKSLRGFKRMALQAGEQRTVTLNL 678
>gi|58581402|ref|YP_200418.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae KACC
10331]
gi|58425996|gb|AAW75033.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae KACC
10331]
Length = 889
Score = 316 bits (809), Expect = 4e-83, Method: Compositional matrix adjust.
Identities = 178/453 (39%), Positives = 247/453 (54%), Gaps = 56/453 (12%)
Query: 56 NTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNF 115
NT + RA L++ ++ +EK+ Q ++A AIPRLGIPAYEWWSE LHGIA NG
Sbjct: 32 NTDATPEQRAADLVAHMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNG----- 86
Query: 116 NGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG---------QAGLTFWA 166
T FPQ + AAS+N L +G+ V+ EARA +N AGLT W+
Sbjct: 87 -----YATVFPQAIGLAASWNTHLMQQVGTVVSTEARAKFNQAGRPGNDHKRYAGLTIWS 141
Query: 167 PNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKG 226
PNINIFRDPRWGRG ET GEDP + AV F+ QGE+
Sbjct: 142 PNINIFRDPRWGRGMETYGEDPFLTGQMAVGFIHGLQGEDL------------------- 182
Query: 227 FGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQG 286
D A KHL + + G R+ F+ ++ +D E T+ P FR+ I +G
Sbjct: 183 --------DHPRTIATPKHLAVHSGPEPG---RHGFDVDVSPRDVEATYTPAFRAAIVEG 231
Query: 287 KASCIMCSYNQVNGVPACLRGDLFQ-KARNEWGFKGYITSDCDAVATIFEYQNYTKTHED 345
+A +MC+YN ++G PAC L + R +WGFKG++ SDCDAV + ++ + +
Sbjct: 232 QAGAVMCAYNSLHGTPACAADWLINGRVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAG 291
Query: 346 SAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKG 405
S+A LKAG D+NCG R +AI +G+V E +D++L+ LF+ + RLG PRK
Sbjct: 292 SSAAALKAGHDLNCGHAY-RELGTAIARGEVDEALLDQSLVRLFAARYRLGELEA-PRKD 349
Query: 406 KYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQM 465
Y +LG DV ++H+ LAL AA + IVLLKN+ LPL NA + LA+IGP + ++ +
Sbjct: 350 PYARLGAKDVDNAQHRALALQAAAESIVLLKNNANTLPL--NAGTRLAVIGPNADALAAL 407
Query: 466 GGGYTGIPCSPKSLLRGLEAY--VSKTHYASGC 496
Y G +P + L GL + YA G
Sbjct: 408 EANYQGTSSAPVTPLLGLRQRFGAQQVSYAQGA 440
Score = 117 bits (292), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 80/262 (30%), Positives = 133/262 (50%), Gaps = 44/262 (16%)
Query: 536 DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAG 595
DR + LP Q +L+ A+ S +P+++VL G + +++A+ + W YPG++G
Sbjct: 647 DRNDIALPAPQQALLER-AKASGKPLVVVLMSGSAVALNWAKTHADAIVAAW--YPGQSG 703
Query: 596 AKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYG 655
A+A ++ GD NPGGRLP+T+Y + ++ A S GR+YR++ G ++
Sbjct: 704 GTAIARMLAGDDNPGGRLPVTFY--------RSTKDLPAYVSYDMKGRTYRYFKGEPLFP 755
Query: 656 FGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFH 715
FG+GLSYT ++Y AP +L+ S +++AGS
Sbjct: 756 FGYGLSYTRFAY---DAP-QLS-STAVQAGST--------------------------LQ 784
Query: 716 VQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQ 775
V +V N G G V ++ + P Q +P + L+GF RVH A + ++F +D
Sbjct: 785 VTTTVRNTGARAGDEVAQVYLQYPDRPQ-SPLRSLVGFQRVHLAAGEQRTLTFNLD-ARA 842
Query: 776 LSIANKHGRRILPLGNHVLMVG 797
LS ++ G+R + GN+ L VG
Sbjct: 843 LSDVDRSGQRAVEAGNYTLFVG 864
>gi|294462719|gb|ADE76904.1| unknown [Picea sitchensis]
Length = 304
Score = 315 bits (807), Expect = 7e-83, Method: Compositional matrix adjust.
Identities = 152/303 (50%), Positives = 208/303 (68%), Gaps = 9/303 (2%)
Query: 507 FHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLT 566
F EAV AKK D VIVV GLD TQE E+RDR+ L LPGQQ +LV V+R +KRPV+ V+
Sbjct: 2 FGEAVETAKKVDEVIVVVGLDQTQEKEERDRIKLTLPGQQENLVYQVSRAAKRPVVFVIL 61
Query: 567 GGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVP 626
GGP+DVSFA D QISSI+W GYPG+AG +ALAEIIFGD+NPGGRLPMTWYP+ F K+P
Sbjct: 62 SGGPVDVSFAVNDPQISSIIWAGYPGQAGGQALAEIIFGDYNPGGRLPMTWYPQDFVKIP 121
Query: 627 MNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAP-SELTISASLKAG 685
M DMNMR + YPGR+YRFYTG +V+ FG GLSY+ YSY F S ++ ++ +++
Sbjct: 122 MTDMNMRPNHDTGYPGRTYRFYTGKKVFEFGQGLSYSAYSYNFSSTTIQKIDLNVTMEH- 180
Query: 686 SDKNILQQTGSR-LDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQG 744
+ G+R +V + E T C L+F I V N +DG H V+L+++ P +G
Sbjct: 181 -----FEALGNRGKGHVRV-ENTPCRKLKFRSSIFVRNHDKMDGRHAVLLYSKSPATHKG 234
Query: 745 TPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLT 804
P+KQLIGF VH K + +++F V PC+ S ++G+R+L +G+H L+VG+ ++ ++
Sbjct: 235 APQKQLIGFRSVHVQGKHTAKVTFVVKPCDHFSTVEENGQRLLAIGSHSLIVGDTQYPVS 294
Query: 805 IET 807
+ T
Sbjct: 295 LVT 297
>gi|325925754|ref|ZP_08187127.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas perforans
91-118]
gi|325543811|gb|EGD15221.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas perforans
91-118]
Length = 874
Score = 315 bits (806), Expect = 7e-83, Method: Compositional matrix adjust.
Identities = 177/453 (39%), Positives = 248/453 (54%), Gaps = 56/453 (12%)
Query: 56 NTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNF 115
NT + RA +L++ ++ +EK+ Q ++A AIPRLGIPAYEWWSE LHGIA NG
Sbjct: 17 NTDATPEQRAAALVAQMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNG----- 71
Query: 116 NGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG---------QAGLTFWA 166
T FPQ + AAS+N L +G+ V+ EARA +N AGLT W+
Sbjct: 72 -----YATVFPQAIGLAASWNTRLMQQVGTVVSTEARAKFNQAGGPGKDHKRYAGLTIWS 126
Query: 167 PNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKG 226
PNINIFRDPRWGRG ET GEDP + AV F++ QGE+ R +
Sbjct: 127 PNINIFRDPRWGRGMETYGEDPFLTGQMAVGFIRGLQGED-----------LNHPRTI-- 173
Query: 227 FGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQG 286
A KH+ + + G R+ F+ ++ +D E T+ P FR+ I +G
Sbjct: 174 --------------ATPKHIAVHSGPEPG---RHGFDVDVSPRDVEATYTPAFRAAIVEG 216
Query: 287 KASCIMCSYNQVNGVPACLRGDLFQ-KARNEWGFKGYITSDCDAVATIFEYQNYTKTHED 345
+A +MC+YN ++G PAC L + R +WGFKG++ SDCDAV + ++ + +
Sbjct: 217 QAGSVMCAYNSLHGTPACAADWLLNGRVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAG 276
Query: 346 SAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKG 405
S+A LKAG D+NCG R +AI +G+V E +D++L+ LF+ + RLG PRK
Sbjct: 277 SSAAALKAGHDLNCGHA-YRELGTAIARGEVDEALLDQSLVRLFAARYRLGELEA-PRKD 334
Query: 406 KYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQM 465
Y +LG DV + H+ LAL AA + IVLLKND LPL A + LA+IGP + ++ +
Sbjct: 335 PYARLGAKDVDNAAHRALALQAAAESIVLLKNDANTLPL--KAGTRLAVIGPNADALAAL 392
Query: 466 GGGYTGIPCSPKSLLRGLEAY--VSKTHYASGC 496
Y G +P + L GL + YA G
Sbjct: 393 EANYQGTSSAPVTPLLGLRQRFGAQQVSYAQGA 425
Score = 119 bits (298), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 80/262 (30%), Positives = 130/262 (49%), Gaps = 44/262 (16%)
Query: 536 DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAG 595
DR + LP Q +L+ A+ S +P+++VL G + +++A+ + W YPG++G
Sbjct: 632 DRNDIALPAPQQALLER-AKASGKPLVVVLMSGSAVALNWAKTHADAIVAAW--YPGQSG 688
Query: 596 AKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYG 655
A+A ++ GD NPGGRLP+T+Y + ++ A S GR+YR++ G ++
Sbjct: 689 GTAIARMLAGDDNPGGRLPVTFY--------RSTKDLPAYVSYDMKGRTYRYFKGEPLFA 740
Query: 656 FGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFH 715
FG+GLSYT ++Y AP T +L+AGS
Sbjct: 741 FGYGLSYTRFAY---DAPQLSTT--TLQAGSS--------------------------LQ 769
Query: 716 VQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQ 775
V +V N G G V ++ + P Q +P + L+GF RVH A + ++F +D
Sbjct: 770 VTTTVRNTGARAGDEVAQVYLQYPDRPQ-SPLRSLVGFQRVHLAAGEQRTLTFNLD-ARA 827
Query: 776 LSIANKHGRRILPLGNHVLMVG 797
LS ++ G+R + GN+ L VG
Sbjct: 828 LSDVDRSGQRAVEAGNYTLFVG 849
>gi|336275603|ref|XP_003352555.1| hypothetical protein SMAC_01389 [Sordaria macrospora k-hell]
gi|380094444|emb|CCC07823.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 833
Score = 315 bits (806), Expect = 8e-83, Method: Compositional matrix adjust.
Identities = 242/830 (29%), Positives = 365/830 (43%), Gaps = 146/830 (17%)
Query: 55 CNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVN 114
C+++ S RA SL+ LT+ EK+ L D + PRLG+P Y WWSE LHG+A + PGV
Sbjct: 37 CDSTASAPDRAASLVEQLTIDEKLVNLVDQSKGAPRLGLPPYAWWSEGLHGVAGS-PGVV 95
Query: 115 FNGT---VSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINI 171
FN + S TSF V+ A+ + L +G+A++ EARA G GL +W PNIN
Sbjct: 96 FNTSGYPFSYATSFANVITLGAALDDDLVYEVGTAISTEARAFAKFGFGGLDYWTPNINP 155
Query: 172 FRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEES 231
++DPRWGRG ETPGEDP+ + Y V +G
Sbjct: 156 YKDPRWGRGAETPGEDPLRIKGYVKAMVAGLEGN-------------------------- 189
Query: 232 DRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCI 291
G + A CKH AYDLE+W +RY F+A+++ QD + + PPF+ C + I
Sbjct: 190 --GTVRKVIATCKHFAAYDLERWRGLTRYDFDAVVSLQDLSEYYLPPFQQCARDSRVGSI 247
Query: 292 MC--------------------------------SYNQVNGVPACLRGDLFQKA-RNEWG 318
MC SYN +NG PAC L R+ W
Sbjct: 248 MCRYVSFFLPPFPSFPRLVTRQSGNQVDIVDNFRSYNALNGTPACASTYLMTNILRDHWN 307
Query: 319 F---KGYITSDCDAVATIF-EYQNYTKTHEDSAAGVLKAGMDINC---GTCMLRHTQSAI 371
+ YITSDC+A+ + N+++T ++AA AG D C G A
Sbjct: 308 WTNHNNYITSDCNAIQDFLPDNHNFSQTPAEAAAAAYIAGTDTVCEVSGWPPYTDVVGAY 367
Query: 372 DKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQG 431
++ + E ID AL L+ +R G + G+ PD S
Sbjct: 368 NQSLLSESVIDTALRRLYEGLIRAGYLD----HGRPASSSPDKAPFSS------------ 411
Query: 432 IVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTH 491
FLPL+ ++A+IG N + G Y+G+P + + + +
Sbjct: 412 -------PDFLPLDLTG-KTVALIGHWANATRTIRGPYSGLPPFYHNPMYAVRQLKLSFY 463
Query: 492 YASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVT 551
YA+G ++D A+ A+ AD V+ G D T +ED DR S+ P Q++L+
Sbjct: 464 YANGPVVNSTDADTWTAAAMLAAESADVVLYFGGTDTTVASEDLDRESIAWPKTQLTLIE 523
Query: 552 SVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGG 611
+A+ K V++ L G +D + + ISSILW+GYPG++G A+ +++ G G
Sbjct: 524 KLAQVGKPMVVIQL--GDQVDDTPLLNNKNISSILWVGYPGQSGGTAVFDVLTGKKASAG 581
Query: 612 RLPMTWYPESFT-KVPMNDMNM--------------------------------RADSSR 638
RLP+T YP + +VP+ +M + R + +
Sbjct: 582 RLPVTQYPAGYVDEVPLTEMGLRPFNHSSSTTSSDVSQSGVEEGNGLTIQTRSTRGNKTL 641
Query: 639 QYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAG-SDKNILQQTGSR 697
PGR+Y++Y V FG+GL YT ++ + S S + S +++L T
Sbjct: 642 SSPGRTYKWYP-RPVLPFGYGLHYTPFNISLSLSTSSNASSTTDNTSISIRSLL--TSQT 698
Query: 698 LDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVH 757
+H+D C F V S+TN G +V +LF + P K L+G+ RV
Sbjct: 699 CTAIHLD---LCPFSPFSV--SITNTGSHTSDYVALLFLSGKFGPKPDPLKTLVGYKRVK 753
Query: 758 TVAKGSKEISFGVD-PCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
+ G + G D P ++A G GN VL G + L ++
Sbjct: 754 DIKPGETRVVGGEDIPVNLAAVARVDGN-----GNTVLYPGTYKFRLDVD 798
>gi|449489074|ref|XP_002195511.2| PREDICTED: beta-xylosidase/alpha-L-arabinofuranosidase 2-like
[Taeniopygia guttata]
Length = 685
Score = 315 bits (806), Expect = 8e-83, Method: Compositional matrix adjust.
Identities = 236/722 (32%), Positives = 340/722 (47%), Gaps = 110/722 (15%)
Query: 85 ASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIG 144
A IPRLGI Y W +E L G PG T+FPQ L AA+F+ L +
Sbjct: 6 APPIPRLGIAPYNWNTECLRG-DGEAPG--------WATAFPQALGLAAAFSPELIYRVA 56
Query: 145 SAVAVEARAMYNLGQA--------GLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAV 196
+A A E RA +N A GL+ ++P +NI R P WGR QET GEDP + A
Sbjct: 57 NATATEVRAKHNSFAAAGRYSDHTGLSCFSPVLNIMRHPLWGRNQETYGEDPFLSGELAR 116
Query: 197 EFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGN 256
FV+ QG + R +K SA CKH + G+
Sbjct: 117 SFVQGLQGPH--------------PRYVKA-------------SAGCKHFSVHG----GH 145
Query: 257 FSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RN 315
+ + + E+D TF P F++C+ G S MCSYN++NGVPAC L R
Sbjct: 146 ENILLYLLTVLERDWRMTFLPQFQACVRAGSYS-FMCSYNRINGVPACANKKLLTDILRG 204
Query: 316 EWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCM----LRHTQSAI 371
EWGF GY+ SD AV I +YT++ ++A + AG ++ M A+
Sbjct: 205 EWGFDGYVVSDEGAVELIMLGHHYTRSFLETAVASVNAGCNLELSYGMRNNVFMRIPEAL 264
Query: 372 DKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQG 431
G + + + + LF ++RLG F+ P Y L V + EH+ L+L+AA +
Sbjct: 265 AMGNITLQMLRDRVRPLFYTRMRLGEFD-PPAMNPYSSLDLSVVQSPEHRNLSLEAAVKS 323
Query: 432 IVLLKNDKKFLPLNKNAVSS--LAIIGPLVNNISQMGGGYTGIPCSPKSLL---RGLEAY 486
VLLKN + LPL +SS LA++GP +N + G Y +P P+ + RGLE
Sbjct: 324 FVLLKNVRGTLPLKAQDLSSQHLAVVGPFADNPRVLFGDYAPVP-EPRYIYTPRRGLEML 382
Query: 487 VSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQ 546
+ +A+GC + C + E V++ AD V+V G + ETE +DR L LPG Q
Sbjct: 383 GANVSFAAGCSEPRCQRYSR-AELVKVVGAADVVLVCLGTGVDVETEAKDRSDLSLPGHQ 441
Query: 547 MSLVT-SVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFG 605
+ L+ +V + RPVIL+L GPLDVS+A+A + +IL +P +A A+A ++ G
Sbjct: 442 LELLQDAVQAAAGRPVILLLFNAGPLDVSWAQAHDGVGAILACFFPAQATGLAIARVLLG 501
Query: 606 D--FNPGGRLPMTWYPESFTKV-PMNDMNMRADSSRQYPGRSYRFY-TGTQVYGFGHGLS 661
+ +P GRLP TW P +V PM + M GR+YR+Y +Y FG+GLS
Sbjct: 502 EAGASPAGRLPATW-PAGMHQVPPMENYTME--------GRTYRYYGQEAPLYPFGYGLS 552
Query: 662 YTNYSYK-FLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISV 720
YT + Y+ + +P L + A+L V + +
Sbjct: 553 YTTFRYRDLVLSPPVLPLCANLS--------------------------------VSVVL 580
Query: 721 TNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIAN 780
N G D VV L+ R S P QL+ F RV A ++SF V EQ ++
Sbjct: 581 ENTGLRDSEEVVQLYLRWEHSSVPVPRWQLVAFRRVAVPAGREAKLSFQVL-AEQRAVWA 639
Query: 781 KH 782
+H
Sbjct: 640 QH 641
>gi|84623339|ref|YP_450711.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae MAFF
311018]
gi|188577358|ref|YP_001914287.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae PXO99A]
gi|84367279|dbj|BAE68437.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae MAFF
311018]
gi|188521810|gb|ACD59755.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae PXO99A]
Length = 889
Score = 314 bits (805), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 177/453 (39%), Positives = 247/453 (54%), Gaps = 56/453 (12%)
Query: 56 NTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNF 115
NT + RA L++ ++ +EK+ Q ++A AIPRLGIPAYEWWSE LHGIA NG
Sbjct: 32 NTDATPEQRAADLVAHMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNG----- 86
Query: 116 NGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG---------QAGLTFWA 166
T FPQ + AAS+N L +G+ V+ EARA +N AGLT W+
Sbjct: 87 -----YATVFPQAIGLAASWNTHLMQQVGTVVSTEARAKFNQAGRPGKDHKRYAGLTIWS 141
Query: 167 PNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKG 226
PNINIFRDPRWGRG ET GEDP + AV F+ QG++
Sbjct: 142 PNINIFRDPRWGRGMETYGEDPFLTGQMAVGFIHGLQGDDL------------------- 182
Query: 227 FGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQG 286
D A KHL + + G R+ F+ ++ +D E T+ P FR+ I +G
Sbjct: 183 --------DHPRTIATPKHLAVHSGPEPG---RHGFDVDVSPRDVEATYTPAFRAAIVEG 231
Query: 287 KASCIMCSYNQVNGVPACLRGDLFQ-KARNEWGFKGYITSDCDAVATIFEYQNYTKTHED 345
+A +MC+YN ++G PAC L + R +WGFKG++ SDCDAV + ++ + +
Sbjct: 232 QAGAVMCAYNSLHGTPACAADWLINGRVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAG 291
Query: 346 SAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKG 405
S+A LKAG D+NCG R +AI +G+V E +D++L+ LF+ + RLG PRK
Sbjct: 292 SSAAALKAGHDLNCGHAY-RELGTAIARGEVDEALLDQSLVRLFAARYRLGELEA-PRKD 349
Query: 406 KYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQM 465
Y +LG DV ++H+ LAL AA + IVLLKN+ LPL NA + LA+IGP + ++ +
Sbjct: 350 PYARLGAKDVDNAQHRALALQAAAESIVLLKNNANTLPL--NAGTRLAVIGPNADALAAL 407
Query: 466 GGGYTGIPCSPKSLLRGLEAY--VSKTHYASGC 496
Y G +P + L GL + YA G
Sbjct: 408 EANYQGTSSAPVTPLLGLRQRFGAQQVSYAQGA 440
Score = 117 bits (292), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 80/262 (30%), Positives = 133/262 (50%), Gaps = 44/262 (16%)
Query: 536 DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAG 595
DR + LP Q +L+ A+ S +P+++VL G + +++A+ + W YPG++G
Sbjct: 647 DRNDIALPAPQQALLER-AKASGKPLVVVLMSGSAVALNWAKTHADAIVAAW--YPGQSG 703
Query: 596 AKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYG 655
A+A ++ GD NPGGRLP+T+Y + ++ A S GR+YR++ G ++
Sbjct: 704 GTAIARMLAGDDNPGGRLPVTFY--------RSTKDLPAYVSYDMKGRTYRYFKGEPLFP 755
Query: 656 FGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFH 715
FG+GLSYT ++Y AP +L+ S +++AGS
Sbjct: 756 FGYGLSYTRFAY---DAP-QLS-STAVQAGST--------------------------LQ 784
Query: 716 VQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQ 775
V +V N G G V ++ + P Q +P + L+GF RVH A + ++F +D
Sbjct: 785 VTTTVRNTGARAGDEVAQVYLQYPDRPQ-SPLRSLVGFQRVHLAAGEQRTLTFNLD-ARA 842
Query: 776 LSIANKHGRRILPLGNHVLMVG 797
LS ++ G+R + GN+ L VG
Sbjct: 843 LSDVDRSGQRAVEAGNYTLFVG 864
>gi|346725879|ref|YP_004852548.1| beta-glucosidase-related glycosidase [Xanthomonas axonopodis pv.
citrumelo F1]
gi|346650626|gb|AEO43250.1| Beta-glucosidase-related glycosidase [Xanthomonas axonopodis pv.
citrumelo F1]
Length = 889
Score = 314 bits (804), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 177/453 (39%), Positives = 248/453 (54%), Gaps = 56/453 (12%)
Query: 56 NTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNF 115
NT + RA +L++ ++ +EK+ Q ++A AIPRLGIPAYEWWSE LHGIA NG
Sbjct: 32 NTDATPDQRAAALVAQMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNG----- 86
Query: 116 NGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG---------QAGLTFWA 166
T FPQ + AAS+N L +G+ V+ EARA +N AGLT W+
Sbjct: 87 -----YATVFPQSIGLAASWNTRLMQQVGTVVSTEARAKFNQAGGPGKDHKRYAGLTIWS 141
Query: 167 PNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKG 226
PNINIFRDPRWGRG ET GEDP + AV F++ QGE+ R +
Sbjct: 142 PNINIFRDPRWGRGMETYGEDPFLTGQMAVGFIRGLQGED-----------LNHPRTI-- 188
Query: 227 FGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQG 286
A KH+ + + G R+ F+ ++ +D E T+ P FR+ I +G
Sbjct: 189 --------------ATPKHIAVHSGPEPG---RHGFDVDVSPRDVEATYTPAFRAAIVEG 231
Query: 287 KASCIMCSYNQVNGVPACLRGDLFQ-KARNEWGFKGYITSDCDAVATIFEYQNYTKTHED 345
+A +MC+YN ++G PAC L + R +WGFKG++ SDCDAV + ++ + +
Sbjct: 232 QAGSVMCAYNSLHGTPACAADWLLNGRVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAG 291
Query: 346 SAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKG 405
S+A LKAG D+NCG R +AI +G+V E +D++L+ LF+ + RLG PRK
Sbjct: 292 SSAAALKAGHDLNCGHAY-RELGTAIARGEVDEALLDQSLVRLFATRYRLGELEA-PRKD 349
Query: 406 KYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQM 465
Y +LG DV + H+ LAL AA + IVLLKND LPL A + LA+IGP + ++ +
Sbjct: 350 PYARLGAKDVDNAAHRALALQAAAESIVLLKNDANTLPL--KAGTRLAVIGPNADALAAL 407
Query: 466 GGGYTGIPCSPKSLLRGLEAY--VSKTHYASGC 496
Y G +P + L GL + YA G
Sbjct: 408 EANYQGTSSAPVTPLLGLRQRFGAQQVSYAQGA 440
Score = 119 bits (298), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 80/262 (30%), Positives = 130/262 (49%), Gaps = 44/262 (16%)
Query: 536 DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAG 595
DR + LP Q +L+ A+ S +P+++VL G + +++A+ + W YPG++G
Sbjct: 647 DRNDIALPAPQQALLER-AKASGKPLVVVLMSGSAVALNWAKTHADAIVAAW--YPGQSG 703
Query: 596 AKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYG 655
A+A ++ GD NPGGRLP+T+Y + ++ A S GR+YR++ G ++
Sbjct: 704 GTAIARMLAGDDNPGGRLPVTFY--------RSTKDLPAYVSYDMKGRTYRYFKGEPLFA 755
Query: 656 FGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFH 715
FG+GLSYT ++Y AP T +L+AGS
Sbjct: 756 FGYGLSYTRFAY---DAPQLSTT--TLQAGSS--------------------------LQ 784
Query: 716 VQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQ 775
V +V N G G V ++ + P Q +P + L+GF RVH A + ++F +D
Sbjct: 785 VTTTVRNTGARAGDEVAQVYLQYPDRPQ-SPLRSLVGFQRVHLAAGEQRTLTFNLD-ARA 842
Query: 776 LSIANKHGRRILPLGNHVLMVG 797
LS ++ G+R + GN+ L VG
Sbjct: 843 LSDVDRSGQRAVEAGNYTLFVG 864
>gi|78048767|ref|YP_364942.1| beta-glucosidase precursor [Xanthomonas campestris pv. vesicatoria
str. 85-10]
gi|78037197|emb|CAJ24942.1| beta-glucosidase precursor [Xanthomonas campestris pv. vesicatoria
str. 85-10]
Length = 889
Score = 314 bits (804), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 177/453 (39%), Positives = 248/453 (54%), Gaps = 56/453 (12%)
Query: 56 NTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNF 115
NT + RA +L++ ++ +EK+ Q ++A AIPRLGIPAYEWWSE LHGIA NG
Sbjct: 32 NTDATPEQRAAALVAQMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNG----- 86
Query: 116 NGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG---------QAGLTFWA 166
T FPQ + AAS+N L +G+ V+ EARA +N AGLT W+
Sbjct: 87 -----YATVFPQSIGLAASWNTRLMQQVGTVVSTEARAKFNQAGGPGKDHKRYAGLTIWS 141
Query: 167 PNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKG 226
PNINIFRDPRWGRG ET GEDP + AV F++ QGE+ R +
Sbjct: 142 PNINIFRDPRWGRGMETYGEDPFLTGQMAVGFIRGLQGED-----------LNHPRTI-- 188
Query: 227 FGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQG 286
A KH+ + + G R+ F+ ++ +D E T+ P FR+ I +G
Sbjct: 189 --------------ATPKHIAVHSGPEPG---RHGFDVDVSPRDVEATYTPAFRAAIVEG 231
Query: 287 KASCIMCSYNQVNGVPACLRGDLFQ-KARNEWGFKGYITSDCDAVATIFEYQNYTKTHED 345
+A +MC+YN ++G PAC L + R +WGFKG++ SDCDAV + ++ + +
Sbjct: 232 QAGSVMCAYNSLHGTPACAADWLLNGRVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAG 291
Query: 346 SAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKG 405
S+A LKAG D+NCG R +AI +G+V E +D++L+ LF+ + RLG PRK
Sbjct: 292 SSAAALKAGHDLNCGHAY-RELGTAIARGEVDEALLDQSLVRLFAARYRLGELEA-PRKD 349
Query: 406 KYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQM 465
Y +LG DV + H+ LAL AA + IVLLKND LPL A + LA+IGP + ++ +
Sbjct: 350 PYARLGAKDVDNAAHRALALQAAAESIVLLKNDANTLPL--KAGTRLAVIGPNADALAAL 407
Query: 466 GGGYTGIPCSPKSLLRGLEAY--VSKTHYASGC 496
Y G +P + L GL + YA G
Sbjct: 408 EANYQGTSSAPVTPLLGLRQRFGAQQVSYAQGA 440
Score = 119 bits (298), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 80/262 (30%), Positives = 130/262 (49%), Gaps = 44/262 (16%)
Query: 536 DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAG 595
DR + LP Q +L+ A+ S +P+++VL G + +++A+ + W YPG++G
Sbjct: 647 DRNDIALPAPQQALLER-AKASGKPLVVVLMSGSAVALNWAKTHADAIVAAW--YPGQSG 703
Query: 596 AKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYG 655
A+A ++ GD NPGGRLP+T+Y + ++ A S GR+YR++ G ++
Sbjct: 704 GTAIARMLAGDDNPGGRLPVTFY--------RSTKDLPAYVSYDMKGRTYRYFKGEPLFA 755
Query: 656 FGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFH 715
FG+GLSYT ++Y AP T +L+AGS
Sbjct: 756 FGYGLSYTRFAY---DAPQLSTT--TLQAGSS--------------------------LQ 784
Query: 716 VQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQ 775
V +V N G G V ++ + P Q +P + L+GF RVH A + ++F +D
Sbjct: 785 VTTTVRNTGARAGDEVAQVYLQYPDRPQ-SPLRSLVGFQRVHLAAGEQRTLTFNLD-ARA 842
Query: 776 LSIANKHGRRILPLGNHVLMVG 797
LS ++ G+R + GN+ L VG
Sbjct: 843 LSDVDRSGQRAVEAGNYTLFVG 864
>gi|294627323|ref|ZP_06705909.1| glucan 1,4-beta-glucosidase [Xanthomonas fuscans subsp.
aurantifolii str. ICPB 11122]
gi|292598405|gb|EFF42556.1| glucan 1,4-beta-glucosidase [Xanthomonas fuscans subsp.
aurantifolii str. ICPB 11122]
Length = 886
Score = 312 bits (800), Expect = 4e-82, Method: Compositional matrix adjust.
Identities = 176/449 (39%), Positives = 245/449 (54%), Gaps = 56/449 (12%)
Query: 60 SISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTV 119
S RA +L++ ++ +EK+ Q ++A AIPRLGIPAYEWWSE LHGIA NG
Sbjct: 33 SPEQRAAALVAQMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNG--------- 83
Query: 120 SSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG---------QAGLTFWAPNIN 170
T FPQ + AAS+N SL +G+ V+ EARA +N AGLT W+PNIN
Sbjct: 84 -YATVFPQSIGLAASWNTSLMQQVGTVVSTEARAKFNQAGGPGKDHQRYAGLTIWSPNIN 142
Query: 171 IFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEE 230
IFRDPRWGRG ET GEDP + AV F++ QGE+
Sbjct: 143 IFRDPRWGRGMETYGEDPFLTGQMAVGFIRGLQGEDL----------------------- 179
Query: 231 SDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASC 290
D A KH+ + + G R+ F+ ++ D E T+ P FR+ + +G+A
Sbjct: 180 ----DHPRTIATPKHIAVHSGPEPG---RHGFDVDVSPHDVEATYTPAFRAALVEGQAGS 232
Query: 291 IMCSYNQVNGVPACLRGDLFQ-KARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAG 349
+MC+YN ++G PAC L + R +WGFKG++ SDCDAV + ++ + + S+A
Sbjct: 233 VMCAYNSLHGTPACAADWLLNGRVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAA 292
Query: 350 VLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGK 409
LKAG D+NCG R +AI++G V E +D++L+ LF+ + RLG PRK Y +
Sbjct: 293 ALKAGHDLNCGHAY-RDLGTAIERGDVDEALLDQSLVRLFAARYRLGELEA-PRKDPYAR 350
Query: 410 LGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGY 469
LG DV + H+ LAL AA + IVLLKND LPL A + LA+IGP + ++ + Y
Sbjct: 351 LGAKDVDNAAHRALALQAAAESIVLLKNDANTLPL--KAGTRLAVIGPNADALAALEANY 408
Query: 470 TGIPCSPKSLLRGLEAY--VSKTHYASGC 496
G +P + L GL + YA G
Sbjct: 409 QGTSSAPVTPLLGLRQRFGAQQVSYAQGA 437
Score = 115 bits (287), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 81/262 (30%), Positives = 132/262 (50%), Gaps = 44/262 (16%)
Query: 536 DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAG 595
DR + LP Q +L+ A+ S +P+++VL G + +++A+ + W YPG++G
Sbjct: 644 DRNDIALPAPQQALLER-AKASGKPLVVVLMSGSAVALNWAKTHADAIVAAW--YPGQSG 700
Query: 596 AKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYG 655
A+A ++ GD NPGGRLP+T+Y + ++ A S GR+YR++ G ++
Sbjct: 701 GTAIARMLAGDDNPGGRLPVTFY--------RSTKDLPAYVSYDMKGRTYRYFKGEPLFP 752
Query: 656 FGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFH 715
FG+GLSYT ++Y AP T +L+AG N LQ
Sbjct: 753 FGYGLSYTRFAY---DAPQLSTT--TLQAG---NPLQ----------------------- 781
Query: 716 VQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQ 775
V +V N G G V ++ + P Q +P + L+GF RVH A + ++F +D
Sbjct: 782 VTTTVRNTGARAGDEVAQVYLQYPDRPQ-SPLRSLVGFQRVHLAAGEQRTLTFHLD-ARA 839
Query: 776 LSIANKHGRRILPLGNHVLMVG 797
LS ++ G+R + G++ L VG
Sbjct: 840 LSDVDRSGQRAVEAGDYTLFVG 861
>gi|294665226|ref|ZP_06730524.1| glucan 1,4-beta-glucosidase [Xanthomonas fuscans subsp.
aurantifolii str. ICPB 10535]
gi|292605014|gb|EFF48367.1| glucan 1,4-beta-glucosidase [Xanthomonas fuscans subsp.
aurantifolii str. ICPB 10535]
Length = 886
Score = 312 bits (800), Expect = 5e-82, Method: Compositional matrix adjust.
Identities = 176/449 (39%), Positives = 245/449 (54%), Gaps = 56/449 (12%)
Query: 60 SISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTV 119
S RA +L++ ++ +EK+ Q ++A AIPRLGIPAYEWWSE LHGIA NG
Sbjct: 33 SPEQRAAALVAQMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNG--------- 83
Query: 120 SSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG---------QAGLTFWAPNIN 170
T FPQ + AAS+N SL +G+ V+ EARA +N AGLT W+PNIN
Sbjct: 84 -YATVFPQSIGLAASWNTSLMQQVGTVVSTEARAKFNQAGGPGKDHQRYAGLTIWSPNIN 142
Query: 171 IFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEE 230
IFRDPRWGRG ET GEDP + AV F++ QGE+
Sbjct: 143 IFRDPRWGRGMETYGEDPFLTGQMAVGFIRGLQGEDL----------------------- 179
Query: 231 SDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASC 290
D A KH+ + + G R+ F+ ++ D E T+ P FR+ + +G+A
Sbjct: 180 ----DHPRTIATPKHIAVHSGPEPG---RHGFDVDVSPHDVEATYTPAFRAALVEGQAGS 232
Query: 291 IMCSYNQVNGVPACLRGDLFQ-KARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAG 349
+MC+YN ++G PAC L + R +WGFKG++ SDCDAV + ++ + + S+A
Sbjct: 233 VMCAYNSLHGTPACAADWLLNGRVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAA 292
Query: 350 VLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGK 409
LKAG D+NCG R +AI++G V E +D++L+ LF+ + RLG PRK Y +
Sbjct: 293 ALKAGHDLNCGHA-YRDLGTAIERGDVDEALLDQSLVRLFAARYRLGELEA-PRKDPYAR 350
Query: 410 LGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGY 469
LG DV + H+ LAL AA + IVLLKND LPL A + LA+IGP + ++ + Y
Sbjct: 351 LGAKDVDNAAHRALALQAAAESIVLLKNDANTLPL--KAGTRLAVIGPNADALAALEANY 408
Query: 470 TGIPCSPKSLLRGLEAY--VSKTHYASGC 496
G +P + L GL + YA G
Sbjct: 409 QGTSSAPVTPLLGLRQRFGAQQVSYAQGA 437
Score = 115 bits (288), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 81/262 (30%), Positives = 132/262 (50%), Gaps = 44/262 (16%)
Query: 536 DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAG 595
DR + LP Q +L+ A+ S +P+++VL G + +++A+ + W YPG++G
Sbjct: 644 DRNDIALPAPQQALLER-AKASGKPLVVVLMSGSAVALNWAKTHADAIVAAW--YPGQSG 700
Query: 596 AKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYG 655
A+A ++ GD NPGGRLP+T+Y + ++ A S GR+YR++ G ++
Sbjct: 701 GTAMARMLAGDDNPGGRLPVTFY--------RSTKDLPAYVSYDMKGRTYRYFKGEPLFP 752
Query: 656 FGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFH 715
FG+GLSYT ++Y AP T +L+AG N LQ
Sbjct: 753 FGYGLSYTRFAY---DAPQLSTT--TLQAG---NPLQ----------------------- 781
Query: 716 VQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQ 775
V +V N G G V ++ + P Q +P + L+GF RVH A + ++F +D
Sbjct: 782 VTTTVRNTGARAGDEVAQVYLQYPDRPQ-SPLRSLVGFQRVHLAAGEQRTLTFHLD-ARA 839
Query: 776 LSIANKHGRRILPLGNHVLMVG 797
LS ++ G+R + G++ L VG
Sbjct: 840 LSDVDRSGQRAVEAGDYTLFVG 861
>gi|381169747|ref|ZP_09878910.1| glycosyl hydrolase family 3 N terminal domain protein [Xanthomonas
citri pv. mangiferaeindicae LMG 941]
gi|380689765|emb|CCG35397.1| glycosyl hydrolase family 3 N terminal domain protein [Xanthomonas
citri pv. mangiferaeindicae LMG 941]
Length = 874
Score = 310 bits (795), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 176/456 (38%), Positives = 247/456 (54%), Gaps = 56/456 (12%)
Query: 53 PFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPG 112
P + S RA +L++ ++ +EK+ Q ++A AIPRLGIPAYEWWSE LHGIA NG
Sbjct: 14 PPAKPAGSPEQRAAALVAQMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNG-- 71
Query: 113 VNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG---------QAGLT 163
T FPQ + AAS+N SL +G+ V+ EARA +N AGLT
Sbjct: 72 --------YATVFPQSIGLAASWNTSLMQQVGTVVSTEARAKFNQAGGPGKDHQRYAGLT 123
Query: 164 FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRV 223
W+PNINIFRDPRWGRG ET GEDP + AV F++ QGE+ R
Sbjct: 124 IWSPNINIFRDPRWGRGMETYGEDPFLTGQMAVGFIRGLQGED-----------LNHPRT 172
Query: 224 LKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCI 283
+ A KH+ + + G R+ F+ ++ D E T+ P FR+ +
Sbjct: 173 I----------------ATPKHIAVHSGPEPG---RHGFDVDVSPHDVEATYTPAFRAAL 213
Query: 284 EQGKASCIMCSYNQVNGVPACLRGDLFQ-KARNEWGFKGYITSDCDAVATIFEYQNYTKT 342
+G+A +MC+YN ++G P C L + R +WGFKG++ SDCDAV + ++ +
Sbjct: 214 VEGQAGSVMCAYNALHGTPVCAADWLLNGRVRGDWGFKGFVVSDCDAVDDMTQFHYFRPD 273
Query: 343 HEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDP 402
+ S+A LKAG D+NCG R +AI +G+V E +D++L+ LF+ + RLG P
Sbjct: 274 NAGSSAAALKAGHDLNCGHAY-RELGTAIARGEVDEALLDQSLVRLFAARYRLGELEA-P 331
Query: 403 RKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNI 462
RK Y +LG DV + H+ LAL AA + IVLLKND LPL A + LA+IGP + +
Sbjct: 332 RKDPYARLGAKDVDNAAHRALALQAAAESIVLLKNDANTLPL--RAGTRLAVIGPNADAL 389
Query: 463 SQMGGGYTGIPCSPKSLLRGLEAY--VSKTHYASGC 496
+ + Y G +P + L GL + YA G
Sbjct: 390 AALEANYQGTSSAPVTPLLGLRQRFGAQQVSYAQGA 425
Score = 116 bits (291), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 81/262 (30%), Positives = 132/262 (50%), Gaps = 44/262 (16%)
Query: 536 DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAG 595
DR + LP Q +L+ A+ S +P+++VL G + +++A+ + W YPG++G
Sbjct: 632 DRNDIALPAPQQALLER-AKASGKPLVVVLMSGSAVALNWAKMHADAIVAAW--YPGQSG 688
Query: 596 AKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYG 655
A+A ++ GD NPGGRLP+T+Y + P +M+ GR+YR++ G ++
Sbjct: 689 GTAIARMLAGDDNPGGRLPVTFYRSTKDLPPYVSYDMK--------GRTYRYFKGEPLFP 740
Query: 656 FGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFH 715
FG+GLSYT ++Y AP T +L+AG N LQ
Sbjct: 741 FGYGLSYTRFAY---DAPQLSTT--TLQAG---NPLQ----------------------- 769
Query: 716 VQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQ 775
V +V N G G V ++ + P Q +P + L+GF RVH A + ++F +D
Sbjct: 770 VTATVRNTGARAGDEVAQVYLQYPDRPQ-SPLRSLVGFQRVHLAAGEQRTLTFHLD-ARA 827
Query: 776 LSIANKHGRRILPLGNHVLMVG 797
LS ++ G+R + G++ L VG
Sbjct: 828 LSDVDRSGQRAVEAGDYTLFVG 849
>gi|21243803|ref|NP_643385.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. citri str.
306]
gi|21109396|gb|AAM37921.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. citri str.
306]
Length = 886
Score = 310 bits (793), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 175/449 (38%), Positives = 245/449 (54%), Gaps = 56/449 (12%)
Query: 60 SISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTV 119
S RA +L++ ++ +EK+ Q ++A AIPRLGIPAYEWWSE LHGIA NG
Sbjct: 33 SPEQRAAALVAQMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNG--------- 83
Query: 120 SSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG---------QAGLTFWAPNIN 170
T FPQ + AAS+N SL +G+ V+ EARA +N AGLT W+PNIN
Sbjct: 84 -YATVFPQSIGLAASWNTSLMQQVGTVVSTEARAKFNQAGGPGKDHQRYAGLTIWSPNIN 142
Query: 171 IFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEE 230
IFRDPRWGRG ET GEDP + AV F++ QGE+ R +
Sbjct: 143 IFRDPRWGRGMETYGEDPFLTGQMAVGFIRGLQGED-----------LNHPRTI------ 185
Query: 231 SDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASC 290
A KH+ + + G R+ F+ ++ D E T+ P FR+ + +G+A
Sbjct: 186 ----------ATPKHIAVHSGPEPG---RHGFDVDVSPHDVEATYTPAFRAALVEGQAGS 232
Query: 291 IMCSYNQVNGVPACLRGDLFQ-KARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAG 349
+MC+YN ++G P C L + R +WGFKG++ SDCDAV + ++ + + S+A
Sbjct: 233 VMCAYNALHGTPVCAADWLLNGRVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAA 292
Query: 350 VLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGK 409
LKAG D+NCG R +AI +G+V E +D++L+ LF+ + RLG PRK Y +
Sbjct: 293 ALKAGHDLNCGHAY-RELGTAIARGEVDEALLDQSLVRLFAARYRLGELEA-PRKDPYAR 350
Query: 410 LGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGY 469
LG DV + H+ LAL AA + IVLLKND LPL A + LA+IGP + ++ + Y
Sbjct: 351 LGAKDVDNAAHRALALQAAAESIVLLKNDANTLPL--RAGTRLAVIGPNADALAALEANY 408
Query: 470 TGIPCSPKSLLRGLEAY--VSKTHYASGC 496
G +P + L GL + YA G
Sbjct: 409 QGTSSAPVTPLLGLRQRFGAQQVSYAQGA 437
Score = 116 bits (291), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 81/262 (30%), Positives = 132/262 (50%), Gaps = 44/262 (16%)
Query: 536 DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAG 595
DR + LP Q +L+ A+ S +P+++VL G + +++A+ + W YPG++G
Sbjct: 644 DRNDIALPAPQQALLER-AKASGKPLVVVLMSGSAVALNWAKMHADAIVAAW--YPGQSG 700
Query: 596 AKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYG 655
A+A ++ GD NPGGRLP+T+Y + P +M+ GR+YR++ G ++
Sbjct: 701 GTAIARMLAGDDNPGGRLPVTFYRSTKDLPPYVSYDMK--------GRTYRYFKGEPLFP 752
Query: 656 FGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFH 715
FG+GLSYT ++Y AP T +L+AG N LQ
Sbjct: 753 FGYGLSYTRFAY---DAPQLSTT--TLQAG---NPLQ----------------------- 781
Query: 716 VQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQ 775
V +V N G G V ++ + P Q +P + L+GF RVH A + ++F +D
Sbjct: 782 VTATVRNTGARAGDEVAQVYLQYPDRPQ-SPLRSLVGFQRVHLAAGEQRTLTFHLD-ARA 839
Query: 776 LSIANKHGRRILPLGNHVLMVG 797
LS ++ G+R + G++ L VG
Sbjct: 840 LSDVDRSGQRAVEAGDYTLFVG 861
>gi|118489157|gb|ABK96385.1| unknown [Populus trichocarpa x Populus deltoides]
Length = 343
Score = 310 bits (793), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 155/344 (45%), Positives = 223/344 (64%), Gaps = 11/344 (3%)
Query: 465 MGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVA 524
M G Y G+ C + L+G+ Y +KT + SGC+DV CN + F+ A A+ AD I+V
Sbjct: 1 MIGNYAGVACGYTTPLQGIRRY-AKTVHLSGCNDVFCNGNQQFNAAEVAARHADATILVM 59
Query: 525 GLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISS 584
GLD + E E RDR LLLPG Q LV+ VAR S+ P ILVL GGP+DVSFA+ D +I +
Sbjct: 60 GLDQSIEAEFRDRKGLLLPGYQQELVSRVARASRGPTILVLMSGGPIDVSFAKNDPRIGA 119
Query: 585 ILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESF-TKVPMNDMNMRADSSRQYPGR 643
ILW+GYPG+AG A+A+++FG NPGG+LPMTWYP + KVPM +M MRAD SR YPGR
Sbjct: 120 ILWVGYPGQAGGAAIADVLFGTANPGGKLPMTWYPHDYLAKVPMTNMGMRADPSRGYPGR 179
Query: 644 SYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTIS-ASLKAGSDKNILQQTGSRLDYVH 702
+YRFY G V+ FGHG+SYT +++ + AP E+++ ASL ++ + T + +
Sbjct: 180 TYRFYKGPVVFPFGHGMSYTTFAHSLVQAPREVSVPLASL------HVSRNTTGASNAIR 233
Query: 703 IDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKG 762
+ +C +L V I V N GD+DG+H +++F+ P T +KQLIGF++VH V
Sbjct: 234 VSH-ANCEALALGVHIDVKNTGDMDGTHTLLVFSSPPGGKWST-QKQLIGFEKVHLVTGS 291
Query: 763 SKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
K + + C+ LS+ ++ G R +P G H L +G+L+HS++++
Sbjct: 292 QKRVKIDIHVCKHLSVVDRFGIRRIPNGEHYLYIGDLKHSISLQ 335
>gi|418519424|ref|ZP_13085476.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. malvacearum
str. GSPB2388]
gi|410704868|gb|EKQ63347.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. malvacearum
str. GSPB2388]
Length = 886
Score = 310 bits (793), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 175/449 (38%), Positives = 245/449 (54%), Gaps = 56/449 (12%)
Query: 60 SISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTV 119
S RA +L++ ++ +EK+ Q ++A AIPRLGIPAYEWWSE LHGIA NG
Sbjct: 33 SPDQRAAALVAQMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNG--------- 83
Query: 120 SSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG---------QAGLTFWAPNIN 170
T FPQ + AAS+N SL +G+ V+ EARA +N AGLT W+PNIN
Sbjct: 84 -YATVFPQSIGLAASWNTSLMQQVGTVVSTEARAKFNQAGGPGKDHQRYAGLTIWSPNIN 142
Query: 171 IFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEE 230
IFRDPRWGRG ET GEDP + AV F++ QGE+ R +
Sbjct: 143 IFRDPRWGRGMETYGEDPFLTGQMAVGFIRGLQGED-----------LNHPRTI------ 185
Query: 231 SDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASC 290
A KH+ + + G R+ F+ ++ D E T+ P FR+ + +G+A
Sbjct: 186 ----------ATPKHIAVHSGPEPG---RHGFDVDVSPHDVEATYTPAFRAALVEGQAGS 232
Query: 291 IMCSYNQVNGVPACLRGDLFQ-KARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAG 349
+MC+YN ++G P C L + R +WGFKG++ SDCDAV + ++ + + S+A
Sbjct: 233 VMCAYNALHGTPVCAADWLLNGRVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAA 292
Query: 350 VLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGK 409
LKAG D+NCG R +AI +G+V E +D++L+ LF+ + RLG PRK Y +
Sbjct: 293 ALKAGHDLNCGHAY-RELGTAIARGEVDEALLDQSLVRLFAARYRLGELEA-PRKDPYAR 350
Query: 410 LGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGY 469
LG DV + H+ LAL AA + IVLLKND LPL A + LA+IGP + ++ + Y
Sbjct: 351 LGAKDVDNAAHRALALQAAAESIVLLKNDANTLPL--RAGTRLAVIGPNADALAALEANY 408
Query: 470 TGIPCSPKSLLRGLEAY--VSKTHYASGC 496
G +P + L GL + YA G
Sbjct: 409 QGTSSAPVTPLLGLRQRFGAQQVSYAQGA 437
Score = 115 bits (287), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 81/262 (30%), Positives = 132/262 (50%), Gaps = 44/262 (16%)
Query: 536 DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAG 595
DR + LP Q +L+ A+ S +P+++VL G + +++A+ + W YPG++G
Sbjct: 644 DRNDIALPAPQQTLLER-AKASGKPLVVVLMSGSAVALNWAKTHADAIVAAW--YPGQSG 700
Query: 596 AKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYG 655
A+A ++ GD NPGGRLP+T+Y + ++ A S GR+YR++ G ++
Sbjct: 701 GTAIARMLAGDDNPGGRLPVTFY--------RSTKDLPAYVSYDMKGRTYRYFKGEPLFP 752
Query: 656 FGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFH 715
FG+GLSYT ++Y AP T +L+AG N LQ
Sbjct: 753 FGYGLSYTRFAY---DAPQLSTT--TLQAG---NPLQ----------------------- 781
Query: 716 VQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQ 775
V +V N G G V ++ + P Q +P + L+GF RVH A + ++F +D
Sbjct: 782 VTATVRNTGARAGDEVAQVYLQYPDRPQ-SPLRSLVGFQRVHLAAGEQRTLTFHLD-ARA 839
Query: 776 LSIANKHGRRILPLGNHVLMVG 797
LS ++ G+R + G++ L VG
Sbjct: 840 LSDVDRSGQRAVEAGDYTLFVG 861
>gi|390992294|ref|ZP_10262532.1| glycosyl hydrolase family 3 N terminal domain protein [Xanthomonas
axonopodis pv. punicae str. LMG 859]
gi|372552957|emb|CCF69507.1| glycosyl hydrolase family 3 N terminal domain protein [Xanthomonas
axonopodis pv. punicae str. LMG 859]
Length = 886
Score = 310 bits (793), Expect = 3e-81, Method: Compositional matrix adjust.
Identities = 175/449 (38%), Positives = 245/449 (54%), Gaps = 56/449 (12%)
Query: 60 SISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTV 119
S RA +L++ ++ +EK+ Q ++A AIPRLGIPAYEWWSE LHGIA NG
Sbjct: 33 SPEQRAAALVAQMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNG--------- 83
Query: 120 SSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG---------QAGLTFWAPNIN 170
T FPQ + AAS+N SL +G+ V+ EARA +N AGLT W+PNIN
Sbjct: 84 -YATVFPQSIGLAASWNTSLMQQVGTVVSTEARAKFNQAGGPGKDHQRYAGLTIWSPNIN 142
Query: 171 IFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEE 230
IFRDPRWGRG ET GEDP + AV F++ QGE+ R +
Sbjct: 143 IFRDPRWGRGMETYGEDPFLTGQMAVGFIRGLQGED-----------LNHPRTI------ 185
Query: 231 SDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASC 290
A KH+ + + G R+ F+ ++ D E T+ P FR+ + +G+A
Sbjct: 186 ----------ATPKHIAVHSGPEPG---RHGFDVDVSPHDVEATYTPAFRAALVEGQAGS 232
Query: 291 IMCSYNQVNGVPACLRGDLFQ-KARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAG 349
+MC+YN ++G P C L + R +WGFKG++ SDCDAV + ++ + + S+A
Sbjct: 233 VMCAYNALHGTPVCAADWLLNGRVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAA 292
Query: 350 VLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGK 409
LKAG D+NCG R +AI +G+V E +D++L+ LF+ + RLG PRK Y +
Sbjct: 293 ALKAGHDLNCGHAY-RELGTAIARGEVDEALLDQSLVRLFAARYRLGELEA-PRKDPYAR 350
Query: 410 LGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGY 469
LG DV + H+ LAL AA + IVLLKND LPL A + LA+IGP + ++ + Y
Sbjct: 351 LGAKDVDNAAHRALALQAAAESIVLLKNDANTLPL--RAGTRLAVIGPNADALAALEANY 408
Query: 470 TGIPCSPKSLLRGLEAY--VSKTHYASGC 496
G +P + L GL + YA G
Sbjct: 409 QGTSSAPVTPLLGLRQRFGAQQVSYAQGA 437
Score = 117 bits (293), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 82/262 (31%), Positives = 132/262 (50%), Gaps = 44/262 (16%)
Query: 536 DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAG 595
DR + LP Q +L+ A+ S +P+++VL G + +++A+ + W YPG++G
Sbjct: 644 DRNDIALPAAQQTLLER-AKASGKPLVVVLMSGSAVALNWAKTHADAIVAAW--YPGQSG 700
Query: 596 AKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYG 655
A+A ++ GD NPGGRLP+T+Y + ++ A S GR+YR++ G ++
Sbjct: 701 GTAIARMLAGDDNPGGRLPVTFY--------RSTKDLPAYVSYDMKGRTYRYFKGEPLFP 752
Query: 656 FGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFH 715
FG+GLSYT ++Y AP T +L+AG N LQ
Sbjct: 753 FGYGLSYTRFAY---DAPQLSTT--TLQAG---NPLQ----------------------- 781
Query: 716 VQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQ 775
V +V N G G V ++ + P Q +P + L+GF RVH A + ++F +D
Sbjct: 782 VTATVRNTGARAGDEVAQVYLQYPDRPQ-SPLRSLVGFQRVHLAAGEQRTLTFNLD-ARA 839
Query: 776 LSIANKHGRRILPLGNHVLMVG 797
LS ++ G+R + GN+ L VG
Sbjct: 840 LSDVDRSGQRAVEAGNYTLFVG 861
>gi|384420163|ref|YP_005629523.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzicola
BLS256]
gi|353463076|gb|AEQ97355.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzicola
BLS256]
Length = 889
Score = 309 bits (792), Expect = 3e-81, Method: Compositional matrix adjust.
Identities = 175/453 (38%), Positives = 245/453 (54%), Gaps = 56/453 (12%)
Query: 56 NTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNF 115
T + RA L++ ++ +EK+ Q ++A AIPRLGIPAYEWWSE LHGIA NG
Sbjct: 32 TTDATPEQRAADLVAHMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNG----- 86
Query: 116 NGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG---------QAGLTFWA 166
T FPQ + AAS+N L +G+ V+ EARA +N AGLT W+
Sbjct: 87 -----YATVFPQAIGLAASWNTHLMQQVGTVVSTEARAKFNQAGRPGKDHKRYAGLTIWS 141
Query: 167 PNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKG 226
PNINIFRDPRWGRG ET GEDP + AV F+ QG++
Sbjct: 142 PNINIFRDPRWGRGMETYGEDPFLTGQMAVGFIHGLQGDDL------------------- 182
Query: 227 FGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQG 286
D A KHL + + G R+ F+ ++ +D E T+ P FR+ I +G
Sbjct: 183 --------DHPRTIATPKHLAVHSGPEPG---RHGFDVDVSPRDVEATYTPAFRAAIVEG 231
Query: 287 KASCIMCSYNQVNGVPACLRGDLFQ-KARNEWGFKGYITSDCDAVATIFEYQNYTKTHED 345
+A +MC+YN ++G PAC L + R +WGFKG++ SDCDAV + ++ + +
Sbjct: 232 QAGAVMCAYNSLHGTPACAADWLINGRVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAG 291
Query: 346 SAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKG 405
S+A LKAG D+NCG R +AI +G+V E +D++L+ LF+ + RLG PRK
Sbjct: 292 SSAAALKAGHDLNCGHAY-RELGTAIARGEVDEALLDQSLVRLFAARYRLGELEA-PRKD 349
Query: 406 KYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQM 465
Y +LG DV ++H+ LAL AA + IVLLKN+ LPL A + LA+IGP + ++ +
Sbjct: 350 PYARLGAKDVDNAQHRALALQAAAESIVLLKNNANTLPL--KAGTRLAVIGPNADALAAL 407
Query: 466 GGGYTGIPCSPKSLLRGLEAY--VSKTHYASGC 496
Y G +P + L GL + YA G
Sbjct: 408 EANYQGTSSAPVTPLLGLRQRFGAQQVSYAQGA 440
Score = 114 bits (286), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 80/262 (30%), Positives = 132/262 (50%), Gaps = 44/262 (16%)
Query: 536 DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAG 595
DR + LP Q +L+ A+ S +P+++VL G + +++A+ + W YPG++G
Sbjct: 647 DRNDIALPAPQQALLER-AKASGKPLVVVLMSGSAVALNWAKTHADAIVAAW--YPGQSG 703
Query: 596 AKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYG 655
A+A ++ GD NPGGRLP+T+Y + ++ A S GR+YR++ G ++
Sbjct: 704 GTAIARMLAGDDNPGGRLPVTFY--------RSTKDLPAYVSYDMKGRTYRYFKGEPLFP 755
Query: 656 FGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFH 715
FG+GLSYT ++Y AP +L+ S +++AGS
Sbjct: 756 FGYGLSYTCFAY---DAP-QLS-STAVQAGST--------------------------LQ 784
Query: 716 VQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQ 775
V +V N G G V ++ + P Q +P + L+GF RVH A + ++F +D
Sbjct: 785 VTTTVRNTGARAGDEVAQVYLQYPDRPQ-SPLRSLVGFQRVHLAAGEQRTLTFNLD-ARA 842
Query: 776 LSIANKHGRRILPLGNHVLMVG 797
LS + G+R + GN+ L VG
Sbjct: 843 LSDVDPSGQRAVEAGNYTLFVG 864
>gi|325922365|ref|ZP_08184139.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas gardneri ATCC
19865]
gi|325547147|gb|EGD18227.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas gardneri ATCC
19865]
Length = 889
Score = 308 bits (790), Expect = 5e-81, Method: Compositional matrix adjust.
Identities = 178/474 (37%), Positives = 253/474 (53%), Gaps = 66/474 (13%)
Query: 35 PVLNKPDFPCKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIP 94
P++ P F K +T+ + RA +L++ ++ +EK+ Q ++A AIPRLGIP
Sbjct: 21 PLIGAPAFAAK----------STNETPEQRAAALVAQMSREEKVAQAMNDAPAIPRLGIP 70
Query: 95 AYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAM 154
AYEWWSE LHGIA NG T FPQ + AAS+N L +G+ V+ EARA
Sbjct: 71 AYEWWSEGLHGIARNG----------YATVFPQAIGLAASWNTQLMQQVGTVVSTEARAK 120
Query: 155 YNLG---------QAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGE 205
+N AGLT W+PNINIFRDPRWGRG ET GEDP + AV F++ QG+
Sbjct: 121 FNQAGGPGKDHKRYAGLTIWSPNINIFRDPRWGRGMETYGEDPFLTGQLAVGFIRGLQGD 180
Query: 206 NWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM 265
+ D A KH+ + + G R+SF+
Sbjct: 181 DL---------------------------DHPRTIATPKHIAVHSGPEPG---RHSFDVD 210
Query: 266 ITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQ-KARNEWGFKGYIT 324
++ +D E T+ P FR+ + G+A +MC+YN ++G PAC L + R +WGFKG++
Sbjct: 211 VSPRDVEATYTPAFRAALIDGQAGSVMCAYNSLHGTPACAADWLLNGRVRGDWGFKGFVV 270
Query: 325 SDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRA 384
SDCDAV + ++ + + S+A LKAG D+NCG R +AI++G+V E +D++
Sbjct: 271 SDCDAVDDMTQFHYFRPDNAGSSAASLKAGHDLNCGYAY-RALGTAIERGEVDEALLDQS 329
Query: 385 LLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPL 444
L+ LF+ + RLG P K Y LG D+ + ++ LAL AA Q IVLLKND LPL
Sbjct: 330 LVRLFAARYRLGELEA-PHKDPYATLGAKDIDNTANRALALKAAAQSIVLLKNDANTLPL 388
Query: 445 NKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAY--VSKTHYASGC 496
A LA+IGP + ++ + Y G +P + L GL V + YA G
Sbjct: 389 KAGA--RLAVIGPNADALAALEANYQGTSSTPVTPLLGLRQRFGVHQVSYAQGA 440
Score = 115 bits (289), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 79/262 (30%), Positives = 133/262 (50%), Gaps = 44/262 (16%)
Query: 536 DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAG 595
DR + LP Q +L+ A+ S +P+++VL G + +++A+ + W YPG++G
Sbjct: 647 DRNDIALPAPQQALLER-AKASGKPLVVVLMSGSAVALNWAKTHADAIVAAW--YPGQSG 703
Query: 596 AKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYG 655
A+A ++ GD NPGGRLP+T+Y + P +M+ GR+YR++ G ++
Sbjct: 704 GTAIARMLAGDDNPGGRLPVTFYRSTKDLPPYVSYDMK--------GRTYRYFKGEPLFP 755
Query: 656 FGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFH 715
FG+GLSYT+++Y AP +L+ S +L+AGS
Sbjct: 756 FGYGLSYTSFAY---GAP-QLS-STTLQAGST--------------------------LQ 784
Query: 716 VQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQ 775
V +V N G G V ++ + P Q +P + L+GF RVH + ++F +D
Sbjct: 785 VTTTVRNTGTRAGDEVAQVYLQYPDRPQ-SPLRSLVGFQRVHLKPGEQRTLTFTLD-ARA 842
Query: 776 LSIANKHGRRILPLGNHVLMVG 797
LS ++ G+R + G++ L VG
Sbjct: 843 LSDVDRTGQRAVEAGDYTLFVG 864
>gi|256394020|ref|YP_003115584.1| glycoside hydrolase family 3 [Catenulispora acidiphila DSM 44928]
gi|256360246|gb|ACU73743.1| glycoside hydrolase family 3 domain protein [Catenulispora
acidiphila DSM 44928]
Length = 1212
Score = 308 bits (788), Expect = 9e-81, Method: Compositional matrix adjust.
Identities = 248/808 (30%), Positives = 370/808 (45%), Gaps = 135/808 (16%)
Query: 54 FCNTSLSISTRAKSLISLLTLQEKIQQLSDN-ASAIPRLGIPAYEWWSESLHGIASNGPG 112
+ T S + RA L+S +TL EK+ QL N A AIPRLG+ +Y +WSE HGI G
Sbjct: 45 YLQTRYSFAERAADLVSRMTLPEKVAQLHTNSAPAIPRLGVQSYTYWSEGQHGINLLGAD 104
Query: 113 VNFNGTVSS--VTSFPQVLVSAASFNRSLWSNIGSAVAVEAR----------AMYNLGQA 160
N G TSFP L S S++ +L +A++ E R A N+G +
Sbjct: 105 SNNGGAAGGPHATSFPTNLSSTMSWDPALVYQETTAISDEVRGELDKSLWGVAQNNIGPS 164
Query: 161 G-----LTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIG 215
LT+WAP +N+ RDPRWGR E GEDP +V A FV +QGE + DG
Sbjct: 165 ADDYGSLTYWAPTVNMDRDPRWGRTDEAFGEDPYLVGKMAGAFVAGYQGE---TIDG--- 218
Query: 216 FGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTF 275
+ L ++A KH + E R++ +A +E D D +
Sbjct: 219 ---------------TPTSPYLKVAATAKHFALNNNEN----DRHADSADASESDIRDYY 259
Query: 276 QPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK---ARNEWGFKGYITSDCDAVAT 332
FRS +E + +M SYN +NG P+ D + A+ WGF GYITSDC AV
Sbjct: 260 TAQFRSLVEDSHVAGLMTSYNAINGTPSP--ADTYTTDALAQRTWGFDGYITSDCGAVGD 317
Query: 333 I--------------FEYQNYTKTHEDSAAGV------------LKAGMDINC--GTCML 364
+ N T T ++A GV L+AG D NC G L
Sbjct: 318 VTASSSHDWAPPGWTVSVVNGTSTWTNTATGVQVPADAGGQAYALRAGTDANCTGGDATL 377
Query: 365 RHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDP-RKGKYGKLGPDDVCTSEHKKL 423
+ ++AI + E ID AL+ LF+V+++ G F DP K Y ++ + + EH+ L
Sbjct: 378 GNIEAAIKAEILSEGVIDHALVQLFTVRMQTGEF--DPANKVAYTRITKAQIQSPEHQAL 435
Query: 424 ALDAARQGIVLLKND------KKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPK 477
A A +VLLKND K LP N +++++ ++G L N ++ GGY+G P
Sbjct: 436 AEKVAANSLVLLKNDPMPGSAAKVLPANPASLNNVVVVGDLANTVTL--GGYSGDPTLQV 493
Query: 478 SLLRGLEAYVSKTH-YASGCHDVPCNSDAGFHEAVRIA------KKADFVIVVAGLDLTQ 530
+ ++G+ + V + A+ D S A A K AD V+V G D +
Sbjct: 494 NAVQGITSAVKAANPNATVTFDACGTSTTATAAASCSAATQAAIKTADLVVVFVGTDGST 553
Query: 531 ETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGY 590
E DR SL +PG SL++ VA +L + GP+D+ + D +I++ Y
Sbjct: 554 AGESNDRASLAMPGNYDSLISQVAALGNPRTVLSMQTDGPVDIENVKGD--FPAIVYSAY 611
Query: 591 PGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMR-ADSSRQYPGRSYRFYT 649
GE+ ALA+++FG NP G L TWY + + + + AD+ GR+Y+++T
Sbjct: 612 NGESQGTALADVLFGKQNPSGHLDFTWYKDDSQLPSIKNYGLNPADTGGL--GRTYQYFT 669
Query: 650 GTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSC 709
GT Y FG+GLSYT+++Y + A H D
Sbjct: 670 GTPTYPFGYGLSYTDFAYSKVQATD---------------------------HADAQGKA 702
Query: 710 TSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGT--PEKQLIGFDRVHTVAKG-SKEI 766
T V+ VTN G G+ V L+ P V GT P +QL GF + + G ++ +
Sbjct: 703 T-----VRFDVTNTGKTPGATVAQLYITPPSV-PGTQQPAEQLEGFAKTAVLKPGQTQHL 756
Query: 767 SFGVDPCEQLSIANKHGRRILPLGNHVL 794
S V+ + + ++ + + G + L
Sbjct: 757 SVSVNIADLATWDAQNAKNAVTDGTYTL 784
>gi|418518029|ref|ZP_13084183.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. malvacearum
str. GSPB1386]
gi|410705279|gb|EKQ63755.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. malvacearum
str. GSPB1386]
Length = 886
Score = 306 bits (784), Expect = 3e-80, Method: Compositional matrix adjust.
Identities = 173/449 (38%), Positives = 243/449 (54%), Gaps = 56/449 (12%)
Query: 60 SISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTV 119
S RA +L++ ++ +EK+ Q ++A AIPRLGIPAYEWWSE LHGIA NG
Sbjct: 33 SPEQRAAALVAQMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNG--------- 83
Query: 120 SSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG---------QAGLTFWAPNIN 170
T FPQ + AAS+N SL +G+ V+ EARA +N AGLT W+PNIN
Sbjct: 84 -YATVFPQSIGLAASWNTSLMQQVGTVVSTEARAKFNQAGGPGKDHQRYAGLTIWSPNIN 142
Query: 171 IFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEE 230
IFRDPRWGRG ET GEDP + AV F++ QGE+ R +
Sbjct: 143 IFRDPRWGRGMETYGEDPFLTGQMAVGFIRGLQGED-----------LNHPRTI------ 185
Query: 231 SDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASC 290
A KH+ + + G R+ F+ ++ D E T+ P FR+ + +G+A
Sbjct: 186 ----------ATPKHIAVHSGPEPG---RHGFDVDVSPHDVEATYTPAFRAALVEGQAGS 232
Query: 291 IMCSYNQVNGVPACLRGDLFQ-KARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAG 349
+MC+YN ++G P C L + R +WGFKG++ SDCDA+ + ++ + + S+
Sbjct: 233 VMCAYNALHGTPVCAADWLLNGRVRGDWGFKGFVVSDCDAIDDMTQFHYFRPDNAGSSVA 292
Query: 350 VLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGK 409
LKAG D+NCG R +AI +G+V E +D++L+ LF+ + RLG PRK Y +
Sbjct: 293 ALKAGHDLNCGHA-YRELGTAIARGEVDEALLDQSLVRLFAARYRLGELEA-PRKDPYAR 350
Query: 410 LGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGY 469
LG DV H+ LAL AA + IVLLKND LPL A + LA+IGP + ++ + Y
Sbjct: 351 LGAKDVDNVAHRALALQAAAESIVLLKNDANTLPL--RAGTRLAVIGPNADALAALEANY 408
Query: 470 TGIPCSPKSLLRGLEAY--VSKTHYASGC 496
G +P + L GL + YA G
Sbjct: 409 QGTSSAPVTPLLGLRQRFGAQQVSYAQGA 437
Score = 114 bits (284), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 81/262 (30%), Positives = 132/262 (50%), Gaps = 44/262 (16%)
Query: 536 DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAG 595
DR + LP Q +L+ A+ S +P+++VL G + +++A+ + W YPG++G
Sbjct: 644 DRNDIALPAPQQALLER-AKASGKPLVVVLMSGSAVALNWAKTHADAIMAAW--YPGQSG 700
Query: 596 AKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYG 655
A+A ++ GD NPGGRLP+T+Y + ++ A S GR+YR++ G ++
Sbjct: 701 GTAIARMLAGDDNPGGRLPVTFY--------RSTKDLPAYVSYDMKGRTYRYFKGEPLFP 752
Query: 656 FGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFH 715
FG+GLSYT ++Y AP T +L+AG N LQ
Sbjct: 753 FGYGLSYTRFAY---DAPQLSTT--TLQAG---NPLQ----------------------- 781
Query: 716 VQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQ 775
V +V N G G V ++ + P Q +P + L+GF RVH A + ++F +D
Sbjct: 782 VIATVRNTGARAGDEVAQVYLQYPDRPQ-SPLRSLVGFQRVHLAAGEQRTLTFHLD-ARA 839
Query: 776 LSIANKHGRRILPLGNHVLMVG 797
LS ++ G+R + G++ L VG
Sbjct: 840 LSDVDRSGQRAVEAGDYTLFVG 861
>gi|21232323|ref|NP_638240.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. campestris
str. ATCC 33913]
gi|21114093|gb|AAM42164.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. campestris
str. ATCC 33913]
Length = 888
Score = 306 bits (783), Expect = 4e-80, Method: Compositional matrix adjust.
Identities = 171/445 (38%), Positives = 245/445 (55%), Gaps = 56/445 (12%)
Query: 64 RAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVT 123
RA +L++ ++ +EK+ Q + A AIPRLGIPAYEWWSE LHGIA NG T
Sbjct: 39 RAAALVAQMSREEKVAQAMNAAPAIPRLGIPAYEWWSEGLHGIARNG----------YAT 88
Query: 124 SFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG---------QAGLTFWAPNINIFRD 174
FPQ + AAS+N L +G+ V+ EARA +N AGLT W+PNINIFRD
Sbjct: 89 VFPQAIGLAASWNTQLMQQVGTVVSTEARAKFNQAGGPGKDHKRYAGLTIWSPNINIFRD 148
Query: 175 PRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRG 234
PRWGRG ET GEDP + AV F++ QG++ R +
Sbjct: 149 PRWGRGMETYGEDPFLTGQLAVGFIRGLQGDD-----------LEHPRTI---------- 187
Query: 235 DELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCS 294
A KH+ + + G R+ F+ ++ +D E T+ P FR+ + +G+A +MC+
Sbjct: 188 ------ATPKHIAVHSGPEPG---RHGFDVDVSPRDVEATYTPAFRAALVEGQAGSVMCA 238
Query: 295 YNQVNGVPACLRGDLFQ-KARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKA 353
YN ++G PAC L + R +WGFKG++ SDCDAV + ++ + + S+A LKA
Sbjct: 239 YNSLHGTPACAADWLLNGRVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAASLKA 298
Query: 354 GMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPD 413
G D+NCGT R +AI++G+V E +D++L+ LF+ + RLG PRK +Y +LG
Sbjct: 299 GHDLNCGTAY-RALGTAIERGEVDEALLDQSLVRLFAARYRLGELQA-PRKDRYARLGAK 356
Query: 414 DVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIP 473
D+ + ++ LAL AA + IVLLKN LPL A + LA+IGP + ++ + Y G
Sbjct: 357 DIDNAGNRALALQAAAESIVLLKNANATLPL--KAGTRLAVIGPNADALAALEANYQGTS 414
Query: 474 CSPKSLLRGLEAY--VSKTHYASGC 496
P + L GL + YA G
Sbjct: 415 SQPVTPLLGLRQRFGAQQVRYAQGA 439
Score = 113 bits (283), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 79/262 (30%), Positives = 131/262 (50%), Gaps = 44/262 (16%)
Query: 536 DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAG 595
DR + LP Q +L+ A+ S +P+++VL G + +++A+ + W YPG++G
Sbjct: 646 DRNDIALPAAQQALLER-AKASGKPLVVVLMSGSAVALNWAKTHADAIVAAW--YPGQSG 702
Query: 596 AKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYG 655
A+A + GD NPGGRLP+T+Y + P +M+ GR+YR++ G ++
Sbjct: 703 GTAIARALAGDDNPGGRLPVTFYRSTKDLPPYVSYDMK--------GRTYRYFKGEALFP 754
Query: 656 FGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFH 715
FG+GLSYT+++Y AP +L+ S +L+AGS
Sbjct: 755 FGYGLSYTSFAY---DAP-QLS-STTLQAGSP--------------------------LQ 783
Query: 716 VQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQ 775
V +V N G G V ++ + P Q +P + L+GF RVH + ++F +D
Sbjct: 784 VTTTVRNTGTRAGDEVAQVYLQYPDRPQ-SPLRSLVGFQRVHLQPGEQRTLTFTLD-ARA 841
Query: 776 LSIANKHGRRILPLGNHVLMVG 797
LS ++ G R + G++ L VG
Sbjct: 842 LSDVDRTGTRAVEAGDYRLFVG 863
>gi|188990656|ref|YP_001902666.1| beta-glucosidase [Xanthomonas campestris pv. campestris str. B100]
gi|167732416|emb|CAP50610.1| exported beta-glucosidase [Xanthomonas campestris pv. campestris]
Length = 888
Score = 305 bits (782), Expect = 5e-80, Method: Compositional matrix adjust.
Identities = 171/445 (38%), Positives = 245/445 (55%), Gaps = 56/445 (12%)
Query: 64 RAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVT 123
RA +L++ ++ +EK+ Q + A AIPRLGIPAYEWWSE LHGIA NG T
Sbjct: 39 RAAALVAQMSREEKVAQAMNAAPAIPRLGIPAYEWWSEGLHGIARNG----------YAT 88
Query: 124 SFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG---------QAGLTFWAPNINIFRD 174
FPQ + AAS+N L +G+ V+ EARA +N AGLT W+PNINIFRD
Sbjct: 89 VFPQAIGLAASWNTQLMQQVGTVVSTEARAKFNQAGGPGKDHKRYAGLTIWSPNINIFRD 148
Query: 175 PRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRG 234
PRWGRG ET GEDP + AV F++ QG++ R +
Sbjct: 149 PRWGRGMETYGEDPFLTGQLAVGFIRGLQGDD-----------LEHPRTI---------- 187
Query: 235 DELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCS 294
A KH+ + + G R+ F+ ++ +D E T+ P FR+ + +G+A +MC+
Sbjct: 188 ------ATPKHIAVHSGPEPG---RHGFDVDVSPRDVEATYTPAFRAALVEGQAGSVMCA 238
Query: 295 YNQVNGVPACLRGDLFQ-KARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKA 353
YN ++G PAC L + R +WGFKG++ SDCDAV + ++ + + S+A LKA
Sbjct: 239 YNSLHGTPACAADWLLNGRVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAASLKA 298
Query: 354 GMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPD 413
G D+NCGT R +AI++G+V E +D++L+ LF+ + RLG PRK +Y +LG
Sbjct: 299 GHDLNCGTAY-RALGTAIERGEVDEALLDQSLVRLFAARYRLGELQA-PRKDRYARLGAK 356
Query: 414 DVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIP 473
D+ + ++ LAL AA + IVLLKN LPL A + LA+IGP + ++ + Y G
Sbjct: 357 DIDNAGNRALALQAAAESIVLLKNANATLPL--KAGTRLAVIGPNADALAALEANYQGTS 414
Query: 474 CSPKSLLRGLEAY--VSKTHYASGC 496
P + L GL + YA G
Sbjct: 415 SQPVTPLLGLRQRFGAQQVRYAQGA 439
Score = 115 bits (288), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 76/262 (29%), Positives = 131/262 (50%), Gaps = 44/262 (16%)
Query: 536 DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAG 595
DR + LP Q +L+ A+ S +P+++VL G + +++A+ + W YPG++G
Sbjct: 646 DRNDIALPAAQQALLER-AKASGKPLVVVLMSGSAVALNWAKTHADAIVAAW--YPGQSG 702
Query: 596 AKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYG 655
A+A + GD NPGGRLP+T+Y + P +M+ GR+YR++ G ++
Sbjct: 703 GTAIARALAGDDNPGGRLPVTFYRSTKDLPPYVSYDMK--------GRTYRYFKGEALFP 754
Query: 656 FGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFH 715
FG+GLSYT ++Y+ L+++ +L+AGS
Sbjct: 755 FGYGLSYTRFAYET----PRLSVT-TLQAGSP--------------------------LQ 783
Query: 716 VQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQ 775
V +V N G+ G V ++ + P Q +P + L+GF RVH + ++F +D
Sbjct: 784 VTTTVRNTGERAGDEVAQVYLQYPDRPQ-SPLRSLVGFQRVHLQPGEQRTLTFTLD-ARA 841
Query: 776 LSIANKHGRRILPLGNHVLMVG 797
LS ++ G R++ G++ L VG
Sbjct: 842 LSDVDRTGTRVVEAGDYRLFVG 863
>gi|384428895|ref|YP_005638255.1| beta-glucosidase [Xanthomonas campestris pv. raphani 756C]
gi|341937998|gb|AEL08137.1| beta-glucosidase [Xanthomonas campestris pv. raphani 756C]
Length = 888
Score = 305 bits (782), Expect = 5e-80, Method: Compositional matrix adjust.
Identities = 170/445 (38%), Positives = 245/445 (55%), Gaps = 56/445 (12%)
Query: 64 RAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVT 123
RA +L++ ++ +EK+ Q + A AIPRLGIPAYEWW+E LHGIA NG T
Sbjct: 39 RAAALVAQMSREEKVAQAMNAAPAIPRLGIPAYEWWNEGLHGIARNG----------YAT 88
Query: 124 SFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG---------QAGLTFWAPNINIFRD 174
FPQ + AAS+N L +G+ V+ EARA +N AGLT W+PNINIFRD
Sbjct: 89 VFPQAIGLAASWNTQLMQQVGTVVSTEARAKFNQAGGPGKDHKRYAGLTIWSPNINIFRD 148
Query: 175 PRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRG 234
PRWGRG ET GEDP + AV F++ QG++ R +
Sbjct: 149 PRWGRGMETYGEDPFLTGQLAVGFIRGLQGDD-----------LEHPRTI---------- 187
Query: 235 DELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCS 294
A KH+ + + G R+ F+ ++ +D E T+ P FR+ + +G+A +MC+
Sbjct: 188 ------ATPKHIAVHSGPEPG---RHGFDVDVSPRDVEATYTPAFRAALVEGQAGSVMCA 238
Query: 295 YNQVNGVPACLRGDLFQ-KARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKA 353
YN ++G PAC L + R +WGFKG++ SDCDAV + ++ + + S+A LKA
Sbjct: 239 YNSLHGTPACAADWLLNGRVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAAALKA 298
Query: 354 GMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPD 413
G D+NCGT R +AI++G+V E +D++L+ LF+ + RLG PRK +Y +LG
Sbjct: 299 GHDLNCGTAY-RALGTAIERGEVDEALLDQSLVRLFAARYRLGELQA-PRKDRYARLGAK 356
Query: 414 DVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIP 473
D+ + ++ LAL AA + IVLLKN LPL A + LA+IGP + ++ + Y G
Sbjct: 357 DIDNAGNRALALQAAAESIVLLKNANATLPL--KASTRLAVIGPNADALAALEANYQGTS 414
Query: 474 CSPKSLLRGLEAY--VSKTHYASGC 496
P + L GL + YA G
Sbjct: 415 SQPVTPLLGLRQRFGAQQVRYAQGA 439
Score = 115 bits (287), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 81/292 (27%), Positives = 139/292 (47%), Gaps = 54/292 (18%)
Query: 516 KADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVARTSKRPVILVL 565
++D V+ GL E E+ DR + LP Q +L+ A+ S +P+++VL
Sbjct: 616 QSDAVVAFVGLSPDVEGEELRIDVPGFDGGDRNDIALPAAQQALLER-AKASGKPLVVVL 674
Query: 566 TGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKV 625
G + +++A+ + W YPG++G A+A + GD NPGGRLP+T+Y +
Sbjct: 675 MSGSAVALNWAKTHADAIVAAW--YPGQSGGTAIARALAGDDNPGGRLPVTFYRSTKDLP 732
Query: 626 PMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAG 685
P +M+ GR+YR++ G ++ FG+GLSYT ++Y+ + + +L+AG
Sbjct: 733 PYVSYDMK--------GRTYRYFKGEALFPFGYGLSYTRFAYE-----TPRLSATTLQAG 779
Query: 686 SDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGT 745
S V +V N G+ G V ++ + P+ Q +
Sbjct: 780 SP--------------------------LQVTTTVRNTGERAGDEVAQVYLQYPERPQ-S 812
Query: 746 PEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
P + L+GF RVH + ++F +D LS ++ G R + G++ L VG
Sbjct: 813 PLRSLVGFQRVHLQPGEQRTLTFTLD-ARALSDVDRTGTRAVEAGDYRLFVG 863
>gi|256393466|ref|YP_003115030.1| glycoside hydrolase family 3 [Catenulispora acidiphila DSM 44928]
gi|256359692|gb|ACU73189.1| glycoside hydrolase family 3 domain protein [Catenulispora
acidiphila DSM 44928]
Length = 1343
Score = 305 bits (782), Expect = 5e-80, Method: Compositional matrix adjust.
Identities = 234/776 (30%), Positives = 350/776 (45%), Gaps = 133/776 (17%)
Query: 54 FCNTSLSISTRAKSLISLLTLQEKIQQLSDN-ASAIPRLGIPAYEWWSESLHGIASNGPG 112
+ +T S + RA L+S +TL EK QL N A AIPRLG+ Y +WSE HG+ + G
Sbjct: 49 YLDTHYSFAERAADLVSRMTLPEKAAQLQTNSAPAIPRLGVQEYTYWSEGQHGVNTLGAD 108
Query: 113 VNFNGTVSSV--TSFPQVLVSAASFNRSLWSNIGSAVAVEAR-----AMYNLGQ------ 159
N V TSFP + S++ +L +AV+ E R +++ GQ
Sbjct: 109 SNRGDVTGGVHATSFPVNFAATMSWDPALTYKETTAVSDEVRGFLDKSLWGTGQNNLGPS 168
Query: 160 ----AGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIG 215
LTFWAPN+N+ RDP WGR E+ GEDP + S A FV +QG++
Sbjct: 169 ASDYGALTFWAPNVNMDRDPLWGRTNESFGEDPYLTSTMAGAFVDGYQGQSMT------- 221
Query: 216 FGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTF 275
G ++ LK ++A KH ++E SR++ ++ T+ + D +
Sbjct: 222 -GQQQTPYLK-------------VAATAKHYSLNNIED----SRHTGSSDTTDANIRDYY 263
Query: 276 QPPFRSCIEQGKASCIMCSYNQVNGVPACLR----GDLFQKARNEWGFKGYITSDCDAVA 331
F S + S IM SYN VNG P+ +L Q +GF GY TSDC A+
Sbjct: 264 TKQFASLVRDAHVSGIMTSYNAVNGTPSPADTYTVDELLQA---TYGFAGYTTSDCGAIG 320
Query: 332 TIFE------------------YQNYTKTHEDSAAG----VLKAGMDINC--GTCMLRHT 367
++ N T +AAG ++AG +NC G ++
Sbjct: 321 DVYGAASHGWAPPGWTSNGTSWTNNATGRQISAAAGGQAFAIRAGTQLNCAGGEMTAQNI 380
Query: 368 QSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPR-KGKYGKLGPDDVCTSEHKKLALD 426
+AID G + +D L LF+V++ G F DP K Y K+ D + + H+ LA
Sbjct: 381 SAAIDLGLLSNGVVDATLTRLFTVRMETGEF--DPAGKVGYTKITKDQIESPAHQALAEQ 438
Query: 427 AARQGIVLLKN------DKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLL 480
A IVLL+N K LP++ S+ I+G L N ++ GGY+G P + +
Sbjct: 439 VAANDIVLLQNGAVSGTSAKLLPVDPAKTDSVVIVGDLANKVTL--GGYSGEPTHEVNAV 496
Query: 481 RGLEAYV-----SKTHYASGCH-----DVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQ 530
+G+ A V S T C P + A A+ K A V+VVAG DL+
Sbjct: 497 QGITAAVQAANPSATVTFDACGTGTQITTPASCSAATQAAI---KSASLVLVVAGSDLSV 553
Query: 531 ETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGY 590
E DR +L LPG SL++ V+ LV+ GP D+ A+ D +I++ GY
Sbjct: 554 ADEANDRSTLALPGNYDSLISQVSALGNPRTALVMQADGPYDIQDAQKD--FPAIVFSGY 611
Query: 591 PGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTG 650
G++ ALA+++FG NP G L TWY PM++ + + GR+Y+++TG
Sbjct: 612 NGQSQGTALAQVLFGQQNPAGHLDFTWYSGDSQLAPMDNYGLTPSQTGGL-GRTYQYFTG 670
Query: 651 TQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCT 710
T Y FG+G SY++++Y +V + +
Sbjct: 671 TPTYPFGYGQSYSSFAYS-------------------------------HVQVGPQNTNA 699
Query: 711 SLRFHVQISVTNAGDVDGSHVVMLFARVPKV-SQGTPEKQLIGFDRVHTVAKGSKE 765
HV V N G V G+ V L+A P + T +QL GF + +T+ G +
Sbjct: 700 DGTVHVSFDVKNTGTVAGTTVAQLYAAPPGAGTNDTTREQLAGFQKTNTLKPGQSQ 755
>gi|66767544|ref|YP_242306.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. campestris
str. 8004]
gi|66572876|gb|AAY48286.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. campestris
str. 8004]
Length = 888
Score = 305 bits (781), Expect = 7e-80, Method: Compositional matrix adjust.
Identities = 171/445 (38%), Positives = 245/445 (55%), Gaps = 56/445 (12%)
Query: 64 RAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVT 123
RA +L++ ++ +EK+ Q + A AIPRLGIPAYEWWSE LHGIA NG T
Sbjct: 39 RAAALVAQMSREEKVAQSMNAAPAIPRLGIPAYEWWSEGLHGIARNG----------YAT 88
Query: 124 SFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG---------QAGLTFWAPNINIFRD 174
FPQ + AAS+N L +G+ V+ EARA +N AGLT W+PNINIFRD
Sbjct: 89 VFPQAIGLAASWNTQLMQQVGTVVSTEARAKFNQAGGPGKDHKRYAGLTIWSPNINIFRD 148
Query: 175 PRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRG 234
PRWGRG ET GEDP + AV F++ QG++ R +
Sbjct: 149 PRWGRGMETYGEDPFLTGQLAVGFIRGLQGDD-----------LEHPRTI---------- 187
Query: 235 DELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCS 294
A KH+ + + G R+ F+ ++ +D E T+ P FR+ + +G+A +MC+
Sbjct: 188 ------ATPKHIAVHSGPEPG---RHGFDVDVSPRDVEATYTPAFRAALVEGQAGSVMCA 238
Query: 295 YNQVNGVPACLRGDLFQ-KARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKA 353
YN ++G PAC L + R +WGFKG++ SDCDAV + ++ + + S+A LKA
Sbjct: 239 YNSLHGTPACAADWLLNGRVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAASLKA 298
Query: 354 GMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPD 413
G D+NCGT R +AI++G+V E +D++L+ LF+ + RLG PRK +Y +LG
Sbjct: 299 GHDLNCGTAY-RALGTAIERGEVDEALLDQSLVRLFAARYRLGELQA-PRKDRYARLGAK 356
Query: 414 DVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIP 473
D+ + ++ LAL AA + IVLLKN LPL A + LA+IGP + ++ + Y G
Sbjct: 357 DIDNAGNRALALQAAAESIVLLKNANATLPL--KAGTRLAVIGPNADALAALEANYQGTS 414
Query: 474 CSPKSLLRGLEAY--VSKTHYASGC 496
P + L GL + YA G
Sbjct: 415 SQPVTPLLGLRQRFGAQQVRYAQGA 439
Score = 113 bits (283), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 79/262 (30%), Positives = 131/262 (50%), Gaps = 44/262 (16%)
Query: 536 DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAG 595
DR + LP Q +L+ A+ S +P+++VL G + +++A+ + W YPG++G
Sbjct: 646 DRNDIALPAAQQALLER-AKASGKPLVVVLMSGSAVALNWAKTHADAIVAAW--YPGQSG 702
Query: 596 AKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYG 655
A+A + GD NPGGRLP+T+Y + P +M+ GR+YR++ G ++
Sbjct: 703 GTAIARALAGDDNPGGRLPVTFYRSTKDLPPYVSYDMK--------GRTYRYFKGEALFP 754
Query: 656 FGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFH 715
FG+GLSYT+++Y AP +L+ S +L+AGS
Sbjct: 755 FGYGLSYTSFAY---DAP-QLS-STTLQAGSP--------------------------LQ 783
Query: 716 VQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQ 775
V +V N G G V ++ + P Q +P + L+GF RVH + ++F +D
Sbjct: 784 VTTTVRNTGTRAGDEVAQVYLQYPDRPQ-SPLRSLVGFQRVHLQPGEQRTLTFTLD-ARA 841
Query: 776 LSIANKHGRRILPLGNHVLMVG 797
LS ++ G R + G++ L VG
Sbjct: 842 LSDVDRTGTRAVEAGDYRLFVG 863
>gi|443717728|gb|ELU08656.1| hypothetical protein CAPTEDRAFT_228276 [Capitella teleta]
Length = 731
Score = 304 bits (778), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 233/756 (30%), Positives = 348/756 (46%), Gaps = 116/756 (15%)
Query: 52 YPFCNTSLSISTRAKSLISLLTLQE----KIQQLSDNASAIPRLGIPAYEWWSESLHGIA 107
+PF + +LS R L+ LT++E + Q + + RLG+ Y++ +E + G+
Sbjct: 20 FPFEDVTLSWDKRVDDLVQRLTIEEVVNISVAQYGKSTIPVDRLGVKPYQFINECITGV- 78
Query: 108 SNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYN-------LGQA 160
+ T+FPQ + ASF+ L N+ A+A E R YN G
Sbjct: 79 ----------RWENSTAFPQAIGLGASFSPDLAFNMSQAIARELRGFYNTEVKSQIYGHR 128
Query: 161 GLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFRE 220
G+ + P INI R P WGR QET GEDP + +V FVK QG
Sbjct: 129 GVNCFTPVINIMRHPLWGRNQETYGEDPWLSGQLSVGFVKGLQG---------------- 172
Query: 221 KRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFR 280
D + S CKH ++ + SR+ F+A ++E+D TF P F+
Sbjct: 173 -----------DHPRYIQASGGCKHFDVHNGPENIPVSRFGFDAKVSERDWRMTFLPQFK 221
Query: 281 SCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNY 339
+C+E G + IMCSYN++NGVPAC L R EWGF GY+ SD A+ I + Y
Sbjct: 222 TCVEAGSIN-IMCSYNRINGVPACANKKLLTDILRKEWGFNGYVISDSGAIENIVYHHKY 280
Query: 340 TKTHEDSAAGVLKAGMDINC-----GTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLR 394
TKT ++AA +KAG ++ + +A+ + + E+++ L ++R
Sbjct: 281 TKTLAEAAADSVKAGCNVELTGATGSGVAYFNLLNAVKQNLISEEELRENLKKPMYSRMR 340
Query: 395 LGLFNGDP-RKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLA 453
G F DP + K+ V + EH+ LA+ A+ VL+KN + LPL K LA
Sbjct: 341 QGEF--DPVDMNPFTKIDMSVVLSQEHQDLAVKASAMSFVLMKNLNRVLPLKKR-FDRLA 397
Query: 454 IIGPLVNNISQMGGGYTGIP-CSPKSL---LRGLEAYVSKTHYASGCHDVPC-NSDAGFH 508
IIGP +N + G Y IP PK + GL++ YASGC D C N D
Sbjct: 398 IIGPFADNAETLFGDY--IPNWDPKFVSTPYEGLKSLGDDVRYASGCDDPSCTNYDPKAI 455
Query: 509 EAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKR-PVILVLTG 567
E + K A FV V G+ E E DR L LPG Q+ ++ S+ P++LVL
Sbjct: 456 E--KAVKGAQFVFVCLGVGSNLEREGHDRADLDLPGYQLQILKDAEFFSREAPLVLVLFN 513
Query: 568 GGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFG---DFNPGGRLPMTWYPESFTK 624
GP+D+++ + ++ I+ YP KAL +++ D P RLP TW P +
Sbjct: 514 AGPVDLTWPKLSPEVDGIIECFYPAMGTGKALYQVVTATGDDGVPAARLPSTW-PAQLHQ 572
Query: 625 VP-MNDMNMRADSSRQYPGRSYRFYTGTQ-VYGFGHGLSYTNYSYKFLSAPSELTISASL 682
VP + D NM G +YR++ G +Y FG+GLSYT++ Y+ +S S+
Sbjct: 573 VPSITDYNMT--------GHTYRYFDGGDPLYPFGYGLSYTSFHYQTVSVS-----PTSV 619
Query: 683 KAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVS 742
+AG + + Q +R Y NA +V ++ + A VP
Sbjct: 620 RAGGNVTVTVQVLNRGPY---------------------NADEVTQVYMSWMEATVP--- 655
Query: 743 QGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSI 778
P L+GF R S +SF V EQ+++
Sbjct: 656 --VPRWTLVGFKRHRHTVNQSSSLSFVVS-AEQMAV 688
>gi|325914134|ref|ZP_08176487.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas vesicatoria
ATCC 35937]
gi|325539637|gb|EGD11280.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas vesicatoria
ATCC 35937]
Length = 874
Score = 304 bits (778), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 172/454 (37%), Positives = 247/454 (54%), Gaps = 56/454 (12%)
Query: 55 CNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVN 114
+ S + RA +L++ ++ EK+ Q ++A AIPRL IPAYEWWSE LHGIA NG
Sbjct: 16 ADRSGTPEQRAAALVAQMSRDEKVAQAMNDAPAIPRLDIPAYEWWSEGLHGIARNG---- 71
Query: 115 FNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG---------QAGLTFW 165
T FPQ + AAS+N +L +G+ V+ EARA +N AGLT W
Sbjct: 72 ------YATVFPQAIGLAASWNTALMQQVGTVVSTEARAKFNQAGGPGKDHKRYAGLTIW 125
Query: 166 APNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLK 225
+PNINIFRDPRWGRG ET GEDP + AV F++ QG++ R +
Sbjct: 126 SPNINIFRDPRWGRGMETYGEDPFLTGQLAVGFIRGLQGDD-----------LNHPRTI- 173
Query: 226 GFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQ 285
A KH+ + + G R+ F+ ++ +D E T+ P FR+ +
Sbjct: 174 ---------------ATPKHIAVHSGPEPG---RHGFDVDVSPRDMEATYTPAFRAALVD 215
Query: 286 GKASCIMCSYNQVNGVPACLRGDLFQ-KARNEWGFKGYITSDCDAVATIFEYQNYTKTHE 344
G+A +MC+YN ++G PAC L + R +WGFKG++ SDCDAV + ++ + +
Sbjct: 216 GQAWSVMCAYNSLHGTPACAADWLLNGRVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNA 275
Query: 345 DSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRK 404
S+A LKAG D+NCG R +AI++G+V E +D++L+ LF+ + RLG PRK
Sbjct: 276 GSSAAALKAGHDLNCGHAY-RELGTAIERGEVDEALLDQSLVRLFAARYRLGELEA-PRK 333
Query: 405 GKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQ 464
Y +LG DV + H+ LAL AA + IVLLKN LPL A + LA+IGP + ++
Sbjct: 334 DPYARLGAKDVDNAAHRALALQAAAESIVLLKNTATTLPL--KAGTRLAVIGPNADALAA 391
Query: 465 MGGGYTGIPCSPKSLLRGLEAY--VSKTHYASGC 496
+ Y G +P + L GL + + YA G
Sbjct: 392 LEANYQGTSATPITPLLGLRQHFGAQQVRYAQGA 425
Score = 114 bits (286), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 87/292 (29%), Positives = 143/292 (48%), Gaps = 54/292 (18%)
Query: 516 KADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVARTSKRPVILVL 565
++D V+ GL E E+ DR + LP Q +L+ A+ S +P+++VL
Sbjct: 602 QSDAVVAFVGLSPDVEGEELRIDVPGFDGGDRNDIALPAPQQALLER-AKASGKPLVVVL 660
Query: 566 TGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKV 625
G + +++A+A++ W YPG++G A+A + GD NPGGRLP+T+Y
Sbjct: 661 MSGSAVALNWAKANADAIVAAW--YPGQSGGTAIARALAGDDNPGGRLPVTFY------- 711
Query: 626 PMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAG 685
+ ++ A S GR+YR++ G ++ FG+GLSYT+++Y AP T +L+AG
Sbjct: 712 -RSTKDLPAYVSYDMKGRTYRYFKGEPLFPFGYGLSYTSFAY---DAPRLST--RTLQAG 765
Query: 686 SDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGT 745
N LQ V +V N G G V ++ + P Q +
Sbjct: 766 ---NPLQ-----------------------VTTTVRNTGSRAGDEVAQVYLQYPDRPQ-S 798
Query: 746 PEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
P + L+GF RVH +E++F +D LS ++ G+R + G + + VG
Sbjct: 799 PLRSLVGFQRVHLKPGEQRELTFTLD-ARALSDVDRSGQRAVEAGEYRVFVG 849
>gi|167524198|ref|XP_001746435.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163775197|gb|EDQ88822.1| predicted protein [Monosiga brevicollis MX1]
Length = 834
Score = 302 bits (774), Expect = 5e-79, Method: Compositional matrix adjust.
Identities = 235/762 (30%), Positives = 349/762 (45%), Gaps = 118/762 (15%)
Query: 46 PPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDN-------ASAIPRLGIPAYEW 98
PP YPF N L + R L+ LTL+EK+QQL A A+ RLGI + W
Sbjct: 31 PPE---YPFRNPDLPWAARLDDLVGRLTLEEKLQQLQHGGAAQMTPAPAVERLGIGPFVW 87
Query: 99 WSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG 158
SE + G+ ++G T+FPQ L AA+F+ +L +A+E RA N
Sbjct: 88 GSECVTGLGTDG-------NDPHGTAFPQPLGMAATFDPALLKRAAGTIALELRAQRNFD 140
Query: 159 QA--------GLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSD 210
+ GL+ W+P +NI R P WGR ET GE P++ S A FV+ QG + +
Sbjct: 141 RENGVVKFHHGLSCWSPVVNINRHPLWGRNDETFGECPVLSSFMARSFVEGIQGNHTRY- 199
Query: 211 DGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQD 270
+A CKHL D+ + RY F+A +++ D
Sbjct: 200 --------------------------YAAAAACKHL---DVYGGPDNLRYVFDADVSQAD 230
Query: 271 TEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPAC--LRGDLFQKARNEWGFKGYITSDCD 328
TF F C G MCSYN + GVPAC R F AR +WGF+GY+ SD
Sbjct: 231 LTGTFLMAFEECAAAGVMG-YMCSYNSIRGVPACANYRTMTF-FAREQWGFEGYVVSDQG 288
Query: 329 AVATIFEYQNYTKTHEDSAAGVLKAGMDI----NCGTCMLRHTQSAIDKGKVQEKDIDRA 384
AV I E NYT A L AG D+ + + A+D ID +
Sbjct: 289 AVFRITESHNYTANQTLGAVAALNAGCDMEDSDDAQHVAYYNLSLALDLKLTDMATIDAS 348
Query: 385 LLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPL 444
+ LF V++RLG F+ P + L V + H ++A D A IVLLKN + LPL
Sbjct: 349 VSRLFYVRMRLGEFD-PPENDPWRSLNMSIVSSPAHVEMARDVATASIVLLKNQNETLPL 407
Query: 445 NKNAV-SSLAIIGPLVNNISQMGGGY-----TGIPCSPKSLLRGLEAYVSKT---HYASG 495
+ A +S ++GP +N M G Y T + + ++ L S+T Y G
Sbjct: 408 SAAAKNASYCLLGPFADNADLMMGKYSPHGSTNVTVTYRAGLAAALQNASQTASFQYLEG 467
Query: 496 CHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSV-- 553
C C+ I + D V++ G E+E DR ++ PG Q +LV +V
Sbjct: 468 CTGPFCDGLDTAAVTTFIQQGCDTVLLAVGTSYHVESESLDRSNMSFPGAQPTLVQTVLE 527
Query: 554 ARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRL 613
A +K+ ++L+++ GP+D++ E D+++++IL + Y G+ ALA+I+ G+ +P GRL
Sbjct: 528 ALGTKQRLVLLVSTAGPVDLAALEQDTRVAAILDLIYLGQTAGTALADILLGETSPSGRL 587
Query: 614 PMTWYPESFTKV-PMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSA 672
P +W P + V P++D M+ GR+YRF ++ FG+GLSYT ++ L+A
Sbjct: 588 PFSW-PNKVSDVPPIDDYTMQ--------GRTYRFAQADVLFPFGYGLSYTQFNLSHLAA 638
Query: 673 PSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVV 732
P L + C +LR V +VTN G + G+ +
Sbjct: 639 PYILPV------------------------------CQALRLSV--NVTNTGRLSGAIPL 666
Query: 733 MLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCE 774
++ P + G P +QL RV A SK + + P E
Sbjct: 667 QVYVEWPN-AVGGPIRQLATTTRVFVDAASSKTVQLSIRPRE 707
>gi|389736853|ref|ZP_10190363.1| glucan 1,4-beta-glucosidase [Rhodanobacter sp. 115]
gi|388438821|gb|EIL95541.1| glucan 1,4-beta-glucosidase [Rhodanobacter sp. 115]
Length = 868
Score = 300 bits (768), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 170/445 (38%), Positives = 242/445 (54%), Gaps = 55/445 (12%)
Query: 63 TRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSV 122
+RA +L++ +TL EK+ Q+ ++A AIPRLG+PAY+WWSE LHGIA NG
Sbjct: 31 SRAVALVAKMTLPEKVAQMQNDAPAIPRLGVPAYDWWSEGLHGIARNG----------YA 80
Query: 123 TSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNL---GQA-----GLTFWAPNINIFRD 174
T FPQ + AAS++ SL +G+ ++ EARA +N G+A GLT W+PNINIFRD
Sbjct: 81 TVFPQAIGLAASWDTSLLHAVGTVISTEARAKFNASGSGRAHGLFQGLTLWSPNINIFRD 140
Query: 175 PRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRG 234
PRWGRGQET GEDP + AV FV+ QG++ + R +
Sbjct: 141 PRWGRGQETYGEDPYLTGQLAVAFVRGIQGDDPQ-----------HPRAI---------- 179
Query: 235 DELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCS 294
A KH +A+ + G R SF+ ++ D EDT+ P FR+ + G A +MC+
Sbjct: 180 ------ATPKHFVAHSGPEAG---RDSFDVDVSPHDLEDTYLPAFRTAVVDGHAGSVMCA 230
Query: 295 YNQVNGVPACLRGDLFQ-KARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKA 353
YN ++G PAC L + R +WGF GY+ SDCDAV I Y + ++ ++A
Sbjct: 231 YNALHGTPACANAGLLDTRLRKDWGFAGYVVSDCDAVGDIASYHYFKPDDVQASVAAVQA 290
Query: 354 GMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPD 413
G D++CG Q A+ +G + E +D +L+ LF+ + RLG G Y ++G D
Sbjct: 291 GTDLDCGHTYASLAQ-AVRQGDIAESALDASLVRLFTARYRLGEL-GSRGNDPYARIGAD 348
Query: 414 DVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIP 473
+ + H+KLAL AA + +VLLKN LPL +A LA+IGP + + + Y G
Sbjct: 349 QIDSPAHRKLALQAALESLVLLKNAHSTLPL--HAGMRLAVIGPDADALETLEANYHGTA 406
Query: 474 CSPKSLLRGLEAYVSKTH--YASGC 496
P + L+GL A H YA G
Sbjct: 407 RHPVTPLQGLRARFGADHVAYAQGA 431
Score = 123 bits (308), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 92/296 (31%), Positives = 138/296 (46%), Gaps = 54/296 (18%)
Query: 512 RIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVARTSKRPV 561
R+ AD V+ GL E E DR + LP Q +L+ AR S +P+
Sbjct: 593 RVLHDADAVVAFIGLSPDVEGEQLRIDVPGFDGGDRTDIGLPAPQRALLER-ARASGKPL 651
Query: 562 ILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPES 621
I+VL G + + +A+ +IL YPG+AG A+A+++ GD+NPGGRLP+T+Y +
Sbjct: 652 IVVLLSGSAVALDWAQ--QHADAILAAWYPGQAGGTAIAQVLAGDYNPGGRLPVTFYRST 709
Query: 622 FTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISAS 681
P M+ GR+YR++ G +Y FG+GLSYT ++Y +AP+ +A+
Sbjct: 710 RDLPPYVSYAMQ--------GRTYRYFDGRPLYPFGYGLSYTRFTY---AAPT--LSAAT 756
Query: 682 LKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKV 741
LKAG V V NAG G VV ++ P
Sbjct: 757 LKAGG--------------------------TLQVSAEVRNAGQRAGDEVVQVYLDTPP- 789
Query: 742 SQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
S P L+GF R+H A + + F + P QLS + G R + G + + +G
Sbjct: 790 SPLAPRHALVGFRRIHLAAGEQRLVRFTLAP-RQLSSVDAAGARAVEPGQYRVFIG 844
>gi|323451996|gb|EGB07871.1| hypothetical protein AURANDRAFT_71699 [Aureococcus anophagefferens]
Length = 1202
Score = 300 bits (768), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 235/694 (33%), Positives = 330/694 (47%), Gaps = 104/694 (14%)
Query: 47 PHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGI 106
P +YP+C+ +L I R L + T+ E I Q+ A+A+PRLG+PA + E+LHG+
Sbjct: 336 PSAAAYPYCDRALPIRARVADLAARFTVNETISQMGTMAAAVPRLGLPALNYGGEALHGV 395
Query: 107 ASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQ------- 159
S T FP ASF+R LW +G+A +EARA++ Q
Sbjct: 396 WST------CAAGRCPTQFPAPHAMGASFDRDLWRAVGAASGLEARALFRWNQRHNASDC 449
Query: 160 -------AGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDG 212
GLTF+APN+N+ RDPRWGR +E P EDP++ Y EFV+ FQG+
Sbjct: 450 ARSLEGCLGLTFYAPNVNLARDPRWGRIEEVPSEDPLLNGVYGAEFVRGFQGD------- 502
Query: 213 GIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLE---------KWGNFS----- 258
G + +A KH Y+LE W +
Sbjct: 503 ---------------------GAYRVANAVVKHFAVYNLEVDVEDTPPADWCGSAACAPP 541
Query: 259 --RYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RN 315
R+SF+A ++ +D E+T+ PF + A+ MCSYN VNG PAC G L + A R
Sbjct: 542 NDRHSFDARVSPRDFEETYVGPFVA-PVAAGAAAAMCSYNAVNGEPACTDGALLRGALRG 600
Query: 316 EWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGK 375
F G + +DC A+ T ++AA + AG+D NCG + A+ G
Sbjct: 601 ALNFTGVLATDCGALEDAVARHKRYATEAEAAAAAIAAGVDSNCGKVLTSALPEALAAGL 660
Query: 376 VQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLL 435
V+ + L L +LRLGL + + D V + H+ LAL AAR+G+VLL
Sbjct: 661 VRPDALRPPLERLLEARLRLGLLDDWDADAPVPRPDVDAVDSPAHRALALRAAREGLVLL 720
Query: 436 KNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRG----LEAYV--SK 489
+N + LPL+ +LA+IGP N + GY G P P LLR LEA K
Sbjct: 721 QNPNQILPLDGR--GTLAVIGPNANASMNLLSGYHGTP--PPDLLRSPLQELEARWRGGK 776
Query: 490 THYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGL------------DLT----QETE 533
YA GC + + A EAV +AK AD V++ GL D T E E
Sbjct: 777 VVYAVGC-NASGAATAALDEAVDLAKTADVVVLGLGLCGDNYGGGPPKEDATCFSIDEAE 835
Query: 534 DRDRVSLLLPGQQMSLVTSVARTSKRPVILV-LTGGGPLDVSFAEADSQISSILWIGYPG 592
DR SL LPG Q +L + + K + V L G +D SFA+ +++L GY G
Sbjct: 836 SVDRTSLKLPGAQEALFSKIWALGKPVAVAVFLVSAGAVDASFAK---DKAALLLAGYGG 892
Query: 593 EAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQ 652
E G A+A+ + G +NPGG L T P++ P DM MR S PGR+YRF +
Sbjct: 893 EFGGVAVADALLGAYNPGGALTATMLPDAGLP-PFRDMAMR--PSAASPGRTYRFLDERR 949
Query: 653 V---YGFGHGLSYTNYSYKFLSAPSELTISASLK 683
V + FG GLSYT ++ L+ P+ + A+ +
Sbjct: 950 VAPLWRFGFGLSYTAFAVS-LAGPTRVPRRAATR 982
>gi|116621778|ref|YP_823934.1| glycoside hydrolase family 3 protein [Candidatus Solibacter
usitatus Ellin6076]
gi|116224940|gb|ABJ83649.1| glycoside hydrolase, family 3 domain protein [Candidatus Solibacter
usitatus Ellin6076]
Length = 850
Score = 300 bits (767), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 173/457 (37%), Positives = 253/457 (55%), Gaps = 56/457 (12%)
Query: 53 PFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPG 112
PF + LS RA L++ +TL EK+ Q+ ++A AIPRLGIPAY+WW+E+LHG+A G
Sbjct: 25 PFMDPDLSAERRAADLVARMTLDEKVLQMQNSAPAIPRLGIPAYDWWNEALHGVARAG-- 82
Query: 113 VNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG--------QAGLTF 164
T FPQ + AA+++ +L I ++ EARA YN GLTF
Sbjct: 83 --------LATVFPQAIGLAATWDATLMHRIAETISTEARAKYNEAIRNDDHSRYRGLTF 134
Query: 165 WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVL 224
W+PNINIFRDPRWGRGQET GEDP + S AV F+K QGE+
Sbjct: 135 WSPNINIFRDPRWGRGQETYGEDPFLTSRMAVAFIKGMQGED------------------ 176
Query: 225 KGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIE 284
+ A KH Y + SR+ F+ + +D DT+ P FR+ I
Sbjct: 177 ---------PHYYKVIATAKH---YAVHSGPESSRHQFDVKPSPRDLADTYLPAFRASIV 224
Query: 285 QGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTH 343
+ +A +MC+YN+V+G+PAC DL +K R EWGF+G++ SDC AV+ IF +Y
Sbjct: 225 EARADSLMCAYNRVDGIPACASTDLLEKRLRGEWGFQGFVVSDCGAVSDIFRGHHYQPDA 284
Query: 344 EDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPR 403
++A +KAG D+ CG R A+ G + E +I+R+L LF + +LG+F+ P
Sbjct: 285 ASASAVAVKAGTDLTCGN-EYRALVDAVKTGLITEPEINRSLERLFVARFKLGMFD-PPE 342
Query: 404 KGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNIS 463
+ + + +V ++ H+K+AL+AAR+ IVLLKND LPL K+++ +A+IGP ++
Sbjct: 343 RVPFSNIPYSEVDSAGHRKIALEAARKSIVLLKNDGT-LPL-KSSIKKIAVIGPAADDAE 400
Query: 464 QMGGGYTGIPCSPKSLLRGLE---AYVSKTHYASGCH 497
+ G Y G + L G+E A ++ YA G +
Sbjct: 401 ALLGNYNGFSSLQVTPLAGIEHQWAGKAEVRYALGAN 437
Score = 95.1 bits (235), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 78/282 (27%), Positives = 120/282 (42%), Gaps = 60/282 (21%)
Query: 510 AVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVARTSKR 559
A+ AD + GL+ + E E+ DR +L LP Q L+ + T K
Sbjct: 597 AIEAVSNADVTLAFVGLNPSLEGEEMPVSVPGFQGGDRTNLELPEPQEKLIEAAIATGK- 655
Query: 560 PVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYP 619
PV++VL G + ++FA S++L Y GE A+A+ + G NP GRLP+T+Y
Sbjct: 656 PVVVVLASGSAVAMNFAA--QHASALLETWYNGEETGTAIADTLAGINNPSGRLPVTFYR 713
Query: 620 ESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTIS 679
P + M+ GR+YR++ G +Y FG GLSY+ + Y L
Sbjct: 714 SVDQLPPFEEYAMK--------GRTYRYFNGDALYSFGFGLSYSKFQYSALK-------- 757
Query: 680 ASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVP 739
+ +AGS + + V NA ++G VV L+
Sbjct: 758 -TRRAGSGTIVASR--------------------------VRNASSIEGDEVVQLYVNG- 789
Query: 740 KVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANK 781
+ G P + L GF R+H S+E+ F P Q + N+
Sbjct: 790 SGADGDPIRSLRGFQRIHLRPGESREVHF---PLGQEDLTNR 828
>gi|389737578|ref|ZP_10190998.1| beta-glucosidase [Rhodanobacter sp. 115]
gi|388434298|gb|EIL91245.1| beta-glucosidase [Rhodanobacter sp. 115]
Length = 898
Score = 299 bits (765), Expect = 4e-78, Method: Compositional matrix adjust.
Identities = 170/457 (37%), Positives = 249/457 (54%), Gaps = 55/457 (12%)
Query: 54 FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
+ +T+ S RA L+S +TL EK+ Q+ ++A AIPRLG+PAY+WW+E+LHG+A G
Sbjct: 43 YLDTAHSFQERAADLVSRMTLAEKVAQMQNSAPAIPRLGVPAYDWWNEALHGVARAG--- 99
Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQ--------AGLTFW 165
T FPQ + AA+F+ +L + +A++ EARA YN Q GLTFW
Sbjct: 100 -------EATVFPQAIGLAATFDPALLHHEATAISDEARAKYNDFQRRGMRGRYEGLTFW 152
Query: 166 APNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLK 225
+PN NIFRDPRWGRGQET GEDP + S V FV+ +G++
Sbjct: 153 SPNTNIFRDPRWGRGQETYGEDPYLTSRMGVAFVRGLEGDDPTYQK-------------- 198
Query: 226 GFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQ 285
L A KH + + R+ F+ +E+D +T+ P F++ ++Q
Sbjct: 199 -------------LDATAKH---FAVHSGPESERHRFDVHPSERDLHETYLPAFQALVQQ 242
Query: 286 GKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHE 344
G +M +YN+V+GVPA L Q R +WGFKGY+ SDCDAVA I+++ T E
Sbjct: 243 GGVDAVMGAYNRVDGVPATASHRLLQDILRRDWGFKGYVVSDCDAVADIYQFHKVVPTAE 302
Query: 345 DSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRK 404
+AA + G D+NCGT + A+ G V E ID A+ L + RLG+F+ P +
Sbjct: 303 QAAALAVNNGDDLNCGTTYATLVK-AVHDGLVNEHTIDTAVTRLMLARFRLGMFD-PPGR 360
Query: 405 GKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQ 464
+ L V + +H LAL A++ +VLLKND LPL+ N V +A+IGP +N++
Sbjct: 361 VPWSTLPMSVVQSPQHDALALRTAQESMVLLKND-GLLPLSHN-VRRIAVIGPTADNVTA 418
Query: 465 MGGGYTGIPCSPKSLLRGLEAYV--SKTHYASGCHDV 499
+ G Y G P +P ++L+G+ V ++ Y G V
Sbjct: 419 LLGNYHGTPKAPVTILQGIREAVPNAQVTYVQGTELV 455
Score = 127 bits (320), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 96/337 (28%), Positives = 145/337 (43%), Gaps = 62/337 (18%)
Query: 480 LRGLEAYVSKTHYASGCHDVPCN-------SDAGFHEAVRIAKKADFVIVVAGLDLTQET 532
L+ +AY + Y HD + + F A+ A+ AD VI GL E
Sbjct: 589 LKAGKAYAIRLDYLRATHDAGVRLAWRMPGAKSPFEAALDAARHADVVIFAGGLSSDLEG 648
Query: 533 EDR----------DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQI 582
E+ DR +L LP Q L+ ++ T K PV+LVLT G L + +A+ +
Sbjct: 649 EEMPVDYPGFAGGDRTTLALPATQRKLLQALQVTGK-PVVLVLTTGSALAIDWAK--QHL 705
Query: 583 SSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPG 642
+IL YPG+ G A+A+ +FG+ +P GRLP+T+Y + P +D M+ G
Sbjct: 706 PAILLAWYPGQDGGHAVADALFGNVDPAGRLPVTFYKSARQLPPFDDYAMK--------G 757
Query: 643 RSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVH 702
R+YR++TG ++ FG GLSYT ++Y L
Sbjct: 758 RTYRYFTGQPLFPFGFGLSYTRFAYSDL-------------------------------Q 786
Query: 703 IDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKG 762
+D T S R + + V N G G VV L+ R + K L GF R+
Sbjct: 787 LDRDTLGPSDRMRISLRVKNTGQRAGDEVVQLYLRPLRAPHARAIKSLRGFQRISLKPGE 846
Query: 763 SKEISFGVDPCEQLSIAN--KHGRRILPLGNHVLMVG 797
+ +SF + P L + H + P G + + VG
Sbjct: 847 ERSVSFDISPQTDLKYYDVAHHAYAVAP-GRYQVQVG 882
>gi|333377782|ref|ZP_08469515.1| hypothetical protein HMPREF9456_01110 [Dysgonomonas mossii DSM
22836]
gi|332883802|gb|EGK04082.1| hypothetical protein HMPREF9456_01110 [Dysgonomonas mossii DSM
22836]
Length = 727
Score = 297 bits (761), Expect = 1e-77, Method: Compositional matrix adjust.
Identities = 229/752 (30%), Positives = 343/752 (45%), Gaps = 127/752 (16%)
Query: 52 YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGP 111
YPF NTSLS R +L+S++T+ EKI LS N +PRLGI SE LHG+A GP
Sbjct: 24 YPFQNTSLSDEKRLDNLLSIMTIDEKINALSTNL-GVPRLGIRN-TGHSEGLHGMALGGP 81
Query: 112 G-------VNFNGT--VSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYN---LGQ 159
G VN+ V T+FPQ +++ L + A E R +
Sbjct: 82 GNWGGFKMVNYQRVPDVYPTTTFPQAYGLGETWDTELIKKVADIEATEIRYYTQNERYTK 141
Query: 160 AGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGEN---WKSDDGGIGF 216
GL APN ++ RDPRWGR +E+ GEDP +VS AV F+K QGEN WKS
Sbjct: 142 GGLVMRAPNADLARDPRWGRTEESFGEDPFLVSEMAVAFIKGLQGENPRYWKS------- 194
Query: 217 GFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQ 276
++ KH +A E + + +F+ + + +
Sbjct: 195 -----------------------ASLMKHFLANSNEDGRDSTSSNFDNRLFHE----YYS 227
Query: 277 PPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKARNEWGFKGYITSDCDAVATIFEY 336
PFR IE+G + M +YN N +P + + +K R +W FKG I +D A+ + +
Sbjct: 228 YPFRKGIEKGGSQAFMAAYNSWNEIPMTIH-PILKKIRKDWNFKGIICTDGGALDLLIKA 286
Query: 337 QNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLG 396
TH + +A ++KAG+ + A++KG + E +ID+A+ F + L+LG
Sbjct: 287 HKTFPTHTEGSAAIVKAGVG-QFLDNFRPYIYQALEKGMLTEAEIDKAIRGNFYIALKLG 345
Query: 397 LFNGDPRKGKYGKLGPDDVCTSEHKKLALDAAR----QGIVLLKNDKKFLPLNKNAVSSL 452
L +GD K Y +G D + K D R + +VLLKN+KK LPLNK + +
Sbjct: 346 LLDGDQTKLPYAHIGVTDTVSVWRNKEIQDFVRLVTAKSVVLLKNEKKLLPLNKGNIKRI 405
Query: 453 AIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVR 512
A+IGP N + + Y+G P S+L+G++ V +V S +A
Sbjct: 406 AVIGPRANEV--LLDWYSGTPPYTVSILQGIKNAVGNN------VEVIYESSNEIDKAYL 457
Query: 513 IAKKADFVIVVAGLDLT---------------QETEDRDRVSLLLPGQQMSLVTSVARTS 557
A+KAD IV G + +E DR +SL +Q LV V + +
Sbjct: 458 AAQKADIAIVCVGNHVYGTDPKWKYSPVPSDGREAVDRKALSL----EQEDLVKIVHKAN 513
Query: 558 KRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTW 617
++VL P +++++ + I +IL I + LA++IFG++NP GR TW
Sbjct: 514 PN-TVMVLVSSFPFAINWSQEN--IPAILHITNNSQELGNGLADVIFGNYNPAGRTNQTW 570
Query: 618 YPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELT 677
PM D ++R GR+Y + +Y FG+GLSYTN++Y S++
Sbjct: 571 VKSIADLPPMMDYDIRN-------GRTYMYAKEKPLYPFGYGLSYTNFTY------SDMA 617
Query: 678 ISAS-LKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFA 736
+S+S L G + V ++V N GD+DG V L+
Sbjct: 618 LSSSALSKGKN--------------------------LKVSVNVKNTGDMDGEEVAQLYV 651
Query: 737 RVPKVSQGTPEKQLIGFDRVHTVAKGSKEISF 768
P+ P KQL GFDR+ SK F
Sbjct: 652 SFPQSKVVRPIKQLKGFDRISIKKGESKTFEF 683
>gi|224068504|ref|XP_002302759.1| predicted protein [Populus trichocarpa]
gi|222844485|gb|EEE82032.1| predicted protein [Populus trichocarpa]
Length = 273
Score = 297 bits (760), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 136/273 (49%), Positives = 174/273 (63%), Gaps = 29/273 (10%)
Query: 42 FPCKPPHFDS--YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWW 99
F C P S +PFC L I +R LI +TLQEK+ L ++A+A+PRLGI YEWW
Sbjct: 27 FACDPEDGTSRNFPFCQVKLPIQSRVSDLIGRMTLQEKVGLLVNDAAAVPRLGIKGYEWW 86
Query: 100 SESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQ 159
SE+LHG+++ GPG F G TSFPQV+ +AASFN +LW IG V+ EARAM+N G
Sbjct: 87 SEALHGVSNVGPGTQFGGAFPGATSFPQVITTAASFNATLWEAIGRVVSDEARAMFNGGV 146
Query: 160 AGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFR 219
AGLT+W+PN+NIFRDPRWGRGQETPGEDP+V YA +V+ QG +
Sbjct: 147 AGLTYWSPNVNIFRDPRWGRGQETPGEDPVVAGKYAASYVRGLQGND------------- 193
Query: 220 EKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPF 279
GD L ++ACCKH AYDL+ W R+ FNA +++QD EDTF PF
Sbjct: 194 --------------GDRLKVAACCKHFTAYDLDNWNGVDRFHFNAQVSKQDMEDTFDVPF 239
Query: 280 RSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK 312
R C+++GK + +MCSYNQVNG+P C L +K
Sbjct: 240 RMCVKEGKVASVMCSYNQVNGIPTCADPKLLKK 272
>gi|380512525|ref|ZP_09855932.1| beta-glucosidase [Xanthomonas sacchari NCPPB 4393]
Length = 885
Score = 296 bits (757), Expect = 4e-77, Method: Compositional matrix adjust.
Identities = 169/442 (38%), Positives = 237/442 (53%), Gaps = 58/442 (13%)
Query: 68 LISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQ 127
L++ +T EKI Q + A AIPRLG+PAYEWWSE LHGIA NG T FPQ
Sbjct: 40 LVAKMTRAEKIAQAMNAAPAIPRLGVPAYEWWSEGLHGIARNG----------EATVFPQ 89
Query: 128 VLVSAASFNRSLWSNIGSAVAVEARAMYNLG---------QAGLTFWAPNINIFRDPRWG 178
+ AA++N L ++G+ + EARA +NL AGLT W+PNINIFRDPRWG
Sbjct: 90 AIGLAATWNPELLHDVGTVTSTEARAKFNLAGGPGKDHPRYAGLTIWSPNINIFRDPRWG 149
Query: 179 RGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELM 238
RG ET GEDP + AV F+ QG++ R +
Sbjct: 150 RGMETYGEDPYLTGRLAVGFIHGLQGDDPA-----------HPRTI-------------- 184
Query: 239 LSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQV 298
A KHL + + G R+ F+ ++ D E T+ P FR+ I G+A +MC+YN +
Sbjct: 185 --ATPKHLAVHSGPEPG---RHGFDVDVSPHDFEATYSPAFRAAIVDGQAGSVMCAYNSL 239
Query: 299 NGVPACLRGDLFQ-KARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDI 357
+G PAC L + R +WGFKG++ SDCDA+ + ++ Y + S+A LKAG D+
Sbjct: 240 HGTPACAADWLIDGRVRGDWGFKGFVVSDCDAIDDMTQFHYYRPDNAGSSAAALKAGHDL 299
Query: 358 NCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGK-YGKLGPDDVC 416
NCGT R A D+G+ E +DR+L+ LF+ + RLG PR+ Y +LG D+
Sbjct: 300 NCGTAY-RELGIAFDRGEADEALLDRSLVRLFAARYRLGELQ--PRRNDPYARLGARDID 356
Query: 417 TSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSP 476
++ H+ LAL AA+Q +VLLKN LPL LA++GP + ++ + Y G P
Sbjct: 357 SAAHRALALQAAQQSLVLLKNANATLPLRPGL--RLAVLGPNADALAALEANYQGTSVQP 414
Query: 477 KSLLRGLEAY--VSKTHYASGC 496
+ L+GL ++ YA G
Sbjct: 415 VTPLQGLRTRFGAAQVAYAQGA 436
Score = 124 bits (310), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 82/262 (31%), Positives = 133/262 (50%), Gaps = 44/262 (16%)
Query: 536 DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAG 595
DR L LP Q +L+ A+ S +P+++VL G + +++AE + W YPG++G
Sbjct: 643 DRNDLALPAAQQALLER-AKASGKPLVVVLMSGSAVALNWAEQHADAIIAAW--YPGQSG 699
Query: 596 AKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYG 655
A+A+ + GD NPGGRLP+T+Y + P +M+ GR+YR++ G ++
Sbjct: 700 GTAIAQALAGDINPGGRLPVTFYRSTKDLPPYVSYDMK--------GRTYRYFKGEPLFP 751
Query: 656 FGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFH 715
FG+GLSYT ++Y AP T +L+AG
Sbjct: 752 FGYGLSYTQFAY---DAPQLSTT--TLQAGQP--------------------------LQ 780
Query: 716 VQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQ 775
V +V N G G VV ++ + P+ +Q +P + L+GF RVH ++ +SF +D Q
Sbjct: 781 VSTTVRNTGARAGDEVVQVYLQYPQRAQ-SPLRSLVGFQRVHLQPGEARTLSFALD-ARQ 838
Query: 776 LSIANKHGRRILPLGNHVLMVG 797
LS ++ G+R + G++ L VG
Sbjct: 839 LSDVDRSGQRAVEAGDYRLFVG 860
>gi|285018984|ref|YP_003376695.1| beta-glucosidase [Xanthomonas albilineans GPE PC73]
gi|283474202|emb|CBA16703.1| putative beta-glucosidase protein [Xanthomonas albilineans GPE
PC73]
Length = 904
Score = 295 bits (755), Expect = 7e-77, Method: Compositional matrix adjust.
Identities = 175/446 (39%), Positives = 241/446 (54%), Gaps = 58/446 (13%)
Query: 64 RAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVT 123
RA +L++ +T EKI Q ++A AIPRLGIPAYEWWSE LHGIA NG T
Sbjct: 55 RATALVAKMTRAEKIAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNG----------EAT 104
Query: 124 SFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG---------QAGLTFWAPNINIFRD 174
FPQ + AAS+N L +G+ + EARA +NL GLT W+PNINIFRD
Sbjct: 105 VFPQAIGLAASWNTDLLHAVGTVTSTEARAKFNLAGGPGKNHARYGGLTIWSPNINIFRD 164
Query: 175 PRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRG 234
PRWGRG ET GEDP + AV F+ QG++ R +
Sbjct: 165 PRWGRGMETYGEDPYLTGQLAVGFIHGLQGDDPT-----------HPRTI---------- 203
Query: 235 DELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCS 294
A KHL + + G R+ F+ ++ D E T+ P FR+ I +G A +MC+
Sbjct: 204 ------ATPKHLAVHSGPESG---RHGFDVDVSPHDFEATYSPAFRAAIVEGHAGSVMCA 254
Query: 295 YNQVNGVPACLRGDLFQ-KARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKA 353
YN ++G+PAC L + R WGFKG++ SDCDA+ + ++ Y + SAA LKA
Sbjct: 255 YNALHGIPACAADWLIDGRVRGNWGFKGFVVSDCDAIDDMTQFHYYRADNAGSAAAALKA 314
Query: 354 GMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPR-KGKYGKLGP 412
G D+NCG R +A+D+G+ +E +DR+L+ LF+ + RLG PR K Y +LG
Sbjct: 315 GHDLNCGYAY-RDLGTALDRGEAEEAMLDRSLVRLFAARYRLGELQ--PRSKDPYARLGA 371
Query: 413 DDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGI 472
D+ + H+ LAL AA+Q +VLL+N LPL LA+IGP + ++ + Y G
Sbjct: 372 KDIDSPTHRALALQAAQQSLVLLQNRNDTLPLRPGL--RLAVIGPNADALAALEANYQGT 429
Query: 473 PCSPKSLLRGLEAYVSKT--HYASGC 496
+P + L+GL A T HY G
Sbjct: 430 SVAPVTPLQGLRARFGTTQVHYTQGA 455
Score = 122 bits (307), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 84/262 (32%), Positives = 139/262 (53%), Gaps = 44/262 (16%)
Query: 536 DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAG 595
DR L LP Q +L+ A+ S +P+I+VL G + +++A+ +IL YPG++G
Sbjct: 662 DRNDLSLPAAQQALLER-AKASGKPLIVVLMSGSAVALNWAK--QHADAILAAWYPGQSG 718
Query: 596 AKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYG 655
A+A+ + GD NPGGRLP+T+Y + P +M+ GR+YR++ G ++
Sbjct: 719 GTAIAQALAGDINPGGRLPVTFYRSTKDLPPYVSYDMK--------GRTYRYFKGEALFP 770
Query: 656 FGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFH 715
FG+GLSYT+++Y +AP +L+ S +L+AG H
Sbjct: 771 FGYGLSYTHFAY---TAP-QLS-STTLQAGDT--------------------------LH 799
Query: 716 VQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQ 775
V +V N G G VV ++ + P +Q +P + L+GF RV ++ +SF ++P Q
Sbjct: 800 VTTTVRNTGARAGDEVVQVYLQYPPRAQ-SPLRALVGFQRVSLQPGEARTLSFALEP-RQ 857
Query: 776 LSIANKHGRRILPLGNHVLMVG 797
LS ++ G+R + G++ L VG
Sbjct: 858 LSDVDRSGQRAVEAGDYRLFVG 879
>gi|361127339|gb|EHK99311.1| putative exo-1,4-beta-xylosidase bxlB [Glarea lozoyensis 74030]
Length = 569
Score = 294 bits (753), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 181/564 (32%), Positives = 276/564 (48%), Gaps = 75/564 (13%)
Query: 55 CNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVN 114
C+T+ + RA +L+ + EK+Q + ++ + RLG+P Y WWSE+LHG+A PG+
Sbjct: 65 CDTTAPPADRAAALVKAMQSSEKLQNIISKSAGVSRLGLPPYNWWSEALHGVA-GAPGIQ 123
Query: 115 FNGTV--SSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIF 172
F+ + + TS P ++ AA+F+ L +G+ + EARA N +G+ FW PNIN F
Sbjct: 124 FSSSSPWNYATSLPMPILMAAAFDDDLIEKVGTLIGTEARAFGNGNHSGIDFWTPNINPF 183
Query: 173 RDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESD 232
+DPRWGRG ETPGED + + Y ++ +G ++R++
Sbjct: 184 KDPRWGRGSETPGEDTLRLKGYVAALLRGLEGNK------------AQRRII-------- 223
Query: 233 RGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIM 292
A CKH A DLE W +R+ F+A I+ QD + + PF+ C K M
Sbjct: 224 --------ATCKHYAANDLESWNGVTRHDFDAKISMQDLAEYYLQPFQQCARDSKVGSFM 275
Query: 293 CSYNQVNGVPACLRGDLFQKA-RNEWGFKG---YITSDCDAVATIFEYQNYTKTHEDSAA 348
CSYN VNGVPAC L Q R+ W + Y+TSDC+AV I +Y T+ A
Sbjct: 276 CSYNSVNGVPACANKYLLQTILRDHWNWTSENQYVTSDCEAVQDISLNHHYASTNAAGTA 335
Query: 349 GVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYG 408
AG D +C G F+G K Y
Sbjct: 336 LAFNAGTDSSC----------------------------------EAGYFDGS--KALYS 359
Query: 409 KLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGG 468
LG DV T + ++LAL A GIV+LKND LPL ++ S +A+IG ++ S++ GG
Sbjct: 360 SLGWSDVNTPQAQQLALQATVDGIVMLKNDGT-LPLKLDSKSKVAMIGFWASDSSKLQGG 418
Query: 469 YTGIPCSPKSLLRGLEAYVSKTHYASG-CHDVPCNSDAGFHEAVRIAKKADFVIVVAGLD 527
Y+G ++ + + + A+G +D A+ A K+D+++ GLD
Sbjct: 419 YSGKAPYLRTPVYAAQQLGFTPNVATGPVQQSASATDNWTTNALAAASKSDYILYFGGLD 478
Query: 528 LTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILW 587
+ E DR SL P Q++L+ ++ K P+I++ G +D + + +SSILW
Sbjct: 479 TSAAAEGVDRTSLEWPSAQLALIKKLSALGK-PLIIIQE-GDQMDNTPLLTNKGVSSILW 536
Query: 588 IGYPGEAGAKALAEIIFGDFNPGG 611
+PG+ G A+ +II G +P G
Sbjct: 537 ASWPGQDGGPAVMQIISGAKSPAG 560
>gi|256393789|ref|YP_003115353.1| beta-glucosidase [Catenulispora acidiphila DSM 44928]
gi|256360015|gb|ACU73512.1| Beta-glucosidase [Catenulispora acidiphila DSM 44928]
Length = 1548
Score = 292 bits (748), Expect = 4e-76, Method: Compositional matrix adjust.
Identities = 232/799 (29%), Positives = 357/799 (44%), Gaps = 153/799 (19%)
Query: 54 FCNTSLSISTRAKSLISLLTLQEKIQQLSDNAS-AIPRLGIPAYEWWSESLHGIASNGPG 112
+ +T S RA L+S +TL EK++QLS N+ AIPRLG+ Y +WSE HG+ + G
Sbjct: 64 YLDTHYSAQARAADLVSRMTLPEKVEQLSTNSGPAIPRLGVQQYTYWSEGQHGVNTLGAN 123
Query: 113 VNFNGTVSSV--TSFPQVLVSAASFNRSLWSNIGSAVAVEARAMY----------NLGQA 160
+ G +V TSFP S S++ SL +A++ EAR NLG +
Sbjct: 124 QDNGGNGGAVHSTSFPTNFASTMSWDPSLIYQETTAISDEARGFLDKSLFGVNQNNLGPS 183
Query: 161 G-----LTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIG 215
LTFWAP +N+ RDPRWGR E GEDP + S A FV ++G
Sbjct: 184 AADYGSLTFWAPTVNMDRDPRWGRTDEAFGEDPYLTSTMAGAFVNGYEGNTPTG------ 237
Query: 216 FGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTF 275
+S G L ++A KH D+E+ R ++ +++ D D +
Sbjct: 238 --------------QSKTG-TLKVAATAKHYALNDVEQ----DRTGISSNVSDTDLHDYY 278
Query: 276 QPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLF---QKARNEWGFKGYITSDCDAVAT 332
F S IE S +M SYN +NG P+ D + Q A+ ++GF GY+TSDC A+ T
Sbjct: 279 TKQFASLIENAHVSGLMTSYNAINGTPSVA--DTYTANQLAQRQFGFNGYVTSDCGAIGT 336
Query: 333 IFEY----------------QNYTKTHEDSAAGV------------LKAGMDINC--GTC 362
++ ++ T T ++A G L+AG D+NC G
Sbjct: 337 AYQSFPSGHDWAPPGWTTDGKSSTGTWTNTATGATVPAQAGGQAYALRAGTDLNCAGGEN 396
Query: 363 MLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPR-KGKYGKLGPDDVCTSEHK 421
+AI G + E ID AL+ +F+V++ G F DP Y + + + H+
Sbjct: 397 TYAQITAAISAGVLSEGVIDNALVKIFTVRVETGEF--DPAGSNPYTGITKAQIQSPAHQ 454
Query: 422 KLALDAARQGIVLLKND-------------------------KKFLPLNKNAVSSLAIIG 456
LA A +VLLKN K LPL+ A + + I+G
Sbjct: 455 ALATKVADNSLVLLKNQPPAASGTSTTPPAASSAASSAAAAAKPLLPLSAAATAKIVIVG 514
Query: 457 PLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSK-------THYASGCHDVPCNSDAGFHE 509
+ N ++ G Y+ P S ++G+ A V K T A G + + +
Sbjct: 515 DMANAVTL--GNYSSDPALKVSPVQGITAAVRKANPGASVTFDACGTSTTASAAASCSAQ 572
Query: 510 AVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGG 569
+ AD VIV G + E +DR S+ +PG SL++ VA ++L + GG
Sbjct: 573 TLADVAGADAVIVFVGTNQQIADEGKDRTSIAMPGNYDSLISQVAAVGNPRMVLAVQSGG 632
Query: 570 PLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMND 629
P+ + + D +SI++ G+ GE+ ALA+++FG NP G L TWY + M++
Sbjct: 633 PVRIDDVQKD--FASIVFSGFNGESQGTALADVLFGAQNPDGHLDFTWYADDSQLPAMSN 690
Query: 630 MNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKN 689
+ + GR+Y ++TGT Y FG+GLSY+ +S+ + A +
Sbjct: 691 YGLTPAQTGGL-GRTYMYFTGTPTYPFGYGLSYSTFSFSGVHA--------------EGR 735
Query: 690 ILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVS---QGTP 746
+ GS+ V ++V N G GS V L+A+ PK + Q P
Sbjct: 736 SVDANGSQ-----------------SVSVTVKNTGKTAGSTVAQLYAQ-PKFTVAGQTFP 777
Query: 747 EKQLIGFDRVHTVAKGSKE 765
+QL+GF + + G +
Sbjct: 778 NEQLVGFAKSKVLKPGESQ 796
>gi|255545660|ref|XP_002513890.1| Periplasmic beta-glucosidase precursor, putative [Ricinus communis]
gi|223546976|gb|EEF48473.1| Periplasmic beta-glucosidase precursor, putative [Ricinus communis]
Length = 336
Score = 292 bits (748), Expect = 5e-76, Method: Compositional matrix adjust.
Identities = 156/344 (45%), Positives = 209/344 (60%), Gaps = 19/344 (5%)
Query: 467 GGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGL 526
G Y G PC + L+GL+ Y+ T Y GC+ V C S A EAV+IA++ D V++V L
Sbjct: 5 GNYAGPPCKTVTPLQGLQNYIKNTKYHRGCNTVAC-SKATIREAVQIAREVDQVVLVMEL 63
Query: 527 DLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSIL 586
D TQE E D ++L LPG Q L+ SVAR + +PV+LVL GG +DVSFA + +I SIL
Sbjct: 64 DQTQEAERIDLLNLRLPGNQQKLIISVARAANKPVVLVLICGGLVDVSFAITEPKIGSIL 123
Query: 587 WIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYR 646
W GYPGEAG ALAEIIFGD NPGG+LP+TWYP+ +TK+PM D+ MR + YPGRSYR
Sbjct: 124 WAGYPGEAGGTALAEIIFGDHNPGGKLPVTWYPQQYTKIPMTDVRMRPQIASGYPGRSYR 183
Query: 647 FYTGTQVYGFGHGLSYTNYSYKFLSAPSE---LTISASLKA--GSDKNILQQTGSRLDYV 701
FY G +V+ FG+GLSY+N SY+ S P + L +S+K S ++ + G L
Sbjct: 184 FYEGKKVFEFGYGLSYSNCSYEIASIPQDKIFLRSPSSIKGVKTSSYTLVSELGKEL--- 240
Query: 702 HIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAK 761
C +F V + V N G + G H V++F R PK G P K+L+ F V A
Sbjct: 241 -------CERSKFSVTVKVKNEGKIIGKHPVLVFLRQPKPGSGRPVKKLVAFQTVRLNAG 293
Query: 762 GSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTI 805
+ EI + PCE L+ AN+ G ++ G L+VGE + +TI
Sbjct: 294 QNAEIQRKLSPCEPLTRANEDGSMVIDGG---LVVGEKPYQITI 334
>gi|116181370|ref|XP_001220534.1| hypothetical protein CHGG_01313 [Chaetomium globosum CBS 148.51]
gi|88185610|gb|EAQ93078.1| hypothetical protein CHGG_01313 [Chaetomium globosum CBS 148.51]
Length = 549
Score = 292 bits (747), Expect = 5e-76, Method: Compositional matrix adjust.
Identities = 195/565 (34%), Positives = 286/565 (50%), Gaps = 56/565 (9%)
Query: 19 LQLIVVVNVIAFSNSKPVLNKPDFPCKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKI 78
L L + + F+N PD C C+ + RA +L+ L ++EK+
Sbjct: 5 LSLRLALASALFANGVIGFQYPD--CANGPLADNTVCDPKATPPERAAALVKALNIEEKL 62
Query: 79 QQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFN----GTVSSVTSFPQVLVSAAS 134
Q L D + RLG+PAY WWSE+LHG+A++ PGV FN G SS TSF + +A+
Sbjct: 63 QNLVDMSKGAERLGLPAYAWWSEALHGVAAS-PGVRFNRTAGGRFSSATSFANSITLSAA 121
Query: 135 FNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAY 194
F+ L + ++ EARA N G AGL +W PNIN ++DPRWGRG ETPGEDP+ + Y
Sbjct: 122 FDDELVYKVADTISTEARAFANAGLAGLDYWTPNINPYKDPRWGRGHETPGEDPVRIKGY 181
Query: 195 AVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKW 254
+ +G DD I ++V+ A CKH AYDLE+W
Sbjct: 182 VKALLAGLEG-----DDPSI------RKVV----------------ATCKHYAAYDLERW 214
Query: 255 GNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA- 313
+R+ F+A+++ QD + + PPF+ C K MCSYN +NG PAC L
Sbjct: 215 QGTTRHRFDAVVSLQDLSEYYLPPFQQCARDSKVGSFMCSYNALNGTPACASTYLMDDIL 274
Query: 314 RNEWGF---KGYITSDCDAVATIF---EYQNY--TKTHEDSAAGVLKAGMDINC---GTC 362
R WG+ YITSDC+A+ ++ N+ T+T ++AA +AG D C G
Sbjct: 275 RKHWGWTEHNNYITSDCNAIQDFLPGPKWHNFSSTQTEAEAAAVAYQAGTDTVCEVPGWP 334
Query: 363 MLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGK-YGKLGPDDVCTSEHK 421
A ++ + E+ ID AL L+ +R+G F DP G Y +G +DV T E +
Sbjct: 335 PYTDVIGAYNQTLLSEEVIDTALKRLYEGLVRVGYF--DPASGSPYRSIGWEDVNTPEAQ 392
Query: 422 KLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVN--NISQMGGGYTGIPCSPKSL 479
+LAL + G+VLLKND LPLN ++A+IG N N ++ GGY+G P S
Sbjct: 393 ELALQSGTDGLVLLKNDGT-LPLNLED-KTVALIGFWANSTNGGRILGGYSGFPPYIHSP 450
Query: 480 LRGLEAYVSKTHYASGC---HDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRD 536
+ E HYASG + D +A+ AKK++ ++ G D + ED D
Sbjct: 451 VDAAEKLNLTYHYASGPLAENITQAAIDDWVAKALEPAKKSNVILYFGGTDTSIAAEDLD 510
Query: 537 RVSLLLPGQQMSLVTSVARTSKRPV 561
R S+ P Q++++ +++ + P
Sbjct: 511 RDSIAWPEIQLAVIEALSALRQAPA 535
>gi|296081549|emb|CBI20072.3| unnamed protein product [Vitis vinifera]
Length = 333
Score = 291 bits (744), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 158/342 (46%), Positives = 214/342 (62%), Gaps = 12/342 (3%)
Query: 465 MGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVA 524
M G Y G P + L+GL A V+ T Y GC +V C + A EA +IA AD +++
Sbjct: 1 MIGNYEGTPGKYTTPLQGLTALVATT-YLPGCSNVACGT-AQIDEAKKIAAAADATVLIV 58
Query: 525 GLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISS 584
G+D + E E RDRV++ LPGQQ L+T VA+ SK VILV+ GG D+SFA+ D +I+S
Sbjct: 59 GIDQSIEAEGRDRVNIQLPGQQPLLITEVAKASKGNVILVVMSGGGFDISFAKNDDKITS 118
Query: 585 ILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNMRADSSRQYPGR 643
ILW+GYPGEAG A+A++IFG +NP GRLP TWYP+S+ KVPM +MNMR D + YPGR
Sbjct: 119 ILWVGYPGEAGGAAIADVIFGFYNPSGRLPTTWYPQSYVDKVPMTNMNMRPDPASGYPGR 178
Query: 644 SYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHI 703
+YRFYTG +Y FG GLSYT +++ + AP S S+ + +D V
Sbjct: 179 TYRFYTGETIYTFGDGLSYTQFNHHLIQAPK----SVSIPIEEGHSCHSSKCKSVDAVQ- 233
Query: 704 DEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGS 763
SC +L F + + V NAG++ GSH V LF+ P V +P+K L+GF++V AK
Sbjct: 234 ---ESCQNLAFDIHLRVNNAGNISGSHTVFLFSSPPSV-HNSPQKHLLGFEKVFVTAKAE 289
Query: 764 KEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTI 805
+ F VD C+ LSI ++ G R + LG HVL VG L+HSL +
Sbjct: 290 ALVRFKVDVCKDLSIVDELGTRKVALGLHVLHVGNLKHSLNV 331
>gi|433679952|ref|ZP_20511614.1| beta-glucosidase [Xanthomonas translucens pv. translucens DSM
18974]
gi|430814928|emb|CCP42243.1| beta-glucosidase [Xanthomonas translucens pv. translucens DSM
18974]
Length = 909
Score = 289 bits (739), Expect = 5e-75, Method: Compositional matrix adjust.
Identities = 165/437 (37%), Positives = 231/437 (52%), Gaps = 56/437 (12%)
Query: 72 LTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVS 131
+T +EK+ Q + A AIPRLG+PAYEWW+E LHGIA NG T FPQ +
Sbjct: 68 MTREEKVAQAMNAAPAIPRLGVPAYEWWNEGLHGIARNG----------YATVFPQAIGL 117
Query: 132 AASFNRSLWSNIGSAVAVEARAMYNLG---------QAGLTFWAPNINIFRDPRWGRGQE 182
AA++N +L +G+ + EARA +NL AGLT W+PNINIFRDPRWGRG E
Sbjct: 118 AATWNTALLEQVGTVTSTEARAKFNLAGGPGKDHPRYAGLTIWSPNINIFRDPRWGRGME 177
Query: 183 TPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSAC 242
T GEDP + AV F++ QG++ R + A
Sbjct: 178 TYGEDPYLTGQLAVGFIRGLQGDD-----------LTHPRTI----------------AT 210
Query: 243 CKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVP 302
KHL + + G R+ F+ ++ D E T+ P FR+ I G+A +MC+YN ++G P
Sbjct: 211 PKHLAVHSGPEPG---RHGFDVDVSPHDLEATYTPAFRAAIVDGRAGAVMCAYNSLHGTP 267
Query: 303 ACLRGDLFQ-KARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGT 361
AC L + R +WGF G++ SDCDAV + ++ + + S+A LKAG D+NCG
Sbjct: 268 ACAADWLLNGRLRGDWGFTGFVVSDCDAVDDMTQFHYFRADNAGSSAAALKAGHDLNCGY 327
Query: 362 CMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHK 421
R AI +G E +D++L+ LF+ + RLG RK Y +LG DV ++ H+
Sbjct: 328 AY-RDLGKAIARGDADEAVLDQSLVRLFAARYRLGELQPQ-RKDPYARLGAKDVDSAAHR 385
Query: 422 KLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLR 481
LAL AA+Q IVLL+N LPL LA+IGP + ++ + Y G +P + L
Sbjct: 386 ALALQAAQQSIVLLQNRNATLPLRPGL--RLAVIGPNADALAALEANYQGTSAAPVTPLL 443
Query: 482 GLEAY--VSKTHYASGC 496
GL + YA G
Sbjct: 444 GLRERFGAANLRYAQGA 460
Score = 116 bits (291), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 78/262 (29%), Positives = 137/262 (52%), Gaps = 44/262 (16%)
Query: 536 DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAG 595
DR L LP Q +L+ A+ S +P+++VL G + +++A+ + W YPG++G
Sbjct: 667 DRNDLALPAAQQALLER-AKASGKPLVVVLMSGSAVALNWAKQHADAIVAAW--YPGQSG 723
Query: 596 AKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYG 655
A+A+++ GD NPGGRLP+T+Y + ++ A S GR+YR++ G ++
Sbjct: 724 GTAIAQVLAGDVNPGGRLPVTFY--------RSTKDLPAYVSYDMKGRTYRYFKGEPLFA 775
Query: 656 FGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFH 715
FG GLSYT ++Y A +L+ + +L+AG++
Sbjct: 776 FGSGLSYTRFTY----AAPQLSAT-TLQAGAN--------------------------LQ 804
Query: 716 VQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQ 775
V+ V N+G G VV ++ + P+ +Q +P + L+GF RV ++E+ F + P Q
Sbjct: 805 VRTQVRNSGTRAGDEVVQVYLQPPQGAQ-SPLRTLVGFQRVTLQPGEAREVGFELTP-RQ 862
Query: 776 LSIANKHGRRILPLGNHVLMVG 797
LS ++ G+R + G++ + VG
Sbjct: 863 LSDVDRAGQRAVQPGDYRVFVG 884
>gi|371777036|ref|ZP_09483358.1| glycoside hydrolase [Anaerophaga sp. HS1]
Length = 890
Score = 288 bits (737), Expect = 7e-75, Method: Compositional matrix adjust.
Identities = 173/455 (38%), Positives = 245/455 (53%), Gaps = 55/455 (12%)
Query: 54 FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
+ + +L RA L+S +TL+EK+ Q+ A AI RLGIP Y WW+E LHG+
Sbjct: 40 YLDPTLPFEERAADLVSKMTLEEKVSQMQHAAPAIERLGIPEYNWWNECLHGV------- 92
Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYN----LGQ----AGLTFW 165
G T FPQ + AA ++ I +AV+ EARA ++ G+ GLTFW
Sbjct: 93 ---GRAGIATVFPQAIGMAAMWDDEEMYRIATAVSDEARAKHHDFARRGKRGIYQGLTFW 149
Query: 166 APNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLK 225
PNINIFRDPRWGRG ET GEDP + AV+++K QG+ DD R LK
Sbjct: 150 TPNINIFRDPRWGRGMETYGEDPFLTGELAVDYIKGLQGD----DD----------RYLK 195
Query: 226 GFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQ 285
L A KH + + + R+ F+A + +D+ T+ P F+ I++
Sbjct: 196 -------------LVATSKHFLVHSGPE---PDRHHFDARTSARDSLMTYTPHFKKTIQE 239
Query: 286 GKASCIMCSYNQVNGVPACLRGDLFQKARNEWGFKGYITSDCDAVATIFE--YQNYTKTH 343
+MC+YN+ NG+P C + RNEWGFKGYI SDC AVA ++ + T
Sbjct: 240 AGVYSVMCAYNRYNGLPCCGSKPVENLLRNEWGFKGYIVSDCWAVADFYKKGHHEVVPTV 299
Query: 344 EDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPR 403
E++AA +KAG D+NCG A+ +G V E++ID + L +LRLG+F+ P
Sbjct: 300 EEAAAMAVKAGTDLNCGNSYPALV-DAVKQGLVSEEEIDVLVKRLMEARLRLGMFD-PPE 357
Query: 404 KGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNIS 463
Y + V + EH++LAL AAR+ +VLLKND LPL+KN V ++A+IGP NN+
Sbjct: 358 MVPYTNIPYSVVDSKEHRELALIAARKSMVLLKNDNNTLPLDKN-VKNVAVIGPNANNLD 416
Query: 464 QMGGGYTGIPCSPKSLLRGLEAYV--SKTHYASGC 496
+ Y G P +P + L G+ + + YA GC
Sbjct: 417 VLLANYNGYPSNPVTPLDGIRQKLPNANVQYALGC 451
Score = 147 bits (372), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 103/299 (34%), Positives = 153/299 (51%), Gaps = 55/299 (18%)
Query: 509 EAVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVARTSK 558
EA++IA +D V++ GL E E+ DRV + LP Q LV ++ K
Sbjct: 609 EAIQIAAASDVVLMFMGLSPNLEGEEMPVNVPGFSGGDRVDIKLPQIQTDLVKAIMSLGK 668
Query: 559 RPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWY 618
PV+LVL G L +++ EA++ + +IL YPG+AG A+A+++FGD+NP GRLP+T+Y
Sbjct: 669 -PVVLVLLNGSALAINW-EAEN-VPAILEAWYPGQAGGTAIADVLFGDYNPAGRLPVTFY 725
Query: 619 PESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTI 678
P D +M GR+Y+++ G ++ FG+GLSYT++ Y L P +
Sbjct: 726 KSVTQLPPFEDYSMD--------GRTYQYFKGEALFPFGYGLSYTSFKYDNLVVPDK--- 774
Query: 679 SASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARV 738
L+AG EVT V + VTN G+ DG VV L+
Sbjct: 775 ---LEAGK------------------EVT--------VHVDVTNTGNRDGDEVVQLYVSH 805
Query: 739 PKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
P V + P + L GFDR+ A +K +SF + P EQL++ ++P GN L VG
Sbjct: 806 PDV-ESAPIRSLQGFDRIALKAGETKTVSFTLKP-EQLAVYQPQNGLVVPAGNLKLSVG 862
>gi|392537607|ref|ZP_10284744.1| Beta-glucosidase [Pseudoalteromonas marina mano4]
Length = 870
Score = 288 bits (736), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 173/479 (36%), Positives = 250/479 (52%), Gaps = 68/479 (14%)
Query: 54 FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
+ N S SI R L++ LTL+EK+ QL D + AI RL IP Y WW+E+LHG+A G
Sbjct: 34 YLNESASIDERVNDLVTRLTLEEKVAQLFDKSPAIERLNIPEYNWWNEALHGVARAG--- 90
Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLTFW 165
T FPQ + AA+F+ L +G+A++ E RA ++ A GLT+W
Sbjct: 91 -------KATVFPQAIGLAATFDEDLMLRVGTAISDEGRAKHHAFLAENNRSMYTGLTYW 143
Query: 166 APNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLK 225
+PNINIFRDPRWGRGQET GEDP + + AV F+ QG+N
Sbjct: 144 SPNINIFRDPRWGRGQETYGEDPYLTTRIAVNFINGLQGDN------------------- 184
Query: 226 GFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQ 285
+ L A KH Y + SR+S + +++D +T+ P F+ I Q
Sbjct: 185 --------TEYLKSVATLKH---YAVHSGPEVSRHSDDYTASKKDLAETYLPAFKDVIAQ 233
Query: 286 GKASCIMCSYNQVNGVPACLRGDLFQ-KARNEWGFKGYITSDCDAVATIFEYQ--NYTKT 342
K + +MC+YN VNG PAC +L Q K R+E+ F GYI SDC A+A ++ + N T
Sbjct: 234 TKVASVMCAYNSVNGTPACGNDELIQNKLRDEFNFDGYIVSDCGAIADFYDVKSHNIVNT 293
Query: 343 HEDSAAGVLKAGMDINCGT---CMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFN 399
+AA LK G D+NCG + A+ +G V+EKD+D+AL L + +LG+F+
Sbjct: 294 EAKAAAMALKTGTDLNCGDHHGNTYSYLSQAVKEGLVEEKDVDKALKRLMYARFKLGMFD 353
Query: 400 GDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLV 459
+P Y D V +++H L +AA++ +VLLKN+ + LPL N +A+IGP
Sbjct: 354 -NPENVPYSDTSIDIVGSNKHLALTQEAAKKSLVLLKNE-QVLPLKGN--EKVALIGPNA 409
Query: 460 NNISQMGGGYTGIPCSPKSLLRGLEAYVSKTH--YASG--------CHDVPCNSDAGFH 508
+N + + G Y G+P P + LE + K + Y +G H P + FH
Sbjct: 410 DNEAILLGNYNGMPIVPITPKLALEQRLGKNNLTYTAGSSITGEVYTHYQPIKAKHFFH 468
Score = 114 bits (286), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 79/301 (26%), Positives = 140/301 (46%), Gaps = 57/301 (18%)
Query: 508 HEAVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVARTS 557
+A+ A +AD ++ V G+ E E+ DR ++ LP Q++L+ + +T
Sbjct: 597 QQALNNANEADVIVFVGGISANLEGEEMPLQIDGFSHGDRTNINLPKSQLNLLKKLKQTG 656
Query: 558 KRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTW 617
K P++LV G + +++ + I +I+ YPGEA AL +++G+++P G+LP+T+
Sbjct: 657 K-PIVLVNMSGSAMALNWE--NENIDAIIQGFYPGEAAGSALVSLLYGEYSPSGKLPITF 713
Query: 618 YPESFTKVP-MNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSEL 676
Y +S + +P D +M+ R+Y++Y G +Y FG GLSY ++ YK
Sbjct: 714 Y-KSVSDLPDFKDYSMK--------NRTYKYYEGEVLYPFGFGLSYADFKYK-------- 756
Query: 677 TISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFA 736
S+ AGS ++ ++TN VV ++
Sbjct: 757 NTRHSIDAGSGD-------------------------LNLTTTITNQSSFSADDVVQVYV 791
Query: 737 RVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMV 796
+P TP KQL+GF + + +I F + P +LS N+ G + G ++ V
Sbjct: 792 SMPDAPIKTPNKQLVGFKHITLKNESKNDIKFTI-PKNKLSYINEQGIAVAYKGRLIITV 850
Query: 797 G 797
G
Sbjct: 851 G 851
>gi|332665860|ref|YP_004448648.1| beta-glucosidase [Haliscomenobacter hydrossis DSM 1100]
gi|332334674|gb|AEE51775.1| Beta-glucosidase [Haliscomenobacter hydrossis DSM 1100]
Length = 887
Score = 288 bits (736), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 175/500 (35%), Positives = 257/500 (51%), Gaps = 81/500 (16%)
Query: 2 SRPFMSIQLRERILFRVLQLIVVVNVIAFSNSKPVLNKPDFPCKPPHFDSYPFCNTSLSI 61
+R S++ I F L I++ NV C+ F +P +T+LS
Sbjct: 6 TRRLCSVKAHRAIFFLSLLTILIANV----------------CQAQSFKGFPMWDTNLSF 49
Query: 62 STRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSS 121
R K L+S LTL+EK+ Q+ + A AIPRLGIPAY+WW+E LHG+A T
Sbjct: 50 EVRVKDLVSRLTLEEKVGQMLNAAPAIPRLGIPAYDWWNEVLHGVAR---------TPFH 100
Query: 122 VTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA---------GLTFWAPNINIF 172
VT +PQ + AA ++ + + + A+E RA++N A GLT+W PNINIF
Sbjct: 101 VTVYPQAIGMAAGWDSTSLAMMAHYSALEGRAVFNKATALGRNNERYLGLTYWTPNINIF 160
Query: 173 RDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESD 232
RDPRWGRGQET GEDP + S FV+ QG++ K
Sbjct: 161 RDPRWGRGQETYGEDPFLTSMLGRAFVRGLQGDDPKY----------------------- 197
Query: 233 RGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIM 292
L +AC KH + + SR+S N + D DT+ P F+ + + K +M
Sbjct: 198 ----LKAAACAKHFAVHSGPE---PSRHSDNFSPSNYDLWDTYLPAFKELVTKAKVEGVM 250
Query: 293 CSYNQVNGVPAC----LRGDLFQKARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAA 348
C+YN +G P C L D+ +K +W FKGY+TSDC A+ F++ KTH D+ +
Sbjct: 251 CAYNAFHGQPCCGSDVLMNDILRK---QWQFKGYVTSDCWAIDDFFKFH---KTHPDATS 304
Query: 349 GVLKA---GMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDP-RK 404
+ A G D+ CGT + + + KG + E +D +L+ LF+ + RLG+F DP
Sbjct: 305 ASVDAVLHGTDVECGTDVYKSLLDGVKKGMIAEAQLDISLIRLFTTRYRLGMF--DPVSM 362
Query: 405 GKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQ 464
KY + + T+EHK +L A+Q IVLLKN+ LPL+KN + +A++GP +N
Sbjct: 363 VKYAQTPESILETAEHKAHSLKMAQQSIVLLKNEGNTLPLSKN-IKKIAVLGPNADNRIV 421
Query: 465 MGGGYTGIPCSPKSLLRGLE 484
+ G Y G P + L+G++
Sbjct: 422 VLGNYNGQPSEIITALQGIK 441
Score = 134 bits (337), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 90/297 (30%), Positives = 136/297 (45%), Gaps = 54/297 (18%)
Query: 511 VRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVARTSKRP 560
V K AD ++ V G+ E E+ DR S+LLP Q L+ + T K
Sbjct: 613 VNRVKDADAIVYVGGISPQLEGEEMRVDFPGFNGGDRTSILLPAVQTELLKMLKGTGKPL 672
Query: 561 VILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPE 620
V +V+TG ++ D I +I+ Y G++ A+A+++FGD+NP GRLP+T+Y
Sbjct: 673 VFVVMTGSA---IALPYEDQNIPAIVNAWYGGQSAGTAIADVLFGDYNPAGRLPVTFYKA 729
Query: 621 SFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISA 680
D ++ S R+YR++ G +Y FGHGLSYT++ Y L P ++ A
Sbjct: 730 --------DSDLPDFKSYDMNNRTYRYFKGDALYPFGHGLSYTSFQYSKLKTPGKIKSGA 781
Query: 681 SLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPK 740
S F V ++TN G DG VV L+ P+
Sbjct: 782 S--------------------------------FKVSATLTNTGKKDGDEVVQLYLAYPE 809
Query: 741 VSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
V+ P + L GF+R+ A SK +SF + P EQ + N+ G P G + +G
Sbjct: 810 VAGKAPIRALKGFNRIRLKAGESKTVSFTLSP-EQCQLVNEEGALYQPKGKMEISLG 865
>gi|217967241|ref|YP_002352747.1| glycoside hydrolase family 3 [Dictyoglomus turgidum DSM 6724]
gi|217336340|gb|ACK42133.1| glycoside hydrolase family 3 domain protein [Dictyoglomus turgidum
DSM 6724]
Length = 762
Score = 287 bits (735), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 229/756 (30%), Positives = 359/756 (47%), Gaps = 128/756 (16%)
Query: 72 LTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVS 131
+ L KIQ+ + RLGIPA E L G + G T FPQ +
Sbjct: 72 VELANKIQKFLKEKT---RLGIPAI-IHEECLSGFMAKG-----------ATVFPQAIGM 116
Query: 132 AASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVV 191
A++F L + + RA N+ Q GL+ P ++I RDPRWGR +ET GEDP +V
Sbjct: 117 ASTFEPELIRRVSDVIRQHMRAA-NVHQ-GLS---PVLDIPRDPRWGRTEETFGEDPYLV 171
Query: 192 SAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDL 251
S A E+VK QGE+W+ + A KH AY +
Sbjct: 172 SRMAAEYVKGLQGEDWREG----------------------------IIATVKHFTAYGI 203
Query: 252 EKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQ 311
+ +R A + E++ + F PF I++G+A +M +Y++++GVP L
Sbjct: 204 SEG---ARNLGPAKVGERELREVFLFPFEVAIKEGQAGSLMNAYHEIDGVPCASSKFLLT 260
Query: 312 KA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGT--CMLRHTQ 368
K R EWGFKGY+ SD A+ + + K +++A L+AG+DI + C
Sbjct: 261 KILRWEWGFKGYVVSDYIAIRMLENFHRVAKDAKEAAVLALEAGIDIELPSVDCYGEPLI 320
Query: 369 SAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAA 428
A+ +G + E+ I+ ++ + + LGLF+GD K K D E ++L+ + A
Sbjct: 321 QAVKEGLISEEVINASVERVLRAKFMLGLFDGDLEKDP--KKVYDIFDKPEFRELSREVA 378
Query: 429 RQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT---------------GIP 473
R+ IVLLKND LPL+KN + ++A+IGP +N + G Y+ IP
Sbjct: 379 RRSIVLLKNDG-ILPLSKN-IRTVAVIGPNADNPRNLHGDYSYTAHIPSVSETLEGVKIP 436
Query: 474 --CSPK--SLLRGLEAYVS---KTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAG- 525
C+ + S+L G++ VS + YA GC ++ +S GF EA+ IAK+AD +I V G
Sbjct: 437 EECAVRTVSILEGIKNKVSAETQVLYAKGC-EILSDSKEGFDEAIEIAKRADVIIAVMGE 495
Query: 526 ----LDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQ 581
E DR +L L G Q L+ + + K P++LVL G P + + +
Sbjct: 496 ESGLFHRGISGEGNDRTTLELFGIQRDLLRELHKLGK-PIVLVLVNGRPQALKWEHEN-- 552
Query: 582 ISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYP 641
+++IL YPGE G A+A++IFGD+NP G+LP++ +P +VP+ N + + Y
Sbjct: 553 LNAILEAWYPGEEGGDAVADVIFGDYNPSGKLPIS-FPAVTGQVPVY-YNRKPSAFTDYV 610
Query: 642 GRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYV 701
S + +Y FGHGLSYT + Y L +
Sbjct: 611 EESAK-----PLYPFGHGLSYTTFEYSNLK-----------------------------I 636
Query: 702 HIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAK 761
H ++V + + + ++ N G +G VV L+ S P K+L GF ++H
Sbjct: 637 HPEKVNALE--KVEISFTIKNTGVREGEEVVQLYVHDQVASLERPVKELKGFKKIHLKPG 694
Query: 762 GSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
SK ++F + P EQL+ ++ R ++ G +M+G
Sbjct: 695 ESKRVTFILYP-EQLAFYDEFMRFVVEKGIFEIMIG 729
>gi|28199699|ref|NP_780013.1| family 3 glycoside hydrolase [Xylella fastidiosa Temecula1]
gi|182682443|ref|YP_001830603.1| beta-glucosidase [Xylella fastidiosa M23]
gi|417557804|ref|ZP_12208815.1| Beta-glucosidase [Xylella fastidiosa EB92.1]
gi|28057820|gb|AAO29662.1| family 3 glycoside hydrolase [Xylella fastidiosa Temecula1]
gi|182632553|gb|ACB93329.1| Beta-glucosidase [Xylella fastidiosa M23]
gi|338179587|gb|EGO82522.1| Beta-glucosidase [Xylella fastidiosa EB92.1]
Length = 882
Score = 287 bits (734), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 172/450 (38%), Positives = 239/450 (53%), Gaps = 58/450 (12%)
Query: 64 RAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVT 123
A +L++ +T QEKI Q + A AIPRLGIPAY+WWSE LHGIA NG T
Sbjct: 33 HAAALVAQMTRQEKIAQTMNAAPAIPRLGIPAYDWWSEGLHGIARNG----------YAT 82
Query: 124 SFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG---------QAGLTFWAPNINIFRD 174
FPQ + AAS+N L ++G+ + EARA +NL AGLT W+PNINIFRD
Sbjct: 83 VFPQAIGLAASWNTDLLQHVGTVTSTEARAKFNLTGGPGKDHPRYAGLTLWSPNINIFRD 142
Query: 175 PRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRG 234
PRWGRG ET GEDP + S AV F++ QG D
Sbjct: 143 PRWGRGMETYGEDPYLTSQLAVSFIRGLQG---------------------------DTP 175
Query: 235 DELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCS 294
D A KH + + G R+SF+ ++ D E T+ P FR+ I G A +MC+
Sbjct: 176 DHPRTIATPKHFAVHSGPEQG---RHSFDVDVSAYDLEATYTPAFRAAIVDGHAGSVMCA 232
Query: 295 YNQVNGVPACLRGDLFQ-KARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKA 353
YN ++G PAC L + RN+WGF G++ SDCDA+ + + + + + ++A LK+
Sbjct: 233 YNALHGTPACASDWLLNTRLRNDWGFNGFVVSDCDAIEDMTRFHFFRQDNASASAAALKS 292
Query: 354 GMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRK-GKYGKLGP 412
G D+NCG R AI +G + E +D+AL+ LF+ + RLG PR+ Y +G
Sbjct: 293 GNDLNCGNTY-RDLNQAIARGDIDESTLDQALIRLFTARQRLGTLQ--PREHDPYAAIGI 349
Query: 413 DDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGI 472
+ T H+ LAL AA Q +VLLKN LPL ++LA++GP ++++ + Y G
Sbjct: 350 KHIDTPAHRALALQAAAQSLVLLKNSGNTLPLPPE--TTLAVLGPDADSLTALEANYQGT 407
Query: 473 PCSPKSLLRGLEAY--VSKTHYASGCHDVP 500
+P + L GL +K HYA G P
Sbjct: 408 SSTPVTPLTGLRTRFGTAKVHYAQGASLAP 437
Score = 122 bits (307), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 95/299 (31%), Positives = 148/299 (49%), Gaps = 54/299 (18%)
Query: 509 EAVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVARTSK 558
EA R AD ++ GL E E+ DR ++ LP Q +L+ V +T+
Sbjct: 603 EAERAVAHADAIVAFVGLSPEVEGEELHIDTPGFSGGDRTTIDLPATQETLLQHV-KTTG 661
Query: 559 RPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWY 618
+P+I+VL G + +++A+ +IL YPG++G A+A+ + GD NPGGRLP+T+Y
Sbjct: 662 KPLIVVLMSGSAVALNWAQ--HHADAILAAWYPGQSGGTAIAQALAGDVNPGGRLPVTFY 719
Query: 619 PESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTI 678
+ P +M GR+YR++ G +Y FG+GLSYT ++Y+ AP T
Sbjct: 720 RSTQDLPPYISYDMT--------GRTYRYFKGQPLYPFGYGLSYTQFAYE---APQLST- 767
Query: 679 SASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARV 738
A+LKAG + ++ HV+ + T AGD VV L+
Sbjct: 768 -ATLKAG----------------------NTLTVTTHVRNTGTRAGD----EVVQLYLEP 800
Query: 739 PKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
P Q P + L+GF RV S+ ++F +D QLS + G+R + G++ L VG
Sbjct: 801 PYSPQA-PLRSLVGFKRVTLRPGESRLLTFTLD-ARQLSSVQQTGQRSVEAGHYHLFVG 857
>gi|423258860|ref|ZP_17239783.1| hypothetical protein HMPREF1055_02060 [Bacteroides fragilis
CL07T00C01]
gi|423264169|ref|ZP_17243172.1| hypothetical protein HMPREF1056_00859 [Bacteroides fragilis
CL07T12C05]
gi|387776440|gb|EIK38540.1| hypothetical protein HMPREF1055_02060 [Bacteroides fragilis
CL07T00C01]
gi|392706435|gb|EIY99558.1| hypothetical protein HMPREF1056_00859 [Bacteroides fragilis
CL07T12C05]
Length = 805
Score = 286 bits (732), Expect = 3e-74, Method: Compositional matrix adjust.
Identities = 234/738 (31%), Positives = 346/738 (46%), Gaps = 124/738 (16%)
Query: 90 RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
RLGIP + E HG + G T FP + A+++N L +G +A+
Sbjct: 140 RLGIPLF-LAEECPHGHMAIG-----------TTVFPTSIGQASTWNPELIRQMGRVIAI 187
Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
EA A Q + P +++ RDPRW R +ET GEDP + A V+ FQGE
Sbjct: 188 EASA-----QGAHIGYGPVLDLARDPRWSRVEETYGEDPYLNGAMGTALVRGFQGETL-- 240
Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQ 269
+DG K V+ A KH +Y W A I E+
Sbjct: 241 NDG--------KSVI----------------ATLKHFASYG---WTEGGHNGGTAHIGER 273
Query: 270 DTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPAC-LRGDLFQKARNEWGFKGYITSDCD 328
+ E+ PPFR + G A +M SYN+++G P R L ++ W FKG++ SD
Sbjct: 274 ELEEAIFPPFREAVGAG-ALSVMSSYNEIDGNPCTGSRYLLTDILKDRWQFKGFVVSDLY 332
Query: 329 AVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCML-RHTQSAIDKGKVQEKDIDRALLN 387
AV + E+ +E +A + AG+D + GT + +A+ +G V ID+A+
Sbjct: 333 AVGGLREHGVAGNDYE-AAIKAVNAGVDSDLGTNVYAEQLVAAVKRGDVAVATIDKAVRR 391
Query: 388 LFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKN 447
+ S++ ++GLF+ DP + K V +SEH LA + ARQ IVLLKN K LPL K+
Sbjct: 392 ILSLKFQMGLFD-DPFVDE--KQAVQLVASSEHTGLAREVARQSIVLLKNKDKLLPLKKD 448
Query: 448 AVSSLAIIGPLVNNISQMGGGYTGIPCSPK--SLLRGLEAYVSKTH---YASGCHDVPCN 502
+ +LA+IGP +N+ M G YT ++L G+ VSK YA GC V +
Sbjct: 449 -IRTLAVIGPNADNVYNMLGDYTAPQADGTVVTVLDGIRQKVSKETRVLYAKGCA-VRDS 506
Query: 503 SDAGFHEAVRIAKKADFVIVVAG----LDLTQETEDR-------------------DRVS 539
S GF +A+ A+ AD V++V G D + E E+ DR +
Sbjct: 507 SRTGFKDAIETARNADAVVMVMGGSSARDFSSEYEETGAAKVTINQISDMESGEGYDRAT 566
Query: 540 LLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKAL 599
L L G+Q+ L+ ++R K PV+LVL G PL + A +++ W YPG G A+
Sbjct: 567 LHLMGRQLELLEEISRLGK-PVVLVLIKGRPLLMEGAIQEAEAIVDAW--YPGMQGGNAV 623
Query: 600 AEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHG 659
A+++FGD+NP GRL ++ P S ++P+ N R +R Y GT Y FG+G
Sbjct: 624 ADVLFGDYNPAGRLTLS-VPRSVGQLPVY-YNTRRKGNRS----RYIEEPGTPRYPFGYG 677
Query: 660 LSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQIS 719
LSYT +SY + + GSD C R V ++
Sbjct: 678 LSYTTFSYT--------DMKVQVTEGSD--------------------DC---RVDVTVT 706
Query: 720 VTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIA 779
+ N G DG V L+ R S TP KQL F R+H A S+E++F +D + L++
Sbjct: 707 IQNQGTADGDEVAQLYFRDDVSSFTTPAKQLRAFSRIHLKAGESREVTFTLDK-KSLALY 765
Query: 780 NKHGRRILPLGNHVLMVG 797
+ G ++ G +MVG
Sbjct: 766 MQEGEWVVEPGRFTIMVG 783
>gi|440731995|ref|ZP_20911965.1| glucan 1,4-beta-glucosidase [Xanthomonas translucens DAR61454]
gi|440370332|gb|ELQ07251.1| glucan 1,4-beta-glucosidase [Xanthomonas translucens DAR61454]
Length = 913
Score = 286 bits (732), Expect = 4e-74, Method: Compositional matrix adjust.
Identities = 161/462 (34%), Positives = 247/462 (53%), Gaps = 45/462 (9%)
Query: 54 FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
+ +T S RA L++ +TL+EK Q+ + A AIPRLG+PAY+WW+E+LHG+A G
Sbjct: 37 YLDTQRSFEQRAADLVARMTLEEKAAQMQNAAPAIPRLGVPAYDWWNEALHGVARAG--- 93
Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG--------QAGLTFW 165
T FPQ + AA+F+ L + +A++ EARA ++ GLTFW
Sbjct: 94 -------GATVFPQAIGMAATFDVPLMHEVSTAISDEARAKHHEALRHDQHARYQGLTFW 146
Query: 166 APNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLK 225
+PNINIFRDPRWGRGQET GEDP + + V FV+ QGE + G +R+
Sbjct: 147 SPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQGEGADAPKNAQGEAYRK----- 201
Query: 226 GFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQ 285
L A KH + + R+ F+A +++D +T+ P F + +++
Sbjct: 202 -------------LDATAKHFAVHSGPE---ADRHHFDAHPSQRDLYETYLPAFEALVKE 245
Query: 286 GKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHE 344
GK +M +YN+V G A L + R+ WGF GY+ SDC A+ I++ T E
Sbjct: 246 GKVDAVMGAYNRVYGESASASKFLLRDVLRDRWGFDGYVVSDCWAIVDIWKNHKIVATRE 305
Query: 345 DSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRK 404
++AA +K G ++ CG SA+ KG + E D+D+AL L ++RLG+F+ P K
Sbjct: 306 EAAALAVKHGTELECG-AEYSTLPSAVRKGLISEADVDKALQKLMYSRMRLGMFD-PPEK 363
Query: 405 GKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQ 464
+ ++ + EH LA AR+ +VLLKND LPL++ + +A++GP ++
Sbjct: 364 LAWAQIPLSANQSPEHDALARRTARESLVLLKND-GVLPLSRAKIKRIAVVGPTADDTMA 422
Query: 465 MGGGYTGIPCSPKSLLRGLE--AYVSKTHYASGCHDVPCNSD 504
+ G Y G P +P ++L+G+ A ++ YA G V D
Sbjct: 423 LLGNYYGTPAAPVTVLQGIREAAPDAEVLYARGADLVEGRDD 464
Score = 139 bits (351), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 96/302 (31%), Positives = 147/302 (48%), Gaps = 53/302 (17%)
Query: 507 FHEAVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVART 556
+A+ A++AD V+ V GL E E+ DR L LP Q +L+ ++ T
Sbjct: 628 LQDALDAARRADVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRALLEALHGT 687
Query: 557 SKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMT 616
K PV+ VLT G L + +A+ + +IL YPG+ G A+A+++FGD NPGGRLP+T
Sbjct: 688 GK-PVVAVLTTGSALAIDWAQ--QHVPAILLAWYPGQRGGSAVADVLFGDANPGGRLPVT 744
Query: 617 WYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSEL 676
+Y ES T +D MR GR+YR++ GT +Y FGHGLSYT ++Y S+L
Sbjct: 745 FYKESETLPAFDDYAMR--------GRTYRYFAGTPLYPFGHGLSYTQFAY------SDL 790
Query: 677 TISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFA 736
+ D++ L G R H + V N G G VV L+
Sbjct: 791 RL--------DRSKLAADG-----------------RLHATLKVKNTGQRAGDEVVQLYL 825
Query: 737 RVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGR-RILPLGNHVLM 795
+ + K L GF R+ ++E+ F + P L + ++ + ++ G++ L
Sbjct: 826 QPLSPQRERASKDLRGFQRIALQPGETREVRFAISPQSDLRLYDEARKGYVVDPGDYELQ 885
Query: 796 VG 797
VG
Sbjct: 886 VG 887
>gi|359450637|ref|ZP_09240068.1| beta-glucosidase [Pseudoalteromonas sp. BSi20480]
gi|358043611|dbj|GAA76317.1| beta-glucosidase [Pseudoalteromonas sp. BSi20480]
Length = 468
Score = 286 bits (731), Expect = 4e-74, Method: Compositional matrix adjust.
Identities = 168/458 (36%), Positives = 244/458 (53%), Gaps = 60/458 (13%)
Query: 54 FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
+ N S SI R L++ LTL+EK+ QL D + AI RL +P Y WW+E+LHG+A G
Sbjct: 34 YLNKSASIDERVNDLVTRLTLEEKVAQLFDKSPAIERLNMPEYNWWNEALHGVARAG--- 90
Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNL--------GQAGLTFW 165
T FPQ + AA+F+ L +G+A++ E RA ++ GLT+W
Sbjct: 91 -------KATVFPQAIGLAATFDEDLMLRVGTAISDEGRAKHHAFLEENNRSMYTGLTYW 143
Query: 166 APNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLK 225
+PNINIFRDPRWGRGQET GEDP + + AV F+ QG+N
Sbjct: 144 SPNINIFRDPRWGRGQETYGEDPYLTTRIAVNFINGLQGDN------------------- 184
Query: 226 GFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQ 285
+ L A KH Y + SR+S + +E+D +T+ P F+ I Q
Sbjct: 185 --------AEYLKSVATLKH---YAVHSGPEVSRHSDDYTASEKDLAETYLPAFKDVIAQ 233
Query: 286 GKASCIMCSYNQVNGVPACLRGDLFQ-KARNEWGFKGYITSDCDAVATIFEYQ--NYTKT 342
K + +MC+YN VNG PAC +L Q K R+E+ F GYI SDC A+A ++ + N T
Sbjct: 234 TKVASVMCAYNSVNGTPACGNDELIQNKLRDEFNFDGYIVSDCGAIADFYDVKSHNIVNT 293
Query: 343 HEDSAAGVLKAGMDINCGT---CMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFN 399
+AA LK G D+NCG + A+ +G V+EKD+D+AL L + +LG+F+
Sbjct: 294 GAKAAAMALKTGTDLNCGDHHGNTYSYLTQAVKEGLVEEKDVDKALKRLMYARFKLGMFD 353
Query: 400 GDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLV 459
+P Y D V +++H L +AA++ +VLLKN+ + LPL N +A+IGP
Sbjct: 354 -NPENVPYSDTSIDVVGSNKHLALTQEAAQKSLVLLKNE-QVLPLKGN--EKIALIGPNA 409
Query: 460 NNISQMGGGYTGIPCSPKSLLRGLEAYVSKTH--YASG 495
+N + + G Y G+P P + LE + K + Y +G
Sbjct: 410 DNEAILLGNYNGMPIVPITPKLALEQRLGKNNLTYTAG 447
>gi|390991557|ref|ZP_10261819.1| glycosyl hydrolase family 3 N terminal domain protein [Xanthomonas
axonopodis pv. punicae str. LMG 859]
gi|372553724|emb|CCF68794.1| glycosyl hydrolase family 3 N terminal domain protein [Xanthomonas
axonopodis pv. punicae str. LMG 859]
Length = 901
Score = 286 bits (731), Expect = 4e-74, Method: Compositional matrix adjust.
Identities = 164/468 (35%), Positives = 248/468 (52%), Gaps = 45/468 (9%)
Query: 48 HFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIA 107
H + P+ +T S RA L+S +TL+EK Q+ + A AIPRL +PAY+WW+E+LHG+A
Sbjct: 28 HAATPPYLDTQRSFEARAADLVSRMTLEEKAAQMQNAAPAIPRLQVPAYDWWNEALHGVA 87
Query: 108 SNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYN--LGQ------ 159
G T FPQ + AA+F+ L + +A++ EARA ++ L Q
Sbjct: 88 RAG----------GATVFPQAIGMAATFDLPLMHEVATAISDEARAKHHQFLRQNQHARY 137
Query: 160 AGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFR 219
GLTFW+PNINIFRDPRWGRGQET GEDP + + V FV+ QGE + G +R
Sbjct: 138 QGLTFWSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQGEGAAAPKNAQGEPYR 197
Query: 220 EKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPF 279
+ L A KH + + R+ F+A +++D +T+ P F
Sbjct: 198 K------------------LDATAKHFAVHSGPE---ADRHHFDARPSQRDLYETYLPAF 236
Query: 280 RSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQN 338
+ ++ GK +M +YN+V G A L Q R++WGF+GY+ SDC A+ I+++
Sbjct: 237 EALVKDGKVDAVMGAYNRVYGESASASKFLLQDVLRDQWGFRGYVVSDCWAIVDIWKHHK 296
Query: 339 YTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLF 398
T E +AA +K G ++ CG +A+ +G + E ID AL L + ++RLG+F
Sbjct: 297 IVATREQAAALAVKHGTELECGEEYAT-LPAAVRQGLIDEAQIDTALKTLMTARMRLGMF 355
Query: 399 NGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPL 458
+ P + + + + H LA AR+ +VLLKND LPL++ + +A+IGP
Sbjct: 356 D-PPGQLPWSTIPASVNQSPAHDALARRTARESLVLLKND-GLLPLSRAKLKRIAVIGPT 413
Query: 459 VNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTH--YASGCHDVPCNSD 504
++ + G Y G P +P ++L+G+ A K YA G V D
Sbjct: 414 ADDTMALLGNYYGTPAAPVTVLQGIRAAAPKAQVLYARGADLVEGRDD 461
Score = 133 bits (334), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 98/320 (30%), Positives = 144/320 (45%), Gaps = 59/320 (18%)
Query: 480 LRGLEAYVSKTHYASGCHDVPCN-------SDAGFHEAVRIAKKADFVIVVAGLDLTQET 532
L+ AY + Y G D + EA+ +A+ AD V+ V GL E
Sbjct: 591 LQAGRAYEVRLEYFEGERDAAVRLAWRQPGAKPPLQEALDVARSADVVVFVGGLTGDVEG 650
Query: 533 EDR----------DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQI 582
E+ DR L LP Q L+ ++ T RPV+ VLT G L + +A+ +
Sbjct: 651 EEMKVNYPGFAGGDRTDLRLPKPQRDLLEALQATG-RPVVAVLTTGSALAIDWAQ--QHL 707
Query: 583 SSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPG 642
+IL YPG+ G A+A+ +FGD NPGGRLP+T+Y ES T +D MR G
Sbjct: 708 PAILLAWYPGQRGGTAVADTLFGDANPGGRLPVTFYKESETLPAFDDYAMR--------G 759
Query: 643 RSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVH 702
R+YR++ GT +Y FGHGLSYT ++Y +G RLD
Sbjct: 760 RTYRYFGGTPLYPFGHGLSYTQFAY--------------------------SGLRLDRTT 793
Query: 703 IDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKG 762
I S T+ ++V N G G VV L+ + K+L GF R+
Sbjct: 794 IATDGSLTAT-----VTVKNTGQRAGDEVVQLYLHPLAPQRERAGKELHGFQRIALQPGE 848
Query: 763 SKEISFGVDPCEQLSIANKH 782
+E+ F ++ + L + ++
Sbjct: 849 QRELGFTINAKDALRLYDEQ 868
>gi|21244948|ref|NP_644530.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. citri str.
306]
gi|21110666|gb|AAM39066.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. citri str.
306]
Length = 901
Score = 286 bits (731), Expect = 5e-74, Method: Compositional matrix adjust.
Identities = 164/468 (35%), Positives = 250/468 (53%), Gaps = 45/468 (9%)
Query: 48 HFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIA 107
H + P+ +T S RA L+S +TL+EK Q+ + A AIPRL +PAY+WW+E+LHG+A
Sbjct: 28 HAATPPYLDTQRSFEARAADLVSRMTLEEKAAQMQNAAPAIPRLQVPAYDWWNEALHGVA 87
Query: 108 SNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYN--LGQ------ 159
G T FPQ + AA+F+ L + +A++ EARA ++ L Q
Sbjct: 88 RAG----------GATVFPQAIGMAATFDLPLMHEVATAISDEARAKHHQFLRQNQHARY 137
Query: 160 AGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFR 219
GLTFW+PNINIFRDPRWGRGQET GEDP + + V FV+ QGE + G +R
Sbjct: 138 QGLTFWSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQGEGAAAPKNAQGEPYR 197
Query: 220 EKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPF 279
+ L A KHL + + R+ F+A +++D +T+ P F
Sbjct: 198 K------------------LDATAKHLAVHSGPE---ADRHHFDARPSQRDLYETYLPAF 236
Query: 280 RSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQN 338
+ +++GK +M +YN+V G A L Q R++WGF+GY+ SDC A+ I+++
Sbjct: 237 EALVKEGKVDAVMGAYNRVYGESASASKFLLQDVLRDQWGFRGYVVSDCWAIVDIWKHHK 296
Query: 339 YTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLF 398
T E +AA +K G ++ CG +A+ +G + E ID AL L + ++RLG+F
Sbjct: 297 IVATREQAAALAVKHGTELECGEEYAT-LPAAVRQGLIDEAQIDTALKTLMTARMRLGMF 355
Query: 399 NGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPL 458
+ P + + + + H LA AR+ +VLLKND LPL++ +A+IGP
Sbjct: 356 D-PPGQLPWSTIPASVNQSPAHDALARRTARESLVLLKND-GLLPLSRAKFKRIAVIGPT 413
Query: 459 VNNISQMGGGYTGIPCSPKSLLRGLEAYV--SKTHYASGCHDVPCNSD 504
++ + G Y G P +P ++L+G+ A ++ YA G V D
Sbjct: 414 ADDTMALLGNYYGTPAAPVTVLQGIRAAAPNAQVLYARGADLVEGRDD 461
Score = 132 bits (333), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 95/320 (29%), Positives = 140/320 (43%), Gaps = 59/320 (18%)
Query: 480 LRGLEAYVSKTHYASGCHDVPCN-------SDAGFHEAVRIAKKADFVIVVAGLDLTQET 532
L+ AY + Y G D + EA+ +A+ AD V+ V GL E
Sbjct: 591 LQAGRAYEVRLEYFEGERDAAVRLAWRQPGARPPLQEALDVARSADVVVFVGGLTGDVEG 650
Query: 533 EDR----------DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQI 582
E+ DR L LP Q L+ ++ T RPV+ VLT G L + +A+ +
Sbjct: 651 EEMKVNYPGFAGGDRTDLRLPKPQRDLLEALQATG-RPVVAVLTTGSALAIDWAQ--QHL 707
Query: 583 SSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPG 642
+IL YPG+ G A+A+ +FGD NPGGRLP+T+Y ES T +D MR G
Sbjct: 708 PAILLAWYPGQRGGTAVADTLFGDANPGGRLPVTFYKESETLPAFDDYAMR--------G 759
Query: 643 RSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVH 702
R+YR++ GT +Y FGHGLSYT ++Y L
Sbjct: 760 RTYRYFGGTPLYPFGHGLSYTQFAYSGL-------------------------------R 788
Query: 703 IDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKG 762
+D T T ++V N G G VV L+ + K+L GF R+
Sbjct: 789 LDRTTIATDGSLAATVTVKNTGQRAGDEVVQLYLHPLAPQRERAGKELHGFQRIALQPGE 848
Query: 763 SKEISFGVDPCEQLSIANKH 782
+E+ F ++ + L + ++
Sbjct: 849 QRELGFTINAKDALRLYDEQ 868
>gi|289668505|ref|ZP_06489580.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. musacearum
NCPPB 4381]
Length = 902
Score = 286 bits (731), Expect = 5e-74, Method: Compositional matrix adjust.
Identities = 163/463 (35%), Positives = 248/463 (53%), Gaps = 45/463 (9%)
Query: 53 PFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPG 112
P+ +T S RA L+S +TL+EK Q+ + A AIPRLG+ AY+WW+E+LHG+A G
Sbjct: 34 PYLDTQRSFEQRAADLVSRMTLEEKAAQMQNAAPAIPRLGVAAYDWWNEALHGVARAG-- 91
Query: 113 VNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYN--LGQ------AGLTF 164
T FPQ + AA+F+ L + +A++ EARA ++ L Q GLTF
Sbjct: 92 --------GATVFPQAIGMAATFDLPLMHEVATAISDEARAKHHQFLRQNQHERYQGLTF 143
Query: 165 WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVL 224
W+PNINIFRDPRWGRGQET GEDP + + V FV+ QGE + G +R+
Sbjct: 144 WSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVRGLQGEGGDAPKNAQGESYRK---- 199
Query: 225 KGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIE 284
L A KH + + R+ F+A +++D +T+ P F + ++
Sbjct: 200 --------------LDATAKHFAVHSGPE---ADRHHFDARPSQRDLYETYLPAFEALVK 242
Query: 285 QGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTH 343
GK +M +YN+V G A L Q R +WGFKGY+ SDC A+ I+++ T
Sbjct: 243 DGKVDAVMGAYNRVYGESASASKFLLQDVLRQQWGFKGYVVSDCWAIVDIWKHHKIVATR 302
Query: 344 EDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPR 403
E +AA +K G ++ CG +A+ +G ++E ID +L L + ++RLG+F+ P
Sbjct: 303 EQAAALAVKHGTELECGE-EYSTLPAAVHQGLIEEAQIDTSLQTLMTARMRLGMFD-PPG 360
Query: 404 KGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNIS 463
+ + K+ + H LA AR+ +VLLKND LPL++ + +A+IGP ++
Sbjct: 361 QLPWSKIPASVNQSPAHDALARRTARESLVLLKND-GLLPLSRTKLKRIAVIGPTADDTM 419
Query: 464 QMGGGYTGIPCSPKSLLRGLEAYV--SKTHYASGCHDVPCNSD 504
+ G Y G P +P ++L+G+ A ++ YA G V D
Sbjct: 420 ALLGNYYGTPAAPVTVLQGIRAAAPNAQVLYARGADLVEGRDD 462
Score = 129 bits (325), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 91/286 (31%), Positives = 134/286 (46%), Gaps = 52/286 (18%)
Query: 507 FHEAVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVART 556
EA+ +A+ A+ V+ V GL E E+ DR L LP Q L+ ++ T
Sbjct: 626 LQEALDVARSAEVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRELLEALQAT 685
Query: 557 SKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMT 616
K PV+ VLT G L + +A+ + +IL YPG+ G A+A+ +FGD NPGGRLP+T
Sbjct: 686 GK-PVVAVLTAGSALAIDWAQ--QHVPAILLAWYPGQRGGTAVADTLFGDANPGGRLPVT 742
Query: 617 WYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSEL 676
+Y ES +D M GR+YR++ GT +Y FGHGLSYT ++Y S+L
Sbjct: 743 FYKESEALPAFDDYAMH--------GRTYRYFGGTPLYPFGHGLSYTQFAY------SDL 788
Query: 677 TISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFA 736
+ D+N + GS F ++V N G G V L+
Sbjct: 789 RL--------DRNTVAADGS-----------------FTATVTVKNTGQRAGDEVAQLYL 823
Query: 737 RVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKH 782
+ K+L GF RV +E+ F ++ E L I ++
Sbjct: 824 HPLTPQRERAGKELRGFQRVALHPGEQRELRFPINAKEALRIYDEQ 869
>gi|265765465|ref|ZP_06093740.1| periplasmic beta-glucosidase [Bacteroides sp. 2_1_16]
gi|263254849|gb|EEZ26283.1| periplasmic beta-glucosidase [Bacteroides sp. 2_1_16]
Length = 814
Score = 285 bits (730), Expect = 5e-74, Method: Compositional matrix adjust.
Identities = 234/738 (31%), Positives = 347/738 (47%), Gaps = 124/738 (16%)
Query: 90 RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
RLGIP + E HG + G T FP + A+++N L +G +A+
Sbjct: 149 RLGIPLF-LAEECPHGHMAIG-----------TTVFPTSIGQASTWNPELIRQMGRVIAI 196
Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
EA A Q + P +++ RDPRW R +ET GEDP + A V+ FQGE
Sbjct: 197 EASA-----QGAHIGYGPVLDLARDPRWSRVEETYGEDPYLNGAMGTALVRGFQGETL-- 249
Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQ 269
+DG K V+ A KH +Y W A I E+
Sbjct: 250 NDG--------KSVI----------------ATLKHFASYG---WTEGGHNGGTAHIGER 282
Query: 270 DTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPAC-LRGDLFQKARNEWGFKGYITSDCD 328
+ E+ PPFR + G A +M SYN+++G P R L ++ W FKG++ SD
Sbjct: 283 ELEEAIFPPFREAVGAG-ALSVMSSYNEIDGNPCTGSRYLLTDILKDRWQFKGFVVSDLY 341
Query: 329 AVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCML-RHTQSAIDKGKVQEKDIDRALLN 387
AV + E+ +E +A + AG+D + GT + +A+ +G V ID+A+
Sbjct: 342 AVGGLREHGVAGNDYE-AAIKAVNAGVDSDLGTNVYAEQLVAAVKRGDVAVATIDKAVRR 400
Query: 388 LFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKN 447
+ S++ ++GLF+ DP + K V +SEH LA + ARQ IVLLKN K LPL K+
Sbjct: 401 ILSLKFQMGLFD-DPFVDE--KQAVQLVASSEHTGLAREVARQSIVLLKNKDKLLPLKKD 457
Query: 448 AVSSLAIIGPLVNNISQMGGGYTGIPC--SPKSLLRGLEAYVSKTH---YASGCHDVPCN 502
+ +LA+IGP +N+ M G YT + ++L G+ VSK YA GC V +
Sbjct: 458 -IRTLAVIGPNADNVYNMLGDYTAPQADGTVVTVLDGIRQKVSKETRVLYAKGCA-VRDS 515
Query: 503 SDAGFHEAVRIAKKADFVIVVAG----LDLTQETEDR-------------------DRVS 539
S GF +A+ A+ AD V++V G D + E E+ DR +
Sbjct: 516 SRTGFKDAIETARNADAVVMVMGGSSARDFSSEYEETGAAKVTINQISDMESGEGYDRAT 575
Query: 540 LLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKAL 599
L L G+Q+ L+ ++R K PV+LVL G PL + A +++ W YPG G A+
Sbjct: 576 LHLMGRQLELLEEISRLGK-PVVLVLIKGRPLLMEGAIQEAEAIVDAW--YPGMQGGNAV 632
Query: 600 AEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHG 659
A+++FGD+NP GRL ++ P S ++P+ N R +R Y GT Y FG+G
Sbjct: 633 ADVLFGDYNPAGRLTLS-VPRSVGQLPVY-YNTRRKGNRS----RYIEEPGTPRYPFGYG 686
Query: 660 LSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQIS 719
LSYT +SY + + GSD C R V ++
Sbjct: 687 LSYTTFSYT--------DMKVQVTEGSD--------------------DC---RVDVTVT 715
Query: 720 VTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIA 779
+ N G DG V L+ R S TP KQL F R+H A S+E++F +D + L++
Sbjct: 716 IQNQGTADGDEVAQLYFRDDVSSFTTPAKQLRAFSRIHLKAGESREVTFTLDK-KSLALY 774
Query: 780 NKHGRRILPLGNHVLMVG 797
+ G ++ G +MVG
Sbjct: 775 MQEGEWVVEPGRFTIMVG 792
>gi|333380553|ref|ZP_08472244.1| hypothetical protein HMPREF9455_00410 [Dysgonomonas gadei ATCC
BAA-286]
gi|332826548|gb|EGJ99377.1| hypothetical protein HMPREF9455_00410 [Dysgonomonas gadei ATCC
BAA-286]
Length = 957
Score = 285 bits (730), Expect = 6e-74, Method: Compositional matrix adjust.
Identities = 232/764 (30%), Positives = 368/764 (48%), Gaps = 120/764 (15%)
Query: 54 FCNTSLSISTRAKSLISLLTLQEKIQQLSDN--ASAIPRLGIPAYEWWSESLHGIASNGP 111
+ N +L + +R + L+S++T+++K++ L + IP LG+PA E++HG +
Sbjct: 170 YMNPNLPLESRVEDLLSVMTVEDKMELLREGWGIPGIPHLGVPAIHK-VEAIHGFSYG-- 226
Query: 112 GVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINI 171
S T FPQ + A++N+ L A+ E + N QA W+P +++
Sbjct: 227 --------SGATIFPQSIGMGATWNKRLIEAAAMAIGDETVSA-NAVQA----WSPVLDV 273
Query: 172 FRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEES 231
+D RWGR +ET GEDP++V+ ++K +Q +
Sbjct: 274 AQDARWGRCEETYGEDPVLVTEIGGAWIKGYQSKG------------------------- 308
Query: 232 DRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCI 291
L KH A+ G R S + ++E++ + PFR ++ K I
Sbjct: 309 -------LMTTPKHFAAHGAPLGG---RDSHDIGLSEREMREIHLVPFRDIYKKYKYQSI 358
Query: 292 MCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHE-DSAAG 349
M SY+ GVP +L + R+EWGF G+I SDC A+ + ++YT + ++A
Sbjct: 359 MMSYSDFLGVPVAKSKELLKGILRDEWGFDGFIVSDCGAIGNLTARKHYTAVDKVEAARQ 418
Query: 350 VLKAGMDINCG-TCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKG-KY 407
L AG+ NCG T +A +G++ D+D L R GLF +P K +
Sbjct: 419 ALAAGIATNCGDTYNDPDVIAAAKRGELNMDDLDFTCKTLLRTLFRNGLFENNPCKPLDW 478
Query: 408 GKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGG 467
K+ P + EH+ LA A++ IVLL+N LPL+K ++ ++A+IGP +N+ G
Sbjct: 479 NKIYPG-WNSPEHQALARKTAQESIVLLENKGNILPLSK-SLKTIAVIGPGADNLQP--G 534
Query: 468 GYTGIP--CSPKSLLRGLEAYV---SKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIV 522
YT P KS+L G++A V +K Y GC + +AV+ A+ AD ++
Sbjct: 535 DYTSKPQPGQLKSVLTGIKAAVNSSTKVLYEEGCRFIGTEG-TDIAKAVKAAENADVAVL 593
Query: 523 VAGLDLTQET---------EDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDV 573
V G T E E+ D +L+LPG+Q L+ +V +T K PV+L+L G P ++
Sbjct: 594 VLGDCSTSEALKGITNTSGENHDLATLILPGEQQKLLEAVCKTGK-PVVLILQAGRPYNL 652
Query: 574 SFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMR 633
S+A + Q + W+ PG+ G A A+++FGD+NP GRLPMT +P ++P+ N +
Sbjct: 653 SYAAENCQAVLVNWL--PGQEGGYATADVLFGDYNPAGRLPMT-FPRDAAQLPLY-YNFK 708
Query: 634 ADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQ 693
S R Y +Y +Y FG+GLSYT+++Y S+L IS L++
Sbjct: 709 T-SGRVYDYVDMPYYP---LYQFGYGLSYTSFNY------SDLNIS-----------LEK 747
Query: 694 TGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGF 753
G+ V +VTN G V G VV L+ S T +L F
Sbjct: 748 NGN-----------------VSVNATVTNTGKVAGDEVVQLYITDMYASVKTRVMELKDF 790
Query: 754 DRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
DRV+ SK++SF + P QLS+ N R++ G +MVG
Sbjct: 791 DRVYLNPGESKKVSFVLTPY-QLSLLNDEMDRVVEKGLFKIMVG 833
>gi|71731103|gb|EAO33170.1| Beta-glucosidase [Xylella fastidiosa subsp. sandyi Ann-1]
Length = 882
Score = 285 bits (730), Expect = 6e-74, Method: Compositional matrix adjust.
Identities = 172/454 (37%), Positives = 239/454 (52%), Gaps = 58/454 (12%)
Query: 60 SISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTV 119
S A +L++ +T QEKI Q + A AIPRLGIPAY+WWSE LHGIA NG
Sbjct: 29 SPEQHAAALVAQMTRQEKIAQTMNAAPAIPRLGIPAYDWWSEGLHGIARNG--------- 79
Query: 120 SSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG---------QAGLTFWAPNIN 170
T FPQ + AAS+N L ++G+ + EARA +NL AGLT W+PNIN
Sbjct: 80 -YATVFPQAIGLAASWNTDLLQHVGTVTSTEARAKFNLTGGPGKDHPRYAGLTLWSPNIN 138
Query: 171 IFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEE 230
IFRDPRWGRG ET GEDP + AV F++ QG
Sbjct: 139 IFRDPRWGRGMETYGEDPYLTGQLAVSFIRGLQG-------------------------- 172
Query: 231 SDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASC 290
D D A KH + + G R+SF+ ++ D E T+ P FR+ I G A
Sbjct: 173 -DTPDHPRTIATPKHFAVHSGPEQG---RHSFDVDVSAYDLEATYTPAFRAAIVDGHAGS 228
Query: 291 IMCSYNQVNGVPACLRGDLFQ-KARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAG 349
+MC+YN ++G PAC L + RN+WGF G++ SDCDA+ + + + + + ++A
Sbjct: 229 VMCAYNALHGTPACASDWLLNTRLRNDWGFNGFVVSDCDAIEDMTRFHFFRQDNASASAA 288
Query: 350 VLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRK-GKYG 408
LK+G D+NCG R AI +G + E +D+AL+ LF+ + RLG PR+ Y
Sbjct: 289 ALKSGDDLNCGNTY-RDLNQAIARGDIDESTLDQALIRLFTARQRLGTLQ--PREHDPYA 345
Query: 409 KLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGG 468
+G + T H+ LAL AA Q +VLLKN LPL ++LA++GP ++++ +
Sbjct: 346 AIGIKHIDTPAHRALALQAAAQSLVLLKNSGNTLPLPPE--TTLAVLGPDADSLTALEAN 403
Query: 469 YTGIPCSPKSLLRGLEAY--VSKTHYASGCHDVP 500
Y G +P + L GL +K HYA G P
Sbjct: 404 YQGTSSTPVTPLTGLRTRFGTAKVHYAQGASLAP 437
Score = 123 bits (308), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 98/299 (32%), Positives = 148/299 (49%), Gaps = 54/299 (18%)
Query: 509 EAVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVARTSK 558
EA R AD ++ GL E E+ DR ++ LP Q +L+ V +T+
Sbjct: 603 EAERAVAHADAIVAFVGLSPEVEGEELHIDTPGFSGGDRTTIDLPATQETLLQHV-KTTG 661
Query: 559 RPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWY 618
+P+I+VL G + +++A+ +IL YPG++G A+A+ + GD NPGGRLP+T+Y
Sbjct: 662 KPLIVVLMSGSAVALNWAQ--HHADAILAAWYPGQSGGTAIAQALAGDVNPGGRLPVTFY 719
Query: 619 PESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTI 678
+ P +M GR+YR++ G +Y FG+GLSYT ++Y+ AP T
Sbjct: 720 RSTQDLPPYISYDMT--------GRTYRYFKGQPLYPFGYGLSYTQFAYE---APQLST- 767
Query: 679 SASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARV 738
A+LKAG N L T HV+ + T AGD VV L+
Sbjct: 768 -ATLKAG---NTLTVTA-------------------HVRNTGTRAGD----EVVQLYLEP 800
Query: 739 PKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
P Q P + L+GF RV S+ ++F +D QLS + G+R + G++ L VG
Sbjct: 801 PYSPQA-PLRSLVGFKRVTLRPGESRLLTFTLD-ARQLSGVQQTGQRSVEAGHYHLFVG 857
>gi|289666226|ref|ZP_06487807.1| beta-glucosidase precursor [Xanthomonas campestris pv. vasculorum
NCPPB 702]
Length = 902
Score = 285 bits (729), Expect = 7e-74, Method: Compositional matrix adjust.
Identities = 163/463 (35%), Positives = 248/463 (53%), Gaps = 45/463 (9%)
Query: 53 PFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPG 112
P+ +T S RA L+S +TL+EK Q+ + A AIPRLG+ AY+WW+E+LHG+A G
Sbjct: 34 PYLDTQRSFEQRAADLVSRMTLEEKAAQMQNAAPAIPRLGVAAYDWWNEALHGVARAG-- 91
Query: 113 VNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYN--LGQ------AGLTF 164
T FPQ + AA+F+ L + +A++ EARA ++ L Q GLTF
Sbjct: 92 --------GATVFPQAIGMAATFDLPLMHEVATAISDEARAKHHQFLRQNQHERYQGLTF 143
Query: 165 WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVL 224
W+PNINIFRDPRWGRGQET GEDP + + V FV+ QGE + G +R+
Sbjct: 144 WSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVRGLQGEGGDAPKNAQGESYRK---- 199
Query: 225 KGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIE 284
L A KH + + R+ F+A +++D +T+ P F + ++
Sbjct: 200 --------------LDATAKHFAVHSGPE---ADRHHFDARPSQRDLYETYLPAFEALVK 242
Query: 285 QGKASCIMCSYNQVNGVPACLRGDLFQK-ARNEWGFKGYITSDCDAVATIFEYQNYTKTH 343
GK +M +YN+V G A L Q R +WGFKGY+ SDC A+ I+++ T
Sbjct: 243 DGKVDAVMGAYNRVYGESASASKFLLQDLLRQQWGFKGYVVSDCWAIVDIWKHHKIVATR 302
Query: 344 EDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPR 403
E +AA +K G ++ CG +A+ +G ++E ID +L L + ++RLG+F+ P
Sbjct: 303 EQAAALAVKHGTELECGE-EYSTLPAAVHQGLIEEAQIDTSLQTLMTARMRLGMFD-PPG 360
Query: 404 KGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNIS 463
+ + K+ + H LA AR+ +VLLKND LPL++ + +A+IGP ++
Sbjct: 361 QLPWSKIPASVNQSPAHDALARRTARESLVLLKND-GLLPLSRTKLKRIAVIGPTADDTM 419
Query: 464 QMGGGYTGIPCSPKSLLRGLEAYV--SKTHYASGCHDVPCNSD 504
+ G Y G P +P ++L+G+ A ++ YA G V D
Sbjct: 420 ALLGNYYGTPAAPVTVLQGIRAAAPNAQVLYARGADLVEGRDD 462
Score = 131 bits (330), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 92/286 (32%), Positives = 135/286 (47%), Gaps = 52/286 (18%)
Query: 507 FHEAVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVART 556
EA+ +A+ A+ V+ V GL E E+ DR L LP Q L+ ++ T
Sbjct: 626 LQEALDVARSAEVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRELLEALQAT 685
Query: 557 SKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMT 616
K PV+ VLT G L + +A+ + +IL YPG+ G A+A+ +FGD NPGGRLP+T
Sbjct: 686 GK-PVVAVLTAGSALAIDWAQ--QHVPAILLAWYPGQRGGTAVADTLFGDANPGGRLPVT 742
Query: 617 WYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSEL 676
+Y ES +D M GR+YR++ GT +Y FGHGLSYT ++Y S+L
Sbjct: 743 FYKESEALPAFDDYAMH--------GRTYRYFGGTPLYPFGHGLSYTQFAY------SDL 788
Query: 677 TISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFA 736
+ D+N + GS F ++V N G G V L+
Sbjct: 789 RL--------DRNTVAADGS-----------------FTATVTVKNTGQRAGDEVAQLYL 823
Query: 737 RVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKH 782
+ K+L GF RV +E+SF ++ E L I ++
Sbjct: 824 HPLTPQRERAGKELRGFQRVALHPGEQRELSFPINAKEALRIYDEQ 869
>gi|218186207|gb|EEC68634.1| hypothetical protein OsI_37026 [Oryza sativa Indica Group]
Length = 1241
Score = 285 bits (729), Expect = 7e-74, Method: Compositional matrix adjust.
Identities = 157/342 (45%), Positives = 202/342 (59%), Gaps = 34/342 (9%)
Query: 146 AVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGE 205
AV+ EARAMYN+G+ GLT+W+PNIN+ RDPRWGR ETPGEDP VV YAV FV+ Q
Sbjct: 917 AVSTEARAMYNMGKGGLTYWSPNINVVRDPRWGRALETPGEDPYVVGRYAVNFVRGMQ-- 974
Query: 206 NWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM 265
D G + G+ + R L SACCKH AYDL+ W N +R+ F+A
Sbjct: 975 ----DIPG-------HEAVAAGGDPNTR--PLKTSACCKHYAAYDLDDWHNHTRFEFDAR 1021
Query: 266 ITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYIT 324
+ E+D +TFQ PF C+ G S +MCSYN+VNG+PAC L Q R +WG GYI
Sbjct: 1022 VDERDMVETFQRPFEMCVRDGDVSSVMCSYNRVNGIPACADARLLSQTIRRDWGLHGYIV 1081
Query: 325 SDCDAVATIFEYQNYT-KTHEDSAAGVLKAGMDINCGTCMLRHT-------------QSA 370
SDCDAV + + + T +++A LKAG+D++CG T A
Sbjct: 1082 SDCDAVRVMTDNATWLGYTGAEASAAALKAGLDLDCGESWKNETDGHPLMDFLTTYGMEA 1141
Query: 371 IDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQ 430
++KGK++E DID AL N + +RLG F+ +Y LG D+CT +HK LALD ARQ
Sbjct: 1142 VNKGKMRESDIDNALTNQYMTLMRLGYFDD---IAQYSSLGRQDICTDQHKTLALDGARQ 1198
Query: 431 GIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQ-MGGGYTG 471
GIVLLKND K LPL+ N V + + GP V + M G YTG
Sbjct: 1199 GIVLLKNDNKLLPLDANKVGFVNVRGPHVQAPEKIMDGDYTG 1240
>gi|58584046|ref|YP_203062.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae KACC
10331]
gi|84625823|ref|YP_453195.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae MAFF
311018]
gi|58428640|gb|AAW77677.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae KACC
10331]
gi|84369763|dbj|BAE70921.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae MAFF
311018]
Length = 904
Score = 285 bits (728), Expect = 8e-74, Method: Compositional matrix adjust.
Identities = 163/463 (35%), Positives = 247/463 (53%), Gaps = 45/463 (9%)
Query: 53 PFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPG 112
P+ T S RA L+S +TL+EK Q+ + A AIPRLG+PAY+WW+E+LHG+A G
Sbjct: 36 PYLQTQRSFEQRAADLVSRMTLEEKAAQMQNAAPAIPRLGVPAYDWWNEALHGVARAG-- 93
Query: 113 VNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYN--LGQ------AGLTF 164
T FPQ + AA+F+ L + +A++ EARA ++ L Q GLTF
Sbjct: 94 --------GATVFPQAIGMAATFDLPLMHEVATAISDEARAKHHRFLRQHQHARYQGLTF 145
Query: 165 WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVL 224
W+PNINIFRDPRWGRGQET GEDP + + V FV+ QGE + G +R+
Sbjct: 146 WSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQGEGSDAPKNAQGERYRK---- 201
Query: 225 KGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIE 284
L A KH + + R+ F+A +++D +T+ P F + ++
Sbjct: 202 --------------LDATAKHFAVHSGPE---ADRHHFDARPSQRDLYETYLPAFEALVK 244
Query: 285 QGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTH 343
GK +M +YN+V G A L Q R +WGFKGY+ SDC A+ ++++ T
Sbjct: 245 DGKVDAVMGAYNRVYGESASASKFLLQDVLRQQWGFKGYVVSDCWAIVDVWKHHKIVATR 304
Query: 344 EDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPR 403
E +AA + G ++ CG +A+ +G + E ID AL L + ++RLG+F+ P
Sbjct: 305 EQAAALAVTHGTELECGE-EYSTLPAAVHQGLIDEAQIDTALQTLMTARMRLGMFD-PPG 362
Query: 404 KGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNIS 463
+ + K+ + H LA AR+ +VLLKND LPL++ + +A+IGP ++
Sbjct: 363 QLPWSKIPASVNQSPAHDALARRTARESLVLLKND-GLLPLSRATLKRIAVIGPTADDTM 421
Query: 464 QMGGGYTGIPCSPKSLLRGLEAYV--SKTHYASGCHDVPCNSD 504
+ G Y G P +P ++L+G+ A ++ YA G V +D
Sbjct: 422 ALLGNYYGTPAAPVTVLQGIRAAAPNAQVLYARGADLVEGRND 464
Score = 131 bits (330), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 92/282 (32%), Positives = 134/282 (47%), Gaps = 52/282 (18%)
Query: 507 FHEAVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVART 556
EA+ +A+ AD V+ V GL E E+ DR L LP Q L+ ++ T
Sbjct: 628 LQEALDVARSADVVVFVGGLTGDVEGEEMKVSYPGFAGGDRTDLRLPKPQRELLEALQAT 687
Query: 557 SKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMT 616
K PV+ VLT G L V +A+ + +IL YPG+ G A+A+ +FGD NPGGRLP+T
Sbjct: 688 GK-PVVAVLTAGSALAVDWAQ--QHVPAILLAWYPGQRGGTAVADTLFGDANPGGRLPVT 744
Query: 617 WYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSEL 676
+Y ES T +D M GR+YR++ GT +Y FGHGLSYT ++Y S+L
Sbjct: 745 FYKESETLPAFDDYAMH--------GRTYRYFGGTPLYPFGHGLSYTQFAY------SDL 790
Query: 677 TISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFA 736
+ D++ L G+ ++V N G G VV L+
Sbjct: 791 RL--------DRSTLTADGA-----------------LTATVAVKNTGQRAGDEVVQLYL 825
Query: 737 RVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSI 778
K + K+L GF R+ +E+ F ++ + L I
Sbjct: 826 HPLKPQRERAGKELRGFQRLALQPGQQRELRFTINAKDALRI 867
>gi|188574621|ref|YP_001911550.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae PXO99A]
gi|188519073|gb|ACD57018.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae PXO99A]
Length = 904
Score = 285 bits (728), Expect = 9e-74, Method: Compositional matrix adjust.
Identities = 163/463 (35%), Positives = 247/463 (53%), Gaps = 45/463 (9%)
Query: 53 PFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPG 112
P+ T S RA L+S +TL+EK Q+ + A AIPRLG+PAY+WW+E+LHG+A G
Sbjct: 36 PYLQTQRSFEQRAADLVSRMTLEEKAAQMQNAAPAIPRLGVPAYDWWNEALHGVARAG-- 93
Query: 113 VNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYN--LGQ------AGLTF 164
T FPQ + AA+F+ L + +A++ EARA ++ L Q GLTF
Sbjct: 94 --------GATVFPQAIGMAATFDLPLMHEVATAISDEARAKHHRFLRQHQHARYQGLTF 145
Query: 165 WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVL 224
W+PNINIFRDPRWGRGQET GEDP + + V FV+ QGE + G +R+
Sbjct: 146 WSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQGEGSDAPKNAQGERYRK---- 201
Query: 225 KGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIE 284
L A KH + + R+ F+A +++D +T+ P F + ++
Sbjct: 202 --------------LDATAKHFAVHSGPE---ADRHHFDARPSQRDLYETYLPAFEALVK 244
Query: 285 QGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTH 343
GK +M +YN+V G A L Q R +WGFKGY+ SDC A+ ++++ T
Sbjct: 245 DGKVDAVMGAYNRVYGESASASKFLLQDVLRQQWGFKGYVVSDCWAIVDVWKHHKIVATR 304
Query: 344 EDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPR 403
E +AA + G ++ CG +A+ +G + E ID AL L + ++RLG+F+ P
Sbjct: 305 EQAAALAVTHGTELECGE-EYSTLPAAVHQGLIDEAQIDTALQTLMTARMRLGMFD-PPG 362
Query: 404 KGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNIS 463
+ + K+ + H LA AR+ +VLLKND LPL++ + +A+IGP ++
Sbjct: 363 QLPWSKIPASVNQSPAHDALARRTARESLVLLKND-GLLPLSRATLKRIAVIGPTADDTM 421
Query: 464 QMGGGYTGIPCSPKSLLRGLEAYV--SKTHYASGCHDVPCNSD 504
+ G Y G P +P ++L+G+ A ++ YA G V +D
Sbjct: 422 ALLGNYYGTPAAPVTVLQGIRAAAPNAQVLYARGADLVEGRND 464
Score = 130 bits (328), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 91/282 (32%), Positives = 134/282 (47%), Gaps = 52/282 (18%)
Query: 507 FHEAVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVART 556
EA+ +A+ AD V+ V GL E E+ DR L LP Q L+ ++ T
Sbjct: 628 LQEALDVARSADVVVFVGGLTGDVEGEEMKVSYPGFAGGDRTDLRLPKPQRELLEALQAT 687
Query: 557 SKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMT 616
K PV+ VLT G L + +A+ + +IL YPG+ G A+A+ +FGD NPGGRLP+T
Sbjct: 688 GK-PVVAVLTAGSALAIDWAQ--QHVPAILLAWYPGQRGGTAVADTLFGDANPGGRLPVT 744
Query: 617 WYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSEL 676
+Y ES T +D M GR+YR++ GT +Y FGHGLSYT ++Y S+L
Sbjct: 745 FYKESETLPAFDDYAMH--------GRTYRYFGGTPLYPFGHGLSYTQFAY------SDL 790
Query: 677 TISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFA 736
+ D++ L G+ ++V N G G VV L+
Sbjct: 791 RL--------DRSTLTADGA-----------------LTATVAVKNTGQRAGDEVVQLYL 825
Query: 737 RVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSI 778
K + K+L GF R+ +E+ F ++ + L I
Sbjct: 826 HPLKPQRERAGKELRGFQRLALQPGQQRELRFTINAKDALRI 867
>gi|60680320|ref|YP_210464.1| beta-glucosidase [Bacteroides fragilis NCTC 9343]
gi|60491754|emb|CAH06512.1| putative beta-glucosidase [Bacteroides fragilis NCTC 9343]
Length = 814
Score = 285 bits (728), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 233/738 (31%), Positives = 346/738 (46%), Gaps = 124/738 (16%)
Query: 90 RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
RLGIP + E HG + G T FP + A+++N L +G +A+
Sbjct: 149 RLGIPLF-LAEECPHGHMAIG-----------TTVFPTSIGQASTWNPELIRQMGRVIAI 196
Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
EA A Q + P +++ RDPRW R +ET GEDP + V+ FQGE
Sbjct: 197 EASA-----QGAHIGYGPVLDLARDPRWSRVEETYGEDPYLNGVMGTALVRGFQGETL-- 249
Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQ 269
+DG K V+ A KH +Y W A I E+
Sbjct: 250 NDG--------KSVI----------------ATLKHFASYG---WTEGGHNGGTAHIGER 282
Query: 270 DTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPAC-LRGDLFQKARNEWGFKGYITSDCD 328
+ E+ PPFR + G A +M SYN+++G P R L ++ W FKG++ SD
Sbjct: 283 ELEEAIFPPFREAVGAG-ALSVMSSYNEIDGNPCTGSRYLLTDILKDRWQFKGFVVSDLY 341
Query: 329 AVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCML-RHTQSAIDKGKVQEKDIDRALLN 387
AV + E+ +E +A + AG+D + GT + +A+ +G V ID+A+
Sbjct: 342 AVGGLREHGVAGNDYE-AAIKAVNAGVDSDLGTNVYAEQLVAAVKRGDVAVATIDKAVRR 400
Query: 388 LFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKN 447
+ S++ ++GLF+ DP + K V +SEH LA + ARQ IVLLKN K LPL K+
Sbjct: 401 ILSLKFQMGLFD-DPFVDE--KQAVQLVASSEHTGLAREVARQSIVLLKNKDKLLPLKKD 457
Query: 448 AVSSLAIIGPLVNNISQMGGGYTGIPC--SPKSLLRGLEAYVSKTH---YASGCHDVPCN 502
+ +LA+IGP +N+ M G YT + ++L G+ VSK YA GC V +
Sbjct: 458 -IRTLAVIGPNADNVYNMLGDYTAPQADGTVVTVLDGIRQKVSKETRVLYAKGC-TVRDS 515
Query: 503 SDAGFHEAVRIAKKADFVIVVAG----LDLTQETEDR-------------------DRVS 539
S GF +A+ A+ AD V++V G D + E E+ DR +
Sbjct: 516 SRTGFKDAIETARNADAVVMVMGGSSARDFSSEYEETGAAKVTINQISDMESGEGYDRAT 575
Query: 540 LLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKAL 599
L L G+Q+ L+ ++R K PV+LVL G PL + A +++ W YPG G A+
Sbjct: 576 LHLMGRQLELLEEISRLGK-PVVLVLIKGRPLLMEGAIQEAEAIVDAW--YPGMQGGNAV 632
Query: 600 AEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHG 659
A+++FGD+NP GRL ++ P S ++P+ N R +R Y GT Y FG+G
Sbjct: 633 ADVLFGDYNPAGRLTLS-VPRSVGQLPVY-YNTRRKGNRS----RYIEEPGTPRYPFGYG 686
Query: 660 LSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQIS 719
LSYT +SY + + GSD C R V ++
Sbjct: 687 LSYTTFSYT--------DMKVQVTEGSD--------------------DC---RVDVTVT 715
Query: 720 VTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIA 779
+ N G DG V L+ R S TP KQL F R+H A S+E++F +D + L++
Sbjct: 716 IQNQGTADGDEVAQLYFRDDVSSFTTPAKQLRAFSRIHLKAGESREVTFTLDK-KSLALY 774
Query: 780 NKHGRRILPLGNHVLMVG 797
+ G ++ G +MVG
Sbjct: 775 MQEGEWVVEPGRFTIMVG 792
>gi|423269263|ref|ZP_17248235.1| hypothetical protein HMPREF1079_01317 [Bacteroides fragilis
CL05T00C42]
gi|423273173|ref|ZP_17252120.1| hypothetical protein HMPREF1080_00773 [Bacteroides fragilis
CL05T12C13]
gi|392701685|gb|EIY94842.1| hypothetical protein HMPREF1079_01317 [Bacteroides fragilis
CL05T00C42]
gi|392708205|gb|EIZ01313.1| hypothetical protein HMPREF1080_00773 [Bacteroides fragilis
CL05T12C13]
Length = 805
Score = 285 bits (728), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 233/738 (31%), Positives = 346/738 (46%), Gaps = 124/738 (16%)
Query: 90 RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
RLGIP + E HG + G T FP + A+++N L +G +A+
Sbjct: 140 RLGIPLF-LAEECPHGHMAIG-----------TTVFPTSIGQASTWNPELIRQMGRVIAI 187
Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
EA A Q + P +++ RDPRW R +ET GEDP + V+ FQGE
Sbjct: 188 EASA-----QGAHIGYGPVLDLARDPRWSRVEETYGEDPYLNGVMGTALVRGFQGETL-- 240
Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQ 269
+DG K V+ A KH +Y W A I E+
Sbjct: 241 NDG--------KSVI----------------ATLKHFASYG---WTEGGHNGGTAHIGER 273
Query: 270 DTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPAC-LRGDLFQKARNEWGFKGYITSDCD 328
+ E+ PPFR + G A +M SYN+++G P R L ++ W FKG++ SD
Sbjct: 274 ELEEAIFPPFREAVGAG-ALSVMSSYNEIDGNPCTGSRYLLTDILKDRWQFKGFVVSDLY 332
Query: 329 AVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCML-RHTQSAIDKGKVQEKDIDRALLN 387
AV + E+ +E +A + AG+D + GT + +A+ +G V ID+A+
Sbjct: 333 AVGGLREHGVAGNDYE-AAIKAVNAGVDSDLGTNVYAEQLVAAVKRGDVAVATIDKAVRR 391
Query: 388 LFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKN 447
+ S++ ++GLF+ DP + K V +SEH LA + ARQ IVLLKN K LPL K+
Sbjct: 392 ILSLKFQMGLFD-DPFVDE--KQAVQLVASSEHTGLAREVARQSIVLLKNKDKLLPLKKD 448
Query: 448 AVSSLAIIGPLVNNISQMGGGYTGIPC--SPKSLLRGLEAYVSKTH---YASGCHDVPCN 502
+ +LA+IGP +N+ M G YT + ++L G+ VSK YA GC V +
Sbjct: 449 -IRTLAVIGPNADNVYNMLGDYTAPQADGTVVTVLDGIRQKVSKETRVLYAKGCA-VRDS 506
Query: 503 SDAGFHEAVRIAKKADFVIVVAG----LDLTQETEDR-------------------DRVS 539
S GF +A+ A+ AD V++V G D + E E+ DR +
Sbjct: 507 SRTGFKDAIETARNADAVVMVMGGSSARDFSSEYEETGAAKVTINQISDMESGEGYDRAT 566
Query: 540 LLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKAL 599
L L G+Q+ L+ ++R K PV+LVL G PL + A +++ W YPG G A+
Sbjct: 567 LHLMGRQLELLEEISRLGK-PVVLVLIKGRPLLMEGAIQEAEAIVDAW--YPGMQGGNAV 623
Query: 600 AEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHG 659
A+++FGD+NP GRL ++ P S ++P+ N R +R Y GT Y FG+G
Sbjct: 624 ADVLFGDYNPAGRLTLS-VPRSVGQLPVY-YNTRRKGNRS----RYIEEPGTPRYPFGYG 677
Query: 660 LSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQIS 719
LSYT +SY + + GSD C R V ++
Sbjct: 678 LSYTTFSYT--------DMKVQVTEGSD--------------------DC---RVDVTVT 706
Query: 720 VTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIA 779
+ N G DG V L+ R S TP KQL F R+H A S+E++F +D + L++
Sbjct: 707 IQNQGTADGDEVAQLYFRDDVSSFTTPAKQLRAFSRIHLKAGESREVTFTLDK-KSLALY 765
Query: 780 NKHGRRILPLGNHVLMVG 797
+ G ++ G +MVG
Sbjct: 766 MQEGEWVVEPGRFTIMVG 783
>gi|336408356|ref|ZP_08588849.1| hypothetical protein HMPREF1018_00864 [Bacteroides sp. 2_1_56FAA]
gi|335937834|gb|EGM99730.1| hypothetical protein HMPREF1018_00864 [Bacteroides sp. 2_1_56FAA]
Length = 805
Score = 284 bits (727), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 233/738 (31%), Positives = 346/738 (46%), Gaps = 124/738 (16%)
Query: 90 RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
RLGIP + E HG + G T FP + A+++N L +G +A+
Sbjct: 140 RLGIPLF-LAEECPHGHMAIG-----------TTVFPTSIGQASTWNPELIRQMGRVIAI 187
Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
EA A Q + P +++ RDPRW R +ET GEDP + V+ FQGE
Sbjct: 188 EASA-----QGAHIGYGPVLDLARDPRWSRVEETYGEDPYLNGVMGTALVRGFQGETL-- 240
Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQ 269
+DG K V+ A KH +Y W A I E+
Sbjct: 241 NDG--------KSVI----------------ATLKHFASYG---WTEGGHNGGTAHIGER 273
Query: 270 DTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPAC-LRGDLFQKARNEWGFKGYITSDCD 328
+ E+ PPFR + G A +M SYN+++G P R L ++ W FKG++ SD
Sbjct: 274 ELEEAIFPPFREAVGAG-ALSVMSSYNEIDGNPCTGSRYLLTDILKDRWQFKGFVVSDLY 332
Query: 329 AVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCML-RHTQSAIDKGKVQEKDIDRALLN 387
AV + E+ +E +A + AG+D + GT + +A+ +G V ID+A+
Sbjct: 333 AVGGLREHGVAGNDYE-AAIKAVNAGVDSDLGTNVYAEQLVAAVKRGDVAVATIDKAVRR 391
Query: 388 LFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKN 447
+ S++ ++GLF+ DP + K V +SEH LA + ARQ IVLLKN K LPL K+
Sbjct: 392 ILSLKFQMGLFD-DPFVDE--KQAVQLVASSEHTGLAREVARQSIVLLKNKDKLLPLKKD 448
Query: 448 AVSSLAIIGPLVNNISQMGGGYTGIPC--SPKSLLRGLEAYVSKTH---YASGCHDVPCN 502
+ +LA+IGP +N+ M G YT + ++L G+ VSK YA GC V +
Sbjct: 449 -IRTLAVIGPNADNVYNMLGDYTAPQADGTVVTVLDGIRQKVSKETRVLYAKGCA-VRDS 506
Query: 503 SDAGFHEAVRIAKKADFVIVVAG----LDLTQETEDR-------------------DRVS 539
S GF +A+ A+ AD V++V G D + E E+ DR +
Sbjct: 507 SRTGFKDAIETARNADAVVMVMGGSSARDFSSEYEETGAAKVTINQISDMESGEGYDRAT 566
Query: 540 LLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKAL 599
L L G+Q+ L+ ++R K PV+LVL G PL + A +++ W YPG G A+
Sbjct: 567 LHLMGRQLELLEEISRLGK-PVVLVLIKGRPLLMEGAIQEAEAIVDAW--YPGMQGGNAV 623
Query: 600 AEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHG 659
A+++FGD+NP GRL ++ P S ++P+ N R +R Y GT Y FG+G
Sbjct: 624 ADVLFGDYNPAGRLTLS-VPRSVGQLPVY-YNTRRKGNRS----RYIEEPGTPRYPFGYG 677
Query: 660 LSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQIS 719
LSYT +SY + + GSD C R V ++
Sbjct: 678 LSYTTFSYT--------DMKVQVTEGSD--------------------DC---RVDVTVT 706
Query: 720 VTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIA 779
+ N G DG V L+ R S TP KQL F R+H A S+E++F +D + L++
Sbjct: 707 IQNQGTADGDEVAQLYFRDDVSSFTTPAKQLRAFSRIHLKAAESREVTFTLDK-KSLALY 765
Query: 780 NKHGRRILPLGNHVLMVG 797
+ G ++ G +MVG
Sbjct: 766 MQEGEWVVEPGRFTIMVG 783
>gi|375357172|ref|YP_005109944.1| putative beta-glucosidase [Bacteroides fragilis 638R]
gi|301161853|emb|CBW21397.1| putative beta-glucosidase [Bacteroides fragilis 638R]
Length = 814
Score = 284 bits (727), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 233/738 (31%), Positives = 346/738 (46%), Gaps = 124/738 (16%)
Query: 90 RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
RLGIP + E HG + G T FP + A+++N L +G +A+
Sbjct: 149 RLGIPLF-LAEECPHGHMAIG-----------TTVFPTSIGQASTWNPELIRQMGRVIAI 196
Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
EA A Q + P +++ RDPRW R +ET GEDP + V+ FQGE
Sbjct: 197 EASA-----QGAHIGYGPVLDLARDPRWSRVEETYGEDPYLNGVMGTALVRGFQGETL-- 249
Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQ 269
+DG K V+ A KH +Y W A I E+
Sbjct: 250 NDG--------KSVI----------------ATLKHFASYG---WTEGGHNGGTAHIGER 282
Query: 270 DTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPAC-LRGDLFQKARNEWGFKGYITSDCD 328
+ E+ PPFR + G A +M SYN+++G P R L ++ W FKG++ SD
Sbjct: 283 ELEEAIFPPFREAVGAG-ALSVMSSYNEIDGNPCTGSRYLLTDILKDRWQFKGFVVSDLY 341
Query: 329 AVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCML-RHTQSAIDKGKVQEKDIDRALLN 387
AV + E+ +E +A + AG+D + GT + +A+ +G V ID+A+
Sbjct: 342 AVGGLREHGVAGNDYE-AAIKAVNAGVDSDLGTNVYAEQLVAAVKRGDVAVATIDKAVRR 400
Query: 388 LFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKN 447
+ S++ ++GLF+ DP + K V +SEH LA + ARQ IVLLKN K LPL K+
Sbjct: 401 ILSLKFQMGLFD-DPFVDE--KQAVQLVASSEHTGLAREVARQSIVLLKNKDKLLPLKKD 457
Query: 448 AVSSLAIIGPLVNNISQMGGGYTGIPC--SPKSLLRGLEAYVSKTH---YASGCHDVPCN 502
+ +LA+IGP +N+ M G YT + ++L G+ VSK YA GC V +
Sbjct: 458 -IRTLAVIGPNADNVYNMLGDYTAPQADGTVVTVLDGIRQKVSKETRVLYAKGCA-VRDS 515
Query: 503 SDAGFHEAVRIAKKADFVIVVAG----LDLTQETEDR-------------------DRVS 539
S GF +A+ A+ AD V++V G D + E E+ DR +
Sbjct: 516 SRTGFKDAIETARNADAVVMVMGGSSARDFSSEYEETGAAKVTINQISDMESGEGYDRAT 575
Query: 540 LLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKAL 599
L L G+Q+ L+ ++R K PV+LVL G PL + A +++ W YPG G A+
Sbjct: 576 LHLMGRQLELLEEISRLGK-PVVLVLIKGRPLLMEGAIQEAEAIVDAW--YPGMQGGNAV 632
Query: 600 AEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHG 659
A+++FGD+NP GRL ++ P S ++P+ N R +R Y GT Y FG+G
Sbjct: 633 ADVLFGDYNPAGRLTLS-VPRSVGQLPVY-YNTRRKGNRS----RYIEEPGTPRYPFGYG 686
Query: 660 LSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQIS 719
LSYT +SY + + GSD C R V ++
Sbjct: 687 LSYTTFSYT--------DMKVQVTEGSD--------------------DC---RVDVTVT 715
Query: 720 VTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIA 779
+ N G DG V L+ R S TP KQL F R+H A S+E++F +D + L++
Sbjct: 716 IQNQGTADGDEVAQLYFRDDVSSFTTPAKQLRAFSRIHLKAGESREVTFTLDK-KSLALY 774
Query: 780 NKHGRRILPLGNHVLMVG 797
+ G ++ G +MVG
Sbjct: 775 MQEGEWVVEPGRFTIMVG 792
>gi|325918730|ref|ZP_08180824.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas vesicatoria
ATCC 35937]
gi|325535054|gb|EGD06956.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas vesicatoria
ATCC 35937]
Length = 391
Score = 284 bits (726), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 156/392 (39%), Positives = 219/392 (55%), Gaps = 52/392 (13%)
Query: 64 RAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVT 123
RA +L++ ++ EK+ Q ++A AIPRL IPAYEWWSE LHGIA NG T
Sbjct: 35 RAAALVAQMSRDEKVAQAMNDAPAIPRLDIPAYEWWSEGLHGIARNG----------YAT 84
Query: 124 SFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG---------QAGLTFWAPNINIFRD 174
FPQ + AAS+N +L +G+ V+ EARA +N AGLT W+PNINIFRD
Sbjct: 85 VFPQAIGLAASWNTALMQQVGTVVSTEARAKFNQAGGPGKDHKRYAGLTIWSPNINIFRD 144
Query: 175 PRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRG 234
PRWGRG ET GEDP + AV F++ QG++ R +
Sbjct: 145 PRWGRGMETYGEDPFLTGQLAVGFIRGLQGDD-----------LNHPRTI---------- 183
Query: 235 DELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCS 294
A KH+ + + G R+ F+ ++ +D E T+ P FR+ + G+A +MC+
Sbjct: 184 ------ATPKHIAVHSGPEPG---RHGFDVDVSPRDMEATYTPAFRAALVDGQAWSVMCA 234
Query: 295 YNQVNGVPACLRGDLFQ-KARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKA 353
YN ++G PAC L + R +WGFKG++ SDCDAV + ++ + + S+A LKA
Sbjct: 235 YNSLHGTPACAADWLLNGRVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAAALKA 294
Query: 354 GMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPD 413
G D+NCG R +AI++G+V E +D++L+ LF+ + RLG PRK Y +LG
Sbjct: 295 GHDLNCGHA-YRELGTAIERGEVDEALLDQSLVRLFAARYRLGELEA-PRKDPYARLGAK 352
Query: 414 DVCTSEHKKLALDAARQGIVLLKNDKKFLPLN 445
DV + H+ LAL AA + IVLLKN LPL
Sbjct: 353 DVDNAAHRALALQAAAESIVLLKNTATTLPLK 384
>gi|323451833|gb|EGB07709.1| hypothetical protein AURANDRAFT_64764 [Aureococcus anophagefferens]
Length = 819
Score = 284 bits (726), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 229/763 (30%), Positives = 346/763 (45%), Gaps = 132/763 (17%)
Query: 54 FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
+ + SL + R L + L++ I QL + A A+ + +PAY W +++ HG+
Sbjct: 71 YLDASLPEADRLAWLADNVPLEDMIGQLVNAAPAVDAVDLPAYNWLNDNEHGV------- 123
Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYN-----LGQA-------- 160
GT + T +P AS++ L +G+A+ E+RA +N G A
Sbjct: 124 --KGTAHA-TVYPMGASLGASWSVDLAWRVGAAIGNESRATHNGLADKSGNACGSTSTGE 180
Query: 161 ------GLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGI 214
G+T +APN+N+ RDPRWGR +E GEDP + + AV V QG
Sbjct: 181 VVANGCGITLYAPNVNLVRDPRWGRAEEVYGEDPHLTAELAVGMVTGLQGN--------- 231
Query: 215 GFGFREKRVLKGFGEESDRGDELMLSACCKHLIAY-------DLEKWGNFSRYSFNAMIT 267
G L+ ACCKH A+ DL R +A ++
Sbjct: 232 ----------AEGSTSGPGGGPLVTGACCKHFAAHFAVYQNEDLPA----DRMVLDANVS 277
Query: 268 EQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSD 326
+D +T+ P ++C+ + KA+ VNG P C +L R WGF G++ SD
Sbjct: 278 SRDLWETYLPVMKACVVRAKAT-------HVNGKPTCAHPELLNDVLRESWGFDGFVVSD 330
Query: 327 CDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCG---TCMLRHTQSAIDKGKVQEKDIDR 383
DA + + Y T E++AA + AGMD G + A+ G V + R
Sbjct: 331 YDAWSNLVTTHKYVSTWEEAAAAGINAGMDQEGGFGDYSPVDALPDAVRNGTVAAATVRR 390
Query: 384 ALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLAL--DAARQGIVLLKNDKKF 441
+ L V+LRLG+F+ YG+ D KLAL +AAR+GIVL KN
Sbjct: 391 SFERLMRVRLRLGMFDPPASTAVYGEAYQCDYQCETAAKLALAREAAREGIVLFKNAGGA 450
Query: 442 LPLNKNAVSSLAIIGPLVNNISQMGGGYT-----GIPCSPKSLLRGLEAYVSKTHYASGC 496
LPL K A +A++GP V++ + G G +P ++ +GLEA V+ A+GC
Sbjct: 451 LPLAKGA--RIALVGPQVDDWRVLLGAVNYAFEDGPDVAPVTIQKGLEA-VANVSVAAGC 507
Query: 497 HDVPCNSDAGFHEAVRIAKK--------ADFVIVVAGLDLTQ-------ETEDRDRVSLL 541
V C + A R+A D G L + E+E DR ++
Sbjct: 508 DSVACAALVDVDGAKRLAAAADATVVVLGDSFGATDGWPLCRGTRDDGCESESHDRATIE 567
Query: 542 LPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAE 601
LPG+Q++LV ++ S R ++ VL GG + + A D LW+ PG+ G ALA+
Sbjct: 568 LPGEQVALVAALRAASSR-LVCVLVHGGAVALGAAADDCDAVLDLWV--PGQMGGAALAD 624
Query: 602 IIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTG-TQVYGFGHGL 660
++FGD++P GR P+T Y + PM + A S G +YR+Y G Y FG GL
Sbjct: 625 VLFGDYSPAGRSPITMYAATSDLPPMGVFDEYAGESSN--GTTYRYYAGPAPTYAFGDGL 682
Query: 661 SYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISV 720
SY ++SY + +AP V +C ++R +++V
Sbjct: 683 SYASFSYAWAAAPPT-----------------------------TVDACGAIRL--RVAV 711
Query: 721 TNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGS 763
TN G V VV ++ARVP + P +L+ FDRV +A G+
Sbjct: 712 TNTGSVASDEVVQVYARVPDATVPAPAIRLVAFDRVRAIAPGA 754
>gi|433677589|ref|ZP_20509555.1| beta-glucosidase [Xanthomonas translucens pv. translucens DSM
18974]
gi|430817300|emb|CCP39963.1| beta-glucosidase [Xanthomonas translucens pv. translucens DSM
18974]
Length = 913
Score = 284 bits (726), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 160/462 (34%), Positives = 246/462 (53%), Gaps = 45/462 (9%)
Query: 54 FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
+ +T S RA L++ +TL+EK Q+ + A AIPRLG+PAY+WW+E+LHG+A G
Sbjct: 37 YLDTQRSFEQRAADLVARMTLEEKAAQMQNAAPAIPRLGVPAYDWWNEALHGVARAG--- 93
Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG--------QAGLTFW 165
T FPQ + AA+F+ L + +A++ EARA ++ GLTFW
Sbjct: 94 -------GATVFPQAIGMAATFDLPLMHEVSTAISDEARAKHHEALRHDQHARYQGLTFW 146
Query: 166 APNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLK 225
+PNINIFRDPRWGRGQET GEDP + + V FV+ QGE+ G +R+
Sbjct: 147 SPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQGEDVDVPKNAQGEAYRK----- 201
Query: 226 GFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQ 285
L A KH + + R+ F+A +++D +T+ P F + +++
Sbjct: 202 -------------LDATAKHFAVHSGPE---ADRHHFDAHPSQRDLYETYLPAFEALVKE 245
Query: 286 GKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHE 344
GK +M +YN+V G A L + R+ WGF GY+ SDC A+ I++ T E
Sbjct: 246 GKVDAVMGAYNRVYGESASASKFLLRDVLRDRWGFDGYVVSDCWAIVDIWKNHKIVATRE 305
Query: 345 DSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRK 404
++AA +K G ++ CG +A+ KG + E D+D AL L ++RLG+F+ P K
Sbjct: 306 EAAALAVKHGTELECG-AEYSTLPTAVRKGLISEADVDNALQKLMYSRMRLGMFD-PPEK 363
Query: 405 GKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQ 464
+ ++ + EH LA AR+ +VLLKND LPL++ + +A++GP ++
Sbjct: 364 LAWAQIPLSANQSPEHDALARRTARESLVLLKND-GVLPLSRAKIKRIAVVGPTADDTMA 422
Query: 465 MGGGYTGIPCSPKSLLRGLE--AYVSKTHYASGCHDVPCNSD 504
+ G Y G P +P ++L+G+ A ++ YA G V D
Sbjct: 423 LLGNYYGTPAAPVTVLQGIREAAPDAEVLYARGADLVEGRDD 464
Score = 140 bits (352), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 96/302 (31%), Positives = 147/302 (48%), Gaps = 53/302 (17%)
Query: 507 FHEAVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVART 556
+A+ A++AD V+ V GL E E+ DR L LP Q +L+ ++ T
Sbjct: 628 LQDALDAARRADVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRALLEALHGT 687
Query: 557 SKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMT 616
K PV+ VLT G L + +A+ + +IL YPG+ G A+A+++FGD NPGGRLP+T
Sbjct: 688 GK-PVVAVLTTGSALAIDWAQ--QHVPAILLAWYPGQRGGSAVADVLFGDANPGGRLPVT 744
Query: 617 WYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSEL 676
+Y ES T +D MR GR+YR++ GT +Y FGHGLSYT ++Y S+L
Sbjct: 745 FYKESETLPAFDDYAMR--------GRTYRYFAGTALYPFGHGLSYTQFAY------SDL 790
Query: 677 TISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFA 736
+ D++ L G R H + V N G G VV L+
Sbjct: 791 RL--------DRSKLAADG-----------------RLHATLKVKNTGQRAGDEVVQLYL 825
Query: 737 RVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRR-ILPLGNHVLM 795
+ + K L GF R+ ++E+ F + P L + ++ + ++ G++ L
Sbjct: 826 QPLSPQRERASKDLRGFQRIALQPGETREVRFAISPQSDLRLYDEARKAYVVDPGDYELQ 885
Query: 796 VG 797
VG
Sbjct: 886 VG 887
>gi|395803818|ref|ZP_10483061.1| glycoside hydrolase family 3 protein [Flavobacterium sp. F52]
gi|395434089|gb|EJG00040.1| glycoside hydrolase family 3 protein [Flavobacterium sp. F52]
Length = 875
Score = 284 bits (726), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 162/448 (36%), Positives = 236/448 (52%), Gaps = 53/448 (11%)
Query: 52 YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGP 111
+PF NT L+ R ++L+S LTL+EK+ Q+ + A AIPRLGIPAY+WW+E+LHG+A
Sbjct: 27 FPFQNTDLTFEERVENLVSQLTLEEKVAQMLNAAPAIPRLGIPAYDWWNETLHGVAR--- 83
Query: 112 GVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG---------QAGL 162
T T FPQ + AA+F+++ + A+E RA+YN GL
Sbjct: 84 ------TPFKTTVFPQAIAMAATFDKNSLFKMADYSALEGRAIYNKAVELNRTKERYLGL 137
Query: 163 TFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKR 222
T+W PNINIFRDPRWGRGQET GEDP + + FVK QG++ K
Sbjct: 138 TYWTPNINIFRDPRWGRGQETYGEDPYLTAVLGDAFVKGLQGDDPKY------------- 184
Query: 223 VLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSC 282
L +AC KH Y + R++F+ +T + DT+ P F+
Sbjct: 185 --------------LKAAACAKH---YAVHSGPESLRHTFDVDVTPYELWDTYLPAFKKL 227
Query: 283 IEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTK 341
+ K + +MC+YN P C L RN+W F GY+TSDC A+ F+
Sbjct: 228 VTNSKVAGVMCAYNAFRTQPCCASDILMNDILRNQWKFTGYVTSDCWAIDDFFKNHKTHP 287
Query: 342 THEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGD 401
++A + G DI+CGT + A+ G++ EK ID ++ LF ++ RLG+F D
Sbjct: 288 DAASASADAVLHGTDIDCGTDAYKSLVQAVKNGQITEKQIDVSVKRLFMIRFRLGMF--D 345
Query: 402 P-RKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVN 460
P KY + + + EHK+ AL ARQ IVLLKN+K LPL+K + + ++GP +
Sbjct: 346 PVSMVKYAQTPSSVLESEEHKEHALKMARQSIVLLKNEKNTLPLSKK-LKKIVVLGPNAD 404
Query: 461 NISQMGGGYTGIPCSPKSLLRGLEAYVS 488
N + G Y G P ++L+G++ +S
Sbjct: 405 NSISILGNYNGTPSKLTTVLQGIKEKIS 432
Score = 113 bits (282), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 84/287 (29%), Positives = 133/287 (46%), Gaps = 54/287 (18%)
Query: 507 FHEAVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVART 556
F + K AD I G+ E E+ DR S+LLP Q L+ ++ ++
Sbjct: 596 FANLIERHKNADAFIFAGGISPQLEGEEMPVDAPGFNGGDRTSILLPEVQTRLLKAL-QS 654
Query: 557 SKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMT 616
S +PV+ ++ G + V + EA++ I +IL I Y G++ A A++IFGD+NP GRLP+T
Sbjct: 655 SGKPVVFLIMTGSAIAVPW-EAEN-IPAILNIWYGGQSAGTASADVIFGDYNPAGRLPVT 712
Query: 617 WYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSEL 676
+Y D ++ + + ++YR++ G +YGFG+GLSYT + Y L P +
Sbjct: 713 FY--------KGDSDLSSFVDYKMDNKTYRYFKGIPLYGFGYGLSYTEFKYSGLKTPDK- 763
Query: 677 TISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFA 736
+K G I + VTN G ++G V L+
Sbjct: 764 -----IKKGQPVTI--------------------------SVKVTNTGKMEGEEVAQLYL 792
Query: 737 RVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHG 783
P S +P K L GF+R + S ++F + P E LS + G
Sbjct: 793 INPNTSIKSPLKSLKGFERFNLKPGQSTVVNFTLSP-EDLSYVTESG 838
>gi|189467437|ref|ZP_03016222.1| hypothetical protein BACINT_03826 [Bacteroides intestinalis DSM
17393]
gi|189435701|gb|EDV04686.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
intestinalis DSM 17393]
Length = 863
Score = 284 bits (726), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 169/446 (37%), Positives = 230/446 (51%), Gaps = 55/446 (12%)
Query: 54 FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
F N L I R L+ LTL+EKI Q+ +NA AI RLGIPAY WW+E LHG+A
Sbjct: 26 FLNPELPIVERVNDLVGRLTLEEKISQMLNNAPAIDRLGIPAYNWWNECLHGVAR----- 80
Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLTFW 165
+ VTSFPQ + AA+++ + + E RA+Y+ GLT+W
Sbjct: 81 ----SPYPVTSFPQAIAMAATWDTESVHQMAVYASDEGRAIYHDATRKGTPGIFRGLTYW 136
Query: 166 APNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLK 225
+PNINIFRDPRWGRGQET GEDP + ++ V FVK QG
Sbjct: 137 SPNINIFRDPRWGRGQETYGEDPFLTASIGVSFVKGLQG--------------------- 175
Query: 226 GFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQ 285
D L SAC KH + +W +R++++A + D DT+ P F+ + +
Sbjct: 176 ------DDPVYLKSSACAKHYAVHSGPEW---NRHTYDAKVNNHDLWDTYLPAFKELVVE 226
Query: 286 GKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHE 344
GK + +MC+YN G P C L RN W F GY+TSDC AV ++ N KTH+
Sbjct: 227 GKVTGVMCAYNSFFGQPCCGNDLLMMDILRNHWKFGGYVTSDCGAVE---DFYNTHKTHQ 283
Query: 345 DSAAGVLKA---GMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGD 401
D+AA A G D CG R A+ +G + EK ID +L LF ++ RLG+F+ D
Sbjct: 284 DAAAASADAVLHGTDCECGNGAYRALADAVLRGLITEKQIDESLKKLFEIRFRLGMFDPD 343
Query: 402 PRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNN 461
R Y + + HK AL ARQ IVLLKN + LPLNKN + +A++GP ++
Sbjct: 344 DRV-PYSNIPLSVLECDAHKAHALKIARQSIVLLKNQDQLLPLNKNKIKKIAVVGPNADD 402
Query: 462 ISQMGGGYTGIPCSPKSLLRGLEAYV 487
S + Y G P + L G++ V
Sbjct: 403 KSVLLANYYGYPSHITTALEGIQKKV 428
Score = 119 bits (299), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 91/311 (29%), Positives = 143/311 (45%), Gaps = 56/311 (18%)
Query: 486 YVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDR---------- 535
YV A D+ A + + K AD +I V GL E E+
Sbjct: 568 YVQHADNAEIKFDMGILRKADYKQTAAAVKDADVIIFVGGLSAKVEGEEMGVEIEGFKRG 627
Query: 536 DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAG 595
DR S+ +P Q +L+ + T K PV+ V+ G L + + A + +IL Y G+AG
Sbjct: 628 DRTSISIPSVQQNLLKELYATGK-PVVFVMMTGSALGLEWESA--HLPAILNAWYGGQAG 684
Query: 596 AKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYG 655
+A+A+++FGD+NP GRLP+T+Y +ND+ D S + R+YR++TGT VY
Sbjct: 685 GQAIADVLFGDYNPSGRLPLTFYKS------VNDLPDFEDYSME--NRTYRYFTGTPVYP 736
Query: 656 FGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFH 715
FG+GLSYT + Y S L+ DK ++ T
Sbjct: 737 FGYGLSYTTFQYS----------SLKLQPSPDKRSVKVTA-------------------- 766
Query: 716 VQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQ 775
+TN G ++G V L+ P+ TP + L GF R++ S+ + F V ++
Sbjct: 767 ---KITNTGKMEGEEVAQLYVSNPR-DFVTPIRALKGFKRINLKPGESQTVEF-VLTSKE 821
Query: 776 LSIANKHGRRI 786
LS+ + G+ +
Sbjct: 822 LSVVDISGKSV 832
>gi|423281958|ref|ZP_17260843.1| hypothetical protein HMPREF1204_00381 [Bacteroides fragilis HMW
615]
gi|404582445|gb|EKA87139.1| hypothetical protein HMPREF1204_00381 [Bacteroides fragilis HMW
615]
Length = 805
Score = 283 bits (725), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 233/738 (31%), Positives = 346/738 (46%), Gaps = 124/738 (16%)
Query: 90 RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
RLGIP + E HG + G T FP + A+++N L +G +A+
Sbjct: 140 RLGIPLF-LAEECPHGHMAIG-----------TTVFPTSIGQASTWNPELIRQMGRVIAI 187
Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
EA A Q + P +++ RDPRW R +ET GEDP + V+ FQGE
Sbjct: 188 EASA-----QGAHIGYGPVLDLARDPRWSRVEETYGEDPYLNGVMGTALVRGFQGETL-- 240
Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQ 269
+DG K V+ A KH +Y W A I E+
Sbjct: 241 NDG--------KSVI----------------ATLKHFASYG---WTEGGHNGGTAHIGER 273
Query: 270 DTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPAC-LRGDLFQKARNEWGFKGYITSDCD 328
+ E+ PPFR + G A +M SYN+++G P R L ++ W FKG++ SD
Sbjct: 274 ELEEAIFPPFREAVGAG-ALSVMSSYNEIDGNPCTGSRYLLTDILKDRWQFKGFVVSDLY 332
Query: 329 AVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCML-RHTQSAIDKGKVQEKDIDRALLN 387
AV + E+ +E +A + AG+D + GT + +A+ +G V ID+A+
Sbjct: 333 AVGGLREHGVAGNDYE-AAIKAVNAGVDSDLGTNVYAEQLVAAVKRGDVAVATIDKAVRR 391
Query: 388 LFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKN 447
+ S++ ++GLF+ DP + K V +SEH LA + ARQ IVLLKN K LPL K+
Sbjct: 392 ILSLKFQMGLFD-DPFVDE--KQAAQLVASSEHTGLAREVARQSIVLLKNKDKLLPLKKD 448
Query: 448 AVSSLAIIGPLVNNISQMGGGYTGIPC--SPKSLLRGLEAYVSKTH---YASGCHDVPCN 502
+ +LA+IGP +N+ M G YT + ++L G+ VSK YA GC V +
Sbjct: 449 -IRTLAVIGPNADNVYNMLGDYTAPQADGTVVTVLDGIRQKVSKETRVLYAKGCA-VRDS 506
Query: 503 SDAGFHEAVRIAKKADFVIVVAG----LDLTQETEDR-------------------DRVS 539
S GF +A+ A+ AD V++V G D + E E+ DR +
Sbjct: 507 SRTGFKDAIETARNADAVVMVMGGSSARDFSSEYEETGAAKVTINQISDMESGEGYDRAT 566
Query: 540 LLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKAL 599
L L G+Q+ L+ ++R K PV+LVL G PL + A +++ W YPG G A+
Sbjct: 567 LHLMGRQLELLEEISRLGK-PVVLVLIKGRPLLMEGAIQEAEAIVDAW--YPGMQGGNAV 623
Query: 600 AEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHG 659
A+++FGD+NP GRL ++ P S ++P+ N R +R Y GT Y FG+G
Sbjct: 624 ADVLFGDYNPAGRLTLS-VPRSVGQLPVY-YNTRRKGNRS----RYIEEPGTPRYPFGYG 677
Query: 660 LSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQIS 719
LSYT +SY + + GSD C R V ++
Sbjct: 678 LSYTTFSYT--------DMKVQVTEGSD--------------------DC---RVDVTVT 706
Query: 720 VTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIA 779
+ N G DG V L+ R S TP KQL F R+H A S+E++F +D + L++
Sbjct: 707 IQNQGTADGDEVAQLYFRDDVSSFTTPAKQLRAFSRIHLKAGESREVTFTLDK-KSLALY 765
Query: 780 NKHGRRILPLGNHVLMVG 797
+ G ++ G +MVG
Sbjct: 766 MQEGEWVVEPGLFTIMVG 783
>gi|121308314|dbj|BAF43576.1| arabinofuranosidase/xylosidase homolog [Prunus persica]
Length = 349
Score = 283 bits (725), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 148/346 (42%), Positives = 215/346 (62%), Gaps = 12/346 (3%)
Query: 465 MGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVA 524
M G Y G+ C + L+G+ Y H A GC DV CN + F A A++AD ++V
Sbjct: 6 MIGNYAGVACGYTTPLQGIGRYTRTIHQA-GCTDVHCNGNQLFGAAEAAARQADATVLVM 64
Query: 525 GLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISS 584
GLD + E E DR LLLPG Q LV+ VAR S+ P ILVL GGP+DV+FA+ D +IS+
Sbjct: 65 GLDQSIEAEFVDRAGLLLPGHQQELVSRVARASRGPTILVLMSGGPIDVTFAKNDPRISA 124
Query: 585 ILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESF-TKVPMNDMNMRADSSRQYPGR 643
I+W+GYPG+AG A+A+++FG NPGG+LPMTWYP+++ T +PM DM MRAD +R YPGR
Sbjct: 125 IIWVGYPGQAGGTAIADVLFGTTNPGGKLPMTWYPQNYVTHLPMTDMAMRADPARGYPGR 184
Query: 644 SYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTIS-ASLKAGSDKNILQQTGSRLDYVH 702
+YRFY G V+ FG GLSYT +++ P+ +++ SLKA ++ +L + + H
Sbjct: 185 TYRFYRGPVVFPFGLGLSYTTFAHNLAHGPTLVSVPLTSLKATANSTMLSKA---VRVSH 241
Query: 703 IDEVTSCTSLR-FHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAK 761
D C +L V + V N G +DG+H +++F P + KQL+GF ++H A
Sbjct: 242 AD----CNALSPLDVHVDVKNTGSMDGTHTLLVFTSPPD-GKWASSKQLMGFHKIHIAAG 296
Query: 762 GSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIET 807
K + V C+ LS+ ++ G R +PLG H L +G+L H ++++T
Sbjct: 297 SEKRVRIAVHVCKHLSVVDRFGIRRIPLGEHKLQIGDLSHHVSLQT 342
>gi|384421334|ref|YP_005630694.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzicola
BLS256]
gi|353464247|gb|AEQ98526.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzicola
BLS256]
Length = 904
Score = 283 bits (725), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 162/463 (34%), Positives = 248/463 (53%), Gaps = 45/463 (9%)
Query: 53 PFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPG 112
P+ +T+ S RA L+S +TL+EK Q+ + A AIPRL +PAY+WW+E+LHG+A G
Sbjct: 36 PYLDTARSFEQRAADLVSRMTLEEKAAQMQNAAPAIPRLQVPAYDWWNEALHGVARAG-- 93
Query: 113 VNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYN--LGQ------AGLTF 164
T FPQ + AA+F+ L + +A++ EARA ++ L Q GLTF
Sbjct: 94 --------GATVFPQAIGMAATFDLPLMHEVATAISDEARAKHHQFLRQNQHARYQGLTF 145
Query: 165 WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVL 224
W+PNINIFRDPRWGRGQET GEDP + + V FV+ QGE + G +R+
Sbjct: 146 WSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQGEGAAAPKNAQGEPYRK---- 201
Query: 225 KGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIE 284
L A KH + + R+ F+A +++D +T+ P F + ++
Sbjct: 202 --------------LDATAKHFAVHSGPE---AERHHFDARPSQRDLYETYLPAFEALVK 244
Query: 285 QGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTH 343
GK +M +YN+V G A L Q R +WGFKGY+ SDC A+ ++++ T
Sbjct: 245 DGKVDAVMGAYNRVYGESASASKFLLQDVLRQQWGFKGYVVSDCWAIVDVWKHHKIVATR 304
Query: 344 EDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPR 403
E +AA + G ++ CG +A+ +G + E ID AL L + ++RLG+F+ P
Sbjct: 305 EQAAALAVTHGTELECGE-EYSTLPAAVHQGLIDEAQIDTALQTLMTARMRLGMFD-PPG 362
Query: 404 KGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNIS 463
+ + K+ + H LA AR+ +VLLKND LPL++ + +A+IGP ++
Sbjct: 363 QLPWSKIPASVNQSPAHDALARRTARESLVLLKND-GLLPLSRATLKRIAVIGPTADDTM 421
Query: 464 QMGGGYTGIPCSPKSLLRGLEAYV--SKTHYASGCHDVPCNSD 504
+ G Y G P +P ++L+G+ A ++ YA G V +D
Sbjct: 422 ALLGNYYGTPAAPVTVLQGIRAAAPNAQVLYARGADLVEGRND 464
Score = 132 bits (331), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 91/282 (32%), Positives = 134/282 (47%), Gaps = 52/282 (18%)
Query: 507 FHEAVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVART 556
EA+ +A+ AD V+ V GL E E+ DR L LP Q L+ ++ T
Sbjct: 628 LQEALDVARSADVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRELLEALQAT 687
Query: 557 SKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMT 616
K PV+ VLT G L + +A+ + +IL YPG+ G A+A+ +FGD NPGGRLP+T
Sbjct: 688 GK-PVVAVLTAGSALAIDWAQ--QHVPAILLAWYPGQRGGTAVADTLFGDANPGGRLPVT 744
Query: 617 WYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSEL 676
+Y ES T +D M GR+YR++ GT +Y FGHGLSYT ++Y S+L
Sbjct: 745 FYKESETLPAFDDYTMH--------GRTYRYFGGTPLYPFGHGLSYTQFAY------SDL 790
Query: 677 TISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFA 736
+ D++ L G+ ++V N G G VV L+
Sbjct: 791 RL--------DRSTLTADGA-----------------LTATVAVKNTGQRAGDEVVQLYL 825
Query: 737 RVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSI 778
K + K+L GF R+ +E+ F ++ + L I
Sbjct: 826 HPLKPQRERAGKELRGFQRLALQPGEQRELRFTINATDALRI 867
>gi|346726970|ref|YP_004853639.1| beta-glucosidase-related glycosidase [Xanthomonas axonopodis pv.
citrumelo F1]
gi|346651717|gb|AEO44341.1| Beta-glucosidase-related glycosidase [Xanthomonas axonopodis pv.
citrumelo F1]
Length = 902
Score = 283 bits (725), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 163/462 (35%), Positives = 246/462 (53%), Gaps = 45/462 (9%)
Query: 54 FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
+ +T S RA L+S +TL+EK Q+ + A AIPRLG+PAY+WW+E+LHG+A G
Sbjct: 35 YLDTQRSFEQRAADLVSRMTLEEKAAQMQNAAPAIPRLGVPAYDWWNEALHGVARAG--- 91
Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYN--LGQ------AGLTFW 165
T FPQ + AA+F+ L + +A++ EARA ++ L Q GLTFW
Sbjct: 92 -------GATVFPQAIGMAATFDLPLMHEVATAISDEARAKHHQFLRQNQHARYQGLTFW 144
Query: 166 APNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLK 225
+PNINIFRDPRWGRGQET GEDP + + V FV+ QGE + G +R+
Sbjct: 145 SPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQGEGADAPKNAQGEPYRK----- 199
Query: 226 GFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQ 285
L A KH + + R+ F+A +++D +T+ P F + ++
Sbjct: 200 -------------LDATAKHFAVHSGPE---ADRHHFDARPSQRDLYETYLPAFEALVKD 243
Query: 286 GKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHE 344
GK +M +YN+V G A L Q R +WGFKGY+ SDC A+ I+++ T E
Sbjct: 244 GKVDAVMGAYNRVYGESASASKFLLQDVLRQQWGFKGYVVSDCWAIVDIWKHHKIVATRE 303
Query: 345 DSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRK 404
+AA +K G ++ CG +A+ +G + E ID AL L + ++RLG+F+ P +
Sbjct: 304 QAAALAVKHGTELECGE-EYSTLPAAVHQGLIDEAQIDTALKTLMTARMRLGMFD-PPGQ 361
Query: 405 GKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQ 464
+ + + H LA AR+ +VLLKND LPL++ + +A+IGP ++
Sbjct: 362 LPWSTIPASVNQSPAHDALARRTARESLVLLKND-GLLPLSRAKLKRIAVIGPTADDTMA 420
Query: 465 MGGGYTGIPCSPKSLLRGLEAYV--SKTHYASGCHDVPCNSD 504
+ G Y G P +P ++L+G+ A ++ YA G V D
Sbjct: 421 LLGNYYGTPAAPVTVLQGIRAAAPNAQVLYARGADLVEGRDD 462
Score = 131 bits (329), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 93/282 (32%), Positives = 131/282 (46%), Gaps = 52/282 (18%)
Query: 507 FHEAVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVART 556
EA+ +A+ AD V+ V GL E E+ DR L LP Q L+ ++ T
Sbjct: 626 LQEALDVARSADVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRDLLEALQAT 685
Query: 557 SKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMT 616
K PV+ VLT G L + +A+ + +IL YPG+ G A+A+ +FGD NPGGRLP+T
Sbjct: 686 GK-PVVAVLTTGSALAIDWAQ--QHLPAILLAWYPGQRGGTAVADTLFGDANPGGRLPVT 742
Query: 617 WYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSEL 676
+Y ES T +D MR GR+YR++ GT +Y FGHGLSYT ++Y
Sbjct: 743 FYKESETLPAFDDYAMR--------GRTYRYFGGTPLYPFGHGLSYTQFAY--------- 785
Query: 677 TISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFA 736
+G RLD I S T+ ++V N G G VV L+
Sbjct: 786 -----------------SGLRLDRTTIAADGSLTAT-----VTVKNTGQRAGDEVVQLYL 823
Query: 737 RVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSI 778
+ K+L GF R+ + + F +D L I
Sbjct: 824 HPLTPQRERAGKELHGFQRIALQPGEQRALHFTLDAKNALRI 865
>gi|326427096|gb|EGD72666.1| hypothetical protein PTSG_04397 [Salpingoeca sp. ATCC 50818]
Length = 614
Score = 283 bits (725), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 198/607 (32%), Positives = 297/607 (48%), Gaps = 84/607 (13%)
Query: 166 APNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLK 225
+PNINI RDPRWGR QE P EDP++ + + Q
Sbjct: 11 SPNININRDPRWGRNQEVPSEDPLLNGEFGKLYTMGLQQ--------------------- 49
Query: 226 GFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQ 285
GE+S R +++++ KH AY LE F+R++F+A ++ DT+ P FR + +
Sbjct: 50 --GEDS-RYTKVVVT--LKHWDAYSLEDSDGFTRHNFDAKVSNFALMDTYWPAFRKAVME 104
Query: 286 GKASCIMCSYNQVNGVPACLRGDLFQKARNEWGFKGYITSDCDAVATIFEYQNYTKTHED 345
G A +MCSYN +NG P C L + R+ W F GY+TSD A+ I+ +YT
Sbjct: 105 GNAKGVMCSYNALNGRPTCTHPLLTKVLRDIWKFDGYVTSDTGAIEDIYAKHHYTANASA 164
Query: 346 SAAGVLKAG-MDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRK 404
+ A L+ G D++ G A++ G+ D+DRAL N ++ LGLF DP +
Sbjct: 165 AVAAALRDGRCDMDSGAVYHDALLDAVNSGECSMDDVDRALYNTLKLRFELGLF--DPIE 222
Query: 405 GK-YGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNIS 463
+ Y ++ + T+ + L + + ++LL+N LP K +A+IGP +N
Sbjct: 223 DQPYWRINASSINTTYAQDLNMKITLESMILLQNHNNALPFKKG--RKVAVIGPHINAQE 280
Query: 464 QMGGGYTGIPCSPKSL------LRGLEAY--VSKTHYASGCHDVPCNSDAGFHEAVRIAK 515
+ G Y G C S L +EA +S T A G + C +DA EAV +AK
Sbjct: 281 ALVGNYLGQLCPDDSFDCITSPLAAIEAINGMSNTVSAMGSGVLAC-TDASIQEAVNVAK 339
Query: 516 KADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSF 575
AD+V+++ G++ T E E DR S+ LP Q L ++A +K +++ GG ++
Sbjct: 340 DADYVVLLIGINDTIEAESNDRTSIDLPQCQHKLTAAIAHLNKTTAAVLINGGM---LAI 396
Query: 576 AEADSQISSILWIGYPGEAGAKALAEIIFGDFNP-GGRLPMTWYPESFT-KVPMNDMNMR 633
+ Q+ +I+ GYPG G A+A+ IFGD N GG+LP T YP + K+ M+DM M
Sbjct: 397 EQEKKQLPAIIEAGYPGFYGGAAIAKTIFGDNNHLGGKLPYTVYPADYIHKINMSDMEM- 455
Query: 634 ADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQ 693
PGRSYR+YTG ++ FG GL+YT +S + PS T +
Sbjct: 456 ----TNSPGRSYRYYTGQPLWPFGFGLAYTTFSVQS-PGPSASTFA-------------- 496
Query: 694 TGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLF---ARVPKVSQGTPEKQL 750
TGS + F + + V N G G VV ++ +P S +KQL
Sbjct: 497 TGS--------------NTSFSLPVHVVNTGKRTGDTVVQVYMAPVSLPHRSFSL-KKQL 541
Query: 751 IGFDRVH 757
I F+RVH
Sbjct: 542 IAFERVH 548
>gi|304406707|ref|ZP_07388362.1| glycoside hydrolase family 3 domain protein [Paenibacillus
curdlanolyticus YK9]
gi|304344240|gb|EFM10079.1| glycoside hydrolase family 3 domain protein [Paenibacillus
curdlanolyticus YK9]
Length = 733
Score = 283 bits (724), Expect = 3e-73, Method: Compositional matrix adjust.
Identities = 203/687 (29%), Positives = 342/687 (49%), Gaps = 93/687 (13%)
Query: 123 TSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTF-WAPNINIFRDPRWGRGQ 181
T FP L AA++N + +A + EA G+T+ +AP I++ RDPRWGR
Sbjct: 112 TVFPIPLAMAAAWNPEVARQTSAAASTEALT------DGVTWVFAPMIDVSRDPRWGRIA 165
Query: 182 ETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSA 241
E+ GEDP + +AY +V+ G + D G ++
Sbjct: 166 ESIGEDPYLTAAYGRAWVE---------------------------GSQIDNGPGRATAS 198
Query: 242 CCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNG 300
C KH Y + + G +N + +++++ D PPF+ +E G S IM S+N++NG
Sbjct: 199 CPKHFAGYGMAEAGR----DYNTVDLSDRELRDIILPPFQDAVEAGALS-IMASFNEING 253
Query: 301 VPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINC 359
+PAC L + R+EWGF+G + SD +A+ + E++ + AG D++
Sbjct: 254 IPACANEYLLKTILRDEWGFEGVVASDYNALVELI-VHGVAANEEEACEMTVLAGCDMDM 312
Query: 360 GTCML-RHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTS 418
+ + R + G+V E +D ++ + +++++LGL + K + S
Sbjct: 313 HSGIFTRQLPKLVRAGRVPESVVDDSVRRILAMKIKLGLL--EQSKSDVSQSAATQPLKS 370
Query: 419 EHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGG-----GYTGIP 473
E+ +LA +AARQ IVLL+N ++ LPL+K A +S+A+IGPL +N + G G +
Sbjct: 371 EYVELAREAARQSIVLLQNKEQVLPLSK-AGASIAVIGPLADNATDPLGCWALDGRSDEV 429
Query: 474 CSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETE 533
+ +R A + YA GC D+ +S+ GF A+ A+ +D V+++ G T E
Sbjct: 430 VTALEGIRQAAAEGTSIRYAQGC-DIDSDSEEGFEAALEAARSSDVVVMLLGESATMSGE 488
Query: 534 DRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGE 593
R R +L LPG+Q +LV +VA+ K P++ V+ G PL +FA Q S+I+ + G
Sbjct: 489 SRSRAALDLPGKQRALVEAVAKLGK-PIVAVILSGRPL--TFAWLPEQASAIVQAWHLGV 545
Query: 594 AGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFY---TG 650
A+A+++FGDFNP GRLP+T +P++ ++P+ + R + R G +Y T
Sbjct: 546 QSGNAIADVLFGDFNPSGRLPVT-FPQNVGQIPI--YHYRKKTGRPPAGAYSSYYIDSTT 602
Query: 651 TQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCT 710
+Y FG+GL+YT + Y + T +S+ G+D+
Sbjct: 603 EPLYPFGYGLTYTEFEYGAIQ-----TSKSSI--GADE---------------------- 633
Query: 711 SLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGV 770
+ V +S+ N G++ G VV + R S P K+L+ F +V A S +++F +
Sbjct: 634 --QLDVTVSIRNVGNLAGEEVVQCYVRDEVASVTQPLKRLVAFRKVKVAAGESVDVTFTI 691
Query: 771 DPCEQLSIANKHGRRILPLGNHVLMVG 797
E L+I +KH +R + G+ L +G
Sbjct: 692 GAAE-LAILDKHMKRTVEPGDFTLWIG 717
>gi|381170979|ref|ZP_09880130.1| glycosyl hydrolase family 3 N terminal domain protein [Xanthomonas
citri pv. mangiferaeindicae LMG 941]
gi|380688543|emb|CCG36617.1| glycosyl hydrolase family 3 N terminal domain protein [Xanthomonas
citri pv. mangiferaeindicae LMG 941]
Length = 901
Score = 283 bits (724), Expect = 3e-73, Method: Compositional matrix adjust.
Identities = 162/463 (34%), Positives = 248/463 (53%), Gaps = 45/463 (9%)
Query: 53 PFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPG 112
P+ +T S RA L+S +TL+EK Q+ + A AIPRL +PAY+WW+E+LHG+A G
Sbjct: 33 PYLDTQRSFEQRAADLVSRMTLEEKAAQMQNAAPAIPRLQVPAYDWWNEALHGVARAG-- 90
Query: 113 VNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYN--LGQ------AGLTF 164
T FPQ + AA+F+ L + +A++ EARA ++ L Q GLTF
Sbjct: 91 --------GATVFPQAIGMAATFDLPLMHEVATAISDEARAKHHQFLRQNQHARYQGLTF 142
Query: 165 WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVL 224
W+PNINIFRDPRWGRGQET GEDP + + V FV+ QGE + G +R+
Sbjct: 143 WSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQGEGAAAPKNAQGEPYRK---- 198
Query: 225 KGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIE 284
L A KH + + R+ F+A +++D +T+ P F + ++
Sbjct: 199 --------------LDATAKHFAVHSGPE---ADRHHFDARPSQRDLYETYLPAFEALVK 241
Query: 285 QGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTH 343
+GK +M +YN+V G A L Q R++WGF+GY+ SDC A+ I+++ T
Sbjct: 242 EGKVDAVMGAYNRVYGESASASKFLLQDVLRDQWGFRGYVVSDCWAIVDIWKHHKIVATR 301
Query: 344 EDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPR 403
E +AA +K G ++ CG +A+ +G + E ID AL L + ++RLG+F+ P
Sbjct: 302 EQAAALAVKHGTELECGEEYAT-LPAAVRQGLIDEAQIDTALKTLMTARMRLGMFD-PPG 359
Query: 404 KGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNIS 463
+ + + + H LA AR+ +VLLKND LPL++ + +A+IGP ++
Sbjct: 360 QLPWSTIPASVNQSPAHDALARRTARESLVLLKND-GLLPLSRAKLKRIAVIGPTADDTM 418
Query: 464 QMGGGYTGIPCSPKSLLRGLEAYV--SKTHYASGCHDVPCNSD 504
+ G Y G P +P ++L+G+ A ++ YA G V D
Sbjct: 419 ALLGNYYGTPAAPVTVLQGIRAAAPNAQVLYARGADLVEGRDD 461
Score = 133 bits (334), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 98/320 (30%), Positives = 144/320 (45%), Gaps = 59/320 (18%)
Query: 480 LRGLEAYVSKTHYASGCHDVPCN-------SDAGFHEAVRIAKKADFVIVVAGLDLTQET 532
L+ AY + Y G D + EA+ +A+ AD V+ V GL E
Sbjct: 591 LQAGRAYDVRLEYFEGERDAAVRLAWRQPGAKPPLQEALDVARSADVVVFVGGLTGDVEG 650
Query: 533 EDR----------DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQI 582
E+ DR L LP Q L+ ++ T RPV+ VLT G L + +A+ +
Sbjct: 651 EEMKVNYPGFAGGDRTDLRLPKPQRDLLEALQATG-RPVVAVLTTGSALAIDWAQ--QHL 707
Query: 583 SSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPG 642
+IL YPG+ G A+A+ +FGD NPGGRLP+T+Y ES T +D MR G
Sbjct: 708 PAILLAWYPGQRGGTAVADTLFGDANPGGRLPVTFYKESETLPAFDDYAMR--------G 759
Query: 643 RSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVH 702
R+YR++ GT +Y FGHGLSYT ++Y +G RLD
Sbjct: 760 RTYRYFGGTPLYPFGHGLSYTQFAY--------------------------SGLRLDRTT 793
Query: 703 IDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKG 762
I S T+ ++V N G G VV L+ + K+L GF R+
Sbjct: 794 IATDGSLTAT-----VTVKNTGQRAGDEVVQLYLHPLAPQRERAGKELHGFQRIALQPGE 848
Query: 763 SKEISFGVDPCEQLSIANKH 782
+E+ F ++ + L + ++
Sbjct: 849 QRELGFTINAKDALRLYDEQ 868
>gi|294667502|ref|ZP_06732718.1| glucan 1,4-beta-glucosidase [Xanthomonas fuscans subsp.
aurantifolii str. ICPB 10535]
gi|292602731|gb|EFF46166.1| glucan 1,4-beta-glucosidase [Xanthomonas fuscans subsp.
aurantifolii str. ICPB 10535]
Length = 901
Score = 283 bits (723), Expect = 3e-73, Method: Compositional matrix adjust.
Identities = 162/463 (34%), Positives = 248/463 (53%), Gaps = 45/463 (9%)
Query: 53 PFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPG 112
P+ +T S RA L+S +TL+EK Q+ + A AIPRL +PAY+WW+E+LHG+A G
Sbjct: 33 PYLDTQRSFEARAADLVSRMTLEEKAAQMQNAAPAIPRLQVPAYDWWNEALHGVARAG-- 90
Query: 113 VNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYN--LGQ------AGLTF 164
T FPQ + AA+F+ L + +A++ EARA ++ L Q GLTF
Sbjct: 91 --------GATVFPQAIGMAATFDLPLMHEVATAISDEARAKHHQFLRQNQHERYQGLTF 142
Query: 165 WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVL 224
W+PNINIFRDPRWGRGQET GEDP + + V FV+ QGE + G +R+
Sbjct: 143 WSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQGEGGDAPKNAQGERYRK---- 198
Query: 225 KGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIE 284
L A KH + + R+ F+A +++D +T+ P F + ++
Sbjct: 199 --------------LDATAKHFAVHSGPE---ADRHHFDAHPSQRDLYETYLPAFEALVK 241
Query: 285 QGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTH 343
GK +M +YN+V G A L Q R++WGF+GY+ SDC A+ I+++ T
Sbjct: 242 DGKVDAVMGAYNRVYGESASASKFLLQDVLRDQWGFRGYVVSDCWAIVDIWKHHKIVATR 301
Query: 344 EDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPR 403
E +AA +K G ++ CG +A+ +G + E ID AL L + ++RLG+F+ P
Sbjct: 302 EQAAALAVKHGTELECGE-EYSTLPAAVRQGLIDEAQIDTALKTLMTARMRLGMFD-PPG 359
Query: 404 KGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNIS 463
+ + ++ + H LA AR+ +VLLKND LPL++ + +A+IGP ++
Sbjct: 360 QLPWSQIPASVNQSPAHDALARRTARESLVLLKND-GLLPLSRARLKRIAVIGPTADDTM 418
Query: 464 QMGGGYTGIPCSPKSLLRGLEAYV--SKTHYASGCHDVPCNSD 504
+ G Y G P +P ++L+G+ A ++ YA G V D
Sbjct: 419 ALLGNYYGTPAAPVTVLQGIRAAAPNAQVLYARGADLVEGRDD 461
Score = 131 bits (330), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 97/320 (30%), Positives = 144/320 (45%), Gaps = 59/320 (18%)
Query: 480 LRGLEAYVSKTHYASGCHDVPCN-------SDAGFHEAVRIAKKADFVIVVAGLDLTQET 532
L+ AY + Y G D + EA+ +A+ A+ V+ V GL E
Sbjct: 591 LQAGRAYEVRLEYFEGERDAAVRLAWRQPGAKPPLQEALDVARSAEVVVFVGGLTGDVEG 650
Query: 533 EDR----------DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQI 582
E+ DR L LP Q L+ ++ T K PV+ VLT G L + +A+ +
Sbjct: 651 EEMKVNYPGFAGGDRTDLRLPKPQRDLLEALHATGK-PVVAVLTTGSALAIDWAQ--QHL 707
Query: 583 SSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPG 642
+IL YPG+ G A+A+ +FGD NPGGRLP+T+Y ES T +D MR G
Sbjct: 708 PAILLAWYPGQRGGTAVADTLFGDANPGGRLPVTFYKESETLPAFDDYAMR--------G 759
Query: 643 RSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVH 702
R+YR++ GT +Y FGHGLSYT ++Y +G RLD
Sbjct: 760 RTYRYFGGTPLYPFGHGLSYTQFAY--------------------------SGLRLDRTT 793
Query: 703 IDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKG 762
I S T+ ++V N G G VV L+ + K+L GF R+
Sbjct: 794 IATDGSLTAT-----VTVKNTGQRAGDEVVQLYLHPLTPQRERAGKELHGFQRIALTPGE 848
Query: 763 SKEISFGVDPCEQLSIANKH 782
+E+ F ++ + L + ++
Sbjct: 849 QRELGFTINAKDALRLYDEQ 868
>gi|146301622|ref|YP_001196213.1| glycoside hydrolase family 3 protein [Flavobacterium johnsoniae
UW101]
gi|146156040|gb|ABQ06894.1| Candidate beta-xylosidase; Glycoside hydrolase family 3
[Flavobacterium johnsoniae UW101]
Length = 875
Score = 283 bits (723), Expect = 4e-73, Method: Compositional matrix adjust.
Identities = 164/448 (36%), Positives = 232/448 (51%), Gaps = 53/448 (11%)
Query: 52 YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGP 111
+ F N SLS R L+S LTL+EK+ Q+ +++ I RLGIPAY+WW+E+LHG+A
Sbjct: 27 FQFQNPSLSFEQRVDDLVSRLTLEEKVSQMLNSSPEIARLGIPAYDWWNETLHGVAR--- 83
Query: 112 GVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG---------QAGL 162
T T +PQ + AA+F+++ + A+E RA+YN GL
Sbjct: 84 ------TPFKTTVYPQAIGMAATFDKNSLFTMADYSALEGRAIYNKAVELKRTNERYLGL 137
Query: 163 TFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKR 222
T+W PNINIFRDPRWGRGQET GEDP + + FVK QG++ K
Sbjct: 138 TYWTPNINIFRDPRWGRGQETYGEDPYLTAVLGDAFVKGLQGDDPKY------------- 184
Query: 223 VLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSC 282
L +AC KH Y + R++F+ +T + DT+ P FR
Sbjct: 185 --------------LKAAACAKH---YAVHSGPESLRHTFDVDVTPYELWDTYLPAFRKL 227
Query: 283 IEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTK 341
I + + +MC+YN P C L R EW F GY+TSDC A+ F+
Sbjct: 228 ITESNVAGVMCAYNAFRTQPCCASDILMNDILRKEWKFDGYVTSDCWAIDDFFKNHKTHP 287
Query: 342 THEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGD 401
E +AA + G DI+CGT + A+ GK+ EK ID ++ LF ++ RLG+F D
Sbjct: 288 DAESAAADAVFHGTDIDCGTDAYKALVQAVKNGKISEKQIDISVKRLFMIRFRLGMF--D 345
Query: 402 P-RKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVN 460
P KY + + + EH+ AL ARQ IVLLKN+K LPLNKN + + ++GP +
Sbjct: 346 PVSMVKYAQTPSSVLESKEHQLHALKMARQSIVLLKNEKNILPLNKN-LKKIVVLGPNAD 404
Query: 461 NISQMGGGYTGIPCSPKSLLRGLEAYVS 488
N + G Y G P ++L+G++ VS
Sbjct: 405 NAISILGNYNGTPSKLTTVLQGIKEKVS 432
Score = 116 bits (290), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 84/287 (29%), Positives = 133/287 (46%), Gaps = 54/287 (18%)
Query: 507 FHEAVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVART 556
F + + K AD I G+ E E+ DR S+L P Q L+ ++ ++
Sbjct: 596 FADLIEHHKNADAFIFAGGISPQLEGEEMPVDFPGFKGGDRTSILFPEVQTKLLKAL-QS 654
Query: 557 SKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMT 616
S +PV+ + G + + + EA++ I +IL I Y G++ A A++IFGD+NP GRLP+T
Sbjct: 655 SGKPVVFAMMTGSAIAIPW-EAEN-IPAILNIWYGGQSAGTAAADVIFGDYNPAGRLPVT 712
Query: 617 WYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSEL 676
+Y ND ++ + + ++YR++ GT +YGFG+GLSYT++ Y L P
Sbjct: 713 FYK--------NDSDLPSFVDYKMDNKTYRYFKGTPLYGFGYGLSYTSFKYSDLKTP--- 761
Query: 677 TISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFA 736
+K G +IL + V N G +G V L+
Sbjct: 762 ---VKIKKGQSVSIL--------------------------VKVANTGKTEGEEVAQLYL 792
Query: 737 RVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHG 783
+ TP K L GF+R + +K I+F + P E LS G
Sbjct: 793 INQDTAIKTPLKSLKGFERFNLKPGENKTITFNLSP-EDLSYVTPEG 838
>gi|427386425|ref|ZP_18882622.1| hypothetical protein HMPREF9447_03655 [Bacteroides oleiciplenus YIT
12058]
gi|425726465|gb|EKU89330.1| hypothetical protein HMPREF9447_03655 [Bacteroides oleiciplenus YIT
12058]
Length = 864
Score = 282 bits (722), Expect = 4e-73, Method: Compositional matrix adjust.
Identities = 161/445 (36%), Positives = 228/445 (51%), Gaps = 49/445 (11%)
Query: 52 YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGP 111
Y F N L + R L+ LTL+EKI Q+ +NA AI RLGIPAY WW+E LHG+A
Sbjct: 24 YKFQNPDLPVEERVNDLVGHLTLEEKISQMMNNAPAIERLGIPAYNWWNECLHGVAR--- 80
Query: 112 GVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLT 163
+ VTSFPQ + AA+++ + + E RA+Y+ GLT
Sbjct: 81 ------SPYPVTSFPQAIAMAATWDTKSVYQMAEYASDEGRAIYHDAARKGTPGIFRGLT 134
Query: 164 FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRV 223
+W+PNINIFRDPRWGRGQET GEDP + +A V FVK QG
Sbjct: 135 YWSPNINIFRDPRWGRGQETYGEDPYLTAAIGVAFVKGLQG------------------- 175
Query: 224 LKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCI 283
D L SAC KH + +W +R+++NA ++ D DT+ P FR +
Sbjct: 176 --------DDPVYLKSSACAKHYAVHSGPEW---NRHTYNAEVSNHDLWDTYLPAFRELV 224
Query: 284 EQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKT 342
K + +MC+YN P C L RN+W F GY+TSDC A+ + N +
Sbjct: 225 VDAKVTGVMCAYNSFFEQPCCGNDLLMMDILRNQWKFDGYVTSDCGAIEDFYNTHNTHED 284
Query: 343 HEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDP 402
+++A + G D CG R AI +G + E+ +D +L LF ++ RLG+F+ D
Sbjct: 285 AAEASADAVLHGTDCECGNGAYRALADAIVRGLITEEQVDVSLKKLFEIRFRLGMFDPDD 344
Query: 403 RKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNI 462
R Y + + HK AL ARQ IVLLKN+K+ LPL+ N + +A++GP ++
Sbjct: 345 RV-PYSDIPISVLECDAHKAHALKMARQSIVLLKNEKQLLPLDMNKIKKIAVVGPNADDK 403
Query: 463 SQMGGGYTGIPCSPKSLLRGLEAYV 487
S + Y G P ++L G++ V
Sbjct: 404 SVLLANYYGYPSCVTTVLEGIKGKV 428
Score = 117 bits (294), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 105/376 (27%), Positives = 166/376 (44%), Gaps = 76/376 (20%)
Query: 444 LNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNS 503
L + ++ L I G N + + Y P + +G ++Y+ + HY +
Sbjct: 527 LKADDMADLYIDGVKQNKVGNINSYY------PLNAEKG-KSYLMEIHYVQHADNAEIKF 579
Query: 504 DAG------FHEAVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQM 547
D G + + K AD V+ V GL E E+ DR S+ +P Q
Sbjct: 580 DMGTLRKADYRQTASSVKDADVVVFVGGLSAKVEGEEMKVEIDGFKRGDRTSISIPVVQQ 639
Query: 548 SLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDF 607
+L+ + T K PVI +L G + + + + +IL Y G+AG +A+A+++FGD+
Sbjct: 640 NLLKELYATGK-PVIFILMTGSAVGLEWES--EHLPAILNAWYGGQAGGQAIADVLFGDY 696
Query: 608 NPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSY 667
NP GRLP+T+Y +ND+ D S + R+YR++TG VY FG+GLSYT++ Y
Sbjct: 697 NPSGRLPLTFYKN------VNDLPDFEDYSMK--NRTYRYFTGIPVYPFGYGLSYTDFQY 748
Query: 668 KFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVD 727
+ L DK L V V+N G +
Sbjct: 749 NTIKVQPSL----------DK-----------------------LSVKVTAEVSNVGKYE 775
Query: 728 GSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRIL 787
G VV L+ P+ TP + L GF R++ S+ + F V ++LS+ +
Sbjct: 776 GEEVVQLYVSNPR-DFVTPIRALKGFRRINLKPGESQMVEF-VLTSKELSVVDV------ 827
Query: 788 PLGNHVLMVGELRHSL 803
GN V M GE++ SL
Sbjct: 828 -AGNFVPMKGEVQISL 842
>gi|423248809|ref|ZP_17229825.1| hypothetical protein HMPREF1066_00835 [Bacteroides fragilis
CL03T00C08]
gi|423253758|ref|ZP_17234689.1| hypothetical protein HMPREF1067_01333 [Bacteroides fragilis
CL03T12C07]
gi|392655387|gb|EIY49030.1| hypothetical protein HMPREF1067_01333 [Bacteroides fragilis
CL03T12C07]
gi|392657750|gb|EIY51381.1| hypothetical protein HMPREF1066_00835 [Bacteroides fragilis
CL03T00C08]
Length = 805
Score = 282 bits (722), Expect = 4e-73, Method: Compositional matrix adjust.
Identities = 232/738 (31%), Positives = 346/738 (46%), Gaps = 124/738 (16%)
Query: 90 RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
RLGIP + E HG + G T FP + A+++N L +G +A+
Sbjct: 140 RLGIPLF-LAEECPHGHMAIG-----------TTVFPTSIGQASTWNPELIRQMGRVIAI 187
Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
EA A Q + P +++ RDPRW R +ET GEDP + V+ FQGE
Sbjct: 188 EASA-----QGAHIGYGPVLDLARDPRWSRVEETYGEDPYLNGVMGTALVRGFQGETL-- 240
Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQ 269
+DG K V+ A KH +Y W A I E+
Sbjct: 241 NDG--------KSVI----------------ATLKHFASYG---WTEGGHNGGTAHIGER 273
Query: 270 DTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPAC-LRGDLFQKARNEWGFKGYITSDCD 328
+ E+ PPFR + G A +M SYN+++G P R L ++ W FKG++ SD
Sbjct: 274 ELEEAIFPPFREAVGAG-ALSVMSSYNEIDGNPCTGSRYLLTDILKDRWQFKGFVVSDLY 332
Query: 329 AVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCML-RHTQSAIDKGKVQEKDIDRALLN 387
AV + E+ +E +A + AG+D + GT + +A+ +G V ID+A+
Sbjct: 333 AVGGLREHGVAGNDYE-AAIKAVNAGVDSDLGTNVYAEQLVAAVKRGDVAVATIDKAVRR 391
Query: 388 LFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKN 447
+ S++ ++GLF+ DP + K V +SEH LA + ARQ IVLLKN K LPL K+
Sbjct: 392 ILSLKFQMGLFD-DPFVDE--KQAVQLVASSEHTGLAREVARQSIVLLKNKDKLLPLKKD 448
Query: 448 AVSSLAIIGPLVNNISQMGGGYTGIPC--SPKSLLRGLEAYVSKTH---YASGCHDVPCN 502
+ +LA+IGP +N+ M G YT + ++L G+ VSK YA GC V +
Sbjct: 449 -IRTLAVIGPNADNVYNMLGDYTAPQADGTVVTVLDGIRQKVSKETRVLYAKGCA-VRDS 506
Query: 503 SDAGFHEAVRIAKKADFVIVVAG----LDLTQETEDR-------------------DRVS 539
S GF +A+ A+ AD V++V G D + E E+ DR +
Sbjct: 507 SRTGFKDAIETARNADAVVMVMGGSSARDFSSEYEETGAAKVTINQISDMESGEGYDRAT 566
Query: 540 LLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKAL 599
L L G+Q+ L+ ++R K PV+LVL G PL + A +++ W YPG G A+
Sbjct: 567 LHLMGRQLELLEEISRLGK-PVVLVLIKGRPLLMEGAIQEAEAIVDAW--YPGMQGGNAV 623
Query: 600 AEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHG 659
A+++FGD+NP GRL ++ P S ++P+ N R +R Y GT Y FG+G
Sbjct: 624 ADVLFGDYNPAGRLTLS-VPRSVGQLPVY-YNTRRKGNRS----RYVEEPGTPRYPFGYG 677
Query: 660 LSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQIS 719
LSYT +SY + + GSD C R V ++
Sbjct: 678 LSYTTFSYT--------DMKVQVTEGSD--------------------DC---RVDVTVT 706
Query: 720 VTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIA 779
+ N G DG V L+ + S TP KQL F R+H A S+E++F +D + L++
Sbjct: 707 IQNQGTADGDEVAQLYFQDDVSSFTTPAKQLRAFSRIHLKAGESREVTFTLDK-KSLALY 765
Query: 780 NKHGRRILPLGNHVLMVG 797
+ G ++ G +MVG
Sbjct: 766 MQEGEWVVEPGRFTIMVG 783
>gi|391417909|gb|AFM44649.1| Xyl3A [Caldanaerobius polysaccharolyticus]
Length = 789
Score = 282 bits (722), Expect = 4e-73, Method: Compositional matrix adjust.
Identities = 222/756 (29%), Positives = 345/756 (45%), Gaps = 146/756 (19%)
Query: 90 RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
RLGIPA ES G + G T FPQ + A+++N L +GS +
Sbjct: 98 RLGIPAL-VHEESCSGYMAKG-----------ATCFPQTIGVASTWNTELVKQMGSVIRE 145
Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
+ +A+ AP +++ RD RWGR +ET GEDP ++S V +++ QG N K
Sbjct: 146 QMKAV-----GAHQALAPLMDVARDARWGRVEETFGEDPYLISEMGVSYIEGLQGGNIK- 199
Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFN---AMI 266
DG + A KH + Y FS N A I
Sbjct: 200 -DG--------------------------IMATVKHFVGY------GFSEGGMNWAPAHI 226
Query: 267 TEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITS 325
E++ + F PF + +++ K + +M +Y++++G+P L Q RNEWGF G + S
Sbjct: 227 PERELREVFLLPFEAAVKKAKTASVMAAYHELDGIPCHGSKKLLTQILRNEWGFDGLVVS 286
Query: 326 DCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGT--CMLRHTQSAIDKGKVQEKDIDR 383
D V ++EY + + ++A L+AG+DI + C + + A+ KG V+E ID
Sbjct: 287 DYFGVNMLYEYHHVARDKGEAAKIALQAGVDIELPSRDCYGQPLKEAVQKGLVEEALIDE 346
Query: 384 ALLNLFSVQLRLGLFNG---DPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKK 440
+ + ++ G+F D K + T + +KLA A+Q IVLLKN
Sbjct: 347 VVRRILRMKFLSGVFENPYVDVEKAA------EVFDTPDQRKLAYKLAQQSIVLLKNQGD 400
Query: 441 FLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPC----------------SP-------- 476
LPL K+ + S+A+IGP +++ + G Y PC +P
Sbjct: 401 LLPLKKD-IKSIAVIGPNADSVRNIIGDYA-YPCHIESLVETKEQSNVFNTPVPDKVSLV 458
Query: 477 ------KSLLRGLEAYVS---KTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAG-- 525
KS+L G++ +S + HYA GC +V + GF EA+ AKK+D IVV G
Sbjct: 459 DNFVPIKSILEGIKGKISPETELHYAKGC-EVTGDDKGGFAEAIEAAKKSDVAIVVVGDK 517
Query: 526 ---LDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQI 582
D E RDR L LPG Q LV ++ T P ++VL G PL +++ I
Sbjct: 518 AGLTDDCTSGESRDRADLNLPGVQQELVEAIYNTGT-PTVVVLVNGRPLSINWI--SRHI 574
Query: 583 SSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPG 642
+I+ PGE GA A+A+++FGD+NPGG+LP++ +P S +VP+ N + R +
Sbjct: 575 PAIIEAWLPGEEGAAAVADVLFGDYNPGGKLPVS-FPRSVGQVPVY-YNHKPSGGRSHWK 632
Query: 643 RSYRFYTGTQVYGFGHGLSYTNYSYKFLS-APSELTISASLKAGSDKNILQQTGSRLDYV 701
Y + +Y FG+GLSYT + + L APSE+ ++
Sbjct: 633 GDYVEMSTKPLYPFGYGLSYTKFEFSNLEIAPSEVYDDGKVR------------------ 674
Query: 702 HIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAK 761
+ + V NAG ++G VV L+ R + P K+L GF RV
Sbjct: 675 --------------ISVDVQNAGKLEGDEVVQLYVRNEVSNVTRPVKELKGFKRVSLRPG 720
Query: 762 GSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
K++ F + QL ++ R ++ G +M+G
Sbjct: 721 EKKKVVFELS-VSQLGFYDEDMRYVVQPGTVKVMIG 755
>gi|418518550|ref|ZP_13084692.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. malvacearum
str. GSPB1386]
gi|418522850|ref|ZP_13088880.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. malvacearum
str. GSPB2388]
gi|410700720|gb|EKQ59264.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. malvacearum
str. GSPB2388]
gi|410703176|gb|EKQ61671.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. malvacearum
str. GSPB1386]
Length = 901
Score = 282 bits (722), Expect = 5e-73, Method: Compositional matrix adjust.
Identities = 162/463 (34%), Positives = 247/463 (53%), Gaps = 45/463 (9%)
Query: 53 PFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPG 112
P+ +T S RA L+S +TL+EK Q+ + A AIPRL +PAY+WW+E+LHG+A G
Sbjct: 33 PYLDTQRSFEARAADLVSRMTLEEKAAQMQNAAPAIPRLQVPAYDWWNEALHGVARAG-- 90
Query: 113 VNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYN--LGQ------AGLTF 164
T FPQ + AA+F+ L + +A++ EARA ++ L Q GLTF
Sbjct: 91 --------GATVFPQAIGMAATFDLPLMHEVATAISDEARAKHHQFLRQNQHARYQGLTF 142
Query: 165 WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVL 224
W+PNINIFRDPRWGRGQET GEDP + + V FV+ QGE + G +R+
Sbjct: 143 WSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQGEGADAPKNAQGERYRK---- 198
Query: 225 KGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIE 284
L A KH + + R+ F+A +++D +T+ P F + ++
Sbjct: 199 --------------LDATAKHFAVHSGPE---ADRHHFDARPSQRDLYETYLPAFEALVK 241
Query: 285 QGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTH 343
GK +M +YN+V G A L Q R++WGF+GY+ SDC A+ I+++ T
Sbjct: 242 DGKVDAVMGAYNRVYGESASASKFLLQDVLRDQWGFRGYVVSDCWAIVDIWKHHKIVATR 301
Query: 344 EDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPR 403
E +AA +K G ++ CG +A+ +G + E ID AL L + ++RLG+F+ P
Sbjct: 302 EQAAALAVKHGTELECGEEYAT-LPAAVRQGLIDEAQIDTALKTLMTARMRLGMFD-PPG 359
Query: 404 KGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNIS 463
+ + + + H LA AR+ +VLLKND LPL++ + +A+IGP ++
Sbjct: 360 QLPWSTIPASVNQSPAHDALARRTARESLVLLKND-GLLPLSRAKLKRIAVIGPTADDTM 418
Query: 464 QMGGGYTGIPCSPKSLLRGLEAYV--SKTHYASGCHDVPCNSD 504
+ G Y G P +P ++L+G+ A ++ YA G V D
Sbjct: 419 ALLGNYYGTPAAPVTVLQGIRAAAPNAQVLYARGADLVEGRDD 461
Score = 134 bits (336), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 98/320 (30%), Positives = 144/320 (45%), Gaps = 59/320 (18%)
Query: 480 LRGLEAYVSKTHYASGCHDVPCN-------SDAGFHEAVRIAKKADFVIVVAGLDLTQET 532
L+ AY + Y G D + EA+ +A+ AD V+ V GL E
Sbjct: 591 LQAGRAYEVRLEYFEGERDAAVRLAWRQPGAKPPLQEALDVARSADVVVFVGGLTGDVEG 650
Query: 533 EDR----------DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQI 582
E+ DR L LP Q L+ ++ T K PV+ VLT G L + +A+ +
Sbjct: 651 EEMKVNYPGFAGGDRTDLRLPKPQRDLLEALQATGK-PVVAVLTAGSALAIDWAQ--QHL 707
Query: 583 SSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPG 642
+IL YPG+ G A+A+ +FGD NPGGRLP+T+Y ES T +D MR G
Sbjct: 708 PAILLAWYPGQRGGTAVADTLFGDANPGGRLPVTFYKESETLPAFDDYAMR--------G 759
Query: 643 RSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVH 702
R+YR++ GT +Y FGHGLSYT ++Y +G RLD
Sbjct: 760 RTYRYFGGTPLYPFGHGLSYTQFAY--------------------------SGLRLDRTT 793
Query: 703 IDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKG 762
I S T+ ++V N G G VV L+ + K+L GF R+
Sbjct: 794 IATDGSLTAT-----VTVKNTGQRAGDEVVQLYLHPLAPQRERAGKELHGFQRIALQPGE 848
Query: 763 SKEISFGVDPCEQLSIANKH 782
+E+ F ++ + L + ++
Sbjct: 849 QRELGFTINAKDALRLYDEQ 868
>gi|78049893|ref|YP_366068.1| beta-glucosidase precursor [Xanthomonas campestris pv. vesicatoria
str. 85-10]
gi|78038323|emb|CAJ26068.1| beta-glucosidase precursor [Xanthomonas campestris pv. vesicatoria
str. 85-10]
Length = 902
Score = 282 bits (722), Expect = 5e-73, Method: Compositional matrix adjust.
Identities = 162/462 (35%), Positives = 246/462 (53%), Gaps = 45/462 (9%)
Query: 54 FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
+ +T S RA L+S +TL+EK Q+ + A AIPRLG+PAY+WW+E+LHG+A G
Sbjct: 35 YLDTQRSFEQRAADLVSRMTLEEKAAQMQNAAPAIPRLGVPAYDWWNEALHGVARAG--- 91
Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYN--LGQ------AGLTFW 165
T FPQ + AA+F+ L + +A++ EARA ++ L Q GLTFW
Sbjct: 92 -------GATVFPQAIGMAATFDLPLMHEVATAISDEARAKHHQFLRQNQHARYQGLTFW 144
Query: 166 APNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLK 225
+PNINIFRDPRWGRGQET GEDP + + V FV+ +GE + G +R+
Sbjct: 145 SPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLRGEGADAPKNAQGEPYRK----- 199
Query: 226 GFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQ 285
L A KH + + R+ F+A +++D +T+ P F + ++
Sbjct: 200 -------------LDATAKHFAVHSGPE---ADRHHFDARPSQRDLYETYLPAFEALVKD 243
Query: 286 GKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHE 344
GK +M +YN+V G A L Q R +WGFKGY+ SDC A+ I+++ T E
Sbjct: 244 GKVDAVMGAYNRVYGESASASKFLLQDVLRQQWGFKGYVVSDCWAIVDIWKHHKIVATRE 303
Query: 345 DSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRK 404
+AA +K G ++ CG +A+ +G + E ID AL L + ++RLG+F+ P +
Sbjct: 304 QAAALAVKHGTELECGE-EYSTLPAAVRQGLIDEAQIDTALTTLMTARMRLGMFD-PPGQ 361
Query: 405 GKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQ 464
+ + + H LA AR+ +VLLKND LPL++ + +A+IGP ++
Sbjct: 362 LPWSTIPASVNQSPAHDALARRTARESLVLLKND-GLLPLSRAKLKRIAVIGPTADDTMA 420
Query: 465 MGGGYTGIPCSPKSLLRGLEAYV--SKTHYASGCHDVPCNSD 504
+ G Y G P +P ++L+G+ A ++ YA G V D
Sbjct: 421 LLGNYYGTPAAPVTVLQGIRAAAPNAQVLYARGADLVEGRDD 462
Score = 131 bits (329), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 94/282 (33%), Positives = 131/282 (46%), Gaps = 52/282 (18%)
Query: 507 FHEAVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVART 556
EA+ +A AD V+ V GL E E+ DR L LP Q L+ ++ T
Sbjct: 626 LQEALDVASSADVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRDLLEALQAT 685
Query: 557 SKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMT 616
K PV+ VLT G L + +A+ + +IL YPG+ G A+A+ +FGD NPGGRLP+T
Sbjct: 686 GK-PVVAVLTTGSALAIDWAQ--QHLPAILLAWYPGQRGGTAVADTLFGDANPGGRLPVT 742
Query: 617 WYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSEL 676
+Y ES T +D MR GR+YR++ GT +Y FGHGLSYT ++Y
Sbjct: 743 FYKESETLPAFDDYAMR--------GRTYRYFGGTPLYPFGHGLSYTQFAY--------- 785
Query: 677 TISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFA 736
+G RLD I S T+ ++V N G G VV L+
Sbjct: 786 -----------------SGLRLDRTTIAADGSLTAT-----VTVKNTGQRAGDEVVQLYL 823
Query: 737 RVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSI 778
+ K+L GF R+ A + + F +D L I
Sbjct: 824 HPLTPQRERAGKELHGFQRITLQAGEQRALHFILDAKNALRI 865
>gi|315607027|ref|ZP_07882031.1| beta-glucosidase [Prevotella buccae ATCC 33574]
gi|315251081|gb|EFU31066.1| beta-glucosidase [Prevotella buccae ATCC 33574]
Length = 866
Score = 282 bits (721), Expect = 6e-73, Method: Compositional matrix adjust.
Identities = 156/457 (34%), Positives = 249/457 (54%), Gaps = 44/457 (9%)
Query: 52 YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGP 111
YP+ N LS RA+ L S LTL+EK + + +++ AIPRLGIP +EWWSE+LHGIA NG
Sbjct: 23 YPYQNLQLSSQERAEDLCSRLTLEEKTKLMRNSSPAIPRLGIPQFEWWSEALHGIARNG- 81
Query: 112 GVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLT 163
T FPQ AAS++ L + A + EA A NL + G++
Sbjct: 82 ---------FATVFPQTTAMAASWDDELLYRVFCAASDEAVAKNNLARKSGDIKRYQGVS 132
Query: 164 FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRV 223
W PNINIFRDPRWGRGQET GEDP + S + V QG+ ++ D
Sbjct: 133 IWTPNINIFRDPRWGRGQETYGEDPYLTSRMGLAVVNGLQGQPFRRD------------- 179
Query: 224 LKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNA-MITEQDTEDTFQPPFRSC 282
++ F E L AC KH + +W +R+ F+ + E+D +T+ P F+S
Sbjct: 180 MRPFTERPRYYKTL---ACAKHYAVHSGPEW---NRHVFDVERLPERDLWETYLPAFKSL 233
Query: 283 IEQGKASCIMCSYNQVNGVPACLRGD-LFQKARNEWGFKGYITSDCDAVATIFE--YQNY 339
+++G +MC+Y +++G P C L Q R EWG+ G + SDC A++ + + +
Sbjct: 234 VQEGNVREVMCAYQRIDGSPCCGNTRYLHQILRGEWGYNGLVVSDCGAISDFYREGHHHV 293
Query: 340 TKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFN 399
+T +++A ++AG D+ CG + A+++G + + ID +++ L + +G F+
Sbjct: 294 VETPAEASAMGVRAGTDVECG-AVYATLPRAVEQGLISREAIDTSVVRLLKARFEVGDFD 352
Query: 400 GDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLV 459
+ + + GP+ + + H++LALD AR+ + LL+N + LPL+KN + +A++GP
Sbjct: 353 SE-KLVPWKLTGPEVIASETHRRLALDMARESMTLLQNRNRLLPLSKNGL-RIAVMGPNA 410
Query: 460 NNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGC 496
N+ + G YTG P S ++L+G+ + V + GC
Sbjct: 411 NDSVMLWGNYTGYPISTTTILKGIRSKVPAARFVEGC 447
Score = 105 bits (262), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 93/334 (27%), Positives = 143/334 (42%), Gaps = 65/334 (19%)
Query: 486 YVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDR---------- 535
YV A D+ S E A AD V+ V G+ E E+
Sbjct: 580 YVQNKAMAVCQFDIARKSPITASEIAAQAGDADVVVFVGGISPRLEGEEMKVDAPGFNGG 639
Query: 536 DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAG 595
DR S+ LP Q ++ + + K V + +GG V AEA ++L Y GEAG
Sbjct: 640 DRTSIELPEAQREVIRLLRQAGKLVVFVNCSGGAVALVPEAEA---CDAVLQAWYAGEAG 696
Query: 596 AKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYG 655
+A+A+++FGD+NP G+LP+T+Y D M GR+YR++ GT ++
Sbjct: 697 GQAVADVLFGDYNPSGKLPVTFYKSDADLPDFLDYRM--------TGRTYRYFRGTPLFP 748
Query: 656 FGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFH 715
FG GLSYT++ + G+ + + G
Sbjct: 749 FGFGLSYTSFVF-----------------GTPR---YENG-------------------K 769
Query: 716 VQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQ 775
+ + VTN G DG+ VV ++ + P + G P K L GF R+ A + + + P E+
Sbjct: 770 LYVEVTNTGKRDGAEVVQVYVKNPADADG-PVKTLRGFARIDLKAGERRRVEIAM-PRER 827
Query: 776 LS--IANKHGRRILPLGNHVLMVGELRHSLTIET 807
A + R+ P GNH+LMVG ++T
Sbjct: 828 FEGWDATTNTMRVKP-GNHLLMVGSSSRDADLQT 860
>gi|408369545|ref|ZP_11167326.1| glycoside hydrolase [Galbibacter sp. ck-I2-15]
gi|407745291|gb|EKF56857.1| glycoside hydrolase [Galbibacter sp. ck-I2-15]
Length = 881
Score = 282 bits (721), Expect = 7e-73, Method: Compositional matrix adjust.
Identities = 168/443 (37%), Positives = 239/443 (53%), Gaps = 55/443 (12%)
Query: 52 YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGP 111
YPF N L S R L+ LT++EKI QL + AI RLGIP Y WW+ESLHG+A G
Sbjct: 27 YPFQNPELDDSARVADLLERLTVEEKIDQLLYTSPAIERLGIPEYNWWNESLHGVARAG- 85
Query: 112 GVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYN----LGQ----AGLT 163
T FPQ + AA+++ L + A++ EARA ++ GQ GLT
Sbjct: 86 ---------YATVFPQSITIAAAWDSDLLKEVADAISDEARAKHHEYIRRGQRGIYQGLT 136
Query: 164 FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRV 223
FW+PNINIFRDPRWGRG ET GEDP + + +VK QG +
Sbjct: 137 FWSPNINIFRDPRWGRGHETYGEDPYLTGQLGIAYVKGLQGND----------------- 179
Query: 224 LKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCI 283
+ L L A KH + + R+ F+ +++D +T+ P FR +
Sbjct: 180 ----------PNYLKLVATAKHFAVHSGPE---PLRHEFDVSPSKRDLWETYLPAFRYLV 226
Query: 284 EQGKASCIMCSYNQVNGVPACLRGDLFQKARNEWGFKGYITSDCDAVATIFEYQNYTKTH 343
+QG +M +YN+V G A LF R+ W F GY+ SDC A++ I++Y K
Sbjct: 227 KQGDVKSVMTAYNRVYGEAASASDTLFTILRDYWDFDGYVVSDCFAISDIWKYHKIAKDA 286
Query: 344 EDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPR 403
+++A + G D+NCG + Q A +G V EKDID AL L +++LG+F DP
Sbjct: 287 AEASAMAVIEGCDLNCGDSYEKLNQ-AYQQGMVTEKDIDIALSRLMEARIKLGMF--DPE 343
Query: 404 K-GKYGKLGPDDVCTSE-HKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNN 461
+ Y ++ P +V TSE H +LAL AA++ IVLLKN LPL+K+ + S+A+IGP +N
Sbjct: 344 QLVPYAQI-PFNVNTSEKHNQLALKAAKESIVLLKNQGDLLPLSKD-LKSVAVIGPNADN 401
Query: 462 ISQMGGGYTGIPCSPKSLLRGLE 484
I + G Y G P P ++L+G++
Sbjct: 402 IQSLWGNYNGNPKDPITVLQGIQ 424
Score = 136 bits (343), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 84/263 (31%), Positives = 135/263 (51%), Gaps = 46/263 (17%)
Query: 536 DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAG 595
DR +L LP Q +L+ VA+T K P++LVL G L +++A + I +I+ GY G+ G
Sbjct: 631 DRTALDLPASQRTLLKEVAKTGK-PIVLVLLNGSALSINWAAEN--IPAIMTAGYAGQQG 687
Query: 596 AKALAEIIFGDFNPGGRLPMTWYPESFTKVP-MNDMNMRADSSRQYPGRSYRFYTGTQVY 654
A+AE++FGD+NP RLP+T+Y +S +P D NM GR+YR++ +Y
Sbjct: 688 GNAVAEVLFGDYNPAARLPVTYY-KSVEDLPDFEDYNMD--------GRTYRYFEKEPLY 738
Query: 655 GFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRF 714
FG+GLSYT + Y PS++ ++ S++
Sbjct: 739 PFGYGLSYTTFDYSKFQLPSKIDMNESIE------------------------------- 767
Query: 715 HVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCE 774
+ + VTN G DG VV ++ K S P ++L+GF R+H S+++ F ++P
Sbjct: 768 -LSVEVTNTGAYDGDEVVQVYLTDEKGSTPRPIRELVGFKRIHLKKGESQKVQFTIEP-R 825
Query: 775 QLSIANKHGRRILPLGNHVLMVG 797
QLS+ + G ++ G + VG
Sbjct: 826 QLSMIDDKGDLVIEPGVFSISVG 848
>gi|383117091|ref|ZP_09937838.1| hypothetical protein BSHG_0805 [Bacteroides sp. 3_2_5]
gi|382973702|gb|EES87886.2| hypothetical protein BSHG_0805 [Bacteroides sp. 3_2_5]
Length = 805
Score = 282 bits (721), Expect = 7e-73, Method: Compositional matrix adjust.
Identities = 233/738 (31%), Positives = 347/738 (47%), Gaps = 124/738 (16%)
Query: 90 RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
RLGIP + E HG + G T FP + A+++N L +G +A+
Sbjct: 140 RLGIPLF-LAEECPHGHMAIG-----------TTVFPTSIGQASTWNPELIRQMGRVIAI 187
Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
EA A Q + P +++ RDPRW R +ET GEDP + V+ FQGE
Sbjct: 188 EASA-----QGAHIGYGPVLDLARDPRWSRVEETYGEDPYLNGVMGTALVRGFQGETL-- 240
Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQ 269
+DG K V+ A KH +Y W A I E+
Sbjct: 241 NDG--------KSVI----------------ATLKHFASYG---WTEGGHNGGTAHIGER 273
Query: 270 DTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPAC-LRGDLFQKARNEWGFKGYITSDCD 328
+ E+ PPFR + G A +M SYN+++G P R L ++ W FKG++ SD
Sbjct: 274 ELEEAIFPPFREAVGAG-ALSVMSSYNEIDGNPCTGSRYLLTDILKDRWQFKGFVVSDLY 332
Query: 329 AVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCML-RHTQSAIDKGKVQEKDIDRALLN 387
AV + E+ +E +A + AG+D + GT + +A+ +G V ID+A+
Sbjct: 333 AVGGLREHGVAGNDYE-AAIKAVNAGVDSDLGTNVYAEQLVAAVKRGDVAVATIDKAVRR 391
Query: 388 LFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKN 447
+ S++ ++GLF+ DP + K V +SEH LA + ARQ IVLLKN K LPL K+
Sbjct: 392 ILSLKFQMGLFD-DPFVDE--KQAVQLVASSEHTGLAREVARQSIVLLKNKDKLLPLKKD 448
Query: 448 AVSSLAIIGPLVNNISQMGGGYTGIPC--SPKSLLRGLEAYVSKTH---YASGCHDVPCN 502
+ +LA+IGP +N+ M G YT + ++L G+ VSK YA GC V +
Sbjct: 449 -IRTLAVIGPNADNVYNMLGDYTAPQADGTVVTVLDGIRQKVSKETRVLYAKGCA-VRDS 506
Query: 503 SDAGFHEAVRIAKKADFVIVVAG----LDLTQETEDR-------------------DRVS 539
S GF +A+ A+ AD V++V G D + E E+ DR +
Sbjct: 507 SRTGFKDAIETARNADTVVMVMGGSSARDFSSEYEETGAAKVTINQISDMESGEGYDRAT 566
Query: 540 LLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKAL 599
L L G+Q+ L+ ++R K PV+LVL G PL + A +++ W YPG G A+
Sbjct: 567 LHLMGRQLELLEEISRLGK-PVVLVLIKGRPLLMEGAIQEAEAIVDAW--YPGMQGGNAV 623
Query: 600 AEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHG 659
A+++FGD+NP GRL ++ P S ++P+ N R +R Y GT Y FG+G
Sbjct: 624 ADVLFGDYNPAGRLTLS-VPRSVGQLPVY-YNTRRKGNRS----RYVEEPGTPRYPFGYG 677
Query: 660 LSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQIS 719
LSYT +SY +D + GS D +D V ++
Sbjct: 678 LSYTTFSY------------------TDMKVQVTEGS--DDCWVD-----------VTVT 706
Query: 720 VTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIA 779
+ N G DG V L+ R S TP KQL F R+H A S+E++F +D + L++
Sbjct: 707 IQNQGTADGDEVAQLYFRDDVSSFTTPAKQLRAFSRIHLKAGESREVTFTLDK-KSLALY 765
Query: 780 NKHGRRILPLGNHVLMVG 797
+ G ++ G +MVG
Sbjct: 766 MQEGEWVVEPGRFTIMVG 783
>gi|402304900|ref|ZP_10823963.1| glycosyl hydrolase family 3, N-terminal domain protein [Prevotella
sp. MSX73]
gi|400380686|gb|EJP33499.1| glycosyl hydrolase family 3, N-terminal domain protein [Prevotella
sp. MSX73]
Length = 866
Score = 281 bits (720), Expect = 7e-73, Method: Compositional matrix adjust.
Identities = 156/457 (34%), Positives = 249/457 (54%), Gaps = 44/457 (9%)
Query: 52 YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGP 111
YP+ N LS RA+ L S LTL+EK + + +++ AIPRLGIP +EWWSE+LHGIA NG
Sbjct: 23 YPYQNPRLSSQERAEDLCSRLTLEEKTKLMRNSSPAIPRLGIPQFEWWSEALHGIARNG- 81
Query: 112 GVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLT 163
T FPQ AAS++ L + A + EA A NL + G++
Sbjct: 82 ---------FATVFPQTTAMAASWDDELLYRVFCAASDEAVAKNNLARKSGDIKRYQGVS 132
Query: 164 FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRV 223
W PNINIFRDPRWGRGQET GEDP + S + V QG+ ++ D
Sbjct: 133 IWTPNINIFRDPRWGRGQETYGEDPYLTSRMGLAVVNGLQGQPFRRD------------- 179
Query: 224 LKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNA-MITEQDTEDTFQPPFRSC 282
++ F E L AC KH + +W +R+ F+ + E+D +T+ P F+S
Sbjct: 180 MRPFTERPRYYKTL---ACAKHYAVHSGPEW---NRHVFDVERLPERDLWETYLPAFKSL 233
Query: 283 IEQGKASCIMCSYNQVNGVPACLRGD-LFQKARNEWGFKGYITSDCDAVATIFE--YQNY 339
+++G +MC+Y +++G P C L Q R EWG+ G + SDC A++ + + +
Sbjct: 234 VQEGNVREVMCAYQRIDGSPCCGNTRYLHQILRGEWGYNGLVVSDCGAISDFYREGHHHV 293
Query: 340 TKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFN 399
+T +++A ++AG D+ CG + A+++G + + ID +++ L + +G F+
Sbjct: 294 VETPAEASAMGVRAGTDVECG-AVYATLPRAVEQGLISREAIDTSVVRLLKARFEVGDFD 352
Query: 400 GDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLV 459
+ + + GP+ + + H++LALD AR+ + LL+N + LPL+KN + +A++GP
Sbjct: 353 SE-KLVPWKLTGPEVIASETHRRLALDMARESMTLLQNRNRLLPLSKNGL-RIAVMGPNA 410
Query: 460 NNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGC 496
N+ + G YTG P S ++L+G+ + V + GC
Sbjct: 411 NDSVMLWGNYTGYPISTTTILKGIRSKVPAARFVEGC 447
Score = 107 bits (266), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 93/334 (27%), Positives = 142/334 (42%), Gaps = 65/334 (19%)
Query: 486 YVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDR---------- 535
YV A D+ S E A AD V+ V G+ E E+
Sbjct: 580 YVQNKAMAVCQFDIARKSPITASEIAAQAGDADVVVFVGGISPRLEGEEMKVDAPGFKGG 639
Query: 536 DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAG 595
DR S+ LP Q ++ + + K V + +GG V AEA ++L Y GEAG
Sbjct: 640 DRTSIELPEAQREVIRLLRQAGKLVVFVNCSGGAVALVPEAEA---CDAVLQAWYAGEAG 696
Query: 596 AKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYG 655
+A+A+++FGD+NP G+LP+T+Y D M GR+YR++ GT ++
Sbjct: 697 GQAVADVLFGDYNPSGKLPVTFYKSDADLPDFLDYRM--------TGRTYRYFRGTPLFP 748
Query: 656 FGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFH 715
FG GLSYT++++ K + +L
Sbjct: 749 FGFGLSYTSFAFG--------------KPRYENGML------------------------ 770
Query: 716 VQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQ 775
+ VTN G DG+ VV ++ + P + G P K L GF R+ A + + + P E+
Sbjct: 771 -YVEVTNTGKRDGAEVVQVYVKNPADADG-PVKTLRGFARIDLKAGERRRVEIAM-PRER 827
Query: 776 LS--IANKHGRRILPLGNHVLMVGELRHSLTIET 807
A + R+ P GNH+LMVG ++T
Sbjct: 828 FEGWDATANTMRVKP-GNHLLMVGSSSRDADLQT 860
>gi|424796589|ref|ZP_18222299.1| exported beta-glucosidase [Xanthomonas translucens pv. graminis
ART-Xtg29]
gi|422794891|gb|EKU23686.1| exported beta-glucosidase [Xanthomonas translucens pv. graminis
ART-Xtg29]
Length = 913
Score = 281 bits (719), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 160/462 (34%), Positives = 245/462 (53%), Gaps = 45/462 (9%)
Query: 54 FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
+ +T S RA L+S +TL+EK Q+ + A AIPRLG+PAY+WW+E+LHG+A G
Sbjct: 37 YLDTQRSFEQRAADLVSRMTLEEKAAQMQNAAPAIPRLGVPAYDWWNEALHGVARAG--- 93
Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG--------QAGLTFW 165
T FPQ + AA+F+ L + +A++ EARA ++ GLTFW
Sbjct: 94 -------GATVFPQAIGMAATFDLPLMHEVSTAISDEARAKHHEALRHDQHARYQGLTFW 146
Query: 166 APNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLK 225
+PNINIFRDPRWGRGQET GEDP + + V FV+ QGE + G +R+
Sbjct: 147 SPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQGEGADAPKNAQGDAYRK----- 201
Query: 226 GFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQ 285
L A KH + + R+ F+A +++D +T+ P F + +++
Sbjct: 202 -------------LDATAKHFAVHSGPE---ADRHHFDAHPSQRDLYETYLPAFEALVKE 245
Query: 286 GKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHE 344
GK +M +YN+V G A L + R+ WGF GY+ SDC A+ I++ T E
Sbjct: 246 GKVDAVMGAYNRVYGESASASKFLLRDVLRDTWGFDGYVVSDCWAIVDIWKNHKIVATRE 305
Query: 345 DSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRK 404
+AA + G ++ CG +A+ KG + E D+D+AL L ++RLG+F+ P
Sbjct: 306 QAAALAVNNGTELECGE-EYSTLPAAVRKGLISEADVDKALQKLMYSRMRLGMFD-PPDT 363
Query: 405 GKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQ 464
++ ++ + EH LA AR+ +VLLKND LPL++ + +A+IGP ++
Sbjct: 364 LRWAQIPLSANQSPEHDALARRTARESLVLLKND-GVLPLSRGKIKRIAVIGPTADDTMA 422
Query: 465 MGGGYTGIPCSPKSLLRGLE--AYVSKTHYASGCHDVPCNSD 504
+ G Y G P +P ++L+G+ A ++ YA G V D
Sbjct: 423 LLGNYYGTPAAPVTVLQGIREAAPDAEVLYARGADLVEGRDD 464
Score = 139 bits (351), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 96/302 (31%), Positives = 146/302 (48%), Gaps = 53/302 (17%)
Query: 507 FHEAVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVART 556
+A+ A++AD V+ V GL E E+ DR L LP Q L+ ++ T
Sbjct: 628 LQDALDAARRADVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRELLEALQGT 687
Query: 557 SKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMT 616
K PV+ VLT G L + +A+ + +IL YPG+ G A+A+++FGD NPGGRLP+T
Sbjct: 688 GK-PVVAVLTTGSALAIDWAQ--QHVPAILLAWYPGQRGGSAVADVLFGDANPGGRLPVT 744
Query: 617 WYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSEL 676
+Y ES +D MR GR+YR++ GT +Y FGHGLSYT ++Y S+L
Sbjct: 745 FYKESEKLPAFDDYAMR--------GRTYRYFAGTALYPFGHGLSYTQFAY------SDL 790
Query: 677 TISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFA 736
+ D++ L GS H + V N G G VV L+
Sbjct: 791 RL--------DRSKLATDGS-----------------LHATLKVKNTGQRAGDEVVQLYL 825
Query: 737 RVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRR-ILPLGNHVLM 795
+ K+L GF R+ ++E+SF + P L + ++ + ++ G++ L
Sbjct: 826 HPLSPQRERARKELRGFQRIALQPGETREVSFAISPQTDLRLYDEARKAYVVDPGDYELQ 885
Query: 796 VG 797
VG
Sbjct: 886 VG 887
>gi|297738404|emb|CBI27605.3| unnamed protein product [Vitis vinifera]
Length = 581
Score = 281 bits (719), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 150/388 (38%), Positives = 212/388 (54%), Gaps = 49/388 (12%)
Query: 237 LMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYN 296
L +S+CCKH YD++ W N ++EQD ++TF PF
Sbjct: 19 LKVSSCCKHYATYDIDSWLN---------VSEQDMKETFFSPFE---------------- 53
Query: 297 QVNGVPACLRGDLFQKARNEWGFKGYITSDCDAVATIFEYQNY-TKTHEDSAAGVLKAGM 355
R+EW GYI SDC + I + QNY ++ D+ A L+AG+
Sbjct: 54 -----------------RDEWDLHGYIVSDCYGLEVIVDNQNYLNESKVDAVAKTLQAGL 96
Query: 356 DINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDV 415
D+ CG ++ GKV + ++DRAL N++ + +R+G F+G P Y LG D+
Sbjct: 97 DLECGHYYTDALNESVLTGKVSQYELDRALKNIYVLLMRVGYFDGIP---AYESLGLKDI 153
Query: 416 CTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCS 475
C ++H +LA +AARQGIVLLKND + LPL L ++GP N M G Y G+P
Sbjct: 154 CAADHIELAREAARQGIVLLKNDYEVLPLKPG--KKLVLVGPHANATEVMIGNYAGLPYK 211
Query: 476 PKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDR 535
S L A + YA+GC D C++D F EA AK A+ I+ G DL+ E E
Sbjct: 212 YVSPLEAFSA-IGNVTYATGCLDASCSNDTYFSEAKEAAKFAEVTIIFVGTDLSIEAEFV 270
Query: 536 DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAG 595
DRV LLPG Q L+ VA S PVILV+ G +D++FA+ + +IS+ILW+G+PGE G
Sbjct: 271 DRVDFLLPGNQTELIKQVAEVSSGPVILVVLSGSNIDITFAKNNPRISAILWVGFPGEQG 330
Query: 596 AKALAEIIFGDFNPGGRLPMTWYPESFT 623
A+A+++FG +NPGGRLP+TWY +
Sbjct: 331 GHAIADVVFGKYNPGGRLPVTWYEADYV 358
>gi|397642422|gb|EJK75223.1| hypothetical protein THAOC_03061, partial [Thalassiosira oceanica]
Length = 534
Score = 281 bits (718), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 192/584 (32%), Positives = 291/584 (49%), Gaps = 105/584 (17%)
Query: 239 LSACCKHLIAYDLEKWGNFSRYSFNA-MITEQDTEDTFQPPFRSCIEQGK---------- 287
++A CKHL AY LE R++F+A I D E T+ P F +C+ +
Sbjct: 10 IAATCKHLAAYSLET----DRFNFSADGIDRTDWEGTYLPAFDACVHAERFLLEHYNASG 65
Query: 288 ---------ASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQ 337
A +MCSYN ++GVPAC L + R +W F G + SDC AV I
Sbjct: 66 GGGGGQDRGALGVMCSYNAIDGVPACADPALLKDMLRRDWNFTGLVVSDCWAVDNIHSNH 125
Query: 338 NYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGL 397
+ ++E++ L++G+D++CG + A D+ + E DID AL LF V + LG
Sbjct: 126 RFVASYEEAVGLALRSGVDLDCGNTFQDFGRLAYDESLLDEDDIDEALSRLFRVLMDLGY 185
Query: 398 FNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKN-----DKKFLPLNKNAVSSL 452
F D K D++ EH +LAL+AA Q IVLLKN + LPL+ +
Sbjct: 186 F--DETDEPDAKSSDDEM---EHDQLALEAALQSIVLLKNGINEDEPGPLPLSLAKHKEI 240
Query: 453 AIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAY---VSKTHYASGCHDVPCNSDAGFHE 509
A+ GPL +N + + G Y G+P + + L GL V+ AS C FH
Sbjct: 241 ALFGPLADNQTVLLGNYHGLPSTIVTPLMGLAKMGVEVAFRQRASVCD---------FH- 290
Query: 510 AVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSK---RPVILVLT 566
I+V GLD + E ED+DR +LLLP +Q L+ +++R SK PV+LV+
Sbjct: 291 ------GESATILVVGLDQSLEAEDQDRTTLLLPVEQRDLIKTISRCSKVRDLPVVLVVV 344
Query: 567 GGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESF-TKV 625
GG +D+S + S I +++ + YPG+ G ALA++++G +NP G+L T YPES+ +V
Sbjct: 345 SGGMVDLSRYKNSSDIDAMIHMSYPGQNGGSALAQVLYGAYNPSGKLVGTMYPESYLNEV 404
Query: 626 PMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSY--KFLSAPSELTISASLK 683
++DM MR D ++PGR++R+Y G +Y FG+GLSYT++ Y +FL ++T+S
Sbjct: 405 SLHDMRMRPDG--KFPGRTHRYYRGDVIYPFGYGLSYTSFRYAMEFLGGTVKVTVS---- 458
Query: 684 AGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKV-S 742
N+G +DGS V+LF P+ +
Sbjct: 459 --------------------------------------NSGSMDGSVAVLLFHSAPQAGN 480
Query: 743 QGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRI 786
+ P + LIGF++++ S+ +SF V A H RI
Sbjct: 481 EQEPFRSLIGFEKIYVSVGDSQLVSFDVSKRMNPGEAGSHTFRI 524
>gi|317477144|ref|ZP_07936385.1| glycosyl hydrolase family 3 C terminal domain-containing protein
[Bacteroides eggerthii 1_2_48FAA]
gi|316906687|gb|EFV28400.1| glycosyl hydrolase family 3 C terminal domain-containing protein
[Bacteroides eggerthii 1_2_48FAA]
Length = 814
Score = 280 bits (717), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 228/743 (30%), Positives = 350/743 (47%), Gaps = 134/743 (18%)
Query: 90 RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
RLGIP + E HG + G T FP + A+++N L +G A+A
Sbjct: 149 RLGIPLF-LAEECPHGHMAIG-----------TTVFPTSIGQASTWNPELIRRMGRAIAT 196
Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
EA A Q + P +++ RDPRW R +ET GED + VK FQGE
Sbjct: 197 EASA-----QGAHIGYGPVLDLARDPRWSRVEETYGEDAYLNGVMGAALVKGFQGE---- 247
Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQ 269
F + +V+ A KH AY W +A + +
Sbjct: 248 ------FPRTKGKVI----------------ATLKHFAAYG---WTEGGHNGGSAHVGNR 282
Query: 270 DTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCD 328
+ E+ PPFR + G A +M SYN+++G+P +L + W FKG++ SD
Sbjct: 283 EMEEAIYPPFREAVAAG-ALSVMSSYNEIDGIPCTANSNLLTGLLKKRWQFKGFVVSDLY 341
Query: 329 AVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLR-HTQSAIDKGKVQEKDIDRALLN 387
A+ + E+ T ++A + AG+D + GT + +A+ +G VQE I++A+
Sbjct: 342 AIGGLREH-GVADTDYEAAVKAVNAGVDSDLGTNVYAGQLVNAVKRGDVQEVVINKAVSR 400
Query: 388 LFSVQLRLGLFNG---DPRKGKYGKLGPDDVCTS-EHKKLALDAARQGIVLLKNDKKFLP 443
+ +++ +GLF+ D R+ P+ V S EH +LA + ARQ I+LLKN + LP
Sbjct: 401 ILALKFHMGLFDHPFVDERE-------PEQVVASTEHLELAREVARQSIILLKNKNELLP 453
Query: 444 LNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSL---LRGLEAYVSK-TH--YASGCH 497
LNK + ++A+IGP +NI M G YT P S S+ L G+ VS TH YA GC
Sbjct: 454 LNKK-MKTIAVIGPNADNIYNMLGDYTA-PQSESSVVTVLDGIRQKVSNDTHIIYAKGCA 511
Query: 498 DVPCNSDAGFHEAVRIAKKADFVIVVAG----LDLTQETEDR------------------ 535
V +S +GF EA+ A+++D V++V G D + + E+
Sbjct: 512 -VRDSSKSGFQEAIEAARQSDVVVMVMGGSSARDFSSKYEETGAAKVSDSHISDMESGEG 570
Query: 536 -DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEA 594
DR +L L G+Q L+ V + +K P++LVL G PL + EA ++ +I+ YPG
Sbjct: 571 YDRSTLELLGRQRELIREVGKLNK-PIVLVLIKGRPLLLEGIEA--EVDAIVDAWYPGMQ 627
Query: 595 GAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVY 654
G A+A+++FGD+NP GRL ++ P S ++P+ R + +Y GT Y
Sbjct: 628 GGNAVADVLFGDYNPAGRLTIS-VPRSVGQLPVYYNTKRKGNRSKYIEEE-----GTPRY 681
Query: 655 GFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRF 714
FG+GLSYT+++Y L A + V + S
Sbjct: 682 PFGYGLSYTSFNYSDLKA-------------------------------EVVEAEDSCLV 710
Query: 715 HVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCE 774
++ + V N G DG VV L+ R S TP KQL GF R+H +KEI+F +D +
Sbjct: 711 NISVKVRNEGSRDGDEVVQLYLRDEVASFTTPFKQLCGFQRIHLKVGETKEITFRLDK-K 769
Query: 775 QLSIANKHGRRILPLGNHVLMVG 797
L++ ++ + G LM+G
Sbjct: 770 SLALYMQNEEWAVEPGRFTLMLG 792
>gi|399029285|ref|ZP_10730258.1| beta-glucosidase-like glycosyl hydrolase [Flavobacterium sp. CF136]
gi|398072895|gb|EJL64089.1| beta-glucosidase-like glycosyl hydrolase [Flavobacterium sp. CF136]
Length = 871
Score = 280 bits (716), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 162/459 (35%), Positives = 249/459 (54%), Gaps = 56/459 (12%)
Query: 50 DSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASN 109
+++ F N +L+ R L+S +++ EKI QL D++ AI RLG+P Y WW+ESLHG+A
Sbjct: 22 ENFAFKNPNLTTEQRVDDLVSRMSIDEKISQLMDSSPAIERLGVPEYNWWNESLHGVARA 81
Query: 110 GPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYN----LGQ----AG 161
G T FPQ + A+S++R L ++ + ++ EARA ++ GQ G
Sbjct: 82 G----------YATVFPQSISIASSWDRQLIFDVANVISDEARAKHHEYLRRGQHGMYQG 131
Query: 162 LTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREK 221
LTFW+PN+NIFRDPRWGRG ET GEDP + +++V QG N K
Sbjct: 132 LTFWSPNVNIFRDPRWGRGHETYGEDPFLTGQLGLKYVNGLQGTNEKY------------ 179
Query: 222 RVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRS 281
L + A KH Y + SR+ FNA ++ D +T+ P FR+
Sbjct: 180 ---------------LKVIATAKH---YAVHSGPEPSRHLFNAETSDIDLYETYLPAFRT 221
Query: 282 CIEQGKASCIMCSYNQVNGVPACLRGDLFQKARNEWGFKGYITSDCDAVATIFEYQNYTK 341
+++G +M +YN+ G LF RN WGF GYI SDC AV I++Y T
Sbjct: 222 LVKEGHVYSVMGAYNRFRGESCSASPFLFNILRNVWGFDGYIVSDCGAVTDIWKYHKITG 281
Query: 342 THEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGD 401
++A LK G+D+ CG+ + + AID+ + E DID A+ LF+ + +LG+F D
Sbjct: 282 DAATASALALKDGLDLECGSS-FKSLKEAIDRKLISEADIDIAVKRLFTARFKLGMF--D 338
Query: 402 PRK-GKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVN 460
P + Y ++ S H LA A+++ IVLLKN LPL+++ + ++A+IGP N
Sbjct: 339 PEEIVSYAQIPYSVNNNSAHDWLARVASQKSIVLLKNQNNTLPLSRD-IKTVAVIGPNAN 397
Query: 461 NISQMGGGYTGIPCSPKSLLRGLEAYV---SKTHYASGC 496
++ + G Y+G+P +P ++L+G++ + +K YA G
Sbjct: 398 DVQSLWGNYSGVPSNPITVLKGIQNKLEPNTKVLYAKGT 436
Score = 155 bits (393), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 98/301 (32%), Positives = 150/301 (49%), Gaps = 54/301 (17%)
Query: 507 FHEAVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVART 556
EAV++A +AD +++V GL+ E E+ DR SL LP Q L+ ++ T
Sbjct: 589 LQEAVQVAGQADAIVLVLGLNERLEGEEMKVEADGFEGGDRTSLDLPSNQEELMKAMTAT 648
Query: 557 SKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMT 616
K PVILVL G L +++A + + +IL GYPG+ G A+A+++FGD+NP GRLP+T
Sbjct: 649 GK-PVILVLINGSALSINWA--NDHVPAILTAGYPGQQGGNAIADVLFGDYNPAGRLPVT 705
Query: 617 WYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSEL 676
+Y + + +M+ GR+YR++ +Y FG GLSYT + Y L P+ +
Sbjct: 706 YYKSTEQLPAFENYDMK--------GRTYRYFQKKPLYPFGFGLSYTKFKYSNLKLPTNV 757
Query: 677 TISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFA 736
T F + + VTN G+ DG V+ L+
Sbjct: 758 TPEKD--------------------------------FEILVDVTNIGERDGDEVIELYL 785
Query: 737 RVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMV 796
+ K S P QL GF+RV+ +K + F + P QLS+ NK G+R++ G + V
Sbjct: 786 KDEKASTPRPILQLEGFERVNLKKGETKTVRFTITP-RQLSLINKKGQRVIEPGWFTISV 844
Query: 797 G 797
G
Sbjct: 845 G 845
>gi|189464211|ref|ZP_03012996.1| hypothetical protein BACINT_00548 [Bacteroides intestinalis DSM
17393]
gi|189438001|gb|EDV06986.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
intestinalis DSM 17393]
Length = 814
Score = 280 bits (716), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 229/743 (30%), Positives = 349/743 (46%), Gaps = 134/743 (18%)
Query: 90 RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
RLGIP + E HG + G T FP + A+++N L +G A+A
Sbjct: 149 RLGIPLF-LAEECPHGHMAIG-----------TTVFPTSIGQASTWNPELIRRMGRAIAT 196
Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
EA A Q + P +++ RDPRW R +ET GED + VK FQGE
Sbjct: 197 EASA-----QGAHIGYGPVLDLARDPRWSRVEETYGEDAYLNGVMGAALVKGFQGE---- 247
Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQ 269
F + +V+ A KH AY W +A + +
Sbjct: 248 ------FPRTKGKVI----------------ATLKHFAAYG---WTEGGHNGGSAHVGNR 282
Query: 270 DTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKARNE-WGFKGYITSDCD 328
+ E+ PPFR + G A +M SYN+++G+P +L E W FKG++ SD
Sbjct: 283 EMEEAIYPPFREAVAAG-ALSVMSSYNEIDGIPCTANSNLLTGLLKERWQFKGFVVSDLY 341
Query: 329 AVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLR-HTQSAIDKGKVQEKDIDRALLN 387
A+ + E+ T ++A + AG+D + GT + +A+ +G VQE I++A+
Sbjct: 342 AIGGLREH-GVADTDYEAAVKAVNAGVDSDLGTNVYAGQLVNAVKRGDVQEVVINKAVSR 400
Query: 388 LFSVQLRLGLFNG---DPRKGKYGKLGPDDVCTS-EHKKLALDAARQGIVLLKNDKKFLP 443
+ +++ +GLF+ D R+ P+ V S EH +LA + ARQ I+LLKN + LP
Sbjct: 401 ILALKFHMGLFDHPFVDERE-------PEQVVASTEHLELAREVARQSIILLKNKNELLP 453
Query: 444 LNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSL---LRGLEAYVSK-TH--YASGCH 497
LNK ++A+IGP +NI M G YT P S S+ L G+ VS TH YA GC
Sbjct: 454 LNKK-TKTIAVIGPNADNIYNMLGDYTA-PQSESSVVTVLDGIRQKVSNDTHIIYAKGCA 511
Query: 498 DVPCNSDAGFHEAVRIAKKADFVIVVAG----LDLTQETEDR------------------ 535
V +S +GF EA+ A+++D V++V G D + + E+
Sbjct: 512 -VRDSSKSGFQEAIEAARQSDVVVMVMGGSSARDFSSKYEETGAAKVSDSHISDMESGEG 570
Query: 536 -DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEA 594
DR +L L G+Q L+ V + +K P++LVL G PL + EA ++ +I+ YPG
Sbjct: 571 YDRSTLELLGRQRELIREVGKLNK-PIVLVLIKGRPLLLEGIEA--EVDAIVDAWYPGMQ 627
Query: 595 GAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVY 654
G A+A+++FGD+NP GRL ++ P S ++P+ R + +Y GT Y
Sbjct: 628 GGNAVADVLFGDYNPAGRLTIS-VPRSVGQLPVYYNTKRKGNRSKYIEEE-----GTPRY 681
Query: 655 GFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRF 714
FG+GLSYT+++Y L A + V + S
Sbjct: 682 PFGYGLSYTSFNYSDLKA-------------------------------EVVEAEDSCLV 710
Query: 715 HVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCE 774
++ + V N G DG VV L+ R S TP KQL GF R+H +KEI+F +D +
Sbjct: 711 NISVKVRNEGSRDGDEVVQLYLRDEVASFTTPFKQLCGFQRIHLKVGETKEITFRLDK-K 769
Query: 775 QLSIANKHGRRILPLGNHVLMVG 797
L++ ++ + G LM+G
Sbjct: 770 SLALYMQNEEWAVEPGRFTLMLG 792
>gi|225873993|ref|YP_002755452.1| beta-xylosidase B [Acidobacterium capsulatum ATCC 51196]
gi|225791521|gb|ACO31611.1| beta-xylosidase B [Acidobacterium capsulatum ATCC 51196]
Length = 894
Score = 280 bits (715), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 165/454 (36%), Positives = 232/454 (51%), Gaps = 61/454 (13%)
Query: 54 FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
+ N SL RA+ L+S +TL+EK QL + A AIPRL +PAY WWSE+LHG+A NG
Sbjct: 39 YLNPSLPPVVRARDLVSRMTLKEKASQLVNAARAIPRLKVPAYNWWSEALHGVAVNG--- 95
Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLTFW 165
T FP+ + A+F+ + + E R +Y + GL FW
Sbjct: 96 --------TTEFPEPIGLGATFDVPAIHEMAVDIGTEGRVVYEENEKDGSSKIFHGLDFW 147
Query: 166 APNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLK 225
APN+NIFRDPRWGRGQET GEDP + V FV QG+N K RV+
Sbjct: 148 APNLNIFRDPRWGRGQETYGEDPFLTGKMGVAFVSGMQGDNPKY-----------YRVI- 195
Query: 226 GFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQ 285
A KH +D+ +R+ + ++ D DT++P FR+ I Q
Sbjct: 196 ---------------ATPKH---FDVHSGPEPTRHFADVDVSLHDQLDTYEPAFRAAIMQ 237
Query: 286 GKASCIMCSYNQVNGVPACLRGDLFQ-KARNEWGFKGYITSDCDAVATIFEYQNYTKTHE 344
G A +MCSYN +NG PAC Q + R WGFKGY+ SDCDAV I+ Y T
Sbjct: 238 GHADSVMCSYNAINGQPACANQFTLQHQLRGAWGFKGYVVSDCDAVHDIYSGHKYRPTLA 297
Query: 345 DSAAGVLKAGMDINC-------GTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGL 397
+AA ++ GMD +C G + A+ +G + ++ +D AL+ LF+ +++LGL
Sbjct: 298 QAAAISMERGMDNDCADFAQPKGDDDYKAYIDAVQQGYLSQQAMDTALVRLFTARIKLGL 357
Query: 398 FNGDPR-KGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIG 456
F DP+ Y ++ + H+ A A + +VLLKND LPL +V S+A++G
Sbjct: 358 F--DPKGMDPYADTPHSELNSPAHRAYARKLADESMVLLKNDGT-LPLKPGSVHSIAVVG 414
Query: 457 PLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKT 490
PL + + + G Y G+P S L GL A T
Sbjct: 415 PLADQTAVLLGNYNGVPTHTVSFLEGLRAEYPNT 448
Score = 113 bits (282), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 83/263 (31%), Positives = 129/263 (49%), Gaps = 46/263 (17%)
Query: 536 DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAG 595
DR +L +P + +LV +VA+T K PV++VL G L V++ +++L Y GE G
Sbjct: 652 DRTNLQMPEPEEALVEAVAKTGK-PVVVVLMNGSALAVNWIS--QHANAVLEAWYSGEEG 708
Query: 596 AKALAEIIFGDFNPGGRLPMTWYPESFTKVP-MNDMNMRADSSRQYPGRSYRFYTGTQVY 654
A+A+ + G +P GRLP+T+Y +S ++P D +M R+YR++ G +Y
Sbjct: 709 GAAIADTLSGKNDPAGRLPVTFY-KSVNQLPNFEDYSME--------NRTYRYFKGKPLY 759
Query: 655 GFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRF 714
FG+GLSYT + Y LS P A++ AG
Sbjct: 760 PFGYGLSYTTFRYSDLSIP-----HATVDAGQP--------------------------V 788
Query: 715 HVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCE 774
+VTN G V G VV L+ + PKV G P+ L GF R+H S+++ F +
Sbjct: 789 EASATVTNTGKVAGDEVVQLYLKFPKVD-GAPDIALRGFQRIHLEPGQSQQVHFELK-KR 846
Query: 775 QLSIANKHGRRILPLGNHVLMVG 797
LS+ G+ I+ G++ L +G
Sbjct: 847 DLSMVTALGQIIVAQGDYTLSIG 869
>gi|285016879|ref|YP_003374590.1| beta-glucosidase [Xanthomonas albilineans GPE PC73]
gi|283472097|emb|CBA14604.1| putative beta-glucosidase protein [Xanthomonas albilineans GPE
PC73]
Length = 914
Score = 280 bits (715), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 159/462 (34%), Positives = 245/462 (53%), Gaps = 45/462 (9%)
Query: 54 FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
+ ++ + + RA L++ +TL+EK+ Q+ + A AIPRLG+PAY+WW+E LHG+A G
Sbjct: 34 YLDSQRTFAQRADDLVARMTLEEKVAQMQNAAPAIPRLGVPAYDWWNEGLHGVARAG--- 90
Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG--------QAGLTFW 165
T FPQ + AA+F+ L + +A++ EARA ++ GLTFW
Sbjct: 91 -------GATVFPQAIGLAATFDLPLMHEVSTAISDEARAKHHEALRRGEHGRYQGLTFW 143
Query: 166 APNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLK 225
+PNINIFRDPRWGRGQET GEDP + + V FV+ QGE + G +R+
Sbjct: 144 SPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGMQGEGADAPKNAQGETYRK----- 198
Query: 226 GFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQ 285
L A KH + + R+ F+A +++D +T+ P F + +++
Sbjct: 199 -------------LDATAKH---FAVHSGPESERHHFDARPSQRDLYETYLPAFEALVKE 242
Query: 286 GKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHE 344
GK +M +YN++ G A L + R WGF GY+ SDC A+ I++ T E
Sbjct: 243 GKVDAVMGAYNRLFGESASASKFLLRDVLRERWGFHGYVVSDCWAIVDIWKNHKIVATRE 302
Query: 345 DSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRK 404
+AA +K G + CG +A+ +G + E DID AL L + ++RLG+F+ P +
Sbjct: 303 QAAALAVKNGTQLECGQEYAT-LPAAVQQGLIGETDIDAALRTLMTARMRLGMFD-PPGQ 360
Query: 405 GKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQ 464
++ +L + EH LA AR+ +VLLKND LPL++ +A+IGP ++
Sbjct: 361 LRWAQLPISVNQSPEHDALARRTARESLVLLKND-GLLPLSRAKHKRIAVIGPTADDTMA 419
Query: 465 MGGGYTGIPCSPKSLLRGLEAYV--SKTHYASGCHDVPCNSD 504
+ G Y G P +P ++L+G+ A + YA G V SD
Sbjct: 420 LLGNYYGTPATPVTILQGIRAAAPDADVLYARGADLVEGRSD 461
Score = 137 bits (346), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 98/304 (32%), Positives = 143/304 (47%), Gaps = 57/304 (18%)
Query: 507 FHEAVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVART 556
EA+ A++AD V+ V GL E E+ DR L LP Q L+ +++ T
Sbjct: 625 LQEALDTARRADVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRELLQALSAT 684
Query: 557 SKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMT 616
K PV+ VLT G L + +A+ + +IL YPG+ G A+A+++FGD NPGGRLP+T
Sbjct: 685 GK-PVVAVLTTGSALAIDWAQ--EHVPAILLAWYPGQRGGSAVADVLFGDTNPGGRLPVT 741
Query: 617 WYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSEL 676
+Y S T +D MR GR+YR++ GT +Y FGHGLSYT ++Y L
Sbjct: 742 FYKASETLPAFDDYAMR--------GRTYRYFAGTPLYPFGHGLSYTQFAYSDL------ 787
Query: 677 TISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFA 736
RLD + + + VTN G G VV L+
Sbjct: 788 --------------------RLDRRKV-----AADGQLSATLKVTNTGTRAGDEVVQLYL 822
Query: 737 RVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSI---ANKHGRRILPLGNHV 793
++ K+L GF R+ S+++ F + P L I A KH ++ G++
Sbjct: 823 HPLAPTRARAIKELRGFQRIALAPGESRDVHFTISPQTDLRIYDEAQKH--YVVDPGDYE 880
Query: 794 LMVG 797
L VG
Sbjct: 881 LQVG 884
>gi|150002739|ref|YP_001297483.1| glycoside hydrolase family protein [Bacteroides vulgatus ATCC 8482]
gi|294776994|ref|ZP_06742455.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
vulgatus PC510]
gi|149931163|gb|ABR37861.1| glycoside hydrolase family 3, candidate beta-glycosidase
[Bacteroides vulgatus ATCC 8482]
gi|294449242|gb|EFG17781.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
vulgatus PC510]
Length = 788
Score = 280 bits (715), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 222/736 (30%), Positives = 345/736 (46%), Gaps = 129/736 (17%)
Query: 90 RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
RLGIP ++ +E + G+ + T FP A++N+ L + IG V
Sbjct: 138 RLGIPV-DFTNEGIRGLCHD-----------RATYFPAQCGQGATWNKKLIARIGE---V 182
Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
EA+ LG + ++P ++I +DPRWGR ET GEDP +V + + S Q N
Sbjct: 183 EAKEAVALGYTNI--YSPILDIAQDPRWGRCVETYGEDPYLVGELGKQMITSLQKYN--- 237
Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQ 269
L A KH Y + G + + + +
Sbjct: 238 -----------------------------LVATPKHFAVYSIPIGGRDGKTRTDPHVAPR 268
Query: 270 DTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGD---LFQKARNEWGFKGYITSD 326
+ + PFR ++ A +M SYN +G P + G L + R EWGFKGY+ SD
Sbjct: 269 EMRTLYIEPFRMAFQEAGALGVMSSYNDYDGEP--ITGSYHFLTEILRQEWGFKGYVVSD 326
Query: 327 CDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTC----MLRHTQSAIDKGKVQEKDID 382
+AV I T+ED A + AG++I + + A+D GK+ ++ +D
Sbjct: 327 SEAVEFISNKHKVADTYEDGIAQAVNAGLNIRTHFTPPADFILPLRKAVDNGKISQETLD 386
Query: 383 RALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFL 442
+ + + ++ RLGLF+ +P +G GK V + EH+ ++L+AARQ +VLLKN+ L
Sbjct: 387 KRVAEILRIKFRLGLFD-NPYRGN-GKQAEQIVHSKEHQAVSLEAARQSLVLLKNETNLL 444
Query: 443 PLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTH--YASGCH--- 497
PL+K ++ S+A+IGP N +Q+ Y K++ +G++ + T Y GC
Sbjct: 445 PLSK-SIRSIAVIGPNANEQTQLICRYGPANAPIKTVYQGIKELLPHTEVIYKKGCDIID 503
Query: 498 ---------DVPCNSDAG--FHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQ 546
D P ++ EA+R AK+A+ V++V G + EDR R SL LPG+Q
Sbjct: 504 PHFPESEILDFPKTAEEVQLMEEAIRAAKQAEVVVMVLGGNELTVREDRSRTSLNLPGRQ 563
Query: 547 MSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGD 606
L+ +V T K P+ILV+ G +++A A I +IL +PGE +A+AE +FGD
Sbjct: 564 EELLKAVCATGK-PIILVMLDGRASSINYAAA--HIPAILHAWFPGEFCGQAVAEALFGD 620
Query: 607 FNPGGRLPMTWYPESFTKVPMN-DMNMRADSSRQYPGRSYRFYTGTQVYG----FGHGLS 661
+NPGGRL +T +P+S ++P +D S + T VYG FGHGLS
Sbjct: 621 YNPGGRLAVT-FPKSVGQIPFAFPFKPGSDES-----------SSTSVYGALYPFGHGLS 668
Query: 662 YTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVT 721
YT ++Y S+L IS S G + D HV +
Sbjct: 669 YTTFTY------SDLHISPS-----------HQGVQGD--------------IHVSCKIK 697
Query: 722 NAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANK 781
N G + G VV L+ R S T K L GF+R+ A + + F + P + L + +K
Sbjct: 698 NTGKIKGDEVVQLYLRDEISSVTTYTKVLRGFERISLKAGEEQTVHFRLRP-QDLGLWDK 756
Query: 782 HGRRILPLGNHVLMVG 797
+ + LG+ +M+G
Sbjct: 757 NMNFRVELGSFKVMLG 772
>gi|399030621|ref|ZP_10730998.1| beta-glucosidase-like glycosyl hydrolase [Flavobacterium sp. CF136]
gi|398071229|gb|EJL62496.1| beta-glucosidase-like glycosyl hydrolase [Flavobacterium sp. CF136]
Length = 876
Score = 279 bits (714), Expect = 4e-72, Method: Compositional matrix adjust.
Identities = 160/445 (35%), Positives = 232/445 (52%), Gaps = 53/445 (11%)
Query: 54 FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
F N LS R L++ LTL+EK+ Q+ +++ AIPRL IPAY+WW+E+LHG+A
Sbjct: 31 FQNPDLSFEKRVDDLVNRLTLEEKVSQMLNSSPAIPRLDIPAYDWWNETLHGVAR----- 85
Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG---------QAGLTF 164
T VT +PQ + AA+F+++ + A+E RA+YN GLT+
Sbjct: 86 ----TPFKVTVYPQAIAMAATFDKNSLYKMADFSALEGRAIYNKAVESGRTNERYLGLTY 141
Query: 165 WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVL 224
W PNINIFRDPRWGRGQET GEDP + FVK QG++ K
Sbjct: 142 WTPNINIFRDPRWGRGQETYGEDPYLTGVLGDSFVKGLQGDDPKY--------------- 186
Query: 225 KGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIE 284
L +AC KH Y + R++F+ +T + DT+ P F+ +
Sbjct: 187 ------------LKAAACAKH---YAVHSGPEPLRHTFDVDVTPYELWDTYLPAFQKLVT 231
Query: 285 QGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTH 343
+ K + +MC+YN P C L RN+W F+GY+TSDC A+ F+
Sbjct: 232 ESKVAGVMCAYNAFRTQPCCASDILMTDILRNQWKFEGYVTSDCWAIDDFFKNHKTHPDA 291
Query: 344 EDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPR 403
E ++A + G DI+CGT + A+ GK+ EK ID ++ LF ++ RLG+F DP
Sbjct: 292 ESASADAVFHGTDIDCGTDAYKALVQAVKDGKISEKQIDISVKRLFMIRFRLGMF--DPV 349
Query: 404 KG-KYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNI 462
+ KY + + EHK AL ARQ IVLL+N+ K LPL+K + + ++GP V+N
Sbjct: 350 EMVKYAQTPTSVLENDEHKAHALKMARQSIVLLRNENKTLPLSKK-LKKIVVLGPNVDNA 408
Query: 463 SQMGGGYTGIPCSPKSLLRGLEAYV 487
+ G Y G P ++L G++ V
Sbjct: 409 IAILGNYNGTPSKLTTVLEGIKEKV 433
Score = 120 bits (300), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 92/307 (29%), Positives = 143/307 (46%), Gaps = 61/307 (19%)
Query: 507 FHEAVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVART 556
F+ V K AD + V G+ E E+ DR S+LLP Q L+ ++ +T
Sbjct: 598 FNTLVNRVKDADAFVFVGGISPQLEGEEMKVNFPGFKGGDRTSILLPKIQTDLMKAL-KT 656
Query: 557 SKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMT 616
+ +P++ V+ G + + + EA++ I +I Y G+A A+A+++FG++NP GRLP+T
Sbjct: 657 TGKPIVFVMMTGSAIAIPW-EAEN-IPAIANAWYGGQAAGTAVADVLFGNYNPAGRLPVT 714
Query: 617 WYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSEL 676
+Y P D M R+YR++ G +YGFG+GLSYT + Y L
Sbjct: 715 FYKSDADLSPFVDYKMD--------NRTYRYFKGKPLYGFGYGLSYTTFKY------DNL 760
Query: 677 TISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFA 736
I+ S+ G KN+ + + VTN G V G VV L+
Sbjct: 761 KIAPSVIKG--KNV------------------------PITVKVTNTGKVSGEEVVQLYV 794
Query: 737 RVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMV 796
+ P K L GF+R+ A SK I+F + P E LS I GNH
Sbjct: 795 INQNTAIKAPLKTLKGFERISLKAGKSKTITFTLSP-EDLSY-------ITAEGNHQQYN 846
Query: 797 GELRHSL 803
G+++ ++
Sbjct: 847 GKIKIAI 853
>gi|288927072|ref|ZP_06420962.1| beta-glucosidase [Prevotella buccae D17]
gi|288336152|gb|EFC74543.1| beta-glucosidase [Prevotella buccae D17]
Length = 866
Score = 279 bits (714), Expect = 4e-72, Method: Compositional matrix adjust.
Identities = 155/457 (33%), Positives = 249/457 (54%), Gaps = 44/457 (9%)
Query: 52 YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGP 111
YP+ N LS RA+ L S LTL+EK + + +++ AIPRLGIP +EWWSE+LHGIA NG
Sbjct: 23 YPYQNPRLSSQERAEDLCSRLTLEEKTKLMRNSSPAIPRLGIPQFEWWSEALHGIARNG- 81
Query: 112 GVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLT 163
T FPQ AAS++ L ++ A + EA A NL + G++
Sbjct: 82 ---------FATVFPQTTAMAASWDDELLYHVFCAASDEAVAKNNLARKSGDIKRYQGVS 132
Query: 164 FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRV 223
W PNINIFRDPRWGRGQET GEDP + S + V QG+ ++ D
Sbjct: 133 IWTPNINIFRDPRWGRGQETYGEDPYLTSRMGLAVVNGLQGQPFRRD------------- 179
Query: 224 LKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNA-MITEQDTEDTFQPPFRSC 282
++ F E L AC KH + +W +R+ F+ + E+D +T+ P F+S
Sbjct: 180 MRPFTERPRYYKTL---ACAKHYAVHSGPEW---NRHVFDVERLPERDLWETYLPAFKSL 233
Query: 283 IEQGKASCIMCSYNQVNGVPACLRGD-LFQKARNEWGFKGYITSDCDAVATIFE--YQNY 339
+++G +MC+Y +++G P C L Q R EW + G + SDC A++ + + +
Sbjct: 234 VQEGNVREVMCAYQRIDGSPCCGNTRYLHQILRGEWEYNGLVVSDCGAISDFYREGHHHV 293
Query: 340 TKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFN 399
+T +++A ++AG D+ CG + A+++G + + ID +++ L + +G F+
Sbjct: 294 VETPAEASAMGVRAGTDVECG-AVYATLPRAVEQGLISREAIDTSVVRLLKARFEVGDFD 352
Query: 400 GDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLV 459
+ + + GP+ + + H++LALD AR+ + LL+N + LPL+KN + +A++GP
Sbjct: 353 SE-KLVPWKLTGPEVIASETHRRLALDMARESMTLLQNRNRLLPLSKNGL-RIAVMGPNA 410
Query: 460 NNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGC 496
N+ + G YTG P S ++L+G+ + V + GC
Sbjct: 411 NDSVMLWGNYTGYPISTTTILKGIRSKVPAARFVEGC 447
Score = 102 bits (254), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 91/334 (27%), Positives = 141/334 (42%), Gaps = 65/334 (19%)
Query: 486 YVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDR---------- 535
YV A D+ S E A AD V+ V G+ E E+
Sbjct: 580 YVQNKAMAVCQFDIARKSPITASEIAAQAGDADVVVFVGGISPRLEGEEMKVDAPGFKGG 639
Query: 536 DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAG 595
DR S+ LP Q ++ + + K V + +GG V EA ++L Y GEAG
Sbjct: 640 DRTSIELPEAQREVIRLLRQAGKLVVFVNCSGGAVALVPETEA---CDAVLQAWYAGEAG 696
Query: 596 AKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYG 655
+A+A+++FGD+NP G+LP+T+Y D M GR+YR++ G ++
Sbjct: 697 GQAVADVLFGDYNPSGKLPVTFYKSDADLPDFLDYRM--------TGRTYRYFRGIPLFP 748
Query: 656 FGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFH 715
FG GLSYT++++ G + + G
Sbjct: 749 FGFGLSYTSFAF-----------------GKPR---YENGK------------------- 769
Query: 716 VQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQ 775
+ + VTN G DG+ VV ++ + P + G P K L GF R+ A + + + P E+
Sbjct: 770 LYVEVTNTGKRDGAEVVQVYVKNPADADG-PVKTLRGFARIDLKAGERRRVEIAM-PRER 827
Query: 776 LS--IANKHGRRILPLGNHVLMVGELRHSLTIET 807
A + R+ P GNH+LMVG ++T
Sbjct: 828 FEGWDATTNTMRVKP-GNHLLMVGSSSRDADLQT 860
>gi|238620766|ref|YP_002915592.1| glycoside hydrolase family protein [Sulfolobus islandicus M.16.4]
gi|238381836|gb|ACR42924.1| glycoside hydrolase family 3 domain protein [Sulfolobus islandicus
M.16.4]
Length = 755
Score = 278 bits (712), Expect = 6e-72, Method: Compositional matrix adjust.
Identities = 227/769 (29%), Positives = 357/769 (46%), Gaps = 142/769 (18%)
Query: 74 LQEKIQQ-LSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSA 132
L KIQ+ L +N RL IPA E L G+ V + T+FPQ + A
Sbjct: 68 LTNKIQKFLMENT----RLKIPAI-VHEECLTGL-----------MVKTATAFPQAIGLA 111
Query: 133 ASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVS 192
++++ L + S + +A+ + Q +P +++ RDPRWGR +ET GED +V+
Sbjct: 112 STWDPDLIREVSSTIRYQAK-LIGTNQC----LSPVLDVCRDPRWGRCEETYGEDQYLVA 166
Query: 193 AYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLE 252
+ + +VK QGEN L A KH A+
Sbjct: 167 SIGLAYVKGLQGENE-------------------------------LIATVKHFAAHGFP 195
Query: 253 KWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK 312
+ G R + ++ + F PF I+ GKA +M +Y++++G+P +L K
Sbjct: 196 EGG---RNIAPVHVGNRELREVFLFPFEVAIKLGKAMSVMPAYHEIDGIPCHSNAELLTK 252
Query: 313 A-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGT--CMLRHTQS 369
R EWGF+G + SD DA+ + + +++A L+AG+D C
Sbjct: 253 ILRQEWGFEGIVVSDYDAIRQLEAIHKVSLNKKEAAILALEAGVDTEFPNIDCFGEPLLE 312
Query: 370 AIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAAR 429
A+ +G + E IDRA+ + ++ +LGLFN D + P+ + S+ ++LALD AR
Sbjct: 313 AVKEGLISESIIDRAVERVLRIKEKLGLFN-DHYINENNV--PEKLDNSKSRELALDVAR 369
Query: 430 QGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT-------GIPCSPKSLLRG 482
+ IVLLKND LPLNKN + ++A+IGP N + G YT + ++L G
Sbjct: 370 KSIVLLKNDN-ILPLNKN-IGTIAVIGPNANEPRNLLGDYTYTGHLNADVGIEVVTVLEG 427
Query: 483 LEAYVSKTH---YASGCHDVPCNSDAGFHEAVRIAKKADFVIVV----AGLDLTQET--- 532
+ VS YA GC D+ S GF EA+ IAKK D +I V +GL L+
Sbjct: 428 IMRKVSNNTNVLYAKGC-DIAAESKEGFSEAIEIAKKGDIIIAVMGEKSGLPLSWTDVPG 486
Query: 533 ------------EDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVS--FAEA 578
E DR SL LPG Q L+ + +T K P+ILVL G PL +S F E
Sbjct: 487 KDEFEKYQAVTGEGNDRTSLRLPGVQEELLKELHKTGK-PIILVLVNGRPLALSSIFNEV 545
Query: 579 DSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSR 638
++ I + W +PGE G A+A++IFGD+NP GRLP++ +P ++P+ N + S
Sbjct: 546 NAIIDA--W--FPGEEGGNAIADVIFGDYNPSGRLPIS-FPIDTGQIPIY-YNRKPSSL- 598
Query: 639 QYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSA-PSELTISASLKAGSDKNILQQTGSR 697
R Y ++ FG+GLSYT + Y L P E+ S +K
Sbjct: 599 ----RPYVMMKSKPLFPFGYGLSYTEFKYSNLEVTPKEVNSSGKIK-------------- 640
Query: 698 LDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVH 757
+ + V N G +G V L+ P K+L GF +V+
Sbjct: 641 ------------------ISLEVENVGKREGEETVQLYISKQYSGVSRPIKELKGFAKVY 682
Query: 758 TVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
++I+F + P E L+ +++ R I+ G++ +++G+ + ++
Sbjct: 683 LKPNEKRKITFSL-PLEALAFYDQYMRLIIDTGDYEILIGKSSEDIVLK 730
>gi|227828570|ref|YP_002830350.1| glycoside hydrolase [Sulfolobus islandicus M.14.25]
gi|229585800|ref|YP_002844302.1| glycoside hydrolase family protein [Sulfolobus islandicus M.16.27]
gi|227460366|gb|ACP39052.1| glycoside hydrolase family 3 domain protein [Sulfolobus islandicus
M.14.25]
gi|228020850|gb|ACP56257.1| glycoside hydrolase family 3 domain protein [Sulfolobus islandicus
M.16.27]
Length = 755
Score = 278 bits (711), Expect = 8e-72, Method: Compositional matrix adjust.
Identities = 229/771 (29%), Positives = 356/771 (46%), Gaps = 146/771 (18%)
Query: 74 LQEKIQQ-LSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSA 132
L KIQ+ L +N RL IPA E L G+ V + T+FPQ + A
Sbjct: 68 LTNKIQKFLMENT----RLKIPAI-VHEECLTGL-----------MVKTATAFPQAIGLA 111
Query: 133 ASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVS 192
++++ L + S + +A+ + Q +P +++ RDPRWGR +ET GED +V+
Sbjct: 112 STWDPDLIREVSSTIRYQAK-LIGTNQC----LSPVLDVCRDPRWGRCEETYGEDQYLVA 166
Query: 193 AYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLE 252
+ + +VK QGEN L A KH A+
Sbjct: 167 SIGLAYVKGLQGENE-------------------------------LIATVKHFAAHGFP 195
Query: 253 KWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK 312
+ G R + ++ + F PF I+ GKA +M +Y++++G+P +L K
Sbjct: 196 EGG---RNIAPVHVGNRELREVFLFPFEVAIKLGKAMSVMPAYHEIDGIPCHSNAELLTK 252
Query: 313 A-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGT--CMLRHTQS 369
R EWGF+G + SD DA+ + + +++A L+AG+D C
Sbjct: 253 ILRQEWGFEGIVVSDYDAIRQLEAIHKVSLNKKEAAILALEAGVDTEFPNIDCFGEPLLE 312
Query: 370 AIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAAR 429
A+ +G + E IDRA+ + ++ +LGLFN P+ + S+ ++LALD AR
Sbjct: 313 AVKEGLISESIIDRAVERVLRIKEKLGLFNNHYINENNV---PEKLDNSKSRELALDVAR 369
Query: 430 QGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT---------GIPCSPKSLL 480
+ IVLLKND LPLNKN + ++A+IGP N + G YT GI ++L
Sbjct: 370 KSIVLLKNDN-ILPLNKN-IGTIAVIGPNANEPRNLLGDYTYTGHLNADGGIEVV--TVL 425
Query: 481 RGLEAYVSKTH---YASGCHDVPCNSDAGFHEAVRIAKKADFVIVV----AGLDLTQET- 532
G+ VS YA GC D+ S GF EA+ IAKK D +I V +GL L+
Sbjct: 426 EGIMRKVSNNTNVLYAKGC-DIAAESKEGFSEAIEIAKKGDIIIAVMGEKSGLPLSWTDV 484
Query: 533 --------------EDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVS--FA 576
E DR SL LPG Q L+ + +T K P+ILVL G PL +S F
Sbjct: 485 PGKDEFEKYQAVTGEGNDRTSLRLPGVQEELLKELHKTGK-PIILVLVNGRPLALSSIFN 543
Query: 577 EADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADS 636
E ++ I + W +PGE G A+A++IFGD+NP GRLP++ +P ++P+ R S
Sbjct: 544 EVNAIIDA--W--FPGEEGGNAIADVIFGDYNPSGRLPIS-FPIDTGQIPI--YYNRKPS 596
Query: 637 SRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSA-PSELTISASLKAGSDKNILQQTG 695
S R Y ++ FG+GLSYT + Y L P E+ S +K
Sbjct: 597 SL----RPYVMMKSKPLFPFGYGLSYTEFKYSNLEVTPKEVNSSGKIK------------ 640
Query: 696 SRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDR 755
+ + V N G +G V L+ P K+L GF +
Sbjct: 641 --------------------ISLEVENVGKREGEETVQLYISKQYSGVSRPIKELKGFAK 680
Query: 756 VHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
V+ ++I+F + P E L+ +++ R I+ G++ +++G+ + ++
Sbjct: 681 VYLKPNEKRKITFSL-PLEALAFYDQYMRLIIDTGDYEILIGKSSEDIVLK 730
>gi|206901280|ref|YP_002250567.1| xylosidase/arabinosidase [Dictyoglomus thermophilum H-6-12]
gi|206740383|gb|ACI19441.1| xylosidase/arabinosidase [Dictyoglomus thermophilum H-6-12]
Length = 762
Score = 278 bits (710), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 228/760 (30%), Positives = 356/760 (46%), Gaps = 136/760 (17%)
Query: 72 LTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVS 131
+ L KIQ+ + RLGIPA E L G + G T FPQ +
Sbjct: 72 VELANKIQKFLKEKT---RLGIPAI-IHEECLSGFMAQG-----------ATVFPQAIGM 116
Query: 132 AASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVV 191
A++F L + + +A N+ Q GL+ P ++I RDPRWGR +ET GEDP +V
Sbjct: 117 ASTFEPELIRRVSDVIRQHMKAA-NVHQ-GLS---PVLDIPRDPRWGRTEETFGEDPYLV 171
Query: 192 SAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDL 251
S A E+VK QGE+W+ + A KH AY +
Sbjct: 172 SRMATEYVKGLQGEDWREG----------------------------IVATVKHFTAYGI 203
Query: 252 EKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQ 311
+ +R A + E++ + F PF I++G+A +M +Y++++GVP L
Sbjct: 204 SEG---ARNLGPAKVGERELREVFLFPFEVAIKEGQAGSLMNAYHEIDGVPCASSKFLLT 260
Query: 312 KA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGT--CMLRHTQ 368
K R EWGFKGY+ SD AV + + + +++A L+AG+DI + C
Sbjct: 261 KILRWEWGFKGYVVSDYIAVRMLENFHKVARDAKEAAVLALEAGIDIELPSVDCYGEPLI 320
Query: 369 SAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVC----TSEHKKLA 424
A+ +G + E+ I+ ++ + + LGLF+ + K P V E + L+
Sbjct: 321 QAVKEGLISEEVINASVERVLRAKFMLGLFDDNLEKD------PKKVYEVFDKPEFRDLS 374
Query: 425 LDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT-------------G 471
+ AR+ IVLLKND LPL+KN + +A+IGP +N + G Y+ G
Sbjct: 375 REVARRSIVLLKNDGT-LPLSKN-LKKVAVIGPNADNPRNLHGDYSYTAHIPSIAEGLEG 432
Query: 472 IPCSPK------SLLRGLEAYVS---KTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIV 522
+ K S+L G+ VS + YA GC D+ +S GF EA+ +AK+AD +I
Sbjct: 433 VKVEEKCVVRTVSILEGIRNKVSPETEVLYAKGC-DIISDSKDGFAEAIEMAKEADVIIA 491
Query: 523 VAG-----LDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAE 577
V G E DR +L L G Q L+ + + K P++LVL G P + +
Sbjct: 492 VMGEESGLFHRGISGEGNDRTTLELFGVQRDLLKELHKLGK-PIVLVLINGRPQALKWEH 550
Query: 578 ADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSS 637
+ +++IL YPGE G A+A++IFGD+NP G+LP++ +P ++P+ N + +
Sbjct: 551 EN--LNAILEAWYPGEEGGNAVADVIFGDYNPSGKLPIS-FPAVTGQIPVY-YNRKPSAF 606
Query: 638 RQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSR 697
Y S + +Y FGHGLSYT + Y S+L IS
Sbjct: 607 SDYIDESAK-----PLYPFGHGLSYTTFEY------SDLKISP----------------- 638
Query: 698 LDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVH 757
++V S + + ++ N G+ DG VV L+ S P K+L GF +++
Sbjct: 639 ------EKVNSLE--KVEISFTIKNTGNRDGEEVVQLYIHDQVASLERPVKELKGFKKIY 690
Query: 758 TVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
SK ++F + P EQL+ ++ R I+ G +M+G
Sbjct: 691 LKPGESKRVTFTLYP-EQLAFYDEFMRFIVEKGVFEVMIG 729
>gi|90021134|ref|YP_526961.1| Beta-glucosidase [Saccharophagus degradans 2-40]
gi|89950734|gb|ABD80749.1| b-xylosidase-like protein [Saccharophagus degradans 2-40]
Length = 893
Score = 277 bits (708), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 172/483 (35%), Positives = 253/483 (52%), Gaps = 66/483 (13%)
Query: 51 SYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNG 110
+YPF + SLS+ R L+S LT EKI Q+ ++ AI RLGIPAY WW+ESLHG+A G
Sbjct: 42 TYPFRDASLSVDARVDDLVSRLTTTEKIAQMFNDTPAIERLGIPAYNWWNESLHGVARAG 101
Query: 111 PGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYN--LGQ------AGL 162
T +PQ + A++F+ L + ++++ E RA Y+ L + GL
Sbjct: 102 ----------KATVYPQAIGLASTFDEDLMLRVATSISDEGRAKYHDFLSKDVRTIYGGL 151
Query: 163 TFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKR 222
TFW+PNINIFRDPRWGRGQET GEDP + A+ FVK QGEN
Sbjct: 152 TFWSPNINIFRDPRWGRGQETYGEDPFLTGRMAINFVKGIQGEN---------------- 195
Query: 223 VLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSC 282
D D L A KH Y + +R+S + T +D +T+ P FR
Sbjct: 196 ---------DNSDYLKAVATIKH---YAVHSGPEKTRHSDDYHPTRKDLFETYLPAFRMA 243
Query: 283 IEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNY-- 339
I + +MC+YN+V+G PAC +L Q+ R + GF GY+ SDC A+A +E +++
Sbjct: 244 IAETNVQSLMCAYNRVDGAPACGNNELMQEILRGDMGFNGYVVSDCGAIADFYESRSHHV 303
Query: 340 TKTHEDSAAGVLKAGMDINCGTC---MLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLG 396
+ ++AA +K+G D+NCG + A+ +G + E ID A+ LF +++LG
Sbjct: 304 VDSPAEAAAWAVKSGTDLNCGDSHGNTYTNLHYALQQGLITEDYIDIAVKRLFKARIKLG 363
Query: 397 LFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIG 456
+F+ R Y ++G D V + +H L +AA + IVLLKN+ LPL A +A+IG
Sbjct: 364 MFDEQDRV-PYSEIGMDVVGSPKHLALTQEAAEKSIVLLKNN-GVLPL--KAGVKVAVIG 419
Query: 457 PLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTH--YASG--------CHDVPCNSDAG 506
P + + G Y G+P P L G+ V + + YA G H P +++
Sbjct: 420 PNAVDEDVLVGNYHGVPVKPVLPLEGIVNRVGEANVFYAPGSAQIADIYSHYEPISAENF 479
Query: 507 FHE 509
+H+
Sbjct: 480 YHK 482
Score = 112 bits (280), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 83/299 (27%), Positives = 133/299 (44%), Gaps = 54/299 (18%)
Query: 509 EAVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVARTSK 558
EA+ A+KAD +I + G+D E E+ DR + LP Q +L+ + T K
Sbjct: 619 EALAAARKADVIIFMGGIDAHLEGEEMPLELDGFTHGDRTHINLPKVQTNLLKQLKATGK 678
Query: 559 RPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWY 618
PV++V G + +++ ++ +IL YPGEA ALA I++GD +P GRLP+T+Y
Sbjct: 679 -PVVMVNFSGSAMALNWES--EKLDAILQAFYPGEATGTALANILWGDVSPSGRLPVTFY 735
Query: 619 PESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTI 678
ND +M R+Y+FY G +Y FGHGL Y +++Y + L +
Sbjct: 736 KGVDDLPAFNDYHME--------NRTYKFYRGEPLYAFGHGLGYVDFAY------NNLVV 781
Query: 679 SASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARV 738
+ + +AG + +SVTN G + V ++ +
Sbjct: 782 ANTAEAGK--------------------------ALPIAVSVTNTGKMQAEDVAQVYISL 815
Query: 739 PKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
TP + L F R A S E+ F + P L+ + +G+ G + VG
Sbjct: 816 LDAPANTPIRDLKAFKRTKLAAGESTELEFNL-PARVLTYIDDNGKTQTYTGRVEVTVG 873
>gi|229580225|ref|YP_002838625.1| glycoside hydrolase family protein [Sulfolobus islandicus
Y.G.57.14]
gi|229581131|ref|YP_002839530.1| glycoside hydrolase family protein [Sulfolobus islandicus
Y.N.15.51]
gi|228010941|gb|ACP46703.1| glycoside hydrolase family 3 domain protein [Sulfolobus islandicus
Y.G.57.14]
gi|228011847|gb|ACP47608.1| glycoside hydrolase family 3 domain protein [Sulfolobus islandicus
Y.N.15.51]
Length = 754
Score = 276 bits (707), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 211/711 (29%), Positives = 336/711 (47%), Gaps = 123/711 (17%)
Query: 121 SVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRG 180
S T+FPQ + A+++N L +I S + + R + + Q +P +++ +DPRWGR
Sbjct: 100 SSTAFPQAIGLASTWNPELVMDIASVIRSQGR-LVGVNQC----LSPVLDVCKDPRWGRC 154
Query: 181 QETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLS 240
+ET GEDP +V++ + ++ QG+N L
Sbjct: 155 EETYGEDPYLVASMGLAYITGLQGDN-------------------------------QLV 183
Query: 241 ACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNG 300
A KH A+ + G R + ++ +TF PF ++ GK IM +Y++++G
Sbjct: 184 ATAKHFAAHGFPEGG---RNIAQVHVGNRELRETFLFPFEVAVKIGKVMSIMPAYHEIDG 240
Query: 301 VPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINC 359
+P L R EWGF G + SD D + + ++A L++G+DI
Sbjct: 241 IPCHGNPQLLTNILRQEWGFDGIVVSDYDGIRQLETIHRVASNKMEAAILALESGVDIEF 300
Query: 360 GT--CMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCT 417
T C +A+ +G V E IDRA+ + ++ RLGL + +P + P+ +
Sbjct: 301 PTIDCYGEPLVNALKEGLVPESLIDRAVERVLRIKDRLGLLD-NPFVNENSV--PEKLDD 357
Query: 418 SEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT------- 470
+ ++LAL AR+ IVLLKN+ LPL+KN V+ +A+IGP N+ M G YT
Sbjct: 358 HKSRELALKTARESIVLLKNENNILPLSKN-VNKIAVIGPNANDPRNMLGDYTYTGHLNI 416
Query: 471 --GIPCSPKSLLRGLEAYV--SKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAG- 525
GI ++L+G+ V SK YA GC D+ S GF EA+ IA++AD +I + G
Sbjct: 417 DSGIEIV--TVLQGIVKKVGESKVLYAKGC-DIASESKEGFAEAIEIARQADVIIAIMGE 473
Query: 526 --------LDLTQETEDR----------DRVSLLLPGQQMSLVTSVARTSKRPVILVLTG 567
+D+ E E + DR SL LPG Q L+ + +T K P+ILVL
Sbjct: 474 KSGLPLSWMDIPSEEEFKKYQAVTGEGNDRSSLRLPGVQEELLKELYKTGK-PIILVLIN 532
Query: 568 GGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPM 627
G PL +S + + +++ +PGE G A+A++IFGD+NPGGRLP+T +P ++P+
Sbjct: 533 GRPLVLS--SIINYVKAVIEAWFPGEEGGNAIADVIFGDYNPGGRLPIT-FPMDTGQIPL 589
Query: 628 NDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSA-PSELTISASLKAGS 686
N + S R Y + ++ FG+GLSYT + Y L P E+ G
Sbjct: 590 Y-YNRKPSSF-----RPYVMLRSSPLFTFGYGLSYTQFEYSNLEVTPKEI--------GP 635
Query: 687 DKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTP 746
+ NI + I V N G ++G VV L+ S P
Sbjct: 636 NSNI------------------------AISIDVKNVGKMEGDDVVQLYVSKTFSSVARP 671
Query: 747 EKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
K+L GF ++H + + F + P E L+ + R ++ G + L++G
Sbjct: 672 VKELKGFAKIHLKPGEKRRVKF-ILPTEALAFYDSFMRLVVEKGEYQLLIG 721
>gi|325916103|ref|ZP_08178390.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas vesicatoria
ATCC 35937]
gi|325537647|gb|EGD09356.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas vesicatoria
ATCC 35937]
Length = 896
Score = 276 bits (707), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 161/462 (34%), Positives = 242/462 (52%), Gaps = 54/462 (11%)
Query: 53 PFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPG 112
P+ +T L TRA L+S +TL+EK Q+ + A AIPRL +PAY+WW+E+LHG+A G
Sbjct: 39 PYLDTQLPFETRAADLVSRMTLEEKAAQMQNAAPAIPRLRVPAYDWWNEALHGVARAG-- 96
Query: 113 VNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLTF 164
T FPQ + AA+F+ L + + +A++ EARA ++ A GLTF
Sbjct: 97 --------GATVFPQAIGLAATFDTPLMAEVATAISDEARAKHHAFLARDEHKRYQGLTF 148
Query: 165 WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVL 224
W+PNINIFRDPRWGRGQET GEDP + + V FV+ Q +
Sbjct: 149 WSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQAQ------------------- 189
Query: 225 KGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIE 284
+G L A KH Y + R+ F+ +E+D +T+ P F++ ++
Sbjct: 190 --------QGPYRKLDATAKH---YAVHSGPEADRHHFDVHPSERDLHETYLPAFQALVQ 238
Query: 285 QGKASCIMCSYNQVNGVPACLRGDLFQKARNEWGFKGYITSDCDAVATIFEYQNYTKTHE 344
+G + +M +YN+VNG A L R +WGF GYI SDC A+ I++ T E
Sbjct: 239 EGHVAAVMGAYNRVNGESASASTRLEGILRRDWGFDGYIVSDCAAIRDIWQNHKIVPTPE 298
Query: 345 DSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRK 404
+AA +K G D++CG A+ G + E ID +L L + ++RLG+F+ P K
Sbjct: 299 AAAALGVKHGTDLDCGDTYA-ALPKAVRAGLIDEATIDTSLKRLMTTRMRLGMFD-PPAK 356
Query: 405 GKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQ 464
+ ++ + +H LA AR+ +VLLKND LPL K + +A++GP ++
Sbjct: 357 VAWAQIPASVNQSPQHDALARRTARESLVLLKND-GLLPL-KPTLKRIAVVGPTADDPMS 414
Query: 465 MGGGYTGIPCSPKSLLRGLE--AYVSKTHYASGCHDVPCNSD 504
+ G Y G P +P ++L+G+ A ++ YA G V D
Sbjct: 415 LLGNYYGTPAAPVTILQGIRDAAPQAEVVYARGSDLVEGRED 456
Score = 119 bits (297), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 86/286 (30%), Positives = 130/286 (45%), Gaps = 52/286 (18%)
Query: 507 FHEAVRIAKKADFVIVVAGLDLTQETEDRD----------RVSLLLPGQQMSLVTSVART 556
EAV A+ A+ V+ V GL E E+ D R LP Q L+ ++ T
Sbjct: 620 LQEAVDAARNAEVVVFVGGLTGDVEGEEMDVNYPGFAGGDRTDTRLPKPQRELLQALQAT 679
Query: 557 SKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMT 616
PV+ VLT G L V +A+ + +IL YPG+ G A+ +++FG +PGGRLP+T
Sbjct: 680 GT-PVVAVLTTGSALAVDWAQ--QHVPAILLAWYPGQRGGSAVGDVLFGQASPGGRLPIT 736
Query: 617 WYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSEL 676
+Y E+ +D MR GR+YR++TGT +Y FGHGLSYT ++Y S+L
Sbjct: 737 FYKEAERLPAFDDYAMR--------GRTYRYFTGTALYPFGHGLSYTQFAY------SDL 782
Query: 677 TISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFA 736
+ D+ L G+ + V N G G VV L+
Sbjct: 783 RL--------DRTTLGADGT-----------------LRATLKVRNTGKRAGDEVVQLYL 817
Query: 737 RVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKH 782
+ K+L GF R+ +E++F + + L I ++
Sbjct: 818 HPLDPKRERAGKELRGFQRMTLQPGEQREVAFTLKAADALRIYDEQ 863
>gi|329850151|ref|ZP_08264997.1| beta-xylosidase B [Asticcacaulis biprosthecum C19]
gi|328842062|gb|EGF91632.1| beta-xylosidase B [Asticcacaulis biprosthecum C19]
Length = 877
Score = 276 bits (707), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 168/471 (35%), Positives = 242/471 (51%), Gaps = 63/471 (13%)
Query: 46 PPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHG 105
P + + +T+L RA L+S +TL+EK QL A AIPRLG+P Y WW+E LHG
Sbjct: 15 PADVAAMAYRDTALDPKARAADLVSRMTLEEKAAQLGHTAPAIPRLGVPKYNWWNEGLHG 74
Query: 106 IASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMY---------N 156
+A G T FPQ + AA+++ + + +G V+ E RA Y
Sbjct: 75 VARAG----------VATVFPQAIGMAATWDEPMMTTVGDVVSTEFRAKYVERVHPDGGT 124
Query: 157 LGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGF 216
GLT W+PNINIFRDPRWGRGQET GEDP + S + ++ QG + K
Sbjct: 125 DWYRGLTVWSPNINIFRDPRWGRGQETYGEDPYLTSRIGIGYIHGLQGNDPKF------- 177
Query: 217 GFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQ 276
A KH + + +R+ + ++ D EDT+
Sbjct: 178 --------------------FKTVATSKHFAVHSGPE---SNRHKEDVYPSKFDLEDTYL 214
Query: 277 PPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIF- 334
P FR+ + +GKA +MC YN V GVP C L +K R WGF G++ SDC A A IF
Sbjct: 215 PAFRATVTEGKAYSVMCVYNAVYGVPGCASDFLMEEKLRQNWGFPGFVVSDCGAAANIFR 274
Query: 335 -EYQNYTKTHEDSAAGVLKAGMDINCG------TCMLRHTQSAIDKGKVQEKDIDRALLN 387
+ +YTKT E+ A LKAGMD+ CG + ++ +A+ G++ +D+AL+
Sbjct: 275 EDALHYTKTAEEGVAVGLKAGMDLICGDYRNKMSTEVQPIINAVKAGQLPIAVVDQALVR 334
Query: 388 LFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKN 447
LF ++RLG+F+ P + + DD T H +ALD A++ +VLLKND LPL K
Sbjct: 335 LFEGRIRLGMFD-PPASLPFAHITADDSDTPAHHAVALDMAKKSMVLLKND-GLLPL-KA 391
Query: 448 AVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAY--VSKTHYASGC 496
++A+IGP +++ + G Y G P P ++L G+ A +K YA G
Sbjct: 392 EPKTIAVIGPNADSLDALVGNYYGKPSKPVTVLDGIRARFPTAKIVYAEGT 442
Score = 113 bits (283), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 95/300 (31%), Positives = 141/300 (47%), Gaps = 58/300 (19%)
Query: 509 EAVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVARTSK 558
+AV +AK ADFV+ V GL E E+ DR S+ LP Q L+ V T K
Sbjct: 590 QAVDVAKTADFVVFVGGLSARVEGEEMKVEAEGFAGGDRTSIDLPKPQQQLLEKVIGTGK 649
Query: 559 RPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWY 618
P +LVL G L V++A D + +I+ YPG G A+A++I GD++P GRLP+T+Y
Sbjct: 650 -PTVLVLMSGSALGVNWA--DKHVPAIIEAWYPGGEGGHAVAQLIAGDYSPAGRLPVTFY 706
Query: 619 PESFTKVP-MNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELT 677
S +P +D M+ R+YR++ G +Y FGHGLSYT ++Y + P
Sbjct: 707 -RSVDALPGFSDYTMK--------NRTYRYFNGEVLYPFGHGLSYTTFAY---ANPKVSA 754
Query: 678 ISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFAR 737
S + + ++ +D DEV VQ+ V++ G
Sbjct: 755 ASVAAGSSVTVSVDVSNSGAMDS---DEV---------VQLYVSHPG------------- 789
Query: 738 VPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
GT + L GF RV +K + F +D LS+ ++HG R + G L +G
Sbjct: 790 ------GTAIRSLQGFQRVSLKKGETKTVQFKLD-DRALSVVDEHGGRKVQAGQVDLWIG 842
>gi|160901716|ref|YP_001567297.1| glycoside hydrolase family 3 protein [Petrotoga mobilis SJ95]
gi|160359360|gb|ABX30974.1| glycoside hydrolase family 3 domain protein [Petrotoga mobilis
SJ95]
Length = 777
Score = 276 bits (707), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 220/750 (29%), Positives = 351/750 (46%), Gaps = 141/750 (18%)
Query: 90 RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
RLGIPA+ E L G + G T FPQ++ +A+++ L + +++
Sbjct: 91 RLGIPAF-MHEECLSGYMTRG-----------ATIFPQMIGAASTWEPPLIERMTTSIRN 138
Query: 150 EARAMYNLG-QAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWK 208
+ +A LG GL+ P +++ RDPRWGR +ET GEDP +++ V +VK Q ++ K
Sbjct: 139 QMKA---LGIHQGLS---PVVDVTRDPRWGRTEETFGEDPYLIAKMGVAYVKGLQSDDLK 192
Query: 209 SDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWG-NFSRYSFNAMIT 267
+ + A KH + Y + + G N++ A I
Sbjct: 193 NG----------------------------IVATLKHFVGYGVSEGGMNWAP----AHIP 220
Query: 268 EQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSD 326
E++ ++TF PF + I++GK +M +Y++++G+P L ++ R EWGF G + SD
Sbjct: 221 ERELKETFLFPFEAAIKEGKVKSVMNAYHEIDGIPCGASETLLRRILREEWGFDGIVVSD 280
Query: 327 CDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGT--CMLRHTQSAIDKGKVQEKDIDRA 384
A+ ++ EY E++A LKAG+D+ + C ++AI+ G+ E ID++
Sbjct: 281 YFAINSLMEYHKIALNKEEAAIKALKAGIDVELPSFDCYKEPLKNAIENGEFSEAFIDKS 340
Query: 385 LLNLFSVQLRLGLFNGDPRKGKYGKLG--PDDVCTSEHKKLALDAARQGIVLLKNDKKFL 442
+ N+ ++ +GLF + Y L PD++ T E +KLA + A++ IVLLKND +
Sbjct: 341 VRNILRLKFEMGLF-----ENPYVDLEKVPDNLDTPEDRKLAYEIAKKSIVLLKND-GIV 394
Query: 443 PLNKNA-VSSLAIIGPLVNNISQMGGGYT-------------------GIPCSP------ 476
PL KN+ + +A+IGP N+ + G YT GI S
Sbjct: 395 PLKKNSKIKKVAVIGPNANSARNLTGDYTYLTHLETLKQGAFGTSAMEGITFSESELPIK 454
Query: 477 ---KSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAG------LD 527
+SL LE +T YA GC ++ ++ EAV +A+ +D ++V G LD
Sbjct: 455 TIYESLKEKLEKLNVETSYAKGC-EINDDNKEMIKEAVELAENSDVALLVLGDKSGLTLD 513
Query: 528 LTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILW 587
T E RD +L+LPG Q+ L+ SV T PVI+VL G P + + + +S+I
Sbjct: 514 CTT-GESRDSSTLILPGVQLDLLKSVINTG-TPVIVVLVNGRPYSLDWVSKN--VSAIFE 569
Query: 588 IGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRF 647
PGE G ALA+II GD +P G+LP++ +P ++P+ N + R Y
Sbjct: 570 AWLPGEEGGNALADIILGDESPSGKLPIS-FPRHVGQIPVY-YNHKPSGGRSQWWGDYTD 627
Query: 648 YTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVT 707
+Y FGHGLSYT + Y N+ + R+ + +D
Sbjct: 628 SPAKPLYPFGHGLSYTQFEY--------------------GNLQIENNDRIVKISMD--- 664
Query: 708 SCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEIS 767
V N G+ G +V L+ S P K+L GF RV K I
Sbjct: 665 ------------VKNIGEETGDEIVQLYMNDEVASVTRPVKELKGFQRVTLKPSEKKRII 712
Query: 768 FGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
F + P E L++ N+ ++ G +MVG
Sbjct: 713 FNL-PIETLALYNEKMEFLVEKGYFKVMVG 741
>gi|373951852|ref|ZP_09611812.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
paludis DSM 18603]
gi|373888452|gb|EHQ24349.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
paludis DSM 18603]
Length = 871
Score = 276 bits (706), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 158/446 (35%), Positives = 228/446 (51%), Gaps = 59/446 (13%)
Query: 52 YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGP 111
YP+ N L +TR L+ +TL+EK+ Q+ +++ AIPRL IPAY+WW+E LHG+A
Sbjct: 26 YPYQNYHLDFTTRVNDLVKRMTLEEKVSQMLNSSPAIPRLKIPAYDWWNEVLHGVAR--- 82
Query: 112 GVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG---------QAGL 162
T VT +PQ + AA+F+R + + A+E RA++N GL
Sbjct: 83 ------TPFKVTVYPQAIAMAATFDRQSLNQMADYAALEGRAVHNKALQMRKPGEKYLGL 136
Query: 163 TFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKR 222
T+W PNINIFRDPRWGRGQET GEDP + A FV QG + K
Sbjct: 137 TYWTPNINIFRDPRWGRGQETYGEDPFLTGAMGSAFVSGLQGNDPKY------------- 183
Query: 223 VLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSC 282
L +AC KH Y + R+ FNA I+ D DT+ P F+
Sbjct: 184 --------------LKAAACAKH---YAVHSGPEPLRHVFNADISTYDLWDTYLPAFKKL 226
Query: 283 IEQGKASCIMCSYNQVNGVPAC----LRGDLFQKARNEWGFKGYITSDCDAVATIFEYQN 338
+ K + +MC+YN P C L D+ RN+W F GY+TSDC + F+
Sbjct: 227 VVDDKVAGVMCAYNAFKTQPCCGSDLLMVDIL---RNQWKFSGYVTSDCGGIDDFFKNHK 283
Query: 339 YTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLF 398
T ED++ + G DI CGT + +A+ +GK+ E ID ++ LF ++ RLG+F
Sbjct: 284 THATAEDASTDAVLHGTDIECGTDAYKSLVAAVKEGKISETQIDISVKRLFMIRFRLGMF 343
Query: 399 NGDPRK-GKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGP 457
DP KY + + + EH+ AL ARQ +VLLKN LPL+K + + ++GP
Sbjct: 344 --DPSDVVKYAQTPVSVLESPEHQAHALKMARQSVVLLKNANHTLPLSKT-IRKIVVLGP 400
Query: 458 LVNNISQMGGGYTGIPCSPKSLLRGL 483
+N + G Y G P + ++ +G+
Sbjct: 401 NADNPIAILGNYNGTPSNLTTVYQGI 426
Score = 116 bits (290), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 84/284 (29%), Positives = 141/284 (49%), Gaps = 56/284 (19%)
Query: 511 VRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVARTSKRP 560
V+ AD ++ V G+ E E+ DR S+ LP Q +L+ ++ T K P
Sbjct: 597 VKRVADADAIVYVGGISPQLEGEEMQVNYPGFNGGDRTSIQLPAAQTNLMKTLQATGK-P 655
Query: 561 VILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPE 620
V+ V+ G L + EA++ I +I+ Y G+A A+A+++FGD+NP GRLP+T+Y +
Sbjct: 656 VVFVMMTGSALATPW-EAEN-IPAIVNAWYGGQAAGTAVADVLFGDYNPAGRLPVTFY-K 712
Query: 621 SFTKVP-MNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTIS 679
S T +P D +M R+YR++ G +YGFG+GLSYT + Y +L +
Sbjct: 713 SDTDLPDFTDYSMT--------NRTYRYFKGIPLYGFGYGLSYTQFKY------DKLIVP 758
Query: 680 ASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVP 739
A++K+G H+ ++VTN+G + G VV ++ +
Sbjct: 759 ATVKSGK--------------------------AIHLSVTVTNSGQIAGDEVVQIYMKHH 792
Query: 740 KVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHG 783
P K L GF RV+ A + ++F + P + L++ + +G
Sbjct: 793 SQRIKVPLKALKGFARVYLKAGERRTLNFILSP-DDLAVTSSNG 835
>gi|397690575|ref|YP_006527829.1| glucan 1,4-beta-glucosidase [Melioribacter roseus P3M]
gi|395812067|gb|AFN74816.1| glucan 1,4-beta-glucosidase [Melioribacter roseus P3M]
Length = 860
Score = 275 bits (704), Expect = 5e-71, Method: Compositional matrix adjust.
Identities = 161/458 (35%), Positives = 239/458 (52%), Gaps = 54/458 (11%)
Query: 54 FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
+ N +L RA+ L+ L+L EKI + + AI RLGIP Y WW+E+LHG+A NG
Sbjct: 23 YLNVNLPFEERAEDLLQRLSLDEKISLMVHQSPAIERLGIPEYNWWNEALHGVARNG--- 79
Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG--------QAGLTFW 165
T FP + AA+++R L I ++ EARA YN G++ W
Sbjct: 80 -------RATVFPMPIGLAATWDRDLIYRIADVISNEARAKYNSALKKNQRGIYQGISLW 132
Query: 166 APNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLK 225
APNINIFRDPRWGRG ET GEDP + AV F+K QG+ +K+ LK
Sbjct: 133 APNINIFRDPRWGRGMETYGEDPYLTGELAVSFIKGLQGQ--------------DKKYLK 178
Query: 226 GFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQ 285
A KHL + + R+ FNA+++ D +T+ P F+ I +
Sbjct: 179 TI-------------ATPKHLAVHSGPE---PERHHFNALVSNYDLNETYLPHFKKSIMK 222
Query: 286 GKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHE 344
GKA +MC+YN++ G C L RN+WGF+G + SDC AV IF + E
Sbjct: 223 GKAYSVMCAYNRLRGKACCGHDTLLTDILRNKWGFEGIVVSDCWAVYDIFNSHKIVDSPE 282
Query: 345 DSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRK 404
+AA + +G D+ CG L ++A G + EK+ID AL + + +LG+F+ P
Sbjct: 283 KAAALAVSSGTDLECGNTFL-SLKNAYRDGLITEKEIDSALRRVLLARFKLGMFD-PPEI 340
Query: 405 GKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQ 464
Y ++ + S ++++AL+AAR+ IVLLKND K LPL+ ++++ +A+IGP +N+
Sbjct: 341 VSYSQIDESYLDNSYNREIALEAARKSIVLLKNDNKLLPLD-SSINKIAVIGPNADNLES 399
Query: 465 MGGGYTGIPCSPKSLLRGLEAYVS--KTHYASGCHDVP 500
+ G Y G P + L+ + + + Y GC P
Sbjct: 400 LLGNYHGFPSEYITPLQAIRRVLKNGEVFYEKGCDFAP 437
Score = 103 bits (256), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 83/298 (27%), Positives = 135/298 (45%), Gaps = 55/298 (18%)
Query: 510 AVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVARTSKR 559
A + A K+D VI+ GL E E DR+ L LP Q+ L+ + T K
Sbjct: 591 AYKTALKSDAVIMFMGLCPRMEGEALKIKLDGFKGGDRLKLSLPANQLKLIKKIHSTGK- 649
Query: 560 PVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYP 619
PVILVL GGP +S I +IL YPG+AG +A+ ++I+G +NP G+LP+T Y
Sbjct: 650 PVILVLLNGGP--ISTVWESENIPAILEAWYPGQAGGRAITDVIWGKYNPSGKLPVTIYK 707
Query: 620 ESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTIS 679
P + +M GR+YR++ G +Y FG GL+YT+ + S + +S
Sbjct: 708 SENDLPPFENYDME--------GRTYRYFKGEVLYPFGWGLNYTDITI------SNIELS 753
Query: 680 ASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVP 739
A +E+ ++R V + + N G++ G V L+ +
Sbjct: 754 A-----------------------NEIKDNDTIR--VVVKLKNNGNLAGEETVQLYTKAL 788
Query: 740 KVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
K ++ K L GF+++ + F + + + G +P G + ++VG
Sbjct: 789 KDNRTI--KTLRGFEKIKLEPGTEGMVEFYLSKSDLAVWVDGLGFETMP-GVYEIIVG 843
>gi|385774250|ref|YP_005646817.1| glycoside hydrolase family protein [Sulfolobus islandicus HVE10/4]
gi|323478365|gb|ADX83603.1| glycoside hydrolase family 3 domain protein [Sulfolobus islandicus
HVE10/4]
Length = 754
Score = 275 bits (704), Expect = 6e-71, Method: Compositional matrix adjust.
Identities = 212/711 (29%), Positives = 335/711 (47%), Gaps = 123/711 (17%)
Query: 121 SVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRG 180
S T+FPQ + A+++N L +I S + + R + + Q +P +++ +DPRWGR
Sbjct: 100 SSTAFPQAIGLASTWNLELVMDIASVIRSQGR-LVGVNQC----LSPVLDVCKDPRWGRC 154
Query: 181 QETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLS 240
+ET GEDP +V++ + ++ QG+N L
Sbjct: 155 EETYGEDPYLVASMGLAYITGLQGDN-------------------------------QLV 183
Query: 241 ACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNG 300
A KH A+ + G R + ++ +TF PF ++ GK IM +Y++++G
Sbjct: 184 ATAKHFAAHGFPEGG---RNIAQVHVGNRELRETFLFPFEVAVKIGKVMSIMPAYHEIDG 240
Query: 301 VPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINC 359
+P L R EWGF G + SD D + + ++A L++G+DI
Sbjct: 241 IPCHGNPQLLTNILRQEWGFDGIVVSDYDGIRQLETIHRVASNKMEAAILALESGVDIEF 300
Query: 360 GT--CMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCT 417
T C +A+ +G V E IDRA+ + ++ RLGL + +P + P+ +
Sbjct: 301 PTIDCYSEPLVNALTEGLVPESLIDRAVERVLRIKDRLGLLD-NPFVNENSV--PEKLDD 357
Query: 418 SEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT------- 470
+ ++LAL AR+ IVLLKN+ LPL+KN V+ +A+IGP N+ M G YT
Sbjct: 358 HKSRELALKTARESIVLLKNENNILPLSKN-VNKIAVIGPNANDPRNMLGDYTYTGHLNI 416
Query: 471 --GIPCSPKSLLRGLEAYV--SKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAG- 525
GI ++L+G+ V SK YA GC D+ S GF EA+ IA++AD +I V G
Sbjct: 417 DSGIEIV--TVLQGVVKKVGESKVLYAKGC-DIASESKEGFAEAIEIARQADVIIAVMGE 473
Query: 526 --------LDLTQETEDR----------DRVSLLLPGQQMSLVTSVARTSKRPVILVLTG 567
D+ E E + DR SL LPG Q L+ + +T K P+ILVL
Sbjct: 474 KSGLPLSWTDIPSEEEFKKYQAVTGEGNDRSSLRLPGVQEELLKELYKTGK-PIILVLIN 532
Query: 568 GGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPM 627
G PL +S + + +++ +PGE G A+A++IFGD+NPGGRLP+T +P ++P+
Sbjct: 533 GRPLVLS--PIINYVKAVIEAWFPGEEGGNAIADVIFGDYNPGGRLPIT-FPMDTGQIPL 589
Query: 628 NDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSA-PSELTISASLKAGS 686
N + S R Y + ++ FG+GLSYT + Y L P E+ G
Sbjct: 590 Y-YNRKPSSF-----RPYVMLRSSPLFTFGYGLSYTQFEYSNLEVTPKEI--------GP 635
Query: 687 DKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTP 746
+ NI + I V N G ++G VV L+ S P
Sbjct: 636 NSNI------------------------AISIDVKNVGKMEGDDVVQLYVSKTFSSVARP 671
Query: 747 EKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
K+L GF ++H + + F + P E L+ + R ++ G + L++G
Sbjct: 672 VKELKGFAKIHLKPGEKRRVKF-ILPTEALAFYDSFMRLVVEKGEYQLLIG 721
>gi|308208211|gb|ADO20356.1| putative beta-D-xylosidase/alpha-L-arabinosidase [uncultured rumen
bacterium]
Length = 780
Score = 275 bits (704), Expect = 6e-71, Method: Compositional matrix adjust.
Identities = 219/761 (28%), Positives = 332/761 (43%), Gaps = 156/761 (20%)
Query: 53 PFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPG 112
P+ + SL RAK L+S LTL+EK ++ + LGI AY WWSE+LHG+A NG
Sbjct: 45 PYKDRSLPPEERAKDLVSRLTLEEKASLSMHPSAPVEALGIKAYNWWSEALHGVARNG-- 102
Query: 113 VNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLTF 164
+ T FPQ + AASF+ L + +AV+ EAR Y + + G+TF
Sbjct: 103 --------AATVFPQPIGMAASFDEPLLYEVFTAVSDEARVKYKIAKESGHIGQYQGVTF 154
Query: 165 WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVL 224
W PNINIFRDPRWGRG ET GEDP + + V+ QG + + VL
Sbjct: 155 WTPNINIFRDPRWGRGMETYGEDPYLTGQMGMAVVRGLQGPS-------------DSPVL 201
Query: 225 KGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIE 284
K AC KH + +W +R+S++A ++E+D +T+ P F+ +
Sbjct: 202 KA-------------HACAKHYAVHSGPEW---NRHSYDAEVSERDLRETYLPAFKDLVT 245
Query: 285 QGKASCIMCSYNQVNGVPACLRGDLFQKA--RNEWGFKGYITSDCDAVATIFEY--QNYT 340
+ +M +YN+ G P C D R EWG+KG ITSDC AV + Y+
Sbjct: 246 KANVQEVMTAYNRFRGEP-CGASDYLINTILRGEWGYKGLITSDCWAVEDFYVQGRHGYS 304
Query: 341 KTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNG 400
+AA + AG+D CG RH A+++G + EKD+DR L+ LF+ + +LG +
Sbjct: 305 PDVASAAAAAVHAGVDTECGQAY-RHIPEAVERGLLDEKDLDRNLIRLFTARYQLGEMDD 363
Query: 401 DPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVN 460
+ L + EH L+ A++ +VLL+N LPL + +A++GP +
Sbjct: 364 ---ISLWDDLPASILEGPEHLALSRKMAQESMVLLQNKGGILPLAPDV--RVALVGPNGD 418
Query: 461 NISQMGGGYTGIPCSPKSLLRGL------------------------------------- 483
+ G Y +P +L L
Sbjct: 419 DREMQWGNYNPVPGRTVTLYDALKERFPGIKYVRGCGIVGAEFAPKPDPNNPLSQALGKS 478
Query: 484 --EAYVSKTHYASGCHDV------PCNSDAGF------HEAVRIAKKADFVIVVAGLDLT 529
E YA G D+ A F ++ + D VI G+
Sbjct: 479 REEMEAIARQYAIGVQDILNYVRRQERMQASFLPELDVQSVLKELEGIDVVIFAGGISPR 538
Query: 530 QETEDR----------DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEAD 579
E E+ DR + LP Q L+ ++ K+ +++ +G +
Sbjct: 539 FEGEEMPVNLPGFKGGDRTDIQLPQVQRDLMKALHDAGKKVILVNFSGCA---IGLVPET 595
Query: 580 SQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVP-MNDMNMRADSSR 638
+IL YPGE G A+ +++FGD NP G+LP+T+Y S +P + +M+
Sbjct: 596 ESCDAILQAWYPGEEGGLAITDVLFGDVNPSGKLPVTFY-RSVEDLPDFENYDMK----- 649
Query: 639 QYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRL 698
G +YR++ G ++ FG+GLSY+ + YK A ++ S ++ TG R
Sbjct: 650 ---GHTYRYFKGKPLFPFGYGLSYSTFRYK----------RAKVRNNSLIIPVKNTGKR- 695
Query: 699 DYVHIDEVTSCTSLRFHVQISVTNAGDVDGS-HVVMLFARV 738
E T VQ+ V GD DG + F RV
Sbjct: 696 ------EATEV------VQVYVRRKGDPDGPVKTLRAFRRV 724
>gi|300777563|ref|ZP_07087421.1| beta-glucosidase [Chryseobacterium gleum ATCC 35910]
gi|300503073|gb|EFK34213.1| beta-glucosidase [Chryseobacterium gleum ATCC 35910]
Length = 896
Score = 275 bits (702), Expect = 8e-71, Method: Compositional matrix adjust.
Identities = 160/447 (35%), Positives = 233/447 (52%), Gaps = 59/447 (13%)
Query: 52 YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGP 111
YPF N L ++ R ++L++LLT +EKI + DN+ A+PRL IPAY WW+E+LHG+A G
Sbjct: 44 YPFRNPDLPVNERIENLLTLLTTEEKIGMMMDNSQAVPRLEIPAYGWWNEALHGVARAG- 102
Query: 112 GVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG---------QAGL 162
T FPQ + AA+++ ++ EARA YN GL
Sbjct: 103 ---------IATVFPQAIGMAATWDVPEHFKTFEMISDEARAKYNRSFDEALKTGRYEGL 153
Query: 163 TFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKR 222
TFW PNINIFRDPRWGRGQET GEDP + S V VK QG + K
Sbjct: 154 TFWTPNINIFRDPRWGRGQETYGEDPYLTSVLGVAAVKGLQGNDPKF------------- 200
Query: 223 VLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSC 282
AC KH + +W +R+S+NA I+++D +T+ P F++
Sbjct: 201 --------------FKTHACAKHFAVHSGPEW---NRHSYNAEISKRDLYETYLPAFKAL 243
Query: 283 IEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTK 341
+++G +MC+YN +G P C L + R +W + G + SDC A+A F+ + Y
Sbjct: 244 VQEGNVREVMCAYNAFDGQPCCANNTLLTEILRGKWKYDGMVVSDCWALADFFQ-KKYHG 302
Query: 342 THED---SAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLF 398
TH D +AA LK D+ CG + ++ G + EKDID ++ + LG+
Sbjct: 303 THPDEKTTAADALKHSTDLECGDT-YNNLNKSLASGLITEKDIDESMRRILKGWFELGML 361
Query: 399 NGDPRKGKYGKLGPDDVCTS-EHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGP 457
DP+ + P V S EHKK AL A++ IVL+KN+K LPLN+N + +A++GP
Sbjct: 362 --DPKSSVHWNTIPYSVVDSEEHKKQALKMAQKSIVLMKNEKNILPLNRN-IKKIAVVGP 418
Query: 458 LVNNISQMGGGYTGIPCSPKSLLRGLE 484
++ G Y G P S ++L G++
Sbjct: 419 NADDGLMQLGNYNGTPSSIVTILDGIK 445
Score = 94.0 bits (232), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 77/295 (26%), Positives = 127/295 (43%), Gaps = 47/295 (15%)
Query: 486 YVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDR---------- 535
+ K Y S +V F K AD ++ GL + E E+
Sbjct: 592 HTQKGKYVSITFEVYRKDPVNFASVREKVKNADVIVFAGGLSPSLEGEEMMVNAEGFKGG 651
Query: 536 DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAG 595
D+ S+ LP Q L+ + +T K PV+ VL G L + + + ++L Y G++G
Sbjct: 652 DKTSIALPKVQRDLLAELRKTGK-PVVFVLCTGSALGLE--QDEKNYDALLNAWYGGQSG 708
Query: 596 AKALAEIIFGDFNPGGRLPMTWYP--ESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQV 653
A+A+++ GD+NP G+LP+T+Y E + GR+YR+ T +
Sbjct: 709 GTAVADVLAGDYNPSGKLPITFYKNLEQLDNALSKTSKHEGFENYDMQGRTYRYMTEKPL 768
Query: 654 YGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLR 713
Y FGHGLSY+ KF+ S+L+ ++ + ++E + T
Sbjct: 769 YPFGHGLSYS----KFVYGDSKLSKNS--------------------ISVNENVTIT--- 801
Query: 714 FHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISF 768
I VTN + +G VV ++ + +Q P K L F+R +K +K I
Sbjct: 802 ----IPVTNISEREGEEVVQVYIKRNNDAQA-PVKTLRAFERTPIKSKETKNIQL 851
>gi|53712134|ref|YP_098126.1| beta-glucosidase [Bacteroides fragilis YCH46]
gi|52214999|dbj|BAD47592.1| periplasmic beta-glucosidase precursor [Bacteroides fragilis YCH46]
Length = 812
Score = 275 bits (702), Expect = 8e-71, Method: Compositional matrix adjust.
Identities = 231/738 (31%), Positives = 346/738 (46%), Gaps = 126/738 (17%)
Query: 90 RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
RLGIP + E HG + G T FP + A+++N L +G +A+
Sbjct: 149 RLGIPLF-LAEECPHGHMAIG-----------TTVFPTSIGQASTWNPELIRQMGRVIAI 196
Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
EA A Q + P +++ RDPRW R +ET GEDP + V+ FQGE
Sbjct: 197 EASA-----QGAHIGYGPVLDLARDPRWSRVEETYGEDPYLNGVMGTALVRGFQGETL-- 249
Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQ 269
+DG K V+ A KH +Y W A I E+
Sbjct: 250 NDG--------KSVI----------------ATLKHFASYG---WTEGGHNGGTAHIGER 282
Query: 270 DTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPAC-LRGDLFQKARNEWGFKGYITSDCD 328
+ E+ PPFR + G A +M SYN+++G P R L ++ W FKG++ SD
Sbjct: 283 ELEEAIFPPFREAVGAG-ALSVMSSYNEIDGNPCTGSRYLLTDILKDRWQFKGFVVSDLY 341
Query: 329 AVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCML-RHTQSAIDKGKVQEKDIDRALLN 387
AV + E+ +E +A + AG+D + GT + +A+ +G V ID+A+
Sbjct: 342 AVGGLREHGVAGNDYE-AAIKAVNAGVDSDLGTNVYAEQLVAAVKRGDVAVATIDKAVRR 400
Query: 388 LFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKN 447
+ S++ ++GLF+ DP + K V +SEH LA + ARQ IVLLKN K LPL K+
Sbjct: 401 ILSLKFQMGLFD-DPFVDE--KQAVQLVASSEHTGLAREVARQSIVLLKNKDKLLPLKKD 457
Query: 448 AVSSLAIIGPLVNNISQMGGGYTGIPC--SPKSLLRGLEAYVSKTH---YASGCHDVPCN 502
+ +LA+IGP +N+ M G YT + ++L G+ VSK YA GC V +
Sbjct: 458 -IRTLAVIGPNADNVYNMLGDYTAPQADGTVVTVLDGIRQKVSKETRVLYAKGCA-VRDS 515
Query: 503 SDAGFHEAVRIAKKADFVIVVAG----LDLTQETEDR-------------------DRVS 539
S GF +A+ A+ AD V++V G D + E E+ DR +
Sbjct: 516 SRTGFKDAIETARNADAVVMVMGGSSARDFSSEYEETGAAKVTINQISDMESGEGYDRAT 575
Query: 540 LLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKAL 599
L L G+Q+ L+ ++R K PV+L+ G PL + A +++ W YPG G A+
Sbjct: 576 LHLMGRQLELLEEISRLGK-PVVLI--KGRPLLMEGAIQEAEAIVDAW--YPGMQGGNAV 630
Query: 600 AEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHG 659
A+++FGD+NP GRL ++ P S ++P+ N R +R Y GT Y FG+G
Sbjct: 631 ADVLFGDYNPAGRLTLS-VPRSVGQLPVY-YNTRRKGNRS----RYVEEPGTPRYPFGYG 684
Query: 660 LSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQIS 719
LSYT +SY +D + GS D +D V ++
Sbjct: 685 LSYTTFSY------------------TDMKVQVTEGS--DDCWVD-----------VTVT 713
Query: 720 VTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIA 779
+ N G DG V L+ R S TP KQL F R+H A S+E++F +D + L++
Sbjct: 714 IQNQGTADGDEVAQLYFRDDVSSFTTPAKQLRAFSRIHLKAGESREVTFTLDK-KSLALY 772
Query: 780 NKHGRRILPLGNHVLMVG 797
+ G ++ G +MVG
Sbjct: 773 MQEGEWVVEPGRFTIMVG 790
>gi|385776908|ref|YP_005649476.1| glycoside hydrolase family protein [Sulfolobus islandicus REY15A]
gi|323475656|gb|ADX86262.1| glycoside hydrolase family 3 domain protein [Sulfolobus islandicus
REY15A]
Length = 754
Score = 275 bits (702), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 212/711 (29%), Positives = 335/711 (47%), Gaps = 123/711 (17%)
Query: 121 SVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRG 180
S T+FPQ + A+++N L +I S + +AR + + Q +P +++ +DPRWGR
Sbjct: 100 SSTAFPQAIGLASTWNLELVMDIASVIRSQAR-LVGVNQC----LSPVLDVCKDPRWGRC 154
Query: 181 QETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLS 240
+ET GEDP +V++ + ++ QG+N L
Sbjct: 155 EETYGEDPYLVASMGLAYITGLQGDN-------------------------------QLV 183
Query: 241 ACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNG 300
A KH A+ + G R + ++ +TF PF ++ GK IM +Y++++G
Sbjct: 184 ATAKHFAAHGFPEGG---RNIAQVHVGNRELRETFLFPFEVAVKIGKVMSIMPAYHEIDG 240
Query: 301 VPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINC 359
+P L R EWGF G + SD D + + ++A L++G+DI
Sbjct: 241 IPCHGNPQLLTNILRQEWGFDGIVVSDYDGIRQLETIHRVASNKMEAAILALESGVDIEF 300
Query: 360 GT--CMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCT 417
T C +A+ +G V E IDRA+ + ++ RLGL + +P + P+ +
Sbjct: 301 PTIDCYSEPLVNALTEGLVPESLIDRAVERVLRIKDRLGLLD-NPFVNENSV--PEKLDD 357
Query: 418 SEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT------- 470
+ ++LAL AR+ IVLLKN+ LPL+KN V+ +A+IGP N+ M G YT
Sbjct: 358 HKSRELALKTARESIVLLKNENNILPLSKN-VNKIAVIGPNANDPRNMLGDYTYTGHLNI 416
Query: 471 --GIPCSPKSLLRGLEAYV--SKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAG- 525
GI ++L+G+ V SK YA GC D+ S GF EA+ IA++AD +I V G
Sbjct: 417 DSGIEIV--TVLQGVVKKVGESKVLYAKGC-DIASESKEGFAEAIEIARQADVIIAVMGE 473
Query: 526 --------LDLTQETEDR----------DRVSLLLPGQQMSLVTSVARTSKRPVILVLTG 567
D+ E E + DR SL LPG Q L+ + +T K P+ILVL
Sbjct: 474 KSGLPLSWTDIPSEEEFKKYQAVTGEGNDRSSLRLPGVQEELLKELYKTGK-PIILVLIN 532
Query: 568 GGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPM 627
G PL +S + + +++ +PGE G A+A++IFGD+NP GRLP+T +P ++P+
Sbjct: 533 GRPLVLS--PIINYVKAVIEAWFPGEEGGNAIADVIFGDYNPSGRLPIT-FPMDTGQIPL 589
Query: 628 NDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSA-PSELTISASLKAGS 686
N + S R Y + ++ FG+GLSYT + Y L P E+ G
Sbjct: 590 Y-YNRKPSSF-----RPYVMLRSSPLFTFGYGLSYTQFEYSNLEVTPKEI--------GP 635
Query: 687 DKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTP 746
+ NI + I V N G ++G VV L+ S P
Sbjct: 636 NSNI------------------------AISIDVKNVGKMEGDDVVQLYVSKTFSSVARP 671
Query: 747 EKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
K+L GF ++H + + F + P E L+ + R ++ G + L++G
Sbjct: 672 VKELKGFAKIHLKPGEKRRVKF-ILPTEALAFYDSFMRLVVEKGEYQLLIG 721
>gi|423313768|ref|ZP_17291703.1| hypothetical protein HMPREF1058_02315 [Bacteroides vulgatus
CL09T03C04]
gi|392684303|gb|EIY77631.1| hypothetical protein HMPREF1058_02315 [Bacteroides vulgatus
CL09T03C04]
Length = 788
Score = 274 bits (701), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 220/736 (29%), Positives = 345/736 (46%), Gaps = 129/736 (17%)
Query: 90 RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
RLGIP ++ +E + G+ + T FP A++N+ L + IG V
Sbjct: 138 RLGIPV-DFTNEGIRGLCHD-----------RATYFPAQCGQGATWNKKLIARIGE---V 182
Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
EA+ LG + ++P ++I +DPRWGR ET GEDP +V + + S Q N
Sbjct: 183 EAKEAVALGYTNI--YSPILDIAQDPRWGRCVETYGEDPYLVGELGKQMITSLQKYN--- 237
Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQ 269
L A KH Y + G + + + +
Sbjct: 238 -----------------------------LVATPKHFAVYSIPIGGRDGKTRTDPHVAPR 268
Query: 270 DTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGD---LFQKARNEWGFKGYITSD 326
+ + PFR ++ A +M SYN +G P + G L + R EWGFKGY+ SD
Sbjct: 269 EMRTLYIEPFRMAFQEAGALGVMSSYNDYDGEP--ITGSYHFLTEILRQEWGFKGYVVSD 326
Query: 327 CDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTC----MLRHTQSAIDKGKVQEKDID 382
+AV I T+ED A + AG++I + + A+D GK+ ++ +D
Sbjct: 327 SEAVEFISNKHKVADTYEDGIAQAVNAGLNIRTHFTPPADFILPLRKAVDNGKISQETLD 386
Query: 383 RALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFL 442
+ + + ++ RLGLF+ +P +G GK V + EH+ ++L+AARQ +VLLKN+ L
Sbjct: 387 KRVAEILRIKFRLGLFD-NPYRGN-GKQAEQIVHSKEHQAVSLEAARQSLVLLKNETNLL 444
Query: 443 PLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV--SKTHYASGCH--- 497
PL+K ++ S+A+IGP N +Q+ Y K++ +G++ + ++ Y GC
Sbjct: 445 PLSK-SIRSIAVIGPNANEQTQLICRYGPANAPIKTVYQGIKELLPHAEVIYKKGCDIID 503
Query: 498 ---------DVPCNSDAG--FHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQ 546
D P ++ EA+R AK+A+ V++V G + EDR R SL LPG+Q
Sbjct: 504 PHFPESEILDFPKTAEEVQLMEEAIRAAKQAEVVVMVLGGNELTVREDRSRTSLNLPGRQ 563
Query: 547 MSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGD 606
L+ +V T K PVILV+ G +++A A + +IL +PGE +A+AE +FGD
Sbjct: 564 EELLKAVCATGK-PVILVMLDGRASSINYAAA--HVPAILHAWFPGEFCGQAVAEALFGD 620
Query: 607 FNPGGRLPMTWYPESFTKVPMN-DMNMRADSSRQYPGRSYRFYTGTQVYG----FGHGLS 661
+NPGGRL +T +P+S ++P +D S + T VYG FGHGLS
Sbjct: 621 YNPGGRLAVT-FPKSVGQIPFAFPFKPGSDES-----------SSTSVYGALYPFGHGLS 668
Query: 662 YTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVT 721
YT ++Y S+L IS S G + D HV +
Sbjct: 669 YTTFTY------SDLHISPS-----------HQGVQGD--------------IHVSCKIK 697
Query: 722 NAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANK 781
N G + G VV L+ R S T K L GF+R+ A + + F + P + L + +K
Sbjct: 698 NTGKIKGDEVVQLYLRDEISSVTTYTKVLRGFERISLKAGEEQTVHFRLRP-QDLGLWDK 756
Query: 782 HGRRILPLGNHVLMVG 797
+ + G+ +M+G
Sbjct: 757 NMNFRVEPGSFKVMLG 772
>gi|383113360|ref|ZP_09934132.1| hypothetical protein BSGG_3064 [Bacteroides sp. D2]
gi|382948727|gb|EFS32364.2| hypothetical protein BSGG_3064 [Bacteroides sp. D2]
Length = 954
Score = 274 bits (701), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 226/780 (28%), Positives = 361/780 (46%), Gaps = 130/780 (16%)
Query: 54 FCNTSLSISTRAKSLISLLTLQEKIQQLSDN--ASAIPRLGIPAYEWWSESLHGIASNGP 111
+ + SL + R +SL++++T ++K++ + + IP L +P E++HG +
Sbjct: 170 YMDASLPVEERVESLLAVMTPEDKMELIREGWGIPGIPHLYVPPITK-VEAVHGFSYG-- 226
Query: 112 GVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINI 171
S T FPQ L A++NR L + + E A N QA W+P +++
Sbjct: 227 --------SGATIFPQALAMGATWNRKLTEEVAMVIGDETVAA-NTKQA----WSPVLDV 273
Query: 172 FRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEES 231
+D RWGR +ET GEDP++VS ++K +Q
Sbjct: 274 AQDARWGRCEETFGEDPVLVSQIGGAWIKGYQS--------------------------- 306
Query: 232 DRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCI 291
RG L KH + G R S + ++E++ + PFR I +
Sbjct: 307 -RG----LFTTPKHFGGHGAPLGG---RDSHDIGLSEREMREVHLVPFRHAIRNYDCQSL 358
Query: 292 MCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAG- 349
M +Y+ G+P +L Q+ R EWGF G+I SDC A+ + ++YT + AA
Sbjct: 359 MMAYSDYMGIPVAKSTELLQQILRQEWGFNGFIVSDCGAIGNLTARKHYTAQDKIEAANQ 418
Query: 350 VLKAGMDINCG-TCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKG-KY 407
L AG+ NCG T + A G++ +++D + S R LF +P K +
Sbjct: 419 ALAAGIATNCGDTYNNKEVIQAAKDGRINMENLDNVCRTMLSTMFRNELFEKNPCKPLDW 478
Query: 408 GKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGG 467
K+ P + HK++A AAR+ IV+L+N + LPL KN + ++A++GP +++ G
Sbjct: 479 KKIYPG-WNSDSHKEMARQAARESIVMLENKENLLPLTKN-LRTIAVLGPGADDLQP--G 534
Query: 468 GYTG--IPCSPKSLLRGLEAYV---SKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIV 522
YT +P KS+L G++ V +K Y GC D + +AV+ A ++D V++
Sbjct: 535 DYTPKLLPGQLKSVLTGIKEAVGKQTKVLYEQGC-DFTNPDETNIPKAVKAASQSDVVVM 593
Query: 523 VAGLDLTQET---------EDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDV 573
V G T E E+ D +L+LPG+Q L+ +V T K PVIL+L G P D+
Sbjct: 594 VLGDCSTSEATNDVRKTCGENNDWATLILPGKQQELLEAVCATGK-PVILILQAGRPYDI 652
Query: 574 SFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMR 633
+A +IL PG+ G A+A+++FGD+NPGGRLPMT +P ++P+ N +
Sbjct: 653 --LKASEMCKAILVNWLPGQEGGPAMADVLFGDYNPGGRLPMT-FPRHVGQLPLY-YNFK 708
Query: 634 ADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQ 693
S R+Y +Y +Y FG GLSYT++ Y SD I ++
Sbjct: 709 T-SGRRYEYVDMEYYP---LYRFGFGLSYTSFEY------------------SDLKIQEK 746
Query: 694 TGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGF 753
+ VQ +V N G G V L+ S T +L F
Sbjct: 747 PNGNVT----------------VQATVKNIGSRAGDEVAQLYVTDMYASVKTRVMELKDF 790
Query: 754 DRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG----------ELRHSL 803
DR++ SK +SF + P + +S+ N H R++ G + VG E++HS+
Sbjct: 791 DRIYLQPGESKTVSFELTPYD-ISLLNDHMDRVVEKGEFKICVGGMSPDYVAKNEIKHSV 849
>gi|15899739|ref|NP_344344.1| Beta-xylosidase [Sulfolobus solfataricus P2]
gi|13816430|gb|AAK43134.1| Beta-xylosidase [Sulfolobus solfataricus P2]
Length = 754
Score = 274 bits (701), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 213/711 (29%), Positives = 332/711 (46%), Gaps = 123/711 (17%)
Query: 121 SVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRG 180
S T+FPQ + A+++N L +N+ S + + R + + Q +P +++ RDPRWGR
Sbjct: 100 SSTAFPQAIGLASTWNPELLTNVASTIRSQGR-LIGVNQC----LSPVLDVCRDPRWGRC 154
Query: 181 QETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLS 240
+ET GEDP +V++ + ++ QGE L
Sbjct: 155 EETYGEDPYLVASMGLAYITGLQGET-------------------------------QLV 183
Query: 241 ACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNG 300
A KH A+ + G R + ++ +TF PF ++ GK IM +Y++++G
Sbjct: 184 ATAKHFAAHGFPEGG---RNIAQVHVGNRELRETFLFPFEVAVKIGKVMSIMPAYHEIDG 240
Query: 301 VPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINC 359
VP L R EWGF G + SD D + + ++A L++G+DI
Sbjct: 241 VPCHGNPQLLTNILRQEWGFDGIVVSDYDGIRQLEAIHKVASNKMEAAILALESGVDIEF 300
Query: 360 GT--CMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCT 417
T C +AI +G V E IDRA+ + ++ RLGL + +P + P+ +
Sbjct: 301 PTIDCYGEPLVTAIKEGLVSEAIIDRAVERVLRIKERLGLLD-NPFVDESAV--PERLDD 357
Query: 418 SEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT------- 470
+ ++LAL AAR+ IVLLKN+ LPL+KN ++ +A+IGP N+ M G YT
Sbjct: 358 RKSRELALKAARESIVLLKNENNMLPLSKN-INKIAVIGPNANDPRNMLGDYTYTGHLNI 416
Query: 471 --GIPCSPKSLLRGLEAYVS--KTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAG- 525
GI ++L+G+ V K YA GC D+ S GF EA+ IAK+AD +I V G
Sbjct: 417 DSGIEIV--TVLQGIAKKVGEGKVLYAKGC-DIAGESKEGFSEAIEIAKQADVIIAVMGE 473
Query: 526 --------LDLTQETEDR----------DRVSLLLPGQQMSLVTSVARTSKRPVILVLTG 567
D+ E E + DR SL L G Q L+ + +T K P+ILVL
Sbjct: 474 KSGLPLSWTDIPSEEEFKKYQAVTGEGNDRASLRLLGVQEELLKELYKTGK-PIILVLIN 532
Query: 568 GGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPM 627
G PL +S + + +I+ +PGE G A+A+IIFGD+NP GRLP+T +P ++P+
Sbjct: 533 GRPLVLS--PIINYVKAIIEAWFPGEEGGNAIADIIFGDYNPSGRLPIT-FPMDTGQIPL 589
Query: 628 NDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSA-PSELTISASLKAGS 686
S + R Y + ++ FG+GLSYT + Y L P E+
Sbjct: 590 ------YYSRKPSSFRPYVMLHSSPLFTFGYGLSYTQFEYSNLEVTPKEV---------- 633
Query: 687 DKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTP 746
L Y+ I + V N G+++G VV L+ S P
Sbjct: 634 ---------GPLSYITI-------------LLDVKNVGNMEGDEVVQLYISKSFSSVARP 671
Query: 747 EKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
K+L GF +VH + + F + P E L+ + R ++ G + +++G
Sbjct: 672 VKELKGFAKVHLKPGEKRRVKFAL-PMEALAFYDNFMRLVVEKGEYQILIG 721
>gi|284174578|ref|ZP_06388547.1| Beta-xylosidase [Sulfolobus solfataricus 98/2]
gi|356934752|gb|AET42953.1| beta-xylosidase-like protein [Sulfolobus solfataricus 98/2]
Length = 754
Score = 274 bits (701), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 213/711 (29%), Positives = 332/711 (46%), Gaps = 123/711 (17%)
Query: 121 SVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRG 180
S T+FPQ + A+++N L +N+ S + + R + + Q +P +++ RDPRWGR
Sbjct: 100 SSTAFPQAIGLASTWNPELLTNVASTIRSQGR-LIGVNQC----LSPVLDVCRDPRWGRC 154
Query: 181 QETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLS 240
+ET GEDP +V++ + ++ QGE L
Sbjct: 155 EETYGEDPYLVASMGLAYITGLQGET-------------------------------QLV 183
Query: 241 ACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNG 300
A KH A+ + G R + ++ +TF PF ++ GK IM +Y++++G
Sbjct: 184 ATAKHFAAHGFPEGG---RNIAQVHVGNRELRETFLFPFEVAVKIGKVMSIMPAYHEIDG 240
Query: 301 VPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINC 359
VP L R EWGF G + SD D + + ++A L++G+DI
Sbjct: 241 VPCHGNPQLLTNILRQEWGFDGIVVSDYDGIRQLEAIHKVASNKMEAAILALESGVDIEF 300
Query: 360 GT--CMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCT 417
T C +AI +G V E IDRA+ + ++ RLGL + +P + P+ +
Sbjct: 301 PTIDCYGEPLVTAIKEGLVSEAIIDRAVERVLRIKERLGLLD-NPFVDESAV--PERLDD 357
Query: 418 SEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT------- 470
+ ++LAL AAR+ IVLLKN+ LPL+KN ++ +A+IGP N+ M G YT
Sbjct: 358 RKSRELALKAARESIVLLKNENNMLPLSKN-INKIAVIGPNANDPRNMLGDYTYTGHLNI 416
Query: 471 --GIPCSPKSLLRGLEAYVS--KTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAG- 525
GI ++L+G+ V K YA GC D+ S GF EA+ IAK+AD +I V G
Sbjct: 417 DSGIEIV--TVLQGIAKKVGEGKVLYAKGC-DIAGESKEGFSEAIEIAKQADVIIAVMGE 473
Query: 526 --------LDLTQETEDR----------DRVSLLLPGQQMSLVTSVARTSKRPVILVLTG 567
D+ E E + DR SL L G Q L+ + +T K P+ILVL
Sbjct: 474 KSGLPLSWTDIPSEEEFKKYQAVTGEGNDRASLRLLGVQEELLKELYKTGK-PIILVLIN 532
Query: 568 GGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPM 627
G PL +S + + +I+ +PGE G A+A+IIFGD+NP GRLP+T +P ++P+
Sbjct: 533 GRPLVLS--PIINYVKAIIEAWFPGEEGGNAIADIIFGDYNPSGRLPIT-FPMDTGQIPL 589
Query: 628 NDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSA-PSELTISASLKAGS 686
S + R Y + ++ FG+GLSYT + Y L P E+
Sbjct: 590 ------YYSRKPSSFRPYVMLHSSPLFTFGYGLSYTQFEYSNLEVTPKEV---------- 633
Query: 687 DKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTP 746
L Y+ I + V N G+++G VV L+ S P
Sbjct: 634 ---------GPLSYITI-------------LLDVKNVGNMEGDEVVQLYISKSFSSVARP 671
Query: 747 EKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
K+L GF +VH + + F + P E L+ + R ++ G + +++G
Sbjct: 672 VKELKGFAKVHLKPGEKRRVKFAL-PMEALAFYDNFMRLVVEKGEYQILIG 721
>gi|333379224|ref|ZP_08470948.1| hypothetical protein HMPREF9456_02543 [Dysgonomonas mossii DSM
22836]
gi|332885492|gb|EGK05741.1| hypothetical protein HMPREF9456_02543 [Dysgonomonas mossii DSM
22836]
Length = 745
Score = 274 bits (701), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 211/692 (30%), Positives = 337/692 (48%), Gaps = 97/692 (14%)
Query: 123 TSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTF-WAPNINIFRDPRWGRGQ 181
T FP L + S++ A EA A G+ + +AP ++I RDPRWGR
Sbjct: 120 TIFPIPLAESCSWDLEAMERSARIAASEATA------EGINWIYAPMVDISRDPRWGRVA 173
Query: 182 ETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSA 241
E GED + S A VK FQG+N + + + A
Sbjct: 174 EGAGEDVYLGSLIAAARVKGFQGDNLSAVN--------------------------TVVA 207
Query: 242 CCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNG 300
C KH AY G +N + ++ + +T+ PPF++ ++ G IM S+N +NG
Sbjct: 208 CVKHYAAYGATMAGR----DYNTVDMSLNELWNTYLPPFKAALDAG-CGTIMTSFNDLNG 262
Query: 301 VPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINC 359
+PA L + R++W F G++ +D ++ + + Y + SA + AG+D++
Sbjct: 263 IPATGNKYLLKDILRDKWNFNGFVVTDYTSINEMIPH-GYANDEKHSAEIAMNAGVDMDM 321
Query: 360 -GTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTS 418
G + H ++ I++GKV EKD+ A + ++ +LGLF DP + D+ T
Sbjct: 322 QGGVYMNHLKTLIEEGKVSEKDVTEAARAILKIKYKLGLFE-DPYRYCDANREKTDILTP 380
Query: 419 EHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGI---PCS 475
+K+ A D AR+ +VLLKNDK+ LPL +N +A+IGPLV + ++ G ++ +
Sbjct: 381 ANKEAARDMARKSMVLLKNDKQTLPLKEN--KRVALIGPLVKDKYEILGCWSAMGNRDTI 438
Query: 476 PKSLLRGLEAYV--SKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETE 533
P S+ GL + K YA GC D+ GF EAVR+A +D V++V G E
Sbjct: 439 PVSVYDGLVEAIGKDKISYAKGC-DIQSEDTKGFAEAVRVASASDVVVMVMGEFHNMSGE 497
Query: 534 DRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGE 593
+ R +L LPG Q+ L+ ++ +T K PV+LVL G PL +++ E D+ + +IL +PG
Sbjct: 498 NNSRTNLSLPGVQVDLLKAIKKTGK-PVVLVLMNGRPLTINW-EKDN-LDAILEAWFPGT 554
Query: 594 AGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQ- 652
G A+A+++ G +NP G+L MT +P++ ++P+ ++ R Y +F G++
Sbjct: 555 MGGAAIADVLTGKYNPSGKLTMT-FPQNVGQIPL--FYNHKNTGRPYDPNVPQFAYGSRY 611
Query: 653 -------VYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDE 705
+Y FG+GLSYT ++Y S+LT+S+ E
Sbjct: 612 WDVSNEPLYPFGYGLSYTTFTY------SDLTLSSK-----------------------E 642
Query: 706 VTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKE 765
+T L+ V + +TN+G+ DG VV L+ R S P K+L GF +V A SK
Sbjct: 643 ITKENPLK--VSVKLTNSGEYDGEEVVQLYTRDLVGSVTRPVKELKGFKKVFLKAGESKV 700
Query: 766 ISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
I F + L N + G+ L VG
Sbjct: 701 IDFTLS-VNDLRFYNSQLEYVYEPGDFHLFVG 731
>gi|284998833|ref|YP_003420601.1| glycoside hydrolase family protein [Sulfolobus islandicus L.D.8.5]
gi|284446729|gb|ADB88231.1| glycoside hydrolase, family 3 domain protein [Sulfolobus islandicus
L.D.8.5]
Length = 754
Score = 274 bits (700), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 210/711 (29%), Positives = 335/711 (47%), Gaps = 123/711 (17%)
Query: 121 SVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRG 180
S T+FPQ + A+++N L +I S + + R + + Q +P +++ +DPRWGR
Sbjct: 100 SSTAFPQAIGLASTWNPELVMDIASVIRSQGR-LVGVNQC----LSPVLDVCKDPRWGRC 154
Query: 181 QETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLS 240
+ET GEDP +V++ + ++ QG+N L
Sbjct: 155 EETYGEDPYLVASMGLAYITGLQGDN-------------------------------QLV 183
Query: 241 ACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNG 300
A KH A+ + G R + ++ +TF PF ++ GK IM +Y++++G
Sbjct: 184 ATAKHFAAHGFPEGG---RNIAQVHVGNRELRETFLFPFEVAVKIGKVMSIMPAYHEIDG 240
Query: 301 VPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINC 359
+P L R EWGF G + SD D + + ++A L++G+DI
Sbjct: 241 IPCHGNPQLLTNILRQEWGFDGIVVSDYDGIRQLETIHRVASNKMEAAILALESGVDIEF 300
Query: 360 GT--CMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCT 417
T C +A+ +G V E IDRA+ + ++ RLGL + +P + P+ +
Sbjct: 301 PTIDCYGEPLVNALKEGLVPESLIDRAVERVLRIKDRLGLLD-NPFVNENSV--PEKLDD 357
Query: 418 SEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT------- 470
+ ++LAL AR+ IVLLKN+ LPL+KN V+ +A+IGP N+ M G YT
Sbjct: 358 HKSRELALKTARESIVLLKNENNILPLSKN-VNKIAVIGPNANDPRNMLGDYTYTGHLNI 416
Query: 471 --GIPCSPKSLLRGLEAYV--SKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAG- 525
GI ++L+G+ V SK YA GC D+ S GF EA+ IA++AD +I + G
Sbjct: 417 DSGIEIV--TVLQGIVKKVGESKVLYAKGC-DIASESKEGFAEAIEIARQADVIIAIMGE 473
Query: 526 --------LDLTQETEDR----------DRVSLLLPGQQMSLVTSVARTSKRPVILVLTG 567
+D+ E E + DR SL LPG Q L+ + +T K P+ILVL
Sbjct: 474 KSGLPLSWMDIPSEEEFKKYQAVTGEGNDRSSLRLPGVQEELLKELYKTGK-PIILVLIN 532
Query: 568 GGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPM 627
G PL +S + + +++ +PGE G A+A++IFGD+NP GRLP+T +P ++P+
Sbjct: 533 GRPLVLS--SIINYVKAVIEAWFPGEEGGNAIADVIFGDYNPSGRLPIT-FPMDTGQIPL 589
Query: 628 NDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSA-PSELTISASLKAGS 686
N + S R Y + ++ FG+GLSYT + Y L P E+ G
Sbjct: 590 Y-YNRKPSSF-----RPYVMLRSSPLFTFGYGLSYTQFEYSNLEVTPKEI--------GP 635
Query: 687 DKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTP 746
+ NI + I V N G ++G VV L+ S P
Sbjct: 636 NSNI------------------------AISIDVKNVGKMEGDDVVQLYVSKTFSSVARP 671
Query: 747 EKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
K+L GF ++H + + F + P E L+ + R ++ G + L++G
Sbjct: 672 VKELKGFAKIHLKPGEKRRVKF-ILPTEALAFYDSFMRLVVEKGEYQLLIG 721
>gi|299149395|ref|ZP_07042452.1| periplasmic beta-glucosidase [Bacteroides sp. 3_1_23]
gi|298512582|gb|EFI36474.1| periplasmic beta-glucosidase [Bacteroides sp. 3_1_23]
Length = 950
Score = 274 bits (700), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 226/780 (28%), Positives = 361/780 (46%), Gaps = 130/780 (16%)
Query: 54 FCNTSLSISTRAKSLISLLTLQEKIQQLSDN--ASAIPRLGIPAYEWWSESLHGIASNGP 111
+ + SL + R +SL++++T ++K++ + + IP L +P E++HG +
Sbjct: 166 YMDASLPVEERVESLLAVMTPEDKMELIREGWGIPGIPHLYVPPITK-VEAVHGFSYG-- 222
Query: 112 GVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINI 171
S T FPQ L A++NR L + + E A N QA W+P +++
Sbjct: 223 --------SGATIFPQALAMGATWNRKLTEEVAMVIGDETVAA-NTKQA----WSPVLDV 269
Query: 172 FRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEES 231
+D RWGR +ET GEDP++VS ++K +Q
Sbjct: 270 AQDARWGRCEETFGEDPVLVSQIGGAWIKGYQS--------------------------- 302
Query: 232 DRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCI 291
RG L KH + G R S + ++E++ + PFR I +
Sbjct: 303 -RG----LFTTPKHFGGHGAPLGG---RDSHDIGLSEREMREVHLVPFRHAIRNYDCQSL 354
Query: 292 MCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAG- 349
M +Y+ G+P +L Q+ R EWGF G+I SDC A+ + ++YT + AA
Sbjct: 355 MMAYSDYMGIPVAKSTELLQQILRQEWGFNGFIVSDCGAIGNLTARKHYTAQDKIEAANQ 414
Query: 350 VLKAGMDINCG-TCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKG-KY 407
L AG+ NCG T + A G++ +++D + S R LF +P K +
Sbjct: 415 ALAAGIATNCGDTYNNKEVIQAAKDGRINMENLDNVCRTMLSTMFRNELFEKNPCKPLDW 474
Query: 408 GKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGG 467
K+ P + HK++A AAR+ IV+L+N + LPL KN + ++A++GP +++ G
Sbjct: 475 KKIYPG-WNSDSHKEMARQAARESIVMLENKENLLPLTKN-LRTIAVLGPGADDLQP--G 530
Query: 468 GYTG--IPCSPKSLLRGLEAYV---SKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIV 522
YT +P KS+L G++ V +K Y GC D + +AV+ A ++D V++
Sbjct: 531 DYTPKLLPGQLKSVLTGIKEAVGKQTKVLYEQGC-DFTNPDETNIPKAVKAASQSDVVVM 589
Query: 523 VAGLDLTQET---------EDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDV 573
V G T E E+ D +L+LPG+Q L+ +V T K PVIL+L G P D+
Sbjct: 590 VLGDCSTSEATNDVRKTCGENNDWATLILPGKQQELLEAVCATGK-PVILILQAGRPYDI 648
Query: 574 SFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMR 633
+A +IL PG+ G A+A+++FGD+NPGGRLPMT +P ++P+ N +
Sbjct: 649 --LKASEMCKAILVNWLPGQEGGPAMADVLFGDYNPGGRLPMT-FPRHVGQLPLY-YNFK 704
Query: 634 ADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQ 693
S R+Y +Y +Y FG GLSYT++ Y SD I ++
Sbjct: 705 T-SGRRYEYVDMEYYP---LYRFGFGLSYTSFEY------------------SDLKIQEK 742
Query: 694 TGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGF 753
+ VQ +V N G G V L+ S T +L F
Sbjct: 743 PNGNVT----------------VQATVKNIGSRAGDEVAQLYVTDMYASVKTRVMELKDF 786
Query: 754 DRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG----------ELRHSL 803
DR++ SK +SF + P + +S+ N H R++ G + VG E++HS+
Sbjct: 787 DRIYLQPGESKTVSFELTPYD-ISLLNDHMDRVVEKGEFKICVGGMSPDYVAKNEIKHSV 845
>gi|188993706|ref|YP_001905716.1| beta-glucosidase [Xanthomonas campestris pv. campestris str. B100]
gi|167735466|emb|CAP53681.1| exported beta-glucosidase [Xanthomonas campestris pv. campestris]
Length = 896
Score = 274 bits (700), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 159/462 (34%), Positives = 242/462 (52%), Gaps = 54/462 (11%)
Query: 53 PFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPG 112
P+ + + + RA L+S +TL+EK Q+ + A AIPRL +P Y+WW+E+LHG+A G
Sbjct: 39 PYLDPTQPLQARAADLVSRMTLEEKAAQMQNAAPAIPRLQVPEYDWWNEALHGVARAG-- 96
Query: 113 VNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAG--------LTF 164
T FPQ + AA+F+ L + + +A++ EARA ++ AG LTF
Sbjct: 97 --------GATVFPQAIGLAATFDTPLMAEVATAISDEARAKHHAFLAGGEHKRYQGLTF 148
Query: 165 WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVL 224
W+PNINIFRDPRWGRGQET GEDP + + V FV+ Q +
Sbjct: 149 WSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQAQ------------------- 189
Query: 225 KGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIE 284
+G L A KH Y + R+ F+ +E+D +T+ P F++ ++
Sbjct: 190 --------QGPYRKLDATAKH---YAVHSGPEADRHHFDVHPSERDLYETYLPAFQALVQ 238
Query: 285 QGKASCIMCSYNQVNGVPACLRGDLFQKARNEWGFKGYITSDCDAVATIFEYQNYTKTHE 344
+G + +M +YN+VNG A L R +WGF GYI SDC A+ I++ T E
Sbjct: 239 EGHVAAVMGAYNRVNGESASASTRLEGILRRDWGFDGYIVSDCAAIRDIWQNHKIVPTPE 298
Query: 345 DSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRK 404
+AA +K G D++CG +A+ G + E IDR+L L + +LRLG+F+ P K
Sbjct: 299 AAAALGVKHGTDLDCGDTYA-ALPAAVRAGLIDEATIDRSLTRLMAARLRLGMFD-PPAK 356
Query: 405 GKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQ 464
+ ++ + +H LA AR+ +VLLKND LPL K + +A++GP ++
Sbjct: 357 VPWAQIPASANQSPQHDALARRTARESLVLLKND-GLLPL-KPTLKRIAVVGPTADDPMS 414
Query: 465 MGGGYTGIPCSPKSLLRGLE--AYVSKTHYASGCHDVPCNSD 504
+ G Y G P +P ++L+G+ A ++ YA G V D
Sbjct: 415 LLGNYYGTPAAPVTILQGIRDAAPQAEVVYARGSDLVEGRED 456
Score = 124 bits (312), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 85/286 (29%), Positives = 124/286 (43%), Gaps = 52/286 (18%)
Query: 507 FHEAVRIAKKADFVIVVAGLDLTQETEDRD----------RVSLLLPGQQMSLVTSVART 556
EAV A+ AD V+ V GL E E+ D R LP Q L+ ++ T
Sbjct: 620 LQEAVDAARNADVVVFVGGLTGDVEGEEMDVNYPGFAGGDRTDTRLPKPQRELLQALQAT 679
Query: 557 SKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMT 616
PV+ VLT G L + +A+ + +IL YPG+ G A+ +++FG +PGGRLP+T
Sbjct: 680 GT-PVVAVLTTGSALAIDWAQ--QHVPAILLAWYPGQRGGTAVGDVLFGQASPGGRLPIT 736
Query: 617 WYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSEL 676
+Y E +D MR GR+YR++ G +Y FGHGL+YT ++Y L
Sbjct: 737 FYKEDERLPAFDDYAMR--------GRTYRYFDGKPLYPFGHGLAYTQFAYSNL------ 782
Query: 677 TISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFA 736
+D T +SV N G G VV L+
Sbjct: 783 -------------------------RLDRTTVAADGTLRATVSVKNTGQRAGDEVVQLYL 817
Query: 737 RVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKH 782
+ K+L GF R+ +E+SF + P E L I ++
Sbjct: 818 HPLNPQRERARKELRGFQRITLQPGEHREVSFNITPREALRIYDEQ 863
>gi|336417087|ref|ZP_08597416.1| hypothetical protein HMPREF1017_04524 [Bacteroides ovatus
3_8_47FAA]
gi|335936712|gb|EGM98630.1| hypothetical protein HMPREF1017_04524 [Bacteroides ovatus
3_8_47FAA]
Length = 954
Score = 274 bits (700), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 226/780 (28%), Positives = 361/780 (46%), Gaps = 130/780 (16%)
Query: 54 FCNTSLSISTRAKSLISLLTLQEKIQQLSDN--ASAIPRLGIPAYEWWSESLHGIASNGP 111
+ + SL + R +SL++++T ++K++ + + IP L +P E++HG +
Sbjct: 170 YMDASLPVEERVESLLAVMTPEDKMELIREGWGIPGIPHLYVPPITK-VEAVHGFSYG-- 226
Query: 112 GVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINI 171
S T FPQ L A++NR L + + E A N QA W+P +++
Sbjct: 227 --------SGATIFPQALAMGATWNRKLTEEVAMVIGDETVAA-NTKQA----WSPVLDV 273
Query: 172 FRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEES 231
+D RWGR +ET GEDP++VS ++K +Q
Sbjct: 274 AQDARWGRCEETFGEDPVLVSQIGGAWIKGYQS--------------------------- 306
Query: 232 DRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCI 291
RG L KH + G R S + ++E++ + PFR I +
Sbjct: 307 -RG----LFTTPKHFGGHGAPLGG---RDSHDIGLSEREMREVHLVPFRHAIRNYDCQSL 358
Query: 292 MCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAG- 349
M +Y+ G+P +L Q+ R EWGF G+I SDC A+ + ++YT + AA
Sbjct: 359 MMAYSDYMGIPVAKSTELLQQILRQEWGFNGFIVSDCGAIGNLTARKHYTAQDKIEAANQ 418
Query: 350 VLKAGMDINCG-TCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKG-KY 407
L AG+ NCG T + A G++ +++D + S R LF +P K +
Sbjct: 419 ALAAGIATNCGDTYNNKEVIQAAKDGRIDMENLDNVCRTMLSTMFRNELFEKNPCKPLDW 478
Query: 408 GKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGG 467
K+ P + HK++A AAR+ IV+L+N + LPL KN + ++A++GP +++ G
Sbjct: 479 KKIYPG-WNSDSHKEMARQAARESIVMLENKENLLPLTKN-LRTIAVLGPGADDLQP--G 534
Query: 468 GYTG--IPCSPKSLLRGLEAYV---SKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIV 522
YT +P KS+L G++ V +K Y GC D + +AV+ A ++D V++
Sbjct: 535 DYTPKLLPGQLKSVLTGIKEAVGKQTKVLYEQGC-DFTNPDETNIPKAVKAASQSDVVVM 593
Query: 523 VAGLDLTQET---------EDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDV 573
V G T E E+ D +L+LPG+Q L+ +V T K PVIL+L G P D+
Sbjct: 594 VLGDCSTSEATNDVRKTCGENNDWATLILPGKQQELLEAVCATGK-PVILILQAGRPYDI 652
Query: 574 SFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMR 633
+A +IL PG+ G A+A+++FGD+NPGGRLPMT +P ++P+ N +
Sbjct: 653 --LKASEMCKAILVNWLPGQEGGPAMADVLFGDYNPGGRLPMT-FPRHVGQLPLY-YNFK 708
Query: 634 ADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQ 693
S R+Y +Y +Y FG GLSYT++ Y SD I ++
Sbjct: 709 T-SGRRYEYVDMEYYP---LYRFGFGLSYTSFEY------------------SDLKIQEK 746
Query: 694 TGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGF 753
+ VQ +V N G G V L+ S T +L F
Sbjct: 747 PNGNVT----------------VQATVKNIGSRAGDEVAQLYVTDMYASVKTRVMELKDF 790
Query: 754 DRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG----------ELRHSL 803
DR++ SK +SF + P + +S+ N H R++ G + VG E++HS+
Sbjct: 791 DRIYLQPGESKTVSFELTPYD-ISLLNDHMDRVVEKGEFKICVGGMSPDYVAKNEIKHSV 849
>gi|197106390|ref|YP_002131767.1| glucan 1,4-beta-glucosidase [Phenylobacterium zucineum HLK1]
gi|196479810|gb|ACG79338.1| glucan 1,4-beta-glucosidase [Phenylobacterium zucineum HLK1]
Length = 888
Score = 274 bits (700), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 169/467 (36%), Positives = 238/467 (50%), Gaps = 71/467 (15%)
Query: 54 FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
+ +T L RA L++ +TL+EK +Q+ A AIPRLG+PAY WW+E LHG+A G
Sbjct: 38 YRDTRLPAERRAADLVARMTLEEKSRQIGHTAPAIPRLGVPAYNWWNEGLHGVARAG--- 94
Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMY---------NLGQAGLTF 164
T FPQ + AA+++ + E RA Y GLT
Sbjct: 95 -------IATVFPQAIGMAATWDVDRMRGTADVIGTEFRAKYAERVHPDGSTDWYRGLTV 147
Query: 165 WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVL 224
W+PNINIFRDPRWGRGQET GEDP + V F++ QG++ F+
Sbjct: 148 WSPNINIFRDPRWGRGQETYGEDPYLTGRMGVAFIRGLQGQDPNF--------FKTIATA 199
Query: 225 KGF----GEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFR 280
K + G ES+R E + H AYDL EDT+ P FR
Sbjct: 200 KHYAVHSGPESNRHREDV------HPSAYDL--------------------EDTYLPAFR 233
Query: 281 SCIEQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIFEYQN- 338
+ + +GK +MC+YN V+GVPAC DL Q+ R +WGF G++ SDC A A I+ +
Sbjct: 234 AAVTEGKVQAVMCAYNAVDGVPACASEDLMDQRLRRDWGFSGHVVSDCGAAANIYREDSL 293
Query: 339 -YTKTHEDSAAGVLKAGMDINCG------TCMLRHTQSAIDKGKVQEKDIDRALLNLFSV 391
Y KT E+ L AGMD+ CG T SA+ KG + E +D AL+ LF+
Sbjct: 294 AYVKTPEEGITRALNAGMDLVCGDYRADWNTEAEATVSAVRKGMLDETVLDGALVRLFAD 353
Query: 392 QLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSS 451
++RLGLF+ P + + K+ T EH+ ++L+ A+ + LLKND LPL K
Sbjct: 354 RIRLGLFD-PPAEVPFSKITAAQNDTPEHRAMSLEMAKASMTLLKND-GVLPL-KGEPRR 410
Query: 452 LAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTH--YASGC 496
+A++GP +++ + G Y G P +P ++L G+ A K YA G
Sbjct: 411 IAVVGPNADSVDALIGNYYGTPSNPVTVLAGIRARFPKAEVVYAEGT 457
Score = 121 bits (303), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 95/290 (32%), Positives = 135/290 (46%), Gaps = 54/290 (18%)
Query: 518 DFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVARTSKRPVILVLTG 567
D V+ V GL E E+ DR SL LP Q L+ + T K PV+LVL
Sbjct: 613 DLVVFVGGLTARVEGEEMKLQVPGFAGGDRTSLDLPAPQQDLLRRLHATGK-PVVLVLMN 671
Query: 568 GGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPM 627
G L V++A D+ + +I+ YPG G A+A+++ GD++P GRLP+T+Y + P
Sbjct: 672 GSALSVNWA--DANLPAIVEAWYPGGEGGHAVAQLLAGDYSPAGRLPVTFYRSAGDLPPF 729
Query: 628 NDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSD 687
D M+ GR+YR++ G +Y FG+GLSYT +SY AP S S
Sbjct: 730 ADYAMK--------GRTYRYFGGEVLYPFGYGLSYTRFSY---GAPQLSARSVSADG--- 775
Query: 688 KNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPE 747
E+T T VTN G +DG VV L+ P GTP
Sbjct: 776 -----------------EITVTT--------QVTNTGGMDGEEVVQLYVSHPG-RDGTPI 809
Query: 748 KQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
+ L GF R+ ++ +SF + QLS+ + G R + G + VG
Sbjct: 810 RALQGFQRIGLKRGETRPVSFTLK-DRQLSVVDAEGNRRVEPGRVEVWVG 858
>gi|325919363|ref|ZP_08181395.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas gardneri ATCC
19865]
gi|325550152|gb|EGD20974.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas gardneri ATCC
19865]
Length = 876
Score = 273 bits (698), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 159/462 (34%), Positives = 241/462 (52%), Gaps = 54/462 (11%)
Query: 53 PFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPG 112
P+ +T RA L++ +TL+EK Q+ + A AIPRL +P Y+WW+E+LHG+A G
Sbjct: 19 PYLDTQRPFDARAADLVARMTLEEKAAQMQNAAPAIPRLQVPEYDWWNEALHGVARAG-- 76
Query: 113 VNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLTF 164
T FPQ + AA+F+ L + + +A++ EARA ++ A GLTF
Sbjct: 77 --------GATVFPQAIGLAATFDTPLMAEVATAISDEARAKHHAFLARGEYKRYQGLTF 128
Query: 165 WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVL 224
W+PNINIFRDPRWGRGQET GEDP + + V FV+ Q +
Sbjct: 129 WSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQAQ------------------- 169
Query: 225 KGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIE 284
+G L A KH + + R+ F+ +E+D +T+ P F++ ++
Sbjct: 170 --------QGPYRKLDATAKH---FAVHSGPEADRHHFDVHPSERDLHETYLPAFQALVQ 218
Query: 285 QGKASCIMCSYNQVNGVPACLRGDLFQKARNEWGFKGYITSDCDAVATIFEYQNYTKTHE 344
+GK + +M +YN+VNG A L R +WGF GYI SDC A+ I++ T E
Sbjct: 219 EGKVAAVMGAYNRVNGESASASTRLEGILRRDWGFDGYIVSDCAAIRDIWQNHKIVPTPE 278
Query: 345 DSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRK 404
+AA +K G D++CG +A+ G + E ID AL L + ++RLG+F+ P K
Sbjct: 279 AAAALGVKHGTDLDCGDTYA-ALPAAVRAGLIDEATIDTALKRLMTTRMRLGMFD-PPAK 336
Query: 405 GKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQ 464
+ ++ + +H LA AR+ +VLLKND LPL K + +A+IGP ++
Sbjct: 337 VPWAQIPASANQSPQHDALARRTARESLVLLKND-GVLPL-KPTLKRIAVIGPTADDPMS 394
Query: 465 MGGGYTGIPCSPKSLLRGLE--AYVSKTHYASGCHDVPCNSD 504
+ G Y G P +P ++L+G+ A ++ YA G V D
Sbjct: 395 LLGNYYGTPAAPVTILQGIRDAAPQAQVIYARGSDLVEGRED 436
Score = 113 bits (282), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 85/320 (26%), Positives = 132/320 (41%), Gaps = 59/320 (18%)
Query: 480 LRGLEAYVSKTHYASGCHDVPCN-------SDAGFHEAVRIAKKADFVIVVAGLDLTQET 532
L G +AY + Y D + EAV A+ A+ V+ V GL E
Sbjct: 566 LEGGKAYDLRVEYYEATRDAGVRLAWRMPGAKPPLQEAVDAARDAEVVVFVGGLTGDVEG 625
Query: 533 EDRD----------RVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQI 582
E+ D R LP Q L+ ++ T PV+ VLT G L + +A+ +
Sbjct: 626 EEMDVNYPGFAGGDRTDTRLPKPQRELLQALQATGT-PVVAVLTTGSALAIDWAQ--QHV 682
Query: 583 SSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPG 642
+IL YPG+ G A+ +++FG +PGGRLP+T+Y E+ +D MR G
Sbjct: 683 PAILLAWYPGQRGGSAVGDVLFGQASPGGRLPVTFYKEAERLPAFDDYAMR--------G 734
Query: 643 RSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVH 702
R+YR++ G +Y FGHGLSYT ++Y L
Sbjct: 735 RTYRYFQGKPLYPFGHGLSYTQFAYSDL-------------------------------R 763
Query: 703 IDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKG 762
+D T +++ N G G VV L+ K + K+L G R+
Sbjct: 764 LDRTTVAADGTLTATVTLKNTGQRAGDEVVQLYLHPLKPQRERALKELHGLQRITLQPGE 823
Query: 763 SKEISFGVDPCEQLSIANKH 782
+++ F + + L I ++
Sbjct: 824 QRQLRFTIKAQDALRIYDEQ 843
>gi|21233528|ref|NP_639445.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. campestris
str. ATCC 33913]
gi|66770493|ref|YP_245255.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. campestris
str. 8004]
gi|21115383|gb|AAM43327.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. campestris
str. ATCC 33913]
gi|66575825|gb|AAY51235.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. campestris
str. 8004]
Length = 896
Score = 273 bits (697), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 159/462 (34%), Positives = 241/462 (52%), Gaps = 54/462 (11%)
Query: 53 PFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPG 112
P+ + + + RA L+S +TL+EK Q+ + A AIPRL +P Y+WW+E+LHG+A G
Sbjct: 39 PYLDPTQPLQARAADLVSRMTLEEKAAQMQNAAPAIPRLQVPEYDWWNEALHGVARAG-- 96
Query: 113 VNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAG--------LTF 164
T FPQ + AA+F+ L + + +A++ EARA ++ AG LTF
Sbjct: 97 --------GATVFPQAIGLAATFDTPLMAEVATAISDEARAKHHAFLAGGEHKRYQGLTF 148
Query: 165 WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVL 224
W+PNINIFRDPRWGRGQET GEDP + + V FV+ Q +
Sbjct: 149 WSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQAQ------------------- 189
Query: 225 KGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIE 284
+G L A KH Y + R+ F+ +E+D +T+ P F++ ++
Sbjct: 190 --------QGPYRKLDATAKH---YAVHSGPEADRHHFDVHPSERDLYETYLPAFQALVQ 238
Query: 285 QGKASCIMCSYNQVNGVPACLRGDLFQKARNEWGFKGYITSDCDAVATIFEYQNYTKTHE 344
+G + +M +YN+VNG A L R +WGF GYI SDC A+ I++ T E
Sbjct: 239 EGHVAAVMGAYNRVNGESASASTRLEGILRRDWGFDGYIVSDCAAIRDIWQNHKIVPTPE 298
Query: 345 DSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRK 404
+AA +K G D++CG +A+ G + E IDR+L L + +LRLG+F+ P K
Sbjct: 299 AAAALGVKHGTDLDCGDTYA-ALPAAVRAGLIDEATIDRSLTRLMAARLRLGMFD-PPAK 356
Query: 405 GKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQ 464
+ + + +H LA AR+ +VLLKND LPL K + +A++GP ++
Sbjct: 357 VPWAQTPASANQSPQHDALARRTARESLVLLKND-GLLPL-KPTLKRIAVVGPTADDPMS 414
Query: 465 MGGGYTGIPCSPKSLLRGLE--AYVSKTHYASGCHDVPCNSD 504
+ G Y G P +P ++L+G+ A ++ YA G V D
Sbjct: 415 LLGNYYGTPAAPVTILQGIRDAAPQAEVVYARGSDLVEGRED 456
Score = 124 bits (312), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 85/286 (29%), Positives = 124/286 (43%), Gaps = 52/286 (18%)
Query: 507 FHEAVRIAKKADFVIVVAGLDLTQETEDRD----------RVSLLLPGQQMSLVTSVART 556
EAV A+ AD V+ V GL E E+ D R LP Q L+ ++ T
Sbjct: 620 LQEAVDAARNADVVVFVGGLTGDVEGEEMDVNYPGFAGGDRTDTRLPKPQRELLQALQAT 679
Query: 557 SKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMT 616
PV+ VLT G L + +A+ + +IL YPG+ G A+ +++FG +PGGRLP+T
Sbjct: 680 GT-PVVAVLTTGSALAIDWAQ--QHVPAILLAWYPGQRGGTAVGDVLFGQASPGGRLPIT 736
Query: 617 WYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSEL 676
+Y E +D MR GR+YR++ G +Y FGHGL+YT ++Y L
Sbjct: 737 FYKEDERLPAFDDYAMR--------GRTYRYFDGKPLYPFGHGLAYTQFAYSNL------ 782
Query: 677 TISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFA 736
+D T +SV N G G VV L+
Sbjct: 783 -------------------------RLDRTTVAADGTLRATVSVKNTGQRAGDEVVQLYL 817
Query: 737 RVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKH 782
+ K+L GF R+ +E+SF + P E L I ++
Sbjct: 818 HPLNPQRERARKELRGFQRITLQPGEHREVSFNITPREALRIYDEQ 863
>gi|384430040|ref|YP_005639401.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. raphani
756C]
gi|341939144|gb|AEL09283.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. raphani
756C]
Length = 896
Score = 273 bits (697), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 159/462 (34%), Positives = 241/462 (52%), Gaps = 54/462 (11%)
Query: 53 PFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPG 112
P+ + + + RA L+S +TL+EK Q+ + A AIPRL +P Y+WW+E+LHG+A G
Sbjct: 39 PYLDPTQPLQARAADLVSRMTLEEKAAQMQNAAPAIPRLQVPEYDWWNEALHGVARAG-- 96
Query: 113 VNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLTF 164
T FPQ + AA+F+ L + + +A++ EARA ++ A GLTF
Sbjct: 97 --------GATVFPQAIGLAATFDTPLMAEVATAISDEARAKHHAFLARGEHKRYQGLTF 148
Query: 165 WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVL 224
W+PNINIFRDPRWGRGQET GEDP + + V FV+ Q +
Sbjct: 149 WSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQAQ------------------- 189
Query: 225 KGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIE 284
+G L A KH Y + R+ F+ +E+D +T+ P F++ ++
Sbjct: 190 --------QGPYRKLDATAKH---YAVHSGPEADRHHFDVHPSERDLYETYLPAFQALVQ 238
Query: 285 QGKASCIMCSYNQVNGVPACLRGDLFQKARNEWGFKGYITSDCDAVATIFEYQNYTKTHE 344
+G + +M +YN+VNG A L R +WGF GYI SDC A+ I++ T E
Sbjct: 239 EGHVAAVMGAYNRVNGESASASTRLEGILRRDWGFDGYIVSDCAAIRDIWQNHKIVPTPE 298
Query: 345 DSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRK 404
+AA +K G D++CG +A+ G + E IDR+L L + +LRLG+F+ P K
Sbjct: 299 AAAALGVKHGTDLDCGDTYA-ALPAAVRAGLIDEATIDRSLTRLMAARLRLGMFD-PPAK 356
Query: 405 GKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQ 464
+ + + +H LA AR+ +VLLKND LPL K + +A++GP ++
Sbjct: 357 VPWAQTPASANQSPQHDALARRTARESLVLLKND-GLLPL-KPTLKRIAVVGPTADDPMS 414
Query: 465 MGGGYTGIPCSPKSLLRGLE--AYVSKTHYASGCHDVPCNSD 504
+ G Y G P +P ++L+G+ A ++ YA G V D
Sbjct: 415 LLGNYYGTPAAPVTILQGIRDAAPQAEVVYARGSDLVEGRED 456
Score = 121 bits (304), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 84/286 (29%), Positives = 123/286 (43%), Gaps = 52/286 (18%)
Query: 507 FHEAVRIAKKADFVIVVAGLDLTQETEDRD----------RVSLLLPGQQMSLVTSVART 556
EAV A+ AD V+ V GL E E+ D R LP Q L+ ++ T
Sbjct: 620 LQEAVDAARNADVVVFVGGLTGDVEGEEMDVNYPGFAGGDRTDTRLPKPQRELLQALQAT 679
Query: 557 SKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMT 616
PV+ VLT G L + +A+ + +IL YPG+ G A+ +++FG +PGGRLP+T
Sbjct: 680 GT-PVVAVLTTGSALAIDWAQ--QHVPAILLAWYPGQRGGTAVGDVLFGQASPGGRLPIT 736
Query: 617 WYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSEL 676
+Y E +D MR GR+YR++ G +Y FGHGL+YT ++Y L
Sbjct: 737 FYKEDERLPAFDDYAMR--------GRTYRYFDGKPLYPFGHGLAYTQFAYSNL------ 782
Query: 677 TISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFA 736
+D T + V N G G VV L+
Sbjct: 783 -------------------------RLDRTTVAADGTLRATVWVKNTGQRAGDEVVQLYL 817
Query: 737 RVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKH 782
+ K+L GF R+ +E+SF + P E L I ++
Sbjct: 818 HPLNPQRERARKELRGFQRITLQPGEHREVSFTITPREALRIYDEQ 863
>gi|227831319|ref|YP_002833099.1| glycoside hydrolase family protein [Sulfolobus islandicus L.S.2.15]
gi|227457767|gb|ACP36454.1| glycoside hydrolase family 3 domain protein [Sulfolobus islandicus
L.S.2.15]
Length = 754
Score = 272 bits (696), Expect = 5e-70, Method: Compositional matrix adjust.
Identities = 209/711 (29%), Positives = 335/711 (47%), Gaps = 123/711 (17%)
Query: 121 SVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRG 180
S T+FPQ + A+++N L +I S + + R + + Q +P +++ +DPRWGR
Sbjct: 100 SSTAFPQAIGLASTWNPELVMDIASVIRSQGR-LVGVNQC----LSPVLDVCKDPRWGRC 154
Query: 181 QETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLS 240
+ET GEDP +V++ + ++ QG+N L
Sbjct: 155 EETYGEDPYLVASMGLAYITGLQGDN-------------------------------QLV 183
Query: 241 ACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNG 300
A KH A+ + G R + ++ +TF PF ++ GK IM +Y++++G
Sbjct: 184 ATAKHFAAHGFPEGG---RNIAQVHVGNRELRETFLFPFEVAVKIGKVMSIMPAYHEIDG 240
Query: 301 VPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINC 359
+P L R EWGF G + SD D + + ++A L++G+DI
Sbjct: 241 IPCHGNPQLLTNILRQEWGFDGIVVSDYDGIRQLETIHRVASNKMEAAILALESGVDIEF 300
Query: 360 GT--CMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCT 417
T C +A+ +G V E IDRA+ + ++ RLGL + +P + P+ +
Sbjct: 301 PTIDCYGEPLVNALKEGLVPESLIDRAVERVLRIKDRLGLLD-NPFVNENSV--PEKLDD 357
Query: 418 SEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT------- 470
+ ++LAL AR+ IVLLKN+ LPL+KN V+ +A+IGP N+ M G YT
Sbjct: 358 HKSRELALKTARESIVLLKNENNILPLSKN-VNKIAVIGPNANDPRNMLGDYTYTGHLNI 416
Query: 471 --GIPCSPKSLLRGLEAYV--SKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAG- 525
GI ++L+G+ V SK YA GC D+ S GF EA+ IA++AD +I + G
Sbjct: 417 DSGIEIV--TVLQGIVKKVGESKVLYAKGC-DIASESKEGFAEAIEIARQADVIIAIMGE 473
Query: 526 --------LDLTQETEDR----------DRVSLLLPGQQMSLVTSVARTSKRPVILVLTG 567
+D+ + E + DR SL LPG Q L+ + +T K P+ILVL
Sbjct: 474 KSGLPLSWMDIPSKEEFKKYQAVTGEGNDRSSLRLPGVQEELLKELYKTGK-PIILVLIN 532
Query: 568 GGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPM 627
G PL +S + + +++ +PGE G A+A++IFGD+NP GRLP+T +P ++P+
Sbjct: 533 GRPLVLS--SIINYVKAVIEAWFPGEEGGNAIADVIFGDYNPSGRLPIT-FPMDTGQIPL 589
Query: 628 NDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSA-PSELTISASLKAGS 686
N + S R Y + ++ FG+GLSYT + Y L P E+ G
Sbjct: 590 Y-YNRKPSSF-----RPYVMLRSSPLFTFGYGLSYTQFEYSNLEVTPKEI--------GP 635
Query: 687 DKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTP 746
+ NI + I V N G ++G VV L+ S P
Sbjct: 636 NSNI------------------------AISIDVKNVGKMEGDDVVQLYVSKTFSSVARP 671
Query: 747 EKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
K+L GF ++H + + F + P E L+ + R ++ G + L++G
Sbjct: 672 VKELKGFAKIHLKPGEKRRVKF-ILPTEALAFYDSFMRLVVEKGEYQLLIG 721
>gi|380696432|ref|ZP_09861291.1| beta-glucosidase [Bacteroides faecis MAJ27]
Length = 954
Score = 271 bits (694), Expect = 7e-70, Method: Compositional matrix adjust.
Identities = 226/781 (28%), Positives = 363/781 (46%), Gaps = 132/781 (16%)
Query: 54 FCNTSLSISTRAKSLISLLTLQEKIQQLSDN--ASAIPRLGIPAYEWWSESLHGIASNGP 111
+ + SL + R +SL++++T ++K++ + + IP L +P E++HG +
Sbjct: 170 YMDVSLPVEERVESLLAVMTPEDKMELIREGWGIPGIPHLYVPPITK-VEAVHGFSYG-- 226
Query: 112 GVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINI 171
S T FPQ L A++N+ L + + E A N QA W+P +++
Sbjct: 227 --------SGATIFPQALAMGATWNKKLTEEVAMVIGDETVAA-NTKQA----WSPVLDV 273
Query: 172 FRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEES 231
+D RWGR +ET GEDP++VS ++K +Q
Sbjct: 274 AQDARWGRCEETFGEDPVLVSQMGGAWIKGYQS--------------------------- 306
Query: 232 DRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCI 291
RG L KH + G R S + ++E++ + PFR I +
Sbjct: 307 -RG----LFTTPKHFGGHGAPLGG---RDSHDIGLSEREMREIHLVPFRHAIRNYDCQSL 358
Query: 292 MCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYT-KTHEDSAAG 349
M +Y+ GVP +L Q+ R EWGF G+I SDC A+ + ++YT K ++A
Sbjct: 359 MMAYSDYMGVPVAKSKELLQQILRQEWGFNGFIVSDCGAIGNLTARKHYTAKDKIEAANQ 418
Query: 350 VLKAGMDINCG-TCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKG-KY 407
L AG+ NCG T + A G++ +D+D + S R LF +P K +
Sbjct: 419 ALAAGIATNCGDTYNNKEVIQAAKDGRINMEDLDNVCRTMLSTMFRNELFEKNPCKPLDW 478
Query: 408 GKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGG 467
K+ P + HK++A AAR+ IV+L+N + LPL+K + ++A++GP +++ G
Sbjct: 479 KKIYPG-WNSDSHKEMARQAARESIVMLENKENLLPLSK-TLRTIAVVGPGADDLQP--G 534
Query: 468 GYTG--IPCSPKSLLRGLEAYV---SKTHYASGCHDVPCNSDA-GFHEAVRIAKKADFVI 521
YT +P KS+L G+++ V +K Y GC N DA +AV+ A ++D VI
Sbjct: 535 DYTPKLLPGQLKSVLTGIKSAVGKQTKVLYEQGCDFT--NPDATNIPKAVKTASQSDVVI 592
Query: 522 VVAGLDLTQET---------EDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLD 572
+V G T E E+ D +L+LPG+Q L+ +V T K PVIL+L G P D
Sbjct: 593 MVLGDCSTSEATNDVRKTCGENNDWATLILPGKQQELLEAVCATGK-PVILILQAGRPYD 651
Query: 573 VSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNM 632
+ +A +IL PG+ G A+A+++FGD+NP GRLPMT +P ++P+ N
Sbjct: 652 I--LKASEMCKAILVNWLPGQEGGPAMADVLFGDYNPAGRLPMT-FPRHVGQLPLY-YNF 707
Query: 633 RADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQ 692
+ S R+Y +Y +Y FG GLSYT++ Y L ++ ++ N+
Sbjct: 708 KT-SGRRYEYVDMEYYP---LYRFGFGLSYTSFEYSNL----------KIQEKANGNV-- 751
Query: 693 QTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIG 752
VQ +V N G G V L+ S T +L
Sbjct: 752 ----------------------EVQATVKNVGSCAGDEVAQLYVTDMYASVKTRVMELKD 789
Query: 753 FDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG----------ELRHS 802
F R+H SK +SF + P + +S+ N R++ G +M+G E++HS
Sbjct: 790 FTRIHLQPGESKTVSFEMTPYD-ISLLNDRMDRVVEKGEFKIMIGGMSPDYVAKNEIKHS 848
Query: 803 L 803
+
Sbjct: 849 V 849
>gi|397691073|ref|YP_006528327.1| beta-glucosidase [Melioribacter roseus P3M]
gi|395812565|gb|AFN75314.1| beta-glucosidase [Melioribacter roseus P3M]
Length = 923
Score = 271 bits (694), Expect = 8e-70, Method: Compositional matrix adjust.
Identities = 162/448 (36%), Positives = 244/448 (54%), Gaps = 57/448 (12%)
Query: 57 TSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFN 116
T+ S R LISL+T +EKI+QL++ A +IPRLG+ AY +W+ESLHG+ + G
Sbjct: 109 TTGSYKERLNDLISLMTTEEKIKQLTNQADSIPRLGLRAYNYWNESLHGVLAEG------ 162
Query: 117 GTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPR 176
TSFPQ + A+++ L + + +AV+ EARA+ L GLT+W+P INI RDPR
Sbjct: 163 -----ATSFPQAIALGATWDPRLVNRVATAVSDEARALNRLYGKGLTYWSPTINIARDPR 217
Query: 177 WGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDE 236
WGR +E+ EDP ++S V F+K QG D
Sbjct: 218 WGRNEESYSEDPYLLSRMGVAFIKGMQG---------------------------DHPYY 250
Query: 237 LMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYN 296
L A KH IA + E+ R++ ++ + ++ + + P F+S I + +A IM +YN
Sbjct: 251 LKTVATPKHFIANNEEE----RRHTGSSDVDMRNLYEYYLPAFKSAIVEARAYSIMGAYN 306
Query: 297 QVNGVPA----CLRGDLFQKARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLK 352
++N VP+ L DL R +WGF+GY+ SDC A+ + + KT ++ A +
Sbjct: 307 ELNHVPSNANMFLMTDLL---RRQWGFEGYVVSDCGAIHDMLYGHKFFKTGAEAVARSIL 363
Query: 353 AGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGP 412
AG D+NCG + A+D+G ++EKDID AL + S + RLG F+ P Y +G
Sbjct: 364 AGCDLNCGQAYREFIKDALDEGLLREKDIDSALFRVLSARFRLGEFD-PPELVPYSSIGK 422
Query: 413 DDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGI 472
D + + E+++LALDAAR+ IVLLKN+ LP++K+ + S+A+IGP N G Y+G
Sbjct: 423 DKLDSKENRRLALDAARKSIVLLKNN-DILPIDKSKIKSIAVIGP--NAREAQLGIYSGF 479
Query: 473 PCSPKSLLRGLEAYVS----KTHYASGC 496
P S L G++ + Y GC
Sbjct: 480 PNVLISPLEGIKNKADSLDIRVGYVKGC 507
Score = 122 bits (306), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 83/271 (30%), Positives = 129/271 (47%), Gaps = 41/271 (15%)
Query: 507 FHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLT 566
F +A +IA + D VI+V G+ E+ DR + LP Q LV A + +++VL
Sbjct: 658 FEKAKKIAAENDLVILVLGITPGISQEELDRKEIELPSVQRELVKQTAEVNPN-IVIVLV 716
Query: 567 GGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVP 626
GGP V+ A A+ +I+ Y GE G +ALA+++FGD+NPGG+LP T+Y + P
Sbjct: 717 NGGP--VALAGAEKYAKAIVENWYNGEFGGQALADVLFGDYNPGGKLPQTFYASTEQLPP 774
Query: 627 MNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGS 686
M+D ++ + R+Y + ++ FGHGLSYT + Y L S
Sbjct: 775 MSDYDIINNP------RTYMYLNEQALFPFGHGLSYTTFKYDSLKIVS------------ 816
Query: 687 DKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTP 746
N L +T + +Q +TN G+ +G VV ++A P
Sbjct: 817 --NTLNETDT-----------------LSLQFRLTNVGNRNGDEVVQIYASCKDAKFKVP 857
Query: 747 EKQLIGFDRVHTVAKGSKEISFGVDPCEQLS 777
KQL F R+ SK + F + P ++L+
Sbjct: 858 RKQLKRFRRLTLQTGESKVLEFKI-PVDELA 887
>gi|323344052|ref|ZP_08084278.1| beta-glucosidase [Prevotella oralis ATCC 33269]
gi|323094781|gb|EFZ37356.1| beta-glucosidase [Prevotella oralis ATCC 33269]
Length = 779
Score = 271 bits (694), Expect = 8e-70, Method: Compositional matrix adjust.
Identities = 225/755 (29%), Positives = 345/755 (45%), Gaps = 142/755 (18%)
Query: 90 RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
RLGIP + E+ HG + G T FP L AA+++ + G +A
Sbjct: 128 RLGIPLF-LAEEAPHGHMAIG-----------ATVFPTGLGMAATWSTDVIEQAGVIIAK 175
Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
E R Q G + P +++ +PRW R +ET GEDP++ AV VK
Sbjct: 176 EIRL-----QGGHISYGPVLDLAHEPRWSRVEETMGEDPVLSGTIAVAQVK--------- 221
Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQ 269
G+G G D A KH IAY + + G S +I +
Sbjct: 222 ---GLGAG--------------DITKPFATIATLKHFIAYGIPESGQNGAPS---IIGTR 261
Query: 270 DTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCD 328
D D F PPFR I+ G A +M SYN ++G+P G L + RN+WGFKG++ SD
Sbjct: 262 DLLDNFLPPFRRAIDAG-ALSVMTSYNSMDGIPCTSNGHLLTEILRNQWGFKGFVVSDLY 320
Query: 329 AVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNL 388
++ I+ + + +++ L+AG+D++ G A+ +G+V E ID A+L +
Sbjct: 321 SIDGIYGTHHTVSSLQEAGIEALRAGVDVDLGANAFALLCDAVRQGRVSEAAIDEAVLRI 380
Query: 389 FSVQLRLGLFNG---DPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLN 445
+++ +GLF +P+ K G V T+E+ ++A A + I LLKN K LPL+
Sbjct: 381 LRMKIEMGLFEHPYVNPKTAKTG------VRTAENIQVAKRVAEESITLLKNSNKLLPLS 434
Query: 446 KNAVSSLAIIGPLVNNISQMGGGYTG--IPCSPKSLLRGLEAYVSKTH--YASGC--HDV 499
KN +A+IGP +N M G YT + K++L G+ + +S + Y GC D
Sbjct: 435 KNI--KIAVIGPNADNRYNMLGDYTAPQQDSNVKTILDGIRSKLSPSQITYVKGCSIRDT 492
Query: 500 PCNSDAGFHEAVRIAKKADFVIVVAG----LDLT---QET----------------EDRD 536
N EAVR A++AD ++V G D QET E D
Sbjct: 493 VFNE---IGEAVRAAREADVIVVAVGGSSARDFKTSYQETGAAITSSKVVSDMESGEGFD 549
Query: 537 RVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGA 596
R SL L G Q L+ S+ T K P++++ G PLD ++ A Q ++L YPG+ G
Sbjct: 550 RASLSLMGIQSRLLQSLKETGK-PMVVIYIEGRPLDKTW--ASEQADALLTAYYPGQEGG 606
Query: 597 KALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGF 656
A+A ++FGD+NP GRLP+T P S ++P+ R +Y T +Y F
Sbjct: 607 NAIANVLFGDYNPAGRLPIT-VPRSVGQLPVYYNKKRPVV------HNYVEMASTPLYPF 659
Query: 657 GHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHV 716
G+GLSYT++ DY H++ +T + + V
Sbjct: 660 GYGLSYTSF---------------------------------DYSHLN-ITKKSEEEYEV 685
Query: 717 QISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQL 776
+ N+G+ DG V L+ S P KQL GF R+H +K I+ + + L
Sbjct: 686 SFDIRNSGERDGDEVAQLYISDKVASVVQPVKQLKGFARIHLKKGETKRITL-ILKKDDL 744
Query: 777 SIANKHGRRILPLGNHVLMVG------ELRHSLTI 805
SI +++ R++ G+ + +G L+ LTI
Sbjct: 745 SITDRNMERVVEAGDFEIQIGSSSEDIRLKAKLTI 779
>gi|298387489|ref|ZP_06997041.1| periplasmic beta-glucosidase [Bacteroides sp. 1_1_14]
gi|298259696|gb|EFI02568.1| periplasmic beta-glucosidase [Bacteroides sp. 1_1_14]
Length = 950
Score = 271 bits (693), Expect = 9e-70, Method: Compositional matrix adjust.
Identities = 225/780 (28%), Positives = 360/780 (46%), Gaps = 130/780 (16%)
Query: 54 FCNTSLSISTRAKSLISLLTLQEKIQQLSDN--ASAIPRLGIPAYEWWSESLHGIASNGP 111
+ + SL + R +SL++++T ++K++ + + IP L +P E++HG +
Sbjct: 166 YMDASLPVEERVESLLAVMTPEDKMELIREGWGIPGIPHLYVPPITK-VEAVHGFSYG-- 222
Query: 112 GVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINI 171
S T FPQ L A++NR L + + E A N QA W+P +++
Sbjct: 223 --------SGATIFPQALAMGATWNRKLTEEVAMVIGDETVAA-NTKQA----WSPVLDV 269
Query: 172 FRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEES 231
+D RWGR +ET GEDP++VS ++K +Q
Sbjct: 270 AQDARWGRCEETFGEDPVLVSQIGGAWIKGYQS--------------------------- 302
Query: 232 DRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCI 291
RG L KH + G R S + ++E++ + PFR I +
Sbjct: 303 -RG----LFTTPKHFGGHGAPLGG---RDSHDIGLSEREMREIHLVPFRHAIRNYDCQSL 354
Query: 292 MCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYT-KTHEDSAAG 349
M +Y+ GVP +L Q+ R EWGF G+I SDC A+ + ++YT K ++A
Sbjct: 355 MMAYSDYMGVPVAKSKELLQQILRQEWGFNGFIVSDCGAIGNLTARKHYTAKDKIEAANQ 414
Query: 350 VLKAGMDINCG-TCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKG-KY 407
L AG+ NCG T + A G++ +D+D + R LF +P K +
Sbjct: 415 ALAAGIATNCGDTYNNKEVIQAAKDGRINMEDLDNVCRTMLGTMFRNELFEKNPCKPLDW 474
Query: 408 GKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGG 467
K+ P + HK++A AAR+ IV+L+N + LPL+K + ++A++GP +++ G
Sbjct: 475 KKIYPG-WNSDSHKEMARQAARESIVMLENKENLLPLSK-TLCTIAVLGPGADDLQP--G 530
Query: 468 GYTG--IPCSPKSLLRGLEAYV---SKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIV 522
YT +P KS+L G++ V +K Y GC D + +AV+ A ++D VI+
Sbjct: 531 DYTPKLLPGQLKSVLTGIKGAVGKQTKVLYEQGC-DFTNPDETNIPKAVKAASQSDVVIM 589
Query: 523 VAGLDLTQET---------EDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDV 573
V G T E E+ D +L+LPG+Q L+ +V T K PVIL+L G P D+
Sbjct: 590 VLGDCSTSEATNDVRKTCGENNDWATLILPGKQQELLEAVCATGK-PVILILQAGRPYDI 648
Query: 574 SFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMR 633
+A +IL PG+ G A+A+++FGD+NP GRLPMT +P ++P+ N +
Sbjct: 649 --LKASEMCKAILVNWLPGQEGGPAMADVLFGDYNPAGRLPMT-FPRHVGQLPLY-YNFK 704
Query: 634 ADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQ 693
S R+Y +Y +Y FG GLSYT++ Y L ++ ++ N+
Sbjct: 705 T-SGRRYEYVDMEYYP---LYRFGFGLSYTSFEYSNL----------KIQEKANGNV--- 747
Query: 694 TGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGF 753
VQ +V N G G V L+ S T +L F
Sbjct: 748 ---------------------EVQATVKNVGSRAGDEVAQLYVTDMYASVKTRVMELKDF 786
Query: 754 DRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG----------ELRHSL 803
R+H SK +SF + P + +S+ N R++ G +MVG E++HS+
Sbjct: 787 ARIHLQPGESKTVSFEMTPYD-ISLLNDRMDRVVEKGEFKIMVGGMSPDYVAKNEIKHSV 845
>gi|325929067|ref|ZP_08190221.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas perforans
91-118]
gi|325540562|gb|EGD12150.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas perforans
91-118]
Length = 850
Score = 271 bits (693), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 157/444 (35%), Positives = 237/444 (53%), Gaps = 45/444 (10%)
Query: 72 LTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVS 131
+TL+EK Q+ + A AIPRLG+PAY+WW+E+LHG+A G T FPQ +
Sbjct: 1 MTLEEKAAQMQNAAPAIPRLGVPAYDWWNEALHGVARAG----------GATVFPQAIGM 50
Query: 132 AASFNRSLWSNIGSAVAVEARAMYN--LGQ------AGLTFWAPNINIFRDPRWGRGQET 183
AA+F+ L + +A++ EARA ++ L Q GLTFW+PNINIFRDPRWGRGQET
Sbjct: 51 AATFDLPLMHEVATAISDEARAKHHQFLRQNQHARYQGLTFWSPNINIFRDPRWGRGQET 110
Query: 184 PGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACC 243
GEDP + + V FV+ QGE + G +R+ L A
Sbjct: 111 YGEDPFLTARMGVTFVQGLQGEGADAPKNAQGEPYRK------------------LDATA 152
Query: 244 KHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPA 303
KH + + R+ F+A +++D +T+ P F + ++ GK +M +YN+V G A
Sbjct: 153 KHFAVHSGPE---ADRHHFDARPSQRDLYETYLPAFEALVKDGKVDAVMGAYNRVYGESA 209
Query: 304 CLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTC 362
L Q R +WGFKGY+ SDC A+ I+++ T E +AA +K G ++ CG
Sbjct: 210 SASKFLLQDVLRQQWGFKGYVVSDCWAIVDIWKHHKIVATREQAAALAVKHGTELECGE- 268
Query: 363 MLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKK 422
+A+ +G + E ID AL L + ++RLG+F+ P + + + + H
Sbjct: 269 EYSTLPAAVRQGLIDEAQIDTALTTLMTARMRLGMFD-PPGQLPWSTIPASVNQSPAHDA 327
Query: 423 LALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRG 482
LA AR+ +VLLKND LPL++ + +A+IGP ++ + G Y G P +P ++L+G
Sbjct: 328 LARRTARESLVLLKND-GLLPLSRAKLKRIAVIGPTADDTMALLGNYYGTPAAPVTVLQG 386
Query: 483 LEAYV--SKTHYASGCHDVPCNSD 504
+ A ++ YA G V D
Sbjct: 387 IRAAAPNAQVLYARGADLVEGRDD 410
Score = 130 bits (328), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 93/282 (32%), Positives = 131/282 (46%), Gaps = 52/282 (18%)
Query: 507 FHEAVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVART 556
EA+ +A+ AD V+ V GL E E+ DR L LP Q L+ ++ T
Sbjct: 574 LQEALDVARSADVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRDLLEALQAT 633
Query: 557 SKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMT 616
K PV+ VLT G L + +A+ + +IL YPG+ G A+A+ +FGD NPGGRLP+T
Sbjct: 634 GK-PVVAVLTTGSALAIDWAQ--QHLPAILLAWYPGQRGGTAVADTLFGDANPGGRLPVT 690
Query: 617 WYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSEL 676
+Y ES T +D MR GR+YR++ GT +Y FGHGLSYT ++Y
Sbjct: 691 FYKESETLPAFDDYAMR--------GRTYRYFGGTPLYPFGHGLSYTQFAY--------- 733
Query: 677 TISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFA 736
+G RLD I S T+ ++V N G G VV L+
Sbjct: 734 -----------------SGLRLDRTTIAADGSLTAT-----VTVKNTGQRAGDEVVQLYL 771
Query: 737 RVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSI 778
+ K+L GF R+ + + F +D L I
Sbjct: 772 HPLTPQRERAGKELHGFQRIALQPGEQRALHFTLDAKNALRI 813
>gi|29347190|ref|NP_810693.1| beta-glucosidase [Bacteroides thetaiotaomicron VPI-5482]
gi|29339089|gb|AAO76887.1| periplasmic beta-glucosidase precursor [Bacteroides
thetaiotaomicron VPI-5482]
Length = 950
Score = 271 bits (693), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 225/780 (28%), Positives = 359/780 (46%), Gaps = 130/780 (16%)
Query: 54 FCNTSLSISTRAKSLISLLTLQEKIQQLSDN--ASAIPRLGIPAYEWWSESLHGIASNGP 111
+ + SL + R +SL++++T ++K++ + + IP L +P E++HG +
Sbjct: 166 YMDASLPVEERVESLLAVMTPEDKMELIREGWGIPGIPHLYVPPITK-VEAVHGFSYG-- 222
Query: 112 GVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINI 171
S T FPQ L A++NR L + + E A N QA W+P +++
Sbjct: 223 --------SGATIFPQALAMGATWNRKLTEEVAMVIGDETVAA-NTKQA----WSPVLDV 269
Query: 172 FRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEES 231
+D RWGR +ET GEDP++VS ++K +Q
Sbjct: 270 AQDARWGRCEETFGEDPVLVSQIGGAWIKGYQS--------------------------- 302
Query: 232 DRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCI 291
RG L KH + G R S + ++E++ + PFR I +
Sbjct: 303 -RG----LFTTPKHFGGHGAPLGG---RDSHDIGLSEREMREIHLVPFRHAIRNYDCQSL 354
Query: 292 MCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYT-KTHEDSAAG 349
M +Y+ GVP +L Q+ R EWGF G+I SDC A+ + ++YT K ++A
Sbjct: 355 MMAYSDYMGVPVAKSKELLQQILRQEWGFNGFIVSDCGAIGNLTARKHYTAKDKIEAANQ 414
Query: 350 VLKAGMDINCG-TCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKG-KY 407
L AG+ NCG T + A G++ +D+D + R LF +P K +
Sbjct: 415 ALAAGIATNCGDTYNNKEVIQAAKDGRINMEDLDNVCRTMLGTMFRNELFEKNPCKPLDW 474
Query: 408 GKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGG 467
K+ P + HK++A AAR+ IV+L+N LPL+K + ++A++GP +++ G
Sbjct: 475 KKIYPG-WNSDSHKEMARQAARESIVMLENKDNLLPLSK-TLRTIAVLGPGADDLQP--G 530
Query: 468 GYTG--IPCSPKSLLRGLEAYV---SKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIV 522
YT +P KS+L G++ V +K Y GC D + +AV+ A ++D VI+
Sbjct: 531 DYTPKLLPGQLKSVLTGIKGAVGKQTKVLYEQGC-DFTNPDETNIPKAVKAASQSDVVIM 589
Query: 523 VAGLDLTQET---------EDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDV 573
V G T E E+ D +L+LPG+Q L+ +V T K PVIL+L G P D+
Sbjct: 590 VLGDCSTSEATNDVRKTCGENNDWATLILPGKQQELLEAVCATGK-PVILILQAGRPYDI 648
Query: 574 SFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMR 633
+A +IL PG+ G A+A+++FGD+NP GRLPMT +P ++P+ N +
Sbjct: 649 --LKASEMCKAILVNWLPGQEGGPAMADVLFGDYNPAGRLPMT-FPRHVGQLPLY-YNFK 704
Query: 634 ADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQ 693
S R+Y +Y +Y FG GLSYT++ Y L ++ ++ N+
Sbjct: 705 T-SGRRYEYVDMEYYP---LYRFGFGLSYTSFEYSNL----------KIQEKANGNV--- 747
Query: 694 TGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGF 753
VQ +V N G G V L+ S T +L F
Sbjct: 748 ---------------------EVQATVKNVGSRAGDEVAQLYVTDMYASVKTRVMELKDF 786
Query: 754 DRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG----------ELRHSL 803
R+H SK +SF + P + +S+ N R++ G +MVG E++HS+
Sbjct: 787 ARIHLQPGESKTVSFEMTPYD-ISLLNDRMDRVVEKGEFKIMVGGMSPDYVAKNEIKHSV 845
>gi|399025517|ref|ZP_10727513.1| beta-glucosidase-like glycosyl hydrolase [Chryseobacterium sp.
CF314]
gi|398077894|gb|EJL68841.1| beta-glucosidase-like glycosyl hydrolase [Chryseobacterium sp.
CF314]
Length = 875
Score = 271 bits (693), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 164/474 (34%), Positives = 238/474 (50%), Gaps = 61/474 (12%)
Query: 52 YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGP 111
YPF N +L + R ++L+ LLT+ EKI + DN+ A+PRL IPAY WW+E+LHG+A G
Sbjct: 23 YPFRNPNLPVEQRIENLLGLLTVDEKIGMMMDNSKAVPRLEIPAYGWWNEALHGVARAG- 81
Query: 112 GVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG---------QAGL 162
+ T FPQ + AA+++ ++ EARA YN GL
Sbjct: 82 ---------TATVFPQAIGMAAAWDVPEHLKTFEMISDEARAKYNKSFDEASKTGRYEGL 132
Query: 163 TFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKR 222
TFW PNINIFRDPRWGRGQET GEDP + S V VK QG + K
Sbjct: 133 TFWTPNINIFRDPRWGRGQETYGEDPYLTSVLGVAAVKGLQGNDPKY------------- 179
Query: 223 VLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSC 282
AC KH + +W +R+S+NA ++++D +T+ P F+S
Sbjct: 180 --------------FKTHACAKHFAVHSGPEW---NRHSYNAEVSKRDLYETYLPAFKSL 222
Query: 283 IEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTK 341
+ +G +MC+YN +G P C L + R +W + G + SDC A+A ++ + Y
Sbjct: 223 VLEGNVREVMCAYNAFDGQPCCASNTLLNEILRGKWKYDGMVVSDCWALADFYQ-EKYHG 281
Query: 342 THED---SAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLF 398
TH D +AA LK D+ CG + ++ G + EKDID ++ + LG+
Sbjct: 282 THPDEKSTAADALKHSTDLECGDT-YNNLNKSLAGGLITEKDIDISMRRILKGWFELGML 340
Query: 399 NGDPRKGKYGKLGPDDVCTS-EHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGP 457
DP+ P V S EHKK AL A++ IVL+KN+ LP NKN + +A++GP
Sbjct: 341 --DPKSSVLWNQIPYSVVDSDEHKKQALKMAQKSIVLMKNENNILPFNKN-IKKIAVVGP 397
Query: 458 LVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTH--YASGCHDVPCNSDAGFHE 509
++ G Y G P S ++L G++A T Y G +S A ++
Sbjct: 398 NADDEMMQLGNYNGTPSSIVTILEGIKAKFPNTEIIYEKGSEVADPSSRASLYQ 451
Score = 95.9 bits (237), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 78/297 (26%), Positives = 121/297 (40%), Gaps = 47/297 (15%)
Query: 486 YVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDR---------- 535
+ K Y S +V F K AD ++ GL + E E+
Sbjct: 571 HTQKGKYVSISFEVYRKDPVNFASVKEKVKDADVIVFAGGLSPSLEGEEMLVNAEGFKGG 630
Query: 536 DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAG 595
D+ S+ LP Q L+ + +T K PV+ VL G L + E + + W G G++G
Sbjct: 631 DKTSIELPKVQRELLAELRKTGK-PVVFVLCTGSSLGLEQDEKNYDVLLNAWYG--GQSG 687
Query: 596 AKALAEIIFGDFNPGGRLPMTWYP--ESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQV 653
A+A+++ GD+NP GRLP+T+Y E + + GR+YR+ T +
Sbjct: 688 GTAVADVLAGDYNPSGRLPVTFYKNLEQLDNALSKTSKHQGFENYDMQGRTYRYMTENPL 747
Query: 654 YGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLR 713
Y FGHGLSY+ ++Y G+ K D +
Sbjct: 748 YAFGHGLSYSKFNY-----------------GNAKLSKNSISPNEDII------------ 778
Query: 714 FHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGV 770
+ + VTN D DG VV ++ + P K L F+RV +K +K I +
Sbjct: 779 --ITVPVTNISDRDGEEVVQVYVKRNN-DVLAPVKTLRAFERVLIRSKETKNIQLTI 832
>gi|441498970|ref|ZP_20981160.1| Beta-glucosidase [Fulvivirga imtechensis AK7]
gi|441437215|gb|ELR70569.1| Beta-glucosidase [Fulvivirga imtechensis AK7]
Length = 752
Score = 271 bits (693), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 217/689 (31%), Positives = 335/689 (48%), Gaps = 117/689 (16%)
Query: 90 RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
RLGIP + ++ +HG T FP L SAAS++ A+
Sbjct: 104 RLGIPLL-FGADVIHGFK---------------TVFPIPLASAASWDLEAIEKAERVAAI 147
Query: 150 EARAMYNLGQAGLTF-WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWK 208
E+ A AG+ F +AP ++I RDPRWGR E GEDP + S A V+ FQ
Sbjct: 148 ESTA------AGINFNFAPMVDISRDPRWGRIAEGAGEDPFLGSEVAKARVRGFQ----- 196
Query: 209 SDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-IT 267
E D ++AC KH AY G +N + ++
Sbjct: 197 ---------------------EQSLTDPQTMAACVKHFAAYGAPDGGR----DYNTVDMS 231
Query: 268 EQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSD 326
E+ + + PP+++ I+ G A+ IM S+N++NG+ A L + R EWGFKG + SD
Sbjct: 232 ERLLREMYLPPYKAGIDAG-AATIMTSFNELNGIAASGSQFLLRDILRKEWGFKGMVVSD 290
Query: 327 CDAVATIFEYQNYTKTHEDSAAGVLKAGMDINC-GTCMLRHTQSAIDKGKVQEKDIDRAL 385
+V + + + ++A LKAG+D++ G L +++GK+ K +D A+
Sbjct: 291 WQSVNEMVAH-GNAANNAEAAMMALKAGVDMDMMGDVYLEEVPRLVNEGKLDIKFVDEAV 349
Query: 386 LNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLN 445
N+ ++ LGLF+ DP + +++ EH + A D A++ IVLLKN +K LPL
Sbjct: 350 RNVLKLKYDLGLFD-DPYRYSDTIREKNNIRAVEHLEAARDVAKKSIVLLKNKEKLLPLK 408
Query: 446 KNAVSSLAIIGPLVNNISQMGGGYT--GIPCSPKSLLRGLEAYVS---KTHYASGCHDVP 500
K ++ ++A+IGPL +N + M G ++ G P + L+G++ VS + YA GC+
Sbjct: 409 K-SIGTIAVIGPLADNQADMNGTWSFFGEAQHPITFLQGIKDAVSGQSRVLYAEGCNLYD 467
Query: 501 CNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRP 560
+ D F EAV IAKKAD VI+ G E R + LPG Q LV +A+T K P
Sbjct: 468 RSKDK-FAEAVNIAKKADVVILAVGESAVMNGEAGSRSDIRLPGIQPELVMEIAKTGK-P 525
Query: 561 VILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPE 620
V+ ++ G PLD+S+ D I +IL + G A A+++FGD+NP G+LP+T +P
Sbjct: 526 VVALVMSGRPLDLSW--LDENIPAILEVWTLGSEAGNAAADVLFGDYNPSGKLPVT-FPR 582
Query: 621 SFTKVPMNDMNMRADSSRQYPG------------RSYRFYTGTQVYGFGHGLSYTNYSYK 668
+ +VP+ ++ R Y G YR + +Y FG+GLSY+ + Y
Sbjct: 583 NVGQVPI--YYNHKNTGRPYEGDYSEPLSERIYRSKYRDVQNSPLYPFGYGLSYSTFEY- 639
Query: 669 FLSAPSELTISA-SLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVD 727
S++T+SA +L AG + +T+ +S+TN G D
Sbjct: 640 -----SDITLSADTLNAG------------------ESITAS--------VSITNEGPYD 668
Query: 728 GSHVVMLFARVPKVSQGTPEKQLIGFDRV 756
G VV L+ R S P K+L GF ++
Sbjct: 669 GEEVVQLYIRDLVGSVTRPVKELKGFKKL 697
>gi|198425902|ref|XP_002120563.1| PREDICTED: similar to predicted protein [Ciona intestinalis]
Length = 996
Score = 271 bits (692), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 198/594 (33%), Positives = 294/594 (49%), Gaps = 81/594 (13%)
Query: 64 RAKSLISLLTLQEKIQQLS-----DN--ASAIPRLGIPAYEWWSESLHGIASNGPGVNFN 116
R+ S +S LT++E + Q+S DN A AI RLGI Y+W +E L G A NG
Sbjct: 450 RSFSEVSRLTVKELVLQISRGGAGDNGPAPAITRLGIGPYQWNTECLRGYAMNG------ 503
Query: 117 GTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNL--------GQAGLTFWAPN 168
T FPQ + AA+F++ L + +A+EARA +N GL+ ++P
Sbjct: 504 ----DATCFPQPIGLAATFDQGLIYKLAKTIALEARAKHNNFTKNGNFGDHTGLSCFSPV 559
Query: 169 INIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFG 228
INI R P WGR QET GEDP++ S A +V QG
Sbjct: 560 INILRHPLWGRNQETFGEDPVMSSLMARAYVTGLQG------------------------ 595
Query: 229 EESDRGDELML--SACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQG 286
DE+ +A CKH AYD + SR SFNA ++ +D T+ P FR C+ G
Sbjct: 596 ------DEIYYPATANCKHFAAYDGPENIPSSRLSFNANVSIEDLGRTYFPAFRECVHSG 649
Query: 287 KASCIMCSYNQVNGVPACLRGDLFQKARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDS 346
A I+CSYN +NG PAC L R+++ FKGY++SD A+ Y YTK++ S
Sbjct: 650 -AFGIVCSYNAINGEPACASSYLQTILRDKFNFKGYVSSDESAIEFFDIYFKYTKSNLLS 708
Query: 347 AAGVLKAGMDINCGTCMLRHTQS----AIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDP 402
A AG+D+ + + S A+++G V E + R+ LF ++ LG F DP
Sbjct: 709 AVAAFDAGVDLELTSYGKNNRYSLLNQAVEQGLVTEAALRRSAKRLFRTRMALGEF--DP 766
Query: 403 RKGKYGKLGPDDVCTS-EHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNN 461
++ + P DV S H+K A++ A + VLLKND LPLN + ++AI+GP +NN
Sbjct: 767 QEFNHWLNVPIDVVQSLAHRKQAVEVAAKSFVLLKND-GVLPLN-HRYENVAIVGPFINN 824
Query: 462 ISQMGGGY-----TGIPCSPKSLLRGLEAYVSKTHYASGC-----HDVPCNSDAGFHEAV 511
+ G Y SP L + + +GC ++P +
Sbjct: 825 SEALTGDYHPNYNLKYFSSPLFAANSLSSS-GVARFTTGCVGTNNQNLPICATYNSTHVK 883
Query: 512 RIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPL 571
+ +D V+V G E E RDR+S+ LPG+Q+ ++ V + + PVI+VL GPL
Sbjct: 884 EVVTGSDIVLVTLGTGTGIEAEGRDRLSMELPGKQLDMIKDVVKYANGPVIVVLFNAGPL 943
Query: 572 DVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKV 625
DVS+ ++ ++++ + + +A+ E++ G NP GRLP TW P S +V
Sbjct: 944 DVSWVMGNT--AAVIACHFSAQMTGEAMLEVLTGIVNPAGRLPNTW-PASMQQV 994
>gi|189464325|ref|ZP_03013110.1| hypothetical protein BACINT_00666 [Bacteroides intestinalis DSM
17393]
gi|189438115|gb|EDV07100.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
intestinalis DSM 17393]
Length = 935
Score = 271 bits (692), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 220/767 (28%), Positives = 353/767 (46%), Gaps = 120/767 (15%)
Query: 51 SYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDN--ASAIPRLGIPAYEWWSESLHGIAS 108
S + + +L + R +SL+S++T ++K++ + + IP L +P E++HG +
Sbjct: 148 SLRYMDPTLPVEERVESLLSVMTPEDKMELIREGWGIPGIPHLYVPPITK-VEAVHGFSY 206
Query: 109 NGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPN 168
S T FPQ L A++N+ L + AV E L + W+P
Sbjct: 207 G----------SGATIFPQALAMGATWNKKLTEEVAMAVGDET-----LSAGTMQAWSPV 251
Query: 169 INIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFG 228
+++ +D RWGR +ET GEDP++VS ++K +Q
Sbjct: 252 LDVAQDARWGRCEETFGEDPVLVSQIGGAWIKGYQS------------------------ 287
Query: 229 EESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKA 288
+ L KH + G R S + ++E++ + PFR I
Sbjct: 288 --------MGLYTTPKHFGGHGAPLGG---RDSHDIGLSEREMREVHLVPFRHVIRNYDC 336
Query: 289 SCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSA 347
+M +Y+ GVP +L R EWGF G+I SDC A+ + ++YT ++ A
Sbjct: 337 QSLMMAYSDFLGVPVAKSRELLHNILREEWGFSGFIVSDCGAIGNLTARKHYTAKNKIEA 396
Query: 348 AG-VLKAGMDINCG-TCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKG 405
A L AG+ NCG T + A G++ +++D + + R LF P K
Sbjct: 397 ANQALAAGIATNCGDTYNDKEVIQAAKDGRINMENLDEVCRTMLRMMFRNELFEKAPNKP 456
Query: 406 -KYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQ 464
+ K+ P + HK++A AAR+ IVLL+N LPL+K+ + ++A++GP N++
Sbjct: 457 LDWNKIYPG-WNSDSHKEMARQAARESIVLLENKDNILPLSKD-MRTIAVLGPGANDLQP 514
Query: 465 MGGGYTGI--PCSPKSLLRGLEAYV---SKTHYASGCHDVPCNSDAGFHEAVRIAKKADF 519
G YT P KS+L G++ V +K Y GC D + +AV++A ++D
Sbjct: 515 --GDYTPKLQPGQLKSVLTGIKQAVGKQTKVIYEQGC-DFTSLGENNIAKAVKVASQSDV 571
Query: 520 VIVVAGLDLTQET---------EDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGP 570
V++V G T E E+ D +L+LPG+Q L+ +V T K PVIL+L G P
Sbjct: 572 VLLVLGDCSTSEATTDVYKTSGENHDYATLILPGKQQELLEAVCATGK-PVILILQAGRP 630
Query: 571 LDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDM 630
++S A + + W+ PG+ G A A+++FGD+NP GRLPMT +P ++P+
Sbjct: 631 YNLSKASELCKAILVNWL--PGQEGGPATADVLFGDYNPAGRLPMT-FPRHVGQLPLY-Y 686
Query: 631 NMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNI 690
N + S R+Y +Y +Y FG+GLSYT++ Y L ++ + NI
Sbjct: 687 NFKT-SGRRYEYSDMEYYP---LYYFGYGLSYTSFEYSGL----------KIQEKENGNI 732
Query: 691 LQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQL 750
VQ +V N G G VV L+ S T +L
Sbjct: 733 T------------------------VQATVKNIGQRAGDEVVQLYVTDMYASVKTRITEL 768
Query: 751 IGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
F R+H +K +SF + P E LS+ N H R++ G ++VG
Sbjct: 769 KDFTRIHLKPGEAKTVSFELTPYE-LSLLNDHMDRVVEKGAFKILVG 814
>gi|393782428|ref|ZP_10370612.1| hypothetical protein HMPREF1071_01480 [Bacteroides salyersiae
CL02T12C01]
gi|392673256|gb|EIY66719.1| hypothetical protein HMPREF1071_01480 [Bacteroides salyersiae
CL02T12C01]
Length = 596
Score = 270 bits (691), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 193/650 (29%), Positives = 310/650 (47%), Gaps = 93/650 (14%)
Query: 162 LTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREK 221
+T+W+PN+NIFRDPRWGRGQET GEDP + + +V+ QG +
Sbjct: 1 MTYWSPNVNIFRDPRWGRGQETYGEDPYLTAEIGKAYVRGLQG--------------NDP 46
Query: 222 RVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRS 281
LK +AC KH Y + R+ FNA +++D +T+ P F +
Sbjct: 47 FFLKA-------------AACAKH---YAVHSGPEALRHEFNASPSKRDLFETYLPAFEA 90
Query: 282 CIEQGKASCIMCSYNQVNGVPACLRGDLF---QKARNEWGFKGYITSDCDAVATIFEYQN 338
+++ K +M +YN+V G A G F R +WGFKG++ SDC AV I+
Sbjct: 91 LVKEAKVEAVMGAYNRVYGESAS--GSFFLLTDILRKKWGFKGHVVSDCGAVDDIYGGHK 148
Query: 339 YTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLF 398
K +++A LK+G+++NCG + A+++ + E D+D AL+ L +L+LG
Sbjct: 149 IAKDVAEASAIALKSGLNLNCGGS-FHALKEALERKLITEVDLDNALMPLMMTRLKLGNL 207
Query: 399 NGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPL 458
D + Y + + + H +A + A++ +VLLKN+ LPL K+ V ++ + GP
Sbjct: 208 TDD-DESPYKNISDSVIASYTHAMVAREVAQKSMVLLKNNNHTLPLKKD-VKTIFVTGPY 265
Query: 459 VNNISQMGGGYTGIPCSPKSLLRGLEAYVS---KTHYASGCHDVPCN---SDAGFHEAVR 512
+ M G Y G+ + L+G+ A VS +Y G N +D E VR
Sbjct: 266 AADTYVMMGNYYGVSPRSNTFLQGIAAKVSGGTSINYKIGILPTTPNMNPADWTVGE-VR 324
Query: 513 IAKKADFVIVVAGLDLTQE------TEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLT 566
A+ A VI ++G+D +E + D+ +L LP Q+ + ++R ++ V+T
Sbjct: 325 AAEVAIVVIGLSGIDEGEEGDAIASSHRGDKQNLKLPEHQLKFLRDISRNRWNKLVTVIT 384
Query: 567 GGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVP 626
GG P+D+ S + W YPG+ G AL +++FGD + GR+P+T+ P
Sbjct: 385 GGSPIDLEEVSELSDAVIMAW--YPGQEGGMALGDLLFGDVSFSGRMPVTF--------P 434
Query: 627 MNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGS 686
+N + A GR+Y++ T +Y FG+GL+Y + SY S
Sbjct: 435 INSDWLPAFEDYNMQGRTYKYMTDNIMYPFGYGLTYGDVSY------------------S 476
Query: 687 DKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTP 746
D IL Y E+ HVQ ++ N G+ + VV L+ P TP
Sbjct: 477 DVKILNPK-----YDGKQEI--------HVQATLRNNGNNEVEEVVQLYLSAPGAGVITP 523
Query: 747 EKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMV 796
LIGF RV + S+ + F + P +QL + + G + L G + ++V
Sbjct: 524 ISSLIGFKRVTLESHLSQTVEFIIKP-DQLKMVMEDGSKNLLKGKYTIIV 572
>gi|383125188|ref|ZP_09945842.1| hypothetical protein BSIG_4348 [Bacteroides sp. 1_1_6]
gi|382983435|gb|EES66611.2| hypothetical protein BSIG_4348 [Bacteroides sp. 1_1_6]
Length = 954
Score = 270 bits (691), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 225/780 (28%), Positives = 359/780 (46%), Gaps = 130/780 (16%)
Query: 54 FCNTSLSISTRAKSLISLLTLQEKIQQLSDN--ASAIPRLGIPAYEWWSESLHGIASNGP 111
+ + SL + R +SL++++T ++K++ + + IP L +P E++HG +
Sbjct: 170 YMDASLPVEERVESLLAVMTPEDKMELIREGWGIPGIPHLYVPPITK-VEAVHGFSYG-- 226
Query: 112 GVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINI 171
S T FPQ L A++NR L + + E A N QA W+P +++
Sbjct: 227 --------SGATIFPQALAMGATWNRKLTEEVAMVIGDETVAA-NTKQA----WSPVLDV 273
Query: 172 FRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEES 231
+D RWGR +ET GEDP++VS ++K +Q
Sbjct: 274 AQDARWGRCEETFGEDPVLVSQIGGAWIKGYQS--------------------------- 306
Query: 232 DRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCI 291
RG L KH + G R S + ++E++ + PFR I +
Sbjct: 307 -RG----LFTTPKHFGGHGAPLGG---RDSHDIGLSEREMREIHLVPFRHAIRNYDCQSL 358
Query: 292 MCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYT-KTHEDSAAG 349
M +Y+ GVP +L Q+ R EWGF G+I SDC A+ + ++YT K ++A
Sbjct: 359 MMAYSDYMGVPVAKSKELLQQILRQEWGFNGFIVSDCGAIGNLTARKHYTAKDKIEAANQ 418
Query: 350 VLKAGMDINCG-TCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKG-KY 407
L AG+ NCG T + A G++ +D+D + R LF +P K +
Sbjct: 419 ALAAGIATNCGDTYNNKEVIQAAKDGRINMEDLDNVCRTMLGTMFRNELFEKNPCKPLDW 478
Query: 408 GKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGG 467
K+ P + HK++A AAR+ IV+L+N LPL+K + ++A++GP +++ G
Sbjct: 479 KKIYPG-WNSDSHKEMARQAARESIVMLENKDNLLPLSK-TLRTIAVLGPGADDLQP--G 534
Query: 468 GYTG--IPCSPKSLLRGLEAYV---SKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIV 522
YT +P KS+L G++ V +K Y GC D + +AV+ A ++D VI+
Sbjct: 535 DYTPKLLPGQLKSVLTGIKGAVGKQTKVLYEQGC-DFTNPDETNIPKAVKAASQSDVVIM 593
Query: 523 VAGLDLTQET---------EDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDV 573
V G T E E+ D +L+LPG+Q L+ +V T K PVIL+L G P D+
Sbjct: 594 VLGDCSTSEATNDVRKTCGENNDWATLILPGKQQELLEAVCATGK-PVILILQAGRPYDI 652
Query: 574 SFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMR 633
+A +IL PG+ G A+A+++FGD+NP GRLPMT +P ++P+ N +
Sbjct: 653 --LKASEMCKAILVNWLPGQEGGPAMADVLFGDYNPAGRLPMT-FPRHVGQLPLY-YNFK 708
Query: 634 ADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQ 693
S R+Y +Y +Y FG GLSYT++ Y L ++ ++ N+
Sbjct: 709 T-SGRRYEYVDMEYYP---LYRFGFGLSYTSFEYSNL----------KIQEKANGNV--- 751
Query: 694 TGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGF 753
VQ +V N G G V L+ S T +L F
Sbjct: 752 ---------------------EVQATVKNVGSRAGDEVAQLYVTDMYASVKTRVMELKDF 790
Query: 754 DRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG----------ELRHSL 803
R+H SK +SF + P + +S+ N R++ G +MVG E++HS+
Sbjct: 791 ARIHLQPGESKTVSFEMTPYD-ISLLNDRMDRVVEKGEFKIMVGGMSPDYVAKNEIKHSV 849
>gi|116312033|emb|CAJ86398.1| OSIGBa0125M19.1 [Oryza sativa Indica Group]
Length = 247
Score = 270 bits (691), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 139/237 (58%), Positives = 167/237 (70%), Gaps = 28/237 (11%)
Query: 43 PCKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSES 102
PC P +YPFCN +L RA++L+SLLTL EKI QLS+ A+ PRLG+P +EWWSES
Sbjct: 29 PCASPAASAYPFCNATLPFPARARALVSLLTLDEKIAQLSNTAAGAPRLGVPPFEWWSES 88
Query: 103 LHGIASNGPGVNFN-GTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAG 161
LHG+ NGPGVNF+ G V S T FPQV++SAA+FNRSLW A+AVEARAM+N GQAG
Sbjct: 89 LHGVCDNGPGVNFSSGPVRSATIFPQVILSAAAFNRSLWRAAARAIAVEARAMHNAGQAG 148
Query: 162 LTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREK 221
LTFWAPNIN+FRDPRWGRGQETPGEDP VVSAY+VE+VK FQ
Sbjct: 149 LTFWAPNINVFRDPRWGRGQETPGEDPAVVSAYSVEYVKGFQ------------------ 190
Query: 222 RVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITE--QDTEDTFQ 276
+ +GEE +MLSACCKH IAYDLEKW F+RY+FNA + ++ T+Q
Sbjct: 191 ---RDYGEEG----RMMLSACCKHYIAYDLEKWRGFTRYTFNAKVMNVLVPSQSTYQ 240
>gi|86143269|ref|ZP_01061671.1| beta-glucosidase precursor [Leeuwenhoekiella blandensis MED217]
gi|85830174|gb|EAQ48634.1| beta-glucosidase precursor [Leeuwenhoekiella blandensis MED217]
Length = 873
Score = 270 bits (689), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 160/457 (35%), Positives = 237/457 (51%), Gaps = 56/457 (12%)
Query: 52 YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGP 111
+PF N L + TR L+S +TL+EKI QL +A AI RL IP Y WW+ESLHG+A G
Sbjct: 24 FPFQNEQLDLETRLNDLVSRMTLEEKISQLMSDAPAIERLNIPKYNWWNESLHGVARAG- 82
Query: 112 GVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARA---------MYNLGQAGL 162
T FPQ + AAS++ L + +A++ EARA +++ Q GL
Sbjct: 83 ---------YATVFPQSISIAASWDAQLVREVATAISDEARAKHHEYLRRDQHDIYQ-GL 132
Query: 163 TFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKR 222
T W+PNINIFRDPRWGRG ET GEDP + ++VK QG
Sbjct: 133 TMWSPNINIFRDPRWGRGHETYGEDPFLTGTLGAQYVKGLQG------------------ 174
Query: 223 VLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSC 282
D + L + A KH + + SR+ F+A +E+D +T+ P FR
Sbjct: 175 ---------DDPEYLKVVATAKHFAVHSGPE---ESRHYFDANTSERDLWETYLPAFRML 222
Query: 283 IEQGKASCIMCSYNQVNGVPACLRGDLFQKARNEWGFKGYITSDCDAVATIFEYQNYTKT 342
++ + +M +YN+ G A LF RN+WGF GY+ SDC A+ I+E T
Sbjct: 223 VKDAQVQSVMTAYNRFRGEAASSNKLLFDILRNKWGFDGYVVSDCGAINDIWEDHKITAD 282
Query: 343 HEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDP 402
++A L+ G D+NCG + + AI G + E+ I+ A+ LF +L+LG+F+ +
Sbjct: 283 AASASALALETGTDLNCG-ATYKSLKEAIANGLITEEKINIAIERLFRARLKLGMFDTE- 340
Query: 403 RKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNI 462
Y + + H LA AA++ IVLLKN+ LPL+K+ + +A+IGP +N+
Sbjct: 341 ENLSYATIPFSVNTNASHTALARKAAQESIVLLKNEAHMLPLSKD-LKQIAVIGPNAHNV 399
Query: 463 SQMGGGYTGIPCSPKSLLRGLEAYVS---KTHYASGC 496
+ G Y G P +P ++++G+ V+ K Y G
Sbjct: 400 QSLWGNYNGTPKNPVTVVQGIRNKVAPQVKVVYEEGA 436
Score = 150 bits (380), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 97/302 (32%), Positives = 154/302 (50%), Gaps = 56/302 (18%)
Query: 507 FHEAVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVART 556
AV +A+ +D I+V GL+ E E+ DR +L LP +Q L+ ++ T
Sbjct: 589 LERAVNLAEDSDVTILVLGLNERLEGEEMRIDVEGFSKGDRTALDLPLEQRELMRALVAT 648
Query: 557 SKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMT 616
K P++LVL G L +++A+ + +IL GYPG+ G A+A+++FGD+NP GRLP+T
Sbjct: 649 GK-PIVLVLLNGSALAINYAQ--EHVPAILSAGYPGQEGGNAIADVLFGDYNPAGRLPVT 705
Query: 617 WYPESFTKVP-MNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSE 675
+Y +S +P D +M+ GR+YR++ G +Y FG+GLSYT +SY
Sbjct: 706 YY-KSVDDLPDFEDYSMK--------GRTYRYFEGEALYPFGYGLSYTQFSYD------- 749
Query: 676 LTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLF 735
I S + +DK + +VQ++VTN+GD DG VV L+
Sbjct: 750 -AIKTSGRLAADKVL------------------------NVQVTVTNSGDRDGDEVVQLY 784
Query: 736 ARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLM 795
+ S P+ QL+GF R+H ++ + F +D Q S+ N + ++ G L
Sbjct: 785 LKDEVASTTRPQVQLVGFKRIHLQKGETQTVEFRLD-ARQFSMINDQEQLVVEPGWFTLY 843
Query: 796 VG 797
G
Sbjct: 844 AG 845
>gi|427384392|ref|ZP_18880897.1| hypothetical protein HMPREF9447_01930 [Bacteroides oleiciplenus YIT
12058]
gi|425727653|gb|EKU90512.1| hypothetical protein HMPREF9447_01930 [Bacteroides oleiciplenus YIT
12058]
Length = 954
Score = 270 bits (689), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 219/764 (28%), Positives = 354/764 (46%), Gaps = 120/764 (15%)
Query: 54 FCNTSLSISTRAKSLISLLTLQEKIQQLSDN--ASAIPRLGIPAYEWWSESLHGIASNGP 111
+ + +L + R +SL+S++T ++K++ + + IP L +P E++HG +
Sbjct: 170 YMDPTLPVEERVESLLSVMTPEDKMELIREGWGIPGIPHLYVPPITK-VEAVHGFSYG-- 226
Query: 112 GVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINI 171
S T FPQ L A++N+ L I AV E L + W+P +++
Sbjct: 227 --------SGATIFPQALAMGATWNKKLTEEIAMAVGDET-----LAAGTMQAWSPVLDV 273
Query: 172 FRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEES 231
+D RWGR +ET GEDP++VS ++K +Q +
Sbjct: 274 AQDARWGRCEETFGEDPVLVSQIGGAWIKGYQSKG------------------------- 308
Query: 232 DRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCI 291
L KH + G R S + ++E++ + PFR I +
Sbjct: 309 -------LFTTPKHFGGHGAPLGG---RDSHDIGLSEREMREVHLVPFRHVIRNYDCQSL 358
Query: 292 MCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYT-KTHEDSAAG 349
M +Y+ GVP +L R EWGF G+I SDC A+ + ++YT K ++A
Sbjct: 359 MMAYSDFLGVPVAKSKELLHNILREEWGFDGFIVSDCGAIGNLTARKHYTAKDKIEAANQ 418
Query: 350 VLKAGMDINCG-TCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKG-KY 407
L AG+ NCG T + A G++ +++D + + R LF P K +
Sbjct: 419 ALAAGIATNCGDTYNDKEVIQAAKDGRLNMENLDNVCRTMLRMMFRNELFEKAPNKPLDW 478
Query: 408 GKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGG 467
K+ P + HK++A AAR+ IV+L+N + LPL+K + S+A++GP +++ G
Sbjct: 479 NKIYPG-WNSDNHKEMARQAARESIVMLENKENILPLDK-GIRSIAVLGPGADDLQP--G 534
Query: 468 GYTG--IPCSPKSLLRGLEAYV---SKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIV 522
YT +P KS+L G++ V +K Y GC D S+ +AV+ A ++D V++
Sbjct: 535 DYTPKLLPGQLKSVLTGIKQAVGKQTKVIYEQGC-DFTNLSETNIPKAVKAASQSDVVVM 593
Query: 523 VAGLDLTQET---------EDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDV 573
V G T E E+ D +L+LPG+Q L+ +V T K PVILVL G P ++
Sbjct: 594 VLGDCSTSEATTDVYKTSGENHDYATLILPGKQQELLEAVCATGK-PVILVLQAGRPYNL 652
Query: 574 SFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMR 633
+ A + + W+ PG+ G A A+++FGD+NP GRLPMT +P+ ++P+ N +
Sbjct: 653 TKASKLCKAIIVNWL--PGQEGGPATADVLFGDYNPAGRLPMT-FPQHVGQLPLY-YNFK 708
Query: 634 ADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQ 693
S R+Y +Y +Y FG+GLSYT++ Y L ++ + NI
Sbjct: 709 T-SGRRYEYSDLEYYP---LYYFGYGLSYTSFEYSGL----------KVQEKDNGNIT-- 752
Query: 694 TGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGF 753
VQ +V N G G VV L+ S T +L F
Sbjct: 753 ----------------------VQATVKNVGQRAGDEVVQLYVTDMYASVKTRITELKDF 790
Query: 754 DRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
R++ SK +SF + P + LS+ N H R++ G ++VG
Sbjct: 791 TRINLKPGESKTVSFELTPYD-LSLLNDHMDRVVEKGEFKILVG 833
>gi|380694609|ref|ZP_09859468.1| periplasmic beta-glucosidase , xylosidase/arabinosidase
[Bacteroides faecis MAJ27]
Length = 804
Score = 270 bits (689), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 223/753 (29%), Positives = 339/753 (45%), Gaps = 137/753 (18%)
Query: 90 RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
RLGIP + E+ HG + G T FP + +A+++ +L +G A+A
Sbjct: 142 RLGIPVF-LAEEAPHGHMAIG-----------TTVFPTGIGMSATWSPTLIEEVGKAIAK 189
Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
E R+ Q + P +++ RDPRW R +ET GEDP++ V
Sbjct: 190 EIRS-----QGAHISYGPVLDLSRDPRWSRVEETFGEDPVLSGRLGAAMVT--------- 235
Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQ 269
G+G G D E A KH +AY + + G Y A + +
Sbjct: 236 ---GLGSG--------------DLSREHATIATLKHFLAYAVPEGGQNGNY---ASVGAR 275
Query: 270 DTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCD 328
D + F PPFR IE G A +M SYN ++G+P L Q RNEW F+G++ SD
Sbjct: 276 DLHENFLPPFREAIEAG-ALSVMTSYNSIDGIPCTANHYLLTQLLRNEWKFRGFVVSDLY 334
Query: 329 AVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNL 388
++ I E T E++A L AG+DI+ G + A+ GK+ E I+ A+ +
Sbjct: 335 SIEGIHESHFVASTMEEAAVQALSAGVDIDLGGDAFMNLLQAVRSGKLDETQINAAVDRI 394
Query: 389 FSVQLRLGLFNG---DPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLN 445
++ +GLF +P K V EH KLA A+ +VLL+N LPL+
Sbjct: 395 LRMKFEMGLFEHPYVNP------KTTTKMVRNKEHVKLARKVAQSSVVLLENKNSILPLS 448
Query: 446 KNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSK-----THYASGC--HD 498
K + +A++GP +N M G YT P K + L+ +SK Y GC D
Sbjct: 449 KK-IKRVAVVGPNADNRYNMLGDYTA-PQEDKDIRTVLDGVISKLSPSRVEYVRGCAIRD 506
Query: 499 VPCNSDAGFHEAVRIAKKADFVIVVAG----LDLT---QET----------------EDR 535
N A EAV A +++ +I V G D QET E
Sbjct: 507 TTVNEIA---EAVEAAHRSEVIIAVVGGSSARDFKTSYQETGAAIADEKSISDMECGEGF 563
Query: 536 DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAG 595
DR +L L G+Q L+ ++ +T+ +P+I+V G PLD +A + ++L YPG+AG
Sbjct: 564 DRATLTLLGKQQDLLNAL-KTTGKPLIVVYIEGRPLDKVWASECAD--ALLTASYPGQAG 620
Query: 596 AKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYG 655
A+A+++FGD+NP GRLP++ P S ++P+ N +A + Y + + +YG
Sbjct: 621 GDAIADVLFGDYNPAGRLPVS-VPRSVGQIPVY-YNKKAPRNHDYVEMA-----ASPLYG 673
Query: 656 FGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFH 715
FG+GLSYT + Y SD I Q+ + C F
Sbjct: 674 FGYGLSYTTFEY------------------SDLQITQK-------------SPC---HFE 699
Query: 716 VQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQ 775
V V N G+ DG V L+ + S P KQL F+R KEI F + +
Sbjct: 700 VSFKVKNTGNYDGEEVAQLYLKDEYASVVQPLKQLKHFERFFLRKGEEKEILFTLTE-KD 758
Query: 776 LSIANKHGRRILPLGNHVLMVGELRHSLTIETY 808
LSI ++ +R++ G+ +M+G + + T+
Sbjct: 759 LSIIDRSMKRVVETGDFRIMIGASSDDIRLTTH 791
>gi|150007848|ref|YP_001302591.1| glycoside hydrolase family protein [Parabacteroides distasonis ATCC
8503]
gi|301310124|ref|ZP_07216063.1| beta-glucosidase [Bacteroides sp. 20_3]
gi|423336365|ref|ZP_17314112.1| hypothetical protein HMPREF1059_00064 [Parabacteroides distasonis
CL09T03C24]
gi|149936272|gb|ABR42969.1| glycoside hydrolase family 3, candidate beta-glycosidase
[Parabacteroides distasonis ATCC 8503]
gi|300831698|gb|EFK62329.1| beta-glucosidase [Bacteroides sp. 20_3]
gi|409240840|gb|EKN33614.1| hypothetical protein HMPREF1059_00064 [Parabacteroides distasonis
CL09T03C24]
Length = 868
Score = 269 bits (687), Expect = 5e-69, Method: Compositional matrix adjust.
Identities = 162/471 (34%), Positives = 228/471 (48%), Gaps = 61/471 (12%)
Query: 52 YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGP 111
YPF N L + R L+S LT +EKI Q+ + AI RLGIP Y+WW+E+LHG+A G
Sbjct: 26 YPFRNPDLPLEERIDDLLSRLTPEEKIGQMMNVTPAIERLGIPTYDWWNEALHGVARAG- 84
Query: 112 GVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLT 163
T FPQ + AA+F+ + + V+ EARA Y+ Q GLT
Sbjct: 85 ---------RATVFPQAIAMAATFDDNAVHETFTMVSDEARAKYHQYQKDKEYDRYKGLT 135
Query: 164 FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRV 223
FW PNINIFRDPRWGRG ET GEDP + V + QG
Sbjct: 136 FWTPNINIFRDPRWGRGMETYGEDPYLTEKMGVAVTRGLQG------------------- 176
Query: 224 LKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCI 283
D + AC KH + +W +R+ FNA T +D +T+ P F + +
Sbjct: 177 --------DDPNYYKTHACAKHYAVHSGPEW---NRHEFNAEATPRDLYETYLPAFEALV 225
Query: 284 EQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIFEYQNYTKT 342
++G +MC+YN+ G P C L RN WG+ I SDC A+ + T
Sbjct: 226 KEGDVQEVMCAYNRFEGKPCCSSDKLLIDILRNSWGYDNIILSDCGAIDDFWRKDKNTPR 285
Query: 343 H------EDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLG 396
H E ++A + G D+ CG R A+ GK+ EKD+D +L L + LG
Sbjct: 286 HETHPDAESASADAVLNGTDLECGGS-YRALNKALADGKISEKDLDVSLRRLLKGRFELG 344
Query: 397 LFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIG 456
+F+ D R Y K+ V + EH ALD AR+ IVLLKN LPL+KN + +A++G
Sbjct: 345 MFDPDERV-PYSKIPYSVVESPEHIAKALDMARKSIVLLKNKNNMLPLDKN-IKKIAVVG 402
Query: 457 PLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGF 507
P + + + Y G P +++ G+ +K A +++ CN A F
Sbjct: 403 PNAADSTMLWANYNGFPTKTVTIVEGIR---NKVPNAEVIYELGCNHTADF 450
Score = 110 bits (275), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 83/298 (27%), Positives = 138/298 (46%), Gaps = 55/298 (18%)
Query: 486 YVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDR---------- 535
Y+ +T A V + F K AD ++ V G+ E E+
Sbjct: 574 YMQRTGSADLNFTVGVRTPVDFQATASKVKDADVIVFVGGISPRLEGEEMPVDAEGFRKG 633
Query: 536 DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAG 595
DR ++ +P Q +V ++ T K PV+ V+ G L +++ + +++IL Y G+ G
Sbjct: 634 DRTNIEIPAVQKEMVKALVATGK-PVVYVVCTGSALALNWE--NDHVNAILNAWYGGQEG 690
Query: 596 AKALAEIIFGDFNPGGRLPMTWYPESFTKVP-MNDMNMRADSSRQYPGRSYRFYTGTQVY 654
A+A+++FGD+NP GRLP+T+Y +S ++P D +M+ GR+YR+ T T +Y
Sbjct: 691 GTAVADVLFGDYNPAGRLPITFY-KSVDQLPDFQDYSMK--------GRTYRYMTQTPLY 741
Query: 655 GFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRF 714
FG+GLSYT + YK +A L D++ S S+
Sbjct: 742 PFGYGLSYTTFDYK----------NAKLSK-------------------DKIASNESVTL 772
Query: 715 HVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDP 772
I+ N G +DG V ++ + P G P K + F RV+ A + +S ++P
Sbjct: 773 SFDIA--NTGKMDGDEVAQIYIKNPNDPAG-PLKAMKAFKRVNVKAGSEQPVSIQLEP 827
>gi|256840106|ref|ZP_05545615.1| glycoside hydrolase family beta-glycosidase [Parabacteroides sp.
D13]
gi|256739036|gb|EEU52361.1| glycoside hydrolase family beta-glycosidase [Parabacteroides sp.
D13]
Length = 868
Score = 269 bits (687), Expect = 5e-69, Method: Compositional matrix adjust.
Identities = 162/471 (34%), Positives = 228/471 (48%), Gaps = 61/471 (12%)
Query: 52 YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGP 111
YPF N L + R L+S LT +EKI Q+ + AI RLGIP Y+WW+E+LHG+A G
Sbjct: 26 YPFRNPDLPLEERIDDLLSRLTPEEKIGQMMNVTPAIERLGIPTYDWWNEALHGVARAG- 84
Query: 112 GVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLT 163
T FPQ + AA+F+ + + V+ EARA Y+ Q GLT
Sbjct: 85 ---------RATVFPQAIAMAATFDDNAVHETFTMVSDEARAKYHQYQKDKEYDRYKGLT 135
Query: 164 FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRV 223
FW PNINIFRDPRWGRG ET GEDP + V + QG
Sbjct: 136 FWTPNINIFRDPRWGRGMETYGEDPYLTEKMGVAVTRGLQG------------------- 176
Query: 224 LKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCI 283
D + AC KH + +W +R+ FNA T +D +T+ P F + +
Sbjct: 177 --------DDPNYYKTHACAKHYAVHSGPEW---NRHEFNAEATPRDLYETYLPAFEALV 225
Query: 284 EQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIFEYQNYTKT 342
++G +MC+YN+ G P C L RN WG+ I SDC A+ + T
Sbjct: 226 KEGDVQEVMCAYNRFEGKPCCSSDKLLIDILRNSWGYDNIILSDCGAIDDFWRKDKNTPR 285
Query: 343 H------EDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLG 396
H E ++A + G D+ CG R A+ GK+ EKD+D +L L + LG
Sbjct: 286 HETHPDAESASADAVLNGTDLECGGS-YRALNKALADGKISEKDLDVSLRRLLKGRFELG 344
Query: 397 LFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIG 456
+F+ D R Y K+ V + EH ALD AR+ IVLLKN LPL+KN + +A++G
Sbjct: 345 MFDPDERV-PYSKIPYSVVESPEHIAKALDMARKSIVLLKNKNNMLPLDKN-IKKIAVVG 402
Query: 457 PLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGF 507
P + + + Y G P +++ G+ +K A +++ CN A F
Sbjct: 403 PNAADSTMLWANYNGFPSKTVTIVEGIR---NKVPNAEVIYELGCNHTADF 450
Score = 110 bits (275), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 83/298 (27%), Positives = 138/298 (46%), Gaps = 55/298 (18%)
Query: 486 YVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDR---------- 535
Y+ +T A V + F K AD ++ V G+ E E+
Sbjct: 574 YMQRTGSADLNFTVGVRTPVDFQATASKVKDADVIVFVGGISPRLEGEEMPVDAEGFRKG 633
Query: 536 DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAG 595
DR ++ +P Q +V ++ T K PV+ V+ G L +++ + +++IL Y G+ G
Sbjct: 634 DRTNIEIPAVQKEMVKALVATGK-PVVYVVCTGSALALNWE--NDHVNAILNAWYGGQEG 690
Query: 596 AKALAEIIFGDFNPGGRLPMTWYPESFTKVP-MNDMNMRADSSRQYPGRSYRFYTGTQVY 654
A+A+++FGD+NP GRLP+T+Y +S ++P D +M+ GR+YR+ T T +Y
Sbjct: 691 GTAVADVLFGDYNPAGRLPITFY-KSVDQLPDFQDYSMK--------GRTYRYMTQTPLY 741
Query: 655 GFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRF 714
FG+GLSYT + YK +A L D++ S S+
Sbjct: 742 PFGYGLSYTTFDYK----------NAKLSK-------------------DKIASNESVTL 772
Query: 715 HVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDP 772
I+ N G +DG V ++ + P G P K + F RV+ A + +S ++P
Sbjct: 773 SFDIA--NTGKMDGDEVAQIYIKNPNDPAG-PLKAMKAFKRVNVKAGSEQPVSIQLEP 827
>gi|298376791|ref|ZP_06986746.1| beta-glucosidase [Bacteroides sp. 3_1_19]
gi|298266669|gb|EFI08327.1| beta-glucosidase [Bacteroides sp. 3_1_19]
Length = 868
Score = 269 bits (687), Expect = 5e-69, Method: Compositional matrix adjust.
Identities = 162/471 (34%), Positives = 228/471 (48%), Gaps = 61/471 (12%)
Query: 52 YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGP 111
YPF N L + R L+S LT +EKI Q+ + AI RLGIP Y+WW+E+LHG+A G
Sbjct: 26 YPFRNPELPLEERIDDLLSRLTPEEKIGQMMNVTPAIERLGIPTYDWWNEALHGVARAG- 84
Query: 112 GVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLT 163
T FPQ + AA+F+ + + V+ EARA Y+ Q GLT
Sbjct: 85 ---------RATVFPQAIAMAATFDDNAVHETFTMVSDEARAKYHQYQKDKEYDRYKGLT 135
Query: 164 FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRV 223
FW PNINIFRDPRWGRG ET GEDP + V + QG
Sbjct: 136 FWTPNINIFRDPRWGRGMETYGEDPYLTEKMGVAVTRGLQG------------------- 176
Query: 224 LKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCI 283
D + AC KH + +W +R+ FNA T +D +T+ P F + +
Sbjct: 177 --------DDPNYYKTHACAKHYAVHSGPEW---NRHEFNAEATPRDLYETYLPAFEALV 225
Query: 284 EQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIFEYQNYTKT 342
++G +MC+YN+ G P C L RN WG+ I SDC A+ + T
Sbjct: 226 KEGDVQEVMCAYNRFEGKPCCSSDKLLIDILRNSWGYDNIILSDCGAIDDFWRKDKNTPR 285
Query: 343 H------EDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLG 396
H E ++A + G D+ CG R A+ GK+ EKD+D +L L + LG
Sbjct: 286 HETHPDAESASADAVLNGTDLECGGS-YRALNKALADGKISEKDLDVSLRRLLKGRFELG 344
Query: 397 LFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIG 456
+F+ D R Y K+ V + EH ALD AR+ IVLLKN LPL+KN + +A++G
Sbjct: 345 MFDPDERV-PYSKIPYSVVESPEHIAKALDMARKSIVLLKNKNNMLPLDKN-IKKIAVVG 402
Query: 457 PLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGF 507
P + + + Y G P +++ G+ +K A +++ CN A F
Sbjct: 403 PNAADSTMLWANYNGFPSKTVTIVEGIR---NKVPNAEVIYELGCNHTADF 450
Score = 110 bits (275), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 83/298 (27%), Positives = 138/298 (46%), Gaps = 55/298 (18%)
Query: 486 YVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDR---------- 535
Y+ +T A V + F K AD ++ V G+ E E+
Sbjct: 574 YMQRTGSADLNFTVGVRTPVDFQATASKVKDADVIVFVGGISPRLEGEEMPVDAEGFRKG 633
Query: 536 DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAG 595
DR ++ +P Q +V ++ T K PV+ V+ G L +++ + +++IL Y G+ G
Sbjct: 634 DRTNIEIPAVQKEMVKALVATGK-PVVYVVCTGSALALNWE--NDHVNAILNAWYGGQEG 690
Query: 596 AKALAEIIFGDFNPGGRLPMTWYPESFTKVP-MNDMNMRADSSRQYPGRSYRFYTGTQVY 654
A+A+++FGD+NP GRLP+T+Y +S ++P D +M+ GR+YR+ T T +Y
Sbjct: 691 GTAVADVLFGDYNPAGRLPITFY-KSVDQLPDFQDYSMK--------GRTYRYMTQTPLY 741
Query: 655 GFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRF 714
FG+GLSYT + YK +A L D++ S S+
Sbjct: 742 PFGYGLSYTTFDYK----------NAKLSK-------------------DKIASNESVTL 772
Query: 715 HVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDP 772
I+ N G +DG V ++ + P G P K + F RV+ A + +S ++P
Sbjct: 773 SFDIA--NTGKMDGDEVAQIYIKNPNDPAG-PLKAMKAFKRVNVKAGSEQPVSIQLEP 827
>gi|262381651|ref|ZP_06074789.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp.
2_1_33B]
gi|262296828|gb|EEY84758.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp.
2_1_33B]
Length = 868
Score = 269 bits (687), Expect = 6e-69, Method: Compositional matrix adjust.
Identities = 162/471 (34%), Positives = 228/471 (48%), Gaps = 61/471 (12%)
Query: 52 YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGP 111
YPF N L + R L+S LT +EKI Q+ + AI RLGIP Y+WW+E+LHG+A G
Sbjct: 26 YPFRNPELPLEERIDDLLSRLTPEEKIGQMMNVTPAIERLGIPTYDWWNEALHGVARAG- 84
Query: 112 GVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLT 163
T FPQ + AA+F+ + + V+ EARA Y+ Q GLT
Sbjct: 85 ---------RATVFPQAIAMAATFDDNAVHETFTIVSDEARAKYHQYQKDKEYDRYKGLT 135
Query: 164 FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRV 223
FW PNINIFRDPRWGRG ET GEDP + V + QG
Sbjct: 136 FWTPNINIFRDPRWGRGMETYGEDPYLTEKMGVAVTRGLQG------------------- 176
Query: 224 LKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCI 283
D + AC KH + +W +R+ FNA T +D +T+ P F + +
Sbjct: 177 --------DDPNYYKTHACAKHYAVHSGPEW---NRHEFNAEATPRDLYETYLPAFEALV 225
Query: 284 EQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIFEYQNYTKT 342
++G +MC+YN+ G P C L RN WG+ I SDC A+ + T
Sbjct: 226 KEGDVQEVMCAYNRFEGKPCCSSDKLLIDILRNSWGYDNIILSDCGAIDDFWRKDKNTPR 285
Query: 343 H------EDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLG 396
H E ++A + G D+ CG R A+ GK+ EKD+D +L L + LG
Sbjct: 286 HETHPDAESASADAVLNGTDLECGGS-YRALNKALADGKISEKDLDVSLRRLLKGRFELG 344
Query: 397 LFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIG 456
+F+ D R Y K+ V + EH ALD AR+ IVLLKN LPL+KN + +A++G
Sbjct: 345 MFDPDERV-PYSKIPYSVVESPEHIAKALDMARKSIVLLKNKNNMLPLDKN-IKKIAVVG 402
Query: 457 PLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGF 507
P + + + Y G P +++ G+ +K A +++ CN A F
Sbjct: 403 PNAADSTMLWANYNGFPSKTVTIVEGIR---NKVPNAEVIYELGCNHTADF 450
Score = 110 bits (275), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 83/298 (27%), Positives = 138/298 (46%), Gaps = 55/298 (18%)
Query: 486 YVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDR---------- 535
Y+ +T A V + F K AD ++ V G+ E E+
Sbjct: 574 YMQRTGSADLNFTVGVRTPVDFQATASKVKDADVIVFVGGISPRLEGEEMPVDAEGFRKG 633
Query: 536 DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAG 595
DR ++ +P Q +V ++ T K PV+ V+ G L +++ + +++IL Y G+ G
Sbjct: 634 DRTNIEIPAVQKEMVKALVATGK-PVVYVVCTGSALALNWE--NDHVNAILNAWYGGQEG 690
Query: 596 AKALAEIIFGDFNPGGRLPMTWYPESFTKVP-MNDMNMRADSSRQYPGRSYRFYTGTQVY 654
A+A+++FGD+NP GRLP+T+Y +S ++P D +M+ GR+YR+ T T +Y
Sbjct: 691 GTAVADVLFGDYNPAGRLPITFY-KSVDQLPDFQDYSMK--------GRTYRYMTQTPLY 741
Query: 655 GFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRF 714
FG+GLSYT + YK +A L D++ S S+
Sbjct: 742 PFGYGLSYTTFDYK----------NAKLSK-------------------DKIASNESVTL 772
Query: 715 HVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDP 772
I+ N G +DG V ++ + P G P K + F RV+ A + +S ++P
Sbjct: 773 SFDIA--NTGKMDGDEVAQIYIKNPNDPAG-PLKAMKAFKRVNVKAGSEQPVSIQLEP 827
>gi|373956830|ref|ZP_09616790.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
paludis DSM 18603]
gi|373893430|gb|EHQ29327.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
paludis DSM 18603]
Length = 823
Score = 268 bits (686), Expect = 7e-69, Method: Compositional matrix adjust.
Identities = 213/735 (28%), Positives = 342/735 (46%), Gaps = 125/735 (17%)
Query: 90 RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
RLGIP ++ +E + G+ + T+FP L ++++ L + +G+ +
Sbjct: 172 RLGIPV-DFTNEGIRGVEA-----------YQATAFPTQLNMGMTWDKPLVNQMGNITGM 219
Query: 150 EARAMYNLGQAGLT-FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWK 208
EARA+ G T +AP +++ RD RWGR +E GEDP +V+ VE K Q N
Sbjct: 220 EARAL------GYTNVYAPILDVARDQRWGRLEEVYGEDPYLVARLGVEMAKGMQQNN-- 271
Query: 209 SDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITE 268
++A KH Y K G + +
Sbjct: 272 -----------------------------QIAATAKHFAVYSANKGGREGLARTDPQVAP 302
Query: 269 QDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGD---LFQKARNEWGFKGYITS 325
++ E+ PF+ I++ +M SYN +G+P + G L Q+ R E+GFKGY+ S
Sbjct: 303 REVENILLYPFKKVIKEAGLMGVMSSYNDYDGIP--ISGSSYWLIQRLRQEFGFKGYVVS 360
Query: 326 DCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCG----TCMLRHTQSAIDKGKVQEKDI 381
D DA+ ++ + +D+ AGM++ ++ + + + +GK+ I
Sbjct: 361 DSDALEYLYNKHHVAADLKDAVYQAFMAGMNVRTTFRTPDSIIIYARQLVKEGKLPIDTI 420
Query: 382 DRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKF 441
+ + ++ V+ +LGLF D + + V + ++ +AL A+++ IVLLKN
Sbjct: 421 NSRVRDVLRVKFKLGLF--DHPYVQDAEASAKLVNCAANQAVALQASKESIVLLKNKGAI 478
Query: 442 LPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVS--KTHYASGCHDV 499
LPL+K +LA+IGP N Y + ++L G++A V K YA GC+ V
Sbjct: 479 LPLSKQ--QTLAVIGPNALNDDYAHTHYGPLASKSINILEGIQAKVGAGKVLYALGCNLV 536
Query: 500 ----------PCNSD----AGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQ 545
P + D A AV IA+ AD +VV G + E++ R SL LPG
Sbjct: 537 DKHWPESEILPQDPDQAEQAKIDSAVTIARHADVAVVVLGGNTQTAGENKSRTSLDLPGY 596
Query: 546 QMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFG 605
Q+ LV +V T K PV++VL G P+ +++ D I I++ GYPG G A+A+++FG
Sbjct: 597 QLRLVKAVKATGK-PVVVVLIGSQPMTINWI--DQHIDGIIYAGYPGTQGGTAVADVLFG 653
Query: 606 DFNPGGRLPMTWYPESFTKVPMN---DMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSY 662
D+NPGG+L +T +P+S ++P N N D + +Y FG GLSY
Sbjct: 654 DYNPGGKLTLT-FPKSVGQLPFNFPTKPNSETDEGELAKIKGL-------LYPFGFGLSY 705
Query: 663 TNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTN 722
T ++Y S+L IS +++ SD+ + SC VTN
Sbjct: 706 TTFAY------SDLKISPAIQ--SDQG--------------NVTVSC---------KVTN 734
Query: 723 AGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKH 782
G V G VV L+ R + T EK L GFDR+ +KE+ F + P + L + N+
Sbjct: 735 TGKVAGDEVVQLYLRDVLSTVTTYEKVLRGFDRLSLKPGETKEVMFTIVP-DDLKLYNRQ 793
Query: 783 GRRILPLGNHVLMVG 797
+ ++ G +MVG
Sbjct: 794 MKYVVEPGEFKVMVG 808
>gi|222099590|ref|YP_002534158.1| Beta-mannanase [Thermotoga neapolitana DSM 4359]
gi|2429092|gb|AAB70867.1| beta-xylosidase [Thermotoga neapolitana]
gi|221571980|gb|ACM22792.1| Beta-mannanase [Thermotoga neapolitana DSM 4359]
Length = 778
Score = 268 bits (686), Expect = 7e-69, Method: Compositional matrix adjust.
Identities = 224/754 (29%), Positives = 341/754 (45%), Gaps = 147/754 (19%)
Query: 90 RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
RLGIPA E L G G T+FPQ + A++++ L + +A+
Sbjct: 92 RLGIPAM-IHEECLTGYMGLGG-----------TNFPQAIAMASTWDPDLIEKMTAAIRE 139
Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
+ R + GL AP +++ RDPRWGR +ET GE P +V+ V +VK QGEN K
Sbjct: 140 DMRKLG--AHQGL---APVLDVARDPRWGRTEETFGESPYLVARMGVSYVKGLQGENIKE 194
Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWG-NFSRYSFNAMITE 268
+ A KH Y + G N++ + I E
Sbjct: 195 G----------------------------VVATVKHFAGYSASEGGKNWAPTN----IPE 222
Query: 269 QDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVP-ACLRGDLFQKARNEWGFKGYITSDC 327
++ + F PF + +++ + +M SY++++GVP A R L R +WGF+G + SD
Sbjct: 223 REFREVFLFPFEAAVKEARVLSVMNSYSEIDGVPCAANRRLLTDILRKDWGFEGIVVSDY 282
Query: 328 DAVATIFEYQNYTKTHEDSAAGVLKAGMDINC-GTCMLRHTQSAIDKGKVQEKDIDRALL 386
AV + EY K +SA L+AG+D+ T +H + ++KG V E ID A+
Sbjct: 283 FAVNMLGEYHRIAKDKSESARLALEAGIDVELPKTDCYQHLKDLVEKGIVPESLIDEAVS 342
Query: 387 NLFSVQLRLGLFNG---DPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLP 443
+ ++ LGLF D K K H+ LAL+ AR+ I+LLKND LP
Sbjct: 343 RVLKLKFMLGLFENPYVDVEKAK----------IESHRDLALEIARKSIILLKNDGT-LP 391
Query: 444 LNKNAVSSLAIIGPLVNNISQMGGGYT----------------GIPCSPKS----LLRGL 483
L KN +A+IGP + + G Y G P P+ L + +
Sbjct: 392 LQKN--KKVALIGPNAGEVRNLLGDYMYLAHIRALLDNIDDVFGNPQIPRENYERLKKSI 449
Query: 484 EAYVSKT--------------HYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAG---- 525
E ++ YA GC +V +GF EA+ +AK++D IVV G
Sbjct: 450 EEHMKSIPSVLDAFKEEGIDFEYAKGC-EVTGEDRSGFKEAIEVAKRSDVAIVVVGDRSG 508
Query: 526 --LDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQIS 583
LD T E RD +L LPG Q LV +A+T K PV+LVL G P S +++
Sbjct: 509 LTLDCTT-GESRDMANLKLPGVQEELVLEIAKTGK-PVVLVLITGRPY--SLKNLVDRVN 564
Query: 584 SILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGR 643
+IL + PGEAG +A+ ++I+G NP G+LP++ +P S ++P+ ++ R +
Sbjct: 565 AILQVWLPGEAGGRAIVDVIYGKVNPSGKLPIS-FPRSAGQIPVFHY-VKPSGGRSHWHG 622
Query: 644 SYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHI 703
Y + ++ FGHGLSYT + Y L +
Sbjct: 623 DYVDESTKPLFPFGHGLSYTRFEYSNLR-----------------------------IEP 653
Query: 704 DEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGS 763
EV S + +++ V N GD+DG VV L+ S P K+L GF RV AK
Sbjct: 654 KEVPSAGEV--VIKVDVENVGDMDGDEVVQLYIGREFASVTRPVKELKGFKRVSLKAKEK 711
Query: 764 KEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
K + F + + L+ ++ + ++ G +MVG
Sbjct: 712 KTVVFRLH-TDVLAYYDRDMKLVVEPGEFRVMVG 744
>gi|294675412|ref|YP_003576028.1| family 3 glycosyl hydrolase [Prevotella ruminicola 23]
gi|294472176|gb|ADE81565.1| glycosyl hydrolase, family 3 [Prevotella ruminicola 23]
Length = 875
Score = 268 bits (686), Expect = 7e-69, Method: Compositional matrix adjust.
Identities = 159/448 (35%), Positives = 237/448 (52%), Gaps = 51/448 (11%)
Query: 53 PFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPG 112
P+ N +LS + RA L+S LTL EK+ + D + AIPRLGIP ++WW+E+LHGI NG
Sbjct: 24 PYQNANLSAAQRADDLLSRLTLDEKVSLMMDTSPAIPRLGIPQFQWWNEALHGIGRNG-- 81
Query: 113 VNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLTF 164
T FP + AAS++ +L + +AV+ EAR + L+F
Sbjct: 82 --------FATVFPITMAMAASWDDALLHQVFTAVSDEARVKAQQAKCTGDIKRYQSLSF 133
Query: 165 WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVL 224
W PNINIFRDPRWGRGQET GEDP + + + V+ QG + +D G+ +R+
Sbjct: 134 WTPNINIFRDPRWGRGQETYGEDPYLTAKMGLAVVRGLQGVGYNGEDLGVS-KYRK---- 188
Query: 225 KGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFN-AMITEQDTEDTFQPPFRSCI 283
L AC KH + +W +R+ FN + E+D +T+ P F++ +
Sbjct: 189 --------------LLACAKHFAVHSGPEW---NRHEFNIENLPERDLWETYLPAFKALV 231
Query: 284 EQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIF-EYQNYTK 341
++GK + +MC+Y +++G C + Q+ R+EWGF G ITSDC A+ + N +K
Sbjct: 232 QEGKVAEVMCAYQRIDGQACCAQTRYEQQILRDEWGFDGLITSDCGAIRDFLPRWHNVSK 291
Query: 342 THEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGD 401
+++A + AG D+ CG+ +H A+ +G V+E DIDR+L L + LG + D
Sbjct: 292 DGAEASAKAVLAGTDVECGS-EYKHLPEAVRRGDVKEADIDRSLRRLLIARFELGDMDSD 350
Query: 402 PRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFL----PLNKNAVS--SLAII 455
+ K+ V + HK LAL A + IVLL+N K L PLN A S + ++
Sbjct: 351 DLNA-WTKIPETVVASQAHKDLALKMALKSIVLLQNKIKVLPLGNPLNAGAGSDKDIVVM 409
Query: 456 GPLVNNISQMGGGYTGIPCSPKSLLRGL 483
GP N+ M G Y G P + L G+
Sbjct: 410 GPNANDSVMMWGNYAGYPTHTVTALDGI 437
Score = 74.3 bits (181), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 78/324 (24%), Positives = 126/324 (38%), Gaps = 65/324 (20%)
Query: 486 YVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDR---------- 535
YV +T Y + D+ + E + +I V G+ E E+
Sbjct: 589 YVQETGYGALNFDIKKRVNPTAEELLAQIGNTQTIIFVGGISPNLEGEEMRVNEPGFKGG 648
Query: 536 DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAG 595
DR S+ LP Q L+ + + K+ + + +G ++ A +IL Y GE G
Sbjct: 649 DRTSIELPQAQRDLLAVLHKAGKKVIFVNCSGSA---MALAPELETCDAILQWWYGGEQG 705
Query: 596 AKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYG 655
ALA +FG P G+LP+T+Y + D M+ R+YR+Y G ++
Sbjct: 706 GAALATTLFGMVAPSGKLPVTFYKSTDELPDFLDYTMK--------NRTYRYYEGEPLFP 757
Query: 656 FGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFH 715
FG GL YT ++ DK I +
Sbjct: 758 FGFGLGYTTFNI-------------------DKPIYKNN--------------------K 778
Query: 716 VQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQ 775
VQ+ V N G G+ V ++ R +G P+K L + +V A +K IS + P +
Sbjct: 779 VQVRVKNLGTTAGTETVQVYIRHLADKEG-PKKSLRAYQQVTLNAAEAKTISIEL-PRKS 836
Query: 776 LS--IANKHGRRILPLGNHVLMVG 797
+ R++P G + +MVG
Sbjct: 837 FEGWDVKTNTMRVVP-GKYEVMVG 859
>gi|261408260|ref|YP_003244501.1| glycoside hydrolase family protein [Paenibacillus sp. Y412MC10]
gi|261284723|gb|ACX66694.1| glycoside hydrolase family 3 domain protein [Paenibacillus sp.
Y412MC10]
Length = 763
Score = 268 bits (686), Expect = 7e-69, Method: Compositional matrix adjust.
Identities = 226/744 (30%), Positives = 339/744 (45%), Gaps = 135/744 (18%)
Query: 90 RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
RLGIP + E HG + G T FP L +++N L+ +I AVA
Sbjct: 104 RLGIPIL-FGEECSHGHMAIG-----------ATVFPVPLTIGSTWNTELFRSISRAVAA 151
Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
E RA Q G ++P +++ RDPRWGR +ET GEDP +V+ +AV V+ QGE S
Sbjct: 152 ETRA-----QGGSATYSPVLDVVRDPRWGRTEETFGEDPHLVTEFAVAAVQGLQGERLDS 206
Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQ 269
L A KH Y + G R + +
Sbjct: 207 --------------------------HTSLLATLKHFAGYGASEGG---RNGAPVHMGLR 237
Query: 270 DTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCD 328
+ + PFR +E G S +M +YN+++GVP G L Q R WGF G++ +DC
Sbjct: 238 ELHEVDLLPFRKAVEAGALS-VMTAYNEIDGVPCTSSGYLLQDVLREAWGFDGFVITDCG 296
Query: 329 AVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLR-HTQSAIDKGKVQEKDIDRALLN 387
A+ + N + ++AA LKAG+D+ M R H A+++G + E+D++RA
Sbjct: 297 AIHMLACGHNTAGSGVEAAAQSLKAGVDMEMSGTMFRAHLHQALEQGLITEEDLNRAAGR 356
Query: 388 LFSVQLRLGLFNG---DPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPL 444
+ ++ RLGLF+ DP + + EH LA AA +GIVLLKN+ LPL
Sbjct: 357 VLELKFRLGLFDRPYVDPAWAE------QVIGCKEHIALAYQAAAEGIVLLKNEGNLLPL 410
Query: 445 NKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV-----SKTHYASGCHDV 499
+ ++ ++A+IGP + G YT P P ++ L+ S+ YA GC +
Sbjct: 411 DSSS-GTIAVIGPNAHAPYHQLGDYTS-PQPPGQIVTVLDGIRRRLGDSRVLYAPGCR-I 467
Query: 500 PCNSDAGFHEAVRIAKKADFVIVVAG-----------LDLTQET--------------ED 534
+S GF A+ A++AD +++V G +DL E
Sbjct: 468 QGDSREGFPRALACAEQADVIVMVLGGSSARDFGEGTIDLRTGASVVTGHAESDMECGEG 527
Query: 535 RDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEA 594
DR +L L G Q+ L+ + + K PVI+V G P+ + D I SI+ YPG+
Sbjct: 528 IDRSTLTLMGVQLELLQELHKLGK-PVIVVYINGRPITEPWI--DEHIPSIVEAWYPGQE 584
Query: 595 GAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVY 654
G A+A+++FGD NP GRLP++ P+ ++P N N +R+ G+ Y Y
Sbjct: 585 GGSAIADMLFGDINPSGRLPLS-IPKEVGQLP-NSYN-----ARRTRGKRYLETDLAPRY 637
Query: 655 GFGHGLSYTNYSYKFLSA-PSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLR 713
FG GLSYT + Y L+ P+ + I E T
Sbjct: 638 PFGFGLSYTEFRYGRLTVEPAVVPIGG------------------------EAT------ 667
Query: 714 FHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPC 773
V+I VTNAG DG+ VV L+ S PEK L GF +V A ++E++F +
Sbjct: 668 --VRIDVTNAGARDGAEVVQLYVSDLAASVTRPEKALKGFRKVFLKAGETQEVTFTIG-S 724
Query: 774 EQLSIANKHGRRILPLGNHVLMVG 797
EQL + + ++ G + VG
Sbjct: 725 EQLELIGLDLKPVVEPGEFRIQVG 748
>gi|423223721|ref|ZP_17210190.1| hypothetical protein HMPREF1062_02376 [Bacteroides cellulosilyticus
CL02T12C19]
gi|392638096|gb|EIY31949.1| hypothetical protein HMPREF1062_02376 [Bacteroides cellulosilyticus
CL02T12C19]
Length = 954
Score = 268 bits (686), Expect = 8e-69, Method: Compositional matrix adjust.
Identities = 218/767 (28%), Positives = 354/767 (46%), Gaps = 120/767 (15%)
Query: 51 SYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDN--ASAIPRLGIPAYEWWSESLHGIAS 108
S + + +L + R +SL+S++T ++K++ + + IP L +P E++HG +
Sbjct: 167 SLRYMDPTLPVEERVESLLSVMTPEDKMELIREGWGIPGIPHLYVPPITK-VEAVHGFSY 225
Query: 109 NGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPN 168
S T FPQ L A++N+ L ++ AV E L + W+P
Sbjct: 226 G----------SGATIFPQALAMGATWNKKLTEDVAMAVGDE-----TLAAGTMQAWSPV 270
Query: 169 INIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFG 228
+++ +D RWGR +ET GEDP++VS ++K +Q +
Sbjct: 271 LDVAQDARWGRCEETFGEDPVLVSQIGGAWIKGYQSKG---------------------- 308
Query: 229 EESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKA 288
L KH + G R S + ++E++ + PFR I
Sbjct: 309 ----------LFTTPKHFGGHGAPLGG---RDSHDIGLSEREMREVHLVPFRHVIRNYDC 355
Query: 289 SCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYT-KTHEDS 346
+M +Y+ GVP +L R EWGF G+I SDC A+ + ++YT K ++
Sbjct: 356 QSVMMAYSDYLGVPVAKSRELLHSILREEWGFDGFIVSDCGAIGNLTARKHYTAKDKIEA 415
Query: 347 AAGVLKAGMDINCG-TCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKG 405
A L AG+ NCG T + A G++ +++D + + R LF P K
Sbjct: 416 ANQALAAGIATNCGDTYNDKEVIQAAKDGRINMENLDEVCRTMLRMMFRNELFEKTPNKP 475
Query: 406 -KYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQ 464
+ K+ P + HK++A AAR+ IV+L+N LPL K+ + ++A++GP +++
Sbjct: 476 LDWNKIYPG-WNSDSHKEMARQAARESIVMLENKDNILPLAKD-MRTIAVVGPGADDLQP 533
Query: 465 MGGGYTG--IPCSPKSLLRGLEAYV---SKTHYASGCHDVPCNSDAGFHEAVRIAKKADF 519
G YT +P KS+L G++ V +K Y GC D ++ +AV+ A ++D
Sbjct: 534 --GDYTPKLLPGQLKSVLTGIKQAVGKQTKVVYEQGC-DFTSSNGTDIPKAVKAASQSDV 590
Query: 520 VIVVAGLDLTQET---------EDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGP 570
V++V G T E+ E+ D +L+LPG+Q L+ +V T K PVIL+L G P
Sbjct: 591 VVLVLGDCSTSESTTDVYKTSGENHDYATLILPGKQQELLEAVCATGK-PVILILQAGRP 649
Query: 571 LDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDM 630
++S A + + W+ PG+ G A A+++FGD+NP GRLPMT +P ++P+
Sbjct: 650 YNLSKASELCKAILVNWL--PGQEGGPATADVLFGDYNPAGRLPMT-FPRHVGQLPLY-Y 705
Query: 631 NMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNI 690
N + S R+Y FY +Y FG+GLSYT++ Y L ++ + N+
Sbjct: 706 NFKT-SGRRYEYSDMEFYP---LYYFGYGLSYTSFEYSGL----------KIQEKDNGNV 751
Query: 691 LQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQL 750
+Q +V N G G VV L+ S T +L
Sbjct: 752 ------------------------AIQATVKNVGQRAGDEVVQLYITDMYASVKTRITEL 787
Query: 751 IGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
F RVH SK +SF + P E LS+ N R++ G ++VG
Sbjct: 788 KDFTRVHLQPDESKIVSFELTPYE-LSLLNDRMDRVVEKGEFKILVG 833
>gi|224536377|ref|ZP_03676916.1| hypothetical protein BACCELL_01251 [Bacteroides cellulosilyticus
DSM 14838]
gi|224522015|gb|EEF91120.1| hypothetical protein BACCELL_01251 [Bacteroides cellulosilyticus
DSM 14838]
Length = 954
Score = 268 bits (685), Expect = 8e-69, Method: Compositional matrix adjust.
Identities = 218/767 (28%), Positives = 354/767 (46%), Gaps = 120/767 (15%)
Query: 51 SYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDN--ASAIPRLGIPAYEWWSESLHGIAS 108
S + + +L + R +SL+S++T ++K++ + + IP L +P E++HG +
Sbjct: 167 SLRYMDPTLPVEERVESLLSVMTPEDKMELIREGWGIPGIPHLYVPPITK-VEAVHGFSY 225
Query: 109 NGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPN 168
S T FPQ L A++N+ L ++ AV E L + W+P
Sbjct: 226 G----------SGATIFPQALAMGATWNKKLTEDVAMAVGDE-----TLAAGTMQAWSPV 270
Query: 169 INIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFG 228
+++ +D RWGR +ET GEDP++VS ++K +Q +
Sbjct: 271 LDVAQDARWGRCEETFGEDPVLVSQIGGAWIKGYQSKG---------------------- 308
Query: 229 EESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKA 288
L KH + G R S + ++E++ + PFR I
Sbjct: 309 ----------LFTTPKHFGGHGAPLGG---RDSHDIGLSEREMREVHLVPFRHVIRNYDC 355
Query: 289 SCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYT-KTHEDS 346
+M +Y+ GVP +L R EWGF G+I SDC A+ + ++YT K ++
Sbjct: 356 QSVMMAYSDYLGVPVAKSRELLHSILREEWGFDGFIVSDCGAIGNLTARKHYTAKDKIEA 415
Query: 347 AAGVLKAGMDINCG-TCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKG 405
A L AG+ NCG T + A G++ +++D + + R LF P K
Sbjct: 416 ANQALAAGIATNCGDTYNDKEVIQAAKDGRINMENLDEVCRTMLRMMFRNELFEKTPNKP 475
Query: 406 -KYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQ 464
+ K+ P + HK++A AAR+ IV+L+N LPL K+ + ++A++GP +++
Sbjct: 476 LDWNKIYPG-WNSDSHKEMARQAARESIVMLENKDNILPLAKD-MRTIAVVGPGADDLQP 533
Query: 465 MGGGYTG--IPCSPKSLLRGLEAYV---SKTHYASGCHDVPCNSDAGFHEAVRIAKKADF 519
G YT +P KS+L G++ V +K Y GC D ++ +AV+ A ++D
Sbjct: 534 --GDYTPKLLPGQLKSVLTGIKQAVGKQTKVVYEQGC-DFTSSNGTNIPKAVKAASQSDV 590
Query: 520 VIVVAGLDLTQET---------EDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGP 570
V++V G T E+ E+ D +L+LPG+Q L+ +V T K PVIL+L G P
Sbjct: 591 VVLVLGDCSTSESTTDVYKTSGENHDYATLILPGKQQELLEAVCATGK-PVILILQAGRP 649
Query: 571 LDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDM 630
++S A + + W+ PG+ G A A+++FGD+NP GRLPMT +P ++P+
Sbjct: 650 YNLSKASELCKAILVNWL--PGQEGGPATADVLFGDYNPAGRLPMT-FPRHVGQLPLY-Y 705
Query: 631 NMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNI 690
N + S R+Y FY +Y FG+GLSYT++ Y L ++ + N+
Sbjct: 706 NFKT-SGRRYEYSDMEFYP---LYYFGYGLSYTSFEYSGL----------KIQEKDNGNV 751
Query: 691 LQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQL 750
+Q +V N G G VV L+ S T +L
Sbjct: 752 ------------------------AIQATVKNVGQRAGDEVVQLYITDMYASVKTRITEL 787
Query: 751 IGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
F RVH SK +SF + P E LS+ N R++ G ++VG
Sbjct: 788 KDFTRVHLQPGESKIVSFELTPYE-LSLLNDRMDRVVEKGEFKILVG 833
>gi|319643197|ref|ZP_07997825.1| glycoside hydrolase family 3 [Bacteroides sp. 3_1_40A]
gi|345520511|ref|ZP_08799899.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp.
4_3_47FAA]
gi|254835034|gb|EET15343.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp.
4_3_47FAA]
gi|317385101|gb|EFV66052.1| glycoside hydrolase family 3 [Bacteroides sp. 3_1_40A]
Length = 788
Score = 268 bits (685), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 217/736 (29%), Positives = 343/736 (46%), Gaps = 129/736 (17%)
Query: 90 RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
RLGIP ++ +E + G+ + T FP A++N+ L + IG V
Sbjct: 138 RLGIPV-DFTNEGIRGLCHD-----------RATYFPAQCGQGATWNKKLIARIGE---V 182
Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
EA+ LG + ++P ++I +DPRWGR ET GEDP +V + + S Q N
Sbjct: 183 EAKEAVALGYTNI--YSPILDIAQDPRWGRCVETYGEDPYLVGELGKQMITSLQKYN--- 237
Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQ 269
L A KH Y + G + + + +
Sbjct: 238 -----------------------------LVATPKHFAVYSIPIGGRDGKTRTDPHVAPR 268
Query: 270 DTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGD---LFQKARNEWGFKGYITSD 326
+ + PFR ++ A +M SYN +G P + G L + R EWGFKGY+ SD
Sbjct: 269 EMRTLYIEPFRMAFQEAGALGVMSSYNDYDGEP--ITGSYHFLTEILRQEWGFKGYVVSD 326
Query: 327 CDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTC----MLRHTQSAIDKGKVQEKDID 382
+AV I T+ED A + AG++I + + A+D GK+ ++ +D
Sbjct: 327 SEAVEFISNKHKVADTYEDGIAQAVNAGLNIRTHFTPPADFILPLRKAVDDGKISQETLD 386
Query: 383 RALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFL 442
+ + + ++ LGLF+ +P +G GK V + EH+ ++L+AARQ +VLLKN+ L
Sbjct: 387 KRVAEILRIKFWLGLFD-NPYRGN-GKQAEQIVHSKEHQAVSLEAARQSLVLLKNETHLL 444
Query: 443 PLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV--SKTHYASGCH--- 497
PL+K ++ S+A+IGP + +Q+ Y K++ +G++ + ++ Y GC
Sbjct: 445 PLSK-SIRSIAVIGPNADEQTQLICRYGPANAPIKTVYQGIKELLPHAEVIYKKGCDIID 503
Query: 498 ---------DVPCNSDAG--FHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQ 546
D P ++ E +R AK+A+ V++V G + EDR R SL LPG+Q
Sbjct: 504 PHFPESEILDFPKTAEEVRLMQEVIRAAKQAEVVVMVLGGNELTVREDRSRTSLNLPGRQ 563
Query: 547 MSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGD 606
L+ +V T K PVILV+ G +++A A + +IL +PGE +A+AE +FGD
Sbjct: 564 EELLKAVCATGK-PVILVMLDGRASSINYAAA--HVPAILHAWFPGEFCGQAVAEALFGD 620
Query: 607 FNPGGRLPMTWYPESFTKVPMN-DMNMRADSSRQYPGRSYRFYTGTQVYG----FGHGLS 661
+NPGGRL +T +P+S ++P +D S + T VYG FGHGLS
Sbjct: 621 YNPGGRLAVT-FPKSVGQIPFAFPFKPGSDES-----------SSTSVYGALYPFGHGLS 668
Query: 662 YTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVT 721
YT ++Y S+L IS S G + D HV +
Sbjct: 669 YTTFTY------SDLHISPS-----------HQGVQGD--------------IHVSCKIK 697
Query: 722 NAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANK 781
N G + G VV L+ R S T K L GF+R+ A + + F + P + L + +K
Sbjct: 698 NTGKIKGDEVVQLYLRDEISSVTTYTKVLRGFERISLKAGEEQTVHFRLRP-QDLGLWDK 756
Query: 782 HGRRILPLGNHVLMVG 797
+ + G+ +M+G
Sbjct: 757 NMNFRVEPGSFKVMLG 772
>gi|6006601|emb|CAB56857.1| beta-mannanase [Thermotoga neapolitana]
Length = 821
Score = 268 bits (684), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 224/754 (29%), Positives = 341/754 (45%), Gaps = 147/754 (19%)
Query: 90 RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
RLGIPA E L G G T+FPQ + A++++ L + +A+
Sbjct: 135 RLGIPAM-IHEECLTGYMGLGG-----------TNFPQAIAMASTWDPDLIEKMTAAIRE 182
Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
+ R + GL AP +++ RDPRWGR +ET GE P +V+ V +VK QGEN K
Sbjct: 183 DMRKLG--AHQGL---APVLDVARDPRWGRTEETFGESPYLVARMGVSYVKGLQGENIKE 237
Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWG-NFSRYSFNAMITE 268
+ A KH Y + G N++ + I E
Sbjct: 238 G----------------------------VVATVKHFAGYSASEGGKNWAPTN----IPE 265
Query: 269 QDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVP-ACLRGDLFQKARNEWGFKGYITSDC 327
++ + F PF + +++ + +M SY++++GVP A R L R +WGF+G + SD
Sbjct: 266 REFREVFLFPFEAAVKEARVLSVMNSYSEIDGVPCAANRRLLTDILRKDWGFEGIVVSDY 325
Query: 328 DAVATIFEYQNYTKTHEDSAAGVLKAGMDINC-GTCMLRHTQSAIDKGKVQEKDIDRALL 386
AV + EY K +SA L+AG+D+ T +H + ++KG V E ID A+
Sbjct: 326 FAVNMLGEYHRIAKDKSESARLALEAGIDVELPKTDCYQHLKDLVEKGIVPESLIDEAVS 385
Query: 387 NLFSVQLRLGLFNG---DPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLP 443
+ ++ LGLF D K K H+ LAL+ AR+ I+LLKND LP
Sbjct: 386 RVLKLKFMLGLFENPYVDVEKAK----------IESHRDLALEIARKSIILLKNDGT-LP 434
Query: 444 LNKNAVSSLAIIGPLVNNISQMGGGYT----------------GIPCSPKS----LLRGL 483
L KN +A+IGP + + G Y G P P+ L + +
Sbjct: 435 LQKN--KKVALIGPNAGEVRNLLGDYMYLAHIRALLDNIDDVFGNPQIPRENYERLKKSI 492
Query: 484 EAYVSKT--------------HYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAG---- 525
E ++ YA GC +V +GF EA+ +AK++D IVV G
Sbjct: 493 EEHMKSIPSVLDAFKEEGIDFEYAKGC-EVTGEDRSGFKEAIEVAKRSDVAIVVVGDRSG 551
Query: 526 --LDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQIS 583
LD T E RD +L LPG Q LV +A+T K PV+LVL G P S +++
Sbjct: 552 LTLDCTT-GESRDMANLKLPGVQEELVLEIAKTGK-PVVLVLITGRPY--SLKNLVDRVN 607
Query: 584 SILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGR 643
+IL + PGEAG +A+ ++I+G NP G+LP++ +P S ++P+ ++ R +
Sbjct: 608 AILQVWLPGEAGGRAIVDVIYGKVNPSGKLPIS-FPRSAGQIPVFHY-VKPSGGRSHWHG 665
Query: 644 SYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHI 703
Y + ++ FGHGLSYT + Y L +
Sbjct: 666 DYVDESTKPLFPFGHGLSYTRFEYSNLR-----------------------------IEP 696
Query: 704 DEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGS 763
EV S + +++ V N GD+DG VV L+ S P K+L GF RV AK
Sbjct: 697 KEVPSAGEVV--IKVDVENVGDMDGDEVVQLYIGREFASVTRPVKELKGFKRVSLKAKEK 754
Query: 764 KEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
K + F + + L+ ++ + ++ G +MVG
Sbjct: 755 KTVVFRLH-TDVLAYYDRDMKLVVEPGEFRVMVG 787
>gi|297543748|ref|YP_003676050.1| glycoside hydrolase family 3 domain-containing protein
[Thermoanaerobacter mathranii subsp. mathranii str. A3]
gi|296841523|gb|ADH60039.1| glycoside hydrolase family 3 domain protein [Thermoanaerobacter
mathranii subsp. mathranii str. A3]
Length = 787
Score = 267 bits (683), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 226/751 (30%), Positives = 344/751 (45%), Gaps = 135/751 (17%)
Query: 90 RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
RLGIPA ES G + G T FPQ + A+++N L + S +
Sbjct: 94 RLGIPAL-IHEESCSGYMAKG-----------ATIFPQTIGVASTWNPKLVEKMASVIRE 141
Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
+ +A+ AP +++ RDPRWGR +ET GEDP +V V +++ Q EN K
Sbjct: 142 QMKAV-----GARQALAPLLDVTRDPRWGRTEETFGEDPYLVMHMGVSYIRGLQTENLK- 195
Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWG-NFSRYSFNAMITE 268
+G I G KH + Y + G N++ A I
Sbjct: 196 -EGVIATG--------------------------KHFVGYGNSEGGMNWA----PAHIPM 224
Query: 269 QDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDC 327
++ + F PF + +++ K IM Y++++G+P L R WGF G + SD
Sbjct: 225 RELYEIFLYPFEAAVKEAKLGSIMPGYHELDGIPCHKSKQLLTDILRKNWGFDGIVVSDY 284
Query: 328 DAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGT--CMLRHTQSAIDKGKVQEKDIDRAL 385
A+ ++EY +++A L+AG+D+ + C + I++G + ++ A+
Sbjct: 285 FAINQLYEYHRLASNKKEAAKLALEAGVDVELPSTDCYGLPIKELIEQGDIDIDFVNDAV 344
Query: 386 LNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLN 445
+ + LGLF +P + K + T E ++LA A++ IVLLKN+ LPL
Sbjct: 345 RRILKAKFLLGLFE-NPYVDE--KRVVEIFDTQEQRQLAYKIAQESIVLLKNESNLLPLK 401
Query: 446 KNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLR--------------GLEA---YV- 487
K+ + S+A+IGP +NI M G Y PC +SLL GLEA YV
Sbjct: 402 KD-LQSIAVIGPNADNIRNMIGDY-AYPCHIESLLEMREKDNVFNTPLPEGLEAKDIYVP 459
Query: 488 ---------------SKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAG-----LD 527
+K YA GC DV + AGF++AV IAK+AD IVV G D
Sbjct: 460 IVSVLQGIKEKVSPKTKVIYAKGC-DVISDDTAGFNKAVEIAKQADVAIVVVGDRAGLTD 518
Query: 528 LTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVS-FAEADSQISSIL 586
E RDR L LPG Q LV ++ T PV++VL G P+ +S AE +I +I+
Sbjct: 519 GCTSGESRDRADLNLPGVQEQLVKAIYETGT-PVVVVLINGRPMSISRLAE---KIPAII 574
Query: 587 WIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYR 646
PGE G +A+A++IFGD+NPGG+LP++ P S ++P+ + + + G Y
Sbjct: 575 EAWLPGEEGGRAIADVIFGDYNPGGKLPIS-IPCSVGQLPVYYYHKPSGGRTNWKG-DYV 632
Query: 647 FYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEV 706
+ +Y FG+GLSYT + Y S L IS K + + I++
Sbjct: 633 ESSTKPLYPFGYGLSYTEFLY------SNLNISNP-KVSTQEGIIE-------------- 671
Query: 707 TSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEI 766
+ + V N G V G VV L+ +S P K+L GF R+ K +
Sbjct: 672 ---------ISVDVKNIGKVKGDEVVQLYIHREFLSVTRPVKELKGFKRITLDVGEQKTV 722
Query: 767 SFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
F + EQL N+ ++ G +M+G
Sbjct: 723 IFQLS-SEQLGFYNEEMEYVVEPGRVEVMIG 752
>gi|289577460|ref|YP_003476087.1| glycoside hydrolase [Thermoanaerobacter italicus Ab9]
gi|289527173|gb|ADD01525.1| glycoside hydrolase family 3 domain protein [Thermoanaerobacter
italicus Ab9]
Length = 787
Score = 267 bits (683), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 224/750 (29%), Positives = 342/750 (45%), Gaps = 133/750 (17%)
Query: 90 RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
RLGIPA ES G + G T FPQ + A+++N L + S +
Sbjct: 94 RLGIPAL-IHEESCSGYMAKG-----------ATIFPQTIGVASTWNPKLVEKMASVIRE 141
Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
+ +A+ AP +++ RDPRWGR +ET GEDP +V V +++ Q EN K
Sbjct: 142 QMKAV-----GARQALAPLLDVTRDPRWGRTEETFGEDPYLVMHMGVSYIRGLQTENLK- 195
Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWG-NFSRYSFNAMITE 268
+G I G KH + Y + G N++ A I
Sbjct: 196 -EGVIATG--------------------------KHFVGYGNSEGGMNWA----PAHIPM 224
Query: 269 QDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDC 327
++ + F PF + +++ K IM Y++++G+P L R WGF G + SD
Sbjct: 225 RELYEIFLYPFEAAVKEAKLGSIMPGYHELDGIPCHKSKQLLTDILRKNWGFDGIVVSDY 284
Query: 328 DAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGT--CMLRHTQSAIDKGKVQEKDIDRAL 385
A+ ++EY +++A L+AG+D+ + C + I++G + ++ A+
Sbjct: 285 FAINQLYEYHRLASNKKEAAKLALEAGVDVELPSTDCYGLPIKELIEQGDIDIDFVNDAV 344
Query: 386 LNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLN 445
+ + LGLF +P + K + T E ++LA A++ IVLLKN+ LPL
Sbjct: 345 RRILKAKFLLGLFE-NPYVDE--KRVVEIFDTQEQRQLAYKIAQESIVLLKNESNLLPLK 401
Query: 446 KNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLR--------------GLEA---YV- 487
K+ + S+A+IGP +NI M G Y PC +SLL GLEA YV
Sbjct: 402 KD-LQSIAVIGPNADNIRNMIGDY-AYPCHIESLLEMREKDNVFNTPLPEGLEAKDIYVP 459
Query: 488 ---------------SKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAG-----LD 527
+K YA GC DV + AGF++AV +AK+AD IVV G D
Sbjct: 460 IVSVLQGIKEKVSPKTKVIYAKGC-DVISDDTAGFNKAVEVAKQADVAIVVVGDRAGLTD 518
Query: 528 LTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILW 587
E RDR L LPG Q L+ +V T PVI+VL G P+ +S+ +I +I+
Sbjct: 519 GCTSGESRDRADLNLPGVQEELIKAVYETGT-PVIVVLINGRPMSISWIA--EKIPAIIE 575
Query: 588 IGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRF 647
PGE G +A+A++IFGD+NPGG+LP++ P S ++P+ + + + G Y
Sbjct: 576 AWLPGEEGGRAIADVIFGDYNPGGKLPIS-IPRSVGQLPVYYYHKPSGGRTNWKG-DYVE 633
Query: 648 YTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVT 707
+ +Y FG+GLSYT + Y S L+IS K + I++
Sbjct: 634 SSTKPLYPFGYGLSYTEFLY------SNLSISHP-KVATQGGIIE--------------- 671
Query: 708 SCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEIS 767
+ V N G V G VV L+ +S P K+L GF R+ K +
Sbjct: 672 --------ISADVKNIGKVKGDEVVQLYIHREFLSVTRPVKELKGFKRITLDVGEQKTVI 723
Query: 768 FGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
F + EQL N+ ++ G +M+G
Sbjct: 724 FQLS-SEQLGFYNEEMEYVVEPGRVEVMIG 752
>gi|393786908|ref|ZP_10375040.1| hypothetical protein HMPREF1068_01320 [Bacteroides nordii
CL02T12C05]
gi|392658143|gb|EIY51773.1| hypothetical protein HMPREF1068_01320 [Bacteroides nordii
CL02T12C05]
Length = 854
Score = 266 bits (681), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 176/490 (35%), Positives = 253/490 (51%), Gaps = 75/490 (15%)
Query: 11 RERILFRVLQLIVVVNVIAFSNSKPVLNKPDFPCKPPHFDSYPFCNTSLSISTRAKSLIS 70
+ RILF V+ L+ V V A N K V + + + R L+S
Sbjct: 3 KRRILFFVVSLVYVFTVQA-QNGKDV-----------------YLDMNAPRHERILDLLS 44
Query: 71 LLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLV 130
LT++EKI L + IPRL I Y +E+LHG+ G NF T FPQ +
Sbjct: 45 KLTIEEKISLLRATSPGIPRLHIDKYYHGNEALHGVVRPG---NF-------TVFPQAIG 94
Query: 131 SAASFNRSLWSNIGSAVAVEARAMYN--------LGQAG--LTFWAPNINIFRDPRWGRG 180
AA +N L + I + ++ EARA +N LGQ LTFW+P +N+ RDPRWGR
Sbjct: 95 LAAMWNPQLLNEISTVISDEARARWNELEQGKKQLGQFSDLLTFWSPTVNMARDPRWGRT 154
Query: 181 QETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLS 240
ET GEDP + V FVK QG+ + R LK +
Sbjct: 155 PETYGEDPFLSGKLGVSFVKGLQGD--------------DPRYLK-------------IV 187
Query: 241 ACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNG 300
+ KH A + E +R+ N +I+E+D + + P F CI +GKA+ IM +YN +N
Sbjct: 188 STPKHFAANNEEH----NRFECNPIISEKDLREYYLPAFEKCIIEGKAASIMTAYNAIND 243
Query: 301 VPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINC 359
VP L L +K R++WGF GY+ SDC + + + Y KT E +AA ++AG+D+ C
Sbjct: 244 VPCTLNNWLLKKVLRHDWGFDGYVVSDCGGPSFLVTHHKYVKTLEAAAALSIQAGLDLEC 303
Query: 360 G-TCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTS 418
G + +A + V E +ID A ++ ++RLGLF+ DP Y K+ P V
Sbjct: 304 GDEVYMEPLLNAYKQYMVSEAEIDSAAYHVLRARMRLGLFD-DPALNPYNKISPSIVGCE 362
Query: 419 EHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCS-PK 477
+H KLAL+AARQ IVLLKN+KKFLPL+ + S+A++G +N + G Y+G P + P
Sbjct: 363 KHSKLALEAARQSIVLLKNEKKFLPLDSKKIKSIAVVG--INAGNSEFGDYSGTPVNQPV 420
Query: 478 SLLRGLEAYV 487
S+L G++ V
Sbjct: 421 SILEGIKKKV 430
Score = 144 bits (363), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 96/307 (31%), Positives = 149/307 (48%), Gaps = 57/307 (18%)
Query: 507 FHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLT 566
+ EA I +K D + V G++ + E E +DR S+ LP Q + + + V+ VL
Sbjct: 592 YGEAGDIMRKCDLTVAVLGINKSIEREGQDRYSIELPKDQQIFIEEAYKINPNTVV-VLV 650
Query: 567 GGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVP 626
G L +++ D I +I+ YPGEAG A+AE++FGD+NPGG+LP+T+Y S ++P
Sbjct: 651 AGSSLAINWM--DEHIPAIVNAWYPGEAGGTAVAEVLFGDYNPGGKLPLTYY-RSLDELP 707
Query: 627 -MNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAG 685
+D ++R GR+Y+F+ G +Y FGHGLSYT +SYK LS
Sbjct: 708 AFDDYDIRK-------GRTYQFFEGDPLYAFGHGLSYTTFSYKKLS-------------- 746
Query: 686 SDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGT 745
ID S+ F ++ N G +G V L+ + QG+
Sbjct: 747 -----------------IDAAGDVVSVSF----TLKNTGKYEGDEVAQLYVKY----QGS 781
Query: 746 ------PEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGEL 799
P KQL GF+R+H SK+I+ V E + G P G+++ MVG
Sbjct: 782 DSQVKLPLKQLKGFERIHLKKGESKQINLTVPKSELRFWNEEKGEFYTPAGDYLFMVGTA 841
Query: 800 RHSLTIE 806
++ ++
Sbjct: 842 SDAIQLQ 848
>gi|148269983|ref|YP_001244443.1| glycoside hydrolase family 3 protein [Thermotoga petrophila RKU-1]
gi|147735527|gb|ABQ46867.1| glycoside hydrolase, family 3 domain protein [Thermotoga petrophila
RKU-1]
Length = 778
Score = 266 bits (681), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 223/752 (29%), Positives = 343/752 (45%), Gaps = 143/752 (19%)
Query: 90 RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
RLGIPA E L G G T+FPQ + A++++ L + +A+
Sbjct: 92 RLGIPAM-IHEECLTGYMGLGG-----------TNFPQAIAMASTWDPDLIEKMTTAIRE 139
Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
+ R + GL AP +++ RDPRWGR +ET GE P +V+ V +VK QGE+ K
Sbjct: 140 DMRKIG--AHQGL---APVLDVARDPRWGRTEETFGESPYLVARMGVSYVKGLQGEDIK- 193
Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWG-NFSRYSFNAMITE 268
KG + A KH Y + G N++ + I E
Sbjct: 194 ---------------KG------------VVATVKHFAGYSASEGGKNWAPTN----IPE 222
Query: 269 QDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVP-ACLRGDLFQKARNEWGFKGYITSDC 327
++ ++ F PF + +++ +M SY++++GVP A R L R +WGFKG + SD
Sbjct: 223 REFKEVFLFPFEAAVKEANVLSVMNSYSEIDGVPCAANRKLLTDILRKDWGFKGIVVSDY 282
Query: 328 DAVATIFEYQNYTKTHEDSAAGVLKAGMDINC-GTCMLRHTQSAIDKGKVQEKDIDRALL 386
AV + +Y + ++A L+AG+D+ T ++ + ++KG + E ID A+
Sbjct: 283 FAVKVLEDYHRIARDKSEAARLALEAGIDVELPKTECYQYLKDLVEKGIISEALIDEAVA 342
Query: 387 NLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNK 446
+ ++ LGLF + + K+ HK +ALD AR+ I+LLKND LPL K
Sbjct: 343 RVLRLKFMLGLFENPYVEVEKAKI-------ESHKDIALDIARKSIILLKNDG-ILPLQK 394
Query: 447 NAVSSLAIIGPLVNNISQMGGGYT----------------GIPCSPKS----LLRGLEAY 486
N +A+IGP + + G Y G P P+ L + +E +
Sbjct: 395 N--KKVALIGPNAGEVRNLLGDYMYLAHIRALLDNIDDVFGNPQIPRENYERLKKSIEEH 452
Query: 487 VSKT--------------HYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAG------L 526
+ YA GC +V +GF EA+ IAKK+D IVV G L
Sbjct: 453 MKSIPSVLDAFKEEGIEFEYAKGC-EVTGEDRSGFEEAIEIAKKSDVAIVVVGDKSGLTL 511
Query: 527 DLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSIL 586
D T E RD +L LPG Q LV VA+T K PV+LVL G P S ++++IL
Sbjct: 512 DCTT-GESRDMANLKLPGVQEELVLEVAKTGK-PVVLVLITGRPY--SLKNVVDKVNAIL 567
Query: 587 WIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYR 646
+ PGEAG +A+ +II+G NP G+LP++ +P S ++P+ ++ R + Y
Sbjct: 568 QVWLPGEAGGRAIVDIIYGKVNPSGKLPIS-FPRSAGQIPVFHY-VKPSGGRSHWHGDYV 625
Query: 647 FYTGTQVYGFGHGLSYTNYSYKFLS-APSELTISASLKAGSDKNILQQTGSRLDYVHIDE 705
+ ++ FGHGLSYT + Y L P E+ + +
Sbjct: 626 DESTKPLFPFGHGLSYTKFEYSNLRIEPKEVPPAGEVV---------------------- 663
Query: 706 VTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKE 765
+++ V N GD DG VV L+ S P K+L GF RV AK K
Sbjct: 664 ----------IKVDVENTGDRDGDEVVQLYIGREFASVTRPVKELKGFKRVSLKAKEKKT 713
Query: 766 ISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
+ F + + L+ ++ + ++ G +MVG
Sbjct: 714 VVFRLH-MDVLAYYDRDMKLVVEPGEFKVMVG 744
>gi|423289665|ref|ZP_17268515.1| hypothetical protein HMPREF1069_03558 [Bacteroides ovatus
CL02T12C04]
gi|423298158|ref|ZP_17276217.1| hypothetical protein HMPREF1070_04882 [Bacteroides ovatus
CL03T12C18]
gi|392663699|gb|EIY57246.1| hypothetical protein HMPREF1070_04882 [Bacteroides ovatus
CL03T12C18]
gi|392667376|gb|EIY60886.1| hypothetical protein HMPREF1069_03558 [Bacteroides ovatus
CL02T12C04]
Length = 955
Score = 266 bits (681), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 221/764 (28%), Positives = 355/764 (46%), Gaps = 120/764 (15%)
Query: 54 FCNTSLSISTRAKSLISLLTLQEKIQQLSDN--ASAIPRLGIPAYEWWSESLHGIASNGP 111
+ + SL + R +SL++++T ++K++ + + IP L +P E++HG +
Sbjct: 171 YMDASLPVEERVESLLAVMTPEDKMELIREGWGIPGIPHLYVPPITK-VEAVHGFSYG-- 227
Query: 112 GVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINI 171
S T FPQ L A++NR L + + E + N QA W+P +++
Sbjct: 228 --------SGATIFPQALAMGATWNRKLTEEVAMVIGDET-VVANTKQA----WSPVLDV 274
Query: 172 FRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEES 231
+D RWGR +ET GEDP++VS ++K +Q
Sbjct: 275 AQDARWGRCEETFGEDPVLVSQIGGAWIKGYQS--------------------------- 307
Query: 232 DRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCI 291
RG L KH + G R S + ++E++ + PFR + +
Sbjct: 308 -RG----LFTTPKHFGGHGAPLGG---RDSHDIGLSEREMREVHLVPFRHVVRNYDCQSL 359
Query: 292 MCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYT-KTHEDSAAG 349
M +Y+ G+P +L Q+ R EWGF G+I SDC A+ + ++YT K ++A
Sbjct: 360 MMAYSDYMGIPVAGSTELLQQILRQEWGFNGFIVSDCGAIGNLTARKHYTAKDKIEAANQ 419
Query: 350 VLKAGMDINCG-TCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKG-KY 407
L AG+ NCG T + A G++ ++D + + R LF +P K +
Sbjct: 420 ALAAGIATNCGDTYNDKEVIQAAKDGRINMVNLDNVCRTMLATMFRNELFEKNPCKPLDW 479
Query: 408 GKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGG 467
K+ P + H+++A AAR+ IV+L+N LPL+K + ++A++GP +++ G
Sbjct: 480 NKIYPG-WNSDRHREMARQAARESIVMLENKDNLLPLSK-TLKTIAVLGPGADDLQP--G 535
Query: 468 GYTGI--PCSPKSLLRGLEAYV---SKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIV 522
YT P KS+L G++A V +K Y GC D +AV+ A ++D V++
Sbjct: 536 DYTPKLQPGQLKSVLSGIKAAVGKQTKVLYEQGC-DFTTPDATNIPKAVKAASQSDVVVM 594
Query: 523 VAGLDLTQET---------EDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDV 573
V G T E E+ D +L+LPG+Q L+ +V T K PV+L+L G P D+
Sbjct: 595 VLGDCSTSEATNNVRKTCGENNDWATLILPGKQQELLEAVCATGK-PVVLILQAGRPYDL 653
Query: 574 SFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMR 633
+A +IL PG+ G A A+++FGD+NPGGRLPMT +P ++P+ N +
Sbjct: 654 --LKASEMCKAILVNWLPGQEGGPATADVLFGDYNPGGRLPMT-FPRHVGQLPLY-YNFK 709
Query: 634 ADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQ 693
S R+Y FY +Y FG+GLSYT++ Y S+L I S+ N++
Sbjct: 710 T-SGRRYEYVDMEFYP---LYRFGYGLSYTSFEY------SDLKIQEK----SNGNVM-- 753
Query: 694 TGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGF 753
VQ +V N G G V L+ S T +L F
Sbjct: 754 ----------------------VQATVKNVGGCAGDEVAQLYITDMYASVKTRVMELKDF 791
Query: 754 DRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
R+H SK +SF + P + +S+ N R++ G +MVG
Sbjct: 792 TRIHLQPGESKNVSFELTPYD-ISLLNDRMDRVVEKGEFKVMVG 834
>gi|373460605|ref|ZP_09552356.1| hypothetical protein HMPREF9944_00620 [Prevotella maculosa OT 289]
gi|371955223|gb|EHO73027.1| hypothetical protein HMPREF9944_00620 [Prevotella maculosa OT 289]
Length = 858
Score = 266 bits (681), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 166/462 (35%), Positives = 245/462 (53%), Gaps = 61/462 (13%)
Query: 51 SYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNG 110
+YP+ N +LS TRA+ L+S LTL+EK + D + AIPRLGI + WWSE+LHG A+ G
Sbjct: 30 AYPYQNPNLSALTRAQDLLSRLTLEEKALLMLDESPAIPRLGIKKFFWWSEALHGAANMG 89
Query: 111 PGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMY-----NLGQ----AG 161
+VT FP+ + AASFN +L + SA + E RA Y N G+
Sbjct: 90 ----------NVTVFPEPIAMAASFNDALLYKVFSAASDEMRAQYHHRIRNGGEDEKFHS 139
Query: 162 LTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQG-ENWKSDDGGIGFGFRE 220
L+ W PN+NIFRDPRWGRGQET GEDP + + V+ QG E+ K +R+
Sbjct: 140 LSVWTPNVNIFRDPRWGRGQETYGEDPYLTAVMGTAVVRGLQGPEDSK---------YRK 190
Query: 221 KRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPF 279
L AC KH Y + ++R++ N ++ +D +T+ P F
Sbjct: 191 ------------------LWACAKH---YAVHSGPEYTRHTANLNNVSPRDLWETYLPAF 229
Query: 280 RSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQN 338
++ +E+ K +MC+Y ++ P C L Q+ R+EWGF+ + SDC AV+ I +QN
Sbjct: 230 KTLVEEAKVREVMCAYQALDDEPCCGNSRLLQQILRDEWGFQYLVVSDCGAVSDI--WQN 287
Query: 339 YTKTHEDSAAGVLK---AGMDINCG-TCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLR 394
+ KT D+ K AG D+ CG + A+ +G + EK++D+ +L L +
Sbjct: 288 H-KTSSDAVHATAKAALAGTDVECGFNYTYKCIPEAVQRGLISEKEVDKHVLRLLEGRFD 346
Query: 395 LGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAI 454
LG + DP + K+ + + H++L+LD ARQ IVLL+N + LPL KN +A+
Sbjct: 347 LGEMD-DPALVPWSKIPYSVMDSKAHRQLSLDMARQSIVLLQNKQNMLPLKKNN-ERIAV 404
Query: 455 IGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGC 496
IGP +N+ M G Y G P ++L G+ A Y GC
Sbjct: 405 IGPNADNVPMMWGNYNGTPNRTVTILDGIRAKHKNVKYIKGC 446
Score = 83.6 bits (205), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 68/271 (25%), Positives = 118/271 (43%), Gaps = 59/271 (21%)
Query: 507 FHEAVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVART 556
+ E +R K + V+ V G+ E E+ DR + LP Q + ++
Sbjct: 598 YQEVIRALKGIEKVVFVGGISPALEGEEMPVDIPGFKGGDRTDIELPRVQRDFIKALHAA 657
Query: 557 SKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMT 616
K+ V + +G ++ + +I+ Y G+ G A+A+++FGD+NP G+LP+T
Sbjct: 658 GKQLVYVNCSGSA---IALEPETTACDAIVQAWYAGQEGGTAVADVLFGDYNPSGKLPVT 714
Query: 617 WYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSEL 676
+Y S + +M+ GR+YR+++ ++ FGHGLSYT ++
Sbjct: 715 FYKNSNQLPDYENYSMK--------GRTYRYFSDP-LFAFGHGLSYTTFNMG-------- 757
Query: 677 TISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFA 736
+A + +D + V+I V N G DG+ V+L+
Sbjct: 758 --TAEIIKKADSIV-------------------------VRIPVENVGSKDGTETVLLYI 790
Query: 737 RVPKVSQGTPEKQLIGFDRVHTVAKGSKEIS 767
+ + G P K L GF RV V G K ++
Sbjct: 791 KNHQDPNG-PIKSLRGFSRVF-VKAGHKAVA 819
>gi|299141953|ref|ZP_07035087.1| beta-glucosidase [Prevotella oris C735]
gi|298576415|gb|EFI48287.1| beta-glucosidase [Prevotella oris C735]
Length = 858
Score = 266 bits (680), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 166/462 (35%), Positives = 245/462 (53%), Gaps = 61/462 (13%)
Query: 51 SYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNG 110
+YP+ N +LS TRA+ L+S LTL+EK + D + AIPRLGI + WWSE+LHG A+ G
Sbjct: 30 AYPYQNPNLSALTRAQDLLSRLTLEEKALLMLDESPAIPRLGIKKFFWWSEALHGAANMG 89
Query: 111 PGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMY-----NLGQ----AG 161
+VT FP+ + AASFN +L + SA + E RA Y N G+
Sbjct: 90 ----------NVTVFPEPIAMAASFNDALLYKVFSAASDEMRAQYHHRIRNGGEDEKFHS 139
Query: 162 LTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQG-ENWKSDDGGIGFGFRE 220
L+ W PN+NIFRDPRWGRGQET GEDP + + V+ QG E+ K +R+
Sbjct: 140 LSVWTPNVNIFRDPRWGRGQETYGEDPYLTAVMGTAVVRGLQGPEDSK---------YRK 190
Query: 221 KRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPF 279
L AC KH Y + ++R++ N ++ +D +T+ P F
Sbjct: 191 ------------------LWACAKH---YAVHSGPEYTRHTANLNNVSPRDLWETYLPAF 229
Query: 280 RSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQN 338
++ +E+ K +MC+Y ++ P C L Q+ R+EWGF+ + SDC AV+ I +QN
Sbjct: 230 KTLVEEAKVREVMCAYQALDDEPCCGNSRLLQQILRDEWGFQYLVVSDCGAVSDI--WQN 287
Query: 339 YTKTHEDSAAGVLK---AGMDINCG-TCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLR 394
+ KT D+ K AG D+ CG + A+ +G + EK++D+ +L L +
Sbjct: 288 H-KTSSDAVHATAKAALAGTDVECGFNYTYKCIPEAVQRGLISEKEVDKHVLRLLEGRFD 346
Query: 395 LGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAI 454
LG + DP + K+ + + H++L+LD ARQ IVLL+N + LPL KN +A+
Sbjct: 347 LGEMD-DPALVPWSKIPYSVMDSKAHRQLSLDMARQSIVLLQNKQNMLPLKKNN-ERIAV 404
Query: 455 IGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGC 496
IGP +N+ M G Y G P ++L G+ A Y GC
Sbjct: 405 IGPNADNVPMMWGNYNGTPNRTVTILDGIRAKHKNVKYIKGC 446
Score = 83.2 bits (204), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 73/305 (23%), Positives = 132/305 (43%), Gaps = 60/305 (19%)
Query: 507 FHEAVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVART 556
+ E +R K + V+ V G+ E E+ DR + LP Q + ++
Sbjct: 598 YQEVIRALKGIEKVVFVGGISPALEGEEMPVDIPGFKGGDRTDIELPRVQRDFIKALHAA 657
Query: 557 SKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMT 616
K+ V + +G ++ + +I+ Y G+ G A+A+++FGD+NP G+LP+T
Sbjct: 658 GKQLVYVNCSGSA---IALEPETTACDAIVQAWYAGQEGGTAVADVLFGDYNPSGKLPVT 714
Query: 617 WYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSEL 676
+Y S + +M+ GR+YR+++ ++ FGHGLSYT ++
Sbjct: 715 FYKNSNQLPDYENYSMK--------GRTYRYFSDP-LFAFGHGLSYTTFNMG-------- 757
Query: 677 TISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFA 736
+A + +D + V+I V N G DG+ V+L+
Sbjct: 758 --TAEIIKKADSIV-------------------------VRIPVENVGSKDGTETVLLYI 790
Query: 737 RVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRI-LPLGNHVLM 795
+ + G P K L GF RV V G + ++ V + +++ + GN+ L+
Sbjct: 791 KNHQDPNG-PIKSLRGFSRVF-VKAGHQAVAELVLTRKSFEFFDENTNTVHFKEGNYDLL 848
Query: 796 VGELR 800
G+ R
Sbjct: 849 YGDKR 853
>gi|423331656|ref|ZP_17309440.1| hypothetical protein HMPREF1075_01453 [Parabacteroides distasonis
CL03T12C09]
gi|409230226|gb|EKN23094.1| hypothetical protein HMPREF1075_01453 [Parabacteroides distasonis
CL03T12C09]
Length = 868
Score = 266 bits (679), Expect = 5e-68, Method: Compositional matrix adjust.
Identities = 160/471 (33%), Positives = 227/471 (48%), Gaps = 61/471 (12%)
Query: 52 YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGP 111
YPF N L + R L+S LT +EKI Q+ + AI RLGIP Y+WW+E+LHG+A G
Sbjct: 26 YPFRNPDLPLEERIDDLLSRLTPEEKIGQMMNVTPAIERLGIPTYDWWNEALHGVARAG- 84
Query: 112 GVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLT 163
T FPQ + AA+F+ + + V+ EARA Y+ Q GLT
Sbjct: 85 ---------RATVFPQAIAMAATFDDNAVHETFTMVSDEARAKYHQYQKDKEYDRYKGLT 135
Query: 164 FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRV 223
FW PNINIFRDPRWGRG ET GEDP + V + QG
Sbjct: 136 FWTPNINIFRDPRWGRGMETYGEDPYLTEKMGVAVTRGLQG------------------- 176
Query: 224 LKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCI 283
D + AC KH + +W +R+ F+ T +D +T+ P F + +
Sbjct: 177 --------DDPNYYKTHACAKHYAVHSGPEW---NRHEFDVEATPRDLYETYLPAFEALV 225
Query: 284 EQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIFEYQNYTKT 342
++G +MC+YN+ G P C L RN WG+ I SDC A+ + T
Sbjct: 226 KEGDVQEVMCAYNRFEGKPCCSSDKLLIDILRNSWGYDNIILSDCGAIDDFWRKDKNTPR 285
Query: 343 H------EDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLG 396
H E ++A + G D+ CG R A+ GK+ EKD+D +L L + LG
Sbjct: 286 HETHPDAESASADAVLNGTDLECGGS-YRALNKALADGKISEKDLDVSLRRLLKGRFELG 344
Query: 397 LFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIG 456
+F+ D R Y K+ V + EH ALD AR+ IVLLKN LPL+KN + +A++G
Sbjct: 345 MFDPDERV-PYSKIPYSVVESPEHIAKALDMARKSIVLLKNKNNMLPLDKN-IKKIAVVG 402
Query: 457 PLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGF 507
P + + + Y G P +++ G+ +K A +++ CN A F
Sbjct: 403 PNAADSTMLWANYNGFPTKTVTIVEGIR---NKVPNAEVIYELGCNHTADF 450
Score = 110 bits (275), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 83/298 (27%), Positives = 139/298 (46%), Gaps = 55/298 (18%)
Query: 486 YVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDR---------- 535
Y+ +T A V + F K AD ++ V G+ E E+
Sbjct: 574 YMQRTGSADLNFTVGVRTPVDFQATASKVKDADVIVFVGGISPRLEGEEMPVDAEGFRKG 633
Query: 536 DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAG 595
DR ++ +P Q +V ++ T K PV+ V+ G L +++ + +++IL Y G+ G
Sbjct: 634 DRTNIEIPAVQKEMVKALVATGK-PVVYVVCTGSALALNWE--NDHVNAILNAWYGGQEG 690
Query: 596 AKALAEIIFGDFNPGGRLPMTWYPESFTKVP-MNDMNMRADSSRQYPGRSYRFYTGTQVY 654
A+A+++FGD+NP GRLP+T+Y +S ++P D +M+ GR+YR+ T T +Y
Sbjct: 691 GTAVADVLFGDYNPAGRLPITFY-KSVDQLPDFQDYSMK--------GRTYRYMTQTPLY 741
Query: 655 GFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRF 714
FG+GLSYT + YK +A L D++ S S+
Sbjct: 742 PFGYGLSYTTFDYK----------NAKLSK-------------------DKIASNESVTL 772
Query: 715 HVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDP 772
I+ N G +DG V ++ + P G P K + F RV+ A ++ +S ++P
Sbjct: 773 SFDIA--NTGKMDGDEVAQIYIKNPNDPAG-PLKAMKAFKRVNVKAGSAQPVSIQLEP 827
>gi|371777646|ref|ZP_09483968.1| beta-glucosidase [Anaerophaga sp. HS1]
Length = 865
Score = 266 bits (679), Expect = 5e-68, Method: Compositional matrix adjust.
Identities = 156/449 (34%), Positives = 241/449 (53%), Gaps = 55/449 (12%)
Query: 53 PFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPG 112
PF N LS RAK LIS LT+QEK + L D + AIPRLGI + WWSE+LHG A+N
Sbjct: 23 PFQNPDLSSEERAKDLISRLTVQEKARLLCDQSEAIPRLGIKKFNWWSEALHGYANN--- 79
Query: 113 VNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA---------GLT 163
SVT FPQ + AASFN L I +A++ EARA Y+ Q L+
Sbjct: 80 -------DSVTVFPQPIGMAASFNEELVFEIFNAISDEARAKYHQAQRRGEENRRFLSLS 132
Query: 164 FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQG-ENWKSDDGGIGFGFREKR 222
W PN+NIFRDPRWGRGQET GEDP + S V+ VK QG E+ K +R+
Sbjct: 133 VWTPNVNIFRDPRWGRGQETYGEDPYLTSRMGVQVVKGLQGPEDAK---------YRK-- 181
Query: 223 VLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPFRS 281
L AC KH + +W SR+ N ++ ++ +T+ P F++
Sbjct: 182 ----------------LLACAKHYTVHSGPEW---SRHELNINDVSPREFYETYMPAFKA 222
Query: 282 CIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYT 340
+++ +MC+Y++++ P C + Q+ R+EWG++ + +DC A++ + +
Sbjct: 223 LVQKADVRQVMCAYHRLDDEPCCSNTRILQRILRDEWGYEHMVVADCGAISDFYTTHGIS 282
Query: 341 KTHEDSAAGVLKAGMDINC--GTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLF 398
T +AA L AG D+ C + A++K + EKDIDR+L+ + + LG
Sbjct: 283 STPVHAAATGLLAGTDLECIWDNYHYKMLPEALEKDLITEKDIDRSLMRVLKGRFDLGEM 342
Query: 399 NGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPL 458
+ D + ++ P + +H++LA A+Q IVLL+N K LPL+K++++ +A++GP
Sbjct: 343 D-DNSLVPWAQIPPSVLNCEKHRQLAYKMAQQSIVLLQNKNKVLPLDKSSINKIAVVGPN 401
Query: 459 VNNISQMGGGYTGIPCSPKSLLRGLEAYV 487
++ + G Y G P ++L G+++ V
Sbjct: 402 ADDEVVLWGNYNGTPIRTITVLDGIKSKV 430
Score = 89.0 bits (219), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 80/306 (26%), Positives = 129/306 (42%), Gaps = 61/306 (19%)
Query: 503 SDAGFHEAVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTS 552
D F ++ + + VI V GL E E+ DR ++ LP Q + + +
Sbjct: 589 EDIDFTALIKKLEGIETVIFVGGLSGFLEGEEMPVSYPGFKGGDRTNIELPSVQRNCLKA 648
Query: 553 VARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGR 612
+ K + + +G ++ +I+ Y GE+G +A+A+++FGD+NP G+
Sbjct: 649 LKEAGKTVIFVNCSGSA---IALEPETESCDAIIQAWYGGESGGQAIADVLFGDYNPSGK 705
Query: 613 LPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSA 672
LP+T+Y S D +M GR+YR YT ++ FG GLSYTN+
Sbjct: 706 LPVTFYRNSDNLGDFEDYSME--------GRTYR-YTNNHLFPFGFGLSYTNFEIG---- 752
Query: 673 PSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVV 732
KA K+ ++ DE S ++I V N G DG+ +V
Sbjct: 753 ----------KARLSKSTIKA----------DETIS-------IKIPVKNTGKRDGTEIV 785
Query: 733 MLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPC-------EQLSIANKHGRR 785
++ R G P K L GF R+ A +++ + + P Q +A G
Sbjct: 786 QVYVRKVNDIDG-PLKTLKGFQRIAVPAGKTRQANISLPPSTFEFYDWSQRKMAVTPGEY 844
Query: 786 ILPLGN 791
IL GN
Sbjct: 845 ILYYGN 850
>gi|299149090|ref|ZP_07042152.1| periplasmic beta-glucosidase [Bacteroides sp. 3_1_23]
gi|298513851|gb|EFI37738.1| periplasmic beta-glucosidase [Bacteroides sp. 3_1_23]
Length = 1049
Score = 266 bits (679), Expect = 5e-68, Method: Compositional matrix adjust.
Identities = 216/747 (28%), Positives = 340/747 (45%), Gaps = 116/747 (15%)
Query: 66 KSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVS----- 120
K L+S +T++EKI QLS G P E+ S+SL G +N +G +
Sbjct: 342 KDLLSRMTVEEKIGQLSQYVGRTLLTG-PESEYLSDSLIARGLVGSVLNISGAKTLRDLQ 400
Query: 121 ----------------------SVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG 158
T FP L + S++ + A+E+ A
Sbjct: 401 EKNMRHSRIKIPILFGMDVIHGYKTIFPTPLAESCSWDLAAIERAAKIAAIESSA----- 455
Query: 159 QAGLTF-WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFG 217
AGL + +AP ++I RD RWGR E GED + S A V FQ W+++
Sbjct: 456 -AGLHWTFAPMVDIARDARWGRVVEGAGEDTYLGSEIAKARVNGFQWNLWENNS------ 508
Query: 218 FREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQP 277
+ AC KH +AY L + G R ++E+ DT+ P
Sbjct: 509 ---------------------VLACAKHWVAYGLPQAG---RDYAPVDMSERTLFDTYLP 544
Query: 278 PFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEY 336
PF++CI+ G M ++N +NG+PA L + R +W F G++ SD +AV +
Sbjct: 545 PFKACIDAG-VLTFMSAFNDINGIPASAHPFLLKDLLRGQWNFNGFVVSDWEAVKQLVA- 602
Query: 337 QNYTKTHEDSAAGVLKAGMDINCGTCML-RHTQSAIDKGKVQEKDIDRALLNLFSVQLRL 395
Q + +D+ +G+D++ + ++ + I+ GK+ +D+D ++ + ++ L
Sbjct: 603 QGVAEDDKDATRLAFNSGIDMDMTDGLYNKYMKELIEAGKISMEDVDNSVSRILHIKYAL 662
Query: 396 GLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAII 455
GLF DP K + + E ALD A + VLLKND LPL KN V S+A++
Sbjct: 663 GLFV-DPYKFCNEEYESQTIMKKEFLDAALDMAHKSAVLLKNDNHTLPLAKN-VRSIAVV 720
Query: 456 GPLVNNISQMGGGYT--GIPCSPKSLLRGLEAYV----SKTHYASGCHDVPCNSDAGFHE 509
GPL +N +++ G + G ++L+G++ + +K YA GC D +GF E
Sbjct: 721 GPLADNQTELLGSWRARGEDRHVTTVLQGIKNKIGGNKTKVGYARGC-DFDGEDKSGFKE 779
Query: 510 AVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGG 569
AV++A K+D VI V G E R R L LPG Q L+ + T K PV++VL G
Sbjct: 780 AVKLASKSDMVIAVVGEKALMSGESRSRAQLDLPGVQEELIKELVATGK-PVVVVLMNGR 838
Query: 570 PLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMND 629
PL + + D +S+IL + G + A+A+I+FGD+NP GRL ++ +P +VP+
Sbjct: 839 PLSIEW--VDKNVSAILETWFLGTSAGTAIADILFGDYNPSGRLTIS-FPRVEGQVPVYY 895
Query: 630 MNMRADSSRQYPGRSYRFYT---GTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGS 686
++ P S + +Y FG+GLSYT +SY SAP
Sbjct: 896 NYKKSGRPGDMPHSSTTRHIDVPNAPLYPFGYGLSYTTFSY---SAPQ------------ 940
Query: 687 DKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTP 746
++ +Y + ++ V ++VTN GD DG V L+ S P
Sbjct: 941 --------STQKEYTRQETIS--------VSVTVTNTGDRDGEETVQLYVNDKVASVVRP 984
Query: 747 EKQLIGFDRVHTVAKGSKEISFGVDPC 773
K+L F ++ A SK + F + P
Sbjct: 985 VKELKAFKKIFLKAGESKTVQFDISPL 1011
>gi|255013451|ref|ZP_05285577.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp. 2_1_7]
gi|410103695|ref|ZP_11298616.1| hypothetical protein HMPREF0999_02388 [Parabacteroides sp. D25]
gi|409236424|gb|EKN29231.1| hypothetical protein HMPREF0999_02388 [Parabacteroides sp. D25]
Length = 868
Score = 265 bits (678), Expect = 5e-68, Method: Compositional matrix adjust.
Identities = 160/471 (33%), Positives = 227/471 (48%), Gaps = 61/471 (12%)
Query: 52 YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGP 111
YPF N L + R L+S LT +EKI Q+ + AI RLGIP Y+WW+E+LHG+A G
Sbjct: 26 YPFRNPDLPLEERIDDLLSRLTPEEKIGQMMNVTPAIERLGIPTYDWWNEALHGVARAG- 84
Query: 112 GVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLT 163
T FPQ + AA+F+ + + V+ EARA Y+ Q GLT
Sbjct: 85 ---------RATVFPQAIAMAATFDDNAVHETFTMVSDEARAKYHQYQKDKEYDRYKGLT 135
Query: 164 FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRV 223
FW PNINIFRDPRWGRG ET GEDP + V + QG
Sbjct: 136 FWTPNINIFRDPRWGRGMETYGEDPYLTEKMGVAVTRGLQG------------------- 176
Query: 224 LKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCI 283
D + AC KH + +W +R+ F+ T +D +T+ P F + +
Sbjct: 177 --------DDPNYYKTHACAKHYAVHSGPEW---NRHEFDVEATPRDLYETYLPAFEALV 225
Query: 284 EQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIFEYQNYTKT 342
++G +MC+YN+ G P C L RN WG+ I SDC A+ + T
Sbjct: 226 KEGDVQEVMCAYNRFEGKPCCSSDKLLIDILRNSWGYDNIILSDCGAIDDFWRKDKNTPR 285
Query: 343 H------EDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLG 396
H E ++A + G D+ CG R A+ GK+ EKD+D +L L + LG
Sbjct: 286 HETHPDAESASADAVLNGTDLECGGS-YRALNKALADGKISEKDLDVSLRRLLKGRFELG 344
Query: 397 LFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIG 456
+F+ D R Y K+ V + EH ALD AR+ IVLLKN LPL+KN + +A++G
Sbjct: 345 MFDPDERV-PYSKIPYSVVESPEHIAKALDMARKSIVLLKNKNNMLPLDKN-IKKIAVVG 402
Query: 457 PLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGF 507
P + + + Y G P +++ G+ +K A +++ CN A F
Sbjct: 403 PNAADSTMLWANYNGFPTKTVTIVEGIR---NKVPNAEVIYELGCNHTADF 450
Score = 110 bits (274), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 83/298 (27%), Positives = 138/298 (46%), Gaps = 55/298 (18%)
Query: 486 YVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDR---------- 535
Y+ +T A V + F K AD ++ V G+ E E+
Sbjct: 574 YMQRTGSADLNFTVGVRTPVDFQATASKVKDADVIVFVGGISPRLEGEEMPVDAEGFRKG 633
Query: 536 DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAG 595
DR ++ +P Q +V ++ T K PV+ V+ G L +++ + +++IL Y G+ G
Sbjct: 634 DRTNIEIPAVQKEMVKALVATGK-PVVYVVCTGSALALNWE--NDHVNAILNAWYGGQEG 690
Query: 596 AKALAEIIFGDFNPGGRLPMTWYPESFTKVP-MNDMNMRADSSRQYPGRSYRFYTGTQVY 654
A+A+++FGD+NP GRLP+T+Y +S ++P D +M+ GR+YR+ T T +Y
Sbjct: 691 GTAVADVLFGDYNPAGRLPITFY-KSVDQLPDFQDYSMK--------GRTYRYMTQTPLY 741
Query: 655 GFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRF 714
FG+GLSYT + YK +A L D++ S S+
Sbjct: 742 PFGYGLSYTTFDYK----------NAKLSK-------------------DKIASNESVTL 772
Query: 715 HVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDP 772
I+ N G +DG V ++ + P G P K + F RV+ A + +S ++P
Sbjct: 773 SFDIA--NTGKMDGDEVAQIYIKNPNDPAG-PLKAMKAFKRVNVKAGSEQPVSIQLEP 827
>gi|409198206|ref|ZP_11226869.1| beta-glucosidase [Marinilabilia salmonicolor JCM 21150]
Length = 775
Score = 265 bits (678), Expect = 5e-68, Method: Compositional matrix adjust.
Identities = 230/812 (28%), Positives = 369/812 (45%), Gaps = 129/812 (15%)
Query: 23 VVVNVIAFSNSKPVLNKPDFPCKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLS 82
+++ V+ F + ++ K P + SY + I + S+++L+TL+EKI Q++
Sbjct: 6 ILIVVLGFMMTLGACSEVGIAGKTPDWKSY---SGDPVIERKVDSVLNLMTLEEKIGQMN 62
Query: 83 DNASAIPRLG----IPAYEWWSESLHGIASNGPGVN--------------------FNGT 118
+ G + E+ + + G N G + F
Sbjct: 63 QYSGNFATTGEVADTKSGEYLKKGMIGSTFNVFGADHLRMLQEQNLKHSRLKIPMLFAAD 122
Query: 119 V--SSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTF-WAPNINIFRDP 175
V T+FP L A S++ L A EA A +G+ + +AP I+I RDP
Sbjct: 123 VIHGLETTFPIPLAEACSWDLELMEKSARIAAEEATA------SGVAWNFAPMIDIGRDP 176
Query: 176 RWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGD 235
RWGR E GED + + A V FQG + +D
Sbjct: 177 RWGRVMEGAGEDVYLATQVARARVIGFQG----------------------IEDYTDLSQ 214
Query: 236 ELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSY 295
+ A KH + Y G R + ++E++ +TF PPF++ +++G AS M ++
Sbjct: 215 SNTMMATSKHFVGYGAALAG---RDYQSVDMSERELHETFLPPFKATVDEGVAS-FMTAF 270
Query: 296 NQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAG 354
N +NGVP LF++ R+ WGF G + +D A+ + + + K + +A + AG
Sbjct: 271 NDLNGVPCTGNQYLFKEILRDRWGFGGMVVTDYTAIMEMVAH-GFAKDLKHAAELAIDAG 329
Query: 355 MDIN-CGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPD 413
+D++ + H + +++G V E+ ID A+ + ++ LGLF+ DP + + +
Sbjct: 330 IDMDMISEAFVTHLKELVEEGDVSEEQIDVAVSRILEMKFLLGLFD-DPFRYFDAERQQE 388
Query: 414 DVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT--G 471
V EH K A +AA++ IVLLKN+ LPL+KN +A+IGP V + G + G
Sbjct: 389 VVMNPEHLKTAREAAQRSIVLLKNEGNVLPLDKNTSKRVALIGPFVKERESLNGEWAIKG 448
Query: 472 IPCSPKSLLRGLEAYVSKTH----YASG--------------CHDVPCNSDAGFHEAVRI 513
+LL GLE + YA G +VP GF EAV +
Sbjct: 449 DRNKSVTLLEGLEEKYDGSRVEFTYAQGTTLPLIDRSTQKVSVTEVPDRR--GFAEAVNV 506
Query: 514 AKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDV 573
A+ +D ++V G + E R + LPG Q L+ + +T K P++LVL G PLD+
Sbjct: 507 ARNSDVIMVAMGENYHWSGEAASRTDITLPGNQRELLKELKKTGK-PIVLVLFNGRPLDL 565
Query: 574 SFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPM--NDMN 631
S+ E + + +I+ YPG A+A+I+ GD+NP +L MT +P + ++P+ N N
Sbjct: 566 SWEEEN--VDAIVEAWYPGMMSGHAVADILSGDYNPSAKLVMT-FPRNVGQIPIFYNMKN 622
Query: 632 MRADSSRQYPG---RSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDK 688
++P SY T ++ FG+GLSYT + Y A+ K SDK
Sbjct: 623 TGRPFDAEHPADYRSSYIDSPNTPLFPFGYGLSYTTFEY------------ANAKISSDK 670
Query: 689 NILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEK 748
Q+GS L + VTN GD+DG VV L+ R S P K
Sbjct: 671 F---QSGSSLT----------------ASVEVTNTGDLDGEEVVQLYLRDRVGSVVRPVK 711
Query: 749 QLIGFDRVHTVAKGSKEISFGVDPCEQLSIAN 780
+L GF+++H A +K + F +D E L + N
Sbjct: 712 ELKGFEKIHLKAGETKTVEFSIDE-EMLKMYN 742
>gi|281412136|ref|YP_003346215.1| glycoside hydrolase family 3 domain protein [Thermotoga
naphthophila RKU-10]
gi|281373239|gb|ADA66801.1| glycoside hydrolase family 3 domain protein [Thermotoga
naphthophila RKU-10]
Length = 778
Score = 265 bits (677), Expect = 7e-68, Method: Compositional matrix adjust.
Identities = 222/752 (29%), Positives = 343/752 (45%), Gaps = 143/752 (19%)
Query: 90 RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
RLGIPA E L G G T+FPQ + A++++ L + +A+
Sbjct: 92 RLGIPAM-IHEECLTGYMGLGG-----------TNFPQAIAMASTWDPDLIEKMTTAIRE 139
Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
+ R + GL AP +++ RDPRWGR +ET GE P +V+ V +VK QGE+ K
Sbjct: 140 DMRKIG--AHQGL---APVLDVARDPRWGRTEETFGESPYLVARMGVSYVKGLQGEDIK- 193
Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWG-NFSRYSFNAMITE 268
KG + A KH Y + G N++ + I E
Sbjct: 194 ---------------KG------------VVATVKHFAGYSASEGGKNWAPTN----IPE 222
Query: 269 QDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVP-ACLRGDLFQKARNEWGFKGYITSDC 327
++ ++ F PF + +++ +M SY++++GVP A R L R +WGFKG + SD
Sbjct: 223 REFKEVFLFPFEAAVKEANVLSVMNSYSEIDGVPCAANRKLLTDILRKDWGFKGIVVSDY 282
Query: 328 DAVATIFEYQNYTKTHEDSAAGVLKAGMDINC-GTCMLRHTQSAIDKGKVQEKDIDRALL 386
AV + +Y + ++A L+AG+D+ T ++ + ++KG + E ID A+
Sbjct: 283 FAVKVLEDYHRIARDKSEAARLALEAGIDVELPKTECYQYLKDLVEKGIISEALIDEAVA 342
Query: 387 NLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNK 446
+ ++ LGLF + + K+ HK +ALD AR+ I+LLKND LPL K
Sbjct: 343 RVLRLKFMLGLFENPYVEVEKAKI-------ESHKDIALDIARKSIILLKNDG-ILPLQK 394
Query: 447 NAVSSLAIIGPLVNNISQMGGGYT----------------GIPCSPKS----LLRGLEAY 486
N +A+IGP + + G Y G P P+ L + +E +
Sbjct: 395 N--KKVALIGPNAGEVRNLLGDYMYLAHIRALLDNIDDVFGNPQIPRENYERLKKSIEEH 452
Query: 487 VSKT--------------HYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAG------L 526
+ YA GC +V +GF EA+ IAKK+D IVV G L
Sbjct: 453 MKSIPSVLDAFKEEGIEFEYAKGC-EVTGEDRSGFEEAIEIAKKSDVAIVVVGDKSGLTL 511
Query: 527 DLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSIL 586
D T E RD +L LPG Q LV VA+T K PV+LVL G P S ++++IL
Sbjct: 512 DCTT-GESRDMANLKLPGVQEELVLEVAKTGK-PVVLVLITGRPY--SLKNVVDKVNAIL 567
Query: 587 WIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYR 646
+ PGEAG +++ +II+G NP G+LP++ +P S ++P+ ++ R + Y
Sbjct: 568 QVWLPGEAGGRSIVDIIYGKVNPSGKLPIS-FPRSAGQIPVFHY-VKPSGGRSHWHGDYV 625
Query: 647 FYTGTQVYGFGHGLSYTNYSYKFLS-APSELTISASLKAGSDKNILQQTGSRLDYVHIDE 705
+ ++ FGHGLSYT + Y L P E+ + +
Sbjct: 626 DESTKPLFPFGHGLSYTKFEYSNLRIEPKEVPPAGEVV---------------------- 663
Query: 706 VTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKE 765
+++ V N GD DG VV L+ S P K+L GF RV AK K
Sbjct: 664 ----------IKVDVENTGDRDGDEVVQLYIGREFASVTRPVKELKGFKRVSLKAKEKKT 713
Query: 766 ISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
+ F + + L+ ++ + ++ G +MVG
Sbjct: 714 VVFRLH-MDVLAYYDRDMKLVVEPGEFKVMVG 744
>gi|237712573|ref|ZP_04543054.1| glycoside hydrolase family 3 protein [Bacteroides sp. 9_1_42FAA]
gi|345512524|ref|ZP_08792050.1| glycoside hydrolase family beta-glycosidase [Bacteroides dorei
5_1_36/D4]
gi|423239901|ref|ZP_17221016.1| hypothetical protein HMPREF1065_01639 [Bacteroides dorei
CL03T12C01]
gi|229435409|gb|EEO45486.1| glycoside hydrolase family beta-glycosidase [Bacteroides dorei
5_1_36/D4]
gi|229453894|gb|EEO59615.1| glycoside hydrolase family 3 protein [Bacteroides sp. 9_1_42FAA]
gi|392644890|gb|EIY38624.1| hypothetical protein HMPREF1065_01639 [Bacteroides dorei
CL03T12C01]
Length = 788
Score = 265 bits (677), Expect = 8e-68, Method: Compositional matrix adjust.
Identities = 216/736 (29%), Positives = 340/736 (46%), Gaps = 129/736 (17%)
Query: 90 RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
RLGIP ++ +E + G+ + T FP A++N+ L + IG V
Sbjct: 138 RLGIPV-DFTNEGIRGLCHD-----------RATYFPAQCGQGATWNKELIARIGE---V 182
Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
EA+ LG + ++P ++I +DPRWGR ET GEDP +V + + S Q N
Sbjct: 183 EAKEAVALGYTNI--YSPILDIAQDPRWGRCVETYGEDPYLVGELGKQMITSLQKHN--- 237
Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQ 269
L A KH Y + G + + + +
Sbjct: 238 -----------------------------LVATPKHFAVYSIPVGGRDGKTRTDPHVAPR 268
Query: 270 DTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGD---LFQKARNEWGFKGYITSD 326
+ + PFR ++ A +M SYN +G P + G L + R EWGFKGY+ SD
Sbjct: 269 EMRTLYIEPFRMAFQEAGALGVMSSYNDYDGEP--ITGSYHFLTEILRQEWGFKGYVVSD 326
Query: 327 CDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTC----MLRHTQSAIDKGKVQEKDID 382
+AV I T+ED A + AG++I + + A+ GK+ ++ +D
Sbjct: 327 SEAVEFISSKHKVANTYEDGIAQAVNAGLNIRTHFTPPADFILPLRKAVADGKISQETLD 386
Query: 383 RALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFL 442
+ + + V+ LGLF+ +P +G GK V + EH+ ++L+AARQ +VLLKN+ L
Sbjct: 387 KRVAEILRVKFWLGLFD-NPYRGN-GKQAEQIVHSKEHQAVSLEAARQSLVLLKNEMNLL 444
Query: 443 PLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTH--YASGCH--- 497
PL+K ++ S+A+IGP + +Q+ Y K++ +G++ + T Y GC
Sbjct: 445 PLSK-SLRSIAVIGPNADERTQLICRYGPANAPIKTVYQGIKERLPHTEVIYRKGCDIID 503
Query: 498 ---------DVPCNSDAG--FHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQ 546
D P ++ EA+ AK+A+ V++V G + EDR R SL LPG+Q
Sbjct: 504 PHFPESEVLDFPKTTEEARLMEEAIHAAKQAEVVVMVLGGNELTVREDRSRTSLNLPGRQ 563
Query: 547 MSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGD 606
L+ +V T K PV+LVL G +++A A + +IL +PGE +A+AE +FGD
Sbjct: 564 EELLKAVCATGK-PVVLVLLDGRASSINYAAA--HVPAILHAWFPGEFCGQAVAEALFGD 620
Query: 607 FNPGGRLPMTWYPESFTKVPMN-DMNMRADSSRQYPGRSYRFYTGTQVYG----FGHGLS 661
+NPGGRL +T +P+S ++P +D S + T VYG FGHGLS
Sbjct: 621 YNPGGRLAVT-FPKSVGQIPFAFPFKPGSDES-----------SSTSVYGVLYPFGHGLS 668
Query: 662 YTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVT 721
YT +SY +L IS L+ G +I ++ +
Sbjct: 669 YTTFSY------GDLKIS-PLRQGVQGDI------------------------NISCKIK 697
Query: 722 NAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANK 781
N G + G VV L+ R S T K L GF+R+ A + + F + P + L + +K
Sbjct: 698 NTGKIKGDEVVQLYLRDEVSSVTTYTKVLRGFERISLEAGEEQMVHFRLRP-QDLGLWDK 756
Query: 782 HGRRILPLGNHVLMVG 797
+ + G +M+G
Sbjct: 757 NMNFRVEPGKFKVMIG 772
>gi|354580734|ref|ZP_08999639.1| glycoside hydrolase family 3 domain protein [Paenibacillus lactis
154]
gi|353203165|gb|EHB68614.1| glycoside hydrolase family 3 domain protein [Paenibacillus lactis
154]
Length = 766
Score = 265 bits (677), Expect = 8e-68, Method: Compositional matrix adjust.
Identities = 217/746 (29%), Positives = 339/746 (45%), Gaps = 136/746 (18%)
Query: 90 RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
RLGIP + E HG + G T FP L +++N L+ ++ AVA
Sbjct: 104 RLGIPIL-FGEECSHGHMAIG-----------ATVFPVPLTIGSTWNPELFRSMCRAVAA 151
Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
E R+ Q G ++P +++ RDPRWGR +ET GEDP +V+ +AV V+ QG+ +
Sbjct: 152 ETRS-----QGGAATYSPVLDVVRDPRWGRTEETFGEDPHLVAEFAVAAVQGLQGDRLDA 206
Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQ 269
+D L A KH Y + G R + +
Sbjct: 207 EDS--------------------------LLATLKHFAGYGASEGG---RNGAPVHMGLR 237
Query: 270 DTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVP-ACLRGDLFQKARNEWGFKGYITSDCD 328
+ + PFR +E G A +M +YN+++GVP R L R WGF G++ +DC
Sbjct: 238 ELHEIDLLPFRKAVEAG-AQSVMTAYNEIDGVPCTSSRYLLHDVLREAWGFDGFVITDCG 296
Query: 329 AVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLR-HTQSAIDKGKVQEKDIDRALLN 387
A+ + N + E++AA L AG+D+ M R + + A+++G + E D++ A+
Sbjct: 297 AIDMLKSGHNTAASGEEAAAQALTAGVDMEMSGSMFRVYLRQALEQGHITEDDLNTAVGR 356
Query: 388 LFSVQLRLGLFN---GDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPL 444
+ +++ RLGLF+ DP + + + EH +LA A +GIVLLKN+ LPL
Sbjct: 357 VLAMKFRLGLFDRPYTDPERAE------KVIGCEEHIELARRVAAEGIVLLKNEGNVLPL 410
Query: 445 NKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPK---SLLRGLEAYV-----SKTHYASGC 496
N +A+IGP N G YT P P ++L G+ ++ ++ YA GC
Sbjct: 411 NPK-TGKIAVIGPNANAPYNQLGDYTS-PQPPGQIITVLEGIRRHIGEDADTRVLYAPGC 468
Query: 497 HDVPCNSDAGFHEAVRIAKKADFVIVVAG-----------LDLTQET------------- 532
+ +S G A+ A +AD +++ G +DL
Sbjct: 469 R-IQGDSREGLSHALACAAEADVIVMAIGGSSARDFGEGTIDLRTGASVVTGLAQSDMEC 527
Query: 533 -EDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYP 591
E DR +L L G Q+ L+ + + K PV++V G P+ + D I +IL YP
Sbjct: 528 GEGIDRSTLHLMGVQLELLQEIHKLGK-PVVVVYINGRPITEPW--IDEHIPAILEAWYP 584
Query: 592 GEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGT 651
G+ G A+A+I+FGD NP GRL +T P+ ++P+N ++++ G+ Y
Sbjct: 585 GQEGGSAIADILFGDVNPSGRLTLT-IPKEVGQLPIN------YNAKRTRGKRYLETDLE 637
Query: 652 QVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTS 711
Y FG+GLSYT++ Y LS + ++ GS
Sbjct: 638 PRYPFGYGLSYTDFHYGNLSV--------------EPAVIPADGSAA------------- 670
Query: 712 LRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVD 771
V+I VTN G DG+ VV L+ S PEK L F +V A S+E++F V
Sbjct: 671 ----VRIVVTNTGPRDGAEVVQLYVSDLAASVTRPEKALKAFSKVFLKAGESREVTFTVG 726
Query: 772 PCEQLSIANKHGRRILPLGNHVLMVG 797
P EQL + + ++ G + VG
Sbjct: 727 P-EQLELIGPDMKAVVEPGEFRIRVG 751
>gi|409197445|ref|ZP_11226108.1| glycoside hydrolase family protein [Marinilabilia salmonicolor JCM
21150]
Length = 737
Score = 265 bits (676), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 210/747 (28%), Positives = 340/747 (45%), Gaps = 118/747 (15%)
Query: 51 SYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNG 110
SYPF N L + TR L+S +TL+EK+ LS + S +PRLGI E HG+A G
Sbjct: 39 SYPFQNADLDMETRVDDLLSRMTLEEKVSALSTDPS-VPRLGIKGAPH-IEGYHGVAMGG 96
Query: 111 PG--VNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYN---LGQAGLTFW 165
P T FPQ A++N L G ++EAR ++ + + GL
Sbjct: 97 PANWAPKGDERVPTTQFPQAYGMGATWNPELIRKAGEIESIEARYIFQNPEISKGGLVVR 156
Query: 166 APNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGEN---WKSDDGGIGFGFREKR 222
APN ++ RDPRWGR +E GEDP +V + F K QG++ W++
Sbjct: 157 APNADLGRDPRWGRTEEVLGEDPFLVGTLSTAFTKGLQGDDEKYWRT------------- 203
Query: 223 VLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSC 282
++ KH +A E + S +F+ + + + FR
Sbjct: 204 -----------------ASLLKHFLANSNENTRDSSSSNFDTQLFYE----YYGATFRRA 242
Query: 283 IEQGKASCIMCSYNQVNGVPACLRGDLFQKARNEWGFKGYITSDCDAVATIFEYQNYTKT 342
I +G ++ M +YN VNGVPA + + + WG G I +D +
Sbjct: 243 ILEGGSNAYMTAYNAVNGVPAHIHPMHKEISMARWGVNGIICTDGGGYTLLVRAHKAYDD 302
Query: 343 HEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDP 402
+ +A GV+KAG++ A+ G + E+D+D L ++ V ++LG DP
Sbjct: 303 YYRAAEGVIKAGLNQFLDNYR-EGVWGALAHGYLAEEDLDEVLKGVYRVMIKLGQL--DP 359
Query: 403 R-KGKYGKLG----PDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGP 457
+ K Y +G P + EH++ AL AR+ +VLLKN+K+ LPL + + +A+IG
Sbjct: 360 QDKVPYASIGRDGKPAPWTSPEHQEAALQMARESVVLLKNEKQTLPLAGDELGKVAVIGH 419
Query: 458 LVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKA 517
L + I + Y+G+P + L G++ + G V D ++ AV A +A
Sbjct: 420 LADTI--LLDWYSGMPPFMSTPLDGIKEKM-------GADKVLFAPDNDYNAAVEAASQA 470
Query: 518 DFVIVVAG-------------LDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILV 564
D IVV G D E DR +L L + ++ A + ILV
Sbjct: 471 DVAIVVLGNHPYCDSERWGDCPDPGMGREAVDRKTLRLTDEWLAQRVFEANPN---TILV 527
Query: 565 LTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTK 624
L P +++++ + + +I+ I + G++ ALA+++FGD+NPGG+L TW P+S +
Sbjct: 528 LQSSFPYGINWSQEN--LPAIVHITHNGQSTGTALADVLFGDYNPGGKLTQTW-PKSEEQ 584
Query: 625 VP-MNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLK 683
+P M + ++R G +Y ++ G +Y FG GLSYT++ + ++ I+ S
Sbjct: 585 LPDMMEYDIRK-------GHTYMYFNGEPLYPFGFGLSYTSFEW------VDMEITGSSV 631
Query: 684 AGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQ 743
+++ ++ V + + N G V G V+ L+A P+ S
Sbjct: 632 KSNEEEVI------------------------VTVKLKNVGQVKGDEVIQLYASFPETSS 667
Query: 744 GTPEKQLIGFDRVHTVAKGSKEISFGV 770
P+K L GF RV SK + V
Sbjct: 668 RRPDKALKGFKRVTLEPGESKNVQIPV 694
>gi|237718444|ref|ZP_04548925.1| glycoside hydrolase [Bacteroides sp. 2_2_4]
gi|229452377|gb|EEO58168.1| glycoside hydrolase [Bacteroides sp. 2_2_4]
Length = 746
Score = 265 bits (676), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 215/747 (28%), Positives = 339/747 (45%), Gaps = 116/747 (15%)
Query: 66 KSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVS----- 120
K L+S +T++EKI QLS G P E+ S+SL G +N +G +
Sbjct: 39 KDLLSRMTVEEKIGQLSQYVGRTLLTG-PESEYLSDSLIARGLVGSVLNISGAKTLRDLQ 97
Query: 121 ----------------------SVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG 158
T FP L + S++ + A+E+ A
Sbjct: 98 EKNMRYSRIKIPILFGMDVIHGYKTIFPTPLAESCSWDLAAIERAAKIAAIESSA----- 152
Query: 159 QAGLTF-WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFG 217
AGL + +AP ++I RD RWGR E GED + S A V FQ W+++
Sbjct: 153 -AGLHWTFAPMVDIARDARWGRVVEGAGEDTYLGSEIAKARVNGFQWNLWENNS------ 205
Query: 218 FREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQP 277
+ AC KH +AY L + G R ++E+ DT+ P
Sbjct: 206 ---------------------VLACAKHWVAYGLPQAG---RDYAPVDMSERTLFDTYLP 241
Query: 278 PFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEY 336
PF++CI+ G M ++N +NG+PA L + R +W F G++ SD +AV +
Sbjct: 242 PFKACIDAG-VLTFMSAFNDINGIPASAHPFLLKDLLRGQWNFNGFVVSDWEAVKQLVA- 299
Query: 337 QNYTKTHEDSAAGVLKAGMDINCGTCML-RHTQSAIDKGKVQEKDIDRALLNLFSVQLRL 395
Q + +D+ +G+D++ + ++ + I+ GK+ +D+D ++ + ++ L
Sbjct: 300 QGVAEDDKDATRLAFNSGIDMDMTDGLYNKYMKELIEAGKISMEDVDNSVSRILHIKYAL 359
Query: 396 GLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAII 455
GLF DP K + + E ALD A + VLLKND LPL KN V S+A++
Sbjct: 360 GLFV-DPYKFCNEEYESQTIMKKEFLDAALDMAHKSAVLLKNDNHTLPLAKN-VRSIAVV 417
Query: 456 GPLVNNISQMGGGYT--GIPCSPKSLLRGLEAYV----SKTHYASGCHDVPCNSDAGFHE 509
GPL +N +++ G + G ++L+G++ + +K YA GC D +GF E
Sbjct: 418 GPLADNQTELLGSWRARGEDRHVTTVLQGIKNKIGGNKTKVGYARGC-DFDGEDKSGFKE 476
Query: 510 AVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGG 569
AV++A K+D VI V G E R R L LPG Q L+ + T K PV++VL G
Sbjct: 477 AVKLASKSDMVIAVVGEKALMSGESRSRAQLDLPGVQEELIKELVATGK-PVVVVLMNGR 535
Query: 570 PLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMND 629
PL + + D +S+IL + G + A+A+I+FGD+NP GRL ++ +P +VP+
Sbjct: 536 PLSIEWV--DKNVSAILETWFLGTSAGTAIADILFGDYNPSGRLTIS-FPRVEGQVPVYY 592
Query: 630 MNMRADSSRQYPGRSYRFYT---GTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGS 686
++ P S + +Y FG+GLSYT +SY S P
Sbjct: 593 NYKKSGRPGDMPHSSTTRHIDVPNAPLYPFGYGLSYTTFSY---SVPQ------------ 637
Query: 687 DKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTP 746
++ +Y + ++ V ++VTN GD DG V L+ S P
Sbjct: 638 --------STQKEYTRQETIS--------VSVTVTNTGDRDGEETVQLYVNDKVASVVRP 681
Query: 747 EKQLIGFDRVHTVAKGSKEISFGVDPC 773
K+L F ++ A SK + F + P
Sbjct: 682 VKELKAFKKIFLKAGESKTVQFDISPL 708
>gi|15642851|ref|NP_227892.1| xylosidase [Thermotoga maritima MSB8]
gi|418046013|ref|ZP_12684107.1| Beta-glucosidase [Thermotoga maritima MSB8]
gi|4980564|gb|AAD35170.1|AE001694_6 xylosidase [Thermotoga maritima MSB8]
gi|351675566|gb|EHA58726.1| Beta-glucosidase [Thermotoga maritima MSB8]
Length = 778
Score = 264 bits (675), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 222/752 (29%), Positives = 343/752 (45%), Gaps = 143/752 (19%)
Query: 90 RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
RLGIPA E L G G T+FPQ + A++++ L + +AV
Sbjct: 92 RLGIPAM-IHEECLTGYMGLGG-----------TNFPQAIAMASTWDPDLIEKMTTAVRE 139
Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
+ R + GL AP +++ RDPRWGR +ET GE P +V+ V +VK QGE+ K
Sbjct: 140 DMRKIG--AHQGL---APVLDVARDPRWGRTEETFGESPYLVARMGVSYVKGLQGEDIK- 193
Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWG-NFSRYSFNAMITE 268
KG + A KH Y + G N++ + I E
Sbjct: 194 ---------------KG------------VVATVKHFAGYSASEGGKNWAPTN----IPE 222
Query: 269 QDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVP-ACLRGDLFQKARNEWGFKGYITSDC 327
++ ++ F PF + +++ +M SY++++GVP A R L R +WGF+G + SD
Sbjct: 223 REFKEVFLFPFEAAVKEANVLSVMNSYSEIDGVPCAANRKLLTDILRKDWGFEGIVVSDY 282
Query: 328 DAVATIFEYQNYTKTHEDSAAGVLKAGMDINC-GTCMLRHTQSAIDKGKVQEKDIDRALL 386
AV + +Y + ++A L+AG+D+ T ++ + ++KG + E ID A+
Sbjct: 283 FAVKVLEDYHRIARDKSEAARLALEAGIDVELPKTECYQYLKDLVEKGIISEALIDEAVT 342
Query: 387 NLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNK 446
+ ++ LGLF + + K+ H+ +AL+ AR+ I+LLKND LPL K
Sbjct: 343 RVLRLKFMLGLFENPYVEVEKAKI-------ESHRDIALEIARKSIILLKNDG-ILPLQK 394
Query: 447 NAVSSLAIIGPLVNNISQMGGGYT----------------GIPCSPKS----LLRGLEAY 486
N +A+IGP + + G Y G P P+ L + +E +
Sbjct: 395 N--KKVALIGPNAGEVRNLLGDYMYLAHIRALLDNIDDVFGNPQIPRENYERLKKSIEEH 452
Query: 487 VSKT--------------HYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAG------L 526
+ YA GC +V +GF EA+ IAKK+D IVV G L
Sbjct: 453 MKSIPSVLDAFKEEGIEFEYAKGC-EVTGEDRSGFEEAIEIAKKSDVAIVVVGDKSGLTL 511
Query: 527 DLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSIL 586
D T E RD +L LPG Q LV VA+T K PV+LVL G P S ++++IL
Sbjct: 512 DCTT-GESRDMANLKLPGVQEELVLEVAKTGK-PVVLVLITGRPY--SLKNVVDKVNAIL 567
Query: 587 WIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYR 646
+ PGEAG +A+ +II+G NP G+LP++ +P S ++P+ ++ R + Y
Sbjct: 568 QVWLPGEAGGRAIVDIIYGKVNPSGKLPIS-FPRSAGQIPVFHY-VKPSGGRSHWHGDYV 625
Query: 647 FYTGTQVYGFGHGLSYTNYSYKFLS-APSELTISASLKAGSDKNILQQTGSRLDYVHIDE 705
+ ++ FGHGLSYT + Y L P E+ + +
Sbjct: 626 DESTKPLFPFGHGLSYTKFEYSNLRIEPKEVPPAGEVV---------------------- 663
Query: 706 VTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKE 765
+++ V N GD DG VV L+ S P K+L GF RV AK K
Sbjct: 664 ----------IKVDVENIGDRDGDEVVQLYIGREFASVTRPVKELKGFKRVSLKAKEKKT 713
Query: 766 ISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
+ F + + L+ N+ + ++ G +MVG
Sbjct: 714 VVFRLH-MDVLAYYNRDMKLVVEPGEFKVMVG 744
>gi|316980598|dbj|BAJ51947.1| putative beta-D-xylosidase [Glycyrrhiza uralensis]
Length = 285
Score = 264 bits (675), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 123/285 (43%), Positives = 191/285 (67%), Gaps = 9/285 (3%)
Query: 525 GLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISS 584
GLD + E E RDRV LLLPG Q LV+ VAR ++ PVILVL GGP+DVSFA+ D +IS+
Sbjct: 2 GLDQSIEAEFRDRVGLLLPGHQQELVSRVARVARGPVILVLMSGGPIDVSFAKNDPKISA 61
Query: 585 ILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESF-TKVPMNDMNMRADSSRQYPGR 643
ILW+GYPG+AG A+A++IFG NPGGRLPMTWYP+++ KVPM +M+MR + + YPGR
Sbjct: 62 ILWVGYPGQAGGTAIADVIFGTTNPGGRLPMTWYPQNYLAKVPMTNMDMRPNPATGYPGR 121
Query: 644 SYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTIS-ASLKAGSDKNILQQTGSRLDYVH 702
+YRFY G V+ FGHGLSYT +++ AP ++++ A+L+A ++ + R+ +
Sbjct: 122 TYRFYKGPVVFPFGHGLSYTRFTHSLAIAPKQVSVPFATLQAFTNSTVSTSKAVRVSH-- 179
Query: 703 IDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKG 762
+C ++ + V N G +DG++ +++F++ P + + KQL+ F + + A
Sbjct: 180 ----ANCDAMEVGFHVDVKNEGSMDGTNTLLVFSK-PPPGKWSATKQLVSFHKTYVPAGS 234
Query: 763 SKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIET 807
+ + GV C+ LS+ ++ G R +P+G H L +G+L+HS++++T
Sbjct: 235 KQRVKVGVHVCKHLSVVDEFGIRRIPMGEHELQIGDLKHSISVQT 279
>gi|255545654|ref|XP_002513887.1| hypothetical protein RCOM_1034150 [Ricinus communis]
gi|223546973|gb|EEF48470.1| hypothetical protein RCOM_1034150 [Ricinus communis]
Length = 246
Score = 264 bits (675), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 139/270 (51%), Positives = 173/270 (64%), Gaps = 33/270 (12%)
Query: 8 IQLRERILFRVLQLIVVVNVIAFSNSKPVLNKPDFPC--KPPHFDSYPFCNTSLSISTRA 65
++LR+ L + L + + F++S P +P F C P SY FC TSL IS RA
Sbjct: 1 MKLRKISLLTLFHLTSFL-IFIFADSAP---QPPFSCDYSNPSTKSYLFCQTSLPISQRA 56
Query: 66 KSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIA-----SNGPGVNFNGTVS 120
K L+S LTL EKI QL D A +IPRLG+PAY WWSE+LHG+A S G++FNGT+
Sbjct: 57 KDLVSRLTLDEKISQLVDTAPSIPRLGVPAYRWWSEALHGVAFLPDVSERQGIHFNGTIR 116
Query: 121 SVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA-GLTFWAPNINIFRDPRWGR 179
S TSFPQV+++AASF+ LW IG EARA+YN GQA G+TFWAPNINIFRDPRWGR
Sbjct: 117 SATSFPQVIITAASFDTKLWYRIGQVTGKEARAIYNAGQAIGMTFWAPNINIFRDPRWGR 176
Query: 180 GQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELML 239
GQETPGEDPMV YAV FV+ QG+ S +GG+ G++L
Sbjct: 177 GQETPGEDPMVTGKYAVSFVRGVQGD---SFEGGV------------------LGEKLQA 215
Query: 240 SACCKHLIAYDLEKWGNFSRYSFNAMITEQ 269
SACCKH AYDLEKW +R+ FNA ++ +
Sbjct: 216 SACCKHFTAYDLEKWNGTNRFIFNAQVSAR 245
>gi|423215778|ref|ZP_17202304.1| hypothetical protein HMPREF1074_03836 [Bacteroides xylanisolvens
CL03T12C04]
gi|392691421|gb|EIY84666.1| hypothetical protein HMPREF1074_03836 [Bacteroides xylanisolvens
CL03T12C04]
Length = 1049
Score = 264 bits (675), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 215/747 (28%), Positives = 339/747 (45%), Gaps = 116/747 (15%)
Query: 66 KSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVS----- 120
K L+S +T++EKI QLS G P E+ S+SL G +N +G +
Sbjct: 342 KDLLSRMTVEEKIGQLSQYVGRTLLTG-PESEYLSDSLIARGLVGSVLNISGAKTLRDLQ 400
Query: 121 ----------------------SVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG 158
T FP L + S++ + A+E+ A
Sbjct: 401 EKNMRHSRIKIPILFGMDVIHGYKTIFPTPLAESCSWDLAAIERAAKIAAIESSA----- 455
Query: 159 QAGLTF-WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFG 217
AGL + +AP ++I RD RWGR E GED + S A V FQ W+++
Sbjct: 456 -AGLHWTFAPMVDIARDARWGRVVEGAGEDTYLGSEIAKARVNGFQWNLWENNS------ 508
Query: 218 FREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQP 277
+ AC KH +AY L + G R ++E+ DT+ P
Sbjct: 509 ---------------------VLACAKHWVAYGLPQAG---RDYAPVDMSERTLFDTYLP 544
Query: 278 PFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEY 336
PF++CI+ G M ++N +NG+PA L + R +W F G++ SD +AV +
Sbjct: 545 PFKACIDAG-VLTFMSAFNDINGIPASAHPFLLKDLLRGQWNFNGFVVSDWEAVKQLVA- 602
Query: 337 QNYTKTHEDSAAGVLKAGMDINCGTCML-RHTQSAIDKGKVQEKDIDRALLNLFSVQLRL 395
Q + +D+ +G+D++ + ++ + I+ GK+ +D+D ++ + ++ L
Sbjct: 603 QGVAEDDKDATRLAFNSGIDMDMTDGLYNKYMKELIEAGKISMEDVDNSVSRILHIKYAL 662
Query: 396 GLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAII 455
GLF DP K + + E ALD A + VLLKND LPL KN V S+A++
Sbjct: 663 GLFV-DPYKFCNEEYESQTIMKKEFLDAALDMAHKSAVLLKNDNHTLPLAKN-VRSIAVV 720
Query: 456 GPLVNNISQMGGGYT--GIPCSPKSLLRGLEAYV----SKTHYASGCHDVPCNSDAGFHE 509
GPL +N +++ G + G ++L+G++ + +K YA GC D +GF E
Sbjct: 721 GPLADNQTELLGSWRARGEDRHVTTVLQGIKNKIGGNKTKVGYARGC-DFDGEDKSGFKE 779
Query: 510 AVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGG 569
AV++A K+D VI V G E R R L LPG Q L+ + T K PV++VL G
Sbjct: 780 AVKLASKSDMVIAVVGEKALMSGESRSRAQLDLPGVQEELIKELVATGK-PVVVVLMNGR 838
Query: 570 PLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMND 629
PL + + D +S+IL + G + A+A+I+FGD+NP GRL ++ +P +VP+
Sbjct: 839 PLSIEWV--DKNVSAILETWFLGTSAGTAIADILFGDYNPSGRLTIS-FPRVEGQVPVYY 895
Query: 630 MNMRADSSRQYPGRSYRFYT---GTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGS 686
++ P S + +Y FG+GLSYT +SY S P
Sbjct: 896 NYKKSGRPGDMPHSSTTRHIDVPNAPLYPFGYGLSYTTFSY---SVPQ------------ 940
Query: 687 DKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTP 746
++ +Y + ++ V ++VTN GD DG V L+ S P
Sbjct: 941 --------STQKEYTRQETIS--------VSVTVTNTGDRDGEETVQLYVNDKVASVVRP 984
Query: 747 EKQLIGFDRVHTVAKGSKEISFGVDPC 773
K+L F ++ A SK + F + P
Sbjct: 985 VKELKAFKKIFLKAGESKTVQFDISPL 1011
>gi|423229063|ref|ZP_17215468.1| hypothetical protein HMPREF1063_01288 [Bacteroides dorei
CL02T00C15]
gi|423244903|ref|ZP_17225977.1| hypothetical protein HMPREF1064_02183 [Bacteroides dorei
CL02T12C06]
gi|392634816|gb|EIY28728.1| hypothetical protein HMPREF1063_01288 [Bacteroides dorei
CL02T00C15]
gi|392640944|gb|EIY34735.1| hypothetical protein HMPREF1064_02183 [Bacteroides dorei
CL02T12C06]
Length = 788
Score = 264 bits (675), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 215/736 (29%), Positives = 338/736 (45%), Gaps = 129/736 (17%)
Query: 90 RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
RLGIP ++ +E + G+ + T FP A++N+ L + IG A
Sbjct: 138 RLGIPV-DFTNEGIRGLCHD-----------RATYFPAQCGQGATWNKELIARIGEVEAK 185
Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
EA A+ ++P ++I +DPRWGR ET GEDP +V + + S Q N
Sbjct: 186 EAVAL-----EYTNIYSPILDIAQDPRWGRCVETYGEDPYLVGELGKQMITSLQKHN--- 237
Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQ 269
L A KH Y + G + + + +
Sbjct: 238 -----------------------------LVATPKHFAVYSIPVGGRDGKTRTDPHVAPR 268
Query: 270 DTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGD---LFQKARNEWGFKGYITSD 326
+ + PFR ++ A +M SYN +G P + G L + R EWGFKGY+ SD
Sbjct: 269 EMRTLYIEPFRMAFQEAGALGVMSSYNDYDGEP--ITGSYHFLTEILRQEWGFKGYVVSD 326
Query: 327 CDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTC----MLRHTQSAIDKGKVQEKDID 382
+AV I T+ED A + AG++I + + A+ GK+ ++ +D
Sbjct: 327 SEAVEFISSKHKVANTYEDGIAQAVNAGLNIRTHFTPPADFILPLRKAVADGKISQETLD 386
Query: 383 RALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFL 442
+ + + V+ LGLF+ +P +G GK V + EH+ ++L+AARQ +VLLKN+ L
Sbjct: 387 KRVAEILRVKFWLGLFD-NPYRGN-GKQAEQIVHSKEHQAVSLEAARQSLVLLKNEMNLL 444
Query: 443 PLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTH--YASGCH--- 497
PL+K ++ S+A+IGP + +Q+ Y K++ +G++ + T Y GC
Sbjct: 445 PLSK-SLRSIAVIGPNADERTQLICRYGPANAPIKTVYQGIKERLPHTEVIYRKGCDIID 503
Query: 498 ---------DVPCNSDAG--FHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQ 546
D P ++ EA+ AK+A+ V++V G + EDR R SL LPG+Q
Sbjct: 504 PHFPESEVLDFPKTTEEARLMEEAIHAAKQAEVVVMVLGGNELTVREDRSRTSLNLPGRQ 563
Query: 547 MSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGD 606
L+ +V T K PV+LVL G +++A A + +IL +PGE +A+AE +FGD
Sbjct: 564 EELLKAVCATGK-PVVLVLLDGRASSINYAAA--HVPAILHAWFPGEFCGQAVAEALFGD 620
Query: 607 FNPGGRLPMTWYPESFTKVPMN-DMNMRADSSRQYPGRSYRFYTGTQVYG----FGHGLS 661
+NPGGRL +T +P+S ++P +D S + T VYG FGHGLS
Sbjct: 621 YNPGGRLAVT-FPKSVGQIPFAFPFKPGSDES-----------SSTSVYGVLYPFGHGLS 668
Query: 662 YTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVT 721
YT +SY +L IS L+ G +I ++ +
Sbjct: 669 YTTFSY------GDLKIS-PLRQGVQGDI------------------------NISCKIK 697
Query: 722 NAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANK 781
N G + G VV L+ R S T K L GF+R+ A + + F + P + L + +K
Sbjct: 698 NTGKIKGDEVVQLYLRDEVSSVTTYTKVLRGFERISLEAGEEQMVHFRLRP-QDLGLWDK 756
Query: 782 HGRRILPLGNHVLMVG 797
+ + G +M+G
Sbjct: 757 NMNFRVEPGKFKVMIG 772
>gi|423342048|ref|ZP_17319763.1| hypothetical protein HMPREF1077_01193 [Parabacteroides johnsonii
CL02T12C29]
gi|409219455|gb|EKN12417.1| hypothetical protein HMPREF1077_01193 [Parabacteroides johnsonii
CL02T12C29]
Length = 868
Score = 264 bits (674), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 157/473 (33%), Positives = 227/473 (47%), Gaps = 61/473 (12%)
Query: 50 DSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASN 109
+ YPF N L I R L+ LT +EK+ Q+ + AI RLGIP Y+WW+E+LHG+A
Sbjct: 24 EDYPFRNPDLPIDERIDDLLKRLTAEEKVGQMMNTTPAIERLGIPQYDWWNEALHGVARA 83
Query: 110 GPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------G 161
G T FPQ + AA+F+ + V+ EARA Y+ Q G
Sbjct: 84 G----------KATVFPQAIAMAATFDDDALYETFTMVSDEARAKYHQYQKDKEYDRYKG 133
Query: 162 LTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREK 221
LTFW PNINIFRDPRWGRG ET GEDP + V VK QG++ K
Sbjct: 134 LTFWTPNINIFRDPRWGRGMETYGEDPYLTERMGVAVVKGLQGDDPKY------------ 181
Query: 222 RVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRS 281
AC KH + +W +R+ F+ +T +D T+ P F +
Sbjct: 182 ---------------FKTHACAKHYAVHSGPEW---NRHEFDVTVTPRDLWQTYLPAFEA 223
Query: 282 CIEQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIFEYQNYT 340
+++G +MC+YN+ G P C L RN WG++ I SDC A+ +E T
Sbjct: 224 LVKEGNVQEVMCAYNRYQGKPCCSSDKLLIDILRNSWGYENIILSDCGAINDFWERDERT 283
Query: 341 KTH------EDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLR 394
H E ++A + G D+ CG R A+ GK+ E D+D +L L +
Sbjct: 284 PRHETHPDAESASADAVLNGTDLECGNS-YRALVKALKDGKISENDLDVSLRRLLKGRFE 342
Query: 395 LGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAI 454
LG+F+ D R Y ++ + V + EH AL+ A + +VLLKN LPL+K + +A+
Sbjct: 343 LGMFDPDERV-PYAQIPYNVVESPEHVAQALEMAHKSMVLLKNKNNTLPLSK-TIRKIAV 400
Query: 455 IGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGF 507
+GP + + + Y G P ++L G+ V T +++ CN A F
Sbjct: 401 VGPNAADSTMLWANYNGFPTHTVTILEGIRNKVPDTEV---IYELGCNHAADF 450
Score = 117 bits (294), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 84/269 (31%), Positives = 134/269 (49%), Gaps = 55/269 (20%)
Query: 515 KKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVARTSKRPVILV 564
K AD ++ V G+ E E+ DR ++ LP Q +V ++ T K PV+ V
Sbjct: 603 KDADVIVYVGGISPRLEGEEMPVNVEGFKKGDRTNIELPKVQQEMVKALKATGK-PVVYV 661
Query: 565 LTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTK 624
L G L +++ EA+ I +IL Y G+ A+A+I+FGD+NP GRLP+T+Y +S +
Sbjct: 662 LCTGSALALNWEEAN--IDAILNAWYGGQEAGTAVADILFGDYNPSGRLPVTFY-KSIDQ 718
Query: 625 VP-MNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLK 683
+P D +M+ GR+YR+ T T +Y FG+GLSYTN++Y+ +A L
Sbjct: 719 LPDFEDYSMK--------GRTYRYMTETPLYPFGYGLSYTNFAYR----------NAKLS 760
Query: 684 AGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQ 743
+G + D+ + T + N G +DG + ++ + P +
Sbjct: 761 SGK--------------IAKDQSVTLT-------FDIANTGKMDGDEIAQIYIKNPNDPE 799
Query: 744 GTPEKQLIGFDRVHTVAKGSKEISFGVDP 772
G P K L F RVH A S+E++ + P
Sbjct: 800 G-PIKALKAFLRVHVKAGDSQEVNIELAP 827
>gi|383114360|ref|ZP_09935124.1| hypothetical protein BSGG_1469 [Bacteroides sp. D2]
gi|313693934|gb|EFS30769.1| hypothetical protein BSGG_1469 [Bacteroides sp. D2]
Length = 863
Score = 264 bits (674), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 177/551 (32%), Positives = 264/551 (47%), Gaps = 79/551 (14%)
Query: 52 YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGP 111
YP+ +T L++ RA L+ LTL+EK+ + +N+ AIPRLGI YEWW+E+LHG+A G
Sbjct: 26 YPYQDTKLTVEQRADDLLQRLTLEEKVALMQNNSPAIPRLGIKPYEWWNEALHGVARAG- 84
Query: 112 GVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARA----MYNLGQ----AGLT 163
T FPQ + AASFN L + AV+ EARA GQ GLT
Sbjct: 85 ---------LATVFPQAIGMAASFNDELLYEVFDAVSDEARAKNRQFNEKGQYKRYQGLT 135
Query: 164 FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRV 223
W PN+NIFRDPRWGRGQET GEDP + + V+ QG +D
Sbjct: 136 MWTPNVNIFRDPRWGRGQETYGEDPYLSGRMGMAAVRGLQG----PEDA----------- 180
Query: 224 LKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPFRSC 282
E D+ L AC KH + +W +R+SFNA I +D +T+ P F+
Sbjct: 181 ------EYDK-----LHACAKHFAVHSGPEW---NRHSFNAENIAPRDLWETYLPAFKEL 226
Query: 283 IEQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIFEYQNYTK 341
+++ +MC+YN+ G P C L Q RN+WGFKG + +DC A+ F+ + + +
Sbjct: 227 VQKAGVKEVMCAYNRFEGDPCCGSNRLLTQILRNDWGFKGIVVTDCGAIGDFFQRKKH-E 285
Query: 342 THEDSA---AGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLF 398
TH D+A A + +G D+ CG + A+ KG + E+ I+ ++ L + LG
Sbjct: 286 THPDAAHASADAVLSGTDLECGG-NFKSITDAVKKGLISEEKINTSVKRLLKARFELGEM 344
Query: 399 NGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPL 458
N + + + +HK+LAL A + +VLL+N+ LPLN+ +A+IGP
Sbjct: 345 NS---THPWSNIPFSVIDCPKHKELALKMAHESLVLLQNNNNILPLNRQM--KVAVIGPN 399
Query: 459 VNNISQMGGGYTGIPCSPKSLLRGLEA--------------YVSKTHYASGCHDVPCNSD 504
N+ G Y G P +LL G+ A Y + T S + + +
Sbjct: 400 ANDSVMQWGNYNGFPSHTVTLLEGIRAKLPDAQIIYEPVCGYTNDTTLHSLFNQCSIDGE 459
Query: 505 AGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRP---- 560
AGF+ ++ I T + ++ PG + T++ R++ RP
Sbjct: 460 AGFNATYWNNREYKGKIAATDRLTTPFHFSAEGSTVFAPGVGLKNFTAIYRSTFRPTDSG 519
Query: 561 --VILVLTGGG 569
V+T GG
Sbjct: 520 AATFRVMTNGG 530
Score = 110 bits (274), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 92/315 (29%), Positives = 141/315 (44%), Gaps = 63/315 (20%)
Query: 508 HEAVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVARTS 557
E + + AD VI G+ E E DR + LP Q ++ + +
Sbjct: 591 REILNKLQSADVVIFAGGISPLLEGESMRVSDPGFKGGDRTEIELPAIQREVLALLKKNG 650
Query: 558 KRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTW 617
K+ V + +G ++ +IL YPG+AG A+A+++FGD+NP GRLP+T+
Sbjct: 651 KKTVFVNFSGSA---MAIVPETQNCDAILQAWYPGQAGGTAVADVLFGDYNPAGRLPITF 707
Query: 618 YPESFTKVP-MNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSEL 676
Y +S ++P D +M+ GR+YRF T T +Y FG+GLSYT +SY +
Sbjct: 708 Y-KSMQQLPDYEDYSMK--------GRTYRFMTETPLYPFGYGLSYTRFSY------GKA 752
Query: 677 TISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFA 736
T++ S +K IL I V+N G DG VV ++
Sbjct: 753 TLNQSKLTKGEKAIL-------------------------TIPVSNVGQRDGEEVVQVYI 787
Query: 737 RVPKVSQGTPEKQLIGFDRVHTVAKGSK---EISFGVDPCEQLSIANKHGRRILPL-GNH 792
P +G P+K L GF RV ++AKG +I D E A I PL G +
Sbjct: 788 CRPDDKEG-PQKTLRGFQRV-SIAKGKTQNVQIELPYDSFEWFDAATN---TIRPLNGTY 842
Query: 793 VLMVGELRHSLTIET 807
++ G + ++T
Sbjct: 843 KILYGNSSNEKDLQT 857
>gi|410097652|ref|ZP_11292633.1| hypothetical protein HMPREF1076_01811 [Parabacteroides goldsteinii
CL02T12C30]
gi|409223742|gb|EKN16677.1| hypothetical protein HMPREF1076_01811 [Parabacteroides goldsteinii
CL02T12C30]
Length = 780
Score = 264 bits (674), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 228/756 (30%), Positives = 363/756 (48%), Gaps = 144/756 (19%)
Query: 90 RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
RLGIP + E HG + G T FP L A+++N L +G A+A+
Sbjct: 125 RLGIPVL-FAEECPHGHMAIG-----------TTVFPTSLSQASTWNAELMHRMGEAIAL 172
Query: 150 EARAM-YNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWK 208
EAR+ N+G + P ++I R+PRW R +ET GEDP++ + V F+K QG++
Sbjct: 173 EARSQGANIG------YGPVLDIAREPRWSRMEETFGEDPVLTTHLGVAFMKGMQGKS-- 224
Query: 209 SDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWG-NFSRYSFNAMIT 267
+ D L + KH AY + + G N +R A +
Sbjct: 225 ------------------------QNDGKHLYSTLKHFAAYGIPEAGHNGAR----ANVG 256
Query: 268 EQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSD 326
+ + PPF+ +E+G A+ IM SYN ++GVP L R++WGFKG++ SD
Sbjct: 257 MRQLFSDYLPPFKKAVEEGVAT-IMTSYNTIDGVPCTSNKYLLTDVLRDQWGFKGFVYSD 315
Query: 327 CDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCG-TCMLRHTQSAIDKGKVQEKDIDRAL 385
++ I + K ++++A LKAG+D++ G ++ Q A+++G + D++RA+
Sbjct: 316 LTSIEGIVGAR-VAKDNKEAAVLALKAGLDMDLGGNAYGKNLQKALEEGAITMDDLNRAV 374
Query: 386 LNLFSVQLRLGLFNG---DPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFL 442
N+ ++ R+GLF P + K V + HK+LA + AR+GIVLLKN+ L
Sbjct: 375 ANVLRLKFRMGLFENPYVSPEQAK------QVVRSKAHKELAREVAREGIVLLKNEG-VL 427
Query: 443 PLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPK--SLLRGLEAYVS---KTHYASGC- 496
PL KN + ++A+IGP + + G YT + ++L G+ VS K +Y GC
Sbjct: 428 PLKKN-IGNIAVIGPNADMMYNQLGDYTAPQEREEIVTVLDGIRKAVSPSTKVNYVKGCA 486
Query: 497 -HDVPCNSDAGFHEAVRI----------AKKADF---VIVVAGLDLTQET---------- 532
D+ ++ EA R + DF I D++ +
Sbjct: 487 IRDITTSNITAAVEAARAADAVVLVVGGSSARDFKTKYIGTGAADVSNDGNQLLSDMDCG 546
Query: 533 EDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPG 592
E DR +L L G Q L+ +VA T K P++++ G L+++ A +Q W YPG
Sbjct: 547 EGYDRSTLRLLGDQEKLLKAVAATGK-PLVVIYIQGRTLNMNLASEKAQALLTAW--YPG 603
Query: 593 EAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQ 652
E G A+A+++FGD+NP GRLP++ P S ++P+ Q R+Y GT
Sbjct: 604 EQGGTAIADVLFGDYNPAGRLPVS-VPRSEGQLPL--------FYSQGKQRAYVEEEGTP 654
Query: 653 VYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSL 712
+Y FG+GLSYT + Y L ++ G+ K++LQ SCT
Sbjct: 655 LYAFGYGLSYTKFDYSQL----------EMQKGNGKDVLQ-------------TVSCT-- 689
Query: 713 RFHVQISVTNAGDVDGSHVVMLFA--RVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGV 770
VTN GD DG VV L+ +V VSQ +P L F+R+ SK+++F +
Sbjct: 690 -------VTNTGDCDGEEVVQLYICDKVASVSQ-SP-ILLKAFERISLKKGESKKVTFTL 740
Query: 771 DPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
E+LS+ N ++++ G+ +MVG + +E
Sbjct: 741 GE-EELSLYNMEMKQVVEPGDFKVMVGAASDDIRLE 775
>gi|167521708|ref|XP_001745192.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163776150|gb|EDQ89770.1| predicted protein [Monosiga brevicollis MX1]
Length = 614
Score = 264 bits (674), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 184/610 (30%), Positives = 281/610 (46%), Gaps = 76/610 (12%)
Query: 71 LLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLV 130
LLT +E + R+G+P Y+W ++HG+ S+ + +GTV TSFP +
Sbjct: 55 LLTAREGGGGSPGPPGNVSRIGLPEYDWGMNAIHGVQSSCIKDD-DGTVYCPTSFPNPVN 113
Query: 131 SAASFNRSLWSNIGSAVAVEARAMYNLG-----------QAGLTFWAPNINIFRDPRWGR 179
++N S + +G + VE RA++ G GL W+PNINI R P WGR
Sbjct: 114 YGFTWNYSAYLELGRIIGVETRALWLAGAVEASTWSGRPHIGLDTWSPNINIARSPLWGR 173
Query: 180 GQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELML 239
QE PGEDP + + + QG+ D+ L
Sbjct: 174 NQEVPGEDPFMNGQFGKAYTLGLQGD-----------------------------DDTYL 204
Query: 240 SACC--KHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQ 297
A KH AY LE +R++FNA+++ DT+ P FR + +GKA +MCSYN
Sbjct: 205 QAIVTLKHWDAYSLEDSDGATRHNFNAIVSNFSLMDTYWPAFRVAVTEGKAKGVMCSYNA 264
Query: 298 VNGVPACLRGDLFQKARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGM-D 356
VNG+P C L R+ W F GY++SD AV I + YT + +A ++ G D
Sbjct: 265 VNGIPTCAHPLLRTVLRDLWKFDGYVSSDTGAVEDISDNHKYTPSWATAACAAIRDGQTD 324
Query: 357 INCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGK-YGKLGPDDV 415
I+ G ++ + +G + +D+D AL N ++ LGLF DP + + Y + V
Sbjct: 325 IDSGAVYMKSLLQGVSEGHCRMEDVDNALRNTLRLRFELGLF--DPVENQSYWHVPLAAV 382
Query: 416 CTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPC- 474
T+ + + + +VLL+N LPL N + +A+IGP M G Y G C
Sbjct: 383 NTNASRATNMLHTLESMVLLQNKNNVLPLASN--TKVALIGPHAKAQEDMVGNYLGQLCP 440
Query: 475 --------SPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGL 526
SP L + + T YA G + C S + EAV +A AD +++ G+
Sbjct: 441 DNNFDCVVSPHDALVSILGTDAVT-YAPGTNVTTC-SQSHIDEAVSVATAADVAVLMLGI 498
Query: 527 DLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSIL 586
D + E E DR S+ LP Q L +++ K VI++L GG ++ Q +I+
Sbjct: 499 DESIEAESNDRKSIDLPECQHQLASAIFAVGKPTVIVLLNGGM---LAIENEKQQADAII 555
Query: 587 WIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYR 646
GYPG G A+A+ + G G + + M+DM M + PGR+YR
Sbjct: 556 EAGYPGFYGGTAIAQTLTGQNEHLG--------DYINWINMSDMEMTSG-----PGRTYR 602
Query: 647 FYTGTQVYGF 656
+Y ++ F
Sbjct: 603 YYKNETLWAF 612
>gi|293371439|ref|ZP_06617870.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
ovatus SD CMC 3f]
gi|292633636|gb|EFF52194.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
ovatus SD CMC 3f]
Length = 1049
Score = 264 bits (674), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 215/747 (28%), Positives = 339/747 (45%), Gaps = 116/747 (15%)
Query: 66 KSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVS----- 120
K L+S +T++EKI QLS G P E+ S+SL G +N +G +
Sbjct: 342 KDLLSRMTVEEKIGQLSQYVGRTLLTG-PESEYLSDSLIARGLVGSVLNISGAKTLRDLQ 400
Query: 121 ----------------------SVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG 158
T FP L + S++ + A+E+ A
Sbjct: 401 EKNMRYSRIKIPILFGMDVIHGYKTIFPTPLAESCSWDLAAIERAAKIAAIESSA----- 455
Query: 159 QAGLTF-WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFG 217
AGL + +AP ++I RD RWGR E GED + S A V FQ W+++
Sbjct: 456 -AGLHWTFAPMVDIARDARWGRVVEGAGEDTYLGSEIAKARVNGFQWNLWENNS------ 508
Query: 218 FREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQP 277
+ AC KH +AY L + G R ++E+ DT+ P
Sbjct: 509 ---------------------VLACAKHWVAYGLPQAG---RDYAPVDMSERTLFDTYLP 544
Query: 278 PFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEY 336
PF++CI+ G M ++N +NG+PA L + R +W F G++ SD +AV +
Sbjct: 545 PFKACIDAG-VLTFMSAFNDINGIPASAHPFLLKDLLRGQWNFNGFVVSDWEAVKQLVA- 602
Query: 337 QNYTKTHEDSAAGVLKAGMDINCGTCML-RHTQSAIDKGKVQEKDIDRALLNLFSVQLRL 395
Q + +D+ +G+D++ + ++ + I+ GK+ +D+D ++ + ++ L
Sbjct: 603 QGVAEDDKDATRLAFNSGIDMDMTDGLYNKYMKELIEAGKISMEDVDNSVSRILHIKYAL 662
Query: 396 GLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAII 455
GLF DP K + + E ALD A + VLLKND LPL KN V S+A++
Sbjct: 663 GLFV-DPYKFCNEEYESQTIMKKEFLDAALDMAHKSAVLLKNDNHTLPLAKN-VRSIAVV 720
Query: 456 GPLVNNISQMGGGYT--GIPCSPKSLLRGLEAYV----SKTHYASGCHDVPCNSDAGFHE 509
GPL +N +++ G + G ++L+G++ + +K YA GC D +GF E
Sbjct: 721 GPLADNQTELLGSWRARGEDRHVTTVLQGIKNKIGGNKTKVGYARGC-DFDGEDKSGFKE 779
Query: 510 AVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGG 569
AV++A K+D VI V G E R R L LPG Q L+ + T K PV++VL G
Sbjct: 780 AVKLASKSDMVIAVVGEKALMSGESRSRAQLDLPGVQEELIKELVATGK-PVVVVLMNGR 838
Query: 570 PLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMND 629
PL + + D +S+IL + G + A+A+I+FGD+NP GRL ++ +P +VP+
Sbjct: 839 PLSIEWV--DKNVSAILETWFLGTSAGTAIADILFGDYNPSGRLTIS-FPRVEGQVPVYY 895
Query: 630 MNMRADSSRQYPGRSYRFYT---GTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGS 686
++ P S + +Y FG+GLSYT +SY S P
Sbjct: 896 NYKKSGRPGDMPHSSTTRHIDVPNAPLYPFGYGLSYTTFSY---SVPQ------------ 940
Query: 687 DKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTP 746
++ +Y + ++ V ++VTN GD DG V L+ S P
Sbjct: 941 --------STQKEYTRQETIS--------VSVTVTNTGDRDGEETVQLYVNDKVASVVRP 984
Query: 747 EKQLIGFDRVHTVAKGSKEISFGVDPC 773
K+L F ++ A SK + F + P
Sbjct: 985 VKELKAFKKIFLKAGESKTVQFDISPL 1011
>gi|423302093|ref|ZP_17280116.1| hypothetical protein HMPREF1057_03257 [Bacteroides finegoldii
CL09T03C10]
gi|408471184|gb|EKJ89716.1| hypothetical protein HMPREF1057_03257 [Bacteroides finegoldii
CL09T03C10]
Length = 1039
Score = 263 bits (673), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 216/762 (28%), Positives = 340/762 (44%), Gaps = 141/762 (18%)
Query: 90 RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
RLGIP ++ +E + G+ S T+FP L ++NR L IG
Sbjct: 245 RLGIPT-DFTNEGIRGVES-----------YKATNFPTQLGLGHTWNRQLLRQIGLITGR 292
Query: 150 EARAMYNLGQAGLT-FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWK 208
EAR + G T +AP +++ RD RWGR +E GE P +V+ +E VK Q +
Sbjct: 293 EARML------GYTNVYAPILDVGRDQRWGRYEEVYGESPYLVAELGIEMVKGMQHNH-- 344
Query: 209 SDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITE 268
++A KH IAY K + ++
Sbjct: 345 -----------------------------QVAATGKHFIAYSNNKGAREGMARVDPQMSP 375
Query: 269 QDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVP---------ACLRGDLFQKARNEWGF 319
++ E PF+ I + +M SYN +G P LRGD+ GF
Sbjct: 376 REVEMIHVYPFKRVIREAGLLGVMSSYNDYDGFPIQSSYYWLTTRLRGDM--------GF 427
Query: 320 KGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCG----TCMLRHTQSAIDKGK 375
+GY+ SD DAV ++ K +++ ++AG++I C + + + +G+
Sbjct: 428 RGYVVSDSDAVEYLYTKHGTAKDMKEAVRQSVEAGLNIRCTFRSPDSYVLPLRELVKEGE 487
Query: 376 VQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLL 435
+ E+ I+ + ++ V+ +GLF+ + G ++V + ++++AL A+R+ IVLL
Sbjct: 488 LSEEIINDRVRDILRVKFLVGLFDHPYQTDLKG--ADEEVEKASNEEIALQASRESIVLL 545
Query: 436 KNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV---SKTHY 492
KNDK LPLN + + +A+ GP + S Y + S+L+G++ + ++ Y
Sbjct: 546 KNDKNVLPLNASTIKKIAVCGPNADEHSYALTHYGPLAVEVTSVLKGIQEKLGGKAEVLY 605
Query: 493 ASGCHDVPC--------------NSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRV 538
GC V N +AV K+AD +VV G E++ R
Sbjct: 606 TKGCELVDANWPESELMEYPLSENEQEEIEKAVSQTKQADVAVVVLGGGQRTCGENKSRS 665
Query: 539 SLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKA 598
SL LPG+Q+ L+ +V T K PV+LVL G PL +++ AD + +IL YPG G KA
Sbjct: 666 SLALPGRQLDLLKAVVATGK-PVVLVLINGRPLSINW--ADKFVPAILEAWYPGSKGGKA 722
Query: 599 LAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQ------ 652
+A+++FGD+NPGG+L +T +P++ ++P N S Q G G
Sbjct: 723 VADVLFGDYNPGGKLTVT-FPKTVGQIP---FNFPCKPSSQIDGGKNPGLNGNMSRVNGA 778
Query: 653 VYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSL 712
+Y FG GLSYT + Y S+L IS ++ + K
Sbjct: 779 LYPFGFGLSYTTFEY------SDLKISPAIITPNQKT----------------------- 809
Query: 713 RFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDP 772
+V VTN G G VV L+ R S T EK L GF+RVH +KEI+F +D
Sbjct: 810 --YVTCKVTNTGKRAGDEVVQLYVRDVLSSVTTYEKNLAGFERVHLKPGETKEITFPID- 866
Query: 773 CEQLSIANKHGRRILPLGNHVLMVG------ELRHSLTIETY 808
+ L + N ++ G LM+G L +LT+ Y
Sbjct: 867 RKALELLNADMHWVVEPGEFTLMIGASSTDIRLNGTLTVSGY 908
>gi|160882671|ref|ZP_02063674.1| hypothetical protein BACOVA_00625 [Bacteroides ovatus ATCC 8483]
gi|423289150|ref|ZP_17268000.1| hypothetical protein HMPREF1069_03043 [Bacteroides ovatus CL02T12C04]
gi|423298450|ref|ZP_17276507.1| hypothetical protein HMPREF1070_05172 [Bacteroides ovatus CL03T12C18]
gi|156111986|gb|EDO13731.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
ovatus ATCC 8483]
gi|392662991|gb|EIY56545.1| hypothetical protein HMPREF1070_05172 [Bacteroides ovatus CL03T12C18]
gi|392667846|gb|EIY61351.1| hypothetical protein HMPREF1069_03043 [Bacteroides ovatus CL02T12C04]
Length = 1049
Score = 263 bits (673), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 220/751 (29%), Positives = 342/751 (45%), Gaps = 124/751 (16%)
Query: 66 KSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVS----- 120
K L+S +T++EKI QLS G P E+ S+SL G +N +G +
Sbjct: 342 KDLLSRMTVEEKIGQLSQYVGRTLLTG-PESEYLSDSLIARGLVGSVLNISGAKTLRDLQ 400
Query: 121 ----------------------SVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG 158
T FP L + S++ + A+E+ A
Sbjct: 401 EKNMRHSRIKIPILFGMDVIHGYKTIFPTPLAESCSWDLAAIERAAKIAAIESSA----- 455
Query: 159 QAGLTF-WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFG 217
AGL + +AP ++I RD RWGR E GED + S A V FQ W+++
Sbjct: 456 -AGLHWTFAPMVDIARDARWGRVVEGAGEDTYLGSEIAKARVNGFQWNLWENNS------ 508
Query: 218 FREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQP 277
+ AC KH +AY L + G R ++E+ DT+ P
Sbjct: 509 ---------------------VLACAKHWVAYGLPQAG---RDYAPVDMSERTLFDTYLP 544
Query: 278 PFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEY 336
PF++CI+ G M ++N +NG+PA L + R +W F G++ SD +AV +
Sbjct: 545 PFKACIDAG-VLTFMSAFNDINGIPASAHPFLLKDLLRGQWNFNGFVVSDWEAVKQLVA- 602
Query: 337 QNYTKTHEDSAAGVLKAGMDINCGTCML-RHTQSAIDKGKVQEKDIDRALLNLFSVQLRL 395
Q + +D+ +G+D++ + ++ + I+ GK+ +D+D ++ + ++ L
Sbjct: 603 QGVAEDDKDATRLAFNSGIDMDMTDGLYNKYMKELIEAGKISMEDVDNSVSRILHIKYAL 662
Query: 396 GLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAII 455
GLF DP K + + E ALD A + VLLKND LPL KN V S+A++
Sbjct: 663 GLFV-DPYKFCNEEYESQTIMKKEFLDAALDMAHKSAVLLKNDNHTLPLAKN-VRSIAVV 720
Query: 456 GPLVNNISQMGGGYT--GIPCSPKSLLRGLEAYV----SKTHYASGCHDVPCNSDAGFHE 509
GPL +N +++ G + G ++L+G++ + +K YA GC D +GF E
Sbjct: 721 GPLADNQTELLGSWRARGEDRHVTTVLQGIKNKIGGNKTKVGYARGC-DFDGEDKSGFKE 779
Query: 510 AVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGG 569
AV++A K+D VI V G E R R L LPG Q L+ + T K PV++VL G
Sbjct: 780 AVKLASKSDMVIAVVGEKALMSGESRSRAQLDLPGVQEELIKELVATGK-PVVVVLMNGR 838
Query: 570 PLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMND 629
PL + + D +S+IL + G + A+A+I+FGD+NP GRL ++ +P +VP+
Sbjct: 839 PLSIEW--VDKNVSAILETWFLGTSAGTAIADILFGDYNPSGRLTIS-FPRVEGQVPIY- 894
Query: 630 MNMRADSSRQYPGRSYRFYTGTQV-------YGFGHGLSYTNYSYKFLSAPSELTISASL 682
N + S R PG T + Y FG+GLSYT +SY SAP
Sbjct: 895 YNYKK-SGR--PGDMLHSSTTRHIDVPNAPLYPFGYGLSYTTFSY---SAPQ-------- 940
Query: 683 KAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVS 742
++ +Y + ++ V ++VTN GD DG V L+ S
Sbjct: 941 ------------STQKEYTRQETIS--------VSVTVTNTGDRDGEETVQLYVNDKVAS 980
Query: 743 QGTPEKQLIGFDRVHTVAKGSKEISFGVDPC 773
P K+L F ++ A SK + F + P
Sbjct: 981 VVRPVKELKAFKKIFLKAGESKTVQFDISPL 1011
>gi|255690204|ref|ZP_05413879.1| xylosidase/arabinosidase [Bacteroides finegoldii DSM 17565]
gi|260624223|gb|EEX47094.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
finegoldii DSM 17565]
Length = 954
Score = 263 bits (673), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 222/781 (28%), Positives = 356/781 (45%), Gaps = 130/781 (16%)
Query: 53 PFCNTSLSISTRAKSLISLLTLQEKIQQLSDN--ASAIPRLGIPAYEWWSESLHGIASNG 110
P+ + SL + R +SL++ +T +K++ + + IP L +P E++HG +
Sbjct: 169 PYMDASLPVDERVESLLAAMTPADKMELIREGWGIPGIPHLYVPPITK-VEAVHGFSYG- 226
Query: 111 PGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNIN 170
S T FPQ L A++NR L + A+ E + N QA W+P ++
Sbjct: 227 ---------SGATIFPQALAMGATWNRQLTEEVAMAIGDET-VIANTKQA----WSPVLD 272
Query: 171 IFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEE 230
+ +D RWGR +ET GEDP++VS ++K +Q +
Sbjct: 273 VAQDARWGRCEETFGEDPVLVSQMGGAWIKGYQSKG------------------------ 308
Query: 231 SDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASC 290
L KH + G R S + ++E++ + PFR I
Sbjct: 309 --------LFTTPKHFGGHGAPLGG---RDSHDIGLSEREMREVHLVPFRHVIRNYDCQS 357
Query: 291 IMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYT-KTHEDSAA 348
+M +Y+ G+P +L Q+ R EWGF G+I SDC A+ + ++YT K ++A
Sbjct: 358 LMMAYSDYMGIPIAKSTELLQRILRQEWGFNGFIVSDCGAIGNLTARKHYTAKDKIEAAN 417
Query: 349 GVLKAGMDINCG-TCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKG-K 406
L AG+ NCG T + A G++ +++D + + R LF +P K
Sbjct: 418 QALAAGIATNCGDTYNNKEVIQAAKDGRINMENLDNVCRTMLATMFRNELFEKNPCKPLD 477
Query: 407 YGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMG 466
+ K+ P + HK +A AA + IV+L+N LPL+K + ++A++GP +++
Sbjct: 478 WNKIYPG-WNSDSHKAMAHRAACESIVMLENKDNLLPLSKE-LRTIAVLGPGADDLQP-- 533
Query: 467 GGYTGI--PCSPKSLLRGLEAYVSKTH---YASGCHDVPCNSDAGFHEAVRIAKKADFVI 521
G YT P KS+L G++A VSK Y GC D +AV+ A +AD V+
Sbjct: 534 GDYTPKLQPGQLKSVLTGIKAAVSKQTKVLYEKGC-DFTETGMTDIPKAVKTASQADVVV 592
Query: 522 VVAG-LDLTQETED--------RDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLD 572
+V G +++ T+D D +L+LPG+Q L+ +V T K PVIL+L G P D
Sbjct: 593 MVLGDCSISEATKDVRKTCGENNDLATLVLPGKQQELLEAVCATGK-PVILILQAGRPYD 651
Query: 573 VSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNM 632
+ A + + W+ PG+ G A A+++FGD+NPGGRLPMT +P ++P+ N
Sbjct: 652 LLKASEMCKAILVNWL--PGQEGGPATADVLFGDYNPGGRLPMT-FPRHVGQLPLY-YNF 707
Query: 633 RADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQ 692
+ S R+Y +Y +Y FG+GLSYT++ Y L
Sbjct: 708 KT-SGRRYEYVDMEYYP---LYRFGYGLSYTSFEYSGL---------------------- 741
Query: 693 QTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIG 752
+V + V+ +V N G G V L+ S T +L
Sbjct: 742 ------------KVQEKPNGNVTVEATVKNVGGRAGDEVAQLYVTDMYASVKTRVMELKD 789
Query: 753 FDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG----------ELRHS 802
F R+H SK +SF + P + LS+ N H R++ G + VG E++HS
Sbjct: 790 FARIHLNPGESKTVSFELTPYD-LSLLNDHMDRVVEKGEFKICVGGMSPDYKANNEIKHS 848
Query: 803 L 803
+
Sbjct: 849 V 849
>gi|189463167|ref|ZP_03011952.1| hypothetical protein BACCOP_03878 [Bacteroides coprocola DSM 17136]
gi|189430146|gb|EDU99130.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
coprocola DSM 17136]
Length = 865
Score = 263 bits (673), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 165/461 (35%), Positives = 237/461 (51%), Gaps = 63/461 (13%)
Query: 52 YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGP 111
+P+ NTSL+ RA L+ LTL+EK+ + + + AIPRLGI AY+WW+E+LHG+
Sbjct: 25 FPYQNTSLTPEQRASDLLERLTLEEKVSLMQNASPAIPRLGIKAYDWWNEALHGV----- 79
Query: 112 GVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMY-------NLGQ-AGLT 163
G T FPQ + AASF+ L + +AV+ EARA Y NL + GLT
Sbjct: 80 -----GRAGIATVFPQTIGMAASFDDELIYKVFTAVSDEARAKYTEFSKSGNLKRYQGLT 134
Query: 164 FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQG-ENWKSDDGGIGFGFREKR 222
FW PNINIFRDPRWGRGQET GEDP + S V V+ QG +N K D
Sbjct: 135 FWTPNINIFRDPRWGRGQETYGEDPYLTSRMGVAVVRGLQGPDNMKYDK----------- 183
Query: 223 VLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPFRS 281
L AC KH + +W +R+SFNA I +D +T+ P F++
Sbjct: 184 ----------------LHACAKHYAVHSGPEW---NRHSFNAENIAPRDLWETYLPAFKA 224
Query: 282 CIEQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIFEYQNYT 340
+++ +MC+YN+ G P C L Q R+EW +KG I SDC A++ + ++
Sbjct: 225 LVQEADVKEVMCAYNRFEGEPCCGSNRLLMQILRDEWKYKGIIVSDCGAISDFWRKGDH- 283
Query: 341 KTHED---SAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGL 397
+TH D ++AG + +G D+ CG + A+ KG + EK ID ++ L + + LG
Sbjct: 284 ETHPDKETASAGAVLSGTDLECGNN-YKSLPEAVQKGLIDEKQIDISVKRLLTARFELGE 342
Query: 398 FNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGP 457
+ + + V + HK LAL+ AR+ IVLL+N LPL ++ +A+IGP
Sbjct: 343 MD---EHVCWDSIPYSVVDSKAHKDLALEIARKSIVLLQNRNNILPLKEDM--KIALIGP 397
Query: 458 LVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTH--YASGC 496
N+ G Y G P +L L+ + Y GC
Sbjct: 398 NANDSVMQWGNYNGFPSHTSTLYEALKERIPANQLIYDFGC 438
Score = 120 bits (301), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 99/325 (30%), Positives = 156/325 (48%), Gaps = 59/325 (18%)
Query: 486 YVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDR---------- 535
+ SK A D+ N+ ++ K AD ++ G+ + E E+
Sbjct: 570 FYSKDRAAQLNFDMGRNAPVNLQASIDKVKAADVIVFAGGISPSLEGEEMPVNAEGFKGG 629
Query: 536 DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQI-SSILWIGYPGEA 594
DR ++ LP Q L++ + + K P+I V G + + E +S+I +IL YPG+A
Sbjct: 630 DRTTIELPAIQRRLISELKKLGK-PIIFVNYSGSAVGL---EPESKICDAILQAWYPGQA 685
Query: 595 GAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVY 654
G A+A+++FGD+NP G+LP+T+Y + D +M+ GR+YR+ T + +Y
Sbjct: 686 GGTAVADVLFGDYNPSGKLPVTFYKHTDQLPDFQDYSMK--------GRTYRYMTESPLY 737
Query: 655 GFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRF 714
FGHGLSYTN++Y P+ L+ QQT S+ EVT
Sbjct: 738 SFGHGLSYTNFTY----GPATLS--------------QQTISQG-----KEVT------- 767
Query: 715 HVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSK-EISFGVD-P 772
+ I V N G+ DG VV ++ +G P L F RVH +AKG + +SF +D
Sbjct: 768 -LTIPVQNTGNYDGEEVVQVYLSCSGDKEG-PSHTLRAFKRVH-IAKGQRANVSFTLDSE 824
Query: 773 CEQLSIANKHGRRILPLGNHVLMVG 797
Q N + R++ GN+ L+ G
Sbjct: 825 TFQWFDTNTNTMRMVE-GNYELLYG 848
>gi|336404627|ref|ZP_08585320.1| hypothetical protein HMPREF0127_02633 [Bacteroides sp. 1_1_30]
gi|335941531|gb|EGN03384.1| hypothetical protein HMPREF0127_02633 [Bacteroides sp. 1_1_30]
Length = 861
Score = 263 bits (673), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 169/519 (32%), Positives = 254/519 (48%), Gaps = 69/519 (13%)
Query: 53 PFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPG 112
P+ +TSL+ RA+ L+ LTL+EK+ + + + AIPRLGI YEWW+E+LHG+
Sbjct: 25 PYQDTSLTAEQRAEDLLPRLTLEEKVSLMQNASPAIPRLGIKEYEWWNEALHGV------ 78
Query: 113 VNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNL-GQAG-------LTF 164
G T FPQ + ASFN SL + +A + EAR + G++G LTF
Sbjct: 79 ----GRAGLATVFPQSIGMGASFNDSLLYEVFNATSDEARVKSRIFGESGVLKRYQGLTF 134
Query: 165 WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVL 224
W PN+NIFRDPRWGRGQET GEDP + + V+ QG +D G
Sbjct: 135 WTPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQG----PEDAGYD--------- 181
Query: 225 KGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPFRSCI 283
L AC KH + +W +R+SF+A I +D +T+ P F+ +
Sbjct: 182 -------------KLHACAKHFAVHSGPEW---NRHSFDAENIAPRDLWETYLPAFKDLV 225
Query: 284 EQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIFEYQNYTKT 342
++ +MC+YN+ G P C L Q R+EWG+KG + SDC A++ + + +T
Sbjct: 226 QKAHVKEVMCAYNRFEGEPCCGSNRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTH-ET 284
Query: 343 HED---SAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFN 399
H D ++A ++ G D+ CG+ A+ G + EK+ID +L L + + LG +
Sbjct: 285 HPDKEHASAAAVRTGTDLECGS-EYASLADAVKAGLIDEKEIDISLKRLLTARFELGEMD 343
Query: 400 GDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLV 459
P + ++ + + EH+ LAL AR+ +VLL+N LPLN N +A++GP
Sbjct: 344 EQP---AWAEIPTSVLNSKEHQALALRMARESLVLLQNKNNILPLNTNL--KIAVMGPNA 398
Query: 460 NNISQMGGGYTGIPCSPKSLLRGLEAYVSKTH--YASGCHDVPCNSDAGFHEAVRIAKKA 517
N+ G Y GIP +LL + A + + Y GC V + + I K
Sbjct: 399 NDSVMQWGNYNGIPAHTVTLLEAVRAKLPEGQIIYEPGCDRVDRKTLQSLFDECSINGKP 458
Query: 518 DFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVART 556
F L + +RDR ++ Q+S A T
Sbjct: 459 GF--------LAEYWNNRDREGEVVATDQISTPFHFATT 489
Score = 92.0 bits (227), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 71/269 (26%), Positives = 119/269 (44%), Gaps = 53/269 (19%)
Query: 510 AVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVARTSKR 559
AV+ AD ++ G+ + E E+ DR + LP Q L+ ++ + K+
Sbjct: 591 AVKKVMDADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQRDLLKALKKAGKK 650
Query: 560 PVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYP 619
V + +G + + +IL YPG+AG A+ + ++G++NPGGRLP+T+Y
Sbjct: 651 VVFINYSGSA---IGLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPGGRLPVTFYK 707
Query: 620 ESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTIS 679
+ D +M+ GR+YR+ ++ FGHGLSYT+++Y E +S
Sbjct: 708 DVNQLPDFEDYSMK--------GRTYRYMQQQPLFPFGHGLSYTDFTY------GEAKLS 753
Query: 680 ASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVP 739
+ A + +L I V+N G DG VV ++ R P
Sbjct: 754 KNTIAKGENVVL-------------------------TIPVSNVGQRDGEEVVQVYLRRP 788
Query: 740 KVSQGTPEKQLIGFDRVHTVAKGSKEISF 768
+G P L F RVH A ++ ++
Sbjct: 789 GDKEG-PRYTLRAFKRVHIPAGKTESVAI 816
>gi|329962030|ref|ZP_08300041.1| glycosyl hydrolase family 3 protein [Bacteroides fluxus YIT 12057]
gi|328530678|gb|EGF57536.1| glycosyl hydrolase family 3 protein [Bacteroides fluxus YIT 12057]
Length = 941
Score = 263 bits (671), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 207/753 (27%), Positives = 343/753 (45%), Gaps = 125/753 (16%)
Query: 90 RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
RLGIP ++ +E + G+ S T+FP L ++NR+L +G
Sbjct: 153 RLGIPV-DFTNEGIRGVES-----------YKATNFPTQLGLGHTWNRALIHKVGLITGR 200
Query: 150 EARAMYNLGQAGLT-FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWK 208
EAR + G T +AP +++ RD RWGR +E GE P +V+ +E V+ Q
Sbjct: 201 EARML------GYTNVYAPILDVGRDQRWGRYEEVYGESPYLVAELGIEMVRGLQQH--- 251
Query: 209 SDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITE 268
++A KH AY K + +
Sbjct: 252 ------------------------------VAATGKHFAAYSNNKGAREGMARVDPQTSP 281
Query: 269 QDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLF---QKARNEWGFKGYITS 325
+ E+ PFR I++ +M SYN +G+P ++G + + R+E GF+GY+ S
Sbjct: 282 HEVENIHIYPFRRVIKEAGLLGVMSSYNDYDGIP--IQGSYYWLTTRLRDEMGFRGYVVS 339
Query: 326 DCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCG----TCMLRHTQSAIDKGKVQEKDI 381
D DAV ++ K +++ ++AG+++ C + + + +G + E+ +
Sbjct: 340 DSDAVEYLYTKHGTAKDMKEAVRQSVEAGLNVRCTFRSPDSFVLPLRELVKEGGLDEETV 399
Query: 382 DRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKF 441
+ + ++ V+ +GLF+ + G +V E++ +AL A+R+ +VLLKN+
Sbjct: 400 NDRVRDILRVKFLIGLFDAPYQTDLAG--ADKEVEKEENEAVALQASRESVVLLKNENST 457
Query: 442 LPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV---SKTHYASGCH- 497
LPLN N V +A+ GP + Y + ++L+G++ V ++ Y GC
Sbjct: 458 LPLNINTVKKIAVCGPNADEDGYALTHYGPLAVEVTTVLKGIQDKVNGKAEVLYTKGCDL 517
Query: 498 -----------DVPCNSD--AGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPG 544
D P D A ++AV A++AD +VV G E++ R SL LPG
Sbjct: 518 VDANWPESEIIDYPLTPDEQAEINKAVENARRADVAVVVLGGGQRTCGENKSRSSLDLPG 577
Query: 545 QQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIF 604
+Q+ L+ +V T K PV+L+L G PL V++A D + +IL YPG G ALA+I+F
Sbjct: 578 RQLQLLQAVQATGK-PVVLILINGRPLSVNWA--DKYVPAILEAWYPGSKGGVALADILF 634
Query: 605 GDFNPGGRLPMTWYPESFTKVPMN---DMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLS 661
GD+NPGG+L +T +P++ ++P N + D + +Y FG+GLS
Sbjct: 635 GDYNPGGKLTVT-FPKTVGQIPFNFPCKPASQIDGGKNAGPDGNMSRINGALYPFGYGLS 693
Query: 662 YTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVT 721
YT + Y S L I+ + ++K V++ VT
Sbjct: 694 YTTFEY------SNLEITPKVITPNEKAT-------------------------VRLKVT 722
Query: 722 NAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANK 781
N G G VV L+ R S T EK L GF+R+H +KE++F +D + L + +
Sbjct: 723 NTGKYAGDEVVQLYTRDVLSSVTTYEKNLAGFERIHLEPGETKEVTFILD-RKHLELLDA 781
Query: 782 HGRRILPLGNHVLMVG------ELRHSLTIETY 808
+R++ G+ +M G L LT+E Y
Sbjct: 782 DMKRVVEPGDFAIMAGASSEDIRLNGLLTVEDY 814
>gi|403744211|ref|ZP_10953568.1| glycoside hydrolase family 3 domain-containing protein
[Alicyclobacillus hesperidum URH17-3-68]
gi|403122228|gb|EJY56463.1| glycoside hydrolase family 3 domain-containing protein
[Alicyclobacillus hesperidum URH17-3-68]
Length = 789
Score = 263 bits (671), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 218/749 (29%), Positives = 340/749 (45%), Gaps = 134/749 (17%)
Query: 90 RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
RL IPA ES G + G T FPQ + A+++++ + IG +
Sbjct: 92 RLAIPAL-VHEESCSGYMAKG-----------ATCFPQSIGIASTWDQDIARKIGEVIRT 139
Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
+ RA+ Q L AP +++ RDPRWGR +ET GEDP +V+ + +V QG++ +
Sbjct: 140 QMRAVG--AQQAL---APLLDVTRDPRWGRVEETFGEDPYLVAQMGIGYVGGLQGDDLR- 193
Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWG-NFSRYSFNAMITE 268
DG I G KH + Y + G N++ A I E
Sbjct: 194 -DGVIATG--------------------------KHFVGYGASEGGMNWA----PAHIPE 222
Query: 269 QDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDC 327
++ + + PF + + + K IM Y++++GVP DL + RN WGF+G + SD
Sbjct: 223 RELREVYLYPFEAVVREAKLQSIMPGYHELDGVPCHHNRDLLVETLRNRWGFEGIVVSDY 282
Query: 328 DAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCML--RHTQSAIDKGKVQEKDIDRAL 385
AV +FEY + ++A ++AG+D+ + + + A+++G+++ + +D +
Sbjct: 283 FAVNQLFEYHQVARDKVEAAVFAVEAGVDVELPSRDVYGQPLVEAVNQGRLRIEQVDALV 342
Query: 386 LNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLN 445
+ + + RLGLF R P+ E ++LA +AA + IVLLKN+ LPL
Sbjct: 343 RRVLTAKFRLGLFE---RPFVDEGRAPNLFDNHEQRQLAREAAAKSIVLLKNEGNLLPLE 399
Query: 446 KNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV------------------ 487
+A+IGP ++I M G Y PC +SLL E V
Sbjct: 400 NRG--KIAVIGPNADSIRNMVGDYA-YPCHIESLLEQSEDNVFHSPMPKGMKSVDDFIEM 456
Query: 488 --------------SKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAG-----LDL 528
++ YA GC DV + +G EA +A++AD IVV G D
Sbjct: 457 KTIVQAIRDKVGDGAEVLYAKGC-DVLGDDTSGIAEAEHVARQADVAIVVVGDRAGLTDG 515
Query: 529 TQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWI 588
E RDR +L L G Q LV V T P ++VL GG PL +++ + +IL
Sbjct: 516 CTTGESRDRATLTLLGAQQELVERVVATGT-PTVVVLVGGRPLSITWIA--EHVPAILEA 572
Query: 589 GYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFY 648
PGE GA A+A+++FGD NP G+LP+T P S +VP+ + R + Y
Sbjct: 573 WLPGEEGAPAIADVVFGDMNPSGKLPIT-IPRSVGQVPIY-YGHKPSGGRSHWKGVYVDE 630
Query: 649 TGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTS 708
+ +Y FGHGLSYT ++Y+ EL +S S + G +H S
Sbjct: 631 SNKPLYAFGHGLSYTTFAYR------ELALSKS-----------EIG-----IHDTVDVS 668
Query: 709 CTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISF 768
C + N GD G VV L+ P ++L GF RVH AK + ++F
Sbjct: 669 CV---------IENTGDRVGEEVVQLYVYDRAADVTRPVQELRGFARVHLEAKEAALVTF 719
Query: 769 GVDPCEQLSIANKHGRRILPLGNHVLMVG 797
+ QL N++ + ++ G + +G
Sbjct: 720 RLS-AHQLGFYNRNLQYVVEPGAIEVQIG 747
>gi|423290405|ref|ZP_17269254.1| hypothetical protein HMPREF1069_04297 [Bacteroides ovatus
CL02T12C04]
gi|392665792|gb|EIY59315.1| hypothetical protein HMPREF1069_04297 [Bacteroides ovatus
CL02T12C04]
Length = 861
Score = 263 bits (671), Expect = 4e-67, Method: Compositional matrix adjust.
Identities = 170/519 (32%), Positives = 256/519 (49%), Gaps = 69/519 (13%)
Query: 53 PFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPG 112
P+ +TSL+ RA+ L+ LTL+EK+ + + + AIPRLGI YEWW+E+LHG+
Sbjct: 25 PYQDTSLTAEQRAEDLLPRLTLEEKVSLMQNASPAIPRLGIKEYEWWNEALHGV------ 78
Query: 113 VNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNL-GQAG-------LTF 164
G T FPQ + ASFN SL + +A + EAR + G++G LTF
Sbjct: 79 ----GRAGLATVFPQSIGMGASFNDSLLYEVFNATSDEARVKSRIFGESGVLKRYQGLTF 134
Query: 165 WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVL 224
W PN+NIFRDPRWGRGQET GEDP + + V+ Q
Sbjct: 135 WTPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQ--------------------- 173
Query: 225 KGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPFRSCI 283
G E R D+ L AC KH + +W +R+SF+A I +D +T+ P F+ +
Sbjct: 174 ---GPEDARYDK--LHACAKHFAVHSGPEW---NRHSFDAENIDPRDLWETYLPAFKDLV 225
Query: 284 EQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIFEYQNYTKT 342
++ +MC+YN+ G P C L Q R+EWG+KG + SDC A++ + + +T
Sbjct: 226 QKAHVKEVMCAYNRFEGEPCCGSNRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTH-ET 284
Query: 343 HED---SAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFN 399
H D ++A ++AG D+ CG+ A+ G + EK+ID +L L + + LG +
Sbjct: 285 HPDKEHASADAVRAGTDLECGS-EYASLADAVKAGLIDEKEIDISLKRLLTARFELGEMD 343
Query: 400 GDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLV 459
P + ++ + + EH+ LAL AR+ +VLL+N LPLN + +A++GP
Sbjct: 344 EQP---AWAEIPTSVLNSKEHQALALRMARESLVLLQNKNNILPLNTHL--KVAVMGPNA 398
Query: 460 NNISQMGGGYTGIPCSPKSLLRGLEAYVSKTH--YASGCHDVPCNSDAGFHEAVRIAKKA 517
N+ G Y GIP +LL + A + + Y GC V + + I K
Sbjct: 399 NDSVMQWGNYNGIPAHTVTLLEAVRAKLPEGQIIYEPGCDRVDGKTLQSLFDECSINGKP 458
Query: 518 DFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVART 556
F L + +RDR ++ Q+S A T
Sbjct: 459 GF--------LAEYWNNRDREGEVVATDQISTPFHFATT 489
Score = 91.3 bits (225), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 71/269 (26%), Positives = 118/269 (43%), Gaps = 53/269 (19%)
Query: 510 AVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVARTSKR 559
AV+ AD ++ G+ + E E+ DR + LP Q L+ ++ + K+
Sbjct: 591 AVKRVMDADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQRDLLKALKKAGKK 650
Query: 560 PVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYP 619
V + +G + + +IL YPG+AG A+ + ++G++NPGGRLP+T+Y
Sbjct: 651 VVFINYSGSA---IGLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPGGRLPVTFYK 707
Query: 620 ESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTIS 679
+ D +M+ GR+YR+ ++ FGHGLSYT ++Y E +S
Sbjct: 708 DVNQLPDFEDYSMK--------GRTYRYMQQQPLFPFGHGLSYTTFTY------GEAKLS 753
Query: 680 ASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVP 739
+ A + +L I V+N G DG VV ++ R P
Sbjct: 754 KNTIAKGENVVL-------------------------TIPVSNVGQRDGEEVVQVYLRRP 788
Query: 740 KVSQGTPEKQLIGFDRVHTVAKGSKEISF 768
+G P L F RVH A ++ ++
Sbjct: 789 GDKEG-PRYTLRAFKRVHIPAGKTESVAI 816
>gi|262405256|ref|ZP_06081806.1| conserved hypothetical protein [Bacteroides sp. 2_1_22]
gi|294644754|ref|ZP_06722499.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
ovatus SD CC 2a]
gi|294810589|ref|ZP_06769241.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
xylanisolvens SD CC 1b]
gi|345508031|ref|ZP_08787672.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp. D1]
gi|229444722|gb|EEO50513.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp. D1]
gi|262356131|gb|EEZ05221.1| conserved hypothetical protein [Bacteroides sp. 2_1_22]
gi|292639876|gb|EFF58149.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
ovatus SD CC 2a]
gi|294442250|gb|EFG11065.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
xylanisolvens SD CC 1b]
Length = 861
Score = 263 bits (671), Expect = 4e-67, Method: Compositional matrix adjust.
Identities = 170/519 (32%), Positives = 256/519 (49%), Gaps = 69/519 (13%)
Query: 53 PFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPG 112
P+ +TSL+ RA+ L+ LTL+EK+ + + + AIPRLGI YEWW+E+LHG+
Sbjct: 25 PYQDTSLTAEQRAEDLLPRLTLEEKVSLMQNASPAIPRLGIKEYEWWNEALHGV------ 78
Query: 113 VNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNL-GQAG-------LTF 164
G T FPQ + ASFN SL + +A + EAR + G++G LTF
Sbjct: 79 ----GRAGLATVFPQSIGMGASFNDSLLYEVFNATSDEARVKSRIFGESGVLKRYQGLTF 134
Query: 165 WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVL 224
W PN+NIFRDPRWGRGQET GEDP + + V+ Q
Sbjct: 135 WTPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQ--------------------- 173
Query: 225 KGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPFRSCI 283
G E R D+ L AC KH + +W +R+SF+A I +D +T+ P F+ +
Sbjct: 174 ---GPEDARYDK--LHACAKHFAVHSGPEW---NRHSFDAENIDPRDLWETYLPAFKDLV 225
Query: 284 EQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIFEYQNYTKT 342
++ +MC+YN+ G P C L Q R+EWG+KG + SDC A++ + + T
Sbjct: 226 QKAHVKEVMCAYNRFEGEPCCGSNRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTH-GT 284
Query: 343 HED---SAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFN 399
H D ++AG ++AG D+ CG+ A+ G + EK+ID +L L + + LG +
Sbjct: 285 HPDKEHASAGAVRAGTDLECGS-EYASLADAVKAGLIDEKEIDISLKRLLTARFELGEMD 343
Query: 400 GDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLV 459
+ + ++ + + EH+ LAL AR+ +VLL+N LPLN + +A++GP
Sbjct: 344 ---EQSAWSEIPTSVLNSKEHQALALRMARESLVLLQNKNNILPLNTHL--KVAVMGPNA 398
Query: 460 NNISQMGGGYTGIPCSPKSLLRGLEAYVSKTH--YASGCHDVPCNSDAGFHEAVRIAKKA 517
N+ G Y GIP +LL + A + + Y GC V + + I K
Sbjct: 399 NDSVMQWGNYNGIPAHTVTLLEAVRAKLPEGQIIYEPGCDRVDGKTLQSLFDECSINGKP 458
Query: 518 DFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVART 556
F L + +RDR ++ Q+S A T
Sbjct: 459 GF--------LAEYWNNRDREGEVVATDQISTPFHFATT 489
Score = 91.7 bits (226), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 71/271 (26%), Positives = 119/271 (43%), Gaps = 53/271 (19%)
Query: 510 AVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVARTSKR 559
AV+ AD ++ G+ + E E+ DR + LP Q L+ ++ + K+
Sbjct: 591 AVKKVMDADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQRDLLKALKKAGKK 650
Query: 560 PVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYP 619
V + +G + + +IL YPG+AG A+ + ++G++NPGGRLP+T+Y
Sbjct: 651 VVFINYSGSA---IGLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPGGRLPVTFYK 707
Query: 620 ESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTIS 679
+ D +M+ GR+YR+ ++ FGHGLSYT ++Y E +S
Sbjct: 708 DVNQLPDFEDYSMK--------GRTYRYMQQQPLFPFGHGLSYTTFTY------GEAKLS 753
Query: 680 ASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVP 739
+ A + +L I V+N G DG VV ++ R P
Sbjct: 754 KNTIAKGENVVL-------------------------TIPVSNVGQRDGEEVVQVYLRRP 788
Query: 740 KVSQGTPEKQLIGFDRVHTVAKGSKEISFGV 770
+G P L F RVH A ++ ++ +
Sbjct: 789 GDKEG-PRYTLRAFKRVHIPAGKTESVAISL 818
>gi|424661946|ref|ZP_18098983.1| hypothetical protein HMPREF1205_02332 [Bacteroides fragilis HMW
616]
gi|404578257|gb|EKA82992.1| hypothetical protein HMPREF1205_02332 [Bacteroides fragilis HMW
616]
Length = 814
Score = 263 bits (671), Expect = 4e-67, Method: Compositional matrix adjust.
Identities = 228/744 (30%), Positives = 343/744 (46%), Gaps = 136/744 (18%)
Query: 90 RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
RLGIP + E HG + G T FP + A+++N L +G +A+
Sbjct: 149 RLGIPLF-LAEECPHGHMAIG-----------TTVFPTSIGQASTWNPELIRQMGRVIAI 196
Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
EA A Q + P +++ RDPRW R +ET GEDP + V+ FQG+ +
Sbjct: 197 EASA-----QGAHIGYGPVLDLARDPRWSRVEETYGEDPYLNGVMGAALVRGFQGDTLRG 251
Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQ 269
K V+ A KH +Y W A + E+
Sbjct: 252 ----------RKSVI----------------ATLKHFASYG---WTEGGHNGGTAHLGER 282
Query: 270 DTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPAC-LRGDLFQKARNEWGFKGYITSDCD 328
+ E+ PPFR + G A +M SYN+++G P R L ++ W FKG++ SD
Sbjct: 283 ELEEAIFPPFREAVGAG-ALSVMSSYNEIDGNPCTGSRYLLTDILKDRWLFKGFVVSDLY 341
Query: 329 AVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCML-RHTQSAIDKGKVQEKDIDRALLN 387
A+ + E+ +E +A + AG+D + GT + +A+ KG V + +D+A+
Sbjct: 342 AIGGLREHGVAGSDYE-AAVKAVNAGVDSDLGTNVYAEQLVAAVRKGDVAMETVDKAVRR 400
Query: 388 LFSVQLRLGLFNG----DPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLP 443
+ S++ +GLF+ D R + V + EH LA + ARQ IVLLKN+ K LP
Sbjct: 401 ILSLKFHMGLFDAPFVDDKRPAQL-------VASPEHIGLAREVARQSIVLLKNEDKLLP 453
Query: 444 LNKNAVSSLAIIGPLVNNISQMGGGYTGIPC--SPKSLLRGLEAYVSK---THYASGCHD 498
L K+ + +LA+IGP +N M G YT S ++L G+ VSK YA GC
Sbjct: 454 LKKD-IRTLAVIGPNADNGYNMLGDYTAPQADGSVVTVLEGIRQKVSKDTRVLYAKGCA- 511
Query: 499 VPCNSDAGFHEAVRIAKKADFVIVVAG----LDLTQETEDR------------------- 535
V +S GF +A+ A+ AD V++V G D + E E+
Sbjct: 512 VRDSSRTGFADAIEAARSADVVVMVVGGSSARDFSSEYEETGAAKVSANRVSDMESGEGY 571
Query: 536 DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVS--FAEADSQISSILWIGYPGE 593
DR +L L G+Q+ L+ V + K P++LVL G PL + EAD +IL YPG
Sbjct: 572 DRATLHLMGRQLELLEEVRKLGK-PMVLVLIKGRPLLMEGVIQEAD----AILDAWYPGM 626
Query: 594 AGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQV 653
G A+A+++FGD+NP GRL ++ P S ++P+ R + +Y + GT
Sbjct: 627 QGGNAVADVLFGDYNPAGRLTLS-VPRSVGQLPVYYNTKRKGNRSRYIEEA-----GTPR 680
Query: 654 YGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLR 713
Y FG+GLSYT +SY TG ++ +E C R
Sbjct: 681 YPFGYGLSYTMFSY--------------------------TGMKVRVS--EESNHC---R 709
Query: 714 FHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPC 773
V ++V N G VDG VV L+ R S TP++QL F RV A ++EI+F +D
Sbjct: 710 VDVSVTVRNQGTVDGDEVVQLYLRDEVGSFTTPDRQLRAFSRVRLKAGETREITFTLDK- 768
Query: 774 EQLSIANKHGRRILPLGNHVLMVG 797
+ L++ + G + G +M G
Sbjct: 769 KSLALYMRDGEWAVEPGRFTVMAG 792
>gi|403253118|ref|ZP_10919422.1| xylosidase [Thermotoga sp. EMP]
gi|402811565|gb|EJX26050.1| xylosidase [Thermotoga sp. EMP]
Length = 778
Score = 262 bits (670), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 220/752 (29%), Positives = 343/752 (45%), Gaps = 143/752 (19%)
Query: 90 RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
RLGIPA E L G G T+FPQ + A++++ L + +A+
Sbjct: 92 RLGIPAM-IHEECLTGYMGLGG-----------TNFPQAIAMASTWDPDLIEKMTTAIRE 139
Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
+ R + GL AP +++ RDPRWGR +ET GE P +V+ V +VK QGE+ K
Sbjct: 140 DMRKIG--AHQGL---APVLDVARDPRWGRTEETFGESPYLVARMGVSYVKGLQGEDIK- 193
Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWG-NFSRYSFNAMITE 268
KG + A KH Y + G N++ + I E
Sbjct: 194 ---------------KG------------VVATVKHFAGYSASEGGKNWAPTN----IPE 222
Query: 269 QDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVP-ACLRGDLFQKARNEWGFKGYITSDC 327
++ ++ F PF + +++ +M SY++++GVP A R L R +WGF+G + SD
Sbjct: 223 REFKEVFLFPFEAAVKEANVLSVMNSYSEIDGVPCAANRKLLTDILRKDWGFEGIVVSDY 282
Query: 328 DAVATIFEYQNYTKTHEDSAAGVLKAGMDINC-GTCMLRHTQSAIDKGKVQEKDIDRALL 386
AV + +Y + ++A L+AG+D+ T ++ + ++KG + E ID A+
Sbjct: 283 FAVKVLEDYHRIARNKSEAARLALEAGIDVELPKTECYQYLKDLVEKGIISEALIDEAVA 342
Query: 387 NLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNK 446
+ ++ LGLF + + K+ H+ +AL+ AR+ I+LLKND LPL K
Sbjct: 343 RVLMLKFMLGLFENPYVEVEKAKI-------ENHRDIALEIARKSIILLKNDG-ILPLQK 394
Query: 447 NAVSSLAIIGPLVNNISQMGGGYT----------------GIPCSPKS----LLRGLEAY 486
N +A+IGP + + G Y G P P+ L + +E +
Sbjct: 395 N--KKVALIGPNAGEVRNLLGDYMYLAHIRALLDNIDDVFGNPQIPRENYERLKKSIEEH 452
Query: 487 VSKT--------------HYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAG------L 526
+ YA GC +V +GF EA+ IAKK+D IVV G L
Sbjct: 453 MKSIPSVLDAFKEEGIEFEYAKGC-EVTGEDRSGFEEAIEIAKKSDVAIVVVGDKSGLTL 511
Query: 527 DLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSIL 586
D T E RD +L LPG Q LV VA+T K PV+LVL G P S ++++IL
Sbjct: 512 DCTT-GESRDMANLKLPGVQEELVLEVAKTGK-PVVLVLITGRPY--SLKNVVDKVNAIL 567
Query: 587 WIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYR 646
+ PGEAG +A+ +II+G NP G+LP++ +P S ++P+ ++ R + Y
Sbjct: 568 QVWLPGEAGGRAIVDIIYGKVNPSGKLPIS-FPRSAGQIPVFHY-VKPSGGRSHWHGDYV 625
Query: 647 FYTGTQVYGFGHGLSYTNYSYKFLS-APSELTISASLKAGSDKNILQQTGSRLDYVHIDE 705
+ ++ FGHGLSYT + Y L P E+ + +
Sbjct: 626 DESTKPLFPFGHGLSYTKFEYSNLRIEPKEVPPAGEVV---------------------- 663
Query: 706 VTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKE 765
+++ V N GD DG VV L+ S P K+L GF RV AK K
Sbjct: 664 ----------IKVDVENIGDRDGDEVVQLYIGREFASVTRPVKELKGFKRVSLKAKEKKT 713
Query: 766 ISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
+ F + + L+ ++ + ++ G +MVG
Sbjct: 714 VVFRLH-MDVLAYYDRDMKLVVEPGEFKVMVG 744
>gi|336415363|ref|ZP_08595703.1| hypothetical protein HMPREF1017_02811 [Bacteroides ovatus
3_8_47FAA]
gi|335940959|gb|EGN02821.1| hypothetical protein HMPREF1017_02811 [Bacteroides ovatus
3_8_47FAA]
Length = 861
Score = 262 bits (670), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 170/519 (32%), Positives = 256/519 (49%), Gaps = 69/519 (13%)
Query: 53 PFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPG 112
P+ +TSL+ RA+ L+ LTL+EK+ + + + AIPRLGI YEWW+E+LHG+
Sbjct: 25 PYQDTSLAAEQRAEDLLPRLTLEEKVSLMQNASPAIPRLGIKEYEWWNEALHGV------ 78
Query: 113 VNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNL-GQAG-------LTF 164
G T FPQ + ASFN SL + +A + EAR + G++G LTF
Sbjct: 79 ----GRAGLATVFPQSIGMGASFNDSLLYEVFNATSDEARVKSRIFGESGVLKRYQGLTF 134
Query: 165 WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVL 224
W PN+NIFRDPRWGRGQET GEDP + + V+ Q
Sbjct: 135 WTPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQ--------------------- 173
Query: 225 KGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPFRSCI 283
G E R D+ L AC KH + +W +R+SF+A I +D +T+ P F+ +
Sbjct: 174 ---GPEDARYDK--LHACAKHFAVHSGPEW---NRHSFDAENIDPRDLWETYLPAFKDLV 225
Query: 284 EQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIFEYQNYTKT 342
++ +MC+YN+ G P C L Q R+EWG+KG + SDC A++ + + T
Sbjct: 226 QKAHVKEVMCAYNRFEGEPCCGSNRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTH-GT 284
Query: 343 HED---SAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFN 399
H D ++AG ++AG D+ CG+ A+ G + EK+ID +L L + + LG +
Sbjct: 285 HPDKEHASAGAVRAGTDLECGS-EYASLADAVKAGLIDEKEIDISLKRLLTARFELGEMD 343
Query: 400 GDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLV 459
+ + ++ + + EH+ LAL AR+ +VLL+N LPLN + +A++GP
Sbjct: 344 ---EQSAWSEIPTSVLNSKEHQALALRMARESLVLLQNKNNILPLNTHL--KVAVMGPNA 398
Query: 460 NNISQMGGGYTGIPCSPKSLLRGLEAYVSKTH--YASGCHDVPCNSDAGFHEAVRIAKKA 517
N+ G Y GIP +LL + A + + Y GC V + + I K
Sbjct: 399 NDSVMQWGNYNGIPAHTVTLLEAVRAKLPEGQIIYEPGCDRVDGKTLQSLFDECSINGKP 458
Query: 518 DFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVART 556
F L + +RDR ++ Q+S A T
Sbjct: 459 GF--------LAEYWNNRDREGEVVATDQISTPFHFATT 489
Score = 99.4 bits (246), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 70/270 (25%), Positives = 119/270 (44%), Gaps = 53/270 (19%)
Query: 511 VRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVARTSKRP 560
V+ AD ++ G+ + E E+ DR + LP Q L+ ++ + K+
Sbjct: 592 VKRVMDADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQRDLLKALKKVGKKV 651
Query: 561 VILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPE 620
V + +G + + +IL YPG+AG A+ + ++G++NPGGRLP+T+Y +
Sbjct: 652 VFINYSGSA---IGLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPGGRLPVTFYKD 708
Query: 621 SFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISA 680
D +M+ GR+YR+ ++ FGHGLSYT+++Y E +S
Sbjct: 709 VNQLPDFEDYSMK--------GRTYRYMQQQPLFPFGHGLSYTDFTY------GEAKLSK 754
Query: 681 SLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPK 740
+ A + +L I V+N G DG VV ++ R P
Sbjct: 755 NTIAKGENVVL-------------------------TIPVSNVGQRDGEEVVQVYLRRPG 789
Query: 741 VSQGTPEKQLIGFDRVHTVAKGSKEISFGV 770
+G P L F RVH A ++ ++ +
Sbjct: 790 DKEG-PRYTLRAFKRVHIPAGKTESVAISL 818
>gi|224537384|ref|ZP_03677923.1| hypothetical protein BACCELL_02262 [Bacteroides cellulosilyticus
DSM 14838]
gi|224521009|gb|EEF90114.1| hypothetical protein BACCELL_02262 [Bacteroides cellulosilyticus
DSM 14838]
Length = 863
Score = 262 bits (670), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 160/459 (34%), Positives = 235/459 (51%), Gaps = 59/459 (12%)
Query: 53 PFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPG 112
P+ N +LS RA L+ LTL+EK + + + AIPRLGI AY+WW+E+LHG+
Sbjct: 25 PYKNPALSPEERANDLVGRLTLEEKAALMQNTSPAIPRLGIKAYDWWNEALHGV------ 78
Query: 113 VNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYN--------LGQAGLTF 164
G T FPQ + ASFN L ++ +AV+ EARA GLT
Sbjct: 79 ----GRAGLATVFPQAIGMGASFNNELLYDVFTAVSDEARAKNTEFSKEGGLKRYQGLTM 134
Query: 165 WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVL 224
W PNINIFRDPRWGRGQET GEDP + + V+ QG
Sbjct: 135 WTPNINIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPE------------------ 176
Query: 225 KGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPFRSCI 283
GE+ D+ L AC KH + +W +R+SFNA I +D +T+ P F++ +
Sbjct: 177 ---GEKYDK-----LHACAKHYAVHSGPEW---NRHSFNAENIDPRDLWETYLPAFKNLV 225
Query: 284 EQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIFE---YQNY 339
++ +MC+YN+ G P C L Q R+EWG+K + SDC A++ + ++
Sbjct: 226 QKAHVKEVMCAYNRFEGEPCCGSNRLLMQILRDEWGYKEIVVSDCWAISDFYNKGAHETD 285
Query: 340 TKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFN 399
SA VL +G D+ CG + A+ +G + EK ID +L L + LG +
Sbjct: 286 PDKQHASAKAVL-SGTDVECGDSYASLPE-AVKEGLIDEKQIDISLKRLMKARFELGEMD 343
Query: 400 GDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLV 459
+P + + ++ V + EH++LAL AR+ +VLL+N++ LPLNKN +A++GP
Sbjct: 344 -EPSQVSWAQIPYSVVDSKEHRELALRMARESLVLLQNNQSLLPLNKNL--KVAVVGPNA 400
Query: 460 NNISQMGGGYTGIPCSPKSLLRGLEAYVSKTH--YASGC 496
N+ G Y G P +LL G+ Y+ ++ Y GC
Sbjct: 401 NDSVMQWGNYNGFPSHTITLLEGIREYLPESQIIYEPGC 439
Score = 108 bits (269), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 84/294 (28%), Positives = 126/294 (42%), Gaps = 60/294 (20%)
Query: 486 YVSKTHYASGCHDVPCNSDAG------FHEAVRIAKKADFVIVVAGLDLTQETEDR---- 535
Y K +A D + D G + V K+AD +I G+ E E+
Sbjct: 563 YRIKIDFAFRNRDAALDFDMGREVPVDLKQTVDKVKEADVIIFAGGISPAVEGEEMHVNI 622
Query: 536 ------DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIG 589
DR ++ LP Q L+ + + K+ V + +G ++ +IL
Sbjct: 623 PGFKGGDRETIELPSIQSRLLAELKKAGKKIVFVNFSGSA---IALTPESKTCDAILQAW 679
Query: 590 YPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYT 649
YPG+AG A+A ++FGD+NP GRLP+T+Y + D +M+ GR+YR+ T
Sbjct: 680 YPGQAGGTAIANVLFGDYNPAGRLPVTFYKSTKQLPDFEDYSMK--------GRTYRYMT 731
Query: 650 GTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSC 709
++ FGHGLSYT + Y +ASL K+ Q T
Sbjct: 732 ENPLFPFGHGLSYTTFQYG----------NASLNTSEIKDGEQVT--------------- 766
Query: 710 TSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGS 763
+ I V+N G DG VV ++ R P +G P L F RV +AKG+
Sbjct: 767 ------LTIPVSNTGKYDGEEVVQVYLRHPGDKEG-PSHALRAFKRV-AIAKGA 812
>gi|160886913|ref|ZP_02067916.1| hypothetical protein BACOVA_04927 [Bacteroides ovatus ATCC 8483]
gi|423288977|ref|ZP_17267828.1| hypothetical protein HMPREF1069_02871 [Bacteroides ovatus
CL02T12C04]
gi|423294866|ref|ZP_17272993.1| hypothetical protein HMPREF1070_01658 [Bacteroides ovatus
CL03T12C18]
gi|156107324|gb|EDO09069.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
ovatus ATCC 8483]
gi|392668741|gb|EIY62235.1| hypothetical protein HMPREF1069_02871 [Bacteroides ovatus
CL02T12C04]
gi|392676057|gb|EIY69498.1| hypothetical protein HMPREF1070_01658 [Bacteroides ovatus
CL03T12C18]
Length = 863
Score = 262 bits (669), Expect = 6e-67, Method: Compositional matrix adjust.
Identities = 177/551 (32%), Positives = 263/551 (47%), Gaps = 79/551 (14%)
Query: 52 YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGP 111
YP+ +T L+ RA L+ LTL+EK+ + +N+ AIPRLGI YEWW+E+LHG+A G
Sbjct: 26 YPYQDTKLTAEQRADDLLQRLTLEEKVALMQNNSPAIPRLGIKPYEWWNEALHGVARAG- 84
Query: 112 GVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARA----MYNLGQ----AGLT 163
T FPQ + AASFN L + AV+ EARA GQ GLT
Sbjct: 85 ---------LATVFPQAIGMAASFNDELLYEVFDAVSDEARAKNRQFNEKGQYKRYQGLT 135
Query: 164 FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRV 223
W PN+NIFRDPRWGRGQET GEDP + + V+ QG +D
Sbjct: 136 MWTPNVNIFRDPRWGRGQETYGEDPYLSGRMGMAAVRGLQG----PEDA----------- 180
Query: 224 LKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPFRSC 282
E D+ L AC KH + +W +R+SFNA I +D +T+ P F+
Sbjct: 181 ------EYDK-----LHACAKHFAVHSGPEW---NRHSFNAENIAPRDLWETYLPAFKEL 226
Query: 283 IEQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIFEYQNYTK 341
+++ +MC+YN+ G P C L Q RN+WGFKG + +DC A+ F+ + + +
Sbjct: 227 VQKAGVKEVMCAYNRFEGDPCCGSNRLLTQILRNDWGFKGIVVTDCGAIGDFFQRKKH-E 285
Query: 342 THEDSA---AGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLF 398
TH D+A A + +G D+ CG + A+ KG + E+ I+ ++ L + LG
Sbjct: 286 THPDAAHASADAVLSGTDLECGG-NFKSITDAVKKGLISEEKINTSVKRLLKARFELGEM 344
Query: 399 NGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPL 458
N + + + +HK+LAL A + +VLL+N+ LPLN+ +A+IGP
Sbjct: 345 NS---THPWSNIPFSVIDCPKHKELALKMAHESLVLLQNNNNILPLNRQM--KVAVIGPN 399
Query: 459 VNNISQMGGGYTGIPCSPKSLLRGLEA--------------YVSKTHYASGCHDVPCNSD 504
N+ G Y G P +LL G+ A Y + T S + + +
Sbjct: 400 ANDSVMQWGNYNGFPSHTVTLLEGIRAKLPDAQIIYEPVCGYTNDTTLHSLFNQCSIDGE 459
Query: 505 AGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRP---- 560
AGF+ ++ I T + ++ PG + T++ R++ RP
Sbjct: 460 AGFNATYWNNREYKGKIAATDRLTTPFHFSAEGSTVFAPGVGLKNFTAIYRSTFRPTDSG 519
Query: 561 --VILVLTGGG 569
V+T GG
Sbjct: 520 AATFRVMTNGG 530
Score = 109 bits (273), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 92/315 (29%), Positives = 141/315 (44%), Gaps = 63/315 (20%)
Query: 508 HEAVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVARTS 557
E + + AD VI G+ E E DR + LP Q ++ + +
Sbjct: 591 REILNKLQSADVVIFAGGISPLLEGESMRVSDPGFKGGDRTEIELPAIQREVLALLKKNG 650
Query: 558 KRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTW 617
K+ V + +G ++ +IL YPG+AG A+A+++FGD+NP GRLP+T+
Sbjct: 651 KKTVFVNFSGSA---MAIVPETQNCDAILQAWYPGQAGGTAVADVLFGDYNPAGRLPITF 707
Query: 618 YPESFTKVP-MNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSEL 676
Y +S ++P D +M+ GR+YRF T T +Y FG+GLSYT +SY +
Sbjct: 708 Y-KSMQQLPDYEDYSMK--------GRTYRFMTETPLYPFGYGLSYTRFSY------GKA 752
Query: 677 TISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFA 736
T++ S +K IL I V+N G DG VV ++
Sbjct: 753 TLNQSKLTKGEKAIL-------------------------TIPVSNVGQRDGEEVVQVYI 787
Query: 737 RVPKVSQGTPEKQLIGFDRVHTVAKGSK---EISFGVDPCEQLSIANKHGRRILPL-GNH 792
P +G P+K L GF RV ++AKG +I D E A I PL G +
Sbjct: 788 CRPDDKEG-PQKTLRGFQRV-SIAKGKTQNVQIELPYDSFEWFDAATN---TIRPLNGTY 842
Query: 793 VLMVGELRHSLTIET 807
++ G + ++T
Sbjct: 843 KILYGNSSNEKDLQT 857
>gi|423226625|ref|ZP_17213090.1| hypothetical protein HMPREF1062_05276 [Bacteroides cellulosilyticus
CL02T12C19]
gi|392628884|gb|EIY22909.1| hypothetical protein HMPREF1062_05276 [Bacteroides cellulosilyticus
CL02T12C19]
Length = 863
Score = 262 bits (669), Expect = 6e-67, Method: Compositional matrix adjust.
Identities = 160/459 (34%), Positives = 234/459 (50%), Gaps = 59/459 (12%)
Query: 53 PFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPG 112
P+ N +LS RA L+ LTL+EK + + + AIPRLGI AY+WW+E+LHG+
Sbjct: 25 PYKNPALSPEERANDLVGRLTLEEKAALMQNTSPAIPRLGIKAYDWWNEALHGV------ 78
Query: 113 VNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYN--------LGQAGLTF 164
G T FPQ + ASFN L ++ +AV+ EARA GLT
Sbjct: 79 ----GRAGLATVFPQAIGMGASFNNELLYDVFTAVSDEARAKNTEFSKEGGLKRYQGLTM 134
Query: 165 WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVL 224
W PNINIFRDPRWGRGQET GEDP + + V+ QG
Sbjct: 135 WTPNINIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPE------------------ 176
Query: 225 KGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPFRSCI 283
GE+ D+ L AC KH + +W +R+SFNA I +D +T+ P F+ +
Sbjct: 177 ---GEKYDK-----LHACAKHYAVHSGPEW---NRHSFNAENIDPRDLWETYLPAFKDLV 225
Query: 284 EQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIFE---YQNY 339
++ +MC+YN+ G P C L Q R+EWG+K + SDC A++ + ++
Sbjct: 226 QKAHVKEVMCAYNRFEGEPCCGSNRLLMQILRDEWGYKEIVVSDCWAISDFYNKDAHETD 285
Query: 340 TKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFN 399
SA VL +G D+ CG + A+ +G + EK ID +L L + LG +
Sbjct: 286 PDKQHASAKAVL-SGTDVECGDSYASLPE-AVKEGLIDEKQIDISLKRLMKARFELGEMD 343
Query: 400 GDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLV 459
+P + + ++ V + EH++LAL AR+ +VLL+N++ LPLNKN +A++GP
Sbjct: 344 -EPSQVSWAQIPYSVVDSKEHRELALRMARESLVLLQNNQSLLPLNKNL--KVAVVGPNA 400
Query: 460 NNISQMGGGYTGIPCSPKSLLRGLEAYVSKTH--YASGC 496
N+ G Y G P +LL G+ Y+ ++ Y GC
Sbjct: 401 NDSVMQWGNYNGFPSHTITLLEGIREYLPESQIIYEPGC 439
Score = 108 bits (270), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 84/294 (28%), Positives = 126/294 (42%), Gaps = 60/294 (20%)
Query: 486 YVSKTHYASGCHDVPCNSDAG------FHEAVRIAKKADFVIVVAGLDLTQETEDR---- 535
Y K +A D + D G + V K+AD +I G+ E E+
Sbjct: 563 YRIKIDFAFRNRDAALDFDMGREVPVDLKQTVDKVKEADVIIFAGGISPAVEGEEMHVNI 622
Query: 536 ------DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIG 589
DR ++ LP Q L+ + + K+ V + +G ++ +IL
Sbjct: 623 PGFKGGDRETIELPSIQSRLLAELKKAGKKIVFVNFSGSA---IALTPESKTCDAILQAW 679
Query: 590 YPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYT 649
YPG+AG A+A ++FGD+NP GRLP+T+Y + D +M+ GR+YR+ T
Sbjct: 680 YPGQAGGTAIANVLFGDYNPAGRLPVTFYKSTKQLPDFEDYSMK--------GRTYRYMT 731
Query: 650 GTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSC 709
++ FGHGLSYT + Y +ASL K+ Q T
Sbjct: 732 ENPLFPFGHGLSYTTFQYG----------NASLNTSEIKDGEQVT--------------- 766
Query: 710 TSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGS 763
+ I V+N G DG VV ++ R P +G P L F RV +AKG+
Sbjct: 767 ------LTIPVSNTGKYDGEEVVQVYLRHPGDKEG-PSHALRAFKRV-AIAKGA 812
>gi|218262493|ref|ZP_03476939.1| hypothetical protein PRABACTJOHN_02617 [Parabacteroides johnsonii
DSM 18315]
gi|218223341|gb|EEC95991.1| hypothetical protein PRABACTJOHN_02617 [Parabacteroides johnsonii
DSM 18315]
Length = 868
Score = 262 bits (669), Expect = 6e-67, Method: Compositional matrix adjust.
Identities = 156/473 (32%), Positives = 227/473 (47%), Gaps = 61/473 (12%)
Query: 50 DSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASN 109
+ YPF N L I R L+ LT +EK+ Q+ + AI RLGIP Y+WW+E+LHG+A
Sbjct: 24 EDYPFRNPDLPIDERIDDLLKRLTAEEKVGQMMNTTPAIERLGIPQYDWWNEALHGVARA 83
Query: 110 GPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------G 161
G T FPQ + AA+F+ + V+ EARA Y+ Q G
Sbjct: 84 G----------KATVFPQAIAMAATFDDDALYETFTMVSDEARAKYHQYQKDKEYDRYKG 133
Query: 162 LTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREK 221
LTFW PNINIFRDPRWGRG ET GEDP + V VK QG++ K
Sbjct: 134 LTFWTPNINIFRDPRWGRGMETYGEDPYLTERMGVAVVKGLQGDDPKY------------ 181
Query: 222 RVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRS 281
AC KH + +W +R+ F+ +T +D T+ P F +
Sbjct: 182 ---------------FKTHACAKHYAVHSGPEW---NRHEFDVTVTPRDLWQTYLPAFEA 223
Query: 282 CIEQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIFEYQNYT 340
+++G +MC+YN+ G P C L RN WG++ I SDC A+ +E T
Sbjct: 224 LVKEGNVQEVMCAYNRYQGKPCCSSDKLLIDILRNSWGYENIILSDCGAINDFWERDERT 283
Query: 341 KTH------EDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLR 394
H E ++A + G D+ CG R A+ GK+ E D+D +L L +
Sbjct: 284 PRHETHPDAESASADAVLNGTDLECGNS-YRALVKALKDGKISENDLDVSLRRLLKGRFE 342
Query: 395 LGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAI 454
LG+F+ D + Y ++ + V + EH AL+ A + +VLLKN LPL+K + +A+
Sbjct: 343 LGMFDPD-EQVPYAQIPYNVVESPEHVAQALEMAHKSMVLLKNKNNTLPLSK-TIRKIAV 400
Query: 455 IGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGF 507
+GP + + + Y G P ++L G+ V T +++ CN A F
Sbjct: 401 VGPNAADSTMLWANYNGFPTHTVTILEGIRNKVPDTEV---IYELGCNHAADF 450
Score = 118 bits (295), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 85/269 (31%), Positives = 134/269 (49%), Gaps = 55/269 (20%)
Query: 515 KKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVARTSKRPVILV 564
K AD ++ V G+ E E+ DR ++ LP Q +V ++ T K PV+ V
Sbjct: 603 KDADVIVYVGGISPRLEGEEMPVNVEGFKKGDRTNIELPKVQQEMVKALKATGK-PVVYV 661
Query: 565 LTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTK 624
L G L +++ EA+ I +IL Y G+ A+A+I+FGD+NP GRLP+T+Y +S +
Sbjct: 662 LCTGSALALNWEEAN--IDAILNAWYGGQEAGTAVADILFGDYNPSGRLPVTFY-KSIDQ 718
Query: 625 VP-MNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLK 683
+P D +M+ GR+YR+ T T +Y FG+GLSYTN++Y+ +A L
Sbjct: 719 LPDFEDYSMK--------GRTYRYMTETPLYPFGYGLSYTNFAYR----------NAKLS 760
Query: 684 AGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQ 743
+G + D+ + T + N G +DG V ++ + P +
Sbjct: 761 SGK--------------IAKDQSVTLT-------FDIANTGKMDGDEVAQIYIKNPNDPE 799
Query: 744 GTPEKQLIGFDRVHTVAKGSKEISFGVDP 772
G P K L F RVH A S+E++ + P
Sbjct: 800 G-PIKALKAFLRVHVKAGDSQEVNIELAP 827
>gi|219118959|ref|XP_002180246.1| beta-xylosidase [Phaeodactylum tricornutum CCAP 1055/1]
gi|217408503|gb|EEC48437.1| beta-xylosidase [Phaeodactylum tricornutum CCAP 1055/1]
Length = 682
Score = 262 bits (669), Expect = 6e-67, Method: Compositional matrix adjust.
Identities = 206/647 (31%), Positives = 301/647 (46%), Gaps = 86/647 (13%)
Query: 40 PDF-PCKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASA---------IP 89
P+F C+ + P+C+ SLSI R + L+S LTL EK+ + + + +
Sbjct: 57 PNFWGCQNDVAKALPYCDMSLSIDERLEDLLSHLTLDEKVDMIGADPTQDVCMTHTMNVS 116
Query: 90 RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
R+G+P Y W E+ + S N T F L AASFNRS W GS
Sbjct: 117 RIGLPDYYWLVETNTAVGSACIAEN-----KCATEFSGPLSIAASFNRSSWFLKGSVFGT 171
Query: 150 EARAMYNLG----------QAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFV 199
E RA+ N+ GLT + PNIN RDPR+GR E PGEDP + YA V
Sbjct: 172 EQRALMNVHGERFHTHSGRHIGLTAFGPNINQQRDPRFGRSSELPGEDPFLSGQYAAHMV 231
Query: 200 KSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSR 259
+ Q E D + A KH AY E+
Sbjct: 232 QGMQ--------------------------ERDANGYPKVLAYLKHFTAYSREEGRGNDD 265
Query: 260 YSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWG 318
Y+ I+ D DT+ P + + QG A+ +MCSYN VNG+PAC L K R W
Sbjct: 266 YN----ISMYDLFDTYLPQYEMGMVQGGATGVMCSYNAVNGIPACANDYLLNKILRQRWN 321
Query: 319 -FKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHT-QSAIDKGKV 376
++T+DC AV + +AA L G DI G+ + H +AI G
Sbjct: 322 RSDAHVTTDCGAVNNLRGKPIQAADEAQAAAMALMNGADIEMGSTLFVHNLTTAITLGYA 381
Query: 377 QEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLK 436
E+ +++A+ + G F+ DP ++ LG DD+ + +H+++ L+AA QG+VLLK
Sbjct: 382 TEEAVNQAIRRSYRPHFIAGRFD-DPTLSEWFSLGLDDIQSKKHQEIQLEAALQGLVLLK 440
Query: 437 NDKKFLPLNKNAVSSLAIIGPL-----------VNNISQMGGGYTGIPCSPKSL--LRGL 483
++ LP+ A + LA++GPL ++ S GGG+ IP +S+ + G
Sbjct: 441 HEDSILPI--AAGTKLAVLGPLGMTRSGLMSDYESDQSCFGGGHDCIPTLAESIGFINGK 498
Query: 484 EAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLP 543
E V+ A+G DV + + +++A D +++ G TQE E DR LP
Sbjct: 499 EFTVA----AAGV-DVDSRNTSDVERILQLAADRDLIVLCLGNTKTQEQEGFDRKDTALP 553
Query: 544 GQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEII 603
GQQ +L +V T ++PV+LVL GG ++ S+I+ P G ALA +
Sbjct: 554 GQQYALFEAVL-TLRKPVVLVLVNGG--QIALDGMTGYPSAIIEAFNPNGIGGTALAASL 610
Query: 604 FGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTG 650
FG N G+LP T YP S M +M+ S PGR+YR++TG
Sbjct: 611 FGQENRWGKLPYTIYPYSV----MQSFDMKDHSMSAPPGRTYRYFTG 653
>gi|154493680|ref|ZP_02033000.1| hypothetical protein PARMER_03021 [Parabacteroides merdae ATCC
43184]
gi|423723902|ref|ZP_17698051.1| hypothetical protein HMPREF1078_02038 [Parabacteroides merdae
CL09T00C40]
gi|154086890|gb|EDN85935.1| glycosyl hydrolase family 3 C-terminal domain protein
[Parabacteroides merdae ATCC 43184]
gi|409240709|gb|EKN33484.1| hypothetical protein HMPREF1078_02038 [Parabacteroides merdae
CL09T00C40]
Length = 868
Score = 262 bits (669), Expect = 7e-67, Method: Compositional matrix adjust.
Identities = 156/473 (32%), Positives = 228/473 (48%), Gaps = 61/473 (12%)
Query: 50 DSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASN 109
+ YPF N L I R L+ LT +EKI Q+ + AI RLGIP Y+WW+E+LHG+A
Sbjct: 24 EDYPFRNPDLPIDERIDDLLKRLTAEEKIGQMMNTTPAIERLGIPEYDWWNEALHGVARA 83
Query: 110 GPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------G 161
G T FPQ + AA+F+ + V+ EARA Y+ Q G
Sbjct: 84 G----------KATVFPQAIAMAATFDDDALYETFTMVSDEARAKYHQYQKNKEYDRYKG 133
Query: 162 LTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREK 221
LTFW PNINIFRDPRWGRG ET GEDP + V VK QG++ K
Sbjct: 134 LTFWTPNINIFRDPRWGRGMETYGEDPYLTERMGVAVVKGLQGDDPKY------------ 181
Query: 222 RVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRS 281
AC KH + +W +R+ F+ +T +D T+ P F +
Sbjct: 182 ---------------FKTHACAKHYAVHSGPEW---NRHEFDVTVTPRDLWQTYLPAFEA 223
Query: 282 CIEQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIFEYQNYT 340
+++G +MC+YN+ G P C L RN WG++ I SDC A+ ++ T
Sbjct: 224 LVKKGNVQEVMCAYNRYQGKPCCSSDKLLIDILRNSWGYENIILSDCGAINDFWQRDERT 283
Query: 341 KTH------EDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLR 394
H E ++A + G D+ CG + A+ +GK+ E D+D +L L +
Sbjct: 284 PRHETHPDAESASADAVLNGTDLECGNS-YKALIKALKEGKISENDLDVSLRRLLKGRFE 342
Query: 395 LGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAI 454
LG+F+ D R Y ++ + V + EH AL+ A + +VLLKN LPL+K + +A+
Sbjct: 343 LGMFDPDERV-PYAQIPYNVVESPEHVAQALEMAHKSMVLLKNKNNTLPLSK-TIRKIAV 400
Query: 455 IGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGF 507
+GP + + + Y G P ++L G+ V T +++ CN A F
Sbjct: 401 VGPNAADSTMLWANYNGFPTHTVTILEGIRNKVPDTEV---IYELGCNHAADF 450
Score = 116 bits (291), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 83/269 (30%), Positives = 133/269 (49%), Gaps = 55/269 (20%)
Query: 515 KKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVARTSKRPVILV 564
K AD ++ V G+ E E+ DR ++ +P Q +V ++ T K PV+ V
Sbjct: 603 KDADVIVYVGGISPRLEGEEMPVNVEGFKKGDRTNIEIPKVQQEMVKALKATGK-PVVYV 661
Query: 565 LTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTK 624
L G L +++ D+ I +IL Y G+ A+A+I+FGD+NP GRLP+T+Y +S +
Sbjct: 662 LCTGSALALNWE--DANIDAILNAWYGGQEAGTAVADILFGDYNPSGRLPVTFY-KSIDQ 718
Query: 625 VP-MNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLK 683
+P D +M+ GR+YR+ T T +Y FG+GLSYTN++Y+ +A L
Sbjct: 719 LPDFEDYSMK--------GRTYRYMTETPLYPFGYGLSYTNFAYR----------NAKLS 760
Query: 684 AGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQ 743
+G + D+ + T + N G +DG V ++ + P +
Sbjct: 761 SGK--------------ITKDQSVTLT-------FDIANTGKMDGDEVAQIYIKNPNDPE 799
Query: 744 GTPEKQLIGFDRVHTVAKGSKEISFGVDP 772
G P K L F RVH A S+E++ + P
Sbjct: 800 G-PIKALKAFLRVHVKAGDSQEVNIELTP 827
>gi|295086418|emb|CBK67941.1| Beta-glucosidase-related glycosidases [Bacteroides xylanisolvens
XB1A]
Length = 861
Score = 261 bits (668), Expect = 7e-67, Method: Compositional matrix adjust.
Identities = 162/483 (33%), Positives = 245/483 (50%), Gaps = 61/483 (12%)
Query: 53 PFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPG 112
P+ +TSL+ RA+ L+ LTL+EK+ + + + AIPRLGI YEWW+E+LHG+
Sbjct: 25 PYQDTSLTAEQRAEDLLPRLTLEEKVSLMQNASPAIPRLGIKEYEWWNEALHGV------ 78
Query: 113 VNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNL-GQAG-------LTF 164
G T FPQ + ASFN SL + +A + EAR + G++G LTF
Sbjct: 79 ----GRAGLATVFPQSIGMGASFNDSLLYEVFNATSDEARVKSRIFGESGVLKRYQGLTF 134
Query: 165 WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVL 224
W PN+NIFRDPRWGRGQET GEDP + + V+ Q
Sbjct: 135 WTPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQ--------------------- 173
Query: 225 KGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPFRSCI 283
G E R D+ L AC KH + +W +R+SF+A I +D +T+ P F+ +
Sbjct: 174 ---GPEDARYDK--LHACAKHFAVHSGPEW---NRHSFDAENIDPRDLWETYLPAFKDLV 225
Query: 284 EQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIFEYQNYTKT 342
++ +MC+YN+ G P C L Q R+EWG+KG + SDC A++ + + +T
Sbjct: 226 QKAHVKEVMCAYNRFEGEPCCGSNRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTH-ET 284
Query: 343 HED---SAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFN 399
H D ++AG ++AG D+ CG+ A+ G + EK+ID +L L + + LG +
Sbjct: 285 HPDKEHASAGAVRAGTDLECGS-EYASLADAVKAGLIDEKEIDISLKRLLTARFELGEMD 343
Query: 400 GDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLV 459
+ + ++ + + EH+ LAL AR+ +VLL+N LPLN + +A++GP
Sbjct: 344 ---EQSAWSEIPTSVLNSKEHQALALRMARESLVLLQNKNNILPLNTHL--KVAVMGPNA 398
Query: 460 NNISQMGGGYTGIPCSPKSLLRGLEAYVSKTH--YASGCHDVPCNSDAGFHEAVRIAKKA 517
N+ G Y GIP +LL + A + + Y GC V + + I K
Sbjct: 399 NDSVMQWGNYNGIPAHTVTLLEAVRAKLPEGQIIYEPGCDRVDGKTLQSLFDECSINGKP 458
Query: 518 DFV 520
F+
Sbjct: 459 GFL 461
Score = 92.0 bits (227), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 71/269 (26%), Positives = 119/269 (44%), Gaps = 53/269 (19%)
Query: 510 AVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVARTSKR 559
AV+ AD ++ G+ + E E+ DR + LP Q L+ ++ + K+
Sbjct: 591 AVKKVMDADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQRDLLKALKKAGKK 650
Query: 560 PVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYP 619
V + +G + + +IL YPG+AG A+ + ++G++NPGGRLP+T+Y
Sbjct: 651 VVFINYSGSA---IGLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPGGRLPVTFYK 707
Query: 620 ESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTIS 679
+ D +M+ GR+YR+ ++ FGHGLSYT+++Y E +S
Sbjct: 708 DVNQLPDFEDYSMK--------GRTYRYMQQQPLFPFGHGLSYTDFTY------GEAKLS 753
Query: 680 ASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVP 739
+ A + +L I V+N G DG VV ++ R P
Sbjct: 754 KNTIAKGENVVL-------------------------TIPVSNVGQRDGEEVVQVYLRRP 788
Query: 740 KVSQGTPEKQLIGFDRVHTVAKGSKEISF 768
+G P L F RVH A ++ ++
Sbjct: 789 GDKEG-PRYTLRAFKRVHIPAGKTESVAI 816
>gi|299147288|ref|ZP_07040353.1| beta-glucosidase [Bacteroides sp. 3_1_23]
gi|298514566|gb|EFI38450.1| beta-glucosidase [Bacteroides sp. 3_1_23]
Length = 861
Score = 261 bits (668), Expect = 8e-67, Method: Compositional matrix adjust.
Identities = 170/519 (32%), Positives = 255/519 (49%), Gaps = 69/519 (13%)
Query: 53 PFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPG 112
P+ +TSL+ RA+ L+ LTL+EK+ + + + AIPRLGI YEWW+E+LHG+
Sbjct: 25 PYQDTSLTAEQRAEDLLPRLTLEEKVSLMQNASPAIPRLGIKEYEWWNEALHGV------ 78
Query: 113 VNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNL-GQAG-------LTF 164
G T FPQ + ASFN SL + +A + EAR + G++G LTF
Sbjct: 79 ----GRAGLATVFPQSIGMGASFNDSLLYEVFNATSDEARVKSRIFGESGVLKRYQGLTF 134
Query: 165 WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVL 224
W PN+NIFRDPRWGRGQET GEDP + + V+ Q
Sbjct: 135 WTPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQ--------------------- 173
Query: 225 KGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPFRSCI 283
G E R D+ L AC KH + +W +R+SF+A I +D +T+ P F+ +
Sbjct: 174 ---GPEDARYDK--LHACAKHFAVHSGPEW---NRHSFDAENIDPRDLWETYLPAFKDLV 225
Query: 284 EQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIFEYQNYTKT 342
++ +MC+YN+ G P C L Q R+EWG+KG + SDC A++ + + T
Sbjct: 226 QKAHVKEVMCAYNRFEGEPCCGSNRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTH-GT 284
Query: 343 HED---SAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFN 399
H D ++A ++AG D+ CG+ A+ G + EK+ID +L L + + LG +
Sbjct: 285 HPDKEHASADAVRAGTDLECGS-EYASLADAVKAGLIDEKEIDISLKRLLTARFELGEMD 343
Query: 400 GDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLV 459
P + ++ + + EH+ LAL AR+ +VLL+N LPLN + +A++GP
Sbjct: 344 EQP---AWSEIPTSVLNSKEHQALALRMARESLVLLQNKNNILPLNTHL--KVAVMGPNA 398
Query: 460 NNISQMGGGYTGIPCSPKSLLRGLEAYVSKTH--YASGCHDVPCNSDAGFHEAVRIAKKA 517
N+ G Y GIP +LL + A + + Y GC V + + I K
Sbjct: 399 NDSVMQWGNYNGIPAHTVTLLEAVRAKLPEGQIIYEPGCDRVDGKTLQSLFDECSINGKP 458
Query: 518 DFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVART 556
F L + +RDR ++ Q+S A T
Sbjct: 459 GF--------LAEYWNNRDREGEVVATDQISTPFHFATT 489
Score = 90.9 bits (224), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 72/270 (26%), Positives = 123/270 (45%), Gaps = 55/270 (20%)
Query: 510 AVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVARTSKR 559
AV+ AD ++ G+ + E E+ DR + LP Q +L+ ++ + K+
Sbjct: 591 AVKRVMDADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQRNLLKALKKAGKK 650
Query: 560 PVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYP 619
V + +G + + +IL YPG+AG A+ + ++G++NPGGRLP+T+Y
Sbjct: 651 VVFINYSGSA---IGLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPGGRLPVTFY- 706
Query: 620 ESFTKVP-MNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTI 678
++ ++P D +M+ GR+YR+ ++ FGHGLSYT ++Y E +
Sbjct: 707 KNVNQLPDFEDYSMK--------GRTYRYMQQQPLFPFGHGLSYTTFTY------GEAKL 752
Query: 679 SASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARV 738
S + A + +L I V+N G DG VV ++ R
Sbjct: 753 SKNTIAKGENVVL-------------------------TIPVSNVGQRDGEEVVQVYLRR 787
Query: 739 PKVSQGTPEKQLIGFDRVHTVAKGSKEISF 768
P +G P L F RVH A ++ ++
Sbjct: 788 PGDKEG-PRYTLRAFKRVHIPAGKTESVAI 816
>gi|189464583|ref|ZP_03013368.1| hypothetical protein BACINT_00926 [Bacteroides intestinalis DSM
17393]
gi|189436857|gb|EDV05842.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
intestinalis DSM 17393]
Length = 879
Score = 261 bits (668), Expect = 8e-67, Method: Compositional matrix adjust.
Identities = 160/459 (34%), Positives = 233/459 (50%), Gaps = 59/459 (12%)
Query: 53 PFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPG 112
P+ N +LS RA L+ LTL+EK + + + AIPRLGI AY+WW+E+LHG+
Sbjct: 41 PYKNPALSPEERANDLVGRLTLEEKAALMQNTSPAIPRLGIKAYDWWNEALHGV------ 94
Query: 113 VNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYN--------LGQAGLTF 164
G T FPQ + ASFN L ++ +A++ EARA GLT
Sbjct: 95 ----GRAGLATVFPQAIGMGASFNNELLYDVFTAISDEARAKNTEFSKEGGLKRYQGLTM 150
Query: 165 WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVL 224
W PNINIFRDPRWGRGQET GEDP + S + V+ QG
Sbjct: 151 WTPNINIFRDPRWGRGQETYGEDPYLTSQMGMAVVRGLQGPE------------------ 192
Query: 225 KGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPFRSCI 283
GE+ D+ L AC KH + +W +R+SFNA I +D +T+ P F+ +
Sbjct: 193 ---GEKYDK-----LHACAKHYAVHSGPEW---NRHSFNAENIDPRDLWETYLPAFKDLV 241
Query: 284 EQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFE---YQNY 339
++ +MC+YN+ G P C L R+EWG+K + SDC A++ + ++
Sbjct: 242 QKAHVKEVMCAYNRFEGEPCCGSNRLLMHILRDEWGYKEIVVSDCWAISDFYNKGAHETD 301
Query: 340 TKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFN 399
SA VL +G DI CG A+ +G + EK ID +L L + LG +
Sbjct: 302 PDKQHASAKAVL-SGTDIECGDS-YGSLPEAVKEGLIDEKQIDISLKRLMKARFELGEMD 359
Query: 400 GDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLV 459
+P + + ++ V + EH++LAL AR+ +VLL+N++ LPLNKN +A++GP
Sbjct: 360 -EPSQVSWAQIPYSVVDSKEHRELALRMARESLVLLQNNQSLLPLNKNL--KVAVVGPNA 416
Query: 460 NNISQMGGGYTGIPCSPKSLLRGLEAYVSKTH--YASGC 496
N+ G Y G P +LL G+ Y+ ++ Y GC
Sbjct: 417 NDSVMQWGNYNGFPSHTITLLEGIREYLPESQIIYEPGC 455
Score = 102 bits (255), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 77/268 (28%), Positives = 121/268 (45%), Gaps = 56/268 (20%)
Query: 507 FHEAVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVART 556
+ V K+AD +I G+ E E+ DR ++ LP Q L+ + +
Sbjct: 606 LKQTVNKVKEADVIIFAGGISPAVEGEEMHVNIPGFKGGDRETIELPSIQSRLLAELKKA 665
Query: 557 SKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMT 616
K+ V + +G ++ +IL YPG+AG A+A ++FGD+NP GRLP+T
Sbjct: 666 GKKIVFVNFSGSA---IALTPESKTCDAILQAWYPGQAGGTAIANVLFGDYNPAGRLPVT 722
Query: 617 WYPESFTKVP-MNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSE 675
+Y +S +++P D +M+ R+YR+ T ++ FGHGLSYT + Y
Sbjct: 723 FY-KSTSQLPGFEDYSMKE--------RTYRYMTEAPLFPFGHGLSYTTFRYG------- 766
Query: 676 LTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLF 735
ASL K+ Q + I V+N G+ DG VV ++
Sbjct: 767 ---DASLNTQEVKDGEQTI---------------------LTIPVSNVGEYDGEEVVQVY 802
Query: 736 ARVPKVSQGTPEKQLIGFDRVHTVAKGS 763
R P +G P L F R + +AKG+
Sbjct: 803 LRRPGDKEG-PSHALRAFKRAN-IAKGA 828
>gi|423215029|ref|ZP_17201557.1| hypothetical protein HMPREF1074_03089 [Bacteroides xylanisolvens
CL03T12C04]
gi|392692292|gb|EIY85530.1| hypothetical protein HMPREF1074_03089 [Bacteroides xylanisolvens
CL03T12C04]
Length = 861
Score = 261 bits (668), Expect = 9e-67, Method: Compositional matrix adjust.
Identities = 170/519 (32%), Positives = 255/519 (49%), Gaps = 69/519 (13%)
Query: 53 PFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPG 112
P+ +TSL+ RA+ L+ LTL+EK+ + + + AIPRLGI YEWW+E+LHG+
Sbjct: 25 PYQDTSLTAEQRAEDLLPRLTLEEKVSLMQNASPAIPRLGIKEYEWWNEALHGV------ 78
Query: 113 VNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNL-GQAG-------LTF 164
G T FPQ + ASFN SL + +A + EAR + G++G LTF
Sbjct: 79 ----GRAGLATVFPQSIGMGASFNDSLLYEVFNATSDEARVKSRIFGESGVLKRYQGLTF 134
Query: 165 WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVL 224
W PN+NIFRDPRWGRGQET GEDP + + V+ Q
Sbjct: 135 WTPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQ--------------------- 173
Query: 225 KGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPFRSCI 283
G E R D+ L AC KH + +W +R+SF+A I +D +T+ P F+ +
Sbjct: 174 ---GPEDARYDK--LHACAKHFAVHSGPEW---NRHSFDAENIDPRDLWETYLPAFKDLV 225
Query: 284 EQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIFEYQNYTKT 342
++ +MC+YN+ G P C L Q R+EWG+KG + SDC A++ + + T
Sbjct: 226 QKAHVKEVMCAYNRFEGEPCCGSNRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTH-GT 284
Query: 343 HED---SAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFN 399
H D ++A ++AG D+ CG+ A+ G + EK+ID +L L + + LG +
Sbjct: 285 HPDKEHASADAVRAGTDLECGS-EYASLADAVKAGLIDEKEIDISLKRLLTARFELGEMD 343
Query: 400 GDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLV 459
P + ++ + + EH+ LAL AR+ +VLL+N LPLN + +A++GP
Sbjct: 344 EQP---AWSEIPTSVLNSKEHQALALRMARESLVLLQNKNNILPLNTHL--KVAVMGPNA 398
Query: 460 NNISQMGGGYTGIPCSPKSLLRGLEAYVSKTH--YASGCHDVPCNSDAGFHEAVRIAKKA 517
N+ G Y GIP +LL + A + + Y GC V + + I K
Sbjct: 399 NDSVMQWGNYNGIPAHTVTLLEAVRAKLPEGQIIYEPGCDRVDGKTLQSLFDECSINGKP 458
Query: 518 DFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVART 556
F L + +RDR ++ Q+S A T
Sbjct: 459 GF--------LAEYWNNRDREGEVVATDQISTPFHFATT 489
Score = 91.7 bits (226), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 71/269 (26%), Positives = 119/269 (44%), Gaps = 53/269 (19%)
Query: 510 AVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVARTSKR 559
AV+ AD ++ G+ + E E+ DR + LP Q L+ ++ + K+
Sbjct: 591 AVKRVMDADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQRDLLKALKKAGKK 650
Query: 560 PVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYP 619
V + +G + + +IL YPG+AG A+ + ++G++NPGGRLP+T+Y
Sbjct: 651 VVFINYSGSA---IGLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPGGRLPVTFYK 707
Query: 620 ESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTIS 679
+ D +M+ GR+YR+ ++ FGHGLSYT+++Y E +S
Sbjct: 708 DVNQLPDFEDYSMK--------GRTYRYMQQQPLFPFGHGLSYTDFTY------GEAKLS 753
Query: 680 ASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVP 739
+ A + +L I V+N G DG VV ++ R P
Sbjct: 754 KNTIAKGENVVL-------------------------TIPVSNVGQRDGEEVVQVYLRRP 788
Query: 740 KVSQGTPEKQLIGFDRVHTVAKGSKEISF 768
+G P L F RVH A ++ ++
Sbjct: 789 GDKEG-PRYTLRAFKRVHIPAGKTESVAI 816
>gi|189464219|ref|ZP_03013004.1| hypothetical protein BACINT_00556 [Bacteroides intestinalis DSM
17393]
gi|189438009|gb|EDV06994.1| glycosyl hydrolase family 3 N-terminal domain protein [Bacteroides
intestinalis DSM 17393]
Length = 865
Score = 261 bits (667), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 155/439 (35%), Positives = 230/439 (52%), Gaps = 53/439 (12%)
Query: 56 NTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNF 115
N S IS R ++LIS +TL+EK+ QLS+ +IPRL +P+Y +W+E LHG+A G
Sbjct: 51 NLSQPISARVENLISKMTLEEKVAQLSNETDSIPRLNLPSYNYWNECLHGVARAG----- 105
Query: 116 NGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDP 175
VT FPQ + A++++ L + SA++ EAR Y GLT+W+P IN+ RDP
Sbjct: 106 -----EVTVFPQAINLASTWDTLLIKKVASAISTEARLKYLEIGKGLTYWSPTINMARDP 160
Query: 176 RWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGD 235
RWGR +ET GEDP + S V FVK QG D D
Sbjct: 161 RWGRNEETYGEDPYLTSRLGVAFVKGLQG---------------------------DHPD 193
Query: 236 ELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSY 295
L A KH +A + E R+S ++ I + + + P + +C+++ A +M +Y
Sbjct: 194 YLKTVATIKHFVANNQEN----DRFSSSSQIPTKQLYEYYFPAYEACVKEADAQSVMTAY 249
Query: 296 NQVNGVP----ACLRGDLFQKARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVL 351
N NGV L GD+ +K EWGF G++ SDC A+ + + E++AA +
Sbjct: 250 NAFNGVAPSGSTWLLGDVLRK---EWGFDGFVVSDCGAIGVMNWQHRVVNSLEEAAALGI 306
Query: 352 KAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRK-GKYGKL 410
+G D+ CG +A+ G V E+ ID+AL + + + +LG F DP + Y
Sbjct: 307 NSGCDLECGGTYREKLVAAVKMGLVSEQAIDKALTRVLTARFKLGEF--DPIELVPYNHY 364
Query: 411 GPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT 470
+ + KLA +AA + IVLLKND FLP++K + S+AI+GP +N GGY+
Sbjct: 365 DKKLLAGEKFGKLAYEAAVKSIVLLKNDNDFLPVDKKKIRSVAIVGPFADN--NYLGGYS 422
Query: 471 GIPCSPKSLLRGLEAYVSK 489
G P SLL+G++ V K
Sbjct: 423 GKPVHNVSLLQGVKDLVGK 441
Score = 122 bits (307), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 104/325 (32%), Positives = 154/325 (47%), Gaps = 55/325 (16%)
Query: 486 YVSKTHYASGCHDVPCNSDAGFHEAVRIAKK----ADFVIVVAGLDLTQETEDRDRVSLL 541
Y++KT A+ C V SD G + + K+ AD V+V G D E+RD S+
Sbjct: 578 YINKTG-AAACMLV---SDFGNSDQIDKVKEFVSGADLVLVALGNDEKLARENRDLPSIY 633
Query: 542 LPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAE 601
LP Q L+ + + + R L+L G PL +A + + +IL YPG+ G KALA
Sbjct: 634 LPMTQELLLKEIYKVNPR-TALILHTGNPLTSKWAAEN--VPAILQAWYPGQEGGKALAG 690
Query: 602 IIFGDFNPGGRLPMTWYPESFTKVP-MNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGL 660
I+FG NP G+LPMT Y ES ++P + D ++ + GR+Y++ + +YGFGHGL
Sbjct: 691 ILFGSENPSGKLPMTIY-ESEEQLPDILDYDI-------WKGRTYQYLSSKPLYGFGHGL 742
Query: 661 SYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISV 720
SY+N+ Y L + D V D C+ I +
Sbjct: 743 SYSNFEYTHLQSD-------------------------DVVRPDGTLQCS-------IEI 770
Query: 721 TNAGDVDGSHVVMLFARVPKVSQGT-PEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIA 779
N DV G VV ++ T P K+L+ F RV SK ++F + P QLSI
Sbjct: 771 KNISDVAGEEVVQVYISRENTPVYTFPLKKLVAFARVDLKPGESKTVTFTIAP-RQLSIW 829
Query: 780 NKHGRRILPLGNHVLMVGELRHSLT 804
+ ++LP G + L VG + L+
Sbjct: 830 QEGIWKMLP-GKYSLFVGSGQEGLS 853
>gi|383115540|ref|ZP_09936296.1| hypothetical protein BSGG_2590 [Bacteroides sp. D2]
gi|313695055|gb|EFS31890.1| hypothetical protein BSGG_2590 [Bacteroides sp. D2]
Length = 770
Score = 261 bits (666), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 209/708 (29%), Positives = 333/708 (47%), Gaps = 111/708 (15%)
Query: 112 GVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTF-WAPNIN 170
++ NG T +P + AA+F+ +L + A+E RA +G+ + + PN+
Sbjct: 136 AIHGNGLYRGSTIYPTPIGQAATFDPALVERMSRETAIEMRA------SGMHWTFTPNVE 189
Query: 171 IFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEE 230
+ RD RWGR ET GEDP +V V+ FQ +++ +D I
Sbjct: 190 VARDARWGRVGETFGEDPYLVGQMGAATVRGFQTKDFTGNDKVI---------------- 233
Query: 231 SDRGDELMLSACCKHLIAYDLEKWGNFSRYSFN---AMITEQDTEDTFQPPFRSCIEQGK 287
AC KHL+ G+ N A ++E+ ++ F PPF+ C+E G
Sbjct: 234 ----------ACAKHLVG------GSQPANGINGAPAELSERTLQEVFFPPFKDCLEAG- 276
Query: 288 ASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDS 346
+M ++N++NG+P L + RN+W F G++ SD + + +Y N +T +D+
Sbjct: 277 VFTVMTAHNELNGIPCHGNKYLMTEVLRNQWKFDGFVVSDWMDIERMHDYHNVAETLKDA 336
Query: 347 AAGVLKAGMDINC-GTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNG---DP 402
+ AGM ++ G + +G + EK ID A+ + V+ RLGLF D
Sbjct: 337 YRISVDAGMGMHMHGPEFYEAIIECVKEGSIPEKQIDAAVSKILEVKFRLGLFENPFIDL 396
Query: 403 RKGKYGKLGPDDVCTSE-HKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNN 461
+K D++ +E H++ AL+ AR+ IVLLKN+ LPL+ + + + G NN
Sbjct: 397 KK-------KDEIVFNEKHQQTALEGARKSIVLLKNEGNMLPLDASKYKKVFVTGHNANN 449
Query: 462 ISQMGGGYTGIPCSP-KSLLRGLEAYVSKTHY--ASGCHDVPCNSDAGFHEAVRIAKKAD 518
S +G P ++L+GL+A +T+Y +V SD EAV+ A+ +D
Sbjct: 450 QSILGDWAMEQPEEHVTTVLKGLKAISPETNYNFLDLGWNVRLLSDNQIKEAVQQARNSD 509
Query: 519 FVIVVAGLD-----LTQET--EDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPL 571
I+V G + ++T E+ DR L LPG+Q LV +VA T P +++L G PL
Sbjct: 510 LAILVVGENSMRYHWNEKTCGENSDRYELSLPGRQQELVKAVAATG-VPTVVILVNGRPL 568
Query: 572 DVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMN 631
+ D + I+ PG AG +ALAEI++G NP G+LP+T +P +
Sbjct: 569 TTEW--IDENMPCIIEAWEPGVAGGQALAEILYGKVNPSGKLPIT--------IPRSTGQ 618
Query: 632 MRADSSRQYPGRSYRFYTGTQ--VYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKN 689
++ + ++ + + TG +Y FG+GLSYT Y Y+ L SE TI+ DK+
Sbjct: 619 IQCMYNHKFTNHWFPYATGNSLPLYEFGYGLSYTTYKYENLKL-SEATITP------DKS 671
Query: 690 ILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQ 749
+ V + VTN G +DG V L+ R S P K+
Sbjct: 672 V------------------------KVTVDVTNTGKMDGEETVQLYIRDEYSSATRPVKE 707
Query: 750 LIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
L F R+ A +KE+SF + P E LS + + + G +MVG
Sbjct: 708 LKDFARIPLKAGETKEVSFTLTP-EMLSYYDANMHYGVEKGTFKIMVG 754
>gi|237719778|ref|ZP_04550259.1| glycoside hydrolase family 3 protein [Bacteroides sp. 2_2_4]
gi|229451047|gb|EEO56838.1| glycoside hydrolase family 3 protein [Bacteroides sp. 2_2_4]
Length = 861
Score = 261 bits (666), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 161/483 (33%), Positives = 244/483 (50%), Gaps = 61/483 (12%)
Query: 53 PFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPG 112
P+ +TSL+ RA+ L+ LTL+EK+ + + + AIPRLGI YEWW+E+LHG+
Sbjct: 25 PYQDTSLTAEQRAEDLLPRLTLEEKVSLMQNASPAIPRLGIKEYEWWNEALHGV------ 78
Query: 113 VNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNL-GQAG-------LTF 164
G T FPQ + ASFN SL + +A + EAR + G++G LTF
Sbjct: 79 ----GRAGLATVFPQSIGMGASFNDSLLYEVFNATSDEARVKSRIFGESGVLKRYQGLTF 134
Query: 165 WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVL 224
W PN+NIFRDPRWGRGQET GEDP + + V+ Q
Sbjct: 135 WTPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQ--------------------- 173
Query: 225 KGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPFRSCI 283
G E R D+ L AC KH + +W +R+SF+A I +D +T+ P F+ +
Sbjct: 174 ---GPEDARYDK--LHACAKHFAVHSGPEW---NRHSFDAENIDPRDLWETYLPAFKDLV 225
Query: 284 EQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIFE---YQNY 339
++ +MC+YN+ G P C L Q R+EWG+KG + SDC A++ + ++ Y
Sbjct: 226 QKAHVKEVMCAYNRFEGEPCCGSNRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTHETY 285
Query: 340 TKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFN 399
E ++AG ++AG D+ CG+ A+ G + EK+ID +L L + + LG +
Sbjct: 286 PD-KEHASAGAVRAGTDLECGS-EYASLADAVKAGLIDEKEIDISLKRLLTARFELGEMD 343
Query: 400 GDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLV 459
+ + ++ + + EH+ LAL AR+ +VLL+N LPLN + +A++GP
Sbjct: 344 ---EQSAWSEIPTSVLNSKEHQALALRMARESLVLLQNKNNILPLNTHL--KVAVMGPNA 398
Query: 460 NNISQMGGGYTGIPCSPKSLLRGLEAYVSKTH--YASGCHDVPCNSDAGFHEAVRIAKKA 517
N+ G Y GIP +LL + A + + Y GC V + + I K
Sbjct: 399 NDSVMQWGNYNGIPAHTVTLLEAVRAKLPEGQIIYEPGCDRVDGKTLQSLFDECSINGKP 458
Query: 518 DFV 520
F+
Sbjct: 459 GFL 461
Score = 92.0 bits (227), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 71/269 (26%), Positives = 119/269 (44%), Gaps = 53/269 (19%)
Query: 510 AVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVARTSKR 559
AV+ AD ++ G+ + E E+ DR + LP Q L+ ++ + K+
Sbjct: 591 AVKKVMDADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQRDLLKALKKAGKK 650
Query: 560 PVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYP 619
V + +G + + +IL YPG+AG A+ + ++G++NPGGRLP+T+Y
Sbjct: 651 VVFINYSGSA---IGLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPGGRLPVTFYK 707
Query: 620 ESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTIS 679
+ D +M+ GR+YR+ ++ FGHGLSYT+++Y E +S
Sbjct: 708 DVNQLPDFEDYSMK--------GRTYRYMQQQPLFPFGHGLSYTDFTY------GEAKLS 753
Query: 680 ASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVP 739
+ A + +L I V+N G DG VV ++ R P
Sbjct: 754 KNTIAKGENVVL-------------------------TIPVSNVGQRDGEEVVQVYLRRP 788
Query: 740 KVSQGTPEKQLIGFDRVHTVAKGSKEISF 768
+G P L F RVH A ++ ++
Sbjct: 789 GDKEG-PRYTLRAFKRVHIPAGKTESVAI 816
>gi|255693560|ref|ZP_05417235.1| periplasmic beta-glucosidase [Bacteroides finegoldii DSM 17565]
gi|260620625|gb|EEX43496.1| glycosyl hydrolase family 3 N-terminal domain protein [Bacteroides
finegoldii DSM 17565]
Length = 770
Score = 261 bits (666), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 209/708 (29%), Positives = 333/708 (47%), Gaps = 111/708 (15%)
Query: 112 GVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTF-WAPNIN 170
++ NG T +P + AA+F+ +L + A+E RA +G+ + + PN+
Sbjct: 136 AIHGNGLYRGSTIYPTPIGQAATFDPALVERMSRETAIEMRA------SGMHWTFTPNVE 189
Query: 171 IFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEE 230
+ RD RWGR ET GEDP +V V+ FQ +++ +D I
Sbjct: 190 VARDARWGRVGETFGEDPYLVGQMGAATVRGFQTKDFTGNDKVI---------------- 233
Query: 231 SDRGDELMLSACCKHLIAYDLEKWGNFSRYSFN---AMITEQDTEDTFQPPFRSCIEQGK 287
AC KHL+ G+ N A ++E+ ++ F PPF+ C+E G
Sbjct: 234 ----------ACAKHLVG------GSQPANGINGAPAELSERTLQEVFFPPFKDCLEAG- 276
Query: 288 ASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDS 346
+M ++N++NG+P L + RN+W F G++ SD + + +Y N +T +D+
Sbjct: 277 VFTVMTAHNELNGIPCHGNKYLMTEVLRNQWKFDGFVVSDWMDIERMHDYHNVAETLKDA 336
Query: 347 AAGVLKAGMDINC-GTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNG---DP 402
+ AGM ++ G + +G + EK ID A+ + V+ RLGLF D
Sbjct: 337 YQISVDAGMGMHMHGPEFYEAIIECVKEGSIPEKQIDAAVSKILEVKFRLGLFENPFIDL 396
Query: 403 RKGKYGKLGPDDVCTSE-HKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNN 461
+K D++ +E H++ AL+ AR+ IVLLKN+ LPL+ + + + G NN
Sbjct: 397 KK-------KDEIVFNEKHQQTALEGARKSIVLLKNEGNMLPLDASKYKKVFVTGHNANN 449
Query: 462 ISQMGGGYTGIPCSP-KSLLRGLEAYVSKTHY--ASGCHDVPCNSDAGFHEAVRIAKKAD 518
S +G P ++L+GL+A +T+Y +V SD EAV+ A+ +D
Sbjct: 450 QSILGDWAMEQPEEHVTTVLKGLKAISPETNYNFLDLGWNVRLLSDNQIKEAVQQARSSD 509
Query: 519 FVIVVAGLD-----LTQET--EDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPL 571
I+V G + ++T E+ DR L LPG+Q LV +VA T P +++L G PL
Sbjct: 510 LAILVVGENSMRYHWNEKTCGENSDRYELSLPGRQQELVEAVAATG-VPTVVILVNGRPL 568
Query: 572 DVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMN 631
+ D + I+ PG AG +ALAEI++G NP G+LP+T +P +
Sbjct: 569 TTEW--IDENMPCIIEAWEPGVAGGQALAEILYGKVNPSGKLPIT--------IPRSTGQ 618
Query: 632 MRADSSRQYPGRSYRFYTGTQ--VYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKN 689
++ + ++ + + TG +Y FG+GLSYT Y Y+ L SE TI+ DK+
Sbjct: 619 IQCMYNHKFTNHWFPYATGNSLPLYEFGYGLSYTTYKYENLKL-SEATITP------DKS 671
Query: 690 ILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQ 749
+ V + VTN G +DG V L+ R S P K+
Sbjct: 672 V------------------------KVTVDVTNTGKMDGEETVQLYIRDEYSSATRPVKE 707
Query: 750 LIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
L F R+ A +KE+SF + P E LS + + + G +MVG
Sbjct: 708 LKDFARIPLKAGETKEVSFTLTP-EMLSYYDANMHYGVEKGTFKIMVG 754
>gi|423344787|ref|ZP_17322476.1| hypothetical protein HMPREF1060_00148 [Parabacteroides merdae
CL03T12C32]
gi|409224378|gb|EKN17311.1| hypothetical protein HMPREF1060_00148 [Parabacteroides merdae
CL03T12C32]
Length = 866
Score = 260 bits (665), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 155/473 (32%), Positives = 228/473 (48%), Gaps = 61/473 (12%)
Query: 50 DSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASN 109
+ YPF N L I R L+ LT +EKI Q+ + AI RLGIP Y+WW+E+LHG+A
Sbjct: 22 EDYPFRNPDLPIDERIDDLLKRLTAEEKIGQMMNTTPAIERLGIPEYDWWNEALHGVARA 81
Query: 110 GPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------G 161
G T FPQ + AA+F+ + V+ EARA Y+ Q G
Sbjct: 82 G----------KATVFPQAIAMAATFDDDALYETFTMVSDEARAKYHQYQKNKEYDRYKG 131
Query: 162 LTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREK 221
LTFW PNINIFRDPRWGRG ET GEDP + + VK QG++ K
Sbjct: 132 LTFWTPNINIFRDPRWGRGMETYGEDPYLTERMGLAVVKGLQGDDPKY------------ 179
Query: 222 RVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRS 281
AC KH + +W +R+ F+ +T +D T+ P F +
Sbjct: 180 ---------------FKTHACAKHYAVHSGPEW---NRHEFDVTVTPRDLWQTYLPAFEA 221
Query: 282 CIEQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIFEYQNYT 340
+++G +MC+YN+ G P C L RN WG++ I SDC A+ ++ T
Sbjct: 222 LVKKGNVQEVMCAYNRYQGKPCCSSDKLLIDILRNSWGYENIILSDCGAINDFWQRDERT 281
Query: 341 KTH------EDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLR 394
H E ++A + G D+ CG + A+ +GK+ E D+D +L L +
Sbjct: 282 PRHETHPDAESASADAVLNGTDLECGNS-YKALIKALKEGKISENDLDVSLRRLLKGRFE 340
Query: 395 LGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAI 454
LG+F+ D R Y ++ + V + EH AL+ A + +VLLKN LPL+K + +A+
Sbjct: 341 LGMFDPDERV-PYAQIPYNVVESPEHVAQALEMAHKSMVLLKNKNNTLPLSK-TIRKIAV 398
Query: 455 IGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGF 507
+GP + + + Y G P ++L G+ V T +++ CN A F
Sbjct: 399 VGPNAADSTMLWANYNGFPTHTVTILEGIRNKVPDTEV---IYELGCNHAADF 448
Score = 116 bits (291), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 83/269 (30%), Positives = 133/269 (49%), Gaps = 55/269 (20%)
Query: 515 KKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVARTSKRPVILV 564
K AD ++ V G+ E E+ DR ++ +P Q +V ++ T K PV+ V
Sbjct: 601 KDADVIVYVGGISPRLEGEEMPVNVEGFKKGDRTNIEIPKVQQEMVKALKATGK-PVVYV 659
Query: 565 LTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTK 624
L G L +++ D+ I +IL Y G+ A+A+I+FGD+NP GRLP+T+Y +S +
Sbjct: 660 LCTGSALALNWE--DANIDAILNAWYGGQEAGTAVADILFGDYNPSGRLPVTFY-KSIDQ 716
Query: 625 VP-MNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLK 683
+P D +M+ GR+YR+ T T +Y FG+GLSYTN++Y+ +A L
Sbjct: 717 LPDFEDYSMK--------GRTYRYMTETPLYPFGYGLSYTNFAYR----------NAKLS 758
Query: 684 AGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQ 743
+G + D+ + T + N G +DG V ++ + P +
Sbjct: 759 SGK--------------ITKDQSVTLT-------FDIANTGKMDGDEVAQIYIKNPNDPE 797
Query: 744 GTPEKQLIGFDRVHTVAKGSKEISFGVDP 772
G P K L F RVH A S+E++ + P
Sbjct: 798 G-PIKALKAFLRVHVKAGDSQEVNIELTP 825
>gi|423294294|ref|ZP_17272421.1| hypothetical protein HMPREF1070_01086 [Bacteroides ovatus
CL03T12C18]
gi|392675485|gb|EIY68926.1| hypothetical protein HMPREF1070_01086 [Bacteroides ovatus
CL03T12C18]
Length = 861
Score = 260 bits (665), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 169/520 (32%), Positives = 254/520 (48%), Gaps = 71/520 (13%)
Query: 53 PFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPG 112
P+ +TSL+ RA+ L+ LTL+EK+ + + + AIPRLGI YEWW+E+LHG+
Sbjct: 25 PYQDTSLTAEQRAEDLLPRLTLEEKVSLMQNASPAIPRLGIKEYEWWNEALHGV------ 78
Query: 113 VNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNL-GQAG-------LTF 164
G T FPQ + ASFN SL + +A + EAR + G++G LTF
Sbjct: 79 ----GRAGLATVFPQSIGMGASFNDSLLYEVFNATSDEARVKSRIFGESGALKRYQGLTF 134
Query: 165 WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQG-ENWKSDDGGIGFGFREKRV 223
W PN+NIFRDPRWGRGQET GEDP + + V+ QG E+ K D
Sbjct: 135 WTPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPEDTKYDK------------ 182
Query: 224 LKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPFRSC 282
L AC KH + +W +R+SF+A I +D +T+ P F+
Sbjct: 183 ---------------LHACAKHFAVHSGPEW---NRHSFDAENIDPRDLWETYLPAFKDL 224
Query: 283 IEQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIFEYQNYTK 341
+++ +MC+YN+ G P C L Q R+EWG+KG + SDC A++ + +
Sbjct: 225 VQKAHVKEVMCAYNRFEGEPCCGSNRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTH-G 283
Query: 342 THED---SAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLF 398
TH D ++A ++ G D+ CG+ A+ G + EK+ID +L L + + LG
Sbjct: 284 THPDKEHASAAAVRTGTDLECGS-EYASLADAVKAGLIDEKEIDISLKRLLTARFELGEM 342
Query: 399 NGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPL 458
+ P + ++ + + EH+ LAL AR+ +VLL+N LPLN + +A++GP
Sbjct: 343 DEQP---AWSEIPASVLNSKEHQALALRMARESLVLLQNKNNILPLNTHL--KVAVMGPN 397
Query: 459 VNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTH--YASGCHDVPCNSDAGFHEAVRIAKK 516
N+ G Y GIP +LL + A + + Y GC V + + I K
Sbjct: 398 ANDSVMQWGNYNGIPAHTVTLLEAVRAKLPEGQIIYEPGCDRVDGKTLQSLFDECSINGK 457
Query: 517 ADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVART 556
F L + +RDR ++ Q+S A T
Sbjct: 458 PGF--------LAEYWNNRDREGEVVATDQISTPFHFATT 489
Score = 91.7 bits (226), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 71/269 (26%), Positives = 119/269 (44%), Gaps = 53/269 (19%)
Query: 510 AVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVARTSKR 559
AV+ AD ++ G+ + E E+ DR + LP Q +L+ ++ + K+
Sbjct: 591 AVKRVMDADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQRNLLKALKKAGKK 650
Query: 560 PVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYP 619
V + +G + + +IL YPG+AG A+ + ++G++NPGGRLP+T+Y
Sbjct: 651 VVFINYSGSA---IGLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPGGRLPVTFYK 707
Query: 620 ESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTIS 679
+ D +M+ GR+YR+ ++ FGHGLSYT ++Y E +S
Sbjct: 708 DVNQLPDFEDYSMK--------GRTYRYMQQQPLFPFGHGLSYTTFTY------GEAKLS 753
Query: 680 ASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVP 739
+ A + +L I V+N G DG VV ++ R P
Sbjct: 754 KNTIAKGENVVL-------------------------TIPVSNVGQRDGEEVVQVYLRRP 788
Query: 740 KVSQGTPEKQLIGFDRVHTVAKGSKEISF 768
+G P L F RVH A ++ ++
Sbjct: 789 GDKEG-PRYTLRAFKRVHIPAGKTESVAI 816
>gi|393781366|ref|ZP_10369565.1| hypothetical protein HMPREF1071_00433 [Bacteroides salyersiae
CL02T12C01]
gi|392676859|gb|EIY70281.1| hypothetical protein HMPREF1071_00433 [Bacteroides salyersiae
CL02T12C01]
Length = 854
Score = 260 bits (665), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 164/437 (37%), Positives = 235/437 (53%), Gaps = 57/437 (13%)
Query: 64 RAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVT 123
R L+S LT++EKI L + IPRL I Y +E+LHG+ G NF T
Sbjct: 38 RILDLLSKLTIEEKISLLRATSPGIPRLQIDKYYHGNEALHGVVRPG---NF-------T 87
Query: 124 SFPQVLVSAASFNRSLWSNIGSAVAVEARAMYN--------LGQAG--LTFWAPNINIFR 173
FPQ + AA +N L + I +A++ EARA +N LGQ LTFW+P +N+ R
Sbjct: 88 VFPQAIGLAAMWNPQLLNEISTAISDEARARWNELEQGKKQLGQFSDLLTFWSPTVNMAR 147
Query: 174 DPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDR 233
DPRWGR ET GEDP + V FVK QG+ + R LK
Sbjct: 148 DPRWGRTPETYGEDPFLSGKLGVSFVKGLQGD--------------DPRYLK-------- 185
Query: 234 GDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMC 293
+ + KH A + E +R+ N +I+E+D + + P F CI +GKA+ IM
Sbjct: 186 -----IVSTPKHFAANNEEH----NRFECNPIISEKDLREYYLPAFEKCIIEGKAASIMT 236
Query: 294 SYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLK 352
+YN +N VP L L +K R++WGF GY+ SDC A + + Y KT E +A ++
Sbjct: 237 AYNAINDVPCTLNNWLLKKVLRHDWGFDGYVVSDCGAPDFLVTHHKYVKTLEAAATLSIQ 296
Query: 353 AGMDINCG-TCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLG 411
AG+D+ CG + +A + V E +ID A ++ ++RLGLF+ DP Y K+
Sbjct: 297 AGLDLECGDNVYMEPLLNAYKQYMVTEAEIDSAAYHILRARMRLGLFD-DPNLNPYNKIS 355
Query: 412 PDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTG 471
P V +H +LAL+AARQ IVLLKN+KKFLPL+ + S+A++G +N + G Y+G
Sbjct: 356 PSVVGCEKHSQLALEAARQSIVLLKNEKKFLPLDLKKIKSIAVVG--INAGNCEFGDYSG 413
Query: 472 IPCS-PKSLLRGLEAYV 487
P + P S+L G++ V
Sbjct: 414 TPVNQPVSILEGIKKRV 430
Score = 140 bits (353), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 93/299 (31%), Positives = 142/299 (47%), Gaps = 57/299 (19%)
Query: 515 KKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVS 574
+K D I V G++ + E E +DR S+ LP Q + + + V+ VL G L ++
Sbjct: 600 RKCDLTIAVVGINKSIEREGQDRYSIELPKDQQIFIEEAYKINPNTVV-VLVAGSSLAIN 658
Query: 575 FAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVP-MNDMNMR 633
+ D I +I+ YPGEAG A+AE++FGD+NPGG+LP+T+Y S ++P +D ++R
Sbjct: 659 WM--DEHIPAIVNAWYPGEAGGTAVAEVLFGDYNPGGKLPLTYY-RSLDELPAFDDYDIR 715
Query: 634 ADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQ 693
GR+Y+F+ G +Y FGHGLSYT +SYK L
Sbjct: 716 K-------GRTYQFFEGNPLYAFGHGLSYTTFSYKKL----------------------- 745
Query: 694 TGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGT------PE 747
+ID + F ++ N G DG V L+ + QG P
Sbjct: 746 --------NIDSTGDAVKVSF----ALKNTGKYDGDEVAQLYVKY----QGNDSLVKLPL 789
Query: 748 KQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
KQL GF+RVH SK ++ V E + G P G+++ MVG ++ ++
Sbjct: 790 KQLKGFERVHLKKGESKRVTLTVPKSELRFWDEEKGEFYTPAGDYLFMVGTASDAIQLQ 848
>gi|427383551|ref|ZP_18880271.1| hypothetical protein HMPREF9447_01304 [Bacteroides oleiciplenus YIT
12058]
gi|425728735|gb|EKU91590.1| hypothetical protein HMPREF9447_01304 [Bacteroides oleiciplenus YIT
12058]
Length = 939
Score = 260 bits (665), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 206/751 (27%), Positives = 343/751 (45%), Gaps = 123/751 (16%)
Query: 90 RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
RLGIP ++ +E + G+ S T+FP L ++NR L +G
Sbjct: 150 RLGIPV-DFTNEGIRGVES-----------YRATNFPTQLGLGHTWNRKLIHQVGLITGR 197
Query: 150 EARAMYNLGQAGLT-FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWK 208
EAR + G T +AP +++ RD RWGR +E GE P +V+ +E V+ Q +
Sbjct: 198 EARML------GYTNVYAPILDVGRDQRWGRYEEVYGESPYLVAELGIEMVRGMQHNH-- 249
Query: 209 SDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITE 268
++A KH +AY K + ++
Sbjct: 250 -----------------------------QVAATGKHFVAYSNNKGAREGMARVDPQMSP 280
Query: 269 QDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLF---QKARNEWGFKGYITS 325
++ E PF+ I++ +M SYN +G+P ++G + ++ R E GF+GY+ S
Sbjct: 281 REVEMIHVYPFKRVIKEAGMLGVMSSYNDYDGIP--IQGSYYWLTKRLRGEMGFRGYVVS 338
Query: 326 DCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCG----TCMLRHTQSAIDKGKVQEKDI 381
D DAV ++ + K +++ ++AG+++ C + + + +G + E I
Sbjct: 339 DSDAVEYLYTKHSTAKDMKEAVRQSVEAGLNVRCTFRSPDSYVLPLRELVKEGGLSEDII 398
Query: 382 DRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKF 441
+ + ++ V+ +GLF+ + G +V +E++ +AL A+R+ ++LLKN+
Sbjct: 399 NDRVRDILRVKFLIGLFDAPYQTDLAG--ADKEVEKAENEAVALQASRESLILLKNENNV 456
Query: 442 LPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV---SKTHYASGCHD 498
LPL+ N + ++A+ GP N Y + ++L G+ ++ YA GC
Sbjct: 457 LPLDINNIKTIAVCGPNANEEGYALTHYGPLAVEVITVLEGIRQKAEGKAEVLYAKGCDL 516
Query: 499 VPCN--------------SDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPG 544
V N A ++AV A+KAD +VV G E++ R SL LPG
Sbjct: 517 VDANWPESELIEYPMTNEEQAEINKAVENARKADVAVVVLGGGQRTCGENKSRSSLDLPG 576
Query: 545 QQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIF 604
+Q+ L+ +V T K PV+LVL G PL +++A D + +IL YPG G A+A+++F
Sbjct: 577 RQLKLLQAVQATGK-PVVLVLINGRPLSINWA--DKFVPAILETWYPGSKGGTAVADVLF 633
Query: 605 GDFNPGGRLPMTWYPESFTKVPMN---DMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLS 661
GD+NPGG+L +T +P+S ++P N + + D + +Y FG+GLS
Sbjct: 634 GDYNPGGKLTVT-FPKSVGQIPFNFPCKPSSQIDGGKNPGPDGNMSRVNGSLYPFGYGLS 692
Query: 662 YTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVT 721
YT + Y S + IS + + K V+ VT
Sbjct: 693 YTTFEY------SNIEISPKMMTANQKAT-------------------------VRCKVT 721
Query: 722 NAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANK 781
N G G VV L+ R S T EK L GF+RVH +KE++F +D + L + +K
Sbjct: 722 NTGKRAGDEVVQLYIRDMLSSVTTYEKNLAGFERVHLQPGETKEVTFILD-RKHLELLDK 780
Query: 782 HGRRILPLGNHVLMVG------ELRHSLTIE 806
H ++ G+ +MVG L LT+E
Sbjct: 781 HMEWVVEPGDFSIMVGASSEDIRLNGMLTVE 811
>gi|313145353|ref|ZP_07807546.1| periplasmic beta-glucosidase [Bacteroides fragilis 3_1_12]
gi|313134120|gb|EFR51480.1| periplasmic beta-glucosidase [Bacteroides fragilis 3_1_12]
Length = 802
Score = 260 bits (665), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 227/744 (30%), Positives = 341/744 (45%), Gaps = 136/744 (18%)
Query: 90 RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
RLGIP + E HG + G T FP + A+++N L +G +A
Sbjct: 137 RLGIPLF-LAEECPHGHMAIG-----------TTVFPTSIGQASTWNPELIRQMGRVIAT 184
Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
EA A Q + P +++ RDPRW R +ET GEDP + V+ FQG+ +
Sbjct: 185 EASA-----QGAHIGYGPVLDLARDPRWSRVEETYGEDPYLNGVMGAALVRGFQGDTLRG 239
Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQ 269
K V+ A KH +Y W A + E+
Sbjct: 240 ----------RKSVI----------------ATLKHFASYG---WTEGGHNGGTAHLGER 270
Query: 270 DTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPAC-LRGDLFQKARNEWGFKGYITSDCD 328
+ E+ PPFR + G A +M SYN+++G P R L ++ W FKG++ SD
Sbjct: 271 ELEEAIFPPFREAVGAG-ALSVMSSYNEIDGNPCTGSRYLLTDILKDRWQFKGFVVSDLY 329
Query: 329 AVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCML-RHTQSAIDKGKVQEKDIDRALLN 387
A+ + E+ +E +A + AG+D + GT + +A+ KG V + +D+A+
Sbjct: 330 AIGGLREHGVAGSDYE-AAVKAVNAGVDSDLGTNVYAEQLVAAVRKGDVAMETVDKAVRR 388
Query: 388 LFSVQLRLGLFNG----DPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLP 443
+ ++ +GLF+ D R + V + EH LA + ARQ IVLLKN+ K LP
Sbjct: 389 ILFLKFHMGLFDAPFVDDKRPAQL-------VASPEHIGLAREVARQSIVLLKNEDKLLP 441
Query: 444 LNKNAVSSLAIIGPLVNNISQMGGGYTGIPC--SPKSLLRGLEAYVSK---THYASGCHD 498
L K+ + +LA+IGP +N M G YT S ++L G+ VSK YA GC
Sbjct: 442 LKKD-IRTLAVIGPNADNGYNMLGDYTAPQADGSVVTVLEGIRQKVSKDTRVLYAKGCA- 499
Query: 499 VPCNSDAGFHEAVRIAKKADFVIVVAG----LDLTQETEDR------------------- 535
V +S GF +A+ A+ AD V++V G D + E E+
Sbjct: 500 VRDSSRTGFADAIEAARSADVVVMVVGGSSARDFSSEYEETGAAKVSANRVSDMESGEGY 559
Query: 536 DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVS--FAEADSQISSILWIGYPGE 593
DR +L L G+Q+ L+ V + K P++LVL G PL + EAD +IL YPG
Sbjct: 560 DRATLHLMGRQLELLEEVRKLGK-PMVLVLIKGRPLLMEGVIQEAD----AILDAWYPGM 614
Query: 594 AGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQV 653
G A+A+++FGD+NP GRL ++ P S ++P+ R + +Y + GT
Sbjct: 615 QGGNAVADVLFGDYNPAGRLTLS-VPRSVGQLPVYYNTKRKGNRSRYIEEA-----GTPR 668
Query: 654 YGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLR 713
Y FG+GLSYT +SY TG ++ +E C R
Sbjct: 669 YPFGYGLSYTTFSY--------------------------TGMKVRVS--EESNHC---R 697
Query: 714 FHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPC 773
V ++V N G VDG VV L+ R S TP++QL F RV A ++EI+F +D
Sbjct: 698 VDVSVTVRNQGTVDGDEVVQLYLRDEVGSFTTPDRQLRAFSRVRLKAGETREITFTLDK- 756
Query: 774 EQLSIANKHGRRILPLGNHVLMVG 797
+ L++ + G + G +M G
Sbjct: 757 KSLALYMRDGEWAVEPGRFTVMAG 780
>gi|218132025|ref|ZP_03460829.1| hypothetical protein BACEGG_03650 [Bacteroides eggerthii DSM 20697]
gi|217985785|gb|EEC52125.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
eggerthii DSM 20697]
Length = 888
Score = 260 bits (665), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 166/468 (35%), Positives = 232/468 (49%), Gaps = 63/468 (13%)
Query: 44 CKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESL 103
C YPF N L + R +L+SLLTLQEK+ + + + AI RLGIPAY WW E+
Sbjct: 20 CSEAEKYQYPFQNPKLKVEERVDNLMSLLTLQEKVSMMINRSLAIERLGIPAYNWWGEAC 79
Query: 104 HGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNL----GQ 159
HG+ + G VT FPQ + AA+F+ S + V+ EARA YN G
Sbjct: 80 HGLIAGG-----------VTVFPQSIALAATFDDSSQLTTYTMVSDEARARYNTLPLDGD 128
Query: 160 AG--------LTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDD 211
G LTFWAPNINIFRDPRWGRGQET GEDP ++S + V QG
Sbjct: 129 IGPYVSAIPNLTFWAPNINIFRDPRWGRGQETYGEDPFLMSRMGLNVVLGMQG------- 181
Query: 212 GGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDT 271
D AC KH Y + R+ FNA+++ +D
Sbjct: 182 --------------------DDEHYYKTHACAKH---YGVHSGPEPLRHEFNAVVSMRDL 218
Query: 272 EDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAV 330
+T+ P F + + +G +MC+Y+ G P C L RN WGF G + SDCDA+
Sbjct: 219 WETYLPAFETLVVKGNVREVMCAYSAYEGEPCCASNRLLVDILRNRWGFDGMVVSDCDAI 278
Query: 331 ATIF---EYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLN 387
+ ++ + SA VL G D+ CG A++KG ++E+D+D +L
Sbjct: 279 NDFYVKGRHETHPDAAAASADAVL-TGTDLECGRS-YNALIEAVEKGIIKEQDLDVSLRR 336
Query: 388 LFSVQLRLGLFNGDPRKG-KYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNK 446
+ + + RLGL DP K Y + + EH+ AL A + VLLKN+ LPL+K
Sbjct: 337 ILTERFRLGLL--DPAKYVPYSTIPGSVIDCQEHRDHALKMAHESQVLLKNEGNILPLDK 394
Query: 447 NAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYAS 494
N + S+AI+GP +N+ M G Y+G P ++L+GL+ + T S
Sbjct: 395 N-IKSIAIVGPNINDSIMMRGNYSGSPTHCITILQGLKNKLPNTRIIS 441
Score = 115 bits (287), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 83/295 (28%), Positives = 137/295 (46%), Gaps = 47/295 (15%)
Query: 505 AGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILV 564
A F E K + +I V G+ TQE E +R + LP Q + ++ T K PVI V
Sbjct: 619 AVFEELKEKIKDVEAIIYVGGITPTQEGEGHERAKIELPDVQKRFLKAMHETGK-PVIYV 677
Query: 565 LTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTK 624
G ++ A+ D ++L YPG+ G A+A+++FGD+NP G+LP+T+Y +
Sbjct: 678 NCSGSA--IALADIDYAYDALLQAWYPGQEGGTAVADVLFGDYNPSGKLPVTFYKSTEQL 735
Query: 625 VPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKA 684
D +M R+YR++ G Y FG+GLSYT++ + L S+S+KA
Sbjct: 736 PEFTDYSME--------NRTYRYFKGEPQYAFGYGLSYTDFEFG-----EALLSSSSIKA 782
Query: 685 GSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQG 744
G + + I +TN G +DG+ VV ++ + +
Sbjct: 783 GE--------------------------KVEITIPLTNVGKMDGAEVVQVYVK-SLTNPD 815
Query: 745 TPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANK--HGRRILPLGNHVLMVG 797
P K L G+ R A S+++ ++P E + N G ++ P G + ++ G
Sbjct: 816 APIKSLKGYVRQEIKAGKSEKVRITLEP-ESFAYYNADVDGLKVFP-GKYQILYG 868
>gi|336399370|ref|ZP_08580170.1| Beta-glucosidase [Prevotella multisaccharivorax DSM 17128]
gi|336069106|gb|EGN57740.1| Beta-glucosidase [Prevotella multisaccharivorax DSM 17128]
Length = 862
Score = 260 bits (664), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 159/463 (34%), Positives = 235/463 (50%), Gaps = 55/463 (11%)
Query: 53 PFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPG 112
P+ + LS RAK L S LTL+EK + D + AIPRLGI + WWSE+LHG A+NG
Sbjct: 24 PYQDPGLSFEARAKDLCSRLTLEEKASLMCDVSPAIPRLGIKPFNWWSEALHGYANNG-- 81
Query: 113 VNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA---------GLT 163
VT FP+ + AASFN ++ + +A + EAR YN A L+
Sbjct: 82 --------DVTVFPEPIGMAASFNPTMVYQVFTATSDEARGKYNQSMAEGKEDTRFHSLS 133
Query: 164 FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRV 223
W PN+NIFRDPRWGRGQET GEDP + S VE VK QG
Sbjct: 134 VWTPNVNIFRDPRWGRGQETYGEDPYLTSVMGVEVVKGLQGP------------------ 175
Query: 224 LKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFN-AMITEQDTEDTFQPPFRSC 282
ES + +L AC KH + + ++R++ N A I+ +D +T+ P F++
Sbjct: 176 ------ESTKYRKLY--ACAKHFAVHSGPE---YTRHTANLADISPRDLWETYLPAFKAT 224
Query: 283 IEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTK 341
++Q +MC+Y +++ P C L Q+ R+EWGF+ + SDC A+A + + +
Sbjct: 225 VQQAGVREVMCAYQRLDDEPCCGNSRLLQQILRDEWGFRHMVVSDCGAIADFYTNHHVSS 284
Query: 342 THEDSAAGVLKAGMDINCG-TCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNG 400
+AA AG D+ CG A+ +G V E ++D+ ++ L + LG+ +
Sbjct: 285 DAVHAAAKGTLAGTDVECGFGYAYMKLPEAVRRGLVSEAEVDKHVIRLLKGRFELGVMD- 343
Query: 401 DPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVN 460
DP+ + K+ P V + H++LAL+ ARQ + LL+N LPL K +A++GP
Sbjct: 344 DPKLVSWTKISPKVVDSDAHRQLALNMARQTMTLLQNRNNVLPLAKG--EKIAVVGPNAA 401
Query: 461 NISQMGGGYTGIPCSPKSLLRGLEAYVSK-THYASGCHDVPCN 502
+ + G Y G P ++L G+ A K Y GC V N
Sbjct: 402 DGPMLWGNYNGTPSRTTTILEGIRAKAGKDIPYLQGCDLVNKN 444
Score = 86.3 bits (212), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 75/285 (26%), Positives = 128/285 (44%), Gaps = 64/285 (22%)
Query: 497 HDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQ 546
H+ P N + A+R + V+ V G+ E E+ DR S+ LP Q
Sbjct: 584 HESPIN----YATAIRQLRGVRTVVFVGGISSKLEGEEMPVHVEGFKGGDRTSIELPAVQ 639
Query: 547 MSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGD 606
+ ++ K V + +G ++ +IL Y GE G +A+A++++GD
Sbjct: 640 RDFLKALKAAGKTVVFVNCSGSA---IALTPEVESCDAILQAWYAGEEGGRAVADVLYGD 696
Query: 607 FNPGGRLPMTWYPESFTKVP-MNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNY 665
+NPGG+LP+T+Y S T++P +D +M+ GR+YR+++ ++ FG+GLSYT +
Sbjct: 697 YNPGGKLPVTFY-RSTTQLPAFDDYSMK--------GRTYRYFSDA-LFPFGYGLSYTRF 746
Query: 666 SY--KFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNA 723
+ LSAP ++KA + + + V+N
Sbjct: 747 AIGKGSLSAP-------AMKADG--------------------------KVTLTVPVSNV 773
Query: 724 GDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISF 768
G G VV ++ R + G P K L F RV A S++++
Sbjct: 774 GKRTGDEVVQVYVRDVNDADG-PLKSLKAFRRVSLKAGESRKVTI 817
>gi|325103214|ref|YP_004272868.1| glycoside hydrolase family protein [Pedobacter saltans DSM 12145]
gi|324972062|gb|ADY51046.1| glycoside hydrolase family 3 domain protein [Pedobacter saltans DSM
12145]
Length = 866
Score = 260 bits (664), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 162/462 (35%), Positives = 239/462 (51%), Gaps = 61/462 (13%)
Query: 52 YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGP 111
YPF + L R L+ LT++EK+ + D + I RLGI Y WW+E+LHG+A G
Sbjct: 24 YPFQDNRLPFDKRVDDLLQRLTVEEKVLLMQDVSRPIERLGIKQYNWWNEALHGVARAG- 82
Query: 112 GVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYN--LGQA------GLT 163
T FPQ + AASF+R N+ +AV+ EARA +N L Q GLT
Sbjct: 83 ---------LATVFPQPIGMAASFDRDALFNVFNAVSDEARAKHNYHLSQGSYGRYEGLT 133
Query: 164 FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQG-ENWKSDDGGIGFGFREKR 222
W P INIFRDPRWGRG ET GEDP + + V+ VK QG N K D
Sbjct: 134 MWTPTINIFRDPRWGRGIETYGEDPYLTAVMGVQAVKGLQGPSNGKYDK----------- 182
Query: 223 VLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPFRS 281
L AC KH + +W +R+SF+A I ++D +T+ P F +
Sbjct: 183 ----------------LHACAKHFAVHSGPEW---NRHSFDAANIKQRDLYETYLPAFEA 223
Query: 282 CIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYT 340
+++ K +MC+YN+ G P C L Q+ R +WGF+G + +DC A+A F+ +N
Sbjct: 224 LVKEAKVQEVMCAYNRFEGDPCCGSDRLLQQILRKKWGFEGIVVADCGAIADFFK-ENAH 282
Query: 341 KTHED---SAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGL 397
KTH D ++A + +G D++CG+ T+ A+ KG ++EKDID ++ L + RLG
Sbjct: 283 KTHPDAASASAAAVYSGTDLDCGSSYKALTE-AVKKGLIEEKDIDVSVRRLLMARFRLGE 341
Query: 398 FNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGP 457
+ D + K+ + V + H ++ALD AR+ I LL+N LPL + +A++GP
Sbjct: 342 MD-DQSLVPWSKISYNVVASKAHNQIALDMARKSITLLQNKNNILPLKSGGL-KIAVMGP 399
Query: 458 LVNNISQMGGGYTGIPCSPKSLLRGLEAYV---SKTHYASGC 496
+ G Y G P + ++L G++A + K Y C
Sbjct: 400 NAQDSVMQWGNYNGTPANTITILEGIKAALGPKDKLIYEQAC 441
Score = 109 bits (273), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 93/331 (28%), Positives = 153/331 (46%), Gaps = 63/331 (19%)
Query: 486 YVSKTHYASGCHDVPCNSDAGFHEAVRIAKK------ADFVIVVAGLDLTQETEDR---- 535
Y K ++A D N D G+ E I K AD V+ V G+ + E E+
Sbjct: 566 YDIKFYFAHNNGDAQLNFDIGYKEEANINKSIKNIAGADLVVFVGGISPSLEGEEMGVKL 625
Query: 536 ------DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIG 589
DR + LP Q V ++ KR VI + G P+ ++ A+S+ W
Sbjct: 626 PGFRGGDRTDIQLPTIQRQFVKALKEAGKR-VIFINCSGSPIGLADEMANSEAIVQAW-- 682
Query: 590 YPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVP-MNDMNMRADSSRQYPGRSYRFY 648
YPG+AG +A+A+++FG +NP GRLP+T+Y ++ T++P + +M GR+YR+
Sbjct: 683 YPGQAGGQAVADVLFGKYNPSGRLPITFYRDT-TQLPDFENYDM--------AGRTYRYM 733
Query: 649 TGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTS 708
++ FG+GLSYT + Y IL Q +T+
Sbjct: 734 QDKPLFPFGYGLSYTQFQY-------------------GNPILNQ----------QVITN 764
Query: 709 CTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISF 768
+++ V VTN G G VV ++ R + G P K L F R+ A ++++ F
Sbjct: 765 GQTIQLTV--PVTNTGKRSGDEVVQVYLRKKGDATG-PVKTLRDFRRLSFNAGQTQQVVF 821
Query: 769 GVDPCEQLSIANKHGRRI-LPLGNHVLMVGE 798
+ P +QL N+ + + + G++ L+VG+
Sbjct: 822 KITP-KQLEWWNEQSKAMQVQSGDYELLVGK 851
>gi|29350122|ref|NP_813625.1| periplasmic beta-glucosidase , xylosidase/arabinosidase
[Bacteroides thetaiotaomicron VPI-5482]
gi|29342034|gb|AAO79819.1| periplasmic beta-glucosidase precursor, xylosidase/arabinosidase
[Bacteroides thetaiotaomicron VPI-5482]
Length = 769
Score = 260 bits (664), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 218/742 (29%), Positives = 334/742 (45%), Gaps = 137/742 (18%)
Query: 90 RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
RLGIP + E+ HG + G T FP + AA+++ +L +G+ +A
Sbjct: 114 RLGIPLF-LAEEAPHGHMAIG-----------TTVFPTGIGMAATWSPTLIEEVGNVIAK 161
Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
E R+ Q + P +++ RDPRW R +ET GEDP++ +
Sbjct: 162 EIRS-----QGAHISYGPVLDLSRDPRWSRVEETFGEDPVLSGRLGAAMIL--------- 207
Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQ 269
G+G G D E A KH +AY + + G Y A + +
Sbjct: 208 ---GLGSG--------------DLSCEYATIATLKHFLAYAVPEGGQNGNY---ASVGTR 247
Query: 270 DTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCD 328
D + F PPFR I+ G S +M SYN ++GVP L Q RNEW F+G++ SD
Sbjct: 248 DLHENFLPPFREAIDAGALS-VMTSYNSIDGVPCTANHYLLTQLLRNEWRFRGFVVSDLY 306
Query: 329 AVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNL 388
++ + E T E++A + AG DI+ G + A+ GK+ E ID A+ +
Sbjct: 307 SIEGVHESHFVAPTIEEAAMQAVSAGADIDLGGDAFMNLTHAVQFGKISEAVIDTAVCRV 366
Query: 389 FSVQLRLGLFNG---DPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLN 445
++ +GLF +P K V + +H KLA A+ IVLLKN+ LPLN
Sbjct: 367 LRMKFEIGLFEHPYVNP------KTATKIVRSKDHIKLARKVAQSSIVLLKNENSILPLN 420
Query: 446 KNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV-----SKTHYASGC--HD 498
K + +A++GP +N M G YT P +++ L+ + SK Y GC D
Sbjct: 421 KK-IKKVAVVGPNADNRYNMLGDYTA-PQEDENIKTVLDGVISKLSPSKVEYVRGCAIRD 478
Query: 499 VPCNSDAGFHEAVRIAKKADFVIVVAG----LDLT---QET----------------EDR 535
N A EAV A +++ +I V G D QET E
Sbjct: 479 TTVNEIA---EAVEAASRSEVIIAVVGGSSARDFKTSYQETGAAIADEKSISDMECGEGF 535
Query: 536 DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAG 595
DR +L L G+Q L+ ++ T K P+I+V G PLD + A ++L YPG+ G
Sbjct: 536 DRATLTLLGKQQDLLIALKATGK-PLIVVYIEGRPLDKVW--ASEYADALLTASYPGQEG 592
Query: 596 AKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYG 655
A+A+++FGD+NP GRLP++ P S ++P+ N +A + Y ++ + +Y
Sbjct: 593 GYAIADVLFGDYNPAGRLPVS-IPRSVGQIPVY-YNKKAPRNHDYVEQA-----ASPLYT 645
Query: 656 FGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFH 715
FG+GLSYT + Y SD ++++ + C F
Sbjct: 646 FGYGLSYTTFEY------------------SDLQVIRK-------------SPC---HFE 671
Query: 716 VQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQ 775
V V N G DG V L+ R S P +QL F+R KEI F + +
Sbjct: 672 VSFKVKNTGSYDGEEVAQLYLRDEYASVVQPLRQLKCFERFFLKRGEEKEIFFTLTE-KD 730
Query: 776 LSIANKHGRRILPLGNHVLMVG 797
LSI +++ +R++ G+ +M+G
Sbjct: 731 LSIIDRNMKRVVETGDFRIMIG 752
>gi|261405721|ref|YP_003241962.1| glycoside hydrolase family protein [Paenibacillus sp. Y412MC10]
gi|261282184|gb|ACX64155.1| glycoside hydrolase family 3 domain protein [Paenibacillus sp.
Y412MC10]
Length = 765
Score = 260 bits (664), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 216/762 (28%), Positives = 353/762 (46%), Gaps = 135/762 (17%)
Query: 76 EKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASF 135
E + + A RLGIP E HG + G GTV FP L +++
Sbjct: 89 EAVNHIQRYAVEQSRLGIPIL-IGEECSHGHMAIG------GTV-----FPVPLSIGSTW 136
Query: 136 NRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYA 195
N L+ ++ AVA+E R+ Q G ++P +++ RDPRWGR +E GEDP ++S YA
Sbjct: 137 NVDLYRDMCRAVALETRS-----QGGAVTYSPVLDVVRDPRWGRTEECFGEDPYLISEYA 191
Query: 196 VEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWG 255
V V+ QGE+ S ++A KH + Y + G
Sbjct: 192 VASVEGLQGESLDSPSS--------------------------VAATLKHFVGYGSSEGG 225
Query: 256 NFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-R 314
R + + ++ + PF+ +E G AS IM +YN+++GVP + +L R
Sbjct: 226 ---RNAGPVHMGTRELMEVDMLPFKKAVEAGAAS-IMPAYNEIDGVPCTVNTELLDGILR 281
Query: 315 NEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCML-RHTQSAIDK 373
EWGF G + +DC A+ + + + D+A ++AG+D+ M +H Q A++
Sbjct: 282 KEWGFDGMVITDCGAIDMLASGHDTAEDGMDAAVQAIRAGIDMEMSGEMFGKHLQKAVES 341
Query: 374 GKVQEKDIDRALLNLFSVQLRLGLFNG---DPRKGKYGKLGPDDVCTSEHKKLALDAARQ 430
K++ +D A+ + +++ +LGLF DP+ + + + + +H LA A +
Sbjct: 342 NKLEVSVLDEAVRRVLTLKFKLGLFENPYVDPQTAE------NVIGSEQHVGLARQLAAE 395
Query: 431 GIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTG--IPCSPKSLLRGLEAYV- 487
GIVLLKN+ K LPL+K +A+IGP + G YT P + ++L G+ A +
Sbjct: 396 GIVLLKNEAKALPLSKEG-GVIAVIGPNADQGYNQLGDYTSPQPPAAVTTVLGGIRAKLG 454
Query: 488 ---SKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAG-----------LDL----T 529
+ YA GC + +S GF A+ A++AD V++V G +DL +
Sbjct: 455 EEAQRVLYAPGCR-IKDDSREGFEFALTCAEQADTVVMVLGGSSARDFGEGTIDLRTGAS 513
Query: 530 QETEDR----------DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEAD 579
+ T+D DR++L L G Q+ LV + + KR +I+V G P+ + D
Sbjct: 514 KVTDDALSDMDCGEGIDRMTLQLSGVQLELVQEIHKLGKR-MIVVYINGRPIAEPW--ID 570
Query: 580 SQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQ 639
+IL YPG+ G A+A+I+FGD NP G+L M+ P+ ++P+ R+
Sbjct: 571 EHADAILEAWYPGQEGGHAVADILFGDVNPSGKLTMS-IPKHVGQLPVYYNGKRSR---- 625
Query: 640 YPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLD 699
G+ Y Y FG+GLSYT +SY + E+ G+D +
Sbjct: 626 --GKRYLEEDSQPRYPFGYGLSYTEFSYSDIQMTPEVI-------GTDGTAV-------- 668
Query: 700 YVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTV 759
V ++VTN+GD +GS VV L+ P ++L GF ++
Sbjct: 669 ----------------VSVNVTNSGDCEGSEVVQLYVSDAASKYTRPARELKGFQKIFLQ 712
Query: 760 AKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRH 801
+++ F + P EQL + R+++ G +M+G RH
Sbjct: 713 PGERRKVEFTIGP-EQLQYIGQDYRQVVEPGLFRVMLG--RH 751
>gi|423279982|ref|ZP_17258895.1| hypothetical protein HMPREF1203_03112 [Bacteroides fragilis HMW
610]
gi|404584318|gb|EKA88983.1| hypothetical protein HMPREF1203_03112 [Bacteroides fragilis HMW
610]
Length = 812
Score = 260 bits (664), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 228/747 (30%), Positives = 343/747 (45%), Gaps = 142/747 (19%)
Query: 90 RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
RLGIP + E HG + G T FP + A+++N L +G +A+
Sbjct: 147 RLGIPLF-LAEECPHGHMAIG-----------TTVFPTSIGQASTWNPELIRQMGRVIAI 194
Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
EA A Q + P +++ RDPRW R +ET GEDP + V+ FQG+ +
Sbjct: 195 EASA-----QGAHIGYGPVLDLARDPRWSRVEETYGEDPYLNGVMGAALVRGFQGDTLRG 249
Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQ 269
K V+ A KH +Y W A + E+
Sbjct: 250 ----------RKSVI----------------ATLKHFASYG---WTEGGHNGGTAHLGER 280
Query: 270 DTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPAC----LRGDLFQKARNEWGFKGYITS 325
+ E+ PPFR + G A +M SYN+++G P L D+ + + W FKG++ S
Sbjct: 281 ELEEAIFPPFREAVGAG-ALSVMSSYNEIDGNPCTGSRYLLTDILE---DRWLFKGFVVS 336
Query: 326 DCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCML-RHTQSAIDKGKVQEKDIDRA 384
D A+ + E+ +E +A + AG+D + GT + +A+ KG V + +D+A
Sbjct: 337 DLYAIGGLREHGVAGSDYE-AAVKAVNAGVDSDLGTNVYAEQLVAAVRKGDVAMETVDKA 395
Query: 385 LLNLFSVQLRLGLFNG----DPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKK 440
+ + S++ +GLF+ D R + V + EH LA + ARQ IVLLKN+ K
Sbjct: 396 VRRILSLKFHMGLFDAPFVDDKRPAQL-------VASPEHIGLAREVARQSIVLLKNEDK 448
Query: 441 FLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPC--SPKSLLRGLEAYVSK---THYASG 495
LPL K+ + +LA+IGP +N M G YT S ++L G+ VSK YA G
Sbjct: 449 LLPLKKD-IRTLAVIGPNADNGYNMLGDYTAPQADGSVVTVLEGIRQKVSKDTRVFYAKG 507
Query: 496 CHDVPCNSDAGFHEAVRIAKKADFVIVVAG----LDLTQETEDR---------------- 535
C V +S GF +A+ A+ AD V++V G D + E E+
Sbjct: 508 CA-VRDSSRTGFADAIESARSADVVVMVVGGSSARDFSSEYEETGAAKVSANRVSDMESG 566
Query: 536 ---DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVS--FAEADSQISSILWIGY 590
DR +L L G+Q+ L+ V + K P++LVL G PL + EAD +IL Y
Sbjct: 567 EGYDRATLHLMGRQLELLEEVRKLGK-PMVLVLIKGRPLLMEGVIQEAD----AILDAWY 621
Query: 591 PGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTG 650
PG G A+A+++FGD+NP GRL ++ P S ++P+ R + +Y + G
Sbjct: 622 PGMQGGNAVADVLFGDYNPAGRLTLS-VPRSVGQLPVYYNTKRKGNRSRYIEEA-----G 675
Query: 651 TQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCT 710
T Y FG+GLSYT +SY TG ++ +E C
Sbjct: 676 TPRYPFGYGLSYTTFSY--------------------------TGMKVRVS--EESNHC- 706
Query: 711 SLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGV 770
R V ++V N G VDG VV L+ R S TP++QL F RV A + EI+F +
Sbjct: 707 --RVDVSVTVRNQGTVDGDEVVQLYLRDEVGSFTTPDRQLRAFRRVRLKAGETWEITFTL 764
Query: 771 DPCEQLSIANKHGRRILPLGNHVLMVG 797
D + L++ + G + G +M G
Sbjct: 765 DK-KSLALYMRDGEWAVEPGRFTVMAG 790
>gi|293370402|ref|ZP_06616956.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
ovatus SD CMC 3f]
gi|292634550|gb|EFF53085.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
ovatus SD CMC 3f]
Length = 863
Score = 260 bits (664), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 175/551 (31%), Positives = 263/551 (47%), Gaps = 79/551 (14%)
Query: 52 YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGP 111
YP+ +T L++ RA L+ LTL+EK+ + +N+ AIPRLGI YEWW+E+LHG+A G
Sbjct: 26 YPYQDTKLTVEQRADDLLQRLTLEEKVALMQNNSPAIPRLGIKPYEWWNEALHGVARAG- 84
Query: 112 GVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARA----MYNLGQ----AGLT 163
T FPQ + AASFN L + AV+ EARA GQ GLT
Sbjct: 85 ---------LATVFPQAIGMAASFNDELLYEVFDAVSDEARAKNRQFNEKGQYKRYQGLT 135
Query: 164 FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRV 223
W PN+NIFRDPRWGRGQET GEDP + + V+ QG +D
Sbjct: 136 MWTPNVNIFRDPRWGRGQETYGEDPYLSGRMGMAAVRGLQG----PEDA----------- 180
Query: 224 LKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPFRSC 282
E D+ L AC KH + +W +R+SFNA I +D +T+ P F+
Sbjct: 181 ------EYDK-----LHACAKHFAVHSGPEW---NRHSFNAENIAPRDLWETYLPAFKEL 226
Query: 283 IEQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIFEYQNYTK 341
+++ +MC+YN+ G P C L Q RN+WGFKG + +DC A+ F+ + + +
Sbjct: 227 VQKAGVKEVMCAYNRFEGDPCCGSNRLLTQILRNDWGFKGIVVTDCGAIGDFFQRKKH-E 285
Query: 342 THEDSA---AGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLF 398
TH D+A A + +G D+ CG + A+ K + E+ I+ ++ + + LG
Sbjct: 286 THPDAAHASADAVLSGTDLECGG-NFKSITDAVKKDLISEEKINTSVKRVLKARFELGEM 344
Query: 399 NGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPL 458
N + + + +HK+LAL A + +VLL+N+ LPLN+ +A+IGP
Sbjct: 345 NS---THPWSNIPFSVIDCPKHKELALKMAHESLVLLQNNNNILPLNRQM--KVAVIGPN 399
Query: 459 VNNISQMGGGYTGIPCSPKSLLRGLEA--------------YVSKTHYASGCHDVPCNSD 504
N+ G Y G P +LL G+ A Y + T S + + +
Sbjct: 400 ANDSVMQWGNYNGFPSHTVTLLEGIRAKLPDAQIIYEPVCGYTNDTTLHSLFNQCSIDGE 459
Query: 505 AGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRP---- 560
AGF+ ++ I T + ++ PG + T++ R++ RP
Sbjct: 460 AGFNATYWNNREYKGKIAATDRLTTPFHFSAEGSTVFAPGVGLKNFTAIYRSTFRPTDSG 519
Query: 561 --VILVLTGGG 569
V+T GG
Sbjct: 520 AATFRVMTNGG 530
Score = 110 bits (274), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 92/315 (29%), Positives = 141/315 (44%), Gaps = 63/315 (20%)
Query: 508 HEAVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVARTS 557
E + + AD VI G+ E E DR + LP Q ++ + +
Sbjct: 591 REILNKLQSADVVIFAGGISPLLEGESMRVSDPGFKGGDRTEIELPAIQREVLALLKKNG 650
Query: 558 KRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTW 617
K+ V + +G ++ +IL YPG+AG A+A+++FGD+NP GRLP+T+
Sbjct: 651 KKTVFVNFSGSA---MAIVPETQNCDAILQAWYPGQAGGTAVADVLFGDYNPAGRLPITF 707
Query: 618 YPESFTKVP-MNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSEL 676
Y +S ++P D +M+ GR+YRF T T +Y FG+GLSYT +SY +
Sbjct: 708 Y-KSMQQLPDYEDYSMK--------GRTYRFMTETPLYPFGYGLSYTRFSY------GKA 752
Query: 677 TISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFA 736
T++ S +K IL I V+N G DG VV ++
Sbjct: 753 TLNQSKLTKGEKAIL-------------------------TIPVSNVGQRDGEEVVQVYI 787
Query: 737 RVPKVSQGTPEKQLIGFDRVHTVAKGSK---EISFGVDPCEQLSIANKHGRRILPL-GNH 792
P +G P+K L GF RV ++AKG +I D E A I PL G +
Sbjct: 788 CRPDDKEG-PQKTLRGFQRV-SIAKGKTQNVQIELPYDSFEWFDAATN---TIRPLNGTY 842
Query: 793 VLMVGELRHSLTIET 807
++ G + ++T
Sbjct: 843 KILYGNSSNEKDLQT 857
>gi|225872720|ref|YP_002754177.1| xylan 1,4-beta-xylosidase [Acidobacterium capsulatum ATCC 51196]
gi|225793233|gb|ACO33323.1| xylann 1,4-beta-xylosidase [Acidobacterium capsulatum ATCC 51196]
Length = 721
Score = 259 bits (663), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 228/782 (29%), Positives = 334/782 (42%), Gaps = 144/782 (18%)
Query: 52 YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGP 111
YPF N +LS R L+S +TLQEKIQ L D+ +PRLGIP E LHG A GP
Sbjct: 24 YPFQNPALSPDQRIDDLLSRMTLQEKIQALGDD-PGVPRLGIPG-ALTEEGLHGAAIGGP 81
Query: 112 GVNFNG---TVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEAR-AMYNLGQAGLTFWAP 167
++ G V T FPQ +++ +L + A E R A+ GL AP
Sbjct: 82 A-HWEGRGRAVVPTTQFPQNHGLGQTWDPALLQKAANVEAYETRWAVNKYHDGGLIVRAP 140
Query: 168 NINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGEN---WKSDDGGIGFGFREKRVL 224
N N+ RDPRWGR +E+ GEDP +V AV ++K QG N W++
Sbjct: 141 NANLSRDPRWGRTEESYGEDPYLVGTLAVAWIKGLQGNNPRYWET--------------- 185
Query: 225 KGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIE 284
+A KH AY E + S +F + + + PFR IE
Sbjct: 186 ---------------AALMKHFDAYSNEANRDGSSSNFGKRLFYE----YYSVPFRMGIE 226
Query: 285 QGKASCIMCSYNQVNGVPACLRGDLFQKARNEWGFKGYITSDCDAVATIFEYQNYTKTHE 344
QG + M SYN NG+P L +WGF G I +D A++ + + +Y KT
Sbjct: 227 QGHSDAFMTSYNAWNGIPMTANPVLKSVVMKKWGFNGIICTDAGALSNMVTHFHYYKTMP 286
Query: 345 DSAAGVLKAGMDINCGTCMLRHTQ---SAIDKGKVQEKDIDRALLNLFSVQLRLGLF--- 398
++AAG + AG++ + R+ Q A+ + + E+ ID+ L ++ V LRLGL
Sbjct: 287 EAAAGAVHAGIN----QFLDRYQQPVEEALQQKLLTEQQIDQDLKGVYRVVLRLGLMDPS 342
Query: 399 -----------NGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKN 447
N +P KG D H L + IVLLKN LPL+
Sbjct: 343 SMSPYSMIGLTNDNPAKG-------DPWDWPSHIALDRKVTDESIVLLKNQNHALPLDAK 395
Query: 448 AVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGF 507
+ S+A+IGP N ++ Y+G P + + G+ V V N +
Sbjct: 396 KLHSIAVIGPWANIVAL--DWYSGTPPFGVTPVEGIRQRVGPDV------KVTFNDGSNL 447
Query: 508 HEAVRIAKKADFVIVVAGLDLT------------QETEDRDRVSLLLPGQQMSLVTSVAR 555
A +AK++D IV+ G T + E DR +L LP + ++ A
Sbjct: 448 QAAAALAKQSDEAIVIIGNHPTCDAGWGKCALPSEGKEAFDRTALNLPDESIAKAVYAAN 507
Query: 556 TSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPM 615
++VL P + +A I +IL + + E ALA+++FGD++P GRL
Sbjct: 508 PH---TVVVLQTSFPYTTDWTQA--HIPAILEMAHNSEEQGTALADVLFGDYDPAGRLAQ 562
Query: 616 TWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSE 675
TW PM D N+R GR+Y + +Y FG GLSYT + Y L S
Sbjct: 563 TWVASIGQLPPMMDYNIRD-------GRTYMYLKSKPLYPFGFGLSYTTFKYSNLRLSSH 615
Query: 676 LTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLF 735
+L AG + V + VTN G +G VV ++
Sbjct: 616 -----TLPAGG--------------------------QLTVSVDVTNTGKYNGDEVVQMY 644
Query: 736 ARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIA----NKHGRRILPLGN 791
+ P + L GFDRV ++ ++ P + ++A N H R+ P +
Sbjct: 645 VKHLDSKVSRPLEALKGFDRVSIPVGQTRTVTL---PLKASALAYWDKNTHAFRVEP--D 699
Query: 792 HV 793
HV
Sbjct: 700 HV 701
>gi|298481648|ref|ZP_06999839.1| beta-glucosidase [Bacteroides sp. D22]
gi|298272189|gb|EFI13759.1| beta-glucosidase [Bacteroides sp. D22]
Length = 861
Score = 259 bits (663), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 162/483 (33%), Positives = 243/483 (50%), Gaps = 61/483 (12%)
Query: 53 PFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPG 112
P+ +TSL+ RA+ L+ LTL+EK+ + + + AIPRLGI YEWW+E+LHG+
Sbjct: 25 PYQDTSLTAEQRAEDLLPRLTLEEKVSLMQNASPAIPRLGIKEYEWWNEALHGV------ 78
Query: 113 VNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNL-GQAG-------LTF 164
G T FPQ + ASFN SL + +A + EAR + G++G LTF
Sbjct: 79 ----GRAGLATVFPQSIGMGASFNDSLLYEVFNATSDEARVKSRIFGESGVLKRYQGLTF 134
Query: 165 WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVL 224
W PN+NIFRDPRWGRGQET GEDP + + V+ Q
Sbjct: 135 WTPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQ--------------------- 173
Query: 225 KGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPFRSCI 283
G E R D+ L AC KH + +W +R+SF+A I +D +T+ P F+ +
Sbjct: 174 ---GPEDARYDK--LHACAKHFAVHSGPEW---NRHSFDAENIDPRDLWETYLPAFKDLV 225
Query: 284 EQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIFEYQNYTKT 342
++ +MC+YN+ G P C L Q R+EWG+KG + SDC A++ + + T
Sbjct: 226 QKAHVKEVMCAYNRFEGEPCCGSNRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTH-GT 284
Query: 343 HED---SAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFN 399
H D ++A ++AG D+ CG+ A+ G + EK+ID +L L + + LG +
Sbjct: 285 HPDKEHASADAVRAGTDLECGS-EYASLADAVKAGLIDEKEIDISLKRLLTARFELGEMD 343
Query: 400 GDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLV 459
P + ++ + + EH+ LAL AR+ +VLL+N LPLN + +A++GP
Sbjct: 344 EQP---AWSEIPTSVLNSKEHQALALRMARESLVLLQNKNNILPLNTHL--KVAVMGPNA 398
Query: 460 NNISQMGGGYTGIPCSPKSLLRGLEAYVSKTH--YASGCHDVPCNSDAGFHEAVRIAKKA 517
N+ G Y GIP +LL + A + + Y GC V + + I K
Sbjct: 399 NDSVMQWGNYNGIPAHTVTLLEAVRAKLPEGQIIYEPGCDRVDGKTLQSLFDECSINGKP 458
Query: 518 DFV 520
F+
Sbjct: 459 GFL 461
Score = 92.8 bits (229), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 71/269 (26%), Positives = 120/269 (44%), Gaps = 53/269 (19%)
Query: 510 AVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVARTSKR 559
AV+ AD ++ G+ + E E+ DR + LP Q +L+ ++ + K+
Sbjct: 591 AVKRVMDADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQRNLLKALKKAGKK 650
Query: 560 PVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYP 619
V + +G + + +IL YPG+AG A+ + ++G++NPGGRLP+T+Y
Sbjct: 651 VVFINYSGSA---IGLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPGGRLPVTFYK 707
Query: 620 ESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTIS 679
+ D +M+ GR+YR+ ++ FGHGLSYT+++Y E +S
Sbjct: 708 DVNQLPDFEDYSMK--------GRTYRYMQQQPLFPFGHGLSYTDFTY------GEAKLS 753
Query: 680 ASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVP 739
+ A + +L I V+N G DG VV ++ R P
Sbjct: 754 KNTIAKGENVVL-------------------------TIPVSNVGQCDGEEVVQVYLRRP 788
Query: 740 KVSQGTPEKQLIGFDRVHTVAKGSKEISF 768
+G P L F RVH A ++ ++
Sbjct: 789 GDKEG-PRYTLRAFKRVHIPAGKTESVAI 816
>gi|317474349|ref|ZP_07933623.1| glycosyl hydrolase family 3 C terminal domain-containing protein
[Bacteroides eggerthii 1_2_48FAA]
gi|316909030|gb|EFV30710.1| glycosyl hydrolase family 3 C terminal domain-containing protein
[Bacteroides eggerthii 1_2_48FAA]
Length = 877
Score = 259 bits (663), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 165/468 (35%), Positives = 232/468 (49%), Gaps = 63/468 (13%)
Query: 44 CKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESL 103
C YPF N L + R +L+SLLTLQEK+ + + + AI RLGIPAY WW E+
Sbjct: 9 CSEAEKYQYPFQNPKLKVEERVDNLMSLLTLQEKVSMMINRSLAIERLGIPAYNWWGEAC 68
Query: 104 HGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNL----GQ 159
HG+ + G VT FPQ + AA+F+ S + V+ EARA YN G
Sbjct: 69 HGLIAGG-----------VTVFPQSIALAATFDDSSQLTTYTMVSDEARARYNTLPLDGD 117
Query: 160 AG--------LTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDD 211
G LTFWAPNINIFRDPRWGRGQET GEDP ++S + V QG
Sbjct: 118 IGPYVSAIPNLTFWAPNINIFRDPRWGRGQETYGEDPFLMSRMGLNVVLGMQG------- 170
Query: 212 GGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDT 271
D AC KH Y + R+ FNA+++ +D
Sbjct: 171 --------------------DDEHYYKTHACAKH---YGVHSGPEPLRHEFNAVVSMRDL 207
Query: 272 EDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAV 330
+T+ P F + + +G +MC+Y+ G P C L RN WGF G + SDCDA+
Sbjct: 208 WETYLPAFETLVVKGNVREVMCAYSAYEGEPCCASNRLLVDILRNRWGFDGMVVSDCDAI 267
Query: 331 ATIF---EYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLN 387
+ ++ + SA VL G D+ CG A++KG ++E+D+D +L
Sbjct: 268 NDFYVKGRHETHPDAAAASADAVL-TGTDLECGRS-YNALIEAVEKGIIKEQDLDVSLRR 325
Query: 388 LFSVQLRLGLFNGDPRKG-KYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNK 446
+ + + RLGL DP K Y + + EH+ AL A + VLLKN+ LPL+K
Sbjct: 326 ILTERFRLGLL--DPAKYVPYSTIPGSVIDCQEHRDHALKMAHESQVLLKNEGNILPLDK 383
Query: 447 NAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYAS 494
N + S+A++GP +N+ M G Y+G P ++L+GL+ + T S
Sbjct: 384 N-IKSIAVVGPNINDSIMMRGNYSGSPTHCITILQGLKNKLPNTRIIS 430
Score = 115 bits (287), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 83/295 (28%), Positives = 137/295 (46%), Gaps = 47/295 (15%)
Query: 505 AGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILV 564
A F E K + +I V G+ TQE E +R + LP Q + ++ T K PVI V
Sbjct: 608 AVFEELKEKIKDVEAIIYVGGITPTQEGEGHERAKIELPDVQKRFLKAMHETGK-PVIYV 666
Query: 565 LTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTK 624
G ++ A+ D ++L YPG+ G A+A+++FGD+NP G+LP+T+Y +
Sbjct: 667 NCSGSA--IALADIDYAYDALLQAWYPGQEGGTAVADVLFGDYNPSGKLPVTFYKSTEQL 724
Query: 625 VPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKA 684
D +M R+YR++ G Y FG+GLSYT++ + L S+S+KA
Sbjct: 725 PEFTDYSME--------NRTYRYFKGEPQYAFGYGLSYTDFEFG-----EALLSSSSIKA 771
Query: 685 GSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQG 744
G + + I +TN G +DG+ VV ++ + +
Sbjct: 772 GE--------------------------KVEITIPLTNVGKMDGAEVVQVYVK-SLTNPD 804
Query: 745 TPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANK--HGRRILPLGNHVLMVG 797
P K L G+ R A S+++ ++P E + N G ++ P G + ++ G
Sbjct: 805 APIKSLKGYVRQEIKAGKSEKVRITLEP-ESFAYYNADVDGLKVFP-GKYQILYG 857
>gi|261880507|ref|ZP_06006934.1| xylosidase [Prevotella bergensis DSM 17361]
gi|270332847|gb|EFA43633.1| xylosidase [Prevotella bergensis DSM 17361]
Length = 948
Score = 259 bits (663), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 206/749 (27%), Positives = 342/749 (45%), Gaps = 125/749 (16%)
Query: 90 RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
RLGIP ++ +E + GI S T+FP L ++NR L +G
Sbjct: 161 RLGIPV-DFTNEGIRGIES-----------YKATNFPTQLGLGTTWNRQLIRQVGYITGR 208
Query: 150 EARAMYNLGQAGLT-FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWK 208
EAR + G T +AP +++ RD RWGR +E GE P +V+ ++ + Q
Sbjct: 209 EARLL------GYTNVYAPILDVGRDQRWGRYEEIYGESPFLVAELGIQMTRGLQ----- 257
Query: 209 SDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITE 268
+ +++ KH AY K G + +
Sbjct: 258 --------------------------TDFQVASTAKHFAAYSNNKGGREGMSRVDPQMPP 291
Query: 269 QDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGD---LFQKARNEWGFKGYITS 325
++ E+ P+ +++ M SYN +G+P ++G L + R+ +GF+GYI S
Sbjct: 292 REVENIHLYPWERVVQEAGLLGAMSSYNDYDGIP--IQGSYHWLTEVLRHRFGFRGYIVS 349
Query: 326 DCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCG----TCMLRHTQSAIDKGKVQEKDI 381
D DA+ +F + +++ + AG+++ C + + I +G++ I
Sbjct: 350 DSDALEYLFSKHHTAADMKEAVYQAVMAGLNVRCTFRSPDSFVLPLRELIREGRIPMSVI 409
Query: 382 DRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKF 441
DR + ++ V+ G+F+ +P + K +V + ++ +AL A+RQ IVLLKN +
Sbjct: 410 DRLVGDILRVKFITGIFD-NPYQMNL-KAADQEVNSERNQAVALQASRQSIVLLKNQDRL 467
Query: 442 LPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVS---KTHYASGCHD 498
LPL+++ + + + GP ++ S Y + ++L G+ V + YA GC
Sbjct: 468 LPLDRSKLRRILVCGPNADDASYALTHYGPLAVDVTTVLEGIRDKVENNIEVSYAKGCDV 527
Query: 499 V----PCNSDAGF----------HEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPG 544
V P + G+ AV +AK++D IVV G + E++ R SL LPG
Sbjct: 528 VDPHWPESEIIGYPMTSQEQQDIDHAVALAKESDVAIVVLGGNSRTCGENKSRSSLDLPG 587
Query: 545 QQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIF 604
+Q+ L+ +V T K PV+LVL G PL V++A D I +I+ YPG G A+A+++F
Sbjct: 588 RQLDLLKAVQATGK-PVVLVLINGRPLSVNWA--DRFIPAIVEAWYPGSQGGTAVADVLF 644
Query: 605 GDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGT------QVYGFGH 658
GD+NPGG+L +T +P+S ++P N + + Q G + G +Y FGH
Sbjct: 645 GDYNPGGKLTVT-FPKSVGQIP---FNFPSKPASQVDGGNKLGLQGNASRINGALYSFGH 700
Query: 659 GLSYTNYSYKFLSAPSE-LTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQ 717
GLSYT + Y L E +T++ S+ D
Sbjct: 701 GLSYTTFKYSNLRLSKETMTLNDSINISCD------------------------------ 730
Query: 718 ISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLS 777
V+N GD +G VV L+ R S T EK L GFDR+H +K ++F + P E L
Sbjct: 731 --VSNTGDREGDEVVQLYIRDVISSVTTYEKNLRGFDRIHLKPGETKTLTFTIKP-EHLK 787
Query: 778 IANKHGRRILPLGNHVLMVGELRHSLTIE 806
+ NK +++ G +M+G + +E
Sbjct: 788 LVNKDFEKVVEPGEFKIMIGASSEDIRLE 816
>gi|393773722|ref|ZP_10362116.1| glycosyl hydrolase family 3 N terminal domain protein
[Novosphingobium sp. Rr 2-17]
gi|392720897|gb|EIZ78368.1| glycosyl hydrolase family 3 N terminal domain protein
[Novosphingobium sp. Rr 2-17]
Length = 846
Score = 259 bits (662), Expect = 4e-66, Method: Compositional matrix adjust.
Identities = 153/425 (36%), Positives = 221/425 (52%), Gaps = 54/425 (12%)
Query: 69 ISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQV 128
I+ L EK+ QL A A +G+PAY+WW+E LHG+A NG T FPQ
Sbjct: 17 IAKADLAEKVGQLQSTAPAAKAMGLPAYDWWNEGLHGLARNG----------EATVFPQA 66
Query: 129 LVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLTFWAPNINIFRDPRWGRG 180
+ AA+F+ L +G+ ++ EARA +N GLT W+PNINIFRDPRWGRG
Sbjct: 67 IGLAATFDPVLMEKVGTVISTEARAKFNTRPVNADRRIYEGLTIWSPNINIFRDPRWGRG 126
Query: 181 QETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLS 240
QET GEDP + AV F+K QG + RV+
Sbjct: 127 QETYGEDPFLSGTLAVGFIKGLQGTD-----------PAHPRVI---------------- 159
Query: 241 ACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNG 300
A KH + + G R SF+ ++ QD E T+ P FR + +G+ +MC+YN +G
Sbjct: 160 ATPKHFAVHSGPEAG---RDSFDVDVSPQDREATYLPAFRKAVTEGRPLSVMCAYNSTHG 216
Query: 301 VPACLRGDLF-QKARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINC 359
P C L Q+ R +WGFKG++ SDCDA+A ++ + +Y +++A +KAG D +C
Sbjct: 217 TPVCADDALINQRLRKDWGFKGFVVSDCDAIANVWMFHHYAADAAEASAASIKAGTDFDC 276
Query: 360 GTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSE 419
G Q A+++G V E IDRAL + LG+ G +G++ P V T
Sbjct: 277 GNTYAALPQ-AVERGLVDEATIDRALARSLQARHSLGIAFG--AANPWGRIKPSQVHTLA 333
Query: 420 HKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSL 479
+ LAL+AAR+ IVLL+ND LPL + LA++G +++S + Y G P +
Sbjct: 334 SRALALEAARKSIVLLQNDNARLPLKPG--TKLAVVGANADDLSVIEASYHGTAADPITP 391
Query: 480 LRGLE 484
L+G+
Sbjct: 392 LQGIR 396
Score = 114 bits (286), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 94/311 (30%), Positives = 141/311 (45%), Gaps = 63/311 (20%)
Query: 503 SDAGFHEAVRIAKKADFVIVVAGLDLTQETE----------DRDRVSLLLPGQQMSLVTS 552
+D EA+ + AD ++ V GL E E DR + LP Q L+ +
Sbjct: 558 ADVQRAEAIAATRNADAIVAVLGLSPDLEGEALQVQVPGFVGGDRSDIALPRPQAQLLEA 617
Query: 553 VARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGR 612
+ T K PVI+VLT G + + AD+ +++ W YPGE G ALAE + G NP GR
Sbjct: 618 LHATGK-PVIVVLTSGSAIAIDPKLADAVLAA--W--YPGEEGGTALAETLGGINNPSGR 672
Query: 613 LPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSA 672
LP+T+Y + D +M+ RSYR++TGT ++GFGHGLSYTN+ Y A
Sbjct: 673 LPVTFYRSTSDLPAYVDYSMKE--------RSYRYFTGTPLWGFGHGLSYTNFGY---DA 721
Query: 673 PSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVV 732
P + D V+ V ++V N G G VV
Sbjct: 722 PKAVATGIG----------------------DPVS--------VTVTVHNTGVRAGEDVV 751
Query: 733 MLFARVPKV------SQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRI 786
+ P + +Q ++QL F R+ ++ I+ +DP E +S+ ++ G R
Sbjct: 752 QAYVVPPTIEPEPIMTQAVLQRQLAAFQRIALAPGQARAITLNLDPRE-ISVVDRRGIRR 810
Query: 787 LPLGNHVLMVG 797
L G + + VG
Sbjct: 811 LVPGEYRVWVG 821
>gi|319787180|ref|YP_004146655.1| glycoside hydrolase [Pseudoxanthomonas suwonensis 11-1]
gi|317465692|gb|ADV27424.1| glycoside hydrolase family 3 domain protein [Pseudoxanthomonas
suwonensis 11-1]
Length = 903
Score = 259 bits (662), Expect = 4e-66, Method: Compositional matrix adjust.
Identities = 159/435 (36%), Positives = 220/435 (50%), Gaps = 64/435 (14%)
Query: 68 LISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQ 127
L++ LT +EK+ QL + A AIPRLG+P Y WW+ESLHG P T+F +
Sbjct: 54 LVAKLTTEEKLDQLLNTAPAIPRLGVPHYNWWTESLHGAMGTLP----------TTNFAE 103
Query: 128 VLVSAASFNRSLWSNIGSAVAVEARAMYNLGQ---------AGLTFWAPNINIFRDPRWG 178
+ AA+F+ L +G +A E R ++ L + GL W+PNINIFRDPRWG
Sbjct: 104 PIGLAATFDDGLVKRVGQVIATEVRGLHALARQTGRTGRMGTGLNTWSPNINIFRDPRWG 163
Query: 179 RGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELM 238
RGQET GEDP + + V +VK QGE+ + D
Sbjct: 164 RGQETYGEDPFLAARMGVAYVKGVQGEDPRWYD--------------------------- 196
Query: 239 LSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQV 298
+ A KH Y + +R+S N ++ D EDT+ P FR+ I +G A +MC+YN+V
Sbjct: 197 IIATPKH---YAVHSGPESTRHSANVYVSRHDLEDTYLPAFRAAIVEGGAGSVMCAYNRV 253
Query: 299 NGVPACLRGDLFQK--ARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMD 356
+G PAC DL K R W FKGY+ SDCDAV I + +Y + A +++G+D
Sbjct: 254 DGQPACAN-DLLLKDYLRGAWDFKGYVVSDCDAVTDINRHHHYAPDAASAVAAAMRSGVD 312
Query: 357 INCGTCML-------RHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGK 409
C L R Q A+++G + D+DRAL+ LF+ + R G G R
Sbjct: 313 NECNGATLVDMAGLARPYQEALERGLISMADVDRALVRLFAARYRNGDLPGL-RPLSTET 371
Query: 410 LGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGY 469
P DV EH LAL+ A + +VLLKN LPL A LA++GPL + + G Y
Sbjct: 372 ASPADVGKPEHAALALEVAEKSLVLLKN-SGVLPLRPQA--KLAVVGPLADATRVLRGNY 428
Query: 470 TGIPCSPK-SLLRGL 483
+ P S+L GL
Sbjct: 429 SSPQSGPPVSVLEGL 443
Score = 117 bits (294), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 91/294 (30%), Positives = 138/294 (46%), Gaps = 54/294 (18%)
Query: 514 AKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVARTSKRPVIL 563
A ++D V+ V GL E E+ D+ SL L Q +L+ + AR + +P+++
Sbjct: 629 AAQSDVVVAVVGLTSDLEAEEAPVEVPGFKGGDKTSLDLLADQQALLEA-ARATGKPLVV 687
Query: 564 VLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT 623
V+ G P+++++A ++I+ YPG++G A+ ++ G NP GRLP+T+Y
Sbjct: 688 VVMNGSPVNLAWAR--QHAAAIIEAWYPGQSGGLAIGNVLAGHANPAGRLPLTFYRSVDD 745
Query: 624 KVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLK 683
P +D +MR GR+YR++ G VY FGHGLSYT++ Y P +L +
Sbjct: 746 LPPFDDYDMR--------GRTYRYFEGKPVYPFGHGLSYTSFGY----GPLKLEPA---- 789
Query: 684 AGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQ 743
AG L+ T T LR N G G V L+ P V
Sbjct: 790 AGGAHEGLKVT---------------TELR--------NTGKRAGDEVAQLYLDFPDVD- 825
Query: 744 GTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
G P L GF RVH + + F +DP LS G+R + GN+ + VG
Sbjct: 826 GLPRIALRGFQRVHLAPGERRVLEFQLDP-RDLSAVTPEGQRQVFAGNYRVFVG 878
>gi|383115617|ref|ZP_09936373.1| hypothetical protein BSGG_2514 [Bacteroides sp. D2]
gi|313694979|gb|EFS31814.1| hypothetical protein BSGG_2514 [Bacteroides sp. D2]
Length = 946
Score = 259 bits (662), Expect = 5e-66, Method: Compositional matrix adjust.
Identities = 209/735 (28%), Positives = 339/735 (46%), Gaps = 115/735 (15%)
Query: 90 RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
RLGIP ++ +E + G+ S T+FP L ++NR L +G
Sbjct: 153 RLGIPT-DFTNEGIRGVES-----------YKATNFPTQLGLGHTWNRELIRQVGVITGR 200
Query: 150 EARAMYNLGQAGLT-FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWK 208
EAR + G T +AP +++ RD RWGR +E GE P +V+ +E V+ Q
Sbjct: 201 EARML------GYTNVYAPILDVGRDQRWGRYEEVYGESPYLVAELGIEMVRGMQ----- 249
Query: 209 SDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITE 268
+ ++A KH IAY K G + ++
Sbjct: 250 --------------------------QDYQVAATGKHFIAYSNNKGGREGMSRVDPQMSP 283
Query: 269 QDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVP-ACLRGDLFQKARNEWGFKGYITSDC 327
++ E PF+ I + +M SYN +G P L + R E GF+GY+ SD
Sbjct: 284 REVEMVHVYPFKRVIREAGLLGVMSSYNDYDGFPIQSSYYWLTTRLRGEMGFRGYVVSDS 343
Query: 328 DAVATIFEYQNYTKTHEDSAAGVLKAGMDINCG----TCMLRHTQSAIDKGKVQEKDIDR 383
DAV ++ N K +++ ++AG+++ C + + + +G + E+ I+
Sbjct: 344 DAVEYLYTKHNTAKDMKEAVRQSVEAGLNVRCTFRSPDSYVLPLRELVKEGGLSEEVIND 403
Query: 384 ALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLP 443
+ ++ V+ +GLF+ + G ++V +E++++AL A+R+ IVLLKND+ LP
Sbjct: 404 RVRDILRVKFLVGLFDHPYQTDLKG--ADEEVEKAENEEVALQASRESIVLLKNDQDVLP 461
Query: 444 LNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVS---KTHYASGCHDVP 500
L+ + + +A+ GP + S G Y + S+L+G++ + Y+ GC V
Sbjct: 462 LDISGIKKIAVCGPNADECSYALGHYGPLAVEVTSVLKGIQEKTDGKVEVLYSKGCELVD 521
Query: 501 CN--------------SDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQ 546
N AV AK+AD +VV G E++ R SL LPG+Q
Sbjct: 522 ANWPESELIDFPLTEEEQKEIDRAVSQAKEADVAVVVLGGGQRTCGENKSRSSLDLPGRQ 581
Query: 547 MSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGD 606
+ L+ +V T K PV+LVL G PL +++ AD + +IL YPG G KA+A+++FGD
Sbjct: 582 LDLLKAVVATGK-PVVLVLINGRPLSINW--ADKFVPAILEAWYPGAKGGKAVADVLFGD 638
Query: 607 FNPGGRLPMTWYPESFTKVPMN---DMNMRADSSRQYPGRSYRFYTGT-QVYGFGHGLSY 662
+NPGG+L +T +P++ ++P N + + D + PG +Y FGHGLSY
Sbjct: 639 YNPGGKLTVT-FPKTVGQIPFNFPCKPSSQIDGGKN-PGMDGNMSRANGALYAFGHGLSY 696
Query: 663 TNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTN 722
T++ Y S+L I+ ++ + K +V VTN
Sbjct: 697 TSFEY------SDLKITPAVITPNQKT-------------------------YVTCKVTN 725
Query: 723 AGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKH 782
G G VV L+ R S T EK L GF+R+H +KE+ F +D + L + N
Sbjct: 726 TGKRAGDEVVQLYVRDVLSSVTTYEKNLAGFERIHLKPGETKEVFFPID-RKALELLNAD 784
Query: 783 GRRILPLGNHVLMVG 797
++ G+ LMVG
Sbjct: 785 MHWVVEPGDFTLMVG 799
>gi|336415490|ref|ZP_08595829.1| hypothetical protein HMPREF1017_02937 [Bacteroides ovatus
3_8_47FAA]
gi|335940369|gb|EGN02236.1| hypothetical protein HMPREF1017_02937 [Bacteroides ovatus
3_8_47FAA]
Length = 863
Score = 259 bits (662), Expect = 5e-66, Method: Compositional matrix adjust.
Identities = 177/551 (32%), Positives = 259/551 (47%), Gaps = 79/551 (14%)
Query: 52 YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGP 111
YP+ +T L+ RA L+ LTL+EK+ + +N+ AIPRLGI YEWW+E+LHG+A G
Sbjct: 26 YPYQDTKLTAEQRADDLLQRLTLEEKVALMQNNSPAIPRLGIKPYEWWNEALHGVARAG- 84
Query: 112 GVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARA----MYNLGQ----AGLT 163
T FPQ + AASFN L + AV+ EARA GQ GLT
Sbjct: 85 ---------LATVFPQAIGMAASFNDELLYEVFDAVSDEARAKNRQFNERGQYKRYQGLT 135
Query: 164 FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRV 223
W PN+NIFRDPRWGRGQET GEDP + + V+ QG +D
Sbjct: 136 MWTPNVNIFRDPRWGRGQETYGEDPYLSGRMGMAAVRGLQG----PEDA----------- 180
Query: 224 LKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPFRSC 282
E D+ L AC KH + +W +R+SFNA I +D +T+ P F+
Sbjct: 181 ------EYDK-----LHACAKHFAVHSGPEW---NRHSFNAENIAPRDLWETYLPAFKEL 226
Query: 283 IEQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIFEYQNYTK 341
+++ +MC+YN+ G P C L Q RN+WGFKG + +DC A+ F+ + + +
Sbjct: 227 VQKAGVKEVMCAYNRFEGDPCCGSNRLLTQILRNDWGFKGIVVTDCGAIGDFFQRKKH-E 285
Query: 342 THEDSA---AGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLF 398
TH D+A A + G D+ CG + A+ KG + E+ I+ ++ L + LG
Sbjct: 286 THPDAAHASADAVLNGTDLECGG-NFKSITDAVKKGLISEEKINTSVKRLLKARFELGEM 344
Query: 399 NGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPL 458
N + + + +HK+LAL A + +VLL+N LPLN+ +A+IGP
Sbjct: 345 N---PTHPWSNIPYSVINCPKHKELALKMAHESLVLLQNKNNILPLNRQM--KVAVIGPN 399
Query: 459 VNNISQMGGGYTGIPCSPKSLLRGLEA--------------YVSKTHYASGCHDVPCNSD 504
N+ G Y G P +LL G+ A Y + T S + +
Sbjct: 400 ANDSVMQWGNYNGFPSHTVTLLEGIRAKLPDAQIIYEPVCGYTNDTTLHSLFNQCSIDGK 459
Query: 505 AGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRP---- 560
AGF ++ I T + ++ PG + T++ R++ RP
Sbjct: 460 AGFSATYWNNREYKGKIAATDRLTTPFHFSAEGSTVFAPGVGLKNFTAIYRSTFRPTDSG 519
Query: 561 --VILVLTGGG 569
V+T GG
Sbjct: 520 AATFRVMTNGG 530
Score = 111 bits (277), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 93/314 (29%), Positives = 142/314 (45%), Gaps = 63/314 (20%)
Query: 509 EAVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVARTSK 558
E + K AD VI G+ E E DR + LP Q ++ + + K
Sbjct: 592 EVLNKLKNADVVIFAGGISPLLEGESMRVSDPGFKGGDRTEIELPAIQREVLALLKKNGK 651
Query: 559 RPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWY 618
+ V + +G ++ +IL YPG+AG A+A+++FG++NP GRLP+T+Y
Sbjct: 652 KTVFVNFSGSA---MAIVPETQSCDAILQAWYPGQAGGTAVADVLFGNYNPAGRLPITFY 708
Query: 619 PESFTKVP-MNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELT 677
+S ++P D +M+ GR+YRF T T +Y FG+GLSYT +SY + T
Sbjct: 709 -KSIQQLPDYEDYSMK--------GRTYRFMTETPLYPFGYGLSYTRFSY------GKAT 753
Query: 678 ISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFAR 737
++ S A +K IL I V+N G DG VV ++
Sbjct: 754 LNQSKLAKGEKAIL-------------------------TIPVSNVGQRDGEEVVQVYIC 788
Query: 738 VPKVSQGTPEKQLIGFDRVHTVAKGSKE---ISFGVDPCEQLSIANKHGRRILPL-GNHV 793
P +G P+K L GF RV+ +AKG + I D E A I PL G +
Sbjct: 789 RPD-DKGGPQKTLRGFQRVN-IAKGKTQNVNIELPYDSFEWFDTATN---TIRPLSGTYK 843
Query: 794 LMVGELRHSLTIET 807
++ G + ++T
Sbjct: 844 ILYGNSSNENDLQT 857
>gi|170288668|ref|YP_001738906.1| glycoside hydrolase family 3 protein [Thermotoga sp. RQ2]
gi|170176171|gb|ACB09223.1| glycoside hydrolase family 3 domain protein [Thermotoga sp. RQ2]
Length = 778
Score = 259 bits (661), Expect = 5e-66, Method: Compositional matrix adjust.
Identities = 218/752 (28%), Positives = 343/752 (45%), Gaps = 143/752 (19%)
Query: 90 RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
RLGIPA E L G G T+FPQ + A++++ L + +A+
Sbjct: 92 RLGIPAM-IHEECLTGYMGLGG-----------TNFPQAIAMASTWDPDLIEKMTTAIRE 139
Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
+ R + GL AP +++ RDPRWGR +ET GE P +V+ V +VK QGE+ K
Sbjct: 140 DMRKIG--AHQGL---APVLDVARDPRWGRTEETFGESPYLVARMGVSYVKGLQGEDIK- 193
Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWG-NFSRYSFNAMITE 268
KG + A KH Y + G N++ + I E
Sbjct: 194 ---------------KG------------VVATVKHFAGYSASEGGKNWAPTN----IPE 222
Query: 269 QDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVP-ACLRGDLFQKARNEWGFKGYITSDC 327
++ ++ F PF + +++ +M SY++++GVP A R L R +WGF+G + SD
Sbjct: 223 REFKEVFLFPFEAAVKEANVLSVMNSYSEIDGVPCAANRKLLTDILRKDWGFEGIVVSDY 282
Query: 328 DAVATIFEYQNYTKTHEDSAAGVLKAGMDINC-GTCMLRHTQSAIDKGKVQEKDIDRALL 386
AV + +Y + ++A L+AG+D+ T ++ + ++KG + E ID A+
Sbjct: 283 FAVKVLEDYHRIARDKSEAARLALEAGIDVELPKTECYQYLKDLVEKGIISEALIDEAVA 342
Query: 387 NLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNK 446
+ ++ LGLF + + K+ H+ +AL+ AR+ I+LLKND LPL+K
Sbjct: 343 RVLRLKFMLGLFENPYVEVEKAKI-------ESHRDIALEIARKSIILLKNDG-ILPLSK 394
Query: 447 NAVSSLAIIGPLVNNISQMGGGYT----------------GIPCSPKS----LLRGLEAY 486
+A+IGP + + G Y G P P+ L + +E +
Sbjct: 395 E--KKVALIGPNAGEVRNLLGDYMYLAHIRALLDNIDDVFGNPQIPRENYERLKKSIEEH 452
Query: 487 VSKT--------------HYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAG------L 526
+ YA GC +V +GF EA+ IAKK+D IVV G L
Sbjct: 453 MKSIPSVLDAFKEEGIEFEYAKGC-EVTGEDRSGFEEAIEIAKKSDVAIVVVGDKSGLTL 511
Query: 527 DLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSIL 586
D T E RD +L LPG Q LV VA+T K PV+LVL G P S ++++IL
Sbjct: 512 DCTT-GESRDMANLKLPGVQEELVLEVAKTGK-PVVLVLITGRPY--SLKNVVDKVNAIL 567
Query: 587 WIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYR 646
+ PGEAG +++ +II+G NP G+LP++ +P S ++P+ ++ R + Y
Sbjct: 568 QVWLPGEAGGRSIVDIIYGKVNPSGKLPIS-FPRSAGQIPVFHY-VKPSGGRSHWHGDYV 625
Query: 647 FYTGTQVYGFGHGLSYTNYSYKFLS-APSELTISASLKAGSDKNILQQTGSRLDYVHIDE 705
+ ++ FGHGLSYT + Y L P E+ + +
Sbjct: 626 DESTKPLFPFGHGLSYTKFEYSNLRIEPKEVPPAGEVV---------------------- 663
Query: 706 VTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKE 765
+++ V N GD DG VV L+ S P K+L GF RV AK K
Sbjct: 664 ----------IKVDVENTGDRDGDEVVQLYIGREFASVTRPVKELKGFKRVSLKAKEKKT 713
Query: 766 ISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
+ F + + L+ ++ + ++ G +MVG
Sbjct: 714 VVFRLH-MDVLAYYDRDMKLVVEPGEFRVMVG 744
>gi|299148437|ref|ZP_07041499.1| beta-glucosidase [Bacteroides sp. 3_1_23]
gi|298513198|gb|EFI37085.1| beta-glucosidase [Bacteroides sp. 3_1_23]
Length = 863
Score = 259 bits (661), Expect = 5e-66, Method: Compositional matrix adjust.
Identities = 177/551 (32%), Positives = 259/551 (47%), Gaps = 79/551 (14%)
Query: 52 YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGP 111
YP+ +T L+ RA L+ LTL+EK+ + +N+ AIPRLGI YEWW+E+LHG+A G
Sbjct: 26 YPYQDTKLTAEQRADDLLQRLTLEEKVALMQNNSPAIPRLGIKPYEWWNEALHGVARAG- 84
Query: 112 GVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARA----MYNLGQ----AGLT 163
T FPQ + AASFN L + AV+ EARA GQ GLT
Sbjct: 85 ---------LATVFPQAIGMAASFNDELLYEVFDAVSDEARAKNRQFNERGQYKRYQGLT 135
Query: 164 FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRV 223
W PN+NIFRDPRWGRGQET GEDP + + V+ QG +D
Sbjct: 136 MWTPNVNIFRDPRWGRGQETYGEDPYLSGRMGMAAVRGLQG----PEDA----------- 180
Query: 224 LKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPFRSC 282
E D+ L AC KH + +W +R+SFNA I +D +T+ P F+
Sbjct: 181 ------EYDK-----LHACAKHFAVHSGPEW---NRHSFNAENIAPRDLWETYLPAFKEL 226
Query: 283 IEQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIFEYQNYTK 341
+++ +MC+YN+ G P C L Q RN+WGFKG + +DC A+ F+ + + +
Sbjct: 227 VQKAGVKEVMCAYNRFEGDPCCGSNRLLTQILRNDWGFKGIVVTDCGAIGDFFQRKKH-E 285
Query: 342 THEDSA---AGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLF 398
TH D+A A + G D+ CG + A+ KG + E+ I+ ++ L + LG
Sbjct: 286 THPDAAHASADAVLNGTDLECGG-NFKSITDAVKKGLISEEKINTSVKRLLKARFELGEM 344
Query: 399 NGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPL 458
N + + + +HK+LAL A + +VLL+N LPLN+ +A+IGP
Sbjct: 345 N---PTHPWSNIPYSVINCPKHKELALKMAHESLVLLQNKNNILPLNRQM--KVAVIGPN 399
Query: 459 VNNISQMGGGYTGIPCSPKSLLRGLEA--------------YVSKTHYASGCHDVPCNSD 504
N+ G Y G P +LL G+ A Y + T S + +
Sbjct: 400 ANDSVMQWGNYNGFPSHTVTLLEGIRAKLPDAQIIYEPVCGYTNDTTLHSLFNQCSIDGK 459
Query: 505 AGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRP---- 560
AGF ++ I T + ++ PG + T++ R++ RP
Sbjct: 460 AGFSATYWNNREYKGKIAATDRLTTPFHFSAEGSTVFAPGVGLKNFTAIYRSTFRPTDSG 519
Query: 561 --VILVLTGGG 569
V+T GG
Sbjct: 520 AATFRVMTNGG 530
Score = 113 bits (282), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 94/314 (29%), Positives = 142/314 (45%), Gaps = 63/314 (20%)
Query: 509 EAVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVARTSK 558
E + K AD VI G+ E E DR + LP Q ++ + + K
Sbjct: 592 EVLNKLKNADVVIFAGGISPLLEGESMRVSDPGFKGGDRTEIELPAIQREVLALLKKNGK 651
Query: 559 RPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWY 618
+ V + +G ++ +IL YPG+AG A+A+++FGD+NP GRLP+T+Y
Sbjct: 652 KTVFVNFSGSA---MAIVPETQSCDAILQAWYPGQAGGTAVADVLFGDYNPAGRLPITFY 708
Query: 619 PESFTKVP-MNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELT 677
+S ++P D +M+ GR+YRF T T +Y FG+GLSYT +SY + T
Sbjct: 709 -KSIQQLPDYEDYSMK--------GRTYRFMTETPLYPFGYGLSYTRFSY------GKAT 753
Query: 678 ISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFAR 737
++ S A +K IL I V+N G DG VV ++
Sbjct: 754 LNQSKLAKGEKAIL-------------------------TIPVSNVGQRDGEEVVQVYIC 788
Query: 738 VPKVSQGTPEKQLIGFDRVHTVAKGSKE---ISFGVDPCEQLSIANKHGRRILPL-GNHV 793
P +G P+K L GF RV+ +AKG + I D E A I PL G +
Sbjct: 789 RPD-DKGGPQKTLRGFQRVN-IAKGKTQNVNIELPYDSFEWFDTATN---TIRPLSGTYK 843
Query: 794 LMVGELRHSLTIET 807
++ G + ++T
Sbjct: 844 ILYGNSSNENDLQT 857
>gi|295135996|ref|YP_003586672.1| beta-glucosidase [Zunongwangia profunda SM-A87]
gi|294984011|gb|ADF54476.1| putative beta-glucosidase [Zunongwangia profunda SM-A87]
Length = 796
Score = 259 bits (661), Expect = 6e-66, Method: Compositional matrix adjust.
Identities = 216/754 (28%), Positives = 340/754 (45%), Gaps = 131/754 (17%)
Query: 90 RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
RLGIP E++HG + G T FP + A+++N L + +A
Sbjct: 126 RLGIPLL-LEEEAMHGHMAVG-----------TTVFPTAIGQASTWNPDLIKKMAHVIAK 173
Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
E RA Q T + P I+I R+PRW R +ET GEDP +++ V FQG +
Sbjct: 174 EIRA-----QGSNTAYGPIIDIAREPRWSRVEETFGEDPYLIAEMGKSMVTGFQGSH--- 225
Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQ 269
ESD ++A KH AY + + G+ I ++
Sbjct: 226 --------------------ESDLKSNEHVAATLKHFAAYGVSEGGH---NGAAVHIGQR 262
Query: 270 DTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKARNE-WGFKGYITSDCD 328
D + P + ++ G S +M +Y+ ++GVP+ +L E WGFKG++ SD
Sbjct: 263 DLFQNYMYPVKEAVDNGVMS-VMTAYSSIDGVPSTAHKNLLTNILKEKWGFKGFVISDLA 321
Query: 329 AVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQ-SAIDKGKVQEKDIDRALLN 387
++ + + T ED+AA + AG+D++ G A++ GKV E+ ID A+
Sbjct: 322 SIEGLLGDHHIVDTEEDAAAMAMNAGVDVDLGGNGYDDALIDAVNAGKVAEERIDEAVRR 381
Query: 388 LFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKN 447
+ +V+ +LGLF +P + K V SEH +LA + ARQ I +LKN+ LPLNK
Sbjct: 382 ILTVKFKLGLFE-NPYANE--KQAEKIVRNSEHIELAREVARQSITMLKNEDNILPLNKE 438
Query: 448 AVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKT-----HYASGCHDVPCN 502
+ ++A+IG + G YT P S ++++ LE K Y G V
Sbjct: 439 -LQNIAVIGSNADMQYNQLGDYTA-PQSEENIITVLEGIQHKMPNANIEYVKGTA-VRDT 495
Query: 503 SDAGFHEAVRIAKKADFVIVVAG--------------------------LDLTQETEDRD 536
+ AV AK A+ IVV G L + E D
Sbjct: 496 TQTNIPAAVEAAKNAEVAIVVLGGSSARDFKTEYLETGAATISSKEDQVLSDMESGEGYD 555
Query: 537 RVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGA 596
R +L L G+Q+ L+ +V T P +LVL G PL +++ + + W YPG+ G
Sbjct: 556 RSTLNLMGKQLELLQAVVATGT-PTVLVLIKGRPLLLNWPAENVPVILDAW--YPGQEGG 612
Query: 597 KALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQ---V 653
A+A++IFGDFNP GRLP++ P+S ++P+ + +P R R Y T +
Sbjct: 613 SAIADVIFGDFNPAGRLPVS-VPKSLGQIPVY-------YNYWFPNR--RDYVETDAKPL 662
Query: 654 YGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLR 713
Y FG+GLSY+ + Y S+L ++ S G +N +
Sbjct: 663 YPFGYGLSYSEFKY------SDLKVATS---GKGRNT----------------------K 691
Query: 714 FHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPC 773
+ + ++N VDG V+ L+ R + +P KQL F+RV A +K + F + P
Sbjct: 692 IEISLKISNTSKVDGDEVIQLYIRDMVSTVLSPVKQLRAFERVSIKAGETKTVQFELLPK 751
Query: 774 EQLSIANKHGRRILPLGNHVLMVGELRHSLTIET 807
E LS+ + ++ + G LM+G + +ET
Sbjct: 752 E-LSLFDTEMKQKVQAGEFKLMIGASSEDIRLET 784
>gi|270296173|ref|ZP_06202373.1| conserved hypothetical protein [Bacteroides sp. D20]
gi|270273577|gb|EFA19439.1| conserved hypothetical protein [Bacteroides sp. D20]
Length = 942
Score = 259 bits (661), Expect = 6e-66, Method: Compositional matrix adjust.
Identities = 212/756 (28%), Positives = 343/756 (45%), Gaps = 129/756 (17%)
Query: 90 RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
RLGIP ++ +E + G+ S T+FP L ++NR L +G
Sbjct: 153 RLGIPV-DFTNEGIRGVES-----------YRATNFPTQLGLGHTWNRELIRQVGLITGR 200
Query: 150 EARAMYNLGQAGLT-FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWK 208
EAR + G T +AP +++ RD RWGR +E GE P +V+ +E V+ Q +
Sbjct: 201 EARML------GYTNVYAPILDVGRDQRWGRYEEVYGESPYLVAELGIEMVRGLQHNH-- 252
Query: 209 SDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITE 268
++A KH AY K + ++
Sbjct: 253 -----------------------------QVAATGKHFAAYSNNKGAREGMARVDPQMSP 283
Query: 269 QDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLF---QKARNEWGFKGYITS 325
++ E+ PF+ I + +M SYN +G+P ++G + + R E GF+GY+ S
Sbjct: 284 REVENIHIYPFKRVIREAGMLGVMSSYNDYDGIP--VQGSYYWLTTRLRGEMGFRGYVVS 341
Query: 326 DCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCG----TCMLRHTQSAIDKGKVQEKDI 381
D DAV ++ K +++ ++AG+++ C + + + +G + E+ I
Sbjct: 342 DSDAVEYLYTKHGTAKDMKEAVRQSVEAGLNVRCTFRSPDSFVLPLRELVKEGGLSEEVI 401
Query: 382 DRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKF 441
+ + ++ V+ +GLF+ + G +V E++ +AL A+R+ IVLLKN +
Sbjct: 402 NDRVRDILRVKFLIGLFDAPYQTDLAG--ADREVEKEENEAIALQASRESIVLLKNAGEL 459
Query: 442 LPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV---SKTHYASGCH- 497
LPL+ N+ +A+ GP N Y + ++L G++ ++ Y GC
Sbjct: 460 LPLDINSTKKIAVCGPNANEEGYALTHYGPLAVEVTTVLEGIQEKTKGKAEVLYTKGCDL 519
Query: 498 -----------DVPCNSD--AGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPG 544
D P D A +AV A++AD IVV G E++ R SL LPG
Sbjct: 520 VDAHWPESEIIDYPLTDDEQAEIDKAVENARQADVAIVVLGGGQRTCGENKSRTSLDLPG 579
Query: 545 QQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIF 604
+Q+ L+ ++ T K PV+L+L G PL +++A D + +IL YPG G ALA+I+F
Sbjct: 580 RQLQLLQAIQATGK-PVVLILINGRPLSINWA--DKFVPAILEAWYPGSKGGTALADILF 636
Query: 605 GDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQ------VYGFGH 658
GD+NPGG+L +T +P++ ++P N S Q G TG +Y FG+
Sbjct: 637 GDYNPGGKLTVT-FPKTVGQIP---FNFPCKPSSQIDGGKNPGPTGNMSRINGALYPFGY 692
Query: 659 GLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQI 718
GLSYT + Y S+L I+ + I S T V++
Sbjct: 693 GLSYTTFEY------SDLDITPRV--------------------ITPNESAT-----VRL 721
Query: 719 SVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSI 778
VTN G G VV L+ R S T EK L GF R+H ++E+SF +D + L +
Sbjct: 722 KVTNTGKRAGDEVVQLYIRDVLSSITTYEKNLAGFQRIHLEPGEAQELSFTID-RKHLEL 780
Query: 779 ANKHGRRILPLGNHVLMVG------ELRHSLTIETY 808
+ + ++ G+ VLM G L +LT+E Y
Sbjct: 781 LDADMKWVVEPGDFVLMAGASSEDIRLNGTLTVEDY 816
>gi|404405497|ref|ZP_10997081.1| glycoside hydrolase family protein [Alistipes sp. JC136]
Length = 804
Score = 258 bits (660), Expect = 7e-66, Method: Compositional matrix adjust.
Identities = 206/711 (28%), Positives = 313/711 (44%), Gaps = 118/711 (16%)
Query: 90 RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
RLGIP E+ +E +HG+ S T P + +++NR+L G
Sbjct: 145 RLGIPV-EFTNEGIHGLNH-----------SRATPLPAPIAIGSTWNRALVHRAGEIAGH 192
Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
EAR LG + +AP +++ RDPRWGR E GEDP +++ VE V+ Q +
Sbjct: 193 EARV---LGYKNV--YAPILDVARDPRWGRVVECYGEDPFLIAELGVEMVRGIQSQG--- 244
Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQ 269
+++ KH AY + K G + I +
Sbjct: 245 -----------------------------VASTLKHYAAYSVPKGGRDGNCRTDPHIAPR 275
Query: 270 DTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACL-RGDLFQKARNEWGFKGYITSDCD 328
+ + PFR I + +M SYN +GVP R L R+E+GF GY+ SD +
Sbjct: 276 ELHQMYLYPFRRVIRESGPMGVMSSYNDWDGVPVTASRYFLTDLLRHEYGFDGYVVSDSE 335
Query: 329 AVATIFEYQNYTKTHEDSAAGVLKAGMDINCG----TCMLRHTQSAIDKGKVQEKDIDRA 384
AV + +T+ED+ VL+AG+++ + + + +G++ + +D+
Sbjct: 336 AVEYVHTKHAVAETYEDAVRQVLEAGLNVRTNFSPPARFILPVRKLVREGRLSMEVVDQR 395
Query: 385 LLNLFSVQLRLGLFNG---DPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKF 441
+ + V+ RLGLF+ DPR+ + G D +H+ LD RQ +VLLKN+ K
Sbjct: 396 VREVLRVKFRLGLFDNPYNDPREA-VAEAGAD-----KHRDFVLDIQRQSLVLLKNEDKT 449
Query: 442 LPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV---SKTHYASGCHD 498
LPL+K + + + GPL + + M Y ++L G+ Y+ ++ YA GC
Sbjct: 450 LPLDKKKTARVLVAGPLADEDNFMISRYGPNDLPTVTVLDGIRNYLGDGAEVRYAKGCDV 509
Query: 499 VPCN--------------SDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPG 544
V AG +EAV+ A D ++ V G D + E R SL LPG
Sbjct: 510 VDAGFPDSELTATPLTAAERAGINEAVKQAAGCDVIVAVLGEDDERVGESHSRTSLELPG 569
Query: 545 QQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIF 604
+Q L+ ++ T PV+LVL G PL V++A + + +IL +P G A+AE +F
Sbjct: 570 RQQQLLEALHATGV-PVVLVLINGQPLTVNWAAQN--VPAILEGWFPSVEGGTAIAETLF 626
Query: 605 GDFNPGGRLPMTWYPESFTKVPMN---DMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLS 661
GD+NPGG+L +T +P S ++ +N R+ P +Y FG+GLS
Sbjct: 627 GDYNPGGKLTIT-FPRSTGQIELNFPYKKGSHGAQPRKGPNGGGVTRVLGSIYPFGYGLS 685
Query: 662 YTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVT 721
YT ++YK L E S T F V VT
Sbjct: 686 YTTFAYKNLRIAPE-------------------------------PSRTQGSFRVSCEVT 714
Query: 722 NAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDP 772
N GD G VV L+ S T E L GF+RV +K +SF V P
Sbjct: 715 NTGDRRGDEVVQLYISDKFSSVVTYESVLRGFERVTLEPGETKTVSFEVTP 765
>gi|254786805|ref|YP_003074234.1| glycoside hydrolase family 3 domain-containing protein
[Teredinibacter turnerae T7901]
gi|237686035|gb|ACR13299.1| glycoside hydrolase family 3 domain protein [Teredinibacter
turnerae T7901]
Length = 888
Score = 258 bits (660), Expect = 7e-66, Method: Compositional matrix adjust.
Identities = 161/460 (35%), Positives = 238/460 (51%), Gaps = 62/460 (13%)
Query: 54 FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
+ +T+L I TR L+S + L EKI Q+ + + AI LGI Y+WW+E+LHG+A G
Sbjct: 47 YMDTTLDIDTRVDDLVSRMDLAEKISQMYNESPAIEHLGIAEYDWWNEALHGVARAG--- 103
Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYN--------LGQAGLTFW 165
T FPQ + AA ++R +I AV+ EARA ++ GLTFW
Sbjct: 104 -------KATVFPQAIGMAAMWDRETMFDIAEAVSDEARAKHHYFVENGVHFRYTGLTFW 156
Query: 166 APNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLK 225
+PNINIFRDPRWGRGQET GEDP + A+ ++ QGEN K
Sbjct: 157 SPNINIFRDPRWGRGQETYGEDPYLTGELALPYISGLQGENPKY---------------- 200
Query: 226 GFGEESDRGDELMLSACCKHLIAYD-LEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIE 284
L +A KH + EK SR+S N + + +D +T+ P F +
Sbjct: 201 -----------LKTAAMAKHFAVHSGPEK----SRHSDNYIASPKDLNETYLPAFEKAVV 245
Query: 285 QGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIF--EYQNYTK 341
+G +MC+YN+VN PAC L ++ R +WGFKG++ SDC A+A + E +
Sbjct: 246 EGDVESVMCAYNRVNDEPACGNDMLLKETLRGKWGFKGHVVSDCGAIADFYAPEAHHVVM 305
Query: 342 THEDSAAGVLKAGMDINCGTCMLR---HTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLF 398
+AA +++G D+NCGT L + A+ + + + +ID+++ L + +LG+F
Sbjct: 306 APAAAAAWAVRSGTDLNCGTDRLSTFANLHFALQREMITQDEIDQSVKRLMKTRFKLGMF 365
Query: 399 NGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPL 458
+ D + Y K+ D V + H L AA + VLLKN LPL K+ S +AIIGP
Sbjct: 366 DPDDQV-PYSKIPMDVVGSQAHLALTQKAAEKSFVLLKN-SGILPLKKS--SKVAIIGPN 421
Query: 459 VNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTH--YASGC 496
N + + G Y G P P + L G++ Y+ + + YA G
Sbjct: 422 ATNPTVLVGNYFGDPIKPVTPLDGIQQYLGEENVFYAPGS 461
Score = 98.6 bits (244), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 79/265 (29%), Positives = 128/265 (48%), Gaps = 50/265 (18%)
Query: 536 DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAG 595
DR + LP Q L+ ++ + +K P++LV G + +++A ++ + +IL YPGEA
Sbjct: 650 DRTDIRLPEPQRKLLATLKKLNK-PIVLVNFSGSAIALNWA--NNNVDAILQGFYPGEAT 706
Query: 596 AKALAEIIFGDFNPGGRLPMTWYPESFTKVP-MNDMNMRADSSRQYPGRSYRFYTGTQVY 654
ALA I++G+ +P GRLP+T+Y S +P D M R+Y++Y G +Y
Sbjct: 707 GTALARILWGEVSPSGRLPITFY-RSLDDLPGFKDYAMT--------NRTYKYYQGDVLY 757
Query: 655 GFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRF 714
FG+GLSYT ++Y LSAP A++ +G I Q
Sbjct: 758 PFGYGLSYTQFAYSELSAP------ATMASGEPLAITAQ--------------------- 790
Query: 715 HVQISVTNAGDVDGSHVVMLFA--RVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDP 772
V+N+G V VV ++ +VP +S P+++L F R++ S+ + F +
Sbjct: 791 -----VSNSGKVASDEVVQVYVSMKVPGLS--LPQRELKEFKRIYLEPGASQTVEFSI-A 842
Query: 773 CEQLSIANKHGRRILPLGNHVLMVG 797
+ LS + G R G L VG
Sbjct: 843 GKDLSYVDDQGVRHPYHGPLTLSVG 867
>gi|160885419|ref|ZP_02066422.1| hypothetical protein BACOVA_03419 [Bacteroides ovatus ATCC 8483]
gi|156109041|gb|EDO10786.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
ovatus ATCC 8483]
Length = 861
Score = 258 bits (660), Expect = 8e-66, Method: Compositional matrix adjust.
Identities = 167/519 (32%), Positives = 252/519 (48%), Gaps = 69/519 (13%)
Query: 53 PFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPG 112
P+ +TSL+ R + L+ LTL+EK+ + + + AIPRLGI YEWW+E+LHG+
Sbjct: 25 PYQDTSLAAEQRTEDLLPRLTLEEKVSLMQNASPAIPRLGIKEYEWWNEALHGV------ 78
Query: 113 VNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNL--------GQAGLTF 164
G T FPQ + ASFN SL + +A + EAR + GLTF
Sbjct: 79 ----GRAGLATVFPQSIGMGASFNDSLLYEVFNATSDEARVKSRIFGDSGVLKRYQGLTF 134
Query: 165 WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVL 224
W PN+NIFRDPRWGRGQET GEDP + + V+ Q
Sbjct: 135 WTPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQ--------------------- 173
Query: 225 KGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPFRSCI 283
G E R D+ L AC KH + +W +R+SF+A I +D +T+ P F+ +
Sbjct: 174 ---GPEDARYDK--LHACAKHFAVHSGPEW---NRHSFDAENIDPRDLWETYLPAFKDLV 225
Query: 284 EQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIFEYQNYTKT 342
++ +MC+YN+ G P C L Q R+EWG++G + SDC A++ + + T
Sbjct: 226 QKAHVKEVMCAYNRFEGEPCCGSNRLLMQILRDEWGYEGIVVSDCGAISDFYRPGTH-GT 284
Query: 343 HED---SAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFN 399
H D ++AG ++AG D+ CG+ A+ G + EK+ID +L L + + LG +
Sbjct: 285 HPDKEHASAGAVRAGTDLECGS-EYASLADAVKAGLIDEKEIDISLKRLLTARFELGEMD 343
Query: 400 GDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLV 459
+ + ++ + + EH+ LAL AR+ +VLL+N LPLN + +A++GP
Sbjct: 344 ---EQSAWSEIPTSVLNSKEHQALALRMARESLVLLQNKNNILPLNTHL--KVAVMGPNA 398
Query: 460 NNISQMGGGYTGIPCSPKSLLRGLEAYVSKTH--YASGCHDVPCNSDAGFHEAVRIAKKA 517
N+ G Y GIP +LL + A + + Y GC V + + I K
Sbjct: 399 NDSVMQWGNYNGIPAHTVTLLEAVRAKLPEGQIIYEPGCDRVDGKTLQSLFDECSINGKP 458
Query: 518 DFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVART 556
F L + +RDR ++ Q+S A T
Sbjct: 459 GF--------LAEYWNNRDREGEVVATDQISTPFHFATT 489
Score = 90.1 bits (222), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 70/268 (26%), Positives = 118/268 (44%), Gaps = 53/268 (19%)
Query: 511 VRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVARTSKRP 560
V+ AD ++ G+ + E E+ DR + LP Q L+ ++ + K+
Sbjct: 592 VKRVMDADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQRDLLKALKKAGKKV 651
Query: 561 VILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPE 620
V + +G + + +IL YPG+AG A+ + ++G++NPGGRLP+T+Y +
Sbjct: 652 VFINYSGSA---IGLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPGGRLPVTFYKD 708
Query: 621 SFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISA 680
D +M+ GR+YR+ ++ FGHGLSYT+++Y E +S
Sbjct: 709 VNQLPDFEDYSMK--------GRTYRYMQQQPLFPFGHGLSYTDFTY------GEAKLSK 754
Query: 681 SLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPK 740
+ A + +L I V+N G DG VV ++ R P
Sbjct: 755 NTIAKGENVVL-------------------------TIPVSNVGQRDGEEVVQVYLRRPG 789
Query: 741 VSQGTPEKQLIGFDRVHTVAKGSKEISF 768
+G P L F RVH A ++ ++
Sbjct: 790 DKEG-PRYTLRAFKRVHIPAGKTESVAI 816
>gi|329922637|ref|ZP_08278189.1| glycosyl hydrolase family 3 N-terminal domain protein
[Paenibacillus sp. HGF5]
gi|328941979|gb|EGG38262.1| glycosyl hydrolase family 3 N-terminal domain protein
[Paenibacillus sp. HGF5]
Length = 765
Score = 258 bits (659), Expect = 8e-66, Method: Compositional matrix adjust.
Identities = 215/762 (28%), Positives = 352/762 (46%), Gaps = 135/762 (17%)
Query: 76 EKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASF 135
E + + A RLGIP E HG + G GTV FP L +++
Sbjct: 89 EAVNHIQRYAIEQSRLGIPIL-IGEECSHGHMAIG------GTV-----FPVPLSIGSTW 136
Query: 136 NRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYA 195
N L+ ++ AVA+E R+ Q G ++P +++ RDPRWGR +E GEDP ++S YA
Sbjct: 137 NLDLYRDMCRAVALETRS-----QGGAVTYSPVLDVVRDPRWGRTEECFGEDPYLISEYA 191
Query: 196 VEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWG 255
V V+ QGE+ S ++A KH + Y + G
Sbjct: 192 VASVEGLQGESLDSPSS--------------------------VAATLKHFVGYGSSEGG 225
Query: 256 NFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-R 314
R + + ++ + PF+ +E G AS IM +YN+++GVP + +L R
Sbjct: 226 ---RNAGPVHMGTRELMEVDMLPFKKAVEAGAAS-IMPAYNEIDGVPCTVNTELLDGILR 281
Query: 315 NEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCML-RHTQSAIDK 373
EWGF G + +DC A+ + + + D+A ++AG+D+ M +H Q A++
Sbjct: 282 KEWGFDGMVITDCGAIDMLASGHDTAEDGMDAAVQAIRAGIDLEMSGEMFGKHLQKAVES 341
Query: 374 GKVQEKDIDRALLNLFSVQLRLGLFNG---DPRKGKYGKLGPDDVCTSEHKKLALDAARQ 430
K++ +D A+ + +++ +LGLF DP+ + + + + +H LA A +
Sbjct: 342 NKLEVSVLDEAVRRVLTLKFKLGLFENPYVDPQTAE------NVIGSGQHIGLARQLAAE 395
Query: 431 GIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTG--IPCSPKSLLRGLEAYV- 487
GIVLLKN+ K LPL+K +A+IGP + G YT P + ++L G+ A +
Sbjct: 396 GIVLLKNEAKALPLSKEG-GVIAVIGPNADQGYNQLGDYTSPQPPAAVTTVLGGIRAKLG 454
Query: 488 ---SKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAG-----------LDL----T 529
+ YA GC + +S GF A+ A++AD V++V G +DL +
Sbjct: 455 EEAQRVLYAPGCR-IKDDSREGFEFALSCAEQADTVVMVLGGSSARDFGEGTIDLRTGAS 513
Query: 530 QETEDR----------DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEAD 579
+ T+D DR++L L G Q+ L + + KR +I+V G P+ + D
Sbjct: 514 KVTDDALSDMDCGEGIDRMTLQLSGVQLDLAQEIHKLGKR-MIVVYINGRPIAEPW--ID 570
Query: 580 SQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQ 639
+IL YPG+ G A+A+I+FGD NP G+L M+ P+ ++P+ R+
Sbjct: 571 EHADAILEAWYPGQEGGHAIADILFGDVNPSGKLTMS-IPKHVGQLPVYYNGKRSR---- 625
Query: 640 YPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLD 699
G+ Y Y FG+GLSYT +SY + E+ G+D +
Sbjct: 626 --GKRYLEEDSQPRYPFGYGLSYTEFSYSDIQMTPEVI-------GTDGTAV-------- 668
Query: 700 YVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTV 759
V ++VTN+GD +GS VV L+ P ++L GF ++
Sbjct: 669 ----------------VSVNVTNSGDCEGSEVVQLYVSDAASKYTRPARELKGFQKISLQ 712
Query: 760 AKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRH 801
+++ F + P EQL + R+++ G +M+G RH
Sbjct: 713 PGERRKVEFTIGP-EQLQYIGQDYRQVVEPGLFRVMLG--RH 751
>gi|298482082|ref|ZP_07000270.1| beta-glucosidase [Bacteroides sp. D22]
gi|298271639|gb|EFI13212.1| beta-glucosidase [Bacteroides sp. D22]
Length = 863
Score = 258 bits (659), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 176/551 (31%), Positives = 259/551 (47%), Gaps = 79/551 (14%)
Query: 52 YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGP 111
YP+ +T L+ RA L+ LTL+EK+ + +N+ AIPRLGI YEWW+E+LHG+A G
Sbjct: 26 YPYQDTKLTAEQRADDLLQRLTLEEKVALMQNNSPAIPRLGIKPYEWWNEALHGVARAG- 84
Query: 112 GVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARA----MYNLGQ----AGLT 163
T FPQ + AASFN L + AV+ EARA GQ GLT
Sbjct: 85 ---------LATVFPQAIGMAASFNDELLYEVFDAVSDEARAKNRQFNERGQYKRYQGLT 135
Query: 164 FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRV 223
W PN+NIFRDPRWGRGQET GEDP + + V+ QG +D
Sbjct: 136 MWTPNVNIFRDPRWGRGQETYGEDPYLSGRMGMAVVRGLQG----PEDA----------- 180
Query: 224 LKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPFRSC 282
E D+ L AC KH + +W +R+SFNA I +D +T+ P F+
Sbjct: 181 ------EYDK-----LHACAKHFAVHSGPEW---NRHSFNAENIAPRDLWETYLPAFKEL 226
Query: 283 IEQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIFEYQNYTK 341
+++ +MC+YN+ G P C L Q RN+WGFKG + +DC A+ F+ + + +
Sbjct: 227 VQKAGVKEVMCAYNRFEGDPCCGSNRLLTQILRNDWGFKGIVVTDCGAIGDFFQRKKH-E 285
Query: 342 THED---SAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLF 398
TH D ++A + G D+ CG + A+ KG + E+ I+ ++ L + LG
Sbjct: 286 THPDAVHASADAVLNGTDLECGG-NFKSITDAVKKGLISEEKINTSVKRLLKARFELGEM 344
Query: 399 NGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPL 458
N + + + +HK+LAL A + +VLL+N LPLN+ +A+IGP
Sbjct: 345 N---PTHPWSNIPYSVIDCPKHKELALKMAHESLVLLQNKNNILPLNRQM--KVAVIGPN 399
Query: 459 VNNISQMGGGYTGIPCSPKSLLRGLEA--------------YVSKTHYASGCHDVPCNSD 504
N+ G Y G P +LL G+ A Y + T S + +
Sbjct: 400 ANDSVMQWGNYNGFPSHTVTLLEGIRAKLPDAQIIYEPVCGYTNDTTLHSLFNQCSIDGK 459
Query: 505 AGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRP---- 560
AGF ++ I T + ++ PG + T++ R++ RP
Sbjct: 460 AGFSATYWNNREYKGKIAATDRLTTPFHFSAEGSTVFAPGVGLKNFTAIYRSTFRPTDSG 519
Query: 561 --VILVLTGGG 569
V+T GG
Sbjct: 520 AATFRVMTNGG 530
Score = 111 bits (277), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 88/311 (28%), Positives = 140/311 (45%), Gaps = 57/311 (18%)
Query: 509 EAVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVARTSK 558
E + K AD VI G+ E E DR + LP Q ++ + + K
Sbjct: 592 EVLNKLKNADVVIFAGGISPLLEGESMRVSDPGFKGGDRTEIELPAIQREVLALLKKNGK 651
Query: 559 RPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWY 618
+ V + +G ++ +IL YPG+AG A+A+++FGD+NP GRLP+T+Y
Sbjct: 652 KTVFVNFSGSA---MAIVPETQSCDAILQAWYPGQAGGTAVADVLFGDYNPAGRLPITFY 708
Query: 619 PESFTKVP-MNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELT 677
+S ++P D +M+ GR+YRF T T +Y FG+GLSYT +SY + T
Sbjct: 709 -KSIQQLPDYEDYSMK--------GRTYRFMTETPLYPFGYGLSYTRFSY------GKAT 753
Query: 678 ISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFAR 737
++ S +K IL I V+N G DG VV ++
Sbjct: 754 LNQSKLNKGEKAIL-------------------------TIPVSNVGQRDGEEVVQVYIC 788
Query: 738 VPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPL-GNHVLMV 796
P +G P+K L GF RV+ ++ +S + P + + I PL G + ++
Sbjct: 789 RPDDKEG-PQKTLRGFQRVNIAKGKTQNVSIEL-PYDSFEWFDTATNTIRPLSGTYKILY 846
Query: 797 GELRHSLTIET 807
G + ++T
Sbjct: 847 GNSSNENDLQT 857
>gi|346226088|ref|ZP_08847230.1| glycoside hydrolase family 3 domain protein [Anaerophaga
thermohalophila DSM 12881]
Length = 749
Score = 258 bits (659), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 210/748 (28%), Positives = 336/748 (44%), Gaps = 128/748 (17%)
Query: 50 DSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASN 109
+SYPF N L R L+S +TL EK+ LS + S +PRLG+ E HG+A
Sbjct: 50 ESYPFQNPELDSEARIDDLLSRMTLDEKVSALSTDPS-VPRLGVKGAPH-IEGYHGVAMG 107
Query: 110 GPG--VNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYN---LGQAGLTF 164
GP T+FPQ A++N L G ++EAR ++ + + GL
Sbjct: 108 GPANWAPKGDEAVPTTTFPQAYGMGATWNPELIRLAGEIESIEARYIFQNPEIAKGGLVV 167
Query: 165 WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGEN---WKSDDGGIGFGFREK 221
APN ++ RDPRWGR +E GEDP +V A F K QG++ W++
Sbjct: 168 RAPNADLGRDPRWGRTEECFGEDPFLVGTSATAFTKGLQGDDDQYWRT------------ 215
Query: 222 RVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRS 281
++ KH +A E S F+ + + + FR
Sbjct: 216 ------------------ASLLKHFLANSNENGRESSSSDFDMQLYHE----YYGASFRR 253
Query: 282 CIEQGKASCIMCSYNQVNGVPACLRGDLFQKARNEWGFKGYITSDCDAVATIFEYQNYTK 341
+G ++ M +YN +NGVPA + + WG G +D + Y
Sbjct: 254 AFIEGGSNAYMAAYNAINGVPAHVHDMHKEITERMWGVDGIKCTDGGGYQLLVYGHKYYD 313
Query: 342 THEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGD 401
+A GV+KAG++ A+ G + E DID L ++ V ++LG D
Sbjct: 314 DLYLAAEGVIKAGLNQFLDNYR-EGVYGALAHGYITEADIDEVLRGVYRVMIKLGQL--D 370
Query: 402 PRK-------GKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAI 454
P++ G+ GK P T +HK AL AR+ IVLLKN+ K LPLN + ++ +A+
Sbjct: 371 PQEKVPYSAIGRDGKPAP--WTTQKHKDAALRMARESIVLLKNNNKTLPLNADKLNKVAV 428
Query: 455 IGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV---SKTHYASGCHDVPCNSDAGFHEAV 511
IG L + + + Y+G+P + L G+ + SK YA D ++ AV
Sbjct: 429 IGYLADTV--LLDWYSGLPPYRITPLEGIREKLGNDSKVLYA---------PDNDYNAAV 477
Query: 512 RIAKKADFVIVVAG-------------LDLTQETEDRDRVSLLLPGQQMSLVTSVARTSK 558
A +AD IV+ G D E DR +L L + + + A +
Sbjct: 478 EAASEADVAIVILGNYPTCNSEIWADCPDPGMGREAIDRKTLRLTDEYLVKLVMEANPN- 536
Query: 559 RPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWY 618
I VL P +++++ + + +IL + + G+ ALA+++FGD+NPGG+L TW
Sbjct: 537 --TIFVLQSSFPYAINWSQQN--VPAILHLTHNGQETGSALADVLFGDYNPGGKLTQTW- 591
Query: 619 PESFTKVP-MNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELT 677
P+S ++P M + ++R G +Y ++ +Y FGHGLSYT ++++ +++
Sbjct: 592 PKSEDQLPDMMEYDIRK-------GHTYMYFEDKPLYPFGHGLSYTTFAWE------DIS 638
Query: 678 ISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFAR 737
I+ + + D+ ++ + + + N GDV G VV L+A
Sbjct: 639 INKPVVSADDEEVI------------------------ITVKLKNTGDVKGDEVVQLYAS 674
Query: 738 VPKVSQGTPEKQLIGFDRVHTVAKGSKE 765
P+ + P K L GF RV T+ G K+
Sbjct: 675 FPESTVRRPAKALKGFKRV-TLEPGEKK 701
>gi|293370605|ref|ZP_06617157.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
ovatus SD CMC 3f]
gi|292634339|gb|EFF52876.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
ovatus SD CMC 3f]
Length = 861
Score = 258 bits (659), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 167/519 (32%), Positives = 253/519 (48%), Gaps = 69/519 (13%)
Query: 53 PFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPG 112
P+ +TSL+ RA+ L+ LTL+EK+ + + + AIPRLGI YEWW+E+LHG+
Sbjct: 25 PYQDTSLTAEQRAEDLLPRLTLEEKVSLMQNASPAIPRLGIKEYEWWNEALHGV------ 78
Query: 113 VNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNL-GQAG-------LTF 164
G T FPQ + ASFN SL + +A + EAR + G++G LTF
Sbjct: 79 ----GRAGLATVFPQSIGMGASFNDSLLYEVFNATSDEARVKSRIFGESGVLKRYQGLTF 134
Query: 165 WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVL 224
W PN+NIFRDPRWGRGQET GEDP + + V+ QG +D G
Sbjct: 135 WTPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQG----PEDAGYD--------- 181
Query: 225 KGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPFRSCI 283
L AC KH + +W +R+SF+A I +D +T+ P F+ +
Sbjct: 182 -------------KLHACAKHFAVHSGPEW---NRHSFDAENIDPRDLWETYLPAFKDLV 225
Query: 284 EQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIFEYQNYTKT 342
++ +MC+YN+ G P C L Q R+EWG+KG + SDC A++ + + T
Sbjct: 226 QKAHVKEVMCAYNRFEGEPCCGSNRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTH-GT 284
Query: 343 HED---SAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFN 399
H D ++A ++ G D+ CG+ A+ G + EK+ID +L L + + LG +
Sbjct: 285 HPDKEHASAAAVRTGTDLECGS-EYASLADAVKAGLIDEKEIDISLKRLLTARFELGEMD 343
Query: 400 GDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLV 459
+ + ++ + + EH+ LAL AR+ +VLL+N LPLN + +A++GP
Sbjct: 344 ---EQSAWSEIPTSVLNSKEHQALALRMARESLVLLQNKNNILPLNTHL--KVAVMGPNA 398
Query: 460 NNISQMGGGYTGIPCSPKSLLRGLEAYVSKTH--YASGCHDVPCNSDAGFHEAVRIAKKA 517
N+ G Y GIP +LL + A + + Y GC V + + I K
Sbjct: 399 NDSVMQWGNYNGIPAHTVTLLEAVRAKLPEGQIIYEPGCDRVDGKTLQSLFDECSINGKP 458
Query: 518 DFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVART 556
F L + +RDR ++ Q+S A T
Sbjct: 459 GF--------LAEYWNNRDREGEVVATDQISTPFHFATT 489
Score = 91.7 bits (226), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 71/269 (26%), Positives = 119/269 (44%), Gaps = 53/269 (19%)
Query: 510 AVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVARTSKR 559
AV+ AD ++ G+ + E E+ DR + LP Q L+ ++ + K+
Sbjct: 591 AVKRVMDADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQRDLLKALKKAGKK 650
Query: 560 PVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYP 619
V + +G + + +IL YPG+AG A+ + ++G++NPGGRLP+T+Y
Sbjct: 651 VVFINYSGSA---IGLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPGGRLPVTFYK 707
Query: 620 ESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTIS 679
+ D +M+ GR+YR+ ++ FGHGLSYT+++Y E +S
Sbjct: 708 DVNQLPDFEDYSMK--------GRTYRYMQQQPLFPFGHGLSYTDFTY------GEAKLS 753
Query: 680 ASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVP 739
+ A + +L I V+N G DG VV ++ R P
Sbjct: 754 KNTIAKGENVVL-------------------------TIPVSNVGQRDGEEVVQVYLRRP 788
Query: 740 KVSQGTPEKQLIGFDRVHTVAKGSKEISF 768
+G P L F RVH A ++ ++
Sbjct: 789 GDKEG-PRYTLRAFKRVHIPAGKTESVAI 816
>gi|224538725|ref|ZP_03679264.1| hypothetical protein BACCELL_03619 [Bacteroides cellulosilyticus
DSM 14838]
gi|224519667|gb|EEF88772.1| hypothetical protein BACCELL_03619 [Bacteroides cellulosilyticus
DSM 14838]
Length = 942
Score = 258 bits (659), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 210/754 (27%), Positives = 341/754 (45%), Gaps = 129/754 (17%)
Query: 90 RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
RLGIP ++ +E + G+ S T+FP L ++NR L IG
Sbjct: 153 RLGIPV-DFTNEGIRGVES-----------YRATNFPTQLGLGHTWNRELIRQIGLITGR 200
Query: 150 EARAMYNLGQAGLT-FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWK 208
EAR + G T +AP +++ RD RWGR +E GE P +V+ +E V+ Q +
Sbjct: 201 EARML------GYTNVYAPILDVGRDQRWGRYEEVYGESPYLVAELGIEMVRGMQHNH-- 252
Query: 209 SDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITE 268
++A KH +AY K + ++
Sbjct: 253 -----------------------------QVAATGKHFVAYSNNKGAREGMARVDPQMSP 283
Query: 269 QDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLF---QKARNEWGFKGYITS 325
++ E PF+ I++ +M SYN +GVP ++G + + R E GF+GY+ S
Sbjct: 284 REVEMIHVYPFKRVIKEAGLLGVMSSYNDYDGVP--IQGSYYWLTTRLRGEMGFRGYVVS 341
Query: 326 DCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCG----TCMLRHTQSAIDKGKVQEKDI 381
D DAV ++ + K +++ ++AG+++ C + + + +G + E+ I
Sbjct: 342 DSDAVEYLYTKHSTAKDMKEAVRQSVEAGLNVRCTFRSPDSYVLPLRELVKEGGLSEEVI 401
Query: 382 DRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKF 441
+ + ++ V+ +GLF+ + G +V +E++ LAL A+R+ +VLLKN+
Sbjct: 402 NDRVRDILRVKFLVGLFDTPYQTDLAG--ADKEVEKAENESLALQASRESLVLLKNENNV 459
Query: 442 LPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV---SKTHYASGCHD 498
LPL+ N V +A+ GP + Y + ++L G+ ++ Y GC
Sbjct: 460 LPLDINNVKKIAVCGPNADEEGYALTHYGPLAVEVTTVLEGIRQKAEGKAEVLYTKGCDL 519
Query: 499 VPCN--------------SDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPG 544
V N A +AV A++AD +VV G E++ R SL LPG
Sbjct: 520 VDANWPESELIDYPMTDSEQAEIDKAVENARQADVAVVVLGGGQRTCGENKSRSSLELPG 579
Query: 545 QQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIF 604
+Q+ L+ +V T K PV+LVL G PL +++A D + +IL YPG G A+A+++F
Sbjct: 580 RQLKLLQAVQATGK-PVVLVLINGRPLSINWA--DKFVPAILEAWYPGSKGGTAVADVLF 636
Query: 605 GDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQ------VYGFGH 658
GD+NPGG+L +T +P+S ++P N S Q G G +Y FG+
Sbjct: 637 GDYNPGGKLTVT-FPKSVGQIP---FNFPCKPSSQIDGGKNPGLDGNMSRVNGALYSFGY 692
Query: 659 GLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQI 718
GLSYT + Y S++ IS + + K V+
Sbjct: 693 GLSYTTFEY------SDIEISPKVITPNQKAT-------------------------VRC 721
Query: 719 SVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSI 778
VTN G G VV L+ R S T EK L GF+R+H +KE+ F +D +QL +
Sbjct: 722 KVTNTGKRAGDEVVQLYVRDILSSVTTYEKNLAGFERIHLQPGETKEVVFTLD-RKQLEL 780
Query: 779 ANKHGRRILPLGNHVLMVG------ELRHSLTIE 806
+KH ++ G+ +M+G L LT+E
Sbjct: 781 LDKHMEWVVEPGDFSIMIGASSEDIRLSGKLTVE 814
>gi|294675359|ref|YP_003575975.1| 1,4-beta-xylosidase [Prevotella ruminicola 23]
gi|225016052|gb|ACN78955.1| xylosidase/arabinofuranosidase [Prevotella ruminicola]
gi|294472720|gb|ADE82109.1| putative 1,4-beta-xylosidase [Prevotella ruminicola 23]
Length = 861
Score = 258 bits (658), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 163/462 (35%), Positives = 234/462 (50%), Gaps = 60/462 (12%)
Query: 51 SYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNG 110
+ P+ N +LS RA L S LTL+EK + D + AIPRLGI + WWSE+LHG A+ G
Sbjct: 21 TLPYQNPNLSAKERAVDLCSRLTLEEKAMLMLDESPAIPRLGIKKFFWWSEALHGAANMG 80
Query: 111 PGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYN----------LGQA 160
+VT+FP+ + AASFN L + + E RA YN +
Sbjct: 81 ----------NVTNFPEPVGMAASFNPHLLFKVFDIASTEFRAQYNHRMYDLNGEDMKMR 130
Query: 161 GLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFRE 220
L+ W PN+NIFRDPRWGRGQET GEDP + S V+ VK QG +
Sbjct: 131 SLSVWTPNVNIFRDPRWGRGQETYGEDPYLTSVMGVQVVKGLQGP-------------ED 177
Query: 221 KRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPF 279
R K L AC KH Y + ++R++ N ++ +D +T+ P F
Sbjct: 178 ARYRK-------------LWACAKH---YAVHSGPEYTRHTANLTDVSARDFWETYMPAF 221
Query: 280 RSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQN 338
++ ++ K +MC+Y +++ P C L Q+ R+EWGF+ + SDC AV+ +E
Sbjct: 222 KTLVKDAKVREVMCAYQRLDDDPCCGSTRLLQQILRDEWGFEYLVVSDCGAVSDFYENH- 280
Query: 339 YTKTHEDSAAGVLK---AGMDINCG-TCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLR 394
K+ D+ G K AG D+ CG + A+ KG + EK++D+ ++ L +
Sbjct: 281 --KSSSDAVHGTSKAVLAGTDVECGFNYAYKSLPEAVRKGLLSEKEVDKHVIRLLEGRFD 338
Query: 395 LGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAI 454
LG + DP ++ K+ + T +ALD ARQ IVLL+N LPL KNA +AI
Sbjct: 339 LGEMD-DPSLVEWSKIPYSAMSTKASANVALDMARQTIVLLQNKNNILPLKKNA-EKIAI 396
Query: 455 IGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGC 496
IGP +N M G Y G P ++L G++A K Y GC
Sbjct: 397 IGPNAHNEPMMWGNYNGTPNHTVTILDGVKAKQKKLVYIPGC 438
Score = 87.8 bits (216), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 73/282 (25%), Positives = 121/282 (42%), Gaps = 56/282 (19%)
Query: 489 KTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDR----------DRV 538
KT AS DV + + E + K + VI G+ + E E+ DR
Sbjct: 574 KTWGASMKIDVARELNIDYQETIAQLKGINKVIFCGGIAPSLEGEEMPVNIEGFKGGDRT 633
Query: 539 SLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKA 598
S+ LP Q + ++ K+ + + +G ++ +I+ YPG+ G A
Sbjct: 634 SIELPKVQREFLKALKAAGKQVIYVNCSGSA---IALQPETESCDAIVQAWYPGQEGGTA 690
Query: 599 LAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGH 658
+A+++FGD+NPGG+L +T+Y ND + GR+YR++ ++ FG+
Sbjct: 691 VADVLFGDYNPGGKLSVTFYK--------NDQQLPDYEDYSMKGRTYRYFDDA-LFPFGY 741
Query: 659 GLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQI 718
GLSYT + A ++A +D + ++VQI
Sbjct: 742 GLSYTTFEVG----------EAKVEAATDGAL-----------------------YNVQI 768
Query: 719 SVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVA 760
VTN G +GS + L+ R + G P K L GF+R+ A
Sbjct: 769 PVTNTGTKNGSETIQLYIRNLQDPDG-PLKSLRGFERLDIKA 809
>gi|160892207|ref|ZP_02073210.1| hypothetical protein BACUNI_04671 [Bacteroides uniformis ATCC 8492]
gi|156858685|gb|EDO52116.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
uniformis ATCC 8492]
Length = 990
Score = 258 bits (658), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 212/756 (28%), Positives = 343/756 (45%), Gaps = 129/756 (17%)
Query: 90 RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
RLGIP ++ +E + G+ S T+FP L ++NR L +G
Sbjct: 201 RLGIPV-DFTNEGIRGVES-----------YRATNFPTQLGLGHTWNRELIRQVGLITGR 248
Query: 150 EARAMYNLGQAGLT-FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWK 208
EAR + G T +AP +++ RD RWGR +E GE P +V+ +E V+ Q +
Sbjct: 249 EARML------GYTNVYAPILDVGRDQRWGRYEEVYGESPYLVAELGIEMVRGLQHNH-- 300
Query: 209 SDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITE 268
++A KH AY K + ++
Sbjct: 301 -----------------------------QVAATGKHFAAYSNNKGAREGMARVDPQMSP 331
Query: 269 QDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLF---QKARNEWGFKGYITS 325
++ E+ PF+ I + +M SYN +G+P ++G + + R E GF+GY+ S
Sbjct: 332 REVENIHIYPFKRVIREAGMLGVMSSYNDYDGIP--VQGSYYWLTTRLRGEMGFRGYVVS 389
Query: 326 DCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCG----TCMLRHTQSAIDKGKVQEKDI 381
D DAV ++ K +++ ++AG+++ C + + + +G + E+ I
Sbjct: 390 DSDAVEYLYTKHGTAKDMKEAVRQSVEAGLNVRCTFRSPDSFVLPLRELVKEGGLSEEVI 449
Query: 382 DRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKF 441
+ + ++ V+ +GLF+ P + +V E++ +AL A+R+ IVLLKN +
Sbjct: 450 NDRVRDILRVKFLIGLFDA-PYQTDLAD-ADREVEKEENEAIALQASRESIVLLKNAGEL 507
Query: 442 LPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV---SKTHYASGCH- 497
LPL+ N+ +A+ GP N Y + ++L G++ ++ Y GC
Sbjct: 508 LPLDINSTKKIAVCGPNANEEGYALTHYGPLAVEVTTVLEGIQEKTKGKAEVLYTKGCDL 567
Query: 498 -----------DVPCNSD--AGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPG 544
D P D A +AV A++AD IVV G E++ R SL LPG
Sbjct: 568 VDAHWPESEIIDYPLTDDEQAEIDKAVENARQADVAIVVLGGGQRTCGENKSRTSLDLPG 627
Query: 545 QQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIF 604
+Q+ L+ ++ T K PV+L+L G PL +++A D + +IL YPG G ALA+I+F
Sbjct: 628 RQLQLLQAIQATGK-PVVLILINGRPLSINWA--DKFVPAILEAWYPGSKGGTALADILF 684
Query: 605 GDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQ------VYGFGH 658
GD+NPGG+L +T +P++ ++P N S Q G TG +Y FG+
Sbjct: 685 GDYNPGGKLTVT-FPKTVGQIP---FNFPCKPSSQIDGGKNPGPTGNMSRINGALYPFGY 740
Query: 659 GLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQI 718
GLSYT + Y S+L I+ + I S T V++
Sbjct: 741 GLSYTTFEY------SDLDITPRV--------------------ITPNESAT-----VRL 769
Query: 719 SVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSI 778
VTN G G VV L+ R S T EK L GF R+H ++E+SF +D + L +
Sbjct: 770 KVTNTGKRAGDEVVQLYIRDVLSSITTYEKNLAGFQRIHLEPGEAQELSFTID-RKHLEL 828
Query: 779 ANKHGRRILPLGNHVLMVG------ELRHSLTIETY 808
+ + ++ G+ VLM G L +LT+E Y
Sbjct: 829 LDADMKWVVEPGDFVLMAGASSEDIRLNGTLTVEDY 864
>gi|317480750|ref|ZP_07939836.1| glycosyl hydrolase family 3 C terminal domain-containing protein
[Bacteroides sp. 4_1_36]
gi|316903091|gb|EFV24959.1| glycosyl hydrolase family 3 C terminal domain-containing protein
[Bacteroides sp. 4_1_36]
Length = 942
Score = 258 bits (658), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 212/756 (28%), Positives = 343/756 (45%), Gaps = 129/756 (17%)
Query: 90 RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
RLGIP ++ +E + G+ S T+FP L ++NR L +G
Sbjct: 153 RLGIPV-DFTNEGIRGVES-----------YRATNFPTQLGLGHTWNRELIRQVGLITGR 200
Query: 150 EARAMYNLGQAGLT-FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWK 208
EAR + G T +AP +++ RD RWGR +E GE P +V+ +E V+ Q +
Sbjct: 201 EARML------GYTNVYAPILDVGRDQRWGRYEEVYGESPYLVAELGIEMVRGLQHNH-- 252
Query: 209 SDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITE 268
++A KH AY K + ++
Sbjct: 253 -----------------------------QVAATGKHFAAYSNNKGAREGMARVDPQMSP 283
Query: 269 QDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLF---QKARNEWGFKGYITS 325
++ E+ PF+ I + +M SYN +G+P ++G + + R E GF+GY+ S
Sbjct: 284 REVENIHIYPFKRVIREAGMLGVMSSYNDYDGIP--VQGSYYWLTTRLRGEMGFRGYVVS 341
Query: 326 DCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCG----TCMLRHTQSAIDKGKVQEKDI 381
D DAV ++ K +++ ++AG+++ C + + + +G + E+ I
Sbjct: 342 DSDAVEYLYTKHGTAKDMKEAVRQSVEAGLNVRCTFRSPDSFVLPLRELVKEGGLSEEVI 401
Query: 382 DRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKF 441
+ + ++ V+ +GLF+ P + +V E++ +AL A+R+ IVLLKN +
Sbjct: 402 NDRVRDILRVKFLIGLFDA-PYQTDLAD-ADREVEKEENEAIALQASRESIVLLKNAGEL 459
Query: 442 LPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV---SKTHYASGCH- 497
LPL+ N+ +A+ GP N Y + ++L G++ ++ Y GC
Sbjct: 460 LPLDINSTKKIAVCGPNANEEGYALTHYGPLAVEVTTVLEGIQEKTKGKAEVLYTKGCDL 519
Query: 498 -----------DVPCNSD--AGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPG 544
D P D A +AV A++AD IVV G E++ R SL LPG
Sbjct: 520 VDAHWPESEIIDYPLTDDEQAEIDKAVENARQADVAIVVLGGGQRTCGENKSRTSLDLPG 579
Query: 545 QQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIF 604
+Q+ L+ ++ T K PV+L+L G PL +++A D + +IL YPG G ALA+I+F
Sbjct: 580 RQLQLLQAIQATGK-PVVLILINGRPLSINWA--DKFVPAILEAWYPGSKGGTALADILF 636
Query: 605 GDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQ------VYGFGH 658
GD+NPGG+L +T +P++ ++P N S Q G TG +Y FG+
Sbjct: 637 GDYNPGGKLTVT-FPKTVGQIP---FNFPCKPSSQIDGGKNPGPTGNMSRINGALYPFGY 692
Query: 659 GLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQI 718
GLSYT + Y S+L I+ + I S T V++
Sbjct: 693 GLSYTTFEY------SDLDITPRV--------------------ITPNESAT-----VRL 721
Query: 719 SVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSI 778
VTN G G VV L+ R S T EK L GF R+H ++E+SF +D + L +
Sbjct: 722 KVTNTGKRAGDEVVQLYIRDVLSSITTYEKNLAGFQRIHLEPGEAQELSFTID-RKHLEL 780
Query: 779 ANKHGRRILPLGNHVLMVG------ELRHSLTIETY 808
+ + ++ G+ VLM G L +LT+E Y
Sbjct: 781 LDADMKWVVEPGDFVLMAGASSEDIRLNGTLTVEDY 816
>gi|383123909|ref|ZP_09944579.1| hypothetical protein BSIG_4072 [Bacteroides sp. 1_1_6]
gi|382983834|gb|EES66944.2| hypothetical protein BSIG_4072 [Bacteroides sp. 1_1_6]
Length = 815
Score = 258 bits (658), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 216/742 (29%), Positives = 331/742 (44%), Gaps = 137/742 (18%)
Query: 90 RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
RLGIP + E+ HG + G T FP + AA+++ L +G+ +A
Sbjct: 160 RLGIPLF-LAEEAPHGHMAIG-----------TTVFPTGIGMAATWSPVLIEEVGNVIAK 207
Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
E R+ Q + P +++ RDPRW R +ET GEDP++ V
Sbjct: 208 EIRS-----QGAHISYGPVLDLSRDPRWSRVEETFGEDPVLSGRLGAAMV---------- 252
Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQ 269
IG G D E A KH +AY + + G Y A + +
Sbjct: 253 ----IGLG------------SGDLSREYATIATLKHFLAYAVPEGGQNGNY---ASVGTR 293
Query: 270 DTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCD 328
D + F PPF+ I+ G S +M SYN ++G+P L Q RNEW F+G++ SD
Sbjct: 294 DLHENFLPPFQEAIDAGALS-VMTSYNSIDGIPCTANYYLLTQLLRNEWRFRGFVVSDLY 352
Query: 329 AVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNL 388
++ + E T E++A V+ AG+DI+ G + A+ GK+ E ID A+ +
Sbjct: 353 SIEGVHESHFVAPTIEEAAMQVVSAGVDIDLGGNAFMNLTHAVQSGKISEAVIDTAVCRV 412
Query: 389 FSVQLRLGLFNG---DPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLN 445
++ +GLF +P+ V + EH +LA A+ IVLLKN LPLN
Sbjct: 413 LRMKFEMGLFEHPYVNPKSAT------KVVRSEEHIRLAHKVAQSSIVLLKNKNSILPLN 466
Query: 446 KNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV-----SKTHYASGC--HD 498
K + +A++GP +N M G YT P +++ L+ + SK Y GC D
Sbjct: 467 KK-IKKVAVVGPNADNRYNMLGDYTA-PQEDENIKTVLDGVISKLSPSKVEYVRGCAIRD 524
Query: 499 VPCNSDAGFHEAVRIAKKADFVIVVAG----LDLT---QET----------------EDR 535
N A E V A +++ +I V G D QET E
Sbjct: 525 TTVNEIA---EVVEAASRSEVIIAVVGGSSARDFKTSYQETGAAIADEKSISDMECGEGF 581
Query: 536 DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAG 595
DR +L L G+Q L+ ++ T K P+I+V G PLD + A ++L YPG+ G
Sbjct: 582 DRATLTLLGKQQDLLNALKATGK-PLIVVYIEGRPLDKVW--ASEYADALLTASYPGQEG 638
Query: 596 AKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYG 655
A+A+++FGD+NP GRLP++ P S ++P+ N +A + Y ++ + +Y
Sbjct: 639 GYAIADVLFGDYNPAGRLPVS-VPRSVGQIPVY-YNKKAPCNHDYVEQA-----ASPLYT 691
Query: 656 FGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFH 715
FG+GLSYT + Y SD ++++ + C F
Sbjct: 692 FGYGLSYTTFEY------------------SDLQVIRK-------------SPC---YFE 717
Query: 716 VQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQ 775
V V N G DG V L+ R S P +QL F+R KEI F + +
Sbjct: 718 VSFKVKNTGSYDGEEVAQLYLRDEYASVVQPLRQLKCFERFFLKRGEEKEIFFTLTE-KD 776
Query: 776 LSIANKHGRRILPLGNHVLMVG 797
LSI +++ R++ G+ +M+G
Sbjct: 777 LSIIDRNMARVVETGDFRIMIG 798
>gi|294647557|ref|ZP_06725134.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
ovatus SD CC 2a]
gi|294807095|ref|ZP_06765914.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
xylanisolvens SD CC 1b]
gi|345508184|ref|ZP_08787819.1| periplasmic beta-glucosidase [Bacteroides sp. D1]
gi|292637099|gb|EFF55540.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
ovatus SD CC 2a]
gi|294445794|gb|EFG14442.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
xylanisolvens SD CC 1b]
gi|345455214|gb|EEO50370.2| periplasmic beta-glucosidase [Bacteroides sp. D1]
Length = 783
Score = 258 bits (658), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 215/749 (28%), Positives = 339/749 (45%), Gaps = 132/749 (17%)
Query: 90 RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
RLGIP + E+ HG + G T FP + AA+++ L +G A+
Sbjct: 131 RLGIPLF-LAEEAPHGHMAIG-----------ATVFPTGIGMAATWSPQLIREVGKAIGK 178
Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
E R Q G + P +++ RDPRW R +ET GEDP++ GE K+
Sbjct: 179 EIRL-----QGGHISYGPVLDLARDPRWSRVEETFGEDPVLT------------GEIGKA 221
Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQ 269
G+G G D A KH +AY + + G SF + +
Sbjct: 222 MVEGLGGG--------------DLSHPYSTLATLKHFLAYGISESGQNGNPSFAGI---R 264
Query: 270 DTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCD 328
+ + F PPFR I+ G A +M SYN ++GVP L + RNEW F+G + SD
Sbjct: 265 ELHENFLPPFRQAIDAG-ALSVMTSYNSMDGVPCTANHSLLTELLRNEWKFRGIVVSDLY 323
Query: 329 AVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNL 388
++ I + T E++A L AG+D++ G + +A++ G++ + +D ++ +
Sbjct: 324 SIEGIHQSHFVAPTMEEAAILALSAGVDVDLGGDAYMNLMNAVNTGRISKTALDASVARV 383
Query: 389 FSVQLRLGLFNG---DPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLN 445
++ +GLF DP K K +V + E LA A+ I LLKN+ LPLN
Sbjct: 384 LRLKFEMGLFENPYVDPEKAK------KEVRSEESVTLARRVAQASITLLKNEHSLLPLN 437
Query: 446 KNAVSSLAIIGPLVNNISQMGGGYTGIPCSP--KSLLRGLEAYV--SKTHYASGCHDVPC 501
KN +A+IGP +N M G YT K++L G+ A + S+ Y GC +
Sbjct: 438 KN--RKVALIGPNADNRYNMLGDYTAPQEEENIKTVLDGIRAKLSSSQVEYVKGC-SIRD 494
Query: 502 NSDAGFHEAVRIAKKADFVIVVAG----LDLT---QET----------------EDRDRV 538
+AV A++++ +I V G D +ET E DR
Sbjct: 495 TVTTDIEQAVAAAQRSEVIIAVVGGSSARDFKTSYKETGAAIADEKTISDMECGEGFDRA 554
Query: 539 SLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKA 598
+L L G+Q L+ ++ T K P+I+V G PLD ++A ++ ++L YPG+ G A
Sbjct: 555 TLSLLGKQQELLKALKATGK-PLIVVYIEGRPLDKNWASENA--DAVLTAYYPGQEGGIA 611
Query: 599 LAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGH 658
+A+++FGDFNP GRLP + P S ++P+ N +A S Y S + +Y FG+
Sbjct: 612 IADVLFGDFNPAGRLPFS-VPRSVGQIPLY-YNKKAPQSHDYVEMS-----ASPLYPFGY 664
Query: 659 GLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQI 718
GLSYT++ Y S+L +SA + F +
Sbjct: 665 GLSYTSFDY------SDLHLSALMPRS----------------------------FEISF 690
Query: 719 SVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSI 778
V N G DG V L+ R S P KQL F R + +E+ F + E S+
Sbjct: 691 KVRNTGKYDGEEVAQLYLRDEYASVVQPLKQLKHFARFYLKRGEEREVKF-ILSEEDFSL 749
Query: 779 ANKHGRRILPLGNHVLMVGELRHSLTIET 807
+++ ++I+ G +M+G + + ++T
Sbjct: 750 VDRNLKKIVEPGTFQIMIGAASNDIRLQT 778
>gi|262405113|ref|ZP_06081663.1| periplasmic beta-glucosidase [Bacteroides sp. 2_1_22]
gi|262355988|gb|EEZ05078.1| periplasmic beta-glucosidase [Bacteroides sp. 2_1_22]
Length = 769
Score = 257 bits (657), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 215/749 (28%), Positives = 339/749 (45%), Gaps = 132/749 (17%)
Query: 90 RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
RLGIP + E+ HG + G T FP + AA+++ L +G A+
Sbjct: 117 RLGIPLF-LAEEAPHGHMAIG-----------ATVFPTGIGMAATWSPQLIREVGKAIGK 164
Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
E R Q G + P +++ RDPRW R +ET GEDP++ GE K+
Sbjct: 165 EIRL-----QGGHISYGPVLDLARDPRWSRVEETFGEDPVLT------------GEIGKA 207
Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQ 269
G+G G D A KH +AY + + G SF + +
Sbjct: 208 MVEGLGGG--------------DLSHPYSTLATLKHFLAYGISESGQNGNPSFAGI---R 250
Query: 270 DTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCD 328
+ + F PPFR I+ G A +M SYN ++GVP L + RNEW F+G + SD
Sbjct: 251 ELHENFLPPFRQAIDAG-ALSVMTSYNSMDGVPCTANHSLLTELLRNEWKFRGIVVSDLY 309
Query: 329 AVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNL 388
++ I + T E++A L AG+D++ G + +A++ G++ + +D ++ +
Sbjct: 310 SIEGIHQSHFVAPTMEEAAILALSAGVDVDLGGDAYMNLMNAVNTGRISKTALDASVARV 369
Query: 389 FSVQLRLGLFNG---DPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLN 445
++ +GLF DP K K +V + E LA A+ I LLKN+ LPLN
Sbjct: 370 LRLKFEMGLFENPYVDPEKAK------KEVRSEESVTLARRVAQASITLLKNEHSLLPLN 423
Query: 446 KNAVSSLAIIGPLVNNISQMGGGYTGIPCSP--KSLLRGLEAYV--SKTHYASGCHDVPC 501
KN +A+IGP +N M G YT K++L G+ A + S+ Y GC +
Sbjct: 424 KN--RKVALIGPNADNRYNMLGDYTAPQEEENIKTVLDGIRAKLSSSQVEYVKGC-SIRD 480
Query: 502 NSDAGFHEAVRIAKKADFVIVVAG----LDLT---QET----------------EDRDRV 538
+AV A++++ +I V G D +ET E DR
Sbjct: 481 TVTTDIEQAVAAAQRSEVIIAVVGGSSARDFKTSYKETGAAIADEKTISDMECGEGFDRA 540
Query: 539 SLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKA 598
+L L G+Q L+ ++ T K P+I+V G PLD ++A ++ ++L YPG+ G A
Sbjct: 541 TLSLLGKQQELLKALKATGK-PLIVVYIEGRPLDKNWASENA--DAVLTAYYPGQEGGIA 597
Query: 599 LAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGH 658
+A+++FGDFNP GRLP + P S ++P+ N +A S Y S + +Y FG+
Sbjct: 598 IADVLFGDFNPAGRLPFS-VPRSVGQIPLY-YNKKAPQSHDYVEMS-----ASPLYPFGY 650
Query: 659 GLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQI 718
GLSYT++ Y S+L +SA + F +
Sbjct: 651 GLSYTSFDY------SDLHLSALMPRS----------------------------FEISF 676
Query: 719 SVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSI 778
V N G DG V L+ R S P KQL F R + +E+ F + E S+
Sbjct: 677 KVRNTGKYDGEEVAQLYLRDEYASVVQPLKQLKHFARFYLKRGEEREVKF-ILSEEDFSL 735
Query: 779 ANKHGRRILPLGNHVLMVGELRHSLTIET 807
+++ ++I+ G +M+G + + ++T
Sbjct: 736 VDRNLKKIVEPGTFQIMIGAASNDIRLQT 764
>gi|383110854|ref|ZP_09931672.1| hypothetical protein BSGG_1962 [Bacteroides sp. D2]
gi|313694427|gb|EFS31262.1| hypothetical protein BSGG_1962 [Bacteroides sp. D2]
Length = 861
Score = 257 bits (657), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 158/460 (34%), Positives = 232/460 (50%), Gaps = 63/460 (13%)
Query: 53 PFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPG 112
P+ +TSL+ RA+ L+ LTL+EK+ + + + AIPRLGI Y+WW+E+LHG+
Sbjct: 25 PYQDTSLTAEQRAEDLLPRLTLEEKVALMQNASPAIPRLGIKEYDWWNEALHGV------ 78
Query: 113 VNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLTF 164
G T FPQ + ASFN SL + AV+ EAR + GLTF
Sbjct: 79 ----GRAGLATVFPQSIGMGASFNDSLLYEVFDAVSDEARVKSRIFSENGVLKRYQGLTF 134
Query: 165 WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQG-ENWKSDDGGIGFGFREKRV 223
W PN+NIFRDPRWGRGQET GEDP + + V+ QG EN K D
Sbjct: 135 WTPNVNIFRDPRWGRGQETYGEDPYLTGQLGMAVVRGLQGPENGKYDK------------ 182
Query: 224 LKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPFRSC 282
L AC KH + +W +R+SF+A IT +D +T+ P F+
Sbjct: 183 ---------------LHACAKHFAVHSGPEW---NRHSFDAENITPRDLWETYLPAFKDL 224
Query: 283 IEQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIFEYQNYTK 341
+++ +MC+YN+ G P C L Q R+EWG+KG + SDC A++ + +
Sbjct: 225 VQKADVKEVMCAYNRFEGEPCCGSNRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTH-G 283
Query: 342 THED---SAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLF 398
TH D ++AG + +G D+ CG A+ G + EK ID +L L + + LG
Sbjct: 284 THPDKEHASAGAVLSGTDLECGG-EYGSLADAVKAGLIDEKQIDVSLKRLLTARFELGEM 342
Query: 399 NGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPL 458
+ P + ++ + + EH+ LAL AR+ +VLL+N LPLN + +A++GP
Sbjct: 343 DEQP---AWAEIPASTLNSKEHQDLALRMARESLVLLQNKNDILPLNTDL--KVAVMGPN 397
Query: 459 VNNISQMGGGYTGIPCSPKSLLRGLEAYV--SKTHYASGC 496
N+ G Y GIP +LL + + + + Y GC
Sbjct: 398 ANDSVMQWGNYNGIPGHTVTLLEAVRSKLPEGQVMYEPGC 437
Score = 98.2 bits (243), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 77/269 (28%), Positives = 122/269 (45%), Gaps = 53/269 (19%)
Query: 510 AVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVARTSKR 559
AV K AD V+ G+ + E E+ DR + LP Q L+ ++ + K+
Sbjct: 591 AVEKVKDADVVLFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPAVQRDLLKALKKAGKK 650
Query: 560 PVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYP 619
V + +G + + +IL YPG+AG A+ +++FGD+NP GRLP+T+Y
Sbjct: 651 VVFINYSGSA---IGLVPESNTCEAILQGWYPGQAGGTAIVDVLFGDYNPAGRLPVTFYK 707
Query: 620 ESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTIS 679
++ D +M+ GR+YR+ ++ FGHGLSYT ++Y
Sbjct: 708 DAGQLPDFEDYSMK--------GRTYRYMQQQPLFPFGHGLSYTTFTYG----------- 748
Query: 680 ASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVP 739
+A KN + G+ VT + I V+NAG DG VV ++ R
Sbjct: 749 ---EADLSKNTIGDGGT---------VT--------LTIPVSNAGQRDGDEVVQVYLRCM 788
Query: 740 KVSQGTPEKQLIGFDRVHTVAKGSKEISF 768
+G P L F RVH A +K+++
Sbjct: 789 ADKEG-PHYTLRAFKRVHIPAGETKQVTI 816
>gi|317477153|ref|ZP_07936394.1| glycosyl hydrolase family 3 N terminal domain-containing protein
[Bacteroides eggerthii 1_2_48FAA]
gi|316906696|gb|EFV28409.1| glycosyl hydrolase family 3 N terminal domain-containing protein
[Bacteroides eggerthii 1_2_48FAA]
Length = 863
Score = 257 bits (657), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 149/436 (34%), Positives = 230/436 (52%), Gaps = 47/436 (10%)
Query: 56 NTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNF 115
+ S IS R +++I +TL+EK+ QLS+ + +IPRL +P+Y +W+E LHG+A G
Sbjct: 51 DLSQPISVRIENIIRQMTLEEKVAQLSNESDSIPRLNLPSYNYWNECLHGVARAG----- 105
Query: 116 NGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDP 175
VT FPQ + A++++ L I SA++ EAR Y GLT+WAP IN+ RDP
Sbjct: 106 -----EVTVFPQAINLASTWDTLLVKRIASAISTEARLKYLDIGKGLTYWAPTINMARDP 160
Query: 176 RWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGD 235
RWGR +ET GEDP + S V FVK QG D +
Sbjct: 161 RWGRNEETYGEDPYLTSRLGVAFVKGLQG---------------------------DHPN 193
Query: 236 ELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSY 295
L A KH +A + E R+S ++ I + + + P + +C+++ IM +Y
Sbjct: 194 YLKTVATVKHFVANNQEN----DRFSSSSQIPTKQLYEYYFPAYEACVKEANVQSIMTAY 249
Query: 296 NQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAG 354
N NG+P L + R EWGF G++ SDC A+ + + E++AA + +G
Sbjct: 250 NAFNGIPPSGSTWLLEDVLRKEWGFDGFVVSDCGAIGVMNWQHRIVNSLEEAAALGINSG 309
Query: 355 MDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRK-GKYGKLGPD 413
D+ CG + +A+ +G V E IDRAL + +++ +LG F DP + Y
Sbjct: 310 CDLECGGTYRENLVAAVQRGLVSEYAIDRALTRVLTMRFKLGEF--DPIELVPYNHYDKK 367
Query: 414 DVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIP 473
+ + ++LA +AA + I+LLKN+ FLP++K V S+AI+GP +N GGY+G P
Sbjct: 368 LLAGEQFRRLAYEAAVKSIILLKNEDNFLPIDKKDVRSIAIVGPFADN--NYLGGYSGKP 425
Query: 474 CSPKSLLRGLEAYVSK 489
SLL+G++ V +
Sbjct: 426 VHNISLLQGVKKMVGE 441
Score = 125 bits (314), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 96/288 (33%), Positives = 140/288 (48%), Gaps = 47/288 (16%)
Query: 512 RIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPL 571
+I +AD V+V G D E+RD S+ LP Q L+ + + + R + L+L G PL
Sbjct: 604 KIVSRADLVLVALGNDGKLARENRDLPSIYLPMTQELLLKEIYKVNPR-IALILQTGNPL 662
Query: 572 DVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVP-MNDM 630
+A + SIL YPG+ G ALA I+FG NP G+LPMT Y ES ++P + D
Sbjct: 663 TSQWAA--EHVPSILQAWYPGQEGGAALAGILFGLENPSGKLPMTIY-ESEQQLPNILDY 719
Query: 631 NMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNI 690
++ + GR+Y++ + +YGFGHGLSY+N+ Y A L+
Sbjct: 720 DI-------WKGRTYQYLSSKPLYGFGHGLSYSNFEY------------ADLQCN----- 755
Query: 691 LQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGT-PEKQ 749
D VH+D C+ I V N DV G V+ ++ K T P K+
Sbjct: 756 --------DVVHVDGTLQCS-------IKVKNISDVVGEEVIQVYVSREKTPVYTFPLKK 800
Query: 750 LIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
LI F RV+ SK ++F + P QLS+ ++L G + L VG
Sbjct: 801 LIAFARVNLKPNESKTVTFTITP-RQLSVWQDGEWKMLS-GKYSLFVG 846
>gi|237721943|ref|ZP_04552424.1| glycoside hydrolase family 3 protein [Bacteroides sp. 2_2_4]
gi|229448812|gb|EEO54603.1| glycoside hydrolase family 3 protein [Bacteroides sp. 2_2_4]
Length = 792
Score = 257 bits (657), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 210/735 (28%), Positives = 339/735 (46%), Gaps = 127/735 (17%)
Query: 90 RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
RLGIP ++ +E + G+ + T FP A++N+ L I A
Sbjct: 143 RLGIPV-DFTNEGIRGLCHD-----------RATMFPAQCGQGATWNKKLIREIAKVTAN 190
Query: 150 EARAMYNLGQAGLT-FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWK 208
EA+A+ G T ++P ++I +DPRWGR E+ GEDP + + + Q E
Sbjct: 191 EAKAL------GYTNIYSPILDIAQDPRWGRVVESYGEDPYLAGELGKQMILGLQNEG-- 242
Query: 209 SDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITE 268
+ A KH Y + G + +
Sbjct: 243 ------------------------------IVATPKHFAVYSIPVGGRDGGTRTDPHVAP 272
Query: 269 QDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDC 327
++ + + PFR I++ A +M SYN +G P + R +WGFKGY+ SD
Sbjct: 273 REMKTLYLEPFRKGIQEAGALGVMSSYNDYDGEPVSGSYHFLTEILRQQWGFKGYVVSDS 332
Query: 328 DAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTC----MLRHTQSAIDKGKVQEKDIDR 383
+AV + T T E+ AA V+ AG++I + + AID+GKV +D+
Sbjct: 333 EAVEFLHTKHRITPTEEEMAAQVVNAGLNIRTNFTPPQDFILPLRRAIDEGKVSLHTLDQ 392
Query: 384 ALLNLFSVQLRLGLFN----GDPRKGKYGKLGPDDVCTSE-HKKLALDAARQGIVLLKND 438
+ + V+ +GLF+ GD R+ P+ V ++ HK +++ AA + +VLLKN+
Sbjct: 393 RVSEILRVKFMMGLFDNPYPGDDRR-------PETVVHNDAHKAVSMKAALESVVLLKNE 445
Query: 439 KKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV--SKTHYASGC 496
+ LPL+KN +A+IGP + ++ Y S K++ +G++ Y+ S+ YA GC
Sbjct: 446 NQMLPLSKN-FKKIAVIGPNAEEVKELTCRYGPANASIKTVYQGIKEYLPNSEVRYAKGC 504
Query: 497 ------------HDVPCNS--DAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLL 542
++VP ++ A HEAV +AK +D I+V G + E+ R +L L
Sbjct: 505 DIIDKYFPESELYNVPLDTQEQAMIHEAVELAKASDIAILVLGGNEKTVREEFSRTNLDL 564
Query: 543 PGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEI 602
G+Q L+ +V T K PV+LV+ G +++ A+ I +I+ +PGE A+A++
Sbjct: 565 CGRQQQLLEAVYATGK-PVVLVMVDGRAATINW--ANKYIPAIIHAWFPGEFMGDAIAKV 621
Query: 603 IFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSY 662
+FGD+NPGGRL +T +P+S ++P S+ + +Y FG+GLSY
Sbjct: 622 LFGDYNPGGRLAVT-FPKSVGQIPFAFPFKPGSDSKG------KVRVDGALYPFGYGLSY 674
Query: 663 TNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTN 722
T + Y S+L IS + G +NI SCT V N
Sbjct: 675 TTFGY------SDLKISKPV-IGPQENI---------------TLSCT---------VKN 703
Query: 723 AGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKH 782
G G VV L+ R S T +K L GF+R+H + +SF + P + L + +K+
Sbjct: 704 TGKKAGDEVVQLYIRDDFSSVTTYDKVLRGFERIHLQPGEEQTVSFTLTP-QDLGLWDKN 762
Query: 783 GRRILPLGNHVLMVG 797
R + G+ +MVG
Sbjct: 763 NRFTVEPGSFSVMVG 777
>gi|329923020|ref|ZP_08278536.1| glycosyl hydrolase family 3 N-terminal domain protein
[Paenibacillus sp. HGF5]
gi|328941793|gb|EGG38078.1| glycosyl hydrolase family 3 N-terminal domain protein
[Paenibacillus sp. HGF5]
Length = 763
Score = 257 bits (657), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 225/758 (29%), Positives = 342/758 (45%), Gaps = 135/758 (17%)
Query: 76 EKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASF 135
E + + A RLGIP + E HG + G T FP L +++
Sbjct: 90 EAVNAIQRYAMEHSRLGIPIL-FGEECSHGHMAIG-----------ATVFPVPLTIGSTW 137
Query: 136 NRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYA 195
N L+ +I AVA E RA Q G ++P +++ RDPRWGR +ET GEDP +V+ +A
Sbjct: 138 NTELFRSISRAVAAETRA-----QGGAATYSPVLDVVRDPRWGRTEETFGEDPHLVAEFA 192
Query: 196 VEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWG 255
V V+ QGE S L A KH Y + G
Sbjct: 193 VAAVQGLQGERLDS--------------------------HTSLLATLKHFAGYGASEGG 226
Query: 256 NFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-R 314
R + ++ + PFR +E G S IM +YN+++GVP L Q R
Sbjct: 227 ---RNGAPVHMGLRELHEVDLLPFRKAVESGALS-IMTAYNEIDGVPCTSSRYLLQNVLR 282
Query: 315 NEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLR-HTQSAIDK 373
WGF G++ +DC A+ + N + ++A LKAG+D+ M R H Q A+++
Sbjct: 283 EAWGFDGFVITDCGAIHMLACGHNTAGSGVEAATQSLKAGVDMEMSGTMFRAHLQQALEQ 342
Query: 374 GKVQEKDIDRALLNLFSVQLRLGLFNG---DPRKGKYGKLGPDDVCTSEHKKLALDAARQ 430
G + E D++RA + ++ RLGLF+ DP + + EH LA AA +
Sbjct: 343 GLITEDDLNRAAGRVLELKFRLGLFDRPYVDPAWAE------QVIGCKEHIALAYQAAAE 396
Query: 431 GIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV--- 487
GIVLLKN+ LPL+ ++ ++A+IGP + G YT P P ++ L+
Sbjct: 397 GIVLLKNEGNLLPLDSSS-GTIAVIGPNAHTPYHQLGDYTS-PQPPGQIVTVLDGIRRRL 454
Query: 488 --SKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAG-----------LDLTQET-- 532
S+ YA GC + +S GF A+ A++AD +++V G +DL
Sbjct: 455 GDSRVLYAPGCR-IQGDSREGFPRALACAEQADVIVMVLGGSSARDFGEGTIDLRTGASV 513
Query: 533 ------------EDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADS 580
E DR +L L G Q+ L+ + + K PVI+V G P+ + D
Sbjct: 514 VTGDAKSDMECGEGIDRSTLTLMGVQLELLQELQKLGK-PVIVVYINGRPITEPWI--DE 570
Query: 581 QISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQY 640
I +I+ YPG+ G A+A+++FGD NP GRLP++ P+ ++P++ ++R+
Sbjct: 571 FIPAIIEAWYPGQEGGGAIADMLFGDINPSGRLPLS-IPKEVGQLPIS------YNARRT 623
Query: 641 PGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSA-PSELTISASLKAGSDKNILQQTGSRLD 699
G+ Y Y FG GLSYT + Y L+ P+ + I
Sbjct: 624 RGKRYLETDLAPRYPFGFGLSYTEFRYGRLTVEPAVVPIGG------------------- 664
Query: 700 YVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTV 759
E T V+I VTNAG DG+ VV L+ S PEK L GF +V
Sbjct: 665 -----EAT--------VRIDVTNAGARDGAEVVQLYVSDLAASVTRPEKALKGFRKVFLK 711
Query: 760 AKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
A ++E++F + EQL + + ++ G + VG
Sbjct: 712 AGETQEVTFTIG-SEQLELIGLDLKPVVEPGEFRIQVG 748
>gi|380694149|ref|ZP_09859008.1| glycoside hydrolase 3 [Bacteroides faecis MAJ27]
Length = 946
Score = 257 bits (657), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 208/735 (28%), Positives = 339/735 (46%), Gaps = 115/735 (15%)
Query: 90 RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
RLGIP ++ +E + G+ S T+FP L ++NR L +G
Sbjct: 153 RLGIPT-DFTNEGIRGVES-----------YKATNFPTQLGLGHTWNRRLIHQVGLITGR 200
Query: 150 EARAMYNLGQAGLT-FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWK 208
EAR + G T +AP +++ RD RWGR +E GE P +V+ +E V+ Q +
Sbjct: 201 EARML------GYTNVYAPILDVGRDQRWGRYEEVYGESPYLVAELGIEMVRGMQHNH-- 252
Query: 209 SDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITE 268
++A KH IAY K + ++
Sbjct: 253 -----------------------------QIAATGKHFIAYSNNKGAREGMARVDPQMSP 283
Query: 269 QDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVP-ACLRGDLFQKARNEWGFKGYITSDC 327
++ E T PF+ I + +M SYN +G P L + R E GF+GY+ SD
Sbjct: 284 REVEMTHVYPFKRVIREAGLLGVMSSYNDYDGFPIQSSYYWLTTRLRGEMGFRGYVVSDS 343
Query: 328 DAVATIFEYQNYTKTHEDSAAGVLKAGMDINCG----TCMLRHTQSAIDKGKVQEKDIDR 383
DAV ++ K +++ ++AG+++ C + + + +G + E+ I+
Sbjct: 344 DAVEYLYTKHGTAKDMKEAVRQSVEAGLNVRCTFRSPDSYVLPLRELVKEGGLSEEVIND 403
Query: 384 ALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLP 443
+ ++ V+ +GLF+ + G ++V + ++++AL A+R+ IVLLKNDK LP
Sbjct: 404 RVRDILRVKFLVGLFDHPYQIDLKG--ADEEVEKAANEEIALQASRESIVLLKNDKNILP 461
Query: 444 LNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV---SKTHYASGCHDVP 500
L+ + + +A+ GP + S Y + S+L+G++ + ++ Y GC V
Sbjct: 462 LDASGIQKIAVCGPNADEHSYALTHYGPLAVEVTSVLKGIQEKMKGKAEVLYTKGCDLVD 521
Query: 501 CN--------------SDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQ 546
N +AV K+AD +VV G E++ R SL LPG+Q
Sbjct: 522 ANWPESELIDYPLTDEEQKEIEKAVDQTKQADVAVVVLGGGQRTCGENKSRSSLDLPGRQ 581
Query: 547 MSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGD 606
+ L+ +VA T K PV+LVL G PL +++ AD + +I+ YPG G KA+A+++FG+
Sbjct: 582 LDLLKAVAATGK-PVVLVLINGRPLSINW--ADKFVPAIVEAWYPGSKGGKAVADVLFGE 638
Query: 607 FNPGGRLPMTWYPESFTKVPMN---DMNMRADSSRQYPGRSYRFYTGT-QVYGFGHGLSY 662
+NPGG+L +T +P++ ++P N + + D + PG +Y FG+GLSY
Sbjct: 639 YNPGGKLTVT-FPKTVGQIPFNFPCKPSSQIDGGKN-PGMEGNMSRANGALYPFGYGLSY 696
Query: 663 TNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTN 722
T + Y S+L IS ++ + QQT V VTN
Sbjct: 697 TTFEY------SDLKISPAIITPN-----QQT--------------------FVTCKVTN 725
Query: 723 AGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKH 782
G G VV L+ R S T EK L GF+RVH +KE++F +D + L + N
Sbjct: 726 TGKRAGDEVVQLYVRDVLSSVTTYEKNLAGFERVHLQPGETKEVTFPID-RKALELLNAD 784
Query: 783 GRRILPLGNHVLMVG 797
++ G+ LMVG
Sbjct: 785 MHWVVEPGDFTLMVG 799
>gi|373952814|ref|ZP_09612774.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
paludis DSM 18603]
gi|373889414|gb|EHQ25311.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
paludis DSM 18603]
Length = 862
Score = 257 bits (657), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 152/456 (33%), Positives = 234/456 (51%), Gaps = 53/456 (11%)
Query: 53 PFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPG 112
P+ N +LS RAK L++ LTL+EK+ + D + A+PRLGI + WWSE+LHG A+ GP
Sbjct: 23 PYQNPALSSEARAKDLVTRLTLKEKVGLMKDVSEAVPRLGIKKFNWWSEALHGYANQGP- 81
Query: 113 VNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLTF 164
VT FP+ + AASF+ ++ AV+ EARA N + L+
Sbjct: 82 ---------VTVFPEPVGMAASFDDQKLFHVFDAVSDEARAKNNEYRKQVESQRFHDLSV 132
Query: 165 WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVL 224
W PN+NIFRDPRWGRGQET GEDP + S V VK QG +R+
Sbjct: 133 WTPNVNIFRDPRWGRGQETYGEDPYLTSRMGVSVVKGLQGP--------ADAKYRK---- 180
Query: 225 KGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPFRSCI 283
L AC KH + +W SR+ N +T +D +T+ P F+S +
Sbjct: 181 --------------LLACAKHYAVHSGPEW---SRHEMNVTDVTPRDLWETYLPAFKSLV 223
Query: 284 EQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIFEYQNYTKT 342
+ +MC+Y +++ P C L Q R +WGFK + SDC A+ + + +
Sbjct: 224 QDADVREVMCAYQRLDDEPCCGNSRLLGQILREDWGFKYLVVSDCGAITDFYNSHHSSSD 283
Query: 343 HEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDP 402
++A + +G D+ C A+ +G ++EKDI+ +++ L + + LG + D
Sbjct: 284 ATHASAKAVLSGTDVECVGYAFDKIPDAVYRGLIKEKDINTSVVRLMTQRFELGEMDKD- 342
Query: 403 RKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNI 462
+ K+ V + +H+KLALD AR+ + LL+N+ LPL+K ++ LA+IGP N+
Sbjct: 343 ELVPWTKIPLSVVNSEDHQKLALDMARETMTLLQNNNNILPLSK-SIGKLAVIGPNANDS 401
Query: 463 SQMGGGYTGIPCSPKSLLRGLEAYVSKTH--YASGC 496
+ G Y G P ++L G++ + H Y +GC
Sbjct: 402 QMLSGNYNGTPLRTINILEGIKTKLGADHVIYDAGC 437
Score = 87.8 bits (216), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 77/312 (24%), Positives = 134/312 (42%), Gaps = 59/312 (18%)
Query: 498 DVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQM 547
++P D G E V K AD V+ V G+ E E+ DR + LP Q
Sbjct: 584 EIPLTYD-GIIEKV---KDADIVVFVGGISPKLEGEEMPVQLPGFKGGDRTDIELPAVQR 639
Query: 548 SLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDF 607
+ + ++ + K+ V + +G ++ +IL Y GE+G +A+A+++FGD+
Sbjct: 640 NCIEALRKAGKKIVFVNCSGSA---IAMVPETQNCDAILQAWYAGESGGQAVADVLFGDY 696
Query: 608 NPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSY 667
NP G LP+T+Y +D +M+ GR+YR+ ++ FG GLSYT ++
Sbjct: 697 NPSGHLPVTFYRNVQQLPDFSDYSMK--------GRTYRYLKSAPLFPFGFGLSYTTFNI 748
Query: 668 KFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVD 727
++LT + K + +++ V NAG D
Sbjct: 749 ----GEAKLTKNNITKGEA---------------------------IQLRVPVANAGKTD 777
Query: 728 GSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCE-QLSIANKHGRRI 786
G+ ++ ++ R G K L GF R+ A ++ ++ + P + R+
Sbjct: 778 GTELLQVYIRKVDDPDGA-SKTLRGFKRIPVSAGKTEMVTLDLPPKTFEFFDPTDAVVRV 836
Query: 787 LPLGNHVLMVGE 798
P GN+ L+ GE
Sbjct: 837 SP-GNYQLLYGE 847
>gi|167646366|ref|YP_001684029.1| beta-glucosidase [Caulobacter sp. K31]
gi|167348796|gb|ABZ71531.1| Beta-glucosidase [Caulobacter sp. K31]
Length = 898
Score = 257 bits (657), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 164/442 (37%), Positives = 229/442 (51%), Gaps = 70/442 (15%)
Query: 65 AKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTS 124
A L+ +TL EK++QL + A AIPRLG+PAY WW+ESLHG P T+
Sbjct: 46 AAELVGKMTLDEKLEQLLNTAPAIPRLGVPAYNWWTESLHGALGALP----------TTN 95
Query: 125 FPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA---------GLTFWAPNINIFRDP 175
FP+ + AA+F+ L ++ A+ E + ++ L +A GL W+PNINIFRDP
Sbjct: 96 FPEPIGLAATFDAPLVHDVAGAIGAEMQDLHALARATGRMGRIGTGLNTWSPNINIFRDP 155
Query: 176 RWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGD 235
RWGRGQET GEDP + + V FV+ QG D D
Sbjct: 156 RWGRGQETYGEDPFLTARMGVAFVEGIQG---------------------------DDPD 188
Query: 236 ELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSY 295
+ A KH + + +R+ N ++ +D EDT+ P FR+ + +G+A IMC+Y
Sbjct: 189 HPRIIATPKH---FAVHSGPESTRHGANVFVSRRDLEDTYLPAFRAAVVEGRAGSIMCAY 245
Query: 296 NQVNGVPACLRGDLFQKA--RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKA 353
N+++G PAC DL K R W F GY+ SDCDAV I ++ Y + A L+A
Sbjct: 246 NRIDGQPACA-SDLLLKEHLRGAWKFDGYVVSDCDAVKDISDHHKYAPEAASAVAAALRA 304
Query: 354 GMDINCGTCMLRHT-------QSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGD-PRKG 405
G+D C L T + A+D+G + ID AL+ LFS + R NGD P KG
Sbjct: 305 GVDNECNGATLTDTDGLAGRYREALDRGLISTAQIDTALVRLFSARFR----NGDLPAKG 360
Query: 406 KY-GKL-GPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNIS 463
G+L GP V T EH+ LAL A+ + +VLLKND LPL +A+IGPL +
Sbjct: 361 GSDGRLAGPSVVTTHEHEALALAASEKSLVLLKND-GVLPLKPGL--RIAVIGPLGDATR 417
Query: 464 QMGGGY-TGIPCSPKSLLRGLE 484
+ G Y + + P S++ GL
Sbjct: 418 VLRGNYSSALSAPPISVVDGLR 439
Score = 110 bits (274), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 83/290 (28%), Positives = 126/290 (43%), Gaps = 54/290 (18%)
Query: 518 DFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVARTSKRPVILVLTG 567
D ++ V GL E E+ D+ +L LP Q +++ A+ +P+I+V
Sbjct: 628 DVLVAVVGLTSDMEAEEAPIEIPGFKGGDKTTLDLPADQRAMLEQ-AKALGKPLIVVAMN 686
Query: 568 GGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPM 627
G PL+ FA A S++L YPG++G A+A ++ G NP GRLP+T+Y P
Sbjct: 687 GSPLN--FAWAKDNASALLEAWYPGQSGGLAIANVLTGKTNPAGRLPLTFYRSVDDLPPF 744
Query: 628 NDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSD 687
+D M GR+YR++ GT VY FG+GLSYT + Y L
Sbjct: 745 DDYAMA--------GRTYRYFEGTPVYPFGYGLSYTRFDYGPLK---------------- 780
Query: 688 KNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPE 747
I+ T V ++ N G G V L+ PK + G P
Sbjct: 781 ---------------IEPATKGAGQGLRVTTTIKNVGTRPGEEVAQLYLDFPK-TPGAPR 824
Query: 748 KQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
L GF R+ +++I+F + P LS + G + G + + VG
Sbjct: 825 LALRGFQRIALKPGETRDITFALSP-RDLSSVDLDGEHRVSAGLYRVSVG 873
>gi|218130693|ref|ZP_03459497.1| hypothetical protein BACEGG_02282 [Bacteroides eggerthii DSM 20697]
gi|217987037|gb|EEC53368.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
eggerthii DSM 20697]
Length = 954
Score = 257 bits (657), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 213/764 (27%), Positives = 353/764 (46%), Gaps = 120/764 (15%)
Query: 54 FCNTSLSISTRAKSLISLLTLQEKIQQLSDN--ASAIPRLGIPAYEWWSESLHGIASNGP 111
+ + L ++ R +SL+S++T ++K++ + + IP L +P E++HG +
Sbjct: 170 YMDPRLPVNERVESLLSVMTPEDKMELIREGWGIPGIPHLYVPPITK-VEAVHGFSYG-- 226
Query: 112 GVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINI 171
S T FPQ L A++N++L + A+ E L + W+P +++
Sbjct: 227 --------SGATIFPQALAMGATWNKNLTEKVAMAIGEET-----LAAGTMQAWSPVLDV 273
Query: 172 FRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEES 231
+D RWGR +ET GEDP++VS ++K +Q +
Sbjct: 274 AQDARWGRCEETFGEDPVLVSQIGGAWIKGYQSKG------------------------- 308
Query: 232 DRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCI 291
L KH + G R S + ++E++ + PFR I +
Sbjct: 309 -------LFTTPKHFGGHGAPLGG---RDSHDIGLSEREMREVHLVPFRHVIRNYSCQSL 358
Query: 292 MCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAG- 349
M +Y+ GVP +L + R EWGF G++ SDC A+ + ++YT ++ AA
Sbjct: 359 MMAYSDFLGVPVAKSKELLRNILREEWGFDGFVVSDCGAIGNLTSRKHYTAKNKIEAANQ 418
Query: 350 VLKAGMDINCG-TCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKG-KY 407
L AG+ NCG T + A G++ ++D + + R LF + ++ +
Sbjct: 419 ALAAGIATNCGDTYNDKEVIQAAKDGRLDMANLDNVCRTMLRMMFRNELFEKEHKEPLDW 478
Query: 408 GKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGG 467
K+ P + HK++A +AR+ IV+L+N LPL+K+ V ++A++GP +N+ G
Sbjct: 479 NKIYPG-WNSDSHKEIARQSARESIVMLENKDDVLPLSKH-VRTIAVLGPGADNLQP--G 534
Query: 468 GYTGI--PCSPKSLLRGLEAYV---SKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIV 522
YT P KS+L G++ V +K Y GC N + AV++A ++D V++
Sbjct: 535 DYTPKLRPGQLKSVLTGIKQAVGNQTKILYEQGCEFTGSNGE-NIPNAVKVAAQSDVVVL 593
Query: 523 VAGLDLTQET---------EDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDV 573
V G T E E+ D +L+LPG+Q L+ +V T K PV+LVL G P ++
Sbjct: 594 VLGDCSTSEATTDVYKTSGENHDYATLILPGRQQELLEAVCATGK-PVVLVLQIGRPYNL 652
Query: 574 SFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMR 633
+ + + W+ PG+ G A A+++FGD+NP GRLPMT +P ++P+ N +
Sbjct: 653 TKESELCKAIIVNWL--PGQEGGLATADVLFGDYNPAGRLPMT-FPRHVGQLPLY-YNFK 708
Query: 634 ADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQ 693
S R+Y +Y +Y FG+GLSYT++ Y S+L + + NI+
Sbjct: 709 T-SGRRYEYSDLEYYP---LYYFGYGLSYTSFEY------SDLKVEER----ENGNII-- 752
Query: 694 TGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGF 753
++V N G G VV L+ S T +L F
Sbjct: 753 ----------------------AHVNVKNIGHRAGDEVVQLYVTDMYASVKTRITELKDF 790
Query: 754 DRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
RVH SK ISF + P E LS+ N + R++ G ++VG
Sbjct: 791 TRVHLRPGESKSISFELTPYE-LSLLNDNMDRVVEKGTFKILVG 833
>gi|365122063|ref|ZP_09338970.1| hypothetical protein HMPREF1033_02316 [Tannerella sp.
6_1_58FAA_CT1]
gi|363643257|gb|EHL82578.1| hypothetical protein HMPREF1033_02316 [Tannerella sp.
6_1_58FAA_CT1]
Length = 819
Score = 256 bits (655), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 208/720 (28%), Positives = 316/720 (43%), Gaps = 124/720 (17%)
Query: 90 RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
RLGIP ++ +E++HG+ + T P + +++N L GS
Sbjct: 151 RLGIPV-DFSNETIHGLNH-----------TKATPLPAPIGIGSTWNAPLVYKAGSIAGK 198
Query: 150 EARAMYNLGQAGLT-FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWK 208
EA+A+ G T +AP +++ RDPRWGR E GEDP +V+ + VK Q
Sbjct: 199 EAKAL------GYTNIYAPILDLARDPRWGRVLECYGEDPFLVATLGTQMVKGIQ----- 247
Query: 209 SDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITE 268
E ++A KH Y + K G + +
Sbjct: 248 ---------------------------EQGVAATLKHFAVYSVPKGGRDGSVRTDPHVAP 280
Query: 269 QDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGD-LFQKARNEWGFKGYITSDC 327
++ PF+ I+ +M SYN +GVP L Q R E+GF GY+ SD
Sbjct: 281 REMHQMHLYPFKKVIQDAHPMGVMSSYNDWDGVPVTASYYFLTQLLRQEFGFDGYVVSDS 340
Query: 328 DAVATIFEYQNYTKTHEDSAAGVLKAGMDINCG----TCMLRHTQSAIDKGKVQEKDIDR 383
DAV ++ + +T+E++ VL+AG+++ + + + +G++ K ID
Sbjct: 341 DAVEYVYNKHHVAETYEEAVRMVLEAGLNVRTTFAAPDIFILPARKLVKEGRLSMKVIDE 400
Query: 384 ALLNLFSVQLRLGLFN----GDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDK 439
+ ++ V+ RLGLF+ DP K V ++K LD RQ +VLLKN+
Sbjct: 401 RVADVLRVKFRLGLFDQPFVADP------KAADKIVGADKNKDFVLDIQRQSLVLLKNEN 454
Query: 440 KFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSK---THYASGC 496
LPL+KN +S + I GPL + M Y ++ G++ Y+ YA GC
Sbjct: 455 NLLPLDKNKLSRILITGPLAKEENYMVSRYGPQELENITVYEGIKNYLGNKVAVDYALGC 514
Query: 497 H--------------DVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLL 542
+ AV AK +D VI V G D E + R L L
Sbjct: 515 KVKDAKWPESEIIHSPLTTEEQQEIQNAVEKAKLSDIVIAVLGEDEESTGESKSRSGLDL 574
Query: 543 PGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEI 602
PG+Q L+ ++ T K PV+LVL G PL +++A D I +IL +PG+ G A+AE
Sbjct: 575 PGRQQQLLEALYATGK-PVVLVLINGQPLTINWA--DRYIPAILEAWFPGQMGGTAIAET 631
Query: 603 IFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQV----YGFGH 658
+FGD+NPGG+LP+T +P++ ++ +N A S+Q P Y T+V Y FG
Sbjct: 632 LFGDYNPGGKLPVT-FPKTLGQIELNFPFKPASQSKQ-PEAGPNGYGKTRVNGALYPFGF 689
Query: 659 GLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQI 718
GLSYT + Y S L +S ++ G + D V
Sbjct: 690 GLSYTTFEY------SNLKVSP-----------ERQGPKGD--------------IQVSF 718
Query: 719 SVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSI 778
+TN G G +V L+ + S + E L GF+RV +K I F + P E L I
Sbjct: 719 DITNTGKRAGDEIVQLYVKDKVSSVISYESLLRGFERVSLQPGETKNIQFTLHP-EDLEI 777
>gi|423222018|ref|ZP_17208488.1| hypothetical protein HMPREF1062_00674 [Bacteroides cellulosilyticus
CL02T12C19]
gi|392644204|gb|EIY37946.1| hypothetical protein HMPREF1062_00674 [Bacteroides cellulosilyticus
CL02T12C19]
Length = 942
Score = 256 bits (655), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 210/754 (27%), Positives = 340/754 (45%), Gaps = 129/754 (17%)
Query: 90 RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
RLGIP ++ +E + G+ S T+FP L ++NR L +G
Sbjct: 153 RLGIPV-DFTNEGIRGVES-----------YRATNFPTQLGLGHTWNRELIRQVGLITGR 200
Query: 150 EARAMYNLGQAGLT-FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWK 208
EAR + G T +AP +++ RD RWGR +E GE P +V+ +E V+ Q +
Sbjct: 201 EARML------GYTNVYAPILDVGRDQRWGRYEEVYGESPYLVAELGIEMVRGMQHSH-- 252
Query: 209 SDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITE 268
++A KH +AY K + ++
Sbjct: 253 -----------------------------QVAATGKHFVAYSNNKGAREGMARVDPQMSP 283
Query: 269 QDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLF---QKARNEWGFKGYITS 325
++ E PF+ I++ +M SYN +GVP ++G + + R E GF+GY+ S
Sbjct: 284 REVEMIHVYPFKRVIKEAGLLGVMSSYNDYDGVP--IQGSYYWLTTRLRGEMGFRGYVVS 341
Query: 326 DCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCG----TCMLRHTQSAIDKGKVQEKDI 381
D DAV ++ + K +++ ++AG+++ C + + + +G + E+ I
Sbjct: 342 DSDAVEYLYTKHSTAKDMKEAVRQSVEAGLNVRCTFRSPDSYVLPLRELVKEGGLSEEVI 401
Query: 382 DRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKF 441
+ + ++ V+ +GLF+ + G +V +E++ LAL A+R+ +VLLKN+
Sbjct: 402 NDRVRDILRVKFLVGLFDTPYQTDLAG--ADKEVEKAENESLALQASRESLVLLKNENNV 459
Query: 442 LPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV---SKTHYASGCHD 498
LPL+ N V +A+ GP + Y + ++L G+ ++ Y GC
Sbjct: 460 LPLDINNVKKIAVCGPNADEEGYALTHYGPLAVEVTTVLEGIRQKAEGKAEVLYTKGCDL 519
Query: 499 VPCN--------------SDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPG 544
V N A +AV A++AD +VV G E++ R SL LPG
Sbjct: 520 VDANWPESELIDYPMTDSEQAEIDKAVENARQADVAVVVLGGGQRTCGENKSRSSLDLPG 579
Query: 545 QQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIF 604
+Q+ L+ +V T K PV+LVL G PL +++A D + IL YPG G A+A+++F
Sbjct: 580 RQLKLLQAVQATGK-PVVLVLINGRPLSINWA--DKFVPVILEAWYPGSKGGTAVADVLF 636
Query: 605 GDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQ------VYGFGH 658
GD+NPGG+L +T +P+S ++P N S Q G G +Y FG+
Sbjct: 637 GDYNPGGKLTVT-FPKSVGQIP---FNFPCKPSSQIDGGKNPGLDGNMSRVNGALYSFGY 692
Query: 659 GLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQI 718
GLSYT + Y S++ IS + + K V+
Sbjct: 693 GLSYTTFEY------SDIEISPKVITPNQKAT-------------------------VRC 721
Query: 719 SVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSI 778
VTN G G VV L+ R S T EK L GF+R+H +KE+ F +D +QL +
Sbjct: 722 KVTNTGKRAGDEVVQLYVRDILSSVTTYEKNLAGFERIHLQPGETKEVVFTLD-RKQLEL 780
Query: 779 ANKHGRRILPLGNHVLMVG------ELRHSLTIE 806
+KH ++ G+ +MVG L LT+E
Sbjct: 781 LDKHMEWVVEPGDFSIMVGASSEDIRLSGKLTVE 814
>gi|189467715|ref|ZP_03016500.1| hypothetical protein BACINT_04107 [Bacteroides intestinalis DSM
17393]
gi|189435979|gb|EDV04964.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
intestinalis DSM 17393]
Length = 943
Score = 256 bits (655), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 211/754 (27%), Positives = 341/754 (45%), Gaps = 129/754 (17%)
Query: 90 RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
RLGIP ++ +E + GI S T+FP L ++NR L +G
Sbjct: 153 RLGIPV-DFTNEGIRGIES-----------YRATNFPTQLGLGHTWNRELIRQVGLITGR 200
Query: 150 EARAMYNLGQAGLT-FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWK 208
EAR + G T +AP +++ RD RWGR +E GE P +V+ +E V+ Q +
Sbjct: 201 EARIL------GYTNVYAPILDVGRDQRWGRYEEVYGESPYLVAELGIEMVRGMQHNH-- 252
Query: 209 SDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITE 268
++A KH +AY K + ++
Sbjct: 253 -----------------------------QVAATGKHFVAYSNNKGAREGMARVDPQMSP 283
Query: 269 QDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLF---QKARNEWGFKGYITS 325
++ E PF+ I++ +M SYN +GVP ++G + + R E GF+GY+ S
Sbjct: 284 REVEMIHVYPFKRVIKEAGLLGVMSSYNDYDGVP--IQGSYYWLTTRLRGEMGFRGYVVS 341
Query: 326 DCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCG----TCMLRHTQSAIDKGKVQEKDI 381
D DAV ++ + K +++ ++AG+++ C + + + +G + E+ I
Sbjct: 342 DSDAVEYLYTKHSTAKDMKEAVRQSVEAGLNVRCTFRSPDSYVLPLRELVKEGGLSEEVI 401
Query: 382 DRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKF 441
+ + ++ V+ +GLF+ + G +V +E++ LAL A+R+ +VLLKN+
Sbjct: 402 NDRVRDILRVKFLIGLFDAPYQTDLAG--ADREVEKAENESLALQASRESLVLLKNENNV 459
Query: 442 LPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV---SKTHYASGCH- 497
LPL+ N V +A+ GP + Y + ++L G+ ++ Y GC
Sbjct: 460 LPLDINNVKKIAVCGPNADEEGYALTHYGPLAVEVTTVLEGIRQKSEGKAEVLYTKGCDL 519
Query: 498 -----------DVPC--NSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPG 544
D P N A +AV A++AD +VV G E++ R SL LPG
Sbjct: 520 VDANWPESELIDYPMTDNEQAEIDKAVENARQADVAVVVLGGGQRTCGENKSRSSLDLPG 579
Query: 545 QQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIF 604
+Q+ L+ +V T K PV+LVL G PL +++A D + +IL YPG G A+A+++F
Sbjct: 580 RQLKLLQAVQATGK-PVVLVLINGRPLSINWA--DKFVPAILEAWYPGSKGGTAVADVLF 636
Query: 605 GDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQ------VYGFGH 658
GD+NPGG++ +T +P+S ++P N S Q G G +Y FG+
Sbjct: 637 GDYNPGGKMTVT-FPKSVGQIP---FNFPCKPSSQIDGGKNPGLDGNMSRVNGALYSFGY 692
Query: 659 GLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQI 718
GLSYT + Y + S I+ + KA V+
Sbjct: 693 GLSYTTFEYSGIEI-SPKVITPNQKA------------------------------TVRC 721
Query: 719 SVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSI 778
VTN G G VV L+ R S T EK L GF+R+H +KE+ F +D +QL +
Sbjct: 722 KVTNTGKRAGDEVVQLYVRDILSSVTTYEKNLAGFERIHLQPGETKEVVFTLD-RKQLEL 780
Query: 779 ANKHGRRILPLGNHVLMVG------ELRHSLTIE 806
+KH ++ G+ +MVG L LT+E
Sbjct: 781 LDKHMEWVVEPGDFSIMVGASSEDIRLSGKLTVE 814
>gi|212692496|ref|ZP_03300624.1| hypothetical protein BACDOR_01992 [Bacteroides dorei DSM 17855]
gi|212664971|gb|EEB25543.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
dorei DSM 17855]
Length = 864
Score = 256 bits (655), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 164/462 (35%), Positives = 234/462 (50%), Gaps = 64/462 (13%)
Query: 54 FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
+ N++LS RA+ L+ LTL+EK+ + DN+ + RLGI Y WW+E+LHG+A +G
Sbjct: 24 YKNSNLSPEERAEDLLQQLTLEEKVALMMDNSKPVERLGIKPYNWWNEALHGVARSG--- 80
Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLTFW 165
T FPQ + AASF I +AV+ EARA A GLT W
Sbjct: 81 -------LATVFPQPIGMAASFEPDAIHTIYTAVSDEARAKNTAYSAAGSYERYQGLTMW 133
Query: 166 APNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQ--GENWKSDDGGIGFGFREKRV 223
P +NI+RDPRWGRG ET GEDP + S V VK Q N K D
Sbjct: 134 TPTVNIYRDPRWGRGIETYGEDPYLTSVMGVNVVKGLQCTDANQKYDK------------ 181
Query: 224 LKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPFRSC 282
+ AC KH + +W +R+ FNA I +D +T+ PF +
Sbjct: 182 ---------------IHACAKHFAVHSGPEW---NRHEFNAENIKPRDLHETYLVPFEAL 223
Query: 283 IEQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIFEYQNYTK 341
+++GK +MC+YN++ G P C L Q R EWG++G + SDC A+ + + + K
Sbjct: 224 VKEGKVKEVMCAYNRLEGDPCCGSDRLLMQILRQEWGYEGIVLSDCGAIDDFYREKGH-K 282
Query: 342 THED----SAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGL 397
TH D SAA VL +G D+ CG+ +SA KG + EKDID ++ L + LG
Sbjct: 283 THPDAESASAAAVL-SGTDLECGSSYKALVESA-KKGLISEKDIDVSVKRLLKARFELGE 340
Query: 398 FNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGP 457
+ DP K ++ K+ VC++EH L+LD AR+ + LL N LPL + ++A++GP
Sbjct: 341 MD-DPSKVEWTKIPYSVVCSAEHDSLSLDIARKSMTLLLNKNNILPLKRGG-QTIAVMGP 398
Query: 458 LVNNISQMGGGYTGIPCSPKSLLRGLEAYV---SKTHYASGC 496
N+ G Y G P +LL G+ + + K Y GC
Sbjct: 399 NANDSVMQWGNYNGTPKHTITLLEGIRSAMGENDKLIYEQGC 440
Score = 121 bits (304), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 97/325 (29%), Positives = 147/325 (45%), Gaps = 66/325 (20%)
Query: 491 HYASGCHDVPCNSDAGFHEAVRI------AKKADFVIVVAGLDLTQETEDR--------- 535
Y SG D N D GF E V I K AD VI G+ + E E+
Sbjct: 572 QYFSG--DAQLNFDLGFKEEVNIKNTVAKVKDADIVIFAGGISPSLEGEEMGVNLPGFRK 629
Query: 536 -DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEA 594
DR + LP Q L+ ++ K+ VI V G P ++ +IL YPG++
Sbjct: 630 GDRTDIELPAVQRELIKALCDAGKK-VIFVNFSGSP--IAMEPETKYCQAILQAWYPGQS 686
Query: 595 GAKALAEIIFGDFNPGGRLPMTWYPESFTKVP-MNDMNMRADSSRQYPGRSYRFYTGTQV 653
G KA AE++FGD+NP GRLP+T+Y + ++P D NM GR+YR++ G +
Sbjct: 687 GGKAAAEVLFGDYNPAGRLPVTFY-RNIAQLPDFEDYNMT--------GRTYRYFKGDPL 737
Query: 654 YGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLR 713
+ FG+GLSYT ++Y + + ++K G ++
Sbjct: 738 FPFGYGLSYTTFNY------DNIKLDQTIKVGETAKMV---------------------- 769
Query: 714 FHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPC 773
I VTNAG+ DG VV ++ + + ++G P K L F RV A + + + P
Sbjct: 770 ----IPVTNAGNRDGEEVVQVYLKKQEDAEG-PAKTLRAFKRVQIPAGKTVNVELELTP- 823
Query: 774 EQLSIANKHGRRILPL-GNHVLMVG 797
+QL + + + GN +MVG
Sbjct: 824 KQLEWWDAQTNTMRTIAGNFDIMVG 848
>gi|265752711|ref|ZP_06088280.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp.
3_1_33FAA]
gi|263235897|gb|EEZ21392.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp.
3_1_33FAA]
Length = 864
Score = 256 bits (655), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 164/462 (35%), Positives = 234/462 (50%), Gaps = 64/462 (13%)
Query: 54 FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
+ N++LS RA+ L+ LTL+EK+ + DN+ + RLGI Y WW+E+LHG+A +G
Sbjct: 24 YKNSNLSPEERAEDLLQQLTLEEKVALMMDNSKPVERLGIKPYNWWNEALHGVARSG--- 80
Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLTFW 165
T FPQ + AASF I +AV+ EARA A GLT W
Sbjct: 81 -------LATVFPQPIGMAASFEPDAIHTIYTAVSDEARAKNTAYSAAGSYERYQGLTMW 133
Query: 166 APNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQ--GENWKSDDGGIGFGFREKRV 223
P +NI+RDPRWGRG ET GEDP + S V VK Q N K D
Sbjct: 134 TPTVNIYRDPRWGRGIETYGEDPYLTSVMGVNVVKGLQCTDANQKYDK------------ 181
Query: 224 LKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPFRSC 282
+ AC KH + +W +R+ FNA I +D +T+ PF +
Sbjct: 182 ---------------IHACAKHFAVHSGPEW---NRHEFNAENIKPRDLHETYLVPFEAL 223
Query: 283 IEQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIFEYQNYTK 341
+++GK +MC+YN++ G P C L Q R EWG++G + SDC A+ + + + K
Sbjct: 224 VKEGKVKEVMCAYNRLEGDPCCGSDRLLMQILRQEWGYEGIVLSDCGAIDDFYREKGH-K 282
Query: 342 THED----SAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGL 397
TH D SAA VL +G D+ CG+ +SA KG + EKDID ++ L + LG
Sbjct: 283 THPDAESASAAAVL-SGTDLECGSSYKALVESA-KKGLISEKDIDVSVKRLLKARFELGE 340
Query: 398 FNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGP 457
+ DP K ++ K+ VC++EH L+LD AR+ + LL N LPL + ++A++GP
Sbjct: 341 MD-DPSKVEWTKIPYSVVCSAEHDSLSLDIARKSMTLLLNKNNILPLKRGG-QTIAVMGP 398
Query: 458 LVNNISQMGGGYTGIPCSPKSLLRGLEAYV---SKTHYASGC 496
N+ G Y G P +LL G+ + + K Y GC
Sbjct: 399 NANDSVMQWGNYNGTPKHTITLLEGIRSAMGENDKLIYEQGC 440
Score = 120 bits (302), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 98/326 (30%), Positives = 150/326 (46%), Gaps = 68/326 (20%)
Query: 491 HYASGCHDVPCNSDAGFHEAVRI------AKKADFVIVVAGLDLTQETEDR--------- 535
Y SG D N D GF E V I K AD VI G+ + E E+
Sbjct: 572 QYFSG--DAQLNFDLGFKEEVNIKNTVAKVKDADIVIFAGGISPSLEGEEMGVNLPGFRK 629
Query: 536 -DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQ-ISSILWIGYPGE 593
DR + LP Q L+ ++ K+ VI V G P+ + E ++Q +IL YPG+
Sbjct: 630 GDRTDIELPAVQRELIKALCDAGKK-VIFVNFSGSPIAM---EPETQYCQAILQAWYPGQ 685
Query: 594 AGAKALAEIIFGDFNPGGRLPMTWYPESFTKVP-MNDMNMRADSSRQYPGRSYRFYTGTQ 652
+G KA AE++FGD+NP GRLP+T+Y + ++P D NM GR+YR++ G
Sbjct: 686 SGGKAAAEVLFGDYNPAGRLPVTFY-RNIAQLPDFEDYNMT--------GRTYRYFKGDP 736
Query: 653 VYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSL 712
++ FG+GLSYT ++Y + + ++K G ++
Sbjct: 737 LFPFGYGLSYTTFNY------GNIKLEQTIKVGETAKMV--------------------- 769
Query: 713 RFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDP 772
I VTN G+ DG VV ++ + + ++G P K L F RV A + + + P
Sbjct: 770 -----IPVTNTGNRDGEEVVQVYLKKQEDTEG-PAKTLRAFKRVQIPAGKTVNVELELTP 823
Query: 773 CEQLSIANKHGRRILPL-GNHVLMVG 797
+QL + + + GN +MVG
Sbjct: 824 -KQLEWWDAQTNTMRTIAGNFDIMVG 848
>gi|237709184|ref|ZP_04539665.1| glycoside hydrolase family 3 protein [Bacteroides sp. 9_1_42FAA]
gi|229456880|gb|EEO62601.1| glycoside hydrolase family 3 protein [Bacteroides sp. 9_1_42FAA]
Length = 864
Score = 256 bits (655), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 164/462 (35%), Positives = 234/462 (50%), Gaps = 64/462 (13%)
Query: 54 FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
+ N++LS RA+ L+ LTL+EK+ + DN+ + RLGI Y WW+E+LHG+A +G
Sbjct: 24 YKNSNLSPEERAEDLLQQLTLEEKVALMMDNSKPVERLGIKPYNWWNEALHGVARSG--- 80
Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLTFW 165
T FPQ + AASF I +AV+ EARA A GLT W
Sbjct: 81 -------LATVFPQPIGMAASFEPDAIHTIYTAVSDEARAKNTAYSAAGSYERYQGLTMW 133
Query: 166 APNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQ--GENWKSDDGGIGFGFREKRV 223
P +NI+RDPRWGRG ET GEDP + S V VK Q N K D
Sbjct: 134 TPTVNIYRDPRWGRGIETYGEDPYLTSVMGVNVVKGLQCTDANQKYDK------------ 181
Query: 224 LKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPFRSC 282
+ AC KH + +W +R+ FNA I +D +T+ PF +
Sbjct: 182 ---------------IHACAKHFAVHSGPEW---NRHEFNAENIKPRDLHETYLVPFEAL 223
Query: 283 IEQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIFEYQNYTK 341
+++GK +MC+YN++ G P C L Q R EWG++G + SDC A+ + + + K
Sbjct: 224 VKEGKVKEVMCAYNRLEGDPCCGSDRLLMQILRQEWGYEGIVLSDCGAIDDFYREKGH-K 282
Query: 342 THED----SAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGL 397
TH D SAA VL +G D+ CG+ +SA KG + EKDID ++ L + LG
Sbjct: 283 THPDAESASAAAVL-SGTDLECGSSYKALVESA-KKGLISEKDIDVSVKRLLKARFELGE 340
Query: 398 FNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGP 457
+ DP K ++ K+ VC++EH L+LD AR+ + LL N LPL + ++A++GP
Sbjct: 341 MD-DPSKVEWTKIPYSVVCSAEHDSLSLDIARKSMTLLLNKNNILPLKRGG-QTIAVMGP 398
Query: 458 LVNNISQMGGGYTGIPCSPKSLLRGLEAYV---SKTHYASGC 496
N+ G Y G P +LL G+ + + K Y GC
Sbjct: 399 NANDSVMQWGNYNGTPKHTITLLEGIRSAMGENDKLIYEQGC 440
Score = 121 bits (304), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 97/325 (29%), Positives = 147/325 (45%), Gaps = 66/325 (20%)
Query: 491 HYASGCHDVPCNSDAGFHEAVRI------AKKADFVIVVAGLDLTQETEDR--------- 535
Y SG D N D GF E V I K AD VI G+ + E E+
Sbjct: 572 QYFSG--DAQLNFDLGFKEEVNIKNTVAKVKDADIVIFAGGISPSLEGEEMGVNLPGFRK 629
Query: 536 -DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEA 594
DR + LP Q L+ ++ K+ VI V G P ++ +IL YPG++
Sbjct: 630 GDRTDIELPAVQRELIKALCDAGKK-VIFVNFSGSP--IAMEPETKYCQAILQAWYPGQS 686
Query: 595 GAKALAEIIFGDFNPGGRLPMTWYPESFTKVP-MNDMNMRADSSRQYPGRSYRFYTGTQV 653
G KA AE++FGD+NP GRLP+T+Y + ++P D NM GR+YR++ G +
Sbjct: 687 GGKAAAEVLFGDYNPAGRLPVTFY-RNIAQLPDFEDYNMT--------GRTYRYFKGDPL 737
Query: 654 YGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLR 713
+ FG+GLSYT ++Y + + ++K G ++
Sbjct: 738 FPFGYGLSYTTFNY------DNIKLDQTIKVGETAKMV---------------------- 769
Query: 714 FHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPC 773
I VTNAG+ DG VV ++ + + ++G P K L F RV A + + + P
Sbjct: 770 ----IPVTNAGNRDGEEVVQVYLKKQEDAEG-PAKTLRAFKRVQIPAGKTVNVELELTP- 823
Query: 774 EQLSIANKHGRRILPL-GNHVLMVG 797
+QL + + + GN +MVG
Sbjct: 824 KQLEWWDAQTNTMRTIAGNFDIMVG 848
>gi|345514226|ref|ZP_08793739.1| glycoside hydrolase family beta-glycosidase [Bacteroides dorei
5_1_36/D4]
gi|229437207|gb|EEO47284.1| glycoside hydrolase family beta-glycosidase [Bacteroides dorei
5_1_36/D4]
Length = 864
Score = 256 bits (655), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 164/462 (35%), Positives = 234/462 (50%), Gaps = 64/462 (13%)
Query: 54 FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
+ N++LS RA+ L+ LTL+EK+ + DN+ + RLGI Y WW+E+LHG+A +G
Sbjct: 24 YKNSNLSPEERAEDLLQQLTLEEKVALMMDNSKPVERLGIKPYNWWNEALHGVARSG--- 80
Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLTFW 165
T FPQ + AASF I +AV+ EARA A GLT W
Sbjct: 81 -------LATVFPQPIGMAASFEPDAIHTIYTAVSDEARAKNTAYSAAGSYERYQGLTMW 133
Query: 166 APNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQ--GENWKSDDGGIGFGFREKRV 223
P +NI+RDPRWGRG ET GEDP + S V VK Q N K D
Sbjct: 134 TPTVNIYRDPRWGRGIETYGEDPYLTSVMGVNVVKGLQCTDANQKYDK------------ 181
Query: 224 LKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPFRSC 282
+ AC KH + +W +R+ FNA I +D +T+ PF +
Sbjct: 182 ---------------IHACAKHFAVHSGPEW---NRHEFNAENIKPRDLHETYLVPFEAL 223
Query: 283 IEQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIFEYQNYTK 341
+++GK +MC+YN++ G P C L Q R EWG++G + SDC A+ + + + K
Sbjct: 224 VKEGKVKEVMCAYNRLEGDPCCGSDRLLMQILRQEWGYEGIVLSDCGAIDDFYREKGH-K 282
Query: 342 THED----SAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGL 397
TH D SAA VL +G D+ CG+ +SA KG + EKDID ++ L + LG
Sbjct: 283 THPDAESASAAAVL-SGTDLECGSSYKALVESA-KKGLISEKDIDVSVKRLLKARFELGE 340
Query: 398 FNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGP 457
+ DP K ++ K+ VC++EH L+LD AR+ + LL N LPL + ++A++GP
Sbjct: 341 MD-DPSKVEWTKIPYSVVCSAEHDSLSLDIARKSMTLLLNKNNILPLKRGG-QTIAVMGP 398
Query: 458 LVNNISQMGGGYTGIPCSPKSLLRGLEAYV---SKTHYASGC 496
N+ G Y G P +LL G+ + + K Y GC
Sbjct: 399 NANDSVMQWGNYNGTPKHTITLLEGIRSAMGENDKLIYEQGC 440
Score = 121 bits (304), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 97/325 (29%), Positives = 147/325 (45%), Gaps = 66/325 (20%)
Query: 491 HYASGCHDVPCNSDAGFHEAVRI------AKKADFVIVVAGLDLTQETEDR--------- 535
Y SG D N D GF E V I K AD VI G+ + E E+
Sbjct: 572 QYFSG--DAQLNFDLGFKEEVNIKNTVAKVKDADIVIFAGGISPSLEGEEMGVNLPGFRK 629
Query: 536 -DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEA 594
DR + LP Q L+ ++ K+ VI V G P ++ +IL YPG++
Sbjct: 630 GDRTDIELPAVQRELIKALCDAGKK-VIFVNFSGSP--IAMEPETKYCQAILQAWYPGQS 686
Query: 595 GAKALAEIIFGDFNPGGRLPMTWYPESFTKVP-MNDMNMRADSSRQYPGRSYRFYTGTQV 653
G KA AE++FGD+NP GRLP+T+Y + ++P D NM GR+YR++ G +
Sbjct: 687 GGKAAAEVLFGDYNPAGRLPVTFY-RNIAQLPDFEDYNMT--------GRTYRYFKGDPL 737
Query: 654 YGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLR 713
+ FG+GLSYT ++Y + + ++K G ++
Sbjct: 738 FPFGYGLSYTTFNY------DNIKLDQTIKVGETAKMV---------------------- 769
Query: 714 FHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPC 773
I VTNAG+ DG VV ++ + + ++G P K L F RV A + + + P
Sbjct: 770 ----IPVTNAGNRDGEEVVQVYLKKQEDAEG-PAKTLRAFKRVQIPAGKTVNVELELTP- 823
Query: 774 EQLSIANKHGRRILPL-GNHVLMVG 797
+QL + + + GN +MVG
Sbjct: 824 KQLEWWDAQTNTMRTIAGNFDIMVG 848
>gi|293371677|ref|ZP_06618088.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
ovatus SD CMC 3f]
gi|292633374|gb|EFF51944.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
ovatus SD CMC 3f]
Length = 783
Score = 256 bits (655), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 214/749 (28%), Positives = 339/749 (45%), Gaps = 132/749 (17%)
Query: 90 RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
RLGIP + E+ HG + G T FP + AA+++ L +G A+
Sbjct: 131 RLGIPLF-LAEEAPHGHMAIG-----------ATVFPTGIGMAATWSPQLIREVGKAIGK 178
Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
E R Q G + P +++ RDPRW R +ET GEDP++
Sbjct: 179 EIRL-----QGGHISYGPVLDLARDPRWSRVEETFGEDPVLT------------------ 215
Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQ 269
G IG K +++G G D A KH +AY + + G SF + +
Sbjct: 216 --GEIG-----KAIVEGLGG-GDLSRPYSTLATLKHFLAYGISESGQNGNPSFAGI---R 264
Query: 270 DTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCD 328
+ + F PPFR I+ G A +M SYN ++GVP L + RNEW F G + SD
Sbjct: 265 ELHENFLPPFRQAIDAG-ALSVMTSYNSMDGVPCTANHSLLTELLRNEWKFSGIVVSDLY 323
Query: 329 AVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNL 388
++ I + T E++A L AG+D++ G + +A++ G++ + +D ++ +
Sbjct: 324 SIEGIHQSHFVAPTMEEAAVLALSAGVDVDLGGDAYMNLMNAVNTGRIGKTALDASVARV 383
Query: 389 FSVQLRLGLFNG---DPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLN 445
++ +GLF DP K K +V + E LA A+ I LLKN+ LPLN
Sbjct: 384 LRLKFEMGLFENPYVDPEKAK------KEVRSEESVTLARRVAQASITLLKNEHSLLPLN 437
Query: 446 KNAVSSLAIIGPLVNNISQMGGGYTGI--PCSPKSLLRGLEAYV--SKTHYASGCHDVPC 501
KN +A+IGP +N M G YT + K++L G+ + S+ Y GC +
Sbjct: 438 KN--RKVALIGPNADNRYNMLGDYTAPQEEANIKTVLDGIRTKLSSSQVEYVKGC-SIRD 494
Query: 502 NSDAGFHEAVRIAKKADFVIVVAG----LDLT---QET----------------EDRDRV 538
+AV A++++ +I V G D +ET E DR
Sbjct: 495 TVTTDIEQAVAAAQRSEIIIAVVGGSSARDFKTSYKETGAAIANEKTISDMECGEGFDRA 554
Query: 539 SLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKA 598
+L L G+Q L+ ++ +T+ +P+++V G PLD ++A ++ ++L YPG+ G A
Sbjct: 555 TLSLLGKQQELLKAL-KTTGKPLVVVYIEGRPLDKNWASENAD--AVLTAYYPGQEGGIA 611
Query: 599 LAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGH 658
+A+++FGDFNP GRLP + P S ++P+ N +A S Y S + +Y FG+
Sbjct: 612 IADVLFGDFNPAGRLPFS-VPRSVGQIPLY-YNKKAPQSHDYVEMS-----ASPLYPFGY 664
Query: 659 GLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQI 718
GLSYT++ Y S+L +SA T F V
Sbjct: 665 GLSYTSFDY------SDLHLSA----------------------------LTPRSFEVSF 690
Query: 719 SVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSI 778
V N G DG V L+ R S P KQL F R + +E+ F + E S+
Sbjct: 691 KVRNTGKYDGEEVAQLYLRDEYASVVQPLKQLKHFARFYLKRGEEREVKF-ILSEEDFSL 749
Query: 779 ANKHGRRILPLGNHVLMVGELRHSLTIET 807
+++ + I+ G +M+G + ++T
Sbjct: 750 VDRNLKSIVEPGTFQIMIGAASDDIRLQT 778
>gi|423230604|ref|ZP_17217008.1| hypothetical protein HMPREF1063_02828 [Bacteroides dorei
CL02T00C15]
gi|423244313|ref|ZP_17225388.1| hypothetical protein HMPREF1064_01594 [Bacteroides dorei
CL02T12C06]
gi|392630748|gb|EIY24734.1| hypothetical protein HMPREF1063_02828 [Bacteroides dorei
CL02T00C15]
gi|392642494|gb|EIY36260.1| hypothetical protein HMPREF1064_01594 [Bacteroides dorei
CL02T12C06]
Length = 864
Score = 256 bits (655), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 164/462 (35%), Positives = 234/462 (50%), Gaps = 64/462 (13%)
Query: 54 FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
+ N++LS RA+ L+ LTL+EK+ + DN+ + RLGI Y WW+E+LHG+A +G
Sbjct: 24 YKNSNLSPEERAEDLLQQLTLEEKVALMMDNSKPVERLGIKPYNWWNEALHGVARSG--- 80
Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLTFW 165
T FPQ + AASF I +AV+ EARA A GLT W
Sbjct: 81 -------LATVFPQPIGMAASFEPDAIHTIYTAVSDEARAKNTAYSAAGSYERYQGLTMW 133
Query: 166 APNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQ--GENWKSDDGGIGFGFREKRV 223
P +NI+RDPRWGRG ET GEDP + S V VK Q N K D
Sbjct: 134 TPTVNIYRDPRWGRGIETYGEDPYLTSVMGVNVVKGLQCTDANQKYDK------------ 181
Query: 224 LKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPFRSC 282
+ AC KH + +W +R+ FNA I +D +T+ PF +
Sbjct: 182 ---------------IHACAKHFAVHSGPEW---NRHEFNAENIKPRDLHETYLVPFEAL 223
Query: 283 IEQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIFEYQNYTK 341
+++GK +MC+YN++ G P C L Q R EWG++G + SDC A+ + + + K
Sbjct: 224 VKEGKVKEVMCAYNRLEGDPCCGSDRLLMQILRQEWGYEGIVLSDCGAIDDFYREKGH-K 282
Query: 342 THED----SAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGL 397
TH D SAA VL +G D+ CG+ +SA KG + EKDID ++ L + LG
Sbjct: 283 THPDAESASAAAVL-SGTDLECGSSYKALVESA-KKGLISEKDIDVSVKRLLKARFELGE 340
Query: 398 FNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGP 457
+ DP K ++ K+ VC++EH L+LD AR+ + LL N LPL + ++A++GP
Sbjct: 341 MD-DPSKVEWTKIPYSVVCSAEHDSLSLDIARKSMTLLLNKNNILPLKRGG-QTIAVMGP 398
Query: 458 LVNNISQMGGGYTGIPCSPKSLLRGLEAYV---SKTHYASGC 496
N+ G Y G P +LL G+ + + K Y GC
Sbjct: 399 NANDSVMQWGNYNGTPKHTITLLEGIRSAMGENDKLIYEQGC 440
Score = 120 bits (300), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 96/325 (29%), Positives = 146/325 (44%), Gaps = 66/325 (20%)
Query: 491 HYASGCHDVPCNSDAGFHEAVRI------AKKADFVIVVAGLDLTQETEDR--------- 535
Y SG D N D GF E V I K AD VI G+ + E E+
Sbjct: 572 QYFSG--DAQLNFDLGFKEEVNIKNTVAKVKDADIVIFAGGISPSLEGEEMGVNLPGFRK 629
Query: 536 -DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEA 594
DR + LP Q L+ ++ K+ VI V G P ++ +IL YPG++
Sbjct: 630 GDRTDIELPAVQRELIKALCDAGKK-VIFVNFSGSP--IAMEPETKYCQAILQAWYPGQS 686
Query: 595 GAKALAEIIFGDFNPGGRLPMTWYPESFTKVP-MNDMNMRADSSRQYPGRSYRFYTGTQV 653
G KA AE++FGD+NP GRLP+T+Y + ++P D NM GR+YR++ G +
Sbjct: 687 GGKAAAEVLFGDYNPAGRLPVTFY-RNIAQLPDFEDYNMT--------GRTYRYFKGDPL 737
Query: 654 YGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLR 713
+ FG+GLSYT ++Y + + ++K G ++
Sbjct: 738 FPFGYGLSYTTFNY------DNIKLEQTIKVGETAKMV---------------------- 769
Query: 714 FHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPC 773
I VTN G+ DG VV ++ + + ++G P K L F RV A + + + P
Sbjct: 770 ----IPVTNTGNRDGEEVVQVYLKKQEDTEG-PTKTLRAFKRVQIPAGKTVNVELELTP- 823
Query: 774 EQLSIANKHGRRILPL-GNHVLMVG 797
+QL + + + GN +MVG
Sbjct: 824 KQLEWWDAQTNTMRTIAGNFDIMVG 848
>gi|62321294|dbj|BAD94522.1| beta-xylosidase - like protein [Arabidopsis thaliana]
Length = 287
Score = 256 bits (654), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 130/291 (44%), Positives = 183/291 (62%), Gaps = 9/291 (3%)
Query: 518 DFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAE 577
D V++V G D + E E DRV L LPG+Q LVT VA ++ PV+LV+ GG D++FA+
Sbjct: 1 DSVVLVVGADQSIEREGHDRVDLYLPGKQQELVTRVAMAARGPVVLVIMSGGGFDITFAK 60
Query: 578 ADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNMRADS 636
D +I+SI+W+GYPGEAG A+ ++IFG NP G LPMTWYP+S+ KVPM++MNMR D
Sbjct: 61 NDKKITSIMWVGYPGEAGGLAIPDVIFGRHNPSGNLPMTWYPQSYVEKVPMSNMNMRPDK 120
Query: 637 SRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGS 696
S+ YPGRSYRFYTG VY F L+YT + ++ + AP +++S + Q
Sbjct: 121 SKGYPGRSYRFYTGETVYAFADALTYTKFDHQLIKAPRLVSLSLDENHPCRSSECQS--- 177
Query: 697 RLDYV--HIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFD 754
LD + H + S F V ++V N GD GSH V LF P+V G+P KQL+GF+
Sbjct: 178 -LDAIGPHCENAVEGGS-DFEVHLNVKNTGDRAGSHTVFLFTTSPQV-HGSPIKQLLGFE 234
Query: 755 RVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTI 805
++ + F V+ C+ LS+ ++ G+R + LG+H+L VG L+HSL I
Sbjct: 235 KIRLGKSEEAVVRFNVNVCKDLSVVDETGKRKIALGHHLLHVGSLKHSLNI 285
>gi|393784569|ref|ZP_10372732.1| hypothetical protein HMPREF1071_03600 [Bacteroides salyersiae
CL02T12C01]
gi|392665550|gb|EIY59074.1| hypothetical protein HMPREF1071_03600 [Bacteroides salyersiae
CL02T12C01]
Length = 929
Score = 256 bits (654), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 147/436 (33%), Positives = 230/436 (52%), Gaps = 49/436 (11%)
Query: 51 SYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNG 110
++PF + SLS RAK+L+SLLTL+EKI Q+ AIPRL I Y +W+E++HG+A +G
Sbjct: 39 THPFQDESLSFHERAKNLVSLLTLEEKINQVGHQTLAIPRLNIKGYNYWNEAIHGVARSG 98
Query: 111 PGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNIN 170
TSFP +++++ L + A + EAR N GL +W P IN
Sbjct: 99 LA----------TSFPVSKAMSSTWDLPLIFDCAVATSDEARVYSNTKDKGLIYWCPTIN 148
Query: 171 IFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEE 230
+ RDPRWGR +E GEDP + AVE++K QG++ K
Sbjct: 149 MSRDPRWGRDEENYGEDPFLTGKIAVEYIKGMQGDDPKY--------------------- 187
Query: 231 SDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASC 290
A KH A + EK R+S ++ + ++ + + P F +++G
Sbjct: 188 ------YKTIATAKHFAANNYEK----GRHSTSSDMDARNLREYYLPAFEMAVKEGNVRS 237
Query: 291 IMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIFE--YQNYTKTHEDSA 347
+M +YN +NG+P +L R EWGF G++TSDC AV +++ ++ T +++
Sbjct: 238 VMSAYNALNGIPCGANHELLIDILRTEWGFNGFVTSDCGAVDDVYQSNRHHFVNTAAEAS 297
Query: 348 AGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKY 407
A + G D+NCG + + AI+KG +QE D+D AL+ +F + +G F+ + +
Sbjct: 298 AVSIVNGEDLNCGNTFQDYCKEAIEKGYMQEADLDTALVRVFEARFSVGEFD-NASNVPW 356
Query: 408 GKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGG 467
+ D + EH++LA AA++ IVLLKND LPL+K S+A+IGP N I+ G
Sbjct: 357 RSISDDVLDCEEHRQLAYKAAQEAIVLLKNDNNILPLDK--TKSVAVIGPFGNTITL--G 412
Query: 468 GYTGIPCSPKSLLRGL 483
GY+G P + + G+
Sbjct: 413 GYSGSPTALTTPFGGI 428
Score = 119 bits (299), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 88/288 (30%), Positives = 136/288 (47%), Gaps = 42/288 (14%)
Query: 495 GCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVA 554
GC V ++ A IA KAD VI AG DLT E DR +L LPG Q L+ +V
Sbjct: 592 GCA-VTGTAETNLERAKEIAAKADVVIFAAGTDLTVSDESHDRTNLNLPGDQQKLLEAVY 650
Query: 555 RTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLP 614
+ ++L+ T V+ A + +I+ Y G+A KA+A++++GD+NP G+L
Sbjct: 651 SANPNVILLLQTCSS---VTINWAKEHVPAIIEAWYGGQAQGKAIADVLYGDYNPSGKLT 707
Query: 615 MTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPS 674
TWY + + +P +N ++ +Y ++ T +Y FG+G+SYT + Y+
Sbjct: 708 STWY-NALSDLPNGMLNYDIRDAK----YTYMYHDKTPLYPFGYGMSYTTFEYQ------ 756
Query: 675 ELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVML 734
+L IS S A ++ I V +TN G G+ +V L
Sbjct: 757 KLNISKSRLAAGEELI-------------------------VSADITNTGKYAGAEIVQL 791
Query: 735 FARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKH 782
+A V S P KQL+GF RV +K ++ + EQLS N+
Sbjct: 792 YAHV-NSSIERPLKQLVGFARVELEPGETKTVTMPLKH-EQLSYFNEQ 837
>gi|313205375|ref|YP_004044032.1| glycoside hydrolase [Paludibacter propionicigenes WB4]
gi|312444691|gb|ADQ81047.1| glycoside hydrolase family 3 domain protein [Paludibacter
propionicigenes WB4]
Length = 858
Score = 256 bits (654), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 161/459 (35%), Positives = 227/459 (49%), Gaps = 62/459 (13%)
Query: 53 PFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPG 112
P+ N LS RA L++ LTL EK + +N+ AIPRLGI AYEWW+E+LHG+ +G
Sbjct: 23 PYQNPKLSAEVRATDLLARLTLAEKAALMQNNSPAIPRLGIKAYEWWNEALHGVGRSG-- 80
Query: 113 VNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLTF 164
T FPQ + AASFN L + +AV+ EARA N GLT+
Sbjct: 81 --------VATVFPQAIGMAASFNNGLLFDAFTAVSDEARAKSNKFSEQGGLKRYQGLTY 132
Query: 165 WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVL 224
W PN+NIFRDPRWGRGQET GEDP + S V VK QG
Sbjct: 133 WTPNVNIFRDPRWGRGQETYGEDPYLTSLMGVAVVKGLQG-------------------- 172
Query: 225 KGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPFRSCI 283
D + L AC KH + +W +R+SFNA I +D +T+ P F++ +
Sbjct: 173 ------PDNAEYDKLHACAKHFAVHSGPEW---NRHSFNAENINPRDLWETYLPAFKALV 223
Query: 284 EQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIFEYQNYTKT 342
++ +MC+YN+ P C L Q RN+W F G + SDC A++ ++ N T
Sbjct: 224 QKADVKEVMCAYNRFEDEPCCGSNRLLTQILRNDWKFDGLVVSDCWAISDFYK-PNAHAT 282
Query: 343 HED---SAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFN 399
D +AA + G D+ CG+ R+ A+ G ++EK ID +L L + LG N
Sbjct: 283 QPDATHAAANAVLNGTDLECGSD-FRNLPEAVKAGLIEEKRIDVSLKRLLKARFELGEMN 341
Query: 400 GDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLV 459
D + + V + +H+ LAL A + IVLL+N+ LPL+K +A++GP
Sbjct: 342 SD----QVWPISYSVVNSEKHQNLALRMAEESIVLLQNNNNILPLSKKL--KIAVMGPNA 395
Query: 460 NNISQMGGGYTGIPCSPKSLLRGLEAYV--SKTHYASGC 496
N+ G Y G P +LL + ++ Y GC
Sbjct: 396 NDSVMQWGNYNGFPAHTVTLLEAMRKSFPGAQLIYEPGC 434
Score = 98.2 bits (243), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 90/336 (26%), Positives = 143/336 (42%), Gaps = 67/336 (19%)
Query: 450 SSLAIIGPLVN-NISQMGGGYTGIPCSPKSL-LRGLEAYVSKTHYASGCHDVPCNSDAG- 506
+SL I G +VN N+S P + L ++Y + ++ G D N D G
Sbjct: 526 TSLTINGEIVNRNVS------ANTPTNVYQLKAEAGKSYEIEIAFSQGNTDANLNFDFGT 579
Query: 507 -----FHEAVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVT 551
++ K AD V+ G+ + E E+ DR + LP Q L+
Sbjct: 580 LVPLDLSASIAKVKDADVVVFAGGIAPSLEGEEMRVTVPGFKGGDRTDIELPAIQRRLLQ 639
Query: 552 SVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGG 611
++ K+ V + +G + +IL YPG+AG A+A ++ G++NP G
Sbjct: 640 ALKDAGKKVVFVNFSGSA---MGLVPETQSCEAILQAWYPGQAGGTAVANVLLGNYNPSG 696
Query: 612 RLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLS 671
RLP+T+Y D +M+ GR+YR+ T ++ FG+GLSYT KF+
Sbjct: 697 RLPVTFYKNVAQLPDFEDYSMK--------GRTYRYMTEKPLFSFGYGLSYT----KFVL 744
Query: 672 APSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHV 731
++L S S+KA + + VTNAG V G+ V
Sbjct: 745 GTAKLNKS-SIKANET--------------------------LKITVPVTNAGKVAGTEV 777
Query: 732 VMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEIS 767
+ ++ R K G P K L GF +V+ + +IS
Sbjct: 778 LQVYVRKVKDVDG-PAKTLRGFKKVNIEPGKTSQIS 812
>gi|427387354|ref|ZP_18883410.1| hypothetical protein HMPREF9447_04443 [Bacteroides oleiciplenus YIT
12058]
gi|425725515|gb|EKU88386.1| hypothetical protein HMPREF9447_04443 [Bacteroides oleiciplenus YIT
12058]
Length = 786
Score = 256 bits (654), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 222/743 (29%), Positives = 349/743 (46%), Gaps = 125/743 (16%)
Query: 90 RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
RLGIP ++ +E + G+ + T FP A++N+ L S I A
Sbjct: 137 RLGIPV-DFTNEGIRGLCHD-----------RATMFPAQCGQGATWNKELISEIAKVTAE 184
Query: 150 EARAMYNLGQAGLT-FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWK 208
EA+A+ G T ++P ++I +DPRWGR E GEDP +V
Sbjct: 185 EAKAL------GYTNIYSPILDIAQDPRWGRVVECYGEDPFLV----------------- 221
Query: 209 SDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITE 268
G +G KR++KG E L + KH Y + G + + +
Sbjct: 222 ---GELG-----KRMIKGLQAEG-------LVSTPKHFAVYSIPVGGRDAGTRTDPHVAP 266
Query: 269 QDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGD---LFQKARNEWGFKGYITS 325
++ + PFR + A +M SYN +G P + G L + R+EWGFKGY+ S
Sbjct: 267 REMRTLYIEPFRKAFCEAGALGVMSSYNDYDGEP--ITGSYHFLTEILRHEWGFKGYVVS 324
Query: 326 DCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCM----LRHTQSAIDKGKVQEKDI 381
D +AV ++ N D AA V+ AG+++ + +R + AI +GKV E+ I
Sbjct: 325 DSEAVEFLYSKHNVAANAVDGAAQVINAGLNVRTNFTLPENFIRPLRQAISEGKVSEQTI 384
Query: 382 DRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTS-EHKKLALDAARQGIVLLKNDKK 440
D + ++ V+ +GLF+ +P KG K P+ V S EH+ +++ AA + IVLLKN+
Sbjct: 385 DSRVADVLRVKFMMGLFD-NPYKGDAKK--PEKVVHSKEHQAVSMRAALESIVLLKNENN 441
Query: 441 FLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV--SKTHYASGC-- 496
LPL+K + +A+IGP + + Y K++ +G++ Y+ + YA G
Sbjct: 442 ILPLSK-STKKVAVIGPNAAEVDNLICRYGPANAPIKTVYQGIKDYLPDADVRYAKGADI 500
Query: 497 ----------HDVPCNSD--AGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPG 544
+DVP + D A EAV +AK++D I+V G + E+ R +L L G
Sbjct: 501 IDKYFPESELYDVPLDKDEQAMIDEAVALAKESDVAIMVLGGNEKTVREEYSRTNLDLCG 560
Query: 545 QQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIF 604
+Q L+ +V T K PV+L+L G +++AE I I+ +PGE A+A+++F
Sbjct: 561 RQEKLLQAVYATGK-PVVLLLVDGRAATINWAE--HYIPGIVHAWFPGEFMGDAVAKVLF 617
Query: 605 GDFNPGGRLPMTWYPESFTKVPMN-DMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYT 663
GD+NPGG+L +T +P S ++P +DS + + TGT +Y FG+GLSYT
Sbjct: 618 GDYNPGGKLAVT-FPRSVGQIPFAFPFKPGSDS------KGFVRVTGT-LYPFGYGLSYT 669
Query: 664 NYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNA 723
++Y S+L I + I Q +L SC V N
Sbjct: 670 TFAY------SDLKIENPV-------IGVQGSVKL---------SC---------KVKNT 698
Query: 724 GDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHG 783
G V G VV L+ S T K L GF+RVH K ++F + P ++L + NK
Sbjct: 699 GKVAGDEVVQLYLHDEMSSVTTYVKVLRGFERVHLEPGEEKTVNFVLTP-QELGLWNKDN 757
Query: 784 RRILPLGNHVLMVGELRHSLTIE 806
++ G +MVG + ++
Sbjct: 758 HFVVEPGTFAVMVGSSSQDIRLQ 780
>gi|358342292|dbj|GAA27551.2| probable beta-D-xylosidase 7 [Clonorchis sinensis]
Length = 826
Score = 256 bits (653), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 226/835 (27%), Positives = 363/835 (43%), Gaps = 150/835 (17%)
Query: 52 YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDN--------ASAIPRLGIPAYEWWSESL 103
+PF N SL + R L++ LT +E IQQ+S+ A I RL I AY+W +
Sbjct: 27 HPFRNPSLPANFRVDDLLARLTNEELIQQVSNGGAGPQHGPAPGIARLNISAYQWRTNPG 86
Query: 104 HGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--- 160
G +T FPQ + A+F+ + A +E RA +N +A
Sbjct: 87 DG---------------RITPFPQPVNLGATFDVHTVYRVARATGLEMRARWNRAKAKKT 131
Query: 161 -----GLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIG 215
G+ +AP +N+ R P WGR QET GEDP ++ A FV+ G WK+ +
Sbjct: 132 YRDGNGIHLFAPVVNLLRHPLWGRNQETFGEDPFMIGKLARTFVRGLGG--WKNAEPQ-- 187
Query: 216 FGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTF 275
E+ + S + D L++ A CKH + + SR SF A +T+ D T+
Sbjct: 188 -SLDEQNL-------SSQPDVLLVGANCKHFAVHTGPEDFPVSRLSFEANVTDVDLWQTY 239
Query: 276 QPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK-ARNEWGFKGYITSDCDAVA-TI 333
P FR+C+E G S +MC+Y+ +NG P C+ L + R +W FKG++ +DC A+ I
Sbjct: 240 LPAFRACLEAGAVS-VMCAYSGINGTPDCINHWLLTELLRQKWKFKGFVVTDCGALQFVI 298
Query: 334 FEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAID----KGKVQEKDIDRALLNLF 389
+++Q + +E + A V +AG+++ S + G + + LF
Sbjct: 299 WKHQIFNHYNETAMAAV-RAGVNLENSVVYATEVFSTLPHLLASGSLSRDQLIEMARPLF 357
Query: 390 SVQLRLGLFNG---DPRKGKYGKLGPDDVCTSE-HKKLALDAARQGIVLLKNDKKFLPLN 445
+L G FN DP Y L P++ +E H+++AL + IVLL+N +FLPL
Sbjct: 358 LTRLMQGEFNPVEMDP----YRLLAPEEAILNEDHRRVALATTARSIVLLQNRDRFLPLK 413
Query: 446 KNAVSS------LAIIGPLVNNISQMGGGYTGIPCSPK---SLLRGLEAYVSKTHYASGC 496
N S +AI+GP +++++ G Y P P+ L +GL + H + C
Sbjct: 414 NNMSDSGGPLRHIAIVGPFATSVTELYGHYRTAP-EPEIEVPLSKGLSQLSRRMHASDIC 472
Query: 497 HD-VPCNS--DAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSV 553
D C+S D H + D +++ G E E+ DR ++ LPG+Q L+
Sbjct: 473 TDGGRCSSLNDDALHSTLGY-DDLDLIVLSLGTGSEVEGENVDRQNITLPGKQPELLEET 531
Query: 554 AR---------TSKR--PVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEI 602
+ SKR P+IL++ GP+++S A + + +I W G+PG A+ +
Sbjct: 532 LKLSSGLGNSGLSKRTVPIILLVFSAGPINISRAVENENVKAIFWCGFPGPLVGDAMRHL 591
Query: 603 IFGD---------------------------------FNPGGRLPMTWYPESFTKVPMND 629
+ G + P RLP TWY +
Sbjct: 592 LLGSSGELFGPSKPISVGFHSFQEAYRWDVTPDDGYWWIPAARLPFTWYESIDQLANITV 651
Query: 630 MNMRADSSRQYPGRSYRFYTGTQV---YGFGHGLSYTNYSYKFLSAPSELTISASLKAGS 686
M + R P + + ++ Y FG+GLSY N++ LS S S + S
Sbjct: 652 YEMTNQTYRYLPTQCHMSSEDCKIPVLYPFGYGLSY-NFN---LSGASGFVYSDLIAPSS 707
Query: 687 DKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKV----- 741
V+S + F+V +V N G + VV ++ +
Sbjct: 708 ------------------AVSSNQRIVFYV--TVQNEGPIACEEVVQVYTKWLNRTENDN 747
Query: 742 SQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMV 796
S+ P QL GF+RV K++ F + P E L++ + ++P G VL +
Sbjct: 748 SRNGPLIQLAGFERVRLDVGEYKQLKFTLIPSEHLAVWSLSENTMIP-GRGVLQI 801
>gi|189468358|ref|ZP_03017143.1| hypothetical protein BACINT_04755 [Bacteroides intestinalis DSM
17393]
gi|189436622|gb|EDV05607.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
intestinalis DSM 17393]
Length = 865
Score = 256 bits (653), Expect = 5e-65, Method: Compositional matrix adjust.
Identities = 155/459 (33%), Positives = 237/459 (51%), Gaps = 58/459 (12%)
Query: 53 PFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPG 112
P+ N +LS S RA L+ +TL+EKI Q+ + + AI RLGIPAY+WW+E+LHG+A G
Sbjct: 24 PYRNPNLSPSERAWDLLKRMTLEEKISQMKNGSPAIERLGIPAYDWWNEALHGVARAG-- 81
Query: 113 VNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNL--------GQAGLTF 164
T FPQ + AA+F+ V+ EARA Y+ G GLTF
Sbjct: 82 --------KATVFPQAIGLAATFDNQAVYETFDIVSDEARAKYHDFQRKGERGGYKGLTF 133
Query: 165 WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVL 224
W PNINI+RDPRWGRG ET GEDP + S + VK QG G G +K
Sbjct: 134 WTPNINIYRDPRWGRGMETYGEDPYLTSLMGLAVVKGLQGN---------GAGKYDK--- 181
Query: 225 KGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPFRSCI 283
AC KH + +W +R+SF++ I+++D +T+ P F++ +
Sbjct: 182 --------------AHACAKHYAVHSGPEW---NRHSFDSKNISQRDLWETYLPAFKTLV 224
Query: 284 EQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKT 342
+GK +MC+YN+ G P C L + R +WG+ + SDC A+ F Y N+ +T
Sbjct: 225 TEGKVKEVMCAYNRFEGEPCCSNKQLLIRILREDWGYDDIVVSDCGAIGD-FYYPNHHET 283
Query: 343 H---EDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFN 399
H E ++A + +G D+ CG A+ KG + E I+ ++ L + +LG+F+
Sbjct: 284 HPSAEAASADAVVSGTDLECGGS-YSSLNEAVKKGLITEDKINESVFRLLRARFQLGMFD 342
Query: 400 GDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLV 459
D + ++ V + EH AL+ AR+ +VLL N LPL+K ++ +A++GP
Sbjct: 343 DDTLVS-WSEIPYSVVESKEHVDKALEMARKSMVLLTNKNNSLPLSK-SIRKVAVLGPNA 400
Query: 460 NNISQMGGGYTGIPCSPKSLLRGLEAYVSK--THYASGC 496
N+ + Y G P ++L G+ + + + +Y GC
Sbjct: 401 NDSVMLWANYNGFPTKSVTILEGIRSKLPEGAVYYEKGC 439
Score = 107 bits (268), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 94/312 (30%), Positives = 146/312 (46%), Gaps = 57/312 (18%)
Query: 498 DVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQM 547
D+ + +++ A +AD +I V GL E E+ DR ++ LP Q
Sbjct: 582 DIGTKKEIDYNKVAAKAAEADAIIFVGGLSSALEGEEMPVDLPGFKKGDRTNIDLPRVQE 641
Query: 548 SLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDF 607
++ ++ +T K PVI V+ G L + + EA++ + ++L YPG+ G A+A+++FGD+
Sbjct: 642 EMLKALKKTGK-PVIFVVCSGSTLALPW-EAEN-LDAMLEAWYPGQQGGTAVADVLFGDY 698
Query: 608 NPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSY 667
NP GRLP+T+Y D NM R+YR++ G ++ FG+GLSYT + Y
Sbjct: 699 NPAGRLPLTFYASDSDLPDFEDYNMS--------NRTYRYFKGKPLFPFGYGLSYTTFDY 750
Query: 668 KFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVD 727
KA DK + +TG D +T + I + N G +D
Sbjct: 751 G--------------KAKVDKKSI-KTG--------DSMT--------LTIPLKNTGKMD 779
Query: 728 GSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRR-- 785
G VV ++ R P +G P K L F RV A ++ I + P N R
Sbjct: 780 GDEVVQVYLRNPADKEG-PIKMLRAFRRVSLKAGQAENIQIEL-PASTFECFNPATNRME 837
Query: 786 ILPLGNHVLMVG 797
ILP GN+ L+ G
Sbjct: 838 ILP-GNYELLYG 848
>gi|423303577|ref|ZP_17281576.1| hypothetical protein HMPREF1072_00516 [Bacteroides uniformis
CL03T00C23]
gi|423307700|ref|ZP_17285690.1| hypothetical protein HMPREF1073_00440 [Bacteroides uniformis
CL03T12C37]
gi|392687941|gb|EIY81232.1| hypothetical protein HMPREF1072_00516 [Bacteroides uniformis
CL03T00C23]
gi|392689569|gb|EIY82846.1| hypothetical protein HMPREF1073_00440 [Bacteroides uniformis
CL03T12C37]
Length = 942
Score = 255 bits (652), Expect = 6e-65, Method: Compositional matrix adjust.
Identities = 209/756 (27%), Positives = 342/756 (45%), Gaps = 129/756 (17%)
Query: 90 RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
RLGIP ++ +E + G+ S T+FP L ++NR L +G
Sbjct: 153 RLGIPV-DFTNEGIRGVES-----------YRATNFPTQLGLGHTWNRELIRQVGLITGR 200
Query: 150 EARAMYNLGQAGLT-FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWK 208
EAR + G T +AP +++ RD RWGR +E GE P +V+ +E V+ Q +
Sbjct: 201 EARML------GYTNVYAPILDVGRDQRWGRYEEVYGESPYLVAELGIEMVRGLQHNH-- 252
Query: 209 SDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITE 268
++A KH AY K + ++
Sbjct: 253 -----------------------------QVAATGKHFAAYSNNKGAREGMARVDPQMSP 283
Query: 269 QDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLF---QKARNEWGFKGYITS 325
++ E+ PF+ I + +M SYN +G+P ++G + + R E GF+GY+ S
Sbjct: 284 REVENIHIYPFKRVIREAGMLGVMSSYNDYDGIP--VQGSYYWLTTRLRGEMGFRGYVVS 341
Query: 326 DCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCG----TCMLRHTQSAIDKGKVQEKDI 381
D DAV ++ K +++ ++AG+++ C + + + +G + E+ I
Sbjct: 342 DSDAVEYLYTKHGTAKDMKEAVRQSVEAGLNVRCTFRSPDSFVLPLRELVKEGGLSEEVI 401
Query: 382 DRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKF 441
+ + ++ V+ +GLF+ + G +V E++ +AL A+ + +VLLKN +
Sbjct: 402 NDRVRDILRVKFLIGLFDAPYQTDLAG--ADREVEKEENEAIALQASHESVVLLKNADEL 459
Query: 442 LPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV---SKTHYASGCH- 497
LPL+ N+ +A+ GP N Y + ++L G++ ++ Y GC
Sbjct: 460 LPLDINSTKKIAVCGPNANEEGYALTHYGPLAVEVTTVLEGIQEKTKSKAEVLYTKGCDL 519
Query: 498 -----------DVPCNSD--AGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPG 544
D P D A +AV A++AD +VV G E++ R SL LPG
Sbjct: 520 VDAHWPESEIIDYPLTDDEQAEIDKAVENARQADVAVVVLGGGQRTCGENKSRTSLDLPG 579
Query: 545 QQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIF 604
+Q+ L+ ++ T K PV+L+L G PL +++A D + +IL YPG G ALA+I+F
Sbjct: 580 RQLQLLQAIQATGK-PVVLILINGRPLSINWA--DKFVPAILEAWYPGSKGGTALADILF 636
Query: 605 GDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQ------VYGFGH 658
GD+NPGG+L +T +P++ ++P N S Q G TG +Y FG+
Sbjct: 637 GDYNPGGKLTVT-FPKTVGQIP---FNFPCKPSSQIDGGKNPGPTGNMSRINGALYPFGY 692
Query: 659 GLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQI 718
GLSYT + Y S+L I+ + I S T V++
Sbjct: 693 GLSYTTFEY------SDLDITPRV--------------------ITPNESAT-----VRL 721
Query: 719 SVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSI 778
VTN G G VV L+ R S T EK L GF R+H ++E+SF +D + L +
Sbjct: 722 KVTNTGKRAGDEVVQLYIRDVLSSITTYEKNLAGFQRIHLEPGEAQELSFTID-RKHLEL 780
Query: 779 ANKHGRRILPLGNHVLMVG------ELRHSLTIETY 808
+ + ++ G+ VLM G L +LT+E Y
Sbjct: 781 LDADMKWVVEPGDFVLMAGASSEDIRLNGTLTVEDY 816
>gi|423240769|ref|ZP_17221883.1| hypothetical protein HMPREF1065_02506 [Bacteroides dorei
CL03T12C01]
gi|392643731|gb|EIY37480.1| hypothetical protein HMPREF1065_02506 [Bacteroides dorei
CL03T12C01]
Length = 864
Score = 255 bits (652), Expect = 7e-65, Method: Compositional matrix adjust.
Identities = 160/461 (34%), Positives = 233/461 (50%), Gaps = 62/461 (13%)
Query: 54 FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
+ N++LS RA+ L+ LTL+EK+ + DN+ + RLGI Y WW+E+LHG+A +G
Sbjct: 24 YKNSNLSPEERAEDLLQQLTLEEKVALMMDNSKPVERLGIKPYNWWNEALHGVARSG--- 80
Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLTFW 165
T FPQ + AASF I +AV+ EARA A GLT W
Sbjct: 81 -------LATVFPQPIGMAASFEPDAIHTIYTAVSDEARAKNTAYSAAGSYERYQGLTMW 133
Query: 166 APNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQ--GENWKSDDGGIGFGFREKRV 223
P +NI+RDPRWGRG ET GEDP + S V VK Q N K D
Sbjct: 134 TPTVNIYRDPRWGRGIETYGEDPYLTSVMGVNVVKGLQCTDANQKYDK------------ 181
Query: 224 LKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPFRSC 282
+ AC KH + +W +R+ FNA I +D +T+ PF +
Sbjct: 182 ---------------IHACAKHFAVHSGPEW---NRHEFNAENIKPRDLHETYLVPFEAL 223
Query: 283 IEQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIFEYQNYTK 341
+++GK +MC+YN++ G P C L Q R EWG++G + SDC A+ + + + K
Sbjct: 224 VKEGKVKEVMCAYNRLEGDPCCGSDRLLMQILRQEWGYEGIVLSDCGAIDDFYREKGH-K 282
Query: 342 TH---EDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLF 398
TH E ++A + +G D+ CG+ +SA KG + EKDID ++ L + LG
Sbjct: 283 THPNAESASAAAVLSGTDLECGSSYKALVESA-KKGLISEKDIDVSVKRLLKARFELGEM 341
Query: 399 NGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPL 458
+ DP K ++ K+ VC++EH L+LD AR+ + LL N LPL + ++A++GP
Sbjct: 342 D-DPSKVEWTKIPYSVVCSAEHDSLSLDIARKSMTLLLNKNNILPLKRGG-QTIAVMGPN 399
Query: 459 VNNISQMGGGYTGIPCSPKSLLRGLEAYV---SKTHYASGC 496
N+ G Y G P +LL G+ + + K Y GC
Sbjct: 400 ANDSVMQWGNYNGTPKHTITLLEGIRSAMGENDKLIYEQGC 440
Score = 121 bits (304), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 97/325 (29%), Positives = 147/325 (45%), Gaps = 66/325 (20%)
Query: 491 HYASGCHDVPCNSDAGFHEAVRI------AKKADFVIVVAGLDLTQETEDR--------- 535
Y SG D N D GF E V I K AD VI G+ + E E+
Sbjct: 572 QYFSG--DAQLNFDLGFKEEVNIKNTVAKVKDADIVIFAGGISPSLEGEEMGVNLPGFRK 629
Query: 536 -DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEA 594
DR + LP Q L+ ++ K+ VI V G P ++ +IL YPG++
Sbjct: 630 GDRTDIELPAVQRELIKALCDAGKK-VIFVNFSGSP--IAMEPETKYCQAILQAWYPGQS 686
Query: 595 GAKALAEIIFGDFNPGGRLPMTWYPESFTKVP-MNDMNMRADSSRQYPGRSYRFYTGTQV 653
G KA AE++FGD+NP GRLP+T+Y + ++P D NM GR+YR++ G +
Sbjct: 687 GGKAAAEVLFGDYNPAGRLPVTFY-RNIAQLPDFEDYNMT--------GRTYRYFKGDPL 737
Query: 654 YGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLR 713
+ FG+GLSYT ++Y + + ++K G ++
Sbjct: 738 FPFGYGLSYTTFNY------DNIKLDQTIKVGETAKMV---------------------- 769
Query: 714 FHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPC 773
I VTNAG+ DG VV ++ + + ++G P K L F RV A + + + P
Sbjct: 770 ----IPVTNAGNRDGEEVVQVYLKKQEDAEG-PAKTLRAFKRVQIPAGKTVNVELELTP- 823
Query: 774 EQLSIANKHGRRILPL-GNHVLMVG 797
+QL + + + GN +MVG
Sbjct: 824 KQLEWWDAQTNTMRTIAGNFDIMVG 848
>gi|325105782|ref|YP_004275436.1| glycoside hydrolase family protein [Pedobacter saltans DSM 12145]
gi|324974630|gb|ADY53614.1| glycoside hydrolase family 3 domain protein [Pedobacter saltans DSM
12145]
Length = 748
Score = 255 bits (651), Expect = 7e-65, Method: Compositional matrix adjust.
Identities = 229/786 (29%), Positives = 369/786 (46%), Gaps = 135/786 (17%)
Query: 61 ISTRAKSLISLLTLQEKIQQL----SDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFN 116
I R L+S +TLQEKI QL SD S P I A S L I + G N
Sbjct: 33 IEQRVNDLMSKMTLQEKIGQLNQFTSDRTSTGP---ISAN---SNKLKDIEAGKVGSMLN 86
Query: 117 GTVSSVT----------------------------SFPQVLVSAASFNRSLWSNIGSAVA 148
S T +FP L AAS++
Sbjct: 87 VRGSKDTRMVQEAAMRSRLKIPLLFGLDVIHGYRVTFPIPLAEAASWDLDAIEFASRVAG 146
Query: 149 VEARAMYNLGQAGLTF-WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENW 207
EA A AG+ + +AP ++I RD RWGR E GED + S A V+ FQG
Sbjct: 147 REAAA------AGIHWTFAPMVDISRDARWGRVMEGAGEDTYLGSMIAKARVRGFQG--- 197
Query: 208 KSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-I 266
KG GE D +M AC KH AY G +N + +
Sbjct: 198 -----------------KGLGEL----DAVM--ACVKHFAAYGAAVAGR----DYNTVDM 230
Query: 267 TEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPAC----LRGDLFQKARNEWGFKGY 322
+E+ + + PPF++ ++ G A+ M S+N +NG+PA L+ D+ + +WGFKG+
Sbjct: 231 SERHLWEVYLPPFKAALDAG-AATFMNSFNDLNGIPATANTYLQRDIL---KGKWGFKGF 286
Query: 323 ITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGT-CMLRHTQSAIDKGKVQEKDI 381
+ SD +V + + + K +E++A +K G D++ + + + + +V E I
Sbjct: 287 VVSDWGSVREMVPH-GFAKDNEEAAKYAIKGGSDMDMESYAYVNFLEKLVKNKEVSEDLI 345
Query: 382 DRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKF 441
+ A+ + + + +GLF+ DP + K ++ ++ +KK+AL+ A++ IVLLKN +
Sbjct: 346 NDAVRRILTKKFEMGLFD-DPYRFSDEKREKKELNSAANKKVALEVAQKSIVLLKNQNQV 404
Query: 442 LPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPK--SLLRGLEAYVSKT---HYASGC 496
LPL+K+ V S+A++GPL + M G ++ + + SL GLE K +YA GC
Sbjct: 405 LPLSKD-VKSIALVGPLAKSAEDMKGFWSVVWDNDHLVSLHEGLETKQGKKTQINYAKGC 463
Query: 497 HDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVART 556
+ + + +GF +AV +A+K+D V+V G E + R ++ LPG Q L+ + ++
Sbjct: 464 N-INDDDKSGFADAVEVARKSDVVVVAVGEAFDMSGEAKSRANIHLPGVQEELIKELKKS 522
Query: 557 SKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMT 616
K PV++++ G PL ++ ++ +I++ + G A+A ++FGD+NP G+LPMT
Sbjct: 523 GK-PVVVLIMAGRPLIFNWTADNAD--AIMYTWWLGSEAGHAMANVLFGDYNPSGKLPMT 579
Query: 617 WYPESFTKVPM----NDMNMRADSSRQYPGRS-YRFYTGTQVYGFGHGLSYTNYSYKFLS 671
+P S ++P+ N+ + RS Y + Y FG+GLSYTN++Y
Sbjct: 580 -FPRSEGQLPIYYNYNNTGRPVINENNIHYRSAYIDSPNSPKYAFGYGLSYTNFAY---- 634
Query: 672 APSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHV 731
S LT+S+ L Q GS V+ ++ N G G V
Sbjct: 635 --SNLTLSSK--------SLNQDGS-----------------ITVKFTLKNDGKYAGEEV 667
Query: 732 VMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGN 791
V L+ R S P K+L F ++ SKE++F ++ E+LS NK+ + G+
Sbjct: 668 VQLYLRDKVASLVRPVKELKDFTKILLKPGESKEVTFVINK-EKLSFYNKNLDWVAEPGD 726
Query: 792 HVLMVG 797
LM+G
Sbjct: 727 FDLMIG 732
>gi|375254464|ref|YP_005013631.1| glycosyl hydrolase family 3, C-terminal domain-containing protein
[Tannerella forsythia ATCC 43037]
gi|363407375|gb|AEW21061.1| glycosyl hydrolase family 3, C-terminal domain protein [Tannerella
forsythia ATCC 43037]
Length = 775
Score = 255 bits (651), Expect = 7e-65, Method: Compositional matrix adjust.
Identities = 219/760 (28%), Positives = 335/760 (44%), Gaps = 143/760 (18%)
Query: 76 EKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASF 135
E + L A RLGIP + + E +HG + G T FP + A+++
Sbjct: 107 EALNALQKYAMENTRLGIPIF-FAEECMHGHMAIG-----------TTVFPTSIGQASTW 154
Query: 136 NRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYA 195
NR+L +G+A+A E R+ Q + P +++ R+PRW R +ET GEDP++
Sbjct: 155 NRTLIEKMGAAIAHETRS-----QGAHIAYGPVLDLAREPRWSRVEETFGEDPVLSGILG 209
Query: 196 VEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWG 255
FV+ QG+ D D + KHL AY + G
Sbjct: 210 SAFVRGLQGK--------------------------DFADGRHTYSTLKHLAAYGIPVGG 243
Query: 256 NFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-R 314
+ R A I ++ PF ++ G A +M SYN V+GVP + +K R
Sbjct: 244 HNGR---QAQIGARELIAEHLLPFEMAVKAG-AQSVMTSYNAVDGVPCTSNTYILKKILR 299
Query: 315 NEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCG-TCMLRHTQSAIDK 373
EW F G++ SD ++ I + +AA L AG++++ G R+ + A
Sbjct: 300 GEWDFNGFVVSDLGSIEGIATTHRVAPDIKHAAAMALNAGVEMDLGGVAYTRNMEQAHTD 359
Query: 374 GKVQEKDIDRALLNLFSVQLRLGLFNG---DPRKGKYGKLGPDDVCTSEHKKLALDAARQ 430
+ +ID A+ + ++ +GLF P + + + + EH +LA A +
Sbjct: 360 SLISMSEIDDAVSRILRLKFEMGLFESPYVQPSRTT------EIIRSKEHNRLARKVAEE 413
Query: 431 GIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPK--SLLRGLEAYVS 488
IVLLKN+ LPL+KN + S+A+IGP +N+ G YT ++L G+ VS
Sbjct: 414 SIVLLKNNANLLPLSKN-IGSIAVIGPNADNLYNQLGDYTAPQPEEHIVTILEGIRNAVS 472
Query: 489 KT---HYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAG-------------------- 525
T Y GC V + + EAVR A ++ V++V G
Sbjct: 473 PTTVIRYVKGCA-VRDTTQSNIDEAVRAANASNAVVLVVGGSSARDFHTKYIETGAATVS 531
Query: 526 ------LDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEAD 579
+ + E DR SL L G Q L+ S+A T K P+I+V G PL+++ AD
Sbjct: 532 SRENELIPDMESGEGYDRKSLTLLGHQEKLIESIAATGK-PLIMVYIQGRPLNMNL--AD 588
Query: 580 SQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQ 639
+ S++L YPGE G A+A +IFGD NP GRLP++ P S ++P+
Sbjct: 589 KKASALLTAWYPGEEGGNAVANVIFGDVNPSGRLPIS-VPRSTGQLPV----------YY 637
Query: 640 YPGRSYRFYTGTQ--VYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSR 697
G+S + GT +Y FG+GLSYT + Y LTIS + G + +
Sbjct: 638 SLGKSNDYVEGTSTPLYAFGYGLSYTAFEY------GNLTIS---REGGNITV------- 681
Query: 698 LDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVH 757
SCT VTN G+ DG VV L+ R S P L F ++
Sbjct: 682 ----------SCT---------VTNTGNTDGDEVVQLYLRDHVASVSVPPVLLKDFAKIS 722
Query: 758 TVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
S ++F + P EQL+ N +R++ G +M+G
Sbjct: 723 LKKGESARVNFVLTP-EQLAFFNTDLKRVVEPGEFTVMIG 761
>gi|393781488|ref|ZP_10369683.1| hypothetical protein HMPREF1071_00551 [Bacteroides salyersiae
CL02T12C01]
gi|392676551|gb|EIY69983.1| hypothetical protein HMPREF1071_00551 [Bacteroides salyersiae
CL02T12C01]
Length = 850
Score = 255 bits (651), Expect = 8e-65, Method: Compositional matrix adjust.
Identities = 162/447 (36%), Positives = 228/447 (51%), Gaps = 62/447 (13%)
Query: 53 PFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPG 112
P+ N L+ RA L+ LT++EKI + +N+ IPRLGI YEWW+E+LHG+A G
Sbjct: 15 PYQNPDLTPEQRATDLLQRLTVEEKISLMQNNSPGIPRLGIRPYEWWNEALHGVARAG-- 72
Query: 113 VNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEA----RAMYNLGQ----AGLTF 164
T FPQ + AASFN SL + +AV+ EA RA + GQ GLT
Sbjct: 73 --------LATVFPQTIGMAASFNDSLVQKVFTAVSDEARAKNRAFNDQGQYKRYQGLTM 124
Query: 165 WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVL 224
W PN+NIFRDPRWGRGQET GEDP + S V VK Q
Sbjct: 125 WTPNVNIFRDPRWGRGQETYGEDPYLTSRMGVAVVKGLQ--------------------- 163
Query: 225 KGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPFRSCI 283
G +S R D+ L AC KH + +W +R+SFNA I +D +T+ P F++ +
Sbjct: 164 ---GPDSARYDK--LHACAKHFAVHSGPEW---NRHSFNAENIAPRDLWETYLPAFKTLV 215
Query: 284 EQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIFEYQNYTKT 342
++ +MC+YN+ G P C L Q R+EWGF G + SDC A++ + + + T
Sbjct: 216 QEADVKEVMCAYNRFEGDPCCGSNRLLTQILRDEWGFNGIVVSDCGAISDFWGAKKHN-T 274
Query: 343 HEDSA---AGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLG-LF 398
H D+A A + +G D+ CG+ R A+ G + E+ ID ++ L + LG +
Sbjct: 275 HPDAAHASADAVLSGTDLECGSN-YRKLTDAVKAGIISEEQIDISVKRLLKARFELGEME 333
Query: 399 NGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPL 458
P Y V EH+ LAL A + + LL+N + LPL+K+A +A+IGP
Sbjct: 334 ESHPWALPYSI-----VDCPEHRHLALQIAHETMTLLQNKENILPLDKHA--KVAVIGPN 386
Query: 459 VNNISQMGGGYTGIPCSPKSLLRGLEA 485
N+ G Y G P +LL L +
Sbjct: 387 ANDSVMQWGNYNGTPSHTSTLLSALRS 413
Score = 97.1 bits (240), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 75/278 (26%), Positives = 126/278 (45%), Gaps = 60/278 (21%)
Query: 496 CHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQ 545
D+P N +A + K + VI G+ E E+ DR + LP
Sbjct: 570 VEDIPLNMNATLEKL----KDTEIVIFAGGISPLLEGEEMKVSAAGFKGGDRTDIELPAV 625
Query: 546 QMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFG 605
Q +++ ++ + K+ + + +G ++ +IL YPG+ G A+A+++FG
Sbjct: 626 QRNVLAALKKAGKKVIFVNFSGSA---MALTPETENCDAILQAWYPGQEGGTAVADVLFG 682
Query: 606 DFNPGGRLPMTWYPESFTKVP-MNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTN 664
D+NP GRLP+T+Y ++ ++P D +M+ GR+YR+ ++ FG+GLSYT
Sbjct: 683 DYNPAGRLPVTFY-KNMEQLPDFEDYSMQ--------GRTYRYMKEAPLFPFGYGLSYTT 733
Query: 665 YSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAG 724
++Y KA +DK + TG ++ + I V+N G
Sbjct: 734 FTYG--------------KARADKKRI-STGEKMT----------------LTIPVSNIG 762
Query: 725 DVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKG 762
DG VV ++ R +G P K L F RV + KG
Sbjct: 763 SRDGEEVVQVYLRREDDPEG-PTKTLRAFKRVE-ITKG 798
>gi|393781363|ref|ZP_10369562.1| hypothetical protein HMPREF1071_00430 [Bacteroides salyersiae
CL02T12C01]
gi|392676856|gb|EIY70278.1| hypothetical protein HMPREF1071_00430 [Bacteroides salyersiae
CL02T12C01]
Length = 863
Score = 255 bits (651), Expect = 8e-65, Method: Compositional matrix adjust.
Identities = 159/459 (34%), Positives = 233/459 (50%), Gaps = 58/459 (12%)
Query: 53 PFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPG 112
PF N+ L + RA+ L+ LTLQEK+ + D +S IPRLGI Y WW+E+LHG+
Sbjct: 23 PFNNSDLPVEERAQDLLQRLTLQEKVLLMCDYSSPIPRLGIKRYNWWNEALHGV------ 76
Query: 113 VNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQ--------AGLTF 164
G T FPQ + AA+F+ V+ EARA Y+ + GLTF
Sbjct: 77 ----GRAGLATVFPQAIGMAATFDDCAVRQAFECVSDEARAKYHHSENKEGSERYQGLTF 132
Query: 165 WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVL 224
W PN+NIFRDPRWGRGQET GEDP + S + V+ QG + D
Sbjct: 133 WTPNVNIFRDPRWGRGQETYGEDPYLTSQMGLAVVRGLQGPSESKYD------------- 179
Query: 225 KGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNA-MITEQDTEDTFQPPFRSCI 283
L AC KH + +W +R+SF+ I+ +D +T+ P F++ +
Sbjct: 180 -------------KLHACAKHYALHSGPEW---NRHSFDVDSISPRDLWETYLPAFKALV 223
Query: 284 EQGKASCIMCSYNQVNGVPACLRGD-LFQKARNEWGFKGYITSDCDAVATIFEYQNYTKT 342
+QG +MC+YN+ G P C L+ R EWGF G + SDC A++ F + + +T
Sbjct: 224 QQGGVKEVMCAYNRFEGEPCCGSNRLLYNILREEWGFDGLVVSDCGAISD-FYLKGHHET 282
Query: 343 H---EDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFN 399
H E + A +KAG D++CG Q A+++G + EK ID +L L + LGL +
Sbjct: 283 HPTKEAAVAAAVKAGTDLDCGVDYYA-LQKAVEEGIITEKQIDVSLFRLLKARFELGLMD 341
Query: 400 GDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLV 459
+ + + V + +H++ AL+ AR+ + LLKND LPL+K+ +A+IGP
Sbjct: 342 -EEHLVSWSDIPYTVVDSEKHREKALEMARKSMTLLKNDHGTLPLSKHC-GKIAVIGPNA 399
Query: 460 NNISQMGGGYTGIPCSPKSLLRGLEAYVSKTH--YASGC 496
N+ M G Y G P ++L G+ + Y GC
Sbjct: 400 NDSVMMWGNYNGFPSHTVTILEGITHKLGAEQIIYDKGC 438
Score = 105 bits (261), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 79/269 (29%), Positives = 129/269 (47%), Gaps = 54/269 (20%)
Query: 510 AVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVARTSKR 559
A R+ A+ ++ V G+ E E+ DR + LP Q L+ + +T K
Sbjct: 594 AARVGD-AEVIVFVGGISPKVEGEELPVSFPGFKGGDRTVIELPQVQRDLLQELHKTGK- 651
Query: 560 PVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYP 619
PVIL+L G + +S AE D +I+ Y G+AG A+A+++FGD+NP GRLP+T+Y
Sbjct: 652 PVILILCSGSAIGLS-AEVDLA-DAIIQAWYLGQAGGTAVADVLFGDYNPAGRLPVTFYK 709
Query: 620 ESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTIS 679
+ D +M+ GR+YR++ G ++ FG+GLSYT++
Sbjct: 710 ATEQLPDFEDYSMQ--------GRTYRYFEGEALFPFGYGLSYTSF-------------- 747
Query: 680 ASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVP 739
+ +RL I E S + ++++V N G +DG V+ ++ R
Sbjct: 748 ------------EIGKARLSKKRIRENESVS-----LKLTVENTGKLDGDEVIQIYIRKL 790
Query: 740 KVSQGTPEKQLIGFDRVHTVAKGSKEISF 768
+ +G P K L F R H A K+++F
Sbjct: 791 QDKEG-PLKTLRAFKRFHLRAGEKKDVTF 818
>gi|253574420|ref|ZP_04851761.1| glycoside hydrolase [Paenibacillus sp. oral taxon 786 str. D14]
gi|251846125|gb|EES74132.1| glycoside hydrolase [Paenibacillus sp. oral taxon 786 str. D14]
Length = 782
Score = 255 bits (651), Expect = 9e-65, Method: Compositional matrix adjust.
Identities = 216/759 (28%), Positives = 345/759 (45%), Gaps = 133/759 (17%)
Query: 76 EKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASF 135
E + + A RLGIP E HG + G T FP L +++
Sbjct: 106 EAVNAIQRYAIENSRLGIPIL-IGEECSHGHMAIG-----------ATVFPVPLSLGSTW 153
Query: 136 NRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYA 195
N L+ + AVA E RA Q G ++P +++ RDPRWGR +E GED ++S A
Sbjct: 154 NVELYREMCRAVARETRA-----QGGAVTYSPVLDVVRDPRWGRTEECFGEDAYLISEMA 208
Query: 196 VEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWG 255
V V+ QGE+ +D ++A KH + Y + G
Sbjct: 209 VASVEGLQGESLDGEDS--------------------------VAATLKHFVGYGSSEGG 242
Query: 256 NFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-R 314
R + + ++ + PFR +E G AS IM +YN+++GVP +L R
Sbjct: 243 ---RNAGPVHMGRRELLEVDLLPFRKAVEAGAAS-IMPAYNEIDGVPCTTNEELLDGVLR 298
Query: 315 NEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCML-RHTQSAIDK 373
EWGF G + +DC A+ + + + D+A ++AG+D+ M +H A+
Sbjct: 299 GEWGFDGMVITDCGAIDMLASGHDVAEDGRDAAIQAIRAGIDMEMSGVMFGKHLVEAVRS 358
Query: 374 GKVQEKDIDRALLNLFSVQLRLGLFN---GDPRKGKYGKLGPDDVCTSEHKKLALDAARQ 430
G+++E+ +DRA+ + +++ RLGLF DP + + + ++EH +LA A +
Sbjct: 359 GQLEEEVLDRAVRRVLTLKFRLGLFERPYADPERAER------VIGSAEHVELARQLASE 412
Query: 431 GIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPK--SLLRGLEAYVS 488
G+VLLKN LPL+ +A ++A+IGP + G YT K ++L G+ + ++
Sbjct: 413 GVVLLKNKDGVLPLSADA-GTIAVIGPNADAGYNQLGDYTSPQPRSKVTTVLGGIRSKLA 471
Query: 489 KTH----YASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAG-----------LDLTQET- 532
+T YA GC + NS GF A+ A+KAD V++V G +DL
Sbjct: 472 ETPERVLYAPGCR-INGNSREGFDVALSCAEKADTVVMVVGGSSARDFGEGTIDLRTGAS 530
Query: 533 -------------EDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEAD 579
E DR++L L G Q+ L+ + + K P+++V G P+ + D
Sbjct: 531 KVTDNAESDMDCGEGIDRMNLSLSGVQLELIQEIHKLGK-PLVVVYINGRPIAEPWI--D 587
Query: 580 SQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQ 639
+IL YPG+ G A+A+I+FGD NP GRL ++ P+ +VP+ R+ R
Sbjct: 588 EHADAILEAWYPGQEGGHAIADILFGDVNPSGRLTIS-IPKHVGQVPVYYHGKRSRGKRY 646
Query: 640 YPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLD 699
G S Y FG+GLSYT ++Y L S+ TI+ K GS K
Sbjct: 647 LEGDSQPRYP------FGYGLSYTEFTYNNLKLESD-TIN---KDGSTK----------- 685
Query: 700 YVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTV 759
V + VTN G+ G+ V+ L+ P K+L GF ++
Sbjct: 686 ----------------VTVEVTNVGERAGAEVIQLYITDVASKVTRPAKELKGFRKIFLQ 729
Query: 760 AKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGE 798
++ + F V P EQL ++ + ++ G + VG+
Sbjct: 730 PGETQTVEFTVGP-EQLQYIGQNYKPVVEPGEFRVHVGK 767
>gi|336412865|ref|ZP_08593218.1| hypothetical protein HMPREF1017_00326 [Bacteroides ovatus
3_8_47FAA]
gi|335942911|gb|EGN04753.1| hypothetical protein HMPREF1017_00326 [Bacteroides ovatus
3_8_47FAA]
Length = 800
Score = 254 bits (650), Expect = 9e-65, Method: Compositional matrix adjust.
Identities = 211/735 (28%), Positives = 342/735 (46%), Gaps = 127/735 (17%)
Query: 90 RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
RLGIP ++ +E + G+ + T FP A++N+ L I A
Sbjct: 151 RLGIPV-DFTNEGIRGLCHD-----------RATMFPAQCGQGATWNKKLIREIAKVTAN 198
Query: 150 EARAMYNLGQAGLT-FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWK 208
EA+A+ G T ++P ++I +DPRWGR E+ GEDP +V + + Q E
Sbjct: 199 EAKAL------GYTNIYSPILDIAQDPRWGRVVESYGEDPYLVGELGKQMILGLQNEG-- 250
Query: 209 SDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITE 268
+ A KH Y + G + +
Sbjct: 251 ------------------------------IVATPKHFAVYSIPVGGRDGGTRTDPHVAP 280
Query: 269 QDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDC 327
++ + + PFR I++ A +M SYN +G P + R +WGFKGY+ SD
Sbjct: 281 REMKTLYLEPFRKGIQEAGALGVMSSYNDYDGEPVSGSYHFLTEILRQQWGFKGYVVSDS 340
Query: 328 DAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTC----MLRHTQSAIDKGKVQEKDIDR 383
+AV + T T E+ AA V+ AG++I + + AID+GKV +++
Sbjct: 341 EAVEFLHTKHRITPTEEEMAAQVVNAGLNIRTNFTPPQDFILPLRRAIDEGKVSLHTLNQ 400
Query: 384 ALLNLFSVQLRLGLFN----GDPRKGKYGKLGPDDVCTSE-HKKLALDAARQGIVLLKND 438
+ + V+ +GLF+ GD R+ P+ V ++ HK +++ AA + IVLLKN+
Sbjct: 401 RVSEILRVKFMMGLFDNPYPGDDRR-------PETVVHNDAHKAVSMKAALESIVLLKNE 453
Query: 439 KKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV--SKTHYASGC 496
+ LPL+KN +A+IGP + ++ Y S K++ +G++ Y+ S+ YA GC
Sbjct: 454 NQMLPLSKN-FKKIAVIGPNGEEVKELTCRYGPANASIKTVYQGIKEYLPNSEVRYAKGC 512
Query: 497 ------------HDVPCNS--DAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLL 542
++VP ++ A HEAV +AK +D I+V G + E+ R +L L
Sbjct: 513 DIIDKYFPESELYNVPLDTQEQAMIHEAVELAKASDIAILVLGGNEKTVREEFSRTNLDL 572
Query: 543 PGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEI 602
G+Q L+ +V T K PV+LV+ G +++ A+ + +I+ +PGE A+A++
Sbjct: 573 CGRQQQLLEAVYATGK-PVVLVMVDGRAATINW--ANKYVPAIIHAWFPGEFMGDAIAKV 629
Query: 603 IFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSY 662
+FGD+NPGGRL +T +P+S ++P + DS + R +Y FG+GLSY
Sbjct: 630 LFGDYNPGGRLAVT-FPKSVGQIPFA-FPFKPDSDSKGKVR-----VDGVLYPFGYGLSY 682
Query: 663 TNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTN 722
T + Y S+L IS + G +NI SCT V N
Sbjct: 683 TIFGY------SDLKISKPV-IGPQENI---------------TLSCT---------VKN 711
Query: 723 AGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKH 782
G G VV L+ R S T +K L GF+R+H + +SF + P + L + +K+
Sbjct: 712 TGKKAGDEVVQLYIRDDFSSVTTYDKVLRGFERIHLQPGEEQTVSFTLTP-QDLGLWDKN 770
Query: 783 GRRILPLGNHVLMVG 797
+ + G+ +MVG
Sbjct: 771 NQFTVEPGSFSVMVG 785
>gi|365122193|ref|ZP_09339098.1| hypothetical protein HMPREF1033_02444 [Tannerella sp.
6_1_58FAA_CT1]
gi|363642907|gb|EHL82241.1| hypothetical protein HMPREF1033_02444 [Tannerella sp.
6_1_58FAA_CT1]
Length = 853
Score = 254 bits (650), Expect = 9e-65, Method: Compositional matrix adjust.
Identities = 161/437 (36%), Positives = 226/437 (51%), Gaps = 57/437 (13%)
Query: 64 RAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVT 123
R L+S LT++EKI L + IPRLGI Y +E+LHG+ G NF T
Sbjct: 40 RIMDLLSRLTVEEKISLLRATSPGIPRLGIDKYYHGNEALHGVVRPG---NF-------T 89
Query: 124 SFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAG----------LTFWAPNINIFR 173
FPQ + A+ +N L I +A++ EAR +N G LTFW+P +N+ R
Sbjct: 90 VFPQAIGLASMWNPELLYEISTAISDEARGRWNELNRGKDQKGFFSDLLTFWSPTVNMAR 149
Query: 174 DPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDR 233
DPRWGR ET GEDP + V FVK QG + R LK
Sbjct: 150 DPRWGRTPETYGEDPFLSGKLGVAFVKGLQG--------------NDPRYLK-------- 187
Query: 234 GDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMC 293
+ + KH A + E +R+ N I+E++ + + P F SCI++GKA IM
Sbjct: 188 -----IVSTPKHFAANNEEH----NRFECNPHISERNLREYYLPAFESCIKEGKAQSIMS 238
Query: 294 SYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLK 352
+YN +N VP L L Q R EWGF GY+ SDC + + Y KT E +A +K
Sbjct: 239 AYNAINDVPCTLNPWLLTQVLRKEWGFNGYVVSDCGGPGFLVTHHKYVKTPEAAATLSIK 298
Query: 353 AGMDINCG-TCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLG 411
AG+D+ CG + +A + V + DID A + ++ LGLF+ DP K Y +
Sbjct: 299 AGLDLECGDNVYIEPLMNAYKQCMVTDADIDTAAYRILRARMMLGLFD-DPEKNPYNAIS 357
Query: 412 PDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTG 471
P V +H++LAL+AARQ +VLLKN+K FLPLN V S+A++G +N + G Y+G
Sbjct: 358 PSIVGCEKHRQLALEAARQSLVLLKNEKNFLPLNPKKVKSIAVVG--INAGNCEFGDYSG 415
Query: 472 IPC-SPKSLLRGLEAYV 487
P +P S+L G++ V
Sbjct: 416 TPVNAPVSVLEGIKNRV 432
Score = 137 bits (344), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 89/292 (30%), Positives = 140/292 (47%), Gaps = 47/292 (16%)
Query: 507 FHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLT 566
+ EA R ++ D + V G++ + E E +DR ++ LP Q + + + V+ VL
Sbjct: 594 YGEAGRAIRECDVTVAVLGINKSIEREGQDRYTIELPADQQLFIKEAYKANPNTVV-VLV 652
Query: 567 GGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVP 626
G L +++ D I +IL YPGE G A+AE +FGD+NPGGRLP+T+Y S ++P
Sbjct: 653 AGSSLAINWI--DENIPAILNAWYPGEQGGTAVAEALFGDYNPGGRLPLTYY-RSLDELP 709
Query: 627 -MNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAG 685
+D +++ GR+Y ++ +Y FG+GLSYT + YK L SE++ A
Sbjct: 710 AFDDYDIQK-------GRTYMYFENKPLYPFGYGLSYTRFDYKNLK--SEVSDDA----- 755
Query: 686 SDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGT 745
+++ +V N G G V ++ R P+
Sbjct: 756 ----------------------------VNLKFTVKNTGKYAGDEVAQVYVRFPESGIKV 787
Query: 746 PEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
P KQL GF+RVH S ++S + E K G+ P GN++ MVG
Sbjct: 788 PLKQLKGFERVHIGKGKSAQVSVSIPKKELRLWDEKDGKFYTPSGNYIFMVG 839
>gi|218258058|ref|ZP_03474485.1| hypothetical protein PRABACTJOHN_00138 [Parabacteroides johnsonii
DSM 18315]
gi|218225777|gb|EEC98427.1| hypothetical protein PRABACTJOHN_00138 [Parabacteroides johnsonii
DSM 18315]
Length = 955
Score = 254 bits (650), Expect = 9e-65, Method: Compositional matrix adjust.
Identities = 204/734 (27%), Positives = 335/734 (45%), Gaps = 113/734 (15%)
Query: 90 RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
RLGIP ++ +E + G+ S T+FP L ++NR L +G
Sbjct: 161 RLGIPT-DFTNEGIRGVES-----------YIATNFPTQLGLGHTWNRDLVHKVGYITGR 208
Query: 150 EARAMYNLGQAGLT-FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWK 208
E R + G T +AP +++ RD RWGR +E GE P +V+ VE K Q
Sbjct: 209 EGRLL------GYTNVYAPILDVGRDQRWGRYEEVYGESPYLVAELGVEMAKGMQ----- 257
Query: 209 SDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITE 268
+ ++A KH IAY K G + ++
Sbjct: 258 --------------------------TDYQVAATSKHYIAYSNNKGGREGMARVDPQMSP 291
Query: 269 QDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVP-ACLRGDLFQKARNEWGFKGYITSDC 327
++ E P++ I++ +M SYN +G P L + R E+GF+GY+ SD
Sbjct: 292 REVEMLHVYPWKRVIKEAGILGVMSSYNDYDGFPIQSSYYWLTTRLRGEFGFRGYVVSDS 351
Query: 328 DAVATIFEYQNYTKTHEDSAAGVLKAGMDINCG----TCMLRHTQSAIDKGKVQEKDIDR 383
DAV +F ++S + AG++I C + + I +G + ID
Sbjct: 352 DAVEYLFSKHGTAADMKESVLQSVLAGLNIRCTFRSPDSYVLPLRELIAEGALPMSTIDD 411
Query: 384 ALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLP 443
+ ++ V+ +GLF+ P + K +V ++E++++AL A+++ +VLLKN LP
Sbjct: 412 RVRDILRVKFLVGLFD-QPYQIDL-KQADKEVNSAENQQVALQASKESLVLLKNQDAVLP 469
Query: 444 LNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV---SKTHYASGCHDVP 500
L+ N +S +A+ GP + + Y + ++L G++ V ++ + GC V
Sbjct: 470 LDVNKISKIAVCGPNADEEAYALTHYGPLAVEVTTVLEGIQNKVKPGTEVLFTKGCDLVD 529
Query: 501 CN--------------SDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQ 546
N + ++AV AKK+D +VV G E++ R SL LPG+Q
Sbjct: 530 ANWPESELIRYPLTSEEQSEINKAVENAKKSDVAVVVLGGSNRTCGENKSRSSLELPGRQ 589
Query: 547 MSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGD 606
+ L+ +V T K PV+LVL G P+ +++A D + +IL YPG G A+A+ +FGD
Sbjct: 590 LDLLQAVVATGK-PVVLVLINGRPISINWA--DKYVPAILEAWYPGSQGGTAIADALFGD 646
Query: 607 FNPGGRLPMTWYPESFTKVPMN---DMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYT 663
+NPGG+L +T +P++ ++P N N + D R +Y FG+GLSYT
Sbjct: 647 YNPGGKLTVT-FPKTVGQIPFNFPTKPNAQVDGGRNKGLDGNMSRVNGPLYPFGYGLSYT 705
Query: 664 NYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNA 723
+ Y S+++I ++ + +V T V+ VTN
Sbjct: 706 TFEY------SDISIQPAI--------------------VTQVQPVT-----VRCKVTNT 734
Query: 724 GDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHG 783
G G VV L+ R S T EK L+GFDR+H +KE++F ++P L + N
Sbjct: 735 GKRAGDEVVQLYVRDILSSVTTYEKNLVGFDRIHLNPGETKELTFTIEP-RDLQLLNSDN 793
Query: 784 RRILPLGNHVLMVG 797
++ G+ +MVG
Sbjct: 794 HWVVEPGDFKVMVG 807
>gi|317474222|ref|ZP_07933498.1| glycosyl hydrolase family 3 C terminal domain-containing protein
[Bacteroides eggerthii 1_2_48FAA]
gi|316909532|gb|EFV31210.1| glycosyl hydrolase family 3 C terminal domain-containing protein
[Bacteroides eggerthii 1_2_48FAA]
Length = 954
Score = 254 bits (650), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 212/764 (27%), Positives = 353/764 (46%), Gaps = 120/764 (15%)
Query: 54 FCNTSLSISTRAKSLISLLTLQEKIQQLSDN--ASAIPRLGIPAYEWWSESLHGIASNGP 111
+ + L ++ R +SL+S++T ++K++ + + IP L +P E++HG +
Sbjct: 170 YMDPRLPVNGRVESLLSVMTPEDKMELIREGWGIPGIPHLYVPPITK-VEAVHGFSYG-- 226
Query: 112 GVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINI 171
S T FPQ L A++N++L + A+ E L + W+P +++
Sbjct: 227 --------SGATIFPQALAMGATWNKNLTEKVAMAIGEET-----LAAGTMQAWSPVLDV 273
Query: 172 FRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEES 231
+D RWGR +ET GEDP++VS ++K +Q +
Sbjct: 274 AQDARWGRCEETFGEDPVLVSQIGGAWIKGYQSKG------------------------- 308
Query: 232 DRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCI 291
L KH + G R S + ++E++ + PFR I +
Sbjct: 309 -------LFTTPKHFGGHGAPLGG---RDSHDIGLSEREMREIHLVPFRHVIRNYACQSL 358
Query: 292 MCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAG- 349
M +Y+ GVP +L + R EWGF G++ SDC A+ + ++YT ++ AA
Sbjct: 359 MMAYSDFLGVPVAKSKELLRNILREEWGFDGFVVSDCGAIGNLTSRKHYTAKNKIEAANQ 418
Query: 350 VLKAGMDINCG-TCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKG-KY 407
L AG+ NCG T + A G++ ++D + + R LF + ++ +
Sbjct: 419 ALAAGIATNCGDTYNDKEVIQAAKDGRLDMANLDNVCRTMLRMMFRNELFEKEHKEPLDW 478
Query: 408 GKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGG 467
K+ P + HK++A +AR+ IV+L+N LPL+K+ V ++A++GP +N+ G
Sbjct: 479 NKIYPG-WNSDSHKEIARQSARESIVMLENKDDLLPLSKH-VRTIAVLGPGADNLQP--G 534
Query: 468 GYTGI--PCSPKSLLRGLEAYV---SKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIV 522
YT P KS+L G++ V +K Y GC N + AV++A ++D V++
Sbjct: 535 DYTPKLRPGQLKSVLTGIKQAVGNQTKILYEQGCEFTGSNGE-NIPNAVKVAAQSDVVVL 593
Query: 523 VAGLDLTQET---------EDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDV 573
V G T E E+ D +L+LPG+Q L+ +V T K PV+L+L G P ++
Sbjct: 594 VLGDCSTSEATTDVYKTSGENHDYATLILPGRQQELLEAVCATGK-PVVLLLQIGRPYNL 652
Query: 574 SFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMR 633
+ + + W+ PG+ G A A+++FGD+NP GRLPMT +P ++P+ N +
Sbjct: 653 TKESELCKAIIVNWL--PGQEGGLATADVLFGDYNPAGRLPMT-FPRHVGQLPLY-YNFK 708
Query: 634 ADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQ 693
S R+Y +Y +Y FG+GLSYT++ Y S+L + + NI+
Sbjct: 709 T-SGRRYEYLDLEYYP---LYYFGYGLSYTSFEY------SDLKVEER----ENGNII-- 752
Query: 694 TGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGF 753
++V N G G VV L+ S T +L F
Sbjct: 753 ----------------------AHVNVKNIGYRAGDEVVQLYVTDMYASVKTRITELKDF 790
Query: 754 DRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
RVH SK ISF + P E LS+ N + R++ G ++VG
Sbjct: 791 TRVHLRPGESKSISFELTPYE-LSLLNDNMDRVVEKGTFKILVG 833
>gi|380696428|ref|ZP_09861287.1| beta-glucosidase [Bacteroides faecis MAJ27]
Length = 851
Score = 254 bits (649), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 158/447 (35%), Positives = 228/447 (51%), Gaps = 57/447 (12%)
Query: 54 FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
+ N + + R LIS LT++EKI L + IPRLGI Y +E+LHG+ G
Sbjct: 28 YKNENAPVHERVMDLISRLTVEEKISLLRATSPGIPRLGIDKYYHGNEALHGVVRPG--- 84
Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAG----------LT 163
T FPQ + AA++N L + + ++ EARA +N G LT
Sbjct: 85 -------RFTVFPQAIGLAATWNPELQRRVATVISDEARARWNELDQGRAQKEQFSDVLT 137
Query: 164 FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRV 223
FW+P +N+ RDPRWGR ET GEDP + FVK QG
Sbjct: 138 FWSPTVNMARDPRWGRTPETYGEDPFLSGVMGTAFVKGLQG------------------- 178
Query: 224 LKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCI 283
D L + + KH A + E +R+ N I+E+ + + P F C+
Sbjct: 179 --------DDPHYLKIVSTPKHFAANNEEH----NRFVCNPQISEKQLREYYFPAFEMCV 226
Query: 284 EQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKT 342
++GKA+ IM +YN +N VP L L QK R +WGF+GY+ SDC A + Y KT
Sbjct: 227 KEGKAASIMSAYNALNDVPCTLNAWLLQKVLRKDWGFQGYVVSDCGGPALLVNAHKYLKT 286
Query: 343 HEDSAAGVLKAGMDINCGTCMLR-HTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGD 401
E +A LKAG+D+ CG + +A + V + DID A ++ + +++LGLF+G
Sbjct: 287 KEAAATLSLKAGLDLECGDDVYDGPLLNAYKQYMVSDADIDSAAYHVLTARMKLGLFDGV 346
Query: 402 PRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNN 461
R Y K+ P + + EH+++ALDAARQ IVLLKN K LPLN + + S+A++G +N
Sbjct: 347 ER-NPYTKISPSVIGSKEHQQIALDAARQCIVLLKNQKNMLPLNASKLKSIAVVG--INA 403
Query: 462 ISQMGGGYTGIP-CSPKSLLRGLEAYV 487
G Y+G P P S+L+G+ V
Sbjct: 404 GKCEFGDYSGAPVVEPVSILQGIRNRV 430
Score = 125 bits (315), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 82/302 (27%), Positives = 143/302 (47%), Gaps = 47/302 (15%)
Query: 507 FHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLT 566
+ EA + ++ + VI V G++ + E E +DR + LP Q + + + + +I++L
Sbjct: 592 YGEAGKAVRECETVIAVMGINKSIEREGQDRYDIQLPADQREFLQEIYKVNSN-MIVILV 650
Query: 567 GGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVP 626
G L +++ D + +I+ YPGE G A+AE++FGD+NP GRLP+T+Y P
Sbjct: 651 AGSSLAINWM--DEHVPAIVNAWYPGEQGGTAVAEVLFGDYNPAGRLPLTYYKSLDELPP 708
Query: 627 MNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGS 686
+D ++ GR+Y+++ G +Y FG+GLSY+++ Y L E
Sbjct: 709 FDDYDITK-------GRTYKYFKGEVLYPFGYGLSYSSFKYSDLRVKDE----------- 750
Query: 687 DKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTP 746
DEV V + N G +G V ++ R+P+ P
Sbjct: 751 ----------------ADEVA--------VSFRLKNTGKRNGDEVTQVYVRIPETGGIVP 786
Query: 747 EKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIAN-KHGRRILPLGNHVLMVGELRHSLTI 805
K+L GF RV + S+ + ++ EQL + G+ ++P G +MVG + +
Sbjct: 787 VKELKGFRRVPLKSGESRRVEIRLNK-EQLRYWDVGKGQFVVPKGTFDIMVGASSKDIRL 845
Query: 806 ET 807
+T
Sbjct: 846 QT 847
>gi|393786911|ref|ZP_10375043.1| hypothetical protein HMPREF1068_01323 [Bacteroides nordii
CL02T12C05]
gi|392658146|gb|EIY51776.1| hypothetical protein HMPREF1068_01323 [Bacteroides nordii
CL02T12C05]
Length = 863
Score = 254 bits (649), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 153/459 (33%), Positives = 230/459 (50%), Gaps = 58/459 (12%)
Query: 53 PFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPG 112
PF N L + R + L+ LTL EK+ + D +S++PRLGI Y WW+E+LHG+
Sbjct: 23 PFNNPDLPVEERVEDLVRRLTLHEKVLLMCDYSSSVPRLGIKQYNWWNEALHGV------ 76
Query: 113 VNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQ--------AGLTF 164
G T FPQ + AA+F+ + V+ EARA Y+ + GLTF
Sbjct: 77 ----GRAGLATVFPQAIGMAATFDDCAVKQVFECVSDEARAKYHHSENKDGSERYRGLTF 132
Query: 165 WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVL 224
W PN+NIFRDPRWGRGQET GEDP + S + V+ QG + D
Sbjct: 133 WTPNVNIFRDPRWGRGQETYGEDPYLTSRMGLAVVRGLQGPSESKYD------------- 179
Query: 225 KGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPFRSCI 283
L AC KH + +W +R+ F+ I+ +D +T+ P F++ +
Sbjct: 180 -------------KLHACAKHYALHSGPEW---NRHRFDVENISPRDLWETYLPAFKALV 223
Query: 284 EQGKASCIMCSYNQVNGVPACLRGD-LFQKARNEWGFKGYITSDCDAVATIFEYQNYTKT 342
+QG +MC+YN+ G P C L+ R EWGF G + SDC A++ F + + +T
Sbjct: 224 QQGGVKEVMCAYNRFEGEPCCGSNRLLYNILREEWGFDGLVVSDCGAISD-FYLKGHHET 282
Query: 343 H---EDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFN 399
H E + A +KAG D++CG + + A++KG + EK ID +L L + LGL +
Sbjct: 283 HSTKESAVAAAVKAGTDLDCGV-DYQSLEKAVEKGIITEKQIDVSLSRLLKARFELGLMD 341
Query: 400 GDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLV 459
+ + + V + +H+ AL+ AR+ + LLKN LPL+K+ + +IGP
Sbjct: 342 -EEHLVSWSDIPYTVVDSEKHRAKALEVARKSMTLLKNKNGTLPLSKHC-GKIVVIGPNA 399
Query: 460 NNISQMGGGYTGIPCSPKSLLRGLEAYVS--KTHYASGC 496
N+ M G Y G P ++L G+ + + Y GC
Sbjct: 400 NDSIMMWGNYNGFPSHTVTILEGITHKLDAGQVIYDKGC 438
Score = 107 bits (268), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 82/270 (30%), Positives = 126/270 (46%), Gaps = 53/270 (19%)
Query: 508 HEAVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVARTS 557
+E A+ ++ V G+ E E+ DR + LP Q L+ + +T
Sbjct: 591 NEIAATVSDAEAIVFVGGISPKVEGEELPVSFPGFKGGDRTVIELPQVQRDLLQELYKTG 650
Query: 558 KRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTW 617
K P+IL+L G + +S AE D +I+ YPG+AG A+A+++FGD+NP GRLP+T+
Sbjct: 651 K-PIILILCSGSAIGLS-AEVDLA-DAIIQAWYPGQAGGTAVADVLFGDYNPAGRLPVTF 707
Query: 618 YPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELT 677
Y + D NM+ GR+YR++ G ++ FG+GLSYT+ F ++L
Sbjct: 708 YKTTEQLPDFEDYNMQ--------GRTYRYFKGEALFPFGYGLSYTS----FEIGKAQL- 754
Query: 678 ISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFAR 737
S K I LD + + N G+ DG V+ ++ R
Sbjct: 755 --------SKKRIHANESVNLD------------------LWIKNTGERDGEEVIQVYIR 788
Query: 738 VPKVSQGTPEKQLIGFDRVHTVAKGSKEIS 767
K +G P K L F RVH + K+IS
Sbjct: 789 KLKDKEG-PLKTLRAFKRVHVKSGEKKQIS 817
>gi|423293673|ref|ZP_17271800.1| hypothetical protein HMPREF1070_00465 [Bacteroides ovatus
CL03T12C18]
gi|392677631|gb|EIY71047.1| hypothetical protein HMPREF1070_00465 [Bacteroides ovatus
CL03T12C18]
Length = 800
Score = 254 bits (648), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 212/735 (28%), Positives = 340/735 (46%), Gaps = 127/735 (17%)
Query: 90 RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
RLGIP ++ +E + G+ + T FP A++N+ L I A
Sbjct: 151 RLGIPV-DFTNEGIRGLCHD-----------RATMFPAQCGQGATWNKKLIGEIAKVTAD 198
Query: 150 EARAMYNLGQAGLT-FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWK 208
EA+A+ G T ++P ++I +DPRWGR E+ GEDP +V + + Q E
Sbjct: 199 EAKAL------GYTNIYSPILDIAQDPRWGRVVESYGEDPYLVGELGKQMILGLQNEG-- 250
Query: 209 SDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITE 268
+ A KH Y + G + +
Sbjct: 251 ------------------------------IVATPKHFAVYSIPVGGRDGGTRTDPHVAP 280
Query: 269 QDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDC 327
++ + + PFR I++ A +M SYN +G P + R +WGFKGYI SD
Sbjct: 281 REMKTLYLEPFRKGIQEAGALGVMSSYNDYDGEPVSGSYHFLTEILRQQWGFKGYIVSDS 340
Query: 328 DAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTC----MLRHTQSAIDKGKVQEKDIDR 383
+AV + T T E+ AA V+ AG++I + + AI++GKV +D+
Sbjct: 341 EAVEFLHTKHRITPTEEEMAAQVVNAGLNIRTNFTPPQDFILPLRRAINEGKVSLHTLDQ 400
Query: 384 ALLNLFSVQLRLGLFN----GDPRKGKYGKLGPDDVCTSE-HKKLALDAARQGIVLLKND 438
+ + V+ +GLF+ GD R+ P+ V ++ HK +++ AA + IVLLKN+
Sbjct: 401 RVGEILRVKFMMGLFDNPYPGDDRR-------PETVVHNDAHKAVSMKAALESIVLLKNE 453
Query: 439 KKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV--SKTHYASGC 496
+ LPL+KN S +A+IGP + ++ Y S K++ +G++ Y+ S+ YA GC
Sbjct: 454 NQMLPLSKN-FSKIAVIGPNGEEVKELTCRYGPANASIKTVYQGIKEYLPNSEVRYAKGC 512
Query: 497 ------------HDVPCNS--DAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLL 542
++VP ++ A EAV +AK +D I+V G + E+ R +L L
Sbjct: 513 DIIDKYFPESELNNVPLDTQEQAMIQEAVELAKASDIAILVLGGNEKTVREEFSRTNLDL 572
Query: 543 PGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEI 602
G+Q L+ +V T K PVILV+ G +++ A+ I +I+ +PGE A+A++
Sbjct: 573 CGRQQQLLEAVYATGK-PVILVMVDGRAATINW--ANKYIPAIIHAWFPGEFMGDAIAKV 629
Query: 603 IFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSY 662
+FGD+NPGGRL +T +P+S ++P S+ + +Y FG+GLSY
Sbjct: 630 LFGDYNPGGRLAVT-FPKSVGQIPFAFPFKPGSDSKG------KVRVDGVLYPFGYGLSY 682
Query: 663 TNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTN 722
T + Y S+L IS + G +NI SCT V N
Sbjct: 683 TTFGY------SDLKISKPV-IGPQENI---------------TLSCT---------VKN 711
Query: 723 AGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKH 782
G G VV L+ R S T +K L GF+R+H + ++F + P + L + +K+
Sbjct: 712 TGKKAGDEVVQLYIRDDFSSVTTYDKVLRGFERIHLQPGEEQTVNFTLTP-QDLGLWDKN 770
Query: 783 GRRILPLGNHVLMVG 797
R + G+ +MVG
Sbjct: 771 NRFTVEPGSFSVMVG 785
>gi|423303655|ref|ZP_17281654.1| hypothetical protein HMPREF1072_00594 [Bacteroides uniformis
CL03T00C23]
gi|423307623|ref|ZP_17285613.1| hypothetical protein HMPREF1073_00363 [Bacteroides uniformis
CL03T12C37]
gi|392688019|gb|EIY81310.1| hypothetical protein HMPREF1072_00594 [Bacteroides uniformis
CL03T00C23]
gi|392689492|gb|EIY82769.1| hypothetical protein HMPREF1073_00363 [Bacteroides uniformis
CL03T12C37]
Length = 801
Score = 254 bits (648), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 213/736 (28%), Positives = 335/736 (45%), Gaps = 129/736 (17%)
Query: 90 RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
RLGIP ++ +E + G+ + T FP A++N+ L + IG A
Sbjct: 151 RLGIPV-DFTNEGIRGLCHD-----------RATYFPAQCGQGATWNKELIAQIGEA--- 195
Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
EAR LG + ++P ++I +DPRWGR ET GEDP + + S Q
Sbjct: 196 EAREASVLGYTNI--YSPILDIAQDPRWGRCVETYGEDPYHAGQMGKQMILSLQKNK--- 250
Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQ 269
L + KH Y + G + + + +
Sbjct: 251 -----------------------------LVSTPKHFAVYSIPVGGRDGKTRTDPHVAPR 281
Query: 270 DTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGD---LFQKARNEWGFKGYITSD 326
+ + PFR + A +M SYN +G P + G L + R EWGFKGY+ SD
Sbjct: 282 EMRTLYLDPFRVAFHEAGALGVMSSYNDYDGEP--ITGSYHFLTEILRQEWGFKGYVVSD 339
Query: 327 CDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTC----MLRHTQSAIDKGKVQEKDID 382
+AV I +ED+ A + AG++I + +SA+ KGK+ ++ ++
Sbjct: 340 SEAVEFISTKHQVANGYEDAVAQAVNAGLNIRTHFTPPADFILPLRSAVKKGKISQETLN 399
Query: 383 RALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFL 442
+ + + V+ LGLF+ +P +G + G V + EH++LAL+AARQ +VLLKN+ + L
Sbjct: 400 QRVAEILRVKFWLGLFD-NPYRGDEKRAG-QIVHSPEHQQLALEAARQSLVLLKNEHQTL 457
Query: 443 PLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTH--YASGCH--- 497
PL+K ++ S+A+IGP + Q+ Y ++ G++ + + Y GC
Sbjct: 458 PLSK-SIRSVAVIGPNADERQQLICRYGPANAHITTIYEGIKKMLPQADVVYKKGCDIID 516
Query: 498 ---------DVPCNSDAG--FHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQ 546
+ P + EA+ AK A+ ++V G + EDR R SL LPG+Q
Sbjct: 517 PHFPESEVLEFPKAAQEAQMMEEAIEAAKGAEVTVMVLGGNELTVREDRSRTSLDLPGRQ 576
Query: 547 MSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGD 606
L+ + + K PV+LV+ G ++FA + + +I+ +PGE G +A+AE +FGD
Sbjct: 577 EELLKKICQLGK-PVVLVMIDGRASSINFAA--THVPAIIHAWFPGEFGGQAIAEALFGD 633
Query: 607 FNPGGRLPMTWYPESFTKVPMN-DMNMRADSSRQYPGRSYRFYTGTQVYG----FGHGLS 661
+NPGGRL +T +P+S ++P +D S + T VYG FGHGLS
Sbjct: 634 YNPGGRLAVT-FPKSVGQIPFAFPFKPGSDESSE-----------TSVYGALYPFGHGLS 681
Query: 662 YTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVT 721
YT + Y S+L IS S K G NI SCT +
Sbjct: 682 YTTFQY------SDLVISPS-KQGVQGNI---------------SISCT---------IK 710
Query: 722 NAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANK 781
N G +G VV L+ R S T + L GF+R+ + S + F + P ++L I +K
Sbjct: 711 NIGQREGDEVVQLYLRDEVSSVTTYTQVLRGFERITLKPEASHTVHFELTP-QELGIWDK 769
Query: 782 HGRRILPLGNHVLMVG 797
+ G +M+G
Sbjct: 770 QMNFTVEPGMFKVMIG 785
>gi|293373755|ref|ZP_06620101.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
ovatus SD CMC 3f]
gi|292631245|gb|EFF49877.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
ovatus SD CMC 3f]
Length = 800
Score = 254 bits (648), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 209/735 (28%), Positives = 340/735 (46%), Gaps = 127/735 (17%)
Query: 90 RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
RLGIP ++ +E + G+ + T FP A++N+ L I A
Sbjct: 151 RLGIPV-DFTNEGIRGLCHD-----------RATMFPAQCGQGATWNKKLIREIAKVTAD 198
Query: 150 EARAMYNLGQAGLT-FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWK 208
EA+A+ G T ++P ++I +DPRWGR E+ GEDP +V + + Q E
Sbjct: 199 EAKAL------GYTNIYSPILDIAQDPRWGRVVESYGEDPYLVGELGKQMILGLQNEG-- 250
Query: 209 SDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITE 268
+ A KH Y + G + +
Sbjct: 251 ------------------------------IVATPKHFAVYSIPVGGRDGGTRTDPHVAP 280
Query: 269 QDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDC 327
++ + + PFR I++ A +M SYN +G P + R +WGFKGY+ SD
Sbjct: 281 REMKTLYLEPFRKGIQEAGALGVMSSYNDYDGEPVSGSYHFLTEILRQQWGFKGYVVSDS 340
Query: 328 DAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTC----MLRHTQSAIDKGKVQEKDIDR 383
+AV + T T E+ AA V+ AG++I + + AI++GKV +D+
Sbjct: 341 EAVEFLHTKHRITPTEEEMAAQVVNAGLNIRTNFTPPQDFILPLRRAINEGKVSLHTLDQ 400
Query: 384 ALLNLFSVQLRLGLFN----GDPRKGKYGKLGPDDVCTSE-HKKLALDAARQGIVLLKND 438
+ + V+ +GLF+ GD R+ P+ V ++ HK +++ AA + IVLLKN+
Sbjct: 401 RVGEILRVKFMMGLFDNPYPGDDRR-------PETVVHNDAHKAVSMKAALESIVLLKNE 453
Query: 439 KKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV--SKTHYASGC 496
+ LPL+KN S +A+IGP + ++ Y S K++ +G++ Y+ S+ Y GC
Sbjct: 454 NQMLPLSKN-FSKIAVIGPNGEEVKELTCRYGPANASIKTVYQGIKEYLPNSEVRYVKGC 512
Query: 497 ------------HDVPCNS--DAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLL 542
++VP ++ A HEAV +AK +D I+V G + E+ R +L L
Sbjct: 513 DIIDKYFPESELYNVPLDTQEQAMIHEAVELAKASDVAILVLGGNEKTVREEFSRTNLDL 572
Query: 543 PGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEI 602
G+Q L+ +V T K PV+LV+ G +++ A+ + +I+ +PGE A+A++
Sbjct: 573 CGRQQQLLEAVYATGK-PVVLVMVDGRAATINW--ANKYVPAIIHAWFPGEFMGDAIAKV 629
Query: 603 IFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSY 662
+FGD+NPGGRL +T +P+S ++P S+ + +Y FG+GLSY
Sbjct: 630 LFGDYNPGGRLAVT-FPKSVGQIPFAFPFKPGSDSKG------KVRVDGVLYPFGYGLSY 682
Query: 663 TNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTN 722
T + Y S+L IS + G +NI SCT V N
Sbjct: 683 TTFGY------SDLKISKPV-IGPQENI---------------TLSCT---------VKN 711
Query: 723 AGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKH 782
G G VV L+ R S T +K L GF+R+H + ++F + P + L + +K+
Sbjct: 712 TGKKAGDEVVQLYIRDDFSSVTTYDKVLRGFERIHLQPGEEQTVNFTLTP-QDLGLWDKN 770
Query: 783 GRRILPLGNHVLMVG 797
R + G+ +MVG
Sbjct: 771 NRFTVEPGSFSVMVG 785
>gi|423342899|ref|ZP_17320613.1| hypothetical protein HMPREF1077_02043 [Parabacteroides johnsonii
CL02T12C29]
gi|409217154|gb|EKN10133.1| hypothetical protein HMPREF1077_02043 [Parabacteroides johnsonii
CL02T12C29]
Length = 955
Score = 254 bits (648), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 204/734 (27%), Positives = 334/734 (45%), Gaps = 113/734 (15%)
Query: 90 RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
RLGIP ++ +E + G+ S T+FP L ++NR L +G
Sbjct: 161 RLGIPT-DFTNEGIRGVES-----------YIATNFPTQLGLGHTWNRDLVHKVGYITGR 208
Query: 150 EARAMYNLGQAGLT-FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWK 208
E R + G T +AP +++ RD RWGR +E GE P +V+ VE K Q
Sbjct: 209 EGRLL------GYTNVYAPILDVGRDQRWGRYEEVYGESPYLVAELGVEMAKGMQ----- 257
Query: 209 SDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITE 268
+ ++A KH IAY K G + ++
Sbjct: 258 --------------------------TDYQVAATSKHYIAYSNNKGGREGMARVDPQMSP 291
Query: 269 QDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVP-ACLRGDLFQKARNEWGFKGYITSDC 327
++ E P++ I++ +M SYN +G P L + R E+GF+GY+ SD
Sbjct: 292 REVEMLHVYPWKRVIKEAGILGVMSSYNDYDGFPIQSSYYWLTTRLRGEFGFRGYVVSDS 351
Query: 328 DAVATIFEYQNYTKTHEDSAAGVLKAGMDINCG----TCMLRHTQSAIDKGKVQEKDIDR 383
DAV +F ++S + AG++I C + + I +G + ID
Sbjct: 352 DAVEYLFSKHGTAADMKESVLQSVLAGLNIRCTFRSPDSYVLPLRELIAEGALPMSTIDD 411
Query: 384 ALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLP 443
+ ++ V+ +GLF+ P + K +V ++E++++AL A+++ +VLLKN LP
Sbjct: 412 RVRDILRVKFLVGLFD-QPYQIDL-KQADKEVNSAENQQVALQASKESLVLLKNQDAVLP 469
Query: 444 LNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV---SKTHYASGCHDVP 500
L+ N +S +A+ GP + + Y + ++L G++ V ++ + GC V
Sbjct: 470 LDVNKISKIAVCGPNADEEAYALTHYGPLAVEVTTVLEGIQNKVKPGTEVLFTKGCDLVD 529
Query: 501 CN--------------SDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQ 546
N + +AV AKK+D +VV G E++ R SL LPG+Q
Sbjct: 530 ANWPESELIRYPLTSEEQSEIDKAVENAKKSDVAVVVLGGSNRTCGENKSRSSLELPGRQ 589
Query: 547 MSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGD 606
+ L+ +V T K PV+LVL G P+ +++A D + +IL YPG G A+A+ +FGD
Sbjct: 590 LDLLQAVVATGK-PVVLVLINGRPISINWA--DKYVPAILEAWYPGSQGGTAIADALFGD 646
Query: 607 FNPGGRLPMTWYPESFTKVPMN---DMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYT 663
+NPGG+L +T +P++ ++P N N + D R +Y FG+GLSYT
Sbjct: 647 YNPGGKLTVT-FPKTVGQIPFNFPTKPNAQVDGGRNKGLDGNMSRVNGPLYPFGYGLSYT 705
Query: 664 NYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNA 723
+ Y S+++I ++ + +V T V+ VTN
Sbjct: 706 TFEY------SDISIQPAI--------------------VTQVQPVT-----VRCKVTNT 734
Query: 724 GDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHG 783
G G VV L+ R S T EK L+GFDR+H +KE++F ++P L + N
Sbjct: 735 GKRAGDEVVQLYVRDILSSVTTYEKNLVGFDRIHLNPGETKELTFTIEP-RDLQLLNSDN 793
Query: 784 RRILPLGNHVLMVG 797
++ G+ +MVG
Sbjct: 794 HWVVEPGDFKVMVG 807
>gi|160890986|ref|ZP_02071989.1| hypothetical protein BACUNI_03433 [Bacteroides uniformis ATCC 8492]
gi|156859207|gb|EDO52638.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
uniformis ATCC 8492]
Length = 869
Score = 254 bits (648), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 243/858 (28%), Positives = 367/858 (42%), Gaps = 178/858 (20%)
Query: 10 LRERILFRVLQLIVVVNVIAFSNSKPVLNKPDFPCKPPHFDSYPFCNTSLSISTRAKSLI 69
L +RIL + L ++ +IA + C P + DS S+S+ R L+
Sbjct: 3 LEDRILKKWLLPFILSVLIAVDSMAQ--------CIPVYKDS------SMSVERRVTDLL 48
Query: 70 SLLTLQEKIQQLS-------------------DNASAIP--------------------- 89
+TL+EKI QLS D A I
Sbjct: 49 GRMTLREKIAQLSHLHGYQLYNGQEVDYQKLRDAAGDISYGCIEGFNLTGENVRKAFHAI 108
Query: 90 --------RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWS 141
RLGIP + +ESLHG +G T FPQ + ++FN L
Sbjct: 109 QKYMVEETRLGIPVFTV-TESLHGSVHDGS-----------TIFPQSVAVGSTFNLDLAY 156
Query: 142 NIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKS 201
+ A+A E R+ Q + +P +++ RD RWGR +E+ GEDP +V + VK
Sbjct: 157 QMTKAIATELRS-----QGVIQTLSPGLDVVRDLRWGRVEESFGEDPWLVGQMGIAQVKG 211
Query: 202 FQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYS 261
+ DGGI +LK FG L L++
Sbjct: 212 YI-------DGGIS------PMLKPFGPGGAPLGGLNLASVES----------------- 241
Query: 262 FNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFK 320
E+D + P+ + + +M SYN NG+P L RNEWGFK
Sbjct: 242 -----GERDIRNIHIKPYEMAVRNTEVKAVMTSYNSWNGIPNSASSYLLTNILRNEWGFK 296
Query: 321 GYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKD 380
GY+ SD AVA + ++Q+ K ++A L AG+D+ + + I++G+ EK
Sbjct: 297 GYVYSDWGAVAMLKDFQHTAKDDSEAAIQALTAGVDLEASSNCYWALEQLIEQGRFDEKY 356
Query: 381 IDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKK 440
+D A+ + V+ LGLF +P +G + + T E +L+ A + IVLLKN+
Sbjct: 357 VDLAVGRILRVKFELGLFE-NPYQG--ADMPGVAMRTKEAVELSRRVADESIVLLKNENT 413
Query: 441 FLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSL--LRGLEAYVS---KTHYASG 495
LPLN N + SLA+IGP N + G YT + + L GL+ V K +YA+G
Sbjct: 414 LLPLNLNKIKSLAVIGPNANQVQ--FGDYTWSRSNKDGVTPLEGLKKRVGNKIKINYAAG 471
Query: 496 CHDVPCNSDAGFHEAVRIAKKADFVIVVAG-------LDLTQET--EDRDRVSLLLPGQQ 546
C D+ ++ +GF EAV K +D +V G D + T E D SL L G Q
Sbjct: 472 C-DLITDNKSGFDEAVAAVKASDMAVVFVGSSSASLARDYSDATCGEGFDLSSLDLTGVQ 530
Query: 547 MSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGD 606
LV + K PVI+VL G P +S+ + I +I+ Y GE A+A+++ G+
Sbjct: 531 EELVEEIYAIGK-PVIVVLVTGKPFSISWIK--EHIPAIVVQWYGGEKAGDAIADMLLGN 587
Query: 607 FNPGGRLPMTWYPES-------FTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHG 659
NP +LP + +P+S + +P + R PGR Y F + ++ FGHG
Sbjct: 588 INPSAKLPFS-FPQSVGHLPVFYNHLPTDKGFYRRPGRPNEPGRDYVFSSPAPLWSFGHG 646
Query: 660 LSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQIS 719
LSYT + Y L+A SA L SD I V +S
Sbjct: 647 LSYTTFEY--LNA----HYSAELLHPSDTLI-------------------------VSVS 675
Query: 720 VTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIA 779
+ N G V G VV L+ R S TP KQL F + + G + P ++L++
Sbjct: 676 LKNTGSVAGKEVVQLYVRDVVSSVVTPVKQLKAFSKPF-LQPGEMQTVVLKLPIQELALY 734
Query: 780 NKHGRRILPLGNHVLMVG 797
+ ++++ G + + +G
Sbjct: 735 DLSMKKVVEEGEYEIQIG 752
>gi|373955060|ref|ZP_09615020.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
paludis DSM 18603]
gi|373891660|gb|EHQ27557.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
paludis DSM 18603]
Length = 754
Score = 254 bits (648), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 205/672 (30%), Positives = 320/672 (47%), Gaps = 105/672 (15%)
Query: 123 TSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFW--APNINIFRDPRWGRG 180
T+FP L AAS++ A+ + AR A W AP ++I RDPRWGR
Sbjct: 124 TTFPIPLAEAASWDMD-------AIRLSARIAATEAAASGIHWTFAPMVDISRDPRWGRV 176
Query: 181 QETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLS 240
E GEDP + S A+ VK FQG+ S D +
Sbjct: 177 MEGAGEDPYLGSQIAIARVKGFQGDGLGSLDA--------------------------VM 210
Query: 241 ACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVN 299
AC KH AY G +NA+ ++++ + + PPF++ ++ G A+ M S+N++N
Sbjct: 211 ACAKHFAAYGAAIAGR----DYNAVDMSDRMLWEVYLPPFKAALDAG-AATFMNSFNELN 265
Query: 300 GVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDIN 358
G+PA L +K + +WGFKG++ SD ++A + + + K ++A L AG D++
Sbjct: 266 GIPATGNSYLQRKVLKGKWGFKGFVVSDWGSIAEMVNH-GFVKDKNEAARVALNAGSDMD 324
Query: 359 C-GTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCT 417
G + + + GKV +D A+ + + + LGLF+ DP + K + V +
Sbjct: 325 MEGRSYIEYLPQLVRSGKVSITLVDDAVRRVLTKKFELGLFD-DPYRFSDEKREKEIVNS 383
Query: 418 SEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT---GIP- 473
+ ++A A++ IVLLKN+ + LPL++NA S+A+IGPL M G + GI
Sbjct: 384 PRNHEVAKMMAQKSIVLLKNEGELLPLSRNA-KSIAVIGPLAKAKKDMNGFWALPWGIAD 442
Query: 474 -CSPKSLLRGLEAYV---SKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLT 529
P SL G++A V YA GC +V S AGF EA R+A+ AD V++ G
Sbjct: 443 ESEPVSLFEGIQAKVGDKKNVWYAKGC-NVNDTSRAGFAEAARVAQNADVVVMAIGEAYN 501
Query: 530 QETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIG 589
E + R ++ LPG Q LV ++ T K V+LV+ GG PL F + +IL+
Sbjct: 502 MSGEAKSRANIHLPGVQEELVKAIQATGKPTVVLVM-GGRPL--IFNWTADHVPAILFTW 558
Query: 590 YPGEAGAKALAEIIFGDFNPGGRLPMT----------WYPESFTKVPMNDMNMRADSSRQ 639
+ G G A+A+++FGD+NP +LP+T +Y T P+ D+N A S
Sbjct: 559 WLGVEGGNAMADVLFGDYNPSAKLPITFPRSEGQIPVYYASKNTGRPLVDVNDVAYKS-- 616
Query: 640 YPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLD 699
+Y T + FG+GLSYT++ Y + LK S K
Sbjct: 617 ----AYIDELNTPRFAFGYGLSYTHFKY------------SDLKLSSKK----------- 649
Query: 700 YVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTV 759
++ + +C+ ++TN+G G VV L+ R S P K+L F ++ +
Sbjct: 650 -INNTQTITCS-------FTITNSGKYTGEEVVQLYLRNRVASVTRPVKELKDFKKIKLM 701
Query: 760 AKGSKEISFGVD 771
SK +SF +D
Sbjct: 702 PSESKTVSFVID 713
>gi|380692929|ref|ZP_09857788.1| glycoside hydrolase family protein [Bacteroides faecis MAJ27]
Length = 777
Score = 253 bits (647), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 216/746 (28%), Positives = 331/746 (44%), Gaps = 131/746 (17%)
Query: 90 RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
RLGIP ++ +E + G+ T FP +++N+ L + IG V
Sbjct: 127 RLGIPV-DFTNEGIRGVCHE-----------KATFFPAQCGQGSTWNKELIARIGE---V 171
Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
EA+ LG + ++P ++I +DPRWGR E GEDP +V + ++S Q
Sbjct: 172 EAKEAVALGYTNI--YSPILDIAQDPRWGRAVECYGEDPYLVGQLGKQMIQSLQKHK--- 226
Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQ 269
L A KH Y + G + + +
Sbjct: 227 -----------------------------LVATPKHFAVYSIPVGGRDGGTRTDPHVAPR 257
Query: 270 DTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGD---LFQKARNEWGFKGYITSD 326
+ + PFR ++ A +M SYN +G P + G L Q R EWGFKGY+ SD
Sbjct: 258 EMRTLYLEPFRVAFQEAGALGVMSSYNDYDGEP--ITGSYRFLTQILRQEWGFKGYVVSD 315
Query: 327 CDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCG----TCMLRHTQSAIDKGKVQEKDID 382
DAV I +E++ + AG+++ ++ +SAI KGKV + ID
Sbjct: 316 SDAVEFISSKHKVADNNEEAVVQSVNAGLNVRTNFSSPAGFIKPLRSAIAKGKVSQATID 375
Query: 383 RALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFL 442
+ + + V+ LGLF+ +P +G GKL V EH+ +AL+AARQ IVLLKN L
Sbjct: 376 QRVSEILYVKFWLGLFD-NPYRGD-GKLADKIVHCKEHQAVALEAARQSIVLLKNQDNLL 433
Query: 443 PLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV--SKTHYASGCH--- 497
PL K + S+A+IGP + ++ Y K++ +G++ + +K Y GC
Sbjct: 434 PLQKT-LKSVAVIGPNADEQKELICRYGPSNAPIKTVYKGIKEALPGAKVVYKKGCEIVD 492
Query: 498 -----------DVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQ 546
D+ EA+ AK A+ VI+V G E+R R SL LPG+Q
Sbjct: 493 PHFPESEVLPFDITPKEQQIMDEAIEAAKSAEVVIMVLGGSEVTVREERSRTSLDLPGRQ 552
Query: 547 MSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGD 606
L+ +V + K P ILV+ G +++A+ + +IL +PGE +A+AE IFGD
Sbjct: 553 EELLKAVCKLGK-PTILVMIDGRASSINYAK--KYVPAILHAWFPGEFCGQAVAETIFGD 609
Query: 607 FNPGGRLPMTWYPESFTKVPMN-DMNMRADSSRQYPGRSYRFYTGTQVYG----FGHGLS 661
NPGG+L +T +P+S ++P +DS GT V G FGHGLS
Sbjct: 610 NNPGGKLAVT-FPKSVGQIPFAFPFKPGSDSG-----------CGTSVTGALFPFGHGLS 657
Query: 662 YTNYSYKFLS-APSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISV 720
YT + Y L +P + + +K SCT V
Sbjct: 658 YTTFEYNNLKISPEQQGVLGEVK-----------------------VSCT---------V 685
Query: 721 TNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIAN 780
N G G VV L+ R S T K L GF+R+ K+++F + P + L+I +
Sbjct: 686 KNTGKRPGDEVVQLYLRDEISSVTTYVKILRGFERITLQPNEEKKVTFTLSP-QDLAIWD 744
Query: 781 KHGRRILPLGNHVLMVGELRHSLTIE 806
K+ + + G +M+G + +E
Sbjct: 745 KNMKFQVEPGTFKVMIGASSKDIRLE 770
>gi|423227459|ref|ZP_17213920.1| hypothetical protein HMPREF1062_06106 [Bacteroides cellulosilyticus
CL02T12C19]
gi|392623089|gb|EIY17195.1| hypothetical protein HMPREF1062_06106 [Bacteroides cellulosilyticus
CL02T12C19]
Length = 864
Score = 253 bits (647), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 156/462 (33%), Positives = 241/462 (52%), Gaps = 58/462 (12%)
Query: 53 PFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPG 112
P+ N LS S RA L+ +TL+EK+ Q+ + + AI RLGIPAY+WW+E+LHG+A G
Sbjct: 23 PYKNPELSPSERAWDLLKRMTLEEKVSQMKNGSPAIERLGIPAYDWWNEALHGVARAG-- 80
Query: 113 VNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYN--------LGQAGLTF 164
T FPQ + AA+F+ V+ EARA Y+ G GLTF
Sbjct: 81 --------KATVFPQAIGLAATFDNQAVYETFDIVSDEARAKYHDFQRKGERDGYKGLTF 132
Query: 165 WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVL 224
W PNINI+RDPRWGRG ET GEDP + S + VK QG GG G
Sbjct: 133 WTPNINIYRDPRWGRGMETYGEDPYLTSLMGLAVVKGLQG-------GGTG--------- 176
Query: 225 KGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPFRSCI 283
+ D+ AC KH + +W +R+SF+A I+++D +T+ P F++ +
Sbjct: 177 -----KYDKA-----HACAKHYAVHSGPEW---NRHSFDAKNISQRDLWETYLPAFKTLV 223
Query: 284 EQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKT 342
++GK +MC+YN+ G P C L + R +WG+ + SDC A+ F Y N+ +T
Sbjct: 224 KEGKVKEVMCAYNRFEGEPCCSNKQLLIRILREDWGYDDIVVSDCGAIGD-FYYPNHHET 282
Query: 343 H---EDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFN 399
H ++A + +G D+ CG A+ KG + E+ I+ ++ L + +LG+F+
Sbjct: 283 HPTAAAASADAVVSGTDLECGGSY-SSLNEAVRKGLISEEKINESVFRLLRARFQLGMFD 341
Query: 400 GDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLV 459
D + ++ V + EH AL+ AR+ +VLL N LPL+K ++ +A++GP
Sbjct: 342 DDALVS-WSEIPYSVVESKEHVAKALEMARKSMVLLTNKNHTLPLSK-SIRKVAVLGPNA 399
Query: 460 NNISQMGGGYTGIPCSPKSLLRGLEAYVSK--THYASGCHDV 499
N+ + Y G P ++L G+++ + + +Y GC V
Sbjct: 400 NDSVMLWANYNGFPTKSVTILEGIKSKLPEGTVYYEKGCDYV 441
Score = 117 bits (292), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 98/318 (30%), Positives = 144/318 (45%), Gaps = 59/318 (18%)
Query: 493 ASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLL 542
AS D+ + + E A +AD +I V GL T E E+ DR ++ L
Sbjct: 576 ASLKFDIGIKKEINYKEVADKAAEADAIIFVGGLSPTLEGEEMPVDLPGFRKGDRTNIDL 635
Query: 543 PGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEI 602
P Q ++ ++ +T K PVI VL G L + + EA++ + +IL YPG+ G A+A++
Sbjct: 636 PHVQAEMLKALKKTGK-PVIFVLCSGSTLALPW-EAEN-LDAILEAWYPGQQGGTAVADV 692
Query: 603 IFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSY 662
+FGD+NP GRLP+T+Y S D +M R+YR++ G ++ FGHGLSY
Sbjct: 693 LFGDYNPAGRLPLTFYASSDDLPDFEDYDMS--------NRTYRYFKGKALFPFGHGLSY 744
Query: 663 TNYSYKFLSAPSELTISASLKAGSDK-NILQQTGSRLDYVHIDEVTSCTSLRFHVQISVT 721
T + Y KA DK N+ G L I +
Sbjct: 745 TIFDYG--------------KAKVDKQNVRAGEGMTL------------------TIPLK 772
Query: 722 NAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANK 781
N G +DG V+ ++ R P +G P K L F RV A ++ I + P N
Sbjct: 773 NTGKLDGDEVIQVYLRNPADKEG-PIKTLRAFRRVSLPAGQTENIRIEL-PASTFECFNP 830
Query: 782 HGRR--ILPLGNHVLMVG 797
R ILP G + L+ G
Sbjct: 831 STNRMEILP-GKYELLYG 847
>gi|270294308|ref|ZP_06200510.1| periplasmic beta-glucosidase [Bacteroides sp. D20]
gi|270275775|gb|EFA21635.1| periplasmic beta-glucosidase [Bacteroides sp. D20]
Length = 862
Score = 253 bits (647), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 236/824 (28%), Positives = 354/824 (42%), Gaps = 170/824 (20%)
Query: 44 CKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLS-------------------DN 84
C P + DS S+S+ R L+ +TL+EKI QLS D
Sbjct: 22 CIPVYKDS------SMSVERRVTDLLGRMTLREKIAQLSHLHGYQLYNGQEVDYQKLRDA 75
Query: 85 ASAIP-----------------------------RLGIPAYEWWSESLHGIASNGPGVNF 115
A I RLGIP + +ESLHG +G
Sbjct: 76 AGDISYGCIEGFNLTGENVRKAFHAIQKYMVEETRLGIPVFTV-TESLHGSVHDGS---- 130
Query: 116 NGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDP 175
T FPQ + ++FN L + A+A E R+ Q + +P +++ RD
Sbjct: 131 -------TIFPQSVAVGSTFNLDLAYQMTKAIATELRS-----QGVIQTLSPGLDVVRDL 178
Query: 176 RWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGD 235
RWGR +E+ GEDP +V + VK + DGGI +LK FG
Sbjct: 179 RWGRVEESFGEDPWLVGQMGIAQVKGYI-------DGGI------SPMLKPFGPGGAPLG 225
Query: 236 ELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSY 295
L L++ E+D + P+ + + +M SY
Sbjct: 226 GLNLASVES----------------------GERDIRNIHIKPYEMAVRNTEVKAVMTSY 263
Query: 296 NQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAG 354
N NG+P L RNEWGFKGY+ SD AVA + ++Q+ K ++A L AG
Sbjct: 264 NSWNGIPNSASSYLLTNILRNEWGFKGYVYSDWGAVAMLKDFQHTAKDDSEAAIQALTAG 323
Query: 355 MDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDD 414
+D+ + + I++G+ EK +D A+ + V+ LGLF +P +G +
Sbjct: 324 VDLEASSNCYWALEQLIEQGRFDEKYVDLAVGRILRVKFELGLFE-NPYQG--ADMPGVA 380
Query: 415 VCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPC 474
+ T E +L+ A + IVLLKN+ LPLN N + SLA+IGP N + G YT
Sbjct: 381 MRTKEAVELSRRVADESIVLLKNENTLLPLNLNKIKSLAVIGPNANQVQ--FGDYTWSRS 438
Query: 475 SPKSL--LRGLEAYVS---KTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAG---- 525
+ + L GL+ V K +YA+GC D+ ++ +GF EAV K +D +V G
Sbjct: 439 NKDGVTPLEGLKKRVGNKIKINYAAGC-DLITDNKSGFDEAVAAVKASDMAVVFVGSSSA 497
Query: 526 ---LDLTQET--EDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADS 580
D + T E D SL L G Q LV + K PVI+VL G P +S+ +
Sbjct: 498 SLARDYSDATCGEGFDLSSLDLTGVQEELVEEIYAIGK-PVIVVLVTGKPFSISWIK--E 554
Query: 581 QISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPES-------FTKVPMNDMNMR 633
I +I+ Y GE A+A+++ G+ NP +LP + +P+S + +P + R
Sbjct: 555 HIPAIVVQWYGGEKAGDAIADMLLGNINPSAKLPFS-FPQSVGHLPVFYNHLPTDKGFYR 613
Query: 634 ADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQ 693
PGR Y F + ++ FGHGLSYT + Y L+A SA L SD I
Sbjct: 614 RPGRPNEPGRDYVFSSPAPLWSFGHGLSYTTFEY--LNA----HYSAELLHPSDTLI--- 664
Query: 694 TGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGF 753
V +S+ N G V G VV L+ R S TP KQL F
Sbjct: 665 ----------------------VSVSLKNTGSVAGKEVVQLYVRDVVSSVVTPVKQLKAF 702
Query: 754 DRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
+ + G + P ++L++ + ++++ G + + +G
Sbjct: 703 SKPF-LQPGEMQTVVLKLPIQELALYDLSMKKVVEEGEYEIQIG 745
>gi|299144988|ref|ZP_07038056.1| xylosidase [Bacteroides sp. 3_1_23]
gi|298515479|gb|EFI39360.1| xylosidase [Bacteroides sp. 3_1_23]
Length = 800
Score = 253 bits (647), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 209/735 (28%), Positives = 340/735 (46%), Gaps = 127/735 (17%)
Query: 90 RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
RLGIP ++ +E + G+ + T FP A++N+ L I A
Sbjct: 151 RLGIPV-DFTNEGIRGLCHD-----------RATMFPAQCGQGATWNKKLIREIAKVTAD 198
Query: 150 EARAMYNLGQAGLT-FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWK 208
EA+A+ G T ++P ++I +DPRWGR E+ GEDP +V + + Q E
Sbjct: 199 EAKAL------GYTNIYSPILDIAQDPRWGRVVESYGEDPYLVGELGKQMILGLQNEG-- 250
Query: 209 SDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITE 268
+ A KH Y + G + +
Sbjct: 251 ------------------------------IVATPKHFAVYSIPVGGRDGGTRTDPHVAP 280
Query: 269 QDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDC 327
++ + + PFR I++ A +M SYN +G P + R +WGFKGY+ SD
Sbjct: 281 REMKTLYLEPFRKGIQEAGALGVMSSYNDYDGEPVSGSYHFLTEILRQQWGFKGYVVSDS 340
Query: 328 DAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTC----MLRHTQSAIDKGKVQEKDIDR 383
+AV + T T E+ AA V+ AG++I + + AI++GKV +D+
Sbjct: 341 EAVEFLHTKHRITPTEEEMAAQVVNAGLNIRTNFTPPQDFILPLRRAINEGKVSLHTLDQ 400
Query: 384 ALLNLFSVQLRLGLFN----GDPRKGKYGKLGPDDVCTSE-HKKLALDAARQGIVLLKND 438
+ + V+ +GLF+ GD R+ P+ V ++ HK +++ AA + IVLLKN+
Sbjct: 401 RVGEILRVKFMMGLFDNPYPGDDRR-------PEAVVHNDAHKAVSMKAALESIVLLKNE 453
Query: 439 KKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV--SKTHYASGC 496
+ LPL+KN S +A+IGP + ++ Y S K++ +G++ Y+ S+ Y GC
Sbjct: 454 NQMLPLSKN-FSKIAVIGPNGEEVKELTCRYGPANASIKTVYQGIKEYLPNSEVRYVKGC 512
Query: 497 ------------HDVPCNS--DAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLL 542
++VP ++ A HEAV +AK +D I+V G + E+ R +L L
Sbjct: 513 DIIDKYFPESELYNVPLDTQEQAMIHEAVELAKASDVAILVLGGNEKTVREEFSRTNLDL 572
Query: 543 PGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEI 602
G+Q L+ +V T K PV+LV+ G +++ A+ + +I+ +PGE A+A++
Sbjct: 573 CGRQQQLLEAVYATGK-PVVLVMVDGRAATINW--ANKYVPAIIHAWFPGEFMGDAIAKV 629
Query: 603 IFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSY 662
+FGD+NPGGRL +T +P+S ++P S+ + +Y FG+GLSY
Sbjct: 630 LFGDYNPGGRLAVT-FPKSVGQIPFAFPFKPGSDSKG------KVRVDGVLYPFGYGLSY 682
Query: 663 TNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTN 722
T + Y S+L IS + G +NI SCT V N
Sbjct: 683 TTFGY------SDLKISKPV-IGPQENI---------------TLSCT---------VKN 711
Query: 723 AGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKH 782
G G VV L+ R S T +K L GF+R+H + ++F + P + L + +K+
Sbjct: 712 TGKKAGDEVVQLYIRDDFSSVTTYDKVLRGFERIHLQPGEEQTVNFTLTP-QDLGLWDKN 770
Query: 783 GRRILPLGNHVLMVG 797
R + G+ +MVG
Sbjct: 771 NRFTVEPGSFSVMVG 785
>gi|270296098|ref|ZP_06202298.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp. D20]
gi|270273502|gb|EFA19364.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp. D20]
Length = 798
Score = 253 bits (647), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 213/736 (28%), Positives = 335/736 (45%), Gaps = 129/736 (17%)
Query: 90 RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
RLGIP ++ +E + G+ + T FP A++N+ L + IG A
Sbjct: 148 RLGIPV-DFTNEGIRGLCHD-----------RATYFPAQCGQGATWNKELIAQIGEA--- 192
Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
EAR LG + ++P ++I +DPRWGR ET GEDP + + S Q
Sbjct: 193 EAREASVLGYTNI--YSPILDIAQDPRWGRCVETYGEDPYHAGQMGKQMILSLQKNK--- 247
Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQ 269
L + KH Y + G + + + +
Sbjct: 248 -----------------------------LVSTPKHFAVYSIPVGGRDGKTRTDPHVAPR 278
Query: 270 DTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGD---LFQKARNEWGFKGYITSD 326
+ + PFR + A +M SYN +G P + G L + R EWGFKGY+ SD
Sbjct: 279 EMRTLYLDPFRVAFHEAGALGVMSSYNDYDGEP--ITGSYHFLTEILRQEWGFKGYVVSD 336
Query: 327 CDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTC----MLRHTQSAIDKGKVQEKDID 382
+AV I +ED+ A + AG++I + +SA+ KGK+ ++ ++
Sbjct: 337 SEAVEFISTKHQVANGYEDAVAQAVNAGLNIRTHFTPPADFILPLRSAVKKGKISQETLN 396
Query: 383 RALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFL 442
+ + + V+ LGLF+ +P +G + G V + EH++LAL+AARQ +VLLKN+ + L
Sbjct: 397 QRVAEILRVKFWLGLFD-NPYRGDEKRAG-QIVHSPEHQQLALEAARQSLVLLKNEHQTL 454
Query: 443 PLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTH--YASGCH--- 497
PL+K ++ S+A+IGP + Q+ Y ++ G++ + + Y GC
Sbjct: 455 PLSK-SIRSVAVIGPNADERQQLICRYGPANAHITTIYEGIKKMLPQADVVYKKGCDIID 513
Query: 498 ---------DVPCNSDAG--FHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQ 546
+ P + EA+ AK A+ ++V G + EDR R SL LPG+Q
Sbjct: 514 PHFPESEVLEFPKAAQEAQMMEEAIEAAKGAEVTVMVLGGNELTVREDRSRTSLDLPGRQ 573
Query: 547 MSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGD 606
L+ + + K PV+LV+ G ++FA + + +I+ +PGE G +A+AE +FGD
Sbjct: 574 KELLKKICQLGK-PVVLVMIDGRASSINFAA--THVPAIIHAWFPGEFGGQAIAEALFGD 630
Query: 607 FNPGGRLPMTWYPESFTKVPMN-DMNMRADSSRQYPGRSYRFYTGTQVYG----FGHGLS 661
+NPGGRL +T +P+S ++P +D S + T VYG FGHGLS
Sbjct: 631 YNPGGRLAVT-FPKSVGQIPFAFPFKPGSDESSE-----------TSVYGALYPFGHGLS 678
Query: 662 YTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVT 721
YT + Y S+L IS S K G NI SCT +
Sbjct: 679 YTTFQY------SDLAISPS-KQGVQGNI---------------SISCT---------IK 707
Query: 722 NAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANK 781
N G +G VV L+ R S T + L GF+R+ + S + F + P ++L I +K
Sbjct: 708 NIGQREGDEVVQLYLRDEVSSVTTYTQVLRGFERITLKPEASHTVHFELTP-QELGIWDK 766
Query: 782 HGRRILPLGNHVLMVG 797
+ G +M+G
Sbjct: 767 QMNFTVEPGMFKVMIG 782
>gi|298482587|ref|ZP_07000772.1| xylosidase [Bacteroides sp. D22]
gi|336405443|ref|ZP_08586122.1| hypothetical protein HMPREF0127_03435 [Bacteroides sp. 1_1_30]
gi|295085727|emb|CBK67250.1| Beta-glucosidase-related glycosidases [Bacteroides xylanisolvens
XB1A]
gi|298271294|gb|EFI12870.1| xylosidase [Bacteroides sp. D22]
gi|335938024|gb|EGM99918.1| hypothetical protein HMPREF0127_03435 [Bacteroides sp. 1_1_30]
Length = 800
Score = 253 bits (647), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 208/735 (28%), Positives = 338/735 (45%), Gaps = 127/735 (17%)
Query: 90 RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
RLGIP ++ +E + G+ + T FP A++N+ L I A
Sbjct: 151 RLGIPV-DFTNEGIRGLCHD-----------RATMFPAQCGQGATWNKKLIREIAKVTAN 198
Query: 150 EARAMYNLGQAGLT-FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWK 208
EA+A+ G T ++P ++I +DPRWGR E+ GEDP + + + Q E
Sbjct: 199 EAKAL------GYTNIYSPILDIAQDPRWGRVVESYGEDPYLAGELGKQMILGLQSEG-- 250
Query: 209 SDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITE 268
+ A KH Y + G + +
Sbjct: 251 ------------------------------IVATPKHFAVYSIPVGGRDGGTRTDPHVAP 280
Query: 269 QDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDC 327
++ + + PFR I++ A +M SYN +G P + R +WGFKGY+ SD
Sbjct: 281 REMKTLYLEPFRKGIQEAGALGVMSSYNDYDGEPVSGSYHFLTEILRQQWGFKGYVVSDS 340
Query: 328 DAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTC----MLRHTQSAIDKGKVQEKDIDR 383
+AV + T T E+ AA V+ AG++I + + AID+GKV +D+
Sbjct: 341 EAVEFLHTKHRITPTEEEMAAQVVNAGLNIRTNFTPPQDFILPLRRAIDEGKVSLHTLDQ 400
Query: 384 ALLNLFSVQLRLGLFN----GDPRKGKYGKLGPDDVCTSE-HKKLALDAARQGIVLLKND 438
+ + V+ +GLF+ GD R+ P+ V ++ HK +++ AA + +VLLKN
Sbjct: 401 RVSEILRVKFMMGLFDNPYPGDDRR-------PETVVHNDAHKAVSMKAALESVVLLKNK 453
Query: 439 KKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV--SKTHYASGC 496
+ LPL+KN +A+IGP + ++ Y S K++ +G++ Y+ S+ YA GC
Sbjct: 454 NQMLPLSKN-FKKIAVIGPNAEEVKELTCRYGPANASIKTVYQGIKEYLPNSEVRYAKGC 512
Query: 497 ------------HDVPCNS--DAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLL 542
++VP ++ A HEAV +AK +D I+V G + E+ R +L L
Sbjct: 513 DIIDKYFPESELYNVPLDTQEQAMIHEAVELAKASDIAILVLGGNEKTVREEFSRTNLDL 572
Query: 543 PGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEI 602
G+Q L+ +V T K PV+LV+ G +++ A+ + +I+ +PGE A+A++
Sbjct: 573 CGRQQQLLEAVYATGK-PVVLVMVDGRAATINW--ANKYVPAIIHAWFPGEFMGDAIAKV 629
Query: 603 IFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSY 662
+FGD+NPGGRL +T +P+S ++P S+ + +Y FG+GLSY
Sbjct: 630 LFGDYNPGGRLAVT-FPKSVGQIPFAFPFKPGSDSKG------KVRVDGVLYPFGYGLSY 682
Query: 663 TNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTN 722
T + Y S+L IS + G +NI SCT V N
Sbjct: 683 TTFGY------SDLKISKPV-IGPQENI---------------TLSCT---------VKN 711
Query: 723 AGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKH 782
G G VV L+ R S T +K L GF+R+H + +SF + P + L + +K+
Sbjct: 712 TGKKAGDEVVQLYIRDDFSSVTTYDKVLRGFERIHLQPGEEQTVSFTLTP-QDLGLWDKN 770
Query: 783 GRRILPLGNHVLMVG 797
+ + G+ +MVG
Sbjct: 771 NQFTVEPGSFSVMVG 785
>gi|313204104|ref|YP_004042761.1| beta-glucosidase [Paludibacter propionicigenes WB4]
gi|312443420|gb|ADQ79776.1| Beta-glucosidase [Paludibacter propionicigenes WB4]
Length = 871
Score = 253 bits (646), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 158/459 (34%), Positives = 233/459 (50%), Gaps = 57/459 (12%)
Query: 53 PFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPG 112
P+ N +LS RAK LIS LTL EK + D + A+PRLGI + WWSE+LHG A+NG
Sbjct: 31 PYKNPNLSSEERAKDLISRLTLSEKATLMCDISDAVPRLGIKKFNWWSEALHGFANNG-- 88
Query: 113 VNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA---------GLT 163
+VT FP+ + AASF+ +L I AV+ EARA YN L+
Sbjct: 89 --------NVTVFPEPVGMAASFDDNLVYKIFDAVSDEARAKYNEATKKGLENARFLSLS 140
Query: 164 FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRV 223
W PN+NIFRDPRWGRGQET GEDP + S V VK QG +R+
Sbjct: 141 VWTPNVNIFRDPRWGRGQETYGEDPYLTSRMGVSVVKGLQGP--------ADAKYRK--- 189
Query: 224 LKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPFRSC 282
L AC KH + +W SR+S N + +D +T+ P F+S
Sbjct: 190 ---------------LLACAKHFAVHSGPEW---SRHSLNINNLDPRDLWETYLPAFKSL 231
Query: 283 IEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTK 341
+++ +MC+Y +++ P C L Q+ R+EWG+K + +DC AV+ + +
Sbjct: 232 VQKSDVRQVMCAYQRLDDEPCCGNTQLLQRILRDEWGYKYMVVADCGAVSDFYTSHKVSS 291
Query: 342 THEDSAAGVLKAGMDINC--GTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFN 399
+A+ + AG D+ C + + A+ KG + E +I++ LLN+ + LG +
Sbjct: 292 DAVHAASKGVWAGTDVECQWDNHIYKQLPDAVAKGLITEAEINKHLLNVLIGRFDLGEMD 351
Query: 400 GDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLV 459
D + K+ V EH+KLALD A + + LL+N LPL+K+ +A++GP
Sbjct: 352 DDALV-PWSKIPMSVVNNEEHRKLALDMALKSMTLLQNKNNILPLSKS--KKIAVVGPNA 408
Query: 460 NNISQMGGGYTGIPCSPKSLLRGLEAYVS--KTHYASGC 496
N+ + G Y G P ++L G+ + +S K Y GC
Sbjct: 409 NDKPMLWGNYNGTPVRTITILDGITSKLSANKVLYEKGC 447
Score = 79.3 bits (194), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 61/227 (26%), Positives = 100/227 (44%), Gaps = 48/227 (21%)
Query: 536 DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAG 595
DR + LP Q + + ++ K+ + + +G ++ +IL Y GE+G
Sbjct: 638 DRTDIELPAVQRNCLKALKEAGKQVIFVNCSGSA---IALIPETESCDAILQAWYGGESG 694
Query: 596 AKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYG 655
+A+A+++FGD+NP G+LP+T+Y D +M+ GR+YR Y ++
Sbjct: 695 GQAVADVLFGDYNPSGKLPITFYKSVKQLSDFEDYSMK--------GRTYR-YMSDPLFP 745
Query: 656 FGHGLSYTNYSY--KFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLR 713
FG GLSYT ++ LS P E+ S S+
Sbjct: 746 FGFGLSYTTFAIGNAKLSNP-------------------------------EIKSNQSV- 773
Query: 714 FHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVA 760
+ I V+N G G+ +V ++ R+ + G P K L GF RV A
Sbjct: 774 -ELTIPVSNKGKRSGTEIVQVYVRMVNDTDG-PLKTLKGFQRVEVAA 818
>gi|296081550|emb|CBI20073.3| unnamed protein product [Vitis vinifera]
Length = 309
Score = 253 bits (646), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 124/270 (45%), Positives = 168/270 (62%), Gaps = 38/270 (14%)
Query: 146 AVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGE 205
V+ EARAMYN+G AGLTFW+PN+NIF+DPRWGRGQETPGEDP++ S YA +V+ Q
Sbjct: 78 VVSTEARAMYNVGLAGLTFWSPNVNIFQDPRWGRGQETPGEDPLLSSKYASGYVRGLQ-- 135
Query: 206 NWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM 265
+SDDG D L ++ACCKH AYDL+ W + FNA+
Sbjct: 136 --QSDDG--------------------SPDRLKVAACCKHYTAYDLDNWKGVDCFHFNAV 173
Query: 266 ITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKARNEWGFKGYITS 325
+T QD +DTFQPPF+SC+ G + +MCSYNQ L K + Y+ S
Sbjct: 174 VTNQDMDDTFQPPFKSCVIDGNVASVMCSYNQFY---------LIYKYHLQ-----YLVS 219
Query: 326 DCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRAL 385
DCD+V + Q+YTKT E++ A + AG+D+NCG+ + +HT++A+ G V E +D+A+
Sbjct: 220 DCDSVDVFYNSQHYTKTPEEAVAKAILAGLDLNCGSFLGQHTEAAVKGGLVDESAVDKAV 279
Query: 386 LNLFSVQLRLGLFNGDPRKGKYGKLGPDDV 415
N F+ +RLG F+G+P K YGKLGP DV
Sbjct: 280 SNNFATLMRLGFFDGNPSKAIYGKLGPKDV 309
>gi|110740481|dbj|BAF02134.1| xylosidase [Arabidopsis thaliana]
Length = 284
Score = 253 bits (646), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 127/287 (44%), Positives = 184/287 (64%), Gaps = 12/287 (4%)
Query: 525 GLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISS 584
GLD + E E RDR LLLPG Q LVT VA+ S+ PVILVL GGP+DV+FA+ D ++++
Sbjct: 2 GLDQSIEAETRDRTGLLLPGYQQDLVTRVAQASRGPVILVLMSGGPIDVTFAKNDPRVAA 61
Query: 585 ILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNMRADSSRQYPGR 643
I+W GYPG+AG A+A IIFG NPGG+LPMTWYP+ + KVPM M MRA S YPGR
Sbjct: 62 IIWAGYPGQAGGAAIANIIFGAANPGGKLPMTWYPQDYVAKVPMTVMAMRA--SGNYPGR 119
Query: 644 SYRFYTGTQVYGFGHGLSYTNYSYKFLSAP-SELTISASLKAGSDKNILQQTGSRLDYVH 702
+YRFY G V+ FG GLSYT +++ +P ++L++S S S IL + + H
Sbjct: 120 TYRFYKGPVVFPFGFGLSYTTFTHSLAKSPLAQLSVSLS-NLNSANTILNSSSHSIKVSH 178
Query: 703 IDEVTSCTSL-RFHVQISVTNAGDVDGSHVVMLFARVP--KVSQGTPEKQLIGFDRVHTV 759
T+C S + + + V+N G+ DG+H V +FA P + KQLI F++VH +
Sbjct: 179 ----TNCNSFPKMPLHVEVSNTGEFDGTHTVFVFAEPPINGIKGLGVNKQLIAFEKVHVM 234
Query: 760 AKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
A + + VD C+ L + +++G+R +P+G H L +G+L+H++ ++
Sbjct: 235 AGAKQTVQVDVDACKHLGVVDEYGKRRIPMGEHKLHIGDLKHTILVQ 281
>gi|410634080|ref|ZP_11344720.1| beta-glucosidase [Glaciecola arctica BSs20135]
gi|410146740|dbj|GAC21587.1| beta-glucosidase [Glaciecola arctica BSs20135]
Length = 772
Score = 253 bits (645), Expect = 4e-64, Method: Compositional matrix adjust.
Identities = 181/622 (29%), Positives = 310/622 (49%), Gaps = 85/622 (13%)
Query: 165 WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVL 224
+AP +++ RDPRWGR E GED + +A A V+ FQG+
Sbjct: 176 FAPMVDVARDPRWGRISEGSGEDVYLTTAIARARVQGFQGD------------------- 216
Query: 225 KGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIE 284
D + A KH AY G R +++++ DT+ PPF++ ++
Sbjct: 217 -------DLSQPHTILATAKHFAAY---GQGQAGRDYHTTDMSDRELRDTYLPPFKAAVD 266
Query: 285 QGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTH 343
G + M S+N++NGVPA L R+EW F+G++ +D ++ + ++ + + +
Sbjct: 267 AG-VTSFMTSFNELNGVPASANKYLLTDILRDEWSFEGFVVTDYTSINEMVKH-GFARDN 324
Query: 344 EDSAAGVLKAGMDINC-GTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDP 402
+ + +KAG+D++ G+ + + + +GKV + ID A + ++ RLGLF DP
Sbjct: 325 DHAGELAVKAGVDMDMQGSVYFDYLANQVTQGKVSPQQIDNAARRILEMKYRLGLFE-DP 383
Query: 403 RKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNI 462
+ + ++ + + A D AR+ +VLLKN+ + LPL+K+ ++ +A+IGPL ++
Sbjct: 384 YRYSNEEREAQEIYKEYNLQAAQDVARKSMVLLKNENQQLPLSKSDLT-IAVIGPLADSK 442
Query: 463 SQMGGGYTGIPC---SPKSLLRGLEAYV---SKTHYASGC-HDVPCNSDAGFHEAVRIAK 515
+ G ++ P +LL G++A V SK YA G ++ ++GF A+ IAK
Sbjct: 443 EDLIGSWSAAGDRYEKPITLLTGIKAKVADPSKVLYAKGASYEFSHQDNSGFEAAIAIAK 502
Query: 516 KADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSF 575
KAD +++ G E R SL PG Q++L+ + + +K P++LVL G P+ + +
Sbjct: 503 KADVIVLAMGEKWDMTGEATSRTSLDFPGNQLALMQQLKKLAK-PMVLVLMNGRPMTIEW 561
Query: 576 AEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMR-- 633
A D + +IL YPG G A+A+++FGD+NP G+LP+T +P + ++P+ NM+
Sbjct: 562 A--DQNVDAILEAWYPGTMGGPAIADVLFGDYNPSGKLPVT-FPRNVGQIPLY-YNMKNT 617
Query: 634 -----ADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDK 688
D++ Q Y T +Y FGHGLSYT + Y S+++++ ++ +K
Sbjct: 618 GRPYSKDNAEQKYVSRYIDSLNTPLYHFGHGLSYTTFDY------SKISLNKAVITAKEK 671
Query: 689 NILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEK 748
I VTN+G+ DG VV L+ R S P K
Sbjct: 672 -------------------------LTASIDVTNSGNYDGEEVVQLYIRDRIGSVTRPVK 706
Query: 749 QLIGFDRVHTVAKGSKEISFGV 770
QL GF ++ +K +SF +
Sbjct: 707 QLKGFKKIFLHKGETKTVSFSI 728
>gi|160884749|ref|ZP_02065752.1| hypothetical protein BACOVA_02738 [Bacteroides ovatus ATCC 8483]
gi|156109784|gb|EDO11529.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
ovatus ATCC 8483]
Length = 800
Score = 253 bits (645), Expect = 4e-64, Method: Compositional matrix adjust.
Identities = 211/735 (28%), Positives = 340/735 (46%), Gaps = 127/735 (17%)
Query: 90 RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
RLGIP ++ +E + G+ + T FP A++N+ L I A
Sbjct: 151 RLGIPV-DFTNEGIRGLCHD-----------RATMFPAQCGQGATWNKKLIGEIAKVTAD 198
Query: 150 EARAMYNLGQAGLT-FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWK 208
EA+A+ G T ++P ++I +DPRWGR E+ GEDP +V + + Q E
Sbjct: 199 EAKAL------GYTNIYSPILDIAQDPRWGRVVESYGEDPYLVGELGKQMILGLQNEG-- 250
Query: 209 SDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITE 268
+ A KH Y + G + +
Sbjct: 251 ------------------------------IVATPKHFAVYSIPVGGRDGGTRTDPHVAP 280
Query: 269 QDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDC 327
++ + + PFR I++ A +M SYN +G P + R +WGFKGYI SD
Sbjct: 281 REMKTLYLEPFRKGIQEAGALGVMSSYNDYDGEPVSGSYHFLTEILRQQWGFKGYIVSDS 340
Query: 328 DAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTC----MLRHTQSAIDKGKVQEKDIDR 383
+AV + T T E+ AA V+ AG++I + + AI++GKV +D+
Sbjct: 341 EAVEFLHTKHRITPTEEEMAAQVVNAGLNIRTNFTPPQDFILPLRRAINEGKVSLHTLDQ 400
Query: 384 ALLNLFSVQLRLGLFN----GDPRKGKYGKLGPDDVCTSE-HKKLALDAARQGIVLLKND 438
+ + V+ +GLF+ GD R+ P+ V ++ HK +++ AA + IVLLKN+
Sbjct: 401 RVGEILRVKFMMGLFDNPYPGDDRR-------PETVVHNDAHKAVSMKAALESIVLLKNE 453
Query: 439 KKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV--SKTHYASGC 496
+ LPL+KN S +A+IGP + ++ Y S K++ +G++ Y+ S+ YA GC
Sbjct: 454 NQMLPLSKN-FSKIAVIGPNGEEVKELTCRYGPANASIKTVYQGIKEYLPNSEVRYAKGC 512
Query: 497 ------------HDVPCNS--DAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLL 542
++VP ++ A EAV +AK +D I+V G + E+ R +L L
Sbjct: 513 DIIDKYFPESELYNVPLDTQEQAMIQEAVELAKASDIAILVLGGNEKTVREEFSRTNLDL 572
Query: 543 PGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEI 602
G+Q L+ +V T K PVILV+ G +++ A+ I +I+ +PGE A+A++
Sbjct: 573 CGRQQQLLEAVYATGK-PVILVMVDGRAATINW--ANKYIPAIIHAWFPGEFMGDAIAKV 629
Query: 603 IFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSY 662
+FGD+NPGGRL +T +P+S ++P S+ + +Y FG+GLSY
Sbjct: 630 LFGDYNPGGRLAVT-FPKSVGQIPFAFPFKPGSDSKG------KVRVDGVLYPFGYGLSY 682
Query: 663 TNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTN 722
T + Y S+L IS + G +NI SCT V N
Sbjct: 683 TTFGY------SDLKISKPV-IGPQENI---------------TLSCT---------VKN 711
Query: 723 AGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKH 782
G G VV L+ R S T +K L GF+R+H + ++F + P + L + +K+
Sbjct: 712 TGKKAGDEVVQLYIRDDFSSVTTYDKVLRGFERIHLQPGEEQTVNFTLTP-QDLGLWDKN 770
Query: 783 GRRILPLGNHVLMVG 797
+ + G+ +MVG
Sbjct: 771 NQFTVEPGSFSVMVG 785
>gi|255532174|ref|YP_003092546.1| glycoside hydrolase family protein [Pedobacter heparinus DSM 2366]
gi|255345158|gb|ACU04484.1| glycoside hydrolase family 3 domain protein [Pedobacter heparinus
DSM 2366]
Length = 799
Score = 253 bits (645), Expect = 4e-64, Method: Compositional matrix adjust.
Identities = 195/735 (26%), Positives = 337/735 (45%), Gaps = 124/735 (16%)
Query: 90 RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
RLGIP ++ +E + G+ + T FP L ++NR+L +G
Sbjct: 147 RLGIPV-DFTNEGIRGVEA-----------YEATGFPTQLNMGMTWNRNLIRKMGRITGQ 194
Query: 150 EARAMYNLGQAGLT-FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWK 208
EARA+ G T +AP +++ RD RWGR +E GEDP +V+ VE Q N
Sbjct: 195 EARAL------GYTNVYAPILDVARDQRWGRLEEVYGEDPYLVARLGVEMTLGMQENN-- 246
Query: 209 SDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITE 268
+++ KH Y K + ++
Sbjct: 247 -----------------------------QIASTAKHFAVYSANKGAREGLARTDPQVSP 277
Query: 269 QDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGD---LFQKARNEWGFKGYITS 325
++ ED PF+ I++ +M SYN NG+P + G L Q+ R ++GF GY+ S
Sbjct: 278 REVEDIMLYPFKKVIQEAGIMGVMSSYNDYNGIP--ITGSEYWLTQRLRKDFGFGGYVVS 335
Query: 326 DCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCG----TCMLRHTQSAIDKGKVQEKDI 381
D DA+ ++ + +++ AG+++ ++ + + +++G++ + I
Sbjct: 336 DSDALEYLYNKHHVAANLKEAVFQAFMAGLNVRTTFRPPDSIIIYARQLVNEGRIPIETI 395
Query: 382 DRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKF 441
+ + ++ V+ +LGLF D K V + H+ +AL A+++ IVLLKN+ +
Sbjct: 396 NSRVKDVLRVKFKLGLF--DQPYVKDAAASEKLVNSIAHQAVALQASKESIVLLKNNNQI 453
Query: 442 LPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV--SKTHYASGCHDV 499
LPL++ ++ +A+IGP + Y + ++L G+ + K YA GC V
Sbjct: 454 LPLSR-SLKKIAVIGPNAADNDYAHTHYGPLQSKSTNILEGIRNKIGADKVWYAKGCELV 512
Query: 500 ----------PCNSDAG----FHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQ 545
P + DA +AV A KAD IVV G + E++ R +L LPG
Sbjct: 513 DKNWPESEIFPEDPDATAIALIEDAVNTAMKADVAIVVLGGNTKTAGENKSRTTLELPGF 572
Query: 546 QMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFG 605
Q++L+ ++ +T K PV+ V+ G P+ +++ D I I++ GYPG G A+A+++FG
Sbjct: 573 QLNLIKAIQKTGK-PVVAVMIGTQPMGINWI--DKYIDGIVYAGYPGVKGGIAVADVLFG 629
Query: 606 DFNPGGRLPMTWYPESFTKVPMN---DMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSY 662
D+NPGG+L +T +P+S ++P+N N + D + +Y FG GLSY
Sbjct: 630 DYNPGGKLTLT-FPKSVGQLPLNFPSKPNAQTDEGELAKIKGL-------LYPFGFGLSY 681
Query: 663 TNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTN 722
T ++Y S L IS ++ D NI + + +TN
Sbjct: 682 TTFAY------SNLKISP-IEQEKDGNI------------------------SISVDITN 710
Query: 723 AGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKH 782
++G +V L+ R + T EK L GF+R+ +K + F + P + L + N+
Sbjct: 711 TAKLEGDEIVQLYIRDVLSTVTTYEKILRGFERISLKPNETKTLKFTLFP-DDLKLWNRE 769
Query: 783 GRRILPLGNHVLMVG 797
+ ++ G +M+G
Sbjct: 770 MQHVIEPGTFKVMIG 784
>gi|423212159|ref|ZP_17198688.1| hypothetical protein HMPREF1074_00220 [Bacteroides xylanisolvens
CL03T12C04]
gi|392695047|gb|EIY88272.1| hypothetical protein HMPREF1074_00220 [Bacteroides xylanisolvens
CL03T12C04]
Length = 768
Score = 252 bits (644), Expect = 5e-64, Method: Compositional matrix adjust.
Identities = 240/840 (28%), Positives = 376/840 (44%), Gaps = 150/840 (17%)
Query: 12 ERILFRVLQLIVVVNVIAFSNSKPVLNK--PDFPCKP-----------PHFD----SYPF 54
+R L +L +I V ++A S+ + +NK + K P+ D + P
Sbjct: 11 KRFLVLILAMIPVCGMMAQSSQESKMNKFVDNLMSKMTIEEKIGQLNMPNIDGAVITGPV 70
Query: 55 CNTSLSISTRAKSLISLLTLQE--KIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPG 112
NT+ R + +LL LQ+ KI+++ A R IP + + +HG
Sbjct: 71 KNTNTGERLRKNEVGALLNLQDVKKIREVQQLALEHSRFKIPLL-FGMDVIHGYK----- 124
Query: 113 VNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTF-WAPNINI 171
T FP L AA++N + AVEA A G+ + ++P +++
Sbjct: 125 ----------TIFPIPLAQAATWNMDMVEEAARISAVEASA------NGICWTFSPMVDV 168
Query: 172 FRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEES 231
RD RWGR E GEDP + A +V+ +QG
Sbjct: 169 TRDARWGRIAEGAGEDPWLGGEIAKAYVRGYQG--------------------------- 201
Query: 232 DRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPFRSCIEQGKASC 290
D + AC KH Y G +N + ++ Q + + P+ + +E G S
Sbjct: 202 DLSANTNILACVKHFALYGAPDAGK----DYNTVDMSRQRMFNEYMLPYLAAVEAGVGSA 257
Query: 291 IMCSYNQVNGVPACL-RGDLFQKARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAG 349
M ++N+V G+PA R L R +WGF G++ SD DAV + E+ ++ +A
Sbjct: 258 -MSAFNEVEGIPAAANRWLLTDLLRGQWGFDGFVVSDWDAVRELTEHG--IGNMQEVSAR 314
Query: 350 VLKAGMDINCGTCMLRHT-QSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYG 408
L AG+D++ + L T + + D+GKV E+ I+ A + + +LGLF DP K
Sbjct: 315 ALIAGLDMDMASEGLVSTLKKSFDEGKVTEEQINTACRRILVAKYKLGLFK-DPFKYCNE 373
Query: 409 KLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGG 468
+ D+ T EH++ A + A + VLLKN ++ LPL K+ +A+IGPL +N M G
Sbjct: 374 ERAKKDLFTPEHRRAAREMAAESFVLLKNKERLLPLQKSG--KIAVIGPLADNRRNMLGT 431
Query: 469 YT--GIPCSPKSLLRGLEAYV---SKTHYASG---CHDVPCNSDA-----GF-------- 507
+T +P S+L G+ V ++ +YA G +D A GF
Sbjct: 432 WTVSADLDTPVSVLEGIREAVGDKAEINYARGSNLTYDEELEKRASHWGKGFPRDGRTDS 491
Query: 508 ---HEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILV 564
EA+ IA ++D +I V G E RV L +P Q L+ + T K PV+LV
Sbjct: 492 QLQREALDIANQSDIIIAVMGEAAEMSGESTSRVDLNIPDAQKDLLKKLVDTGK-PVVLV 550
Query: 565 LTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTK 624
L G PL + + E + + +IL + +PG A+A+++FGD NP G+L T +P S +
Sbjct: 551 LFAGRPLTLVWEEEN--VPAILNVWFPGTEAGNAVADVLFGDVNPSGKLTAT-FPRSVGQ 607
Query: 625 VPMN---DMNMRADSSRQYPGRSYRF----YTGTQVYGFGHGLSYTNYSYKFLSAPSELT 677
VP++ RA S++ P YR T +Y FG+GLSYT + Y LS
Sbjct: 608 VPISYSYKHTGRA-PSKEKPSEKYRTGYIDETYEPLYPFGYGLSYTQFEYGELSL----- 661
Query: 678 ISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFAR 737
DK ++ T E + + I+VTN G VDG +V L+ R
Sbjct: 662 ---------DKEVINNT----------EFLTAS-------ITVTNKGTVDGKEIVQLYLR 695
Query: 738 VPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
S P K+L G+++++ A SK + F + P E L N + + G +MVG
Sbjct: 696 DVVGSVTRPLKELKGYEKIYLKAGESKTVRFRITP-EMLKFYNYNIEYVYEPGEFDVMVG 754
>gi|224535195|ref|ZP_03675734.1| hypothetical protein BACCELL_00056 [Bacteroides cellulosilyticus
DSM 14838]
gi|224523186|gb|EEF92291.1| hypothetical protein BACCELL_00056 [Bacteroides cellulosilyticus
DSM 14838]
Length = 733
Score = 252 bits (644), Expect = 5e-64, Method: Compositional matrix adjust.
Identities = 210/742 (28%), Positives = 351/742 (47%), Gaps = 108/742 (14%)
Query: 67 SLISLLT---LQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVT 123
SLI L T L+ +IQ+ + S RLGIP + + +HG+ T
Sbjct: 79 SLIYLHTDPKLRNRIQRKAMEES---RLGIPIL-FGFDVIHGLR---------------T 119
Query: 124 SFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQET 183
+P L A SFN L + A E+ L TF +P I++ RDPRWGR E
Sbjct: 120 VYPISLAQACSFNPDLVTQACGMAAKESV----LSGIDWTF-SPMIDVARDPRWGRISEC 174
Query: 184 PGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACC 243
GEDP + + + V VK +QGE D ++AC
Sbjct: 175 YGEDPYLNTVFGVASVKGYQGEKL--------------------------SDPYSIAACL 208
Query: 244 KHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPA 303
KH + Y + + G RY+ I+ Q +T+ PP+ +C++ G A+ +M S+N ++GVPA
Sbjct: 209 KHYVGYGVSEGGRDYRYT---DISPQALWETYLPPYEACVKAG-AATLMSSFNDISGVPA 264
Query: 304 CLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINC-GT 361
+ + +N+W G++ SD +A+ + YQ K +++A AG++++
Sbjct: 265 TSNHYILTEILKNKWRHDGFVVSDWNAIEQLI-YQGVAKNRKEAAYKAFHAGVEMDMRDN 323
Query: 362 CMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHK 421
+ + + + K++ ID A+ + V+ RLGLF+ +P + + + E
Sbjct: 324 VYYEYLEQLVAEKKIEISQIDDAVARILRVKFRLGLFD-EPYTKELTE--QERYLQKEDI 380
Query: 422 KLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGG--YTGIPCSPKSL 479
LA A + +VLLKN+K LPL+ + V +A+IGP+V + S + G + G +++
Sbjct: 381 ALAARLAEESMVLLKNEKNLLPLS-STVKRVALIGPMVKDRSDLLGAWAFKGQAEDVETI 439
Query: 480 LRGLEAYVS---KTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRD 536
G++ + Y GC + N ++GF A++ A+ +D V+V G E+
Sbjct: 440 YEGMQKEFGDKVRLDYEQGCA-LDGNDESGFSAALKTAEASDVVVVCLGESKQWSGENAS 498
Query: 537 RVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGA 596
R ++ LP Q L+ + + +K P++LVL+ G PL++ + Q+ +I+ + PG AG
Sbjct: 499 RSTIALPDIQEKLLLHLKQANK-PIVLVLSSGRPLEL--IRLEPQVEAIIEMWQPGVAGG 555
Query: 597 KALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGR-SYRFYTGTQVYG 655
LA I+ G NP G+L +T +P S ++P+ NMR S+R + Y+ +Y
Sbjct: 556 TPLAGILSGRVNPSGKLSVT-FPLSTGQIPVY-YNMR-QSARPFDAMGDYQDIPTEPLYP 612
Query: 656 FGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFH 715
FG+GLSYT ++Y S+ +S SLK ++ I
Sbjct: 613 FGYGLSYTTFTY------SDAKLS-SLKIKKNQKI------------------------T 641
Query: 716 VQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQ 775
+++VTNAG V+G V+ + P S P K+L F++ S+ F +DP
Sbjct: 642 AEVTVTNAGKVEGKETVLWYVSDPFCSISRPMKELKFFEKQSLKVGESRVFRFEIDPMRD 701
Query: 776 LSIANKHGRRILPLGNHVLMVG 797
LS + G+R L G ++ VG
Sbjct: 702 LSYTDATGKRFLEAGEFIVSVG 723
>gi|224025503|ref|ZP_03643869.1| hypothetical protein BACCOPRO_02243 [Bacteroides coprophilus DSM
18228]
gi|224018739|gb|EEF76737.1| hypothetical protein BACCOPRO_02243 [Bacteroides coprophilus DSM
18228]
Length = 787
Score = 252 bits (644), Expect = 5e-64, Method: Compositional matrix adjust.
Identities = 217/747 (29%), Positives = 334/747 (44%), Gaps = 138/747 (18%)
Query: 90 RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
RLGIP + E HG + G T FP + A+++N SL +G + +
Sbjct: 125 RLGIPVL-FAEECPHGHMAIG-----------ATVFPTSMGQASTWNESLIRQMGEVIGL 172
Query: 150 EARAM-YNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWK 208
EAR N+G + P ++I R+PRW R +ET GEDP + FV+ QG+++K
Sbjct: 173 EARLQGANIG------YGPVLDIAREPRWSRVEETFGEDPYLTGILGTAFVQGMQGKDFK 226
Query: 209 SDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNF---SRYSFNAM 265
D + + KHL AY + + G+ + A+
Sbjct: 227 --------------------------DGRHVYSTLKHLAAYGVPRGGHNGGPADMGLRAL 260
Query: 266 ITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYIT 324
+ D + P F+ +E GKA+ +M SYN ++GVP L R WGF G++
Sbjct: 261 L------DEYLPGFQRAVEVGKAATVMTSYNSIDGVPCTSNKFLIDSLLRKRWGFDGFVY 314
Query: 325 SDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRA 384
SD ++ I + ED+A ++AG D++ G R A+ GKV+E I+RA
Sbjct: 315 SDLASIDGIAG-AHVAANLEDAAIQAVEAGTDMDLGANAYRRLVKAVQTGKVKESAINRA 373
Query: 385 LLNLFSVQLRLGLFNGDPRKGKYGKLGPDD----VCTSEHKKLALDAARQGIVLLKNDKK 440
+ N+ ++ R+GLF + + P++ V +H+ LA AR+G VLLKN+
Sbjct: 374 VSNVLRLKFRMGLF-------EQPYVSPEEAARLVNCEDHRMLARKIAREGTVLLKNN-G 425
Query: 441 FLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPK--SLLRGLEAYVS--KTHYASGC 496
LPL K V +A+IGP + + G YT K +LL L + + Y GC
Sbjct: 426 ILPLGK--VKRIAVIGPNADVMYNYLGDYTAPQERSKVVTLLDALRNRMPDVRIDYVKGC 483
Query: 497 HDVPCNSDAGFHEAVRIAKKADFVIVVAG--------------------------LDLTQ 530
+ + + EAV A+KAD VI+ G L +
Sbjct: 484 A-IRDTTQSNIKEAVEAARKADLVILAVGGSSARDFKTKYINTGAATVDSENSGILSDME 542
Query: 531 ETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGY 590
E DR +L L G Q L+ ++A T K P++ V G PL+++ A S W Y
Sbjct: 543 CGEGFDRATLDLLGDQEKLIRAIAATEK-PLVTVYIAGRPLNMNLASEVSDALLTAW--Y 599
Query: 591 PGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTG 650
PGE G + +++ G++NP GRLPM+ P ++P++ Q R Y G
Sbjct: 600 PGEQGGNGIVDVLTGEYNPSGRLPMS-VPRHVGQIPVH--------YSQGTLRDYMDCPG 650
Query: 651 TQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCT 710
+Y FG+GLSYT ++Y S L +SA+ KA S Q G + + +CT
Sbjct: 651 KPLYTFGYGLSYTTFAY------SNLKLSATAKAAS-----QPAGDN----EVMQTITCT 695
Query: 711 SLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGV 770
VTN GD DG VV L+ S P +L GF ++ S+E++F +
Sbjct: 696 ---------VTNTGDRDGDEVVQLYLNDEVSSVAVPPIRLKGFQKIFLKKGESREVTFQL 746
Query: 771 DPCEQLSIANKHGRRILPLGNHVLMVG 797
+ LSI +++ G +M+G
Sbjct: 747 TR-QDLSIYDRNMNFTAEPGRFNVMIG 772
>gi|198274480|ref|ZP_03207012.1| hypothetical protein BACPLE_00628 [Bacteroides plebeius DSM 17135]
gi|198272682|gb|EDY96951.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
plebeius DSM 17135]
Length = 912
Score = 252 bits (644), Expect = 5e-64, Method: Compositional matrix adjust.
Identities = 209/754 (27%), Positives = 333/754 (44%), Gaps = 125/754 (16%)
Query: 90 RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
RLGIPA ++ +E + G+ + T+FP L ++NR L IG
Sbjct: 118 RLGIPA-DFTNEGIRGVEN-----------YIATNFPTQLALGHTWNRELIRQIGYITGR 165
Query: 150 EARAMYNLGQAGLT-FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWK 208
EAR + G T +AP +++ RD RWGR +E GE P +V+ + K Q
Sbjct: 166 EARLL------GYTNVYAPILDVGRDQRWGRYEEVYGESPYLVAELGIAMGKGLQ----- 214
Query: 209 SDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITE 268
++ +++ KH IAY K + ++
Sbjct: 215 --------------------------TDMQVASTAKHFIAYSNNKGAREGFARVDPQMSW 248
Query: 269 QDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVP-ACLRGDLFQKARNEWGFKGYITSDC 327
++ E+ PF I++ +M SYN +G P L Q+ R GF+GY+ SD
Sbjct: 249 REVENIHAYPFTRVIQEAGILGVMSSYNDYDGFPIQSSYYWLTQRLRGTMGFRGYVVSDS 308
Query: 328 DAVATIFEYQNYTKTHEDSAAGVLKAGMDINCG----TCMLRHTQSAIDKGKVQEKDIDR 383
DAV ++ K +++ ++AG+++ C + + I +G + + ID
Sbjct: 309 DAVEYLYSKHKTAKDMKEAVRQSVEAGLNVRCTFRSPESYVLPLRELIQEGGLSMETIDN 368
Query: 384 ALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLP 443
+ ++ V+ GLF+ P + L +V + H+++AL A+R+G+VLLKN LP
Sbjct: 369 RVRDILRVKFLTGLFD-TPYQTDLA-LADKEVNSEAHQQVALQASREGLVLLKNANNLLP 426
Query: 444 LNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV---SKTHYASGCHDVP 500
L+K+ + +A+ GP + S Y + ++L G++ V +K Y GC V
Sbjct: 427 LDKSQIKRIAVCGPNADEASFALTHYGPVAVEVTTVLEGIKQQVKEGTKVTYTKGCDLVD 486
Query: 501 CN--------------SDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQ 546
N +AV K++D +VV G + E++ R SL LPG Q
Sbjct: 487 ANWPESEIISYPLTAEEKTEIQKAVDNVKESDVAVVVLGGGIRTCGENKSRTSLDLPGHQ 546
Query: 547 MSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGD 606
L+ ++ T K PV+LVL G PL +++ AD + +IL YPG G A+AE +FGD
Sbjct: 547 QQLLEAIVATGK-PVVLVLINGRPLSINW--ADKFVPAILEAWYPGSQGGTAIAEALFGD 603
Query: 607 FNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQ------VYGFGHGL 660
+NPGG+L +T +P++ ++P N A + Q G G Q +Y FG+GL
Sbjct: 604 YNPGGKLTVT-FPKTVGQIP---FNFPAKPASQVDGGQTPGMKGNQSRINGPLYPFGYGL 659
Query: 661 SYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISV 720
SYT + Y L S + +DK + VT V +
Sbjct: 660 SYTTFEYSNLQLSSPVI--------TDK---------------EPVT--------VTCKI 688
Query: 721 TNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIAN 780
N G G VV L+ R S T EK L GF+RVH +K++SF + P + + N
Sbjct: 689 KNTGTRSGDEVVQLYTRDVISSVTTYEKNLRGFERVHLEPGETKKVSFQLLPRD-FQLLN 747
Query: 781 KHGRRILPLGNHVLMVG------ELRHSLTIETY 808
K ++ G +M+G L+ L I Y
Sbjct: 748 KDNHWVVEPGMFQIMIGASSEDIRLKKGLEIRAY 781
>gi|298386950|ref|ZP_06996504.1| beta-glucosidase [Bacteroides sp. 1_1_14]
gi|298260100|gb|EFI02970.1| beta-glucosidase [Bacteroides sp. 1_1_14]
Length = 846
Score = 252 bits (644), Expect = 5e-64, Method: Compositional matrix adjust.
Identities = 161/447 (36%), Positives = 225/447 (50%), Gaps = 57/447 (12%)
Query: 54 FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
+ N + I R + L+S LT++EKI L + I R+GI Y +E+LHGI G
Sbjct: 23 YKNMNAPIHERIQDLLSKLTIEEKISLLRATSPGIERMGIDKYYMGNEALHGIIRPG--- 79
Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAG----------LT 163
T FPQ + A+ +N L I S ++ EARA +N + G LT
Sbjct: 80 -------KFTVFPQAIGLASMWNPELHHIIASVISDEARARWNELERGKKQKDQFSDLLT 132
Query: 164 FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRV 223
FW+P +N+ RDPRWGR ET GEDP + FVK QG++ R
Sbjct: 133 FWSPTVNMARDPRWGRTPETYGEDPYLSGVLGTAFVKGLQGDH--------------PRY 178
Query: 224 LKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCI 283
LK + KH A + E +R+ +A ITE D + + P F CI
Sbjct: 179 LKSV-------------STPKHFAANNEEH----NRFYCDAAITETDMREYYLPAFEKCI 221
Query: 284 EQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKT 342
+GKA IM +YN +NGVP L K + +WGF GYI SDC A + Y KT
Sbjct: 222 REGKAESIMTAYNAINGVPCTANNWLLNKVLKQDWGFNGYIVSDCGAPGLLMTDHRYVKT 281
Query: 343 HEDSAAGVLKAGMDINCGTCML-RHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGD 401
E +A +KAG+D+ CG + +A + V +ID A ++ ++RLG+F+ D
Sbjct: 282 PEAAAMIAIKAGLDLECGDYVFGAPLLNAYKQYMVSTAEIDSAAYHVLRARMRLGMFD-D 340
Query: 402 PRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNN 461
P K Y L P+ V +HK+LAL+AARQ IVLLKN K LPLN + S+A++G +N
Sbjct: 341 PEKNPYNHLSPEIVGCEKHKELALEAARQSIVLLKNQKNTLPLNAKKIKSIAVVG--INA 398
Query: 462 ISQMGGGYTGIPC-SPKSLLRGLEAYV 487
+ G Y+G P +P S+L G+ V
Sbjct: 399 ANCEFGDYSGTPVNAPVSVLDGIRNRV 425
Score = 138 bits (348), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 89/302 (29%), Positives = 151/302 (50%), Gaps = 49/302 (16%)
Query: 507 FHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLT 566
+ +A ++ +++D VI V G++ + E E +DR S+ LP Q + + + I+VL
Sbjct: 587 YGDASKVIRESDVVIAVMGINQSIEREGQDRSSIELPKDQQIFIREAYKANPN-TIVVLV 645
Query: 567 GGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVP 626
G + V + D I +I+ YPGE G A+AE++FGD+NP GRLP+T+Y S +P
Sbjct: 646 AGSSMAVGWM--DQNIPAIIDAWYPGEQGGTAIAEVLFGDYNPAGRLPLTFY-NSIEDLP 702
Query: 627 -MNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAG 685
ND N++ + R+Y ++ G +Y FG+GLSYT + Y+ L
Sbjct: 703 AFNDYNVKNN-------RTYMYFEGKPLYAFGYGLSYTKFDYRNL--------------- 740
Query: 686 SDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGT 745
+I + + +L F SV N+G +G V ++ + P + T
Sbjct: 741 ----------------NIKQDSQNITLNF----SVKNSGKYNGDEVAQVYVQFPDLGIKT 780
Query: 746 PEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRIL-PLGNHVLMVGELRHSLT 804
P KQL GF RVH +++IS + P E+L + + ++ P G + MVG+ ++
Sbjct: 781 PLKQLKGFKRVHIKKGATEQISIEI-PKEELRLWDDQKKQFYTPSGTYNFMVGKSSDNIC 839
Query: 805 IE 806
++
Sbjct: 840 LQ 841
>gi|294674604|ref|YP_003575220.1| family 3 glycosyl hydrolase [Prevotella ruminicola 23]
gi|294474051|gb|ADE83440.1| glycosyl hydrolase, family 3 [Prevotella ruminicola 23]
Length = 869
Score = 252 bits (644), Expect = 5e-64, Method: Compositional matrix adjust.
Identities = 155/474 (32%), Positives = 239/474 (50%), Gaps = 74/474 (15%)
Query: 52 YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGP 111
YP+ +T LS R ++L+SLLT +EK+ + + + +I RLGIP+Y WWSE+ HG+ +G
Sbjct: 23 YPYLDTKLSNKERVENLLSLLTPEEKVGLMMNKSISIDRLGIPSYNWWSEACHGVRQDG- 81
Query: 112 GVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNL-------------- 157
T +PQ + AA+FN L+ ++ S V+ EARA +N
Sbjct: 82 ----------YTVYPQPIGMAAAFNAQLFYDVFSQVSDEARANWNRTDHNDPKLFNVPMG 131
Query: 158 -----GQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDG 212
G LTFW PN+NIFRDPRWGRGQET GEDP + + V+ V QG N K
Sbjct: 132 VTYYPGNPELTFWCPNVNIFRDPRWGRGQETCGEDPYLNAVLGVQTVLGMQGNNDKY--- 188
Query: 213 GIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTE 272
AC KH Y + R+S N T +D
Sbjct: 189 ------------------------FKTHACAKH---YAVHSGPEPLRHSMNVEPTNRDLW 221
Query: 273 DTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVA 331
+T+ P F++ +++G +MC+Y + G P C L RN+WG+ + +DCDA+
Sbjct: 222 ETYLPAFKALVKKGNVREVMCAYQRFEGKPCCTSDRLLIDILRNKWGYDAIVLTDCDAIN 281
Query: 332 TIFEYQNYTKTHEDSAAGVLKA---GMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNL 388
F + +TH+D + + A G D+ CG + + + KG ++E D+D L
Sbjct: 282 NFFNRGQH-ETHKDGLSASVDAVLNGTDLECGKVFMSLVE-GLKKGLIKESDLDNHLRKT 339
Query: 389 FSVQLRLGLFNGDPRKG-KYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKN 447
+ LG+F DP + KLG D + + ++ +A+ AAR+ +VLL+N LPL+K
Sbjct: 340 LMGRFELGMF--DPADMLPWAKLGADVISSEKNDAMAVQAARESMVLLENKAGILPLSK- 396
Query: 448 AVSSLAIIGPLVNNISQMGGGYTGIPCSP--KSLLRGLEAYV--SKTHYASGCH 497
++ +LA++GP ++++ + G Y G P + SLL G++A V +K Y C
Sbjct: 397 SIKTLAVLGPNADDVNMLNGNYGGTPTAAHQHSLLSGIKAAVPGAKIIYNKACE 450
Score = 113 bits (283), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 78/265 (29%), Positives = 131/265 (49%), Gaps = 43/265 (16%)
Query: 507 FHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLT 566
F + K+AD ++++ G+ E E D+ + LP Q LV ++ +T K PVI V
Sbjct: 606 FTDLANQVKEADAIVIIGGISAQMEGEGGDKQDIELPKVQQMLVKAMHKTGK-PVIFVNC 664
Query: 567 GGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVP 626
G ++F + + ++L Y G+ GAKALAE++FGD+NPGG+LP+T+Y +
Sbjct: 665 SGSA--IAFGSVEGEYDALLQAWYAGQGGAKALAEVLFGDYNPGGKLPVTFYRSN----- 717
Query: 627 MNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGS 686
ND+ D S + R+YR++TG Y FG+GLSYT + A + ISA
Sbjct: 718 -NDLPDFLDYSMK--NRTYRYFTGVPQYAFGYGLSYTTF------ALGDAKISAKQ---- 764
Query: 687 DKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTP 746
+++ G + + + VTN G +G+ V ++ + G P
Sbjct: 765 ----MKKNG-----------------KVTLTVPVTNTGKREGTETVQVYVKRLD-DAGAP 802
Query: 747 EKQLIGFDRVHTVAKGSKEISFGVD 771
K L GF +++ A +++ + +D
Sbjct: 803 IKALKGFQKLNLKAGETQKATITLD 827
>gi|255690202|ref|ZP_05413877.1| beta-glucosidase [Bacteroides finegoldii DSM 17565]
gi|260624221|gb|EEX47092.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
finegoldii DSM 17565]
Length = 853
Score = 252 bits (644), Expect = 5e-64, Method: Compositional matrix adjust.
Identities = 159/460 (34%), Positives = 231/460 (50%), Gaps = 60/460 (13%)
Query: 44 CKPPHFDSYP---FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWS 100
C P SY + N + + R LIS LT++EKI L + IPRLGI Y +
Sbjct: 16 CGAPCLFSYAQELYKNANAPVHERVMDLISRLTVEEKISLLRATSPGIPRLGIDKYYHGN 75
Query: 101 ESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA 160
E+LHG+ G T FPQ + AA++N L I + ++ EARA +N
Sbjct: 76 EALHGVVRPG----------RFTVFPQAIGLAATWNPELQKRIATVISDEARARWNELDQ 125
Query: 161 G----------LTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSD 210
G LTFW+P +N+ RDPRWGR ET GEDP + FVK QG
Sbjct: 126 GRNQKEQFSDVLTFWSPTVNMARDPRWGRTPETYGEDPFLSGVMGTAFVKGLQG------ 179
Query: 211 DGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQD 270
D L + + KH A + E +R+ N I+E+
Sbjct: 180 ---------------------DDPHYLKIVSTPKHFAANNEEH----NRFVCNPQISEKQ 214
Query: 271 TEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDA 329
+ + P F C+++GKA+ IM +YN +N VP L L QK R +WGF+GY+ SDC
Sbjct: 215 LREYYFPAFEMCVKEGKAASIMTAYNALNNVPCTLNSWLLQKVLRRDWGFQGYVVSDCGG 274
Query: 330 VATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCML-RHTQSAIDKGKVQEKDIDRALLNL 388
+ + Y KT E +A +KAG+D+ CG + + +A + E DID A ++
Sbjct: 275 PSLLVNAHKYVKTKEAAATLSIKAGLDLECGDDVYDEYLLNAYKQYMASEADIDSAAYHV 334
Query: 389 FSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNA 448
+ +++LGLF+G R Y K+ P + + EH+ +AL+AAR+ IVLLKN K LPLN
Sbjct: 335 LTARMKLGLFDGVER-NPYAKISPSVIGSKEHQTVALNAARECIVLLKNQKNMLPLNVKK 393
Query: 449 VSSLAIIGPLVNNISQMGGGYTGIP-CSPKSLLRGLEAYV 487
+ S+A++G +N G Y+G P P S+L+G++ V
Sbjct: 394 LKSIAVVG--INAGKCEFGDYSGAPVVEPVSILQGIKNRV 431
Score = 120 bits (301), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 81/301 (26%), Positives = 144/301 (47%), Gaps = 46/301 (15%)
Query: 507 FHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLT 566
+ EA + ++ + V+ V G++ + E E +DR + LP Q + + + + +ILVL
Sbjct: 593 YGEAGKAVRECETVVAVMGINKSIEREGQDRYDIQLPADQREFLQEIYKVNPN-IILVLV 651
Query: 567 GGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVP 626
G L V++ + + +I+ YPGE G A+AE++FGD+NP GRLP+T+Y +S ++P
Sbjct: 652 AGSSLAVNWE--NEHLPAIVNAWYPGEQGGTAVAEVLFGDYNPAGRLPLTYY-KSLEQLP 708
Query: 627 MNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGS 686
D D ++ GR+Y+++ +Y FG+GLSYT + Y L
Sbjct: 709 AFD---DYDITK---GRTYQYFKKDVLYPFGYGLSYTTFKYSNLK--------------- 747
Query: 687 DKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTP 746
+D+ ++ F ++ N G G V ++ R+P+++ T
Sbjct: 748 ----------------VDDAGKTVNVSF----TLKNTGKRAGDEVAQVYVRLPEIAGSTQ 787
Query: 747 E-KQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTI 805
+QL GF RV A S+++ +D + K ++P G+ MVG + +
Sbjct: 788 AIRQLKGFRRVALKAGESRKVEITLDKEQLRYWDEKQACFVVPQGSFTFMVGASSGDIRL 847
Query: 806 E 806
E
Sbjct: 848 E 848
>gi|423346097|ref|ZP_17323785.1| hypothetical protein HMPREF1060_01457 [Parabacteroides merdae
CL03T12C32]
gi|409220895|gb|EKN13848.1| hypothetical protein HMPREF1060_01457 [Parabacteroides merdae
CL03T12C32]
Length = 955
Score = 252 bits (644), Expect = 6e-64, Method: Compositional matrix adjust.
Identities = 202/734 (27%), Positives = 333/734 (45%), Gaps = 113/734 (15%)
Query: 90 RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
RLGIP ++ +E + G+ S T+FP L ++NR+L +G
Sbjct: 161 RLGIPT-DFTNEGIRGVES-----------YIATNFPTQLGLGHTWNRNLVHKVGYITGR 208
Query: 150 EARAMYNLGQAGLT-FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWK 208
E R + G T +AP +++ RD RWGR +E GE P +V+ +E K Q ++
Sbjct: 209 EGRLL------GYTNVYAPILDVGRDQRWGRYEEVYGESPYLVAELGIEMAKGMQTDH-- 260
Query: 209 SDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITE 268
++A KH IAY K G + ++
Sbjct: 261 -----------------------------QVAATSKHYIAYSNNKGGREGMARVDPQMSP 291
Query: 269 QDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVP-ACLRGDLFQKARNEWGFKGYITSDC 327
++ E P++ I++ +M SYN +G P L + R E+GF+GY+ SD
Sbjct: 292 REVEMIHVYPWKRVIKEAGILGVMSSYNDYDGFPIQSSYYWLTTRLRGEFGFRGYVVSDS 351
Query: 328 DAVATIFEYQNYTKTHEDSAAGVLKAGMDINCG----TCMLRHTQSAIDKGKVQEKDIDR 383
DAV +F ++S + AG++I C + + I +G + ID
Sbjct: 352 DAVEYLFSKHGTAADMKESVLQSVLAGLNIRCTFRSPDSYVLPLRELIAEGAIPMSTIDD 411
Query: 384 ALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLP 443
+ ++ V+ +GLF+ P + + + C +E++++AL A+++ +VLLKN LP
Sbjct: 412 RVRDILRVKFLVGLFD-HPYQIDLKETDKEVNC-AENQQVALQASKESLVLLKNQDAVLP 469
Query: 444 LNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV---SKTHYASGCHDVP 500
L+ N +S +A+ GP + + Y + ++L G+ V + + GC V
Sbjct: 470 LDVNKISKIAVCGPNADEEAYALTHYGPLAVEVTTVLEGIRNKVKPGTNVLFTKGCDLVD 529
Query: 501 CN--------------SDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQ 546
N + +AV AKK+D +VV G E++ R SL LPG+Q
Sbjct: 530 ANWPESELIRYPLTAEEQSEIDKAVENAKKSDVTVVVLGGSDRTCGENKSRSSLDLPGRQ 589
Query: 547 MSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGD 606
+ L+ +V T K PV+L+L G PL +++A D + +IL YPG G A+A+ +FGD
Sbjct: 590 LDLLQAVVATGK-PVVLILINGRPLSINWA--DKYVPAILEAWYPGSQGGTAIADALFGD 646
Query: 607 FNPGGRLPMTWYPESFTKVPMN---DMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYT 663
+NPGG+L +T +P++ ++P N N + D R +Y FG+GLSYT
Sbjct: 647 YNPGGKLTVT-FPKTVGQIPFNFPTKPNAQVDGGRNKGLDGNMSRVNGPLYPFGYGLSYT 705
Query: 664 NYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNA 723
+ Y S+++I ++ + +V T V+ VTN
Sbjct: 706 TFEY------SDISIQPAI--------------------VTQVQPVT-----VRCKVTNT 734
Query: 724 GDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHG 783
G G VV L+ R S T EK L+GFDR+H +KE++F ++P L + N
Sbjct: 735 GKRAGDEVVQLYVRDILSSVTTYEKNLVGFDRIHLNPGETKELTFTIEP-RDLQLLNSDN 793
Query: 784 RRILPLGNHVLMVG 797
++ G+ +MVG
Sbjct: 794 HWVVEPGDFKVMVG 807
>gi|423289663|ref|ZP_17268513.1| hypothetical protein HMPREF1069_03556 [Bacteroides ovatus
CL02T12C04]
gi|423298156|ref|ZP_17276215.1| hypothetical protein HMPREF1070_04880 [Bacteroides ovatus
CL03T12C18]
gi|392663697|gb|EIY57244.1| hypothetical protein HMPREF1070_04880 [Bacteroides ovatus
CL03T12C18]
gi|392667374|gb|EIY60884.1| hypothetical protein HMPREF1069_03556 [Bacteroides ovatus
CL02T12C04]
Length = 850
Score = 252 bits (643), Expect = 6e-64, Method: Compositional matrix adjust.
Identities = 152/449 (33%), Positives = 235/449 (52%), Gaps = 57/449 (12%)
Query: 54 FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
+ N + + R L+S LT++EKI L + IPRLGI Y +E+LHG+ G
Sbjct: 27 YKNENAPVHERVADLLSRLTVEEKISLLRATSPGIPRLGIDKYYHGNEALHGVVRPG--- 83
Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAG----------LT 163
T FPQ + AA++N L + + ++ EARA +N G LT
Sbjct: 84 -------RFTVFPQAIGLAATWNPVLQQKVATVISDEARARWNELDQGRNQKEQFSDVLT 136
Query: 164 FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRV 223
FW+P +N+ RDPRWGR ET GEDP + FVK QGE+ R
Sbjct: 137 FWSPTVNMARDPRWGRTPETYGEDPFLSGVMGTAFVKGLQGED--------------PRY 182
Query: 224 LKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCI 283
LK + + KH +A + E +R+ N I+E+ + + P F C+
Sbjct: 183 LK-------------IVSTPKHFVANNEEH----NRFICNPQISEKQLREYYFPAFEMCV 225
Query: 284 EQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKT 342
++GKA+ IM +YN +N VP L L QK R +WGF+GY+ SDC + + Y KT
Sbjct: 226 KKGKAASIMTAYNALNDVPCTLNAWLLQKVLRQDWGFRGYVVSDCGGPSLLVNAHKYVKT 285
Query: 343 HEDSAAGVLKAGMDINCGTCML-RHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGD 401
E +A +KAG+D+ CG + + +A + V + DID A ++ + +++LG+F+
Sbjct: 286 KETAATLSIKAGLDLECGDDVYDEYLLNAYKQYMVSDADIDSAACHVLAARMKLGMFDSK 345
Query: 402 PRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNN 461
R Y ++ P + + +H+++ALDAAR+ IVLLKN K LPLN + + S+A++G +N
Sbjct: 346 ER-NPYARISPSVIGSKDHQQVALDAARECIVLLKNQKNMLPLNVDKLKSIAVVG--INA 402
Query: 462 ISQMGGGYTGIPC-SPKSLLRGLEAYVSK 489
+ G Y+G P P S+L+G++ V +
Sbjct: 403 GTCEFGDYSGAPVIEPVSVLQGIKNRVGE 431
Score = 125 bits (314), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 87/302 (28%), Positives = 143/302 (47%), Gaps = 47/302 (15%)
Query: 507 FHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLT 566
+ EA + + + V+ V G++ + E E +DR + LP Q + + + + +I+VL
Sbjct: 591 YGEAGKAVSECETVVAVMGINKSIEREGQDRYDIQLPADQREFLQEIYKVNPN-IIVVLV 649
Query: 567 GGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVP 626
G L V++ D I +I+ YPGE G A+A+++FGD+NP GRLP+T+Y P
Sbjct: 650 AGSSLAVNWM--DEHIPAIVNAWYPGEQGGTAVADVLFGDYNPAGRLPLTYYKSLDELPP 707
Query: 627 MNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGS 686
+D ++ GR+Y+++ G +Y FG+GLSY+++ Y S+L + S
Sbjct: 708 FDDYDITK-------GRTYKYFKGDVLYPFGYGLSYSSFKY------SDLKVKDS----- 749
Query: 687 DKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTP 746
D+VT V + N G G V ++ R+P+ P
Sbjct: 750 ----------------TDKVT--------VSFRLKNTGRRKGDEVAQVYVRIPETGGIVP 785
Query: 747 EKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRR-ILPLGNHVLMVGELRHSLTI 805
K+L GF RV S+ I +D EQL + + ILP G +MVG + +
Sbjct: 786 IKELKGFRRVPLEPGESRAIDIELDK-EQLRYWDTTKEQFILPAGTFDVMVGASSKDIRL 844
Query: 806 ET 807
+T
Sbjct: 845 QT 846
>gi|336415919|ref|ZP_08596257.1| hypothetical protein HMPREF1017_03365 [Bacteroides ovatus
3_8_47FAA]
gi|335939822|gb|EGN01694.1| hypothetical protein HMPREF1017_03365 [Bacteroides ovatus
3_8_47FAA]
Length = 782
Score = 252 bits (643), Expect = 6e-64, Method: Compositional matrix adjust.
Identities = 216/750 (28%), Positives = 336/750 (44%), Gaps = 135/750 (18%)
Query: 90 RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
RLGIP + E+ HG + G T FP + AA+++ L +G +A
Sbjct: 129 RLGIPMF-LAEEAPHGHMAIG-----------ATVFPTGIGMAATWSPELVKEVGQVIAK 176
Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
E R+ Q G + P +++ RDPRW R +ET GEDP++ V G N
Sbjct: 177 EIRS-----QGGHISYGPVLDLTRDPRWSRVEETFGEDPVLSGILGASMVDGLGGGNLSQ 231
Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQ 269
I A KH +AY + + G Y A + +
Sbjct: 232 KYATI--------------------------ATLKHFLAYAVPEGGQNGNY---ASVGIR 262
Query: 270 DTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCD 328
D F PPFR I+ G S +M SYN ++G+P +L Q RNEW F+G++ SD
Sbjct: 263 DLHQNFLPPFRKAIDAGALS-VMTSYNSIDGIPCTSNHNLLTQLLRNEWKFRGFVVSDLY 321
Query: 329 AVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNL 388
++ I E T E++A + AG+D++ G + A+ G++ + ID A+ +
Sbjct: 322 SIEGIHESHFVAPTKENAAIQSVTAGVDVDLGGDAYTNLCHAVQSGQMDKAVIDTAVCRV 381
Query: 389 FSVQLRLGLFNG---DPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLN 445
++ +GLF DP K+ V EH +LA A+ I LLKN+ LPL+
Sbjct: 382 LRMKFEMGLFEHPYVDP------KIAAKTVRRKEHIELARKIAQSSITLLKNENSILPLS 435
Query: 446 KNAVSSLAIIGPLVNNISQMGGGYTGIP--CSPKSLLRGLEAYVS--KTHYASGC--HDV 499
K ++ +A+IGP +N M G YT + K++L G+ +S + Y GC D
Sbjct: 436 KT-INKVAVIGPNADNRYNMLGDYTAPQEDSNVKTVLDGIITKLSPSRVEYVRGCAIRDT 494
Query: 500 PCNSDAGFHEAVRIAKKADFVIVVAG----LDLT---QET----------------EDRD 536
N +A+ A++++ VIVV G D +ET E D
Sbjct: 495 TVNE---IEQAIEAARRSEVVIVVVGGSSARDFKTSYKETGAAVAEEGSVSDMECGEGFD 551
Query: 537 RVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGA 596
R SL L G+Q L+ S+ +T K P+I+V G PL+ ++A ++L YPG+ G
Sbjct: 552 RASLSLLGRQQELLESLQKTGK-PLIVVYIEGRPLEKNWA--SEYADALLTAYYPGQEGG 608
Query: 597 KALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGF 656
A+A+++FGD+NP GRLP++ P S ++P+ N +A + Y S + +Y F
Sbjct: 609 NAIADVLFGDYNPSGRLPIS-VPRSVGQIPVY-YNQKAPRNHDYVEVS-----SSPLYSF 661
Query: 657 GHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHV 716
G+G+SYT + Y SD ++Q++ C F V
Sbjct: 662 GYGMSYTTFEY------------------SDLQVVQKSA------------RC----FEV 687
Query: 717 QISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQL 776
V N G DG V L+ R S P KQL F+R H K+++F V E
Sbjct: 688 SFKVKNTGKYDGEEVSQLYMRDEYASVVQPMKQLKHFERFHLKKGEEKKVTF-VLTEEDF 746
Query: 777 SIANKHGRRILPLGNHVLMVGELRHSLTIE 806
+ N ++++ GN LM+G + + ++
Sbjct: 747 FLVNYTLKKVVESGNFHLMIGAASNDIRLQ 776
>gi|375143423|ref|YP_005005864.1| Beta-glucosidase [Niastella koreensis GR20-10]
gi|361057469|gb|AEV96460.1| Beta-glucosidase [Niastella koreensis GR20-10]
Length = 793
Score = 252 bits (643), Expect = 6e-64, Method: Compositional matrix adjust.
Identities = 207/737 (28%), Positives = 333/737 (45%), Gaps = 129/737 (17%)
Query: 90 RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
RLGIPA ++ +E + G+ + T FP L ++N+ L G
Sbjct: 142 RLGIPA-DFTNEGIRGVEA-----------YEATGFPTELNMGMTWNKELVHQEGIITGR 189
Query: 150 EARAMYNLGQAGLT-FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWK 208
EARA+ G T +AP +++ RD RWGR +E+ GEDP +V++ + K Q
Sbjct: 190 EARAL------GYTNVYAPIMDVARDQRWGRLEESYGEDPYLVASMGIALAKGIQ----- 238
Query: 209 SDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITE 268
DG + ++ KH Y K + + +
Sbjct: 239 -QDGKV-------------------------ASTAKHFAVYSANKGAREGQARTDPQVAP 272
Query: 269 QDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGD-LFQKARNEWGFKGYITSDC 327
++ E+ PF+ I++ +M SYN +G+P L Q+ R E GF GY+ SD
Sbjct: 273 REVENLLLYPFKKVIKEAGIMGVMSSYNDYDGIPVSGSNYWLIQRLRVEMGFTGYVVSDS 332
Query: 328 DAVATIFEYQNYTKTHEDSAAGVLKAGMDINCG----TCMLRHTQSAIDKGKVQEKDIDR 383
DA+ + + +++ AGM++ ++ + + + +G++ I+
Sbjct: 333 DALEYLATKHHVAANLKEAVFQAFMAGMNVRTTFKAPDSIIIYLRQLVKEGRIPMDTINH 392
Query: 384 ALLNLFSVQLRLGLFN-------GDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLK 436
+ ++ V+ RLGLF+ + RK V + +++AL A+R+ +VLLK
Sbjct: 393 RVADVLRVKFRLGLFDHPYVESAAETRK---------VVNSDASQQIALQASRESVVLLK 443
Query: 437 NDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVS--KTHYAS 494
N+ LPL K ++ +A++GP + Y + ++L+G++A + K YA
Sbjct: 444 NNNNILPLVK-SLDKIAVVGPNATDDDYAHTHYGPLGSPSVNVLQGIQAKLGAGKVLYAK 502
Query: 495 GCHDV----------PCNSDAG----FHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSL 540
G V P DAG AV I K+A IVV G + E + R L
Sbjct: 503 GVDLVDKNWPESEILPEPMDAGEQAMLDSAVNITKQAQMAIVVLGGNTRTAGESKSRTDL 562
Query: 541 LLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALA 600
LPG Q+ LV ++ T K PV++VL G P+ +++ D I I++ GYPG G A+A
Sbjct: 563 DLPGHQLELVKAIKATGK-PVVVVLLGTQPMTINWI--DKYIDGIVYAGYPGVKGGIAVA 619
Query: 601 EIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGL 660
+++FGD+NPGG+L +TW P+S ++P+N + S + G + +Y FG GL
Sbjct: 620 DVLFGDYNPGGKLTLTW-PKSVGQIPLNFPSKPGAQSDE--GEHAKI--KGLLYPFGFGL 674
Query: 661 SYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISV 720
SYT++ Y + L IS A D V V + V
Sbjct: 675 SYTSFGY------TNLKISTGKTAA------------------DPVA--------VTVDV 702
Query: 721 TNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIAN 780
TN G + G VV + R S T EK L GF+RVH A +K ISF + P E+L + N
Sbjct: 703 TNTGKLAGDEVVQCYIRDVLSSVTTYEKLLKGFERVHLQAGETKTISFTI-PREELKLYN 761
Query: 781 KHGRRILPLGNHVLMVG 797
+ + +L G +M+G
Sbjct: 762 REMKFVLEPGEFSVMIG 778
>gi|116621797|ref|YP_823953.1| glycoside hydrolase family protein [Candidatus Solibacter usitatus
Ellin6076]
gi|116224959|gb|ABJ83668.1| glycoside hydrolase, family 3 domain protein [Candidatus Solibacter
usitatus Ellin6076]
Length = 765
Score = 252 bits (643), Expect = 6e-64, Method: Compositional matrix adjust.
Identities = 220/746 (29%), Positives = 350/746 (46%), Gaps = 138/746 (18%)
Query: 90 RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
RLGIP + E LHG A+ G TSFPQ + A+F+ L ++ + A
Sbjct: 104 RLGIPVI-FHEECLHGHAAIGG-----------TSFPQPIGLGATFDPELVESLFAMTAA 151
Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
EARA LT P +++ R+PRWGR +ET GEDP +VS + V+ FQG+
Sbjct: 152 EARARGT--HQALT---PVVDVAREPRWGRVEETYGEDPFLVSRMGIAAVRGFQGDA--- 203
Query: 210 DDGGIGFGFREK-RVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITE 268
FR+K RV+ A KH A+ + G ++
Sbjct: 204 -------TFRDKTRVI----------------ATLKHFAAHGQPESGT---NCAPVNVSM 237
Query: 269 QDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPA-CLRGDLFQKARNEWGFKGYITSDC 327
+ +TF PF+ +++G A +M SYN+++GVP+ R L R EWGFKG++ SD
Sbjct: 238 RVLRETFLFPFKEALDKGCAISVMASYNEIDGVPSHASRWLLRDVLRKEWGFKGFVVSDY 297
Query: 328 DAVATIFEYQNYTKTH--------EDSAAGVLKAGMDINCGT--CMLRHTQSAIDKGKVQ 377
A I+E ++H ++ A ++AG++I C L H + KG +Q
Sbjct: 298 YA---IYELSYRPESHGHFVAKDKREACALAVQAGVNIELPEPDCYL-HLVDLVHKGVLQ 353
Query: 378 EKDIDRALLNLFSVQLRLGLFNG---DPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVL 434
E +D + + + ++GLF+ DP + + ++ D H++LA+ AAR+ I L
Sbjct: 354 ESQLDELVEPMLRWKFQMGLFDDPYVDPAEAE--RIAGCDA----HRELAMQAARETITL 407
Query: 435 LKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV---SKTH 491
LKND +PL+ +A+ ++A+IGP N + GGY+G+P ++L G+ V +K
Sbjct: 408 LKNDGPVVPLDLSAIKTIAVIGPNANR--SLLGGYSGVPKHDVTVLDGIRERVGSRAKVV 465
Query: 492 YASGCHDV--------------PCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETED--- 534
YA GC P EAV++AK+AD +++ G + E
Sbjct: 466 YAEGCKITIGGSWVQDEVTPSDPAEDRRQIAEAVKVAKRADVIVLAIGGNEQTSREAWSP 525
Query: 535 ---RDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYP 591
DR SL L G+Q LV ++ T K PVI L G P+ +++ + +I Y
Sbjct: 526 KHLGDRPSLDLVGRQEELVRAMVATGK-PVIAFLFNGRPISINYLA--QSVPAIFECWYL 582
Query: 592 GEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGT 651
G+ +A+AE++FGD NPGG+LP+T P S +P S+R R Y F
Sbjct: 583 GQETGRAVAEVLFGDTNPGGKLPIT-IPRSAGHLPA--FYNHKPSAR----RGYLFDEVG 635
Query: 652 QVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTS 711
+Y FG+GLSYT ++++ L + K + +++ +R
Sbjct: 636 PLYAFGYGLSYTTFAFQNLRL-------------AKKKMHRESTAR-------------- 668
Query: 712 LRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVD 771
V + VTN G +G VV L+ R S P K+L GF ++ ++ + F +
Sbjct: 669 ----VLVDVTNTGAREGREVVQLYIRDLVSSVTRPIKELKGFRKITLQPGQTQTVEFEIT 724
Query: 772 PCEQLSIANKHGRRILPLGNHVLMVG 797
P + L+ N + ++ G+ +MVG
Sbjct: 725 P-DLLAFYNVDMKFVVEPGDFEIMVG 749
>gi|242206820|ref|XP_002469265.1| hypothetical protein POSPLDRAFT_51213 [Postia placenta Mad-698-R]
gi|220731725|gb|EED85567.1| hypothetical protein POSPLDRAFT_51213 [Postia placenta Mad-698-R]
Length = 312
Score = 252 bits (643), Expect = 6e-64, Method: Compositional matrix adjust.
Identities = 130/306 (42%), Positives = 175/306 (57%), Gaps = 33/306 (10%)
Query: 55 CNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVN 114
C+TS + RA +LISL TL+EKI + A +PRLG+PAY+WW E+LHG+A + PGV
Sbjct: 34 CDTSATPLERATALISLFTLEEKINNTGNTAPGVPRLGLPAYQWWQEALHGVAES-PGVI 92
Query: 115 F--NGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIF 172
F +G S TSFPQ ++ A+F+ +L +++ + V+ EARA N ++G+ FW PNIN F
Sbjct: 93 FAPSGEYSYATSFPQPILMGAAFDDALINHVATIVSTEARAFNNANRSGIDFWTPNINPF 152
Query: 173 RDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESD 232
+DPRWGRGQETPGEDP + +Y + QG G KR++
Sbjct: 153 KDPRWGRGQETPGEDPFHLQSYVYNLITGLQG----------GLDPEYKRIV-------- 194
Query: 233 RGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIM 292
A CKH AYDLE W RY F+A+++ QD + + FR+C M
Sbjct: 195 --------ATCKHFAAYDLENWEGNVRYGFDALVSLQDLSEFYTRSFRTCARDANVGSFM 246
Query: 293 CSYNQVNGVPACLRGDLFQKA-RNEWGFKG---YITSDCDAVATIFEYQNYTKTHEDSAA 348
CSYN VNGVP+C L Q R+ WG+ YITSDCDA+ I+E YT T ++ A
Sbjct: 247 CSYNAVNGVPSCANSYLLQDILRDHWGWTNEDQYITSDCDAIQNIYEPHYYTATRAETVA 306
Query: 349 GVLKAG 354
L AG
Sbjct: 307 DALNAG 312
>gi|427384989|ref|ZP_18881494.1| hypothetical protein HMPREF9447_02527 [Bacteroides oleiciplenus YIT
12058]
gi|425728250|gb|EKU91109.1| hypothetical protein HMPREF9447_02527 [Bacteroides oleiciplenus YIT
12058]
Length = 862
Score = 252 bits (643), Expect = 7e-64, Method: Compositional matrix adjust.
Identities = 154/460 (33%), Positives = 236/460 (51%), Gaps = 55/460 (11%)
Query: 51 SYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNG 110
+ P+ N LS + RAK L+S LTL+EK + D++ AIPRLGI + WWSE+LHG+A+ G
Sbjct: 20 TLPYQNPELSPAERAKDLVSRLTLEEKALLMCDDSEAIPRLGIKKFNWWSEALHGVANQG 79
Query: 111 PGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYN---------LGQAG 161
+VT FP+ + AASFN L I +A + E RA +N
Sbjct: 80 ----------NVTVFPEPVGMAASFNEKLVFEIFNATSDEMRAKHNERVRNGLEDTRFHS 129
Query: 162 LTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREK 221
L+ W PN+NIFRDPRWGRGQET GEDP + S + VK Q
Sbjct: 130 LSVWTPNVNIFRDPRWGRGQETYGEDPYLTSQMGIAVVKGLQ------------------ 171
Query: 222 RVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNA-MITEQDTEDTFQPPFR 280
G E ++ +L+ AC KH + +W SR+S N I+ +D +T+ P F+
Sbjct: 172 ------GPEDEKYRKLL--ACAKHYAVHSGPEW---SRHSANLNNISPRDLWETYLPAFK 220
Query: 281 SCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNY 339
+ +++ +MC+Y +++ P C L Q+ R+EWGFK + SDC A+A +
Sbjct: 221 ALVQKADVREVMCAYQRLDDDPCCGSTRLLQQILRDEWGFKYLVVSDCGAIADFWTSHKS 280
Query: 340 TKTHEDSAAGVLKAGMDINCGT-CMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLF 398
+ +A AG D+ CG + A+ +G + E++ID+ +L L + LG
Sbjct: 281 SSDAVHAAVKGTMAGTDVECGYGYAYQKLPEAVSRGLITEEEIDKHVLRLLEGRFELGEM 340
Query: 399 NGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPL 458
+ DP K+ ++ V + HK L+L+ +RQ + LL+N LPL+K ++ +A+IGP
Sbjct: 341 D-DPSLVKWSQIPMSVVNSKAHKDLSLNMSRQTMTLLQNKNNVLPLSK-SIRKIAVIGPN 398
Query: 459 VNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTH--YASGC 496
++ + G Y G P ++L G + + K Y GC
Sbjct: 399 ADDKPMLWGNYNGTPNQTITILDGFKTKLKKNQIIYMKGC 438
Score = 81.6 bits (200), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 75/312 (24%), Positives = 132/312 (42%), Gaps = 62/312 (19%)
Query: 498 DVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQM 547
++P N D E++ K D V+ V G+ E E+ DR + LP Q
Sbjct: 585 ELPINYD----ESISKLKNVDMVVFVGGISPQLEGEEMPLNLPGFKNGDRTDIELPAVQR 640
Query: 548 SLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDF 607
+ + ++ SK+ V + +G ++ +IL Y GE G +A+A+++FGD+
Sbjct: 641 NFLKALKEASKQVVFVNCSGSS---MALLPETESCDAILQAWYGGELGGQAVADVLFGDY 697
Query: 608 NPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSY 667
NP G+LP+T+Y + D +M+ GR+YR Y ++ FG GLSYT+++
Sbjct: 698 NPSGKLPVTFYKSTKQLPDYEDYSMK--------GRTYR-YMSDPLFPFGFGLSYTDFTV 748
Query: 668 KFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVD 727
Q + ++L +E + T + ++N G
Sbjct: 749 ---------------------GTAQCSKTQL---RTEEALTLT-------VPISNTGKRS 777
Query: 728 GSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSI--ANKHGRR 785
G+ V+ ++ R G P K L + R A +++I + P E + + R
Sbjct: 778 GTEVIQVYIRKTD-DTGGPLKSLKAYARAELAAGATQDIEIQL-PAESFECFDPSTNTMR 835
Query: 786 ILPLGNHVLMVG 797
+ P G + L G
Sbjct: 836 VAP-GEYELFYG 846
>gi|295135338|ref|YP_003586014.1| glycoside hydrolase [Zunongwangia profunda SM-A87]
gi|294983353|gb|ADF53818.1| glycoside hydrolase family protein [Zunongwangia profunda SM-A87]
Length = 764
Score = 252 bits (643), Expect = 7e-64, Method: Compositional matrix adjust.
Identities = 215/742 (28%), Positives = 337/742 (45%), Gaps = 136/742 (18%)
Query: 76 EKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASF 135
EKI+ D A R+GIP S+ +HG T+FP L +AAS+
Sbjct: 90 EKIRVAQDYAVNDTRMGIPLL-IGSDVIHGYK---------------TTFPIPLGTAASW 133
Query: 136 NRSLWSNIGSAVAVEARAMYNLGQAGLTF-WAPNINIFRDPRWGRGQETPGEDPMVVSAY 194
+ + A EA A G+ + ++P ++I RDPRWGR E GEDP + S
Sbjct: 134 DMEMIKKTAEIAAQEATA------DGINWNFSPMVDIARDPRWGRIAEGAGEDPYLGSQI 187
Query: 195 AVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKW 254
A V+ +QG+ D E + A KH Y +
Sbjct: 188 AKAMVEGYQGD--------------------------DLAKENTMIATVKHFALYGASEA 221
Query: 255 G---NFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPAC----LRG 307
G N + S M E + PP+++ I+ G A +M S+N V+GVPA L
Sbjct: 222 GRDYNTTDMSRVKMFNE------YLPPYKAAIDAG-AESVMSSFNDVDGVPATGNKWLLT 274
Query: 308 DLFQKARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDIN-CGTCMLRH 366
DL R+ WGF+G++TSD ++ + + + +A LKAG+D++ G L+
Sbjct: 275 DLL---RDRWGFEGFVTSDYTSLNEMIAHG--MGDLQAVSALALKAGLDMDMVGEGYLKT 329
Query: 367 TQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALD 426
+ ++D+GKV E +I A + + +LGLF+ DP K D+ + E++ +
Sbjct: 330 LKKSLDEGKVTEAEITTAARRILEAKYKLGLFD-DPYKYLDESRPEKDILSEENRTFSRK 388
Query: 427 AARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGY--TGIPCSPKSLLRGLE 484
A VLLK D PL KNA +A+IGPL NN + M G + TG P +L+G++
Sbjct: 389 VAAHSFVLLKKDAGVFPLKKNA--KIALIGPLANNKNNMLGTWAPTGNPQLSVPVLQGVK 446
Query: 485 --AYVSKTHYASGCHDV-------------------PCNSDAGFHEAVRIAKKADFVIVV 523
A +K YA G + + + EA+++AKK+D ++ V
Sbjct: 447 NVAPKAKVTYAQGANITDDAQLAENINVFGPRAEISETSPEKMLEEALKVAKKSDVIVAV 506
Query: 524 AGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQIS 583
G E R +LL+P Q L+ +A+T K P+ LVL G PL++S E++ I
Sbjct: 507 VGEATEMSGEAASRTNLLIPESQKKLIRELAKTGK-PMALVLMSGRPLNIS-EESEMNID 564
Query: 584 SILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPG- 642
IL + +PG A+A++IFGD+NP G++ +W P + +VP+ R + G
Sbjct: 565 -ILQVWHPGVEAGNAIADVIFGDYNPSGKITASW-PRNVGQVPVYYAMKRTGRPGEVEGF 622
Query: 643 ----RSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRL 698
+ + +Y FG+GLSYT + Y + +KA +D+
Sbjct: 623 QKFKSEFLDTPNSPLYPFGYGLSYTEFEY------------SDVKASADE---------- 660
Query: 699 DYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHT 758
+ +D + +++ +TN GD DG VV L+ S P KQLIGF+++
Sbjct: 661 --LKMDGTLTLSAI-------ITNTGDYDGEEVVQLYIHDKVRSITPPMKQLIGFEKIML 711
Query: 759 VAKGSKEISFGVDPCEQLSIAN 780
SK ++F + E L N
Sbjct: 712 KKGESKTVTFEIS-AEDLKFYN 732
>gi|340616359|ref|YP_004734812.1| xylosidase/arabinosidase [Zobellia galactanivorans]
gi|339731156|emb|CAZ94420.1| Xylosidase/arabinosidase, family GH3 [Zobellia galactanivorans]
Length = 801
Score = 251 bits (642), Expect = 8e-64, Method: Compositional matrix adjust.
Identities = 205/734 (27%), Positives = 323/734 (44%), Gaps = 114/734 (15%)
Query: 90 RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
RLGIP ++ +E + G+ TSFP L A++N++L IG
Sbjct: 142 RLGIPV-DFTNEGIRGLCH-----------EKATSFPSQLGVGATWNKNLVGKIGHITGK 189
Query: 150 EARAMYNLGQAGLT-FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWK 208
EAR + G T ++P ++I RDPRWGR E GEDP +V + VK Q E
Sbjct: 190 EARLL------GYTNVYSPILDIARDPRWGRVVECYGEDPYLVGELGYQMVKGIQQEK-- 241
Query: 209 SDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITE 268
+ + KH Y K G +A ITE
Sbjct: 242 ------------------------------VVSTPKHFAIYSAPKGGRDGDARTDAHITE 271
Query: 269 QDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDC 327
++ + PF+ I+ A +M SYN NGVP R +WGFKGY+ SD
Sbjct: 272 RELFSLYLHPFKRAIKDAGAMGVMSSYNDYNGVPVSSSKYFLNDILREDWGFKGYVVSDS 331
Query: 328 DAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCM----LRHTQSAIDKGKVQEKDIDR 383
AV I + + K +D+ + AG+++ M + + + +G + ID
Sbjct: 332 RAVEFIADKHHVAKDRKDAVRQAVLAGLNVRTDFTMPEDFILPVRELVKEGGLDMATIDD 391
Query: 384 ALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLP 443
+ ++ V+ GLF D GK K V E++++A A+ + IVLLKN++ LP
Sbjct: 392 RVRDILRVKFWQGLF--DAPYGKQMKEADKTVGKPEYQEVAYQASLESIVLLKNEENILP 449
Query: 444 LNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSK---THYASGC---- 496
L+ + S+ + GP I+ Y S+ G++ K Y GC
Sbjct: 450 LDFSKYKSVLVTGPNAKAINHSVSRYGPSHIDVVSVFDGIKEKFPKDVEIKYTKGCVFFD 509
Query: 497 ---------HDVPCNSD-AGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQ 546
+ P ++ + +AV +AK IVV G D E R R SL LPG Q
Sbjct: 510 ENWPDSELMNTPPTEAEQSEIDKAVAMAKTVGLAIVVLGDDEETVGESRSRTSLDLPGNQ 569
Query: 547 MSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGD 606
LV + +T PVI+VL G P+ +++ D + I+ + G+ G A+A+++ G
Sbjct: 570 QKLVEEIYKTGT-PVIVVLINGRPMTINW--VDKYVPGIVEGWFQGKFGGSAIADVLVGS 626
Query: 607 FNPGGRLPMTWYPESFTKVPMN---DMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYT 663
+NPGG+LP++ +P++ ++PMN +AD + P S + G +Y FG+GLSYT
Sbjct: 627 YNPGGKLPVS-FPKTVGQLPMNFPSKPGAQADQPAKGPNGSGKTRVGGFLYPFGYGLSYT 685
Query: 664 NYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNA 723
+ Y + L I +++K G + V + +TN+
Sbjct: 686 TFEY------TNLKIRSNIKNGLGDVV-------------------------VSVDITNS 714
Query: 724 GDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHG 783
G G +V L+ S EKQL GF+R+ A +K ++F + P E LS+ N+
Sbjct: 715 GKRKGDEIVQLYFSDETSSVTVYEKQLRGFERISLEAGETKTVNFTLSP-EDLSLYNRQM 773
Query: 784 RRILPLGNHVLMVG 797
+L G+ +M+G
Sbjct: 774 EFVLEPGSFTIMIG 787
>gi|206901921|ref|YP_002251428.1| xylosidase/arabinosidase [Dictyoglomus thermophilum H-6-12]
gi|206741024|gb|ACI20082.1| xylosidase/arabinosidase [Dictyoglomus thermophilum H-6-12]
Length = 756
Score = 251 bits (642), Expect = 9e-64, Method: Compositional matrix adjust.
Identities = 199/681 (29%), Positives = 321/681 (47%), Gaps = 112/681 (16%)
Query: 101 ESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA 160
E+LHG + G T FPQ + A+++N L + +A+ E R+ +
Sbjct: 138 EALHGCMAKGS-----------TIFPQAIGMASTWNPELIYQVATAIGKETRS-----RG 181
Query: 161 GLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFRE 220
+P INI RDPR GR +ET GEDP + S AV ++K G +E
Sbjct: 182 IHQVLSPTINIARDPRCGRTEETYGEDPYLASRMAVAYIK----------------GVQE 225
Query: 221 KRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFR 280
+ V+ A KH +A + G+ R S+ +E+ + + P FR
Sbjct: 226 QGVI----------------ATPKHFVANFV---GDGGRDSYPIHFSERLLREIYFPAFR 266
Query: 281 SCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNY 339
+ IE+ A +M +YN ++G+P L + R EWGFKGY+ SD +V +
Sbjct: 267 ASIEEAGALSLMAAYNSLDGIPCSSNKWLLTRILRKEWGFKGYVVSDYFSVLHLMTKHKV 326
Query: 340 TKTHEDSAAGVLKAGMDINC-GTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLF 398
++ ++A L+AG+D+ + I + K+ + +D A+ + V+ +GLF
Sbjct: 327 AESKAEAAKLSLEAGLDMELPDSDCFEEIPGLIRESKLSQDTLDEAVRRVLRVKFWIGLF 386
Query: 399 NGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPL 458
+ Y + D SEH++LAL AR+ IVLLKN+ LPLNK+ + S+A+IGP
Sbjct: 387 DNPFVDPDYAERIND---CSEHRELALRVARESIVLLKNEG-ILPLNKD-IRSIAVIGP- 440
Query: 459 VNNISQMGGGYTGIPCSPKSLLRGLEAYVS---KTHYASGCHDVPCNSDAGFHEAVRIAK 515
N GGY+G + L G++ + K ++A GC + S +GF EA++IA+
Sbjct: 441 -NAAVPRLGGYSGYGVKVVTPLEGIKNKLGDKVKVYFAEGC-GLNDTSKSGFDEAIKIAQ 498
Query: 516 KADFVIVVAGLDLTQ-ETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVS 574
K+D I+ G + + E E RDR +L LPG Q L+ + T+ PVI+VL G ++
Sbjct: 499 KSDVAILFMGNSVPETEGEQRDRHNLNLPGVQEDLIKEICNTNT-PVIVVLINGSA--IT 555
Query: 575 FAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPM---NDMN 631
++ +++ YPGE G A+A+++FGD+NPGG+LP++ +P+ +++P+ + +
Sbjct: 556 MMNWIDKVQAVIEAWYPGEEGGNAIADVLFGDYNPGGKLPIS-FPKYSSQLPLYYNHKPS 614
Query: 632 MRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLS-APSELTISASLKAGSDKNI 690
R D G Y F FG+GLSYT++ Y L P E+ D +
Sbjct: 615 GRVDDYVDLRGNQYLF-------PFGYGLSYTDFKYSNLRITPEEIP--------RDGEV 659
Query: 691 LQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQL 750
+ + + N G G VV L+ S P K+L
Sbjct: 660 V------------------------ITFDIENIGKYKGDEVVQLYLHDEFASVARPIKEL 695
Query: 751 IGFDRVHTVAKGSKEISFGVD 771
F+RV K +SF ++
Sbjct: 696 KRFERVTLDVGERKTVSFKLN 716
>gi|423722678|ref|ZP_17696831.1| hypothetical protein HMPREF1078_00891 [Parabacteroides merdae
CL09T00C40]
gi|409241951|gb|EKN34716.1| hypothetical protein HMPREF1078_00891 [Parabacteroides merdae
CL09T00C40]
Length = 955
Score = 251 bits (642), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 204/734 (27%), Positives = 331/734 (45%), Gaps = 113/734 (15%)
Query: 90 RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
RLGIP ++ +E + G+ S T+FP L ++NR+L +G
Sbjct: 161 RLGIPT-DFTNEGIRGVES-----------YIATNFPTQLGLGHTWNRNLVHKVGYITGR 208
Query: 150 EARAMYNLGQAGLT-FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWK 208
E R + G T +AP +++ RD RWGR +E GE P +V+ VE K Q
Sbjct: 209 EGRLL------GYTNVYAPILDVGRDQRWGRYEEVYGESPYLVAELGVEMAKGMQ----- 257
Query: 209 SDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITE 268
+ ++A KH IAY K G + ++
Sbjct: 258 --------------------------TDYQVAATSKHYIAYSNNKGGREGMARVDPQMSP 291
Query: 269 QDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVP-ACLRGDLFQKARNEWGFKGYITSDC 327
++ E P++ I++ +M SYN +G P L + R E+GF+GY+ SD
Sbjct: 292 REVEMIHVYPWKRVIKEAGILGVMSSYNDYDGFPIQSSYYWLTTRLRGEFGFRGYVVSDS 351
Query: 328 DAVATIFEYQNYTKTHEDSAAGVLKAGMDINCG----TCMLRHTQSAIDKGKVQEKDIDR 383
DAV +F ++S + AG++I C + + I +G + ID
Sbjct: 352 DAVEYLFSKHGTAADMKESVLQSVLAGLNIRCTFRSPDSYVLPLRELIAEGAIPMSTIDD 411
Query: 384 ALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLP 443
+ ++ V+ +GLF+ P + + + C +E++ +AL A+++ +VLLKN LP
Sbjct: 412 RVRDILRVKFLVGLFD-HPYQIDLKETDKEVNC-AENQLVALQASKESLVLLKNQDAVLP 469
Query: 444 LNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV---SKTHYASGCHDVP 500
L+ N +S +A+ GP + + Y + ++L G+ V + + GC V
Sbjct: 470 LDVNKISKIAVCGPNADEEAYALTHYGPLAVEVTTVLEGIRNKVKPGTDVLFTKGCDLVD 529
Query: 501 CN--------------SDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQ 546
N + +AV AKK+D +VV G E++ R SL LPG+Q
Sbjct: 530 ANWPESELIRYPLTAEEQSEIDKAVENAKKSDVTVVVLGGSNRTCGENKSRSSLDLPGRQ 589
Query: 547 MSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGD 606
+ L+ +V T K PV+LVL G PL +++A D + +IL YPG G A+A+ +FGD
Sbjct: 590 LDLLQAVVATGK-PVVLVLINGRPLSINWA--DKYVPAILEAWYPGSQGGTAIADALFGD 646
Query: 607 FNPGGRLPMTWYPESFTKVPMN---DMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYT 663
+NPGG+L +T +P++ ++P N N + D R +Y FG+GLSYT
Sbjct: 647 YNPGGKLTVT-FPKTVGQIPFNFPTKPNAQVDGGRNKGLDGNMSRVNGPLYPFGYGLSYT 705
Query: 664 NYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNA 723
+ Y S+++I ++ + +V T V+ VTN
Sbjct: 706 TFEY------SDISIQPAI--------------------VTQVQPVT-----VRCKVTNT 734
Query: 724 GDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHG 783
G G VV L+ R S T EK L+GFDR+H +KE++F ++P L + N
Sbjct: 735 GKRAGDEVVQLYVRDILSSVTTYEKNLVGFDRIHLNPGETKELTFTIEP-RDLQLLNSDN 793
Query: 784 RRILPLGNHVLMVG 797
++ G+ +MVG
Sbjct: 794 HWVVEPGDFKVMVG 807
>gi|427387362|ref|ZP_18883418.1| hypothetical protein HMPREF9447_04451 [Bacteroides oleiciplenus YIT
12058]
gi|425725523|gb|EKU88394.1| hypothetical protein HMPREF9447_04451 [Bacteroides oleiciplenus YIT
12058]
Length = 865
Score = 251 bits (641), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 156/459 (33%), Positives = 235/459 (51%), Gaps = 58/459 (12%)
Query: 53 PFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPG 112
P+ N L+ S RA L+ +TL+EKI Q+ + + AI RLGIPAY WW+E+LHG+A G
Sbjct: 24 PYKNPDLTPSERAWDLLKRMTLEEKISQMKNGSPAIERLGIPAYNWWNEALHGVARAG-- 81
Query: 113 VNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNL--------GQAGLTF 164
T FPQ + AA+F+ S V+ EARA Y+ G GLTF
Sbjct: 82 --------KATVFPQAIGLAATFDNQAVHETFSIVSDEARAKYHDFQRKGERDGYKGLTF 133
Query: 165 WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVL 224
W PNINI+RDPRWGRG ET GEDP + S + VK QG+ G G +K
Sbjct: 134 WTPNINIYRDPRWGRGMETYGEDPYLTSLMGLAVVKGLQGD---------GTGKYDK--- 181
Query: 225 KGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPFRSCI 283
AC KH + +W +R+SF+A I+++D +T+ P F++ +
Sbjct: 182 --------------THACAKHYAVHSGPEW---NRHSFDAKNISQRDLWETYLPAFKTLV 224
Query: 284 EQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKT 342
+GK +MC+YN+ G P C L + R +WG+ + SDC A+ F Y N+ +T
Sbjct: 225 TEGKVKEVMCAYNRYEGEPCCSNKQLLIRILREDWGYDDIVVSDCGAIGD-FYYPNHHET 283
Query: 343 H---EDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFN 399
H ++A + +G D+ CG A+ KG + E I+ ++ L + +LG+F+
Sbjct: 284 HPTAAAASADAVVSGTDLECGGSY-SSLNEAVRKGLISEDKINESVFRLLRARFQLGMFD 342
Query: 400 GDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLV 459
D + ++ V + EH AL+ AR+ +VLL N LPL+K +V +A++GP
Sbjct: 343 -DNTLVSWSEIPYSVVESKEHVAKALEMARKSMVLLTNKNNILPLSK-SVRKVAVLGPNA 400
Query: 460 NNISQMGGGYTGIPCSPKSLLRGLEAYVSK--THYASGC 496
N+ + Y G P ++L G+ + + +Y GC
Sbjct: 401 NDSVMLWANYNGFPTKSVTILEGIRNKLPEGAVYYEKGC 439
Score = 116 bits (291), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 94/317 (29%), Positives = 148/317 (46%), Gaps = 57/317 (17%)
Query: 493 ASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLL 542
AS D+ + + E A +AD +I V GL + E E+ DR ++ L
Sbjct: 577 ASLKFDIGIKKEINYKEMADKAAEADVIIFVGGLSSSLEGEEMPVDLPGFRKGDRTNIDL 636
Query: 543 PGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEI 602
P Q ++ ++ +T K PV+ VL G L + + EA++ + +I+ YPG+ G A+A++
Sbjct: 637 PQVQEEMLKALKKTGK-PVVFVLCSGSTLALPW-EAEN-LDAIIEAWYPGQQGGTAVADV 693
Query: 603 IFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSY 662
+FGD+NP GRLP+T+Y S D +M R+YR++ G ++ FGHGLSY
Sbjct: 694 LFGDYNPAGRLPLTFYASSSDLPDFEDYDMS--------NRTYRYFKGRPLFPFGHGLSY 745
Query: 663 TNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTN 722
T + Y KA +DK IL + G L + I + N
Sbjct: 746 TTFDYG--------------KAKADKKIL-RAGEGLT----------------LTIPLKN 774
Query: 723 AGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKH 782
G + G VV ++ R P +G P K L F R+ A ++++ F + P N
Sbjct: 775 IGKLSGDEVVQVYLRNPGDKEG-PIKTLRAFRRISLEAGQAEDVLFEL-PVSTFEWFNPA 832
Query: 783 GRR--ILPLGNHVLMVG 797
R +LP G + L+ G
Sbjct: 833 TNRMEVLP-GKYELLYG 848
>gi|154493932|ref|ZP_02033252.1| hypothetical protein PARMER_03276 [Parabacteroides merdae ATCC
43184]
gi|154086192|gb|EDN85237.1| glycosyl hydrolase family 3 C-terminal domain protein
[Parabacteroides merdae ATCC 43184]
Length = 955
Score = 251 bits (641), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 203/734 (27%), Positives = 332/734 (45%), Gaps = 113/734 (15%)
Query: 90 RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
RLGIP ++ +E + G+ S T+FP L ++NR+L +G
Sbjct: 161 RLGIPT-DFTNEGIRGVES-----------YIATNFPTQLGLGHTWNRNLVHKVGYITGR 208
Query: 150 EARAMYNLGQAGLT-FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWK 208
E R + G T +AP +++ RD RWGR +E GE P +V+ +E K Q ++
Sbjct: 209 EGRLL------GYTNVYAPILDVGRDQRWGRYEEVYGESPYLVAELGIEMAKGMQTDH-- 260
Query: 209 SDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITE 268
++A KH IAY K G + ++
Sbjct: 261 -----------------------------QVAATSKHYIAYSNNKGGREGMARVDPQMSP 291
Query: 269 QDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVP-ACLRGDLFQKARNEWGFKGYITSDC 327
++ E P++ I++ +M SYN +G P L + R E+GF+GY+ SD
Sbjct: 292 REVEMIHVYPWKRVIKEAGILGVMSSYNDYDGFPIQSSYYWLTTRLRGEFGFRGYVVSDS 351
Query: 328 DAVATIFEYQNYTKTHEDSAAGVLKAGMDINCG----TCMLRHTQSAIDKGKVQEKDIDR 383
DAV +F ++S + AG++I C + + I +G + ID
Sbjct: 352 DAVEYLFSKHGTAADMKESVLQSVLAGLNIRCTFRSPDSYVLPLRELIAEGAIPMSTIDD 411
Query: 384 ALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLP 443
+ ++ V+ +GLF+ P + + + C +E++ +AL A+++ +VLLKN LP
Sbjct: 412 RVRDILRVKFLVGLFD-HPYQIDLKETDKEVNC-AENQLVALQASKESLVLLKNQDAVLP 469
Query: 444 LNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV---SKTHYASGCHDVP 500
L+ N +S +A+ GP + + Y + ++L G+ V + + GC V
Sbjct: 470 LDVNKISKIAVCGPNADEEAYALTHYGPLAVEVTTVLEGIRNKVKPGTDVLFTKGCDLVD 529
Query: 501 CN--------------SDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQ 546
N + +AV AKK+D +VV G E++ R SL LPG+Q
Sbjct: 530 ANWPESELIRYPLTAEEQSEIDKAVENAKKSDVTVVVLGGSNRTCGENKSRSSLDLPGRQ 589
Query: 547 MSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGD 606
+ L+ +V T K PV+LVL G PL +++A D + +IL YPG G A+A+ +FGD
Sbjct: 590 LDLLQAVVATGK-PVVLVLINGRPLSINWA--DKYVPAILEAWYPGSQGGTAIADALFGD 646
Query: 607 FNPGGRLPMTWYPESFTKVPMN---DMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYT 663
+NPGG+L +T +P++ ++P N N + D R +Y FG+GLSYT
Sbjct: 647 YNPGGKLTVT-FPKTVGQIPFNFPTKPNAQVDGGRNKGLDGNMSRVNGPLYPFGYGLSYT 705
Query: 664 NYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNA 723
+ Y S+++I ++ + +V T V+ VTN
Sbjct: 706 TFEY------SDISIQPAI--------------------VTQVQPVT-----VRCKVTNT 734
Query: 724 GDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHG 783
G G VV L+ R S T EK L+GFDR+H +KE++F ++P L + N
Sbjct: 735 GKRAGDEVVQLYVRDILSSVTTYEKNLVGFDRIHLNPGETKELTFTIEP-RDLQLLNSDN 793
Query: 784 RRILPLGNHVLMVG 797
++ G+ +MVG
Sbjct: 794 HWVVEPGDFKVMVG 807
>gi|423313129|ref|ZP_17291065.1| hypothetical protein HMPREF1058_01677 [Bacteroides vulgatus
CL09T03C04]
gi|392686343|gb|EIY79649.1| hypothetical protein HMPREF1058_01677 [Bacteroides vulgatus
CL09T03C04]
Length = 864
Score = 251 bits (641), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 162/462 (35%), Positives = 232/462 (50%), Gaps = 64/462 (13%)
Query: 54 FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
+ ++SLS RA+ L+ LTL+EK+ + DN+ + RLGI Y WW+E+LHG+A +G
Sbjct: 24 YKDSSLSPEERAEDLLQQLTLEEKVALMMDNSKPVERLGIKPYNWWNEALHGVARSG--- 80
Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLTFW 165
T FPQ + AASF I +AV+ EARA A GLT W
Sbjct: 81 -------LATVFPQPIGMAASFEPDAIHTIYTAVSDEARAKNAAYSAAGSYERYQGLTMW 133
Query: 166 APNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQ--GENWKSDDGGIGFGFREKRV 223
P +NI+RDPRWGRG ET GEDP + S V VK Q N K D
Sbjct: 134 TPTVNIYRDPRWGRGIETYGEDPYLTSVMGVNVVKGLQCMDANQKYDK------------ 181
Query: 224 LKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPFRSC 282
+ AC KH + +W +R+ FNA I +D +T+ PF +
Sbjct: 182 ---------------IHACAKHFAVHSGPEW---NRHEFNAENIKPRDLHETYLVPFEAL 223
Query: 283 IEQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIFEYQNYTK 341
+++ K +MC+YN++ G P C L Q R +WG+ G + SDC A+ + + + K
Sbjct: 224 VKEAKVKEVMCAYNRLEGDPCCGSDRLLMQILRQDWGYDGIVLSDCGAIDDFYREKGH-K 282
Query: 342 THED----SAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGL 397
TH D SAA VL +G D+ CG+ +SA KG + EKDID ++ L + LG
Sbjct: 283 THPDAESASAAAVL-SGTDLECGSSYKALVESA-KKGLISEKDIDVSVKRLLKARFELGE 340
Query: 398 FNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGP 457
+ DP K ++ K+ VC++EH L+LD AR+ + LL N LPL + ++A++GP
Sbjct: 341 MD-DPDKVEWTKIPYSVVCSAEHDSLSLDIARKSMTLLLNKNNILPLKRGG-QTIAVMGP 398
Query: 458 LVNNISQMGGGYTGIPCSPKSLLRGLEAYV---SKTHYASGC 496
N+ G Y G P +LL G+ + + K Y GC
Sbjct: 399 NANDSVMQWGNYNGTPKHTITLLEGIRSAMGENDKLIYEQGC 440
Score = 123 bits (309), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 98/325 (30%), Positives = 148/325 (45%), Gaps = 66/325 (20%)
Query: 491 HYASGCHDVPCNSDAGFHEAVRI------AKKADFVIVVAGLDLTQETEDR--------- 535
Y SG D N D GF E V I K AD VI G+ + E E+
Sbjct: 572 QYFSG--DAQLNFDLGFKEEVNIKNTVAKVKDADVVIFAGGISPSLEGEEMGVNLPGFRK 629
Query: 536 -DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEA 594
DR + LP Q L+ ++ K+ VI V G P ++ +IL YPG++
Sbjct: 630 GDRTDIELPAVQRELIKALCDAGKK-VIFVNFSGSP--IAMEPETKYCQAILQAWYPGQS 686
Query: 595 GAKALAEIIFGDFNPGGRLPMTWYPESFTKVP-MNDMNMRADSSRQYPGRSYRFYTGTQV 653
G KA+AE++FGD+NP GRLP+T+Y + T++P D NM GR+YR++ G +
Sbjct: 687 GGKAVAEVLFGDYNPAGRLPVTFY-RNITQLPNFEDYNMT--------GRTYRYFKGDPL 737
Query: 654 YGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLR 713
+ FG+GLSYT ++Y + + ++K G I+
Sbjct: 738 FPFGYGLSYTTFNY------GNIKLEQTIKVGETAKII---------------------- 769
Query: 714 FHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPC 773
+ VTN G+ DG VV ++ + + ++G P K L F RV A + + + P
Sbjct: 770 ----VPVTNTGNRDGEEVVQVYLKKQEDAEG-PVKTLRAFKRVQIPAGKTVNVELELTP- 823
Query: 774 EQLSIANKHGRRILPL-GNHVLMVG 797
+QL + + L GN +MVG
Sbjct: 824 KQLEWWDTQTNTMRTLAGNFDIMVG 848
>gi|383113364|ref|ZP_09934136.1| hypothetical protein BSGG_3068 [Bacteroides sp. D2]
gi|382948729|gb|EFS32368.2| hypothetical protein BSGG_3068 [Bacteroides sp. D2]
Length = 850
Score = 251 bits (641), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 153/447 (34%), Positives = 232/447 (51%), Gaps = 57/447 (12%)
Query: 54 FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
+ N + + R L+S LT++EKI L + IPRLGI Y +E+LHG+ G
Sbjct: 27 YKNENAPVHERVADLLSRLTVEEKISLLRATSPGIPRLGIDKYYHGNEALHGVVRPG--- 83
Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAG----------LT 163
T FPQ + AA++N L + + ++ EARA +N G LT
Sbjct: 84 -------RFTVFPQAIGLAATWNPELQKRVATVISDEARARWNELDQGREQKEQFSDVLT 136
Query: 164 FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRV 223
FW+P +N+ RDPRWGR ET GEDP + FVK QG+ + R
Sbjct: 137 FWSPTVNMARDPRWGRTPETYGEDPFLSGVMGTAFVKGLQGD--------------DPRY 182
Query: 224 LKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCI 283
LK + + KH A + E +R+ N I+E+ + + P F C+
Sbjct: 183 LK-------------IVSTPKHFAANNEEH----NRFVCNPQISEKQLREYYFPAFEMCV 225
Query: 284 EQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKT 342
++GKA+ IM +YN +N VP L L +K R +WGF+GY+ SDC + + Y KT
Sbjct: 226 KEGKAASIMTAYNALNDVPCTLNAWLLKKVLRQDWGFQGYVVSDCGGPSLLVNAHKYVKT 285
Query: 343 HEDSAAGVLKAGMDINCGTCML-RHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGD 401
E +A ++AG+D+ CG + + +A + V + DID A ++ + +++LGLF+G
Sbjct: 286 KEAAATLSIQAGLDLECGDDVYDEYLLNAYKQYMVSDADIDSAACHVLTARMKLGLFDGT 345
Query: 402 PRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNN 461
R Y ++ P + + EH+++ALDAAR+ IVLLKN LPLN N V S+A++G +N
Sbjct: 346 ER-NPYTRISPSVIGSKEHQQIALDAARECIVLLKNKNNMLPLNVNKVKSIAVVG--INA 402
Query: 462 ISQMGGGYTGIP-CSPKSLLRGLEAYV 487
G Y+G P P S+L+G++ V
Sbjct: 403 GKCEFGDYSGAPVVDPVSILQGIKDRV 429
Score = 121 bits (303), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 84/303 (27%), Positives = 148/303 (48%), Gaps = 49/303 (16%)
Query: 507 FHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLT 566
+ EA + ++ + V+ V G++ + E E +DR + LP Q + + + + +I+VL
Sbjct: 591 YGEAGKAVRECETVVAVMGINKSIEREGQDRYDIQLPADQREFLQEIYKVNPN-IIVVLV 649
Query: 567 GGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVP 626
G L V++ D I +I+ YPGE G A+A+++FGD+NP GRLP+T+Y +S ++P
Sbjct: 650 AGSSLAVNWM--DEHIPAIVNAWYPGEQGGTAVADVLFGDYNPAGRLPLTYY-KSLDELP 706
Query: 627 -MNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAG 685
+D ++ GR+Y+++ G +Y FG+GLSY+++ Y L +K G
Sbjct: 707 AFDDYDITQ-------GRTYKYFKGDVLYPFGYGLSYSSFKYSDL----------KVKDG 749
Query: 686 SDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGT 745
++ T S R N G G V ++ R+P+
Sbjct: 750 AN-------------------TVSVSFRLK------NTGKRKGDEVAQVYVRIPETGGVV 784
Query: 746 PEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKH-GRRILPLGNHVLMVGELRHSLT 804
P K+L GF R+ + S+ + +D EQL + G+ I+P G +M+G +
Sbjct: 785 PIKELKGFRRIPLKSGESRVVEIELDK-EQLRYWDAGLGQFIVPQGAFDIMIGASSKDIR 843
Query: 805 IET 807
++T
Sbjct: 844 LQT 846
>gi|150003731|ref|YP_001298475.1| glycoside hydrolase family protein [Bacteroides vulgatus ATCC 8482]
gi|319640047|ref|ZP_07994774.1| glycoside hydrolase family 3 [Bacteroides sp. 3_1_40A]
gi|345517061|ref|ZP_08796539.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp.
4_3_47FAA]
gi|149932155|gb|ABR38853.1| glycoside hydrolase family 3, candidate beta-glycosidase
[Bacteroides vulgatus ATCC 8482]
gi|254833833|gb|EET14142.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp.
4_3_47FAA]
gi|317388325|gb|EFV69177.1| glycoside hydrolase family 3 [Bacteroides sp. 3_1_40A]
Length = 864
Score = 251 bits (641), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 162/462 (35%), Positives = 232/462 (50%), Gaps = 64/462 (13%)
Query: 54 FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
+ ++SLS RA+ L+ LTL+EK+ + DN+ + RLGI Y WW+E+LHG+A +G
Sbjct: 24 YKDSSLSPEERAEDLLQQLTLEEKVALMMDNSKPVERLGIKPYNWWNEALHGVARSG--- 80
Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLTFW 165
T FPQ + AASF I +AV+ EARA A GLT W
Sbjct: 81 -------LATVFPQPIGMAASFEPDAIHTIYTAVSDEARAKNAAYSAAGSYERYQGLTMW 133
Query: 166 APNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQ--GENWKSDDGGIGFGFREKRV 223
P +NI+RDPRWGRG ET GEDP + S V VK Q N K D
Sbjct: 134 TPTVNIYRDPRWGRGIETYGEDPYLTSVMGVNVVKGLQCMDANQKYDK------------ 181
Query: 224 LKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPFRSC 282
+ AC KH + +W +R+ FNA I +D +T+ PF +
Sbjct: 182 ---------------IHACAKHFAVHSGPEW---NRHEFNAENIKPRDLHETYLVPFEAL 223
Query: 283 IEQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIFEYQNYTK 341
+++ K +MC+YN++ G P C L Q R +WG+ G + SDC A+ + + + K
Sbjct: 224 VKEAKVKEVMCAYNRLEGDPCCGSDRLLMQILRQDWGYDGIVLSDCGAIDDFYREKGH-K 282
Query: 342 THED----SAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGL 397
TH D SAA VL +G D+ CG+ +SA KG + EKDID ++ L + LG
Sbjct: 283 THPDAESASAAAVL-SGTDLECGSSYKALVESA-KKGLISEKDIDVSVKRLLKARFELGE 340
Query: 398 FNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGP 457
+ DP K ++ K+ VC++EH L+LD AR+ + LL N LPL + ++A++GP
Sbjct: 341 MD-DPDKVEWTKIPYSVVCSAEHDSLSLDIARKSMTLLLNKNNILPLKRGG-QTIAVMGP 398
Query: 458 LVNNISQMGGGYTGIPCSPKSLLRGLEAYV---SKTHYASGC 496
N+ G Y G P +LL G+ + + K Y GC
Sbjct: 399 NANDSVMQWGNYNGTPKHTITLLEGIRSAMGENDKLIYEQGC 440
Score = 121 bits (304), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 97/325 (29%), Positives = 147/325 (45%), Gaps = 66/325 (20%)
Query: 491 HYASGCHDVPCNSDAGFHEAVRI------AKKADFVIVVAGLDLTQETEDR--------- 535
Y SG D N D GF E V I K AD VI G+ + E E+
Sbjct: 572 QYFSG--DAQLNFDLGFKEEVNIKNTVAKVKDADVVIFAGGISPSLEGEEMGVNLPGFRK 629
Query: 536 -DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEA 594
DR + LP Q L+ ++ K+ VI V G P ++ +IL YPG++
Sbjct: 630 GDRTDIELPAVQRELIKALCDAGKK-VIFVNFSGSP--IAMEPETKYCQAILQAWYPGQS 686
Query: 595 GAKALAEIIFGDFNPGGRLPMTWYPESFTKVP-MNDMNMRADSSRQYPGRSYRFYTGTQV 653
G KA AE++FGD+NP GRLP+T+Y + T++P D NM GR+YR++ G +
Sbjct: 687 GGKAAAEVLFGDYNPAGRLPVTFY-RNITQLPDFEDYNMT--------GRTYRYFKGDPL 737
Query: 654 YGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLR 713
+ FG+GLSYT ++Y + + ++K G I+
Sbjct: 738 FPFGYGLSYTTFNY------GNIKLEQTIKVGETAKII---------------------- 769
Query: 714 FHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPC 773
+ VTN G+ DG VV ++ + + ++G P K L F RV A + + + P
Sbjct: 770 ----VPVTNTGNRDGEEVVQVYLKKQEDAEG-PVKTLRAFKRVQIPAGKTVNVELELTP- 823
Query: 774 EQLSIANKHGRRILPL-GNHVLMVG 797
+QL + + + GN +MVG
Sbjct: 824 KQLEWWDAQTNTMRTIAGNFDIMVG 848
>gi|261880245|ref|ZP_06006672.1| beta-glucosidase [Prevotella bergensis DSM 17361]
gi|270333079|gb|EFA43865.1| beta-glucosidase [Prevotella bergensis DSM 17361]
Length = 854
Score = 251 bits (641), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 151/463 (32%), Positives = 238/463 (51%), Gaps = 55/463 (11%)
Query: 46 PPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHG 105
P +P+ NT LS RA L S LTL+EK + + + + AIPRLGIP +EWWSE+LHG
Sbjct: 17 PMKAQQFPYQNTDLSPKERAADLCSRLTLEEKSKIMQNGSPAIPRLGIPQFEWWSEALHG 76
Query: 106 IASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQ------ 159
I NG T FP + A+S++ +L + AV+ E R +
Sbjct: 77 IGRNG----------FATVFPITMGMASSWDDALLQKVFDAVSDEGRVKAQQAKRSGTIK 126
Query: 160 --AGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFG 217
GL+FW PNINIFRDPRWGRGQET GEDP + S + V+ QG SD
Sbjct: 127 RYQGLSFWTPNINIFRDPRWGRGQETYGEDPYLTSRMGLAVVRGLQG---PSDSK----- 178
Query: 218 FREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQ 276
+R+ L AC KH + +W +R++FN + E+D +T+
Sbjct: 179 YRK------------------LLACAKHFAVHSGPEW---NRHTFNVEDLPERDLWETYL 217
Query: 277 PPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFE 335
P F++ ++QG + +MC+Y +++G P C + RNEW ++G + SDC AV ++
Sbjct: 218 PAFKALVQQGDVAEVMCAYQRIDGQPCCGNNRFLKSILRNEWNYQGMVVSDCWAVPDFWK 277
Query: 336 --YQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQL 393
+ + ++A + +G D+ CG+ + A+ G ++E D+D ++ L +
Sbjct: 278 KGHHEVSPDATHASAKAVLSGTDVECGSD-YSNLPEAVRAGIIKEADVDVSVRRLLEARF 336
Query: 394 RLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLA 453
LG F+ D + K+ V + HK+LALD AR+ +VLL+N+ LPL ++ +
Sbjct: 337 ALGDFDPD-ELVPWTKISESVVASKAHKQLALDMARKSMVLLQNN-DILPLKRSG-QKIV 393
Query: 454 IIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGC 496
++G + + M G Y+G P ++L+GL+ + + GC
Sbjct: 394 VVGANAIDSTMMWGNYSGYPTQTVTILQGLQTKSDQVTFIPGC 436
Score = 112 bits (279), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 91/334 (27%), Positives = 148/334 (44%), Gaps = 65/334 (19%)
Query: 486 YVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDR---------- 535
YV T A D+ + + VR AD VI V G+ E E+
Sbjct: 568 YVQVTSLAMIKFDITHTGLSTPQDIVRKTAGADVVIFVGGISPRLEGEEMEVSDPGFKGG 627
Query: 536 DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAG 595
DR ++ LP Q ++ +++ +R V + +G ++ ++ +IL YPGE G
Sbjct: 628 DRTTIELPQAQREVIKALSEAGRRIVFVNCSGSA---IALTPESQRVDAILQAWYPGEQG 684
Query: 596 AKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYG 655
A+A+++FGD+NP G+LP+T+Y ND + + GR+YR++ T ++
Sbjct: 685 GTAVADVLFGDYNPSGKLPVTFY--------KNDAQLPDFLDYRMAGRTYRYFKETPLFP 736
Query: 656 FGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFH 715
FG+GLSYT ++ G + I Q
Sbjct: 737 FGYGLSYTQFTI-----------------GQPRYINNQ---------------------- 757
Query: 716 VQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQ 775
VQ+SV+N G DG VV ++ R + G P K L GF RV +K++S + P E
Sbjct: 758 VQVSVSNTGKRDGDEVVQVYIRRTDDAAG-PIKTLRGFQRVSLKVGETKQVSVSL-PRES 815
Query: 776 LSI--ANKHGRRILPLGNHVLMVGELRHSLTIET 807
A+ + R++P GN+ +MVG + ++T
Sbjct: 816 FEWWDASSNTMRVIP-GNYEVMVGSSSMAKNLKT 848
>gi|255693561|ref|ZP_05417236.1| periplasmic beta-glucosidase(Cellobiase) [Bacteroides finegoldii
DSM 17565]
gi|260620626|gb|EEX43497.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
finegoldii DSM 17565]
Length = 800
Score = 251 bits (641), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 209/735 (28%), Positives = 341/735 (46%), Gaps = 127/735 (17%)
Query: 90 RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
RLGIP ++ +E + G+ + T FP A++N+ L I A
Sbjct: 151 RLGIPV-DFTNEGIRGLCHD-----------RATMFPAQCGQGATWNKKLIREIAKVTAD 198
Query: 150 EARAMYNLGQAGLT-FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWK 208
EA+A+ G T +AP ++I +DPRWGR E+ GEDP +V + + Q E
Sbjct: 199 EAKAL------GYTNIYAPILDIAQDPRWGRVVESYGEDPYLVGELGKQMILGLQSEG-- 250
Query: 209 SDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITE 268
+ A KH Y + G + +
Sbjct: 251 ------------------------------IVATPKHFAVYSIPVGGRDGGTRTDPHVAP 280
Query: 269 QDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDC 327
++ + + PFR I++ A +M SYN +G P + R +WGFKGY+ SD
Sbjct: 281 REMKTLYLEPFRKGIQEAGALGVMSSYNDYDGEPVSGSYHFLTEILRQQWGFKGYVVSDS 340
Query: 328 DAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTC----MLRHTQSAIDKGKVQEKDIDR 383
+AV + T T E+ AA V+ AG++I + + AI +GKV +D+
Sbjct: 341 EAVEFLHTKHRITPTEEEMAAQVVNAGLNIRTNFTPPQDFILPLRRAISEGKVSLHTLDQ 400
Query: 384 ALLNLFSVQLRLGLFN----GDPRKGKYGKLGPDDVC-TSEHKKLALDAARQGIVLLKND 438
+ + V+ +GLF+ GD R+ P+ V + H+ +++ AA + IVLLKN+
Sbjct: 401 RVGEILRVKFMMGLFDNPYPGDDRR-------PEVVVHNAAHQDVSMRAALESIVLLKNE 453
Query: 439 KKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV--SKTHYASGC 496
K+ LPL+K + S +A+IGP + ++ Y S K++ +G++ Y+ ++ YA GC
Sbjct: 454 KEMLPLSK-SFSKIAVIGPNAEEVKELTCRYGPANASIKTVYQGIKEYLPNAEVRYAKGC 512
Query: 497 ------------HDVPCNS--DAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLL 542
++VP ++ A +EAV +AK +D I+V G + E+ R +L L
Sbjct: 513 DIIDKYFPESELYNVPLDTQEQAMINEAVELAKASDVAILVLGGNEKTVREEFSRTNLDL 572
Query: 543 PGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEI 602
G+Q L+ +V T K PV+LV+ G +++ A+ + +I+ +PGE A+A++
Sbjct: 573 CGRQQQLLEAVYATGK-PVVLVMVDGRAATINW--ANKYVPAIIHAWFPGEFMGDAIAKV 629
Query: 603 IFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSY 662
+FGD+NPGGRL +T +P+S ++P S+ + +Y FG+GLSY
Sbjct: 630 LFGDYNPGGRLAVT-FPKSVGQIPFAFPFKPGSDSKG------KVRVAGVLYPFGYGLSY 682
Query: 663 TNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTN 722
T + Y S+L IS + G+ +NI SCT V N
Sbjct: 683 TTFGY------SDLKISKPV-IGAQENI---------------TLSCT---------VKN 711
Query: 723 AGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKH 782
G G VV L+ R S T +K L GF+R+H + ISF + P + L + +K+
Sbjct: 712 TGKKAGDEVVQLYIRDDFSSVTTYDKVLRGFERIHLQPGEEQTISFTLTP-QDLGLWDKN 770
Query: 783 GRRILPLGNHVLMVG 797
+ + G+ +MVG
Sbjct: 771 NQFTVEPGSFSVMVG 785
>gi|294777452|ref|ZP_06742903.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
vulgatus PC510]
gi|294448520|gb|EFG17069.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
vulgatus PC510]
Length = 864
Score = 251 bits (641), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 162/462 (35%), Positives = 232/462 (50%), Gaps = 64/462 (13%)
Query: 54 FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
+ ++SLS RA+ L+ LTL+EK+ + DN+ + RLGI Y WW+E+LHG+A +G
Sbjct: 24 YKDSSLSPEERAEDLLQQLTLEEKVALMMDNSKPVERLGIKPYNWWNEALHGVARSG--- 80
Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLTFW 165
T FPQ + AASF I +AV+ EARA A GLT W
Sbjct: 81 -------LATVFPQPIGMAASFEPDAIHTIYTAVSDEARAKNAAYSAAGSYERYQGLTMW 133
Query: 166 APNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQ--GENWKSDDGGIGFGFREKRV 223
P +NI+RDPRWGRG ET GEDP + S V VK Q N K D
Sbjct: 134 TPTVNIYRDPRWGRGIETYGEDPYLTSVMGVNVVKGLQCMDANQKYDK------------ 181
Query: 224 LKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPFRSC 282
+ AC KH + +W +R+ FNA I +D +T+ PF +
Sbjct: 182 ---------------IHACAKHFAVHSGPEW---NRHEFNAENIKPRDLHETYLVPFEAL 223
Query: 283 IEQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIFEYQNYTK 341
+++ K +MC+YN++ G P C L Q R +WG+ G + SDC A+ + + + K
Sbjct: 224 VKEAKVKEVMCAYNRLEGDPCCGSDRLLMQILRQDWGYDGIVLSDCGAIDDFYREKGH-K 282
Query: 342 THED----SAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGL 397
TH D SAA VL +G D+ CG+ +SA KG + EKDID ++ L + LG
Sbjct: 283 THPDAESASAAAVL-SGTDLECGSSYKALVESA-KKGLISEKDIDVSVKRLLKARFELGE 340
Query: 398 FNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGP 457
+ DP K ++ K+ VC++EH L+LD AR+ + LL N LPL + ++A++GP
Sbjct: 341 MD-DPDKVEWTKIPYSVVCSAEHDSLSLDIARKSMTLLLNKNNILPLKRGG-QTIAVMGP 398
Query: 458 LVNNISQMGGGYTGIPCSPKSLLRGLEAYV---SKTHYASGC 496
N+ G Y G P +LL G+ + + K Y GC
Sbjct: 399 NANDSVMQWGNYNGTPKHTITLLEGIRSAMGENDKLIYEQGC 440
Score = 121 bits (303), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 95/324 (29%), Positives = 143/324 (44%), Gaps = 64/324 (19%)
Query: 491 HYASGCHDVPCNSDAGFHEAVRI------AKKADFVIVVAGLDLTQETEDR--------- 535
Y SG D N D GF E V I K AD VI G+ + E E+
Sbjct: 572 QYFSG--DAQLNFDLGFKEEVNIKNTVAKVKDADIVIFAGGISPSLEGEEMGVNLPGFRK 629
Query: 536 -DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEA 594
DR + LP Q L+ ++ K+ VI V G P ++ +IL YPG++
Sbjct: 630 GDRTDIELPAVQRELIKALCDAGKK-VIFVNFSGSP--IAMEPETKYCQAILQAWYPGQS 686
Query: 595 GAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVY 654
G KA AE++FGD+NP GRLP+T+Y + D NM GR+YR++ G ++
Sbjct: 687 GGKAAAEVLFGDYNPAGRLPVTFYRNTAQLPDFEDYNMT--------GRTYRYFKGDPLF 738
Query: 655 GFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRF 714
FG+GLSYT ++Y + + ++K G I+
Sbjct: 739 PFGYGLSYTTFNY------GNIKLEQTIKVGETAKII----------------------- 769
Query: 715 HVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCE 774
+ VTN G+ DG VV ++ + + ++G P K L F RV A + + + P +
Sbjct: 770 ---VPVTNTGNRDGEEVVQVYLKKQEDAEG-PVKTLRAFKRVQIPAGKTVNVELELTP-K 824
Query: 775 QLSIANKHGRRILPL-GNHVLMVG 797
QL + + + GN +MVG
Sbjct: 825 QLEWWDAQTNTMRTIAGNFDIMVG 848
>gi|299149391|ref|ZP_07042448.1| beta-glucosidase [Bacteroides sp. 3_1_23]
gi|298512578|gb|EFI36470.1| beta-glucosidase [Bacteroides sp. 3_1_23]
Length = 853
Score = 251 bits (640), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 152/444 (34%), Positives = 231/444 (52%), Gaps = 57/444 (12%)
Query: 54 FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
+ N + + R L+S LT++EKI L + IPRLGI Y +E+LHG+ G
Sbjct: 30 YKNENAPVHERVADLLSRLTVEEKISLLRATSPGIPRLGIDKYYHGNEALHGVVRPG--- 86
Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAG----------LT 163
T FPQ + AA++N L + + ++ EARA +N G LT
Sbjct: 87 -------RFTVFPQAIGLAATWNPELQKRVATVISDEARARWNELDQGREQKEQFSDVLT 139
Query: 164 FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRV 223
FW+P +N+ RDPRWGR ET GEDP + FVK QG+ + R
Sbjct: 140 FWSPTVNMARDPRWGRTPETYGEDPFLSGVMGTAFVKGLQGD--------------DPRY 185
Query: 224 LKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCI 283
LK + + KH A + E +R+ N I+E+ + + P F C+
Sbjct: 186 LK-------------IVSTPKHFAANNEEH----NRFVCNPQISEKQLREYYFPAFEMCV 228
Query: 284 EQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKT 342
++GKA+ IM +YN +N VP L L +K R +WGF+GY+ SDC + + Y KT
Sbjct: 229 KEGKAASIMTAYNALNDVPCTLNAWLLKKVLRQDWGFQGYVVSDCGGPSLLVNAHKYVKT 288
Query: 343 HEDSAAGVLKAGMDINCGTCML-RHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGD 401
E +A ++AG+D+ CG + + +A + V + DID A ++ + +++LGLF+G
Sbjct: 289 KEAAATLSIQAGLDLECGDDVYDEYLLNAYKQYMVSDADIDSAACHVLTARMKLGLFDGT 348
Query: 402 PRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNN 461
R Y ++ P + + EH+++ALDAAR+ IVLLKN LPLN N V S+A++G +N
Sbjct: 349 ER-NPYTRISPSVIGSKEHQQIALDAARECIVLLKNKNNMLPLNVNKVKSIAVVG--INA 405
Query: 462 ISQMGGGYTGIP-CSPKSLLRGLE 484
G Y+G P P S+L+G++
Sbjct: 406 GKCEFGDYSGAPVVDPVSILQGIK 429
Score = 121 bits (304), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 83/303 (27%), Positives = 146/303 (48%), Gaps = 49/303 (16%)
Query: 507 FHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLT 566
+ EA + ++ + V+ V G++ + E E +DR + LP Q + + + + +I+VL
Sbjct: 594 YGEAGKAVRECETVVAVMGINKSIEREGQDRYDIQLPADQREFLQEIYKVNPN-IIVVLV 652
Query: 567 GGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVP 626
G L V++ D I +I+ YPGE G A+A+++FGD+NP GRLP+T+Y +S ++P
Sbjct: 653 AGSSLAVNWM--DEHIPAIVNAWYPGEQGGTAVADVLFGDYNPAGRLPLTYY-KSLDELP 709
Query: 627 -MNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAG 685
+D ++ GR+Y+++ G +Y FG+GLSY+++ Y + LK
Sbjct: 710 AFDDYDITK-------GRTYKYFKGDVLYPFGYGLSYSSFKY------------SDLKVK 750
Query: 686 SDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGT 745
N + V + N G G V ++ R+P+
Sbjct: 751 DGANTIS-----------------------VSFRLKNTGKRKGDEVAQVYVRIPETGGVV 787
Query: 746 PEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKH-GRRILPLGNHVLMVGELRHSLT 804
P K+L GF R+ + S+ + +D EQL + G+ I+P G +MVG +
Sbjct: 788 PIKELKGFRRIPLKSGESRVVDIELDK-EQLRYWDAGLGQFIVPQGAFDIMVGASSKDIR 846
Query: 805 IET 807
++T
Sbjct: 847 LQT 849
>gi|262405837|ref|ZP_06082387.1| conserved hypothetical protein [Bacteroides sp. 2_1_22]
gi|294647798|ref|ZP_06725350.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
ovatus SD CC 2a]
gi|294806192|ref|ZP_06765039.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
xylanisolvens SD CC 1b]
gi|345510348|ref|ZP_08789916.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp. D1]
gi|262356712|gb|EEZ05802.1| conserved hypothetical protein [Bacteroides sp. 2_1_22]
gi|292636706|gb|EFF55172.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
ovatus SD CC 2a]
gi|294446448|gb|EFG15068.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
xylanisolvens SD CC 1b]
gi|345454537|gb|EEO48843.2| glycoside hydrolase family beta-glycosidase [Bacteroides sp. D1]
Length = 800
Score = 251 bits (640), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 209/738 (28%), Positives = 338/738 (45%), Gaps = 133/738 (18%)
Query: 90 RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
RLGIP + +E + G+ + T FP A++N+ L I A
Sbjct: 151 RLGIPV-DLTNEGIRGLCHD-----------RATMFPAQCGQGATWNKKLIREIAKVTAN 198
Query: 150 EARAMYNLGQAGLT-FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWK 208
EA+A+ G T ++P ++I +DPRWGR E+ GEDP + + + Q E
Sbjct: 199 EAKAL------GYTNIYSPILDIAQDPRWGRVVESYGEDPYLAGELGKQMILGLQSEG-- 250
Query: 209 SDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITE 268
+ A KH Y + G + +
Sbjct: 251 ------------------------------IVATPKHFAVYSIPVGGRDGGTRTDPHVAP 280
Query: 269 QDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDC 327
++ + + PFR I++ A +M SYN +G P + R +WGFKGY+ SD
Sbjct: 281 REMKTLYLEPFRKGIQEAGALGVMSSYNDYDGEPVSGSYHFLTEILRQQWGFKGYVVSDS 340
Query: 328 DAVATIFEYQNYTKTHEDSAAGVLKAGMDINCG-------TCMLRHTQSAIDKGKVQEKD 380
+AV + T T E+ AA V+ AG++I LRH AI++GKV
Sbjct: 341 EAVEFLHTKHRITPTEEEMAAQVVNAGLNIRTNFTPPQDFILPLRH---AINEGKVSLHT 397
Query: 381 IDRALLNLFSVQLRLGLFN----GDPRKGKYGKLGPDDVCTSE-HKKLALDAARQGIVLL 435
+D+ + + V+ +GLF+ GD R+ P+ V ++ HK +++ AA + +VLL
Sbjct: 398 LDQRVSEILRVKFMMGLFDNPYPGDDRR-------PETVVHNDAHKAVSMKAALESVVLL 450
Query: 436 KNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV--SKTHYA 493
KN + LPL+KN +A+IGP + ++ Y S K++ +G++ Y+ S+ YA
Sbjct: 451 KNKNQMLPLSKN-FKKIAVIGPNAEEVKELTCRYGPANASIKTVYQGIKEYLPNSEVRYA 509
Query: 494 SGC------------HDVPCNS--DAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVS 539
GC ++VP ++ A HEAV +AK +D I+V G + E+ R +
Sbjct: 510 KGCDIIDKYFPESELYNVPLDTQEQAMIHEAVELAKASDIAILVLGGNEKTVREEFSRTN 569
Query: 540 LLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKAL 599
L L G+Q L+ +V T K PV+LV+ G +++ A+ + +I+ +PGE A+
Sbjct: 570 LDLCGRQQQLLEAVYATGK-PVVLVMVDGRAATINW--ANKYVPAIIHAWFPGEFMGDAI 626
Query: 600 AEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHG 659
A+++FGD+NPGGRL +T +P+S ++P S+ + +Y FG+G
Sbjct: 627 AKVLFGDYNPGGRLAVT-FPKSVGQIPFAFPFKPGSDSKG------KVRVDGVLYPFGYG 679
Query: 660 LSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQIS 719
LSYT + Y S+L IS + G +NI SCT
Sbjct: 680 LSYTTFGY------SDLKISKPV-IGPQENI---------------TLSCT--------- 708
Query: 720 VTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIA 779
V N G G VV L+ R S T +K L GF+R+H + ++F + P + L +
Sbjct: 709 VKNTGKKAGDEVVQLYIRDDFSSVTTYDKVLRGFERIHLQPGEEQTVNFTLTP-QDLGLW 767
Query: 780 NKHGRRILPLGNHVLMVG 797
+K+ + + G+ +MVG
Sbjct: 768 DKNNQFTVEPGSFSVMVG 785
>gi|398386387|ref|ZP_10544389.1| beta-glucosidase-like glycosyl hydrolase [Sphingobium sp. AP49]
gi|397718418|gb|EJK79007.1| beta-glucosidase-like glycosyl hydrolase [Sphingobium sp. AP49]
Length = 791
Score = 251 bits (640), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 226/736 (30%), Positives = 332/736 (45%), Gaps = 124/736 (16%)
Query: 90 RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
RLGIP + E LHG A+ G TSFPQ + A+S++ ++ + +
Sbjct: 138 RLGIPIL-FHEEGLHGYAAVG-----------ATSFPQSIAMASSWDPTMLRQVNQVIGR 185
Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
E RA + +P ++I RDPRWGR +ET GEDP +V V V+ QGE
Sbjct: 186 EIRA-----RGVPMVLSPVVDIARDPRWGRIEETYGEDPYLVGEMGVAAVEGLQGEG--- 237
Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQ 269
R + + G + A KHL + + G A ++E+
Sbjct: 238 ---------RSRLLRPGH-----------VFATLKHLTGHGQPESGT---NVGPAPVSER 274
Query: 270 DTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPA-CLRGDLFQKARNEWGFKGYITSDCD 328
+ + F PPF +++ +M SYN+++GVP+ R L R EWGF+G + SD
Sbjct: 275 ELRENFFPPFEQVVKRTGIEAVMASYNEIDGVPSHANRWLLDNVLRQEWGFRGAVVSDYS 334
Query: 329 AVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSA-IDKGKVQEKDIDRALLN 387
AV + + E++A L AG+D + + T + +GKV E +D A+
Sbjct: 335 AVDQLMSIHHIAANLEEAAMRALDAGVDADLPEGLSYATLGKLVREGKVSEAKVDLAVRR 394
Query: 388 LFSVQLRLGLFNGDPRKGKYGKLGPDDVCTS--EHKKLALDAARQGIVLLKNDKKFLPLN 445
+ ++ R GLF + Y T+ E + LA AA++ I LLKND LPL
Sbjct: 395 MLELKFRAGLF-----ENPYADANAAAAITNNDEARALARTAAQRSITLLKNDG-MLPLK 448
Query: 446 KNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV---SKTHYASGC------ 496
++A+IGP ++++GG Y G P S+L G++A V + +A G
Sbjct: 449 PEG--TIAVIGPSAA-VARLGG-YYGQPPHSVSILEGIKARVGTKANIVFAQGVKITEND 504
Query: 497 ---HDVPCNSDAG-----FHEAVRIAKKADFVIVVAGLDLTQETEDR-------DRVSLL 541
D SD +AV A+ D +I+ G D Q + + DR SL
Sbjct: 505 DWWEDKVVKSDPAENRKLIAQAVEAARNVDRIILTLG-DTEQSSREGWADNHLGDRPSLD 563
Query: 542 LPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAE 601
L G+Q L ++ K P+ +VL G P S + Q ++IL Y GE G A+A+
Sbjct: 564 LVGEQQELFDALKALGK-PITVVLINGRP--ASTVKVSEQANAILEGWYLGEQGGNAVAD 620
Query: 602 IIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLS 661
I+FGD NPGG+LP+T P S ++PM NM+ + R Y F T +Y FG GLS
Sbjct: 621 ILFGDVNPGGKLPVT-VPRSVGQLPMF-YNMKPSAR-----RGYLFDTTDPLYPFGFGLS 673
Query: 662 YTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVT 721
YTN+S LSAP RL I T + V + V
Sbjct: 674 YTNFS---LSAP-----------------------RLSATKI-----GTGGKTSVSVDVR 702
Query: 722 NAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANK 781
N G +G VV L+ R S P K+L GF RV S+ ++F V P E L + N
Sbjct: 703 NTGAREGDEVVQLYIRDKVSSVTRPVKELKGFQRVTLKPGESRTVTFTVGP-EALQMWND 761
Query: 782 HGRRILPLGNHVLMVG 797
RR++ G+ +M G
Sbjct: 762 QMRRVVEPGDFEIMTG 777
>gi|336417083|ref|ZP_08597412.1| hypothetical protein HMPREF1017_04520 [Bacteroides ovatus
3_8_47FAA]
gi|335936708|gb|EGM98626.1| hypothetical protein HMPREF1017_04520 [Bacteroides ovatus
3_8_47FAA]
Length = 850
Score = 251 bits (640), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 152/444 (34%), Positives = 231/444 (52%), Gaps = 57/444 (12%)
Query: 54 FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
+ N + + R L+S LT++EKI L + IPRLGI Y +E+LHG+ G
Sbjct: 27 YKNENAPVHERVADLLSRLTVEEKISLLRATSPGIPRLGIDKYYHGNEALHGVVRPG--- 83
Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAG----------LT 163
T FPQ + AA++N L + + ++ EARA +N G LT
Sbjct: 84 -------RFTVFPQAIGLAATWNPELQKRVATVISDEARARWNELDQGREQKEQFSDVLT 136
Query: 164 FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRV 223
FW+P +N+ RDPRWGR ET GEDP + FVK QG+ + R
Sbjct: 137 FWSPTVNMARDPRWGRTPETYGEDPFLSGVMGTAFVKGLQGD--------------DPRY 182
Query: 224 LKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCI 283
LK + + KH A + E +R+ N I+E+ + + P F C+
Sbjct: 183 LK-------------IVSTPKHFAANNEEH----NRFVCNPQISEKQLREYYFPAFEMCV 225
Query: 284 EQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKT 342
++GKA+ IM +YN +N VP L L +K R +WGF+GY+ SDC + + Y KT
Sbjct: 226 KEGKAASIMTAYNALNDVPCTLNAWLLKKVLRQDWGFQGYVVSDCGGPSLLVNAHKYVKT 285
Query: 343 HEDSAAGVLKAGMDINCGTCML-RHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGD 401
E +A ++AG+D+ CG + + +A + V + DID A ++ + +++LGLF+G
Sbjct: 286 KEAAATLSIQAGLDLECGDDVYDEYLLNAYKQYMVSDADIDSAACHVLTARMKLGLFDGT 345
Query: 402 PRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNN 461
R Y ++ P + + EH+++ALDAAR+ IVLLKN LPLN N V S+A++G +N
Sbjct: 346 ER-NPYTRISPSVIGSKEHQQIALDAARECIVLLKNKNNMLPLNVNKVKSIAVVG--INA 402
Query: 462 ISQMGGGYTGIP-CSPKSLLRGLE 484
G Y+G P P S+L+G++
Sbjct: 403 GKCEFGDYSGAPVVDPVSILQGIK 426
Score = 123 bits (309), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 86/303 (28%), Positives = 148/303 (48%), Gaps = 49/303 (16%)
Query: 507 FHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLT 566
+ EA + ++ + V+ V G++ + E E +DR + LP Q + + + + +I+VL
Sbjct: 591 YGEAGKAVRECETVVAVMGINKSIEREGQDRYDIQLPADQREFLQEIYKVNPN-IIVVLV 649
Query: 567 GGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVP 626
G L V++ D I +I+ YPGE G A+A+++FGD+NP GRLP+T+Y +S ++P
Sbjct: 650 AGSSLAVNWM--DEHIPAIVNAWYPGEQGGTAVADVLFGDYNPAGRLPLTYY-KSLDELP 706
Query: 627 -MNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAG 685
+D ++ GR+Y+++ G +Y FG+GLSY+++ Y L +K G
Sbjct: 707 AFDDYDITK-------GRTYKYFKGDVLYPFGYGLSYSSFKYSDL----------KVKDG 749
Query: 686 SDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGT 745
++ T S R N G G V ++ R+P+
Sbjct: 750 AN-------------------TVSVSFRLK------NTGKRKGDEVAQVYVRIPETGGVV 784
Query: 746 PEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKH-GRRILPLGNHVLMVGELRHSLT 804
P K+L GF R+ + S+ + +D EQL + GR I+P G +MVG +
Sbjct: 785 PIKELKGFRRIPLKSGESRVVEIELDK-EQLRYWDAGLGRFIVPQGAFDIMVGASSKDIR 843
Query: 805 IET 807
++T
Sbjct: 844 LQT 846
>gi|300773468|ref|ZP_07083337.1| possible beta-glucosidase [Sphingobacterium spiritivorum ATCC
33861]
gi|300759639|gb|EFK56466.1| possible beta-glucosidase [Sphingobacterium spiritivorum ATCC
33861]
Length = 777
Score = 251 bits (640), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 199/730 (27%), Positives = 330/730 (45%), Gaps = 127/730 (17%)
Query: 90 RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
RLGIP + E+ HG + G T FP + A+++N +L + + VA
Sbjct: 126 RLGIPVF-LAEEAPHGHMAIG-----------TTVFPTGIGQASTWNPALLQKMSATVAK 173
Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
E R Q + P +++ RDPRW R +E+ GEDP++ A V+
Sbjct: 174 EVRQ-----QGAHISYGPVLDLSRDPRWSRVEESYGEDPVLTGTLAAAIVR--------- 219
Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQ 269
G+G G + D KH +AY + + G+ A + E+
Sbjct: 220 ---GLGSG--------------NLSDPFATIPTLKHFVAYGIPEGGHNGS---AASVGER 259
Query: 270 DTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCD 328
+ + F PPF+S + G A +M +YN V+G+P L R EW F G+ SD
Sbjct: 260 ELREYFLPPFQSAVAAG-AKSVMAAYNSVDGIPCSSNKFLLTDILRKEWSFNGFTVSDLG 318
Query: 329 AVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNL 388
++ I K H+ +A ++AG+D + G A+ +G+VQE ID+A+ +
Sbjct: 319 SIEGIKGSHRVAKDHKQAAILAIEAGLDADLGGNAYVRLIEAVKQGEVQENSIDQAVSRI 378
Query: 389 FSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNA 448
+++ +GLF K K +V T + L+ AR+ IVLL+N LPL K+
Sbjct: 379 LALKFEMGLFEKPFVDVKTAK---KEVKTESNIALSRQVARESIVLLENKNNILPLRKDV 435
Query: 449 VSSLAIIGPLVNNISQMGGGYTG-IPCSPKSLLR-GLEAYVSKTH--YASGCHDVPCNSD 504
+AI+GP +N+ M G YT P + +R + A + K Y GC + ++
Sbjct: 436 --KIAIVGPNADNVYNMLGDYTAPQPDGAVTTVRQAISARLPKAQVSYVKGCA-IRDTTN 492
Query: 505 AGFHEAVRIAKKADFVIVVAG-----------------------LDLTQETEDRDRVSLL 541
+ AV A+++D ++ V G + + E DR +L
Sbjct: 493 SDIPAAVTAARQSDIIVAVVGGSSARDFKTEYISTGAAVASDKSVSDMESGEGFDRSTLD 552
Query: 542 LPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAE 601
L G+QM L+ ++ +T K P++++ G PL++++A +Q ++L YPG+ G A+A+
Sbjct: 553 LLGRQMELLKALKQTGK-PLVVIYIQGRPLNMNWAA--TQADALLCAWYPGQEGGHAIAD 609
Query: 602 IIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLS 661
++FGD+NP G++P++ P S ++P++ N ++ +Y + T +Y FG+G S
Sbjct: 610 VLFGDYNPAGKMPLS-VPRSVGQIPVH-YNRKSSLDHRYVEEA-----ATPLYAFGYGKS 662
Query: 662 YTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVT 721
Y+++ YK L E T DY HV ++T
Sbjct: 663 YSDFEYKDLKIQKENT---------------------DY--------------HVSFTLT 687
Query: 722 NAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANK 781
N G DG V L+ R S P +QL F+R+H SK +SF V S+ N
Sbjct: 688 NTGKYDGDEVPQLYIRNQYASVSQPVQQLKHFERIHLKTGESKTVSF-VLTAGDFSVINT 746
Query: 782 HGRRILPLGN 791
+++L G+
Sbjct: 747 QMKKVLEPGS 756
>gi|431797765|ref|YP_007224669.1| beta-glucosidase-like glycosyl hydrolase [Echinicola vietnamensis
DSM 17526]
gi|430788530|gb|AGA78659.1| beta-glucosidase-like glycosyl hydrolase [Echinicola vietnamensis
DSM 17526]
Length = 799
Score = 251 bits (640), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 210/754 (27%), Positives = 348/754 (46%), Gaps = 134/754 (17%)
Query: 90 RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
RLGIP E HG + G T FP + A+++N +L + +A+A+
Sbjct: 137 RLGIPMM-LAEECPHGHMAIG-----------TTVFPTSIGQASTWNPALIQEMAAAIAL 184
Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
EAR Q G + P +++ R+PRW R +ET GEDP + S V FQGE+ S
Sbjct: 185 EARL-----QGGHIGYGPVLDLAREPRWSRVEETYGEDPYINSQMGRAMVSGFQGESIAS 239
Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQ 269
I + KH AY + + G+ + + ++
Sbjct: 240 GKNVI--------------------------STLKHFTAYGVPEGGHNGT---SVSVGQR 270
Query: 270 DTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCD 328
+ +++ PPF++ + +G A +M +YN ++GVP G L R++WGF G++ SD
Sbjct: 271 ELHESYLPPFKAAVAEG-ALSVMTAYNSIDGVPCTSNGHLLNDVLRDDWGFNGFVVSDLG 329
Query: 329 AVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCML-RHTQSAIDKGKVQEKDIDRALLN 387
+++ + + T+T E +A + AG+D + G ++ +A+ G V ++ +D A+
Sbjct: 330 SISGLRGSHHVTETAEGAAQLAINAGVDSDLGGYGFGKNLLAAVQAGGVSQEVLDEAVRR 389
Query: 388 LFSVQLRLGLFNG---DPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPL 444
+ V+ +GLF DP K + V +++H LA AR+ +VLLKN+ LPL
Sbjct: 390 VLKVKFDMGLFENPYVDPSKAE------SLVRSAKHIALARKVARESVVLLKNENDLLPL 443
Query: 445 NKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPK--SLLRGLEAYVSK---THYASGCHDV 499
K V+S+A+IGP +N G YT + ++L G++ V K +Y GC +
Sbjct: 444 RKK-VNSIAVIGPNADNTYNQLGDYTAPQPNENVVTVLEGIKNKVGKDVRVNYVKGCA-I 501
Query: 500 PCNSDAGFHEAVRIAKKADFVIVVAG----LDLTQETEDR-------------------- 535
+ + +A +A ++D +VV G D E E+
Sbjct: 502 RDTTQSEIGKAASLAARSDVAVVVLGGSSARDFDTEYEETAAAKVSEAEEGQVISDMESG 561
Query: 536 ---DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPG 592
DR++L L G Q+ LV +V T PV++VL G PL++++ D + +I+ YPG
Sbjct: 562 EGFDRMTLDLLGDQLKLVQAVQATGT-PVVVVLIKGRPLNLNWI--DEHVPAIVDAWYPG 618
Query: 593 EAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQ 652
+ G A+A+++FGD+NP GRL ++ P S ++P+ N R Y S
Sbjct: 619 QEGGNAIADVLFGDYNPSGRLTIS-VPRSVGQLPVF-YNYRNPKRHDYVEGS-----AEP 671
Query: 653 VYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSL 712
+Y FGHGLSY ++ Y L ++AS AGS ++
Sbjct: 672 LYAFGHGLSYADFEY------DNLEVTASGMAGS-----------------------PTV 702
Query: 713 RFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDP 772
R H Q+S N +VDG VV L+ R S P +L F++V A S +I+F +
Sbjct: 703 RVHFQVS--NISNVDGEEVVQLYVRDEAGSTVRPLLELKRFEKVMVPAGESSKITFMLT- 759
Query: 773 CEQLSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
E L + + ++ G+ ++VG + +E
Sbjct: 760 AEDLQVLGQDMNWLVEPGSFQVLVGRSSRDIRLE 793
>gi|383115541|ref|ZP_09936297.1| hypothetical protein BSGG_2589 [Bacteroides sp. D2]
gi|313695054|gb|EFS31889.1| hypothetical protein BSGG_2589 [Bacteroides sp. D2]
Length = 800
Score = 251 bits (640), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 209/735 (28%), Positives = 341/735 (46%), Gaps = 127/735 (17%)
Query: 90 RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
RLGIP ++ +E + G+ + T FP A++N+ L I A
Sbjct: 151 RLGIPV-DFTNEGIRGLCHD-----------RATMFPAQCGQGATWNKKLIREIAKVTAD 198
Query: 150 EARAMYNLGQAGLT-FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWK 208
EA+A+ G T +AP ++I +DPRWGR E+ GEDP +V + + Q E
Sbjct: 199 EAKAL------GYTNIYAPILDIAQDPRWGRVVESYGEDPYLVGELGKQMILGLQSEG-- 250
Query: 209 SDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITE 268
+ A KH Y + G + +
Sbjct: 251 ------------------------------IVATPKHFAVYSIPVGGRDGGTRTDPHVAP 280
Query: 269 QDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDC 327
++ + + PFR I++ A +M SYN +G P + R +WGFKGY+ SD
Sbjct: 281 REMKTLYLEPFRKGIQEAGALGVMSSYNDYDGEPVSGSYHFLTEILRQQWGFKGYVVSDS 340
Query: 328 DAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTC----MLRHTQSAIDKGKVQEKDIDR 383
+AV + T T E+ AA V+ AG++I + + AI +GKV +D+
Sbjct: 341 EAVEFLHTKHRITPTEEEMAAQVVNAGLNIRTNFTPPQDFILPLRRAISEGKVSLHTLDQ 400
Query: 384 ALLNLFSVQLRLGLFN----GDPRKGKYGKLGPDDVC-TSEHKKLALDAARQGIVLLKND 438
+ + V+ +GLF+ GD R+ P+ V + H+ +++ AA + IVLLKN+
Sbjct: 401 RVGEILRVKFMMGLFDNPYPGDDRR-------PEVVVHNAAHQDVSMRAALESIVLLKNE 453
Query: 439 KKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV--SKTHYASGC 496
K+ LPL+K + S +A+IGP + ++ Y S K++ +G++ Y+ ++ YA GC
Sbjct: 454 KEMLPLSK-SFSKIAVIGPNAEEVKELTCRYGPANASIKTVYQGIKEYLPNAEVRYAKGC 512
Query: 497 ------------HDVPCNS--DAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLL 542
++VP ++ A +EAV +AK +D I+V G + E+ R +L L
Sbjct: 513 DIIDKYFPESELYNVPLDTQEQAMINEAVELAKASDVAILVLGGNEKTVREEFSRTNLDL 572
Query: 543 PGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEI 602
G+Q L+ +V T K PV+LV+ G +++ A+ + +I+ +PGE A+A++
Sbjct: 573 CGRQQQLLEAVYATGK-PVVLVMVDGRAATINW--ANKYVPAIIHAWFPGEFMGDAIAKV 629
Query: 603 IFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSY 662
+FGD+NPGGRL +T +P+S ++P S+ + +Y FG+GLSY
Sbjct: 630 LFGDYNPGGRLAVT-FPKSVGQIPFAFPFKPGSDSKG------KVRVAGVLYPFGYGLSY 682
Query: 663 TNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTN 722
T ++Y S L IS + G+ +NI SCT V N
Sbjct: 683 TTFNY------SNLKISKPV-IGAQENI---------------TLSCT---------VKN 711
Query: 723 AGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKH 782
G G VV L+ R S T +K L GF+R+H + ISF + P + L + +K+
Sbjct: 712 TGKKAGDEVVQLYIRDDFSSVTTYDKVLRGFERIHLQPGEEQTISFTLTP-QDLGLWDKN 770
Query: 783 GRRILPLGNHVLMVG 797
+ + G+ +MVG
Sbjct: 771 NQFTVEPGSFSVMVG 785
>gi|393786770|ref|ZP_10374902.1| hypothetical protein HMPREF1068_01182 [Bacteroides nordii
CL02T12C05]
gi|392658005|gb|EIY51635.1| hypothetical protein HMPREF1068_01182 [Bacteroides nordii
CL02T12C05]
Length = 864
Score = 251 bits (640), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 161/457 (35%), Positives = 229/457 (50%), Gaps = 62/457 (13%)
Query: 49 FDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIAS 108
+ P+ + +L+ RA L+ LT++EK+ + +N+ I RLGI YEWW+E+LHG+A
Sbjct: 25 YSQLPYQDPNLTPEQRATDLLQRLTIEEKVSLMQNNSPGILRLGIKPYEWWNEALHGVAR 84
Query: 109 NGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARA----MYNLGQ----A 160
G T FPQ + AASF+ +L + +A++ EARA LGQ
Sbjct: 85 AG----------LATVFPQTIGMAASFDDTLIYEVFNAISDEARAKNRHFNTLGQYKRYQ 134
Query: 161 GLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFRE 220
GLT W PNINIFRDPRWGRGQET GEDP + S V VK QG
Sbjct: 135 GLTMWTPNINIFRDPRWGRGQETYGEDPYLTSRMGVAVVKGLQG---------------- 178
Query: 221 KRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPF 279
D L AC KH + +W +R+SFNA I +D +T+ P F
Sbjct: 179 ----------PDSARYNKLHACAKHFAVHSGPEW---NRHSFNAENIIPRDLWETYLPAF 225
Query: 280 RSCIEQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIFEYQN 338
++ +++ +MC+YN+ G P C L Q RNEWGFKG + SDC A++ + +
Sbjct: 226 KTLVQEADVKEVMCAYNRFEGDPCCGSNRLLTQILRNEWGFKGIVVSDCGAISDFWGTKK 285
Query: 339 YTKTHEDSAAGVLKA---GMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRL 395
+ TH D+A +A G D+ CG+ + T+ AI G + EK I+ ++ L + L
Sbjct: 286 HN-THPDAAHASAEAVLNGTDLECGSNYRKLTE-AIKAGIISEKQINVSVKRLLKARFEL 343
Query: 396 G-LFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAI 454
G + N P Y V + +H+ LAL A + + LL+N K LPL+K A +AI
Sbjct: 344 GEMENIHPWTLPYSI-----VDSPKHRCLALKMAHETMTLLQNKGKVLPLDKQA--RIAI 396
Query: 455 IGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTH 491
IGP N+ G Y G P +LL + +H
Sbjct: 397 IGPNANDSVMQWGNYNGTPSHTSTLLSAFRKRLPISH 433
Score = 90.1 bits (222), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 63/263 (23%), Positives = 114/263 (43%), Gaps = 55/263 (20%)
Query: 515 KKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVARTSKRPVILV 564
K D +I G+ + E E+ DR + P Q ++ ++ K+ +++
Sbjct: 599 KDIDIIIFAGGISPSLEGEEMNVSATGFKGGDRTDIEFPAVQRKVLAALKEAGKKVILVN 658
Query: 565 LTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTK 624
+G ++ +IL YPGE G A+ ++FGD+NP GRLP+T+Y +S +
Sbjct: 659 FSGSA---MALTPETKSCDAILQAWYPGEEGGMAIVNVLFGDYNPAGRLPITFY-KSIDQ 714
Query: 625 VP-MNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLK 683
+P + +M+ GR+YR+ ++ FG+GLSYT +++
Sbjct: 715 LPDFENYSMK--------GRTYRYMQEEPLFPFGYGLSYTTFAF---------------- 750
Query: 684 AGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQ 743
+HI++ + + + I + N GD DG VV ++ + +
Sbjct: 751 ---------------GKIHINKNSLSAGEKVTLHIPIKNIGDRDGVEVVQIYIQRQADKE 795
Query: 744 GTPEKQLIGFDRVHTVAKGSKEI 766
G P K L F RV ++E+
Sbjct: 796 G-PVKTLRAFKRVEIPKGKTQEV 817
>gi|427385138|ref|ZP_18881643.1| hypothetical protein HMPREF9447_02676 [Bacteroides oleiciplenus YIT
12058]
gi|425727306|gb|EKU90166.1| hypothetical protein HMPREF9447_02676 [Bacteroides oleiciplenus YIT
12058]
Length = 863
Score = 250 bits (639), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 155/458 (33%), Positives = 229/458 (50%), Gaps = 57/458 (12%)
Query: 53 PFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPG 112
P+ N +L+ RA L+ LTL+EK + + + AIPRLGI AY+WW+E+LHG+
Sbjct: 25 PYKNPALTPEERAADLVGRLTLEEKASLMQNTSPAIPRLGIKAYDWWNEALHGV------ 78
Query: 113 VNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMY-------NLGQ-AGLTF 164
G T FPQ + ASFN L ++ +AV+ EARA L + GLT
Sbjct: 79 ----GRAGLATVFPQAIGMGASFNNDLLYDVFTAVSDEARAKTAEFSKEGGLKRYQGLTM 134
Query: 165 WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVL 224
W PN+NIFRDPRWGRGQET GEDP + + V+ QG
Sbjct: 135 WTPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPE------------------ 176
Query: 225 KGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPFRSCI 283
G + D+ L AC KH + +W +R+SF+A + +D +T+ P F+ +
Sbjct: 177 ---GGKYDK-----LHACAKHFAVHSGPEW---NRHSFDAENVDPRDLWETYLPAFKDLV 225
Query: 284 EQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIFEYQNYTKT 342
++ +MC+YN+ G P C L Q R+EW + G I SDC A+ F +
Sbjct: 226 QKAHVKEVMCAYNRFEGEPCCGSNRLLVQILRDEWAYDGIIVSDCWAINDFFNKGAHETE 285
Query: 343 --HEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNG 400
E ++A + G D+ CG Q A+ G + EK ID +L L + LG +
Sbjct: 286 PDKEHASAKAVLTGTDVECGESYASLPQ-AVKAGLIDEKKIDISLKRLMKARFELGEMD- 343
Query: 401 DPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVN 460
+P + ++ V + EH++LAL AR+ +VLL+N++ LPLNK+ +A++GP N
Sbjct: 344 NPELVSWAQIPYSVVDSKEHRELALRMARESLVLLQNNQNVLPLNKSL--KVAVVGPNAN 401
Query: 461 NISQMGGGYTGIPCSPKSLLRGLEAYVSKTH--YASGC 496
+ G Y G P +LL G+ Y+ + Y GC
Sbjct: 402 DSVMQWGNYNGFPGHTVTLLEGIRQYLPEAQLIYEPGC 439
Score = 103 bits (256), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 72/267 (26%), Positives = 119/267 (44%), Gaps = 54/267 (20%)
Query: 507 FHEAVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVART 556
++ ++ K AD ++ G+ E E+ DR ++ LP Q L+ + +
Sbjct: 590 LNKTIQRVKDADIIVFAGGISPAVEGEEMRVTIPGFKGGDRETIELPSIQSRLLAELKKA 649
Query: 557 SKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMT 616
K+ V + +G ++ +IL YPG+AG A+A ++FGD+NP GRLP+T
Sbjct: 650 GKKVVFVNFSGSA---IALTPETKTCDAILQAWYPGQAGGTAIANVLFGDYNPAGRLPVT 706
Query: 617 WYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSEL 676
+Y + D +M+ GR+YR+ ++ FGHGLSYT + Y +
Sbjct: 707 FYKSTSQLPDFEDYSMK--------GRTYRYMAEAPLFPFGHGLSYTTFRY------GDA 752
Query: 677 TISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFA 736
++S ++ IL I V+N G+ DG VV ++
Sbjct: 753 SLSTQEVKEGEQAIL-------------------------TIPVSNTGERDGEEVVQVYL 787
Query: 737 RVPKVSQGTPEKQLIGFDRVHTVAKGS 763
R P +G P L F RV+ +AKG+
Sbjct: 788 RRPGDKEG-PSHALRAFKRVN-IAKGT 812
>gi|224537265|ref|ZP_03677804.1| hypothetical protein BACCELL_02142 [Bacteroides cellulosilyticus
DSM 14838]
gi|224521119|gb|EEF90224.1| hypothetical protein BACCELL_02142 [Bacteroides cellulosilyticus
DSM 14838]
Length = 885
Score = 250 bits (639), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 155/461 (33%), Positives = 238/461 (51%), Gaps = 57/461 (12%)
Query: 51 SYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNG 110
+ P+ N LS + RAK L+ LTL+EK + D++ AIPRLGI + WWSE+LHG+A+ G
Sbjct: 20 TLPYQNPELSPAERAKDLVKRLTLEEKALLMCDDSEAIPRLGIKKFNWWSEALHGVANQG 79
Query: 111 PGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYN---------LGQAG 161
+VT FP+ + AASFN L +I +AV+ E RA +N +
Sbjct: 80 ----------NVTVFPEPVGMAASFNDKLVFDIFNAVSDEMRAKHNERVRNGLEDVRFHS 129
Query: 162 LTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQG-ENWKSDDGGIGFGFRE 220
L+ W PN+NIFRDPRWGRGQET GEDP + S + VK QG EN K +R+
Sbjct: 130 LSVWTPNVNIFRDPRWGRGQETYGEDPYLTSQMGIAVVKGLQGPENEK---------YRK 180
Query: 221 KRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNA-MITEQDTEDTFQPPF 279
L AC KH + +W SR++ N ++ +D +T+ P F
Sbjct: 181 ------------------LLACAKHYAVHSGPEW---SRHTANLNNVSPRDLWETYLPAF 219
Query: 280 RSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQN 338
++ +++ +MC+Y +++ P C L Q+ R+EWGFK + SDC A+A +
Sbjct: 220 KALVQKADVREVMCAYQRLDDDPCCGNTRLLQQILRDEWGFKYLVVSDCGAIADFWTSHK 279
Query: 339 YTKTHEDSAAGVLKAGMDINCGT-CMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGL 397
+ +A AG D+ CG + A+ KG + E+++D+ +L L + LG
Sbjct: 280 SSSDAVHAAVKGTMAGTDVECGYGYAYQKLPEAVSKGLITEEEVDKHVLRLMEGRFELGE 339
Query: 398 FNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGP 457
+ DP + K+ V HK L+L+ +RQ + LL+N LPL+K ++ +A+IGP
Sbjct: 340 MD-DPSLVNWTKIPMSVVNCKAHKDLSLNMSRQTMTLLQNKNNVLPLSK-SIRKIAVIGP 397
Query: 458 LVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTH--YASGC 496
++ + G Y G P ++L G ++ + K Y GC
Sbjct: 398 NADDKPMLWGNYNGTPNQTITILDGFKSKLKKNQIVYMKGC 438
Score = 82.0 bits (201), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 70/270 (25%), Positives = 115/270 (42%), Gaps = 54/270 (20%)
Query: 507 FHEAVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVART 556
+ E++ K D V+ V G+ E E+ DR + LP Q + + ++
Sbjct: 590 YSESISKLKGIDVVVFVGGISPQLEGEEMPVNIPGFKGGDRTDIELPAVQRNFLKALKDA 649
Query: 557 SKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMT 616
K+ V + +G ++ +IL Y GE G A+A+++FGD+NP G+LP+T
Sbjct: 650 GKQVVFVNCSGSS---MALLPETESCDAILQAWYGGELGGYAVADVLFGDYNPSGKLPVT 706
Query: 617 WYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSEL 676
+Y + D +M+ GR+YR Y ++ FG GLSYT++
Sbjct: 707 FYKSTKQLPDYEDYSMK--------GRTYR-YMSDPLFPFGFGLSYTDF----------- 746
Query: 677 TISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFA 736
A A +K L H DE + T + V+N G G+ VV ++
Sbjct: 747 ---AVGTASCNKTQL----------HTDESLTLT-------VPVSNTGKRSGTEVVQVYI 786
Query: 737 RVPKVSQGTPEKQLIGFDRVHTVAKGSKEI 766
R + G P K L + RV A +++
Sbjct: 787 RKTDDADG-PLKSLKAYARVELAAGAKQDV 815
>gi|224535242|ref|ZP_03675781.1| hypothetical protein BACCELL_00103 [Bacteroides cellulosilyticus
DSM 14838]
gi|224523140|gb|EEF92245.1| hypothetical protein BACCELL_00103 [Bacteroides cellulosilyticus
DSM 14838]
Length = 864
Score = 250 bits (639), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 155/462 (33%), Positives = 240/462 (51%), Gaps = 58/462 (12%)
Query: 53 PFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPG 112
P+ N LS S RA L+ +TL+EK+ Q+ + + AI RLGIPAY+WW+E+LHG+A G
Sbjct: 23 PYKNPELSPSERAWDLLKRMTLEEKVSQMKNGSPAIERLGIPAYDWWNEALHGVARAG-- 80
Query: 113 VNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYN--------LGQAGLTF 164
T FPQ + AA+F+ V+ EARA Y+ G GLTF
Sbjct: 81 --------KATVFPQAIGLAATFDNQAVYETFDIVSDEARAKYHDFQRKGERDGYKGLTF 132
Query: 165 WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVL 224
W PNINI+RDPRWGRG ET GEDP + S + VK QG GG G
Sbjct: 133 WTPNINIYRDPRWGRGMETYGEDPYLTSLMGLAVVKGLQG-------GGTG--------- 176
Query: 225 KGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPFRSCI 283
+ D+ AC KH + +W +R+SF+A I+++D +T+ F++ +
Sbjct: 177 -----KYDKA-----HACAKHYAVHSGPEW---NRHSFDAKNISQRDLWETYLSAFKTLV 223
Query: 284 EQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKT 342
++GK +MC+YN+ G P C L + R +WG+ + SDC A+ F Y N+ +T
Sbjct: 224 KEGKVKEVMCAYNRFEGEPCCSNKQLLIRILREDWGYDDIVVSDCGAIGD-FYYPNHHET 282
Query: 343 H---EDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFN 399
H ++A + +G D+ CG A+ KG + E+ I+ ++ L + +LG+F+
Sbjct: 283 HPTAAAASADAVVSGTDLECGGSY-SSLNEAVRKGLISEEKINESVFRLLRARFQLGMFD 341
Query: 400 GDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLV 459
D + ++ V + EH AL+ AR+ +VLL N LPL+K ++ +A++GP
Sbjct: 342 DDALVS-WSEIPYSVVESKEHVTKALEMARKSMVLLTNKNHTLPLSK-SIRKVAVLGPNA 399
Query: 460 NNISQMGGGYTGIPCSPKSLLRGLEAYVSK--THYASGCHDV 499
N+ + Y G P ++L G+++ + + +Y GC V
Sbjct: 400 NDSVMLWANYNGFPTKSVTILEGIKSKLPEGTVYYEKGCDYV 441
Score = 117 bits (292), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 98/318 (30%), Positives = 144/318 (45%), Gaps = 59/318 (18%)
Query: 493 ASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLL 542
AS D+ + + E A +AD +I V GL T E E+ DR ++ L
Sbjct: 576 ASLKFDIGIKKEINYKEVADKAAEADAIIFVGGLSPTLEGEEMPVDLPGFRKGDRTNIDL 635
Query: 543 PGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEI 602
P Q ++ ++ +T K PVI VL G L + + EA++ + +IL YPG+ G A+A++
Sbjct: 636 PHVQAEMLKALKKTGK-PVIFVLCSGSTLALPW-EAEN-LDAILEAWYPGQQGGTAVADV 692
Query: 603 IFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSY 662
+FGD+NP GRLP+T+Y S D +M R+YR++ G ++ FGHGLSY
Sbjct: 693 LFGDYNPAGRLPLTFYASSNDLPDFEDYDMS--------NRTYRYFKGKALFPFGHGLSY 744
Query: 663 TNYSYKFLSAPSELTISASLKAGSDK-NILQQTGSRLDYVHIDEVTSCTSLRFHVQISVT 721
T + Y KA DK N+ G L I +
Sbjct: 745 TIFDYG--------------KAKVDKQNVRAGEGMTL------------------TIPLK 772
Query: 722 NAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANK 781
N G +DG V+ ++ R P +G P K L F RV A ++ I + P N
Sbjct: 773 NTGKLDGDEVIQVYLRNPADKEG-PIKTLRAFRRVSLPAGQTENIRIEL-PASTFECFNP 830
Query: 782 HGRR--ILPLGNHVLMVG 797
R ILP G + L+ G
Sbjct: 831 STNRMEILP-GKYELLYG 847
>gi|383119099|ref|ZP_09939838.1| hypothetical protein BSHG_1822 [Bacteroides sp. 3_2_5]
gi|251946311|gb|EES86688.1| hypothetical protein BSHG_1822 [Bacteroides sp. 3_2_5]
Length = 859
Score = 250 bits (639), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 227/828 (27%), Positives = 357/828 (43%), Gaps = 168/828 (20%)
Query: 51 SYPFCNTSLSISTRAKSLISLLTLQEKIQQLSD-NASAI--------------------- 88
++ + N SL + R + L+S +TL+EKI Q+ +A +I
Sbjct: 23 NFKYKNASLPVEVRVQDLLSRMTLEEKIAQMRHIHAYSIMENGKLNEEKLEKMIGGQNYG 82
Query: 89 --------------------------PRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSV 122
PRLGIP + +ESLHG +G
Sbjct: 83 FIEGITLPGKECLTLMNEVQKYMREKPRLGIPVFTL-TESLHGSVHDGS----------- 130
Query: 123 TSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQE 182
T FPQ + ++FN L + SA+A E A Q P I++ RD RWGR +E
Sbjct: 131 TIFPQAIALGSTFNPILAYEMTSAIAKELSA-----QGITQSLTPVIDVCRDLRWGRVEE 185
Query: 183 TPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSAC 242
GEDP +VS V V+ G+ + +V S
Sbjct: 186 CFGEDPFLVSRMGVSQVR----------------GYLDNQV----------------SPM 213
Query: 243 CKHLIAYDLEKWG-NFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGV 301
KH A+ + G N + S +++ + F + +++ K +M SYN N
Sbjct: 214 IKHFGAHGTPQGGLNLASVSCG----QRELLSIYLKTFETVVKEAKPWAVMSSYNSWNNE 269
Query: 302 PACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCG 360
P L + R+ W F+GY+ SD A+ + + + ++A L AG+D
Sbjct: 270 PNSSSHYLMTELLRDRWDFQGYVYSDWGAIGMLNYFHKTAQNSAEAAIQALTAGLDAEAS 329
Query: 361 TCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGD-PRKGKYGKLGPDDVCTSE 419
Q ++ G + K ID+A+ + + + +GLF P + Y K+ V
Sbjct: 330 DNSYAELQQLVENGMLDVKYIDQAVARILTAKFNMGLFEYPLPMEKNYDKV----VHAPA 385
Query: 420 HKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPK-- 477
H LA A + IVLL+N+ LPL N + S+A+IGP + + G YT +
Sbjct: 386 HVSLARKIAEESIVLLQNENNILPLQMNKLKSIAVIGPNADQVQ--FGDYTWSRDNKDGV 443
Query: 478 SLLRGLEAYVSKT---HYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAG-------LD 527
+LL L+ VS +YA GC D+ + +GF EAV +AKK+D IVV G D
Sbjct: 444 TLLEALKERVSNQLTLNYAKGC-DLVTDDCSGFKEAVDVAKKSDVCIVVVGSASASLARD 502
Query: 528 LTQET--EDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSI 585
+ T E D L L G Q LV ++ T K PVI+VL G P +S+ + + I I
Sbjct: 503 YSNATCGEGFDLSDLTLTGVQEDLVEAIHATGK-PVIVVLLSGKPFAMSWIKEN--IPGI 559
Query: 586 LWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPES-------FTKVPMNDMNMRADSSR 638
+ YPGE G ALA+++ G NP G+L + +P+S + +P + R+ S+
Sbjct: 560 VVQWYPGEQGGLALADMLLGKVNPSGKLNYS-FPQSVGHLPCYYNYLPTDKGFYRSPGSK 618
Query: 639 QYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRL 698
PG+ Y F + ++ FGHGLSYT++ Y L T S+
Sbjct: 619 NKPGKDYVFSSPKALWAFGHGLSYTDFEY-----------------------LSATTSKE 655
Query: 699 DYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHT 758
DY +C + V I++ N GD DG V ++ R S P ++L GF++V
Sbjct: 656 DY-------ACEDV-IEVTIAIRNTGDYDGLEVPQVYVRDMVSSVVMPVQELKGFEKVLI 707
Query: 759 VAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
+K++ + P +L++ NK ++++ G L +G + I+
Sbjct: 708 KKGETKQVIIKI-PVSELALYNKEMKKVVEPGAFELQIGRASDDIRIK 754
>gi|224538282|ref|ZP_03678821.1| hypothetical protein BACCELL_03173 [Bacteroides cellulosilyticus
DSM 14838]
gi|224520107|gb|EEF89212.1| hypothetical protein BACCELL_03173 [Bacteroides cellulosilyticus
DSM 14838]
Length = 864
Score = 250 bits (639), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 158/455 (34%), Positives = 234/455 (51%), Gaps = 60/455 (13%)
Query: 50 DSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASN 109
+ +P+ +TSL+ RA L+ LTL+EK + + + AIPRL I AY WW+E+LHG+A
Sbjct: 25 EKFPYQDTSLTAEERADDLLKRLTLEEKASLMMNGSPAIPRLSIKAYGWWNEALHGLART 84
Query: 110 GPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMY-------NLGQ-AG 161
G T FPQ + ASF+ SL + +AV+ EARA NL +
Sbjct: 85 G----------LATVFPQAIGMGASFDDSLLYEVFTAVSDEARAKSRRLDSKGNLTRYQA 134
Query: 162 LTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREK 221
LT W PN+NIFRDPRWGRGQET GEDP + S V V QG
Sbjct: 135 LTVWTPNVNIFRDPRWGRGQETYGEDPYLTSRLGVAVVNGLQG----------------- 177
Query: 222 RVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPFR 280
D L AC KH + +W +R+SFNA I+ +D +T+ P F+
Sbjct: 178 ---------PDTARYNKLHACAKHYAVHSGPEW---NRHSFNAENISPRDLWETYLPAFK 225
Query: 281 SCIEQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIFEYQNY 339
+ +++ K +MC+YN+ G P C L Q R+EWGF G + SDC AV+ ++ + +
Sbjct: 226 TLVQEAKVKEVMCAYNRFEGEPCCGSNRLLTQILRDEWGFDGVVVSDCGAVSDFWQKRKH 285
Query: 340 TKTHEDSAAGVLKA---GMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLG 396
+TH D+A+ A G D+ CG + A+ G + E ID ++ L + LG
Sbjct: 286 -ETHPDAASASADAVLNGTDVECGNS-YKSLPDAVKAGLITENQIDISVKRLLKARFELG 343
Query: 397 LFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIG 456
+ + G + D V + +H++LAL AR+ + LL+N+ LPL+K A +A+IG
Sbjct: 344 EMDENVWTG----ISSDVVDSPKHRQLALQMARETMTLLQNNNNILPLSKQA--KIALIG 397
Query: 457 PLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTH 491
P N+ G Y G+P +LL G++ Y+ ++
Sbjct: 398 PNANDSVMQWGNYNGLPSHTITLLEGMQRYLPTSN 432
Score = 117 bits (293), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 105/363 (28%), Positives = 154/363 (42%), Gaps = 64/363 (17%)
Query: 452 LAIIGPLVNN----ISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGF 507
L I G LV + QM YT S KS L Y A+ D+ N+
Sbjct: 533 LYINGKLVKEDEGAVKQMTSIYTLKAESGKSY-DILLKYRHFDKTANLSFDMGVNAQIDV 591
Query: 508 HEAVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVARTS 557
+ K D VI G+ E E+ DR + LP Q +V ++
Sbjct: 592 KGLLERIKDVDVVIFAGGISPALEGEEMPVDAAGFRGGDRTEIELPAVQRRVVEALKTAG 651
Query: 558 KRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTW 617
KR V + +G ++ +IL YPG+AG +A+AE++FGD+NP G+LP+T+
Sbjct: 652 KRIVFVNFSGAA---IALEPESQNCEAILQAWYPGQAGGQAVAEVLFGDYNPAGKLPLTF 708
Query: 618 YPESFTKVP-MNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSEL 676
Y + ++P D NM GR+YR+ T ++ FGHGLSYT + Y
Sbjct: 709 Y-RNLAQIPDFEDYNM--------TGRTYRYMKETPLFPFGHGLSYTTFKY--------- 750
Query: 677 TISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFA 736
LK DK Q ++ I VTN G DG VV ++
Sbjct: 751 ---GKLKMNDDKIAAGQN-------------------LNLAIPVTNTGSRDGDEVVQVYL 788
Query: 737 RVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGR--RILPLGNHVL 794
+ ++G P K L F RV A + E+ F +D QL ++ R+ P GN+ +
Sbjct: 789 KKMDDTEG-PVKTLRAFKRVRIPAGKTVEVKFSLDDT-QLEWWDEQSNTMRVCP-GNYTV 845
Query: 795 MVG 797
M+G
Sbjct: 846 MIG 848
>gi|299146513|ref|ZP_07039581.1| beta-glucosidase [Bacteroides sp. 3_1_23]
gi|298517004|gb|EFI40885.1| beta-glucosidase [Bacteroides sp. 3_1_23]
Length = 736
Score = 250 bits (639), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 216/750 (28%), Positives = 335/750 (44%), Gaps = 135/750 (18%)
Query: 90 RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
RLGIP + E+ HG + G T FP + AA+++ L +G +A
Sbjct: 83 RLGIPMF-LAEEAPHGHMAIG-----------TTVFPTGIGMAATWSPELVKEVGQVIAK 130
Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
E R+ Q G + P +++ RDPRW R +ET GEDP++ V G N
Sbjct: 131 EIRS-----QGGHISYGPVLDLTRDPRWSRVEETFGEDPVLSGILGASMVDGLGGGNLSQ 185
Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQ 269
I A KH +AY + + G Y A + +
Sbjct: 186 KYATI--------------------------ATLKHFLAYAVPEGGQNGNY---ASVGIR 216
Query: 270 DTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCD 328
D F PPFR I+ G A +M SYN ++G+P L K RNEW F+G++ SD
Sbjct: 217 DLHQNFLPPFRKAIDAG-ALSVMTSYNSIDGIPCTSNHYLLTKLLRNEWKFRGFVVSDLY 275
Query: 329 AVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNL 388
++ I E T E++A + AG+D++ G + A+ G++ + ID A+ +
Sbjct: 276 SIEGIHESHFVAPTKENAAIQSVMAGVDVDLGGDAYTNLCHAVQSGQMDKTVIDTAVCRV 335
Query: 389 FSVQLRLGLFNG---DPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLN 445
++ +GLF DP K+ V EH +LA A+ I LLKN+ LPL+
Sbjct: 336 LRMKFEMGLFEHPYVDP------KIAAKTVRRKEHIELARKIAQSSITLLKNENSILPLS 389
Query: 446 KNAVSSLAIIGPLVNNISQMGGGYTGIP--CSPKSLLRGLEAYVS--KTHYASGC--HDV 499
K ++ +A+IGP +N M G YT + K++L G+ +S + Y GC D
Sbjct: 390 K-MINKVAVIGPNADNRYNMLGDYTAPQEDSNVKTVLDGIITKLSPSRVEYVRGCAIRDT 448
Query: 500 PCNSDAGFHEAVRIAKKADFVIVVAG----LDLT---QET----------------EDRD 536
N +A+ A++++ VIVV G D +ET E D
Sbjct: 449 TVNE---IEQAIEAARRSEVVIVVVGGSSARDFKTSYKETGAAVAEEGSVSDMECGEGFD 505
Query: 537 RVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGA 596
R SL L G+Q L+ S+ +T K P+I+V G PL+ ++A ++L YPG+ G
Sbjct: 506 RASLSLLGRQQELLESLQKTGK-PLIVVYIEGRPLEKNWA--SEYADALLTAYYPGQEGG 562
Query: 597 KALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGF 656
A+A+++FGD+NP GRLP++ P S ++P+ N +A + Y S + +Y F
Sbjct: 563 NAIADVLFGDYNPSGRLPIS-VPRSVGQIPVY-YNQKAPRNHDYVEVS-----SSPLYSF 615
Query: 657 GHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHV 716
G+G+SYT + Y SD ++Q++ C F V
Sbjct: 616 GYGMSYTTFEY------------------SDLQVVQKSA------------RC----FEV 641
Query: 717 QISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQL 776
V N G DG V L+ R S P KQL F+R H K+++F V E
Sbjct: 642 SFKVKNTGKYDGEEVSQLYMRDEYASVVQPMKQLKHFERFHLKKGEEKKVTF-VLTEEDF 700
Query: 777 SIANKHGRRILPLGNHVLMVGELRHSLTIE 806
+ N ++++ GN LM+G + + ++
Sbjct: 701 FLVNYTLKKVVESGNFHLMIGAASNDIRLQ 730
>gi|374374543|ref|ZP_09632202.1| Beta-glucosidase [Niabella soli DSM 19437]
gi|373233985|gb|EHP53779.1| Beta-glucosidase [Niabella soli DSM 19437]
Length = 799
Score = 250 bits (639), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 218/780 (27%), Positives = 346/780 (44%), Gaps = 145/780 (18%)
Query: 51 SYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNG 110
SYPF N + +I+ K I RLGIP ++ +E +HG+ +
Sbjct: 116 SYPFSNHAEAINKIQKWFIEET-----------------RLGIPV-DFTNEGIHGLNQD- 156
Query: 111 PGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLT-FWAPNI 169
T+FP + +++N+ L +G + EA+A+ G T +AP +
Sbjct: 157 ----------HATAFPAPIGIGSTWNKELVHQMGQIIGREAKAL------GYTNVYAPIL 200
Query: 170 NIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGE 229
++ RD RWGR ET GEDP +V+ GGI
Sbjct: 201 DVARDQRWGRVVETYGEDPFLVAGLGTALA------------GGI--------------- 233
Query: 230 ESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKAS 289
E +++ KH Y + K G + + ++ + F PFR I+
Sbjct: 234 -----QENGVASTLKHFAVYSVPKGGRDGNARTDPHVAPREMQQLFLYPFRKVIQNVHPL 288
Query: 290 CIMCSYNQVNGVPACLRGD-LFQKARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAA 348
+M SYN +G+P L Q R ++GF GY+ SD AV ++E + K ++++
Sbjct: 289 GVMSSYNDWDGMPVTASNYFLTQLLRQQFGFDGYVVSDSRAVEFVYEKHHVAKDYKEAVK 348
Query: 349 GVLKAGMDI----NCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRK 404
V++AG+++ N + + + I +G + + +++ + + SV+ RLGLF D
Sbjct: 349 MVMEAGLNVRTEFNAPSNFILPLRQLIKEGGLSMETLNQRVGEVLSVKFRLGLF--DAPY 406
Query: 405 GKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNN--- 461
K K V T + +AL R+ +VLLKNDK LPL+ ++ + GPL +
Sbjct: 407 VKDPKAADKIVATEASEAVALQMNRESLVLLKNDKNILPLSLGQYRNILVTGPLADEKEH 466
Query: 462 -ISQMGGGYTGIPCSPK--SLLRGLEAYVSK---THYASGCH------------DVPCNS 503
IS+ G P + K S+L G+ + +K +Y GC D P
Sbjct: 467 AISRYG------PSNKKVISVLEGIRHFAAKKATINYIKGCEAADATWPESEIIDTPPTP 520
Query: 504 D--AGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPV 561
A ++AV AK+ D +I V G + Q E R L LPG+Q+ L+ + +T K P+
Sbjct: 521 QEIAEMNKAVEAAKQNDIIIAVMGENDKQVGESLSRTGLNLPGRQLRLLEELKKTGK-PM 579
Query: 562 ILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPES 621
+L+L G PL +++ + + +IL +PG AG A+AE IFG +NPGG+L T +P++
Sbjct: 580 VLILINGQPLTINWE--NRYLDAILETWFPGPAGGTAVAEAIFGAYNPGGKLTTT-FPKT 636
Query: 622 FTKVPMNDMNMRADSSRQYPGRSYRFYTGTQV----YGFGHGLSYTNYSYKFLSAPSELT 677
++ MN + S PG Y T V Y FG+GLSYT + Y
Sbjct: 637 TGQIEMN-FPFKPASHAGQPGDGPNGYGKTAVVGPLYPFGYGLSYTTFEY---------- 685
Query: 678 ISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFAR 737
A+LK +D + T V + V N G V G VV L+ +
Sbjct: 686 --ANLK-------------------VDPEKARTQADISVAVDVKNTGKVKGDEVVQLYVK 724
Query: 738 VPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
S T E L GF+RV +K + F + P + LSI +K+ ++ G +MVG
Sbjct: 725 QLVSSVTTYESILRGFERVSLSPGETKTVHFKLTP-DDLSILDKNMNFVVEPGAFDIMVG 783
>gi|423221630|ref|ZP_17208100.1| hypothetical protein HMPREF1062_00286 [Bacteroides cellulosilyticus
CL02T12C19]
gi|392645869|gb|EIY39591.1| hypothetical protein HMPREF1062_00286 [Bacteroides cellulosilyticus
CL02T12C19]
Length = 864
Score = 250 bits (638), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 158/455 (34%), Positives = 234/455 (51%), Gaps = 60/455 (13%)
Query: 50 DSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASN 109
+ +P+ +TSL+ RA L+ LTL+EK + + + AIPRL I AY WW+E+LHG+A
Sbjct: 25 EKFPYQDTSLTAEERADDLLKRLTLEEKASLMMNGSPAIPRLSIKAYGWWNEALHGLART 84
Query: 110 GPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMY-------NLGQ-AG 161
G T FPQ + ASF+ SL + +AV+ EARA NL +
Sbjct: 85 G----------LATVFPQAIGMGASFDDSLLYEVFTAVSDEARAKSRRLDSKGNLTRYQA 134
Query: 162 LTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREK 221
LT W PN+NIFRDPRWGRGQET GEDP + S V V QG
Sbjct: 135 LTVWTPNVNIFRDPRWGRGQETYGEDPYLTSRLGVAVVNGLQG----------------- 177
Query: 222 RVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPFR 280
D L AC KH + +W +R+SFNA I+ +D +T+ P F+
Sbjct: 178 ---------PDTARYNKLHACAKHYAVHSGPEW---NRHSFNAENISPRDLWETYLPAFK 225
Query: 281 SCIEQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIFEYQNY 339
+ +++ K +MC+YN+ G P C L Q R+EWGF G + SDC AV+ ++ + +
Sbjct: 226 TLVQEAKVKEVMCAYNRFEGEPCCGSNRLLTQILRDEWGFDGVVVSDCGAVSDFWQKRKH 285
Query: 340 TKTHEDSAAGVLKA---GMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLG 396
+TH D+A+ A G D+ CG + A+ G + E ID ++ L + LG
Sbjct: 286 -ETHPDAASASADAVLNGTDVECGNS-YKSLPDAVKAGLITENQIDISVKRLLKARFELG 343
Query: 397 LFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIG 456
+ + G + D V + +H++LAL AR+ + LL+N+ LPL+K A +A+IG
Sbjct: 344 EMDENVWTG----ISSDVVDSPKHRQLALQMARETMTLLQNNNNILPLSKQA--KIALIG 397
Query: 457 PLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTH 491
P N+ G Y G+P +LL G++ Y+ ++
Sbjct: 398 PNANDSVMQWGNYNGLPSHTITLLEGMQRYLPTSN 432
Score = 116 bits (290), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 107/363 (29%), Positives = 155/363 (42%), Gaps = 64/363 (17%)
Query: 452 LAIIGPLVNN----ISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGF 507
L I G LV + QM YT S KS L Y A+ D+ N+
Sbjct: 533 LYINGKLVKEDEGAVKQMTSIYTLKAESGKSY-DILLKYRHFDKTANLSFDMGVNAQIDV 591
Query: 508 HEAVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVARTS 557
+ K D VI G+ E E+ DR + LP Q +V ++
Sbjct: 592 KGLLERIKDVDVVIFAGGISPALEGEEMPVDAAGFRGGDRTEIELPAVQRRVVEALKTAG 651
Query: 558 KRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTW 617
KR V + +G ++ +IL YPG+AG +A+AE++FGD+NP G+LP+T+
Sbjct: 652 KRIVFVNFSGAA---IALEPESLNCEAILQAWYPGQAGGQAVAEVLFGDYNPAGKLPLTF 708
Query: 618 YPESFTKVP-MNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSEL 676
Y + ++P D NM GR+YR+ T ++ FGHGLSYT + Y
Sbjct: 709 Y-RNLAQIPDFEDYNM--------TGRTYRYMKETPLFPFGHGLSYTTFKY--------- 750
Query: 677 TISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFA 736
LK DK G L+ V I VTN G DG VV ++
Sbjct: 751 ---GKLKMNDDK---IAAGQNLNLV----------------IPVTNTGSRDGDEVVQVYL 788
Query: 737 RVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGR--RILPLGNHVL 794
+ ++G P K L F RV A + E+ F +D QL ++ R+ P GN+ +
Sbjct: 789 KKMDDTEG-PVKTLRAFKRVRIPAGKTVEVKFSLDDT-QLEWWDEQSNTMRVCP-GNYTV 845
Query: 795 MVG 797
M+G
Sbjct: 846 MIG 848
>gi|333380551|ref|ZP_08472242.1| hypothetical protein HMPREF9455_00408 [Dysgonomonas gadei ATCC
BAA-286]
gi|332826546|gb|EGJ99375.1| hypothetical protein HMPREF9455_00408 [Dysgonomonas gadei ATCC
BAA-286]
Length = 854
Score = 250 bits (638), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 155/437 (35%), Positives = 227/437 (51%), Gaps = 57/437 (13%)
Query: 64 RAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVT 123
R L+S LT++EKI L + IPRL IP Y +ESLHG+ G T
Sbjct: 40 RIMDLLSRLTIEEKISLLRATSPGIPRLQIPKYYHGNESLHGVVRPG----------RFT 89
Query: 124 SFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAG----------LTFWAPNINIFR 173
FPQ + A+ +N L I +A++ EAR +N + G LTFW+P +N+ R
Sbjct: 90 VFPQAIGLASMWNPELHHKIATAISDEARGRWNELEQGKLQTQRFTDLLTFWSPTVNMAR 149
Query: 174 DPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDR 233
DPRWGR ET GEDP + FV+ QG+ + R LK
Sbjct: 150 DPRWGRTPETYGEDPYLSGILGTAFVRGLQGD--------------DPRYLK-------- 187
Query: 234 GDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMC 293
+ + KH A + E +R+ N I+E+ + + P F C++ GK++ IM
Sbjct: 188 -----IVSTPKHFAANNEEH----NRFVCNPQISERQLREYYFPAFEMCVKDGKSASIMS 238
Query: 294 SYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLK 352
+YN +N VP L K R++WGF GY+ SDC + + Y KT E +A +K
Sbjct: 239 AYNAINDVPCTANPWLLTKVLRHDWGFNGYVVSDCGGPSLLVSAMKYVKTKEAAATLSIK 298
Query: 353 AGMDINCGT-CMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLG 411
AG+D+ CG ++ +A ++ V DID A + ++ LGLF+ DP Y K+
Sbjct: 299 AGLDLECGDDVYMQPLLNAYNQYMVSRADIDTAAYRVLRARMHLGLFD-DPDLNPYNKIS 357
Query: 412 PDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTG 471
P V ++EHK+LAL+AARQ IVLLKN+ + LPLN V S+A++G +N + G Y+G
Sbjct: 358 PSVVGSAEHKQLALEAARQSIVLLKNNNRTLPLNPKKVKSIAVVG--INAGNSEFGDYSG 415
Query: 472 IPC-SPKSLLRGLEAYV 487
IP +P S+L+G++ V
Sbjct: 416 IPANAPVSILQGIKDKV 432
Score = 119 bits (299), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 83/295 (28%), Positives = 141/295 (47%), Gaps = 53/295 (17%)
Query: 507 FHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLT 566
+ EA + ++ + VI V G++ T E E +DR + LP Q + + + + +++VL
Sbjct: 594 YGEAGKAVRECEQVIAVLGINKTIEREGQDRYDIHLPADQEEFIREIYKVNPN-IVVVLV 652
Query: 567 GGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVP 626
G L +++ D + +I+ YPGE G A+AE++FG++NPGGRLP+T+Y S ++P
Sbjct: 653 AGSSLAINWM--DEHVPAIVNAWYPGEQGGTAVAEVLFGEYNPGGRLPVTYY-NSLEEIP 709
Query: 627 -MNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAG 685
+D ++ GR+Y+++ G +Y FG+GLSYT ++YK L
Sbjct: 710 SFDDYDITK-------GRTYQYFKGKPLYPFGYGLSYTTFAYKNLQ-------------- 748
Query: 686 SDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGT 745
+ G+ + V + N G +DG V ++ ++P
Sbjct: 749 -----INDNGNNI----------------KVSFELKNTGRMDGDEVSQVYVKIPSSGIFM 787
Query: 746 PEKQLIGFDRVHTVAKGSK---EISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
P K+L GF R T+ KG+ EI+ D A + I P G + M+G
Sbjct: 788 PIKELKGFQR-STLKKGATKNVEINIRKDLLRYWDDATE--TFITPKGEYEFMIG 839
>gi|389794138|ref|ZP_10197297.1| beta-glucosidase [Rhodanobacter fulvus Jip2]
gi|388432924|gb|EIL89908.1| beta-glucosidase [Rhodanobacter fulvus Jip2]
Length = 869
Score = 250 bits (638), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 156/436 (35%), Positives = 228/436 (52%), Gaps = 65/436 (14%)
Query: 68 LISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQ 127
L++ +TL EK+ QL + A AIPRLGIP Y WW+ESLHG P T+FP+
Sbjct: 25 LVARMTLAEKLPQLLNVAPAIPRLGIPRYNWWTESLHGALGTLP----------TTNFPE 74
Query: 128 VLVSAASFNRSLWSNIGSAVAVEARAMYNLGQ---------AGLTFWAPNINIFRDPRWG 178
+ ASF+ L + +A++ E R ++ L + GL W+PN+NIFRDPRWG
Sbjct: 75 PIGLGASFDAPLVKRVAAAISTEVRGLHTLARQTGRLGRIGTGLDTWSPNLNIFRDPRWG 134
Query: 179 RGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELM 238
RGQET GEDP + + V FV+ QG N +DR + +
Sbjct: 135 RGQETYGEDPFLTARMGVAFVEGMQGPN------------------------ADRPEVI- 169
Query: 239 LSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQV 298
A KH ++ + +R+ N ++ D DT+ P FR+ I + KA IMC+YN++
Sbjct: 170 --ATPKHFAVHNGPE---STRHEANVFVSRHDLVDTYLPAFRAAITEAKAGSIMCAYNRI 224
Query: 299 NGVPACLRGDLFQK--ARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMD 356
+G PAC DL K R WGF GY+ SDCDAV I ++ +Y + A ++AG+D
Sbjct: 225 DGQPACAN-DLLLKHYLRGAWGFTGYVVSDCDAVKDISDHHHYAPDAATAVAAAVRAGVD 283
Query: 357 INCGTCMLRHT-------QSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGK 409
C L T A+ + + DIDRAL+ LF+ + R+G G P G
Sbjct: 284 NECNGATLSDTAGLEDRYSEALQRELISVADIDRALVRLFAARYRVGDLAGLP--GTAPA 341
Query: 410 LGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGY 469
+ P + ++ + +LALDAA++ +VLLKND LPL +A+IGPL + + G Y
Sbjct: 342 VPPSAIGSAANSQLALDAAQKTLVLLKND-GVLPLKAGV--KVAVIGPLGDATRVLRGNY 398
Query: 470 TG-IPCSPKSLLRGLE 484
+ + +P S+L GL+
Sbjct: 399 SSPLSGTPISVLDGLQ 414
Score = 129 bits (325), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 89/296 (30%), Positives = 143/296 (48%), Gaps = 54/296 (18%)
Query: 512 RIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVARTSKRPV 561
R A +AD ++ V GL E E+ D+ +L LP Q++++ AR + +P+
Sbjct: 593 RAAAQADVLVAVVGLTSDLEAEETSVTVPGFAGGDKTTLDLPADQIAMLKQ-ARATGKPL 651
Query: 562 ILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPES 621
I+V G P+D+++++ ++ ++IL YPG++G A+A ++ G +PGGRLP+T+Y
Sbjct: 652 IVVAMNGSPIDLAWSKQNA--AAILEAWYPGQSGGLAIANVLSGKTDPGGRLPLTFYRSV 709
Query: 622 FTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISAS 681
P D M GR+YR++ G VY FG GLSYT ++Y L
Sbjct: 710 ADLPPFGDYRME--------GRTYRYFKGEPVYPFGFGLSYTEFTYGPL----------- 750
Query: 682 LKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKV 741
N+ G + +H+ V VTNAG +G V L+ R P +
Sbjct: 751 -------NVTPAAGGAANGLHVSTV-------------VTNAGQREGGDVAQLYLRFPAI 790
Query: 742 SQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
G P+ L GF RV ++ +SF + P LS G+R + G++ + VG
Sbjct: 791 P-GAPQVALRGFQRVALKPGETRTVSFDLSP-RDLSAVTADGQREVMAGDYTVSVG 844
>gi|423295566|ref|ZP_17273693.1| hypothetical protein HMPREF1070_02358 [Bacteroides ovatus
CL03T12C18]
gi|392672275|gb|EIY65744.1| hypothetical protein HMPREF1070_02358 [Bacteroides ovatus
CL03T12C18]
Length = 782
Score = 250 bits (638), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 216/750 (28%), Positives = 335/750 (44%), Gaps = 135/750 (18%)
Query: 90 RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
RLGIP + E+ HG + G T FP + AA+++ L +G +A
Sbjct: 129 RLGIPMF-LAEEAPHGHMAIG-----------ATVFPTGIGMAATWSPELVKEVGQVIAK 176
Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
E R+ Q G + P +++ RDPRW R +ET GEDP++ V G N
Sbjct: 177 EIRS-----QGGHISYGPVLDLTRDPRWSRVEETFGEDPVLSGTLGASMVDGLGGGNLSQ 231
Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQ 269
I A KH +AY + + G Y A + +
Sbjct: 232 KYATI--------------------------ATLKHFLAYAVPEGGQNGNY---ASVGIR 262
Query: 270 DTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCD 328
D F PPFR I+ G S +M SYN ++G+P L Q RNEW F+G++ SD
Sbjct: 263 DLHQNFLPPFRKAIDAGALS-VMTSYNSIDGIPCTSNHYLLTQLLRNEWKFRGFVVSDLY 321
Query: 329 AVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNL 388
++ I E T E++A + AG+D++ G + A+ G++ + ID A+ +
Sbjct: 322 SIEGIHESHFVAPTKENAAIQSVMAGVDVDLGGDAYTNLCHAVQSGQMDKAVIDTAVCRV 381
Query: 389 FSVQLRLGLFNG---DPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLN 445
++ +GLF DP K+ V EH +LA A+ I LLKN+ LPL+
Sbjct: 382 LRMKFEMGLFEHPYVDP------KIAAKTVRRKEHIELARKIAQSSITLLKNENSILPLS 435
Query: 446 KNAVSSLAIIGPLVNNISQMGGGYTGIP--CSPKSLLRGLEAYVS--KTHYASGC--HDV 499
K ++ +A+IGP +N M G YT + K++L G+ +S + Y GC D
Sbjct: 436 K-MINKVAVIGPNADNRYNMLGDYTAPQEDSNVKTVLDGIITKLSPSRVEYVRGCAIRDT 494
Query: 500 PCNSDAGFHEAVRIAKKADFVIVVAG----LDLT---QET----------------EDRD 536
N +A+ A++++ VIVV G D +ET E D
Sbjct: 495 TVNE---IEQAIEAARRSEVVIVVVGGSSARDFKTSYKETGAAVAEEGSVSDMECGEGFD 551
Query: 537 RVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGA 596
R SL L G+Q L+ S+ +T K P+I+V G PL+ ++A ++L YPG+ G
Sbjct: 552 RASLSLLGRQQELLESLQKTGK-PLIVVYIEGRPLEKNWA--SEYADALLTAYYPGQEGG 608
Query: 597 KALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGF 656
A+A+++FGD+NP GRLP++ P S ++P+ N +A + Y S + +Y F
Sbjct: 609 NAIADVLFGDYNPSGRLPIS-VPRSVGQIPVY-YNQKAPRNHDYVEVS-----SSPLYSF 661
Query: 657 GHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHV 716
G+G+SYT + Y SD ++Q++ C F V
Sbjct: 662 GYGMSYTTFEY------------------SDLQVVQKSA------------RC----FEV 687
Query: 717 QISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQL 776
V N G DG V L+ R S P KQL F+R H K+++F V E
Sbjct: 688 SFKVKNTGKYDGEEVSQLYMRDEYASVVQPMKQLKHFERFHLKKGEEKKVTF-VLTEEDF 746
Query: 777 SIANKHGRRILPLGNHVLMVGELRHSLTIE 806
+ N ++++ GN LM+G + + ++
Sbjct: 747 FLVNYTLKKVVESGNFHLMIGAASNDIRLQ 776
>gi|182413194|ref|YP_001818260.1| glycoside hydrolase family 3 [Opitutus terrae PB90-1]
gi|177840408|gb|ACB74660.1| glycoside hydrolase family 3 domain protein [Opitutus terrae
PB90-1]
Length = 859
Score = 250 bits (638), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 217/754 (28%), Positives = 326/754 (43%), Gaps = 114/754 (15%)
Query: 90 RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
RLGIP ++ +E + G+ S TSFP L A++++ +L IG
Sbjct: 161 RLGIPV-DFTNEGIRGLLH-----------SKATSFPAELAVASTWDPALVREIGRITGR 208
Query: 150 EARAMYNLGQAGLT-FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWK 208
EARA+ G T ++P +++ RDPRWGR ET GEDP +V VE V+ Q E+
Sbjct: 209 EARAL------GYTNIYSPVLDLARDPRWGRTIETYGEDPFLVGTLGVEQVRGLQAEH-- 260
Query: 209 SDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITE 268
+ + KH Y + K G + T
Sbjct: 261 ------------------------------VVSTLKHFAVYSIPKGGRDGEARTDPQATW 290
Query: 269 QDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGD---LFQKARNEWGFKGYITS 325
++ + F PFR I + A +M SYN +GVP + G L + R +WGF+GY+ S
Sbjct: 291 REVQTIFLEPFRRAIREAGALGVMASYNDYDGVP--VEGSALFLSEILRGQWGFRGYVVS 348
Query: 326 DCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCG----TCMLRHTQSAIDKGKVQEKDI 381
D AV I T D+ ++AG++I + + GK+ I
Sbjct: 349 DSAAVEFIHSKHRVAPTPADAIRQAVEAGLNIRTNFTPPAAYAEPLRQLVRDGKLAMATI 408
Query: 382 DRALLNLFSVQLRLGLFN----GDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKN 437
D + ++ V+ +LGLF+ DP P EH +A A R+ IVLLKN
Sbjct: 409 DARVRDVLRVKFQLGLFDRPYVADPAAADRVVRAP------EHLVVAQRAGREAIVLLKN 462
Query: 438 DKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVS---KTHYAS 494
+ LPL++ + + + GPL ++ Y + L GL A + + YA
Sbjct: 463 EPALLPLDRAKLQRVLVAGPLADDAHAWWSRYGAQRLDFVTPLPGLRAKLGAAVEVRYAK 522
Query: 495 GCH-------------DVP-CNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSL 540
G D P AG AV A+ D +I V G E R+SL
Sbjct: 523 GVEAKDAAWPASDVLKDPPSAEVRAGIEAAVAAAQNVDVIIAVLGETDELCRESSSRISL 582
Query: 541 LLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALA 600
LPG Q L+ ++ T K P++LVL+ G PL V +A + +I+ + +PGE G ALA
Sbjct: 583 ALPGYQQELLEALHATGK-PLVLVLSNGRPLSVVWAA--RHVPAIVELWFPGEDGGAALA 639
Query: 601 EIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQ--VYGFGH 658
++ GD NP GRLP+T +P+S ++P N +PG R + + ++ FGH
Sbjct: 640 AVLLGDANPSGRLPIT-FPQSVGQLPYN--------FPAHPGSQARDFGQVEGSLFPFGH 690
Query: 659 GLSYTNYSYKFLS-APSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQ 717
GLSYT + Y L P + + AG L+ + SR + V F +
Sbjct: 691 GLSYTTFRYSDLRITPERIPVDGFGAAGGGDPGLRGSASRATPYSVSTVPE-----FTIT 745
Query: 718 ISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLS 777
VTN G G VV L+ R S T + L GF RV +K ++F + L
Sbjct: 746 CDVTNTGTRAGDEVVQLYLRDDYSSVTTYDIALRGFARVTLAPGETKPVTFTLHRA-HLE 804
Query: 778 IANKHGRRILPLGNHVLMVG------ELRHSLTI 805
+ N+ G ++ G +M+G LR + T+
Sbjct: 805 LYNRDGDWVVEPGRFTVMLGASSADIRLRGTFTV 838
>gi|167765093|ref|ZP_02437206.1| hypothetical protein BACSTE_03479 [Bacteroides stercoris ATCC
43183]
gi|167696721|gb|EDS13300.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
stercoris ATCC 43183]
Length = 944
Score = 249 bits (637), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 205/753 (27%), Positives = 331/753 (43%), Gaps = 123/753 (16%)
Query: 90 RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
RLGIP ++ +E + G+ S T+FP L ++NR L +G
Sbjct: 153 RLGIPV-DFTNEGIRGVES-----------YKATNFPTQLGLGHTWNRELIRQVGLITGR 200
Query: 150 EARAMYNLGQAGLT-FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWK 208
EAR + G T +AP +++ RD RWGR +E GE P +V+ +E V+ Q +
Sbjct: 201 EARML------GYTNVYAPILDVGRDQRWGRYEEVYGESPYLVAELGIEMVRGLQHNH-- 252
Query: 209 SDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITE 268
++A KH AY K + +
Sbjct: 253 -----------------------------QVAATAKHFAAYSNNKGAREGMARVDPQMPP 283
Query: 269 QDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLF---QKARNEWGFKGYITS 325
++ E+ PF+ I + +M SYN +G+P ++G + + R E GF+GY+ S
Sbjct: 284 REVENIHIYPFKRVIREAGLLGVMSSYNDYDGIP--IQGSYYWLTTRLRKEMGFRGYVVS 341
Query: 326 DCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCG----TCMLRHTQSAIDKGKVQEKDI 381
D DAV ++ N K +++ ++AG+++ C + + + +G + E+ I
Sbjct: 342 DSDAVEYLYTKHNTAKDMKEAVRQSVEAGLNVRCTFRSPDSFVLPLRELVKEGGLSEEVI 401
Query: 382 DRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKF 441
+ + ++ V+ +GLF+ + G D+V ++ +AL A+R+ IVLLKN
Sbjct: 402 NDRVRDILRVKFLIGLFDAPYQTDLAG--ADDEVEKEANEAVALQASRESIVLLKNTDNT 459
Query: 442 LPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV---SKTHYASGCH- 497
LPLN + + +A+ GP + Y + ++L G+ ++ Y GC
Sbjct: 460 LPLNIDKIKKIAVCGPNADEEGYALTHYGPLAVEVTTVLEGIREKAQGKAEVLYTKGCDL 519
Query: 498 -----------DVPCNSD--AGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPG 544
+ P D A AV A++AD +VV G E++ R SL LPG
Sbjct: 520 VDAHWPESEIMEYPLTPDEQAEIDRAVANARQADVAVVVLGGGQRTCGENKSRTSLELPG 579
Query: 545 QQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIF 604
Q+ L+ +V T K PVIL+L G PL V++A D + +IL YPG G +A+I+F
Sbjct: 580 HQLKLLQAVQATGK-PVILILINGRPLSVNWA--DKFVPAILEAWYPGSKGGTVVADILF 636
Query: 605 GDFNPGGRLPMTWYPESFTKVPMN---DMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLS 661
GD+NPGG+L +T +P++ ++P N + D + +Y FG+GLS
Sbjct: 637 GDYNPGGKLTVT-FPKTVGQIPFNFPYKPASQIDGGKNPGPDGNMSRINGALYPFGYGLS 695
Query: 662 YTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVT 721
YT + Y S+L I+ + + K +++ VT
Sbjct: 696 YTTFEY------SDLEITPKVITPNQKAT-------------------------IRLKVT 724
Query: 722 NAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANK 781
N G G VV L+ R S T EK L GF+R+H SKEI F +D + L + N
Sbjct: 725 NTGKRAGDEVVQLYTRDILSSVTTYEKNLAGFERIHLKPGESKEIVFTLD-RKHLELLNA 783
Query: 782 HGRRILPLGNHVLMVG------ELRHSLTIETY 808
+ + G +M G L LT+E Y
Sbjct: 784 DMKWTVEPGEFAIMAGASSEDIRLNGILTVEDY 816
>gi|393787054|ref|ZP_10375186.1| hypothetical protein HMPREF1068_01466 [Bacteroides nordii
CL02T12C05]
gi|392658289|gb|EIY51919.1| hypothetical protein HMPREF1068_01466 [Bacteroides nordii
CL02T12C05]
Length = 958
Score = 249 bits (637), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 208/751 (27%), Positives = 344/751 (45%), Gaps = 119/751 (15%)
Query: 90 RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
RLGIP ++ +E + G+ S T+FP L ++NR L +G
Sbjct: 165 RLGIPV-DFTNEGIRGVES-----------YKATNFPTQLGLGHTWNRKLIHQVGLITGR 212
Query: 150 EARAMYNLGQAGLT-FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWK 208
EAR + G T +AP +++ RD RWGR +E GE P +V+ +E VK Q N++
Sbjct: 213 EARML------GYTNVYAPILDVGRDQRWGRYEEVYGESPYLVAELGIEMVKGMQ-HNYQ 265
Query: 209 SDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITE 268
++A KH IAY K + ++
Sbjct: 266 ------------------------------VAATGKHFIAYSNNKGAREGMARVDPQMSP 295
Query: 269 QDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPA-CLRGDLFQKARNEWGFKGYITSDC 327
++ E PF+ I++ +M SYN +G+P L + R + GF+GY+ SD
Sbjct: 296 REVEMIHVYPFKRVIQEAGLLGVMSSYNDYDGLPVQSSYYWLMTRLRGQMGFRGYVVSDS 355
Query: 328 DAVATIFEYQNYTKTHEDSAAGVLKAGMDINCG----TCMLRHTQSAIDKGKVQEKDIDR 383
DAV ++ K +++ ++AG+++ C + + + +G + E+ I+
Sbjct: 356 DAVEYLYTKHGTAKDMKEAVRQSVEAGLNVRCTFRSPDSYVLPLRELVQEGGLSEEIIND 415
Query: 384 ALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLP 443
+ ++ V+ +GLF+ + G ++V E++ +AL A+R+ IVLLKNDK LP
Sbjct: 416 RVRDILRVKFLVGLFDTPYQTDLKG--ADEEVEKEENEIVALQASRESIVLLKNDKNALP 473
Query: 444 LNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV---SKTHYASGCH--- 497
L+ ++ +A+ GP + + Y + ++L G+ V ++ Y GC
Sbjct: 474 LDVASIRKIAVCGPNADETAYALTHYGPLAVDVTTVLSGIRQKVDGKAEVLYTKGCELVD 533
Query: 498 ---------DVPCNSDA--GFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQ 546
D P +D +AV AK+AD +VV G E++ R SL LPG+Q
Sbjct: 534 ANWPESEIIDYPLTNDEQNKIDKAVAQAKEADVAVVVLGGGQRTCGENKSRSSLDLPGRQ 593
Query: 547 MSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGD 606
+ L+ +V T K PV+LVL G PL V++A D + +I+ YPG G A+A+++FGD
Sbjct: 594 LDLLKAVQATGK-PVVLVLINGRPLSVNWA--DKFVPAIIEAWYPGSKGGTAVADVLFGD 650
Query: 607 FNPGGRLPMTWYPESFTKVPMN---DMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYT 663
+NPGG+L +T +P+S ++P N + + D + + +Y FGHGLSYT
Sbjct: 651 YNPGGKLTVT-FPKSVGQIPFNFPCKPSSQIDGGKNPGPKGNMSRVNGALYPFGHGLSYT 709
Query: 664 NYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNA 723
+ Y S+++IS + + K V+ +TN
Sbjct: 710 TFEY------SDISISPKVITPNQK-------------------------VQVRCKITNT 738
Query: 724 GDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHG 783
G G VV L+ R S T EK L GF+R+H +KE+SF +D + L + N
Sbjct: 739 GKRAGDEVVQLYVRDILSSVTTYEKNLEGFERIHLQPGETKEVSFTLD-RKALELLNAKN 797
Query: 784 RRILPLGNHVLMVG------ELRHSLTIETY 808
++ G+ +M+G L LT++ Y
Sbjct: 798 DWVVEPGDFSIMLGASSEDIRLSDILTVKEY 828
>gi|150003144|ref|YP_001297888.1| glycoside hydrolase family protein [Bacteroides vulgatus ATCC 8482]
gi|149931568|gb|ABR38266.1| glycoside hydrolase family 3, candidate beta-glycosidase
[Bacteroides vulgatus ATCC 8482]
Length = 785
Score = 249 bits (637), Expect = 4e-63, Method: Compositional matrix adjust.
Identities = 221/754 (29%), Positives = 349/754 (46%), Gaps = 141/754 (18%)
Query: 90 RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
RLGIP + E HG + G T FP L +A+++N L +G A+A+
Sbjct: 131 RLGIPVL-FAEECPHGHMAIG-----------TTVFPTALSAASTWNEGLMLKMGEAIAL 178
Query: 150 EARAM-YNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWK 208
EAR N+G + P +++ R+PRW R +ET GEDP++ + V +K QG+
Sbjct: 179 EARLQGANIG------YGPVLDVAREPRWSRMEETFGEDPVLTTIMGVAMMKGMQGKV-- 230
Query: 209 SDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWG-NFSRYS--FNAM 265
+ D L A KH AY + + G N SR + +
Sbjct: 231 ------------------------QNDGKHLYATLKHFAAYGVPESGHNGSRANCGMRQL 266
Query: 266 ITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYIT 324
++E + PPFR +++G A +M SYN ++GVP +L RN+WGFKG++
Sbjct: 267 LSE------YLPPFRKAVKEG-AGTLMTSYNAIDGVPCTANKELLTDVLRNQWGFKGFVY 319
Query: 325 SDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCG-TCMLRHTQSAIDKGKVQEKDIDR 383
SD ++ I + K ++++A LKAG+D++ G ++ + A ++G + D+DR
Sbjct: 320 SDLISIEGIVGMRA-AKDNKEAAVKALKAGLDMDLGGNAFGKNLKKAYEEGLITMADLDR 378
Query: 384 ALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLP 443
A+ N+ ++ ++GLF +P +L V + EHK+LA AR+G+VLLKN+ LP
Sbjct: 379 AVGNVLRLKFQMGLFE-NPYVSP--ELAKKLVHSKEHKELARQVAREGVVLLKNEG-VLP 434
Query: 444 LNKNAVSSLAIIGPLVNNISQMGGGYTG--IPCSPKSLLRGLEAYVS---KTHYASGCH- 497
L+K+ + LA+IGP + + G YT + ++L G+ A VS + Y GC
Sbjct: 435 LSKH-IGHLAVIGPNADEMYNQLGDYTAPQVREEVATVLDGIRAAVSESTRVTYVKGCAV 493
Query: 498 ------DVPCNSDAGFHE-----AVRIAKKADFVIVVAGLDLTQETEDR----------- 535
D+P A V + DF +ED
Sbjct: 494 RDTTATDIPAAVAAAQKADAVVLVVGGSSARDFKTKYISTGAATVSEDAKTLPDMDCGEG 553
Query: 536 -DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEA 594
DR SL L G Q L+++VA T K P+++V G ++++ A +Q W YPGE
Sbjct: 554 FDRSSLRLLGDQEKLISAVASTGK-PLVVVYIQGRTMNMNLAAEKAQALLTAW--YPGEQ 610
Query: 595 GAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVY 654
G +A+I+FGD++P GRLP++ P S ++P+ Q R Y GT +Y
Sbjct: 611 GGMGIADILFGDYSPAGRLPVS-VPRSEGQLPV--------FYSQGTQRDYVESKGTPLY 661
Query: 655 GFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRF 714
FG+GLSYT ++Y L L+ G++ LQ +CT
Sbjct: 662 AFGYGLSYTRFTYSGL----------ELQKGTEMETLQ-------------TVACT---- 694
Query: 715 HVQISVTNAGDVDGSHVVMLFA--RVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDP 772
VTN G+ DG VV L+ +V VSQ P L F R+ S+++ F +
Sbjct: 695 -----VTNTGNRDGEEVVQLYIGDKVASVSQ--PPLLLKAFQRIFLKKGESRQVIFHLKK 747
Query: 773 CEQLSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
+ L I + ++ G +MVG + + +E
Sbjct: 748 -DDLGIYDSEMNYVVEPGEFKVMVGAASNDIRLE 780
>gi|423300893|ref|ZP_17278917.1| hypothetical protein HMPREF1057_02058 [Bacteroides finegoldii
CL09T03C10]
gi|408472228|gb|EKJ90756.1| hypothetical protein HMPREF1057_02058 [Bacteroides finegoldii
CL09T03C10]
Length = 798
Score = 249 bits (637), Expect = 4e-63, Method: Compositional matrix adjust.
Identities = 207/735 (28%), Positives = 340/735 (46%), Gaps = 127/735 (17%)
Query: 90 RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
RLGIP ++ +E + G+ + T FP A++N+ L I A
Sbjct: 149 RLGIPV-DFTNEGIRGLCHD-----------RATMFPAQCGQGATWNKKLIREIAKVTAD 196
Query: 150 EARAMYNLGQAGLT-FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWK 208
EA+A+ G T +AP ++I +DPRWGR E+ GEDP +V + + Q E
Sbjct: 197 EAKAL------GYTNIYAPILDIAQDPRWGRVVESYGEDPYLVGELGKQMILGLQNEG-- 248
Query: 209 SDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITE 268
+ A KH Y + G + +
Sbjct: 249 ------------------------------IVATPKHFAVYSIPVGGRDGGTRTDPHVAP 278
Query: 269 QDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDC 327
++ + + PFR I++ A +M SYN +G P + R +WGFKGY+ SD
Sbjct: 279 REMKTLYLEPFRKGIQEAGALGVMSSYNDYDGEPVSGSYHFLTEILRQQWGFKGYVVSDS 338
Query: 328 DAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTC----MLRHTQSAIDKGKVQEKDIDR 383
+AV + T T E+ AA V+ AG++I + + AI +GKV +D+
Sbjct: 339 EAVEFLHTKHRITPTEEEMAAQVVNAGLNIRTNFTPPQDFILPLRRAISEGKVSLHTLDQ 398
Query: 384 ALLNLFSVQLRLGLFN----GDPRKGKYGKLGPDDVC-TSEHKKLALDAARQGIVLLKND 438
+ + V+ +GLF+ GD R+ P+ V + H+ +++ AA + IVLLKN+
Sbjct: 399 RVGEILRVKFMMGLFDNPYPGDDRR-------PEVVVHNAAHQDVSMRAALESIVLLKNE 451
Query: 439 KKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV--SKTHYASGC 496
K+ LPL+K + + +A+IGP + ++ Y S K++ +G++ Y+ ++ YA GC
Sbjct: 452 KEMLPLSK-SFNKIAVIGPNAEEVKELTCRYGPANASIKTVYQGIKEYLPNAEVRYAKGC 510
Query: 497 ------------HDVPCNSD--AGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLL 542
++VP ++ A +EAV +AK +D I+V G + E+ R +L L
Sbjct: 511 DIIDKYFPESELYNVPLDTQEKAMINEAVELAKASDVAILVLGGNEKTVREEFSRTNLDL 570
Query: 543 PGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEI 602
G+Q L+ +V T K PV+LV+ G +++ A+ + +I+ +PGE A+A++
Sbjct: 571 CGRQQQLLEAVYATGK-PVVLVMVDGRAATINW--ANKYVPAIIHAWFPGEFMGDAIAKV 627
Query: 603 IFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSY 662
+FGD+NPGGRL +T +P+S ++P S+ + +Y FG+GLSY
Sbjct: 628 LFGDYNPGGRLAVT-FPKSVGQIPFAFPFKPGSDSKG------KVRVAGVLYPFGYGLSY 680
Query: 663 TNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTN 722
T + Y S+L +S + G+ +NI SCT V N
Sbjct: 681 TTFGY------SDLKVSKPV-IGAQENI---------------TLSCT---------VKN 709
Query: 723 AGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKH 782
G G VV L+ R S T +K L GF+R+H + ISF + P + L + +K+
Sbjct: 710 TGKKAGDEVVQLYIRDDFSSVTTYDKVLRGFERIHLQPGEERTISFTLTP-QDLGLWDKN 768
Query: 783 GRRILPLGNHVLMVG 797
+ G+ +MVG
Sbjct: 769 NHFTVEPGSFSVMVG 783
>gi|313203744|ref|YP_004042401.1| glycoside hydrolase [Paludibacter propionicigenes WB4]
gi|312443060|gb|ADQ79416.1| glycoside hydrolase family 3 domain protein [Paludibacter
propionicigenes WB4]
Length = 1286
Score = 249 bits (636), Expect = 4e-63, Method: Compositional matrix adjust.
Identities = 159/443 (35%), Positives = 228/443 (51%), Gaps = 46/443 (10%)
Query: 54 FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
+ NTS S RA LIS LTL+EK L ++ +AIPRLGI + WSE+LHGI G
Sbjct: 33 YLNTSYSFEERAADLISRLTLEEKESLLGNSMAAIPRLGIKSMNVWSEALHGILG---GA 89
Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFR 173
N + +S TSFP + ++++ +L A+A EARA+ G GLT+W+P + R
Sbjct: 90 NQSVGISGPTSFPNSVALGSAWDPALMQREAMAIADEARAINQTGTKGLTYWSPVVEPIR 149
Query: 174 DPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDR 233
DPRWGR E+ GEDP + + A FV+ G + LK
Sbjct: 150 DPRWGRTGESYGEDPFLAAEIAGGFVRGMVG--------------NDPTYLKSV------ 189
Query: 234 GDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMC 293
C KH A + E F R+ ++ + +D + + P++ IEQ IM
Sbjct: 190 -------PCAKHYFANNSE----FDRHVSSSNMDSRDMREFYLAPYKKLIEQDNLPSIMS 238
Query: 294 SYNQVNGVPACLRGDLFQK-ARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLK 352
SYN VNGVP AR +G KGYIT DC A+ I+ Y KT E++ A LK
Sbjct: 239 SYNAVNGVPTSASQLYLDTIARRTYGLKGYITGDCAAIEDIYTGHYYVKTAEEATAKGLK 298
Query: 353 AGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGP 412
AG+D +CG+ R+ +A+ KG + DIDRALLN+F V++R G F+ P K Y + P
Sbjct: 299 AGVDSDCGSIYQRYAIAALKKGLITMADIDRALLNIFIVRMRTGEFD-PPAKVLYAQFQP 357
Query: 413 DDVCTSEHKKLALDAARQGIVLLKN------DKKFLPLNKNAVSSLAIIGPLVNNISQMG 466
+ V + +K LA + A + VLLKN ++K LPLN + +A+IGP + +
Sbjct: 358 NIVNSPANKALAKEIATKTPVLLKNNISLKTNRKALPLNPADLKKIALIGPHADKVEL-- 415
Query: 467 GGYTGIPCSPKSL--LRGLEAYV 487
G Y+G P + G++ Y+
Sbjct: 416 GPYSGRPAQENMITPFAGIKKYI 438
Score = 110 bits (275), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 79/255 (30%), Positives = 127/255 (49%), Gaps = 39/255 (15%)
Query: 525 GLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISS 584
G D TE+ DR++LLLPG Q+ L+ +VA + I+V+ G ++V + I
Sbjct: 619 GTDEKTATEEADRLTLLLPGNQVELIKAVAAVNPN-TIVVMQTLGCVEVEEFKNLQNIPG 677
Query: 585 ILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVP-MNDMNMRADSSRQYPGR 643
I+W+GY G+A A+A ++FG+ NPGG+L TWY +S +P + D +R + + GR
Sbjct: 678 IIWVGYNGQAQGDAIASVLFGEVNPGGKLNGTWY-KSVKDLPEITDYTLRGGNGKN--GR 734
Query: 644 SYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHI 703
++ ++ Y FG G+SYT + Y S IS KN + +
Sbjct: 735 TFWYFDKDVSYEFGFGMSYTTFEY------SNFRIS--------KNSI---------IPH 771
Query: 704 DEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPK--VSQGTPEKQLIGFDRVHTVAK 761
D++T V + V N G V+G V+ ++ + P S P K+L GF RV A
Sbjct: 772 DKIT--------VSVDVKNTGKVEGDEVIQVYMKTPDSPASLQRPIKRLKGFKRVTLPAG 823
Query: 762 GSKEISFGVDPCEQL 776
+K ++ ++ C L
Sbjct: 824 QTKTVNIDIN-CADL 837
>gi|383110724|ref|ZP_09931543.1| hypothetical protein BSGG_1833 [Bacteroides sp. D2]
gi|382949470|gb|EFS31133.2| hypothetical protein BSGG_1833 [Bacteroides sp. D2]
Length = 783
Score = 249 bits (636), Expect = 4e-63, Method: Compositional matrix adjust.
Identities = 212/739 (28%), Positives = 331/739 (44%), Gaps = 132/739 (17%)
Query: 90 RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
RLGIP + E+ HG + G T FP + AA+++ L +G A+
Sbjct: 131 RLGIPLF-LAEEAPHGHMAIG-----------ATVFPTGIGMAATWSPQLIREVGKAIGK 178
Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
E R Q G + P +++ RDPRW R +ET GEDP++
Sbjct: 179 EIRL-----QGGHISYGPVLDLARDPRWSRVEETFGEDPVLT------------------ 215
Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQ 269
G IG K +++G G D A KH +AY + + G SF + +
Sbjct: 216 --GEIG-----KAMVEGLGS-GDLSHPYSTLATLKHFLAYGISESGQNGNPSFAGI---R 264
Query: 270 DTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCD 328
+ + F PPFR I+ G A +M SYN ++G+P L + RNEW F G + SD
Sbjct: 265 ELHENFLPPFRQAIDAG-ALSVMTSYNSMDGIPCTANHSLLTELLRNEWKFSGIVVSDLY 323
Query: 329 AVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNL 388
++ I + T E +A L AG+D++ G + +A++ G++ + +D ++ +
Sbjct: 324 SIEGIHQSHFVAPTMEAAAILALSAGVDVDLGGDAYMNLMNAVNTGRISKTALDASVARV 383
Query: 389 FSVQLRLGLFNG---DPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLN 445
++ +GLF DP K K +V + E LA A+ I LLKN+ LPLN
Sbjct: 384 LRLKFEMGLFENPYVDPEKAK------KEVRSEESVTLARRVAQASITLLKNEHSLLPLN 437
Query: 446 KNAVSSLAIIGPLVNNISQMGGGYTGIPCSP--KSLLRGLEAYV--SKTHYASGCHDVPC 501
KN +A+IGP +N M G YT K++L G+ + S+ Y GC +
Sbjct: 438 KN--RKVALIGPNADNRYNMLGDYTAPQEEENIKTVLDGIRTKLSSSQVEYVKGC-SIRD 494
Query: 502 NSDAGFHEAVRIAKKADFVIVVAG----LDLT---QET----------------EDRDRV 538
+AV A++++ +I V G D +ET E DR
Sbjct: 495 TVTTDIEQAVAAAQRSEVIIAVVGGSSARDFKTSYKETGAAIADEKTISDMECGEGFDRA 554
Query: 539 SLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKA 598
+L L G+Q L+ ++ T K P+I+V G PLD ++A ++ ++L YPG+ G A
Sbjct: 555 TLSLLGKQQELLKALKATGK-PLIVVYIEGRPLDKTWASENAD--AVLTAYYPGQEGGNA 611
Query: 599 LAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGH 658
+A+++FGD+NP GRLP+T P S ++P+ N +A + Y S + +Y FG+
Sbjct: 612 IADVLFGDYNPAGRLPLT-VPRSVGQIPIY-YNKKAPQNHDYVELS-----ASPLYAFGY 664
Query: 659 GLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQI 718
GLSYT + Y S+L +SA + F V
Sbjct: 665 GLSYTTFEY------SDLRVSA----------------------------ISPHSFEVSF 690
Query: 719 SVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSI 778
V N G DG V L+ R S P KQL F+R KE+ F + + +I
Sbjct: 691 KVKNTGRYDGEEVSQLYLRDEYASVVQPLKQLKHFERFCLKRGEVKEVKFVLSESD-FTI 749
Query: 779 ANKHGRRILPLGNHVLMVG 797
+++ + ++ G +MVG
Sbjct: 750 IDRNLKTVVESGTFQVMVG 768
>gi|295689400|ref|YP_003593093.1| glycosyl hydrolase family protein [Caulobacter segnis ATCC 21756]
gi|295431303|gb|ADG10475.1| glycoside hydrolase family 3 domain protein [Caulobacter segnis
ATCC 21756]
Length = 895
Score = 249 bits (636), Expect = 4e-63, Method: Compositional matrix adjust.
Identities = 158/449 (35%), Positives = 228/449 (50%), Gaps = 64/449 (14%)
Query: 65 AKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTS 124
A L++ +T +EK+ QL + A AIPRLG+PAY WW+ESLHG + P T+
Sbjct: 43 AADLVAKMTTEEKLDQLLNTAPAIPRLGVPAYNWWTESLHGALGSLP----------TTN 92
Query: 125 FPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA---------GLTFWAPNINIFRDP 175
FP+ + AA+F+ L ++G A+ E R ++ L +A L W+PNINIFRDP
Sbjct: 93 FPEPVGLAATFDTDLVKDVGGAIGAEVRGLHTLARATGRMGRIGTALNTWSPNINIFRDP 152
Query: 176 RWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGD 235
RWGRGQET GEDP + + V FV+ QG + D
Sbjct: 153 RWGRGQETYGEDPHLAARMGVAFVEGVQGPD---------------------------PD 185
Query: 236 ELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSY 295
+ A KH Y + +R+ N ++ +D EDT+ P FR+ I + KA IMC+Y
Sbjct: 186 LPQVIATPKH---YAVHSGPESTRHHANVYVSRRDLEDTYLPAFRAAIVEAKAGSIMCAY 242
Query: 296 NQVNGVPACLRGDLFQK-ARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAG 354
N+++G PAC L + R W F GY+ SDCDAV I + Y + A ++AG
Sbjct: 243 NRIDGQPACASDMLLKDYLRTAWKFDGYVVSDCDAVKDINDNHKYAPDAATAVAAAVRAG 302
Query: 355 MDINCGTCMLRHT-------QSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKY 407
+D C L T + A+ +G + D+D+AL+ L+S +LR+G G R
Sbjct: 303 VDNECNGATLTDTAGLSNRYRDALQQGLITIGDVDQALVRLYSARLRVGDLPGV-RPVDT 361
Query: 408 GKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGG 467
P V T EH+ LAL A+ + +VLLKND LPL +A+IGPL + + G
Sbjct: 362 APKAPSAVMTPEHEALALAASEKSLVLLKND-GLLPLKPGL--RVAVIGPLGDATRVLRG 418
Query: 468 GY-TGIPCSPKSLLRGLEAYV--SKTHYA 493
Y + + P S++ GL + +K YA
Sbjct: 419 NYSSALSAPPISVVEGLRRALPGAKVAYA 447
Score = 108 bits (269), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 83/290 (28%), Positives = 129/290 (44%), Gaps = 54/290 (18%)
Query: 518 DFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVARTSKRPVILVLTG 567
D ++ V GL E E+ D+ +L +P Q +L+ A+ +P+++V
Sbjct: 625 DVLVAVVGLTSDLEAEESPVEIPGFKGGDKTTLDIPADQQALLEQ-AKALGKPLVVVAMN 683
Query: 568 GGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPM 627
G PL++S+A+ ++ ++IL YPG++G A+A ++ G NP GRLP+T+Y P
Sbjct: 684 GSPLNLSWAKDNA--AAILEAWYPGQSGGLAIANVLTGKTNPSGRLPLTFYKSVEDLPPF 741
Query: 628 NDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSD 687
D M GR+YR++TG VY FG+GLSYT + Y A LK
Sbjct: 742 GDYRMD--------GRTYRYFTGQPVYPFGYGLSYTRFDY------------APLK---- 777
Query: 688 KNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPE 747
++ + V +V N G G V L+ P S G P
Sbjct: 778 ---------------VEPIKGDAGQGLRVTTTVRNVGQRAGDEVAQLYLNFPN-SPGAPR 821
Query: 748 KQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
L GF RV +K ++F + LS + G R + G + + VG
Sbjct: 822 VALRGFQRVSLKPGEAKSVTFSLS-SRDLSSVDPDGVRQVMTGRYKVSVG 870
>gi|189468349|ref|ZP_03017134.1| hypothetical protein BACINT_04746 [Bacteroides intestinalis DSM
17393]
gi|189436613|gb|EDV05598.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
intestinalis DSM 17393]
Length = 786
Score = 249 bits (636), Expect = 5e-63, Method: Compositional matrix adjust.
Identities = 218/742 (29%), Positives = 348/742 (46%), Gaps = 123/742 (16%)
Query: 90 RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
RLGIP ++ +E + G+ + T FP A++N+ L S I A
Sbjct: 137 RLGIPV-DFTNEGIRGLCHD-----------RATMFPAQCGQGATWNKELISEIAQVTAE 184
Query: 150 EARAMYNLGQAGLT-FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWK 208
EA+A+ G T ++P ++I +DPRWGR E GEDP +V
Sbjct: 185 EAKAL------GYTNIYSPILDIAQDPRWGRVVECYGEDPFLV----------------- 221
Query: 209 SDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITE 268
G +G KR++KG +E L A KH Y + G + + +
Sbjct: 222 ---GELG-----KRMIKGLQQEG-------LVATPKHFAVYSIPVGGRDAGTRTDPHVAP 266
Query: 269 QDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGD---LFQKARNEWGFKGYITS 325
++ + PFR + A +M SYN +G P + G L + R+EWGFKGY+ S
Sbjct: 267 REMRTLYIEPFRKAFCEAGALGVMSSYNDYDGEP--ITGSYHFLTEILRHEWGFKGYVVS 324
Query: 326 DCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCM----LRHTQSAIDKGKVQEKDI 381
D +AV ++ D AA V+ AG+++ + +R + AI +GKV + I
Sbjct: 325 DSEAVEFLYSKHQVAVDAVDGAAQVVNAGLNVRTNFTLPENFIRPLRQAISEGKVSMQTI 384
Query: 382 DRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKF 441
D + ++ V+ +GLF+ +P KG K V + EH+ +++ AA + IVLLKN+
Sbjct: 385 DSRVADVLRVKFGMGLFD-NPYKGD-AKHPEKVVHSKEHQAVSMRAALESIVLLKNENNI 442
Query: 442 LPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV--SKTHYASGC--- 496
LPL+K+ + +A+IGP N + + Y K++ +G++ Y+ ++ YA G
Sbjct: 443 LPLSKD-LKKIAVIGPNANEVQNLICRYGPANAPIKTVYQGIKEYLPDAEVRYAKGTDII 501
Query: 497 ---------HDVPCNSD--AGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQ 545
++VP + + A EAV +A+++D I+V G + E+ R +L L G+
Sbjct: 502 DKYFPESELYEVPLDQEEQAMMDEAVTLAEESDVAIMVLGGNEKTVREEYSRTNLDLCGR 561
Query: 546 QMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFG 605
Q L+ +V T K PVIL+L G +++AE I I+ +PGE A+A+++FG
Sbjct: 562 QEKLLQAVYATGK-PVILLLVDGRVATINWAE--RYIPGIVHAWFPGEFMGDAVAQVLFG 618
Query: 606 DFNPGGRLPMTWYPESFTKVPMN-DMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTN 664
D+NPGG+L +T +P S ++P +DS + + TGT +Y FG+GLSYT
Sbjct: 619 DYNPGGKLAVT-FPRSVGQIPFAFPFKPGSDS------KGFVRVTGT-LYPFGYGLSYTT 670
Query: 665 YSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAG 724
++Y S+L I + I Q +L SC V N G
Sbjct: 671 FAY------SDLKIENPV-------IGVQGSVKL---------SC---------KVKNTG 699
Query: 725 DVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGR 784
V G VV L+ S T K L GF+R+H K I F + P ++L + NK
Sbjct: 700 KVAGDEVVQLYLHDEMSSVTTYVKVLRGFERIHLEPGEEKVIDFVLTP-QELGLWNKDNH 758
Query: 785 RILPLGNHVLMVGELRHSLTIE 806
++ G +MVG + ++
Sbjct: 759 FVVEPGTFAVMVGSSSQDIRLQ 780
>gi|393781221|ref|ZP_10369422.1| hypothetical protein HMPREF1071_00290 [Bacteroides salyersiae
CL02T12C01]
gi|392677556|gb|EIY70973.1| hypothetical protein HMPREF1071_00290 [Bacteroides salyersiae
CL02T12C01]
Length = 946
Score = 249 bits (636), Expect = 5e-63, Method: Compositional matrix adjust.
Identities = 203/734 (27%), Positives = 333/734 (45%), Gaps = 113/734 (15%)
Query: 90 RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
RLGIP ++ +E + G+ S T+FP L ++NR L IG
Sbjct: 153 RLGIPV-DFTNEGIRGVES-----------YKATNFPTQLGLGHTWNRKLIHQIGLITGR 200
Query: 150 EARAMYNLGQAGLT-FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWK 208
EAR + G T +AP +++ RD RWGR +E GE P +V+ +E V+ Q +
Sbjct: 201 EARML------GYTNVYAPILDVGRDQRWGRYEEVYGESPYLVAELGIEMVRGMQHNH-- 252
Query: 209 SDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITE 268
++A KH IAY K + ++
Sbjct: 253 -----------------------------QVAATGKHFIAYSNNKGAREGMARVDPQMSP 283
Query: 269 QDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVP-ACLRGDLFQKARNEWGFKGYITSDC 327
++ E PF+ I++ +M SYN +G P L + R + GF+GY+ SD
Sbjct: 284 REVEMIHVYPFKRVIQEAGLLGVMSSYNDYDGFPIQSSYYWLTTRLRGQMGFRGYVVSDS 343
Query: 328 DAVATIFEYQNYTKTHEDSAAGVLKAGMDINCG----TCMLRHTQSAIDKGKVQEKDIDR 383
DAV ++ K +++ ++AG+++ C + + + +G + E+ I+
Sbjct: 344 DAVEYLYTKHGTAKDMKEAVRQSVEAGLNVRCTFRSPDSYVLPLRELVQEGGLSEEVIND 403
Query: 384 ALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLP 443
+ ++ V+ +GLF+ + G D+V E++ +AL A+R+ IVLLKN+ LP
Sbjct: 404 RVRDILRVKFLVGLFDAPYQTDLKG--ADDEVEKEENEAVALQASRESIVLLKNENNTLP 461
Query: 444 LNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV---SKTHYASGCH--- 497
L+ +V +A+ GP + Y + +++ GL + ++ Y GC
Sbjct: 462 LDITSVKKIAVCGPNAAEKAYALTHYGPLAVEVTTVVDGLREKLNGKAEVLYTKGCDLVD 521
Query: 498 ---------DVPCNSD--AGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQ 546
D P + D + +AV A++AD +VV G E++ R SL LPG+Q
Sbjct: 522 AHWPESEIIDYPLSKDEQSEIDKAVAQAQEADVAVVVLGGGQRTCGENKSRSSLDLPGRQ 581
Query: 547 MSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGD 606
+ L+ +V T K PVILVL G PL V++A D + +IL YPG G A+A+++FGD
Sbjct: 582 LDLLKAVQATGK-PVILVLINGRPLSVNWA--DKFVPAILEAWYPGSKGGTAIADVLFGD 638
Query: 607 FNPGGRLPMTWYPESFTKVPMN---DMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYT 663
+NPGG+L +T +P+S ++P N + + D + + +Y FG+GLSYT
Sbjct: 639 YNPGGKLTVT-FPKSVGQIPFNFPHKPSSQIDGGKNPGTKGDMSRVNGALYPFGYGLSYT 697
Query: 664 NYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNA 723
+ Y S++ IS + + K V+ VTN
Sbjct: 698 TFEY------SDINISPKVITPNQK-------------------------VQVRCKVTNT 726
Query: 724 GDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHG 783
G G VV L+ R S T EK L GF+R+H +KE+SF +D + L + N
Sbjct: 727 GKHAGDEVVQLYVRDLISSVTTYEKNLEGFERIHLQPGETKEVSFTLD-RKALELLNAKN 785
Query: 784 RRILPLGNHVLMVG 797
++ G+ +M+G
Sbjct: 786 DWVVEPGDFSIMLG 799
>gi|423222970|ref|ZP_17209439.1| hypothetical protein HMPREF1062_01625 [Bacteroides cellulosilyticus
CL02T12C19]
gi|392640546|gb|EIY34345.1| hypothetical protein HMPREF1062_01625 [Bacteroides cellulosilyticus
CL02T12C19]
Length = 862
Score = 249 bits (635), Expect = 5e-63, Method: Compositional matrix adjust.
Identities = 154/461 (33%), Positives = 237/461 (51%), Gaps = 57/461 (12%)
Query: 51 SYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNG 110
+ P+ N LS + RAK L+ LTL+EK + D++ AIPRLGI + WWSE+LHG+A+ G
Sbjct: 20 TLPYQNPELSPAERAKDLVKRLTLEEKALLMCDDSEAIPRLGIKKFNWWSEALHGVANQG 79
Query: 111 PGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYN---------LGQAG 161
+VT FP+ + AASFN L I +AV+ E RA +N +
Sbjct: 80 ----------NVTVFPEPVGMAASFNDKLVFEIFNAVSDEMRAKHNERVRNGLEDVRFHS 129
Query: 162 LTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQG-ENWKSDDGGIGFGFRE 220
L+ W PN+NIFRDPRWGRGQET GEDP + S + VK QG EN K +R+
Sbjct: 130 LSVWTPNVNIFRDPRWGRGQETYGEDPYLTSQMGIAVVKGLQGPENEK---------YRK 180
Query: 221 KRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNA-MITEQDTEDTFQPPF 279
L AC KH + +W SR++ N ++ +D +T+ P F
Sbjct: 181 ------------------LLACAKHYAVHSGPEW---SRHTANLNNVSPRDLWETYLPAF 219
Query: 280 RSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQN 338
++ +++ +MC+Y +++ P C L Q+ R+EWGFK + SDC A+A +
Sbjct: 220 KALVQKADVREVMCAYQRLDDDPCCGNTRLLQQILRDEWGFKYLVVSDCGAIADFWTSHK 279
Query: 339 YTKTHEDSAAGVLKAGMDINCGT-CMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGL 397
+ +A AG D+ CG + A+ +G + E+++D+ +L L + LG
Sbjct: 280 SSSDAVHAAVKGTMAGTDVECGYGYAYQKLPEAVSRGLITEEEVDKHVLRLMEGRFELGE 339
Query: 398 FNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGP 457
+ DP + K+ V HK L+L+ +RQ + LL+N LPL+K ++ +A+IGP
Sbjct: 340 MD-DPSLVNWTKIPMSVVNCKAHKDLSLNMSRQTMTLLQNKNNVLPLSK-SIRKIAVIGP 397
Query: 458 LVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTH--YASGC 496
++ + G Y G P ++L G ++ + K Y GC
Sbjct: 398 NADDKPMLWGNYNGTPNQTITILDGFKSKLKKNQIVYMKGC 438
Score = 74.3 bits (181), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 61/233 (26%), Positives = 101/233 (43%), Gaps = 44/233 (18%)
Query: 536 DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAG 595
DR + LP Q + + ++ K+ V + +G ++ +IL Y GE G
Sbjct: 629 DRTDIELPAVQRNFLKALKDAGKQVVFVNCSGSS---MALLPETESCDAILQAWYGGELG 685
Query: 596 AKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYG 655
A+A+++FGD+NP G+LP+T+Y + D +M+ GR+YR Y ++
Sbjct: 686 GYAVADVLFGDYNPSGKLPVTFYKSTKQLPDYEDYSMK--------GRTYR-YMSDPLFP 736
Query: 656 FGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFH 715
FG GLSYT++ A A +K L+ DE + T
Sbjct: 737 FGFGLSYTDF--------------AVGTASCNKTQLR----------TDESLTLT----- 767
Query: 716 VQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISF 768
+ V+N G G+ VV ++ R + G P K L + RV A +++
Sbjct: 768 --VPVSNTGKRSGTEVVQVYIRKTDDADG-PLKSLKAYARVELAAGAKQDVKI 817
>gi|365121873|ref|ZP_09338785.1| hypothetical protein HMPREF1033_02131 [Tannerella sp.
6_1_58FAA_CT1]
gi|363644185|gb|EHL83481.1| hypothetical protein HMPREF1033_02131 [Tannerella sp.
6_1_58FAA_CT1]
Length = 850
Score = 249 bits (635), Expect = 5e-63, Method: Compositional matrix adjust.
Identities = 159/437 (36%), Positives = 221/437 (50%), Gaps = 57/437 (13%)
Query: 64 RAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVT 123
R L+S LT++EKI L + IPRL I Y +E+LHGI G NF T
Sbjct: 37 RIMDLLSRLTIEEKISLLRATSPGIPRLEIEKYYHGNEALHGIVRPG---NF-------T 86
Query: 124 SFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAG----------LTFWAPNINIFR 173
FPQ + A+ +N I + ++ EARA +N G LTFW+P +N+ R
Sbjct: 87 VFPQAIGLASMWNPDFLYEISTVISDEARARWNELNRGKDQKRLFSDLLTFWSPTVNMAR 146
Query: 174 DPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDR 233
DPRWGR ET GEDP + V FVK QG + R LK
Sbjct: 147 DPRWGRTPETYGEDPFLSGKLGVAFVKGLQGN--------------DPRYLK-------- 184
Query: 234 GDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMC 293
+ + KH A + E +R+ N I+E+D + + P F CI GKA IM
Sbjct: 185 -----VVSTPKHFAANNEEH----NRFECNPQISERDLREYYLPAFERCIIDGKAQSIMT 235
Query: 294 SYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLK 352
+YN +N VP L L +K R +WGF GY+ SDC A + + + Y KT E +A LK
Sbjct: 236 AYNAINDVPCTLNTWLLKKVLRTDWGFNGYVVSDCGAPSLLVTHHKYVKTPEAAATLALK 295
Query: 353 AGMDINCG-TCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLG 411
AG+D+ CG + +A + V E +ID A + ++ LGLF+ DP K Y L
Sbjct: 296 AGLDLECGDNVYIEPLMNAYKQYMVSEAEIDTAAYRILRARMMLGLFD-DPAKNPYNALS 354
Query: 412 PDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTG 471
P V +HK +AL+AARQ +VLLKN+ FLP+N + S+A++G +N + G Y+G
Sbjct: 355 PSIVGCEKHKNMALEAARQSLVLLKNENNFLPINPKKIKSIAVVG--INAGNCEFGDYSG 412
Query: 472 IPCS-PKSLLRGLEAYV 487
P + P S+L G+ V
Sbjct: 413 KPVNVPVSVLDGIRNRV 429
Score = 136 bits (343), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 87/292 (29%), Positives = 141/292 (48%), Gaps = 47/292 (16%)
Query: 507 FHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLT 566
+ +A + ++ D I V G++ + E E RDR + LP Q + + + + + +VL
Sbjct: 591 YGDAKKAIQECDMTIAVMGINKSIEREGRDRDHIELPKDQELFIEEAYKLNPK-MAVVLV 649
Query: 567 GGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVP 626
G L V++ D + +IL YPGE G A+AE +FGD+NP GRLP+T+Y P
Sbjct: 650 AGSSLAVNWM--DEHVPAILNAWYPGEQGGTAVAEALFGDYNPAGRLPLTYYRSLDDLPP 707
Query: 627 MNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGS 686
+D ++ + R+Y ++TG +Y FG+GLSYT + Y+ LS
Sbjct: 708 FDDYAVQKN-------RTYMYFTGKPLYAFGYGLSYTKFDYRKLS--------------- 745
Query: 687 DKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTP 746
+D+ L F ++ N+G +G V ++ + P++ P
Sbjct: 746 ----------------VDQDAENVRLSF----TIKNSGKYNGDEVAQVYVQFPEIGVKVP 785
Query: 747 EKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIAN-KHGRRILPLGNHVLMVG 797
KQL GF+RVH + ++ V P ++L I N + G P GN+V MVG
Sbjct: 786 IKQLKGFERVHIAKGKTLPVTITV-PKKELRIWNERKGEFFTPSGNYVFMVG 836
>gi|423214394|ref|ZP_17200922.1| hypothetical protein HMPREF1074_02454 [Bacteroides xylanisolvens
CL03T12C04]
gi|392692809|gb|EIY86045.1| hypothetical protein HMPREF1074_02454 [Bacteroides xylanisolvens
CL03T12C04]
Length = 800
Score = 249 bits (635), Expect = 5e-63, Method: Compositional matrix adjust.
Identities = 208/738 (28%), Positives = 338/738 (45%), Gaps = 133/738 (18%)
Query: 90 RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
RLGIP ++ +E + G+ + T FP A++N+ L I A
Sbjct: 151 RLGIPV-DFTNEGIRGLCHD-----------RATMFPAQCGQGATWNKKLIREIAKVTAN 198
Query: 150 EARAMYNLGQAGLT-FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWK 208
EA+A+ G T ++P ++I +DPRWGR E+ GEDP + + + Q E
Sbjct: 199 EAKAL------GYTNIYSPILDIAQDPRWGRVVESYGEDPYLAGELGKQMILGLQSEG-- 250
Query: 209 SDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITE 268
+ A KH Y + G + +
Sbjct: 251 ------------------------------IVATPKHFAVYSIPVGGRDGGTRTDPHVAP 280
Query: 269 QDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDC 327
++ + + PFR I++ A +M SYN +G P + R +WGFKGY+ SD
Sbjct: 281 REMKTLYLEPFRKGIQEAGALGVMSSYNDYDGEPVSGSYHFLTEILRQQWGFKGYVVSDS 340
Query: 328 DAVATIFEYQNYTKTHEDSAAGVLKAGMDINCG-------TCMLRHTQSAIDKGKVQEKD 380
+AV + T T E+ AA V+ AG++I LRH AI++GKV
Sbjct: 341 EAVEFLHTKHRITPTEEEMAAQVVNAGLNIRTNFTPPQDFILPLRH---AINEGKVSLHT 397
Query: 381 IDRALLNLFSVQLRLGLFN----GDPRKGKYGKLGPDDVCTSE-HKKLALDAARQGIVLL 435
+D+ + + V+ +GLF+ GD R+ P+ V ++ HK +++ AA + +VLL
Sbjct: 398 LDQRVSEILRVKFMMGLFDNPYPGDDRR-------PETVVHNDAHKAVSMKAALESVVLL 450
Query: 436 KNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV--SKTHYA 493
KN + LPL+KN +A+IGP + ++ Y S K++ +G++ Y+ S+ YA
Sbjct: 451 KNKNQMLPLSKN-FKKIAVIGPNAEEVKELTCRYGPANASIKTVYQGIKEYLPNSEVRYA 509
Query: 494 SGC------------HDVPCNS--DAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVS 539
GC ++VP ++ A EAV +AK +D I+V G + E+ R +
Sbjct: 510 KGCDIIDKYFPESELYNVPLDTQEQAMIQEAVELAKASDIAILVLGGNEKTVREEFSRTN 569
Query: 540 LLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKAL 599
L L G+Q L+ +V T K PV+LV+ G +++ A+ + +I+ +PGE A+
Sbjct: 570 LDLCGRQQQLLEAVYATGK-PVVLVMVDGRAATINW--ANKYVPAIIHAWFPGEFMGDAI 626
Query: 600 AEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHG 659
A+++FGD+NPGGRL +T +P+S ++P S+ + +Y FG+G
Sbjct: 627 AKVLFGDYNPGGRLAVT-FPKSVGQIPFAFPFKPGSDSKG------KVRVDGVLYPFGYG 679
Query: 660 LSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQIS 719
LSYT + Y S+L IS + G +NI SCT
Sbjct: 680 LSYTTFGY------SDLKISKPV-IGPQENI---------------TLSCT--------- 708
Query: 720 VTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIA 779
V N G G VV L+ R S T +K L GF+R+H + ++F + P + L +
Sbjct: 709 VKNTGKKAGDEVVQLYIRDDFSSVTTYDKVLRGFERIHLQPGEEQTVNFTLTP-QDLGLW 767
Query: 780 NKHGRRILPLGNHVLMVG 797
+K+ + + G+ +MVG
Sbjct: 768 DKNNQFTVEPGSFSVMVG 785
>gi|160887545|ref|ZP_02068548.1| hypothetical protein BACOVA_05565 [Bacteroides ovatus ATCC 8483]
gi|156107956|gb|EDO09701.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
ovatus ATCC 8483]
Length = 736
Score = 249 bits (635), Expect = 5e-63, Method: Compositional matrix adjust.
Identities = 216/750 (28%), Positives = 334/750 (44%), Gaps = 135/750 (18%)
Query: 90 RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
RLGIP + E+ HG + G T FP + AA+++ L +G +A
Sbjct: 83 RLGIPMF-LAEEAPHGHMAIG-----------ATVFPTGIGMAATWSPELVKEVGQVIAK 130
Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
E R+ Q G + P +++ RDPRW R +ET GEDP++ V G N
Sbjct: 131 EIRS-----QGGHISYGPVLDLTRDPRWSRVEETFGEDPVLSGILGASMVDGLGGGNLSQ 185
Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQ 269
I A KH +AY + + G Y A + +
Sbjct: 186 KYATI--------------------------ATLKHFLAYAVPEGGQNGNY---ASVGIR 216
Query: 270 DTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCD 328
D F PPFR I+ G A +M SYN ++G P L Q RNEW F+G++ SD
Sbjct: 217 DLHQNFLPPFRKAIDAG-ALSVMTSYNSIDGTPCTSNHYLLTQLLRNEWKFRGFVVSDLY 275
Query: 329 AVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNL 388
++ I E T E++A + AG+D++ G + A+ G++ + ID A+ +
Sbjct: 276 SIEGIHESHFVAPTKENAAIQSVMAGVDVDLGGDAYTNLCHAVQSGQMDKTVIDTAVCRV 335
Query: 389 FSVQLRLGLFNG---DPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLN 445
++ +GLF DP K+ V EH +LA A+ I LLKN+ LPL+
Sbjct: 336 LRMKFEMGLFEHPYVDP------KIAAKTVRRKEHIELARKIAQSSITLLKNENSILPLS 389
Query: 446 KNAVSSLAIIGPLVNNISQMGGGYTGIP--CSPKSLLRGLEAYVS--KTHYASGC--HDV 499
K ++ +A+IGP +N M G YT + K++L G+ +S + Y GC D
Sbjct: 390 KT-INKVAVIGPNADNRYNMLGDYTAPQEDSNVKTVLDGILTKLSPFRVEYVRGCAIRDT 448
Query: 500 PCNSDAGFHEAVRIAKKADFVIVVAG----LDLT---QET----------------EDRD 536
N +A++ A++++ VIVV G D +ET E D
Sbjct: 449 TVNE---IEQAIKAARRSEVVIVVVGGSSARDFKTSYKETGAAVAEEGSVSDMECGEGFD 505
Query: 537 RVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGA 596
R SL L G+Q L+ S+ +T K P+I+V G PL+ ++A ++L YPG+ G
Sbjct: 506 RASLSLLGRQQELLESLQKTGK-PLIVVYIEGRPLEKNWA--SEYADALLTAYYPGQEGG 562
Query: 597 KALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGF 656
A+A+++FGD+NP GRLP++ P S ++P+ N +A + Y S +Y F
Sbjct: 563 NAIADVLFGDYNPSGRLPIS-VPRSVGQIPVY-YNKKAPRNHDYVEMS-----SFPLYSF 615
Query: 657 GHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHV 716
G+G+SYT + Y SD ++Q++ C F V
Sbjct: 616 GYGMSYTTFEY------------------SDLQVVQKSA------------RC----FEV 641
Query: 717 QISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQL 776
V N G DG V L+ R S P KQL F+R H K+++F V E
Sbjct: 642 SFKVKNTGKYDGEEVSQLYMRDEYASVVQPMKQLKHFERFHLKKGEEKKVTF-VLTEEDF 700
Query: 777 SIANKHGRRILPLGNHVLMVGELRHSLTIE 806
+ N ++++ GN LM+G + + ++
Sbjct: 701 FLVNYTLKKVVESGNFHLMIGAASNDIRLQ 730
>gi|423291211|ref|ZP_17270059.1| hypothetical protein HMPREF1069_05102 [Bacteroides ovatus
CL02T12C04]
gi|392663822|gb|EIY57367.1| hypothetical protein HMPREF1069_05102 [Bacteroides ovatus
CL02T12C04]
Length = 800
Score = 249 bits (635), Expect = 6e-63, Method: Compositional matrix adjust.
Identities = 207/735 (28%), Positives = 339/735 (46%), Gaps = 127/735 (17%)
Query: 90 RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
RLGIP ++ +E + G+ + T FP A++N+ L I A
Sbjct: 151 RLGIPV-DFTNEGIRGLCHD-----------RATMFPAQCGQGATWNKKLIREIAKVTAN 198
Query: 150 EARAMYNLGQAGLT-FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWK 208
EA+A+ G T ++P ++I +DPRWGR E+ GEDP + + + Q E
Sbjct: 199 EAKAL------GYTNIYSPILDIAQDPRWGRVVESYGEDPYLAGELGKQMILGLQSEG-- 250
Query: 209 SDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITE 268
+ A KH Y + G + +
Sbjct: 251 ------------------------------IVATPKHFAVYSIPVGGRDGGTRTDPHVAP 280
Query: 269 QDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDC 327
++ + + PFR I++ A +M SYN +G P + R +WGFKGY+ SD
Sbjct: 281 REMKTLYLEPFRKGIQEAGALGVMSSYNDYDGEPVSGSYHFLTEILRQQWGFKGYVVSDS 340
Query: 328 DAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTC----MLRHTQSAIDKGKVQEKDIDR 383
+AV + T T E+ AA V+ AG++I + + AI++GKV +D+
Sbjct: 341 EAVEFLHTKHRITPTEEEMAAQVVNAGLNIRTNFTPPQDFILPLRRAINEGKVSLHTLDQ 400
Query: 384 ALLNLFSVQLRLGLFN----GDPRKGKYGKLGPDDVCTSE-HKKLALDAARQGIVLLKND 438
+ + V+ +GLF+ GD R+ P+ V ++ HK +++ AA + IVLLKN+
Sbjct: 401 RVGEILRVKFMMGLFDNPYPGDDRR-------PETVVHNDAHKAVSMKAALESIVLLKNE 453
Query: 439 KKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV--SKTHYASGC 496
+ LPL+KN S +A+IGP + ++ Y S K++ +G++ Y+ S+ YA GC
Sbjct: 454 NQMLPLSKN-FSKIAVIGPNGEEVKELTCRYGPANASIKTVYQGIKEYLPNSEVRYAKGC 512
Query: 497 ------------HDVPCNS--DAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLL 542
++VP ++ A EAV +AK +D I+V G + E+ R +L L
Sbjct: 513 DIIDKYFPESELYNVPLDTQEQAMIQEAVELAKASDVAILVLGGNEKTVREEFSRTNLDL 572
Query: 543 PGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEI 602
G+Q L+ +V T K PV+LV+ G +++ A+ + +I+ +PGE A+A++
Sbjct: 573 CGRQQQLLEAVYATGK-PVVLVMVDGRAATINW--ANKYVPAIIHAWFPGEFMGDAIAKV 629
Query: 603 IFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSY 662
+FGD+NPGGRL +T +P+S ++P S+ + +Y FG+GLSY
Sbjct: 630 LFGDYNPGGRLAVT-FPKSVGQIPFAFPFKPGSDSKG------KVRVDGVLYPFGYGLSY 682
Query: 663 TNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTN 722
T + Y S+L IS + G +NI SCT V N
Sbjct: 683 TTFGY------SDLKISKPV-IGPQENI---------------TLSCT---------VKN 711
Query: 723 AGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKH 782
G G VV L+ R S T +K L GF+R+H + ++F + P + L + +K+
Sbjct: 712 TGKKAGDEVVQLYIRDDFSSVTTYDKVLRGFERIHLQPGEEQTVNFTLTP-QDLGLWDKN 770
Query: 783 GRRILPLGNHVLMVG 797
+ + G+ +MVG
Sbjct: 771 NQFTVEPGSFSVMVG 785
>gi|427384377|ref|ZP_18880882.1| hypothetical protein HMPREF9447_01915 [Bacteroides oleiciplenus YIT
12058]
gi|425727638|gb|EKU90497.1| hypothetical protein HMPREF9447_01915 [Bacteroides oleiciplenus YIT
12058]
Length = 1050
Score = 248 bits (634), Expect = 6e-63, Method: Compositional matrix adjust.
Identities = 155/449 (34%), Positives = 230/449 (51%), Gaps = 57/449 (12%)
Query: 54 FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
+ N + R L+S LT++EKI L + I RL IP Y +E+LHG+ G
Sbjct: 29 YKNENAPTHERIMDLLSRLTVEEKISLLRATSPGISRLDIPKYYHGNEALHGVVRPG--- 85
Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAG----------LT 163
T FPQ + AA++N L + + ++ EARA +N G LT
Sbjct: 86 -------RFTVFPQAIGLAATWNPVLQEQVATVISDEARARWNELDQGREQKSQFSDLLT 138
Query: 164 FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRV 223
FW+P +N+ RDPRWGR ET GEDP + FVK QG + R
Sbjct: 139 FWSPTVNMARDPRWGRTPETYGEDPYLSGIMGTAFVKGLQGND--------------SRY 184
Query: 224 LKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCI 283
LK + + KH A + E +R+ N I+E+ + + P F +C+
Sbjct: 185 LK-------------IVSTPKHFAANNEE----HNRFVCNPQISEKQLREYYLPAFEACV 227
Query: 284 EQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKT 342
+ GK++ IM +YN +N VP L L K RN+WGFKGY+ SDC + + Y KT
Sbjct: 228 KDGKSASIMSAYNALNDVPCTLNAWLLTKVLRNDWGFKGYVVSDCGGPSLLVNAHKYVKT 287
Query: 343 HEDSAAGVLKAGMDINCGTCMLRH-TQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGD 401
E +A +KAG+D+ CG + SA + V + DID A + +++LGLF+
Sbjct: 288 KEAAATLSIKAGLDLECGDDVYDEPLLSAYRQYMVTDADIDSAAYRVLRARMQLGLFDSG 347
Query: 402 PRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNN 461
K Y K+ P + + EH+++AL+AAR+ IVLLKN KK LPLN + S+A++G +N
Sbjct: 348 -EKNPYTKISPAVIGSKEHQEVALNAARECIVLLKNQKKMLPLNAKKIKSIAVVG--INA 404
Query: 462 ISQMGGGYTGIPC-SPKSLLRGLEAYVSK 489
S G Y+G+P +P S+L+G++ V +
Sbjct: 405 GSSEFGDYSGLPVIAPVSVLQGIKDRVGE 433
Score = 131 bits (329), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 82/292 (28%), Positives = 142/292 (48%), Gaps = 47/292 (16%)
Query: 507 FHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLT 566
+ EA + ++ + V+ V G++ + E E +DR + LP Q + + + + +++VL
Sbjct: 593 YGEAGKAVRECETVVAVLGINKSIEREGQDRYDIQLPADQREFLQEIYKVNPN-IVVVLV 651
Query: 567 GGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVP 626
G L V++ D + +I+ YPGE+G KA+AE++FGD+NPGGRLP+T+Y P
Sbjct: 652 AGSSLAVNWM--DEHVPAIVNAWYPGESGGKAVAEVLFGDYNPGGRLPLTYYRSLDELPP 709
Query: 627 MNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLS-APSELTISASLKAG 685
+D ++ GR+Y+++ G +Y FG+GLSYT++ Y L A E +S S +
Sbjct: 710 FDDYDITK-------GRTYKYFKGDVLYPFGYGLSYTSFKYSNLQVADGEEEVSVSFQ-- 760
Query: 686 SDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGT 745
+ N G G V ++ ++P+ +
Sbjct: 761 ----------------------------------LKNTGRYAGDEVAQVYVKLPEREEVM 786
Query: 746 PEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
P K+L GF+RV + SK+++ + G+ I P GN+ +MVG
Sbjct: 787 PVKELKGFERVSLKSGESKKVTIKLRKDLLRYWDEAKGKFIYPSGNYNIMVG 838
>gi|325299987|ref|YP_004259904.1| Beta-glucosidase [Bacteroides salanitronis DSM 18170]
gi|324319540|gb|ADY37431.1| Beta-glucosidase [Bacteroides salanitronis DSM 18170]
Length = 864
Score = 248 bits (634), Expect = 7e-63, Method: Compositional matrix adjust.
Identities = 154/459 (33%), Positives = 226/459 (49%), Gaps = 61/459 (13%)
Query: 53 PFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPG 112
P+ N L+ RA L+ LTL+EK + + + AIPRLGI AY+WW+E+LHG+
Sbjct: 26 PYQNPKLTPEERANDLVGRLTLEEKASLMQNTSPAIPRLGIKAYDWWNEALHGV------ 79
Query: 113 VNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLTF 164
G T FPQ + AASF+ L + +AV+ EARA Y + GLTF
Sbjct: 80 ----GRAGIATVFPQTIGMAASFDDELLYQVFTAVSDEARAKYTQFRKEGDLKRYQGLTF 135
Query: 165 WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVL 224
W PN+NIFRDPRWGRGQET GEDP + S + V+ QG D
Sbjct: 136 WTPNVNIFRDPRWGRGQETYGEDPYLTSQMGMAVVRGLQGPEDAPYD------------- 182
Query: 225 KGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPFRSCI 283
L AC KH + +W +R+ FNA I +D +T+ P F+ +
Sbjct: 183 -------------KLHACAKHFAVHSGPEW---NRHEFNAENIAPRDLWETYMPAFKDLV 226
Query: 284 EQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKT 342
++ +MC+YN++ G P C L R+EWG++G + SDC A++ + ++ +T
Sbjct: 227 QKAHVKEVMCAYNRLEGEPCCGNNRLLTHILRDEWGYQGIVVSDCGAISDFWRKGDH-ET 285
Query: 343 HEDSA---AGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFN 399
H D A AG + +G D+ CG+ + A+ G + E +D ++ L + LG +
Sbjct: 286 HPDKAHASAGAVLSGTDLECGSN-YKSLPEAVKAGLIAESQLDISVKRLLKARFELGEMD 344
Query: 400 GDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLV 459
D + + V HK LAL AR+ IVLL+N LPL K+ +A++GP
Sbjct: 345 KDV---CWDTIPYSVVDCQAHKDLALRMARESIVLLQNRNNILPLRKDM--KIALVGPNA 399
Query: 460 NNISQMGGGYTGIPCSPKSLLRGLEAYV--SKTHYASGC 496
N+ G Y G P ++L L+ + S+ Y GC
Sbjct: 400 NDSIMHWGNYNGFPSHTETLYEALKKRLPASQLIYEFGC 438
Score = 106 bits (265), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 82/282 (29%), Positives = 125/282 (44%), Gaps = 59/282 (20%)
Query: 498 DVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQM 547
D+P N A K AD ++ G+ T E E+ DR S+ LP Q
Sbjct: 586 DIPVNLQA----TADKVKDADVILFAGGISPTLEGEEMPVDAEGFRGGDRTSIELPAIQR 641
Query: 548 SLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQI-SSILWIGYPGEAGAKALAEIIFGD 606
LV + + K P++ + G + ++ +S+I ++ YPG+AG A+A+++FGD
Sbjct: 642 QLVGELKKLGK-PIVFINYSGSAMGLA---PESEICDGMIQAWYPGQAGGTAIADVLFGD 697
Query: 607 FNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYS 666
+NP G+LP+T+Y + D M+ GR+YR+ T T ++ FGHGLSYT +
Sbjct: 698 YNPAGKLPVTFYRNTEQLPDFEDYAMK--------GRTYRYMTETPLFRFGHGLSYTTFD 749
Query: 667 YKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDV 726
Y KA +N + E + T I V+N G
Sbjct: 750 YG--------------KARLSQNTFSK----------GETLTLT-------IPVSNTGTR 778
Query: 727 DGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISF 768
DG V ++ R P + P L F RV+ G+KEI F
Sbjct: 779 DGEETVQVYLRRPGDADA-PSHTLRAFKRVYVPKGGTKEIKF 819
>gi|387789568|ref|YP_006254633.1| beta-glucosidase-like glycosyl hydrolase [Solitalea canadensis DSM
3403]
gi|379652401|gb|AFD05457.1| beta-glucosidase-like glycosyl hydrolase [Solitalea canadensis DSM
3403]
Length = 742
Score = 248 bits (634), Expect = 7e-63, Method: Compositional matrix adjust.
Identities = 230/809 (28%), Positives = 366/809 (45%), Gaps = 136/809 (16%)
Query: 47 PHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGI 106
PHF +I R SL+ L+TL+EK+ Q++ + G + L +
Sbjct: 17 PHF----VAAQKKTIDQRVDSLLQLMTLEEKVGQMNQYSGPWAHTGPITED--GNILQQV 70
Query: 107 ASN--GPGVNFNGTVSSV--------------------------TSFPQVLVSAASFNRS 138
G +N NG + T+FP L AAS++
Sbjct: 71 QEGKLGSMLNINGVAHTKELQTLALKSRLKIPLLFGQDVIHGYRTTFPIPLGEAASWD-- 128
Query: 139 LWSNIGSAVAVEARAMYNLGQAGLTF-WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVE 197
+ + A +G+ + +AP ++I RDPRWGR E GEDP + S A
Sbjct: 129 ----LEAMEQSARVAATEAAASGIHWTFAPMVDISRDPRWGRVMEGAGEDPYLGSLIAKA 184
Query: 198 FVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNF 257
VK FQG + GD + AC KH AY G
Sbjct: 185 RVKGFQG--------------------------NKLGDINSVMACAKHFAAYGAAIGGR- 217
Query: 258 SRYSFNAM-ITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPAC----LRGDLFQK 312
+N++ ++++ + + PPF++ E G A+ M S+N +NGVPA L+ D+
Sbjct: 218 ---DYNSVDMSDRTLWEIYLPPFKAAAEAGVAT-FMNSFNDLNGVPASASSYLQRDIL-- 271
Query: 313 ARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGT-CMLRHTQSAI 371
+ +W F G++ SD ++ + ++ +E S A V+ AG D++ + C ++ +
Sbjct: 272 -KGKWNFTGFVVSDWGSIGEMIKHGYVKDCYEASQAAVM-AGSDMDMESRCYTQNLVHLV 329
Query: 372 DKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQG 431
+GKV E ID A+ + + LGLF DP + K + +HK +A D A++
Sbjct: 330 KEGKVPETVIDDAVRRILRKKFELGLFE-DPFRFCDVKREQQALNNPKHKAIARDVAKKS 388
Query: 432 IVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGY-TGIPCSPK--SLLRGLEAYV- 487
+VLLKN+ LPL+K A ++AIIGPL + M G + P S S G++ +
Sbjct: 389 VVLLKNNNNVLPLSKQA-KTIAIIGPLAKSERDMLGFWAVDWPDSSYIVSQFEGIQQKLG 447
Query: 488 --SKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQ 545
SK YA GC ++ NS +GF EA+ +A+KAD V++ G E + R ++ LPG
Sbjct: 448 NNSKLLYAMGC-NIEDNSTSGFAEAIEVAQKADVVLLSVGERRDMSGEAKSRSTIHLPGV 506
Query: 546 QMSLVTSVARTSKRPVILVLTGGGPLDVSF-AEADSQISSILWIGYPGEAGAKALAEIIF 604
Q L+ ++ T K PV++++ G PL ++ A+ I W+G EAG A+A+++F
Sbjct: 507 QEELIKAIKATGK-PVVVLINAGRPLIFNWTADNADAIVYTWWLG--SEAG-NAIADVLF 562
Query: 605 GDFNPGGRLPMTWYPESFTKVPM--NDMNM----RADSSRQYPGRSYRFYTGTQVYGFGH 658
GD+NP G+LPMT +P S ++P+ N +N D R Y +Y + Y FG+
Sbjct: 563 GDYNPAGKLPMT-FPRSEGQLPIYYNFLNTGRPAENDKDRFYRS-AYNDLSIYPKYAFGY 620
Query: 659 GLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQI 718
GLSYTN+ Y L +E T + +
Sbjct: 621 GLSYTNFKYDNLKLSAE-------------------------------TMNMDGKITLSF 649
Query: 719 SVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSI 778
SVTN G+ G VV L+ R S P K+L F ++ +KEI+F +D E+LS
Sbjct: 650 SVTNTGNYTGEEVVQLYLRDEFASLVRPVKELKDFKKILLKPGETKEINFVIDK-EKLSF 708
Query: 779 ANKHGRRILPLGNHVLMVGELRHSLTIET 807
N+ + G +M+G + +++
Sbjct: 709 YNQKLEWVAEPGTFQVMIGSASDDIRLKS 737
>gi|409196594|ref|ZP_11225257.1| beta-glucosidase [Marinilabilia salmonicolor JCM 21150]
Length = 876
Score = 248 bits (634), Expect = 7e-63, Method: Compositional matrix adjust.
Identities = 149/448 (33%), Positives = 228/448 (50%), Gaps = 55/448 (12%)
Query: 52 YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGP 111
YPF N SLS R K LIS LT++EK L D + AIPRLGI + WWSE+LHG A+N
Sbjct: 33 YPFQNPSLSFEERVKDLISRLTVEEKAALLCDQSDAIPRLGIEKFNWWSEALHGYANN-- 90
Query: 112 GVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA---------GL 162
SVT FPQ + AASFN L +I +A + E RA Y+ Q L
Sbjct: 91 --------DSVTVFPQPVGMAASFNDELVFDIFNATSDEGRAKYHQAQRRGEENRRFLSL 142
Query: 163 TFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQG-ENWKSDDGGIGFGFREK 221
+ W PN+NIFRDPRWGRGQET GEDP + S V+ VK QG E+ K +R+
Sbjct: 143 SVWTPNVNIFRDPRWGRGQETYGEDPYLTSRMGVQVVKGLQGPEDAK---------YRK- 192
Query: 222 RVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPFR 280
L AC KH + +W SR+ N ++ ++ +T+ P F+
Sbjct: 193 -----------------LLACAKHYTVHSGPEW---SRHELNVNDVSPREFYETYMPAFK 232
Query: 281 SCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNY 339
+ ++ +MC+Y++++ P C + Q+ R+EWG++ + +DC A+
Sbjct: 233 ALVQDADVRQVMCAYHRLDDEPCCSNTRILQRILRDEWGYEHLVAADCGAITDFHTTHGI 292
Query: 340 TKTHEDSAAGVLKAGMDINC--GTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGL 397
+ T +AA L AG D+ C + A++K ++E+ ID +L + + LG
Sbjct: 293 SSTPVHAAATGLLAGTDLECIWDNYHYKMLPEALEKDLIKEEHIDSSLFRVLMGRFELGE 352
Query: 398 FNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGP 457
+ D + + + EH++LA A+Q + LL+N LPL+K++ +A+IGP
Sbjct: 353 MDDDAIV-PWAQTPASVLNNKEHRELAYKMAQQSMTLLQNKNNVLPLDKSSTQKIAVIGP 411
Query: 458 LVNNISQMGGGYTGIPCSPKSLLRGLEA 485
++ + G Y G P ++L G+++
Sbjct: 412 NADDEEVLWGNYNGTPIRTITILDGMKS 439
Score = 87.0 bits (214), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 68/263 (25%), Positives = 110/263 (41%), Gaps = 51/263 (19%)
Query: 536 DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAG 595
DR + LP Q + + ++ K + + +G ++ +IL Y GE+G
Sbjct: 643 DRTHIELPSVQRNALKALKEAGKTVIFVNFSGSA---IALEPETESCDAILQAWYAGESG 699
Query: 596 AKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYG 655
+A+A+++FGD+NP G+LP+T+Y S D +M GR+YR YT ++
Sbjct: 700 GQAIADVLFGDYNPSGKLPLTFYRNSDNLGDFEDYSME--------GRTYR-YTNDHLFP 750
Query: 656 FGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFH 715
FG GLSYTN+ Q G+ + + T
Sbjct: 751 FGFGLSYTNF---------------------------QIGN----AKLSKATITPDETIS 779
Query: 716 VQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPC-- 773
+ I V N G DG+ ++ ++ R G P K L GF+RV A + + + + P
Sbjct: 780 INIPVKNTGKRDGTEIIQVYVRKVNDIDG-PIKTLKGFERVDVPAGKTTQANISLPPSSF 838
Query: 774 -----EQLSIANKHGRRILPLGN 791
Q I G+ I+ GN
Sbjct: 839 EFYDWSQRKITVTPGQYIVYYGN 861
>gi|329957143|ref|ZP_08297710.1| glycosyl hydrolase family 3 protein [Bacteroides clarus YIT 12056]
gi|328523411|gb|EGF50510.1| glycosyl hydrolase family 3 protein [Bacteroides clarus YIT 12056]
Length = 803
Score = 248 bits (634), Expect = 7e-63, Method: Compositional matrix adjust.
Identities = 211/773 (27%), Positives = 341/773 (44%), Gaps = 133/773 (17%)
Query: 52 YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGP 111
YPF N + +++T + + RLGIP ++ +E +HG+
Sbjct: 121 YPFSNHAEAVNTVQRWFVEET-----------------RLGIPV-DFTNEGIHGLNH--- 159
Query: 112 GVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLT-FWAPNIN 170
+ T P + +++N+ L G EA+A+ G T +AP ++
Sbjct: 160 --------TKATPLPAPIAIGSTWNKELVRRAGVIAGQEAKAL------GYTNVYAPILD 205
Query: 171 IFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEE 230
+ RDPRWGR E GE+P +++A E V Q +
Sbjct: 206 VVRDPRWGRTLECYGEEPFLIAALGTEMVNGIQSQG------------------------ 241
Query: 231 SDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASC 290
++A KH Y + K G + + ++ + F PF+ I+
Sbjct: 242 --------VAATLKHYAVYSVPKGGRDGHCRTDPHVAPRELHELFLYPFKKVIQNSHPMG 293
Query: 291 IMCSYNQVNGVPACLRGD-LFQKARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAG 349
+M SYN +GVP L + R E+GF GY+ SD AV + + T++++
Sbjct: 294 VMSSYNDWDGVPVSASYYFLTELLREEYGFDGYVVSDSQAVEFVESKHHVADTYDEAVRQ 353
Query: 350 VLKAGMDINCG----TCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKG 405
VL+AG+++ + + + +++ K+ ID+ + + V+ RLGLF+ P
Sbjct: 354 VLEAGLNVRTHFTPPSDFILPIRRLLEEKKISMATIDKRVSEVLRVKFRLGLFD-RPYVT 412
Query: 406 KYGKLGPDDVCTSEHK-KLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQ 464
G D+V ++ + +Q +VLLKN+ LPL+K + + + GPL + +
Sbjct: 413 DTG--AADNVGGADRNMDFVKEMQQQALVLLKNENNILPLDKQRIKKVLVTGPLADEDNF 470
Query: 465 MGGGYTGIPCSPKSLLRGLEAY---VSKTHYASGCH------------DVPCNS--DAGF 507
M Y ++L GL AY V++ YA GC VP N G
Sbjct: 471 MTSRYGPNGLETVTVLAGLRAYLQGVAEVDYAKGCDIVDAGWPATEILPVPMNEREKRGI 530
Query: 508 HEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTG 567
EAV A ++D VI V G D + E R R SL LPG+Q L+ ++ T K PVILVL
Sbjct: 531 AEAVAKAGESDVVIAVLGEDEYRTGESRSRTSLDLPGRQQQLLEALHATGK-PVILVLIN 589
Query: 568 GGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPM 627
G PL V++A A I +IL +PG G +AE +FG+ NPGG+L +T +P+S ++ +
Sbjct: 590 GQPLTVNWANA--YIPAILESWFPGCQGGTVIAETLFGEHNPGGKLTVT-FPKSVGQIEL 646
Query: 628 N---DMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKA 684
N + P S ++Y FG GLSYT ++Y S+L +S
Sbjct: 647 NFPFKPGSHGSQPKSGPNGSGATRVIGELYPFGFGLSYTTFAY------SDLEVSP---- 696
Query: 685 GSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQG 744
L+Q T + V+++VTN G G VV L+ R S
Sbjct: 697 ------LRQR---------------TQGEYTVKVNVTNTGKRAGDEVVQLYVRDKVSSVI 735
Query: 745 TPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
T + QL GF+RV +++++F + P E L I +++ + G +M+G
Sbjct: 736 TYDSQLRGFERVSLKPGETRQVTFSLKP-EDLQILDRNMNWTVEPGEFEVMIG 787
>gi|418855797|ref|ZP_13410448.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Newport str. CVM 19593]
gi|392820838|gb|EJA76679.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Newport str. CVM 19593]
Length = 765
Score = 248 bits (634), Expect = 7e-63, Method: Compositional matrix adjust.
Identities = 211/749 (28%), Positives = 347/749 (46%), Gaps = 139/749 (18%)
Query: 75 QEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAAS 134
++ I+Q+ D A+ RL IP + + + +HG T FP L A+S
Sbjct: 86 RQDIRQMQDQVMALSRLKIPLF-FAYDVVHG---------------QRTVFPISLGLASS 129
Query: 135 FNRSLWSNIGSAVAVEARAMYNLGQAGLTF-WAPNINIFRDPRWGRGQETPGEDPMVVSA 193
FN +G A EA GL WAP +++ RDPRWGR E GED + S
Sbjct: 130 FNLDAVRTVGRVSAYEA------ADDGLNMTWAPMVDVSRDPRWGRASEGFGEDTYLTSI 183
Query: 194 YAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEK 253
VK+ QG++ D + KH AY +
Sbjct: 184 MGETMVKAMQGKS--------------------------PADRYSVMTSVKHFAAYGAVE 217
Query: 254 WGNFSRYSFNAM-ITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK 312
G +N + ++ Q + + PP+++ ++ G + +M + N +NG PA L +
Sbjct: 218 GGK----EYNTVDMSSQRLFNDYMPPYKAGLDAG-SGAVMVALNSLNGTPATSDSWLLKD 272
Query: 313 A-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCML-RHTQSA 370
R+EWGFKG SD A+ + ++ ED+ LKAG+D++ ++
Sbjct: 273 VLRDEWGFKGITVSDHGAIKELIKHGTAADP-EDAVRVALKAGVDMSMADEYYSKYLPGL 331
Query: 371 IDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGP---DDVCTSE----HKKL 423
I GKV ++D A ++ +V+ +GLFN DP Y LGP D V T+ H+K
Sbjct: 332 IKSGKVTMAELDDATRHVLNVKYDMGLFN-DP----YSHLGPKESDPVDTNAESRLHRKE 386
Query: 424 ALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT--GIPCSPKSLLR 481
A + AR+ +VLLKN + LPL K+ ++A++GPL ++ + G ++ G+ ++L
Sbjct: 387 AREVARESVVLLKNRLETLPLKKSG--TIAVVGPLADSQRDVMGSWSAAGVANQSVTVLA 444
Query: 482 GLEAYV---SKTHYASGCH-----DV--------------PCNSDAGFHEAVRIAKKADF 519
G++ V +K YA G + D+ P + A EAV+ AK+AD
Sbjct: 445 GIQNAVGDGAKILYAKGANITNDKDIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADV 504
Query: 520 VIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEAD 579
V+ V G E R ++ +P Q L+T++ T K P++LVL G PL + + D
Sbjct: 505 VVAVVGESQGMAHEASSRTNITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKED 561
Query: 580 SQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRA----- 634
Q +IL + G G A+A+++FGD+NP G+LP++ +P S ++P+ ++
Sbjct: 562 QQADAILETWFAGTEGGNAIADVLFGDYNPSGKLPIS-FPRSVGQIPVYYSHLNTGRPYN 620
Query: 635 -DSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQ 693
+ +Y R + G +Y FG+GLSYT ++ S++T+S+ +Q+
Sbjct: 621 PEKPNKYTSRYFDEANGP-LYSFGYGLSYTTFTV------SDVTLSSP--------TMQR 665
Query: 694 TGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGF 753
G +VT+ + VTN G +G+ V+ ++ + S P KQL GF
Sbjct: 666 DG---------KVTAS--------VEVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGF 708
Query: 754 DRVHTVAKGSKEISFGVDPCEQLSIANKH 782
+++ SK +SF +D E L N+
Sbjct: 709 EKITLKPGESKTVSFPID-IEALKFWNQQ 736
>gi|298387490|ref|ZP_06997042.1| beta-glucosidase [Bacteroides sp. 1_1_14]
gi|298259697|gb|EFI02569.1| beta-glucosidase [Bacteroides sp. 1_1_14]
Length = 853
Score = 248 bits (634), Expect = 7e-63, Method: Compositional matrix adjust.
Identities = 155/447 (34%), Positives = 226/447 (50%), Gaps = 57/447 (12%)
Query: 54 FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
+ N + + R LIS LT++EKI L + IPRLGI Y +E+LHG+ G
Sbjct: 30 YKNENAPVHERVMDLISRLTVEEKISLLRATSPGIPRLGIDKYYHGNEALHGVVRPG--- 86
Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAG----------LT 163
T FPQ + AA++N L + + ++ EARA +N G LT
Sbjct: 87 -------RFTVFPQAIGLAATWNPELQKRVATVISDEARARWNELDQGREQKEQFSDVLT 139
Query: 164 FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRV 223
FW+P +N+ RDPRWGR ET GEDP + FV QG
Sbjct: 140 FWSPTVNMARDPRWGRTPETYGEDPFLSGIMGTAFVNGLQG------------------- 180
Query: 224 LKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCI 283
D L + + KH A + E +R+ N I+E+ + + P F C+
Sbjct: 181 --------DDPHYLKIVSTPKHFAANNEEH----NRFVCNPQISEKQLREYYFPAFEMCV 228
Query: 284 EQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKT 342
++GKA+ IM +YN +N VP L L QK R +WGF+GY+ SDC A + Y KT
Sbjct: 229 KEGKAASIMSAYNALNDVPCTLNSWLLQKVLRQDWGFQGYVVSDCGGPALLVNAHKYVKT 288
Query: 343 HEDSAAGVLKAGMDINCGTCMLR-HTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGD 401
E +A +KAG+D+ CG + +A + V + DID A ++ + +++LGLF+
Sbjct: 289 KEAAATLSIKAGLDLECGDDVYDGPLLNAYKQYMVSDADIDSAACHVLTARMKLGLFDSG 348
Query: 402 PRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNN 461
R Y K+ P + + EH+++ALDAARQ IVLLKN K LPLN + + S+A++G +N
Sbjct: 349 ER-NPYTKISPSVIGSKEHQQIALDAARQCIVLLKNQKNRLPLNADKLKSIAVVG--INA 405
Query: 462 ISQMGGGYTGIP-CSPKSLLRGLEAYV 487
G Y+G P P S+L+G+ V
Sbjct: 406 GKCEFGDYSGAPVVEPVSILQGIRNRV 432
Score = 130 bits (326), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 86/302 (28%), Positives = 148/302 (49%), Gaps = 47/302 (15%)
Query: 507 FHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLT 566
+ EA + ++ + V+ V G++ + E E +DR + LP Q + + + + +I+VL
Sbjct: 594 YGEAGKAVRECETVVAVMGINKSIEREGQDRYDIQLPADQREFLQEIYKVNPN-IIVVLV 652
Query: 567 GGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVP 626
G L V++ D + +I+ YPGE G A+AE++FGD+NP GRLP+T+Y P
Sbjct: 653 AGSSLAVNWM--DEHVPAIVNAWYPGEQGGTAVAEVLFGDYNPAGRLPLTYYKSLDELPP 710
Query: 627 MNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGS 686
+D ++ GR+Y+++ G +Y FG+GLSY++++Y L +K G
Sbjct: 711 FDDYDITK-------GRTYKYFKGDVLYPFGYGLSYSSFTYSDL----------QVKDGG 753
Query: 687 DKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTP 746
DEVT V + N G +G V ++ R+P+ P
Sbjct: 754 -----------------DEVT--------VSFRLKNTGKRNGDEVAQVYVRIPETGGIVP 788
Query: 747 EKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIAN-KHGRRILPLGNHVLMVGELRHSLTI 805
K+L GF RV + S+ + +D EQL + + G+ ++P G +MVG + +
Sbjct: 789 LKELKGFRRVPLKSGESRRVEIKLDK-EQLRYWDVEKGQFVVPKGAFDVMVGASSKDIRL 847
Query: 806 ET 807
+T
Sbjct: 848 QT 849
>gi|194446334|ref|YP_002041437.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar
Newport str. SL254]
gi|194469038|ref|ZP_03075022.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Kentucky str. CVM29188]
gi|195874002|ref|ZP_02700028.2| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Newport str. SL317]
gi|205358223|ref|ZP_02654575.2| glycosyl hydrolase family 3 N domain protein [Salmonella enterica
subsp. enterica serovar Kentucky str. CDC 191]
gi|205359498|ref|ZP_02830128.2| glycosyl hydrolase family 3 N domain protein [Salmonella enterica
subsp. enterica serovar Weltevreden str. HI_N05-537]
gi|417374418|ref|ZP_12144175.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar
Inverness str. R8-3668]
gi|418765082|ref|ZP_13321175.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Newport str. CVM 35199]
gi|418772136|ref|ZP_13328140.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Newport str. CVM 21539]
gi|418825678|ref|ZP_13380951.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Newport str. CVM 22462]
gi|194404997|gb|ACF65219.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Newport str. SL254]
gi|194455402|gb|EDX44241.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Kentucky str. CVM29188]
gi|195631424|gb|EDX49984.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Newport str. SL317]
gi|205335699|gb|EDZ22463.1| glycosyl hydrolase family 3 N domain protein [Salmonella enterica
subsp. enterica serovar Kentucky str. CDC 191]
gi|205344851|gb|EDZ31615.1| glycosyl hydrolase family 3 N domain protein [Salmonella enterica
subsp. enterica serovar Weltevreden str. HI_N05-537]
gi|353599534|gb|EHC55680.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar
Inverness str. R8-3668]
gi|392732250|gb|EIZ89461.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Newport str. CVM 21539]
gi|392742031|gb|EIZ99126.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Newport str. CVM 35199]
gi|392812566|gb|EJA68550.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Newport str. CVM 22462]
Length = 755
Score = 248 bits (634), Expect = 8e-63, Method: Compositional matrix adjust.
Identities = 211/749 (28%), Positives = 347/749 (46%), Gaps = 139/749 (18%)
Query: 75 QEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAAS 134
++ I+Q+ D A+ RL IP + + + +HG T FP L A+S
Sbjct: 76 RQDIRQMQDQVMALSRLKIPLF-FAYDVVHG---------------QRTVFPISLGLASS 119
Query: 135 FNRSLWSNIGSAVAVEARAMYNLGQAGLTF-WAPNINIFRDPRWGRGQETPGEDPMVVSA 193
FN +G A EA GL WAP +++ RDPRWGR E GED + S
Sbjct: 120 FNLDAVRTVGRVSAYEA------ADDGLNMTWAPMVDVSRDPRWGRASEGFGEDTYLTSI 173
Query: 194 YAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEK 253
VK+ QG++ D + KH AY +
Sbjct: 174 MGETMVKAMQGKS--------------------------PADRYSVMTSVKHFAAYGAVE 207
Query: 254 WGNFSRYSFNAM-ITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK 312
G +N + ++ Q + + PP+++ ++ G + +M + N +NG PA L +
Sbjct: 208 GGK----EYNTVDMSSQRLFNDYMPPYKAGLDAG-SGAVMVALNSLNGTPATSDSWLLKD 262
Query: 313 A-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCML-RHTQSA 370
R+EWGFKG SD A+ + ++ ED+ LKAG+D++ ++
Sbjct: 263 VLRDEWGFKGITVSDHGAIKELIKH-GTAADPEDAVRVALKAGVDMSMADEYYSKYLPGL 321
Query: 371 IDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGP---DDVCTSE----HKKL 423
I GKV ++D A ++ +V+ +GLFN DP Y LGP D V T+ H+K
Sbjct: 322 IKSGKVTMAELDDATRHVLNVKYDMGLFN-DP----YSHLGPKESDPVDTNAESRLHRKE 376
Query: 424 ALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT--GIPCSPKSLLR 481
A + AR+ +VLLKN + LPL K+ ++A++GPL ++ + G ++ G+ ++L
Sbjct: 377 AREVARESVVLLKNRLETLPLKKSG--TIAVVGPLADSQRDVMGSWSAAGVANQSVTVLA 434
Query: 482 GLEAYV---SKTHYASGCH-----DV--------------PCNSDAGFHEAVRIAKKADF 519
G++ V +K YA G + D+ P + A EAV+ AK+AD
Sbjct: 435 GIQNAVGDGAKILYAKGANITNDKDIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADV 494
Query: 520 VIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEAD 579
V+ V G E R ++ +P Q L+T++ T K P++LVL G PL + + D
Sbjct: 495 VVAVVGESQGMAHEASSRTNITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKED 551
Query: 580 SQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRA----- 634
Q +IL + G G A+A+++FGD+NP G+LP++ +P S ++P+ ++
Sbjct: 552 QQADAILETWFAGTEGGNAIADVLFGDYNPSGKLPIS-FPRSVGQIPVYYSHLNTGRPYN 610
Query: 635 -DSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQ 693
+ +Y R + G +Y FG+GLSYT ++ S++T+S+ +Q+
Sbjct: 611 PEKPNKYTSRYFDEANGP-LYPFGYGLSYTTFTV------SDVTLSSP--------TMQR 655
Query: 694 TGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGF 753
G +VT+ + VTN G +G+ V+ ++ + S P KQL GF
Sbjct: 656 DG---------KVTAS--------VEVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGF 698
Query: 754 DRVHTVAKGSKEISFGVDPCEQLSIANKH 782
+++ SK +SF +D E L N+
Sbjct: 699 EKITLKPGESKTVSFPID-IEALKFWNQQ 726
>gi|410096880|ref|ZP_11291865.1| hypothetical protein HMPREF1076_01043 [Parabacteroides goldsteinii
CL02T12C30]
gi|409225497|gb|EKN18416.1| hypothetical protein HMPREF1076_01043 [Parabacteroides goldsteinii
CL02T12C30]
Length = 799
Score = 248 bits (634), Expect = 8e-63, Method: Compositional matrix adjust.
Identities = 213/772 (27%), Positives = 328/772 (42%), Gaps = 131/772 (16%)
Query: 52 YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGP 111
YPF N + +I+ + I RLGIP ++ +E +HG+
Sbjct: 118 YPFSNHAEAINKTQRWFIEET-----------------RLGIPV-DFSNEGIHGLNH--- 156
Query: 112 GVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAM-YNLGQAGLTFWAPNIN 170
+ T P + +++NR L G EA+A+ YN +AP ++
Sbjct: 157 --------TKATPLPAPINIGSTWNRDLVHQAGDIAGKEAKALGYN------NVYAPILD 202
Query: 171 IFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEE 230
+ RDPRWGR ET GEDP +V ++ VK Q
Sbjct: 203 VARDPRWGRVLETYGEDPYLVGELGIQMVKGIQQNG------------------------ 238
Query: 231 SDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASC 290
+++ KH Y + K G + + + ++ + PF+ +++
Sbjct: 239 --------VASTLKHFAVYSIPKGGRDAAVRTDPHVAPRELHEIHLYPFKRVVQKAHPKG 290
Query: 291 IMCSYNQVNGVPACLRGD-LFQKARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAG 349
+M SYN +GVP L Q R E+GFKGYI SD +AV + + ++E++
Sbjct: 291 VMSSYNDWDGVPVTASYYFLTQLLRQEYGFKGYIVSDSEAVEFVQTKHHVADSYEEAVRQ 350
Query: 350 VLKAGMDINCGTCMLRH----TQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKG 405
V++AG+++ + + + +GK+ K +DR + ++ V+ LGLF D
Sbjct: 351 VVEAGLNVRTNFTHPKDYILPVRKLVKEGKLSMKSVDRMVADVLRVKFELGLF--DSPYV 408
Query: 406 KYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQM 465
K K V +H+ LD +Q +VLLKN+ LPL+KN + I GPL + M
Sbjct: 409 KDPKAADKIVGADKHRDFVLDMQKQSLVLLKNENNLLPLDKNQTKKVLIAGPLAKETNYM 468
Query: 466 GGGYTGIPCSPKSLLRGLEAYV---SKTHYASGCHDVPCN--------------SDAGFH 508
Y ++ G++ Y+ ++ YA GC N G
Sbjct: 469 ISRYGPQGLDNITVYDGIKDYLGNQTEVVYAKGCEVKDANWPDSEIVPTPLTDEEKKGIA 528
Query: 509 EAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGG 568
EA A D +I V G D + E + R L LPG+Q L+ ++ T K PV+LVL G
Sbjct: 529 EAATAAADCDVIIAVLGEDESCTGESKSRTGLDLPGRQQQLLEALHATGK-PVVLVLING 587
Query: 569 GPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMN 628
PL +++A D I SIL +PG+ G +A+A+ +FGD+NPGGRL +T +P S ++ N
Sbjct: 588 QPLTINWA--DRNIPSILEAWFPGQLGGEAIAQTLFGDYNPGGRLSVT-FPRSIGQIEFN 644
Query: 629 DMNMRADSSRQY---PGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAG 685
QY P S R +Y FG+GLSYT ++Y LS E S S
Sbjct: 645 FPFKPGSQDGQYFEGPNGSGRTRVNGALYPFGYGLSYTTFAYSNLSVKQETPYSQSPVT- 703
Query: 686 SDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGT 745
V + VTN G G VV L+ R S
Sbjct: 704 ------------------------------VTVDVTNTGKRAGDEVVQLYIRDKVSSVIA 733
Query: 746 PEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
E L GF+R+ +K +SF + P E L I ++H + G + +G
Sbjct: 734 YESVLRGFERISLQPGETKTVSFVLLP-EDLQILDRHMEWTVEPGEFEVRIG 784
>gi|153807033|ref|ZP_01959701.1| hypothetical protein BACCAC_01310 [Bacteroides caccae ATCC 43185]
gi|423219984|ref|ZP_17206480.1| hypothetical protein HMPREF1061_03253 [Bacteroides caccae
CL03T12C61]
gi|149130153|gb|EDM21363.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
caccae ATCC 43185]
gi|392624247|gb|EIY18340.1| hypothetical protein HMPREF1061_03253 [Bacteroides caccae
CL03T12C61]
Length = 786
Score = 248 bits (633), Expect = 9e-63, Method: Compositional matrix adjust.
Identities = 207/735 (28%), Positives = 337/735 (45%), Gaps = 127/735 (17%)
Query: 90 RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
RLGIP ++ +E + G+ N T FP A++N+ L I A
Sbjct: 137 RLGIPV-DFTNEGIRGLCHN-----------RATMFPAQCGQGATWNKKLIREIAKVTAD 184
Query: 150 EARAMYNLGQAGLT-FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWK 208
EA+A+ G T +AP ++I +DPRWGR E+ GEDP + + + Q E
Sbjct: 185 EAKAL------GYTNIYAPILDIAQDPRWGRVVESYGEDPYLAGELGKQMILGLQAEG-- 236
Query: 209 SDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITE 268
L+A KH Y + G + +
Sbjct: 237 ------------------------------LAATPKHFAVYSIPVGGRDGGTRTDPHVAP 266
Query: 269 QDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDC 327
++ + + PFR I++ A +M SYN +G P + R +WGFKGY+ SD
Sbjct: 267 REMKTLYLEPFRKGIQEAGALGVMSSYNDYDGEPVSGSYHFLTEILRQQWGFKGYVVSDS 326
Query: 328 DAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTC----MLRHTQSAIDKGKVQEKDIDR 383
+AV + T T E+ AA V+ AG++I + + AI +GK+ +D+
Sbjct: 327 EAVEFLHTKHRITPTEEEMAAQVVNAGLNIRTNFTPPQDFILPLRRAISEGKISLHTLDQ 386
Query: 384 ALLNLFSVQLRLGLFN----GDPRKGKYGKLGPDDVC-TSEHKKLALDAARQGIVLLKND 438
+ + V+ LGLF+ GD R P+ V + H+++++ AA + IVLLKN+
Sbjct: 387 RVGEILRVKFMLGLFDNPYPGDDRH-------PETVVHNAAHQEVSMKAALESIVLLKNE 439
Query: 439 KKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV--SKTHYASGC 496
+ LPL+K +++ +A+IGP + ++ Y K++ +G++ Y+ ++ YA GC
Sbjct: 440 NQMLPLSK-SLNKIAVIGPNAEEVKELTCRYGPAHAPIKTVYQGIKEYLPNAEVSYAKGC 498
Query: 497 ------------HDVPCNS--DAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLL 542
++VP ++ A +EAV +AK +D I+V G + E+ R SL L
Sbjct: 499 NIIDKYFPESELYNVPLDTQEQAMINEAVELAKVSDIAILVLGGNEKTVREEFSRTSLDL 558
Query: 543 PGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEI 602
G+Q L+ +V T K PV+LV+ G +++ A+ + +I+ +PGE A+A++
Sbjct: 559 CGRQQQLLEAVYATGK-PVVLVMVDGRAATINW--ANKYVPAIVHAWFPGEFMGNAIAKV 615
Query: 603 IFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSY 662
+FGD+NPGGRL +T +P+S +VP S+ R +Y FG+GLSY
Sbjct: 616 LFGDYNPGGRLAVT-FPKSVGQVPFAFPFKPGSDSKG------RVRVDGVLYPFGYGLSY 668
Query: 663 TNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTN 722
T + Y S L IS + G +N+ SC V N
Sbjct: 669 TTFEY------SALKISKPV-IGPQENM---------------TLSCI---------VKN 697
Query: 723 AGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKH 782
G G VV L+ R S T +K L GF+R+H + ISF + P + L + +K+
Sbjct: 698 TGKRAGDEVVQLYIRDDFSSVTTYDKMLRGFERIHLQPGEEQTISFTLTP-QDLGLWDKN 756
Query: 783 GRRILPLGNHVLMVG 797
+ + G+ +M+G
Sbjct: 757 NQFTVEPGSFSIMIG 771
>gi|293371041|ref|ZP_06617583.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
ovatus SD CMC 3f]
gi|292633971|gb|EFF52518.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
ovatus SD CMC 3f]
Length = 791
Score = 248 bits (633), Expect = 9e-63, Method: Compositional matrix adjust.
Identities = 210/754 (27%), Positives = 331/754 (43%), Gaps = 141/754 (18%)
Query: 90 RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
RLGIP + E+ HG + G +T FP + AA+++ L +G +A
Sbjct: 138 RLGIPMF-LAEEAPHGHMAIG-----------ITVFPTGIGMAATWSPELVKEVGQVIAK 185
Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
E R+ Q G + P +++ RDPRW R +ET GEDP++ V N
Sbjct: 186 EIRS-----QGGHISYGPVLDLTRDPRWSRVEETFGEDPVLSGTLGAAMVDGLINGNISR 240
Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQ 269
+ I A KH +AY + + G A++ +
Sbjct: 241 KNSTI--------------------------ATLKHFLAYAVPEGGQNGN---QALVGMR 271
Query: 270 DTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCD 328
+ + F PPF+ I+ G S +M SYN ++G+P L Q RNEW F+G++ SD
Sbjct: 272 ELHENFLPPFKKAIDAGALS-VMTSYNSIDGIPCTANSYLLNQLLRNEWKFRGFVVSDLY 330
Query: 329 AVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNL 388
++ I+E + ED+A + AG+D++ G + A+ + ++ E ID + +
Sbjct: 331 SIEGIYESHYTASSIEDAAIQAVSAGVDVDLGGEAYTNIYRAVKEKRLSEAIIDEVVCRV 390
Query: 389 FSVQLRLGLFNG---DPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLN 445
++ +GLF DP ++ + V + H A A+ + LLKN LPL+
Sbjct: 391 LRLKFEMGLFENPYVDP------QIAIERVRNANHIANARRMAQASVTLLKNRHDILPLS 444
Query: 446 KNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSK-----THYASGC--HD 498
KN + +A+IGP +N M G YT P +++ L+ +SK Y GC D
Sbjct: 445 KN-IRKVAVIGPNADNCYNMLGDYTA-PQKDENIKTVLDGIISKLSLSRVEYVRGCAIRD 502
Query: 499 VPCNSDAGFHEAVRIAKKADFVIVVAG------------------LDLTQET-----EDR 535
N A +AV A +AD VI V G D +Q + E
Sbjct: 503 TTNNEIA---KAVEAANRADVVIAVVGGSSARDFKTTYKETGAAIADKSQISDMECGEGF 559
Query: 536 DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAG 595
DR +L L G+Q+ L+ S+ +++++P+I+V G PL+ ++A + ++L YPG+ G
Sbjct: 560 DRATLSLLGKQLELLESL-KSTRKPLIVVYIEGRPLNKNWAAEHA--DALLTAYYPGQEG 616
Query: 596 AKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYG 655
A+A+++FGD+NP GRLP++ P S ++P+ N + Y S + +Y
Sbjct: 617 GDAIADVLFGDYNPAGRLPVS-VPRSEGQIPVY-YNKKTPKCHDYVEMS-----ASPLYS 669
Query: 656 FGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFH 715
FG+GLSY+ + Y L +VT L F
Sbjct: 670 FGYGLSYSTFEYSNL----------------------------------KVTQQAPLHFE 695
Query: 716 VQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQ 775
+ V N G DG V L+ R S +QL F R K I F + E
Sbjct: 696 ISFDVENTGKYDGEEVAQLYIRDEYASVVRALRQLKHFKRFFLKQGEKKTIVFTLVE-ED 754
Query: 776 LSIANKHGRRILPLGNHVLMVG----ELRHSLTI 805
LSI N+ RI+ G+ LM+G ++R S TI
Sbjct: 755 LSIINQKMERIVEPGSFQLMIGAASDDIRLSKTI 788
>gi|427387416|ref|ZP_18883472.1| hypothetical protein HMPREF9447_04505 [Bacteroides oleiciplenus YIT
12058]
gi|425725577|gb|EKU88448.1| hypothetical protein HMPREF9447_04505 [Bacteroides oleiciplenus YIT
12058]
Length = 733
Score = 248 bits (633), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 206/739 (27%), Positives = 344/739 (46%), Gaps = 102/739 (13%)
Query: 67 SLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFP 126
SLI L T + Q+ A RLGIP + + +HG+ T +P
Sbjct: 79 SLIYLHTDPKLRNQIQRKAMEESRLGIPIL-FGFDVIHGLR---------------TVYP 122
Query: 127 QVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGE 186
L A SFN L + A E+ L TF +P I++ RDPRWGR E GE
Sbjct: 123 ISLAQACSFNPDLVTQACGMAAKESV----LSGIDWTF-SPMIDVARDPRWGRISECYGE 177
Query: 187 DPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHL 246
DP + + + V V+ +QGE D ++AC KH
Sbjct: 178 DPYLNTVFGVASVQGYQGEKL--------------------------SDPYSIAACLKHY 211
Query: 247 IAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLR 306
+ Y + G RY+ I+ Q +T+ PP+ +C++ G A+ +M S+N ++GVPA
Sbjct: 212 VGYGASEGGRDYRYT---DISPQALWETYLPPYEACVKAG-AATLMSSFNDISGVPATSN 267
Query: 307 GDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINC-GTCML 364
+ + +N+W G++ SD +A+ + YQ K +++A AG++++
Sbjct: 268 HYILTEILKNKWRHDGFVVSDWNAIEQLI-YQGVAKDRKEAAYKAFHAGVEMDMRDNIYY 326
Query: 365 RHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLA 424
+ + + + K+Q ID A+ + V+ RLGLF+ +P + + + E LA
Sbjct: 327 EYLEQLVAEKKIQMSQIDDAVARILRVKFRLGLFD-EPYTKELTE--QERYLQKEDIALA 383
Query: 425 LDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGG--YTGIPCSPKSLLRG 482
A + +VLLKN+ LPL+ + V +A+IGP+ + + + G + G +++ G
Sbjct: 384 ARLAEESMVLLKNENNLLPLS-STVKRVALIGPMAKDSANLLGAWAFKGHAEDVETIYEG 442
Query: 483 LEAYVS---KTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVS 539
++ + Y GC + N ++GF A++ A+ +D V+V G E+ R +
Sbjct: 443 MQKEFGDKVQLDYEQGCA-LDGNDESGFSAALKTAEASDVVVVCLGESKQWSGENASRST 501
Query: 540 LLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKAL 599
+ LP Q L+ + + +K P++LVL+ G PL++ + Q+ +I+ + PG AG L
Sbjct: 502 IALPDIQEKLLLHLKQANK-PIVLVLSSGRPLEL--IRLEPQVEAIIEMWQPGVAGGTPL 558
Query: 600 AEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGR-SYRFYTGTQVYGFGH 658
A I+ G NP G+L +T +P S ++P+ NMR S+R + Y+ +Y FGH
Sbjct: 559 AGILSGRVNPSGKLSVT-FPLSTGQIPVY-YNMR-QSARPFDAMGDYQDIPTKPLYPFGH 615
Query: 659 GLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQI 718
GLSYT + Y S+ +S SLK ++ I ++
Sbjct: 616 GLSYTTFVY------SDAKLS-SLKIRKNQKIT------------------------AEV 644
Query: 719 SVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSI 778
+VTNAG ++G V+ + P S P K+L F++ A S+ F +DP LS
Sbjct: 645 TVTNAGKMEGKETVLWYVSDPFCSISRPMKELKFFEKHSLNAGESRVFRFEIDPMRDLSY 704
Query: 779 ANKHGRRILPLGNHVLMVG 797
+ G+R L G ++ VG
Sbjct: 705 TDATGKRFLEAGEFIVSVG 723
>gi|380692997|ref|ZP_09857856.1| beta-glucosidase [Bacteroides faecis MAJ27]
Length = 837
Score = 248 bits (633), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 158/447 (35%), Positives = 222/447 (49%), Gaps = 57/447 (12%)
Query: 54 FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
+ N + I R + L+S LT++EK+ L + I R+GI Y +E+LHGI G
Sbjct: 14 YKNMNAPIHERVQDLLSKLTIEEKVSLLRATSPGIERMGIDKYYMGNEALHGIIRPG--- 70
Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAG----------LT 163
T FPQ + A+ +N L I ++ EARA +N + G LT
Sbjct: 71 -------KFTVFPQAIGLASMWNPELHHIIAGVISDEARARWNELERGKKQKDQFSDLLT 123
Query: 164 FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRV 223
FW+P +N+ RDPRWGR ET GEDP + FVK QG++ R
Sbjct: 124 FWSPTVNMARDPRWGRTPETYGEDPYLSGVLGTAFVKGLQGDH--------------PRY 169
Query: 224 LKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCI 283
LK A KH A + E +R+ +A ITE D + + P F CI
Sbjct: 170 LKAV-------------ATPKHFAANNEEH----NRFYCDAAITETDLREYYFPAFEKCI 212
Query: 284 EQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKT 342
+GKA IM +YN +NGVP L K + +WGF GYI SDC A + Y KT
Sbjct: 213 REGKAESIMTAYNAINGVPCTANNWLLNKVLKQDWGFNGYIVSDCGAPGLLMTDHRYVKT 272
Query: 343 HEDSAAGVLKAGMDINCGTCMLRHTQ-SAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGD 401
E +A +KAG+D+ CG + + +A + V +ID A + ++RLG+F+ D
Sbjct: 273 PEAAAMIAIKAGLDVECGDYVFANPLLNAYKQYMVSAAEIDSAAYRVLRARMRLGMFD-D 331
Query: 402 PRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNN 461
P K Y L P+ V +H LAL+AARQ IVLLKN + LPLN + S+A++G +N
Sbjct: 332 PEKNPYNHLSPEIVGCKKHHDLALEAARQSIVLLKNQQNTLPLNAQKIKSIAVVG--INA 389
Query: 462 ISQMGGGYTGIPC-SPKSLLRGLEAYV 487
+ G Y+G P +P S+L G+ V
Sbjct: 390 ANCEFGDYSGTPVNAPVSVLDGIRNRV 416
Score = 137 bits (345), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 85/302 (28%), Positives = 150/302 (49%), Gaps = 49/302 (16%)
Query: 507 FHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLT 566
+ +A +I +++D VI V G++ + E E +DR S+ LP Q + + + +++++
Sbjct: 578 YGDASKIIRESDVVIAVMGINQSIEREGQDRNSIELPKDQQIFIREAYKANPNTIVVLVA 637
Query: 567 GGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVP 626
G ++ D I +I+ YPGE G A+AE++FGD+NP GRLP+T+Y S +P
Sbjct: 638 GSS---MAIGWMDQHIPAIIDAWYPGEQGGTAIAEVLFGDYNPAGRLPLTFY-NSIEDLP 693
Query: 627 -MNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAG 685
+D N++ + R+Y ++ G +Y FG+GLSYT + Y+ L+
Sbjct: 694 AFDDYNVKNN-------RTYMYFEGKPLYAFGYGLSYTKFDYRNLN-------------- 732
Query: 686 SDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGT 745
I + T +L F S+ N+G +G V ++ + P T
Sbjct: 733 -----------------IKQDTQNVTLNF----SIKNSGKYNGDEVAQVYVKFPDQGIKT 771
Query: 746 PEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRIL-PLGNHVLMVGELRHSLT 804
P KQL GF RVH +++IS + P E+L + + ++ P G + MVG+ ++
Sbjct: 772 PLKQLKGFKRVHIKKGATEQISIEI-PKEELRLWDDQKKQFYTPSGTYHFMVGKSSDNIC 830
Query: 805 IE 806
++
Sbjct: 831 LQ 832
>gi|423287910|ref|ZP_17266761.1| hypothetical protein HMPREF1069_01804 [Bacteroides ovatus
CL02T12C04]
gi|392671925|gb|EIY65396.1| hypothetical protein HMPREF1069_01804 [Bacteroides ovatus
CL02T12C04]
Length = 782
Score = 248 bits (633), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 216/750 (28%), Positives = 334/750 (44%), Gaps = 135/750 (18%)
Query: 90 RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
RLGIP + E+ HG + G T FP + AA+++ L +G +A
Sbjct: 129 RLGIPMF-LAEEAPHGHMAIG-----------ATVFPTGIGMAATWSLELVKEVGQVIAK 176
Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
E R+ Q G + P +++ RDPRW R +ET GEDP++ V G N
Sbjct: 177 EIRS-----QGGHISYGPVLDLTRDPRWSRVEETFGEDPVLSGILGASMVDGLGGGNLSQ 231
Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQ 269
I A KH +AY + + G Y A + +
Sbjct: 232 KYATI--------------------------ATLKHFLAYAVPEGGQNGNY---ASVGIR 262
Query: 270 DTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCD 328
D F PPFR I+ G S +M SYN ++G+P L Q RNEW F G++ SD
Sbjct: 263 DLHQNFLPPFRKAIDSGALS-VMTSYNSIDGIPCTSNHYLLTQLLRNEWKFCGFVVSDLY 321
Query: 329 AVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNL 388
++ I E T E++A + AG+D++ G + A+ G++ + ID A+ +
Sbjct: 322 SIEGIHESHFVALTKENAAIQSVTAGVDVDLGGDAYTNLCHAVQSGQMDKAVIDTAVCRV 381
Query: 389 FSVQLRLGLFNG---DPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLN 445
++ +GLF DP K+ V EH +LA A+ I LLKN+ LPL+
Sbjct: 382 LRMKFEMGLFEHPYVDP------KIAAKTVRRKEHIELARKIAQSSITLLKNENSILPLS 435
Query: 446 KNAVSSLAIIGPLVNNISQMGGGYTGIP--CSPKSLLRGLEAYVS--KTHYASGC--HDV 499
K ++ +A+IGP +N M G YT + K++L G+ +S + Y GC D
Sbjct: 436 KT-INKVAVIGPNADNRYNMLGDYTAPQEDSNVKTVLDGILTKLSPFRVEYVRGCAIRDT 494
Query: 500 PCNSDAGFHEAVRIAKKADFVIVVAG----LDLT---QET----------------EDRD 536
N +A++ A++++ VIVV G D +ET E D
Sbjct: 495 TVNE---IEQAIKAARRSEVVIVVVGGSSARDFKTSYKETGAAVAEEGSVSDMECGEGFD 551
Query: 537 RVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGA 596
R SL L G+Q L+ S+ +T K P+I+V G PL+ ++A ++L YPG+ G
Sbjct: 552 RASLSLLGRQQELLESLQKTGK-PLIVVYIEGRPLEKNWA--SEYADALLTAYYPGQEGG 608
Query: 597 KALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGF 656
A+A+++FGD+NP GRLP++ P S ++P+ N +A + Y S +Y F
Sbjct: 609 NAIADVLFGDYNPSGRLPIS-VPRSVGQIPVY-YNKKAPRNHDYVEMS-----SFPLYSF 661
Query: 657 GHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHV 716
G+G+SYT + Y SD ++Q++ C F V
Sbjct: 662 GYGMSYTTFEY------------------SDLQVVQKSA------------RC----FEV 687
Query: 717 QISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQL 776
V N G DG V L+ R S P KQL F+R H K+++F V E
Sbjct: 688 SFKVKNTGKYDGEEVSQLYMRDEYASVVQPMKQLKHFERFHLKKGEEKKVTF-VLTEEDF 746
Query: 777 SIANKHGRRILPLGNHVLMVGELRHSLTIE 806
+ N ++++ GN LM+G + + ++
Sbjct: 747 FLVNYTLKKVVESGNFHLMIGAASNDIRLQ 776
>gi|383125190|ref|ZP_09945844.1| hypothetical protein BSIG_4346 [Bacteroides sp. 1_1_6]
gi|251838523|gb|EES66609.1| hypothetical protein BSIG_4346 [Bacteroides sp. 1_1_6]
Length = 853
Score = 248 bits (632), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 155/447 (34%), Positives = 226/447 (50%), Gaps = 57/447 (12%)
Query: 54 FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
+ N + + R LIS LT++EKI L + IPRLGI Y +E+LHG+ G
Sbjct: 30 YKNENAPVHERVMDLISRLTVEEKISLLRATSPGIPRLGIDKYYHGNEALHGVVRPG--- 86
Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAG----------LT 163
T FPQ + AA++N L + + ++ EARA +N G LT
Sbjct: 87 -------RFTVFPQAIGLAATWNPELQKRVATVISDEARARWNELDQGREQKEQFSDVLT 139
Query: 164 FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRV 223
FW+P +N+ RDPRWGR ET GEDP + FV QG
Sbjct: 140 FWSPTVNMARDPRWGRTPETYGEDPFLSGIMGTAFVNGLQG------------------- 180
Query: 224 LKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCI 283
D L + + KH A + E +R+ N I+E+ + + P F C+
Sbjct: 181 --------DDPHYLKIVSTPKHFAANNEEH----NRFVCNPQISEKQLREYYFPAFEMCV 228
Query: 284 EQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKT 342
++GKA+ IM +YN +N VP L L QK R +WGF+GY+ SDC A + Y KT
Sbjct: 229 KEGKAASIMSAYNALNDVPCTLNPWLLQKVLRQDWGFQGYVVSDCGGPALLVNAHKYVKT 288
Query: 343 HEDSAAGVLKAGMDINCGTCMLR-HTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGD 401
E +A +KAG+D+ CG + +A + V + DID A ++ + +++LGLF+
Sbjct: 289 KEAAATLSIKAGLDLECGDDVYDGPLLNAYKQYMVSDADIDSAAYHVLTARMKLGLFDSG 348
Query: 402 PRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNN 461
R Y K+ P + + EH+++ALDAARQ IVLLKN K LPLN + + S+A++G +N
Sbjct: 349 ER-NPYTKISPSVIGSKEHQQIALDAARQCIVLLKNQKNRLPLNADKLKSIAVVG--INA 405
Query: 462 ISQMGGGYTGIP-CSPKSLLRGLEAYV 487
G Y+G P P S+L+G+ V
Sbjct: 406 GKCEFGDYSGAPVVEPVSILQGIRNRV 432
Score = 126 bits (316), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 84/302 (27%), Positives = 148/302 (49%), Gaps = 47/302 (15%)
Query: 507 FHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLT 566
+ EA + ++ + V+ V G++ + E E +DR + LP Q + + + + +I+VL
Sbjct: 594 YGEAGKAVRECETVVAVMGINKSIEREGQDRYDIQLPADQREFLQEIYKVNPN-IIVVLV 652
Query: 567 GGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVP 626
G L +++ D I +I+ YPGE G A+AE++FGD+NP GRLP+T+Y P
Sbjct: 653 AGSSLAINWM--DEHIPAIVNAWYPGEQGGTAVAEVLFGDYNPAGRLPLTYYKSLDELPP 710
Query: 627 MNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGS 686
+D ++ GR+Y+++ G +Y FG+GLSY++++Y L +K G
Sbjct: 711 FDDYDITK-------GRTYKYFKGDVLYPFGYGLSYSSFTYSDL----------QVKDG- 752
Query: 687 DKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTP 746
+ EVT V + N G +G V ++ R+P+ P
Sbjct: 753 ----------------VGEVT--------VSFRLKNTGKRNGDEVAQVYVRIPETGGIVP 788
Query: 747 EKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIAN-KHGRRILPLGNHVLMVGELRHSLTI 805
K+L GF RV + S+ + ++ EQL + + G+ ++P G +MVG + +
Sbjct: 789 LKELKGFRRVPLKSGESRRVEIKLNK-EQLRYWDVEKGQFVVPKGAFDVMVGASSKDIRL 847
Query: 806 ET 807
+T
Sbjct: 848 QT 849
>gi|409250776|ref|YP_006886584.1| beta-D-glucoside glucohydrolase, periplasmic [Salmonella enterica
subsp. enterica serovar Weltevreden str. 2007-60-3289-1]
gi|418760521|ref|ZP_13316675.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Newport str. CVM 35185]
gi|418775352|ref|ZP_13331310.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Newport str. CVM 33953]
gi|418781475|ref|ZP_13337358.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Newport str. CVM 35188]
gi|418783741|ref|ZP_13339586.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Newport str. CVM 21559]
gi|418789285|ref|ZP_13345072.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Newport str. CVM 19447]
gi|418794086|ref|ZP_13349808.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Newport str. CVM 19449]
gi|418796683|ref|ZP_13352374.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Newport str. CVM 19567]
gi|418803316|ref|ZP_13358937.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Newport str. CVM 35202]
gi|418809719|ref|ZP_13365271.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Newport str. CVM 21550]
gi|418813873|ref|ZP_13369394.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Newport str. CVM 22513]
gi|418815139|ref|ZP_13370647.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Newport str. CVM 21538]
gi|418822928|ref|ZP_13378339.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Newport str. CVM 22425]
gi|418830026|ref|ZP_13384989.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Newport str. CVM N18486]
gi|418836408|ref|ZP_13391292.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Newport str. CVM N1543]
gi|418840973|ref|ZP_13395796.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Newport str. CVM 21554]
gi|418844732|ref|ZP_13399518.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Newport str. CVM 19443]
gi|418850848|ref|ZP_13405564.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Newport str. CVM 37978]
gi|418857905|ref|ZP_13412528.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Newport str. CVM 19470]
gi|418862200|ref|ZP_13416744.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Newport str. CVM 19536]
gi|419789701|ref|ZP_14315381.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Newport str. Levine 1]
gi|419792237|ref|ZP_14317879.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Newport str. Levine 15]
gi|320086604|emb|CBY96375.1| beta-D-glucoside glucohydrolase, periplasmic [Salmonella enterica
subsp. enterica serovar Weltevreden str. 2007-60-3289-1]
gi|392615486|gb|EIW97925.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Newport str. Levine 1]
gi|392618961|gb|EIX01347.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Newport str. Levine 15]
gi|392742601|gb|EIZ99688.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Newport str. CVM 35185]
gi|392747243|gb|EJA04244.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Newport str. CVM 35188]
gi|392748425|gb|EJA05411.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Newport str. CVM 33953]
gi|392756753|gb|EJA13648.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Newport str. CVM 21559]
gi|392760526|gb|EJA17361.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Newport str. CVM 19447]
gi|392762081|gb|EJA18897.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Newport str. CVM 19449]
gi|392770056|gb|EJA26784.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Newport str. CVM 19567]
gi|392772845|gb|EJA29542.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Newport str. CVM 35202]
gi|392773804|gb|EJA30500.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Newport str. CVM 21550]
gi|392775099|gb|EJA31794.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Newport str. CVM 22513]
gi|392787185|gb|EJA43733.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Newport str. CVM 22425]
gi|392793518|gb|EJA49962.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Newport str. CVM 21538]
gi|392800769|gb|EJA56999.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Newport str. CVM N1543]
gi|392802259|gb|EJA58473.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Newport str. CVM N18486]
gi|392809639|gb|EJA65673.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Newport str. CVM 21554]
gi|392813541|gb|EJA69505.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Newport str. CVM 19443]
gi|392818698|gb|EJA74582.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Newport str. CVM 37978]
gi|392834583|gb|EJA90187.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Newport str. CVM 19470]
gi|392836652|gb|EJA92232.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Newport str. CVM 19536]
Length = 765
Score = 248 bits (632), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 211/749 (28%), Positives = 347/749 (46%), Gaps = 139/749 (18%)
Query: 75 QEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAAS 134
++ I+Q+ D A+ RL IP + + + +HG T FP L A+S
Sbjct: 86 RQDIRQMQDQVMALSRLKIPLF-FAYDVVHG---------------QRTVFPISLGLASS 129
Query: 135 FNRSLWSNIGSAVAVEARAMYNLGQAGLTF-WAPNINIFRDPRWGRGQETPGEDPMVVSA 193
FN +G A EA GL WAP +++ RDPRWGR E GED + S
Sbjct: 130 FNLDAVRTVGRVSAYEA------ADDGLNMTWAPMVDVSRDPRWGRASEGFGEDTYLTSI 183
Query: 194 YAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEK 253
VK+ QG++ D + KH AY +
Sbjct: 184 MGETMVKAMQGKS--------------------------PADRYSVMTSVKHFAAYGAVE 217
Query: 254 WGNFSRYSFNAM-ITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK 312
G +N + ++ Q + + PP+++ ++ G + +M + N +NG PA L +
Sbjct: 218 GGK----EYNTVDMSSQRLFNDYMPPYKAGLDAG-SGAVMVALNSLNGTPATSDSWLLKD 272
Query: 313 A-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCML-RHTQSA 370
R+EWGFKG SD A+ + ++ ED+ LKAG+D++ ++
Sbjct: 273 VLRDEWGFKGITVSDHGAIKELIKHGTAADP-EDAVRVALKAGVDMSMADEYYSKYLPGL 331
Query: 371 IDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGP---DDVCTSE----HKKL 423
I GKV ++D A ++ +V+ +GLFN DP Y LGP D V T+ H+K
Sbjct: 332 IKSGKVTMAELDDATRHVLNVKYDMGLFN-DP----YSHLGPKESDPVDTNAESRLHRKE 386
Query: 424 ALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT--GIPCSPKSLLR 481
A + AR+ +VLLKN + LPL K+ ++A++GPL ++ + G ++ G+ ++L
Sbjct: 387 AREVARESVVLLKNRLETLPLKKSG--TIAVVGPLADSQRDVMGSWSAAGVANQSVTVLA 444
Query: 482 GLEAYV---SKTHYASGCH-----DV--------------PCNSDAGFHEAVRIAKKADF 519
G++ V +K YA G + D+ P + A EAV+ AK+AD
Sbjct: 445 GIQNAVGDGAKILYAKGANITNDKDIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADV 504
Query: 520 VIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEAD 579
V+ V G E R ++ +P Q L+T++ T K P++LVL G PL + + D
Sbjct: 505 VVAVVGESQGMAHEASSRTNITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKED 561
Query: 580 SQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRA----- 634
Q +IL + G G A+A+++FGD+NP G+LP++ +P S ++P+ ++
Sbjct: 562 QQADAILETWFAGTEGGNAIADVLFGDYNPSGKLPIS-FPRSVGQIPVYYSHLNTGRPYN 620
Query: 635 -DSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQ 693
+ +Y R + G +Y FG+GLSYT ++ S++T+S+ +Q+
Sbjct: 621 PEKPNKYTSRYFDEANGP-LYPFGYGLSYTTFTV------SDVTLSSP--------TMQR 665
Query: 694 TGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGF 753
G +VT+ + VTN G +G+ V+ ++ + S P KQL GF
Sbjct: 666 DG---------KVTAS--------VEVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGF 708
Query: 754 DRVHTVAKGSKEISFGVDPCEQLSIANKH 782
+++ SK +SF +D E L N+
Sbjct: 709 EKITLKPGESKTVSFPID-IEALKFWNQQ 736
>gi|423217451|ref|ZP_17203947.1| hypothetical protein HMPREF1061_00720 [Bacteroides caccae
CL03T12C61]
gi|392628610|gb|EIY22636.1| hypothetical protein HMPREF1061_00720 [Bacteroides caccae
CL03T12C61]
Length = 946
Score = 248 bits (632), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 207/735 (28%), Positives = 336/735 (45%), Gaps = 115/735 (15%)
Query: 90 RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
RLGIP ++ +E + G+ S T+FP L ++NR L +G
Sbjct: 153 RLGIPT-DFTNEGIRGVES-----------YKATNFPTQLGLGHTWNRQLIHQVGLITGR 200
Query: 150 EARAMYNLGQAGLT-FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWK 208
EAR + G T +AP +++ RD RWGR +E GE P +V+ +E V+ Q +
Sbjct: 201 EARML------GYTNVYAPILDVGRDQRWGRYEEVYGESPYLVAELGIEMVRGMQHNH-- 252
Query: 209 SDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITE 268
++A KH IAY K + ++
Sbjct: 253 -----------------------------QVAATGKHFIAYSNNKGAREGMARVDPQMSP 283
Query: 269 QDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVP-ACLRGDLFQKARNEWGFKGYITSDC 327
++ E PF+ I + +M SYN +G P L + R E GF+GY+ SD
Sbjct: 284 REVEMLHAYPFKRVIREAGLLGVMSSYNDYDGFPIQSSYYWLTTRLRGEMGFRGYVVSDS 343
Query: 328 DAVATIFEYQNYTKTHEDSAAGVLKAGMDINCG----TCMLRHTQSAIDKGKVQEKDIDR 383
DAV ++ K +++ ++AG+++ C + + + +G + E+ I+
Sbjct: 344 DAVEYLYTKHGTAKDMKEAVRQSVEAGLNVRCTFRSPDSYVLPLRELVKEGGLSEEVIND 403
Query: 384 ALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLP 443
+ ++ V+ +GLF+ + G ++V E++++AL A+R+ IVLLKN+K LP
Sbjct: 404 RVRDILRVKFLVGLFDTPYQTDLKG--ADEEVEKKENEEVALQASRESIVLLKNEKNVLP 461
Query: 444 LNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV---SKTHYASGCHDVP 500
L+ + + +A+ GP + S Y + S+L+G++ + + Y GC V
Sbjct: 462 LDPSKIRKIAVCGPNADEHSYALTHYGPLAVEVTSVLKGIQEKMKDKADVLYTKGCDLVD 521
Query: 501 CN--------------SDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQ 546
N +AV AK+AD IVV G E++ R SL LPG+Q
Sbjct: 522 ANWPESELIDYPLTDEEQKEIDKAVSQAKQADVAIVVLGGGQRTCGENKSRSSLDLPGRQ 581
Query: 547 MSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGD 606
+ L+ +V T K PV+LVL G PL +++A D + +IL YPG G A+A+I+FGD
Sbjct: 582 LDLLKAVVATGK-PVVLVLINGRPLSINWA--DKFVPAILEAWYPGSKGGIAVADILFGD 638
Query: 607 FNPGGRLPMTWYPESFTKVPMN---DMNMRADSSRQYPGRSYRFYTGT-QVYGFGHGLSY 662
+NPGG+L +T +P++ ++P N + + D + PG +Y FG+GLSY
Sbjct: 639 YNPGGKLTVT-FPKTVGQIPFNFPCKPSSQIDGGKN-PGPDGNMSRANGALYPFGYGLSY 696
Query: 663 TNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTN 722
T + Y S+L IS ++ + K +V VTN
Sbjct: 697 TTFEY------SDLKISPAIITPNQKA-------------------------YVTCKVTN 725
Query: 723 AGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKH 782
G G V+ L+ R S T EK L+GF+RVH +KEI+F +D + L + N
Sbjct: 726 TGKRSGDEVIQLYVRDVLSSVTTYEKNLVGFERVHLKPGETKEITFPID-RKALELLNAD 784
Query: 783 GRRILPLGNHVLMVG 797
++ G+ LM+G
Sbjct: 785 MHWVVEPGDFTLMLG 799
>gi|161502686|ref|YP_001569798.1| hypothetical protein SARI_00733 [Salmonella enterica subsp.
arizonae serovar 62:z4,z23:- str. RSK2980]
gi|160864033|gb|ABX20656.1| hypothetical protein SARI_00733 [Salmonella enterica subsp.
arizonae serovar 62:z4,z23:-]
Length = 771
Score = 248 bits (632), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 212/763 (27%), Positives = 352/763 (46%), Gaps = 137/763 (17%)
Query: 75 QEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAAS 134
++ I+Q+ D A+ RL IP + + + +HG T FP L A+S
Sbjct: 92 RQDIRQMQDQVMALSRLKIPLF-FAYDVVHG---------------QRTVFPISLGLASS 135
Query: 135 FNRSLWSNIGSAVAVEARAMYNLGQAGLTF-WAPNINIFRDPRWGRGQETPGEDPMVVSA 193
FN +G A EA GL WAP +++ RDPRWGR E GED + S
Sbjct: 136 FNLDAVRTVGRVSAYEA------ADDGLNMTWAPMVDVSRDPRWGRASEGFGEDTYLTSI 189
Query: 194 YAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEK 253
VK+ QG+N D + KH AY +
Sbjct: 190 MGETMVKAMQGKN--------------------------PADRYSVMTSVKHFAAYGAVE 223
Query: 254 WGNFSRYSFNAM-ITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK 312
G +N++ ++ Q + + PP+++ ++ G + +M + N +NG PA L ++
Sbjct: 224 GGK----EYNSVDMSSQRLFNDYMPPYKAGLDAG-SGAVMVALNSLNGTPATSDSWLLKE 278
Query: 313 A-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCML-RHTQSA 370
R+EWGFKG SD A+ + ++ ED+ LK+G+D++ ++
Sbjct: 279 VLRDEWGFKGITVSDHGAIKELIKHGTAADP-EDAVRVALKSGVDMSMADEYYSKYLPGL 337
Query: 371 IDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGP---DDVCTSE----HKKL 423
I GKV ++D A ++ +V+ +GLFN DP Y LGP D V T+ H+K
Sbjct: 338 IKSGKVTMAELDDATRHVLNVKYDMGLFN-DP----YSHLGPKESDPVDTNAESRLHRKE 392
Query: 424 ALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT--GIPCSPKSLLR 481
A + AR+ +VLLKN + LPL K+ ++A++GPL ++ + G ++ G+ ++L
Sbjct: 393 AREVARESLVLLKNRLETLPLKKSG--TIAVVGPLADSQRDVMGSWSAAGVADQSVTVLA 450
Query: 482 GLEAYV---SKTHYASGCHDV-------------------PCNSDAGFHEAVRIAKKADF 519
G++ V +K YA G + P + A EAV+ AK+AD
Sbjct: 451 GIQNAVGDGAKILYAKGANITNNKGIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADV 510
Query: 520 VIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEAD 579
V+ V G E R + +P Q L+T++ T K P++LVL G PL + + D
Sbjct: 511 VVAVVGESQGMAHEASSRTDITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKED 567
Query: 580 SQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPM--NDMNMRADSS 637
Q +IL + G G A+A+++FGD+NP G+LPM+ +P S ++P+ + +N
Sbjct: 568 QQADAILETWFAGTEGGNAIADVLFGDYNPSGKLPMS-FPRSVGQIPVYYSHLNTGRPYD 626
Query: 638 RQYPGR-SYRFYTGTQ--VYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQT 694
+ P + + R++ +Y FG+GLSYT ++ S++T+SA +++
Sbjct: 627 TEKPNKYTSRYFDEANGPLYPFGYGLSYTTFTV------SDVTLSAP--------TMKRD 672
Query: 695 GSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFD 754
G+ VT+ + VTN G +G+ V+ ++ + S P KQL GF+
Sbjct: 673 GN---------VTAS--------VEVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGFE 715
Query: 755 RVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
++ K +SF +D E L N+ + G + +G
Sbjct: 716 KITLKPGERKTVSFPID-IEALKFWNQQMKYDAEPGKFNIFIG 757
>gi|146301613|ref|YP_001196204.1| beta-glucosidase [Flavobacterium johnsoniae UW101]
gi|146156031|gb|ABQ06885.1| Candidate beta-xylosidase; Glycoside hydrolase family 3
[Flavobacterium johnsoniae UW101]
Length = 864
Score = 248 bits (632), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 152/460 (33%), Positives = 236/460 (51%), Gaps = 56/460 (12%)
Query: 52 YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGP 111
YP+ + SLS RA+ LIS LTL+EK + D + AIPRLGI + WWSE+LHG A+N
Sbjct: 22 YPYKDPSLSSKKRAEDLISRLTLEEKAALMCDQSDAIPRLGIKKFNWWSEALHGYANN-- 79
Query: 112 GVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG-QAG--------L 162
+VT FP+ + AASF+ L + AV+ E RA YN Q G L
Sbjct: 80 --------DNVTVFPEPIGMAASFDDQLVFRVFDAVSDEGRAKYNQWIQNGNENKRFLSL 131
Query: 163 TFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKR 222
+ W PN+NIFRDPRWGRGQET GEDP + S V VK QG +R+
Sbjct: 132 SVWTPNVNIFRDPRWGRGQETYGEDPYLTSRMGVSVVKGLQGP--------ADAKYRK-- 181
Query: 223 VLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPFRS 281
L AC KH + +W SR+ N + ++ +T+ P F++
Sbjct: 182 ----------------LLACAKHFAVHSGPEW---SRHELNLNNVKPRELYETYLPAFKA 222
Query: 282 CIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYT 340
+++ +MC+Y +++ P C L Q+ R+EWGF+ + SDC AV + +
Sbjct: 223 LVQEADVRQVMCAYQRLDDEPCCSNTRLLQRILRDEWGFQYLVVSDCGAVTDFYTTHKVS 282
Query: 341 KTHEDSAAGVLKAGMDINC--GTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLF 398
+A+ + AG D+ C + A++K ++E++I+++LL + + LG
Sbjct: 283 SDEVHAASKAVLAGTDVECVWDKYPFKKLPEAVEKDLIKEEEINKSLLRVLIGRFDLGEM 342
Query: 399 NGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPL 458
+ D + ++ + + EH++LAL+ A++ + LL+N LPLNKN V+ +A+IGP
Sbjct: 343 DDDSIV-PWAQIPASVLNSKEHQQLALEMAQKSMTLLQNKNNILPLNKN-VNKVAVIGPN 400
Query: 459 VNNISQMGGGYTGIPCSPKSLLRGLEAYVS--KTHYASGC 496
+N + G Y G P ++ +G+E ++ K Y C
Sbjct: 401 ADNEPMLWGNYNGTPNKTITIKKGIEGKLAANKVLYDKSC 440
Score = 82.8 bits (203), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 68/278 (24%), Positives = 120/278 (43%), Gaps = 56/278 (20%)
Query: 504 DAGFHEAVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSV 553
D F ++ K D VI GL E E+ DR ++ LP Q + +
Sbjct: 589 DIDFKALIQKLKGIDTVIFAGGLSTLLEGEEMPVSFPGFKGGDRTNIELPAVQRKCLKEL 648
Query: 554 ARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRL 613
K+ + + +G ++ +IL YPGE+G +A+A+++FGD+NP G+L
Sbjct: 649 KAAGKKVIFVNCSGSA---IALTPETESCDAILQAWYPGESGGQAVADVLFGDYNPAGKL 705
Query: 614 PMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAP 673
P+++Y S D +++ GR+YR YT ++ FG GLSY+ +
Sbjct: 706 PISFYKNSDKLGDFEDYSLK--------GRTYR-YTTDVLFPFGFGLSYSKF-------- 748
Query: 674 SELTISASLKAGS-DKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVV 732
++AG+ KN ++ T L+F S+ N+ G+ +V
Sbjct: 749 -------DIQAGNLSKNKIKPNED-------------TQLKF----SIKNSSKRSGTEIV 784
Query: 733 MLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGV 770
++ R ++G P K L F R+ A + ++ +
Sbjct: 785 QVYVRKLNDTEG-PLKTLKAFKRIELKAGEKQNVTIDL 821
>gi|29347188|ref|NP_810691.1| beta-glucosidase [Bacteroides thetaiotaomicron VPI-5482]
gi|29339087|gb|AAO76885.1| beta-glucosidase (gentiobiase) [Bacteroides thetaiotaomicron
VPI-5482]
Length = 853
Score = 248 bits (632), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 154/447 (34%), Positives = 226/447 (50%), Gaps = 57/447 (12%)
Query: 54 FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
+ N + + R LIS LT++EKI L + IPRLGI Y +E+LHG+ G
Sbjct: 30 YKNENAPVHERVMDLISRLTVEEKISLLRATSPGIPRLGIDKYYHGNEALHGVVRPG--- 86
Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAG----------LT 163
T FPQ + AA++N L + + ++ EARA +N G LT
Sbjct: 87 -------RFTVFPQAIGLAATWNPELQKRVATVISDEARARWNELDQGREQKEQFSDVLT 139
Query: 164 FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRV 223
FW+P +N+ RDPRWGR ET GEDP + FV QG
Sbjct: 140 FWSPTVNMARDPRWGRTPETYGEDPFLSGIMGTAFVNGLQG------------------- 180
Query: 224 LKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCI 283
D L + + KH A + E +R+ N I+E+ + + P F C+
Sbjct: 181 --------DDPHYLKIVSTPKHFAANNEEH----NRFVCNPQISEKQLREYYFPAFEMCV 228
Query: 284 EQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKT 342
++GKA+ IM +YN +N VP L L QK R +WGF+GY+ SDC A + Y KT
Sbjct: 229 KEGKAASIMSAYNALNDVPCTLNPWLLQKVLRQDWGFQGYVVSDCGGPALLVNAHKYVKT 288
Query: 343 HEDSAAGVLKAGMDINCGTCMLR-HTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGD 401
E +A +KAG+D+ CG + +A + V + DID A ++ + +++LGLF+
Sbjct: 289 KEAAATLSIKAGLDLECGDDVYDGPLLNAYKQYMVSDADIDSAAYHVLTARMKLGLFDSG 348
Query: 402 PRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNN 461
R Y K+ P + + EH+++ALDAARQ +VLLKN K LPLN + + S+A++G +N
Sbjct: 349 ER-NPYTKISPSVIGSKEHQQIALDAARQCVVLLKNQKNRLPLNADKLKSIAVVG--INA 405
Query: 462 ISQMGGGYTGIP-CSPKSLLRGLEAYV 487
G Y+G P P S+L+G+ V
Sbjct: 406 GKCEFGDYSGAPVVEPVSILQGIRNRV 432
Score = 127 bits (320), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 84/305 (27%), Positives = 146/305 (47%), Gaps = 53/305 (17%)
Query: 507 FHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLT 566
+ EA + ++ + V+ V G++ + E E +DR + LP Q + + + + +I+VL
Sbjct: 594 YGEAGKAVRECETVVAVMGINKSIEREGQDRYDIQLPADQREFLQEIYKVNPN-IIVVLV 652
Query: 567 GGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVP 626
G L +++ D I +I+ YPGE G A+AE++FGD+NP GRLP+T+Y P
Sbjct: 653 AGSSLAINWM--DEHIPAIVNAWYPGEQGGTAVAEVLFGDYNPAGRLPLTYYKSLDELPP 710
Query: 627 MNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPS---ELTISASLK 683
+D ++ GR+Y+++ G +Y FG+GLSY++++Y L E+T+S LK
Sbjct: 711 FDDYDITK-------GRTYKYFKGDVLYPFGYGLSYSSFTYSDLQVKDGGGEVTVSFRLK 763
Query: 684 AGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQ 743
N G +G V ++ R+P+
Sbjct: 764 --------------------------------------NTGKRNGDEVAQVYVRIPETGG 785
Query: 744 GTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIAN-KHGRRILPLGNHVLMVGELRHS 802
P K+L GF RV + S+ + +D EQL + + G+ ++P G +MVG
Sbjct: 786 IVPLKELKGFRRVPLKSGESRRVEIKLDK-EQLRYWDVEKGQFVVPKGAFDVMVGASSKD 844
Query: 803 LTIET 807
+ ++T
Sbjct: 845 IRLQT 849
>gi|53714352|ref|YP_100344.1| beta-glucosidase [Bacteroides fragilis YCH46]
gi|52217217|dbj|BAD49810.1| periplasmic beta-glucosidase precursor [Bacteroides fragilis YCH46]
Length = 859
Score = 248 bits (632), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 216/759 (28%), Positives = 339/759 (44%), Gaps = 123/759 (16%)
Query: 72 LTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVS 131
LTL ++Q+ + RLGIP + +ESLHG +G T FPQ +
Sbjct: 95 LTLMNEVQKYMREKT---RLGIPVFTL-TESLHGSVHDGS-----------TIFPQAIAL 139
Query: 132 AASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVV 191
++FN L + SA+A E A Q P I++ RD RWGR +E GEDP +V
Sbjct: 140 GSTFNPILAYEMTSAIAKELTA-----QGITQSLTPVIDVCRDLRWGRVEECFGEDPYLV 194
Query: 192 SAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDL 251
S V V+ G+ + +V S KH A+
Sbjct: 195 SRMGVSQVR----------------GYLDNQV----------------SPMIKHFGAHGA 222
Query: 252 EKWG-NFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLF 310
+ G N + S +++ + F + +++ K +M SYN N P L
Sbjct: 223 PQGGLNLASVSCG----QRELLSIYLKTFETVVKEAKPWAVMSSYNSWNNEPNSSSHYLM 278
Query: 311 QKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQS 369
+ R+ W F+GY+ SD A+ + + + ++A L AG+D Q
Sbjct: 279 TELLRDRWDFQGYVYSDWGAIGMLNYFHKTAQNSAEAAIQALTAGLDAEASDNSYAELQQ 338
Query: 370 AIDKGKVQEKDIDRALLNLFSVQLRLGLFNGD-PRKGKYGKLGPDDVCTSEHKKLALDAA 428
++ G + K ID+A+ + + + +GLF P + Y K+ V H LA A
Sbjct: 339 LVENGMLDVKYIDQAVARILTAKFNMGLFEYPLPMEKNYDKV----VHAPAHVSLARKIA 394
Query: 429 RQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPK--SLLRGLEAY 486
+ IVLL+N+ LPL N + S+A+IGP + + G YT + +LL L+
Sbjct: 395 EESIVLLQNENNILPLQMNKLKSIAVIGPNADQVQ--FGDYTWSRDNKDGVTLLEALKER 452
Query: 487 VSKT---HYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAG-------LDLTQET--ED 534
V +YA GC D+ + +GF EAV +AKK+D IVV G D + T E
Sbjct: 453 VGNQLTLNYAKGC-DLVTDDRSGFKEAVDVAKKSDVCIVVVGSASASLARDYSNATCGEG 511
Query: 535 RDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEA 594
D L L G Q LV ++ T K PVI+VL G PL +S+ + + I I+ YPGE
Sbjct: 512 FDLSDLTLTGVQEDLVEAIHATGK-PVIVVLLSGKPLAMSWIKEN--IPGIVVQWYPGEQ 568
Query: 595 GAKALAEIIFGDFNPGGRLPMTWYPES-------FTKVPMNDMNMRADSSRQYPGRSYRF 647
G ALA+++ G NP G+L + +P+S + +P + R+ S+ PG+ Y F
Sbjct: 569 GGLALADMLLGKVNPSGKLNYS-FPQSVGHLPCYYNYLPTDKGFYRSPGSKNKPGKDYVF 627
Query: 648 YTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVT 707
+ ++ FGHGLSYT++ Y L T S+ DY
Sbjct: 628 SSPKALWAFGHGLSYTDFEY-----------------------LSATTSKEDY------- 657
Query: 708 SCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEIS 767
+C + V I++ N GD DG V ++ R S P ++L GF++V +K++
Sbjct: 658 ACEDV-IEVTIAIRNTGDYDGLEVPQVYVRDMVSSVVMPVQELKGFEKVLIKKGETKQVI 716
Query: 768 FGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
+ P +L++ NK ++++ G L +G + I+
Sbjct: 717 IKI-PVSELALYNKEMKKVVEPGAFELQIGRASDDIRIK 754
>gi|197248682|ref|YP_002147137.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar Agona
str. SL483]
gi|440763077|ref|ZP_20942125.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Agona str. SH11G1113]
gi|440769230|ref|ZP_20948190.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Agona str. SH08SF124]
gi|440771374|ref|ZP_20950293.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Agona str. SH10GFN094]
gi|197212385|gb|ACH49782.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Agona str. SL483]
gi|436415218|gb|ELP13139.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Agona str. SH08SF124]
gi|436421310|gb|ELP19157.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Agona str. SH11G1113]
gi|436422409|gb|ELP20247.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Agona str. SH10GFN094]
Length = 765
Score = 247 bits (631), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 208/749 (27%), Positives = 346/749 (46%), Gaps = 139/749 (18%)
Query: 75 QEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAAS 134
++ I+Q+ D A+ RL IP + + + +HG T FP L A+S
Sbjct: 86 RQDIRQMQDQVMALSRLKIPLF-FAYDVVHG---------------QRTVFPISLGLASS 129
Query: 135 FNRSLWSNIGSAVAVEARAMYNLGQAGLTF-WAPNINIFRDPRWGRGQETPGEDPMVVSA 193
FN +G A EA GL WAP +++ RDPRWGR E GED + S
Sbjct: 130 FNLDAVRTVGRVSAYEA------ADDGLNMTWAPMVDVSRDPRWGRASEGFGEDTYLTSI 183
Query: 194 YAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEK 253
VK+ QG++ D + KH AY +
Sbjct: 184 MGETMVKAMQGKS--------------------------PADRYSVMTSVKHFAAYGAVE 217
Query: 254 WGNFSRYSFNAM-ITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK 312
G +N + ++ Q + + PP+++ ++ G + +M + N +NG PA L +
Sbjct: 218 GGK----EYNTVDMSSQRLFNDYMPPYKAGLDAG-SGAVMVALNSLNGTPATSDSWLLKD 272
Query: 313 A-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCML-RHTQSA 370
R+EWGFKG SD A+ + ++ ED+ LKAG+D++ ++
Sbjct: 273 VLRDEWGFKGITVSDHGAIKELIKHGTAADP-EDAVRVALKAGVDMSMADEYYSKYLPGL 331
Query: 371 IDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSE-------HKKL 423
I GKV ++D A ++ +V+ +GLFN DP Y LGP + ++ H+K
Sbjct: 332 IKSGKVTMAELDDATRHVLNVKYDMGLFN-DP----YSHLGPKESDPADTNAESRLHRKE 386
Query: 424 ALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT--GIPCSPKSLLR 481
A + AR+ +VLLKN + LPL K+ ++A++GPL ++ + G ++ G+ ++L
Sbjct: 387 AREVARESVVLLKNRLETLPLKKSG--TIAVVGPLADSQRDVMGSWSAAGVANQSVTVLA 444
Query: 482 GLEAYV---SKTHYASGCH-----DV--------------PCNSDAGFHEAVRIAKKADF 519
G++ V +K YA G + D+ P + A EAV+ AK+AD
Sbjct: 445 GIQNAVGDGAKILYAKGANITNDKDIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADV 504
Query: 520 VIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEAD 579
V+ V G E R ++ +P Q L+T++ T K P++LVL G PL + + D
Sbjct: 505 VVAVVGESQGMAHEASSRTNITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKED 561
Query: 580 SQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRA----- 634
Q +IL + G G A+A+++FGD+NP G+LP++ +P S ++P+ ++
Sbjct: 562 QQADAILETWFAGTEGGNAIADVLFGDYNPSGKLPIS-FPRSVGQIPVYYSHLNTGRPYN 620
Query: 635 -DSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQ 693
+ +Y R + G +Y FG+GLSYT ++ S++T+S+ +Q+
Sbjct: 621 PEKPNKYTSRYFDEANGP-LYPFGYGLSYTTFTV------SDVTLSSP--------TMQR 665
Query: 694 TGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGF 753
G +VT+ + VTN G +G+ V+ ++ + S P KQL GF
Sbjct: 666 DG---------KVTAS--------VEVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGF 708
Query: 754 DRVHTVAKGSKEISFGVDPCEQLSIANKH 782
+++ SK +SF +D E L N+
Sbjct: 709 EKITLKPGESKTVSFPID-IEALKFWNQQ 736
>gi|383114908|ref|ZP_09935668.1| hypothetical protein BSGG_5166 [Bacteroides sp. D2]
gi|382948422|gb|EIC71783.1| hypothetical protein BSGG_5166 [Bacteroides sp. D2]
Length = 782
Score = 247 bits (631), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 214/750 (28%), Positives = 332/750 (44%), Gaps = 135/750 (18%)
Query: 90 RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
RLGIP + E+ HG + G T FP + AA+++ L +G +A
Sbjct: 129 RLGIPMF-LAEEAPHGHMAIG-----------ATVFPTGIGMAATWSPELVKEVGQVIAK 176
Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
E R+ Q G + P +++ RDPRW R +ET GEDP++ V G N
Sbjct: 177 EIRS-----QGGHISYGPVLDLTRDPRWSRVEETFGEDPVLSGILGASMVDGLGGGNLSQ 231
Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQ 269
I A KH +AY + + G Y A + +
Sbjct: 232 KYATI--------------------------ATLKHFLAYAVPEGGQNGNY---ASVGIR 262
Query: 270 DTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCD 328
D F PPFR I+ G S +M SYN ++G+P L Q RNEW F+G++ SD
Sbjct: 263 DLHQNFLPPFRKAIDSGALS-VMTSYNSIDGIPCTSNHYLLTQLLRNEWKFRGFVVSDLY 321
Query: 329 AVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNL 388
++ I E T E++A + AG+D++ G + A+ G++ + ID A+ +
Sbjct: 322 SIEGIHESHFVALTKENAAIQSVTAGVDVDLGGDAYTNLCHAVQSGQMDKAVIDTAVCRV 381
Query: 389 FSVQLRLGLFNG---DPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLN 445
++ +GLF DP K+ V EH +LA A+ I LLKN+ LPL+
Sbjct: 382 LRMKFEMGLFEHPYVDP------KIAAKTVRRKEHIELARKIAQSSITLLKNENSILPLS 435
Query: 446 KNAVSSLAIIGPLVNNISQMGGGYTGIP--CSPKSLLRGLEAYVS--KTHYASGC--HDV 499
K ++ +A+IGP +N M G YT + K++L G+ +S + Y GC D
Sbjct: 436 KT-INKVAVIGPNADNRYNMLGDYTAPQEDSNVKTVLDGIITKLSPSRVEYVRGCAIRDT 494
Query: 500 PCNSDAGFHEAVRIAKKADFVIVVAG----LDLT---QET----------------EDRD 536
N +A+ A++++ VIVV G D +ET E D
Sbjct: 495 TVNE---IEQAIEAARRSEVVIVVVGGSSARDFKTSYKETGAAVAEEGSVSDMECGEGFD 551
Query: 537 RVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGA 596
R SL L G+Q L+ S+ +T K P+I+V G PL+ ++A ++L YPG+ G
Sbjct: 552 RASLSLLGRQQELLESLQKTGK-PLIVVYIEGRPLEKNWA--SEYADALLTAYYPGQEGG 608
Query: 597 KALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGF 656
A+A+++FGD+NP GRLP++ P S ++P+ N +A + Y S + +Y F
Sbjct: 609 NAIADVLFGDYNPSGRLPIS-VPRSVGQIPVY-YNKKAPRNHDYVEVS-----SSPLYSF 661
Query: 657 GHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHV 716
G+G+SYT + Y L +V ++ F V
Sbjct: 662 GYGMSYTTFEYSAL----------------------------------QVVQKSARCFEV 687
Query: 717 QISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQL 776
V N G DG V L+ R S P KQL F+R H K+++F V E
Sbjct: 688 SFKVKNTGKYDGEEVSQLYMRDEYASVVQPMKQLKHFERFHLKKGEEKKVTF-VLTEEDF 746
Query: 777 SIANKHGRRILPLGNHVLMVGELRHSLTIE 806
+ N ++++ GN LM+G + + ++
Sbjct: 747 FLVNYTLKKVVESGNFHLMIGAASNDIRLQ 776
>gi|227536644|ref|ZP_03966693.1| possible beta-glucosidase [Sphingobacterium spiritivorum ATCC
33300]
gi|227243445|gb|EEI93460.1| possible beta-glucosidase [Sphingobacterium spiritivorum ATCC
33300]
Length = 777
Score = 247 bits (631), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 202/731 (27%), Positives = 325/731 (44%), Gaps = 129/731 (17%)
Query: 90 RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
RLGIP + E+ HG + G T FP + A+++N +L + + VA
Sbjct: 126 RLGIPVF-LAEEAPHGHMAIG-----------TTVFPTGIGQASTWNPALLQKMSATVAK 173
Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
E R Q + P +++ RDPRW R +E+ GEDP++ A V
Sbjct: 174 EVRQ-----QGAHISYGPVLDLSRDPRWSRVEESYGEDPVLTGTLAAAIVT--------- 219
Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQ 269
G+G G + D KH +AY + + G+ A I E+
Sbjct: 220 ---GLGSG--------------NLSDPFATIPTLKHFVAYGIPEGGHNGS---AASIGER 259
Query: 270 DTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCD 328
+ + F PPF+S + G A +M +YN V+G+P L R EW F G+ SD
Sbjct: 260 ELREYFLPPFQSAVAAG-AKSVMAAYNSVDGIPCSSNKFLLTDILRKEWNFNGFTVSDLG 318
Query: 329 AVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNL 388
++ I K H+ +A ++AG+D + G A+ +G+VQE ID+A+ +
Sbjct: 319 SIEGIKGSHRVAKDHKQAAILAIEAGLDADLGGNAYVRLIEAVKQGEVQENSIDQAVSRV 378
Query: 389 FSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNA 448
+++ +GLF K K +V T + L+ AR+ IVLL+N LPL K+
Sbjct: 379 LALKFEMGLFEKPFVDAKTAK---KEVKTEANIALSRQVARESIVLLENKNNILPLRKDV 435
Query: 449 VSSLAIIGPLVNNISQMGGGYTG-IPCSPKSLLR-GLEAYVSKTH--YASGCHDVPCNSD 504
+AIIGP +NI M G YT P + +R + A + K Y GC + ++
Sbjct: 436 --KIAIIGPNADNIYNMLGDYTAPQPDGAVTTVRQAISARLPKAQVSYVKGC-SIRDTTN 492
Query: 505 AGFHEAVRIAKKADFVIVVAG-----------------------LDLTQETEDRDRVSLL 541
+ AV A+++D ++ V G + + E DR +L
Sbjct: 493 SDIPAAVTAAQQSDIIVAVVGGSSARDFKTEYISTGAAVASDKSVSDMESGEGFDRSTLD 552
Query: 542 LPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAE 601
L G+QM L+ ++ +T K P++++ G PL++++A + W YPG+ G A+A+
Sbjct: 553 LLGRQMELLKALKQTGK-PLVVIYIQGRPLNMNWAATHADALLCAW--YPGQEGGHAIAD 609
Query: 602 IIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYP-GRSYRFYTGTQVYGFGHGL 660
++FGD+NP G++P++ P S ++P++ +R+ P Y T +Y FG+G
Sbjct: 610 VLFGDYNPAGKMPLS-VPRSVGQIPVH-------YNRKSPLDHRYVEEAATPLYAFGYGK 661
Query: 661 SYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISV 720
SY+++ YK LK D DY V ++
Sbjct: 662 SYSDFEYK------------DLKIQKDNK---------DY--------------RVSFTL 686
Query: 721 TNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIAN 780
TN G DG V L+ R S P +QL F+R+H SK +SF V LS+ N
Sbjct: 687 TNTGKYDGDEVAQLYIRNQYASVSQPVQQLKHFERIHLKTGESKTVSF-VLTAGDLSVIN 745
Query: 781 KHGRRILPLGN 791
+++L G+
Sbjct: 746 TQMKKVLEPGS 756
>gi|329956938|ref|ZP_08297506.1| glycosyl hydrolase family 3 protein [Bacteroides clarus YIT 12056]
gi|328523695|gb|EGF50787.1| glycosyl hydrolase family 3 protein [Bacteroides clarus YIT 12056]
Length = 944
Score = 247 bits (631), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 204/753 (27%), Positives = 333/753 (44%), Gaps = 123/753 (16%)
Query: 90 RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
RLGIP ++ +E + G+ S T+FP L ++NR L +G
Sbjct: 153 RLGIPV-DFTNEGIRGVES-----------YKATNFPTQLGLGHTWNRELIRQVGLITGR 200
Query: 150 EARAMYNLGQAGLT-FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWK 208
EAR + G T +AP +++ RD RWGR +E GE P +V+ +E V+ Q +
Sbjct: 201 EARML------GYTNVYAPILDVGRDQRWGRYEEVYGESPYLVAELGIEMVRGLQHNH-- 252
Query: 209 SDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITE 268
++A KH AY K + ++
Sbjct: 253 -----------------------------QVAATAKHFAAYSNNKGAREGMSRVDPQMSP 283
Query: 269 QDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLF---QKARNEWGFKGYITS 325
++ E+ PF+ I + IM SYN +G+P ++G + + R E GF+GY+ S
Sbjct: 284 REVENIHIYPFKRVIRETGLLGIMSSYNDYDGIP--VQGSYYWLTTRLRQEMGFRGYVVS 341
Query: 326 DCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCG----TCMLRHTQSAIDKGKVQEKDI 381
D DAV ++ N K +++ ++AG+++ C + + + +G + E+ I
Sbjct: 342 DSDAVEYLYTKHNTAKDMKEAVRQSVEAGLNVRCTFRSPDSFVLPLRELVKEGGLSEEVI 401
Query: 382 DRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKF 441
+ + ++ V+ +GLF+ + G ++V + ++ +AL A+R+ +VLLKN
Sbjct: 402 NDRVRDILRVKFLIGLFDSPYQTDLAG--ADNEVEKAANEAVALQASRESVVLLKNADNT 459
Query: 442 LPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV---SKTHYASGCH- 497
LPLN + + +A+ GP + Y + ++L G+ ++ Y GC
Sbjct: 460 LPLNIDKIKKIAVCGPNADEEGYALTHYGPLAVEVTTVLEGIREKAQGKAEVLYTKGCDL 519
Query: 498 -----------DVPCNSD--AGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPG 544
+ P D A A A++AD +VV G E++ R SL LPG
Sbjct: 520 VDAHWPESEIIEYPLTPDEQAEIDRAAANARQADVAVVVLGGGQRTCGENKSRTSLDLPG 579
Query: 545 QQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIF 604
Q+ L+ +V T K PV+LVL G PL V++A D + +IL YPG G A+A+I+F
Sbjct: 580 HQLKLLQAVQATGK-PVVLVLINGRPLSVNWA--DKFVPAILEAWYPGSKGGTAVADILF 636
Query: 605 GDFNPGGRLPMTWYPESFTKVPMN---DMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLS 661
GD+NPGG+L +T +P++ ++P N + D + +Y FG+GLS
Sbjct: 637 GDYNPGGKLTVT-FPKTVGQIPFNFPCKPASQIDGGKNPGADGNMSRINGALYPFGYGLS 695
Query: 662 YTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVT 721
YT + Y S+L IS + K V++ VT
Sbjct: 696 YTTFEY------SDLEISPKVITPDQKAT-------------------------VRLKVT 724
Query: 722 NAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANK 781
N G G VV L+ R S T EK L GF+R+ +KE++F +D + L + N
Sbjct: 725 NTGKRAGDEVVQLYTRDILSSITTYEKNLAGFERIRLKPGETKEVTFTLD-RKHLELLNA 783
Query: 782 HGRRILPLGNHVLMVG------ELRHSLTIETY 808
+ I+ G +M G L LT+E Y
Sbjct: 784 DMKWIVEPGEFAIMAGASSEDIRLNGILTVEDY 816
>gi|397691065|ref|YP_006528319.1| glycoside hydrolase family 3 protein [Melioribacter roseus P3M]
gi|395812557|gb|AFN75306.1| glycoside hydrolase family 3 protein [Melioribacter roseus P3M]
Length = 769
Score = 247 bits (631), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 218/741 (29%), Positives = 341/741 (46%), Gaps = 130/741 (17%)
Query: 90 RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
RLGIP + E LHG+A+ TS+P + AA+FN L I SA+A
Sbjct: 110 RLGIPVI-FHEECLHGLAA-----------KDATSYPVPIGLAATFNPELIEKIFSAIAE 157
Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
+AR+ + P +++ RDPRWGR +ET GED +VS + VK QG
Sbjct: 158 DARS-----RGAHQALTPVVDVVRDPRWGRVEETFGEDTYLVSQMGIASVKGLQG----- 207
Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWG-NFSRYSFNAMITE 268
DG + +V+ A KH A+ + G N + +F +E
Sbjct: 208 -DGSLN---NNNKVI----------------ATLKHFAAHGQPESGTNCAPANF----SE 243
Query: 269 QDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDC 327
+ DTF PF+ I++ +M SYN+++G+P+ L +K R+EW FKG++ SD
Sbjct: 244 RFLRDTFLMPFKEAIDKAGVISVMASYNEIDGIPSHANKWLLRKVLRDEWNFKGFVVSDY 303
Query: 328 DAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGT--------CMLRHTQSAIDKGKVQEK 379
A+ +F ++ T +H +A V A + + G C T+ + G E
Sbjct: 304 YAITELF-HKEETVSHGVAANKVEAAKLALEAGVNIEFPNPDCYPNLTE-MVKGGLADES 361
Query: 380 DIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDK 439
DID +L + + LGLF+ + + G+ + + ++LAL AAR+ I LLKN+
Sbjct: 362 DIDALVLPMLKYKFELGLFDNPYVEAEPGQF---ENKLEQDRELALQAARETITLLKNEG 418
Query: 440 KFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSK---THYASGC 496
LPL +A+IGP N + GGY G P S+ +G++ V K Y+ GC
Sbjct: 419 NLLPLKD--FKKIAVIGP--NADRTLLGGYHGTPKYYTSVYQGIKDKVGKNGEVFYSEGC 474
Query: 497 HDV--------------PCNSDAGFHEAVRIAKKADFVIVVAGLD--LTQETEDR----D 536
P + +EAV +A+K+D ++V G + ++E ++ D
Sbjct: 475 KITVGGSWNDDEVILPDPAEDEKLINEAVAVAQKSDVAVLVLGGNEQTSREAWNKKHLGD 534
Query: 537 RVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGA 596
R SL L G+Q LV + +T K PV+++L G P + F + + +IL Y G+
Sbjct: 535 RPSLELVGRQNKLVEEILKTGK-PVVVLLFNGRPNSIGFIK--DNVPAILECWYLGQETG 591
Query: 597 KALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGF 656
+A+A+++FGD+NP G+LP++ P S +P + S + R Y F + ++ F
Sbjct: 592 RAVADVLFGDYNPSGKLPVS-IPRSAGHIPAH------YSHKPSARRGYLFDDVSPLFAF 644
Query: 657 GHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHV 716
G+GLSYT +S+ L S+ TISA DE S V
Sbjct: 645 GYGLSYTKFSFDNLRL-SKDTISA-----------------------DEKVS-------V 673
Query: 717 QISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQL 776
I V N G + G VV L+ R S P K+L GF ++ + + F + P E L
Sbjct: 674 SIEVKNEGAIAGEEVVQLYIRDKVSSVTRPVKELKGFRKITLAPGQTSTVVFELLP-EHL 732
Query: 777 SIANKHGRRILPLGNHVLMVG 797
+ N + + G +MVG
Sbjct: 733 AFTNVDMKFTVEPGEFEIMVG 753
>gi|423212854|ref|ZP_17199383.1| hypothetical protein HMPREF1074_00915 [Bacteroides xylanisolvens
CL03T12C04]
gi|392694712|gb|EIY87939.1| hypothetical protein HMPREF1074_00915 [Bacteroides xylanisolvens
CL03T12C04]
Length = 782
Score = 247 bits (631), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 215/750 (28%), Positives = 333/750 (44%), Gaps = 135/750 (18%)
Query: 90 RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
RLGIP + E+ HG + G T FP + AA+++ L +G +A
Sbjct: 129 RLGIPMF-LAEEAPHGHMAIG-----------ATVFPTGIGMAATWSPELVKEVGQVIAK 176
Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
E R+ Q G + P +++ RDPRW R +ET GEDP++ V G N
Sbjct: 177 EIRS-----QGGHISYGPVLDLTRDPRWSRVEETFGEDPVLSGILGASMVDGLGGGNLSQ 231
Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQ 269
I A KH +AY + + G Y A + +
Sbjct: 232 KYATI--------------------------ATLKHFLAYAVPEGGQNGNY---ASVGIR 262
Query: 270 DTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCD 328
D F PPFR I+ G S +M SYN ++G+P L Q RNEW F+G++ SD
Sbjct: 263 DLHQNFLPPFRKAIDAGALS-VMTSYNSIDGIPCTSNHYLLTQLLRNEWKFRGFVVSDLY 321
Query: 329 AVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNL 388
++ I E T E++A + AG+D++ G + A+ G++ + ID A+ +
Sbjct: 322 SIEGIHESHFVAPTKENAAIQSVTAGVDVDLGGDAYTNLCHAVQSGQMDKTVIDTAVCRV 381
Query: 389 FSVQLRLGLFNG---DPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLN 445
++ +GLF DP K+ V EH +LA A+ I LLKN+ LPL+
Sbjct: 382 LRMKFEMGLFEHPYVDP------KIAAKTVRRKEHIELARKIAQSSITLLKNENSILPLS 435
Query: 446 KNAVSSLAIIGPLVNNISQMGGGYTGIP--CSPKSLLRGLEAYVS--KTHYASGC--HDV 499
K ++ +A+IGP +N M G YT + K++L G+ +S + Y GC D
Sbjct: 436 KT-INKVAVIGPNADNRYNMLGDYTAPQEDSNVKTVLDGILTKLSPFRVEYVRGCAIRDT 494
Query: 500 PCNSDAGFHEAVRIAKKADFVIVVAG----LDLT---QET----------------EDRD 536
N +A+ A++++ VIVV G D +ET E D
Sbjct: 495 TVNE---IEQAIEAARRSEVVIVVVGGSSARDFKTSYKETGAAVAEEGSVSDMECGEGFD 551
Query: 537 RVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGA 596
R SL L G+Q L+ S+ +T K P+I+V G PL+ ++A ++L YPG+ G
Sbjct: 552 RASLSLLGRQQELLESLQKTGK-PLIVVYIEGRPLEKNWA--SEYADALLTAYYPGQEGG 608
Query: 597 KALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGF 656
A+A+++FGD+NP GRLP++ P S ++P+ N +A + Y S + +Y F
Sbjct: 609 NAIADVLFGDYNPSGRLPIS-VPRSVGQIPVY-YNKKAPRNHDYVEVS-----SSPLYSF 661
Query: 657 GHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHV 716
G+G+SYT + Y SD ++Q++ C F V
Sbjct: 662 GYGMSYTTFEY------------------SDLQVVQKSA------------RC----FEV 687
Query: 717 QISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQL 776
V N G DG V L+ R S P KQL F+R H K+++F V E
Sbjct: 688 SFKVKNTGKYDGEEVSQLYMRDEYASVVQPMKQLKHFERFHLKKGEEKKVTF-VLTEEDF 746
Query: 777 SIANKHGRRILPLGNHVLMVGELRHSLTIE 806
+ N ++++ G +M+G + +E
Sbjct: 747 FLVNYTLKKVVESGTFQVMIGSSSDDIRLE 776
>gi|238912609|ref|ZP_04656446.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Tennessee str. CDC07-0191]
Length = 765
Score = 247 bits (630), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 208/749 (27%), Positives = 346/749 (46%), Gaps = 139/749 (18%)
Query: 75 QEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAAS 134
++ I+Q+ D A+ RL IP + + + +HG T FP L A+S
Sbjct: 86 RQDIRQMQDQVMALSRLKIPLF-FAYDVVHG---------------QRTVFPISLGLASS 129
Query: 135 FNRSLWSNIGSAVAVEARAMYNLGQAGLTF-WAPNINIFRDPRWGRGQETPGEDPMVVSA 193
FN +G A EA GL WAP +++ RDPRWGR E GED + S
Sbjct: 130 FNLDAVRTVGRVSAYEA------ADDGLNMTWAPMVDVSRDPRWGRASEGFGEDTYLTSI 183
Query: 194 YAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEK 253
VK+ QG++ D + KH AY +
Sbjct: 184 MGETMVKAMQGKS--------------------------PADRYSVMTSVKHFAAYGAVE 217
Query: 254 WGNFSRYSFNAM-ITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK 312
G +N + ++ Q + + PP+++ ++ G + +M + N +NG PA L +
Sbjct: 218 GGK----EYNTVDMSSQRLFNDYMPPYKAGLDAG-SGAVMVALNSLNGTPATSDSWLLKD 272
Query: 313 A-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCML-RHTQSA 370
R+EWGFKG SD A+ + ++ ED+ LKAG+D++ ++
Sbjct: 273 VLRDEWGFKGITVSDHGAIKELIKH-GTAADPEDAVRVALKAGVDMSMADEYYSKYLPGL 331
Query: 371 IDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSE-------HKKL 423
I GKV ++D A ++ +V+ +GLFN DP Y LGP + ++ H+K
Sbjct: 332 IKSGKVTMAELDDATRHVLNVKYDMGLFN-DP----YSHLGPKESDPADTNAESRLHRKE 386
Query: 424 ALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT--GIPCSPKSLLR 481
A + AR+ +VLLKN + LPL K+ ++A++GPL ++ + G ++ G+ ++L
Sbjct: 387 AREVARESVVLLKNRLETLPLKKSG--TIAVVGPLADSQRDVMGSWSAAGVANQSVTVLA 444
Query: 482 GLEAYV---SKTHYASGCH-----DV--------------PCNSDAGFHEAVRIAKKADF 519
G++ V +K YA G + D+ P + A EAV+ AK+AD
Sbjct: 445 GIQNAVGDGAKILYAKGANITNDKDIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADV 504
Query: 520 VIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEAD 579
V+ V G E R ++ +P Q L+T++ T K P++LVL G PL + + D
Sbjct: 505 VVAVVGESQGMAHEASSRTNITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKED 561
Query: 580 SQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRA----- 634
Q +IL + G G A+A+++FGD+NP G+LP++ +P S ++P+ ++
Sbjct: 562 QQADAILETWFAGTEGGNAIADVLFGDYNPSGKLPIS-FPRSVGQIPVYYSHLNTGRPYN 620
Query: 635 -DSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQ 693
+ +Y R + G +Y FG+GLSYT ++ S++T+S+ +Q+
Sbjct: 621 PEKPNKYTSRYFDEANGP-LYPFGYGLSYTTFTV------SDVTLSSP--------TMQR 665
Query: 694 TGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGF 753
G +VT+ + VTN G +G+ V+ ++ + S P KQL GF
Sbjct: 666 DG---------KVTAS--------VEVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGF 708
Query: 754 DRVHTVAKGSKEISFGVDPCEQLSIANKH 782
+++ SK +SF +D E L N+
Sbjct: 709 EKITLKPGESKTVSFPID-IEALKFWNQQ 736
>gi|417366389|ref|ZP_12138693.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar
Hvittingfoss str. A4-620]
gi|353592073|gb|EHC50188.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar
Hvittingfoss str. A4-620]
Length = 755
Score = 247 bits (630), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 210/749 (28%), Positives = 344/749 (45%), Gaps = 139/749 (18%)
Query: 75 QEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAAS 134
++ I+Q+ D A+ RL IP + + + +HG T FP L A+S
Sbjct: 76 RQDIRQMQDQVMALSRLKIPLF-FAYDVVHG---------------QRTVFPISLGLASS 119
Query: 135 FNRSLWSNIGSAVAVEARAMYNLGQAGLTF-WAPNINIFRDPRWGRGQETPGEDPMVVSA 193
FN +G A EA GL WAP +++ RDPRWGR E GED + S
Sbjct: 120 FNLDAVRTVGRVSAYEA------ADDGLNMTWAPMVDVSRDPRWGRASEGFGEDTYLTSI 173
Query: 194 YAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEK 253
VK+ QG++ D + KH AY +
Sbjct: 174 MGETMVKAMQGKS--------------------------PADRYSVMTSVKHFAAYGAVE 207
Query: 254 WGNFSRYSFNAM-ITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK 312
G +N + ++ Q + + PP+++ ++ G + +M + N +NG PA L +
Sbjct: 208 GGK----EYNTVDMSSQRLFNDYMPPYKAGLDAG-SGAVMVALNSLNGTPATSDSWLLKD 262
Query: 313 A-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCML-RHTQSA 370
R+EWGFKG SD A+ + ++ ED+ LKAG+D++ ++
Sbjct: 263 VLRDEWGFKGITVSDHGAIKELIKH-GTAADPEDAVRVALKAGVDMSMADEYYSKYLPGL 321
Query: 371 IDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGP---DDVCTSE----HKKL 423
I GKV ++D A ++ +V+ +GLFN DP Y LGP D V T+ H+K
Sbjct: 322 IKSGKVTMAELDDATRHVLNVKYDMGLFN-DP----YSHLGPKESDPVDTNAESRLHRKE 376
Query: 424 ALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT--GIPCSPKSLLR 481
A + AR+ +VLLKN + LPL K+ ++A++GPL ++ + G ++ G+ ++L
Sbjct: 377 AREVARESVVLLKNRLETLPLKKSG--TIAVVGPLADSQRDVMGSWSAAGVANQSVTVLA 434
Query: 482 GLEAYV---SKTHYASGCHDV-------------------PCNSDAGFHEAVRIAKKADF 519
G++ V +K YA G + P + A EAV+ AK+AD
Sbjct: 435 GIQNAVGDGAKILYAKGANITNDKGIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADV 494
Query: 520 VIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEAD 579
V+ V G E R + +P Q L+T++ T K P++LVL G PL + + D
Sbjct: 495 VVAVVGESQGMAHEASSRTDITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKED 551
Query: 580 SQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRA----- 634
Q +IL + G G A+A+++FGD+NP G+LP++ +P S ++P+ ++
Sbjct: 552 QQADAILETWFAGTEGGNAIADVLFGDYNPSGKLPIS-FPRSVGQIPVYYSHLNTGRPYN 610
Query: 635 -DSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQ 693
+ +Y R + G +Y FG+GLSYT ++ S++T+S+ +Q+
Sbjct: 611 PEKPNKYTSRYFDEANGP-LYPFGYGLSYTTFTV------SDVTLSSP--------TMQR 655
Query: 694 TGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGF 753
G +VT+ + VTN G +G+ V+ ++ + S P KQL GF
Sbjct: 656 DG---------KVTAS--------VEVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGF 698
Query: 754 DRVHTVAKGSKEISFGVDPCEQLSIANKH 782
+++ SK +SF +D E L N+
Sbjct: 699 EKITLKPGESKTVSFPID-IEALKFWNQQ 726
>gi|223936933|ref|ZP_03628842.1| Beta-glucosidase [bacterium Ellin514]
gi|223894502|gb|EEF60954.1| Beta-glucosidase [bacterium Ellin514]
Length = 774
Score = 247 bits (630), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 215/749 (28%), Positives = 343/749 (45%), Gaps = 144/749 (19%)
Query: 90 RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
RLGIP + E LHG A+ TSFPQ + A+FN +L + + A
Sbjct: 112 RLGIPVM-FHEECLHGHAAR-----------DGTSFPQPIGLGATFNPALVEKLYAMTAH 159
Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
E R + G P +++ RD RWGR +ET GEDP + + + V+ FQG+
Sbjct: 160 ETRV-----RGGHQALTPVVDVARDARWGRVEETYGEDPFLNTQLGIAAVRGFQGDA--- 211
Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQ 269
F++K+ + A KH A+ + G + ++E+
Sbjct: 212 -------SFKDKK---------------HVIATLKHFAAHGQPESG---QNCAPVNVSER 246
Query: 270 DTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPA-CLRGDLFQKARNEWGFKGYITSDCD 328
+TF PFR C+++G A +M SYN+++GVP+ R L R EWGFKG++ SD
Sbjct: 247 LLRETFLHPFRDCLKKGGAISVMASYNEIDGVPSHASRWLLRDVLRKEWGFKGFVVSDYY 306
Query: 329 AVATIFEYQNYTKTHEDSAAG--------VLKAGMDI-----NCGTCMLRHTQSAIDKGK 375
A I+E + +H A +KAG++I +C RH + K
Sbjct: 307 A---IWELSHRPDSHGHHVAADKKEACVLAVKAGVNIEFPEPDC----YRHLVELVRKKV 359
Query: 376 VQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDD----VCTSEHKKLALDAARQG 431
+ E ++D + + + ++GLF+ DP + P++ V H++LA +AAR+
Sbjct: 360 LHETELDELIAPMLLWKFKMGLFD-DPY------VDPEEAARVVGCEVHRELASEAARET 412
Query: 432 IVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVS--- 488
I LLKN+ LPLN + ++A+IGP N + GGY+G+P ++L G++A +
Sbjct: 413 ITLLKNENDLLPLNPAKLKTVAVIGPNANR--SLLGGYSGVPAHNVTVLDGIKARLGGAV 470
Query: 489 KTHYASGCHDV--------------PCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETED 534
K +A GC P EAV++A AD VIV G + E
Sbjct: 471 KVVHAEGCKITVGGSWQQDEVLASDPAEDRKQIDEAVKVAWSADVVIVAIGGNEQTSREA 530
Query: 535 ------RDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWI 588
DR SL L G Q L+ ++ T K PV+ ++ G PL ++ + + +IL
Sbjct: 531 WSLKHMGDRTSLDLIGHQDELIRALLATGK-PVVALVFNGRPLAINHVAQN--VPAILEC 587
Query: 589 GYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFY 648
Y G+ A+A ++FGD NPGG+LP++ P S ++P+ S+R R + +
Sbjct: 588 WYLGQECGSAVAAVLFGDHNPGGKLPIS-IPRSVGQLPV--FYNHKPSAR----RGFLWD 640
Query: 649 TGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTS 708
T ++ FG GLSYT +++K + K I+ +TGS
Sbjct: 641 EATPLFPFGFGLSYTKFTFKNVRLA--------------KKIISRTGST----------- 675
Query: 709 CTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISF 768
HV + VTNAG G+ VV ++ R S P K+L F ++ +K +S
Sbjct: 676 ------HVSVDVTNAGKRAGTEVVQVYVRDLISSVTRPVKELKVFQKITLAPGETKTVSL 729
Query: 769 GVDPCEQLSIANKHGRRILPLGNHVLMVG 797
+ P E L+ + + + ++ G +MVG
Sbjct: 730 DLTP-ESLAFYDVNMKYVVEPGEFEIMVG 757
>gi|224536538|ref|ZP_03677077.1| hypothetical protein BACCELL_01413 [Bacteroides cellulosilyticus
DSM 14838]
gi|224521794|gb|EEF90899.1| hypothetical protein BACCELL_01413 [Bacteroides cellulosilyticus
DSM 14838]
Length = 863
Score = 247 bits (630), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 161/460 (35%), Positives = 235/460 (51%), Gaps = 61/460 (13%)
Query: 54 FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
+ + SLS RA+ L+ LTL+EK + D + ++ RLGI Y WW+E+LHG+A G
Sbjct: 23 YKDASLSPERRAELLVKELTLEEKAHLMMDGSRSVERLGIKPYNWWNEALHGVARAG--- 79
Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARA--MYNLGQ------AGLTFW 165
T FPQ + AASFN + + +AV+ EARA Y Q GLT W
Sbjct: 80 -------LATVFPQPIGMAASFNPEMVYEVFNAVSDEARAKNTYYASQDSRERYQGLTMW 132
Query: 166 APNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLK 225
P +NI+RDPRWGRG ET GEDP + S V VK QG DG
Sbjct: 133 TPTVNIYRDPRWGRGIETYGEDPYLTSRMGVMVVKGLQG----PADG------------- 175
Query: 226 GFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPFRSCIE 284
+ D+ L AC KH + +W +R+SFNA I +D +T+ PPF + ++
Sbjct: 176 ----KYDK-----LHACAKHFAVHSGPEW---NRHSFNAENIKPRDLYETYLPPFEALVK 223
Query: 285 QGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIFEYQNYTKTH 343
+GK +MC+YN+ G P C L Q R EWGF G + SDC A+A + + + TH
Sbjct: 224 EGKVEEVMCAYNRFEGDPCCGSDRLLMQILRGEWGFDGIVVSDCGAIADFYNDRGH-HTH 282
Query: 344 ED----SAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFN 399
D SAA V+ +G D+ CG+ +S + KG + E+ +D ++ L + LG +
Sbjct: 283 PDAESASAAAVI-SGTDLECGSSYKALIES-VKKGLISEETVDTSVKRLMKARFALGEMD 340
Query: 400 GDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLV 459
+P K + K+ V ++ H LAL+ AR+ + LL N FLPL + + ++A++GP
Sbjct: 341 -EPEKVSWTKIPFSVVASAAHDSLALNMARESMTLLMNKDNFLPLKRGGL-TVAVMGPNA 398
Query: 460 NNISQMGGGYTGIPCSPKSLLRGLEAYV---SKTHYASGC 496
N+ G Y G+P ++L G+ + K Y GC
Sbjct: 399 NDSVMQWGNYNGMPAHTVTILDGVRNLLGTDDKLIYEQGC 438
Score = 116 bits (290), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 100/319 (31%), Positives = 150/319 (47%), Gaps = 65/319 (20%)
Query: 498 DVPCNSDAGFHEAVRI------AKKADFVIVVAGLDLTQETEDR----------DRVSLL 541
D N D GF + V I K AD VI +G+ + E E+ DR +
Sbjct: 575 DAQLNFDLGFKKDVDIRKSVERVKDADIVIFASGISPSLEGEEMGVNLPGFKKGDRTDIE 634
Query: 542 LPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQ-ISSILWIGYPGEAGAKALA 600
LP Q L+ ++ R K+ +ILV G P+ + E ++Q +IL YPG+ G KA+A
Sbjct: 635 LPAVQRELIDALHRAGKK-IILVNCSGSPIGL---EPETQKCEAILQAWYPGQQGGKAVA 690
Query: 601 EIIFGDFNPGGRLPMTWYPESFTKVP-MNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHG 659
E++FGD+NP G+LP+T+Y + +++P D NM GR+YR+ ++ FG+G
Sbjct: 691 EVLFGDYNPAGKLPVTFY-RNVSQLPDFEDYNMT--------GRTYRYMQDVPLFPFGYG 741
Query: 660 LSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQIS 719
LSYT + Y K DKN E+T+ SL+ V
Sbjct: 742 LSYTTFGYG--------------KTVLDKN---------------ELTAGQSLKLTV--P 770
Query: 720 VTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCE-QLSI 778
VTN G +G VV ++ R ++G P K L F RV A + + F + E +
Sbjct: 771 VTNTGKRNGEEVVQVYLRKQGDAEG-PIKTLRAFKRVSIPAGKTVNVEFDLKDKELEWWD 829
Query: 779 ANKHGRRILPLGNHVLMVG 797
+ R+ P GN+ +MVG
Sbjct: 830 DQSNTVRVCP-GNYDIMVG 847
>gi|365877135|ref|ZP_09416640.1| glycoside hydrolase family protein [Elizabethkingia anophelis Ag1]
gi|442587941|ref|ZP_21006755.1| glycoside hydrolase family protein [Elizabethkingia anophelis R26]
gi|365754995|gb|EHM96929.1| glycoside hydrolase family protein [Elizabethkingia anophelis Ag1]
gi|442562440|gb|ELR79661.1| glycoside hydrolase family protein [Elizabethkingia anophelis R26]
Length = 827
Score = 247 bits (630), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 207/778 (26%), Positives = 346/778 (44%), Gaps = 140/778 (17%)
Query: 51 SYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNG 110
SYP+ N + +++ K I RLGIP ++ +E +HG+
Sbjct: 143 SYPYSNHTEALNNIQKWFIEET-----------------RLGIPV-DFTNEGIHGL---- 180
Query: 111 PGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNIN 170
T T FP + +++++ L IG+ + EA Y LG + +AP ++
Sbjct: 181 -------THDRATPFPAPINIGSTWDKDLVGKIGNTIGKEA---YYLGYTNV--YAPILD 228
Query: 171 IFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEE 230
+ RDPRWGR ET GEDP ++ Y KR++KG +
Sbjct: 229 VSRDPRWGRVVETYGEDPFMIGEYG-------------------------KRMVKGIQQN 263
Query: 231 SDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASC 290
+++ KH Y + K G + + ++ + PF+ I +
Sbjct: 264 G-------VASTLKHYAVYSVPKGGRDGLARTDPHVAPKEMHTMYLYPFKEVIRKEHPLG 316
Query: 291 IMCSYNQVNGVPAC----LRGDLFQKARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDS 346
+M SYN +GVP DL R E+GF GY+ SD DA+ + + K +E+
Sbjct: 317 VMASYNDYDGVPVISSKYFLTDLL---RKEYGFDGYVVSDSDALEFLHGKHHVAKDYEEG 373
Query: 347 AAGVLKAGMDINCGTCM----LRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDP 402
L+AG+D+ L A+ GK++E+ ++ + ++ + RLGLF+
Sbjct: 374 IQKALEAGLDVRTNFTQPKEYLTALMDALKSGKIKEEVLNERVRSVLKTKFRLGLFDEPI 433
Query: 403 RKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNI 462
R + K V T E + L++D R+ +VLLKN+K+ LPL+ + ++ I GPL + +
Sbjct: 434 RN--FIKEADRKVHTKEDEALSVDVNRRSVVLLKNEKQTLPLDTGKLKNILITGPLADAV 491
Query: 463 SQMGGGYTGIPCSPKSLLR-GLEAYVSKTH----YASGCHDV-------------PCNSD 504
+ Y G +P + +R G+E Y S H Y G + P +
Sbjct: 492 NYTTSRY-GPSNNPVTTIRKGIEDYASLHHINTSYTKGVDVIDEGWPETEIIPVEPTEKE 550
Query: 505 -AGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVIL 563
+ + + +A+K+D +I V G + E R R SL LPG+Q + + +T ++P++L
Sbjct: 551 KSEISKTISMAEKSDVIIAVMGESEKEVGESRSRSSLNLPGKQTYFLQQLYKT-RKPIVL 609
Query: 564 VLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT 623
VL G PL +++ + + +IL + G +AE +FG+ NPGG+LP++ +P+S
Sbjct: 610 VLVNGRPLTINW--ENKYLPAILETWFLGPQSGNIVAETLFGENNPGGKLPIS-FPKSIG 666
Query: 624 KVPMNDMNMRADSSRQYPGRSYRFYTGTQV----YGFGHGLSYTNYSYKFLSAPSELTIS 679
++ MN A + Q PG ++V Y FG+GLSYTN+ + S S+
Sbjct: 667 QLEMNFPTKPAAQAGQ-PGTGPNGSGSSRVTGFLYPFGYGLSYTNFEFTDFSLSSK---- 721
Query: 680 ASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVP 739
+KAG++ H ++ VTN G V G VV L+
Sbjct: 722 -KIKAGNE--------------------------LHAKLKVTNTGKVKGDEVVQLYLSDL 754
Query: 740 KVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
S T E L GF+RV +KE+ F ++ E + + N ++ G + VG
Sbjct: 755 VSSVTTYEMDLRGFERVTLEPGEAKEVQFTLN-KEHMQLLNDKMEWVVEPGEFRVSVG 811
>gi|265766195|ref|ZP_06094236.1| periplasmic beta-glucosidase [Bacteroides sp. 2_1_16]
gi|263253863|gb|EEZ25328.1| periplasmic beta-glucosidase [Bacteroides sp. 2_1_16]
Length = 859
Score = 247 bits (630), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 216/759 (28%), Positives = 339/759 (44%), Gaps = 123/759 (16%)
Query: 72 LTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVS 131
LTL ++Q+ + RLGIP + +ESLHG +G T FPQ +
Sbjct: 95 LTLMNEVQKYMREKT---RLGIPVFTL-TESLHGSVHDGS-----------TIFPQAIAL 139
Query: 132 AASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVV 191
++FN L + SA+A E A Q P I++ RD RWGR +E GEDP +V
Sbjct: 140 GSTFNPILAYEMTSAIAKELTA-----QGITQSLTPVIDVCRDLRWGRVEECFGEDPYLV 194
Query: 192 SAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDL 251
S V V+ G+ + +V S KH A+
Sbjct: 195 SRMGVSQVR----------------GYLDNQV----------------SPMIKHFGAHGA 222
Query: 252 EKWG-NFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLF 310
+ G N + S +++ + F + +++ K +M SYN N P L
Sbjct: 223 PQGGLNLASVSCG----QRELLSIYLKTFETVVKEAKPWAVMSSYNSWNNEPNSSSHYLM 278
Query: 311 QKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQS 369
+ R+ W F+GY+ SD A+ + + + ++A L AG+D Q
Sbjct: 279 TELLRDRWDFQGYVYSDWGAIGMLNYFHKTAQNSAEAAIQALTAGLDAEASDNSYAELQQ 338
Query: 370 AIDKGKVQEKDIDRALLNLFSVQLRLGLFNGD-PRKGKYGKLGPDDVCTSEHKKLALDAA 428
++ G + K ID+A+ + + + +GLF P + Y K+ V H LA A
Sbjct: 339 LVENGMLDVKYIDQAVARILTAKFNMGLFEYPLPMEKNYDKV----VHAPAHVSLARKIA 394
Query: 429 RQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPK--SLLRGLEAY 486
+ IVLL+N+ LPL N + S+A+IGP + + G YT + +LL L+
Sbjct: 395 EESIVLLQNENNILPLQMNKLKSIAVIGPNADQVQ--FGDYTWSRDNKDGVTLLEALKER 452
Query: 487 VSKT---HYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAG-------LDLTQET--ED 534
VS +YA GC D+ + +GF EAV +AKK+D IVV G D + T E
Sbjct: 453 VSNQLTLNYAKGC-DLVTDDCSGFKEAVDVAKKSDVCIVVVGSASASLARDYSNATCGEG 511
Query: 535 RDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEA 594
D L L G Q LV ++ T K PVI+VL G P +S+ + + I I+ YPGE
Sbjct: 512 FDLSDLTLTGVQEDLVEAIHATGK-PVIVVLLSGKPFAMSWIKEN--IPGIVVQWYPGEQ 568
Query: 595 GAKALAEIIFGDFNPGGRLPMTWYPES-------FTKVPMNDMNMRADSSRQYPGRSYRF 647
G ALA+++ G NP G+L + +P+S + +P + R+ S+ PG+ Y F
Sbjct: 569 GGLALADMLLGKVNPSGKLNYS-FPQSVGHLPCYYNYLPTDKGFYRSPGSKNKPGKDYVF 627
Query: 648 YTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVT 707
+ ++ FGHGLSYT++ Y L T S+ DY
Sbjct: 628 SSPKALWAFGHGLSYTDFEY-----------------------LSATTSKEDY------- 657
Query: 708 SCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEIS 767
+C + V I++ N GD DG V ++ R S P ++L GF++V +K++
Sbjct: 658 ACEDV-IEVTIAIRNTGDYDGLEVPQVYVRDMVSSVVMPVQELKGFEKVLIKKGETKQVI 716
Query: 768 FGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
+ P +L++ NK ++++ G L +G + I+
Sbjct: 717 IKI-PVSELALYNKEMKKVVEPGAFELQIGRASDDIRIK 754
>gi|189464698|ref|ZP_03013483.1| hypothetical protein BACINT_01042 [Bacteroides intestinalis DSM
17393]
gi|189436972|gb|EDV05957.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
intestinalis DSM 17393]
Length = 862
Score = 247 bits (630), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 153/461 (33%), Positives = 237/461 (51%), Gaps = 57/461 (12%)
Query: 51 SYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNG 110
+ P+ N LS + RAK L+ LTL+EK + D++ AIPRLGI + WWSE+LHG+A+ G
Sbjct: 20 TLPYQNPELSPAERAKDLVKRLTLEEKALLMCDDSEAIPRLGIKKFNWWSEALHGVANQG 79
Query: 111 PGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYN---------LGQAG 161
+VT FP+ + AASFN L I +AV+ E RA +N +
Sbjct: 80 ----------NVTVFPEPVGMAASFNDKLVFEIFNAVSDEMRAKHNERVRNGLEDVRFHS 129
Query: 162 LTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQG-ENWKSDDGGIGFGFRE 220
L+ W PN+NIFRDPRWGRGQET GEDP + S + VK QG EN K +R+
Sbjct: 130 LSVWTPNVNIFRDPRWGRGQETYGEDPYLTSQMGIAVVKGLQGPENEK---------YRK 180
Query: 221 KRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNA-MITEQDTEDTFQPPF 279
L AC KH + +W SR++ N ++ +D +T+ P F
Sbjct: 181 ------------------LLACAKHYAVHSGPEW---SRHTANLNNVSPRDLWETYLPAF 219
Query: 280 RSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQN 338
++ +++ +MC+Y +++ P C L Q+ R+EWGFK + SDC A+A +
Sbjct: 220 KALVQKADVREVMCAYQRLDDDPCCGNTRLLQQILRDEWGFKYLVVSDCGAIADFWTSHK 279
Query: 339 YTKTHEDSAAGVLKAGMDINCGT-CMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGL 397
+ +A AG D+ CG + A+ +G + E+++++ +L L + LG
Sbjct: 280 SSSDAVHAAVKGTMAGTDVECGYGYAYQKLPEAVSRGLITEEEVNKHVLRLMEGRFELGE 339
Query: 398 FNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGP 457
+ DP + K+ V HK L+L+ +RQ + LL+N LPL+K ++ +A+IGP
Sbjct: 340 MD-DPSLVNWTKIPMSVVNCKAHKDLSLNMSRQTMTLLQNKNNVLPLSK-SIRKIAVIGP 397
Query: 458 LVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTH--YASGC 496
++ + G Y G P ++L G ++ + K Y GC
Sbjct: 398 NADDKPMLWGNYNGTPNQTITILDGFKSKLKKNQIVYMKGC 438
Score = 77.0 bits (188), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 67/272 (24%), Positives = 114/272 (41%), Gaps = 54/272 (19%)
Query: 507 FHEAVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVART 556
+ E++ K D V+ V G+ E E+ DR + LP Q + + ++
Sbjct: 590 YSESISKLKGIDVVVFVGGISPQLEGEEMPVNIPGFKGGDRTDIELPAVQRNFLKALKDA 649
Query: 557 SKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMT 616
K+ V + +G ++ +IL Y GE G A+A+++FGD+NP G+LP+T
Sbjct: 650 GKQVVFVNCSGSS---MALLPETESCDAILQAWYGGELGGYAVADVLFGDYNPSGKLPVT 706
Query: 617 WYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSEL 676
+Y + D +M+ GR+YR Y ++ FG GLSYT++
Sbjct: 707 FYKNTKQLPDYEDYSMK--------GRTYR-YMSDPLFPFGFGLSYTDF----------- 746
Query: 677 TISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFA 736
A A +K L+ DE + T + V+N G G+ +V ++
Sbjct: 747 ---AIGTASCNKTQLR----------TDESLTLT-------VPVSNTGKRSGTEIVQVYI 786
Query: 737 RVPKVSQGTPEKQLIGFDRVHTVAKGSKEISF 768
R + G P K L + RV +++
Sbjct: 787 RKTDDADG-PLKSLKAYARVELATGAQQDVKI 817
>gi|417519247|ref|ZP_12181440.1| Periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Uganda str. R8-3404]
gi|353647113|gb|EHC90324.1| Periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Uganda str. R8-3404]
Length = 755
Score = 247 bits (630), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 207/749 (27%), Positives = 344/749 (45%), Gaps = 139/749 (18%)
Query: 75 QEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAAS 134
++ I+Q+ D A+ RL IP + + + +HG T FP L A+S
Sbjct: 76 RQDIRQMQDQVMALSRLKIPLF-FAYDVVHG---------------QRTVFPISLGLASS 119
Query: 135 FNRSLWSNIGSAVAVEARAMYNLGQAGLTF-WAPNINIFRDPRWGRGQETPGEDPMVVSA 193
FN +G A EA GL WAP +++ RDPRWGR E GED + S
Sbjct: 120 FNLDAVRTVGRVSAYEA------ADDGLNMTWAPMVDVSRDPRWGRASEGFGEDTYLTSI 173
Query: 194 YAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEK 253
VK+ QG++ D + KH AY +
Sbjct: 174 MGETMVKAMQGKS--------------------------PADRYSVMTSVKHFAAYGAVE 207
Query: 254 WGNFSRYSFNAM-ITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK 312
G +N + ++ Q + + PP+++ ++ G + +M + N +NG PA L +
Sbjct: 208 GGK----EYNTVDMSSQRLFNDYMPPYKAGLDAG-SGAVMVALNSLNGTPATSDSWLLKD 262
Query: 313 A-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCML-RHTQSA 370
R+EWGFKG SD A+ + ++ ED+ LKAG+D++ ++
Sbjct: 263 VLRDEWGFKGITVSDHGAIKELIKH-GTAADPEDAVRVALKAGVDMSMADEYYSKYLPGL 321
Query: 371 IDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSE-------HKKL 423
I GKV ++D A ++ +V+ +GLFN DP Y LGP + ++ H+K
Sbjct: 322 IKSGKVTMAELDDATRHVLNVKYDMGLFN-DP----YSHLGPKESDPADTNAESRLHRKE 376
Query: 424 ALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT--GIPCSPKSLLR 481
A + AR+ +VLLKN + LPL K+ ++A++GPL ++ + G ++ G+ ++L
Sbjct: 377 AREVARESVVLLKNRLETLPLKKSG--TIAVVGPLADSQRDVMGSWSAAGVANQSVTVLA 434
Query: 482 GLEAYV---SKTHYASGCHDV-------------------PCNSDAGFHEAVRIAKKADF 519
G++ V +K YA G + P + A EAV+ AK+AD
Sbjct: 435 GIQNAVGDGAKILYAKGANITNDKGIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADV 494
Query: 520 VIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEAD 579
V+ V G E R ++ +P Q L+T++ T K P++LVL G PL + + D
Sbjct: 495 VVAVVGESQGMAHEASSRTNITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKED 551
Query: 580 SQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRA----- 634
Q +IL + G G A+A+++FGD+NP G+LP++ +P S ++P+ ++
Sbjct: 552 QQADAILETWFAGTEGGNAIADVLFGDYNPSGKLPIS-FPRSVGQIPVYYSHLNTGRPYN 610
Query: 635 -DSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQ 693
+ +Y R + G +Y FG+GLSYT ++ S++T+S+ +Q+
Sbjct: 611 PEKPNKYTSRYFDEVNGP-LYPFGYGLSYTTFTV------SDVTLSSP--------TMQR 655
Query: 694 TGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGF 753
G +VT+ + VTN G +G+ V+ ++ + S P KQL GF
Sbjct: 656 DG---------KVTAS--------VEVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGF 698
Query: 754 DRVHTVAKGSKEISFGVDPCEQLSIANKH 782
+++ SK +SF +D E L N+
Sbjct: 699 EKITLKPGESKTVSFPID-IEALKFWNQQ 726
>gi|198245676|ref|YP_002216250.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Dublin str. CT_02021853]
gi|375119733|ref|ZP_09764900.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Dublin str. SD3246]
gi|197940192|gb|ACH77525.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Dublin str. CT_02021853]
gi|326624000|gb|EGE30345.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Dublin str. SD3246]
Length = 755
Score = 247 bits (630), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 210/749 (28%), Positives = 346/749 (46%), Gaps = 139/749 (18%)
Query: 75 QEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAAS 134
++ I+Q+ D A+ RL IP + + + +HG T FP L A+S
Sbjct: 76 RQDIRQMQDQVMALSRLKIPLF-FAYDVVHG---------------QRTVFPISLGLASS 119
Query: 135 FNRSLWSNIGSAVAVEARAMYNLGQAGLTF-WAPNINIFRDPRWGRGQETPGEDPMVVSA 193
FN +G A EA GL WAP +++ RDPRWGR E GED + S
Sbjct: 120 FNLDAVRTVGRVSAYEA------ADDGLNMTWAPMVDVSRDPRWGRASEGFGEDTYLTSI 173
Query: 194 YAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEK 253
VK+ QG++ D + KH AY +
Sbjct: 174 MGETMVKAMQGKS--------------------------PADRYSVMTSVKHFAAYGAVE 207
Query: 254 WGNFSRYSFNAM-ITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK 312
G +N + ++ Q + + PP+++ ++ G + +M + N +NG PA L +
Sbjct: 208 GGK----EYNTVDMSSQRLFNDYMPPYKAGLDAG-SGAVMVALNSLNGTPATSDSWLLKD 262
Query: 313 A-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCML-RHTQSA 370
R+EWGFKG SD A+ + ++ ED+ LKAG+D++ ++
Sbjct: 263 VLRDEWGFKGITVSDHGAIKELIKH-GTAADPEDAVRVALKAGVDMSMADEYYSKYLPGL 321
Query: 371 IDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGP---DDVCTSE----HKKL 423
I GKV ++D A ++ +V+ +GLFN DP Y LGP D V T+ H+K
Sbjct: 322 IKSGKVTMAELDDATRHVLNVKYDMGLFN-DP----YSHLGPKESDPVDTNAESRLHRKE 376
Query: 424 ALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT--GIPCSPKSLLR 481
A + AR+ +VLLKN + LPL K+ ++A++GPL ++ + G ++ G+ ++L
Sbjct: 377 AREVARESVVLLKNRLETLPLKKSG--TIAVVGPLADSQRDVMGSWSAAGVANQSVTVLA 434
Query: 482 GLEAYV---SKTHYASGCH-----DV--------------PCNSDAGFHEAVRIAKKADF 519
G++ V +K YA G + D+ P + A EAV+ AK+AD
Sbjct: 435 GIQNAVGDGAKILYAKGANITNDKDIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADM 494
Query: 520 VIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEAD 579
V+ V G E R ++ +P Q L+T++ T K P++LVL G PL + + D
Sbjct: 495 VVAVVGESQGMAHEASSRTNITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKED 551
Query: 580 SQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRA----- 634
Q +IL + G G A+A+++FGD+NP G+LP++ +P S ++P+ ++
Sbjct: 552 QQADAILETWFAGTEGGNAIADVLFGDYNPSGKLPIS-FPRSVGQIPVYYSHLNTGRPYN 610
Query: 635 -DSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQ 693
+ +Y R + G +Y FG+GLSYT ++ S++T+S+ +Q+
Sbjct: 611 PEKPNKYTSRYFDEANGP-LYPFGYGLSYTTFTV------SDVTLSSP--------TMQR 655
Query: 694 TGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGF 753
G +VT+ + VTN G +G+ V+ ++ + S P KQL GF
Sbjct: 656 DG---------KVTAS--------VEVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGF 698
Query: 754 DRVHTVAKGSKEISFGVDPCEQLSIANKH 782
+++ K +SF +D E L N+
Sbjct: 699 EKITLKPGERKTVSFPID-IEALKFWNQQ 726
>gi|319901412|ref|YP_004161140.1| glycoside hydrolase 3 [Bacteroides helcogenes P 36-108]
gi|319416443|gb|ADV43554.1| glycoside hydrolase family 3 domain protein [Bacteroides helcogenes
P 36-108]
Length = 944
Score = 247 bits (630), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 209/754 (27%), Positives = 342/754 (45%), Gaps = 125/754 (16%)
Query: 90 RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
RLGIP ++ +E + G+ S T+FP L ++NR L IG
Sbjct: 153 RLGIPV-DFTNEGIRGVES-----------YKATNFPTQLGLGHTWNRELIHKIGFITGR 200
Query: 150 EARAMYNLGQAGLT-FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWK 208
EAR + G T +AP +++ RD RWGR +E GE P +V+ +E V+ Q +
Sbjct: 201 EARML------GYTNVYAPILDVGRDQRWGRYEEVYGESPYLVAELGIEMVRGMQYNH-- 252
Query: 209 SDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITE 268
++A KH AY K + I+
Sbjct: 253 -----------------------------QVAATGKHFAAYSNNKGAREGMSRVDPQISP 283
Query: 269 QDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGD---LFQKARNEWGFKGYITS 325
++ E+ PFR I + +M SYN +G+P ++G L + R E GF+GY+ S
Sbjct: 284 REVENIHIYPFRRVIREAGLLGVMSSYNDYDGIP--IQGSHYWLTTRLRGEIGFRGYVVS 341
Query: 326 DCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCG----TCMLRHTQSAIDKGKVQEKDI 381
D DAV ++ K +++ ++AG++I C + + + +G + E+ I
Sbjct: 342 DSDAVEYLYTKHGTAKDMKEAIRQSVEAGLNIRCTFRSPDSFVLPLRELVKEGGLSEEII 401
Query: 382 DRALLNLFSVQLRLGLFNGDPRKGKYGKLGPD-DVCTSEHKKLALDAARQGIVLLKNDKK 440
+ + ++ V+ GLF+ P + G D +V E+ +AL A+R+ IVLLKN+
Sbjct: 402 NDRVRDILRVKFLTGLFD-TPYQSDLA--GADREVEKEENGSIALQASRESIVLLKNENN 458
Query: 441 FLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVS---KTHYASGC- 496
LPL+ + V +A+ GP + + Y + ++L+G++ VS + Y GC
Sbjct: 459 MLPLDLSTVKRIAVCGPNADEKNYALTHYGPLAVEVITVLKGIQDKVSGKAEVLYTKGCD 518
Query: 497 -------------HDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLP 543
H + + A ++A A+++D +VV G E++ R SL LP
Sbjct: 519 LVDANWPESEIINHPLTADEQAEINKAAENARQSDVAVVVLGGGQRTCGENKSRSSLDLP 578
Query: 544 GQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEII 603
G+Q+ L+ ++ T K PVILVL G PL V++A D + +IL YPG G ALA+++
Sbjct: 579 GRQLQLLQAIQATGK-PVILVLINGRPLSVNWA--DKYVPAILEAWYPGAKGGIALADVL 635
Query: 604 FGDFNPGGRLPMTWYPESFTKVPMN---DMNMRADSSRQYPGRSYRFYTGTQVYGFGHGL 660
FGD+NPGG+L +T +P++ ++P N + D + +Y FG+GL
Sbjct: 636 FGDYNPGGKLTVT-FPKTVGQIPFNFPYKPASQIDGGKNPGPEGNMSRINGALYPFGYGL 694
Query: 661 SYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISV 720
SYT + Y S+L I+ + + +E T V++ V
Sbjct: 695 SYTTFEY------SDLEITPKVITPN-----------------EEAT--------VRLKV 723
Query: 721 TNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIAN 780
TN G G VV L+ R S T EK L GF+RVH +KE+ F + + L + +
Sbjct: 724 TNTGKRAGDEVVQLYIRDVVSSVITYEKNLAGFERVHLEPGETKEVVFTLG-RKHLELLD 782
Query: 781 KHGRRILPLGNHVLMVG------ELRHSLTIETY 808
+ + ++ G+ +M L LT+E Y
Sbjct: 783 ANMQWVVEPGDFTIMAAASSEDIRLNGLLTVEDY 816
>gi|423223593|ref|ZP_17210062.1| hypothetical protein HMPREF1062_02248 [Bacteroides cellulosilyticus
CL02T12C19]
gi|392638218|gb|EIY32065.1| hypothetical protein HMPREF1062_02248 [Bacteroides cellulosilyticus
CL02T12C19]
Length = 863
Score = 247 bits (630), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 161/460 (35%), Positives = 235/460 (51%), Gaps = 61/460 (13%)
Query: 54 FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
+ + SLS RA+ L+ LTL+EK + D + ++ RLGI Y WW+E+LHG+A G
Sbjct: 23 YKDASLSPERRAELLVKELTLEEKAHLMMDGSRSVERLGIKPYNWWNEALHGVARAG--- 79
Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARA--MYNLGQ------AGLTFW 165
T FPQ + AASFN + + +AV+ EARA Y Q GLT W
Sbjct: 80 -------LATVFPQPIGMAASFNPEMVYEVFNAVSDEARAKNTYYASQDSRERYQGLTMW 132
Query: 166 APNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLK 225
P +NI+RDPRWGRG ET GEDP + S V VK QG DG
Sbjct: 133 TPTVNIYRDPRWGRGIETYGEDPYLTSRMGVMVVKGLQG----PADG------------- 175
Query: 226 GFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPFRSCIE 284
+ D+ L AC KH + +W +R+SFNA I +D +T+ PPF + ++
Sbjct: 176 ----KYDK-----LHACAKHFAVHSGPEW---NRHSFNAENIKPRDLYETYLPPFEALVK 223
Query: 285 QGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIFEYQNYTKTH 343
+GK +MC+YN+ G P C L Q R EWGF G + SDC A+A + + + TH
Sbjct: 224 EGKVEEVMCAYNRFEGDPCCGSDRLLMQILRGEWGFDGIVVSDCGAIADFYNDRGH-HTH 282
Query: 344 ED----SAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFN 399
D SAA V+ +G D+ CG+ +S + KG + E+ +D ++ L + LG +
Sbjct: 283 PDAESASAAAVI-SGTDLECGSSYKALIES-VKKGLISEETVDTSVKRLMKARFALGEMD 340
Query: 400 GDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLV 459
+P K + K+ V ++ H LAL+ AR+ + LL N FLPL + + ++A++GP
Sbjct: 341 -EPEKVSWTKIPFSVVASAAHDSLALNMARESMTLLMNKDNFLPLKRGGL-TVAVMGPNA 398
Query: 460 NNISQMGGGYTGIPCSPKSLLRGLEAYV---SKTHYASGC 496
N+ G Y G+P ++L G+ + K Y GC
Sbjct: 399 NDSVMQWGNYNGMPAHTVTILDGVRNLLGTDDKLIYEQGC 438
Score = 116 bits (290), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 99/319 (31%), Positives = 151/319 (47%), Gaps = 65/319 (20%)
Query: 498 DVPCNSDAGFHEAVRI------AKKADFVIVVAGLDLTQETEDR----------DRVSLL 541
D N D GF + V I K AD VI +G+ + E E+ DR +
Sbjct: 575 DAQLNFDLGFKKDVDIRKSVERVKDADIVIFASGISPSLEGEEMGVNLPGFKKGDRTDIE 634
Query: 542 LPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQ-ISSILWIGYPGEAGAKALA 600
LP Q L+ ++ R K+ +ILV G P+ + E ++Q +IL YPG+ G KA+A
Sbjct: 635 LPAVQRELIDALHRAGKK-IILVNCSGSPIGL---EPETQKCEAILQAWYPGQQGGKAVA 690
Query: 601 EIIFGDFNPGGRLPMTWYPESFTKVP-MNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHG 659
E++FGD+NP G+LP+T+Y + +++P D NM GR+YR+ ++ FG+G
Sbjct: 691 EVLFGDYNPAGKLPVTFY-RNVSQLPDFEDYNMT--------GRTYRYMQDVPLFPFGYG 741
Query: 660 LSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQIS 719
LSYT + Y K DKN E+T+ SL+ + +
Sbjct: 742 LSYTTFGYG--------------KTVLDKN---------------ELTAGQSLK--LTVP 770
Query: 720 VTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCE-QLSI 778
VTN G +G VV ++ R ++G P K L F RV A + + F + E +
Sbjct: 771 VTNTGKRNGEEVVQVYLRKQGDAEG-PIKTLRAFKRVSIPAGKTVNVEFDLKDKELEWWD 829
Query: 779 ANKHGRRILPLGNHVLMVG 797
+ R+ P GN+ +MVG
Sbjct: 830 DQSNTVRVCP-GNYDIMVG 847
>gi|194736410|ref|YP_002115255.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar
Schwarzengrund str. CVM19633]
gi|197300833|ref|ZP_02661390.2| glycosyl hydrolase family 3 N domain protein [Salmonella enterica
subsp. enterica serovar Schwarzengrund str. SL480]
gi|204929066|ref|ZP_03220209.1| glycosyl hydrolase family 3 N domain protein [Salmonella enterica
subsp. enterica serovar Javiana str. GA_MM04042433]
gi|205360240|ref|ZP_02681470.2| glycosyl hydrolase family 3 N domain protein [Salmonella enterica
subsp. enterica serovar Hadar str. RI_05P066]
gi|416491454|ref|ZP_11727088.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Montevideo str. OH_2009072675]
gi|416496191|ref|ZP_11729048.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Montevideo str. CASC_09SCPH15965]
gi|416586610|ref|ZP_11775622.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Montevideo str. MD_MDA09249507]
gi|416674556|ref|ZP_11821234.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Montevideo str. MB102109-0047]
gi|417359373|ref|ZP_12133763.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar Give
str. S5-487]
gi|417391848|ref|ZP_12154881.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar
Minnesota str. A4-603]
gi|417476332|ref|ZP_12170882.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar
Rubislaw str. A4-653]
gi|417511967|ref|ZP_12176427.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar
Senftenberg str. A4-543]
gi|194711912|gb|ACF91133.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Schwarzengrund str. CVM19633]
gi|197290594|gb|EDY29949.1| glycosyl hydrolase family 3 N domain protein [Salmonella enterica
subsp. enterica serovar Schwarzengrund str. SL480]
gi|204321610|gb|EDZ06809.1| glycosyl hydrolase family 3 N domain protein [Salmonella enterica
subsp. enterica serovar Javiana str. GA_MM04042433]
gi|205350836|gb|EDZ37467.1| glycosyl hydrolase family 3 N domain protein [Salmonella enterica
subsp. enterica serovar Hadar str. RI_05P066]
gi|322649375|gb|EFY45811.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Montevideo str. OH_2009072675]
gi|322656495|gb|EFY52784.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Montevideo str. CASC_09SCPH15965]
gi|322667165|gb|EFY63332.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Montevideo str. MD_MDA09249507]
gi|323222876|gb|EGA07232.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Montevideo str. MB102109-0047]
gi|353589745|gb|EHC48461.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar Give
str. S5-487]
gi|353614040|gb|EHC65987.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar
Minnesota str. A4-603]
gi|353641236|gb|EHC86018.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar
Rubislaw str. A4-653]
gi|353641325|gb|EHC86084.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar
Senftenberg str. A4-543]
Length = 755
Score = 246 bits (629), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 210/749 (28%), Positives = 346/749 (46%), Gaps = 139/749 (18%)
Query: 75 QEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAAS 134
++ I+Q+ D A+ RL IP + + + +HG T FP L A+S
Sbjct: 76 RQDIRQMQDQVMALSRLKIPLF-FAYDVVHG---------------QRTVFPISLGLASS 119
Query: 135 FNRSLWSNIGSAVAVEARAMYNLGQAGLTF-WAPNINIFRDPRWGRGQETPGEDPMVVSA 193
FN +G A EA GL WAP +++ RDPRWGR E GED + S
Sbjct: 120 FNLDAVRTVGRVSAYEA------ADDGLNMTWAPMVDVSRDPRWGRASEGFGEDTYLTSI 173
Query: 194 YAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEK 253
VK+ QG++ D + KH AY +
Sbjct: 174 MGETMVKAMQGKS--------------------------PADRYSVMTSVKHFAAYGAVE 207
Query: 254 WGNFSRYSFNAM-ITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK 312
G +N + ++ Q + + PP+++ ++ G + +M + N +NG PA L +
Sbjct: 208 GGK----EYNTVDMSSQRLFNDYMPPYKAGLDAG-SGAVMVALNSLNGTPATSDSWLLKD 262
Query: 313 A-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCML-RHTQSA 370
R+EWGFKG SD A+ + ++ ED+ LKAG+D++ ++
Sbjct: 263 VLRDEWGFKGITVSDHGAIKELIKH-GTAADPEDAVRVALKAGVDMSMADEYYSKYLPGL 321
Query: 371 IDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGP---DDVCTSE----HKKL 423
I GKV ++D A ++ +V+ +GLFN DP Y LGP D V T+ H+K
Sbjct: 322 IKSGKVTMAELDDATRHVLNVKYDMGLFN-DP----YSHLGPKESDPVDTNAESRLHRKE 376
Query: 424 ALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT--GIPCSPKSLLR 481
A + AR+ +VLLKN + LPL K+ ++A++GPL ++ + G ++ G+ ++L
Sbjct: 377 AREVARESVVLLKNRLETLPLKKSG--TIAVVGPLADSQRDVMGSWSAAGVANQSVTVLA 434
Query: 482 GLEAYV---SKTHYASGCH-----DV--------------PCNSDAGFHEAVRIAKKADF 519
G++ V +K YA G + D+ P + A EAV+ AK+AD
Sbjct: 435 GIQNAVGDGAKILYAKGANITNDKDIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADV 494
Query: 520 VIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEAD 579
V+ V G E R ++ +P Q L+T++ T K P++LVL G PL + + D
Sbjct: 495 VVAVVGESQGMAHEASSRTNITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKED 551
Query: 580 SQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRA----- 634
Q +IL + G G A+A+++FGD+NP G+LP++ +P S ++P+ ++
Sbjct: 552 QQADAILETWFAGTEGGNAIADVLFGDYNPSGKLPIS-FPRSVGQIPVYYSHLNTGRPYN 610
Query: 635 -DSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQ 693
+ +Y R + G +Y FG+GLSYT ++ S++T+S+ +Q+
Sbjct: 611 PEKPNKYTSRYFDEANGP-LYPFGYGLSYTTFTV------SDVTLSSP--------TMQR 655
Query: 694 TGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGF 753
G +VT+ + VTN G +G+ V+ ++ + S P KQL GF
Sbjct: 656 DG---------KVTAS--------VEVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGF 698
Query: 754 DRVHTVAKGSKEISFGVDPCEQLSIANKH 782
+++ K +SF +D E L N+
Sbjct: 699 EKITLKPGERKTVSFPID-IEALKFWNQQ 726
>gi|62180752|ref|YP_217169.1| beta-D-glucoside glucohydrolase, periplasmic [Salmonella enterica
subsp. enterica serovar Choleraesuis str. SC-B67]
gi|375115081|ref|ZP_09760251.1| beta-D-glucoside glucohydrolase, periplasmic [Salmonella enterica
subsp. enterica serovar Choleraesuis str. SCSA50]
gi|62128385|gb|AAX66088.1| beta-D-glucoside glucohydrolase, periplasmic [Salmonella enterica
subsp. enterica serovar Choleraesuis str. SC-B67]
gi|322715227|gb|EFZ06798.1| beta-D-glucoside glucohydrolase, periplasmic [Salmonella enterica
subsp. enterica serovar Choleraesuis str. SCSA50]
Length = 765
Score = 246 bits (629), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 210/749 (28%), Positives = 348/749 (46%), Gaps = 139/749 (18%)
Query: 75 QEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAAS 134
++ I+Q+ D A+ RL IP + + + +HG T FP L A+S
Sbjct: 86 RQDIRQMQDQVMALSRLKIPLF-FAYDVVHG---------------QRTVFPISLGLASS 129
Query: 135 FNRSLWSNIGSAVAVEARAMYNLGQAGLTF-WAPNINIFRDPRWGRGQETPGEDPMVVSA 193
FN+ +G A EA GL WAP +++ RDPRWGR E GED + S
Sbjct: 130 FNQDAVRTVGRVSAYEA------ADDGLNMTWAPMVDVSRDPRWGRASEGFGEDTYLTSI 183
Query: 194 YAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEK 253
VK+ QG++ D + KH AY +
Sbjct: 184 MGETMVKAMQGKS--------------------------PADRYSVMTSVKHFAAYGAVE 217
Query: 254 WGNFSRYSFNAM-ITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK 312
G +N + ++ Q + + PP+++ ++ G + +M + N +NG PA L +
Sbjct: 218 GGK----EYNTVDMSSQRLFNDYMPPYKAGLDAG-SGAVMVALNSLNGTPATSDSWLLKD 272
Query: 313 A-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCML-RHTQSA 370
R+EWGFKG SD A+ + ++ ED+ LKAG+D++ ++
Sbjct: 273 VLRDEWGFKGITVSDHGAIKELIKHGTAADP-EDAVRVALKAGVDMSMADEYYSKYLPGL 331
Query: 371 IDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGP---DDVCTSE----HKKL 423
I GKV ++D A ++ +V+ +GLFN DP Y LGP D V T+ H+K
Sbjct: 332 IKSGKVTMAELDDATRHVLNVKYDMGLFN-DP----YSHLGPKESDPVDTNAESRLHRKE 386
Query: 424 ALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT--GIPCSPKSLLR 481
A + AR+ +VLLKN + LPL K+ ++A++GPL ++ + G ++ G+ ++L
Sbjct: 387 AREVARESVVLLKNRLETLPLKKSG--AIAVVGPLADSQRDVMGSWSAAGVANQSVTVLA 444
Query: 482 GLEAYV---SKTHYASGCH-----DV--------------PCNSDAGFHEAVRIAKKADF 519
G++ V +K +A G + D+ P + A EAV+ AK+AD
Sbjct: 445 GIQNAVGDGAKILHAKGANITNDKDIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADV 504
Query: 520 VIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEAD 579
V+ V G E R ++ +P Q L+T++ T K P++LVL G PL + + D
Sbjct: 505 VVAVVGESQGMAHEASSRTNITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKED 561
Query: 580 SQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRA----- 634
Q +IL + G G A+A+++FGD+NP G+LP++ +P S ++P+ ++
Sbjct: 562 QQADAILETWFAGTEGGNAIADVLFGDYNPSGKLPIS-FPRSVGQIPVYYSHLNTGRPYN 620
Query: 635 -DSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQ 693
+ +Y R + G +Y FG+GLSYT ++ S++T+S+ +Q+
Sbjct: 621 PEKPNKYTSRYFDEANGP-LYPFGYGLSYTTFTV------SDVTLSSP--------TMQR 665
Query: 694 TGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGF 753
G +VT+ + VTN G +G+ V+ ++ + S P KQL GF
Sbjct: 666 DG---------KVTAS--------VEVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGF 708
Query: 754 DRVHTVAKGSKEISFGVDPCEQLSIANKH 782
+++ SK +SF +D E L N+
Sbjct: 709 EKITLKPGESKTVSFPID-IEALKFWNQQ 736
>gi|423271149|ref|ZP_17250120.1| hypothetical protein HMPREF1079_03202 [Bacteroides fragilis
CL05T00C42]
gi|423274973|ref|ZP_17253919.1| hypothetical protein HMPREF1080_02572 [Bacteroides fragilis
CL05T12C13]
gi|392699073|gb|EIY92255.1| hypothetical protein HMPREF1079_03202 [Bacteroides fragilis
CL05T00C42]
gi|392704252|gb|EIY97391.1| hypothetical protein HMPREF1080_02572 [Bacteroides fragilis
CL05T12C13]
Length = 859
Score = 246 bits (629), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 215/759 (28%), Positives = 338/759 (44%), Gaps = 123/759 (16%)
Query: 72 LTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVS 131
LTL ++Q+ + RLGIP + +ESLHG +G T FPQ +
Sbjct: 95 LTLMNEVQKYMREKT---RLGIPVFTL-TESLHGSVHDGS-----------TIFPQAIAL 139
Query: 132 AASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVV 191
++FN L + SA+A E A Q P I++ RD RWGR +E GEDP +V
Sbjct: 140 GSTFNPILAYEMTSAIAKELSA-----QGITQSLTPVIDVCRDLRWGRVEECFGEDPFLV 194
Query: 192 SAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDL 251
S V V+ G+ + +V S KH A+
Sbjct: 195 SRMGVSQVR----------------GYLDNQV----------------SPMIKHFGAHGA 222
Query: 252 EKWG-NFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLF 310
+ G N + S +++ + F + +++ K +M SYN N P L
Sbjct: 223 PQGGLNLASVSCG----QRELLSIYLKTFETVVKEAKPWAVMSSYNSWNNEPNSSSHYLM 278
Query: 311 QKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQS 369
+ R+ W F+GY+ SD A+ + + + ++A L AG+D Q
Sbjct: 279 TELLRDRWDFQGYVYSDWGAIGMLNYFHKTAQNSAEAAIQALTAGLDAEASDNSYAELQQ 338
Query: 370 AIDKGKVQEKDIDRALLNLFSVQLRLGLFNGD-PRKGKYGKLGPDDVCTSEHKKLALDAA 428
++ G + K ID+A+ + + + +GLF P + Y K+ V H LA A
Sbjct: 339 LVENGMLDVKYIDQAVARILTAKFNMGLFEYPLPMEKNYDKV----VHAPAHVSLARKIA 394
Query: 429 RQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPK--SLLRGLEAY 486
+ IVLL+N+ LPL N + S+A+IGP + + G YT + +LL L+
Sbjct: 395 EESIVLLQNENNILPLQMNKLKSIAVIGPNADQVQ--FGDYTWSRDNKDGVTLLEALKER 452
Query: 487 VSKT---HYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAG-------LDLTQET--ED 534
V +YA GC D+ + +GF EAV +AKK+D IVV G D + T E
Sbjct: 453 VGNQLTLNYAKGC-DLVTDDRSGFKEAVDVAKKSDVCIVVVGSASASLARDYSNATCGEG 511
Query: 535 RDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEA 594
D L L G Q LV ++ T K PVI+VL G P +S+ + + I I+ YPGE
Sbjct: 512 FDLSDLTLTGVQEDLVEAIHATGK-PVIVVLLSGKPFAMSWIKEN--IPGIVVQWYPGEQ 568
Query: 595 GAKALAEIIFGDFNPGGRLPMTWYPES-------FTKVPMNDMNMRADSSRQYPGRSYRF 647
G ALA+++ G NP G+L + +P+S + +P + R+ S+ PG+ Y F
Sbjct: 569 GGLALADMLLGKVNPSGKLNYS-FPQSVGHLPCYYNYLPTDKGFYRSPGSKNKPGKDYVF 627
Query: 648 YTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVT 707
+ ++ FGHGLSYT++ Y L T S+ DY
Sbjct: 628 SSPKALWAFGHGLSYTDFEY-----------------------LSATTSKEDY------- 657
Query: 708 SCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEIS 767
+C + V I++ N GD DG V ++ R S P ++L GF++V +K++
Sbjct: 658 ACEDV-IEVTIAIRNTGDYDGLEVPQVYVRDMVSSVVMPVQELKGFEKVLIKKGETKQVI 716
Query: 768 FGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
+ P +L++ NK ++++ G L +G + I+
Sbjct: 717 IKI-PVSELALYNKEMKKVVEPGAFELQIGRASDDIRIK 754
>gi|336411808|ref|ZP_08592268.1| hypothetical protein HMPREF1018_04286 [Bacteroides sp. 2_1_56FAA]
gi|335940152|gb|EGN02020.1| hypothetical protein HMPREF1018_04286 [Bacteroides sp. 2_1_56FAA]
Length = 859
Score = 246 bits (629), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 216/760 (28%), Positives = 337/760 (44%), Gaps = 125/760 (16%)
Query: 72 LTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVS 131
LTL ++Q+ + RLGIP + +ESLHG +G T FPQ +
Sbjct: 95 LTLMNEVQKYMREKT---RLGIPVFTL-TESLHGSVHDGS-----------TIFPQAIAL 139
Query: 132 AASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVV 191
++FN L + SA+A E A Q P I++ RD RWGR +E GEDP +V
Sbjct: 140 GSTFNPILAYEMTSAIAKELSA-----QGITQSLTPVIDVCRDLRWGRVEECFGEDPFLV 194
Query: 192 SAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDL 251
S V V+ G+ + +V S KH A+
Sbjct: 195 SRMGVSQVR----------------GYLDNQV----------------SPMIKHFGAHGA 222
Query: 252 EKWG-NFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLF 310
+ G N + S +++ + F + +++ K +M SYN N P L
Sbjct: 223 PQGGLNLASVSCG----QRELLSIYLKTFETVVKEAKPWAVMSSYNSWNNEPNSSSHYLM 278
Query: 311 QKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQS 369
+ R+ W F+GY+ SD A+ + + + ++A L AG+D Q
Sbjct: 279 TELLRDRWDFQGYVYSDWGAIGMLNYFHKTAQNSAEAAIQALTAGLDAEASDNSYAELQQ 338
Query: 370 AIDKGKVQEKDIDRALLNLFSVQLRLGLFNGD-PRKGKYGKLGPDDVCTSEHKKLALDAA 428
++ G + K ID+A+ + + + +GLF P + Y K+ V H LA A
Sbjct: 339 LVENGMLDVKYIDQAVARILTAKFNMGLFEYPLPMEKNYDKV----VHAPAHVSLARKIA 394
Query: 429 RQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVS 488
+ IVLL+N+ LPL N + S+A+IGP N Q+ G K + LEA
Sbjct: 395 EESIVLLQNENNILPLQMNKLKSIAVIGP---NADQVQFGDYTWSRDNKDGVTLLEALKE 451
Query: 489 KT------HYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAG-------LDLTQET--E 533
+ +YA GC D+ + +GF EAV +AKK+D IVV G D + T E
Sbjct: 452 RAGNQLTLNYAKGC-DLVTDDRSGFKEAVDVAKKSDVCIVVVGSASASLARDYSNATCGE 510
Query: 534 DRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGE 593
D L L G Q LV ++ T K PVI+VL G PL +S+ + + I I+ YPGE
Sbjct: 511 GFDLSDLTLTGVQEDLVEAIHATGK-PVIVVLLSGKPLAMSWIKEN--IPGIVVQWYPGE 567
Query: 594 AGAKALAEIIFGDFNPGGRLPMTWYPES-------FTKVPMNDMNMRADSSRQYPGRSYR 646
G ALA+++ G NP G+L + +P+S + +P + R+ S+ PG+ Y
Sbjct: 568 QGGLALADMLLGKVNPSGKLNYS-FPQSVGHLPCYYNYLPTDKGFYRSPGSKNKPGKDYV 626
Query: 647 FYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEV 706
F + ++ FGHGLSYT++ Y L T S+ DY
Sbjct: 627 FSSPKALWAFGHGLSYTDFEY-----------------------LSATTSKEDY------ 657
Query: 707 TSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEI 766
+C + V I++ N GD DG V ++ R S P ++L GF++V +K++
Sbjct: 658 -ACEDV-IEVTIAIRNTGDYDGLEVPQVYVRDMVSSVVMPVQELKGFEKVLIKKGETKQV 715
Query: 767 SFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
+ P +L++ NK ++++ G L +G + I+
Sbjct: 716 IIKI-PVSELALYNKEMKKVVEPGAFELQIGRASDDIRIK 754
>gi|445146604|ref|ZP_21387818.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Dublin str. SL1438]
gi|445154484|ref|ZP_21391811.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Dublin str. HWS51]
gi|444845501|gb|ELX70711.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Dublin str. SL1438]
gi|444850729|gb|ELX75826.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Dublin str. HWS51]
Length = 765
Score = 246 bits (629), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 210/749 (28%), Positives = 346/749 (46%), Gaps = 139/749 (18%)
Query: 75 QEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAAS 134
++ I+Q+ D A+ RL IP + + + +HG T FP L A+S
Sbjct: 86 RQDIRQMQDQVMALSRLKIPLF-FAYDVVHG---------------QRTVFPISLGLASS 129
Query: 135 FNRSLWSNIGSAVAVEARAMYNLGQAGLTF-WAPNINIFRDPRWGRGQETPGEDPMVVSA 193
FN +G A EA GL WAP +++ RDPRWGR E GED + S
Sbjct: 130 FNLDAVRTVGRVSAYEA------ADDGLNMTWAPMVDVSRDPRWGRASEGFGEDTYLTSI 183
Query: 194 YAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEK 253
VK+ QG++ D + KH AY +
Sbjct: 184 MGETMVKAMQGKS--------------------------PADRYSVMTSVKHFAAYGAVE 217
Query: 254 WGNFSRYSFNAM-ITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK 312
G +N + ++ Q + + PP+++ ++ G + +M + N +NG PA L +
Sbjct: 218 GGK----EYNTVDMSSQRLFNDYMPPYKAGLDAG-SGAVMVALNSLNGTPATSDSWLLKD 272
Query: 313 A-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCML-RHTQSA 370
R+EWGFKG SD A+ + ++ ED+ LKAG+D++ ++
Sbjct: 273 VLRDEWGFKGITVSDHGAIKELIKH-GTAADPEDAVRVALKAGVDMSMADEYYSKYLPGL 331
Query: 371 IDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGP---DDVCTSE----HKKL 423
I GKV ++D A ++ +V+ +GLFN DP Y LGP D V T+ H+K
Sbjct: 332 IKSGKVTMAELDDATRHVLNVKYDMGLFN-DP----YSHLGPKESDPVDTNAESRLHRKE 386
Query: 424 ALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT--GIPCSPKSLLR 481
A + AR+ +VLLKN + LPL K+ ++A++GPL ++ + G ++ G+ ++L
Sbjct: 387 AREVARESVVLLKNRLETLPLKKSG--TIAVVGPLADSQRDVMGSWSAAGVANQSVTVLA 444
Query: 482 GLEAYV---SKTHYASGCH-----DV--------------PCNSDAGFHEAVRIAKKADF 519
G++ V +K YA G + D+ P + A EAV+ AK+AD
Sbjct: 445 GIQNAVGDGAKILYAKGANITNDKDIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADM 504
Query: 520 VIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEAD 579
V+ V G E R ++ +P Q L+T++ T K P++LVL G PL + + D
Sbjct: 505 VVAVVGESQGMAHEASSRTNITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKED 561
Query: 580 SQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRA----- 634
Q +IL + G G A+A+++FGD+NP G+LP++ +P S ++P+ ++
Sbjct: 562 QQADAILETWFAGTEGGNAIADVLFGDYNPSGKLPIS-FPRSVGQIPVYYSHLNTGRPYN 620
Query: 635 -DSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQ 693
+ +Y R + G +Y FG+GLSYT ++ S++T+S+ +Q+
Sbjct: 621 PEKPNKYTSRYFDEANGP-LYPFGYGLSYTTFTV------SDVTLSSP--------TMQR 665
Query: 694 TGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGF 753
G +VT+ + VTN G +G+ V+ ++ + S P KQL GF
Sbjct: 666 DG---------KVTAS--------VEVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGF 708
Query: 754 DRVHTVAKGSKEISFGVDPCEQLSIANKH 782
+++ K +SF +D E L N+
Sbjct: 709 EKITLKPGERKTVSFPID-IEALKFWNQQ 736
>gi|423227452|ref|ZP_17213913.1| hypothetical protein HMPREF1062_06099 [Bacteroides cellulosilyticus
CL02T12C19]
gi|392623082|gb|EIY17188.1| hypothetical protein HMPREF1062_06099 [Bacteroides cellulosilyticus
CL02T12C19]
Length = 786
Score = 246 bits (629), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 219/742 (29%), Positives = 349/742 (47%), Gaps = 123/742 (16%)
Query: 90 RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
RLGIP ++ +E + G+ + T FP A++N+ L S I A
Sbjct: 137 RLGIPV-DFTNEGIRGLCHD-----------RATMFPAQCGQGATWNKELISEIAQVTAE 184
Query: 150 EARAMYNLGQAGLT-FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWK 208
EA+A+ G T ++P ++I +DPRWGR E GEDP +V
Sbjct: 185 EAKAL------GYTNIYSPILDIAQDPRWGRVVECYGEDPFLV----------------- 221
Query: 209 SDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITE 268
G +G KR++KG +E L A KH Y + G + + +
Sbjct: 222 ---GELG-----KRMIKGLQQEG-------LVATPKHFAVYSIPVGGRDAGTRTDPHVAP 266
Query: 269 QDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGD---LFQKARNEWGFKGYITS 325
++ + PFR + A +M SYN +G P + G L + R+EWGFKGY+ S
Sbjct: 267 REMRTLYIEPFRKAFCEAGALGVMSSYNDYDGEP--ITGSYHFLTEILRHEWGFKGYVVS 324
Query: 326 DCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCM----LRHTQSAIDKGKVQEKDI 381
D +AV ++ D AA V+ AG+++ + +R + AI +GKV + I
Sbjct: 325 DSEAVEFLYSKHQVAADAVDGAAQVVNAGLNVRTNFTLPENFIRPLRQAISEGKVSMQTI 384
Query: 382 DRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKF 441
D + ++ V+ +GLF+ +P KG K V + EH+ +++ AA + IVLLKN+
Sbjct: 385 DSRVADVLRVKFGMGLFD-NPYKGD-AKHPEKVVHSKEHQAVSMRAALESIVLLKNENNI 442
Query: 442 LPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV--SKTHYASGC--- 496
LPL+K+ + +A+IGP N + + Y K++ +G++ Y+ ++ YA G
Sbjct: 443 LPLSKD-LKKVAVIGPNANEVQNLICRYGPANAPIKTVYQGIKEYLPDAEVRYAKGTDII 501
Query: 497 ---------HDVPCNSD--AGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQ 545
++VP + + A EAV +AK++D I+V G + E+ R +L L G+
Sbjct: 502 DKYFPESELYEVPLDQEEQAMMDEAVALAKESDVAIMVLGGNEKTVREEYSRTNLDLCGR 561
Query: 546 QMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFG 605
Q L+ +V T K PVIL+L G +++AE I I+ +PGE A+A+++FG
Sbjct: 562 QEKLLQAVYATGK-PVILLLVDGRAATINWAE--RYIPGIVHAWFPGEFMGDAVAQVLFG 618
Query: 606 DFNPGGRLPMTWYPESFTKVPMN-DMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTN 664
D+NPGG+L +T +P S ++P +DS + + TGT +Y FG+GLSYT
Sbjct: 619 DYNPGGKLAVT-FPRSVGQIPFAFPFKPGSDS------KGFVRVTGT-LYPFGYGLSYTT 670
Query: 665 YSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAG 724
++Y S+L I + I Q +L SC V N G
Sbjct: 671 FAY------SDLKIENPV-------IGVQGSVKL---------SC---------KVKNTG 699
Query: 725 DVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGR 784
V G VV L+ S T K L GF+R+H + K I F + P ++L + NK
Sbjct: 700 KVAGDEVVQLYLHDEMSSVTTYVKVLRGFERIHLESGEEKVIDFVLTP-QELGLWNKDNH 758
Query: 785 RILPLGNHVLMVGELRHSLTIE 806
++ G +MVG + ++
Sbjct: 759 FVVEPGTFAVMVGSSSQDIKLQ 780
>gi|205357092|ref|ZP_02344874.2| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Saintpaul str. SARA29]
gi|213650030|ref|ZP_03380083.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Typhi str. J185]
gi|289824263|ref|ZP_06543858.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Typhi str. E98-3139]
gi|378958841|ref|YP_005216327.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar Typhi
str. P-stx-12]
gi|205324021|gb|EDZ11860.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Saintpaul str. SARA29]
gi|374352713|gb|AEZ44474.1| Periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Typhi str. P-stx-12]
Length = 755
Score = 246 bits (628), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 210/748 (28%), Positives = 348/748 (46%), Gaps = 137/748 (18%)
Query: 75 QEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAAS 134
++ I+Q+ D A+ RL IP + + + +HG T FP L A+S
Sbjct: 76 RQDIRQMQDQVMALSRLKIPLF-FAYDVVHG---------------QRTVFPISLGLASS 119
Query: 135 FNRSLWSNIGSAVAVEARAMYNLGQAGLTF-WAPNINIFRDPRWGRGQETPGEDPMVVSA 193
FN +G A EA GL WAP +++ RDPRWGR E GED + S
Sbjct: 120 FNLDAVRTVGRVSAYEA------ADDGLNMTWAPMVDVSRDPRWGRASEGFGEDTYLTSI 173
Query: 194 YAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEK 253
VK+ QG++ D + KH AY +
Sbjct: 174 MGETMVKAMQGKS--------------------------PADRYSVMTSVKHFAAYGAVE 207
Query: 254 WGNFSRYSFNAM-ITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK 312
G +N + ++ Q + + PP+++ ++ G + +M + N +NG PA L +
Sbjct: 208 GGK----EYNTVDMSSQRLFNDYMPPYKAGLDAG-SGAVMVALNSLNGTPATSDSWLLKD 262
Query: 313 A-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCML-RHTQSA 370
R+EWGFKG SD A+ + ++ ED+ LKAG+D++ ++
Sbjct: 263 VLRDEWGFKGITVSDHGAIKELIKH-GTAADPEDAVRVALKAGVDMSMADEYYSKYLPGL 321
Query: 371 IDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGP---DDVCTSE----HKKL 423
I GKV ++D A ++ +V+ +GLFN DP Y LGP D V T+ H+K
Sbjct: 322 IKSGKVTMAELDDATRHVLNVKYDMGLFN-DP----YSHLGPKESDPVDTNAESRLHRKE 376
Query: 424 ALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT--GIPCSPKSLLR 481
A + AR+ +VLLKN + LPL K+ ++A++GPL ++ + G ++ G+ ++L
Sbjct: 377 AREVARESVVLLKNRLETLPLKKSG--TIAVVGPLADSQRDVMGSWSAAGVANQSVTVLA 434
Query: 482 GLEAYV---SKTHYASGCH-----DV--------------PCNSDAGFHEAVRIAKKADF 519
G++ V +K YA G + D+ P + A EAV+ AK+AD
Sbjct: 435 GIQNAVGDGAKILYAKGANITNDKDIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADV 494
Query: 520 VIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEAD 579
V+ V G E R ++ +P Q L+T++ T K P++LVL G PL + + D
Sbjct: 495 VVAVVGESQGMAHEASSRTNITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKED 551
Query: 580 SQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPM--NDMNMRADSS 637
Q +IL + G G A+A+++FGD+NP G+LP++ +P S ++P+ + +N +
Sbjct: 552 QQADAILETWFAGTEGGNAIADVLFGDYNPSGKLPIS-FPRSVGQIPVYYSHLNTGRPYN 610
Query: 638 RQYPGR-SYRFYTGTQ--VYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQT 694
+ P + + R++ +Y FG+GLSYT ++ S++T+S+ +Q+
Sbjct: 611 PEKPNKYTSRYFDEANGPLYPFGYGLSYTTFTV------SDVTLSSP--------TMQRD 656
Query: 695 GSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFD 754
G +VT+ + VTN G +G V+ ++ + S P KQL GF+
Sbjct: 657 G---------KVTAS--------VEVTNTGKREGETVIQMYLQDVTASMSRPVKQLKGFE 699
Query: 755 RVHTVAKGSKEISFGVDPCEQLSIANKH 782
++ K +SF +D E L N+
Sbjct: 700 KITLKPGERKTVSFPID-IEALKFWNQQ 726
>gi|423140736|ref|ZP_17128374.1| glycosyl hydrolase family 3 protein [Salmonella enterica subsp.
houtenae str. ATCC BAA-1581]
gi|379053290|gb|EHY71181.1| glycosyl hydrolase family 3 protein [Salmonella enterica subsp.
houtenae str. ATCC BAA-1581]
Length = 765
Score = 246 bits (628), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 208/738 (28%), Positives = 341/738 (46%), Gaps = 138/738 (18%)
Query: 75 QEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAAS 134
++ I+Q+ D A+ RL IP + + + +HG T FP L A+S
Sbjct: 86 RQDIRQMQDQVMALSRLKIPLF-FAYDVVHG---------------QRTVFPISLGLASS 129
Query: 135 FNRSLWSNIGSAVAVEARAMYNLGQAGLTF-WAPNINIFRDPRWGRGQETPGEDPMVVSA 193
FN +G A EA GL WAP +++ RDPRWGR E GED + S
Sbjct: 130 FNLDAVRTVGRVSAYEA------ADDGLNMTWAPMVDVSRDPRWGRASEGFGEDTWLTSI 183
Query: 194 YAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEK 253
VK+ QG++ D + KH AY +
Sbjct: 184 MGETMVKAMQGKS--------------------------PADRYSVMTSVKHFAAYGAVE 217
Query: 254 WGNFSRYSFNAM-ITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK 312
G +N + ++ Q + + PP+++ ++ G + +M + N +NG PA L +
Sbjct: 218 GGK----EYNTVDMSSQRLFNDYMPPYKAGLDAG-SGAVMVALNSLNGTPATSDSWLLKD 272
Query: 313 A-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCML-RHTQSA 370
R+EWGFKG SD A+ + ++ ED+ LK+G+D++ ++
Sbjct: 273 VLRDEWGFKGITVSDHGAIKELIKHGTAADP-EDAVRVALKSGVDMSMADEYYSKYLPGL 331
Query: 371 IDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGP---DDVCTSE----HKKL 423
I GKV ++D A ++ +V+ +GLFN DP Y LGP D V T+ H+K
Sbjct: 332 IKSGKVTMAELDDATRHVLNVKYDMGLFN-DP----YSHLGPKESDPVDTNAESRLHRKE 386
Query: 424 ALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT--GIPCSPKSLLR 481
A + AR+ +VLLKN + LPL K+ ++A++GPL ++ + G ++ G+ ++L
Sbjct: 387 AREVARESLVLLKNRLETLPLKKSG--TIAVVGPLADSQRDVMGSWSAAGVADQSVTVLA 444
Query: 482 GLEAYV---SKTHYASGCHDV-------------------PCNSDAGFHEAVRIAKKADF 519
G++ V +K YA G + P + A EAV+ AK+AD
Sbjct: 445 GIQNAVGDGAKILYAKGANITNDKGIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADV 504
Query: 520 VIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEAD 579
V+ V G E R + LP Q L+T++ T K P++LVL G PL + + D
Sbjct: 505 VVAVVGESQGMAHEASSRTDITLPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKED 561
Query: 580 SQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMR------ 633
Q +IL + G G A+A+++FGD+NP G+LPM+ +P S ++P+ ++
Sbjct: 562 QQADAILETWFAGTEGGNAIADVLFGDYNPSGKLPMS-FPRSVGQIPVYYSHLNTGRPYV 620
Query: 634 ADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQ 693
++ +Y R + G +Y FG+GLSYT ++ S++T+SA +++
Sbjct: 621 SEKPNKYTSRYFDEANGP-LYPFGYGLSYTTFTV------SDVTLSAP--------TMKR 665
Query: 694 TGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGF 753
G +VT+ + VTN G +G+ V+ ++ + S P KQL GF
Sbjct: 666 DG---------KVTAS--------VEVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGF 708
Query: 754 DRVHTVAKGSKEISFGVD 771
+++ SK +SF +D
Sbjct: 709 EKIALKPGESKTVSFPID 726
>gi|404404031|ref|ZP_10995615.1| glycoside hydrolase family protein [Alistipes sp. JC136]
Length = 740
Score = 246 bits (628), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 201/666 (30%), Positives = 316/666 (47%), Gaps = 93/666 (13%)
Query: 123 TSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTF-WAPNINIFRDPRWGRGQ 181
T P L + S++ AVEA A AGL + +AP ++I RDPRWGR
Sbjct: 118 TISPVPLAESCSWDMETIEASARMAAVEASA------AGLQWTFAPMVDIARDPRWGRVM 171
Query: 182 ETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSA 241
E GEDP + S A V+ FQG++ + + + A
Sbjct: 172 EGAGEDPYLGSHIARARVRGFQGDDLSAPN--------------------------TILA 205
Query: 242 CCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNG 300
C KH Y + G +N + I++Q + + PPF++ + G A+ M S+N+++G
Sbjct: 206 CAKHFAGYGASEGGR----DYNTVDISDQRLRELYLPPFKAAADAGAAT-FMNSFNELSG 260
Query: 301 VPACLRGDLF---QKARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDI 357
VPA G+ F Q RNEWG+ G I SD +VA + + + + +A +K DI
Sbjct: 261 VPAT--GNRFLVKQILRNEWGWDGVIVSDWGSVAEMIPH-GIAEDKKQAALLAVKNECDI 317
Query: 358 NC-GTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVC 416
+ G C + + +GKV EK+IDR++ + ++ LGLF+ DP + + +
Sbjct: 318 DMEGNCYPSSLEELVKEGKVSEKEIDRSVRRILRLKYELGLFD-DPYRYCDEQREKEVTL 376
Query: 417 TSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGY--TGIPC 474
++ H++ A D AR+ IVLL+N K LPL K S+A++GPL ++ M G + G P
Sbjct: 377 SAAHREAARDMARKSIVLLENRKSVLPLGKP--RSIAVVGPLADSPVDMLGEWRAKGDPK 434
Query: 475 SPKSLLRGLEAYV---SKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQE 531
++LRG+E ++ +A GC DV + +GF EAVR A+ AD VI G
Sbjct: 435 EVVTILRGIEKTAGAGTRVTHAKGC-DVTGSDRSGFAEAVRAARSADVVIACLGESADMS 493
Query: 532 TEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYP 591
E R L LPG Q L+ + +T K P++L+L+ G PL +++ + + I +I+ +
Sbjct: 494 GEGYCRSELGLPGVQQELLKELKKTGK-PIVLLLSNGRPLTLAWEKEN--IETIVETWFL 550
Query: 592 GEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGT 651
G A+A+++FG +NP G+L M+ +P + ++P+ + + R Y
Sbjct: 551 GTEAGNAVADVLFGKYNPSGKLVMS-FPYNVGQIPVYYNHKHTGRPFEPNQRYVMHYIDA 609
Query: 652 QV---YGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTS 708
V Y FG+GLSYT + Y E T+S+ A D
Sbjct: 610 PVDALYPFGYGLSYTRFEY------GEPTLSSDRMAAGDT-------------------- 643
Query: 709 CTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISF 768
+ VTNAGD DG VV L+ R K P K+L GF ++ S +++F
Sbjct: 644 -----ITATVKVTNAGDYDGEEVVQLYIRDLKAQITRPVKELKGFRKIFLKKGESADVTF 698
Query: 769 GVDPCE 774
+ E
Sbjct: 699 DITRAE 704
>gi|346226406|ref|ZP_08847548.1| beta-glucosidase [Anaerophaga thermohalophila DSM 12881]
Length = 775
Score = 246 bits (628), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 218/774 (28%), Positives = 347/774 (44%), Gaps = 126/774 (16%)
Query: 61 ISTRAKSLISLLTLQEKIQQLSDNASAIPRLG----IPAYEWWSESLHGIASNGPGVNF- 115
I + S+++L+TL+EKI Q++ + G A E+ + + G N G +
Sbjct: 41 IERKVDSVLNLMTLEEKIGQMNQYSGNFAVTGEVAETKAGEYLKKGMIGSTFNTFGADHI 100
Query: 116 ------NGTVSSV---------------TSFPQVLVSAASFNRSLWSNIGSAVAVEARAM 154
N S + T+FP L A S++ L A EA A
Sbjct: 101 RMLQEQNLKYSRLKIPMLFAADVIHGLETTFPIPLAEACSWDLELMEQSARIAAEEATA- 159
Query: 155 YNLGQAGLTF-WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGG 213
+G+ + +AP I+I RDPRWGR E GEDP + S A V+ FQG D
Sbjct: 160 -----SGIAWNFAPMIDIARDPRWGRVMEGAGEDPYLGSLVARARVRGFQGIETYKDFSK 214
Query: 214 IGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTED 273
I + A KH + Y + G R + ++ + +
Sbjct: 215 IN----------------------TMMATSKHFVGYGAVQAG---RDYHSVDMSVRTLHE 249
Query: 274 TFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVAT 332
T+ PPF++ +++G + M ++N +NGVP LF++ R+ WGF G + +D A+
Sbjct: 250 TYLPPFKAAVDEG-VTAFMTAFNDLNGVPCTGNKYLFKEILRDRWGFGGMVVTDYTAIQE 308
Query: 333 IFEYQNYTKTHEDSAAGVLKAGMDIN-CGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSV 391
+ + + + + + + AG+D++ + + + +++GKV EK ID A+ + +
Sbjct: 309 MVAH-GFARDLKHATELAIDAGIDMDMISEGFVTYLKELVEEGKVSEKQIDVAVSRILEM 367
Query: 392 QLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSS 451
+ LGLF+ DP K + + V EH K A + A++ IVLL+N LPL KN
Sbjct: 368 KFLLGLFD-DPFKYCNAERQKEVVMNPEHLKAAREVAQRSIVLLENKNNVLPLKKNEPKR 426
Query: 452 LAIIGPLVNNISQMGGGYT--GIPCSPKSLLRGLEAYVS----KTHYASGCH-------- 497
+A+IGP V + G + G P +L+ GLE K YA G
Sbjct: 427 VALIGPFVKERESLTGEWAIKGDPDKSVTLMEGLEEKYKDSQVKFSYAKGTSLPVIDRTT 486
Query: 498 ------DVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVT 551
VP S GF EA+ +A+ +D ++V G E R + LPG Q L+
Sbjct: 487 QKVSTTRVPDRS--GFSEAINLARTSDVILVAMGEKFHWSGEAASRTDITLPGNQRELLK 544
Query: 552 SVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGG 611
+ +T K P+ILVL G PLD+S+ EA++ + +I+ YPG A+A+++ GD+NP
Sbjct: 545 ELKKTGK-PIILVLFNGRPLDLSW-EAEN-VDAIVEAWYPGIMAGHAVADVLSGDYNPSA 601
Query: 612 RLPMTWYPESFTKVPM--NDMNMRADSSRQYPG---RSYRFYTGTQVYGFGHGLSYTNYS 666
+L MT +P + ++P+ N N P SY + +Y FG+GLSYT++
Sbjct: 602 KLVMT-FPRNVGQIPIFYNVKNTGRPFDEDNPADYRSSYIDCPNSPLYPFGYGLSYTSFE 660
Query: 667 YKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDV 726
Y + K S K L++ G V + VTN G +
Sbjct: 661 YD------------NAKISSKK--LERGGI-----------------LTVSVDVTNTGTM 689
Query: 727 DGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIAN 780
DG VV L+ S P K+L GF ++H +K + F +D E+L + N
Sbjct: 690 DGEEVVQLYIHDKVGSVVRPVKELKGFKKIHLKKGETKTVEFTIDE-ERLKMYN 742
>gi|423250669|ref|ZP_17231684.1| hypothetical protein HMPREF1066_02694 [Bacteroides fragilis
CL03T00C08]
gi|423253995|ref|ZP_17234925.1| hypothetical protein HMPREF1067_01569 [Bacteroides fragilis
CL03T12C07]
gi|392651626|gb|EIY45288.1| hypothetical protein HMPREF1066_02694 [Bacteroides fragilis
CL03T00C08]
gi|392654553|gb|EIY48200.1| hypothetical protein HMPREF1067_01569 [Bacteroides fragilis
CL03T12C07]
Length = 859
Score = 246 bits (628), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 215/759 (28%), Positives = 338/759 (44%), Gaps = 123/759 (16%)
Query: 72 LTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVS 131
LTL ++Q+ + RLGIP + +ESLHG +G T FPQ +
Sbjct: 95 LTLMNEVQKYMREKT---RLGIPVFTL-TESLHGSVHDGS-----------TIFPQAIAL 139
Query: 132 AASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVV 191
++FN L + SA+A E A Q P I++ RD RWGR +E GEDP +V
Sbjct: 140 GSTFNPILAYEMTSAIAKELSA-----QGITQSLTPVIDVCRDLRWGRVEECFGEDPYLV 194
Query: 192 SAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDL 251
S V V+ G+ + +V S KH A+
Sbjct: 195 SRMGVSQVR----------------GYLDNQV----------------SPMIKHFGAHGA 222
Query: 252 EKWG-NFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLF 310
+ G N + S +++ + F + +++ K +M SYN N P L
Sbjct: 223 PQGGLNLASVSCG----QRELLSIYLKTFETVVKEAKPWAVMSSYNSWNNEPNSSSHYLM 278
Query: 311 QKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQS 369
+ R+ W F+GY+ SD A+ + + + ++A L AG+D Q
Sbjct: 279 TELLRDRWDFQGYVYSDWGAIGMLNYFHKTAQNSAEAAIQALTAGLDAEASDNSYAELQQ 338
Query: 370 AIDKGKVQEKDIDRALLNLFSVQLRLGLFNGD-PRKGKYGKLGPDDVCTSEHKKLALDAA 428
++ G + K ID+A+ + + + +GLF P + Y K+ V H LA A
Sbjct: 339 LVENGMLDVKYIDQAVARILTAKFNMGLFEYPLPMEKNYDKV----VHAPAHVSLARKIA 394
Query: 429 RQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPK--SLLRGLEAY 486
+ IVLL+N+ LPL N + S+A+IGP + + G YT + +LL L+
Sbjct: 395 EESIVLLQNENNILPLQMNKLKSIAVIGPNADQVQ--FGDYTWSRDNKDGVTLLEALKER 452
Query: 487 VSKT---HYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAG-------LDLTQET--ED 534
V +YA GC D+ + +GF EAV +AKK+D IVV G D + T E
Sbjct: 453 VGNQLTLNYAKGC-DLVTDDRSGFKEAVDVAKKSDVCIVVVGSASASLARDYSNATCGEG 511
Query: 535 RDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEA 594
D L L G Q LV ++ T K PVI+VL G P +S+ + + I I+ YPGE
Sbjct: 512 FDLSDLTLTGVQEDLVEAIHATGK-PVIVVLLSGKPFAMSWIKEN--IPGIVVQWYPGEQ 568
Query: 595 GAKALAEIIFGDFNPGGRLPMTWYPES-------FTKVPMNDMNMRADSSRQYPGRSYRF 647
G ALA+++ G NP G+L + +P+S + +P + R+ S+ PG+ Y F
Sbjct: 569 GGLALADMLLGKVNPSGKLNYS-FPQSVGHLPCYYNYLPTDKGFYRSPGSKNKPGKDYVF 627
Query: 648 YTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVT 707
+ ++ FGHGLSYT++ Y L T S+ DY
Sbjct: 628 SSPKALWAFGHGLSYTDFEY-----------------------LSATTSKEDY------- 657
Query: 708 SCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEIS 767
+C + V I++ N GD DG V ++ R S P ++L GF++V +K++
Sbjct: 658 ACEDV-IEVTIAIRNTGDYDGLEVPQVYVRDMVSSVVIPVQELKGFEKVLIKKGETKQVI 716
Query: 768 FGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
+ P +L++ NK ++++ G L +G + I+
Sbjct: 717 IKI-PVSELALYNKEMKKVVEPGAFELQIGRASDDIRIK 754
>gi|416424053|ref|ZP_11691321.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Montevideo str. 315996572]
gi|416434178|ref|ZP_11697512.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Montevideo str. 495297-1]
gi|416438422|ref|ZP_11699509.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Montevideo str. 495297-3]
gi|416447728|ref|ZP_11705981.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Montevideo str. 495297-4]
gi|416450525|ref|ZP_11707600.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Montevideo str. 515920-1]
gi|416460485|ref|ZP_11714793.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Montevideo str. 515920-2]
gi|416470222|ref|ZP_11718747.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Montevideo str. 531954]
gi|416476726|ref|ZP_11721214.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Montevideo str. NC_MB110209-0054]
gi|416508101|ref|ZP_11735884.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Montevideo str. SARB31]
gi|416513698|ref|ZP_11738019.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Montevideo str. ATCC BAA710]
gi|416540687|ref|ZP_11750492.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Montevideo str. 19N]
gi|416561671|ref|ZP_11761668.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Montevideo str. 42N]
gi|416575337|ref|ZP_11768369.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Montevideo str. 81038-01]
gi|416594582|ref|ZP_11780414.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Montevideo str. 414877]
gi|416602160|ref|ZP_11785217.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Montevideo str. 366867]
gi|416607231|ref|ZP_11788413.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Montevideo str. 413180]
gi|416611818|ref|ZP_11791047.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Montevideo str. 446600]
gi|416623896|ref|ZP_11797678.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Montevideo str. 609458-1]
gi|416630098|ref|ZP_11800505.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Montevideo str. 556150-1]
gi|416636573|ref|ZP_11803136.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Montevideo str. 609460]
gi|416652084|ref|ZP_11811486.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Montevideo str. 507440-20]
gi|416655624|ref|ZP_11812629.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Montevideo str. 556152]
gi|416669115|ref|ZP_11819140.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Montevideo str. MB101509-0077]
gi|416700997|ref|ZP_11829262.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Montevideo str. MB110209-0055]
gi|416705382|ref|ZP_11830863.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Montevideo str. MB111609-0052]
gi|416713638|ref|ZP_11837193.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Montevideo str. 2009083312]
gi|416720123|ref|ZP_11841889.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Montevideo str. 2009085258]
gi|416721559|ref|ZP_11842724.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Montevideo str. 315731156]
gi|416729877|ref|ZP_11848338.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Montevideo str. IA_2009159199]
gi|416738151|ref|ZP_11853179.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Montevideo str. IA_2010008282]
gi|416744242|ref|ZP_11856524.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Montevideo str. IA_2010008283]
gi|416759732|ref|ZP_11864557.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Montevideo str. IA_2010008284]
gi|416760387|ref|ZP_11864780.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Montevideo str. IA_2010008285]
gi|416767537|ref|ZP_11869997.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Montevideo str. IA_2010008287]
gi|418484908|ref|ZP_13053898.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Montevideo str. 80959-06]
gi|418492113|ref|ZP_13058613.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Montevideo str. CT_02035278]
gi|418496941|ref|ZP_13063366.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Montevideo str. CT_02035318]
gi|418500305|ref|ZP_13066703.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Montevideo str. CT_02035320]
gi|418503319|ref|ZP_13069684.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Montevideo str. CT_02035321]
gi|418509304|ref|ZP_13075600.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Montevideo str. CT_02035327]
gi|418512924|ref|ZP_13079159.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Pomona str. ATCC 10729]
gi|418527981|ref|ZP_13093934.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Montevideo str. IA_2010008286]
gi|452119629|ref|YP_007469877.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Javiana str. CFSAN001992]
gi|322615194|gb|EFY12116.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Montevideo str. 315996572]
gi|322617785|gb|EFY14681.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Montevideo str. 495297-1]
gi|322624621|gb|EFY21452.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Montevideo str. 495297-3]
gi|322626928|gb|EFY23724.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Montevideo str. 495297-4]
gi|322634114|gb|EFY30850.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Montevideo str. 515920-1]
gi|322635695|gb|EFY32405.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Montevideo str. 515920-2]
gi|322640170|gb|EFY36834.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Montevideo str. 531954]
gi|322646407|gb|EFY42919.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Montevideo str. NC_MB110209-0054]
gi|322661471|gb|EFY57695.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Montevideo str. 19N]
gi|322665641|gb|EFY61825.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Montevideo str. 81038-01]
gi|322671171|gb|EFY67299.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Montevideo str. 414877]
gi|322675188|gb|EFY71265.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Montevideo str. 366867]
gi|322680797|gb|EFY76832.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Montevideo str. 413180]
gi|322686972|gb|EFY82949.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Montevideo str. 446600]
gi|323192736|gb|EFZ77963.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Montevideo str. 609458-1]
gi|323198832|gb|EFZ83931.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Montevideo str. 556150-1]
gi|323205158|gb|EFZ90136.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Montevideo str. 609460]
gi|323213417|gb|EFZ98214.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Montevideo str. 556152]
gi|323215774|gb|EGA00517.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Montevideo str. MB101509-0077]
gi|323223860|gb|EGA08163.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Montevideo str. MB110209-0055]
gi|323231220|gb|EGA15335.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Montevideo str. MB111609-0052]
gi|323233654|gb|EGA17746.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Montevideo str. 2009083312]
gi|323237724|gb|EGA21784.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Montevideo str. 2009085258]
gi|323245707|gb|EGA29701.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Montevideo str. 315731156]
gi|323248929|gb|EGA32853.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Montevideo str. IA_2009159199]
gi|323253000|gb|EGA36833.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Montevideo str. IA_2010008282]
gi|323258676|gb|EGA42338.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Montevideo str. IA_2010008283]
gi|323259508|gb|EGA43143.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Montevideo str. IA_2010008284]
gi|323268291|gb|EGA51766.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Montevideo str. IA_2010008285]
gi|323271920|gb|EGA55336.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Montevideo str. IA_2010008287]
gi|363552716|gb|EHL36999.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Montevideo str. SARB31]
gi|363567278|gb|EHL51278.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Montevideo str. ATCC BAA710]
gi|363573864|gb|EHL57738.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Montevideo str. 42N]
gi|366055944|gb|EHN20277.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Montevideo str. CT_02035318]
gi|366057212|gb|EHN21516.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Montevideo str. 80959-06]
gi|366058491|gb|EHN22779.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Montevideo str. CT_02035278]
gi|366069158|gb|EHN33284.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Montevideo str. CT_02035320]
gi|366073161|gb|EHN37236.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Montevideo str. CT_02035321]
gi|366078494|gb|EHN42495.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Montevideo str. CT_02035327]
gi|366082861|gb|EHN46791.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Pomona str. ATCC 10729]
gi|366827184|gb|EHN54093.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Montevideo str. 507440-20]
gi|372204105|gb|EHP17636.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Montevideo str. IA_2010008286]
gi|451908633|gb|AGF80439.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Javiana str. CFSAN001992]
Length = 765
Score = 246 bits (628), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 210/749 (28%), Positives = 346/749 (46%), Gaps = 139/749 (18%)
Query: 75 QEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAAS 134
++ I+Q+ D A+ RL IP + + + +HG T FP L A+S
Sbjct: 86 RQDIRQMQDQVMALSRLKIPLF-FAYDVVHG---------------QRTVFPISLGLASS 129
Query: 135 FNRSLWSNIGSAVAVEARAMYNLGQAGLTF-WAPNINIFRDPRWGRGQETPGEDPMVVSA 193
FN +G A EA GL WAP +++ RDPRWGR E GED + S
Sbjct: 130 FNLDAVRTVGRVSAYEA------ADDGLNMTWAPMVDVSRDPRWGRASEGFGEDTYLTSI 183
Query: 194 YAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEK 253
VK+ QG++ D + KH AY +
Sbjct: 184 MGETMVKAMQGKS--------------------------PADRYSVMTSVKHFAAYGAVE 217
Query: 254 WGNFSRYSFNAM-ITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK 312
G +N + ++ Q + + PP+++ ++ G + +M + N +NG PA L +
Sbjct: 218 GGK----EYNTVDMSSQRLFNDYMPPYKAGLDAG-SGAVMVALNSLNGTPATSDSWLLKD 272
Query: 313 A-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCML-RHTQSA 370
R+EWGFKG SD A+ + ++ ED+ LKAG+D++ ++
Sbjct: 273 VLRDEWGFKGITVSDHGAIKELIKH-GTAADPEDAVRVALKAGVDMSMADEYYSKYLPGL 331
Query: 371 IDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGP---DDVCTSE----HKKL 423
I GKV ++D A ++ +V+ +GLFN DP Y LGP D V T+ H+K
Sbjct: 332 IKSGKVTMAELDDATRHVLNVKYDMGLFN-DP----YSHLGPKESDPVDTNAESRLHRKE 386
Query: 424 ALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT--GIPCSPKSLLR 481
A + AR+ +VLLKN + LPL K+ ++A++GPL ++ + G ++ G+ ++L
Sbjct: 387 AREVARESVVLLKNRLETLPLKKSG--TIAVVGPLADSQRDVMGSWSAAGVANQSVTVLA 444
Query: 482 GLEAYV---SKTHYASGCH-----DV--------------PCNSDAGFHEAVRIAKKADF 519
G++ V +K YA G + D+ P + A EAV+ AK+AD
Sbjct: 445 GIQNAVGDGAKILYAKGANITNDKDIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADV 504
Query: 520 VIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEAD 579
V+ V G E R ++ +P Q L+T++ T K P++LVL G PL + + D
Sbjct: 505 VVAVVGESQGMAHEASSRTNITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKED 561
Query: 580 SQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRA----- 634
Q +IL + G G A+A+++FGD+NP G+LP++ +P S ++P+ ++
Sbjct: 562 QQADAILETWFAGTEGGNAIADVLFGDYNPSGKLPIS-FPRSVGQIPVYYSHLNTGRPYN 620
Query: 635 -DSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQ 693
+ +Y R + G +Y FG+GLSYT ++ S++T+S+ +Q+
Sbjct: 621 PEKPNKYTSRYFDEANGP-LYPFGYGLSYTTFTV------SDVTLSSP--------TMQR 665
Query: 694 TGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGF 753
G +VT+ + VTN G +G+ V+ ++ + S P KQL GF
Sbjct: 666 DG---------KVTAS--------VEVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGF 708
Query: 754 DRVHTVAKGSKEISFGVDPCEQLSIANKH 782
+++ K +SF +D E L N+
Sbjct: 709 EKITLKPGERKTVSFPID-IEALKFWNQQ 736
>gi|437730260|ref|ZP_20831186.1| beta-D-glucoside glucohydrolase, partial [Salmonella enterica
subsp. enterica serovar Enteritidis str. 648901 16-16]
gi|435289521|gb|ELO66481.1| beta-D-glucoside glucohydrolase, partial [Salmonella enterica
subsp. enterica serovar Enteritidis str. 648901 16-16]
Length = 764
Score = 246 bits (627), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 210/749 (28%), Positives = 346/749 (46%), Gaps = 139/749 (18%)
Query: 75 QEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAAS 134
++ I+Q+ D A+ RL IP + + + +HG T FP L A+S
Sbjct: 86 RQDIRQMQDQVMALSRLKIPLF-FAYDVVHG---------------QRTVFPISLGLASS 129
Query: 135 FNRSLWSNIGSAVAVEARAMYNLGQAGLTF-WAPNINIFRDPRWGRGQETPGEDPMVVSA 193
FN +G A EA GL WAP +++ RDPRWGR E GED + S
Sbjct: 130 FNLDAVRTVGRVSAYEA------ADDGLNMTWAPMVDVSRDPRWGRASEGFGEDTYLTSI 183
Query: 194 YAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEK 253
VK+ QG++ D + KH AY +
Sbjct: 184 MGETMVKAMQGKS--------------------------PADRYSVMTSVKHFAAYGAVE 217
Query: 254 WGNFSRYSFNAM-ITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK 312
G +N + ++ Q + + PP+++ ++ G + +M + N +NG PA L +
Sbjct: 218 GGK----EYNTVDMSSQRLFNDYMPPYKAGLDAG-SGAVMVALNSLNGTPATSDSWLLKD 272
Query: 313 A-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCML-RHTQSA 370
R+EWGFKG SD A+ + ++ ED+ LKAG+D++ ++
Sbjct: 273 VLRDEWGFKGITVSDHGAIKELIKHGTAADP-EDAVRVALKAGVDMSMADEYYSKYLPGL 331
Query: 371 IDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGP---DDVCTSE----HKKL 423
I GKV ++D A ++ +V+ +GLFN DP Y LGP D V T+ H+K
Sbjct: 332 IKSGKVTMAELDDATRHVLNVKYDMGLFN-DP----YSHLGPKESDPVDTNAESRLHRKE 386
Query: 424 ALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT--GIPCSPKSLLR 481
A + AR+ +VLLKN + LPL K+ ++A++GPL ++ + G ++ G+ ++L
Sbjct: 387 AREVARESVVLLKNRLETLPLKKSG--TIAVVGPLADSQRDVMGSWSAAGVANQSVTVLA 444
Query: 482 GLEAYV---SKTHYASGCH-----DV--------------PCNSDAGFHEAVRIAKKADF 519
G++ V +K YA G + D+ P + A EAV+ AK+AD
Sbjct: 445 GIQNAVDDGAKILYAKGANITNDKDIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADV 504
Query: 520 VIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEAD 579
V+ V G E R ++ +P Q L+T++ T K P++LVL G PL + + D
Sbjct: 505 VVAVVGESQGMAHEASSRTNITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKED 561
Query: 580 SQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRA----- 634
Q +IL + G G A+A+++FGD+NP G+LP++ +P S ++P+ ++
Sbjct: 562 QQADAILETWFAGTEGGNAIADVLFGDYNPSGKLPIS-FPRSVGQIPVYYSHLNTGRPYN 620
Query: 635 -DSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQ 693
+ +Y R + G +Y FG+GLSYT ++ S++T+S+ +Q+
Sbjct: 621 PEKPNKYTSRYFDEANGP-LYPFGYGLSYTTFTV------SDVTLSSP--------TMQR 665
Query: 694 TGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGF 753
G +VT+ + VTN G +G+ V+ ++ + S P KQL GF
Sbjct: 666 DG---------KVTAS--------VEVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGF 708
Query: 754 DRVHTVAKGSKEISFGVDPCEQLSIANKH 782
+++ K +SF +D E L N+
Sbjct: 709 EKITLKPGERKTVSFPID-IEALKFWNQQ 736
>gi|437849166|ref|ZP_20847216.1| beta-D-glucoside glucohydrolase, partial [Salmonella enterica
subsp. enterica serovar Enteritidis str. 6.0562-1]
gi|435338617|gb|ELP07830.1| beta-D-glucoside glucohydrolase, partial [Salmonella enterica
subsp. enterica serovar Enteritidis str. 6.0562-1]
Length = 792
Score = 246 bits (627), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 210/749 (28%), Positives = 346/749 (46%), Gaps = 139/749 (18%)
Query: 75 QEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAAS 134
++ I+Q+ D A+ RL IP + + + +HG T FP L A+S
Sbjct: 113 RQDIRQMQDQVMALSRLKIPLF-FAYDVVHG---------------QRTVFPISLGLASS 156
Query: 135 FNRSLWSNIGSAVAVEARAMYNLGQAGLTF-WAPNINIFRDPRWGRGQETPGEDPMVVSA 193
FN +G A EA GL WAP +++ RDPRWGR E GED + S
Sbjct: 157 FNLDAVRTVGRVSAYEA------ADDGLNMTWAPMVDVSRDPRWGRASEGFGEDTYLTSI 210
Query: 194 YAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEK 253
VK+ QG++ D + KH AY +
Sbjct: 211 MGETMVKAMQGKS--------------------------PADRYSVMTSVKHFAAYGAVE 244
Query: 254 WGNFSRYSFNAM-ITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK 312
G +N + ++ Q + + PP+++ ++ G + +M + N +NG PA L +
Sbjct: 245 GGK----EYNTVDMSSQRLFNDYMPPYKAGLDAG-SGAVMVALNSLNGTPATSDSWLLKD 299
Query: 313 A-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCML-RHTQSA 370
R+EWGFKG SD A+ + ++ ED+ LKAG+D++ ++
Sbjct: 300 VLRDEWGFKGITVSDHGAIKELIKH-GTAADPEDAVRVALKAGVDMSMADEYYSKYLPGL 358
Query: 371 IDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGP---DDVCTSE----HKKL 423
I GKV ++D A ++ +V+ +GLFN DP Y LGP D V T+ H+K
Sbjct: 359 IKSGKVTMAELDDATRHVLNVKYDMGLFN-DP----YSHLGPKESDPVDTNAESRLHRKE 413
Query: 424 ALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT--GIPCSPKSLLR 481
A + AR+ +VLLKN + LPL K+ ++A++GPL ++ + G ++ G+ ++L
Sbjct: 414 AREVARESVVLLKNRLETLPLKKSG--TIAVVGPLADSQRDVMGSWSAAGVANQSVTVLA 471
Query: 482 GLEAYV---SKTHYASGCH-----DV--------------PCNSDAGFHEAVRIAKKADF 519
G++ V +K YA G + D+ P + A EAV+ AK+AD
Sbjct: 472 GIQNAVDDGAKILYAKGANITNDKDIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADV 531
Query: 520 VIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEAD 579
V+ V G E R ++ +P Q L+T++ T K P++LVL G PL + + D
Sbjct: 532 VVAVVGESQGMAHEASSRTNITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKED 588
Query: 580 SQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRA----- 634
Q +IL + G G A+A+++FGD+NP G+LP++ +P S ++P+ ++
Sbjct: 589 QQADAILETWFAGTEGGNAIADVLFGDYNPSGKLPIS-FPRSVGQIPVYYSHLNTGRPYN 647
Query: 635 -DSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQ 693
+ +Y R + G +Y FG+GLSYT ++ S++T+S+ +Q+
Sbjct: 648 PEKPNKYTSRYFDEANGP-LYPFGYGLSYTTFTV------SDVTLSSP--------TMQR 692
Query: 694 TGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGF 753
G +VT+ + VTN G +G+ V+ ++ + S P KQL GF
Sbjct: 693 DG---------KVTAS--------VEVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGF 735
Query: 754 DRVHTVAKGSKEISFGVDPCEQLSIANKH 782
+++ K +SF +D E L N+
Sbjct: 736 EKITLKPGERKTVSFPID-IEALKFWNQQ 763
>gi|60682370|ref|YP_212514.1| hydrolase [Bacteroides fragilis NCTC 9343]
gi|60493804|emb|CAH08594.1| putative exported hydrolase [Bacteroides fragilis NCTC 9343]
Length = 859
Score = 246 bits (627), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 214/759 (28%), Positives = 338/759 (44%), Gaps = 123/759 (16%)
Query: 72 LTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVS 131
LTL ++Q+ + RLGIP + +ESLHG +G T FPQ +
Sbjct: 95 LTLMNEVQKYMREKT---RLGIPVFTL-TESLHGSVHDGS-----------TIFPQAIAL 139
Query: 132 AASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVV 191
++FN L + SA+A E A Q P I++ RD RWGR +E GEDP +V
Sbjct: 140 GSTFNPILAYEMTSAIAKELSA-----QGITQSLTPVIDVCRDLRWGRVEECFGEDPFLV 194
Query: 192 SAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDL 251
S V V+ G+ + +V S KH A+
Sbjct: 195 SRMGVSQVR----------------GYLDNQV----------------SPMIKHFGAHGA 222
Query: 252 EKWG-NFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLF 310
+ G N + + + +++ + F + +++ K +M SYN N P L
Sbjct: 223 PQGGLNLA----SVLCGQRELLSIYLKTFETVVKEAKPWAVMSSYNSWNNEPNSSSHYLM 278
Query: 311 QKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQS 369
+ R+ W F+GY+ SD A+ + + + ++A L AG+D Q
Sbjct: 279 TELLRDRWDFQGYVYSDWGAIGMLNYFHKTAQNSAEAAIQALTAGLDAEASDNSYAELQQ 338
Query: 370 AIDKGKVQEKDIDRALLNLFSVQLRLGLFNGD-PRKGKYGKLGPDDVCTSEHKKLALDAA 428
++ G + K ID+A+ + + + +GLF P + Y K+ V H LA A
Sbjct: 339 LVENGMLDVKYIDQAVARILTAKFNMGLFEYPLPMEKNYDKV----VHAPAHVSLARKIA 394
Query: 429 RQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPK--SLLRGLEAY 486
+ IVLL+N LPL N + S+A+IGP + + G YT + +LL L+
Sbjct: 395 EESIVLLQNKNNILPLQMNKLKSIAVIGPNADQVQ--FGDYTWSRDNKDGVTLLEALKER 452
Query: 487 VSKT---HYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAG-------LDLTQET--ED 534
V +YA GC D+ + +GF EAV +AKK+D IVV G D + T E
Sbjct: 453 VGNQLTLNYAKGC-DLVTDDRSGFKEAVDVAKKSDVCIVVVGSASASLARDYSNATCGEG 511
Query: 535 RDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEA 594
D L L G Q LV ++ T K PVI+VL G P +S+ + + I I+ YPGE
Sbjct: 512 FDLSDLTLTGVQEDLVEAIHATGK-PVIVVLLSGKPFAMSWIKEN--IPGIVVQWYPGEQ 568
Query: 595 GAKALAEIIFGDFNPGGRLPMTWYPES-------FTKVPMNDMNMRADSSRQYPGRSYRF 647
G ALA+++ G NP G+L + +P+S + +P + R+ S+ PG+ Y F
Sbjct: 569 GGLALADMLLGKVNPSGKLNYS-FPQSVGHLPCYYNYLPTDKGFYRSPGSKNKPGKDYVF 627
Query: 648 YTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVT 707
+ ++ FGHGLSYT++ Y L T S+ DY
Sbjct: 628 SSPKALWAFGHGLSYTDFEY-----------------------LSATTSKEDY------- 657
Query: 708 SCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEIS 767
+C + V I++ N GD DG V ++ R S P ++L GF++V +K++
Sbjct: 658 ACEDV-IEVTIAIRNTGDYDGLEVPQVYVRDMVSSVVMPVQELKGFEKVLIKKGETKQVI 716
Query: 768 FGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
+ P +L++ NK ++++ G L +G + I+
Sbjct: 717 IKI-PVSELALYNKEMKKVVEPGAFELQIGRASDDIRIK 754
>gi|378984729|ref|YP_005247884.1| periplasmic beta-D-glucoside glucohydrolase [Salmonella enterica
subsp. enterica serovar Typhimurium str. T000240]
gi|312913157|dbj|BAJ37131.1| periplasmic beta-D-glucoside glucohydrolase [Salmonella enterica
subsp. enterica serovar Typhimurium str. T000240]
Length = 765
Score = 246 bits (627), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 209/749 (27%), Positives = 344/749 (45%), Gaps = 139/749 (18%)
Query: 75 QEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAAS 134
++ I+Q+ D A+ RL IP + + + +HG T FP L A+S
Sbjct: 86 RQDIRQMQDQVMALSRLKIPLF-FAYDVVHG---------------QRTVFPISLGLASS 129
Query: 135 FNRSLWSNIGSAVAVEARAMYNLGQAGLTF-WAPNINIFRDPRWGRGQETPGEDPMVVSA 193
FN +G A EA GL WAP +++ RDPRWGR E GED + S
Sbjct: 130 FNLDAVRTVGRVSAYEA------ADDGLNMTWAPMVDVSRDPRWGRASEGFGEDTYLTSI 183
Query: 194 YAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEK 253
VK+ QG++ D + KH AY +
Sbjct: 184 MGETMVKAMQGKS--------------------------PADRYSVMTSVKHFAAYGAVE 217
Query: 254 WGNFSRYSFNAM-ITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK 312
G +N + ++ Q + + PP+++ ++ G + +M + N +NG PA L +
Sbjct: 218 GGK----EYNTVDMSSQRLFNDYMPPYKAGLDAG-SGAVMVALNSLNGTPATSDSWLLKD 272
Query: 313 A-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCML-RHTQSA 370
R+EWGFKG SD A+ + ++ ED+ LKAG+D++ ++
Sbjct: 273 VLRDEWGFKGITVSDHGAIKELIKHGTAADP-EDAVRVALKAGVDMSMADEYYSKYLPGL 331
Query: 371 IDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGP---DDVCTSE----HKKL 423
I GKV ++D A ++ +V+ +GLFN DP Y LGP D V T+ H+K
Sbjct: 332 IKSGKVTMAELDDATRHVLNVKYDMGLFN-DP----YSHLGPKESDPVDTNAESRLHRKE 386
Query: 424 ALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT--GIPCSPKSLLR 481
A + AR+ +VLLKN + LPL K+ ++A++GPL ++ + G ++ G+ ++L
Sbjct: 387 AREVARESVVLLKNRLETLPLKKSG--TIAVVGPLADSQRDVMGSWSAAGVANQSVTVLA 444
Query: 482 GLEAYV---SKTHYASGCHDV-------------------PCNSDAGFHEAVRIAKKADF 519
G++ V +K YA G + P + A EAV+ AK+AD
Sbjct: 445 GIQNAVGDGAKILYAKGANITNDKGIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADV 504
Query: 520 VIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEAD 579
V+ V G E R ++ +P Q L+T++ T K P++LVL G PL + + D
Sbjct: 505 VVAVVGESQEMAHEASSRTNITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKED 561
Query: 580 SQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRA----- 634
Q +IL + G G A+A+++FGD+NP G+LP++ +P S ++P+ ++
Sbjct: 562 QQADAILETWFAGTEGGNAIADVLFGDYNPSGKLPIS-FPRSVGQIPVYYSHLNTGRPYN 620
Query: 635 -DSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQ 693
+ +Y R + G +Y FG+GLSYT ++ S++T+S+ +Q+
Sbjct: 621 PEKPNKYTSRYFDEANGP-LYPFGYGLSYTTFTV------SDVTLSSP--------TMQR 665
Query: 694 TGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGF 753
G +VT+ + VTN G +G+ V+ ++ + S P KQL GF
Sbjct: 666 DG---------KVTAS--------VEVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGF 708
Query: 754 DRVHTVAKGSKEISFGVDPCEQLSIANKH 782
+++ K +SF +D E L N+
Sbjct: 709 EKITLKPGERKTVSFPID-IEALKFWNQQ 736
>gi|153809292|ref|ZP_01961960.1| hypothetical protein BACCAC_03604 [Bacteroides caccae ATCC 43185]
gi|149128062|gb|EDM19283.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
caccae ATCC 43185]
Length = 946
Score = 246 bits (627), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 207/735 (28%), Positives = 335/735 (45%), Gaps = 115/735 (15%)
Query: 90 RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
RLGIP ++ +E + G+ S T+FP L ++NR L +G
Sbjct: 153 RLGIPT-DFTNEGIRGVES-----------YKATNFPTQLGLGHTWNRQLIHQVGLITGR 200
Query: 150 EARAMYNLGQAGLT-FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWK 208
EAR + G T +AP +++ RD RWGR +E GE P +V+ +E V+ Q +
Sbjct: 201 EARML------GYTNVYAPILDVGRDQRWGRYEEVYGESPYLVAELGIEMVRGMQHNH-- 252
Query: 209 SDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITE 268
++A KH IAY K + ++
Sbjct: 253 -----------------------------QVAATGKHFIAYSNNKGAREGMARVDPQMSP 283
Query: 269 QDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVP-ACLRGDLFQKARNEWGFKGYITSDC 327
++ E PF+ I + +M SYN +G P L + R E GF+GY+ SD
Sbjct: 284 REVEMLHAYPFKRVIREAGLLGVMSSYNDYDGFPIQSSYYWLTTRLRGEMGFRGYVVSDS 343
Query: 328 DAVATIFEYQNYTKTHEDSAAGVLKAGMDINCG----TCMLRHTQSAIDKGKVQEKDIDR 383
DAV ++ K +++ ++AG+++ C + + + +G + E+ I+
Sbjct: 344 DAVEYLYTKHGTAKDMKEAVRQSVEAGLNVRCTFRSPDSYVLPLRELVKEGGLSEEVIND 403
Query: 384 ALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLP 443
+ ++ V+ +GLF+ + G ++V E++++AL A+R+ IVLLKN+K LP
Sbjct: 404 RVRDILRVKFLVGLFDTPYQTDLKG--ADEEVEKKENEEVALQASRESIVLLKNEKNVLP 461
Query: 444 LNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV---SKTHYASGCHDVP 500
L+ + + +A+ GP + S Y + S+L+G++ + + Y GC V
Sbjct: 462 LDPSKIRKIAVCGPNADEHSYALTHYGPLAVEVTSVLKGIQEKMKDKADVLYTKGCDLVD 521
Query: 501 CN--------------SDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQ 546
N +AV AK+AD IVV G E++ R SL LPG+Q
Sbjct: 522 ANWPESELIDYPLTDEEQKEIDKAVSQAKQADVAIVVLGGGQRTCGENKSRSSLDLPGRQ 581
Query: 547 MSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGD 606
+ L+ +V T K PV+LVL G PL +++A D + +IL YPG G A+A+I+FGD
Sbjct: 582 LDLLKAVVATGK-PVVLVLINGRPLSINWA--DKFVPAILEAWYPGSKGGIAVADILFGD 638
Query: 607 FNPGGRLPMTWYPESFTKVPMN---DMNMRADSSRQYPGRSYRFYTGT-QVYGFGHGLSY 662
+NPGG+L +T +P++ ++P N + + D + PG +Y FG+GLSY
Sbjct: 639 YNPGGKLTVT-FPKTVGQIPFNFPCKPSSQIDGGKN-PGPDGNMSRANGALYPFGYGLSY 696
Query: 663 TNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTN 722
T + Y S+L IS ++ + K +V VTN
Sbjct: 697 TTFEY------SDLKISPAIITPNQKA-------------------------YVTCKVTN 725
Query: 723 AGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKH 782
G G V+ L+ R S T EK L GF+RVH +KEI+F +D + L + N
Sbjct: 726 TGKRSGDEVIQLYVRDVLSSVTTYEKNLAGFERVHLKPGETKEITFPID-RKALELLNAD 784
Query: 783 GRRILPLGNHVLMVG 797
++ G+ LM+G
Sbjct: 785 MHWVVEPGDFTLMLG 799
>gi|194449077|ref|YP_002046219.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar
Heidelberg str. SL476]
gi|205359117|ref|ZP_02667180.2| glycosyl hydrolase family 3 N domain protein [Salmonella enterica
subsp. enterica serovar Heidelberg str. SL486]
gi|386591978|ref|YP_006088378.1| Periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Heidelberg str. B182]
gi|417384271|ref|ZP_12149694.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar
Johannesburg str. S5-703]
gi|194407381|gb|ACF67600.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Heidelberg str. SL476]
gi|205338504|gb|EDZ25268.1| glycosyl hydrolase family 3 N domain protein [Salmonella enterica
subsp. enterica serovar Heidelberg str. SL486]
gi|353609401|gb|EHC62716.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar
Johannesburg str. S5-703]
gi|383799022|gb|AFH46104.1| Periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Heidelberg str. B182]
Length = 755
Score = 246 bits (627), Expect = 5e-62, Method: Compositional matrix adjust.
Identities = 207/749 (27%), Positives = 345/749 (46%), Gaps = 139/749 (18%)
Query: 75 QEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAAS 134
++ I+Q+ D A+ RL IP + + + +HG T FP L A+S
Sbjct: 76 RQDIRQMQDQVMALSRLKIPLF-FAYDVVHG---------------QRTVFPISLGLASS 119
Query: 135 FNRSLWSNIGSAVAVEARAMYNLGQAGLTF-WAPNINIFRDPRWGRGQETPGEDPMVVSA 193
FN +G A EA GL WAP +++ RDPRWGR E GED + S
Sbjct: 120 FNLDAVRTVGRVSAYEA------ADDGLNMTWAPMVDVSRDPRWGRASEGFGEDTYLTSI 173
Query: 194 YAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEK 253
VK+ QG++ D + KH AY +
Sbjct: 174 MGETMVKAMQGKS--------------------------PADRYSVMTSVKHFAAYGAVE 207
Query: 254 WGNFSRYSFNAM-ITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK 312
G +N + ++ Q + + PP+++ ++ G + +M + N +NG PA L +
Sbjct: 208 GGK----EYNTVDMSSQRLFNDYMPPYKAGLDAG-SGAVMVALNSLNGTPATSDSWLLKD 262
Query: 313 A-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCML-RHTQSA 370
R+EWGFKG SD A+ + ++ ED+ LKAG+D++ ++
Sbjct: 263 VLRDEWGFKGITVSDHGAIKELIKH-GTAADPEDAVRVALKAGVDMSMADEYYSKYLPGL 321
Query: 371 IDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSE-------HKKL 423
I GKV ++D A ++ +V+ +GLFN DP Y LGP + ++ H+K
Sbjct: 322 IKSGKVTMAELDDATRHVLNVKYDMGLFN-DP----YSHLGPKESDPADTNAESRLHRKE 376
Query: 424 ALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT--GIPCSPKSLLR 481
A + AR+ +VLLKN + LPL K+ ++A++GPL ++ + G ++ G+ ++L
Sbjct: 377 AREVARESVVLLKNRLETLPLKKSG--TIAVVGPLADSQRDVMGSWSAAGVANQSVTVLA 434
Query: 482 GLEAYV---SKTHYASGCH-----DV--------------PCNSDAGFHEAVRIAKKADF 519
G++ V +K YA G + D+ P + A EAV+ AK+AD
Sbjct: 435 GIQNAVGDGAKILYAKGANITNDKDIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADV 494
Query: 520 VIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEAD 579
V+ V G E R ++ +P Q L+T++ T K P++LVL G PL + + D
Sbjct: 495 VVAVVGESQGMAHEASSRTNITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKED 551
Query: 580 SQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRA----- 634
Q +IL + G G A+A+++FGD+NP G+LP++ +P S ++P+ ++
Sbjct: 552 QQADAILETWFAGTEGGNAIADVLFGDYNPSGKLPIS-FPRSVGQIPVYYSHLNTGRPYN 610
Query: 635 -DSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQ 693
+ +Y R + G +Y FG+GLSYT ++ S++T+S+ +Q+
Sbjct: 611 PEKPNKYTSRYFDEANGP-LYPFGYGLSYTTFTV------SDVTLSSP--------TMQR 655
Query: 694 TGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGF 753
G +VT+ + VTN G +G+ V+ ++ + S P KQL GF
Sbjct: 656 DG---------KVTAS--------VEVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGF 698
Query: 754 DRVHTVAKGSKEISFGVDPCEQLSIANKH 782
+++ K +SF +D E L N+
Sbjct: 699 EKITLKPGERKTVSFPID-IEALKFWNQQ 726
>gi|383761254|ref|YP_005440236.1| beta-xylosidase [Caldilinea aerophila DSM 14535 = NBRC 104270]
gi|381381522|dbj|BAL98338.1| beta-xylosidase [Caldilinea aerophila DSM 14535 = NBRC 104270]
Length = 790
Score = 246 bits (627), Expect = 5e-62, Method: Compositional matrix adjust.
Identities = 226/830 (27%), Positives = 350/830 (42%), Gaps = 198/830 (23%)
Query: 47 PHFDSYPFCNTSLSISTRAKSLISLLTLQEK----------------------------- 77
P +YP+ N SL + R LI +T++EK
Sbjct: 4 PTKTNYPYQNASLPVEERVNDLIGRMTIEEKVAQLGSLWIYEIADGQGLNRAWAQARMAN 63
Query: 78 ----IQQLSDNASAIP------------------RLGIPAYEWWSESLHGIASNGPGVNF 115
+ +L+ ++ P RLGIPA E G + G
Sbjct: 64 GLGQVTRLAGGSTLGPVETAKLANQIQKFLIEETRLGIPAL-IHDECCSGFLAKG----- 117
Query: 116 NGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDP 175
T+FPQ++ A+++ L + + + RA+ GL AP ++I RDP
Sbjct: 118 ------ATNFPQIIGVASTWEPELVEAMTRVIRQQMRAVGV--HHGL---APVLDIARDP 166
Query: 176 RWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGD 235
RWGR +ET GEDP + S V +++ QGE+W DG + G
Sbjct: 167 RWGRTEETFGEDPYLTSTMGVAYIRGLQGESWA--DGVMATG------------------ 206
Query: 236 ELMLSACCKHLIAYDLEKWG-NFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCS 294
KH + Y + G N++ A IT ++ + + PF + + + + IM +
Sbjct: 207 --------KHFVGYSASEGGLNWA----PAHITPRELREVYLAPFEAAVRAARLASIMPA 254
Query: 295 YNQVNGVP-ACLRGDLFQKARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKA 353
Y++++G P + R + + R+EWGF G + SD A+ + Y + +A L A
Sbjct: 255 YHEIDGEPCSSSRWLMTELLRHEWGFDGLVVSDYMAINQLHNYHRLARDKTQAAKLALAA 314
Query: 354 GMDINC------GTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNG---DPRK 404
GMD+ G ++ A+ G+V + IDR++ + + + GLF +P
Sbjct: 315 GMDLELPNIDAYGQALI----DAVHAGEVPLEWIDRSVRRILTFKFAFGLFEHPYVEP-- 368
Query: 405 GKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQ 464
+ P+ T ++LA AR+ +VLLKN+ LPL+K +SS+A+IGP +
Sbjct: 369 ----EAVPEVFNTPPQRELARTIARKSLVLLKNEGNLLPLSKQ-LSSIAVIGPNADTKRN 423
Query: 465 MGGGYT-------------------------GIPCSPKSLLRGLEAY------VSKTHYA 493
+ G Y+ G+ S+L +EA ++ +A
Sbjct: 424 LLGDYSYPAHIETLITLRQLGFSEHPLPESVGLVDDYGSMLSIVEAIRQVVSPETQVQHA 483
Query: 494 SGCHDVPCNSDAGFHEAVRIAKKADFVIVVAG-----LDLTQETEDRDRVSLLLPGQQMS 548
GC D+ S GF EAV A+KAD IVV G + E RD L LPG Q +
Sbjct: 484 KGC-DILSTSTEGFAEAVEAARKADVAIVVVGDKAGLIPECTSGEFRDSAHLTLPGVQQA 542
Query: 549 LVTSVARTSKRPVILVLTGGGPLDV-SFAEADSQISSILWIGYPGEAGAKALAEIIFGDF 607
LV ++ T PV+LVL G P + EA + W+ PG GA ALAE++FGD
Sbjct: 543 LVEAILATGT-PVVLVLVTGRPYAIPQLVEAAPAVIEA-WL--PGAEGAPALAEVLFGDV 598
Query: 608 NPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSY 667
NPGG+LP+T+ +VP+ + R +R + Y + ++ FG+GLSYT +++
Sbjct: 599 NPGGKLPITFL-RHVGQVPLFYAH-RPSGARSFFYGPYMDESNEPLFPFGYGLSYTQFAF 656
Query: 668 KFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVD 727
+ L+ E T T V + V N G
Sbjct: 657 ENLTVTPEAT-------------------------------TTDGEVQVSVEVVNIGTRA 685
Query: 728 GSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLS 777
G VV L+ RV S P K+L GF RVH K + F + P E L+
Sbjct: 686 GEEVVQLYTRVDGASVTRPVKELKGFKRVHLDPGERKRVHFTL-PVELLA 734
>gi|197263061|ref|ZP_03163135.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Saintpaul str. SARA23]
gi|207857601|ref|YP_002244252.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar
Enteritidis str. P125109]
gi|421359420|ref|ZP_15809713.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 622731-39]
gi|421364010|ref|ZP_15814248.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 639016-6]
gi|421367050|ref|ZP_15817252.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 640631]
gi|421373030|ref|ZP_15823175.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 77-0424]
gi|421375395|ref|ZP_15825508.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 607307-6]
gi|421382177|ref|ZP_15832228.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 485549-17]
gi|421386891|ref|ZP_15836897.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 596866-22]
gi|421391230|ref|ZP_15841201.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 596866-70]
gi|421395600|ref|ZP_15845536.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 629164-26]
gi|421397961|ref|ZP_15847870.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 629164-37]
gi|421402484|ref|ZP_15852342.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 639672-46]
gi|421409068|ref|ZP_15858863.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 639672-50]
gi|421411752|ref|ZP_15861516.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 77-1427]
gi|421418166|ref|ZP_15867872.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 77-2659]
gi|421422724|ref|ZP_15872392.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 78-1757]
gi|421424918|ref|ZP_15874555.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 22510-1]
gi|421433210|ref|ZP_15882778.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 8b-1]
gi|421434152|ref|ZP_15883702.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648905 5-18]
gi|421441895|ref|ZP_15891355.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648901 6-18]
gi|421443218|ref|ZP_15892660.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 50-3079]
gi|436600421|ref|ZP_20512957.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 22704]
gi|436686654|ref|ZP_20517957.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. SE30663]
gi|436802328|ref|ZP_20525319.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CHS44]
gi|436809476|ref|ZP_20528856.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1882]
gi|436814772|ref|ZP_20532323.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1884]
gi|436844194|ref|ZP_20537952.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1594]
gi|436854476|ref|ZP_20544110.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1566]
gi|436855800|ref|ZP_20544925.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1580]
gi|436864301|ref|ZP_20550268.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1543]
gi|436870232|ref|ZP_20554038.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1441]
gi|436877723|ref|ZP_20558651.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1810]
gi|436886662|ref|ZP_20563082.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1558]
gi|436893559|ref|ZP_20567466.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1018]
gi|436901307|ref|ZP_20572217.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1010]
gi|436912656|ref|ZP_20578485.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1729]
gi|436920338|ref|ZP_20582934.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_0895]
gi|436926675|ref|ZP_20586501.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_0899]
gi|436936605|ref|ZP_20592045.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1457]
gi|436940618|ref|ZP_20594562.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1747]
gi|436950717|ref|ZP_20599772.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_0968]
gi|436961959|ref|ZP_20605333.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1444]
gi|436969147|ref|ZP_20608268.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1445]
gi|436976823|ref|ZP_20612073.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1559]
gi|436989589|ref|ZP_20616596.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1565]
gi|437001001|ref|ZP_20620797.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1808]
gi|437022365|ref|ZP_20628334.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1811]
gi|437036112|ref|ZP_20633844.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_0956]
gi|437044148|ref|ZP_20637101.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1455]
gi|437052068|ref|ZP_20641628.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1575]
gi|437059125|ref|ZP_20645972.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1725]
gi|437064832|ref|ZP_20648606.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1745]
gi|437077112|ref|ZP_20655320.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1791]
gi|437083638|ref|ZP_20659292.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1795]
gi|437092016|ref|ZP_20663616.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 576709]
gi|437113855|ref|ZP_20669057.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 635290-58]
gi|437122048|ref|ZP_20672085.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 607308-16]
gi|437128670|ref|ZP_20675357.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 607308-19]
gi|437139171|ref|ZP_20681653.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 607307-2]
gi|437144157|ref|ZP_20684771.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 607308-9]
gi|437151251|ref|ZP_20689128.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 629163]
gi|437162727|ref|ZP_20696289.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. SE15-1]
gi|437169975|ref|ZP_20700070.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CVM_N202]
gi|437179039|ref|ZP_20705157.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CVM_56-3991]
gi|437181201|ref|ZP_20706372.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CVM_76-3618]
gi|437239027|ref|ZP_20714239.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 13183-1]
gi|437261953|ref|ZP_20718699.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CVM_81-2490]
gi|437267405|ref|ZP_20721157.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. SL909]
gi|437280676|ref|ZP_20728053.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. SL913]
gi|437289916|ref|ZP_20731294.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CVM_69-4941]
gi|437311898|ref|ZP_20736006.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 638970-15]
gi|437330435|ref|ZP_20741599.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 17927]
gi|437346849|ref|ZP_20747003.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CHS4]
gi|437411031|ref|ZP_20752807.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 543463 22-17]
gi|437442646|ref|ZP_20757878.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 543463 40-18]
gi|437461006|ref|ZP_20761959.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 561362 1-1]
gi|437479748|ref|ZP_20768095.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 642044 4-1]
gi|437494385|ref|ZP_20772414.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 642046 4-7]
gi|437505366|ref|ZP_20775420.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648898 4-5]
gi|437535206|ref|ZP_20781440.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648899 3-17]
gi|437559607|ref|ZP_20785823.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648900 1-16]
gi|437572273|ref|ZP_20789035.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648901 1-17]
gi|437583575|ref|ZP_20792569.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648901 39-2]
gi|437599978|ref|ZP_20797137.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648902 6-8]
gi|437616848|ref|ZP_20802600.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648903 1-6]
gi|437631373|ref|ZP_20806367.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648904 3-6]
gi|437662149|ref|ZP_20813366.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 653049 13-19]
gi|437676329|ref|ZP_20816941.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 642044 8-1]
gi|437695929|ref|ZP_20822252.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 561362 9-7]
gi|437711846|ref|ZP_20826864.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 543463 42-20]
gi|437813102|ref|ZP_20841687.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 33944]
gi|438082703|ref|ZP_20857889.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 81-2625]
gi|438099351|ref|ZP_20863367.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 62-1976]
gi|438109484|ref|ZP_20867447.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 53-407]
gi|445165173|ref|ZP_21394056.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. SE8a]
gi|445182314|ref|ZP_21398520.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 20037]
gi|445229380|ref|ZP_21405024.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. SE10]
gi|445333762|ref|ZP_21414961.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 18569]
gi|445344304|ref|ZP_21417576.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 13-1]
gi|445361577|ref|ZP_21423869.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. PT23]
gi|197241316|gb|EDY23936.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Saintpaul str. SARA23]
gi|206709404|emb|CAR33745.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Enteritidis str. P125109]
gi|395985043|gb|EJH94216.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 622731-39]
gi|395985485|gb|EJH94655.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 639016-6]
gi|395989705|gb|EJH98839.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 640631]
gi|395998658|gb|EJI07685.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 77-0424]
gi|395999277|gb|EJI08299.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 485549-17]
gi|396005381|gb|EJI14360.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 607307-6]
gi|396011532|gb|EJI20442.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 596866-22]
gi|396012239|gb|EJI21137.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 629164-26]
gi|396012640|gb|EJI21536.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 596866-70]
gi|396025999|gb|EJI34772.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 639672-50]
gi|396032021|gb|EJI40746.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 639672-46]
gi|396032157|gb|EJI40881.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 629164-37]
gi|396039333|gb|EJI47961.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 77-2659]
gi|396042039|gb|EJI50662.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 78-1757]
gi|396045253|gb|EJI53847.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 77-1427]
gi|396049426|gb|EJI57969.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 8b-1]
gi|396058514|gb|EJI66975.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 22510-1]
gi|396060758|gb|EJI69199.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648905 5-18]
gi|396062572|gb|EJI70983.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648901 6-18]
gi|396071002|gb|EJI79329.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 50-3079]
gi|434957421|gb|ELL51060.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CHS44]
gi|434967293|gb|ELL60128.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1882]
gi|434972888|gb|ELL65276.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1884]
gi|434975403|gb|ELL67694.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 22704]
gi|434978780|gb|ELL70772.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1594]
gi|434983279|gb|ELL75087.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1566]
gi|434992014|gb|ELL83484.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1580]
gi|434995336|gb|ELL86652.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1543]
gi|435002560|gb|ELL93625.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1441]
gi|435003859|gb|ELL94860.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. SE30663]
gi|435008607|gb|ELL99430.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1810]
gi|435011905|gb|ELM02608.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1558]
gi|435018555|gb|ELM09017.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1018]
gi|435020741|gb|ELM11130.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1010]
gi|435026901|gb|ELM17032.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1729]
gi|435027857|gb|ELM17949.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_0895]
gi|435036516|gb|ELM26335.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_0899]
gi|435039443|gb|ELM29224.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1457]
gi|435045523|gb|ELM35151.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1747]
gi|435051098|gb|ELM40602.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1444]
gi|435051184|gb|ELM40686.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_0968]
gi|435059248|gb|ELM48538.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1445]
gi|435071156|gb|ELM60106.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1565]
gi|435071299|gb|ELM60247.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1559]
gi|435074446|gb|ELM63278.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_0956]
gi|435075555|gb|ELM64369.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1808]
gi|435076993|gb|ELM65767.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1811]
gi|435081352|gb|ELM69994.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1455]
gi|435091470|gb|ELM79861.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1575]
gi|435094938|gb|ELM83277.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1725]
gi|435100574|gb|ELM88742.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1745]
gi|435103950|gb|ELM92024.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1791]
gi|435107381|gb|ELM95366.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1795]
gi|435112922|gb|ELN00787.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 576709]
gi|435116153|gb|ELN03904.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 635290-58]
gi|435124194|gb|ELN11661.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 607308-16]
gi|435128186|gb|ELN15537.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 607308-19]
gi|435132693|gb|ELN19891.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 607307-2]
gi|435139148|gb|ELN26152.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 607308-9]
gi|435142647|gb|ELN29534.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. SE15-1]
gi|435145134|gb|ELN31963.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 629163]
gi|435147755|gb|ELN34507.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CVM_N202]
gi|435152456|gb|ELN39086.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CVM_56-3991]
gi|435162324|gb|ELN48508.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CVM_81-2490]
gi|435166749|gb|ELN52715.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CVM_76-3618]
gi|435170164|gb|ELN55920.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. SL913]
gi|435175921|gb|ELN61323.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. SL909]
gi|435182180|gb|ELN67212.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CVM_69-4941]
gi|435182531|gb|ELN67539.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 13183-1]
gi|435183030|gb|ELN68005.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 638970-15]
gi|435189277|gb|ELN73922.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CHS4]
gi|435189619|gb|ELN74243.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 17927]
gi|435204236|gb|ELN87933.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 543463 22-17]
gi|435211340|gb|ELN94479.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 543463 40-18]
gi|435219425|gb|ELO01787.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 642044 4-1]
gi|435220698|gb|ELO02980.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 561362 1-1]
gi|435227283|gb|ELO08792.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 642046 4-7]
gi|435235423|gb|ELO16226.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648898 4-5]
gi|435238918|gb|ELO19527.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648900 1-16]
gi|435240567|gb|ELO20958.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648899 3-17]
gi|435247913|gb|ELO27842.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648901 1-17]
gi|435261576|gb|ELO40730.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648903 1-6]
gi|435262896|gb|ELO41978.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648902 6-8]
gi|435263473|gb|ELO42520.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648901 39-2]
gi|435267795|gb|ELO46460.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 653049 13-19]
gi|435276722|gb|ELO54719.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 642044 8-1]
gi|435277229|gb|ELO55183.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 561362 9-7]
gi|435283446|gb|ELO61011.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648904 3-6]
gi|435288066|gb|ELO65157.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 543463 42-20]
gi|435297184|gb|ELO73479.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 33944]
gi|435320361|gb|ELO93000.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 81-2625]
gi|435325905|gb|ELO97749.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 62-1976]
gi|435332633|gb|ELP03544.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 53-407]
gi|444865473|gb|ELX90243.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. SE10]
gi|444867011|gb|ELX91716.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. SE8a]
gi|444871369|gb|ELX95805.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 20037]
gi|444875449|gb|ELX99648.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 18569]
gi|444880476|gb|ELY04551.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 13-1]
gi|444884583|gb|ELY08407.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. PT23]
Length = 765
Score = 246 bits (627), Expect = 5e-62, Method: Compositional matrix adjust.
Identities = 210/749 (28%), Positives = 346/749 (46%), Gaps = 139/749 (18%)
Query: 75 QEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAAS 134
++ I+Q+ D A+ RL IP + + + +HG T FP L A+S
Sbjct: 86 RQDIRQMQDQVMALSRLKIPLF-FAYDVVHG---------------QRTVFPISLGLASS 129
Query: 135 FNRSLWSNIGSAVAVEARAMYNLGQAGLTF-WAPNINIFRDPRWGRGQETPGEDPMVVSA 193
FN +G A EA GL WAP +++ RDPRWGR E GED + S
Sbjct: 130 FNLDAVRTVGRVSAYEA------ADDGLNMTWAPMVDVSRDPRWGRASEGFGEDTYLTSI 183
Query: 194 YAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEK 253
VK+ QG++ D + KH AY +
Sbjct: 184 MGETMVKAMQGKS--------------------------PADRYSVMTSVKHFAAYGAVE 217
Query: 254 WGNFSRYSFNAM-ITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK 312
G +N + ++ Q + + PP+++ ++ G + +M + N +NG PA L +
Sbjct: 218 GGK----EYNTVDMSSQRLFNDYMPPYKAGLDAG-SGAVMVALNSLNGTPATSDSWLLKD 272
Query: 313 A-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCML-RHTQSA 370
R+EWGFKG SD A+ + ++ ED+ LKAG+D++ ++
Sbjct: 273 VLRDEWGFKGITVSDHGAIKELIKHGTAADP-EDAVRVALKAGVDMSMADEYYSKYLPGL 331
Query: 371 IDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGP---DDVCTSE----HKKL 423
I GKV ++D A ++ +V+ +GLFN DP Y LGP D V T+ H+K
Sbjct: 332 IKSGKVTMAELDDATRHVLNVKYDMGLFN-DP----YSHLGPKESDPVDTNAESRLHRKE 386
Query: 424 ALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT--GIPCSPKSLLR 481
A + AR+ +VLLKN + LPL K+ ++A++GPL ++ + G ++ G+ ++L
Sbjct: 387 AREVARESVVLLKNRLETLPLKKSG--TIAVVGPLADSQRDVMGSWSAAGVANQSVTVLA 444
Query: 482 GLEAYV---SKTHYASGCH-----DV--------------PCNSDAGFHEAVRIAKKADF 519
G++ V +K YA G + D+ P + A EAV+ AK+AD
Sbjct: 445 GIQNAVDDGAKILYAKGANITNDKDIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADV 504
Query: 520 VIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEAD 579
V+ V G E R ++ +P Q L+T++ T K P++LVL G PL + + D
Sbjct: 505 VVAVVGESQGMAHEASSRTNITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKED 561
Query: 580 SQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRA----- 634
Q +IL + G G A+A+++FGD+NP G+LP++ +P S ++P+ ++
Sbjct: 562 QQADAILETWFAGTEGGNAIADVLFGDYNPSGKLPIS-FPRSVGQIPVYYSHLNTGRPYN 620
Query: 635 -DSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQ 693
+ +Y R + G +Y FG+GLSYT ++ S++T+S+ +Q+
Sbjct: 621 PEKPNKYTSRYFDEANGP-LYPFGYGLSYTTFTV------SDVTLSSP--------TMQR 665
Query: 694 TGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGF 753
G +VT+ + VTN G +G+ V+ ++ + S P KQL GF
Sbjct: 666 DG---------KVTAS--------VEVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGF 708
Query: 754 DRVHTVAKGSKEISFGVDPCEQLSIANKH 782
+++ K +SF +D E L N+
Sbjct: 709 EKITLKPGERKTVSFPID-IEALKFWNQQ 736
>gi|421449803|ref|ZP_15899183.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 58-6482]
gi|396068619|gb|EJI76965.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 58-6482]
Length = 765
Score = 246 bits (627), Expect = 5e-62, Method: Compositional matrix adjust.
Identities = 210/749 (28%), Positives = 346/749 (46%), Gaps = 139/749 (18%)
Query: 75 QEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAAS 134
++ I+Q+ D A+ RL IP + + + +HG T FP L A+S
Sbjct: 86 RQDIRQMQDQVMALSRLKIPLF-FAYDVVHG---------------QRTVFPISLGLASS 129
Query: 135 FNRSLWSNIGSAVAVEARAMYNLGQAGLTF-WAPNINIFRDPRWGRGQETPGEDPMVVSA 193
FN +G A EA GL WAP +++ RDPRWGR E GED + S
Sbjct: 130 FNLDAVRTVGRVSAYEA------ADDGLNMTWAPMVDVSRDPRWGRASEGFGEDTYLTSI 183
Query: 194 YAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEK 253
VK+ QG++ D + KH AY +
Sbjct: 184 MGETMVKAMQGKS--------------------------PADRYSVMTSVKHFAAYGAVE 217
Query: 254 WGNFSRYSFNAM-ITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK 312
G +N + ++ Q + + PP+++ ++ G + +M + N +NG PA L +
Sbjct: 218 GGK----EYNTVDMSSQRLFNDYMPPYKAGLDAG-SGAVMVALNSLNGTPATSDSWLLKD 272
Query: 313 A-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCML-RHTQSA 370
R+EWGFKG SD A+ + ++ ED+ LKAG+D++ ++
Sbjct: 273 VLRDEWGFKGITVSDHGAIKELIKHGTAADP-EDAVRVALKAGVDMSMADEYYSKYLPGL 331
Query: 371 IDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGP---DDVCTSE----HKKL 423
I GKV ++D A ++ +V+ +GLFN DP Y LGP D V T+ H+K
Sbjct: 332 IKSGKVTMAELDDATRHVLNVKYDMGLFN-DP----YSHLGPKESDPVDTNAESRLHRKE 386
Query: 424 ALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT--GIPCSPKSLLR 481
A + AR+ +VLLKN + LPL K+ ++A++GPL ++ + G ++ G+ ++L
Sbjct: 387 AREVARESVVLLKNRLETLPLKKSG--TIAVVGPLADSQRDVMGSWSAAGVANQSVTVLA 444
Query: 482 GLEAYV---SKTHYASGCH-----DV--------------PCNSDAGFHEAVRIAKKADF 519
G++ V +K YA G + D+ P + A EAV+ AK+AD
Sbjct: 445 GIQNAVDDGAKILYAKGANITNDKDIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADV 504
Query: 520 VIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEAD 579
V+ V G E R ++ +P Q L+T++ T K P++LVL G PL + + D
Sbjct: 505 VVAVVGESQGMAHEASSRTNITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKED 561
Query: 580 SQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRA----- 634
Q +IL + G G A+A+++FGD+NP G+LP++ +P S ++P+ ++
Sbjct: 562 QQADAILETWFAGTEGGNAIADVLFGDYNPSGKLPIS-FPRSVGQIPVYYSHLNTGRPYN 620
Query: 635 -DSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQ 693
+ +Y R + G +Y FG+GLSYT ++ S++T+S+ +Q+
Sbjct: 621 PEKPNKYTSRYFDEANGP-LYPFGYGLSYTTFTV------SDVTLSSP--------TMQR 665
Query: 694 TGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGF 753
G +VT+ + VTN G +G+ V+ ++ + S P KQL GF
Sbjct: 666 DG---------KVTAS--------VEVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGF 708
Query: 754 DRVHTVAKGSKEISFGVDPCEQLSIANKH 782
+++ K +SF +D E L N+
Sbjct: 709 EKITLKPGERKTVSFPID-IEALKFWNQQ 736
>gi|224536087|ref|ZP_03676626.1| hypothetical protein BACCELL_00952 [Bacteroides cellulosilyticus
DSM 14838]
gi|224522306|gb|EEF91411.1| hypothetical protein BACCELL_00952 [Bacteroides cellulosilyticus
DSM 14838]
Length = 791
Score = 246 bits (627), Expect = 5e-62, Method: Compositional matrix adjust.
Identities = 209/748 (27%), Positives = 331/748 (44%), Gaps = 135/748 (18%)
Query: 90 RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
RLGIP ++ +E + G+ + T FP +++N+ L + IG V
Sbjct: 142 RLGIPV-DFTNEGIRGLCHD-----------RATFFPSQSGQGSTWNKELIARIGE---V 186
Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
EA+ LG + ++P ++I +DPRWGR E GEDP +V + ++S Q
Sbjct: 187 EAKEAIALGYTNI--YSPILDICQDPRWGRSVECYGEDPYLVGQLGKQMIQSLQKHR--- 241
Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQ 269
L + KH Y + G + + ++ +
Sbjct: 242 -----------------------------LVSTVKHFAVYSIPVGGRDGKTRTDPHVSPR 272
Query: 270 DTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCD 328
+ + PFR + A +M SYN +G P + R E+GFKGY+ SD +
Sbjct: 273 EMRTLYLEPFRRAFCEAGALGVMSSYNDYDGEPITSSHHFLTEILRQEYGFKGYVVSDSE 332
Query: 329 AVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTC----MLRHTQSAIDKGKVQEKDIDRA 384
AV I + + A + AG++I + + AI +GKV + I+
Sbjct: 333 AVEFITTKHHVVSNEVEGVAQAVNAGLNIRTHFTKPEDFVLPLRQAIKEGKVSPETINSR 392
Query: 385 LLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPL 444
+ ++ ++ LGLF+ +P +G K V EH+++AL+AARQ +VLLKN+ + LPL
Sbjct: 393 VADILRIKFWLGLFD-NPYRGD-EKQEEKIVHCKEHQQVALEAARQSLVLLKNENQLLPL 450
Query: 445 NKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTH--YASGCHDVPCN 502
K V S+A+IGP N +Q+ Y K++ +G++ + +T Y GC + +
Sbjct: 451 KKT-VKSVAVIGPNANEQTQLICRYGPANAPIKTVYQGIKELLPETEVVYRKGCEIIDSH 509
Query: 503 --------------SDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMS 548
EAV A+ A+ V++V G EDR R SL LPG Q
Sbjct: 510 FPESEILPFEKTTEEQQMLDEAVAAARNAEVVVLVLGGSELTVREDRSRTSLDLPGHQQE 569
Query: 549 LVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFN 608
L+ ++ T K P +LVL G +++A + I +IL +PGE A+AE +FGD+N
Sbjct: 570 LMQAIHATGK-PTVLVLLDGRAATINYA--NQYIPAILHAWFPGEFAGTAVAEALFGDYN 626
Query: 609 PGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTG------TQVYG----FGH 658
PGGRL +T +P+S ++P ++ F G T VYG FG+
Sbjct: 627 PGGRLAVT-FPKSVGQIPF----------------AFPFKPGSDEPCETAVYGALYPFGY 669
Query: 659 GLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQI 718
GLSYT +SYK L E + G I SC
Sbjct: 670 GLSYTKFSYKNLQITPE-------EQGPQGEI---------------TVSC--------- 698
Query: 719 SVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSI 778
VTN GD G VV L+ R S T K L GF+R+ +K+++F + P + L +
Sbjct: 699 EVTNIGDRTGDEVVQLYLRDEVSSVTTYMKVLRGFERITLNPGETKKVTFILTP-QDLGL 757
Query: 779 ANKHGRRILPLGNHVLMVGELRHSLTIE 806
+K+ + ++ G +M+G + +E
Sbjct: 758 WDKNNKFVVEPGMFKVMIGAASTDIRLE 785
>gi|16761109|ref|NP_456726.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Typhi str. CT18]
gi|29141195|ref|NP_804537.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Typhi str. Ty2]
gi|213428232|ref|ZP_03360982.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Typhi str. E02-1180]
gi|25289432|pir||AD0778 beta-glucosidase (EC 3.2.1.21) - Salmonella enterica subsp.
enterica serovar Typhi (strain CT18)
gi|16503407|emb|CAD02546.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Typhi]
gi|29136821|gb|AAO68386.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Typhi str. Ty2]
Length = 765
Score = 246 bits (627), Expect = 5e-62, Method: Compositional matrix adjust.
Identities = 210/749 (28%), Positives = 345/749 (46%), Gaps = 139/749 (18%)
Query: 75 QEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAAS 134
++ I+Q+ D A+ RL IP + + + +HG T FP L A+S
Sbjct: 86 RQDIRQMQDQVMALSRLKIPLF-FAYDVVHG---------------QRTVFPISLGLASS 129
Query: 135 FNRSLWSNIGSAVAVEARAMYNLGQAGLTF-WAPNINIFRDPRWGRGQETPGEDPMVVSA 193
FN +G A EA GL WAP +++ RDPRWGR E GED + S
Sbjct: 130 FNLDAVRTVGRVSAYEA------ADDGLNMTWAPMVDVSRDPRWGRASEGFGEDTYLTSI 183
Query: 194 YAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEK 253
VK+ QG++ D + KH AY +
Sbjct: 184 MGETMVKAMQGKS--------------------------PADRYSVMTSVKHFAAYGAVE 217
Query: 254 WGNFSRYSFNAM-ITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK 312
G +N + ++ Q + + PP+++ ++ G + +M + N +NG PA L +
Sbjct: 218 GGK----EYNTVDMSSQRLFNDYMPPYKAGLDAG-SGAVMVALNSLNGTPATSDSWLLKD 272
Query: 313 A-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCML-RHTQSA 370
R+EWGFKG SD A+ + ++ ED+ LKAG+D++ ++
Sbjct: 273 VLRDEWGFKGITVSDHGAIKELIKH-GTAADPEDAVRVALKAGVDMSMADEYYSKYLPGL 331
Query: 371 IDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGP---DDVCTSE----HKKL 423
I GKV ++D A ++ +V+ +GLFN DP Y LGP D V T+ H+K
Sbjct: 332 IKSGKVTMAELDDATRHVLNVKYDMGLFN-DP----YSHLGPKESDPVDTNAESRLHRKE 386
Query: 424 ALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT--GIPCSPKSLLR 481
A + AR+ +VLLKN + LPL K+ ++A++GPL ++ + G ++ G+ ++L
Sbjct: 387 AREVARESVVLLKNRLETLPLKKSG--TIAVVGPLADSQRDVMGSWSAAGVANQSVTVLA 444
Query: 482 GLEAYV---SKTHYASGCH-----DV--------------PCNSDAGFHEAVRIAKKADF 519
G++ V +K YA G + D+ P + A EAV+ AK+AD
Sbjct: 445 GIQNAVGDGAKILYAKGANITNDKDIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADV 504
Query: 520 VIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEAD 579
V+ V G E R ++ +P Q L+T++ T K P++LVL G PL + + D
Sbjct: 505 VVAVVGESQGMAHEASSRTNITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKED 561
Query: 580 SQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRA----- 634
Q +IL + G G A+A+++FGD+NP G+LP++ +P S ++P+ ++
Sbjct: 562 QQADAILETWFAGTEGGNAIADVLFGDYNPSGKLPIS-FPRSVGQIPVYYSHLNTGRPYN 620
Query: 635 -DSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQ 693
+ +Y R + G +Y FG+GLSYT ++ S++T+S+ +Q+
Sbjct: 621 PEKPNKYTSRYFDEANGP-LYPFGYGLSYTTFTV------SDVTLSSP--------TMQR 665
Query: 694 TGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGF 753
G +VT+ + VTN G +G V+ ++ + S P KQL GF
Sbjct: 666 DG---------KVTAS--------VEVTNTGKREGETVIQMYLQDVTASMSRPVKQLKGF 708
Query: 754 DRVHTVAKGSKEISFGVDPCEQLSIANKH 782
+++ K +SF +D E L N+
Sbjct: 709 EKITLKPGERKTVSFPID-IEALKFWNQQ 736
>gi|200387658|ref|ZP_03214270.1| glycosyl hydrolase family 3 N domain protein [Salmonella enterica
subsp. enterica serovar Virchow str. SL491]
gi|199604756|gb|EDZ03301.1| glycosyl hydrolase family 3 N domain protein [Salmonella enterica
subsp. enterica serovar Virchow str. SL491]
Length = 765
Score = 246 bits (627), Expect = 5e-62, Method: Compositional matrix adjust.
Identities = 210/749 (28%), Positives = 344/749 (45%), Gaps = 139/749 (18%)
Query: 75 QEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAAS 134
++ I+Q+ D A+ RL IP + + + +HG T FP L A+S
Sbjct: 86 RQDIRQMQDQVMALSRLKIPLF-FAYDVVHG---------------QRTVFPISLGLASS 129
Query: 135 FNRSLWSNIGSAVAVEARAMYNLGQAGLTF-WAPNINIFRDPRWGRGQETPGEDPMVVSA 193
FN +G A EA GL WAP +++ RDPRWGR E GED + S
Sbjct: 130 FNLDAVRTVGRVSAYEA------ADDGLNMTWAPMVDVSRDPRWGRASEGFGEDTYLTSI 183
Query: 194 YAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEK 253
VK+ QG++ D + KH AY +
Sbjct: 184 MGETMVKAMQGKS--------------------------PADRYSVMTSVKHFAAYGAVE 217
Query: 254 WGNFSRYSFNAM-ITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK 312
G +N + ++ Q + + PP+++ ++ G + +M + N +NG PA L +
Sbjct: 218 GGK----EYNTVDMSSQRLFNDYMPPYKAGLDAG-SGAVMVALNSLNGTPATSDSWLLKD 272
Query: 313 A-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCML-RHTQSA 370
R+EWGFKG SD A+ + ++ ED+ LKAG+D++ ++
Sbjct: 273 VLRDEWGFKGITVSDHGAIKELIKH-GTAADPEDAVRVALKAGVDMSMADEYYSKYLPGL 331
Query: 371 IDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGP---DDVCTSE----HKKL 423
I GKV ++D A ++ +V+ +GLFN DP Y LGP D V T+ H+K
Sbjct: 332 IKSGKVTMAELDDATRHVLNVKYDMGLFN-DP----YSHLGPKESDPVDTNAESRLHRKE 386
Query: 424 ALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT--GIPCSPKSLLR 481
A + AR+ +VLLKN + LPL K+ ++A++GPL ++ + G ++ G+ ++L
Sbjct: 387 AREVARESVVLLKNRLETLPLKKSG--TIAVVGPLADSQRDVMGSWSAAGVANQSVTVLA 444
Query: 482 GLEAYV---SKTHYASGCHDV-------------------PCNSDAGFHEAVRIAKKADF 519
G++ V +K YA G + P + A EAV+ AK+AD
Sbjct: 445 GIQNAVGDGAKILYAKGANITNDKGIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADV 504
Query: 520 VIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEAD 579
V+ V G E R + +P Q L+T++ T K P++LVL G PL + + D
Sbjct: 505 VVAVVGESQGMAHEASSRTDITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKED 561
Query: 580 SQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRA----- 634
Q +IL + G G A+A+++FGD+NP G+LP++ +P S ++P+ ++
Sbjct: 562 QQADAILETWFAGTEGGNAIADVLFGDYNPSGKLPIS-FPRSVGQIPVYYSHLNTGRPYN 620
Query: 635 -DSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQ 693
+ +Y R + G +Y FG+GLSYT ++ S++T+S+ +Q+
Sbjct: 621 PEKPNKYTSRYFDEANGP-LYPFGYGLSYTTFTV------SDVTLSSP--------TMQR 665
Query: 694 TGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGF 753
G +VT+ + VTN G +G+ V+ ++ + S P KQL GF
Sbjct: 666 DG---------KVTAS--------VEVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGF 708
Query: 754 DRVHTVAKGSKEISFGVDPCEQLSIANKH 782
+++ SK +SF +D E L N+
Sbjct: 709 EKLTLKPGESKTVSFPID-IEALKFWNQQ 736
>gi|417419081|ref|ZP_12159904.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar
Mississippi str. A4-633]
gi|353619740|gb|EHC70042.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar
Mississippi str. A4-633]
Length = 769
Score = 246 bits (627), Expect = 5e-62, Method: Compositional matrix adjust.
Identities = 212/762 (27%), Positives = 346/762 (45%), Gaps = 151/762 (19%)
Query: 75 QEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAAS 134
++ I+Q+ D A+ RL IP + + + +HG T FP L A+S
Sbjct: 76 RQDIRQMQDQVMALSRLKIPLF-FAYDVVHG---------------QRTVFPISLGLASS 119
Query: 135 FNRSLWSNIGSAVAVEARAMYNLGQAGLTF-WAPNINIFRDPRWGRGQETPGEDPMVVSA 193
FN +G A EA GL WAP +++ RDPRWGR E GED + S
Sbjct: 120 FNLDAVRTVGRVSAYEA------ADDGLNMTWAPMVDVSRDPRWGRASEGFGEDTYLTSI 173
Query: 194 YAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEK 253
VK+ QG++ D + KH AY +
Sbjct: 174 MGETMVKAMQGKS--------------------------PADRYSVMTSVKHFAAYGAVE 207
Query: 254 WGNFSRYSFNAM-ITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK 312
G +N + ++ Q + + PP+++ ++ G + +M + N +NG PA L +
Sbjct: 208 GGK----EYNTVDMSSQRLFNDYMPPYKAGLDAG-SGAVMVALNSLNGTPATSDSWLLKD 262
Query: 313 A-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCML-RHTQSA 370
R+EWGFKG SD A+ + ++ ED+ LKAG+D++ ++
Sbjct: 263 VLRDEWGFKGITVSDHGAIKELIKHGTAADP-EDAVRVALKAGVDMSMADEYYSKYLPGL 321
Query: 371 IDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGP---DDVCTSE----HKKL 423
I GKV ++D A ++ +V+ +GLFN DP Y LGP D V T+ H+K
Sbjct: 322 IKSGKVTMAELDDATRHVLNVKYDMGLFN-DP----YSHLGPKESDPVDTNAESRLHRKE 376
Query: 424 ALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT--GIPCSPKSLLR 481
A + AR+ +VLLKN + LPL K+ ++A++GPL ++ + G ++ G+ ++L
Sbjct: 377 AREVARESVVLLKNRLETLPLKKSG--TIAVVGPLADSQRDVMGSWSAAGVANQSVTVLA 434
Query: 482 GLEAYV---SKTHYASGCHDV-------------------PCNSDAGFHEAVRIAKKADF 519
G++ V +K YA G + P + A EAV+ AK+AD
Sbjct: 435 GIQNAVGDGAKILYAKGANITNDKGIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADV 494
Query: 520 VIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEAD 579
V+ V G E R ++ +P Q L+T++ T K P++LVL G PL + + D
Sbjct: 495 VVAVVGESQGMAHEASSRTNITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKED 551
Query: 580 SQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQ 639
Q +IL + G G A+A+++FGD+NP G+LP++ +P S ++P+ ++
Sbjct: 552 QQADAILETWFAGTEGGNAIADVLFGDYNPSGKLPIS-FPRSVGQIPVYYSHLNTGRPYY 610
Query: 640 Y-------PGRSY----------RFYTGTQ--VYGFGHGLSYTNYSYKFLSAPSELTISA 680
Y PGR Y R++ +Y FG+GLSYT ++ S++T+S+
Sbjct: 611 YSHLNTGRPGRPYNPEKPNKYTSRYFDEANGPLYPFGYGLSYTTFTV------SDVTLSS 664
Query: 681 SLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPK 740
+Q+ G +VT+ + VTN G +G+ V+ ++ +
Sbjct: 665 P--------TMQRDG---------KVTAS--------VEVTNTGKREGATVIQMYLQDVT 699
Query: 741 VSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKH 782
S P KQL GF+++ K +SF +D E L N+
Sbjct: 700 ASMSRPVKQLKGFEKITLKPGERKTVSFPID-IEALKFWNQQ 740
>gi|224583333|ref|YP_002637131.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Paratyphi C strain RKS4594]
gi|224467860|gb|ACN45690.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Paratyphi C strain RKS4594]
Length = 765
Score = 246 bits (627), Expect = 5e-62, Method: Compositional matrix adjust.
Identities = 210/749 (28%), Positives = 347/749 (46%), Gaps = 139/749 (18%)
Query: 75 QEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAAS 134
++ I+Q+ D A+ RL IP + + + +HG T FP L A+S
Sbjct: 86 RQDIRQMQDQVMALSRLKIPLF-FAYDVVHG---------------QRTVFPISLGLASS 129
Query: 135 FNRSLWSNIGSAVAVEARAMYNLGQAGLTF-WAPNINIFRDPRWGRGQETPGEDPMVVSA 193
FN +G A EA GL WAP +++ RDPRWGR E GED + S
Sbjct: 130 FNLDAVRTVGRVSAYEA------ADDGLNMTWAPMVDVSRDPRWGRASEGFGEDTYLTSI 183
Query: 194 YAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEK 253
VK+ QG++ D + KH AY +
Sbjct: 184 MGETMVKAMQGKS--------------------------PADRYSVMTSVKHFAAYGAVE 217
Query: 254 WGNFSRYSFNAM-ITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK 312
G +N + ++ Q + + PP+++ ++ G + +M + N +NG PA L +
Sbjct: 218 GGK----EYNTVDMSSQRLFNDYMPPYKAGLDAG-SGAVMVALNSLNGTPATSDSWLLKD 272
Query: 313 A-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCML-RHTQSA 370
R+EWGFKG SD A+ + ++ ED+ LKAG+D++ ++
Sbjct: 273 VLRDEWGFKGITVSDHGAIKELIKHGTAADP-EDAVRVALKAGVDMSMADEYYSKYLPGL 331
Query: 371 IDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGP---DDVCTSE----HKKL 423
I GKV ++D A ++ +V+ +GLFN DP Y LGP D V T+ H+K
Sbjct: 332 IKSGKVTMAELDDATRHVLNVKYDMGLFN-DP----YSHLGPKESDPVDTNAESRLHRKE 386
Query: 424 ALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT--GIPCSPKSLLR 481
A + AR+ +VLLKN + LPL K+ ++A++GPL ++ + G ++ G+ ++L
Sbjct: 387 AREVARESVVLLKNRLETLPLKKSG--TIAVVGPLADSQRDVMGSWSAAGVANQSVTVLA 444
Query: 482 GLEAYV---SKTHYASGCH-----DV--------------PCNSDAGFHEAVRIAKKADF 519
G++ V +K +A G + D+ P + A EAV+ AK+AD
Sbjct: 445 GIQNAVGDGAKILHAKGANITNDKDIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADV 504
Query: 520 VIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEAD 579
V+ V G E R ++ +P Q L+T++ T K P++LVL G PL + + D
Sbjct: 505 VVAVVGESQGMAHEASSRTNITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKED 561
Query: 580 SQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRA----- 634
Q +IL + G G A+A+++FGD+NP G+LP++ +P S ++P+ ++
Sbjct: 562 QQADAILETWFAGTEGGNAIADVLFGDYNPSGKLPIS-FPRSVGQIPVYYSHLNTGRPYN 620
Query: 635 -DSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQ 693
+ +Y R + G +Y FG+GLSYT ++ S++T+S+ +Q+
Sbjct: 621 PEKPNKYTSRYFDEANGP-LYPFGYGLSYTTFTV------SDVTLSSP--------TMQR 665
Query: 694 TGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGF 753
G +VT+ + VTN G +G+ V+ ++ + S P KQL GF
Sbjct: 666 DG---------KVTAS--------VEVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGF 708
Query: 754 DRVHTVAKGSKEISFGVDPCEQLSIANKH 782
+++ SK +SF +D E L N+
Sbjct: 709 EKITLKPGESKTVSFPID-IEALKFWNQQ 736
>gi|213852437|ref|ZP_03381969.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Typhi str. M223]
Length = 721
Score = 246 bits (627), Expect = 5e-62, Method: Compositional matrix adjust.
Identities = 210/749 (28%), Positives = 345/749 (46%), Gaps = 139/749 (18%)
Query: 75 QEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAAS 134
++ I+Q+ D A+ RL IP + + + +HG T FP L A+S
Sbjct: 42 RQDIRQMQDQVMALSRLKIPLF-FAYDVVHG---------------QRTVFPISLGLASS 85
Query: 135 FNRSLWSNIGSAVAVEARAMYNLGQAGLTF-WAPNINIFRDPRWGRGQETPGEDPMVVSA 193
FN +G A EA GL WAP +++ RDPRWGR E GED + S
Sbjct: 86 FNLDAVRTVGRVSAYEA------ADDGLNMTWAPMVDVSRDPRWGRASEGFGEDTYLTSI 139
Query: 194 YAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEK 253
VK+ QG++ D + KH AY +
Sbjct: 140 MGETMVKAMQGKS--------------------------PADRYSVMTSVKHFAAYGAVE 173
Query: 254 WGNFSRYSFNAM-ITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK 312
G +N + ++ Q + + PP+++ ++ G + +M + N +NG PA L +
Sbjct: 174 GGK----EYNTVDMSSQRLFNDYMPPYKAGLDAG-SGAVMVALNSLNGTPATSDSWLLKD 228
Query: 313 A-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCML-RHTQSA 370
R+EWGFKG SD A+ + ++ ED+ LKAG+D++ ++
Sbjct: 229 VLRDEWGFKGITVSDHGAIKELIKH-GTAADPEDAVRVALKAGVDMSMADEYYSKYLPGL 287
Query: 371 IDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGP---DDVCTSE----HKKL 423
I GKV ++D A ++ +V+ +GLFN DP Y LGP D V T+ H+K
Sbjct: 288 IKSGKVTMAELDDATRHVLNVKYDMGLFN-DP----YSHLGPKESDPVDTNAESRLHRKE 342
Query: 424 ALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT--GIPCSPKSLLR 481
A + AR+ +VLLKN + LPL K+ ++A++GPL ++ + G ++ G+ ++L
Sbjct: 343 AREVARESVVLLKNRLETLPLKKSG--TIAVVGPLADSQRDVMGSWSAAGVANQSVTVLA 400
Query: 482 GLEAYV---SKTHYASGCH-----DV--------------PCNSDAGFHEAVRIAKKADF 519
G++ V +K YA G + D+ P + A EAV+ AK+AD
Sbjct: 401 GIQNAVGDGAKILYAKGANITNDKDIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADV 460
Query: 520 VIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEAD 579
V+ V G E R ++ +P Q L+T++ T K P++LVL G PL + + D
Sbjct: 461 VVAVVGESQGMAHEASSRTNITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKED 517
Query: 580 SQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRA----- 634
Q +IL + G G A+A+++FGD+NP G+LP++ +P S ++P+ ++
Sbjct: 518 QQADAILETWFAGTEGGNAIADVLFGDYNPSGKLPIS-FPRSVGQIPVYYSHLNTGRPYN 576
Query: 635 -DSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQ 693
+ +Y R + G +Y FG+GLSYT ++ S++T+S+ +Q+
Sbjct: 577 PEKPNKYTSRYFDEANGP-LYPFGYGLSYTTFTV------SDVTLSSP--------TMQR 621
Query: 694 TGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGF 753
G +VT+ + VTN G +G V+ ++ + S P KQL GF
Sbjct: 622 DG---------KVTAS--------VEVTNTGKREGETVIQMYLQDVTASMSRPVKQLKGF 664
Query: 754 DRVHTVAKGSKEISFGVDPCEQLSIANKH 782
+++ K +SF +D E L N+
Sbjct: 665 EKITLKPGERKTVSFPID-IEALKFWNQQ 692
>gi|1483154|dbj|BAA13102.1| T-cell inhibitor(STI) [Salmonella enterica subsp. enterica serovar
Typhimurium]
Length = 765
Score = 246 bits (627), Expect = 5e-62, Method: Compositional matrix adjust.
Identities = 209/748 (27%), Positives = 343/748 (45%), Gaps = 137/748 (18%)
Query: 75 QEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAAS 134
++ I+Q+ D A+ RL IP + + + +HG T FP L A+S
Sbjct: 86 RQDIRQMQDQVMALSRLKIPLF-FAYDVVHG---------------QRTVFPISLGLASS 129
Query: 135 FNRSLWSNIGSAVAVEARAMYNLGQAGLTF-WAPNINIFRDPRWGRGQETPGEDPMVVSA 193
FN +G A EA GL WAP +++ RDPRWGR E GED + S
Sbjct: 130 FNLDAVRTVGRVSAYEA------ADDGLNMTWAPMVDVSRDPRWGRASEGFGEDTYLTSI 183
Query: 194 YAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEK 253
VK+ QG++ D + KH AY +
Sbjct: 184 MGETMVKAMQGKS--------------------------PADRYSVMTSVKHFAAYGAVE 217
Query: 254 WGNFSRYSFNAM-ITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK 312
G +N + ++ Q + + PP+++ ++ G + +M + N +NG PA L +
Sbjct: 218 GGK----EYNTVDMSSQRLFNDYMPPYKAGLDAG-SGAVMVALNSLNGTPATSDSWLLKD 272
Query: 313 A-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCML-RHTQSA 370
R+EWGFKG SD A+ + ++ ED+ LKAG+D++ ++
Sbjct: 273 VLRDEWGFKGITVSDHGAIKELIKHGTAADP-EDAVRVALKAGVDMSMADEYYSKYLPGL 331
Query: 371 IDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGP---DDVCTSE----HKKL 423
I GKV ++D A ++ +V+ +GLFN DP Y LGP D V T+ H+K
Sbjct: 332 IKSGKVTMAELDDATRHVLNVKYDMGLFN-DP----YSHLGPKESDPVDTNAESRLHRKE 386
Query: 424 ALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT--GIPCSPKSLLR 481
A + AR+ +VLLKN + LPL K+ ++A++GPL ++ + G ++ G+ ++L
Sbjct: 387 AREVARESVVLLKNRLETLPLKKSG--TIAVVGPLADSQRDVMGSWSAAGVANQSVTVLA 444
Query: 482 GLEAYV---SKTHYASGCHDV-------------------PCNSDAGFHEAVRIAKKADF 519
G++ V +K YA G + P + A EAV+ AK+AD
Sbjct: 445 GIQNAVGDGAKILYAKGANITNDKGIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADV 504
Query: 520 VIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEAD 579
V+ V G E R ++ +P Q L+T++ T K P++LVL G PL + + D
Sbjct: 505 VVAVVGESQGMAHEASSRTNITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKED 561
Query: 580 SQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPM--NDMNMRADSS 637
Q +IL + G G A+A+++FGD+NP G+LP++ +P S ++P+ + +N +
Sbjct: 562 QQADAILETWFAGTEGGNAIADVLFGDYNPSGKLPIS-FPRSVGQIPVYYSHLNTGRPYN 620
Query: 638 RQYPGRSYRFY---TGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQT 694
+ P + Y +Y FG+GLSYT ++ S+ T+S+ +Q+
Sbjct: 621 PEKPNKYTSLYFDKPNGPLYPFGYGLSYTTFTV------SDFTLSSP--------TMQRD 666
Query: 695 GSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFD 754
G +VT+ + VTN G +G+ V+ ++ + S P KQL GF+
Sbjct: 667 G---------KVTAS--------VKVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGFE 709
Query: 755 RVHTVAKGSKEISFGVDPCEQLSIANKH 782
++ K +SF +D E L N+
Sbjct: 710 KITLKPGKRKTVSFPID-IEALKFWNQQ 736
>gi|423223731|ref|ZP_17210200.1| hypothetical protein HMPREF1062_02386 [Bacteroides cellulosilyticus
CL02T12C19]
gi|392638106|gb|EIY31959.1| hypothetical protein HMPREF1062_02386 [Bacteroides cellulosilyticus
CL02T12C19]
Length = 854
Score = 245 bits (626), Expect = 5e-62, Method: Compositional matrix adjust.
Identities = 156/440 (35%), Positives = 229/440 (52%), Gaps = 57/440 (12%)
Query: 61 ISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVS 120
+ R L+S LT++EKI L + I RL IP Y +E+LHG+ G
Sbjct: 35 MHERIMDLLSRLTVEEKISLLRATSPGISRLDIPKYYHGNEALHGVVRPG---------- 84
Query: 121 SVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAG----------LTFWAPNIN 170
T FPQ + AA++N L + + ++ EARA +N G LTFW+P +N
Sbjct: 85 RFTVFPQAIGLAATWNPELQLQVATVISDEARARWNELDQGREQKSQFSDLLTFWSPTVN 144
Query: 171 IFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEE 230
+ RDPRWGR ET GEDP + FVK QG+ DD R LK
Sbjct: 145 MARDPRWGRTPETYGEDPYLSGIMGTAFVKGLQGD----DD----------RYLK----- 185
Query: 231 SDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASC 290
+ + KH A + E +R+ N I+E+ + + P F +C++ GK++
Sbjct: 186 --------IVSTPKHFAANNEE----HNRFVCNPQISEKQLREYYLPAFEACVKDGKSAS 233
Query: 291 IMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAG 349
IM +YN +N VP L L K R +WGFKGY+ SDC + + Y KT E +AA
Sbjct: 234 IMSAYNALNDVPCTLNAWLLTKVLRKDWGFKGYVVSDCGGPSLLVNAHKYVKTKEAAAAL 293
Query: 350 VLKAGMDINCGTCMLRH-TQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYG 408
+KAG+D+ CG + SA + V + DID A + ++ LGLF+ + Y
Sbjct: 294 SIKAGLDLECGDDVYDQPLLSAYRQYMVTDADIDSAAYRVLRARMELGLFDSG-EQNPYT 352
Query: 409 KLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGG 468
K+ P + ++EH+++AL+AAR+ IVLLKN KK LPLN V S+A++G +N S G
Sbjct: 353 KISPAVIGSAEHQEVALNAARECIVLLKNQKKMLPLNAKKVKSIAVVG--INAGSSEFGD 410
Query: 469 YTGIPC-SPKSLLRGLEAYV 487
Y+G+P +P S+L+G++ V
Sbjct: 411 YSGLPVIAPISVLQGIKDRV 430
Score = 126 bits (317), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 77/291 (26%), Positives = 141/291 (48%), Gaps = 45/291 (15%)
Query: 507 FHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLT 566
+ EA + ++ + V+ V G++ + E E +DR + LP Q + + + + +++VL
Sbjct: 592 YGEAGKAVRECETVVAVLGINKSIEREGQDRYDIQLPADQQEFLQEIYKVNPN-IVVVLV 650
Query: 567 GGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVP 626
G L +++ D I +I+ YPGE+G KA+AE++FGD+NPGGRLP+T+Y P
Sbjct: 651 AGSSLAINWM--DEHIPAIVNAWYPGESGGKAVAEVLFGDYNPGGRLPLTYYRSLDELPP 708
Query: 627 MNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGS 686
+D ++ GR+Y+++ G +Y FG+GLSYT + Y L A
Sbjct: 709 FDDYDITK-------GRTYKYFKGDVLYPFGYGLSYTTFKYSNLQV-----------ADG 750
Query: 687 DKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTP 746
++ I +V + N+G G V ++ ++P+ + P
Sbjct: 751 EEEI------------------------NVSFQLKNSGKYAGDEVAQVYVKLPERDEVMP 786
Query: 747 EKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
K+L GF+RV + +K+++ + + + P G++ +MVG
Sbjct: 787 IKELKGFERVTLKSGENKKVTLKLRKDLLRYWDEAKDKFVCPSGDYTIMVG 837
>gi|218132023|ref|ZP_03460827.1| hypothetical protein BACEGG_03648 [Bacteroides eggerthii DSM 20697]
gi|217985783|gb|EEC52123.1| glycosyl hydrolase family 3 N-terminal domain protein [Bacteroides
eggerthii DSM 20697]
Length = 762
Score = 245 bits (626), Expect = 6e-62, Method: Compositional matrix adjust.
Identities = 212/764 (27%), Positives = 349/764 (45%), Gaps = 122/764 (15%)
Query: 59 LSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGT 118
+S ++ S +S+ +QE + ++ + RLGIP +ESLHG+ +G
Sbjct: 81 MSYASVFGSRLSVEQMQESMFAINKYMAEHNRLGIPVL-GEAESLHGLIHDG-------- 131
Query: 119 VSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWG 178
T FPQ + +++FN + + + +A EA+A + Q +P +++ R+ RWG
Sbjct: 132 ---ATIFPQSIALSSTFNPDITHRVATVIAQEAKAT-GVDQV----LSPVLDLARELRWG 183
Query: 179 RGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELM 238
R +ET GEDP +V V +V +F E
Sbjct: 184 RVEETYGEDPYLVGRMGVAYVSAFNKEG-------------------------------- 211
Query: 239 LSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQV 298
+ KH +A+ G + S E+D + PF+ + + +M SYN
Sbjct: 212 VMTTLKHFLAHG-SPTGGLNLASVTG--CERDLRSLYLKPFQDVMREAMPYSVMNSYNSY 268
Query: 299 NGVP-ACLRGDLFQKARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDI 357
VP A L R E GFKGYI+SD +V + + K D+A + AG+D+
Sbjct: 269 ESVPVAASHWILDDILRGEMGFKGYISSDWGSVEMLRSLHHTAKDKADAACQAVIAGVDV 328
Query: 358 NCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCT 417
S + G + EK+ID+ + + + + +GLF+ D K L V T
Sbjct: 329 EVDGDCYETLDSLVRSGVLPEKEIDKCVSRVLTAKFAMGLFDKD--YTKRANLS-QTVHT 385
Query: 418 SEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMG-GGYTGIPCSP 476
E +LAL AAR+ +L+KN+ LPL+ N + S+A+IGP N +Q+ G Y +
Sbjct: 386 PEAVELALVAARESAILVKNENSLLPLDANKLRSVAVIGP---NAAQVQFGDYMWTNSNE 442
Query: 477 KSL--LRGLEAYVS---KTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGL----- 526
+ L+G+EA K +YA GC ++ +GF +AV A+ +D ++ G
Sbjct: 443 YGITPLQGIEAVTQGKVKINYAKGC-EIHTQDRSGFSQAVTAARNSDVALLFVGAMSGSP 501
Query: 527 ------DLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADS 580
++ E+ D +S LPG Q +L+ +V T K P I+VL G P + + + +
Sbjct: 502 GRPWPNSVSGESFDLSDIS--LPGCQEALIRAVKATGK-PTIVVLVAGKPFAIPWVKDNC 558
Query: 581 QISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPM------NDMNMRA 634
+ + W Y GE +A+AEI+FG+ NP GRL ++ +P+S +P+ +D
Sbjct: 559 EAVIVQW--YGGEQEGRAIAEILFGEVNPSGRLNVS-FPQSTGHLPVFYNYYPSDKGFYH 615
Query: 635 D-SSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQ 693
D + + PGR Y F + V+ FGHGLSYT + YK ++ S K +D +
Sbjct: 616 DHGTLEKPGRDYVFSSPDPVWAFGHGLSYTTFKYK--------SMQISNKEFTDDD---- 663
Query: 694 TGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGF 753
+C + + V N G DG VV L+ S TP K+L F
Sbjct: 664 --------------TC-----EITVEVANTGKRDGKEVVQLYVNDIVSSVVTPVKELRRF 704
Query: 754 DRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
++V A ++ + F + P ++L++ N + ++ G+ L VG
Sbjct: 705 EKVFIPAGETRTVKFNL-PIKELALWNTDMKEVVEPGDFELQVG 747
>gi|224535250|ref|ZP_03675789.1| hypothetical protein BACCELL_00111 [Bacteroides cellulosilyticus
DSM 14838]
gi|224523135|gb|EEF92240.1| hypothetical protein BACCELL_00111 [Bacteroides cellulosilyticus
DSM 14838]
Length = 786
Score = 245 bits (626), Expect = 6e-62, Method: Compositional matrix adjust.
Identities = 219/742 (29%), Positives = 348/742 (46%), Gaps = 123/742 (16%)
Query: 90 RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
RLGIP ++ +E + G+ + T FP A++N+ L S I A
Sbjct: 137 RLGIPV-DFTNEGIRGLCHD-----------RATMFPAQCGQGATWNKELISEIAQVTAE 184
Query: 150 EARAMYNLGQAGLT-FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWK 208
EA+A+ G T ++P ++I +DPRWGR E GEDP +V
Sbjct: 185 EAKAL------GYTNIYSPILDIAQDPRWGRVVECYGEDPFLV----------------- 221
Query: 209 SDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITE 268
G +G KR++KG +E L A KH Y + G + + +
Sbjct: 222 ---GELG-----KRMIKGLQQEG-------LVATPKHFAVYSIPVGGRDAGTRTDPHVAP 266
Query: 269 QDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGD---LFQKARNEWGFKGYITS 325
++ + PFR + A +M SYN +G P + G L + R+EWGFKGY+ S
Sbjct: 267 REMRTLYIEPFRKAFCEAGALGVMSSYNDYDGEP--ITGSYHFLTEILRHEWGFKGYVVS 324
Query: 326 DCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCM----LRHTQSAIDKGKVQEKDI 381
D +AV ++ D AA V+ AG+++ + +R + AI +GKV + I
Sbjct: 325 DSEAVEFLYSKHQVAADAVDGAAQVVNAGLNVRTNFTLPENFIRPLRQAISEGKVSMQTI 384
Query: 382 DRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKF 441
D + ++ V+ +GLF+ +P KG K V + EH+ +++ AA + IVLLKN+
Sbjct: 385 DSRVADVLRVKFGMGLFD-NPYKGD-AKHPEKVVHSKEHQAVSMRAALESIVLLKNENNI 442
Query: 442 LPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV--SKTHYASGC--- 496
LPL+K+ + +A+IGP N + + Y K++ +G++ Y+ ++ YA G
Sbjct: 443 LPLSKD-LKKVAVIGPNANEVQNLICRYGPANAPIKTVYQGIKEYLPDAEVRYAKGTDII 501
Query: 497 ---------HDVPCNSD--AGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQ 545
++VP + + A EAV +AK++D I+V G + E+ R +L L G+
Sbjct: 502 DKYFPESELYEVPLDQEEQAMMDEAVALAKESDVAIMVLGGNEKTVREEYSRTNLDLCGR 561
Query: 546 QMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFG 605
Q L+ +V T K PVIL+L G +++AE I I+ +PGE A+A+++FG
Sbjct: 562 QEKLLQAVYATGK-PVILLLVDGRAATINWAE--RYIPGIVHAWFPGEFMGDAVAQVLFG 618
Query: 606 DFNPGGRLPMTWYPESFTKVPMN-DMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTN 664
D+NPGG+L +T +P S ++P +DS + + TGT +Y FG+GLSYT
Sbjct: 619 DYNPGGKLAVT-FPRSVGQIPFAFPFKPGSDS------KGFVRVTGT-LYPFGYGLSYTT 670
Query: 665 YSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAG 724
++Y S+L I + I Q +L SC V N G
Sbjct: 671 FAY------SDLKIENLV-------IGVQGSVKL---------SC---------KVKNTG 699
Query: 725 DVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGR 784
V G VV L+ S T K L GF+R+H K I F + P ++L + NK
Sbjct: 700 KVAGDEVVQLYLHDEMSSVTTYVKVLRGFERIHLEPGEEKVIDFVLTP-QELGLWNKDNH 758
Query: 785 RILPLGNHVLMVGELRHSLTIE 806
++ G +MVG + ++
Sbjct: 759 FVVEPGTFAVMVGSSSQDIKLQ 780
>gi|419728314|ref|ZP_14255280.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Heidelberg str. 41579]
gi|419736497|ref|ZP_14263337.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Heidelberg str. 41563]
gi|419741060|ref|ZP_14267772.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Heidelberg str. 41573]
gi|419745173|ref|ZP_14271816.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Heidelberg str. 41566]
gi|419746816|ref|ZP_14273391.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Heidelberg str. 41565]
gi|421574444|ref|ZP_16020065.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Heidelberg str. CFSAN00325]
gi|421579948|ref|ZP_16025510.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Heidelberg str. CFSAN00326]
gi|421585722|ref|ZP_16031214.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Heidelberg str. CFSAN00328]
gi|381291806|gb|EIC33035.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Heidelberg str. 41563]
gi|381293004|gb|EIC34177.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Heidelberg str. 41573]
gi|381299851|gb|EIC40919.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Heidelberg str. 41579]
gi|381306597|gb|EIC47470.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Heidelberg str. 41566]
gi|381320690|gb|EIC61232.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Heidelberg str. 41565]
gi|402521423|gb|EJW28761.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Heidelberg str. CFSAN00326]
gi|402524644|gb|EJW31941.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Heidelberg str. CFSAN00325]
gi|402529572|gb|EJW36805.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Heidelberg str. CFSAN00328]
Length = 765
Score = 245 bits (626), Expect = 6e-62, Method: Compositional matrix adjust.
Identities = 207/749 (27%), Positives = 345/749 (46%), Gaps = 139/749 (18%)
Query: 75 QEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAAS 134
++ I+Q+ D A+ RL IP + + + +HG T FP L A+S
Sbjct: 86 RQDIRQMQDQVMALSRLKIPLF-FAYDVVHG---------------QRTVFPISLGLASS 129
Query: 135 FNRSLWSNIGSAVAVEARAMYNLGQAGLTF-WAPNINIFRDPRWGRGQETPGEDPMVVSA 193
FN +G A EA GL WAP +++ RDPRWGR E GED + S
Sbjct: 130 FNLDAVRTVGRVSAYEA------ADDGLNMTWAPMVDVSRDPRWGRASEGFGEDTYLTSI 183
Query: 194 YAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEK 253
VK+ QG++ D + KH AY +
Sbjct: 184 MGETMVKAMQGKS--------------------------PADRYSVMTSVKHFAAYGAVE 217
Query: 254 WGNFSRYSFNAM-ITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK 312
G +N + ++ Q + + PP+++ ++ G + +M + N +NG PA L +
Sbjct: 218 GGK----EYNTVDMSSQRLFNDYMPPYKAGLDAG-SGAVMVALNSLNGTPATSDSWLLKD 272
Query: 313 A-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCML-RHTQSA 370
R+EWGFKG SD A+ + ++ ED+ LKAG+D++ ++
Sbjct: 273 VLRDEWGFKGITVSDHGAIKELIKH-GTAADPEDAVRVALKAGVDMSMADEYYSKYLPGL 331
Query: 371 IDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSE-------HKKL 423
I GKV ++D A ++ +V+ +GLFN DP Y LGP + ++ H+K
Sbjct: 332 IKSGKVTMAELDDATRHVLNVKYDMGLFN-DP----YSHLGPKESDPADTNAESRLHRKE 386
Query: 424 ALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT--GIPCSPKSLLR 481
A + AR+ +VLLKN + LPL K+ ++A++GPL ++ + G ++ G+ ++L
Sbjct: 387 AREVARESVVLLKNRLETLPLKKSG--TIAVVGPLADSQRDVMGSWSAAGVANQSVTVLA 444
Query: 482 GLEAYV---SKTHYASGCH-----DV--------------PCNSDAGFHEAVRIAKKADF 519
G++ V +K YA G + D+ P + A EAV+ AK+AD
Sbjct: 445 GIQNAVGDGAKILYAKGANITNDKDIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADV 504
Query: 520 VIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEAD 579
V+ V G E R ++ +P Q L+T++ T K P++LVL G PL + + D
Sbjct: 505 VVAVVGESQGMAHEASSRTNITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKED 561
Query: 580 SQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRA----- 634
Q +IL + G G A+A+++FGD+NP G+LP++ +P S ++P+ ++
Sbjct: 562 QQADAILETWFAGTEGGNAIADVLFGDYNPSGKLPIS-FPRSVGQIPVYYSHLNTGRPYN 620
Query: 635 -DSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQ 693
+ +Y R + G +Y FG+GLSYT ++ S++T+S+ +Q+
Sbjct: 621 PEKPNKYTSRYFDEANGP-LYPFGYGLSYTTFTV------SDVTLSSP--------TMQR 665
Query: 694 TGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGF 753
G +VT+ + VTN G +G+ V+ ++ + S P KQL GF
Sbjct: 666 DG---------KVTAS--------VEVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGF 708
Query: 754 DRVHTVAKGSKEISFGVDPCEQLSIANKH 782
+++ K +SF +D E L N+
Sbjct: 709 EKITLKPGERKTVSFPID-IEALKFWNQQ 736
>gi|205357644|ref|ZP_02572188.2| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar 4,[5],12:i:- str. CVM23701]
gi|374981198|ref|ZP_09722528.1| Periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Typhimurium str. TN061786]
gi|383496875|ref|YP_005397564.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar
Typhimurium str. 798]
gi|205330573|gb|EDZ17337.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar 4,[5],12:i:- str. CVM23701]
gi|321224818|gb|EFX49881.1| Periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Typhimurium str. TN061786]
gi|380463696|gb|AFD59099.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Typhimurium str. 798]
Length = 755
Score = 245 bits (626), Expect = 6e-62, Method: Compositional matrix adjust.
Identities = 209/749 (27%), Positives = 344/749 (45%), Gaps = 139/749 (18%)
Query: 75 QEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAAS 134
++ I+Q+ D A+ RL IP + + + +HG T FP L A+S
Sbjct: 76 RQDIRQMQDQVMALSRLKIPLF-FAYDVVHG---------------QRTVFPISLGLASS 119
Query: 135 FNRSLWSNIGSAVAVEARAMYNLGQAGLTF-WAPNINIFRDPRWGRGQETPGEDPMVVSA 193
FN +G A EA GL WAP +++ RDPRWGR E GED + S
Sbjct: 120 FNLDAVRTVGRVSAYEA------ADDGLNMTWAPMVDVSRDPRWGRASEGFGEDTYLTSI 173
Query: 194 YAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEK 253
VK+ QG++ D + KH AY +
Sbjct: 174 MGETMVKAMQGKS--------------------------PADRYSVMTSVKHFAAYGAVE 207
Query: 254 WGNFSRYSFNAM-ITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK 312
G +N + ++ Q + + PP+++ ++ G + +M + N +NG PA L +
Sbjct: 208 GGK----EYNTVDMSSQRLFNDYMPPYKAGLDAG-SGAVMVALNSLNGTPATSDSWLLKD 262
Query: 313 A-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCML-RHTQSA 370
R+EWGFKG SD A+ + ++ ED+ LKAG+D++ ++
Sbjct: 263 VLRDEWGFKGITVSDHGAIKELIKH-GTAADPEDAVRVALKAGVDMSMADEYYSKYLPGL 321
Query: 371 IDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGP---DDVCTSE----HKKL 423
I GKV ++D A ++ +V+ +GLFN DP Y LGP D V T+ H+K
Sbjct: 322 IKSGKVTMAELDDATRHVLNVKYDMGLFN-DP----YSHLGPKESDPVDTNAESRLHRKE 376
Query: 424 ALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT--GIPCSPKSLLR 481
A + AR+ +VLLKN + LPL K+ ++A++GPL ++ + G ++ G+ ++L
Sbjct: 377 AREVARESVVLLKNRLETLPLKKSG--TIAVVGPLADSQRDVMGSWSAAGVANQSVTVLA 434
Query: 482 GLEAYV---SKTHYASGCHDV-------------------PCNSDAGFHEAVRIAKKADF 519
G++ V +K YA G + P + A EAV+ AK+AD
Sbjct: 435 GIQNAVGDGAKILYAKGANITNDKGIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADV 494
Query: 520 VIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEAD 579
V+ V G E R ++ +P Q L+T++ T K P++LVL G PL + + D
Sbjct: 495 VVAVVGESQGMAHEASSRTNITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKED 551
Query: 580 SQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRA----- 634
Q +IL + G G A+A+++FGD+NP G+LP++ +P S ++P+ ++
Sbjct: 552 QQADAILETWFAGTEGGNAIADVLFGDYNPSGKLPIS-FPRSVGQIPVYYSHLNTGRPYN 610
Query: 635 -DSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQ 693
+ +Y R + G +Y FG+GLSYT ++ S++T+S+ +Q+
Sbjct: 611 PEKPNKYTSRYFDEANGP-LYPFGYGLSYTTFTV------SDVTLSSP--------TMQR 655
Query: 694 TGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGF 753
G +VT+ + VTN G +G+ V+ ++ + S P KQL GF
Sbjct: 656 DG---------KVTAS--------VEVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGF 698
Query: 754 DRVHTVAKGSKEISFGVDPCEQLSIANKH 782
+++ K +SF +D E L N+
Sbjct: 699 EKITLKPGERKTVSFPID-IEALKFWNQQ 726
>gi|16765496|ref|NP_461111.1| periplasmic beta-D-glucoside glucohydrolase [Salmonella enterica
subsp. enterica serovar Typhimurium str. LT2]
gi|378445601|ref|YP_005233233.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Typhimurium str. D23580]
gi|378450850|ref|YP_005238209.1| periplasmic beta-D-glucoside glucohydrolase [Salmonella enterica
subsp. enterica serovar Typhimurium str. 14028S]
gi|378700080|ref|YP_005182037.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar
Typhimurium str. SL1344]
gi|378989553|ref|YP_005252717.1| periplasmic beta-D-glucoside glucohydrolase [Salmonella enterica
subsp. enterica serovar Typhimurium str. UK-1]
gi|379701394|ref|YP_005243122.1| periplasmic beta-D-glucoside glucohydrolase [Salmonella enterica
subsp. enterica serovar Typhimurium str. ST4/74]
gi|422026439|ref|ZP_16372832.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm1]
gi|422031463|ref|ZP_16377632.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm2]
gi|427551058|ref|ZP_18928136.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm8]
gi|427567333|ref|ZP_18932851.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm9]
gi|427587543|ref|ZP_18937641.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm3]
gi|427611175|ref|ZP_18942507.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm4]
gi|427634848|ref|ZP_18947401.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm6]
gi|427656473|ref|ZP_18952166.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm10]
gi|427661625|ref|ZP_18957079.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm11]
gi|427670100|ref|ZP_18961878.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm12]
gi|20141276|sp|Q56078.2|BGLX_SALTY RecName: Full=Periplasmic beta-glucosidase; AltName:
Full=Beta-D-glucoside glucohydrolase; AltName:
Full=Cellobiase; AltName: Full=Gentiobiase; AltName:
Full=T-cell inhibitor; Flags: Precursor
gi|16420702|gb|AAL21070.1| periplasmic beta-D-glucoside glucohydrolase [Salmonella enterica
subsp. enterica serovar Typhimurium str. LT2]
gi|261247380|emb|CBG25205.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Typhimurium str. D23580]
gi|267994228|gb|ACY89113.1| periplasmic beta-D-glucoside glucohydrolase [Salmonella enterica
subsp. enterica serovar Typhimurium str. 14028S]
gi|301158728|emb|CBW18240.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Typhimurium str. SL1344]
gi|323130493|gb|ADX17923.1| periplasmic beta-D-glucoside glucohydrolase [Salmonella enterica
subsp. enterica serovar Typhimurium str. ST4/74]
gi|332989100|gb|AEF08083.1| periplasmic beta-D-glucoside glucohydrolase [Salmonella enterica
subsp. enterica serovar Typhimurium str. UK-1]
gi|414017758|gb|EKT01456.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm1]
gi|414018535|gb|EKT02182.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm8]
gi|414020221|gb|EKT03810.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm2]
gi|414032204|gb|EKT15215.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm9]
gi|414033625|gb|EKT16574.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm3]
gi|414036785|gb|EKT19597.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm4]
gi|414046773|gb|EKT29088.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm10]
gi|414047776|gb|EKT30042.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm6]
gi|414052520|gb|EKT34556.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm11]
gi|414059327|gb|EKT40912.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm12]
Length = 765
Score = 245 bits (626), Expect = 7e-62, Method: Compositional matrix adjust.
Identities = 209/749 (27%), Positives = 344/749 (45%), Gaps = 139/749 (18%)
Query: 75 QEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAAS 134
++ I+Q+ D A+ RL IP + + + +HG T FP L A+S
Sbjct: 86 RQDIRQMQDQVMALSRLKIPLF-FAYDVVHG---------------QRTVFPISLGLASS 129
Query: 135 FNRSLWSNIGSAVAVEARAMYNLGQAGLTF-WAPNINIFRDPRWGRGQETPGEDPMVVSA 193
FN +G A EA GL WAP +++ RDPRWGR E GED + S
Sbjct: 130 FNLDAVRTVGRVSAYEA------ADDGLNMTWAPMVDVSRDPRWGRASEGFGEDTYLTSI 183
Query: 194 YAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEK 253
VK+ QG++ D + KH AY +
Sbjct: 184 MGETMVKAMQGKS--------------------------PADRYSVMTSVKHFAAYGAVE 217
Query: 254 WGNFSRYSFNAM-ITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK 312
G +N + ++ Q + + PP+++ ++ G + +M + N +NG PA L +
Sbjct: 218 GGK----EYNTVDMSSQRLFNDYMPPYKAGLDAG-SGAVMVALNSLNGTPATSDSWLLKD 272
Query: 313 A-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCML-RHTQSA 370
R+EWGFKG SD A+ + ++ ED+ LKAG+D++ ++
Sbjct: 273 VLRDEWGFKGITVSDHGAIKELIKHGTAADP-EDAVRVALKAGVDMSMADEYYSKYLPGL 331
Query: 371 IDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGP---DDVCTSE----HKKL 423
I GKV ++D A ++ +V+ +GLFN DP Y LGP D V T+ H+K
Sbjct: 332 IKSGKVTMAELDDATRHVLNVKYDMGLFN-DP----YSHLGPKESDPVDTNAESRLHRKE 386
Query: 424 ALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT--GIPCSPKSLLR 481
A + AR+ +VLLKN + LPL K+ ++A++GPL ++ + G ++ G+ ++L
Sbjct: 387 AREVARESVVLLKNRLETLPLKKSG--TIAVVGPLADSQRDVMGSWSAAGVANQSVTVLA 444
Query: 482 GLEAYV---SKTHYASGCHDV-------------------PCNSDAGFHEAVRIAKKADF 519
G++ V +K YA G + P + A EAV+ AK+AD
Sbjct: 445 GIQNAVGDGAKILYAKGANITNDKGIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADV 504
Query: 520 VIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEAD 579
V+ V G E R ++ +P Q L+T++ T K P++LVL G PL + + D
Sbjct: 505 VVAVVGESQGMAHEASSRTNITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKED 561
Query: 580 SQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRA----- 634
Q +IL + G G A+A+++FGD+NP G+LP++ +P S ++P+ ++
Sbjct: 562 QQADAILETWFAGTEGGNAIADVLFGDYNPSGKLPIS-FPRSVGQIPVYYSHLNTGRPYN 620
Query: 635 -DSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQ 693
+ +Y R + G +Y FG+GLSYT ++ S++T+S+ +Q+
Sbjct: 621 PEKPNKYTSRYFDEANGP-LYPFGYGLSYTTFTV------SDVTLSSP--------TMQR 665
Query: 694 TGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGF 753
G +VT+ + VTN G +G+ V+ ++ + S P KQL GF
Sbjct: 666 DG---------KVTAS--------VEVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGF 708
Query: 754 DRVHTVAKGSKEISFGVDPCEQLSIANKH 782
+++ K +SF +D E L N+
Sbjct: 709 EKITLKPGERKTVSFPID-IEALKFWNQQ 736
>gi|375001990|ref|ZP_09726330.1| glycosyl hydrolase family 3 protein [Salmonella enterica subsp.
enterica serovar Infantis str. SARB27]
gi|353076678|gb|EHB42438.1| glycosyl hydrolase family 3 protein [Salmonella enterica subsp.
enterica serovar Infantis str. SARB27]
Length = 751
Score = 245 bits (626), Expect = 7e-62, Method: Compositional matrix adjust.
Identities = 207/749 (27%), Positives = 345/749 (46%), Gaps = 139/749 (18%)
Query: 75 QEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAAS 134
++ I+Q+ D A+ RL IP + + + +HG T FP L A+S
Sbjct: 72 RQDIRQMQDQVMALSRLKIPLF-FAYDVVHG---------------QRTVFPISLGLASS 115
Query: 135 FNRSLWSNIGSAVAVEARAMYNLGQAGLTF-WAPNINIFRDPRWGRGQETPGEDPMVVSA 193
FN +G A EA GL WAP +++ RDPRWGR E GED + S
Sbjct: 116 FNLDAVRTVGRVSAYEA------ADDGLNMTWAPMVDVSRDPRWGRASEGFGEDTYLTSI 169
Query: 194 YAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEK 253
VK+ QG++ D + KH AY +
Sbjct: 170 MGETMVKAMQGKS--------------------------PADRYSVMTSVKHFAAYGAVE 203
Query: 254 WGNFSRYSFNAM-ITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK 312
G +N + ++ Q + + PP+++ ++ G + +M + N +NG PA L +
Sbjct: 204 GGK----EYNTVDMSSQRLFNDYMPPYKAGLDAG-SGAVMVALNSLNGTPATSDSWLLKD 258
Query: 313 A-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCML-RHTQSA 370
R+EWGFKG SD A+ + ++ ED+ LKAG+D++ ++
Sbjct: 259 VLRDEWGFKGITVSDHGAIKELIKHGTAADP-EDAVRVALKAGVDMSMADEYYSKYLPGL 317
Query: 371 IDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSE-------HKKL 423
I GKV ++D A ++ +V+ +GLFN DP Y LGP + ++ H+K
Sbjct: 318 IKSGKVTMAELDDATRHVLNVKYDMGLFN-DP----YSHLGPKESDPADTNAESRLHRKE 372
Query: 424 ALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT--GIPCSPKSLLR 481
A + AR+ +VLLKN + LPL K+ ++A++GPL ++ + G ++ G+ ++L
Sbjct: 373 AREVARESVVLLKNRLETLPLKKSG--TIAVVGPLADSQRDVMGSWSAAGVANQSVTVLA 430
Query: 482 GLEAYV---SKTHYASGCH-----DV--------------PCNSDAGFHEAVRIAKKADF 519
G++ V +K YA G + D+ P + A EAV+ AK+AD
Sbjct: 431 GIQNAVGDGAKILYAKGANITNDKDIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADV 490
Query: 520 VIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEAD 579
V+ V G E R ++ +P Q L+T++ T K P++LVL G PL + + D
Sbjct: 491 VVAVVGESQGMAHEASSRTNITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKED 547
Query: 580 SQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRA----- 634
Q +IL + G G A+A+++FGD+NP G+LP++ +P S ++P+ ++
Sbjct: 548 QQADAILETWFAGTEGGNAIADVLFGDYNPSGKLPIS-FPRSVGQIPVYYSHLNTGRPYN 606
Query: 635 -DSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQ 693
+ +Y R + G +Y FG+GLSYT ++ S++T+S+ +Q+
Sbjct: 607 PEKPNKYTSRYFDEANGP-LYPFGYGLSYTTFTV------SDVTLSSP--------TMQR 651
Query: 694 TGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGF 753
G +VT+ + VTN G +G+ V+ ++ + S P KQL GF
Sbjct: 652 DG---------KVTAS--------VEVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGF 694
Query: 754 DRVHTVAKGSKEISFGVDPCEQLSIANKH 782
+++ K +SF +D E L N+
Sbjct: 695 EKITLKPGERKTVSFPID-IEALKFWNQQ 722
>gi|375359159|ref|YP_005111931.1| putative exported hydrolase [Bacteroides fragilis 638R]
gi|423283738|ref|ZP_17262622.1| hypothetical protein HMPREF1204_02160 [Bacteroides fragilis HMW
615]
gi|301163840|emb|CBW23395.1| putative exported hydrolase [Bacteroides fragilis 638R]
gi|404580776|gb|EKA85484.1| hypothetical protein HMPREF1204_02160 [Bacteroides fragilis HMW
615]
Length = 859
Score = 245 bits (626), Expect = 7e-62, Method: Compositional matrix adjust.
Identities = 215/759 (28%), Positives = 338/759 (44%), Gaps = 123/759 (16%)
Query: 72 LTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVS 131
LTL ++Q+ + RLGIP + +ESLHG +G T FPQ +
Sbjct: 95 LTLMNEVQKYMREKT---RLGIPVFTL-TESLHGSVHDGS-----------TIFPQAIAL 139
Query: 132 AASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVV 191
++FN L + SA+A E A Q P I++ RD RWGR +E GEDP +V
Sbjct: 140 GSTFNPILAYEMTSAIAKELTA-----QGITQSLTPVIDVCRDLRWGRVEECFGEDPYLV 194
Query: 192 SAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDL 251
S V V+ G+ + +V S KH A+
Sbjct: 195 SRMGVSQVR----------------GYLDNQV----------------SPMIKHFGAHGA 222
Query: 252 EKWG-NFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLF 310
+ G N + S +++ + F + +++ K +M SYN N P L
Sbjct: 223 PQGGLNLASVSCG----QRELLSIYLKTFETVVKEAKPWAVMSSYNSWNNEPNSSSHYLM 278
Query: 311 QKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQS 369
+ R+ W F+GY+ SD A+ + + + ++A L AG+D Q
Sbjct: 279 TELLRDRWDFQGYVYSDWGAIGMLNYFHKTAQNSAEAAIQALTAGLDAEASDNSYAELQQ 338
Query: 370 AIDKGKVQEKDIDRALLNLFSVQLRLGLFNGD-PRKGKYGKLGPDDVCTSEHKKLALDAA 428
++ G + K ID+A+ + + + +GLF P + Y K+ V H LA A
Sbjct: 339 LVENGMLDVKYIDQAVARILTAKFNMGLFEYPLPMEKNYDKV----VHAPAHVSLARKIA 394
Query: 429 RQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPK--SLLRGLEAY 486
+ IVLL+N+ LPL N + S+A+IGP + + G YT + +LL L+
Sbjct: 395 EESIVLLQNENNILPLQMNKLKSIAVIGPNADQVQ--FGDYTWSRDNKDGVTLLEALKER 452
Query: 487 VSKT---HYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAG-------LDLTQET--ED 534
V +YA GC D+ + +GF EAV +AKK+D IVV G D + T E
Sbjct: 453 VGNQLTLNYAKGC-DLVTDDCSGFKEAVDVAKKSDVCIVVVGSASASLARDYSNATCGEG 511
Query: 535 RDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEA 594
D L L G Q LV ++ T K PVI+VL G P +S+ + + I I+ YPGE
Sbjct: 512 FDLSDLTLTGVQEDLVEAIHATGK-PVIVVLLSGKPFAMSWIKEN--IPGIVVQWYPGEQ 568
Query: 595 GAKALAEIIFGDFNPGGRLPMTWYPES-------FTKVPMNDMNMRADSSRQYPGRSYRF 647
G ALA+++ G NP G+L + +P+S + +P + R+ S+ PG+ Y F
Sbjct: 569 GGLALADMLLGKVNPSGKLNYS-FPQSVGHLPCYYNYLPTDKGFYRSPGSKNKPGKDYVF 627
Query: 648 YTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVT 707
+ ++ FGHGLSYT++ Y L T S+ DY
Sbjct: 628 SSPKALWAFGHGLSYTDFEY-----------------------LSATTSKEDY------- 657
Query: 708 SCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEIS 767
+C + V I++ N GD DG V ++ R S P ++L GF++V +K++
Sbjct: 658 ACEDV-IEVTIAIRNTGDYDGLEVPQVYVRDMVSSVVMPVQELKGFEKVLIKKGETKQVI 716
Query: 768 FGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
+ P +L++ NK ++++ G L +G + I+
Sbjct: 717 IKI-PVSELALYNKEMKKVVEPGAFELQIGRASDDIRIK 754
>gi|423260853|ref|ZP_17241755.1| hypothetical protein HMPREF1055_04032 [Bacteroides fragilis
CL07T00C01]
gi|423266988|ref|ZP_17245970.1| hypothetical protein HMPREF1056_03657 [Bacteroides fragilis
CL07T12C05]
gi|387774614|gb|EIK36724.1| hypothetical protein HMPREF1055_04032 [Bacteroides fragilis
CL07T00C01]
gi|392697691|gb|EIY90874.1| hypothetical protein HMPREF1056_03657 [Bacteroides fragilis
CL07T12C05]
Length = 859
Score = 245 bits (626), Expect = 7e-62, Method: Compositional matrix adjust.
Identities = 215/759 (28%), Positives = 338/759 (44%), Gaps = 123/759 (16%)
Query: 72 LTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVS 131
LTL ++Q+ + RLGIP + +ESLHG +G T FPQ +
Sbjct: 95 LTLMNEVQKYMREKT---RLGIPVFTL-TESLHGSVHDGS-----------TIFPQAIAL 139
Query: 132 AASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVV 191
++FN L + SA+A E A Q P I++ RD RWGR +E GEDP +V
Sbjct: 140 GSTFNPILAYEMTSAIAKELSA-----QGITQSLTPVIDVCRDLRWGRVEECFGEDPYLV 194
Query: 192 SAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDL 251
S V V+ G+ + +V S KH A+
Sbjct: 195 SRMGVSQVR----------------GYLDNQV----------------SPMIKHFGAHGA 222
Query: 252 EKWG-NFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLF 310
+ G N + S +++ + F + +++ K +M SYN N P L
Sbjct: 223 PQGGLNLASVSCG----QRELLSIYLKTFETVVKEAKPWAVMSSYNSWNNEPNSSSHYLM 278
Query: 311 QKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQS 369
+ R+ W F+GY+ SD A+ + + + ++A L AG+D Q
Sbjct: 279 TELLRDRWDFQGYVYSDWGAIGMLNYFHKTAQNSAEAAIQALTAGLDAEASDNSYAELQQ 338
Query: 370 AIDKGKVQEKDIDRALLNLFSVQLRLGLFNGD-PRKGKYGKLGPDDVCTSEHKKLALDAA 428
++ G + K ID+A+ + + + +GLF P + Y K+ V H LA A
Sbjct: 339 LVENGMLDVKYIDQAVARILTAKFNMGLFEYPLPMEKNYDKV----VHAPAHVSLARKIA 394
Query: 429 RQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPK--SLLRGLEAY 486
+ IVLL+N+ LPL N + S+A+IGP + + G YT + +LL L+
Sbjct: 395 EESIVLLQNENNILPLQMNKLKSIAVIGPNADQVQ--FGDYTWSRDNKDGVTLLEALKER 452
Query: 487 VSKT---HYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAG-------LDLTQET--ED 534
V +YA GC D+ + +GF EAV +AKK+D IVV G D + T E
Sbjct: 453 VGNQLTLNYAKGC-DLVTDDRSGFKEAVDVAKKSDVCIVVVGSASASLARDYSNATCGEG 511
Query: 535 RDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEA 594
D L L G Q LV ++ T K PVI+VL G P +S+ + + I I+ YPGE
Sbjct: 512 FDLSDLTLTGVQEDLVEAIHATGK-PVIVVLLSGKPFAMSWIKEN--IPGIVVQWYPGEQ 568
Query: 595 GAKALAEIIFGDFNPGGRLPMTWYPES-------FTKVPMNDMNMRADSSRQYPGRSYRF 647
G ALA+++ G NP G+L + +P+S + +P + R+ S+ PG+ Y F
Sbjct: 569 GGLALADMLLGKVNPSGKLNYS-FPQSVGHLPCYYNYLPTDKGFYRSPGSKNKPGKDYVF 627
Query: 648 YTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVT 707
+ ++ FGHGLSYT++ Y L T S+ DY
Sbjct: 628 SSPKALWAFGHGLSYTDFEY-----------------------LSATISKEDY------- 657
Query: 708 SCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEIS 767
+C + V I++ N GD DG V ++ R S P ++L GF++V +K++
Sbjct: 658 ACEDV-IEVTIAIRNTGDYDGLEVPQVYVRDMVSSVVMPVQELKGFEKVLIKKGETKQVI 716
Query: 768 FGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
+ P +L++ NK ++++ G L +G + I+
Sbjct: 717 IKI-PVSELALYNKEMKKVVEPGAFELQIGRASDDIRIK 754
>gi|365121645|ref|ZP_09338561.1| hypothetical protein HMPREF1033_01907 [Tannerella sp.
6_1_58FAA_CT1]
gi|363645135|gb|EHL84409.1| hypothetical protein HMPREF1033_01907 [Tannerella sp.
6_1_58FAA_CT1]
Length = 868
Score = 245 bits (625), Expect = 7e-62, Method: Compositional matrix adjust.
Identities = 153/463 (33%), Positives = 231/463 (49%), Gaps = 60/463 (12%)
Query: 53 PFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPG 112
P+ N LS RA L++ +TL+EK Q+ +N I RLG+ Y+WW+E+LHGIA G
Sbjct: 25 PYKNPELSPDERALDLLNRMTLKEKFAQMHNNTGGIERLGVRPYDWWNEALHGIARAG-- 82
Query: 113 VNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARA---------MYNLGQAGLT 163
T FPQ + AA+F+ + + V+ E RA MYN G GLT
Sbjct: 83 --------KATVFPQAIGLAATFDDTAVYEMFDMVSDEGRAKYHDFQRKGMYN-GYKGLT 133
Query: 164 FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRV 223
FW PNINIFRDPRWGRG ET GEDP + + + VK QG+ + D
Sbjct: 134 FWTPNINIFRDPRWGRGMETYGEDPFLTTKMGLAVVKGLQGDGTQKYDKA---------- 183
Query: 224 LKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPFRSC 282
AC KH + +W +R+S+NA I+ +D +T+ P F++
Sbjct: 184 ----------------HACAKHYAVHSGPEW---NRHSYNAENISIRDLRETYLPAFKAL 224
Query: 283 IEQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIF---EYQN 338
+ +GK +MC+YN+ G P C L ++EWGF I SDC A+A + ++
Sbjct: 225 VTEGKVKEVMCAYNRFEGEPCCSNKTLLINILKDEWGFDDVIVSDCGAIADFYTKGRHET 284
Query: 339 YTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLF 398
+ + SA V+ +G D+ CG A++KG + E I+ ++ L + LG+F
Sbjct: 285 HASAADASADAVI-SGTDLECGGSYW-ALDEALEKGLITETKINESVFRLLRARFELGMF 342
Query: 399 NGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPL 458
+ D + + VC +HK AL+ AR+ +VLL N LPL+K ++ +A++GP
Sbjct: 343 DDDSLVS-WSSIPYSVVCCDKHKAKALEMARKSMVLLSNKNNTLPLSK-SIKKVAVMGPN 400
Query: 459 VNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTH--YASGCHDV 499
N+ + Y G P ++L G++A + + Y GC V
Sbjct: 401 ANDSVMLWANYNGTPDRSVTILEGIKAKLPEGSVIYEKGCDYV 443
Score = 113 bits (282), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 82/273 (30%), Positives = 127/273 (46%), Gaps = 50/273 (18%)
Query: 498 DVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQM 547
DV + + K AD +I V G+ + E E+ DR ++ LP Q
Sbjct: 583 DVGLSRQIDYKAVAEKVKDADAIIFVGGISSSLEGEEMGVKYPGFRNGDRTNIDLPQVQK 642
Query: 548 SLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDF 607
+++ ++ T K PVI VL G + +S+ D + +IL YPG+ G A+A+++FGD+
Sbjct: 643 NMMKALKETGK-PVIFVLCSGSTMALSWE--DKNMDAILQAWYPGQEGGTAVADVLFGDY 699
Query: 608 NPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSY 667
NP GRLP+T+Y S + NM GR+YR++ G +Y FGHGLSYT +SY
Sbjct: 700 NPAGRLPLTFYASSDDLPDFENYNMSEGQ-----GRTYRYFKGKPLYPFGHGLSYTGFSY 754
Query: 668 KFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVD 727
S + +++ S D V + +++ N G D
Sbjct: 755 ------------------SKAKLNKKSMSVNDSVFL-------------SLNLKNTGLRD 783
Query: 728 GSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVA 760
G VV ++ R + +G P K L G+ RV A
Sbjct: 784 GDEVVQVYIRNLQDPEG-PSKSLRGYKRVSVKA 815
>gi|256838673|ref|ZP_05544183.1| glycoside hydrolase, family 3 [Parabacteroides sp. D13]
gi|256739592|gb|EEU52916.1| glycoside hydrolase, family 3 [Parabacteroides sp. D13]
Length = 758
Score = 245 bits (625), Expect = 8e-62, Method: Compositional matrix adjust.
Identities = 220/778 (28%), Positives = 347/778 (44%), Gaps = 129/778 (16%)
Query: 44 CKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESL 103
C+ P + + +I R S++SL+TL+EK+ Q++ + G P E+L
Sbjct: 20 CQQPAVSGWKSFSGDKNIERRVDSVLSLMTLEEKVGQMAQYSCNWDVTG-PVMTGDYETL 78
Query: 104 HGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLT 163
G N TV V ++ +S + + G V R ++ + A
Sbjct: 79 LKQGLVGSLFNVY-TVDGVRKMQEMALSESRLKIPVL--FGYDVVHGYRTIFPMPLAESC 135
Query: 164 FW-------------------------APNINIFRDPRWGRGQETPGEDPMVVSAYAVEF 198
W AP ++I RD RWGR E GEDP + S A
Sbjct: 136 SWDLERMRKSAAIAADEASASGIAWTFAPMVDISRDARWGRVMEGAGEDPYLGSLIAKAR 195
Query: 199 VKSFQGEN-WKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWG-- 255
V+ FQG N W+S D + ACCKH AY + G
Sbjct: 196 VEGFQGGNDWRS-----------------------LADVNTVLACCKHFAAYGAAEAGRD 232
Query: 256 -NFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPAC----LRGDLF 310
N S S N ++ + + PP+ + E G A+ M S+N++NGVP+ L DL
Sbjct: 233 YNTSELSQNTLM------NYYMPPYLAAKEAGVAT-FMASFNEINGVPSTGNKWLMTDLL 285
Query: 311 QKARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVL--KAGMDIN-CGTCMLRHT 367
+K +WGFKG++ +D I E ++ D AG L AG+D++ G ++
Sbjct: 286 RK---DWGFKGFVVTDYTG---INEMVAHSIVRNDKEAGELAANAGIDMDMTGGIYSQYL 339
Query: 368 QSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDA 427
++ +GKV E++IDRA+ ++ ++ LGLF+ DP + + + + E + A +
Sbjct: 340 VQSVKEGKVSEENIDRAVASILEMKFLLGLFD-DPYRYLDNEREKNTIMKPEFLQEARET 398
Query: 428 ARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPK--SLLRGLEA 485
+ + IVLLKND F P++K+ ++A+IGP+V + G + G + SL GL
Sbjct: 399 SARSIVLLKNDNNFFPISKDKHITVALIGPMVKDKINQNGEWAGRGEREESISLFEGLTE 458
Query: 486 YVSKTH----YASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLL 541
+ T+ YA GC D+ + + F EA+ A++AD V+ G D E R L
Sbjct: 459 KYAGTNVKFIYAEGC-DLLTDDSSKFAEAIATARRADIVLAAMGEDFNWSGEAACRTDLK 517
Query: 542 LPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAE 601
LPG Q +L+ + +T K P+ L+L G PLD+S+ D + IL Y G +A+
Sbjct: 518 LPGAQQALLKELKKTGK-PLGLILVNGRPLDLSW--EDQHVDGILEAWYLGTMAGHGMAD 574
Query: 602 IIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRS--------YRFYTGTQV 653
+I GD+NP RL M+ +P + ++P+ + + R P + Y T +
Sbjct: 575 VISGDYNPSARLTMS-FPRTVGQLPL--YYNQKPTGRPVPPEAPDTDYKSRYMDVPNTPL 631
Query: 654 YGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLR 713
Y FG+GLSYT ++ S+K D+N + G +
Sbjct: 632 YPFGYGLSYTTFAVN------------SMKL--DQNSFTKGG-----------------K 660
Query: 714 FHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVD 771
V V N G VDG VV ++ R S P K+L GF++V A K++SF +D
Sbjct: 661 ITVTAEVENTGKVDGETVVQMYIRDLAGSVTRPVKELKGFEKVALKAGEKKQVSFTID 718
>gi|218130696|ref|ZP_03459500.1| hypothetical protein BACEGG_02285 [Bacteroides eggerthii DSM 20697]
gi|217987040|gb|EEC53371.1| glycosyl hydrolase family 3 N-terminal domain protein [Bacteroides
eggerthii DSM 20697]
Length = 858
Score = 245 bits (625), Expect = 8e-62, Method: Compositional matrix adjust.
Identities = 154/450 (34%), Positives = 227/450 (50%), Gaps = 57/450 (12%)
Query: 54 FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
+ N I R L+S LT++EKI L + I RL IP Y +E+LHG+ G
Sbjct: 29 YKNEKAPIHERIMDLLSRLTVEEKISLLRATSPGISRLDIPKYYHGNEALHGVVRPG--- 85
Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAG----------LT 163
T FPQ + AA++N L + + ++ EARA +N G LT
Sbjct: 86 -------RFTVFPQAIGLAATWNPVLQKQVATVISDEARARWNELDQGREQNSQFSDLLT 138
Query: 164 FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRV 223
FW+P +N+ RDPRWGR ET GEDP + FVK QG + R
Sbjct: 139 FWSPTVNMARDPRWGRTPETYGEDPYLSGIMGTAFVKGLQGN--------------DSRY 184
Query: 224 LKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCI 283
LK + + KH A + E +R+ N I+E+ + + P F +C+
Sbjct: 185 LK-------------IVSTPKHFAANNEE----HNRFVCNPQISEKQLREYYLPAFEACV 227
Query: 284 EQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKT 342
++GK++ IM +YN +N VP L L K R +WGFKGY+ SDC A + Y KT
Sbjct: 228 KEGKSASIMSAYNALNDVPCTLNAWLLTKVLREDWGFKGYVVSDCGGPALLVNAHKYVKT 287
Query: 343 HEDSAAGVLKAGMDINCGTCMLRH-TQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGD 401
E +A +KAG+D+ CG + +A + V + DID A + +++LGLF+
Sbjct: 288 KEAAATLSIKAGLDLECGDDVYDAPLLNAYRQYMVTDADIDSAAYRVLRARMQLGLFDSG 347
Query: 402 PRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNN 461
Y K+ P + + EH+K+ALDAAR+ IVLLKN K LPL+ + S+A++G +N
Sbjct: 348 -ENNPYTKISPKVIGSKEHQKVALDAARECIVLLKNQNKMLPLDAKKIKSIAVVG--INA 404
Query: 462 ISQMGGGYTGIPC-SPKSLLRGLEAYVSKT 490
G Y+G+P +P S+L+G++ V T
Sbjct: 405 GRSEFGDYSGLPVIAPVSILQGIKDRVGDT 434
Score = 124 bits (312), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 79/291 (27%), Positives = 144/291 (49%), Gaps = 45/291 (15%)
Query: 507 FHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLT 566
+ EA R+ ++ + V+ V G++ E E +DR + LP Q + + + + +++VL
Sbjct: 593 YGEAGRVVRECEKVVAVLGINKAIEREGQDRSDIQLPADQREFLKEIYKVNPN-IVVVLV 651
Query: 567 GGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVP 626
G L +++ D I +I+ YPGE+G KA+AE++FGD+NPGGRLP+T+Y P
Sbjct: 652 AGSSLSINWM--DEHIPAIINAWYPGESGGKAVAEVLFGDYNPGGRLPLTYYRSLDELPP 709
Query: 627 MNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGS 686
+D ++ GR+Y+++ G +Y FG+GLSYT++ Y S+L + +
Sbjct: 710 FDDYDITK-------GRTYQYFKGNVLYPFGYGLSYTSFKY------SDLQV-------T 749
Query: 687 DKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTP 746
D N EV +V + N G G V ++ ++P+ + P
Sbjct: 750 DGN--------------QEV--------NVSFCLKNVGKYAGDEVAQIYVKLPERDKIMP 787
Query: 747 EKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
K+L GF+R+ S++++ + + + P G++ +M+G
Sbjct: 788 IKELKGFERISLKRGESRKVTIRLKKDLLRYWDEEKECFVHPSGDYTIMIG 838
>gi|254445290|ref|ZP_05058766.1| Glycosyl hydrolase family 3 C terminal domain protein
[Verrucomicrobiae bacterium DG1235]
gi|198259598|gb|EDY83906.1| Glycosyl hydrolase family 3 C terminal domain protein
[Verrucomicrobiae bacterium DG1235]
Length = 730
Score = 245 bits (625), Expect = 8e-62, Method: Compositional matrix adjust.
Identities = 215/741 (29%), Positives = 322/741 (43%), Gaps = 121/741 (16%)
Query: 52 YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGP 111
YPF + L R LI+ +TL+EK+ L IPRL + Y SE HG+A GP
Sbjct: 27 YPFQDPDLPNEERIDDLITCMTLEEKVD-LMGFVPGIPRLDV-KYTRISEGYHGVAQGGP 84
Query: 112 GVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYN---LGQAGLTFWAPN 168
+ T FPQ AA+++ +L S + + A E R +Y ++GL APN
Sbjct: 85 SNWGKRNPTPTTQFPQAYGLAATWDPALISRVSANQATEVRYLYQSPKYQRSGLVVMAPN 144
Query: 169 INIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFG 228
++ RDPRWGR +E GEDP + A F G++ R LK
Sbjct: 145 ADLARDPRWGRTEEVYGEDPFLTGTLAAAFASGLAGDH--------------PRYLKA-- 188
Query: 229 EESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKA 288
++ KH +A E FS F+ E+ + + PF I G A
Sbjct: 189 -----------TSLLKHFLANSNEDDRFFSSSDFD----ERLWREYYAKPFEMAIRDGGA 233
Query: 289 SCIMCSYNQVNGVPACLRGDLFQKARNEWGFKGYITSDCDAVATIFEYQNYTKTHED--- 345
+M +YN +NG PA + L EWG G I +D +A + N KT+ D
Sbjct: 234 RSMMAAYNAINGTPAHVHPMLRDIVMGEWGLDGTICTDGGGLAHLV---NQHKTYPDLPT 290
Query: 346 SAAGVLKAGMDINCGTCMLRHTQSAID---KGKVQEKDIDRALLNLFSVQLRLGLFNGDP 402
+ A +KAG+++ + HTQ+A+D + V E +ID + + L LGL + P
Sbjct: 291 ATAACIKAGINL----FLDNHTQAALDAVEQSLVTEAEIDDVIRGRIRLFLDLGLLD-PP 345
Query: 403 RKGKYGKLGPDDVCT----SEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPL 458
Y +G + E + R+ IVLLKN+ LPL+ + ++S+AI+GPL
Sbjct: 346 ELVPYSNIGHEPGLEPWELPETHAFVREVTRKSIVLLKNENNILPLDPSKINSVAIVGPL 405
Query: 459 VNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYAS----GCHDVPCNSDAGFHEAVRIA 514
N + + Y+G P G+E Y + + S G + V SD A+ +A
Sbjct: 406 AN--TTLLDWYSGTPPYAIPPRDGIEGYANSGPFPSPAKFGSNWVADMSDT----ALEVA 459
Query: 515 KKADFVIVVAG------------LDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVI 562
D IVV G ++ E DR ++L Q + V + I
Sbjct: 460 ASRDVAIVVVGNHPESNAGWGVVTSPSEGKEAVDRQEIILQPDQEEFIQKVYAANPN-TI 518
Query: 563 LVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESF 622
+VL P + +A ++ +I+ I + + ALA+++FGD+NPGG+ TW P+S
Sbjct: 519 VVLVSNFPYAMPWAAENA--PAIVHITHASQEQGNALADVLFGDYNPGGKTVQTW-PKSL 575
Query: 623 TKV-PMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISAS 681
++ PM D ++R GR+Y + Y FG+GLSYT + L AP +L A+
Sbjct: 576 DQLPPMMDYDIRR-------GRTYMYSQHEPQYPFGYGLSYTTFELSKLKAPKKLKADAT 628
Query: 682 LKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKV 741
+++ V N G+ DG VV L+ R P
Sbjct: 629 AT--------------------------------IKVRVANTGERDGDEVVQLYVRYPNS 656
Query: 742 SQGTPEKQLIGFDRVHTVAKG 762
P KQL GF RV TV G
Sbjct: 657 KVERPSKQLKGFQRV-TVPAG 676
>gi|354585466|ref|ZP_09004353.1| glycoside hydrolase family 3 domain protein [Paenibacillus lactis
154]
gi|353188735|gb|EHB54255.1| glycoside hydrolase family 3 domain protein [Paenibacillus lactis
154]
Length = 734
Score = 245 bits (625), Expect = 8e-62, Method: Compositional matrix adjust.
Identities = 207/697 (29%), Positives = 331/697 (47%), Gaps = 106/697 (15%)
Query: 123 TSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQE 182
T FP L A+S+ L + A E A +TF AP ++ RDPRWGR E
Sbjct: 108 TVFPVPLGLASSWEPKLAEDTSRVAAAEGAA----SGINVTF-APMADLVRDPRWGRVME 162
Query: 183 TPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSAC 242
+PGEDP + A V+ FQG++ K+ D ++AC
Sbjct: 163 SPGEDPYLNGLMAAAMVRGFQGDDLKALD--------------------------TVAAC 196
Query: 243 CKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGV 301
KH AY + G +N + ++E + + P +++ I+ G A IM S+N +GV
Sbjct: 197 VKHFAAYGAAEGGR----DYNTVDMSEAALRNYYLPAYKAGIDAG-AELIMTSFNVYDGV 251
Query: 302 PACLRGDLFQKA-RNEWGFKGYITSDCDAV-ATIFEYQNYTKTHEDSAAGVLKAGMDIN- 358
PA L +K R EWGF+G + SD ++ TIF +K ED+A L+AG+DI
Sbjct: 252 PATTNSFLLRKVLREEWGFEGVVISDYTSLWETIFHMS--SKDGEDAAKQGLEAGLDIEM 309
Query: 359 CGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDP------RKGKYGKLGP 412
T + H + +++G+V ID A+ + +++ +LGLF+ DP + K G L P
Sbjct: 310 ISTEYISHLEQLVERGEVDVALIDEAVRRILTLKFKLGLFD-DPYRYINIEREKKGHLKP 368
Query: 413 DDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGI 472
E+++LA +AA++ +VLLKND LPL K+ + S+A+IGP +N ++ GG++G+
Sbjct: 369 ------EYRQLAREAAQKSMVLLKND-GVLPLKKD-IKSVAVIGPFADN-GRILGGWSGL 419
Query: 473 --PCSPKSLLRGLEAYVS---KTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLD 527
P ++ +GL + K A+GC D N +GF A+ A ++ VI+ G +
Sbjct: 420 GKPEEAVTVKQGLINKLGNDVKVTAAAGC-DYSSNDVSGFQAALEAAASSEVVILAMGEE 478
Query: 528 LTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILW 587
E R L LPG Q LV V + K P +LVL G PL++ + + +IL
Sbjct: 479 DHMSGEAGSRAYLTLPGVQPKLVEEVLKLGK-PTVLVLFNGRPLELKWYH--EHVPAILE 535
Query: 588 IGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRF 647
+PG G A+A+++FGD NP +L M+ +P + +VP+ + + + +RF
Sbjct: 536 AWFPGTEGGNAVADLLFGDANPSAKLTMS-FPYTVGQVPVYYNCLNTGRPKGHEDNDFRF 594
Query: 648 YT------GTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYV 701
+ Y FG+GLSYT Y+Y SD ++
Sbjct: 595 LSKYLDIPNAPFYPFGYGLSYTEYTY------------------SDAALIG--------- 627
Query: 702 HIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAK 761
D +T S++ ++V N GD G +V ++ R S P +L GF ++H
Sbjct: 628 --DTLTPGGSVK--AMVTVQNTGDYAGEEIVQMYVRDLWGSTSRPLLELKGFHKIHLAPG 683
Query: 762 GSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGE 798
+ F ++P E LS + + G+ + VG+
Sbjct: 684 EKATVEFVIEP-EMLSYFTAKKQYEVEEGDFKVFVGK 719
>gi|161613146|ref|YP_001587111.1| hypothetical protein SPAB_00854 [Salmonella enterica subsp.
enterica serovar Paratyphi B str. SPB7]
gi|416529744|ref|ZP_11744511.1| hypothetical protein SEEM010_09961 [Salmonella enterica subsp.
enterica serovar Montevideo str. LQC 10]
gi|417466542|ref|ZP_12165045.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar
Montevideo str. S5-403]
gi|161362510|gb|ABX66278.1| hypothetical protein SPAB_00854 [Salmonella enterica subsp.
enterica serovar Paratyphi B str. SPB7]
gi|353629977|gb|EHC77663.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar
Montevideo str. S5-403]
gi|363552147|gb|EHL36453.1| hypothetical protein SEEM010_09961 [Salmonella enterica subsp.
enterica serovar Montevideo str. LQC 10]
Length = 755
Score = 245 bits (625), Expect = 8e-62, Method: Compositional matrix adjust.
Identities = 206/749 (27%), Positives = 343/749 (45%), Gaps = 139/749 (18%)
Query: 75 QEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAAS 134
++ I+Q+ D A+ RL IP + + + +HG T FP L A+S
Sbjct: 76 RQDIRQMQDQVMALSRLKIPLF-FAYDVVHG---------------QRTVFPISLGLASS 119
Query: 135 FNRSLWSNIGSAVAVEARAMYNLGQAGLTF-WAPNINIFRDPRWGRGQETPGEDPMVVSA 193
FN +G A EA GL WAP +++ RDPRWGR E GED + S
Sbjct: 120 FNLDAVRTVGRVSAYEA------ADDGLNMTWAPMVDVSRDPRWGRASEGFGEDTYLTSI 173
Query: 194 YAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEK 253
VK+ QG++ D + KH AY +
Sbjct: 174 MGETMVKAMQGKS--------------------------PADRYSVMTSVKHFAAYGAVE 207
Query: 254 WGNFSRYSFNAM-ITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK 312
G +N + ++ Q + + PP+++ ++ G + +M + N +NG PA L +
Sbjct: 208 GGK----EYNTVDMSSQRLFNDYMPPYKAGLDAG-SGAVMVALNSLNGTPATSDSWLLKD 262
Query: 313 A-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCML-RHTQSA 370
R+EWGFKG SD A+ + ++ ED+ LKAG+D++ ++
Sbjct: 263 VLRDEWGFKGITVSDHGAIKELIKH-GTAADPEDAVRVALKAGVDMSMADEYYSKYLPGL 321
Query: 371 IDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSE-------HKKL 423
I GKV ++D A ++ +V+ +GLFN DP Y LGP + ++ H+K
Sbjct: 322 IKSGKVTMAELDDATRHVLNVKYDMGLFN-DP----YSHLGPKESDPADTNAESRLHRKE 376
Query: 424 ALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT--GIPCSPKSLLR 481
A + AR+ +VLLKN + LPL K+ ++A++GPL ++ + G ++ G+ ++L
Sbjct: 377 AREVARESVVLLKNRLETLPLKKSG--TIAVVGPLADSQRDVMGSWSAAGVANQSVTVLA 434
Query: 482 GLEAYV---SKTHYASGCHDV-------------------PCNSDAGFHEAVRIAKKADF 519
G++ V +K YA G + P + A EAV+ AK+AD
Sbjct: 435 GIQNAVGDGAKILYAKGANITNDKGIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADV 494
Query: 520 VIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEAD 579
V+ V G E R ++ +P Q L+T++ T K P++LVL G PL + + D
Sbjct: 495 VVAVVGESQGMAHEASSRTNITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKED 551
Query: 580 SQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRA----- 634
Q +IL + G G A+A+++FGD+NP G+LP++ +P S ++P+ ++
Sbjct: 552 QQADAILETWFAGTEGGNAIADVLFGDYNPSGKLPIS-FPRSVGQIPVYYSHLNTGRPYN 610
Query: 635 -DSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQ 693
+ +Y R + G +Y FG+GLSYT ++ S++T+S+ +Q+
Sbjct: 611 PEKPNKYTSRYFDEANGP-LYPFGYGLSYTTFTV------SDVTLSSP--------TMQR 655
Query: 694 TGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGF 753
G +VT+ + VTN G +G+ V+ ++ + S P KQL GF
Sbjct: 656 DG---------KVTAS--------VEVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGF 698
Query: 754 DRVHTVAKGSKEISFGVDPCEQLSIANKH 782
+++ K +SF +D E L N+
Sbjct: 699 EKITLKPGERKTVSFPID-IEALKFWNQQ 726
>gi|317474225|ref|ZP_07933501.1| glycosyl hydrolase family 3 N terminal domain-containing protein
[Bacteroides eggerthii 1_2_48FAA]
gi|316909535|gb|EFV31213.1| glycosyl hydrolase family 3 N terminal domain-containing protein
[Bacteroides eggerthii 1_2_48FAA]
Length = 858
Score = 245 bits (625), Expect = 8e-62, Method: Compositional matrix adjust.
Identities = 154/450 (34%), Positives = 227/450 (50%), Gaps = 57/450 (12%)
Query: 54 FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
+ N I R L+S LT++EKI L + I RL IP Y +E+LHG+ G
Sbjct: 29 YKNEKAPIHERIMDLLSRLTVEEKISLLRATSPGISRLDIPKYYHGNEALHGVVRPG--- 85
Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAG----------LT 163
T FPQ + AA++N L + + ++ EARA +N G LT
Sbjct: 86 -------RFTVFPQAIGLAATWNPVLQKQVATVISDEARARWNELDQGREQNSQFSDLLT 138
Query: 164 FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRV 223
FW+P +N+ RDPRWGR ET GEDP + FVK QG + R
Sbjct: 139 FWSPTVNMARDPRWGRTPETYGEDPYLSGIMGTAFVKGLQGN--------------DSRY 184
Query: 224 LKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCI 283
LK + + KH A + E +R+ N I+E+ + + P F +C+
Sbjct: 185 LK-------------IVSTPKHFAANNEE----HNRFVCNPQISEKQLREYYLPAFEACV 227
Query: 284 EQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKT 342
++GK++ IM +YN +N VP L L K R +WGFKGY+ SDC A + Y KT
Sbjct: 228 KEGKSASIMSAYNALNDVPCTLNAWLLTKVLREDWGFKGYVVSDCGGPALLVNAHKYVKT 287
Query: 343 HEDSAAGVLKAGMDINCGTCMLRH-TQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGD 401
E +A +KAG+D+ CG + +A + V + DID A + +++LGLF+
Sbjct: 288 KEAAATLSIKAGLDLECGDDVYDAPLLNAYRQYMVTDADIDSAAYRVLRARMQLGLFDSG 347
Query: 402 PRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNN 461
Y K+ P + + EH+K+ALDAAR+ IVLLKN K LPL+ + S+A++G +N
Sbjct: 348 -ENNPYTKISPKVIGSKEHQKVALDAARECIVLLKNQNKMLPLDAKKIKSIAVVG--INA 404
Query: 462 ISQMGGGYTGIPC-SPKSLLRGLEAYVSKT 490
G Y+G+P +P S+L+G++ V T
Sbjct: 405 GRSEFGDYSGLPVIAPVSILQGIKDRVGDT 434
Score = 130 bits (328), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 78/291 (26%), Positives = 143/291 (49%), Gaps = 45/291 (15%)
Query: 507 FHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLT 566
+ EA R+ ++ + V+ V G++ E E +DR + LP Q + + + + +++VL
Sbjct: 593 YGEAGRVVRECEKVVAVLGINKAIEREGQDRSDIQLPADQREFLKEIYKVNPN-IVVVLV 651
Query: 567 GGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVP 626
G L +++ D I +I+ YPGE+G KA+AE++FGD+NPGGRLP+T+Y P
Sbjct: 652 AGSSLSINWM--DEHIPAIINAWYPGESGGKAVAEVLFGDYNPGGRLPLTYYRSLDELPP 709
Query: 627 MNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGS 686
+D ++ GR+Y+++ G +Y FG+GLSYT++ Y S+L ++
Sbjct: 710 FDDYDITK-------GRTYQYFKGNVLYPFGYGLSYTSFKY------SDLQVTEG----- 751
Query: 687 DKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTP 746
+ +V + N G G V ++ ++P+ + P
Sbjct: 752 ------------------------NQEVNVSFCLKNVGKYAGDEVAQIYVKLPERDKIMP 787
Query: 747 EKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
K+L GF+R+ GS++++ + + G + P G++ +MVG
Sbjct: 788 IKELKGFERISLKRGGSRKVTIRLKKDLLRYWDEEKGCFVHPSGDYTIMVG 838
>gi|429745624|ref|ZP_19279029.1| glycosyl hydrolase family 3 protein [Capnocytophaga sp. oral taxon
380 str. F0488]
gi|429168470|gb|EKY10301.1| glycosyl hydrolase family 3 protein [Capnocytophaga sp. oral taxon
380 str. F0488]
Length = 770
Score = 245 bits (625), Expect = 9e-62, Method: Compositional matrix adjust.
Identities = 233/827 (28%), Positives = 356/827 (43%), Gaps = 164/827 (19%)
Query: 44 CKPPHF---DSYPFCNTSLS----ISTRAKSLISLLTLQEKIQQLSD------------- 83
C P D+ F S S I R S++ L+TL+EKI Q++
Sbjct: 19 CATPAILKSDNEKFTQKSFSGDKHIEQRVDSVLRLMTLEEKIGQMTQFSADWSVTGPVMA 78
Query: 84 ----------------NASAIP-------------RLGIPAYEWWSESLHGIASNGPGVN 114
NA+++ RLGIP + + +HG
Sbjct: 79 DKYQPYLEKGLVGSIFNATSVAGIRKLQKIAVEQTRLGIPIL-FGQDVIHGYK------- 130
Query: 115 FNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTF-WAPNINIFR 173
T FP L + S++ +L A EA A G+ + +AP ++I R
Sbjct: 131 --------TIFPIPLAESCSWDLALMRKTAELAAREASA------DGINWTFAPMVDITR 176
Query: 174 DPRWGRGQETPGEDPMVVSAYAVEFVKSFQG-ENWKSDDGGIGFGFREKRVLKGFGEESD 232
D RWGR E GEDP + S A VK FQG +NW+
Sbjct: 177 DARWGRAMEGAGEDPYLGSLIAEARVKGFQGGDNWQM----------------------- 213
Query: 233 RGDELMLSACCKHLIAYDLEKWGNFSRYSFN-AMITEQDTEDTFQPPFRSCIEQGKASCI 291
L AC KH Y + G +N A ++ + + PP+ + + G S I
Sbjct: 214 LSSPHTLLACGKHFAGYGAAESGK----DYNTAELSMHTLRNVYLPPYEATLNAGVGS-I 268
Query: 292 MCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGV 350
M S N++NGVPA L + R EWGF G + SD I E + +D A
Sbjct: 269 MASLNEINGVPATADKWLLTEVLRKEWGFNGLLVSDYTG---INELVRHGVAKDDKQAAN 325
Query: 351 LKAG----MDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGK 406
L A MD+N G +++ + + +GKV E ID+A+ ++ ++ LGLF+ DP +
Sbjct: 326 LSANAGIEMDMN-GATFIKYLSALVKEGKVTEAQIDKAVRHILEMKFLLGLFD-DPYRYL 383
Query: 407 YGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMG 466
++ T E+ K+A A +VLLKN+ + LP+ K++ ++A+IGP++NN S +
Sbjct: 384 DETRAKENTFTEEYLKVARQAVASSVVLLKNEAEVLPIKKDSGKTIAVIGPMMNNTSDIN 443
Query: 467 GGYT--GIPCSPKSLLRGL-EAYVS---KTHYASGCHDVPCNSDAGFHEAVRIAKKADFV 520
G +T G SLL GL E Y K YA GC +++ EAV IA+KAD V
Sbjct: 444 GSWTCLGDGKQSVSLLTGLTEKYKGTNVKLLYAEGCGFTTISTEQ-LKEAVAIARKADRV 502
Query: 521 IVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADS 580
+V G + E R + LP Q L+ ++ +K P+ +V G PLD+S+ +
Sbjct: 503 LVAVGEQSSWAGESAVRTDIRLPQAQRQLLEALKAINK-PIAIVTFSGRPLDLSW--ENE 559
Query: 581 QISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMN----------DM 630
+ +IL +PG G +A++I GD NP G L M+ +P S ++P+
Sbjct: 560 NVQAILQAWFPGTQGGNGIADVIAGDVNPSGHLTMS-FPRSVGQIPIYYNYKSTGRPVYT 618
Query: 631 NMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNI 690
N R + Y + T +Y FG+GLSYT ++ + + S+K +D I
Sbjct: 619 NNEEVDHRPHYNAGYLDSSITPLYPFGYGLSYTTFAISNVHLNKK-----SIKRYNDSII 673
Query: 691 LQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQL 750
V SV N G +G VV L+ R S P K+L
Sbjct: 674 -------------------------VNASVQNTGTTEGEIVVQLYTRQLVASVSRPVKEL 708
Query: 751 IGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
GF ++ A SK++ F + P E L+ + +G++ +L VG
Sbjct: 709 KGFQKISLKAGESKQVRFEL-PSEALAFYSINGKKDTEPSECLLWVG 754
>gi|146299327|ref|YP_001193918.1| glycoside hydrolase family 3 protein [Flavobacterium johnsoniae
UW101]
gi|146153745|gb|ABQ04599.1| Candidate beta-glucosidase; Glycoside hydrolase family 3
[Flavobacterium johnsoniae UW101]
Length = 743
Score = 244 bits (624), Expect = 9e-62, Method: Compositional matrix adjust.
Identities = 208/698 (29%), Positives = 326/698 (46%), Gaps = 105/698 (15%)
Query: 123 TSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFW--APNINIFRDPRWGRG 180
T+FP L AAS++ A+ + AR A W AP ++I RDPRWGR
Sbjct: 111 TTFPLPLAEAASWDLQ-------AIELAARVAATEASASGIHWTFAPMVDISRDPRWGRV 163
Query: 181 QETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLS 240
E GED + S A VK FQG + GD +
Sbjct: 164 MEGAGEDTYLGSKIAYARVKGFQG--------------------------NKLGDLNSVM 197
Query: 241 ACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVN 299
AC KH AY G +N++ ++E+ +T+ PPF++ ++ G A+ M S+N +N
Sbjct: 198 ACVKHFAAYGAGVGGR----DYNSVDMSERMLWETYLPPFKAALDAG-AATFMNSFNDIN 252
Query: 300 GVPAC----LRGDLFQKARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGM 355
G+PA L+ D+ + +W F+G++ SD ++ + + Y+K +++A + AG
Sbjct: 253 GIPATGNAHLQRDIL---KGKWNFQGFVVSDWGSIGEMVAH-GYSKNLKEAAYSAITAGS 308
Query: 356 DINCGTCMLRHTQSAIDK-GKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDD 414
D++ + R+ + + K G+V ID A+ + + LGLF+ DP + K
Sbjct: 309 DMDMESNAYRYNLAQLVKEGRVSVDLIDDAVKRILRKKFELGLFD-DPYRYSDEKRAEKA 367
Query: 415 VCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQ-MGGGYTGIP 473
+ EH+K ALD A++ IVLLKN+ + LP++K +V ++A IGP+V + MG +P
Sbjct: 368 LNNPEHRKAALDVAQKSIVLLKNENQTLPISK-SVKTIAFIGPMVKEYKENMGFWSVELP 426
Query: 474 CSPK-----SLLRGLEAYVSKTH---YASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAG 525
S GL+ V K YA GC N D GF EAV AK+AD VI+ G
Sbjct: 427 EVDYNKWIVSQWDGLQNKVGKNTKLLYAKGCEIEGTNKD-GFAEAVETAKQADVVILSIG 485
Query: 526 LDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSI 585
E + R + LPG Q LV ++ T K PV++++ G PL F + ++
Sbjct: 486 ERRDMSGEAKSRSDIHLPGVQEDLVKAIQATGK-PVVVLINAGRPL--VFNWTADNVPAV 542
Query: 586 LWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPM--NDMNM----RADSSRQ 639
++ + G A+A ++FGD+NP G+LPMT +P ++P+ N + + ++
Sbjct: 543 VYTWWLGTEAGNAIANVLFGDYNPSGKLPMT-FPREVGQIPIYYNHFSTGRPAKTENETN 601
Query: 640 YPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLD 699
Y +Y + + FG+GLSYT +SY S+L +S S K S++ I
Sbjct: 602 YVS-AYIDLKNSPKFPFGYGLSYTQFSY------SDLKLS-STKIKSNETI--------- 644
Query: 700 YVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTV 759
V ++N G V G V L+ + S P +L F++V
Sbjct: 645 ---------------KVSFKLSNVGKVAGEEVAQLYLKDKFGSVVRPVLELRDFEKVKLN 689
Query: 760 AKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
A SK I F +D E+LS N G+ LM+G
Sbjct: 690 AGESKTIEFTIDK-EKLSFYNDKLEWTTEPGDFELMIG 726
>gi|256819849|ref|YP_003141128.1| glycoside hydrolase [Capnocytophaga ochracea DSM 7271]
gi|256581432|gb|ACU92567.1| glycoside hydrolase family 3 domain protein [Capnocytophaga
ochracea DSM 7271]
Length = 804
Score = 244 bits (624), Expect = 9e-62, Method: Compositional matrix adjust.
Identities = 217/743 (29%), Positives = 335/743 (45%), Gaps = 113/743 (15%)
Query: 78 IQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNR 137
I++L A RLGIP + + +HG T FP L + S++
Sbjct: 136 IRKLQKIAVEQTRLGIPIL-FGQDVIHGYK---------------TIFPIPLAESCSWDL 179
Query: 138 SLWSNIGSAVAVEARAMYNLGQAGLTF-WAPNINIFRDPRWGRGQETPGEDPMVVSAYAV 196
+L A EA A G+ + +AP ++I RD RWGR E GEDP + S A
Sbjct: 180 ALMRKTAELAAREASA------DGINWTFAPMVDITRDARWGRAMEGAGEDPYLGSLIAE 233
Query: 197 EFVKSFQG-ENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWG 255
VK FQG +NW++ L AC KH Y + G
Sbjct: 234 ARVKGFQGGDNWQT-----------------------LSSPHTLLACGKHFAGYGAAESG 270
Query: 256 NFSRYSFN-AMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA- 313
+N A ++ + + PP+ + ++ G S IM S N++NGVPA L +
Sbjct: 271 K----DYNTAELSMHTLRNVYLPPYEATLKAGVGS-IMASLNEINGVPATADKWLLTEVL 325
Query: 314 RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAG--MDINCGTCMLRHTQSAI 371
R EWGF G + SD + + + K + A AG MD+N G +++ + +
Sbjct: 326 RKEWGFNGLLVSDYTGINELVRH-GVAKDDKQVANLSANAGIEMDMN-GATFIKYLSALV 383
Query: 372 DKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQG 431
+GKV E ID+A+ ++ ++ LGLF+ DP + ++ T E+ K+A A
Sbjct: 384 KEGKVTENQIDKAVRHILEMKFLLGLFD-DPYRYLDETRAKENTFTEEYLKVARQAVASS 442
Query: 432 IVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT--GIPCSPKSLLRGL-EAYVS 488
+VLLKN+ + LP+ KN+ ++A+IGP++NN S + G +T G SLL GL E Y
Sbjct: 443 VVLLKNEAEALPIKKNSDKTIAVIGPMMNNTSDINGSWTCLGDGKQSVSLLTGLTEKYKG 502
Query: 489 ---KTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQ 545
K YA GC +++ EAV IA+KAD V+V G + E R + LP
Sbjct: 503 TNVKLLYAEGCGFTTISTEQ-LKEAVAIARKADRVLVAVGEQSSWAGESAVRTDIRLPQA 561
Query: 546 QMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFG 605
Q L+ ++ +K P+ ++ G PLD+S+ + + +IL +PG G +A++I G
Sbjct: 562 QRQLLEALKAINK-PIAIITFSGRPLDLSW--ENENVQAILQAWFPGTQGGNGIADVIAG 618
Query: 606 DFNPGGRLPMTWYPESFTKVPM-----------NDMNMRADSSRQYPGRSYRFYTGTQVY 654
D NP G L M+ +P S ++P+ + N D Y Y + T +Y
Sbjct: 619 DVNPSGHLTMS-FPRSVGQIPIYYNYKSTGRPVHTNNEEVDHRPHYNA-GYLDSSITPLY 676
Query: 655 GFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRF 714
FG+GLSYT ++ + + S+K +D I
Sbjct: 677 PFGYGLSYTTFAISNVHLNKK-----SIKRYNDSII------------------------ 707
Query: 715 HVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCE 774
V SV N G +G VV L+ R S P K+L GF ++ A SK++ F + P E
Sbjct: 708 -VNASVQNTGRTEGEIVVQLYTRQLVASVSRPVKELKGFQKIPLKAGESKQVRFEL-PSE 765
Query: 775 QLSIANKHGRRILPLGNHVLMVG 797
L+ +G++ +L VG
Sbjct: 766 ALAFYGINGKKDTEPSECLLWVG 788
>gi|441500080|ref|ZP_20982250.1| Beta-glucosidase [Fulvivirga imtechensis AK7]
gi|441436171|gb|ELR69545.1| Beta-glucosidase [Fulvivirga imtechensis AK7]
Length = 704
Score = 244 bits (624), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 213/741 (28%), Positives = 344/741 (46%), Gaps = 117/741 (15%)
Query: 72 LTLQEKIQQLSDNASAIPRLGIPAYEWWSESLH----GIASNGPGVNFNGTVSSV----- 122
+TL+EKI Q++ S G E + E L G N V +N + +
Sbjct: 1 MTLEEKIGQMTLYTSGWDVTGPTLNENYKEDLKAGKVGALFNAHTVKYNSDLQRMAVEET 60
Query: 123 -----------------TSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTF- 164
T FP L A+S+N L A EA A +GL +
Sbjct: 61 RLGIPLLFGYDVIHGHRTIFPLPLAEASSWNLDLIKETARLSAKEAAA------SGLNWT 114
Query: 165 WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVL 224
+ P ++I RDPRWGR E GED + S A V+ +QG+
Sbjct: 115 FNPMVDIARDPRWGRIAEGSGEDTYLGSLIAKAKVEGYQGD------------------- 155
Query: 225 KGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIE 284
D D + AC KH AY + G R ++++ +T+ PP+++ I+
Sbjct: 156 -------DLSDPFTVLACVKHFAAYGASQAG---RDYHTVDMSDRVLRETYLPPYKAAID 205
Query: 285 QGKASCIMCSYNQVNGVPAC-LRGDLFQKARNEWGFKGYITSDCDAVATIFEYQNYTKTH 343
G A+ +M S+N+++GVPA R + + R EW FKG++ +D ++ + +
Sbjct: 206 AG-AATVMTSFNELHGVPASGSRYLMTEILREEWRFKGFVVTDYTSINEMVPH-GVVANE 263
Query: 344 EDSAAGVLKAGMDINC-GTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDP 402
+++A L AG+D++ G H + +++GKV EK +D A+ + ++ RLGLF DP
Sbjct: 264 KEAADLALNAGVDMDMQGGVYNDHLATLVNEGKVSEKQVDEAVRRILEMKWRLGLFK-DP 322
Query: 403 RKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKND----KKFLPLNKNAVSSLAIIGPL 458
+ K + + E AL +AR+ IVLLKN+ KK LP+ N V S+A+IGPL
Sbjct: 323 YRYLDEKRELQVLFSKELMDHALVSARESIVLLKNEPYNNKKLLPI-ANDVKSIALIGPL 381
Query: 459 VNNISQMGGGY--TGIPCSPKSLLRGL-EAY-VSKTHYASGCHDVPCNSDAGFHEAVRIA 514
+N M G + +G ++L+GL EA+ +K Y G D + +GF EA + A
Sbjct: 382 GDNQIDMLGTWHASGDANKVVTVLQGLKEAFPKAKITYTKGA-DFMGSDKSGFEEATKNA 440
Query: 515 KKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVS 574
+ AD VI+ G + Q E R L LPG Q LV ++ +T K P++ ++ G PL +
Sbjct: 441 RAADLVIMAVGENHQQSGEAASRSGLDLPGVQQELVEAIYQTGK-PIVALVMAGRPLTIG 499
Query: 575 FAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVP----MNDM 630
+ D I +I+ + G KA+AE++ G +NP G+L +T +P + ++P M +
Sbjct: 500 W--MDENIPAIVNTWHLGTMAGKAIAEVLAGKYNPSGKLTIT-FPRNVGQIPIYYSMKNT 556
Query: 631 NMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNI 690
D+ +Y + Y + +Y FG+GLSYT + Y P
Sbjct: 557 GRPFDADSKYTSK-YLDVSNEPLYPFGYGLSYTTFEY---GEP----------------- 595
Query: 691 LQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQL 750
+L + I E + T +++ V N G+ +G VV L+ R S P K+L
Sbjct: 596 ------KLSKIEIKEHENLT-----IEVMVKNTGEYEGQEVVQLYVRDLVGSVTRPVKEL 644
Query: 751 IGFDRVHTVAKGSKEISFGVD 771
GF+++ SK ++F ++
Sbjct: 645 KGFEKISLKPGESKVVTFTIN 665
>gi|421883572|ref|ZP_16314801.1| periplasmic beta-D-glucoside glucohydrolase [Salmonella enterica
subsp. enterica serovar Senftenberg str. SS209]
gi|379986798|emb|CCF87074.1| periplasmic beta-D-glucoside glucohydrolase [Salmonella enterica
subsp. enterica serovar Senftenberg str. SS209]
Length = 765
Score = 244 bits (624), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 208/749 (27%), Positives = 344/749 (45%), Gaps = 139/749 (18%)
Query: 75 QEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAAS 134
++ I+Q+ D A+ RL IP + + + +HG T FP L A+S
Sbjct: 86 RQDIRQMQDQVMALSRLKIPLF-FAYDVVHG---------------QRTVFPISLGLASS 129
Query: 135 FNRSLWSNIGSAVAVEARAMYNLGQAGLTF-WAPNINIFRDPRWGRGQETPGEDPMVVSA 193
FN +G A EA GL WAP +++ RDPRWGR E GED + S
Sbjct: 130 FNLDAVRTVGRVSAYEA------ADDGLNMTWAPMVDVSRDPRWGRASEGFGEDTYLTSI 183
Query: 194 YAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEK 253
VK+ QG++ D + KH AY +
Sbjct: 184 MGETMVKAMQGKS--------------------------PADRYSVMTSVKHFAAYGAVE 217
Query: 254 WGNFSRYSFNAM-ITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK 312
G +N + ++ Q + + PP+++ ++ G + +M + N +NG PA L +
Sbjct: 218 GGK----EYNTVDMSSQRLFNDYMPPYKAGLDAG-SGAVMVALNSLNGTPATSDSWLLKD 272
Query: 313 A-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCML-RHTQSA 370
R+EWGFKG SD A+ + ++ ED+ LKAG+D++ ++
Sbjct: 273 VLRDEWGFKGITVSDHGAIKELIKHGTAADP-EDAVRVALKAGVDMSMADEYYSKYLPGL 331
Query: 371 IDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGP---DDVCTSE----HKKL 423
I GKV ++D A ++ +V+ +GLFN Y LGP D V T+ H+K
Sbjct: 332 IKSGKVTMAELDDATRHVLNVKYDMGLFNA-----PYSHLGPKESDPVDTNAESRLHRKE 386
Query: 424 ALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT--GIPCSPKSLLR 481
A + AR+ +VLLKN + LPL K+ ++A++GPL ++ + G ++ G+ ++L
Sbjct: 387 AREVARESVVLLKNRLETLPLKKSG--TIAVVGPLADSQRDVMGSWSAAGVANQSVTVLA 444
Query: 482 GLEAYV---SKTHYASGCH-----DV--------------PCNSDAGFHEAVRIAKKADF 519
G++ V +K YA G + D+ P + A EAV+ AK+AD
Sbjct: 445 GIQNAVGDGAKILYAKGANITNDKDIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADV 504
Query: 520 VIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEAD 579
V+ V G E R ++ +P Q L+T++ T K P++LVL G PL + + D
Sbjct: 505 VVAVVGESQGMAHEASSRTNITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKED 561
Query: 580 SQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRA----- 634
Q +IL + G G A+A+++FGD+NP G+LP++ +P S ++P+ ++
Sbjct: 562 QQADAILETWFAGTEGGNAIADVLFGDYNPSGKLPIS-FPRSVGQIPVYYSHLNTGRPYN 620
Query: 635 -DSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQ 693
+ +Y R + G +Y FG+GLSYT ++ S++T+S+ +Q+
Sbjct: 621 PEKPNKYTSRYFDEANGP-LYPFGYGLSYTTFTV------SDVTLSSP--------TMQR 665
Query: 694 TGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGF 753
G +VT+ + VTN G +G+ V+ ++ + S P KQL GF
Sbjct: 666 DG---------KVTAS--------VEVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGF 708
Query: 754 DRVHTVAKGSKEISFGVDPCEQLSIANKH 782
+++ K +SF +D E L N+
Sbjct: 709 EKITLKPGERKTVSFPID-IEALKFWNQQ 736
>gi|378954476|ref|YP_005211963.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar
Gallinarum/pullorum str. RKS5078]
gi|357205087|gb|AET53133.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Gallinarum/pullorum str. RKS5078]
Length = 755
Score = 244 bits (624), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 209/748 (27%), Positives = 349/748 (46%), Gaps = 137/748 (18%)
Query: 75 QEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAAS 134
++ I+Q+ D A+ RL IP + + + +HG T FP L A+S
Sbjct: 76 RQDIRQMQDQVMALSRLKIPLF-FAYDVVHG---------------QRTVFPISLGLASS 119
Query: 135 FNRSLWSNIGSAVAVEARAMYNLGQAGLTF-WAPNINIFRDPRWGRGQETPGEDPMVVSA 193
FN +G A EA GL WAP +++ RDPRWGR E GED + S
Sbjct: 120 FNLDAVRTVGRVSAYEA------ADDGLNMTWAPMVDVSRDPRWGRASEGFGEDTYLTSI 173
Query: 194 YAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEK 253
VK+ QG++ D + KH AY +
Sbjct: 174 MGETMVKAMQGKS--------------------------PADRYSVMTSVKHFAAYGAVE 207
Query: 254 WGNFSRYSFNAM-ITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK 312
G +N + ++ Q + + PP+++ ++ G + +M + N +NG PA L +
Sbjct: 208 GGK----EYNTVDMSSQRLFNDYMPPYKAGLDAG-SGAVMVALNSLNGTPATSDSWLLKD 262
Query: 313 A-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCML-RHTQSA 370
R+EWGFKG SD A+ + ++ ED+ LKAG+D++ ++
Sbjct: 263 VLRDEWGFKGITVSDHGAIKELIKHGTAADP-EDAVRVALKAGVDMSMADEYYSKYLPGL 321
Query: 371 IDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGP---DDVCTSE----HKKL 423
I GKV ++D A ++ +V+ +GLFN DP Y LGP D V T+ H+K
Sbjct: 322 IKSGKVTMAELDDATRHVLNVKYDMGLFN-DP----YSHLGPKESDPVDTNAESRLHRKE 376
Query: 424 ALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT--GIPCSPKSLLR 481
A + AR+ +VLLKN + LPL K+ ++A++GPL ++ + G ++ G+ ++L
Sbjct: 377 AREVARESVVLLKNRLETLPLKKSG--TIAVVGPLADSQRDVMGSWSAAGVANQSVTVLA 434
Query: 482 GLEAYV---SKTHYASGCH-----DV--------------PCNSDAGFHEAVRIAKKADF 519
G++ V +K YA G + D+ P + A EAV+ AK+A+
Sbjct: 435 GIQNAVDDGAKILYAKGANITNDKDIVDFLNLHEEAVKIDPRSPQAMIDEAVQAAKQANV 494
Query: 520 VIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEAD 579
V+ V G E R ++ +P Q L+T++ T K P++LVL G PL + + D
Sbjct: 495 VVAVVGESQGMAHEASSRTNITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKED 551
Query: 580 SQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPM--NDMNMRADSS 637
Q +IL + G G A+A+++FGD+NP G+LP++ +P S ++P+ + +N +
Sbjct: 552 QQADAILETWFAGTEGGNAIADVLFGDYNPSGKLPIS-FPRSVGQIPVYYSHLNTGRPYN 610
Query: 638 RQYPGR-SYRFYTGTQ--VYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQT 694
+ P + + R++ +Y FG+GLSYT ++ S++T+S+ +Q+
Sbjct: 611 PEKPNKYTSRYFDEANGPLYPFGYGLSYTTFTV------SDVTLSSP--------TMQRD 656
Query: 695 GSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFD 754
G +VT+ + VTN G +G+ V+ ++ + S P KQL GF+
Sbjct: 657 G---------KVTAS--------VEVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGFE 699
Query: 755 RVHTVAKGSKEISFGVDPCEQLSIANKH 782
++ K +SF +D E L N+
Sbjct: 700 KITLKPGERKTVSFPID-IEALKFWNQQ 726
>gi|319953334|ref|YP_004164601.1| beta-glucosidase [Cellulophaga algicola DSM 14237]
gi|319421994|gb|ADV49103.1| Beta-glucosidase [Cellulophaga algicola DSM 14237]
Length = 756
Score = 244 bits (624), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 217/771 (28%), Positives = 353/771 (45%), Gaps = 137/771 (17%)
Query: 76 EKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASF 135
EKI+ D A RLGIP + + S+ +HG T+FP L ++S+
Sbjct: 83 EKIKTAQDFAVKKTRLGIPLF-FGSDIIHGYK---------------TTFPIPLGLSSSW 126
Query: 136 NRSLWSNIGSAVAVEARAMYNLGQAGLTF-WAPNINIFRDPRWGRGQETPGEDPMVVSAY 194
+ L A+EA A G+ + ++P ++I RDPRWGR E GEDP + S
Sbjct: 127 DMELLKRTAQVAALEATA------DGINWNFSPMVDISRDPRWGRISEGAGEDPYLGSQI 180
Query: 195 AVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKW 254
A V +QGE+ + + + A KH Y +
Sbjct: 181 AKAMVTGYQGEDLMAKN--------------------------TMLATVKHFALYGAAEA 214
Query: 255 GNFSRYSFNAM-ITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPAC----LRGDL 309
G +N++ ++ + + PP+++ I+ G S +M S+N ++G+PA L DL
Sbjct: 215 GR----DYNSVDMSRLKMYNEYLPPYKAAIDAGVGS-VMSSFNDIDGIPASGNKWLLTDL 269
Query: 310 FQKARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDIN-CGTCMLRHTQ 368
R++W F G++ SD +V + + + +A LKAG+D++ G L +
Sbjct: 270 L---RDDWKFNGFVVSDYTSVNEMIAHG--LGDLQAVSALSLKAGLDMDMVGEGFLTTLK 324
Query: 369 SAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAA 428
++D+GKV ++I A + + +LGLF+ DP K K D+ E++ LA +AA
Sbjct: 325 KSLDEGKVTAEEITTACRRILEAKFKLGLFD-DPYKYIDKKRPAKDILKDENRALAREAA 383
Query: 429 RQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGY--TGIPCSPKSLLRGLE-- 484
++ VLLKND K LP+NK+ S +A+IG L N+ M G + TG P S+L+G +
Sbjct: 384 KKSFVLLKNDTKNLPINKS--SKIALIGDLANSKDNMLGTWAPTGDPQLSVSILQGFKNV 441
Query: 485 ---AYVSKTHYASGCHDVPC----------------NSDAGFHEAVRIAKKADFVIVVAG 525
A ++ A+ D +++ +EAV +AKK+D ++ V G
Sbjct: 442 APNAQITHAKGANITDDAALAKKINVFGERVTIDKRSAEEMLNEAVELAKKSDIIVAVVG 501
Query: 526 LDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSI 585
E R + +P Q L+ ++A T K P++LVL G PL E + +SI
Sbjct: 502 EATEFTGESSSRTDISIPQSQKKLIRALAATGK-PLVLVLMSGRPL--VLEEELALSASI 558
Query: 586 LWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPM-----NDMNMRADSSRQY 640
L + +PG A+A+++FGD+NP G+L TW P + ++P+ N + S +
Sbjct: 559 LQVWFPGVEAGNAIADVVFGDYNPSGKLTATW-PRNVGQIPIYHSIKNTGRPQLTSEFEK 617
Query: 641 PGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDY 700
+Y T + FG+GLSYT + Y S L ++AS ++ I
Sbjct: 618 FTSNYLDAPNTPLLPFGYGLSYTEFEY------SNLNVNASQINQNEPLI---------- 661
Query: 701 VHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVA 760
V +SVTN G+ DG VV L+ R S P KQL GF +V
Sbjct: 662 ---------------VTVSVTNTGNFDGEEVVQLYLRDVVRSITQPVKQLKGFKKVMLKK 706
Query: 761 KGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG-----ELRHSLTIE 806
+K+++ + P + L N + + G+ + VG L+ S T++
Sbjct: 707 GETKQVTLTLTP-DDLKFYNSNLDFVAEPGDFEIYVGTNSDATLKSSFTLK 756
>gi|416539671|ref|ZP_11750078.1| periplasmic beta-D-glucoside glucohydrolase [Salmonella enterica
subsp. enterica serovar Montevideo str. SARB30]
gi|416551719|ref|ZP_11756625.1| periplasmic beta-D-glucoside glucohydrolase [Salmonella enterica
subsp. enterica serovar Montevideo str. 29N]
gi|416566632|ref|ZP_11763924.1| periplasmic beta-D-glucoside glucohydrolase [Salmonella enterica
subsp. enterica serovar Montevideo str. 4441 H]
gi|417327454|ref|ZP_12112866.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar
Adelaide str. A4-669]
gi|418868407|ref|ZP_13422850.1| periplasmic beta-D-glucoside glucohydrolase [Salmonella enterica
subsp. enterica serovar Newport str. CVM 4176]
gi|353570549|gb|EHC34783.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar
Adelaide str. A4-669]
gi|363559751|gb|EHL43903.1| periplasmic beta-D-glucoside glucohydrolase [Salmonella enterica
subsp. enterica serovar Montevideo str. SARB30]
gi|363566134|gb|EHL50153.1| periplasmic beta-D-glucoside glucohydrolase [Salmonella enterica
subsp. enterica serovar Montevideo str. 29N]
gi|363579468|gb|EHL63250.1| periplasmic beta-D-glucoside glucohydrolase [Salmonella enterica
subsp. enterica serovar Montevideo str. 4441 H]
gi|392838313|gb|EJA93877.1| periplasmic beta-D-glucoside glucohydrolase [Salmonella enterica
subsp. enterica serovar Newport str. CVM 4176]
Length = 765
Score = 244 bits (624), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 206/749 (27%), Positives = 343/749 (45%), Gaps = 139/749 (18%)
Query: 75 QEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAAS 134
++ I+Q+ D A+ RL IP + + + +HG T FP L A+S
Sbjct: 86 RQDIRQMQDQVMALSRLKIPLF-FAYDVVHG---------------QRTVFPISLGLASS 129
Query: 135 FNRSLWSNIGSAVAVEARAMYNLGQAGLTF-WAPNINIFRDPRWGRGQETPGEDPMVVSA 193
FN +G A EA GL WAP +++ RDPRWGR E GED + S
Sbjct: 130 FNLDAVRTVGRVSAYEA------ADDGLNMTWAPMVDVSRDPRWGRASEGFGEDTYLTSI 183
Query: 194 YAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEK 253
VK+ QG++ D + KH AY +
Sbjct: 184 MGETMVKAMQGKS--------------------------PADRYSVMTSVKHFAAYGAVE 217
Query: 254 WGNFSRYSFNAM-ITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK 312
G +N + ++ Q + + PP+++ ++ G + +M + N +NG PA L +
Sbjct: 218 GGK----EYNTVDMSSQRLFNDYMPPYKAGLDAG-SGAVMVALNSLNGTPATSDSWLLKD 272
Query: 313 A-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCML-RHTQSA 370
R+EWGFKG SD A+ + ++ ED+ LKAG+D++ ++
Sbjct: 273 VLRDEWGFKGITVSDHGAIKELIKHGTAADP-EDAVRVALKAGVDMSMADEYYSKYLPGL 331
Query: 371 IDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSE-------HKKL 423
I GKV ++D A ++ +V+ +GLFN DP Y LGP + ++ H+K
Sbjct: 332 IKSGKVTMAELDDATRHVLNVKYDMGLFN-DP----YSHLGPKESDPADTNAESRLHRKE 386
Query: 424 ALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT--GIPCSPKSLLR 481
A + AR+ +VLLKN + LPL K+ ++A++GPL ++ + G ++ G+ ++L
Sbjct: 387 AREVARESVVLLKNRLETLPLKKSG--TIAVVGPLADSQRDVMGSWSAAGVANQSVTVLA 444
Query: 482 GLEAYV---SKTHYASGCHDV-------------------PCNSDAGFHEAVRIAKKADF 519
G++ V +K YA G + P + A EAV+ AK+AD
Sbjct: 445 GIQNAVGDGAKILYAKGANITNDKGIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADV 504
Query: 520 VIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEAD 579
V+ V G E R ++ +P Q L+T++ T K P++LVL G PL + + D
Sbjct: 505 VVAVVGESQGMAHEASSRTNITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKED 561
Query: 580 SQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRA----- 634
Q +IL + G G A+A+++FGD+NP G+LP++ +P S ++P+ ++
Sbjct: 562 QQADAILETWFAGTEGGNAIADVLFGDYNPSGKLPIS-FPRSVGQIPVYYSHLNTGRPYN 620
Query: 635 -DSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQ 693
+ +Y R + G +Y FG+GLSYT ++ S++T+S+ +Q+
Sbjct: 621 PEKPNKYTSRYFDEANGP-LYPFGYGLSYTTFTV------SDVTLSSP--------TMQR 665
Query: 694 TGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGF 753
G +VT+ + VTN G +G+ V+ ++ + S P KQL GF
Sbjct: 666 DG---------KVTAS--------VEVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGF 708
Query: 754 DRVHTVAKGSKEISFGVDPCEQLSIANKH 782
+++ K +SF +D E L N+
Sbjct: 709 EKITLKPGERKTVSFPID-IEALKFWNQQ 736
>gi|387790798|ref|YP_006255863.1| beta-glucosidase-like glycosyl hydrolase [Solitalea canadensis DSM
3403]
gi|379653631|gb|AFD06687.1| beta-glucosidase-like glycosyl hydrolase [Solitalea canadensis DSM
3403]
Length = 730
Score = 244 bits (624), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 224/774 (28%), Positives = 343/774 (44%), Gaps = 139/774 (17%)
Query: 64 RAKSLISLLTLQEKIQQLSDNAS----AIPRLGIPAYEWWSESLHGI-ASNGPGVNFNGT 118
+ + LI +TL+EK+ + N+S I RLGIP S+ HG+ +G +
Sbjct: 40 KIEQLIEKMTLEEKVGMIHGNSSFTSAGIERLGIPELVT-SDGPHGVRVEHGRDWTVDTN 98
Query: 119 VS-SVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRW 177
V + T P AA++N L G+ + EA + P +NI R P
Sbjct: 99 VDDAATYLPTGNTLAATWNTDLGYQFGAVLGSEANY-----RGKDVILGPGVNIIRSPLC 153
Query: 178 GRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDEL 237
GR E EDP ++S AV ++K Q D+G
Sbjct: 154 GRNFEYLSEDPYLISKMAVGYIKGVQ----------------------------DQG--- 182
Query: 238 MLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQ 297
+SAC KH A + E R + + ++E+ + + P F++ + G + +M SYN+
Sbjct: 183 -VSACVKHYAANNEE----VDRNTVDVQMSERALREIYLPAFKAAVVDGGVNTVMGSYNK 237
Query: 298 VNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFE-YQNYTKTHEDSAAGVLKAGM 355
G A L +K + EWGFKG + SD AV E QN T + G+L
Sbjct: 238 FRGQYATHNEYLVKKILKGEWGFKGVLMSDWGAVHNTMEAMQNGTDLEMGTDLGMLP--- 294
Query: 356 DINCGTCMLRHTQSAIDK-GKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDD 414
+ N + T A+ K GK+ E+ ID + + V + + +G K P
Sbjct: 295 NPNYNKFFMADTVLALVKSGKLSEQLIDEKVRRILWVMFKTNMIDG--------KRQPGS 346
Query: 415 VCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPC 474
T EH+K+AL A +GIVLLKN+ LPL KN + S+A+IG N + MGGG + +
Sbjct: 347 FNTKEHQKVALKVAEEGIVLLKNENGILPLQKNDLKSIAVIGENANRPNSMGGGSSQVKA 406
Query: 475 SPK-SLLRGLEAYVSKT---HYASGCHDVPC-NSDAGF-HEAVRIAKKADFVIVVAGL-- 526
+ +LL+GL+ + T YA G +DA EAV A KA+ I+V G
Sbjct: 407 KYEITLLQGLKNLLGSTVNIQYAQGYKIARGQQADAKLISEAVSAASKAEIAILVVGWTH 466
Query: 527 --------DLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEA 578
D + E D+ + +P Q L+ +V + + V+ VLTGGGP+DV+
Sbjct: 467 GYDYSVWNDNAYDAEGVDKPDMDMPFGQNELIKAVLKANPHTVV-VLTGGGPIDVTQWIG 525
Query: 579 DSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSR 638
D++ +L Y G G ALA+I+FG+ NP G+LPMT +P+ P A
Sbjct: 526 DAK--GVLEGWYAGMEGGNALAKILFGEVNPSGKLPMT-FPKKLEDSP-------AHKFG 575
Query: 639 QYPGRS------------YRFYTGTQV---YGFGHGLSYTNYSYKFLSAPSELTISASLK 683
+PG + YR++ +V + FGHGLSYT +SY+ + +
Sbjct: 576 DFPGVNNVAHYKEDIFVGYRYFDTYKVQPQFAFGHGLSYTTFSYE----------NMKVA 625
Query: 684 AGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQ 743
AG DK T+ T I++ N G V G+ V L+ + K S
Sbjct: 626 AGDDK------------------TTAT-------ITIKNTGKVGGAEVAQLYVKQVKSSL 660
Query: 744 GTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
PEK+L F ++ SKEISF ++ +K + ++ G +++G
Sbjct: 661 KRPEKELKAFQKIFLKPGESKEISFELNDEAFHYFNDKENKWVVEPGKFDILIG 714
>gi|332184951|ref|ZP_08386700.1| glycosyl hydrolase family 3 N terminal domain protein [Sphingomonas
sp. S17]
gi|332014675|gb|EGI56731.1| glycosyl hydrolase family 3 N terminal domain protein [Sphingomonas
sp. S17]
Length = 886
Score = 244 bits (624), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 152/423 (35%), Positives = 212/423 (50%), Gaps = 63/423 (14%)
Query: 65 AKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTS 124
A +L+ +T EK+ QL + A AIPRLG+PAY WW+ESLHG P T+
Sbjct: 34 AATLVGKMTTDEKLYQLLNTAPAIPRLGVPAYNWWTESLHGALGPVP----------TTN 83
Query: 125 FPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQ---------AGLTFWAPNINIFRDP 175
FP+ + AA+F+ +L A++ E R ++ L + GL W+PNINIFRDP
Sbjct: 84 FPEPVGLAATFDDALVGRAAQAISREVRGLHALARQTGRLGRIGTGLDSWSPNINIFRDP 143
Query: 176 RWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGD 235
RWGRGQET GEDP + + V FV+ QG + D
Sbjct: 144 RWGRGQETYGEDPFLTARMGVAFVRGMQGTDPNRID------------------------ 179
Query: 236 ELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSY 295
+ A KH Y + +R+ N +++ D DT+ P FR+ I +G A IMC+Y
Sbjct: 180 ---VVATPKH---YAVHSGPESTRHEANVYVSQHDLVDTYLPAFRAAIVEGGAGSIMCAY 233
Query: 296 NQVNGVPACLRGDLFQK--ARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKA 353
N+++G PAC DL K R WGF+GY+ SDCDAV I +Y + A ++A
Sbjct: 234 NRIDGQPACA-SDLLLKDYLRGRWGFRGYVVSDCDAVKDIDANHHYAPDPATAVAAAMRA 292
Query: 354 GMDINCGTCMLRHT-------QSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGK 406
G+D C L T + A+ +G + DIDR+L LF+ + R G G R
Sbjct: 293 GVDNECNNATLTDTDGLTNPYRDALARGLISVADIDRSLTRLFAARYRTGDLPGV-RPLS 351
Query: 407 YGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMG 466
P D+ T H+ LAL+ A + +VLLKN K LPL A +A+IGPL + +
Sbjct: 352 MQSSSPADIGTPAHQALALETAEKSLVLLKN-KGILPLKAQA--RIAVIGPLGDATRVLR 408
Query: 467 GGY 469
G Y
Sbjct: 409 GNY 411
Score = 128 bits (322), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 99/306 (32%), Positives = 150/306 (49%), Gaps = 56/306 (18%)
Query: 503 SDAGFHEAVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTS 552
SD E AK +D ++ V GL E E+ D+ +L LP Q +L+
Sbjct: 601 SDDPAAELRTAAKDSDALVAVVGLTSDLEAEEAPISVPGFKGGDKTTLDLPADQQALLEQ 660
Query: 553 VARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGR 612
A+ + +P+I+VL G P+++++A+A++ W YPG+AG A+ ++ G NP GR
Sbjct: 661 -AKATGKPLIVVLMNGSPVNLAWAKANADAIVEAW--YPGQAGGLAVGNVLSGKTNPAGR 717
Query: 613 LPMTWYPESFTKVP-MNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLS 671
LP+T+Y S +P D M+ GR+YR++TGT VY FG+GLSYT+++Y
Sbjct: 718 LPLTFY-RSVEDLPAFGDYAMK--------GRTYRYFTGTPVYPFGYGLSYTSFAY---- 764
Query: 672 APSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHV 731
AP +LT + TG + D +T T +R N G G V
Sbjct: 765 APVKLTPAV-------------TGVQ------DGLTVTTEVR--------NTGRRAGDEV 797
Query: 732 VMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGN 791
L+ PKV G P L GF RVH + + F +DP LS + G R++ G+
Sbjct: 798 AQLYLNFPKV-DGAPRVALRGFQRVHLAPGERRMLRFTLDP-RDLSAVDADGHRLVMKGD 855
Query: 792 HVLMVG 797
+ + VG
Sbjct: 856 YRVSVG 861
>gi|423301451|ref|ZP_17279475.1| hypothetical protein HMPREF1057_02616 [Bacteroides finegoldii
CL09T03C10]
gi|408472052|gb|EKJ90581.1| hypothetical protein HMPREF1057_02616 [Bacteroides finegoldii
CL09T03C10]
Length = 781
Score = 244 bits (623), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 211/749 (28%), Positives = 338/749 (45%), Gaps = 132/749 (17%)
Query: 90 RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
RLGIP + E+ HG + G T FP + AA+++ L + +G A+
Sbjct: 129 RLGIPLF-LAEEAPHGHMAIG-----------TTVFPTGIGMAATWSPQLINEVGKAIGK 176
Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
E R Q G + P +++ RDPRW R +ET GEDP++ GE K+
Sbjct: 177 EIRL-----QGGHISYGPVLDLARDPRWSRVEETFGEDPVLT------------GEIGKA 219
Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQ 269
G+G G D A KH +AY + + G SF M +
Sbjct: 220 MVAGLGSG--------------DLSRPYSTLATLKHFLAYGISESGQNGNPSFAGM---R 262
Query: 270 DTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCD 328
+ + F PPF I G A +M SYN ++G P L + R++W FKG + SD
Sbjct: 263 ELHENFLPPFGQAINAG-ALSVMTSYNSMDGTPCTANHYLLTELLRDDWKFKGVVVSDLY 321
Query: 329 AVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNL 388
++ I + T +++A L AG+DI+ G + A+++ ++ ++ +D A+ +
Sbjct: 322 SIEGIHQSHFVASTMKEAAVMALSAGVDIDLGGDAYMNLMDAVNRKEISKEILDAAVSRV 381
Query: 389 FSVQLRLGLFNG---DPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLN 445
++ +GLF DP K K +V + E+ LA A+ I LLKN+ LPL+
Sbjct: 382 LRLKFEMGLFENPYVDPGKAK------KEVRSKEYVALARQVAQASITLLKNEHSLLPLD 435
Query: 446 KNAVSSLAIIGPLVNNISQMGGGYTGIPCSP--KSLLRGLEAYV--SKTHYASGCHDVPC 501
++ +A+IGP +N M G YT K++L G+ A + S+ Y GC +
Sbjct: 436 RSM--KVALIGPNADNRYNMLGDYTAPQEEENVKTVLDGIRAKLSSSQVEYVKGC-SIRD 492
Query: 502 NSDAGFHEAVRIAKKADFVIVVAG----LDLT---QET----------------EDRDRV 538
+ +AV A++++ VI V G D +ET E DR
Sbjct: 493 TVTSDIEQAVAAARRSEVVIAVVGGSSARDFKTSYKETGAAIADEKTISDMECGEGFDRA 552
Query: 539 SLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKA 598
+L L G+Q L+ ++ T K P+I+V G PLD ++A ++ ++L YPG+ G A
Sbjct: 553 TLSLLGKQQELLKALKATGK-PLIVVYIEGRPLDKNWASENA--DALLTAYYPGQEGGNA 609
Query: 599 LAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGH 658
+A+++FG+FNP GRLP + P S +VP+ N +A S Y S + +Y FG+
Sbjct: 610 IADVLFGEFNPAGRLPFS-VPRSVGQVPVY-YNKKAPQSHDYVEVS-----ASPLYSFGY 662
Query: 659 GLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQI 718
GLSYT + Y S+L +SA T F V
Sbjct: 663 GLSYTTFEY------SDLHLSA----------------------------LTPHSFEVSC 688
Query: 719 SVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSI 778
+ N G DG VV L+ R S P KQL F R+ +++ F + E ++
Sbjct: 689 KIRNTGKYDGEEVVQLYLRDEYASVVQPLKQLKHFARLFLKCGEEQKVKFILSE-EDFAL 747
Query: 779 ANKHGRRILPLGNHVLMVGELRHSLTIET 807
+++ +R++ G +M+G + ++T
Sbjct: 748 VDRNLKRVVEPGTFQVMIGAASDDIRLQT 776
>gi|189464310|ref|ZP_03013095.1| hypothetical protein BACINT_00651 [Bacteroides intestinalis DSM
17393]
gi|189438100|gb|EDV07085.1| glycosyl hydrolase family 3 N-terminal domain protein [Bacteroides
intestinalis DSM 17393]
Length = 864
Score = 244 bits (623), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 156/440 (35%), Positives = 228/440 (51%), Gaps = 57/440 (12%)
Query: 61 ISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVS 120
+ R L+S LT++EKI L + IPRL IP Y +E+LHG+ G
Sbjct: 35 MHERIMDLLSRLTVEEKISLLRATSPGIPRLDIPKYYHGNEALHGVVRPG---------- 84
Query: 121 SVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAG----------LTFWAPNIN 170
T FPQ + AA++N L + + ++ EARA +N G LTFW+P +N
Sbjct: 85 RFTVFPQAIGLAATWNPELQLQVATVISDEARARWNELDQGREQKSQFSDLLTFWSPTVN 144
Query: 171 IFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEE 230
+ RDPRWGR ET GEDP + FVK QG+ DD R LK
Sbjct: 145 MARDPRWGRTPETYGEDPYLSGVMGTAFVKGLQGD----DD----------RYLK----- 185
Query: 231 SDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASC 290
+ + KH A + E +R+ N I+E+ + + P F +C++ GK++
Sbjct: 186 --------IVSTPKHFAANNEEH----NRFVCNPQISEKQLREYYLPAFEACVKDGKSAS 233
Query: 291 IMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAG 349
IM +YN +N VP L L K R +WGFKGY+ SDC + + Y KT E +A
Sbjct: 234 IMSAYNALNDVPCTLNAWLLTKVLREDWGFKGYVVSDCGGPSLLVNAHKYVKTKEAAATL 293
Query: 350 VLKAGMDINCGTCMLRH-TQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYG 408
+KAG+D+ CG + SA + V DID A + +++LGLF+ K Y
Sbjct: 294 SIKAGLDLECGDDVFDEPLLSAYRQYMVTNADIDSAAYRVLRARMQLGLFDSG-EKNPYT 352
Query: 409 KLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGG 468
K+ P V +++H+++AL+AAR+ IVLLKN KK LPLN V S+A++G +N + G
Sbjct: 353 KISPAVVGSAKHQEVALNAARECIVLLKNQKKMLPLNAKKVKSIAVVG--INAGNCEFGD 410
Query: 469 YTGIPC-SPKSLLRGLEAYV 487
Y+G P +P S+L+G++ V
Sbjct: 411 YSGSPVIAPISVLQGIKDRV 430
Score = 132 bits (333), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 80/291 (27%), Positives = 143/291 (49%), Gaps = 45/291 (15%)
Query: 507 FHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLT 566
+ EA + ++ + V+ V G++ + E E +DR + LP QM + + + + +++VL
Sbjct: 592 YGEAGKAVRECETVVAVLGINKSIEREGQDRYDIQLPADQMEFLQEIYKVNPN-IVVVLV 650
Query: 567 GGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVP 626
G L V++ D + +I+ YPGE+G KA+AE++FGD+NPGGRLP+T+Y P
Sbjct: 651 AGSSLAVNWM--DEHVPAIVNAWYPGESGGKAVAEVLFGDYNPGGRLPLTYYRSLDELPP 708
Query: 627 MNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGS 686
+D ++ GR+Y+++ G +Y FG+GLSYT + Y L A
Sbjct: 709 FDDYDITK-------GRTYKYFKGDVLYPFGYGLSYTTFKYSNLQV-----------ADG 750
Query: 687 DKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTP 746
++ I +V + NAG G V ++ ++P+ + P
Sbjct: 751 EEEI------------------------NVSFQLKNAGKYAGDEVAQVYVKLPERDEVMP 786
Query: 747 EKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
K+L GF+RV + +K+++ + G+ + P G++ +MVG
Sbjct: 787 VKELKGFERVALKSGENKKMTLKLRKDLLRYWDEAKGKFVYPSGDYTIMVG 837
>gi|384914665|ref|ZP_10015417.1| Periplasmic beta-glucosidase [Methylacidiphilum fumariolicum SolV]
gi|384527282|emb|CCG91285.1| Periplasmic beta-glucosidase [Methylacidiphilum fumariolicum SolV]
Length = 717
Score = 244 bits (623), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 178/646 (27%), Positives = 314/646 (48%), Gaps = 82/646 (12%)
Query: 162 LTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREK 221
LTF AP +++ RDPRWGR E+PGEDP + S + ++K FQG+N S
Sbjct: 129 LTF-APMLDVCRDPRWGRVAESPGEDPFLASLLSSAWIKGFQGKNLSSSS---------- 177
Query: 222 RVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPFR 280
+L+AC KH + Y + G +N++ I+ + ++ + PPF
Sbjct: 178 ----------------LLAACPKHFVGYGAVEGGR----DYNSVTISHRLLQEIYLPPFL 217
Query: 281 SCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNY 339
++G A M ++ +NG+P L + R + G++G + SD +AV + ++
Sbjct: 218 QAFKEG-AFATMAAFVAINGLPPAANPQLLKGLLRKDIGWEGPVISDWNAVRELVQH-GI 275
Query: 340 TKTHEDSAAGVLKAGMDINCGTCML-RHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLF 398
T+ + + A + +G+DI+ + + ++ + + +GKV+E+ ID A L + +++ +LGLF
Sbjct: 276 TENEKTAVALAINSGIDIDMVSGLYSQYLEELLKEGKVKEETIDEAALRVLTLKKKLGLF 335
Query: 399 NGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPL 458
+P K + + + +K++AL+AA++ +VLLKN+ LP+ ++ S+A+IGP
Sbjct: 336 TQEPHPLKIDQNQWQTILKT-NKQIALEAAKKSVVLLKNEDGLLPIAP-SIRSIALIGPF 393
Query: 459 VNNISQMGGGYTGI-PCSPK-SLLRGLEAYV---SKTHYASGCHDVPCNSDAGFHEAVRI 513
+ G ++ I C+ +L GL+ Y +K H GC+ + + +A++I
Sbjct: 394 SEERKEHLGPWSAIGDCNDVITLAEGLKRYAPRGTKIHVVEGCNKF-SHGTVDYQQALKI 452
Query: 514 AKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDV 573
A+ + ++ G E R L LP QQ L+ + T PV+L++ G PLD+
Sbjct: 453 AQSSQLIVAAVGEKANMSGEAASRAFLDLPTQQQKLIEVLLETGV-PVVLIVFSGRPLDL 511
Query: 574 SFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMR 633
S S+ +IL + G +AL +I+FG +NP G+LP+T +P S ++P+ +
Sbjct: 512 S--RIISKTKAILQAWFLGTETGEALGQILFGLYNPSGKLPIT-FPRSVGQIPIYYAQLP 568
Query: 634 A--DSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNIL 691
++R +P Y + T ++ FG+GLSYT + Y F +
Sbjct: 569 TGRPANRSFPSCRYIDQSNTPLFPFGYGLSYTTFHYSFPTL------------------- 609
Query: 692 QQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLI 751
SR++Y E+ + + N G V G +V L+ R P S P K+L
Sbjct: 610 --NRSRINYHETLEILT----------EIENTGTVPGEEIVQLYIRDPVASISRPLKELK 657
Query: 752 GFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
G RV ++I F + + L+ + G+R + G L VG
Sbjct: 658 GIRRVFLKPGERQKIKFSLSSVD-LAFIDYEGKRKIEPGRFFLWVG 702
>gi|150009653|ref|YP_001304396.1| beta-glucosidase [Parabacteroides distasonis ATCC 8503]
gi|149938077|gb|ABR44774.1| glycoside hydrolase family 3, candidate beta-glucosidase
[Parabacteroides distasonis ATCC 8503]
Length = 758
Score = 244 bits (623), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 193/632 (30%), Positives = 299/632 (47%), Gaps = 100/632 (15%)
Query: 165 WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGEN-WKSDDGGIGFGFREKRV 223
+AP ++I RD RWGR E GEDP + S A V+ FQG N W+S
Sbjct: 162 FAPMVDISRDARWGRVMEGAGEDPYLGSLIAKVRVEGFQGGNDWRS-------------- 207
Query: 224 LKGFGEESDRGDELMLSACCKHLIAYDLEKWG---NFSRYSFNAMITEQDTEDTFQPPFR 280
D + ACCKH AY + G N S S N ++ + + PP+
Sbjct: 208 ---------LADVNTVLACCKHFAAYGAAEAGRDYNTSELSQNTLM------NYYMPPYL 252
Query: 281 SCIEQGKASCIMCSYNQVNGVPAC----LRGDLFQKARNEWGFKGYITSDCDAVATIFEY 336
+ E G A+ M S+N++NGVP+ L DL +K +WGF G++ +D I E
Sbjct: 253 AAKEAGVAT-FMASFNEINGVPSTGNKWLMTDLLRK---DWGFNGFVVTDYTG---INEM 305
Query: 337 QNYTKTHEDSAAGVL--KAGMDIN-CGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQL 393
++ D AG L AG+D++ G +H ++ +GKV E++IDRA+ ++ ++
Sbjct: 306 VAHSIVRNDKEAGELAANAGIDMDMTGGIYSQHLVQSVKEGKVSEENIDRAVASILEMKF 365
Query: 394 RLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLA 453
LGLF+ DP + + + + E + A + + + IVLLKND F P++K+ ++A
Sbjct: 366 LLGLFD-DPYRYLDNEREKNTIMKPEFLQEARETSARSIVLLKNDNNFFPISKDKNITVA 424
Query: 454 IIGPLVNNISQMGGGYTGIPCSPK--SLLRGLEAYVSKTH----YASGCHDVPCNSDAGF 507
+IGP+V + G + G + SL GL + T+ YA GC D+ + + F
Sbjct: 425 LIGPMVKDKINQNGEWAGRGEREESISLFEGLTEKYAGTNVKFIYAEGC-DLLTDDSSKF 483
Query: 508 HEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTG 567
EA+ A++AD V+ G D E R L LPG Q +L+ + +T K P+ L+L
Sbjct: 484 AEAIATARRADIVLAAMGEDFNWSGEAACRTDLKLPGAQQALLKELKKTGK-PLGLILVN 542
Query: 568 GGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPM 627
G PLD+S+ D + IL Y G +A++I GD+NP RL M+ +P + ++P+
Sbjct: 543 GRPLDLSW--EDQHVDGILEAWYLGTMAGHGMADVISGDYNPSARLTMS-FPRTVGQLPL 599
Query: 628 NDMNMRADSSRQYPGRS--------YRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTIS 679
+ + R P + Y T +Y FG+GLSYT ++
Sbjct: 600 --YYNQKPTGRPVPPEAPDTDYKSRYMDVPNTPLYPFGYGLSYTTFAVN----------- 646
Query: 680 ASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVP 739
S+K D+N + G + V V N G VDG VV ++ R
Sbjct: 647 -SMKL--DQNSFTKGG-----------------KITVTAEVENTGKVDGETVVQMYIRDL 686
Query: 740 KVSQGTPEKQLIGFDRVHTVAKGSKEISFGVD 771
S P K+L GF++V A K++SF +D
Sbjct: 687 AGSVTRPVKELKGFEKVALKAGEKKQVSFTID 718
>gi|205353304|ref|YP_002227105.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Gallinarum str. 287/91]
gi|375124143|ref|ZP_09769307.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Gallinarum str. SG9]
gi|445128447|ref|ZP_21380240.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Gallinarum str. 9184]
gi|205273085|emb|CAR38038.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Gallinarum str. 287/91]
gi|326628393|gb|EGE34736.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Gallinarum str. SG9]
gi|444854909|gb|ELX79964.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Gallinarum str. 9184]
Length = 765
Score = 244 bits (622), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 209/749 (27%), Positives = 346/749 (46%), Gaps = 139/749 (18%)
Query: 75 QEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAAS 134
++ I+Q+ D A+ RL IP + + + +HG T FP L A+S
Sbjct: 86 RQDIRQMQDQVMALSRLKIPLF-FAYDVVHG---------------QRTVFPISLGLASS 129
Query: 135 FNRSLWSNIGSAVAVEARAMYNLGQAGLTF-WAPNINIFRDPRWGRGQETPGEDPMVVSA 193
FN +G A EA GL WAP +++ RDPRWGR E GED + S
Sbjct: 130 FNLDAVRTVGRVSAYEA------ADDGLNMTWAPMVDVSRDPRWGRASEGFGEDTYLTSI 183
Query: 194 YAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEK 253
VK+ QG++ D + KH AY +
Sbjct: 184 MGETMVKAMQGKS--------------------------PADRYSVMTSVKHFAAYGAVE 217
Query: 254 WGNFSRYSFNAM-ITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK 312
G +N + ++ Q + + PP+++ ++ G + +M + N +NG PA L +
Sbjct: 218 GGK----EYNTVDMSSQRLFNDYMPPYKAGLDAG-SGAVMVALNSLNGTPATSDSWLLKD 272
Query: 313 A-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCML-RHTQSA 370
R+EWGFKG SD A+ + ++ ED+ LKAG+D++ ++
Sbjct: 273 VLRDEWGFKGITVSDHGAIKELIKHGTAADP-EDAVRVALKAGVDMSMADEYYSKYLPGL 331
Query: 371 IDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGP---DDVCTSE----HKKL 423
I GKV ++D A ++ +V+ +GLFN DP Y LGP D V T+ H+K
Sbjct: 332 IKSGKVTMAELDDATRHVLNVKYDMGLFN-DP----YSHLGPKESDPVDTNAESRLHRKE 386
Query: 424 ALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT--GIPCSPKSLLR 481
A + AR+ +VLLKN + LPL K+ ++A++GPL ++ + G ++ G+ ++L
Sbjct: 387 AREVARESVVLLKNRLETLPLKKSG--TIAVVGPLADSQRDVMGSWSAAGVANQSVTVLA 444
Query: 482 GLEAYV---SKTHYASGCH-----DV--------------PCNSDAGFHEAVRIAKKADF 519
G++ V +K YA G + D+ P + A EAV+ AK+A+
Sbjct: 445 GIQNAVDDGAKILYAKGANITNDKDIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQANV 504
Query: 520 VIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEAD 579
V+ V G E R ++ +P Q L+T++ T K P++LVL G PL + + D
Sbjct: 505 VVAVVGESQGMAHEASSRTNITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKED 561
Query: 580 SQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRA----- 634
Q +IL + G G A+A+++FGD+NP G+LP++ +P S ++P+ ++
Sbjct: 562 QQADAILETWFAGTEGGNAIADVLFGDYNPSGKLPIS-FPRSVGQIPVYYSHLNTGRPYN 620
Query: 635 -DSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQ 693
+ +Y R + G +Y FG+GLSYT ++ S++T+S+ +Q+
Sbjct: 621 PEKPNKYTSRYFDEANGP-LYPFGYGLSYTTFTV------SDVTLSSP--------TMQR 665
Query: 694 TGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGF 753
G +VT+ + VTN G +G+ V+ ++ + S P KQL GF
Sbjct: 666 DG---------KVTAS--------VEVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGF 708
Query: 754 DRVHTVAKGSKEISFGVDPCEQLSIANKH 782
+++ K +SF +D E L N+
Sbjct: 709 EKITLKPGERKTVSFPID-IEALKFWNQQ 736
>gi|429756169|ref|ZP_19288778.1| glycosyl hydrolase family 3 protein [Capnocytophaga sp. oral taxon
324 str. F0483]
gi|429171889|gb|EKY13478.1| glycosyl hydrolase family 3 protein [Capnocytophaga sp. oral taxon
324 str. F0483]
Length = 755
Score = 244 bits (622), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 214/749 (28%), Positives = 336/749 (44%), Gaps = 125/749 (16%)
Query: 78 IQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNR 137
I++L A RLGIP + + +HG T FP L + S++
Sbjct: 87 IRKLQKIAVEQTRLGIPIL-FGQDVIHGYK---------------TIFPIPLAESCSWDL 130
Query: 138 SLWSNIGSAVAVEARAMYNLGQAGLTF-WAPNINIFRDPRWGRGQETPGEDPMVVSAYAV 196
+L A EA A G+ + +AP ++I RD RWGR E GEDP + S A
Sbjct: 131 ALMRKTAELAAREATA------DGINWTFAPMVDITRDARWGRAMEGAGEDPYLGSLIAE 184
Query: 197 EFVKSFQG-ENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWG 255
VK FQG +NW++ L AC KH Y + G
Sbjct: 185 ARVKGFQGGDNWQT-----------------------LSSPHTLLACGKHFAGYGAAESG 221
Query: 256 NFSRYSFN-AMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACL-RGDLFQKA 313
+N A ++ + + PP+ + + G S IM S N++NGVPA + L ++
Sbjct: 222 K----DYNTAELSMHTLRNVYLPPYEATLNAGVGS-IMASLNEINGVPATADKWLLTEEL 276
Query: 314 RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAG----MDINCGTCMLRHTQS 369
R EWGF G + SD I E + +D A L A MD+N G +++ +
Sbjct: 277 RKEWGFNGLLVSDYTG---INELVRHGVAKDDKQAANLSANAGIEMDMN-GATFIKYLSA 332
Query: 370 AIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAAR 429
+ +GK E ID+A+ ++ ++ LGLF+ DP + ++ T E+ K+A A
Sbjct: 333 LVKEGKATEAQIDKAVRHILEMKFLLGLFD-DPYRYLDETRAKENTFTEEYLKVARQAVA 391
Query: 430 QGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT--GIPCSPKSLLRGLEAYV 487
+VLLKN+ + LP+ KN+ ++A+IGP++NN S + G +T G SLL GL
Sbjct: 392 SSVVLLKNEAEVLPIKKNSGKTIAVIGPMMNNTSDINGSWTCLGDGKQSVSLLSGLTQKY 451
Query: 488 SKTH----YASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLP 543
T+ YA GC +++ EAV IA+KAD V+V G + E R + LP
Sbjct: 452 KGTNVKLLYAEGCGFTTISTEQ-LKEAVAIARKADRVLVAVGEQSSWAGESAVRTDIRLP 510
Query: 544 GQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEII 603
Q L+ ++ +K P+ ++ G PLD+S+ + + +IL +PG G +A++I
Sbjct: 511 QAQRQLLEALKAINK-PITIITFSGRPLDLSW--ENENVQAILQAWFPGTQGGNGIADVI 567
Query: 604 FGDFNPGGRLPMTWYPESFTKVPM---------------NDMNMRADSSRQYPGRSYRFY 648
GD NP G L M+ +P S ++P+ ++++R + Y S
Sbjct: 568 AGDVNPSGHLTMS-FPRSVGQIPIYYNYKNTGRPVYTNNEEVDLRPHYNAGYLDSSI--- 623
Query: 649 TGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTS 708
T +Y FG+GLSYT ++ + + S+K +D I
Sbjct: 624 --TPLYPFGYGLSYTTFAISNVHLNKK-----SMKRYNDSII------------------ 658
Query: 709 CTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISF 768
V SV N G +G V+ L+ R S P K+L GF ++ A SK++ F
Sbjct: 659 -------VNASVQNTGTTEGEIVLQLYTRQLVASVSRPVKELKGFQKISLKAGESKQVRF 711
Query: 769 GVDPCEQLSIANKHGRRILPLGNHVLMVG 797
+ P E L+ +G++ +L VG
Sbjct: 712 EL-PSEALAFYGINGKKDTEPSECLLWVG 739
>gi|445499551|ref|ZP_21466406.1| periplasmic beta-glucosidase BglX [Janthinobacterium sp. HH01]
gi|444789546|gb|ELX11094.1| periplasmic beta-glucosidase BglX [Janthinobacterium sp. HH01]
Length = 743
Score = 244 bits (622), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 196/656 (29%), Positives = 304/656 (46%), Gaps = 87/656 (13%)
Query: 165 WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVL 224
+AP ++I RDPRWGR E GED + SA A V FQG+ + D
Sbjct: 152 FAPMVDIARDPRWGRVMEGAGEDTFLGSAIARARVHGFQGKQLGATDS------------ 199
Query: 225 KGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPFRSCI 283
+ A KH AY G +NA+ ++ Q + + PPF++ +
Sbjct: 200 --------------VMATAKHFAAYGAAVAGR----DYNAVDMSNQQLFEVYLPPFKAAL 241
Query: 284 EQGKASCIMCSYNQVNGVPACLRGDLFQK---ARNEWGFKGYITSDCDAVATIFEYQNYT 340
+ G A+ M S+N +NG+PA G++F + + W FKG++ SD +V + + Y
Sbjct: 242 DAG-AATFMNSFNTLNGIPAT--GNVFLQRDILKGSWNFKGFVVSDWGSVREMVPH-GYA 297
Query: 341 KTHEDSAAGVLKAGMDINCGT-CMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFN 399
D++ + AG D++ + ++H + A+ GKV+ K +D A+ + + LGLF+
Sbjct: 298 VDLADASVKAINAGSDMDMESYAHIKHLEDAVKSGKVKMKTLDDAVYRILYKKFELGLFD 357
Query: 400 GDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLV 459
DP + + H+ ALD A++ +VLLKN LPL++ A +A+IGPL
Sbjct: 358 -DPYRYSVAAREKAVMADPSHRAAALDVAQKSLVLLKNAAGTLPLSR-AAQKIAVIGPLA 415
Query: 460 NNISQMGGGYT--GIPCSPKSLLRGLEAYVSKTH--YASGCHDVPCNSDAGFHEAVRIAK 515
+ + GG+ G S+L G+ ++ K YA C D C GF +AV A
Sbjct: 416 DAKRDLEGGWVVEGDRAPVVSILEGIRSHAGKAEVSYAQAC-DNRCTGTEGFADAVAAAT 474
Query: 516 KADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSF 575
+AD V++V G E + R + LPG Q L ++ T K PV++V+ G PL F
Sbjct: 475 RADVVVLVVGETWDLSGEAKSRTDITLPGHQEQLFAALKATGK-PVVVVMLAGRPL--VF 531
Query: 576 AEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMR-- 633
Q +I++ +PG G A+A ++FGD+NP G+LP+T +P S ++P++
Sbjct: 532 NTIADQADAIVYAWFPGSEGGNAVANVLFGDYNPSGKLPIT-FPRSVGQIPLSYAQYSTG 590
Query: 634 ---ADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNI 690
D +Y T Y FGHGLSYT++ Y S LTIS+ A S
Sbjct: 591 RPVTDEKNVVYKSAYIDSVNTPRYAFGHGLSYTDFKY------SGLTISSKEMAPS---- 640
Query: 691 LQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQL 750
QQ D + N G +G+ +V L+ R S P K+L
Sbjct: 641 -QQVTLSFD--------------------LANTGKAEGTEIVQLYLRDLVASVVRPVKEL 679
Query: 751 IGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
GF ++H + + +SF +D E LS N G+ LMVG + +E
Sbjct: 680 KGFQKIHLLPGEQRRVSFTID-RELLSFFNSQLVWGAEAGDFKLMVGSASDDIRLE 734
>gi|317476310|ref|ZP_07935559.1| glycosyl hydrolase family 3 C terminal domain-containing protein
[Bacteroides eggerthii 1_2_48FAA]
gi|316907336|gb|EFV29041.1| glycosyl hydrolase family 3 C terminal domain-containing protein
[Bacteroides eggerthii 1_2_48FAA]
Length = 862
Score = 244 bits (622), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 167/470 (35%), Positives = 239/470 (50%), Gaps = 66/470 (14%)
Query: 44 CKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESL 103
C P + + TSL+ RA+ L+ LTL+EK+ + DN+ A+ RLGI Y WW+E+L
Sbjct: 17 CSQPTYKT-----TSLAPEKRAELLLKELTLEEKVALMMDNSQAVERLGIKPYNWWNEAL 71
Query: 104 HGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--- 160
HG+A G T FPQ + AASF+ + SAV+ EARA N A
Sbjct: 72 HGVARAG----------LATVFPQPIGMAASFSPESVYEVFSAVSDEARAK-NANYASQG 120
Query: 161 ------GLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGI 214
GLT W P +NI+RDPRWGRG ET GEDP + S V VK QG N DG
Sbjct: 121 SYARYQGLTMWTPTVNIYRDPRWGRGIETYGEDPYLTSRMGVMVVKGLQGTN----DG-- 174
Query: 215 GFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTED 273
+ D+ L AC KH + +W +R+SFN ++ +D +
Sbjct: 175 ---------------KYDK-----LHACAKHFAVHSGPEW---NRHSFNVEDLSARDLYE 211
Query: 274 TFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVAT 332
T+ PPF + +++ K +MC+YN G P C L Q RN+WGF G I SDC A+A
Sbjct: 212 TYLPPFEALVKEAKVKEVMCAYNSFEGEPCCGSNRLLMQILRNDWGFDGIILSDCGAIAD 271
Query: 333 IFE---YQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLF 389
+ ++ Y SAA VL G D+ CG+ + A+ G++ EKDID+A++ L
Sbjct: 272 FYNEHGHKAYPDAESASAAAVLN-GTDLECGSS-YKALVKAVRAGQINEKDIDKAVMRLL 329
Query: 390 SVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAV 449
+ LG + DP + K+ V ++EH LALD AR+ + LL+N LPL + +
Sbjct: 330 EARFALGEMD-DPDNVSWTKIPFSVVASAEHDSLALDMARKSMTLLQNTGNILPLKRGGL 388
Query: 450 SSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV---SKTHYASGC 496
++A++GP N+ G Y G+P ++L G+ + K Y GC
Sbjct: 389 -TVAVMGPNANDSVMQWGNYNGMPPHTVTILGGIRKALGADDKLIYEQGC 437
Score = 119 bits (299), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 96/318 (30%), Positives = 146/318 (45%), Gaps = 63/318 (19%)
Query: 498 DVPCNSDAGFHEAVRI------AKKADFVIVVAGLDLTQETEDR----------DRVSLL 541
D N D GF + V I K AD V+ +G+ E E+ DR +
Sbjct: 574 DAQLNFDIGFKQKVDINASVARVKDADIVVFASGVSPVLEGEEMGVNLPGFKGGDRTDIE 633
Query: 542 LPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAE 601
LP Q L++++ R K+ +ILV G P ++ +IL YPG+ G A+AE
Sbjct: 634 LPAIQRELISALHRAGKK-IILVNCSGSP--IALEPETKNCEAILQAWYPGQQGGTAVAE 690
Query: 602 IIFGDFNPGGRLPMTWYPESFTKVP-MNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGL 660
++FGD+NPGGRLP+T+Y + +++P D NM GR+YR+ T ++ FG+GL
Sbjct: 691 VLFGDYNPGGRLPVTFY-RNMSQLPDFEDYNMT--------GRTYRYMTQQPLFPFGYGL 741
Query: 661 SYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISV 720
SYT + Y LS E +K G + +SV
Sbjct: 742 SYTTFDYGKLSLGKE-----QIKQGEP--------------------------LKLAVSV 770
Query: 721 TNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCE-QLSIA 779
TN+G DG VV ++ + ++G P K L F RV+ A S ++ F + E + A
Sbjct: 771 TNSGQRDGEEVVQVYLKKQDDAEG-PGKTLRAFKRVYIPAGKSVDVEFDLKDKELEWWDA 829
Query: 780 NKHGRRILPLGNHVLMVG 797
+ R+ G + +MVG
Sbjct: 830 QSNTMRVCA-GKYDIMVG 846
>gi|46127231|ref|XP_388169.1| hypothetical protein FG07993.1 [Gibberella zeae PH-1]
Length = 712
Score = 244 bits (622), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 176/534 (32%), Positives = 268/534 (50%), Gaps = 54/534 (10%)
Query: 282 CIEQGKASCIMCSYNQVNGVPAC----LRGDLFQKARNEWGFKGYITSDCDAVATIFEYQ 337
C K IMCSYN VNG+PAC L+ L +K N +ITSDC A+ I+++
Sbjct: 189 CTRDAKVGSIMCSYNAVNGIPACANSYLQETLLRKHWNWTHTNNWITSDCGAMQDIWQHH 248
Query: 338 NYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGL 397
NYTKT ++A + G D +C + + ++G + EK +DRAL LF + G
Sbjct: 249 NYTKTGAEAAKAAFENGQDSSCEYTTTKDISDSYEQGLLTEKVMDRALKRLFEGLVHTGF 308
Query: 398 FNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGP 457
F+GD K ++ L DDV T + LAL +A +G VLLKND LPLN S+A+IG
Sbjct: 309 FDGD--KSEWSSLDFDDVNTRHAQDLALQSAVRGAVLLKNDNT-LPLNIKKKESVALIGF 365
Query: 458 LVNNISQMGGGYTGIPCSPKSLLRGLEAYVSK-----THYASG----CHDVPCNSDAGFH 508
++ +++ GGY+G P +R AY +K T+ A G VP N
Sbjct: 366 WADDKTKLQGGYSG----PAPHVR-TPAYAAKMLGLNTNVAWGPTLQNSSVPDNWTTNAL 420
Query: 509 EAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGG 568
EA AKK+D+++ + GLD T E+RDR L P Q++L+ ++ K V++ L G
Sbjct: 421 EA---AKKSDYIVYLGGLDATAAGEERDRTDLDWPSTQLTLLKKLSNLGKPLVVVQL--G 475
Query: 569 GPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPM 627
+D + + ++SILW+ YPG+ G A+ E+I G P GRLP+T YP +T +V M
Sbjct: 476 DQVDDTPLLKNKGVNSILWVNYPGQEGGTAVMELITGRKGPAGRLPLTQYPSKYTEQVGM 535
Query: 628 NDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSD 687
+M +R ++ PGR+YR+Y+ + V FG G YT + F S E+ I LK
Sbjct: 536 LEMELRP--TKSSPGRTYRWYSDS-VLPFGFGKHYTTFKAMFKSQKIEMNIQKILKG--- 589
Query: 688 KNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPE 747
D ++D +C H +SV N G V ++F + + P
Sbjct: 590 ----------CDATYVD---TCPLPPIH--LSVKNTGRTTSDFVSLVFIQGKVGPKPYPL 634
Query: 748 KQLIGFDRVHTV---AKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGE 798
K L + R H + A E+ + +D ++ K+G ++ G + L++ E
Sbjct: 635 KTLAAYSRSHDIKPRATKDVELQWTMD---NIARREKNGDLVVYPGTYTLLLDE 685
Score = 136 bits (342), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 70/160 (43%), Positives = 97/160 (60%), Gaps = 6/160 (3%)
Query: 39 KPDFPCKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEW 98
+PD C S C+T+ S + RA +L+S LT +EK+ L NA+ PR+G+P Y W
Sbjct: 28 QPD--CTKEPLRSNGICDTTASPAERAAALVSALTPREKVNNLVSNATGAPRIGLPRYNW 85
Query: 99 WSESLHGIASNGPGVNFNGT--VSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARA-MY 155
W+E+LHG+A PG ++N S TSFP L+ ++F+ L +IG + EARA
Sbjct: 86 WNEALHGVAG-APGNDYNDKPPYDSATSFPMPLLMGSTFDDDLIHDIGEVIGTEARAWNN 144
Query: 156 NLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYA 195
G+ +W PN+N F+DPRWGRG ETPGED + VS YA
Sbjct: 145 GGWGGGVDYWTPNVNPFKDPRWGRGSETPGEDALHVSRYA 184
>gi|347536214|ref|YP_004843639.1| glycoside hydrolase family protein [Flavobacterium branchiophilum
FL-15]
gi|345529372|emb|CCB69402.1| Glycoside hydrolase precursor, family 3 [Flavobacterium
branchiophilum FL-15]
Length = 740
Score = 244 bits (622), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 205/710 (28%), Positives = 338/710 (47%), Gaps = 109/710 (15%)
Query: 123 TSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTF-WAPNINIFRDPRWGRGQ 181
T+FP L AAS++ A EA +G+ + +AP ++I RDPRWGR
Sbjct: 111 TTFPIPLAEAASWDVEAIEKSARVAATEA------ASSGIHWTFAPMVDISRDPRWGRVM 164
Query: 182 ETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSA 241
E GED + S A VK FQ ++ GD + A
Sbjct: 165 EGAGEDTYLGSKIAFARVKGFQ---------------------------ANLGDVHSVMA 197
Query: 242 CCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNG 300
C KH AY G +N++ I+E+ +T+ PPF++ ++ G A+ M ++N +NG
Sbjct: 198 CVKHFAAYGAAVGGR----DYNSVDISERMLWETYLPPFKAALDAG-AATFMNAFNDING 252
Query: 301 VPAC----LRGDLFQKARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMD 356
+PA ++ D+ + +W F+G++ SD ++ + + Y K ++ +A L AG D
Sbjct: 253 IPATANKHIQRDIL---KGKWQFQGFVVSDWGSIGEMVAH-GYAKDYKQAAEKALLAGSD 308
Query: 357 INC-GTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDV 415
++ + + H + + + KV ID A+ + ++ LGLF DP K + +
Sbjct: 309 MDMESSAYIGHLATLVKENKVPIALIDDAVRRILRKKMELGLFE-DPFKFCNPERQNKAL 367
Query: 416 CTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCS 475
EH K+A + A + IVLLKNDK+ LPL+K+ + ++A IGP+V + + G+ +
Sbjct: 368 NNPEHTKIAREVAAKSIVLLKNDKQVLPLSKD-LKTIAFIGPMVQS-KRDNHGFWAVDLK 425
Query: 476 PK------SLLRGLEAYV---SKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGL 526
S GL+ V +K YA GC DV + +GF EA+ +A +AD V+V G
Sbjct: 426 DVDSTYIVSQWEGLQRKVGKNTKLLYAKGC-DVLSTNKSGFEEAIAVAHQADVVVVSVGE 484
Query: 527 DLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSIL 586
E + R SL LPG Q L+ + +T K P+++++ G PL ++ AD+ + +IL
Sbjct: 485 KHNMSGEAKSRSSLQLPGVQEDLIMELQKTGK-PIVVLINAGRPLIFNWT-ADN-MPTIL 541
Query: 587 WIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPM--NDMNM----RADSSRQY 640
+ + G A+A+++FGD+NP +LP+T +P S +VP+ N + ++D + Y
Sbjct: 542 YTWWLGSEAGNAIADVLFGDYNPSAKLPIT-FPRSEGQVPIYYNHFSTGRPAKSDDDKIY 600
Query: 641 PGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDY 700
+Y + + FG+GLSYT + Y S+L +S +D
Sbjct: 601 KS-AYIDLQNSPKFPFGYGLSYTTFEY------SDLKLSTQKITTND------------- 640
Query: 701 VHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVA 760
R VQ ++ N G G+ +V L+ + S P +L F ++ A
Sbjct: 641 ------------RIMVQATIKNTGKYAGTEIVQLYIKDQFGSVVRPVLELKDFQKITLEA 688
Query: 761 KGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG----ELRHSLTIE 806
SK ISF +D E+LS N + + G +M+G +LR T E
Sbjct: 689 GASKTISFVIDK-EKLSFYNADLQYVAEPGTFEIMIGASAADLRLKSTFE 737
>gi|427411073|ref|ZP_18901275.1| hypothetical protein HMPREF9718_03749 [Sphingobium yanoikuyae ATCC
51230]
gi|425710258|gb|EKU73280.1| hypothetical protein HMPREF9718_03749 [Sphingobium yanoikuyae ATCC
51230]
Length = 791
Score = 243 bits (621), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 226/737 (30%), Positives = 334/737 (45%), Gaps = 126/737 (17%)
Query: 90 RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
RLGIP + E LHG A+ G TSFPQ + A+S++ ++ + +A
Sbjct: 138 RLGIPIL-FHEEGLHGYAAVG-----------ATSFPQSIAMASSWDPAMLRQVNQVIAR 185
Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
E RA + +P ++I RDPRWGR +ET GEDP +V V V+ QG
Sbjct: 186 EIRA-----RGVPMVLSPVVDIARDPRWGRIEETYGEDPYLVGEMGVAAVEGLQG----- 235
Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWG-NFSRYSFNAMITE 268
+G R L+ + A KHL + + G N A ++E
Sbjct: 236 ----VG----RSRTLQ----------PNHVFATLKHLTGHGQPESGTNIG----PAPVSE 273
Query: 269 QDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPA-CLRGDLFQKARNEWGFKGYITSDC 327
++ + F PPF +++ +M SYN+++GVP+ R L R EWGF+G + SD
Sbjct: 274 RELRENFFPPFEQVVKRTGIEAVMASYNEIDGVPSHANRWLLDNVLRQEWGFRGAVVSDY 333
Query: 328 DAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSA-IDKGKVQEKDIDRALL 386
AV + + E++A L AG+D + + T + +GKV E +D A+
Sbjct: 334 SAVDQLMSIHHIAANLEEAAMRALDAGVDADLPEGLSYATLGKLVREGKVSEAKVDLAVR 393
Query: 387 NLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTS--EHKKLALDAARQGIVLLKNDKKFLPL 444
+ ++ R GLF + Y T+ + + LA AA++ I LLKND LPL
Sbjct: 394 RMLELKFRAGLF-----ENPYADANAAAAITNNEDARALARTAAQRSITLLKNDG-MLPL 447
Query: 445 NKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV---SKTHYASGCH---- 497
++A+IGP ++++GG Y G P S+L G++A V + +A G
Sbjct: 448 KPEG--TIAVIGPSAA-VARLGG-YYGQPPHSVSILEGIKARVGTKANIVFAQGVKITED 503
Query: 498 -----DVPCNSDAG-----FHEAVRIAKKADFVIVVAGLDLTQETEDR-------DRVSL 540
D SD +AV A+ D +I+ G D Q + + DR SL
Sbjct: 504 DDWWADSVTKSDPAENRKLIAQAVEAARNVDRIILTLG-DTEQSSREGWADNHLGDRPSL 562
Query: 541 LLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALA 600
L G+Q L ++ K P+ +VL G P S + Q ++IL Y GE G A+A
Sbjct: 563 DLVGEQQELFDALKALGK-PITVVLINGRP--ASTVKVSEQANAILEGWYLGEQGGNAVA 619
Query: 601 EIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGL 660
+I+FGD NPGG+LP+T P S ++P+ NM+ + R Y F T +Y FG GL
Sbjct: 620 DILFGDVNPGGKLPVT-VPRSAGQLPLF-YNMKPSAR-----RGYLFDTTDPLYPFGFGL 672
Query: 661 SYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISV 720
SYT++S LSAP RL I T + V + V
Sbjct: 673 SYTSFS---LSAP-----------------------RLSATRI-----GTGGKTSVSVDV 701
Query: 721 TNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIAN 780
N G +G VV L+ R S P K+L GF RV S+ I+F V P E L + N
Sbjct: 702 RNTGAREGDEVVQLYIRDKVSSVTRPVKELKGFQRVTLKPGESRTITFTVGP-EALQMWN 760
Query: 781 KHGRRILPLGNHVLMVG 797
RR++ G+ +M G
Sbjct: 761 DQMRRVVEPGDFEIMTG 777
>gi|260062042|ref|YP_003195122.1| beta-glucosidase [Robiginitalea biformata HTCC2501]
gi|88783604|gb|EAR14775.1| beta-glucosidase [Robiginitalea biformata HTCC2501]
Length = 763
Score = 243 bits (621), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 189/633 (29%), Positives = 301/633 (47%), Gaps = 104/633 (16%)
Query: 165 WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVL 224
+AP ++I RDPRWGR E GEDP + S V V+ FQG+
Sbjct: 163 FAPMVDISRDPRWGRVMEGAGEDPYLGSRVGVARVRGFQGD------------------- 203
Query: 225 KGFGEESDRGDELMLSACCKHLIAYDLEKWG---NFSRYSFNAMITEQDTEDTFQPPFRS 281
D D L ++AC KH Y + G N + + + + + PPF++
Sbjct: 204 -------DLSDPLTIAACLKHFAGYGFAEGGRDYNTADFGLSTLY------NVVLPPFQA 250
Query: 282 CIEQGKASCIMCSYNQVNGVPAC----LRGDLFQKARNEWGFKGYITSDCDAVATIFEYQ 337
++ G A+ +M S+N +NG+PA L+ D+ + A W F+G++ SD ++ + +
Sbjct: 251 GVDAG-AATVMNSFNVLNGIPATADAFLQRDILKAA---WDFQGFVVSDWGSIGEMIPH- 305
Query: 338 NYTKTHEDSAAGVLKAGMDINCGTCM-LRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLG 396
Y + ++A AG D++ + M L + GKV E +D A+L + ++ LG
Sbjct: 306 GYARDRNEAALRAAVAGSDMDMESGMYLTELPELVRDGKVPESLVDEAVLRILGLKYDLG 365
Query: 397 LFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIG 456
LF DP + + + + D AR+ IVLLKN+ LPL+KN S+A+IG
Sbjct: 366 LF-ADPYRYADAEREKRILSNPARLETVRDMARKSIVLLKNEGGVLPLSKNG-GSIALIG 423
Query: 457 PLVNNISQMGGGY--TGIPCSPKSLLRGLEAYVSKT-HYASGC------------HDVPC 501
PL ++ G + T P S S+L G++AY T Y G +
Sbjct: 424 PLASDKDSPLGSWRLTAEPNSAVSVLEGMQAYSGNTLAYERGVPLAEGETAFVFETKINT 483
Query: 502 NSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPV 561
+G AV +A+ ++ V++V G Q E R R +L LPG Q L+ +V + +
Sbjct: 484 TDRSGIPAAVELARSSETVVMVLGEHGFQSGEGRSRAALGLPGLQQELLEAVHAVNPN-I 542
Query: 562 ILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPES 621
+LVL G PL +++A + +IL + G A+AE+++GD+NP G+LPMT +P+S
Sbjct: 543 VLVLMNGRPLTINWAA--EHVPAILEAWHLGTESGHAIAEVLYGDYNPSGKLPMT-FPKS 599
Query: 622 FTKVPMNDMNMRADSSRQYPGRSYRFYT------GTQVYGFGHGLSYTNYSYKFLSAPSE 675
++P+ ++ A +YPG F++ +Y FGHGLSY+++ Y
Sbjct: 600 VGQIPVYYSHL-ATGRPEYPGNDLVFWSHYIDQVNEPLYPFGHGLSYSDFRY-------- 650
Query: 676 LTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLF 735
A LK LQ T E+ SL V + + NA D G+ +V L+
Sbjct: 651 ----ADLK-------LQTT----------EIRPGGSL--EVSVRLENASDTPGTEIVQLY 687
Query: 736 ARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISF 768
R S+ P ++L GF++V A GS E+SF
Sbjct: 688 VRDHFGSRARPVRELKGFEKVFLEAGGSAEVSF 720
>gi|451821117|ref|YP_007457318.1| periplasmic beta-glucosidase BglX [Clostridium
saccharoperbutylacetonicum N1-4(HMT)]
gi|451787096|gb|AGF58064.1| periplasmic beta-glucosidase BglX [Clostridium
saccharoperbutylacetonicum N1-4(HMT)]
Length = 750
Score = 243 bits (620), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 207/708 (29%), Positives = 329/708 (46%), Gaps = 106/708 (14%)
Query: 76 EKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASF 135
EK +L A RLGIP + + +HG T FP L A SF
Sbjct: 95 EKSNELQKIAVEESRLGIPIL-FGLDVIHGYR---------------TIFPIPLAEACSF 138
Query: 136 NRSLWSNIGSAVAVEARAMYNLGQAGLTF-WAPNINIFRDPRWGRGQETPGEDPMVVSAY 194
+ A EA A AGL + +AP ++I RDPRWGR E GEDP + S
Sbjct: 139 DIEKIKESARIAAKEASA------AGLHWTFAPMVDISRDPRWGRVAEGAGEDPYLGSVI 192
Query: 195 AVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKW 254
A V+ FQGE+ + + + AC KH Y
Sbjct: 193 AKARVEGFQGESLDNPES--------------------------ILACAKHFAGYGAPDG 226
Query: 255 GNFSRYSFNAM-ITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA 313
G +N + ++ Q D + PPF++ E G M ++N +NG+P + L
Sbjct: 227 GR----DYNTVDMSLQTLHDVYLPPFKAAAEAG-VGTFMSAFNDLNGIPCTVNKYLLTDV 281
Query: 314 -RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHT-QSAI 371
R ++GF G++ SD +++ + Y + ++ ++ L AG+D++ R+ +
Sbjct: 282 LREKFGFNGFVVSDANSIPEVV-VHGYAEDNKAASKKALNAGLDMDMSQGTYRNELPELV 340
Query: 372 DKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQG 431
+G + E+ +D A+ + V+ LGLF+ +P + K +C EH + A D +R+
Sbjct: 341 KEGDILEEVLDEAVRRVLRVKFLLGLFD-NPYRTDAKKEEKTLLC-KEHLEAARDISRRS 398
Query: 432 IVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGG--GYTGIPCSPKSLLRGLEAYVS- 488
IVLLKN+ LPL K+ + +A++GPL N ++M G +TG P +++ G++A VS
Sbjct: 399 IVLLKNENNALPLKKD-LKKIAVVGPLAENAAEMLGTWSHTGNPSDVVTIISGIKAAVST 457
Query: 489 --KTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQ 546
+ YA GC + F AVR+AK++D +I V G + E R+ + LPG+Q
Sbjct: 458 ETEILYAEGCK-ITGEECIDFEGAVRVAKESDVIIAVVGENSDMSGEAASRIDINLPGKQ 516
Query: 547 MSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGD 606
L+ + + K P+I+VL G PL + + EA++ + +++ G A+A+++FGD
Sbjct: 517 EELLKELRKIGK-PLIVVLINGRPLTIPW-EAEN-VDALVEAWQLGTQSGNAIADVLFGD 573
Query: 607 FNPGGRLPMTWYPESFTKVPM---NDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYT 663
+NP G+L T +P S +VP+ N M R ++ + Y +Y FG GLSYT
Sbjct: 574 YNPSGKLVAT-FPYSVGQVPIYYNNPMTGRPAGKIKFTSK-YIDGPAEPLYPFGFGLSYT 631
Query: 664 NYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNA 723
+ Y+ LS +SA K G D V V++ VTN
Sbjct: 632 TFKYENLS-----ILSAENKIG------------------DTVA--------VKVYVTNT 660
Query: 724 GDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVD 771
G+V G VV L+ S+ P K+L F++V K K I F ++
Sbjct: 661 GEVSGEEVVQLYVSDVVASRVRPVKELKSFEKVLLQPKECKTIIFKLN 708
>gi|390945417|ref|YP_006409177.1| beta-glucosidase-like glycosyl hydrolase [Alistipes finegoldii DSM
17242]
gi|390421986|gb|AFL76492.1| beta-glucosidase-like glycosyl hydrolase [Alistipes finegoldii DSM
17242]
Length = 771
Score = 243 bits (620), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 206/744 (27%), Positives = 323/744 (43%), Gaps = 123/744 (16%)
Query: 90 RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
RLGIP + E+ HG + G T+FP A+++N L +G +A
Sbjct: 120 RLGIPLF-LAEEAPHGHMAIG-----------ATTFPTAPGQASTWNPELIERMGKVIAA 167
Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
E R Q G + P ++I RDPRW R +E+ GED + + +V+
Sbjct: 168 EIRL-----QGGHICYGPVLDIVRDPRWSRTEESYGEDCYLTARIGEAYVR--------- 213
Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQ 269
G G G D + KH IAY + G + + ++ E+
Sbjct: 214 ---GTGSG--------------DLSQSRHALSTLKHFIAYGASEGG---QNGGSNLLGER 253
Query: 270 DTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACL-RGDLFQKARNEWGFKGYITSDCD 328
+ +T+ PPF + ++ G A +M +YN V+G+P R L R EWGF G++ SD
Sbjct: 254 ELRETYLPPFEAAVKAG-ARSVMTAYNSVDGIPCTANRRMLTDILRGEWGFDGFVVSDLL 312
Query: 329 AVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNL 388
++ + E + ++A L+AG+D + + A + G V E +IDRA+ +
Sbjct: 313 SIEGLHETHGVAGSVREAAVQALRAGVDADLKGGAFASLREAAEAGDVAEAEIDRAVERV 372
Query: 389 FSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNA 448
+++ +GLF +P + +V + H +LAL+AARQ + LL+N LPL+
Sbjct: 373 LALKFEMGLFE-NP---YIDEAAAAEVGCAAHSELALEAARQSVTLLENRSGTLPLDPRR 428
Query: 449 VSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTH--YASGCHDVPCNSDAG 506
+ +A+IGP +NI G YT + ++ GLE + + Y+ GC V +
Sbjct: 429 LRRVAVIGPNADNIYNQLGDYTAQQTAANTVRDGLEKLLGRDRVVYSRGC-TVRGGDRSE 487
Query: 507 FHEAVRIAKKADFVIVVAG----LDLTQE-------------------TEDRDRVSLLLP 543
AV A+ D +VV G D E E DR +L L
Sbjct: 488 IAAAVSAARGTDAAVVVIGGSSARDFDTEFLQTGAAKAAHDEVRDMECGEGFDRATLALL 547
Query: 544 GQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEII 603
G+Q L+ + T P+I+V G PLD+ A Q ++L YPG G A+AE I
Sbjct: 548 GEQEELLRRIKATGT-PLIVVCIAGRPLDLR--RASEQADALLMAWYPGARGGDAVAETI 604
Query: 604 FGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYT 663
G NP GRLP+T P + ++P+ R + Y T +Y FG+GLSY+
Sbjct: 605 LGHNNPAGRLPIT-IPRAEGQIPVYYNKKRPAN------HDYTDLTAAPLYPFGYGLSYS 657
Query: 664 NYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNA 723
+ Y L A S N+L+ SC + N
Sbjct: 658 TFEYGSLEARQ-----------SGDNVLE--------------VSC---------RIRNT 683
Query: 724 GDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHG 783
D +G VV L+ S P +QL GF R+ +++SF + E LS+ + G
Sbjct: 684 SDREGDEVVQLYISDMVASTVRPPRQLGGFRRIRLAPGEQRQVSFTLGD-EALSLIDPQG 742
Query: 784 RRILPLGNHVLMVGELRHSLTIET 807
RR++ G+ V+ VG + ++T
Sbjct: 743 RRVVEKGDFVIAVGSSSQDIRLQT 766
>gi|371776218|ref|ZP_09482540.1| beta-glucosidase [Anaerophaga sp. HS1]
Length = 774
Score = 243 bits (620), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 198/685 (28%), Positives = 324/685 (47%), Gaps = 98/685 (14%)
Query: 123 TSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTF-WAPNINIFRDPRWGRGQ 181
T+FP L A S++ L A EA A +G+ + +AP ++I RDPRWGR
Sbjct: 128 TTFPIPLAEACSWDLQLMEKSARIAAEEATA------SGVAWNFAPMVDISRDPRWGRIM 181
Query: 182 ETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSA 241
E GEDP + S A V+ FQG + K F + + + A
Sbjct: 182 EGAGEDPFLGSLIARARVRGFQGID----------------SYKDFSKPN------TMMA 219
Query: 242 CCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGV 301
C KH + Y + G R I+E+ +T+ PPF++ +++G AS M ++N++NGV
Sbjct: 220 CAKHFVGYGAAQAG---RDYHTVDISERTLFETYLPPFKAAVDEGVAS-FMTAFNELNGV 275
Query: 302 PACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDIN-C 359
P +FQ R++W F G + +D A+ + + + K + ++ + AG+D++
Sbjct: 276 PCTGNKYIFQDILRHQWNFNGMVVTDYTAIQEMVAH-GFAKDLKQASKLAIDAGIDMDMI 334
Query: 360 GTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSE 419
+ + + +++G+V EK ID A+ + ++ LGLF+ DP K + + + +
Sbjct: 335 SEGFVTYLKELVEEGQVSEKQIDVAVARILEMKFLLGLFD-DPYKYCDAEREKEVLMNPQ 393
Query: 420 HKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT--GIPCSPK 477
H + A + A++ IVLLKN+ LPL K+ +A+IGP V + G + G
Sbjct: 394 HLQAAREVAQRSIVLLKNENNVLPLRKDIPKRVALIGPFVKERESLNGEWAIKGDRSKSV 453
Query: 478 SLLRGLEAYVSKT----HYASGCHDVPCNSDA-------------GFHEAVRIAKKADFV 520
+L GL+ + T +YA G +P A GF EA+R+AK +D +
Sbjct: 454 TLWEGLQEKYADTPVRFNYAKGT-SLPLIDGATRHVSLEQGFDKSGFAEALRVAKTSDLI 512
Query: 521 IVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADS 580
+V G E R + LPG Q L+ + +T K P++LVL G PLD+S+ EA++
Sbjct: 513 LVAMGEHYHWSGEAASRTDITLPGNQRELLKELKKTGK-PIVLVLFNGRPLDLSW-EAEN 570
Query: 581 QISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPM--NDMNMRADSSR 638
+ +I+ YPG A+A+++ GD+NP RL +T +P + ++P+ N N
Sbjct: 571 -VDAIVEAWYPGIMAGHAVADVLSGDYNPSARLVVT-FPRNVGQIPIFYNMKNTGRPFDE 628
Query: 639 QYPG---RSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTG 695
+P SY + ++ FG GLSYT++ Y TIS+ L + G
Sbjct: 629 NHPADYKSSYIDSPNSPLFPFGFGLSYTSFQY------DNATISSQK--------LTKGG 674
Query: 696 SRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDR 755
S + V + VTN G+VDG VV L+ S P K+L GF +
Sbjct: 675 SLI-----------------VSVDVTNTGNVDGEEVVQLYIHDKVGSVTRPVKELKGFKK 717
Query: 756 VHTVAKGSKEISFGVDPCEQLSIAN 780
+ +K + F ++ E L + N
Sbjct: 718 IFLKKGETKTVEFTINE-EMLKMYN 741
>gi|393789623|ref|ZP_10377743.1| hypothetical protein HMPREF1068_04023 [Bacteroides nordii
CL02T12C05]
gi|392650339|gb|EIY44008.1| hypothetical protein HMPREF1068_04023 [Bacteroides nordii
CL02T12C05]
Length = 863
Score = 243 bits (620), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 154/459 (33%), Positives = 227/459 (49%), Gaps = 58/459 (12%)
Query: 53 PFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPG 112
P+ N +LS RA+ L+ LTL+EK+ + + + I RLGI Y+WW+E+LHG+A G
Sbjct: 24 PYRNPNLSPEERAEDLLGRLTLKEKVTLMQNESFPIERLGIAHYDWWNEALHGVARAG-- 81
Query: 113 VNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQ---------AGLT 163
T FP + A+SF+ +I +AV+ E RA Y+ GLT
Sbjct: 82 --------IATVFPITMGMASSFDDKAIEDIFTAVSDEVRAKYHDAHRQGRRGVRCEGLT 133
Query: 164 FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRV 223
FW PNINIFRDPRWGRGQET GEDP + S V VK QG D
Sbjct: 134 FWTPNINIFRDPRWGRGQETYGEDPYLTSRMGVAVVKGLQGPADAKYD------------ 181
Query: 224 LKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPFRSC 282
L AC KH Y + R+SFNA I+ +D +T+ P F++
Sbjct: 182 --------------KLHACAKH---YAVHSGPEAKRHSFNAENISPRDLWETYLPAFKTL 224
Query: 283 IEQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIFEYQNYTK 341
+++ +MC+YN+ G P C L Q R++WG+K + SDC A++ F Y +
Sbjct: 225 VQEADVKEVMCAYNRFEGDPCCGSNRLLTQILRDDWGYKHVVVSDCGAISDFF-YSDRHA 283
Query: 342 THED---SAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLF 398
TH++ ++A + +G D+ CG H A+ +G + E+ I+ +L L + LG
Sbjct: 284 THKNAADASAAAVLSGTDLECG-IEYAHLDKAVKEGLISEERINTSLFRLLKARFELGEM 342
Query: 399 NGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPL 458
+ D + K+ D V HK++ALD R+ +VLL N+ LPL K + +A++GP
Sbjct: 343 DDDALV-PWSKISIDTVDCQMHKQMALDITRKSMVLLHNN-GVLPLGKTG-ARIAVMGPN 399
Query: 459 VNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCH 497
N+ G Y G P ++L G+ + Y GC
Sbjct: 400 ANDSVMQWGNYEGTPSHTVTVLEGIRNKIGNVIYEKGCE 438
Score = 98.2 bits (243), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 82/301 (27%), Positives = 140/301 (46%), Gaps = 56/301 (18%)
Query: 515 KKADFVIVVAGLDLTQETEDR-----------DRVSLLLPGQQMSLVTSVARTSKRPVIL 563
K+AD VI V G+ E E++ DR S+ LP Q +++ ++ + K+ + +
Sbjct: 597 KEADVVIFVGGISPNLEGEEKNFVNCPGFVGGDRTSIELPEVQRNILKALKKAGKKVIFV 656
Query: 564 VLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT 623
+G ++ +IL YPG+AG A+A+IIFGD+NP G+LP+T+Y +
Sbjct: 657 NCSGSA---MALVPETQSCDAILQAWYPGQAGGTAVADIIFGDYNPSGKLPVTFYKNTEQ 713
Query: 624 KVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLK 683
D +M+ GR+YR+ T + ++ FG+GLSYT + + K
Sbjct: 714 LPDFEDYSMK--------GRTYRYMTESPLFPFGYGLSYTTFQFS--------------K 751
Query: 684 AGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQ 743
AG +K + T ++F +++V N G DG+ VV ++ R +
Sbjct: 752 AGLNKRV---------------ATINQPVQF--KVNVKNTGKRDGTEVVQIYIRKADDEE 794
Query: 744 GTPEKQLIGFDRVHTVAKGSKEISFGVDP-CEQLSIANKHGRRILPLGNHVLMVGELRHS 802
G P K L F V A S ++ + P + + RI+P G++ +M G ++
Sbjct: 795 G-PVKSLRAFRPVTLKAGKSATVNITLSPDAFEFFDPETNTMRIVP-GDYEIMYGNSSNT 852
Query: 803 L 803
L
Sbjct: 853 L 853
>gi|393779898|ref|ZP_10368130.1| glycosyl hydrolase family 3, N-terminal domain protein
[Capnocytophaga sp. oral taxon 412 str. F0487]
gi|392609318|gb|EIW92128.1| glycosyl hydrolase family 3, N-terminal domain protein
[Capnocytophaga sp. oral taxon 412 str. F0487]
Length = 770
Score = 243 bits (620), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 221/788 (28%), Positives = 347/788 (44%), Gaps = 127/788 (16%)
Query: 61 ISTRAKSLISLLTLQEKIQQLS----DNASAIPRLGIPAYEWWSESLHGIASNGPGVNFN 116
I R S++ L+TL+EKI Q++ D + P + + + L G N V
Sbjct: 43 IEQRVDSVLRLMTLEEKIGQMTQFSADWSVTGPVMADKYQPYLEKGLVGSIFNATSVAGI 102
Query: 117 GTVSSV----------------------TSFPQVLVSAASFNRSLWSNIGSAVAVEARAM 154
+ + T FP L + S++ +L A EA A
Sbjct: 103 RNLQKIAVEQTRLGIPILFGQDVIHGYKTIFPIPLAESCSWDLTLMRKTAELAAREASA- 161
Query: 155 YNLGQAGLTF-WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQG-ENWKSDDG 212
G+ + +AP ++I RD RWGR E GEDP + S A VK FQG +NW+
Sbjct: 162 -----DGINWTFAPMVDITRDARWGRAMEGAGEDPYLGSLIAEARVKGFQGGDNWQM--- 213
Query: 213 GIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFN-AMITEQDT 271
L AC KH Y + G +N A ++
Sbjct: 214 --------------------LSSPHTLLACGKHFAGYGAAESGK----DYNTAELSMHTL 249
Query: 272 EDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAV 330
+ + PP+ + + + IM S N++NGVPA L + R EWGF G + SD
Sbjct: 250 RNVYLPPYEATL-NARVGSIMASLNEINGVPATADKWLLTEVLRKEWGFNGLLVSDYTG- 307
Query: 331 ATIFEYQNYTKTHEDSAAGVLKAG----MDINCGTCMLRHTQSAIDKGKVQEKDIDRALL 386
I E + +D A L A MD+N G +++ + + +GKV E ID+A+
Sbjct: 308 --INELVRHGVAKDDKQAANLSANAGIEMDMN-GATFIKYLSALVKEGKVTEAQIDKAVR 364
Query: 387 NLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNK 446
++ ++ LGLF+ DP + ++ T ++ K+A A +VLLKN+ + LP+ K
Sbjct: 365 HILEIKFLLGLFD-DPYRYLDETRAKENTFTEKYLKVARQAVASSVVLLKNEAEVLPIKK 423
Query: 447 NAVSSLAIIGPLVNNISQMGGGYT--GIPCSPKSLLRGL-EAYVS---KTHYASGCHDVP 500
++ ++A+IGP++NN S + G +T G SLL GL E Y + K YA GC
Sbjct: 424 DSGKTIAVIGPMMNNTSDINGSWTCLGDGKQSVSLLTGLTEKYKATNVKLLYAEGCGFTT 483
Query: 501 CNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRP 560
+++ EAV +A+KAD V+V G + E R + LP Q L+ ++ +T +P
Sbjct: 484 ISTEQ-LKEAVAMARKADRVLVAVGEQSSWSGESAVRTDIRLPQAQRQLLEAL-KTINKP 541
Query: 561 VILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPE 620
+ ++ G PLD+S+ + + +IL +PG G +A++I GD NP G L M+ +P
Sbjct: 542 IAIITFSGRPLDLSW--ENENVQAILQAWFPGTQGGYGIADVIAGDVNPSGHLTMS-FPR 598
Query: 621 SFTKVPM-----------NDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKF 669
S ++P+ + N D Y Y + T +Y FG+GLSYT ++
Sbjct: 599 SVGQIPIYYNYKSTGRPVHTNNEEVDHRPHYNA-GYLDSSITPLYPFGYGLSYTTFAISN 657
Query: 670 LSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGS 729
+ + SLK +D I V SV N G +G
Sbjct: 658 VHLNKK-----SLKRYNDSII-------------------------VNASVQNTGTTEGE 687
Query: 730 HVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPL 789
VV L+ R S P K+L GF ++ A SK++ F + P E L+ +G++
Sbjct: 688 IVVQLYTRQLVASVSRPVKELKGFQKISLKAGESKQVRFEL-PSEALAFYGINGKKDTEP 746
Query: 790 GNHVLMVG 797
+L VG
Sbjct: 747 SECLLWVG 754
>gi|430736195|gb|AGA60127.1| glycoside hydrolase [Aminobacter sp. Gsoil204]
Length = 772
Score = 243 bits (620), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 200/666 (30%), Positives = 313/666 (46%), Gaps = 97/666 (14%)
Query: 123 TSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTF-WAPNINIFRDPRWGRGQ 181
T FP L AAS++ A EA A G+ + +AP +++ RDPRWGR
Sbjct: 141 TIFPISLGEAASWDLKAIEKAARISATEASA------EGIHWTFAPMVDVARDPRWGRIS 194
Query: 182 ETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSA 241
E GED + S A V+ FQG + K+ D + A
Sbjct: 195 EGAGEDVYLGSRIAEARVRGFQGNDLKAVD--------------------------TVLA 228
Query: 242 CCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGV 301
KH AY + G R I+E+ D + PPF++ + A+ M S+N V+G+
Sbjct: 229 TAKHFAAYGAAQAG---RDYGTVDISERTLRDVYLPPFKAAADA-GAATFMTSFNDVDGI 284
Query: 302 PACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINC- 359
PA L R++WGFKG++ +D ++ + + Y+K + + + AG+D++
Sbjct: 285 PASGNHHLLTDVLRDKWGFKGFVVTDYTSINEMVAH-GYSKDLQQAGEQAINAGVDMDLQ 343
Query: 360 GTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSE 419
G + H ++ +GKV ID A+ + ++ RLGLF DP + V +
Sbjct: 344 GAVFMEHLAKSVAEGKVDVARIDAAVKAILEMKYRLGLFE-DPYRYSDEAREKATVYRPD 402
Query: 420 HKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGI---PCSP 476
+ A D AR+ +VLLKN LPL +A S+A+IGPL ++ + M G ++ P
Sbjct: 403 FLEAARDVARKSMVLLKNANNALPLAASA-KSIAVIGPLGDSKADMIGSWSAAGDRKTRP 461
Query: 477 KSLLRGLEAYVSKTHY------ASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQ 530
+LL G++A K AS + +D GF EA+ +A+K+D ++ G
Sbjct: 462 VTLLEGMQARAPKGQSVAYVRGASYAFEDAGKTD-GFAEAIALAQKSDVIVAAMGERWDM 520
Query: 531 ETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGY 590
E R SL LPG Q +L+ + +T K P+ILVL G P + +A D+ + +IL Y
Sbjct: 521 TGEAASRTSLDLPGNQQALLQELKKTGK-PIILVLMSGRPNSIEWA--DANVDAILEAWY 577
Query: 591 PGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVP----MNDMNMRADSSR---QYPGR 643
PG G A+A++++GD+NP G+LP T +P + +VP M + D ++ +Y R
Sbjct: 578 PGTMGGHAIADVLYGDYNPSGKLPAT-FPRNVGQVPLYYDMKNTGRPIDPAKPDAKYVSR 636
Query: 644 SYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTIS-ASLKAGSDKNILQQTGSRLDYVH 702
Y T +Y FG+GLSYT+++Y S +T+S A +K G
Sbjct: 637 -YLNTPNTPLYPFGYGLSYTSFTY------SPVTLSKARIKPGEP--------------- 674
Query: 703 IDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKG 762
++VTN+G DG VV L+ R S P ++L GF ++
Sbjct: 675 -----------LTASVTVTNSGARDGEEVVQLYVRDLVGSVTRPVRELKGFRKIPLKKGE 723
Query: 763 SKEISF 768
SK +SF
Sbjct: 724 SKTVSF 729
>gi|420148909|ref|ZP_14656095.1| glycosyl hydrolase family 3, N-terminal domain protein
[Capnocytophaga sp. oral taxon 335 str. F0486]
gi|394754508|gb|EJF37885.1| glycosyl hydrolase family 3, N-terminal domain protein
[Capnocytophaga sp. oral taxon 335 str. F0486]
Length = 770
Score = 243 bits (620), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 226/803 (28%), Positives = 351/803 (43%), Gaps = 157/803 (19%)
Query: 61 ISTRAKSLISLLTLQEKIQQLSD-----------------------------NASAI--- 88
I R S++ L+TL+EKI Q++ NA+++
Sbjct: 43 IEQRVDSVLRLMTLEEKIGQMTQFSADWSVTGPVMADKYQPYLEKGLVGSIFNATSVVGM 102
Query: 89 ----------PRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRS 138
RLGIP + + +HG T FP L + S++ +
Sbjct: 103 RKLQKIAVEQTRLGIPIL-FGQDVIHGYK---------------TIFPIPLAESCSWDLA 146
Query: 139 LWSNIGSAVAVEARAMYNLGQAGLTF-WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVE 197
L A EA A G+ + +AP ++I RD RWGR E GEDP + S A
Sbjct: 147 LMRKTAELAAREATA------DGINWTFAPMVDITRDARWGRAMEGAGEDPYLGSLIAEA 200
Query: 198 FVKSFQG-ENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGN 256
VK FQG +NW++ L AC KH Y + G
Sbjct: 201 RVKGFQGGDNWQT-----------------------LSSPHTLLACGKHFAGYGAAESGK 237
Query: 257 FSRYSFN-AMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPA-CLRGDLFQKAR 314
+N A ++ + + PP+ + + G S IM S N++NGVPA + L + R
Sbjct: 238 ----DYNTAELSMHTLRNVYLPPYEATLNAGVGS-IMASLNEINGVPATAYKWLLTEVLR 292
Query: 315 NEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAG----MDINCGTCMLRHTQSA 370
EWGF G + SD I E + +D A L A MD+N G +++ +
Sbjct: 293 KEWGFNGLLVSDYTG---INELVRHGVAKDDKQAANLSANAGIEMDMN-GATFIKYLSAL 348
Query: 371 IDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQ 430
+ +GKV E ID+A+ ++ ++ LGLF+ DP + ++ T E+ K+A A
Sbjct: 349 VKEGKVTEAQIDKAVRHILEMKFLLGLFD-DPYRYLDETRAKENTFTEEYLKVARQAVAS 407
Query: 431 GIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT--GIPCSPKSLLRGL-EAYV 487
+VLLKN+ + LP+ K++ ++A+IGP++NN S + G +T G SLL GL E Y
Sbjct: 408 SVVLLKNEAEVLPIKKDSGKTIAVIGPMMNNTSDINGSWTCLGDGKQSVSLLTGLTEKYK 467
Query: 488 S---KTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPG 544
K YA GC +++ EAV IA+KAD V+V G + E R + LP
Sbjct: 468 GTNVKLLYAEGCGFTTISTEQ-LKEAVAIARKADRVLVAVGEQSSWSGESAVRTDIRLPQ 526
Query: 545 QQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIF 604
Q L+ ++ +K P+ ++ G PLD+S+ + + +IL +PG G +A++I
Sbjct: 527 AQRQLLEALKAINK-PIAIITFSGRPLDLSW--ENENVQAILQAWFPGTQGGYGIADVIA 583
Query: 605 GDFNPGGRLPMTWYPESFTKVPMN----------DMNMRADSSRQYPGRSYRFYTGTQVY 654
GD NP G+L M+ +P S ++P+ N R + Y + T +Y
Sbjct: 584 GDVNPSGQLTMS-FPRSVGQIPIYYNYKSTGRPVYTNNEEVDHRPHYNAGYLDSSITPLY 642
Query: 655 GFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRF 714
FG+GLSYT ++ + + S+K +D I
Sbjct: 643 PFGYGLSYTTFAINNVHLNKK-----SIKRYNDSII------------------------ 673
Query: 715 HVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCE 774
V SV N G +G VV L+ R S P K+L GF ++ A SK++ F + P E
Sbjct: 674 -VNASVQNTGTTEGEIVVQLYTRQLVASVSRPVKELKGFQKIPLKAGESKQVHFEL-PSE 731
Query: 775 QLSIANKHGRRILPLGNHVLMVG 797
L+ +G++ +L VG
Sbjct: 732 ALAFYGINGKKDTEPSECLLWVG 754
>gi|156740905|ref|YP_001431034.1| glycoside hydrolase family 3 [Roseiflexus castenholzii DSM 13941]
gi|156232233|gb|ABU57016.1| glycoside hydrolase family 3 domain protein [Roseiflexus
castenholzii DSM 13941]
Length = 790
Score = 243 bits (620), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 219/761 (28%), Positives = 336/761 (44%), Gaps = 158/761 (20%)
Query: 90 RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
RLGIPA E G +NG T+FPQ++ A+++ L + +
Sbjct: 97 RLGIPAL-IHDECCSGFLANG-----------ATNFPQIIGIASAWEPELVEAMTRVIRQ 144
Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
+ RA+ GL AP ++I RDPRWGR +ET GEDP + S +++ QG +W
Sbjct: 145 QMRAVGV--HHGL---APVLDIARDPRWGRTEETFGEDPYLTSVMGAAYIRGLQGADWSE 199
Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWG-NFSRYSFNAMITE 268
+ A KH + Y + G N++ A IT
Sbjct: 200 G----------------------------VMATGKHFVGYSASEGGLNWA----PAHITT 227
Query: 269 QDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPA-----CLRGDLFQKARNEWGFKGYI 323
++ + + PF + + + + IM +Y++++G P L G L R+EWGF+G +
Sbjct: 228 RELREVYLAPFETAVRAARLASIMPAYHEIDGEPCSGAHWLLTGIL----RDEWGFEGLV 283
Query: 324 TSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINC------GTCMLRHTQSAIDKGKVQ 377
SD A+ + Y + +A L+AGMDI G +L A+ G++
Sbjct: 284 VSDYMAIDQLRNYHKLARDKAHAARLALEAGMDIELPNVEAYGQPLL----DALAAGEIP 339
Query: 378 EKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVC----TSEHKKLALDAARQGIV 433
+ +DR++ + +++ GLF + PD V T ++LA + AR+ IV
Sbjct: 340 MEWVDRSVRRILTLKFAFGLFENP-------YVDPDAVPAVFDTPAQRELAREIARKSIV 392
Query: 434 LLKNDKKFLPLNKNAVSSLAIIGPLVNN-------------------ISQMGGGYTGIPC 474
LLKN+ LPL K +S++A+IGP ++ +SQ+G +P
Sbjct: 393 LLKNEGNRLPLPKT-LSAIAVIGPNADSKRNLLGDYSYPAHIETLITLSQLGFSEHPLPD 451
Query: 475 SPK---------SLLRGLEAYVSKTH---YASGCHDVPCNSDAGFHEAVRIAKKADFVIV 522
S + S++ + VS T YA GC DV S GF EA+ A+KA+ IV
Sbjct: 452 SIRLIENDSSMLSIVEAIRRTVSPTTQVLYARGC-DVNSPSTDGFAEAIEAARKAEVAIV 510
Query: 523 VAG--LDLTQET---EDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAE 577
V G LT E E RD L LPG Q LV ++ T PV+LVL G P +
Sbjct: 511 VVGDKAGLTPECTSGEFRDSAHLTLPGVQQQLVAAILATGT-PVVLVLVTGRPYAIPHLV 569
Query: 578 ADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSS 637
+ W+ PG GA ALAE +FGD NPGG+LP+T +P +VP+ + R +
Sbjct: 570 DATPAVVEAWL--PGAEGAPALAEALFGDVNPGGKLPIT-FPRHVGQVPLFYAH-RPSGA 625
Query: 638 RQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSR 697
R + Y + ++ FG GLSYT ++++ L+ ++T ++A D
Sbjct: 626 RSFFYGPYMDESNQPLFPFGFGLSYTQFAFENLTVTPDVTTDGEVQASVD---------- 675
Query: 698 LDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVH 757
V N G+ G VV L+ R S P K+L GF RVH
Sbjct: 676 ----------------------VINTGERSGDEVVQLYTRTEGASVTRPVKELRGFKRVH 713
Query: 758 TVAKGSKEISFGVDPCEQLSIANKHGRRIL-PLGNHVLMVG 797
K + F + P E L+ + ++ P+ H+ MVG
Sbjct: 714 LQPGERKRVIFTL-PVELLAYYDTTMHLVVEPVEVHI-MVG 752
>gi|393789624|ref|ZP_10377744.1| hypothetical protein HMPREF1068_04024 [Bacteroides nordii
CL02T12C05]
gi|392650340|gb|EIY44009.1| hypothetical protein HMPREF1068_04024 [Bacteroides nordii
CL02T12C05]
Length = 855
Score = 243 bits (619), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 164/490 (33%), Positives = 243/490 (49%), Gaps = 69/490 (14%)
Query: 16 FRVLQLI-VVVNVIAFSNSKPVLNKPDFPCKPPHFDSYPFCNTSLSISTRAKSLISLLTL 74
++L+LI ++ ++ FS S NK + F + + R L++ LT+
Sbjct: 1 MKILKLIGCLIPLLVFSLSGQAQNKTEL-----------FRDMTAPQHERILDLLNRLTV 49
Query: 75 QEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAAS 134
+EK+ L ++A IPRL I Y +E+LHG+ G T FPQ + AA+
Sbjct: 50 EEKVSLLVNDAREIPRLNIDKYNHGNEALHGVVRPG----------EFTVFPQAIGLAAT 99
Query: 135 FNRSLWSNIGSAVAVEARAMYNLGQAG----------LTFWAPNINIFRDPRWGRGQETP 184
+N +L + +A++ EAR + G LTFW+P +N+ RDPRWGR ET
Sbjct: 100 WNPNLIFRVSTAISDEARGRWKELDYGKKQIAGGSDLLTFWSPTVNMARDPRWGRTPETY 159
Query: 185 GEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCK 244
GEDP + EFVK QG+N R LK + K
Sbjct: 160 GEDPFLSGRIGCEFVKGLQGDN--------------PRYLKTV-------------STPK 192
Query: 245 HLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPAC 304
H A + E +R S NA ++E+D + + P F CI GKA IM +YN VN VP
Sbjct: 193 HFAANNEEH----NRSSCNARMSERDLREYYLPAFERCIVDGKAQSIMMAYNAVNDVPCT 248
Query: 305 LRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCM 363
+ L +K R +W F GYI SDC A + Y K E +A LKAG+D+ CG +
Sbjct: 249 VNIYLIKKVLRGDWNFNGYIVSDCSAPEWMVTKHKYVKNLEAAATLALKAGLDLECGDRV 308
Query: 364 LRH-TQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKK 422
A ++ V E +ID A ++ ++ LGLF+ DP + Y K+ P + EH++
Sbjct: 309 YTAPLLKAYNEYMVSEAEIDSAAYHILRGRMLLGLFD-DPSQNPYNKIEPSVIGCKEHQE 367
Query: 423 LALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIP-CSPKSLLR 481
LAL+ ARQ +VLLKN K FLPLN+ + S+A++G ++ G Y+G P +P S+L
Sbjct: 368 LALETARQSMVLLKNQKNFLPLNRKKIRSIAVVG--ISAAHCEFGDYSGNPKNTPVSVLD 425
Query: 482 GLEAYVSKTH 491
G++ Y +
Sbjct: 426 GIKKYAENAN 435
Score = 147 bits (371), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 95/306 (31%), Positives = 154/306 (50%), Gaps = 48/306 (15%)
Query: 502 NSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPV 561
N A F +A ++AK+ D + V G++ + E E +DR SL LP Q + + + + V
Sbjct: 588 NRLALFGDAGKVAKECDVTVAVLGINKSIEREGQDRYSLELPIDQQEFIKELYKVNPNTV 647
Query: 562 ILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPES 621
+ VL G + +++ D + +IL YPGE G A+AE++FGD+NPGGRLP+T+Y S
Sbjct: 648 V-VLVAGSSMAINWM--DENVPAILNAWYPGEQGGNAVAEVLFGDYNPGGRLPLTYY-NS 703
Query: 622 FTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISAS 681
++P D D S + R+Y+++ G +Y FG+GLSYTN+ YK
Sbjct: 704 LDELPAFD-----DYSVK--NRTYQYFEGKPLYEFGYGLSYTNFKYK------------- 743
Query: 682 LKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKV 741
K + Q+ +D + +++N G DG V ++ R P+
Sbjct: 744 ------KKSIMQSNDTVD----------------ITFNLSNVGKYDGDEVAQVYVRYPET 781
Query: 742 SQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRR-ILPLGNHVLMVGELR 800
P KQL GF RVH S +I+ + P ++L ++ R+ + P G +V VG
Sbjct: 782 GTYMPLKQLKGFSRVHLKKGKSADITISI-PKKELRYWDEKTRQFVTPTGEYVFQVGGSS 840
Query: 801 HSLTIE 806
+++IE
Sbjct: 841 ENISIE 846
>gi|295132888|ref|YP_003583564.1| beta-glucosidase [Zunongwangia profunda SM-A87]
gi|294980903|gb|ADF51368.1| beta-glucosidase [Zunongwangia profunda SM-A87]
Length = 855
Score = 243 bits (619), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 152/463 (32%), Positives = 231/463 (49%), Gaps = 57/463 (12%)
Query: 53 PFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPG 112
P+ N +LS RA+ L++ LTL+EK + D + AIPRLGI + WWSE+LHG A+N
Sbjct: 13 PYQNPNLSPEERAEDLVNRLTLEEKASLMFDVSEAIPRLGIKKFNWWSEALHGFANN--- 69
Query: 113 VNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMY---------NLGQAGLT 163
VT FP+ + AASF+ L + A + E RA Y N L+
Sbjct: 70 -------DDVTVFPEPVGMAASFDDELVYQVFDATSDEVRAKYHEALRNGEENKRFLSLS 122
Query: 164 FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRV 223
W PN+NIFRDPRWGRGQET GEDP + S V+ VK QG +D + K++
Sbjct: 123 VWTPNVNIFRDPRWGRGQETYGEDPYLTSRMGVQVVKGLQG----PEDA------KYKKL 172
Query: 224 LKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPFRSC 282
L AC KH + +W SR+ N ++++D +T+ P F+
Sbjct: 173 L----------------ACAKHYAVHSGPEW---SRHELNLNNVSQRDLWETYLPAFKVL 213
Query: 283 IEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTK 341
++ +MC+Y +++ P C L Q+ R +WGF+ + SDC A+ + N +
Sbjct: 214 VQDANVRQVMCAYQRLDDEPCCGSDRLLQQILREKWGFEHLVVSDCGAIQDFYTSHNVSS 273
Query: 342 THEDSAAGVLKAGMDINC--GTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFN 399
+AA + AG D+ C + A++KG V+E+DIDR++ + + LG +
Sbjct: 274 DAVHAAAKAVLAGTDVECQWDKHNYKLLPEAVEKGLVKEEDIDRSVKRVLIGRFELGEMD 333
Query: 400 GDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLV 459
D Y ++ + EH++LAL AR+ + LL+N LPL+K +A+IGP
Sbjct: 334 PD-EIVPYAQIPASVINNEEHRQLALKMARESMTLLQNKNNILPLSKGQ-DRIAVIGPNA 391
Query: 460 NNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCN 502
++ + G Y G P S+L G+ SK S +D C+
Sbjct: 392 DDEPMLWGNYNGTPVRTISILDGI---TSKIGEKSIVYDKACD 431
Score = 105 bits (262), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 85/313 (27%), Positives = 144/313 (46%), Gaps = 63/313 (20%)
Query: 498 DVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQM 547
++P N + + ++ K + VI V GL E E+ DR + LP Q
Sbjct: 577 EIPVN----YEDLLKKLKGIETVIFVGGLSTKLEGEEMPVSYPGFKGGDRTDIALPSVQR 632
Query: 548 SLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDF 607
+ + ++ K+ VI V G + + +IL Y GE+G +A+A+++FGD+
Sbjct: 633 NCLKTLKDAGKK-VIFVNNSGSA--IGLVPETTSCDAILQAWYGGESGGQAVADVLFGDY 689
Query: 608 NPGGRLPMTWYPESFTKVP-MNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYS 666
NP G+LP+T+Y ++ T++P D +M GR+YRF ++ FGHGLSYTN+
Sbjct: 690 NPSGKLPVTFYKDT-TQLPDFEDYSMN--------GRTYRFMKAEPLFPFGHGLSYTNF- 739
Query: 667 YKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDV 726
K G ++LD ID TS ++ IS++N G
Sbjct: 740 ----------------KIGE---------AQLDKSEID-----TSSSVNITISISNEGKT 769
Query: 727 DGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGR-- 784
+G ++ ++ + +G P K L GF RV+ K ++ + P +K R
Sbjct: 770 EGVEIIQVYVHKQGLEEG-PIKTLKGFKRVNLKPNEMKNVTINL-PSNSFEFYDKKARSM 827
Query: 785 RILPLGNHVLMVG 797
+++P GN+ + G
Sbjct: 828 KVMP-GNYEIFYG 839
>gi|56412920|ref|YP_149995.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Paratyphi A str. ATCC 9150]
gi|197361851|ref|YP_002141487.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar
Paratyphi A str. AKU_12601]
gi|56127177|gb|AAV76683.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Paratyphi A str. ATCC 9150]
gi|197093327|emb|CAR58775.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Paratyphi A str. AKU_12601]
Length = 768
Score = 243 bits (619), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 209/751 (27%), Positives = 343/751 (45%), Gaps = 140/751 (18%)
Query: 75 QEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAAS 134
++ I+Q+ D A+ RL IP + + + +HG T FP L A+S
Sbjct: 86 RQDIRQMQDQVMALSRLKIPLF-FAYDVVHG---------------QRTVFPISLGLASS 129
Query: 135 FNRSLWSNIGSAVAVEARAMYNLGQAGLTF-WAPNINIFRDPRWGRGQETPGEDPMVVSA 193
FN +G A EA GL WAP +++ RDPRWGR E GED + S
Sbjct: 130 FNLDAVRTVGRVSAYEA------ADDGLNMTWAPMVDVSRDPRWGRASEGFGEDTYLTSI 183
Query: 194 YAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEK 253
VK+ QG++ D + KH AY +
Sbjct: 184 MGETMVKAMQGKS--------------------------PADRYSVMTSVKHFAAYGAVE 217
Query: 254 WGNFSRYSFNAM-ITEQDTEDTFQPPFRSCIEQGKASCIMC--SYNQVNGVPACLRGDLF 310
G +N + ++ Q + + PP+++ ++ G + ++ S N +NG PA L
Sbjct: 218 GGK----EYNTVDMSSQRLFNDYMPPYKAGLDAGSGAVMVALNSLNSLNGTPATSDSWLL 273
Query: 311 QKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCML-RHTQ 368
+ R+EWGFKG SD A+ + ++ ED+ LKAG+D++ ++
Sbjct: 274 KDVLRDEWGFKGITVSDHGAIKELIKH-GTAADPEDAVRVALKAGVDMSMADEYYSKYLP 332
Query: 369 SAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGP---DDVCTSE----HK 421
I GKV ++D A ++ +V+ +GLFN DP Y LGP D V T+ H+
Sbjct: 333 GLIKSGKVTMAELDDATRHVLNVKYDMGLFN-DP----YSHLGPKESDPVDTNAESRLHR 387
Query: 422 KLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT--GIPCSPKSL 479
K A + AR+ +VLLKN + LPL K+ ++A++GPL ++ + G ++ G+ ++
Sbjct: 388 KEAREVARESVVLLKNRLETLPLKKSG--TIAVVGPLADSQRDVMGSWSAAGVANQSVTV 445
Query: 480 LRGLEAYV---SKTHYASGCHDV-------------------PCNSDAGFHEAVRIAKKA 517
L G++ V +K YA G + P + A EAV+ AK+A
Sbjct: 446 LAGIQNAVGDGAKILYAKGANITNDKGIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQA 505
Query: 518 DFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAE 577
D V+ V G E R + +P Q L+T++ T K P++LVL G PL + +
Sbjct: 506 DVVVAVVGESQGMAHEASSRTDITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVK 562
Query: 578 ADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRA--- 634
D Q +IL + G G A+A+++FGD+NP G+LP++ +P S ++P+ ++
Sbjct: 563 EDQQADAILETWFAGTEGGNAIADVLFGDYNPSGKLPIS-FPRSVGQIPVYYSHLNTGRP 621
Query: 635 ---DSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNIL 691
+ +Y R + G +Y FG+GLSYT ++ S++T+S+ +
Sbjct: 622 YNPEKPNKYTSRYFDEANGP-LYPFGYGLSYTTFTV------SDVTLSSP--------TM 666
Query: 692 QQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLI 751
Q+ G +VT+ + VTN G +G+ V+ ++ + S P KQL
Sbjct: 667 QRDG---------KVTAS--------VEVTNTGKREGATVIQMYLQDVTASMSRPVKQLK 709
Query: 752 GFDRVHTVAKGSKEISFGVDPCEQLSIANKH 782
GF+++ K +SF +D E L N+
Sbjct: 710 GFEKITLKPGERKTVSFPID-IEALKFWNQQ 739
>gi|329956868|ref|ZP_08297436.1| glycosyl hydrolase family 3 protein [Bacteroides clarus YIT 12056]
gi|328523625|gb|EGF50717.1| glycosyl hydrolase family 3 protein [Bacteroides clarus YIT 12056]
Length = 864
Score = 243 bits (619), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 152/461 (32%), Positives = 230/461 (49%), Gaps = 62/461 (13%)
Query: 54 FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
+ + S S + RA+ L+ LTL+EK+ + DN+ + RLGI Y WW+E+LHG+A +G
Sbjct: 24 YKDNSYSPAERAEDLVKQLTLEEKVALMMDNSKPVERLGIKPYNWWNEALHGVARSG--- 80
Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLTFW 165
T FPQ + AASF+ AV+ EARA A GLT W
Sbjct: 81 -------WATVFPQPIGMAASFSPEALHTAFVAVSDEARAKNAAYSAEGSYKRYQGLTIW 133
Query: 166 APNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQ--GENWKSDDGGIGFGFREKRV 223
P +NI+RDPRWGRG ET GEDP + S V VK Q EN K D
Sbjct: 134 TPTVNIYRDPRWGRGIETYGEDPYLASVMGVSVVKGLQCLDENEKYDK------------ 181
Query: 224 LKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPFRSC 282
+ AC KH + +W +R+SFNA I+ +D +T+ PPF +
Sbjct: 182 ---------------VHACAKHFAVHSGPEW---NRHSFNAENISPRDLYETYLPPFEAL 223
Query: 283 IEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTK 341
+++GK +MC+YN+ G P C L R EWG+ G + +DC A++ F K
Sbjct: 224 VKEGKVKEVMCAYNRFEGEPCCGSNRLLNHILRREWGYDGIVVADCSAISD-FHNDKGHK 282
Query: 342 TH---EDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLF 398
TH +++ + +G D+ CG+ R + KG + E DIDR++ L + LG
Sbjct: 283 THADAASASSAAVLSGTDLECGS-NYRSLTEGVKKGFIDEADIDRSVKRLLQARFELGEM 341
Query: 399 NGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPL 458
+ +P + ++ ++ VC+ +H L+LD AR+ + LL N LPL + +++A++GP
Sbjct: 342 D-EPDQVRWAQIPYSVVCSDKHDSLSLDMARKSMTLLLNKNNALPLERGG-TTIAVMGPN 399
Query: 459 VNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTH---YASGC 496
N+ G Y G+P ++L G+ + + K Y GC
Sbjct: 400 ANDSVMQWGNYNGLPKRTITILDGIRSAMGKDDKLIYEQGC 440
Score = 107 bits (268), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 82/292 (28%), Positives = 124/292 (42%), Gaps = 62/292 (21%)
Query: 498 DVPCNSDAGFHEAVRI------AKKADFVIVVAGLDLTQETEDR----------DRVSLL 541
D N D GF E I K AD VI G+ E E+ DR +
Sbjct: 577 DAQLNFDLGFKEEADIQRSVAKVKDADVVIFAGGISPQLEGEEMGVKLPGFRGGDRTDIE 636
Query: 542 LPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAE 601
LP Q ++ ++ K+ VI V G P ++ +IL YPG++G KA+AE
Sbjct: 637 LPAVQREMIKALHDAGKK-VIFVNCSGSP--IAMEPETEYCQAILQAWYPGQSGGKAVAE 693
Query: 602 IIFGDFNPGGRLPMTWYPESFTKVP-MNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGL 660
++FGD+NP GRLP T+Y + ++P D NM G +YRF+ G ++ FG+GL
Sbjct: 694 VLFGDYNPAGRLPATFY-RNLAQLPDFEDYNM--------AGHTYRFFNGEPLFPFGYGL 744
Query: 661 SYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISV 720
SYT + Y + S ++K + + V
Sbjct: 745 SYTTFKYGKIQLKSSAQTDETVK--------------------------------ITVPV 772
Query: 721 TNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDP 772
TN G +G VV ++ + + G P K L F RV+ A + ++ + P
Sbjct: 773 TNTGSRNGEEVVQVYLKKQGETDG-PVKTLRAFKRVYIPAGKTVKVELELTP 823
>gi|381200964|ref|ZP_09908096.1| beta-glucosidase [Sphingobium yanoikuyae XLDN2-5]
Length = 792
Score = 243 bits (619), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 221/749 (29%), Positives = 336/749 (44%), Gaps = 133/749 (17%)
Query: 90 RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
RLGIP + ESLHG + + T FP + A SF+R L +++ S +A
Sbjct: 140 RLGIPIL-FHEESLHGYMA-----------TDATMFPMAIGLAGSFDRQLMTDVQSVIAR 187
Query: 150 EARAMYNLGQAGLTF-WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWK 208
E RA G+ +P ++I RDPRWGR +ET GEDP + V V QGE+
Sbjct: 188 EVRAR------GVHLALSPVVDIARDPRWGRIEETFGEDPYLCGEMGVAAVLGLQGES-- 239
Query: 209 SDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWG-NFSRYSFNAMIT 267
E D++M A KH+ + + G N + A I+
Sbjct: 240 ---------------------EQIGPDKVM--ATLKHMTGHGQPQAGENIA----PAPIS 272
Query: 268 EQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPA----CLRGDLFQKARNEWGFKGYI 323
E++ + F PPFR +++ + +M SYN+++GVP+ L GD+ R EW F G +
Sbjct: 273 ERELRENFFPPFRQVVKRTGIAAVMPSYNEIDGVPSHQNKWLLGDIL---RGEWHFDGAV 329
Query: 324 TSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCM-LRHTQSAIDKGKVQEKDID 382
SD AV + + E +A L+AG+D + R + GKV + ++
Sbjct: 330 VSDYGAVPELDTIHHVQPDLEATARAALRAGVDCELPDGLAYRTLVEQVRAGKVPLEAVN 389
Query: 383 RALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFL 442
A + +++ R GLF + Y L +E + LAL AA + IVLLKND L
Sbjct: 390 LACTRMLTLKFRAGLFENPWPRADYDAL----TGNAEARALALKAAHKSIVLLKNDG-VL 444
Query: 443 PLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV---SKTHYASGCHDV 499
PL A +A+IGP N GGY+ IP SLL G++A + ++ +A G
Sbjct: 445 PLKPGAHRKVAVIGP--NAAIARLGGYSSIPRQAVSLLDGVKAKLGNRAEIVHAQGVFIT 502
Query: 500 --------------PCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDR-------DRV 538
P + EAV +AK AD +++ G D Q + + DR
Sbjct: 503 QSEDRSVDEVLLADPAKNRQLIAEAVEVAKTADIILLAIG-DTEQTSREGFAKNHLGDRT 561
Query: 539 SLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKA 598
SL L G+Q L ++ T K V+ + G P S+ ++++ YPG+ G A
Sbjct: 562 SLDLVGEQNDLFAAMKATGKPVVVCAINGRPP---SYPAVVDGANALIECWYPGQEGGTA 618
Query: 599 LAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGH 658
+A+I+FGD NPG +LP+T ++ ++P+ R SSR R Y F + ++ FG
Sbjct: 619 MADILFGDVNPGAKLPVTVARDA-GQIPI--FYNRKPSSR----RGYLFADASPLFPFGF 671
Query: 659 GLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQI 718
GLSYT KF+ P L+ S + G D V++
Sbjct: 672 GLSYT----KFVFGPPRLSAS-RIGVGGD--------------------------VTVEV 700
Query: 719 SVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSI 778
V N G+V G VV L+ S P K+L GF+R+ S+ + + P E ++
Sbjct: 701 DVRNVGNVAGEEVVQLYVHDQTASVTRPLKELKGFERIALAPGESRTVRLTIGP-EAFAL 759
Query: 779 ANKHGRRILPLGNHVLMVGELRHSLTIET 807
N R ++ G +MVG S T++T
Sbjct: 760 WNLDMREVVEPGLFDIMVGP--DSATLQT 786
>gi|282878201|ref|ZP_06286997.1| glycosyl hydrolase family 3 C-terminal domain protein [Prevotella
buccalis ATCC 35310]
gi|281299619|gb|EFA91992.1| glycosyl hydrolase family 3 C-terminal domain protein [Prevotella
buccalis ATCC 35310]
Length = 947
Score = 243 bits (619), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 201/756 (26%), Positives = 336/756 (44%), Gaps = 139/756 (18%)
Query: 90 RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
RLGIP ++ +E + G+ S T+FP L ++NR L +G
Sbjct: 159 RLGIPV-DFTNEGIRGVES-----------FKATNFPTQLGLGTTWNRKLIHQVGYITGR 206
Query: 150 EARAMYNLGQAGLT-FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWK 208
EAR + G T +AP +++ RD RWGR +E GE P +V+ ++ + Q
Sbjct: 207 EARLL------GYTNVYAPILDVGRDQRWGRYEEVYGESPFLVAELGIQMTRGLQ----- 255
Query: 209 SDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITE 268
+++ KH AY K + ++
Sbjct: 256 --------------------------TNYQVASTGKHFAAYSNNKGAREGMARVDPQMSP 289
Query: 269 QDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGD---LFQKARNEWGFKGYITS 325
++ ++ P+ + + M SYN +GVP ++G L + R ++GFKGY+ S
Sbjct: 290 REVQNIHLYPWGRVVREAGLLGAMSSYNDYDGVP--IQGSFHWLTEVLRQQFGFKGYVVS 347
Query: 326 DCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCG----TCMLRHTQSAIDKGKVQEKDI 381
D DA+ +F +++ + AG+++ C + + + +G++ K I
Sbjct: 348 DSDALEYLFSKHRTAANMKEAVYKAVMAGLNVRCTFRSPDSFVLPLRELVKEGRIPMKVI 407
Query: 382 DRALLNLFSVQLRLGLFNGDPRKGKYGKLGPD-DVCTSEHKKLALDAARQGIVLLKNDKK 440
D L ++ V+ +G+F+ R + D +V H+++AL A+R+ IVLLKN
Sbjct: 408 DERLRDILRVKFMVGIFD---RPYQMNLQAADKEVDGKSHQQVALQASRESIVLLKNQNN 464
Query: 441 FLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVS---KTHYASGCH 497
LPLNK ++ +A+ GP N+ + Y + ++ G+ V + Y GC
Sbjct: 465 TLPLNKASIKKIAVCGPNANDAAYALTHYGPLAVEVTTVFEGIRNKVGSDVEVTYTKGCD 524
Query: 498 ------------DVPCNSDAG--FHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLP 543
D P +D +AV +++D +VV G + E++ R SL LP
Sbjct: 525 LVDAHWPESELVDYPMTADEQNEIDKAVEQVRQSDVAVVVLGGNSRTCGENKSRSSLELP 584
Query: 544 GQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEII 603
G+Q+ L+ +V T K VILVL G PL V++A D + +I+ YPG G A+A+++
Sbjct: 585 GRQLQLLKAVQATGK-TVILVLINGRPLSVNWA--DKFVPAIVEAWYPGSQGGTAVADVL 641
Query: 604 FGDFNPGGRLPMTWYPESFTKVPMN-------------DMNMRADSSRQYPGRSYRFYTG 650
FGD+NPGG+L +T +P++ ++P N + + ++SR
Sbjct: 642 FGDYNPGGKLTVT-FPKTVGQIPFNFPSKPAALVDGGNKLGLHGNASR----------AN 690
Query: 651 TQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCT 710
+Y FGHGLSYT + Y S L +SA Q S D V V SC
Sbjct: 691 GALYYFGHGLSYTTFKY------SNLRLSA------------QNISPTDSV----VVSC- 727
Query: 711 SLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGV 770
+TN G G VV L+ + + T EK L GF+RVH ++ +SF +
Sbjct: 728 --------DITNTGQRAGDEVVQLYIQDVLSTVTTYEKNLRGFERVHLKPGETRTLSFVI 779
Query: 771 DPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
P E L + N+ + ++ G+ +M+G + +E
Sbjct: 780 KP-EHLQLINEQYQHVVEPGDFKVMMGASSEDIRLE 814
>gi|408386263|gb|AFU63317.1| beta-glucosidase [uncultured bacterium]
Length = 764
Score = 243 bits (619), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 196/623 (31%), Positives = 298/623 (47%), Gaps = 87/623 (13%)
Query: 165 WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVL 224
+AP ++I RD RWGR E GEDP + S A V FQG
Sbjct: 167 FAPMVDIGRDARWGRVMEGAGEDPYLGSKVAEARVLGFQG-------------------- 206
Query: 225 KGFGEESDRGDELM-LSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCI 283
G + + EL L A KHL AY + G R A ++E + + PP+R+
Sbjct: 207 ---GRDVNSLRELTTLLATGKHLAAYGAAESG---RDYNTAELSEHTLRNVYLPPYRAAH 260
Query: 284 EQGKASCIMCSYNQVNGVPACL-RGDLFQKARNEWGFKGYITSDCDAVATIFEYQNYTKT 342
E G M S+N++NGVPA R + + RNEWGFKG++ SD V + + +
Sbjct: 261 EAG-VGTFMASFNEINGVPATADRWLMTEVLRNEWGFKGFVVSDYTGVNELVPH-GVARD 318
Query: 343 HEDSAAGVLKAGMDIN-CGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGD 401
+ +A + AG+D++ G ++H ++ +GKV EK ID A+ + ++ LGLF+ D
Sbjct: 319 EKHAAELAVNAGIDMDMTGATFIKHLTESVKEGKVSEKSIDTAVRRILEMKFLLGLFD-D 377
Query: 402 PRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNN 461
P + + + T E + A A +VLLKND + LP+ K + ++A+IGPL+N+
Sbjct: 378 PYRYLDEERAKQNTMTPEFMETARKAVSSSVVLLKNDNQVLPVTKESGKTVALIGPLMND 437
Query: 462 ISQMGGGYTGIPCSPKS--LLRGL----EAYVSKTHYASGCHDVPCNSDAGFHEAVRIAK 515
+ G + G+ KS +L+GL E + YA G + + A EAV A+
Sbjct: 438 SINLNGEWAGLGDRNKSVPILKGLTEKYEGSTVRLIYAEGAS-ITETTPAKIAEAVAAAR 496
Query: 516 KADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSF 575
+AD V+ G D E R + LP Q L+ ++ T K PV+LV G PLD+S+
Sbjct: 497 RADIVVAAMGEDFNWSGEAAVRTDIRLPEPQRELLRALKETGK-PVVLVTLSGRPLDLSW 555
Query: 576 AEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPM-------- 627
D + +IL +PG G +A++I GD+NP GRL M+ +P + ++P+
Sbjct: 556 --EDETMDAILQAWFPGTQGGHGIADVIAGDYNPSGRLVMS-FPRNIGQIPIYYNQKNTG 612
Query: 628 --NDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAG 685
D+N + S Y +Y + T +Y FG+GLSYT + A
Sbjct: 613 RPADLN---NESVDYKS-NYLDTSITPLYPFGYGLSYTTFEIS--------------NAQ 654
Query: 686 SDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGT 745
DK L+ D++T ++R NAG+VDG VV L+ R S
Sbjct: 655 VDKKTLK--------TEDDKITVTANVR--------NAGEVDGEIVVQLYTRQLVASVTR 698
Query: 746 PEKQLIGFDRVHTVAKGSKEISF 768
P K+L GF +V A SK+++F
Sbjct: 699 PVKELKGFQKVALQAGKSKQVTF 721
>gi|332671963|ref|YP_004454971.1| glycoside hydrolase family 3 domain-containing protein
[Cellulomonas fimi ATCC 484]
gi|332341001|gb|AEE47584.1| glycoside hydrolase family 3 domain protein [Cellulomonas fimi ATCC
484]
Length = 771
Score = 243 bits (619), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 192/638 (30%), Positives = 293/638 (45%), Gaps = 98/638 (15%)
Query: 165 WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVL 224
+AP ++I RDPRWGR E GEDP + SA A V+ FQG G IG +RV+
Sbjct: 166 FAPMVDIARDPRWGRIVEGAGEDPYLGSAVAAAQVRGFQG-------GRIG---TPERVI 215
Query: 225 KGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIE 284
G KH Y G R +++Q + + PPF++ ++
Sbjct: 216 AG----------------PKHFAGYGAAIGG---RDYDEVDLSDQQLRNVYLPPFKAAVD 256
Query: 285 QGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTH 343
G A IM +Y +NGVPA L + R+EWGF G++ SD +AV + + Y
Sbjct: 257 AG-AGNIMTAYMPLNGVPATANAWLLKDVLRDEWGFDGFLVSDANAVRNLVTH-GYAADL 314
Query: 344 EDSAAGVLKAGMDINCGTC--MLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGD 401
D+A ++ G+D+ H A++ G V + +D + + V++RLGLF +
Sbjct: 315 PDAAVRAVEVGLDLEMAISDPAYAHLPEAVESGAVSIEAVDACVRRILEVKVRLGLFE-E 373
Query: 402 PRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNN 461
P + + H+++A +AA++ VLL+ND + LPL+ +A+ S+A+IGPL ++
Sbjct: 374 PYVDE--DRAAAVLADPAHRQVAREAAQRSAVLLRNDGRLLPLDPDALGSVAVIGPLADS 431
Query: 462 ISQMGGGYT--GIPCSPKSLLRGLEAYVSKT---HYASGCHDV-----------PCNS-- 503
G + ++L GL A + T YA G P N+
Sbjct: 432 KRDTLGPWVFDDDLDETVTILDGLRARLGDTTDVRYAPGVRPAQRTFPSMFEMFPGNATP 491
Query: 504 -------DAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVART 556
+A AV +A+ +D ++V G E R SL LPG+Q+ L+ +VA T
Sbjct: 492 DPEDFDDEAELARAVELARTSDVAVLVVGEWQGMIGEAASRSSLELPGRQLELLQAVAAT 551
Query: 557 SKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMT 616
PV+L++ G PLD+ +A + +IL + YPG G A+A+++ GD PGG+LP T
Sbjct: 552 GT-PVVLLVMNGRPLDLRWAA--QHVPAILDVWYPGTQGGTAVADLLVGDVAPGGKLPFT 608
Query: 617 WYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSEL 676
W P + +VPM + Q GR Y T ++ FG GL Y ++Y L
Sbjct: 609 W-PRTVGQVPMI-YSHTTSFEPQNQGRRYWDEESTPLFPFGFGLGYGEFTYSDL------ 660
Query: 677 TISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFA 736
RLD I S T+ ++VTNAG DG VV L+
Sbjct: 661 --------------------RLDAETIPLDGSLTA-----SVTVTNAGQHDGDEVVQLYV 695
Query: 737 RVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCE 774
S P ++L GF R+H A S+ + + P E
Sbjct: 696 HQRHGSAARPVRELKGFQRIHLAAGESRTVELTIGPDE 733
>gi|298374050|ref|ZP_06984008.1| periplasmic beta-glucosidase [Bacteroides sp. 3_1_19]
gi|298268418|gb|EFI10073.1| periplasmic beta-glucosidase [Bacteroides sp. 3_1_19]
Length = 758
Score = 242 bits (618), Expect = 5e-61, Method: Compositional matrix adjust.
Identities = 188/632 (29%), Positives = 298/632 (47%), Gaps = 100/632 (15%)
Query: 165 WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGEN-WKSDDGGIGFGFREKRV 223
+AP ++I RD RWGR E GEDP + S A V+ FQG N W+S
Sbjct: 162 FAPMVDISRDARWGRVMEGAGEDPYLGSLIAKARVEGFQGGNDWRS-------------- 207
Query: 224 LKGFGEESDRGDELMLSACCKHLIAYDLEKWG---NFSRYSFNAMITEQDTEDTFQPPFR 280
D + ACCKH AY + G N S S N ++ + + PP+
Sbjct: 208 ---------LADVNTVLACCKHFAAYGAAEAGRDYNTSELSQNTLM------NYYMPPYL 252
Query: 281 SCIEQGKASCIMCSYNQVNGVPAC----LRGDLFQKARNEWGFKGYITSDCDAVATIFEY 336
+ E G A+ M S+N++NGVP+ L DL +K +WGF G++ +D I E
Sbjct: 253 AAKEAGVAT-FMASFNEINGVPSTGNKWLMTDLLRK---DWGFNGFVVTDYTG---INEM 305
Query: 337 QNYTKTHEDSAAGVL--KAGMDIN-CGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQL 393
++ D AG L AG+D++ G ++ ++ +GKV E++IDRA+ ++ ++
Sbjct: 306 VAHSIVRNDKEAGELAANAGIDMDMTGGIYSQYLVQSVKEGKVSEENIDRAVASILEMKF 365
Query: 394 RLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLA 453
LGLF+ DP + + + + E + A + + + IVLLKND F P++K+ ++A
Sbjct: 366 LLGLFD-DPYRYLDNEREKNTIMKPEFLQEARETSARSIVLLKNDNNFFPISKDKNITVA 424
Query: 454 IIGPLVNNISQMGGGYTGIPCSPK--SLLRGLEAYVSKTH----YASGCHDVPCNSDAGF 507
+IGP+V + G + G + SL GL + T+ YA GC D+ + + F
Sbjct: 425 LIGPMVKDKINQNGEWAGRGEREESISLFEGLTEKYAGTNVKFIYAEGC-DLLTDDSSKF 483
Query: 508 HEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTG 567
EA+ A++AD V+ G D E R L LPG Q +L+ + +T K P+ L+L
Sbjct: 484 AEAIATARRADIVLAAMGEDFNWSGEAACRTDLKLPGAQQALLKELKKTGK-PLGLILVN 542
Query: 568 GGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPM 627
G PLD+S+ D + IL Y G +A++I GD+NP RL M+ +P + ++P+
Sbjct: 543 GRPLDLSW--EDQHVDGILEAWYLGTMAGHGMADVISGDYNPSARLTMS-FPRTVGQLPL 599
Query: 628 NDMNMRADSSRQYPGRS--------YRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTIS 679
+ + R P + Y T +Y FG+GLSYT ++
Sbjct: 600 --YYNQKPTGRPVPPEAPDTDYKSRYMDVPNTPLYPFGYGLSYTTFA------------- 644
Query: 680 ASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVP 739
++ + +D+ + + V V N G VDG VV ++ R
Sbjct: 645 ------------------VNSMKLDQNSFTKGGKITVMAEVENTGKVDGETVVQMYIRDL 686
Query: 740 KVSQGTPEKQLIGFDRVHTVAKGSKEISFGVD 771
S P K+L GF++V A K++SF +D
Sbjct: 687 AGSVTRPVKELKGFEKVTLKAGEKKQVSFTID 718
>gi|315225249|ref|ZP_07867066.1| periplasmic beta-glucosidase [Capnocytophaga ochracea F0287]
gi|420158631|ref|ZP_14665447.1| glycosyl hydrolase family 3, N-terminal domain protein
[Capnocytophaga ochracea str. Holt 25]
gi|314944932|gb|EFS96964.1| periplasmic beta-glucosidase [Capnocytophaga ochracea F0287]
gi|394763447|gb|EJF45542.1| glycosyl hydrolase family 3, N-terminal domain protein
[Capnocytophaga ochracea str. Holt 25]
Length = 770
Score = 242 bits (618), Expect = 5e-61, Method: Compositional matrix adjust.
Identities = 219/744 (29%), Positives = 330/744 (44%), Gaps = 115/744 (15%)
Query: 78 IQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNR 137
I++L A RLGIP + + +HG T FP L + S++
Sbjct: 102 IRKLQKIAVEQTRLGIPIL-FGQDVIHGYK---------------TIFPIPLAESCSWDL 145
Query: 138 SLWSNIGSAVAVEARAMYNLGQAGLTF-WAPNINIFRDPRWGRGQETPGEDPMVVSAYAV 196
+L A EA A G+ + +AP ++I RD RWGR E GEDP + S A
Sbjct: 146 ALMRKTTELAAREASA------DGINWTFAPMVDITRDARWGRAMEGAGEDPYLGSLIAE 199
Query: 197 EFVKSFQG-ENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWG 255
VK FQG +NW++ L AC KH Y + G
Sbjct: 200 ARVKGFQGGDNWQT-----------------------LSSPHTLLACGKHFAGYGAAESG 236
Query: 256 NFSRYSFN-AMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA- 313
+N A ++ + + PP+ + + G S IM S N++NGVPA L +
Sbjct: 237 K----DYNTAELSMHTFRNVYLPPYEATLNAGVGS-IMASLNEINGVPATADKWLLTEVL 291
Query: 314 RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAG----MDINCGTCMLRHTQS 369
R EWGF G + SD I E + +D A L A MD+N G +++ +
Sbjct: 292 RKEWGFNGLLVSDYTG---INELVRHGVAKDDKQAANLSANAGIEMDMN-GATFIKYLSA 347
Query: 370 AIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAAR 429
+ +GKV E ID+A+ ++ ++ LGLF+ DP + ++ T E+ K+A A
Sbjct: 348 LVKEGKVTEAQIDKAVRHILEMKFLLGLFD-DPYRYLDETRAKENTFTEEYLKVARQAVA 406
Query: 430 QGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT--GIPCSPKSLLRGL-EAY 486
+VLLKN+ + LP+ K++ ++A+IGP++NN S + G +T G SL GL E Y
Sbjct: 407 SSVVLLKNEAEVLPIKKDSGKTIAVIGPMMNNTSDINGSWTCLGDGKQSVSLFTGLTEKY 466
Query: 487 VS---KTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLP 543
K YA GC +++ EAV IA+KAD V+V G E R + LP
Sbjct: 467 KGTNVKLLYAEGCGFTTISTEQ-LKEAVAIARKADRVLVAVGEQSNWAGESAVRTDIRLP 525
Query: 544 GQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEII 603
Q L+ ++ +K P+ +V G PLD+S+ + + +IL +PG G +A++I
Sbjct: 526 QAQRQLLEALKAINK-PIAIVTFSGRPLDLSW--ENENVQAILQAWFPGTQGGNGIADVI 582
Query: 604 FGDFNPGGRLPMT----------WYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQV 653
GD NP G L M+ +Y T P+ N D Y Y + T +
Sbjct: 583 AGDVNPSGHLTMSFPRNVGQIPIYYNYKSTGRPVYTNNEEVDHRPHYNA-GYLDSSITPL 641
Query: 654 YGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLR 713
Y FG+GLSYT ++ + + SLK +D I
Sbjct: 642 YPFGYGLSYTTFAISNVHLNKK-----SLKRYNDSII----------------------- 673
Query: 714 FHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPC 773
V SV N G +G VV L+ R S P K+L GF+++ A SK++ F + P
Sbjct: 674 --VNASVQNTGTTEGEIVVQLYTRQLVASVSRPVKELKGFEKISLKAGESKQVCFEL-PS 730
Query: 774 EQLSIANKHGRRILPLGNHVLMVG 797
E L+ +G++ +L VG
Sbjct: 731 EALAFYGINGKKDTEPSECLLWVG 754
>gi|218131278|ref|ZP_03460082.1| hypothetical protein BACEGG_02889 [Bacteroides eggerthii DSM 20697]
gi|217986495|gb|EEC52831.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
eggerthii DSM 20697]
Length = 862
Score = 242 bits (618), Expect = 6e-61, Method: Compositional matrix adjust.
Identities = 166/470 (35%), Positives = 239/470 (50%), Gaps = 66/470 (14%)
Query: 44 CKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESL 103
C P + + TSL+ RA+ L+ LTL+EK+ + DN+ A+ RLGI Y WW+E+L
Sbjct: 17 CSQPTYKT-----TSLAPEKRAELLLKELTLEEKVALMMDNSQAVERLGIKPYNWWNEAL 71
Query: 104 HGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--- 160
HG+A G T FPQ + AASF+ + SAV+ EARA N A
Sbjct: 72 HGVARAG----------LATVFPQPIGMAASFSPESVYEVFSAVSDEARAK-NANYASQG 120
Query: 161 ------GLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGI 214
GLT W P +NI+RDPRWGRG ET GEDP + S V VK QG N DG
Sbjct: 121 SYARYQGLTMWTPTVNIYRDPRWGRGIETYGEDPYLTSRMGVMVVKGLQGTN----DG-- 174
Query: 215 GFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTED 273
+ D+ L AC KH + +W +R+SFN ++ +D +
Sbjct: 175 ---------------KYDK-----LHACAKHFAVHSGPEW---NRHSFNVEDLSARDLYE 211
Query: 274 TFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVAT 332
T+ PPF + +++ K +MC+YN G P C L Q RN+WGF G I SDC A+A
Sbjct: 212 TYLPPFEALVKEAKVKEVMCAYNSFEGEPCCGSNRLLMQILRNDWGFDGIILSDCGAIAD 271
Query: 333 IFE---YQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLF 389
+ ++ Y SAA VL G D+ CG+ + A+ G++ EKDID+A++ L
Sbjct: 272 FYNEHGHKAYPDAESASAAAVLN-GTDLECGSS-YKALVKAVRAGQINEKDIDKAVMRLL 329
Query: 390 SVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAV 449
+ LG + DP + K+ V ++EH LAL+ AR+ + LL+N LPL + +
Sbjct: 330 EARFALGEMD-DPDNVSWTKIPFSVVASAEHDSLALNMARKSMTLLQNTGNILPLKRGGL 388
Query: 450 SSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV---SKTHYASGC 496
++A++GP N+ G Y G+P ++L G+ + K Y GC
Sbjct: 389 -TVAVMGPNANDSVMQWGNYNGMPPHTVTILGGIRKALGADDKLIYEQGC 437
Score = 119 bits (298), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 96/318 (30%), Positives = 145/318 (45%), Gaps = 63/318 (19%)
Query: 498 DVPCNSDAGFHEAVRI------AKKADFVIVVAGLDLTQETEDR----------DRVSLL 541
D N D GF + V I K AD V+ +G+ E E+ DR +
Sbjct: 574 DAQLNFDIGFKQKVDINASVARVKDADIVVFASGVSPVLEGEEMGVNLPGFKGGDRTDIE 633
Query: 542 LPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAE 601
LP Q L++++ R K+ +ILV G P ++ +IL YPG+ G A+AE
Sbjct: 634 LPAIQRELISALHRAGKK-IILVNCSGSP--IALEPETKNCEAILQAWYPGQQGGTAVAE 690
Query: 602 IIFGDFNPGGRLPMTWYPESFTKVP-MNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGL 660
++FGD+NPGGRLP+T+Y + +++P D NM GR+YR+ T ++ FG+GL
Sbjct: 691 VLFGDYNPGGRLPVTFY-RNMSQLPDFEDYNMT--------GRTYRYMTQQPLFPFGYGL 741
Query: 661 SYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISV 720
SYT + Y LS E +K G + +SV
Sbjct: 742 SYTTFDYGKLSLDKE-----QIKQGEP--------------------------LKLAVSV 770
Query: 721 TNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCE-QLSIA 779
TN+G DG VV ++ + ++G P K L F RV+ A S + F + E + A
Sbjct: 771 TNSGQRDGEEVVQVYLKKQDDAEG-PGKTLRAFKRVYIPAGKSVNVEFDLKDKELEWWDA 829
Query: 780 NKHGRRILPLGNHVLMVG 797
+ R+ G + +MVG
Sbjct: 830 QSNTMRVCA-GKYDIMVG 846
>gi|311031835|ref|ZP_07709925.1| glycoside hydrolase family 3 domain protein [Bacillus sp. m3-13]
Length = 719
Score = 242 bits (617), Expect = 7e-61, Method: Compositional matrix adjust.
Identities = 202/699 (28%), Positives = 334/699 (47%), Gaps = 92/699 (13%)
Query: 123 TSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQE 182
T FP L S++ L A EA ++ +TF AP +++ RDPRWGR E
Sbjct: 96 TIFPVPLAIGCSWDTELAEKSAEIAAKEA----SVSGVHVTF-APMVDLVRDPRWGRVME 150
Query: 183 TPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSAC 242
+ GEDP + S +A FV+ FQGE+ +D RV +AC
Sbjct: 151 STGEDPYLNSVFAKAFVRGFQGEDLTNDT---------HRV----------------AAC 185
Query: 243 CKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGV 301
KH AY + G +N + ++E+ +++ P +++ +++G +M ++N V+G+
Sbjct: 186 VKHFAAYGAPEGGR----DYNTVNMSERQLRESYLPAYKAALDEG-CEMVMTAFNTVDGI 240
Query: 302 PACLRGDLFQK-ARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCG 360
PA L + R EWGF G + SD AV + + E + G L+AG+DI
Sbjct: 241 PASGNKKLMRDLLREEWGFDGVMISDWGAVKELIPHGVAEDEREAALKG-LEAGVDIEMM 299
Query: 361 T-CMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSE 419
T C ++ + ++ G++ E +D ++L + +++ +LGLF +P +G +L + + ++E
Sbjct: 300 TACYAKNLKDLVESGELSEALVDESVLRILNLKNKLGLFE-NPYRGTSEELEKEVIMSAE 358
Query: 420 HKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMG-GGYTGIPCSPKS 478
H+ +A + A + VLLKND + LPLNK +A+IGP N +G + G +
Sbjct: 359 HRHVAKELAIKSSVLLKND-QVLPLNKK--QKIALIGPFAQNGDVLGPWSWQGSKEAAVQ 415
Query: 479 LLRGLEAYVSKTHY--ASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRD 536
L GL A S+ + A G D+ +D EAV AK AD +++ G E
Sbjct: 416 LYDGLLAKTSEANLLVAQGS-DIEAITDEQLTEAVETAKGADVIVLALGEASEMSGEGGC 474
Query: 537 RVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVS--FAEADSQISSILWIGYPGEA 594
R + LP Q+ LV + R +PV+ VL G PLD+ AE+D+ + + W YPG
Sbjct: 475 RADIRLPEAQLKLVAKI-RELGKPVVTVLFNGRPLDLHGVIAESDALLEA--W--YPGTE 529
Query: 595 GAKALAEIIFGDFNPGGRLPMTWYPESFTKVPM-----NDMNMRADSSRQYPGRSYRFYT 649
A+A+++FGD NP GRL M+ +P S ++P+ N + ++ ++ Y
Sbjct: 530 AGSAIADLLFGDANPSGRLTMS-FPYSVGQIPVYYNCFNTGRPQGENKQERYVTHYLDIP 588
Query: 650 GTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSC 709
+ FG+GLSYT ++Y S++++SA D +T
Sbjct: 589 NEPLLPFGYGLSYTTFAY------SDMSLSA-----------------------DSLTEN 619
Query: 710 TSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFG 769
+SL V ++VTN G+ G V + R P K+L G+++V +K+I+F
Sbjct: 620 SSL--DVTVNVTNTGERAGEETVQFYIRDLSGEIVRPLKELKGYEKVWLEPGETKQITFT 677
Query: 770 VDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIETY 808
+ EQL + G M+G + LTI+ +
Sbjct: 678 ITE-EQLRYYHSDLSFQSDKGKFHAMIGPNSNELTIKEF 715
>gi|375145746|ref|YP_005008187.1| Beta-glucosidase [Niastella koreensis GR20-10]
gi|361059792|gb|AEV98783.1| Beta-glucosidase [Niastella koreensis GR20-10]
Length = 866
Score = 242 bits (617), Expect = 7e-61, Method: Compositional matrix adjust.
Identities = 152/446 (34%), Positives = 219/446 (49%), Gaps = 54/446 (12%)
Query: 53 PFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPG 112
P+ N LS RAK LI+ LTL+EK + D + AIPRLGI + WWSE+LHG+A+N
Sbjct: 25 PYQNPDLSSEERAKDLITRLTLEEKASLMFDQSPAIPRLGIKKFNWWSEALHGLANN--- 81
Query: 113 VNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMY---------NLGQAGLT 163
VT FP+ + AASF+ +L I A + E RA Y N L+
Sbjct: 82 -------DKVTVFPEPVGMAASFDDTLVYKIFDATSDEVRAKYHDAIRNGKENRRFLSLS 134
Query: 164 FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRV 223
W PNINIFRDPRWGRGQET GEDP + S V VK QG + R
Sbjct: 135 VWTPNINIFRDPRWGRGQETYGEDPYLTSRMGVAVVKGLQGP-------------ADARY 181
Query: 224 LKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNA-MITEQDTEDTFQPPFRSC 282
K L AC KH + +W SR++ N + +D +T+ P F+S
Sbjct: 182 RK-------------LLACAKHFAVHSGPEW---SRHTLNLYQVRPRDLYETYLPAFKSL 225
Query: 283 IEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTK 341
+ + +MC+Y +++ P C L Q R+ WGFK + SDC A+ F +
Sbjct: 226 VMEADVRQVMCAYQRLDDEPCCGNNRLLQTILRDNWGFKHVVVSDCGAITDFFTSHKVSS 285
Query: 342 THEDSAAGVLKAGMDINC--GTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFN 399
++A AG D+ C ++ A+ +G + EK+ID+ LL + + + LG +
Sbjct: 286 DAVHASAKGALAGTDVECVWEGYAFKNLPEAVSRGLITEKEIDKHLLRVLAGRFDLGEMD 345
Query: 400 GDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLV 459
D + + V H+KLALD A++ + LL+N LPL +NA +A+IGP
Sbjct: 346 DDAVV-PWAAIPMSVVNNEAHRKLALDMAQESMTLLQNRNNILPLKRNA-KKIAVIGPNA 403
Query: 460 NNISQMGGGYTGIPCSPKSLLRGLEA 485
+N + G Y G P +++ G+ A
Sbjct: 404 DNDPVLWGNYNGRPVRTINIVSGIRA 429
Score = 90.9 bits (224), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 73/304 (24%), Positives = 124/304 (40%), Gaps = 54/304 (17%)
Query: 504 DAGFHEAVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSV 553
D F + K D V+ V GL E E+ DR + LP Q + + ++
Sbjct: 591 DIDFSGLISKLKGIDEVVFVGGLSTLLEGEEMPVSYPGFKGGDRTDIQLPEVQRNCLKAL 650
Query: 554 ARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRL 613
K+ + + +G + F +IL Y GE+G +A+A+++FGD+NP G+L
Sbjct: 651 KAAGKKVIFVNCSGSA---IGFVPETESCDAILQAWYGGESGGQAVADVLFGDYNPSGKL 707
Query: 614 PMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAP 673
P+T+Y ++ D +++ GR+YRF + ++ FG GLSYT +S
Sbjct: 708 PVTFYKDTTQLAGFEDYSLK--------GRTYRFMS-DPLFPFGFGLSYTTFS------- 751
Query: 674 SELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVM 733
+ + + S + I VTN G +G+ +V
Sbjct: 752 ------------------------IGTAQLSKTNISNSESIELSIPVTNTGKRNGTEIVQ 787
Query: 734 LFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHV 793
++ R S G P K + GF R+ A + + P K G+ + G +
Sbjct: 788 VYVRKTGDSDG-PLKTMRGFKRIAIPAGKTTTAVINLPPAAFEFYDEKFGKMTVAAGEYE 846
Query: 794 LMVG 797
L+ G
Sbjct: 847 LLYG 850
>gi|317474221|ref|ZP_07933497.1| glycosyl hydrolase family 3 C terminal domain-containing protein
[Bacteroides eggerthii 1_2_48FAA]
gi|316909531|gb|EFV31209.1| glycosyl hydrolase family 3 C terminal domain-containing protein
[Bacteroides eggerthii 1_2_48FAA]
Length = 786
Score = 242 bits (617), Expect = 7e-61, Method: Compositional matrix adjust.
Identities = 218/745 (29%), Positives = 342/745 (45%), Gaps = 129/745 (17%)
Query: 90 RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
RLGIP ++ +E + G+ + T FP A++N+ L S I A
Sbjct: 137 RLGIPV-DFTNEGIRGLCHD-----------RATMFPAQCGQGATWNKELISEIARVTAE 184
Query: 150 EARAMYNLGQAGLT-FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWK 208
EA+A+ G T ++P ++I +DPRWGR E GEDP +V
Sbjct: 185 EAKAL------GYTNIYSPILDIAQDPRWGRVVECYGEDPFLV----------------- 221
Query: 209 SDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITE 268
G +G KR++KG E L A KH Y + G + + +
Sbjct: 222 ---GELG-----KRMIKGLQAEG-------LVATPKHFAVYSVPVGGRDAGTRTDPHVAP 266
Query: 269 QDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGD---LFQKARNEWGFKGYITS 325
++ + PFR + A +M SYN +G P + G L + R+EWGFKGY+ S
Sbjct: 267 REMRTLYLEPFRKAFCEAGALGVMSSYNDYDGEP--ITGSYHFLTEILRHEWGFKGYVVS 324
Query: 326 DCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCM-------LRHTQSAIDKGKVQE 378
D +AV ++ D AA V+ AG+++ + LRH AI +GK+
Sbjct: 325 DSEAVEFLYSKHKVAADAVDGAAQVINAGLNVRTNFSLPENFIYPLRH---AISEGKISM 381
Query: 379 KDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKND 438
+ ID + ++ V+ +GLF+ +P KG KL C EH+ +++ AA + IVLLKN+
Sbjct: 382 QTIDSRVADVLRVKFMMGLFD-NPYKGDTKKLEKVVHC-KEHQNVSMRAALESIVLLKNE 439
Query: 439 KKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV--SKTHYASGC 496
LPL+K ++ +A+IGP + ++ Y K++ +G++ Y+ S+ YA G
Sbjct: 440 NNILPLSK-SIKKVAVIGPNAAEVDKLICRYGPANAPIKTVYQGIKDYLPDSEVRYAKGA 498
Query: 497 ------------HDVPCN--SDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLL 542
+DVP N + EAV +AK++D I+V G + E+ R +L L
Sbjct: 499 DIIDEYFPESELYDVPLNKTEETMIDEAVALAKESDVAIMVLGGNEKTVREEYSRTNLDL 558
Query: 543 PGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEI 602
G+Q L+ +V T K PVILV+ G +++AE I I+ +PGE A+A++
Sbjct: 559 CGRQEKLLQAVYATGK-PVILVMIDGRAATINWAE--RYIPGIVHAWFPGEFMGDAIAKV 615
Query: 603 IFGDFNPGGRLPMTWYPESFTKVPMN-DMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLS 661
IFGD+NPGG+L +T +P S ++P +DS + + TG+ +Y FG+GLS
Sbjct: 616 IFGDYNPGGKLAVT-FPRSVGQIPFAFPFKPGSDS------KGFVRVTGS-LYPFGYGLS 667
Query: 662 YTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVT 721
YT + Y L + + I Q +L SC V
Sbjct: 668 YTTFDYSALKIENPV-------------IGTQGSVKL---------SC---------RVK 696
Query: 722 NAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANK 781
N G + G VV L+ S T K L GF+R+H K + F + P + + + NK
Sbjct: 697 NTGKIAGDEVVQLYLHDEVSSVTTYVKVLRGFERIHLQPGEEKVVDFVLTP-QDMGLWNK 755
Query: 782 HGRRILPLGNHVLMVGELRHSLTIE 806
I+ G +MVG + ++
Sbjct: 756 DNHFIVEPGTFAVMVGSSSQDIRLQ 780
>gi|160891510|ref|ZP_02072513.1| hypothetical protein BACUNI_03961 [Bacteroides uniformis ATCC 8492]
gi|156858917|gb|EDO52348.1| glycosyl hydrolase family 3 N-terminal domain protein [Bacteroides
uniformis ATCC 8492]
Length = 756
Score = 242 bits (617), Expect = 7e-61, Method: Compositional matrix adjust.
Identities = 184/619 (29%), Positives = 295/619 (47%), Gaps = 81/619 (13%)
Query: 165 WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVL 224
+AP ++I RD RWGR E GEDP + S + V+ FQGE
Sbjct: 161 FAPMVDISRDARWGRVMEGSGEDPYLGSLLSAARVRGFQGEK------------------ 202
Query: 225 KGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPFRSCI 283
E+ R D+++ AC KH AY + G +N ++E+ D + PPF++
Sbjct: 203 ---PEDLMRLDKML--ACAKHFCAYGAAEAGR----DYNTTDVSERSLRDIYFPPFKAAK 253
Query: 284 EQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKT 342
+ G A+ M ++N+++GVP L+Q R+EW F G++ +D A+ + + +
Sbjct: 254 DAGVAT-FMTAFNEISGVPCTSSKFLYQDVLRDEWRFNGFVVTDYTAINELVPH-GVARD 311
Query: 343 HEDSAAGVLKAGMDIN-CGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGD 401
+A AG++++ G H A+ +GKV E+ ID A+ + ++ LG+ + D
Sbjct: 312 EAHAAELAANAGIEMDMTGGVFHAHLLQAVKEGKVNEETIDNAVRRILEMKFLLGIMD-D 370
Query: 402 PRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNN 461
P + + + E + A DAAR+ +VLLKN+ F P+ + ++A+IGP+V
Sbjct: 371 PYRYLNEEREKATIMKPEFLEAARDAARKSVVLLKNENDFFPIQPSERKTVALIGPMVKE 430
Query: 462 ISQMGGGYTGIPCSPKS--LLRGLEAYVSKTH----YASGCHDVPCNSDAGFHEAVRIAK 515
+ + GG+ G +S L GLE ++ YA GC D+ AGF +AV +A+
Sbjct: 431 RNSVNGGWGGRGDRQRSVTLFEGLEKKYGNSNVRFLYAEGC-DLRKPGTAGFAQAVSVAR 489
Query: 516 KADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSF 575
+AD ++V AG D E R + LP Q L+ + +T K P+ LVL G PL++++
Sbjct: 490 QADVILVAAGEDQNWSAEAACRTDITLPASQRDLLKELKKTGK-PIGLVLMNGRPLELTW 548
Query: 576 AEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPM--NDMN-- 631
D + +IL YPG G A+A++I GD+NP G+L M+ +P S ++P+ N N
Sbjct: 549 --EDENMDAILEAWYPGTMGGHAIADVIAGDYNPAGKLTMS-FPRSVGQLPLYYNHKNTG 605
Query: 632 --MRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKN 689
+ D+ + SY + +Y FG+GLSYT++ L E LK G
Sbjct: 606 RPLPPDNPKMDYKSSYIDCPNSPLYPFGYGLSYTSFEVDNLKLDKE-----ELKKGE--- 657
Query: 690 ILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQ 749
V + V N G V G VV L+ R S P K+
Sbjct: 658 -----------------------TLTVTVDVANIGKVGGEEVVQLYIRDLVGSVTRPVKE 694
Query: 750 LIGFDRVHTVAKGSKEISF 768
L GF +++ A K ++F
Sbjct: 695 LKGFQKLYLKAGEKKSLTF 713
>gi|393782348|ref|ZP_10370533.1| hypothetical protein HMPREF1071_01401 [Bacteroides salyersiae
CL02T12C01]
gi|392673619|gb|EIY67078.1| hypothetical protein HMPREF1071_01401 [Bacteroides salyersiae
CL02T12C01]
Length = 852
Score = 242 bits (617), Expect = 7e-61, Method: Compositional matrix adjust.
Identities = 157/441 (35%), Positives = 222/441 (50%), Gaps = 57/441 (12%)
Query: 64 RAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVT 123
R L+S LT++EKI L ++A IPRL I Y +E+LHGI G T
Sbjct: 40 RLLDLLSRLTIEEKISLLVNDAREIPRLNIDKYYHGNEALHGIVRPG----------EFT 89
Query: 124 SFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAG----------LTFWAPNINIFR 173
FPQ + AA++N L + SA++ EAR + G LTFW+P +N+ R
Sbjct: 90 VFPQAIGLAATWNPGLIFEVSSAISDEARGRWKELDYGKKQIAGASDLLTFWSPTVNMAR 149
Query: 174 DPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDR 233
DPRWGR ET GEDP + EFVK QG++ R LK
Sbjct: 150 DPRWGRTPETYGEDPFLTGVIGCEFVKGLQGDH--------------PRYLKTV------ 189
Query: 234 GDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMC 293
+ KH A + E +R S NA ++E+D + + P F CI KA IM
Sbjct: 190 -------STPKHFAANNEEH----NRSSCNARMSERDLREFYLPSFERCIVDAKAQSIMM 238
Query: 294 SYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLK 352
+YN VNGVP + L + R +WGF GYI SDC A + Y + + +A +K
Sbjct: 239 AYNAVNGVPCTVNTYLIKNVLRGDWGFNGYIVSDCSAPEWMVTKHKYVRDLDAAATLAIK 298
Query: 353 AGMDINCGTCMLRH-TQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLG 411
AG+D+ CG + A ++ V + DID A + ++ LGLF+ DP + Y ++
Sbjct: 299 AGLDLECGDRVYTAPLLKAYNESMVSKADIDSAAYRVLRGRMLLGLFD-DPSQNPYNQIE 357
Query: 412 PDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTG 471
P + +H++LAL+ ARQ +VLLKN K FLPLN V S+A++G +N G Y+G
Sbjct: 358 PSVIGCKKHQELALETARQSMVLLKNQKNFLPLNLKKVKSIAVVG--INAGHCEFGDYSG 415
Query: 472 IP-CSPKSLLRGLEAYVSKTH 491
IP +P S+L G+ Y K +
Sbjct: 416 IPKNAPVSVLDGIRKYAEKAN 436
Score = 140 bits (353), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 90/303 (29%), Positives = 148/303 (48%), Gaps = 48/303 (15%)
Query: 505 AGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILV 564
A F +A + AK+ D + V G++ + E E +DR SL LP Q + + + + V+ V
Sbjct: 592 ALFGDAGKAAKECDVTVAVLGINKSIEREGQDRYSLELPTDQQEFIRELYKVNPNTVV-V 650
Query: 565 LTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTK 624
L G L +++ D + +IL YPGE G A+AE++FGD+NPGGRLP+T+Y S +
Sbjct: 651 LVAGSSLAINWI--DENVPAILNAWYPGEQGGTAIAEVLFGDYNPGGRLPLTYY-NSLDE 707
Query: 625 VPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKA 684
+P D + R+Y+++ G +Y FG+GLSYT ++YK
Sbjct: 708 LPSFD-------NYSVQNRTYQYFKGKPLYEFGYGLSYTKFNYK---------------- 744
Query: 685 GSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQG 744
KN+ S + + V+NAG DG V ++ + P+
Sbjct: 745 --KKNV-----------------SIANDTIDITFKVSNAGKYDGDEVAQVYVQYPETGTY 785
Query: 745 TPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRR-ILPLGNHVLMVGELRHSL 803
P KQL GF RVH S +++ V P ++L ++ R+ + P G +V ++G +
Sbjct: 786 MPLKQLRGFSRVHIKKGKSADVTISV-PKKELRYWDEKTRQFVTPEGKYVFLIGSSSEDI 844
Query: 804 TIE 806
++
Sbjct: 845 KLQ 847
>gi|334365132|ref|ZP_08514098.1| glycosyl hydrolase family 3 N-terminal domain protein [Alistipes
sp. HGB5]
gi|313158675|gb|EFR58064.1| glycosyl hydrolase family 3 N-terminal domain protein [Alistipes
sp. HGB5]
Length = 771
Score = 242 bits (617), Expect = 7e-61, Method: Compositional matrix adjust.
Identities = 205/744 (27%), Positives = 323/744 (43%), Gaps = 123/744 (16%)
Query: 90 RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
RLGIP + E+ HG + G T+FP A+++N L +G +A
Sbjct: 120 RLGIPLF-LAEEAPHGHMAIG-----------ATTFPTAPGQASTWNPELIERMGKVIAA 167
Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
E R Q G + P ++I RDPRW R +E+ GED + + +V+
Sbjct: 168 EIRL-----QGGHICYGPVLDIVRDPRWSRTEESYGEDCYLTARIGEAYVR--------- 213
Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQ 269
G G G D + KH IAY + G + + ++ E+
Sbjct: 214 ---GTGSG--------------DLSQSRHALSTLKHFIAYGASEGG---QNGGSNLLGER 253
Query: 270 DTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACL-RGDLFQKARNEWGFKGYITSDCD 328
+ +T+ PPF + ++ G A +M +YN V+G+P R L R EWGF G++ SD
Sbjct: 254 ELRETYLPPFEAAVKAG-ARSVMTAYNSVDGIPCTANRRMLTDILRGEWGFDGFVVSDLL 312
Query: 329 AVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNL 388
++ + E + ++A L+AG+D + + A + G V E +IDRA+ +
Sbjct: 313 SIEGLHETHGVAGSVREAAVQALRAGVDADLKGGAFASLREAAEAGDVAEAEIDRAVERV 372
Query: 389 FSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNA 448
+++ +GLF +P + +V + H +LAL+AARQ + LL+N LPL+
Sbjct: 373 LALKFEMGLFE-NP---YIDEAAAAEVGCAAHSELALEAARQSVTLLENRSGTLPLDPRR 428
Query: 449 VSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTH--YASGCHDVPCNSDAG 506
+ +A+IGP +NI G YT + ++ GLE + + Y+ GC V +
Sbjct: 429 LRRVAVIGPNADNIYNQLGDYTAQQTAANTVRDGLEKLLGRDRVVYSRGC-TVRGGDRSE 487
Query: 507 FHEAVRIAKKADFVIVVAG----LDLTQE-------------------TEDRDRVSLLLP 543
AV A+ D +VV G D E E DR +L L
Sbjct: 488 IAAAVSAARGTDAAVVVIGGSSARDFDTEFLQTGAAKAAHDEVRDMECGEGFDRATLALL 547
Query: 544 GQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEII 603
G+Q L+ + T P+I+V G PLD+ A Q ++L YPG G A+AE I
Sbjct: 548 GEQEELLRRIKATGT-PLIVVCIAGRPLDLR--RASEQADALLMAWYPGARGGDAVAETI 604
Query: 604 FGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYT 663
G NP GRLP+T P + ++P+ R + Y T +Y FG+GLSY+
Sbjct: 605 LGRNNPAGRLPIT-IPRAEGQIPVYYNKKRPAN------HDYTDLTAAPLYPFGYGLSYS 657
Query: 664 NYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNA 723
+ Y L A S N+L+ SC + N
Sbjct: 658 TFEYGSLEARQ-----------SGDNVLE--------------VSC---------RIRNT 683
Query: 724 GDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHG 783
D +G VV L+ S P +QL GF R+ +++SF + E L++ + G
Sbjct: 684 SDREGDEVVQLYISDMVASTVRPPRQLGGFRRIRLAPGEQRQVSFTLGD-EALALIDPQG 742
Query: 784 RRILPLGNHVLMVGELRHSLTIET 807
RR++ G+ V+ VG + ++T
Sbjct: 743 RRVVEKGDFVIAVGSSSQDIRLQT 766
>gi|332982620|ref|YP_004464061.1| glycoside hydrolase [Mahella australiensis 50-1 BON]
gi|332700298|gb|AEE97239.1| glycoside hydrolase family 3 domain protein [Mahella australiensis
50-1 BON]
Length = 753
Score = 242 bits (617), Expect = 7e-61, Method: Compositional matrix adjust.
Identities = 216/744 (29%), Positives = 345/744 (46%), Gaps = 131/744 (17%)
Query: 90 RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
RLGIPA E L G ++G T FPQ + A++++ + +
Sbjct: 89 RLGIPAI-VHEECLSGFMADG-----------ATVFPQAIGLASTWDAEAIEAMAGVIRQ 136
Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
+ +A GL+ P +++ RDPRWGR +ET GEDP +V++ AV +V+ QG++
Sbjct: 137 QMKAAG--AHQGLS---PVLDVARDPRWGRVEETFGEDPYLVASMAVSYVRGLQGQD--- 188
Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQ 269
+ KG + A KH + + G R + E+
Sbjct: 189 -------------LTKG------------IFATLKHFAGHSFSEGG---RNCAPVHVGER 220
Query: 270 DTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVP-ACLRGDLFQKARNEWGFKGYITSDCD 328
+ D F PF + + + A +M +Y+ ++GVP A R L R +GF G + SD D
Sbjct: 221 ELWDIFLFPFEAAVREANAKSVMNAYHDIDGVPCAASRELLTDILRGHFGFDGIVVSDYD 280
Query: 329 AVATIFEYQNYTKTHEDSAAGVLKAGMDINCGT--CMLRHTQSAIDKGKVQEKDIDRALL 386
A+ + + +++A L+AG+DI C + A+ +G + E I+ ++
Sbjct: 281 AIDRLRKAHFTAGNKKEAAVQALEAGIDIELPKMDCYGQPLMDAVKEGMISEATINESVE 340
Query: 387 NLFSVQLRLGLFNG-----DPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKF 441
+ + + LGLF+G D G + T E ++++ D AR+ IVLLKND
Sbjct: 341 RVLTAKFELGLFDGVYVDVDSVPGLFE--------TPEQREMSRDIARKSIVLLKNDN-V 391
Query: 442 LPLNKNAVSSLAIIGPLVNNISQMGGGYTGIP--------CSPKSLLRGLEAYV---SKT 490
LPL+K+ + S+A+IGP +N M G Y + ++L G++ V +
Sbjct: 392 LPLSKD-IKSIAVIGPNADNARNMLGDYAFMAHRSYDKTSVHIVTVLEGIKNKVLDSCRI 450
Query: 491 HYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAG-----LDLTQETEDRDRVSLLLPGQ 545
YA GC + ++D GF EAV A+ AD IVV G E+ DR + LPG
Sbjct: 451 TYAKGCDIIDPSTD-GFVEAVNAARAADAAIVVVGDNSGIFGKGTSGENDDRTDITLPGV 509
Query: 546 QMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFG 605
QM LV ++ T K PVI+VL G + E S+++ YPGE G A+A+++FG
Sbjct: 510 QMQLVKAIKDTGK-PVIVVLINGRAF--AAKELADNASALMEAWYPGEEGGNAVADVLFG 566
Query: 606 DFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQV---YGFGHGLSY 662
D+NP GRLP++ P ++P+N N++ SY Y T+ + FG+G+SY
Sbjct: 567 DYNPAGRLPISL-PCEVGQIPIN-YNLKP--------ASYINYLSTETKPAFAFGYGMSY 616
Query: 663 TNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTN 722
T + Y S+L+I+ ++ + K + VTN
Sbjct: 617 TTFGY------SDLSITPAVAPSAGK-------------------------VDISFKVTN 645
Query: 723 AGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKH 782
AG + G VV L+ R S P K+L GF RV+ +KEI+F + +QL+ +K
Sbjct: 646 AGQLAGDEVVQLYIRDEVSSIVRPVKELKGFKRVNLQPGETKEITFTL-YADQLAFHDKD 704
Query: 783 GRRILPLGNHVLMVGELRHSLTIE 806
R ++ G +MVG + +E
Sbjct: 705 MRLVVEPGTFKIMVGSSSDDIRLE 728
>gi|300726322|ref|ZP_07059774.1| beta-xylosidase B [Prevotella bryantii B14]
gi|291292284|gb|ADD92014.1| Xyl3A [Prevotella bryantii B14]
gi|299776347|gb|EFI72905.1| beta-xylosidase B [Prevotella bryantii B14]
Length = 885
Score = 242 bits (617), Expect = 7e-61, Method: Compositional matrix adjust.
Identities = 153/456 (33%), Positives = 225/456 (49%), Gaps = 52/456 (11%)
Query: 53 PFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPG 112
P+ N +LS RA L LTL+EK + D + AIPRLGI + WWSE+LHG A+ G
Sbjct: 49 PYQNPNLSAYERAIDLCHRLTLEEKALLMQDESPAIPRLGIKKFFWWSEALHGAANMG-- 106
Query: 113 VNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMY-----NLGQ----AGLT 163
+VT+FP+ + A+SFN +L ++ SA + E RA Y N G+ L+
Sbjct: 107 --------NVTNFPEPIAMASSFNPTLLKSVFSAASDEMRAQYHHRMDNGGEDEKFHSLS 158
Query: 164 FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRV 223
W PN+NIFRDPRWGRGQET GEDP + S V+ QG
Sbjct: 159 VWTPNVNIFRDPRWGRGQETYGEDPYLTSVMGCAVVEGLQGP------------------ 200
Query: 224 LKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPFRSC 282
ES + +L AC KH + + +R++ N I+ +D +T+ P F+S
Sbjct: 201 ------ESSKYRKLW--ACAKHFAVHSGPE---STRHTANLNNISPRDLYETYLPAFQST 249
Query: 283 IEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTK 341
++ G +MC+Y +++ P C L Q+ R EWGFK + SDC AV+ I++ +
Sbjct: 250 VQDGHVREVMCAYQRLDDEPCCSNNRLLQQILREEWGFKYLVVSDCGAVSDIWQSHKTSS 309
Query: 342 THEDSAAGVLKAGMDINCGT-CMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNG 400
++ AG D+ CG A+ +G + E++ID+ ++ L + LG +
Sbjct: 310 DAVHASRQATLAGTDVECGYGYTYAKIPEAVKRGLLTEEEIDKHVIRLLEGRFDLGEMD- 368
Query: 401 DPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVN 460
D + ++ K+ + H +LALD ARQ IVLL+N LPL +A+IGP +
Sbjct: 369 DSKLVEWSKIPYSIMSCKAHAQLALDMARQSIVLLQNKGNILPLQLKKNERIAVIGPNAD 428
Query: 461 NISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGC 496
N M G Y G P S+L G+ Y C
Sbjct: 429 NKPMMWGNYNGTPNHTVSILEGIRKQYKNVVYLPAC 464
Score = 76.3 bits (186), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 69/294 (23%), Positives = 127/294 (43%), Gaps = 59/294 (20%)
Query: 515 KKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVARTSKRPVILV 564
K D VI V G+ + E E+ DR + +P Q + ++A K+ +++
Sbjct: 624 KGIDKVIFVGGIAPSLEGEEMPVNIPGFKGGDRTDIEMPQVQRDFIKALAEAGKQIILVN 683
Query: 565 LTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTK 624
+G ++ + +I+ YPG+ G A+A+I+ G NP G+LP+T+Y +
Sbjct: 684 CSGSA---IALTPEAQRCQAIIQAWYPGQEGGTAVADILMGKVNPMGKLPVTFYKSTQQL 740
Query: 625 VPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKA 684
D +M+ R+YR++ +Y FG+GLSYT S +
Sbjct: 741 PDFEDYSMK--------NRTYRYFEDA-LYPFGYGLSYT-----------------SFEI 774
Query: 685 GSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQG 744
G+ K + + T+ +QI VTN G +G+ +V ++ R +G
Sbjct: 775 GTAK-----------------LQTLTNNSITLQIPVTNTGKREGTELVQVYLRRDDDVEG 817
Query: 745 TPEKQLIGFDRVHTVAKGSKEISFGVDPCE-QLSIANKHGRRILPLGNHVLMVG 797
P K L F + A +K+ ++ + + A+ + R++P G + + G
Sbjct: 818 -PSKTLRSFAHITLKAGETKKAILKLNRNQFECWDASTNTMRVIP-GKYTIFYG 869
>gi|317478381|ref|ZP_07937545.1| glycosyl hydrolase family 3 N terminal domain-containing protein
[Bacteroides sp. 4_1_36]
gi|316905540|gb|EFV27330.1| glycosyl hydrolase family 3 N terminal domain-containing protein
[Bacteroides sp. 4_1_36]
Length = 756
Score = 242 bits (617), Expect = 7e-61, Method: Compositional matrix adjust.
Identities = 184/619 (29%), Positives = 295/619 (47%), Gaps = 81/619 (13%)
Query: 165 WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVL 224
+AP ++I RD RWGR E GEDP + S + V+ FQGE
Sbjct: 161 FAPMVDISRDARWGRVMEGSGEDPYLGSLLSAARVRGFQGEK------------------ 202
Query: 225 KGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPFRSCI 283
E+ R D+++ AC KH AY + G +N ++E+ D + PPF++
Sbjct: 203 ---PEDLMRLDKML--ACAKHFCAYGAAEAGR----DYNTTDVSERSLRDIYFPPFKAAK 253
Query: 284 EQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKT 342
+ G A+ M ++N+++GVP L+Q R+EW F G++ +D A+ + + +
Sbjct: 254 DAGVAT-FMTAFNEISGVPCTSSKFLYQDVLRDEWRFNGFVVTDYTAINELVPH-GVARD 311
Query: 343 HEDSAAGVLKAGMDIN-CGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGD 401
+A AG++++ G H A+ +GKV E+ ID A+ + ++ LG+ + D
Sbjct: 312 EAHAAELAANAGIEMDMTGGVFHAHLLQAVKEGKVNEETIDNAVRRILEMKFLLGIMD-D 370
Query: 402 PRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNN 461
P + + + E + A DAAR+ +VLLKN+ F P+ + ++A+IGP+V
Sbjct: 371 PYRYLNEEREKATIMKPEFLEAARDAARKSVVLLKNENNFFPIQPSERKTVALIGPMVKE 430
Query: 462 ISQMGGGYTGIPCSPKS--LLRGLEAYVSKTH----YASGCHDVPCNSDAGFHEAVRIAK 515
+ + GG+ G +S L GLE ++ YA GC D+ AGF +AV +A+
Sbjct: 431 RNSVNGGWGGRGDRQRSVTLFEGLEKKYGNSNVRFLYAEGC-DLRKPGTAGFAQAVSVAR 489
Query: 516 KADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSF 575
+AD ++V AG D E R + LP Q L+ + +T K P+ LVL G PL++++
Sbjct: 490 QADVILVAAGEDQNWSAEAACRTDITLPASQRDLLKELKKTGK-PIGLVLMNGRPLELTW 548
Query: 576 AEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPM--NDMN-- 631
D + +IL YPG G A+A++I GD+NP G+L M+ +P S ++P+ N N
Sbjct: 549 --EDENMDAILEAWYPGTMGGHAIADVIAGDYNPAGKLTMS-FPRSVGQLPLYYNHKNTG 605
Query: 632 --MRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKN 689
+ D+ + SY + +Y FG+GLSYT++ L E LK G
Sbjct: 606 RPLPPDNPKMDYKSSYIDCPNSPLYPFGYGLSYTSFEVDNLKLDKE-----ELKKGE--- 657
Query: 690 ILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQ 749
V + V N G V G VV L+ R S P K+
Sbjct: 658 -----------------------TLTVTVDVANIGKVGGEEVVQLYIRDLVGSVTRPVKE 694
Query: 750 LIGFDRVHTVAKGSKEISF 768
L GF +++ A K ++F
Sbjct: 695 LKGFQKLYLKAGEKKSLTF 713
>gi|167765233|ref|ZP_02437346.1| hypothetical protein BACSTE_03621 [Bacteroides stercoris ATCC
43183]
gi|167696861|gb|EDS13440.1| glycosyl hydrolase family 3 N-terminal domain protein [Bacteroides
stercoris ATCC 43183]
Length = 818
Score = 241 bits (616), Expect = 8e-61, Method: Compositional matrix adjust.
Identities = 209/777 (26%), Positives = 342/777 (44%), Gaps = 141/777 (18%)
Query: 52 YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGP 111
YPF N + +++T + + RLGIP ++ +E +HG+
Sbjct: 136 YPFSNHAEAVNTVQRWFVEET-----------------RLGIPV-DFTNEGIHGLNH--- 174
Query: 112 GVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLT-FWAPNIN 170
+ T P + +++N+ L G EA+A+ G T +AP ++
Sbjct: 175 --------TKATPLPAPIAIGSTWNKELVRRAGVIAGQEAKAL------GYTNVYAPILD 220
Query: 171 IFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEE 230
I RDPRWGR E GE+P +++A E V Q +
Sbjct: 221 IVRDPRWGRTLECYGEEPYLIAALGTEMVNGIQSQG------------------------ 256
Query: 231 SDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASC 290
++A KH Y + K G + + ++ + F PF+ I+
Sbjct: 257 --------VAATLKHYAVYSVPKGGRDGNCRTDPHVAPRELHELFLYPFKKVIQNSHPMG 308
Query: 291 IMCSYNQVNGVPACLRGD-LFQKARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAG 349
+M SYN +GVP L + R E+GF GY+ SD +AV + + T++++
Sbjct: 309 VMSSYNDWDGVPVSASYYFLTELLREEYGFDGYVVSDSEAVEFVESKHHVADTYDEAVRQ 368
Query: 350 VLKAGMDINCG----TCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFN----GD 401
VL+AG+++ + + + +++ K+ ID+ + + V+ RLGLF+ D
Sbjct: 369 VLEAGLNVRTHFTPPSDFILPIRRLLEEKKISMAVIDKRVSEVLRVKFRLGLFDQPYVAD 428
Query: 402 PRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNN 461
+ ++G D K++ +Q +VLLKN+ LPL+K + + + GPL +
Sbjct: 429 TKAAD--RVGGADRNMDFVKQMQ----QQALVLLKNENNILPLDKRQIKKVLVTGPLADE 482
Query: 462 ISQMGGGYTGIPCSPKSLLRGLEAY---VSKTHYASGCHDVPCNSDA------------- 505
+ M Y ++L GL Y +++ YA GC V A
Sbjct: 483 DNFMTSRYGPNGLETVTVLAGLRNYLKGIAEVDYAKGCDIVDAGWPATEILPAPMSEQEK 542
Query: 506 -GFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILV 564
G EAV A ++D +I V G D + E R R SL LPG+Q L+ ++ T K PVILV
Sbjct: 543 QGIAEAVAKAGESDVIIAVLGEDEYRTGESRSRTSLDLPGRQQQLLEALHATGK-PVILV 601
Query: 565 LTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTK 624
L G PL V++A A I +IL +PG G +AE +FG+ NPGG+L +T +P+S +
Sbjct: 602 LINGQPLTVNWANA--YIPAILESWFPGCQGGTVIAETLFGEHNPGGKLTVT-FPKSVGQ 658
Query: 625 VPMNDMNMRADSSRQYPGRSYRFYTGTQV----YGFGHGLSYTNYSYKFLSAPSELTISA 680
+ +N + S P T++ Y FG GLSYT ++Y S+L +S
Sbjct: 659 IELN-FPFKPGSHGAQPHSGPNGSGATRIIGELYPFGFGLSYTTFAY------SDLEVSP 711
Query: 681 SLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPK 740
LQQ T + ++++VTN G G VV L+ R
Sbjct: 712 ----------LQQH---------------TQGEYTIKVNVTNTGKRAGDEVVQLYVRDKV 746
Query: 741 VSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
S T + QL GF+RV +++++F + P E L I +++ + G +M+G
Sbjct: 747 SSVITYDSQLRGFERVSLQPGETRQVTFSLKP-EDLQILDRNMNWTVEPGEFEVMIG 802
>gi|255013061|ref|ZP_05285187.1| beta-glucosidase [Bacteroides sp. 2_1_7]
gi|410102523|ref|ZP_11297449.1| hypothetical protein HMPREF0999_01221 [Parabacteroides sp. D25]
gi|409238595|gb|EKN31386.1| hypothetical protein HMPREF0999_01221 [Parabacteroides sp. D25]
Length = 758
Score = 241 bits (616), Expect = 8e-61, Method: Compositional matrix adjust.
Identities = 191/632 (30%), Positives = 298/632 (47%), Gaps = 100/632 (15%)
Query: 165 WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGEN-WKSDDGGIGFGFREKRV 223
+AP ++I RD RWGR E GEDP + S A V+ FQG N W+S
Sbjct: 162 FAPMVDISRDARWGRVMEGAGEDPYLGSLIAKARVEGFQGGNDWRS-------------- 207
Query: 224 LKGFGEESDRGDELMLSACCKHLIAYDLEKWG---NFSRYSFNAMITEQDTEDTFQPPFR 280
D + ACCKH AY + G N S S N ++ + + PP+
Sbjct: 208 ---------LADVNTVLACCKHFAAYGAAEAGRDYNTSELSQNTLM------NYYMPPYL 252
Query: 281 SCIEQGKASCIMCSYNQVNGVPAC----LRGDLFQKARNEWGFKGYITSDCDAVATIFEY 336
+ E G A+ M S+N++NGVP+ L DL R +WGF G++ +D I E
Sbjct: 253 AAKEAGVAT-FMASFNEINGVPSTGNKWLMTDLL---REDWGFNGFVVTDYTG---INEM 305
Query: 337 QNYTKTHEDSAAGVL--KAGMDIN-CGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQL 393
++ D AG L AG+D++ G ++ ++ +GKV E++IDRA+ ++ ++
Sbjct: 306 VAHSIVRNDKEAGELAANAGIDMDMTGGIYSQYLVQSVKEGKVSEENIDRAVASILEMKF 365
Query: 394 RLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLA 453
LGLF+ DP + + + + E + A + + + IVLLKND F P++K+ ++A
Sbjct: 366 LLGLFD-DPYRYLDNEREKNTIMKPEFLQEARETSARSIVLLKNDNNFFPISKDKHITVA 424
Query: 454 IIGPLVNNISQMGGGYTGIPCSPK--SLLRGLEAYVSKTH----YASGCHDVPCNSDAGF 507
+IGP+V + G + G + SL GL + T+ YA GC D+ + + F
Sbjct: 425 LIGPMVKDKINQNGEWAGRGEREESISLFEGLTEKYAGTNVKFIYAEGC-DLLTDDSSKF 483
Query: 508 HEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTG 567
EA+ A++AD V+ G D E R L LPG Q +L+ + +T K P+ L+L
Sbjct: 484 AEAIATARRADIVLAAMGEDFNWSGEAACRTDLKLPGAQQALLKELKKTGK-PLGLILVN 542
Query: 568 GGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPM 627
G PLD+S+ D + IL Y G +A++I GD+NP RL M+ +P + ++P+
Sbjct: 543 GRPLDLSW--EDQHVDGILEAWYLGTMAGHGMADVISGDYNPSARLTMS-FPRTVGQLPL 599
Query: 628 NDMNMRADSSRQYPGRS--------YRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTIS 679
+ + R P + Y T +Y FG+GLSYT ++
Sbjct: 600 --YYNQKPTGRPVPPEAPDTDYKSRYMDVPNTPLYPFGYGLSYTTFAVN----------- 646
Query: 680 ASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVP 739
S+K D+N + G + V V N G VDG V+ ++ R
Sbjct: 647 -SMKL--DQNSFTKGG-----------------KITVTAEVENTGKVDGETVIQMYIRDL 686
Query: 740 KVSQGTPEKQLIGFDRVHTVAKGSKEISFGVD 771
S P K+L GF++V A K++SF +D
Sbjct: 687 AGSVTRPVKELKGFEKVTLKAGEKKQVSFTID 718
>gi|409198859|ref|ZP_11227522.1| periplasmic beta-glucosidase , xylosidase/arabinosidase
[Marinilabilia salmonicolor JCM 21150]
Length = 777
Score = 241 bits (616), Expect = 8e-61, Method: Compositional matrix adjust.
Identities = 201/756 (26%), Positives = 331/756 (43%), Gaps = 141/756 (18%)
Query: 90 RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
RLGIP + E+ HG + G T FP + A+++ L +G +
Sbjct: 124 RLGIPLF-LAEEAPHGHMAIG-----------ATVFPTGIGLASTWAPELIEEVGKVIGR 171
Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
E R Q G + P +++ RD RW R +ET GEDP++ V+ G+
Sbjct: 172 EVRL-----QGGHIGYGPVLDLVRDARWSRVEETYGEDPVLAGIMGAAMVRGMAGDGLHD 226
Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQ 269
G L A KH +AY + G + I
Sbjct: 227 KHG--------------------------LIATLKHFLAYGTPEAGQNGGIT---TIGPL 257
Query: 270 DTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCD 328
D + PPF+ I G A +M SYN ++G+P+ L R +W F G++ SD
Sbjct: 258 DVMQNYLPPFKEAINAG-ALSVMTSYNSIDGIPSTSNHYLLTDVLRKKWNFDGFVVSDLA 316
Query: 329 AVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNL 388
++ ++ ++ +++A L AG+D++ G+ ++++ G+VQE ID A+ +
Sbjct: 317 SIEGLYGSHYVAQSVQEAANMALSAGVDVDLGSNAFVKLVNSVNAGEVQESLIDTAVTRV 376
Query: 389 FSVQLRLGLFNG---DPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLN 445
++ +GLF+ DP K + K+ V ++EH LA + A+ +VLLKND + LPL
Sbjct: 377 LRLKFEMGLFDNPYVDPTKAE--KM----VRSAEHIALAREVAQNTLVLLKNDHEILPLK 430
Query: 446 KNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSK-----THYASGCHDVP 500
++ + ++GP +N M G YT P +++ LE ++K Y GC +
Sbjct: 431 RDP-GKILVVGPNADNQYNMLGDYTA-PQDSDNVVTILEGVLNKLSPEQVEYVKGCA-IR 487
Query: 501 CNSDAGFHEAVRIAKKADFVIVVAG----LDLTQET-------------------EDRDR 537
S+ +AV A+ +D ++ V G D E E DR
Sbjct: 488 DTSNVNIDKAVEAAEHSDVILAVVGGSSARDFKTEYLETGAAVANNQSVSDMDAGEGFDR 547
Query: 538 VSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAK 597
+L L G+Q L+ ++ +T K P+++V G P+ +++A ++ W YPG+ G
Sbjct: 548 ATLDLLGKQTDLLKALKKTGK-PLVVVYIQGRPMKMNWAAENADALICAW--YPGQEGGN 604
Query: 598 ALAEIIFGDFNPGGRLPMTWYPESFTKVPM--NDMNMRADSSRQYPGRSYRFYTGTQVYG 655
A+A+++FGD+NP GRLPM+ P ++P+ N N R Y + T +Y
Sbjct: 605 AIADVLFGDYNPAGRLPMS-VPRDVGQLPIYYNKKNPRP--------HDYVELSATPLYS 655
Query: 656 FGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFH 715
FG+GLSY++++Y L EV F
Sbjct: 656 FGYGLSYSSFTYSDL----------------------------------EVAPVGQEEFV 681
Query: 716 VQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQ 775
V V N G+ DG VV L+ R S P +QL F R+ K++SF ++ +
Sbjct: 682 VNFKVKNTGNFDGDEVVQLYLRDEYASVVQPVQQLKHFKRLFLKKGEEKQVSFHLNE-KD 740
Query: 776 LSIANKHGRRILPLGNHVLMVG----ELRHSLTIET 807
L+I N+ ++ G LM+G ++R S TI+
Sbjct: 741 LAILNQKKEWVVEPGTFQLMIGAASDDIRLSGTIKV 776
>gi|329963634|ref|ZP_08301109.1| glycosyl hydrolase family 3 protein [Bacteroides fluxus YIT 12057]
gi|328528179|gb|EGF55158.1| glycosyl hydrolase family 3 protein [Bacteroides fluxus YIT 12057]
Length = 863
Score = 241 bits (616), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 169/470 (35%), Positives = 240/470 (51%), Gaps = 66/470 (14%)
Query: 44 CKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESL 103
C P + + TSL RA+ L+ LTL+EK+ + D A + RLGI Y WW+E+L
Sbjct: 18 CSQPTYKT-----TSLPPEERAELLLKELTLEEKVSLMMDGAKPVERLGIKPYNWWNEAL 72
Query: 104 HGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAM--YNLGQA- 160
HG+A G T FPQ + AASF+ + AV+ EARA Y Q
Sbjct: 73 HGVARAG----------LATVFPQPIGMAASFSPEAVYGVFDAVSDEARAKNSYYASQGS 122
Query: 161 -----GLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIG 215
GLT W P +NI+RDPRWGRG ET GEDP + S VE VK QG N DG
Sbjct: 123 YERYQGLTMWTPTVNIYRDPRWGRGIETYGEDPYLTSRMGVEVVKGLQGGN----DG--- 175
Query: 216 FGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDT 274
R D+ L AC KH + +W +R+ FNA I +D +T
Sbjct: 176 -----------------RYDK--LHACAKHFAVHSGPEW---NRHEFNAENIKPRDLYET 213
Query: 275 FQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATI 333
+ PPF + +++GK +MC+YN+ G P C L Q R+EWGF G + SDC A+A
Sbjct: 214 YLPPFEALVKEGKVKEVMCAYNRFEGDPCCGSNRLLMQILRDEWGFDGIVLSDCGAIADF 273
Query: 334 FEYQNYTKTHED----SAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLF 389
+ + + KTH D SAA VL +G D+ CG+ + A+ GK++EK +D A+ L
Sbjct: 274 YNDRGH-KTHPDAESASAAAVL-SGTDLECGSS-YKALVEAVKLGKIEEKAVDVAVKRLL 330
Query: 390 SVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAV 449
+ + LG + +P K + K+ V +++H LALD AR+ + LL N LPL + +
Sbjct: 331 TARFALGEMD-EPDKVSWTKIPFSVVASAKHDSLALDIARKSMTLLMNKNDILPLKQGGL 389
Query: 450 SSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV---SKTHYASGC 496
++A++GP N+ G Y G+P ++L G+ + K Y GC
Sbjct: 390 -TVAVMGPNANDSVMQWGNYNGMPSHTVTILEGVRKALGADDKLIYEQGC 438
Score = 104 bits (260), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 94/333 (28%), Positives = 147/333 (44%), Gaps = 68/333 (20%)
Query: 498 DVPCNSDAGFHEAVRIAKKADFV------IVVAGLDLTQETEDR----------DRVSLL 541
D N D GF + V I K V I V G+ + E E+ DR +
Sbjct: 575 DAQLNFDLGFRKEVDIKKSVAVVKDADVVIFVGGISPSLEGEEMGVNLPGFKKGDRTDIE 634
Query: 542 LPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAE 601
LP Q ++ ++ R+ K+ VI V G P + + +IL YPG+AG A+AE
Sbjct: 635 LPAVQREMIAALHRSGKK-VIFVNCSGSP--IGLEPETKECEAILQAWYPGQAGGTAVAE 691
Query: 602 IIFGDFNPGGRLPMTWYPESFTKVP-MNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGL 660
++FG +NP GRLP+T+Y + +++P D +M GR+YR+ T ++ FG+GL
Sbjct: 692 VLFGKYNPAGRLPVTFY-RNVSQLPDFEDYDMS--------GRTYRYMTQEPLFPFGYGL 742
Query: 661 SYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISV 720
SYT + Y + +L Q +++ + +L + + V
Sbjct: 743 SYTAFEYGQM-------------------VLDQ----------NKINAGQTL--ELTVPV 771
Query: 721 TNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIAN 780
TNAG DG VV ++ R + G P K L F RV+ A + + F + E
Sbjct: 772 TNAGKRDGEEVVQVYLRKRDDTDG-PNKTLRAFKRVNIPAGKTVNVDFELKDKELEWWDT 830
Query: 781 KHGRRILPLGNHVLMVGE-------LRHSLTIE 806
+ G++ +MVG L +S TIE
Sbjct: 831 ATNTMRVCAGDYDIMVGGSSQTEDLLLYSFTIE 863
>gi|218130692|ref|ZP_03459496.1| hypothetical protein BACEGG_02281 [Bacteroides eggerthii DSM 20697]
gi|217987036|gb|EEC53367.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
eggerthii DSM 20697]
Length = 786
Score = 241 bits (615), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 218/745 (29%), Positives = 341/745 (45%), Gaps = 129/745 (17%)
Query: 90 RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
RLGIP ++ +E + G+ + T FP A++N+ L S I A
Sbjct: 137 RLGIPV-DFTNEGIRGLCHD-----------RATMFPAQCGQGATWNKELISEIARVTAE 184
Query: 150 EARAMYNLGQAGLT-FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWK 208
EA+A+ G T ++P ++I +DPRWGR E GEDP +V
Sbjct: 185 EAKAL------GYTNIYSPILDIAQDPRWGRVVECYGEDPFLV----------------- 221
Query: 209 SDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITE 268
G +G KR++KG E L A KH Y + G + + +
Sbjct: 222 ---GELG-----KRMIKGLQAEG-------LVATPKHFAVYSVPVGGRDAGTRTDPHVAP 266
Query: 269 QDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGD---LFQKARNEWGFKGYITS 325
++ + PFR + A +M SYN +G P + G L + R+EWGFKGY+ S
Sbjct: 267 REMRTLYLEPFRKAFCEAGALGVMSSYNDYDGEP--ITGSYHFLTEILRHEWGFKGYVVS 324
Query: 326 DCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCM-------LRHTQSAIDKGKVQE 378
D +AV ++ D AA V+ AG+++ + LRH AI +GK+
Sbjct: 325 DSEAVEFLYSKHKVAADAVDGAAQVINAGLNVRTNFSLPENFIYPLRH---AISEGKISM 381
Query: 379 KDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKND 438
+ ID + ++ V+ +GLF+ +P KG KL C EH+ +++ AA + IVLLKN+
Sbjct: 382 QTIDSRVADVLRVKFMMGLFD-NPYKGDTKKLEKVVHC-KEHQNVSMRAALESIVLLKNE 439
Query: 439 KKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV--SKTHYASGC 496
LPL+K ++ +A+IGP + + Y K++ +G++ Y+ S+ YA G
Sbjct: 440 NNILPLSK-SIKKVAVIGPNAAEVDNLICRYGPANAPIKTVYQGIKDYLPDSEVRYAKGA 498
Query: 497 ------------HDVPCN--SDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLL 542
+DVP N + EAV +AK++D I+V G + E+ R +L L
Sbjct: 499 DIIDEYFPESELYDVPLNKTEETMIDEAVALAKESDVAIMVLGGNEKTVREEYSRTNLDL 558
Query: 543 PGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEI 602
G+Q L+ +V T K PVILV+ G +++AE I I+ +PGE A+A++
Sbjct: 559 CGRQEKLLQAVYATGK-PVILVMIDGRAATINWAE--RYIPGIVHAWFPGEFMGDAIAKV 615
Query: 603 IFGDFNPGGRLPMTWYPESFTKVPMN-DMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLS 661
IFGD+NPGG+L +T +P S ++P +DS + + TG+ +Y FG+GLS
Sbjct: 616 IFGDYNPGGKLAVT-FPRSVGQIPFAFPFKPGSDS------KGFVRVTGS-LYPFGYGLS 667
Query: 662 YTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVT 721
YT + Y L + + I Q +L SC V
Sbjct: 668 YTTFDYSALKIENPV-------------IGTQGSVKL---------SC---------RVK 696
Query: 722 NAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANK 781
N G + G VV L+ S T K L GF+R+H K + F + P + + + NK
Sbjct: 697 NTGKIAGDEVVQLYLHDEVSSVTTYVKVLRGFERIHLQPGEEKVVDFVLTP-QDMGLWNK 755
Query: 782 HGRRILPLGNHVLMVGELRHSLTIE 806
I+ G +MVG + ++
Sbjct: 756 DNHFIVEPGTFAVMVGSSSQDIRLQ 780
>gi|423333917|ref|ZP_17311698.1| hypothetical protein HMPREF1075_03349 [Parabacteroides distasonis
CL03T12C09]
gi|409226752|gb|EKN19658.1| hypothetical protein HMPREF1075_03349 [Parabacteroides distasonis
CL03T12C09]
Length = 758
Score = 241 bits (615), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 214/778 (27%), Positives = 345/778 (44%), Gaps = 129/778 (16%)
Query: 44 CKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESL 103
C+ P + + +I R S++SL+TL+EK+ Q++ + G P E+L
Sbjct: 20 CQQPAVSGWKSFSGDKNIERRVDSVLSLMTLEEKVGQMAQYSCNWDVTG-PVMTGDYETL 78
Query: 104 HGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLT 163
G N TV V ++ +S + + G V R ++ + A
Sbjct: 79 LKQGLVGSLFNVY-TVDGVRKMQEMALSESRLKIPVL--FGYDVVHGYRTIFPMPLAESC 135
Query: 164 FW-------------------------APNINIFRDPRWGRGQETPGEDPMVVSAYAVEF 198
W AP ++I RD RWGR E GEDP + S A
Sbjct: 136 SWDLERMRKSAAIAADEASASGIAWTFAPMVDISRDARWGRVMEGAGEDPYLGSLIAKAR 195
Query: 199 VKSFQGEN-WKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWG-- 255
V+ FQG N W+S D + ACCKH AY + G
Sbjct: 196 VEGFQGGNDWRS-----------------------LADVNTVLACCKHFAAYGAAEAGRD 232
Query: 256 -NFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPAC----LRGDLF 310
N S S N ++ + + PP+ + E G A+ M S+N++NGVP+ L DL
Sbjct: 233 YNTSELSQNTLM------NYYMPPYLAAKEAGVAT-FMASFNEINGVPSTGNKWLMTDLL 285
Query: 311 QKARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVL--KAGMDIN-CGTCMLRHT 367
+K +WGF G++ +D I E ++ D AG L AG+D++ G ++
Sbjct: 286 RK---DWGFNGFVVTDYTG---INEMVAHSIVRNDKEAGELAANAGIDMDMTGGIYNQYL 339
Query: 368 QSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDA 427
++ +GKV E++I+RA+ ++ ++ LGLF+ DP + + + + E + A +
Sbjct: 340 VQSVKEGKVSEENINRAVASILEMKFLLGLFD-DPYRYLDNEREKNTIMKPEFLQEARET 398
Query: 428 ARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPK--SLLRGLEA 485
+ + IVLLKND F P++K+ ++A+IGP+V + G + G + SL GL
Sbjct: 399 SARSIVLLKNDNNFFPISKDKHITVALIGPMVKDKINQNGEWAGRGEREESISLFEGLTE 458
Query: 486 YVSKTH----YASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLL 541
+ T+ YA GC D+ + + F EA+ A++AD V+ G D E R L
Sbjct: 459 KYAGTNVKFIYAEGC-DLLTDDSSKFAEAIATARRADIVLAAMGEDFNWSGEAACRTDLK 517
Query: 542 LPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAE 601
LPG Q +L+ + +T K P+ L+L G PLD+S+ D + IL Y G +A+
Sbjct: 518 LPGAQQALLKELKKTGK-PLGLILVNGRPLDLSW--EDQHVDGILEAWYLGTMAGHGMAD 574
Query: 602 IIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRS--------YRFYTGTQV 653
+I GD+NP RL M+ +P + ++P+ + + R P + Y T +
Sbjct: 575 VISGDYNPSARLTMS-FPRTVGQLPL--YYNQKPTGRPVPPEAPDTDYKSRYMDVPNTPL 631
Query: 654 YGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLR 713
Y FG+GLSYT ++ ++ + +D+ + +
Sbjct: 632 YPFGYGLSYTTFA-------------------------------VNSMKLDQNSFTKGGK 660
Query: 714 FHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVD 771
V V N G VDG VV ++ R S P K+L GF++V A K++SF +D
Sbjct: 661 ITVTAEVENTGKVDGETVVQMYIRDLAGSVTRPVKELKGFEKVALKAGEKKQVSFTID 718
>gi|399025370|ref|ZP_10727374.1| beta-glucosidase-like glycosyl hydrolase [Chryseobacterium sp.
CF314]
gi|398078230|gb|EJL69151.1| beta-glucosidase-like glycosyl hydrolase [Chryseobacterium sp.
CF314]
Length = 775
Score = 241 bits (615), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 232/857 (27%), Positives = 379/857 (44%), Gaps = 161/857 (18%)
Query: 19 LQLIVVVNVIAFSNSKPVLNKPDFPCKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKI 78
++ ++V+ +A + PV + + KP SY +L+S +TL EKI
Sbjct: 1 MKKLIVIATLALA---PVFSAQEMVSKP--VQSYQTAQYQAKKKAFVNNLLSKMTLDEKI 55
Query: 79 QQLSDNASAIPRLGIPAY----EWWSESLHGIASNGPGVNFNGTVSSV------------ 122
QL+ +S G+ + + L G N G + V V
Sbjct: 56 GQLNLPSSGDFTTGLAQSSDIGKKIEQGLVGGLFNIKGADKIRAVQKVAVEKSRLKIPLI 115
Query: 123 ----------TSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTF-WAPNINI 171
T+FP L AAS++ +L A EA + G+ + ++P ++I
Sbjct: 116 FGMDVIHGYETTFPIPLGLAASWDMNLVQQSARVAAKEASS------DGINWTFSPMVDI 169
Query: 172 FRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEES 231
R+PRWGR E GEDP + S A V +QG++
Sbjct: 170 SREPRWGRVSEGSGEDPYLGSEIAKNMVYGYQGKDLSV---------------------- 207
Query: 232 DRGDELMLSACCKHLIAYDLEKWG---NFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKA 288
G+ ++ AC KH Y + G N S M E + PP+++ ++ G A
Sbjct: 208 --GNTIL--ACVKHFALYGAGEAGRDYNTVDMSHVRMFNE------YFPPYKAAVDAGVA 257
Query: 289 SCIMCSYNQVNGVPAC----LRGDLFQKARNEWGFKGYITSDCDAVATIFEYQNYTKTHE 344
S +M S+N+V+GVPA L+ ++ RN+W FKG++ +D + + ++ +
Sbjct: 258 S-VMASFNEVDGVPATGNRWLQTEVL---RNKWNFKGFVVTDYTGINEMVDHG--MGDLQ 311
Query: 345 DSAAGVLKAGMDIN-CGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPR 403
+A LKAG+D++ G L + ++ +GK+ + DID A + + LGLF+ DP
Sbjct: 312 QVSALALKAGVDMDMVGEGFLTTLKKSLSEGKITQADIDIAAKRILEAKYDLGLFD-DPY 370
Query: 404 KGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNIS 463
+ KL +V E++ +A +AA Q +VL+KND + LPL K+ ++A+IGPLVNN
Sbjct: 371 RYGDAKLAAKEVYNMENRNIARNAAAQSMVLMKNDNQVLPLKKSG--TVAVIGPLVNNSL 428
Query: 464 QMGGGYTGIPCSPKS--LLRGLEA-----------------YVSKTHYASGCHDVPCNSD 504
M G ++ ++ L++GL+A Y +K H + D
Sbjct: 429 NMAGTWSVAAKHDQAVNLMQGLQANFGKEVKFISAKGANIDYDAKLEDIYAAHGKKTDRD 488
Query: 505 -----AGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKR 559
A EAV +A KAD +++ G E R + +P Q+ L+ + +T K
Sbjct: 489 NRPKEALLKEAVDVANKADVIVLAIGESAEMSGESSSRTEITIPQSQVDLLNELKKTGK- 547
Query: 560 PVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYP 619
P+ +VL G PL + SIL + + G A+++++FG NP G+LPMT +P
Sbjct: 548 PIAVVLFTGRPL--ALTNIKDTPDSILNVWFAGSEAGNAISDVLFGKVNPSGKLPMT-FP 604
Query: 620 ESFTKVPM--NDMNMRADSSRQYPGR--------SYRFYTGTQVYGFGHGLSYTNYSYKF 669
S +VP+ N N S++ + +Y T +Y FG+GLSYT + Y
Sbjct: 605 RSLGQVPVYYNAKNTGRPLSKELTDKCEYQRFRSNYMDECNTPLYPFGYGLSYTKFGY-- 662
Query: 670 LSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGS 729
S++T+S + G+ QT + VTN+G+ DG+
Sbjct: 663 ----SDVTVSNANPKGN------QT-------------------IQASVMVTNSGNYDGA 693
Query: 730 HVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPL 789
VV L+ R S P K+L GF +V SK+++F + P E L N +
Sbjct: 694 EVVQLYIRDMVGSITRPVKELKGFQKVMLKKGESKKVTFDITP-ENLKFYNGDLKYDWES 752
Query: 790 GNHVLMVG----ELRHS 802
G +M+G E++HS
Sbjct: 753 GEFDIMIGTNSEEVKHS 769
>gi|423301011|ref|ZP_17279035.1| hypothetical protein HMPREF1057_02176 [Bacteroides finegoldii
CL09T03C10]
gi|408472346|gb|EKJ90874.1| hypothetical protein HMPREF1057_02176 [Bacteroides finegoldii
CL09T03C10]
Length = 746
Score = 241 bits (615), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 201/712 (28%), Positives = 326/712 (45%), Gaps = 114/712 (16%)
Query: 123 TSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTF-WAPNINIFRDPRWGRGQ 181
T FP L +AS++ A EA ++ GL + ++P +I RDPRWGR
Sbjct: 121 TIFPVNLGISASWDLEEIERFARVSAEEASSV------GLHWTFSPMCDISRDPRWGRVS 174
Query: 182 ETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSA 241
E GEDP + A V+ +QG++ D+ + +
Sbjct: 175 EGSGEDPYLGGLIAAAMVRGYQGDDLSKDN--------------------------TILS 208
Query: 242 CCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGV 301
C KH IAY + G R I+E++ DT+ PPF++ I+ G A+ +M ++N+ +GV
Sbjct: 209 CAKHFIAYGAAQAG---RDYHTVDISERELRDTYLPPFKAAIDAG-AATVMSAFNEYDGV 264
Query: 302 PACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCG 360
P L + R E GFKG++ SD AV + + +++ AA + GM+
Sbjct: 265 PCTANSFLMRDLLRKEMGFKGFVVSDYTAVKELIAHG--VAGNDEEAASL---GMNAGVN 319
Query: 361 TCM-----LRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDV 415
CM L H + + +G V+E+ ++ + +++ +LGLF+ R G K +
Sbjct: 320 MCMVDNLYLFHGEKLVKEGVVREETVNELCAEILTMKYKLGLFDDPYRYGSEEK-EKSTI 378
Query: 416 CTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCS 475
+ A + A++ +VLL+N+ LP+ + +A+IGP N+ +M G + P
Sbjct: 379 FNDSNLLAAKELAKKSMVLLQNNNNVLPVKEG--KKIALIGPFANHKREMLGSWVLAP-E 435
Query: 476 PK---SLLRGLEAYVSK--THYASGC---HDVPCNSDAGFHEAVRIAKKADFVIVVAGLD 527
PK + L GLE K ++A GC ++P GF AV A+KAD + GL
Sbjct: 436 PKRTVTFLEGLEERYGKQNIYFAPGCRIHEEIP----DGFKNAVAQAQKADVIFYTMGLS 491
Query: 528 LTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILW 587
+ E SL LP QM L+ + T K P++++L P+++ E ++SS+L
Sbjct: 492 QKESGEAASMTSLKLPEVQMQLLEKLKSTGK-PIVVLLVVARPIELE--EVTEKVSSLLL 548
Query: 588 IGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPM--------NDMNMRADSSRQ 639
PG AL E++ GD+NP G+L ++ +P +VP+ M +++ SS +
Sbjct: 549 TWNPGTMAGAALGEVVSGDYNPSGKLTLS-FPLLTGQVPLYYNMKNTGRPMGLKSVSSEK 607
Query: 640 YPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLD 699
Y R Y +Y FG+GLSYT + Y + S ++ + G L
Sbjct: 608 YKSR-YIDCPNEPLYPFGYGLSYTTFEYSDIQPSS---------------VIAEEGDTLQ 651
Query: 700 YVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTV 759
+ ++VTN G+VDG VV L+ R K S P K+L GF ++
Sbjct: 652 F----------------SVNVTNTGNVDGEEVVQLYIRDMKGSVTRPVKELKGFKKIMIK 695
Query: 760 AKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG-----ELRHSLTIE 806
A S++++F + P E LS K G + L +G L S TI+
Sbjct: 696 AGESRKVTFNLSP-EDLSFTRKDMTWGQEKGEYKLWIGGDSNANLEASFTIK 746
>gi|301307646|ref|ZP_07213603.1| periplasmic beta-glucosidase [Bacteroides sp. 20_3]
gi|423337347|ref|ZP_17315091.1| hypothetical protein HMPREF1059_01016 [Parabacteroides distasonis
CL09T03C24]
gi|300834320|gb|EFK64933.1| periplasmic beta-glucosidase [Bacteroides sp. 20_3]
gi|409237807|gb|EKN30603.1| hypothetical protein HMPREF1059_01016 [Parabacteroides distasonis
CL09T03C24]
Length = 758
Score = 241 bits (614), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 217/778 (27%), Positives = 346/778 (44%), Gaps = 129/778 (16%)
Query: 44 CKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESL 103
C+ P + + +I R S++SL+TL+EK+ Q++ + G P E+L
Sbjct: 20 CQQPAVSGWKSFSGDKNIERRVDSVLSLMTLEEKVGQMAQYSCNWDVTG-PVMTGDYETL 78
Query: 104 HGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLT 163
G N TV V ++ +S + + G V R ++ + A
Sbjct: 79 LKQGLVGSLFNVY-TVDGVRKMQEMALSESRLKIPVL--FGYDVVHGYRTIFPMPLAESC 135
Query: 164 FW-------------------------APNINIFRDPRWGRGQETPGEDPMVVSAYAVEF 198
W AP ++I RD RWGR E GEDP + S A
Sbjct: 136 SWDLERMRKSAAIAADEASASGIAWTFAPMVDISRDARWGRVMEGAGEDPYLGSLIAKAR 195
Query: 199 VKSFQGEN-WKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWG-- 255
V+ FQG N W+S D + ACCKH AY + G
Sbjct: 196 VEGFQGGNDWRS-----------------------LADVNTVLACCKHFAAYGAAEAGRD 232
Query: 256 -NFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPAC----LRGDLF 310
N S S N ++ + + PP+ + E G A+ M S+N++NGVP+ L DL
Sbjct: 233 YNTSELSQNTLM------NYYMPPYLAAKEAGVAT-FMASFNEINGVPSTGNKWLMTDLL 285
Query: 311 QKARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVL--KAGMDIN-CGTCMLRHT 367
+K +WGF G++ +D I E ++ D AG L AG+D++ G ++
Sbjct: 286 RK---DWGFNGFVVTDYTG---INEMVAHSIVRNDKEAGELAANAGIDMDMTGGIYSQYL 339
Query: 368 QSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDA 427
++ +GKV E++I+RA+ ++ ++ LGLF+ DP + + + + E + A +
Sbjct: 340 VQSVKEGKVSEENINRAVASILEMKFLLGLFD-DPYRYLDNEREKNTIMKPEFLQEARET 398
Query: 428 ARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPK--SLLRGLEA 485
+ + IVLLKND F P++K+ ++A+IGP+V + G + G + SL GL
Sbjct: 399 SARSIVLLKNDNNFFPISKDKHITVALIGPMVKDKINQNGEWAGRGEREESISLFEGLTE 458
Query: 486 YVSKTH----YASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLL 541
+ T+ YA GC D+ + + F EA+ A++AD V+ G D E R L
Sbjct: 459 KYAGTNVKFIYAEGC-DLLTDDSSKFAEAIATARRADIVLAAMGEDFNWSGEAACRTDLK 517
Query: 542 LPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAE 601
LPG Q +L+ + +T K P+ L+L G PLD+S+ D + IL Y G +A+
Sbjct: 518 LPGAQQALLKELKKTGK-PLGLILVNGRPLDLSW--EDQHVDGILEAWYLGTMAGHGMAD 574
Query: 602 IIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRS--------YRFYTGTQV 653
+I GD+NP RL M+ +P + ++P+ + + R P + Y T +
Sbjct: 575 VISGDYNPSARLTMS-FPRTVGQLPL--YYNQKPTGRPVPPEAPDTDYKSRYMDVPNTPL 631
Query: 654 YGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLR 713
Y FG+GLSYT ++ S+K D+N + G +
Sbjct: 632 YPFGYGLSYTTFAVN------------SMKL--DQNSFTKGG-----------------K 660
Query: 714 FHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVD 771
V V N G VDG V+ ++ R S P K+L GF++V A K++SF +D
Sbjct: 661 ITVTAEVENTGKVDGETVIQMYIRDLAGSVTRPVKELKGFEKVTLKAGEKKQVSFTID 718
>gi|373461603|ref|ZP_09553342.1| hypothetical protein HMPREF9944_01606 [Prevotella maculosa OT 289]
gi|371951907|gb|EHO69749.1| hypothetical protein HMPREF9944_01606 [Prevotella maculosa OT 289]
Length = 739
Score = 241 bits (614), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 195/690 (28%), Positives = 315/690 (45%), Gaps = 113/690 (16%)
Query: 123 TSFPQVLVSAASFNRSLWSNIGSAVAVEARAM-YNLGQAGLTFWAPNINIFRDPRWGRGQ 181
T FP L + +++ L A+EA A+ YN ++P ++I RDPRWGR
Sbjct: 103 TVFPIPLGLSCTWDSVLIKKTARIAAIEATAIGYN------NTYSPMVDITRDPRWGRVM 156
Query: 182 ETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSA 241
E GEDP + S A VK +QGEN + L A
Sbjct: 157 EGSGEDPYLGSVIARAMVKGYQGENLDASTS--------------------------LLA 190
Query: 242 CCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGV 301
C KH Y + G + + +T + + PP+++ +E G A +M S+N V G+
Sbjct: 191 CVKHFGMYGAIEAGRDYNTTDMSRVT---MYNVYLPPYKAAVEAG-AGSVMTSFNDVEGI 246
Query: 302 PAC----LRGDLFQKARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDI 357
P+ L DL RN+WGFKG++ +D +AV + + +E++A +KAG D+
Sbjct: 247 PSTANKWLLTDLL---RNQWGFKGFMVTDYNAVQELIPH-GVAADYEEAAEKAIKAGTDM 302
Query: 358 NCGT-CMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVC 416
+ + C + + +G + I+RA + V+ LGLFN DP KG L +
Sbjct: 303 DMASECYSMFMEKLVTEGHIDTSYINRACRRVLEVKYDLGLFN-DPYKGYQKGLAERVIL 361
Query: 417 TSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT--GIPC 474
+ EHK +A +A ++ +VLLKN+++ LPL + S +A+IGP ++N ++M ++ G+
Sbjct: 362 SKEHKAVAREATQRAMVLLKNERQTLPLRPD--SRVALIGPFLDNKAEMLSMWSPDGVID 419
Query: 475 SPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFH---------------EAVRIAKKADF 519
S S++ G++ + A G + N D EAV +A ++D
Sbjct: 420 SVVSIVAGMKKQFREVKTAVGTY---MNDDPALKSFEPFDSAQQAKMIAEAVELAHQSDV 476
Query: 520 VIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVS--FAE 577
V+ V G E R + +P Q L+ ++ +T K PVILVL G P+ ++
Sbjct: 477 VVAVLGESRWLSGEARSMTKISMPACQKELLKALKKTGK-PVILVLLSGRPMVLTDIIPH 535
Query: 578 ADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPM-----NDMNM 632
AD+ I++ W PG A+A+++ G +NP G+L MT +P S ++P+ N
Sbjct: 536 ADAIIAA--W--RPGTMAGDAIADVLSGKYNPSGKLTMT-FPRSVGQIPIYYNHKNTGRP 590
Query: 633 RADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQ 692
D R Y + ++ FG+GLSYT + Y ++ +S ++ G+ K+I
Sbjct: 591 FKDGDRGQFKSKYLDEMNSPLFPFGYGLSYTTFRY------GDIVLSDTIVNGAKKHIT- 643
Query: 693 QTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIG 752
ISVTN G G V L+ P S P K+L
Sbjct: 644 -----------------------ASISVTNTGKYSGEETVQLYINDPVASITRPVKELKD 680
Query: 753 FDRVHTVAKGSKEISFGVDPCEQLSIANKH 782
+ +V +K+I+F + P E L N H
Sbjct: 681 YQKVMLAPGETKKITFNITP-EALKFYNHH 709
>gi|409195436|ref|ZP_11224099.1| glycoside hydrolase family protein [Marinilabilia salmonicolor JCM
21150]
Length = 867
Score = 241 bits (614), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 151/448 (33%), Positives = 220/448 (49%), Gaps = 62/448 (13%)
Query: 55 CNT------SLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIAS 108
CNT S S RA L+ LTL+EK+ + D +AI RLGI Y WW+E+LHG+A
Sbjct: 21 CNTEIWKDNSYSPEERADDLLKELTLEEKVSLMVDRNTAIERLGIEEYNWWNEALHGVAR 80
Query: 109 NGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA-------- 160
G T FPQ + AA+F+R + ++ SA + EARA ++ +
Sbjct: 81 AG----------QATVFPQPVGMAAAFDRDMVLDVFSAASDEARAKHHFFKERGERGRYQ 130
Query: 161 GLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFRE 220
GLT W PNIN+FRDPRWGRG E GEDP + VK QG+
Sbjct: 131 GLTMWTPNINVFRDPRWGRGMEAYGEDPFMNGVLGTAVVKGLQGDR-------------- 176
Query: 221 KRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPF 279
G L AC KH + +W +R+SFNA I +D +T+ P F
Sbjct: 177 ------------SGKYDKLHACAKHYAVHSGPEW---NRHSFNAENIRPRDLHETYLPAF 221
Query: 280 RSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFE--- 335
+ + G +MC+YN+ G P C L + RNEWGF G + SDC A+ F
Sbjct: 222 KKLVIDGDVRMVMCAYNRFEGEPCCGNNQLLRDILRNEWGFDGVVVSDCWAINDFFNKDA 281
Query: 336 YQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRL 395
+ Y S VL AG D+NCG + A+++G + E+ +D +L L + L
Sbjct: 282 HAMYPDAKTASTDAVL-AGTDLNCGDSYPSLVE-AVEQGLITEEQLDISLRRLLIARFEL 339
Query: 396 GLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAII 455
G + D + ++ K+ V + H ++AL+AAR+ + LL N LPL K + ++A++
Sbjct: 340 GEMDPD-EEVEWSKIPHSVVSSPTHSEMALEAARKSMTLLMNKNGALPLKKEGL-TVAVM 397
Query: 456 GPLVNNISQMGGGYTGIPCSPKSLLRGL 483
GP N+ G Y G P + ++L+G+
Sbjct: 398 GPNANDSLMQWGNYNGTPATTTTILQGI 425
Score = 126 bits (317), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 92/293 (31%), Positives = 142/293 (48%), Gaps = 57/293 (19%)
Query: 517 ADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVARTSKRPVILVLT 566
AD V+ +G+ E E+ DR + LP Q ++ ++ + K +++ +
Sbjct: 603 ADVVVFASGISPFLEGEEMGVDLPGFKGGDRTDIALPAIQKEMLKALHKAGKEIILVNCS 662
Query: 567 GGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVP 626
G + F EA S+IL YPG+AG +A+AE++FGD+NP GRLP+T+Y +S ++P
Sbjct: 663 GSA---IGFEEATDYSSAILQAWYPGQAGGQAVAEVLFGDYNPAGRLPVTFY-KSVDQLP 718
Query: 627 -MNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAG 685
D NM R+YR++ G +Y FG+GLSYT +SY
Sbjct: 719 DFQDYNMT--------NRTYRYFEGEPLYPFGYGLSYTTFSY------------------ 752
Query: 686 SDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGT 745
D+ L QT + T +++SV N GD DG VV L+ + P ++G
Sbjct: 753 -DQPELSQT------------SISTEEEASLKVSVANTGDYDGEEVVQLYLQKPDDTEG- 798
Query: 746 PEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPL-GNHVLMVG 797
P L GF RV + E+ F + E L N +R+ PL G++ L+VG
Sbjct: 799 PSLTLRGFQRVFIPKGETVEVEFQLTE-EVLEWWNADAQRMTPLAGDYRLLVG 850
>gi|339999930|ref|YP_004730813.1| periplasmic beta-glucosidase [Salmonella bongori NCTC 12419]
gi|339513291|emb|CCC31041.1| periplasmic beta-glucosidase precursor [Salmonella bongori NCTC
12419]
Length = 765
Score = 240 bits (613), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 206/737 (27%), Positives = 343/737 (46%), Gaps = 136/737 (18%)
Query: 75 QEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAAS 134
++ I+Q+ D A+ RL IP + + + +HG T FP L A++
Sbjct: 86 RQDIRQMQDQVMALSRLKIPLF-FAYDVVHG---------------QRTVFPISLGLAST 129
Query: 135 FNRSLWSNIGSAVAVEARAMYNLGQAGLTF-WAPNINIFRDPRWGRGQETPGEDPMVVSA 193
FN +G A EA GL WAP +++ RDPRWGR E GED + S
Sbjct: 130 FNLDAVRTVGRVSAYEA------ADDGLNMTWAPMVDVSRDPRWGRASEGFGEDTYLTSI 183
Query: 194 YAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEK 253
VK+ QG++ D + KH AY +
Sbjct: 184 MGETMVKAMQGKS--------------------------PADRYSVMTSVKHFAAYGAVE 217
Query: 254 WGNFSRYSFNAM-ITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK 312
G +N + ++ Q + + PP+++ ++ G + +M + N +NG PA L +
Sbjct: 218 GGK----EYNTVDMSPQRLFNDYMPPYKAGLDAG-SGAVMIALNSLNGTPATSDAWLLKD 272
Query: 313 A-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCML-RHTQSA 370
R++WGFKG SD A+ + ++ T ED+ LK+G+D++ ++
Sbjct: 273 VLRHQWGFKGITVSDHGAIKELIKHGTATDP-EDAVRVALKSGVDMSMADEYYSKYLPGL 331
Query: 371 IDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGP---DDVCTSE----HKKL 423
I GKV ++D A ++ +V+ +GLFN DP Y LGP D V T+ H+K
Sbjct: 332 IKSGKVTMAELDDATRHVLNVKYDMGLFN-DP----YSHLGPKESDPVDTNAESRLHRKE 386
Query: 424 ALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT--GIPCSPKSLLR 481
A + AR+ +VLLKN LPL+KNA ++A++GPL ++ + G ++ G+ ++L
Sbjct: 387 AREVARESLVLLKNRLNTLPLSKNA--TIAVVGPLADSKRDIMGSWSAAGVADQSVTVLT 444
Query: 482 GLEAYVS---KTHYASGCH-----DV--------------PCNSDAGFHEAVRIAKKADF 519
G++ + K YA G + D+ P + A EAV+ AK+AD
Sbjct: 445 GIKNALGERGKVLYAKGANITNDKDIVDFLNLYEEVVKIDPRSPQAMIDEAVQAAKQADV 504
Query: 520 VIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEAD 579
V+ V G E R + +P Q L+T++ T K P++LVL G PL + + D
Sbjct: 505 VVAVVGESQGMAHEASSRTDITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKED 561
Query: 580 SQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPM--NDMNMRADSS 637
Q +IL + G G A+A+++FGD NP G+LPM+ +P S ++P+ + +N +
Sbjct: 562 QQADAILETWFAGTEGGNAIADVLFGDDNPSGKLPMS-FPRSVGQIPVYYSHLNTGRPYN 620
Query: 638 RQYPGR-SYRFYTGTQ--VYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQT 694
+ P + + R++ +Y FG+GLSY + S++T+SA +++
Sbjct: 621 PEKPDKYTSRYFDEANGPLYPFGYGLSYNTFKV------SDVTLSAP--------TMKRD 666
Query: 695 GSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFD 754
G +VT+ + VTN G +G+ V+ ++ + S P KQL GF+
Sbjct: 667 G---------KVTAS--------VEVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGFE 709
Query: 755 RVHTVAKGSKEISFGVD 771
++ +K +SF +D
Sbjct: 710 KIILKPGETKTVSFPID 726
>gi|300777062|ref|ZP_07086920.1| beta-glucosidase [Chryseobacterium gleum ATCC 35910]
gi|300502572|gb|EFK33712.1| beta-glucosidase [Chryseobacterium gleum ATCC 35910]
Length = 775
Score = 240 bits (613), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 211/717 (29%), Positives = 330/717 (46%), Gaps = 124/717 (17%)
Query: 123 TSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQE 182
T+FP L AAS++ +L I + V AR + G TF +P ++I R+PRWGR E
Sbjct: 126 TTFPIPLGLAASWDMNL---IQQSARVAAREAASDG-INWTF-SPMVDISREPRWGRVSE 180
Query: 183 TPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSAC 242
GEDP + S + V +QG+ D + + AC
Sbjct: 181 GSGEDPYLGSEISKNMVYGYQGK--------------------------DLANGSNILAC 214
Query: 243 CKHLIAYDLEKWG---NFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVN 299
KH Y + G N S M E + PP+++ ++ G AS +M S+N+V+
Sbjct: 215 VKHFALYGAGEAGRDYNTVDMSHVRMFNE------YFPPYKAAVDAGVAS-VMASFNEVD 267
Query: 300 GVPAC----LRGDLFQKARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGM 355
GVPA L+ ++ RN+W FKG++ +D + + E+ + +A LKAG+
Sbjct: 268 GVPATGNRWLQTEVL---RNQWKFKGFVVTDYTGINEMVEHG--MGDLQQVSALALKAGV 322
Query: 356 DIN-CGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDD 414
D++ G L + ++ +GKV + +ID A + + LGLF+ +P K KL +
Sbjct: 323 DMDMVGEGFLTTLKKSLAEGKVTQAEIDMAARRILEAKYDLGLFD-NPYKHGDAKLAAKE 381
Query: 415 VCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIP- 473
V E++ +A A Q +VLLKND + LPL K+ ++A+IGPLVNN M G ++
Sbjct: 382 VYNMENRNIARSTAAQSMVLLKNDNQVLPLKKSG--TVAVIGPLVNNSMNMAGTWSVATK 439
Query: 474 -CSPKSLLRGLEA-----------------YVSKTHYASGCHDVPCNSDAG-----FHEA 510
S SL++GL++ Y +K H + D EA
Sbjct: 440 HASSVSLMQGLQSNYGKEVKFLSAKGANIDYDAKLEEIYAAHGKKTDRDNRSKEELLKEA 499
Query: 511 VRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGP 570
V +A KAD +++ G E R + +P Q+ L+ + +T K P+ +VL G P
Sbjct: 500 VEVANKADVIVLAIGESAEMSGESSSRTEITIPQSQVDLLNELKKTGK-PIAMVLFTGRP 558
Query: 571 LDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPM--N 628
L + +IL + G A+A+++FG NP G+LPMT +P S +VP+ N
Sbjct: 559 L--ALTNVKDTPDAILNAWFAGSEAGNAIADVLFGKVNPSGKLPMT-FPRSLGQVPIYYN 615
Query: 629 DMNM-RADSSRQYPGRSY-RFYTG------TQVYGFGHGLSYTNYSYKFLSAPSELTISA 680
N R S + +Y RF + T +Y FG+GLSYT ++Y S++T+S
Sbjct: 616 AKNTGRPLSQDKVDKCTYERFRSNYMDECNTPLYAFGYGLSYTKFNY------SDMTVSN 669
Query: 681 SLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPK 740
+ G+ QT ++VTNAG+ DG+ VV L+ R
Sbjct: 670 TSPKGN------QT-------------------VQASVTVTNAGNYDGAEVVQLYIRDMV 704
Query: 741 VSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
S P K+L GF +V SK+++F + P E L N + G +M+G
Sbjct: 705 GSITRPVKELKGFQKVFLKKGESKKVTFDITP-ENLKFYNGELKYDWEPGEFDIMIG 760
>gi|423303939|ref|ZP_17281938.1| hypothetical protein HMPREF1072_00878 [Bacteroides uniformis
CL03T00C23]
gi|423307339|ref|ZP_17285329.1| hypothetical protein HMPREF1073_00079 [Bacteroides uniformis
CL03T12C37]
gi|392686630|gb|EIY79933.1| hypothetical protein HMPREF1072_00878 [Bacteroides uniformis
CL03T00C23]
gi|392690354|gb|EIY83622.1| hypothetical protein HMPREF1073_00079 [Bacteroides uniformis
CL03T12C37]
Length = 736
Score = 240 bits (612), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 209/802 (26%), Positives = 343/802 (42%), Gaps = 146/802 (18%)
Query: 54 FCNTSLSISTRAKSLISLLTLQEKIQQLSD--------------NASAIP---------- 89
+ + I R L+S +TL+EK+QQL+ IP
Sbjct: 29 YKDAKAPIEERVNDLVSRMTLEEKVQQLNQYTLGRNNNENNRGEEVKKIPATLGSLIYFD 88
Query: 90 -----------------RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSA 132
RLGIP + + +HG T +P L A
Sbjct: 89 EDANLRNEAQRKAMEESRLGIPIL-FGYDVIHGFR---------------TIYPISLGQA 132
Query: 133 ASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVS 192
S+N L + A EAR + TF +P I++ RD RWGR E GEDP +
Sbjct: 133 CSWNPQLVEQACAVAAQEAR----MSGVDWTF-SPMIDVARDGRWGRVAEGYGEDPYTNA 187
Query: 193 AYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLE 252
+ V +K +QGE D D ++AC KH I Y
Sbjct: 188 VFGVASIKGYQGE--------------------------DMSDSKRVAACLKHYIGYGAS 221
Query: 253 KWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRG-DLFQ 311
+ G Y+ I+ Q DT+ PP+ + ++ G A+ +M S+N ++G P + +
Sbjct: 222 EAGRDYVYT---EISNQTLWDTYIPPYEAGVKAG-AATLMSSFNDISGTPGSANHYTMTE 277
Query: 312 KARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDIN-CGTCMLRHTQSA 370
+N W G++ SD AV + + Q + +++A AG++++ G C +H
Sbjct: 278 ILKNRWKHDGFVVSDWSAVPQLID-QGHAADRKEAARLAFNAGLEMDMMGHCYDKHMAKL 336
Query: 371 IDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQ 430
+++GK+ + +D A+ + ++ RLGLF+ K + + +A A +
Sbjct: 337 VEEGKISMQLVDDAVKRVLRIKFRLGLFDNPYTPTSTEK---ERFLLPQSLTIAEKLAEE 393
Query: 431 GIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTG-------IPCSPKSLLRGL 483
IVLLKN+ K LPL ++A++GPLV N +++ G + G +P K L
Sbjct: 394 TIVLLKNENKVLPLANGNKPTIAVMGPLVQNSAELLGSWYGHGHAEDVLPI--KKALDAE 451
Query: 484 EAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLP 543
A ++ Y GC D N + F EA+ +A+KAD +++ G E+ R + LP
Sbjct: 452 FAGKAELIYTEGC-DFDGNDTSKFSEALAVARKADIILLCMGEKKKWSGENASRSIIELP 510
Query: 544 GQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEII 603
Q + + + K P++L L G PL +S E +W PG G K LA ++
Sbjct: 511 AIQEKFIAEMKKAGK-PIVLALANGRPLGLSKVEPLCDAIVEMW--QPGVPGGKPLAGVL 567
Query: 604 FGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYT 663
G NP G+L +T +P S ++P+ N R ++R G+ Y+ T +Y FG+GLSYT
Sbjct: 568 SGRVNPSGKLSIT-FPRSTGQIPIY-YNQR-KTARPQSGK-YQNIPSTPLYEFGYGLSYT 623
Query: 664 NYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNA 723
++Y ++ P E +++ G + ++I VTN
Sbjct: 624 TFNYGNINLPKE-----TIRRGE--------------------------KLVMEIPVTNV 652
Query: 724 GDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHG 783
G DG+ VV F P + P K+L F++ A + F +DP L+ N +G
Sbjct: 653 GKRDGAEVVHWFISDPFSTITRPCKELKHFEKQLIKAGETHIFRFEIDPMRDLAFVNANG 712
Query: 784 RRILPLGNHVLMVGELRHSLTI 805
L G + ++V + + T+
Sbjct: 713 EHFLENGEYYVIVKDQKVKFTV 734
>gi|427411074|ref|ZP_18901276.1| hypothetical protein HMPREF9718_03750 [Sphingobium yanoikuyae ATCC
51230]
gi|425710259|gb|EKU73281.1| hypothetical protein HMPREF9718_03750 [Sphingobium yanoikuyae ATCC
51230]
Length = 792
Score = 240 bits (612), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 220/749 (29%), Positives = 334/749 (44%), Gaps = 133/749 (17%)
Query: 90 RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
RLGIP + ESLHG + + T FP + A SF+R L +++ S +A
Sbjct: 140 RLGIPIL-FHEESLHGYMA-----------TDATMFPMAIGLAGSFDRQLMTDVQSVIAR 187
Query: 150 EARAMYNLGQAGLTF-WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWK 208
E RA G+ +P ++I RDPRWGR +ET GEDP + V V QGE
Sbjct: 188 EVRAR------GVHLALSPVVDIARDPRWGRIEETFGEDPYLCGEMGVAAVLGLQGEG-- 239
Query: 209 SDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWG-NFSRYSFNAMIT 267
K G D++M A KH+ + + G N + A I+
Sbjct: 240 ----------------KQIGP-----DKVM--ATLKHMTGHGQPQAGENIA----PAPIS 272
Query: 268 EQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPA----CLRGDLFQKARNEWGFKGYI 323
E++ + F PPFR +++ + +M SYN+++GVP+ L GD+ R EW F G +
Sbjct: 273 ERELRENFFPPFRQVVKRTGIAAVMPSYNEIDGVPSHQNKWLLGDIL---RGEWHFDGAV 329
Query: 324 TSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCM-LRHTQSAIDKGKVQEKDID 382
SD AV + + E +A L+AG+D + R + GKV + ++
Sbjct: 330 VSDYGAVPELDTIHHVQPDLEATARAALRAGVDCELPDGLAYRTLVEQVRAGKVPLEAVN 389
Query: 383 RALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFL 442
A + +++ R GLF + Y L +E + LAL AA + IVLLKND L
Sbjct: 390 LACTRMLTLKFRAGLFENPWPRADYDAL----TGNAEARALALKAAHKSIVLLKNDGT-L 444
Query: 443 PLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV---SKTHYASGCHDV 499
PL A +A+IGP N GGY+ IP SLL G++A + + +A G
Sbjct: 445 PLKPGAHRKVAVIGP--NAAIARLGGYSSIPRQAVSLLDGVKAKLGNRADIVHAQGVFIT 502
Query: 500 --------------PCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDR-------DRV 538
P + EAV +AK AD +++ G D Q + + DR
Sbjct: 503 QSEDRSVDEVLLADPAKNRQLIAEAVEVAKTADIILLAIG-DTEQTSREGFAKNHLGDRT 561
Query: 539 SLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKA 598
SL L G+Q L ++ T K V+ + G P S+ ++++ YPG+ G A
Sbjct: 562 SLDLVGEQNDLFAAMKATGKPVVVCAINGRPP---SYPAVVDGANALIECWYPGQEGGTA 618
Query: 599 LAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGH 658
+A+I+FGD NPG +LP+T ++ ++P+ R SSR R Y F + ++ FG
Sbjct: 619 MADILFGDVNPGAKLPVTVARDA-GQIPI--FYNRKPSSR----RGYLFADTSPLFPFGF 671
Query: 659 GLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQI 718
GLSYT +++ P L+ S + G D V++
Sbjct: 672 GLSYTKFAF----GPPRLSAS-RIGVGGD--------------------------VTVEV 700
Query: 719 SVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSI 778
V N G V G VV L+ S P K+L GF+R+ S+ + + P E ++
Sbjct: 701 DVRNVGTVAGEEVVQLYVHDQTASVTRPLKELKGFERIALAPGESRTVRLTIGP-EAFAL 759
Query: 779 ANKHGRRILPLGNHVLMVGELRHSLTIET 807
N R ++ G +MVG S T++T
Sbjct: 760 WNLDMREVVEPGLFDIMVGP--DSATLQT 786
>gi|401676349|ref|ZP_10808334.1| glucosidase [Enterobacter sp. SST3]
gi|400216388|gb|EJO47289.1| glucosidase [Enterobacter sp. SST3]
Length = 765
Score = 240 bits (612), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 203/738 (27%), Positives = 339/738 (45%), Gaps = 138/738 (18%)
Query: 75 QEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAAS 134
++ I+++ D + RL IP + + + +HG T FP L A+S
Sbjct: 86 RQDIRKMQDQVMDLSRLKIPLF-FAYDVVHG---------------QRTVFPISLGLASS 129
Query: 135 FNRSLWSNIGSAVAVEARAMYNLGQAGLTF-WAPNINIFRDPRWGRGQETPGEDPMVVSA 193
FN +G A EA GL WAP +++ RDPRWGRG E GED + S
Sbjct: 130 FNLDAVKTVGRVSAYEA------ADDGLNMTWAPMVDVSRDPRWGRGSEGFGEDTFLTST 183
Query: 194 YAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEK 253
V++ QG++ D + KH AY +
Sbjct: 184 MGKTMVEAMQGKS--------------------------PADRYSVMTSVKHFAAYGAVE 217
Query: 254 WGNFSRYSFNAM-ITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK 312
G +N + ++ Q + + PP+++ ++ G + +M + N +NG PA L +
Sbjct: 218 GGK----EYNTVDMSPQRLFNDYMPPYKAGLDAG-SGAVMVALNSLNGTPATSDSWLLKD 272
Query: 313 A-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCML-RHTQSA 370
R++WGFKG SD A+ + ++ + ED+ LK+G++++ ++
Sbjct: 273 VLRDQWGFKGITVSDHGAIKELIKHGTASDP-EDAVRVALKSGINMSMSDEYYSKYLPGL 331
Query: 371 IDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSE-------HKKL 423
+ GKV ++D A ++ +V+ +GLFN DP Y LGP D ++ H+K
Sbjct: 332 VKSGKVTMAELDDATRHVLNVKYDMGLFN-DP----YSHLGPKDSDPADTNAESRLHRKE 386
Query: 424 ALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT--GIPCSPKSLLR 481
A D AR+ +VLLKN LPL K+ ++A++GPL ++ + G ++ G+ ++L
Sbjct: 387 ARDVARESLVLLKNRLDTLPLKKSG--TIAVVGPLADSKRDVMGSWSAAGVADQSVTVLT 444
Query: 482 GLEAYV---SKTHYASGCH-----DV--------------PCNSDAGFHEAVRIAKKADF 519
G+++ V +K YA G + D+ P EAV AK++D
Sbjct: 445 GIKSAVGENAKVVYAKGANVTNDKDIVTFLNQYEEAVKVDPRTPKEMIDEAVNAAKQSDV 504
Query: 520 VIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEAD 579
V+ V G E R + +P Q L+ ++ T K P++LVL G PL + + D
Sbjct: 505 VVAVVGEAQGMAHEASSRTDITIPQSQRDLIAALKATGK-PLVLVLMNGRPL--ALVKED 561
Query: 580 SQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPM--NDMNM----R 633
Q +IL + G G A+A+++FGD+NP G+LPM+ +P S ++P+ + +N
Sbjct: 562 QQADAILETWFAGTEGGNAIADVLFGDYNPSGKLPMS-FPRSVGQIPVYYSHLNTGRPYN 620
Query: 634 ADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQ 693
AD +Y R + G +Y FG+GLSYT + S++ +SA L++
Sbjct: 621 ADKPNKYTSRYFDEANGP-LYPFGYGLSYTTFKV------SDVKMSAP--------TLKR 665
Query: 694 TGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGF 753
G +VT+ + VTN G +G+ V+ ++ + S P KQL GF
Sbjct: 666 DG---------KVTAS--------VEVTNTGKREGATVIQMYVQDVTASMSRPVKQLRGF 708
Query: 754 DRVHTVAKGSKEISFGVD 771
++V+ +K ISF +D
Sbjct: 709 EKVNLKPGETKTISFPID 726
>gi|398386388|ref|ZP_10544390.1| beta-glucosidase-like glycosyl hydrolase [Sphingobium sp. AP49]
gi|397718419|gb|EJK79008.1| beta-glucosidase-like glycosyl hydrolase [Sphingobium sp. AP49]
Length = 792
Score = 240 bits (612), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 218/740 (29%), Positives = 331/740 (44%), Gaps = 133/740 (17%)
Query: 90 RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
RLGIP + ESLHG + + T FP + A SF+R L +++ S +A
Sbjct: 140 RLGIPIL-FHEESLHGYMA-----------TDATMFPMAIGLAGSFDRQLMTDVQSVIAR 187
Query: 150 EARAMYNLGQAGLTF-WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWK 208
E RA G+ +P ++I RDPRWGR +ET GEDP + V V QGE+
Sbjct: 188 EVRAR------GVHLALSPVVDIARDPRWGRIEETFGEDPYLCGEMGVAAVLGLQGES-- 239
Query: 209 SDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWG-NFSRYSFNAMIT 267
K G D++M A KH+ + + G N + A I+
Sbjct: 240 ----------------KQIGP-----DKVM--ATLKHMTGHGQPQAGENIA----PAPIS 272
Query: 268 EQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPA----CLRGDLFQKARNEWGFKGYI 323
E++ + F PPFR +++ + +M SYN+++GVP+ L GD+ R EW F G +
Sbjct: 273 ERELRENFFPPFRQVVKRTGIAAVMPSYNEIDGVPSHQNKWLLGDIL---RGEWHFDGAV 329
Query: 324 TSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCM-LRHTQSAIDKGKVQEKDID 382
SD AV + + E +A L+AG+D + R + GKV + ++
Sbjct: 330 VSDYGAVPELDTIHHVQPDLEATARAALRAGVDCELPDGLAYRTLVEQVRAGKVPLEAVN 389
Query: 383 RALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFL 442
A + +++ R GLF + Y L +E + LAL AA + IVLLKND L
Sbjct: 390 LACTRMLTLKFRAGLFENPWPRADYDAL----TGNAEARALALKAAHKSIVLLKNDGT-L 444
Query: 443 PLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV---SKTHYASGCHDV 499
PL A +A++GP N GGY+ IP SLL G++A + + +A G
Sbjct: 445 PLRPGAHRKVAVVGP--NAAIARLGGYSSIPRQAVSLLDGVKAKLGNRADIVHAQGVFIT 502
Query: 500 --------------PCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDR-------DRV 538
P + EAV +AK AD +++ G D Q + + DR
Sbjct: 503 QSEDRSVDEVFLADPAKNRQLIAEAVEVAKTADIILLAIG-DTEQTSREGFAKNHLGDRT 561
Query: 539 SLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKA 598
SL L G+Q L ++ T K V+ + G P S+ +++L YPG+ G A
Sbjct: 562 SLDLVGEQNELFAAMKATGKPVVVCAINGRPP---SYPAVVDGANALLECWYPGQEGGTA 618
Query: 599 LAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGH 658
+A+I+FGD NPGG+LP+T ++ ++P+ R SSR R Y F + ++ FG
Sbjct: 619 MADILFGDVNPGGKLPVTVARDA-GQIPI--FYNRKPSSR----RGYVFEDSSPLFPFGF 671
Query: 659 GLSYTNYSYKFLSAPSELTISAS-LKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQ 717
GLSYT +++ + +SA + G D V+
Sbjct: 672 GLSYTKFTF------GKPRLSAGRIGVGGD--------------------------VFVE 699
Query: 718 ISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLS 777
I V N G V G VV L+ S P K+L GF+R+ S+ + + P E S
Sbjct: 700 IEVRNVGSVAGEEVVQLYVHDQTASVTRPIKELKGFERIALAPGESRTVRLAIGP-EAFS 758
Query: 778 IANKHGRRILPLGNHVLMVG 797
+ N + ++ G +MVG
Sbjct: 759 LWNLDMQEVVEPGLFDIMVG 778
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.319 0.135 0.408
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 12,736,236,143
Number of Sequences: 23463169
Number of extensions: 545838363
Number of successful extensions: 1160191
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 4767
Number of HSP's successfully gapped in prelim test: 2450
Number of HSP's that attempted gapping in prelim test: 1110655
Number of HSP's gapped (non-prelim): 18956
length of query: 808
length of database: 8,064,228,071
effective HSP length: 151
effective length of query: 657
effective length of database: 8,816,256,848
effective search space: 5792280749136
effective search space used: 5792280749136
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 81 (35.8 bits)