BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 004190
(770 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|255548487|ref|XP_002515300.1| Beta-glucosidase, putative [Ricinus communis]
gi|223545780|gb|EEF47284.1| Beta-glucosidase, putative [Ricinus communis]
Length = 768
Score = 1327 bits (3435), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 622/748 (83%), Positives = 680/748 (90%), Gaps = 2/748 (0%)
Query: 23 AAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIK 82
AA EPFACD KD TT+ LPFCQV LPI RV DLIGRL+L EKV LL++ A AV RLGIK
Sbjct: 23 AALEPFACDSKDGTTKNLPFCQVKLPIQDRVKDLIGRLTLAEKVGLLVNNAGAVSRLGIK 82
Query: 83 GYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAM 142
GYEWWSEALHGVSNVGPGTKFGG FPGATSFPQVITTA+SFN+TLWEAIGRVVSDEARAM
Sbjct: 83 GYEWWSEALHGVSNVGPGTKFGGSFPGATSFPQVITTAASFNSTLWEAIGRVVSDEARAM 142
Query: 143 YNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAA 202
YNGG AGLTYWSPNVNI RDPRWGRGQETPGEDP+L GKYAASYV+GLQG+DG+RLKVAA
Sbjct: 143 YNGGAAGLTYWSPNVNILRDPRWGRGQETPGEDPLLVGKYAASYVKGLQGNDGERLKVAA 202
Query: 203 SCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGV 262
CKHFTAYDLDNWNGVDRFHFNAKVSKQD++DTFDVPFRMCV EGKVASVMCSYNQVNG+
Sbjct: 203 CCKHFTAYDLDNWNGVDRFHFNAKVSKQDMKDTFDVPFRMCVKEGKVASVMCSYNQVNGI 262
Query: 263 PTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG 322
PTCADPN+L++T+R +W LNGYIVSDCDSVGV+YD QH+TSTPEEAAADAI+AGLDLDCG
Sbjct: 263 PTCADPNLLRKTVRTQWGLNGYIVSDCDSVGVFYDKQHYTSTPEEAAADAIKAGLDLDCG 322
Query: 323 PFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDH 382
PFL +HT+ AV+RGL+SE D+N AL NTLTVQMRLGMFDGEPS+QPYG+LGPKDVCTP H
Sbjct: 323 PFLAVHTQDAVKRGLISEADVNGALFNTLTVQMRLGMFDGEPSAQPYGNLGPKDVCTPAH 382
Query: 383 QELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPL 442
QELALEA RQGIVLLKN GPSLPLS RHRTVA+IGPNS+VTVTMIGNYAG+AC YTTPL
Sbjct: 383 QELALEAGRQGIVLLKNHGPSLPLSPRRHRTVAIIGPNSNVTVTMIGNYAGVACQYTTPL 442
Query: 443 QGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLL 502
QGIG YA+TIHQQGC DV C DQLF AIDA+RQADAT+LVMGLDQSIEAE DR GLL
Sbjct: 443 QGIGSYAKTIHQQGCADVGCVTDQLFSGAIDAARQADATVLVMGLDQSIEAEFRDRTGLL 502
Query: 503 LPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIAD 562
LPGRQQELVSKV+MASKGPTILVLMSGGPIDV+FAK DP+IAAI+WAGYPGQAGG AIAD
Sbjct: 503 LPGRQQELVSKVAMASKGPTILVLMSGGPIDVSFAKKDPKIAAILWAGYPGQAGGAAIAD 562
Query: 563 ILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGM 622
+LFGT NPGGKLPMTWYPQEYITNLPMTEMAMR SQSK YPGRTYRFY+G VVYPFGHGM
Sbjct: 563 VLFGTINPGGKLPMTWYPQEYITNLPMTEMAMRSSQSKGYPGRTYRFYQGKVVYPFGHGM 622
Query: 623 SYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSK 682
SYT+FVH +A+APT+V+VPLDG G N +ISGKAI+VTH KCN+L+LG+QVDVKNVGSK
Sbjct: 623 SYTHFVHNIASAPTMVSVPLDGHRG--NTSISGKAIRVTHTKCNKLSLGIQVDVKNVGSK 680
Query: 683 DGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRI 742
DG HTLLV+S PPAG W+PHKQLVAFE+VHV AG Q+RVGI+IHVCK LSVVDRSG RRI
Sbjct: 681 DGTHTLLVYSAPPAGRWSPHKQLVAFERVHVSAGTQERVGISIHVCKLLSVVDRSGIRRI 740
Query: 743 PLGEHNIHIGGTKHSVSLHAATLGVIKS 770
P+GEH+IHIG KHSVSL A LGV+KS
Sbjct: 741 PIGEHSIHIGNVKHSVSLQATVLGVVKS 768
>gi|225437531|ref|XP_002270249.1| PREDICTED: probable beta-D-xylosidase 2 [Vitis vinifera]
gi|297743965|emb|CBI36935.3| unnamed protein product [Vitis vinifera]
Length = 768
Score = 1292 bits (3343), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 602/769 (78%), Positives = 684/769 (88%), Gaps = 8/769 (1%)
Query: 2 ASTIAFFFLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLS 61
+S++ F + L ++S AR+PFACDPKD PFC+ S+ I +RV DLIGRL+
Sbjct: 8 SSSLLIFLVVLAVVSGE-----ARDPFACDPKDGANAGFPFCRKSIGIGERVKDLIGRLT 62
Query: 62 LQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTAS 121
L+EKV+LL++ AA VPRLGIKGYEWWSEALHGVSNVGPGTKF GDFPGATSFPQVITTA+
Sbjct: 63 LEEKVRLLVNNAAGVPRLGIKGYEWWSEALHGVSNVGPGTKFSGDFPGATSFPQVITTAA 122
Query: 122 SFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGK 181
SFN++LWEAIG+VVSDEARAMYNGG AGLT+WSPNVNIFRDPRWGRGQETPGEDPVL+GK
Sbjct: 123 SFNSSLWEAIGQVVSDEARAMYNGGAAGLTFWSPNVNIFRDPRWGRGQETPGEDPVLAGK 182
Query: 182 YAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFR 241
YAA YVRGLQG+ GDRLKVAA CKHFTAYDLDNWNGVDRFHF+A+VSKQ++EDTFDVPFR
Sbjct: 183 YAARYVRGLQGNAGDRLKVAACCKHFTAYDLDNWNGVDRFHFDARVSKQEMEDTFDVPFR 242
Query: 242 MCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHF 301
CV+EGKVASVMCSYNQVNGVPTCADPN+L+ T+R +W LNGY+VSDCDSVGV+YD QH+
Sbjct: 243 SCVVEGKVASVMCSYNQVNGVPTCADPNLLRNTVRKQWHLNGYVVSDCDSVGVFYDNQHY 302
Query: 302 TSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFD 361
T+TPEEAAADAI+AGLDLDCGPFL +HT+ A+++GL+SE D+++ALVNT+TVQMRLGMFD
Sbjct: 303 TNTPEEAAADAIKAGLDLDCGPFLAVHTQDAIKKGLVSEADVDSALVNTVTVQMRLGMFD 362
Query: 362 GEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNS 421
GEPS+QP+G LGPKDVC+P HQELA+EAARQGIVLLKN G SLPLS HR++AVIGPNS
Sbjct: 363 GEPSAQPFGDLGPKDVCSPAHQELAIEAARQGIVLLKNHGHSLPLSTRSHRSIAVIGPNS 422
Query: 422 DVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADAT 481
D VTMIGNYAGI C YTTPLQGIGRY+RTIHQ+GC DVAC++DQLF AIDA+ QADAT
Sbjct: 423 DANVTMIGNYAGIPCEYTTPLQGIGRYSRTIHQKGCADVACSEDQLFAGAIDAASQADAT 482
Query: 482 ILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDP 541
+LVMGLDQSIEAEA DRA LLLPGRQQELVSKV+MAS+GPT+LVLMSGGP+DV+FAK DP
Sbjct: 483 VLVMGLDQSIEAEAKDRADLLLPGRQQELVSKVAMASRGPTVLVLMSGGPVDVSFAKKDP 542
Query: 542 RIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKR 601
RIAAI+WAGYPGQAGG AIADILFG +NPGGKLPMTWYPQEY++ +PMT MAMR SK
Sbjct: 543 RIAAIVWAGYPGQAGGAAIADILFGVANPGGKLPMTWYPQEYLSKVPMTTMAMRAIPSKA 602
Query: 602 YPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVT 661
YPGRTYRFYKGPVVY FGHG+SYTNFVHT+A APT VA+PL G H N T+SGKAI+VT
Sbjct: 603 YPGRTYRFYKGPVVYRFGHGLSYTNFVHTIAQAPTAVAIPLHGHH---NTTVSGKAIRVT 659
Query: 662 HAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRV 721
HAKCNRL++ + +DVKNVG+KDG+HTLLVFS PPAGHWAPHKQLVAFEKVHV A QQRV
Sbjct: 660 HAKCNRLSIALHLDVKNVGNKDGSHTLLVFSKPPAGHWAPHKQLVAFEKVHVAARTQQRV 719
Query: 722 GINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHAATLGVIKS 770
INIHVCKYLSVVDRSG RRIP+G+H +HIG TKH VSL AA LGVIKS
Sbjct: 720 QINIHVCKYLSVVDRSGIRRIPMGQHGLHIGDTKHLVSLQAAALGVIKS 768
>gi|356534827|ref|XP_003535953.1| PREDICTED: probable beta-D-xylosidase 2-like [Glycine max]
Length = 771
Score = 1290 bits (3338), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 603/753 (80%), Positives = 686/753 (91%), Gaps = 4/753 (0%)
Query: 21 GLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLG 80
G AR+PFACDPK+ T+ LPFC+ L RV DLIGRL+LQEKV LL++ AAAVPRLG
Sbjct: 20 GGEARDPFACDPKNTATKNLPFCKAWLATGARVKDLIGRLTLQEKVNLLVNNAAAVPRLG 79
Query: 81 IKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEAR 140
IKGYEWWSEALHGVSNVGPGTKFGG FP ATSFPQVITTA+SFNA+LWEAIGRV SDEAR
Sbjct: 80 IKGYEWWSEALHGVSNVGPGTKFGGQFPAATSFPQVITTAASFNASLWEAIGRVASDEAR 139
Query: 141 AMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKV 200
AMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDP+L+GKYAASYVRGLQ +DG+RLKV
Sbjct: 140 AMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPILAGKYAASYVRGLQETDGNRLKV 199
Query: 201 AASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVN 260
AASCKHFTAYDLDNWNGVDRFHFNA+VSKQDIEDTF+VPFRMCV EGKVASVMCSYNQVN
Sbjct: 200 AASCKHFTAYDLDNWNGVDRFHFNAQVSKQDIEDTFNVPFRMCVKEGKVASVMCSYNQVN 259
Query: 261 GVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLD 320
GVPTCADP +LKRT+RG+W LNGYIVSDCDSVGV+Y++QH+TSTPEEAAADAI+AGLDLD
Sbjct: 260 GVPTCADPILLKRTVRGQWGLNGYIVSDCDSVGVFYNSQHYTSTPEEAAADAIKAGLDLD 319
Query: 321 CGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTP 380
CGPFLG HT++AV++GL+SE D+N AL+NTLTVQMRLGM+DGEPSS PYG LGP+DVCTP
Sbjct: 320 CGPFLGQHTQNAVKKGLISETDVNGALLNTLTVQMRLGMYDGEPSSHPYGKLGPRDVCTP 379
Query: 381 DHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTT 440
HQELALEAARQGIVLLKN+GPSLPLS RH TVAVIGPNS+VTVTMIGNYAGIACGYT+
Sbjct: 380 SHQELALEAARQGIVLLKNKGPSLPLSTRRHPTVAVIGPNSNVTVTMIGNYAGIACGYTS 439
Query: 441 PLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAG 500
PL+GIGRY +TIH+ GC +VAC +D+ FG AI+ ++QADAT+LVMGLDQSIEAE +DRAG
Sbjct: 440 PLEGIGRYTKTIHELGCANVACTNDKQFGRAINVAQQADATVLVMGLDQSIEAETVDRAG 499
Query: 501 LLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAI 560
LLLPGRQQ+LVSKV+ ASKGPTILV+MSGGP+D+ FAKN+PRI AI+WAGYPGQAGG AI
Sbjct: 500 LLLPGRQQDLVSKVAAASKGPTILVIMSGGPVDITFAKNNPRIQAILWAGYPGQAGGAAI 559
Query: 561 ADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGH 620
ADILFGTSNPGGKLPMTWYPQ YI NLPMT MAMR S+SK YPGRTYRFY GPVVYPFG+
Sbjct: 560 ADILFGTSNPGGKLPMTWYPQGYIKNLPMTNMAMRASRSKGYPGRTYRFYNGPVVYPFGY 619
Query: 621 GMSYTNFVHTVANAPTVVAVPLDG-RHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNV 679
G+SYT+FVHT+A+AP +V++P+DG RHG+ +++I+ KAIKVTHA+C +L++ +QVDVKNV
Sbjct: 620 GLSYTHFVHTLASAPKLVSIPVDGHRHGN-SSSIANKAIKVTHARCGKLSISLQVDVKNV 678
Query: 680 GSKDGAHTLLVFSTPPA--GHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRS 737
GSKDG HTLLVFS PPA GHWAPHKQLVAF+K+H+P+ AQQRV +NIHVCK LSVVDRS
Sbjct: 679 GSKDGTHTLLVFSAPPAGNGHWAPHKQLVAFQKLHIPSKAQQRVNVNIHVCKLLSVVDRS 738
Query: 738 GTRRIPLGEHNIHIGGTKHSVSLHAATLGVIKS 770
GTRR+P+G H++HIG KH VSL A TLG+IKS
Sbjct: 739 GTRRVPMGLHSLHIGDVKHYVSLQAETLGIIKS 771
>gi|356501877|ref|XP_003519750.1| PREDICTED: probable beta-D-xylosidase 2-like [Glycine max]
Length = 772
Score = 1283 bits (3321), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 603/753 (80%), Positives = 680/753 (90%), Gaps = 4/753 (0%)
Query: 21 GLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLG 80
G AR+PFACDPK+ T+ LPFC+ SL RV DLIGRL+LQEKV LL++ AAAVPRLG
Sbjct: 21 GGEARDPFACDPKNTATKNLPFCKASLATGARVKDLIGRLTLQEKVNLLVNNAAAVPRLG 80
Query: 81 IKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEAR 140
IKGYEWWSEALHGVSNVGPGTKFGG FP ATSFPQVITTA+SFNA+LWEAIGRV SDEAR
Sbjct: 81 IKGYEWWSEALHGVSNVGPGTKFGGQFPAATSFPQVITTAASFNASLWEAIGRVASDEAR 140
Query: 141 AMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKV 200
AMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDP+L+GKYAASYVRGLQG+DG+RLKV
Sbjct: 141 AMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPILAGKYAASYVRGLQGTDGNRLKV 200
Query: 201 AASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVN 260
AASCKHFTAYDLDNWNGVDRFHFNA+VSKQDIEDTF+VPFRMCV EGKVASVMCSYNQVN
Sbjct: 201 AASCKHFTAYDLDNWNGVDRFHFNAQVSKQDIEDTFNVPFRMCVKEGKVASVMCSYNQVN 260
Query: 261 GVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLD 320
GVPTCADP +LKRT+RG+W LNGYIVSDCDSVGV+Y++QH+TSTPEEAAADAI+AGLDLD
Sbjct: 261 GVPTCADPILLKRTVRGQWGLNGYIVSDCDSVGVFYNSQHYTSTPEEAAADAIKAGLDLD 320
Query: 321 CGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTP 380
CGPFLG HT++AV++GL+SE D+N AL+NTLTVQMRLGM+DGEPSS PY +LGP+DVCT
Sbjct: 321 CGPFLGQHTQNAVKKGLISEADVNGALLNTLTVQMRLGMYDGEPSSHPYNNLGPRDVCTQ 380
Query: 381 DHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTT 440
HQELALEAARQGIVLLKN+GPSLPLS R RTVAVIGPNS+VT TMIGNYAGIACGYT+
Sbjct: 381 SHQELALEAARQGIVLLKNKGPSLPLSTRRGRTVAVIGPNSNVTFTMIGNYAGIACGYTS 440
Query: 441 PLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAG 500
PLQGIG Y +TI++ GC +VAC DD+ FG AI+A++QADAT+LVMGLDQSIEAE +DRA
Sbjct: 441 PLQGIGTYTKTIYEHGCANVACTDDKQFGRAINAAQQADATVLVMGLDQSIEAETVDRAS 500
Query: 501 LLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAI 560
LLLPG QQ+LVSKV+ ASKGPTILV+MSGGP+D+ FAKNDPRI I+WAGYPGQAGG AI
Sbjct: 501 LLLPGHQQDLVSKVAAASKGPTILVIMSGGPVDITFAKNDPRIQGILWAGYPGQAGGAAI 560
Query: 561 ADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGH 620
ADILFGTSNPGGKLPMTWYPQ YI NLPMT MAMR S+SK YPGRTYRFY GPVVYPFG+
Sbjct: 561 ADILFGTSNPGGKLPMTWYPQGYIKNLPMTNMAMRASRSKGYPGRTYRFYNGPVVYPFGY 620
Query: 621 GMSYTNFVHTVANAPTVVAVPLDG-RHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNV 679
G+SYT+FVHT+ +AP +V++P+DG RHG+ ++ I+ KAIKVTHA+C +L++ + VDVKNV
Sbjct: 621 GLSYTHFVHTLTSAPKLVSIPVDGHRHGN-SSNIANKAIKVTHARCGKLSINLHVDVKNV 679
Query: 680 GSKDGAHTLLVFSTPPA--GHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRS 737
GSKDG HTLLVFS PPA GHWAPHKQLVAFEKVH+PA AQQRV + IHVCK LSVVDRS
Sbjct: 680 GSKDGIHTLLVFSAPPAGNGHWAPHKQLVAFEKVHIPAKAQQRVRVKIHVCKLLSVVDRS 739
Query: 738 GTRRIPLGEHNIHIGGTKHSVSLHAATLGVIKS 770
GTRRIP+G H++HIG KHSVSL A TLG+IKS
Sbjct: 740 GTRRIPMGLHSLHIGDVKHSVSLQAETLGIIKS 772
>gi|356503923|ref|XP_003520749.1| PREDICTED: probable beta-D-xylosidase 2-like [Glycine max]
Length = 775
Score = 1279 bits (3309), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 598/772 (77%), Positives = 684/772 (88%), Gaps = 6/772 (0%)
Query: 1 MASTIAFFFLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRL 60
+ + IA F L L L+ + AR+PFACDPK+ T +PFC+ SL IP+RV DL+GRL
Sbjct: 8 LLNLIAVFLL-LFLVRHTCE---ARDPFACDPKNGATENMPFCKASLAIPERVKDLVGRL 63
Query: 61 SLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTA 120
+LQEKV+LL++ AAAVPRLG+KGYEWWSEALHGVSNVGPG KF FPGATSFPQVITTA
Sbjct: 64 TLQEKVRLLVNNAAAVPRLGMKGYEWWSEALHGVSNVGPGVKFNAQFPGATSFPQVITTA 123
Query: 121 SSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSG 180
+SFNA+LWEAIG+VVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVL+G
Sbjct: 124 ASFNASLWEAIGQVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLAG 183
Query: 181 KYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPF 240
YAASYVRGLQG+DG+RLKVAA CKHFTAYDLDNWNG+DRFHFNA+VSKQDIE+TFDVPF
Sbjct: 184 TYAASYVRGLQGTDGNRLKVAACCKHFTAYDLDNWNGMDRFHFNAQVSKQDIEETFDVPF 243
Query: 241 RMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQH 300
RMCV EGKVASVMCSYNQVNGVPTCADPN+LK+T+RG W+L+GYIVSDCDSVGV+YD QH
Sbjct: 244 RMCVSEGKVASVMCSYNQVNGVPTCADPNLLKKTVRGLWQLDGYIVSDCDSVGVFYDNQH 303
Query: 301 FTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMF 360
+T TPEEAAADAI+AGLDLDCGPFL +HT++AV++GLLSE D+N ALVNTLTVQMRLGMF
Sbjct: 304 YTPTPEEAAADAIKAGLDLDCGPFLAVHTQNAVEKGLLSEADVNGALVNTLTVQMRLGMF 363
Query: 361 DGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPN 420
DGEPS+ YG LGPKDVC P HQELALEAARQGIVLLKN GP LPLS RH TVAVIGPN
Sbjct: 364 DGEPSAHAYGKLGPKDVCKPAHQELALEAARQGIVLLKNTGPVLPLSPQRHHTVAVIGPN 423
Query: 421 SDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADA 480
S TVTMIGNYAG+ACGYT PLQGIGRYA+TIHQ GC++VAC +D+LFG+AI+A+RQADA
Sbjct: 424 SKATVTMIGNYAGVACGYTNPLQGIGRYAKTIHQLGCENVACKNDKLFGSAINAARQADA 483
Query: 481 TILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKND 540
T+LVMGLDQSIEAE +DR GLLLPGRQQ+LVSKV+ ASKGPTILV+MSGG +D+ FAKN+
Sbjct: 484 TVLVMGLDQSIEAETVDRTGLLLPGRQQDLVSKVAAASKGPTILVIMSGGSVDITFAKNN 543
Query: 541 PRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSK 600
PRI I+WAGYPGQAGG AIADILFGT+NPGGKLP+TWYPQEY+T LPMT MAMR S+S
Sbjct: 544 PRIVGILWAGYPGQAGGAAIADILFGTTNPGGKLPVTWYPQEYLTKLPMTNMAMRGSKSA 603
Query: 601 RYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKV 660
YPGRTYRFY GPVVYPFGHG++YT+FVHT+A+APTVV+VPL+G + IS +AI+V
Sbjct: 604 GYPGRTYRFYNGPVVYPFGHGLTYTHFVHTLASAPTVVSVPLNGHRRANVTNISNRAIRV 663
Query: 661 THAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPA--GHWAPHKQLVAFEKVHVPAGAQ 718
THA+C++L++ ++VD+KNVGS+DG HTLLVFS PPA GHWA KQLVAFEK+HVPA
Sbjct: 664 THARCDKLSISLEVDIKNVGSRDGTHTLLVFSAPPAGFGHWALEKQLVAFEKIHVPAKGL 723
Query: 719 QRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHAATLGVIKS 770
QRVG+NIHVCK LSVVD+SG RRIPLGEH+ +IG KHSVSL AA LG+IKS
Sbjct: 724 QRVGVNIHVCKLLSVVDKSGIRRIPLGEHSFNIGDVKHSVSLQAAALGIIKS 775
>gi|357442285|ref|XP_003591420.1| Beta xylosidase [Medicago truncatula]
gi|355480468|gb|AES61671.1| Beta xylosidase [Medicago truncatula]
Length = 765
Score = 1273 bits (3294), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 600/769 (78%), Positives = 677/769 (88%), Gaps = 12/769 (1%)
Query: 5 IAFFFLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQE 64
I FL L+L+S+ AR+PFACDPK+ +T PFC+ SLPIP RVNDLIGRL+LQE
Sbjct: 6 ITIVFL-LLLMSSE-----ARDPFACDPKNTSTNNFPFCKASLPIPTRVNDLIGRLTLQE 59
Query: 65 KVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFN 124
KV +L++ AAAVPR+GIKGYEWWSEALHGVSNVGPGTKF G FP ATSFPQVITT +SFN
Sbjct: 60 KVSMLVNNAAAVPRVGIKGYEWWSEALHGVSNVGPGTKFAGQFPAATSFPQVITTVASFN 119
Query: 125 ATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAA 184
A+LWEAIGRV SDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDP+L+GKYAA
Sbjct: 120 ASLWEAIGRVASDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPILAGKYAA 179
Query: 185 SYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCV 244
SYVRGLQG+D RLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQD+EDTF+VPFRMCV
Sbjct: 180 SYVRGLQGTDSSRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDMEDTFNVPFRMCV 239
Query: 245 MEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTST 304
EG VASVMCSYNQVNGVPTCADPN+LKRTIRG+W L+GYIVSDCDSVGV+Y QH+TST
Sbjct: 240 KEGNVASVMCSYNQVNGVPTCADPNLLKRTIRGQWHLDGYIVSDCDSVGVFYTNQHYTST 299
Query: 305 PEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEP 364
PEEAAADAI+AGLDLDCGPFL HT++AV++GLL+E D+N AL NTLTVQMRLGMFDGEP
Sbjct: 300 PEEAAADAIKAGLDLDCGPFLAQHTQNAVKKGLLTETDVNGALANTLTVQMRLGMFDGEP 359
Query: 365 SSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVT 424
S+QPYG+LGP DVCTP HQELAL+AARQGIVLLKN GPSLPLS H+TVAVIGPNS+ T
Sbjct: 360 SAQPYGNLGPTDVCTPTHQELALDAARQGIVLLKNTGPSLPLSTKNHQTVAVIGPNSNAT 419
Query: 425 VTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILV 484
VTMIGNYAGIACGYT+PLQGIG+YARTIH+ GC +VAC DD+ FG+A++A+RQADAT+LV
Sbjct: 420 VTMIGNYAGIACGYTSPLQGIGKYARTIHEPGCANVACNDDKQFGSALNAARQADATVLV 479
Query: 485 MGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIA 544
MGLDQSIEAE +DR GLLLPG QQ+LVSKV+ AS+GPTILVLMSGGPID+ FAKNDPRI
Sbjct: 480 MGLDQSIEAEMVDRTGLLLPGHQQDLVSKVAAASRGPTILVLMSGGPIDITFAKNDPRIM 539
Query: 545 AIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPG 604
I+WAGYPGQAGG AIADILFGT+NPG KLPMTWYPQ Y+ NL MT MAMRPS S YPG
Sbjct: 540 GILWAGYPGQAGGAAIADILFGTTNPGAKLPMTWYPQGYLKNLAMTNMAMRPSSSTGYPG 599
Query: 605 RTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDG-RHGSINATISGKAIKVTHA 663
RTYRFY GPVVYPFG+G+SYTNFVHT+A+AP VV+VP+DG R G+ + + AI+VTHA
Sbjct: 600 RTYRFYNGPVVYPFGYGLSYTNFVHTLASAPKVVSVPVDGHRRGN---SSNKAAIRVTHA 656
Query: 664 KCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPA--GHWAPHKQLVAFEKVHVPAGAQQRV 721
+C +L++ + +DVKNVGSKDG +TLLVFS PP GHWAP KQLVAFEKV+VPA AQQRV
Sbjct: 657 RCGKLSIRLDIDVKNVGSKDGTNTLLVFSVPPTGNGHWAPQKQLVAFEKVYVPAKAQQRV 716
Query: 722 GINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHAATLGVIKS 770
INIHVCK LSVVD+SGTRRIP+G H+IHIG KH VSL LG+IK+
Sbjct: 717 RINIHVCKLLSVVDKSGTRRIPMGAHSIHIGDVKHFVSLQEQKLGIIKT 765
>gi|357511337|ref|XP_003625957.1| Beta-xylosidase [Medicago truncatula]
gi|355500972|gb|AES82175.1| Beta-xylosidase [Medicago truncatula]
Length = 771
Score = 1272 bits (3291), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 599/773 (77%), Positives = 681/773 (88%), Gaps = 5/773 (0%)
Query: 1 MASTIAFFFLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRL 60
M+ST + L + + S AR+ FACD KDA T+ LPFC V L IP+RV DLIGRL
Sbjct: 1 MSSTFSLSPLITLFILLLQSSCDARDSFACDAKDAATKNLPFCNVKLAIPERVKDLIGRL 60
Query: 61 SLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTA 120
++QEKV LL++ A AVPR+G+K YEWWSEALHGVSNVGPGT+FGG FP ATSFPQVITTA
Sbjct: 61 TMQEKVNLLVNNAPAVPRVGMKSYEWWSEALHGVSNVGPGTRFGGVFPAATSFPQVITTA 120
Query: 121 SSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSG 180
+SFNA+LWEAIGRVVSDEARAMYNGG AGLTYWSPNVNIFRDPRWGRGQETPGEDPVL+G
Sbjct: 121 ASFNASLWEAIGRVVSDEARAMYNGGAAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLAG 180
Query: 181 KYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPF 240
+YAASYV+GLQG+DG++LKVAA CKHFTAYD+DNWNGVDRFHFNA VSKQDIEDTFDVPF
Sbjct: 181 RYAASYVKGLQGTDGNKLKVAACCKHFTAYDVDNWNGVDRFHFNALVSKQDIEDTFDVPF 240
Query: 241 RMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQH 300
RMCV EGKVASVMCSYNQVNGVPTCADPN+LK+T+RG W L+GYIVSDCDSVGV Y++QH
Sbjct: 241 RMCVKEGKVASVMCSYNQVNGVPTCADPNLLKKTVRGVWGLDGYIVSDCDSVGVLYNSQH 300
Query: 301 FTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMF 360
+TSTPEEAAADAI+AGLDLDCGPFLG+HT+ AV++GLL+E D+NNALVNTL VQMRLGMF
Sbjct: 301 YTSTPEEAAADAIKAGLDLDCGPFLGVHTQDAVKKGLLTEADVNNALVNTLKVQMRLGMF 360
Query: 361 DGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPN 420
DGEPS+Q YG LGPKDVC P HQELALEAARQGIVLLKN GP+LPLS RHRTVAVIGPN
Sbjct: 361 DGEPSAQAYGRLGPKDVCKPAHQELALEAARQGIVLLKNTGPTLPLSPQRHRTVAVIGPN 420
Query: 421 SDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADA 480
SDVTVTMIGNYAGIACGYT+PLQGIGRYA+TIHQQGC +VAC DD+ FG A+DA+R ADA
Sbjct: 421 SDVTVTMIGNYAGIACGYTSPLQGIGRYAKTIHQQGCSNVACRDDKQFGPALDAARHADA 480
Query: 481 TILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKND 540
TILV+GLDQSIEAE +DR LLLPG QQ+LVSKV+ ASKGPTILVLMSGGP+D+ FAKND
Sbjct: 481 TILVIGLDQSIEAETVDRTSLLLPGHQQDLVSKVAAASKGPTILVLMSGGPVDITFAKND 540
Query: 541 PRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSK 600
P++A I+WAGYPGQAGG AIADILFGT++PGGKLP+TWYPQEY+ NL MT MAMRPS+
Sbjct: 541 PKVAGILWAGYPGQAGGAAIADILFGTASPGGKLPVTWYPQEYLKNLAMTNMAMRPSKIG 600
Query: 601 RYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDG-RHGSINATISGKAIK 659
YPGRTYRFYKGPVVYPFGHG++YT+FVH +++APTVV+VP+ G RHG+ N IS KAI+
Sbjct: 601 -YPGRTYRFYKGPVVYPFGHGLTYTHFVHELSSAPTVVSVPVHGHRHGN-NTNISNKAIR 658
Query: 660 VTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAG--HWAPHKQLVAFEKVHVPAGA 717
VTHA+C +L++ + VDVKNVGS+DG HTLLVFS PP G HW P K LVAFEKVHVPA
Sbjct: 659 VTHARCGKLSIALHVDVKNVGSRDGTHTLLVFSAPPNGGNHWVPQKSLVAFEKVHVPAKT 718
Query: 718 QQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHAATLGVIKS 770
+QRV +NIHVCK LSVVD+SG RRIP+GEH++HIG KHSVSL A LG+IKS
Sbjct: 719 KQRVRVNIHVCKLLSVVDKSGIRRIPMGEHSLHIGDVKHSVSLQAEALGIIKS 771
>gi|356572781|ref|XP_003554544.1| PREDICTED: probable beta-D-xylosidase 2-like [Glycine max]
Length = 771
Score = 1270 bits (3286), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 588/749 (78%), Positives = 671/749 (89%), Gaps = 2/749 (0%)
Query: 24 AREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKG 83
AREPFACDPK+ T+ + FC+VSL I +RV DLIGRL+L+EKV+LL++ AAAVPRLG+KG
Sbjct: 23 AREPFACDPKNGGTKKMAFCKVSLAIAERVKDLIGRLTLEEKVRLLVNNAAAVPRLGMKG 82
Query: 84 YEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMY 143
YEWWSEALHGVSN+GP KF FP ATSFPQVITTA+SFNA+LWEAIG+VVSDEARAMY
Sbjct: 83 YEWWSEALHGVSNLGPAVKFNAQFPAATSFPQVITTAASFNASLWEAIGQVVSDEARAMY 142
Query: 144 NGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAAS 203
NGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVL+G YAA+YVRGLQG+ +RLKVAA
Sbjct: 143 NGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLAGTYAATYVRGLQGTHANRLKVAAC 202
Query: 204 CKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVP 263
CKHFTAYDLDNWNG+DRFHFNA+VSKQDIEDTFDVPF+MCV EGKVASVMCSYNQVNGVP
Sbjct: 203 CKHFTAYDLDNWNGMDRFHFNAQVSKQDIEDTFDVPFKMCVSEGKVASVMCSYNQVNGVP 262
Query: 264 TCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGP 323
TCADPN+LK+T+RG W+L+GYIVSDCDSVGV+YD QH+T TPEEAAADAI+AGLDLDCGP
Sbjct: 263 TCADPNLLKKTVRGLWQLDGYIVSDCDSVGVFYDNQHYTPTPEEAAADAIKAGLDLDCGP 322
Query: 324 FLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQ 383
FL +HT++AV++GLLSE D+N ALVNTLTVQMRLGMFDGEP++ PYGHLGPKDVC P HQ
Sbjct: 323 FLAVHTQNAVKKGLLSEADVNGALVNTLTVQMRLGMFDGEPTAHPYGHLGPKDVCKPAHQ 382
Query: 384 ELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQ 443
ELALEAARQGIVLLKN GP LPLS HRTVAVIGPNS T+TMIGNYAG+ACGYT PLQ
Sbjct: 383 ELALEAARQGIVLLKNTGPVLPLSSQLHRTVAVIGPNSKATITMIGNYAGVACGYTNPLQ 442
Query: 444 GIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLL 503
GIGRYART+HQ GC++VAC +D+LFG AI+A+RQADAT+LVMGLDQSIEAE +DR GLLL
Sbjct: 443 GIGRYARTVHQLGCQNVACKNDKLFGPAINAARQADATVLVMGLDQSIEAETVDRTGLLL 502
Query: 504 PGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADI 563
PGRQ +LVSKV+ ASKGPTILVLMSGGP+D+ FAKN+PRI I+WAGYPGQAGG AIADI
Sbjct: 503 PGRQPDLVSKVAAASKGPTILVLMSGGPVDITFAKNNPRIVGILWAGYPGQAGGAAIADI 562
Query: 564 LFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMS 623
LFGT+NPGGKLP+TWYP+EY+T LPMT MAMR ++S YPGRTYRFY GPVVYPFGHG++
Sbjct: 563 LFGTANPGGKLPVTWYPEEYLTKLPMTNMAMRATKSAGYPGRTYRFYNGPVVYPFGHGLT 622
Query: 624 YTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKD 683
YT+FVHT+A+APTVV+VPL+G + IS +AI+VTHA+C++L++ +QVD+KNVGS+D
Sbjct: 623 YTHFVHTLASAPTVVSVPLNGHRRANVTNISNRAIRVTHARCDKLSITLQVDIKNVGSRD 682
Query: 684 GAHTLLVFSTPPA--GHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRR 741
G HTLLVFS PPA GHWA KQLVAFEKVHVPA Q RVG+NIHVCK LSVVDRSG RR
Sbjct: 683 GTHTLLVFSAPPAGFGHWALEKQLVAFEKVHVPAKGQHRVGVNIHVCKLLSVVDRSGIRR 742
Query: 742 IPLGEHNIHIGGTKHSVSLHAATLGVIKS 770
IPLGEH+ +IG KHSVSL AA LG+IKS
Sbjct: 743 IPLGEHSFNIGDVKHSVSLQAAALGIIKS 771
>gi|18378991|ref|NP_563659.1| beta-glucosidase [Arabidopsis thaliana]
gi|75250279|sp|Q94KD8.1|BXL2_ARATH RecName: Full=Probable beta-D-xylosidase 2; Short=AtBXL2; Flags:
Precursor
gi|14194121|gb|AAK56255.1|AF367266_1 At1g02640/T14P4_11 [Arabidopsis thaliana]
gi|23506063|gb|AAN28891.1| At1g02640/T14P4_11 [Arabidopsis thaliana]
gi|332189332|gb|AEE27453.1| beta-glucosidase [Arabidopsis thaliana]
Length = 768
Score = 1243 bits (3217), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 584/765 (76%), Positives = 668/765 (87%), Gaps = 7/765 (0%)
Query: 9 FLGLIL---LSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEK 65
FL +IL +S+SS + +RE FACD KDA T TL FCQ+S+PIP+RV DLIGRL+L EK
Sbjct: 8 FLAVILFFLISSSSVCVHSRETFACDTKDAATATLRFCQLSVPIPERVRDLIGRLTLAEK 67
Query: 66 VKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNA 125
V LL + AAA+PRLGIKGYEWWSEALHGVSNVGPGTKFGG +P ATSFPQVITT +SFNA
Sbjct: 68 VSLLGNTAAAIPRLGIKGYEWWSEALHGVSNVGPGTKFGGVYPAATSFPQVITTVASFNA 127
Query: 126 TLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAAS 185
+LWE+IGRVVS+EARAMYNGG GLTYWSPNVNI RDPRWGRGQETPGEDPV++GKYAAS
Sbjct: 128 SLWESIGRVVSNEARAMYNGGVGGLTYWSPNVNILRDPRWGRGQETPGEDPVVAGKYAAS 187
Query: 186 YVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVM 245
YVRGLQG+D RLKVAA CKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCV
Sbjct: 188 YVRGLQGNDRSRLKVAACCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVK 247
Query: 246 EGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTP 305
EG VAS+MCSYNQVNGVPTCADPN+LK+TIR +W LNGYIVSDCDSVGV YDTQH+T TP
Sbjct: 248 EGNVASIMCSYNQVNGVPTCADPNLLKKTIRNQWGLNGYIVSDCDSVGVLYDTQHYTGTP 307
Query: 306 EEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPS 365
EEAAAD+I+AGLDLDCGPFLG HT AV++ LL E D++NAL+NTLTVQMRLGMFDG+ +
Sbjct: 308 EEAAADSIKAGLDLDCGPFLGAHTIDAVKKNLLRESDVDNALINTLTVQMRLGMFDGDIA 367
Query: 366 SQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTV 425
+QPYGHLGP VCTP H+ LALEAA+QGIVLLKN G SLPLS RHRTVAVIGPNSD TV
Sbjct: 368 AQPYGHLGPAHVCTPVHKGLALEAAQQGIVLLKNHGSSLPLSSQRHRTVAVIGPNSDATV 427
Query: 426 TMIGNYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVM 485
TMIGNYAG+ACGYT+P+QGI YARTIHQ+GC DV C DD+LF AA++A+R ADAT+LVM
Sbjct: 428 TMIGNYAGVACGYTSPVQGITGYARTIHQKGCVDVHCMDDRLFDAAVEAARGADATVLVM 487
Query: 486 GLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAA 545
GLDQSIEAE DR LLLPG+QQELVS+V+ A+KGP ILVLMSGGPID++FA+ D +I A
Sbjct: 488 GLDQSIEAEFKDRNSLLLPGKQQELVSRVAKAAKGPVILVLMSGGPIDISFAEKDRKIPA 547
Query: 546 IIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGR 605
I+WAGYPGQ GGTAIADILFG++NPGGKLPMTWYPQ+Y+TNLPMTEM+MRP SKR PGR
Sbjct: 548 IVWAGYPGQEGGTAIADILFGSANPGGKLPMTWYPQDYLTNLPMTEMSMRPVHSKRIPGR 607
Query: 606 TYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKC 665
TYRFY GPVVYPFGHG+SYT F H +A+AP V+ + + GR+G T+SGK+I+VTHA+C
Sbjct: 608 TYRFYDGPVVYPFGHGLSYTRFTHNIADAPKVIPIAVRGRNG----TVSGKSIRVTHARC 663
Query: 666 NRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINI 725
+RL+LGV V+V NVGS+DG HT+LVFS PP G WAP KQLVAFE+VHV G ++RV +NI
Sbjct: 664 DRLSLGVHVEVTNVGSRDGTHTMLVFSAPPGGEWAPKKQLVAFERVHVAVGEKKRVQVNI 723
Query: 726 HVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHAATLGVIKS 770
HVCKYLSVVDR+G RRIP+G+H IHIG H+VSL A+TLGVIKS
Sbjct: 724 HVCKYLSVVDRAGNRRIPIGDHGIHIGDESHTVSLQASTLGVIKS 768
>gi|9972374|gb|AAG10624.1|AC022521_2 Similar to xylosidase [Arabidopsis thaliana]
Length = 763
Score = 1243 bits (3215), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 584/765 (76%), Positives = 668/765 (87%), Gaps = 7/765 (0%)
Query: 9 FLGLIL---LSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEK 65
FL +IL +S+SS + +RE FACD KDA T TL FCQ+S+PIP+RV DLIGRL+L EK
Sbjct: 3 FLAVILFFLISSSSVCVHSRETFACDTKDAATATLRFCQLSVPIPERVRDLIGRLTLAEK 62
Query: 66 VKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNA 125
V LL + AAA+PRLGIKGYEWWSEALHGVSNVGPGTKFGG +P ATSFPQVITT +SFNA
Sbjct: 63 VSLLGNTAAAIPRLGIKGYEWWSEALHGVSNVGPGTKFGGVYPAATSFPQVITTVASFNA 122
Query: 126 TLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAAS 185
+LWE+IGRVVS+EARAMYNGG GLTYWSPNVNI RDPRWGRGQETPGEDPV++GKYAAS
Sbjct: 123 SLWESIGRVVSNEARAMYNGGVGGLTYWSPNVNILRDPRWGRGQETPGEDPVVAGKYAAS 182
Query: 186 YVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVM 245
YVRGLQG+D RLKVAA CKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCV
Sbjct: 183 YVRGLQGNDRSRLKVAACCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVK 242
Query: 246 EGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTP 305
EG VAS+MCSYNQVNGVPTCADPN+LK+TIR +W LNGYIVSDCDSVGV YDTQH+T TP
Sbjct: 243 EGNVASIMCSYNQVNGVPTCADPNLLKKTIRNQWGLNGYIVSDCDSVGVLYDTQHYTGTP 302
Query: 306 EEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPS 365
EEAAAD+I+AGLDLDCGPFLG HT AV++ LL E D++NAL+NTLTVQMRLGMFDG+ +
Sbjct: 303 EEAAADSIKAGLDLDCGPFLGAHTIDAVKKNLLRESDVDNALINTLTVQMRLGMFDGDIA 362
Query: 366 SQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTV 425
+QPYGHLGP VCTP H+ LALEAA+QGIVLLKN G SLPLS RHRTVAVIGPNSD TV
Sbjct: 363 AQPYGHLGPAHVCTPVHKGLALEAAQQGIVLLKNHGSSLPLSSQRHRTVAVIGPNSDATV 422
Query: 426 TMIGNYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVM 485
TMIGNYAG+ACGYT+P+QGI YARTIHQ+GC DV C DD+LF AA++A+R ADAT+LVM
Sbjct: 423 TMIGNYAGVACGYTSPVQGITGYARTIHQKGCVDVHCMDDRLFDAAVEAARGADATVLVM 482
Query: 486 GLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAA 545
GLDQSIEAE DR LLLPG+QQELVS+V+ A+KGP ILVLMSGGPID++FA+ D +I A
Sbjct: 483 GLDQSIEAEFKDRNSLLLPGKQQELVSRVAKAAKGPVILVLMSGGPIDISFAEKDRKIPA 542
Query: 546 IIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGR 605
I+WAGYPGQ GGTAIADILFG++NPGGKLPMTWYPQ+Y+TNLPMTEM+MRP SKR PGR
Sbjct: 543 IVWAGYPGQEGGTAIADILFGSANPGGKLPMTWYPQDYLTNLPMTEMSMRPVHSKRIPGR 602
Query: 606 TYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKC 665
TYRFY GPVVYPFGHG+SYT F H +A+AP V+ + + GR+G T+SGK+I+VTHA+C
Sbjct: 603 TYRFYDGPVVYPFGHGLSYTRFTHNIADAPKVIPIAVRGRNG----TVSGKSIRVTHARC 658
Query: 666 NRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINI 725
+RL+LGV V+V NVGS+DG HT+LVFS PP G WAP KQLVAFE+VHV G ++RV +NI
Sbjct: 659 DRLSLGVHVEVTNVGSRDGTHTMLVFSAPPGGEWAPKKQLVAFERVHVAVGEKKRVQVNI 718
Query: 726 HVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHAATLGVIKS 770
HVCKYLSVVDR+G RRIP+G+H IHIG H+VSL A+TLGVIKS
Sbjct: 719 HVCKYLSVVDRAGNRRIPIGDHGIHIGDESHTVSLQASTLGVIKS 763
>gi|297843058|ref|XP_002889410.1| hypothetical protein ARALYDRAFT_470222 [Arabidopsis lyrata subsp.
lyrata]
gi|297335252|gb|EFH65669.1| hypothetical protein ARALYDRAFT_470222 [Arabidopsis lyrata subsp.
lyrata]
Length = 763
Score = 1242 bits (3214), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 584/770 (75%), Positives = 667/770 (86%), Gaps = 7/770 (0%)
Query: 1 MASTIAFFFLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRL 60
MA A F L+S+SS + +RE FACD KDA T TL FCQ+S+PI +RV DLIGRL
Sbjct: 1 MAFLAAILFF---LISSSSVCVQSRETFACDIKDAATATLRFCQLSVPITERVKDLIGRL 57
Query: 61 SLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTA 120
+L EKV LL + AAA+PRLGIKGYEWWSEALHGVSNVGPGTKFGG +P ATSFPQVITT
Sbjct: 58 TLVEKVSLLGNTAAAIPRLGIKGYEWWSEALHGVSNVGPGTKFGGVYPAATSFPQVITTV 117
Query: 121 SSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSG 180
+SFNA+LWE+IGRVVS+EARAMYNGG GLTYWSPNVNI RDPRWGRGQETPGEDPV++G
Sbjct: 118 ASFNASLWESIGRVVSNEARAMYNGGVGGLTYWSPNVNILRDPRWGRGQETPGEDPVVAG 177
Query: 181 KYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPF 240
KYAASYVRGLQG+D RLKVAA CKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPF
Sbjct: 178 KYAASYVRGLQGNDRSRLKVAACCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPF 237
Query: 241 RMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQH 300
RMCV EG VAS+MCSYN+VNGVPTCADPN+LK+TIR EW LNGYIVSDCDSVGV YDTQH
Sbjct: 238 RMCVKEGNVASIMCSYNEVNGVPTCADPNLLKKTIRNEWGLNGYIVSDCDSVGVLYDTQH 297
Query: 301 FTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMF 360
+T TPEEAAAD+I+AGLDLDCGPFLG HT AV++ LL E D++NAL+NTLTVQMRLGMF
Sbjct: 298 YTGTPEEAAADSIKAGLDLDCGPFLGAHTIDAVKKNLLRESDVDNALINTLTVQMRLGMF 357
Query: 361 DGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPN 420
DG+ ++QPYGHLGP VCTP H+ LALEAA+QGIVLLKN G SLPLS RHRTVAVIGPN
Sbjct: 358 DGDIAAQPYGHLGPAHVCTPVHKGLALEAAQQGIVLLKNHGSSLPLSSQRHRTVAVIGPN 417
Query: 421 SDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADA 480
SD TV MIGNYAGIACGYT+P+QGI YART+HQ+GC DV C DD+LF AA++A+R ADA
Sbjct: 418 SDATVAMIGNYAGIACGYTSPVQGITGYARTVHQKGCVDVHCMDDRLFDAAVEAARGADA 477
Query: 481 TILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKND 540
T+LVMGLDQSIEAE DR LLLPG+QQEL+S+V+ A+KGP ILVLMSGGPID++FA+ D
Sbjct: 478 TVLVMGLDQSIEAEFKDRNSLLLPGKQQELISRVAKAAKGPVILVLMSGGPIDISFAEKD 537
Query: 541 PRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSK 600
+I AI+WAGYPGQ GGTAIADILFG++NPGGKLPMTWYPQ+Y+TNLPMTEM+MRP SK
Sbjct: 538 RKIPAIVWAGYPGQEGGTAIADILFGSANPGGKLPMTWYPQDYLTNLPMTEMSMRPIHSK 597
Query: 601 RYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKV 660
R PGRTYRFY GPVVYPFGHG+SYT F H++A+AP V+ + + GR+G T+SGK+I+V
Sbjct: 598 RIPGRTYRFYDGPVVYPFGHGLSYTRFTHSIADAPKVIPIAVRGRNG----TVSGKSIRV 653
Query: 661 THAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQR 720
THA+CNRL+LGV VDV NVGS+DG HT+LVFS PP G WAP KQLVAFE+VHV G ++R
Sbjct: 654 THARCNRLSLGVHVDVTNVGSRDGTHTMLVFSAPPGGEWAPKKQLVAFERVHVAVGEKKR 713
Query: 721 VGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHAATLGVIKS 770
V +NIHVCKYLSVVDR+G RRIP+G+H IHIG H+VSL A+TLGVIKS
Sbjct: 714 VQVNIHVCKYLSVVDRAGNRRIPIGDHGIHIGDESHTVSLQASTLGVIKS 763
>gi|371917282|dbj|BAL44717.1| SlArf/Xyl2 [Solanum lycopersicum]
Length = 774
Score = 1225 bits (3170), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 579/769 (75%), Positives = 661/769 (85%), Gaps = 5/769 (0%)
Query: 3 STIAFFFLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSL 62
ST + F + +S AAR PFACD K+ R PFCQ +LPI RV DLIGRL+L
Sbjct: 10 STSSLFIFIFLFVSIQ----AARPPFACDQKNRAFRNFPFCQTNLPIGDRVRDLIGRLTL 65
Query: 63 QEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASS 122
QEKVKLL + AAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGG+FPGATSFPQVITTA+S
Sbjct: 66 QEKVKLLGNNAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGEFPGATSFPQVITTAAS 125
Query: 123 FNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKY 182
FNA+LWE IGRVVSDEARAMYNG GLTYWSPNVNIFRDPRWGRGQETPGEDPV++ Y
Sbjct: 126 FNASLWEEIGRVVSDEARAMYNGEMGGLTYWSPNVNIFRDPRWGRGQETPGEDPVVAALY 185
Query: 183 AASYVRGLQGS-DGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFR 241
A YVRGLQG+ DGD LKVAA CKH+TAYDLDNW GVDRFHFNAKV+KQDIEDTFDVPFR
Sbjct: 186 AERYVRGLQGNEDGDSLKVAACCKHYTAYDLDNWGGVDRFHFNAKVTKQDIEDTFDVPFR 245
Query: 242 MCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHF 301
CV +GKVAS+MCSYNQVNG+PTCADP +L++TIRG W LNGYIVSDCDSVGV+YDTQH+
Sbjct: 246 SCVKQGKVASIMCSYNQVNGIPTCADPQLLRKTIRGGWGLNGYIVSDCDSVGVFYDTQHY 305
Query: 302 TSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFD 361
TSTPEEAAA AI+AGLDLDCGPFL HTE+AV G+L E I+ L NT+ VQMRLGMFD
Sbjct: 306 TSTPEEAAAAAIKAGLDLDCGPFLSQHTENAVHIGILKEAAIDTNLANTVAVQMRLGMFD 365
Query: 362 GEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNS 421
GEPS+Q YGHLGP+DVC+P HQELA+EAARQGIVLLKN GP+LPLS RHRTVAVIGPNS
Sbjct: 366 GEPSAQQYGHLGPRDVCSPAHQELAVEAARQGIVLLKNHGPALPLSPRRHRTVAVIGPNS 425
Query: 422 DVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADAT 481
DVTVTMIGNYAG+ACGYT+PLQGI +YA+TIH++GC DVAC+DD+LF A++A+RQADAT
Sbjct: 426 DVTVTMIGNYAGVACGYTSPLQGISKYAKTIHEKGCGDVACSDDKLFAGAVNAARQADAT 485
Query: 482 ILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDP 541
+LVMGLDQSIEAE DR GLLLPG QQEL+S+VS AS+GP +LVLMSGGP+DV FA NDP
Sbjct: 486 VLVMGLDQSIEAEFRDRTGLLLPGFQQELISEVSKASRGPVVLVLMSGGPVDVTFANNDP 545
Query: 542 RIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKR 601
RI AI+WAGYPGQ GG AIAD+LFG NPGGKLPMTWYPQEY+ NLPMT M MR + +K
Sbjct: 546 RIGAIVWAGYPGQGGGAAIADVLFGAHNPGGKLPMTWYPQEYLNNLPMTTMDMRSNLAKG 605
Query: 602 YPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVT 661
YPGRTYRFYKGP+VYPFGHG+SYT F+ T+ AP +A+P+DGRH ++TIS K+I+VT
Sbjct: 606 YPGRTYRFYKGPLVYPFGHGLSYTKFITTIFEAPKTLAIPIDGRHTYNSSTISNKSIRVT 665
Query: 662 HAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRV 721
HAKC+++++ + VDVKNVG KDG+HTLLVFS PP W PHKQLVAF+KV+VPA ++QRV
Sbjct: 666 HAKCSKISVQIHVDVKNVGPKDGSHTLLVFSKPPVDIWVPHKQLVAFQKVYVPARSKQRV 725
Query: 722 GINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHAATLGVIKS 770
INIHVCKYLSVVDR+G RRIP+GEH+IHIG KHS+SL A+ LGVIKS
Sbjct: 726 AINIHVCKYLSVVDRAGVRRIPIGEHSIHIGDAKHSLSLQASVLGVIKS 774
>gi|449436749|ref|XP_004136155.1| PREDICTED: probable beta-D-xylosidase 2-like [Cucumis sativus]
Length = 772
Score = 1200 bits (3105), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 568/763 (74%), Positives = 657/763 (86%), Gaps = 6/763 (0%)
Query: 12 LILLSA-SSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLI 70
LI+LSA G AREPFACDPKDA PFC+V+LPIP+RV DLIGRL+LQEKV+LL+
Sbjct: 12 LIILSAIFRHGGGAREPFACDPKDAALSRYPFCRVALPIPERVKDLIGRLTLQEKVRLLV 71
Query: 71 SGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEA 130
+ AAAVPRLGIKGYEWWSEALHGVSNVGPGT+FGGDFPGATSFPQVITT +SFN +LWEA
Sbjct: 72 NNAAAVPRLGIKGYEWWSEALHGVSNVGPGTEFGGDFPGATSFPQVITTVASFNVSLWEA 131
Query: 131 IGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGL 190
IGRVVSDEARAMYNGG AGLTYWSPNVNIFRDPRWGRGQETPGEDPV++G+YAA Y++GL
Sbjct: 132 IGRVVSDEARAMYNGGAAGLTYWSPNVNIFRDPRWGRGQETPGEDPVVAGEYAARYIKGL 191
Query: 191 QGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVA 250
QG+DGDRLKVAA CKHFTAYDLDNWNG DRFHFNAKV++QD+ DTF+VPFR CV EGKVA
Sbjct: 192 QGNDGDRLKVAACCKHFTAYDLDNWNGTDRFHFNAKVTRQDMVDTFEVPFRKCVKEGKVA 251
Query: 251 SVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAA 310
SVMCSYNQVNGVPTCADPN+LK TIR +W LNGYIVSDCDSVGV+YD QH+TST EEAAA
Sbjct: 252 SVMCSYNQVNGVPTCADPNLLKGTIRNQWGLNGYIVSDCDSVGVFYDNQHYTSTAEEAAA 311
Query: 311 DAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYG 370
DAI+AGLDLDCGPFL +HTE AV++GLL++ INNAL NT+TVQMRLGMFDG PSS YG
Sbjct: 312 DAIKAGLDLDCGPFLAVHTEDAVKKGLLTQTHINNALANTITVQMRLGMFDGAPSSHAYG 371
Query: 371 HLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGN 430
LGPK+VC+P HQ+LAL+AARQGIVLLKN+ P LPLS HRTVAVIGPNSDV VTMIGN
Sbjct: 372 KLGPKNVCSPSHQQLALDAARQGIVLLKNRLPGLPLSADHHRTVAVIGPNSDVNVTMIGN 431
Query: 431 YAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQS 490
YAG+ACGY TPL+GI RY +H++GC +VACA D F A+ A+ ADAT+LVMGLDQS
Sbjct: 432 YAGVACGYVTPLEGIKRYTTVVHRKGCDNVACATDYSFTDALAAASTADATVLVMGLDQS 491
Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
+EAE DR GLLLPGRQQELV KV+ AS+GPT+++LMSGGPIDV+FA NDPRI+AI+W G
Sbjct: 492 VEAETKDRDGLLLPGRQQELVLKVAAASRGPTVVILMSGGPIDVSFADNDPRISAILWVG 551
Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFY 610
YPGQAGG AIAD+LFGT+NPGGKLPMTWYPQ Y++NLPMT MAMR + S YPGRTYRFY
Sbjct: 552 YPGQAGGAAIADVLFGTTNPGGKLPMTWYPQSYLSNLPMTNMAMRSTSS--YPGRTYRFY 609
Query: 611 KGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDG-RHGSINATISGKAIKVTHAKCNRLT 669
GPVVY FGHG+SYTNF+HT+ APT+V++ L G R +T+S KAI+VTHAKC +L+
Sbjct: 610 AGPVVYEFGHGLSYTNFIHTIVKAPTIVSISLSGHRQTHSASTLSSKAIRVTHAKCQKLS 669
Query: 670 LGVQVDVKNVGSKDGAHTLLVFSTPPA--GHWAPHKQLVAFEKVHVPAGAQQRVGINIHV 727
L + VDV+N G +DG HT+LVFSTPPA W P KQLVAFEK+H+ + ++R+ +++HV
Sbjct: 670 LVIHVDVENKGDRDGFHTMLVFSTPPANGATWVPRKQLVAFEKLHLASREKRRLQVHVHV 729
Query: 728 CKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHAATLGVIKS 770
CKYLSVVD+ G RRIPLG+H IHIG KH+VSL AATLG+IK+
Sbjct: 730 CKYLSVVDKLGVRRIPLGDHYIHIGNVKHTVSLQAATLGIIKT 772
>gi|449505346|ref|XP_004162442.1| PREDICTED: LOW QUALITY PROTEIN: probable beta-D-xylosidase 2-like
[Cucumis sativus]
Length = 772
Score = 1198 bits (3099), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 567/763 (74%), Positives = 656/763 (85%), Gaps = 6/763 (0%)
Query: 12 LILLSA-SSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLI 70
LI+LSA G AREPFACDPKDA PFC+V+LPIP+RV DLIGRL+LQEKV+LL+
Sbjct: 12 LIILSAIFRHGGGAREPFACDPKDAALSRYPFCRVALPIPERVKDLIGRLTLQEKVRLLV 71
Query: 71 SGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEA 130
+ AAAVPRLGIKGYEWWSEALHGVSNVGPGT+FGGDFPGATSFPQVITT +SFN +LWEA
Sbjct: 72 NNAAAVPRLGIKGYEWWSEALHGVSNVGPGTEFGGDFPGATSFPQVITTVASFNVSLWEA 131
Query: 131 IGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGL 190
IGRVVSDEARAMYNGG AGLTYWSPNVNIFRDPRWGRGQETPGEDPV++G+YAA Y++GL
Sbjct: 132 IGRVVSDEARAMYNGGAAGLTYWSPNVNIFRDPRWGRGQETPGEDPVVAGEYAARYIKGL 191
Query: 191 QGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVA 250
QG+DGDRLKVAA CKHFTAYDLDNWNG DRFHFNAKV++QD+ DTF+VPFR CV EGKVA
Sbjct: 192 QGNDGDRLKVAACCKHFTAYDLDNWNGTDRFHFNAKVTRQDMVDTFEVPFRKCVKEGKVA 251
Query: 251 SVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAA 310
SVMCSYNQVNGVPTCADPN+LK TIR +W LNGYIVSDCDSVGV+YD QH+TST EEAAA
Sbjct: 252 SVMCSYNQVNGVPTCADPNLLKGTIRNQWGLNGYIVSDCDSVGVFYDNQHYTSTAEEAAA 311
Query: 311 DAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYG 370
DAI+AGLDLDCGPFL +HTE AV++ LL++ INNAL NT+TVQMRLGMFDG PSS YG
Sbjct: 312 DAIKAGLDLDCGPFLAVHTEDAVKKXLLTQTHINNALANTITVQMRLGMFDGAPSSHAYG 371
Query: 371 HLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGN 430
LGPK+VC+P HQ+LAL+AARQGIVLLKN+ P LPLS HRTVAVIGPNSDV VTMIGN
Sbjct: 372 KLGPKNVCSPSHQQLALDAARQGIVLLKNRLPGLPLSAXHHRTVAVIGPNSDVNVTMIGN 431
Query: 431 YAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQS 490
YAG+ACGY TPL+GI RY +H++GC +VACA D F A+ A+ ADAT+LVMGLDQS
Sbjct: 432 YAGVACGYVTPLEGIKRYTTVVHRKGCDNVACATDYSFTDALAAASTADATVLVMGLDQS 491
Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
+EAE DR GLLLPGRQQELV KV+ AS+GPT+++LMSGGPIDV+FA NDPRI+AI+W G
Sbjct: 492 VEAETKDRDGLLLPGRQQELVLKVAAASRGPTVVILMSGGPIDVSFADNDPRISAILWVG 551
Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFY 610
YPGQAGG AIAD+LFGT+NPGGKLPMTWYPQ Y++NLPMT MAMR + S YPGRTYRFY
Sbjct: 552 YPGQAGGAAIADVLFGTTNPGGKLPMTWYPQSYLSNLPMTNMAMRSTSS--YPGRTYRFY 609
Query: 611 KGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDG-RHGSINATISGKAIKVTHAKCNRLT 669
GPVVY FGHG+SYTNF+HT+ APT+V++ L G R +T+S KAI+VTHAKC +L+
Sbjct: 610 AGPVVYEFGHGLSYTNFIHTIVKAPTIVSISLSGHRQTHSASTLSSKAIRVTHAKCQKLS 669
Query: 670 LGVQVDVKNVGSKDGAHTLLVFSTPPA--GHWAPHKQLVAFEKVHVPAGAQQRVGINIHV 727
L + VDV+N G +DG HT+LVFSTPPA W P KQLVAFEK+H+ + ++R+ +++HV
Sbjct: 670 LVIHVDVENKGDRDGFHTMLVFSTPPANGATWVPRKQLVAFEKLHLASREKRRLQVHVHV 729
Query: 728 CKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHAATLGVIKS 770
CKYLSVVD+ G RRIPLG+H IHIG KH+VSL AATLG+IK+
Sbjct: 730 CKYLSVVDKLGVRRIPLGDHYIHIGNVKHTVSLQAATLGIIKT 772
>gi|224111912|ref|XP_002316021.1| predicted protein [Populus trichocarpa]
gi|222865061|gb|EEF02192.1| predicted protein [Populus trichocarpa]
Length = 768
Score = 1187 bits (3070), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 560/749 (74%), Positives = 641/749 (85%), Gaps = 2/749 (0%)
Query: 22 LAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGI 81
+ +REPFACDPK TR+L FC+V+LPI RV DLIGRL+LQEK++LL++ AAAVPRLGI
Sbjct: 22 VESREPFACDPKLGLTRSLKFCRVNLPIHVRVRDLIGRLTLQEKIRLLVNNAAAVPRLGI 81
Query: 82 KGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARA 141
+GYEWWSEALHGVSNVGPGTKFGG FPGAT+FPQVITTA+SFN +LWE IGRVVSDEARA
Sbjct: 82 QGYEWWSEALHGVSNVGPGTKFGGAFPGATAFPQVITTAASFNESLWEEIGRVVSDEARA 141
Query: 142 MYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVA 201
MYNGG AGLTYWSPNVN+FRDPRWGRGQETPGEDPV++GKYAASYVRGLQG++G RLKVA
Sbjct: 142 MYNGGMAGLTYWSPNVNVFRDPRWGRGQETPGEDPVVAGKYAASYVRGLQGNNGLRLKVA 201
Query: 202 ASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNG 261
A CKH+TAYDLDNWNGVDR+HFNA+VSKQD+EDT++VPF+ CV+ GKVASVMCSYNQVNG
Sbjct: 202 ACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKSCVVAGKVASVMCSYNQVNG 261
Query: 262 VPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDC 321
PTCADP +LK TIRGEW LNGYIVSDCDSVGV +DTQH+T+TPEEAAA IRAGLDLDC
Sbjct: 262 KPTCADPYLLKNTIRGEWGLNGYIVSDCDSVGVLFDTQHYTATPEEAAASTIRAGLDLDC 321
Query: 322 GPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD 381
GPFL +HTE+AV+ GLL E D+N AL NT+TVQMRLGMFDGEPS+QP+G+LGP+DVCTP
Sbjct: 322 GPFLAIHTENAVKGGLLKEEDVNMALANTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPA 381
Query: 382 HQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTP 441
HQ+LAL+AARQGIVLL+N+G +LPLS +TVAVIGPNSDVTVTMIGNYAG+ACGYTTP
Sbjct: 382 HQQLALQAARQGIVLLQNRGRTLPLSRTL-QTVAVIGPNSDVTVTMIGNYAGVACGYTTP 440
Query: 442 LQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGL 501
LQGI RYA+T+H GC DV C +Q F AA A+R ADATILVMGLDQSIEAE DR GL
Sbjct: 441 LQGIRRYAKTVHHPGCNDVFCNGNQQFNAAEVAARHADATILVMGLDQSIEAEFRDRKGL 500
Query: 502 LLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIA 561
LLPG QQELVS V+ AS+GPTILVLMSGGPIDV+FAKNDPRI AI+W GYPGQAGG AIA
Sbjct: 501 LLPGYQQELVSIVARASRGPTILVLMSGGPIDVSFAKNDPRIGAILWVGYPGQAGGAAIA 560
Query: 562 DILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHG 621
D+LFGT+NPGGKLPMTWYP Y+ +PMT M MR S+ YPGRTYRFYKGPVV+PFGHG
Sbjct: 561 DVLFGTANPGGKLPMTWYPHNYLAKVPMTNMGMRADPSRGYPGRTYRFYKGPVVFPFGHG 620
Query: 622 MSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGS 681
MSYT F H++ AP V+VPL H S N T + AI+V+HA C L LGV +DVKN G
Sbjct: 621 MSYTTFAHSLVQAPREVSVPLASLHVSRNTTGASNAIRVSHANCEALALGVHIDVKNTGD 680
Query: 682 KDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRR 741
DG HTLLVFS+PP G W+ KQL+ FEKVH+ G+Q+RV I+IHVCK+LSVVDR G RR
Sbjct: 681 MDGTHTLLVFSSPPGGKWSTQKQLIGFEKVHLVTGSQKRVKIDIHVCKHLSVVDRFGIRR 740
Query: 742 IPLGEHNIHIGGTKHSVSLHAATLGVIKS 770
IP+GEH+++IG KHS+SL A L IKS
Sbjct: 741 IPIGEHDLYIGDLKHSISLQ-ANLEEIKS 768
>gi|225431898|ref|XP_002276351.1| PREDICTED: beta-D-xylosidase 1-like [Vitis vinifera]
Length = 770
Score = 1176 bits (3041), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 563/748 (75%), Positives = 654/748 (87%), Gaps = 2/748 (0%)
Query: 24 AREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKG 83
AREPFACDP++ TR LPFC+VSLPI +R DL+GRL+LQEK++LL++ A VPRLGIKG
Sbjct: 23 AREPFACDPRNGVTRNLPFCRVSLPIQERARDLVGRLTLQEKIRLLVNNAIDVPRLGIKG 82
Query: 84 YEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMY 143
YEWWSEALHGVSNVGPGTKFGG FPGATSFPQVITTA+SFNA+LWE IGRVVSDEARAMY
Sbjct: 83 YEWWSEALHGVSNVGPGTKFGGSFPGATSFPQVITTAASFNASLWEEIGRVVSDEARAMY 142
Query: 144 NGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAAS 203
NGG AGLTYWSPNVNIFRDPRWGRGQETPGEDP ++ KYAA+YVRGLQG+ DRLKVAA
Sbjct: 143 NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPAVAAKYAAAYVRGLQGNARDRLKVAAC 202
Query: 204 CKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVP 263
CKH+TAYDLD+W G+DRFHFNA+VSKQD+EDT+DVPF+ CV+EG VASVMCSYNQVNG P
Sbjct: 203 CKHYTAYDLDHWGGIDRFHFNARVSKQDLEDTYDVPFKACVVEGNVASVMCSYNQVNGKP 262
Query: 264 TCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGP 323
TCADP++L+ TIRGEW+LNGYIVSDCDSVGV+YD QH+T+TPEEAAA AI+AGLDLDCGP
Sbjct: 263 TCADPHLLRDTIRGEWKLNGYIVSDCDSVGVFYDEQHYTATPEEAAAVAIKAGLDLDCGP 322
Query: 324 FLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQ 383
FL +HTE+A++ G L+E D+N AL+NT++VQMRLGMFDGEPS+QPYG+LGP+DVCTP HQ
Sbjct: 323 FLAIHTEAAIRGGKLTEADVNGALMNTISVQMRLGMFDGEPSAQPYGNLGPRDVCTPAHQ 382
Query: 384 ELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQ 443
+LALEAARQGIVL++N+GP+LPLS RHRT+AVIGPNSDVT TMIGNYAG+ACGYTTPLQ
Sbjct: 383 QLALEAARQGIVLVQNRGPALPLSTSRHRTIAVIGPNSDVTETMIGNYAGVACGYTTPLQ 442
Query: 444 GIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLL 503
GIGRYARTIHQ GC VAC DDQ FGAA+ A+RQADAT+LVMGLDQSIEAE DR +LL
Sbjct: 443 GIGRYARTIHQAGCSGVACRDDQQFGAAVAAARQADATVLVMGLDQSIEAEFRDRVDILL 502
Query: 504 PGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADI 563
PGRQQELVSKV++AS+GPT+LVLMSGGPIDV+FAKNDPRIAAIIW GYPGQAGGTAIAD+
Sbjct: 503 PGRQQELVSKVAVASRGPTVLVLMSGGPIDVSFAKNDPRIAAIIWVGYPGQAGGTAIADV 562
Query: 564 LFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMS 623
LFG +NPGGKLP+TWYPQ Y+ PMT MAMR S+ YPGRTYRFY GPVV+PFGHG+S
Sbjct: 563 LFGRTNPGGKLPVTWYPQSYLRKAPMTNMAMRAIPSRGYPGRTYRFYNGPVVFPFGHGLS 622
Query: 624 YTNFVHTVANAPTVVAVPLDGRHGSINATI-SGKAIKVTHAKCNRLTLGVQVDVKNVGSK 682
Y+ F H++A APT V+V L N+TI S AI+++HA CN LG +DVKN G+
Sbjct: 623 YSTFAHSLAQAPTTVSVSLASLQTIKNSTIVSSGAIRISHANCNTQPLGFHIDVKNTGTM 682
Query: 683 DGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRI 742
DG+HTLL+FSTPP G W+P+K+L+AFEKVHV AG+Q+RV ++HVCK+LSVVD G RI
Sbjct: 683 DGSHTLLLFSTPPPGTWSPNKRLLAFEKVHVGAGSQERVRFDVHVCKHLSVVDHFGIHRI 742
Query: 743 PLGEHNIHIGGTKHSVSLHAATLGVIKS 770
P+GEH+ HIG KHS+SL ATL IKS
Sbjct: 743 PMGEHHFHIGDLKHSISLQ-ATLEEIKS 769
>gi|408354266|gb|AFU54452.1| alpha-L-arabinofuranosidase/beta-D-xylosidase [Prunus salicina]
Length = 775
Score = 1170 bits (3028), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 559/740 (75%), Positives = 641/740 (86%), Gaps = 2/740 (0%)
Query: 24 AREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKG 83
AR PFACDP + TR L FC+V++PI RV DLIGRL+LQEK++LL++ A AVPRLGI+G
Sbjct: 28 ARPPFACDPHNPITRGLKFCRVTVPIHVRVQDLIGRLTLQEKIRLLVNNAIAVPRLGIQG 87
Query: 84 YEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMY 143
YEWWSEALHGVSNVGPGTKFGG FPGATSFPQVITTA+SFN +LW+ IGRVV DEARAMY
Sbjct: 88 YEWWSEALHGVSNVGPGTKFGGAFPGATSFPQVITTAASFNESLWQEIGRVVPDEARAMY 147
Query: 144 NGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSD-GDRLKVAA 202
NGG AGLTYWSPNVNIFRDPRWGRGQETPGEDPVL+ KYAA YV+GLQG G+RLKVAA
Sbjct: 148 NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLASKYAARYVKGLQGDGAGNRLKVAA 207
Query: 203 SCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGV 262
CKH+TAYDLDNWNGV+RFHFNA+VSKQD+ DT++VPF+ CV+EG VASVMCSYNQVNG
Sbjct: 208 CCKHYTAYDLDNWNGVNRFHFNARVSKQDLADTYNVPFKACVVEGHVASVMCSYNQVNGK 267
Query: 263 PTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG 322
PTCADP++LK TIRG+WRLNGYIVSDCDSVGV Y+ QH+T TPEEAAADAI+AGLDLDCG
Sbjct: 268 PTCADPDLLKGTIRGQWRLNGYIVSDCDSVGVLYEEQHYTRTPEEAAADAIKAGLDLDCG 327
Query: 323 PFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDH 382
PFL +HTE+AV+RGL+S+++IN AL NT+TVQMRLGMFDGEPS+ YG+LGP+DVCTP H
Sbjct: 328 PFLAIHTEAAVRRGLVSQLEINWALANTMTVQMRLGMFDGEPSAHQYGNLGPRDVCTPAH 387
Query: 383 QELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPL 442
Q+LALEAARQGIVLL+N+G SLPLS RHRTVAVIGPNSDVTVTMIGNYAG+ACGYTTPL
Sbjct: 388 QQLALEAARQGIVLLENRGRSLPLSIRRHRTVAVIGPNSDVTVTMIGNYAGVACGYTTPL 447
Query: 443 QGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLL 502
QGIGRY RTIHQ GC DV C +QLFGAA A+RQADAT+LVMGLDQSIEAE +DR GLL
Sbjct: 448 QGIGRYTRTIHQAGCTDVHCNGNQLFGAAEAAARQADATVLVMGLDQSIEAEFVDRVGLL 507
Query: 503 LPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIAD 562
LPG QQELVS+V+ AS+GPTILVLMSGGPIDV FAKNDPRI+AIIW GYPGQAGGTAIAD
Sbjct: 508 LPGHQQELVSRVARASRGPTILVLMSGGPIDVTFAKNDPRISAIIWVGYPGQAGGTAIAD 567
Query: 563 ILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGM 622
+LFGT+NPGGKLPMTWYPQ Y+T+LPMT+MAMR ++ YPGRTYRFY+GPVV+PFG G+
Sbjct: 568 VLFGTTNPGGKLPMTWYPQNYVTHLPMTDMAMRADPARGYPGRTYRFYRGPVVFPFGLGL 627
Query: 623 SYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLT-LGVQVDVKNVGS 681
SYT F H +A+ PT V+VPL + N+T+ KA++V+HA CN L+ L V VDVKN GS
Sbjct: 628 SYTTFAHNLAHGPTSVSVPLTSLKATANSTMLSKAVRVSHADCNALSPLDVHVDVKNTGS 687
Query: 682 KDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRR 741
DG HTLLVF++PP G WA KQLV F K+H+ AG++ RV I +HVCK+LSVVDR G RR
Sbjct: 688 MDGTHTLLVFTSPPDGKWAASKQLVGFHKIHIAAGSETRVRIAVHVCKHLSVVDRFGIRR 747
Query: 742 IPLGEHNIHIGGTKHSVSLH 761
IPLGEH + IG H VSL
Sbjct: 748 IPLGEHKLQIGDLSHHVSLQ 767
>gi|408354264|gb|AFU54451.1| alpha-L-arabinofuranosidase/beta-D-xylosidase [Prunus salicina]
Length = 775
Score = 1168 bits (3021), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 558/740 (75%), Positives = 640/740 (86%), Gaps = 2/740 (0%)
Query: 24 AREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKG 83
AR PFACDP + TR L FC+V++PI RV DLIGRL+LQEK++LL++ A AVPRLGI+G
Sbjct: 28 ARPPFACDPHNPITRGLKFCRVTVPIHVRVQDLIGRLTLQEKIRLLVNNAIAVPRLGIQG 87
Query: 84 YEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMY 143
YEWWSEALHGVSNVGPGTKFGG FPGATSFPQVITTA+SFN +LW+ IGR V DEARAMY
Sbjct: 88 YEWWSEALHGVSNVGPGTKFGGAFPGATSFPQVITTAASFNESLWQEIGRGVPDEARAMY 147
Query: 144 NGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSD-GDRLKVAA 202
NGG AGLTYWSPNVNIFRDPRWGRGQETPGEDPVL+ KYAA YV+GLQG G+RLKVAA
Sbjct: 148 NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLASKYAARYVKGLQGDGAGNRLKVAA 207
Query: 203 SCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGV 262
CKH+TAYDLDNWNGV+RFHFNA+VSKQD+ DT++VPF+ CV+EG VASVMCSYNQVNG
Sbjct: 208 CCKHYTAYDLDNWNGVNRFHFNARVSKQDLADTYNVPFKACVVEGHVASVMCSYNQVNGK 267
Query: 263 PTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG 322
PTCADP++LK TIRG+WRLNGYIVSDCDSVGV Y+ QH+T TPEEAAADAI+AGLDLDCG
Sbjct: 268 PTCADPDLLKGTIRGQWRLNGYIVSDCDSVGVLYEEQHYTRTPEEAAADAIKAGLDLDCG 327
Query: 323 PFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDH 382
PFL +HTE+AV+RGL+S+++IN AL NT+TVQMRLGMFDGEPS+ YG+LGP+DVCTP H
Sbjct: 328 PFLAIHTEAAVRRGLVSQLEINWALANTMTVQMRLGMFDGEPSAHQYGNLGPRDVCTPAH 387
Query: 383 QELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPL 442
Q+LALEAARQGIVLL+N+G SLPLS RHRTVAVIGPNSDVTVTMIGNYAG+ACGYTTPL
Sbjct: 388 QQLALEAARQGIVLLENRGRSLPLSIRRHRTVAVIGPNSDVTVTMIGNYAGVACGYTTPL 447
Query: 443 QGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLL 502
QGIGRY RTIHQ GC DV C +QLFGAA A+RQADAT+LVMGLDQSIEAE +DR GLL
Sbjct: 448 QGIGRYTRTIHQAGCTDVHCNGNQLFGAAEAAARQADATVLVMGLDQSIEAEFVDRVGLL 507
Query: 503 LPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIAD 562
LPG QQELVS+V+ AS+GPTILVLMSGGPIDV FAKNDPRI+AIIW GYPGQAGGTAIAD
Sbjct: 508 LPGHQQELVSRVARASRGPTILVLMSGGPIDVTFAKNDPRISAIIWVGYPGQAGGTAIAD 567
Query: 563 ILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGM 622
+LFGT+NPGGKLPMTWYPQ Y+T+LPMT+MAMR ++ YPGRTYRFY+GPVV+PFG G+
Sbjct: 568 VLFGTTNPGGKLPMTWYPQNYVTHLPMTDMAMRADPARGYPGRTYRFYRGPVVFPFGLGL 627
Query: 623 SYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLT-LGVQVDVKNVGS 681
SYT F H +A+ PT V+VPL + N+T+ KA++V+HA CN L+ L V VDVKN GS
Sbjct: 628 SYTTFAHNLAHGPTSVSVPLTSLKATANSTMLSKAVRVSHADCNALSPLDVHVDVKNTGS 687
Query: 682 KDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRR 741
DG HTLLVF++PP G WA KQLV F K+H+ AG++ RV I +HVCK+LSVVDR G RR
Sbjct: 688 MDGTHTLLVFTSPPDGKWAASKQLVGFHKIHIAAGSETRVRIAVHVCKHLSVVDRFGIRR 747
Query: 742 IPLGEHNIHIGGTKHSVSLH 761
IPLGEH + IG H VSL
Sbjct: 748 IPLGEHKLQIGDLSHHVSLQ 767
>gi|255556320|ref|XP_002519194.1| Periplasmic beta-glucosidase precursor, putative [Ricinus communis]
gi|223541509|gb|EEF43058.1| Periplasmic beta-glucosidase precursor, putative [Ricinus communis]
Length = 782
Score = 1158 bits (2996), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 550/742 (74%), Positives = 641/742 (86%), Gaps = 3/742 (0%)
Query: 22 LAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGI 81
+ +R PFACDP++ TR L FC+ +LPI RV DLI RL+LQEK++LL++ AAAVPRLGI
Sbjct: 36 VESRAPFACDPRNGVTRNLKFCRANLPIHVRVRDLISRLTLQEKIRLLVNNAAAVPRLGI 95
Query: 82 KGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARA 141
+GYEWWSEALHGVSNVGPG KFGG FPGATSFPQVITTA+SFN +LWE IGRVVSDEARA
Sbjct: 96 QGYEWWSEALHGVSNVGPGVKFGGAFPGATSFPQVITTAASFNQSLWEQIGRVVSDEARA 155
Query: 142 MYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVA 201
MYNGG AGLTYWSPNVN+FRDPRWGRGQETPGEDPVL+GKYAASYVRGLQ S G +LKVA
Sbjct: 156 MYNGGLAGLTYWSPNVNVFRDPRWGRGQETPGEDPVLAGKYAASYVRGLQSSTGLKLKVA 215
Query: 202 ASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNG 261
A CKH+TAYDLDNWNGVDR+HFNA+VSKQD+EDT+DVPF+ CV+EGKVASVMCSYNQVNG
Sbjct: 216 ACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYDVPFKACVVEGKVASVMCSYNQVNG 275
Query: 262 VPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDC 321
PTCADP +LK TIRG+W LNGYIVSDCDSVGV YD QH+TSTPEEAAA I+AGLDLDC
Sbjct: 276 KPTCADPILLKNTIRGQWGLNGYIVSDCDSVGVLYDNQHYTSTPEEAAAATIKAGLDLDC 335
Query: 322 GPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD 381
GPFL +HTE+AV++GLL E D+N AL NT+TVQMRLGMFDGEPS+ PYG+LGP+DVCTP
Sbjct: 336 GPFLAIHTENAVKKGLLVEEDVNLALANTITVQMRLGMFDGEPSAHPYGNLGPRDVCTPA 395
Query: 382 HQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTP 441
HQELALEAARQGIVLL+N+G +LPLS RH T+AVIGPNSDVTVTMIGNYAGIAC YT+P
Sbjct: 396 HQELALEAARQGIVLLENRGQALPLSSSRHHTIAVIGPNSDVTVTMIGNYAGIACKYTSP 455
Query: 442 LQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGL 501
LQGI RYA+T+HQ GC DVAC +Q FGAA A+RQADAT+LVMGLDQSIEAE DR GL
Sbjct: 456 LQGISRYAKTLHQNGCGDVACHSNQQFGAAEAAARQADATVLVMGLDQSIEAEFRDRVGL 515
Query: 502 LLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIA 561
LLPG QQELVS+V+ AS+GPTILVLMSGGPIDV+FAKNDPR+ AI+WAGYPGQAGG AIA
Sbjct: 516 LLPGHQQELVSRVARASRGPTILVLMSGGPIDVSFAKNDPRVGAILWAGYPGQAGGAAIA 575
Query: 562 DILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHG 621
D+LFGT+NPGGKLPMTWYPQ Y+ +PMT M MRP + YPGRTYRFYKG VV+PFGHG
Sbjct: 576 DVLFGTTNPGGKLPMTWYPQGYLAKVPMTNMGMRPDPATGYPGRTYRFYKGNVVFPFGHG 635
Query: 622 MSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGS 681
MSYT+F H++ AP V++P+ + ++N TIS KAI+V+H C + +LG+ ++VKN G+
Sbjct: 636 MSYTSFSHSLTQAPKEVSLPITNLY-ALNTTISSKAIRVSHINC-QTSLGIDINVKNTGT 693
Query: 682 KDGAHTLLVFSTPPAGH-WAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTR 740
DG HTLLVFS+PP+G + +KQL+ FEKV + AG+Q +V I+IHVCK+LS VDR G R
Sbjct: 694 MDGTHTLLVFSSPPSGEKESSNKQLIGFEKVDLVAGSQIQVKIDIHVCKHLSAVDRFGIR 753
Query: 741 RIPLGEHNIHIGGTKHSVSLHA 762
RIP+G+H+I+IG KHS+SL A
Sbjct: 754 RIPIGDHHIYIGDLKHSISLQA 775
>gi|183579871|dbj|BAG28345.1| arabinofuranosidase [Citrus unshiu]
Length = 769
Score = 1158 bits (2995), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 548/746 (73%), Positives = 634/746 (84%), Gaps = 11/746 (1%)
Query: 19 SSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPR 78
S + +R PFACDP++ TR+L FC+ S+PI RV DLIGRL+LQEK++LL++ AAAVPR
Sbjct: 19 SRFVESRVPFACDPRNGLTRSLRFCRTSVPIHVRVQDLIGRLTLQEKIRLLVNNAAAVPR 78
Query: 79 LGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDE 138
LGI+GYEWWSEALHGVSNVGPGTKFGG FPGATSFPQVITTA++FN +LWE IGRVVSDE
Sbjct: 79 LGIQGYEWWSEALHGVSNVGPGTKFGGAFPGATSFPQVITTAAAFNESLWEEIGRVVSDE 138
Query: 139 ARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRL 198
ARAMYNGG AGLTYWSPNVNIFRDPRWGRGQETPGEDPVL+GKYAASYVR LQG+ G RL
Sbjct: 139 ARAMYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLAGKYAASYVRRLQGNTGSRL 198
Query: 199 KVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQ 258
KVAA CKH+TAYDLDNWNGVDR+HFNA+VSKQD+EDT++VPF+ CV+EGKVASVMCSYNQ
Sbjct: 199 KVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQ 258
Query: 259 VNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLD 318
VNG PTCADP+ILK TIRG+WRL+GYIVSDCDSVGV Y+TQH+T TPEEAAADAI+AGLD
Sbjct: 259 VNGKPTCADPDILKNTIRGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLD 318
Query: 319 LDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVC 378
LDCGPFL +HTE AV+ GLL E D+N A T+TVQMRLGMFDGEPS+QP+G+LGP+DVC
Sbjct: 319 LDCGPFLAIHTEGAVRGGLLREEDVNLASAYTITVQMRLGMFDGEPSAQPFGNLGPRDVC 378
Query: 379 TPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGY 438
TP HQ+LAL+AA QGIVLLKN +LPLS +RH TVAVIGPNSDVTVTMIGNYAG+ACGY
Sbjct: 379 TPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGY 438
Query: 439 TTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDR 498
TTPLQGI RYA+TIHQ GC VAC +QL GAA A+RQADAT+LVMGLDQSIEAE +DR
Sbjct: 439 TTPLQGISRYAKTIHQAGCLGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDR 498
Query: 499 AGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGT 558
AGLLLPGRQQELVS+V+ AS+GP +LVLM GGP+DV+FAKNDPRI AI+W GYPGQAGG
Sbjct: 499 AGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGA 558
Query: 559 AIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPF 618
AIAD+LFG +NPGGKLPMTWYPQ+Y+ LPMT+M MR + YPGRTYRFYKGPVV+PF
Sbjct: 559 AIADVLFGRANPGGKLPMTWYPQDYVARLPMTDMRMRAGRG--YPGRTYRFYKGPVVFPF 616
Query: 619 GHGMSYTNFVHTVANAPTVVAVPL-DGRHGSINATISGKAIKVTHAKCN-RLTLGVQVDV 676
GHGMSYT F HT++ AP +VP+ + N TIS AI+V H CN ++LG+ VDV
Sbjct: 617 GHGMSYTTFAHTLSKAPNQFSVPIATSLYAFKNTTISSNAIRVAHTNCNDAMSLGLHVDV 676
Query: 677 KNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDR 736
KN G G HTLLVF+ PPAG+W+P+KQL+ F+KVHV AGA Q V ++IHVCK+LSVVD+
Sbjct: 677 KNTGDMAGTHTLLVFAKPPAGNWSPNKQLIGFKKVHVTAGALQSVRLDIHVCKHLSVVDK 736
Query: 737 SGTRRIP-------LGEHNIHIGGTK 755
G RRIP LG +IH+ +K
Sbjct: 737 FGIRRIPWVNIVSILGISSIHLSSSK 762
>gi|298364130|gb|ADI79208.1| alpha-L-arabinofuranosidase/beta-D-xylosidase [Malus x domestica]
Length = 774
Score = 1157 bits (2992), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 551/748 (73%), Positives = 631/748 (84%), Gaps = 4/748 (0%)
Query: 24 AREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKG 83
AR PFACDP++ TRTL FC+V +PI RV DLIGRL+LQEK+ LL++ A AVPRLGI+G
Sbjct: 28 ARPPFACDPRNPITRTLKFCRVRVPIHVRVQDLIGRLTLQEKIGLLVNNAIAVPRLGIQG 87
Query: 84 YEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMY 143
YEWWSEALHGVSNVGPGTKFG F GATSFPQVITTA+SFN +LWE IGRVVSDEARAMY
Sbjct: 88 YEWWSEALHGVSNVGPGTKFG-TFLGATSFPQVITTAASFNESLWEEIGRVVSDEARAMY 146
Query: 144 NGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSD-GDRLKVAA 202
NGG AGLT+WSPNVNIFRDPRWGRGQETPGEDP+L+ KY A YV+GLQG G+RLKVAA
Sbjct: 147 NGGAAGLTFWSPNVNIFRDPRWGRGQETPGEDPILAAKYGARYVKGLQGDGAGNRLKVAA 206
Query: 203 SCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGV 262
CKH+TAYDLDNWNGVDRFHFNA+VSKQD+EDT++VPFR CV++G VASVMCSYNQVNG
Sbjct: 207 CCKHYTAYDLDNWNGVDRFHFNARVSKQDLEDTYNVPFRACVVDGNVASVMCSYNQVNGK 266
Query: 263 PTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG 322
PTCADP +LK TIRG+W+LNGYIVSDCDSVGVYYD QH+T TPEEAAA AI+AGLDLDCG
Sbjct: 267 PTCADPELLKGTIRGQWKLNGYIVSDCDSVGVYYDNQHYTKTPEEAAAYAIKAGLDLDCG 326
Query: 323 PFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDH 382
PFLG+HTE+AV+ G ++EIDIN AL NT+TVQMRLGMFDGEPS+Q YG+LG DVC P
Sbjct: 327 PFLGIHTEAAVRFGQVNEIDINYALANTITVQMRLGMFDGEPSAQRYGNLGLADVCKPSS 386
Query: 383 QELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPL 442
ELALEAARQGIVLL+N+G SLPLS +RHRTVAVIGPNSDVT TMIGNYAGIACGYTTPL
Sbjct: 387 NELALEAARQGIVLLENRGNSLPLSTMRHRTVAVIGPNSDVTETMIGNYAGIACGYTTPL 446
Query: 443 QGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLL 502
QGI RY RTIHQ GC DV C +QL GAA A+RQADAT+LV+GLDQSIEAE DR LL
Sbjct: 447 QGIARYTRTIHQAGCTDVHCNGNQLIGAAEVAARQADATVLVIGLDQSIEAEFRDRTDLL 506
Query: 503 LPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIAD 562
LPG QQELVS+V+ AS+GPTILV+MSGGPIDV FAKNDPRI AIIW GYPGQAGGTAIAD
Sbjct: 507 LPGHQQELVSRVARASRGPTILVIMSGGPIDVTFAKNDPRIGAIIWVGYPGQAGGTAIAD 566
Query: 563 ILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGM 622
+LFGT+NP GKLPMTWYPQ Y+ NLPMT+MAMR ++ YPGRTYRFYKGPVV+PFG G+
Sbjct: 567 VLFGTTNPSGKLPMTWYPQNYVANLPMTDMAMRADPARGYPGRTYRFYKGPVVFPFGLGL 626
Query: 623 SYTNFVHTVANAPTVVAVPLDGRHGSINATISGKA-IKVTHAKCNRLTLGVQVDVKNVGS 681
SYT F H++A PT+V+VP S N T+ G I+V+H C+ L+L V +D+KN G+
Sbjct: 627 SYTRFSHSLAQGPTLVSVPFTSLVASKNTTMLGNHDIRVSHTNCDSLSLDVHIDIKNSGT 686
Query: 682 KDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRR 741
DG HTLLVF+TPP G WAP+KQLV F KVH+ AG+++RV + + VCK+LSVVD G RR
Sbjct: 687 MDGTHTLLVFATPPTGKWAPNKQLVGFHKVHIVAGSERRVRVGVQVCKHLSVVDELGIRR 746
Query: 742 IPLGEHNIHIGGTKHSVSLHAATLGVIK 769
IPLG+H + IG +H VS+ A LG IK
Sbjct: 747 IPLGQHKLEIGDLQHHVSVE-ANLGEIK 773
>gi|449484229|ref|XP_004156823.1| PREDICTED: LOW QUALITY PROTEIN: probable beta-D-xylosidase 2-like
[Cucumis sativus]
Length = 769
Score = 1155 bits (2988), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 547/772 (70%), Positives = 643/772 (83%), Gaps = 5/772 (0%)
Query: 1 MASTIAFFFLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRL 60
M ST ILLSA G A+R PFACDP ++ T PFC+ SL + +RV DLIGRL
Sbjct: 1 MPSTSTLILSIFILLSAIH-GRASRAPFACDPNNSVTTDYPFCRRSLVVEERVKDLIGRL 59
Query: 61 SLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTA 120
+L+EKVKLL+S A VPRLGIK Y+WWSEALHGVSNVGPGT+FGG+FP ATSFPQVI+TA
Sbjct: 60 TLEEKVKLLVSNAGGVPRLGIKAYQWWSEALHGVSNVGPGTRFGGEFPAATSFPQVISTA 119
Query: 121 SSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSG 180
+SFNA+LWEAIGRVVSDEARAMYNGG GLTYWSPNVNIFRDPRWGRGQETPGEDP+L+G
Sbjct: 120 ASFNASLWEAIGRVVSDEARAMYNGGVGGLTYWSPNVNIFRDPRWGRGQETPGEDPILAG 179
Query: 181 KYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPF 240
YA +YVRGLQG++G+RLKVAA CKHFTAYDLDNWNGVDRFHFNA+VSKQDIEDTF+VPF
Sbjct: 180 TYAVNYVRGLQGTEGNRLKVAACCKHFTAYDLDNWNGVDRFHFNAQVSKQDIEDTFEVPF 239
Query: 241 RMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQH 300
RMCV GKV+SVMCSYNQVNGVPTCADPN+L T+R +W L+GYIVSDCDSVGV+Y++QH
Sbjct: 240 RMCVKGGKVSSVMCSYNQVNGVPTCADPNLLTNTLRSQWHLDGYIVSDCDSVGVFYNSQH 299
Query: 301 FTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMF 360
+TSTPEEAAA AI+AGLDLDCG FL HTE+AV+RGLL+E IN AL NTL+VQMRLGMF
Sbjct: 300 YTSTPEEAAAMAIKAGLDLDCGSFLETHTENAVKRGLLNESHINGALSNTLSVQMRLGMF 359
Query: 361 DGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPN 420
DG+ +QPY HLG K VC+ +++LA++AARQGIVLL+N+ SLPLS RHR VAV+GPN
Sbjct: 360 DGDLKTQPYAHLGAKHVCSDHNRQLAVDAARQGIVLLENRRGSLPLSTNRHRIVAVVGPN 419
Query: 421 SDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADA 480
S+ T+TMIGNYAGIAC Y TPLQGI +Y RTIHQ+GC+ VAC ++ FG AI+A+R ADA
Sbjct: 420 SNATLTMIGNYAGIACEYITPLQGISKYTRTIHQEGCRGVACRSNKFFGGAIEAARVADA 479
Query: 481 TILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKND 540
+LVMGLDQSIEAE DRAGLLLPG Q +LV KV+ +KGP ILVLMSGGPIDV+FAK+
Sbjct: 480 VVLVMGLDQSIEAEFRDRAGLLLPGLQPDLVLKVASVAKGPVILVLMSGGPIDVSFAKDH 539
Query: 541 PRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSK 600
P+I+ IIW GYPGQAGG AIAD+LFG +NPGGKLPMTWYPQ+Y++ LPMT M++RP S
Sbjct: 540 PKISGIIWGGYPGQAGGLAIADVLFGQTNPGGKLPMTWYPQDYVSKLPMTTMSLRPGTS- 598
Query: 601 RYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDG-RHGSINATISGKAIK 659
YPGRTYRFYKGPVVYPFGHG+SYT F H + +APT + VP+ G RH + GKA++
Sbjct: 599 -YPGRTYRFYKGPVVYPFGHGLSYTAFTHKILSAPTTLTVPVTGHRHPHNGSEFWGKAVR 657
Query: 660 VTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQ 719
VTHAKC+RL+L ++V V+N+G++DGAHTLLV+S PP G W P KQLVAFEKVH+ A A +
Sbjct: 658 VTHAKCDRLSLVIKVAVRNIGARDGAHTLLVYSIPPMGVWVPQKQLVAFEKVHIDAQALK 717
Query: 720 RVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIG-GTKHSVSLHAATLGVIKS 770
V INIHVCK LSVVD+ G RR+P+GEH I IG +H VSL TLG+IKS
Sbjct: 718 EVQINIHVCKLLSVVDKYGIRRVPMGEHGIDIGDNVRHIVSLQPQTLGIIKS 769
>gi|86553064|gb|AAS17751.2| beta xylosidase [Fragaria x ananassa]
Length = 772
Score = 1155 bits (2987), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 554/767 (72%), Positives = 651/767 (84%), Gaps = 9/767 (1%)
Query: 10 LGLILLSASSSGLA-----AREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQE 64
L LI L S L AR PFACDP++ TR FC+ +P+ RV DLIGRL+LQE
Sbjct: 9 LSLIALVLCVSALLFNLVHARPPFACDPRNPLTRGFKFCRTRVPVHVRVQDLIGRLTLQE 68
Query: 65 KVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFN 124
K++LL++ A AVPRLGI+GYEWWSEALHGVSNVGPGTKFGG FPGATSFPQVITTA+SFN
Sbjct: 69 KIRLLVNNAIAVPRLGIQGYEWWSEALHGVSNVGPGTKFGGAFPGATSFPQVITTAASFN 128
Query: 125 ATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAA 184
+LW+ IG+VVSDEARAMYNGG AGLTYWSPNVNIFRDPRWGRGQETPGEDPVLS KYAA
Sbjct: 129 QSLWQEIGQVVSDEARAMYNGGQAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSAKYAA 188
Query: 185 SYVRGLQGSD-GDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMC 243
SYV+GLQG G+RLKVAA CKH+TAYDLDNWNGVDRFHFNA+VSKQD+ DT+DVPFR C
Sbjct: 189 SYVKGLQGDGAGNRLKVAACCKHYTAYDLDNWNGVDRFHFNARVSKQDLADTYDVPFRGC 248
Query: 244 VMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTS 303
V+EGKVASVMCSYNQVNG PTCADP++LK TIRGEW+LNGYIVSDCDSVGV+YD QH+T
Sbjct: 249 VLEGKVASVMCSYNQVNGKPTCADPDLLKNTIRGEWKLNGYIVSDCDSVGVFYDQQHYTR 308
Query: 304 TPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGE 363
TPEEAAA+AI+AGLDLDCGPFL +HTE A++ GLL EID++ AL NTLTVQMRLGMFDGE
Sbjct: 309 TPEEAAAEAIKAGLDLDCGPFLAIHTEGAIKAGLLPEIDVDYALANTLTVQMRLGMFDGE 368
Query: 364 PSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDV 423
PS+Q YG+LGP+DVCTP HQELALEA+RQGIVLL+N G +LPLS +RHRTVAV+GPNSDV
Sbjct: 369 PSAQQYGNLGPRDVCTPAHQELALEASRQGIVLLQNNGHTLPLSTVRHRTVAVVGPNSDV 428
Query: 424 TVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATIL 483
T TMIGNYAG+ACGYTTPLQGIGRY +TIHQQGC +VAC +QLFGAA A+RQADAT+L
Sbjct: 429 TETMIGNYAGVACGYTTPLQGIGRYTKTIHQQGCTNVACTTNQLFGAAEAAARQADATVL 488
Query: 484 VMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRI 543
VMGLDQSIEAE DR L++PG QQELVS+V+ AS+GPT+LVLMSGGPIDV+FAKNDP+I
Sbjct: 489 VMGLDQSIEAEFRDRTDLVMPGHQQELVSRVARASRGPTVLVLMSGGPIDVSFAKNDPKI 548
Query: 544 AAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYP 603
AIIW GYPGQAGGTA+AD+LFGT+NP GKLPMTWYPQ+Y++ +PMT MAMR + YP
Sbjct: 549 GAIIWVGYPGQAGGTAMADVLFGTTNPSGKLPMTWYPQDYVSKVPMTNMAMRAGRG--YP 606
Query: 604 GRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHA 663
GRTYRFYKGPVV+PFG G+SYT F H++A PT V+VPL + N+T+ A++V+H
Sbjct: 607 GRTYRFYKGPVVFPFGLGLSYTTFAHSLAQVPTSVSVPLTSLSATTNSTMLSSAVRVSHT 666
Query: 664 KCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGI 723
CN L+L + V VKN G++DG HTLLVFS+PP+G WA +KQLV F KVH+ AG+ +RV +
Sbjct: 667 NCNPLSLALHVVVKNTGARDGTHTLLVFSSPPSGKWAANKQLVGFHKVHIVAGSHKRVKV 726
Query: 724 NIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHAATLGVIKS 770
++HVCK+LSVVD+ G RRIP+GEH + IG +H +S+ A +G I+S
Sbjct: 727 DVHVCKHLSVVDQFGIRRIPIGEHKLQIGDLEHHISVE-ANVGEIRS 772
>gi|449469042|ref|XP_004152230.1| PREDICTED: probable beta-D-xylosidase 2-like [Cucumis sativus]
Length = 769
Score = 1154 bits (2985), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 547/772 (70%), Positives = 643/772 (83%), Gaps = 5/772 (0%)
Query: 1 MASTIAFFFLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRL 60
M ST ILLSA G A+R PFACDP ++ T PFC+ SL + +RV DLIGRL
Sbjct: 1 MPSTSTLILSIFILLSAIH-GRASRAPFACDPNNSVTTDYPFCRRSLVVGERVKDLIGRL 59
Query: 61 SLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTA 120
+L+EKVKLL+S A VPRLGIK Y+WWSEALHGVSNVGPGT+FGG+FP ATSFPQVI+TA
Sbjct: 60 TLEEKVKLLVSNAGGVPRLGIKAYQWWSEALHGVSNVGPGTRFGGEFPAATSFPQVISTA 119
Query: 121 SSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSG 180
+SFNA+LWEAIGRVVSDEARAMYNGG GLTYWSPNVNIFRDPRWGRGQETPGEDP+L+G
Sbjct: 120 ASFNASLWEAIGRVVSDEARAMYNGGVGGLTYWSPNVNIFRDPRWGRGQETPGEDPILAG 179
Query: 181 KYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPF 240
YA +YVRGLQG++G+RLKVAA CKHFTAYDLDNWNGVDRFHFNA+VSKQDIEDTF+VPF
Sbjct: 180 TYAVNYVRGLQGTEGNRLKVAACCKHFTAYDLDNWNGVDRFHFNAQVSKQDIEDTFEVPF 239
Query: 241 RMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQH 300
RMCV GKV+SVMCSYNQVNGVPTCADPN+L T+R +W L+GYIVSDCDSVGV+Y++QH
Sbjct: 240 RMCVKGGKVSSVMCSYNQVNGVPTCADPNLLTNTLRSQWHLDGYIVSDCDSVGVFYNSQH 299
Query: 301 FTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMF 360
+TSTPEEAAA AI+AGLDLDCG FL HTE+AV+RGLL+E IN AL NTL+VQMRLGMF
Sbjct: 300 YTSTPEEAAAMAIKAGLDLDCGSFLETHTENAVKRGLLNESHINGALSNTLSVQMRLGMF 359
Query: 361 DGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPN 420
DG+ +QPY HLG K VC+ +++LA++AARQGIVLL+N+ SLPLS RHR VAV+GPN
Sbjct: 360 DGDLKTQPYAHLGAKHVCSDHNRQLAVDAARQGIVLLENRRGSLPLSTNRHRIVAVVGPN 419
Query: 421 SDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADA 480
S+ T+TMIGNYAGIAC Y TPLQGI +Y RTIHQ+GC+ VAC ++ FG AI+A+R ADA
Sbjct: 420 SNATLTMIGNYAGIACEYITPLQGISKYTRTIHQEGCRGVACRSNKFFGGAIEAARVADA 479
Query: 481 TILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKND 540
+LVMGLDQSIEAE DRAGLLLPG Q +LV KV+ +KGP ILVLMSGGPIDV+FAK+
Sbjct: 480 VVLVMGLDQSIEAEFRDRAGLLLPGLQPDLVLKVASVAKGPVILVLMSGGPIDVSFAKDH 539
Query: 541 PRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSK 600
P+I+ IIW GYPGQAGG AIAD+LFG +NPGGKLPMTWYPQ+Y++ LPMT M++RP S
Sbjct: 540 PKISGIIWGGYPGQAGGLAIADVLFGQTNPGGKLPMTWYPQDYVSKLPMTTMSLRPGTS- 598
Query: 601 RYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDG-RHGSINATISGKAIK 659
YPGRTYRFYKGPVVYPFGHG+SYT F H + +APT + VP+ G RH + GKA++
Sbjct: 599 -YPGRTYRFYKGPVVYPFGHGLSYTAFTHKILSAPTTLTVPVTGHRHPHNGSEFWGKAVR 657
Query: 660 VTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQ 719
VTHAKC+RL+L ++V V+N+G++DGAHTLLV+S PP G W P KQLVAFEKVH+ A A +
Sbjct: 658 VTHAKCDRLSLVIKVAVRNIGARDGAHTLLVYSIPPMGVWVPQKQLVAFEKVHIDAQALK 717
Query: 720 RVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIG-GTKHSVSLHAATLGVIKS 770
V INIHVCK LSVVD+ G RR+P+GEH I IG +H VSL TLG+IKS
Sbjct: 718 EVQINIHVCKLLSVVDKYGIRRVPMGEHGIDIGDNVRHIVSLQPQTLGIIKS 769
>gi|157041199|dbj|BAF79669.1| beta-D-xylosidase [Pyrus pyrifolia]
Length = 774
Score = 1149 bits (2973), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 550/748 (73%), Positives = 634/748 (84%), Gaps = 4/748 (0%)
Query: 24 AREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKG 83
AR PFACDP++ TRTL FC+V +PI RV DLIGRL+LQEK+ LL++ A AVPRLGI+G
Sbjct: 28 ARPPFACDPRNPITRTLKFCRVRVPIHVRVQDLIGRLTLQEKIGLLVNNAIAVPRLGIQG 87
Query: 84 YEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMY 143
YEWWSEALHGVSNVGPGTKFG F GATSFPQVITTA+SFN +LWE IGRVVSDEARAMY
Sbjct: 88 YEWWSEALHGVSNVGPGTKFG-TFLGATSFPQVITTAASFNESLWEEIGRVVSDEARAMY 146
Query: 144 NGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSD-GDRLKVAA 202
NGG AGLT+WSPNVNIFRDPRWGRGQETPGEDPVL+ KY A YV+GLQG G+RLKVAA
Sbjct: 147 NGGAAGLTFWSPNVNIFRDPRWGRGQETPGEDPVLAAKYGARYVKGLQGDGAGNRLKVAA 206
Query: 203 SCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGV 262
CKH+TAYDLDNWNGVDRFHFNA+VSKQD+EDT++VPF+ CV++G VASVMCSYNQVNG
Sbjct: 207 CCKHYTAYDLDNWNGVDRFHFNARVSKQDLEDTYNVPFKACVVDGNVASVMCSYNQVNGK 266
Query: 263 PTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG 322
PTCADP++LK TIRG+W+LNGYIVSDCDSVGVYYD QH+T TPE AAA AI+AGLDLDCG
Sbjct: 267 PTCADPDLLKGTIRGQWKLNGYIVSDCDSVGVYYDNQHYTKTPEAAAAYAIKAGLDLDCG 326
Query: 323 PFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDH 382
PFLG+HTE+A++ G ++EIDIN AL NT+TVQMRLGMFDGEPS+Q YG+LG DVC P
Sbjct: 327 PFLGIHTEAAIRTGQVNEIDINYALANTITVQMRLGMFDGEPSTQRYGNLGLADVCKPSS 386
Query: 383 QELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPL 442
ELALEAARQGIVLL+N+G SLPLS IRHRTVAVIGPNSDVT TMIGNYAGIACGYTTPL
Sbjct: 387 NELALEAARQGIVLLENRGNSLPLSTIRHRTVAVIGPNSDVTETMIGNYAGIACGYTTPL 446
Query: 443 QGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLL 502
QGI RY RTIHQ GC DV C +QL GAA A+RQADAT+LV+GLDQSIEAE DR GLL
Sbjct: 447 QGIARYTRTIHQAGCTDVHCNGNQLIGAAEVAARQADATVLVIGLDQSIEAEFRDRTGLL 506
Query: 503 LPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIAD 562
LPG QQELVS+V+ AS+GPTILV+MSGGPIDV FAKNDPRI AIIW GYPGQAGGTAIAD
Sbjct: 507 LPGHQQELVSRVARASRGPTILVIMSGGPIDVTFAKNDPRIGAIIWVGYPGQAGGTAIAD 566
Query: 563 ILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGM 622
+LFGT+NP GKLPMTWYPQ Y+ NLPMT+MAMR ++ YPGRTYRFYKGPVV+PFG G+
Sbjct: 567 VLFGTTNPSGKLPMTWYPQNYVANLPMTDMAMRADPARGYPGRTYRFYKGPVVFPFGMGL 626
Query: 623 SYTNFVHTVANAPTVVAVPLDGRHGSINAT-ISGKAIKVTHAKCNRLTLGVQVDVKNVGS 681
SYT F H++A PT+V+VPL + N T +S ++V+H C+ L+L +D+KN G+
Sbjct: 627 SYTRFSHSLAQGPTLVSVPLTSLVAAKNTTMLSNHGVRVSHTNCDSLSLDFHIDIKNTGT 686
Query: 682 KDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRR 741
DG HTLLVF+T PAG WAP+KQLV F KVH+ AG+++RV + +HVCK+LS+VD+ G RR
Sbjct: 687 MDGTHTLLVFATQPAGKWAPNKQLVGFHKVHIVAGSERRVRVGVHVCKHLSIVDKLGIRR 746
Query: 742 IPLGEHNIHIGGTKHSVSLHAATLGVIK 769
IPLG+H + IG KH VS+ A LG IK
Sbjct: 747 IPLGQHKLEIGDLKHYVSIE-ANLGEIK 773
>gi|65736613|dbj|BAD98523.1| alpha-L-arabinofuranosidase / beta-D-xylosidase [Pyrus pyrifolia]
Length = 774
Score = 1146 bits (2965), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 549/748 (73%), Positives = 633/748 (84%), Gaps = 4/748 (0%)
Query: 24 AREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKG 83
AR PFACDP++ TRTL FC+V +PI RV DLIGRL+LQEK+ LL++ A AVPRLGI+G
Sbjct: 28 ARPPFACDPRNPITRTLKFCRVRVPIHVRVQDLIGRLTLQEKIGLLVNNAIAVPRLGIQG 87
Query: 84 YEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMY 143
YEWWSEALHGVSNVGPGTKFG F GATSFPQVITTA+SFN +LWE IGRVVSDEARAMY
Sbjct: 88 YEWWSEALHGVSNVGPGTKFG-TFLGATSFPQVITTAASFNESLWEEIGRVVSDEARAMY 146
Query: 144 NGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSD-GDRLKVAA 202
NGG AGLT+WSPNVNIFRDPRWGRGQETPGEDPVL+ KY A YV+GLQG G+RLKVAA
Sbjct: 147 NGGAAGLTFWSPNVNIFRDPRWGRGQETPGEDPVLAAKYGARYVKGLQGDGAGNRLKVAA 206
Query: 203 SCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGV 262
CKH+TAYDLDNWNGVDRFHFNA+VSKQD+EDT++VPF+ CV++G VASVMCSYNQVNG
Sbjct: 207 CCKHYTAYDLDNWNGVDRFHFNARVSKQDLEDTYNVPFKACVVDGNVASVMCSYNQVNGK 266
Query: 263 PTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG 322
PTCADP++LK TIRG+W+LNGYIVSDCDSVGVYYD QH+T TPE AAA AI+AGLDLDCG
Sbjct: 267 PTCADPDLLKGTIRGQWKLNGYIVSDCDSVGVYYDNQHYTKTPEAAAAYAIKAGLDLDCG 326
Query: 323 PFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDH 382
PFLG+HTE+A++ G ++EIDIN AL NT+TVQMRLGMFDGEPS+Q YG+LG DVC P
Sbjct: 327 PFLGIHTEAAIRTGQVNEIDINYALANTITVQMRLGMFDGEPSTQRYGNLGLADVCKPSS 386
Query: 383 QELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPL 442
ELALEAARQGIVLL+N+G SLPLS IRHRTVAVIGPNSDVT TMIGNYAGIACGYTTPL
Sbjct: 387 NELALEAARQGIVLLENRGNSLPLSTIRHRTVAVIGPNSDVTETMIGNYAGIACGYTTPL 446
Query: 443 QGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLL 502
QGI RY RTIHQ GC DV C +QL GAA A+RQADAT+LV+GLDQSIEAE DR GLL
Sbjct: 447 QGIARYTRTIHQAGCTDVHCNGNQLIGAAEVAARQADATVLVIGLDQSIEAEFRDRTGLL 506
Query: 503 LPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIAD 562
LPG QQELVS+V+ AS+GPTILV+MSGGPIDV FAKNDP I AIIW GYPGQAGGTAIAD
Sbjct: 507 LPGHQQELVSRVARASRGPTILVIMSGGPIDVTFAKNDPCIGAIIWVGYPGQAGGTAIAD 566
Query: 563 ILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGM 622
+LFGT+NP GKLPMTWYPQ Y+ NLPMT+MAMR ++ YPGRTYRFYKGPVV+PFG G+
Sbjct: 567 VLFGTTNPSGKLPMTWYPQNYVANLPMTDMAMRADPARGYPGRTYRFYKGPVVFPFGMGL 626
Query: 623 SYTNFVHTVANAPTVVAVPLDGRHGSINAT-ISGKAIKVTHAKCNRLTLGVQVDVKNVGS 681
SYT F H++A PT+V+VPL + N T +S ++V+H C+ L+L +D+KN G+
Sbjct: 627 SYTRFSHSLAQGPTLVSVPLTSLVAAKNTTMLSNHGVRVSHTNCDSLSLDFHIDIKNTGT 686
Query: 682 KDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRR 741
DG HTLLVF+T PAG WAP+KQLV F KVH+ AG+++RV + +HVCK+LS+VD+ G RR
Sbjct: 687 MDGTHTLLVFATQPAGKWAPNKQLVGFHKVHIVAGSERRVRVGVHVCKHLSIVDKLGIRR 746
Query: 742 IPLGEHNIHIGGTKHSVSLHAATLGVIK 769
IPLG+H + IG KH VS+ A LG IK
Sbjct: 747 IPLGQHKLEIGDLKHYVSIE-ANLGEIK 773
>gi|32481073|gb|AAP83934.1| auxin-induced beta-glucosidase [Chenopodium rubrum]
Length = 767
Score = 1142 bits (2954), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 546/764 (71%), Positives = 637/764 (83%), Gaps = 5/764 (0%)
Query: 1 MASTIAFFFLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRL 60
M+S FF ++ + S+ AA P ACDPK TR L FC+V+LPI RV DLIGRL
Sbjct: 1 MSSRNNFFCFLVLFILLSAEARAA--PLACDPKSGLTRALRFCRVNLPIRARVQDLIGRL 58
Query: 61 SLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTA 120
+LQEKVKLL++ AA VPRLGI GYEWWSEALHGVSNVGPGTKF G FP ATSFPQVITTA
Sbjct: 59 NLQEKVKLLVNNAAPVPRLGISGYEWWSEALHGVSNVGPGTKFRGAFPAATSFPQVITTA 118
Query: 121 SSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSG 180
+SFNA+LWEAIG+VVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDP L+
Sbjct: 119 ASFNASLWEAIGQVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPTLAS 178
Query: 181 KYAASYVRGLQG-SDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVP 239
+YAASYVRGLQG + +RLKVAA CKH+TAYDLDNWN VDRFHFNAKVSKQD+EDT++VP
Sbjct: 179 QYAASYVRGLQGIYNKNRLKVAACCKHYTAYDLDNWNAVDRFHFNAKVSKQDLEDTYNVP 238
Query: 240 FRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQ 299
F+ CV EG+VASVMCSYNQVNG PTCADP++L+ TIRG+WRLNGYIVSDCDSVGV YD Q
Sbjct: 239 FKGCVQEGRVASVMCSYNQVNGKPTCADPDLLRNTIRGQWRLNGYIVSDCDSVGVLYDDQ 298
Query: 300 HFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGM 359
H+T TPEEAAAD I+AGLDLDCGPFL +HTE+AV+RGLL+E D+N AL NT TVQMRLGM
Sbjct: 299 HYTRTPEEAAADTIKAGLDLDCGPFLAVHTEAAVKRGLLTEADVNQALTNTFTVQMRLGM 358
Query: 360 FDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGP 419
FDGE ++QP+GHLGPKDVC+P HQ+LAL+AARQGIVLL+N+G SLPLS RHR +AVIGP
Sbjct: 359 FDGEAAAQPFGHLGPKDVCSPAHQDLALQAARQGIVLLQNRGRSLPLSTARHRNIAVIGP 418
Query: 420 NSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQAD 479
N+D TVTMIGNYAG+ACGYT+PLQGI RYA+T+HQ GC VAC +Q FGAA A+ AD
Sbjct: 419 NADATVTMIGNYAGVACGYTSPLQGIARYAKTVHQAGCIGVACTSNQQFGAATAAAAHAD 478
Query: 480 ATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKN 539
AT+LVMGLDQSIEAE DRA +LLPG QQELVSKV++AS+GPTILVLM GGP+DV FAKN
Sbjct: 479 ATVLVMGLDQSIEAEFRDRASVLLPGHQQELVSKVALASRGPTILVLMCGGPVDVTFAKN 538
Query: 540 DPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQS 599
DP+I+AI+W GYPGQAGGTAIAD+LFGT+NPGGKLP TWYPQ Y+ +PMT++AMR + S
Sbjct: 539 DPKISAILWVGYPGQAGGTAIADVLFGTTNPGGKLPNTWYPQSYVAKVPMTDLAMRANPS 598
Query: 600 KRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGK-AI 658
YPGRTYRFYKGPVV+PFG G+SYT F ++A+APT V VPL + + N T K A+
Sbjct: 599 NGYPGRTYRFYKGPVVFPFGFGLSYTRFTQSLAHAPTKVMVPLANQFTNSNITSFNKDAL 658
Query: 659 KVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQ 718
KV H C+ + L + +DVKN G DG+HT+LVFSTPP G + KQL+ F++VHV AG++
Sbjct: 659 KVLHTNCDNIPLSLHIDVKNKGKVDGSHTILVFSTPPKGTKSSEKQLIGFKRVHVFAGSK 718
Query: 719 QRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGT-KHSVSLH 761
QRV +NIHVC +LS D G RRIP+GEH +HIG KH +SLH
Sbjct: 719 QRVRMNIHVCNHLSRADEFGVRRIPIGEHTLHIGDDHKHKLSLH 762
>gi|224099193|ref|XP_002311398.1| predicted protein [Populus trichocarpa]
gi|222851218|gb|EEE88765.1| predicted protein [Populus trichocarpa]
Length = 755
Score = 1140 bits (2948), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 545/741 (73%), Positives = 636/741 (85%), Gaps = 1/741 (0%)
Query: 22 LAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGI 81
+ R PFACD K+ TR+L FC+V++P+ RV DLIGRL+LQEK++LL++ AAAVPRLGI
Sbjct: 14 VECRAPFACDAKNGLTRSLKFCRVNMPLHVRVRDLIGRLTLQEKIRLLVNNAAAVPRLGI 73
Query: 82 KGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARA 141
+GYEWWSEALHGVSNVGPGTKFGG FPGATSFPQVITTA+SFN +LWE IGRVVSDEARA
Sbjct: 74 QGYEWWSEALHGVSNVGPGTKFGGAFPGATSFPQVITTAASFNKSLWEEIGRVVSDEARA 133
Query: 142 MYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVA 201
M+NGG AGLTYWSPNVN+FRDPRWGRGQETPGEDPV++GKYAASYVRGLQG+ G RLKVA
Sbjct: 134 MFNGGMAGLTYWSPNVNVFRDPRWGRGQETPGEDPVVAGKYAASYVRGLQGNSGFRLKVA 193
Query: 202 ASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNG 261
A CKH+TAYDLDNWNGVDR+HFNA+VSKQD+EDT+DVPF+ CV+EGKVASVMCSYNQVNG
Sbjct: 194 ACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYDVPFKSCVVEGKVASVMCSYNQVNG 253
Query: 262 VPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDC 321
PTCADPN+LK TIRGEWRLNGYIVSDCDSVGV Y+ QH+T+TPEEAAA I+AGLDLDC
Sbjct: 254 KPTCADPNLLKNTIRGEWRLNGYIVSDCDSVGVLYENQHYTATPEEAAAATIKAGLDLDC 313
Query: 322 GPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD 381
GPFL +HTE+AV+ GLL+E D+N AL NT+TVQMRLG+FDGEPS+QP+G LGP+DVCTP
Sbjct: 314 GPFLAIHTENAVKGGLLNEEDVNMALANTITVQMRLGLFDGEPSAQPFGKLGPRDVCTPA 373
Query: 382 HQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTP 441
HQ+LAL AA+QGIVLL+N G +LPLS + TVAVIGP +DVTVTMIGNYAG+ACGYTTP
Sbjct: 374 HQQLALHAAQQGIVLLQNSGRTLPLSR-PNLTVAVIGPIADVTVTMIGNYAGVACGYTTP 432
Query: 442 LQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGL 501
LQGI RYA+TIHQ GC DVAC +Q FG A A+ QADAT+LVMGLDQSIEAE DR L
Sbjct: 433 LQGISRYAKTIHQSGCIDVACNGNQQFGMAEAAASQADATVLVMGLDQSIEAEFRDRKDL 492
Query: 502 LLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIA 561
LLPG QQEL+S+V+ AS+GPTILVLMSGGPIDV+FAKNDPRI AI+WAGYPGQAGG AIA
Sbjct: 493 LLPGYQQELISRVARASRGPTILVLMSGGPIDVSFAKNDPRIGAILWAGYPGQAGGAAIA 552
Query: 562 DILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHG 621
D+LFGT+NPGGKLPMTWYPQ+Y+ +PMT M MR S+ YPGRTYRFYKGPVV+PFGHG
Sbjct: 553 DVLFGTTNPGGKLPMTWYPQDYLAKVPMTNMGMRADPSRGYPGRTYRFYKGPVVFPFGHG 612
Query: 622 MSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGS 681
MSYT F H++ AP VAVP + N T + +I+V+HA C L LGV +DVKN G
Sbjct: 613 MSYTTFAHSLVQAPQEVAVPFTSLYALQNTTAARNSIRVSHANCEPLVLGVHIDVKNTGD 672
Query: 682 KDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRR 741
DG TLLVFS+PP G W+ +K+L+ FEKVH+ AG+++RV I+I VCK+LSVVDR G RR
Sbjct: 673 MDGIQTLLVFSSPPEGKWSANKKLIGFEKVHIVAGSKKRVKIDIPVCKHLSVVDRFGIRR 732
Query: 742 IPLGEHNIHIGGTKHSVSLHA 762
+P+G+H++HIG KHS+SL A
Sbjct: 733 LPIGKHDLHIGDLKHSISLQA 753
>gi|296083274|emb|CBI22910.3| unnamed protein product [Vitis vinifera]
Length = 738
Score = 1138 bits (2943), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 550/747 (73%), Positives = 636/747 (85%), Gaps = 32/747 (4%)
Query: 24 AREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKG 83
AREPFACDP++ TR LPFC+VSLPI +R DL+GRL+LQEK++LL++ A VPRLGIKG
Sbjct: 23 AREPFACDPRNGVTRNLPFCRVSLPIQERARDLVGRLTLQEKIRLLVNNAIDVPRLGIKG 82
Query: 84 YEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMY 143
YEWWSEALHGVSNVGPGTKFGG FPGATSFPQVITTA+SFNA+LWE IGRVVSDEARAMY
Sbjct: 83 YEWWSEALHGVSNVGPGTKFGGSFPGATSFPQVITTAASFNASLWEEIGRVVSDEARAMY 142
Query: 144 NGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAAS 203
NGG AGLTYWSPNVNIFRDPRWGRGQETPGEDP ++ KYAA+YVRGLQG+ DRLKVAA
Sbjct: 143 NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPAVAAKYAAAYVRGLQGNARDRLKVAAC 202
Query: 204 CKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVP 263
CKH+TAYDLD+W G+DRFHFNA+VSKQD+EDT+DVPF+ CV+EG VASVMCSYNQVNG P
Sbjct: 203 CKHYTAYDLDHWGGIDRFHFNARVSKQDLEDTYDVPFKACVVEGNVASVMCSYNQVNGKP 262
Query: 264 TCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGP 323
TCADP++L+ TIRGEW+LNGYIVSDCDSVGV+YD QH+T+TPEEAAA AI+AGLDLDCGP
Sbjct: 263 TCADPHLLRDTIRGEWKLNGYIVSDCDSVGVFYDEQHYTATPEEAAAVAIKAGLDLDCGP 322
Query: 324 FLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQ 383
FL +HTE+A++ G L+E D+N AL+NT++VQMRLGMFDGEPS+QPYG+LGP+DVCTP HQ
Sbjct: 323 FLAIHTEAAIRGGKLTEADVNGALMNTISVQMRLGMFDGEPSAQPYGNLGPRDVCTPAHQ 382
Query: 384 ELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQ 443
+LALEAARQGIVL++N+GP+LPLS RHRT+AVIGPNSDVT TMIGNYAG+ACGYTTPLQ
Sbjct: 383 QLALEAARQGIVLVQNRGPALPLSTSRHRTIAVIGPNSDVTETMIGNYAGVACGYTTPLQ 442
Query: 444 GIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLL 503
GIGRYARTIHQ GC VAC DDQ FGAA+ A+RQADAT+LVMGLDQSIEAE DR +LL
Sbjct: 443 GIGRYARTIHQAGCSGVACRDDQQFGAAVAAARQADATVLVMGLDQSIEAEFRDRVDILL 502
Query: 504 PGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADI 563
PGRQQELVSKV++AS+GPT+LVLMSGGPIDV+FAKNDPRIAAIIW GYPGQAGGTAIAD+
Sbjct: 503 PGRQQELVSKVAVASRGPTVLVLMSGGPIDVSFAKNDPRIAAIIWVGYPGQAGGTAIADV 562
Query: 564 LFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMS 623
LFG +NPGGKLP+TWYPQ Y+ PMT MAMR S+ YPGRTYRFY GPVV+PFGHG+S
Sbjct: 563 LFGRTNPGGKLPVTWYPQSYLRKAPMTNMAMRAIPSRGYPGRTYRFYNGPVVFPFGHGLS 622
Query: 624 YTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKD 683
Y+ F H++A APT LG +DVKN G+ D
Sbjct: 623 YSTFAHSLAQAPTT-------------------------------PLGFHIDVKNTGTMD 651
Query: 684 GAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIP 743
G+HTLL+FSTPP G W+P+K+L+AFEKVHV AG+Q+RV ++HVCK+LSVVD G RIP
Sbjct: 652 GSHTLLLFSTPPPGTWSPNKRLLAFEKVHVGAGSQERVRFDVHVCKHLSVVDHFGIHRIP 711
Query: 744 LGEHNIHIGGTKHSVSLHAATLGVIKS 770
+GEH+ HIG KHS+SL ATL IKS
Sbjct: 712 MGEHHFHIGDLKHSISLQ-ATLEEIKS 737
>gi|356556038|ref|XP_003546334.1| PREDICTED: beta-D-xylosidase 1-like [Glycine max]
Length = 775
Score = 1126 bits (2913), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 522/739 (70%), Positives = 626/739 (84%), Gaps = 2/739 (0%)
Query: 25 REPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGY 84
R PFACDP++ TR FC +PI RV DLI RL+L EK++L+++ A AVPRLGI+GY
Sbjct: 34 RVPFACDPRNGLTRGFKFCNTHVPIHVRVQDLIARLTLPEKIRLVVNNAIAVPRLGIQGY 93
Query: 85 EWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN 144
EWWSEALHGVSNVGPGTKFGG FPGAT FPQVI+TA+SFN +LW+ IGRVVSDEARAMYN
Sbjct: 94 EWWSEALHGVSNVGPGTKFGGAFPGATMFPQVISTAASFNQSLWQEIGRVVSDEARAMYN 153
Query: 145 GGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAAS 203
GG AGLTYWSPNVNIFRDPRWGRGQETPGEDP L+ KYAASYV+GLQG S G+ LKVAA
Sbjct: 154 GGQAGLTYWSPNVNIFRDPRWGRGQETPGEDPTLAAKYAASYVKGLQGDSAGNHLKVAAC 213
Query: 204 CKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVP 263
CKH+TAYDLDNWNGVDRFHFNAKVSKQD+EDT+DVPF+ CV+EG+VASVMCSYNQVNG P
Sbjct: 214 CKHYTAYDLDNWNGVDRFHFNAKVSKQDLEDTYDVPFKACVLEGQVASVMCSYNQVNGKP 273
Query: 264 TCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGP 323
TCADP++L+ TIRG+WRLNGYIVSDCDSVGV++D QH+T TPEEAAA+AI+AGLDLDCGP
Sbjct: 274 TCADPDLLRNTIRGQWRLNGYIVSDCDSVGVFFDNQHYTKTPEEAAAEAIKAGLDLDCGP 333
Query: 324 FLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQ 383
FL +HT+SA+++GL+SE D+N AL N ++VQMRLGMFDGEPS+QPYG+LGP+DVCT HQ
Sbjct: 334 FLAIHTDSAIRKGLISENDLNLALANLISVQMRLGMFDGEPSTQPYGNLGPRDVCTSAHQ 393
Query: 384 ELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQ 443
+LALEAAR+ IVLL+N+G SLPLS R RT+ V+GPN+D TVTMIGNYAG+ACGYTTPLQ
Sbjct: 394 QLALEAARESIVLLQNKGNSLPLSPSRLRTIGVVGPNADATVTMIGNYAGVACGYTTPLQ 453
Query: 444 GIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLL 503
GI RY +T HQ GC+ VAC ++LFGAA +RQADA +LVMGLDQ++EAE DR GLLL
Sbjct: 454 GIARYVKTAHQVGCRGVACRGNELFGAAETIARQADAIVLVMGLDQTVEAETRDRVGLLL 513
Query: 504 PGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADI 563
PG QQELV++V+ A+KGP IL++MSGGP+D++FAKNDP+I+AI+W GYPGQAGGTAIAD+
Sbjct: 514 PGLQQELVTRVARAAKGPVILLIMSGGPVDISFAKNDPKISAILWVGYPGQAGGTAIADV 573
Query: 564 LFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMS 623
+FGT+NPGG+LPMTWYPQ Y+ +PMT M MRP+ + YPGRTYRFYKGPVV+PFGHG+S
Sbjct: 574 IFGTTNPGGRLPMTWYPQGYLAKVPMTNMDMRPNPTTGYPGRTYRFYKGPVVFPFGHGLS 633
Query: 624 YTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKC-NRLTLGVQVDVKNVGSK 682
Y+ F H++A AP V+VP+ N+T+S KA+KV+HA C + L + VDVKN GS
Sbjct: 634 YSRFSHSLALAPKQVSVPIMSLQALTNSTLSSKAVKVSHANCDDSLEMEFHVDVKNEGSM 693
Query: 683 DGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRI 742
DG HTLL+FS PP G W+ KQLV F K HV AG++QRV + +HVCK+LSVVD+ G RRI
Sbjct: 694 DGTHTLLIFSQPPHGKWSQIKQLVGFHKTHVLAGSKQRVKVGVHVCKHLSVVDQFGVRRI 753
Query: 743 PLGEHNIHIGGTKHSVSLH 761
P GEH +HIG KHS+S+
Sbjct: 754 PTGEHELHIGDVKHSISVQ 772
>gi|356529243|ref|XP_003533205.1| PREDICTED: beta-D-xylosidase 1-like [Glycine max]
Length = 774
Score = 1117 bits (2888), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 521/739 (70%), Positives = 622/739 (84%), Gaps = 2/739 (0%)
Query: 25 REPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGY 84
R PFACDP++ TR FC +PI RV DLI RL+L EK++L+++ A AVPRLGI+GY
Sbjct: 33 RVPFACDPRNGLTRGFKFCNTHVPIHVRVQDLIARLTLPEKIRLVVNNAIAVPRLGIQGY 92
Query: 85 EWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN 144
EWWSEALHGVSNVGPGTKFGG FPGAT FPQVI+TA+SFN +LW+ IGRVVSDEARAMYN
Sbjct: 93 EWWSEALHGVSNVGPGTKFGGAFPGATMFPQVISTAASFNQSLWQEIGRVVSDEARAMYN 152
Query: 145 GGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSD-GDRLKVAAS 203
GG AGLTYWSPNVNIFRDPRWGRGQETPGEDP L+ KYAASYV+GLQG G+RLKVAA
Sbjct: 153 GGQAGLTYWSPNVNIFRDPRWGRGQETPGEDPTLAAKYAASYVKGLQGDGAGNRLKVAAC 212
Query: 204 CKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVP 263
CKH+TAYDLDNWNGVDRFHFNAKVSKQD+EDT+DVPF+ CV+EG+VASVMCSYNQVNG P
Sbjct: 213 CKHYTAYDLDNWNGVDRFHFNAKVSKQDLEDTYDVPFKACVLEGQVASVMCSYNQVNGKP 272
Query: 264 TCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGP 323
TCADP++L+ TIRG+W LNGYIVSDCDSVGV++D QH+T TPEEAAA+AI+AGLDLDCGP
Sbjct: 273 TCADPDLLRNTIRGQWGLNGYIVSDCDSVGVFFDNQHYTRTPEEAAAEAIKAGLDLDCGP 332
Query: 324 FLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQ 383
FL +HT+SA+++GL+SE D+N AL N +TVQMRLGMFDGEPS+QP+G+LGP+DVCTP HQ
Sbjct: 333 FLAIHTDSAIRKGLISENDLNLALANLITVQMRLGMFDGEPSTQPFGNLGPRDVCTPAHQ 392
Query: 384 ELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQ 443
+LALEAAR+ IVLL+N+G SLPLS R R V VIGPN+D TVTMIGNYAG+ACGYTTPLQ
Sbjct: 393 QLALEAARESIVLLQNKGNSLPLSPSRLRIVGVIGPNTDATVTMIGNYAGVACGYTTPLQ 452
Query: 444 GIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLL 503
GI RY +T HQ GC+ VAC ++LFGAA +RQ DAT+LVMGLDQ+IEAE DR GLLL
Sbjct: 453 GIARYVKTAHQVGCRGVACRGNELFGAAEIIARQVDATVLVMGLDQTIEAETRDRVGLLL 512
Query: 504 PGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADI 563
PG QQELV++V+ A+KGP ILV+MSGGP+DV+FAKN+P+I+AI+W GYPGQAGGTAIAD+
Sbjct: 513 PGLQQELVTRVARAAKGPVILVIMSGGPVDVSFAKNNPKISAILWVGYPGQAGGTAIADV 572
Query: 564 LFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMS 623
+FG +NPGG+LPMTWYPQ Y+ +PMT M MRP+ + YPGRTYRFYKGPVV+PFGHG+S
Sbjct: 573 IFGATNPGGRLPMTWYPQGYLAKVPMTNMDMRPNPATGYPGRTYRFYKGPVVFPFGHGLS 632
Query: 624 YTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKC-NRLTLGVQVDVKNVGSK 682
Y+ F ++A AP V+V + N+T+S KA+KV+HA C + L VDVKN GS
Sbjct: 633 YSRFSQSLALAPKQVSVQILSLQALTNSTLSSKAVKVSHANCDDSLETEFHVDVKNEGSM 692
Query: 683 DGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRI 742
DG HTLL+FS PP G W+ KQLV F K HVPAG++QR+ +N+H CK+LSVVD+ G RRI
Sbjct: 693 DGTHTLLIFSKPPPGKWSQIKQLVTFHKTHVPAGSKQRLKVNVHSCKHLSVVDQFGVRRI 752
Query: 743 PLGEHNIHIGGTKHSVSLH 761
P GEH +HIG KHS+++
Sbjct: 753 PTGEHELHIGDLKHSINVQ 771
>gi|357449039|ref|XP_003594795.1| Beta xylosidase [Medicago truncatula]
gi|355483843|gb|AES65046.1| Beta xylosidase [Medicago truncatula]
Length = 762
Score = 1097 bits (2838), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 527/766 (68%), Positives = 618/766 (80%), Gaps = 13/766 (1%)
Query: 5 IAFFFLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQE 64
I F L IL++ S R PFACDPK+ TR+ FC +PI RV DLIGRL+L E
Sbjct: 8 ITFMLLISILVTLSE----GRVPFACDPKNGLTRSYKFCNTRVPIHARVQDLIGRLALPE 63
Query: 65 KVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFN 124
K++L+++ A AVPRLGI+GYEWWSEALHGVSNVGPGTKFGG F ATSFPQVITTA+SFN
Sbjct: 64 KIRLVVNNAIAVPRLGIQGYEWWSEALHGVSNVGPGTKFGGAFSAATSFPQVITTAASFN 123
Query: 125 ATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAA 184
+LW IGR+VSDEARAMYNGG AGLT+WSPNVNIFRDPRWGRGQETPGEDP ++GKYAA
Sbjct: 124 QSLWLEIGRIVSDEARAMYNGGAAGLTFWSPNVNIFRDPRWGRGQETPGEDPTVAGKYAA 183
Query: 185 SYVRGLQGSD-GDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMC 243
SYV+GLQG+ G+RLKVAA CKH+TAYDLDNWNGVDRFHFNAKVSKQD+ DT+DVPF+ C
Sbjct: 184 SYVQGLQGNGAGNRLKVAACCKHYTAYDLDNWNGVDRFHFNAKVSKQDLADTYDVPFKAC 243
Query: 244 VMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTS 303
V +GKVASVMCSYNQVNG PTCADP +L+ TIRGEW LNGYIVSDCDSVGV YD QH+T
Sbjct: 244 VRDGKVASVMCSYNQVNGKPTCADPELLRNTIRGEWGLNGYIVSDCDSVGVLYDNQHYTR 303
Query: 304 TPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGE 363
TPE+AAA AI+AGLDLDCGPFL LHT+ A+++GL+SE D+N AL N +TVQMRLGMFDG+
Sbjct: 304 TPEQAAAAAIKAGLDLDCGPFLALHTDGAIKQGLISENDLNLALANLITVQMRLGMFDGD 363
Query: 364 PSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDV 423
+QPYG+LG +DVC P H ++ALEAARQGIVLL+N+G +LPLS R+RTV VIGPNSDV
Sbjct: 364 --AQPYGNLGTRDVCLPSHNDVALEAARQGIVLLQNKGNALPLSPTRYRTVGVIGPNSDV 421
Query: 424 TVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATIL 483
TVTMIGNYAGIACGYTTPLQGI RY +TIHQ GCKDV C +QLFG + +RQADAT+L
Sbjct: 422 TVTMIGNYAGIACGYTTPLQGIARYVKTIHQAGCKDVGCGGNQLFGLSEQVARQADATVL 481
Query: 484 VMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRI 543
VMGLDQSIEAE DR GLLLPG QQELVS+V+ A++GP ILVLMSGGPIDV FAKNDP+I
Sbjct: 482 VMGLDQSIEAEFRDRTGLLLPGHQQELVSRVARAARGPVILVLMSGGPIDVTFAKNDPKI 541
Query: 544 AAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYP 603
+AI+W GYPGQ+GGTAIAD++FG +NP G+LP TWYPQ+Y+ +PMT M MR + + YP
Sbjct: 542 SAILWVGYPGQSGGTAIADVIFGRTNPSGRLPNTWYPQDYVRKVPMTNMDMRANPATGYP 601
Query: 604 GRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHA 663
GRTYRFYKGPVV+PFGHG+SY+ F H++A AP V+V S KA+KV+HA
Sbjct: 602 GRTYRFYKGPVVFPFGHGLSYSRFTHSLALAPKQVSVQFTTPLTQAFTNSSNKAMKVSHA 661
Query: 664 KCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGI 723
C+ L +G VDVKN GS DGAHTLLV+S P G KQLV F K +VPAG++ RV +
Sbjct: 662 NCDELEVGFHVDVKNEGSMDGAHTLLVYSKAPNG----VKQLVNFHKTYVPAGSKTRVKV 717
Query: 724 NIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHAATLGVIK 769
+HVC +LS VD G RRIP+GEH + IG KHS+ + TL IK
Sbjct: 718 GVHVCNHLSAVDEFGVRRIPMGEHELQIGDLKHSILVQ--TLDQIK 761
>gi|449466797|ref|XP_004151112.1| PREDICTED: beta-D-xylosidase 1-like [Cucumis sativus]
Length = 770
Score = 1088 bits (2814), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 519/735 (70%), Positives = 615/735 (83%), Gaps = 7/735 (0%)
Query: 30 CDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSE 89
CD ++ TR + FCQ SL I +RV DLIGRL+L EK++LL++ A AVPRLGI+GYEWWSE
Sbjct: 34 CDKRNVGTRNMGFCQESLGIEERVKDLIGRLTLGEKIRLLVNNAIAVPRLGIRGYEWWSE 93
Query: 90 ALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAG 149
ALHGVSNVGPGTKFGG FPGATSFPQVITTA+SFN +LW IGRVVSDEARAMYNGGTAG
Sbjct: 94 ALHGVSNVGPGTKFGGTFPGATSFPQVITTAASFNQSLWLLIGRVVSDEARAMYNGGTAG 153
Query: 150 LTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGD-RLKVAASCKHFT 208
LTYWSPNVNIFRDPRWGRGQETPGEDP+L+ KYAA+YV+GLQG+DG RLKVAA CKH+T
Sbjct: 154 LTYWSPNVNIFRDPRWGRGQETPGEDPILAAKYAANYVQGLQGNDGKKRLKVAACCKHYT 213
Query: 209 AYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADP 268
AYDLDNWNGVDR+HFNAKVSKQD+EDT++VPF+ CV+EGKVASVMCSYNQVNG PTCADP
Sbjct: 214 AYDLDNWNGVDRYHFNAKVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADP 273
Query: 269 NILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLH 328
++LK TIRG W L+GYIVSDCDSVGV YD+QHFT TPEEAAA I+AGLDLDCGPFL +H
Sbjct: 274 DLLKNTIRGAWGLDGYIVSDCDSVGVLYDSQHFTPTPEEAAASTIKAGLDLDCGPFLAVH 333
Query: 329 TESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALE 388
T +AV RGLL E+D+NNAL N L+VQMRLGMFDGEP++QPYG+LGPKDVCTP H+ LALE
Sbjct: 334 TATAVGRGLLKEVDLNNALANLLSVQMRLGMFDGEPAAQPYGNLGPKDVCTPAHKHLALE 393
Query: 389 AARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRY 448
AARQGIVLL+N+ +LPLS RHRTVAVIGPNSD TVTMIGNYAG+AC YTTP+QGI +Y
Sbjct: 394 AARQGIVLLQNRAGALPLSPTRHRTVAVIGPNSDATVTMIGNYAGVACEYTTPVQGISKY 453
Query: 449 ARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQ 508
+TIH +GC +VAC DQL G A A+R ADA ++V+GLDQSIEAE+ DR G+LLPG+Q+
Sbjct: 454 VKTIHAKGCANVACVGDQLIGEAEAAARVADAAVVVVGLDQSIEAESRDRNGVLLPGKQE 513
Query: 509 ELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTS 568
ELV ++ +A KGPT++VLMSGGPIDV+FAKND +I+ I+W GYPGQAGG AIAD+LFG +
Sbjct: 514 ELVRRIGLACKGPTVVVLMSGGPIDVSFAKNDGKISGILWVGYPGQAGGAAIADVLFGAT 573
Query: 569 NPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFV 628
NPGGKLPMTWYPQ Y+ +PMT M +RP S YPGRTYRFYKGPVV+PFG G+SY+ F
Sbjct: 574 NPGGKLPMTWYPQSYLAKVPMTNMGLRPDPSTGYPGRTYRFYKGPVVFPFGFGLSYSKFS 633
Query: 629 HTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLT-LGVQVDVKNVGSKDGAHT 687
+ A APT +++PL + +AT +KV+H C ++ L + +DVKN G+ DG+HT
Sbjct: 634 QSFAEAPTKISLPLSSLSPNSSAT-----VKVSHTDCASVSDLPIMIDVKNTGTVDGSHT 688
Query: 688 LLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEH 747
+LVFST P W+P K L+ FEKVH+ AG+Q+RV I IHVC +LS VD GTRRIP+GEH
Sbjct: 689 ILVFSTVPNQTWSPEKHLIGFEKVHLIAGSQKRVRIGIHVCDHLSRVDEFGTRRIPMGEH 748
Query: 748 NIHIGGTKHSVSLHA 762
+HIG HS+SL A
Sbjct: 749 KLHIGDLTHSISLQA 763
>gi|189380221|gb|ACD93208.1| beta xylosidase [Camellia sinensis]
Length = 767
Score = 1072 bits (2773), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 520/753 (69%), Positives = 609/753 (80%), Gaps = 18/753 (2%)
Query: 24 AREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKG 83
+R FACD TR LPFC+VSLPI RV DLIGRL+LQEK++LL++ AAAVPRLGIKG
Sbjct: 27 SRPAFACD---GATRNLPFCRVSLPIQDRVRDLIGRLTLQEKIRLLVNNAAAVPRLGIKG 83
Query: 84 YEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMY 143
YEWWSEALHGVSN PG KFGG FPGATSFPQVI+TA+SFNA+LWE IGRVVSDEARAMY
Sbjct: 84 YEWWSEALHGVSNADPGVKFGGAFPGATSFPQVISTAASFNASLWEHIGRVVSDEARAMY 143
Query: 144 NGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAAS 203
NGG AGLTYWSPNVNIFRDPRWGRGQETPGEDPVL+GKYAASYVRGLQG+ G++LKVAA
Sbjct: 144 NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLAGKYAASYVRGLQGNSGNQLKVAAC 203
Query: 204 CKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVP 263
CKH+TAYDLDNWN VDR+ FNA+VSKQD+ DT+DVPF+ CV+EGK V C++ +
Sbjct: 204 CKHYTAYDLDNWNSVDRYRFNARVSKQDLADTYDVPFKACVVEGKY-QVYCAHT----IK 258
Query: 264 TCADPNILK----RTIRGEWR--LNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGL 317
A+P +L + W L+ + + C + + T H STPE+AAA I+AGL
Sbjct: 259 LMANPLVLTLISPQHHPWSWHSWLHCFRLYRCWGF-ICHSTLH--STPEDAAAATIKAGL 315
Query: 318 DLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDV 377
DL+CGPFL +HTE AV++G L E D+N AL+NTL+VQMRLGMFDGEPSSQPYG+LGP+DV
Sbjct: 316 DLECGPFLAIHTEQAVRQGKLGEADVNGALINTLSVQMRLGMFDGEPSSQPYGNLGPRDV 375
Query: 378 CTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACG 437
CTP HQ+LALEAARQGIVLL+N+G SLPLS HRTVAVIGPNSDVTVTM+GNYAG+ACG
Sbjct: 376 CTPAHQQLALEAARQGIVLLQNRGRSLPLSTQLHRTVAVIGPNSDVTVTMLGNYAGVACG 435
Query: 438 YTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALD 497
+TTPLQGI RY RTIHQ GC VAC+++QLFG A A+RQADAT+LVMGLDQSIE E D
Sbjct: 436 FTTPLQGIERYVRTIHQSGCDSVACSNNQLFGVAETAARQADATVLVMGLDQSIETEFKD 495
Query: 498 RAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGG 557
R GLLLPG QQELVS+V+MAS+GP +LVLMSGGPIDV+FAKNDPRI AI+W GYPGQAGG
Sbjct: 496 RVGLLLPGPQQELVSRVAMASRGPVVLVLMSGGPIDVSFAKNDPRIGAILWVGYPGQAGG 555
Query: 558 TAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYP 617
TAIAD+LFG +NPGG+LPMTWYPQ+Y+ PMT MAMR + S YPGRTYRFYKGPVV+P
Sbjct: 556 TAIADVLFGRTNPGGRLPMTWYPQDYLAKAPMTNMAMRANPSSGYPGRTYRFYKGPVVFP 615
Query: 618 FGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVK 677
FGHGMSYT F H +A+APT V+VPL +G N+T I+VTH C+ L LG+ +DVK
Sbjct: 616 FGHGMSYTTFAHELAHAPTTVSVPLTSLYGLQNSTTFNNGIRVTHTNCDTLILGIHIDVK 675
Query: 678 NVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRS 737
N G DG HT+LVFSTPP G W +KQL+ F+KVHV A +QRV I++HVC LSVVD+
Sbjct: 676 NTGDMDGTHTVLVFSTPPVGKWGANKQLIGFKKVHVVARGRQRVKIHVHVCNQLSVVDQF 735
Query: 738 GTRRIPLGEHNIHIGGTKHSVSLHAATLGVIKS 770
G RRIP+GEH++HIG KHS+SL TL IK+
Sbjct: 736 GIRRIPIGEHSLHIGDIKHSISLQ-VTLDNIKT 767
>gi|15239867|ref|NP_199747.1| beta-xylosidase 1 [Arabidopsis thaliana]
gi|75262458|sp|Q9FGY1.1|BXL1_ARATH RecName: Full=Beta-D-xylosidase 1; Short=AtBXL1; AltName:
Full=Alpha-L-arabinofuranosidase; Flags: Precursor
gi|9759419|dbj|BAB09906.1| xylosidase [Arabidopsis thaliana]
gi|21539545|gb|AAM53325.1| xylosidase [Arabidopsis thaliana]
gi|332008419|gb|AED95802.1| beta-xylosidase 1 [Arabidopsis thaliana]
Length = 774
Score = 1072 bits (2771), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 525/767 (68%), Positives = 618/767 (80%), Gaps = 17/767 (2%)
Query: 1 MASTIAFFFLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRL 60
+ + + + L+ L SS L R FACDP + TRTL FC+ ++PI RV DL+GRL
Sbjct: 10 IGNKVVVILVFLLCLVHSSESL--RPLFACDPANGLTRTLRFCRANVPIHVRVQDLLGRL 67
Query: 61 SLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTA 120
+LQEK++ L++ AAAVPRLGI GYEWWSEALHG+S+VGPG KFGG FPGATSFPQVITTA
Sbjct: 68 TLQEKIRNLVNNAAAVPRLGIGGYEWWSEALHGISDVGPGAKFGGAFPGATSFPQVITTA 127
Query: 121 SSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSG 180
+SFN +LWE IGRVVSDEARAMYNGG AGLTYWSPNVNI RDPRWGRGQETPGEDP+++
Sbjct: 128 ASFNQSLWEEIGRVVSDEARAMYNGGVAGLTYWSPNVNILRDPRWGRGQETPGEDPIVAA 187
Query: 181 KYAASYVRGLQGSD-GDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVP 239
KYAASYVRGLQG+ G+RLKVAA CKH+TAYDLDNWNGVDRFHFNAKV++QD+EDT++VP
Sbjct: 188 KYAASYVRGLQGTAAGNRLKVAACCKHYTAYDLDNWNGVDRFHFNAKVTQQDLEDTYNVP 247
Query: 240 FRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQ 299
F+ CV EGKVASVMCSYNQVNG PTCAD N+LK TIRG+WRLNGYIVSDCDSV V+++ Q
Sbjct: 248 FKSCVYEGKVASVMCSYNQVNGKPTCADENLLKNTIRGQWRLNGYIVSDCDSVDVFFNQQ 307
Query: 300 HFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGM 359
H+TSTPEEAAA +I+AGLDLDCGPFL + TE AV++GLL+E DIN AL NTLTVQMRLGM
Sbjct: 308 HYTSTPEEAAARSIKAGLDLDCGPFLAIFTEGAVKKGLLTENDINLALANTLTVQMRLGM 367
Query: 360 FDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGP 419
FDG PY +LGP+DVCTP H+ LALEAA QGIVLLKN SLPLS RHRTVAVIGP
Sbjct: 368 FDGNLG--PYANLGPRDVCTPAHKHLALEAAHQGIVLLKNSARSLPLSPRRHRTVAVIGP 425
Query: 420 NSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQAD 479
NSDVT TMIGNYAG AC YT+PLQGI RYART+HQ GC VAC +Q FGAA A+R+AD
Sbjct: 426 NSDVTETMIGNYAGKACAYTSPLQGISRYARTLHQAGCAGVACKGNQGFGAAEAAAREAD 485
Query: 480 ATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKN 539
AT+LVMGLDQSIEAE DR GLLLPG QQ+LV++V+ AS+GP ILVLMSGGPIDV FAKN
Sbjct: 486 ATVLVMGLDQSIEAETRDRTGLLLPGYQQDLVTRVAQASRGPVILVLMSGGPIDVTFAKN 545
Query: 540 DPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQS 599
DPR+AAIIWAGYPGQAGG AIA+I+FG +NPGGKLPMTWYPQ+Y+ +PMT MAMR S +
Sbjct: 546 DPRVAAIIWAGYPGQAGGAAIANIIFGAANPGGKLPMTWYPQDYVAKVPMTVMAMRASGN 605
Query: 600 KRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAP----TVVAVPLDGRHGSINATISG 655
YPGRTYRFYKGPVV+PFG G+SYT F H++A +P +V L+ + +N+ S
Sbjct: 606 --YPGRTYRFYKGPVVFPFGFGLSYTTFTHSLAKSPLAQLSVSLSNLNSANTILNS--SS 661
Query: 656 KAIKVTHAKCNRL-TLGVQVDVKNVGSKDGAHTLLVFSTPPAG---HWAPHKQLVAFEKV 711
+IKV+H CN + + V+V N G DG HT+ VF+ PP +KQL+AFEKV
Sbjct: 662 HSIKVSHTNCNSFPKMPLHVEVSNTGEFDGTHTVFVFAEPPINGIKGLGVNKQLIAFEKV 721
Query: 712 HVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSV 758
HV AGA+Q V +++ CK+L VVD G RRIP+GEH +HIG KH++
Sbjct: 722 HVMAGAKQTVQVDVDACKHLGVVDEYGKRRIPMGEHKLHIGDLKHTI 768
>gi|297795695|ref|XP_002865732.1| beta-xylosidase 1 [Arabidopsis lyrata subsp. lyrata]
gi|297311567|gb|EFH41991.1| beta-xylosidase 1 [Arabidopsis lyrata subsp. lyrata]
Length = 774
Score = 1069 bits (2765), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 526/767 (68%), Positives = 620/767 (80%), Gaps = 17/767 (2%)
Query: 1 MASTIAFFFLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRL 60
+ + + + L+ L SS L R FACDP + TRTL FC+V++PI RV DLIGRL
Sbjct: 10 IGNKVVVILVFLLCLVHSSESL--RPLFACDPANGLTRTLRFCRVNVPIHVRVQDLIGRL 67
Query: 61 SLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTA 120
+LQEK++ L++ AAAVPRLGI GYEWWSEALHGVS+VGPG+KFGG FPGATSFPQVITTA
Sbjct: 68 TLQEKIRNLVNNAAAVPRLGIGGYEWWSEALHGVSDVGPGSKFGGAFPGATSFPQVITTA 127
Query: 121 SSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSG 180
+SFN +LWE IGRVVSDEARAMYNGG AGLTYWSPNVNI RDPRWGRGQETPGEDP+++
Sbjct: 128 ASFNQSLWEEIGRVVSDEARAMYNGGVAGLTYWSPNVNILRDPRWGRGQETPGEDPIVAA 187
Query: 181 KYAASYVRGLQGSD-GDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVP 239
KYAASYVRGLQG+ G+RLKVAA CKH+TAYDLDNWNGVDRFHFNAKV++QD+EDT++VP
Sbjct: 188 KYAASYVRGLQGTAAGNRLKVAACCKHYTAYDLDNWNGVDRFHFNAKVTQQDLEDTYNVP 247
Query: 240 FRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQ 299
F+ CV EGKVASVMCSYNQVNG PTCAD N+LK TIRG+WRLNGYIVSDCDSV V+++ Q
Sbjct: 248 FKSCVYEGKVASVMCSYNQVNGKPTCADENLLKNTIRGKWRLNGYIVSDCDSVDVFFNQQ 307
Query: 300 HFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGM 359
H+TSTPEEAAA +I+AGLDLDCGPFL + TE AV++GLL+E DIN AL NTLTVQMRLGM
Sbjct: 308 HYTSTPEEAAAASIKAGLDLDCGPFLAIFTEGAVKKGLLTENDINLALANTLTVQMRLGM 367
Query: 360 FDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGP 419
FDG PY +LGP+DVC+ H+ LALEAA QGIVLLKN G SLPLS RHRTVAVIGP
Sbjct: 368 FDGNLG--PYANLGPRDVCSLAHKHLALEAAHQGIVLLKNSGRSLPLSPRRHRTVAVIGP 425
Query: 420 NSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQAD 479
NSDVT TMIGNYAG AC YTTPLQGI RYART+HQ GC VAC +Q FGAA A+R+AD
Sbjct: 426 NSDVTETMIGNYAGKACAYTTPLQGISRYARTLHQAGCAGVACKGNQGFGAAEAAAREAD 485
Query: 480 ATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKN 539
AT+LVMGLDQSIEAE DR GLLLPG QQ+LV++V+ AS+GP ILVLMSGGPIDV FAKN
Sbjct: 486 ATVLVMGLDQSIEAETRDRTGLLLPGYQQDLVTRVAQASRGPVILVLMSGGPIDVTFAKN 545
Query: 540 DPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQS 599
DPR+AAIIWAGYPGQAGG AIA+I+FG +NPGGKLPMTWYPQ+Y+ +PMT MAMR S +
Sbjct: 546 DPRVAAIIWAGYPGQAGGAAIANIIFGAANPGGKLPMTWYPQDYVAKVPMTVMAMRASGN 605
Query: 600 KRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAP----TVVAVPLDGRHGSINATISG 655
YPGRTYRFYKGPVV+PFG G+SYT F +++A +P +V L+ + +N+T
Sbjct: 606 --YPGRTYRFYKGPVVFPFGFGLSYTTFTNSLAKSPLAQLSVSLSNLNSANAILNST--S 661
Query: 656 KAIKVTHAKCNRL-TLGVQVDVKNVGSKDGAHTLLVFSTPPAG---HWAPHKQLVAFEKV 711
+IKV+H CN + + V+V N G DG HT+ VF+ PP +KQL+AFEKV
Sbjct: 662 HSIKVSHTNCNSFPKMPLHVEVSNTGEFDGTHTVFVFAEPPKNGIKGLGVNKQLIAFEKV 721
Query: 712 HVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSV 758
HV AGA+Q V +++ CK+L VVD G RRIP+G+H +HIG KH++
Sbjct: 722 HVMAGAKQTVRVDVDACKHLGVVDEYGKRRIPMGKHKLHIGDLKHTI 768
>gi|224128360|ref|XP_002320310.1| predicted protein [Populus trichocarpa]
gi|222861083|gb|EEE98625.1| predicted protein [Populus trichocarpa]
Length = 635
Score = 1052 bits (2720), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 504/639 (78%), Positives = 560/639 (87%), Gaps = 6/639 (0%)
Query: 133 RVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQG 192
+VVSDEARAM+NGG AGLTYWSPNVNIFRDPRWGRGQETPGEDPV+ GKYAASYVRGLQG
Sbjct: 2 QVVSDEARAMFNGGVAGLTYWSPNVNIFRDPRWGRGQETPGEDPVVVGKYAASYVRGLQG 61
Query: 193 SDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASV 252
SDG+RLKVAA CKHFTAYDLDNWNGVDRFHFNA+VSKQD+EDTFDVPFRMCV EGKVASV
Sbjct: 62 SDGNRLKVAACCKHFTAYDLDNWNGVDRFHFNAEVSKQDMEDTFDVPFRMCVKEGKVASV 121
Query: 253 MCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADA 312
MCSYNQVNG+PTCADPN+LK+T+RG ++ +G Q P
Sbjct: 122 MCSYNQVNGIPTCADPNLLKKTVRGTLFQTVTLLEFI--MGSNTILQPRRKQPRMLLK-- 177
Query: 313 IRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHL 372
+A LDLDCGPFLG HTE AV++GLL+E +INNAL+NTLTVQMRLGMFDGEPSSQ YG+L
Sbjct: 178 -QASLDLDCGPFLGQHTEDAVKKGLLNEAEINNALLNTLTVQMRLGMFDGEPSSQLYGNL 236
Query: 373 GPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYA 432
GP DVCTP HQELALEAARQGIVLLKN GPSLPLS RH +VA++GPNS+VT TMIGNYA
Sbjct: 237 GPNDVCTPAHQELALEAARQGIVLLKNHGPSLPLSTRRHLSVAIVGPNSNVTATMIGNYA 296
Query: 433 GIACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIE 492
G+ACGYTTPLQGI RYA+TIH+QGC DVAC DQ F AAIDA+RQADAT+LVMGLDQSIE
Sbjct: 297 GLACGYTTPLQGIQRYAQTIHRQGCADVACVSDQQFSAAIDAARQADATVLVMGLDQSIE 356
Query: 493 AEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYP 552
AE DR GLLLPGRQQELVSKV+ ASKGPTILVLMSGGPIDV+FA+NDP+I +I+WAGYP
Sbjct: 357 AEFRDRTGLLLPGRQQELVSKVAAASKGPTILVLMSGGPIDVSFAENDPKIGSIVWAGYP 416
Query: 553 GQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKG 612
GQAGG AI+D+LFG +NPGGKLPMTWYPQ+YITNLPMT MAMR S+SK YPGRTYRFYKG
Sbjct: 417 GQAGGAAISDVLFGITNPGGKLPMTWYPQDYITNLPMTNMAMRSSKSKGYPGRTYRFYKG 476
Query: 613 PVVYPFGHGMSYTNFVHTVANAPTVVAVPLDG-RHGSINATISGKAIKVTHAKCNRLTLG 671
VVYPFGHG+SYTNFVHT+A+APT+V+VPLDG RHGS NATISGKAI+VTHA+CNRL+LG
Sbjct: 477 KVVYPFGHGISYTNFVHTIASAPTMVSVPLDGHRHGSGNATISGKAIRVTHARCNRLSLG 536
Query: 672 VQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYL 731
+QVDVKN GS DG HTLLV+S PPA HWAPHKQLVAFEKVHV AG QQRVGINIHVCK L
Sbjct: 537 MQVDVKNTGSMDGTHTLLVYSRPPARHWAPHKQLVAFEKVHVAAGTQQRVGINIHVCKSL 596
Query: 732 SVVDRSGTRRIPLGEHNIHIGGTKHSVSLHAATLGVIKS 770
SVVD SG RRIP+GEH++HIG KHSVSL A+ LGV++S
Sbjct: 597 SVVDGSGIRRIPMGEHSLHIGDVKHSVSLQASILGVVES 635
>gi|371917280|dbj|BAL44716.1| SlArf/Xyl1 [Solanum lycopersicum]
Length = 771
Score = 1051 bits (2719), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 507/764 (66%), Positives = 600/764 (78%), Gaps = 14/764 (1%)
Query: 7 FFFLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKV 66
+F L + +L+ + SG +R+PFACDP +A R L FC+ SLPI RV DLI RL+LQEK+
Sbjct: 5 YFILIIFVLAFAYSG-ESRQPFACDPANAGIRNLRFCKTSLPIHVRVQDLIARLTLQEKI 63
Query: 67 KLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNAT 126
+LL++ AA V RLGI GYEWWSEALHGVSN G G KFGG FPGATSFPQVITTA+SFNA+
Sbjct: 64 RLLVNNAAPVQRLGISGYEWWSEALHGVSNTGYGVKFGGAFPGATSFPQVITTAASFNAS 123
Query: 127 LWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASY 186
LWE IGRVVS+E RAMYNGG AGLT+WSPNVNIFRDPRWGRGQETPGEDP L +Y SY
Sbjct: 124 LWEEIGRVVSEEGRAMYNGGAAGLTFWSPNVNIFRDPRWGRGQETPGEDPHLVAQYGVSY 183
Query: 187 VRGLQGSD---GDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMC 243
V+GLQG RLKVAA CKH+TAYDLD+WNG DR+HFNAKVS QD+EDT++ PF+ C
Sbjct: 184 VKGLQGGGGRGNTRLKVAACCKHYTAYDLDDWNGYDRYHFNAKVSMQDLEDTYNAPFKAC 243
Query: 244 VMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTS 303
V+EG VASVMCSYNQ+NG P+CADP +L+ TIR +W LNGYIVSDCDSVGV ++ QH+T
Sbjct: 244 VVEGNVASVMCSYNQINGKPSCADPTLLRDTIRNQWHLNGYIVSDCDSVGVLFEKQHYTR 303
Query: 304 TPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGE 363
PE+AAA I+AGLDLDCGPFL +HT+ AV G +S+++INNAL NT+TVQMRLGMFDG
Sbjct: 304 YPEDAAAITIKAGLDLDCGPFLAIHTDKAVHTGKVSQVEINNALANTITVQMRLGMFDG- 362
Query: 364 PSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDV 423
+ PY +LGPKDVC+P HQ+LAL+AAR+GIVLLKN G +LPLS RHRTVAVIGPNSD
Sbjct: 363 -PNGPYANLGPKDVCSPAHQQLALQAAREGIVLLKNIGQALPLSTKRHRTVAVIGPNSDA 421
Query: 424 TVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATIL 483
T+ MIGNYAG+ CGY +PLQGI RYARTIHQQGC VAC +Q FG A A+R ADAT+L
Sbjct: 422 TLAMIGNYAGVPCGYISPLQGISRYARTIHQQGCMGVACPGNQNFGLAEVAARHADATVL 481
Query: 484 VMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRI 543
VMGLDQSIEAEA DR LLLPG QQ+L+S+V+MASKGP +LVLMSGGPIDV FAKNDPR+
Sbjct: 482 VMGLDQSIEAEAKDRVTLLLPGHQQDLISRVAMASKGPVVLVLMSGGPIDVTFAKNDPRV 541
Query: 544 AAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYP 603
++I+W GYPGQAGG AIAD+LFG +NPGGKLPMTWYPQ+Y+ + M M MR + SK YP
Sbjct: 542 SSIVWVGYPGQAGGAAIADVLFGATNPGGKLPMTWYPQDYVAKVSMANMDMRANPSKGYP 601
Query: 604 GRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHG--SINATISGKA---I 658
GRTYRFYKGP V+PFG G+SYT F + +AP V+VP H S N T KA +
Sbjct: 602 GRTYRFYKGPTVFPFGAGISYTTFSQHLVSAPITVSVPTLHSHDLVSNNTTTLMKAKATV 661
Query: 659 KVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQ 718
+ H C L + + +DVKN G DG H +L+FSTPP KQLVAFEKVHV AGA+
Sbjct: 662 RTIHTNCESLDIDMHIDVKNTGDMDGTHAVLIFSTPPDP--TETKQLVAFEKVHVVAGAK 719
Query: 719 QRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGG-TKHSVSLH 761
QRV IN++ CK+LSV D G RRI +GEH IH+G KHS++
Sbjct: 720 QRVKINMNACKHLSVADEYGVRRIYMGEHKIHVGDHLKHSITFQ 763
>gi|357130854|ref|XP_003567059.1| PREDICTED: probable beta-D-xylosidase 2-like [Brachypodium
distachyon]
Length = 779
Score = 1004 bits (2597), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 489/756 (64%), Positives = 588/756 (77%), Gaps = 18/756 (2%)
Query: 25 REPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGY 84
R PFAC P +TR LPFC+ +LP R DL+ RL+ EKV+LL++ AA VPRLG++GY
Sbjct: 24 RPPFACAPGGPSTR-LPFCRQALPPRARARDLVARLTRAEKVRLLVNNAAGVPRLGVEGY 82
Query: 85 EWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN 144
EWWSEALHGVS+ GPG +FGG FPGAT+FPQVI TA+SFNA+LWE IGR VSDE RA+YN
Sbjct: 83 EWWSEALHGVSDTGPGVRFGGAFPGATAFPQVIGTAASFNASLWELIGRAVSDEGRAIYN 142
Query: 145 GGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASC 204
G AGLT+WSPNVNIFRDPRWGRGQETPGEDP +SG+YAA+YVRGLQ RLK AA C
Sbjct: 143 GRQAGLTFWSPNVNIFRDPRWGRGQETPGEDPAVSGRYAAAYVRGLQQQHAGRLKTAACC 202
Query: 205 KHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPT 264
KHFTAYDLD W+G DRFHFNA V+ QD+EDTF+ PFR CV+EG+ A+VMCSYNQVNGVPT
Sbjct: 203 KHFTAYDLDRWSGADRFHFNAIVTPQDLEDTFNAPFRACVVEGRAAAVMCSYNQVNGVPT 262
Query: 265 CADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPF 324
CAD L+ TIRG+W+L+GYIVSDCDSV V+Y QH+T T E+A A +RAGLDLDCGPF
Sbjct: 263 CADQGFLRGTIRGKWKLDGYIVSDCDSVDVFYREQHYTRTREDAVAATLRAGLDLDCGPF 322
Query: 325 LGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQE 384
L +TE+AV +G + E DI+ A+VNT+TVQMRLGMFDG+ ++QP+GHLGP+ VCTP H+E
Sbjct: 323 LAQYTEAAVAQGKVKEADIDAAVVNTVTVQMRLGMFDGDVAAQPFGHLGPQHVCTPAHRE 382
Query: 385 LALEAARQGIVLLKNQGPS---LPL-SHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTT 440
LALEAA Q IVLLKN G + LPL SH R TVAV+GP+S+ TV MIGNYAG C YTT
Sbjct: 383 LALEAACQSIVLLKNGGGNNMRLPLSSHHRRGTVAVVGPHSEATVAMIGNYAGKPCAYTT 442
Query: 441 PLQGIGRYAR-TIHQQGCKDVAC-ADDQLFGAAIDASRQADATILVMGLDQSIEAEALDR 498
PLQG+GRYAR T+HQ GC DVAC Q AA+DA+R ADAT++V+GLDQS+EAE LDR
Sbjct: 443 PLQGVGRYARATVHQAGCTDVACQGSGQPIDAAVDAARHADATVVVVGLDQSVEAEGLDR 502
Query: 499 AGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGT 558
LLLPGRQ ELVS V+ ASKGP ILVLMSGGP+D+AFA+ND +AAI+WAGYPGQAGG
Sbjct: 503 TTLLLPGRQAELVSAVARASKGPVILVLMSGGPVDIAFAQNDRNVAAILWAGYPGQAGGQ 562
Query: 559 AIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPF 618
AIAD++FG NPGGKLP+TWYP++Y+ PMT MAMR ++ YPGRTYRFY GP ++PF
Sbjct: 563 AIADVIFGHHNPGGKLPVTWYPEDYLRKAPMTNMAMRADPARGYPGRTYRFYAGPTIHPF 622
Query: 619 GHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKA-----IKVTHAKCNRLTLGVQ 673
GHG+SYT F HT+A+AP + V H + A + A ++V HA+C L++ V
Sbjct: 623 GHGLSYTKFAHTLAHAPAHLTVRRAAGHRTTAAINTTTASHLNDVRVAHAQCEGLSVSVH 682
Query: 674 VDVKNVGSKDGAHTLLVFSTPP--AGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYL 731
VDVKNVGS+DGAHT+ V+++PP A H AP +QLVAFEKVHV AGA RV + + VC L
Sbjct: 683 VDVKNVGSRDGAHTVFVYASPPIAAIHGAPVRQLVAFEKVHVAAGAVARVKMGVDVCGSL 742
Query: 732 SVVDRSGTRRIPLGEHNIHIGGTKHSVSLHAATLGV 767
S+ D+ G RRIP+GEH + IG HA LGV
Sbjct: 743 SIADQEGVRRIPIGEHRLMIG----EELTHAVMLGV 774
>gi|318136853|gb|ADV41671.1| alpha-L-arabinofuranosidase/beta-D-xylosidase [Actinidia deliciosa
var. deliciosa]
Length = 634
Score = 995 bits (2573), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 475/634 (74%), Positives = 548/634 (86%), Gaps = 2/634 (0%)
Query: 137 DEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGD 196
DEARAMYNGG AGLT+WSPNVNIFRDPRWGRGQETPGEDP+L+G YAASYVRGLQG+DG+
Sbjct: 1 DEARAMYNGGMAGLTFWSPNVNIFRDPRWGRGQETPGEDPMLAGNYAASYVRGLQGNDGE 60
Query: 197 RLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSY 256
RLKVAA CKH+TAYDLDNW GVDRFHFNA+VSKQDI+DTF++PFR CV+ GKVASVMCSY
Sbjct: 61 RLKVAACCKHYTAYDLDNWRGVDRFHFNARVSKQDIKDTFEIPFRECVLGGKVASVMCSY 120
Query: 257 NQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAG 316
NQVNG+PTCA+P +LK TIRG WRLNGYIVSDCDSVGV+++ QH+TS PEEA A AI+AG
Sbjct: 121 NQVNGIPTCANPKLLKGTIRGSWRLNGYIVSDCDSVGVFFENQHYTSKPEEAVAAAIKAG 180
Query: 317 LDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKD 376
LDLDCGPFL +HTE+AV+RGL+S+++IN AL NT+T QMRLGMFDGEPS+ YG+LGP+D
Sbjct: 181 LDLDCGPFLAIHTEAAVRRGLVSQLEINWALANTMTAQMRLGMFDGEPSAHQYGNLGPRD 240
Query: 377 VCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIAC 436
VCTP HQ+LALEAARQGIVLL+N+G SLPLS RHRTVAVIGPNSDVTVTMIGNYAG+AC
Sbjct: 241 VCTPAHQQLALEAARQGIVLLENRGRSLPLSIRRHRTVAVIGPNSDVTVTMIGNYAGVAC 300
Query: 437 GYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEAL 496
GYTTPLQGIGRY RTIHQ GC DV C +QLFGAA A+RQADAT+LVMGLDQSIEAE +
Sbjct: 301 GYTTPLQGIGRYTRTIHQAGCTDVHCNGNQLFGAAEAAARQADATVLVMGLDQSIEAEFV 360
Query: 497 DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAG 556
DRAG LLPG QQELVS+V+ AS+GPTILVLMSGGPIDV FAKNDPRI+AIIW GYPGQAG
Sbjct: 361 DRAGPLLPGHQQELVSRVARASRGPTILVLMSGGPIDVTFAKNDPRISAIIWVGYPGQAG 420
Query: 557 GTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVY 616
GTAIAD+LFGT+NPGGKLPMTWYPQ Y+T+LPMT+MAMR ++ YPGRTYRFY+GPVV+
Sbjct: 421 GTAIADVLFGTTNPGGKLPMTWYPQNYVTHLPMTDMAMRADPARGYPGRTYRFYRGPVVF 480
Query: 617 PFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLT-LGVQVD 675
PFG G+SYT F H +A+ PT+V+VPL + N+T+ KA++V+HA CN L+ L V VD
Sbjct: 481 PFGLGLSYTTFAHNLAHGPTLVSVPLTSLKATANSTMLSKAVRVSHADCNALSPLDVHVD 540
Query: 676 VKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVD 735
VKN GS DG HTLLVF++PP G WA KQLV F K+H+ AG++ RV I +HVCK+LSVVD
Sbjct: 541 VKNTGSMDGTHTLLVFTSPPDGKWAASKQLVGFHKIHIAAGSETRVRIAVHVCKHLSVVD 600
Query: 736 RSGTRRIPLGEHNIHIGGTKHSVSLHAATLGVIK 769
R G RRIPLGEH + IG H VSL + G IK
Sbjct: 601 RFGIRRIPLGEHKLQIGDLSHHVSLQTNS-GEIK 633
>gi|226531269|ref|NP_001145980.1| uncharacterized protein LOC100279508 precursor [Zea mays]
gi|219885199|gb|ACL52974.1| unknown [Zea mays]
gi|413920228|gb|AFW60160.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
Length = 794
Score = 979 bits (2532), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 485/769 (63%), Positives = 581/769 (75%), Gaps = 30/769 (3%)
Query: 28 FACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWW 87
FAC P +LPFC+ SLP+ R DL+ RL+ EKV+LL++ AA VPRLG+ GYEWW
Sbjct: 27 FACAPGGPAA-SLPFCRQSLPLRARARDLVSRLTRAEKVRLLVNNAAGVPRLGVAGYEWW 85
Query: 88 SEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGT 147
SEALHGVS+ GPG +FGG FPGAT+FPQVI TA+S NATLWE +GR VSDEARAMYNGG
Sbjct: 86 SEALHGVSDTGPGVRFGGAFPGATAFPQVIGTAASLNATLWELVGRAVSDEARAMYNGGR 145
Query: 148 AGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQ-------GSDGDRLKV 200
AGLT+WSPNVNIFRDPRWGRGQETPGEDP +S +YAA+YVRGLQ G +RLK+
Sbjct: 146 AGLTFWSPNVNIFRDPRWGRGQETPGEDPAVSARYAAAYVRGLQQPYAAPNGGHRNRLKL 205
Query: 201 AASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVN 260
AA CKHFTAYDLD W G DRFHFNA V+ QD+EDTF+VPFR CV +G+ ASVMCSYNQVN
Sbjct: 206 AACCKHFTAYDLDKWGGTDRFHFNAVVAAQDLEDTFNVPFRACVEDGRAASVMCSYNQVN 265
Query: 261 GVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLD 320
GVPTCAD L+ TIRG W L+GYIVSDCDSV V++ QH+T TPE+AAA +RAGLDLD
Sbjct: 266 GVPTCADAAFLRGTIRGRWGLDGYIVSDCDSVDVFFRDQHYTRTPEDAAAATLRAGLDLD 325
Query: 321 CGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTP 380
CGPFL L+ SAV G +++ D++ AL+NT+TVQMRLGMFDG+P++ P+G LGP DVCT
Sbjct: 326 CGPFLALYAGSAVAAGKVADADVDAALLNTVTVQMRLGMFDGDPAAGPFGRLGPADVCTR 385
Query: 381 DHQELALEAARQGIVLLKNQGPS------LPLSHIRHRTVAVIGPNSDVTVTMIGNYAGI 434
+HQ+LAL+AARQG+VLLKN+ + LPL HR VAV+GP++D TV MIGNYAG
Sbjct: 386 EHQDLALDAARQGVVLLKNRRGARHNRDVLPLRPAAHRVVAVVGPHADATVAMIGNYAGK 445
Query: 435 ACGYTTPLQGIGRY-ARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEA 493
C YTTPLQG+ Y AR HQ GC DVAC +Q AA++A+RQADAT++V GLDQ +EA
Sbjct: 446 PCRYTTPLQGVAAYAARVAHQAGCTDVACRGNQPIAAAVEAARQADATVVVAGLDQRVEA 505
Query: 494 EALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPG 553
E LDR LLLPGRQ EL+S V+ ASKGP ILVLMSGGPID+AFA+NDPRI I+W GYPG
Sbjct: 506 EGLDRTTLLLPGRQAELISAVAKASKGPVILVLMSGGPIDIAFAQNDPRIDGILWVGYPG 565
Query: 554 QAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGP 613
QAGG AIAD++FG NPG KLP+TWY Q+Y+ +PMT MAMR + ++ YPGRTYRFY GP
Sbjct: 566 QAGGQAIADVIFGHHNPGAKLPVTWYHQDYLQKVPMTNMAMRANPARGYPGRTYRFYTGP 625
Query: 614 VVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISG---------KAIKVTHAK 664
+YPFGHG+SYT F HT+A+APT + V L G S + S +A++V HA+
Sbjct: 626 TIYPFGHGLSYTQFTHTLAHAPTQLTVRLSGSGHSAASAASLLNATLARPVRAVRVAHAR 685
Query: 665 CNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPA------GHWAPHKQLVAFEKVHVPAGAQ 718
C LT+ V VDV NVG +DGAH +LV+ P+ G AP +QLVAFEKVHVPAG
Sbjct: 686 CEGLTVPVHVDVSNVGDRDGAHAVLVYHAAPSPSHAAPGADAPARQLVAFEKVHVPAGGV 745
Query: 719 QRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHAATLGV 767
RV + I VC LSV DR+G RR+P+GEH + IG HSVSL LGV
Sbjct: 746 ARVEMRIGVCDRLSVADRNGVRRVPVGEHRLMIGELTHSVSLGVEQLGV 794
>gi|242071935|ref|XP_002451244.1| hypothetical protein SORBIDRAFT_05g026400 [Sorghum bicolor]
gi|241937087|gb|EES10232.1| hypothetical protein SORBIDRAFT_05g026400 [Sorghum bicolor]
Length = 790
Score = 944 bits (2440), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 470/756 (62%), Positives = 560/756 (74%), Gaps = 38/756 (5%)
Query: 39 TLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVG 98
TLPFC+ SLP+ R DL+ RL+ EKV+LL++ AA V RLG+ GYEWWSEALHGVS+ G
Sbjct: 44 TLPFCRQSLPLHARARDLVSRLTRAEKVRLLVNNAAGVARLGVGGYEWWSEALHGVSDTG 103
Query: 99 PGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVN 158
PG KFGG FPGAT+FPQVI A++ NATLWE IGR VSDEARAMYNGG AGLT+WSPNVN
Sbjct: 104 PGVKFGGAFPGATAFPQVIGAAAALNATLWELIGRAVSDEARAMYNGGRAGLTFWSPNVN 163
Query: 159 IFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGS-DGDRLKVAASCKHFTAYDLDNWNG 217
IFRDPRWGRGQETPGEDP +S +YAA+YVRGLQ D +RLK+AA CKHFTAYDLD+W G
Sbjct: 164 IFRDPRWGRGQETPGEDPAISSRYAAAYVRGLQQPYDHNRLKLAACCKHFTAYDLDSWGG 223
Query: 218 VDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRG 277
DRFHFNA VS QD+EDTF+VPFR CV G+ ASVMCSYNQVNGVPTCAD L+ TIR
Sbjct: 224 TDRFHFNAVVSPQDLEDTFNVPFRACVAGGRAASVMCSYNQVNGVPTCADQGFLRGTIRK 283
Query: 278 EWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGL 337
W L+GYIVSDCDSV V++ QH+T T E+A A +RAGLDLDCGPFL L+TE+AV R
Sbjct: 284 AWGLDGYIVSDCDSVDVFFRDQHYTRTAEDAVAATLRAGLDLDCGPFLALYTENAVARKK 343
Query: 338 LSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLL 397
+S+ D++ AL+NT+TVQMRLGMFDG+P+S P+GHLG DVCT HQ+LAL+AARQ +VLL
Sbjct: 344 VSDADVDAALLNTVTVQMRLGMFDGDPASGPFGHLGAADVCTKAHQDLALDAARQSVVLL 403
Query: 398 KNQGPS-------LPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRY-A 449
KNQ LPL HR VAV+GP++D TV MIGNYAG C YTTPLQG+ Y A
Sbjct: 404 KNQRGRKHRDRDVLPLRPAAHRVVAVVGPHADATVAMIGNYAGKPCRYTTPLQGVAAYAA 463
Query: 450 RTIHQQGCKDVAC-ADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQ 508
R +HQ GC DVAC +Q AA+DA+R+ GL +S LLLPGRQ
Sbjct: 464 RVVHQAGCADVACQGKNQPIAAAVDAARRLTPPSSSPGLTRS----------LLLPGRQA 513
Query: 509 ELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTS 568
EL+S V+ A+KGP ILVLMSGGPID+AFA+NDPRI I+W GYPGQAGG AIAD++FG
Sbjct: 514 ELISAVAKAAKGPVILVLMSGGPIDIAFAQNDPRIDGILWVGYPGQAGGQAIADVIFGQH 573
Query: 569 NPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFV 628
NPGGKLP+TWYPQ+Y+ +PMT MAMR + ++ YPGRTYRFY GP ++ FGHG+SYT F
Sbjct: 574 NPGGKLPVTWYPQDYLEKVPMTNMAMRANPARGYPGRTYRFYTGPTIHAFGHGLSYTQFT 633
Query: 629 HTVANAPTVVAVPLDGRHGS----------INATISGKAIKVTHAKCNRLTLGVQVDVKN 678
HT+A+AP + V L S +NAT +A++V HA+C LT+ V VDV+N
Sbjct: 634 HTLAHAPAQLTVRLSTSSASASASASAASLLNATRPSRAVRVAHARCEGLTVPVHVDVRN 693
Query: 679 VGSKDGAHTLLVF--------STPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKY 730
VG +DGAH +LV+ S+ PAG AP +QLVAFEKVHVPAG RV + I VC
Sbjct: 694 VGDRDGAHAVLVYHVAPSSSSSSAPAGTDAPARQLVAFEKVHVPAGGVARVEMGIDVCDR 753
Query: 731 LSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHAATLG 766
LSV DR G RRIP+GEH + IG HSV+L LG
Sbjct: 754 LSVADRDGVRRIPVGEHRLMIGELTHSVTLGVEQLG 789
>gi|115486595|ref|NP_001068441.1| Os11g0673200 [Oryza sativa Japonica Group]
gi|113645663|dbj|BAF28804.1| Os11g0673200 [Oryza sativa Japonica Group]
Length = 822
Score = 920 bits (2377), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 485/783 (61%), Positives = 580/783 (74%), Gaps = 54/783 (6%)
Query: 39 TLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVG 98
TLPFC+ SLP R DL+ RL+ EKV+LL++ AA VPRLG+ GYEWWSEALHGVS+ G
Sbjct: 40 TLPFCRRSLPARARARDLVARLTRAEKVRLLVNNAAGVPRLGVAGYEWWSEALHGVSDTG 99
Query: 99 PGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRV------------------------ 134
PG +FGG FPGAT+FPQVI TA+SFNATLWE IG+V
Sbjct: 100 PGVRFGGAFPGATAFPQVIGTAASFNATLWELIGQVMPILKGGHARCNQRPSCIRISVFM 159
Query: 135 --------VSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASY 186
VSDE RAMYNGG AGLT+WSPNVNIFRDPRWGRGQETPGEDP ++ +YAA+Y
Sbjct: 160 YVYVCAQAVSDEGRAMYNGGQAGLTFWSPNVNIFRDPRWGRGQETPGEDPAVAARYAAAY 219
Query: 187 VRGLQGSD--GDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCV 244
VRGLQ RLK+AA CKHFTAYDLDNW+G DRFHFNA V++QD+EDTF+VPFR CV
Sbjct: 220 VRGLQQQQPSSGRLKLAACCKHFTAYDLDNWSGTDRFHFNAVVTRQDLEDTFNVPFRSCV 279
Query: 245 MEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTST 304
++G+ ASVMCSYNQVNGVPTCAD L+ TIR W L GYIVSDCDSV V+Y QH+T T
Sbjct: 280 VDGRAASVMCSYNQVNGVPTCADAAFLRGTIRRRWGLAGYIVSDCDSVDVFYSDQHYTRT 339
Query: 305 PEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEP 364
E+A A +RAGLDLDCGPFL +TE AV +G + + DI+ A+ NT+TVQMRLGMFDG+P
Sbjct: 340 REDAVAATLRAGLDLDCGPFLAQYTEGAVAQGKVGDGDIDAAVTNTVTVQMRLGMFDGDP 399
Query: 365 SSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLS-HIRHRTVAVIGPNSDV 423
++QP+GHLGP+ VCT HQELA+EAARQGIVLLKN G +LPLS R VAV+GP+++
Sbjct: 400 AAQPFGHLGPQHVCTAAHQELAVEAARQGIVLLKNDGRALPLSPATARRAVAVVGPHAEA 459
Query: 424 TVTMIGNYAGIACGYTTPLQGIGRY-ARTIHQQGCKDVACA-DDQLFGAAIDASRQADAT 481
TV MIGNYAG C YTTPLQG+ RY AR HQ GC DVACA Q AA+DA+R+ADAT
Sbjct: 460 TVAMIGNYAGKPCRYTTPLQGVARYAARAAHQPGCTDVACAGSGQPIAAAVDAARRADAT 519
Query: 482 ILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDP 541
I+V GLDQ IEAE LDRA LLLPGRQ EL+S V+ ASKGP ILVLMSGGPID+ FA+NDP
Sbjct: 520 IVVAGLDQKIEAEGLDRASLLLPGRQAELISSVAKASKGPVILVLMSGGPIDIGFAQNDP 579
Query: 542 RIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKR 601
+IA I+WAGYPGQAGG AIAD++FG NPGGKLP+TWYPQ+Y+ +PMT MAMR + +K
Sbjct: 580 KIAGILWAGYPGQAGGQAIADVIFGHHNPGGKLPVTWYPQDYLQKVPMTNMAMRANPAKG 639
Query: 602 YPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPL------DGRHGSINATI-- 653
YPGRTYRFY GP ++PFGHG+SYT+F H++A+AP+ + V L S+NAT
Sbjct: 640 YPGRTYRFYTGPTIHPFGHGLSYTSFTHSIAHAPSQLTVRLSAHHAAASASASLNATARL 699
Query: 654 -SGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPH--------KQ 704
A++V HA+C L + V VDV+NVG +DGAHT+LV++ PA A +Q
Sbjct: 700 SRAAAVRVAHARCEELRMPVHVDVRNVGERDGAHTVLVYAAAPASSAAEAAAGHGAPVRQ 759
Query: 705 LVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHAAT 764
LVAFEKVHV AG RV + I VC LSV DR+G RRIP+GEH + IG H+V++
Sbjct: 760 LVAFEKVHVGAGGTARVEMGIDVCDGLSVADRNGVRRIPVGEHRLIIGELTHTVTIALEQ 819
Query: 765 LGV 767
LGV
Sbjct: 820 LGV 822
>gi|449438167|ref|XP_004136861.1| PREDICTED: beta-xylosidase/alpha-L-arabinofuranosidase 2-like
[Cucumis sativus]
Length = 782
Score = 915 bits (2365), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 447/737 (60%), Positives = 545/737 (73%), Gaps = 7/737 (0%)
Query: 28 FACDPK-DATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEW 86
FACD + + + FC SL RV DL+ RL+LQEK+ LI+ A V RLGI YEW
Sbjct: 47 FACDAETNPSVSGFAFCDSSLGFEARVEDLVKRLTLQEKIGFLINNARNVTRLGIPKYEW 106
Query: 87 WSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGG 146
WSEALHGVS VGPGTKF PGATSFPQVI TA+SFNA+L+EAIG+VVS EARAMYN G
Sbjct: 107 WSEALHGVSYVGPGTKFSNVVPGATSFPQVILTAASFNASLFEAIGKVVSTEARAMYNVG 166
Query: 147 TAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDG---DRLKVAAS 203
AGLTYWSPNVNIFRDPRWGRGQETPGEDP+LS KYAA YVRGLQ D DRLKVAA
Sbjct: 167 LAGLTYWSPNVNIFRDPRWGRGQETPGEDPLLSSKYAAGYVRGLQQRDDGDPDRLKVAAC 226
Query: 204 CKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVP 263
CKH+TAYDLDNW G DR+HFNA VS QD+EDTF PF+ CV++G VASVMCSYNQVNG P
Sbjct: 227 CKHYTAYDLDNWKGTDRYHFNAVVSPQDLEDTFQPPFKSCVIDGNVASVMCSYNQVNGKP 286
Query: 264 TCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGP 323
TCADP++L IRG+W+LNGYIVSDCDSV V Y++QH+T +PEEAAA I AGLDLDCG
Sbjct: 287 TCADPDLLAGVIRGQWKLNGYIVSDCDSVDVLYNSQHYTKSPEEAAAKTILAGLDLDCGD 346
Query: 324 FLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQ 383
FLG HTE+AV GL++E I+ A+ N L MRLG FDG PS Q YG LGPKDVCTP+HQ
Sbjct: 347 FLGKHTEAAVTGGLVNEAAISKAVFNNLLTLMRLGFFDGNPSKQLYGKLGPKDVCTPEHQ 406
Query: 384 ELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQ 443
ELA EAARQGIVLLKN SLPLS +++AVIGPN++VT TMIGNY G C YTTPLQ
Sbjct: 407 ELAREAARQGIVLLKNSPKSLPLSSSAIKSLAVIGPNANVTKTMIGNYEGTPCKYTTPLQ 466
Query: 444 GIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLL 503
G+ T Q GC +VAC QL A A+ ADAT+LV+G DQSIEAE+ DR L L
Sbjct: 467 GLSAVVSTSFQPGCANVACTSAQLDEAKKIAA-SADATVLVVGSDQSIEAESRDRVDLNL 525
Query: 504 PGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADI 563
PG+Q L+++V+ ASKGP ILV+M+GG +D+ FAK D +I +I+W G+PG+AGG AIAD+
Sbjct: 526 PGQQALLITEVAKASKGPVILVIMTGGGMDITFAKKDDKITSILWVGFPGEAGGAAIADV 585
Query: 564 LFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMS 623
+FG+ NP G+LPMTWYPQ Y+ +PMT+M MRPS S +PGRTYRFY G +Y FG G+S
Sbjct: 586 IFGSFNPSGRLPMTWYPQSYVEKVPMTDMRMRPSASNGFPGRTYRFYTGETIYSFGDGLS 645
Query: 624 YTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKD 683
Y++F H + AP +V++PL+ G I + +++V C L V + VKNVG +
Sbjct: 646 YSDFKHHLVKAPKLVSIPLE--EGHICHSSKCHSLEVVQESCQNLGFDVHLRVKNVGQRS 703
Query: 684 GAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIP 743
G+HT+ ++STPP+ H +P K L+ FEKV + G + V + VCK LSV D G+R++
Sbjct: 704 GSHTVFLYSTPPSVHNSPQKHLLGFEKVSLGRGGETVVRFKVDVCKDLSVADEVGSRKVA 763
Query: 744 LGEHNIHIGGTKHSVSL 760
LG H +H+G KHS+++
Sbjct: 764 LGLHILHVGTLKHSLNV 780
>gi|255545293|ref|XP_002513707.1| Beta-glucosidase, putative [Ricinus communis]
gi|223547158|gb|EEF48654.1| Beta-glucosidase, putative [Ricinus communis]
Length = 777
Score = 915 bits (2364), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 447/755 (59%), Positives = 550/755 (72%), Gaps = 11/755 (1%)
Query: 14 LLSASSSGLAAREP-FACDPK-DATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLIS 71
+LS+ + LA P FACD K + + + FC VSL I RV DL+ RL+LQEK+ L++
Sbjct: 26 VLSSRNQVLAQSSPVFACDVKSNPSLASFGFCNVSLGISDRVTDLVNRLTLQEKIGFLVN 85
Query: 72 GAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAI 131
A +V RLGI YEWWSEALHGVS VGPGT F PGATSFPQVI TA+SFNA+L+EAI
Sbjct: 86 SAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNIVPGATSFPQVILTAASFNASLFEAI 145
Query: 132 GRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQ 191
G+VVS EARAMYN G AGLT+WSPN+NIFRDPRWGRGQETPGEDP+LS KY + YVRGLQ
Sbjct: 146 GKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLSSKYGSCYVRGLQ 205
Query: 192 GSDG---DRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGK 248
+D +RLKVAA CKH+TAYDLDNW G DR+HFNA V+KQD++DTF PF+ CV++G
Sbjct: 206 QTDNGDSERLKVAACCKHYTAYDLDNWKGTDRYHFNAVVTKQDLDDTFQPPFKSCVIDGN 265
Query: 249 VASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEA 308
VASVMCSYNQVNG PTCADP++L IRGEW+LNGYIVSDCDSV V Y++QH+T TPEEA
Sbjct: 266 VASVMCSYNQVNGKPTCADPDLLAGIIRGEWKLNGYIVSDCDSVDVIYNSQHYTKTPEEA 325
Query: 309 AADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQP 368
AA I AGLDL+CG FLG HTE+AV GLL+ ++ A+ N MRLG FDG+PS Q
Sbjct: 326 AAITILAGLDLNCGSFLGKHTEAAVNAGLLNVSAVDKAVSNNFATLMRLGFFDGDPSKQL 385
Query: 369 YGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMI 428
YG LGPKDVCT +QELA EAARQGIVLLKN SLPLS +T+AVIGPN++VT TMI
Sbjct: 386 YGKLGPKDVCTAVNQELAREAARQGIVLLKNSPGSLPLSPTAIKTLAVIGPNANVTKTMI 445
Query: 429 GNYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLD 488
GNY G C YTTPLQG+ T + GC +VACA Q+ A A+ ADAT+LVMG D
Sbjct: 446 GNYEGTPCKYTTPLQGLTASVATTYLAGCSNVACAAAQVDDAKKLAA-SADATVLVMGAD 504
Query: 489 QSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIW 548
QSIEAE+ DR +LLPG+QQ L+++V+ SKGP ILV+MSGG +DV+FAK + +I +I+W
Sbjct: 505 QSIEAESRDRVDVLLPGQQQLLITQVANVSKGPVILVIMSGGGMDVSFAKTNDKITSILW 564
Query: 549 AGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYR 608
GYPG+AGG AIAD++FG NP G+LPMTWYPQ Y+ +PMT M MRP S YPGRTYR
Sbjct: 565 VGYPGEAGGAAIADVIFGYYNPSGRLPMTWYPQAYVDKVPMTNMNMRPDPSSGYPGRTYR 624
Query: 609 FYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVT--HAKCN 666
FY G VY FG G+SY+ + H + AP +V++PL+ H + S K I V C
Sbjct: 625 FYTGETVYSFGDGLSYSEYKHQLVQAPQLVSIPLEDDHVCRS---SSKCISVDAGEQNCQ 681
Query: 667 RLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIH 726
L + + V+N+G G HT+ +F TPP+ H +P K LV FEKV + A V +
Sbjct: 682 GLAFNIDLKVRNIGKVRGTHTVFLFFTPPSVHNSPQKHLVDFEKVSLDAKTYGMVSFKVD 741
Query: 727 VCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLH 761
VCK+LSVVD G+R++ LG H +H+G +HS+++
Sbjct: 742 VCKHLSVVDEFGSRKVALGGHVLHVGNLEHSLTVR 776
>gi|449479116|ref|XP_004155509.1| PREDICTED: beta-xylosidase/alpha-L-arabinofuranosidase 2-like
[Cucumis sativus]
Length = 809
Score = 913 bits (2359), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 446/737 (60%), Positives = 544/737 (73%), Gaps = 7/737 (0%)
Query: 28 FACDPK-DATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEW 86
FACD + + + FC SL RV DL+ RL+LQEK+ LI+ A V RLGI YEW
Sbjct: 74 FACDAETNPSVSGFAFCDSSLGFEARVEDLVKRLTLQEKIGFLINNARNVTRLGIPKYEW 133
Query: 87 WSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGG 146
WSEALHGVS VGPGTKF PGATSFPQVI TA+SFNA+L+EAIG+VVS EARAMYN G
Sbjct: 134 WSEALHGVSYVGPGTKFSNVVPGATSFPQVILTAASFNASLFEAIGKVVSTEARAMYNVG 193
Query: 147 TAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDG---DRLKVAAS 203
AGLTYWSPNVNIFRDPRWGRGQETPGEDP+LS KYAA YVRGLQ D DRLKVAA
Sbjct: 194 LAGLTYWSPNVNIFRDPRWGRGQETPGEDPLLSSKYAAGYVRGLQQRDDGDPDRLKVAAC 253
Query: 204 CKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVP 263
CKH+TAYDLDNW G DR+HFNA VS QD+EDTF PF+ CV++G VASVMCSYNQVNG P
Sbjct: 254 CKHYTAYDLDNWKGTDRYHFNAVVSPQDLEDTFQPPFKSCVIDGNVASVMCSYNQVNGKP 313
Query: 264 TCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGP 323
TCADP++L IRG+W+LNGYIVSDCDSV V Y++QH+T +PEEAAA I AGLDLDCG
Sbjct: 314 TCADPDLLAGVIRGQWKLNGYIVSDCDSVDVLYNSQHYTKSPEEAAAKTILAGLDLDCGD 373
Query: 324 FLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQ 383
FLG HTE+AV GL++E I+ A+ N L MRLG FDG PS Q YG LGPKDVCTP+HQ
Sbjct: 374 FLGKHTEAAVTGGLVNEAAISKAVFNNLLTLMRLGFFDGNPSKQLYGKLGPKDVCTPEHQ 433
Query: 384 ELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQ 443
ELA EAARQGIVLLKN SLPLS +++AVIGPN++VT TMIGNY G C YTTPLQ
Sbjct: 434 ELAREAARQGIVLLKNSPKSLPLSSSAIKSLAVIGPNANVTKTMIGNYEGTPCKYTTPLQ 493
Query: 444 GIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLL 503
G+ T Q GC +VAC QL A + ADAT+LV+G DQSIEAE+ DR L L
Sbjct: 494 GLSAVVSTSFQPGCANVACTSAQL-DEAKKIAASADATVLVVGSDQSIEAESRDRVDLNL 552
Query: 504 PGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADI 563
PG+Q L+++V+ ASKGP ILV+M+GG +D+ FAK D +I +I+W G+PG+AGG AIAD+
Sbjct: 553 PGQQALLITEVAKASKGPVILVIMTGGGMDITFAKKDDKITSILWVGFPGEAGGAAIADV 612
Query: 564 LFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMS 623
+FG+ NP G+LPMTWYPQ Y+ +PMT+M MRPS S +PGRTYRFY G +Y FG G+S
Sbjct: 613 IFGSFNPSGRLPMTWYPQSYVEKVPMTDMRMRPSASNGFPGRTYRFYTGETIYSFGDGLS 672
Query: 624 YTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKD 683
Y++F H + AP +V++PL+ G I + +++V C L V + VKNVG +
Sbjct: 673 YSDFKHHLVKAPKLVSIPLE--EGHICHSSKCHSLEVVQESCQNLGFDVHLRVKNVGQRS 730
Query: 684 GAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIP 743
G+HT+ ++STPP+ H +P K L+ FEKV + G + V + VCK LSV D G+R++
Sbjct: 731 GSHTVFLYSTPPSVHNSPQKHLLGFEKVSLGRGGETVVRFKVDVCKDLSVADEVGSRKVA 790
Query: 744 LGEHNIHIGGTKHSVSL 760
LG H +H+G KHS+++
Sbjct: 791 LGLHILHVGTLKHSLNV 807
>gi|359485890|ref|XP_002264183.2| PREDICTED: beta-xylosidase/alpha-L-arabinofuranosidase 2-like
[Vitis vinifera]
Length = 774
Score = 908 bits (2347), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 445/764 (58%), Positives = 548/764 (71%), Gaps = 10/764 (1%)
Query: 4 TIAFFFLGLI--LLSASSSGLAAREP-FACD-PKDATTRTLPFCQVSLPIPQRVNDLIGR 59
T+ FL LS+ LA P FACD + T FC SL RV DL+ R
Sbjct: 12 TVFLCFLSCFSHFLSSPKWVLAQSSPVFACDVENNPTLGQFGFCNTSLETAARVADLVKR 71
Query: 60 LSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITT 119
L+L+EK+ L++ AA+V RLGI YEWWSEALHGVS VGPGT F PGATSFPQVI T
Sbjct: 72 LTLEEKIGFLVNSAASVSRLGIPKYEWWSEALHGVSYVGPGTHFNSVVPGATSFPQVILT 131
Query: 120 ASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLS 179
A+SFNA+L+EAIG+ VS EARAMYN G AGLT+WSPNVNIFRDPRWGRGQETPGEDP+LS
Sbjct: 132 AASFNASLFEAIGKAVSTEARAMYNVGLAGLTFWSPNVNIFRDPRWGRGQETPGEDPLLS 191
Query: 180 GKYAASYVRGLQGSDG---DRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTF 236
KYA+ YVRGLQ SD DRLKVAA CKH+TAYDLDNW GVDRFHFNA V+KQD++DTF
Sbjct: 192 SKYASGYVRGLQQSDDGSPDRLKVAACCKHYTAYDLDNWKGVDRFHFNAVVTKQDMDDTF 251
Query: 237 DVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYY 296
PF+ CV++G VASVMCSYNQVNG P CADP++L +RGEW+LNGYIVSDCDSV V+Y
Sbjct: 252 QPPFKSCVIDGNVASVMCSYNQVNGKPACADPDLLSGIVRGEWKLNGYIVSDCDSVDVFY 311
Query: 297 DTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMR 356
++QH+T TPEEAAA AI AGLDL+CG FLG HTE+AV+ GL+ E ++ A+ N MR
Sbjct: 312 NSQHYTKTPEEAAAKAILAGLDLNCGSFLGQHTEAAVKGGLVDESAVDKAVSNNFATLMR 371
Query: 357 LGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAV 416
LG FDG PS YG LGPKDVCT +HQELA EAARQGIVLLKN SLPLS +T+AV
Sbjct: 372 LGFFDGNPSKAIYGKLGPKDVCTSEHQELAREAARQGIVLLKNSKGSLPLSPTAIKTLAV 431
Query: 417 IGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASR 476
IGPN++VT TMIGNY G C YTTPLQG+ T + GC +VAC Q+ A +
Sbjct: 432 IGPNANVTKTMIGNYEGTPCKYTTPLQGLTALVATTYLPGCSNVACGTAQI-DEAKKIAA 490
Query: 477 QADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAF 536
ADAT+L++G+DQSIEAE DR + LPG+Q L+++V+ ASKG ILV+MSGG D++F
Sbjct: 491 AADATVLIVGIDQSIEAEGRDRVNIQLPGQQPLLITEVAKASKGNVILVVMSGGGFDISF 550
Query: 537 AKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRP 596
AKND +I +I+W GYPG+AGG AIAD++FG NP G+LPMTWYPQ Y+ +PMT M MRP
Sbjct: 551 AKNDDKITSILWVGYPGEAGGAAIADVIFGFYNPSGRLPMTWYPQSYVDKVPMTNMNMRP 610
Query: 597 SQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGK 656
+ YPGRTYRFY G +Y FG G+SYT F H + AP V++P++ H ++ K
Sbjct: 611 DPASGYPGRTYRFYTGETIYTFGDGLSYTQFNHHLVQAPKSVSIPIEEGHSCHSSKC--K 668
Query: 657 AIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAG 716
++ C L + + V N G+ G+HT+ +FS+PP+ H +P K L+ FEKV V A
Sbjct: 669 SVDAVQESCQNLVFDIHLRVNNAGNISGSHTVFLFSSPPSVHNSPQKHLLGFEKVFVTAK 728
Query: 717 AQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
A+ V + VCK LS+VD GTR++ LG H +H+G KHS+++
Sbjct: 729 AKALVRFKVDVCKDLSIVDELGTRKVALGLHVLHVGNLKHSLNV 772
>gi|359481045|ref|XP_002268626.2| PREDICTED: beta-xylosidase/alpha-L-arabinofuranosidase 2-like
[Vitis vinifera]
gi|296089342|emb|CBI39114.3| unnamed protein product [Vitis vinifera]
Length = 774
Score = 905 bits (2339), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 443/764 (57%), Positives = 551/764 (72%), Gaps = 10/764 (1%)
Query: 4 TIAFFFLGLI--LLSASSSGLAAREP-FACD-PKDATTRTLPFCQVSLPIPQRVNDLIGR 59
T+ FL LS+ L P FACD + T FC SL RV DL+ R
Sbjct: 12 TVFLCFLSCFSHFLSSPKWVLGQSSPVFACDVENNPTLGQFGFCNTSLETAARVADLVKR 71
Query: 60 LSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITT 119
L+L+EK+ L++ AA+V RLGI YEWWSEALHGVS VGPGT F PGATSFPQVI T
Sbjct: 72 LTLEEKIGFLVNSAASVSRLGIPKYEWWSEALHGVSYVGPGTHFNSIVPGATSFPQVILT 131
Query: 120 ASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLS 179
A+SFNA+L+EAIG+VVS EARAMYN G AGLT+WSPNVNIFRDPRWGRGQETPGEDP+LS
Sbjct: 132 AASFNASLFEAIGKVVSTEARAMYNVGLAGLTFWSPNVNIFRDPRWGRGQETPGEDPLLS 191
Query: 180 GKYAASYVRGLQ-GSDG--DRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTF 236
KYA++YVRGLQ G DG DRLKVAA CKH+TAYDLDNW GVDR HFNA V+KQD++DTF
Sbjct: 192 SKYASAYVRGLQQGDDGSPDRLKVAACCKHYTAYDLDNWKGVDRLHFNAVVTKQDMDDTF 251
Query: 237 DVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYY 296
PF+ CV++G VASVMCS+NQVNG PTCADP++L +RGEW+LNGYIVSDCDSV V+Y
Sbjct: 252 QPPFKSCVIDGNVASVMCSFNQVNGKPTCADPDLLSGIVRGEWKLNGYIVSDCDSVDVFY 311
Query: 297 DTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMR 356
++QH+T TPEEAAA AI AGLDL+CG FLG HTE+AV+ GL+ E ++ A+ N MR
Sbjct: 312 NSQHYTKTPEEAAAKAILAGLDLNCGSFLGQHTEAAVKGGLVDESAVDKAVSNNFATLMR 371
Query: 357 LGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAV 416
LG FDG PS YG LGPKDVCT +HQE+A EAARQGIVLLKN SLPLS +T+A+
Sbjct: 372 LGFFDGNPSKAIYGKLGPKDVCTSEHQEMAREAARQGIVLLKNSKGSLPLSPTAIKTLAI 431
Query: 417 IGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASR 476
IGPN++VT TMIGNY G C YTTPLQG+ T + GC +VAC Q+ A +
Sbjct: 432 IGPNANVTKTMIGNYEGTPCKYTTPLQGLTALVATTYLPGCSNVACGTAQI-DEAKKIAA 490
Query: 477 QADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAF 536
ADAT+L++G+DQSIEAE DR + LPG+Q L+++V+ ASKG ILV+MSGG D++F
Sbjct: 491 AADATVLIVGIDQSIEAEGRDRVSIQLPGQQPLLITEVAKASKGNVILVVMSGGGFDISF 550
Query: 537 AKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRP 596
AKND +IA+I+W GYPG+AGG AIAD++FG NP G+LPMTWYPQ Y+ +PMT M MRP
Sbjct: 551 AKNDDKIASILWVGYPGEAGGAAIADVIFGFYNPSGRLPMTWYPQSYVDKVPMTNMNMRP 610
Query: 597 SQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGK 656
+ YPGRTYRFY G +Y FG G+SYT F H + AP V++P++ H ++ K
Sbjct: 611 DPASGYPGRTYRFYTGETIYTFGDGLSYTQFNHHLVQAPKSVSIPIEEGHSCHSSKC--K 668
Query: 657 AIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAG 716
++ C L + + V N G+ G+HT+ +FS+PP+ H +P K L+ FEKV V A
Sbjct: 669 SVDAVQESCQNLAFDIHLRVNNAGNISGSHTVFLFSSPPSVHNSPQKHLLGFEKVFVTAK 728
Query: 717 AQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
A+ V + VCK LS+VD GT+++ LG H +H+G KHS+++
Sbjct: 729 AEALVRFKVDVCKDLSIVDELGTQKVALGLHVLHVGSLKHSLNV 772
>gi|224054312|ref|XP_002298197.1| predicted protein [Populus trichocarpa]
gi|222845455|gb|EEE83002.1| predicted protein [Populus trichocarpa]
Length = 741
Score = 904 bits (2337), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 438/739 (59%), Positives = 543/739 (73%), Gaps = 9/739 (1%)
Query: 28 FACDP-KDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEW 86
FACD + + + FC SL + RV DL+ RL+LQEK+ L++ A +V RLGI YEW
Sbjct: 4 FACDVVSNPSLASFGFCNTSLGVSDRVVDLVKRLTLQEKILFLVNSAGSVSRLGIPKYEW 63
Query: 87 WSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGG 146
WSEALHGVS VGPGT F PGATSFPQVI TA+SFN +L+ AIG+VVS EARAMYN G
Sbjct: 64 WSEALHGVSYVGPGTHFSSVVPGATSFPQVILTAASFNTSLFVAIGKVVSTEARAMYNVG 123
Query: 147 TAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDG---DRLKVAAS 203
AGLT+WSPN+NIFRDPRWGRGQETPGEDP+LS KY + YV+GLQ D D LKVAA
Sbjct: 124 LAGLTFWSPNINIFRDPRWGRGQETPGEDPLLSSKYGSGYVKGLQQRDDGNPDGLKVAAC 183
Query: 204 CKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVP 263
CKH+TAYDLDNW GVDR+HFNA V+KQD++DTF PF+ CV++G VASVMCSYN+VNG+P
Sbjct: 184 CKHYTAYDLDNWKGVDRYHFNAVVTKQDMDDTFQPPFKSCVVDGNVASVMCSYNKVNGIP 243
Query: 264 TCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAG--LDLDC 321
TCADP++L IRGEW+LNGYIV+DCDS+ V+Y++QH+T TPEEAAA AI AG LDL+C
Sbjct: 244 TCADPDLLSGVIRGEWKLNGYIVTDCDSIDVFYNSQHYTKTPEEAAAKAILAGIRLDLNC 303
Query: 322 GPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD 381
G FLG HTE+AV GL++E I+ A+ N MRLG FDG+PS Q YG LGPKDVCT +
Sbjct: 304 GSFLGKHTEAAVTAGLVNESAIDRAVSNNFATLMRLGFFDGDPSKQLYGKLGPKDVCTAE 363
Query: 382 HQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTP 441
+QELA EAARQGIVLLKN SLPLS + +AVIGPN++VT TMIGNY G C YTTP
Sbjct: 364 NQELAREAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTP 423
Query: 442 LQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGL 501
LQG+ T + GC +VAC+ Q+ A + ADAT+LVMG D SIEAE+ DR +
Sbjct: 424 LQGLAALVATTYLPGCSNVACSTAQV-DDAKKIAAAADATVLVMGADLSIEAESRDRVDI 482
Query: 502 LLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIA 561
LLPG+QQ L++ V+ AS GP ILV+MSGG +DV+FAK + +I +I+W GYPG+AGG AIA
Sbjct: 483 LLPGQQQLLITAVANASTGPVILVIMSGGGMDVSFAKTNDKITSILWVGYPGEAGGAAIA 542
Query: 562 DILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHG 621
DI+FG+ NP G+LPMTWYPQ Y+ +PMT M MRP S YPGRTYRFY G VY FG G
Sbjct: 543 DIIFGSYNPSGRLPMTWYPQSYVDKVPMTNMNMRPDPSNGYPGRTYRFYTGETVYSFGDG 602
Query: 622 MSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGS 681
+SY+ F H + AP +V+VPL+ H + + K++ C LT V + +KN G+
Sbjct: 603 LSYSEFSHELTQAPGLVSVPLEENH--VCYSSECKSVAAAEQTCQNLTFDVHLRIKNTGT 660
Query: 682 KDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRR 741
G+HT+ +FSTPP+ H +P K LV FEKV + A VG + VCK LSVVD G+++
Sbjct: 661 TSGSHTVFLFSTPPSVHNSPQKHLVGFEKVFLHAQTDSHVGFKVDVCKDLSVVDELGSKK 720
Query: 742 IPLGEHNIHIGGTKHSVSL 760
+ LGEH +HIG KHS+++
Sbjct: 721 VALGEHVLHIGSLKHSMTV 739
>gi|356524862|ref|XP_003531047.1| PREDICTED: beta-xylosidase/alpha-L-arabinofuranosidase 2-like
[Glycine max]
Length = 765
Score = 902 bits (2330), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 435/739 (58%), Positives = 541/739 (73%), Gaps = 7/739 (0%)
Query: 26 EPFACDP-KDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGY 84
+ FACD K FC SL + RV DL+GRL+LQEK+ L++ A V RLGI Y
Sbjct: 28 QTFACDVGKSPAVAGYGFCDKSLGVEARVKDLVGRLTLQEKIGNLVNSAVDVSRLGIPKY 87
Query: 85 EWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN 144
EWWSEALHGVSNVGPGT+F PGATSFP I TA+SFN +L+E IGRVVS EARAMYN
Sbjct: 88 EWWSEALHGVSNVGPGTRFSNVIPGATSFPMPILTAASFNTSLFEVIGRVVSTEARAMYN 147
Query: 145 GGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDG---DRLKVA 201
G AGLTYWSPN+NIFRDPRWGRG ETPGEDPVL+ KYAA YV+GLQ +DG ++LKVA
Sbjct: 148 VGLAGLTYWSPNINIFRDPRWGRGLETPGEDPVLTSKYAAGYVKGLQQTDGGDPNKLKVA 207
Query: 202 ASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNG 261
A CKH+TAYD+DNW G+ R+ FNA V+KQD+EDTF PF+ CV++G VASVMCSYN+VNG
Sbjct: 208 ACCKHYTAYDVDNWKGIQRYTFNAVVTKQDMEDTFQPPFKSCVIDGNVASVMCSYNKVNG 267
Query: 262 VPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDC 321
PTCADP++LK +RGEW+LNGYIVSDCDSV V Y QH+T TPEEAAA +I AGLDL+C
Sbjct: 268 KPTCADPDLLKGVVRGEWKLNGYIVSDCDSVEVLYKDQHYTKTPEEAAAISILAGLDLNC 327
Query: 322 GPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD 381
G FLG +TE AV++GL+ E INNA+ N MRLG FDG+P QPYG+LGPKDVCT +
Sbjct: 328 GRFLGQYTEGAVKQGLIDEASINNAVTNNFATLMRLGFFDGDPRKQPYGNLGPKDVCTQE 387
Query: 382 HQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTP 441
+QELA EAARQGIVLLKN SLPL+ +++AVIGPN++ T MIGNY GI C Y +P
Sbjct: 388 NQELAREAARQGIVLLKNSPASLPLNAKAIKSLAVIGPNANATRVMIGNYEGIPCKYISP 447
Query: 442 LQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGL 501
LQG+ +A T + GC DV C + L A A+ ADAT++V+G +IEAE+LDR +
Sbjct: 448 LQGLTAFAPTSYAAGCLDVRCPNPVLDDAKKIAA-SADATVIVVGASLAIEAESLDRVNI 506
Query: 502 LLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIA 561
LLPG+QQ LVS+V+ ASKGP ILV+MSGG +DV+FAKN+ +I +I+W GYPG+AGG AIA
Sbjct: 507 LLPGQQQLLVSEVANASKGPVILVIMSGGGMDVSFAKNNNKITSILWVGYPGEAGGAAIA 566
Query: 562 DILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHG 621
D++FG NP G+LPMTWYPQ Y+ +PMT M MRP + YPGRTYRFYKG V+ FG G
Sbjct: 567 DVIFGFHNPSGRLPMTWYPQSYVDKVPMTNMNMRPDPATGYPGRTYRFYKGETVFAFGDG 626
Query: 622 MSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGS 681
+SY++ VH + AP +V+V L H + + K+I V C L + + +KN G
Sbjct: 627 LSYSSIVHKLVKAPQLVSVQLAEDH--VCRSSECKSIDVVGEHCQNLVFDIHLRIKNKGK 684
Query: 682 KDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRR 741
AHT+ +FSTPPA H AP K L+ FEKVH+ ++ V + VCK LS+VD G R+
Sbjct: 685 MSSAHTVFLFSTPPAVHNAPQKHLLGFEKVHLIGKSEALVSFKVDVCKDLSIVDELGNRK 744
Query: 742 IPLGEHNIHIGGTKHSVSL 760
+ LG+H +H+G KH +S+
Sbjct: 745 VALGQHLLHVGDLKHPLSV 763
>gi|356525896|ref|XP_003531557.1| PREDICTED: beta-xylosidase/alpha-L-arabinofuranosidase 1-like
[Glycine max]
Length = 776
Score = 900 bits (2325), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 430/760 (56%), Positives = 546/760 (71%), Gaps = 7/760 (0%)
Query: 5 IAFFFLGLILLSASSSGLAAREPFACD-PKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQ 63
+F F+ +LL+ FACD K+ FC SL + RV DL+ RL+LQ
Sbjct: 18 FSFMFVATVLLNCDRVSGQTSSVFACDVAKNPALAGYGFCDKSLSLEDRVADLVKRLTLQ 77
Query: 64 EKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSF 123
EK+ L++ A +V RLGI YEWWSEALHGVSNVGPGT F PGATSFP I TA+SF
Sbjct: 78 EKIGSLVNSATSVSRLGIPKYEWWSEALHGVSNVGPGTHFSSLVPGATSFPMPILTAASF 137
Query: 124 NATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYA 183
NA+L+EAIGRVVS EARAMYN G AGLTYWSPN+NIFRDPRWGRGQETPGEDP+LS KYA
Sbjct: 138 NASLFEAIGRVVSTEARAMYNVGLAGLTYWSPNINIFRDPRWGRGQETPGEDPLLSSKYA 197
Query: 184 ASYVRGLQGSD---GDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPF 240
YV+GLQ +D ++LKVAA CKH+TAYDLDNW G+ R+ FNA V++QD++DTF PF
Sbjct: 198 TGYVKGLQQTDDGDSNKLKVAACCKHYTAYDLDNWKGIQRYTFNAVVTQQDMDDTFQPPF 257
Query: 241 RMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQH 300
+ CV++G VASVMCSYNQVNG PTCADP++LK IRGEW+LNGYIVSDCDSV V + QH
Sbjct: 258 KSCVIDGNVASVMCSYNQVNGKPTCADPDLLKGVIRGEWKLNGYIVSDCDSVEVLFKDQH 317
Query: 301 FTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMF 360
+T TPEEAAA+ I AGLDL+CG +LG +TE AV++GLL E INNA+ N MRLG F
Sbjct: 318 YTKTPEEAAAETILAGLDLNCGNYLGQYTEGAVKQGLLDEASINNAVSNNFATLMRLGFF 377
Query: 361 DGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPN 420
DG+PS Q YG+LGP DVCT +++ELA EAARQGIVLLKN SLPL+ +++AVIGPN
Sbjct: 378 DGDPSKQTYGNLGPNDVCTSENRELAREAARQGIVLLKNSLGSLPLNAKAIKSLAVIGPN 437
Query: 421 SDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADA 480
++ T MIGNY GI C Y +PLQ + T + GC +V CA+ +L A + ADA
Sbjct: 438 ANATRVMIGNYEGIPCNYISPLQALTALVPTSYAAGCPNVQCANAEL-DDATQIAASADA 496
Query: 481 TILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKND 540
T++V+G +IEAE+LDR +LLPG+QQ LVS+V+ ASKGP ILV+MSGG +DV+FAK++
Sbjct: 497 TVIVVGASLAIEAESLDRINILLPGQQQLLVSEVANASKGPVILVIMSGGGMDVSFAKSN 556
Query: 541 PRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSK 600
+I +I+W GYPG+AGG AIAD++FG NP G+LPMTWYPQ Y+ +PMT M MR +
Sbjct: 557 DKITSILWVGYPGEAGGAAIADVIFGFYNPSGRLPMTWYPQSYVNKVPMTNMNMRADPAT 616
Query: 601 RYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKV 660
YPGRTYRFYKG V+ FG G+S++N H + AP +V+VPL H ++ ++ V
Sbjct: 617 GYPGRTYRFYKGETVFSFGDGISFSNIEHKIVKAPQLVSVPLAEDHECRSSEC--MSLDV 674
Query: 661 THAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQR 720
C L + + VKN+G +H +L+F TPP H AP K L+ FEKVH+P ++ +
Sbjct: 675 ADEHCQNLAFDIHLGVKNMGKMSSSHVVLLFFTPPDVHNAPQKHLLGFEKVHLPGKSEAQ 734
Query: 721 VGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
V + +CK LSVVD G R++PLG+H +H+G KH +S+
Sbjct: 735 VRFKVDICKDLSVVDELGNRKVPLGQHLLHVGNLKHQLSV 774
>gi|356558612|ref|XP_003547598.1| PREDICTED: beta-xylosidase/alpha-L-arabinofuranosidase 1-like
[Glycine max]
Length = 776
Score = 899 bits (2324), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 433/761 (56%), Positives = 546/761 (71%), Gaps = 7/761 (0%)
Query: 5 IAFFFLGLILLSASSSGLAAREPFACD-PKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQ 63
+F F+ +LL+ + FACD K+ FC SL + RV DL+ RL+LQ
Sbjct: 18 FSFTFVASVLLNCNRVSGQTSAVFACDVAKNPALAGYGFCDKSLSVEDRVADLVKRLTLQ 77
Query: 64 EKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSF 123
EK+ L++ A +V RLGI YEWWSEALHGVSNVGPGT F PGATSFP I TA+SF
Sbjct: 78 EKIGSLVNSATSVSRLGIPKYEWWSEALHGVSNVGPGTHFSSLVPGATSFPMPILTAASF 137
Query: 124 NATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYA 183
NA+L+EAIGRVVS EARAMYN G AGLTYWSPN+NIFRDPRWGRGQETPGEDP+LS KYA
Sbjct: 138 NASLFEAIGRVVSTEARAMYNVGLAGLTYWSPNINIFRDPRWGRGQETPGEDPLLSSKYA 197
Query: 184 ASYVRGLQGSD---GDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPF 240
YV+GLQ +D ++LKVAA CKH+TAYDLDNW G+ R+ FNA V++QD++DTF PF
Sbjct: 198 TGYVKGLQQTDDGDSNKLKVAACCKHYTAYDLDNWKGIQRYTFNAVVTQQDMDDTFQPPF 257
Query: 241 RMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQH 300
+ CV++G VASVMCSYNQVNG PTCADP++LK IRGEW+LNGYIVSDCDSV V + QH
Sbjct: 258 KSCVIDGNVASVMCSYNQVNGKPTCADPDLLKGIIRGEWKLNGYIVSDCDSVEVLFKDQH 317
Query: 301 FTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMF 360
+T TPEEAAA I AGLDL+CG +LG +TE AV++GLL E INNA+ N MRLG F
Sbjct: 318 YTKTPEEAAAQTILAGLDLNCGNYLGQYTEGAVKQGLLDEASINNAVSNNFATLMRLGFF 377
Query: 361 DGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPN 420
DG+PS QPYG+LGPKDVCT +++ELA EAARQGIVLLKN SLPL+ +++AVIGPN
Sbjct: 378 DGDPSKQPYGNLGPKDVCTSENRELAREAARQGIVLLKNSPGSLPLNAKTIKSLAVIGPN 437
Query: 421 SDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADA 480
++ T MIGNY GI C Y +PLQ + T + GC +V CA+ +L A + ADA
Sbjct: 438 ANATRVMIGNYEGIPCNYISPLQTLTALVPTSYAAGCPNVQCANAEL-DDATQIAASADA 496
Query: 481 TILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKND 540
T++++G +IEAE+LDR +LLPG+QQ LVS+V+ ASKGP ILV+MSGG +DV+FAK++
Sbjct: 497 TVIIVGASLAIEAESLDRINILLPGQQQLLVSEVANASKGPVILVIMSGGGMDVSFAKSN 556
Query: 541 PRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSK 600
+I +I+W GYPG+AGG AIAD++FG NP G+LPMTWYPQ Y+ +PMT M MR +
Sbjct: 557 DKITSILWVGYPGEAGGAAIADVIFGFYNPSGRLPMTWYPQAYVNKVPMTNMNMRADPAT 616
Query: 601 RYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKV 660
YPGRTYRFYKG V+ FG G+S+++ H + AP +V+VPL H ++ I
Sbjct: 617 GYPGRTYRFYKGETVFSFGDGISFSSIEHKIVKAPQLVSVPLAEDHECRSSECMSLDIAD 676
Query: 661 THAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQR 720
H C L + + VKN G +H +L+F TPP H AP K L+ FEKVH+P ++ +
Sbjct: 677 EH--CQNLAFDIHLGVKNTGKMSTSHVVLLFFTPPDVHNAPQKHLLGFEKVHLPGKSEAQ 734
Query: 721 VGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLH 761
V + VCK LSVVD G R++PLG+H +H+G KH +SL
Sbjct: 735 VRFKVDVCKDLSVVDELGNRKVPLGQHLLHVGNLKHPLSLR 775
>gi|74355968|dbj|BAE44362.1| alpha-L-arabinofuranosidase [Raphanus sativus]
Length = 780
Score = 899 bits (2323), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 442/768 (57%), Positives = 550/768 (71%), Gaps = 16/768 (2%)
Query: 6 AFFFLGLILLSASSSGLAAREP-FACD-PKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQ 63
+ F L LI L S A P FACD + + FC ++ I RV DL+ RL+LQ
Sbjct: 16 SVFSLSLIFLCLLDSSNAQSTPVFACDVAGNPSLAAYGFCNTAIKIEYRVADLVARLTLQ 75
Query: 64 EKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSF 123
EK+ +L S V RLGI YEWWSEALHGVS VGPGT+F G PGATSFPQVI TA+SF
Sbjct: 76 EKIGVLTSKLHGVARLGIPTYEWWSEALHGVSYVGPGTRFSGQVPGATSFPQVILTAASF 135
Query: 124 NATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYA 183
N +L++AIG+VVS EARAMYN G AGLTYWSPNVNIFRDPRWGRGQETPGEDP+LS KYA
Sbjct: 136 NVSLFQAIGKVVSTEARAMYNVGLAGLTYWSPNVNIFRDPRWGRGQETPGEDPLLSSKYA 195
Query: 184 ASYVRGLQ---GSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPF 240
+ YV+GLQ SD +RLKVAA CKH+TAYD+DNW GV+R+ FNA V++QD++DT+ PF
Sbjct: 196 SGYVKGLQETDSSDANRLKVAACCKHYTAYDVDNWKGVERYSFNAVVNQQDLDDTYQPPF 255
Query: 241 RMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQH 300
+ CV++G VASVMCSYN+VNG PTCADP++L IRGEW+LNGYIVSDCDSV V Y QH
Sbjct: 256 KSCVVDGNVASVMCSYNKVNGKPTCADPDLLSGVIRGEWKLNGYIVSDCDSVDVLYKNQH 315
Query: 301 FTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMF 360
+T TPEEAAA +I AGLDL+CG FLG HTE+AV+ GL+ E I+ A+ N MRLG F
Sbjct: 316 YTKTPEEAAAISINAGLDLNCGYFLGDHTEAAVKAGLVKEAAIDKAITNNFLTLMRLGFF 375
Query: 361 DGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPN 420
DG+P Q YG LGPKDVCTP +QELA EAARQGIVLLKN G +LPLS +T+AVIGPN
Sbjct: 376 DGDPKKQIYGGLGPKDVCTPANQELAAEAARQGIVLLKNTG-ALPLSPKTIKTLAVIGPN 434
Query: 421 SDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADA 480
++VT TMIGNY G C YTTPLQG+ T + GC +VACA + G+ A+ +DA
Sbjct: 435 ANVTKTMIGNYEGTPCKYTTPLQGLAGTVHTTYLPGCSNVACAVADVAGSTKLAA-ASDA 493
Query: 481 TILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKND 540
T+LV+G DQSIEAE+ DR L LPG+QQELV++V+ A+KGP LV+MSGG D+ FAKND
Sbjct: 494 TVLVIGADQSIEAESRDRVDLNLPGQQQELVTQVAKAAKGPVFLVIMSGGGFDITFAKND 553
Query: 541 PRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSK 600
+IA I+W GYPG+AGG A AD++FG NP G+LPMTWYPQ Y+ +PMT M MRP +S
Sbjct: 554 AKIAGILWVGYPGEAGGIATADVIFGRYNPSGRLPMTWYPQSYVEKVPMTNMNMRPDKSN 613
Query: 601 RYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKV 660
YPGRTYRFY G VY FG G+SYT F H++ AP +V++ L+ H + + +++
Sbjct: 614 GYPGRTYRFYTGETVYAFGDGLSYTKFSHSLVKAPRLVSLSLEENH--VCRSSECQSLNA 671
Query: 661 THAKCNRLTLG-------VQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHV 713
C+ G V + V+N G ++G HT+ +F+TPPA H +P K L+ FEK+ +
Sbjct: 672 IGPHCDNAVSGTGGKAFEVHIKVQNGGDREGIHTVFLFTTPPAVHGSPRKHLLGFEKIRL 731
Query: 714 PAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLH 761
+ V + VCK LSVVD G R+I LG+H +H+G KHS+S+
Sbjct: 732 GKMEEAVVKFKVDVCKDLSVVDEVGKRKIGLGQHLLHVGDVKHSLSIR 779
>gi|297797477|ref|XP_002866623.1| beta-xylosidase 4 [Arabidopsis lyrata subsp. lyrata]
gi|297312458|gb|EFH42882.1| beta-xylosidase 4 [Arabidopsis lyrata subsp. lyrata]
Length = 784
Score = 895 bits (2313), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 442/767 (57%), Positives = 549/767 (71%), Gaps = 19/767 (2%)
Query: 5 IAFFFLGLILLSASSSGLAAREPFACD-PKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQ 63
+ FF L L +A SS + FACD + + FC L I RV DL+ RL+LQ
Sbjct: 26 LCFFLYFLDLSNAQSSPV-----FACDVAANPSLAAYGFCNTVLKIEYRVADLVARLTLQ 80
Query: 64 EKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSF 123
EK+ L+S A V RLGI YEWWSEALHGVS +GPGT F PGATSFPQVI TA+SF
Sbjct: 81 EKIGFLVSKANGVTRLGIPTYEWWSEALHGVSYIGPGTHFSSQVPGATSFPQVILTAASF 140
Query: 124 NATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYA 183
N +L++AIG+VVS EARAMYN G AGLTYWSPNVNIFRDPRWGRGQETPGEDP+L+ KYA
Sbjct: 141 NVSLFQAIGKVVSTEARAMYNVGLAGLTYWSPNVNIFRDPRWGRGQETPGEDPLLASKYA 200
Query: 184 ASYVRGLQGSDG---DRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPF 240
+ YV+GLQ +DG +RLKVAA CKH+TAYD+DNW GV+R+ FNA V++QD++DT+ PF
Sbjct: 201 SGYVKGLQETDGGDSNRLKVAACCKHYTAYDVDNWKGVERYSFNAVVTQQDMDDTYQPPF 260
Query: 241 RMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQH 300
+ CV++G VASVMCSYNQVNG PTCADP++L IRGEW+LNGYIVSDCDSV V Y QH
Sbjct: 261 KSCVVDGNVASVMCSYNQVNGKPTCADPDLLSGVIRGEWKLNGYIVSDCDSVDVLYKNQH 320
Query: 301 FTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMF 360
+T TP EAAA +I AGLDL+CG FLG HTE AV+ GL++E I+ A+ N MRLG F
Sbjct: 321 YTKTPAEAAAISILAGLDLNCGSFLGQHTEEAVKSGLVNEAAIDKAISNNFLTLMRLGFF 380
Query: 361 DGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPN 420
DG P +Q YG LGP DVCT +QELA +AARQGIVLLKN G LPLS +T+AVIGPN
Sbjct: 381 DGNPKNQIYGGLGPTDVCTSANQELAADAARQGIVLLKNTG-FLPLSPKSIKTLAVIGPN 439
Query: 421 SDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADA 480
++VT TMIGNY G C YTTPLQG+ T + GC +VACA + GA A+ AD
Sbjct: 440 ANVTKTMIGNYEGTPCKYTTPLQGLAGAVSTTYLPGCSNVACAVADVAGATKLAA-TADV 498
Query: 481 TILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKND 540
T+L++G DQSIEAE+ DR L LPG+QQELV +V+ A+KGP +LV+MSGG D+ FAKND
Sbjct: 499 TVLLIGADQSIEAESRDRVDLNLPGQQQELVIQVAKAAKGPVLLVIMSGGGFDITFAKND 558
Query: 541 PRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSK 600
P+IA I+W GYPG+AGG AIADI+FG NP G+LPMTWYPQ Y+ +PMT M MRP +SK
Sbjct: 559 PKIAGILWVGYPGEAGGIAIADIIFGRYNPSGRLPMTWYPQSYVEKVPMTIMNMRPDKSK 618
Query: 601 RYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKV 660
YPGRTYRFY G VY FG G+SYT F H++ AP++V++ L+ H + + +++
Sbjct: 619 GYPGRTYRFYTGETVYAFGDGLSYTKFSHSLVKAPSLVSLSLEENH--VCRSSECQSLDA 676
Query: 661 THAKCNRLTLG------VQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVP 714
C G VQ+ V+N G ++G HT+ +F+TPPA H +P K L+ FEK+ +
Sbjct: 677 IGPHCENAVSGGGSAFEVQIKVRNGGDREGIHTVFLFTTPPAIHGSPRKHLLGFEKIRLG 736
Query: 715 AGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLH 761
+ V + VCK LSVVD G R+I LG+H +H+G KHS+S+
Sbjct: 737 KMEEAVVRFKVEVCKDLSVVDEIGKRKIGLGKHLLHVGDLKHSLSIR 783
>gi|15237736|ref|NP_201262.1| beta-D-xylosidase 4 [Arabidopsis thaliana]
gi|75262663|sp|Q9FLG1.1|BXL4_ARATH RecName: Full=Beta-D-xylosidase 4; Short=AtBXL4; Flags: Precursor
gi|10178060|dbj|BAB11424.1| beta-xylosidase [Arabidopsis thaliana]
gi|332010539|gb|AED97922.1| beta-D-xylosidase 4 [Arabidopsis thaliana]
Length = 784
Score = 894 bits (2309), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 439/769 (57%), Positives = 546/769 (71%), Gaps = 14/769 (1%)
Query: 3 STIAFFFLGLILLSASSSGLAAREPFACD-PKDATTRTLPFCQVSLPIPQRVNDLIGRLS 61
S++ FL L + S + FACD + + FC L I RV DL+ RL+
Sbjct: 19 SSVYLIFLCFFLYFLNFSNAQSSPVFACDVAANPSLAAYGFCNTVLKIEYRVADLVARLT 78
Query: 62 LQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTAS 121
LQEK+ L+S A V RLGI YEWWSEALHGVS +GPGT F PGATSFPQVI TA+
Sbjct: 79 LQEKIGFLVSKANGVTRLGIPTYEWWSEALHGVSYIGPGTHFSSQVPGATSFPQVILTAA 138
Query: 122 SFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGK 181
SFN +L++AIG+VVS EARAMYN G AGLTYWSPNVNIFRDPRWGRGQETPGEDP+L+ K
Sbjct: 139 SFNVSLFQAIGKVVSTEARAMYNVGLAGLTYWSPNVNIFRDPRWGRGQETPGEDPLLASK 198
Query: 182 YAASYVRGLQGSDG---DRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDV 238
YA+ YV+GLQ +DG +RLKVAA CKH+TAYD+DNW GV+R+ FNA V++QD++DT+
Sbjct: 199 YASGYVKGLQETDGGDSNRLKVAACCKHYTAYDVDNWKGVERYSFNAVVTQQDMDDTYQP 258
Query: 239 PFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDT 298
PF+ CV++G VASVMCSYNQVNG PTCADP++L IRGEW+LNGYIVSDCDSV V Y
Sbjct: 259 PFKSCVVDGNVASVMCSYNQVNGKPTCADPDLLSGVIRGEWKLNGYIVSDCDSVDVLYKN 318
Query: 299 QHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLG 358
QH+T TP EAAA +I AGLDL+CG FLG HTE AV+ GL++E I+ A+ N MRLG
Sbjct: 319 QHYTKTPAEAAAISILAGLDLNCGSFLGQHTEEAVKSGLVNEAAIDKAISNNFLTLMRLG 378
Query: 359 MFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIG 418
FDG P +Q YG LGP DVCT +QELA +AARQGIVLLKN G LPLS +T+AVIG
Sbjct: 379 FFDGNPKNQIYGGLGPTDVCTSANQELAADAARQGIVLLKNTG-CLPLSPKSIKTLAVIG 437
Query: 419 PNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQA 478
PN++VT TMIGNY G C YTTPLQG+ T + GC +VACA + GA A+ A
Sbjct: 438 PNANVTKTMIGNYEGTPCKYTTPLQGLAGTVSTTYLPGCSNVACAVADVAGATKLAA-TA 496
Query: 479 DATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAK 538
D ++LV+G DQSIEAE+ DR L LPG+QQELV +V+ A+KGP +LV+MSGG D+ FAK
Sbjct: 497 DVSVLVIGADQSIEAESRDRVDLHLPGQQQELVIQVAKAAKGPVLLVIMSGGGFDITFAK 556
Query: 539 NDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQ 598
NDP+IA I+W GYPG+AGG AIADI+FG NP GKLPMTWYPQ Y+ +PMT M MRP +
Sbjct: 557 NDPKIAGILWVGYPGEAGGIAIADIIFGRYNPSGKLPMTWYPQSYVEKVPMTIMNMRPDK 616
Query: 599 SKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAI 658
+ YPGRTYRFY G VY FG G+SYT F HT+ AP++V++ L+ H + + +++
Sbjct: 617 ASGYPGRTYRFYTGETVYAFGDGLSYTKFSHTLVKAPSLVSLGLEENH--VCRSSECQSL 674
Query: 659 KVTHAKCNRLTLG------VQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVH 712
C G V + V+N G ++G HT+ +F+TPPA H +P K LV FEK+
Sbjct: 675 DAIGPHCENAVSGGGSAFEVHIKVRNGGDREGIHTVFLFTTPPAIHGSPRKHLVGFEKIR 734
Query: 713 VPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLH 761
+ + V + +CK LSVVD G R+I LG+H +H+G KHS+S+
Sbjct: 735 LGKREEAVVRFKVEICKDLSVVDEIGKRKIGLGKHLLHVGDLKHSLSIR 783
>gi|357445735|ref|XP_003593145.1| Beta-xylosidase/alpha-L-arabinofuranosidase [Medicago truncatula]
gi|355482193|gb|AES63396.1| Beta-xylosidase/alpha-L-arabinofuranosidase [Medicago truncatula]
Length = 775
Score = 890 bits (2300), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 427/738 (57%), Positives = 540/738 (73%), Gaps = 7/738 (0%)
Query: 28 FACD-PKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEW 86
FACD K+ + FC SL + RV+DL+ RL+LQEK+ L + A V RLGI YEW
Sbjct: 40 FACDVAKNTNVSSYGFCDKSLSVEDRVSDLVKRLTLQEKIGNLGNSAVEVSRLGIPKYEW 99
Query: 87 WSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGG 146
WSEALHGVSN+GPGT F PGATSFP I TA+SFN +L++AIG VVS+EARAMYN G
Sbjct: 100 WSEALHGVSNIGPGTHFSSLVPGATSFPMPILTAASFNTSLFQAIGSVVSNEARAMYNVG 159
Query: 147 TAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSD---GDRLKVAAS 203
AGLTYWSPN+NIFRDPRWGRGQETPGEDP+LS KYAA YV+GLQ +D D+LKVAA
Sbjct: 160 LAGLTYWSPNINIFRDPRWGRGQETPGEDPLLSSKYAAGYVKGLQQTDDGDSDKLKVAAC 219
Query: 204 CKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVP 263
CKH+TAYD+DNW GV R+ F+A VS+QD++DTF PF+ CV++G VASVMCSYN+VNG P
Sbjct: 220 CKHYTAYDVDNWKGVQRYTFDAVVSQQDLDDTFQPPFKSCVIDGNVASVMCSYNKVNGKP 279
Query: 264 TCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGP 323
TCADP++LK IRG+W+LNGYIVSDCDSV V + QH+T TPEEAAA I +GLDLDCG
Sbjct: 280 TCADPDLLKGVIRGKWKLNGYIVSDCDSVEVLFKDQHYTKTPEEAAAKTILSGLDLDCGS 339
Query: 324 FLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQ 383
+LG +T AV++GL+ E INNA+ N MRLG FDG+PS QPYG+LGPKDVCTP++Q
Sbjct: 340 YLGQYTGGAVKQGLVDEASINNAVSNNFATLMRLGFFDGDPSKQPYGNLGPKDVCTPENQ 399
Query: 384 ELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQ 443
ELA EAARQGIVLLKN SLPLS +++AVIGPN++ T MIGNY GI C YT+PLQ
Sbjct: 400 ELAREAARQGIVLLKNSPGSLPLSSKAIKSLAVIGPNANATRVMIGNYEGIPCKYTSPLQ 459
Query: 444 GIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLL 503
G+ + T + GC DV CA+ Q+ A + ADATI+V+G + +IEAE+LDR +LL
Sbjct: 460 GLTAFVPTSYAPGCPDVQCANAQI-DDAAKIAASADATIIVVGANLAIEAESLDRVNILL 518
Query: 504 PGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADI 563
PG+QQ+LV++V+ SKGP ILV+MSGG +DV+FAK + +I +I+W GYPG+AGG AIAD+
Sbjct: 519 PGQQQQLVNEVANVSKGPVILVIMSGGGMDVSFAKTNDKITSILWVGYPGEAGGAAIADV 578
Query: 564 LFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMS 623
+FG+ NP G+LPMTWYPQ Y+ +PMT M MR + YPGRTYRFYKG V+ FG GMS
Sbjct: 579 IFGSYNPSGRLPMTWYPQSYVEKIPMTNMNMRSDPATGYPGRTYRFYKGETVFSFGDGMS 638
Query: 624 YTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKD 683
+ H + AP +V+VPL H ++ K++ V C L + + VKN+G
Sbjct: 639 FGTVEHKIVKAPQLVSVPLAEDHEC--RSLECKSLDVADEHCQNLAFDIHLSVKNMGKMS 696
Query: 684 GAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIP 743
+H++L+F TPP H AP K L+ FEKV + ++ V + VC LSVVD G R++P
Sbjct: 697 SSHSVLLFFTPPNVHNAPQKHLLGFEKVQLAGKSEGMVRFKVDVCNDLSVVDELGNRKVP 756
Query: 744 LGEHNIHIGGTKHSVSLH 761
LG+H +H+G KHS+S+
Sbjct: 757 LGDHMLHVGNLKHSLSVR 774
>gi|297745522|emb|CBI40687.3| unnamed protein product [Vitis vinifera]
Length = 751
Score = 888 bits (2295), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 442/764 (57%), Positives = 538/764 (70%), Gaps = 33/764 (4%)
Query: 4 TIAFFFLGLI--LLSASSSGLAAREP-FACD-PKDATTRTLPFCQVSLPIPQRVNDLIGR 59
T+ FL LS+ LA P FACD + T FC SL RV DL+ R
Sbjct: 12 TVFLCFLSCFSHFLSSPKWVLAQSSPVFACDVENNPTLGQFGFCNTSLETAARVADLVKR 71
Query: 60 LSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITT 119
L+L+EK+ L++ AA+V RLGI YEWWSEALHGVS VGPGT F PGATSFPQVI T
Sbjct: 72 LTLEEKIGFLVNSAASVSRLGIPKYEWWSEALHGVSYVGPGTHFNSVVPGATSFPQVILT 131
Query: 120 ASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLS 179
A+SFNA+L+EAIG+ VS EARAMYN G AGLT+WSPNVNIFRDPRWGRGQETPGEDP+LS
Sbjct: 132 AASFNASLFEAIGKAVSTEARAMYNVGLAGLTFWSPNVNIFRDPRWGRGQETPGEDPLLS 191
Query: 180 GKYAASYVRGLQGSDG---DRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTF 236
KYA+ YVRGLQ SD DRLKVAA CKH+TAYDLDNW GVDRFHFNA V+KQD++DTF
Sbjct: 192 SKYASGYVRGLQQSDDGSPDRLKVAACCKHYTAYDLDNWKGVDRFHFNAVVTKQDMDDTF 251
Query: 237 DVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYY 296
PF+ CV++G VASVMCSYNQVNG P CADP++L +RGEW+LNGYIVSDCDSV V+Y
Sbjct: 252 QPPFKSCVIDGNVASVMCSYNQVNGKPACADPDLLSGIVRGEWKLNGYIVSDCDSVDVFY 311
Query: 297 DTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMR 356
++QH+T TPEEAAA AI AGLDL+CG FLG HTE+AV+ GL+ E ++ A+ N MR
Sbjct: 312 NSQHYTKTPEEAAAKAILAGLDLNCGSFLGQHTEAAVKGGLVDESAVDKAVSNNFATLMR 371
Query: 357 LGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAV 416
LG FDG PS YG LGPKDVCT +HQELA EAARQGIVLLKN SLPLS +T+AV
Sbjct: 372 LGFFDGNPSKAIYGKLGPKDVCTSEHQELAREAARQGIVLLKNSKGSLPLSPTAIKTLAV 431
Query: 417 IGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASR 476
IGPN++VT TMIGNY G C YTTPLQG+ T + GC +VAC Q+ A +
Sbjct: 432 IGPNANVTKTMIGNYEGTPCKYTTPLQGLTALVATTYLPGCSNVACGTAQI-DEAKKIAA 490
Query: 477 QADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAF 536
ADAT+L++G+DQSIEAE DR + LPG+Q L+++V+ ASKG ILV+MSGG D++F
Sbjct: 491 AADATVLIVGIDQSIEAEGRDRVNIQLPGQQPLLITEVAKASKGNVILVVMSGGGFDISF 550
Query: 537 AKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRP 596
AKND +I +I+W GYPG+AGG AIAD++FG NP G+LPMTWYPQ Y+ +PMT M MRP
Sbjct: 551 AKNDDKITSILWVGYPGEAGGAAIADVIFGFYNPSGRLPMTWYPQSYVDKVPMTNMNMRP 610
Query: 597 SQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGK 656
+ YPGRTYRFY G +Y FG G+SYT F H H S++A
Sbjct: 611 DPASGYPGRTYRFYTGETIYTFGDGLSYTQFNH----------------HLSVDA----- 649
Query: 657 AIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAG 716
C L + + V N G+ G+HT+ +FS+PP+ H +P K L+ FEKV V A
Sbjct: 650 ----VQESCQNLVFDIHLRVNNAGNISGSHTVFLFSSPPSVHNSPQKHLLGFEKVFVTAK 705
Query: 717 AQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
A+ V + VCK LS+VD GTR++ LG H +H+G KHS+++
Sbjct: 706 AKALVRFKVDVCKDLSIVDELGTRKVALGLHVLHVGNLKHSLNV 749
>gi|292630922|sp|A5JTQ2.1|XYL1_MEDVA RecName: Full=Beta-xylosidase/alpha-L-arabinofuranosidase 1;
AltName: Full=Xylan
1,4-beta-xylosidase/Alpha-N-arabinofuranosidase 1;
Short=MsXyl1; Includes: RecName: Full=Beta-xylosidase;
AltName: Full=1,4-beta-D-xylan xylohydrolase; AltName:
Full=Xylan 1,4-beta-xylosidase; Includes: RecName:
Full=Alpha-N-arabinofuranosidase; AltName:
Full=Alpha-L-arabinofuranosidase; Short=Arabinosidase;
Flags: Precursor
gi|146762261|gb|ABQ45227.1| beta-xylosidase/alpha-L-arabinosidase [Medicago sativa subsp. x
varia]
Length = 774
Score = 888 bits (2295), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 426/738 (57%), Positives = 539/738 (73%), Gaps = 7/738 (0%)
Query: 28 FACD-PKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEW 86
FACD K+ + FC SL + RV+DL+ RL+LQEK+ L + A V RLGI YEW
Sbjct: 39 FACDVAKNTNVSSYGFCDNSLSVEDRVSDLVKRLTLQEKIGNLGNSAVEVSRLGIPKYEW 98
Query: 87 WSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGG 146
WSEALHGVSN+GPGT F PGAT+FP I TA+SFN +L++AIG VVS+EARAMYN G
Sbjct: 99 WSEALHGVSNIGPGTHFSSLVPGATNFPMPILTAASFNTSLFQAIGSVVSNEARAMYNVG 158
Query: 147 TAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSD---GDRLKVAAS 203
AGLTYWSPN+NIFRDPRWGRGQETPGEDP+LS KYAA YV+GLQ +D D+LKVAA
Sbjct: 159 LAGLTYWSPNINIFRDPRWGRGQETPGEDPLLSSKYAAGYVKGLQQTDDGDSDKLKVAAC 218
Query: 204 CKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVP 263
CKH+TAYD+DNW GV R+ F+A VS+QD++DTF PF+ CV++G VASVMCSYN+VNG P
Sbjct: 219 CKHYTAYDVDNWKGVQRYTFDAVVSQQDLDDTFQPPFKSCVIDGNVASVMCSYNKVNGKP 278
Query: 264 TCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGP 323
TCADP++LK IRG+W+LNGYIVSDCDSV V Y QH+T TPEEAAA I +GLDLDCG
Sbjct: 279 TCADPDLLKGVIRGKWKLNGYIVSDCDSVEVLYKDQHYTKTPEEAAAKTILSGLDLDCGS 338
Query: 324 FLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQ 383
+LG +T AV++GL+ E I NA+ N MRLG FDG+PS QPYG+LGPKDVCTP++Q
Sbjct: 339 YLGQYTGGAVKQGLVDEASITNAVSNNFATLMRLGFFDGDPSKQPYGNLGPKDVCTPENQ 398
Query: 384 ELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQ 443
ELA EAARQGIVLLKN SLPLS +++AVIGPN++ T MIGNY GI C YT+PLQ
Sbjct: 399 ELAREAARQGIVLLKNSPRSLPLSSKAIKSLAVIGPNANATRVMIGNYEGIPCKYTSPLQ 458
Query: 444 GIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLL 503
G+ + T + GC DV CA+ Q+ A + ADATI+V+G + +IEAE+LDR +LL
Sbjct: 459 GLTAFVPTSYAPGCPDVQCANAQI-DDAAKIAASADATIIVVGANLAIEAESLDRVNILL 517
Query: 504 PGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADI 563
PG+QQ+LV++V+ SKGP ILV+MSGG +DV+FAK + +I +I+W GYPG+AGG AIAD+
Sbjct: 518 PGQQQQLVNEVANVSKGPVILVIMSGGGMDVSFAKTNDKITSILWVGYPGEAGGAAIADV 577
Query: 564 LFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMS 623
+FG+ NP G+LPMTWYPQ Y+ +PMT M MR + YPGRTYRFYKG V+ FG GMS
Sbjct: 578 IFGSYNPSGRLPMTWYPQSYVEKVPMTNMNMRADPATGYPGRTYRFYKGETVFSFGDGMS 637
Query: 624 YTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKD 683
+ H + AP +V+VPL H ++ K++ V C L + + VKN+G
Sbjct: 638 FGTVEHKIVKAPQLVSVPLAEDHEC--RSLECKSLDVADKHCQNLAFDIHLSVKNMGKMS 695
Query: 684 GAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIP 743
+H++L+F TPP H AP K L+ FEKV + ++ V + VC LSVVD G R++P
Sbjct: 696 SSHSVLLFFTPPNVHNAPQKHLLGFEKVQLAGKSEGMVRFKVDVCNDLSVVDELGNRKVP 755
Query: 744 LGEHNIHIGGTKHSVSLH 761
LG+H +H+G KHS+S+
Sbjct: 756 LGDHMLHVGNLKHSLSVR 773
>gi|350534908|ref|NP_001233910.1| beta-D-xylosidase 1 precursor [Solanum lycopersicum]
gi|37359706|dbj|BAC98298.1| LEXYL1 [Solanum lycopersicum]
Length = 770
Score = 883 bits (2281), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 427/770 (55%), Positives = 547/770 (71%), Gaps = 17/770 (2%)
Query: 1 MASTIAFFFLGLILLSAS-SSGLAAREP-FACD-PKDATTRTLPFCQVSLPIPQRVNDLI 57
MA F +G ILLS+ LA P FACD + L FC SL + RVNDL+
Sbjct: 6 MAYNNHFSIIGFILLSSLLKQVLAQNSPVFACDVTSNPALGNLTFCDASLAVENRVNDLV 65
Query: 58 GRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVI 117
RL+L EK+ L+SGA V RLGI YEWWSEALHGV+ GPG F PGATSFPQVI
Sbjct: 66 NRLTLGEKIGFLVSGAGGVSRLGIPKYEWWSEALHGVAYTGPGVHFTSLVPGATSFPQVI 125
Query: 118 TTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPV 177
TA+SFN TL++ IG+VVS EARAMYN G AGLTYWSPNVNIFRDPRWGRGQETPGEDP
Sbjct: 126 LTAASFNVTLFQTIGKVVSTEARAMYNVGLAGLTYWSPNVNIFRDPRWGRGQETPGEDPT 185
Query: 178 LSGKYAASYVRGLQGSDG---DRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIED 234
L+ KY +YV GLQ +D ++LKVAA CKH+TAYD+DNW G++R+ FNA V +QD++D
Sbjct: 186 LTSKYGVAYVEGLQQTDDGSTNKLKVAACCKHYTAYDVDNWKGIERYSFNAVVRQQDLDD 245
Query: 235 TFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGV 294
TF PFR CV+EG VASVMCSYNQVNG PTC DPN+L +RGEW+LNGYIV+DCDS+ V
Sbjct: 246 TFQPPFRSCVLEGAVASVMCSYNQVNGKPTCGDPNLLAGIVRGEWKLNGYIVTDCDSLQV 305
Query: 295 YYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQ 354
+ +Q++T TPEEAAA + +G+DL+CG +L +T+ AV + L++E I+ A+ N
Sbjct: 306 IFKSQNYTKTPEEAAALGLNSGVDLNCGSWLSTYTQGAVNQKLVNESVIDRAISNNFATL 365
Query: 355 MRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTV 414
MRLG FDG P S+ YG+LGPKDVCTP++QELA EAARQGIVLLKN SLPL+ +++
Sbjct: 366 MRLGFFDGNPKSRIYGNLGPKDVCTPENQELAREAARQGIVLLKNTAGSLPLTPTAIKSL 425
Query: 415 AVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDA 474
AVIGPN++VT TMIGNY GI C YTTPLQG+ TI++ GC DV+C Q ID
Sbjct: 426 AVIGPNANVTKTMIGNYEGIPCKYTTPLQGLTASVATIYKPGCADVSCNTAQ-----IDD 480
Query: 475 SRQ----ADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGG 530
++Q ADA +LVMG DQSIE E+LDR + LPG+Q LV++V+ +KGP ILV+MSGG
Sbjct: 481 AKQIATTADAVVLVMGSDQSIEKESLDRTSITLPGQQSILVAEVAKVAKGPVILVIMSGG 540
Query: 531 PIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMT 590
+DV FA ++P+I +I+W G+PG+AGG A+AD++FG NP G+LPMTWYPQ Y +PMT
Sbjct: 541 GMDVQFAVDNPKITSILWVGFPGEAGGAALADVIFGYYNPSGRLPMTWYPQSYADVVPMT 600
Query: 591 EMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSIN 650
+M MRP+ + YPGRTYRFY GP V+ FGHG+SY+ F H + AP V++PL +H
Sbjct: 601 DMNMRPNPATNYPGRTYRFYTGPTVFTFGHGLSYSQFKHHLDKAPQFVSLPLGEKHTCRL 660
Query: 651 ATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEK 710
+ K + C+ + + + VKNVG G+H + +F++PP+ H AP K L+ FEK
Sbjct: 661 SKC--KTVDAVGQSCSNMGFDIHLRVKNVGKISGSHIIFLFTSPPSVHNAPKKHLLGFEK 718
Query: 711 VHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
VH+ + V N++VCK+LSV D G R++ LG H +HIG KHS+++
Sbjct: 719 VHLTPQGEGVVKFNVNVCKHLSVHDELGNRKVALGPHVLHIGDLKHSLTV 768
>gi|15242492|ref|NP_196535.1| beta-xylosidase 3 [Arabidopsis thaliana]
gi|75264323|sp|Q9LXD6.1|BXL3_ARATH RecName: Full=Beta-D-xylosidase 3; Short=AtBXL3; AltName:
Full=Alpha-L-arabinofuranosidase; Flags: Precursor
gi|7671416|emb|CAB89357.1| beta-xylosidase-like protein [Arabidopsis thaliana]
gi|9759004|dbj|BAB09531.1| beta-xylosidase [Arabidopsis thaliana]
gi|15450735|gb|AAK96639.1| AT5g09730/F17I14_80 [Arabidopsis thaliana]
gi|332004056|gb|AED91439.1| beta-xylosidase 3 [Arabidopsis thaliana]
Length = 773
Score = 881 bits (2276), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 424/766 (55%), Positives = 543/766 (70%), Gaps = 13/766 (1%)
Query: 4 TIAFFFLGLILLSASSSGLAAREPFACD-PKDATTRTLPFCQVSLPIPQRVNDLIGRLSL 62
+++ FL I+ + S + FACD + + L FC L I RV DL+GRL+L
Sbjct: 10 SVSTLFLCFIVCISEQSNNQSSPVFACDVTGNPSLAGLRFCNAGLSIKARVTDLVGRLTL 69
Query: 63 QEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASS 122
+EK+ L S A V RLGI Y+WWSEALHGVSNVG G++F G PGATSFPQVI TA+S
Sbjct: 70 EEKIGFLTSKAIGVSRLGIPSYKWWSEALHGVSNVGGGSRFTGQVPGATSFPQVILTAAS 129
Query: 123 FNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKY 182
FN +L++AIG+VVS EARAMYN G+AGLT+WSPNVNIFRDPRWGRGQETPGEDP LS KY
Sbjct: 130 FNVSLFQAIGKVVSTEARAMYNVGSAGLTFWSPNVNIFRDPRWGRGQETPGEDPTLSSKY 189
Query: 183 AASYVRGLQGSDG---DRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVP 239
A +YV+GLQ +DG +RLKVAA CKH+TAYD+DNW V+R FNA V++QD+ DTF P
Sbjct: 190 AVAYVKGLQETDGGDPNRLKVAACCKHYTAYDIDNWRNVNRLTFNAVVNQQDLADTFQPP 249
Query: 240 FRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQ 299
F+ CV++G VASVMCSYNQVNG PTCADP++L IRG+W+LNGYIVSDCDSV V + Q
Sbjct: 250 FKSCVVDGHVASVMCSYNQVNGKPTCADPDLLSGVIRGQWQLNGYIVSDCDSVDVLFRKQ 309
Query: 300 HFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGM 359
H+ TPEEA A ++ AGLDL+C F G H AV+ GL++E I+ A+ N MRLG
Sbjct: 310 HYAKTPEEAVAKSLLAGLDLNCDHFNGQHAMGAVKAGLVNETAIDKAISNNFATLMRLGF 369
Query: 360 FDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGP 419
FDG+P Q YG LGPKDVCT D+QELA + ARQGIVLLKN SLPLS +T+AVIGP
Sbjct: 370 FDGDPKKQLYGGLGPKDVCTADNQELARDGARQGIVLLKNSAGSLPLSPSAIKTLAVIGP 429
Query: 420 NSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQAD 479
N++ T TMIGNY G+ C YTTPLQG+ + +Q GC +VAC D + G+A+D + AD
Sbjct: 430 NANATETMIGNYHGVPCKYTTPLQGLAETVSSTYQLGC-NVACVDADI-GSAVDLAASAD 487
Query: 480 ATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKN 539
A +LV+G DQSIE E DR L LPG+QQELV++V+MA++GP +LV+MSGG D+ FAKN
Sbjct: 488 AVVLVVGADQSIEREGHDRVDLYLPGKQQELVTRVAMAARGPVVLVIMSGGGFDITFAKN 547
Query: 540 DPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQS 599
D +I +I+W GYPG+AGG AIAD++FG NP G LPMTWYPQ Y+ +PM+ M MRP +S
Sbjct: 548 DKKITSIMWVGYPGEAGGLAIADVIFGRHNPSGNLPMTWYPQSYVEKVPMSNMNMRPDKS 607
Query: 600 KRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIK 659
K YPGR+YRFY G VY F ++YT F H + AP +V++ LD H ++ +++
Sbjct: 608 KGYPGRSYRFYTGETVYAFADALTYTKFDHQLIKAPRLVSLSLDENHPCRSSEC--QSLD 665
Query: 660 VTHAKCNRLTLG-----VQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVP 714
C G V ++VKN G + G+HT+ +F+T P H +P KQL+ FEK+ +
Sbjct: 666 AIGPHCENAVEGGSDFEVHLNVKNTGDRAGSHTVFLFTTSPQVHGSPIKQLLGFEKIRLG 725
Query: 715 AGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
+ V N++VCK LSVVD +G R+I LG H +H+G KHS+++
Sbjct: 726 KSEEAVVRFNVNVCKDLSVVDETGKRKIALGHHLLHVGSLKHSLNI 771
>gi|292630923|sp|A5JTQ3.1|XYL2_MEDVA RecName: Full=Beta-xylosidase/alpha-L-arabinofuranosidase 2;
AltName: Full=Xylan
1,4-beta-xylosidase/Alpha-N-arabinofuranosidase 2;
Short=MsXyl2; Includes: RecName: Full=Beta-xylosidase;
AltName: Full=1,4-beta-D-xylan xylohydrolase; AltName:
Full=Xylan 1,4-beta-xylosidase; Includes: RecName:
Full=Alpha-N-arabinofuranosidase; AltName:
Full=Alpha-L-arabinofuranosidase; Short=Arabinosidase;
Flags: Precursor
gi|146762263|gb|ABQ45228.1| beta-xylosidase/alpha-L-arabinosidase [Medicago sativa subsp. x
varia]
Length = 774
Score = 880 bits (2273), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 422/762 (55%), Positives = 540/762 (70%), Gaps = 15/762 (1%)
Query: 7 FFFLGLILLSASSSGLAAREPFACD-PKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEK 65
F +LLS FACD K+ FC L + RV DL+ RL+LQEK
Sbjct: 18 FVLFATLLLSGGRVSSQTSAVFACDVAKNPALANYGFCNKKLSVDARVKDLVRRLTLQEK 77
Query: 66 VKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNA 125
V L++ A V RLGI YEWWSEALHGVSN+GPGT F PGATSFP I A+SFNA
Sbjct: 78 VGNLVNSAVDVSRLGIPKYEWWSEALHGVSNIGPGTHFSNVIPGATSFPMPILIAASFNA 137
Query: 126 TLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAAS 185
+L++ IG+VVS EARAM+N G AGLTYWSPN+NIFRDPRWGRGQETPGEDP+L+ KYAA
Sbjct: 138 SLFQTIGKVVSTEARAMHNVGLAGLTYWSPNINIFRDPRWGRGQETPGEDPLLASKYAAG 197
Query: 186 YVRGLQGSD---GDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRM 242
YV+GLQ +D ++LKVAA CKH+TAYD+D+W GV R+ FNA V++QD++DT+ PF+
Sbjct: 198 YVKGLQQTDDGDSNKLKVAACCKHYTAYDVDDWKGVQRYTFNAVVTQQDLDDTYQPPFKS 257
Query: 243 CVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFT 302
CV++G VASVMCSYNQVNG PTCADP++LK IRG+W+LNGYIVSDCDSV V + QH+T
Sbjct: 258 CVIDGNVASVMCSYNQVNGKPTCADPDLLKGVIRGKWKLNGYIVSDCDSVDVLFKNQHYT 317
Query: 303 STPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDG 362
TPEEAAA +I AGLDL+CG FLG +TE AV++GL+ E INNA+ N MRLG FDG
Sbjct: 318 KTPEEAAAKSILAGLDLNCGSFLGRYTEGAVKQGLIGEASINNAVYNNFATLMRLGFFDG 377
Query: 363 EPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSD 422
+PS QPYG+LGPKDVCT +QELA EAARQGIVLLKN SLPL+ +++AVIGPN++
Sbjct: 378 DPSKQPYGNLGPKDVCTSANQELAREAARQGIVLLKNCAGSLPLNAKAIKSLAVIGPNAN 437
Query: 423 VTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQ----A 478
T MIGNY GI C YT+PLQG+ T GC DV C + AA+D +++ A
Sbjct: 438 ATRAMIGNYEGIPCKYTSPLQGLTALVPTSFAAGCPDVQCTN-----AALDDAKKIAASA 492
Query: 479 DATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAK 538
DAT++V+G + +IEAE+ DR +LLPG+QQ+LV++V+ +KGP IL +MSGG +DV+FAK
Sbjct: 493 DATVIVVGANLAIEAESHDRINILLPGQQQQLVTEVANVAKGPVILAIMSGGGMDVSFAK 552
Query: 539 NDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQ 598
+ +I +I+W GYPG+AGG AIAD++FG NP G+LPMTWYPQ Y+ +PMT M MRP
Sbjct: 553 TNKKITSILWVGYPGEAGGAAIADVIFGYHNPSGRLPMTWYPQSYVDKVPMTNMNMRPDP 612
Query: 599 SKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAI 658
+ YPGRTYRFYKG V+ FG G+SY+ F H + AP +V+VPL H + + K++
Sbjct: 613 ATGYPGRTYRFYKGETVFSFGDGISYSTFEHKLVKAPQLVSVPLAEDH--VCRSSKCKSL 670
Query: 659 KVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQ 718
V C L + + +KN G + T+ +FSTPPA H AP K L+AFEKV + ++
Sbjct: 671 DVVGEHCQNLAFDIHLRIKNKGKMSSSQTVFLFSTPPAVHNAPQKHLLAFEKVLLTGKSE 730
Query: 719 QRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
V + VCK L +VD G R++ LG+H +H+G KH +S+
Sbjct: 731 ALVSFKVDVCKDLGLVDELGNRKVALGKHMLHVGDLKHPLSV 772
>gi|224070626|ref|XP_002303181.1| predicted protein [Populus trichocarpa]
gi|222840613|gb|EEE78160.1| predicted protein [Populus trichocarpa]
Length = 773
Score = 877 bits (2265), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 436/759 (57%), Positives = 546/759 (71%), Gaps = 8/759 (1%)
Query: 7 FFFLGLILLSASSSGLAAREP-FACDP-KDATTRTLPFCQVSLPIPQRVNDLIGRLSLQE 64
F + L+ S+ A P FACD + + +L FC S+ I RV DL+ RL+LQE
Sbjct: 16 FLLFCMFLVFLSTHVSAQSSPVFACDVVSNPSLASLGFCNTSIGINDRVVDLVKRLTLQE 75
Query: 65 KVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFN 124
K+ L++ A V RLGI YEWWSEALHGVS VGPGT F D GATSFPQVI TA+SFN
Sbjct: 76 KIVFLVNSAGNVSRLGIPKYEWWSEALHGVSYVGPGTHFSDDVAGATSFPQVILTAASFN 135
Query: 125 ATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAA 184
+L+EAIG+VVS EARAMYN G AGLT+WSPN+NIFRDPRWGRGQETPGEDP+LS KY +
Sbjct: 136 TSLFEAIGKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLSSKYGS 195
Query: 185 SYVRGLQGSDG---DRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFR 241
YV+GLQ D D+LKVAA CKH+TAYDLDNW G DR+HFNA V+KQD++DTF PF+
Sbjct: 196 CYVKGLQQRDDGDPDKLKVAACCKHYTAYDLDNWKGSDRYHFNAVVTKQDMDDTFQPPFK 255
Query: 242 MCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHF 301
CV++G VASVMCSYNQVNG PTCADP++L IRGEW LNGYIV+DCDS+ V+Y +Q++
Sbjct: 256 SCVIDGNVASVMCSYNQVNGKPTCADPDLLSGVIRGEWNLNGYIVTDCDSLDVFYKSQNY 315
Query: 302 TSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFD 361
T TPEEAAA AI AG+DL+CG FLG HTE+AV+ GL++E I+ A+ N MRLG FD
Sbjct: 316 TKTPEEAAAAAILAGVDLNCGSFLGQHTEAAVKGGLVNEHAIDIAVSNNFATLMRLGFFD 375
Query: 362 GEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNS 421
G+PS Q YG LGPKDVCT ++QELA EAARQGIVLLKN SLPLS + +AVIGPN+
Sbjct: 376 GDPSKQLYGKLGPKDVCTAENQELAREAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNA 435
Query: 422 DVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADAT 481
+VT TMIGNY G C YTTPLQG+ T + GC +VAC+ Q+ A + ADAT
Sbjct: 436 NVTKTMIGNYEGTPCKYTTPLQGLAASVATTYLPGCSNVACSTAQV-DDAKKLAAAADAT 494
Query: 482 ILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDP 541
+LVMG D SIEAE+ DR +LLPG+QQ L++ V+ S GP ILV+MSGG +DV+FA+ +
Sbjct: 495 VLVMGADLSIEAESRDRVDVLLPGQQQLLITAVANVSCGPVILVIMSGGGMDVSFARTND 554
Query: 542 RIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKR 601
+I +I+W GYPG+AGG AIADI+FG NP G+LPMTWYPQ Y+ +PMT M MRP S
Sbjct: 555 KITSILWVGYPGEAGGAAIADIIFGYYNPSGRLPMTWYPQSYVDKVPMTNMNMRPDPSNG 614
Query: 602 YPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVT 661
YPGRTYRFY G VY FG G+SY+ F H + AP +V VPL+ H + + +++ +
Sbjct: 615 YPGRTYRFYTGETVYSFGDGLSYSQFTHELIQAPQLVYVPLEESH--VCHSSECQSVVAS 672
Query: 662 HAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRV 721
C T + + VKN G+ G+HT+ +FS+PPA H +P K LV FEKV + A + V
Sbjct: 673 EQTCQNSTFDMLLRVKNEGTISGSHTVFLFSSPPAVHNSPQKHLVGFEKVFLNAQTGRHV 732
Query: 722 GINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
+ +CK LSVVD G++++ LGEH +H+G KH +S+
Sbjct: 733 RFKVDICKDLSVVDELGSKKVALGEHVLHVGSLKHFLSV 771
>gi|413919688|gb|AFW59620.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
Length = 773
Score = 874 bits (2259), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 430/736 (58%), Positives = 534/736 (72%), Gaps = 7/736 (0%)
Query: 28 FACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWW 87
FACD +AT + FC S R DL+ RL+L EKV L+ AA+PRLG+ YEWW
Sbjct: 40 FACDASNATLASYGFCNRSAAAAARAADLVSRLTLAEKVGFLVDKQAALPRLGVPLYEWW 99
Query: 88 SEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGT 147
SEALHGVS VGPGT+F PGATSFPQ I TA+SFNATL+ AIG VVS+EARAM+N G
Sbjct: 100 SEALHGVSYVGPGTRFSPLVPGATSFPQPILTAASFNATLFRAIGEVVSNEARAMHNVGL 159
Query: 148 AGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQG--SDGDRLKVAASCK 205
AGLT+WSPN+NIFRDPRWGRGQETPGEDP+L+ KYA YV GLQG S LKVAA CK
Sbjct: 160 AGLTFWSPNINIFRDPRWGRGQETPGEDPLLTSKYAVGYVTGLQGAVSGAGALKVAACCK 219
Query: 206 HFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTC 265
H+TAYD+DNW GV+R+ F+A VS+QD++DTF PF+ CV++G VASVMCSYNQVNG PTC
Sbjct: 220 HYTAYDVDNWKGVERYTFDAVVSQQDLDDTFQPPFKSCVVDGNVASVMCSYNQVNGKPTC 279
Query: 266 ADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFL 325
AD ++L IRG+W+LNGYI SDCDSV V Y+ QH+T TPE+AAA +I+AGLDL+CG FL
Sbjct: 280 ADKDLLSGVIRGDWKLNGYISSDCDSVDVLYNNQHYTKTPEDAAAISIKAGLDLNCGTFL 339
Query: 326 GLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQEL 385
HT +AVQ G LSE D++ A+ N L MRLG FDG+P P+G+LGP DVCTP +QEL
Sbjct: 340 AQHTVAAVQAGKLSESDVDRAVTNNLVTLMRLGFFDGDPRELPFGNLGPSDVCTPSNQEL 399
Query: 386 ALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
A EAARQGIVLLKN G LPLS +++AVIGPN++ + TMIGNY G C YTTPLQG+
Sbjct: 400 AREAARQGIVLLKNTG-KLPLSATSIKSMAVIGPNANASFTMIGNYEGTPCKYTTPLQGL 458
Query: 446 GRYARTIHQQGCKDVACADDQL-FGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLP 504
G T++Q GC +V C+ + L AA A+ AD T+LV+G DQSIE E+LDR LLLP
Sbjct: 459 GANVATVYQPGCTNVGCSGNSLQLDAATKAAASADVTVLVVGADQSIERESLDRTSLLLP 518
Query: 505 GRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADIL 564
G+Q +LVS V+ AS GP ILV+MSGGP D++FAK+ +IAAI+W GYPG+AGG AIAD+L
Sbjct: 519 GQQPQLVSAVANASSGPCILVVMSGGPFDISFAKSSDKIAAILWVGYPGEAGGAAIADVL 578
Query: 565 FGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSY 624
FG NP G+LP+TWYP+ + T +PMT+M MRP S YPGRTYRFY G VY FG G+SY
Sbjct: 579 FGYHNPSGRLPVTWYPESF-TKVPMTDMRMRPDPSTGYPGRTYRFYTGDTVYAFGDGLSY 637
Query: 625 TNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDG 684
T+F H + +AP +A+ L H + T +++ A C L V + V+N G + G
Sbjct: 638 TSFAHHLVSAPKQLALQLAEGHACL--TEQCPSVEAEGAHCEGLAFDVHLRVRNAGERSG 695
Query: 685 AHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPL 744
HT+ +FS+PPA H AP K L+ FEKV + G V + VCK LSVVD G R++ L
Sbjct: 696 GHTVFLFSSPPAVHNAPAKHLLGFEKVSLEPGQAGVVAFKVDVCKDLSVVDELGNRKVAL 755
Query: 745 GEHNIHIGGTKHSVSL 760
G H +H+G KH+++L
Sbjct: 756 GSHTLHVGDLKHTLNL 771
>gi|242077366|ref|XP_002448619.1| hypothetical protein SORBIDRAFT_06g030270 [Sorghum bicolor]
gi|241939802|gb|EES12947.1| hypothetical protein SORBIDRAFT_06g030270 [Sorghum bicolor]
Length = 767
Score = 872 bits (2253), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 431/736 (58%), Positives = 532/736 (72%), Gaps = 7/736 (0%)
Query: 28 FACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWW 87
FACD +AT + FC S R DL+ RL+L EKV L+ AA+PRLGI YEWW
Sbjct: 34 FACDASNATLASYGFCNRSASASARAADLVSRLTLAEKVGFLVDKQAALPRLGIPLYEWW 93
Query: 88 SEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGT 147
SEALHGVS VGPGT+F P ATSFPQ I TA+SFNATL+ AIG VVS+EARAM+N G
Sbjct: 94 SEALHGVSYVGPGTRFSSLVPAATSFPQPILTAASFNATLFRAIGEVVSNEARAMHNVGL 153
Query: 148 AGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQ--GSDGDRLKVAASCK 205
AGLT+WSPN+NIFRDPRWGRGQETPGEDP+L+ KYA YV GLQ GS LKVAA CK
Sbjct: 154 AGLTFWSPNINIFRDPRWGRGQETPGEDPLLTSKYAVGYVTGLQDAGSGSGSLKVAACCK 213
Query: 206 HFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTC 265
H+TAYD+DNW GV+R+ FNA VS+QD++DTF PF+ CV++G VASVMCSYNQVNG PTC
Sbjct: 214 HYTAYDVDNWKGVERYTFNAVVSQQDLDDTFQPPFKSCVVDGNVASVMCSYNQVNGKPTC 273
Query: 266 ADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFL 325
AD ++L IRG+W+LNGYI SDCDSV V Y+ QH+T TPE+AAA +I+AGLDL+CG FL
Sbjct: 274 ADKDLLSGVIRGDWKLNGYISSDCDSVDVLYNNQHYTKTPEDAAAISIKAGLDLNCGNFL 333
Query: 326 GLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQEL 385
HT +AVQ G LSE D++ A+ N MRLG FDG+P P+G+LGP DVCT +QEL
Sbjct: 334 AQHTVAAVQAGKLSESDVDRAITNNFITLMRLGFFDGDPRKLPFGNLGPSDVCTSSNQEL 393
Query: 386 ALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
A EAARQGIVLLKN G +LPLS +++AVIGPN++ + TMIGNY G C YTTPLQG+
Sbjct: 394 AREAARQGIVLLKNSG-ALPLSASSIKSLAVIGPNANASFTMIGNYEGTPCKYTTPLQGL 452
Query: 446 GRYARTIHQQGCKDVACADDQL-FGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLP 504
G T++Q GC +V C+ + L AA A+ AD T+LV+G DQSIE E+LDR LLLP
Sbjct: 453 GANVATVYQPGCTNVGCSGNSLQLDAATKAAASADVTVLVVGADQSIERESLDRTSLLLP 512
Query: 505 GRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADIL 564
G+Q +LVS V+ AS+GP ILV+MSGGP D++FAK+ +IAAI+W GYPG+AGG AIAD+L
Sbjct: 513 GQQPQLVSAVANASRGPCILVIMSGGPFDISFAKSSDKIAAILWVGYPGEAGGAAIADVL 572
Query: 565 FGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSY 624
FG NP G+LP+TWYP+ + T +PM +M MRP S YPGRTYRFY G VY FG G+SY
Sbjct: 573 FGHHNPSGRLPVTWYPESF-TKVPMIDMRMRPDASTGYPGRTYRFYTGDTVYAFGDGLSY 631
Query: 625 TNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDG 684
T+F H + +AP VA+ L H + T +++ A C L V + V+N G G
Sbjct: 632 TSFAHHLVSAPKQVALQLAEGHTCL--TEQCPSVEAEGAHCEGLAFDVHLRVRNAGDMSG 689
Query: 685 AHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPL 744
AHT+ +FS+PPA H AP K L+ FEKV + G V + VCK LSVVD G R++ L
Sbjct: 690 AHTVFLFSSPPAVHNAPAKHLLGFEKVSLEPGQAGVVAFKVDVCKDLSVVDELGNRKVAL 749
Query: 745 GEHNIHIGGTKHSVSL 760
G H +H+G KH++SL
Sbjct: 750 GNHTLHVGDLKHTLSL 765
>gi|115460876|ref|NP_001054038.1| Os04g0640700 [Oryza sativa Japonica Group]
gi|38344900|emb|CAE02971.2| OSJNBb0079B02.3 [Oryza sativa Japonica Group]
gi|113565609|dbj|BAF15952.1| Os04g0640700 [Oryza sativa Japonica Group]
gi|116310882|emb|CAH67823.1| OSIGBa0138H21-OSIGBa0138E01.14 [Oryza sativa Indica Group]
gi|218195682|gb|EEC78109.1| hypothetical protein OsI_17615 [Oryza sativa Indica Group]
Length = 765
Score = 872 bits (2252), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 425/736 (57%), Positives = 531/736 (72%), Gaps = 6/736 (0%)
Query: 28 FACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWW 87
FACD +AT FC + R DL+GRL+L EKV L++ AA+PRLGI YEWW
Sbjct: 31 FACDASNATVSGYGFCDRTKSSAARAADLLGRLTLAEKVGFLVNKQAALPRLGIPAYEWW 90
Query: 88 SEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGT 147
SEALHGVS VGPGT+F PGATSFPQ I TA+SFNA+L+ AIG VVS EARAM+N G
Sbjct: 91 SEALHGVSYVGPGTRFSTLVPGATSFPQPILTAASFNASLFRAIGEVVSTEARAMHNVGL 150
Query: 148 AGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDG--DRLKVAASCK 205
AGLT+WSPN+NIFRDPRWGRGQETPGEDP+L+ KYA YV GLQ + G D LKVAA CK
Sbjct: 151 AGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYAVGYVTGLQDAGGGSDALKVAACCK 210
Query: 206 HFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTC 265
H+TAYD+DNW GV+R+ F+A VS+QD++DTF PF+ CV++G VASVMCSYN+VNG PTC
Sbjct: 211 HYTAYDVDNWKGVERYTFDAVVSQQDLDDTFQPPFKSCVIDGNVASVMCSYNKVNGKPTC 270
Query: 266 ADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFL 325
AD ++L IRG+W+LNGYIVSDCDSV V Y+ QH+T PE+AAA I++GLDL+CG FL
Sbjct: 271 ADKDLLSGVIRGDWKLNGYIVSDCDSVDVLYNNQHYTKNPEDAAAITIKSGLDLNCGNFL 330
Query: 326 GLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQEL 385
HT +AVQ G LSE D++ A+ N V MRLG FDG+P P+G LGPKDVCT +QEL
Sbjct: 331 AQHTVAAVQAGKLSESDVDRAITNNFIVLMRLGFFDGDPRKLPFGSLGPKDVCTSSNQEL 390
Query: 386 ALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
A EAARQGIVLLKN G +LPLS +++AVIGPN++ + TMIGNY G C YTTPLQG+
Sbjct: 391 AREAARQGIVLLKNTG-ALPLSAKSIKSMAVIGPNANASFTMIGNYEGTPCKYTTPLQGL 449
Query: 446 GRYARTIHQQGCKDVACADDQL-FGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLP 504
G T++Q GC +V C+ + L AA A+ AD T+LV+G DQS+E E+LDR LLLP
Sbjct: 450 GANVATVYQPGCTNVGCSGNSLQLSAATQAAASADVTVLVVGADQSVERESLDRTSLLLP 509
Query: 505 GRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADIL 564
G+Q +LVS V+ AS+GP ILV+MSGGP D++FAK+ +I+AI+W GYPG+AGG A+ADIL
Sbjct: 510 GQQPQLVSAVANASRGPVILVVMSGGPFDISFAKSSDKISAILWVGYPGEAGGAALADIL 569
Query: 565 FGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSY 624
FG NPGG+LP+TWYP + + MT+M MRP S YPGRTYRFY G VY FG G+SY
Sbjct: 570 FGYHNPGGRLPVTWYPASFADKVSMTDMRMRPDSSTGYPGRTYRFYTGDTVYAFGDGLSY 629
Query: 625 TNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDG 684
T F H++ +AP VAV L H T +++ C L+ V + V+N G G
Sbjct: 630 TKFAHSLVSAPEQVAVQLAEGHAC--HTEHCFSVEAAGEHCGSLSFDVHLRVRNAGGMAG 687
Query: 685 AHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPL 744
HT+ +FS+PP+ H AP K L+ FEKV + G V + VCK LSVVD G R++ L
Sbjct: 688 GHTVFLFSSPPSVHSAPAKHLLGFEKVSLEPGQAGVVAFKVDVCKDLSVVDELGNRKVAL 747
Query: 745 GEHNIHIGGTKHSVSL 760
G H +H+G KH+++L
Sbjct: 748 GSHTLHVGDLKHTLNL 763
>gi|297811069|ref|XP_002873418.1| beta-xylosidase 3 [Arabidopsis lyrata subsp. lyrata]
gi|297319255|gb|EFH49677.1| beta-xylosidase 3 [Arabidopsis lyrata subsp. lyrata]
Length = 780
Score = 871 bits (2251), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 431/770 (55%), Positives = 543/770 (70%), Gaps = 14/770 (1%)
Query: 4 TIAFFFLGLILLSASSSGLAAREPFACD-PKDATTRTLPFCQVSLPIPQRVNDLIGRLSL 62
+++ L +L + S + FACD + + L FC L I RV DL+GRL+L
Sbjct: 15 SVSTLLLCFLLCISEQSNAQSSPVFACDVTGNPSLAGLRFCNTGLNIKSRVTDLVGRLTL 74
Query: 63 QEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASS 122
+EK+ L S A V RLGI Y+WWSEALHGVSNVG G+ F G PGATSFPQVI TA+S
Sbjct: 75 EEKIGFLGSNAIGVSRLGIPAYKWWSEALHGVSNVGGGSSFSGQVPGATSFPQVILTAAS 134
Query: 123 FNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKY 182
FN +L++AIG+VVS EARAMYN G+AGLT+WSPNVNIFRDPRWGRGQETPGEDP LS KY
Sbjct: 135 FNVSLFQAIGKVVSTEARAMYNVGSAGLTFWSPNVNIFRDPRWGRGQETPGEDPELSSKY 194
Query: 183 AASYVRGLQGSDG---DRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVP 239
A +YVRGLQ +DG +RLKVAA CKH+TAYD+DNW V RF FNA V++QD+ DTF P
Sbjct: 195 AVAYVRGLQETDGGDPNRLKVAACCKHYTAYDVDNWKDVHRFTFNAVVNQQDMADTFQPP 254
Query: 240 FRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQ 299
F+ CV++G VASVMCSYNQVNG PTCADP++L IRG+W+LNGYIVSDCDSV V Y Q
Sbjct: 255 FKSCVVDGNVASVMCSYNQVNGKPTCADPDLLSGVIRGQWKLNGYIVSDCDSVDVLYTKQ 314
Query: 300 HFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGM 359
H+T TPEEA A +I AGLDL+C F G + AV+ GL++E I+ A+ N MRLG
Sbjct: 315 HYTKTPEEAVAKSILAGLDLNCDHFTGQYAMKAVKVGLVNETAIDKAISNNFATLMRLGF 374
Query: 360 FDGEPSSQP-YGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIG 418
FDG+P Q YG LGP DVCT ++QELA +AARQGIVLLKN SLPLS +T+AVIG
Sbjct: 375 FDGDPKKQQLYGGLGPNDVCTANNQELARDAARQGIVLLKNSAGSLPLSPSAIKTLAVIG 434
Query: 419 PNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQA 478
PN++ T TMIGNY GI C YTTPLQG+ + +Q GC +VACA+ L G+A + A
Sbjct: 435 PNANATETMIGNYNGIPCKYTTPLQGLAETVSSTYQLGC-NVACAEPDL-GSAAALAASA 492
Query: 479 DATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAK 538
DA +LVMG DQSIE E LDR L LPG+QQELV++V+ +KGP +LV+MSGG D+ FAK
Sbjct: 493 DAVVLVMGADQSIEQENLDRLDLYLPGKQQELVTQVAKVAKGPVVLVIMSGGAFDITFAK 552
Query: 539 NDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQ 598
N+ +I I+W GYPG+AGG AIAD++FG NP G LPMTWYPQ Y+ +PMT M MRP +
Sbjct: 553 NEEKITGIMWVGYPGEAGGLAIADVIFGRHNPSGNLPMTWYPQSYVEKVPMTNMNMRPDK 612
Query: 599 SKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAI 658
S YPGRTYRFY G VY FG G+SYTNF H + AP +V++ LD H ++ +++
Sbjct: 613 SNGYPGRTYRFYTGETVYAFGDGLSYTNFNHQILKAPKLVSLDLDENHACRSSEC--QSV 670
Query: 659 KVTHAKCNR-----LTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHV 713
C+ L VQ+ V+NVG ++G+HT+ +F+TPP H +P K L+ FEK+ +
Sbjct: 671 DAIGPHCDNAVGGGLNFEVQLKVRNVGDREGSHTVFLFTTPPEVHGSPRKHLLGFEKIRL 730
Query: 714 PAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHAA 763
+ + N+ VCK LSVVD G R+I LG + +H+G KHS+++ +
Sbjct: 731 GEKEETVIRFNVDVCKDLSVVDEIGKRKIALGHYLLHVGSFKHSLTISVS 780
>gi|9294427|dbj|BAB02547.1| beta-1,4-xylosidase [Arabidopsis thaliana]
Length = 876
Score = 848 bits (2191), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 402/767 (52%), Positives = 537/767 (70%), Gaps = 17/767 (2%)
Query: 7 FFFLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKV 66
F L L++++ SS +++ FACD T FC VSL R DL+ RLSL+EKV
Sbjct: 6 FVRLSLLIIALVSSLCESQKNFACDISAPATAKYGFCNVSLSYEARAKDLVSRLSLKEKV 65
Query: 67 KLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNAT 126
+ L++ A VPRLG+ YEWWSEALHGVS+VGPG F G PGATSFP I TA+SFN +
Sbjct: 66 QQLVNKATGVPRLGVPPYEWWSEALHGVSDVGPGVHFNGTVPGATSFPATILTAASFNTS 125
Query: 127 LWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASY 186
LW +G VVS EARAM+N G AGLTYWSPNVN+FRDPRWGRGQETPGEDP++ KYA +Y
Sbjct: 126 LWLKMGEVVSTEARAMHNVGLAGLTYWSPNVNVFRDPRWGRGQETPGEDPLVVSKYAVNY 185
Query: 187 VRGLQ----GSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRM 242
V+GLQ RLKV++ CKH+TAYDLDNW G+DRFHF+AKV+KQD+EDT+ PF+
Sbjct: 186 VKGLQDVHDAGKSRRLKVSSCCKHYTAYDLDNWKGIDRFHFDAKVTKQDLEDTYQTPFKS 245
Query: 243 CVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFT 302
CV EG V+SVMCSYN+VNG+PTCADPN+L+ IRG+WRL+GYIVSDCDS+ VY++ H+T
Sbjct: 246 CVEEGDVSSVMCSYNRVNGIPTCADPNLLRGVIRGQWRLDGYIVSDCDSIQVYFNDIHYT 305
Query: 303 STPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDG 362
T E+A A A++AGL+++CG FLG +TE+AV+ L+ D++ AL+ V MRLG FDG
Sbjct: 306 KTREDAVALALKAGLNMNCGDFLGKYTENAVKLKKLNGSDVDEALIYNYIVLMRLGFFDG 365
Query: 363 EPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSD 422
+P S P+G+LGP DVC+ DHQ LALEAA+QGIVLL+N+G LPL + +AVIGPN++
Sbjct: 366 DPKSLPFGNLGPSDVCSKDHQMLALEAAKQGIVLLENRG-DLPLPKTTVKKLAVIGPNAN 424
Query: 423 VTVTMIGNYAGIACGYTTPLQGIGRYA--RTIHQQGCKDVACADDQLFGAAIDASRQADA 480
T MI NYAG+ C YT+P+QG+ +Y + +++ GCKDV C D L AA+ A +AD
Sbjct: 425 ATKVMISNYAGVPCKYTSPIQGLQKYVPEKIVYEPGCKDVKCGDQTLISAAVKAVSEADV 484
Query: 481 TILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKND 540
T+LV+GLDQ++EAE LDR L LPG Q++LV V+ A+K +LV+MS GPID++FAKN
Sbjct: 485 TVLVVGLDQTVEAEGLDRVNLTLPGYQEKLVRDVANAAKKTVVLVIMSAGPIDISFAKNL 544
Query: 541 PRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSK 600
I A++W GYPG+AGG AIA ++FG NP G+LP TWYPQE+ + MT+M MRP+ +
Sbjct: 545 STIRAVLWVGYPGEAGGDAIAQVIFGDYNPSGRLPETWYPQEFADKVAMTDMNMRPNSTS 604
Query: 601 RYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKV 660
+PGR+YRFY G +Y FG+G+SY++F V +AP+++ + + ++N T S + +
Sbjct: 605 GFPGRSYRFYTGKPIYKFGYGLSYSSFSTFVLSAPSIIHIKTNPIM-NLNKTTS---VDI 660
Query: 661 THAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPP------AGHWAPHKQLVAFEKVHVP 714
+ C+ L + + + VKN G + G+H +LVF PP G P QLV FE+V V
Sbjct: 661 STVNCHDLKIRIVIGVKNHGLRSGSHVVLVFWKPPKCSKSLVGGGVPLTQLVGFERVEVG 720
Query: 715 AGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLH 761
++ ++ VCK LS+VD G R++ G H + IG H
Sbjct: 721 RSMTEKFTVDFDVCKALSLVDTHGKRKLVTGHHKLVIGSNSDQQIYH 767
>gi|15230897|ref|NP_188596.1| putative beta-D-xylosidase 5 [Arabidopsis thaliana]
gi|259585724|sp|Q9LJN4.2|BXL5_ARATH RecName: Full=Probable beta-D-xylosidase 5; Short=AtBXL5; Flags:
Precursor
gi|332642747|gb|AEE76268.1| putative beta-D-xylosidase 5 [Arabidopsis thaliana]
Length = 781
Score = 848 bits (2191), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 402/767 (52%), Positives = 537/767 (70%), Gaps = 17/767 (2%)
Query: 7 FFFLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKV 66
F L L++++ SS +++ FACD T FC VSL R DL+ RLSL+EKV
Sbjct: 6 FVRLSLLIIALVSSLCESQKNFACDISAPATAKYGFCNVSLSYEARAKDLVSRLSLKEKV 65
Query: 67 KLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNAT 126
+ L++ A VPRLG+ YEWWSEALHGVS+VGPG F G PGATSFP I TA+SFN +
Sbjct: 66 QQLVNKATGVPRLGVPPYEWWSEALHGVSDVGPGVHFNGTVPGATSFPATILTAASFNTS 125
Query: 127 LWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASY 186
LW +G VVS EARAM+N G AGLTYWSPNVN+FRDPRWGRGQETPGEDP++ KYA +Y
Sbjct: 126 LWLKMGEVVSTEARAMHNVGLAGLTYWSPNVNVFRDPRWGRGQETPGEDPLVVSKYAVNY 185
Query: 187 VRGLQ----GSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRM 242
V+GLQ RLKV++ CKH+TAYDLDNW G+DRFHF+AKV+KQD+EDT+ PF+
Sbjct: 186 VKGLQDVHDAGKSRRLKVSSCCKHYTAYDLDNWKGIDRFHFDAKVTKQDLEDTYQTPFKS 245
Query: 243 CVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFT 302
CV EG V+SVMCSYN+VNG+PTCADPN+L+ IRG+WRL+GYIVSDCDS+ VY++ H+T
Sbjct: 246 CVEEGDVSSVMCSYNRVNGIPTCADPNLLRGVIRGQWRLDGYIVSDCDSIQVYFNDIHYT 305
Query: 303 STPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDG 362
T E+A A A++AGL+++CG FLG +TE+AV+ L+ D++ AL+ V MRLG FDG
Sbjct: 306 KTREDAVALALKAGLNMNCGDFLGKYTENAVKLKKLNGSDVDEALIYNYIVLMRLGFFDG 365
Query: 363 EPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSD 422
+P S P+G+LGP DVC+ DHQ LALEAA+QGIVLL+N+G LPL + +AVIGPN++
Sbjct: 366 DPKSLPFGNLGPSDVCSKDHQMLALEAAKQGIVLLENRG-DLPLPKTTVKKLAVIGPNAN 424
Query: 423 VTVTMIGNYAGIACGYTTPLQGIGRYA--RTIHQQGCKDVACADDQLFGAAIDASRQADA 480
T MI NYAG+ C YT+P+QG+ +Y + +++ GCKDV C D L AA+ A +AD
Sbjct: 425 ATKVMISNYAGVPCKYTSPIQGLQKYVPEKIVYEPGCKDVKCGDQTLISAAVKAVSEADV 484
Query: 481 TILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKND 540
T+LV+GLDQ++EAE LDR L LPG Q++LV V+ A+K +LV+MS GPID++FAKN
Sbjct: 485 TVLVVGLDQTVEAEGLDRVNLTLPGYQEKLVRDVANAAKKTVVLVIMSAGPIDISFAKNL 544
Query: 541 PRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSK 600
I A++W GYPG+AGG AIA ++FG NP G+LP TWYPQE+ + MT+M MRP+ +
Sbjct: 545 STIRAVLWVGYPGEAGGDAIAQVIFGDYNPSGRLPETWYPQEFADKVAMTDMNMRPNSTS 604
Query: 601 RYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKV 660
+PGR+YRFY G +Y FG+G+SY++F V +AP+++ + + ++N T S + +
Sbjct: 605 GFPGRSYRFYTGKPIYKFGYGLSYSSFSTFVLSAPSIIHIKTNPIM-NLNKTTS---VDI 660
Query: 661 THAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPP------AGHWAPHKQLVAFEKVHVP 714
+ C+ L + + + VKN G + G+H +LVF PP G P QLV FE+V V
Sbjct: 661 STVNCHDLKIRIVIGVKNHGLRSGSHVVLVFWKPPKCSKSLVGGGVPLTQLVGFERVEVG 720
Query: 715 AGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLH 761
++ ++ VCK LS+VD G R++ G H + IG H
Sbjct: 721 RSMTEKFTVDFDVCKALSLVDTHGKRKLVTGHHKLVIGSNSDQQIYH 767
>gi|357166259|ref|XP_003580652.1| PREDICTED: beta-D-xylosidase 4-like [Brachypodium distachyon]
Length = 774
Score = 845 bits (2184), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 419/738 (56%), Positives = 526/738 (71%), Gaps = 8/738 (1%)
Query: 28 FACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWW 87
FACD ++T FC + R DL+ RL+L +KV L++ A+ RLGI YEWW
Sbjct: 38 FACDAANSTVAGYAFCDRAKSASARAADLVSRLTLADKVGFLVNKQPALARLGIPAYEWW 97
Query: 88 SEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGT 147
SEALHGVS VGPGT+F PGATSFPQ I TA+SFNA+L+ AIG VVS+EARAM+N G
Sbjct: 98 SEALHGVSYVGPGTRFSPLVPGATSFPQPILTAASFNASLFRAIGEVVSNEARAMHNVGL 157
Query: 148 AGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQ--GSDGD-RLKVAASC 204
AGLT+WSPN+NIFRDPRWGRGQETPGEDP+L+ +YA YV GLQ G+D D LKVAA C
Sbjct: 158 AGLTFWSPNINIFRDPRWGRGQETPGEDPLLASRYAVGYVSGLQDAGADADGPLKVAACC 217
Query: 205 KHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPT 264
KH+TAYD+DNW GV+R+ F+AKVS+QD++DTF PF+ CV++GKVASVMCSYN+VNG PT
Sbjct: 218 KHYTAYDVDNWKGVERYTFDAKVSQQDLDDTFQPPFKSCVIDGKVASVMCSYNKVNGKPT 277
Query: 265 CADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPF 324
CAD ++L IRG+W+LNGYIVSDCDSV V Y QH+T TPEEAAA I++GLDL+CG F
Sbjct: 278 CADKDLLSGVIRGDWKLNGYIVSDCDSVDVLYSQQHYTKTPEEAAAITIKSGLDLNCGDF 337
Query: 325 LGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQE 384
L HT +AVQ G LSE D++ A+ N + MRLG FDG+P YG LGPKDVCT +QE
Sbjct: 338 LAKHTVAAVQAGNLSESDVDRAITNNFIMLMRLGFFDGDPRKLAYGSLGPKDVCTSSNQE 397
Query: 385 LALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQG 444
LA E ARQGIVLLKN G +LPLS +++AVIGPN++ + TMIGNY G C YTTPL G
Sbjct: 398 LARETARQGIVLLKNDG-ALPLSAKSIKSMAVIGPNANASFTMIGNYEGTPCKYTTPLHG 456
Query: 445 IGRYARTIHQQGCKDVACADDQL-FGAAIDASRQADATILVMGLDQSIEAEALDRAGLLL 503
+G T++Q GC +V C+ + L AA A+ AD T+LV+G DQSIE EALDR LLL
Sbjct: 457 LGNNVATVYQPGCSNVGCSGNSLQLSAATAAAASADVTVLVVGADQSIEREALDRTSLLL 516
Query: 504 PGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADI 563
PG+Q +L+S V+ ASKG ILV+MSGGP D++FAK +I+AI+W GYPG+AGG AIADI
Sbjct: 517 PGQQPDLISAVANASKGHVILVVMSGGPFDISFAKASDKISAILWVGYPGEAGGAAIADI 576
Query: 564 LFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMS 623
+FG NP G+LP+TWYP + +PMT+M MRP S YPGRTYRFY G V+ FG G+S
Sbjct: 577 IFGKYNPSGRLPVTWYPASFADKVPMTDMRMRPDNSTGYPGRTYRFYTGETVFAFGDGLS 636
Query: 624 YTNFVHT-VANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSK 682
YT H VA P+ V++ L H T +++ C + V++ V N G
Sbjct: 637 YTTMSHNLVAAPPSEVSMQLAEGHAC--HTKECASVEAAGDHCEGMAFEVRLRVHNTGEM 694
Query: 683 DGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRI 742
GAHT+L+FS+PPA H AP K L+ FEK+++ G + VCK LSVVD G R++
Sbjct: 695 AGAHTVLLFSSPPAVHNAPAKHLLGFEKLNLEPGQAGVAAFKVDVCKDLSVVDELGNRKV 754
Query: 743 PLGEHNIHIGGTKHSVSL 760
LG H +H+G KH+++L
Sbjct: 755 ALGGHTLHVGDLKHTLNL 772
>gi|326492918|dbj|BAJ90315.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 775
Score = 842 bits (2176), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 418/741 (56%), Positives = 524/741 (70%), Gaps = 10/741 (1%)
Query: 28 FACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWW 87
FACD +AT FC R DL+ RL+L EKV L++ A+ RLGI YEWW
Sbjct: 37 FACDASNATLAAYGFCNRKATASARARDLVSRLTLAEKVGFLVNKQPALGRLGIPAYEWW 96
Query: 88 SEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGT 147
SEALHGVS VGPGT+F PGATSFPQ I TA+SFNA+L+ AIG VVS EARAM+N G
Sbjct: 97 SEALHGVSYVGPGTRFSPLVPGATSFPQPILTAASFNASLFRAIGEVVSTEARAMHNVGL 156
Query: 148 AGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQ-----GSDGDRLKVAA 202
AGLT+WSPN+NIFRDPRWGRGQETPGEDP+L+ KYA YV GLQ G LKVAA
Sbjct: 157 AGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYAVGYVTGLQDAGAGGVTDGALKVAA 216
Query: 203 SCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGV 262
CKH+TAYD+DNW GV+R+ F+AKVS+QD++DTF PF+ CV++G VASVMCSYN+VNG
Sbjct: 217 CCKHYTAYDVDNWKGVERYTFDAKVSQQDLDDTFQPPFKSCVLDGNVASVMCSYNKVNGK 276
Query: 263 PTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG 322
PTCAD ++L+ IRG+W+LNGYIVSDCDSV V Y QH+T TPEEAAA I++GLDL+CG
Sbjct: 277 PTCADKDLLEGVIRGDWKLNGYIVSDCDSVDVLYTQQHYTKTPEEAAAITIKSGLDLNCG 336
Query: 323 PFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDH 382
FL HT +AVQ G LSE D++ A+ N + MRLG FDG+P +G LGPKDVCT +
Sbjct: 337 NFLAQHTVAAVQAGELSEEDVDRAITNNFIMLMRLGFFDGDPRQLAFGSLGPKDVCTSSN 396
Query: 383 QELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPL 442
+ELA E ARQGIVLLKN G +LPLS +++AVIGPN++ + TMIGNY G C YTTPL
Sbjct: 397 RELARETARQGIVLLKNSG-ALPLSAKSIKSMAVIGPNANASFTMIGNYEGTPCKYTTPL 455
Query: 443 QGIGRYARTIHQQGCKDVACADDQL-FGAAIDASRQADATILVMGLDQSIEAEALDRAGL 501
QG+G T++Q GC +V C+ + L A+ A+ AD T+LV+G DQSIE E+LDR L
Sbjct: 456 QGLGAKVNTVYQPGCTNVGCSGNSLQLSTAVAAAASADVTVLVVGADQSIERESLDRTSL 515
Query: 502 LLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIA 561
LLPG+Q +LVS V+ AS GP ILV+MSGGP D++FAK +IAAI+W GYPG+AGG A+A
Sbjct: 516 LLPGQQTQLVSAVANASSGPVILVVMSGGPFDISFAKASDKIAAILWVGYPGEAGGAALA 575
Query: 562 DILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHG 621
DILFG+ NP GKLP+TWYP Y + MT+M MRP S YPGRTYRFY G V+ FG G
Sbjct: 576 DILFGSHNPSGKLPVTWYPASYADTVTMTDMRMRPDTSTGYPGRTYRFYTGDTVFAFGDG 635
Query: 622 MSYTNFVHTVANA-PTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVG 680
+SYT H++ +A P+ V++ L H + +++ C+ L V++ V+N G
Sbjct: 636 LSYTKMSHSLVSAPPSYVSMRLAEDHPCRAEECA--SVEAAGDHCDDLAFDVKLQVRNAG 693
Query: 681 SKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTR 740
GAH++L+FS+PP H AP K L+ FEKV + G V + VC+ LSVVD G R
Sbjct: 694 EVAGAHSVLLFSSPPPAHNAPAKHLLGFEKVSLAPGEAGTVAFRVDVCRDLSVVDELGGR 753
Query: 741 RIPLGEHNIHIGGTKHSVSLH 761
++ LG H +H+G KH+V L
Sbjct: 754 KVALGGHTLHVGDLKHTVELR 774
>gi|326494302|dbj|BAJ90420.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326521150|dbj|BAJ96778.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326527851|dbj|BAK08165.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 775
Score = 841 bits (2173), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 417/741 (56%), Positives = 524/741 (70%), Gaps = 10/741 (1%)
Query: 28 FACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWW 87
FACD +AT FC R DL+ RL+L EKV L++ A+ RLGI YEWW
Sbjct: 37 FACDASNATLAAYGFCNRKATASARARDLVSRLTLAEKVGFLVNKQPALGRLGIPAYEWW 96
Query: 88 SEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGT 147
SEALHGVS VGPGT+F PGATSFPQ I TA+SFNA+L+ AIG VVS EARAM+N G
Sbjct: 97 SEALHGVSYVGPGTRFSPLVPGATSFPQPILTAASFNASLFRAIGEVVSTEARAMHNVGL 156
Query: 148 AGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQ-----GSDGDRLKVAA 202
AGLT+WSPN+NIFRDPRWGRGQETPGEDP+L+ KYA YV GLQ G LKVAA
Sbjct: 157 AGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYAVGYVTGLQDAGAGGVTDGALKVAA 216
Query: 203 SCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGV 262
CKH+TAYD+DNW GV+R+ F+AKVS+QD++DTF PF+ CV++G VASVMCSYN+VNG
Sbjct: 217 CCKHYTAYDVDNWKGVERYTFDAKVSQQDLDDTFQPPFKSCVLDGNVASVMCSYNKVNGK 276
Query: 263 PTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG 322
PTCAD ++L+ IRG+W+LNGYIVSDCDSV V Y QH+T TPEEAAA I++GLDL+CG
Sbjct: 277 PTCADKDLLEGVIRGDWKLNGYIVSDCDSVDVLYTQQHYTKTPEEAAAITIKSGLDLNCG 336
Query: 323 PFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDH 382
FL HT +AVQ G LSE D++ A+ N + MRLG FDG+P +G LGPKDVCT +
Sbjct: 337 NFLAQHTVAAVQAGELSEEDVDRAITNNFIMLMRLGFFDGDPRQLAFGSLGPKDVCTSSN 396
Query: 383 QELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPL 442
+ELA E ARQGIVLLKN G +LPLS +++AVIGPN++ + TMIGNY G C YTTPL
Sbjct: 397 RELARETARQGIVLLKNSG-ALPLSAKSIKSMAVIGPNANASFTMIGNYEGTPCKYTTPL 455
Query: 443 QGIGRYARTIHQQGCKDVACADDQL-FGAAIDASRQADATILVMGLDQSIEAEALDRAGL 501
QG+G T++Q GC +V C+ + L A+ A+ AD T+LV+G DQSIE E+LDR L
Sbjct: 456 QGLGAKVNTVYQPGCTNVGCSGNSLQLSTAVAAAASADVTVLVVGADQSIERESLDRTSL 515
Query: 502 LLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIA 561
LLPG+Q +LVS V+ AS GP ILV+MSGGP D++FAK +IAAI+W GYPG+AGG A+A
Sbjct: 516 LLPGQQTQLVSAVANASSGPVILVVMSGGPFDISFAKASDKIAAILWVGYPGEAGGAALA 575
Query: 562 DILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHG 621
DILFG+ NP G+LP+TWYP Y + MT+M MRP S YPGRTYRFY G V+ FG G
Sbjct: 576 DILFGSHNPSGRLPVTWYPASYADTVTMTDMRMRPDTSTGYPGRTYRFYTGDTVFAFGDG 635
Query: 622 MSYTNFVHTVANA-PTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVG 680
+SYT H++ +A P+ V++ L H + +++ C+ L V++ V+N G
Sbjct: 636 LSYTKMSHSLVSAPPSYVSMRLAEDHPCRAEECA--SVEAAGDHCDDLAFDVKLQVRNAG 693
Query: 681 SKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTR 740
GAH++L+FS+PP H AP K L+ FEKV + G V + VC+ LSVVD G R
Sbjct: 694 EVAGAHSVLLFSSPPPAHNAPAKHLLGFEKVSLAPGEAGTVAFRVDVCRDLSVVDELGGR 753
Query: 741 RIPLGEHNIHIGGTKHSVSLH 761
++ LG H +H+G KH+V L
Sbjct: 754 KVALGGHTLHVGDLKHTVELR 774
>gi|18025340|gb|AAK38481.1| alpha-L-arabinofuranosidase/beta-D-xylosidase isoenzyme ARA-I
[Hordeum vulgare]
Length = 777
Score = 834 bits (2155), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 415/741 (56%), Positives = 521/741 (70%), Gaps = 10/741 (1%)
Query: 28 FACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWW 87
FACD +AT FC R DL+ RL+L EKV L++ A+ RLGI YEWW
Sbjct: 39 FACDASNATLAAYGFCNRKATASARARDLVSRLTLAEKVGFLVNKQPALGRLGIPAYEWW 98
Query: 88 SEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGT 147
SEALHGVS VGPGT+F PGATSFPQ I TA+SFNA+L+ AIG VVS EARAM+N G
Sbjct: 99 SEALHGVSYVGPGTRFSPLVPGATSFPQPILTAASFNASLFRAIGEVVSTEARAMHNVGL 158
Query: 148 AGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQ-----GSDGDRLKVAA 202
AGLT+WSPN+NIFRDPRWGRGQETPGEDP+L+ KYA YV GLQ G LKVAA
Sbjct: 159 AGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYAVGYVTGLQDAGAGGVTDGALKVAA 218
Query: 203 SCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGV 262
CKH+TAYD+DNW GV+R+ F+AKVS+QD++DTF PF+ CV++G VASVMCSYN+VNG
Sbjct: 219 CCKHYTAYDVDNWKGVERYTFDAKVSQQDLDDTFQPPFKSCVLDGNVASVMCSYNKVNGK 278
Query: 263 PTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG 322
PTCAD ++L+ IRG+W+LNGYIVSDCDSV V Y QH+T TPEEAAA I++G+DL+CG
Sbjct: 279 PTCADKDLLEGVIRGDWKLNGYIVSDCDSVDVLYTQQHYTKTPEEAAAITIKSGVDLNCG 338
Query: 323 PFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDH 382
FL HT +AVQ G LSE D++ A+ N + MRLG FDG+P +G LGPKDVCT +
Sbjct: 339 NFLAQHTVAAVQAGELSEEDVDRAITNNFIMLMRLGFFDGDPRQLAFGSLGPKDVCTSSN 398
Query: 383 QELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPL 442
+ELA E ARQGIVLLKN G +LPLS +++AVIGPN++ + TMIGNY G C YTTPL
Sbjct: 399 RELARETARQGIVLLKNSG-ALPLSAKSIKSMAVIGPNANASFTMIGNYEGTPCKYTTPL 457
Query: 443 QGIGRYARTIHQQGCKDVACADDQL-FGAAIDASRQADATILVMGLDQSIEAEALDRAGL 501
QG+G T++Q GC +V C+ + L A+ A+ AD T+LV+G DQSIE E+LDR L
Sbjct: 458 QGLGAKVNTVYQPGCTNVGCSGNSLQLSTAVAAAASADVTVLVVGADQSIERESLDRTSL 517
Query: 502 LLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIA 561
LLPG+Q +LVS V+ AS GP ILV+MSGGP D++FAK +IAA +W GYPG+AGG A+
Sbjct: 518 LLPGQQTQLVSAVANASSGPVILVVMSGGPFDISFAKASDKIAATLWVGYPGEAGGAALD 577
Query: 562 DILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHG 621
D LFG+ NP G+LP+TWYP Y + MT+M MRP S YPGRTYRFY G V+ FG G
Sbjct: 578 DTLFGSHNPSGRLPVTWYPASYADTVTMTDMRMRPDTSTGYPGRTYRFYTGDTVFAFGDG 637
Query: 622 MSYTNFVHTVANA-PTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVG 680
+SYT H++ +A P+ V++ L H + +++ C+ L L V++ V+N G
Sbjct: 638 LSYTKMSHSLVSAPPSYVSMRLAEDH--LCRAEECASVEAAGDHCDDLALDVKLQVRNAG 695
Query: 681 SKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTR 740
GAH++L+FS+PP H AP K LV FEKV + G V + VC+ LSVVD G R
Sbjct: 696 EVAGAHSVLLFSSPPPAHNAPAKHLVGFEKVSLAPGEAGTVAFRVDVCRDLSVVDELGGR 755
Query: 741 RIPLGEHNIHIGGTKHSVSLH 761
++ LG H +H G KH+V L
Sbjct: 756 KVALGGHTLHDGDLKHTVELR 776
>gi|85813772|emb|CAJ65922.1| xylan 1,4-beta-xylosidase [Populus tremula x Populus alba]
Length = 757
Score = 834 bits (2154), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 427/777 (54%), Positives = 530/777 (68%), Gaps = 60/777 (7%)
Query: 7 FFFLGLI---LLSASSSGLAAREP-FACDP-KDATTRTLPFCQVSLPIPQRVNDLIGRLS 61
F F L+ L S+S LA P FACD + + + FC SL + RV DL+ RL+
Sbjct: 16 FLFFSLVCFLLFSSSHVVLAQSSPVFACDVVSNPSLASFGFCNTSLGVSDRVVDLVKRLT 75
Query: 62 LQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTAS 121
LQEK+ L++ A +V RLGI YEWWSEALHGVS VGPGT F PGATSFPQVI TA+
Sbjct: 76 LQEKILFLVNSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSSVVPGATSFPQVILTAA 135
Query: 122 SFNATLWEAIGR----VVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPV 177
SFN +L+ AIG+ VVS EARAMYN G AGLT+WSPN+NIFRDPRWGRGQETPGEDP+
Sbjct: 136 SFNTSLFVAIGKVISQVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPL 195
Query: 178 LSGKYAASYVRGLQGSDG---DRLKVAASCKHFTAYDLDNWNGVDRFHFNA-KVSKQDIE 233
LS KY + YV+GLQ D D LKVAA CKH+TAYDLDNW GVDR+HFNA V+KQD++
Sbjct: 196 LSSKYGSGYVKGLQQRDDGNPDGLKVAACCKHYTAYDLDNWKGVDRYHFNAVVVTKQDMD 255
Query: 234 DTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNG--YIVSDCDS 291
DTF PF+ CV++G VASVMCSYN+VNG+PTCADP++L IRGEW+LNG YIV+DCDS
Sbjct: 256 DTFQPPFKSCVVDGNVASVMCSYNKVNGIPTCADPDLLSGVIRGEWKLNGYVYIVTDCDS 315
Query: 292 VGVYYDTQHFTSTPEEAAADAIRA--GLDLDCGPFLGLHTESAVQRGLLSEIDINNALVN 349
+ V+Y++QH+T TPEEAAA AI A GLDL+CG FLG HTE+AV GL++E I+ A+ N
Sbjct: 316 IDVFYNSQHYTKTPEEAAAKAILAGIGLDLNCGSFLGKHTEAAVTAGLVNESAIDRAVSN 375
Query: 350 TLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHI 409
MRLG FDG+PS Q YG LGPKDVCT ++QELA EAARQGIVLLKN
Sbjct: 376 NFATLMRLGFFDGDPSKQLYGKLGPKDVCTAENQELAREAARQGIVLLKN---------- 425
Query: 410 RHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFG 469
G C YTTPLQG+ T + GC +VAC+ Q+
Sbjct: 426 ----------------------TGTPCKYTTPLQGLAALVATTYLPGCSNVACSTAQV-D 462
Query: 470 AAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSG 529
A + ADAT+LVMG D SIEAE+ DR +LLPG+QQ L++ V+ AS GP ILV+MSG
Sbjct: 463 DAKKIAAAADATVLVMGADLSIEAESRDRVDILLPGQQQLLITAVANASTGPVILVIMSG 522
Query: 530 GPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSN------PGGKLPMTWYPQEY 583
G +DV+FAK + +I +I+W GYPG+AGG AIADI+FG+ N PGG+LPMTWYPQ Y
Sbjct: 523 GGMDVSFAKTNDKITSILWVGYPGEAGGAAIADIIFGSYNPSTHQPPGGRLPMTWYPQSY 582
Query: 584 ITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLD 643
+ +PMT M MRP S YPGRTYRFY G VY FG G+SY+ F H + AP +V+VPL+
Sbjct: 583 VDKVPMTNMNMRPDPSNGYPGRTYRFYTGETVYSFGDGLSYSEFSHELTQAPGLVSVPLE 642
Query: 644 GRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHK 703
H + + K++ C V + +KN G+ G+HT+ +FSTPP+ H +P K
Sbjct: 643 ENH--VCYSSECKSVAAAEQTCQNFD--VHLRIKNTGTTSGSHTVFLFSTPPSVHNSPQK 698
Query: 704 QLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
LV FEKV + A VG + VCK LSVVD G++++ LGEH +HIG KHS+++
Sbjct: 699 HLVGFEKVFLHAQTDSHVGFKVDVCKDLSVVDELGSKKVALGEHVLHIGSLKHSMTV 755
>gi|297834874|ref|XP_002885319.1| beta-1,4-xylosidase [Arabidopsis lyrata subsp. lyrata]
gi|297331159|gb|EFH61578.1| beta-1,4-xylosidase [Arabidopsis lyrata subsp. lyrata]
Length = 865
Score = 832 bits (2148), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 401/767 (52%), Positives = 529/767 (68%), Gaps = 30/767 (3%)
Query: 9 FLGLILLSAS--SSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKV 66
F+G+ LL A+ SS +++ FACD D T FC VSL R DL+ RLSL+EKV
Sbjct: 6 FVGVSLLIAALVSSLCESQKNFACDRNDPATAKYGFCNVSLSYEARAKDLVSRLSLKEKV 65
Query: 67 KLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNAT 126
+ L++ A V RLG+ YEWWSEALHGVS+VGPG +F G PGATSFP I TA+SFN +
Sbjct: 66 QQLVNKATGVSRLGVPPYEWWSEALHGVSDVGPGVRFNGTVPGATSFPATILTAASFNTS 125
Query: 127 LWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASY 186
LW +G VVS EARAM+N G AGLTYWSPNVNIFRDPRWGRGQETPGEDP++ KYA +Y
Sbjct: 126 LWLKMGEVVSTEARAMHNVGLAGLTYWSPNVNIFRDPRWGRGQETPGEDPLVVSKYAVNY 185
Query: 187 VRGLQ----GSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRM 242
V+GLQ RLKV++ CKH+TAYDLDNW G+DRFHF+AKV+KQD+EDT+ PF+
Sbjct: 186 VKGLQDVQDAGKSRRLKVSSCCKHYTAYDLDNWKGIDRFHFDAKVTKQDLEDTYQPPFKS 245
Query: 243 CVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFT 302
CV EG V+SVMCSYN+VNG+PTCADPN+L+ IRG+WRL+GYIVSDCDS+ VY+D H+T
Sbjct: 246 CVEEGDVSSVMCSYNRVNGIPTCADPNLLRGVIRGQWRLDGYIVSDCDSIQVYFDDIHYT 305
Query: 303 STPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDG 362
T L+++CG FLG +TE+AV+ L+ +++ AL+ V MRLG FDG
Sbjct: 306 KTR-----------LNMNCGDFLGKYTENAVKLKKLNGSEVDEALIYNYIVLMRLGFFDG 354
Query: 363 EPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSD 422
+P S P+G LGP DVC+ DHQ LALEAA+QGIVLL+N+G LPLS + +AVIGPN++
Sbjct: 355 DPKSLPFGQLGPSDVCSKDHQMLALEAAKQGIVLLENRG-DLPLSKTAVKKIAVIGPNAN 413
Query: 423 VTVTMIGNYAGIACGYTTPLQGIGRYA--RTIHQQGCKDVACADDQLFGAAIDASRQADA 480
T MI NYAG+ C YT+PLQG+ +Y + +++ GCKDV C + L AA+ A +AD
Sbjct: 414 ATKVMISNYAGVPCKYTSPLQGLQKYVPEKVVYEPGCKDVNCGEQTLISAAVKAVSEADV 473
Query: 481 TILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKND 540
T+LV+GLDQ++EAE LDR L LPG Q++LV V+ A+K +LV+MS GPID++FAKN
Sbjct: 474 TVLVVGLDQTVEAEGLDRVNLTLPGYQEKLVRDVANAAKKTVVLVIMSAGPIDISFAKNL 533
Query: 541 PRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSK 600
I+A++W GYPG+AGG AIA ++FG NP G+LP TWY QE+ + MT+M MRP+ +
Sbjct: 534 STISAVLWVGYPGEAGGDAIAQVIFGDYNPSGRLPETWYSQEFADKVAMTDMNMRPNSTS 593
Query: 601 RYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKV 660
+PGR+YRFY G +Y FG+G+SY+ F V +AP+++ + + ++N T S I +
Sbjct: 594 GFPGRSYRFYTGKPIYKFGYGLSYSAFSTFVLSAPSIIHIKTNPIL-NLNKTTS---IDI 649
Query: 661 THAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPP------AGHWAPHKQLVAFEKVHVP 714
+ C+ L + + + VKN G + G+H +LVF PP G P QLV FE+V V
Sbjct: 650 STVNCHDLKIRIVIGVKNRGQRSGSHVVLVFWKPPKCSKTLVGAGVPQTQLVGFERVEVG 709
Query: 715 AGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLH 761
++V + VCK LS+VD G R++ G H + IG H
Sbjct: 710 RSMTEKVTVEFDVCKALSLVDTHGKRKLVTGHHTLVIGSNSDQQIYH 756
>gi|357444469|ref|XP_003592512.1| Xylosidase [Medicago truncatula]
gi|355481560|gb|AES62763.1| Xylosidase [Medicago truncatula]
Length = 781
Score = 826 bits (2134), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 396/748 (52%), Positives = 527/748 (70%), Gaps = 19/748 (2%)
Query: 29 ACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWS 88
ACD T PFC SL R DL+ RL+LQEK + L++ + + RLG+ YEWWS
Sbjct: 25 ACDKGSPKTSNFPFCNTSLSYETRAKDLVSRLTLQEKAQQLVNPSTGISRLGVPAYEWWS 84
Query: 89 EALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA 148
EALHGVSNVGPGT+F PGATSFP VI +A+SFN TLW +G+VVS+EARAMYN A
Sbjct: 85 EALHGVSNVGPGTRFDSRVPGATSFPAVILSAASFNETLWYTMGQVVSNEARAMYNVDLA 144
Query: 149 GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQ------GSDGDRLKVAA 202
GLT+WSPNVN+FRDPRWGRGQETPGEDP++ +YA +YVRGLQ + GDRLKV++
Sbjct: 145 GLTFWSPNVNVFRDPRWGRGQETPGEDPLVVSRYAVNYVRGLQEVGDEASAKGDRLKVSS 204
Query: 203 SCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGV 262
CKH+TAYD+DNW GVDRFHF+AKV+KQD+EDT+ PF+ CV+EG V+SVMCSYN+VNG+
Sbjct: 205 CCKHYTAYDVDNWKGVDRFHFDAKVTKQDLEDTYQPPFKSCVLEGHVSSVMCSYNRVNGI 264
Query: 263 PTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG 322
PTCADP++L+ IRG+W L+GYIVSDCDSV VYY++ H+T TPE+A A A++AGL+++CG
Sbjct: 265 PTCADPDLLQGVIRGQWGLDGYIVSDCDSVEVYYNSIHYTKTPEDAVALALKAGLNMNCG 324
Query: 323 PFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDH 382
FL +T +AV + ++ ALV V MRLG F+ P S P+ +LGP DVCT ++
Sbjct: 325 DFLKKYTANAVNLKKVDVSIVDQALVYNYIVLMRLGFFE-NPKSLPFANLGPSDVCTKEN 383
Query: 383 QELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPL 442
Q+LALEAA+QGIVLL+N +LPLS + + +AVIGPN++ T MI NYAGI C Y++PL
Sbjct: 384 QQLALEAAKQGIVLLENNKGALPLSKTKIKNLAVIGPNANATTVMISNYAGIPCRYSSPL 443
Query: 443 QGIGRYARTI-HQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGL 501
QG+ +Y ++ + +GC DV C++ LF AA+ A+ ADA +LV+GLDQSIEAE LDR L
Sbjct: 444 QGLQKYISSVTYARGCSDVKCSNQNLFAAAVKAAASADAVVLVVGLDQSIEAEGLDRVNL 503
Query: 502 LLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIA 561
LPG Q++LV V+ A+KG ILV+M+ GPID++F K+ I I+W GYPGQ GG AIA
Sbjct: 504 TLPGFQEKLVKDVAAATKGTLILVIMAAGPIDISFTKSVSNIGGILWVGYPGQDGGNAIA 563
Query: 562 DILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHG 621
++FG NPGG+ P TWYPQ Y+ +PMT+M MR + S+ +PGRTYRFY G +Y FG+G
Sbjct: 564 QVIFGDYNPGGRSPFTWYPQSYVDQVPMTDMNMRANSSRNFPGRTYRFYNGKSLYEFGYG 623
Query: 622 MSYTNFVHTVANAPTVVAVPLDGRHGSI-----NATISGKAIKVTHAKCNRLTLGVQVDV 676
+SY+ F +A+AP+ + + ++ SI N + + I ++ C LT + + V
Sbjct: 624 LSYSTFSTHIASAPSTIMLQ---KNTSISKPLNNIFLDDQVIDISTISCFNLTFSLVIGV 680
Query: 677 KNVGSKDGAHTLLVFSTPPAGHW---APHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSV 733
KN G DG+H +LVF PP+ P KQL+ FE+ V G + V + I +CK LS
Sbjct: 681 KNNGPFDGSHVVLVFLEPPSSEAVSGVPLKQLIGFERAQVKVGKTEFVTVKIDICKMLSN 740
Query: 734 VDRSGTRRIPLGEHNIHIGGTKHSVSLH 761
VD G R++ +G+HNI +G + H
Sbjct: 741 VDSDGKRKLVIGQHNILVGTSSEKQVTH 768
>gi|255573163|ref|XP_002527511.1| Beta-glucosidase, putative [Ricinus communis]
gi|223533151|gb|EEF34909.1| Beta-glucosidase, putative [Ricinus communis]
Length = 810
Score = 826 bits (2134), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 408/760 (53%), Positives = 523/760 (68%), Gaps = 23/760 (3%)
Query: 24 AREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKG 83
A + FACD T FC SL R DLI RL+LQEKV+ +++ AA +PRLGI
Sbjct: 21 ASQNFACDKNSPQTNDYSFCNTSLSYQDRAKDLISRLTLQEKVQQVVNHAAGIPRLGIPA 80
Query: 84 YEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMY 143
YEWWSEALHGVSNVG G +F G PGATSFP +I +A+SFN TLW +G+VVS EAR M+
Sbjct: 81 YEWWSEALHGVSNVGFGVRFNGTVPGATSFPAMILSAASFNETLWLKMGQVVSTEARTMH 140
Query: 144 NGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQ------GSDGDR 197
+ G AGLTYWSPNVN+FRDPRWGRGQETPGEDP++ +YA +YVRGLQ S D+
Sbjct: 141 SVGLAGLTYWSPNVNVFRDPRWGRGQETPGEDPLVVSRYAVNYVRGLQEVGDEGNSTADK 200
Query: 198 LKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYN 257
LKV++ CKH+TAYDLD W GVDRFHF+AKV+KQD+EDT+ PFR CV E V+SVMCSYN
Sbjct: 201 LKVSSCCKHYTAYDLDKWKGVDRFHFDAKVTKQDLEDTYQPPFRSCVEEAHVSSVMCSYN 260
Query: 258 QVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGL 317
+VNG+PTCADP++LK IRGEW L+GYIVSDCDS+ VYYD+ ++T+TPE+A A A++AGL
Sbjct: 261 RVNGIPTCADPDLLKGIIRGEWNLDGYIVSDCDSIEVYYDSINYTATPEDAVALALKAGL 320
Query: 318 DLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDV 377
+++CG FLG +T AV+ + E ++ AL+ V MRLG FDG+P S +G+LGP DV
Sbjct: 321 NMNCGEFLGKYTVDAVKLNKVEESVVDQALIYNFIVLMRLGFFDGDPKSLLFGNLGPSDV 380
Query: 378 CTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACG 437
C+ HQ+LAL+AARQGIVLL N+G +LPLS R +AVIGPN++VT TMI NYAGI C
Sbjct: 381 CSDGHQKLALDAARQGIVLLYNKG-ALPLSKNNTRNLAVIGPNANVTTTMISNYAGIPCK 439
Query: 438 YTTPLQGIGRYARTI-HQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEAL 496
YTTPLQG+ +Y T+ + GCK V+C+DD L AA A+ ADA +L++GLDQSIE E L
Sbjct: 440 YTTPLQGLQKYVSTVTYAAGCKSVSCSDDTLIDAATQAAAAADAVVLLVGLDQSIEREGL 499
Query: 497 DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAG 556
DR L LPG Q++LV V A+ G +LV+MS PIDV+FA N +I I+W GYPGQAG
Sbjct: 500 DRENLTLPGFQEKLVVDVVNATNGTVVLVVMSSSPIDVSFAVNKSKIKGILWVGYPGQAG 559
Query: 557 GTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVY 616
G A+A ++FG NP G+ P TWYPQEY +PMT+M MR + + +PGRTYRFY G +Y
Sbjct: 560 GDAVAQVMFGDYNPAGRSPFTWYPQEYAHQVPMTDMNMRANSTANFPGRTYRFYAGNTLY 619
Query: 617 PFGHGMSYTNFVHTVANAPTVVAVPLD---------GRHGSINA--TISGKAIKVTHAKC 665
FGHG+SY+ F + + + P+ + + + H S I+ +A+ +T C
Sbjct: 620 KFGHGLSYSTFSNFIISGPSTLLLKTNSDLKPDIILSTHNSTEEHPFINSQAMDITTLNC 679
Query: 666 NRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHW---APHKQLVAFEKVHVPAGAQQRVG 722
L + + V+N G G H +LVF PP A + QLV F +V V G Q V
Sbjct: 680 TNSLLSLILGVRNNGPVSGDHVVLVFWKPPNSSEVTGAANVQLVGFSRVEVNRGKTQNVT 739
Query: 723 INIHVCKYLSVVDRSGTRRIPLGEHNIHIGG-TKHSVSLH 761
+ I VCK LS+VD G R++ G+H IG T+H V H
Sbjct: 740 LEIDVCKRLSLVDSEGKRKLVTGQHIFTIGSDTEHQVRHH 779
>gi|225428983|ref|XP_002264114.1| PREDICTED: probable beta-D-xylosidase 5-like [Vitis vinifera]
Length = 818
Score = 824 bits (2129), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 401/784 (51%), Positives = 532/784 (67%), Gaps = 26/784 (3%)
Query: 8 FFLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVK 67
FL L LL+ S +ACD D + PFC SLP R +DL+ RL+LQEK K
Sbjct: 35 IFLSLPLLAISQ----PTHRYACDRTDPNSSQFPFCNTSLPYQDRASDLVSRLTLQEKAK 90
Query: 68 LLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATL 127
LI+ A + RLG+ YEWWSEALHGVSN G G F P T FP VI +A+SFN +L
Sbjct: 91 QLINSATGISRLGVPDYEWWSEALHGVSNSGIGVHFHDPIPAVTIFPAVILSAASFNESL 150
Query: 128 WEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYV 187
W +G+VVS E RAMYN G AGLTYWSPNVNIFRDPRWGRGQETPGEDP++ +YA +YV
Sbjct: 151 WYTMGQVVSTEGRAMYNVGQAGLTYWSPNVNIFRDPRWGRGQETPGEDPLVVSRYAVNYV 210
Query: 188 RGLQ--GSDG----DRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFR 241
RGLQ G +G DRLKV++ CKH+TAYD+D W GVDRFHF+AKV+ QD+EDT+ PF+
Sbjct: 211 RGLQEVGKEGNFAADRLKVSSCCKHYTAYDVDKWKGVDRFHFDAKVTLQDLEDTYQPPFK 270
Query: 242 MCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHF 301
CV EG V+SVMCSYN+VNGVPTCA+P +LK IR +W L+GYIVSDCDS+ VY++ ++
Sbjct: 271 SCVEEGHVSSVMCSYNRVNGVPTCANPELLKGVIRDQWGLDGYIVSDCDSIMVYHERMNY 330
Query: 302 TSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFD 361
T TPE+A A A++AGL+L+CG +LG +T++AV G + E +N AL+ V MRLG FD
Sbjct: 331 TETPEDAVALALKAGLNLNCGSYLGDYTKNAVNLGKVKESIVNQALIYNYIVLMRLGFFD 390
Query: 362 GEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNS 421
G+P+ P+G +GP DVCT DHQ LAL+AA+QGIVLL N G +LPLS +T+AVIGPN+
Sbjct: 391 GDPTMLPFGKMGPSDVCTVDHQLLALDAAKQGIVLLHNNG-ALPLSPNTTKTLAVIGPNA 449
Query: 422 DVTVTMIGNYAGIACGYTTPLQGIGRYARTI-HQQGCKDVACADDQLFGAAIDASRQADA 480
D T TM+ NYAG+ C YT+PLQG+ +Y + +++GC +V+C+++ L A + ADA
Sbjct: 450 DATNTMLSNYAGVPCRYTSPLQGLQKYVSAVSYEKGCANVSCSEETLIEGAASIASMADA 509
Query: 481 TILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKND 540
T++V+GLD IEAE LDR L LPG Q++LV + + A+ G ILV+MS GP+D++F KN
Sbjct: 510 TVVVVGLDLFIEAEDLDRVNLTLPGFQEKLVMEAAKAANGTVILVVMSAGPVDISFVKNV 569
Query: 541 PRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSK 600
+I I+W GYPGQAGG AI+ ++FG NPGG+ P TWYPQEY+ +PMT+M MRP+ +
Sbjct: 570 SKIGGILWVGYPGQAGGDAISQVIFGDYNPGGRSPFTWYPQEYVDQVPMTDMNMRPNATS 629
Query: 601 RYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPL----------DGRHGSI- 649
+PGRTYRFY G +Y FGHG+SY+ F + +APT V V L + ++
Sbjct: 630 NFPGRTYRFYTGKSLYQFGHGLSYSTFYKFIKSAPTTVLVHLLPQMDMPNIFSSNYPTMP 689
Query: 650 NATISGKAIKVTHAKCNRLT-LGVQVDVKNVGSKDGAHTLLVFSTPPAG--HWAPHKQLV 706
N +G+AI ++ C L+ + + + VKN G DG H +L F PP AP +LV
Sbjct: 690 NPNTNGQAIDISAIDCRNLSNIDIVIGVKNAGEIDGTHVVLAFWKPPRSGVRGAPGVELV 749
Query: 707 AFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHAATLG 766
FE+V V G + VG+ + VC +S VD G R++ +G H + +G + + H
Sbjct: 750 GFERVEVKRGKTEMVGMRLDVCGKISNVDEEGKRKLVMGMHTLVVGSSSEQQARHHVNFR 809
Query: 767 VIKS 770
+ KS
Sbjct: 810 LAKS 813
>gi|195614824|gb|ACG29242.1| auxin-induced beta-glucosidase [Zea mays]
gi|413920229|gb|AFW60161.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
Length = 655
Score = 821 bits (2120), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 410/655 (62%), Positives = 492/655 (75%), Gaps = 29/655 (4%)
Query: 142 MYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQ-------GSD 194
MYNGG AGLT+WSPNVNIFRDPRWGRGQETPGEDP +S +YAA+YVRGLQ G
Sbjct: 1 MYNGGRAGLTFWSPNVNIFRDPRWGRGQETPGEDPAVSARYAAAYVRGLQQPYAAPNGGH 60
Query: 195 GDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMC 254
+RLK+AA CKHFTAYDLD W G DRFHFNA V+ QD+EDTF+VPFR CV +G+ ASVMC
Sbjct: 61 RNRLKLAACCKHFTAYDLDKWGGTDRFHFNAVVAAQDLEDTFNVPFRACVEDGRAASVMC 120
Query: 255 SYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIR 314
SYNQVNGVPTCAD L+ TIRG W L+GYIVSDCDSV V++ QH+T TPE+AAA +R
Sbjct: 121 SYNQVNGVPTCADAAFLRGTIRGRWGLDGYIVSDCDSVDVFFRDQHYTRTPEDAAAATLR 180
Query: 315 AGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGP 374
AGLDLDCGPFL L+ SAV G +++ D++ AL+NT+TVQMRLGMFDG+P++ P+G LGP
Sbjct: 181 AGLDLDCGPFLALYAGSAVAAGKVADADVDAALLNTVTVQMRLGMFDGDPAAGPFGRLGP 240
Query: 375 KDVCTPDHQELALEAARQGIVLLKNQGPS------LPLSHIRHRTVAVIGPNSDVTVTMI 428
DVCT +HQ+LAL+AARQG+VLLKN+ + LPL HR VAV+GP++D TV MI
Sbjct: 241 ADVCTREHQDLALDAARQGVVLLKNRRGARHNRDVLPLRPAAHRVVAVVGPHADATVAMI 300
Query: 429 GNYAGIACGYTTPLQGIGRY-ARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGL 487
GNYAG C YTTPLQG+ Y AR HQ GC DVAC +Q AA++A+RQADAT++V GL
Sbjct: 301 GNYAGKPCRYTTPLQGVAAYAARVAHQAGCTDVACRGNQPIAAAVEAARQADATVVVAGL 360
Query: 488 DQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAII 547
DQ +EAE LDR LLLPGRQ EL+S V+ ASKGP ILVLMSGGPID+AFA+NDPRI I+
Sbjct: 361 DQRVEAEGLDRTTLLLPGRQAELISAVAKASKGPVILVLMSGGPIDIAFAQNDPRIDGIL 420
Query: 548 WAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTY 607
W GYPGQAGG AIAD++FG NPG KLP+TWY Q+Y+ +PMT MAMR + ++ YPGRTY
Sbjct: 421 WVGYPGQAGGQAIADVIFGHHNPGAKLPVTWYHQDYLQKVPMTNMAMRANPARGYPGRTY 480
Query: 608 RFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISG---------KAI 658
RFY GP +YPFGHG+SYT F HT+A+APT + V L G S + S +A+
Sbjct: 481 RFYTGPTIYPFGHGLSYTQFTHTLAHAPTQLTVRLSGSGHSAASAASLLNATLARPVRAV 540
Query: 659 KVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPA------GHWAPHKQLVAFEKVH 712
+V HA+C LT+ V VDV NVG +DGAH +LV+ P+ G AP +QLVAFEKVH
Sbjct: 541 RVAHARCEGLTVPVHVDVSNVGDRDGAHAVLVYHAAPSPSHAAPGADAPARQLVAFEKVH 600
Query: 713 VPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHAATLGV 767
VPAG RV + I VC LSV DR+G RR+P+GEH + IG HSVSL LGV
Sbjct: 601 VPAGGVARVEMRIGVCDRLSVADRNGVRRVPVGEHRLMIGELTHSVSLGVEQLGV 655
>gi|147844622|emb|CAN82161.1| hypothetical protein VITISV_035506 [Vitis vinifera]
Length = 925
Score = 821 bits (2120), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 399/784 (50%), Positives = 531/784 (67%), Gaps = 26/784 (3%)
Query: 8 FFLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVK 67
FL L LL+ S +ACD D + PFC SLP R +DL+ RL+LQEK K
Sbjct: 11 IFLSLPLLAISQ----PTHRYACDRTDPNSSQFPFCNTSLPYQDRASDLVSRLTLQEKAK 66
Query: 68 LLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATL 127
LI+ A + RLG+ YEWWSEALHGVSN G G F P T FP VI +A+SFN +L
Sbjct: 67 QLINSATGISRLGVPDYEWWSEALHGVSNSGIGVHFHDPIPAVTIFPAVILSAASFNESL 126
Query: 128 WEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYV 187
W +G+VVS E RAMYN G AGLTYWSPNVNIFRDPRWGRGQETPGEDP++ +YA +YV
Sbjct: 127 WYTMGQVVSTEGRAMYNVGQAGLTYWSPNVNIFRDPRWGRGQETPGEDPLVVSRYAVNYV 186
Query: 188 RGLQ--GSDG----DRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFR 241
RGLQ G +G DRLKV++ CKH+TAYD+D W GVDRFHF+AKV+ QD+EDT+ PF+
Sbjct: 187 RGLQEVGKEGNFAADRLKVSSCCKHYTAYDVDKWKGVDRFHFDAKVTLQDLEDTYQPPFK 246
Query: 242 MCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHF 301
CV EG V+SVMCSYN+VNGVPTCA+P +LK IR +W L+GYIVSDCDS+ VY++ ++
Sbjct: 247 XCVEEGHVSSVMCSYNRVNGVPTCANPELLKGVIRDQWGLDGYIVSDCDSIMVYHERMNY 306
Query: 302 TSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFD 361
T TPE+A A A++AGL+L+CG +LG +T++AV G + E +B AL+ V MRLG FD
Sbjct: 307 TETPEDAVALALKAGLNLNCGSYLGDYTKNAVNLGKVKESIVBQALIYNYIVLMRLGFFD 366
Query: 362 GEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNS 421
G+P+ P+G +GP DVCT DHQ LAL+AA+QGIVLL N G +LPLS +T+AVIGPN+
Sbjct: 367 GDPTMLPFGKMGPSDVCTVDHQLLALDAAKQGIVLLHNNG-ALPLSPNTTKTLAVIGPNA 425
Query: 422 DVTVTMIGNYAGIACGYTTPLQGIGRYARTI-HQQGCKDVACADDQLFGAAIDASRQADA 480
D T TM+ NYAG+ C YT+PLQG+ +Y + +++GC +V+C+++ L A + ADA
Sbjct: 426 DATNTMLSNYAGVPCRYTSPLQGLQKYVSAVSYEKGCANVSCSEETLIEGAASIASMADA 485
Query: 481 TILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKND 540
T++V+GLD IEAE LDR L LPG Q++LV + + A+ G ILV+MS GP+D++F KN
Sbjct: 486 TVVVVGLDLFIEAEDLDRVNLTLPGFQEKLVMEAAKAANGTVILVVMSAGPVDISFVKNV 545
Query: 541 PRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSK 600
+I I+W GYPGQAGG AI+ ++FG NPGG+ P TWYPQEY+ +PMT+M MRP+ +
Sbjct: 546 SKIGGILWVGYPGQAGGDAISQVIFGDYNPGGRSPFTWYPQEYVDQVPMTDMNMRPNATX 605
Query: 601 RYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPL----------DGRHGSI- 649
+PGRTYRFY G +Y FGHG+SY+ F + +AP V V L + ++
Sbjct: 606 NFPGRTYRFYTGKSLYQFGHGLSYSTFYKFIKSAPXTVLVHLLPQMDMPNIFSSNYPTMP 665
Query: 650 NATISGKAIKVTHAKCNRLT-LGVQVDVKNVGSKDGAHTLLVFSTPPAG--HWAPHKQLV 706
N +G+AI ++ C L+ + + + VKN G DG H +L F PP AP +LV
Sbjct: 666 NPNTNGQAIDISAIDCRNLSNIDIVIGVKNAGEIDGTHVVLAFWKPPRSGVRGAPGVELV 725
Query: 707 AFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHAATLG 766
FE+V V G + VG+ + VC +S VD G R++ +G H + +G + + H
Sbjct: 726 GFERVEVKRGKTEMVGMRLDVCGKISNVDEEGKRKLVMGMHTLVVGSSSEQQARHHVNFR 785
Query: 767 VIKS 770
+ KS
Sbjct: 786 LAKS 789
>gi|326489197|dbj|BAK01582.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 709
Score = 813 bits (2101), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 404/711 (56%), Positives = 509/711 (71%), Gaps = 10/711 (1%)
Query: 58 GRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVI 117
GR +KV L++ A+ RLGI YEWWSEALHGVS VGPGT+F PGATSFPQ I
Sbjct: 1 GRGGESQKVGFLVNKQPALGRLGIPAYEWWSEALHGVSYVGPGTRFSPLVPGATSFPQPI 60
Query: 118 TTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPV 177
TA+SFNA+L+ AIG VVS EARAM+N G AGLT+WSPN+NIFRDPRWGRGQETPGEDP+
Sbjct: 61 LTAASFNASLFRAIGEVVSTEARAMHNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPL 120
Query: 178 LSGKYAASYVRGLQ-----GSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDI 232
L+ KYA YV GLQ G LKVAA CKH+TAYD+DNW GV+R+ F+AKVS+QD+
Sbjct: 121 LASKYAVGYVTGLQDAGAGGVTDGALKVAACCKHYTAYDVDNWKGVERYTFDAKVSQQDL 180
Query: 233 EDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSV 292
+DTF PF+ CV++G VASVMCSYN+VNG PTCAD ++L+ IRG+W+LNGYIVSDCDSV
Sbjct: 181 DDTFQPPFKSCVLDGNVASVMCSYNKVNGKPTCADKDLLEGVIRGDWKLNGYIVSDCDSV 240
Query: 293 GVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLT 352
V Y QH+T TPEEAAA I++GLDL+CG FL HT +AVQ G LSE D++ A+ N
Sbjct: 241 DVLYTQQHYTKTPEEAAAITIKSGLDLNCGNFLAQHTVAAVQAGELSEEDVDRAITNNFI 300
Query: 353 VQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHR 412
+ MRLG FDG+P +G LGPKDVCT ++ELA E ARQGIVLLKN G +LPLS +
Sbjct: 301 MLMRLGFFDGDPRQLAFGSLGPKDVCTSSNRELARETARQGIVLLKNSG-ALPLSAKSIK 359
Query: 413 TVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQL-FGAA 471
++AVIGPN++ + TMIGNY G C YTTPLQG+G T++Q GC +V C+ + L A
Sbjct: 360 SMAVIGPNANASFTMIGNYEGTPCKYTTPLQGLGAKVNTVYQPGCTNVGCSGNSLQLSTA 419
Query: 472 IDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGP 531
+ A+ AD T+LV+G DQSIE E+LDR LLLPG+Q +LVS V+ AS GP ILV+MSGGP
Sbjct: 420 VAAAASADVTVLVVGADQSIERESLDRTSLLLPGQQTQLVSAVANASSGPVILVVMSGGP 479
Query: 532 IDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTE 591
D++FAK +IAAI+W GYPG+AGG A+ADILFG+ NP G+LP+TWYP Y + MT+
Sbjct: 480 FDISFAKASDKIAAILWVGYPGEAGGAALADILFGSHNPSGRLPVTWYPASYADTVTMTD 539
Query: 592 MAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANA-PTVVAVPLDGRHGSIN 650
M MRP S YPGRTYRFY G V+ FG G+SYT H++ +A P+ V++ L H
Sbjct: 540 MRMRPDTSTGYPGRTYRFYTGDTVFAFGDGLSYTKMSHSLVSAPPSYVSMRLAEDHPCRA 599
Query: 651 ATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEK 710
+ +++ C+ L V++ V+N G GAH++L+FS+PP H AP K L+ FEK
Sbjct: 600 EECA--SVEAAGDHCDDLAFDVKLQVRNAGEVAGAHSVLLFSSPPPAHNAPAKHLLGFEK 657
Query: 711 VHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLH 761
V + G V + VC+ LSVVD G R++ LG H +H+G KH+V L
Sbjct: 658 VSLAPGEAGTVAFRVDVCRDLSVVDELGGRKVALGGHTLHVGDLKHTVELR 708
>gi|356574315|ref|XP_003555294.1| PREDICTED: LOW QUALITY PROTEIN: probable beta-D-xylosidase 5-like
[Glycine max]
Length = 901
Score = 812 bits (2098), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 399/785 (50%), Positives = 539/785 (68%), Gaps = 33/785 (4%)
Query: 1 MASTIAFF--FLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIG 58
M + + F FL ++L +S + AC A T PFC SL R DL+
Sbjct: 1 MKNQLMFLSMFLSFLILPTAS------QKHACG--SAKTSNFPFCDTSLSYEDRAKDLVS 52
Query: 59 RLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVIT 118
RL+LQEK + L++ +A + RLG+ YEWWSEALHGVSN+GPGT+F PGATSFP VI
Sbjct: 53 RLTLQEKTQQLVNPSAGISRLGVPAYEWWSEALHGVSNLGPGTRFDKKVPGATSFPAVIL 112
Query: 119 TASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVL 178
+A+SFNA+LW+ +G+VVS EARAMYN AGLT+WSPNVN+FRDPRWGRGQETPGEDP++
Sbjct: 113 SAASFNASLWQKMGQVVSTEARAMYNVDLAGLTFWSPNVNVFRDPRWGRGQETPGEDPLV 172
Query: 179 SGKYAASYVRGLQ------GSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDI 232
+YA Y+RGLQ + DRLKV++ CKH+TAYDLDNW G+DRFHF+AKV+KQD+
Sbjct: 173 VSRYAVMYLRGLQEVEDEASAKADRLKVSSCCKHYTAYDLDNWKGIDRFHFDAKVTKQDL 232
Query: 233 EDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSV 292
ED++ PF+ CV+EG V+SVMCSYN+VNG+PTCADP++LK IRG+W L+GYIVSDCDSV
Sbjct: 233 EDSYQPPFKSCVVEGHVSSVMCSYNRVNGIPTCADPDLLKGIIRGQWGLDGYIVSDCDSV 292
Query: 293 GVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLT 352
VYY+ H+T+TPE+A A A++AGL+++CG FL +T +AV + ++ ALV
Sbjct: 293 EVYYNAIHYTATPEDAVALALKAGLNMNCGDFLKKYTANAVNLKKVDVATVDQALVYNYI 352
Query: 353 VQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHR 412
V MRLG FD +P S P+ +LGP DVCT D+Q+LAL+AA+QGIVLL+N +LPLS +
Sbjct: 353 VLMRLGFFD-DPKSLPFANLGPSDVCTKDNQQLALDAAKQGIVLLENNNGALPLSQTNIK 411
Query: 413 TVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTI-HQQGCKDVACADDQLFGAA 471
+AVIGPN++ T MI NYAGI C YT+PLQG+ +Y ++ + GC +V C + L AA
Sbjct: 412 KLAVIGPNANATTVMISNYAGIPCRYTSPLQGLQKYISSVNYAPGCSNVKCDNQSLIAAA 471
Query: 472 IDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGP 531
+ A+ ADA +LV+GLDQSIEAE LDR L LPG Q++ V V+ A+KG ILV+M+ GP
Sbjct: 472 VKAAASADAVVLVVGLDQSIEAEGLDRENLTLPGFQEKFVKDVAGATKGKVILVIMAAGP 531
Query: 532 IDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTE 591
ID++ K+ I I+W GYPGQAGG AIA ++FG NPGG+ P TWYPQ Y+ +PMT+
Sbjct: 532 IDISSTKSVSNIGGILWVGYPGQAGGDAIAQVIFGDYNPGGRSPFTWYPQSYVDQVPMTD 591
Query: 592 MAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVP---LDGRHGS 648
M MR ++S+ +PGRTYRFY G +Y FGHG+SY+ F VA+AP+ + + + H
Sbjct: 592 MNMRANKSRNFPGRTYRFYNGNSLYEFGHGLSYSTFSMYVASAPSSIMIENTSISEPHNM 651
Query: 649 INATIS---------GKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHW 699
+++ S G+AI ++ C LT + + VKN G +G+H +LVF P +
Sbjct: 652 LSSNNSGTQVESLSDGQAIDISTINCQDLTFLLVIGVKNNGPLNGSHVVLVFWEPATSEF 711
Query: 700 ---APHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKH 756
AP KQL+ FE+V V G + V + I +C+ +S VD G R++ +G+H I +G +
Sbjct: 712 VIGAPIKQLIGFERVQVVVGVTEFVTVKIDICQLISNVDSDGKRKLVIGQHTILVGSSSE 771
Query: 757 SVSLH 761
+ H
Sbjct: 772 TQVRH 776
>gi|78482949|emb|CAJ41429.1| beta (1,4)-xylosidase [Populus tremula x Populus alba]
Length = 732
Score = 810 bits (2093), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 444/780 (56%), Positives = 534/780 (68%), Gaps = 68/780 (8%)
Query: 4 TIAFFFLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQ 63
T++ FLG+ L ++ A +PFACDPKD T R LPFCQV+LPI RVNDLIGR++LQ
Sbjct: 8 TLSVLFLGVSLQTSK-----ALDPFACDPKDGTNRDLPFCQVNLPIHTRVNDLIGRMTLQ 62
Query: 64 EKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSF 123
EKV LL++ AAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGG FP ATSFPQVITTA+SF
Sbjct: 63 EKVGLLVNNAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGAFPVATSFPQVITTAASF 122
Query: 124 NATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYA 183
NATLWEAIGRVVSDEARAM+NGG AGLTYWSPNV PRWGRGQETPGEDPV+ GKYA
Sbjct: 123 NATLWEAIGRVVSDEARAMFNGGVAGLTYWSPNVTYSVYPRWGRGQETPGEDPVVVGKYA 182
Query: 184 ASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMC 243
ASYVRGLQGSDG RLKVAA CKHFTAYDLDNWNGVDRFHFNAKVSKQD+ DTFDVPFRMC
Sbjct: 183 ASYVRGLQGSDGIRLKVAACCKHFTAYDLDNWNGVDRFHFNAKVSKQDMVDTFDVPFRMC 242
Query: 244 VMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTS 303
V EGKVASVMCSYNQVNG+PTCADPN+LK+T+RG+WRLNGYIVSDCDS GVYY QHFTS
Sbjct: 243 VKEGKVASVMCSYNQVNGIPTCADPNLLKKTVRGQWRLNGYIVSDCDSFGVYYGQQHFTS 302
Query: 304 TPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGE 363
P ++ +AGLDLDCGPFL H + AV++ E +INNA + TLT Q+ LG+FDG
Sbjct: 303 -PRRSSLGCYKAGLDLDCGPFLVTHRD-AVKKA-AEEAEINNAWLKTLTFQISLGIFDGS 359
Query: 364 PSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDV 423
P Q G + P + P +Q+LA+ A ++ + + KN+ + L RH + GP +
Sbjct: 360 P-LQAVGDVVPT-MGPPTNQDLAVNAPKR-LFIFKNR--AFLLYSPRH----IFGPVALF 410
Query: 424 TVT--MIGNYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADAT 481
M+GNY G+ C Y PLQG+ + ++ GC +V CA + G+A+D + ADA
Sbjct: 411 KSLPFMLGNYEGLPCKYLFPLQGLAGFVSLLYLPGCSNVICAVADV-GSAVDLAASADAV 469
Query: 482 ILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDP 541
+LV+G DQSIE E DR LPG+QQELV++V+MA+KGP +LV+M D+A +
Sbjct: 470 VLVVGADQSIEREGHDRVDFYLPGKQQELVTRVAMAAKGPVLLVIM-----DLAISGGG- 523
Query: 542 RIAAIIWAGYPGQAGGTAIADILFGT-------SNPGGKLPMTWYPQEYITNLPMTEMAM 594
Y Q G I+D+ G+ SN G +P Y + L T++
Sbjct: 524 -------CSY-NQVNGIPISDVCEGSSYRWPSFSNCHGYMPWISYSRAIWETLRFTKVNW 575
Query: 595 RPSQSKRYPGRTYRFYKGPVVYPFG--HGMSYTNFVHTVANAPTVV--AVPLDGRHGSIN 650
P T+ + K ++ FG H + T P + GR ++
Sbjct: 576 VP---------TWSWNK---LHKFGSHHSKCTDDGFGTPRRPPPWLRKCNHFQGRQSELH 623
Query: 651 ATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEK 710
++ LG+QVDVKN GS DG HTLLV+ PPA HWAPHKQLVAFEK
Sbjct: 624 MLDVIDSL-----------LGMQVDVKNTGSMDGTHTLLVYFRPPARHWAPHKQLVAFEK 672
Query: 711 VHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHAATLGVIKS 770
VHV AG QQRVGINIHVCK LSVVD SG RRIP+GEH++HIG KHSVSL A+ LGV++S
Sbjct: 673 VHVAAGTQQRVGINIHVCKSLSVVDGSGIRRIPMGEHSLHIGDVKHSVSLQASILGVVES 732
>gi|296083056|emb|CBI22460.3| unnamed protein product [Vitis vinifera]
Length = 896
Score = 803 bits (2074), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 391/772 (50%), Positives = 512/772 (66%), Gaps = 50/772 (6%)
Query: 8 FFLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVK 67
FL L LL+ S +ACD D + PFC SLP R +DL+ RL+LQEK K
Sbjct: 35 IFLSLPLLAISQ----PTHRYACDRTDPNSSQFPFCNTSLPYQDRASDLVSRLTLQEKAK 90
Query: 68 LLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATL 127
LI+ A + RLG+ YEWWSEALHGVSN G G F P T FP VI +A+SFN +L
Sbjct: 91 QLINSATGISRLGVPDYEWWSEALHGVSNSGIGVHFHDPIPAVTIFPAVILSAASFNESL 150
Query: 128 WEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYV 187
W +G+VVS E RAMYN G AGLTYWSPNVNIFRDPRWGRGQETPGEDP++ +YA +YV
Sbjct: 151 WYTMGQVVSTEGRAMYNVGQAGLTYWSPNVNIFRDPRWGRGQETPGEDPLVVSRYAVNYV 210
Query: 188 RGLQ--GSDG----DRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFR 241
RGLQ G +G DRLKV++ CKH+TAYD+D W GVDRFHF+AKV+ QD+EDT+ PF+
Sbjct: 211 RGLQEVGKEGNFAADRLKVSSCCKHYTAYDVDKWKGVDRFHFDAKVTLQDLEDTYQPPFK 270
Query: 242 MCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHF 301
CV EG V+SVMCSYN+VNGVPTCA+P +LK IR +W L+GYIVSDCDS+ VY++ ++
Sbjct: 271 SCVEEGHVSSVMCSYNRVNGVPTCANPELLKGVIRDQWGLDGYIVSDCDSIMVYHERMNY 330
Query: 302 TSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFD 361
T TPE+A A A++AGL+L+CG +LG +T++AV G + E +N AL+ V MRLG FD
Sbjct: 331 TETPEDAVALALKAGLNLNCGSYLGDYTKNAVNLGKVKESIVNQALIYNYIVLMRLGFFD 390
Query: 362 GEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNS 421
G+P+ P+G +GP DVCT DHQ LAL+AA+QGIVLL N G +LPLS +T+AVIGPN+
Sbjct: 391 GDPTMLPFGKMGPSDVCTVDHQLLALDAAKQGIVLLHNNG-ALPLSPNTTKTLAVIGPNA 449
Query: 422 DVTVTMIGNYAGIACGYTTPLQGIGRYARTI-HQQGCKDVACADDQLFGAAIDASRQADA 480
D T TM+ NYAG+ C YT+PLQG+ +Y + +++GC +V+C+++ L A + ADA
Sbjct: 450 DATNTMLSNYAGVPCRYTSPLQGLQKYVSAVSYEKGCANVSCSEETLIEGAASIASMADA 509
Query: 481 TILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKND 540
T++V+GLD IEAE LDR L LPG Q++LV + + A+ G ILV+MS GP+D++F KN
Sbjct: 510 TVVVVGLDLFIEAEDLDRVNLTLPGFQEKLVMEAAKAANGTVILVVMSAGPVDISFVKNV 569
Query: 541 PRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSK 600
+I I+W GYPGQAGG AI+ ++FG NPGG+ P TWYPQEY+ +PMT+M MRP+ +
Sbjct: 570 SKIGGILWVGYPGQAGGDAISQVIFGDYNPGGRSPFTWYPQEYVDQVPMTDMNMRPNATS 629
Query: 601 RYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKV 660
+PGRTYRFY G +Y FGHG+SY+ F ++N V+
Sbjct: 630 NFPGRTYRFYTGKSLYQFGHGLSYSTFYKNLSNIDIVIG--------------------- 668
Query: 661 THAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAG--HWAPHKQLVAFEKVHVPAGAQ 718
VKN G DG H +L F PP AP +LV FE+V V G
Sbjct: 669 ---------------VKNAGEIDGTHVVLAFWKPPRSGVRGAPGVELVGFERVEVKRGKT 713
Query: 719 QRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHAATLGVIKS 770
+ VG+ + VC +S VD G R++ +G H + +G + + H + KS
Sbjct: 714 EMVGMRLDVCGKISNVDEEGKRKLVMGMHTLVVGSSSEQQARHHVNFRLAKS 765
>gi|302811516|ref|XP_002987447.1| hypothetical protein SELMODRAFT_426207 [Selaginella moellendorffii]
gi|300144853|gb|EFJ11534.1| hypothetical protein SELMODRAFT_426207 [Selaginella moellendorffii]
Length = 779
Score = 799 bits (2064), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 396/761 (52%), Positives = 512/761 (67%), Gaps = 41/761 (5%)
Query: 22 LAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGI 81
+A + P+ACD ++AT FC LP RV DLI R++LQEK+ L++ AA +PRLG+
Sbjct: 26 VAGQAPYACDQRNATLLQFGFCNTRLPTSTRVEDLISRMTLQEKIIQLVNNAAGIPRLGL 85
Query: 82 KGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARA 141
YEWW EALHGV+ V PG KFGG FPGATSFP I TA+SF+A VS EARA
Sbjct: 86 PRYEWWQEALHGVA-VSPGVKFGGKFPGATSFPMPILTAASFDA---------VSTEARA 135
Query: 142 MYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSD--GDRLK 199
M+N AGLTYWSPNVNI+RDPRWGRGQETPGEDP+LS KYA YVRGLQ ++ GD+LK
Sbjct: 136 MHNYQRAGLTYWSPNVNIYRDPRWGRGQETPGEDPLLSSKYATFYVRGLQDTNLGGDKLK 195
Query: 200 VAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQV 259
V+A CKH TAYD+DNW G RF FNA V++QD+ DT++ PF+ CV + KV+SVMCSYN+V
Sbjct: 196 VSACCKHMTAYDVDNWKGTTRFKFNAIVTQQDLSDTYNPPFQSCVEDAKVSSVMCSYNRV 255
Query: 260 NGVPTCADPNILKRTIRGEWRLNG----------------YIVSDCDSVGVYYDTQHFTS 303
NGVPTCAD N+L T+R W LNG YIVSDCDS+ ++D ++
Sbjct: 256 NGVPTCADYNLLSATVRSSWNLNGSILLTCEVLLLYLPCSYIVSDCDSLQTFFDNTNYAK 315
Query: 304 TPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGE 363
T E+ ADA+ AGL+LDCGPFL +HT+SA+ G ++E ++N AL VQMRLG++DG
Sbjct: 316 TAEDVVADALLAGLNLDCGPFLAIHTQSAITNGKITEANVNQALRYLYNVQMRLGLYDGN 375
Query: 364 PSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDV 423
P SQPYG+LGP+ VCT ++Q+LAL+AA++GIVLLKN G LP S RTVA IGP++
Sbjct: 376 PRSQPYGNLGPQSVCTGENQQLALDAAKEGIVLLKNNGNVLPFSKSNIRTVAAIGPHAKA 435
Query: 424 TVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATIL 483
T MIGNY GI C YTTP G+ YAR ++ GC DVAC D L G+A+ + QADA +L
Sbjct: 436 TRAMIGNYQGIPCKYTTPHDGLSAYARVVYSAGCSDVACYSDSLIGSAVSTASQADAVVL 495
Query: 484 VMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRI 543
+GLD + EAE DR LLLPG+QQELV++V+ A+KGP +LV+ SGG +DV+FAK + ++
Sbjct: 496 FVGLDLNQEAEGKDRTSLLLPGKQQELVTEVTKAAKGPAVLVIFSGGSVDVSFAKYNNKV 555
Query: 544 AAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYP 603
I+WAGYPG+AGG AIA +LFG NPGG+LP+TWYP+ + T + M +M MRP S+ YP
Sbjct: 556 QGILWAGYPGEAGGAAIAQVLFGDHNPGGRLPVTWYPESF-TGITMLDMNMRPDASRGYP 614
Query: 604 GRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHA 663
GRTYRFY G VY FG+G +Y+ H AP + P ++ + G + H
Sbjct: 615 GRTYRFYTGQSVYNFGYGKTYSKLSHKFKEAPLSLGFP---EAAAVKRSCDGN-LTCFHL 670
Query: 664 K------CNRLTLGVQVDVKNVGSKDGAHTLLVFSTPP-AGH-WAPHKQLVAFEKVHVPA 715
C+ LT V++ V N G + +L++S+PP AG AP +QL F KV V
Sbjct: 671 NAHDEITCSTLTSKVRILVHNKGDRPSNRAVLLYSSPPNAGRDGAPIRQLAGFGKVSVAP 730
Query: 716 GAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKH 756
GA + V I I CK+LS +G R + G H + +G +H
Sbjct: 731 GAVENVEIEIDPCKHLSHAGANGVRILHGGIHTLAVGNARH 771
>gi|302796585|ref|XP_002980054.1| hypothetical protein SELMODRAFT_419541 [Selaginella moellendorffii]
gi|300152281|gb|EFJ18924.1| hypothetical protein SELMODRAFT_419541 [Selaginella moellendorffii]
Length = 779
Score = 797 bits (2058), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 395/765 (51%), Positives = 513/765 (67%), Gaps = 41/765 (5%)
Query: 22 LAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGI 81
+A + P+ACD ++AT FC LP RV DLI R++LQEK+ L++ AA +PRLG+
Sbjct: 26 VAGQAPYACDQRNATLLQFGFCNTRLPTSTRVEDLISRMTLQEKIIQLVNNAAGIPRLGL 85
Query: 82 KGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARA 141
YEWW EALHGV+ V PG KFGG FPGATSFP I TA+SF+A VS EARA
Sbjct: 86 PRYEWWQEALHGVA-VSPGVKFGGKFPGATSFPMPILTAASFDA---------VSTEARA 135
Query: 142 MYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSD--GDRLK 199
M+N AGLTYWSPNVNI+RDPRWGRGQETPGEDP+LS KYA YVRGLQ ++ GD+LK
Sbjct: 136 MHNYQRAGLTYWSPNVNIYRDPRWGRGQETPGEDPLLSSKYATFYVRGLQDTNLGGDKLK 195
Query: 200 VAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQV 259
V+A CKH TAYD+DNW G RF FNA V++QD+ DT++ PF+ CV + KV+SVMCSYN+V
Sbjct: 196 VSACCKHMTAYDVDNWKGTTRFKFNAIVTQQDLSDTYNPPFQSCVEDAKVSSVMCSYNRV 255
Query: 260 NGVPTCADPNILKRTIRGEWRLNG----------------YIVSDCDSVGVYYDTQHFTS 303
NGVPTCAD N+L T+R W LNG YIVSDCDS+ ++D ++
Sbjct: 256 NGVPTCADYNLLSATVRSSWNLNGSILLTCEVLLLYLPCSYIVSDCDSLQTFFDNTNYAK 315
Query: 304 TPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGE 363
T E+ ADA+ AGL+LDCGPFL +HT+SA+ G ++E ++N AL VQMRLG++DG
Sbjct: 316 TAEDVVADALLAGLNLDCGPFLAIHTQSAITNGKITEANVNQALRYLYNVQMRLGLYDGN 375
Query: 364 PSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDV 423
P SQPYG+LGP+ VCT ++Q+LAL+AA++GIVLLKN G LP S RTVA IGP++
Sbjct: 376 PRSQPYGNLGPQSVCTGENQQLALDAAKEGIVLLKNNGNVLPFSKSNIRTVAAIGPHAKA 435
Query: 424 TVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATIL 483
T MIGNY GI C YTTP G+ YAR ++ GC DVAC + L G+A + QADA +L
Sbjct: 436 TRAMIGNYQGIPCKYTTPHDGLSAYARVVYSAGCSDVACYSNSLIGSAASTASQADAVVL 495
Query: 484 VMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRI 543
+GLD + EAE DR LLLPG+QQELV++V+ A+KGP +LV+ SGG +DV+FAK D ++
Sbjct: 496 FVGLDLNQEAEGKDRTSLLLPGKQQELVTEVTKAAKGPVVLVIFSGGSVDVSFAKYDKKV 555
Query: 544 AAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYP 603
++WAGYPG+AGG AIA +LFG NPGG+LP+TWYP+ + T + M +M MRP S+ YP
Sbjct: 556 QGMLWAGYPGEAGGAAIAQVLFGDHNPGGRLPVTWYPESF-TGITMLDMNMRPDASRGYP 614
Query: 604 GRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHA 663
GRTYRFY G VY FG+G +Y+ H AP + P ++ + G + H
Sbjct: 615 GRTYRFYTGQSVYNFGYGKTYSKLSHKFKEAPLSLGFP---EAAAVKRSCDGN-LTCFHL 670
Query: 664 K------CNRLTLGVQVDVKNVGSKDGAHTLLVFSTPP-AGH-WAPHKQLVAFEKVHVPA 715
C+ LT V++ V N G + +L++S+PP AG AP +QL F KV V
Sbjct: 671 NAHDEITCSTLTSKVRILVHNEGDRPSNRAVLLYSSPPNAGRDGAPIRQLAGFGKVSVAP 730
Query: 716 GAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
GA + V I I CK+LS +G R + G H + +G +H + +
Sbjct: 731 GAVENVEIEIDPCKHLSHAGANGVRILHGGIHTLAVGNARHPLPI 775
>gi|77552476|gb|ABA95273.1| Beta-D-xylosidase, putative, expressed [Oryza sativa Japonica
Group]
Length = 883
Score = 795 bits (2052), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 418/657 (63%), Positives = 502/657 (76%), Gaps = 22/657 (3%)
Query: 133 RVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQG 192
+ VSDE RAMYNGG AGLT+WSPNVNIFRDPRWGRGQETPGEDP ++ +YAA+YVRGLQ
Sbjct: 227 QAVSDEGRAMYNGGQAGLTFWSPNVNIFRDPRWGRGQETPGEDPAVAARYAAAYVRGLQQ 286
Query: 193 SD--GDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVA 250
RLK+AA CKHFTAYDLDNW+G DRFHFNA V++QD+EDTF+VPFR CV++G+ A
Sbjct: 287 QQPSSGRLKLAACCKHFTAYDLDNWSGTDRFHFNAVVTRQDLEDTFNVPFRSCVVDGRAA 346
Query: 251 SVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAA 310
SVMCSYNQVNGVPTCAD L+ TIR W L GYIVSDCDSV V+Y QH+T T E+A A
Sbjct: 347 SVMCSYNQVNGVPTCADAAFLRGTIRRRWGLAGYIVSDCDSVDVFYSDQHYTRTREDAVA 406
Query: 311 DAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYG 370
+RAGLDLDCGPFL +TE AV +G + + DI+ A+ NT+TVQMRLGMFDG+P++QP+G
Sbjct: 407 ATLRAGLDLDCGPFLAQYTEGAVAQGKVGDGDIDAAVTNTVTVQMRLGMFDGDPAAQPFG 466
Query: 371 HLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLS-HIRHRTVAVIGPNSDVTVTMIG 429
HLGP+ VCT HQELA+EAARQGIVLLKN G +LPLS R VAV+GP+++ TV MIG
Sbjct: 467 HLGPQHVCTAAHQELAVEAARQGIVLLKNDGRALPLSPATARRAVAVVGPHAEATVAMIG 526
Query: 430 NYAGIACGYTTPLQGIGRY-ARTIHQQGCKDVACA-DDQLFGAAIDASRQADATILVMGL 487
NYAG C YTTPLQG+ RY AR HQ GC DVACA Q AA+DA+R+ADATI+V GL
Sbjct: 527 NYAGKPCRYTTPLQGVARYAARAAHQPGCTDVACAGSGQPIAAAVDAARRADATIVVAGL 586
Query: 488 DQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAII 547
DQ IEAE LDRA LLLPGRQ EL+S V+ ASKGP ILVLMSGGPID+ FA+NDP+IA I+
Sbjct: 587 DQKIEAEGLDRASLLLPGRQAELISSVAKASKGPVILVLMSGGPIDIGFAQNDPKIAGIL 646
Query: 548 WAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTY 607
WAGYPGQAGG AIAD++FG NPGGKLP+TWYPQ+Y+ +PMT MAMR + +K YPGRTY
Sbjct: 647 WAGYPGQAGGQAIADVIFGHHNPGGKLPVTWYPQDYLQKVPMTNMAMRANPAKGYPGRTY 706
Query: 608 RFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPL------DGRHGSINATI---SGKAI 658
RFY GP ++PFGHG+SYT+F H++A+AP+ + V L S+NAT A+
Sbjct: 707 RFYTGPTIHPFGHGLSYTSFTHSIAHAPSQLTVRLSAHHAAASASASLNATARLSRAAAV 766
Query: 659 KVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPH--------KQLVAFEK 710
+V HA+C L + V VDV+NVG +DGAHT+LV++ PA A +QLVAFEK
Sbjct: 767 RVAHARCEELRMPVHVDVRNVGERDGAHTVLVYAAAPASSAAEAAAGHGAPVRQLVAFEK 826
Query: 711 VHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHAATLGV 767
VHV AG RV + I VC LSV DR+G RRIP+GEH + IG H+V++ LGV
Sbjct: 827 VHVGAGGTARVEMGIDVCDGLSVADRNGVRRIPVGEHRLIIGELTHTVTIALEQLGV 883
Score = 147 bits (372), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 67/98 (68%), Positives = 79/98 (80%)
Query: 39 TLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVG 98
TLPFC+ SLP R DL+ RL+ EKV+LL++ AA VPRLG+ GYEWWSEALHGVS+ G
Sbjct: 40 TLPFCRRSLPARARARDLVARLTRAEKVRLLVNNAAGVPRLGVAGYEWWSEALHGVSDTG 99
Query: 99 PGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVS 136
PG +FGG FPGAT+FPQVI TA+SFNATLWE IG+ S
Sbjct: 100 PGVRFGGAFPGATAFPQVIGTAASFNATLWELIGQFRS 137
>gi|125535275|gb|EAY81823.1| hypothetical protein OsI_36995 [Oryza sativa Indica Group]
Length = 885
Score = 794 bits (2050), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 418/659 (63%), Positives = 502/659 (76%), Gaps = 24/659 (3%)
Query: 133 RVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQG 192
+ VSDE RAMYNGG AGLT+WSPNVNIFRDPRWGRGQETPGEDP ++ +YAA+YVRGLQ
Sbjct: 227 QAVSDEGRAMYNGGQAGLTFWSPNVNIFRDPRWGRGQETPGEDPAVAARYAAAYVRGLQQ 286
Query: 193 SD--GDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVA 250
RLK+AA CKHFTAYDLDNW+G DRFHFNA V++QD+EDTF+VPFR CV++G+ A
Sbjct: 287 QQPSSGRLKLAACCKHFTAYDLDNWSGTDRFHFNAVVTRQDLEDTFNVPFRSCVVDGRAA 346
Query: 251 SVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAA 310
SVMCSYNQVNGVPTCAD L+ TIR W L GYIVSDCDSV V+Y QH+T T E+A A
Sbjct: 347 SVMCSYNQVNGVPTCADAAFLRGTIRRRWGLAGYIVSDCDSVDVFYSDQHYTRTREDAVA 406
Query: 311 DAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYG 370
+RAGLDLDCGPFL +TE AV +G + + DI+ A+ NT+TVQMRLGMFDG+P++QP+G
Sbjct: 407 ATLRAGLDLDCGPFLAQYTEGAVAQGKVGDGDIDAAVTNTVTVQMRLGMFDGDPAAQPFG 466
Query: 371 HLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLS-HIRHRTVAVIGPNSDVTVTMIG 429
HLGP+ VCT HQELA+EAARQGIVLLKN G +LPLS R VAV+GP+++ TV MIG
Sbjct: 467 HLGPQHVCTAAHQELAVEAARQGIVLLKNDGRALPLSPATARRAVAVVGPHAEATVAMIG 526
Query: 430 NYAGIACGYTTPLQGIGRY-ARTIHQQGCKDVACA-DDQLFGAAIDASRQADATILVMGL 487
NYAG C YTTPLQG+ RY AR HQ GC DVACA Q AA+DA+R+ADATI+V GL
Sbjct: 527 NYAGKPCRYTTPLQGVARYAARAAHQPGCTDVACAGSGQPIAAAVDAARRADATIVVAGL 586
Query: 488 DQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAII 547
DQ IEAE LDRA LLLPGRQ EL+S V+ ASKGP ILVLMSGGPID+ FA+NDP+IA I+
Sbjct: 587 DQKIEAEGLDRASLLLPGRQAELISSVAKASKGPVILVLMSGGPIDIGFAQNDPKIAGIL 646
Query: 548 WAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTY 607
WAGYPGQAGG AIAD++FG NPGGKLP+TWYPQ+Y+ +PMT MAMR + +K YPGRTY
Sbjct: 647 WAGYPGQAGGQAIADVIFGHHNPGGKLPVTWYPQDYLQKVPMTNMAMRANPAKGYPGRTY 706
Query: 608 RFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPL--------DGRHGSINATI---SGK 656
RFY GP ++PFGHG+SYT+F H++A+AP+ + V L S+NAT
Sbjct: 707 RFYTGPTIHPFGHGLSYTSFTHSIAHAPSQLTVRLAAHHAAASASASASLNATARLSRAA 766
Query: 657 AIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPH--------KQLVAF 708
A++V HA+C L + V VDV+NVG +DGAHT+LV++ PA A +QLVAF
Sbjct: 767 AVRVAHARCEELRMPVHVDVRNVGERDGAHTVLVYAAAPASSAAEAAAGHGAPVRQLVAF 826
Query: 709 EKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHAATLGV 767
EKVHV AG RV + I VC LSV DR+G RRIP+GEH + IG H+V++ LGV
Sbjct: 827 EKVHVGAGGTARVEMGIDVCDGLSVADRNGVRRIPVGEHRLIIGELTHTVTIALEQLGV 885
Score = 147 bits (370), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 66/98 (67%), Positives = 79/98 (80%)
Query: 39 TLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVG 98
TLPFC+ SLP R DL+ R++ EKV+LL++ AA VPRLG+ GYEWWSEALHGVS+ G
Sbjct: 40 TLPFCRRSLPARARARDLVARMTRAEKVRLLVNNAAGVPRLGVAGYEWWSEALHGVSDTG 99
Query: 99 PGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVS 136
PG +FGG FPGAT+FPQVI TA+SFNATLWE IG+ S
Sbjct: 100 PGVRFGGAFPGATAFPQVIGTAASFNATLWELIGQFRS 137
>gi|302791321|ref|XP_002977427.1| hypothetical protein SELMODRAFT_106899 [Selaginella moellendorffii]
gi|300154797|gb|EFJ21431.1| hypothetical protein SELMODRAFT_106899 [Selaginella moellendorffii]
Length = 772
Score = 794 bits (2050), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 380/777 (48%), Positives = 518/777 (66%), Gaps = 36/777 (4%)
Query: 7 FFFLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKV 66
FF + A S G A AC D+ + + PFC VSLP+P RV DL+GR++L EK+
Sbjct: 5 LFFFSIAWAPAESRGGAGA---AC--TDSRSSSFPFCDVSLPVPDRVADLVGRMNLSEKI 59
Query: 67 KLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNAT 126
++S A+ +PRLGI GY+WW EALHGV+ PG KF P ATSFPQVI T +SFN++
Sbjct: 60 AQIVSNASGIPRLGIPGYQWWEEALHGVAE-SPGVKFAAPVPSATSFPQVILTVASFNSS 118
Query: 127 LWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASY 186
LW I + +S EA AMYN G +GLT+WSPN+NIFRDPRWGRGQETPGEDP+LS KYAA +
Sbjct: 119 LWNKIAQAISIEAIAMYNAGRSGLTFWSPNINIFRDPRWGRGQETPGEDPLLSSKYAAYF 178
Query: 187 VRGLQGSDGD-------------RLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIE 233
VRGLQ D D RLKV++ CKHFTAYD++ G D FHFNA+V+ QD++
Sbjct: 179 VRGLQEGDYDEGTAISTMQGSPTRLKVSSCCKHFTAYDMEKSEGTDCFHFNAQVTVQDLQ 238
Query: 234 DTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVG 293
DTFD PFR C+++G+ + +MCSYN+VNGVP+CAD L T+R W GYIVSDCD+V
Sbjct: 239 DTFDPPFRSCIVDGQASGLMCSYNRVNGVPSCADYTFLTETVRNSWGFEGYIVSDCDAVA 298
Query: 294 VYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTV 353
+ Y+ ++T+T E+A AD + AG+DL+CG FL HT +A+++G ++E ++ AL N +TV
Sbjct: 299 LLYEYINYTTTAEDAVADVLSAGMDLNCGTFLLRHTAAAIEQGKVTEAAVDRALSNVMTV 358
Query: 354 QMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRT 413
+MRLG+FDG S + Y +GP VCTP+H++L+LEAA QGIVLLKN G LP T
Sbjct: 359 RMRLGLFDGN-SGETYNSIGPDAVCTPEHRQLSLEAAEQGIVLLKNSGNVLPFPRNDLMT 417
Query: 414 VAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYAR-TIHQQGCKDVACADDQLFGAAI 472
+AVIGP+ + T TM+GNYAG+ C Y TP QG+ Y + + + GCKD+ C D LF AA+
Sbjct: 418 IAVIGPSGNATETMLGNYAGVPCQYITPFQGLQEYTKCVVFEPGCKDIMCNDTTLFLAAV 477
Query: 473 DASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPI 532
A+ +DA ++V+GLD+ E E LDR LLLPG QQ LV +VS +KGP ILV+MSGGPI
Sbjct: 478 RAAENSDAVVIVVGLDKDQEREGLDRTSLLLPGNQQGLVLEVSKVAKGPVILVVMSGGPI 537
Query: 533 DVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEM 592
DV FAK + +I+ ++W GYPG+AGG AIA ++FG NP G+LPMTWYPQ + ++ + M
Sbjct: 538 DVTFAKENCKISNVLWVGYPGEAGGKAIARVIFGDHNPAGRLPMTWYPQAFAEHVSILNM 597
Query: 593 AMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPT------VVAVPLDGRH 646
+RP+ S +PGRTYRFY G VY FGHG+SYTNF +T AP+ VA+ R
Sbjct: 598 HLRPNTSTGFPGRTYRFYTGENVYEFGHGLSYTNFTYTNFCAPSNITARNTVAIRTPLRE 657
Query: 647 GSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWA---PHK 703
+ + + C L V + N G++D H L+++ PPA + P K
Sbjct: 658 D------GARQFPIDYTGCEALAFKVVAYISNTGTRDSDHISLLYAIPPAASSSLSPPRK 711
Query: 704 QLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
QL++F++ H+ AG +V ++ CK L + + +GT+ + G++ + +G +H +SL
Sbjct: 712 QLISFKRQHLIAGRCAKVEFDVDTCKDLGLTNEAGTKVLVHGDYKLSLGDIEHVISL 768
>gi|302786474|ref|XP_002975008.1| hypothetical protein SELMODRAFT_103038 [Selaginella moellendorffii]
gi|300157167|gb|EFJ23793.1| hypothetical protein SELMODRAFT_103038 [Selaginella moellendorffii]
Length = 772
Score = 793 bits (2048), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 384/782 (49%), Positives = 523/782 (66%), Gaps = 46/782 (5%)
Query: 7 FFFLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKV 66
FF + A S G A AC D+ + + PFC VSLP+P RV DL+GR++L EK+
Sbjct: 5 LFFFSIAWAPAESRGGAGA---AC--TDSRSSSFPFCDVSLPVPDRVADLVGRMNLSEKI 59
Query: 67 KLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNAT 126
++S A+ +PRLGI GY+WW EALHGV+ PG KF P ATSFPQVI T +SFN++
Sbjct: 60 AQIVSNASGIPRLGIPGYQWWEEALHGVAE-SPGVKFAAPVPSATSFPQVILTVASFNSS 118
Query: 127 LWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASY 186
LW I + +S EA AMYN G +GLT+WSPN+NIFRDPRWGRGQETPGEDP+LS KYAA +
Sbjct: 119 LWNKIAQAISIEAIAMYNAGRSGLTFWSPNINIFRDPRWGRGQETPGEDPLLSSKYAAYF 178
Query: 187 VRGLQGSDGD-------------RLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIE 233
VRGLQ D D RLKV++ CKHFTAYD++ G D FHFNA+V+ QD++
Sbjct: 179 VRGLQEGDYDEGTAISTMQRRPTRLKVSSCCKHFTAYDMEKSEGTDCFHFNAQVTVQDLQ 238
Query: 234 DTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVG 293
DTFD PFR C+++G+ + +MCSYN+VNGVP+CAD L T+R W GYIVSDCD+V
Sbjct: 239 DTFDPPFRSCIVDGQASGLMCSYNRVNGVPSCADYTFLTETVRNSWGFEGYIVSDCDAVA 298
Query: 294 VYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTV 353
+ Y+ ++T+T E+A AD + AG+DL+CG FL HT +A+++G ++E ++ AL N +TV
Sbjct: 299 LLYEYINYTTTAEDAVADVLSAGMDLNCGTFLLRHTAAAIEQGKVTEAAVDRALSNVMTV 358
Query: 354 QMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRT 413
+MRLG+FDG S + Y +GP VCT +H++L+LEAA QGIVLLKN G LP T
Sbjct: 359 RMRLGLFDGN-SGETYNSIGPDAVCTREHRQLSLEAAEQGIVLLKNSGNVLPFPRNDLMT 417
Query: 414 VAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYAR-TIHQQGCKDVACADDQLFGAAI 472
+AVIGP+ + T TM+GNYAG+ C Y TP QG+ Y + + + GCKD+ C D LF AA+
Sbjct: 418 IAVIGPSGNATETMLGNYAGVPCQYITPFQGLQEYTKGVVFEPGCKDIMCNDTTLFLAAV 477
Query: 473 DASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPI 532
A+ +DA ++V+GLD+ E E LDR LLLPG QQ+LV +VS +KGP ILV+MSGGPI
Sbjct: 478 RAAENSDAVVIVVGLDKDQEREGLDRTSLLLPGYQQDLVLEVSKVAKGPVILVVMSGGPI 537
Query: 533 DVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEM 592
DV FAK + +I++++W GYPG+AGG AIA ++FG NP G+LPMTWYPQ + ++ + M
Sbjct: 538 DVTFAKGNCKISSVLWVGYPGEAGGKAIARVIFGDHNPAGRLPMTWYPQAFAEHVSILNM 597
Query: 593 AMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPT--------VVAVPL-- 642
+RP+ S +PGRTYRFY G VY FGHG+SYTNF +T +AP+ + PL
Sbjct: 598 HLRPNTSTGFPGRTYRFYTGENVYEFGHGLSYTNFTYTNFSAPSNITARNTVAIRTPLRE 657
Query: 643 DG-RHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWA- 700
DG RH I+ T C L V + N G++D H L+++ PPA +
Sbjct: 658 DGARHFPIDYT-----------GCEALAFKVVAYISNTGTRDSDHISLLYAIPPAASSSL 706
Query: 701 --PHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSV 758
P KQL++F++ H+ AG +V ++ CK L + + +GT+ + G++ + +G +H +
Sbjct: 707 SPPRKQLISFKRQHLIAGRCAKVEFDVDTCKDLGLTNEAGTKVLVHGDYKLSLGDIEHVI 766
Query: 759 SL 760
SL
Sbjct: 767 SL 768
>gi|302811514|ref|XP_002987446.1| hypothetical protein SELMODRAFT_426206 [Selaginella moellendorffii]
gi|300144852|gb|EFJ11533.1| hypothetical protein SELMODRAFT_426206 [Selaginella moellendorffii]
Length = 772
Score = 793 bits (2048), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 394/748 (52%), Positives = 527/748 (70%), Gaps = 12/748 (1%)
Query: 22 LAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGI 81
LA + P+ACD +AT PFC SLPI RV D + RL+L+EK+ LI+ A +PRLG+
Sbjct: 24 LAGQAPYACDQSNATLAAFPFCNTSLPITDRVEDYVARLTLEEKISQLINTATGIPRLGV 83
Query: 82 KGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARA 141
Y+WW EALHGV++ PG +FGG P ATSFP ITTA+SFN +L+ IG+ VS EARA
Sbjct: 84 PKYQWWQEALHGVAS-SPGVQFGGSVPAATSFPMPITTAASFNTSLFYGIGQAVSTEARA 142
Query: 142 MYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDG--DRLK 199
M+N G +GLT+WSPN+NI+RDPRWGRGQETPGEDP+LS +A YVRGLQ S D+LK
Sbjct: 143 MHNLGQSGLTFWSPNINIYRDPRWGRGQETPGEDPLLSSNFATYYVRGLQESQAGSDKLK 202
Query: 200 VAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQV 259
V+A CKH TAYD+DNW G DR+HFNA V++QD+EDT++ PF+ CV +G V+SVMCSYN++
Sbjct: 203 VSACCKHMTAYDVDNWLGTDRYHFNAIVTEQDLEDTYNAPFKSCVEDGGVSSVMCSYNRL 262
Query: 260 NGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDL 319
NGVPTCAD +L T+R W+LNGYIVSDCDS+ V++D ++ +T E+AAADA+ AGL+L
Sbjct: 263 NGVPTCADHELLTTTVRETWKLNGYIVSDCDSLQVFFDNTNYAATAEDAAADALLAGLNL 322
Query: 320 DCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCT 379
+CG FL HT SA+Q+ ++E IN AL +TVQMRLG++DG+P SQ YG LG DVCT
Sbjct: 323 NCGTFLAKHTLSAIQQKKVTEATINQALTYLVTVQMRLGLYDGDPKSQTYGSLGASDVCT 382
Query: 380 PDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYT 439
+HQ LALEAARQG+VLLKN G +LPLS + +++AV+GP+++ T MIGNYAGI C YT
Sbjct: 383 SEHQTLALEAARQGMVLLKNLG-ALPLSTSKIKSLAVVGPHANATRAMIGNYAGIPCKYT 441
Query: 440 TPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRA 499
+PLQ +YA+ + GC +VAC+ D L A+ A+ ADA ++ +GLD +IEAE+LDR
Sbjct: 442 SPLQAFQKYAQVSYAPGCANVACSSDSLISGAVSAAAAADAVVVAVGLDLTIEAESLDRT 501
Query: 500 GLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTA 559
LLLPG+QQELVS+V A+KGP ++V++S G ID+ FA +D RIA I+WAGYPGQAGG A
Sbjct: 502 SLLLPGKQQELVSQVMQAAKGPVVIVILSAGAIDIPFALSDSRIAGILWAGYPGQAGGAA 561
Query: 560 IADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFG 619
IA+++FG NP GKLP TWYPQ + T++ M +M MRP+ S YPGRTYRFY GP ++ FG
Sbjct: 562 IAEVIFGDHNPSGKLPATWYPQNF-TSISMLDMNMRPNASTGYPGRTYRFYTGPTIFKFG 620
Query: 620 HGMSYTNFVHTVANAPTVVAVPLDGRH---GSINATISGKAIKVTHAK-CNRLTLGVQVD 675
G+SYT+ AP+ +++P + + S + T K C L V +
Sbjct: 621 DGLSYTSLSAKFIKAPSFLSIPSTAPMQPCTGLKKSSSCFHLDATDEKSCESLKSQVAIS 680
Query: 676 VKNVGSKDGAHTLLVFSTPPAG--HWAPHKQLVAFEKVHVPAGA-QQRVGINIHVCKYLS 732
V+N G+ +HTL++FSTPP+ P +QLV F K+ + + V ++ C++
Sbjct: 681 VRNKGAMAISHTLMLFSTPPSAGSDGVPQRQLVGFNKIQIAGDSISNPVIFDLDPCRHFV 740
Query: 733 VVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
DR G + + G H + G +HS+ L
Sbjct: 741 HADRDGKKLLRSGTHVLTAGNEQHSLRL 768
>gi|302786124|ref|XP_002974833.1| hypothetical protein SELMODRAFT_101733 [Selaginella moellendorffii]
gi|300157728|gb|EFJ24353.1| hypothetical protein SELMODRAFT_101733 [Selaginella moellendorffii]
Length = 784
Score = 791 bits (2043), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 389/766 (50%), Positives = 530/766 (69%), Gaps = 33/766 (4%)
Query: 13 ILLSASSSGLAAREPFACD-PKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLIS 71
IL +A +S R +ACD +A+ + PFC L I RV DL+ RL+L EKV +++
Sbjct: 23 ILSNARASTAQLR--YACDVSSNASLGSFPFCDTKLGIDVRVQDLVSRLTLDEKVDEMVN 80
Query: 72 GAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAI 131
A +PRLG+ Y+WW EALHGV++ PG +FGG P ATSFP I TA+SFN+TL+ +I
Sbjct: 81 AAQGIPRLGVPSYQWWQEALHGVAS-SPGVQFGGLAPAATSFPMPIATAASFNSTLFYSI 139
Query: 132 GRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQ 191
G VS EARA++N G AGLT+WSPNVNIFRDPRWGRGQETPGEDP+L+ K+A+ YVRGLQ
Sbjct: 140 GEAVSSEARALHNLGRAGLTFWSPNVNIFRDPRWGRGQETPGEDPLLASKFASLYVRGLQ 199
Query: 192 G-------SDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCV 244
G SDG LKV+A CKH TAYD+DNW G+DR+HFNA+VS+QD+ DT++ PF+ C+
Sbjct: 200 GGAYEGSASDG-FLKVSACCKHLTAYDVDNWKGMDRYHFNAEVSEQDLVDTYNPPFQSCI 258
Query: 245 MEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTST 304
+G+V+SVMCSYN+VNGVPTCAD N+L T+R W NGYIVSDCD++ V ++ + +
Sbjct: 259 EDGRVSSVMCSYNRVNGVPTCADRNLLTETVRNSWGFNGYIVSDCDALQVLFEDTTYAPS 318
Query: 305 PEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEP 364
E+A AD+I AGLDL+CG FLG H +SA+Q G ++E D+++A+ N + +MRLG+FDG+P
Sbjct: 319 AEDAVADSILAGLDLNCGTFLGKHAKSALQAGKITEADLDHAVSNLMRTRMRLGLFDGDP 378
Query: 365 SSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVT 424
+SQPY LG D+C+ DHQ+LAL+AA QG+VLLKN G SLPLS +TVA+IGPN++ T
Sbjct: 379 NSQPYSSLGATDICSNDHQQLALDAALQGVVLLKNDG-SLPLS-TALKTVALIGPNANAT 436
Query: 425 VTMIGNYAGIACGYTTPLQGIGRYARTI-HQQGCKDVACADDQLFGAAIDASRQADATIL 483
TM+GNY GI C Y +PLQG+ Y+ I + GC++VAC + L +A++ + +ADA +L
Sbjct: 437 YTMLGNYEGIPCKYISPLQGMQIYSSNILYSPGCRNVACNEGDLVASAVEVATKADAVVL 496
Query: 484 VMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRI 543
V+GLDQS E E DR LLLPG Q +LVS ++ A P +LV+MS GP+D++ K++ RI
Sbjct: 497 VVGLDQSQERETFDRTSLLLPGMQSQLVSNIANAVTSPIVLVIMSAGPVDISTFKDNSRI 556
Query: 544 AAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYP 603
+++IW GYPGQ+GG A+A ++FG NPGG+LP TWY +E+ TN+ M +M MRP+ YP
Sbjct: 557 SSVIWLGYPGQSGGAALAHVVFGAYNPGGRLPNTWYHEEF-TNVSMLDMQMRPNPLSGYP 615
Query: 604 GRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHA 663
GR+YRFY G +Y FG G+SY+ + + APT L + + A+ + A
Sbjct: 616 GRSYRFYTGTPLYNFGDGLSYSTYFYKFLLAPT----KLSFFKSNTGNSRGCPAVNRSKA 671
Query: 664 K-------------CNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEK 710
K CN + V V+V N+G + G+H++L+FS PP AP KQL+AF+K
Sbjct: 672 KSGCFHLPADDLETCNSILFQVSVEVSNLGPRSGSHSVLIFSAPPPVEGAPLKQLIAFQK 731
Query: 711 VHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKH 756
VH+ + QR+ I CK+LS V R+G R + G H + IG H
Sbjct: 732 VHLESDTTQRLIFGIDPCKHLSSVRRNGKRFLHSGRHKLLIGNAVH 777
>gi|449433577|ref|XP_004134574.1| PREDICTED: probable beta-D-xylosidase 2-like [Cucumis sativus]
gi|449530107|ref|XP_004172038.1| PREDICTED: probable beta-D-xylosidase 2-like [Cucumis sativus]
Length = 812
Score = 790 bits (2041), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 388/748 (51%), Positives = 504/748 (67%), Gaps = 27/748 (3%)
Query: 28 FACDPKDATTRTLPF-----CQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIK 82
F CDP L F C SL P+R DLI R++L EK L A+ V RLG+
Sbjct: 56 FVCDPSRYDKLGLDFSSFGFCDSSLSFPERAKDLIDRMTLSEKAAQLGHVASGVDRLGLP 115
Query: 83 GYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAM 142
Y WWSEALHGVSNVGPGT+F PGATSFP VITTASSFN LW+ IG+ VS EARAM
Sbjct: 116 PYNWWSEALHGVSNVGPGTQFDKVVPGATSFPNVITTASSFNEDLWKTIGQAVSTEARAM 175
Query: 143 YNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDR----- 197
YN G AGLTYWSP +N+ RDPRWGR ETPGEDP + GKYA +YVRGLQ +G
Sbjct: 176 YNLGRAGLTYWSPTINVIRDPRWGRTVETPGEDPFVVGKYAKNYVRGLQDVEGSENVTDL 235
Query: 198 ----LKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVM 253
LKV++ CKH+ AYD+DNW GV+R+ F+A+V++QD+ +TF+ PF MCV EG V+SVM
Sbjct: 236 NSRPLKVSSCCKHYAAYDVDNWLGVERYSFDARVTEQDMLETFNKPFEMCVKEGDVSSVM 295
Query: 254 CSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHF-TSTPEEAAADA 312
CSYN+VNG+PTCADP +LK TIRG W L+GYIVSDCDSV V + H+ T E+A A
Sbjct: 296 CSYNRVNGIPTCADPVLLKDTIRGNWGLHGYIVSDCDSVKVMVEDAHYLQDTNEDAVAQT 355
Query: 313 IRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHL 372
++AGLDLDCG +TES V++G + +I+NAL N V MRLG FDG + L
Sbjct: 356 LKAGLDLDCGQIYPNYTESTVRQGKVGMRNIDNALNNLYVVLMRLGYFDGNTG---FESL 412
Query: 373 GPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYA 432
G D+C+ +H ELA EAARQG VLLKN +LP ++T+AV+GP+++ T M+GNYA
Sbjct: 413 GKPDICSDEHIELATEAARQGTVLLKNDNDTLPFDPSNYKTLAVVGPHANATSAMLGNYA 472
Query: 433 GIACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIE 492
G+ C +P+ G+ YA+ +Q GC VAC +D A++A+R +DAT++ +G+D SIE
Sbjct: 473 GVPCRMNSPMDGLSEYAKVKYQMGCDSVACKNDTFIFGAMEAARTSDATVIFVGIDLSIE 532
Query: 493 AEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYP 552
AE+LDR LLLPG Q +LV +V+ SKGP +LV++S G IDV+FAKN+ I AIIWAGYP
Sbjct: 533 AESLDRVDLLLPGYQTQLVQQVATVSKGPVVLVILSAGGIDVSFAKNNSNIKAIIWAGYP 592
Query: 553 GQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKG 612
G+ GG AIAD++FG NPGG+LP+TWY +Y+ LPMT M +RP +S YPGRTY+FY G
Sbjct: 593 GEEGGRAIADVIFGKFNPGGRLPLTWYENDYVYQLPMTSMPLRPVKSLGYPGRTYKFYDG 652
Query: 613 PVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRH-----GSINATISGK--AIKVTHAKC 665
PVVYPFGHG+SYT F+H + +A +A+ L R N T + A+ V C
Sbjct: 653 PVVYPFGHGLSYTFFLHNLTSAKRSIAIDLSNRTQCRDIAYTNGTFKPECPAVLVDDLTC 712
Query: 666 NRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPH-KQLVAFEKVHVPAGAQQRVGIN 724
+ Q++V+N G +DG+ LLV+S PP G + H KQ+V F++V + AG + V
Sbjct: 713 TE-EIEFQMEVENTGERDGSQVLLVYSVPPGGISSTHIKQVVGFQRVFLKAGDSETVTFK 771
Query: 725 IHVCKYLSVVDRSGTRRIPLGEHNIHIG 752
++ CK L +VD +G +P G H I +G
Sbjct: 772 LNACKSLGLVDFTGYNLLPAGGHTIVVG 799
>gi|302796583|ref|XP_002980053.1| hypothetical protein SELMODRAFT_112087 [Selaginella moellendorffii]
gi|300152280|gb|EFJ18923.1| hypothetical protein SELMODRAFT_112087 [Selaginella moellendorffii]
Length = 772
Score = 786 bits (2029), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 392/748 (52%), Positives = 524/748 (70%), Gaps = 12/748 (1%)
Query: 22 LAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGI 81
LA + P+ACD +AT PFC SL I RV D + RL+L+EK+ LI+ A +PRLG+
Sbjct: 24 LAGQAPYACDQSNATLAAFPFCNTSLAITDRVEDYVARLTLEEKISQLINTATGIPRLGV 83
Query: 82 KGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARA 141
Y+WW EALHGV++ PG +FGG P ATSFP ITTA+SFN +L+ IG+ VS EARA
Sbjct: 84 PKYQWWQEALHGVAS-SPGVQFGGSVPAATSFPMPITTAASFNTSLFYGIGQAVSTEARA 142
Query: 142 MYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDG--DRLK 199
M+N G +GLT+WSPN+NI+RDPRWGRGQETPGEDP+LS +A YVRGLQ S D+LK
Sbjct: 143 MHNLGQSGLTFWSPNINIYRDPRWGRGQETPGEDPLLSSNFATYYVRGLQESQAGSDKLK 202
Query: 200 VAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQV 259
V+A CKH TAYD+DNW G DR+HFNA V++QD+EDT++ PF+ CV +G V+SVMCSYN++
Sbjct: 203 VSACCKHMTAYDVDNWLGTDRYHFNAIVTEQDLEDTYNAPFKSCVEDGGVSSVMCSYNRL 262
Query: 260 NGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDL 319
NGVPTCAD +L T+R W+LNGYIVSDCDS+ V++D ++ +T E+AAADA+ AGL+L
Sbjct: 263 NGVPTCADHELLTTTVRETWKLNGYIVSDCDSLQVFFDNTNYAATAEDAAADALLAGLNL 322
Query: 320 DCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCT 379
+CG FL HT SA+Q+ ++E IN AL +TVQMRLG++DG+P SQ YG LG DVCT
Sbjct: 323 NCGTFLAKHTLSAIQQKKVTEATINQALTYLVTVQMRLGLYDGDPKSQTYGSLGASDVCT 382
Query: 380 PDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYT 439
+HQ LALEAARQG+VLLKN G +LPLS + +++AV+GP+++ T MIGNYAGI C YT
Sbjct: 383 SEHQTLALEAARQGMVLLKNLG-ALPLSTSKIKSLAVVGPHANATRAMIGNYAGIPCKYT 441
Query: 440 TPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRA 499
+PLQ +YA+ + GC +VAC+ D L A+ A+ ADA ++ +GLD +IEAE+LDR
Sbjct: 442 SPLQAFQKYAQVSYAPGCANVACSSDSLISGAVSAAAAADAVVVAVGLDLTIEAESLDRT 501
Query: 500 GLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTA 559
LLLPG+QQELVS+V A+KGP ++V++S G ID+ FA +D RIA I+WAGYPGQAGG A
Sbjct: 502 SLLLPGKQQELVSQVMQAAKGPVVIVILSAGAIDIPFALSDSRIAGILWAGYPGQAGGAA 561
Query: 560 IADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFG 619
IA+++FG NP GKLP TWYPQ + T++ M +M MRP+ S YPGRTYRFY GP ++ FG
Sbjct: 562 IAEVIFGDHNPSGKLPATWYPQNF-TSISMLDMNMRPNASTGYPGRTYRFYTGPTIFKFG 620
Query: 620 HGMSYTNFVHTVANAPTVVAVPLDGRH---GSINATISGKAIKVTHAK-CNRLTLGVQVD 675
G+SYT+ AP+ +++P + + S + T K C L V +
Sbjct: 621 DGLSYTSLSAKFIKAPSFLSIPSTAPMQPCTGLKKSSSCFHLDATDEKSCESLKSQVAIS 680
Query: 676 VKNVGSKDGAHTLLVFSTPP--AGHWAPHKQLVAFEKVHVPAGA-QQRVGINIHVCKYLS 732
V+N G+ +HTL++FSTPP P +QLV F K+ + + V ++ C++
Sbjct: 681 VRNKGAMAISHTLMLFSTPPNAGSDGVPQRQLVGFNKIQIAGDSISNPVIFDLDPCRHFV 740
Query: 733 VVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
D G + + G H + G +HS+ L
Sbjct: 741 HADPDGKKLLRSGTHVLTAGNEQHSLRL 768
>gi|302760655|ref|XP_002963750.1| hypothetical protein SELMODRAFT_80102 [Selaginella moellendorffii]
gi|300169018|gb|EFJ35621.1| hypothetical protein SELMODRAFT_80102 [Selaginella moellendorffii]
Length = 785
Score = 783 bits (2021), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 378/751 (50%), Positives = 523/751 (69%), Gaps = 23/751 (3%)
Query: 24 AREPFACD-PKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIK 82
A+ +ACD +A+ + PFC L + RV DL+ RL+L EKV +++ A +PRLG+
Sbjct: 33 AQPRYACDVSSNASLGSFPFCDTKLGVDVRVQDLVSRLTLDEKVDEMVNAAQGIPRLGVP 92
Query: 83 GYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAM 142
Y+WW EALHGV++ PG +FGG P ATSFP I A+SFN+TL+ +IG VS EARA+
Sbjct: 93 SYQWWQEALHGVAS-SPGVQFGGLAPAATSFPMPIAMAASFNSTLFYSIGEAVSSEARAL 151
Query: 143 YNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQG-------SDG 195
+N G AGLT+WSPNVNIFRDPRWGRGQETPGEDP+L+ K+A+ YVRGLQG SDG
Sbjct: 152 HNLGRAGLTFWSPNVNIFRDPRWGRGQETPGEDPLLASKFASLYVRGLQGGAYGGSASDG 211
Query: 196 DRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCS 255
LKV+A CKH TAYD+DNW G+DR+HFNA+VS+QD+ DT++ PF+ C+ +G+V+SVMCS
Sbjct: 212 -FLKVSACCKHLTAYDMDNWKGMDRYHFNAEVSEQDLVDTYNPPFQSCIEDGRVSSVMCS 270
Query: 256 YNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRA 315
YN+VNGVPTCAD ++L T+R W NGYIVSDCD++ V ++ + + E+A AD+I A
Sbjct: 271 YNRVNGVPTCADRSLLTETVRNSWGFNGYIVSDCDALQVLFEDTTYAPSAEDAVADSILA 330
Query: 316 GLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPK 375
GLDL+CG FLG H +SA+Q G ++E D+++A+ N + +MRLG+FDG+ +++PY LG
Sbjct: 331 GLDLNCGTFLGKHAKSALQAGKVTEADLDHAISNLMRTRMRLGLFDGDLNTRPYSSLGAT 390
Query: 376 DVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIA 435
D+C+ DHQ+LAL+AA QG+VLLKN G SLPLS +TVA+IGPN++ T TM+GNY GI
Sbjct: 391 DICSNDHQQLALDAALQGVVLLKNDG-SLPLS-TALKTVALIGPNANATYTMLGNYEGIP 448
Query: 436 CGYTTPLQGIGRYARTI-HQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAE 494
C Y +PLQG+ Y I + GC+DVAC++ L +A++ + +ADA +LV+GLDQS E E
Sbjct: 449 CKYVSPLQGMQIYNNNILYSPGCRDVACSEGDLVASAVEVATKADAVVLVVGLDQSQERE 508
Query: 495 ALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQ 554
DR LLLPG Q +LVS ++ A P +LV+MS GP+D++ K++ RI+++IW GYPGQ
Sbjct: 509 TFDRTSLLLPGMQSQLVSNIANAVTCPIVLVIMSAGPVDISTFKDNSRISSVIWIGYPGQ 568
Query: 555 AGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPV 614
+GG A+A ++FG NPGG+LP TWY +E+ TN+ M +M MRP+ YPGR+YRFY G
Sbjct: 569 SGGAALAHVVFGAYNPGGRLPNTWYHEEF-TNVSMLDMRMRPNPPSGYPGRSYRFYTGTP 627
Query: 615 VYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKA---IKVTH------AKC 665
+Y FG G+SY+ +++ APT ++ S + ++ H C
Sbjct: 628 LYNFGDGLSYSTYLYKFLLAPTRLSFFKSNTRNSRDCPTVNRSEAEFGCFHLPADDLETC 687
Query: 666 NRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINI 725
N + V V+V N+G + G+H++L+FS PP AP KQL+AF+KVH+ + QR+ I
Sbjct: 688 NSILFQVSVEVSNLGPRSGSHSVLIFSAPPPVEGAPLKQLIAFQKVHLESDTTQRLIFGI 747
Query: 726 HVCKYLSVVDRSGTRRIPLGEHNIHIGGTKH 756
CK+LS V R+G R + G H + IG H
Sbjct: 748 DPCKHLSSVRRNGKRFLHSGRHKLLIGNAVH 778
>gi|255545664|ref|XP_002513892.1| Periplasmic beta-glucosidase precursor, putative [Ricinus communis]
gi|223546978|gb|EEF48475.1| Periplasmic beta-glucosidase precursor, putative [Ricinus communis]
Length = 774
Score = 780 bits (2015), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 373/767 (48%), Positives = 523/767 (68%), Gaps = 19/767 (2%)
Query: 7 FFFLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKV 66
F L++L +S+ PF+CDP + +T + FC+ SLPI QRV DL+ RL+L EK+
Sbjct: 12 FTIFPLLILQITST----EPPFSCDPSNPSTSSFLFCKTSLPISQRVRDLVSRLTLDEKI 67
Query: 67 KLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNAT 126
L+S A ++PRLGI YEWWSEALHGV+NVG G F G ATSFPQVI TA+SF+A
Sbjct: 68 SQLVSSAPSIPRLGIPAYEWWSEALHGVANVGRGIHFEGAIKAATSFPQVILTAASFDAY 127
Query: 127 LWEAIGRVVSDEARAMYNGGTA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAAS 185
W IG+V+ EARA+YN G A G+T+W+PN+NIFRDPRWGRGQETPGEDP+++GKYA S
Sbjct: 128 QWYRIGQVIGREARAVYNAGQATGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVS 187
Query: 186 YVRGLQGS--DGDRLK----VAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVP 239
YVRG+QG G +LK +A CKHFTAYDLDNW GV+RF F+A+V+ QD+ DT+ P
Sbjct: 188 YVRGVQGDSFQGGKLKGHLQASACCKHFTAYDLDNWKGVNRFVFDARVTMQDLADTYQPP 247
Query: 240 FRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQ 299
F+ CV +GK + +MC+YN+VNG+P+CAD N+L RT RG+W +GYI SDCD+V + YD Q
Sbjct: 248 FQSCVQQGKASGIMCAYNRVNGIPSCADFNLLSRTARGQWDFHGYIASDCDAVSIIYDNQ 307
Query: 300 HFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGM 359
+ +PE+A D ++AG+D++CG +L HT++AV++ L E I+ AL N +V+MRLG+
Sbjct: 308 GYAKSPEDAVVDVLKAGMDVNCGSYLQKHTKAAVEQKKLPEASIDRALHNLFSVRMRLGL 367
Query: 360 FDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGP 419
F+G P+ QP+ ++GP VC+ +HQ LALEAAR GIVLLKN LPL + ++AVIGP
Sbjct: 368 FNGNPTEQPFSNIGPDQVCSQEHQILALEAARNGIVLLKNSARLLPLQKSKTVSLAVIGP 427
Query: 420 NSDVTVTMIGNYAGIACGYTTPLQGIGRYAR-TIHQQGCKDVACADDQLFGAAIDASRQA 478
N++ T++GNYAG C TPLQ + Y + TI+ GC V C+ + A+D ++
Sbjct: 428 NANSVQTLLGNYAGPPCKTVTPLQALQYYVKNTIYYSGCDTVKCSSASI-DKAVDIAKGV 486
Query: 479 DATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAK 538
D +++MGLDQ+ E E LDR L+LPG+QQEL++ V+ ++K P +LVL+SGGP+D++FAK
Sbjct: 487 DRVVMIMGLDQTQEREELDRLDLVLPGKQQELITNVAKSAKNPIVLVLLSGGPVDISFAK 546
Query: 539 NDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQ 598
D I +I+WAGYPG+AGG A+A+I+FG NPGGKLPMTWYPQE++ +PMT+M MRP
Sbjct: 547 YDENIGSILWAGYPGEAGGIALAEIIFGDHNPGGKLPMTWYPQEFV-KVPMTDMRMRPDP 605
Query: 599 SKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVAN-APTVVAVPLDGRHGSINATISGKA 657
S YPGRTYRFYKG V+ FG+G+SY+ + + + + T + + I+ + +A
Sbjct: 606 SSGYPGRTYRFYKGRNVFEFGYGLSYSKYSYELKYVSQTKLYLNQSSTMRIIDNSDPVRA 665
Query: 658 IKVTH---AKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPA-GHWAPHKQLVAFEKVHV 713
V C V+V V+N G G H +L+F+ G+ P +QL+ F+ V +
Sbjct: 666 TLVAQLGAEFCKESKFSVKVGVENQGEMAGKHPVLLFARHARHGNGRPRRQLIGFKSVIL 725
Query: 714 PAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
AG + + + C++ S + G R + G H + +GG K+ +S+
Sbjct: 726 NAGEKAEIEFELSPCEHFSRANEDGLRVMEEGTHFLMVGGDKYPISV 772
>gi|224068498|ref|XP_002302758.1| predicted protein [Populus trichocarpa]
gi|222844484|gb|EEE82031.1| predicted protein [Populus trichocarpa]
Length = 462
Score = 778 bits (2009), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 367/458 (80%), Positives = 413/458 (90%), Gaps = 1/458 (0%)
Query: 314 RAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLG 373
+A LDLDCGPFLG HTE AV++GLL+E +INNAL+NTLTVQMRLGMFDGEPSS+PYG+LG
Sbjct: 5 QASLDLDCGPFLGQHTEDAVRKGLLTEAEINNALLNTLTVQMRLGMFDGEPSSKPYGNLG 64
Query: 374 PKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAG 433
P DVCTP HQELALEAARQGIVLLKN GP LPLS H++VA+IGPNS+VTVTMIGNYAG
Sbjct: 65 PTDVCTPAHQELALEAARQGIVLLKNHGPPLPLSTRHHQSVAIIGPNSNVTVTMIGNYAG 124
Query: 434 IACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEA 493
+ACGYTTPLQGIGRYA+TI+QQGC DVAC DQ F AA+DA+RQADAT+LVMGLDQSIEA
Sbjct: 125 VACGYTTPLQGIGRYAKTIYQQGCADVACVSDQQFVAAMDAARQADATVLVMGLDQSIEA 184
Query: 494 EALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPG 553
E+ DR LLLPGRQQEL+SKV+ ASKGPTILVLMSGGPIDV+FA+NDP+I I+WAGYPG
Sbjct: 185 ESRDRTELLLPGRQQELISKVAAASKGPTILVLMSGGPIDVSFAENDPKIGGIVWAGYPG 244
Query: 554 QAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGP 613
QAGG AI+D+LFGT+NPGGKLPMTWYPQ+Y+TNLPMT MAMRPS+S YPGRTYRFYKG
Sbjct: 245 QAGGAAISDVLFGTTNPGGKLPMTWYPQDYVTNLPMTNMAMRPSKSNGYPGRTYRFYKGK 304
Query: 614 VVYPFGHGMSYTNFVHTVANAPTVVAVPLDG-RHGSINATISGKAIKVTHAKCNRLTLGV 672
VVYPFGHG+SYTNFVHT+A+APT+V+VPLDG R S NATISGKAI+VTHA+CNRL+ GV
Sbjct: 305 VVYPFGHGISYTNFVHTIASAPTMVSVPLDGHRQASRNATISGKAIRVTHARCNRLSFGV 364
Query: 673 QVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLS 732
QVDVKN GS DG HTLLV+S PPAGHWAP KQLVAFEKVHV AG QQRVGIN+HVCK+LS
Sbjct: 365 QVDVKNTGSMDGTHTLLVYSKPPAGHWAPLKQLVAFEKVHVAAGTQQRVGINVHVCKFLS 424
Query: 733 VVDRSGTRRIPLGEHNIHIGGTKHSVSLHAATLGVIKS 770
VVDRSG RRIP+G H++HIG KHSVSL A+ LGVIKS
Sbjct: 425 VVDRSGIRRIPMGAHSLHIGDVKHSVSLQASILGVIKS 462
>gi|224082152|ref|XP_002306583.1| predicted protein [Populus trichocarpa]
gi|222856032|gb|EEE93579.1| predicted protein [Populus trichocarpa]
Length = 745
Score = 769 bits (1986), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 362/765 (47%), Positives = 508/765 (66%), Gaps = 39/765 (5%)
Query: 5 IAFFFLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQE 64
F +I+L S+ + PF+CD + +T+T PFC+ +LPI QR NDL+ RL+L+E
Sbjct: 9 FTFTIFTVIVLQVDST----QPPFSCDSSNPSTKTFPFCKTTLPISQRANDLVSRLTLEE 64
Query: 65 KVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFN 124
K+ L++ A +PRLGI GY+WWSEALHGV+ GPG +F G ATSFPQVI +A+SF+
Sbjct: 65 KISQLVNSAQPIPRLGIPGYQWWSEALHGVAYAGPGIRFNGTIKRATSFPQVILSAASFD 124
Query: 125 ATLWEAIGRVVSDEARAMYNGGTA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYA 183
A W I + + EARA+YN G A G+T+W+PN+NIFRDPRWGRGQETPGEDP+++GKYA
Sbjct: 125 ANQWYRISQAIGKEARALYNAGQATGMTFWAPNINIFRDPRWGRGQETPGEDPLMTGKYA 184
Query: 184 ASYVRGLQGSD------GDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFD 237
SYVRGLQG L+ +A CKHFTAYDL+NWNG R+ F+A V+ QD+ DT+
Sbjct: 185 VSYVRGLQGDSFKGGEIKGPLQASACCKHFTAYDLENWNGTSRYVFDAYVTAQDLADTYQ 244
Query: 238 VPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYD 297
PF+ CV EG+ + +MC+YN+VNG+P CAD N L RT R +W +GYI SDCD+V + +D
Sbjct: 245 PPFKSCVEEGRASGIMCAYNRVNGIPNCADSNFLSRTARAQWGFDGYIASDCDAVSIIHD 304
Query: 298 TQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRL 357
Q + TPE+A ++AG+D++CG +L HT++AV + L+ +I+ AL N +V+MRL
Sbjct: 305 AQGYAKTPEDAVVAVLKAGMDVNCGSYLQQHTKAAVDQKKLTISEIDRALHNLFSVRMRL 364
Query: 358 GMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVI 417
G+F+G P+ Q +G++GP VC+ ++Q LAL+AAR GIVLLKN LPLS + ++AVI
Sbjct: 365 GLFNGNPTGQQFGNIGPDQVCSQENQILALDAARNGIVLLKNSAGLLPLSKSKTMSLAVI 424
Query: 418 GPNSDVTVTMIGNYAGIACGYTTPLQGIGRYAR-TIHQQGCKDVACADDQLFGAAIDASR 476
GPN++ T++GNYAG C TPLQ + Y + TI GC V C+ + G A++ ++
Sbjct: 425 GPNANSVQTLLGNYAGPPCKLVTPLQALQSYIKHTIPYPGCDSVQCSSASIVG-AVNVAK 483
Query: 477 QADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAF 536
AD +L+MGLD + E E LDR L+LPG+QQEL+ V+ A+K P +LVL+SGGP+D++F
Sbjct: 484 GADHVVLIMGLDDTQEKEGLDRRDLVLPGKQQELIISVAKAAKNPVVLVLLSGGPVDISF 543
Query: 537 AKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRP 596
AKND I +I+WAGYPG+AG A+A+I+FG NPGGKLPMTWYPQE++ +PMT+M MRP
Sbjct: 544 AKNDKNIGSILWAGYPGEAGAIALAEIIFGDHNPGGKLPMTWYPQEFV-KVPMTDMRMRP 602
Query: 597 SQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGK 656
S YPGRTYRFYKGP V+ FG+G+SY+ + + + +
Sbjct: 603 ETSSGYPGRTYRFYKGPTVFEFGYGLSYSKYTYEL------------------------R 638
Query: 657 AIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFST-PPAGHWAPHKQLVAFEKVHVPA 715
AI + +C + V V VKN G G H +L+F+ G P K+LV F+ V + A
Sbjct: 639 AIYIGEEQCENIKFKVTVSVKNEGQMAGKHPVLLFARHAKPGKGRPIKKLVGFQTVKLGA 698
Query: 716 GAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
G + + + C++LS + G + G + +G +H V++
Sbjct: 699 GEKTEIEYELSPCEHLSSANEDGVMVMEEGSQILLVGDKEHPVTI 743
>gi|222629651|gb|EEE61783.1| hypothetical protein OsJ_16354 [Oryza sativa Japonica Group]
Length = 771
Score = 769 bits (1986), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 391/731 (53%), Positives = 490/731 (67%), Gaps = 50/731 (6%)
Query: 75 AVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIG-- 132
A+PRLGI YEWWSEALHGVS VGPGT+F PGATSFPQ I TA+SFNA+L+ AIG
Sbjct: 44 ALPRLGIPAYEWWSEALHGVSYVGPGTRFSTLVPGATSFPQPILTAASFNASLFRAIGES 103
Query: 133 ----------------------------------------RVVSDEARAMYNGGTAGLTY 152
RVVS EARAM+N G AGLT+
Sbjct: 104 ACNNTSQFFFSSKSPFSICIAMENLHCDFRSRLVRFYRGARVVSTEARAMHNVGLAGLTF 163
Query: 153 WSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDG--DRLKVAASCKHFTAY 210
WSPN+NIFRDPRWGRGQETPGEDP+L+ KYA YV GLQ + G D LKVAA CKH+TAY
Sbjct: 164 WSPNINIFRDPRWGRGQETPGEDPLLASKYAVGYVTGLQDAGGGSDALKVAACCKHYTAY 223
Query: 211 DLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNI 270
D+DNW GV+R+ F+A VS+QD++DTF PF+ CV++G VASVMCSYN+VNG PTCAD ++
Sbjct: 224 DVDNWKGVERYTFDAVVSQQDLDDTFQPPFKSCVIDGNVASVMCSYNKVNGKPTCADKDL 283
Query: 271 LKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTE 330
L IRG+W+LNGYIVSDCDSV V Y+ QH+T PE+AAA I++GLDL+CG FL HT
Sbjct: 284 LSGVIRGDWKLNGYIVSDCDSVDVLYNNQHYTKNPEDAAAITIKSGLDLNCGNFLAQHTV 343
Query: 331 SAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAA 390
+AVQ G LSE D++ A+ N V MRLG FDG+P P+G LGPKDVCT +QELA EAA
Sbjct: 344 AAVQAGKLSESDVDRAITNNFIVLMRLGFFDGDPRKLPFGSLGPKDVCTSSNQELAREAA 403
Query: 391 RQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYAR 450
RQGIVLLKN G +LPLS +++AVIGPN++ + TMIGNY G C YTTPLQG+G
Sbjct: 404 RQGIVLLKNTG-ALPLSAKSIKSMAVIGPNANASFTMIGNYEGTPCKYTTPLQGLGANVA 462
Query: 451 TIHQQGCKDVACADDQL-FGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQE 509
T++Q GC +V C+ + L AA A+ AD T+LV+G DQS+E E+LDR LLLPG+Q +
Sbjct: 463 TVYQPGCTNVGCSGNSLQLSAATQAAASADVTVLVVGADQSVERESLDRTSLLLPGQQPQ 522
Query: 510 LVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSN 569
LVS V+ AS+GP ILV+MSGGP D++FAK+ +I+AI+W GYP ++
Sbjct: 523 LVSAVANASRGPVILVVMSGGPFDISFAKSSDKISAILWVGYPRRSRWRRPRRHPLRIPQ 582
Query: 570 PGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVH 629
LP+TWYP + + MT+M MRP S YPGRTYRFY G VY FG G+SYT F H
Sbjct: 583 --SWLPVTWYPASFADKVSMTDMRMRPDSSTGYPGRTYRFYTGDTVYAFGDGLSYTKFAH 640
Query: 630 TVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLL 689
++ +AP VAV L H T +++ C L+ V + V+N G G HT+
Sbjct: 641 SLVSAPEQVAVQLAEGHAC--HTEHCFSVEAAGEHCGSLSFDVHLRVRNAGGMAGGHTVF 698
Query: 690 VFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNI 749
+FS+PP+ H AP K L+ FEKV + G V + VCK LSVVD G R++ LG H +
Sbjct: 699 LFSSPPSVHSAPAKHLLGFEKVSLEPGQAGVVAFKVDVCKDLSVVDELGNRKVALGSHTL 758
Query: 750 HIGGTKHSVSL 760
H+G KH+++L
Sbjct: 759 HVGDLKHTLNL 769
>gi|224066931|ref|XP_002302285.1| predicted protein [Populus trichocarpa]
gi|222844011|gb|EEE81558.1| predicted protein [Populus trichocarpa]
Length = 773
Score = 764 bits (1974), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 358/751 (47%), Positives = 511/751 (68%), Gaps = 15/751 (1%)
Query: 23 AAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIK 82
+ + PF+CD + +T+ PFC+ +LPI QR DL+ RL+L EK+ L++ A +PRLGI
Sbjct: 23 STQPPFSCDSSNPSTKAFPFCETTLPISQRARDLVSRLTLDEKISQLVNSAPPIPRLGIP 82
Query: 83 GYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAM 142
GYEWWSEALHGVSN GPG F + GATSFPQVI TA+SF+A W IG+ + EARA+
Sbjct: 83 GYEWWSEALHGVSNAGPGIHFNDNIKGATSFPQVILTAASFDAYQWYRIGQAIGKEARAL 142
Query: 143 YNGGTA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGD----- 196
YN G A G+T+W+PN+NIFRDPRWGRGQETPGEDP+++G YAASYV+G+QG +
Sbjct: 143 YNAGQATGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGLYAASYVKGVQGDSFEGGKIK 202
Query: 197 -RLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCS 255
L+ +A CKHFTAYDLDNW G++RF F+A+V+ QD+ DT+ PF+ CV +G+ + +MC+
Sbjct: 203 GHLQASACCKHFTAYDLDNWKGMNRFVFDARVTMQDLADTYQPPFKSCVEQGRASGIMCA 262
Query: 256 YNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRA 315
YN+VNGVP+CAD N+L +T R +W GYI SDCD+V + +D Q + +PE+A D ++A
Sbjct: 263 YNKVNGVPSCADSNLLSKTARAQWGFRGYITSDCDAVSIIHDDQGYAKSPEDAVVDVLKA 322
Query: 316 GLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPK 375
G+D++CG +L H + AV++ LSE DI+ AL N +V+MRLG+F+G P Q +G++GP
Sbjct: 323 GMDVNCGSYLLKHAKVAVEQKKLSESDIDKALHNLFSVRMRLGLFNGRPEGQLFGNIGPD 382
Query: 376 DVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIA 435
VC+ +HQ LALEAAR GIVLLKN LPLS + +++AVIGPN++ ++GNYAG
Sbjct: 383 QVCSQEHQILALEAARNGIVLLKNSARLLPLSKSKTKSLAVIGPNANSGQMLLGNYAGPP 442
Query: 436 CGYTTPLQGIGRYAR-TIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAE 494
C + TPLQ + Y + T++ C V C+ + A+D ++ AD +L+MGLDQ+ E E
Sbjct: 443 CRFVTPLQALQSYIKQTVYHPACDTVQCSSASV-DRAVDVAKGADNVVLMMGLDQTQERE 501
Query: 495 ALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQ 554
LDR LLLPG+QQEL+ V+ A+K P +LVL SGGP+D++FAKND I +I+WAGYPG+
Sbjct: 502 ELDRTDLLLPGKQQELIIAVAKAAKNPVVLVLFSGGPVDISFAKNDKNIGSILWAGYPGE 561
Query: 555 AGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPV 614
G A+A+I+FG NPGG+LPMTWYPQE++ +PMT+M MRP S YPGRTYRFY+G
Sbjct: 562 GGAIALAEIVFGDHNPGGRLPMTWYPQEFV-KVPMTDMGMRPEASSGYPGRTYRFYRGRS 620
Query: 615 VYPFGHGMSYTNFVHTV-ANAPTVVAVPLDGRHGSINATISGKAIKVTHAK---CNRLTL 670
V+ FG+G+SY+ + + + A + + + IN S ++ ++ C +
Sbjct: 621 VFEFGYGISYSKYSYELTAVSQNTLYLNQSSTMHIINDFDSVRSTLISELGTEFCEQNKC 680
Query: 671 GVQVDVKNVGSKDGAHTLLVFSTPPA-GHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCK 729
++ VKN G G H +L+F+ G+ P KQL+ F+ V + AG + + + C+
Sbjct: 681 RARIGVKNHGEMAGKHPVLLFARQEKHGNGRPRKQLIGFQSVVLGAGERAEIEFEVSPCE 740
Query: 730 YLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
+LS + G + G H + + G ++ +S+
Sbjct: 741 HLSRANEDGLMVMEEGRHFLVVDGDEYPISV 771
>gi|449508468|ref|XP_004163321.1| PREDICTED: LOW QUALITY PROTEIN: probable beta-D-xylosidase 7-like
[Cucumis sativus]
Length = 783
Score = 761 bits (1964), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 360/754 (47%), Positives = 510/754 (67%), Gaps = 15/754 (1%)
Query: 20 SGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRL 79
+G +++ P+ACD + T+TLPFC+ LPI R DL+ RL+L EKV L++ +PRL
Sbjct: 30 AGSSSQPPYACDSSNPLTKTLPFCKTYLPIKLRARDLVSRLTLDEKVLQLVNTVPPIPRL 89
Query: 80 GIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEA 139
GI YEWWSEALHGV+NVG G + G ATSFPQVI TA+SF+ LW IG+ + EA
Sbjct: 90 GIPAYEWWSEALHGVANVGYGIRLNGTITAATSFPQVILTAASFDENLWYQIGQAIGTEA 149
Query: 140 RAMYNGGTA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSD---- 194
RA+YN G A G+T+W+PN+NIFRDPRWGRGQETPGEDP+++GKY+ +YVRG+QG
Sbjct: 150 RAVYNAGQAKGMTFWTPNINIFRDPRWGRGQETPGEDPLMTGKYSVAYVRGIQGDAIEGG 209
Query: 195 --GDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASV 252
G++LK +A CKHFTAYDLD WNG+ R+ F+AKV+ QD+ DT+ PF CV EGK + +
Sbjct: 210 KLGNQLKASACCKHFTAYDLDRWNGMTRYVFDAKVTMQDMADTYQPPFESCVEEGKASGI 269
Query: 253 MCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADA 312
MC+YN+VNGVP+CAD ++L T R +W+ NGYI SDCD+V + +D Q + PE+A AD
Sbjct: 270 MCAYNRVNGVPSCADHHLLTATARKQWKFNGYITSDCDAVSIIHDAQGYAKIPEDAVADV 329
Query: 313 IRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHL 372
+RAG+D++CG +L HT+SAV+ + + I+ AL N +V+MRLG+FDG P+ P+G +
Sbjct: 330 LRAGMDVNCGTYLKEHTKSAVEMKKVPMLHIDRALRNLFSVRMRLGLFDGNPTKLPFGQI 389
Query: 373 GPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYA 432
G VC+ HQ LAL+AAR+GIVLLKN LPLS ++AVIG N + T+ GNYA
Sbjct: 390 GRDQVCSQQHQNLALQAAREGIVLLKNSAKLLPLSKSNTHSLAVIGHNGNDPKTLRGNYA 449
Query: 433 GIACGYTTPLQGIGRYAR-TIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSI 491
GI C TP QG+ Y + T++ +GC C + ++ A+ ++ D +LVMGLDQ+
Sbjct: 450 GIPCKSATPFQGLNNYVKNTVYHRGCNYANCTEATIY-QAVKIAKSVDYVVLVMGLDQTQ 508
Query: 492 EAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGY 551
E E DR L LPG+Q +L+++V+ A+K P ILV++SGGP+D++ AK + +I +I+WAGY
Sbjct: 509 EREDFDRTELGLPGKQDKLIAEVAKAAKXPVILVILSGGPVDISSAKYNEKIGSILWAGY 568
Query: 552 PGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYK 611
PGQAGGTAIA+I+FG NPGG+LP+TWYP ++I PMT+M MR S YPGRTYRFY
Sbjct: 569 PGQAGGTAIAEIIFGDHNPGGRLPLTWYPHDFI-KFPMTDMRMRADSSTGYPGRTYRFYN 627
Query: 612 GPVVYPFGHGMSYTNFVH---TVANAPTVVAVPLDGRHGSINATISGKAIKVTHAK-CNR 667
GP VY FG+G+SY+N ++ +V+ + +++ P + + +S + + K C
Sbjct: 628 GPKVYEFGYGLSYSNHIYEFTSVSESKLLLSHPKASQPAKNSDLVSYRLVSELDKKFCES 687
Query: 668 LTLGVQVDVKNVGSKDGAHTLLVFSTPPAG-HWAPHKQLVAFEKVHVPAGAQQRVGINIH 726
T+ V V V+N G G H++L+F P + +P KQLV F+KV + AG ++ + +
Sbjct: 688 KTVNVTVGVRNEGEMGGKHSVLLFIKPSKPINGSPVKQLVGFKKVEINAGERREIEFLVS 747
Query: 727 VCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
C ++S G I G +++ +G +H + +
Sbjct: 748 PCDHISKASEEGLMIIEEGSYSLVVGDVEHPLDI 781
>gi|85813770|emb|CAJ65921.1| xylan 1,4-beta-xylosidase [Populus tremula x Populus alba]
Length = 704
Score = 760 bits (1963), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 390/671 (58%), Positives = 478/671 (71%), Gaps = 30/671 (4%)
Query: 5 IAFFFLGLILLSASSSGLAAREP-FACDP-KDATTRTLPFCQVSLPIPQRVNDLIGRLSL 62
+ F + L+ S+ A P FACD + + +L FC S+ I RV DL+ RL+L
Sbjct: 15 VVFLLFCMFLVFLSTHVSAQSSPVFACDVVSNPSLASLGFCNTSIGINDRVVDLVKRLTL 74
Query: 63 QEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASS 122
QEK+ L++ A V RLGI YEWWSEALHGVS VGPGT F D GATSFPQVI TA+S
Sbjct: 75 QEKIVFLVNSAGNVSRLGIPKYEWWSEALHGVSYVGPGTHFSDDVAGATSFPQVILTAAS 134
Query: 123 FNATLWEAIGRV-----VSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPV 177
FN +L+EAIG+V VS EARAMYN G AGLT+WSPN+NIFRDPRWGRGQETPGEDP+
Sbjct: 135 FNTSLFEAIGKVYYTQVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPL 194
Query: 178 LSGKYAASYVRGLQGSDG---DRLKVAASCKHFTAYDLDNWNGVDRFHFNAKV-SKQDIE 233
LS KY + YV+GLQ D D+LKVAA CKH+TAYDLDNW G DR+HFNA V +KQD++
Sbjct: 195 LSSKYGSCYVKGLQQRDDGDPDKLKVAACCKHYTAYDLDNWKGSDRYHFNAVVVTKQDMD 254
Query: 234 DTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY-------IV 286
DTF PF+ CV++G VASVMCSYNQVNG PTCADP++L IRGEW LNGY IV
Sbjct: 255 DTFQPPFKSCVIDGNVASVMCSYNQVNGKPTCADPDLLSGVIRGEWNLNGYQWGCCRYIV 314
Query: 287 SDCDSVGVYYDTQHFTSTPEEA------AADAIRAGLDLDCGPFLGLHTESAVQRGLLSE 340
+DCDS+ V+Y +Q++T TPEEA A +++ G+DL+CG FLG HTE+AV+ GL++E
Sbjct: 315 TDCDSLDVFYKSQNYTKTPEEAAAAAILAGNSLVTGVDLNCGSFLGQHTEAAVKGGLVNE 374
Query: 341 IDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQ 400
I+ A+ N MRLG FDG+PS Q YG LGPKDVCT ++QELA EAARQGIVLLKN
Sbjct: 375 HAIDIAVSNNFATLMRLGFFDGDPSKQLYGKLGPKDVCTAENQELAREAARQGIVLLKNT 434
Query: 401 GPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY-AGIACGYTTPLQGIGRYARTIHQQGCKD 459
SLPLS + +AVIGPN++VT TMIGNY G C YTTPLQG+ T + GC +
Sbjct: 435 AGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGGTPCKYTTPLQGLAASVATTYLPGCSN 494
Query: 460 VACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASK 519
VAC+ Q+ A + ADAT+LVMG D SIEAE+ DR +LLPG+QQ L++ V+ S
Sbjct: 495 VACSTAQV-DDAKKLAAAADATVLVMGADLSIEAESRDRVDVLLPGQQQLLITAVANVSC 553
Query: 520 GPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPG----GKLP 575
GP ILV+MSGG +DV+FA+ + +I +I+W GYPG+AGG AIADI+FG NP G+LP
Sbjct: 554 GPVILVIMSGGGMDVSFARTNDKITSILWVGYPGEAGGAAIADIIFGYYNPSTHQPGRLP 613
Query: 576 MTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAP 635
MTWYPQ Y+ +PMT M MRP S YPGRTYRFY G VY FG G+SY+ F H + AP
Sbjct: 614 MTWYPQSYVDKVPMTNMNMRPDPSNGYPGRTYRFYTGETVYSFGDGLSYSQFTHELIQAP 673
Query: 636 TVVAVPLDGRH 646
+V VPL+ H
Sbjct: 674 QLVYVPLEESH 684
>gi|449465962|ref|XP_004150696.1| PREDICTED: probable beta-D-xylosidase 7-like [Cucumis sativus]
Length = 783
Score = 760 bits (1962), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 360/754 (47%), Positives = 510/754 (67%), Gaps = 15/754 (1%)
Query: 20 SGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRL 79
+G +++ P+ACD + T+TLPFC+ LPI R DL+ RL+L EKV L++ +PRL
Sbjct: 30 AGSSSQPPYACDSSNPLTKTLPFCKTYLPIKLRARDLVSRLTLDEKVLQLVNTVPPIPRL 89
Query: 80 GIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEA 139
GI YEWWSEALHGV+NVG G + G ATSFPQVI TA+SF+ LW IG+ + EA
Sbjct: 90 GIPAYEWWSEALHGVANVGYGIRLNGTITAATSFPQVILTAASFDENLWYQIGQAIGTEA 149
Query: 140 RAMYNGGTA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSD---- 194
RA+YN G A G+T+W+PN+NIFRDPRWGRGQETPGEDP+++GKY+ +YVRG+QG
Sbjct: 150 RAVYNAGQAKGMTFWTPNINIFRDPRWGRGQETPGEDPLMTGKYSVAYVRGIQGDAIEGG 209
Query: 195 --GDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASV 252
G++LK +A CKHFTAYDLD WNG+ R+ F+AKV+ QD+ DT+ PF CV EGK + +
Sbjct: 210 KLGNQLKASACCKHFTAYDLDRWNGMTRYVFDAKVTMQDMADTYQPPFESCVEEGKASGI 269
Query: 253 MCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADA 312
MC+YN+VNGVP+CAD ++L T R +W+ NGYI SDCD+V + +D Q + PE+A AD
Sbjct: 270 MCAYNRVNGVPSCADHHLLTATARKQWKFNGYITSDCDAVSIIHDAQGYAKIPEDAVADV 329
Query: 313 IRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHL 372
+RAG+D++CG +L HT+SAV+ + + I+ AL N +V+MRLG+FDG P+ P+G +
Sbjct: 330 LRAGMDVNCGTYLKEHTKSAVEMKKVPMLHIDRALRNLFSVRMRLGLFDGNPTKLPFGQI 389
Query: 373 GPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYA 432
G VC+ HQ LAL+AAR+GIVLLKN LPLS ++AVIG N + T+ GNYA
Sbjct: 390 GRDQVCSQQHQNLALQAAREGIVLLKNSAKLLPLSKSNTHSLAVIGHNGNDPKTLRGNYA 449
Query: 433 GIACGYTTPLQGIGRYAR-TIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSI 491
GI C TP QG+ Y + T++ +GC C + ++ A+ ++ D +LVMGLDQ+
Sbjct: 450 GIPCKSATPFQGLNNYVKNTVYHRGCNYANCTEATIY-QAVKIAKSVDYVVLVMGLDQTQ 508
Query: 492 EAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGY 551
E E DR L LPG+Q +L+++V+ A+K P ILV++SGGP+D++ AK + +I +I+WAGY
Sbjct: 509 EREDFDRTELGLPGKQDKLIAEVAKAAKRPVILVILSGGPVDISSAKYNEKIGSILWAGY 568
Query: 552 PGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYK 611
PGQAGGTAIA+I+FG NPGG+LP+TWYP ++I PMT+M MR S YPGRTYRFY
Sbjct: 569 PGQAGGTAIAEIIFGDHNPGGRLPLTWYPHDFI-KFPMTDMRMRADSSTGYPGRTYRFYN 627
Query: 612 GPVVYPFGHGMSYTNFVH---TVANAPTVVAVPLDGRHGSINATISGKAIKVTHAK-CNR 667
GP VY FG+G+SY+N ++ +V+ + +++ P + + +S + + K C
Sbjct: 628 GPKVYEFGYGLSYSNHIYEFTSVSESKLLLSHPKASQPAKNSDLVSYRLVSELDKKFCES 687
Query: 668 LTLGVQVDVKNVGSKDGAHTLLVFSTPPAG-HWAPHKQLVAFEKVHVPAGAQQRVGINIH 726
T+ V V V+N G G H++L+F P + +P KQLV F+KV + AG ++ + +
Sbjct: 688 KTVNVTVGVRNEGEMGGKHSVLLFIKPSKPINGSPVKQLVGFKKVEINAGERREIEFLVS 747
Query: 727 VCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
C ++S G I G +++ +G +H + +
Sbjct: 748 PCDHISKASEEGLMIIEEGSYSLVVGDVEHPLDI 781
>gi|371917286|dbj|BAL44719.1| SlArf/Xyl4 [Solanum lycopersicum]
Length = 775
Score = 754 bits (1947), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 362/766 (47%), Positives = 511/766 (66%), Gaps = 22/766 (2%)
Query: 12 LILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLIS 71
LI LS S + + PF+CD + T++L FCQ LPI RV DL+ RL+L EK+ L++
Sbjct: 13 LISLSLVSIVQSTQPPFSCDSSNPQTKSLKFCQTGLPISVRVLDLVSRLTLDEKISQLVN 72
Query: 72 GAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAI 131
A A+PRLGI YEWWSE+LHGV + G G F G GATSFPQVI TA++F+ LW I
Sbjct: 73 SAPAIPRLGIPAYEWWSESLHGVGSAGKGIFFNGSIAGATSFPQVILTAATFDENLWYRI 132
Query: 132 GRVVSDEARAMYNGGTA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGL 190
G+V+ EAR +YN G A G+T+W+PN+NIFRDPRWGRGQETPGEDP+++GKYA YVRG+
Sbjct: 133 GQVIGVEARGVYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPIMTGKYAIRYVRGV 192
Query: 191 QGS--DGDRLK-----VAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMC 243
QG +G +LK +A CKHFTAYDLD W +DRF FNA V+ QD+ DTF PF+ C
Sbjct: 193 QGDSFNGGQLKKGHLQASACCKHFTAYDLDQWKNLDRFSFNAIVTPQDMADTFQPPFQDC 252
Query: 244 VMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTS 303
+ + + + +MCSYN VNG+P+CA+ N+L +T R +W +GYI SDCD+V V +D + +
Sbjct: 253 IQKAQASGIMCSYNSVNGIPSCANYNLLTKTARQQWGFHGYITSDCDAVQVMHDNHRYGN 312
Query: 304 TPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGE 363
TPE++ A A++AG+D+DCG +L +T+SAV + +S++ I+ AL N +++MRLG+F+G+
Sbjct: 313 TPEDSTAFALKAGMDIDCGDYLKKYTKSAVMKKKVSQVHIDRALHNLFSIRMRLGLFNGD 372
Query: 364 PSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDV 423
P Q YG++ P VC P HQ+LALEAAR GIVLLKN G LPLS + ++AVIG N++
Sbjct: 373 PRKQLYGNISPSQVCAPQHQQLALEAARNGIVLLKNTGKLLPLSKAKTNSLAVIGHNANN 432
Query: 424 TVTMIGNYAGIACGYTTPLQGIGRYARTI-HQQGCKDVACADDQLFGAAIDASRQADATI 482
+ GNY G C Y L+ + YA+++ +QQGC C + A++ +R AD +
Sbjct: 433 AYILRGNYDGPPCKYIEILKALVGYAKSVQYQQGCNAANCTSANI-DQAVNIARNADYVV 491
Query: 483 LVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPR 542
L+MGLDQ+ E E DR L+LPG+Q+ L++ V+ A+K P ILV++SGGP+D++FAK +P+
Sbjct: 492 LIMGLDQTQEREQFDRDDLVLPGQQENLINSVAKAAKKPVILVILSGGPVDISFAKYNPK 551
Query: 543 IAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRY 602
I +I+WAGYPG+AGG A+A+I+FG NPGGKLP+TWYPQ ++ +PMT+M MRP Y
Sbjct: 552 IGSILWAGYPGEAGGIALAEIIFGEHNPGGKLPVTWYPQAFV-KIPMTDMRMRPDPKTGY 610
Query: 603 PGRTYRFYKGPVVYPFGHGMSYTNFVHTVANA-PTVVAVPLDGRHGSINATISGKAIKVT 661
PGRTYRFYKGP VY FG+G+SYT + + +A P + + + S+ + +I+ T
Sbjct: 611 PGRTYRFYKGPKVYEFGYGLSYTTYSYGFHSATPNTIQL---NQLLSVKTVENSDSIRYT 667
Query: 662 HA------KCNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVP 714
C + V V+N G DG H +L+F A + +P KQLV F+ V +
Sbjct: 668 FVDEIGSDNCEKAKFSAHVSVENSGEMDGKHPVLLFVKQDKARNGSPIKQLVGFQSVSLK 727
Query: 715 AGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
AG ++ I C++LS + G I G + +G +H +++
Sbjct: 728 AGENSQLVFEISPCEHLSSANEDGLMMIEEGSRYLVVGDAEHPINI 773
>gi|168065036|ref|XP_001784462.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162663987|gb|EDQ50724.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 726
Score = 753 bits (1943), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 377/732 (51%), Positives = 502/732 (68%), Gaps = 27/732 (3%)
Query: 40 LPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGP 99
+ FC SL RV DL+ RL+L+EKV L++ A+A+PRL I YEWW E LHGV++V
Sbjct: 1 MRFCDTSLSDEIRVFDLVSRLTLEEKVTQLVNTASAIPRLSIPAYEWWQEGLHGVAHV-- 58
Query: 100 GTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNI 159
FGG P ATSFP I T +SFN LW IG+ S EARA YN G AGLTYWSP +NI
Sbjct: 59 --SFGGSLPRATSFPLPILTTASFNKDLWNQIGQAFSTEARAFYNDGIAGLTYWSPVINI 116
Query: 160 FRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGD--RLKVAASCKHFTAYDLDNWNG 217
RDPRWGR QET GEDP + YA +V+G+Q D + RLK++A CKHFTAYD+DNW G
Sbjct: 117 ARDPRWGRIQETSGEDPYTTSAYATHFVQGMQEGDANSKRLKLSACCKHFTAYDVDNWEG 176
Query: 218 VDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRG 277
+DR+HF+AK ++ DT++ PF+ CV EG+ AS+MCSYN+VNGVPTCA+ + L+ T+R
Sbjct: 177 IDRYHFDAKA---NLADTYNPPFQSCVQEGRSASLMCSYNKVNGVPTCANYDFLENTVRR 233
Query: 278 EWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGL 337
W LNGYIVSDCDSV V +++ ++ T E+AAADA+ AGLDL+CG +L +TE AV G
Sbjct: 234 AWGLNGYIVSDCDSVLVMHESTNYAPTTEDAAADALNAGLDLNCGDYLASYTEGAVAMGK 293
Query: 338 LSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLL 397
++ ++NA+ N V+MRLGMFDG P++Q +G++G DVCTP HQELA+EAARQGIVLL
Sbjct: 294 VNASRVDNAVYNVFLVRMRLGMFDGNPANQEFGNIGVADVCTPAHQELAVEAARQGIVLL 353
Query: 398 KNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI-----GRYARTI 452
KN G LPLS ++ AVIGPN++ T TM+GNY GI C Y TPLQG+ G Y +
Sbjct: 354 KNDGNILPLS--KNINTAVIGPNANATHTMLGNYEGIPCQYITPLQGLVKFGSGDYHKVW 411
Query: 453 HQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVS 512
+GC + AC D +A+ + ADA +LV+GL Q E+EALDR LLLPG QQ L+
Sbjct: 412 FSEGCVNTACQQDDQISSAVSTAAVADAVVLVVGLSQVQESEALDRTSLLLPGYQQTLID 471
Query: 513 KVSMASKG-PTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPG 571
+V+ A+ G P +LVLM GP+D+ FAKND RI +I+W GYPGQ+GG AIA+++FG NPG
Sbjct: 472 EVAGAAAGRPVVLVLMCAGPVDINFAKNDKRIQSILWVGYPGQSGGQAIAEVIFGAHNPG 531
Query: 572 GKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTV 631
GKLPM+WYP++Y T + MT M MRP YPGRTYRFY G +Y FG+G+SYT + H+
Sbjct: 532 GKLPMSWYPEDY-TKISMTNMNMRPDSRSNYPGRTYRFYTGEKIYDFGYGLSYTEYKHSF 590
Query: 632 ANAPTVVAVPLDGRHGSINATISGKAIKVTHAK-CNRLTLGVQVDVKNVGSKDGAHTLLV 690
A APT V P SI++ + +K C+ V ++V+N+G+ G HTLL+
Sbjct: 591 ALAPTTVMTP------SIHSQLCDPHQTSAGSKTCSSSNFDVHINVENIGAMAGNHTLLL 644
Query: 691 FSTPPAG--HWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHN 748
F T P+ + P KQL AF+ V++ +G+Q++V + ++ C++L V GTR + G H
Sbjct: 645 FFTAPSAGKNGTPLKQLAAFDSVYIRSGSQEKVVLTLNPCQHLGTVAEDGTRMLEAGNHI 704
Query: 749 IHIGGTKHSVSL 760
+ +G KHS+S+
Sbjct: 705 LSVGDAKHSLSV 716
>gi|358349509|ref|XP_003638778.1| Xylan 1 4-beta-xylosidase [Medicago truncatula]
gi|355504713|gb|AES85916.1| Xylan 1 4-beta-xylosidase [Medicago truncatula]
Length = 776
Score = 750 bits (1937), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 373/780 (47%), Positives = 510/780 (65%), Gaps = 28/780 (3%)
Query: 1 MASTIAFFFLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRL 60
++ST F + + L+ + S LA PFACD + +TR+ PFC LPI QR DL+ RL
Sbjct: 3 LSSTFTFVTIISLFLTLTYSVLAQLPPFACDYSNPSTRSYPFCNPKLPITQRTKDLVSRL 62
Query: 61 SLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTA 120
+L EK+ L++ A +PRLGI YEWWSEALHG+ NVG G F G ATSFPQVI TA
Sbjct: 63 TLDEKLAQLVNSAPPIPRLGIPAYEWWSEALHGIGNVGRGIFFNGSITSATSFPQVILTA 122
Query: 121 SSFNATLWEAIGRVVSDEARAMYNGGTA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLS 179
+SF++ LW IG+ + EARA+YNGG A G+T+W+PN+NIFRDPRWGRGQET GEDP+++
Sbjct: 123 ASFDSHLWYRIGQAIGVEARAIYNGGQAMGMTFWAPNINIFRDPRWGRGQETAGEDPMMT 182
Query: 180 GKYAASYVRGLQGSDGD------RLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIE 233
YA SYVRGLQG L+ +A CKHFTAYDLDNW GV+RFHF+A+VS QD+
Sbjct: 183 SNYAVSYVRGLQGDSFQGGKLRGHLQASACCKHFTAYDLDNWKGVNRFHFDARVSLQDLA 242
Query: 234 DTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVG 293
DT+ PFR C+ +G+ + +MC+YN+VNG+P+CAD N+L T+R +W +GYIVSDC +VG
Sbjct: 243 DTYQPPFRSCIEQGRASGIMCAYNRVNGIPSCADFNLLTNTVRKQWEFHGYIVSDCGAVG 302
Query: 294 VYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTV 353
+ +D Q + + E+A AD + AG+DL+CG +L H +SAVQ+ L + I+ AL N ++
Sbjct: 303 IIHDEQGYAKSAEDAVADVLHAGMDLECGSYLTDHAKSAVQQKKLPIVRIDRALHNLFSI 362
Query: 354 QMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRT 413
++RLG FDG P+ P+G +GP VC+ +H LALEAAR GIVLLKN LPL +
Sbjct: 363 RIRLGQFDGNPAKLPFGMIGPNHVCSENHLYLALEAARNGIVLLKNTASLLPLPKTS-IS 421
Query: 414 VAVIGPNSDVT-VTMIGNYAGIACGYTTPLQGIGRYART-IHQQGCKDVACADDQLFGAA 471
+AVIGPN++ + +T++GNYAG C T LQG Y + + GC A
Sbjct: 422 LAVIGPNANASPLTLLGNYAGPPCKSITILQGFQHYVKNAVFHPGCDGGPKCASAPIDKA 481
Query: 472 IDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGP 531
+ ++ AD +LVMGLDQS+E E DR L LPG+Q EL++ V+ ASK P ILVL+ GGP
Sbjct: 482 VKVAKNADYVVLVMGLDQSVEREERDRVHLDLPGKQLELINSVAKASKRPVILVLLCGGP 541
Query: 532 IDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTE 591
ID++ AKN+ +I IIWAGYPG+ GG A+A I+FG NPGG+LP+TWYP++YI +PMT+
Sbjct: 542 IDISSAKNNDKIGGIIWAGYPGELGGIALAQIIFGDHNPGGRLPITWYPKDYI-KVPMTD 600
Query: 592 MAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINA 651
M MR + YPGRTYRFYKGP VY FGHG+SYT + + V+V D H + ++
Sbjct: 601 MRMRADPTTGYPGRTYRFYKGPTVYEFGHGLSYTKYSY------EFVSVTHDKLHFNQSS 654
Query: 652 ---------TISGKAI-KVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGH-WA 700
TI K + ++ C +++ V V VKN G+ G H +L+F P +
Sbjct: 655 THLMTENSETIRYKLVSELDEETCKSMSVSVTVGVKNHGNIVGRHPILLFMRPQKHRTRS 714
Query: 701 PHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
P KQLV F + + AG VG + C++LS + +G + I G H +H+G ++ + +
Sbjct: 715 PMKQLVGFHSLLLDAGEMSHVGFELSPCEHLSRANEAGLKIIEEGSHLLHVGEEEYLIDI 774
>gi|296084630|emb|CBI25718.3| unnamed protein product [Vitis vinifera]
Length = 768
Score = 750 bits (1937), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 368/772 (47%), Positives = 496/772 (64%), Gaps = 16/772 (2%)
Query: 1 MASTIAFFFLGLIL--LSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIG 58
M++ F F+ L L L S + F C P T PFC SLPI R L+
Sbjct: 1 MSTEWRFLFICLFLQVLPLFSISESTHPQFPCMP--PTNSDYPFCNTSLPISTRAQSLVS 58
Query: 59 RLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVIT 118
L+L EK++ L AAA+PRL I YEWWSE+LHG++ GPG F G ATSFPQV+
Sbjct: 59 LLTLSEKIQQLSDEAAAIPRLYIPAYEWWSESLHGIATNGPGVSFNGTVSAATSFPQVLL 118
Query: 119 TASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVL 178
TA+SFN +LW +IG ++ EARAMYN G AGLT+W+PN+NIFRDPRWGRGQETPGEDP++
Sbjct: 119 TAASFNRSLWFSIGSAIAVEARAMYNVGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMV 178
Query: 179 SGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFD 237
+ YA +VRG QG SDGD L ++A CKH TAYDL+ W R+ F+A VS QD+EDT+
Sbjct: 179 ASAYAVEFVRGFQGDSDGDGLMLSACCKHLTAYDLEKWGNFSRYSFDAVVSNQDLEDTYQ 238
Query: 238 VPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYD 297
PFR CV +GK + +MCSYN+VNGVP CA ++ ++ + EW GYI SDCD+V Y+
Sbjct: 239 PPFRSCVQQGKASCLMCSYNRVNGVPACARQDLFQKA-KTEWGFKGYITSDCDAVATVYE 297
Query: 298 TQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRL 357
QH+ ++PE+A AD ++AG D++CG ++ HT+SA+ +G + E DI+ AL N +VQMRL
Sbjct: 298 YQHYANSPEDAVADVLKAGTDINCGSYMLRHTQSAIDQGKVKEEDIDRALFNLFSVQMRL 357
Query: 358 GMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVI 417
G+FDG+P++ YG+LGPKDVCT +H+ LALEAARQGIVLLKN LPL R ++A+I
Sbjct: 358 GLFDGDPANGLYGNLGPKDVCTKEHRTLALEAARQGIVLLKNDKKFLPLDKSRISSLAII 417
Query: 418 GPNSDVTVTMIGNYAGIACGYTTPLQGIGRYA-RTIHQQGCKDVACADDQLFGAAIDASR 476
GP +D + G Y GI C + ++G+ Y +T GC DV C D F A+ +R
Sbjct: 418 GPQADQPF-LGGGYTGIPCKPESLVEGLKTYVEKTSFAAGCVDVPCLSDTGFDEAVSIAR 476
Query: 477 QADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAF 536
+AD ++V GLD S E E DR LLLPG+Q L+S V+ A + P +LVL GGP+DV+F
Sbjct: 477 KADIVVVVAGLDLSQETEDHDRVSLLLPGKQMALISSVASAIQKPLVLVLTGGGPLDVSF 536
Query: 537 AKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRP 596
A+ DPRIA+I+W GYPG+AG A+A+I+FG NPGG+LPMTWYP+ + T +PM +M MR
Sbjct: 537 AEQDPRIASILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESF-TRVPMNDMNMRA 595
Query: 597 SQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGK 656
+ YPGRTYRFY G VY FG G+SYT F + +AP + + S +
Sbjct: 596 DPYRGYPGRTYRFYIGHRVYGFGQGLSYTKFAYQFVSAPNKLNLLRSSDTVSSKNLPRQR 655
Query: 657 AIKVTH------AKCNRLTLGVQVDVKNVGSKDGAHTLLVFS-TPPAGHWAPHKQLVAFE 709
+V + C+ L V++ V NVG DG+H +++FS P P KQL+ F
Sbjct: 656 REEVNYFHIEELDTCDSLRFHVEISVTNVGDMDGSHVVMLFSRVPKIVKGTPEKQLIGFS 715
Query: 710 KVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLH 761
+VH + I + C++ S+ + G R +PLG+H I +G HSVS+
Sbjct: 716 RVHTVSRRSTETSIMVDPCEHFSIANEQGKRIMPLGDHTIMLGDVVHSVSVE 767
>gi|32488698|emb|CAE03635.1| OSJNBb0003B01.27 [Oryza sativa Japonica Group]
Length = 839
Score = 749 bits (1933), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 363/639 (56%), Positives = 461/639 (72%), Gaps = 6/639 (0%)
Query: 125 ATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAA 184
+ ++ I VVS EARAM+N G AGLT+WSPN+NIFRDPRWGRGQETPGEDP+L+ KYA
Sbjct: 202 SMMYNLIVLVVSTEARAMHNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYAV 261
Query: 185 SYVRGLQGSDG--DRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRM 242
YV GLQ + G D LKVAA CKH+TAYD+DNW GV+R+ F+A VS+QD++DTF PF+
Sbjct: 262 GYVTGLQDAGGGSDALKVAACCKHYTAYDVDNWKGVERYTFDAVVSQQDLDDTFQPPFKS 321
Query: 243 CVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFT 302
CV++G VASVMCSYN+VNG PTCAD ++L IRG+W+LNGYIVSDCDSV V Y+ QH+T
Sbjct: 322 CVIDGNVASVMCSYNKVNGKPTCADKDLLSGVIRGDWKLNGYIVSDCDSVDVLYNNQHYT 381
Query: 303 STPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDG 362
PE+AAA I++GLDL+CG FL HT +AVQ G LSE D++ A+ N V MRLG FDG
Sbjct: 382 KNPEDAAAITIKSGLDLNCGNFLAQHTVAAVQAGKLSESDVDRAITNNFIVLMRLGFFDG 441
Query: 363 EPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSD 422
+P P+G LGPKDVCT +QELA EAARQGIVLLKN G +LPLS +++AVIGPN++
Sbjct: 442 DPRKLPFGSLGPKDVCTSSNQELAREAARQGIVLLKNTG-ALPLSAKSIKSMAVIGPNAN 500
Query: 423 VTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQL-FGAAIDASRQADAT 481
+ TMIGNY G C YTTPLQG+G T++Q GC +V C+ + L AA A+ AD T
Sbjct: 501 ASFTMIGNYEGTPCKYTTPLQGLGANVATVYQPGCTNVGCSGNSLQLSAATQAAASADVT 560
Query: 482 ILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDP 541
+LV+G DQS+E E+LDR LLLPG+Q +LVS V+ AS+GP ILV+MSGGP D++FAK+
Sbjct: 561 VLVVGADQSVERESLDRTSLLLPGQQPQLVSAVANASRGPVILVVMSGGPFDISFAKSSD 620
Query: 542 RIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKR 601
+I+AI+W GYPG+AGG A+ADILFG NPGG+LP+TWYP + + MT+M MRP S
Sbjct: 621 KISAILWVGYPGEAGGAALADILFGYHNPGGRLPVTWYPASFADKVSMTDMRMRPDSSTG 680
Query: 602 YPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVT 661
YPGRTYRFY G VY FG G+SYT F H++ +AP VAV L H T +++
Sbjct: 681 YPGRTYRFYTGDTVYAFGDGLSYTKFAHSLVSAPEQVAVQLAEGHAC--HTEHCFSVEAA 738
Query: 662 HAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRV 721
C L+ V + V+N G G HT+ +FS+PP+ H AP K L+ FEKV + G V
Sbjct: 739 GEHCGSLSFDVHLRVRNAGGMAGGHTVFLFSSPPSVHSAPAKHLLGFEKVSLEPGQAGVV 798
Query: 722 GINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
+ VCK LSVVD G R++ LG H +H+G KH+++L
Sbjct: 799 AFKVDVCKDLSVVDELGNRKVALGSHTLHVGDLKHTLNL 837
Score = 129 bits (324), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 63/106 (59%), Positives = 74/106 (69%)
Query: 28 FACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWW 87
FACD +AT FC + R DL+GRL+L EKV L++ AA+PRLGI YEWW
Sbjct: 31 FACDASNATVSGYGFCDRTKSSAARAADLLGRLTLAEKVGFLVNKQAALPRLGIPAYEWW 90
Query: 88 SEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGR 133
SEALHGVS VGPGT+F PGATSFPQ I TA+SFNA+L+ AIG
Sbjct: 91 SEALHGVSYVGPGTRFSTLVPGATSFPQPILTAASFNASLFRAIGE 136
>gi|357485313|ref|XP_003612944.1| Beta-D-xylosidase [Medicago truncatula]
gi|355514279|gb|AES95902.1| Beta-D-xylosidase [Medicago truncatula]
Length = 783
Score = 746 bits (1926), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 361/782 (46%), Positives = 501/782 (64%), Gaps = 31/782 (3%)
Query: 3 STIAFFFLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSL 62
STI F L+L+ + C P + PFC +SLPI R LI L+L
Sbjct: 8 STIIIFLFSLLLIHLPK--FFTTPDYPCKPPHSH---YPFCNISLPISTRTTSLISLLTL 62
Query: 63 QEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASS 122
+K+ L + A+++ LGI Y+WWSEALHG++ GPG F G AT+FPQVI +A++
Sbjct: 63 SDKINQLSNTASSISHLGIPSYQWWSEALHGIATNGPGVNFNGSVKSATNFPQVIVSAAA 122
Query: 123 FNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKY 182
FN +LW IG V E RAM+N G AGL++W+PNVN+FRDPRWGRGQETPGEDP++ Y
Sbjct: 123 FNRSLWFLIGYAVGVEGRAMFNVGQAGLSFWAPNVNVFRDPRWGRGQETPGEDPMVGSAY 182
Query: 183 AASYVRGLQGSDG------------DRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQ 230
A +VRG+QG DG D L V+A CKHFTAYDL+ W R++FNA V++Q
Sbjct: 183 AVEFVRGIQGVDGIKKVLNDHDSDDDGLMVSACCKHFTAYDLEKWGEFSRYNFNAVVTQQ 242
Query: 231 DIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCD 290
D+EDT+ PFR CV +GK + +MCSYN+VNGVP CA ++L +R +W GYI SDCD
Sbjct: 243 DLEDTYQPPFRGCVQQGKASCLMCSYNEVNGVPACASKDLLG-LVRNKWGFEGYIASDCD 301
Query: 291 SVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNT 350
+V ++ Q + + E+A AD ++AG+D++CG F+ HTESA+++GL+ E D++ AL N
Sbjct: 302 AVATVFEYQKYAKSAEDAVADVLKAGMDINCGTFMLRHTESAIEQGLVKEEDLDRALFNL 361
Query: 351 LTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIR 410
+VQMRLG+F+G+P +G LGP+DVCTP+H++LALEAARQGIVLLKN LPL
Sbjct: 362 FSVQMRLGLFNGDPEKGKFGKLGPQDVCTPEHKKLALEAARQGIVLLKNDNKFLPLDKKD 421
Query: 411 HRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTI-HQQGCKDVACADDQLFG 469
++A+IGP + T + G Y+GI C + G+ Y +TI + GC DV C D F
Sbjct: 422 RVSLAIIGPMA-TTSELGGGYSGIPCSPRSLYDGLKEYVKTISYAFGCSDVKCDSDDGFA 480
Query: 470 AAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSG 529
AID ++QAD ++V GLD ++E E LDR LLLPG+Q +LVS+V+ ASK P ILVL G
Sbjct: 481 VAIDIAKQADFVVIVAGLDTTLETEDLDRVSLLLPGKQMDLVSRVAAASKRPVILVLTGG 540
Query: 530 GPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPM 589
GP+DV+FA+++ I +I+W GYPG+AGG A+A+I+FG NP G+LPMTWYP+ + TN+PM
Sbjct: 541 GPLDVSFAESNQLITSILWIGYPGEAGGKALAEIIFGEFNPAGRLPMTWYPESF-TNVPM 599
Query: 590 TEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLD---GRH 646
+M MR S+ YPGRTYRFY G +Y FGHG+SY++F + V +AP+ +++ G
Sbjct: 600 NDMGMRADPSRGYPGRTYRFYTGSRIYGFGHGLSYSDFSYRVLSAPSKLSLSKTTNGGLR 659
Query: 647 GSINATISGKAIKVTHA------KCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAG-HW 699
S+ + +V H CN L+ V + V NVG DG+H +++FS P
Sbjct: 660 RSLLNKVEKDVFEVDHVHVDELQNCNSLSFSVHISVMNVGDMDGSHVVMLFSKWPKNIQG 719
Query: 700 APHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVS 759
+P QLV ++H + I C++ S D G R +PLG H +++G +H VS
Sbjct: 720 SPESQLVGPSRLHTVSNKSIETSILADPCEHFSFADEQGKRILPLGNHILNVGDVEHIVS 779
Query: 760 LH 761
+
Sbjct: 780 IE 781
>gi|225469218|ref|XP_002264031.1| PREDICTED: probable beta-D-xylosidase 6-like [Vitis vinifera]
Length = 789
Score = 741 bits (1914), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 368/793 (46%), Positives = 496/793 (62%), Gaps = 37/793 (4%)
Query: 1 MASTIAFFFLGLIL--LSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIG 58
M++ F F+ L L L S + F C P T PFC SLPI R L+
Sbjct: 1 MSTEWRFLFICLFLQVLPLFSISESTHPQFPCMP--PTNSDYPFCNTSLPISTRAQSLVS 58
Query: 59 RLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVIT 118
L+L EK++ L AAA+PRL I YEWWSE+LHG++ GPG F G ATSFPQV+
Sbjct: 59 LLTLSEKIQQLSDEAAAIPRLYIPAYEWWSESLHGIATNGPGVSFNGTVSAATSFPQVLL 118
Query: 119 TASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVL 178
TA+SFN +LW +IG ++ EARAMYN G AGLT+W+PN+NIFRDPRWGRGQETPGEDP++
Sbjct: 119 TAASFNRSLWFSIGSAIAVEARAMYNVGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMV 178
Query: 179 SGKYAASYVRGLQG----------------------SDGDRLKVAASCKHFTAYDLDNWN 216
+ YA +VRG QG SDGD L ++A CKH TAYDL+ W
Sbjct: 179 ASAYAVEFVRGFQGGNWKGGDEIRGAVGKKRVLRGDSDGDGLMLSACCKHLTAYDLEKWG 238
Query: 217 GVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIR 276
R+ F+A VS QD+EDT+ PFR CV +GK + +MCSYN+VNGVP CA ++ ++ +
Sbjct: 239 NFSRYSFDAVVSNQDLEDTYQPPFRSCVQQGKASCLMCSYNRVNGVPACARQDLFQKA-K 297
Query: 277 GEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRG 336
EW GYI SDCD+V Y+ QH+ ++PE+A AD ++AG D++CG ++ HT+SA+ +G
Sbjct: 298 TEWGFKGYITSDCDAVATVYEYQHYANSPEDAVADVLKAGTDINCGSYMLRHTQSAIDQG 357
Query: 337 LLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVL 396
+ E DI+ AL N +VQMRLG+FDG+P++ YG+LGPKDVCT +H+ LALEAARQGIVL
Sbjct: 358 KVKEEDIDRALFNLFSVQMRLGLFDGDPANGLYGNLGPKDVCTKEHRTLALEAARQGIVL 417
Query: 397 LKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYA-RTIHQQ 455
LKN LPL R ++A+IGP +D + G Y GI C + ++G+ Y +T
Sbjct: 418 LKNDKKFLPLDKSRISSLAIIGPQADQPF-LGGGYTGIPCKPESLVEGLKTYVEKTSFAA 476
Query: 456 GCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVS 515
GC DV C D F A+ +R+AD ++V GLD S E E DR LLLPG+Q L+S V+
Sbjct: 477 GCVDVPCLSDTGFDEAVSIARKADIVVVVAGLDLSQETEDHDRVSLLLPGKQMALISSVA 536
Query: 516 MASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLP 575
A + P +LVL GGP+DV+FA+ DPRIA+I+W GYPG+AG A+A+I+FG NPGG+LP
Sbjct: 537 SAIQKPLVLVLTGGGPLDVSFAEQDPRIASILWIGYPGEAGAKALAEIIFGDFNPGGRLP 596
Query: 576 MTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAP 635
MTWYP+ + T +PM +M MR + YPGRTYRFY G VY FG G+SYT F + +AP
Sbjct: 597 MTWYPESF-TRVPMNDMNMRADPYRGYPGRTYRFYIGHRVYGFGQGLSYTKFAYQFVSAP 655
Query: 636 TVVAVPLDGRHGSINATISGKAIKVTH------AKCNRLTLGVQVDVKNVGSKDGAHTLL 689
+ + S + +V + C+ L V++ V NVG DG+H ++
Sbjct: 656 NKLNLLRSSDTVSSKNLPRQRREEVNYFHIEELDTCDSLRFHVEISVTNVGDMDGSHVVM 715
Query: 690 VFS-TPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHN 748
+FS P P KQL+ F +VH + I + C++ S+ + G R +PLG+H
Sbjct: 716 LFSRVPKIVKGTPEKQLIGFSRVHTVSRRSTETSIMVDPCEHFSIANEQGKRIMPLGDHT 775
Query: 749 IHIGGTKHSVSLH 761
I +G HSVS+
Sbjct: 776 IMLGDVVHSVSVE 788
>gi|302141935|emb|CBI19138.3| unnamed protein product [Vitis vinifera]
Length = 1411
Score = 741 bits (1914), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 363/749 (48%), Positives = 499/749 (66%), Gaps = 37/749 (4%)
Query: 22 LAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGI 81
L++ PFACD D T++ FC +L I QR +DLI RL+L EK+ LIS AA++PRLGI
Sbjct: 692 LSSSPPFACDSSDPLTKSYAFCNTTLRISQRASDLISRLTLDEKISQLISSAASIPRLGI 751
Query: 82 KGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARA 141
YEWWSEALHG+ + G +F G ATSFPQVI TA+SF+A LW IG+ + E RA
Sbjct: 752 PAYEWWSEALHGIRDRH-GIRFNGTIRSATSFPQVILTAASFDAHLWYRIGQAIGIETRA 810
Query: 142 MYNGGTA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGS-----DG 195
MYN G A G+T+W+PN+NIFRDPRWGRGQETPGEDPV++GKYA SYVRGLQG
Sbjct: 811 MYNAGQAMGMTFWAPNINIFRDPRWGRGQETPGEDPVVAGKYAVSYVRGLQGDTFEGGKV 870
Query: 196 DRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCS 255
D L+ +A CKHFTAYDLDNW +DR+ F+A+V+ QD+ DT+ PFR C+ EG+ + +MC+
Sbjct: 871 DVLQASACCKHFTAYDLDNWTSIDRYTFDARVTMQDLADTYQPPFRSCIEEGRASGLMCA 930
Query: 256 YNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRA 315
YN VNGVP CAD N+L +T RG+W +GYIVSDCD+V + +D Q + +PE+A A + A
Sbjct: 931 YNLVNGVPNCADFNLLSKTARGQWGFDGYIVSDCDAVSLVHDVQGYAKSPEDAVAIVLTA 990
Query: 316 GLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPK 375
G+D+ CG +L H +SAV + L+E +I+ AL+N TV+MRLG+F+G P P+G++GP
Sbjct: 991 GMDVACGGYLQKHAKSAVSQKKLTESEIDRALLNLFTVRMRLGLFNGNPRKLPFGNIGPD 1050
Query: 376 DVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIA 435
VC+ +HQ LALEAAR GIVLLKN LPLS ++AVIGPN++ T T++GNYAG
Sbjct: 1051 QVCSTEHQTLALEAARSGIVLLKNSDRLLPLSKGETLSLAVIGPNANATDTLLGNYAGPP 1110
Query: 436 CGYTTPLQGIGRYA-RTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAE 494
C + +PLQG+ Y T++ GC DVAC+ + A+D ++QAD +LVMGLDQ+ E E
Sbjct: 1111 CKFISPLQGLQSYVNNTMYHAGCNDVACSSASIEN-AVDVAKQADYVVLVMGLDQTQERE 1169
Query: 495 ALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQ 554
DR L+LPG+Q++L++ V+ A+K P +LVL+ GGP+D++FAK I +I+WAGYPG+
Sbjct: 1170 KYDRLDLVLPGKQEQLITGVAKAAKKPVVLVLLCGGPVDISFAKGSSNIGSILWAGYPGE 1229
Query: 555 AGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPV 614
AGG AIA+ +FG NPGG+LP+TWYP+++I +PMT+M MRP YPGRT+RFY G
Sbjct: 1230 AGGAAIAETIFGDHNPGGRLPVTWYPKDFI-KIPMTDMRMRPEPQSGYPGRTHRFYTGKT 1288
Query: 615 VYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQV 674
V+ FG+G+SY+ + S + + VT K
Sbjct: 1289 VFEFGNGLSYSPY--------------------------SYEFLSVTPNKLYLNQPSTTH 1322
Query: 675 DVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSV 733
V+N G G H +L+F AG+ +P KQLV F+ V + AG V + C++LS
Sbjct: 1323 VVENSGKMAGKHPVLLFVKQAKAGNGSPMKQLVGFQNVFLDAGESSNVEFILSPCEHLSR 1382
Query: 734 VDRSGTRRIPLGEHNIHIGGTKHSVSLHA 762
++ G + G H + +G ++ +++ A
Sbjct: 1383 ANKDGLMVMEQGIHLLVVGDKEYPIAIVA 1411
Score = 691 bits (1784), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 331/631 (52%), Positives = 451/631 (71%), Gaps = 13/631 (2%)
Query: 5 IAFFFLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQE 64
I ++ +IL+ S+ PF+CD + +T++ FC+ +LPIP RV DL+ RL+L E
Sbjct: 9 INLIYVTVILVGVEST---QSPPFSCDSSNPSTKSYHFCKTTLPIPDRVRDLVSRLTLDE 65
Query: 65 KVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFN 124
K+ L++ A A+PRLGI YEWWSEALHGV++ GPG +F G ATSFPQVI TA+SF+
Sbjct: 66 KISQLVNSAPAIPRLGIPAYEWWSEALHGVADAGPGIRFNGTIRSATSFPQVILTAASFD 125
Query: 125 ATLWEAIGRVVSDEARAMYNGG-TAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYA 183
LW IGR + EARA+YN G T G+T+W+PN+NIFRDPRWGRGQETPGEDP+++G YA
Sbjct: 126 VHLWYRIGRAIGVEARAVYNAGQTKGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGSYA 185
Query: 184 ASYVRGLQG------SDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFD 237
SYVRG+QG L+ +A CKHFTAYDLD+W G+DRF F+A+V+ QD+ DT+
Sbjct: 186 VSYVRGVQGDCLRGLKRCGELQASACCKHFTAYDLDDWKGIDRFKFDARVTMQDLADTYQ 245
Query: 238 VPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYD 297
PF C+ EG+ + +MC+YN+VNGVP+CAD N+L T R W GYI SDCD+V + +D
Sbjct: 246 PPFHRCIEEGRASGIMCAYNRVNGVPSCADFNLLTNTARKRWNFQGYITSDCDAVSLIHD 305
Query: 298 TQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRL 357
+ F TPE+A D ++AG+D++CG +L HT+SAV + L E +++ AL N V+MRL
Sbjct: 306 SYGFAKTPEDAVVDVLKAGMDVNCGTYLLNHTKSAVMQKKLPESELDRALENLFAVRMRL 365
Query: 358 GMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVI 417
G+F+G P QPYG +GP VC+ +HQ LAL+AAR GIVLLKN LPL + ++AVI
Sbjct: 366 GLFNGNPKGQPYGDIGPNQVCSVEHQTLALDAARDGIVLLKNSQRLLPLPKGKTMSLAVI 425
Query: 418 GPNSDVTVTMIGNYAGIACGYTTPLQGIGRYAR-TIHQQGCKDVACADDQLFGAAIDASR 476
GPN++ T+IGNYAG C + TPLQ + Y + T++ GC VAC+ + A++ ++
Sbjct: 426 GPNANSPKTLIGNYAGPPCKFITPLQALQSYVKSTMYHPGCDAVACSSPSI-EKAVEIAQ 484
Query: 477 QADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAF 536
+AD +LVMGLDQ+ E EA DR L+LPG+QQ+L+ V+ A+K P +LVL+SGGP+D++F
Sbjct: 485 KADYVVLVMGLDQTQEREAHDRLDLVLPGKQQQLIICVANAAKKPVVLVLLSGGPVDISF 544
Query: 537 AKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRP 596
AK I +I+WAGYPG AGG AIA+ +FG NPGG+LP+TWYPQ++ T +PMT+M MRP
Sbjct: 545 AKYSNNIGSILWAGYPGGAGGAAIAETIFGDHNPGGRLPVTWYPQDF-TKIPMTDMRMRP 603
Query: 597 SQSKRYPGRTYRFYKGPVVYPFGHGMSYTNF 627
+ YPGRTYRFY G V+ FG+G+SY+ +
Sbjct: 604 ESNSGYPGRTYRFYTGEKVFEFGYGLSYSTY 634
>gi|356531391|ref|XP_003534261.1| PREDICTED: probable beta-D-xylosidase 6-like [Glycine max]
Length = 780
Score = 740 bits (1910), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 360/746 (48%), Positives = 496/746 (66%), Gaps = 24/746 (3%)
Query: 37 TRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSN 96
+++ PFC SLP R L+ L+L EK+ LL + A+++PRLGI Y+WWSE+LHG++
Sbjct: 36 SQSHPFCDTSLPTLTRARSLVSLLTLPEKILLLSNNASSIPRLGIPAYQWWSESLHGLAL 95
Query: 97 VGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPN 156
GPG F G P ATSFPQVI +A+SFN +LW ++ EARAM+N G AGLT+W+PN
Sbjct: 96 NGPGVSFAGAVPSATSFPQVILSAASFNRSLWLRTAAAIAREARAMFNVGQAGLTFWAPN 155
Query: 157 VNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDR----------LKVAASCKH 206
+N+FRDPRWGRGQETPGEDP+L+ YA YVRGLQG G + L V+A CKH
Sbjct: 156 INLFRDPRWGRGQETPGEDPMLASAYAVEYVRGLQGLSGIQDAVVVDDDDTLMVSACCKH 215
Query: 207 FTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCA 266
FTAYDLD W R++FNA VS+QD+EDT+ PFR C+ +GK + +MCSYN+VNGVP CA
Sbjct: 216 FTAYDLDMWGQFSRYNFNAVVSQQDLEDTYQPPFRSCIQQGKASCLMCSYNEVNGVPACA 275
Query: 267 DPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLG 326
+L R +W GYI SDCD+V Y+ Q + + E+A AD ++AG+D++CG F+
Sbjct: 276 SEELLGLA-RDKWGFKGYITSDCDAVATVYEYQKYAKSQEDAVADVLKAGMDINCGTFML 334
Query: 327 LHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELA 386
HTESA+++G + E D++ AL+N +VQ+RLG+FDG+P +G LGPKDVCT +H+ LA
Sbjct: 335 RHTESAIEQGKVKEEDLDRALLNLFSVQLRLGLFDGDPIRGRFGKLGPKDVCTQEHKTLA 394
Query: 387 LEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIG 446
L+AARQGIVLLKN LPL ++AVIGP + T + G Y+GI C ++ +G+G
Sbjct: 395 LDAARQGIVLLKNDKKFLPLDRDIGASLAVIGPLA-TTTKLGGGYSGIPCSSSSLYEGLG 453
Query: 447 RYARTI-HQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPG 505
+A I + GC DV C D F AID ++QAD ++V GLD + E E DR LLLPG
Sbjct: 454 EFAERISYAFGCYDVPCDSDDGFAEAIDTAKQADFVVIVAGLDATQETEDHDRVSLLLPG 513
Query: 506 RQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILF 565
+Q LVS V+ ASK P ILVL+ GGP+DV+FA+ +P+IA+IIW GYPG+AGG A+A+I+F
Sbjct: 514 KQMNLVSSVADASKNPVILVLIGGGPLDVSFAEKNPQIASIIWLGYPGEAGGKALAEIIF 573
Query: 566 GTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYT 625
G NP G+LPMTWYP+ + TN+PM EM+MR S+ YPGRTYRFY G VY FGHG+S++
Sbjct: 574 GEFNPAGRLPMTWYPEAF-TNVPMNEMSMRADPSRGYPGRTYRFYTGGRVYGFGHGLSFS 632
Query: 626 NFVHTVANAPTVVAVPL---DGRHGSINATISGKAIKVTHA------KCNRLTLGVQVDV 676
+F + +AP+ +++ DG + + + V + CN+L+ V + V
Sbjct: 633 DFSYNFLSAPSKISLSRTIKDGSRKRLLYQVENEVYGVDYVPVNQLQNCNKLSFSVHISV 692
Query: 677 KNVGSKDGAHTLLVFST-PPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVD 735
N+G DG+H +++FS P +P QLV F ++H + I +H C++LS D
Sbjct: 693 MNLGGLDGSHVVMLFSKGPKVVDGSPETQLVGFSRLHTISSKPTETSILVHPCEHLSFAD 752
Query: 736 RSGTRRIPLGEHNIHIGGTKHSVSLH 761
+ G R +PLG H + +G +H VS+
Sbjct: 753 KQGKRILPLGPHTLSVGDLEHVVSIE 778
>gi|449531013|ref|XP_004172482.1| PREDICTED: beta-D-xylosidase 1-like, partial [Cucumis sativus]
Length = 534
Score = 738 bits (1905), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 355/532 (66%), Positives = 432/532 (81%), Gaps = 6/532 (1%)
Query: 232 IEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDS 291
+EDT++VPF+ CV+EGKVASVMCSYNQVNG PTCADP++LK TIRG W L+GYIVSDCDS
Sbjct: 1 LEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADPDLLKNTIRGAWGLDGYIVSDCDS 60
Query: 292 VGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTL 351
VGV YD+QHFT TPEEAAA I+AGLDLDCGPFL +HT +AV RGLL E+D+NNAL N L
Sbjct: 61 VGVLYDSQHFTPTPEEAAASTIKAGLDLDCGPFLAVHTATAVGRGLLKEVDLNNALANLL 120
Query: 352 TVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRH 411
+VQMRLGMFDGEP++QPYG+LGPKDVCTP H+ LALEAARQGIVLL+N+ +LPLS RH
Sbjct: 121 SVQMRLGMFDGEPAAQPYGNLGPKDVCTPAHKHLALEAARQGIVLLQNRAGALPLSPTRH 180
Query: 412 RTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAA 471
RTVAVIGPNSD TVTMIGNYAG+AC YTTP+QGI +Y +TIH +GC +VAC DQL G A
Sbjct: 181 RTVAVIGPNSDATVTMIGNYAGVACEYTTPVQGISKYVKTIHAKGCANVACVGDQLIGEA 240
Query: 472 IDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGP 531
A+R ADA ++V+GLDQSIEAE+ DR G+LLPG+Q+ELV ++ +A KGPT++VLMSGGP
Sbjct: 241 EAAARVADAAVVVVGLDQSIEAESRDRNGVLLPGKQEELVRRIGLACKGPTVVVLMSGGP 300
Query: 532 IDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTE 591
IDV+FAKND +I+ I+W GYPGQAGG AIAD+LFG +NPGGKLPMTWYPQ Y+ +PMT
Sbjct: 301 IDVSFAKNDGKISGILWVGYPGQAGGAAIADVLFGATNPGGKLPMTWYPQSYLAKVPMTN 360
Query: 592 MAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINA 651
M +RP S YPGRTYRFYKGPVV+PFG G+SY+ F + A APT +++PL + +A
Sbjct: 361 MGLRPDPSTGYPGRTYRFYKGPVVFPFGFGLSYSKFSQSFAEAPTKISLPLSSLSPNSSA 420
Query: 652 TISGKAIKVTHAKCNRLT-LGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEK 710
T +KV+H C ++ L + +DVKN G+ DG+HT+LVFST P W+P K L+ FEK
Sbjct: 421 T-----VKVSHTDCASVSDLPIMIDVKNTGTVDGSHTILVFSTVPNQTWSPEKHLIGFEK 475
Query: 711 VHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHA 762
VH+ AG+Q+RV I IHVC +LS VD GTRRIP+GEH +HIG HS+SL A
Sbjct: 476 VHLIAGSQKRVRIGIHVCDHLSRVDEFGTRRIPMGEHKLHIGDLTHSISLQA 527
>gi|225432136|ref|XP_002274651.1| PREDICTED: probable beta-D-xylosidase 5-like [Vitis vinifera]
Length = 809
Score = 738 bits (1904), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 372/783 (47%), Positives = 506/783 (64%), Gaps = 40/783 (5%)
Query: 5 IAFFFLGLILLSASSSG-----LAAREP------FACDPKDATT-----RTLPFCQVSLP 48
IAFF + + S G +AAR P + CD + +C S P
Sbjct: 17 IAFFAVCSAIKSPLKDGPAAAPMAARGPIDGNYTYVCDESRFAALGLDMKDFHYCDSSSP 76
Query: 49 IPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFP 108
R DL+ R++L EKV A+ V R+G+ Y WWSEALHGVSN G F P
Sbjct: 77 YEVRAKDLVDRMTLSEKVMQTGDQASGVERIGLPKYNWWSEALHGVSNFGRCVFFDEVVP 136
Query: 109 GATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRG 168
GATSFP VI +A+SFN +LW+ +G+ VS EARAMYN G AGLT+WSPN+N+ RDPRWGR
Sbjct: 137 GATSFPTVILSAASFNQSLWKTLGQAVSTEARAMYNSGNAGLTFWSPNINVVRDPRWGRI 196
Query: 169 QETPGEDPVLSGKYAASYVRGLQGSDGDR---------LKVAASCKHFTAYDLDNWNGVD 219
ETPGEDP L G YA +YVRGLQ G LKV++ CKH+ AYDLDNW G D
Sbjct: 197 LETPGEDPHLVGLYAVNYVRGLQDVVGAENTTDLNSRPLKVSSCCKHYAAYDLDNWKGAD 256
Query: 220 RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEW 279
R HF+A+VS QD+ +TF +PF MCV EG V+SVMCSYN++NG+P+CAD +LK+TIRGEW
Sbjct: 257 RVHFDARVSVQDMAETFVLPFEMCVKEGDVSSVMCSYNKINGIPSCADSRLLKQTIRGEW 316
Query: 280 RLNGYIVSDCDSVGVY-YDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLL 338
L+GYIVSDCDSV V D + S+ ++AA A+ AG++LDCG F AV +G
Sbjct: 317 DLHGYIVSDCDSVEVMAVDQKWLDSSFSDSAAQALNAGMNLDCGTFNNRSLTEAVNQGKA 376
Query: 339 SEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLK 398
++ D++++L + MR+G FDG P+ + LG D+C+ +H ELA EAARQGIVLLK
Sbjct: 377 NQADLDHSLRYLYVLLMRVGFFDGIPA---FASLGKDDICSAEHIELAREAARQGIVLLK 433
Query: 399 NQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCK 458
N +LPL +++ +A++GP+++ T MIGNYAGI C Y +PL +++GC
Sbjct: 434 NDNATLPLKSVKN--IALVGPHANATDAMIGNYAGIPCYYVSPLDAFSSMGEVRYEKGCA 491
Query: 459 DVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMAS 518
DV C ++ A++A+++ADATI+ G D SIEAEALDR LLLPG Q +L+++V+ S
Sbjct: 492 DVQCLNETYIFNAMEAAKRADATIIFAGTDLSIEAEALDRVDLLLPGYQTQLINQVADLS 551
Query: 519 KGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTW 578
GP +LV+MSGG +D++FA+++P+IAAI+WAGYPG+ GG AIAD++ G NPGG+LP+TW
Sbjct: 552 TGPVVLVIMSGGGVDISFARDNPKIAAILWAGYPGEQGGNAIADVILGKYNPGGRLPITW 611
Query: 579 YPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVV 638
Y +Y+ LPMT MA+RP S YPGRTY+F+ G VYPFG+GMSYTNF ++++ +
Sbjct: 612 YEADYVDMLPMTSMALRPVDSLGYPGRTYKFFNGSTVYPFGYGMSYTNFSYSLSTSQRWT 671
Query: 639 AVPLDGRHGS-----INATI--SGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF 691
+ L IN T A+ V C ++ +V VKNVG DG+ ++V+
Sbjct: 672 NINLRKLQRCRSMVYINDTFVPDCPAVLVDDLSCKE-SIEFEVAVKNVGRMDGSEVVVVY 730
Query: 692 STPPAGHWAPH-KQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIH 750
S+PP G H K++V FE+V V G ++V +++VCK L +VD +G +P G H I
Sbjct: 731 SSPPLGIAGTHIKKVVGFERVFVKVGGTEKVKFSMNVCKSLGIVDSTGYALLPSGSHTIK 790
Query: 751 IGG 753
+GG
Sbjct: 791 VGG 793
>gi|255557375|ref|XP_002519718.1| Beta-glucosidase, putative [Ricinus communis]
gi|223541135|gb|EEF42691.1| Beta-glucosidase, putative [Ricinus communis]
Length = 802
Score = 734 bits (1895), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 361/741 (48%), Positives = 497/741 (67%), Gaps = 25/741 (3%)
Query: 39 TLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVG 98
T FC SL R DL+ +++L+EKV+ L A VPRLGI YEWWSEALHGVS+VG
Sbjct: 65 TFGFCDSSLSYEVRAKDLVNQMTLKEKVQQLGDLAYGVPRLGIPKYEWWSEALHGVSDVG 124
Query: 99 PGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVN 158
PGT F PGATSFP I T +SFN +LW+ IG+ S +ARAMYN G AGLTYWSPNVN
Sbjct: 125 PGTFFDDLVPGATSFPTTILTTASFNESLWKNIGQA-SAKARAMYNLGRAGLTYWSPNVN 183
Query: 159 IFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDR---------LKVAASCKHFTA 209
+ RDPRWGR ETPGEDP + G+YA +YVRGLQ +G LKV++ CKH+ A
Sbjct: 184 VVRDPRWGRTVETPGEDPYVVGRYAVNYVRGLQDVEGTENYTDLNTRPLKVSSCCKHYAA 243
Query: 210 YDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPN 269
YD++ W GV+R F+A+V++QD+ +TF PF MCV EG V+SVMCS+N+VNG+PTCADP
Sbjct: 244 YDVEKWQGVERLTFDARVTEQDMVETFLRPFEMCVKEGDVSSVMCSFNRVNGIPTCADPK 303
Query: 270 ILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFT-STPEEAAADAIRAGLDLDCGPFLGLH 328
+L +TIRG+W L+GYIVSDCDS+ V D F T E+A A ++AGLDLDCG +
Sbjct: 304 LLNQTIRGDWDLHGYIVSDCDSIEVMVDNHKFLGDTNEDAVAQVLKAGLDLDCGGYYTNF 363
Query: 329 TESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALE 388
TE++V++G E I+ +L V MRLG FDG P Y LG KD+CT ++ ELA +
Sbjct: 364 TETSVKQGKAREEYIDRSLKYLYVVLMRLGFFDGTPQ---YQKLGKKDICTKENVELAKQ 420
Query: 389 AARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRY 448
AAR+GIVLLKN +LPLS + + +AV+GP+++ T MIGNYAG+ C Y +P+ G Y
Sbjct: 421 AAREGIVLLKNND-TLPLSMDKVKNLAVVGPHANATRVMIGNYAGVPCRYVSPIDGFSIY 479
Query: 449 ARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQ 508
+ ++ GC DV C ++ L A+ A++ ADATI+V GLD +IEAE LDR LLLPG Q
Sbjct: 480 SNVTYEIGC-DVPCKNESLVFPAVHAAKNADATIIVAGLDLTIEAEGLDRNDLLLPGYQT 538
Query: 509 ELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTS 568
+L+++V+ A+ GP ILV+M+ G +D++FA+++ +I AI+W GYPGQ GG AIAD++FG
Sbjct: 539 QLINQVAGAANGPVILVIMAAGGVDISFARDNEKIKAILWVGYPGQEGGHAIADVVFGKY 598
Query: 569 NPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFV 628
NPGG+LP+TWY +++ +PMT M +RP + YPG+TY+FY G VYPFG+G+SYT F
Sbjct: 599 NPGGRLPITWYEADFVEQVPMTYMQLRPDEELGYPGKTYKFYDGSTVYPFGYGLSYTTFS 658
Query: 629 HTVANAPTVVAVPL-------DGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGS 681
+ + +A + L D R+G+ S A+ H CN ++V+V+N GS
Sbjct: 659 YNITSAKRSKHIALNKFQHCRDLRYGNETFKPSCPAVLTDHLPCND-DFELEVEVENTGS 717
Query: 682 KDGAHTLLVFSTPPAGHWAPH-KQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTR 740
+DG+ ++V+S P G + KQ++ F++V V AG+ ++V +VCK ++D +
Sbjct: 718 RDGSEVVMVYSKTPEGIVGSYIKQVIGFKRVFVQAGSVEKVNFRFNVCKSFRIIDYNAYS 777
Query: 741 RIPLGEHNIHIGGTKHSVSLH 761
+P G H I +G S+ L+
Sbjct: 778 ILPSGGHTIMVGDDIVSIPLY 798
>gi|18025342|gb|AAK38482.1| beta-D-xylosidase [Hordeum vulgare]
Length = 777
Score = 734 bits (1895), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 367/757 (48%), Positives = 489/757 (64%), Gaps = 23/757 (3%)
Query: 22 LAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGI 81
+AA PF+C + FC LPI QR DL+ +L+L+EK+ L + AV RLG+
Sbjct: 28 VAADPPFSC----GAPSSAAFCDRRLPIEQRAADLVSKLTLEEKISQLGDESPAVDRLGV 83
Query: 82 KGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARA 141
Y+WWSEALHGV+N G G G ATSFPQVI TA+SFN LW IG+V+ EAR
Sbjct: 84 PAYKWWSEALHGVANAGRGVHLDGPLRAATSFPQVILTAASFNPHLWYRIGQVIGTEARG 143
Query: 142 MYNGGTA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGS------D 194
+YN G A GLT+W+PN+N+FRDPRWGRGQETPGEDP ++GKYAA +VRG+QG +
Sbjct: 144 VYNNGQAEGLTFWAPNINVFRDPRWGRGQETPGEDPTMTGKYAAVFVRGVQGYGMSGAIN 203
Query: 195 GDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMC 254
L+ +A CKHFTAYDL+NW GV RF F+AKV++QD+ DT++ PF+ CV +G + +MC
Sbjct: 204 SSDLEASACCKHFTAYDLENWKGVTRFAFDAKVTEQDLADTYNPPFKSCVEDGGASGIMC 263
Query: 255 SYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIR 314
SYN+VNGVPTCAD N+L +T RG+W NGYI SDCD+V + +D Q + PE+A AD ++
Sbjct: 264 SYNRVNGVPTCADHNLLSKTARGDWSFNGYITSDCDAVAIIHDVQGYAKAPEDAVADVLK 323
Query: 315 AGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGP 374
AG+D++CG ++ H SA Q+G ++ DI+ AL N ++MRLG+FDG P YG++G
Sbjct: 324 AGMDVNCGGYIQTHGVSAYQQGKITGEDIDRALRNLFAIRMRLGLFDGNPKYNRYGNIGA 383
Query: 375 KDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGI 434
VC+ +HQ+LAL+AAR GIVLLKN G +LPLS + ++AVIGPN + ++GNY G
Sbjct: 384 DQVCSKEHQDLALQAARDGIVLLKNDGAALPLSKSKVSSLAVIGPNGNNASLLLGNYFGP 443
Query: 435 ACGYTTPLQGIGRYARTIH-QQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEA 493
C TPLQ + Y + QGC C + G A+ A+ AD +L MGLDQ+ E
Sbjct: 444 PCISVTPLQALQGYVKDARFVQGCNAAVCNVSNI-GEAVHAAGSADYVVLFMGLDQNQER 502
Query: 494 EALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPG 553
E +DR L LPG Q+ LV+ V+ A+K P ILVL+ GGP+DV FAKN+P+I AI+WAGYPG
Sbjct: 503 EEVDRLELGLPGMQESLVNSVADAAKKPVILVLLCGGPVDVTFAKNNPKIGAIVWAGYPG 562
Query: 554 QAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGP 613
QAGG AIA +LFG NPGG+LP+TWYP+E+ T +PMT+M MR S YPGRTYRFYKG
Sbjct: 563 QAGGIAIAQVLFGDHNPGGRLPVTWYPKEF-TAVPMTDMRMRADPSTGYPGRTYRFYKGK 621
Query: 614 VVYPFGHGMSYTNFVHTVANAPT-------VVAVPLDGRHGSINATISGKAIKVTHAKCN 666
VY FG+G+SY+ + H A+ T + + R S T+S ++ C+
Sbjct: 622 TVYNFGYGLSYSKYSHRFASKGTKPPSMSGIEGLKATAR-ASAAGTVSYDVEEMGAEACD 680
Query: 667 RLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINI 725
RL V V+N G DG H +L+F P A P QL+ F+ VH+ A V +
Sbjct: 681 RLRFPAVVRVQNHGPMDGGHLVLLFLRWPNATDGRPASQLIGFQSVHLRADEAAHVEFEV 740
Query: 726 HVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHA 762
CK+LS G + I G H + +G + +S A
Sbjct: 741 SPCKHLSRAAEDGRKVIDQGSHFVRVGDDEFELSFMA 777
>gi|224093292|ref|XP_002309869.1| predicted protein [Populus trichocarpa]
gi|222852772|gb|EEE90319.1| predicted protein [Populus trichocarpa]
Length = 694
Score = 733 bits (1891), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 355/717 (49%), Positives = 483/717 (67%), Gaps = 38/717 (5%)
Query: 55 DLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFP 114
DL+ +++L EKV L + A VPRLG+ Y+WWSEALHGVSNVGPGT F PG+TSFP
Sbjct: 2 DLVNQMTLNEKVLQLGNKAYGVPRLGLAEYQWWSEALHGVSNVGPGTFFDDLIPGSTSFP 61
Query: 115 QVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGE 174
VITTA++FN +LW+ IG+ VS EARAMYN G AGLTYWSPN+N+ RDPRWGR ETPGE
Sbjct: 62 TVITTAAAFNESLWKVIGQAVSTEARAMYNLGRAGLTYWSPNINVVRDPRWGRAIETPGE 121
Query: 175 DPVLSGKYAASYVRGLQGSDGDR---------LKVAASCKHFTAYDLDNWNGVDRFHFNA 225
DP L G+YA +YVRGLQ +G LKV++ CKH+ AYD+DNW GV+R+ F+A
Sbjct: 122 DPYLVGRYAVNYVRGLQDVEGSENYTDPNSRPLKVSSCCKHYAAYDVDNWKGVERYTFDA 181
Query: 226 KVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYI 285
+VS+QD+ +TF PF MCV +G V+SVMCSYN+VNG+PTCADP +L +TIRG+W L+GYI
Sbjct: 182 RVSEQDMVETFLRPFEMCVKDGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWDLHGYI 241
Query: 286 VSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINN 345
VSDCDS+ V + + GLDLDCG + + E+AV++G + E DI+
Sbjct: 242 VSDCDSLQVMVENHKWL-------------GLDLDCGAYYTENVEAAVRQGKVREADIDK 288
Query: 346 ALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLP 405
+L V MRLG FDG P Y G DVC+ ++ ELA EAAR+G VLLKN+ SLP
Sbjct: 289 SLNFLYVVLMRLGFFDGIPQ---YNSFGKNDVCSKENIELATEAAREGAVLLKNENDSLP 345
Query: 406 LSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADD 465
LS + +T+AVIGP+S+ T MIGNYAGI C TP++G+ +YA+ +Q GC D+AC D+
Sbjct: 346 LSIEKVKTLAVIGPHSNATSAMIGNYAGIPCQIITPIEGLSKYAKVDYQMGCSDIACKDE 405
Query: 466 QLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILV 525
A++++++ADATI++ G+D SIEAE+LDR LLLPG Q +L+++V+ S GP +LV
Sbjct: 406 SFIFPAMESAKKADATIILAGIDLSIEAESLDRDDLLLPGYQTQLINQVASVSNGPVVLV 465
Query: 526 LMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYIT 585
LMS G +D++FAK++ I +I+W GYPG+ GG AIAD++FG NPGG+LP+TW+ +Y+
Sbjct: 466 LMSAGGVDISFAKSNGDIKSILWVGYPGEEGGNAIADVIFGKYNPGGRLPLTWHEADYVD 525
Query: 586 NLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGR 645
LPMT M +RP S YPGRTY+F+ G VYPFGHG+SYT F + + + + + LD
Sbjct: 526 MLPMTSMPLRPIDSLGYPGRTYKFFNGSTVYPFGHGLSYTQFTYKLTSTIRSLDIKLDKY 585
Query: 646 HGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPH-KQ 704
+ + K + +V N G+KDG+ ++V++ PP G A + KQ
Sbjct: 586 QYCHDLGYKNDSFKPSF------------EVLNAGAKDGSEVVIVYAKPPEGIDATYIKQ 633
Query: 705 LVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLH 761
++ F++V VPAG ++V + K L VVD + +P G H I +G S S+
Sbjct: 634 VIGFKRVFVPAGGSEKVKFEFNASKSLQVVDFNAYSVLPSGGHTIMLGDDIISFSVQ 690
>gi|225432134|ref|XP_002274619.1| PREDICTED: probable beta-D-xylosidase 5-like [Vitis vinifera]
Length = 805
Score = 732 bits (1889), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 371/741 (50%), Positives = 489/741 (65%), Gaps = 24/741 (3%)
Query: 38 RTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNV 97
+ +C SLP RV DL+ R++L+EK + +I A+ VPR+G+ Y+WWSEALHGV+NV
Sbjct: 63 KDFVYCDSSLPYDVRVKDLVDRITLEEKARNVIDVASGVPRIGLPPYKWWSEALHGVANV 122
Query: 98 GPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNV 157
G T F PGATSFP VI +A+SFN +LW+ +G+VVS EARAMYN G AGLT+WSPN+
Sbjct: 123 GSATFFDEVVPGATSFPNVILSAASFNQSLWKTLGQVVSTEARAMYNLGHAGLTFWSPNI 182
Query: 158 NIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDR---------LKVAASCKHFT 208
N+ RDPRWGR ETPGEDP+ G Y +YVRGLQ +G LK+A+SCKHF
Sbjct: 183 NVARDPRWGRILETPGEDPLTVGVYGVNYVRGLQDIEGTENTTDLNSRPLKIASSCKHFA 242
Query: 209 AYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADP 268
AYDLD W VDR HF+AKVS+QD+ +TF PF MCV EG +SVMCS+N +NG+P CADP
Sbjct: 243 AYDLDQWFNVDRRHFDAKVSEQDMTETFLRPFEMCVKEGDTSSVMCSFNNINGIPPCADP 302
Query: 269 NILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTS-TPEEAAADAIRAGLDLDCGPFLGL 327
LK IR +W L+GYIVSDC ++ Q F T EE A +++AGLDL+CG +
Sbjct: 303 RFLKGVIREQWNLHGYIVSDCWAIDTIVQDQKFLDVTSEEGVALSMKAGLDLECGHYYND 362
Query: 328 HTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELAL 387
+AV+ G +SE D++ +L V MR+G FDG PS LG KD+C +H ELA
Sbjct: 363 SLATAVREGRVSEHDVDKSLSYLYVVLMRVGFFDGIPS---LASLGKKDICNDEHIELAR 419
Query: 388 EAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGR 447
EAARQGIVLLKN +LPL ++ +A++GP+++ TV MIGNYAGI C Y +PL
Sbjct: 420 EAARQGIVLLKNDNATLPLKPVKK--LALVGPHANATVAMIGNYAGIPCHYVSPLDAFSE 477
Query: 448 YARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQ 507
++ GC DV C +D A +A++ ADATI+++G D SIEAE DR LLLPG Q
Sbjct: 478 LGDVTYEVGCADVKCHNDTHVYKAAEAAKNADATIILVGTDLSIEAEERDREDLLLPGYQ 537
Query: 508 QELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGT 567
E+V++V+ S GP ILV+M GGPID++FAKN+P+IAAI+WAG+PG+ GG AIADI+FG
Sbjct: 538 TEMVNQVTDLSTGPVILVVMCGGPIDISFAKNNPKIAAILWAGFPGEQGGNAIADIVFGK 597
Query: 568 SNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNF 627
NPGG+ P+TWY Y+ LPMT MA+RP +S YPGRTY+F+ G VYPFG+G+SYTNF
Sbjct: 598 YNPGGRSPITWYENGYVGMLPMTSMALRPIESLGYPGRTYKFFNGSTVYPFGYGLSYTNF 657
Query: 628 VHTVANAPTVVAVPLDGRHGSINATISG-------KAIKVTHAKCNRLTLGVQVDVKNVG 680
+++ V + L + S A+ V C+ + QV VKNVG
Sbjct: 658 SYSLTAPTRSVHISLTRLQQCRSMAYSSDSFQPECSAVLVDDLSCDE-SFEFQVAVKNVG 716
Query: 681 SKDGAHTLLVFSTPPAGHWAPH-KQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGT 739
S DG+ ++V+S+PP+G H KQ++ FE+V V G ++V +++VCK L +VD SG
Sbjct: 717 SMDGSEVVMVYSSPPSGIVGTHIKQVIGFERVFVKVGNTEKVKFSMNVCKSLGLVDSSGY 776
Query: 740 RRIPLGEHNIHIGGTKHSVSL 760
+P G H I G SVS
Sbjct: 777 ILLPSGSHTIMAGDNSTSVSF 797
>gi|297842585|ref|XP_002889174.1| glycosyl hydrolase family 3 protein [Arabidopsis lyrata subsp.
lyrata]
gi|297335015|gb|EFH65433.1| glycosyl hydrolase family 3 protein [Arabidopsis lyrata subsp.
lyrata]
Length = 766
Score = 732 bits (1889), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 354/757 (46%), Positives = 506/757 (66%), Gaps = 27/757 (3%)
Query: 23 AAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIK 82
+A P +CDP + TT+ FC+ LPI QR DL+ RL++ EK+ L + A +PRLG+
Sbjct: 18 SAPPPHSCDPSNPTTKLYQFCRTDLPISQRARDLVSRLNIDEKISQLGNTAPGIPRLGVP 77
Query: 83 GYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAM 142
YEWWSEALHGV+ GPG +F G ATSFPQVI TA+SF++ W I +V+ EAR +
Sbjct: 78 AYEWWSEALHGVAYAGPGIRFNGTVKAATSFPQVILTAASFDSYEWFRIAQVIGKEARGV 137
Query: 143 YNGGTA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGD----- 196
YN G A G+T+W+PN+NIFRDPRWGRGQETPGEDP+++G YA +YVRGLQG D
Sbjct: 138 YNAGQAQGMTFWAPNINIFRDPRWGRGQETPGEDPIMTGTYAVAYVRGLQGDSFDGRKTL 197
Query: 197 --RLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMC 254
L+ +A CKHFTAYDLD W G+ R+ FNA+VS D+ +T+ PF+ C+ EG+ + +MC
Sbjct: 198 SIHLQASACCKHFTAYDLDRWKGITRYVFNAQVSLADLAETYQPPFKKCIEEGRASGIMC 257
Query: 255 SYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIR 314
+YN+VNG+P+CADPN+L RT RG WR GYI SDCD+V + +D Q + TPE+A AD ++
Sbjct: 258 AYNRVNGIPSCADPNLLTRTARGLWRFRGYITSDCDAVSIIHDAQGYAKTPEDAVADVLK 317
Query: 315 AGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGP 374
AG+D++CG +L HT+SA+Q+ +SE DI+ AL+N +V++RLG+F+G+P+ PYG++ P
Sbjct: 318 AGMDVNCGSYLQKHTKSALQQKKVSETDIDRALLNLFSVRIRLGLFNGDPTKLPYGNISP 377
Query: 375 KDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGI 434
DVC+P HQ LALEAAR GIVLLKN LP S ++AVIGPN+ V T++GNYAG
Sbjct: 378 NDVCSPAHQALALEAARNGIVLLKNNLKLLPFSKRSVSSLAVIGPNAHVAKTLLGNYAGP 437
Query: 435 ACGYTTPLQGIGRYART-IHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEA 493
C TPL + Y + ++ GC VAC++ + A+ +R AD +L+MGLDQ+ E
Sbjct: 438 PCKTVTPLDALRSYVKNAVYHNGCDSVACSNAAI-DQAVAIARNADHVVLIMGLDQTQEK 496
Query: 494 EALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPG 553
E +DR L LPG+QQEL++ V+ A+K P +LVL+ GGP+D++FA N+ +I +I+WAGYPG
Sbjct: 497 EDMDRVDLSLPGKQQELITSVANAAKKPVVLVLICGGPVDISFATNNDKIGSIMWAGYPG 556
Query: 554 QAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGP 613
+AGG A+A+I+FG NPGG+LP+TWYPQ ++ N+ MT+M MR + YPGRTY+FYKGP
Sbjct: 557 EAGGIALAEIIFGDHNPGGRLPVTWYPQSFV-NVQMTDMRMRSATG--YPGRTYKFYKGP 613
Query: 614 VVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVT------HAKCNR 667
V+ FGHG+SY+ + + T+ A L A ++ +++ T CN
Sbjct: 614 KVFEFGHGLSYSTYSYRFK---TLGATNLYLNQS--KAQLNSDSVRYTLVSEMGEEGCNI 668
Query: 668 LTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHW---APHKQLVAFEKVHVPAGAQQRVGIN 724
V V V+N G G H +L+F+ G KQLV F+ + + G + +
Sbjct: 669 AKTKVIVTVENQGEMAGKHPVLMFARHERGGENGKRAEKQLVGFKSIVLSNGEKAEMEFE 728
Query: 725 IHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLH 761
I +C++LS + G + G++ + +G ++ ++++
Sbjct: 729 IGLCEHLSRANEVGVMVVEEGKYFLTVGDSELPLTIN 765
>gi|15218202|ref|NP_177929.1| putative beta-D-xylosidase 7 [Arabidopsis thaliana]
gi|259585708|sp|Q9SGZ5.2|BXL7_ARATH RecName: Full=Probable beta-D-xylosidase 7; Short=AtBXL7; Flags:
Precursor
gi|18086336|gb|AAL57631.1| At1g78060/F28K19_32 [Arabidopsis thaliana]
gi|332197942|gb|AEE36063.1| putative beta-D-xylosidase 7 [Arabidopsis thaliana]
Length = 767
Score = 731 bits (1888), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 348/746 (46%), Positives = 502/746 (67%), Gaps = 17/746 (2%)
Query: 23 AAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIK 82
+A P +CDP + TT+ FC+ LPI +R DL+ RL++ EK+ L++ A +PRLG+
Sbjct: 19 SAPPPHSCDPSNPTTKLYQFCRTDLPIGKRARDLVSRLTIDEKISQLVNTAPGIPRLGVP 78
Query: 83 GYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAM 142
YEWWSEALHGV+ GPG +F G ATSFPQVI TA+SF++ W I +V+ EAR +
Sbjct: 79 AYEWWSEALHGVAYAGPGIRFNGTVKAATSFPQVILTAASFDSYEWFRIAQVIGKEARGV 138
Query: 143 YNGGTA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGD----- 196
YN G A G+T+W+PN+NIFRDPRWGRGQETPGEDP+++G YA +YVRGLQG D
Sbjct: 139 YNAGQANGMTFWAPNINIFRDPRWGRGQETPGEDPMMTGTYAVAYVRGLQGDSFDGRKTL 198
Query: 197 --RLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMC 254
L+ +A CKHFTAYDLD W G+ R+ FNA+VS D+ +T+ PF+ C+ EG+ + +MC
Sbjct: 199 SNHLQASACCKHFTAYDLDRWKGITRYVFNAQVSLADLAETYQPPFKKCIEEGRASGIMC 258
Query: 255 SYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIR 314
+YN+VNG+P+CADPN+L RT RG+W GYI SDCD+V + YD Q + +PE+A AD ++
Sbjct: 259 AYNRVNGIPSCADPNLLTRTARGQWAFRGYITSDCDAVSIIYDAQGYAKSPEDAVADVLK 318
Query: 315 AGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGP 374
AG+D++CG +L HT+SA+Q+ +SE DI+ AL+N +V++RLG+F+G+P+ PYG++ P
Sbjct: 319 AGMDVNCGSYLQKHTKSALQQKKVSETDIDRALLNLFSVRIRLGLFNGDPTKLPYGNISP 378
Query: 375 KDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGI 434
+VC+P HQ LAL+AAR GIVLLKN LP S ++AVIGPN+ V T++GNYAG
Sbjct: 379 NEVCSPAHQALALDAARNGIVLLKNNLKLLPFSKRSVSSLAVIGPNAHVVKTLLGNYAGP 438
Query: 435 ACGYTTPLQGIGRYART-IHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEA 493
C TPL + Y + ++ QGC VAC++ + A+ ++ AD +L+MGLDQ+ E
Sbjct: 439 PCKTVTPLDALRSYVKNAVYHQGCDSVACSNAAI-DQAVAIAKNADHVVLIMGLDQTQEK 497
Query: 494 EALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPG 553
E DR L LPG+QQEL++ V+ A+K P +LVL+ GGP+D++FA N+ +I +IIWAGYPG
Sbjct: 498 EDFDRVDLSLPGKQQELITSVANAAKKPVVLVLICGGPVDISFAANNNKIGSIIWAGYPG 557
Query: 554 QAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGP 613
+AGG AI++I+FG NPGG+LP+TWYPQ ++ N+ MT+M MR + YPGRTY+FYKGP
Sbjct: 558 EAGGIAISEIIFGDHNPGGRLPVTWYPQSFV-NIQMTDMRMRSATG--YPGRTYKFYKGP 614
Query: 614 VVYPFGHGMSYTNFVHTVAN-APTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGV 672
VY FGHG+SY+ + + A T + + + ++ ++ C+ V
Sbjct: 615 KVYEFGHGLSYSAYSYRFKTLAETNLYLNQSKAQTNSDSVRYTLVSEMGKEGCDVAKTKV 674
Query: 673 QVDVKNVGSKDGAHTLLVFSTPPAGHW---APHKQLVAFEKVHVPAGAQQRVGINIHVCK 729
V+V+N G G H +L+F+ G KQLV F+ + + G + + I +C+
Sbjct: 675 TVEVENQGEMAGKHPVLMFARHERGGEDGKRAEKQLVGFKSIVLSNGEKAEMEFEIGLCE 734
Query: 730 YLSVVDRSGTRRIPLGEHNIHIGGTK 755
+LS + G + G++ + +G ++
Sbjct: 735 HLSRANEFGVMVLEEGKYFLTVGDSE 760
>gi|297811163|ref|XP_002873465.1| glycosyl hydrolase family 3 protein [Arabidopsis lyrata subsp.
lyrata]
gi|297319302|gb|EFH49724.1| glycosyl hydrolase family 3 protein [Arabidopsis lyrata subsp.
lyrata]
Length = 796
Score = 731 bits (1886), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 364/783 (46%), Positives = 495/783 (63%), Gaps = 40/783 (5%)
Query: 15 LSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAA 74
++ + L + F C P ++ PFC VSL I QR L+ L+L EK+ L + AA
Sbjct: 17 IAETFKNLDSHPQFPCKPPHFSS--YPFCNVSLSIKQRAISLVSLLTLPEKIGQLSTTAA 74
Query: 75 AVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRV 134
+VPRLGI YEWWSE+LHG+++ GPG F G ATSFPQVI +A+SFN TLW IG
Sbjct: 75 SVPRLGIPPYEWWSESLHGLADNGPGVSFNGSISAATSFPQVIVSAASFNRTLWYEIGSA 134
Query: 135 VSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQ--- 191
V+ EARAMYNGG AGLT+W+PN+N+FRDPRWGRGQETPGEDP + +Y +VRG Q
Sbjct: 135 VAVEARAMYNGGQAGLTFWAPNINLFRDPRWGRGQETPGEDPKVVSEYGVEFVRGFQEKK 194
Query: 192 ---------GSDG------------DRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQ 230
GSD +L ++A CKHFTAYDL+ W R+ FNA V++Q
Sbjct: 195 KRKVLKTRFGSDNVDDDARYDDDADGKLMLSACCKHFTAYDLEKWGNFTRYDFNAVVTEQ 254
Query: 231 DIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCD 290
D+EDT+ PF C+ +GK + +MCSYN VNGVP CA ++L++ R EW +GYI SDCD
Sbjct: 255 DMEDTYQPPFETCIKDGKASCLMCSYNAVNGVPACAQGDLLQKA-RVEWGFDGYITSDCD 313
Query: 291 SVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNT 350
+V ++ Q +T +PEEA ADAI+AG+D++CG ++ +T+SA+++G +SE ++ AL+N
Sbjct: 314 AVATIFEYQGYTKSPEEAVADAIKAGVDINCGTYMLRNTQSAIEQGKVSEELVDRALLNL 373
Query: 351 LTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIR 410
VQ+RLG+FDG+P YG LG D+C+ DH++LALEAARQGIVLLKN LPL+
Sbjct: 374 FAVQLRLGLFDGDPRGGHYGKLGSNDICSSDHRKLALEAARQGIVLLKNDYKLLPLNKNH 433
Query: 411 HRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYA-RTIHQQGCKDVACADDQLFG 469
++A++GP ++ M G Y G C T + Y +T + GC DV+C D FG
Sbjct: 434 VSSLAIVGPMANNISNMGGTYTGKPCQRKTLFTELLEYVKKTSYASGCSDVSCVSDTGFG 493
Query: 470 AAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSG 529
A+ ++ AD I+V GLD S E E DR L LPG+Q++LVS V+ SK P ILVL G
Sbjct: 494 EAVAIAKGADFVIVVAGLDLSQETEDKDRFSLSLPGKQKDLVSSVAAVSKKPVILVLTGG 553
Query: 530 GPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPM 589
GP+DV FAK DPRI +IIW GYPG+ GG A+A+I+FG NPGG+LP+TWYP+ + ++PM
Sbjct: 554 GPVDVTFAKTDPRIGSIIWIGYPGETGGQALAEIIFGDFNPGGRLPITWYPESF-ADVPM 612
Query: 590 TEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVA----VPLDGR 645
++M MR S+ YPGRTYRFY GP VY FG G+SYT F + + +AP ++ +P
Sbjct: 613 SDMHMRADSSRGYPGRTYRFYTGPQVYSFGTGLSYTKFDYKIISAPIRLSLSELLPQQSS 672
Query: 646 HGSINATISGKAIK------VTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFST-PPAGH 698
H + ++ V C L V+V+V+N G DG+H L++FS
Sbjct: 673 HKKQLLQHGEEQLQYIQLDDVMVNSCESLRFNVRVNVRNTGEIDGSHVLMLFSKMARVLS 732
Query: 699 WAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSV 758
P KQL+ F++VH+ + I CKYLSV + G R IPLG H + +G +HS+
Sbjct: 733 GVPEKQLIGFDRVHIRSNEMMETVFVIDPCKYLSVANDVGKRVIPLGIHALFLGDLQHSL 792
Query: 759 SLH 761
S+
Sbjct: 793 SVE 795
>gi|371917284|dbj|BAL44718.1| SlArf/Xyl3 [Solanum lycopersicum]
Length = 777
Score = 730 bits (1885), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 352/771 (45%), Positives = 505/771 (65%), Gaps = 23/771 (2%)
Query: 4 TIAFFFLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQ 63
T+ F F+ L+LL + + + PF+CD + T + PFC +LPIPQRVNDL+ RL++
Sbjct: 10 TVMFIFVILVLLFRRTE--STKPPFSCDSSNPNTSSYPFCNAALPIPQRVNDLVSRLTVD 67
Query: 64 EKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSF 123
EK+ L++GA +PRLGI YEWWSE LHG+S G GT F G AT FPQ+I TASSF
Sbjct: 68 EKILQLVNGAPEIPRLGISAYEWWSEGLHGISRHGKGTLFNGTIKAATQFPQIILTASSF 127
Query: 124 NATLWEAIGRVVSDEARAMYNGGT-AGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKY 182
+ LW I + + EARA+YN G G+T W+PN+NI RDPRWGRGQETPGEDP++ GKY
Sbjct: 128 DENLWYRIAQAIGREARAVYNAGQLKGITLWAPNINILRDPRWGRGQETPGEDPMMVGKY 187
Query: 183 AASYVRGLQGS--DGDRLK-----VAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDT 235
+YVRGLQG +G +LK +A CKHF A D+DNW+ R+ F+A+V KQD+ D+
Sbjct: 188 GVAYVRGLQGDSFEGGKLKDGHLQTSACCKHFIAQDMDNWHNFSRYTFDAQVLKQDLADS 247
Query: 236 FDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVY 295
++ PF+ CV +GK +SVMC+YN VNG+P CA+ ++L T RG+W L GYIVSDCD+V
Sbjct: 248 YEPPFKDCVEQGKASSVMCAYNLVNGIPNCANFDLLTTTARGKWGLQGYIVSDCDAVDKM 307
Query: 296 YDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQM 355
Y QH+ PE+A A ++AG+D++CG L +T+SA+++ + E DI+ AL N +V+M
Sbjct: 308 YSEQHYAKEPEDAVAATLKAGMDVNCGSHLKTYTKSALEKQKVKESDIDRALHNLFSVRM 367
Query: 356 RLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVA 415
RLG+F+G+PS YG + +VC+ +H+ LA+EAAR G VLLKN LPLS ++ ++A
Sbjct: 368 RLGLFNGDPSKLEYGDISAAEVCSEEHRALAVEAARSGSVLLKNSNRLLPLSKMKTASLA 427
Query: 416 VIGPNSDVTVTMIGNYAGIACGYTTPLQGI-GRYARTIHQQGCKDVACADDQLFGAAIDA 474
VIGP ++ + ++GNY G +C T QG+ G A T++ GC + C + A++
Sbjct: 428 VIGPKANDSEVLLGNYEGFSCKNVTLFQGLQGYVANTMYHPGCDFINCTSPAI-DEAVNI 486
Query: 475 SRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDV 534
+++AD +LVMGLDQ++E E DR L LPG Q++L++ ++ A+ P ILVLM GGP+DV
Sbjct: 487 AKKADYVVLVMGLDQTLEREKFDRTELGLPGMQEKLITSIAEAASKPVILVLMCGGPVDV 546
Query: 535 AFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAM 594
FAK++P+I I+W GYPG+ G A+A ILFG NPGG+ P+TWYP+E+ + M +M M
Sbjct: 547 TFAKDNPKIGGILWVGYPGEGGAAALAQILFGEHNPGGRSPVTWYPKEF-NKVAMNDMRM 605
Query: 595 RPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSIN-ATI 653
RP S YPGRTYRFY GP V+ FG+G+SYTN+ +T A +V L ++ IN +T
Sbjct: 606 RPESSSGYPGRTYRFYNGPKVFEFGYGLSYTNYSYTFA---SVSKNQLLFKNPKINQSTE 662
Query: 654 SGKAIKVTHAK-----CNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAG-HWAPHKQLVA 707
G + + + CN + V+V VKN G G H +L+F + P K L+
Sbjct: 663 KGSVLNIAVSDVGPEVCNSAMITVKVAVKNQGEMAGKHPVLLFLKHSSTVDEVPKKTLIG 722
Query: 708 FEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSV 758
F+ V++ AGA +V ++ C++ + +R GT I G+H + +G ++ +
Sbjct: 723 FKSVNLEAGANTQVTFDVKPCEHFTRANRDGTLVIDEGKHFLLLGDQEYPI 773
>gi|225459350|ref|XP_002285805.1| PREDICTED: probable beta-D-xylosidase 7-like [Vitis vinifera]
Length = 774
Score = 729 bits (1883), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 364/771 (47%), Positives = 513/771 (66%), Gaps = 22/771 (2%)
Query: 5 IAFFFLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQE 64
I ++ +IL+ S+ PF+CD + +T++ FC+ +LPIP RV DL+ RL+L E
Sbjct: 9 INLIYVTVILVGVEST---QSPPFSCDSSNPSTKSYHFCKTTLPIPDRVRDLVSRLTLDE 65
Query: 65 KVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFN 124
K+ L++ A A+PRLGI YEWWSEALHGV++ GPG +F G ATSFPQVI TA+SF+
Sbjct: 66 KISQLVNSAPAIPRLGIPAYEWWSEALHGVADAGPGIRFNGTIRSATSFPQVILTAASFD 125
Query: 125 ATLWEAIGRVVSDEARAMYNGG-TAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYA 183
LW IGR + EARA+YN G T G+T+W+PN+NIFRDPRWGRGQETPGEDP+++G YA
Sbjct: 126 VHLWYRIGRAIGVEARAVYNAGQTKGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGSYA 185
Query: 184 ASYVRGLQG------SDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFD 237
SYVRG+QG L+ +A CKHFTAYDLD+W G+DRF F+A+V+ QD+ DT+
Sbjct: 186 VSYVRGVQGDCLRGLKRCGELQASACCKHFTAYDLDDWKGIDRFKFDARVTMQDLADTYQ 245
Query: 238 VPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYD 297
PF C+ EG+ + +MC+YN+VNGVP+CAD N+L T R W GYI SDCD+V + +D
Sbjct: 246 PPFHRCIEEGRASGIMCAYNRVNGVPSCADFNLLTNTARKRWNFQGYITSDCDAVSLIHD 305
Query: 298 TQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRL 357
+ F TPE+A D ++AG+D++CG +L HT+SAV + L E +++ AL N V+MRL
Sbjct: 306 SYGFAKTPEDAVVDVLKAGMDVNCGTYLLNHTKSAVMQKKLPESELDRALENLFAVRMRL 365
Query: 358 GMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVI 417
G+F+G P QPYG +GP VC+ +HQ LAL+AAR GIVLLKN LPL + ++AVI
Sbjct: 366 GLFNGNPKGQPYGDIGPNQVCSVEHQTLALDAARDGIVLLKNSQRLLPLPKGKTMSLAVI 425
Query: 418 GPNSDVTVTMIGNYAGIACGYTTPLQGIGRYAR-TIHQQGCKDVACADDQLFGAAIDASR 476
GPN++ T+IGNYAG C + TPLQ + Y + T++ GC VAC+ + A++ ++
Sbjct: 426 GPNANSPKTLIGNYAGPPCKFITPLQALQSYVKSTMYHPGCDAVACSSPSI-EKAVEIAQ 484
Query: 477 QADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAF 536
+AD +LVMGLDQ+ E EA DR L+LPG+QQ+L+ V+ A+K P +LVL+SGGP+D++F
Sbjct: 485 KADYVVLVMGLDQTQEREAHDRLDLVLPGKQQQLIICVANAAKKPVVLVLLSGGPVDISF 544
Query: 537 AKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRP 596
AK I +I+WAGYPG AGG AIA+ +FG NPGG+LP+TWYPQ++ T +PMT+M MRP
Sbjct: 545 AKYSNNIGSILWAGYPGGAGGAAIAETIFGDHNPGGRLPVTWYPQDF-TKIPMTDMRMRP 603
Query: 597 SQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGK 656
+ YPGRTYRFY G V+ FG+G+SY+ + + P + + + +
Sbjct: 604 ESNSGYPGRTYRFYTGEKVFEFGYGLSYSTY--SCETIPVTRNKLYFNQSSTAHVYENTD 661
Query: 657 AIKVTHAK------CNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFE 709
+I+ T C+ + + + V+N G G H++L+F A +P KQLVAF+
Sbjct: 662 SIRYTSVAELGKELCDSNNISISIRVRNDGEMAGKHSVLLFVRRLKASAGSPIKQLVAFQ 721
Query: 710 KVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
VH+ G VG ++ C++ S ++ G I G H + +G +H V++
Sbjct: 722 SVHLNGGESADVGFLLNPCEHFSGPNKDGLMVIEEGTHFLVVGDQEHPVTV 772
>gi|224066929|ref|XP_002302284.1| predicted protein [Populus trichocarpa]
gi|222844010|gb|EEE81557.1| predicted protein [Populus trichocarpa]
Length = 742
Score = 729 bits (1881), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 354/707 (50%), Positives = 482/707 (68%), Gaps = 27/707 (3%)
Query: 12 LILLSASSSGL-----AAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKV 66
LIL++ ++ L + + P++CD D +T+ PFCQ LPI QRV DL+ RL+L EKV
Sbjct: 11 LILIAIHTTSLHLYVESTQPPYSCDSSDPSTKLYPFCQTKLPISQRVEDLVSRLTLDEKV 70
Query: 67 KLLISGAAAVPRLGIKGYEWWSEALHGV---SNVGPGTKFGGDFPGATSFPQVITTASSF 123
L+ A A+PRLGI YEWWSEALHGV + V G +F G ATSFPQVI TA+SF
Sbjct: 71 SQLVDTAPAIPRLGIPAYEWWSEALHGVALQTTVRQGIRFNGTIRFATSFPQVILTAASF 130
Query: 124 NATLWEAIGRVVSDEARAMYNGGTA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKY 182
+A LW IG+V+ EAR +YN G A G+T+W+PN+NIFRDPRWGRGQETPGEDP+++GKY
Sbjct: 131 DAHLWYRIGQVIGKEARGIYNAGQATGMTFWAPNINIFRDPRWGRGQETPGEDPLVAGKY 190
Query: 183 AASYVRGLQGSD------GDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTF 236
A SYVRG+QG G++L+ +A CKHFTAYDLD W G++RF F+A QD+ DT+
Sbjct: 191 AVSYVRGVQGDSFGGGTLGEQLQASACCKHFTAYDLDKWKGMNRFVFDA----QDLADTY 246
Query: 237 DVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYY 296
PF+ C+ EGK + +MC+YN+VNGVP CAD N+L + RG+W GYI SDCD+V + +
Sbjct: 247 QPPFQSCIQEGKASGIMCAYNRVNGVPNCADYNLLSKKARGQWGFYGYITSDCDAVAIIH 306
Query: 297 DTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMR 356
D Q + +PE+A AD ++AG+D++CG +L +T+SAV++ L E +I+ AL N +++MR
Sbjct: 307 DDQGYAKSPEDAVADVLKAGMDVNCGDYLKNYTKSAVKKKKLPESEIDRALHNLFSIRMR 366
Query: 357 LGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAV 416
LG+F+G P+ QPYG++ P VC+ +HQ LAL+AA+ GIVLLKN LPLS + +++AV
Sbjct: 367 LGLFNGNPTKQPYGNIAPDQVCSQEHQALALKAAQDGIVLLKNPDKLLPLSKLETKSLAV 426
Query: 417 IGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYAR-TIHQQGCKDVACADDQLFGAAIDAS 475
IGPN++ + ++GNY G C TPLQG+ Y + T + GC VAC+ + A+ +
Sbjct: 427 IGPNANNSTKLLGNYFGPPCKTVTPLQGLQNYIKNTRYHPGCSRVACSSASI-NQAVKIA 485
Query: 476 RQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVA 535
+ AD ILVMGLDQ+ E E DR L+LPG+Q+EL++ V+ A+K P +LVL GGP+DV+
Sbjct: 486 KGADQVILVMGLDQTQEKEEQDRVDLVLPGKQRELITAVAKAAKKPVVLVLFCGGPVDVS 545
Query: 536 FAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMR 595
FAK D I +IIWAGYPG+AGGTA+A I+FG NPGG+LPMTWYPQ++ T +PMT+M MR
Sbjct: 546 FAKYDQNIGSIIWAGYPGEAGGTALAQIIFGDHNPGGRLPMTWYPQDF-TKVPMTDMRMR 604
Query: 596 PSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINA---T 652
P S YPGRTYRFY G V+ FG+G+SY+N+ + +A + T + L I T
Sbjct: 605 PQLSSGYPGRTYRFYNGKKVFEFGYGLSYSNYSYELA-SDTQNKLYLRASSNQITKNSNT 663
Query: 653 ISGKAI-KVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGH 698
I K I + C + V V VKN G G + + + P H
Sbjct: 664 IRHKLISNIGKELCEKTKFTVTVRVKNHGEMAGENAEIQYELSPCEH 710
>gi|242062502|ref|XP_002452540.1| hypothetical protein SORBIDRAFT_04g027700 [Sorghum bicolor]
gi|241932371|gb|EES05516.1| hypothetical protein SORBIDRAFT_04g027700 [Sorghum bicolor]
Length = 784
Score = 727 bits (1877), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 362/757 (47%), Positives = 494/757 (65%), Gaps = 20/757 (2%)
Query: 21 GLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLG 80
G A+ P+ C +PFC +LPI +RV+DL+ RL++ EK+ L + A+PRLG
Sbjct: 33 GAASEPPYTCGA--GAPPNIPFCDTALPIDRRVDDLVSRLTVAEKISQLGDESPAIPRLG 90
Query: 81 IKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEAR 140
+ Y+WWSEALHGV+N G G G ATSFPQVI TA+SFN LW IG+V+ EAR
Sbjct: 91 VPAYKWWSEALHGVANAGRGIHLDGPLRAATSFPQVILTAASFNPHLWYRIGQVIGVEAR 150
Query: 141 AMYNGGTA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGS------ 193
A+YN G A GLT+W+PN+N+FRDPRWGRGQETPGEDP ++GKYAA +VRG+QG
Sbjct: 151 AVYNNGQAEGLTFWAPNINVFRDPRWGRGQETPGEDPTMTGKYAAVFVRGVQGYGVAGPV 210
Query: 194 DGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVM 253
+ L+ +A CKHFTAYDL+NW G+ R+ ++AKV+ QD+EDT++ PF+ CV +G + +M
Sbjct: 211 NSTDLEASACCKHFTAYDLENWKGITRYVYDAKVTAQDLEDTYNPPFKSCVEDGHASGIM 270
Query: 254 CSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAI 313
CSYN+VNGVPTCAD N+L +T R W GYI SDCD+V + +D Q + T E+A AD +
Sbjct: 271 CSYNRVNGVPTCADYNLLSKTARQSWGFYGYITSDCDAVSIIHDAQGYAKTSEDAVADVL 330
Query: 314 RAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLG 373
+AG+D++CG ++ + SA+Q+G ++E DIN AL N TV+MRLG+F+G+P YG++G
Sbjct: 331 KAGMDVNCGGYVQKYGASALQQGKITEQDINRALHNLFTVRMRLGLFNGDPRRNRYGNIG 390
Query: 374 PKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAG 433
P VCT +HQ+LALEAA+ GIVLLKN G +LPLS ++AVIG N++ +++GNY G
Sbjct: 391 PDQVCTQEHQDLALEAAQDGIVLLKNDGGALPLSKSGVASLAVIGFNANNATSLLGNYFG 450
Query: 434 IACGYTTPLQGIGRYAR-TIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIE 492
C TPLQ + Y + T GC AC + A+ A+ AD+ +L MGLDQ+ E
Sbjct: 451 PPCVTVTPLQVLQGYVKDTSFVAGCNSAAC-NVTTIPEAVQAASSADSVVLFMGLDQNQE 509
Query: 493 AEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYP 552
E +DR L LPG+QQ L+ V+ A+K P ILVL+ GGP+DV+FAK +P+I AI+WAGYP
Sbjct: 510 REEVDRLDLTLPGQQQTLIESVANAAKKPVILVLLCGGPVDVSFAKTNPKIGAILWAGYP 569
Query: 553 GQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKG 612
G+AGG AIA +LFG NPGG+LP+TWYPQ++ T +PMT+M MR + YPGRTYRFY+G
Sbjct: 570 GEAGGIAIAQVLFGEHNPGGRLPVTWYPQDF-TKVPMTDMRMRADPATGYPGRTYRFYRG 628
Query: 613 PVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHA----KCNRL 668
P V+ FG+G+SY+ + H P + G ++ T G A A C+RL
Sbjct: 629 PTVFNFGYGLSYSKYSHRFVTKPPPSMSNVAGLK-ALATTAGGVATYDVEAIGSETCDRL 687
Query: 669 TLGVQVDVKNVGSKDGAHTLLVFSTPPA---GHWAPHKQLVAFEKVHVPAGAQQRVGINI 725
V V+N G DG H +LVF P G P +QL+ F+ +H+ A V +
Sbjct: 688 KFPAVVRVQNHGPMDGKHPVLVFLRWPNATDGSGRPARQLIGFQSLHLRATQTAHVEFEV 747
Query: 726 HVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHA 762
CK+ S G + I G H + +G + +S A
Sbjct: 748 SPCKHFSRATEDGRKVIDQGSHFVMVGDDEFEMSFMA 784
>gi|297736787|emb|CBI25988.3| unnamed protein product [Vitis vinifera]
Length = 774
Score = 727 bits (1876), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 371/734 (50%), Positives = 486/734 (66%), Gaps = 41/734 (5%)
Query: 38 RTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNV 97
+ +C SLP RV DL+ R++L+EK + +I A+ VPR+G+ Y+WWSEALHGV+NV
Sbjct: 63 KDFVYCDSSLPYDVRVKDLVDRITLEEKARNVIDVASGVPRIGLPPYKWWSEALHGVANV 122
Query: 98 GPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNV 157
G T F PGATSFP VI +A+SFN +LW+ +G+VVS EARAMYN G AGLT+WSPN+
Sbjct: 123 GSATFFDEVVPGATSFPNVILSAASFNQSLWKTLGQVVSTEARAMYNLGHAGLTFWSPNI 182
Query: 158 NIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDR---------LKVAASCKHFT 208
N+ RDPRWGR ETPGEDP+ G Y +YVRGLQ +G LK+A+SCKHF
Sbjct: 183 NVARDPRWGRILETPGEDPLTVGVYGVNYVRGLQDIEGTENTTDLNSRPLKIASSCKHFA 242
Query: 209 AYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADP 268
AYDLD W VDR HF+AKVS+QD+ +TF PF MCV EG +SVMCS+N +NG+P CADP
Sbjct: 243 AYDLDQWFNVDRRHFDAKVSEQDMTETFLRPFEMCVKEGDTSSVMCSFNNINGIPPCADP 302
Query: 269 NILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTS-TPEEAAADAIRAGLDLDCGPFLGL 327
LK IR +W L+GYIVSDC ++ Q F T EE A +++AGLDL+CG +
Sbjct: 303 RFLKGVIREQWNLHGYIVSDCWAIDTIVQDQKFLDVTSEEGVALSMKAGLDLECGHYYND 362
Query: 328 HTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELAL 387
+AV+ G +SE D++ +L V MR+G FDG PS LG KD+C +H ELA
Sbjct: 363 SLATAVREGRVSEHDVDKSLSYLYVVLMRVGFFDGIPS---LASLGKKDICNDEHIELAR 419
Query: 388 EAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGR 447
EAARQGIVLLKN +LPL ++ +A++GP+++ TV MIGNYAGI C Y +PL
Sbjct: 420 EAARQGIVLLKNDNATLPLKPVKK--LALVGPHANATVAMIGNYAGIPCHYVSPLDAFSE 477
Query: 448 YARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQ 507
++ GC DV C +D A +A++ ADATI+++G D SIEAE DR LLLPG Q
Sbjct: 478 LGDVTYEVGCADVKCHNDTHVYKAAEAAKNADATIILVGTDLSIEAEERDREDLLLPGYQ 537
Query: 508 QELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGT 567
E+V++V+ S GP ILV+M GGPID++FAKN+P+IAAI+WAG+PG+ GG AIADI+FG
Sbjct: 538 TEMVNQVTDLSTGPVILVVMCGGPIDISFAKNNPKIAAILWAGFPGEQGGNAIADIVFGK 597
Query: 568 SNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNF 627
NPGG+ P+TWY Y+ LPMT MA+RP +S YPGRTY+F+ G VYPFG+G+SYTNF
Sbjct: 598 YNPGGRSPITWYENGYVGMLPMTSMALRPIESLGYPGRTYKFFNGSTVYPFGYGLSYTNF 657
Query: 628 VHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHT 687
+++ APT R I+ T + QV VKNVGS DG+
Sbjct: 658 SYSLT-APT--------RSVHISLT----------------SFEFQVAVKNVGSMDGSEV 692
Query: 688 LLVFSTPPAGHWAPH-KQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGE 746
++V+S+PP+G H KQ++ FE+V V G ++V +++VCK L +VD SG +P G
Sbjct: 693 VMVYSSPPSGIVGTHIKQVIGFERVFVKVGNTEKVKFSMNVCKSLGLVDSSGYILLPSGS 752
Query: 747 HNIHIGGTKHSVSL 760
H I G SVS
Sbjct: 753 HTIMAGDNSTSVSF 766
>gi|15238197|ref|NP_196618.1| putative beta-D-xylosidase 6 [Arabidopsis thaliana]
gi|75264319|sp|Q9LXA8.1|BXL6_ARATH RecName: Full=Probable beta-D-xylosidase 6; Short=AtBXL6; Flags:
Precursor
gi|7671447|emb|CAB89387.1| beta-xylosidase-like protein [Arabidopsis thaliana]
gi|15982753|gb|AAL09717.1| AT5g10560/F12B17_90 [Arabidopsis thaliana]
gi|332004180|gb|AED91563.1| putative beta-D-xylosidase 6 [Arabidopsis thaliana]
Length = 792
Score = 726 bits (1873), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 364/787 (46%), Positives = 490/787 (62%), Gaps = 38/787 (4%)
Query: 7 FFFLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKV 66
FF I + + L + F C P ++ PFC VSL I QR L+ L L EK+
Sbjct: 11 LFFTSAI--AETFKNLDSHPQFPCKPPHFSS--YPFCNVSLSIKQRAISLVSLLMLPEKI 66
Query: 67 KLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNAT 126
L + AA+VPRLGI YEWWSE+LHG+++ GPG F G ATSFPQVI +A+SFN T
Sbjct: 67 GQLSNTAASVPRLGIPPYEWWSESLHGLADNGPGVSFNGSISAATSFPQVIVSAASFNRT 126
Query: 127 LWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASY 186
LW IG V+ E RAMYNGG AGLT+W+PN+N+FRDPRWGRGQETPGEDP + +Y +
Sbjct: 127 LWYEIGSAVAVEGRAMYNGGQAGLTFWAPNINVFRDPRWGRGQETPGEDPKVVSEYGVEF 186
Query: 187 VRGLQGSDGD----------------------RLKVAASCKHFTAYDLDNWNGVDRFHFN 224
VRG Q +L ++A CKHFTAYDL+ W R+ FN
Sbjct: 187 VRGFQEKKKRKVLKRRFSDDVDDDRHDDDADGKLMLSACCKHFTAYDLEKWGNFTRYDFN 246
Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
A V++QD+EDT+ PF C+ +GK + +MCSYN VNGVP CA ++L++ R EW GY
Sbjct: 247 AVVTEQDMEDTYQPPFETCIRDGKASCLMCSYNAVNGVPACAQGDLLQKA-RVEWGFEGY 305
Query: 285 IVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDIN 344
I SDCD+V + Q +T +PEEA ADAI+AG+D++CG ++ HT+SA+++G +SE ++
Sbjct: 306 ITSDCDAVATIFAYQGYTKSPEEAVADAIKAGVDINCGTYMLRHTQSAIEQGKVSEELVD 365
Query: 345 NALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSL 404
AL+N VQ+RLG+FDG+P YG LG D+C+ DH++LALEA RQGIVLLKN L
Sbjct: 366 RALLNLFAVQLRLGLFDGDPRRGQYGKLGSNDICSSDHRKLALEATRQGIVLLKNDHKLL 425
Query: 405 PLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYA-RTIHQQGCKDVACA 463
PL+ ++A++GP ++ M G Y G C T + Y +T + GC DV+C
Sbjct: 426 PLNKNHVSSLAIVGPMANNISNMGGTYTGKPCQRKTLFTELLEYVKKTSYASGCSDVSCD 485
Query: 464 DDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTI 523
D FG A+ ++ AD I+V GLD S E E DR L LPG+Q++LVS V+ SK P I
Sbjct: 486 SDTGFGEAVAIAKGADFVIVVAGLDLSQETEDKDRVSLSLPGKQKDLVSHVAAVSKKPVI 545
Query: 524 LVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEY 583
LVL GGP+DV FAKNDPRI +IIW GYPG+ GG A+A+I+FG NPGG+LP TWYP+ +
Sbjct: 546 LVLTGGGPVDVTFAKNDPRIGSIIWIGYPGETGGQALAEIIFGDFNPGGRLPTTWYPESF 605
Query: 584 ITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVP-- 641
T++ M++M MR + S+ YPGRTYRFY GP VY FG G+SYT F + + +AP +++
Sbjct: 606 -TDVAMSDMHMRANSSRGYPGRTYRFYTGPQVYSFGTGLSYTKFEYKILSAPIRLSLSEL 664
Query: 642 LDGRHGSINATISGKAIK------VTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFST-P 694
L + G+ ++ V C L V+V V N G DG+H +++FS P
Sbjct: 665 LPQQSSHKKQLQHGEELRYLQLDDVIVNSCESLRFNVRVHVSNTGEIDGSHVVMLFSKMP 724
Query: 695 PAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGT 754
P P KQL+ +++VHV + I CK LSV + G R IPLG H + +G
Sbjct: 725 PVLSGVPEKQLIGYDRVHVRSNEMMETVFVIDPCKQLSVANDVGKRVIPLGSHVLFLGDL 784
Query: 755 KHSVSLH 761
+HS+S+
Sbjct: 785 QHSLSVE 791
>gi|449451581|ref|XP_004143540.1| PREDICTED: probable beta-D-xylosidase 6-like [Cucumis sativus]
Length = 777
Score = 725 bits (1871), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 348/749 (46%), Positives = 484/749 (64%), Gaps = 28/749 (3%)
Query: 39 TLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVG 98
+ PFC SL R L+ L+L EK++ L + A+++PRLGI Y+WWSE LHG++ G
Sbjct: 29 SYPFCNRSLSFTARAQSLVSLLTLDEKIQQLSNNASSIPRLGIPSYQWWSEGLHGIATNG 88
Query: 99 PGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVN 158
PG F G ATSFPQV+ TA+SFN TLW IG ++ EARAM+N G GLT W+PN+N
Sbjct: 89 PGVSFNGSITSATSFPQVLVTAASFNRTLWFLIGSAIAVEARAMFNVGQCGLTIWAPNIN 148
Query: 159 IFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDR-----------------LKVA 201
IFRDPRWGRGQETPGEDP+++ Y+ +VRGLQ + + L V+
Sbjct: 149 IFRDPRWGRGQETPGEDPMVASAYSIQFVRGLQSGNWMKEHEIRNEVLEEDNGMGSLMVS 208
Query: 202 ASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNG 261
A CKHFTAYDL+ WN R+ F++ V++QD+ DT+ PFR C+ +GK + +MCSYN VNG
Sbjct: 209 ACCKHFTAYDLEKWNNFTRYTFDSVVTEQDLGDTYQPPFRSCIQQGKASCLMCSYNAVNG 268
Query: 262 VPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDC 321
VP CA+P++LK+ R +W L GYI SDCD+V Y+ Q +T TPE+A AD ++AG+D++C
Sbjct: 269 VPACANPDLLKKA-RNDWGLKGYITSDCDAVATVYEYQKYTDTPEDAIADVLKAGMDINC 327
Query: 322 GPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD 381
G F+ T+SA+ +G + E ++++AL+N +VQ RLG FDG P +G LG +DVCT
Sbjct: 328 GTFMLRGTKSAIDQGKVREEELDSALINLFSVQARLGFFDGNPREGKFGELGAQDVCTAQ 387
Query: 382 HQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTP 441
H+ LALEAARQGIVLLKN+ LPL ++ VIG ++ + ++G YAG+ C +
Sbjct: 388 HKTLALEAARQGIVLLKNENKFLPLDKNAISSLTVIGSLANDSSKLLGGYAGVPCSPMSL 447
Query: 442 LQGIGRYARTI-HQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAG 500
++G YA TI GC DV CA D F AI +++AD I V GLD S E E LDR
Sbjct: 448 VEGFQEYAETIFFASGCLDVPCASDNRFEDAILIAKKADFVIAVAGLDASQETEDLDRVS 507
Query: 501 LLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAI 560
LLLPG+Q +LVS V+ SK P ILVL+ GGP+D++FAK D R+A+I+W G PG+AGG A+
Sbjct: 508 LLLPGKQMDLVSSVASVSKKPIILVLIGGGPLDISFAKKDSRVASILWIGNPGEAGGKAL 567
Query: 561 ADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGH 620
A+++FG NPGG+LP+TWYPQ + TN+PM +M MRP+ S+ YPGRTYRFY G +Y FG
Sbjct: 568 AEVIFGDYNPGGRLPVTWYPQSF-TNVPMNDMHMRPNPSRGYPGRTYRFYTGDRIYGFGE 626
Query: 621 GMSYTNFVHTVANAPTVVAV---PLDGRHGSINATISGKAIKVTHAK----CNRLTLGVQ 673
G+SYT+F + + +AP V + R I G + + C+ L V+
Sbjct: 627 GLSYTSFKYRLLSAPKKVNLLGKAETSRRRIIPQVRDGVNMSYMEVEEVESCDLLRFEVK 686
Query: 674 VDVKNVGSKDGAHTLLVFST-PPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLS 732
+ V N+G DG+H +++FS P P +QL+ F++++V I + C ++S
Sbjct: 687 LSVSNIGEFDGSHVVMMFSEFPKVLTGTPQRQLIGFDRLYVKRNQSAESSIMVDPCNHVS 746
Query: 733 VVDRSGTRRIPLGEHNIHIGGTKHSVSLH 761
+ D G R IPLG+H I +G +H +S+
Sbjct: 747 LADEYGKRVIPLGDHTISLGDLEHVISIQ 775
>gi|356515806|ref|XP_003526589.1| PREDICTED: probable beta-D-xylosidase 7-like [Glycine max]
Length = 772
Score = 725 bits (1871), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 365/754 (48%), Positives = 491/754 (65%), Gaps = 26/754 (3%)
Query: 27 PFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEW 86
PFACD + ++R+ PFC LPIPQR DL+ RL+L EK+ L++ A +PRLGI Y+W
Sbjct: 25 PFACDFSNPSSRSYPFCNPKLPIPQRTKDLLSRLTLDEKLSQLVNTAPPIPRLGIPAYQW 84
Query: 87 WSEALHGVSNVGPGTKFGGD--FPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN 144
WSEALHGVS VGPG F + ATSFPQVI TA+SF++ LW IG + EARA++N
Sbjct: 85 WSEALHGVSGVGPGILFDNNSTISSATSFPQVILTAASFDSRLWYRIGHAIGIEARAIFN 144
Query: 145 GGTA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSD--GDRLKVA 201
G A GLT+W+PN+NIFRDPRWGRGQET GEDP+L+ +YA S+VRGLQG G L +
Sbjct: 145 AGQANGLTFWAPNINIFRDPRWGRGQETAGEDPLLTSRYAVSFVRGLQGDSFKGAHLLAS 204
Query: 202 ASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNG 261
A CKHFTAYDLDNW GVDRF F+A+VS QD+ DT+ PF+ CV +G+ + +MC+YN+VNG
Sbjct: 205 ACCKHFTAYDLDNWKGVDRFVFDARVSLQDLADTYQPPFQSCVQQGRASGIMCAYNRVNG 264
Query: 262 VPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDC 321
VP CAD +L +T R +W NGYI SDC +VG +D Q + +PE+ AD +RAG+DL+C
Sbjct: 265 VPNCADYGLLTQTARNQWDFNGYITSDCGAVGFIHDRQRYAKSPEDVVADVLRAGMDLEC 324
Query: 322 GPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD 381
G +L H +SAV + L +I+ AL N +++MRLG+FDG P+ +G +G VC+ +
Sbjct: 325 GSYLTYHAKSAVLQKKLGMSEIDRALQNLFSIRMRLGLFDGNPTRLSFGLIGSNHVCSKE 384
Query: 382 HQELALEAARQGIVLLKNQGPSLPLSHIRHR-TVAVIGPNSDVT-VTMIGNYAGIACGYT 439
HQ LALEAAR GIVLLKN LPL ++AVIGPN++ + +T++GNYAG C Y
Sbjct: 385 HQYLALEAARNGIVLLKNSPTLLPLPKTSPSISLAVIGPNANSSPLTLLGNYAGPPCKYV 444
Query: 440 TPLQGIGRYART-IHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDR 498
T LQG Y + + GC A++ +++ D +LVMGLDQS E E DR
Sbjct: 445 TILQGFRHYVKNAFYHPGCDGGPKCSSAQIDQAVEVAKKVDYVVLVMGLDQSEEREERDR 504
Query: 499 AGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGT 558
L LPG+Q EL++ V+ ASK P ILVL+SGGP+D+ AK + +I I+WAGYPG+ GG
Sbjct: 505 VHLDLPGKQLELINGVAEASKKPVILVLLSGGPLDITSAKYNHKIGGILWAGYPGELGGI 564
Query: 559 AIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPF 618
A+A I+FG NPGG+LP TWYP++YI +PMT+M MR S YPGRTYRFYKGP VY F
Sbjct: 565 ALAQIIFGDHNPGGRLPTTWYPKDYI-KVPMTDMRMRADPSTGYPGRTYRFYKGPKVYEF 623
Query: 619 GHGMSYTNFVHTVANAPTVVAVPLDGRHGSINA---------TISGKAI-KVTHAKCNRL 668
G+G+SY+ + + V+V D H + ++ TIS K + ++ C +
Sbjct: 624 GYGLSYSKYSY------EFVSVTHDKLHFNQSSTHLMVENSETISYKLVSELDEQTCQSM 677
Query: 669 TLGVQVDVKNVGSKDGAHTLLVFSTPP-AGHWAPHKQLVAFEKVHVPAGAQQRVGINIHV 727
+L V V V+N GS G H +L+F P +P KQLV FE V + AG V +
Sbjct: 678 SLSVTVRVQNHGSMVGKHPVLLFIRPKRQKSGSPVKQLVGFESVMLDAGEMAHVEFEVSP 737
Query: 728 CKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLH 761
C++LS + +G I G H + + +H + ++
Sbjct: 738 CEHLSRANEAGAMIIEEGSHMLLVDDLEHPIDIY 771
>gi|224058158|ref|XP_002299457.1| predicted protein [Populus trichocarpa]
gi|222846715|gb|EEE84262.1| predicted protein [Populus trichocarpa]
Length = 780
Score = 725 bits (1871), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 360/742 (48%), Positives = 493/742 (66%), Gaps = 23/742 (3%)
Query: 18 SSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVP 77
S+S A F C P T T FC SLPI +R LI L+LQEK++ L A+ +P
Sbjct: 20 SNSKSVANPQFPCKP--PTHNTYSFCNKSLPITRRAQSLISHLTLQEKIQQLSDNASGIP 77
Query: 78 RLGIKGYEWWSEALHGVSNVGPGTKF--GGDFPGATSFPQVITTASSFNATLWEAIGRVV 135
RLGI YEWWSE+LHG+S GPG F GG AT FPQVI +A+SFN TLW IG +
Sbjct: 78 RLGIPHYEWWSESLHGISINGPGVSFKNGGPVTSATGFPQVIVSAASFNRTLWFLIGSAI 137
Query: 136 SDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQG--- 192
+ EARAMYN G AGLT+W+PN+NIFRDPRWGRGQETPGEDP+++ YA +V+G QG
Sbjct: 138 AIEARAMYNVGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVASAYAIEFVKGFQGGHW 197
Query: 193 --SDG----DRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVME 246
DG D+L ++A CKH TAYDL+ W R+ FNA V++QD+EDT+ PFR C+ +
Sbjct: 198 KNEDGEINDDKLMLSACCKHSTAYDLEKWGNFSRYSFNAVVTEQDMEDTYQPPFRSCIQK 257
Query: 247 GKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPE 306
GK + +MCSYN+VNGVP CA ++L++ R EW GYI SDCD+V ++ Q+++ +PE
Sbjct: 258 GKASCLMCSYNEVNGVPACAREDLLQKP-RTEWGFKGYITSDCDAVATIFEYQNYSKSPE 316
Query: 307 EAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSS 366
+A A A++AG+D++CG ++ + +SAV++G L E DI+ AL N +VQ+RLG+FDG+P
Sbjct: 317 DAVAIALKAGMDINCGTYVLRNAQSAVEKGKLQEEDIDRALHNLFSVQLRLGLFDGDPRK 376
Query: 367 QPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVT 426
+G LGPK+VCT +H+ LALEAARQGIVLLKN LPL+ ++A+IGP +++ +
Sbjct: 377 GQFGKLGPKNVCTKEHKTLALEAARQGIVLLKNDKKLLPLNKKAVSSLAIIGPLANMANS 436
Query: 427 MIGNYAGIACGYTTPLQGIGRYA-RTIHQQGCKDVACADDQLFGAAIDASRQADATILVM 485
+ G+Y G C + +G+ Y +T + GC DVAC D F AI +++AD I+V
Sbjct: 437 LGGDYTGYPCDPQSLFEGLKAYVKKTSYAIGCLDVACVSDTQFHKAIIVAKRADFVIIVA 496
Query: 486 GLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAA 545
GLD S E E DR LLLPG+Q LVS V+ ASK P ILVL GGP+DV+FAK DPRIA+
Sbjct: 497 GLDLSQETEEHDRVSLLLPGKQMSLVSSVAAASKKPVILVLTGGGPLDVSFAKGDPRIAS 556
Query: 546 IIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGR 605
I+W GYPG+AG A+A+I+FG NPGG+LPMTWYP+ + T + MT+M MRP+ S+ YPGR
Sbjct: 557 ILWIGYPGEAGAKALAEIIFGEYNPGGRLPMTWYPESF-TEVSMTDMNMRPNPSRGYPGR 615
Query: 606 TYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTH--- 662
TYRFY G VY FG G+SYTNF + + +AP+ +++ S + ++++
Sbjct: 616 TYRFYTGNRVYGFGGGLSYTNFTYKILSAPSKLSLSGSLSSNSRKRILQQGGERLSYINI 675
Query: 663 ---AKCNRLTLGVQVDVKNVGSKDGAHTLLVFS-TPPAGHWAPHKQLVAFEKVHVPAGAQ 718
C+ L +Q+ V+NVG+ DG H +++FS P AP KQLV F++VH +
Sbjct: 676 NEITSCDSLRFYMQILVENVGNMDGGHVVMLFSRVPTVFRGAPEKQLVGFDRVHTISHRS 735
Query: 719 QRVGINIHVCKYLSVVDRSGTR 740
+ I + C++LSV + G +
Sbjct: 736 TEMSILVDPCEHLSVANEQGKK 757
>gi|449496501|ref|XP_004160150.1| PREDICTED: probable beta-D-xylosidase 6-like, partial [Cucumis
sativus]
Length = 767
Score = 724 bits (1870), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 348/749 (46%), Positives = 484/749 (64%), Gaps = 28/749 (3%)
Query: 39 TLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVG 98
+ PFC SL R L+ L+L EK++ L + A+++PRLGI Y+WWSE LHG++ G
Sbjct: 19 SYPFCNRSLSFTARAQSLVSLLTLDEKIQQLSNNASSIPRLGIPSYQWWSEGLHGIATNG 78
Query: 99 PGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVN 158
PG F G ATSFPQV+ TA+SFN TLW IG ++ EARAM+N G GLT W+PN+N
Sbjct: 79 PGVSFNGSITSATSFPQVLVTAASFNRTLWFLIGSAIAVEARAMFNVGQCGLTIWAPNIN 138
Query: 159 IFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDR-----------------LKVA 201
IFRDPRWGRGQETPGEDP+++ Y+ +VRGLQ + + L V+
Sbjct: 139 IFRDPRWGRGQETPGEDPMVASAYSIQFVRGLQSGNWMKEHEIRNEVLEEDNGMGSLMVS 198
Query: 202 ASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNG 261
A CKHFTAYDL+ WN R+ F++ V++QD+ DT+ PFR C+ +GK + +MCSYN VNG
Sbjct: 199 ACCKHFTAYDLEKWNNFTRYTFDSVVTEQDLGDTYQPPFRSCIQQGKASCLMCSYNAVNG 258
Query: 262 VPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDC 321
VP CA+P++LK+ R +W L GYI SDCD+V Y+ Q +T TPE+A AD ++AG+D++C
Sbjct: 259 VPACANPDLLKKA-RNDWGLKGYITSDCDAVATVYEYQKYTDTPEDAIADVLKAGMDINC 317
Query: 322 GPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD 381
G F+ T+SA+ +G + E ++++AL+N +VQ RLG FDG P +G LG +DVCT
Sbjct: 318 GTFMLRGTKSAIDQGKVREEELDSALINLFSVQARLGFFDGNPREGKFGELGAQDVCTAQ 377
Query: 382 HQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTP 441
H+ LALEAARQGIVLLKN+ LPL ++ VIG ++ + ++G YAG+ C +
Sbjct: 378 HKTLALEAARQGIVLLKNENKFLPLDKNAISSLTVIGSLANDSSKLLGGYAGVPCSPMSL 437
Query: 442 LQGIGRYARTI-HQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAG 500
++G YA TI GC DV CA D F AI +++AD I V GLD S E E LDR
Sbjct: 438 VEGFQEYAETIFFASGCLDVPCASDNRFEDAILIAKKADFVIAVAGLDASQETEDLDRVS 497
Query: 501 LLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAI 560
LLLPG+Q +LVS V+ SK P ILVL+ GGP+D++FAK D R+A+I+W G PG+AGG A+
Sbjct: 498 LLLPGKQMDLVSSVASVSKKPIILVLIGGGPLDISFAKKDSRVASILWIGNPGEAGGKAL 557
Query: 561 ADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGH 620
A+++FG NPGG+LP+TWYPQ + TN+PM +M MRP+ S+ YPGRTYRFY G +Y FG
Sbjct: 558 AEVIFGDYNPGGRLPVTWYPQSF-TNVPMNDMHMRPNPSRGYPGRTYRFYTGDRIYGFGE 616
Query: 621 GMSYTNFVHTVANAPTVVAV---PLDGRHGSINATISGKAIKVTHAK----CNRLTLGVQ 673
G+SYT+F + + +AP V + R I G + + C+ L V+
Sbjct: 617 GLSYTSFKYRLLSAPKKVNLLGKAETSRRRIIPQVRDGVNMSYMEVEEVESCDLLRFEVK 676
Query: 674 VDVKNVGSKDGAHTLLVFST-PPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLS 732
+ V N+G DG+H +++FS P P +QL+ F++++V I + C ++S
Sbjct: 677 LSVSNIGEFDGSHVVMMFSEFPKVLTGTPQRQLIGFDRLYVKRNQSAESSIMVDPCNHVS 736
Query: 733 VVDRSGTRRIPLGEHNIHIGGTKHSVSLH 761
+ D G R IPLG+H I +G +H +S+
Sbjct: 737 LADEYGKRVIPLGDHTISLGDLEHVISIQ 765
>gi|37359708|dbj|BAC98299.1| LEXYL2 [Solanum lycopersicum]
Length = 633
Score = 724 bits (1869), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 347/633 (54%), Positives = 460/633 (72%), Gaps = 6/633 (0%)
Query: 131 IGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGL 190
IG+VVS E RAMYN G AGLTYWSPNVNI+RDPRWGRGQET GEDP LS +Y +YV+GL
Sbjct: 2 IGKVVSTEGRAMYNVGQAGLTYWSPNVNIYRDPRWGRGQETAGEDPTLSSRYGVAYVKGL 61
Query: 191 QGSDG---DRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEG 247
Q D D LKVA+ CKH+TAYD+D+W G+ R++FNAKV++QD++DTF+ PF+ CV++G
Sbjct: 62 QQRDDGKKDMLKVASCCKHYTAYDVDDWKGIQRYNFNAKVTQQDLDDTFNPPFKSCVLDG 121
Query: 248 KVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEE 307
VASVMCSYNQV+G PTC D ++L IRG+W+LNGYIV+DCDS+ Y QH+T TPEE
Sbjct: 122 NVASVMCSYNQVDGKPTCGDYDLLAGVIRGQWKLNGYIVTDCDSLNEMYWAQHYTKTPEE 181
Query: 308 AAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQ 367
AA ++ AGL L+CG +LG +T+ AV +GL++E I+ A+ N MRLG FDG P +Q
Sbjct: 182 TAALSLNAGLGLNCGSWLGKYTQGAVNQGLVNESVIDRAVTNNFATLMRLGFFDGNPKNQ 241
Query: 368 PYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTM 427
YG+LGPKD+CT DHQELA EAARQGIVLLKN SLPLS +++AVIGPN+++ TM
Sbjct: 242 LYGNLGPKDICTEDHQELAREAARQGIVLLKNTAGSLPLSPKSIKSLAVIGPNANLAYTM 301
Query: 428 IGNYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGL 487
+G+Y G C YTTPL G+G T++QQGC D+ACA Q+ A + ADA +LVMG
Sbjct: 302 VGSYEGSPCKYTTPLDGLGASVSTVYQQGC-DIACATAQVDNAKK-VAAAADAVVLVMGS 359
Query: 488 DQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAII 547
DQ+IE E+ DR + LPG+Q LV++V+ SKGP ILV+MSGG +DV FA ++P++ +I+
Sbjct: 360 DQTIERESKDRFNITLPGQQSLLVTEVASVSKGPVILVIMSGGGMDVKFAVDNPKVTSIL 419
Query: 548 WAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTY 607
W G+PG+AGG A+AD++FG NPGG+LPMTWYPQ Y+ + MT M MR +PGR+Y
Sbjct: 420 WVGFPGEAGGAALADVVFGYHNPGGRLPMTWYPQSYVDKVDMTNMNMRADPKTGFPGRSY 479
Query: 608 RFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNR 667
RFYKGP V+ FG G+SYT + H + AP V++PL+ H + +T V CN
Sbjct: 480 RFYKGPTVFNFGDGLSYTQYKHHLVKAPKFVSIPLEEGH-ACRSTKCKSIDAVNEQGCNN 538
Query: 668 LTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHV 727
L L + + V+NVG G+HT+L+F++PP+ H AP K L+ F+K+H+ ++ V N+ V
Sbjct: 539 LGLDIHLKVQNVGKMRGSHTVLLFTSPPSVHNAPQKHLLDFQKIHLTPQSEGVVKFNLDV 598
Query: 728 CKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
CK+LSVVD G R++ LG H +HIG KHS++L
Sbjct: 599 CKHLSVVDEVGNRKVALGLHVLHIGDLKHSLTL 631
>gi|357489441|ref|XP_003615008.1| Xylan 1 4-beta-xylosidase [Medicago truncatula]
gi|355516343|gb|AES97966.1| Xylan 1 4-beta-xylosidase [Medicago truncatula]
Length = 798
Score = 723 bits (1866), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 369/792 (46%), Positives = 514/792 (64%), Gaps = 42/792 (5%)
Query: 5 IAFFFLGLI----LLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRL 60
I+F FL L L+ A S P++CD + T++LPFC ++L I QR D++ RL
Sbjct: 11 ISFIFLFLTRYHRLVHADSP--THVPPYSCDTTNPLTKSLPFCNLNLTITQRAKDIVSRL 68
Query: 61 SLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTA 120
+L EK+ L++ A ++PRLGI Y+WW EALHGV+N G G + G GATSFPQVI TA
Sbjct: 69 TLDEKISQLVNTAPSIPRLGIPSYQWWDEALHGVANAGKGIRLNGSVAGATSFPQVILTA 128
Query: 121 SSFNATLWEAIGRVVSDEARAMYNGGTA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLS 179
+SF++ LW I +V+ EAR +YN G A G+T+W+PN+NIFRDPRWGRGQET GEDP+++
Sbjct: 129 ASFDSKLWYQISKVIGTEARGVYNAGQAQGMTFWAPNINIFRDPRWGRGQETAGEDPLVN 188
Query: 180 GKYAASYVRGLQGSD-------GDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVS---- 228
KY SYVRGLQG GDRLK +A CKHFTAYDLDNW G+DRF F+AKVS
Sbjct: 189 SKYGVSYVRGLQGDSFEGGKLIGDRLKASACCKHFTAYDLDNWKGLDRFDFDAKVSFLFS 248
Query: 229 ------------KQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIR 276
QD+ DT+ PF C+++G+ + +MC+YN+VNGVP CAD N+L +T R
Sbjct: 249 MAYSPWMINYVTLQDLADTYQPPFHSCIVQGRSSGIMCAYNRVNGVPNCADYNLLTKTAR 308
Query: 277 GEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRG 336
+W NGYI SDC++V + YD Q + TPE+A AD ++AG+D++CG +L H ++AV +
Sbjct: 309 QKWNFNGYITSDCEAVRIIYDNQGYAKTPEDAVADVLQAGMDVECGDYLTKHAKAAVLQK 368
Query: 337 LLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVL 396
+ I+ AL N T+++RLG+FDG P+ YG +GP VC+ ++ +LALEAAR GIVL
Sbjct: 369 KVPISQIDRALHNLFTIRIRLGLFDGNPTKLQYGRIGPNQVCSKENLDLALEAARSGIVL 428
Query: 397 LKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMI-GNYAGIACGYTTPLQGIGRYARTIH-Q 454
LKN LPL R T+ VIGPN++ + ++ GNY G C L+G YA H +
Sbjct: 429 LKNTASILPLP--RVNTLGVIGPNANKSSKVVLGNYFGRPCRLVPILKGFYTYASQTHYR 486
Query: 455 QGCKD-VACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSK 513
GC D CA ++ A++ ++ +D ILVMGLDQS E E+ DR L LPG+QQEL++
Sbjct: 487 SGCLDGTKCASAEI-DRAVEVAKISDYVILVMGLDQSQERESRDRDDLELPGKQQELINS 545
Query: 514 VSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGK 573
V+ ASK P ILVL+ GGP+D+ FAKN+ +I IIWAGYPG+ GG A+A ++FG NPGG+
Sbjct: 546 VAKASKKPVILVLLCGGPVDITFAKNNDKIGGIIWAGYPGELGGRALAQVVFGDYNPGGR 605
Query: 574 LPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHT-VA 632
LPMTWYP+++I +PMT+M MR S YPGRTYRFY GP VY FG+G+SY+N+ + ++
Sbjct: 606 LPMTWYPKDFI-KIPMTDMRMRADPSSGYPGRTYRFYTGPKVYEFGYGLSYSNYSYNFIS 664
Query: 633 NAPTVVAVPLDGRHGSI--NATISGKAI-KVTHAKCNRLTLGVQVDVKNVGSKDGAHTLL 689
+ + H + + TI K + ++ C +++ V + + N GS G H +L
Sbjct: 665 VKNNNLHINQSTTHSILENSETIYYKLVSELGEETCKTMSISVTLGITNTGSMAGKHPVL 724
Query: 690 VFSTPPAG-HWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHN 748
+F P G + P KQLV FE V V G + VG + VC++LS + SG + I G H
Sbjct: 725 LFVKPKKGRNGNPVKQLVGFESVTVEGGGKGEVGFEVSVCEHLSRANESGVKVIEEGGHL 784
Query: 749 IHIGGTKHSVSL 760
+ +G ++S+++
Sbjct: 785 LVVGEEEYSINI 796
>gi|357489431|ref|XP_003615003.1| Xylan 1 4-beta-xylosidase [Medicago truncatula]
gi|355516338|gb|AES97961.1| Xylan 1 4-beta-xylosidase [Medicago truncatula]
Length = 780
Score = 723 bits (1865), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 363/776 (46%), Positives = 506/776 (65%), Gaps = 23/776 (2%)
Query: 4 TIAFFFLGLIL---LSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRL 60
TI F FL L L + S P++CD + T++ PFC ++L I QR D++ RL
Sbjct: 7 TITFIFLFLTRYHRLVHADSLATNVPPYSCDTSNPLTKSFPFCNLNLTITQRAKDIVSRL 66
Query: 61 SLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTA 120
+L EK+ L++ A A+PRLGI Y+WW+EALHGVS VG G + G ATSFPQ+I A
Sbjct: 67 TLDEKISQLVNTAPAIPRLGIPSYQWWNEALHGVSYVGKGIRLNGSITAATSFPQIILIA 126
Query: 121 SSFNATLWEAIGRVVSDEARAMYNGGTA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLS 179
+SF+ LW I +V+ EAR +YN G A G+T+W+PN+NIFRDPRWGRGQET GEDP+++
Sbjct: 127 ASFDPKLWYRISKVIGTEARGVYNAGQAQGMTFWAPNINIFRDPRWGRGQETAGEDPLVN 186
Query: 180 GKYAASYVRGLQGSD-------GDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDI 232
KY SYVRGLQG G RLK +A CKHFTAYDL+NW GV+R+ F+AKV+ QD+
Sbjct: 187 SKYGVSYVRGLQGDSFEGGKLIGGRLKASACCKHFTAYDLENWKGVNRYVFDAKVTLQDL 246
Query: 233 EDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSV 292
DT+ F CV++G+ + +MC+YN+VNGVP CAD N+L T R +W NGYI SDCD+V
Sbjct: 247 ADTYQPSFHSCVVQGRSSGIMCAYNRVNGVPNCADYNLLTNTARKKWNFNGYIASDCDAV 306
Query: 293 GVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLT 352
Y+ Q + TPE+ AD +RAG+D++CG ++ H +SAV + + I+ AL N T
Sbjct: 307 RFIYEKQGYAKTPEDVVADVLRAGMDVECGNYMTKHAKSAVLQKKIPISQIDRALHNLFT 366
Query: 353 VQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHR 412
+++RLG+FDG P+ YG +GP VC+ ++ +LALEAAR GIVLLKN LPL R
Sbjct: 367 IRIRLGLFDGNPTKLQYGRIGPNQVCSKENLDLALEAARSGIVLLKNTASILPLP--RVN 424
Query: 413 TVAVIGPNSD-VTVTMIGNYAGIACGYTTPLQGIGRYARTIH-QQGCKD-VACADDQLFG 469
T+ VIGPN++ ++ ++GNY G C + L+G YA H + GC D V CA ++
Sbjct: 425 TLGVIGPNANKSSIVLLGNYFGQPCKQVSILKGFYTYASQTHYRSGCTDGVKCASAEI-D 483
Query: 470 AAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSG 529
A++ ++ +D ILVMGLDQS E E LDR L LPG+QQ+L++ V+ ASK P ILV++ G
Sbjct: 484 RAVEVAKISDYVILVMGLDQSQETETLDRDHLELPGKQQKLINSVAKASKKPVILVILCG 543
Query: 530 GPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPM 589
GP+D+ FAKN+ +I IIWAGYPG+ GG A+A ++FG NPGG+LPMTWYP+++I +PM
Sbjct: 544 GPVDITFAKNNDKIGGIIWAGYPGELGGRALAQVVFGDYNPGGRLPMTWYPKDFI-KIPM 602
Query: 590 TEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHT-VANAPTVVAVPLDGRHGS 648
T+M MR S YPGRTYRFY GP VY FG+G+SY+N+ + ++ + + H
Sbjct: 603 TDMRMRADPSSGYPGRTYRFYTGPKVYEFGYGLSYSNYSYNFISVKNNNIHINQSTTHSI 662
Query: 649 I--NATISGKAIKVTHAK-CNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAG-HWAPHKQ 704
+ + TI K + K C +++ V + + N GS G H +L+F P G + P KQ
Sbjct: 663 LENSETIRYKLVSELGKKACKTMSISVTLGITNTGSMAGKHPVLLFVKPKKGRNGNPVKQ 722
Query: 705 LVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
LV FE V V G + VG + VC++LS + SG + I G + +G ++S+++
Sbjct: 723 LVGFESVTVEGGGKGEVGFEVSVCEHLSRANESGVKVIEEGGYLFLVGELEYSINI 778
>gi|225432132|ref|XP_002274591.1| PREDICTED: beta-xylosidase/alpha-L-arabinofuranosidase 1-like
[Vitis vinifera]
Length = 805
Score = 721 bits (1862), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 352/761 (46%), Positives = 491/761 (64%), Gaps = 29/761 (3%)
Query: 22 LAAREPFACDPKDAT-----TRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAV 76
LA + CD ++ FC SL +R DL+ R++LQEKV + A+ V
Sbjct: 41 LAKNYTYVCDESRYALLGLDMKSFAFCDKSLSYKERAKDLVSRMTLQEKVMQSVHTASGV 100
Query: 77 PRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVS 136
RLG+ Y WWSEALHG+SN+GPG F PGATS P VI + ++FN TLW+ +GRVVS
Sbjct: 101 RRLGLPEYSWWSEALHGISNLGPGVFFDETIPGATSLPTVILSTAAFNQTLWKTLGRVVS 160
Query: 137 DEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGD 196
E RAMYN G AGLT+WSPN+N+ RD RWGR QET GEDP + G++A +YVRGLQ +G
Sbjct: 161 TEGRAMYNLGHAGLTFWSPNINVVRDTRWGRTQETSGEDPFIVGEFAVNYVRGLQDVEGT 220
Query: 197 R---------LKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEG 247
LKV++ CKH+ AYD+D+W VDR F+A+VS+QD+++TF PF CV EG
Sbjct: 221 ENVTDLNSRPLKVSSCCKHYAAYDIDSWLNVDRHTFDARVSEQDMKETFVSPFERCVREG 280
Query: 248 KVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPE- 306
V+SVMCS+N++NG+P C+DP +LK IR EW L+GYIVSDC + V D Q++ + +
Sbjct: 281 DVSSVMCSFNKINGIPPCSDPRLLKGVIRDEWDLHGYIVSDCYGLEVIVDNQNYLNDSKV 340
Query: 307 EAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSS 366
+A A ++AGLDL+CG + +V G +S+ +++ AL N + MR+G FDG P+
Sbjct: 341 DAVAKTLQAGLDLECGHYYTDALNESVLTGKVSQYELDRALKNIYVLLMRVGYFDGIPA- 399
Query: 367 QPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVT 426
Y LG KD+C DH ELA EAARQGIVLLKN LPL + +A++GP+++ T
Sbjct: 400 --YESLGLKDICAADHIELAREAARQGIVLLKNDYEVLPLKP--GKKIALVGPHANATEV 455
Query: 427 MIGNYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMG 486
MIGNYAG+ C Y +PL+ + GC D +C++D F A +A++ A+ TI+ +G
Sbjct: 456 MIGNYAGLPCKYVSPLEAFSAIGNVTYATGCLDASCSNDTYFSEAKEAAKSAEVTIIFVG 515
Query: 487 LDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAI 546
D SIEAE +DR LLPG Q EL+ +V+ S GP ILV++SG ID+ FAKN+PRI+AI
Sbjct: 516 TDLSIEAEFVDRVDFLLPGNQTELIKQVAEVSSGPVILVVLSGSNIDITFAKNNPRISAI 575
Query: 547 IWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRT 606
+W G+PG+ GG AIAD++FG NPGG+LP+TWY +Y+ LPM+ M++RP YPGRT
Sbjct: 576 LWVGFPGEQGGHAIADVVFGKYNPGGRLPVTWYEADYVDMLPMSSMSLRPVDELGYPGRT 635
Query: 607 YRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDG--RHGSINATISGK-----AIK 659
Y+F+ G VYPFG+GMSYT F +++A + + + L+ + ++ T K A+
Sbjct: 636 YKFFDGSTVYPFGYGMSYTKFSYSLATSKISIDIDLNKFQKCRTVAYTEDQKVPSCPAVL 695
Query: 660 VTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPH-KQLVAFEKVHVPAGAQ 718
+ C+ T+ +V V NVG DG+ L+V+S PP+G H KQ++ F+KV V AG
Sbjct: 696 LDDMSCDD-TIEFEVAVTNVGMVDGSEVLMVYSIPPSGIVGTHIKQVIGFQKVFVAAGDT 754
Query: 719 QRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVS 759
+RV +++ CK L +VD +G +P G H I +G +S S
Sbjct: 755 ERVKFSMNACKSLRIVDSTGYSLLPSGSHTIRVGDYSNSAS 795
>gi|212275712|ref|NP_001130324.1| uncharacterized protein LOC100191418 precursor [Zea mays]
gi|194688848|gb|ACF78508.1| unknown [Zea mays]
gi|413938927|gb|AFW73478.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
Length = 780
Score = 721 bits (1861), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 362/766 (47%), Positives = 490/766 (63%), Gaps = 19/766 (2%)
Query: 12 LILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLIS 71
L+ L + +A+ P+ C +PFC LPI +RV+DL+ R+++ EK+ L
Sbjct: 19 LLQLHGGAVVVASEPPYTCGA--GAPPNIPFCDAGLPIDRRVDDLVSRMTVAEKISQLGD 76
Query: 72 GAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAI 131
+ A+PRLG+ Y+WWSEALHG+SN G G G ATSFPQVI TA+SFN LW I
Sbjct: 77 QSPAIPRLGVPAYKWWSEALHGISNQGRGIHLDGPLRAATSFPQVILTAASFNPHLWYRI 136
Query: 132 GRVVSDEARAMYNGGTA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGL 190
G+V+ EARA+YN G A GLT+W+PN+N+FRDPRWGRGQETPGEDP ++GKYAA +VRG+
Sbjct: 137 GQVIGVEARAVYNNGQAEGLTFWAPNINVFRDPRWGRGQETPGEDPTMTGKYAAVFVRGV 196
Query: 191 QGS------DGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCV 244
QG + L+ +A CKHFTAYDL+NW GV R+ F+AKV+ QD+ DT++ PF+ CV
Sbjct: 197 QGYGLAGPVNSTGLEASACCKHFTAYDLENWKGVTRYVFDAKVTAQDLADTYNPPFKSCV 256
Query: 245 MEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTST 304
+G + +MCSYN+VNGVPTCAD N+L T R +W GYI SDCD+V + +D Q + T
Sbjct: 257 EDGHASGIMCSYNRVNGVPTCADYNLLSTTARQDWGFYGYITSDCDAVAIIHDAQGYAKT 316
Query: 305 PEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEP 364
E+A AD ++AG+D++CG ++ H SA+Q+G ++E DIN AL N V+MRLG+F+G+P
Sbjct: 317 AEDAVADVLKAGMDVNCGSYVQDHGASALQQGKITEQDINRALHNLFAVRMRLGLFNGDP 376
Query: 365 SSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQG--PSLPLSHIRHRTVAVIGPNSD 422
YG +GP VCT +HQ+LALEAA+ GIVLLKN G +LPLS ++AVIG N++
Sbjct: 377 RRNLYGDIGPDQVCTQEHQDLALEAAQDGIVLLKNDGGAGALPLSKPNVASLAVIGFNAN 436
Query: 423 VTVTMIGNYAGIACGYTTPLQGIGRYAR-TIHQQGCKDVACADDQLFGAAIDASRQADAT 481
+ + GNY G C TPLQ + Y + T GC AC + A+ A+ AD+
Sbjct: 437 DAIRLRGNYFGPPCVTVTPLQVLQGYVKDTSFVAGCNSAAC-NVTTIPEAVQAASSADSV 495
Query: 482 ILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDP 541
+L MGLDQ E E +DR L LPG+QQ L+ V+ A+K P ILVL+ GGP+DV+FAK +P
Sbjct: 496 VLFMGLDQDQEREEVDRLDLTLPGQQQTLIESVANAAKKPVILVLLCGGPVDVSFAKTNP 555
Query: 542 RIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKR 601
+I AI+WAGYPG+AGG AIA +LFG NPGG+LP+TWYPQ++ T +PMT+M MR +
Sbjct: 556 KIGAILWAGYPGEAGGIAIAQVLFGEHNPGGRLPVTWYPQDF-TRVPMTDMRMRADPATG 614
Query: 602 YPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANA--PTVVAVPLDGRHGSINATISGKAIK 659
YPGRTYRFY+GP V+ FG+G+SY+ + H A PT L + S
Sbjct: 615 YPGRTYRFYRGPTVFNFGYGLSYSKYSHRFATKPPPTSNVAGLKAVEATAGGMASYDVEA 674
Query: 660 VTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPA---GHWAPHKQLVAFEKVHVPAG 716
+ C+RL V V+N G DG H++LVF P G P QL+ F+ +H+ A
Sbjct: 675 IGSETCDRLKFPAVVRVQNHGPMDGKHSVLVFMRWPNATDGSGRPASQLIGFQSLHLRAT 734
Query: 717 AQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHA 762
V + CK+ S G + I G H + +G + +S A
Sbjct: 735 QTAHVEFEVSPCKHFSRATEDGRKVIDQGSHFVMVGEDEFEMSFMA 780
>gi|218191593|gb|EEC74020.1| hypothetical protein OsI_08964 [Oryza sativa Indica Group]
Length = 774
Score = 720 bits (1858), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 361/750 (48%), Positives = 486/750 (64%), Gaps = 22/750 (2%)
Query: 27 PFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEW 86
PF+C + FC LPI QR +DL+ RL+L+EK+ L + AV RLG+ Y+W
Sbjct: 29 PFSC----GAPSSAAFCNPRLPIEQRADDLVSRLTLEEKISQLGDQSPAVDRLGVPAYKW 84
Query: 87 WSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGG 146
WSEALHGVSN G G G ATSFPQVI TA+SFN LW IG+V+ EARA+YN G
Sbjct: 85 WSEALHGVSNAGRGIHLDGPLRAATSFPQVILTAASFNPHLWYRIGQVIGTEARAVYNNG 144
Query: 147 TA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGS------DGDRLK 199
A GLT+W+PN+N+FRDPRWGRGQETPGEDP ++GKYAA +VRG+QG + L+
Sbjct: 145 QAEGLTFWAPNINVFRDPRWGRGQETPGEDPTVTGKYAAVFVRGVQGYALAGAINSTDLE 204
Query: 200 VAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQV 259
+A CKHFTAYDL+NW GV R+ F+AKV+ QD+ DT++ PFR CV +G + +MCSYN+V
Sbjct: 205 ASACCKHFTAYDLENWKGVTRYAFDAKVTAQDLADTYNPPFRSCVEDGGASGIMCSYNRV 264
Query: 260 NGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDL 319
NGVPTCAD N+L +T RG+WR GYI SDCD+V + +D Q + T E+A AD ++AG+D+
Sbjct: 265 NGVPTCADYNLLSKTARGDWRFYGYITSDCDAVSIIHDVQGYAKTAEDAVADVLKAGMDV 324
Query: 320 DCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCT 379
+CG ++ H SA+Q+G ++E DIN AL N V+MRLG+F+G P YG++GP VCT
Sbjct: 325 NCGSYVQEHGLSAIQQGKITEQDINRALHNLFAVRMRLGLFNGNPKYNRYGNIGPDQVCT 384
Query: 380 PDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYT 439
+HQ LALEAA+ G+VLLKN +LPLS + ++AVIG N++ ++GNY G C
Sbjct: 385 QEHQNLALEAAQHGVVLLKNDANALPLSKSQVSSIAVIGHNANDATRLLGNYFGPPCISV 444
Query: 440 TPLQGIGRYAR-TIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDR 498
TPLQ + Y + T GC AC + G A + D +L MGLDQ E E +DR
Sbjct: 445 TPLQVLQGYVKDTRFLAGCNSAACNVSSI-GEAAQLASSVDYVVLFMGLDQDQEREEVDR 503
Query: 499 AGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGT 558
L LPG Q+ L++ V+ A+K P ILVL+ GGP+DV FAK +P+I AI+WAGYPG+AGG
Sbjct: 504 LELSLPGMQENLINTVANAAKKPVILVLLCGGPVDVTFAKYNPKIGAILWAGYPGEAGGI 563
Query: 559 AIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPF 618
AIA +LFG NPGG+LP+TWYP+E+ T++PMT+M MR S YPGRTYRFY+G VY F
Sbjct: 564 AIAQVLFGEHNPGGRLPVTWYPKEF-TSVPMTDMRMRADPSTGYPGRTYRFYRGNTVYKF 622
Query: 619 GHGMSYTNFVHT-VANAPTVVAV-PLDGRHGSINA---TISGKAIKVTHAKCNRLTLGVQ 673
G+G+SY+ + H VAN + ++ +DG A T+S ++ C++L
Sbjct: 623 GYGLSYSKYSHHFVANGTKLPSLSSIDGLKAMATAAAGTVSYDVEEIGTETCDKLKFPAL 682
Query: 674 VDVKNVGSKDGAHTLLVFSTPPAGHW---APHKQLVAFEKVHVPAGAQQRVGINIHVCKY 730
V V+N G DG H +L+F P G P QL+ F+ +H+ + V + CK+
Sbjct: 683 VRVQNHGPMDGRHPVLLFLRWPNGAADGGRPASQLIGFQSLHLKSMQTVHVEFEVSPCKH 742
Query: 731 LSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
S G + I G H + +G + +S
Sbjct: 743 FSRATEDGKKVIDHGSHFMMVGDDEFEMSF 772
>gi|115448721|ref|NP_001048140.1| Os02g0752200 [Oryza sativa Japonica Group]
gi|46390122|dbj|BAD15557.1| putative beta-D-xylosidase [Oryza sativa Japonica Group]
gi|46390225|dbj|BAD15656.1| putative beta-D-xylosidase [Oryza sativa Japonica Group]
gi|113537671|dbj|BAF10054.1| Os02g0752200 [Oryza sativa Japonica Group]
gi|125583710|gb|EAZ24641.1| hypothetical protein OsJ_08409 [Oryza sativa Japonica Group]
Length = 780
Score = 719 bits (1855), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 361/750 (48%), Positives = 486/750 (64%), Gaps = 22/750 (2%)
Query: 27 PFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEW 86
PF+C + FC LPI QR +DL+ RL+L+EK+ L + AV RLG+ Y+W
Sbjct: 35 PFSC----GAPSSAAFCNPRLPIEQRADDLVSRLTLEEKISQLGDQSPAVDRLGVPAYKW 90
Query: 87 WSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGG 146
WSEALHGVSN G G G ATSFPQVI TA+SFN LW IG+V+ EARA+YN G
Sbjct: 91 WSEALHGVSNAGRGIHLDGPLRAATSFPQVILTAASFNPHLWYRIGQVIGTEARAVYNNG 150
Query: 147 TA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGS------DGDRLK 199
A GLT+W+PN+N+FRDPRWGRGQETPGEDP ++GKYAA +VRG+QG + L+
Sbjct: 151 QAEGLTFWAPNINVFRDPRWGRGQETPGEDPTVTGKYAAVFVRGVQGYALAGAINSTDLE 210
Query: 200 VAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQV 259
+A CKHFTAYDL+NW GV R+ F+AKV+ QD+ DT++ PFR CV +G + +MCSYN+V
Sbjct: 211 ASACCKHFTAYDLENWKGVTRYAFDAKVTAQDLADTYNPPFRSCVEDGGASGIMCSYNRV 270
Query: 260 NGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDL 319
NGVPTCAD N+L +T RG+WR GYI SDCD+V + +D Q + T E+A AD ++AG+D+
Sbjct: 271 NGVPTCADYNLLSKTARGDWRFYGYITSDCDAVSIIHDVQGYAKTAEDAVADVLKAGMDV 330
Query: 320 DCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCT 379
+CG ++ H SA+Q+G ++E DIN AL N V+MRLG+F+G P YG++GP VCT
Sbjct: 331 NCGSYVQEHGLSAIQQGKITEQDINRALHNLFAVRMRLGLFNGNPKYNRYGNIGPDQVCT 390
Query: 380 PDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYT 439
+HQ LALEAA+ G+VLLKN +LPLS + ++AVIG N++ ++GNY G C
Sbjct: 391 QEHQNLALEAAQHGVVLLKNDANALPLSKSQVSSIAVIGHNANDATRLLGNYFGPPCISV 450
Query: 440 TPLQGIGRYAR-TIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDR 498
TPLQ + Y + T GC AC + G A + D +L MGLDQ E E +DR
Sbjct: 451 TPLQVLQGYVKDTRFLAGCNSAACNVSSI-GEAAQLASSVDYVVLFMGLDQDQEREEVDR 509
Query: 499 AGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGT 558
L LPG Q+ L++ V+ A+K P ILVL+ GGP+DV FAK +P+I AI+WAGYPG+AGG
Sbjct: 510 LELSLPGMQENLINTVANAAKKPVILVLLCGGPVDVTFAKYNPKIGAILWAGYPGEAGGI 569
Query: 559 AIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPF 618
AIA +LFG NPGG+LP+TWYP+E+ T++PMT+M MR S YPGRTYRFY+G VY F
Sbjct: 570 AIAQVLFGEHNPGGRLPVTWYPKEF-TSVPMTDMRMRADPSTGYPGRTYRFYRGNTVYKF 628
Query: 619 GHGMSYTNFVHT-VANAPTVVAV-PLDGRHGSINA---TISGKAIKVTHAKCNRLTLGVQ 673
G+G+SY+ + H VAN + ++ +DG A T+S ++ C++L
Sbjct: 629 GYGLSYSKYSHHFVANGTKLPSLSSIDGLKAMATAAAGTVSYDVEEIGPETCDKLKFPAL 688
Query: 674 VDVKNVGSKDGAHTLLVFSTPPAGHW---APHKQLVAFEKVHVPAGAQQRVGINIHVCKY 730
V V+N G DG H +L+F P G P QL+ F+ +H+ + V + CK+
Sbjct: 689 VRVQNHGPMDGRHPVLLFLRWPNGAADGGRPASQLIGFQSLHLKSMQTVHVEFEVSPCKH 748
Query: 731 LSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
S G + I G H + +G + +S
Sbjct: 749 FSRATEDGKKVIDHGSHFMMVGDDEFEMSF 778
>gi|242076578|ref|XP_002448225.1| hypothetical protein SORBIDRAFT_06g023450 [Sorghum bicolor]
gi|241939408|gb|EES12553.1| hypothetical protein SORBIDRAFT_06g023450 [Sorghum bicolor]
Length = 766
Score = 713 bits (1840), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 349/748 (46%), Positives = 489/748 (65%), Gaps = 22/748 (2%)
Query: 30 CDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSE 89
C + + PFC SL IP R L+ L+L EK+ L + A VPRLGI Y+WWSE
Sbjct: 24 CASPYSGSSAYPFCDASLSIPARARALVSLLTLDEKIAQLSNTAGGVPRLGIPPYQWWSE 83
Query: 90 ALHGVSNVGPGTKFG-GDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA 148
+LHG+++ GPG F G AT+FPQVI + ++FN +LW A+ V+ EA M+N G A
Sbjct: 84 SLHGLADNGPGVNFSSGPVRAATTFPQVILSTAAFNRSLWRAVAEAVATEALGMHNAGQA 143
Query: 149 GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGD--RLKVAASCKH 206
GLTYW+PN+NIFRDPRWGRGQET GEDP ++ Y+ YV+G QG G+ R++++A CKH
Sbjct: 144 GLTYWAPNINIFRDPRWGRGQETSGEDPAVAAAYSLEYVKGFQGEQGEEGRIRLSACCKH 203
Query: 207 FTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCA 266
+TAYD++ W G R+ FNAKV+ QD+EDT+ PF+ C+ E + + +MC+YNQVNGVP CA
Sbjct: 204 YTAYDMEKWEGFSRYTFNAKVNAQDLEDTYQPPFKTCIQEARASCLMCAYNQVNGVPMCA 263
Query: 267 DPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLG 326
+ ++L++T R EW GYI SDCD+V + ++ Q +T + E++ A ++AG+D++CG FL
Sbjct: 264 NKDLLQKT-RDEWGFQGYITSDCDAVAIIHENQTYTKSDEDSIAIVLKAGMDINCGSFLV 322
Query: 327 LHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELA 386
HT+SAV++G + E DI+ AL N +VQ+RLG+FD ++Q LGP +VCT +H+ELA
Sbjct: 323 RHTKSAVEKGKVQEQDIDRALFNLFSVQLRLGIFDKPNNNQWSTQLGPNNVCTKEHRELA 382
Query: 387 LEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIG 446
EA RQG VLLKN LPL R VA+IGP+++ M G+Y G+AC TT L+GI
Sbjct: 383 AEAVRQGAVLLKNDHSFLPLKRSEVRHVAIIGPSANDVYAMGGDYTGVACNPTTFLKGIQ 442
Query: 447 RYA-RTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPG 505
YA +T GCKDV+C +LFG AI A+++AD ++V GL+ + E E DR LLLPG
Sbjct: 443 AYATQTTFAAGCKDVSCNSTELFGEAIAAAKRADIVVVVAGLNLTEEREDFDRVSLLLPG 502
Query: 506 RQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILF 565
+Q L+ V+ +K P +LVL+ GGP+DV+FAK DPRIA+I+W GYPG+ GG + +ILF
Sbjct: 503 KQMSLIHAVASVAKKPLVLVLLGGGPVDVSFAKQDPRIASILWLGYPGEVGGQVLPEILF 562
Query: 566 GTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYT 625
G NPGGKL MTWYP+ + T +PMT+M MR S+ YPGRTYRFY G VVY FG+G+SY+
Sbjct: 563 GEYNPGGKLAMTWYPESF-TAIPMTDMNMRADPSRGYPGRTYRFYTGDVVYGFGYGLSYS 621
Query: 626 NFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTH-----------AKCNRLTLGVQV 674
+ +++ +AP + + S+ IS K + A C L V V
Sbjct: 622 KYSYSILSAPKKITM----SRSSVLDIISRKPSYIRRDGLDFVKTEDIASCEALAFSVHV 677
Query: 675 DVKNVGSKDGAHTLLVFSTPPAG-HWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSV 733
V N GS DG+H +L+F+ + P KQLV FE+VH AG+ V I++ CK++S
Sbjct: 678 AVSNHGSMDGSHAVLLFARSKSSVPGFPIKQLVGFERVHTAAGSASNVEISVDPCKHMSA 737
Query: 734 VDRSGTRRIPLGEHNIHIGGTKHSVSLH 761
+ G R + LG+H + +G + + +
Sbjct: 738 ANPEGKRVLLLGDHVLTVGDEEFELFIE 765
>gi|168046596|ref|XP_001775759.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162672911|gb|EDQ59442.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 784
Score = 711 bits (1836), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 364/757 (48%), Positives = 491/757 (64%), Gaps = 26/757 (3%)
Query: 28 FACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWW 87
+ACDP PFC S+ RV DLI RL++QEK++ L++ AA V RLGI Y+WW
Sbjct: 20 YACDPDGPADLLFPFCNTSISDDDRVEDLISRLTIQEKIEQLVNTAANVSRLGIPPYQWW 79
Query: 88 SEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGT 147
E LHGV+ + P FGG P ATSFP + S+N TLW IG+VVS E RAMYN G
Sbjct: 80 GEGLHGVA-ISPSVYFGGATPAATSFPLPCLSVCSYNRTLWNKIGQVVSTEGRAMYNQGR 138
Query: 148 AGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGD----------- 196
+GLTYWSPN+NI RDPRWGR QETPGEDP LS YA +V+GLQ D D
Sbjct: 139 SGLTYWSPNINIARDPRWGRTQETPGEDPKLSSGYAVHFVKGLQEGDYDQNQPQAVSRGP 198
Query: 197 -RLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCS 255
RLK++A CKHFTA+DLD W DR HF++KV++QD+EDT++ F+ CV EG+ +SVMCS
Sbjct: 199 RRLKISACCKHFTAHDLDRWKDYDRDHFDSKVTQQDLEDTYNPSFKSCVKEGQSSSVMCS 258
Query: 256 YNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRA 315
YN++NG+P C +L T+R +W +GYIVSDCD+V + +D ++ T E+A + + A
Sbjct: 259 YNRLNGIPMCTHYELLTLTVRNQWGFDGYIVSDCDAVALIHDYINYAPTSEDAVSYVMLA 318
Query: 316 GLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPK 375
G+DL+CG +H +A+ + L+ E I+ L N V+MRLGMFDG PS+ PYG LGP+
Sbjct: 319 GMDLNCGSTTLVHGLAALDKKLIWEGLIDMHLRNLFRVRMRLGMFDGNPSTLPYGSLGPE 378
Query: 376 DVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIA 435
D+CT D+Q LALEAARQ +VLLKN+ +LP +AVIG ++D T M+GNY G
Sbjct: 379 DMCTEDNQHLALEAARQSLVLLKNEKNALPWKKTHGLKLAVIGHHADATREMLGNYEGYP 438
Query: 436 CGYTTPLQGIGRYA-----RTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQS 490
C + +PLQG + R H++GC D AC D AA +A+ QADA +LV+G+ Q+
Sbjct: 439 CKFVSPLQGFAKVLSDHSPRISHERGCSDAACEDQFYIYAAKEAAAQADAVVLVLGISQA 498
Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKG-PTILVLMSGGPIDVAFAKNDPRIAAIIWA 549
E E DR LLLPGRQ ELVS V AS G P +LVL+SG P+DV+FA +DPRI +IIWA
Sbjct: 499 QEKEGRDRDSLLLPGRQMELVSSVVEASAGRPVVLVLLSGSPLDVSFANDDPRIQSIIWA 558
Query: 550 GYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRF 609
GYPGQ+GG AIA+ +FG NPGG+L +WY + Y TN+ M+ M MRP+ S YPGRTYRF
Sbjct: 559 GYPGQSGGEAIAEAIFGLVNPGGRLAQSWYYENY-TNIDMSNMNMRPNASTGYPGRTYRF 617
Query: 610 YKGPVVYPFGHGMSYTNFVHTVANAPTVVAVP-LDGRHGSINATISGKAIKVTHAK---C 665
+ ++ FGHG+SY++F +T+ +AP + P L + S + + + H + C
Sbjct: 618 FTDTPLWEFGHGLSYSDFKYTMVSAPQSIMAPHLRYQLCSSDRAVMTSDLNCLHYEKEAC 677
Query: 666 NRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAG--HWAPHKQLVAFEKVHVPAGAQQRVGI 723
+ V+V V N G G H++L+FS PP+ P KQLV+FE+VH+ AGA Q +
Sbjct: 678 KESSFHVRVWVINHGPLSGDHSVLLFSKPPSRGIDGIPLKQLVSFERVHLEAGAGQEILF 737
Query: 724 NIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
++ C+ L V G R + LGEH + +G +H +++
Sbjct: 738 KVNPCEDLGTVGDDGIRTVELGEHTLMVGMVQHVLTV 774
>gi|125534137|gb|EAY80685.1| hypothetical protein OsI_35867 [Oryza sativa Indica Group]
Length = 779
Score = 709 bits (1831), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 358/742 (48%), Positives = 483/742 (65%), Gaps = 28/742 (3%)
Query: 28 FACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWW 87
F C P A + FC +LP QR DL+ RL+ EKV L A VPRLGI Y+WW
Sbjct: 37 FTCGPASAQ-KGFAFCNAALPAEQRAADLVARLTTAEKVGQLGDQAPGVPRLGIPVYKWW 95
Query: 88 SEALHGVSNVGPGTKFG-GDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGG 146
SEALHG++ G G FG G ATSFPQVI TA++F+ LW IG+ + E RA YN G
Sbjct: 96 SEALHGLAISGKGIHFGNGPARTATSFPQVIHTAAAFDDGLWFRIGQAIGKEGRAFYNLG 155
Query: 147 TA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCK 205
A GL WSPNVNIFRDPRWGRGQETPGEDP + KY A++V+GLQGS L+ +A CK
Sbjct: 156 QAEGLAMWSPNVNIFRDPRWGRGQETPGEDPATASKYGAAFVKGLQGSSLTNLQTSACCK 215
Query: 206 HFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTC 265
H TAYD++ W GV R++FNAKV+ QD+ DT++ PFR CV++GK + +MC+Y +NGVP C
Sbjct: 216 HITAYDIEEWKGVSRYNFNAKVTPQDLADTYNPPFRSCVVDGKASCIMCAYTLINGVPAC 275
Query: 266 ADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFL 325
A ++L +T+RGEW+L+GY SDCD+V + + ++HFT T EEA A A++AGLD++CG ++
Sbjct: 276 ASSDLLTKTVRGEWKLDGYTASDCDAVAILHKSEHFTRTAEEAVAVALKAGLDINCGVYM 335
Query: 326 GLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEP-SSQPYGHLGPKDVCTPDHQE 384
+ SA+Q+G ++E D++ AL N ++MRLG FDG+P ++ YG LG DVCTP H+
Sbjct: 336 QQNAASALQQGKMTEKDVDKALKNLFAIRMRLGHFDGDPRGNKLYGRLGAADVCTPVHKA 395
Query: 385 LALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQG 444
LALEAAR+G+VLLKN LPL + AVIG N++ + ++GNY G+ C TTP G
Sbjct: 396 LALEAARRGVVLLKNDARLLPLRAPTVSSAAVIGHNANDILALLGNYYGLPCETTTPFGG 455
Query: 445 IGRYARTIH-QQGCKDVAC---ADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAG 500
I +Y ++ GC AC A DQ A ++ +D LVMGL Q E E LDR
Sbjct: 456 IQKYVKSAKFLPGCSSAACDVAATDQ----ATALAKSSDYVFLVMGLSQKQEQEGLDRTS 511
Query: 501 LLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAI 560
LLLPG+QQ L++ V+ ASK P IL+L++GGP+D+ FA+ +P+I AI+WAGYPGQAGG AI
Sbjct: 512 LLLPGKQQALITAVATASKRPVILILLTGGPVDITFAQTNPKIGAILWAGYPGQAGGQAI 571
Query: 561 ADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGH 620
AD+LFG NP GKLP+TWYP+E+ T MT+M MRP + YPGR+YRFYKG VY FG+
Sbjct: 572 ADVLFGEFNPSGKLPVTWYPEEF-TKFTMTDMRMRPDPATGYPGRSYRFYKGKTVYKFGY 630
Query: 621 GMSYTNF----VHTVANAPTVVAVPLDGRHGSINATISGKAI----KVTHAKCNRLTLGV 672
G+SY+ F V N+ + L G + T G A+ ++ +C RL V
Sbjct: 631 GLSYSKFACRIVSGAGNSSSYGKAALAGLRAAT--TPEGDAVYRVDEIGDDRCERLRFPV 688
Query: 673 QVDVKNVGSKDGAHTLLVF---STPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCK 729
V+V+N G DG HT+L+F S+ G P +QL+ F H+ G ++++ + I C+
Sbjct: 689 MVEVQNHGPMDGKHTVLMFVRWSSTDGGR--PVRQLIGFRNQHLKVGEKKKLKMEISPCE 746
Query: 730 YLSVVDRSGTRRIPLGEHNIHI 751
+LS G + I G H + +
Sbjct: 747 HLSRARVDGEKVIDRGSHFLMV 768
>gi|62734691|gb|AAX96800.1| Glycosyl hydrolase family 3 C terminal domain, putative [Oryza
sativa Japonica Group]
gi|77549994|gb|ABA92791.1| beta-D-xylosidase, putative, expressed [Oryza sativa Japonica
Group]
Length = 853
Score = 709 bits (1830), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 357/742 (48%), Positives = 482/742 (64%), Gaps = 28/742 (3%)
Query: 28 FACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWW 87
F C P A + FC +LP QR DL+ RL+ EKV L A VPRLGI Y+WW
Sbjct: 111 FTCGPASAQ-KGFAFCNAALPAEQRAADLVARLTTAEKVGQLGDQAPGVPRLGIPVYKWW 169
Query: 88 SEALHGVSNVGPGTKFG-GDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGG 146
SEALHG++ G G FG G ATSFPQVI TA++F+ LW IG+ + E RA YN G
Sbjct: 170 SEALHGLAISGKGIHFGNGPARTATSFPQVIHTAAAFDDGLWFRIGQAIGKEGRAFYNLG 229
Query: 147 TA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCK 205
A GL WSPNVNIFRDPRWGRGQETPGEDP + KY A++V+GLQGS L+ +A CK
Sbjct: 230 QAEGLAMWSPNVNIFRDPRWGRGQETPGEDPATASKYGAAFVKGLQGSSLTNLQTSACCK 289
Query: 206 HFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTC 265
H TAYD++ W GV R++FNAKV+ QD+ DT++ PFR CV++GK + +MC+Y +NGVP C
Sbjct: 290 HITAYDIEEWKGVSRYNFNAKVTPQDLADTYNPPFRSCVVDGKASCIMCAYTLINGVPAC 349
Query: 266 ADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFL 325
A ++L +T+RGEW+L+GY SDCD+V + + ++HFT T EEA A A++AGLD++CG ++
Sbjct: 350 ASSDLLTKTVRGEWKLDGYTASDCDAVAILHKSEHFTRTAEEAVAVALKAGLDINCGVYM 409
Query: 326 GLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEP-SSQPYGHLGPKDVCTPDHQE 384
+ SA+Q+G ++E D++ AL N ++MRLG FDG+P ++ YG L DVCTP H+
Sbjct: 410 QQNAASALQQGKMTEKDVDKALKNLFAIRMRLGHFDGDPRGNKLYGRLSAADVCTPVHKA 469
Query: 385 LALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQG 444
LALEAAR+G+VLLKN LPL + AVIG N++ + ++GNY G+ C TTP G
Sbjct: 470 LALEAARRGVVLLKNDARLLPLRAPTVASAAVIGHNANDILALLGNYYGLPCETTTPFGG 529
Query: 445 IGRYARTIH-QQGCKDVAC---ADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAG 500
I +Y ++ GC AC A DQ A ++ +D LVMGL Q E E LDR
Sbjct: 530 IQKYVKSAKFLPGCSSAACDVAATDQ----ATALAKSSDYVFLVMGLSQKQEQEGLDRTS 585
Query: 501 LLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAI 560
LLLPG+QQ L++ V+ ASK P IL+L++GGP+D+ FA+ +P+I AI+WAGYPGQAGG AI
Sbjct: 586 LLLPGKQQALITAVATASKRPVILILLTGGPVDITFAQTNPKIGAILWAGYPGQAGGQAI 645
Query: 561 ADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGH 620
AD+LFG NP GKLP+TWYP+E+ T MT+M MRP + YPGR+YRFYKG VY FG+
Sbjct: 646 ADVLFGEFNPSGKLPVTWYPEEF-TKFTMTDMRMRPDPATGYPGRSYRFYKGKTVYKFGY 704
Query: 621 GMSYTNF----VHTVANAPTVVAVPLDGRHGSINATISGKAI----KVTHAKCNRLTLGV 672
G+SY+ F V N+ + L G + T G A+ ++ +C RL V
Sbjct: 705 GLSYSKFACRIVSGAGNSSSYGKAALAGLRAAT--TPEGDAVYRVDEIGDDRCERLRFPV 762
Query: 673 QVDVKNVGSKDGAHTLLVF---STPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCK 729
V+V+N G DG HT+L+F S+ G P +QL+ F H+ G ++++ + I C+
Sbjct: 763 MVEVQNHGPMDGKHTVLMFVRWSSTDGGR--PVRQLIGFRNQHLKVGEKKKLKMEISPCE 820
Query: 730 YLSVVDRSGTRRIPLGEHNIHI 751
+LS G + I G H + +
Sbjct: 821 HLSRARVDGEKVIDRGSHFLMV 842
>gi|253761872|ref|XP_002489310.1| hypothetical protein SORBIDRAFT_0010s010920 [Sorghum bicolor]
gi|241946958|gb|EES20103.1| hypothetical protein SORBIDRAFT_0010s010920 [Sorghum bicolor]
Length = 772
Score = 709 bits (1830), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 357/756 (47%), Positives = 485/756 (64%), Gaps = 24/756 (3%)
Query: 22 LAAREPFACDPKDA-TTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLG 80
LA PF+C P A + L FC V+L QR DL+ RL+ EK+ L A VPRLG
Sbjct: 22 LAGDPPFSCGPTSAEASEGLAFCDVTLSPAQRAADLVSRLTPAEKIAQLGDQATGVPRLG 81
Query: 81 IKGYEWWSEALHGVSNVGPGTKFG--GDFPGATSFPQVITTASSFNATLWEAIGRVVSDE 138
+ GY+WW+EALHG++ G G F G ATSFPQV+ TA++F+ LW IG+ + E
Sbjct: 82 VPGYKWWNEALHGLATSGKGLHFDVVGGVRAATSFPQVLLTAAAFDDDLWFRIGQAIGRE 141
Query: 139 ARAMYNGGTA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQG-SDGD 196
ARA++N G A GLT WSPNVNIFRDPRWGRGQETPGEDP ++ +YA ++VRG+QG S
Sbjct: 142 ARALFNVGQAEGLTIWSPNVNIFRDPRWGRGQETPGEDPAVASRYAVAFVRGIQGNSSSS 201
Query: 197 RLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSY 256
L+ +A CKH TAYDL++WNGV R+ F A+V+ QD+EDTF+ PFR CV+EGK + +MC+Y
Sbjct: 202 LLQTSACCKHATAYDLEDWNGVARYSFVARVTAQDLEDTFNPPFRSCVVEGKASCIMCAY 261
Query: 257 NQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAG 316
+NGVP CA+ ++L T+RG+W L+GY+ SDCD+V + D Q + TPE+A A +++AG
Sbjct: 262 TAINGVPACANTDLLTGTVRGDWGLDGYVASDCDAVAIMRDAQRYAPTPEDAVAVSLKAG 321
Query: 317 LDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKD 376
LD+DCG ++ H +A+Q+G L+E+DI+ ALVN V+MRLG FDG+P YG L D
Sbjct: 322 LDIDCGSYIQQHATAAIQQGKLTELDIDKALVNLFAVRMRLGHFDGDPRKNMYGALSAAD 381
Query: 377 VCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIAC 436
+CTP+H+ LALEAA+ GIVLLKN G LPL + AVIGPNS+ + +I NY G C
Sbjct: 382 ICTPEHRSLALEAAQDGIVLLKNDGGILPLDRSTVTSAAVIGPNSNDGMALIANYFGPPC 441
Query: 437 GYTTPLQGIGRYARTIH-QQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEA 495
TTPLQG+ Y + GC AC D + A+ S D L MGL Q E+E
Sbjct: 442 ESTTPLQGLQSYVNNVRFLAGCSSAAC-DVAVTDQAVVLSGSEDYVFLFMGLSQQQESEG 500
Query: 496 LDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQA 555
DR LLLPG QQ L++ V+ ASK P ILVL+SGGP+D+ FA+++P+I AI+WAGYPGQA
Sbjct: 501 KDRTSLLLPGMQQSLITAVADASKRPVILVLLSGGPVDITFAQSNPKIGAILWAGYPGQA 560
Query: 556 GGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVV 615
GG AIA +LFG NP G+LPMTWYP+++ T +PMT+M MR + YPGR+YRFY+G V
Sbjct: 561 GGLAIAKVLFGDHNPSGRLPMTWYPEDF-TKVPMTDMRMRADPTSGYPGRSYRFYQGNAV 619
Query: 616 YPFGHGMSYTNFVHTV---ANAPTVVAVPLDGRHGSINATISGKAIKVTHAK------CN 666
Y FG+G+SY+ F + + P + + L G + T++ + + H C
Sbjct: 620 YKFGYGLSYSTFSSRLLYGTSMPALSSTVLAG----LRETVTEEGDRSYHIDDIGTDGCE 675
Query: 667 RLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINI 725
+L V+V+N G DG H+ L+F P P QL+ F H+ AG + +I
Sbjct: 676 QLKFPAMVEVQNHGPMDGKHSALMFLRWPNTNGGRPASQLIGFMSQHLKAGETANLRFDI 735
Query: 726 HVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLH 761
C++ S V G + I +G H + + H++ +
Sbjct: 736 SPCEHFSRVRADGMKVIDIGSHFLTV--DNHAIEIR 769
>gi|62701898|gb|AAX92971.1| beta-D-xylosidase [Oryza sativa Japonica Group]
gi|62733926|gb|AAX96035.1| beta-D-xylosidase [Oryza sativa Japonica Group]
gi|77550045|gb|ABA92842.1| Glycosyl hydrolase family 3 C terminal domain containing protein,
expressed [Oryza sativa Japonica Group]
gi|125576900|gb|EAZ18122.1| hypothetical protein OsJ_33667 [Oryza sativa Japonica Group]
Length = 771
Score = 709 bits (1829), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 363/741 (48%), Positives = 479/741 (64%), Gaps = 22/741 (2%)
Query: 27 PFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEW 86
P++C P+ ++ FC LP +R DL+ RL+ EKV L A VPRLG+ Y+W
Sbjct: 26 PYSCGPRSPSS-GYAFCDARLPPARRAADLVSRLTAAEKVAQLGDEAGGVPRLGVPPYKW 84
Query: 87 WSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGG 146
WSE LHG+S G G F G TSFPQV+ TA++F+ LW IG+ + EARA+YN G
Sbjct: 85 WSEGLHGLSYWGHGMHFNGAVTAITSFPQVLLTAAAFDDRLWFRIGQAIGTEARALYNLG 144
Query: 147 TA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCK 205
A GLT WSPNVNI+RDPRWGRGQETPGEDP + KYA ++V+GLQGS L+ +A CK
Sbjct: 145 QAEGLTIWSPNVNIYRDPRWGRGQETPGEDPTTASKYAVAFVKGLQGSTPGTLQTSACCK 204
Query: 206 HFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTC 265
H TAYDL+ WNGV R++FNAKV+ QD+ DTF+ PF+ CV++ K + VMC+Y +NGVP C
Sbjct: 205 HATAYDLEEWNGVARYNFNAKVTAQDLADTFNPPFKSCVVDAKASCVMCAYTDINGVPAC 264
Query: 266 ADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFL 325
A ++L +T RG+W L+GY+ SDCD+V + D Q + TPE+ A AI+AGLDL+CG +
Sbjct: 265 ASSDLLSKTFRGQWGLDGYVSSDCDAVALLRDAQRYAPTPEDTVAVAIKAGLDLNCGNYT 324
Query: 326 GLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEP-SSQPYGHLGPKDVCTPDHQE 384
+H +A+Q+G + E D++ AL N V+MRLG FDG+P S+ YGHLG DVCT H++
Sbjct: 325 QVHGMAALQQGKMRESDVDRALTNLFAVRMRLGHFDGDPRSNAAYGHLGAADVCTQAHRD 384
Query: 385 LALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQG 444
LALEAA+ GIVLLKN +LPL R+ AVIGPN++ + GNY G C TTPLQG
Sbjct: 385 LALEAAQDGIVLLKNDAGALPLDRATVRSAAVIGPNANDPAALNGNYFGPPCETTTPLQG 444
Query: 445 IGRYARTIH-QQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLL 503
+ RY ++ GC AC AA AS +D I+ MGL Q E E LDR LLL
Sbjct: 445 VQRYISSVRFLAGCDSPACGFAATGQAAALAS-SSDQVIMFMGLSQDQEKEGLDRTSLLL 503
Query: 504 PGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADI 563
PG+QQ L++ V+ A++ P ILVL++GGP+DV FAKN+P+I AI+WAGYPGQAGG AIA +
Sbjct: 504 PGKQQSLITAVASAARRPVILVLLTGGPVDVTFAKNNPKIGAILWAGYPGQAGGLAIAKV 563
Query: 564 LFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMS 623
LFG NP G+LP+TWYP+E+ T +PMT+M MR + YPGR+YRFY+G VY FG+G+S
Sbjct: 564 LFGDHNPSGRLPVTWYPEEF-TRIPMTDMRMRADPATGYPGRSYRFYQGNPVYKFGYGLS 622
Query: 624 YTNFVHTVANAPTVVAVPLDGRHGSINATI---SGKAIKVTHAK------CNRLTLGVQV 674
Y+ F + A A P + I +G + H + C RL V
Sbjct: 623 YSKFSRRLVAA----AKPRRPNRNLLAGVIPKPAGDGGESYHVEEIGEEGCERLKFPATV 678
Query: 675 DVKNVGSKDGAHTLLVFSTPP---AGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYL 731
+V N G DG H++LVF P AG P +QLV F HV AG + R+ + I+ C++L
Sbjct: 679 EVHNHGPMDGKHSVLVFVRWPNATAGASRPARQLVGFSSQHVRAGEKARLTMEINPCEHL 738
Query: 732 SVVDRSGTRRIPLGEHNIHIG 752
S GT+ I G H + +G
Sbjct: 739 SRAREDGTKVIDRGSHFLKVG 759
>gi|297736788|emb|CBI25989.3| unnamed protein product [Vitis vinifera]
Length = 746
Score = 708 bits (1827), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 356/767 (46%), Positives = 485/767 (63%), Gaps = 71/767 (9%)
Query: 5 IAFFFLGLILLSASSSG-----LAAREP------FACDPKDATT-----RTLPFCQVSLP 48
IAFF + + S G +AAR P + CD + +C S P
Sbjct: 17 IAFFAVCSAIKSPLKDGPAAAPMAARGPIDGNYTYVCDESRFAALGLDMKDFHYCDSSSP 76
Query: 49 IPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFP 108
R DL+ R++L EKV A+ V R+G+ Y WWSEALHGVSN G F P
Sbjct: 77 YEVRAKDLVDRMTLSEKVMQTGDQASGVERIGLPKYNWWSEALHGVSNFGRCVFFDEVVP 136
Query: 109 GATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRG 168
GATSFP VI +A+SFN +LW+ +G+ VS EARAMYN G AGLT+WSPN+N+ RDPRWGR
Sbjct: 137 GATSFPTVILSAASFNQSLWKTLGQAVSTEARAMYNSGNAGLTFWSPNINVVRDPRWGRI 196
Query: 169 QETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVS 228
ETPGEDP L G YA +Y H+ AYDLDNW G DR HF+A+VS
Sbjct: 197 LETPGEDPHLVGLYAVNY-------------------HYAAYDLDNWKGADRVHFDARVS 237
Query: 229 KQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSD 288
QD+ +TF +PF MCV EG V+SVMCSYN++NG+P+CAD +LK+TIRGEW L+GYIVSD
Sbjct: 238 VQDMAETFVLPFEMCVKEGDVSSVMCSYNKINGIPSCADSRLLKQTIRGEWDLHGYIVSD 297
Query: 289 CDSVGVY-YDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNAL 347
CDSV V D + S+ ++AA A+ AG++LDCG F AV +G ++ D++++L
Sbjct: 298 CDSVEVMAVDQKWLDSSFSDSAAQALNAGMNLDCGTFNNRSLTEAVNQGKANQADLDHSL 357
Query: 348 VNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLS 407
+ MR+G FDG P+ + LG D+C+ +H ELA EAARQGIVLLKN +LPL
Sbjct: 358 RYLYVLLMRVGFFDGIPA---FASLGKDDICSAEHIELAREAARQGIVLLKNDNATLPLK 414
Query: 408 HIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQL 467
+++ +A++GP+++ T MIGNYAGI C Y +PL +++GC DV C ++
Sbjct: 415 SVKN--IALVGPHANATDAMIGNYAGIPCYYVSPLDAFSSMGEVRYEKGCADVQCLNETY 472
Query: 468 FGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLM 527
A++A+++ADATI+ G D SIEAEALDR LLLPG Q +L+++V+ S GP +LV+M
Sbjct: 473 IFNAMEAAKRADATIIFAGTDLSIEAEALDRVDLLLPGYQTQLINQVADLSTGPVVLVIM 532
Query: 528 SGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNL 587
SGG +D++FA+++P+IAAI+WAGYPG+ GG AIAD++ G NPGG+LP+TWY +Y+ L
Sbjct: 533 SGGGVDISFARDNPKIAAILWAGYPGEQGGNAIADVILGKYNPGGRLPITWYEADYVDML 592
Query: 588 PMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHG 647
PMT MA+RP S YPGRTY+F+ G VYPFG+GMSYTNF ++++
Sbjct: 593 PMTSMALRPVDSLGYPGRTYKFFNGSTVYPFGYGMSYTNFSYSLS--------------- 637
Query: 648 SINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPH-KQLV 706
T C ++ +V VKNVG DG+ ++V+S+PP G H K++V
Sbjct: 638 -------------TSQSCKE-SIEFEVAVKNVGRMDGSEVVVVYSSPPLGIAGTHIKKVV 683
Query: 707 AFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGG 753
FE+V V G ++V +++VCK L +VD +G +P G H I +GG
Sbjct: 684 GFERVFVKVGGTEKVKFSMNVCKSLGIVDSTGYALLPSGSHTIKVGG 730
>gi|115485163|ref|NP_001067725.1| Os11g0297300 [Oryza sativa Japonica Group]
gi|113644947|dbj|BAF28088.1| Os11g0297300 [Oryza sativa Japonica Group]
Length = 779
Score = 707 bits (1825), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 357/742 (48%), Positives = 482/742 (64%), Gaps = 28/742 (3%)
Query: 28 FACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWW 87
F C P A + FC +LP QR DL+ RL+ EKV L A VPRLGI Y+WW
Sbjct: 37 FTCGPASAQ-KGFAFCNAALPAEQRAADLVARLTTAEKVGQLGDQAPGVPRLGIPVYKWW 95
Query: 88 SEALHGVSNVGPGTKFG-GDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGG 146
SEALHG++ G G FG G ATSFPQVI TA++F+ LW IG+ + E RA YN G
Sbjct: 96 SEALHGLAISGKGIHFGNGPARTATSFPQVIHTAAAFDDGLWFRIGQAIGKEGRAFYNLG 155
Query: 147 TA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCK 205
A GL WSPNVNIFRDPRWGRGQETPGEDP + KY A++V+GLQGS L+ +A CK
Sbjct: 156 QAEGLAMWSPNVNIFRDPRWGRGQETPGEDPATASKYGAAFVKGLQGSSLTNLQTSACCK 215
Query: 206 HFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTC 265
H TAYD++ W GV R++FNAKV+ QD+ DT++ PFR CV++GK + +MC+Y +NGVP C
Sbjct: 216 HITAYDIEEWKGVSRYNFNAKVTPQDLADTYNPPFRSCVVDGKASCIMCAYTLINGVPAC 275
Query: 266 ADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFL 325
A ++L +T+RGEW+L+GY SDCD+V + + ++HFT T EEA A A++AGLD++CG ++
Sbjct: 276 ASSDLLTKTVRGEWKLDGYTASDCDAVAILHKSEHFTRTAEEAVAVALKAGLDINCGVYM 335
Query: 326 GLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEP-SSQPYGHLGPKDVCTPDHQE 384
+ SA+Q+G ++E D++ AL N ++MRLG FDG+P ++ YG L DVCTP H+
Sbjct: 336 QQNAASALQQGKMTEKDVDKALKNLFAIRMRLGHFDGDPRGNKLYGRLSAADVCTPVHKA 395
Query: 385 LALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQG 444
LALEAAR+G+VLLKN LPL + AVIG N++ + ++GNY G+ C TTP G
Sbjct: 396 LALEAARRGVVLLKNDARLLPLRAPTVASAAVIGHNANDILALLGNYYGLPCETTTPFGG 455
Query: 445 IGRYARTIH-QQGCKDVAC---ADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAG 500
I +Y ++ GC AC A DQ A ++ +D LVMGL Q E E LDR
Sbjct: 456 IQKYVKSAKFLPGCSSAACDVAATDQ----ATALAKSSDYVFLVMGLSQKQEQEGLDRTS 511
Query: 501 LLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAI 560
LLLPG+QQ L++ V+ ASK P IL+L++GGP+D+ FA+ +P+I AI+WAGYPGQAGG AI
Sbjct: 512 LLLPGKQQALITAVATASKRPVILILLTGGPVDITFAQTNPKIGAILWAGYPGQAGGQAI 571
Query: 561 ADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGH 620
AD+LFG NP GKLP+TWYP+E+ T MT+M MRP + YPGR+YRFYKG VY FG+
Sbjct: 572 ADVLFGEFNPSGKLPVTWYPEEF-TKFTMTDMRMRPDPATGYPGRSYRFYKGKTVYKFGY 630
Query: 621 GMSYTNF----VHTVANAPTVVAVPLDGRHGSINATISGKAI----KVTHAKCNRLTLGV 672
G+SY+ F V N+ + L G + T G A+ ++ +C RL V
Sbjct: 631 GLSYSKFACRIVSGAGNSSSYGKAALAGLRAAT--TPEGDAVYRVDEIGDDRCERLRFPV 688
Query: 673 QVDVKNVGSKDGAHTLLVF---STPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCK 729
V+V+N G DG HT+L+F S+ G P +QL+ F H+ G ++++ + I C+
Sbjct: 689 MVEVQNHGPMDGKHTVLMFVRWSSTDGGR--PVRQLIGFRNQHLKVGEKKKLKMEISPCE 746
Query: 730 YLSVVDRSGTRRIPLGEHNIHI 751
+LS G + I G H + +
Sbjct: 747 HLSRARVDGEKVIDRGSHFLMV 768
>gi|357152329|ref|XP_003576084.1| PREDICTED: probable beta-D-xylosidase 7-like [Brachypodium
distachyon]
Length = 779
Score = 706 bits (1823), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 352/748 (47%), Positives = 480/748 (64%), Gaps = 15/748 (2%)
Query: 27 PFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEW 86
PF+C P AT + FC +LP+ +R DL+ RL+L EKV L A AVPRLG+ Y+W
Sbjct: 35 PFSCGPGSAT-QGYAFCDKALPVERRAADLVSRLTLAEKVSQLGDEADAVPRLGVPAYKW 93
Query: 87 WSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGG 146
WSE LHG+S G G F G TSFPQV+ TA+SF+ +W IG+ + EARA+YN G
Sbjct: 94 WSEGLHGLSFWGHGMHFDGAVRAITSFPQVLLTAASFDQDIWYRIGQAIGTEARALYNLG 153
Query: 147 TA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCK 205
A GLT WSPNVNI+RDPRWGRGQETPGEDP + KYA ++V+GLQG+ L+ +A CK
Sbjct: 154 QAQGLTIWSPNVNIYRDPRWGRGQETPGEDPTTASKYAVAFVKGLQGTSATTLQTSACCK 213
Query: 206 HFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTC 265
H TAYDL++WNGV R++FNAKV+ QD+ DTF+ PF+ CV EGK VMC+Y +NGVP C
Sbjct: 214 HATAYDLEDWNGVVRYNFNAKVTLQDLADTFNPPFKSCVEEGKATCVMCAYTNINGVPAC 273
Query: 266 ADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFL 325
A +++ +T +G+W LNGY+ SDCD+V + D Q + +TPE+ A A++AGLDL+CG +
Sbjct: 274 ASSDLITKTFKGDWGLNGYVSSDCDAVALLRDAQRYRATPEDTVAVALKAGLDLNCGNYT 333
Query: 326 GLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEP-SSQPYGHLGPKDVCTPDHQE 384
+H SA+Q+G ++E D++NAL N V+MRLG FDG+P +S YG LG DVC+P H+
Sbjct: 334 QVHGMSALQQGKMTEQDVDNALKNLFAVRMRLGHFDGDPRTSALYGSLGAADVCSPAHKN 393
Query: 385 LALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQG 444
LALEAA+ GIVLLKN LPL + A IG N++ + GNY G C TTPLQG
Sbjct: 394 LALEAAQSGIVLLKNDAGILPLDPSAVASAAAIGHNANDPAALNGNYFGPPCETTTPLQG 453
Query: 445 IGRYARTIH-QQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLL 503
+ Y + + GC AC G A+ + +D IL MGL Q E E +DR LLL
Sbjct: 454 LQGYVKNVKFLAGCDSAACGFAAT-GQAVTLASSSDYVILFMGLSQKEEQEGIDRTSLLL 512
Query: 504 PGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADI 563
PG+QQ L++ V+ ASK P ILVL++GG +D+ FAK++P+I AI+WAGYPGQAGG AIA +
Sbjct: 513 PGKQQNLITAVASASKRPVILVLLTGGSVDITFAKSNPKIGAILWAGYPGQAGGLAIARV 572
Query: 564 LFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMS 623
LFG NP G+LP+TWYP+E+ T +PMT+M MR + YPGR+YRFY+G VY FG G+S
Sbjct: 573 LFGDHNPSGRLPVTWYPEEF-TKVPMTDMRMRADPATGYPGRSYRFYQGKTVYKFGDGLS 631
Query: 624 YTNFVHTVANAPTVVAVPLDGRHGSINA-TISGKAIKVTHAK------CNRLTLGVQVDV 676
Y+ F + ++ VP + A T + + H + C++L V+V
Sbjct: 632 YSKFSRQLVSSTNTHQVPNTNLLTGLTARTATDGGMSYYHVEEIGVEGCDKLKFPAVVEV 691
Query: 677 KNVGSKDGAHTLLVFSTPP--AGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVV 734
+N G DG H++++F P G P QLV F H+ AG + + ++ C++ +
Sbjct: 692 QNHGPMDGKHSVMMFLRWPNSTGTGRPVSQLVGFRSQHLKAGEKASLTFDVSPCEHFARA 751
Query: 735 DRSGTRRIPLGEHNIHIGGTKHSVSLHA 762
G + I G H + +G + +S H+
Sbjct: 752 REDGKKVIDRGSHFLVVGKDEREISFHS 779
>gi|125534112|gb|EAY80660.1| hypothetical protein OsI_35838 [Oryza sativa Indica Group]
Length = 771
Score = 706 bits (1821), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 362/741 (48%), Positives = 478/741 (64%), Gaps = 22/741 (2%)
Query: 27 PFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEW 86
P++C P+ + + FC LP +R DL+ RL+ EKV L A V RLG+ Y+W
Sbjct: 26 PYSCGPR-SPSLGYAFCDARLPPARRAADLVSRLTAAEKVAQLGDEAGGVARLGVPPYKW 84
Query: 87 WSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGG 146
WSE LHG+S G G F G TSFPQV+ TA++F+ LW IG+ + EARA+YN G
Sbjct: 85 WSEGLHGLSYWGHGMHFNGAVTAITSFPQVLLTAAAFDDRLWFRIGQAIGTEARALYNLG 144
Query: 147 TA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCK 205
A GLT WSPNVNI+RDPRWGRGQETPGEDP + KYA ++V+GLQGS L+ +A CK
Sbjct: 145 QAEGLTIWSPNVNIYRDPRWGRGQETPGEDPTTASKYAVAFVKGLQGSTPGTLQTSACCK 204
Query: 206 HFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTC 265
H TAYDL+ WNGV R++FNAKV+ QD+ DTF+ PF+ CV++ K + VMC+Y +NGVP C
Sbjct: 205 HATAYDLEEWNGVARYNFNAKVTAQDLADTFNPPFKSCVVDAKASCVMCAYTDINGVPAC 264
Query: 266 ADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFL 325
A ++L +T RG+W L+GY+ SDCD+V + D Q + TPE+ A AI+AGLDL+CG +
Sbjct: 265 ASSDLLSKTFRGQWGLDGYVSSDCDAVALLRDAQRYAPTPEDTVAVAIKAGLDLNCGNYT 324
Query: 326 GLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEP-SSQPYGHLGPKDVCTPDHQE 384
+H +A+Q+G + E D++ AL N V+MRLG FDG+P S+ YGHLG DVCT H++
Sbjct: 325 QVHGMAALQQGKMRESDVDRALTNLFAVRMRLGHFDGDPRSNAAYGHLGAADVCTQAHRD 384
Query: 385 LALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQG 444
LALEAA+ GIVLLKN +LPL R+ AVIGPN++ + GNY G C TTPLQG
Sbjct: 385 LALEAAQNGIVLLKNDAGALPLDRATVRSAAVIGPNANDPAALNGNYFGPPCETTTPLQG 444
Query: 445 IGRYARTIH-QQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLL 503
+ RY ++ GC AC AA AS +D I+ MGL Q E E LDR LLL
Sbjct: 445 VQRYISSVRFLAGCDSPACGFAATGQAAALAS-SSDQVIMFMGLSQDQEKEGLDRTSLLL 503
Query: 504 PGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADI 563
PG+QQ L++ V+ A++ P ILVL++GGP+DV FAKN+P+I AI+WAGYPGQAGG AIA +
Sbjct: 504 PGKQQSLITAVASAARRPVILVLLTGGPVDVTFAKNNPKIGAILWAGYPGQAGGLAIAKV 563
Query: 564 LFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMS 623
LFG NP G+LP+TWYP+E+ T +PMT+M MR + YPGR+YRFY+G VY FG+G+S
Sbjct: 564 LFGDHNPSGRLPVTWYPEEF-TRIPMTDMRMRADPATGYPGRSYRFYQGNPVYKFGYGLS 622
Query: 624 YTNFVHTVANAPTVVAVPLDGRHGSINATI---SGKAIKVTHAK------CNRLTLGVQV 674
Y+ F + A A P + I +G + H + C RL V
Sbjct: 623 YSKFTRRLVAA----AKPRRPNRNLLAGVIPKPAGDGGESYHVEEIGEEGCERLKFPATV 678
Query: 675 DVKNVGSKDGAHTLLVFSTPP---AGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYL 731
+V N G DG H++LVF P AG P +QLV F HV AG + R+ + I+ C++L
Sbjct: 679 EVHNHGPMDGKHSVLVFVQWPNATAGASRPARQLVGFSSQHVRAGEKARLTMEINPCEHL 738
Query: 732 SVVDRSGTRRIPLGEHNIHIG 752
S GT+ I G H + +G
Sbjct: 739 SRARDDGTKVIDRGSHFLKVG 759
>gi|357164885|ref|XP_003580200.1| PREDICTED: probable beta-D-xylosidase 6-like [Brachypodium
distachyon]
Length = 771
Score = 706 bits (1821), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 342/739 (46%), Positives = 488/739 (66%), Gaps = 22/739 (2%)
Query: 41 PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPG 100
PFC SLP P R L+ L+L EK+ L + AA VPRLGI YEWWSE+LHG+++ GPG
Sbjct: 38 PFCDASLPFPVRARALVSLLTLDEKIAQLSNTAAGVPRLGIPPYEWWSESLHGLADNGPG 97
Query: 101 TKFG-GDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNI 159
F G AT FPQVI +A+SFN +LW A+ V+ EARAM+N G AGLTYW+PN+N+
Sbjct: 98 VNFSSGPVGAATIFPQVILSAASFNRSLWRAVAEAVAVEARAMHNAGQAGLTYWAPNINV 157
Query: 160 FRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGD----RLKVAASCKHFTAYDLDNW 215
FRDPRWGRGQETPGEDP + Y+ YV+G QG GD R+ ++A CKH+ AYDL+ W
Sbjct: 158 FRDPRWGRGQETPGEDPAVIAAYSVEYVKGFQGEYGDGKEGRMMLSACCKHYVAYDLEKW 217
Query: 216 NGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTI 275
R+ FNAKV++QD EDT++ PF+ C+ EG+ + +MCSYNQVNGVP CA ++L++ +
Sbjct: 218 GNFTRYTFNAKVNEQDFEDTYEPPFKSCIQEGRASCLMCSYNQVNGVPACARKDLLQK-V 276
Query: 276 RGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQR 335
R EW GY+VSDCD+VG+ Y Q++T++ E++ A ++AG+D++CG FL HT+SA+Q+
Sbjct: 277 RDEWGFQGYVVSDCDAVGIIYGYQNYTNSDEDSIAIVLKAGMDINCGSFLIRHTKSAIQK 336
Query: 336 GLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIV 395
G ++E DIN+AL N +VQ+RLG+FD +Q + LGP ++CT +H+ELA EAARQG V
Sbjct: 337 GKITEEDINHALFNLFSVQLRLGLFDKTSGNQWFTQLGPSNICTKEHRELAAEAARQGTV 396
Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYA-RTIHQ 454
LLKN LPL +A+IGP ++ M G+Y G+ C TT L+G+ +T
Sbjct: 397 LLKNDNSFLPLKRSEVSHIAIIGPVANDAYIMGGDYTGVPCNPTTFLKGMQAVVPQTTIA 456
Query: 455 QGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKV 514
GCKD++C FG AI+ +++AD +L+ GL+ + E E LDR LLLPG+Q +L++ +
Sbjct: 457 AGCKDISCNSTDGFGEAIEVAKRADIVVLIAGLNLTQETEDLDRVSLLLPGKQMDLINSI 516
Query: 515 SMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKL 574
+ +K P +LV+ GGP+DV+FAK D RIA+++W GYPG+ GG + +ILFG NPGGKL
Sbjct: 517 ASVTKKPLVLVITGGGPVDVSFAKQDKRIASVLWIGYPGEVGGQVLPEILFGEYNPGGKL 576
Query: 575 PMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANA 634
P+TWYP+ + T +PM +M MR S+ YPGRTYRFY G VVY FG+G+SY+ + + + A
Sbjct: 577 PITWYPESF-TAVPMNDMNMRADPSRSYPGRTYRFYTGDVVYGFGYGLSYSKYSYNIIQA 635
Query: 635 PTVVAVPLDGRHGSINATISGKA---------IKVTH-AKCNRLTLGVQVDVKNVGSKDG 684
PT +++ R +++ + +A ++V A C + V + V N G+ DG
Sbjct: 636 PTKISL---SRSSAVDFISTKRAHTRRDGLDYVQVEDIASCESIKFSVHISVANDGAMDG 692
Query: 685 AHTLLVFSTPPAG-HWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIP 743
+H +L+F+ + P KQLV FE+++ AG V I + CK +S + G R +
Sbjct: 693 SHAVLLFTRSKSSVPGFPLKQLVGFERLYAAAGKATNVEITVDPCKLMSSANTEGRRVLL 752
Query: 744 LGEHNIHIGGTKHSVSLHA 762
LG H + +G +H + A
Sbjct: 753 LGSHLLMVGDEEHEFFMEA 771
>gi|326491679|dbj|BAJ94317.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 772
Score = 705 bits (1820), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 347/746 (46%), Positives = 489/746 (65%), Gaps = 24/746 (3%)
Query: 35 ATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGV 94
A + FC SLP P R L+ L+L EK+ L + AA VPRLG+ YEWWSE+LHG+
Sbjct: 33 AEANSYAFCDGSLPFPVRARALVSLLTLDEKIAQLSNTAAGVPRLGVPPYEWWSESLHGL 92
Query: 95 SNVGPGTKFG-GDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYW 153
++ GPG F G AT FPQVI +A++FN +LW A+ V+ EARAM+N G AGLTYW
Sbjct: 93 ADNGPGVNFSSGPVAAATIFPQVILSAAAFNRSLWRAVAEAVAVEARAMHNAGQAGLTYW 152
Query: 154 SPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGD----RLKVAASCKHFTA 209
+PN+N+FRDPRWGRGQETPGEDP + Y+ YV+G QG GD R+ ++A CKH+ A
Sbjct: 153 APNINVFRDPRWGRGQETPGEDPAMIAAYSVEYVKGFQGEYGDGREGRMMLSACCKHYIA 212
Query: 210 YDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPN 269
YDL+ W R+ FNA+V+ QD EDT++ PF+ C+ EG+ + +MCSYNQVNGVP CA +
Sbjct: 213 YDLEKWGKFARYTFNAEVNAQDFEDTYEPPFKSCIQEGRASCLMCSYNQVNGVPACARKD 272
Query: 270 ILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHT 329
+L++ IR EW GYIVSDCD+V + ++ Q +TS+ E++ A ++AG+D++CG FL HT
Sbjct: 273 LLQK-IRDEWGFKGYIVSDCDAVAIIHENQTYTSSDEDSVAIVLKAGMDVNCGSFLIRHT 331
Query: 330 ESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEA 389
+SA+++G + E DIN+AL N +VQ+RLG+F+ +Q + LGP +VCT +H+ELA EA
Sbjct: 332 KSAIEKGKIQEEDINHALYNLFSVQLRLGLFEKANENQWFTRLGPSNVCTKEHRELAAEA 391
Query: 390 ARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYA 449
RQG VLLKN LPL + +A+IG ++ M G+Y G+ C T L+G+ +
Sbjct: 392 VRQGTVLLKNDNSFLPLKRSKVSHIALIGAAANDAYIMGGDYTGVPCDPITFLKGMQAFV 451
Query: 450 -RTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQ 508
+T GCKDV+C FG AI+A+++AD +++ GL+ + E+E LDR LLLPGRQQ
Sbjct: 452 PQTTVAAGCKDVSCDSPDGFGEAIEAAKRADIVVVIAGLNLTQESEDLDRVTLLLPGRQQ 511
Query: 509 ELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTS 568
+LV+ ++ +K P +LV+ GGP+DVAFAK DPRIA+++W GYPG+ GG + +ILFG
Sbjct: 512 DLVNIIASVTKKPIVLVITGGGPVDVAFAKQDPRIASVLWIGYPGEVGGQVLPEILFGEY 571
Query: 569 NPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFV 628
NPGGKLPMTWYP+ + T +PM +M MR S+ YPGRTYRFY G VVY FG+G+SY+ +
Sbjct: 572 NPGGKLPMTWYPESF-TAVPMNDMNMRADPSRGYPGRTYRFYTGEVVYGFGYGLSYSKYS 630
Query: 629 HTVANAPTVVAVPLDGRHGSINATISGKA----------IKVTH-AKCNRLTLGVQVDVK 677
+ + AP +++ H + IS K ++V A C L V + V
Sbjct: 631 YNIVQAPQRISL----SHSPVPGLISRKPAYTRRDGLDYVQVEDIASCESLVFSVHISVA 686
Query: 678 NVGSKDGAHTLLVFSTPPAG-HWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDR 736
N G+ DG+H +L+F+ + P KQLV FE+V+ AG+ + V I + CKY+S +
Sbjct: 687 NDGAMDGSHAVLLFARSKSSVPGFPLKQLVGFERVYTAAGSSKNVAITVDPCKYMSAANT 746
Query: 737 SGTRRIPLGEHNIHIGGTKHSVSLHA 762
G R + LG H++ +G H + A
Sbjct: 747 EGRRVLLLGSHHLMVGDEVHEFVIEA 772
>gi|384872601|gb|AFI25186.1| putative beta-D-xylosidase [Nicotiana tabacum]
Length = 791
Score = 702 bits (1813), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 346/748 (46%), Positives = 470/748 (62%), Gaps = 34/748 (4%)
Query: 42 FCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGT 101
FC +LPI RV LI L++ EK+ L ++PRLG+ YEWWSE+LHG++ GP
Sbjct: 43 FCNKNLPISTRVQSLISLLTIDEKILHLSDNTTSIPRLGLPAYEWWSESLHGIATNGPAV 102
Query: 102 KFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFR 161
F G G TSFPQVI TA++FN TLW +I ++ EARAMYN G AGLT+W+PN+NI R
Sbjct: 103 NFNGQIKGVTSFPQVILTAAAFNRTLWHSIATAIAVEARAMYNLGQAGLTFWAPNINILR 162
Query: 162 DPRWGRGQETPGEDPVLSGKYAASYVRGLQG------------------------SDGDR 197
DPRWGRGQETPGEDP++ YA YV G QG +DG+R
Sbjct: 163 DPRWGRGQETPGEDPMVVSAYAIEYVTGFQGLNPKAKKGNRNGYGKKRRVLKEDDNDGER 222
Query: 198 LKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYN 257
L ++A CKHFTAYDL+ W R+ FNA V+KQD+EDTF PFR C+ +GK + +MCSYN
Sbjct: 223 LMLSACCKHFTAYDLEKWGDATRYDFNAVVTKQDMEDTFQAPFRSCIQQGKASCLMCSYN 282
Query: 258 QVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGL 317
VNGVP CAD +L + +R +W +GYI SDCD+V Y+ Q +T TPE+A A A++AG
Sbjct: 283 SVNGVPACADKELLDK-VRTDWGFDGYITSDCDAVATIYENQKYTKTPEDAVAVALKAGT 341
Query: 318 DLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDV 377
+++CG ++ H +SA Q+G + E D++ AL +VQ RLG+FDG P+ + + G +DV
Sbjct: 342 NINCGTYMLRHMKSAFQQGSVLEEDLDRALQYLFSVQFRLGLFDGNPADGQFANFGAQDV 401
Query: 378 CTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACG 437
CT +H LAL+AARQGIVLLKN LPL T+A++GP ++V+ + G Y+G+ C
Sbjct: 402 CTSNHLNLALDAARQGIVLLKNDQKFLPLDKTSVSTLAIVGPMANVS-SPGGTYSGVPCK 460
Query: 438 YTTPLQGIGRYA-RTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEAL 496
+ +G R+ RT++ GC DV C F AI ++AD I+V G D S E E
Sbjct: 461 LKSIREGFHRHINRTLYAAGCLDVGCNSTAGFQDAISIVKEADYVIVVAGSDLSEETEDH 520
Query: 497 DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAG 556
DR LLLPG+Q LV+ ++ ASK P ILVL GGP+DV+FA+ DPRIA+I+W YPG+ G
Sbjct: 521 DRYSLLLPGQQTNLVTTLAAASKKPIILVLTGGGPVDVSFAEKDPRIASILWVAYPGETG 580
Query: 557 GTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVY 616
G A+++I+FG NPGGKLPMTWY + + T +PMT+M MR S YPGRTYRFY G V+Y
Sbjct: 581 GKALSEIIFGYQNPGGKLPMTWYLESF-TKVPMTDMNMRADPSNGYPGRTYRFYTGDVLY 639
Query: 617 PFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKA----IKVTHAK-CNRLTLG 671
FGHG+SYT+F + +AP+ +++ L + + G++ I V + C+
Sbjct: 640 GFGHGLSYTSFSSQLLSAPSRLSLSLAKSNRKRSILAKGRSRLGYIHVDEVESCHSSKFF 699
Query: 672 VQVDVKNVGSKDGAHTLLVFSTPPAG-HWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKY 730
V + V N G DG+H L++FS AP KQLV F++VHVPA + + C+
Sbjct: 700 VHISVTNDGDMDGSHVLMLFSRVLQNFQGAPQKQLVGFDRVHVPARKYVETSLLVDPCEL 759
Query: 731 LSVVDRSGTRRIPLGEHNIHIGGTKHSV 758
S + G R + LGEH + +H V
Sbjct: 760 FSFANDQGNRILALGEHTFILDDIEHVV 787
>gi|115459584|ref|NP_001053392.1| Os04g0530700 [Oryza sativa Japonica Group]
gi|38346629|emb|CAD41212.2| OSJNBa0074L08.23 [Oryza sativa Japonica Group]
gi|38346760|emb|CAE03865.2| OSJNBa0081C01.11 [Oryza sativa Japonica Group]
gi|113564963|dbj|BAF15306.1| Os04g0530700 [Oryza sativa Japonica Group]
gi|218195263|gb|EEC77690.1| hypothetical protein OsI_16749 [Oryza sativa Indica Group]
Length = 770
Score = 702 bits (1811), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 347/738 (47%), Positives = 481/738 (65%), Gaps = 22/738 (2%)
Query: 41 PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPG 100
PFC +LP P R L+ L+L EK+ L + AA PRLG+ +EWWSE+LHGV + GPG
Sbjct: 39 PFCNATLPFPARARALVSLLTLDEKIAQLSNTAAGAPRLGVPPFEWWSESLHGVCDNGPG 98
Query: 101 TKFG-GDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNI 159
F G AT FPQVI +A++FN +LW A R ++ EARAM+N G AGLT+W+PN+N+
Sbjct: 99 VNFSSGPVRSATIFPQVILSAAAFNRSLWRAAARAIAVEARAMHNAGQAGLTFWAPNINV 158
Query: 160 FRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGD--RLKVAASCKHFTAYDLDNWNG 217
FRDPRWGRGQETPGEDP + Y+ YV+G Q G+ R+ ++A CKH+ AYDL+ W G
Sbjct: 159 FRDPRWGRGQETPGEDPAVVSAYSVEYVKGFQRDYGEEGRMMLSACCKHYIAYDLEKWRG 218
Query: 218 VDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRG 277
R+ FNAKV+ QD+EDT+ PF+ C+ EG+ + +MCSYNQVNGVP CA +IL+R R
Sbjct: 219 FTRYTFNAKVNAQDMEDTYQPPFKSCIQEGRASCLMCSYNQVNGVPACARKDILQRA-RD 277
Query: 278 EWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGL 337
EW GYI SDCD+V + ++ Q +T++ E++ A ++AG+D++CG FL HT+SA+++G
Sbjct: 278 EWGFQGYITSDCDAVAIIHENQTYTASDEDSIAVVLKAGMDINCGSFLIRHTKSAIEKGK 337
Query: 338 LSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLL 397
+ E DIN+AL N +VQ+RLG FD +Q + LGP +VCT +H+ELA EA RQG VLL
Sbjct: 338 VQEEDINHALFNLFSVQLRLGFFDKTNENQWFTQLGPNNVCTTEHRELAAEAVRQGTVLL 397
Query: 398 KNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYA-RTIHQQG 456
KN LPL +A+IGP ++ + G+Y G+ C TT ++G+ Y +T G
Sbjct: 398 KNDNGFLPLKRSEVGHIALIGPAANDPYILGGDYTGVPCHSTTFVKGMQAYVPKTTFAAG 457
Query: 457 CKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSM 516
CKDV C FG AI+A+++AD +L+ GL+ + E E DR LLLPGRQ +L+ V+
Sbjct: 458 CKDVPCNSTDGFGEAIEAAKRADVVVLIAGLNLTEETEDHDRVSLLLPGRQMDLIHTVAS 517
Query: 517 ASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPM 576
+K P +LVLM GGP+DV+FAK+DPRIA+I+W GYPG+ GG + +ILFG NPGGKLP+
Sbjct: 518 VTKKPVVLVLMGGGPVDVSFAKHDPRIASILWIGYPGEVGGNVLPEILFGKYNPGGKLPI 577
Query: 577 TWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPT 636
TWYP+ + T +PM +M MR S+ YPGRTYRFY G VVY FG+G+SY+ + +++ AP
Sbjct: 578 TWYPESF-TAVPMDDMNMRADASRGYPGRTYRFYTGDVVYGFGYGLSYSKYSYSILQAPK 636
Query: 637 VVAVPLDGRHGSINATISGKAIKVTH-----------AKCNRLTLGVQVDVKNVGSKDGA 685
+++ S+ IS K A C L V + V N G+ DG+
Sbjct: 637 KISL----SRSSVPDLISRKPAYTRRDGVDYVQVEDIASCEALQFPVHISVSNDGAMDGS 692
Query: 686 HTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPL 744
H +L+F S+ P+ +P KQLV FE+VH AG V I + CK +S + GTR + L
Sbjct: 693 HAVLLFASSKPSFPGSPIKQLVGFERVHTAAGRSTDVEITVDPCKLMSFANTEGTRVLFL 752
Query: 745 GEHNIHIGGTKHSVSLHA 762
G H + +G +H + + A
Sbjct: 753 GTHVLMVGDEEHELLIEA 770
>gi|115485165|ref|NP_001067726.1| Os11g0297800 [Oryza sativa Japonica Group]
gi|62734696|gb|AAX96805.1| beta-D-xylosidase [Oryza sativa Japonica Group]
gi|77549999|gb|ABA92796.1| Glycosyl hydrolase family 3 C terminal domain containing protein,
expressed [Oryza sativa Japonica Group]
gi|113644948|dbj|BAF28089.1| Os11g0297800 [Oryza sativa Japonica Group]
gi|125534139|gb|EAY80687.1| hypothetical protein OsI_35869 [Oryza sativa Indica Group]
gi|215766717|dbj|BAG98945.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 782
Score = 701 bits (1810), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 355/752 (47%), Positives = 482/752 (64%), Gaps = 21/752 (2%)
Query: 27 PFACDPKDATTRTLP-FCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYE 85
PF+C A FC +LP QR DL+ RL+ EKV L AA VPRLG+ Y+
Sbjct: 36 PFSCGGAAAGGGQGYAFCDATLPAEQRAADLVARLTAAEKVAQLGDQAAGVPRLGVPAYK 95
Query: 86 WWSEALHGVSNVGPGTKFGGDFPG-----ATSFPQVITTASSFNATLWEAIGRVVSDEAR 140
WWSEALHG++ G G F D PG ATSFPQV+ TA++F+ LW IG+ + EAR
Sbjct: 96 WWSEALHGLATSGRGLHF--DAPGSAARAATSFPQVLLTAAAFDDDLWFRIGQAIGTEAR 153
Query: 141 AMYNGGTA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLK 199
A+YN G A GLT WSPNVNIFRDPRWGRGQETPGEDP ++ KYA ++V+G+QG+ L+
Sbjct: 154 ALYNIGQAEGLTMWSPNVNIFRDPRWGRGQETPGEDPTMASKYAVAFVKGMQGNSSAILQ 213
Query: 200 VAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQV 259
+A CKH TAYDL++WNGV R++FNAKV+ QD+EDT++ PFR CV++ K +MC+Y +
Sbjct: 214 TSACCKHVTAYDLEDWNGVQRYNFNAKVTAQDLEDTYNPPFRSCVVDAKATCIMCAYTGI 273
Query: 260 NGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDL 319
NGVP CA+ ++L +T+RG+W L+GYI SDCD+V + D Q +T TPE+A A A++AGLD+
Sbjct: 274 NGVPACANADLLTKTVRGDWGLDGYIASDCDAVAIMRDAQRYTQTPEDAVAVALKAGLDM 333
Query: 320 DCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQP-YGHLGPKDVC 378
+CG ++ H +A+Q+G L+E DI+ AL N ++MRLG FDG+P S YG LG D+C
Sbjct: 334 NCGTYMQQHATAAIQQGKLTEEDIDKALKNLFAIRMRLGHFDGDPRSNSVYGGLGAADIC 393
Query: 379 TPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGY 438
TP+H+ LALEAA GIVLLKN LPL + AVIGPN++ + +IGNY G C
Sbjct: 394 TPEHRSLALEAAMDGIVLLKNDAGILPLDRTAVASAAVIGPNANDGLALIGNYFGPPCES 453
Query: 439 TTPLQGIGRYARTIH-QQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALD 497
TTPL GI Y + + GC AC AA AS +D L MGL Q E+E D
Sbjct: 454 TTPLNGILGYIKNVRFLAGCNSAACDVAATDQAAAVAS-SSDYVFLFMGLSQKQESEGRD 512
Query: 498 RAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGG 557
R LLLPG QQ L++ V+ A+K P ILVL++GGP+DV FA+ +P+I AI+WAGYPGQAGG
Sbjct: 513 RTSLLLPGEQQSLITAVADAAKRPVILVLLTGGPVDVTFAQTNPKIGAILWAGYPGQAGG 572
Query: 558 TAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYP 617
AIA +LFG NPGG+LP+TWYP+E+ T +PMT+M MR + YPGR+YRFY+G VY
Sbjct: 573 LAIARVLFGDHNPGGRLPVTWYPEEF-TKVPMTDMRMRADPATGYPGRSYRFYQGKTVYK 631
Query: 618 FGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAK------CNRLTLG 671
FG+G+SY+++ + + A S+ T + + + H + C +L
Sbjct: 632 FGYGLSYSSYSRQLVSGGK-PAESYTNLLASLRTTTTSEGDESYHIEEIGTDGCEQLKFP 690
Query: 672 VQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKY 730
V+V+N G DG H++L++ P A P QL+ F H+ G + + +I C++
Sbjct: 691 AVVEVQNHGPMDGKHSVLMYLRWPNAKGGRPTTQLIGFRSQHLKVGEKANIRFDISPCEH 750
Query: 731 LSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHA 762
S V + G + I G H + + + + A
Sbjct: 751 FSRVRKDGKKVIDRGSHYLMVDKDELEIRFEA 782
>gi|26449574|dbj|BAC41913.1| putative beta-xylosidase [Arabidopsis thaliana]
Length = 732
Score = 701 bits (1810), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 346/732 (47%), Positives = 466/732 (63%), Gaps = 34/732 (4%)
Query: 62 LQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTAS 121
L EK+ L + AA+VPRLGI YEWWSE+LHG+++ GPG F G ATSFPQVI +A+
Sbjct: 2 LPEKIGQLSNTAASVPRLGIPPYEWWSESLHGLADNGPGVSFNGSISAATSFPQVIVSAA 61
Query: 122 SFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGK 181
SFN TLW IG V+ E RAMYNGG AGLT+W+PN+N+FRDPRWGRGQETPGEDP + +
Sbjct: 62 SFNRTLWYEIGSAVAVEGRAMYNGGQAGLTFWAPNINVFRDPRWGRGQETPGEDPKVVSE 121
Query: 182 YAASYVRGLQ----------------------GSDGDRLKVAASCKHFTAYDLDNWNGVD 219
Y +VRG Q +L ++A CKHFTAYDL+ W
Sbjct: 122 YGVEFVRGFQEKKKRKVLKRRFSDDVDDDRHDDDADGKLMLSACCKHFTAYDLEKWGNFT 181
Query: 220 RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEW 279
R+ FNA V++QD+EDT+ PF C+ +GK + +MCSYN VNGVP CA ++L++ R EW
Sbjct: 182 RYDFNAVVTEQDMEDTYQPPFETCIRDGKASCLMCSYNAVNGVPACAQGDLLQKA-RVEW 240
Query: 280 RLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLS 339
GYI SDCD+V + Q +T +PEEA ADAI+AG+D++CG ++ HT+SA+++G +S
Sbjct: 241 GFEGYITSDCDAVATIFAYQGYTKSPEEAVADAIKAGVDINCGTYMLRHTQSAIEQGKVS 300
Query: 340 EIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKN 399
E ++ AL+N VQ+RLG+FDG+P YG LG D+C+ DH++LALEA RQGIVLLKN
Sbjct: 301 EELVDRALLNLFAVQLRLGLFDGDPRRGQYGKLGSNDICSSDHRKLALEATRQGIVLLKN 360
Query: 400 QGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYA-RTIHQQGCK 458
LPL+ ++A++GP ++ M G Y G C T + Y +T + GC
Sbjct: 361 DHKLLPLNKNHVSSLAIVGPMANNISNMGGTYTGKPCQRKTLFTELLEYVKKTSYASGCS 420
Query: 459 DVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMAS 518
DV+C D FG A+ ++ AD I+V GLD S E E DR L LPG+Q++LVS V+ S
Sbjct: 421 DVSCDSDTGFGEAVAIAKGADFVIVVAGLDLSQETEDKDRVSLSLPGKQKDLVSHVAAVS 480
Query: 519 KGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTW 578
K P ILVL GGP+DV FAKNDPRI +IIW GYPG+ GG A+A+I+FG NPGG+LP TW
Sbjct: 481 KKPVILVLTGGGPVDVTFAKNDPRIGSIIWIGYPGETGGQALAEIIFGDFNPGGRLPTTW 540
Query: 579 YPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVV 638
YP+ + T++ M++M MR + S+ YPGRTYRFY GP VY FG G+SYT F + + +AP +
Sbjct: 541 YPESF-TDVAMSDMHMRANSSRGYPGRTYRFYTGPQVYSFGTGLSYTKFEYKILSAPIRL 599
Query: 639 AVP--LDGRHGSINATISGKAIK------VTHAKCNRLTLGVQVDVKNVGSKDGAHTLLV 690
++ L + G+ ++ V C L V+V V N G DG+H +++
Sbjct: 600 SLSELLPQQSSHKKQLQHGEELRYLQLDDVIVNSCESLRFNVRVHVSNTGEIDGSHVVML 659
Query: 691 FST-PPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNI 749
FS PP P KQL+ +++VHV + I CK LSV + G R IPLG H +
Sbjct: 660 FSKMPPVLSGVPEKQLIGYDRVHVRSNEMMETVFVIDPCKQLSVANDVGKRVIPLGSHVL 719
Query: 750 HIGGTKHSVSLH 761
+G +HS+S+
Sbjct: 720 FLGDLQHSLSVE 731
>gi|357489463|ref|XP_003615019.1| hypothetical protein MTR_5g062650 [Medicago truncatula]
gi|355516354|gb|AES97977.1| hypothetical protein MTR_5g062650 [Medicago truncatula]
Length = 785
Score = 701 bits (1810), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 357/784 (45%), Positives = 504/784 (64%), Gaps = 39/784 (4%)
Query: 5 IAFFFLGLI----LLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRL 60
I+F FL L L+ A S L P++CD + T++ FC ++L QR D++ RL
Sbjct: 11 ISFIFLFLTRYNQLVHADSPTLVP--PYSCDITNPLTKSYTFCNLNLTTIQRAKDIVSRL 68
Query: 61 SLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGD--FPGATSFPQVIT 118
+L EK+ L++ A A+PRLGI Y+WWSEALHGV++ G G + G+ AT FPQVI
Sbjct: 69 TLDEKLAQLVNTAPAIPRLGIHSYQWWSEALHGVADYGKGIRLNGNVTIKAATIFPQVIL 128
Query: 119 TASSFNATLWEAIGRVVSDEARAMYNGGTA-GLTYWSPNVNIFRDPRWGRGQETPGEDPV 177
TA+SF++ LW I +V+ EARA+YN G A G+T+W+PN+NIFRDPRWGRGQET GEDP+
Sbjct: 129 TAASFDSKLWYRISKVIGTEARAVYNAGQAEGMTFWAPNINIFRDPRWGRGQETAGEDPL 188
Query: 178 LSGKYAASYVRGLQGS-------DGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQ 230
+S KYA S+VRGLQG + DRLK +A CKHFTAYDLDNW GVDRF F+A V+ Q
Sbjct: 189 VSAKYAVSFVRGLQGDSFEGGKLNEDRLKASACCKHFTAYDLDNWKGVDRFDFDANVTLQ 248
Query: 231 DIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCD 290
D+ DT+ PF C+++G+ + +MC+YN+VNG+P CAD N+L T R +W NGYI SDC
Sbjct: 249 DLADTYQPPFHSCIVQGRSSGIMCAYNRVNGIPNCADYNLLTNTARKKWNFNGYITSDCS 308
Query: 291 SVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNT 350
+V + +D Q + PE+A AD ++AG+D++CG + H++SAV + + I+ AL N
Sbjct: 309 AVDIIHDRQGYAKAPEDAVADVLQAGMDVECGDYFTSHSKSAVLQKKVPISQIDRALHNL 368
Query: 351 LTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIR 410
++++RLG+FDG P+ YG +GP VC+ + +ALEAAR GIVLLKN LPL
Sbjct: 369 FSIRIRLGLFDGHPTKLKYGKIGPNRVCSKQNLNIALEAARSGIVLLKNAASILPLPK-S 427
Query: 411 HRTVAVIGPNSDVTVTMI-GNYAGIACGYTTPLQGIGRYA-RTIHQQGCKD-VACADDQL 467
++ VIGPN++ + ++ GNY G C T LQG Y+ ++ GC D C ++
Sbjct: 428 TDSIVVIGPNANSSSQVVLGNYFGRPCNLVTILQGFENYSDNLLYHPGCSDGTKCVSAEI 487
Query: 468 FGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLM 527
A++ ++ D +LVMGLDQS E+E DR L LPG+QQEL++ V+ ASK P ILVL
Sbjct: 488 -DRAVEVAKVVDYVVLVMGLDQSQESEGHDRDDLELPGKQQELINSVAKASKRPVILVLF 546
Query: 528 SGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNL 587
GGP+D++FAK D +I I+WAGYPG+ GG A+A ++FG NPGG+LPMTWYP+++I +
Sbjct: 547 CGGPVDISFAKVDDKIGGILWAGYPGELGGMALAQVVFGDYNPGGRLPMTWYPKDFI-KI 605
Query: 588 PMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHG 647
PMT+M MR S YPGRTYRFY GP VY FG+G+SY+N+ + ++V + H
Sbjct: 606 PMTDMRMRADPSSGYPGRTYRFYTGPKVYEFGYGLSYSNYSY------NFISVKNNNLHI 659
Query: 648 SINATIS-GKAIKVTHAK---------CNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAG 697
+ + T S + + H K C +++ V + + N GS G H +L+F P G
Sbjct: 660 NQSTTYSILEKSQTIHYKLVSELGKKACKTMSISVTLGITNTGSMAGKHPVLLFVKPKKG 719
Query: 698 -HWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKH 756
+ P KQLV FE V V G + VG + VC++LS + SG + I G + +G ++
Sbjct: 720 RNGNPVKQLVGFESVTVEGGGKGEVGFEVSVCEHLSRANESGVKVIEEGGYLFLVGELEY 779
Query: 757 SVSL 760
S+++
Sbjct: 780 SINI 783
>gi|356552866|ref|XP_003544783.1| PREDICTED: probable beta-D-xylosidase 7-like [Glycine max]
Length = 776
Score = 701 bits (1810), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 343/771 (44%), Positives = 502/771 (65%), Gaps = 20/771 (2%)
Query: 6 AFFFLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEK 65
A F+ +LL+ + + P++CD + + PFC LPI +R DL+ RL+L EK
Sbjct: 8 AAIFISFLLLTLHHHAESTQPPYSCD-SSSNSPYYPFCNTRLPISKRAQDLVSRLTLDEK 66
Query: 66 VKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNA 125
+ L++ A A+PRLGI Y+WWSEALHGV++ G G +F G ATSFPQVI TA+SF+
Sbjct: 67 LAQLVNTAPAIPRLGIPSYQWWSEALHGVADAGFGIRFNGTIKSATSFPQVILTAASFDP 126
Query: 126 TLWEAIGRVVSDEARAMYNGGTA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAA 184
LW I + + EARA+YN G A G+T+W+PN+N+FRDPRWGRGQET GEDP+++ KY
Sbjct: 127 NLWYQISKTIGKEARAVYNAGQATGMTFWAPNINVFRDPRWGRGQETAGEDPLMNAKYGV 186
Query: 185 SYVRGLQGSD------GDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDV 238
+YVRGLQG G+RL+ +A CKHFTAYDLD+W G+DRF ++A+V+ QD+ DT+
Sbjct: 187 AYVRGLQGDSFEGGKLGERLQASACCKHFTAYDLDHWKGLDRFVYDARVTSQDLADTYQP 246
Query: 239 PFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDT 298
PF+ C+ +G+ + +MC+YN+VNGVP CA+ N+L +T R +W+ +GYI SDC +V + +D
Sbjct: 247 PFQSCIEQGRASGIMCAYNRVNGVPNCANFNLLTKTARQQWKFDGYITSDCGAVSIIHDE 306
Query: 299 QHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLG 358
Q + T E+A AD RAG+D++CG ++ H +SAV + L I+ AL N ++++RLG
Sbjct: 307 QGYAKTAEDAIADVFRAGMDVECGDYITKHGKSAVSQKKLPISQIDRALQNLFSIRIRLG 366
Query: 359 MFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIG 418
+ DG P+ P+G +GP VC+ +LALEAAR GIVLLKN LPL + T+A+IG
Sbjct: 367 LLDGNPTKLPFGTIGPDQVCSKQSLQLALEAARDGIVLLKNTNSLLPLPKT-NPTIALIG 425
Query: 419 PNSDVTV-TMIGNYAGIACGYTTPLQGIGRYAR-TIHQQGCKD-VACADDQLFGAAIDAS 475
PN++ + +GNY G C T LQG YA+ T++ GC D CA Q+ G A++ +
Sbjct: 426 PNANASSKVFLGNYYGRPCNLVTLLQGFEGYAKDTVYHPGCDDGPQCAYAQIEG-AVEVA 484
Query: 476 RQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVA 535
++ D +LVMGLDQS E E+ DR L LPG+Q+EL+ V+ ASK P +LVL+ GGP+D+
Sbjct: 485 KKVDYVVLVMGLDQSQERESHDREYLGLPGKQEELIKSVARASKRPVVLVLLCGGPVDIT 544
Query: 536 FAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMR 595
AK D ++ I+WAGYPG+ GG A+A ++FG NPGGKLP+TWYP+++I +PMT+M MR
Sbjct: 545 SAKFDDKVGGILWAGYPGELGGVALAQVVFGDHNPGGKLPITWYPKDFI-KVPMTDMRMR 603
Query: 596 PSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVAN-APTVVAVPLDGRHGSI--NAT 652
+ YPGRTYRFY GP VY FG+G+SYT + + + + + + + H + + T
Sbjct: 604 ADPASGYPGRTYRFYTGPKVYEFGYGLSYTKYSYKLLSLSHNTLHINQSSTHLTTQNSET 663
Query: 653 ISGKAI-KVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF--STPPAGHWAPHKQLVAFE 709
I K + ++ C + L + + V N G+ G H +L+F + P KQLV F+
Sbjct: 664 IRYKLVSELAEETCQTMLLSIALGVTNHGNMAGKHPVLLFVRQGKVRNNGNPVKQLVGFQ 723
Query: 710 KVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
V + AG +VG + C++LSV + +G+ I G + + +G ++ + +
Sbjct: 724 SVKLNAGETVQVGFELSPCEHLSVANEAGSMVIEEGSYLLLVGDQEYPIEI 774
>gi|356548162|ref|XP_003542472.1| PREDICTED: probable beta-D-xylosidase 7-like [Glycine max]
Length = 778
Score = 701 bits (1808), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 345/773 (44%), Positives = 500/773 (64%), Gaps = 25/773 (3%)
Query: 5 IAFFFLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQE 64
I+FF L L A S+ R P++CD + + FC LPI +R DL+ RL+L E
Sbjct: 12 ISFFLLNLHHHHAEST----RPPYSCD-SSSNSPYYSFCNTKLPITKRAQDLVSRLTLDE 66
Query: 65 KVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFN 124
K+ L++ A A+PRLGI Y+WWSEALHGV++ G G +F G ATSFPQVI TA+SF+
Sbjct: 67 KLAQLVNTAPAIPRLGIPSYQWWSEALHGVADAGFGIRFNGTIKSATSFPQVILTAASFD 126
Query: 125 ATLWEAIGRVVSDEARAMYNGGTA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYA 183
LW I + + EARA+YN G A G+T+W+PN+N+FRDPRWGRGQET GEDP+++ KY
Sbjct: 127 PNLWYQISKTIGREARAVYNAGQATGMTFWAPNINVFRDPRWGRGQETAGEDPLMNAKYG 186
Query: 184 ASYVRGLQGSD------GDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFD 237
+YVRGLQG +RL+ +A CKHFTAYDLD W G+DRF F+A+V+ QD+ DT+
Sbjct: 187 VAYVRGLQGDSFEGGKLAERLQASACCKHFTAYDLDQWKGLDRFVFDARVTSQDLADTYQ 246
Query: 238 VPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYD 297
PF+ C+ +G+ + +MC+YN+VNGVP CAD N+L +T R +W+ +GYI SDC +V + ++
Sbjct: 247 PPFQSCIEQGRASGIMCAYNRVNGVPNCADFNLLTKTARQQWKFDGYITSDCGAVSIIHE 306
Query: 298 TQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRL 357
Q + T E+A AD RAG+D++CG ++ H +SAV + L I+ AL N ++++RL
Sbjct: 307 KQGYAKTAEDAIADVFRAGMDVECGDYITKHAKSAVFQKKLPISQIDRALQNLFSIRIRL 366
Query: 358 GMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVI 417
G+FDG P+ P+G +GP +VC+ +LALEAAR GIVLLKN LPL + T+A+I
Sbjct: 367 GLFDGNPTKLPFGTIGPNEVCSKQSLQLALEAARDGIVLLKNTNSLLPLPKT-NPTIALI 425
Query: 418 GPNSDVTV-TMIGNYAGIACGYTTPLQGIGRYARTIHQQGCKD-VACADDQLFGAAIDAS 475
GPN++ + +GNY G C T LQG YA+T++ GC D CA Q+ A++ +
Sbjct: 426 GPNANASSKVFLGNYYGRPCNLVTLLQGFEGYAKTVYHPGCDDGPQCAYAQI-EEAVEVA 484
Query: 476 RQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVA 535
++ D +LVMGLDQS E E+ DR L LPG+Q+EL+ V+ A+K P ++VL+ GGP+D+
Sbjct: 485 KKVDYVVLVMGLDQSQERESHDREYLGLPGKQEELIKSVARAAKRPVVVVLLCGGPVDIT 544
Query: 536 FAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMR 595
AK D ++ I+WAGYPG+ GG A+A ++FG NPGGKLP+TWYP+++I +PMT+M MR
Sbjct: 545 SAKFDDKVGGILWAGYPGELGGVALAQVVFGDHNPGGKLPITWYPKDFI-KVPMTDMRMR 603
Query: 596 PSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVAN-APTVVAVPLDGRH--GSINAT 652
+ YPGRTYRFY GP VY FG+G+SYT + + + + + + + + H + T
Sbjct: 604 ADPASGYPGRTYRFYTGPKVYEFGYGLSYTKYSYKLLSLSHSTLHINQSSTHLMTQNSET 663
Query: 653 ISGKAI-KVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF----STPPAGHWAPHKQLVA 707
I K + ++ C + L + + V N G+ G H +L+F + P KQLV
Sbjct: 664 IRYKLVSELAEETCQTMLLSIALGVTNRGNLAGKHPVLLFVRQGKVRNINNGNPVKQLVG 723
Query: 708 FEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
F+ V V AG +VG + C++LSV + +G+ I G + +G ++ + +
Sbjct: 724 FQSVKVNAGETVQVGFELSPCEHLSVANEAGSMVIEEGSYLFIVGDQEYPIEV 776
>gi|413925162|gb|AFW65094.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
Length = 774
Score = 698 bits (1802), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 354/754 (46%), Positives = 483/754 (64%), Gaps = 19/754 (2%)
Query: 22 LAAREPFACDPKDA-TTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLG 80
+A PF+C P A + L FC V+L QR DL+ RL+ EK+ L A VPRLG
Sbjct: 25 VAGDPPFSCGPSSAEASEGLAFCDVTLAPAQRAADLVSRLTAAEKIAQLGDQAPGVPRLG 84
Query: 81 IKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEAR 140
+ GY+WW+EALHG++ G G F ATSFPQV+ TA++F+ LW IG+ + EAR
Sbjct: 85 VPGYKWWNEALHGLATSGKGLHFDAAVRAATSFPQVLLTAAAFDDDLWLRIGQAIGREAR 144
Query: 141 AMYNGGTA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQG-SDGDRL 198
A++N G A GLT WSPNVNIFRDPRWGRGQETPGEDP ++ +YA ++VRG+QG S L
Sbjct: 145 ALFNVGQAEGLTIWSPNVNIFRDPRWGRGQETPGEDPAVASRYAVAFVRGIQGNSSSSLL 204
Query: 199 KVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQ 258
+ +A CKH TAYDL++WNGV R+ F A+V++QD+EDTF+ PFR CV+E K + VMC+Y
Sbjct: 205 QTSACCKHATAYDLEDWNGVARYSFVARVTEQDLEDTFNPPFRSCVVEAKASCVMCAYTA 264
Query: 259 VNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLD 318
+NGVP CA+ ++L T+RG+W L+GY+ SDCD+V + D Q + TPE+A A +++AGLD
Sbjct: 265 INGVPACANSDLLTGTVRGDWGLDGYVASDCDAVAIMRDAQRYAPTPEDAVAVSLKAGLD 324
Query: 319 LDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVC 378
+DCG ++ H +A+Q+G L+E DI+ AL N V+MRLG FDG+P YG LG D+C
Sbjct: 325 IDCGSYVQQHAAAAIQQGKLTEQDIDKALTNLYAVRMRLGHFDGDPRKNMYGVLGAADIC 384
Query: 379 TPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGY 438
TP+H+ LALEAA+ GIVLLKN G LPL + AVIGPN++ + +I NY G C
Sbjct: 385 TPEHRNLALEAAQDGIVLLKNDGGILPLDRSTVTSAAVIGPNANDGMALIANYFGPPCES 444
Query: 439 TTPLQGIGRYARTIH-QQGCKDVAC---ADDQLFGAAIDASRQADATILVMGLDQSIEAE 494
TTPL+G+ Y + GC AC A DQ A+ + D L MGL Q E+E
Sbjct: 445 TTPLKGLQSYVNDVRFLAGCNSAACDVAATDQ----AVALAGSEDYVFLFMGLSQKQESE 500
Query: 495 ALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQ 554
DR LLLPG QQ L++ V+ ASK P ILVL+SGGP+D+ FA+++P+I AI+WAGYPGQ
Sbjct: 501 GKDRTSLLLPGMQQSLITAVADASKRPVILVLLSGGPVDITFAQSNPKIGAILWAGYPGQ 560
Query: 555 AGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPV 614
AGG AIA +LFG NP G+LP+TWYP+E+ T +PMT+M MR + YPGR+YRFY+G
Sbjct: 561 AGGLAIAKVLFGDHNPSGRLPVTWYPEEF-TKVPMTDMRMRADPTSGYPGRSYRFYQGNT 619
Query: 615 VYPFGHGMSYTNFVHTVANAPTVVAVP---LDGRHGSINATISGKAIKVTH---AKCNRL 668
VY FG+G+SY+ F + + +V A+ L G ++ ++ V C +L
Sbjct: 620 VYKFGYGLSYSTFSRRLVHGTSVPALSSTLLTGLRETMTPQDGDRSYHVDAIGTEGCEQL 679
Query: 669 TLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHV 727
V+V+N G DG H++L+F P P QL+ F H+ AG ++ +I
Sbjct: 680 KFPAMVEVQNHGPMDGKHSVLMFLRWPNTKQGRPASQLIGFRSQHLKAGETAKLRFDISP 739
Query: 728 CKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLH 761
CK+ S V G + I +G H + + + +
Sbjct: 740 CKHFSRVRADGRKVIDIGSHFLMVDNHEMEIRFE 773
>gi|357156390|ref|XP_003577440.1| PREDICTED: probable beta-D-xylosidase 7-like [Brachypodium
distachyon]
Length = 755
Score = 698 bits (1801), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 357/747 (47%), Positives = 480/747 (64%), Gaps = 17/747 (2%)
Query: 28 FACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWW 87
F+C P FC +LP QR DL+ +L+L+EKV L A VPR G+ GY WW
Sbjct: 14 FSCGPPQQAQYA--FCNRALPAEQRAADLVAKLTLEEKVSQLGDQAPGVPRFGVPGYNWW 71
Query: 88 SEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGT 147
SE LHGVS G G F G G T+FPQV+ T +SF+ ++W IG+ + EARAM+N G
Sbjct: 72 SEGLHGVSMWGHGMHFNGAVRGVTTFPQVLLTTASFDDSIWYRIGQAIGTEARAMFNLGQ 131
Query: 148 A-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKH 206
A GLT WSPNVNI+RDPRWGRGQETPGEDP + KYA ++VRGLQG+ L+ +A CKH
Sbjct: 132 ADGLTIWSPNVNIYRDPRWGRGQETPGEDPATASKYAVAFVRGLQGTSTTTLQTSACCKH 191
Query: 207 FTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCA 266
TAYDLD+WN + R++FNAKV+ QD+E+TF+ PF+ CV+EGK VMC+Y VNG+P CA
Sbjct: 192 ATAYDLDDWNRIGRYNFNAKVTAQDLEETFNPPFKSCVVEGKATCVMCAYTSVNGIPACA 251
Query: 267 DPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLG 326
D +L +TI+GEW +NGYI SDCD+V + Y T+ ++ TPE+A A AI+AGLD++CG F
Sbjct: 252 DSGLLTKTIKGEWGMNGYISSDCDAVALLYGTR-YSGTPEDAVAAAIKAGLDMNCGNFSQ 310
Query: 327 LHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQP-YGHLGPKDVCTPDHQEL 385
+H +A+Q+ +SE D++ AL N ++MRLG FDG+P P YG LG +DVC+P H++L
Sbjct: 311 VHGMAALQQRKMSEQDVDKALRNLFAIRMRLGHFDGDPLQSPLYGRLGAQDVCSPAHKDL 370
Query: 386 ALEAARQGIVLLKNQGPSLPLSH--IRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQ 443
ALEAA+ GIVLLKN +LPLS + AVIGPN++ ++GNY G C TTPLQ
Sbjct: 371 ALEAAQNGIVLLKNDAATLPLSRPTAASASFAVIGPNANEPGALLGNYFGPPCETTTPLQ 430
Query: 444 GIGR-YARTIH-QQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGL 501
+ + Y++ + GC AC + A+ A+ +D TIL MGL Q E E LDR L
Sbjct: 431 ALQKFYSKNVRFVPGCDSAACNVADTYQASGLAA-TSDYTILFMGLSQKQEQEGLDRTSL 489
Query: 502 LLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIA 561
LLPG+Q+ L++ V+ A+K P ILVL++GGP+D+ FAK +P+I AI+WAGYPGQAGG AIA
Sbjct: 490 LLPGKQESLITAVAAAAKRPIILVLLTGGPVDITFAKFNPKIGAILWAGYPGQAGGLAIA 549
Query: 562 DILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHG 621
+LFG NP G+LP+TWYP+EY T +PM +M MR + YPGR+YRFYKG VY FG+G
Sbjct: 550 KVLFGEHNPSGRLPVTWYPEEY-TKVPMDDMRMRADPATGYPGRSYRFYKGNAVYKFGYG 608
Query: 622 MSYTNFV-HTVANAPTVVAVP----LDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDV 676
+SY+ F V N+ + P L A+ ++ C RL V+V
Sbjct: 609 LSYSKFSRQLVRNSSSNNRAPNTELLAAAAVDCGASRYYLVEEIGGEVCERLKFPAVVEV 668
Query: 677 KNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVD 735
+N G DG ++L+F P A P QLV F + AG + V +I C++ S
Sbjct: 669 ENHGPMDGKQSVLLFLRWPTATEGRPASQLVGFRSQDLRAGEKASVSFDISPCEHFSRTT 728
Query: 736 RSGTRRIPLGEHNIHIGGTKHSVSLHA 762
GT+ I G H + + + +S +
Sbjct: 729 VDGTKVIDRGSHFLMVDEDEMEISFDS 755
>gi|356510699|ref|XP_003524073.1| PREDICTED: beta-xylosidase/alpha-L-arabinofuranosidase 2-like
[Glycine max]
Length = 613
Score = 692 bits (1787), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 339/549 (61%), Positives = 415/549 (75%), Gaps = 6/549 (1%)
Query: 26 EPFACDP-KDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGY 84
+ FACD K FC SL + RV DL+GRL+LQEK+ L++ A V RLGI Y
Sbjct: 28 QTFACDVGKSPAVAGYGFCDKSLGVEARVKDLVGRLTLQEKIGNLVNSAGDVSRLGIPRY 87
Query: 85 EWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN 144
EWWSEALHGVSNVG GT+F PGATSFP I TA+SFN +L+E IGRVVS EA AMYN
Sbjct: 88 EWWSEALHGVSNVGLGTRFSNVVPGATSFPMPILTAASFNTSLFEVIGRVVSTEAGAMYN 147
Query: 145 GGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDG---DRLKVA 201
G AGLTYWSPN+NIFRDPRWGRG ETPGEDPVL+ KYAA YV+GLQ +DG ++LKVA
Sbjct: 148 VGLAGLTYWSPNINIFRDPRWGRGLETPGEDPVLTSKYAAGYVKGLQQTDGGDPNKLKVA 207
Query: 202 ASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNG 261
A CKH+TAYD+D W G+ R+ FNA ++KQD+EDTF PF+ CV++G VASVMCSYN+VNG
Sbjct: 208 ACCKHYTAYDVDKWKGIQRYTFNAVLTKQDLEDTFQPPFKSCVIDGNVASVMCSYNKVNG 267
Query: 262 VPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDC 321
PTCADP++LK +RGEW+LNGY+VSDCDSV V Y QH+T TPEEAAA +I AGLDL+C
Sbjct: 268 KPTCADPDLLKGVVRGEWKLNGYMVSDCDSVEVLYKYQHYTKTPEEAAAISILAGLDLNC 327
Query: 322 GPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD 381
G FLG +TE AV++GL+ E INNA+ N MRLG FDG+P QPYG+LGPKDVCTP
Sbjct: 328 GRFLGQYTEGAVKQGLIDE-SINNAVSNNFATLMRLGFFDGDPRKQPYGNLGPKDVCTPA 386
Query: 382 HQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTP 441
+QELA EAARQGIV LKN SLPL+ +++AVIGPN++ T MIGNY GI C Y +P
Sbjct: 387 NQELAREAARQGIVSLKNSPASLPLNAKAIKSLAVIGPNANATRVMIGNYEGIPCKYISP 446
Query: 442 LQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGL 501
LQG+ + T + GC DV C + L A S DAT++V+G +IEAE+LDR +
Sbjct: 447 LQGLTAFVPTSYAAGCLDVRCPNPVLDDAK-KISASGDATVIVVGASLAIEAESLDRVNI 505
Query: 502 LLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIA 561
LLPG+QQ LV++V+ ASKGP ILV+MSGG +DV+FAK++ +I +I+W GYPG+AGG AIA
Sbjct: 506 LLPGQQQLLVTEVANASKGPVILVIMSGGGMDVSFAKDNNKITSILWVGYPGEAGGAAIA 565
Query: 562 DILFGTSNP 570
D++FG NP
Sbjct: 566 DVIFGFHNP 574
>gi|253761874|ref|XP_002489311.1| hypothetical protein SORBIDRAFT_0010s012040 [Sorghum bicolor]
gi|241946959|gb|EES20104.1| hypothetical protein SORBIDRAFT_0010s012040 [Sorghum bicolor]
Length = 791
Score = 692 bits (1787), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 356/775 (45%), Positives = 484/775 (62%), Gaps = 20/775 (2%)
Query: 1 MASTIAFFFLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRL 60
MA+ ++ L LL + +A PF+C P + ++ LPFC + LP QR DL+ R+
Sbjct: 24 MAAHVSVLLLVPALLMRVA--VAGAPPFSCGP-SSPSKGLPFCNMKLPASQRAADLVSRM 80
Query: 61 SLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTA 120
+ EK L A VPRLG+ Y+WW+EALHGV+ G G ATSFPQV+ TA
Sbjct: 81 TPAEKASQLGDIANGVPRLGVPSYKWWNEALHGVAISGKGIHMNQGVRSATSFPQVLHTA 140
Query: 121 SSFNATLWEAIGRVVSDEARAMYNGGTA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLS 179
+SFN LW IG+ EARA YN G A GLT WSPNVNIFRDPRWGRGQETPGEDP ++
Sbjct: 141 ASFNDNLWFRIGQATGKEARAFYNIGQAEGLTMWSPNVNIFRDPRWGRGQETPGEDPAVA 200
Query: 180 GKYAASYVRGLQGSDGDR------LKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIE 233
+Y A++VRGLQGS + L+ +A CKH TAYDL++W GV R+ F A V+ QD+
Sbjct: 201 SRYGAAFVRGLQGSSSNTKSVPPVLQTSACCKHATAYDLEDWKGVSRYSFKATVTIQDLA 260
Query: 234 DTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVG 293
DTF+ PFR CV++GK + VMC+Y VNGVP+CA+ ++L +T RG W L+GY+ +DCD+V
Sbjct: 261 DTFNPPFRSCVVDGKASCVMCAYTIVNGVPSCANGDLLTKTFRGSWGLDGYVAADCDAVA 320
Query: 294 VYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTV 353
+ ++Q + T E+ A ++AGLD+DCGP++ + +A+Q+G L++ D++ A+ N LT
Sbjct: 321 IMRNSQFYRPTAEDTVAATLKAGLDIDCGPYIQQYAMAAIQKGKLTQQDVDKAVKNLLTT 380
Query: 354 QMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRT 413
+MRLG FDG+P + YG+LG +CT +H+ LALEAA GIVLLKN LPL +
Sbjct: 381 RMRLGHFDGDPKTNVYGNLGAGHICTAEHKNLALEAALDGIVLLKNSAGVLPLKRGTVNS 440
Query: 414 VAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIH-QQGCKDVACADDQLFGAAI 472
AVIG N++ + ++GNY G C TTPLQGI Y + + GC AC A
Sbjct: 441 AAVIGHNANDVLALLGNYWGPPCAPTTPLQGIQGYVKNVKFLAGCNKAACNVAATPQATA 500
Query: 473 DASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPI 532
AS +DA IL MGL Q E+E DR LLLPG QQ L++ V+ A+K P ILVL++GGP+
Sbjct: 501 LAS-SSDAVILFMGLSQEQESEGKDRTTLLLPGNQQSLINAVANAAKRPVILVLLTGGPV 559
Query: 533 DVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEM 592
D+ FA+ +P+I AI+WAGYPGQAGG AIA +LFG NP GKLP TWYP+E+ T +PMT+M
Sbjct: 560 DITFAQANPKIGAILWAGYPGQAGGLAIAKVLFGEKNPSGKLPNTWYPEEF-TRIPMTDM 618
Query: 593 AMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVA----NAPTVVAVPLDGRHGS 648
MR + S YPGRTYRFY G +Y FG+G+SY+ F H V N ++ G
Sbjct: 619 RMRAAGS--YPGRTYRFYNGKTIYKFGYGLSYSKFSHRVVTGRKNPAHNTSLLAAGLAAM 676
Query: 649 INATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVA 707
+S + C++L V V+N G DG HT L+F P A P +QL+
Sbjct: 677 TEDNLSYHVEHIGDVVCDQLKFLAVVKVQNHGPIDGKHTALMFLRWPSATDGRPTRQLIG 736
Query: 708 FEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHA 762
F+ H+ AG + + + C++ S V + G + I G H + +G + +S A
Sbjct: 737 FQSQHIKAGEKANLRFEVSPCEHFSRVRQDGRKVIDKGSHFLKVGKHELEISFGA 791
>gi|414588273|tpg|DAA38844.1| TPA: putative O-Glycosyl hydrolase superfamily protein [Zea mays]
Length = 775
Score = 690 bits (1781), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 352/750 (46%), Positives = 477/750 (63%), Gaps = 19/750 (2%)
Query: 28 FACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWW 87
F+C P A+ R PFC SLP +R DL+ RL++ EKV L AA VPRLG+ Y+WW
Sbjct: 30 FSCGPSSAS-RAYPFCDRSLPAARRAADLVSRLTVAEKVSQLGDEAAGVPRLGVPPYKWW 88
Query: 88 SEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGT 147
SE LHG++ G G +F G TSFPQV+ T +SF+ +LW IG+ + EARA+YN G
Sbjct: 89 SEGLHGLAFWGHGMRFNGTVSAVTSFPQVLLTTASFDESLWFRIGQAIGREARALYNLGQ 148
Query: 148 A-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSD-----GDRLKVA 201
A GLT WSPNVNIFRDPRWGRGQETPGEDP ++ KYA ++VRG+QGS+ L+ +
Sbjct: 149 AEGLTIWSPNVNIFRDPRWGRGQETPGEDPAVASKYAVAFVRGIQGSNPAGAAAAPLQAS 208
Query: 202 ASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNG 261
A CKH TAYDL++WNGV R++F+A+V+ QD+ DTF+ PF+ CV++GK + VMC+Y +NG
Sbjct: 209 ACCKHATAYDLEDWNGVARYNFDARVTLQDLADTFNPPFQSCVVDGKASCVMCAYTVING 268
Query: 262 VPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDC 321
VP CA ++L +T RG W L+GY+ SDCD+V + D Q + TPE+ A A++AGLDL+C
Sbjct: 269 VPACASSDLLTKTFRGAWGLDGYVSSDCDAVAIMRDAQRYEPTPEDTVAVALKAGLDLNC 328
Query: 322 GPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQP-YGHLGPKDVCTP 380
G + H +A+Q+G ++E D++ AL N V+MRLG FDG+P YG LG DVCT
Sbjct: 329 GTYTQQHGMAAIQQGKMTEKDVDKALTNLFAVRMRLGHFDGDPRGNALYGRLGAADVCTA 388
Query: 381 DHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTT 440
DH+ LALEAA+ GIVLLKN LPL + AVIG N++ + + GNY G AC TT
Sbjct: 389 DHKNLALEAAQDGIVLLKNDAGILPLDRSAVGSAAVIGHNANDPLVLSGNYFGPACETTT 448
Query: 441 PLQGIGRYARTIH-QQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRA 499
PL+G+ Y R + GC AC AA AS A+ L MGL Q E E LDR
Sbjct: 449 PLEGLQSYVRNVRFLAGCSSAACGYAATGQAAALAS-SAEYVFLFMGLSQDQEKEGLDRT 507
Query: 500 GLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTA 559
LLLPG+QQ LV+ V+ A+K P +LVL++GGP+D+ FA+++P+I AI+WAGYPGQAGG A
Sbjct: 508 SLLLPGKQQSLVTAVASAAKRPVVLVLLTGGPVDITFAQSNPKIGAILWAGYPGQAGGLA 567
Query: 560 IADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFG 619
IA +LFG NP G+LP+TWY +++ T +PMT+M MR + YPGRTYRFY+G +Y FG
Sbjct: 568 IARVLFGDHNPSGRLPVTWYTEDF-TKVPMTDMRMRADPATGYPGRTYRFYRGKTIYKFG 626
Query: 620 HGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAK------CNRLTLGVQ 673
+G+SY+ F + +A P ++A A H C +L +
Sbjct: 627 YGLSYSKFSRQLVTGDKNLA-PNTSLLAHLSAKTQHAATSYYHVDDIGTVGCEQLKFPAE 685
Query: 674 VDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLS 732
V+V N G DG H++L+F P A P +QL+ F H+ AG + V ++ C++ S
Sbjct: 686 VEVLNHGPMDGKHSVLMFLRWPNATDGRPVRQLIGFRSQHIKAGEKANVRFHVSPCEHFS 745
Query: 733 VVDRSGTRRIPLGEHNIHIGGTKHSVSLHA 762
G + I G H + +G + +S A
Sbjct: 746 RTRADGKKVIDRGSHFLMVGKEELEISFEA 775
>gi|414586138|tpg|DAA36709.1| TPA: putative O-Glycosyl hydrolase superfamily protein [Zea mays]
Length = 769
Score = 689 bits (1779), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 344/738 (46%), Positives = 483/738 (65%), Gaps = 14/738 (1%)
Query: 36 TTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVS 95
+T PFC SL IP R L+ L+L EK+ L + A VPRLGI Y+WWSE+LHG++
Sbjct: 33 STSAYPFCDASLSIPARARALVSLLTLDEKIAQLSNTAGGVPRLGIPPYQWWSESLHGLA 92
Query: 96 NVGPGTKFG-GDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWS 154
+ GPG F G AT FPQVI + ++FN +LW A+ V+ EA M+N G AGLTYW+
Sbjct: 93 DNGPGVNFSSGPVRAATDFPQVILSTAAFNRSLWRAVAEAVATEALGMHNAGQAGLTYWA 152
Query: 155 PNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGD--RLKVAASCKHFTAYDL 212
PN+NIFRDPRWGRGQET GEDP ++ Y+ YV+G QG +G+ R++++A CKH+TAYD+
Sbjct: 153 PNINIFRDPRWGRGQETSGEDPAVAAAYSLEYVKGFQGEEGEEGRIRLSACCKHYTAYDM 212
Query: 213 DNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILK 272
+ W G R+ FNAKV+ QD+EDT+ PF+ C+ E + + +MC+YNQVNGVP CA ++L+
Sbjct: 213 EKWEGFSRYTFNAKVNAQDLEDTYQPPFKTCIQEARASCLMCAYNQVNGVPMCAHKDLLQ 272
Query: 273 RTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESA 332
+T R EW GYI SDCD+V + ++ Q +T + E++ A ++AG+D++CG FL HT+SA
Sbjct: 273 KT-RDEWGFQGYITSDCDAVAIIHENQTYTKSGEDSIAIVLKAGMDINCGSFLVRHTKSA 331
Query: 333 VQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQ 392
+++G + E DI+ AL N +VQ+RLG+FD ++Q + LGP VCT +H+ELA EA RQ
Sbjct: 332 IEKGKIQEEDIDRALFNLFSVQLRLGIFDKPSNNQWFSQLGPNSVCTKEHRELAAEAVRQ 391
Query: 393 GIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYA-RT 451
G VLLKN LPL R VA+IGP+++ M G+Y G+ C TT L+GI YA +T
Sbjct: 392 GAVLLKNDHNFLPLKRSEVRHVAIIGPSANDAYAMGGDYTGVPCNPTTFLKGIQAYATQT 451
Query: 452 IHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELV 511
GCKD +C LFG A++A+++AD +++ GL+ + E E DR LLLPG+Q L+
Sbjct: 452 SFAPGCKDASCNSTDLFGEAVEAAKRADIVVVIAGLNLTEEREDFDRVSLLLPGKQMGLI 511
Query: 512 SKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPG 571
++ +K P +LVL+ GGP+DV+FAK DPRIA+I+W GYPG+ GG + +ILFG NPG
Sbjct: 512 HAIASVAKKPLVLVLLGGGPVDVSFAKQDPRIASILWLGYPGEVGGQVLPEILFGEYNPG 571
Query: 572 GKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTV 631
GKLP+TWYP+ + T +PMT+M MR S+ YPGRTYRFY G VVY FG+G+SY+ + +++
Sbjct: 572 GKLPITWYPESF-TAIPMTDMNMRADPSRGYPGRTYRFYTGDVVYGFGYGLSYSKYSYSI 630
Query: 632 ANAPTVVAVPLDGRHGSINATIS-------GKAIKVTHAKCNRLTLGVQVDVKNVGSKDG 684
++AP + V G I+ + G A C L V V V N GS DG
Sbjct: 631 SSAPKKITVSRSSDLGIISRKPAYTRRDGLGSVKTEDIASCEALVFSVHVAVSNHGSMDG 690
Query: 685 AHTLLVFSTPPAG-HWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIP 743
+H +L+F+ + P KQLV FE VH AG+ V I + CK +S + G R +
Sbjct: 691 SHAVLLFARSKSSVPGFPIKQLVGFESVHTAAGSASNVEITVDPCKQMSAANPEGKRVLL 750
Query: 744 LGEHNIHIGGTKHSVSLH 761
LG H + +G + +S+
Sbjct: 751 LGAHVLTVGDEEFELSIE 768
>gi|357156904|ref|XP_003577615.1| PREDICTED: probable beta-D-xylosidase 7-like [Brachypodium
distachyon]
Length = 767
Score = 689 bits (1777), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 353/750 (47%), Positives = 487/750 (64%), Gaps = 18/750 (2%)
Query: 22 LAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGI 81
LA PF+C + + FC +LP+ QR DL+ RL+ EKV L AA VPRLG+
Sbjct: 21 LAGDPPFSCG---QASSSYAFCDAALPVAQRAADLVSRLTAAEKVAQLGDEAAGVPRLGV 77
Query: 82 KGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARA 141
GY+WW+EALHG++ G G F G ATSFPQV TA++F+ LW IG+ + EARA
Sbjct: 78 PGYKWWNEALHGLATSGKGLHFDGAVRSATSFPQVCLTAAAFDDDLWFRIGQAIGREARA 137
Query: 142 MYNGGTA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKV 200
+YN G A GLT WSPNVNI+RDPRWGRGQETPGEDP + +YA ++VRG+QG+ L+
Sbjct: 138 LYNLGQAEGLTMWSPNVNIYRDPRWGRGQETPGEDPTTASRYAVAFVRGMQGNSTSLLQA 197
Query: 201 AASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVN 260
+A CKH TAYDL++WNGV R++F+AKV+ QD+EDTF+ PFR CV++GK + VMC+Y +N
Sbjct: 198 SACCKHATAYDLEDWNGVARYNFDAKVTAQDLEDTFNPPFRSCVVDGKASCVMCAYTGIN 257
Query: 261 GVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLD 320
GVP CA+ ++L +T+RG+W L+GY SDCD+V + D Q + +PE+A A A++AGLD+D
Sbjct: 258 GVPACANADLLTKTVRGDWGLDGYTASDCDAVAIMRDAQRYAQSPEDAVALALKAGLDID 317
Query: 321 CGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTP 380
CG ++ H +A+Q+G ++E DI+ AL N ++MRLG FDG+P + YG LG D+CT
Sbjct: 318 CGTYMQQHAAAAIQQGKITEEDIDKALKNLFAIRMRLGHFDGDPRTNMYGGLGAADICTA 377
Query: 381 DHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTT 440
+H+ LAL+AA+ GIVLLKN LPL + AVIGPN++ +I NY G C TT
Sbjct: 378 EHRSLALDAAQDGIVLLKNDAGILPLDRAAVASTAVIGPNANNPGALIANYFGPPCESTT 437
Query: 441 PLQGIGRYARTIH-QQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRA 499
PL+GI Y + GC AC AA AS +D L MGL Q E+E DR
Sbjct: 438 PLKGIQGYVKDARFLAGCSSTACDVATTDQAAALAS-TSDYVFLFMGLGQRQESEGRDRT 496
Query: 500 GLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTA 559
LLLPG+QQ L++ V+ A++ P ILVL+SGGP+DV FA+ +P+I AI+WAGYPGQAGG A
Sbjct: 497 SLLLPGKQQSLITAVADAAQRPVILVLLSGGPVDVTFAQTNPKIGAILWAGYPGQAGGLA 556
Query: 560 IADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFG 619
IA +LFG NP G+LP+TWYP+E+ TN+PMT+M MR + YPGR+YRFY+G VY FG
Sbjct: 557 IARVLFGDHNPSGRLPVTWYPEEF-TNVPMTDMRMRADPANGYPGRSYRFYQGKTVYKFG 615
Query: 620 HGMSYTNFVHTVANAPTVVAVPLDGRHGSINATI-SGKAI-------KVTHAKCNRLTLG 671
+G+SY+++ + ++ T P S+ T+ S + I ++ C L
Sbjct: 616 YGLSYSSYSRRLLSSGTSTPAPNADLLASLTTTMPSAENILGSYHVEQIGAQGCEMLKFP 675
Query: 672 VQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKY 730
V+V+N G DG ++L++ P A P +QL+ F+K H+ AG + + I C++
Sbjct: 676 AVVEVQNHGPMDGKQSVLMYLRWPNATAGRPERQLIGFKKEHLKAGEKAHIKFEIRPCEH 735
Query: 731 LSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
LS V G + I G H + + KH + +
Sbjct: 736 LSRVREDGNKVIDRGSHFLRV--DKHELEI 763
>gi|297611657|ref|NP_001067709.2| Os11g0291000 [Oryza sativa Japonica Group]
gi|255680005|dbj|BAF28072.2| Os11g0291000 [Oryza sativa Japonica Group]
Length = 764
Score = 687 bits (1774), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 355/733 (48%), Positives = 467/733 (63%), Gaps = 19/733 (2%)
Query: 42 FCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGT 101
FC L QR DL+ L+L EKV L AA V RLG+ YEWWSE LHG+S G G
Sbjct: 31 FCDAWLTAEQRAADLVANLTLAEKVSQLGDRAAGVARLGVPAYEWWSEGLHGLSIWGRGI 90
Query: 102 KFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA-GLTYWSPNVNIF 160
+F G TSFPQVI TA++F+A LW +G V EARA+YN G A GLT WSPNVNIF
Sbjct: 91 RFNGTVRAVTSFPQVILTAAAFDAGLWRRVGEAVGAEARALYNLGQANGLTIWSPNVNIF 150
Query: 161 RDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDR 220
RDPRWGRGQETPGEDPV + +YA ++V GLQG G+ +A CKH TAYDLD WN V R
Sbjct: 151 RDPRWGRGQETPGEDPVTASRYAVAFVTGLQGIGGE---ASACCKHATAYDLDYWNNVVR 207
Query: 221 FHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWR 280
+++++KV+ QD+EDT++ PF+ CV EGK +MC YN +NGVP CA ++L + +R EW
Sbjct: 208 YNYDSKVTLQDLEDTYNPPFKSCVAEGKATCIMCGYNSINGVPACASSDLLTKKVRQEWG 267
Query: 281 LNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSE 340
+NGY+ SDCD+V D H+T +PE+ A +I+ G+D++CG + +H +AVQ+G L+E
Sbjct: 268 MNGYVASDCDAVATIRDAHHYTLSPEDTVAVSIKVGMDVNCGNYTQVHAMAAVQKGNLTE 327
Query: 341 IDINNALVNTLTVQMRLGMFDGEPSSQP-YGHLGPKDVCTPDHQELALEAARQGIVLLKN 399
DI+ ALVN V+MRLG FDG+P S YGHLG DVC+P H+ LALEAA+ GIVLLKN
Sbjct: 328 KDIDRALVNLFAVRMRLGHFDGDPRSNAVYGHLGAADVCSPAHKSLALEAAQDGIVLLKN 387
Query: 400 QGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYA--RTIHQQGC 457
+LPL ++AVIGPN+D + GNY G C TTPLQGI Y R GC
Sbjct: 388 DAGALPLQPSAVTSLAVIGPNADNLGALHGNYFGPPCETTTPLQGIKGYLGDRARFLAGC 447
Query: 458 KDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMA 517
ACA AA AS +D +L MGL Q E + LDR LLLPG QQ L++ V+ A
Sbjct: 448 DSPACAVAATNEAAALAS-SSDHVVLFMGLSQKQEQDGLDRTSLLLPGEQQGLITAVANA 506
Query: 518 SKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMT 577
++ P ILVL++GGP+DV FAK++P+I AI+WAGYPGQAGG AIA +LFG NP G+LP+T
Sbjct: 507 ARRPVILVLLTGGPVDVTFAKDNPKIGAILWAGYPGQAGGLAIAKVLFGDHNPSGRLPVT 566
Query: 578 WYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTV------ 631
WYP+E+ T +PMT+M MR + YPGR+YRFY+G VY FG+G+SY+ F +
Sbjct: 567 WYPEEF-TKVPMTDMRMRADPATGYPGRSYRFYQGNTVYNFGYGLSYSKFSRRMFSSFST 625
Query: 632 --ANAPTVVAVPLDGRHGSINATISGKAIK-VTHAKCNRLTLGVQVDVKNVGSKDGAHTL 688
A +++A + R G +S +K + +C+RL V+V+N G DG H++
Sbjct: 626 SNAGNLSLLAGVMARRAGDDGGGMSSYLVKEIGVERCSRLVFPAVVEVQNHGPMDGKHSV 685
Query: 689 LVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEH 747
L++ P P +QL+ F HV G + V + C++ S V G R I G H
Sbjct: 686 LMYLRWPTTSGGRPARQLIGFRSQHVKVGEKAMVSFEVSPCEHFSWVGEDGERVIDGGAH 745
Query: 748 NIHIGGTKHSVSL 760
+ +G + S
Sbjct: 746 FLMVGDEELETSF 758
>gi|297736786|emb|CBI25987.3| unnamed protein product [Vitis vinifera]
Length = 745
Score = 684 bits (1764), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 340/745 (45%), Positives = 466/745 (62%), Gaps = 57/745 (7%)
Query: 22 LAAREPFACDPKDAT-----TRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAV 76
LA + CD ++ FC SL +R DL+ R++LQEKV + A+ V
Sbjct: 41 LAKNYTYVCDESRYALLGLDMKSFAFCDKSLSYKERAKDLVSRMTLQEKVMQSVHTASGV 100
Query: 77 PRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVS 136
RLG+ Y WWSEALHG+SN+GPG F PGATS P VI + ++FN TLW+ +GRVVS
Sbjct: 101 RRLGLPEYSWWSEALHGISNLGPGVFFDETIPGATSLPTVILSTAAFNQTLWKTLGRVVS 160
Query: 137 DEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGD 196
E RAMYN G AGLT+WSPN+N+ RD RWGR QET GEDP + G++A +YVRGLQ +G
Sbjct: 161 TEGRAMYNLGHAGLTFWSPNINVVRDTRWGRTQETSGEDPFIVGEFAVNYVRGLQDVEGT 220
Query: 197 RLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSY 256
V++ CKH+ AYD+D+W VDR F+A+VS+QD+++TF PF CV EG V+SVMCS+
Sbjct: 221 E-NVSSCCKHYAAYDIDSWLNVDRHTFDARVSEQDMKETFVSPFERCVREGDVSSVMCSF 279
Query: 257 NQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPE-EAAADAIRA 315
N++NG+P C+DP +LK IR EW L+GYIVSDC + V D Q++ + + +A A ++A
Sbjct: 280 NKINGIPPCSDPRLLKGVIRDEWDLHGYIVSDCYGLEVIVDNQNYLNDSKVDAVAKTLQA 339
Query: 316 GLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPK 375
GLDL+CG + +V G +S+ +++ AL N + MR+G FDG P+ Y LG K
Sbjct: 340 GLDLECGHYYTDALNESVLTGKVSQYELDRALKNIYVLLMRVGYFDGIPA---YESLGLK 396
Query: 376 DVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIA 435
D+C DH ELA EAARQGIVLLKN LPL + +A++GP+++ T MIGNYAG+
Sbjct: 397 DICAADHIELAREAARQGIVLLKNDYEVLPLK--PGKKIALVGPHANATEVMIGNYAGLP 454
Query: 436 CGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEA 495
C Y +PL+ F A + + TI+ +G D SIEAE
Sbjct: 455 CKYVSPLEA-----------------------FSAIGNVTYATGFTIIFVGTDLSIEAEF 491
Query: 496 LDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQA 555
+DR LLPG Q EL+ +V+ S GP ILV++SG ID+ FAKN+PRI+AI+W G+PG+
Sbjct: 492 VDRVDFLLPGNQTELIKQVAEVSSGPVILVVLSGSNIDITFAKNNPRISAILWVGFPGEQ 551
Query: 556 GGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVV 615
GG AIAD++FG NPGG+LP+TWY +Y+ LPM+ M++RP YPGRTY+F+ G V
Sbjct: 552 GGHAIADVVFGKYNPGGRLPVTWYEADYVDMLPMSSMSLRPVDELGYPGRTYKFFDGSTV 611
Query: 616 YPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVD 675
YPFG+GMSYT F +++A + + + L+ KC +V
Sbjct: 612 YPFGYGMSYTKFSYSLATSKISIDIDLN-----------------KFQKCRTF----EVA 650
Query: 676 VKNVGSKDGAHTLLVFSTPPAGHWAPH-KQLVAFEKVHVPAGAQQRVGINIHVCKYLSVV 734
V NVG DG+ L+V+S PP+G H KQ++ F+KV V AG +RV +++ CK L +V
Sbjct: 651 VTNVGMVDGSEVLMVYSIPPSGIVGTHIKQVIGFQKVFVAAGDTERVKFSMNACKSLRIV 710
Query: 735 DRSGTRRIPLGEHNIHIGGTKHSVS 759
D +G +P G H I +G +S S
Sbjct: 711 DSTGYSLLPSGSHTIRVGDYSNSAS 735
>gi|413925164|gb|AFW65096.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
Length = 829
Score = 682 bits (1759), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 354/766 (46%), Positives = 475/766 (62%), Gaps = 21/766 (2%)
Query: 12 LILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLIS 71
L+L+ + A PF+C + LPFC LP QR DL+ R++ EK L
Sbjct: 70 LLLMRVVAVVAAGAPPFSCG--GGPSLGLPFCNTKLPAAQRAADLVSRMTPAEKASQLGD 127
Query: 72 GAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFG-GDFPGATSFPQVITTASSFNATLWEA 130
A VPRLG+ Y+WW+EALHGV+ G G G ATSFPQV+ TA+SFN LW
Sbjct: 128 VANGVPRLGVPSYKWWNEALHGVAISGKGIHMDRGAVRSATSFPQVLLTAASFNDNLWFR 187
Query: 131 IGRVVSDEARAMYNGGTA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRG 189
IG+ EARA YN G A GLT WSPNVNIFRDPRWGRGQETPGEDP ++ +YAA++VRG
Sbjct: 188 IGQATGKEARAFYNIGQAEGLTMWSPNVNIFRDPRWGRGQETPGEDPAVASRYAAAFVRG 247
Query: 190 LQGSDGDR------LKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMC 243
LQGS + L +A CKH TAYDL++W GV R+ F A V+ QD+ DTF+ PFR C
Sbjct: 248 LQGSSSNTKSVPPVLLTSACCKHATAYDLEDWKGVTRYSFRATVTVQDLADTFNPPFRSC 307
Query: 244 VMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTS 303
V++GK + VMC+Y VNGVP+CA+ ++L +T RG W L+GY+ +DCD+V + ++Q +
Sbjct: 308 VVDGKASCVMCAYTSVNGVPSCANADLLTKTFRGSWGLDGYVAADCDAVSIMRNSQFYRP 367
Query: 304 TPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGE 363
T E+ A ++AGLD+DCGP++ H +A+Q+G L++ D++ A+ N T +MRLG FDG+
Sbjct: 368 TAEDTVATTLKAGLDIDCGPYVQQHAMAAIQKGKLTQQDVDKAVKNLFTTRMRLGHFDGD 427
Query: 364 PSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDV 423
P + YG+LG +CT +H+ LALEAA GIVLLKN LPL + AVIG N++
Sbjct: 428 PKAHVYGNLGAAHICTQEHKNLALEAALDGIVLLKNSAGVLPLKRGSVASAAVIGHNAND 487
Query: 424 TVTMIGNYAGIACGYTTPLQGIGRYARTIH-QQGCKDVACADDQLFGAAIDASRQADATI 482
+ ++GNY G C TTPLQGI Y + + GC AC AA AS +D+ I
Sbjct: 488 VLALLGNYWGPPCAPTTPLQGIQGYVKNVRFLAGCHKAACNVAATPQAAALAS-TSDSVI 546
Query: 483 LVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPR 542
L MGL Q E+E DR LLLPG QQ L++ V+ A+K P ILVL++GGP+D+ FA+ +P+
Sbjct: 547 LFMGLSQEQESEGKDRTTLLLPGNQQSLITAVANAAKRPVILVLLTGGPVDITFAQANPK 606
Query: 543 IAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRY 602
I AI+WAGYPGQAGG AIA +LFG NP G+LP+TWYP+E+ T +PMT+M MR + S Y
Sbjct: 607 IGAILWAGYPGQAGGLAIAKVLFGEKNPSGRLPVTWYPEEF-TKVPMTDMRMRSAGS--Y 663
Query: 603 PGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAP-----TVVAVPLDGRHGSINATISGKA 657
PGR+YRFYKG +Y FG+G+SY+ F H V A + G + +S
Sbjct: 664 PGRSYRFYKGKTIYKFGYGLSYSKFSHRVVTARNNPAHNTTLLLAAGHAATTEDNLSYHV 723
Query: 658 IKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAG 716
+ C +L V V+N G DG HT L+F P A P +QLV F+ H+ AG
Sbjct: 724 DHIGDELCRQLKFLAVVKVQNHGPMDGKHTALMFLRWPNATDGRPARQLVGFQSQHIKAG 783
Query: 717 AQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHA 762
+ + + C+ S V G + I G H + +G + +S A
Sbjct: 784 EKAHLRFEVSPCEDFSRVRDDGRKVIDKGSHFLKVGKHELEISFGA 829
>gi|326517420|dbj|BAK00077.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 781
Score = 681 bits (1757), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 345/746 (46%), Positives = 480/746 (64%), Gaps = 23/746 (3%)
Query: 22 LAAREPFACDPKD-ATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLG 80
LAA PF+C P A T+ FC +LP+ QR DL+ RL+ EKV L AA VPRLG
Sbjct: 32 LAADPPFSCGPSSTAATQGYAFCDATLPVAQRAADLVARLTTAEKVAQLGDEAAGVPRLG 91
Query: 81 IKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEAR 140
+ Y+WW+EALHG++ G G F G ATSFPQV TA++F+ LW IG+ + EAR
Sbjct: 92 VPAYKWWNEALHGLATSGKGLHFNGAVRSATSFPQVSLTAAAFDDDLWLRIGQAIGREAR 151
Query: 141 AMYNGGTA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLK 199
A+YN G A GLT WSPNVNI+RDPRWGRGQETPGEDP + +Y ++V+GLQG+
Sbjct: 152 ALYNVGQAEGLTMWSPNVNIYRDPRWGRGQETPGEDPTTASRYGVAFVKGLQGNSTSSSL 211
Query: 200 VA--ASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYN 257
+ A CKH TAYDL++W GV R++F+A+V+ QD+EDT++ PFR CV++GK + VMC+Y
Sbjct: 212 LQTSACCKHATAYDLEDWGGVARYNFDARVTAQDLEDTYNPPFRSCVVDGKASCVMCAYT 271
Query: 258 QVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGL 317
+NGVP CA+ +L T+R +W L+GY+ SDCD+V + D Q + TPE+A A A++AGL
Sbjct: 272 AINGVPACANSGLLTNTVRADWGLDGYVASDCDAVAIMRDAQRYAPTPEDAVALALKAGL 331
Query: 318 DLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDV 377
D+DCG ++ H +A+Q+G ++E D++ AL N ++MRLG FDG+P + YG L +
Sbjct: 332 DIDCGTYMQQHAPAALQQGKITEDDVDKALKNLFAIRMRLGHFDGDPRANIYGGLNAAHI 391
Query: 378 CTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACG 437
CTP+H+ LALEAA+ GIVLLKN LPL + AVIGPN++ +IGNY G C
Sbjct: 392 CTPEHRSLALEAAQDGIVLLKNDAGILPLDRAAIASAAVIGPNANNPGLLIGNYFGPPCE 451
Query: 438 YTTPLQGIGRYARTIH-QQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEAL 496
TPL+G+ Y + + GC AC AA A +D +L MGL Q E+E
Sbjct: 452 SVTPLKGVQGYVKDVRFMAGCGSAACDVADTDQAATLAG-SSDYVLLFMGLSQQQESEGR 510
Query: 497 DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAG 556
DR LLLPG+QQ L++ V+ A+K P ILVL++GGP+DV FAKN+P+I AI+WAGYPGQAG
Sbjct: 511 DRTSLLLPGQQQSLITAVADAAKRPVILVLLTGGPVDVTFAKNNPKIGAILWAGYPGQAG 570
Query: 557 GTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVY 616
G AIA +LFG NPGG+LP+TWYP+E+ T +PMT+M MR + YPGR+YRFY+G VY
Sbjct: 571 GLAIARVLFGDHNPGGRLPVTWYPEEF-TKVPMTDMRMRADPATGYPGRSYRFYQGETVY 629
Query: 617 PFGHGMSYTNFVHTVANAPT--------VVAVPLDGRHGSINATISGKAIKVTHAKCNRL 668
FG+G+SY+++ + ++ T + +P G + S + C +L
Sbjct: 630 KFGYGLSYSSYSRRLLSSGTPNTDLLAGLSTMPTPAEEGGV---ASYHVEHIGARGCEQL 686
Query: 669 TLGVQVDVKNVGSKDGAHTLLVF---STPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINI 725
V+V+N G DG H++L++ + AG P KQL+ F + H+ AG + + +I
Sbjct: 687 KFPAVVEVENHGPMDGKHSVLMYLRWANATAGR--PAKQLIGFRRQHLKAGEKASLTFDI 744
Query: 726 HVCKYLSVVDRSGTRRIPLGEHNIHI 751
C++ S V + G + + G H + +
Sbjct: 745 SPCEHFSRVRKDGNKVVDRGSHFLMV 770
>gi|357138088|ref|XP_003570630.1| PREDICTED: probable beta-D-xylosidase 7-like [Brachypodium
distachyon]
Length = 1026
Score = 680 bits (1754), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 348/693 (50%), Positives = 456/693 (65%), Gaps = 26/693 (3%)
Query: 1 MASTIAFFFLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRL 60
MA F + L L + +A PF+C + + PFC LPI QR DL RL
Sbjct: 1 MAGRSIIFHVVLPLCLVLQATMATDPPFSC----GSPSSYPFCDRKLPIGQRAADLASRL 56
Query: 61 SLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNV---GPGTKFG-GDFPGATSFPQV 116
+++EKV LL + VPRLG+ Y+WWSEALHGV+N G +F G ATSFPQV
Sbjct: 57 TVEEKVSLLGDVSPGVPRLGVPAYKWWSEALHGVANAPADRAGVRFDDGPVRAATSFPQV 116
Query: 117 ITTASSFNATLWEAIGRVVSDEARAMYNGGTA-GLTYWSPNVNIFRDPRWGRGQETPGED 175
+ TA+SFN LW IG+V+ EAR +YN G A GLT+W+PN+N+FRDPRWGRGQETPGED
Sbjct: 117 LVTAASFNPHLWYRIGQVIGREARGIYNSGQAEGLTFWAPNINVFRDPRWGRGQETPGED 176
Query: 176 PVLSGKYAASYVRGLQGS------DGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSK 229
P ++GKYAA +VRG+QG + L+ +A CKHFTAYDL+NWNGV RF FNAKVS+
Sbjct: 177 PTMTGKYAAVFVRGVQGYGASGAVNSSGLEASACCKHFTAYDLENWNGVTRFAFNAKVSE 236
Query: 230 QDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDC 289
QD+ DT++ PFR CV +G + +MCSYN+VNGVPTCAD N+L +T RG+WR NGYI SDC
Sbjct: 237 QDLADTYNPPFRSCVEDGGASGIMCSYNRVNGVPTCADHNLLSKTARGDWRFNGYITSDC 296
Query: 290 DSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVN 349
D+V + +D Q + PE+A AD ++AG+D++CG ++ H SA +G ++E DI+ AL N
Sbjct: 297 DAVAIIHDVQGYAKEPEDAVADVLKAGMDVNCGDYVQKHGVSAFHQGKITEQDIDRALQN 356
Query: 350 TLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHI 409
++MRLG+FDG P YG++G VC +HQ+LALEAA+ GIVLLKN +LPL
Sbjct: 357 LFAIRMRLGLFDGNPKYNRYGNIGADQVCKKEHQDLALEAAQDGIVLLKNDAGTLPLPKQ 416
Query: 410 RHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYAR-TIHQQGCKDVACADDQLF 468
+ ++AVIG N++ + GNY G C +PLQ + Y R T GC C +
Sbjct: 417 KISSLAVIGHNANDAQRLQGNYFGPPCISVSPLQALQGYVRETKFVAGCNAAVCNVSDIA 476
Query: 469 GAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMS 528
GAA A+ +A+ +L MGLDQ E E LDR L LPG Q+ LV+ V+ A+K P +LVL+
Sbjct: 477 GAAK-AASEAEYVVLFMGLDQDQEREDLDRIELGLPGMQESLVNAVADAAKKPVVLVLLC 535
Query: 529 GGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLP 588
GGP+DV FAK +P+I AIIWAGYPGQAGG AIA +LFG NPGG+LP+TWYP+EY T +
Sbjct: 536 GGPVDVTFAKGNPKIGAIIWAGYPGQAGGIAIAQVLFGEHNPGGRLPVTWYPKEYATAVA 595
Query: 589 MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGS 648
MT+M MR S YPGRTYRFYKG VY FG+G+SY+ + H+ + P P G
Sbjct: 596 MTDMRMRADASTGYPGRTYRFYKGKTVYNFGYGLSYSKYSHSFVSKP-----PASMMIGE 650
Query: 649 INAT---ISGK-AIKVTHAKCNRLTLGVQVDVK 677
+ A+ +SGK A+ ++ R + DVK
Sbjct: 651 VRASLGQLSGKSALYCSNGSVARYLVARNWDVK 683
>gi|413925166|gb|AFW65098.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
Length = 830
Score = 677 bits (1748), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 354/767 (46%), Positives = 475/767 (61%), Gaps = 22/767 (2%)
Query: 12 LILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLIS 71
L+L+ + A PF+C + LPFC LP QR DL+ R++ EK L
Sbjct: 70 LLLMRVVAVVAAGAPPFSCG--GGPSLGLPFCNTKLPAAQRAADLVSRMTPAEKASQLGD 127
Query: 72 GAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFG-GDFPGATSFPQVITTASSFNATLWEA 130
A VPRLG+ Y+WW+EALHGV+ G G G ATSFPQV+ TA+SFN LW
Sbjct: 128 VANGVPRLGVPSYKWWNEALHGVAISGKGIHMDRGAVRSATSFPQVLLTAASFNDNLWFR 187
Query: 131 IGRVVSDEARAMYNGGTA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRG 189
IG+ EARA YN G A GLT WSPNVNIFRDPRWGRGQETPGEDP ++ +YAA++VRG
Sbjct: 188 IGQATGKEARAFYNIGQAEGLTMWSPNVNIFRDPRWGRGQETPGEDPAVASRYAAAFVRG 247
Query: 190 LQGSDGDR------LKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMC 243
LQGS + L +A CKH TAYDL++W GV R+ F A V+ QD+ DTF+ PFR C
Sbjct: 248 LQGSSSNTKSVPPVLLTSACCKHATAYDLEDWKGVTRYSFRATVTVQDLADTFNPPFRSC 307
Query: 244 VMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNG-YIVSDCDSVGVYYDTQHFT 302
V++GK + VMC+Y VNGVP+CA+ ++L +T RG W L+G Y+ +DCD+V + ++Q +
Sbjct: 308 VVDGKASCVMCAYTSVNGVPSCANADLLTKTFRGSWGLDGRYVAADCDAVSIMRNSQFYR 367
Query: 303 STPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDG 362
T E+ A ++AGLD+DCGP++ H +A+Q+G L++ D++ A+ N T +MRLG FDG
Sbjct: 368 PTAEDTVATTLKAGLDIDCGPYVQQHAMAAIQKGKLTQQDVDKAVKNLFTTRMRLGHFDG 427
Query: 363 EPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSD 422
+P + YG+LG +CT +H+ LALEAA GIVLLKN LPL + AVIG N++
Sbjct: 428 DPKAHVYGNLGAAHICTQEHKNLALEAALDGIVLLKNSAGVLPLKRGSVASAAVIGHNAN 487
Query: 423 VTVTMIGNYAGIACGYTTPLQGIGRYARTIH-QQGCKDVACADDQLFGAAIDASRQADAT 481
+ ++GNY G C TTPLQGI Y + + GC AC AA AS +D+
Sbjct: 488 DVLALLGNYWGPPCAPTTPLQGIQGYVKNVRFLAGCHKAACNVAATPQAAALAS-TSDSV 546
Query: 482 ILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDP 541
IL MGL Q E+E DR LLLPG QQ L++ V+ A+K P ILVL++GGP+D+ FA+ +P
Sbjct: 547 ILFMGLSQEQESEGKDRTTLLLPGNQQSLITAVANAAKRPVILVLLTGGPVDITFAQANP 606
Query: 542 RIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKR 601
+I AI+WAGYPGQAGG AIA +LFG NP G+LP+TWYP+E+ T +PMT+M MR + S
Sbjct: 607 KIGAILWAGYPGQAGGLAIAKVLFGEKNPSGRLPVTWYPEEF-TKVPMTDMRMRSAGS-- 663
Query: 602 YPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAP-----TVVAVPLDGRHGSINATISGK 656
YPGR+YRFYKG +Y FG+G+SY+ F H V A + G + +S
Sbjct: 664 YPGRSYRFYKGKTIYKFGYGLSYSKFSHRVVTARNNPAHNTTLLLAAGHAATTEDNLSYH 723
Query: 657 AIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPA 715
+ C +L V V+N G DG HT L+F P A P +QLV F+ H+ A
Sbjct: 724 VDHIGDELCRQLKFLAVVKVQNHGPMDGKHTALMFLRWPNATDGRPARQLVGFQSQHIKA 783
Query: 716 GAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHA 762
G + + + C+ S V G + I G H + +G + +S A
Sbjct: 784 GEKAHLRFEVSPCEDFSRVRDDGRKVIDKGSHFLKVGKHELEISFGA 830
>gi|242052713|ref|XP_002455502.1| hypothetical protein SORBIDRAFT_03g012290 [Sorghum bicolor]
gi|241927477|gb|EES00622.1| hypothetical protein SORBIDRAFT_03g012290 [Sorghum bicolor]
Length = 825
Score = 676 bits (1744), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 348/755 (46%), Positives = 471/755 (62%), Gaps = 41/755 (5%)
Query: 42 FCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPG- 100
+C SLP +RV DL+GRLSL+EKV+ L A PR+G+ Y+WW EALHGVS+VGPG
Sbjct: 68 YCDASLPYAERVRDLVGRLSLEEKVRNLGDQAEGAPRVGLPPYKWWGEALHGVSDVGPGG 127
Query: 101 TKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIF 160
T FG PGATSFP VI +A++FN +LW AIG VVS E RAMYN G A LTYWSPN+N+
Sbjct: 128 TWFGDVVPGATSFPLVINSAAAFNESLWRAIGGVVSTEIRAMYNLGHAELTYWSPNINVV 187
Query: 161 RDPRWGRGQETPGEDPVLSGKYAASYVRGLQG------------SDGDRLKVAASCKHFT 208
RDPRWGR ETPGEDP + G+YA ++VRG+Q +KV++ CKHF
Sbjct: 188 RDPRWGRASETPGEDPFVVGRYAVNFVRGMQDVVIAAGAAATADPFSRPIKVSSCCKHFA 247
Query: 209 AYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADP 268
AYD+D W DR F+A+V ++D+ +TF+ PF MC+ +G + VMCSYN++NG+P CAD
Sbjct: 248 AYDVDAWFKADRLTFDAQVEERDMVETFERPFEMCIRDGDASCVMCSYNRINGIPACADA 307
Query: 269 NILKRTIRGEWRLNGYIVSDCDSVGVYY-DTQHFTSTPEEAAADAIRAGLDLDCG----- 322
+L T+R +W+L+GYIVSDCDSV V D + T EA A A++AGLDLDCG
Sbjct: 308 RLLSETVRSQWQLHGYIVSDCDSVRVMVRDAKWLNYTGVEATAAAMKAGLDLDCGMFWEG 367
Query: 323 --PFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTP 380
F + AV++G + E D++NAL N T MRLG FDG P + LG DVCT
Sbjct: 368 ARDFFTTYGVDAVRQGKIKEADVDNALGNVYTTLMRLGFFDGMPE---FESLGADDVCTR 424
Query: 381 DHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVT--MIGNYAGIACGY 438
DH+ELA +AARQG+VLLKN LPL + +V+++G + T M+G+Y G C
Sbjct: 425 DHKELAADAARQGMVLLKNDARRLPLDPSKINSVSLVGLLEHINATDVMLGDYRGKPCRI 484
Query: 439 TTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDR 498
TP I + + C AC+ + G A ++ ADATI++ GL+ S+E E+ DR
Sbjct: 485 VTPYDAIRQVVNATYVHACDSGACSTAEGMGRASRTAKIADATIVIAGLNMSVERESNDR 544
Query: 499 AGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGT 558
LLLP Q ++ V+ AS P +LV+MS G +DV+FA+N+ +I AI+WAGYPG+ GGT
Sbjct: 545 EDLLLPWNQSSWINAVAEASTTPIVLVIMSAGGVDVSFAQNNTKIGAIVWAGYPGEEGGT 604
Query: 559 AIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGP-VVYP 617
AIAD+LFG NPGG+LP+TW+ EY+ +PMT MA+RP + YPGRTY+FY GP V+YP
Sbjct: 605 AIADVLFGKYNPGGRLPLTWFKNEYVNQIPMTSMALRPDAAHGYPGRTYKFYGGPAVLYP 664
Query: 618 FGHGMSYTNFVHTVANAPTVVAVPLDG-RHGSINATISGK---------AIKVTHAKCNR 667
FGHG+SYT+F + V +P+ H + SGK A+ V +C+
Sbjct: 665 FGHGLSYTSFTYASGTTGATVTIPIGAWEHCKMLTYKSGKAPSPSPACPALNVASHRCDE 724
Query: 668 LTLGVQVDVKNVGSKDGAHTLLVFSTPP--AGHWAPHKQLVAFEKVHVPAGAQQRVGINI 725
+ + + V N G G H + V++ PP G AP KQLV F +V VPAGA V +
Sbjct: 725 V-VSFSLRVANTGGVGGDHVVPVYTAPPPEVGD-APRKQLVEFRRVFVPAGAAVDVPFAL 782
Query: 726 HVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
+VCK ++V+ + +P G + +G ++S
Sbjct: 783 NVCKTFAIVEETAYTVVPSGVSTVIVGDDALALSF 817
>gi|30316196|sp|P83344.1|XYNB_PRUPE RecName: Full=Putative beta-D-xylosidase; AltName: Full=PpAz152
gi|19879972|gb|AAM00218.1|AF362990_1 beta-D-xylosidase, partial [Prunus persica]
Length = 461
Score = 675 bits (1741), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 331/461 (71%), Positives = 387/461 (83%), Gaps = 2/461 (0%)
Query: 310 ADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPY 369
ADAI+AGLDLDCGPFL +HTE+AV+RGL+S+++IN AL NT+TVQMRLGMFDGEPS+ Y
Sbjct: 1 ADAIKAGLDLDCGPFLAIHTEAAVRRGLVSQLEINWALANTMTVQMRLGMFDGEPSAHQY 60
Query: 370 GHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIG 429
G+LGP+DVCTP HQ+LALEAARQGIVLL+N+G SLPLS RHRTVAVIGPNSDVTVTMIG
Sbjct: 61 GNLGPRDVCTPAHQQLALEAARQGIVLLENRGRSLPLSTRRHRTVAVIGPNSDVTVTMIG 120
Query: 430 NYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQ 489
NYAG+ACGYTTPLQGIGRY RTIHQ GC DV C +QLFGAA A+RQADAT+LVMGLDQ
Sbjct: 121 NYAGVACGYTTPLQGIGRYTRTIHQAGCTDVHCNGNQLFGAAEAAARQADATVLVMGLDQ 180
Query: 490 SIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWA 549
SIEAE +DRAGLLLPG QQELVS+V+ AS+GPTILVLMSGGPIDV FAKNDPRI+AIIW
Sbjct: 181 SIEAEFVDRAGLLLPGHQQELVSRVARASRGPTILVLMSGGPIDVTFAKNDPRISAIIWV 240
Query: 550 GYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRF 609
GYPGQAGGTAIA++LFGT+NPGGKLPMTWYPQ Y+T+LPMT+MAMR ++ YPGRTYRF
Sbjct: 241 GYPGQAGGTAIANVLFGTANPGGKLPMTWYPQNYVTHLPMTDMAMRADPARGYPGRTYRF 300
Query: 610 YKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLT 669
Y GPVV+PFG G+SYT F H +A+ PT+V+VPL + N+T+ K ++V+H CN L+
Sbjct: 301 YIGPVVFPFGLGLSYTTFAHNLAHGPTLVSVPLTSLKATANSTMLSKTVRVSHPDCNALS 360
Query: 670 -LGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVC 728
L V VDVKN GS DG HTLLVF++PP G WA KQL+ F K+H+ G+++RV I +HVC
Sbjct: 361 PLDVHVDVKNTGSMDGTHTLLVFTSPPDGKWASSKQLMGFHKIHIATGSEKRVRIAVHVC 420
Query: 729 KYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHAATLGVIK 769
K+LSVVDR G RRIPLGEH + IG H VSL LG IK
Sbjct: 421 KHLSVVDRFGIRRIPLGEHKLQIGDLSHHVSLQ-TNLGEIK 460
>gi|253761860|ref|XP_002489304.1| hypothetical protein SORBIDRAFT_0010s007570 [Sorghum bicolor]
gi|241946952|gb|EES20097.1| hypothetical protein SORBIDRAFT_0010s007570 [Sorghum bicolor]
Length = 750
Score = 673 bits (1737), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 353/765 (46%), Positives = 479/765 (62%), Gaps = 29/765 (3%)
Query: 12 LILLSAS--SSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLL 69
++LL+A SS +A+ F+C P + +R PFC SLP +R DL+ RL++ EKV L
Sbjct: 1 MLLLAAVFFSSAVASDPLFSCGPS-SPSRAYPFCDRSLPAARRAADLVSRLTVAEKVSQL 59
Query: 70 ISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWE 129
AA VPRLG+ Y+WWSE LHG++ G G +F G G TSFPQV+ T +SF+ LW
Sbjct: 60 GDEAAGVPRLGVPPYKWWSEGLHGLAFWGHGMRFNGTVTGVTSFPQVLLTTASFDDGLWF 119
Query: 130 AIGRVVSDEARAMYNGGTA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVR 188
IG+ + EARA+YN G A GLT WSPNVNIFRDPRWGRGQETPGEDP ++ KYA ++VR
Sbjct: 120 RIGQAIGREARALYNLGQAEGLTIWSPNVNIFRDPRWGRGQETPGEDPAVASKYAVAFVR 179
Query: 189 GLQGSDGDR----LKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCV 244
G+QGS L+ +A CKH TAYDL++WNGV R++F+A+V+ QD+ DTF+ PF+ CV
Sbjct: 180 GIQGSSAAGAAAPLQASACCKHATAYDLEDWNGVARYNFDARVTAQDLADTFNPPFQSCV 239
Query: 245 MEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTST 304
++GK VMC+Y +NGVP CA ++L +T RG W +GY+ SDCD+V + +D Q + T
Sbjct: 240 VDGKATCVMCAYTGINGVPACASSDLLTKTFRGAWGHDGYVSSDCDAVAIMHDAQRYVPT 299
Query: 305 PEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEP 364
PE+ A A++ H +A+Q+G ++E D++ AL N V+MRLG FDG+P
Sbjct: 300 PEDTVAVALKE------------HGMAAIQQGKMTEKDVDKALTNLFAVRMRLGHFDGDP 347
Query: 365 SSQP-YGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDV 423
YGHLG DVCT DH+ LALEAA+ GIVLLKN LPL + AVIG N++
Sbjct: 348 RGNALYGHLGAADVCTADHKNLALEAAQDGIVLLKNDAGILPLDRSAMGSAAVIGHNAND 407
Query: 424 TVTMIGNYAGIACGYTTPLQGIGRYARTIH-QQGCKDVACADDQLFGAAIDASRQADATI 482
+ + GNY G AC TTPLQG+ Y + GC AC AA AS ++
Sbjct: 408 ALVLRGNYFGPACETTTPLQGVQSYVSNVRFLAGCSSAACGYAATGQAAALAS-SSEYVF 466
Query: 483 LVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPR 542
L MGL Q E E LDR LLLPG+QQ L++ V+ A+K P ILVL++GGP+D+ FA+++P+
Sbjct: 467 LFMGLSQDQEKEGLDRTSLLLPGKQQSLITAVASAAKRPVILVLLTGGPVDITFAQSNPK 526
Query: 543 IAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRY 602
I AI+WAGYPGQAGG AIA +LFG NP G+LP+TWYP+E+ T +PMT+M MR + Y
Sbjct: 527 IGAILWAGYPGQAGGLAIARVLFGDHNPSGRLPVTWYPEEF-TKVPMTDMRMRADPANGY 585
Query: 603 PGRTYRFYKGPVVYPFGHGMSYTNFV-HTVANAPTVVAVPLDGRHGSI---NATISGKAI 658
PGR+YRFY+G +Y FG+G+SY+ F V +A L G + +AT
Sbjct: 586 PGRSYRFYRGNTIYKFGYGLSYSKFSRQLVTGGKNQLASLLAGLSATTKDDDATSYYHVD 645
Query: 659 KVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGA 717
+ C +L +V+V+N G DG H++L+F P A P QL+ F H+ AG
Sbjct: 646 DIGADGCEQLRFPAEVEVQNHGPMDGKHSVLMFLRWPNATDGRPVSQLIGFTSQHIKAGE 705
Query: 718 QQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHA 762
+ V ++ C++ S G + I G H + +G + VS A
Sbjct: 706 KANVRFDVRPCEHFSRARADGKKVIDRGSHFLMVGKEEVEVSFEA 750
>gi|62701894|gb|AAX92967.1| beta-xylosidase, putative [Oryza sativa Japonica Group]
gi|77550041|gb|ABA92838.1| Glycosyl hydrolase family 3 C terminal domain containing protein
[Oryza sativa Japonica Group]
Length = 793
Score = 672 bits (1734), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 355/761 (46%), Positives = 467/761 (61%), Gaps = 47/761 (6%)
Query: 42 FCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGT 101
FC L QR DL+ L+L EKV L AA V RLG+ YEWWSE LHG+S G G
Sbjct: 32 FCDAWLTAEQRAADLVANLTLAEKVSQLGDRAAGVARLGVPAYEWWSEGLHGLSIWGRGI 91
Query: 102 KFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA-GLTYWSPNVNIF 160
+F G TSFPQVI TA++F+A LW +G V EARA+YN G A GLT WSPNVNIF
Sbjct: 92 RFNGTVRAVTSFPQVILTAAAFDAGLWRRVGEAVGAEARALYNLGQANGLTIWSPNVNIF 151
Query: 161 RDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDR 220
RDPRWGRGQETPGEDPV + +YA ++V GLQG G+ +A CKH TAYDLD WN V R
Sbjct: 152 RDPRWGRGQETPGEDPVTASRYAVAFVTGLQGIGGE---ASACCKHATAYDLDYWNNVVR 208
Query: 221 FHFNAK----------------------------VSKQDIEDTFDVPFRMCVMEGKVASV 252
+++++K V+ QD+EDT++ PF+ CV EGK +
Sbjct: 209 YNYDSKDGASTGKSGETSSQVEKKHGPYEKGYFAVTLQDLEDTYNPPFKSCVAEGKATCI 268
Query: 253 MCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADA 312
MC YN +NGVP CA ++L + +R EW +NGY+ SDCD+V D H+T +PE+ A +
Sbjct: 269 MCGYNSINGVPACASSDLLTKKVRQEWGMNGYVASDCDAVATIRDAHHYTLSPEDTVAVS 328
Query: 313 IRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQP-YGH 371
I+ G+D++CG + +H +AVQ+G L+E DI+ ALVN V+MRLG FDG+P S YGH
Sbjct: 329 IKVGMDVNCGNYTQVHAMAAVQKGNLTEKDIDRALVNLFAVRMRLGHFDGDPRSNAVYGH 388
Query: 372 LGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY 431
LG DVC+P H+ LALEAA+ GIVLLKN +LPL ++AVIGPN+D + GNY
Sbjct: 389 LGAADVCSPAHKSLALEAAQDGIVLLKNDAGALPLQPSAVTSLAVIGPNADNLGALHGNY 448
Query: 432 AGIACGYTTPLQGIGRYA--RTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQ 489
G C TTPLQGI Y R GC ACA AA AS +D +L MGL Q
Sbjct: 449 FGPPCETTTPLQGIKGYLGDRARFLAGCDSPACAVAATNEAAALAS-SSDHVVLFMGLSQ 507
Query: 490 SIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWA 549
E + LDR LLLPG QQ L++ V+ A++ P ILVL++GGP+DV FAK++P+I AI+WA
Sbjct: 508 KQEQDGLDRTSLLLPGEQQGLITAVANAARRPVILVLLTGGPVDVTFAKDNPKIGAILWA 567
Query: 550 GYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRF 609
GYPGQAGG AIA +LFG NP G+LP+TWYP+E+ T +PMT+M MR + YPGR+YRF
Sbjct: 568 GYPGQAGGLAIAKVLFGDHNPSGRLPVTWYPEEF-TKVPMTDMRMRADPATGYPGRSYRF 626
Query: 610 YKGPVVYPFGHGMSYTNFVHTV--------ANAPTVVAVPLDGRHGSINATISGKAIK-V 660
Y+G VY FG+G+SY+ F + A +++A + R G +S +K +
Sbjct: 627 YQGNTVYNFGYGLSYSKFSRRMFSSFSTSNAGNLSLLAGVMARRAGDDGGGMSSYLVKEI 686
Query: 661 THAKCNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQ 719
+C+RL V+V+N G DG H++L++ P P +QL+ F HV G +
Sbjct: 687 GVERCSRLVFPAVVEVQNHGPMDGKHSVLMYLRWPTTSGGRPARQLIGFRSQHVKVGEKA 746
Query: 720 RVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
V + C++ S V G R I G H + +G + S
Sbjct: 747 MVSFEVSPCEHFSWVGEDGERVIDGGAHFLMVGDEELETSF 787
>gi|226506870|ref|NP_001146482.1| uncharacterized protein LOC100280070 precursor [Zea mays]
gi|219887469|gb|ACL54109.1| unknown [Zea mays]
gi|413947917|gb|AFW80566.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
Length = 835
Score = 670 bits (1728), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 351/763 (46%), Positives = 471/763 (61%), Gaps = 46/763 (6%)
Query: 30 CDPKDATTRTLP-----FCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGY 84
CDP L +C SLP RV DL+GRL+L+EKV+ L A PR+G+ Y
Sbjct: 63 CDPARFVALGLDMSRFRYCDASLPYADRVRDLVGRLALEEKVRNLGDQAEGAPRVGLPPY 122
Query: 85 EWWSEALHGVSNVGPG-TKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMY 143
+WW EALHGVS+VGPG T FG PGATSFP VI +A++FN +LW AIG VVS E RAMY
Sbjct: 123 KWWGEALHGVSDVGPGGTWFGDVVPGATSFPLVINSAAAFNESLWRAIGGVVSTEIRAMY 182
Query: 144 NGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDR------ 197
N G A LTYWSPN+N+ RDPRWGR ETPGEDP + G+YA ++VRG+Q D DR
Sbjct: 183 NLGHAELTYWSPNINVVRDPRWGRASETPGEDPFVVGRYAVNFVRGMQDVD-DRPYAAAA 241
Query: 198 ------LKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVAS 251
+KV++ CKHF AYD+D W DR F+A+V ++D+ +TF+ PF MC+ +G +
Sbjct: 242 DPFSRPIKVSSCCKHFAAYDVDAWFKADRLTFDAQVEERDMVETFERPFEMCIRDGDASC 301
Query: 252 VMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYY-DTQHFTSTPEEAAA 310
VMCSYN++NG+P CAD +L T+R +W+L+GYIVSDCDSV V D + T EA A
Sbjct: 302 VMCSYNRINGIPACADARLLSETVRSQWQLHGYIVSDCDSVRVMVRDAKWLNYTGVEATA 361
Query: 311 DAIRAGLDLDCG-------PFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGE 363
A++AGLDLDCG F + AV++G + E D++NAL N T MRLG FDG
Sbjct: 362 AAMKAGLDLDCGMFWEGARDFFTTYGVDAVRQGKIKEGDVDNALSNVYTTLMRLGFFDGM 421
Query: 364 PSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDV 423
P + LG +VCT H+ELA +AARQG+VLLKN LPL + +V+++G +
Sbjct: 422 PE---FESLGASNVCTDGHKELAADAARQGMVLLKNDARRLPLDPNKINSVSLVGLLEHI 478
Query: 424 TVT--MIGNYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADAT 481
T M+G+Y G C TP I + C AC + G A ++ ADAT
Sbjct: 479 NATDVMLGDYRGKPCRIVTPYNAIRNMVNATYVHACDSGACNTAEGMGRASSTAKIADAT 538
Query: 482 ILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDP 541
I++ GL+ S+E E+ DR LLLP Q ++ V+MAS P +LV+MS G +DV+FA N+
Sbjct: 539 IVIAGLNMSVERESNDREDLLLPWNQSSWINAVAMASPTPIVLVIMSAGGVDVSFAHNNT 598
Query: 542 RIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKR 601
+I AI+WAGYPG+ GGTAIAD+LFG NPGG+LP+TW+ EY+ +PMT MA+RP +
Sbjct: 599 KIGAIVWAGYPGEEGGTAIADVLFGKYNPGGRLPLTWFKNEYVNQIPMTSMALRPDAALG 658
Query: 602 YPGRTYRFYKGP-VVYPFGHGMSYTNFVHTVANAPTVVAV---------PLDGRHGSINA 651
YPGRTY+FY GP V+YPFGHG+SYTNF + V + L + G+ +
Sbjct: 659 YPGRTYKFYGGPAVLYPFGHGLSYTNFSYASGTTGATVTIHIGAWEHCKMLTYKMGAPSP 718
Query: 652 TISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPP--AGHWAPHKQLVAFE 709
+ + A+ V C+ + + + V N G G H + V++ PP G AP KQLVAF
Sbjct: 719 SPACPALNVASHMCSEV-VSFSLRVANTGGVGGDHVVPVYTAPPPEVGD-APLKQLVAFR 776
Query: 710 KVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIG 752
+V VPAGA V ++VCK ++V+ + +P G + +G
Sbjct: 777 RVFVPAGAAVDVPFALNVCKTFAIVEETAYTVVPSGVSTVVVG 819
>gi|326523729|dbj|BAJ93035.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 810
Score = 666 bits (1718), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 355/771 (46%), Positives = 469/771 (60%), Gaps = 54/771 (7%)
Query: 30 CDPKDATTRTLP-----FCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGY 84
CDP L +C SLP RV DL+GRL+L+EKV+ L A R+G+ Y
Sbjct: 46 CDPARFAALGLEMAGFRYCDASLPYADRVRDLVGRLTLEEKVRNLGDRAEGAARVGLPPY 105
Query: 85 EWWSEALHGVSNVGPG-TKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMY 143
WW EALHGVS+ GPG T+FG PGATSFP VI +A++FN TLW AIG VS E RAMY
Sbjct: 106 LWWGEALHGVSDTGPGGTRFGDVVPGATSFPLVINSAAAFNETLWGAIGGAVSTEIRAMY 165
Query: 144 NGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDG-------- 195
N G A LTYWSPN+N+ RDPRWGR ETPGEDP + G+YA S+VR +Q DG
Sbjct: 166 NLGHAELTYWSPNINVVRDPRWGRASETPGEDPFVVGRYAVSFVRAMQDIDGAGPGAGAD 225
Query: 196 ---DRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASV 252
+KV++ CKH+ AYD+D W DR F+A+V ++D+ +TF+ PF MCV +G + V
Sbjct: 226 PFARPIKVSSCCKHYAAYDVDAWLTADRLTFDAQVEERDMIETFERPFEMCVRDGDASCV 285
Query: 253 MCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYY-DTQHFTSTPEEAAAD 311
MCSYN++NGVP CA+ +L T+RGEW+L+GYIVSDCDSV V D + EA A
Sbjct: 286 MCSYNRINGVPACANARLLSETVRGEWQLHGYIVSDCDSVRVMVRDAKWLGYNGVEATAA 345
Query: 312 AIRAGLDLDCGPF----------LGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFD 361
A++AGLDLDCG F GL AV++G L E +++NAL N MRLG FD
Sbjct: 346 AMKAGLDLDCGMFWEGAQDFFTAFGL---DAVRQGKLRESEVDNALRNLYLTLMRLGFFD 402
Query: 362 GEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNS 421
G P + LG DVCT +H+ELA +AARQG+VL+KN LPL + +++++G
Sbjct: 403 GIPELE---SLGANDVCTEEHKELAADAARQGMVLIKNDHGRLPLDTSKVNSLSLVGLLQ 459
Query: 422 DVTVT--MIGNYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQAD 479
+ T M+G+Y G C TP I + Q C AC+ + D
Sbjct: 460 HINATDVMLGDYRGKPCRVVTPYDAIRKVVSATSMQVCDHGACS-------TAANGKTVD 512
Query: 480 ATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKN 539
ATI++ GL+ S+E E DR LLLP Q ++ V+ AS P ILV++S G +DV+FA+N
Sbjct: 513 ATIVIAGLNMSVEKEGNDREDLLLPWNQTNWINAVAEASPYPIILVIISAGGVDVSFAQN 572
Query: 540 DPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQS 599
+P+I AI+WAGYPG+ GGTAIAD+LFG NPGG+LP+TWY EYI+ +PMT MA+RP
Sbjct: 573 NPKIGAIVWAGYPGEEGGTAIADVLFGKYNPGGRLPLTWYKSEYISKIPMTSMALRPVAD 632
Query: 600 KRYPGRTYRFYKGP-VVYPFGHGMSYTNFVHTVANAPTVVAVPLDG--------RHGSIN 650
K YPGRTY+FY GP V+YPFGHG+SY+NF + V V + R
Sbjct: 633 KGYPGRTYKFYGGPEVLYPFGHGLSYSNFSYASDTTGASVTVRVGAWESCKQLTRKPGTT 692
Query: 651 ATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAG-HWAPHKQLVAFE 709
A ++ A+ V C + + V N GS+DGAH ++V++ PPA AP KQLVAF
Sbjct: 693 APLACPAVNVAGHGCKE-EVSFSLTVANRGSRDGAHVVMVYTVPPAEVDDAPLKQLVAFR 751
Query: 710 KVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
+V VPAGA +V ++VCK ++V+ + +P G + +G S S
Sbjct: 752 RVFVPAGAAVQVPFTLNVCKAFAIVEETAYTVVPSGVSTVLVGDDALSFSF 802
>gi|14164501|dbj|BAB55751.1| putative alpha-L-arabinofuranosidase/beta-D- xylosidase isoenzyme
ARA-I [Oryza sativa Japonica Group]
Length = 818
Score = 665 bits (1715), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 360/772 (46%), Positives = 475/772 (61%), Gaps = 51/772 (6%)
Query: 30 CDPKDATTRTL-----PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGY 84
CDP L P+C SLP RV DL+GR++L+EKV L A PR+G+ Y
Sbjct: 47 CDPARFAAAGLDMAGFPYCDASLPYADRVRDLVGRMTLEEKVANLGDRAGGAPRVGLPRY 106
Query: 85 EWWSEALHGVSNVGPG-TKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMY 143
WW EALHGVS+VGPG T FG PGATSFP VI +A+SFN TLW AIG VVS E RAMY
Sbjct: 107 LWWGEALHGVSDVGPGGTWFGDAVPGATSFPLVINSAASFNETLWRAIGGVVSTEIRAMY 166
Query: 144 NGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDR------ 197
N G A LTYWSPN+N+ RDPRWGR ETPGEDP + G+YA ++VRG+Q DG
Sbjct: 167 NLGHAELTYWSPNINVVRDPRWGRASETPGEDPFVVGRYAVNFVRGMQDIDGATTAASAA 226
Query: 198 ---------LKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGK 248
+KV++ CKH+ AYD+D WNG DR F+A+V ++D+ +TF+ PF MC+ +G
Sbjct: 227 AATDAFSRPIKVSSCCKHYAAYDVDAWNGTDRLTFDARVQERDMVETFERPFEMCIRDGD 286
Query: 249 VASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYY-DTQHFTSTPEE 307
+ VMCSYN++NGVP CAD +L T+R +W+L+GYIVSDCDSV V D + T E
Sbjct: 287 ASCVMCSYNRINGVPACADARLLTETVRRDWQLHGYIVSDCDSVRVMVRDAKWLGYTGVE 346
Query: 308 AAADAIRAGLDLDCGPFL-GLHT------ESAVQRGLLSEIDINNALVNTLTVQMRLGMF 360
A A A++AGLDLDCG F G+H AV++G L E ++NAL N MRLG F
Sbjct: 347 ATAAAMKAGLDLDCGMFWEGVHDFFTTYGVDAVRQGKLKESAVDNALTNLYLTLMRLGFF 406
Query: 361 DGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPN 420
DG P + LG DVCT +H+ELA +AARQG+VLLKN LPLS + +VA+ G
Sbjct: 407 DGIPELE---SLGAADVCTEEHKELAADAARQGMVLLKNDAALLPLSPEKVNSVALFGQL 463
Query: 421 SDVTVT--MIGNYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQA 478
+ T M+G+Y G C TP G+ + + C +C A A++
Sbjct: 464 QHINATDVMLGDYRGKPCRVVTPYDGVRKVVSSTSVHACDKGSC------DTAAAAAKTV 517
Query: 479 DATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAK 538
DATI+V GL+ S+E E+ DR LLLP Q ++ V+ AS P +LV+MS G +DV+FA+
Sbjct: 518 DATIVVAGLNMSVERESNDREDLLLPWSQASWINAVAEASPSPIVLVIMSAGGVDVSFAQ 577
Query: 539 NDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQ 598
++P+I A++WAGYPG+ GGTAIAD+LFG NPGG+LP+TWY EY++ +PMT MA+RP
Sbjct: 578 DNPKIGAVVWAGYPGEEGGTAIADVLFGKYNPGGRLPLTWYKNEYVSKIPMTSMALRPDA 637
Query: 599 SKRYPGRTYRFYKGP-VVYPFGHGMSYTNFVHTVANAPTVVAVPLDG----RHGSINATI 653
YPGRTY+FY G V+YPFGHG+SYTNF + A A V V + + + A +
Sbjct: 638 EHGYPGRTYKFYGGADVLYPFGHGLSYTNFTYASATAAAPVTVKVGAWEYCKQLTYKAGV 697
Query: 654 SG----KAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAG-HWAPHKQLVAF 708
S A+ V C + V V N G +DG H + +++ PPA AP KQLVAF
Sbjct: 698 SSPPACPAVNVASHACQE-EVSFAVTVANTGGRDGTHVVPMYTAPPAEVDGAPRKQLVAF 756
Query: 709 EKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
+V V AGA V ++VCK ++V+ + +P G + +G S+S
Sbjct: 757 RRVRVAAGAAVEVAFALNVCKAFAIVEETAYTVVPSGVSRVLVGDDALSLSF 808
>gi|343172466|gb|AEL98937.1| beta-xylosidase, partial [Silene latifolia]
gi|343172468|gb|AEL98938.1| beta-xylosidase, partial [Silene latifolia]
Length = 374
Score = 662 bits (1708), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 302/374 (80%), Positives = 341/374 (91%)
Query: 78 RLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSD 137
RLG++GYEWWSEALHGVSNVGPGTKF G FP ATSFPQVITTA+SFNA+LW+AIG+ VSD
Sbjct: 1 RLGLQGYEWWSEALHGVSNVGPGTKFQGAFPAATSFPQVITTAASFNASLWQAIGQAVSD 60
Query: 138 EARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDR 197
EARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDP LS +YAASYV GLQG+ G+R
Sbjct: 61 EARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPTLSAQYAASYVTGLQGNYGNR 120
Query: 198 LKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYN 257
LKVAA CKH+TAYDLDNWNG+DRFHFNAKVSKQD+EDT++VPF+ CV+EGKVASVMCSYN
Sbjct: 121 LKVAACCKHYTAYDLDNWNGMDRFHFNAKVSKQDLEDTYNVPFKACVLEGKVASVMCSYN 180
Query: 258 QVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGL 317
QVNG PTCADP+IL+ TIRG+W LNGYIVSDCDSVGV YD QH+T TPEEAAAD I AGL
Sbjct: 181 QVNGKPTCADPDILRNTIRGQWHLNGYIVSDCDSVGVLYDDQHYTRTPEEAAADTINAGL 240
Query: 318 DLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDV 377
DLDCGPFL +HTE A+++GL++E +N AL NT+TVQMRLGMFDGEPS+QP+G+LGP+DV
Sbjct: 241 DLDCGPFLAVHTEGAIRQGLVTEAAVNQALANTITVQMRLGMFDGEPSAQPFGNLGPRDV 300
Query: 378 CTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACG 437
CTP HQ+LAL+AAR+GIVLLKNQ SLPLS +RHR +AVIGPN+ T TMIGNYAGIACG
Sbjct: 301 CTPAHQDLALQAAREGIVLLKNQVGSLPLSTVRHRNIAVIGPNAQATTTMIGNYAGIACG 360
Query: 438 YTTPLQGIGRYART 451
YT+PLQGI RYART
Sbjct: 361 YTSPLQGISRYART 374
>gi|326513064|dbj|BAK03439.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 694
Score = 659 bits (1701), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 336/643 (52%), Positives = 436/643 (67%), Gaps = 26/643 (4%)
Query: 126 TLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAAS 185
TL E +G +V+ + A+ G +WS ETPGEDP+L+ KYA
Sbjct: 70 TLAEKVGFLVNKQP-ALGRLGIPAYEWWS---------------ETPGEDPLLASKYAVG 113
Query: 186 YVRGLQ-----GSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPF 240
YV GLQ G LKVAA CKH+TAYD+DNW GV+R+ F+AKVS+QD++DTF PF
Sbjct: 114 YVTGLQDAGAGGVTDGALKVAACCKHYTAYDVDNWKGVERYTFDAKVSQQDLDDTFQPPF 173
Query: 241 RMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQH 300
+ CV++G VASVMCSYN+VNG PTCAD ++L+ IRG+W+LNGYIVSDCDSV V Y QH
Sbjct: 174 KSCVLDGNVASVMCSYNKVNGKPTCADKDLLEGVIRGDWKLNGYIVSDCDSVDVLYTQQH 233
Query: 301 FTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMF 360
+T TPEEAAA I++GLDL+CG FL HT +AVQ G LSE D++ A+ N + MRLG F
Sbjct: 234 YTKTPEEAAAITIKSGLDLNCGNFLAQHTVAAVQAGELSEEDVDRAITNNFIMLMRLGFF 293
Query: 361 DGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPN 420
DG+P +G LGPKDVCT ++ELA E ARQGIVLLKN G +LPLS +++AVIGPN
Sbjct: 294 DGDPRQLAFGSLGPKDVCTSSNRELARETARQGIVLLKNSG-ALPLSAKSIKSMAVIGPN 352
Query: 421 SDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQL-FGAAIDASRQAD 479
++ + TMIGNY G C YTTPLQG+G T++Q GC +V C+ + L A+ A+ AD
Sbjct: 353 ANASFTMIGNYEGTPCKYTTPLQGLGAKVNTVYQPGCTNVGCSGNSLQLSTAVAAAASAD 412
Query: 480 ATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKN 539
T+LV+G DQSIE E+LDR LLLPG+Q +LVS V+ AS GP ILV+MSGGP D++FAK
Sbjct: 413 VTVLVVGADQSIERESLDRTSLLLPGQQTQLVSAVANASSGPVILVVMSGGPFDISFAKA 472
Query: 540 DPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQS 599
+IAAI+W GYPG+AGG A+ADILFG+ NP G+LP+TWYP Y + MT+M MRP S
Sbjct: 473 SDKIAAILWVGYPGEAGGAALADILFGSHNPSGRLPVTWYPASYADTVTMTDMRMRPDTS 532
Query: 600 KRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANA-PTVVAVPLDGRHGSINATISGKAI 658
YPGRTYRFY G V+ FG G+SYT H++ +A P+ V++ L H + ++
Sbjct: 533 TGYPGRTYRFYTGDTVFAFGDGLSYTKMSHSLVSAPPSYVSMRLAEDHPCRAEECA--SV 590
Query: 659 KVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQ 718
+ C+ L V++ V+N G GAH++L+FS+PP H AP K L+ FEKV + G
Sbjct: 591 EAAGDHCDDLAFDVKLQVRNAGEVAGAHSVLLFSSPPPAHNAPAKHLLGFEKVSLAPGEA 650
Query: 719 QRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLH 761
V + VC+ LSVVD G R++ LG H +H+G KH+V L
Sbjct: 651 GTVAFRVDVCRDLSVVDELGGRKVALGGHTLHVGDLKHTVELR 693
Score = 63.5 bits (153), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 29/62 (46%), Positives = 35/62 (56%)
Query: 28 FACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWW 87
FACD +AT FC R DL+ RL+L EKV L++ A+ RLGI YEWW
Sbjct: 37 FACDASNATLAAYGFCNRKATASARARDLVSRLTLAEKVGFLVNKQPALGRLGIPAYEWW 96
Query: 88 SE 89
SE
Sbjct: 97 SE 98
>gi|359477633|ref|XP_003632006.1| PREDICTED: LOW QUALITY PROTEIN: beta-D-xylosidase 3-like [Vitis
vinifera]
Length = 781
Score = 655 bits (1690), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 361/756 (47%), Positives = 474/756 (62%), Gaps = 40/756 (5%)
Query: 30 CDPKDATT-----RTLPFCQVSLPIPQ-RVNDLIGRLSLQEKVKLLISGAAAVPRLGIKG 83
CDP + +C SLPI RV DL+ R++L+EK +I AA V R+G+
Sbjct: 17 CDPARFAALGFDMKDFVYCNSSLPIYDVRVKDLVDRMTLEEKATNVIYKAAGVERIGLPP 76
Query: 84 YEWWSEALHGVSNV---GPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEAR 140
Y+WWSEALHGVS+V GP T F PGATSFP VI +A+SFN +LW+ I +VVS EAR
Sbjct: 77 YQWWSEALHGVSSVSINGP-TFFDETVPGATSFPNVILSAASFNQSLWKTIRQVVSKEAR 135
Query: 141 AMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDR--- 197
A YN G AGLT+W PNVN+ RDPRWGR QET GEDP YA SYVRGLQ +G
Sbjct: 136 ATYNLGHAGLTFWCPNVNVARDPRWGRTQETXGEDPFTVSVYAVSYVRGLQDVEGTENTT 195
Query: 198 ------LKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVAS 251
LKV++S KHF AYDLDNW VDR HFNA+VS+QD+ +TF PF CV EG V+
Sbjct: 196 DLNSRPLKVSSSGKHFAAYDLDNWLNVDRNHFNARVSEQDMAETFLRPFEACVREGDVSG 255
Query: 252 VMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTS-TPEEAAA 310
VMCS+N +NG+P CADP + K TIR EW L+GYIVSDC S+ + Q F T EEA A
Sbjct: 256 VMCSFNNINGIPPCADPRLFKGTIRDEWNLHGYIVSDCWSIETIVEDQKFLDVTGEEAVA 315
Query: 311 DAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYG 370
++AGLDL+CG + SAV G + + D++ +L N V MRLG FDG P+
Sbjct: 316 LNLKAGLDLECGHYYNDSPASAVMAGRVGQHDLDQSLSNLYVVLMRLGFFDGIPA---LA 372
Query: 371 HLGPKDVC-TPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIG 429
LG D+C + +H ELA EAARQGIVLLKN +LPL +++ +A++GPN+D M+G
Sbjct: 373 SLGKDDICLSAEHIELAREAARQGIVLLKNDNATLPLKSVKN--LALVGPNADAYGAMMG 430
Query: 430 NYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGL-D 488
NYAG C +P ++ GC DV C +D A++A++ AD TI+V+G+ D
Sbjct: 431 NYAGPPCRSVSPRDAFSAIGNVTYEMGCGDVLCHNDTYVYKAVEAAKHADTTIIVVGITD 490
Query: 489 QSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLM--SGGPIDVAFAKNDPRIAAI 546
SI E DR LLLPG Q LV++++ A+ P ILV+ GGPID++FA+++P I I
Sbjct: 491 VSIGTEDKDRVDLLLPGYQTHLVNQIAKATTAPIILVVCGHCGGPIDISFARDNPGIEPI 550
Query: 547 IWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRT 606
+WAG+PG+ GG AIAD+++G NPGG+LP+TWY Y+ LPMT MA+R +S YPGR
Sbjct: 551 LWAGFPGEEGGNAIADVVYGKYNPGGRLPVTWYENGYVGMLPMTSMALRSVESLGYPGRK 610
Query: 607 YRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVA-------VPLDGRHGSINATI-SGKAI 658
Y+F+ G VYPFG G+SYTNF +++ APT P SI + I A+
Sbjct: 611 YKFFSGSTVYPFGCGLSYTNFSYSLT-APTRSIHTHLKKLQPCRSMAYSICSVIPQCPAV 669
Query: 659 KVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPH-KQLVAFEKVHVPAGA 717
V CN T +V VK VGS DG+ ++V+S+PP+G H KQ++ FE+V V G
Sbjct: 670 LVDDLSCNE-TFEFEVAVKTVGSMDGSEVVIVYSSPPSGIVGTHIKQVIGFERVFVKVGX 728
Query: 718 QQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGG 753
++V +++VCK L +V SG +P G I GG
Sbjct: 729 VEKVKFSMNVCKSLGIVHSSGHTLLPSGSDIIKAGG 764
>gi|222629257|gb|EEE61389.1| hypothetical protein OsJ_15562 [Oryza sativa Japonica Group]
Length = 771
Score = 654 bits (1686), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 335/743 (45%), Positives = 467/743 (62%), Gaps = 31/743 (4%)
Query: 41 PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGV-----S 95
PFC +LP P R L+ L+L EK+ L+ P + GV +
Sbjct: 39 PFCNATLPFPARARALVSLLTLDEKIAQLLQHRRGRPPPRRPALR----VVVGVPSTASA 94
Query: 96 NVGPG-TKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWS 154
GPG T G AT FPQVI +A++FN +LW A R ++ EARAM+N G AGLT+W+
Sbjct: 95 TTGPGSTSPRGPVRSATIFPQVILSAAAFNRSLWRAAARAIAVEARAMHNAGQAGLTFWA 154
Query: 155 PNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGD--RLKVAASCKHFTAYDL 212
PN+N+FRDPRWGRGQETPGEDP + Y+ YV+G Q G+ R+ ++A CKH+ AYDL
Sbjct: 155 PNINVFRDPRWGRGQETPGEDPAVVSAYSVEYVKGFQRDYGEEGRMMLSACCKHYIAYDL 214
Query: 213 DNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILK 272
+ W G R+ FNAKV+ QD+EDT+ PF+ C+ EG+ + +MCSYNQVNGVP CA +IL+
Sbjct: 215 EKWRGFTRYTFNAKVNAQDMEDTYQPPFKSCIQEGRASCLMCSYNQVNGVPACARKDILQ 274
Query: 273 RTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESA 332
R R EW GYI SDCD+V + ++ Q +T++ E++ A ++AG+D++CG FL HT+SA
Sbjct: 275 RA-RDEWGFQGYITSDCDAVAIIHENQTYTASDEDSIAVVLKAGMDINCGSFLIRHTKSA 333
Query: 333 VQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQ 392
+++G + E DIN+AL N +VQ+RLG FD +Q + LGP +VCT +H+ELA EA RQ
Sbjct: 334 IEKGKVQEEDINHALFNLFSVQLRLGFFDKTNENQWFTQLGPNNVCTTEHRELAAEAVRQ 393
Query: 393 GIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYA-RT 451
G VLLKN LPL +A+IGP ++ + G+Y G+ C TT ++G+ Y +T
Sbjct: 394 GTVLLKNDNGFLPLKRSEVGHIALIGPAANDPYILGGDYTGVPCHSTTFVKGMQAYVPKT 453
Query: 452 IHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELV 511
GCKDV C FG AI+A+++AD +L+ GL+ + E E DR LLLPGRQ +L+
Sbjct: 454 TFAAGCKDVPCNSTDGFGEAIEAAKRADVVVLIAGLNLTEETEDHDRVSLLLPGRQMDLI 513
Query: 512 SKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPG 571
V+ +K P +LVLM GGP+DV+FAK+DPRIA+I+W GYPG+ GG + +ILFG NPG
Sbjct: 514 HTVASVTKKPVVLVLMGGGPVDVSFAKHDPRIASILWIGYPGEVGGNVLPEILFGKYNPG 573
Query: 572 GKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTV 631
GKLP+TWYP+ + T +PM +M MR S+ YPGRTYRFY G VVY FG+G+SY+ + +++
Sbjct: 574 GKLPITWYPESF-TAVPMDDMNMRADASRGYPGRTYRFYTGDVVYGFGYGLSYSKYSYSI 632
Query: 632 ANAPTVVAVPLDGRHGSINATISGKAIKVTH-----------AKCNRLTLGVQVDVKNVG 680
AP +++ S+ IS K A C L V + V N G
Sbjct: 633 LQAPKKISL----SRSSVPDLISRKPAYTRRDGVDYVQVEDIASCEALQFPVHISVSNDG 688
Query: 681 SKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGT 739
+ DG+H +L+F S+ P+ +P KQLV FE+VH AG V I + CK +S + GT
Sbjct: 689 AMDGSHAVLLFASSKPSFPGSPIKQLVGFERVHTAAGRSTDVEITVDPCKLMSFANTEGT 748
Query: 740 RRIPLGEHNIHIGGTKHSVSLHA 762
R + LG H + +G +H + + A
Sbjct: 749 RVLFLGTHVLMVGDEEHELLIEA 771
>gi|357128056|ref|XP_003565692.1| PREDICTED: beta-D-xylosidase 3-like [Brachypodium distachyon]
Length = 821
Score = 652 bits (1683), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 353/774 (45%), Positives = 466/774 (60%), Gaps = 49/774 (6%)
Query: 30 CDPKDATTRTLP-----FCQVSLPIPQRVNDLIGRLSLQEKVKLLISGA-AAVPRLGIKG 83
CDP + L +C SLP +RV DL+GRL+L+EKV L A A R+G+
Sbjct: 46 CDPARFASLGLDMAGFRYCDASLPYAERVRDLVGRLTLEEKVANLGDQAKGAEQRVGLPR 105
Query: 84 YEWWSEALHGVSNVGPG-TKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAM 142
Y WW EALHGVS+ PG T+FG PGATSFP V+ +A++FN TLW AIG S E RAM
Sbjct: 106 YMWWGEALHGVSDTNPGGTRFGDVVPGATSFPLVLNSAAAFNETLWRAIGGATSTEIRAM 165
Query: 143 YNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSD-------- 194
YN G A LTYWSPN+N+ RDPRWGR ETPGEDP L G++A S+VR +Q D
Sbjct: 166 YNLGHAELTYWSPNINVVRDPRWGRASETPGEDPFLVGRFAVSFVRAMQDIDDGANAGAG 225
Query: 195 -----GDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKV 249
RLKV++ CKH+ AYD+D W G DR F+A V ++D+ +TF+ PF MCV +G
Sbjct: 226 AADPFARRLKVSSCCKHYAAYDVDKWFGADRLSFDANVQERDMVETFERPFEMCVRDGDA 285
Query: 250 ASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYY-DTQHFTSTPEEA 308
+ VMCSYN++NGVP CA+ +L T+R +W+L+GYIVSDCDSV V D + +A
Sbjct: 286 SCVMCSYNRINGVPACANGRLLTGTVRRDWQLHGYIVSDCDSVRVMVRDAKWLGYDGVQA 345
Query: 309 AADAIRAGLDLDCG-------PFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFD 361
A A++AGLDLDCG F + AV++G L E +++ AL + MRLG FD
Sbjct: 346 TAAAMKAGLDLDCGMFWEGAKDFFTAYGLQAVRQGKLKEAEVDEALGHLYLTLMRLGFFD 405
Query: 362 GEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNS 421
G P Q LG DVCT +H+E+A EAARQG+VLLKN LPL + ++A++G
Sbjct: 406 GSPEFQ---SLGASDVCTEEHKEMAAEAARQGMVLLKNDHDRLPLDANKVNSLALVGLLQ 462
Query: 422 DVTVT--MIGNYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQAD 479
+ T M+G+Y G C TP + I + Q C AC L GAAI A++ D
Sbjct: 463 HINATDVMLGDYRGKPCRVVTPYEAIRKVVSGTSMQACDKGACGTTAL-GAAI-AAKTVD 520
Query: 480 ATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKN 539
ATI++ GL+ S+E E DR LLLP Q + ++ V+ AS+ P LV++S G +D++FA+N
Sbjct: 521 ATIVITGLNMSVEREGNDREDLLLPWDQTQWINAVAEASRDPITLVIISAGGVDISFAQN 580
Query: 540 DPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQS 599
+P+I AI+WAGYPG+ GGT IAD+LFG NPGG+LP+TWY EYI LPMT MA+RP
Sbjct: 581 NPKIGAILWAGYPGEEGGTGIADVLFGKYNPGGRLPLTWYKNEYIGKLPMTSMALRPVAD 640
Query: 600 KRYPGRTYRFYKGP-VVYPFGHGMSYTNFVHTVANAPTVVAVP-----------LDGRHG 647
K YPGRTY+FY GP V+YPFGHG+SYTNF + V V L + G
Sbjct: 641 KGYPGRTYKFYSGPDVLYPFGHGLSYTNFTYDSYTTGASVTVKIGTAWEDSCKNLTYKPG 700
Query: 648 SINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAG-HWAPHKQLV 706
+ +T AI V C + + V N G G+H + V++ PPA AP KQLV
Sbjct: 701 TTASTAPCPAINVAGHGCQE-EVSFTLKVSNTGGIGGSHVVPVYTAPPAEVDDAPLKQLV 759
Query: 707 AFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
AF ++ VPAG V + VCK ++V+ + +P G + +G S S
Sbjct: 760 AFRRMFVPAGDAVEVPFTLSVCKAFAIVEGTAYTVVPAGVSRVLVGDESLSFSF 813
>gi|90399376|emb|CAJ86207.1| B1011H02.4 [Oryza sativa Indica Group]
Length = 738
Score = 642 bits (1656), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 330/738 (44%), Positives = 456/738 (61%), Gaps = 54/738 (7%)
Query: 41 PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPG 100
PFC +LP P R L+ L+L EK+ L + AA PRLG+ +EWWSE+LHGV + GPG
Sbjct: 39 PFCNATLPFPARARALVSLLTLDEKIAQLSNTAAGAPRLGVPPFEWWSESLHGVCDNGPG 98
Query: 101 TKFG-GDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNI 159
F G AT FPQVI +A++FN +LW A R ++ EARAM+N G AGLT+W+PN+N+
Sbjct: 99 VNFSSGPVRSATIFPQVILSAAAFNRSLWRAAARAIAVEARAMHNAGQAGLTFWAPNINV 158
Query: 160 FRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGD--RLKVAASCKHFTAYDLDNWNG 217
FRDPRWGRGQETPGEDP + Y+ YV+G Q G+ R+ ++A CKH+ AYDL+ W G
Sbjct: 159 FRDPRWGRGQETPGEDPAVVSAYSVEYVKGFQRDYGEEGRMMLSACCKHYIAYDLEKWRG 218
Query: 218 VDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRG 277
R+ FNAKV NGVP CA +IL+R R
Sbjct: 219 FTRYTFNAKV--------------------------------NGVPACARKDILQRA-RD 245
Query: 278 EWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGL 337
EW GYI SDCD+V + ++ Q +T++ E++ A ++AG+D++CG FL HT+SA+++G
Sbjct: 246 EWGFQGYITSDCDAVAIIHENQTYTASDEDSIAVVLKAGMDINCGSFLIRHTKSAIEKGK 305
Query: 338 LSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLL 397
+ E DIN+AL N +VQ+RLG FD +Q + LGP +VCT +H+ELA EA RQG VLL
Sbjct: 306 VQEEDINHALFNLFSVQLRLGFFDKTNENQWFTQLGPNNVCTTEHRELAAEAVRQGTVLL 365
Query: 398 KNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYA-RTIHQQG 456
KN LPL +A+IGP ++ + G+Y G+ C TT ++G+ Y +T G
Sbjct: 366 KNDNGFLPLKRSEVGHIALIGPAANDPYILGGDYTGVPCHSTTFVKGMQAYVPKTTFAAG 425
Query: 457 CKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSM 516
CKDV C FG AI+A+++AD +L+ GL+ + E E DR LLLPGRQ +L+ V+
Sbjct: 426 CKDVPCNSTDGFGEAIEAAKRADVVVLIAGLNLTEETEDHDRVSLLLPGRQMDLIHTVAS 485
Query: 517 ASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPM 576
+K P +LVLM GGP+DV+FAK+DPRIA+I+W GYPG+ GG + +ILFG NPGGKLP+
Sbjct: 486 VTKKPVVLVLMGGGPVDVSFAKHDPRIASILWIGYPGEVGGNVLPEILFGKYNPGGKLPI 545
Query: 577 TWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPT 636
TWYP+ + T +PM +M MR S+ YPGRTYRFY G VVY FG+G+SY+ + +++ AP
Sbjct: 546 TWYPESF-TAVPMDDMNMRADASRGYPGRTYRFYTGDVVYGFGYGLSYSKYSYSILQAPK 604
Query: 637 VVAVPLDGRHGSINATISGKAIKVTH-----------AKCNRLTLGVQVDVKNVGSKDGA 685
+++ S+ IS K A C L V + V N G+ DG+
Sbjct: 605 KISL----SRSSVPDLISRKPAYTRRDGVDYVQVEDIASCEALQFPVHISVSNDGAMDGS 660
Query: 686 HTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPL 744
H +L+F S+ P+ +P KQLV FE+VH AG V I + CK +S + GTR + L
Sbjct: 661 HAVLLFASSKPSFPGSPIKQLVGFERVHTAAGRSTDVEITVDPCKLMSFANTEGTRVLFL 720
Query: 745 GEHNIHIGGTKHSVSLHA 762
G H + +G +H + + A
Sbjct: 721 GTHVLMVGDEEHELLIEA 738
>gi|163889365|gb|ABY48135.1| beta-D-xylosidase [Medicago truncatula]
Length = 776
Score = 642 bits (1656), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 335/795 (42%), Positives = 471/795 (59%), Gaps = 63/795 (7%)
Query: 3 STIAFFFLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSL 62
STI F L+L+ + C P + PFC +SLPI R LI L+L
Sbjct: 7 STIIIFLFSLLLIHLPK--FFTTPDYPCKPPHSH---YPFCNISLPISTRTTSLISLLTL 61
Query: 63 QEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASS 122
+K+ L + A+++ LGI Y+WWSEALHG++ GPG F G AT+FPQVI +A++
Sbjct: 62 SDKINQLSNTASSISHLGIPSYQWWSEALHGIATNGPGVNFNGSVKSATNFPQVIVSAAA 121
Query: 123 FNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKY 182
FN +LW IG V E RAM+N G AGL++W+PNVN+FRDPRWGRGQETPGEDP++ Y
Sbjct: 122 FNRSLWFLIGYAVGVEGRAMFNVGQAGLSFWAPNVNVFRDPRWGRGQETPGEDPMVGSAY 181
Query: 183 AASYVRGLQGSDG------------DRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQ 230
A +VRG+QG DG D L V+A CKHFTAYDL+ W R++FNA V
Sbjct: 182 AVEFVRGIQGVDGIKKVLNDHDSDDDGLMVSACCKHFTAYDLEKWGEFSRYNFNAVV--- 238
Query: 231 DIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY-IVSDC 289
+T+ PFR CV +GK + +MCSYN+VNGVP CA ++L +R +W G I+
Sbjct: 239 ---NTYQPPFRGCVQQGKASCLMCSYNEVNGVPACASKDLLG-LVRNKWGFEGVGILPQT 294
Query: 290 DSVGVYY-DTQHFTSTPEEAAADAIRA-----------GLDLDCGPFLGLHTESAVQRGL 337
+ + + + + P+ ++ +D++CG F+ HTESA+++GL
Sbjct: 295 VMLWLLFLSIKSMQNLPKMLLLMFLKQVFFYVFENLWFCMDINCGTFMLRHTESAIEQGL 354
Query: 338 LSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLL 397
+ E D++ AL N +VQMRLG+F+G+P +G LGP+DVCTP+H++LALEAARQGIVLL
Sbjct: 355 VKEEDLDRALFNLFSVQMRLGLFNGDPEKGKFGKLGPQDVCTPEHKKLALEAARQGIVLL 414
Query: 398 KNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTI-HQQG 456
KN LPL ++A+IGP + T + G Y+GI C + G+ Y +TI + G
Sbjct: 415 KNDNKFLPLDKKDRVSLAIIGPMA-TTSELGGGYSGIPCSPRSLYDGLKEYVKTISYAFG 473
Query: 457 CKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSM 516
C DV C D F AID ++QAD ++V GLD ++E E LDR LLLPG+Q +LVS+V+
Sbjct: 474 CSDVKCDSDDGFAVAIDIAKQADFVVIVAGLDTTLETEDLDRVSLLLPGKQMDLVSRVAA 533
Query: 517 ASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPM 576
ASK P ILVL GGP+DV+FA+++ I +I+W GYP D + G+LPM
Sbjct: 534 ASKRPVILVLTGGGPLDVSFAESNQLITSILWIGYP--------VDF-----DAAGRLPM 580
Query: 577 TWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPT 636
TWYP+ + TN+PM +M MR S+ YPGRTYRFY G +Y FGHG+SY++F + V +AP+
Sbjct: 581 TWYPESF-TNVPMNDMGMRADPSRGYPGRTYRFYTGSRIYGFGHGLSYSDFSYRVLSAPS 639
Query: 637 VVAVPLD---GRHGSINATISGKAIKVTHA------KCNRLTLGVQVDVKNVGSKDGAHT 687
+++ G S+ + +V H CN L+ V + V NVG DG+H
Sbjct: 640 KLSLSKTTNGGLRRSLLNKVEKDVFEVDHVHVDELQNCNSLSFSVHISVMNVGDMDGSHV 699
Query: 688 LLVFSTPPAG-HWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGE 746
+++FS P +P QLV ++H + I C++ S D G R +PLG
Sbjct: 700 VMLFSKWPKNIQGSPESQLVGPSRLHTVSNKSIETSILADPCEHFSFADEQGKRILPLGN 759
Query: 747 HNIHIGGTKHSVSLH 761
H +++G +H VS+
Sbjct: 760 HILNVGDVEHIVSIE 774
>gi|222615852|gb|EEE51984.1| hypothetical protein OsJ_33664 [Oryza sativa Japonica Group]
Length = 753
Score = 641 bits (1654), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 340/734 (46%), Positives = 448/734 (61%), Gaps = 32/734 (4%)
Query: 42 FCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGT 101
FC L QR DL+ L+L EKV L AA V RLG+ YEWWSE LHG+S G G
Sbjct: 31 FCDAWLTAEQRAADLVANLTLAEKVSQLGDRAAGVARLGVPAYEWWSEGLHGLSIWGRGI 90
Query: 102 KFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA-GLTYWSPNVNIF 160
+F G TSFPQVI TA++F+A LW +G V EARA+YN G A GLT WSPNVNIF
Sbjct: 91 RFNGTVRAVTSFPQVILTAAAFDAGLWRRVGEAVGAEARALYNLGQANGLTIWSPNVNIF 150
Query: 161 RDPRWGRGQETPGEDPVLSGKYAASYVRG-LQGSDGDRLKVAASCKHFTAYDLDNWNGVD 219
RDP SG RG G G + +A CKH TAYDLD WN V
Sbjct: 151 RDP---------------SGTRPGDARRGPRHGEQGIGGEASACCKHATAYDLDYWNNVV 195
Query: 220 RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEW 279
R+++++KV+ QD+EDT++ PF+ CV EGK +MC YN +NGVP CA ++L + +R EW
Sbjct: 196 RYNYDSKVTLQDLEDTYNPPFKSCVAEGKATCIMCGYNSINGVPACASSDLLTKKVRQEW 255
Query: 280 RLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLS 339
+NGY+ SDCD+V D H+T +PE+ A +I+ G+D++CG + +H +AVQ+G L+
Sbjct: 256 GMNGYVASDCDAVATIRDAHHYTLSPEDTVAVSIKVGMDVNCGNYTQVHAMAAVQKGNLT 315
Query: 340 EIDINNALVNTLTVQMRLGMFDGEPSSQP-YGHLGPKDVCTPDHQELALEAARQGIVLLK 398
E DI+ ALVN V+MRLG FDG+P S YGHLG DVC+P H+ LALEAA+ GIVLLK
Sbjct: 316 EKDIDRALVNLFAVRMRLGHFDGDPRSNAVYGHLGAADVCSPAHKSLALEAAQDGIVLLK 375
Query: 399 NQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYA--RTIHQQG 456
N +LPL ++AVIGPN+D + GNY G C TTPLQGI Y R G
Sbjct: 376 NDAGALPLQPSAVTSLAVIGPNADNLGALHGNYFGPPCETTTPLQGIKGYLGDRARFLAG 435
Query: 457 CKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSM 516
C ACA D AA AS +D +L MGL Q E + LDR LLLPG QQ L++ V+
Sbjct: 436 CDSPACAVDATNEAAALAS-SSDHVVLFMGLSQKQEQDGLDRTSLLLPGEQQGLITAVAN 494
Query: 517 ASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPM 576
A++ P ILVL++GGP+DV FAK++P+I AI+WAGYPGQAGG AIA +LFG NP G+LP+
Sbjct: 495 AARRPVILVLLTGGPVDVTFAKDNPKIGAILWAGYPGQAGGLAIAKVLFGDHNPSGRLPV 554
Query: 577 TWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTV----- 631
TWYP+E+ T +PMT+M MR + YPGR+YRFY+G VY FG+G+SY+ F +
Sbjct: 555 TWYPEEF-TKVPMTDMRMRADPATGYPGRSYRFYQGNTVYNFGYGLSYSKFSRRMFSSFS 613
Query: 632 ---ANAPTVVAVPLDGRHGSINATISGKAIK-VTHAKCNRLTLGVQVDVKNVGSKDGAHT 687
A +++A + R G +S +K + +C+RL V+V+N G DG H+
Sbjct: 614 TSNAGNLSLLAGVMARRAGDDGGGMSSYLVKEIGVERCSRLVFPAVVEVQNHGPMDGKHS 673
Query: 688 LLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGE 746
+L++ P P +QL+ F HV G + V + C++ S V G R I G
Sbjct: 674 VLMYLRWPTTSGGRPARQLIGFRSQHVKVGEKAMVSFEVSPCEHFSWVGEDGERVIDGGA 733
Query: 747 HNIHIGGTKHSVSL 760
H + +G + S
Sbjct: 734 HFLMVGDEELETSF 747
>gi|147857580|emb|CAN78858.1| hypothetical protein VITISV_030325 [Vitis vinifera]
Length = 699
Score = 635 bits (1638), Expect = e-179, Method: Compositional matrix adjust.
Identities = 323/631 (51%), Positives = 406/631 (64%), Gaps = 71/631 (11%)
Query: 133 RVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQG 192
+VVS EARAMYN G AGLT+WSPNVNIF+DPRWGRGQETPGEDP+LS KYA+ YVRGLQ
Sbjct: 135 KVVSTEARAMYNVGLAGLTFWSPNVNIFQDPRWGRGQETPGEDPLLSSKYASGYVRGLQQ 194
Query: 193 SDG---DRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKV 249
SD DRLKVAA CKH+TAYDLDNW GVD FHFNA V+ QD++DTF PF+ CV++G V
Sbjct: 195 SDDGSPDRLKVAACCKHYTAYDLDNWKGVDCFHFNAVVTNQDMDDTFQPPFKSCVIDGNV 254
Query: 250 ASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAA 309
ASV+ YIVSDCDSV V+Y++QH+T TPEEAA
Sbjct: 255 ASVI------------------------------YIVSDCDSVDVFYNSQHYTKTPEEAA 284
Query: 310 ADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPY 369
A AI AGLDL+CG FLG HTE+AV+ GL+ E ++ A+ N MRLG FDG PS Y
Sbjct: 285 AKAILAGLDLNCGSFLGQHTEAAVKGGLVDESAVDKAVSNNFATLMRLGFFDGNPSKAIY 344
Query: 370 GHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIG 429
G LGPKDVCT +HQE A EA RQGIV
Sbjct: 345 GKLGPKDVCTSEHQERAREAPRQGIV---------------------------------- 370
Query: 430 NYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQ 489
+AG C YTTPLQG+ T + GC +VAC Q+ A + ADAT+L++G+DQ
Sbjct: 371 -FAGTPCKYTTPLQGLTALVATTYLPGCSNVACGTAQI-DEAKKIAAAADATVLIVGIDQ 428
Query: 490 SIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWA 549
SIEAE DR + LPG+Q L+++V+ SKG ILV+MSGG D++FAKND +I +I W
Sbjct: 429 SIEAEGRDRVNIQLPGQQPLLITEVAKXSKGNVILVVMSGGGFDISFAKNDDKITSIQWV 488
Query: 550 GYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRF 609
GYPG+AGG AIAD++FG NP GKLPMTWYPQ Y+ +PMT M MRP + YPGRTYRF
Sbjct: 489 GYPGEAGGAAIADVIFGFYNPSGKLPMTWYPQSYVDKVPMTNMNMRPDPASGYPGRTYRF 548
Query: 610 YKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLT 669
Y G +Y FG G+SYT F H + AP V++P++ H ++ K++ C L
Sbjct: 549 YTGETIYTFGDGLSYTQFNHHLVQAPKSVSIPIEEAHSCHSSKC--KSVDAVQESCQNLA 606
Query: 670 LGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCK 729
+ + V N G+ G+HT+ +FS+PP+ H +P K L+ FEKV V A A+ V + VCK
Sbjct: 607 FDIHLRVNNAGNISGSHTVFLFSSPPSVHNSPQKHLLGFEKVFVTAKAKALVRFKVDVCK 666
Query: 730 YLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
LS+VD GTR++ LG H +H+G KHS+++
Sbjct: 667 DLSIVDELGTRKVALGLHVLHVGNLKHSLNV 697
>gi|357153280|ref|XP_003576399.1| PREDICTED: probable beta-D-xylosidase 2-like [Brachypodium
distachyon]
Length = 807
Score = 634 bits (1636), Expect = e-179, Method: Compositional matrix adjust.
Identities = 346/765 (45%), Positives = 464/765 (60%), Gaps = 73/765 (9%)
Query: 42 FCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGT 101
+C LP RV DLIG ++++EKV L AA PR+G+ Y+WWSEALHG+S+ GP T
Sbjct: 62 YCDAKLPYGDRVRDLIGWMTVEEKVSNLGDWAAGAPRVGLPPYKWWSEALHGLSSTGPTT 121
Query: 102 KF------------GGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAG 149
KF F G T F VI +A+SFN +LW +IG+ +S EARAMYN G G
Sbjct: 122 KFDDLKKPRLHSGRAAVFNG-TVFANVINSAASFNESLWRSIGQAISTEARAMYNLGKGG 180
Query: 150 LTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQ-------GSDGDRL---- 198
LTYWSPN+N+ RDPRWGR ETPGEDP + G+YA ++VRG+Q G +GD L
Sbjct: 181 LTYWSPNINVVRDPRWGRALETPGEDPFVVGRYAVNFVRGMQDVDDAAAGFNGDPLSRPL 240
Query: 199 KVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQ 258
K +A CKH+ AYD+D+W G RF F+A+V+++D+ +TF PF MCV +G ++VMCSYN+
Sbjct: 241 KTSACCKHYAAYDVDDWYGHTRFKFDARVTERDMVETFQRPFEMCVRDGDASAVMCSYNR 300
Query: 259 VNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTS-TPEEAAADAIRAGL 317
VNG+P CAD +L T+R +W L+GYIVSDCD+V V D + TP EA+A +++AGL
Sbjct: 301 VNGIPACADARLLAGTLRRDWGLHGYIVSDCDAVRVMTDNATWLGYTPAEASAASLKAGL 360
Query: 318 DLDCGP------------FLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPS 365
DLDCG FL + +AV++G + E DI+NALVN T MRLG FDG P
Sbjct: 361 DLDCGESWIVQKGKPVMDFLSTYGMAAVRQGKMRESDIDNALVNLYTTLMRLGYFDGMPR 420
Query: 366 SQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDV-T 424
Y L KD+C+ H+ LAL+ ARQ +VLLKN LPL + +VAV GP+++
Sbjct: 421 ---YESLDEKDICSEAHRSLALDGARQSMVLLKNLDGLLPLDASKLASVAVRGPHAEAPE 477
Query: 425 VTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILV 484
M G+Y G C Y TP +GI + I QQG D TI +
Sbjct: 478 KVMDGDYTGPPCRYITPREGISKDV-NISQQG---------------------GDVTIYM 515
Query: 485 MGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIA 544
G++ IE E DR LLLP Q E + +V+ AS P +LV++SGG IDV+FA++ P+I
Sbjct: 516 GGINMHIEREGNDREDLLLPKNQTEEILRVAAASPSPIVLVILSGGGIDVSFAQSHPKIG 575
Query: 545 AIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPG 604
AI+WAGYPG GG AIAD++FG NPGG+LP+TW+ +YI LPMT MA+RP YPG
Sbjct: 576 AILWAGYPGGEGGHAIADVIFGRYNPGGRLPLTWFKNKYIHQLPMTSMALRPRPEHGYPG 635
Query: 605 RTYRFYKGP-VVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGK------A 657
RTY+FY GP V+YPFG+G+SYT F + + N T V + RH + +G A
Sbjct: 636 RTYKFYDGPDVLYPFGYGLSYTKFRYELLNKETAVTLAPGRRHCRQLSYKTGSVGPDCPA 695
Query: 658 IKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAG-HWAPHKQLVAFEKVHVPAG 716
+ V C T+ V V N G DGA+ +LV++ PPA AP KQ+ AF +V V AG
Sbjct: 696 VDVASHACAE-TVSFNVSVVNAGKADGANAVLVYTAPPAELAGAPIKQVAAFRRVAVKAG 754
Query: 717 AQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHI-GGTKHSVSL 760
A + V ++VCK +V+++ +P G + + G +VS
Sbjct: 755 AAETVVFTLNVCKAFGIVEKTAYTVVPSGVSTVIVENGDSSAVSF 799
>gi|357489437|ref|XP_003615006.1| Xylan 1 4-beta-xylosidase [Medicago truncatula]
gi|355516341|gb|AES97964.1| Xylan 1 4-beta-xylosidase [Medicago truncatula]
Length = 685
Score = 634 bits (1636), Expect = e-179, Method: Compositional matrix adjust.
Identities = 321/677 (47%), Positives = 442/677 (65%), Gaps = 20/677 (2%)
Query: 100 GTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA-GLTYWSPNVN 158
G G P ATSFPQVI TA+SF+ LW I +V+ EAR +YN G A G+ +W+PN+N
Sbjct: 2 GIILNGSIPAATSFPQVILTAASFDPKLWYQISKVIGTEARGVYNAGQAQGMNFWAPNIN 61
Query: 159 IFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSD-------GDRLKVAASCKHFTAYD 211
IFRDPRWGRGQET GEDP+++ KY SYVRGLQG G RLK +A CKHFTAYD
Sbjct: 62 IFRDPRWGRGQETAGEDPLVNSKYGVSYVRGLQGDSFEGGKLIGGRLKASACCKHFTAYD 121
Query: 212 LDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNIL 271
L+NW GV+R+ F+AKV+ QD+ DT+ F CV++G+ + +MC+YN+VNGVP CAD N+L
Sbjct: 122 LENWKGVNRYVFDAKVTLQDLADTYQPSFHSCVVQGRSSGIMCAYNRVNGVPNCADYNLL 181
Query: 272 KRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTES 331
T R +W NGYI SDCD+V Y+ Q + TPE+ AD +RAG+DL+CG ++ H +S
Sbjct: 182 TNTARKKWNFNGYIASDCDAVRFIYEKQGYAKTPEDVVADVLRAGMDLECGNYMTKHAKS 241
Query: 332 AVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAAR 391
AV + + I+ AL N T+++RLG+FDG P+ YG +GP VC+ ++ +LALEAAR
Sbjct: 242 AVLQKKIPISQIDRALHNLFTIRIRLGLFDGNPTKLQYGRIGPNQVCSKENLDLALEAAR 301
Query: 392 QGIVLLKNQGPSLPLSHIRHRTVAVIGPNSD-VTVTMIGNYAGIACGYTTPLQGIGRYAR 450
GIVLLKN LPL R T+ VIGPN++ ++ ++GNY G C + L+G YA
Sbjct: 302 SGIVLLKNTASILPLP--RVNTLGVIGPNANKSSIVLLGNYIGPPCKNVSILKGFYTYAS 359
Query: 451 TIH-QQGCKD-VACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQ 508
H GC D CA ++ A++ ++ +D ILVMGLDQS E E LDR L LPG+QQ
Sbjct: 360 QTHYHSGCTDGTKCASAEI-DRAVEVAKISDYVILVMGLDQSQETETLDRDHLELPGKQQ 418
Query: 509 ELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTS 568
+L++ V+ ASK P ILVL+ GGP+D+ FAKN+ +I IIWAGYPG+ GG A+A ++FG
Sbjct: 419 KLINSVAKASKKPVILVLLCGGPVDITFAKNNDKIGGIIWAGYPGELGGRALAQVVFGDY 478
Query: 569 NPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFV 628
NPGG+LPMTWYP+++I +PMT+M MR S YPGRTYRFY GP VY FG+G+SY+N+
Sbjct: 479 NPGGRLPMTWYPKDFI-KIPMTDMRMRADPSSGYPGRTYRFYTGPKVYEFGYGLSYSNYS 537
Query: 629 H---TVANAPTVVAVPLDGRHGSINATISGKAI-KVTHAKCNRLTLGVQVDVKNVGSKDG 684
+ +V N + + TI+ K + ++ C +++ V + + N GS G
Sbjct: 538 YNFISVKNNNLHINQSTTYSILENSETINYKLVSELGEETCKTMSISVTLGITNTGSMAG 597
Query: 685 AHTLLVFSTPPAG-HWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIP 743
H +L+F P G + P KQLV FE V V G + VG + VC++LS + SG + I
Sbjct: 598 KHPVLLFVKPKKGRNGNPVKQLVGFESVTVEGGGKGEVGFEVSVCEHLSRANESGVKVIE 657
Query: 744 LGEHNIHIGGTKHSVSL 760
G + +G ++S+++
Sbjct: 658 EGGYLFLVGQEEYSINI 674
>gi|115486735|ref|NP_001068511.1| Os11g0696400 [Oryza sativa Japonica Group]
gi|77552754|gb|ABA95551.1| Glycosyl hydrolase family 3 C terminal domain containing protein
[Oryza sativa Japonica Group]
gi|113645733|dbj|BAF28874.1| Os11g0696400 [Oryza sativa Japonica Group]
Length = 816
Score = 627 bits (1618), Expect = e-177, Method: Compositional matrix adjust.
Identities = 347/766 (45%), Positives = 451/766 (58%), Gaps = 74/766 (9%)
Query: 42 FCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGT 101
+C SLP RV DLIGR++++EKV L R+G+ Y WWSEALHG+S+ GP T
Sbjct: 70 YCDASLPYADRVRDLIGRMTVEEKVGALGDWTDGAARIGLPAYRWWSEALHGLSSTGPTT 129
Query: 102 KFG-----------GDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGL 150
KF AT F VI +A+SFN TLW++IG+ VS EARAMYN G GL
Sbjct: 130 KFDDLATPHLHSGVSAVYNATVFANVINSAASFNETLWKSIGQAVSTEARAMYNMGKGGL 189
Query: 151 TYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDR------------L 198
TYWSPN+N+ RDPRWGR ETPGEDP + G+YA ++VRG+Q G L
Sbjct: 190 TYWSPNINVVRDPRWGRALETPGEDPYVVGRYAVNFVRGMQDIPGHEAVAAGGDPNTRPL 249
Query: 199 KVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQ 258
K +A CKH+ AYDLD+W+ RF F+A+V ++D+ +TF PF MCV +G V+SVMCSYN+
Sbjct: 250 KTSACCKHYAAYDLDDWHNHTRFEFDARVDERDMVETFQRPFEMCVRDGDVSSVMCSYNR 309
Query: 259 VNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTS-TPEEAAADAIRAGL 317
VNG+P CAD +L +TIR +W L+GYIVSDCD+V V D + T EA+A A++AGL
Sbjct: 310 VNGIPACADARLLSQTIRRDWGLHGYIVSDCDAVRVMTDNATWLGYTGAEASAAALKAGL 369
Query: 318 DLDCGP-------------FLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEP 364
DLDCG FL + AV +G + E DI+NAL N MRLG FD
Sbjct: 370 DLDCGESWKNDTDGHPLMDFLTTYGMEAVNKGKMRESDIDNALTNQYMTLMRLGYFD--- 426
Query: 365 SSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDV- 423
Y LG +D+CT H+ LAL+ ARQGIVLLKN LPL + V V GP+
Sbjct: 427 DIAQYSSLGRQDICTDQHKTLALDGARQGIVLLKNDNKLLPLDANKVGFVNVRGPHVQAP 486
Query: 424 TVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATIL 483
M G+Y G C Y TP QG+ +Y R H+ A+ TI
Sbjct: 487 EKIMDGDYTGPPCRYVTPRQGVSKYVRFSHR-----------------------ANTTIY 523
Query: 484 VMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRI 543
GL+ +IE E DR +LLP Q E + +V+ AS P ILV++SGG IDV+FA+N+P+I
Sbjct: 524 FGGLNLNIEREGNDREDILLPKNQTEEIIRVAKASPNPIILVILSGGGIDVSFAQNNPKI 583
Query: 544 AAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYP 603
AI+WAGYPG GG AIAD++FG NP G+LP+TW+ +YI LPMT M +RP YP
Sbjct: 584 GAILWAGYPGGEGGNAIADVIFGKHNPSGRLPLTWFKNKYIYQLPMTSMDLRPVAKHGYP 643
Query: 604 GRTYRFYKGP-VVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISG------- 655
GRTY+FY GP V+YPFG+G+SYT F++ + T + VP+ G H + SG
Sbjct: 644 GRTYKFYDGPDVLYPFGYGLSYTKFLYEMGTNGTALIVPVAGGHCKKLSYKSGVSTAPAC 703
Query: 656 KAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAG-HWAPHKQLVAFEKVHVP 714
AI V C T+ V V N G G+H ++VFS PPA AP KQ+VAF+ V VP
Sbjct: 704 PAINVNGHVCTE-TVSFNVSVTNGGDTGGSHPVIVFSKPPAEVDDAPMKQVVAFKSVFVP 762
Query: 715 AGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
A + V ++VCK +V+++ +P G I + SVS
Sbjct: 763 AWSTVSVSFELNVCKAFGIVEKTAYTVVPSGVSTILVENVDSSVSF 808
>gi|125535311|gb|EAY81859.1| hypothetical protein OsI_37025 [Oryza sativa Indica Group]
Length = 816
Score = 627 bits (1616), Expect = e-176, Method: Compositional matrix adjust.
Identities = 346/767 (45%), Positives = 451/767 (58%), Gaps = 75/767 (9%)
Query: 42 FCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGT 101
+C SLP RV DLIGR++++EKV L R+G+ Y WWSEALHG+S+ GP T
Sbjct: 69 YCDASLPYADRVRDLIGRMTVEEKVGALGDWTDGAARIGLPAYRWWSEALHGLSSTGPTT 128
Query: 102 KFG-----------GDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGL 150
KF AT F VI +A+SFN TLW++IG+ VS EARAMYN G GL
Sbjct: 129 KFDDLATPHLHSGVSAVYNATVFANVINSAASFNETLWKSIGQAVSTEARAMYNMGKGGL 188
Query: 151 TYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDR------------L 198
TYWSPN+N+ RDPRWGR ETPGEDP + G+YA ++VRG+Q G L
Sbjct: 189 TYWSPNINVVRDPRWGRALETPGEDPYVVGRYAVNFVRGMQDIPGHEAVAAGGDPNTRPL 248
Query: 199 KVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQ 258
K +A CKH+ AYDLD+W+ RF F+A+V ++D+ +TF PF MCV +G V+SVMCSYN+
Sbjct: 249 KTSACCKHYAAYDLDDWHNHTRFEFDARVDERDMVETFQRPFEMCVRDGDVSSVMCSYNR 308
Query: 259 VNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTS-TPEEAAADAIRAGL 317
VNG+P CAD +L +TIR +W L+GYIVSDCD+V V D + T EA+A A++AGL
Sbjct: 309 VNGIPACADARLLSQTIRRDWGLHGYIVSDCDAVRVMTDNATWLGYTGAEASAAALKAGL 368
Query: 318 DLDCGP-------------FLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEP 364
DLDCG FL + AV +G + E DI+NAL N MRLG FD
Sbjct: 369 DLDCGESWKNDTEGHPLMDFLTTYGMEAVNKGKMRESDIDNALTNQYMTLMRLGYFD--- 425
Query: 365 SSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDV- 423
Y LG +D+CT H+ LAL+ ARQGIVLLKN LPL + V V GP+
Sbjct: 426 DITQYSSLGRQDICTDQHKTLALDGARQGIVLLKNDNKLLPLDANKVGFVNVRGPHVQAP 485
Query: 424 TVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATIL 483
M G+Y G C Y TP QG+ +Y R H+ A+ TI
Sbjct: 486 EKIMDGDYTGPPCRYVTPRQGVSKYVRFSHR-----------------------ANTTIY 522
Query: 484 VMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRI 543
GL+ +IE E DR +LLP Q E + +V+ AS P ILV++SGG IDV+FA+N+P+I
Sbjct: 523 FGGLNLNIEREGNDREDILLPKNQTEEIIRVAKASPNPIILVILSGGGIDVSFAQNNPKI 582
Query: 544 AAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYP 603
AI+WAGYPG GG AIAD++FG NP G+LP+TW+ +YI LPMT M +RP YP
Sbjct: 583 GAILWAGYPGGEGGNAIADVIFGKHNPSGRLPLTWFKNKYIYQLPMTSMDLRPVAKHGYP 642
Query: 604 GRTYRFYKGP-VVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRH--------GSINATIS 654
GRTY+FY GP V+YPFG+G+SYT F++ + T + VP+ G H G +A +
Sbjct: 643 GRTYKFYNGPDVLYPFGYGLSYTKFLYEMGTNGTALTVPVAGGHCKKLSYKSGVSSAAPA 702
Query: 655 GKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAG-HWAPHKQLVAFEKVHV 713
AI V C T+ V V N G G+H ++VFS PPA AP KQ+VAF V V
Sbjct: 703 CPAINVNGHACTE-TVSFNVSVTNGGDTGGSHPVIVFSKPPAEVDDAPIKQVVAFRSVFV 761
Query: 714 PAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
PA + V ++VCK +V+++ +P G + + SVS
Sbjct: 762 PAWSTVSVSFELNVCKAFGIVEKTAYTVVPSGVSTVLVENVDSSVSF 808
>gi|242093144|ref|XP_002437062.1| hypothetical protein SORBIDRAFT_10g020500 [Sorghum bicolor]
gi|241915285|gb|EER88429.1| hypothetical protein SORBIDRAFT_10g020500 [Sorghum bicolor]
Length = 809
Score = 626 bits (1614), Expect = e-176, Method: Compositional matrix adjust.
Identities = 342/770 (44%), Positives = 458/770 (59%), Gaps = 78/770 (10%)
Query: 40 LPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGP 99
P+C SLP RV DLIG ++++EKV L + PR+G+ Y+WWSEALHGVS+ GP
Sbjct: 61 FPYCDASLPYADRVRDLIGWMTVEEKVGNLGDVSHGAPRVGLPPYKWWSEALHGVSSTGP 120
Query: 100 GTKFG------GDFPG------ATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGT 147
F G+ G AT F VI +A+SFN TLW++IG+ VS EARAMYN G
Sbjct: 121 TMLFDDLHSKPGNHSGRATVNNATVFANVINSAASFNETLWKSIGQAVSTEARAMYNLGK 180
Query: 148 AGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQ---GSDG-------DR 197
GLTYWSPN+N+ RDPRWGR ETPGEDP ++G+YA ++VRG+Q G DG
Sbjct: 181 GGLTYWSPNINVVRDPRWGRALETPGEDPFVAGRYAVNFVRGMQDIPGHDGGGDDPSTRP 240
Query: 198 LKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYN 257
+K +A CKH+ AYD+D+W+ RF F+A+VS++D+ +TF PF MCV +G + VMCSYN
Sbjct: 241 IKTSACCKHYAAYDVDDWHNHTRFTFDARVSERDMAETFLRPFEMCVRDGDASGVMCSYN 300
Query: 258 QVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQ---HFTSTPEEAAADAIR 314
+VNG+P CAD +L TIRG+W+L+GYIVSDCD+V V D HFT E++A +IR
Sbjct: 301 RVNGIPACADARLLSGTIRGDWQLHGYIVSDCDAVRVMTDNATWLHFTGA--ESSAASIR 358
Query: 315 AGLDLDCGP------------FLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDG 362
AGLDLDC FL + ++AV +G + E DI++AL N MRLG FD
Sbjct: 359 AGLDLDCAESWIEEKGRPLRDFLSEYGKAAVAQGKMRESDIDSALRNQYMTLMRLGYFDN 418
Query: 363 EPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSD 422
P Y L D+CT +H+ LA + ARQG+VLLKN LPL + VAV GP++
Sbjct: 419 IPR---YASLNETDICTDEHKSLAHDGARQGMVLLKNDDGLLPLDPEKILAVAVHGPHAR 475
Query: 423 V-TVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADAT 481
M G+Y G C Y TP QGI + + H+ A+ T
Sbjct: 476 APEKIMDGDYTGPPCRYVTPRQGISKDVKISHR-----------------------ANTT 512
Query: 482 ILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDP 541
I + G++ IE E DR LLLP Q E + + AS P ILV++SGG ID++FA P
Sbjct: 513 IYLGGINLHIEREGNDREDLLLPKNQTEEILHFAKASPNPIILVILSGGGIDISFAHKHP 572
Query: 542 RIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKR 601
+I AI+WAGYPG GG AIAD++FG NPGG+LP+TW+ +YI +PMT M RP K
Sbjct: 573 KIGAILWAGYPGGEGGNAIADVIFGRYNPGGRLPLTWFKNKYIQQIPMTSMEFRPVPEKG 632
Query: 602 YPGRTYRFYKGP-VVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHG-------SINATI 653
YPGRTY+FY GP V+YPFG+G+SYT F++ + T V +P G H S+ T
Sbjct: 633 YPGRTYKFYDGPEVLYPFGYGLSYTKFLYETSTNGTAVTLPATGGHCKGLSYKPSVATTP 692
Query: 654 SGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAG-HWAPHKQLVAFEKVH 712
+ +A+ V C T+ + V N G + GAH +LV++ PP AP KQ+ AF +V
Sbjct: 693 ACQAVDVAGHACTE-TVSFNISVTNAGGRGGAHVVLVYTAPPPEVAQAPIKQVAAFRRVF 751
Query: 713 VPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHI--GGTKHSVSL 760
VPA + V ++VCK +V+R+ +P G + + G + SVS
Sbjct: 752 VPARSTATVPFTLNVCKAFGIVERTAYTVVPSGVSKVLVQNGDSSSSVSF 801
>gi|413954831|gb|AFW87480.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
Length = 814
Score = 626 bits (1614), Expect = e-176, Method: Compositional matrix adjust.
Identities = 343/766 (44%), Positives = 452/766 (59%), Gaps = 72/766 (9%)
Query: 40 LPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGP 99
P+C SLP RV DLIG ++++EKV L + PR+G+ Y+WWSEALHGVS+ GP
Sbjct: 68 FPYCDASLPYADRVRDLIGWMTVEEKVGNLGDISHGAPRVGLPPYKWWSEALHGVSSTGP 127
Query: 100 GTKFG------GDFPG------ATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGT 147
F G+ G AT F VI +A+SFN TLW +IG+ VS EARAMYN G
Sbjct: 128 TMLFDDLHSKPGNHSGRATVNNATVFANVINSAASFNETLWNSIGQAVSTEARAMYNLGK 187
Query: 148 AGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDR--------LK 199
GLTYWSPN+N+ RDPRWGR ETPGEDP ++G+YA ++VRG+Q G +K
Sbjct: 188 GGLTYWSPNINVVRDPRWGRALETPGEDPYVAGRYAVNFVRGMQDIPGHYSGDPSARPIK 247
Query: 200 VAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQV 259
+A CKH AYD+DNW+ RF ++A+VS++D+ +TF PF MCV EG V+SVMCSYN+V
Sbjct: 248 TSACCKHHAAYDVDNWHNQTRFTYDARVSERDMAETFLRPFEMCVREGDVSSVMCSYNRV 307
Query: 260 NGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTS-TPEEAAADAIRAGLD 318
NGVP CAD +L T+RGEW LNGYIVSDCD+V V D + + T E++A ++RAG+D
Sbjct: 308 NGVPACADARLLSGTVRGEWHLNGYIVSDCDAVRVMTDNATWLNFTAAESSAVSLRAGMD 367
Query: 319 LDCGP------------FLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSS 366
LDC +L + +AV +G + E DI+NAL N MRLG FD P
Sbjct: 368 LDCAESWIEEEGRPLRDYLSEYGMAAVAQGKMRESDIDNALTNLYMTLMRLGYFDNIPR- 426
Query: 367 QPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDV-TV 425
Y L DVCT +H+ LAL+ ARQGIVLLKN LPL + VAV GP++
Sbjct: 427 --YASLNETDVCTDEHKSLALDGARQGIVLLKNDHGLLPLDPKKTLAVAVHGPHARAPEK 484
Query: 426 TMIGNYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVM 485
M G+Y G C Y TP QGI R + H+ A TI +
Sbjct: 485 IMDGDYTGPPCRYVTPRQGISRDVKISHK-----------------------AKMTIYLG 521
Query: 486 GLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAA 545
G++ IE E DR LLLP Q E + + AS P ILV++SGG ID++FA+ P+I A
Sbjct: 522 GINLYIEREGNDREDLLLPKNQTEEILHFAQASPTPIILVILSGGGIDISFAQKHPKIGA 581
Query: 546 IIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGR 605
I+WAGYPG GG AIAD++FG NPGG+LP+TW+ +YI +PMT M RP K YPGR
Sbjct: 582 ILWAGYPGGEGGNAIADVIFGRYNPGGRLPLTWFKNKYIEQIPMTSMEFRPVPEKGYPGR 641
Query: 606 TYRFYKGP-VVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHG-------SINATISGKA 657
TY+FY GP V+YPFG+G+SYT F + + V++P G H S+ + +A
Sbjct: 642 TYKFYDGPEVLYPFGYGLSYTKFQYETSTDGVSVSLPAPGGHCKGLSYKPSVATVPACQA 701
Query: 658 IKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAG-HWAPHKQLVAFEKVHVPAG 716
+ V C T+ V V N G + GAH +LV++ PP AP KQ+ AF +V V A
Sbjct: 702 VNVADHACTE-TVSFNVSVTNAGGRGGAHVVLVYTAPPPEVAEAPIKQVAAFRRVFVAAR 760
Query: 717 AQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHI--GGTKHSVSL 760
+ V ++VCK +V+R+ +P G + + G + SVS
Sbjct: 761 STATVPFALNVCKAFGIVERTAYTVVPSGVSKVLVENGDSSSSVSF 806
>gi|293336530|ref|NP_001167905.1| uncharacterized protein LOC100381616 [Zea mays]
gi|223944757|gb|ACN26462.1| unknown [Zea mays]
Length = 630
Score = 602 bits (1551), Expect = e-169, Method: Compositional matrix adjust.
Identities = 294/631 (46%), Positives = 417/631 (66%), Gaps = 13/631 (2%)
Query: 142 MYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGD--RLK 199
M+N G AGLTYW+PN+NIFRDPRWGRGQET GEDP ++ Y+ YV+G QG +G+ R++
Sbjct: 1 MHNAGQAGLTYWAPNINIFRDPRWGRGQETSGEDPAVAAAYSLEYVKGFQGEEGEEGRIR 60
Query: 200 VAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQV 259
++A CKH+TAYD++ W G R+ FNAKV+ QD+EDT+ PF+ C+ E + + +MC+YNQV
Sbjct: 61 LSACCKHYTAYDMEKWEGFSRYTFNAKVNAQDLEDTYQPPFKTCIQEARASCLMCAYNQV 120
Query: 260 NGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDL 319
NGVP CA ++L++T R EW GYI SDCD+V + ++ Q +T + E++ A ++AG+D+
Sbjct: 121 NGVPMCAHKDLLQKT-RDEWGFQGYITSDCDAVAIIHENQTYTKSGEDSIAIVLKAGMDI 179
Query: 320 DCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCT 379
+CG FL HT+SA+++G + E DI+ AL N +VQ+RLG+FD ++Q + LGP VCT
Sbjct: 180 NCGSFLVRHTKSAIEKGKIQEEDIDRALFNLFSVQLRLGIFDKPSNNQWFSQLGPNSVCT 239
Query: 380 PDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYT 439
+H+ELA EA RQG VLLKN LPL R VA+IGP+++ M G+Y G+ C T
Sbjct: 240 KEHRELAAEAVRQGAVLLKNDHNFLPLKRSEVRHVAIIGPSANDAYAMGGDYTGVPCNPT 299
Query: 440 TPLQGIGRYA-RTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDR 498
T L+GI YA +T GCKD +C LFG A++A+++AD +++ GL+ + E E DR
Sbjct: 300 TFLKGIQAYATQTSFAPGCKDASCNSTDLFGEAVEAAKRADIVVVIAGLNLTEEREDFDR 359
Query: 499 AGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGT 558
LLLPG+Q L+ ++ +K P +LVL+ GGP+DV+FAK DPRIA+I+W GYPG+ GG
Sbjct: 360 VSLLLPGKQMGLIHAIASVAKKPLVLVLLGGGPVDVSFAKQDPRIASILWLGYPGEVGGQ 419
Query: 559 AIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPF 618
+ +ILFG NPGGKLP+TWYP+ + T +PMT+M MR S+ YPGRTYRFY G VVY F
Sbjct: 420 VLPEILFGEYNPGGKLPITWYPESF-TAIPMTDMNMRADPSRGYPGRTYRFYTGDVVYGF 478
Query: 619 GHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATIS-------GKAIKVTHAKCNRLTLG 671
G+G+SY+ + +++++AP + V G I+ + G A C L
Sbjct: 479 GYGLSYSKYSYSISSAPKKITVSRSSDLGIISRKPAYTRRDGLGSVKTEDIASCEALVFS 538
Query: 672 VQVDVKNVGSKDGAHTLLVFSTPPAG-HWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKY 730
V V V N GS DG+H +L+F+ + P KQLV FE VH AG+ V I + CK
Sbjct: 539 VHVAVSNHGSMDGSHAVLLFARSKSSVPGFPIKQLVGFESVHTAAGSASNVEITVDPCKQ 598
Query: 731 LSVVDRSGTRRIPLGEHNIHIGGTKHSVSLH 761
+S + G R + LG H + +G + +S+
Sbjct: 599 MSAANPEGKRVLLLGAHVLTVGDEEFELSIE 629
>gi|125576920|gb|EAZ18142.1| hypothetical protein OsJ_33692 [Oryza sativa Japonica Group]
Length = 618
Score = 601 bits (1549), Expect = e-169, Method: Compositional matrix adjust.
Identities = 294/615 (47%), Positives = 407/615 (66%), Gaps = 25/615 (4%)
Query: 153 WSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDL 212
WSPNVNIFRDPRWGRGQETPGEDP + KY A++V+GLQGS L+ +A CKH TAYD+
Sbjct: 2 WSPNVNIFRDPRWGRGQETPGEDPATASKYGAAFVKGLQGSSLTNLQTSACCKHITAYDI 61
Query: 213 DNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILK 272
+ W GV R++FNAKV+ QD+ DT++ PFR CV++GK + +MC+Y +NGVP CA ++L
Sbjct: 62 EEWKGVSRYNFNAKVTPQDLADTYNPPFRSCVVDGKASCIMCAYTLINGVPACASSDLLT 121
Query: 273 RTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESA 332
+T+RGEW+L+GY SDCD+V + + ++HFT T EEA A A++AGLD++CG ++ + SA
Sbjct: 122 KTVRGEWKLDGYTASDCDAVAILHKSEHFTRTAEEAVAVALKAGLDINCGVYMQQNAASA 181
Query: 333 VQRGLLSEIDINNALVNTLTVQMRLGMFDGEPS-SQPYGHLGPKDVCTPDHQELALEAAR 391
+Q+G ++E D++ AL N ++MRLG FDG+P ++ YG L DVCTP H+ LALEAAR
Sbjct: 182 LQQGKMTEKDVDKALKNLFAIRMRLGHFDGDPRGNKLYGRLSAADVCTPVHKALALEAAR 241
Query: 392 QGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYART 451
+G+VLLKN LPL + AVIG N++ + ++GNY G+ C TTP GI +Y ++
Sbjct: 242 RGVVLLKNDARLLPLRAPTVASAAVIGHNANDILALLGNYYGLPCETTTPFGGIQKYVKS 301
Query: 452 IH-QQGCKDVAC---ADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQ 507
GC AC A DQ A ++ +D LVMGL Q E E LDR LLLPG+Q
Sbjct: 302 AKFLPGCSSAACDVAATDQ----ATALAKSSDYVFLVMGLSQKQEQEGLDRTSLLLPGKQ 357
Query: 508 QELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGT 567
Q L++ V+ ASK P IL+L++GGP+D+ FA+ +P+I AI+WAGYPGQAGG AIAD+LFG
Sbjct: 358 QALITAVATASKRPVILILLTGGPVDITFAQTNPKIGAILWAGYPGQAGGQAIADVLFGE 417
Query: 568 SNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNF 627
NP GKLP+TWYP+E+ T MT+M MRP + YPGR+YRFYKG VY FG+G+SY+ F
Sbjct: 418 FNPSGKLPVTWYPEEF-TKFTMTDMRMRPDPATGYPGRSYRFYKGKTVYKFGYGLSYSKF 476
Query: 628 ----VHTVANAPTVVAVPLDGRHGSINATISGKAI----KVTHAKCNRLTLGVQVDVKNV 679
V N+ + L G + T G A+ ++ +C RL V V+V+N
Sbjct: 477 ACRIVSGAGNSSSYGKAALAGLRAA--TTPEGDAVYRVDEIGDDRCERLRFPVMVEVQNH 534
Query: 680 GSKDGAHTLLVF---STPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDR 736
G DG HT+L+F S+ G P +QL+ F H+ G ++++ + I C++LS
Sbjct: 535 GPMDGKHTVLMFVRWSSTDGGR--PVRQLIGFRNQHLKVGEKKKLKMEISPCEHLSRARV 592
Query: 737 SGTRRIPLGEHNIHI 751
G + I G H + +
Sbjct: 593 DGEKVIDRGSHFLMV 607
>gi|348667575|gb|EGZ07400.1| xylosidase [Phytophthora sojae]
Length = 751
Score = 599 bits (1545), Expect = e-168, Method: Compositional matrix adjust.
Identities = 313/748 (41%), Positives = 447/748 (59%), Gaps = 59/748 (7%)
Query: 30 CDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSE 89
CD D +LPFC SLPI RV+DL+ R+ L++ V LL++ A+A P + + YEWW+E
Sbjct: 24 CD--DPKVSSLPFCDGSLPIDARVSDLVNRIPLEQAVGLLVNKASAAPSVNVPSYEWWNE 81
Query: 90 ALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAG 149
ALHGV+ + PG F G ATSFPQV++TA+SFN TL+ I +S EARA YN AG
Sbjct: 82 ALHGVA-LSPGVTFKGPLTAATSFPQVLSTAASFNRTLFYQIAEAISTEARAFYNEKNAG 140
Query: 150 LTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGS---------DGDRLKV 200
LT+W+PNVNIFRDPRWGRGQETPGEDP L+G+YA ++VRGLQG D LK+
Sbjct: 141 LTFWTPNVNIFRDPRWGRGQETPGEDPYLTGEYAVAFVRGLQGEAMEGHENKDDNKFLKI 200
Query: 201 AASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVN 260
++ CKHF+AY + V R +A V+KQD DT+ F CV G V+S+MCSYN VN
Sbjct: 201 SSCCKHFSAYSQE----VPRHRNDAIVTKQDQADTYFPAFEDCVKRGHVSSIMCSYNAVN 256
Query: 261 GVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLD 320
G+P+CAD +L +R +W+ +GYI SDC++V HFT +PE+ A + AG+DL+
Sbjct: 257 GIPSCADKGLLTDLVRNQWKFDGYITSDCEAVADVIYRHHFTQSPEQTCATTLDAGMDLN 316
Query: 321 CGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTP 380
CG FL H SA+++G++S ++NAL N V MRLGMF E +QP+ ++ V T
Sbjct: 317 CGEFLRQHLSSAIEQGIVSTEMVHNALKNQFRVMMRLGMF--EKGTQPFSNITKDAVDTA 374
Query: 381 DHQELALEAARQGIVLLKNQGPSLPLS---HIRHRTVAVIGPNSDVTVTMIGNYAGIACG 437
H++LALEAARQ +VLLKN+ +LPL+ + ++A+IGP+ + + ++GNY GI
Sbjct: 375 AHRQLALEAARQSVVLLKNEDNTLPLATDVFSKDGSLALIGPHFNASTALLGNYFGIPSH 434
Query: 438 YTTPLQGIGRYARTI-HQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEAL 496
TPL+G+ Y + + GCK V+ F AI+ ++AD ++ MGLDQS E E +
Sbjct: 435 IVTPLKGVSSYVPNVAYSLGCK-VSGEVLPDFDEAIEVVKKADRVVVFMGLDQSQEREEI 493
Query: 497 DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAG 556
DR L LPG Q L++++ A+ P +LVL+SGG +D++ KN P++ AI++ GY GQAG
Sbjct: 494 DRYHLKLPGFQIALLNRILAAASHPIVLVLISGGSVDLSLYKNHPKVGAIVFGGYLGQAG 553
Query: 557 GTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVY 616
G A+AD+LFG +P G+L T+Y +Y+ +P+ +M MRP+ PGRTYRF+ G VY
Sbjct: 554 GQALADMLFGKYSPAGRLTQTFYDSDYVNTMPIYDMHMRPTFVTGNPGRTYRFFSGAPVY 613
Query: 617 PFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDV 676
FG G+SYT F H C ++ V
Sbjct: 614 EFGFGLSYTTF----------------------------------HKACRSCVASFEITV 639
Query: 677 KNVGSKDGAHTLLVFSTPP-AGHWA-PHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVV 734
N+G +G +L+++ PP AG P + LVAFE+ + + K ++
Sbjct: 640 TNLGDVEGEDAILIYAEPPHAGEGGRPLRSLVAFERTALVTTGKTATADFCLEAKAFALA 699
Query: 735 DRSGTRRIPLGEHNIHIGGTKHSVSLHA 762
+ G+ + G IH+ +H V++ A
Sbjct: 700 NAEGSWVVEQGNWTIHVDTLQHRVNVQA 727
>gi|326488213|dbj|BAJ89945.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 525
Score = 598 bits (1541), Expect = e-168, Method: Compositional matrix adjust.
Identities = 297/490 (60%), Positives = 362/490 (73%), Gaps = 7/490 (1%)
Query: 28 FACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWW 87
FACD +AT FC R DL+ RL+L EKV L++ A+ RLGI YEWW
Sbjct: 37 FACDASNATLAAYGFCNRKATASARARDLVSRLTLAEKVGFLVNKQPALGRLGIPAYEWW 96
Query: 88 SEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGT 147
SEALHGVS VGPGT+F PGATSFPQ I TA+SFNA+L+ AIG VVS EARAM+N G
Sbjct: 97 SEALHGVSYVGPGTRFSPLVPGATSFPQPILTAASFNASLFRAIGEVVSTEARAMHNVGL 156
Query: 148 AGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQ-----GSDGDRLKVAA 202
AGLT+WSPN+NIFRDPRWGRGQETPGEDP+L+ KYA YV GLQ G LKVAA
Sbjct: 157 AGLTFWSPNINIFRDPRWGRGQETPGEDPLLTSKYAVGYVTGLQDAGAGGVTDGALKVAA 216
Query: 203 SCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGV 262
CKH+TAYD+DNW GV+R+ F+AKVS+QD++DTF PF+ CV++G VASVMCSYN+VNG
Sbjct: 217 CCKHYTAYDVDNWKGVERYTFDAKVSQQDLDDTFQPPFKSCVLDGNVASVMCSYNKVNGK 276
Query: 263 PTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG 322
PTCAD ++L+ IRG+W+LNGYIVSDCDSV V Y QH+T TPEEAAA I++GLDL+CG
Sbjct: 277 PTCADKDLLEGVIRGDWKLNGYIVSDCDSVDVLYTQQHYTKTPEEAAAITIKSGLDLNCG 336
Query: 323 PFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDH 382
FL HT +AVQ G LSE D++ A+ N + MRLG FDG+P +G LGPKDVCT +
Sbjct: 337 NFLAQHTVAAVQAGELSEEDVDRAITNNFIMLMRLGFFDGDPRQLAFGSLGPKDVCTSSN 396
Query: 383 QELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPL 442
+ELA E ARQGIVLLKN G +LPLS +++AVIGPN++ + TMIGNY G C YTTPL
Sbjct: 397 RELARETARQGIVLLKNSG-ALPLSAKSIKSMAVIGPNANASFTMIGNYEGTPCKYTTPL 455
Query: 443 QGIGRYARTIHQQGCKDVACADDQL-FGAAIDASRQADATILVMGLDQSIEAEALDRAGL 501
QG+G T++Q GC +V C+ + L A+ A+ AD T+LV+G DQSIE E+LDR L
Sbjct: 456 QGLGAKVNTVYQPGCTNVGCSGNSLQLSTAVAAAASADVTVLVVGADQSIERESLDRTSL 515
Query: 502 LLPGRQQELV 511
LLPG+Q +LV
Sbjct: 516 LLPGQQTQLV 525
>gi|340370206|ref|XP_003383637.1| PREDICTED: probable beta-D-xylosidase 5-like [Amphimedon
queenslandica]
Length = 728
Score = 594 bits (1531), Expect = e-167, Method: Compositional matrix adjust.
Identities = 325/767 (42%), Positives = 456/767 (59%), Gaps = 58/767 (7%)
Query: 5 IAFFFLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQE 64
+ F L + L ++S + + PF T +C + IP+RVNDL+ R+++ +
Sbjct: 3 MKFTVLLVFLFASSVADYCEKAPF---------NTYKYCDYTQSIPERVNDLLSRMTILD 53
Query: 65 KVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFN 124
K+ LI+ A A+P L I Y+WWSE LHGV+ PG FGG+FP ATSFPQVI ++FN
Sbjct: 54 KIPQLITSAPAIPSLDIPAYQWWSEGLHGVAG-SPGVHFGGNFPNATSFPQVIGLGATFN 112
Query: 125 ATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAA 184
+L A+ +V+S EARA NGG AGLTY++PN+NIFRDPRWGRGQETPGEDP LS +YAA
Sbjct: 113 MSLVLAMAQVISTEARAFANGGQAGLTYFAPNINIFRDPRWGRGQETPGEDPYLSSQYAA 172
Query: 185 SYVRGLQ-GSDGDR-LKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRM 242
++V+G+Q G+D R LK A+CKH+ AYDL+N+ + R FNA VS QD E+T+ FR
Sbjct: 173 NFVKGMQEGADDTRYLKTIATCKHYAAYDLENYLNLSRHTFNAIVSDQDFEETYFPAFRS 232
Query: 243 CVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFT 302
CV EGKV S+MCSYN VNGVP+CA+ I RG+W GY+VSDC ++ ++ +T
Sbjct: 233 CVEEGKVGSIMCSYNAVNGVPSCANDFINNEVARGKWGFEGYVVSDCGAISDIINSHKYT 292
Query: 303 STPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDG 362
S ++ A +R G DL+CG F H ++A G +++ DI+ A+ T +MRLGMFD
Sbjct: 293 SNTDDTVAAGLRGGCDLNCGHFYSDHAQAAYDNGAITDDDIDRAMTRLFTYRMRLGMFD- 351
Query: 363 EPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSD 422
PS QP+ V T H+ LAL+A+R+ IVLL+N LPLS HR +A++GP+
Sbjct: 352 PPSMQPFRDYTNDKVDTKQHEALALDASRESIVLLQNNKDILPLSLTTHRKIALVGPHGQ 411
Query: 423 VTVTMIGNYAGIACGYTTPLQGIGRYARTI-HQQGCKDVACADDQLFGAAIDASRQ--AD 479
M GNY G A +P+QG+ ++ GC VAC F + +
Sbjct: 412 AQGAMQGNYKGTAPYLISPMQGLQDLGLSVTFAAGCTQVACPTIAGFSEVTKLVEEHSIE 471
Query: 480 ATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKG--PTILVLMSGGPIDVAFA 537
A I V+GLD+S E+E DR L LPG+Q +L+ + + P I+V+MSGGP+D++
Sbjct: 472 AIIAVIGLDESQESEGHDRTSLTLPGQQVQLLEDIKKKAVPGIPFIVVVMSGGPVDLSGV 531
Query: 538 KNDPRIA-AIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRP 596
K+ IA AI+WAGYPGQ+GG AIA++++G NP G+LP+T+YP YI +P T M+MR
Sbjct: 532 KD---IADAILWAGYPGQSGGQAIAEVIYGKVNPSGRLPVTFYPASYINEIPYTNMSMRV 588
Query: 597 SQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGK 656
PGR+Y+FY G V+PFG G+SYT F N P V
Sbjct: 589 P-----PGRSYKFYTGTPVFPFGFGLSYTTFEMKWKNPPNVT------------------ 625
Query: 657 AIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF--STPPAGHWAPHKQLVAFEKVHVP 714
+K TH + + +V V N G + G+ ++L + ST P AP K+L F+K+++
Sbjct: 626 HLKTTH----DVDVNYEVVVTNAGKRSGSVSVLAYITSTVPG---APMKELFGFQKIYLK 678
Query: 715 AGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGT---KHSV 758
+ + K + VD+ G R+I G + I IG T KH+V
Sbjct: 679 PEQSMTLSF-VAEPKVFTTVDKHGERKIRPGTYKITIGDTSDLKHTV 724
>gi|326431595|gb|EGD77165.1| beta-glucosidase [Salpingoeca sp. ATCC 50818]
Length = 900
Score = 593 bits (1528), Expect = e-166, Method: Compositional matrix adjust.
Identities = 329/748 (43%), Positives = 458/748 (61%), Gaps = 20/748 (2%)
Query: 13 ILLSASSSGLA----AREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKL 68
+ + A+ SGL + P CD + ++LPFC +L R+ DLI R++ + L
Sbjct: 153 LTIRANDSGLCLDAGSPTPRTCDVEPG--KSLPFCNTALSYDDRIRDLISRINDSDLPGL 210
Query: 69 LISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLW 128
L++ A V L + Y+WWSEALHGV + PG FGGD P ATSFPQVI T ++FN TL+
Sbjct: 211 LVNSATGVEHLNLPAYQWWSEALHGVGH-SPGVHFGGDVPAATSFPQVIHTGATFNKTLY 269
Query: 129 EAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVR 188
IG V+S EARAM N AG T+W+PN+NI RDPRWGRGQETPGEDP +G+YAA++V
Sbjct: 270 RKIGTVISTEARAMNNVQRAGNTFWAPNINIIRDPRWGRGQETPGEDPFATGEYAANFVS 329
Query: 189 GLQ-GSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEG 247
G Q G D + +K ++ CKHF Y+L+NW+GVDR H+NA + QDI DT+ F CV G
Sbjct: 330 GFQDGEDMNYIKASSCCKHFFDYNLENWHGVDRHHYNAIATDQDIADTYLPSFEACVRYG 389
Query: 248 KVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEE 307
+ + +MCSYN VNGVP+CA+ +I+ R W +GYI SDC +V ++ FT E
Sbjct: 390 RASGLMCSYNAVNGVPSCANGDIMTVMARESWGFDGYITSDCGAVADVLNSHKFTRNTSE 449
Query: 308 AAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQ 367
+ AG+D DCG F+ + A+Q G++ +N AL VQ RLG+FD S Q
Sbjct: 450 TIRAVLEAGMDTDCGSFVQQYLAKAMQEGVVPRELVNTALHRLFMVQFRLGLFD-PVSKQ 508
Query: 368 PYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTM 427
PY + V TP +Q+LALEAA+QGIVLLKN LPL H VA+IGPN+D T M
Sbjct: 509 PYTNYSVARVNTPANQQLALEAAQQGIVLLKNTNARLPLKTGLH--VALIGPNADATTVM 566
Query: 428 IGNYAGIACGYTTPLQGIGRYARTI-HQQGCKDVACADDQLFGAAIDASRQADATILVMG 486
GNY G A +P++G Y+ + + +GC DVAC D F AA+ A+++ADA ++V+G
Sbjct: 567 QGNYQGTAPFLISPVRGFKNYSAAVTYAKGC-DVACKDTSGFDAAVAAAKEADAVVVVVG 625
Query: 487 LDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAI 546
LDQ E+E DR + LPG Q++LV++V+ A+K P ++ +M+GG +D++ K + +A I
Sbjct: 626 LDQGQESEGHDRTSITLPGHQEDLVAQVAAAAKSPIVVFVMTGGAVDLSTIKANKNVAGI 685
Query: 547 IWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRT 606
+W GYPGQ+GG A+AD++FG +PGG+LP T YP Y+ M + MRP+++ PGRT
Sbjct: 686 LWCGYPGQSGGQAMADVVFGAVSPGGRLPYTIYPGSYVDACSMLDNGMRPNKTSGNPGRT 745
Query: 607 YRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCN 666
YRFY G VY +G G+SYT+F + + T+ + +A + K I+ +
Sbjct: 746 YRFYTGKPVYEYGTGLSYTSFSYHIHYLNTMDTSLATVQTYVQDAKQNHKFIRYDAPEFT 805
Query: 667 RLTLGVQVDVKNVGSKDGAHTLLVFSTP--PAGHWAPHKQLVAFEKVHVPAGAQQRVGIN 724
R V+V+V NVG GA + VF P PA AP K L+ FE+V + G V +
Sbjct: 806 R----VEVNVTNVGRVAGADVVQVFVEPKTPAELGAPIKTLIGFERVFLNPGQWTIVQFS 861
Query: 725 IHVCKYLSVVDRSGTRRIPLGEHNIHIG 752
++ L+ VD SG R GE +HIG
Sbjct: 862 VNAHD-LTFVDASGKRVARAGEWLVHIG 888
>gi|62321271|dbj|BAD94481.1| beta-xylosidase [Arabidopsis thaliana]
Length = 523
Score = 587 bits (1512), Expect = e-165, Method: Compositional matrix adjust.
Identities = 291/524 (55%), Positives = 366/524 (69%), Gaps = 10/524 (1%)
Query: 244 VMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTS 303
V++G VASVMCSYNQVNG PTCADP++L IRGEW+LNGYIVSDCDSV V Y QH+T
Sbjct: 3 VVDGNVASVMCSYNQVNGKPTCADPDLLSGVIRGEWKLNGYIVSDCDSVDVLYKNQHYTK 62
Query: 304 TPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGE 363
TP EAAA +I AGLDL+CG FLG HTE AV+ GL++E I+ A+ N MRLG FDG
Sbjct: 63 TPAEAAAISILAGLDLNCGSFLGQHTEEAVKSGLVNEAAIDKAISNNFLTLMRLGFFDGN 122
Query: 364 PSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDV 423
P +Q YG LGP DVCT +QELA +AARQGIVLLKN G LPLS +T+AVIGPN++V
Sbjct: 123 PKNQIYGGLGPTDVCTSANQELAADAARQGIVLLKNTG-CLPLSPKSIKTLAVIGPNANV 181
Query: 424 TVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATIL 483
T TMIGNY G C YTTPLQG+ T + GC +VACA + GA A+ AD ++L
Sbjct: 182 TKTMIGNYEGTPCKYTTPLQGLAGTVSTTYLPGCSNVACAVADVAGATKLAA-TADVSVL 240
Query: 484 VMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRI 543
V+G DQSIEAE+ DR L LPG+QQELV +V+ A+KGP +LV+MSGG D+ FAKNDP+I
Sbjct: 241 VIGADQSIEAESRDRVDLRLPGQQQELVIQVAKAAKGPVLLVIMSGGGFDITFAKNDPKI 300
Query: 544 AAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYP 603
A I+W GYPG+AGG AIADI+FG NP GKLPMTWYPQ Y+ +PMT M MRP ++ YP
Sbjct: 301 AGILWVGYPGEAGGIAIADIIFGRYNPSGKLPMTWYPQSYVEKVPMTIMNMRPDKASGYP 360
Query: 604 GRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHA 663
GRTYRFY G VY FG G+SYT F HT+ AP++V++ L+ H + + +++
Sbjct: 361 GRTYRFYTGETVYAFGDGLSYTKFSHTLVKAPSLVSLGLEENH--VCRSSECQSLDAIGP 418
Query: 664 KCNRLTLG------VQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGA 717
C G V + V+N G ++G HT+ +F+TPPA H +P K LV FEK+ +
Sbjct: 419 HCENAVSGGGSAFEVHIKVRNGGDREGIHTVFLFTTPPAIHGSPRKHLVGFEKIRLGKRE 478
Query: 718 QQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLH 761
+ V + +CK LSVVD G R+I LG+H +H+G KHS+S+
Sbjct: 479 EAVVRFKVEICKDLSVVDEIGKRKIGLGKHLLHVGDLKHSLSIR 522
>gi|340370204|ref|XP_003383636.1| PREDICTED: probable beta-D-xylosidase 2-like [Amphimedon
queenslandica]
Length = 755
Score = 583 bits (1503), Expect = e-163, Method: Compositional matrix adjust.
Identities = 324/762 (42%), Positives = 451/762 (59%), Gaps = 51/762 (6%)
Query: 3 STIAFFFLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSL 62
S IA F L+ L AS + + PF +C S I +RV DL+ RL++
Sbjct: 31 SAIAVF---LLFLVASRADYCEKPPF---------NAYLYCNYSASITERVKDLLSRLTV 78
Query: 63 QEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASS 122
EK+ + A+A+ RL I Y+WWSE LHG++ PG F D ATSFPQVI ++
Sbjct: 79 LEKMSQTATNASAIERLDIPAYDWWSECLHGLAQ-SPGVFFENDLTSATSFPQVIGLGAT 137
Query: 123 FNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKY 182
FN +L A+G+V+S EARA N G +GLT+++PN+NI+RDPRWGRGQETPGEDP L+ +Y
Sbjct: 138 FNMSLVLAMGQVISTEARAFANNGQSGLTFFAPNINIYRDPRWGRGQETPGEDPYLTSQY 197
Query: 183 AASYVRGLQGSDGDR--LKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPF 240
AA++V+G+Q DR LK A+CKH+ AY+L+ + V R +FNA VS QD+E+T+ F
Sbjct: 198 AANFVKGIQEGSEDRRYLKAIATCKHYAAYNLERYLDVRRVNFNAIVSDQDLEETYLPAF 257
Query: 241 RMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQH 300
+ CV EG+V S+MCSYN +NGVP CA+ I + R W GYIVSDC ++ +
Sbjct: 258 KACVQEGQVGSIMCSYNAINGVPNCANDFINNKIARDTWGFEGYIVSDCGAILDIQYKHN 317
Query: 301 FTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMF 360
+TS ADA++ G DL+CG F + E A ++E DI+ +L T +MRLGMF
Sbjct: 318 YTSDTNITVADALKGGCDLNCGHFYEKYMEDAFDNSTITEEDIDKSLTRLFTSRMRLGMF 377
Query: 361 DGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPN 420
D P QP+ KDV TP+ Q+LAL AAR+GIVLL+N+G LPL ++H +A IGPN
Sbjct: 378 D-PPEIQPFRQYSVKDVNTPEAQDLALNAAREGIVLLQNKGSVLPLDIVKHSNIAAIGPN 436
Query: 421 SDVTVTMIGNYAGIACGYTTPLQGIGRYA-RTIHQQGCKDVACADDQLFGAAIDASRQAD 479
+D T M GNY GIA +PLQG +Q GC VAC D + F A+ A + D
Sbjct: 437 ADATHIMQGNYHGIAPYLISPLQGFSNLGINATYQIGCP-VACNDTEGFPDAVKAVQGVD 495
Query: 480 ATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSM-ASKG-PTILVLMSGGPIDVAFA 537
A I V+GL+ + E E+ DR + LPG Q++L+ ++ A+KG P I+V+MSGG +D+
Sbjct: 496 AVIAVIGLNNTQEGESHDRTSIALPGHQEDLLLELKKNAAKGTPLIVVVMSGGSVDLTGV 555
Query: 538 KNDPRIA-AIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRP 596
K+ IA AI+WAGYPGQ+GG AIA++++G NP G+LP+T+YP YI +P T M+MR
Sbjct: 556 KD---IADAILWAGYPGQSGGQAIAEVIYGKVNPSGRLPVTFYPASYINEIPYTNMSMRV 612
Query: 597 SQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGK 656
PGR+Y+FY G V+PFG G+SYT F + +T
Sbjct: 613 P-----PGRSYKFYTGTPVFPFGFGLSYTTFEIKWKDT----------------STAKDY 651
Query: 657 AIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAG 716
+K TH + + + V N GS+ G+ ++L F T AP K+L AF+K+++
Sbjct: 652 YLKTTHDE----VVNYEATVTNSGSRPGSVSVLAFITSSVPG-APMKELFAFKKIYLEPT 706
Query: 717 AQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSV 758
V + K + VD G R+I G + I IG H +
Sbjct: 707 ESVDVSF-VAEPKVFTTVDIYGIRKIRPGAYKIIIGDDDHHI 747
>gi|226491558|ref|NP_001146416.1| uncharacterized protein LOC100279996 [Zea mays]
gi|223975771|gb|ACN32073.1| unknown [Zea mays]
Length = 507
Score = 573 bits (1478), Expect = e-161, Method: Compositional matrix adjust.
Identities = 283/509 (55%), Positives = 360/509 (70%), Gaps = 5/509 (0%)
Query: 253 MCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADA 312
MCSYNQVNG PTCAD ++L IRG+W+LNGYI SDCDSV V Y+ QH+T TPE+AAA +
Sbjct: 1 MCSYNQVNGKPTCADKDLLSGVIRGDWKLNGYISSDCDSVDVLYNNQHYTKTPEDAAAIS 60
Query: 313 IRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHL 372
I+AGLDL+CG FL HT +AVQ G LSE D++ A+ N L MRLG FDG+P P+G+L
Sbjct: 61 IKAGLDLNCGTFLAQHTVAAVQAGKLSESDVDRAVTNNLVTLMRLGFFDGDPRELPFGNL 120
Query: 373 GPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYA 432
GP DVCTP +QELA EAARQGIVLLKN G LPLS +++AVIGPN++ + TMIGNY
Sbjct: 121 GPSDVCTPSNQELAREAARQGIVLLKNTG-KLPLSATSIKSMAVIGPNANASFTMIGNYE 179
Query: 433 GIACGYTTPLQGIGRYARTIHQQGCKDVACADDQL-FGAAIDASRQADATILVMGLDQSI 491
G C YTTPLQG+G T++Q GC +V C+ + L AA A+ AD T+LV+G DQSI
Sbjct: 180 GTPCKYTTPLQGLGANVATVYQPGCTNVGCSGNSLQLDAATKAAASADVTVLVVGADQSI 239
Query: 492 EAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGY 551
E E+LDR LLLPG+Q +LVS V+ AS GP ILV+MSGGP D++FAK+ +IAAI+W GY
Sbjct: 240 ERESLDRTSLLLPGQQPQLVSAVANASSGPCILVVMSGGPFDISFAKSSDKIAAILWVGY 299
Query: 552 PGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYK 611
PG+AGG AIAD+LFG NP G+LP+TWYP+ + T +PMT+M MRP S YPGRTYRFY
Sbjct: 300 PGEAGGAAIADVLFGYHNPSGRLPVTWYPESF-TKVPMTDMRMRPDPSTGYPGRTYRFYT 358
Query: 612 GPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLG 671
G VY FG G+SYT+F H + +AP +A+ L H + T +++ A C L
Sbjct: 359 GDTVYAFGDGLSYTSFAHHLVSAPKQLALQLAEGHACL--TEQCPSVEAEGAHCEGLAFD 416
Query: 672 VQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYL 731
V + V+N G + G HT+ +FS+PPA H AP K L+ FEKV + G V + VCK L
Sbjct: 417 VHLRVRNAGERSGGHTVFLFSSPPAVHNAPAKHLLGFEKVSLEPGQAGVVAFKVDVCKDL 476
Query: 732 SVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
SVVD G R++ LG H +H+G KH+++L
Sbjct: 477 SVVDELGNRKVALGSHTLHVGDLKHTLNL 505
>gi|301110280|ref|XP_002904220.1| beta-D-xylosidase, putative [Phytophthora infestans T30-4]
gi|262096346|gb|EEY54398.1| beta-D-xylosidase, putative [Phytophthora infestans T30-4]
Length = 709
Score = 565 bits (1455), Expect = e-158, Method: Compositional matrix adjust.
Identities = 301/719 (41%), Positives = 423/719 (58%), Gaps = 57/719 (7%)
Query: 57 IGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQV 116
+ R+ L + V LL++ AA P + I YEWW+EALHGV+ + PG F G ATSFPQV
Sbjct: 12 LTRIPLDQAVGLLVNKAAPAPSVNIPSYEWWNEALHGVA-LSPGVTFKGSITAATSFPQV 70
Query: 117 ITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDP 176
++TA+SFN +L+ I V+S EARA +N AGLT+W+PNVNIFRDPRWGRGQETPGEDP
Sbjct: 71 LSTAASFNRSLFYQIADVISTEARAFHNAKDAGLTFWTPNVNIFRDPRWGRGQETPGEDP 130
Query: 177 VLSGKYAASYVRGLQGS--------DGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVS 228
L+G+YA ++VRGLQG + LK+++ CKHF+AY + V R NA V+
Sbjct: 131 YLTGEYAVAFVRGLQGEGMEGREVENSKFLKISSCCKHFSAYSQE----VPRHRNNAMVT 186
Query: 229 KQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSD 288
KQD DT+ F CV G V+S+MCSYN VNG+P+CAD +L +RG+W+ +GYI SD
Sbjct: 187 KQDQADTYFPAFEDCVKRGHVSSIMCSYNAVNGIPSCADKGLLTDLVRGQWKFDGYIASD 246
Query: 289 CDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALV 348
C++V D H+T +PE+ A + AG+DL+CG FL H A+++G+++ I+NAL
Sbjct: 247 CEAVADVIDHHHYTQSPEQTCATTLDAGMDLNCGEFLRQHLPKALEQGIVTTEMIHNALK 306
Query: 349 NTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLS- 407
N V MRLGMF+ +P+ ++ V T H++LALEAARQ IVLLKN G +LPL+
Sbjct: 307 NQFRVLMRLGMFE---KVEPFANITKDSVDTTMHRQLALEAARQSIVLLKNDGNTLPLAT 363
Query: 408 --HIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTI-HQQGCKDVACAD 464
R R++A+IGP+ + + ++GNY GI TPL+GI ++ + H GCK V+
Sbjct: 364 KDFTRDRSLALIGPHFNASAALLGNYFGIPSHIVTPLEGISQFVPNVAHSLGCK-VSGEV 422
Query: 465 DQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTIL 524
F AI +++AD I+ +GLDQS E E +DR + LP Q L+ +V + P +
Sbjct: 423 LPDFDDAIAVAKKADRLIVFVGLDQSQEREEIDRYHIGLPAFQSTLLKRVLEVASHPIVF 482
Query: 525 VLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYI 584
V++SGG +D++ KN P++ AI++ GY GQAGG A+AD+LFG NP GKLP T+Y EY+
Sbjct: 483 VVISGGCVDLSAYKNHPKVGAIVFGGYLGQAGGQALADVLFGKYNPSGKLPQTFYDSEYV 542
Query: 585 TNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDG 644
+ + +M MRP+ GRTYRF+ G VY FG G+SYT F
Sbjct: 543 NAMSIYDMHMRPTPVTGNSGRTYRFFTGVPVYEFGFGLSYTTF----------------- 585
Query: 645 RHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPP-AGHWA-PH 702
H C+ + V N G+ G +L + PP AG P
Sbjct: 586 -----------------HKNCHACVATFNITVTNAGAISGEDVILTYVEPPLAGEGGRPL 628
Query: 703 KQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLH 761
K LVAFE+ + A Q+ K ++ + +G + G IH+ +H V +
Sbjct: 629 KSLVAFERTPLIAAGQRATAKICLEAKAFALANEAGNWVVEPGNWTIHVDTLQHKVDIQ 687
>gi|340377241|ref|XP_003387138.1| PREDICTED: probable beta-D-xylosidase 2-like [Amphimedon
queenslandica]
Length = 733
Score = 563 bits (1452), Expect = e-158, Method: Compositional matrix adjust.
Identities = 305/726 (42%), Positives = 425/726 (58%), Gaps = 43/726 (5%)
Query: 42 FCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGT 101
+C L RV DL+ RL+L+EK+ L + A+A+ RLGI GY+WWSE LHGV+ V PG
Sbjct: 37 YCNYRLSFKDRVKDLLSRLTLEEKISQLGNSASAIDRLGIPGYQWWSEGLHGVA-VSPGL 95
Query: 102 KFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFR 161
GG+ TSFPQ+ITTASSFN +L+ IG VS EAR + G GLTY++PN+NI R
Sbjct: 96 HLGGNLTCTTSFPQIITTASSFNKSLFYEIGEAVSTEARGFADNGQGGLTYFTPNINIVR 155
Query: 162 DPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNW-NGVDR 220
DPRWGRGQET GEDP L+ +YA + VRG QG+D + K+ A+CKHF AYDL+++ NG R
Sbjct: 156 DPRWGRGQETAGEDPYLTSQYAVNLVRGAQGNDSEYKKIIATCKHFAAYDLESYINGDVR 215
Query: 221 FHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWR 280
FNA+V+KQD+E+T+ FR CV G V S+MCSYN VNGVP+C D + R +W+
Sbjct: 216 DSFNAEVTKQDLEETYFPAFRSCVTAGGVGSIMCSYNSVNGVPSCVDGVFNNKIARNKWK 275
Query: 281 LNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSE 340
+GY+VSDC ++ + H+TSTP + A ++ G DL+CG F H A G ++E
Sbjct: 276 FDGYLVSDCGAIDDVMNKHHYTSTPTDTVAAGLKGGTDLNCGSFYQTHAMDAFLNGSITE 335
Query: 341 IDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQ 400
+DI+ A+ T +MRLG+FD P QPY + V T HQ+LAL+AAR+ IVLL+N
Sbjct: 336 VDIDRAVGRLFTARMRLGLFD-LPKYQPYSYFNTDVVNTKQHQDLALQAARESIVLLQNN 394
Query: 401 GPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI-GRYARTIHQQGCKD 459
G LPLS+ H +AV+GPN VTM G IA +P+ G + + GC D
Sbjct: 395 G-KLPLSYEDHHKIAVVGPNILANVTMQGISQVIAPYLISPVDGFKSKGLHVTYSLGC-D 452
Query: 460 VACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELV-----SKV 514
V C F A + A A + VMGLDQ IE E +DR + LPG Q + + +
Sbjct: 453 VKCIVTDGFHDAFKLVKDAKAVVAVMGLDQGIERETVDREDIFLPGLQDKFLLGLRDTLT 512
Query: 515 SMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKL 574
++ S P I+V+MSG +D++ +K+ AI+W GYPGQ+GG AIA++++G NP G+L
Sbjct: 513 NLQSPVPLIVVIMSGSSVDLSESKS--LADAILWVGYPGQSGGQAIAEVIYGEVNPSGRL 570
Query: 575 PMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANA 634
P+T+YP EYI + M+MR PGRTYRFY V+PFGHG+SYT F + N
Sbjct: 571 PLTFYPGEYIDLVAYRHMSMREP-----PGRTYRFYTENPVFPFGHGLSYTTFELSWTN- 624
Query: 635 PTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLL--VFS 692
+ ++ + ++ + + + V N G GA ++L V S
Sbjct: 625 ----------KMNNVTEIVISDSVDI--------NIDFDITVVNTGYLSGAVSVLGYVSS 666
Query: 693 TPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIG 752
P AP ++L F+KV + +++ + + VD G R I GE++I I
Sbjct: 667 NIPD---APLRELFDFDKVFIDKYESKKISL-FATNDAFTTVDEKGRRNILPGEYDIAIE 722
Query: 753 GTKHSV 758
H +
Sbjct: 723 NLSHKI 728
>gi|320170454|gb|EFW47353.1| beta-xylosidase [Capsaspora owczarzaki ATCC 30864]
Length = 779
Score = 563 bits (1450), Expect = e-157, Method: Compositional matrix adjust.
Identities = 317/732 (43%), Positives = 438/732 (59%), Gaps = 36/732 (4%)
Query: 33 KDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALH 92
+DA R LPFC +L QR +DL+GRL+LQEK+ + A V RLG+ YEWWSEALH
Sbjct: 28 EDAALRNLPFCNPNLAWEQRADDLVGRLTLQEKISQFGTTAPGVARLGVNAYEWWSEALH 87
Query: 93 GVSNVGPGTKFGGDFPGATSFPQVI--------TTASSFNATLWEAIGRVVSDEARAMYN 144
GV+ PG F G+ P +T FPQ+I ++FN A+ +V+S EARA N
Sbjct: 88 GVAE-SPGVNFTGNTPVSTCFPQIIGNNCSSLSRVGATFNLDSVAAMAQVISTEARAFAN 146
Query: 145 GGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQ-GSDGDRLKVAAS 203
G AGLTY++PN+NIFRDPRWGRGQETPGEDP L+ +Y + V+ LQ G D LKV A+
Sbjct: 147 AGHAGLTYFTPNINIFRDPRWGRGQETPGEDPYLTSRYVETLVQNLQNGEDARYLKVVAT 206
Query: 204 CKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVP 263
CKH+TAYD+++W G+DRFHFNA VS QD+ +TF PF CV GK AS+MCSYN VNG+P
Sbjct: 207 CKHYTAYDMEDWGGIDRFHFNAVVSDQDLVETFMPPFEACVRVGKGASLMCSYNAVNGIP 266
Query: 264 TCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGP 323
+CAD I R +W +GYIVSDC ++ T ++T+T + A I+ G DLDCG
Sbjct: 267 SCADDFINNEIAREQWGFDGYIVSDCGAIDCIQYTHNYTNTTQATCAAGIQGGCDLDCGD 326
Query: 324 FLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQ 383
F H A+ L E D++ +L ++RLG FD S QPY + + + +HQ
Sbjct: 327 FYQSHLMDAIGNATLHEADLDFSLRRLFGHRIRLGEFDAA-SIQPYRQIPVSAINSQEHQ 385
Query: 384 ELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQ 443
ELAL+ AR+ IVLL N +LP S R +A+IGPN+D T++GNY G A TPL+
Sbjct: 386 ELALQIARESIVLLGNDNNTLPFSLATVRKLAIIGPNADDAETLLGNYYGDAPYLITPLK 445
Query: 444 GIGRYARTI---HQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAG 500
G + T+ +GC DV D F AA A++ ADATI+V+GL+Q++E+E LDR
Sbjct: 446 GFQQLDPTLSITFVKGC-DVNSTDTSGFVAAAAAAKAADATIVVVGLNQTVESENLDRTT 504
Query: 501 LLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAI 560
L+LPG Q EL+ ++ A++GP ILV+MSG PID++ + R A +W GYPGQAGG A+
Sbjct: 505 LVLPGVQAELILALTAAARGPVILVVMSGSPIDLSNVIHPVR--AALWIGYPGQAGGRAL 562
Query: 561 ADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGH 620
A+ +FG +P G+LP T YP +Y+ LPMT M MR PGRTYRFY G ++ FGH
Sbjct: 563 AEAVFGVFSPAGRLPFTVYPADYVNQLPMTNMDMRAG-----PGRTYRFYTGTPLFEFGH 617
Query: 621 GMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVG 680
G+SY+ F +T +N+ + + S++ A ++ +V V+N G
Sbjct: 618 GLSYSTFQYTWSNS-SSSSSSSATSQHSLSTAALAAQHLAARAPVEAVSF--RVLVQNTG 674
Query: 681 SKDGAHTLLVFSTPPAGHWA----------PHKQLVAFEKVHVPAGAQQRVGINIHVCKY 730
+L F++ A P + LV F ++H+ GA Q + + +
Sbjct: 675 KMASDDVVLAFASFNASSIIDQSSSQFASPPIRSLVGFRRIHLAPGASQEIFFAVTSSQ- 733
Query: 731 LSVVDRSGTRRI 742
L+ VD +G + +
Sbjct: 734 LAQVDSTGAQTL 745
>gi|413919687|gb|AFW59619.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
Length = 451
Score = 549 bits (1414), Expect = e-153, Method: Compositional matrix adjust.
Identities = 264/411 (64%), Positives = 316/411 (76%), Gaps = 3/411 (0%)
Query: 28 FACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWW 87
FACD +AT + FC S R DL+ RL+L EKV L+ AA+PRLG+ YEWW
Sbjct: 40 FACDASNATLASYGFCNRSAAAAARAADLVSRLTLAEKVGFLVDKQAALPRLGVPLYEWW 99
Query: 88 SEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGT 147
SEALHGVS VGPGT+F PGATSFPQ I TA+SFNATL+ AIG VVS+EARAM+N G
Sbjct: 100 SEALHGVSYVGPGTRFSPLVPGATSFPQPILTAASFNATLFRAIGEVVSNEARAMHNVGL 159
Query: 148 AGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQG--SDGDRLKVAASCK 205
AGLT+WSPN+NIFRDPRWGRGQETPGEDP+L+ KYA YV GLQG S LKVAA CK
Sbjct: 160 AGLTFWSPNINIFRDPRWGRGQETPGEDPLLTSKYAVGYVTGLQGAVSGAGALKVAACCK 219
Query: 206 HFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTC 265
H+TAYD+DNW GV+R+ F+A VS+QD++DTF PF+ CV++G VASVMCSYNQVNG PTC
Sbjct: 220 HYTAYDVDNWKGVERYTFDAVVSQQDLDDTFQPPFKSCVVDGNVASVMCSYNQVNGKPTC 279
Query: 266 ADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFL 325
AD ++L IRG+W+LNGYI SDCDSV V Y+ QH+T TPE+AAA +I+AGLDL+CG FL
Sbjct: 280 ADKDLLSGVIRGDWKLNGYISSDCDSVDVLYNNQHYTKTPEDAAAISIKAGLDLNCGTFL 339
Query: 326 GLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQEL 385
HT +AVQ G LSE D++ A+ N L MRLG FDG+P P+G+LGP DVCTP +QEL
Sbjct: 340 AQHTVAAVQAGKLSESDVDRAVTNNLVTLMRLGFFDGDPRELPFGNLGPSDVCTPSNQEL 399
Query: 386 ALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIAC 436
A EAARQGIVLLKN G LPLS +++AVIGPN++ + TMIGNY G +C
Sbjct: 400 AREAARQGIVLLKNTG-KLPLSATSIKSMAVIGPNANASFTMIGNYEGTSC 449
>gi|167525174|ref|XP_001746922.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163774702|gb|EDQ88329.1| predicted protein [Monosiga brevicollis MX1]
Length = 1620
Score = 546 bits (1406), Expect = e-152, Method: Compositional matrix adjust.
Identities = 305/725 (42%), Positives = 416/725 (57%), Gaps = 52/725 (7%)
Query: 27 PFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEW 86
P CD A + PFC SL + R+ D+I RLS+Q+KV L + A A G+ Y+W
Sbjct: 913 PRPCDELPA--KNFPFCNASLDLDTRIRDVISRLSIQDKVALTANTAGAAADAGLPAYQW 970
Query: 87 WSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGG 146
WSEALHGV PG F G ATSFPQVI T++SFN TLW IG +S EARAM N
Sbjct: 971 WSEALHGVG-FSPGVTFMGKVQAATSFPQVIHTSASFNKTLWHHIGMTISTEARAMNNVN 1029
Query: 147 TAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQ-GSDGDRLKVAASCK 205
AGLT+W+PN+NI RDPRWGRGQETPGEDP +G YAA++V G+Q G D +K ++ CK
Sbjct: 1030 QAGLTFWAPNINIIRDPRWGRGQETPGEDPYATGLYAANFVPGMQEGEDTRYIKASSCCK 1089
Query: 206 HFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTC 265
HF Y+L++W+ VDR HFNA + QDI DT+ F CV G+ +S+MCSYN VNGVP+C
Sbjct: 1090 HFFDYNLEDWHNVDRHHFNAIATDQDIADTYLPAFESCVRFGRASSLMCSYNAVNGVPSC 1149
Query: 266 ADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFL 325
A+ +I+ R W +GYI SDC +V Y + +T + AG+D+DCG FL
Sbjct: 1150 ANADIMTTLAREAWGFDGYITSDCGAVEDVYSNHKYYNTTGATVNGVLSAGMDVDCGSFL 1209
Query: 326 GLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQEL 385
H A+ G ++ ++ AL N VQ RLGMFD QPY +L V TP+HQ+L
Sbjct: 1210 SQHLADAIDSGDVTNATVDQALYNLFRVQFRLGMFD-PAEDQPYLNLTTDAVNTPEHQQL 1268
Query: 386 ALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
ALEAARQG+ LL+N+ LPL + +A+IGPN++ T M GNY G A +P QG+
Sbjct: 1269 ALEAARQGMTLLENRDSRLPLDASSIKQLALIGPNANATGVMQGNYNGKAPFLISPQQGV 1328
Query: 446 GRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPG 505
+Y + L A+ A++ AD ++V+GLDQ+ E+E DR + LPG
Sbjct: 1329 QQYVSNV-------------ALELGAVTAAKAADTVVMVIGLDQTQESEGHDREIIALPG 1375
Query: 506 RQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILF 565
Q ELV++V+ AS P ++V+M+GG +D+ K+ + GQAGG A+A+ LF
Sbjct: 1376 MQAELVAQVANASSSPIVVVVMTGGAVDLTPVKDLDNV---------GQAGGQALAETLF 1426
Query: 566 GTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYT 625
G +NPGG+LP T YP + + + M + MRP+ + PGRTYRFY G VY +G G+SYT
Sbjct: 1427 GDNNPGGRLPYTLYPADLVNQVSMFDDGMRPNATSGNPGRTYRFYTGTPVYAYGTGLSYT 1486
Query: 626 NFVHTVANAPTVVA-------VPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKN 678
+F + + V+ V G+ I + + + V V+N
Sbjct: 1487 SFSYETSTPSLRVSAERVRAWVAARGQTSFIRDEVDAEDY-------------ITVTVQN 1533
Query: 679 VGSKDGAHTLLVF--STPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKY-LSVVD 735
G+ GA + VF +T P P K L FE+V + G + I V + LSVV+
Sbjct: 1534 NGTVAGADVVQVFIKTTTPGADGNPIKSLCGFERVFLKPG--ETTSIQFPVTPHDLSVVN 1591
Query: 736 RSGTR 740
G R
Sbjct: 1592 SRGER 1596
>gi|300121549|emb|CBK22068.2| unnamed protein product [Blastocystis hominis]
Length = 690
Score = 539 bits (1389), Expect = e-150, Method: Compositional matrix adjust.
Identities = 291/675 (43%), Positives = 390/675 (57%), Gaps = 41/675 (6%)
Query: 51 QRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGA 110
+R L+ L+L EK+ L+ A+ V RL I Y+WWSEALHGV+ PG F P A
Sbjct: 3 ERARALVAELTLAEKMSLMGHTASEVKRLNIPKYQWWSEALHGVA-ASPGVVFQEPTPFA 61
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQE 170
T+FPQV TA SF+ L+ I ++S EAR M N A LTYWSPNVN++RDPRWGRGQE
Sbjct: 62 TAFPQVALTAQSFDKPLFHDIASIISTEARVMNNAERANLTYWSPNVNVYRDPRWGRGQE 121
Query: 171 TPGEDPVLSGKYAASYVRGLQ-GSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSK 229
TPGEDP L YA +VRGLQ G D LKV+A CKH++AYDL+NW+GV+RF F+A VS
Sbjct: 122 TPGEDPFLVATYAVEFVRGLQEGEDPRYLKVSACCKHYSAYDLENWHGVERFEFDAIVSD 181
Query: 230 QDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDC 289
+D+ DTF VPF CV +G V+S+MCSYN +NG+P CAD +L T RG W GYI SDC
Sbjct: 182 RDMTDTFQVPFEQCVKKGHVSSLMCSYNAINGIPACADRELLYGTARGGWGFEGYITSDC 241
Query: 290 DSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVN 349
++ H+T+ + A +RA DLDCG F H +V+ G L E ++++AL N
Sbjct: 242 GAIDTIIYNHHYTNDTDTTAMLGVRATCDLDCGGFYQQHILHSVESGRLKEAEVDDALAN 301
Query: 350 TLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHI 409
VQMRLG+FD Q Y H G + T +HQ +AL AAR+GI LLKNQ LPLS +
Sbjct: 302 LFKVQMRLGLFD-PVEQQVYTHYGLDKLNTKEHQAMALRAAREGIALLKNQNDFLPLS-L 359
Query: 410 RHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFG 469
+ + V V+GP ++ M+GNY GI T QG+ R + C V
Sbjct: 360 KDKHVVVMGPYAEDAGVMLGNYNGIPEFIVTVAQGL----RNV----CDHVDVVKSL--- 408
Query: 470 AAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMS- 528
A+ D ++ +GL+Q IE E LDR LLLP Q+ L+ + + P +L L+S
Sbjct: 409 EALSKLEGVDLIVVTVGLNQEIEREGLDREDLLLPASQRALLDGLLAQTDVPVVLTLLSG 468
Query: 529 GGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLP 588
GG +D++ + + + ++ GY G GG AIA+++ G NP G+L T Y +Y+TNL
Sbjct: 469 GGSVDISAYEQNEHVVGVLAVGYGGMFGGQAIAEVIVGDVNPSGRLVNTMYYNDYVTNLD 528
Query: 589 MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGS 648
+M MRP + +PGRTYRF+ GPV++PFG G+SYT F H V G
Sbjct: 529 YFDMNMRPKEETGFPGRTYRFFAGPVIHPFGFGLSYTTFAHAV-------------EIGQ 575
Query: 649 INATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWA-PHKQLV 706
+ H + L + V V V N GS+ G ++L+F +P AG P K L
Sbjct: 576 MR----------NHRLRSALAIDVYVKVTNTGSRQGDESVLLFVKSPLAGKQGYPLKSLA 625
Query: 707 AFEKVHVPAGAQQRV 721
F +V + G Q V
Sbjct: 626 DFSRVSLAPGETQTV 640
>gi|40363751|dbj|BAD06320.1| putative beta-xylosidase [Triticum aestivum]
Length = 573
Score = 538 bits (1385), Expect = e-150, Method: Compositional matrix adjust.
Identities = 266/571 (46%), Positives = 364/571 (63%), Gaps = 8/571 (1%)
Query: 198 LKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYN 257
L+ +A CKHFTAYDL+NW GV RF F+AKV++QD+ DT++ PF+ CV +G + +MCSYN
Sbjct: 5 LEASACCKHFTAYDLENWKGVTRFAFDAKVTEQDLADTYNPPFKSCVEDGGASGIMCSYN 64
Query: 258 QVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGL 317
+VNGVPTCAD N+L +T RG+W NGYI SDCD+V + +D Q + PE+A AD ++AG+
Sbjct: 65 RVNGVPTCADHNLLSKTARGDWSFNGYITSDCDAVAIIHDVQGYAKAPEDAVADVLKAGM 124
Query: 318 DLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDV 377
D++CG ++ H SA Q+G ++ DI+ AL N ++MRLG+F+G P YG++G V
Sbjct: 125 DVNCGGYIQTHGVSAYQQGKITGEDIDRALRNLFAIRMRLGLFNGNPKYNRYGNIGADQV 184
Query: 378 CTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACG 437
C +HQ+LAL+AA+ GIVLLKN +LPLS + +VAVIGPN + ++GNY G C
Sbjct: 185 CKKEHQDLALQAAQDGIVLLKNDAGALPLSKSKVSSVAVIGPNGNNASLLLGNYFGPPCI 244
Query: 438 YTTPLQGIGRYAR-TIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEAL 496
TP Q + Y + QGC C + G A+ A+ AD +L MGLDQ+ E E +
Sbjct: 245 SVTPFQALQGYVKDATFVQGCNAAVCNVSNI-GEAVHAASSADYVVLFMGLDQNQEREEV 303
Query: 497 DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAG 556
DR L LPG Q+ LV+KV+ A+K P ILVL+ GGP+DV FAKN+P+I AI+WAGYPGQAG
Sbjct: 304 DRLELGLPGMQESLVNKVADAAKKPVILVLLCGGPVDVTFAKNNPKIGAIVWAGYPGQAG 363
Query: 557 GTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVY 616
G AIA +LFG NPGG+LP+TWYP+E+ T +PMT+M MR S YPGRTYRFYKG VY
Sbjct: 364 GIAIAQVLFGEHNPGGRLPVTWYPKEF-TAVPMTDMRMRADPSTGYPGRTYRFYKGKTVY 422
Query: 617 PFGHGMSYTNFVHTVANA----PTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGV 672
FG+G+SY+ + H A+ P++ + S T+S ++ C+RL
Sbjct: 423 NFGYGLSYSKYSHRFASEGTKPPSMSGIEGLKATASAAGTVSYDVEEMGAEACDRLRFPA 482
Query: 673 QVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYL 731
V V+N G DG H +L+F P A P QL+ F+ VH+ A V + CK+
Sbjct: 483 VVRVQNHGPMDGRHPVLLFLRWPNATDGRPASQLIGFQSVHLRADEAAHVEFEVSPCKHF 542
Query: 732 SVVDRSGTRRIPLGEHNIHIGGTKHSVSLHA 762
S G + I G H + +G + +S A
Sbjct: 543 SRAAEDGRKVIDQGSHFVKVGDDEFELSFMA 573
>gi|125576923|gb|EAZ18145.1| hypothetical protein OsJ_33695 [Oryza sativa Japonica Group]
Length = 591
Score = 535 bits (1378), Expect = e-149, Method: Compositional matrix adjust.
Identities = 266/594 (44%), Positives = 378/594 (63%), Gaps = 12/594 (2%)
Query: 178 LSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFD 237
++ KYA ++V+G+QG+ L+ +A CKH TAYDL++WNGV R++FNAKV+ QD+EDT++
Sbjct: 1 MASKYAVAFVKGMQGNSSAILQTSACCKHVTAYDLEDWNGVQRYNFNAKVTAQDLEDTYN 60
Query: 238 VPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYD 297
PFR CV++ K +MC+Y +NGVP CA+ ++L +T+RG+W L+GYI SDCD+V + D
Sbjct: 61 PPFRSCVVDAKATCIMCAYTGINGVPACANADLLTKTVRGDWGLDGYIASDCDAVAIMRD 120
Query: 298 TQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRL 357
Q +T TPE+A A A++AGLD++CG ++ H +A+Q+G L+E DI+ AL N ++MRL
Sbjct: 121 AQRYTQTPEDAVAVALKAGLDMNCGTYMQQHATAAIQQGKLTEEDIDKALKNLFAIRMRL 180
Query: 358 GMFDGEPSSQP-YGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAV 416
G FDG+P S YG LG D+CTP+H+ LALEAA GIVLLKN LPL + AV
Sbjct: 181 GHFDGDPRSNSVYGGLGAADICTPEHRSLALEAAMDGIVLLKNDAGILPLDRTAVASAAV 240
Query: 417 IGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIH-QQGCKDVACADDQLFGAAIDAS 475
IGPN++ + +IGNY G C TTPL GI Y + + GC AC AA AS
Sbjct: 241 IGPNANDGLALIGNYFGPPCESTTPLNGILGYIKNVRFLAGCNSAACDVAATDQAAAVAS 300
Query: 476 RQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVA 535
+D L MGL Q E+E DR LLLPG QQ L++ V+ A+K P ILVL++GGP+DV
Sbjct: 301 -SSDYVFLFMGLSQKQESEGRDRTSLLLPGEQQSLITAVADAAKRPVILVLLTGGPVDVT 359
Query: 536 FAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMR 595
FA+ +P+I AI+WAGYPGQAGG AIA +LFG NPGG+LP+TWYP+E+ T +PMT+M MR
Sbjct: 360 FAQTNPKIGAILWAGYPGQAGGLAIARVLFGDHNPGGRLPVTWYPEEF-TKVPMTDMRMR 418
Query: 596 PSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISG 655
+ YPGR+YRFY+G VY FG+G+SY+++ + + A S+ T +
Sbjct: 419 ADPATGYPGRSYRFYQGKTVYKFGYGLSYSSYSRQLVSGGK-PAESYTNLLASLRTTTTS 477
Query: 656 KAIKVTHAK------CNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAF 708
+ + H + C +L V+V+N G DG H++L++ P A P QL+ F
Sbjct: 478 EGDESYHIEEIGTDGCEQLKFPAVVEVQNHGPMDGKHSVLMYLRWPNAKGGRPTTQLIGF 537
Query: 709 EKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHA 762
H+ G + + +I C++ S V + G + I G H + + + + A
Sbjct: 538 RSQHLKVGEKANIRFDISPCEHFSRVRKDGKKVIDRGSHYLMVDKDELEIRFEA 591
>gi|340370208|ref|XP_003383638.1| PREDICTED: probable beta-D-xylosidase 2-like [Amphimedon
queenslandica]
Length = 732
Score = 532 bits (1371), Expect = e-148, Method: Compositional matrix adjust.
Identities = 304/742 (40%), Positives = 418/742 (56%), Gaps = 59/742 (7%)
Query: 38 RTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNV 97
++ +C SLPI RV DL+ R++L EK+ L + A ++ RL I Y+WWSE LHGV++
Sbjct: 30 QSFSYCNYSLPISDRVKDLLSRMTLAEKITQLGNTAGSIDRLDIPAYQWWSEGLHGVAD- 88
Query: 98 GPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNV 157
PG F G F ATSFPQVITTASSFN TL+ I V+S EARA N G+ Y+ +
Sbjct: 89 SPGVHFNGMFHNATSFPQVITTASSFNKTLYHEIAAVMSTEARAFAN---QGIVYFKQHQ 145
Query: 158 NIF--------RDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTA 209
+ RDPRWGR QETPGEDP L+ +YA +V G QG D LKV +CKHF
Sbjct: 146 QLLSNYLLFYCRDPRWGRAQETPGEDPYLNSQYAIQFVTGAQG-DSKYLKVVTTCKHFAG 204
Query: 210 YDLDNW-NGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADP 268
YDL+++ +G R FNAK++ QD E+T+ F+ CV E VAS+MCSYN+VNGVP+CAD
Sbjct: 205 YDLEDYVDGETRHSFNAKITPQDFEETYYPAFKACVEEANVASIMCSYNEVNGVPSCADG 264
Query: 269 NILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLH 328
I + R W +G+I SDC ++ + H+T+ ++ A A++ G DL+CG + H
Sbjct: 265 QINNKLARDTWGFDGFIASDCGAIDDIQNKHHYTNNTDDTVAAALKGGCDLNCGSYYQSH 324
Query: 329 TESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALE 388
+SA G ++ +IN AL T +M+LGMFD P QPY + P V + +HQ LAL
Sbjct: 325 AQSAFLNGTITIGEINLALTRLFTARMKLGMFD-PPELQPYNAISPDVVNSLEHQALALN 383
Query: 389 AARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRY 448
AAR+ IVLL+N LPL+ +H T+AV+GP++ T M GNY G+A +P++G
Sbjct: 384 AARESIVLLQNNNDVLPLNFEKHSTIAVVGPHAMATDVMQGNYNGVAPYLISPVEGFENL 443
Query: 449 A--RTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGR 506
+ GC DV C F A D + +ADA I V+GLDQS E+E DR L LP
Sbjct: 444 GIDSVLTASGC-DVNCEVTDGFQDAFDIAVKADAVIAVLGLDQSHESEGHDREDLFLPNL 502
Query: 507 QQELVSKV-----SMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIA 561
Q + V + + + P I+V+MSG +D+ K AI+WAGYPGQ+GG AIA
Sbjct: 503 QDKFVQDLKNTLKAAGTNAPLIVVVMSGSSVDLTVTKK--HADAILWAGYPGQSGGQAIA 560
Query: 562 DILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHG 621
+I++G NP G+LP+T+YP YI + M+MR YPGRTY+FY + FG G
Sbjct: 561 EIIYGKVNPSGRLPVTFYPGSYIDLVAFRHMSMR-----EYPGRTYKFYNDTPDFSFGDG 615
Query: 622 MSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGS 681
+SYT F + + V + T+ V V N G
Sbjct: 616 LSYTTFYLEWSKPVNMSGV----------------------RSVSYPTVVYNVTVTNTGK 653
Query: 682 KDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHV-CKYLSVVDRSGT 739
GA ++L + S +G AP K+L FEKV + Q V + K S VD+SG
Sbjct: 654 MPGAISVLAYISYNNSG--APKKKLFGFEKVFL--NPLQSVSVTFPADSKAFSTVDKSGK 709
Query: 740 RRIPLGEHNIHIGGTK-HSVSL 760
R + G++++ IG H +SL
Sbjct: 710 RSVNPGDYHVTIGDQLIHKISL 731
>gi|334187562|ref|NP_196532.2| Glycosyl hydrolase family protein [Arabidopsis thaliana]
gi|332004052|gb|AED91435.1| Glycosyl hydrolase family protein [Arabidopsis thaliana]
Length = 526
Score = 531 bits (1368), Expect = e-148, Method: Compositional matrix adjust.
Identities = 259/481 (53%), Positives = 332/481 (69%), Gaps = 5/481 (1%)
Query: 284 YIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDI 343
YIVSDCDS+G+ Y +QH+T TPEEAAA +I AGLDL+CG FLG HTE+AV++GL+ E I
Sbjct: 45 YIVSDCDSLGILYGSQHYTKTPEEAAAKSILAGLDLNCGSFLGNHTENAVKKGLIDEAAI 104
Query: 344 NNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPS 403
N A+ N MRLG FDG P +QPYG LGPKDVCT +++ELA+E ARQGIVLLKN S
Sbjct: 105 NKAISNNFATLMRLGFFDGNPKNQPYGGLGPKDVCTVENRELAVETARQGIVLLKNSAGS 164
Query: 404 LPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTI-HQQGCKDVAC 462
LPLS +T+AVIGPN++VT TMIGNY G+AC YTTPLQG+ R T + +GC +V C
Sbjct: 165 LPLSPSAIKTLAVIGPNANVTKTMIGNYEGVACKYTTPLQGLERTVLTTKYHRGCFNVTC 224
Query: 463 ADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPT 522
+ L A A+ ADAT+LVMG DQ+IE E LDR L LPG+QQELV++V+ A++GP
Sbjct: 225 TEADLDSAKTLAA-SADATVLVMGADQTIEKETLDRIDLNLPGKQQELVTQVAKAARGPV 283
Query: 523 ILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQE 582
+LV+MSGG D+ FAKND +I +I+W GYPG+AGG AIAD++FG NP GKLPMTWYPQ
Sbjct: 284 VLVIMSGGGFDITFAKNDEKITSIMWVGYPGEAGGIAIADVIFGRHNPSGKLPMTWYPQS 343
Query: 583 YITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPL 642
Y+ +PMT M MRP +S Y GRTYRFY G VY FG G+SYTNF H + AP V++ L
Sbjct: 344 YVEKVPMTNMNMRPDKSNGYLGRTYRFYIGETVYAFGDGLSYTNFSHQLIKAPKFVSLNL 403
Query: 643 DGRHGSIN---ATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHW 699
D + ++ A R VQ+ V+NVG ++G T+ +F+TPP H
Sbjct: 404 DESQSCRSPECQSLDAIGPHCEKAVGERSDFEVQLKVRNVGDREGTETVFLFTTPPEVHG 463
Query: 700 APHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVS 759
+P KQL+ FEK+ + + V + VCK L VVD G R++ LG H +H+G KHS +
Sbjct: 464 SPRKQLLGFEKIRLGKKEETVVRFKVDVCKDLGVVDEIGKRKLALGHHLLHVGSLKHSFN 523
Query: 760 L 760
+
Sbjct: 524 I 524
>gi|407922988|gb|EKG16078.1| Glycoside hydrolase family 3 [Macrophomina phaseolina MS6]
Length = 800
Score = 502 bits (1292), Expect = e-139, Method: Compositional matrix adjust.
Identities = 281/724 (38%), Positives = 420/724 (58%), Gaps = 36/724 (4%)
Query: 43 CQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTK 102
C S R L+ L+L+EK+ + + VPRLGI Y+WW+EALHGV+ PG
Sbjct: 41 CDSSATPLARATALVKELTLEEKLNNTGNTSPGVPRLGIPEYQWWNEALHGVAFTYPGQP 100
Query: 103 F--GGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIF 160
G+F ATSFPQ I ++F+ L + VVS EARA NGG +GL YW+PN+N +
Sbjct: 101 MTESGNFSSATSFPQPILMGAAFDDELIYEVASVVSTEARAYSNGGRSGLDYWTPNINPY 160
Query: 161 RDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRL-KVAASCKHFTAYDLDNWNGVD 219
+DPRWGRGQETPGEDP Y + +RGL+G+ D K+ A+CKHFT YD++NWNG
Sbjct: 161 KDPRWGRGQETPGEDPFHLASYVQNLIRGLEGNQNDPYKKIVATCKHFTGYDMENWNGNF 220
Query: 220 RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEW 279
R+ F+A+++ +D+ + + PF+ C E KV + MCSYN VNGVPTCADP +L+ +R W
Sbjct: 221 RYQFDAQINMRDMVEYYMPPFQACAREAKVGAFMCSYNAVNGVPTCADPWLLQTVLREHW 280
Query: 280 RLN---GYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRG 336
N ++VSDCD++ Y + + E+A AD + AG DL+CG + + A ++G
Sbjct: 281 GWNQEDQWVVSDCDAIQNVYLPHEWAESREQAVADTLNAGTDLNCGTYYQRYLPGAYEQG 340
Query: 337 LLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVL 396
L+++ ++ AL T + ++LG FD SQPY +G +DV + QELAL+AA++GIVL
Sbjct: 341 LINDTTLDRALTRTYSSLIKLGYFDNA-DSQPYRQIGWQDVNSQHAQELALKAAQEGIVL 399
Query: 397 LKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTI-HQQ 455
LKN G LPLS ++A+IG ++ T M GNYAG+A +PL + + + +
Sbjct: 400 LKNDG-LLPLSLDGVSSIALIGSWANATEQMQGNYAGVAPYLHSPLYAAEQLGVKVNYAE 458
Query: 456 GCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVS 515
G DQ +GA A+ +D I+V G+D IE+E LDR + G Q ++++K++
Sbjct: 459 GASQSNPTTDQ-WGAEYTAAENSDVIIVVGGIDNDIESEELDRVAIAWSGPQLDMITKLA 517
Query: 516 MASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLP 575
K P I+V M G +D ++ I+A++W GYPGQ GGTA+ DI+ G P G+LP
Sbjct: 518 TYGK-PVIVVQMGAGQLDSTPLVSNANISALLWGGYPGQDGGTALFDIITGAVAPAGRLP 576
Query: 576 MTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAP 635
+T YP Y + MT+M++RPS + GRTY++Y G V+PFG G+ YTNF + + P
Sbjct: 577 ITQYPARYTKEVAMTDMSLRPSSTSA--GRTYKWYNGTAVFPFGFGLHYTNFSAAIPSPP 634
Query: 636 TVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTL----GVQVDVKNVGSKDGAHTLLVF 691
+ + IS + ++L L + VD+ N G++ L F
Sbjct: 635 ------------ASSFAISDLVASCSANDTSKLDLCPFTSLAVDIANDGTRASDFVALAF 682
Query: 692 ST---PPAGHWAPHKQLVAFEKVH-VPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEH 747
T P+ H P LVA++++H + AG Q +N+ + L VD +G + + G++
Sbjct: 683 LTGEFGPSPH--PKSSLVAYQRLHAIAAGETQTARLNLTLGS-LVRVDENGDKLLYPGDY 739
Query: 748 NIHI 751
++ I
Sbjct: 740 SVLI 743
>gi|121308314|dbj|BAF43576.1| arabinofuranosidase/xylosidase homolog [Prunus persica]
Length = 349
Score = 501 bits (1289), Expect = e-139, Method: Compositional matrix adjust.
Identities = 247/349 (70%), Positives = 288/349 (82%), Gaps = 2/349 (0%)
Query: 422 DVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADAT 481
DVTVTMIGNYAG+ACGYTTPLQGIGRY RTIHQ GC DV C +QLFGAA A+RQADAT
Sbjct: 1 DVTVTMIGNYAGVACGYTTPLQGIGRYTRTIHQAGCTDVHCNGNQLFGAAEAAARQADAT 60
Query: 482 ILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDP 541
+LVMGLDQSIEAE +DRAGLLLPG QQELVS+V+ AS+GPTILVLMSGGPIDV FAKNDP
Sbjct: 61 VLVMGLDQSIEAEFVDRAGLLLPGHQQELVSRVARASRGPTILVLMSGGPIDVTFAKNDP 120
Query: 542 RIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKR 601
RI+AIIW GYPGQAGGTAIAD+LFGT+NPGGKLPMTWYPQ Y+T+LPMT+MAMR ++
Sbjct: 121 RISAIIWVGYPGQAGGTAIADVLFGTTNPGGKLPMTWYPQNYVTHLPMTDMAMRADPARG 180
Query: 602 YPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVT 661
YPGRTYRFY+GPVV+PFG G+SYT F H +A+ PT+V+VPL + N+T+ KA++V+
Sbjct: 181 YPGRTYRFYRGPVVFPFGLGLSYTTFAHNLAHGPTLVSVPLTSLKATANSTMLSKAVRVS 240
Query: 662 HAKCNRLT-LGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQR 720
HA CN L+ L V VDVKN GS DG HTLLVF++PP G WA KQL+ F K+H+ AG+++R
Sbjct: 241 HADCNALSPLDVHVDVKNTGSMDGTHTLLVFTSPPDGKWASSKQLMGFHKIHIAAGSEKR 300
Query: 721 VGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHAATLGVIK 769
V I +HVCK+LSVVDR G RRIPLGEH + IG H VSL LG IK
Sbjct: 301 VRIAVHVCKHLSVVDRFGIRRIPLGEHKLQIGDLSHHVSLQ-TNLGEIK 348
>gi|297745533|emb|CBI40698.3| unnamed protein product [Vitis vinifera]
Length = 461
Score = 499 bits (1285), Expect = e-138, Method: Compositional matrix adjust.
Identities = 243/375 (64%), Positives = 291/375 (77%), Gaps = 4/375 (1%)
Query: 142 MYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDG---DRL 198
MYN G AGLT+WSPNVNIFRDPRWGRGQETPGEDP+LS KYA+ YVRGLQ SD DRL
Sbjct: 1 MYNVGLAGLTFWSPNVNIFRDPRWGRGQETPGEDPLLSSKYASGYVRGLQQSDDGSPDRL 60
Query: 199 KVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQ 258
K+AA CKH+TAYDLDNW GVDRFHFNA V+KQD++DTF PF+ CV++G VASVMCSYNQ
Sbjct: 61 KIAACCKHYTAYDLDNWKGVDRFHFNAVVTKQDMDDTFQPPFKSCVIDGNVASVMCSYNQ 120
Query: 259 VNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLD 318
VNG P CADP++L +RGEW+LNGYIVSDCDSV V+Y++QH+T TPEEAAA AI AGLD
Sbjct: 121 VNGKPACADPDLLSGIVRGEWKLNGYIVSDCDSVDVFYNSQHYTKTPEEAAAKAILAGLD 180
Query: 319 LDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVC 378
L+CG FLG HTE+AV+ GL+ E ++ A+ N MRLG FDG PS YG LGPKDVC
Sbjct: 181 LNCGSFLGQHTEAAVKGGLVDESAVDKAVSNNFATLMRLGFFDGNPSKAIYGKLGPKDVC 240
Query: 379 TPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGY 438
T +HQELA EAARQGI+LLKN SLPLS +T+A+IGPN++VT TMIGNY G C Y
Sbjct: 241 TLEHQELAREAARQGIMLLKNSKGSLPLSPTAIKTLAIIGPNANVTKTMIGNYEGTPCKY 300
Query: 439 TTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDR 498
TTPLQG+ T + GC +VAC+ Q+ A + ADAT+L++G+DQSIEAE DR
Sbjct: 301 TTPLQGLMALVATTYLSGCSNVACSTAQI-DEAKKIAAAADATVLIVGIDQSIEAEGRDR 359
Query: 499 AGLLLPGRQQELVSK 513
+ LPG+Q L+++
Sbjct: 360 VNIQLPGQQPLLITE 374
>gi|409041356|gb|EKM50841.1| glycoside hydrolase family 3 protein [Phanerochaete carnosa
HHB-10118-sp]
Length = 764
Score = 498 bits (1281), Expect = e-138, Method: Compositional matrix adjust.
Identities = 295/730 (40%), Positives = 417/730 (57%), Gaps = 29/730 (3%)
Query: 43 CQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTK 102
C S R N L+ L+L+E V ++ + VPRLG+ Y WWSEALHGV+ + PGT
Sbjct: 38 CNPSADPTSRANALVDALTLEELVNNTVNASPGVPRLGLPPYNWWSEALHGVA-LSPGTN 96
Query: 103 F---GGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNI 159
F G F ATSFPQ I ++F+ L +I V+S EARA N G AGL +++PN+N
Sbjct: 97 FSVPGSPFSSATSFPQPIILGATFDDDLVTSIATVISTEARAFNNAGRAGLDFFTPNINP 156
Query: 160 FRDPRWGRGQETPGEDPVLSGKYAASYVRGLQG--SDGDRLKVAASCKHFTAYDLDNWNG 217
F+DPRWGRGQETPGEDP +Y V GLQG S KV A CKHF YDL+NW G
Sbjct: 157 FKDPRWGRGQETPGEDPFHIAQYVYQLVTGLQGGLSPDPYYKVIADCKHFAGYDLENWEG 216
Query: 218 VDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRG 277
R FNA +S QD+ + + F+ CV + V SVMCSYN VNG+P+CA+ +L+ IRG
Sbjct: 217 NSRMAFNAIISTQDLAEYYTPSFQSCVRDAHVGSVMCSYNAVNGIPSCANSYLLQDIIRG 276
Query: 278 EWRL-NGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRG 336
+ L +G+I SDCD+V + +T+T A+A A++AG D+DCG AV +
Sbjct: 277 HFGLGDGWITSDCDAVANIFSPHQYTTTLVNASAVALKAGTDVDCGTTYSQTLVDAVDQN 336
Query: 337 LLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVL 396
L++E DI N+++ +RLG FD P+ QP+ LG DV TP Q LAL AA +G+ L
Sbjct: 337 LVTEDDIKNSMIRLYRSLVRLGYFD-SPAEQPFRQLGWSDVNTPSSQALALTAAEEGVTL 395
Query: 397 LKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQG 456
LKN G +LPLS R +A++GP ++ T M GNY GIA +PLQ + +
Sbjct: 396 LKNDG-TLPLSSAIKR-IALVGPWANATTQMQGNYQGIAPFLVSPLQALQDAGFQVTFAN 453
Query: 457 CKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSM 516
+ DD F AA+ A + ADA I G+D++IE+E DR + PG Q +LVS+++
Sbjct: 454 GTAINSTDDSGFAAAVSAVQVADAVIYAGGIDETIESEGNDREIITWPGNQLDLVSQLAA 513
Query: 517 ASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPM 576
K P +++ M GG +D + K++ + A+IW GYPGQ+GG AI +IL G P G+LP+
Sbjct: 514 VGK-PFVVLQMGGGQVDSSSLKSNKAVNALIWGGYPGQSGGAAIVNILTGKIAPAGRLPI 572
Query: 577 TWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNF-VHTVANAP 635
T YP +Y+ +PMT+MA+RP+ + PGRTY+++ G ++ FG G+ YT F + P
Sbjct: 573 TQYPADYVNEIPMTDMALRPNGTS--PGRTYKWFTGTPIFGFGFGLHYTTFSLDWAPTPP 630
Query: 636 TVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTL-GVQVDVKNVGSKDGAHTLLVFSTP 694
+ A+ +T+ +A + N L +V+VKN G + L+FS
Sbjct: 631 SSFAI----------STLVSEANTAGVSFTNLAPLFTFRVNVKNTGKVGSDYVALLFSNT 680
Query: 695 PAG-HWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHI-- 751
AG AP KQLV++ +V A Q ++ +D +G + G +NI +
Sbjct: 681 TAGPQPAPLKQLVSYTRVKGIAPGQTETAELKVTLGSIARIDENGDSALYPGRYNIWVDT 740
Query: 752 -GGTKHSVSL 760
G HS L
Sbjct: 741 TGDIVHSFEL 750
>gi|440799679|gb|ELR20723.1| betaxylosidase [Acanthamoeba castellanii str. Neff]
Length = 748
Score = 497 bits (1279), Expect = e-137, Method: Compositional matrix adjust.
Identities = 286/736 (38%), Positives = 402/736 (54%), Gaps = 80/736 (10%)
Query: 33 KDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALH 92
+DA + LPFC SL QR +DL+ RL+L + + + A AVP LGI Y WW+E LH
Sbjct: 6 QDAPLKDLPFCNTSLTAGQRTDDLVSRLTLDQLIGQMGHQAPAVPSLGIPAYNWWTECLH 65
Query: 93 GVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY 152
GV TK G + P TSFP ++FN L + R +S+EARA+ N G GL +
Sbjct: 66 GVL-----TKCGTNCP--TSFPAPCALGAAFNMKLIHKMARAISNEARALNNEGIGGLDF 118
Query: 153 WSPNV-----------------------NIFRDPRWGRGQETPGEDPVLSGKYAASYVRG 189
W+PN+ +I RDPRWGR E PGEDP ++ +Y A ++RG
Sbjct: 119 WAPNIKYSTQPTNKTRQESQLRNAMVCISINRDPRWGRNMEVPGEDPFMTAQYVAHFMRG 178
Query: 190 LQ-GSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGK 248
LQ G D +V +CKHF AY L+ W DRF F+A VS D +T+ F+ C++EG+
Sbjct: 179 LQEGEDSRYPQVVGTCKHFAAYSLEAWKDYDRFMFDAIVSDYDFVETYLPAFKGCIVEGR 238
Query: 249 VASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEA 308
S+MCSYN VNGVP+CA+ +L+ +R W +GY+VSDCD+V Y+ HFT TPE A
Sbjct: 239 ARSIMCSYNSVNGVPSCANDFLLRTILRDSWSFDGYVVSDCDAVDTIYNNHHFTKTPEGA 298
Query: 309 AADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQP 368
A A+ AG DL+CG F H A G ++E ++ A+ +M LGM+D P+ QP
Sbjct: 299 CAVALHAGTDLNCGDFYQKHLGKAHSEGRVTEDEVRLAVKRLFRQRMELGMWD-PPAEQP 357
Query: 369 YGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMI 428
Y P V + +H +LAL+AAR+ +VLL+N+ LPL R VAVIGPN++ T TM+
Sbjct: 358 YKQYPPSVVGSREHSDLALQAARESMVLLQNRRGVLPLRKSVRR-VAVIGPNANATETML 416
Query: 429 GNYAGIACGYTTPLQGIGRY---------ARTIHQQGCKDVACADDQLFGAAIDASRQAD 479
GNY G C T + Y A + GC DV + A+ A++ AD
Sbjct: 417 GNYYGSRCHDGTYDCIVSPYLAIKAKLPQALVTYNLGC-DVDSTNTTGIPEAVKAAQAAD 475
Query: 480 ATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKN 539
I+V+GL+ S+E+E DR + LPG Q L+ + +A+ PT++V+M GG + + + K+
Sbjct: 476 VAIVVLGLNTSVESEGKDRVAITLPGMQDHLIKSI-VATNTPTVVVMMHGGAVAIEWIKD 534
Query: 540 DPRIAAIIWAGYPGQAGGTAIADILFGTSNPG----------GKLPMTWYPQEYITNLPM 589
++ I+ A YPG+ GG AIAD+LFG NPG G+LP+T P Y+ +P+
Sbjct: 535 --QVDGIVDAFYPGENGGQAIADVLFGDYNPGDNKTDGTTLLGRLPVTVLPANYVDMVPL 592
Query: 590 TEMAMRPSQSKRYPGRTYRFYKGPV-VYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGS 648
T M+MR S + PGRTYR+Y GP ++ FG G+SYT F + P A+ R +
Sbjct: 593 TNMSMRASGNN--PGRTYRYYTGPAPLWEFGFGLSYTTFKTEWLSTPQPSALKSYARDEA 650
Query: 649 INATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAF 708
++ +V V NVG G +L F T P KQL AF
Sbjct: 651 VS---------------------FRVRVTNVGPVAGDEVVLAFVTRDNADRGPLKQLFAF 689
Query: 709 EKVHVPAGAQQRVGIN 724
E+VH+ G + + N
Sbjct: 690 ERVHLNPGESKEIFFN 705
>gi|118489157|gb|ABK96385.1| unknown [Populus trichocarpa x Populus deltoides]
Length = 343
Score = 496 bits (1276), Expect = e-137, Method: Compositional matrix adjust.
Identities = 238/344 (69%), Positives = 273/344 (79%), Gaps = 1/344 (0%)
Query: 427 MIGNYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMG 486
MIGNYAG+ACGYTTPLQGI RYA+T+H GC DV C +Q F AA A+R ADATILVMG
Sbjct: 1 MIGNYAGVACGYTTPLQGIRRYAKTVHLSGCNDVFCNGNQQFNAAEVAARHADATILVMG 60
Query: 487 LDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAI 546
LDQSIEAE DR GLLLPG QQELVS+V+ AS+GPTILVLMSGGPIDV+FAKNDPRI AI
Sbjct: 61 LDQSIEAEFRDRKGLLLPGYQQELVSRVARASRGPTILVLMSGGPIDVSFAKNDPRIGAI 120
Query: 547 IWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRT 606
+W GYPGQAGG AIAD+LFGT+NPGGKLPMTWYP +Y+ +PMT M MR S+ YPGRT
Sbjct: 121 LWVGYPGQAGGAAIADVLFGTANPGGKLPMTWYPHDYLAKVPMTNMGMRADPSRGYPGRT 180
Query: 607 YRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCN 666
YRFYKGPVV+PFGHGMSYT F H++ AP V+VPL H S N T + AI+V+HA C
Sbjct: 181 YRFYKGPVVFPFGHGMSYTTFAHSLVQAPREVSVPLASLHVSRNTTGASNAIRVSHANCE 240
Query: 667 RLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIH 726
L LGV +DVKN G DG HTLLVFS+PP G W+ KQL+ FEKVH+ G+Q+RV I+IH
Sbjct: 241 ALALGVHIDVKNTGDMDGTHTLLVFSSPPGGKWSTQKQLIGFEKVHLVTGSQKRVKIDIH 300
Query: 727 VCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHAATLGVIKS 770
VCK+LSVVDR G RRIP GEH ++IG KHS+SL ATL IKS
Sbjct: 301 VCKHLSVVDRFGIRRIPNGEHYLYIGDLKHSISLQ-ATLEEIKS 343
>gi|222618262|gb|EEE54394.1| hypothetical protein OsJ_01415 [Oryza sativa Japonica Group]
Length = 776
Score = 495 bits (1275), Expect = e-137, Method: Compositional matrix adjust.
Identities = 309/783 (39%), Positives = 412/783 (52%), Gaps = 115/783 (14%)
Query: 30 CDPKDATTRTL-----PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGY 84
CDP L P+C SLP RV DL+GR++L+EKV L A PR+G+ Y
Sbjct: 47 CDPARFAAAGLDMAGFPYCDASLPYADRVRDLVGRMTLEEKVANLGDRAGGAPRVGLPRY 106
Query: 85 EWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARA--- 141
G G + A A RVV +A A
Sbjct: 107 ------------CGGGRRCTACPTSARRDVVWRRRARRHQLPARHQQRRVVQRDAVARHR 154
Query: 142 ---------MYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQG 192
MYN G A LTYWSPN+N+ RDPRWGR ETPGEDP + G+YA ++VRG+Q
Sbjct: 155 RRGVDGDQGMYNLGHAELTYWSPNINVVRDPRWGRASETPGEDPFVVGRYAVNFVRGMQD 214
Query: 193 SDGDR---------------LKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFD 237
DG +KV++ CKH+ A
Sbjct: 215 IDGATTAASAAAATDAFSRPIKVSSCCKHYAA---------------------------- 246
Query: 238 VPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYY- 296
VMCSYN++NGVP CAD +L T+R +W+L+GYIVSDCDSV V
Sbjct: 247 -------------CVMCSYNRINGVPACADARLLTETVRRDWQLHGYIVSDCDSVRVMVR 293
Query: 297 DTQHFTSTPEEAAADAIRAGLDLDCGPFL-GLHT------ESAVQRGLLSEIDINNALVN 349
D + T EA A A++AGLDLDCG F G+H AV++G L E ++NAL N
Sbjct: 294 DAKWLGYTGVEATAAAMKAGLDLDCGMFWEGVHDFFTTYGVDAVRQGKLKESAVDNALTN 353
Query: 350 TLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHI 409
MRLG FDG P + LG DVCT +H+ELA +AARQG+VLLKN LPLS
Sbjct: 354 LYLTLMRLGFFDGIPELE---SLGAADVCTEEHKELAADAARQGMVLLKNDAALLPLSPE 410
Query: 410 RHRTVAVIGPNSDVTVT--MIGNYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQL 467
+ +VA+ G + T M+G+Y G C TP G+ + + C +C
Sbjct: 411 KVNSVALFGQLQHINATDVMLGDYRGKPCRVVTPYDGVRKVVSSTSVHACDKGSC----- 465
Query: 468 FGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLM 527
A A++ DATI+V GL+ S+E E+ DR LLLP Q ++ V+ AS P +LV+M
Sbjct: 466 -DTAAAAAKTVDATIVVAGLNMSVERESNDREDLLLPWSQASWINAVAEASPSPIVLVIM 524
Query: 528 SGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNL 587
S G +DV+FA+++P+I A++WAGYPG+ GGTAIAD+LFG NPGG+LP+TWY EY++ +
Sbjct: 525 SAGGVDVSFAQDNPKIGAVVWAGYPGEEGGTAIADVLFGKYNPGGRLPLTWYKNEYVSKI 584
Query: 588 PMTEMAMRPSQSKRYPGRTYRFYKGP-VVYPFGHGMSYTNFVHTVANAPTVVAVPLDG-- 644
PMT MA+RP YPGRTY+FY G V+YPFGHG+SYTNF + A A V V +
Sbjct: 585 PMTSMALRPDAEHGYPGRTYKFYGGADVLYPFGHGLSYTNFTYASATAAAPVTVKVGAWE 644
Query: 645 --RHGSINATISG----KAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAG- 697
+ + A +S A+ V C + V V N G +DG H + +++ PPA
Sbjct: 645 YCKQLTYKAGVSSPPACPAVNVASHACQE-EVSFAVTVANTGGRDGTHVVPMYTAPPAEV 703
Query: 698 HWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHS 757
AP KQLVAF +V V AGA V ++VCK ++V+ + +P G + +G S
Sbjct: 704 DGAPRKQLVAFRRVRVAAGAAVEVAFALNVCKAFAIVEETAYTVVPSGVSRVLVGDDALS 763
Query: 758 VSL 760
+S
Sbjct: 764 LSF 766
>gi|409079878|gb|EKM80239.1| hypothetical protein AGABI1DRAFT_120267 [Agaricus bisporus var.
burnettii JB137-S8]
Length = 786
Score = 495 bits (1274), Expect = e-137, Method: Compositional matrix adjust.
Identities = 295/741 (39%), Positives = 424/741 (57%), Gaps = 39/741 (5%)
Query: 38 RTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNV 97
++ P C + R LI + E ++ + + VPRLG+ YEWWSEALHGV +
Sbjct: 33 KSTPVCDSAKDPATRAQSLIQMFTDDELIQNGDNASPGVPRLGLPPYEWWSEALHGVGH- 91
Query: 98 GPGTKFG--GDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSP 155
PG F GDF ATSFPQ I ++F+ L +A+ VVS EARA N G AGL Y++P
Sbjct: 92 SPGVVFAPSGDFSSATSFPQPIVIGAAFDDDLVKAVANVVSTEARAFNNFGRAGLNYFTP 151
Query: 156 NVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQG--SDGDRLKVAASCKHFTAYDLD 213
N+N F+DPRWGRGQETPGEDP +Y V GLQG +KVAA CKHF AYDL+
Sbjct: 152 NINPFKDPRWGRGQETPGEDPFHLSQYVYHLVDGLQGGIDPWPYIKVAADCKHFAAYDLE 211
Query: 214 NWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKR 273
NW G+DRFHF+A+VS+QD+ + + PF+ CV + K ASVMCSYN VNGVP CA +L+
Sbjct: 212 NWEGIDRFHFDAQVSQQDLSEYYLPPFQSCVRDAKAASVMCSYNSVNGVPACASTYLLQD 271
Query: 274 TIRGEWRLNG--YIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTES 331
+R W + ++ SDC ++ +D+ +FT + EAAA +++AG D+DCG H +
Sbjct: 272 ILRDAWGFDDDRWVTSDCWALDKIFDSHNFTRSFAEAAAISLKAGTDIDCGSTFADHLPA 331
Query: 332 AVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPS-SQPYGHLGPKDVCTPDHQELALEAA 390
A+ + L+S D+ A + T +RLG FD PS SQ Y DV TP+ Q L+ AA
Sbjct: 332 ALNQSLISRDDLTRAFIRQYTSLIRLGYFD--PSDSQTYRQFDWSDVNTPEAQALSRRAA 389
Query: 391 RQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYAR 450
+G+VLLKN G LPL+ +T+A+IGP ++ T +M GNY G A T+P QG
Sbjct: 390 VEGLVLLKNDG-LLPLAP-DGKTIAIIGPYTNATSSMQGNYFGNAPIITSPFQGAQDVGF 447
Query: 451 TIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQEL 510
+ V F AI+ ++ AD + V G+D ++E E LDR+ + PG Q +L
Sbjct: 448 KVVSAAGTTVNGTSSAGFAEAINTAKAADVVVFVGGIDNTLEREGLDRSSISWPGNQLDL 507
Query: 511 VSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNP 570
V ++ K P I+V GG +D + ++ AIIWAGYPGQ+GGTAI DI+ G++ P
Sbjct: 508 VKDLASLGK-PLIVVQFGGGQVDDTEILANKKVQAIIWAGYPGQSGGTAIFDIIVGSTAP 566
Query: 571 GGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHT 630
G+LP+T YP +Y + MT+M++RPS PGRTY++YK PV+ +GHG+ +T F +
Sbjct: 567 AGRLPVTQYPADYTHQVRMTDMSLRPSSHN--PGRTYKWYKTPVL-EYGHGLHFTTFDFS 623
Query: 631 VANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTL--GVQVDVKNVGSKDGAHTL 688
P A D + + I+ +H+K L ++ V+N G+ +
Sbjct: 624 WQRQP---AAEYDIQ----------ELIRASHSKFLDLAHFDTFEICVRNTGNITSDYVG 670
Query: 689 LVF---STPPAGHWAPHKQLVAFEKVH-VPAGAQQRVGINIHVCKYLSVVDRSGTRRIPL 744
L+F +T P H P K LVA+ +VH + G + + + + ++ VD++G +
Sbjct: 671 LLFLSGNTGPGPH--PIKSLVAYSRVHDIQGGTSATLTLKVTLGS-VARVDKNGDLWLFP 727
Query: 745 GEHNIHIGGTKHSVSLHAATL 765
G + + + TK V H L
Sbjct: 728 GPYRL-VLDTKDGVLTHPFRL 747
>gi|452989371|gb|EME89126.1| glycoside hydrolase family 3 protein [Pseudocercospora fijiensis
CIRAD86]
Length = 790
Score = 494 bits (1273), Expect = e-137, Method: Compositional matrix adjust.
Identities = 274/710 (38%), Positives = 398/710 (56%), Gaps = 23/710 (3%)
Query: 43 CQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTK 102
C + R LI +L EK+ S + VPRLG+ YEWW EALHGV++ PG
Sbjct: 39 CDTAADPLTRAKALIAEFTLAEKINNTGSTSPGVPRLGLLPYEWWQEALHGVAS-SPGVN 97
Query: 103 FG--GDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIF 160
F G+F ATSFPQ I ++F+ L + V+S EARA N AGL +W+PN+N F
Sbjct: 98 FSVSGEFRYATSFPQPILMGAAFDDQLIHDVASVISTEARAFSNDDRAGLDFWTPNINPF 157
Query: 161 RDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDR 220
+DPRWGRGQETPGEDP Y S +RGLQG + KV A+CKHF AYD++NWNG R
Sbjct: 158 KDPRWGRGQETPGEDPYHLSSYVHSLIRGLQGDNPSYKKVVATCKHFVAYDVENWNGNFR 217
Query: 221 FHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWR 280
+ +A ++ QD+ + + PFR C + V + MCSYN +NGVPTCADP +L+ +R W
Sbjct: 218 YQLDAHINSQDLVEYYMPPFRSCARDSNVGAFMCSYNSLNGVPTCADPYLLQTVLREHWN 277
Query: 281 LNG---YIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGL 337
++ SDCDSV + ++ S+ EEAAA +++AG D++CG + H A ++GL
Sbjct: 278 WTAEEQWVTSDCDSVQNVFLYHNYASSREEAAAISLKAGTDINCGTYYQEHLPRAYEQGL 337
Query: 338 LSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLL 397
++E D++ +L+ +RLG FDG+ PY +L DV TP Q+LAL+AA GI LL
Sbjct: 338 INETDVDTSLIRQYGSLIRLGYFDGD--RVPYRNLTWNDVSTPYAQDLALKAATSGITLL 395
Query: 398 KNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRY-ARTIHQQG 456
KN G LPL +A+IG ++ T M+GNY GI + +PL + A + QG
Sbjct: 396 KNDG-ILPLQITNGTKIALIGDWANATDQMLGNYHGIPPYFHSPLWAAQQTGAEVTYVQG 454
Query: 457 CKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSM 516
+ + A+ ++D I + G+D+ +EAE DR + G Q +++ +++
Sbjct: 455 PGGQSDPTTYTWRPIWSAANKSDVIIYIGGMDERVEAEEKDRVSIAWSGPQLDVIGQLAD 514
Query: 517 ASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPM 576
PTI+V M GG +D + +P I A++W GYPGQ GG AI DIL G S P G+LP+
Sbjct: 515 YYDKPTIVVQMGGGSLDSSPLVKNPNIRALLWGGYPGQDGGKAIFDILQGISAPAGRLPI 574
Query: 577 TWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPT 636
T Y +YI+ +PMT+ ++RP+ + PGRTY + V+ FG+G+ YTNF T+
Sbjct: 575 TQYRADYISKVPMTDTSLRPNATSGSPGRTYIWLNEEPVFEFGYGLHYTNFTATIP---- 630
Query: 637 VVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGV-QVDVKNVGSKDGAHTLLVFSTPP 695
D +I A T + +R +DV N GS + L F T
Sbjct: 631 ------DAESSDTTYSIDSLASDCTESYLDRCPFKTFSIDVTNTGSVTSDYVTLGFLTGA 684
Query: 696 AG-HWAPHKQLVAFEKVH-VPAGAQQRVGINIHVCKYLSVVDRSGTRRIP 743
G P+K+LV+++++H + AG+ Q +N+ + V D+ T P
Sbjct: 685 HGPEPCPNKRLVSYQRLHNITAGSTQTAALNLTLGSLSRVDDKGNTVLFP 734
>gi|426198356|gb|EKV48282.1| hypothetical protein AGABI2DRAFT_67675 [Agaricus bisporus var.
bisporus H97]
Length = 763
Score = 494 bits (1271), Expect = e-137, Method: Compositional matrix adjust.
Identities = 296/748 (39%), Positives = 425/748 (56%), Gaps = 39/748 (5%)
Query: 31 DPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEA 90
D + ++ P C + R LI + E ++ + + VPRLG+ YEWWSEA
Sbjct: 26 DCTNGPLKSTPVCDSTKDPATRAQSLIQMFTDDELIQNGDNASPGVPRLGLPPYEWWSEA 85
Query: 91 LHGVSNVGPGTKFG--GDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA 148
LHGV + PG F GDF ATSFPQ I ++F+ L +A+ VVS EARA N G A
Sbjct: 86 LHGVGH-SPGVVFAPSGDFSSATSFPQPIVIGAAFDDDLVKAVANVVSTEARAFNNFGRA 144
Query: 149 GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQG--SDGDRLKVAASCKH 206
GL Y++PN+N F+DPRWGRGQETPGEDP +Y V GLQG +KVAA CKH
Sbjct: 145 GLNYFTPNINPFKDPRWGRGQETPGEDPFHLSQYVYHLVDGLQGGIDPWPYIKVAADCKH 204
Query: 207 FTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCA 266
F AYDL+NW G+DRFHF+A+VS+QD+ + + PF+ CV + K ASVMCSYN VNGVP CA
Sbjct: 205 FAAYDLENWEGIDRFHFDAQVSQQDLSEYYLPPFQSCVRDAKAASVMCSYNSVNGVPACA 264
Query: 267 DPNILKRTIRGEWRLNG--YIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPF 324
+L+ +R W + ++ SDC ++ +D+ +FT + EAAA +++AG D+DCG
Sbjct: 265 STYLLQDILRDAWGFDDDRWVTSDCWALDKIFDSHNFTRSFAEAAAISLKAGTDIDCGST 324
Query: 325 LGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPS-SQPYGHLGPKDVCTPDHQ 383
H +A+ + L+S D+ A + T +RLG FD PS SQ Y DV TP+ Q
Sbjct: 325 FADHLPAALNQSLISRDDLTRAFIRQYTSLIRLGYFD--PSHSQTYRQFDWSDVNTPEAQ 382
Query: 384 ELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQ 443
L+ AA +G+VLLKN G LPL+ +T+A+IGP ++ T +M GNY G A T+P Q
Sbjct: 383 ALSRRAAVEGLVLLKNDG-LLPLAP-DGKTIAIIGPYTNATSSMQGNYFGNAPFITSPFQ 440
Query: 444 GIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLL 503
G + V F AI+ +R AD + V G+D ++E E LDR+ +
Sbjct: 441 GAQDVGFKVVSAAGTIVNGTSSAGFAEAINTARAADVVVFVGGIDNTLEREGLDRSSISW 500
Query: 504 PGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADI 563
PG Q +LV ++ K P I+V GG +D + ++ AIIWAGYPGQ+GGTAI DI
Sbjct: 501 PGNQLDLVKDLASLGK-PLIVVQFGGGQVDDTEILANEKVQAIIWAGYPGQSGGTAIFDI 559
Query: 564 LFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMS 623
+ G + P G+LP+T YP +Y + MT+M++RPS PGRTY++YK PV+ +GHG+
Sbjct: 560 IVGATAPAGRLPVTQYPADYTHQVRMTDMSLRPSSHN--PGRTYKWYKTPVL-EYGHGLH 616
Query: 624 YTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTL--GVQVDVKNVGS 681
+T F + P A D + + I+ +H+K L ++ V+N G+
Sbjct: 617 FTTFDFSWQRQP---AAEYDIQ----------ELIRASHSKFLDLAHFDTFEICVRNTGN 663
Query: 682 KDGAHTLLVF---STPPAGHWAPHKQLVAFEKVH-VPAGAQQRVGINIHVCKYLSVVDRS 737
+ L+F ++ P H P K LVA+ +VH + G + + + + ++ VD++
Sbjct: 664 ITSDYVGLLFLSGNSGPGPH--PIKSLVAYSRVHDIQGGTSATLTLKVTLGS-VARVDKN 720
Query: 738 GTRRIPLGEHNIHIGGTKHSVSLHAATL 765
G + G + + + TK V H L
Sbjct: 721 GDLWLFPGPYRL-VLDTKDGVLTHPFRL 747
>gi|396473219|ref|XP_003839293.1| similar to beta-1,4-xylosidase [Leptosphaeria maculans JN3]
gi|312215862|emb|CBX95814.1| similar to beta-1,4-xylosidase [Leptosphaeria maculans JN3]
Length = 789
Score = 493 bits (1268), Expect = e-136, Method: Compositional matrix adjust.
Identities = 289/735 (39%), Positives = 417/735 (56%), Gaps = 34/735 (4%)
Query: 31 DPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEA 90
D ++ +T C S R L+ +L+EK+ SG+ VPRLGI Y+WWSE
Sbjct: 23 DCQNGPLQTETICNTSASPLDRAKSLVTLYTLEEKINATSSGSPGVPRLGIPPYQWWSEG 82
Query: 91 LHGVSNVGPGTKF---GGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGT 147
LHG++ GP T F G ++ +TSFPQ I ++F+ L + +V+S EARA N
Sbjct: 83 LHGIA--GPYTNFSTSGIEYSYSTSFPQPILMGAAFDDHLITDVAKVISTEARAFNNANR 140
Query: 148 AGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLK-VAASCKH 206
GL +W+PN+N FRDPRWGRGQETPGED Y + + GLQG D K V A+CKH
Sbjct: 141 TGLDFWTPNINPFRDPRWGRGQETPGEDAFHLSSYVKALIAGLQGETTDPYKRVVATCKH 200
Query: 207 FTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCA 266
F YD+++WNG R+ F+A++S+QD+ + + PF+ CV + V + MCSYN VNGVPTCA
Sbjct: 201 FAGYDIEDWNGNLRYQFDAQISQQDLVEYYLQPFQACV-QANVGAFMCSYNAVNGVPTCA 259
Query: 267 DPNILKRTIRGEWRLNG---YIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGP 323
DP +L+ +R W ++ SDCD+V Y +++T E+A ADA+ AG DLDCG
Sbjct: 260 DPYLLQTILREHWGWTNEEQWVTSDCDAVQNIYLPHQWSATREQAVADALIAGTDLDCGT 319
Query: 324 FLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQ 383
++ H A +GL++E ++ ALV + +RLG FD + + QPY G V T Q
Sbjct: 320 YMQEHLPGAFAQGLVNENVLDQALVRQYSSLVRLGWFD-DAADQPYRQFGWDSVATDASQ 378
Query: 384 ELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQ 443
LA AA +GIVLLKN G LPLS ++ V G ++ T ++GNYAG+ +PL
Sbjct: 379 ALARRAAVEGIVLLKNDG-VLPLSIDSSVSLGVFGDWANATSQLLGNYAGVPTYLHSPLW 437
Query: 444 GIGRYARTIHQQGCKDVACAD--DQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGL 501
+ + TI+ G D + + A +D I + G+D SIE E DR L
Sbjct: 438 ALQQENLTINYAGGNPGGQGDPTTNRWSSLSGAIATSDILIYIGGIDNSIEEEGHDRTSL 497
Query: 502 LLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIA 561
G Q +++ +++ K PTI+V+M GG ID A N+ I+AI+WAGYPGQ GG AI
Sbjct: 498 AWTGAQLDVIFQLAATGK-PTIVVVMGGGQIDSAPLANNANISAILWAGYPGQDGGPAIV 556
Query: 562 DILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHG 621
DIL G S P G+LP T YP Y + +PMT+M +RPS++ PGRTY++Y G Y FGHG
Sbjct: 557 DILTGKSPPAGRLPQTQYPASYTSLVPMTDMGLRPSENN--PGRTYKWYNGTATYEFGHG 614
Query: 622 MSYTNFVHTVANAPTVVAVPLDGRHGSINATISGK-AIKVTHAKCNRLTLGVQVDVKNVG 680
+ YTNF TV + P+ + + + K A +T +C + V + V N G
Sbjct: 615 LHYTNFSATVTS-------PMQQSYRIADLMSTCKNATSITLERCAFTS--VDISVTNTG 665
Query: 681 SKDGAHTLLVF---STPPAGHWAPHKQLVAFEKVH-VPAGAQQRVGINIHVCKYLSVVDR 736
+ + L + S PA H P K LV ++++ + AGA I++ + + L+ VD
Sbjct: 666 AVASDYVTLCYISGSHGPAPH--PKKSLVGYQRLFGIAAGASDTARIDLTL-ESLARVDE 722
Query: 737 SGTRRIPLGEHNIHI 751
G + + GE+++ +
Sbjct: 723 VGNKVLYPGEYSLMV 737
>gi|389748500|gb|EIM89677.1| glycoside hydrolase family 3 protein [Stereum hirsutum FP-91666
SS1]
Length = 770
Score = 488 bits (1257), Expect = e-135, Method: Compositional matrix adjust.
Identities = 289/734 (39%), Positives = 411/734 (55%), Gaps = 25/734 (3%)
Query: 43 CQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTK 102
C S R L+ ++L+E V ++ + VPRLG+ YEWWSEALHGV++ PG
Sbjct: 36 CNTSANFLDRAKALVNAMTLEEMVNNTVNTSPGVPRLGLPPYEWWSEALHGVAS-SPGVT 94
Query: 103 F--GGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIF 160
F GDF GATSFP+ I +++F+ L ++ +S EARA N +GL +++PN+N F
Sbjct: 95 FETSGDFSGATSFPEPILMSAAFDDDLIFSVASTISTEARAFGNTNHSGLDFFTPNINPF 154
Query: 161 RDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDR--LKVAASCKHFTAYDLDNWNGV 218
+DPRWGRGQETPGEDP+ + +Y + GLQG G K+ A CKHF AYDL+NW G
Sbjct: 155 KDPRWGRGQETPGEDPLHTSRYVYQLITGLQGGVGPSPYYKIIADCKHFAAYDLENWEGN 214
Query: 219 DRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGE 278
+R FNA VS QD+ + + F+ CV + KV SVMCSYN VNGVP C P +L+ +R
Sbjct: 215 NRMAFNAIVSTQDLAEFYTPSFQSCVRDAKVGSVMCSYNAVNGVPACGSPYLLQDLVRDY 274
Query: 279 WRL--NGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRG 336
+ L + +I SDCD+VG +D ++T+T A+A A+ AG D+DCG AV G
Sbjct: 275 FELGNDTWITSDCDAVGNIFDPHNYTTTLTNASAVALLAGTDVDCGTSYSETLGEAVSEG 334
Query: 337 LLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVL 396
L+S+ D+ ALV +RLG FD E S PY LG DV TP Q LA AA +GIVL
Sbjct: 335 LVSKSDVERALVRLYGSLVRLGYFDPE-DSVPYRALGASDVNTPAAQTLAYTAAVEGIVL 393
Query: 397 LKNQGPSLPLS-HIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQ 455
LKN G LPLS ++ H +A+IGP ++ T M GNY GIA +PL G +
Sbjct: 394 LKNDG-LLPLSSNVSH--IALIGPWANATTQMQGNYEGIAPLLISPLDGFTSAGFNVSFT 450
Query: 456 GCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVS 515
++ F A+ + AD + + G+D ++EAE DR + PG Q EL+ ++
Sbjct: 451 NGTTISGNSTSGFADALSMASAADVIVYIGGIDDTVEAEGQDRTSITWPGNQLELIGELG 510
Query: 516 MASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLP 575
K P +++ M GG +D K + + A++W GYPGQAGG A+ADI+ G P G+L
Sbjct: 511 AFGK-PFVVIQMGGGQVDDTELKANSSVNALLWGGYPGQAGGKALADIITGVQAPAGRLT 569
Query: 576 MTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNF-VHTVANA 634
T YP Y+ + MT+M++RPS S PGRTY++Y G V+ FG G+ YT F V +
Sbjct: 570 TTQYPASYVDQVAMTDMSVRPSNSTGSPGRTYKWYTGTPVFEFGFGLHYTTFDVEWAEGS 629
Query: 635 PTVVAVPLDGRHGSINATISGKAIKVTHAKCNRL-TLGVQVDVKNVGSKDGAHTLLVFST 693
P D + A+ + + V H L T VQ V N G+ + L+FS
Sbjct: 630 PAASYSIQD-----LVASANSSSSAVAHVDSAILDTFTVQ--VTNTGNVTSDYVALLFSN 682
Query: 694 PPAG-HWAPHKQLVAFEKVH-VPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHI 751
AG AP ++LV++ +V + G +N+ + ++ VD G I G +N+ +
Sbjct: 683 TTAGPSPAPLQELVSYARVKGITPGVSATASLNVTLGT-IARVDEDGNSIIYPGVYNLWV 741
Query: 752 GGTKHSVSLHAATL 765
T + ++ + L
Sbjct: 742 DTTGQAKAVTSFEL 755
>gi|344303941|gb|EGW34190.1| hypothetical protein SPAPADRAFT_65353 [Spathaspora passalidarum
NRRL Y-27907]
Length = 788
Score = 488 bits (1256), Expect = e-135, Method: Compositional matrix adjust.
Identities = 284/727 (39%), Positives = 402/727 (55%), Gaps = 31/727 (4%)
Query: 43 CQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTK 102
C LP QR ++ ++ E + + + + V RLG+ Y+WWSEALHG++ +
Sbjct: 61 CNPHLPTEQRAKAVVDLFTVDELIANMGNTSPGVERLGLPPYQWWSEALHGIAR----SN 116
Query: 103 F--GGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIF 160
F G++ ATSFPQ I +FN L++ +G V+ EARA N G AGL ++SPN+N F
Sbjct: 117 FTASGEYSHATSFPQPILMGGAFNNDLYKQVGNVIGTEARAFNNVGRAGLDFYSPNINPF 176
Query: 161 RDPRWGRGQETPGEDPVLSGKYAASYVRGLQG-----SDGDRLKVAASCKHFTAYDLDNW 215
RD RWGRGQE E PVL G YA +YV+GLQG + D L+VAA+CKHF YD+++W
Sbjct: 177 RDARWGRGQEVASESPVLVGNYALNYVQGLQGGLDSNQNDDTLQVAATCKHFVGYDMESW 236
Query: 216 NGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTI 275
N R +NA +S QD+ D + F+ CV + K A MCSYN VNGVP CA L +
Sbjct: 237 NQHSRLGYNAIISDQDLADFYLPTFQSCVRDAKAAGAMCSYNAVNGVPACASEFFLNTVL 296
Query: 276 RGEWRL-NGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQ 334
R + NG I SDCD++ ++ + AAADAI+AG+D++CG + A+
Sbjct: 297 RDGFDFQNGVIHSDCDAIYNVWNPHLYAQDLGGAAADAIKAGVDVNCGDTYQNNLGYALG 356
Query: 335 RGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGI 394
++E I ++ + +RLG FD P + Y DV TP +LA +AA +GI
Sbjct: 357 NKTINENQIRTSVTRQYSNLIRLGYFD-SPQTNKYRKYDWNDVSTPQANQLAYQAAVEGI 415
Query: 395 VLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQ 454
LLKN G +LP + + R VAVIGP ++ T M+G+YAG +PLQG +
Sbjct: 416 ALLKNDG-TLPFNKQKVRKVAVIGPWANATTQMLGDYAGTPPYMISPLQGAQSEGFQVEY 474
Query: 455 QGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKV 514
+ D + AA++A++ ADA + G+D S+E EALDR L PG Q +LVSK+
Sbjct: 475 ALGTQINTTDTSGYTAALNAAKGADAIVYFGGIDNSVENEALDRESLAWPGNQLDLVSKL 534
Query: 515 SMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKL 574
S K P +++ GG ID KN+ + AI++AGYPGQ+GGTAI DIL G P G+L
Sbjct: 535 S-GLKKPLVVLQFGGGQIDDTEIKNNKNVNAIVYAGYPGQSGGTAIWDILSGKYAPAGRL 593
Query: 575 PMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANA 634
T YP Y +PMT+M +RP Q YPGRT+ +Y G VY FG+G+ YT F ++ANA
Sbjct: 594 TTTQYPASYADQVPMTDMTLRPRQG--YPGRTFMWYNGEPVYEFGYGLHYTTFSASLANA 651
Query: 635 PTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTP 694
P G H S N A K + L V++KN G + L++S
Sbjct: 652 PR-------GGHQSFNIEQVVAAAKRSQYVDTGLITTFDVNIKNTGKTTSDYAALLYSKT 704
Query: 695 PAG---HWAPHKQLVAFEKVH-VPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIH 750
AG H P+K LV+F+K+H + AG Q + + + L D +G + + G +
Sbjct: 705 TAGPGPH--PNKILVSFDKLHQIHAGQTQTAKLPVTIGSLLQ-TDTNGNKWLYPGTYTFF 761
Query: 751 IGGTKHS 757
+ K +
Sbjct: 762 VDNDKKA 768
>gi|451992719|gb|EMD85198.1| glycoside hydrolase family 3 protein [Cochliobolus heterostrophus
C5]
Length = 781
Score = 488 bits (1256), Expect = e-135, Method: Compositional matrix adjust.
Identities = 288/757 (38%), Positives = 429/757 (56%), Gaps = 43/757 (5%)
Query: 10 LGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLL 69
L L+L +A G+ A+ F D K+ + C S R L+ +L+EK+
Sbjct: 8 LSLLLFAA---GVFAQGNFP-DCKNGPLKNETICDPSASTLARAKSLVALYTLEEKINAT 63
Query: 70 ISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFG--GDFPGATSFPQVITTASSFNATL 127
+ A V RLG+ Y+WW+E LHG++ GP T F GD+ +TSFPQ I ++F+ L
Sbjct: 64 SNSAPGVARLGVPPYQWWNEGLHGIA--GPFTSFAKQGDYSYSTSFPQPILMGAAFDDDL 121
Query: 128 WEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYV 187
+ +V+S EARA N GL +W+PN+N FRDPRWGRGQETPGED Y + +
Sbjct: 122 ITEVAKVISTEARAFNNANRTGLDFWTPNINPFRDPRWGRGQETPGEDSYHLSSYVKALI 181
Query: 188 RGLQGSDGDRL-KVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVME 246
GLQG+ D +V A+CKH+ YD++NWNG R+ + ++S+QD+ + + PF CV +
Sbjct: 182 HGLQGNATDPYRRVVATCKHYAGYDIENWNGNLRYQNDVQISQQDLVEYYLAPFEACV-Q 240
Query: 247 GKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNG---YIVSDCDSVGVYYDTQHFTS 303
V + MCSYN VNG P CADP +L+ +R W + ++ SDCD++ Y ++S
Sbjct: 241 ANVGAFMCSYNAVNGAPPCADPYLLQTVLREHWGWSSDDHWVTSDCDAIQNVYLPHQWSS 300
Query: 304 TPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGE 363
T E AAAD++ AG DLDCG +L H AV++GL E ++ AL+ + ++LG FD
Sbjct: 301 TREGAAADSLNAGTDLDCGTYLQTHLPGAVKQGLTDETTLDKALIRQYSSLIKLGYFDA- 359
Query: 364 PSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDV 423
P +QPY LG V T Q LAL+AA +GIVLLKN G LP+ ++ + V + G ++
Sbjct: 360 PENQPYRQLGFDAVATSASQALALKAAEEGIVLLKNDG-VLPI-NLGSKQVGIYGDWANA 417
Query: 424 TVTMIGNYAGIACGYTTP---LQGIG---RYARTIHQQGCKDVACADDQLFGAAIDASRQ 477
T + GNY G+A T+P LQ +G +YA + A L G
Sbjct: 418 TSQLQGNYFGVAKFLTSPLMALQNLGVDVKYAGNLPGGQGDPTTGAWSSLSGVIT----T 473
Query: 478 ADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFA 537
+D I V G+D +E+E DR+ L L G Q +++ +++ K P I+V+M GG ID +
Sbjct: 474 SDVHIWVGGIDNGVESEDRDRSWLTLTGGQLDVIGQLADTGK-PVIVVIMGGGQIDTSPL 532
Query: 538 KNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPS 597
+P+I+A++WAGYPGQ GGTAI +IL G + P G+LP T YP +Y++ +PMT+MAMRPS
Sbjct: 533 IRNPKISAVLWAGYPGQDGGTAIVNILTGKAAPAGRLPQTQYPSKYVSEVPMTDMAMRPS 592
Query: 598 QSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKA 657
+ PGRTY++Y G ++ FG+G+ YTNF ++ N P + +I+ + G
Sbjct: 593 D--KNPGRTYKWYTGEPIFEFGYGLHYTNFSASITNQPK--------QSYAISDLVKG-- 640
Query: 658 IKVTHAKCNRLTL-GVQVDVKNVGSKDGAHTLLVFSTPPAG-HWAPHKQLVAFEKV-HVP 714
T R G+ V V+N G + L F T G P K LVA++++ ++
Sbjct: 641 CNSTGGFLERCPFTGITVSVQNTGKISSDYVTLGFLTGSFGPKPYPKKSLVAYDRLFNIA 700
Query: 715 AGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHI 751
AG+ +N+ + L+ VD SG + + G++ + I
Sbjct: 701 AGSSSTATLNLTLAS-LARVDESGNKVLYPGDYELQI 736
>gi|409079872|gb|EKM80233.1| hypothetical protein AGABI1DRAFT_57801 [Agaricus bisporus var.
burnettii JB137-S8]
Length = 767
Score = 486 bits (1250), Expect = e-134, Method: Compositional matrix adjust.
Identities = 293/725 (40%), Positives = 416/725 (57%), Gaps = 38/725 (5%)
Query: 52 RVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFG--GDFPG 109
R LI + +E ++ + + VPRLG+ Y+WWSEALHGV+ PG F G+F
Sbjct: 47 RATTLIQMFTDEELMQNTDNVSPGVPRLGVPSYQWWSEALHGVAG-SPGVSFAPSGEFSS 105
Query: 110 ATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQ 169
ATSFPQ I S+F+ L +A+ V+S EARA N AGL Y++PN+N F+DPRWGRGQ
Sbjct: 106 ATSFPQSIVLGSTFDIDLVKAVATVISTEARAFNNFHRAGLDYFTPNINPFKDPRWGRGQ 165
Query: 170 ETPGEDPVLSGKYAASYVRGLQGSDGDR--LKVAASCKHFTAYDLDNWNGVDRFHFNAKV 227
ETPGEDP +Y S + GLQG R KVAA CKH+ AYDLD+W G+DRFHF+AKV
Sbjct: 166 ETPGEDPFHVSQYVYSLIDGLQGGIDPRPYFKVAADCKHYAAYDLDSWEGIDRFHFDAKV 225
Query: 228 SKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNG--YI 285
S QD+ + + F+ CV + KVASVMCSYN VNG+P CA+P +L+ +R W + ++
Sbjct: 226 SLQDLSEYYLPSFQSCVRDAKVASVMCSYNSVNGIPACANPYLLQDILRDFWGFDDDRWV 285
Query: 286 VSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINN 345
SDCD++G + T +FT T EA ADA++AG D+DCG H A+ + L++ D+
Sbjct: 286 TSDCDAIGNIFTTHNFTDTFAEAVADALKAGTDVDCGTSYSTHLPDALNQSLITRDDLER 345
Query: 346 ALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLP 405
AL T MRLG FD P SQP L DV PD Q LA AA +G+VLLKN G LP
Sbjct: 346 ALTRQYTSLMRLGYFD-PPESQPLRQLAWSDVNKPDAQALAHTAAVEGLVLLKNDG-FLP 403
Query: 406 LSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI--GRYARTIHQQGCKDVACA 463
+S +T+A+IGP ++ T M GNY G A TP QG + + G +
Sbjct: 404 VS-ASGKTIAIIGPYANATKDMQGNYFGTAPFIVTPFQGAVDAGFNEVVSAAGTS-INGT 461
Query: 464 DDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTI 523
+ F AAI + +D I G++ SIE+EA DR + G Q LV +++ K P +
Sbjct: 462 SEADFAAAIAVANSSDIIIFAGGINNSIESEAKDRLTIAWTGNQLSLVKQLASLGK-PVV 520
Query: 524 LVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEY 583
+V GG +D + ++ + A+IWAGYPGQ+GGTAI D++ G P G+L +T YP+++
Sbjct: 521 VVQFGGGQLDDSDLLDNDAVRAVIWAGYPGQSGGTAIFDVITGAVAPAGRLSVTQYPEDF 580
Query: 584 ITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLD 643
+ + MT+MA+RP + PGRTY++Y G V FGHG+ +T F + P
Sbjct: 581 VNQVGMTDMALRPGSAN--PGRTYKWYTGRPVLEFGHGLHFTTFDFSWRGRP-------- 630
Query: 644 GRHGSINATISGKAIKVTHAKCNRLTLGV-QVDVKNVGSKDGAHTLLVFSTPPAGHWAPH 702
GR +I + K + + L V+++N G+ + L+F AG +APH
Sbjct: 631 GRKYNIQHLLHTADKKFP----DLIPLDTFHVNIRNTGNITSDYVALLFLKSNAG-FAPH 685
Query: 703 --KQLVAFEKVH-VPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHI----GGTK 755
K LV+F + H + AG+ V + +++ ++ VD G + G++ + + G
Sbjct: 686 PKKSLVSFARAHRIDAGSSATVDLGVNLGS-IARVDEHGDSWLFAGDYQLVLDIGDGVLS 744
Query: 756 HSVSL 760
HS SL
Sbjct: 745 HSFSL 749
>gi|426198365|gb|EKV48291.1| hypothetical protein AGABI2DRAFT_219902 [Agaricus bisporus var.
bisporus H97]
Length = 767
Score = 485 bits (1249), Expect = e-134, Method: Compositional matrix adjust.
Identities = 294/725 (40%), Positives = 416/725 (57%), Gaps = 38/725 (5%)
Query: 52 RVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFG--GDFPG 109
R LI + +E ++ + + VPRLG+ Y+WWSEALHGV+ PG F G+F
Sbjct: 47 RAKTLIQMFTDEELMQNTDNVSPGVPRLGVPSYQWWSEALHGVAG-SPGVSFAPSGEFSS 105
Query: 110 ATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQ 169
ATSFPQ I S+F+ L +A+ V+S EARA N AGL Y++PN+N F+DPRWGRGQ
Sbjct: 106 ATSFPQSIVLGSTFDIDLVKAVATVISTEARAFNNFHRAGLDYFTPNINPFKDPRWGRGQ 165
Query: 170 ETPGEDPVLSGKYAASYVRGLQGSDGDR--LKVAASCKHFTAYDLDNWNGVDRFHFNAKV 227
ETPGEDP +Y S + GLQG R KVAA CKH+ AYDLD+W G+DRFHF+AKV
Sbjct: 166 ETPGEDPFHVSQYVYSLIDGLQGGIDPRPYFKVAADCKHYAAYDLDSWEGIDRFHFDAKV 225
Query: 228 SKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNG--YI 285
S QD+ + + F+ CV + KVASVMCSYN VNG+P CA+P +L+ +R W + ++
Sbjct: 226 SLQDLSEYYLPSFQSCVRDAKVASVMCSYNSVNGIPACANPYLLQDILRDFWGFDDDRWV 285
Query: 286 VSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINN 345
SDCD++G + T +FT T EA ADA++AG D+DCG H A+ + L++ D+
Sbjct: 286 TSDCDAIGNIFTTHNFTDTFAEAVADALKAGTDVDCGTSYSTHLPDALNQSLITRDDLER 345
Query: 346 ALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLP 405
AL T MRLG FD P SQP L DV PD Q LA AA +G+VLLKN G LP
Sbjct: 346 ALTRQYTSLMRLGYFD-PPESQPLRQLAWSDVNKPDAQALAHTAAVEGLVLLKNDG-FLP 403
Query: 406 LSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI--GRYARTIHQQGCKDVACA 463
+S +T+A+IGP ++ T M GNY G A TP QG + + G +
Sbjct: 404 VS-ASGKTIAIIGPYANATKDMQGNYFGTAPFIVTPFQGAVDAGFNEVVSAAGTS-INGT 461
Query: 464 DDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTI 523
+ F AAI + +D I G++ SIE+EA DR + G Q LV +++ K P +
Sbjct: 462 SEADFAAAIAVANSSDIIIFAGGINNSIESEAKDRLTIAWTGNQLSLVKQLASLGK-PVV 520
Query: 524 LVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEY 583
+V GG +D + ++ + A+IWAGYPGQ+GGTAI D++ G P G+L +T YP+++
Sbjct: 521 VVQFGGGQLDDSDLLDNDAVRAVIWAGYPGQSGGTAIFDVITGAVAPAGRLSVTQYPEDF 580
Query: 584 ITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLD 643
+ + MT+MA+RP + PGRTY++Y G V FGHG+ +T F + P
Sbjct: 581 VNQVGMTDMALRPGSAN--PGRTYKWYTGRPVLEFGHGLHFTTFDFSWRGRP-------- 630
Query: 644 GRHGSINATISGKAIKVTHAKCNRLTLGV-QVDVKNVGSKDGAHTLLVFSTPPAGHWAPH 702
GR +I + K + + L V+++N G+ + L+F AG +APH
Sbjct: 631 GRKYNIQHLLHTADKKFP----DLIPLDTFHVNIRNTGNITSDYVALLFLRSNAG-FAPH 685
Query: 703 --KQLVAFEKVH-VPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEH----NIHIGGTK 755
K LV+F + H + AG+ V + +++ ++ VD G + G++ +I G
Sbjct: 686 PKKSLVSFARAHRIDAGSSATVDLGVNLGS-IARVDEHGDSWLFAGDYQLVLDIGDGVLS 744
Query: 756 HSVSL 760
HS SL
Sbjct: 745 HSFSL 749
>gi|395334835|gb|EJF67211.1| beta-xylosidase [Dichomitus squalens LYAD-421 SS1]
Length = 774
Score = 485 bits (1248), Expect = e-134, Method: Compositional matrix adjust.
Identities = 300/741 (40%), Positives = 419/741 (56%), Gaps = 44/741 (5%)
Query: 43 CQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTK 102
C S R LI + +E ++ + VPRLG+ Y WWSE LHGV+ PG
Sbjct: 41 CDTSKDPITRATALIDLWTDEELTNNTVNASPGVPRLGLPAYNWWSEGLHGVAQ-SPGVT 99
Query: 103 FG--GDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIF 160
F G+F ATSFPQ I ++F+ L +A+ VVS E RA N G AGL YW+PN+N F
Sbjct: 100 FAPSGNFSYATSFPQPILMGAAFDDPLIQAVASVVSTEGRAFNNVGRAGLDYWTPNINPF 159
Query: 161 RDPRWGRGQETPGEDPVLSGKYAASYVRGLQG--SDGDRLKVAASCKHFTAYDLDNWNGV 218
+DPRWGRGQETPGEDP Y + + GLQG KV A CKHF AYD+DNW G
Sbjct: 160 KDPRWGRGQETPGEDPFHLQGYVYNLILGLQGGLDPTPYFKVVADCKHFAAYDMDNWEGN 219
Query: 219 DRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGE 278
R+ FNA V++QD+ + + F+ CV + KVASVMCSYN VNG+P+CA+ +L+ +R
Sbjct: 220 VRYGFNAVVTQQDLSEYYLPSFQTCVRDAKVASVMCSYNAVNGIPSCANSFLLQDILRDY 279
Query: 279 WRLNG--YIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRG 336
W + ++ SDCD+V Y ++T P +AAADA+ AG D+DCG F + A+ +G
Sbjct: 280 WGFDDTRWVTSDCDAVQNIYTPHNYTDNPAQAAADALLAGTDIDCGTFSSTYLPDALSQG 339
Query: 337 LLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVL 396
L++ D+ A + +RLG FD P SQPY LG DV TP+ Q+LA AA +G+VL
Sbjct: 340 LVNATDLKRAAIRQYASLVRLGYFD-PPESQPYRQLGWSDVNTPEAQQLAHTAAVEGMVL 398
Query: 397 LKNQGPSLPLS-HIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQ 455
LKN G +LPLS H+R +A+IGP ++ T M GNYAGIA +PL G + +
Sbjct: 399 LKNDG-TLPLSKHVRK--LALIGPWANATTLMQGNYAGIAPYLISPLLGAQQAGFDVEYV 455
Query: 456 GCKDVACADDQL-FGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKV 514
+V +D F AA+ A+++ADA I GLD+++E E +DR + PG Q +LV+++
Sbjct: 456 FGTNVTTTNDTSGFAAAVAAAKRADAVIFAGGLDETVEREEVDRLNVTWPGNQLDLVAEL 515
Query: 515 SMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKL 574
+ K P I+ GG +D + K+ + AIIW GYPGQ+GGTA+ DIL G + P G+L
Sbjct: 516 ASVGK-PLIVAQFGGGQLDDSALKSKRSVNAIIWGGYPGQSGGTALFDILTGKAAPAGRL 574
Query: 575 PMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANA 634
P+T YP EY +PMT+M +RPS + PGRTY++Y G V+ FG G+ YT F A+
Sbjct: 575 PITQYPAEYANQVPMTDMTLRPSATN--PGRTYKWYTGTPVFEFGFGLHYTTFSFAWAS- 631
Query: 635 PTVVAVPLDGRHGSINATISGKAIKVTHAKCNR----LTLG----VQVDVKNVGSKDGAH 686
+ N + +I A N+ L L V V N G +
Sbjct: 632 -----------NAHANTPAASYSIDALMASGNKSAAFLDLAPLDTFAVRVTNTGKMTSDY 680
Query: 687 TLLVFST---PPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIP 743
L+F++ PA H P+KQLVA+ +VH A Q + ++ D SG + +
Sbjct: 681 VALLFASGTFGPAPH--PNKQLVAYTRVHGVAPKQSTIAELTVTLGAIARADESGAKWVY 738
Query: 744 LGEHNIHIGGT---KHSVSLH 761
G + + + T KH+ +L
Sbjct: 739 PGTYTLALDTTEQLKHTFTLE 759
>gi|392560759|gb|EIW53941.1| glycoside hydrolase family 3 protein [Trametes versicolor FP-101664
SS1]
Length = 783
Score = 482 bits (1240), Expect = e-133, Method: Compositional matrix adjust.
Identities = 291/687 (42%), Positives = 402/687 (58%), Gaps = 28/687 (4%)
Query: 43 CQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTK 102
C ++ R LIG + +E ++ + VPRLG+ Y WWSE LHGV+ PG
Sbjct: 41 CDITKDPITRATALIGLWTDEELTSNTVNASPGVPRLGLPAYNWWSEGLHGVAQ-SPGVT 99
Query: 103 FG--GDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIF 160
F G+F ATSFPQ I ++F+ TL +AI +VS E RA N G AGL YW+PN+N F
Sbjct: 100 FAPSGNFSHATSFPQPILMGAAFDDTLIQAIATIVSTEGRAFNNAGRAGLDYWTPNINPF 159
Query: 161 RDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDR--LKVAASCKHFTAYDLDNWNGV 218
+DPRWGRGQETPGEDP +Y + + GLQG + KV A CKHF AYDL+NW G+
Sbjct: 160 KDPRWGRGQETPGEDPFHLSQYVYNLILGLQGGLDPKPYFKVVADCKHFAAYDLENWEGI 219
Query: 219 DRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGE 278
R F+A VS+QD+ + + PF+ CV + KVASVMCSYN VNG+P+CA+ +L+ +R
Sbjct: 220 VRNGFDAIVSQQDLSEFYLPPFQTCVRDAKVASVMCSYNAVNGIPSCANSFLLQDVLRDH 279
Query: 279 WRL--NGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRG 336
W + ++ SDCD+V +T+ P +AAADA+ AG D+DCG F + A+QRG
Sbjct: 280 WGFTDDRWVTSDCDAVENILTPHKYTTDPAQAAADALLAGTDIDCGTFSSTYLPEALQRG 339
Query: 337 LLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVL 396
L++ D+ A + +RLG FD +P++QPY LG DV TP Q+LA AA +GIVL
Sbjct: 340 LVNSTDLRRAAIRQYASLVRLGYFD-DPAAQPYRQLGWSDVNTPQAQQLAHTAAVEGIVL 398
Query: 397 LKNQGPSLPLS-HIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQ 455
LKN G LP S H+R +A+IGP ++ T + G+Y G+A +PLQG +
Sbjct: 399 LKNDG-VLPFSKHVRK--LALIGPWANATSLLQGSYIGVAPYLVSPLQGAQEAGFEVEYV 455
Query: 456 GCKDVACADDQL-FGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKV 514
+V +D F AA+ A R+ADA + GLD+++E E DR + PG Q +LV+++
Sbjct: 456 LGTNVTTQNDMSGFAAAVAAVRRADAVVFAGGLDETVECEGTDRLNVTWPGNQLDLVAEL 515
Query: 515 SMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKL 574
K P I+ GG +D K+ + AIIW GYPGQ+GGTA+ DIL G + P G+L
Sbjct: 516 ERVGK-PLIVAQFGGGQLDDTALKHSKAVNAIIWGGYPGQSGGTALFDILTGKAAPAGRL 574
Query: 575 PMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANA 634
P+T YP Y +PMT+M++RPS + PGRTY++Y G V+ FG G+ YT FV + A
Sbjct: 575 PITQYPAAYTKQVPMTDMSLRPSATN--PGRTYKWYSGTPVFEFGFGLHYTTFVFSWAAP 632
Query: 635 PTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGV------QVDVKNVGSKDGAHTL 688
AV GS+ + S + V H + + L + V V N G +
Sbjct: 633 SAAAAVDSTASFGSLAKSYSISQL-VAHGQESTAFLDLAPLDTFAVRVTNTGRVASDYVA 691
Query: 689 LVFST---PPAGHWAPHKQLVAFEKVH 712
L+F + PA H P KQLVA+ +VH
Sbjct: 692 LLFVSGAFGPAPH--PKKQLVAYTRVH 716
>gi|452846807|gb|EME48739.1| glycoside hydrolase family 3 protein [Dothistroma septosporum
NZE10]
Length = 802
Score = 481 bits (1239), Expect = e-133, Method: Compositional matrix adjust.
Identities = 279/728 (38%), Positives = 415/728 (57%), Gaps = 34/728 (4%)
Query: 43 CQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTK 102
C + R LI +LQEK+ S + VPRLG+ Y WW EALHGV++ PG
Sbjct: 39 CDTTADPLTRATALINAFTLQEKLNNTGSTSPGVPRLGLPAYTWWQEALHGVAS-SPGVN 97
Query: 103 FG--GDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIF 160
F G F ATSFPQ I ++F+ L + V+S EARA N AGL +W+PN+N F
Sbjct: 98 FSDSGPFRYATSFPQPILMGAAFDDDLIRDVATVISTEARAFNNDKRAGLDFWTPNINPF 157
Query: 161 RDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLK-VAASCKHFTAYDLDNWNGVD 219
+D RWGRGQETPGEDP Y A+ + GLQGS D+ K V A+CKHF AYD+++WNG
Sbjct: 158 KDSRWGRGQETPGEDPYHLSSYVAALIEGLQGSPDDKYKRVVATCKHFVAYDMESWNGNF 217
Query: 220 RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEW 279
R+ F+A+VS QD+ + + PF+ C + V + MCSYN +NGVPTCADP +L+ +R +W
Sbjct: 218 RYQFDAQVSSQDLVEYYMPPFQQCARDSNVGAFMCSYNALNGVPTCADPWLLQTVLREKW 277
Query: 280 RLNG---YIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRG 336
++ SDCD+V + + ST EEAAA +++AG D++CG + H +A +G
Sbjct: 278 NWTSEQQWVTSDCDAVQNVFLPHDYASTREEAAALSLKAGTDINCGTYYQDHLPAAYDQG 337
Query: 337 LLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVL 396
L++ D++ +L+ + +RLG FDG + PY +L DV TP Q+LA +AA +GI L
Sbjct: 338 LINTTDLDISLIRQYSSLVRLGYFDGL--AVPYRNLTWNDVSTPHAQQLAYKAAAEGITL 395
Query: 397 LKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIH--- 453
LKN G LPL+ ++A+IG ++ T M+GNY GI + +PL + T++
Sbjct: 396 LKNDG-VLPLTISNGTSIALIGDWANATDQMLGNYDGIPPFFHSPLYAAQQTGATVNFAT 454
Query: 454 QQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSK 513
G + D L A+ ++D I G+D S+E+E +DR L G Q +++ +
Sbjct: 455 GPGGQGDPTTDHWL--PVWAAANKSDVIIYAGGIDNSVESEGMDRVSLTWTGAQLDMIGQ 512
Query: 514 VSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGK 573
++M K P I++ M GG ID + N+P ++A+IW GYPGQ GG A+ DI+ G + P G+
Sbjct: 513 LAMYGK-PVIVLQMGGGQIDSSPLVNNPNVSALIWGGYPGQDGGVALFDIIRGITAPAGR 571
Query: 574 LPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVAN 633
LP T YP +YI+ +PMT+M +RP+ + PGRTY +Y V+P+G G+ YTNF T A
Sbjct: 572 LPTTQYPAKYISQVPMTDMTLRPNSTTGSPGRTYIWYNENAVFPYGLGLHYTNF--TAAI 629
Query: 634 APTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLT------LGVQVDVKNVGSKDGAHT 687
P+ P S N+ + I + C V + N G +
Sbjct: 630 KPS---FPSTYDSSSSNSGSASYDISTLTSNCTATYKDLCPFTSFSVSITNTGEIMSDYV 686
Query: 688 LLVFST---PPAGHWAPHKQLVAFEKVH-VPAGAQQRVGINIHVCKYLSVVDRSGTRRIP 743
L F PA H P+K+LV+++++H + AG+ Q +N+ + L+ VD G + +
Sbjct: 687 TLGFLAGIHGPAPH--PNKRLVSYQRLHNITAGSSQTAWLNLTLGS-LARVDEMGNKVLY 743
Query: 744 LGEHNIHI 751
G++ + +
Sbjct: 744 PGDYALLV 751
>gi|451849522|gb|EMD62825.1| glycoside hydrolase family 3 protein [Cochliobolus sativus ND90Pr]
Length = 849
Score = 481 bits (1238), Expect = e-133, Method: Compositional matrix adjust.
Identities = 279/715 (39%), Positives = 411/715 (57%), Gaps = 39/715 (5%)
Query: 52 RVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFG--GDFPG 109
R L+ +L+EK+ + A V RLGI Y+WW+E LHG++ GP T F GD+
Sbjct: 114 RAKSLVALYTLEEKINATSNSAPGVARLGIPPYQWWNEGLHGIA--GPFTSFAKQGDYSY 171
Query: 110 ATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQ 169
+TSFPQ I ++F+ L + V+S EARA N GL +W+PN+N FRDPRWGRGQ
Sbjct: 172 STSFPQPILMGAAFDDNLITEVANVISTEARAFNNVNRTGLDFWTPNINPFRDPRWGRGQ 231
Query: 170 ETPGEDPVLSGKYAASYVRGLQGSDGDRL-KVAASCKHFTAYDLDNWNGVDRFHFNAKVS 228
ETPGED Y + + GLQG++ D +V A+CKH+ YD++NWNG R+ + ++S
Sbjct: 232 ETPGEDSYHLSSYVKALIHGLQGNETDPYRRVVATCKHYAGYDIENWNGNLRYQNDVQIS 291
Query: 229 KQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNG---YI 285
+QD+ + + PF CV + V + MCSYN VNG P CADP +L+ +R W + ++
Sbjct: 292 QQDLVEYYLAPFEACV-QANVGAFMCSYNAVNGAPPCADPYMLQTVLREHWGWSSDEHWV 350
Query: 286 VSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINN 345
SDCDS+ Y ++ST E AAAD++ AG DLDCG +L H AV++GL +E ++N
Sbjct: 351 TSDCDSIQNVYLPHQWSSTREGAAADSLNAGTDLDCGTYLQSHLPGAVKQGLTNETTLDN 410
Query: 346 ALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLP 405
AL+ + ++LG FD P +QPY LG V T Q LAL+AA +GIVLLKN G LP
Sbjct: 411 ALIRQYSSLIKLGYFD-IPENQPYRQLGFDAVATSASQALALKAAEEGIVLLKNDG-VLP 468
Query: 406 LSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTP---LQGIG---RYARTIHQQGCKD 459
+ + + V + G ++ T + GNY G+A T+P L+ +G RYA + G D
Sbjct: 469 I-NFGSKNVGIYGDWANATSQLQGNYFGVAKFLTSPYMALEKLGVNVRYAGNL-PGGQGD 526
Query: 460 VACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASK 519
I S D I V G+D IE+E DR+ L L G Q +++ +++ K
Sbjct: 527 PTTGSWPRLSGVITTS---DVHIWVGGMDNGIESEDRDRSWLTLTGSQLDVIGQLADTGK 583
Query: 520 GPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWY 579
P I+++M GG ID + +P+I+A++WAGYPGQ GGTAI +IL G + P G+LP T Y
Sbjct: 584 -PVIVIIMGGGQIDTSPLIKNPKISAVLWAGYPGQDGGTAIVNILTGKAAPAGRLPQTQY 642
Query: 580 PQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVA 639
+Y++ +PMT+MAMRP S + PGRTY++Y G ++ FG+G+ YTNF ++ N P
Sbjct: 643 LYKYVSEVPMTDMAMRP--SNKNPGRTYKWYTGKPIFEFGYGLHYTNFSASITNQPK--- 697
Query: 640 VPLDGRHGSINATISGKAIKVTHAKCNRLTL-GVQVDVKNVGSKDGAHTLLVFSTPPAG- 697
+ +I+ + G T R G+ V V+N G + L F T G
Sbjct: 698 -----QSYAISDLVKG--CNSTGGFLERCPFTGINVSVQNTGKTSSDYVTLGFLTGSFGP 750
Query: 698 HWAPHKQLVAFEKV-HVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHI 751
P K LVA++++ ++ A + +N+ + L+ VD SG + + G++ + I
Sbjct: 751 KPYPKKSLVAYDRLFNIAASSSSTATLNLTLAS-LARVDESGNKVLYPGDYELQI 804
>gi|413919686|gb|AFW59618.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
Length = 475
Score = 479 bits (1233), Expect = e-132, Method: Compositional matrix adjust.
Identities = 241/447 (53%), Positives = 309/447 (69%), Gaps = 5/447 (1%)
Query: 315 AGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGP 374
AGLDL+CG FL HT +AVQ G LSE D++ A+ N L MRLG FDG+P P+G+LGP
Sbjct: 31 AGLDLNCGTFLAQHTVAAVQAGKLSESDVDRAVTNNLVTLMRLGFFDGDPRELPFGNLGP 90
Query: 375 KDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGI 434
DVCTP +QELA EAARQGIVLLKN G LPLS +++AVIGPN++ + TMIGNY G
Sbjct: 91 SDVCTPSNQELAREAARQGIVLLKNTG-KLPLSATSIKSMAVIGPNANASFTMIGNYEGT 149
Query: 435 ACGYTTPLQGIGRYARTIHQQGCKDVACADDQL-FGAAIDASRQADATILVMGLDQSIEA 493
C YTTPLQG+G T++Q GC +V C+ + L AA A+ AD T+LV+G DQSIE
Sbjct: 150 PCKYTTPLQGLGANVATVYQPGCTNVGCSGNSLQLDAATKAAASADVTVLVVGADQSIER 209
Query: 494 EALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPG 553
E+LDR LLLPG+Q +LVS V+ AS GP ILV+MSGGP D++FAK+ +IAAI+W GYPG
Sbjct: 210 ESLDRTSLLLPGQQPQLVSAVANASSGPCILVVMSGGPFDISFAKSSDKIAAILWVGYPG 269
Query: 554 QAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGP 613
+AGG AIAD+LFG NP G+LP+TWYP+ + T +PMT+M MRP S YPGRTYRFY G
Sbjct: 270 EAGGAAIADVLFGYHNPSGRLPVTWYPESF-TKVPMTDMRMRPDPSTGYPGRTYRFYTGD 328
Query: 614 VVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQ 673
VY FG G+SYT+F H + +AP +A+ L H + T +++ A C L V
Sbjct: 329 TVYAFGDGLSYTSFAHHLVSAPKQLALQLAEGHACL--TEQCPSVEAEGAHCEGLAFDVH 386
Query: 674 VDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSV 733
+ V+N G + G HT+ +FS+PPA H AP K L+ FEKV + G V + VCK LSV
Sbjct: 387 LRVRNAGERSGGHTVFLFSSPPAVHNAPAKHLLGFEKVSLEPGQAGVVAFKVDVCKDLSV 446
Query: 734 VDRSGTRRIPLGEHNIHIGGTKHSVSL 760
VD G R++ LG H +H+G KH+++L
Sbjct: 447 VDELGNRKVALGSHTLHVGDLKHTLNL 473
>gi|393247584|gb|EJD55091.1| beta-xylosidase [Auricularia delicata TFB-10046 SS5]
Length = 763
Score = 478 bits (1229), Expect = e-132, Method: Compositional matrix adjust.
Identities = 287/755 (38%), Positives = 410/755 (54%), Gaps = 34/755 (4%)
Query: 10 LGLILLSASSSGLAAREPFACDP-KDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKL 68
GL+L + + + L A A P KD C + R LI + +E V
Sbjct: 8 FGLVLAALTPATLGAFPDCANGPLKDNLV-----CNTTANFMDRAKALIDEFTTEELVNN 62
Query: 69 LISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKF---GGDFPGATSFPQVITTASSFNA 125
++G+ VPRLG+ Y+WWSEALHGV+ PG F G DF ATSFPQ I ++F+
Sbjct: 63 TVNGSPGVPRLGLPPYQWWSEALHGVAGANPGVHFAPAGEDFDHATSFPQPILMGAAFDD 122
Query: 126 TLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAAS 185
L + V+S EARA N G +G+ +++PN+N FRDPRWGRGQETPGEDP+ +Y
Sbjct: 123 ELIHEVATVISTEARAFNNFGFSGIDFFTPNINPFRDPRWGRGQETPGEDPLHISRYVFQ 182
Query: 186 YVRGLQGSDGDR--LKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMC 243
V LQG G K+ A CKHF YDL++W G+DRFHF+A ++ QD+ + + F+ C
Sbjct: 183 LVTALQGGLGPSPYYKIVADCKHFAGYDLESWEGIDRFHFDAVITTQDLAEFYTPSFQSC 242
Query: 244 VMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRL-NGYIVSDCDSVGVYYDTQHFT 302
V + KV SVMCSYN VNGVP CA +L+ +R + L +G+I SDCD+V + T +FT
Sbjct: 243 VRDAKVGSVMCSYNSVNGVPACASSYLLQDIVRDFYGLGDGWITSDCDAVQNVFTTHNFT 302
Query: 303 STPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDG 362
+T A+A +++AG D+DCG A+ +GL+ E D+ ALV +R G FD
Sbjct: 303 TTQANASAISLKAGTDVDCGNVYAQSLGDALDQGLVEEDDLKQALVRLYGSLVRTGYFD- 361
Query: 363 EPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSD 422
P QP+ LG DV TP + LAL AA +GIVLLKN G LPLS V ++GP +
Sbjct: 362 SPEEQPFRQLGWADVDTPASRRLALLAAEEGIVLLKNDG-LLPLSSRDVPNVIMVGPWGN 420
Query: 423 VTVTMIGNYAGIACGYTTPLQGIGRYARTI-HQQGCKDVACADDQLFGAAIDASRQADAT 481
T M GNY G A +P QG + G D F A+ A+ D
Sbjct: 421 ATTMMQGNYFGNAPYLVSPRQGFVDAGFNVTFFNGTVGTNGTDTSGFDEAVAAAGDTDLI 480
Query: 482 ILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDP 541
+ V G D +E E+ DR + PG Q +L+ +++ K P I++ M G +D + K
Sbjct: 481 VFVGGPDNVVERESRDRINITWPGVQLDLIKELAGVGK-PMIVLQMGAGQVDDTWLKESD 539
Query: 542 RIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKR 601
I A+IW GYPGQ+GGTA+A+I+ G + P +LP+T YP++YI+ LPMT+M +RPS S
Sbjct: 540 AINALIWGGYPGQSGGTALANIVTGKTAPAARLPITQYPEDYIS-LPMTDMNVRPSNSS- 597
Query: 602 YPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVT 661
PGRTY+++ G ++ FG G+ Y+ F A P P G + A S T
Sbjct: 598 -PGRTYKWFTGEPIFEFGFGLHYSKFDFAWAEEP-----PASFAIGDLVANASSPVDLAT 651
Query: 662 HAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAG-HWAPHKQLVAFEKV-HVPAGAQQ 719
QV+V N+G ++F AG AP K+LV + ++ ++P GA
Sbjct: 652 FHT-------FQVNVTNLGPVASDFVAMLFGNTTAGPSPAPLKELVGYTRLTNIPVGATV 704
Query: 720 RVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGT 754
+ + + ++ D G + G++++ + T
Sbjct: 705 TASVPVTLGT-IARADEDGNSVLFPGQYSVWLDTT 738
>gi|224068504|ref|XP_002302759.1| predicted protein [Populus trichocarpa]
gi|222844485|gb|EEE82032.1| predicted protein [Populus trichocarpa]
Length = 273
Score = 477 bits (1227), Expect = e-131, Method: Compositional matrix adjust.
Identities = 218/251 (86%), Positives = 237/251 (94%)
Query: 24 AREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKG 83
A +PFACDP+D T+R PFCQV LPI RV+DLIGR++LQEKV LL++ AAAVPRLGIKG
Sbjct: 23 ALDPFACDPEDGTSRNFPFCQVKLPIQSRVSDLIGRMTLQEKVGLLVNDAAAVPRLGIKG 82
Query: 84 YEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMY 143
YEWWSEALHGVSNVGPGT+FGG FPGATSFPQVITTA+SFNATLWEAIGRVVSDEARAM+
Sbjct: 83 YEWWSEALHGVSNVGPGTQFGGAFPGATSFPQVITTAASFNATLWEAIGRVVSDEARAMF 142
Query: 144 NGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAAS 203
NGG AGLTYWSPNVNIFRDPRWGRGQETPGEDPV++GKYAASYVRGLQG+DGDRLKVAA
Sbjct: 143 NGGVAGLTYWSPNVNIFRDPRWGRGQETPGEDPVVAGKYAASYVRGLQGNDGDRLKVAAC 202
Query: 204 CKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVP 263
CKHFTAYDLDNWNGVDRFHFNA+VSKQD+EDTFDVPFRMCV EGKVASVMCSYNQVNG+P
Sbjct: 203 CKHFTAYDLDNWNGVDRFHFNAQVSKQDMEDTFDVPFRMCVKEGKVASVMCSYNQVNGIP 262
Query: 264 TCADPNILKRT 274
TCADP +LK+T
Sbjct: 263 TCADPKLLKKT 273
>gi|398403795|ref|XP_003853364.1| putative xylan 1,4-beta-Xylosidase [Zymoseptoria tritici IPO323]
gi|339473246|gb|EGP88340.1| putative xylan 1,4-beta-Xylosidase [Zymoseptoria tritici IPO323]
Length = 785
Score = 473 bits (1218), Expect = e-130, Method: Compositional matrix adjust.
Identities = 274/713 (38%), Positives = 394/713 (55%), Gaps = 33/713 (4%)
Query: 43 CQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTK 102
C + R LI +++EK+ S A VPRLG+ Y WW EALHGV+ PG
Sbjct: 39 CDFTADPLTRATALIAAFTIEEKINNTGSTAPGVPRLGLPAYTWWQEALHGVAQ-SPGVN 97
Query: 103 FG--GDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIF 160
F GDF ATSFPQ I ++F+ L + + V+S EARA N +GL YW+PN+N F
Sbjct: 98 FSDSGDFRYATSFPQPILMGAAFDDDLIKDVATVISTEARAFNNDARSGLDYWTPNINPF 157
Query: 161 RDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDR 220
+D RWGRGQETPGEDP Y S + GLQG DG KV A+CKHF AYDL+ WNG R
Sbjct: 158 KDSRWGRGQETPGEDPYHLSSYVKSLIAGLQG-DGKYKKVVATCKHFVAYDLETWNGNFR 216
Query: 221 FHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWR 280
+ F+ V Q++ + + PF+ C + V + MCSYN +NG+PTCADP +L+ +R W
Sbjct: 217 YQFDPHVGSQELVEYYMPPFQACARDANVGAFMCSYNSLNGIPTCADPYLLQTILREHWN 276
Query: 281 LNG---YIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGL 337
++ SDCDS+ Y +TST EEA A +++AG D++CG + A+ GL
Sbjct: 277 WTSEEQWVTSDCDSIQNVYLPHEYTSTREEAVAVSLKAGTDVNCGTYYQEFLPGALSLGL 336
Query: 338 LSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLL 397
++E DI+ AL+ + +RLG FDG ++ Y L KDV TP Q+LAL+AA +GI LL
Sbjct: 337 VTEKDIDMALIRQYSSLVRLGYFDG--TAVEYRSLSWKDVSTPYAQQLALKAAVEGITLL 394
Query: 398 KNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGC 457
KN G LPL+ + +AVIG ++ T M+GNY GI +PL + + G
Sbjct: 395 KNDG-ILPLAITKDTKIAVIGDWANATEQMLGNYDGIPPYLHSPLWAAQQTGANVTYSGN 453
Query: 458 K----DVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSK 513
D + A+D +AD + G+D +EAE +DR + G Q +++ +
Sbjct: 454 PGGQGDPTTNNWLHIWTAVD---EADVILFAGGIDNGVEAEGMDRVSIAWTGAQLDVIGQ 510
Query: 514 VSMASKG-PTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGG 572
+AS+G P I+ M +D N+ I+A++W GYPGQ GG A+ DI+ G S P G
Sbjct: 511 --LASRGKPVIVAQMGTNGVDSTPLLNNQNISALLWGGYPGQDGGVALLDIIQGKSAPAG 568
Query: 573 KLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVA 632
+LP T YP YI+ +PMT+M +RP+ + +PGRTY +Y V+ FG+G+ YTNF T++
Sbjct: 569 RLPTTQYPASYISKVPMTDMHLRPNSTTGFPGRTYMWYNEKPVFEFGYGLHYTNFSATIS 628
Query: 633 NAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFS 692
P D SI A ++ + +C + ++ V N G+ + L F
Sbjct: 629 --------PTDTTSFSI-ADLTKDCTEHYMDRCPFADM--KIAVTNTGNVTSDYVTLGFL 677
Query: 693 TPPAGHW-APHKQLVAFEKVH-VPAGAQQRVGINIHVCKYLSVVDRSGTRRIP 743
G P+K+LV ++++H + AGA Q +N+ + V D T P
Sbjct: 678 AGEHGPAPCPNKRLVNYQRLHNITAGASQTTSLNLTLASLARVDDMGNTVLYP 730
>gi|389748262|gb|EIM89440.1| hypothetical protein STEHIDRAFT_182874, partial [Stereum hirsutum
FP-91666 SS1]
Length = 772
Score = 471 bits (1212), Expect = e-130, Method: Compositional matrix adjust.
Identities = 280/722 (38%), Positives = 402/722 (55%), Gaps = 28/722 (3%)
Query: 43 CQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTK 102
C + R LI +L + V ++G+ V RLG+ Y+WW+EALHGV + PG
Sbjct: 39 CNTTAHFVDRATSLIEEFNLTDLVNNTVNGSPGVDRLGLPPYQWWNEALHGVGS-SPGVN 97
Query: 103 FG----GDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVN 158
+G +F ATSFP I ++FN +L +I V+S EARA N AGLT+++PN+N
Sbjct: 98 WGSGPDANFTSATSFPAPILLGATFNDSLIASIADVISTEARAFNNFNYAGLTFFTPNIN 157
Query: 159 IFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQG--SDGDRLKVAASCKHFTAYDLDNWN 216
FRDPRWGRGQETPGEDP +Y YV GLQG S KV A+CKH AYD++NW
Sbjct: 158 PFRDPRWGRGQETPGEDPYHLSRYVYQYVVGLQGGLSPDPYYKVLANCKHVLAYDVENWE 217
Query: 217 GVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIR 276
G DR FNA V+ QD+ + + F+ C+ + + AS MCSYN VNGVP+CA ILK +R
Sbjct: 218 GNDRTGFNAVVTTQDLSEFYTPSFQGCLRDAQGASAMCSYNAVNGVPSCASSYILKDLVR 277
Query: 277 GEWRL---NGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAV 333
W L G+I DC +V Y +T T A A A+ AG DLDCG + +AV
Sbjct: 278 DFWGLGEREGWITGDCGAVQNIYQPHGYTDTLVNATAVAMDAGTDLDCGDVYSPNLWTAV 337
Query: 334 QRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQG 393
GL++ I AL+ +RLG FD QPY +V TP Q+LA AA QG
Sbjct: 338 VEGLITAGQIQTALIRLYGSLIRLGYFD-PAEQQPYRSFDWSNVNTPSSQDLAYNAAVQG 396
Query: 394 IVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIH 453
IVLL+N G LPLS + +A+IGP ++ T+++ GNYAGIA +P Q +
Sbjct: 397 IVLLENDG-LLPLS-TNVKNIALIGPMANATLSLQGNYAGIAPFVISPQQAFETAGYNVT 454
Query: 454 QQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSK 513
++ +D+ + A++A++ AD + V G+D SIEAE DR + PG Q +L+ +
Sbjct: 455 FAFGTGISNSDNSGYSEALEAAQGADVVVFVGGIDNSIEAEGQDRTSIEWPGSQLDLIGQ 514
Query: 514 VSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGK 573
+ K P ++V M GG D + K + + A++WAGYPGQ+GGTA+ DI+ G +P G+
Sbjct: 515 LGELGK-PLVVVRMGGGQCDDSTLKANATVNALLWAGYPGQSGGTALVDIISGKQSPSGR 573
Query: 574 LPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVAN 633
LP+T YP Y++ + MT+MA+RP+ S PGRTY++Y G +YPFG+G+ YT F ++
Sbjct: 574 LPVTQYPSSYVSEIDMTDMAIRPNSSGS-PGRTYKWYTGAPIYPFGYGIHYTTFRLAWSD 632
Query: 634 APTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSK-DGAHTLLVFS 692
+ + D I A K + + V N GS + L+F+
Sbjct: 633 SSSTTYNIQD---------IVSSANKSGGFADTEILDTFSLLVTNTGSNYTSDYVALLFA 683
Query: 693 TPPAG-HWAPHKQLVAFEKV-HVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIH 750
+G AP ++LV + +V H+ G +N+ + +S VD +G + G +N+
Sbjct: 684 NSTSGPSPAPLQELVGYTRVPHITPGGTATAELNVTLGS-ISRVDENGNWILYPGTYNLW 742
Query: 751 IG 752
+G
Sbjct: 743 VG 744
>gi|392570764|gb|EIW63936.1| glycoside hydrolase family 3 protein [Trametes versicolor FP-101664
SS1]
Length = 781
Score = 471 bits (1212), Expect = e-130, Method: Compositional matrix adjust.
Identities = 292/725 (40%), Positives = 411/725 (56%), Gaps = 28/725 (3%)
Query: 43 CQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTK 102
C V+ R LI + +E ++ + VPRLG+ Y WWSE LHGV+ PG
Sbjct: 41 CDVTKDPITRATALISIWTDEELTNNTVNASPGVPRLGLPAYNWWSEGLHGVAQ-SPGVT 99
Query: 103 FG--GDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIF 160
F G+F ATSFPQ I ++F+ L +AI +VS E RA N G AGL YW+PN+N F
Sbjct: 100 FAPSGNFSYATSFPQPILMGAAFDDPLIQAIATIVSTEGRAFNNAGRAGLDYWTPNINPF 159
Query: 161 RDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDR--LKVAASCKHFTAYDLDNWNGV 218
+DPRWGRGQETPGEDP +Y + + GLQG + KV A CKHF AYD+DNW GV
Sbjct: 160 KDPRWGRGQETPGEDPFHLSQYVYNLILGLQGGLDPKPYFKVVADCKHFAAYDMDNWEGV 219
Query: 219 DRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGE 278
R+ FNA VS+QD+ + + PF+ CV + KVASVMCSYN VNG+P+CA+ +L+ +R
Sbjct: 220 VRYGFNAVVSQQDLSEFYLPPFQTCVRDAKVASVMCSYNAVNGIPSCANSFLLQDVLRDH 279
Query: 279 WRL--NGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRG 336
W + ++ SDCD+V + ++T+ P +AAADA+ AG D+DCG F + A+QRG
Sbjct: 280 WGFTDDRWVTSDCDAVQNIFTPHNYTTDPAQAAADALLAGTDIDCGTFSSTYLPEALQRG 339
Query: 337 LLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVL 396
L++ D+ A + +RLG FD +P++QPY LG DV T Q+LA AA +G+VL
Sbjct: 340 LVNSTDLRRAAIRQYASLVRLGYFD-DPAAQPYRQLGWSDVNTLQAQQLAHTAAVEGMVL 398
Query: 397 LKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQG 456
LKN G LPLS R R +A+IGP ++ T + GNY GIA +P+QG + +
Sbjct: 399 LKNDG-LLPLSK-RVRKLALIGPWANATRLLQGNYFGIAPYLVSPVQGAQQAGFEVEYVF 456
Query: 457 CKDVACADDQL-FGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVS 515
+V +D F AA+ A+++ADA + GLD+++E E +DR + PG Q +LV+++
Sbjct: 457 GTNVTTRNDTSGFAAAVAAAKRADAVVFAGGLDETVEREEIDRLNVTWPGNQLDLVAELE 516
Query: 516 MASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLP 575
K P I+ GG +D K + AIIW GYPGQ+GGTA+ DIL G + P G+LP
Sbjct: 517 RVGK-PLIVAQFGGGQLDNTALKRSKAVNAIIWGGYPGQSGGTALFDILTGKAAPAGRLP 575
Query: 576 MTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAP 635
+T YP Y +PMT+M +RPS + PGRTY++Y G V+ FG G+ YT F A
Sbjct: 576 ITQYPAAYAEQVPMTDMTLRPSATN--PGRTYKWYSGTPVFEFGFGLHYTTFAFAWAAPG 633
Query: 636 TVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGV------QVDVKNVGSKDGAHTLL 689
+ + +IS V H + + L + V V N G + L
Sbjct: 634 AAADSTASFGGPAKSYSISQ---LVAHGQESAAFLDLAPLDTFAVRVTNTGKVASDYVAL 690
Query: 690 VF---STPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGE 746
+F S PA H P K LVA+ ++H A VG ++ D +G + + G
Sbjct: 691 LFVSGSFGPAPH--PKKTLVAYTRIHGLAPRGSTVGQLPVTLGAIARADENGEKWVHPGT 748
Query: 747 HNIHI 751
+ + +
Sbjct: 749 YTLAL 753
>gi|392590128|gb|EIW79457.1| glycoside hydrolase family 3 protein [Coniophora puteana RWD-64-598
SS2]
Length = 770
Score = 469 bits (1206), Expect = e-129, Method: Compositional matrix adjust.
Identities = 275/727 (37%), Positives = 414/727 (56%), Gaps = 33/727 (4%)
Query: 43 CQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTK 102
C SL QR L+ +++E + ++G+ VPRLG+ Y+WWSE LHGV++ PG
Sbjct: 37 CDTSLNATQRAAALVELFTVEELINNTVNGSPGVPRLGLPAYQWWSEGLHGVAD-SPGVN 95
Query: 103 FG--GDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIF 160
F G F ATSFPQ I +++F+ L +A+G VV E R+ N G AGL +W+PN+N F
Sbjct: 96 FSTSGPFSYATSFPQPIVMSAAFDDALIKAVGGVVGMEGRSFNNYGHAGLDFWTPNINPF 155
Query: 161 RDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDR--LKVAASCKHFTAYDLDNWNGV 218
+DPRWGRGQETPGEDP +Y + ++GLQG +V A+CKHF YDL++W
Sbjct: 156 KDPRWGRGQETPGEDPYHIAQYVYNLIQGLQGGVNPEPYFQVVATCKHFAGYDLEDWENN 215
Query: 219 DRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGE 278
R+ F+A ++ QD+ + + F+ C + + + MCSYN VNG+PTCAD +L+ +R
Sbjct: 216 FRYGFDALITTQDLSEFYLPSFQSCYRDAQAGASMCSYNAVNGIPTCADTYLLQDILRDY 275
Query: 279 WRLNG--YIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRG 336
W + ++ SDCD+V Y+ ++T+ P++AAADA+RAG DLDCG F + A +
Sbjct: 276 WNFDETRWVTSDCDAVENIYNPHNYTALPQQAAADALRAGTDLDCGTFYTEYLPLAYNQS 335
Query: 337 LLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVL 396
L++E ++ AL +RLG FD + QPY G +V TP Q+LA AA +GI L
Sbjct: 336 LITETELRAALTRQYASLVRLGYFD-PAAQQPYRQYGWSNVDTPYAQQLAYTAATEGITL 394
Query: 397 LKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQG 456
LKN G +LPL + +A+IGP ++ T M GNY G+A +PLQG +
Sbjct: 395 LKNDG-TLPLPSTL-KNIALIGPWANATNQMQGNYFGVAPYLVSPLQGALAAGYNVTYVF 452
Query: 457 CKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSM 516
++ F AAI A+R+ADA + G+D ++EAEA+DR + PG Q +L+ +++
Sbjct: 453 GTNITSNSTAGFAAAIAAAREADAVVYAGGIDVTVEAEAMDRYNVTWPGNQLQLIGELAA 512
Query: 517 ASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPM 576
K P ++ GG +D K + + ++IWAGYPGQ+GG A+ DI+ G P G+L
Sbjct: 513 LGK-PFVVAQFGGGQVDDTEIKANASVNSLIWAGYPGQSGGQALFDIISGKVAPAGRLVT 571
Query: 577 TWYPQEYITNLPMTEMAMRPSQS-KRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAP 635
T YP +Y+ +PMT+M +RP+ + PGRTY++Y G VY FG+G+ YTNF +T AP
Sbjct: 572 TQYPADYVYEIPMTDMNLRPNANGTTSPGRTYKWYTGAPVYEFGYGLHYTNFTYTWTKAP 631
Query: 636 TVVAVPLDGRHGSINATISGK--AIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF-- 691
+I +S A + A + L+ V V N G+ ++ L+F
Sbjct: 632 ASTY--------NIQTLVSAASGAAHIDLAPFDTLS----VAVTNAGAVTSDYSALLFVN 679
Query: 692 -STPPAGHWAPHKQLVAFEKVH-VPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNI 749
+ PA + P+K L A+ ++H V AGA Q ++ V ++ D G + G + +
Sbjct: 680 GTYGPAPY--PNKALAAYTRLHSVAAGAAQTATFDV-VLNQIARADAYGNFWLYPGAYEL 736
Query: 750 HIGGTKH 756
+ T+
Sbjct: 737 ALDTTRE 743
>gi|83774566|dbj|BAE64689.1| unnamed protein product [Aspergillus oryzae RIB40]
Length = 822
Score = 467 bits (1202), Expect = e-128, Method: Compositional matrix adjust.
Identities = 278/734 (37%), Positives = 411/734 (55%), Gaps = 48/734 (6%)
Query: 41 PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPG 100
P C SL I +RV+ L+ L+L+EK+ L+ +A RLG+ YEWWSEA HGV + PG
Sbjct: 78 PVCDTSLSIAERVDSLVKSLTLEEKILNLVDASAGSTRLGLPSYEWWSEATHGVGS-APG 136
Query: 101 TKFG---GDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNV 157
+F +F ATSFP I TA+SF+ TL I V+ E RA N G +G +W+PN+
Sbjct: 137 VQFTSKPANFSYATSFPAPILTAASFDDTLIRKIAEVIGREGRAFGNNGFSGFDFWAPNI 196
Query: 158 NIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNG 217
N FRDPRWGRGQETPGEDP+++ Y ++V GLQG D +V A+CKH+ YDL+
Sbjct: 197 NGFRDPRWGRGQETPGEDPLVAQNYIRNFVPGLQGDDPKNKQVIATCKHYAVYDLE---- 252
Query: 218 VDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRG 277
R+ N ++QD+ D F PF+ CV + V S+MCSYN V+G+P CA+ +L +R
Sbjct: 253 TGRYGNNYNPTQQDLSDYFLAPFKTCVRDTDVGSIMCSYNSVSGIPACANEYLLSEVLRK 312
Query: 278 EWRLNG---YIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG-PFLGLHTESAV 333
W N Y+VSDC +V + +FT T E AA+ A+ AG+DL+CG +L L+ A
Sbjct: 313 HWNFNSDYHYVVSDCGAVTDIWQYHNFTDTEEAAASVALNAGVDLECGSSYLKLNESLAA 372
Query: 334 QRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQG 393
+ + +D +L + +G FDG Y L DV TPD Q LA EAA +G
Sbjct: 373 NQTSVKVMD--QSLARLYSALFTVGFFDG----GKYDKLDFSDVSTPDAQALAYEAAVEG 426
Query: 394 IVLLKNQGPSLPL-SHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTI 452
+ LLKN LPL S ++++VAVIGP ++ T M G+Y+G A +PL+ G +
Sbjct: 427 MTLLKNDD-LLPLDSPHKYKSVAVIGPFANATTQMQGDYSGDAPYLISPLEAFGDSRWKV 485
Query: 453 HQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVS 512
+ + + F A+ A+ ++D I + G+D S+E+E LDR L PG Q +L++
Sbjct: 486 NYALGTAMNNQNTSGFEEALAAANKSDLIIYLGGIDNSLESETLDRTSLTWPGNQLDLIT 545
Query: 513 KVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGG 572
+S SK P ++V GG +D + + I A++WAGYP Q+GGTA+ D+L G +P G
Sbjct: 546 SLSKLSK-PLVVVQFGGGQVDDSDILKNKDIQALVWAGYPSQSGGTALLDVLVGKRSPAG 604
Query: 573 KLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNF----- 627
+LP+T YP Y + + ++ +RP+ S YPGRTY++Y G V PFG+G+ YT F
Sbjct: 605 RLPVTQYPASYADQVNIFDINLRPTDS--YPGRTYKWYTGKPVLPFGYGLHYTKFMFDWE 662
Query: 628 --VHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGA 685
++ N +VA + G IN N V+V VKNVG K
Sbjct: 663 KTLNREYNIQDLVASCRNSSGGPIND--------------NTPLTTVKVRVKNVGHKTSD 708
Query: 686 H-TLLVFSTPPAGHWA-PHKQLVAFEK-VHVPAGAQQRVGINIHVCKYLSVVDRSGTRRI 742
+ +LL S+ AG P+K LV++ + +++ G+ Q + + + L+ D +G+ I
Sbjct: 709 YVSLLFLSSKNAGPAPRPNKSLVSYVRLLNIARGSDQVAELPLTLGS-LARADENGSLVI 767
Query: 743 PLGEHNIHIGGTKH 756
G + I + ++
Sbjct: 768 FPGRYKIALDHSEE 781
>gi|391865040|gb|EIT74331.1| beta-glucosidase-related glycosidase [Aspergillus oryzae 3.042]
Length = 822
Score = 466 bits (1200), Expect = e-128, Method: Compositional matrix adjust.
Identities = 276/734 (37%), Positives = 409/734 (55%), Gaps = 48/734 (6%)
Query: 41 PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPG 100
P C SL I +RV+ L+ L+L+EK+ L+ +A RLG+ YEWWSEA HGV + PG
Sbjct: 78 PVCDTSLSIAERVDSLVKSLTLEEKILNLVDASAGSTRLGLPSYEWWSEATHGVGS-APG 136
Query: 101 TKFG---GDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNV 157
+F +F ATSFP I TA+SF+ TL I V+ E RA N G +G +W+PN+
Sbjct: 137 VQFTSKPANFSYATSFPAPILTAASFDDTLIRKIAEVIGREGRAFGNNGFSGFDFWAPNI 196
Query: 158 NIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNG 217
N FRDPRWGRGQETPGEDP+++ Y ++V GLQG D +V A+CKH+ YDL+
Sbjct: 197 NGFRDPRWGRGQETPGEDPLVAQNYIRNFVPGLQGDDPKNKQVIATCKHYAVYDLE---- 252
Query: 218 VDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRG 277
R+ N ++QD+ D F PF+ CV + V S+MCSYN V+G+P CA+ +L +R
Sbjct: 253 TGRYGNNYNPTQQDLSDYFLAPFKTCVRDTDVGSIMCSYNSVSGIPACANEYLLDEVLRK 312
Query: 278 EWRLNG---YIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG-PFLGLHTESAV 333
W N Y+VSDC +V + +FT T E AA+ A+ AG+DL+CG +L L+ A
Sbjct: 313 HWNFNSDYYYVVSDCGAVTDIWQYHNFTDTEEAAASVALNAGVDLECGSSYLKLNESLAA 372
Query: 334 QRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQG 393
+ + +D +L + +G FDG Y L DV TPD Q LA EAA +G
Sbjct: 373 NQTSVKVMD--RSLARLYSALFTVGFFDG----GKYDKLDFSDVSTPDAQALAYEAAVEG 426
Query: 394 IVLLKNQGPSLPLSHI-RHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTI 452
+ LLKN LPL ++++VAVIGP ++ T M G+Y+G A +PL+ G +
Sbjct: 427 MTLLKNDD-LLPLDFPHKYKSVAVIGPFANATTQMQGDYSGDAPYLISPLEAFGDSRWKV 485
Query: 453 HQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVS 512
+ + + F A+ A+ ++D I + G+D S+E+E LDR L PG Q +L++
Sbjct: 486 NYALGTAINNQNTSGFEEALAAANKSDLIIYLGGIDNSLESETLDRTSLAWPGNQLDLIT 545
Query: 513 KVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGG 572
+S SK P ++V GG +D + + I A++WAGYP Q+GGTA+ D+L G +P G
Sbjct: 546 SLSKLSK-PLVVVQFGGGQVDDSAILKNKDIQALVWAGYPSQSGGTALLDVLVGKRSPAG 604
Query: 573 KLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNF----- 627
+LP+T YP Y + + ++ +RP+ S YPGRTY++Y G V PFG+G+ YT F
Sbjct: 605 RLPVTQYPASYADQVNIFDINLRPTDS--YPGRTYKWYTGKPVLPFGYGLHYTKFMFDWE 662
Query: 628 --VHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGA 685
++ N +VA + G IN N V+ VKNVG K
Sbjct: 663 KTLNREYNIQDLVASCRNSSGGPIND--------------NTPLTTVKARVKNVGHKTSD 708
Query: 686 H-TLLVFSTPPAGHWA-PHKQLVAFEK-VHVPAGAQQRVGINIHVCKYLSVVDRSGTRRI 742
+ +LL S+ AG P+K LV++ + +++ G+ Q + + + L+ D +G+ I
Sbjct: 709 YVSLLFLSSKNAGPAPRPNKSLVSYVRLLNIARGSDQVAELPLTLGS-LARADENGSLVI 767
Query: 743 PLGEHNIHIGGTKH 756
G + I + ++
Sbjct: 768 FPGRYKIALDNSEE 781
>gi|317156541|ref|XP_001825822.2| exo-1,4-beta-xylosidase xlnD [Aspergillus oryzae RIB40]
Length = 882
Score = 466 bits (1200), Expect = e-128, Method: Compositional matrix adjust.
Identities = 278/729 (38%), Positives = 409/729 (56%), Gaps = 48/729 (6%)
Query: 41 PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPG 100
P C SL I +RV+ L+ L+L+EK+ L+ +A RLG+ YEWWSEA HGV + PG
Sbjct: 138 PVCDTSLSIAERVDSLVKSLTLEEKILNLVDASAGSTRLGLPSYEWWSEATHGVGS-APG 196
Query: 101 TKFG---GDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNV 157
+F +F ATSFP I TA+SF+ TL I V+ E RA N G +G +W+PN+
Sbjct: 197 VQFTSKPANFSYATSFPAPILTAASFDDTLIRKIAEVIGREGRAFGNNGFSGFDFWAPNI 256
Query: 158 NIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNG 217
N FRDPRWGRGQETPGEDP+++ Y ++V GLQG D +V A+CKH+ YDL+
Sbjct: 257 NGFRDPRWGRGQETPGEDPLVAQNYIRNFVPGLQGDDPKNKQVIATCKHYAVYDLE---- 312
Query: 218 VDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRG 277
R+ N ++QD+ D F PF+ CV + V S+MCSYN V+G+P CA+ +L +R
Sbjct: 313 TGRYGNNYNPTQQDLSDYFLAPFKTCVRDTDVGSIMCSYNSVSGIPACANEYLLSEVLRK 372
Query: 278 EWRLNG---YIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG-PFLGLHTESAV 333
W N Y+VSDC +V + +FT T E AA+ A+ AG+DL+CG +L L+ A
Sbjct: 373 HWNFNSDYHYVVSDCGAVTDIWQYHNFTDTEEAAASVALNAGVDLECGSSYLKLNESLAA 432
Query: 334 QRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQG 393
+ + +D +L + +G FDG Y L DV TPD Q LA EAA +G
Sbjct: 433 NQTSVKVMD--QSLARLYSALFTVGFFDG----GKYDKLDFSDVSTPDAQALAYEAAVEG 486
Query: 394 IVLLKNQGPSLPL-SHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTI 452
+ LLKN LPL S ++++VAVIGP ++ T M G+Y+G A +PL+ G +
Sbjct: 487 MTLLKNDD-LLPLDSPHKYKSVAVIGPFANATTQMQGDYSGDAPYLISPLEAFGDSRWKV 545
Query: 453 HQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVS 512
+ + + F A+ A+ ++D I + G+D S+E+E LDR L PG Q +L++
Sbjct: 546 NYALGTAMNNQNTSGFEEALAAANKSDLIIYLGGIDNSLESETLDRTSLTWPGNQLDLIT 605
Query: 513 KVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGG 572
+S SK P ++V GG +D + + I A++WAGYP Q+GGTA+ D+L G +P G
Sbjct: 606 SLSKLSK-PLVVVQFGGGQVDDSDILKNKDIQALVWAGYPSQSGGTALLDVLVGKRSPAG 664
Query: 573 KLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNF----- 627
+LP+T YP Y + + ++ +RP+ S YPGRTY++Y G V PFG+G+ YT F
Sbjct: 665 RLPVTQYPASYADQVNIFDINLRPTDS--YPGRTYKWYTGKPVLPFGYGLHYTKFMFDWE 722
Query: 628 --VHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGA 685
++ N +VA + G IN N V+V VKNVG K
Sbjct: 723 KTLNREYNIQDLVASCRNSSGGPIND--------------NTPLTTVKVRVKNVGHKTSD 768
Query: 686 H-TLLVFSTPPAGHWA-PHKQLVAFEK-VHVPAGAQQRVGINIHVCKYLSVVDRSGTRRI 742
+ +LL S+ AG P+K LV++ + +++ G+ Q + + + L+ D +G+ I
Sbjct: 769 YVSLLFLSSKNAGPAPRPNKSLVSYVRLLNIARGSDQVAELPLTLGS-LARADENGSLVI 827
Query: 743 PLGEHNIHI 751
G + I +
Sbjct: 828 FPGRYKIAL 836
>gi|388857998|emb|CCF48443.1| related to Beta-xylosidase [Ustilago hordei]
Length = 782
Score = 466 bits (1199), Expect = e-128, Method: Compositional matrix adjust.
Identities = 290/784 (36%), Positives = 424/784 (54%), Gaps = 56/784 (7%)
Query: 7 FFFLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKV 66
F G+ L S + L+ P +C P ++ C ++P R L+ + + +E +
Sbjct: 2 LFRRGITLSSLARMHLSFSFP-SCAPDSPLSKIPDICDPTIPFYTRATSLVNQFTTEELL 60
Query: 67 KLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFG-----GDFPGATSFPQVITTAS 121
I+ A VPRLGI Y+WW+EALHGV+ PG F +F AT FPQ I +
Sbjct: 61 NNTINYAPGVPRLGIPNYQWWTEALHGVAK-SPGVNFDLSDPHAEFTSATQFPQTINLGA 119
Query: 122 SFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSP-NVNIFRDPRWGRGQETPGEDPVLSG 180
+F+ L++ I V++ E RA N G AGL +SP N+N FRDPRWGRGQET GEDP+
Sbjct: 120 TFDDDLYQQIASVIASEVRAYNNAGKAGLNLYSPLNINCFRDPRWGRGQETVGEDPLHMS 179
Query: 181 KYAASYVRGLQG------SDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIED 234
++A S V GLQG ++G++L VAA+CKHF AYDL+ ++ +R+ F+A VSKQD+ D
Sbjct: 180 RFAVSIVHGLQGPHAQNEAEGNKLTVAATCKHFLAYDLEQYDRGERYQFDAIVSKQDLSD 239
Query: 235 TFDVP-FRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLN---GYIVSDCD 290
F +P FR CV +G ++M SYN VN VP A L+ R W L+ Y+ SDCD
Sbjct: 240 -FHLPQFRACVRDGGATTLMTSYNAVNNVPPSASKYYLQTLARQAWGLDKTHNYVTSDCD 298
Query: 291 SVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNT 350
+V YD + EAAA +I AG DLDCG + +A+++ L I A++
Sbjct: 299 AVANVYDGHRYAQNYVEAAAKSINAGTDLDCGATYSENLGAALKQKLTDIATIRRAVIRM 358
Query: 351 LTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIR 410
+RLG FD +P+SQP L KDV +P Q LA +A I LLKN +LP+ +
Sbjct: 359 YASLVRLGYFD-DPASQPLRQLTWKDVNSPSSQRLAYTSALSSITLLKNLDSTLPIKQ-K 416
Query: 411 HRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRY---ARTIHQQGCKDVAC----A 463
+A+IGP ++V+ + GNYAG A T + + A+ + G D++ +
Sbjct: 417 PTKIAIIGPYTNVSTSFSGNYAGPAAFNMTMVHAASQVFPDAKIVWVNGT-DISGPYIPS 475
Query: 464 DDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMA----SK 519
D Q A+ + AD+ + G+D SIE E+ DR + P Q L+ ++S + K
Sbjct: 476 DAQ---DAVKLTSDADSVVFAGGIDASIERESHDRKDIAWPPNQLRLIHELSQSRKKDKK 532
Query: 520 GPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWY 579
++V GG +D A K+D + A++WAGYPGQ+ A+ DIL G + P G+LP+T Y
Sbjct: 533 SKLVVVQFGGGQLDGASLKSDDAVGALVWAGYPGQSASLAVWDILAGKAVPAGRLPVTQY 592
Query: 580 PQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVA 639
P YI LP + M++RP YPGRTY++YKG YPFGHG+ YT F ++A P A
Sbjct: 593 PASYIDGLPESAMSLRPKAG--YPGRTYKWYKGVPTYPFGHGLHYTTFSASLAK-PQPYA 649
Query: 640 VPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF---STPPA 696
+P T + K + HA+ + VQ ++KN G +T L+F S PA
Sbjct: 650 IP---------TTPAAKGPEGVHAEHISVA-DVQANIKNTGKVASDYTALLFARHSNGPA 699
Query: 697 GHWAPHKQLVAFEKV-HVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTK 755
+ P K LV + KV ++ AG + V I I L+ D G + + G + + + +
Sbjct: 700 PY--PRKTLVGYTKVKNLSAGEESSVTIKITQAA-LARADEEGNQFLYPGSYQLELDTEE 756
Query: 756 HSVS 759
H ++
Sbjct: 757 HRLA 760
>gi|359473427|ref|XP_002265788.2| PREDICTED: beta-xylosidase/alpha-L-arabinofuranosidase 1-like
[Vitis vinifera]
Length = 464
Score = 466 bits (1199), Expect = e-128, Method: Compositional matrix adjust.
Identities = 226/467 (48%), Positives = 311/467 (66%), Gaps = 16/467 (3%)
Query: 142 MYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDR---- 197
MYN G AGLT+WSPN+N+ RD RWGR QET EDP + G++A +YVRGLQ +G
Sbjct: 1 MYNLGHAGLTFWSPNINVVRDTRWGRTQETSREDPFMVGEFAVNYVRGLQDVEGTENVTD 60
Query: 198 -----LKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASV 252
LKV++ CKH+ AYD+D+W +DR F+A+VS+QD+++TF PF CV EG V+SV
Sbjct: 61 LNSRPLKVSSCCKHYAAYDIDSWLNIDRHTFDARVSEQDMKETFVSPFERCVREGDVSSV 120
Query: 253 MCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPE-EAAAD 311
MCS+N++NG+P C+DP +LK IR EW L+GYIVSDC + V D Q++ + + +A A
Sbjct: 121 MCSFNKINGIPPCSDPRLLKGVIRDEWDLHGYIVSDCYGLEVIVDNQNYLNDSKVDAVAK 180
Query: 312 AIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGH 371
++AGLDL+CG + V G +S+ +++ AL N + MR+G FDG P+ Y
Sbjct: 181 TLQAGLDLECGHYYTDALNELVLTGKVSQYELDRALKNIYVLLMRVGYFDGIPA---YES 237
Query: 372 LGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY 431
LG KD+C DH ELA EAARQGIVLLKN PL + +A++GP+++ T MIGNY
Sbjct: 238 LGLKDICAADHIELAREAARQGIVLLKNDYEVFPLK--PGKKLALVGPHANATEVMIGNY 295
Query: 432 AGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSI 491
AG+ Y +PL+ + GC D +C++D F A +A++ A+ TI+ +G D SI
Sbjct: 296 AGLPRKYVSPLEAFSAIGNVTYTTGCLDASCSNDTYFSEAKEAAKSAEVTIIFVGTDLSI 355
Query: 492 EAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGY 551
EAE +DR LLPG Q EL+ +V+ S GP ILV++SG ID+ FAKN+PRI+AI+W G+
Sbjct: 356 EAEFVDRVDFLLPGNQTELIKQVAEVSSGPVILVVLSGSNIDITFAKNNPRISAILWVGF 415
Query: 552 PGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEM-AMRPS 597
PG+ GG AIAD++FG NPGG+LP+TWY +Y+ L M A PS
Sbjct: 416 PGEQGGHAIADVVFGKYNPGGRLPVTWYEADYVACLETHIMDAKTPS 462
>gi|443893988|dbj|GAC71176.1| hypothetical protein PANT_1d00031 [Pseudozyma antarctica T-34]
Length = 759
Score = 466 bits (1199), Expect = e-128, Method: Compositional matrix adjust.
Identities = 276/726 (38%), Positives = 397/726 (54%), Gaps = 49/726 (6%)
Query: 43 CQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTK 102
C SL R L+ + QE + I+ A VPRLGI Y+WW+EALHGV+ PG
Sbjct: 36 CDTSLDYWTRATSLVAEFTTQELINNTINTAPGVPRLGIPPYQWWTEALHGVAG-SPGVN 94
Query: 103 FGGD----FPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSP-NV 157
F D + AT+FPQ+I ++F+ L+E + +++E RA N G AGL +SP N+
Sbjct: 95 FADDVEAPYGSATNFPQIINLGATFDDALYEQVATHIANETRAFNNAGKAGLNMYSPLNI 154
Query: 158 NIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNG 217
N FRDPRWGRGQET GEDP+ +YA V+GLQG + D L++AA+CKH+ AYDL+ W+G
Sbjct: 155 NCFRDPRWGRGQETTGEDPLHMSRYAVKMVQGLQGPNQDELRLAATCKHYLAYDLEKWDG 214
Query: 218 VDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRG 277
V+R+ F+A+VS+Q++ + + FR CV +GK ++M SYN VN VP A L+ R
Sbjct: 215 VERYQFDAQVSRQELAEFYLPQFRACVRDGKAVTLMTSYNAVNNVPPSASRYYLETLARK 274
Query: 278 EWRLN---GYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQ 334
EW L+ Y+ SDCD+V +D H+ + +AAAD+I AG DL+CG + A++
Sbjct: 275 EWGLDKKHNYVTSDCDAVANVFDGHHYADSYVQAAADSINAGTDLNCGATYSDNLGQALE 334
Query: 335 RGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGI 394
+ L I A+ Q+RLG+FD + QP LG + V T Q+LA +A +
Sbjct: 335 QNLTDVETIRTAVARMYASQVRLGLFDPK-QGQPLRELGWEHVNTKAAQDLAYSSAAASV 393
Query: 395 VLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAG---IACGYTTPLQGIGRYART 451
LLKN G +LP+ VAVIGP S+ T + GNYAG A T Q + A
Sbjct: 394 TLLKNNG-TLPVDGATK--VAVIGPYSNATFALRGNYAGPGPFAITMTEAAQRVFSQATI 450
Query: 452 IHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELV 511
G + AA+ +++AD I G+D +IE+E LDRA + P Q +L+
Sbjct: 451 SSANGTTISGTYNHTDAEAAMQLAKEADLVIFAGGIDPTIESEELDRATIAWPPNQLQLI 510
Query: 512 SKV-SMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNP 570
+ MA K +V GG ID A K D I A++WAGYPGQ+G A+ D++ G + P
Sbjct: 511 HALGGMAKK--MAVVQFGGGQIDGASIKADGNIGALLWAGYPGQSGALAVMDVIAGNTAP 568
Query: 571 GGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHT 630
G+LP+T YP EYI L T MA+RP+ + YPGRTY++Y G YP+ HG+ YT F
Sbjct: 569 AGRLPITQYPAEYIDGLAETTMALRPNAT--YPGRTYKWYSGTPTYPYAHGLHYTEFKAE 626
Query: 631 VAN-APTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLL 689
+A AP +A +A+ R+ VQ + N G + + L
Sbjct: 627 LAQPAPYTIAT-------------------AGYAEFERVAT-VQATITNAGQRTSDYAAL 666
Query: 690 VF---STPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCK-YLSVVDRSGTRRIPLG 745
VF + PA H P+K LV ++KV A + R + + + + L+ D G + G
Sbjct: 667 VFARHTNGPAPH--PNKTLVGYKKVKAIAPGESR-SVEVEITQAALARGDEEGNLVLYPG 723
Query: 746 EHNIHI 751
++ + +
Sbjct: 724 KYELEL 729
>gi|302683060|ref|XP_003031211.1| glycoside hydrolase family 3 protein [Schizophyllum commune H4-8]
gi|300104903|gb|EFI96308.1| glycoside hydrolase family 3 protein [Schizophyllum commune H4-8]
Length = 761
Score = 465 bits (1197), Expect = e-128, Method: Compositional matrix adjust.
Identities = 291/717 (40%), Positives = 394/717 (54%), Gaps = 28/717 (3%)
Query: 43 CQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTK 102
C SL +R L+ L++ E + + A VPRLG+ Y WW+EALHGV+ PG
Sbjct: 35 CDTSLGHVERARALVEELTVAEMINNTVHTAPGVPRLGLPPYNWWNEALHGVA-ASPGVV 93
Query: 103 F---GGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNI 159
F G +F ATSFP I S+F+ L A+G V S EARA N G AGL YW+PN+N
Sbjct: 94 FTSPGEEFSSATSFPMPINMGSAFDDALMLAVGNVTSTEARAFNNAGLAGLDYWTPNINP 153
Query: 160 FRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGS-DGDRLKVAASCKHFTAYDLDNWNGV 218
F+DPRWGRG ETPGEDP+ + +Y + V GLQG D LKVAA CKH+ AYDL++W GV
Sbjct: 154 FKDPRWGRGAETPGEDPLHAARYVRTLVEGLQGGIDPPSLKVAADCKHWAAYDLEDWGGV 213
Query: 219 DRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGE 278
R+ F+A V+ QD+ + + PF+ CV + + ASVMCSYN VNGVP CA P +LK +R
Sbjct: 214 ARYAFDAVVTPQDLAEYYSPPFKSCVRDARAASVMCSYNAVNGVPACASPYLLKTVLRDA 273
Query: 279 WRL--NGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRG 336
W L + ++ SDCD+VG YD +T +A +++AG DLDCG + A RG
Sbjct: 274 WGLAEDRWVTSDCDAVGNVYDPHGYTEDFVNGSAVSLKAGSDLDCGTTYSQYLPEAYDRG 333
Query: 337 LLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVL 396
L+ E D+ AL + LG FD P QPY + DV TP Q LA AA + VL
Sbjct: 334 LIDEDDLKAALTRLYASLVWLGYFDA-PEDQPYRQISWADVNTPAAQALAYTAAIESFVL 392
Query: 397 LKNQGPSLPLSHIRHRTVAVIGPNSDVT-VTMIGNYAGIACGYTTPLQGIGRYARTIHQQ 455
LKN G +LPL+ ++A+IGP ++ + V + GNY GI PLQG +
Sbjct: 393 LKNDG-TLPLTD-SSLSIALIGPMANASAVQLQGNYNGIPPFAIAPLQGFLDAGFNVTYV 450
Query: 456 GCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVS 515
+V D A+ A+ AD I V G+D ++E EA DR + P Q L+S +
Sbjct: 451 LGTNVTGNDADDIDGAVAAAEAADVVIYVGGIDSTVEEEAKDRTEISWPDNQLALLSALE 510
Query: 516 MASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLP 575
A K P ++V M GG +D K + AI+WAGYPGQ+GGTAIAD + G P G+L
Sbjct: 511 EAGK-PLVVVQMGGGQLDDTPLKESDAVNAILWAGYPGQSGGTAIADTVMGKVAPAGRLS 569
Query: 576 MTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNF-VHTVANA 634
+T YP Y+ + MT+M +RP S PGRTY++Y G VYP+G+G+ YTNF V ++A
Sbjct: 570 ITQYPASYVDAVAMTDMTLRPDNSTGNPGRTYKWYTGTPVYPYGYGLHYTNFSVAWASDA 629
Query: 635 PTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTP 694
P SI S V A + +V V N G L+F +
Sbjct: 630 PEACY--------SIQDLTSSADGFVDLAPLDTF----RVTVTNDGDVASDFVALLFVST 677
Query: 695 PAGHW-APHKQLVAFEKV-HVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNI 749
AG AP K+LVA+ + V G V + + + L+ D SG + G++ +
Sbjct: 678 QAGPAPAPMKELVAYARASDVQPGDSTDVDLEVTLGA-LARSDESGDASLYPGDYEL 733
>gi|242216161|ref|XP_002473890.1| beta-xylosidase [Postia placenta Mad-698-R]
gi|220726990|gb|EED80923.1| beta-xylosidase [Postia placenta Mad-698-R]
Length = 741
Score = 465 bits (1197), Expect = e-128, Method: Compositional matrix adjust.
Identities = 282/730 (38%), Positives = 409/730 (56%), Gaps = 31/730 (4%)
Query: 30 CDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSE 89
C TT T+ C S +R LI +L+EK+ + A VPRLG+ Y+WW E
Sbjct: 23 CANGPLTTNTV--CDTSATPLERATALISLFTLEEKINNTGNTAPGVPRLGLPAYQWWQE 80
Query: 90 ALHGVSNVGPGTKFG--GDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGT 147
ALHGV+ PG F G++ ATSFPQ I ++F+ L + +VS EARA N
Sbjct: 81 ALHGVAE-SPGVIFAPSGEYSYATSFPQPILMGAAFDDALINHVATIVSTEARAFNNANR 139
Query: 148 AGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGS-DGDRLKVAASCKH 206
+G+ +W+PN+N F+DPRWGRGQETPGEDP Y + + GLQG D + ++ A+CKH
Sbjct: 140 SGIDFWTPNINPFKDPRWGRGQETPGEDPFHLQSYVYNLITGLQGGLDPEYKRIVATCKH 199
Query: 207 FTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCA 266
F AYDL+NW G R+ F+A VS QD+ + + FR C + V S MCSYN VNGVP+CA
Sbjct: 200 FAAYDLENWEGNVRYGFDALVSLQDLSEFYTRSFRTCARDANVGSFMCSYNAVNGVPSCA 259
Query: 267 DPNILKRTIRGEW---RLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGP 323
+ +L+ +R W + YI SDCD++ Y+ ++T+T E ADA+ AG DLDCG
Sbjct: 260 NSYLLQDILRDHWGWTNEDQYITSDCDAIQNIYEPHYYTATRAETVADALNAGTDLDCGE 319
Query: 324 FLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSS-QPYGHLGPKDVCTPDH 382
+ + +A +GL +E +N AL+ ++LG FD P+ QPY +G +V TP+
Sbjct: 320 YYPENLGAAYDQGLFTESTLNRALIRQYAALVKLGYFD--PADIQPYRQIGWANVSTPEA 377
Query: 383 QELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPL 442
+ELA AA +GI LLKN G +LPLS +T+A+IGP ++ T M GNY G+A +PL
Sbjct: 378 EELAYTAAVEGITLLKNDG-TLPLSP-SIKTIALIGPWANATTQMQGNYYGVAPYLISPL 435
Query: 443 QGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLL 502
T++ V F AA A+ ADA I G+D ++EAEA+DR L
Sbjct: 436 MAAEELGFTVYYSAGPGVDDPTTSSFPAAFAAAEAADAIIYAGGIDITVEAEAMDRYTLD 495
Query: 503 LPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIAD 562
PG Q + + ++S+ K P I++ GG ID + +P + A++W GYPGQ+GG AI D
Sbjct: 496 WPGVQPDFIDQLSLLGK-PLIVLQFGGGQIDDSALLPNPGVNALVWGGYPGQSGGKAIMD 554
Query: 563 ILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGM 622
I+ G + P G+LP+T YP +Y+ + MT+M++RPS + PGRTY +Y G + FG G+
Sbjct: 555 IIVGNAAPAGRLPITQYPLDYVYQVAMTDMSLRPSPTN--PGRTYMWYTGTPIVEFGFGL 612
Query: 623 SYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSK 682
YT F +++ P+ + + AT+ V H +V N GS
Sbjct: 613 HYTTFTASLSQ-PSAPSYDI--------ATLVSLCSGVAHPDLCPFA-SYTANVTNTGSS 662
Query: 683 DGAH--TLLVFSTPPAGHWAPHKQLVAFEKVHVPAG-AQQRVGINIHVCKYLSVVDRSGT 739
+ +LL + P+K LVA++++H A A Q +N+ + LS VD G
Sbjct: 663 VTSDFVSLLFLAGEHGPAPYPNKVLVAYDRLHAIAPLASQTTTLNLTLGS-LSRVDDYGN 721
Query: 740 RRIPLGEHNI 749
+ GE+ +
Sbjct: 722 TILYPGEYTL 731
>gi|392596548|gb|EIW85871.1| hypothetical protein CONPUDRAFT_80240 [Coniophora puteana
RWD-64-598 SS2]
Length = 770
Score = 465 bits (1196), Expect = e-128, Method: Compositional matrix adjust.
Identities = 283/751 (37%), Positives = 404/751 (53%), Gaps = 38/751 (5%)
Query: 22 LAAREPFACDPKDATTRTL---PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPR 78
LAA + A D TT L C SL QR LI ++ E + ++ A VPR
Sbjct: 13 LAASQVSAYGFPDCTTAPLVNNSVCDTSLNATQRAAALIDLFTVDELIVNTVNWAPGVPR 72
Query: 79 LGIKGYEWWSEALHGVSN-VGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSD 137
LG+ YEWWSE LHGV+N G G F ATSFPQ I +++F+ L +A+G V+
Sbjct: 73 LGLPAYEWWSEGLHGVANSAGVTWSITGPFSYATSFPQPILMSAAFDDALIKAVGGVIGM 132
Query: 138 EARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQG--SDG 195
E RA N G AGL +W+PN+N F+DPRWGRGQETPGEDP +Y + ++GLQG
Sbjct: 133 EGRAFNNYGHAGLDFWTPNINPFKDPRWGRGQETPGEDPYHIAQYVYNLIQGLQGGLDPE 192
Query: 196 DRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCS 255
+V A+CKHF YDL++W+ R+ +NA +S QD+ + + F+ C + + MCS
Sbjct: 193 PYFQVVATCKHFAGYDLEDWDFNYRYGYNAIISTQDLSEYYLPSFQSCYRDAFAGASMCS 252
Query: 256 YNQVNGVPTCADPNILKRTIRGEWRLNG--YIVSDCDSVGVYYDTQHFTSTPEEAAADAI 313
YN +NG+PTCAD +L+ +RG W + ++ DCDSV YD H+T+ P++AAADA+
Sbjct: 253 YNAINGIPTCADTYLLQDILRGFWGFDQTRWVTGDCDSVEDIYDFHHYTALPQQAAADAL 312
Query: 314 RAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLG 373
+AG D+DCG F A L++E D+ AL +RLG FD S QPY
Sbjct: 313 KAGSDIDCGIFYTTWLPLAYTESLITEQDLRAALTRQYASLVRLGYFD-PASEQPYRQYN 371
Query: 374 PKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAG 433
+V T QELA AA +GI LLKN G +LP S + +A+IGP + T M GNY G
Sbjct: 372 WSNVDTSYAQELAYTAAVEGITLLKNDG-TLPFSSAI-KNIALIGPWTFATTQMQGNYYG 429
Query: 434 IACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEA 493
A +P QG I +V + AA A++ ADA + V G+D ++EA
Sbjct: 430 NAPYLISPYQGAQLAGYNISYVLETNVTSNTTDGYAAAFTAAQGADAIVFVGGIDNTVEA 489
Query: 494 EALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPG 553
EA+DR + P Q L+ ++ K P ++V GG +D +P + A++W GYPG
Sbjct: 490 EAMDRNDITWPAFQLWLIGELGKLGK-PLVVVQFGGGQVDDTEINANPDVNALLWGGYPG 548
Query: 554 QAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRP-SQSKRYPGRTYRFYKG 612
Q+GG A+ DI+ G P G+L T YP +Y+ +PMT M +RP + PGRTY++Y G
Sbjct: 549 QSGGQALFDIISGKVAPAGRLVSTQYPADYVNEIPMTNMNLRPDANGTTSPGRTYKWYTG 608
Query: 613 PVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLG- 671
VY FG+G+ YTNF + AP AT S +A+ + L
Sbjct: 609 TPVYEFGYGLHYTNFTYAWTKAPA--------------ATYSIEALVAAGQGSAHIDLAP 654
Query: 672 ---VQVDVKNVGSKDGAHTLLVF---STPPAGHWAPHKQLVAFEKVH-VPAGAQQRVGIN 724
+ V+V N G+ ++ L+F + PA + P+K L A+ ++H V AGA Q
Sbjct: 655 FDTLSVEVTNAGAVTSDYSALLFVNGTYGPAPY--PNKSLAAYTRLHNVTAGASQTATFE 712
Query: 725 IHVCKYLSVVDRSGTRRIPLGEHNIHIGGTK 755
+ V ++ D G + G + + + T+
Sbjct: 713 V-VLNQIARADVQGNFWLYPGAYEVALDTTR 742
>gi|115436902|ref|XP_001217674.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
gi|121734342|sp|Q0CB82.1|BXLB_ASPTN RecName: Full=Probable exo-1,4-beta-xylosidase bxlB; AltName:
Full=1,4-beta-D-xylan xylohydrolase bxlB; AltName:
Full=Beta-xylosidase bxlB; AltName: Full=Xylobiase bxlB;
Flags: Precursor
gi|114188489|gb|EAU30189.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
Length = 765
Score = 464 bits (1194), Expect = e-128, Method: Compositional matrix adjust.
Identities = 272/713 (38%), Positives = 395/713 (55%), Gaps = 37/713 (5%)
Query: 24 AREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKG 83
A+ PF D ++ C +L R L+ ++L+EK+ + VPRLG+
Sbjct: 25 AQSPFP-DCENGPLSKNAVCDTTLDPVTRAQALLAAMTLEEKINNTQYNSPGVPRLGLPA 83
Query: 84 YEWWSEALHGVSNVGPGTKFG--GDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARA 141
Y WWSEALHGV+ PG F G+F ATSFP IT ++F+ L + I V+ E RA
Sbjct: 84 YNWWSEALHGVAG-SPGVHFADSGNFSYATSFPSPITLGAAFDDDLVKQIATVIGTEGRA 142
Query: 142 MYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDG-DRLKV 200
N G AGL YW+PN+N +RDPRWGRGQETPGEDP + +Y + GLQ G ++ K+
Sbjct: 143 FGNAGHAGLDYWTPNINPYRDPRWGRGQETPGEDPFHTSRYVYHLIDGLQDGIGPEKPKI 202
Query: 201 AASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVN 260
A+CKHF YD+++W G +R+ F+A +S QD+ + + PF+ C + KV +VMCSYN VN
Sbjct: 203 VATCKHFAGYDIEDWEGNERYAFDAVISDQDMAEYYFPPFKTCTRDAKVDAVMCSYNSVN 262
Query: 261 GVPTCADPNILKRTIRGEWRLNG---YIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGL 317
G+PTCADP +L+ +R W G ++ SDC ++ Y + + AAA A+ AG
Sbjct: 263 GIPTCADPWLLQTVLREHWEWEGVGHWVTSDCGAIDNIYKDHKYVADGAHAAAVAVNAGT 322
Query: 318 DLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDV 377
DLDCG SA+ +GLL ++ AL + ++LG FD + QPY +G DV
Sbjct: 323 DLDCGSVYPQFLGSAISQGLLGNRTLDRALTRLYSSLVKLGYFD-PAADQPYRSIGWSDV 381
Query: 378 CTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACG 437
TPD ++LA AA +G VLLKN G +LPL ++ TVA++GP ++ T + GNY G A
Sbjct: 382 ATPDAEQLAHTAAVEGTVLLKNDG-TLPLK--KNGTVAIVGPYANATTQLQGNYEGTAKY 438
Query: 438 YTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALD 497
T L + + + F A++A++ +D I G+D +EAEALD
Sbjct: 439 IHTMLSAAAQQGYKVKYAPGTGINSNSTSGFEQALNAAKGSDLVIYFGGIDHEVEAEALD 498
Query: 498 RAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGG 557
R + PG Q +L+ ++S K P ++V GG +D + ++ + ++WAGYP QAGG
Sbjct: 499 RTSIAWPGNQLDLIQQLSDLKK-PLVVVQFGGGQVDDSSLLSNAGVNGLLWAGYPSQAGG 557
Query: 558 TAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYP 617
A+ DIL G + P G+LP+T YP+EY+ +PMT+M +RP S PGRTYR+Y V+ P
Sbjct: 558 AAVFDILTGKTAPAGRLPVTQYPEEYVDQVPMTDMNLRPGPSN--PGRTYRWYDKAVI-P 614
Query: 618 FGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVK 677
FG+GM YT F + +G N A+K +A +L VK
Sbjct: 615 FGYGMHYTTFDVSWKRK----------NYGPYNTA----AVKAENAVLETFSL----QVK 656
Query: 678 NVGSKDGAHTLLVFSTPPAGHWAPH--KQLVAFEKVH-VPAGAQQRVGINIHV 727
N G + LVF T P+ K LV +++V + G ++ V I++ V
Sbjct: 657 NTGKVTSDYVALVFLTTTDAGPKPYPIKTLVGYQRVKAIRPGERKVVDIDVTV 709
>gi|330934749|ref|XP_003304687.1| hypothetical protein PTT_17336 [Pyrenophora teres f. teres 0-1]
gi|311318569|gb|EFQ87188.1| hypothetical protein PTT_17336 [Pyrenophora teres f. teres 0-1]
Length = 798
Score = 462 bits (1190), Expect = e-127, Method: Compositional matrix adjust.
Identities = 274/739 (37%), Positives = 404/739 (54%), Gaps = 43/739 (5%)
Query: 31 DPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEA 90
D + + + C S R L+ +L+EK+ SGA VPRLG+ Y+WW+E
Sbjct: 25 DCTNGPLKNVTICDPSASPLARAKSLVALYTLEEKINATSSGAPGVPRLGVPPYQWWNEG 84
Query: 91 LHGVSNVGPGTKF---GGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGT 147
LHG++ GP T F G ++ +TSFPQ I ++F+ L + +V+S EARA N
Sbjct: 85 LHGIA--GPYTNFSHSGVEWSYSTSFPQPILMGAAFDDDLITEVAKVISTEARAFNNANR 142
Query: 148 AGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLK-VAASCKH 206
GL +W+PN+N FRDPRWGRGQETPGED Y + + GLQG D K V A+CKH
Sbjct: 143 TGLDFWTPNINPFRDPRWGRGQETPGEDAYHLSSYVQALIHGLQGEATDPYKRVVATCKH 202
Query: 207 FTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCA 266
F YD+++WNG R+ + ++++QD+ + + PF+ CV + V + MCSYN VNG P CA
Sbjct: 203 FAGYDVEDWNGNLRYQNDVQITQQDLVEYYLAPFQACV-QANVGAFMCSYNAVNGAPPCA 261
Query: 267 DPNILKRTIRGEWRLNG---YIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGP 323
DP +L+ +R W N ++ DCD+V Y ++ST AAAD++ AG D+ CG
Sbjct: 262 DPYLLQTILREHWGWNKEEQWVTGDCDAVQNVYFPHQWSSTRAGAAADSLVAGTDITCGT 321
Query: 324 FLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQ 383
++ H +A ++ LL+E ++ AL+ + +RLG FD P +QPY LG V T Q
Sbjct: 322 YMQEHLPAAFRQKLLNESSLDLALIRQYSSLVRLGYFDA-PENQPYRQLGFDAVATNASQ 380
Query: 384 ELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQ 443
LA AA +GIVLLKN G +LPLS TV + G ++ T ++GNYAG+A +PL
Sbjct: 381 ALARRAAAEGIVLLKNDG-TLPLSLDSSMTVGLFGDWANATTQLLGNYAGVATYLHSPLY 439
Query: 444 GIGRYARTIHQQGCKDVACAD------DQLFGAAIDASRQADATILVMGLDQSIEAEALD 497
+ + I+ G K D L+G A +D I V G+D +E E D
Sbjct: 440 ALKQTGVKINYAGGKPGGQGDPTTNRWSNLYG----AYSTSDVLIYVGGIDNGVEEEGHD 495
Query: 498 RAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGG 557
R L G Q +++ +++ K P I+V+ GG ID + N+P I+AI+WAGYPGQ GG
Sbjct: 496 RGYLTWTGPQLDVIGQLAETGK-PVIVVVTGGGQIDSSPLVNNPNISAIMWAGYPGQDGG 554
Query: 558 TAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYP 617
+AI DI+ G + P G+LP T YP Y + M M +RP ++ PGRTY++Y G V+
Sbjct: 555 SAIIDIISGKTAPAGRLPQTQYPASYAAAVSMMNMNLRPGENN--PGRTYKWYNGSAVFE 612
Query: 618 FGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLG-VQVDV 676
FG+GM YTNF ++ + +I++ SG T R V V V
Sbjct: 613 FGYGMHYTNFSAAISTQMQ--------QSYAISSLASG--CNSTGGFLERCPFASVDVQV 662
Query: 677 KNVGSKDGAHTLLVF---STPPAGHWAPHKQLVAFEKVH-VPAGAQQRVGINIHVCKYLS 732
N G + L + + PA H P K LV+++++H + GA +N+ + ++
Sbjct: 663 HNTGKVTSDYVTLGYMAGTFGPAPH--PRKTLVSYKRLHNIAGGATSTAKLNLTLAS-VA 719
Query: 733 VVDRSGTRRIPLGEHNIHI 751
VD G + + G +++ I
Sbjct: 720 RVDEYGNKVLYPGHYSLQI 738
>gi|344302281|gb|EGW32586.1| hypothetical protein SPAPADRAFT_51129 [Spathaspora passalidarum
NRRL Y-27907]
Length = 788
Score = 462 bits (1188), Expect = e-127, Method: Compositional matrix adjust.
Identities = 271/721 (37%), Positives = 398/721 (55%), Gaps = 31/721 (4%)
Query: 43 CQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTK 102
C LP QR ++ ++ E + + + + V RLG+ Y+WWSE LHG++ +
Sbjct: 61 CNPYLPNNQRAKAVVDLFTVDELIANMGNTSPGVERLGLPPYQWWSEGLHGIAR----SN 116
Query: 103 F--GGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIF 160
F G++ ATSFPQ I +FN+ L++ +G V+ EARA N G AGL Y+SPN+N F
Sbjct: 117 FTASGEYSHATSFPQPILMGGAFNSDLYKQVGNVIGTEARAFNNVGRAGLDYYSPNINPF 176
Query: 161 RDPRWGRGQETPGEDPVLSGKYAASYVRGLQGS-----DGDRLKVAASCKHFTAYDLDNW 215
+DPRWGRGQE E PVL G YA +YV+GLQG + D L+VAA+CKHF YD+++W
Sbjct: 177 KDPRWGRGQEVASESPVLVGNYALNYVQGLQGGIDSNPNDDTLQVAATCKHFAGYDMESW 236
Query: 216 NGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTI 275
R +NA +S QD+ D + F+ CV + K A MCSYN +NG+P CA L I
Sbjct: 237 KQHSRLGYNAIISDQDLADYYFPTFQSCVRDAKAAGAMCSYNAINGIPVCASEFFLGTVI 296
Query: 276 RGEWRL-NGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQ 334
R + NG I SDCDS+ ++ + AAAD I+AG+D++CG + A+
Sbjct: 297 REGFDFQNGVIHSDCDSLYSIWNPHLYVQDLGAAAADGIKAGVDVNCGDTYQNNLGYALG 356
Query: 335 RGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGI 394
++E I ++ + +RLG FD P + Y DV T +LA +AA +GI
Sbjct: 357 NKTINEDQIRASVTRQYSNLIRLGYFD-SPQTNKYRTYNWSDVSTSQANQLAYQAAVEGI 415
Query: 395 VLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQ 454
LLKN G +LP + + + VAVIGP ++ T M+G+YAG +PLQG +
Sbjct: 416 TLLKNDG-TLPFNKDKVKNVAVIGPWANATTDMLGDYAGTPPYLISPLQGAQDSGFKVQY 474
Query: 455 QGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKV 514
+ + AA++A++ ADA + G+D SIE EALDR L PG Q +LVSK+
Sbjct: 475 AYGTQINTTLTTNYTAALNAAKGADAIVYFGGIDNSIENEALDRESLAWPGNQLDLVSKL 534
Query: 515 SMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKL 574
S +K P ++V G +D KN+ + +I++AGYPGQ+GGTAI D+L G P G+L
Sbjct: 535 SGLNK-PLVVVQFGAGQVDDTEIKNNNNVNSIVYAGYPGQSGGTAIWDVLNGIYAPAGRL 593
Query: 575 PMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANA 634
T YP Y +PMT+M +RP YPGRT+ +Y G VY FG+G+ YT F ++ANA
Sbjct: 594 STTQYPASYADQVPMTDMTLRPRDG--YPGRTFMWYNGEPVYEFGYGLHYTTFSVSLANA 651
Query: 635 PTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTP 694
P A + +I+ I+ K+ + L V++KN G + L++S
Sbjct: 652 PPKGAP----QSFNIDQFIAAKSSQYVDTS---LITTFDVNIKNTGKVTSDYAALLYSNT 704
Query: 695 PAG---HWAPHKQLVAFEKVH-VPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIH 750
+G H P+K LV+F+K+H + G Q + + + L D +G + + G +
Sbjct: 705 TSGPGPH--PNKILVSFDKLHQIHPGQIQTASLPVTIGSLLQ-TDTNGNKWLYPGAYTFF 761
Query: 751 I 751
+
Sbjct: 762 V 762
>gi|238492365|ref|XP_002377419.1| conserved hypothetical protein [Aspergillus flavus NRRL3357]
gi|220695913|gb|EED52255.1| conserved hypothetical protein [Aspergillus flavus NRRL3357]
Length = 775
Score = 461 bits (1187), Expect = e-127, Method: Compositional matrix adjust.
Identities = 274/734 (37%), Positives = 408/734 (55%), Gaps = 48/734 (6%)
Query: 41 PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPG 100
P C SL I +RV+ L+ L+L+EK+ L+ +A RLG+ YEWWSEA HGV + PG
Sbjct: 31 PVCDTSLSIAERVDSLVKSLTLEEKILNLVDASAGSTRLGLPSYEWWSEATHGVGS-APG 89
Query: 101 TKFG---GDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNV 157
+F +F ATSFP I TA+SF+ TL I V+ E R N G +G +W+PN+
Sbjct: 90 VQFTSKPANFSYATSFPAPILTAASFDDTLIRKIAEVIGREGRVFGNNGFSGFDFWAPNI 149
Query: 158 NIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNG 217
N FRDPRWGRGQETPGEDP+++ Y ++V GLQG D +V A+CKH+ YDL+
Sbjct: 150 NGFRDPRWGRGQETPGEDPLVAQNYIRNFVPGLQGDDPKNKQVIATCKHYAVYDLE---- 205
Query: 218 VDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRG 277
R+ N ++QD+ + F PF+ CV + V S+MCSYN V+G+P CA+ +L +R
Sbjct: 206 TGRYGNNYNPTQQDLSEYFLAPFKTCVRDTDVGSIMCSYNSVSGIPACANEYLLDEVLRK 265
Query: 278 EWRLNG---YIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG-PFLGLHTESAV 333
W N Y+VSDC +V + +FT T E AA+ A+ AG+DL+CG +L L+ A
Sbjct: 266 HWNFNSDYHYVVSDCGAVTDIWQYHNFTDTEEAAASVALNAGVDLECGSSYLKLNESLAA 325
Query: 334 QRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQG 393
+ + +D +L + +G FDG Y L DV TPD Q LA EAA +G
Sbjct: 326 NQTSVKVMD--QSLARLYSALFTVGFFDG----GKYDKLDFSDVSTPDAQALAYEAAVEG 379
Query: 394 IVLLKNQGPSLPL-SHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTI 452
+ LLKN LPL S ++++VAVIGP ++ T M G+Y+G A +PL+ G +
Sbjct: 380 MTLLKNDD-LLPLDSPHKYKSVAVIGPFANATTQMQGDYSGDAPYLISPLEAFGDSRWKV 438
Query: 453 HQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVS 512
+ + + F A+ A+ ++D I + G+D S+E+E LDR L PG Q +L++
Sbjct: 439 NYALGTAINNQNTSGFEEALAAANKSDLIIYLGGIDNSLESETLDRTSLAWPGNQLDLIT 498
Query: 513 KVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGG 572
+S SK P ++V GG +D + + I A++WAGYP Q+GGTA+ D+L G +P G
Sbjct: 499 SLSKLSK-PLVVVQFGGGQVDDSAILKNKDIQALVWAGYPSQSGGTALLDVLVGKRSPAG 557
Query: 573 KLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNF----- 627
+LP+T YP Y + + ++ +RP+ YPGRTY++Y G V PFG+G+ YT F
Sbjct: 558 RLPVTQYPASYADQVNIFDINLRPTD--LYPGRTYKWYTGKPVLPFGYGLHYTKFMFDWE 615
Query: 628 --VHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGA 685
++ N +VA + G IN N V+ VKNVG K
Sbjct: 616 KTLNREYNIQDLVASCRNSSGGPIND--------------NTPLTTVKARVKNVGHKTSD 661
Query: 686 H-TLLVFSTPPAGHWA-PHKQLVAFEK-VHVPAGAQQRVGINIHVCKYLSVVDRSGTRRI 742
+ +LL S+ AG P+K LV++ + +++ G+ Q + + + L+ D +G+ I
Sbjct: 662 YVSLLFLSSKNAGPAPRPNKSLVSYVRLLNIARGSDQVAELPLTLGS-LARADENGSLVI 720
Query: 743 PLGEHNIHIGGTKH 756
G + I + ++
Sbjct: 721 FPGRYKIALDNSEE 734
>gi|403412992|emb|CCL99692.1| predicted protein [Fibroporia radiculosa]
Length = 760
Score = 457 bits (1177), Expect = e-126, Method: Compositional matrix adjust.
Identities = 270/716 (37%), Positives = 403/716 (56%), Gaps = 27/716 (3%)
Query: 43 CQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTK 102
C S R LIG +L+EK+ + + VPRLG+ Y+WW EALHGV+ PG
Sbjct: 34 CDTSASPVARATALIGLFTLEEKINNTGNTSPGVPRLGLPAYQWWQEALHGVAE-SPGVI 92
Query: 103 FG--GDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIF 160
F G++ ATSFPQ I ++F+ L + +VS EARA N +GL +W+PN+N F
Sbjct: 93 FAETGEYSYATSFPQPILMGAAFDDELINQVATIVSTEARAFNNANRSGLDFWTPNINPF 152
Query: 161 RDPRWGRGQETPGEDPVLSGKYAASYVRGLQGS-DGDRLKVAASCKHFTAYDLDNWNGVD 219
+DPRWGRGQETPGEDP Y + + GLQG D + ++ A+CKH+ YDL+NW G
Sbjct: 153 KDPRWGRGQETPGEDPFHLQSYVYNLITGLQGGLDPEYKRIVATCKHYAGYDLENWEGNV 212
Query: 220 RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEW 279
R+ F+A +S QD+ + + F C + V + MCSYN VNGVP+CA+ +L+ +RG W
Sbjct: 213 RYGFDALISIQDLSEFYTRSFETCARDANVGAFMCSYNAVNGVPSCANSYLLQDILRGHW 272
Query: 280 RL---NGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRG 336
+ +I SDCD++ Y+ ++ T E ADA+ AG DLDCG + + +A G
Sbjct: 273 NWTSDDQWITSDCDAIQNIYEPHYYAPTRELTVADALNAGADLDCGTYYPENLGAAYDEG 332
Query: 337 LLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVL 396
L +E ++ AL+ ++LG FD +QPY +G +V TP+ +ELA AA +GI L
Sbjct: 333 LFAESTLDRALIRQYASLVKLGYFD-PAENQPYRQIGWANVSTPEAEELAYRAAVEGITL 391
Query: 397 LKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQG 456
+KN G +LPLS +++A+IGP ++ T M GNY G +PL T++
Sbjct: 392 IKNDG-TLPLSP-SIKSLALIGPWANATTQMQGNYYGQPPYLISPLMAAEALNYTVYYSP 449
Query: 457 CKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSM 516
V F AA A++ ADA I + G+D ++EAEA+DR L PG Q + + ++S
Sbjct: 450 GPGVDDPTTSSFPAAFAAAQAADAIIYIGGIDTTVEAEAMDRYTLDWPGVQPDFIDQLSQ 509
Query: 517 ASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPM 576
K P +++ M GG +D + + + A+IW GYPGQ+GGTA+ DI+ G + P G+LP
Sbjct: 510 FGK-PLVVLQMGGGQVDDSCLLPNTNVNALIWGGYPGQSGGTALMDIIVGNAAPAGRLPT 568
Query: 577 TWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPT 636
T YP +Y+ + MT+M++RPS + PGRTY +Y G + FG G+ YTNF ++ P+
Sbjct: 569 TQYPLDYVYQVAMTDMSLRPSATN--PGRTYMWYTGTPIVEFGFGLHYTNFSAELSQ-PS 625
Query: 637 VVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSK-DGAHTLLVFSTPP 695
+ + A++ G V H V+V N+GSK + L+F
Sbjct: 626 APSYDI--------ASLVGACEGVAHLDLCAFE-SYTVNVTNIGSKVTSDYVALLFVAGE 676
Query: 696 AGHWA-PHKQLVAFEKVHVPAG-AQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNI 749
G P+K L A++++H A + Q+ +N+ + LS VD G R + GE+ +
Sbjct: 677 HGPAPIPNKVLAAYDRLHTIAPLSSQQATLNLTLGS-LSRVDEYGNRVLYPGEYTL 731
>gi|189203341|ref|XP_001938006.1| beta-xylosidase [Pyrenophora tritici-repentis Pt-1C-BFP]
gi|187985105|gb|EDU50593.1| beta-xylosidase [Pyrenophora tritici-repentis Pt-1C-BFP]
Length = 761
Score = 457 bits (1177), Expect = e-126, Method: Compositional matrix adjust.
Identities = 270/722 (37%), Positives = 399/722 (55%), Gaps = 38/722 (5%)
Query: 45 VSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFG 104
S P R L+ +L+EK+ SGA VPRLG+ Y+WWSE LHG++ GP T F
Sbjct: 3 TSRPPLARAQSLVALYTLEEKINATSSGAPGVPRLGVPPYQWWSEGLHGIA--GPYTNFS 60
Query: 105 --GDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRD 162
G++ +TSFPQ I ++F+ L + +V+S EARA N GL +W+PN+N FRD
Sbjct: 61 DSGEWSYSTSFPQPILMGAAFDDDLITDVAKVISTEARAFNNANRTGLDFWTPNINPFRD 120
Query: 163 PRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLK-VAASCKHFTAYDLDNWNGVDRF 221
PRWGRGQETPGED Y + + GLQG D K V A+CKHF YD+++WNG R+
Sbjct: 121 PRWGRGQETPGEDAYHLSSYVQALIHGLQGESTDPYKRVVATCKHFAGYDVEDWNGNLRY 180
Query: 222 HFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRL 281
+ ++++Q++ + + PF+ CV + V + MCSYN VNG P CADP +L+ +R W
Sbjct: 181 QNDVQITQQELVEYYLAPFQACV-QANVGAFMCSYNAVNGAPPCADPYLLQTILREHWGW 239
Query: 282 NG---YIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLL 338
++ DCD+V Y ++ T AAAD++ AG D+ CG ++ H +A Q+ LL
Sbjct: 240 TNEEQWVTGDCDAVQNVYLPHQWSPTRAGAAADSLVAGTDVTCGTYMQEHLPAAFQQKLL 299
Query: 339 SEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLK 398
+E ++ AL+ + +RLG FD +QPY LG V T Q LA AA +GIVLLK
Sbjct: 300 NESSLDQALIRQYSSLVRLGYFDAS-ENQPYRQLGFDAVATNASQALARRAAAEGIVLLK 358
Query: 399 NQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCK 458
N G +LPLS TV + G ++ T ++GNYAG+A +PL + + I+ G
Sbjct: 359 NDG-TLPLSLDSSVTVGLFGDWANATSQLLGNYAGVATYLHSPLYALEQTGVKINYAGGN 417
Query: 459 DVACAD------DQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVS 512
D L+G A +D I V G+D S+E E DR L G Q +++
Sbjct: 418 PGGQGDPTTNRWSNLYG----AYSTSDVLIYVGGIDNSVEEEGRDRGYLTWTGAQLDVIG 473
Query: 513 KVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGG 572
+++ K P I+V+ GG ID + N+P I+AI+WAGYPGQ GG+AI DI+ G + P G
Sbjct: 474 QLADTGK-PVIVVVTGGGQIDSSPLVNNPNISAIMWAGYPGQDGGSAIIDIIGGKTAPAG 532
Query: 573 KLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVA 632
+LP T YP Y + M M +RP ++ PGRTY++Y G + FG+GM YTNF +
Sbjct: 533 RLPQTQYPANYTAAVSMMNMNLRPGENS--PGRTYKWYNGSATFEFGYGMHYTNFSAEIT 590
Query: 633 NAPTVVAVPLDGRHGSINATISG-KAIKVTHAKCNRLTLGVQV-DVKNVGSKDGAHTLLV 690
+ +I++ SG + +C ++ VQV + NV S +
Sbjct: 591 TQMQ--------QSYAISSLASGCNSTGGFLERCPFASVNVQVHNTGNVTSDYITLGYMA 642
Query: 691 FSTPPAGHWAPHKQLVAFEKVH-VPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNI 749
+ PA H P K LV+++++H + GA +N+ + L+ VD G + + G++++
Sbjct: 643 GTFGPAPH--PRKTLVSYKRLHSIAGGATSTATLNLTLAS-LARVDEHGNKVLYPGDYSL 699
Query: 750 HI 751
I
Sbjct: 700 QI 701
>gi|336377735|gb|EGO18896.1| glycoside hydrolase family 3 protein [Serpula lacrymans var.
lacrymans S7.9]
Length = 766
Score = 457 bits (1177), Expect = e-126, Method: Compositional matrix adjust.
Identities = 273/718 (38%), Positives = 413/718 (57%), Gaps = 43/718 (5%)
Query: 9 FLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKL 68
FL L+L ++GLA+ F D + C SL R ++ ++ E +
Sbjct: 8 FLPLVL----TAGLASAYSFP-DCTNGPLAQNAICDTSLDPISRATAVVDLFTIDELINN 62
Query: 69 LISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFG--GDFPGATSFPQVITTASSFNAT 126
+S + VPRLG+ Y+WWSE LHGV++ PG F G+F ATSFPQ I ++F+
Sbjct: 63 TVSTSPGVPRLGLPPYQWWSEGLHGVAD-SPGVNFSASGEFSYATSFPQPIIMGAAFDDE 121
Query: 127 LWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASY 186
L +++G +V E R+ N G AGL +W+PN+N F+DPRWGRGQETPGEDP +Y +
Sbjct: 122 LIKSVGAIVGMEGRSFNNYGRAGLDFWTPNINPFKDPRWGRGQETPGEDPYHLAQYVYNL 181
Query: 187 VRGLQGSDGDR--LKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCV 244
V+GLQG + +V ++CKHF AYDL++W+G R+ F+A V+ QD+ + + F+ C
Sbjct: 182 VQGLQGGLDPKPYYQVISTCKHFAAYDLEDWDGNYRYGFDAIVTTQDLSEYYLPSFQSCY 241
Query: 245 MEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRL--NGYIVSDCDSVGVYYDTQHFT 302
+ KV + MCSYN VNG+P+CA+ +L+ +R W + ++ SDCD+V YD ++T
Sbjct: 242 RDAKVGAAMCSYNAVNGIPSCANTYLLQSILRDFWGFAEDRWVTSDCDAVDNIYDPHNYT 301
Query: 303 STPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDG 362
TPEEA ADA++AG D+DCG F + A + L++E ++ AL+ +RLG FD
Sbjct: 302 KTPEEAVADALKAGTDIDCGTFYSEYLPGAYNQSLITETELRQALIRQYASLVRLGYFD- 360
Query: 363 EPSS-QPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNS 421
P+ QPY +V TP Q+LA +AA +GIVLLKN G +LPLS + +A+IGP
Sbjct: 361 -PTDIQPYRQYNWNNVDTPQAQQLAYQAAAEGIVLLKNDG-TLPLSS-DIKNIALIGPWG 417
Query: 422 DVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADAT 481
+ T M GNY G+A +PL G + ++ D F AAI A++ AD
Sbjct: 418 NATGEMQGNYYGVAPYLISPLMGAVATGYNVTYVFGTNITSNDTSGFAAAIAAAQGADVV 477
Query: 482 ILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDP 541
I G+D+++E+E DR + PG Q +LV +++ K P ++V GG +D K +
Sbjct: 478 IYAGGIDETVESEGNDRNYITWPGNQLDLVGELAAVGK-PLVVVQFGGGQVDDTSLKANS 536
Query: 542 RIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKR 601
+ A++WAGYPGQ+GG+A+ DI+ G P G+LP+T YP +Y+ +PMT+M +RP+ +
Sbjct: 537 TVNALLWAGYPGQSGGSALFDIISGKVAPAGRLPVTQYPADYVYEIPMTDMDLRPNATS- 595
Query: 602 YPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVT 661
PGRTY++Y G +Y FG+G+ YT F + A AP+ +T + + + +
Sbjct: 596 -PGRTYKWYTGTPIYDFGYGLHYTTFSYKWAKAPS--------------STYNIQTLVQS 640
Query: 662 HAKCNRLTLG----VQVDVKNVG--SKDGAHTLLVFST-PPAGHWAPHKQLVAFEKVH 712
+ L L V+V N G + D A L V T P+ + P+K L+ + ++H
Sbjct: 641 GNLYSYLDLAPFDTFTVNVTNTGNVTSDFASLLFVNGTYGPSPY--PNKSLITYARLH 696
>gi|238508313|ref|XP_002385353.1| beta-xylosidase, putative [Aspergillus flavus NRRL3357]
gi|296439537|sp|B8NYD8.1|BXLB_ASPFN RecName: Full=Probable exo-1,4-beta-xylosidase bxlB; AltName:
Full=1,4-beta-D-xylan xylohydrolase bxlB; AltName:
Full=Beta-xylosidase bxlB; AltName: Full=Xylobiase bxlB;
Flags: Precursor
gi|220688872|gb|EED45224.1| beta-xylosidase, putative [Aspergillus flavus NRRL3357]
Length = 776
Score = 456 bits (1174), Expect = e-125, Method: Compositional matrix adjust.
Identities = 271/697 (38%), Positives = 389/697 (55%), Gaps = 35/697 (5%)
Query: 43 CQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTK 102
C SL R L+ ++L+EK+ ++ PRLG+ Y WW+EALHGV+ G G
Sbjct: 41 CDTSLDPVSRAKSLVAAMTLEEKINNTKYDSSGAPRLGLPAYNWWNEALHGVAE-GHGVS 99
Query: 103 FG--GDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIF 160
F G+F ATSFP I ++F+ L + + V+S EARA NGG AGL YW+PN+N F
Sbjct: 100 FSDSGNFSYATSFPMPILLGAAFDDDLVKQVATVISTEARAFANGGHAGLDYWTPNINPF 159
Query: 161 RDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDG-DRLKVAASCKHFTAYDLDNWNGVD 219
RDPRWGRGQETPGEDP+ +Y V GLQ G +R KV A+CKHF AYDL+NW G++
Sbjct: 160 RDPRWGRGQETPGEDPLHLSRYVYHLVDGLQDGIGPERPKVVATCKHFAAYDLENWEGIE 219
Query: 220 RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEW 279
R+ F+A VS QD+ + + F+ C + KV +VMCSYN +NG+PTCAD +L+ +R W
Sbjct: 220 RYAFDAVVSPQDLSEYYLPSFKTCTRDAKVDAVMCSYNSLNGIPTCADRWLLQTLLREHW 279
Query: 280 ---RLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRG 336
+ ++ DC ++ Y H+ + AAA A+ AG DLDCG + SA+Q+G
Sbjct: 280 GWEQTGHWVTGDCGAIDNIYADHHYVADGAHAAAAALNAGTDLDCGSVFPEYLRSALQQG 339
Query: 337 LLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVL 396
L + +NNAL+ + ++LG FD QPY +G +V TP +ELA +A +GIV+
Sbjct: 340 LYNNQTLNNALIRLYSSLVKLGYFD-PADDQPYRSIGWNEVFTPAAEELAHKATVEGIVM 398
Query: 397 LKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQG 456
LKN G +LPL + TVA+IGP ++ T + GNY G T + +
Sbjct: 399 LKNDG-TLPLK--SNGTVAIIGPFANATTQLQGNYEGPPKYIRTLIWAAVHNGYKVKFSQ 455
Query: 457 CKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSM 516
D+ F AI A+++AD I G+D +IE E+ DR ++ PG Q +L+ ++S
Sbjct: 456 GTDINSNSSAGFAEAISAAKEADTVIYAGGIDNTIEKESQDRTTIVWPGNQLDLIEQLSD 515
Query: 517 ASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPM 576
K P I+V GG +D + + + A++WAGYP QAGG A+ DIL G S P G+LP+
Sbjct: 516 LEK-PLIVVQFGGGQVDDSSLLANAGVGALLWAGYPSQAGGAAVFDILTGKSAPAGRLPV 574
Query: 577 TWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPT 636
T YP Y+ +PMT+M +RP + PGRTYR+Y V+ PFG G+ YT F + +A
Sbjct: 575 TQYPASYVDEVPMTDMTLRPGSNN--PGRTYRWYDKAVL-PFGFGLHYTTFNVSWNHA-- 629
Query: 637 VVAVPLDGRHGSINATISGKAIKVTHAKCN-RLTLGVQVDVKNVGSKDGAHTLLVFSTP- 694
+G N A T+A + L + V N G+ + L+F T
Sbjct: 630 --------EYGPYNT--DSVASGTTNAPVDTELFDTFSITVTNTGNVASDYIALLFLTAD 679
Query: 695 ---PAGHWAPHKQLVAFEKVH-VPAGAQQRVGINIHV 727
P + P K LV + + + G Q+V +++ V
Sbjct: 680 RVGPEPY--PIKTLVGYSRAKGIEPGQSQQVKLDVSV 714
>gi|336365124|gb|EGN93476.1| glycoside hydrolase family 3 protein [Serpula lacrymans var.
lacrymans S7.3]
Length = 732
Score = 456 bits (1173), Expect = e-125, Method: Compositional matrix adjust.
Identities = 271/731 (37%), Positives = 418/731 (57%), Gaps = 40/731 (5%)
Query: 43 CQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTK 102
C SL R ++ ++ E + +S + VPRLG+ Y+WWSE LHGV++ PG
Sbjct: 22 CDTSLDPISRATAVVDLFTIDELINNTVSTSPGVPRLGLPPYQWWSEGLHGVAD-SPGVN 80
Query: 103 FG--GDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIF 160
F G+F ATSFPQ I ++F+ L +++G +V E R+ N G AGL +W+PN+N F
Sbjct: 81 FSASGEFSYATSFPQPIIMGAAFDDELIKSVGAIVGMEGRSFNNYGRAGLDFWTPNINPF 140
Query: 161 RDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDR--LKVAASCKHFTAYDLDNWNGV 218
+DPRWGRGQETPGEDP +Y + V+GLQG + +V ++CKHF AYDL++W+G
Sbjct: 141 KDPRWGRGQETPGEDPYHLAQYVYNLVQGLQGGLDPKPYYQVISTCKHFAAYDLEDWDGN 200
Query: 219 DRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGE 278
R+ F+A V+ QD+ + + F+ C + KV + MCSYN VNG+P+CA+ +L+ +R
Sbjct: 201 YRYGFDAIVTTQDLSEYYLPSFQSCYRDAKVGAAMCSYNAVNGIPSCANTYLLQSILRDF 260
Query: 279 WRL--NGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRG 336
W + ++ SDCD+V YD ++T TPEEA ADA++AG D+DCG F + A +
Sbjct: 261 WGFAEDRWVTSDCDAVDNIYDPHNYTKTPEEAVADALKAGTDIDCGTFYSEYLPGAYNQS 320
Query: 337 LLSEIDINNALVNTLTVQMRLGMFDGEPSS-QPYGHLGPKDVCTPDHQELALEAARQGIV 395
L++E ++ AL+ +RLG FD P+ QPY +V TP Q+LA +AA +GIV
Sbjct: 321 LITETELRQALIRQYASLVRLGYFD--PTDIQPYRQYNWNNVDTPQAQQLAYQAAAEGIV 378
Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQ 455
LLKN G +LPLS + +A+IGP + T M GNY G+A +PL G +
Sbjct: 379 LLKNDG-TLPLSS-DIKNIALIGPWGNATGEMQGNYYGVAPYLISPLMGAVATGYNVTYV 436
Query: 456 GCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVS 515
++ D F AAI A++ AD I G+D+++E+E DR + PG Q +LV +++
Sbjct: 437 FGTNITSNDTSGFAAAIAAAQGADVVIYAGGIDETVESEGNDRNYITWPGNQLDLVGELA 496
Query: 516 MASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLP 575
K P ++V GG +D K + + A++WAGYPGQ+GG+A+ DI+ G P G+LP
Sbjct: 497 AVGK-PLVVVQFGGGQVDDTSLKANSTVNALLWAGYPGQSGGSALFDIISGKVAPAGRLP 555
Query: 576 MTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAP 635
+T YP +Y+ +PMT+M +RP+ + PGRTY++Y G +Y FG+G+ YT F + A AP
Sbjct: 556 VTQYPADYVYEIPMTDMDLRPNATS--PGRTYKWYTGTPIYDFGYGLHYTTFSYKWAKAP 613
Query: 636 TVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLG----VQVDVKNVG--SKDGAHTLL 689
+ +T + + + + + L L V+V N G + D A L
Sbjct: 614 S--------------STYNIQTLVQSGNLYSYLDLAPFDTFTVNVTNTGNVTSDFASLLF 659
Query: 690 VFST-PPAGHWAPHKQLVAFEKVH-VPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEH 747
V T P+ + P+K L+ + ++H + +G V + + + ++ D G + G +
Sbjct: 660 VNGTYGPSPY--PNKSLITYARLHDIASGDTASVALGVTLGS-IARADTYGNMWLYPGTY 716
Query: 748 NIHIGGTKHSV 758
+ + +SV
Sbjct: 717 QVTLDTLGNSV 727
>gi|121797681|sp|Q2TYT2.1|BXLB_ASPOR RecName: Full=Probable exo-1,4-beta-xylosidase bxlB; AltName:
Full=1,4-beta-D-xylan xylohydrolase bxlB; AltName:
Full=Beta-xylosidase bxlB; AltName: Full=Xylobiase bxlB;
Flags: Precursor
gi|83775471|dbj|BAE65591.1| unnamed protein product [Aspergillus oryzae RIB40]
Length = 797
Score = 456 bits (1173), Expect = e-125, Method: Compositional matrix adjust.
Identities = 270/695 (38%), Positives = 388/695 (55%), Gaps = 31/695 (4%)
Query: 43 CQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTK 102
C SL R L+ ++L+EK+ ++ PRLG+ Y WW+EALHGV+ G G
Sbjct: 62 CDTSLDPVSRAKSLVAAMTLEEKINNTKYDSSGAPRLGLPAYNWWNEALHGVAE-GHGVS 120
Query: 103 FG--GDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIF 160
F G+F ATSFP I ++F+ L + + V+S EARA NGG AGL YW+PN+N F
Sbjct: 121 FSDSGNFSYATSFPMPILLGAAFDDDLVKQVATVISTEARAFANGGHAGLDYWTPNINPF 180
Query: 161 RDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDG-DRLKVAASCKHFTAYDLDNWNGVD 219
RDPRWGRGQETPGEDP+ +Y V GLQ G +R KV A+CKHF AYDL+NW G++
Sbjct: 181 RDPRWGRGQETPGEDPLHLSRYVYHLVDGLQDGIGPERPKVVATCKHFAAYDLENWEGIE 240
Query: 220 RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEW 279
R+ F+A VS QD+ + + F+ C + KV +VMCSYN +NG+PTCAD +L+ +R W
Sbjct: 241 RYAFDAVVSPQDLSEYYLPSFKTCTRDAKVDAVMCSYNSLNGIPTCADRWLLQTLLREHW 300
Query: 280 ---RLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRG 336
+ ++ DC ++ Y H+ + AAA A+ AG DLDCG + SA+Q+G
Sbjct: 301 GWEQTGHWVTGDCGAIDNIYADHHYVADGAHAAAAALNAGTDLDCGSVFPEYLGSALQQG 360
Query: 337 LLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVL 396
L + +NNAL+ + ++LG FD QPY +G +V TP +ELA +A +GIV+
Sbjct: 361 LYNNQTLNNALIRLYSSLVKLGYFD-PADDQPYRSIGWNEVFTPAAEELAHKATVEGIVM 419
Query: 397 LKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQG 456
LKN G +LPL + TVA+IGP ++ T + GNY G T + +
Sbjct: 420 LKNDG-TLPLK--SNGTVAIIGPFANATTQLQGNYEGPPKYIRTLIWAAVHNGYKVKFSQ 476
Query: 457 CKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSM 516
D+ F AI A+++AD I G+D +IE E+ DR ++ PG Q +L+ ++S
Sbjct: 477 GTDINSNSSAGFAEAISAAKEADTVIYAGGIDNTIEKESQDRTTIVWPGNQLDLIEQLSD 536
Query: 517 ASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPM 576
K P I+V GG +D + + + A++WAGYP QAGG A+ DIL G S P G+LP+
Sbjct: 537 LEK-PLIVVQFGGGQVDDSSLLANAGVGALLWAGYPSQAGGAAVFDILTGKSAPAGRLPV 595
Query: 577 TWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPT 636
T YP Y+ +PMT+M +RP + PGRTYR+Y V+ PFG G+ YT F + +A
Sbjct: 596 TQYPASYVDEVPMTDMTLRPGSNN--PGRTYRWYDKAVL-PFGFGLHYTTFNVSWNHA-- 650
Query: 637 VVAVPLDGRHGSINATISGKAIKVTHAKCN-RLTLGVQVDVKNVGSKDGAHTLLVFSTPP 695
+G N A T+A + L + V N G+ + L+F T
Sbjct: 651 --------EYGPYNT--DSVASGTTNAPVDTELFDTFSITVTNTGNVASDYIALLFLTAD 700
Query: 696 AGHWAPH--KQLVAFEKVH-VPAGAQQRVGINIHV 727
P+ K LV + + + G Q+V +++ V
Sbjct: 701 GVGPEPYPIKTLVGYSRAKGIEPGQSQQVKLDVSV 735
>gi|317158006|ref|XP_001826724.2| exo-1,4-beta-xylosidase xlnD [Aspergillus oryzae RIB40]
Length = 776
Score = 456 bits (1172), Expect = e-125, Method: Compositional matrix adjust.
Identities = 270/695 (38%), Positives = 388/695 (55%), Gaps = 31/695 (4%)
Query: 43 CQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTK 102
C SL R L+ ++L+EK+ ++ PRLG+ Y WW+EALHGV+ G G
Sbjct: 41 CDTSLDPVSRAKSLVAAMTLEEKINNTKYDSSGAPRLGLPAYNWWNEALHGVAE-GHGVS 99
Query: 103 FG--GDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIF 160
F G+F ATSFP I ++F+ L + + V+S EARA NGG AGL YW+PN+N F
Sbjct: 100 FSDSGNFSYATSFPMPILLGAAFDDDLVKQVATVISTEARAFANGGHAGLDYWTPNINPF 159
Query: 161 RDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDG-DRLKVAASCKHFTAYDLDNWNGVD 219
RDPRWGRGQETPGEDP+ +Y V GLQ G +R KV A+CKHF AYDL+NW G++
Sbjct: 160 RDPRWGRGQETPGEDPLHLSRYVYHLVDGLQDGIGPERPKVVATCKHFAAYDLENWEGIE 219
Query: 220 RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEW 279
R+ F+A VS QD+ + + F+ C + KV +VMCSYN +NG+PTCAD +L+ +R W
Sbjct: 220 RYAFDAVVSPQDLSEYYLPSFKTCTRDAKVDAVMCSYNSLNGIPTCADRWLLQTLLREHW 279
Query: 280 ---RLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRG 336
+ ++ DC ++ Y H+ + AAA A+ AG DLDCG + SA+Q+G
Sbjct: 280 GWEQTGHWVTGDCGAIDNIYADHHYVADGAHAAAAALNAGTDLDCGSVFPEYLGSALQQG 339
Query: 337 LLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVL 396
L + +NNAL+ + ++LG FD QPY +G +V TP +ELA +A +GIV+
Sbjct: 340 LYNNQTLNNALIRLYSSLVKLGYFD-PADDQPYRSIGWNEVFTPAAEELAHKATVEGIVM 398
Query: 397 LKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQG 456
LKN G +LPL + TVA+IGP ++ T + GNY G T + +
Sbjct: 399 LKNDG-TLPLK--SNGTVAIIGPFANATTQLQGNYEGPPKYIRTLIWAAVHNGYKVKFSQ 455
Query: 457 CKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSM 516
D+ F AI A+++AD I G+D +IE E+ DR ++ PG Q +L+ ++S
Sbjct: 456 GTDINSNSSAGFAEAISAAKEADTVIYAGGIDNTIEKESQDRTTIVWPGNQLDLIEQLSD 515
Query: 517 ASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPM 576
K P I+V GG +D + + + A++WAGYP QAGG A+ DIL G S P G+LP+
Sbjct: 516 LEK-PLIVVQFGGGQVDDSSLLANAGVGALLWAGYPSQAGGAAVFDILTGKSAPAGRLPV 574
Query: 577 TWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPT 636
T YP Y+ +PMT+M +RP + PGRTYR+Y V+ PFG G+ YT F + +A
Sbjct: 575 TQYPASYVDEVPMTDMTLRPGSNN--PGRTYRWYDKAVL-PFGFGLHYTTFNVSWNHA-- 629
Query: 637 VVAVPLDGRHGSINATISGKAIKVTHAKCN-RLTLGVQVDVKNVGSKDGAHTLLVFSTPP 695
+G N A T+A + L + V N G+ + L+F T
Sbjct: 630 --------EYGPYNT--DSVASGTTNAPVDTELFDTFSITVTNTGNVASDYIALLFLTAD 679
Query: 696 AGHWAPH--KQLVAFEKVH-VPAGAQQRVGINIHV 727
P+ K LV + + + G Q+V +++ V
Sbjct: 680 GVGPEPYPIKTLVGYSRAKGIEPGQSQQVKLDVSV 714
>gi|302683012|ref|XP_003031187.1| glycoside hydrolase family 3 protein [Schizophyllum commune H4-8]
gi|300104879|gb|EFI96284.1| glycoside hydrolase family 3 protein [Schizophyllum commune H4-8]
Length = 752
Score = 455 bits (1171), Expect = e-125, Method: Compositional matrix adjust.
Identities = 293/753 (38%), Positives = 405/753 (53%), Gaps = 37/753 (4%)
Query: 5 IAFFFLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQE 64
+AF L I LS ++ A P D + + P C SL +R L+ ++ E
Sbjct: 1 MAFLLLTSIALSLAAGSRAQGFP---DCVNGPLASNPVCDASLGHVERARALVEEFTVPE 57
Query: 65 KVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPG-ATSFPQVITTASSF 123
+ ++ A VPRLG+ YEWW+EALHGV + PG F P ATSFP I S+F
Sbjct: 58 MINNTVNAAFGVPRLGLPPYEWWNEALHGVG-LSPGVVFFEPEPAVATSFPMPINMGSAF 116
Query: 124 NATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYA 183
+ L A+G V+S EARA N G AGL YW+PN+N F+DPRWGRG ETPGEDP+ + +Y
Sbjct: 117 DDALMLAMGDVISTEARAFSNAGRAGLDYWTPNINPFKDPRWGRGAETPGEDPLHAARYV 176
Query: 184 ASYVRGLQGS-DGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRM 242
S V GLQG D LKVAA+CKH+ AYDL+NW GV R+ F+A V+ QD+ + + PFR
Sbjct: 177 RSLVEGLQGGIDPPSLKVAAACKHWAAYDLENWGGVTRYAFDAVVTPQDLAEYYAPPFRS 236
Query: 243 CVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRL--NGYIVSDCDSVGVYYDTQH 300
CV + + AS MCSYN VNGVP CA P +LK +R W L + ++ SDC +VG YD
Sbjct: 237 CVRDARAASAMCSYNAVNGVPACASPYLLKTVLRDAWGLAEDRWVTSDCGAVGNVYDPHG 296
Query: 301 FTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMF 360
+T A+ +++AG DL+CG + A RGL+ E D+ AL + LG F
Sbjct: 297 YTEDLVNASTVSLKAGTDLNCGTNYTQYLPEAYDRGLIDEDDLKAALTRLYASLVWLGYF 356
Query: 361 DGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPN 420
D P QPY + DV TP+ Q LA AA + VLLKN G +LPL+ ++A+IGP
Sbjct: 357 DA-PEDQPYRQITWADVNTPEAQALAYTAAIKSFVLLKNDG-TLPLTD-STLSLALIGPM 413
Query: 421 SDVT-VTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQAD 479
++ + + M+GNY GI PLQG + +V D F AA+ A+ AD
Sbjct: 414 ANASALQMLGNYFGIPPFVIAPLQGFLDAGFNVTYVLGTNVTGNDAGSFDAAVAAAEAAD 473
Query: 480 ATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKN 539
I V G+D ++E E DR + P Q L+S + K P ++V M GG +D K
Sbjct: 474 VVIYVGGIDNTLEMEEKDRTEISWPDNQLALLSALEGVGK-PLVVVQMGGGQLDDTPLKE 532
Query: 540 DPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQS 599
+ AI+WAGYPGQ+GGTAIAD + G P G+L Y+ + MT+M +RP +
Sbjct: 533 SDAVNAILWAGYPGQSGGTAIADTVTGKVAPAGRL--------YVDEVAMTDMTLRPDNA 584
Query: 600 KRYPGRTYRFYKGPVVYPFGHGMSYTNF-VHTVANAPTVVAVPLDGRHGSINATISGKAI 658
PGRTY++Y G VYP+G+G+ YTN V ++AP SI +
Sbjct: 585 TGNPGRTYKWYTGTPVYPYGYGLHYTNISVAWASDAPEACY--------SIQDLTGEASG 636
Query: 659 KVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHW-APHKQLVAFEKV-HVPAG 716
V A + +V V N G L+F + AG AP K++VA+ + V G
Sbjct: 637 FVDLAPLDTF----RVTVTNEGDIASDFVALLFVSTQAGPAPAPIKEMVAYARASDVQPG 692
Query: 717 AQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNI 749
V + + + L+ D SG + G++ +
Sbjct: 693 NSTEVELEVTLGA-LARTDESGDASLYPGKYEL 724
>gi|332982588|ref|YP_004464029.1| glycoside hydrolase [Mahella australiensis 50-1 BON]
gi|332700266|gb|AEE97207.1| glycoside hydrolase family 3 domain protein [Mahella australiensis
50-1 BON]
Length = 714
Score = 455 bits (1171), Expect = e-125, Method: Compositional matrix adjust.
Identities = 276/735 (37%), Positives = 399/735 (54%), Gaps = 79/735 (10%)
Query: 42 FCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGT 101
+ VSL R DL+ R++L EK+ +I A A+PRL I Y WW+E LHGV+ G
Sbjct: 13 YKDVSLSFEDRAKDLVSRMTLPEKISQMIYDAPAIPRLDIPAYNWWNECLHGVARAGI-- 70
Query: 102 KFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--------GGTAGLTYW 153
AT FPQ I A++FN L + +SDEARA ++ G GLT+W
Sbjct: 71 --------ATVFPQAIAMAATFNPELIHKVAEAISDEARAKHHEAVRNGDRGIYKGLTFW 122
Query: 154 SPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLD 213
SPN+NIFRDPRWGRG ET GEDP L+ + ++V+GLQG D LKV A+ KH+ +
Sbjct: 123 SPNINIFRDPRWGRGHETYGEDPYLTSRMGVAFVKGLQGDDPKYLKVVATPKHYAVH--- 179
Query: 214 NWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKR 273
+ R F+A+VS++D+ +T+ F CV EGK S+M +YN+ NG P CA +LK
Sbjct: 180 SGPESQRHSFDARVSQKDLRETYLPAFEECVKEGKAVSIMGAYNRTNGEPCCASKTLLKD 239
Query: 274 TIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAV 333
+R EW +GY+VSDC ++ + T T E+AA A+ G +L+CG + AV
Sbjct: 240 ILRDEWGFDGYVVSDCGAIDDIHMHHKVTKTAAESAALAVNNGCELNCGKTYEYLCQ-AV 298
Query: 334 QRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVC-TPDHQELALEAARQ 392
++GL+SE I+ A++ T +MRLGMFD P Y H+ P DV +P+H+ELALE ARQ
Sbjct: 299 EQGLISEETIDQAVIKLFTARMRLGMFD-PPEMVRYAHI-PYDVNDSPEHRELALETARQ 356
Query: 393 GIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYA--- 449
IVLLKN LPLS + +T+AVIGPN+D ++ NY G Y TPL+GI
Sbjct: 357 SIVLLKNDENILPLSK-KLKTIAVIGPNADDLDVLLANYFGTPSKYVTPLEGIKNKVSPD 415
Query: 450 -RTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAE---------ALDRA 499
+ ++ +GC+ + D F A++ + AD I+ +GL IE E DR
Sbjct: 416 TKVLYAKGCEVTGNSVDG-FDEAVNIAEMADIVIMCLGLSPRIEGEEGDVADSDGGGDRL 474
Query: 500 GLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTA 559
+ LPG Q++L+ + K P +LVL++G I + +A + AII A YPG+ GGTA
Sbjct: 475 HIDLPGMQEQLLETIYGTGK-PIVLVLLNGSAIAINWAHE--HVPAIIEAWYPGEEGGTA 531
Query: 560 IADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFG 619
IAD+LFG NP G+LP+T+ + +LP P GRTYR+++ +YPFG
Sbjct: 532 IADVLFGDYNPAGRLPITFV--RSLDDLP-------PFTDYNMKGRTYRYFEKEPLYPFG 582
Query: 620 HGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNV 679
+G+SYT+F ++ N +S + + L + VDV+N
Sbjct: 583 YGLSYTSFKYS-------------------NLRLSAMRLPAGN------NLDINVDVENT 617
Query: 680 GSKDGAHTL-LVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSG 738
G G + L S A P +QL + + + G +Q V + +++S+ D G
Sbjct: 618 GKLAGREVVQLYISDVEASVEVPMRQLCGIQCITLEPGQKQTVSFTVEP-QHMSLFDYDG 676
Query: 739 TRRIPLGEHNIHIGG 753
R + G+ I +GG
Sbjct: 677 KRILEPGQFIIAVGG 691
>gi|223945397|gb|ACN26782.1| unknown [Zea mays]
Length = 516
Score = 454 bits (1169), Expect = e-125, Method: Compositional matrix adjust.
Identities = 235/518 (45%), Positives = 319/518 (61%), Gaps = 10/518 (1%)
Query: 253 MCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADA 312
MCSYN+VNGVPTCAD N+L T R +W GYI SDCD+V + +D Q + T E+A AD
Sbjct: 1 MCSYNRVNGVPTCADYNLLSTTARQDWGFYGYITSDCDAVAIIHDAQGYAKTAEDAVADV 60
Query: 313 IRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHL 372
++AG+D++CG ++ H SA+Q+G ++E DIN AL N V+MRLG+F+G+P YG +
Sbjct: 61 LKAGMDVNCGSYVQDHGASALQQGKITEQDINRALHNLFAVRMRLGLFNGDPRRNLYGDI 120
Query: 373 GPKDVCTPDHQELALEAARQGIVLLKNQG--PSLPLSHIRHRTVAVIGPNSDVTVTMIGN 430
GP VCT +HQ+LALEAA+ GIVLLKN G +LPLS ++AVIG N++ + + GN
Sbjct: 121 GPDQVCTQEHQDLALEAAQDGIVLLKNDGGAGALPLSKPNVASLAVIGFNANDAIRLRGN 180
Query: 431 YAGIACGYTTPLQGIGRYAR-TIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQ 489
Y G C TPLQ + Y + T GC AC + A+ A+ AD+ +L MGLDQ
Sbjct: 181 YFGPPCVTVTPLQVLQGYVKDTSFVAGCNSAAC-NVTTIPEAVQAASSADSVVLFMGLDQ 239
Query: 490 SIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWA 549
E E +DR L LPG+QQ L+ V+ A+K P ILVL+ GGP+DV+FAK +P+I AI+WA
Sbjct: 240 DQEREEVDRLDLTLPGQQQTLIESVANAAKKPVILVLLCGGPVDVSFAKTNPKIGAILWA 299
Query: 550 GYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRF 609
GYPG+AGG AIA +LFG NPGG+LP+TWYPQ++ T +PMT+M MR + YPGRTYRF
Sbjct: 300 GYPGEAGGIAIAQVLFGEHNPGGRLPVTWYPQDF-TRVPMTDMRMRADPATGYPGRTYRF 358
Query: 610 YKGPVVYPFGHGMSYTNFVHTVANA--PTVVAVPLDGRHGSINATISGKAIKVTHAKCNR 667
Y+GP V+ FG+G+SY+ + H A PT L + S + C+R
Sbjct: 359 YRGPTVFNFGYGLSYSKYSHRFATKPPPTSNVAGLKAVEATAGGMASYDVEAIGSETCDR 418
Query: 668 LTLGVQVDVKNVGSKDGAHTLLVFSTPPA---GHWAPHKQLVAFEKVHVPAGAQQRVGIN 724
L V V+N G DG H++LVF P G P QL+ F+ +H+ A V
Sbjct: 419 LKFPAVVRVQNHGPMDGKHSVLVFMRWPNATDGSGRPASQLIGFQSLHLRATQTAHVEFE 478
Query: 725 IHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHA 762
+ CK+ S G + I G H + +G + +S A
Sbjct: 479 VSPCKHFSRATEDGRKVIDQGSHFVMVGEDEFEMSFMA 516
>gi|391864313|gb|EIT73609.1| beta-glucosidase-related glycosidase [Aspergillus oryzae 3.042]
Length = 797
Score = 453 bits (1165), Expect = e-124, Method: Compositional matrix adjust.
Identities = 269/695 (38%), Positives = 387/695 (55%), Gaps = 31/695 (4%)
Query: 43 CQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTK 102
C SL R L+ ++L+EK+ ++ PRLG+ Y WW+EALHGV+ G G
Sbjct: 62 CDTSLDPVSRAKSLVAAMTLEEKINNTKYDSSGAPRLGLPAYNWWNEALHGVAE-GHGVS 120
Query: 103 FG--GDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIF 160
F G+F ATSFP I ++F+ L + + V+S EARA NGG AGL YW+PN+N F
Sbjct: 121 FSDSGNFSYATSFPMPILLGAAFDDDLVKQVATVISTEARAFANGGHAGLDYWTPNINPF 180
Query: 161 RDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDG-DRLKVAASCKHFTAYDLDNWNGVD 219
RDPRWGRGQETPGEDP+ +Y V GLQ G +R KV A+CKHF AYDL+NW G++
Sbjct: 181 RDPRWGRGQETPGEDPLHLSRYVYHLVDGLQDGIGPERPKVVATCKHFAAYDLENWEGIE 240
Query: 220 RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEW 279
R+ F+A VS QD+ + + F+ C + KV +VMCSYN +NG+PTCAD +L+ +R W
Sbjct: 241 RYAFDAVVSPQDLSEYYLPSFKTCTRDAKVDAVMCSYNSLNGIPTCADRWLLQTLLREHW 300
Query: 280 ---RLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRG 336
+ ++ DC ++ Y H+ + AAA A+ AG DLDCG + SA+Q+G
Sbjct: 301 GWEQTGHWVTGDCGAIDNIYADHHYVADGAHAAAAALNAGTDLDCGSVFPEYLGSALQQG 360
Query: 337 LLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVL 396
L + + NAL+ + ++LG FD QPY +G +V TP +ELA +A +GIV+
Sbjct: 361 LYNNQTLYNALIRLYSSLVKLGYFD-PADDQPYRSIGWNEVFTPAAEELAHKATVEGIVM 419
Query: 397 LKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQG 456
LKN G +LPL + TVA+IGP ++ T + GNY G T + +
Sbjct: 420 LKNDG-TLPLK--SNGTVAIIGPFANATTQLQGNYEGPPKYIRTLIWAAVHNGYKVKFSQ 476
Query: 457 CKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSM 516
D+ F AI A+++AD I G+D +IE E+ DR ++ PG Q +L+ ++S
Sbjct: 477 GTDINSNSSAGFAEAISAAKEADTVIYAGGIDNTIEKESQDRTTIVWPGNQLDLIEQLSD 536
Query: 517 ASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPM 576
K P I+V GG +D + + + A++WAGYP QAGG A+ DIL G S P G+LP+
Sbjct: 537 LEK-PLIVVQFGGGQVDDSSLLANAGVGALLWAGYPSQAGGAAVFDILTGKSAPAGRLPV 595
Query: 577 TWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPT 636
T YP Y+ +PMT+M +RP + PGRTYR+Y V+ PFG G+ YT F + +A
Sbjct: 596 TQYPASYVDEVPMTDMTLRPGSNN--PGRTYRWYDKAVL-PFGFGLHYTTFNVSWNHA-- 650
Query: 637 VVAVPLDGRHGSINATISGKAIKVTHAKCN-RLTLGVQVDVKNVGSKDGAHTLLVFSTPP 695
+G N A T+A + L + V N G+ + L+F T
Sbjct: 651 --------EYGPYNT--DSVASGTTNAPVDTELFDTFSITVTNTGNVASDYIALLFLTAD 700
Query: 696 AGHWAPH--KQLVAFEKVH-VPAGAQQRVGINIHV 727
P+ K LV + + + G Q+V +++ V
Sbjct: 701 GVGPEPYPIKTLVGYSRAKGIEPGQSQQVKLDVSV 735
>gi|442803736|ref|YP_007371885.1| beta-xylosidase BxlB [Clostridium stercorarium subsp. stercorarium
DSM 8532]
gi|442739586|gb|AGC67275.1| beta-xylosidase BxlB [Clostridium stercorarium subsp. stercorarium
DSM 8532]
Length = 715
Score = 452 bits (1162), Expect = e-124, Method: Compositional matrix adjust.
Identities = 278/743 (37%), Positives = 397/743 (53%), Gaps = 92/743 (12%)
Query: 46 SLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGG 105
S +R DL+ R++++EKV ++ + A+ RLGI Y WW+EALHGV+ G
Sbjct: 11 SYSFEERAKDLVSRMTIEEKVSQMLYNSPAIERLGIPAYNWWNEALHGVARAGT------ 64
Query: 106 DFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA--------GLTYWSPNV 157
AT FPQ I A++F+ L + V+S E RA Y+ + GLT+WSPN+
Sbjct: 65 ----ATMFPQAIGMAATFDEELIYKVADVISTEGRAKYHASSKKGDRGIYKGLTFWSPNI 120
Query: 158 NIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNG 217
NIFRDPRWGRGQET GEDP L+ + ++V+GLQG+ LK AA KHF + +G
Sbjct: 121 NIFRDPRWGRGQETYGEDPYLTARLGVAFVKGLQGNHPKYLKAAACAKHFAVH-----SG 175
Query: 218 VD--RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTI 275
+ R FNA VSK+D+ +T+ F+ V E KV SVM +YN+ NG P C +L +
Sbjct: 176 PESLRHEFNAVVSKKDLYETYLPAFKALVQEAKVESVMGAYNRTNGEPCCGSKTLLSDIL 235
Query: 276 RGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQR 335
RGEW G++VSDC ++ ++ H T+T E+AA A+R G DL+CG G + A++
Sbjct: 236 RGEWGFKGHVVSDCWAIRDFHMHHHVTATAPESAALAVRNGCDLNCGNMFG-NLLIALKE 294
Query: 336 GLLSEIDINNALVNTLTVQMRLGMFDGEPSSQ-PYGHLGPKDVCTPDHQELALEAARQGI 394
GL++E +I+ A+ + +M+LGMFD P Q PY + V +H+ELAL+ A++ I
Sbjct: 295 GLITEEEIDRAVTRLMITRMKLGMFD--PEDQVPYASISYDFVDCKEHRELALDVAKKSI 352
Query: 395 VLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYA----R 450
VLLKN G LPL + R++AVIGPN+D +IGNY G A Y T L GI A R
Sbjct: 353 VLLKNDG-LLPLDRKKIRSIAVIGPNADSRQALIGNYEGTASEYVTVLDGIREMAGDDVR 411
Query: 451 TIHQQGC-------KDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEAL------- 496
+ GC +++ D++ A+ + AD I+ +GLD +IE E +
Sbjct: 412 IYYSVGCHLYKDRVENLGEPGDRI-AEAVTCAEHADVVIMCLGLDSTIEGEEMHESNIYG 470
Query: 497 --DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQ 554
D+ L LPG+QQEL+ V K P +LVL++G + V +A D I AI+ A YPG
Sbjct: 471 SGDKPDLNLPGQQQELLEAVYATGK-PIVLVLLTGSALAVTWA--DEHIPAILNAWYPGA 527
Query: 555 AGGTAIADILFGTSNPGGKLPMTWYPQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGP 613
GG AIA +LFG +NP GKLP+T+Y LP T+ +M RTYRF K
Sbjct: 528 LGGRAIASVLFGETNPSGKLPVTFY--RTTEELPDFTDYSME--------NRTYRFMKNE 577
Query: 614 VVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQ 673
+YPFG G+SYT F ++ + +S I+ V
Sbjct: 578 ALYPFGFGLSYTTFDYS-------------------DLKLSKDTIRAGEG------FNVS 612
Query: 674 VDVKNVGSKDGAHTLLVFSTPPAGHW-APHKQLVAFEKVHVPAGAQQRVGINIHVCKYLS 732
V V N G G + V+ W P+ QL ++V + +G + I + L+
Sbjct: 613 VKVTNTGKMAGEEVVQVYIKDLEASWRVPNWQLSGMKRVRLESGETAEITFEIR-PEQLA 671
Query: 733 VVDRSGTRRIPLGEHNIHIGGTK 755
VV G I GE I++GG++
Sbjct: 672 VVTDEGKSVIEPGEFEIYVGGSQ 694
>gi|291167620|dbj|BAI82526.1| 1,4-beta-D-xylosidase [Aureobasidium pullulans var. melanogenum]
Length = 805
Score = 451 bits (1161), Expect = e-124, Method: Compositional matrix adjust.
Identities = 267/718 (37%), Positives = 404/718 (56%), Gaps = 27/718 (3%)
Query: 43 CQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTK 102
C S R L+ ++ EK+ L + + VPRLG+ Y+WW EALHGV++ PG
Sbjct: 44 CDKSADPVARAKALVAAFTVAEKLNLTGNNSPGVPRLGLPVYQWWQEALHGVAS-SPGVT 102
Query: 103 FG--GDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIF 160
F G F ATSFPQ I ++F+ L +++ VVS EARA N G AGL +W+PN+N +
Sbjct: 103 FNATGQFDSATSFPQPILMGAAFDDALIQSVAEVVSTEARAFNNYGRAGLDFWTPNINPY 162
Query: 161 RDPRWGRGQETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDLDNWNGVD 219
RDPRWGRGQETPGEDP Y S + GLQG D + K+ A+CKHF YD+++WNG
Sbjct: 163 RDPRWGRGQETPGEDPYHLSSYVHSLIMGLQGGEDPEIRKITATCKHFAGYDIESWNGNL 222
Query: 220 RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEW 279
R+ + ++ ++D+ + + FR C + V + MC+Y+ +NGVPTCADP +L +R W
Sbjct: 223 RYQNDVQIPQRDLVEYYLPSFRSCARDSNVGAFMCTYSALNGVPTCADPWLLNDVLREHW 282
Query: 280 RLNG---YIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRG 336
++ SDCDS+ + +F+ T + AAA A+ AG DLDCG + H A +G
Sbjct: 283 GWTNEEQWVTSDCDSIQNIFLPHNFSDTRQGAAAAALNAGTDLDCGTYYQHHLPLAYSQG 342
Query: 337 LLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVL 396
L+++ ++ ALV T +R G FDG P++ Y +L DV T Q+LAL+AA +G+VL
Sbjct: 343 LINQTTVDQALVRLYTSLVRTGYFDG-PNAM-YRNLTWSDVGTTHAQQLALQAAEEGMVL 400
Query: 397 LKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRY-ARTIHQQ 455
LKN G LPLS +A+IG ++ T M GNY G+ +PL + A+ + Q
Sbjct: 401 LKNDG-LLPLSISNGTKIALIGSWANATTQMQGNYYGVPTYLHSPLYAAQQTGAQVFYAQ 459
Query: 456 GCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVS 515
G + A+ +AD I + G+D S+EAE +DR + G Q +++ +++
Sbjct: 460 GPGGQGDPTTDHWLPVWTAAEKADIIIYIGGVDISVEAEGMDREDINWTGAQLDIIGELA 519
Query: 516 MASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLP 575
M K P +L M G +D N+ I+A+IW GYPGQ GG A+ +I+ G + P G+LP
Sbjct: 520 MYGK-PMVLAQM-GDQLDNTPIVNNANISALIWGGYPGQDGGVALFNIITGKTAPAGRLP 577
Query: 576 MTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAP 635
+T YP YI ++PMT+M +RP+ + PGRTY++Y G V+ FG+GM YT F ++
Sbjct: 578 VTQYPAHYIADIPMTDMTLRPNATTGSPGRTYKWYNGTAVFEFGYGMHYTKFSADIS--- 634
Query: 636 TVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQV-DVKNVGSKDGAHTLLVFSTP 694
P+ I++ +SG + +C ++ V V + NV S A +
Sbjct: 635 -----PMSKSSYDISSLLSG-CNETYKDRCAFESISVNVHNTGNVTSDYAALGFIAGQFG 688
Query: 695 PAGHWAPHKQLVAFEKVH-VPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHI 751
P+ + P K LV ++++H + G+ Q +N+ + LS VD G + G++ + I
Sbjct: 689 PSPY--PKKSLVNYQRLHNIAGGSSQTATLNLTLGS-LSRVDDHGNTYLYPGDYALMI 743
>gi|242813865|ref|XP_002486253.1| beta-xylosidase, putative [Talaromyces stipitatus ATCC 10500]
gi|218714592|gb|EED14015.1| beta-xylosidase, putative [Talaromyces stipitatus ATCC 10500]
Length = 893
Score = 451 bits (1161), Expect = e-124, Method: Compositional matrix adjust.
Identities = 276/698 (39%), Positives = 391/698 (56%), Gaps = 30/698 (4%)
Query: 41 PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPG 100
P C SL R L+ ++ +EKV+ +G+ RLG+ Y+WW+EALHGV+ PG
Sbjct: 163 PICDTSLDPLTRAKGLVDAMTFEEKVQNTQNGSPGAARLGLPAYQWWNEALHGVAG-SPG 221
Query: 101 TKF--GGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVN 158
F G+F ATSFPQ I +++F+ L + +G VVS E RA N G AGL +W+PN+N
Sbjct: 222 VTFQPSGNFSYATSFPQPILMSAAFDDALIKEVGTVVSIEGRAFNNYGNAGLDFWTPNIN 281
Query: 159 IFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQ-GSDGDRLKVAASCKHFTAYDLDNWNG 217
FRDPRWGRGQETPGEDP +Y + V GLQ G +V A+CKHF YD+++W G
Sbjct: 282 PFRDPRWGRGQETPGEDPYHIARYVYNLVDGLQNGIAPANPRVVATCKHFAGYDIEDWEG 341
Query: 218 VDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRG 277
R+ FNA +S QD+ + + PF+ C + +V ++MCSYN VNG+PTCAD +L +R
Sbjct: 342 NSRYGFNAIISTQDLSEYYLPPFKSCARDAQVDAIMCSYNAVNGIPTCADSYLLDTILRD 401
Query: 278 EWRLNG---YIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQ 334
W N ++ SDCD+V Y +TS+ AAADA+ AG +LDCG + + +A
Sbjct: 402 HWNWNQTGHWVTSDCDAVDNIYSDHRYTSSLAAAAADALNAGTNLDCGTTMSNNLAAAAA 461
Query: 335 RGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGI 394
+ L +N+ALV + +RLG FD E S Y LG DV T Q+LA AA +GI
Sbjct: 462 QDLFKNATLNSALVYLYSSLVRLGWFDSEDSQ--YSSLGWSDVGTTASQQLANRAAVEGI 519
Query: 395 VLLKNQGPS-LPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIH 453
VLLKN LPLS +T+A+IGP ++ T + GNY G T + G + T+
Sbjct: 520 VLLKNDHKKVLPLSQ-HGQTIALIGPYANATTQLQGNYYGTPAYIRTLVWGAEQMGYTVQ 578
Query: 454 QQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSK 513
+ + D F AA+ A++ AD I G+D SIEAEA+DR + G Q +L+ +
Sbjct: 579 YEAGTGINSTDTSGFAAAVAAAKTADIVIYAGGIDNSIEAEAMDRNTIAWTGNQLQLIDQ 638
Query: 514 VSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGK 573
+S K P +++ GG +D + + + A++W GYP Q GG A+ DIL G S P G+
Sbjct: 639 LSQVGK-PLVVLQFGGGQLDDSALLQNENVNALLWCGYPSQTGGQAVFDILTGQSAPAGR 697
Query: 574 LPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVAN 633
LP+T YP Y +PMT+M++RP+ S PGRTYR+Y V+ PFG G+ YT F + A
Sbjct: 698 LPVTQYPANYTNAIPMTDMSLRPNGST--PGRTYRWYDDAVI-PFGFGLHYTTFDASWA- 753
Query: 634 APTVVAVPLDGRHGSIN-ATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF- 691
D + G N A++ KA K + V+VKN G L+F
Sbjct: 754 ---------DKKFGPYNTASLVAKASKSKYQDTAPFD-SFHVNVKNTGKVTSDFVALLFA 803
Query: 692 STPPAG-HWAPHKQLVAFEKV-HVPAGAQQRVGINIHV 727
ST AG P K L+++ + + G + V I++ +
Sbjct: 804 STDNAGPKPYPIKTLISYARASSIKPGETRTVSIDVTI 841
>gi|378730020|gb|EHY56479.1| beta-glucosidase, variant [Exophiala dermatitidis NIH/UT8656]
gi|378730021|gb|EHY56480.1| beta-glucosidase [Exophiala dermatitidis NIH/UT8656]
Length = 783
Score = 451 bits (1159), Expect = e-124, Method: Compositional matrix adjust.
Identities = 283/734 (38%), Positives = 413/734 (56%), Gaps = 39/734 (5%)
Query: 43 CQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTK 102
C + + R L+ L+ +EK L + + VPRLG+ Y+WW EALHGV++ PG
Sbjct: 35 CNTNASVADRAKALVAALTNEEKFNLTGNTSPGVPRLGLYSYQWWQEALHGVAS-SPGVN 93
Query: 103 FG--GDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIF 160
F GDF ATSFPQ I +++F+ L A+ VVS EARA N +GL +W+PN+N +
Sbjct: 94 FSTSGDFSHATSFPQPILMSAAFDDALINAVATVVSTEARAFNNVNRSGLDFWTPNINPY 153
Query: 161 RDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLK-VAASCKHFTAYDLDNWNGVD 219
+DPRWGRGQETPGED Y A+ + GLQG +K V A+CKHF AYDL++W D
Sbjct: 154 KDPRWGRGQETPGEDTFHLKSYVAALIDGLQGGLNPPIKKVIATCKHFVAYDLEDWITTD 213
Query: 220 RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEW 279
R++F+A VS QD+ + + PF+ C + +V S+MCSYN +NGVPTCADP IL+ +R W
Sbjct: 214 RYNFDAIVSTQDLAEYYMQPFQTCARDARVGSIMCSYNAMNGVPTCADPYILQTVLREHW 273
Query: 280 RLNG---YIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRG 336
Y+ SDCD++ Y ++ T E+A ADA+ AG DL+CG + H +A G
Sbjct: 274 NWTDDGQYVTSDCDAIQNIYAPHYYEPTREQAVADALTAGTDLNCGTYYQTHLPAAFSEG 333
Query: 337 LLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVL 396
L ++ I+ + + ++LG FD PS+ PY L DV TP + LAL+AA +GIVL
Sbjct: 334 LFNQTVIDQTITRLYSALIKLGYFD-PPSATPYRSLNWSDVSTPAAEALALKAAEEGIVL 392
Query: 397 LKNQGPSLPLSH--IRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYAR-TIH 453
LKN G LPLS ++ TVA+IG ++ T TM GNY GIA +PL + +
Sbjct: 393 LKNDG-LLPLSFPTDKNTTVAIIGGWANATTTMQGNYFGIAPYLHSPLYALQQLPNINAV 451
Query: 454 QQGCKDVACAD--DQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELV 511
G V D D+L GAA +AD I+ GL S E+E+ DR + +++
Sbjct: 452 YGGGFGVPTTDGWDELLGAA----GEADLIIIADGLTTSDESESNDRYTIGWQPAAIDII 507
Query: 512 SKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPG 571
+++S K PT+ + M G +D N+P I+A+IW GYPG AGG A+ +IL G + P
Sbjct: 508 NQLSGMGK-PTVFLQM-GDQLDNTPLLNNPNISALIWGGYPGMAGGDALINILTGKAAPA 565
Query: 572 GKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTV 631
G+LP+T YP +Y+ + MT+M +RP+ + PGRTY++Y V+ PFG+G+ YTNF
Sbjct: 566 GRLPVTQYPADYVNQVNMTDMELRPNATSGNPGRTYKWYNNAVL-PFGYGLHYTNF-SVA 623
Query: 632 ANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNR-----LTL----GVQVDVKNVGSK 682
A+A S + I + C+R L L V+V N GSK
Sbjct: 624 ASAQGQAQTQSGPSSNSSQGQGTSYNISSLVSSCDRSQYAYLDLCPFESFNVNVTNTGSK 683
Query: 683 DGAHTLLVFSTPPAGHWAPH----KQLVAFEKV-HVPAGAQQRVGINIHVCKYLSVVDRS 737
+ + + +G + P KQLVA++++ ++ AGA +N+ + L+ D +
Sbjct: 684 LASDFVALGFI--SGSYGPQPYPIKQLVAYQRLFNISAGASATATLNLTLGS-LARHDEN 740
Query: 738 GTRRIPLGEHNIHI 751
G + G++ + I
Sbjct: 741 GNAVLYPGDYGLLI 754
>gi|297039776|gb|ADH95739.1| beta-xylosidase [Aspergillus fumigatus]
Length = 771
Score = 450 bits (1157), Expect = e-123, Method: Compositional matrix adjust.
Identities = 257/594 (43%), Positives = 351/594 (59%), Gaps = 15/594 (2%)
Query: 40 LPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGP 99
L C SL + R L+ ++ +EKV + VPRLG+ Y WWSEALHGV+ P
Sbjct: 40 LAVCDTSLDVTTRAQSLVNAMTFEEKVNNTQYNSPGVPRLGLPAYNWWSEALHGVAG-SP 98
Query: 100 GTKFG--GDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNV 157
G +F G F ATSFPQ I ++F+ L + + VVS E RA N G +GL +W+PN+
Sbjct: 99 GVEFADSGPFSYATSFPQPILLGATFDDDLIKQVATVVSTEGRAFGNAGRSGLDFWTPNI 158
Query: 158 NIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGD-RLKVAASCKHFTAYDLDNWN 216
N FRD RWGRGQETPGEDP+ +Y V GLQ G KV A+CKHF AY L++WN
Sbjct: 159 NPFRDARWGRGQETPGEDPLHVSRYVYHLVDGLQNGIGPANPKVVATCKHFAAYGLEDWN 218
Query: 217 GVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIR 276
GV R FNA+VS QD+ + + PF+ C + +V +VMCSYN +NGVP CAD +L+ +R
Sbjct: 219 GVVRHSFNAEVSTQDLSEFYLPPFKSCARDARVDAVMCSYNALNGVPACADSYLLQTILR 278
Query: 277 GEWRLNG---YIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAV 333
W+ + +I SDC ++ Y+ +FT+TP EAAA A+ AG DLDCG + A
Sbjct: 279 EHWKWDEPGRWITSDCGAIDDIYNGHNFTTTPAEAAATALNAGTDLDCGTVFPKYLGQAA 338
Query: 334 QRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQG 393
GL S ++ ALV + ++LG FD QPY +G DV TP + LA +AA +G
Sbjct: 339 DEGLYSNQTLDRALVRLYSSFVKLGYFD-PAEDQPYRSIGWTDVDTPAVEALAHKAAGEG 397
Query: 394 IVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIH 453
IVLLKN +LPL T+A+IGP ++ T M GNY G A T L + +
Sbjct: 398 IVLLKNDK-TLPLK--AKGTLALIGPYANATKQMQGNYEGPAKYIRTLLWAATQAGYDVK 454
Query: 454 QQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSK 513
+ F AA+ A++QAD + G+D +IEAE DR + PG Q L+ +
Sbjct: 455 YAAGTAINTNSTAGFDAALSAAKQADVVVYAGGIDNTIEAEGRDRTTIAWPGNQVNLIDQ 514
Query: 514 VSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGK 573
+S K P ++V GG +D + ++PR+ A++WAGYP Q GG+AI DIL G + P G+
Sbjct: 515 LSKIGK-PLVVVQFGGGQVDDSSLLSNPRVNALLWAGYPSQEGGSAIFDILTGKTAPAGR 573
Query: 574 LPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNF 627
LP+T YP +Y+ +PMT+MA+RP + PGRTYR+Y V+ PFG G+ YT F
Sbjct: 574 LPVTQYPADYVNQVPMTDMALRPGSNT--PGRTYRWYDKAVL-PFGFGLHYTTF 624
>gi|296439595|sp|A1CCL9.2|BXLB_ASPCL RecName: Full=Probable exo-1,4-beta-xylosidase bxlB; AltName:
Full=1,4-beta-D-xylan xylohydrolase bxlB; AltName:
Full=Beta-xylosidase bxlB; AltName: Full=Xylobiase bxlB;
Flags: Precursor
Length = 771
Score = 447 bits (1151), Expect = e-123, Method: Compositional matrix adjust.
Identities = 287/697 (41%), Positives = 389/697 (55%), Gaps = 32/697 (4%)
Query: 40 LPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGP 99
L C S + R L+ +S EKV A VPRLG+ Y WWSEALHGV+ P
Sbjct: 40 LAVCDTSRDVTTRAQSLVDAMSFAEKVNNTQYEAPGVPRLGLPAYNWWSEALHGVAGA-P 98
Query: 100 GTKFG--GDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNV 157
G F G F ATSF Q I +SF+ L + + VV E RA N G AGL YW+PN+
Sbjct: 99 GVHFADSGPFSYATSFAQPILLGASFDDELVKQVATVVGTEGRAFGNAGRAGLDYWTPNI 158
Query: 158 NIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGD-RLKVAASCKHFTAYDLDNWN 216
N FRDPRWGRGQETPGEDP+ +Y V GLQG G R ++AA+CKHF AYD+++WN
Sbjct: 159 NPFRDPRWGRGQETPGEDPLHVSRYVYHLVDGLQGGIGPARPQIAATCKHFAAYDMEDWN 218
Query: 217 GVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIR 276
GV R F+A+VS QD+ + + F+ CV + +V +VMCSYN +NGVPTCADP +L+ +R
Sbjct: 219 GVSRHEFDARVSTQDLAEFYLPSFKSCVRDAQVDAVMCSYNALNGVPTCADPYLLQTLLR 278
Query: 277 GEWRLNG---YIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAV 333
W + ++VSDC ++ Y ++T T EAAA A+ AG DLDCG H A
Sbjct: 279 EHWDWDQPGHWVVSDCGAIDDIYIGHNYTKTGAEAAAVALNAGTDLDCGTVFPKHLGEAA 338
Query: 334 QRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQG 393
++GL + ++ ALV + ++LG FD QPYG +G KDV TP ++LA +AA +G
Sbjct: 339 EQGLYTNQTLDRALVRLYSSLVKLGYFD-PAEKQPYGSIGWKDVDTPAAEQLAHKAAVEG 397
Query: 394 IVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIH 453
IVLLKN +LPL T+A+IGP ++ T M GNY G T ++ +
Sbjct: 398 IVLLKND-QTLPLK--AKGTLALIGPYANATKQMQGNYQGPPKYIRTLEWAATQHGYQVQ 454
Query: 454 QQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSK 513
+ + F AA+ A++ AD + G+D +IE+E LDR + PG Q L+S+
Sbjct: 455 YSPGTAINNSSTAGFAAALAAAKDADVVLYAGGIDNTIESETLDRTTITWPGNQLSLISE 514
Query: 514 VSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGK 573
+S K P I++ GG +D +P + A++WAGYP Q GG AI DIL G + P G+
Sbjct: 515 LSNLHK-PLIVIQFGGGQVDDTPLLTNPHVNALLWAGYPSQEGGAAIFDILTGKAAPAGR 573
Query: 574 LPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVAN 633
LP+T YP Y +PMTEM +R PGRTYR+Y VV PFG G+ YT+F
Sbjct: 574 LPITQYPAAYTAQVPMTEMGLRAGGDN--PGRTYRWYDKAVV-PFGFGLHYTSF------ 624
Query: 634 APTVVAVPLD-GRHGSIN-ATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF 691
V D GR G N A + +A +H L +V V+N G+ + L+F
Sbjct: 625 -----EVSWDRGRLGPYNTAALVNRAPGGSHVD-RALFDTFRVQVQNTGTVTSDYVALLF 678
Query: 692 -STPPAG-HWAPHKQLVAFEKV-HVPAGAQQRVGINI 725
T AG P K LV + +V V G ++ V I +
Sbjct: 679 VKTEDAGPEPYPLKTLVGYTRVQQVKPGERRSVEIEV 715
>gi|121712174|ref|XP_001273702.1| beta-xylosidase [Aspergillus clavatus NRRL 1]
gi|119401854|gb|EAW12276.1| beta-xylosidase [Aspergillus clavatus NRRL 1]
Length = 803
Score = 447 bits (1151), Expect = e-123, Method: Compositional matrix adjust.
Identities = 292/713 (40%), Positives = 390/713 (54%), Gaps = 64/713 (8%)
Query: 40 LPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGP 99
L C S + R L+ +S EKV A VPRLG+ Y WWSEALHGV+ P
Sbjct: 72 LAVCDTSRDVTTRAQSLVDAMSFAEKVNNTQYEAPGVPRLGLPAYNWWSEALHGVAGA-P 130
Query: 100 GTKFG--GDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNV 157
G F G F ATSF Q I +SF+ L + + VV E RA N G AGL YW+PN+
Sbjct: 131 GVHFADSGPFSYATSFAQPILLGASFDDELVKQVATVVGTEGRAFGNAGRAGLDYWTPNI 190
Query: 158 NIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGD-RLKVAASCKHFTAYDLDNWN 216
N FRDPRWGRGQETPGEDP+ +Y V GLQG G R ++AA+CKHF AYD+++WN
Sbjct: 191 NPFRDPRWGRGQETPGEDPLHVSRYVYHLVDGLQGGIGPARPQIAATCKHFAAYDMEDWN 250
Query: 217 GVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIR 276
GV R F+A+VS QD+ + + F+ CV + +V +VMCSYN +NGVPTCADP +L+ +R
Sbjct: 251 GVSRHEFDARVSTQDLAEFYLPSFKSCVRDAQVDAVMCSYNALNGVPTCADPYLLQTLLR 310
Query: 277 GEWRLNG---YIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAV 333
W + ++VSDC ++ Y ++T T EAAA A+ AG DLDCG H A
Sbjct: 311 EHWDWDQPGHWVVSDCGAIDDIYIGHNYTKTGAEAAAVALNAGTDLDCGTVFPKHLGEAA 370
Query: 334 QRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQG 393
++GL + ++ ALV + ++LG FD QPYG +G KDV TP ++LA +AA +G
Sbjct: 371 EQGLYTNQTLDRALVRLYSSLVKLGYFD-PAEKQPYGSIGWKDVDTPAAEQLAHKAAVEG 429
Query: 394 IVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIH 453
IVLLKN +LPL T+A+IGP ++ T M GNY QG +Y RT+
Sbjct: 430 IVLLKND-QTLPLK--AKGTLALIGPYANATKQMQGNY-----------QGPPKYIRTLE 475
Query: 454 ----QQGCKDVACADDQLFGAAIDASR------------QADATILVMGLDQSIEAEALD 497
Q G + G AI+ S AD + G+D +IE+E LD
Sbjct: 476 WAATQHGYQVQYSP-----GTAINNSSTAGFAAALAAAKDADVVLYAGGIDNTIESETLD 530
Query: 498 RAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGG 557
R + PG Q L+S++S K P I++ GG +D +P + A++WAGYP Q GG
Sbjct: 531 RTTITWPGNQLSLISELSNLHK-PLIVIQFGGGQVDDTPLLTNPHVNALLWAGYPSQEGG 589
Query: 558 TAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYP 617
AI DIL G + P G+LP+T YP Y +PMTEM +R PGRTYR+Y VV P
Sbjct: 590 AAIFDILTGKAAPAGRLPITQYPAAYTAQVPMTEMGLRAGGDN--PGRTYRWYDKAVV-P 646
Query: 618 FGHGMSYTNFVHTVANAPTVVAVPLD-GRHGSIN-ATISGKAIKVTHAKCNRLTLGVQVD 675
FG G+ YT+F V D GR G N A + +A +H L +V
Sbjct: 647 FGFGLHYTSF-----------EVSWDRGRLGPYNTAALVNRAPGGSHVD-RALFDTFRVQ 694
Query: 676 VKNVGSKDGAHTLLVF-STPPAG-HWAPHKQLVAFEKV-HVPAGAQQRVGINI 725
V+N G+ + L+F T AG P K LV + +V V G ++ V I +
Sbjct: 695 VQNTGTVTSDYVALLFVKTEDAGPEPYPLKTLVGYTRVQQVKPGERRSVEIEV 747
>gi|119473971|ref|XP_001258861.1| beta-xylosidase [Neosartorya fischeri NRRL 181]
gi|292495290|sp|A1DJS5.1|XYND_NEOFI RecName: Full=Probable exo-1,4-beta-xylosidase xlnD; AltName:
Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
gi|119407014|gb|EAW16964.1| beta-xylosidase [Neosartorya fischeri NRRL 181]
Length = 771
Score = 446 bits (1147), Expect = e-122, Method: Compositional matrix adjust.
Identities = 253/594 (42%), Positives = 350/594 (58%), Gaps = 15/594 (2%)
Query: 40 LPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGP 99
L C SL + R L+ ++ +EKV + VPRLG+ Y WWSEALHGV+ P
Sbjct: 40 LAVCDTSLDVTTRARSLVNAMTFEEKVNNTQYNSPGVPRLGLPAYNWWSEALHGVAG-SP 98
Query: 100 GTKFG--GDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNV 157
G +F G F ATSFPQ I ++F+ L + + VVS E RA N G AGL +W+PN+
Sbjct: 99 GVEFADSGPFSYATSFPQPILLGATFDDDLIKQVATVVSTEGRAFGNAGRAGLDFWTPNI 158
Query: 158 NIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGD-RLKVAASCKHFTAYDLDNWN 216
N FRD RWGRGQETPGEDP+ +Y V GLQ G KV A+CKHF AYDL++WN
Sbjct: 159 NPFRDARWGRGQETPGEDPLHVSRYVYHLVDGLQNGIGPANPKVVATCKHFAAYDLEDWN 218
Query: 217 GVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIR 276
GV R FNA+VS QD+ + + PF+ C + KV +VMCSYN +NGVP CAD +L+ +R
Sbjct: 219 GVVRHSFNAEVSTQDLSEFYLPPFKSCARDAKVDAVMCSYNALNGVPACADSYLLQTILR 278
Query: 277 GEWRLNG---YIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAV 333
W+ + +I DC ++ Y+ ++T TP EAAA A+ AG DLDCG + A
Sbjct: 279 EHWKWDEPGHWITGDCGAIDDIYNGHNYTKTPAEAAATALNAGTDLDCGTVFPKYLGQAA 338
Query: 334 QRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQG 393
GL + ++ ALV + ++LG FD QPY +G KDV +P + LA +AA +G
Sbjct: 339 DEGLYTNKTLDKALVRLYSSLVKLGYFD-PAEDQPYRSIGWKDVDSPAAEALAHKAAVEG 397
Query: 394 IVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIH 453
IVLLKN +LPL T+A+IGP ++ T M GNY G T L + +
Sbjct: 398 IVLLKNDK-TLPLK--AKGTLALIGPYANATKQMQGNYEGPPKYIRTLLWAATQAGYDVK 454
Query: 454 QQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSK 513
+ F AA+ A++QAD + G+D +IEAE DR ++ PG Q +L+ +
Sbjct: 455 YVAGTAINANSTAGFDAALSAAKQADVVVYAGGIDNTIEAEGHDRTTIVWPGNQLDLIDQ 514
Query: 514 VSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGK 573
+S K P ++V GG +D + ++P + A++W GYP Q GG+AI DIL G + P G+
Sbjct: 515 LSKIGK-PLVVVQFGGGQVDDSSLLSNPHVNALLWTGYPSQEGGSAIFDILTGKTAPAGR 573
Query: 574 LPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNF 627
LP+T YP +Y+ +P+T+MA+RP + PGRTYR+Y V+ PFG G+ YT F
Sbjct: 574 LPVTQYPADYVNQVPLTDMALRPGSNT--PGRTYRWYDKAVL-PFGFGLHYTTF 624
>gi|70986056|ref|XP_748529.1| beta-xylosidase [Aspergillus fumigatus Af293]
gi|74668295|sp|Q4WFI6.1|BXLB_ASPFU RecName: Full=Probable exo-1,4-beta-xylosidase bxlB; AltName:
Full=1,4-beta-D-xylan xylohydrolase bxlB; AltName:
Full=Beta-xylosidase bxlB; AltName: Full=Xylobiase bxlB;
Flags: Precursor
gi|296439536|sp|B0Y0I4.1|BXLB_ASPFC RecName: Full=Probable exo-1,4-beta-xylosidase bxlB; AltName:
Full=1,4-beta-D-xylan xylohydrolase bxlB; AltName:
Full=Beta-xylosidase bxlB; AltName: Full=Xylobiase bxlB;
Flags: Precursor
gi|66846158|gb|EAL86491.1| beta-xylosidase, putative [Aspergillus fumigatus Af293]
gi|159128339|gb|EDP53454.1| beta-xylosidase [Aspergillus fumigatus A1163]
Length = 771
Score = 445 bits (1145), Expect = e-122, Method: Compositional matrix adjust.
Identities = 258/594 (43%), Positives = 352/594 (59%), Gaps = 15/594 (2%)
Query: 40 LPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGP 99
L C SL + R L+ ++ +EKV + VPRLG+ Y WWSEALHGV+ P
Sbjct: 40 LAVCDTSLDVTTRAQSLVNAMTFEEKVNNTQYNSPGVPRLGLPAYNWWSEALHGVAG-SP 98
Query: 100 GTKFG--GDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNV 157
G +F G F ATSFPQ I ++F+ L + + VVS E RA N G +GL +W+PN+
Sbjct: 99 GVEFADSGPFSYATSFPQPILLGATFDDDLIKQVATVVSTEGRAFGNAGRSGLDFWTPNI 158
Query: 158 NIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGD-RLKVAASCKHFTAYDLDNWN 216
N FRD RWGRGQETPGEDP+ +Y V GLQ G KV A+CKHF AYDL++WN
Sbjct: 159 NPFRDARWGRGQETPGEDPLHVSRYVYHLVDGLQNGIGPANPKVVATCKHFAAYDLEDWN 218
Query: 217 GVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIR 276
GV R FNA+VS QD+ + + PF+ C + +V +VMCSYN +NGVP CAD +L+ +R
Sbjct: 219 GVVRHSFNAEVSTQDLSEFYLPPFKSCARDARVDAVMCSYNALNGVPACADSYLLQTILR 278
Query: 277 GEWRLNG---YIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAV 333
W+ + +I SDC ++ Y+ +FT+TP EAAA A+ AG DLDCG + A
Sbjct: 279 EHWKWDEPGRWITSDCGAIDDIYNGHNFTTTPAEAAATALNAGTDLDCGTVFPKYLGQAA 338
Query: 334 QRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQG 393
GL S ++ ALV + ++LG FD QPY +G DV TP + LA +AA +G
Sbjct: 339 DEGLYSNQTLDRALVRLYSSLVKLGYFD-PAEDQPYRSIGWTDVDTPAAEALAHKAAGEG 397
Query: 394 IVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIH 453
IVLLKN +LPL T+A+IGP ++ T M GNY G A T L + +
Sbjct: 398 IVLLKNDK-TLPLKA--KGTLALIGPYANATKQMQGNYEGPAKYIRTLLWAATQAGYDVK 454
Query: 454 QQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSK 513
+ F AA+ A++QAD + G+D +IEAE DR + PG Q L+ +
Sbjct: 455 YAAGTAINTNSTAGFDAALSAAKQADVVVYAGGIDNTIEAEGRDRTTIAWPGNQVNLIDQ 514
Query: 514 VSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGK 573
+S K P ++V GG +D + ++PR+ A++WAGYP Q GG+AI DIL G + P G+
Sbjct: 515 LSKIGK-PLVVVQFGGGQVDDSSLLSNPRVNALLWAGYPSQEGGSAIFDILTGKTAPAGR 573
Query: 574 LPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNF 627
LP+T YP +Y+ +PMT+MA+RP + PGRTYR+Y V+ PFG G+ YT F
Sbjct: 574 LPVTQYPADYVNQVPMTDMALRPGSNT--PGRTYRWYDKAVL-PFGFGLHYTTF 624
>gi|23304843|emb|CAD48309.1| beta-xylosidase B [Clostridium stercorarium]
Length = 715
Score = 444 bits (1143), Expect = e-122, Method: Compositional matrix adjust.
Identities = 274/741 (36%), Positives = 394/741 (53%), Gaps = 88/741 (11%)
Query: 46 SLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGG 105
S +R DL+ R++++EKV ++ + A+ RLGI Y WW+EALHGV+ G
Sbjct: 11 SYSFEERAKDLVSRMTIEEKVSQMLYNSPAIERLGIPAYNWWNEALHGVARAGT------ 64
Query: 106 DFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA--------GLTYWSPNV 157
AT FPQ I A++F+ L + V+S E RA Y+ + GLT+WSPN+
Sbjct: 65 ----ATMFPQAIGMAATFDEELIYKVADVISTEGRAKYHASSKKGDRGIYKGLTFWSPNI 120
Query: 158 NIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNG 217
NIFRDPRWGRGQET GEDP L+ + ++V+GLQG+ LK CK+ + + +
Sbjct: 121 NIFRDPRWGRGQETYGEDPYLTARLGVAFVKGLQGNHPKYLKAGGMCKNILPFTVVPESL 180
Query: 218 VDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRG 277
R FNA VSK+D+ +T+ F+ V E KV SVM +YN+ NG P C +L +RG
Sbjct: 181 --RHEFNAVVSKKDLYETYLPAFKALVQEAKVESVMGAYNRTNGEPCCGSKTLLSDILRG 238
Query: 278 EWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGL 337
EW G++VSDC ++ ++ H T+T E+AA A+R G DL+CG G + A++ GL
Sbjct: 239 EWGFKGHVVSDCWAIRDFHMHHHVTATAPESAALAVRNGCDLNCGNMFG-NLLIALKEGL 297
Query: 338 LSEIDINNALVNTLTVQMRLGMFDGEPSSQ-PYGHLGPKDVCTPDHQELALEAARQGIVL 396
++E +I+ A+ + +M+LGMFD P Q PY + C +H+ELAL+ A++ IVL
Sbjct: 298 ITEEEIDRAVTRLMITRMKLGMFD--PEDQVPYASISSFVDCK-EHRELALDVAKKSIVL 354
Query: 397 LKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYA----RTI 452
LKN G LPL + R++AVIGPN+D +IGNY G A Y T L GI A R
Sbjct: 355 LKNDG-LLPLDRKKIRSIAVIGPNADSRQALIGNYEGTASEYVTVLDGIREMAGDDVRIY 413
Query: 453 HQQGC-------KDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEAL--------- 496
+ GC +++ D++ A+ + AD I+ +GLD +IE E +
Sbjct: 414 YSVGCHLYKDRVENLGEPGDRI-AEAVTCAEHADVVIMCLGLDSTIEGEEMHESNIYGSG 472
Query: 497 DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAG 556
D+ L LPG+QQEL+ V K P +LVL++G + V +A D I AI+ A YPG G
Sbjct: 473 DKPDLNLPGQQQELLEAVYATGK-PIVLVLLTGSALAVTWA--DEHIPAILNAWYPGALG 529
Query: 557 GTAIADILFGTSNPGGKLPMTWYPQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVV 615
G AIA +LFG +NP GKLP+T+Y LP T+ +M RTYRF K +
Sbjct: 530 GRAIASVLFGETNPSGKLPVTFY--RTTEELPDFTDYSME--------NRTYRFMKNEAL 579
Query: 616 YPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVD 675
YPFG G+SYT F ++ + +S I+ V V
Sbjct: 580 YPFGFGLSYTTFDYS-------------------DLKLSKDTIRAGEG------FNVSVK 614
Query: 676 VKNVGSKDGAHTLLVFSTPPAGHW-APHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVV 734
V N G G + V+ W P+ QL ++V + +G + I + L+VV
Sbjct: 615 VTNTGKMAGEEVVQVYIKDLEASWRVPNWQLSGMKRVRLESGETAEITFEIR-PEQLAVV 673
Query: 735 DRSGTRRIPLGEHNIHIGGTK 755
G I GE I++GG++
Sbjct: 674 TDEGKSVIEPGEFEIYVGGSQ 694
>gi|242786966|ref|XP_002480909.1| conserved hypothetical protein [Talaromyces stipitatus ATCC 10500]
gi|218721056|gb|EED20475.1| conserved hypothetical protein [Talaromyces stipitatus ATCC 10500]
Length = 757
Score = 444 bits (1142), Expect = e-122, Method: Compositional matrix adjust.
Identities = 262/712 (36%), Positives = 398/712 (55%), Gaps = 35/712 (4%)
Query: 51 QRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGG---DF 107
QRV LI L+L+EK+ L+ +A RLG+ YEWW+EA HGV + PG +F +F
Sbjct: 24 QRVKSLIDSLTLEEKILNLVDASAGSERLGLPSYEWWNEATHGVGS-APGVQFTEKPVNF 82
Query: 108 PGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGR 167
ATSFP I TA+SF+ L I V+ E RA N G +G +W+PN+N FRDPRWGR
Sbjct: 83 SYATSFPAPILTAASFDDALVREIASVIGREGRAFGNNGFSGFDFWAPNINPFRDPRWGR 142
Query: 168 GQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKV 227
GQETPGED + Y +++ GLQG D + +V A+CKH+ AYDL+ R+ +
Sbjct: 143 GQETPGEDSFVVQSYIRNFIPGLQGDDPEDKQVIATCKHYAAYDLE----TGRYGNDYNP 198
Query: 228 SKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNG---Y 284
++QD+ D F PF+ CV + V S+MC+YN V+G+PTCA +L + +R W Y
Sbjct: 199 TQQDLADYFLAPFKTCVRDTGVGSIMCAYNAVDGIPTCASEYLLDQVLRKHWNFTADYNY 258
Query: 285 IVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEIDI 343
+VSDC +V + +FT T E AA+ ++ AG+DL+CG +L L+ A + + +D
Sbjct: 259 VVSDCGAVTDIWQYHNFTDTEEAAASVSLNAGVDLECGSSYLKLNESLAANQTTVQALD- 317
Query: 344 NNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPS 403
AL + +G FDG Y LG DV TP+ Q LA EAA +G+ LLKN
Sbjct: 318 -QALTRLYSALFTVGFFDG----GKYTALGFADVSTPEAQSLAYEAAVEGMTLLKNDKRL 372
Query: 404 LPL-SHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCKDVAC 462
LP+ S ++++VA+IGP ++ T M G+Y+GI +PL+ + ++ +
Sbjct: 373 LPIRSSHKYKSVALIGPFANATTQMQGDYSGIPPFLISPLEAFKGHDWEVNYAMGTGINN 432
Query: 463 ADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPT 522
F +A+ A+ ++D I + G+D SIEAE LDR L PG Q +LV+++S K P
Sbjct: 433 QTTTGFASALAAAEKSDLVIYLGGIDNSIEAETLDRTSLTWPGNQLDLVTQLSKLHK-PL 491
Query: 523 ILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQE 582
I+V GG +D + + + A++WAGYP Q+GG+A+ D+L G + G+LP+T YP
Sbjct: 492 IVVQFGGGQLDDSALLQNEGVQALVWAGYPSQSGGSALLDVLLGKRSIAGRLPVTQYPAS 551
Query: 583 YITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPL 642
Y + + ++ +RP+ S YPGRTY++Y G V PFG+G+ YT F A L
Sbjct: 552 YADQVSIFDINIRPNDS--YPGRTYKWYTGMPVVPFGYGLHYTKFEFEWAQT-------L 602
Query: 643 DGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHT-LLVFSTPPAGHWA- 700
+ + S ++ T V+ VKN+G + + LL S+P AG
Sbjct: 603 NHEYNIQQLVASCQSTGPISDNTPFTT--VKAHVKNIGPEASDYVGLLFLSSPDAGPAPR 660
Query: 701 PHKQLVAFEKVH-VPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHI 751
P+K LV++ ++H + +G+Q + + + + ++ D +G I G + I +
Sbjct: 661 PNKSLVSYLRLHNITSGSQGTLDLPLTLGS-MARADENGNLVIFPGHYKIAL 711
>gi|402225863|gb|EJU05924.1| hypothetical protein DACRYDRAFT_113532 [Dacryopinax sp. DJM-731
SS1]
Length = 778
Score = 442 bits (1138), Expect = e-121, Method: Compositional matrix adjust.
Identities = 264/701 (37%), Positives = 385/701 (54%), Gaps = 35/701 (4%)
Query: 43 CQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTK 102
C +L R L+G L++ EK ++ + VPRLG+ Y WWSE LHGV++ PG
Sbjct: 42 CDSALDPLTRARALVGMLTMAEKFNNTVNASPGVPRLGLPPYNWWSEGLHGVAS-SPGVT 100
Query: 103 F---GGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNI 159
F G +F ATSFP+ I ++F+ L I ++S EARA N +GL +W+PN+N
Sbjct: 101 FAPAGQNFSYATSFPEPILMGAAFDDNLIYDIATIISTEARAFNNFNHSGLDFWTPNINP 160
Query: 160 FRDPRWGRGQETPGEDPVLSGKYAASYVRGLQ--GSDGDRLKVAASCKHFTAYDLDNWNG 217
RDPRWGR ETPGEDP Y A V GLQ G D K+ A+CKH+ YDL+NW G
Sbjct: 161 VRDPRWGRSLETPGEDPFHLASYVAKLVTGLQFGGDDPKYQKLVATCKHYAGYDLENWGG 220
Query: 218 VDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRG 277
R+ F+A +S QD+ + F PF+ C + V SVMCSYN VNG+P+CA+ +L+ +R
Sbjct: 221 YARYGFDAVISNQDLVEYFLPPFQTCARDVNVTSVMCSYNAVNGIPSCANDYLLQSLLRT 280
Query: 278 EW-------RLNG-YIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHT 329
W LN Y+ SDCD+V Y ++T TPE+A A +++AG DLDCG F
Sbjct: 281 YWGWEPDSESLNAHYVTSDCDAVSNIYYPHNYTITPEQAVAVSLKAGTDLDCGTFYAEWL 340
Query: 330 ESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEA 389
S+ ++GL + DI+ AL+ + LG FD Q Y ++ T Q+LA A
Sbjct: 341 PSSYEQGLFHQTDIDRALIRSYAALFLLGYFD-PAEGQIYRQYNWANINTDYAQQLAYTA 399
Query: 390 ARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYA 449
A +GI LLKN LPL +A+IGP ++ T M GNY GIA +PL + +
Sbjct: 400 AWEGITLLKNIDDMLPLPSTM-TNIALIGPWANATTQMQGNYQGIAPFLHSPLYALQQRG 458
Query: 450 RTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQE 509
+ ++ F AA+ A++ AD T+ + G+D ++EAEA+DR + PG Q +
Sbjct: 459 INVTYVLGTNITSNSTAGFAAALAAAQTADLTLYIGGIDITVEAEAMDRVNITWPGNQLD 518
Query: 510 LVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSN 569
L+++++ S I+ M GG ID +P++ ++W GYPGQ GGTA+ DIL+G+
Sbjct: 519 LIAQLANVSTH-LIVYQMGGGQIDDTVLLENPKVHGLLWGGYPGQDGGTAMIDILYGSRA 577
Query: 570 PGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVH 629
P G+LP++ YP +I +PMT+M + P+ PGRTY++Y G +V PFG+G+ YT F
Sbjct: 578 PAGRLPLSQYPANFINEVPMTDMRLHPALGT--PGRTYKWYSGDLVLPFGYGLHYTTFAK 635
Query: 630 TVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGV-QVDVKNVGSKDGAHTL 688
A+ S AT+ +A K + A ++ V +V N GS +
Sbjct: 636 --------AALKDHSPRSSDIATLVNEA-KQSSAWLDKAFFDVFAAEVTNTGSLTSDYVA 686
Query: 689 LVFST---PPAGHWAPHKQLVAFEKV-HVPAGAQQRVGINI 725
L + T PA + P LV++ ++ V G Q V ++
Sbjct: 687 LGYLTGEFGPAPY--PKSSLVSYTRLSQVTPGETQVVNFDL 725
>gi|115387056|ref|XP_001210069.1| predicted protein [Aspergillus terreus NIH2624]
gi|114191067|gb|EAU32767.1| predicted protein [Aspergillus terreus NIH2624]
Length = 908
Score = 442 bits (1136), Expect = e-121, Method: Compositional matrix adjust.
Identities = 275/734 (37%), Positives = 402/734 (54%), Gaps = 51/734 (6%)
Query: 43 CQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTK 102
C SL I +RVN L+ L+L+EK+ L+ AA RLG+ YEWW+EA HGV + PG +
Sbjct: 163 CDTSLSIAERVNSLVKSLTLEEKILNLVDAAAGSTRLGLPFYEWWNEATHGVGS-APGVQ 221
Query: 103 FG---GDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNI 159
F +F ATSFP I A+SF+ L I V+ E RA N G +G +W+PN+N
Sbjct: 222 FTSKPANFSYATSFPAPILIAASFDNALIRKIAEVIGKEGRAFANNGFSGFDFWAPNING 281
Query: 160 FRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD 219
FRDPRWGRGQETPGED ++ Y +++ GLQG D +V A+CKH+ YDL+
Sbjct: 282 FRDPRWGRGQETPGEDTFVAQNYIRNFIPGLQGDDPKNKQVIATCKHYAVYDLE----TG 337
Query: 220 RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEW 279
R+ N ++QD+ D F PF+ CV + V S+MCSYN V+G+P CA+ +L +R W
Sbjct: 338 RYGNNYNPTQQDLSDYFLAPFKTCVRDTDVGSIMCSYNSVSGIPACANEYLLDEVLRKHW 397
Query: 280 RLNG---YIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG-PFLGLHTESAVQR 335
N Y+VSDC++V + +FT T E AAA A+ AG+DL+CG +L L+ A +
Sbjct: 398 GFNADYHYVVSDCNAVTDIWQYHNFTDTEEAAAAVALNAGVDLECGSSYLKLNESLAANQ 457
Query: 336 GLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIV 395
+ +D +L + +G FDG Y HL DV P Q LA EAA +G+
Sbjct: 458 TSVKAMD--QSLARLYSALFTIGFFDG----GKYDHLDFSDVSIPAAQALAYEAAVEGMT 511
Query: 396 LLKNQGPSLPL-SHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYAR-TIH 453
LLKN G LPL S ++++VAVIGP ++ T M G Y+G A +PL R ++
Sbjct: 512 LLKNDG-LLPLHSQHKYKSVAVIGPFANATTQMQGGYSGNAPYLISPLVAFESDHRWKVN 570
Query: 454 QQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSK 513
+ + F A++ A++++D + + G+D SIE+E +DR L PG Q +L+
Sbjct: 571 YAVGTAINDQNTTGFEASLAAAKKSDLIVYLGGIDNSIESETIDRTSLAWPGNQLDLIKS 630
Query: 514 VSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGK 573
+S SK P ++V GG +D + + I A+IWAGYP Q+GGTA+ DIL G +P G+
Sbjct: 631 LSNLSK-PMVVVQFGGGQVDDSALLENKDIQALIWAGYPSQSGGTALLDILVGKRSPAGR 689
Query: 574 LPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVAN 633
LP+T YP Y + + ++ +RP+ +PGRTY++Y G V PFGHG+ YT F
Sbjct: 690 LPVTQYPASYADQINIFDINLRPNSKDSHPGRTYKWYTGKPVIPFGHGLHYTKFKF---- 745
Query: 634 APTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLG----------VQVDVKNVGSKD 683
G ++N S I+ A C R + G V+ V+NVG +
Sbjct: 746 ----------GWEETLNREYS---IQELVASCQRSSGGPIKDNTPFTTVKARVRNVGHET 792
Query: 684 GAH-TLLVFSTPPAGHWA-PHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRR 741
+ +LL S+ AG P+K LV+++++H A RV L+ D +G+
Sbjct: 793 SDYVSLLFLSSKNAGPAPRPNKSLVSYKRLHNIAPGSDRVADLPLTLGSLARADENGSLY 852
Query: 742 IPLGEHNIHIGGTK 755
I G + I + ++
Sbjct: 853 IFPGRYKIALDNSE 866
>gi|336435507|ref|ZP_08615222.1| hypothetical protein HMPREF0988_00807 [Lachnospiraceae bacterium
1_4_56FAA]
gi|336000960|gb|EGN31106.1| hypothetical protein HMPREF0988_00807 [Lachnospiraceae bacterium
1_4_56FAA]
Length = 717
Score = 442 bits (1136), Expect = e-121, Method: Compositional matrix adjust.
Identities = 278/735 (37%), Positives = 401/735 (54%), Gaps = 87/735 (11%)
Query: 51 QRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGA 110
++ +L+ +++L EK L A A+PRL I Y WW+E+LHGV+ G A
Sbjct: 12 KQAEELVDQMTLMEKASQLRYDAPAIPRLHIPAYNWWNESLHGVARGGT----------A 61
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA--------GLTYWSPNVNIFRD 162
T FPQ I A+SF+ + E IG ++ E RA YN GLT+W+PNVNIFRD
Sbjct: 62 TVFPQAIGLAASFDREMLEEIGEAIALEGRAKYNAAVKLDDRDIYKGLTFWAPNVNIFRD 121
Query: 163 PRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD--R 220
PRWGRG ET GEDP LS + SY+RGLQG DG+ +K AA KHF + +G + R
Sbjct: 122 PRWGRGHETYGEDPYLSSRLGVSYIRGLQG-DGETMKAAACAKHFAVH-----SGPEALR 175
Query: 221 FHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWR 280
F+A+VS++D+ +T+ F+ CV EG V +VM +YN VNG P C +LK+ +R EW
Sbjct: 176 HEFDAEVSEKDLRETYLPAFQACVQEGHVEAVMGAYNCVNGEPCCGSETLLKKILREEWG 235
Query: 281 LNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSE 340
+G++VSDC ++ +++ T TP ++AA A+ AG DL+CG LH A Q GL++E
Sbjct: 236 FDGHVVSDCWAIKDFHENHLVTGTPVQSAALAMEAGCDLNCG-VTYLHLVHACQEGLVTE 294
Query: 341 IDINNALVNTLTVQMRLGMFDG-EPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKN 399
I A + T + LGMFDG E S PY + K +H++L+ AAR+ IVLLKN
Sbjct: 295 AQITEAAIRLFTTRFLLGMFDGSEYDSVPYTVVECK-----EHRDLSERAARESIVLLKN 349
Query: 400 QGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRY----ARTIHQQ 455
G LPL + +T+ +IGPN+D +IGNY G + Y T L+G+ R R ++
Sbjct: 350 NGI-LPLDREKLKTIGIIGPNADSRKALIGNYHGTSSEYITVLEGVRRLVGDEVRILYSD 408
Query: 456 GC-------KDVACADDQLFGAAIDASRQADATILVMGLDQSIEAE---------ALDRA 499
GC +++A D+L A I +R++D IL +GLD+++E E + D+
Sbjct: 409 GCHLYENKTENLAREQDRLSEARI-VARESDVVILCLGLDETLEGEEGDTGNSYASGDKV 467
Query: 500 GLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTA 559
L LP Q+ L+ V+M K PT+L LM+G ID++FA + AI+ YPG GG A
Sbjct: 468 DLRLPKSQRMLMEAVAMEKK-PTVLCLMAGSDIDLSFA--EKHFDAIVDLWYPGAYGGAA 524
Query: 560 IADILFGTSNPGGKLPMTWYPQEYITNLPMTE-MAMRPSQSKRYPGRTYRFYKGPVVYPF 618
ADILFG +P GKLP+T+Y E + LP E +MR GRTYR+ + YPF
Sbjct: 525 AADILFGKCSPSGKLPITFY--ESLEVLPSFEDYSMR--------GRTYRYLEQKAQYPF 574
Query: 619 GHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKN 678
G+G++YT + V L+ + G+ + C +V+N
Sbjct: 575 GYGLTYTKM--------KIRNVWLENAEKDMKEVTDGENAEAAVIVC--------AEVEN 618
Query: 679 VGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRS 737
G D L ++ + H PH L FE++ V G ++ V I ++ + +VVD S
Sbjct: 619 CGGMDSQEVLQIYIRDTESEHETPHPHLAGFERIFVEKGVKKLVKIPVNRSAF-TVVDES 677
Query: 738 GTRRIPLGEHNIHIG 752
G R G++ I G
Sbjct: 678 GRRFTDSGKYEIFAG 692
>gi|323447708|gb|EGB03620.1| hypothetical protein AURANDRAFT_72703 [Aureococcus anophagefferens]
Length = 744
Score = 441 bits (1135), Expect = e-121, Method: Compositional matrix adjust.
Identities = 276/775 (35%), Positives = 393/775 (50%), Gaps = 101/775 (13%)
Query: 34 DATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHG 93
+AT LPFC +L I R D + R+++ EK+ L + + LG+ Y WWSEA G
Sbjct: 31 NATFEALPFCDATLAIDLRAADAVSRMTIPEKIDALDTKTGPIASLGLPAYNWWSEASSG 90
Query: 94 VSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYW 153
V P TKF ++P +TTA SFN TLW A G + EARA+ N G A TYW
Sbjct: 91 VMGSRPTTKF--------AYP--VTTAMSFNRTLWRATGAAIGREARALMNAGAAYSTYW 140
Query: 154 SPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGD--RLKVAASCKHFTAYD 211
+P VN+ R+PRWGR E PGEDP L+G+YA +V G Q + D L+ +A CKH+ A +
Sbjct: 141 APVVNLAREPRWGRNIEVPGEDPYLTGEYATEFVGGFQAAPEDPYHLQASACCKHYVANE 200
Query: 212 LDN--------WNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVP 263
L+N W DR H ++ V+++D+ D++ VPF+ CV +GKV+S+MCSYN VNGVP
Sbjct: 201 LENTRQPDGEQW---DRQHVDSNVTQRDLVDSYMVPFQACVEKGKVSSLMCSYNAVNGVP 257
Query: 264 TCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGP 323
+CA+ +L+ R W +GYI SDCD+ YD H+ +TPEEA AD ++AG D+DC
Sbjct: 258 SCANDWLLRTVARDAWHFDGYITSDCDADSNVYDAHHYAATPEEAVADVLKAGTDVDCQS 317
Query: 324 FLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFD-GEPSSQPYGHLGPKD----VC 378
F+G H SA+ +GL++E D++ LVN V++RLG FD +++P G L D VC
Sbjct: 318 FVGQHARSALDKGLITEADMDARLVNLFKVRLRLGHFDLSFDAAKPRGPLDEIDADAVVC 377
Query: 379 TPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGY 438
+ H + ++E Q LLKN G +LPL T AV+GPN+ ++ G Y
Sbjct: 378 SDAHLDASMEGLAQSATLLKNDG-ALPLKP--SGTAAVVGPNALLSKADAGYYG------ 428
Query: 439 TTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDR 498
+ ADA +L +G D + AE D
Sbjct: 429 -----------------------------------PTDAADAVVLAVGTDLTWAAEGKDA 453
Query: 499 AGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVA--FAKNDPRIAAIIWAGYPGQAG 556
++ Q EL+ V+ AS P ++V+ S P+D+ A++D ++ A++ G P
Sbjct: 454 TSIVFTAAQLELIDAVATASATPVVVVVFSATPLDLTPLLARSDGKVGAVVHVGQP-SVT 512
Query: 557 GTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSK---------------- 600
+ D+L+G + G+ T YP Y + + + MRP S
Sbjct: 513 VKGLGDLLYGRRSFAGRAVQTVYPAAYADQISIFDFNMRPGPSAFARPDCATNESACPRG 572
Query: 601 RYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAV-PLDGRHGSINATIS--GKA 657
PGRTYRFY V PFG G+SYT F + V +APT V + PL + + A G A
Sbjct: 573 TNPGRTYRFYVDEPVVPFGFGLSYTTFAYAVRSAPTTVDLAPLRAAYAGVAAARGDGGPA 632
Query: 658 IKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAG--HWAPHKQLVAFEKVHVPA 715
H T VDV N G D +L F TPP P K+L FE+VHV A
Sbjct: 633 FLSLHDDAAAATYA--VDVTNTGDIDADDVVLGFVTPPGAGVDGVPLKELFGFERVHVKA 690
Query: 716 GAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIG--GTKHSVSLHAATLGVI 768
G + V + + K+ +V + P G++ I G T+ + AATL
Sbjct: 691 GETKTVYLYPALSKFKTVAEDGALAARP-GDYAIEFGIPETRGRMGYAAATLAAF 744
>gi|333379783|ref|ZP_08471502.1| hypothetical protein HMPREF9456_03097 [Dysgonomonas mossii DSM
22836]
gi|332884929|gb|EGK05184.1| hypothetical protein HMPREF9456_03097 [Dysgonomonas mossii DSM
22836]
Length = 737
Score = 441 bits (1135), Expect = e-121, Method: Compositional matrix adjust.
Identities = 265/745 (35%), Positives = 396/745 (53%), Gaps = 85/745 (11%)
Query: 38 RTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNV 97
+ PF +L I +RVNDL+ +L+L+EKV +++ A+ RL I Y WW+E LHG+
Sbjct: 23 QNYPFQNTNLSIDERVNDLVSKLTLEEKVAQMLNNTPAIERLNIPAYNWWNECLHGIGRT 82
Query: 98 GPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA--------G 149
D+ T FPQ I A+++N L + + +SDE RA+YN T+ G
Sbjct: 83 --------DYK-VTVFPQAIGMAAAWNKELMKEVASAISDEGRAIYNDATSKGNREIYYG 133
Query: 150 LTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTA 209
LTYW+PN+NIFRDPRWGRGQET GEDP L+G S+V GLQG D LK AA KH+
Sbjct: 134 LTYWTPNINIFRDPRWGRGQETYGEDPFLTGVLGKSFVAGLQGDDTKYLKAAACAKHYAV 193
Query: 210 YDLDNWNGVD--RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCAD 267
+ +G + R FN V+ D+ DT+ FR V+E KVA VMC+YN NG P C +
Sbjct: 194 H-----SGPENTRHTFNTFVTDYDLWDTYLPAFRNLVVEAKVAGVMCAYNAYNGEPCCGN 248
Query: 268 PNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEE--AAADAIRAGLDLDCGPFL 325
+++ +R +W GY+ SDC ++ +Y QH + P+ AAADA+ G D+DCG
Sbjct: 249 NFLMQEILREKWNFTGYVTSDCGAIDDFY--QHHKTHPDAKYAAADAVYNGTDIDCGNEA 306
Query: 326 GLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQEL 385
AV+ G+++E I+ +L T++ RLGMFD + + Y + + + H++L
Sbjct: 307 YKALVDAVKTGIITEKQIDISLKRLFTIRFRLGMFDPAENVK-YSQISTSVLESQKHKDL 365
Query: 386 ALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
AL+ R+ IVLLKN+ +LPLS + + VAV+GPN++ V+++GNY G TP + +
Sbjct: 366 ALKITRESIVLLKNENNTLPLSK-KLKKVAVVGPNANNEVSVLGNYNGFPTEIVTPYEAV 424
Query: 446 GRY---ARTIHQQGCKDV--ACADDQLFGAAIDASRQADATILVMGLDQSIEAEAL---- 496
+ A I+++G V + + A + + D I V G+ +E E +
Sbjct: 425 KQKLKGAEVIYEKGIDFVTPSTNSKEEVSALVKRLKDVDVVIFVGGISPELEGEEMPVKI 484
Query: 497 ------DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
DR + LP Q + + K +A K PT+ V+M+G I + + I AI+ A
Sbjct: 485 EGFTGGDRTSIKLPKIQTDFM-KALVAEKIPTVFVMMTGSAIATEWESQN--IPAIVNAW 541
Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFY 610
Y GQ GTAIAD+LFG NP GKLP+T+Y ++ ++LP S RTYR++
Sbjct: 542 YGGQDAGTAIADVLFGDYNPSGKLPVTFYAKD--SDLPAF-------NSYEMKNRTYRYF 592
Query: 611 KGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTL 670
G V+YPFG+G+SYT F ++ P+ + +G KV+
Sbjct: 593 NGEVLYPFGYGLSYTKFEYSPIQVPSTID--------------TGNNAKVS--------- 629
Query: 671 GVQVDVKNVGSKDGAHTL-LVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCK 729
V +KN G +G + L S P P L F +V + AG + V N+ +
Sbjct: 630 ---VSIKNTGKVEGEEVVQLYISYPDTKGQKPLYALKGFNRVSLKAGESKTVEFNLS-PR 685
Query: 730 YLSVVDRSGTRRIPLGEHNIHIGGT 754
L +VD +G ++ G+ I IGG+
Sbjct: 686 ELGLVDDAGILKVSAGKRKIFIGGS 710
>gi|429850127|gb|ELA25427.1| glycoside hydrolase family 3 protein [Colletotrichum
gloeosporioides Nara gc5]
Length = 918
Score = 441 bits (1134), Expect = e-121, Method: Compositional matrix adjust.
Identities = 271/729 (37%), Positives = 389/729 (53%), Gaps = 40/729 (5%)
Query: 43 CQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTK 102
C SL QR L+ L++ EK+ L++ A +PRL + YEWWSE LHGV+ PGTK
Sbjct: 170 CDESLSDKQRAAALVAELTIWEKLDNLVNEAPGIPRLRVPPYEWWSEGLHGVAR-SPGTK 228
Query: 103 F--GGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIF 160
F G+F ATSFPQ I S+F+ L A+G VVS EARA N G +GL +SPN+N F
Sbjct: 229 FTSKGNFSYATSFPQPILLGSAFDDELVRAVGEVVSREARAFSNAGRSGLDLYSPNINAF 288
Query: 161 RDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDR 220
+DPRWGRGQETPGED KY ++ + GL+G D D+ K+ A+CKH+ A D +N+ GVDR
Sbjct: 289 KDPRWGRGQETPGEDTFHLQKYVSAMLSGLEGDDPDK-KLIATCKHYAANDFENYKGVDR 347
Query: 221 FHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWR 280
FNA +S QD+ + + PF+ C +E V S MCSYN +NG P CA+ +++ +R W
Sbjct: 348 SGFNAVISTQDLSEYYLPPFKTCAVEKNVGSFMCSYNGINGTPLCANSYLIEDILRKHWG 407
Query: 281 LNG---YIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHT-ESAVQRG 336
NG Y+ +DCD V + H+ AAA +++AG DL+C F G +SA +
Sbjct: 408 WNGDGQYVSTDCDCVALMVSYHHYAPDLGHAAAWSMQAGTDLECNAFPGSEALQSAWNQS 467
Query: 337 LLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVL 396
L+SE D++ AL T + +G+FD + P LG +V T + Q+LA AA +G VL
Sbjct: 468 LISEKDVDKALTRMYTSLVSVGLFDLD-RKDPLRSLGWDEVNTKEAQDLAYRAAVEGAVL 526
Query: 397 LKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQG 456
+KN G LPLS + A+IGP T M GNY G A +P ++
Sbjct: 527 MKNDG-ILPLSPDSSKKYALIGPWVSATTQMQGNYFGPAPYLISP------------RKA 573
Query: 457 CKDVACADDQLFGAAIDASRQA-----------DATILVMGLDQSIEAEALDRAGLLLPG 505
KD+ G+ + S + D I + G+D ++E E LDR L P
Sbjct: 574 AKDLGLDFTYFLGSRTNKSDSSFAQAIKAAQAADVVIFMGGVDNTLEQETLDRNTLAWPE 633
Query: 506 RQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILF 565
Q +L+ +S K P +++ GG +D + + AI+W GYPGQ+GG AI DI+F
Sbjct: 634 PQLQLLRALSEVGK-PLVVLQFGGGQVDDTELLANDSVNAILWGGYPGQSGGKAILDIVF 692
Query: 566 GTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYT 625
G + P G+L +T YP Y +P T+M +RP GRTYR+Y G P+G G+ YT
Sbjct: 693 GRAAPAGRLSVTQYPASYNDAVPATDMNLRPGPGNSGLGRTYRWYTGETPVPYGFGLHYT 752
Query: 626 NFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGA 685
F + A V + + N + + I R + V V KN G+
Sbjct: 753 KFSVDMKPASNVHNIDIAQMAAEANDDAASE-IPSWQRGLERRMVTVTVSAKNEGNVISD 811
Query: 686 HTLLVFSTPPAG--HWAPHKQLVAFEKV-HVPAGAQQRVGINIHVCKYLSVVDRSGTRRI 742
+ LVF AG W P K LV + ++ ++ G +++ I I + + L VD G R +
Sbjct: 812 YVALVFLRSEAGPKPW-PQKTLVGYTRLRNIKPGEERKEEIIIKM-EQLVRVDEVGNRVL 869
Query: 743 PLGEHNIHI 751
G +++ +
Sbjct: 870 YEGLYSLFL 878
>gi|7671419|emb|CAB89360.1| beta-glucosidase-like protein [Arabidopsis thaliana]
gi|9758998|dbj|BAB09525.1| unnamed protein product [Arabidopsis thaliana]
Length = 411
Score = 441 bits (1133), Expect = e-121, Method: Compositional matrix adjust.
Identities = 216/410 (52%), Positives = 278/410 (67%), Gaps = 5/410 (1%)
Query: 355 MRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTV 414
MRLG FDG P +QPYG LGPKDVCT +++ELA+E ARQGIVLLKN SLPLS +T+
Sbjct: 1 MRLGFFDGNPKNQPYGGLGPKDVCTVENRELAVETARQGIVLLKNSAGSLPLSPSAIKTL 60
Query: 415 AVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTI-HQQGCKDVACADDQLFGAAID 473
AVIGPN++VT TMIGNY G+AC YTTPLQG+ R T + +GC +V C + L A
Sbjct: 61 AVIGPNANVTKTMIGNYEGVACKYTTPLQGLERTVLTTKYHRGCFNVTCTEADLDSAKTL 120
Query: 474 ASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPID 533
A+ ADAT+LVMG DQ+IE E LDR L LPG+QQELV++V+ A++GP +LV+MSGG D
Sbjct: 121 AA-SADATVLVMGADQTIEKETLDRIDLNLPGKQQELVTQVAKAARGPVVLVIMSGGGFD 179
Query: 534 VAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMA 593
+ FAKND +I +I+W GYPG+AGG AIAD++FG NP GKLPMTWYPQ Y+ +PMT M
Sbjct: 180 ITFAKNDEKITSIMWVGYPGEAGGIAIADVIFGRHNPSGKLPMTWYPQSYVEKVPMTNMN 239
Query: 594 MRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSIN--- 650
MRP +S Y GRTYRFY G VY FG G+SYTNF H + AP V++ LD +
Sbjct: 240 MRPDKSNGYLGRTYRFYIGETVYAFGDGLSYTNFSHQLIKAPKFVSLNLDESQSCRSPEC 299
Query: 651 ATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEK 710
++ A R VQ+ V+NVG ++G T+ +F+TPP H +P KQL+ FEK
Sbjct: 300 QSLDAIGPHCEKAVGERSDFEVQLKVRNVGDREGTETVFLFTTPPEVHGSPRKQLLGFEK 359
Query: 711 VHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
+ + + V + VCK L VVD G R++ LG H +H+G KHS ++
Sbjct: 360 IRLGKKEETVVRFKVDVCKDLGVVDEIGKRKLALGHHLLHVGSLKHSFNI 409
>gi|67523807|ref|XP_659963.1| hypothetical protein AN2359.2 [Aspergillus nidulans FGSC A4]
gi|74597492|sp|Q5BAS1.1|XYND_EMENI RecName: Full=Exo-1,4-beta-xylosidase xlnD; AltName:
Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
gi|40745314|gb|EAA64470.1| hypothetical protein AN2359.2 [Aspergillus nidulans FGSC A4]
gi|259487761|tpe|CBF86686.1| TPA: Beta-xylosidase (EC 3.2.1.37)
[Source:UniProtKB/TrEMBL;Acc:O42810] [Aspergillus
nidulans FGSC A4]
Length = 803
Score = 439 bits (1130), Expect = e-120, Method: Compositional matrix adjust.
Identities = 266/730 (36%), Positives = 390/730 (53%), Gaps = 36/730 (4%)
Query: 41 PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPG 100
P C SL R L+ + E V + V RLG+ Y+ W EALHGV
Sbjct: 59 PVCDRSLSPKDRATALVSLFTFDELVNNTGNTGLGVSRLGLPNYQVWGEALHGVGRAN-- 116
Query: 101 TKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIF 160
G+F ATSFP IT ++ N TL IG +VS + RA N G G+ +SPN+N F
Sbjct: 117 FVESGNFSWATSFPMPITMMAALNKTLIHQIGTIVSTQLRAFSNAGLGGVDVYSPNINTF 176
Query: 161 RDPRWGRGQETPGEDPVLSGKYAASYVRGLQGS-DGDRLKVAASCKHFTAYDLDNWNGVD 219
R P WGRGQETPGED L+ Y Y+ LQG D + LK+ A+ KH+ YD+++WN
Sbjct: 177 RHPVWGRGQETPGEDAFLTSVYGYEYITALQGGVDPETLKIIATAKHYAGYDIESWNNHS 236
Query: 220 RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRG-- 277
R + ++++Q++ + + PF + + KV SVMCSYN VNGVP+CA+ L+ +R
Sbjct: 237 RLGNDMQITQQELSEYYTPPFIVASRDAKVRSVMCSYNAVNGVPSCANKFFLQTLLRDTF 296
Query: 278 EWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGL 337
E+ +GY+ DC +V ++ + S A+AD+I AG D+DCG H+E A + L
Sbjct: 297 EFSEDGYVSGDCGAVYNVWNPHGYASNEAAASADSILAGTDIDCGTSYQWHSEDAFEDSL 356
Query: 338 LSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLL 397
+S DI ++ + ++ G FDGE + PY + DV + D +A EAA +GIVLL
Sbjct: 357 VSRSDIERGVIRLYSNLVQAGYFDGEDA--PYRDITWDDVLSTDAWNIAYEAAVEGIVLL 414
Query: 398 KNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGC 457
KN +LPLS ++VAVIGP ++VT + GNY G A +PL G +H
Sbjct: 415 KND-ETLPLSK-DIKSVAVIGPWANVTEELQGNYFGPAPYLISPLTGFRDSGLDVHYALG 472
Query: 458 KDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMA 517
++ F A+ A++QADA I G+D +IEAEA+DR + PG Q +L+SK+S
Sbjct: 473 TNLTSHSTSGFEEALTAAKQADAIIFAGGIDNTIEAEAMDRENITWPGNQLDLISKLSEL 532
Query: 518 SKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMT 577
K P +++ M GG +D + K++ + A+IW GYPGQ+GG A+ADI+ G P G+L T
Sbjct: 533 GK-PLVVLQMGGGQVDSSSLKDNDNVNALIWGGYPGQSGGHALADIITGKRAPAGRLVTT 591
Query: 578 WYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVA----- 632
YP EY P +M +RP+++ PG+TY +Y G VY FGHG+ YT F +
Sbjct: 592 QYPAEYAEVFPAIDMNLRPNETSGNPGQTYMWYTGTPVYEFGHGLFYTTFEESTETTDAG 651
Query: 633 --NAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLV 690
N TV+ P G HA+ L L VKN G ++ +T LV
Sbjct: 652 SFNIQTVLTTPHSGYE---------------HAQQKTL-LNFTATVKNTGERESDYTALV 695
Query: 691 FSTPPAGHWA-PHKQLVAFEKV-HVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHN 748
+ AG P K +V F+++ + G Q + + + V + ++ D G R + G +
Sbjct: 696 YVNTTAGPAPYPKKWVVGFDRLGGLEPGDSQTLTVPVTV-ESVARTDEQGNRVLYPGSYE 754
Query: 749 IHIGGTKHSV 758
+ + + V
Sbjct: 755 LALNNERSVV 764
>gi|343428088|emb|CBQ71612.1| related to Beta-xylosidase [Sporisorium reilianum SRZ2]
Length = 698
Score = 439 bits (1130), Expect = e-120, Method: Compositional matrix adjust.
Identities = 253/619 (40%), Positives = 344/619 (55%), Gaps = 22/619 (3%)
Query: 39 TLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVG 98
TLP C SL R L+ + + E + ++ A VPRLGI Y+WW+EALHGV+
Sbjct: 31 TLPVCDTSLDFYTRATSLVAQFTTAELINNTVNHAPGVPRLGIPQYQWWTEALHGVAR-S 89
Query: 99 PGTKF----GGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWS 154
PG F G+F ATSFPQVI ++F+ L+EA+ +++E RA N G AGL +S
Sbjct: 90 PGVNFNPDAAGEFGCATSFPQVINLGATFDDALYEAVAAHIANETRAFSNAGRAGLNMYS 149
Query: 155 P-NVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGD-----RLKVAASCKHFT 208
P N+N FRDPRWGRGQET GEDP+ +YA VRGLQG RL +AA+CKH+
Sbjct: 150 PLNINAFRDPRWGRGQETVGEDPLHLSRYAVRVVRGLQGPAAQDEANPRLTLAATCKHYL 209
Query: 209 AYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADP 268
AYDL+ GV+R+ F+A VS QD+ D FR CV +G ++M SYN VNGVP A
Sbjct: 210 AYDLEASAGVERYQFDALVSNQDLADLHLPQFRACVRDGGATTLMTSYNAVNGVPPSASK 269
Query: 269 NILKRTIRGEWRLN---GYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFL 325
L+ R W L+ Y+ SDCD+V YD H+ + AAA ++ AG DLDCG
Sbjct: 270 YYLETLARDTWGLDKHHNYVTSDCDAVANVYDAHHYAADYVHAAAASLNAGTDLDCGATY 329
Query: 326 GLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQEL 385
+A+ + L I A+ +RLG FD +QP LG KDV P Q+L
Sbjct: 330 RDSLAAALAQNLTDVATIRRAVTRMYGSLVRLGYFDAA-EAQPLRQLGWKDVNAPAAQKL 388
Query: 386 ALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
A EAA I LLKN+ +LPL +T+A+IGP ++ T + GNYAG + TP
Sbjct: 389 AYEAAAASITLLKNRQSTLPLRETAGKTIALIGPYTNATFALRGNYAGPSPLVITPFDAA 448
Query: 446 GRY---ARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLL 502
R A + G D AA+ ++ AD + G+D ++E E+LDR +
Sbjct: 449 RRTFSDAHIVSANGTSIAGPYDTATASAALATAKSADIIVYAGGIDPTVEGESLDRRDIA 508
Query: 503 LPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIAD 562
P Q L+ +++ K ++V GG +D A K D + A++WAGYPGQ+G A+ D
Sbjct: 509 WPANQLRLIQELAALGKV-LVVVQFGGGQVDGALLKGDDGVGALVWAGYPGQSGALALMD 567
Query: 563 ILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGM 622
IL G P G+LP+T YP Y L T MA+RP+ + YPGRTY++Y G +PFG G+
Sbjct: 568 ILAGKRAPAGRLPITQYPANYTHALRETTMALRPTAT--YPGRTYKWYTGTPTFPFGFGL 625
Query: 623 SYTNFVHTVANAPTVVAVP 641
YT F ++A P +P
Sbjct: 626 HYTTFRASIA-PPATYTIP 643
>gi|156062754|ref|XP_001597299.1| hypothetical protein SS1G_01493 [Sclerotinia sclerotiorum 1980]
gi|154696829|gb|EDN96567.1| hypothetical protein SS1G_01493 [Sclerotinia sclerotiorum 1980
UF-70]
Length = 758
Score = 439 bits (1130), Expect = e-120, Method: Compositional matrix adjust.
Identities = 274/712 (38%), Positives = 392/712 (55%), Gaps = 62/712 (8%)
Query: 43 CQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTK 102
C + R L+ +L EK+ + + VPR+G+ Y+WW+EALHG++ GT
Sbjct: 34 CDTTADPYTRATALVSLFTLAEKINNTGNTSPGVPRIGLPAYQWWNEALHGIAY---GTH 90
Query: 103 F---GGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNI 159
F G ++ ATSFPQ I ++F+ L + +S EARA N GL +W+PN+N
Sbjct: 91 FAAAGSNYSYATSFPQPILMGAAFDDALIHDVASQISTEARAFSNANRYGLNFWTPNINP 150
Query: 160 FRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGD--RLKVAASCKHFTAYDLDNWNG 217
++DPRWGRGQETPGEDP Y + V GLQG D K A+CKH+ YDL+N G
Sbjct: 151 YKDPRWGRGQETPGEDPFHVSSYVNALVTGLQGGLDDLPYKKGVATCKHYAGYDLENGGG 210
Query: 218 VDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRG 277
+ R+ F+A ++ QD+ D + F+ C + V S+MCSYN VNGVPTCAD +L+ +R
Sbjct: 211 IQRYAFDAIINSQDLRDYYLPSFQQCARDSNVQSIMCSYNAVNGVPTCADDWLLQSLLRE 270
Query: 278 EW---RLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQ 334
W + ++ SDCD+V +D+ ++TSTPE+AAADA+ AG DLDCG F + SA
Sbjct: 271 HWGWVEEDQWVTSDCDAVQNIWDSHNYTSTPEQAAADALNAGTDLDCGGFWPTYLGSAYN 330
Query: 335 RGLLSEIDINNALVNTLTVQMRLGMFDGEPSS-QPYGHLGPKDVCTPDHQELALEAARQG 393
+ L + ++ +L +RLG FD P+S QPY LG DV TP ++LAL+AA G
Sbjct: 331 QSLYNISTLDRSLTRRYASLVRLGYFD--PASIQPYRQLGWSDVSTPSAEQLALQAAEDG 388
Query: 394 IVLLKNQGPSLPL-SHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPL---QGIGRYA 449
IVLLKN G LPL S+I + VA+IGP ++ T M GNY G A +PL Q G +
Sbjct: 389 IVLLKNDG-ILPLPSNITN--VALIGPWANATTQMQGNYYGQAPYLHSPLIAAQNAGFHV 445
Query: 450 RTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQE 509
+ QG D+ + F AAI A+++AD I + G+D SIEAEA DR + P Q
Sbjct: 446 TYV--QGA-DIDSTNTTEFTAAIAAAKKADVIIYIGGIDNSIEAEAKDRKTIAWPSSQIS 502
Query: 510 LVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSN 569
LV++++ S P I+ M G ID + + + IIWAGYPGQ GGTAI +IL G +
Sbjct: 503 LVNQLANLSI-PLIISQM-GTMIDSSSLLTNRGVNGIIWAGYPGQDGGTAIFNILTGKTA 560
Query: 570 PGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVH 629
P G+LP+T YP +Y+ + M M + P + PGRTY+++ G ++ FG G+ YT F
Sbjct: 561 PAGRLPITQYPSDYVNEVSMNNMNLHPGANN--PGRTYKWFNGTSIFDFGFGLHYTTF-- 616
Query: 630 TVANAPTVVAVPLDGRHGSINATI---SGKAIKVTHAKCNRLT-------LGVQVDVKNV 679
NA I S +++H N T L + + + N
Sbjct: 617 --------------------NAKITPPSSNTFEISHLTSNTSTHKDLTPFLTLPISISNT 656
Query: 680 GSKDGAHTLLVFSTPPAGHWA-PHKQLVAFEKVH-VPAGAQQRVGINIHVCK 729
G+ + L+F T G P K LVA+ ++H + GA + +++
Sbjct: 657 GTTTSDYVALLFLTGSFGPTPYPKKSLVAYTRLHDIKGGASSTAQLKLNLAS 708
>gi|2920706|emb|CAA73902.1| beta-xylosidase [Emericella nidulans]
Length = 802
Score = 439 bits (1128), Expect = e-120, Method: Compositional matrix adjust.
Identities = 265/730 (36%), Positives = 391/730 (53%), Gaps = 36/730 (4%)
Query: 41 PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPG 100
P C SL R L+ + E V + V RLG+ Y+ W EALHGV
Sbjct: 58 PVCDRSLSPKDRATALVSLFTFDELVNNTGNTGLGVSRLGLPNYQVWGEALHGVGRAN-- 115
Query: 101 TKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIF 160
G+F ATSFP IT ++ N TL IG +VS + RA N G G+ +SPN+N F
Sbjct: 116 FVESGNFSWATSFPMPITMMAALNKTLIHQIGTIVSTQLRAFSNAGLGGVDVYSPNINTF 175
Query: 161 RDPRWGRGQETPGEDPVLSGKYAASYVRGLQGS-DGDRLKVAASCKHFTAYDLDNWNGVD 219
R P WGRGQETPGED L+ Y Y+ LQG+ D + K+ A+ KH+ YD+++WN
Sbjct: 176 RHPVWGRGQETPGEDAFLTSVYGYEYITALQGAVDPETSKIIATAKHYAGYDIESWNNHS 235
Query: 220 RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRG-- 277
R + ++++Q++ + + PF + + KV SVMCSYN VNGVP+CA+ L+ +R
Sbjct: 236 RLGNDMQITQQELSEYYTPPFIVASRDAKVRSVMCSYNAVNGVPSCANKFFLQTLLRDTF 295
Query: 278 EWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGL 337
E+ +GY+ DC +V ++ + S A+AD+I AG D+DCG H+E A + L
Sbjct: 296 EFSEDGYVSGDCGAVYNVWNPHGYASNEAAASADSILAGTDIDCGTSYQWHSEDAFEDSL 355
Query: 338 LSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLL 397
+S DI ++ + ++ G FDGE + PY + DV + D +A EAA +GIVLL
Sbjct: 356 VSRSDIERGVIRLYSNLVQAGYFDGEDA--PYRDITWDDVLSTDAWNIAYEAAVEGIVLL 413
Query: 398 KNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGC 457
KN +LPLS ++VAVIGP ++VT + GNY G A +PL G +H
Sbjct: 414 KND-ETLPLSK-DIKSVAVIGPWANVTEELQGNYFGPAPYLISPLTGFRDSGLDVHYALG 471
Query: 458 KDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMA 517
++ F A+ A++QADA I G+D +IEAEA+DR + PG Q +L+SK+S
Sbjct: 472 TNLTSHSTSGFEEALTAAKQADAIIFAGGIDNTIEAEAMDRENITWPGNQLDLISKLSEL 531
Query: 518 SKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMT 577
K P +++ M GG +D + K++ + A+IW GYPGQ+GG A+ADI+ G P G+L T
Sbjct: 532 GK-PLVVLQMGGGQVDSSSLKDNDNVNALIWGGYPGQSGGHALADIITGKRAPAGRLVTT 590
Query: 578 WYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVA----- 632
YP EY P +M +RP+++ PG+TY +Y G VY FGHG+ YT F +
Sbjct: 591 QYPAEYAEVFPAIDMNLRPNETSGNPGQTYMWYTGTPVYEFGHGLFYTTFEESTETTDAG 650
Query: 633 --NAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLV 690
N TV+ P G HA+ L L VKN G ++ +T LV
Sbjct: 651 SFNIQTVLTTPHSGYE---------------HAQQKTL-LNFTATVKNTGERESDYTALV 694
Query: 691 FSTPPAGHWA-PHKQLVAFEKV-HVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHN 748
+ AG P K +V F+++ + G Q + + + V + ++ D G R + G ++
Sbjct: 695 YVNTTAGPAPYPKKWVVGFDRLGGLEPGDSQTLTVPVTV-ESVARTDEQGNRVLYPGSYD 753
Query: 749 IHIGGTKHSV 758
+ + + V
Sbjct: 754 VALNNERSVV 763
>gi|240146254|ref|ZP_04744855.1| beta-glucosidase [Roseburia intestinalis L1-82]
gi|257201613|gb|EEU99897.1| beta-glucosidase [Roseburia intestinalis L1-82]
gi|291539969|emb|CBL13080.1| Beta-glucosidase-related glycosidases [Roseburia intestinalis
XB6B4]
Length = 710
Score = 438 bits (1127), Expect = e-120, Method: Compositional matrix adjust.
Identities = 263/735 (35%), Positives = 392/735 (53%), Gaps = 87/735 (11%)
Query: 51 QRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGA 110
+R +L+G+++L+EKV + A AV RL IK Y WW+EALHGV+ G A
Sbjct: 15 KRAAELVGKMTLEEKVAQTLYQAPAVERLNIKAYNWWNEALHGVARAGT----------A 64
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAG--------LTYWSPNVNIFRD 162
T FPQ I A++F+ L E +G VS EARA +N G LT+W+PNVNIFRD
Sbjct: 65 TVFPQAIGLAATFDEDLLEQVGDAVSTEARAKFNMQQEGKDTDIYKGLTFWAPNVNIFRD 124
Query: 163 PRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD--R 220
PRWGRG ET GEDP L+ + Y+ GLQG D + LK AA KHF + +G + R
Sbjct: 125 PRWGRGHETFGEDPYLTSRLGVRYIEGLQGHDENYLKAAACAKHFAVH-----SGPEAVR 179
Query: 221 FHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWR 280
F+A+V++QD+ +T+ F CV EGKV +VM +YN+ NGVP C + +L +R EW
Sbjct: 180 HEFDAEVTEQDLRETYLPAFEACVKEGKVEAVMGAYNRTNGVPCCGNKRLLIDILRKEWG 239
Query: 281 LNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSE 340
+G++ SDC ++ +++ H T T E+ A A+ G DL+CG G + AV++GL+ E
Sbjct: 240 FSGHVTSDCWAIRDFHEGHHVTGTAIESVAMAMNNGCDLNCGTLFGFLVQ-AVRQGLVKE 298
Query: 341 IDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQ 400
++ A+ N +M+LG+FD + PY + + + ++L AR+ +VLLKN+
Sbjct: 299 ERLDEAVTNLFMARMKLGVFD-KKEENPYDKIPYLAADSREMKKLNEAVARRTVVLLKNK 357
Query: 401 GPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRY----ARTIHQQG 456
LPL + +TV VIGPN+D ++GNY G A Y T L+GI Y R ++ +G
Sbjct: 358 EHILPLDKNKIKTVGVIGPNADSRRALVGNYEGTASRYITVLEGIEDYVGDDVRVLYSEG 417
Query: 457 C-------KDVACADDQLFGAAIDASRQADATILVMGLDQSIEAE---------ALDRAG 500
C ++A +D++ + +++D + V+GLD IE E + D+
Sbjct: 418 CHLYKDRTSNLAQENDRM-SEVLGVCKESDVVVAVLGLDAGIEGEEGDAGNEYGSGDKPD 476
Query: 501 LLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAI 560
L LPG Q+E++ K P ILVL+SG + V +A D + AI+ YPG GG AI
Sbjct: 477 LNLPGLQEEILEAAVSCGK-PVILVLLSGSALAVNWA--DEHVDAIVQGWYPGARGGAAI 533
Query: 561 ADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGH 620
ADILFG +NP GKLP+T+Y T + + GRTYR+ + +YPFG+
Sbjct: 534 ADILFGEANPEGKLPVTFY---------RTTEELPDFEDYSMQGRTYRYMEQEALYPFGY 584
Query: 621 GMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVG 680
G+SYT + + V +G +T+G+ VKN G
Sbjct: 585 GLSYTEYAYQNVRFLEQEPVVSEG-----------------------VTIGLS--VKNTG 619
Query: 681 SKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTR 740
DG T+ V+ PH QL K+ + AG ++ + I + ++ + D +G +
Sbjct: 620 KMDGTETVQVYVKAEHSK-MPHGQLKKIVKLPLCAGEEKEINIRLESEAFM-LYDENGEK 677
Query: 741 RIPLGEHNIHIGGTK 755
+P G I +GG +
Sbjct: 678 ILPSGHFEIFVGGMQ 692
>gi|291537442|emb|CBL10554.1| Beta-glucosidase-related glycosidases [Roseburia intestinalis
M50/1]
Length = 710
Score = 438 bits (1126), Expect = e-120, Method: Compositional matrix adjust.
Identities = 262/735 (35%), Positives = 392/735 (53%), Gaps = 87/735 (11%)
Query: 51 QRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGA 110
+R +L+G+++L+EKV + A AV RL IK Y WW+EALHGV+ G A
Sbjct: 15 KRAAELVGKMTLEEKVAQTLYQAPAVERLNIKAYNWWNEALHGVARAGT----------A 64
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAG--------LTYWSPNVNIFRD 162
T FPQ I A++F+ L E +G VS EARA +N G LT+W+PNVNIFRD
Sbjct: 65 TVFPQAIGLAATFDEDLLEQVGDAVSTEARAKFNMQQEGKDTDIYKGLTFWAPNVNIFRD 124
Query: 163 PRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD--R 220
PRWGRG ET GEDP L+ + Y+ GLQG D + LK AA KHF + +G + R
Sbjct: 125 PRWGRGHETFGEDPYLTSRLGVRYIEGLQGHDENYLKAAACAKHFAVH-----SGPEAVR 179
Query: 221 FHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWR 280
F+A+V++QD+ +T+ F CV EGKV +VM +YN+ NGVP C + +L +R EW
Sbjct: 180 HEFDAEVTEQDLRETYLPAFEACVKEGKVEAVMGAYNRTNGVPCCGNKRLLIDILRKEWG 239
Query: 281 LNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSE 340
+G++ SDC ++ +++ H T T E+ A A+ G DL+CG G + AV++GL+ E
Sbjct: 240 FSGHVTSDCWAIRDFHEGHHVTGTAIESVAMAMNNGCDLNCGTLFGFLVQ-AVRQGLVKE 298
Query: 341 IDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQ 400
++ A+ N +M+LG+FD + PY + + + ++L AR+ +VLLKN+
Sbjct: 299 ERLDEAVTNLFMARMKLGVFD-KKEENPYDKIPYLAADSREMKKLNEAVARRTVVLLKNK 357
Query: 401 GPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRY----ARTIHQQG 456
LPL + +T+ VIGPN+D ++GNY G A Y T L+GI Y R ++ +G
Sbjct: 358 EHILPLDKNKIKTIGVIGPNADSRRALVGNYEGTASRYITVLEGIEDYVGDDVRVLYSEG 417
Query: 457 C-------KDVACADDQLFGAAIDASRQADATILVMGLDQSIEAE---------ALDRAG 500
C ++A +D++ + +++D + V+GLD IE E + D+
Sbjct: 418 CHLYKDRTSNLAQENDRM-SEVLGVCKESDVVVAVLGLDAGIEGEEGDAGNEYGSGDKPD 476
Query: 501 LLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAI 560
L LPG Q+E++ K P ILVL+SG + V +A D + AI+ YPG GG AI
Sbjct: 477 LNLPGLQEEILEAAVSCGK-PVILVLLSGSALAVNWA--DEHVDAIVQGWYPGARGGAAI 533
Query: 561 ADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGH 620
ADILFG +NP GKLP+T+Y T + + GRTYR+ + +YPFG+
Sbjct: 534 ADILFGEANPEGKLPVTFY---------RTTEELPDFEDYSMQGRTYRYMEQEALYPFGY 584
Query: 621 GMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVG 680
G+SYT + + V +G +T+G+ VKN G
Sbjct: 585 GLSYTEYAYQNVRFLEQEPVVSEG-----------------------VTIGLS--VKNTG 619
Query: 681 SKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTR 740
DG T+ V+ PH QL K+ + AG ++ + I + ++ + D +G +
Sbjct: 620 KMDGTETVQVYVKAEHSK-MPHGQLKKIVKLPLCAGEEKEINIRLESEAFM-LYDENGEK 677
Query: 741 RIPLGEHNIHIGGTK 755
+P G I +GG +
Sbjct: 678 ILPSGHFEIFVGGMQ 692
>gi|60729621|pir||JC7966 xylan 1,4-beta-xylosidase (EC 3.2.1.37) - Talaromyces emersonii
gi|21326570|gb|AAL32053.2|AF439746_1 beta-xylosidase [Rasamsonia emersonii]
Length = 796
Score = 438 bits (1126), Expect = e-120, Method: Compositional matrix adjust.
Identities = 271/740 (36%), Positives = 393/740 (53%), Gaps = 40/740 (5%)
Query: 43 CQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTK 102
C S R L+ +L+E + + A VPRLG+ Y+ W+EALHG+
Sbjct: 64 CNTSADPWARAEALVSLFTLEELINNTQNTAPGVPRLGLPQYQVWNEALHGLDRANFSDS 123
Query: 103 FGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRD 162
G++ ATSFP I + +SFN TL I +++ +ARA N G GL ++PN+N FR
Sbjct: 124 --GEYSWATSFPMPILSMASFNRTLINQIASIIATQARAFNNAGRYGLDSYAPNINGFRS 181
Query: 163 PRWGRGQETPGEDP-VLSGKYAASYVRGLQGS-DGDRLKVAASCKHFTAYDLDNWNGVDR 220
P WGRGQETPGED LS YA Y+ GLQG D + +K+ A+ KHF YDL+NW V R
Sbjct: 182 PLWGRGQETPGEDAFFLSSAYAYEYITGLQGGVDPEHVKIVATAKHFAGYDLENWGNVSR 241
Query: 221 FHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWR 280
NA +++QD+ + + F K S+MCSYN VNGVP+C++ L+ +R +
Sbjct: 242 LGSNAIITQQDLSEYYTPQFLASARYAKTRSLMCSYNAVNGVPSCSNSFFLQTLLRESFN 301
Query: 281 L--NGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLL 338
+GY+ SDCD+V ++ + AAAD++ AG D+DCG + H + +
Sbjct: 302 FVDDGYVSSDCDAVYNVFNPHGYALNQSGAAADSLLAGTDIDCGQTMPWHLNESFYERYV 361
Query: 339 SEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLK 398
S DI +L +RLG FDG S Y +L DV T D ++ EAA +GI LLK
Sbjct: 362 SRGDIEKSLTRLYANLVRLGYFDGNNSV--YRNLNWNDVVTTDAWNISYEAAVEGITLLK 419
Query: 399 NQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCK 458
N G +LPLS + R++A+IGP ++ TV M GNY G +PL+ T++
Sbjct: 420 NDG-TLPLSK-KVRSIALIGPWANATVQMQGNYYGTPPYLISPLEAAKASGFTVNYAFGT 477
Query: 459 DVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMAS 518
+++ Q F AI A++++D I G+D +IEAE DR L PG Q +L+ ++S
Sbjct: 478 NISTDSTQWFAEAISAAKKSDVIIYAGGIDNTIEAEGQDRTDLKWPGNQLDLIEQLSKVG 537
Query: 519 KGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTW 578
K P +++ M GG +D + K + + A++W GYPGQ+GG A+ DIL G P G+L T
Sbjct: 538 K-PLVVLQMGGGQVDSSSLKANKNVNALVWGGYPGQSGGAALFDILTGKRAPAGRLVSTQ 596
Query: 579 YPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVV 638
YP EY T P +M +RP+ S PG+TY +Y G VY FGHG+ YT F + A A T
Sbjct: 597 YPAEYATQFPANDMNLRPNGSN--PGQTYIWYTGTPVYEFGHGLFYTEFQESAA-AGTNK 653
Query: 639 AVPLDGRHGSINATISGKAIKVTHAKCNRLT----LGVQVDVKNVGSKDGAHTLLVFSTP 694
LD + H + L V VDVKNVG +T L+F+
Sbjct: 654 TSTLDIL----------DLVPTPHPGYEYIELVPFLNVTVDVKNVGHTPSPYTGLLFANT 703
Query: 695 PAG-HWAPHKQLVAFEK---VHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIH 750
AG P+K LV F++ +H AQ + + ++ D +G + I GE+ +
Sbjct: 704 TAGPKPYPNKWLVGFDRLATIHPAKTAQVTFPVPLGA---IARADENGNKVIFPGEYELA 760
Query: 751 IGGTK-----HSVSLHAATL 765
+ + S++ +AATL
Sbjct: 761 LNNERSVVVSFSLTGNAATL 780
>gi|347832625|emb|CCD48322.1| glycoside hydrolase family 3 protein [Botryotinia fuckeliana]
Length = 772
Score = 436 bits (1122), Expect = e-119, Method: Compositional matrix adjust.
Identities = 276/704 (39%), Positives = 393/704 (55%), Gaps = 43/704 (6%)
Query: 43 CQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTK 102
C S R LI +L EKV + + VPR+G+ YEWW+EALHG++ PGT
Sbjct: 34 CDTSSDPYTRAAALISLFTLAEKVNNTGNTSPGVPRIGLPSYEWWNEALHGIAR-SPGTT 92
Query: 103 F---GGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNI 159
F G ++ +TSFPQ I ++F+ L + VS EARA N GL +W+PN+N
Sbjct: 93 FAATGSNYSYSTSFPQPILMGATFDDELIHKVATQVSTEARAFNNVNRFGLNFWTPNINP 152
Query: 160 FRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGD--RLKVAASCKHFTAYDLDNWNG 217
++DPRWGRGQETPGEDP + Y + + GLQG D K A+CKHF YDL++ +G
Sbjct: 153 YKDPRWGRGQETPGEDPFHTSSYVNALITGLQGGLDDLPYKKGVATCKHFAGYDLESSDG 212
Query: 218 VDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRG 277
R+ F+A + QD+ D + PF+ C + V SVMCSYN +NGVPTCAD +L+ +R
Sbjct: 213 AIRYGFDAIIKSQDLRDYYLPPFQQCARDSNVQSVMCSYNAMNGVPTCADDWLLQTLLRE 272
Query: 278 EW---RLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQ 334
W + ++ SDCD+V +D ++T TPE++AADA+ AG DLDCG F + SA
Sbjct: 273 HWGWTEEDQWVTSDCDAVKNIWDYHNYTLTPEQSAADALNAGTDLDCGTFWPTYLGSAYD 332
Query: 335 RGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGI 394
+GL ++ +L +RLG FD PS QPY L +V TP Q+LAL+AA GI
Sbjct: 333 QGLYDISTLDRSLARRYASLVRLGYFD-PPSVQPYRQLNWDNVSTPAAQQLALQAAEDGI 391
Query: 395 VLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPL---QGIGRYART 451
VLLKN G LPLS VA+IGP ++ T M GNY G A +PL Q G +
Sbjct: 392 VLLKNDG-ILPLSS-NITNVALIGPLANATKQMQGNYYGTAPYLRSPLIAAQNAG--FKV 447
Query: 452 IHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELV 511
+ QG D+ + F AAI A++ AD I V G+D SIEAE +DR + P Q L+
Sbjct: 448 TYVQGA-DIDSQNTTDFSAAISAAQSADLVIYVGGIDNSIEAEEIDRTSISWPSSQLSLI 506
Query: 512 SKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPG 571
++++ S P I+ M G ID + ++ + A++WAGYPGQ GGTAI +IL G + P
Sbjct: 507 NQLANLST-PLIISQM-GCMIDSSSLLSNTGVNALLWAGYPGQDGGTAIFNILTGKTAPA 564
Query: 572 GKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTV 631
G+LP+T YP Y+ + MT+M ++PS+ PGRTY++Y G V+ +G+G+ YT F +
Sbjct: 565 GRLPITQYPSNYVNQVTMTDMNLQPSRFN--PGRTYKWYNGEPVFEYGYGLQYTTFDAKI 622
Query: 632 ANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLT--LGVQVDVKNVGSKDGAHTLL 689
+ S N T + + LT + + + V N G+ + L
Sbjct: 623 TPS-------------SPNNTFEISELLANASNYKDLTPFVKIPITVSNTGTTTSDYVAL 669
Query: 690 VF---STPPAGHWAPHKQLVAFEKVH-VPAGAQQRVGINIHVCK 729
F + PA H P K LVA+ ++H + GA +++++
Sbjct: 670 FFLSGTFGPAPH--PKKSLVAYTRLHDITGGANATAEVSLNLAS 711
>gi|219887077|gb|ACL53913.1| unknown [Zea mays]
gi|224035251|gb|ACN36701.1| unknown [Zea mays]
gi|413919685|gb|AFW59617.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
Length = 405
Score = 436 bits (1122), Expect = e-119, Method: Compositional matrix adjust.
Identities = 219/407 (53%), Positives = 282/407 (69%), Gaps = 5/407 (1%)
Query: 355 MRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTV 414
MRLG FDG+P P+G+LGP DVCTP +QELA EAARQGIVLLKN G LPLS +++
Sbjct: 1 MRLGFFDGDPRELPFGNLGPSDVCTPSNQELAREAARQGIVLLKNTG-KLPLSATSIKSM 59
Query: 415 AVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQL-FGAAID 473
AVIGPN++ + TMIGNY G C YTTPLQG+G T++Q GC +V C+ + L AA
Sbjct: 60 AVIGPNANASFTMIGNYEGTPCKYTTPLQGLGANVATVYQPGCTNVGCSGNSLQLDAATK 119
Query: 474 ASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPID 533
A+ AD T+LV+G DQSIE E+LDR LLLPG+Q +LVS V+ AS GP ILV+MSGGP D
Sbjct: 120 AAASADVTVLVVGADQSIERESLDRTSLLLPGQQPQLVSAVANASSGPCILVVMSGGPFD 179
Query: 534 VAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMA 593
++FAK+ +IAAI+W GYPG+AGG AIAD+LFG NP G+LP+TWYP+ + T +PMT+M
Sbjct: 180 ISFAKSSDKIAAILWVGYPGEAGGAAIADVLFGYHNPSGRLPVTWYPESF-TKVPMTDMR 238
Query: 594 MRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATI 653
MRP S YPGRTYRFY G VY FG G+SYT+F H + +AP +A+ L H + T
Sbjct: 239 MRPDPSTGYPGRTYRFYTGDTVYAFGDGLSYTSFAHHLVSAPKQLALQLAEGHACL--TE 296
Query: 654 SGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHV 713
+++ A C L V + V+N G + G HT+ +FS+PPA H AP K L+ FEKV +
Sbjct: 297 QCPSVEAEGAHCEGLAFDVHLRVRNAGERSGGHTVFLFSSPPAVHNAPAKHLLGFEKVSL 356
Query: 714 PAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
G V + VCK LSVVD G R++ LG H +H+G KH+++L
Sbjct: 357 EPGQAGVVAFKVDVCKDLSVVDELGNRKVALGSHTLHVGDLKHTLNL 403
>gi|348604625|dbj|BAK96214.1| beta-xylosidase [Acremonium cellulolyticus]
Length = 797
Score = 436 bits (1121), Expect = e-119, Method: Compositional matrix adjust.
Identities = 262/723 (36%), Positives = 393/723 (54%), Gaps = 37/723 (5%)
Query: 43 CQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTK 102
C S R LI +L+E + + A VPRLG+ Y+ WSEALHG+ T
Sbjct: 64 CDTSANYVDRAEGLIALFTLEELINNTQNTAPGVPRLGLPPYQVWSEALHGLDRANFATS 123
Query: 103 FGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRD 162
G ++ ATSFP I + ++ N TL I ++ +ARA N G GL ++PN+N FR
Sbjct: 124 -GDEWTWATSFPMPILSMAALNRTLINQIAGIIGTQARAFNNAGRYGLDAYAPNINGFRS 182
Query: 163 PRWGRGQETPGEDP-VLSGKYAASYVRGLQGS-DGDRLKVAASCKHFTAYDLDNWNGVDR 220
P WGRGQETPGED LS YA Y+ GLQG D D LKV A+ KHF YDL+NW G R
Sbjct: 183 PLWGRGQETPGEDANFLSSSYAYEYITGLQGGVDPDHLKVVATAKHFAGYDLENWGGNSR 242
Query: 221 FHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWR 280
F+A +++QD+ + + F K S MCSYN VNGVP+C+ +L+ +R W
Sbjct: 243 LGFDASITQQDLAEYYTPQFLAASRYAKARSFMCSYNSVNGVPSCSSSFLLQTLLRDNWD 302
Query: 281 L--NGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLL 338
GY+ SDCD+V ++ + S AAAD++RAG D+DCG + + G +
Sbjct: 303 FPEYGYVSSDCDAVYNVFNPHGYASNQSAAAADSLRAGTDIDCGQTYPWNLNQSFIEGSV 362
Query: 339 SEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLK 398
+ +I ++V + ++LG FDG+ S Y LG DV T D ++ EAA +GIVLLK
Sbjct: 363 TRGEIERSIVRLYSNLVKLGYFDGDKSE--YRQLGWNDVVTTDAWNISYEAAVEGIVLLK 420
Query: 399 NQGPSLPLS-HIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGC 457
N G LPLS H+ +++A+IGP ++ T + GNY G A TPLQG ++
Sbjct: 421 NDG-ILPLSKHV--KSIALIGPWANATEQLQGNYYGTAPYLITPLQGASDAGYKVNYALG 477
Query: 458 KDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMA 517
++ + F A+ A++++D + + G+D +IEAE DR + PG Q +L+ ++S
Sbjct: 478 TNILGNTTEGFADALSAAKKSDVIVYLGGIDNTIEAEGTDRMNVTWPGNQLDLIQQLSQT 537
Query: 518 SKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMT 577
K P +++ M GG +D + K + ++ A++W GYPGQ+GGTAI DIL G P G+L T
Sbjct: 538 GK-PLVVLQMGGGQVDSSSIKANSKVNALVWGGYPGQSGGTAIFDILSGKRVPAGRLVTT 596
Query: 578 WYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTV 637
YP EY T P T+M +RP + PG+TY +Y G VY FG+G+ YT F T
Sbjct: 597 QYPAEYATQFPATDMNLRPDGASN-PGQTYMWYTGTPVYDFGYGLFYTTFKETAQ----- 650
Query: 638 VAVPLDGRHGSINATISG--KAIKVTHAKCNRLT--LGVQVDVKNVGSKDGAHTLLVFST 693
+ GS + IS A + + + L + + +KN G +T ++F+
Sbjct: 651 -------KLGSSSFDISEIVAAPRSPSYEYSELVPFVNITATIKNTGKTASPYTAMLFAN 703
Query: 694 P----PAGHWAPHKQLVAFEKV-HVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHN 748
PA + P+K LV ++++ + G + I + + ++ VD +G R + G++
Sbjct: 704 TTNAGPAPY--PNKWLVGYDRLASIEPGKSADLVIPVPIGA-IARVDENGNRIVYPGDYQ 760
Query: 749 IHI 751
+ +
Sbjct: 761 LAL 763
>gi|421077748|ref|ZP_15538711.1| glycoside hydrolase family 3 domain protein [Pelosinus fermentans
JBW45]
gi|392524151|gb|EIW47314.1| glycoside hydrolase family 3 domain protein [Pelosinus fermentans
JBW45]
Length = 750
Score = 436 bits (1120), Expect = e-119, Method: Compositional matrix adjust.
Identities = 270/752 (35%), Positives = 398/752 (52%), Gaps = 85/752 (11%)
Query: 34 DATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHG 93
D + +L QR DL+ R++L+EKV ++ + A+PRLG+ Y WWSEALHG
Sbjct: 24 DERMEIFDYQDETLSFEQRAKDLVSRMTLEEKVTQMVYISPAIPRLGVPAYNWWSEALHG 83
Query: 94 VSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--------G 145
V+ G AT FPQ I A++F+ L + V+S E RA ++ G
Sbjct: 84 VARAGV----------ATVFPQAIGLAATFDEKLIHDVAEVISIEGRAKFHEFQRKGDHG 133
Query: 146 GTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCK 205
GLT+WSPNVNIFRDPRWGRGQET GEDP L+G+ S+++GLQG D L+ AA K
Sbjct: 134 IYKGLTFWSPNVNIFRDPRWGRGQETFGEDPYLTGRLGVSFIKGLQGQDKKYLRAAACAK 193
Query: 206 HFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTC 265
HF + +R F+A VS +D+ +T+ F+ CV E V +VM +YN+VNG P C
Sbjct: 194 HFAVHSGPE---SERHSFDAVVSPKDLRETYLPAFKECVKEANVEAVMGAYNRVNGEPCC 250
Query: 266 ADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFL 325
+LK T+R EW G++VSDC ++ +++ TS+ E+ A A+ G DL+CG +
Sbjct: 251 GSNMLLKETLRQEWGFTGHVVSDCWAIKDFHENHRVTSSAPESVALALNNGCDLNCGN-M 309
Query: 326 GLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQEL 385
L+ A Q GL++E IN A+ + +M+LG+FD + PY ++G +H+E
Sbjct: 310 YLNLLIAYQEGLVTEEAINTAVTRLMLTRMKLGLFD-TAENVPYTNIGFHQNDCQEHREF 368
Query: 386 ALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
ALE +++ +VLLKN+ LPL ++AVIGPN++ + GNY G A Y T L+GI
Sbjct: 369 ALEVSKKTLVLLKNENNLLPLDRNTISSIAVIGPNANSREALTGNYCGTASNYITVLEGI 428
Query: 446 ----GRYARTIHQQGC-------KDVACADDQLFGAAIDASRQADATILVMGLDQSIEAE 494
G+ + QGC +++ A D+ F A+ + +AD ++ MGLD SIE E
Sbjct: 429 REAVGKDTIVSYAQGCHLYRDKAENLGEARDR-FAEAVSTAERADIVVMCMGLDASIEGE 487
Query: 495 ---------ALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAA 545
+ D+ GL LPG QQEL+ + K P ILVL++G + V +A ++ A
Sbjct: 488 EGDVSNEYASGDKLGLNLPGLQQELLEVIYQTGK-PIILVLLAGSALAVTWAAE--KVPA 544
Query: 546 IIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLP-MTEMAMRPSQSKRYPG 604
II A YPG GG A+A +FG +P GKLP+T+Y LP T+ +M+
Sbjct: 545 IIQAWYPGAEGGKALASAIFGEYSPVGKLPITFYRT--TEELPEFTDYSMK--------N 594
Query: 605 RTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAK 664
RTYR+ +YPFG+G+ YT F + R +N T V +
Sbjct: 595 RTYRYMTKEALYPFGYGLGYTTFAY---------------RQLQLNRTQISAGENV---Q 636
Query: 665 CNRLTLGVQVDVKNVGSKDGAHTL-LVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGI 723
C+ L VKN G+ T+ L A P +L +KVH+ G +Q V
Sbjct: 637 CSVL-------VKNTGNFASDETVQLYIKDVKASVEVPILELQGIQKVHLLPGTEQEVFF 689
Query: 724 NIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTK 755
+ + L++++ G + G I++GG +
Sbjct: 690 TL-TPRQLALINEEGNCILEPGAFEIYVGGCQ 720
>gi|340519849|gb|EGR50086.1| glycoside hydrolase family 3 [Trichoderma reesei QM6a]
Length = 796
Score = 435 bits (1119), Expect = e-119, Method: Compositional matrix adjust.
Identities = 277/726 (38%), Positives = 404/726 (55%), Gaps = 51/726 (7%)
Query: 43 CQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTK 102
C + I +R ++ ++L EKV + S A+ RLG+ Y+W +EALHGV+ G +
Sbjct: 75 CDTTKSIAERAAAIVKPMTLNEKVANVGSSASGSARLGLPAYQWQNEALHGVAG-STGVQ 133
Query: 103 F----GGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVN 158
F G +F ATSFP I +++F+ L +++ +S EARA N G AGL +W+PN+N
Sbjct: 134 FQSPLGANFSAATSFPMPILLSAAFDDALVKSVATAISTEARAFANYGFAGLDFWTPNIN 193
Query: 159 IFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGS-DGDRLKVAASCKHFTAYDLDNWNG 217
FRDPRWGRG ETPGED Y + V GLQG D D + ++CKHF AYD++N
Sbjct: 194 PFRDPRWGRGMETPGEDAFRIQGYVLALVDGLQGGIDPDFYRTLSTCKHFAAYDIEN--- 250
Query: 218 VDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRG 277
R N ++QD+ D + F CV + KVAS+MC+YN V+GVP CAD +L+ +R
Sbjct: 251 -GRTANNLSPTQQDMADYYLPMFETCVRDAKVASIMCAYNAVDGVPACADSYLLQDVLRD 309
Query: 278 EWRLN---GYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQ 334
+ Y+VSDCD+V +D H+ + +AAA +I AG DLDCG + S VQ
Sbjct: 310 TYGFTEDFNYVVSDCDAVENVFDPHHYAANLTQAAAMSINAGTDLDCGSSYNVLNAS-VQ 368
Query: 335 RGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGI 394
GL +E ++ +L+ + +++G FD +P+ Y LG +V T Q LA +AA +G+
Sbjct: 369 AGLTTEATLDKSLIRLYSALVKVGYFD-QPAE--YNSLGWGNVNTTQSQALAHDAATEGM 425
Query: 395 VLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQ 454
LLKN G +LPLS VAVIGP ++VT M GNYAG A PL + R +
Sbjct: 426 TLLKNDG-TLPLSRTLSN-VAVIGPWANVTTQMQGNYAGTAPLLVNPLSVFQQKWRNVKY 483
Query: 455 QGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKV 514
+ D F AA+ A+ +D + + G+D S+E E DR+ + PG Q L+S++
Sbjct: 484 AQGTAINSQDTSGFNAALSAASSSDVIVYLGGIDISVENEGFDRSSITWPGNQLNLISQL 543
Query: 515 SMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKL 574
+ K P ++V GG ID + ++ ++ +I+WAGYPGQ GG AI D+L G + P G+L
Sbjct: 544 ANLGK-PLVIVQFGGGQIDDSALLSNSKVNSILWAGYPGQDGGNAIFDVLTGANPPAGRL 602
Query: 575 PMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANA 634
P+T YP Y+ N + +M +RPS PGRTY +Y G V PFG+G+ YTNF + +
Sbjct: 603 PVTQYPANYVNNNNIQDMNLRPSNG--IPGRTYAWYTGTPVLPFGYGLHYTNFSLSFQST 660
Query: 635 PTVVAVPLDGRHGSINATISGKAIKVTHAKCNR---LTLGVQVDVKNVGSK-----DGAH 686
T GS AT+ V +A N+ + V+VKN G K D
Sbjct: 661 KTA---------GSDIATL------VNNAGSNKDLATFATIVVNVKNTGGKANLASDYVG 705
Query: 687 TLLVFSTP--PAGHWAPHKQLVAFEKV-HVPAGAQQRVGINIHVCKYLSVVDRSGTRRIP 743
L + ST PA H P+KQL A+ +V +V GA Q++ + +++ L+ D +G R I
Sbjct: 706 LLFLKSTNAGPAPH--PNKQLAAYGRVRNVGVGATQQLTLTVNLGS-LARADTNGDRWIY 762
Query: 744 LGEHNI 749
G + +
Sbjct: 763 PGAYTL 768
>gi|171678585|ref|XP_001904242.1| hypothetical protein [Podospora anserina S mat+]
gi|170937362|emb|CAP62020.1| unnamed protein product [Podospora anserina S mat+]
Length = 800
Score = 434 bits (1117), Expect = e-119, Method: Compositional matrix adjust.
Identities = 275/766 (35%), Positives = 403/766 (52%), Gaps = 48/766 (6%)
Query: 9 FLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKL 68
F G L SA + F + D T C +L P+R L+ L+ +EK++
Sbjct: 6 FWGASLTSAVVRAIDLPFQFYPNCVDDLLSTNQVCNTTLSPPERAAALVAALTPEEKLQN 65
Query: 69 LISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKF---GGDFPGATSFPQVITTASSFNA 125
++S + PR+G+ Y WWSEALHGV+ PGT+F G F +TSFP + A++F+
Sbjct: 66 IVSKSLGAPRIGLPAYNWWSEALHGVA-YAPGTQFWQGDGPFNSSTSFPMPLLMAATFDD 124
Query: 126 TLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAAS 185
L E I V+ E RA N G +GL YW+PNVN F+DPRWGRG ETPGED +L +YAA+
Sbjct: 125 ELLEKIAEVIGIEGRAFGNAGFSGLDYWTPNVNPFKDPRWGRGSETPGEDVLLVKRYAAA 184
Query: 186 YVRGLQGSDGDR-LKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCV 244
++GL+G ++ +V A+CKH+ A D ++WNG R +FNAK+S QD+ + + +PF+ CV
Sbjct: 185 MIKGLEGPVPEKERRVVATCKHYAANDFEDWNGATRHNFNAKISLQDMAEYYFMPFQQCV 244
Query: 245 MEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRL---NGYIVSDCDSVGVYYDTQHF 301
+ +V S+MC+YN VNGVP+CA P +L+ +R W N YI SDC++V +
Sbjct: 245 RDSRVGSIMCAYNAVNGVPSCASPYLLQTILREHWNWTEHNNYITSDCEAVLDVSLNHKY 304
Query: 302 TSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFD 361
+T E A + AG+D C A +GLL E ++ AL+ +R G FD
Sbjct: 305 AATNAEGTAISFEAGMDTSCEYEGSSDIPGAWSQGLLKESTVDRALLRLYEGIVRAGYFD 364
Query: 362 GEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHI----RHRTVAVI 417
G+ S Y LG DV P Q+L+L+AA G VLLKN G +LPLS + R + VA+I
Sbjct: 365 GKQSL--YSSLGWADVNKPSAQKLSLQAAVDGTVLLKNDG-TLPLSDLLDKSRPKKVAMI 421
Query: 418 GPNSDVTVTMIGNYAGIACGYTTPLQGIGRY--------ARTIHQQGCKDVACADDQLFG 469
G SD + G Y+G A TP + +H + + D+
Sbjct: 422 GFWSDAKDKLRGGYSGTAAYLHTPAYAASQLGIPFSTASGPILHSDLASNQSWTDN---- 477
Query: 470 AAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSG 529
A+ A++ AD + G+D S E DR L PG Q L++ ++ SK ++VL G
Sbjct: 478 -AMAAAKDADYILYFGGIDTSAAGETKDRYDLDWPGAQLSLINLLTTLSK--PLIVLQMG 534
Query: 530 GPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPM 589
+D ++P+I AI+WA +PGQ GGTA+ +++ G +P G+LP+T YP + +PM
Sbjct: 535 DQLDNTPLLSNPKINAILWANWPGQDGGTAVMELVTGLKSPAGRLPVTQYPSNFTELVPM 594
Query: 590 TEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANA-PTVVAVPLDGRHGS 648
T+MA+RPS GRTYR+YK P V FG G+ YT F P V+
Sbjct: 595 TDMALRPSAGNSQLGRTYRWYKTP-VQAFGFGLHYTTFSPKFGKKFPAVI---------D 644
Query: 649 INATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPA---GHWAPHKQL 705
++ + G K C L V V+ N G++ + L F + P G W P K L
Sbjct: 645 VDEVLEGCDDKYLDT-CPLPDLPVVVE--NRGNRTSDYVALAFVSAPGVGPGPW-PIKTL 700
Query: 706 VAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHI 751
AF ++ G ++R G L+ D G + G++ + +
Sbjct: 701 GAFTRLRGVKGGEKREGGLKWNLGNLARHDEEGNTVVYPGKYEVSL 746
>gi|392962219|ref|ZP_10327666.1| glycoside hydrolase family 3 domain protein [Pelosinus fermentans
DSM 17108]
gi|392452977|gb|EIW29882.1| glycoside hydrolase family 3 domain protein [Pelosinus fermentans
DSM 17108]
Length = 724
Score = 434 bits (1117), Expect = e-119, Method: Compositional matrix adjust.
Identities = 267/739 (36%), Positives = 395/739 (53%), Gaps = 83/739 (11%)
Query: 46 SLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGG 105
+L QR DL+ R++++EKV ++ + A+ RLGI Y WWSEALHGV+ G
Sbjct: 10 TLSFEQRAKDLVSRMTIEEKVTQMVYSSPAISRLGIPAYNWWSEALHGVARAGV------ 63
Query: 106 DFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--------GGTAGLTYWSPNV 157
AT FPQ I A++F+ L + ++S EARA ++ G GLT+WSPNV
Sbjct: 64 ----ATVFPQAIGLAATFDEKLIYDVAEIISIEARAKFHEFQRKGDHGIYKGLTFWSPNV 119
Query: 158 NIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNG 217
NIFRDPRWGRGQET GEDP L+G+ S+++GLQG D L+ AA KHF +
Sbjct: 120 NIFRDPRWGRGQETFGEDPYLTGRLGVSFIKGLQGQDKKYLRAAACAKHFAVHSGPE--- 176
Query: 218 VDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRG 277
+R F+A VS +D+ +T+ F+ CV E V +VM +YN+VNG P C +LK T+R
Sbjct: 177 SERHRFDAVVSPKDLRETYLPAFKECVKEANVEAVMGAYNRVNGEPCCGSNILLKETLRQ 236
Query: 278 EWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGL 337
EW G++VSDC ++ +++ TS+ E+ A A+ G DL+CG + L+ A Q GL
Sbjct: 237 EWGFTGHVVSDCWAIKDFHENHRVTSSAPESVALALNNGCDLNCGN-MYLNLLIAYQEGL 295
Query: 338 LSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLL 397
++E IN A+ + +M+LG+FD + PY ++G +H+E ALE +++ +VLL
Sbjct: 296 VTEEAINTAVTRLMLTRMKLGLFDA-AENVPYTNIGFHQNDCQEHREFALEVSKKTLVLL 354
Query: 398 KNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI----GRYARTIH 453
KN+ LPL ++AVIGPN++ + GNY G A Y T L+GI G+ +
Sbjct: 355 KNENHLLPLDRNTISSIAVIGPNANSREALTGNYFGTASNYITVLEGIREAVGKDTMVSY 414
Query: 454 QQGC---KDVA---CADDQLFGAAIDASRQADATILVMGLDQSIEAE---------ALDR 498
QGC +D A + F A+ + +AD ++ MGLD SIE E + D+
Sbjct: 415 AQGCHLYRDKAENLGEERDRFAEAVSTAERADLVVMCMGLDASIEGEEGDVSNEYASGDK 474
Query: 499 AGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGT 558
GL LPG QQEL+ + K P ILVL++G + V +A ++ AII A YPG GG
Sbjct: 475 LGLNLPGLQQELLEVIYKTGK-PIILVLLAGSALAVTWAAE--KVPAIIQAWYPGAEGGK 531
Query: 559 AIADILFGTSNPGGKLPMTWYPQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYP 617
A+A +FG +P GKLP+T+Y LP T+ +M+ RTYR+ +YP
Sbjct: 532 ALASAIFGEYSPVGKLPITFYRT--TEELPEFTDYSMK--------NRTYRYMTKEALYP 581
Query: 618 FGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVK 677
FG+G+ YT F + R +N T K + +C+ L VK
Sbjct: 582 FGYGLGYTTFAY---------------RQLQLNRT---KICAGENVQCSIL-------VK 616
Query: 678 NVGSKDGAHTL-LVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDR 736
N G+ T+ L A P L +K+H+ GA+Q + + + L++++
Sbjct: 617 NTGNFASDETVQLYIKDVKASVEVPIWALQGIQKIHLLPGAEQEISFTL-TSRQLALINE 675
Query: 737 SGTRRIPLGEHNIHIGGTK 755
G + G I++GG +
Sbjct: 676 KGNCILEPGIFEIYVGGCQ 694
>gi|421060771|ref|ZP_15523202.1| glycoside hydrolase family 3 domain protein [Pelosinus fermentans
B3]
gi|421065248|ref|ZP_15527033.1| glycoside hydrolase family 3 domain protein [Pelosinus fermentans
A12]
gi|421073214|ref|ZP_15534285.1| glycoside hydrolase family 3 domain protein [Pelosinus fermentans
A11]
gi|392444242|gb|EIW21677.1| glycoside hydrolase family 3 domain protein [Pelosinus fermentans
A11]
gi|392454445|gb|EIW31278.1| glycoside hydrolase family 3 domain protein [Pelosinus fermentans
B3]
gi|392459366|gb|EIW35779.1| glycoside hydrolase family 3 domain protein [Pelosinus fermentans
A12]
Length = 724
Score = 434 bits (1115), Expect = e-118, Method: Compositional matrix adjust.
Identities = 267/740 (36%), Positives = 393/740 (53%), Gaps = 85/740 (11%)
Query: 46 SLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGG 105
+L QR DL+ R++L+EKV ++ + A+PRLG+ Y WWSEALHGV+ G
Sbjct: 10 TLSFEQRAKDLVSRMTLEEKVTQMVYISPAIPRLGVPAYNWWSEALHGVARAGV------ 63
Query: 106 DFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--------GGTAGLTYWSPNV 157
AT FPQ I A++F+ L + V+S E RA ++ G GLT+WSPNV
Sbjct: 64 ----ATVFPQAIGLAATFDEKLIFNVAEVISIEGRAKFHEFQRKGDHGIYKGLTFWSPNV 119
Query: 158 NIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNG 217
NIFRDPRWGRGQET GEDP L+G+ S+++GLQG D L+ AA KHF +
Sbjct: 120 NIFRDPRWGRGQETFGEDPYLTGRLGVSFIKGLQGQDKKYLRAAACAKHFAVHSGPE--- 176
Query: 218 VDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRG 277
+R F+A VS +D+ +T+ F+ CV E V +VM +YN+VNG P C +LK T+R
Sbjct: 177 SERHSFDAVVSPKDLRETYLPAFKECVKEANVEAVMGAYNRVNGEPCCGSNMLLKETLRR 236
Query: 278 EWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGL 337
EW G++VSDC ++ +++ TS+ E+ A A+ G DL+CG + L+ A Q GL
Sbjct: 237 EWGFTGHVVSDCWAIKDFHENHRVTSSAPESVAMALNNGCDLNCGN-MYLNLLIAYQEGL 295
Query: 338 LSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLL 397
++E IN A+ + +M+LG+FD + PY +G +H+E ALE +++ +VLL
Sbjct: 296 VTEEAINTAVTRLMLTRMKLGLFD-TAENVPYTKIGFHQNDCQEHREFALEVSKKTLVLL 354
Query: 398 KNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI----GRYARTIH 453
KN+ LPL ++AVIGPN++ + GNY G A Y T L+GI G+ +
Sbjct: 355 KNENNLLPLDRNTISSIAVIGPNANSREALTGNYCGTASNYITVLEGIREAVGKDTMVSY 414
Query: 454 QQGC-------KDVACADDQLFGAAIDASRQADATILVMGLDQSIEAE---------ALD 497
QGC +++ A D+ F A+ + +AD ++ MGLD SIE E + D
Sbjct: 415 AQGCHLYRDKAENLGEARDR-FAEAVSTAERADIVVMCMGLDASIEGEEGDVSNEYASGD 473
Query: 498 RAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGG 557
+ GL LPG QQEL+ + K P ILVL++G + V +A +I AII A YPG GG
Sbjct: 474 KLGLNLPGLQQELLEVIYQTGK-PIILVLLAGSALAVTWAAE--KIPAIIQAWYPGAEGG 530
Query: 558 TAIADILFGTSNPGGKLPMTWYPQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVY 616
A+A +FG +P GKLP+T+Y LP T+ +M+ RTYR+ +Y
Sbjct: 531 KALASAIFGEYSPVGKLPITFY--RTTEELPEFTDYSMK--------NRTYRYMTKEALY 580
Query: 617 PFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDV 676
PFG+G+ YT F + + L+ S+ + G + V
Sbjct: 581 PFGYGLGYTTFAYR--------QLQLNRTQISVGENVQGSVL-----------------V 615
Query: 677 KNVGSKDGAHTL-LVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVD 735
KN G+ T+ L A P L +KVH+ G +Q V + + L++++
Sbjct: 616 KNTGNFASDETVQLYIKDVKASVEVPIWALQGIQKVHLLPGTEQEVFFTL-TPRQLALIN 674
Query: 736 RSGTRRIPLGEHNIHIGGTK 755
G + G I++GG +
Sbjct: 675 EEGNCILEPGVFEIYVGGCQ 694
>gi|310792973|gb|EFQ28434.1| glycosyl hydrolase family 3 N terminal domain-containing protein
[Glomerella graminicola M1.001]
Length = 728
Score = 434 bits (1115), Expect = e-118, Method: Compositional matrix adjust.
Identities = 258/709 (36%), Positives = 389/709 (54%), Gaps = 32/709 (4%)
Query: 60 LSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGD---FPGATSFPQV 116
+S++EKV+ L+ +A V LG+ + WW+E LHGV PG F D F ATSFP
Sbjct: 1 MSVEEKVRNLVDASAGVKSLGLPPHGWWNEGLHGVG-FSPGVLFAQDSEPFGYATSFPLP 59
Query: 117 ITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDP 176
I TA+SF+ L+ AIG+V+ E RA N G AG +W+PN+N FRDPRWGRGQETPGED
Sbjct: 60 ILTAASFDDDLFNAIGQVIGREGRAFSNYGYAGFNFWTPNMNAFRDPRWGRGQETPGEDV 119
Query: 177 VLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTF 236
++ Y SYV GLQGSD + A+CKHF AYD++ R N ++QD++D +
Sbjct: 120 LVVSNYVQSYVTGLQGSDPTDKVIIAACKHFAAYDIE----TARRANNYNPTQQDLQDYY 175
Query: 237 DVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNG---YIVSDCDSVG 293
FR CV + V +VMCSYN V+G+P C+ +LK +R W ++VSDC +V
Sbjct: 176 LPAFRRCVRDSHVGTVMCSYNSVDGIPACSSEYLLKEVLRDTWGFTNDYQFVVSDCGAVT 235
Query: 294 VYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTV 353
+ +FT+T ++AA+ ++ AG DL+CG LH ++ +++ ++ AL
Sbjct: 236 DVWLLHNFTNTEQDAASVSMAAGTDLECGSSY-LHLNGSLADKQVTQERVDEALTRLYKA 294
Query: 354 QMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRT 413
+G FDG S LG DV T D Q++A EAAR G+ LLKN G LPL+ ++++
Sbjct: 295 LFTVGYFDGSSHSS----LGWSDVSTIDAQQIACEAARAGMTLLKNDG-VLPLADGKYKS 349
Query: 414 VAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYAR-TIHQQGCKDVACADDQLFGAAI 472
VA+IGP ++ T M GNY G A +PL + + ++ D+ D F A+
Sbjct: 350 VALIGPFANATTQMQGNYFGRAPFVRSPLWAFTQQSSLQVNYAAGTDINSTSDSGFADAL 409
Query: 473 DASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPI 532
A++ +D I G+D +IEAE LDR + PG Q +L+S++SM K P ++ GG +
Sbjct: 410 AAAKNSDIVIFCGGIDTTIEAETLDRVSITWPGNQLDLISQLSMLGK-PLVVAQFGGGQV 468
Query: 533 DVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEM 592
D ++ + A+ WAG PGQAGG A+ D++ G ++ G+LP T YP Y + + +
Sbjct: 469 DDTALVDNANVNALFWAGLPGQAGGLAMYDLVVGKASFAGRLPTTQYPASYADLVSIFNI 528
Query: 593 AMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINAT 652
+RP+ + +PGRTY++Y G V+PFG G+ YT F T + L+ + N
Sbjct: 529 NLRPNGT--FPGRTYKWYIGEPVFPFGFGLHYTKFNFTWKDT-------LEPTYDISNII 579
Query: 653 ISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHT-LLVFSTPPAGHWA-PHKQLVAFEK 710
++ H V V VKNVG+ + LL S+ AG P+K L ++ +
Sbjct: 580 SWARSQNNGHVTDTTPFTSVNVTVKNVGNVRSDYVGLLFLSSKNAGPVPRPNKSLASYSR 639
Query: 711 VH-VPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSV 758
H + GA ++ + + + + + D G I G++ + + K V
Sbjct: 640 AHDIETGASDQLTLKLTLGSF-ARSDSQGNLTIFPGDYKLELDNDKSLV 687
>gi|67902828|ref|XP_681670.1| hypothetical protein AN8401.2 [Aspergillus nidulans FGSC A4]
gi|74592887|sp|Q5ATH9.1|BXLB_EMENI RecName: Full=Exo-1,4-beta-xylosidase bxlB; AltName:
Full=1,4-beta-D-xylan xylohydrolase bxlB; AltName:
Full=Beta-xylosidase bxlB; AltName: Full=Xylobiase bxlB;
Flags: Precursor
gi|40747867|gb|EAA67023.1| hypothetical protein AN8401.2 [Aspergillus nidulans FGSC A4]
gi|259484335|tpe|CBF80465.1| TPA: beta-1,4-xylosidase (Eurofung) [Aspergillus nidulans FGSC A4]
Length = 763
Score = 433 bits (1114), Expect = e-118, Method: Compositional matrix adjust.
Identities = 267/719 (37%), Positives = 386/719 (53%), Gaps = 29/719 (4%)
Query: 40 LPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNV-G 98
LP C SL +R L+ L+L+EK+ AA RLG+ Y WW+EALHGV+ G
Sbjct: 36 LPICDTSLSPLERAKSLVSALTLEEKINNTGHEAAGSSRLGLPAYNWWNEALHGVAEKHG 95
Query: 99 PGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVN 158
+ GDF ATSFP I ++FN L + ++S EARA N AG+ YW+PNVN
Sbjct: 96 VSFEESGDFSYATSFPAPIVLGAAFNDALIRRVAEIISTEARAFSNSDHAGIDYWTPNVN 155
Query: 159 IFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGV 218
F+DPRWGRGQETPGEDP+ +Y +V GLQG D ++ KV A+CKH AYDL+ W GV
Sbjct: 156 PFKDPRWGRGQETPGEDPLHCSRYVKEFVGGLQGDDPEKPKVVATCKHLAAYDLEEWGGV 215
Query: 219 DRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGE 278
RF F+AKVS D+ + + PF+ C ++ V + MCSYN +NGVP CAD +L+ +R
Sbjct: 216 SRFEFDAKVSAVDLLEYYLPPFKTCAVDASVGAFMCSYNALNGVPACADRYLLQTVLREH 275
Query: 279 WRLNG---YIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQR 335
W G ++ DC +V H+ + EAAA A+ AG+DLDCG +L + A ++
Sbjct: 276 WGWEGPGHWVTGDCGAVERIQTYHHYVESGPEAAAAALNAGVDLDCGTWLPSYLGEAERQ 335
Query: 336 GLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIV 395
GL+S ++ AL T ++LG FD QP LG DV T + +ELA A QG V
Sbjct: 336 GLISNETLDAALTRLYTSLVQLGYFD-PAEGQPLRSLGWDDVATSEAEELAKTVAIQGTV 394
Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQ 455
LLKN +LPL + T+A+IGP + T + NYAG A T ++ R +
Sbjct: 395 LLKNIDWTLPLK--ANGTLALIGPFINFTTELQSNYAGPAKHIPTMIEAAERLGYNVLTA 452
Query: 456 GCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVS 515
+V F A+ + +ADA I G+D ++E E+LDR + PG Q+EL+ +++
Sbjct: 453 PGTEVNSTSTDGFDDALAIAAEADALIFFGGIDNTVEEESLDRTRIDWPGNQEELILELA 512
Query: 516 MASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLP 575
+ P +V GG +D + + AI+WAGYP QAGG + D+L G + P G+LP
Sbjct: 513 ELGR-PLTVVQFGGGQVDDSALLASAGVGAIVWAGYPSQAGGAGVFDVLTGKAAPAGRLP 571
Query: 576 MTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAP 635
+T YP+ Y+ +PMT+M ++P PGRTYR+Y+ V+ PFG G+ YT F + A
Sbjct: 572 ITQYPKSYVDEVPMTDMNLQPGTDN--PGRTYRWYEDAVL-PFGFGLHYTTFNVSWAKK- 627
Query: 636 TVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPP 695
G +A + + + +L V N G + LVF++ P
Sbjct: 628 ---------AFGPYDAATLARGKNPSSNIVDTFSLA----VTNTGDVASDYVALVFASAP 674
Query: 696 --AGHWAPHKQLVAFEKVH-VPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHI 751
AP K LV + + + G ++V + + V L+ G + GE+ + +
Sbjct: 675 ELGAQPAPIKTLVGYSRASLIKPGETRKVDVEVTVAP-LTRATEDGRVVLYPGEYTLLV 732
>gi|291528382|emb|CBK93968.1| Beta-glucosidase-related glycosidases [Eubacterium rectale M104/1]
Length = 714
Score = 433 bits (1113), Expect = e-118, Method: Compositional matrix adjust.
Identities = 249/604 (41%), Positives = 357/604 (59%), Gaps = 63/604 (10%)
Query: 56 LIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQ 115
L+ ++++ EK+ ++ + A+ RLGI Y WW+EALHGV+ G AT FPQ
Sbjct: 12 LVSQMTIDEKISQMLYESPAIERLGIPEYNWWNEALHGVARAGV----------ATVFPQ 61
Query: 116 VITTASSFNATLWEAIGRVVSDEARAMYN--------GGTAGLTYWSPNVNIFRDPRWGR 167
I A++F+A L E IG VVS E R +N G GLT+W+PNVNIFRDPRWGR
Sbjct: 62 AIGLAAAFDADLIEKIGDVVSTEGRGKFNEFSKKGDHGIYKGLTFWAPNVNIFRDPRWGR 121
Query: 168 GQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD--RFHFNA 225
G ET GEDP L+GK +Y+RGLQG D D LK AA KHF + +G + R F+A
Sbjct: 122 GHETYGEDPYLTGKLGCAYIRGLQGDDPDHLKSAACAKHFAVH-----SGPEAIRHEFDA 176
Query: 226 KVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYI 285
K SK D+ DT+ F+ CV + KV +VM +YN+VNG P C +LK +R E+ G++
Sbjct: 177 KASKHDMYDTYLYAFKRCVKDAKVEAVMGAYNRVNGEPACGSRTLLKDILRDEFGFEGHV 236
Query: 286 VSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINN 345
VSDC ++ +++ H T T EE+AA A+ G DL+CG LH + A +GL+S+ I
Sbjct: 237 VSDCWAILDFHEHHHVTDTVEESAAMAVNNGCDLNCGSAF-LHLKDAYDKGLVSDEAITA 295
Query: 346 ALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLP 405
A+ + V++RLGM PS PY + + V +H EL++EAAR+ +VLLKN+ LP
Sbjct: 296 AVERLMEVRIRLGMMKDYPS--PYEDISYEVVECKEHVELSVEAARRSLVLLKNKDNFLP 353
Query: 406 LSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRY----ARTIHQQGC---K 458
L +T+AVIGPN++ +IGNY G + Y TPL+G+ +Y R ++ +GC K
Sbjct: 354 LDRKNVKTIAVIGPNANSRDALIGNYYGTSSRYITPLEGLQQYLGDDTRVLYAEGCHLYK 413
Query: 459 DVA---CADDQLFGAAIDASRQADATILVMGLDQSIEAE---------ALDRAGLLLPGR 506
D + F A+ + Q+D ++ +GLD +IE E + D+ GL+LPG
Sbjct: 414 DKVQGLAEEKDRFKEALIMAEQSDVVVMCLGLDATIEGEEGDAGNEYASGDKLGLMLPGL 473
Query: 507 QQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFG 566
Q+EL+ V+ K P ILVL +G ID+++A+ + AII + YPG GG A+A+ +FG
Sbjct: 474 QEELLEAVAAVGK-PVILVLSAGSAIDLSWAEE--HVDAIIDSWYPGARGGKAVAEAIFG 530
Query: 567 TSNPGGKLPMTWYPQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSY- 624
+P GKLP+T+Y + NLP T+ +M RTYR+ V+YPFG+G+ Y
Sbjct: 531 EYSPSGKLPVTFY--QGTENLPEFTDYSM--------AHRTYRYTNENVLYPFGYGLHYG 580
Query: 625 -TNF 627
TN+
Sbjct: 581 ETNY 584
>gi|367046937|ref|XP_003653848.1| glycoside hydrolase family 3 protein [Thielavia terrestris NRRL
8126]
gi|347001111|gb|AEO67512.1| glycoside hydrolase family 3 protein [Thielavia terrestris NRRL
8126]
Length = 923
Score = 432 bits (1112), Expect = e-118, Method: Compositional matrix adjust.
Identities = 275/729 (37%), Positives = 397/729 (54%), Gaps = 37/729 (5%)
Query: 41 PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPG 100
P C SLPI RV L+G+L+LQEK+ L+ GA+ R+G+ YEWWSEALHGV+ PG
Sbjct: 163 PACNTSLPIADRVRWLVGQLTLQEKITNLVDGASGSARVGLPPYEWWSEALHGVA-ASPG 221
Query: 101 TKFGGD----FPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPN 156
F G F ATSFP IT +++F+ L I VV E RA N G +G +W+PN
Sbjct: 222 VTFAGPNGTAFSYATSFPMPITISAAFDDDLVSQIAAVVGREGRAFANHGLSGFDFWTPN 281
Query: 157 VNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWN 216
+N FRDPRWGRG ETPGED +Y + GLQGSD ++ A+CKH+ YD++
Sbjct: 282 INPFRDPRWGRGPETPGEDAFRIQQYIRHLIPGLQGSDPLDKQIIATCKHYAVYDVE--- 338
Query: 217 GVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIR 276
R+ ++ D+ + + PF+ CV + + SVMCSYN V+G+P CA +L+ +R
Sbjct: 339 -TGRYEYDYDPQPHDLAEYYLAPFKTCVRDVGIGSVMCSYNAVDGIPACASEYLLQSVLR 397
Query: 277 GEWRLN---GYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAV 333
W Y+VSDCD+V Y +FT +P AAA A+ AG DL+CG L+ ++
Sbjct: 398 DHWGFTEPYQYVVSDCDAVRFIYSPHNFTDSPAAAAAVALNAGTDLECGSTY-LNLNQSL 456
Query: 334 QRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQG 393
+ +E ++ AL T +G FDG S YG LG V T D Q LA +AA G
Sbjct: 457 ASNMTTEAALDRALTRLYTALHTIGFFDG---SARYGGLGWDAVGTGDAQVLAYQAAVDG 513
Query: 394 IVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRY--ART 451
VLLKN+ LPL R R +AVIGP ++ T M GNY G A +PL A
Sbjct: 514 AVLLKNEKSLLPLDSKRLRKLAVIGPWANATTQMQGNYFGQAAYLVSPLAAFQSAWGADN 573
Query: 452 IHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELV 511
+ +A F AA+ A++ ADA + + G+D S+E+E+LDR + PG Q +L+
Sbjct: 574 VLFANGTGIAGNSTAGFAAALAAAKAADAVVFLGGVDNSVESESLDRTAISWPGNQLDLI 633
Query: 512 SKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPG 571
++++ K P ++V GG +D + +PR+ A++WAGYPGQAGG AIAD+L G P
Sbjct: 634 AQLAAVGK-PLVVVQCGGGQLDDSALLANPRVGALLWAGYPGQAGGAAIADLLTGKQAPA 692
Query: 572 GKLPMTWYPQEYITNLPMTEMAMRPSQS------KRYPGRTYRFYKGPVVYPFGHGMSYT 625
G+LP+T Y Y + + + + ++RP +S +PGRTY++Y G V PFG+G+ YT
Sbjct: 693 GRLPVTQYAASYTSEVSLFDPSLRPRRSGGSKSHSTFPGRTYKWYTGKPVLPFGYGLHYT 752
Query: 626 NFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGA 685
F A+ P A + G + N T + A A L V V V N G
Sbjct: 753 TFRTAWADEPRGRAYDIAGLFPA-NTTTTSSAFS---AADTYPVLNVSVTVTNTGRGASD 808
Query: 686 HTLLVF----STPPAGHWAPHKQLVAFEKVH-VPAGAQQRVGINIHVCKYLSVVDRSGTR 740
+ L+F + PA + P+K LV + + + G+ R+ + + + L+ D G R
Sbjct: 809 YVGLLFLRTRNAGPAPY--PNKWLVGYARARGLAPGSSARLELAVALGS-LARADEDGRR 865
Query: 741 RIPLGEHNI 749
+ G++ +
Sbjct: 866 VVYPGDYEL 874
>gi|367053033|ref|XP_003656895.1| glycoside hydrolase family 3 protein [Thielavia terrestris NRRL
8126]
gi|347004160|gb|AEO70559.1| glycoside hydrolase family 3 protein [Thielavia terrestris NRRL
8126]
Length = 758
Score = 432 bits (1111), Expect = e-118, Method: Compositional matrix adjust.
Identities = 273/727 (37%), Positives = 402/727 (55%), Gaps = 40/727 (5%)
Query: 43 CQVSLPIPQRVNDLIGRLSLQEKVKLLI------SGAAAVPRLGIKGYEWWSEALHGVSN 96
C + P+R L+ +++ EK+ L+ S + PRLG+ YEWWSEALHGV+
Sbjct: 11 CDTTASPPKRAAALVEAMNITEKLANLVEYVMARSSSKGAPRLGLPPYEWWSEALHGVA- 69
Query: 97 VGPGTKF---GGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYW 153
PG F GG F ATSF IT +++F+ L + + V+S EARA N G+AGL +W
Sbjct: 70 ASPGVSFNWSGGPFSYATSFANPITLSAAFDDELVQKVADVISTEARAFANAGSAGLDFW 129
Query: 154 SPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLD 213
+PN+N +RDPRWGRG ETPGEDPV Y S +RGL+G + + KV A+CKH+ AYDL+
Sbjct: 130 TPNINPWRDPRWGRGSETPGEDPVRIKGYVRSLLRGLEGEESIK-KVIATCKHYAAYDLE 188
Query: 214 NWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKR 273
W+ + R+ F+A VS QD+ + + PF+ C + KV S+MCSYN +NG P CA+ ++
Sbjct: 189 RWHNITRYEFDAIVSLQDLSEYYLPPFQQCARDSKVGSIMCSYNSLNGTPACANTYLMDD 248
Query: 274 TIRGEWRL---NGYIVSDCDSVGVYYDTQH-FTSTPEEAAADAIRAGLDLDC---GPFLG 326
+R WR N YI SDC+++ + +H FT T EAAA A AG D C G
Sbjct: 249 ILRKHWRWTEDNNYITSDCNAIKDFLPDEHNFTQTAAEAAAAAYTAGTDTVCEVAGSPPY 308
Query: 327 LHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELA 386
A + LLSE I+ AL +R G FD S+ PY +G DV T + Q LA
Sbjct: 309 TDVVGAYDQKLLSEEVIDRALRRLYEGLVRAGYFDPA-SASPYRDIGWSDVNTAEAQALA 367
Query: 387 LEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIG 446
L++A G+VLLKN G +LP+ + +TVA+IG + T +M+G Y+GI Y +P+ G
Sbjct: 368 LQSASDGLVLLKNDG-TLPI-KLEGKTVALIGHWASGTRSMLGGYSGIPPYYHSPVYAAG 425
Query: 447 RYARTIHQQG--CKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLP 504
+ T + A D A+ A+ ++D + GLDQS+ +E DR + P
Sbjct: 426 QLNLTYKYASGPVAPASAARDTWTADALSAANKSDVILYFGGLDQSVASEDKDRDSIAWP 485
Query: 505 GRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADIL 564
Q L+ ++A G ++V+ G +D +P ++AI+WAGYPGQ+GGTA+ + +
Sbjct: 486 PAQLTLIQ--TLAGLGKPLVVIQLGDQVDDTPLLTNPNVSAILWAGYPGQSGGTAVLNAI 543
Query: 565 FGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFY-KGPVVYPFGHGMS 623
G S P G+LP+T YP Y + LP+T+M++RP + PGRTYR+ + V PFG+G+
Sbjct: 544 TGVSPPAGRLPVTQYPSSYTSQLPLTDMSLRPDPASGRPGRTYRWLPRNATVLPFGYGLH 603
Query: 624 YTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKD 683
YTNF A P ++ ++ + K+ H L V V+V N G++
Sbjct: 604 YTNF----------TARPNPAQNFTLTPSALLAPCKLAHRDLCPLPYPVTVEVTNTGART 653
Query: 684 GAHTLLVFSTPPAGHWAPH--KQLVAFEKVH-VPAGAQQRVGINIHVCKYLSVVDRSGTR 740
+ LVF+T PH K LVA+ ++ + G R + + + L+ VD +G R
Sbjct: 654 SDYVGLVFATTRDAGPPPHPLKTLVAYARLRGIAPGRTARAQVQVALGD-LARVDAAGNR 712
Query: 741 RIPLGEH 747
+ G +
Sbjct: 713 VLYPGRY 719
>gi|291525508|emb|CBK91095.1| Beta-glucosidase-related glycosidases [Eubacterium rectale DSM
17629]
Length = 714
Score = 431 bits (1109), Expect = e-118, Method: Compositional matrix adjust.
Identities = 248/604 (41%), Positives = 356/604 (58%), Gaps = 63/604 (10%)
Query: 56 LIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQ 115
L+ ++++ EK+ ++ + A+ RLGI Y WW+EALHGV+ G AT FPQ
Sbjct: 12 LVSQMTIDEKISQMLYESPAIERLGIPEYNWWNEALHGVARAGV----------ATVFPQ 61
Query: 116 VITTASSFNATLWEAIGRVVSDEARAMYN--------GGTAGLTYWSPNVNIFRDPRWGR 167
I A++F+ L E IG VVS E R +N G GLT+W+PNVNIFRDPRWGR
Sbjct: 62 AIGLAATFDTDLIEKIGDVVSTEGRGKFNEFSKKGDHGIYKGLTFWAPNVNIFRDPRWGR 121
Query: 168 GQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD--RFHFNA 225
G ET GEDP L+GK +Y+RGLQG D D LK AA KHF + +G + R F+A
Sbjct: 122 GHETYGEDPYLTGKLGCAYIRGLQGDDPDHLKSAACAKHFAVH-----SGPEAIRHEFDA 176
Query: 226 KVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYI 285
K SK D+ DT+ F+ CV + KV +VM +YN+VNG P C +LK +R E+ G++
Sbjct: 177 KASKHDMYDTYLYAFKRCVKDAKVEAVMGAYNRVNGEPACGSRTLLKDILRDEFGFEGHV 236
Query: 286 VSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINN 345
VSDC ++ +++ H T T EE+AA A+ G DL+CG LH + A +GL+S+ I
Sbjct: 237 VSDCWAILDFHEHHHVTDTVEESAAMAVNNGCDLNCGSAF-LHLKDAYDKGLVSDEAITA 295
Query: 346 ALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLP 405
A+ + V++RLGM PS PY + + V +H EL++EAAR+ +VLLKN+ LP
Sbjct: 296 AVERLMEVRIRLGMMKDYPS--PYEDISYEVVECKEHVELSVEAARRSLVLLKNKDNFLP 353
Query: 406 LSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRY----ARTIHQQGC---K 458
L +T+AVIGPN++ +IGNY G + Y TPL+G+ +Y R ++ +GC K
Sbjct: 354 LDRKNVKTIAVIGPNANSRDALIGNYYGTSSRYITPLEGLQQYLGEDTRVLYAEGCHLYK 413
Query: 459 DVA---CADDQLFGAAIDASRQADATILVMGLDQSIEAE---------ALDRAGLLLPGR 506
D + F A+ + Q+D ++ +GLD +IE E + D+ GL+LPG
Sbjct: 414 DKVQGLAEEKDRFKEALIMAEQSDVVVMCLGLDATIEGEEGDAGNEYASGDKLGLMLPGL 473
Query: 507 QQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFG 566
Q+EL+ V+ K P ILVL +G ID+++A+ + AII + YPG GG A+A+ +FG
Sbjct: 474 QEELLEAVAAVGK-PVILVLSAGSAIDLSWAEE--HVDAIIDSWYPGARGGKAVAEAIFG 530
Query: 567 TSNPGGKLPMTWYPQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSY- 624
+P GKLP+T+Y + NLP T+ +M RTYR+ V+YPFG+G+ Y
Sbjct: 531 EYSPSGKLPVTFY--QGTENLPEFTDYSM--------AHRTYRYTNENVLYPFGYGLHYG 580
Query: 625 -TNF 627
TN+
Sbjct: 581 ETNY 584
>gi|212531051|ref|XP_002145682.1| beta-xylosidase XylA [Talaromyces marneffei ATCC 18224]
gi|210071046|gb|EEA25135.1| beta-xylosidase XylA [Talaromyces marneffei ATCC 18224]
Length = 799
Score = 431 bits (1108), Expect = e-118, Method: Compositional matrix adjust.
Identities = 255/720 (35%), Positives = 392/720 (54%), Gaps = 29/720 (4%)
Query: 43 CQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTK 102
C S R LI +L+E + + VPRLG+ YE WSE LHG+ K
Sbjct: 64 CDTSANYVDRAEGLIALFTLEELINNTQNSGPGVPRLGLPPYEVWSEGLHGLDRAH-FVK 122
Query: 103 FGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRD 162
G ++ ATSFP I + ++ N TL I +++ +ARA N G GL ++PN+N FR
Sbjct: 123 SGDEWTWATSFPMPILSMAALNRTLINQIASIIATQARAFNNVGRYGLDAYAPNINGFRS 182
Query: 163 PRWGRGQETPGEDP-VLSGKYAASYVRGLQGS-DGDRLKVAASCKHFTAYDLDNWNGVDR 220
P WGRGQETPGED L+ YA Y+ GLQG D D LK+AA+ KHF YDL+NW G R
Sbjct: 183 PLWGRGQETPGEDANFLTSSYAYEYITGLQGGIDPDNLKIAATAKHFAGYDLENWGGNSR 242
Query: 221 FHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWR 280
F+A++++QD+ + + F K S MCSYN VN +P+C+ +L+ +R +W
Sbjct: 243 LGFDARITQQDLAEYYTPQFLAASRYAKARSFMCSYNSVNAIPSCSSSFLLQTLLREQWD 302
Query: 281 L--NGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLL 338
GY+ SDCD+V ++ + S AAA+++RAG D+DCG H + G +
Sbjct: 303 FPEYGYVSSDCDAVYNVFNPHGYASNQSSAAAESLRAGTDIDCGQTYSWHLNQSFIEGSV 362
Query: 339 SEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLK 398
+ +I +++ + ++LG FDG+ + Y LG DV T D ++ EAA +GIVLLK
Sbjct: 363 TRGEIERSILRLYSNLVKLGYFDGDKNE--YRQLGWNDVVTTDAWNISYEAAVEGIVLLK 420
Query: 399 NQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCK 458
N G LPLS ++VA++GP ++ T + GNY G A TPLQG ++
Sbjct: 421 NDG-VLPLSK-NVKSVALVGPWANATKQLQGNYFGTAPYLITPLQGASDAGYKVNYALGT 478
Query: 459 DVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMAS 518
+++ F A+ A++++D + + G+D +IEAE DR + P Q +L+ ++S
Sbjct: 479 NISGNTTDGFANALSAAKKSDVIVYLGGIDNTIEAEGTDRMNVTWPRNQLDLIQQLSQTG 538
Query: 519 KGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTW 578
K P +++ M GG +D + K++ ++ A+IW GYPGQ+GG AI DIL G P G+L T
Sbjct: 539 K-PLVVLQMGGGQVDSSSIKSNSKVNALIWGGYPGQSGGKAIFDILKGKRAPAGRLVSTQ 597
Query: 579 YPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVV 638
YP EY T P T+M++RP K PG+TY +Y G VY FG+G+ YT F T
Sbjct: 598 YPAEYATQFPATDMSLRP-DGKSNPGQTYMWYIGKPVYEFGYGLFYTTFKET-------- 648
Query: 639 AVPLDGRHGSINATISGKAIKVTHAKCNRLT--LGVQVDVKNVGSKDGAHTLLVFSTP-- 694
A L S + + + + + + L L V +KN G +T ++F+
Sbjct: 649 AKKLGSSSSSFDISEIVSSPRSPSYEYSELVPFLNVTATIKNTGKTASPYTAMLFANTTN 708
Query: 695 --PAGHWAPHKQLVAFEKV-HVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHI 751
PA + P+K LV ++++ + G + I + + ++ VD++G R + G++ + +
Sbjct: 709 AGPAPY--PNKWLVGYDRLPSIEPGKSADLVIPVPIGA-IARVDKNGNRIVYPGDYQLTL 765
>gi|292495634|sp|A1CND4.2|XYND_ASPCL RecName: Full=Probable exo-1,4-beta-xylosidase xlnD; AltName:
Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
Length = 792
Score = 431 bits (1108), Expect = e-118, Method: Compositional matrix adjust.
Identities = 256/712 (35%), Positives = 386/712 (54%), Gaps = 23/712 (3%)
Query: 52 RVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGAT 111
R LI +L+E V + + VPRLG+ Y+ W+EALHG+ + G F +T
Sbjct: 72 RAAALISLFTLEELVNATGNTSPGVPRLGLPPYQVWNEALHGLDRAYFTDE--GQFSWST 129
Query: 112 SFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQET 171
SFP I T S+ N TL + ++S + RA N G GL +SPN+N FR P WGRGQET
Sbjct: 130 SFPMPILTMSALNRTLINQVASIISTQGRAFSNAGRYGLDVYSPNINSFRHPVWGRGQET 189
Query: 172 PGEDP-VLSGKYAASYVRGLQGS-DGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSK 229
PGED LS YA Y+ G+QG D LK+ A+ KH+ YD++NW+G R + +++
Sbjct: 190 PGEDAYCLSSAYAYEYITGIQGGVDPKSLKLVATAKHYAGYDIENWDGHSRLGNDMNITQ 249
Query: 230 QDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRL--NGYIVS 287
QD+ + + F + + KV SVMCSYN VNGVP+CA+ L+ +R + +GYI S
Sbjct: 250 QDLSEYYTPQFLVAARDAKVRSVMCSYNAVNGVPSCANSFFLQTLLRDTFGFVEDGYISS 309
Query: 288 DCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNAL 347
DCDS ++ + + AAAD+IRAG D+DCG + + AV + LLS DI +
Sbjct: 310 DCDSAYNVFNPHEYAANVSSAAADSIRAGTDIDCGTTYQYYFDEAVDQNLLSRADIERGV 369
Query: 348 VNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLS 407
+ + MRLG FDG S+ Y +L DV T + ++ E +G VLLKN G +LPLS
Sbjct: 370 IRLYSNLMRLGYFDGNSSA--YRNLTWNDVVTTNSWNISYEV--EGTVLLKNDG-TLPLS 424
Query: 408 HIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQL 467
R++A++GP +V+ + GNY G A +PL ++ +++
Sbjct: 425 E-SIRSIALVGPWMNVSTQLQGNYFGPAPYLISPLDAFRDSHLDVNYAFGTNISSNSTDG 483
Query: 468 FGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLM 527
F A+ A++++DA I G+D S+EAE LDR + PG+Q EL+ ++S K P I++ M
Sbjct: 484 FSKALSAAKKSDAIIFAGGIDNSLEAETLDRMNITWPGKQLELIDQLSQLGK-PLIVLQM 542
Query: 528 SGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNL 587
GG +D + K++ + ++IW GYPGQ+GG A+ DI+ G P G+L +T YP EY T
Sbjct: 543 GGGQVDSSLLKSNKNVNSLIWGGYPGQSGGQALLDIITGKRAPAGRLVVTQYPAEYATQF 602
Query: 588 PMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHG 647
P T+M++RP + PG+TY +Y G VY FGHG+ YT F V++A V P
Sbjct: 603 PATDMSLRPHGNN--PGQTYMWYTGTPVYEFGHGLFYTTF--RVSHARAVKIKPT----Y 654
Query: 648 SINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWA-PHKQLV 706
+I ++ H + L VD+ N G +T ++F+ AG P K LV
Sbjct: 655 NIQDLLAQPHPGYIHVEQMPF-LNFTVDITNTGKASSDYTAMLFANTTAGPAPYPKKWLV 713
Query: 707 AFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSV 758
F+++ + ++ ++ D G R + G++ + + + V
Sbjct: 714 GFDRLPTLGPSTSKLMTIPVTINSMARTDELGNRVLYPGKYELALNNERSVV 765
>gi|347531439|ref|YP_004838202.1| beta-glucosidase [Roseburia hominis A2-183]
gi|345501587|gb|AEN96270.1| beta-glucosidase [Roseburia hominis A2-183]
Length = 716
Score = 431 bits (1108), Expect = e-118, Method: Compositional matrix adjust.
Identities = 266/747 (35%), Positives = 400/747 (53%), Gaps = 96/747 (12%)
Query: 45 VSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFG 104
+SL + L+ +++L+EK+ + + A+ RL I Y WW+EALHGV+ G
Sbjct: 1 MSLETKEYAKRLVEQMTLEEKISQMRYESPAIERLHIPAYNWWNEALHGVARSGV----- 55
Query: 105 GDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--------GGTAGLTYWSPN 156
AT FPQ I A++F+ L E IG VVS E RA + G GLT+W+PN
Sbjct: 56 -----ATMFPQAIALAATFDEELIEKIGDVVSTEGRAKFEAYSGRGDRGIYKGLTFWAPN 110
Query: 157 VNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWN 216
+NIFRDPRWGRG ET GEDP L+ K +Y+RG+QG D D LK AA KHF + +
Sbjct: 111 INIFRDPRWGRGHETYGEDPCLTAKLGCAYIRGIQGKDPDHLKAAACAKHFAVH-----S 165
Query: 217 GVD--RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRT 274
G + R F+AKVS D+ DT+ F+ CV + V +VM +YN+VNG P C +L+
Sbjct: 166 GPEALRHEFDAKVSLHDLYDTYLYAFKRCVKDAGVEAVMGAYNRVNGEPACGSKTLLQDI 225
Query: 275 IRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQ 334
+R ++ G++VSDC ++ +++ H T T EE+AA A+ G DL+CG L+ A +
Sbjct: 226 LREQFGFEGHVVSDCWAILDFHEHHHVTKTVEESAAMAVNHGCDLNCGKAF-LYLSRACE 284
Query: 335 RGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGI 394
+GL+ E I A+ + V++RLGM + PS PY ++ V P+H L+LEA+++ +
Sbjct: 285 QGLVEEKTITEAVERLMDVRIRLGMMEDYPS--PYANIPYDVVECPEHIALSLEASKRSM 342
Query: 395 VLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRY----AR 450
VLLKN LPL + T+AVIGPN++ ++GNY G + Y TPL+GI Y R
Sbjct: 343 VLLKNDNHFLPLKQEQVHTIAVIGPNANSRAALVGNYEGTSSRYITPLEGIQEYTGEKTR 402
Query: 451 TIHQQGCKDVACADDQL---------FGAAIDASRQADATILVMGLDQSIEAE------- 494
++ QGC DQ+ F A+ A+ +AD ++ +GLD IE E
Sbjct: 403 VLYAQGCH---LYKDQVEFLGEPKDRFKEALIAAERADVIVMCLGLDAGIEGEEGDAGNE 459
Query: 495 --ALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYP 552
+ D+ GL LPG QQEL+ V+ K P +L +++G +D+++A+ +I AI+ YP
Sbjct: 460 YASGDKLGLKLPGLQQELLEAVAAVGK-PIVLTVLAGSALDLSWAQEHAQIRAILDCWYP 518
Query: 553 GQAGGTAIADILFGTSNPGGKLPMTWYP-QEYITNLPMTEMAMRPSQSKRYPGRTYRFYK 611
G GG AIA+ LFG +P GKLP+T+Y E++ + MA GRTYR+
Sbjct: 519 GARGGKAIAEALFGEFSPCGKLPVTFYEGTEFLPDFTDYSMA----------GRTYRYTD 568
Query: 612 GPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLG 671
V+YPFG+G++Y+ ++ A+A + T G VT
Sbjct: 569 RHVLYPFGYGLTYSQIRYSDAHA---------------DVTDFGILEPVT---------- 603
Query: 672 VQVDVKNVGS---KDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVC 728
V V V+N G+ ++ + FS A + P QL V + G ++ V I +
Sbjct: 604 VHVTVENTGTYPVQEAVQVYVRFSEREA--YDPGYQLKGIRSVALECGEKKEVCITLSP- 660
Query: 729 KYLSVVDRSGTRRIPLGEHNIHIGGTK 755
+ +++ G + G + I +GG +
Sbjct: 661 RDFALISEEGKCLVHPGSYEIAVGGQQ 687
>gi|238923424|ref|YP_002936940.1| beta-glucosidase [Eubacterium rectale ATCC 33656]
gi|238875099|gb|ACR74806.1| beta-glucosidase [Eubacterium rectale ATCC 33656]
Length = 714
Score = 431 bits (1107), Expect = e-118, Method: Compositional matrix adjust.
Identities = 247/604 (40%), Positives = 356/604 (58%), Gaps = 63/604 (10%)
Query: 56 LIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQ 115
L+ ++++ EK+ ++ + A+ RLGI Y WW+EALHGV+ G AT FPQ
Sbjct: 12 LVSQMTIDEKISQMLYESPAIERLGIPEYNWWNEALHGVARAGV----------ATVFPQ 61
Query: 116 VITTASSFNATLWEAIGRVVSDEARAMYN--------GGTAGLTYWSPNVNIFRDPRWGR 167
I A++F+ L E IG VVS E R +N G GLT+W+PNVNIFRDPRWGR
Sbjct: 62 AIGLAATFDTDLIEKIGDVVSTEGRGKFNEFSKKGDHGIYKGLTFWAPNVNIFRDPRWGR 121
Query: 168 GQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD--RFHFNA 225
G ET GEDP L+GK +Y+RGLQG D D LK AA KHF + +G + R F+A
Sbjct: 122 GHETYGEDPYLTGKLGCAYIRGLQGDDPDHLKSAACAKHFAVH-----SGPEAIRHEFDA 176
Query: 226 KVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYI 285
K SK D+ DT+ F+ CV + KV +VM +YN+VNG P C +LK +R E+ G++
Sbjct: 177 KASKHDMYDTYLYAFKRCVKDAKVEAVMGAYNRVNGEPACGSRTLLKDILRDEFGFEGHV 236
Query: 286 VSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINN 345
VSDC ++ +++ H T T EE+AA A+ G DL+CG LH + A +G++S+ I
Sbjct: 237 VSDCWAILDFHEHHHVTDTVEESAAMAVNNGCDLNCGSAF-LHLKDAYDKGMVSDEAITA 295
Query: 346 ALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLP 405
A+ + V++RLGM PS PY + + V +H EL++EAAR+ +VLLKN+ LP
Sbjct: 296 AVERLMEVRIRLGMMKDYPS--PYEDISYEVVECKEHVELSVEAARRSLVLLKNKDNFLP 353
Query: 406 LSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRY----ARTIHQQGC---K 458
L +T+AVIGPN++ +IGNY G + Y TPL+G+ +Y R ++ +GC K
Sbjct: 354 LDRKNVKTIAVIGPNANSRDALIGNYYGTSSRYITPLEGLQQYLGEDTRVLYAEGCHLYK 413
Query: 459 DVA---CADDQLFGAAIDASRQADATILVMGLDQSIEAE---------ALDRAGLLLPGR 506
D + F A+ + Q+D ++ +GLD +IE E + D+ GL+LPG
Sbjct: 414 DKVQGLAEEKDRFKEALIMAEQSDVVVMCLGLDATIEGEEGDAGNEYASGDKLGLMLPGL 473
Query: 507 QQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFG 566
Q+EL+ V+ K P ILVL +G ID+++A+ + AII + YPG GG A+A+ +FG
Sbjct: 474 QEELLEAVAAVGK-PVILVLSAGSAIDLSWAEE--HVDAIIDSWYPGARGGKAVAEAIFG 530
Query: 567 TSNPGGKLPMTWYPQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSY- 624
+P GKLP+T+Y + NLP T+ +M RTYR+ V+YPFG+G+ Y
Sbjct: 531 EYSPNGKLPVTFY--QGTENLPEFTDYSM--------AHRTYRYTNENVLYPFGYGLHYG 580
Query: 625 -TNF 627
TN+
Sbjct: 581 ETNY 584
>gi|310795958|gb|EFQ31419.1| glycosyl hydrolase family 3 N terminal domain-containing protein
[Glomerella graminicola M1.001]
Length = 824
Score = 429 bits (1104), Expect = e-117, Method: Compositional matrix adjust.
Identities = 267/742 (35%), Positives = 388/742 (52%), Gaps = 44/742 (5%)
Query: 43 CQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTK 102
C +L +R L+ L++ EK+ L++ A VPRL I YEWWSE LHGV++ PGTK
Sbjct: 65 CDETLSPKERAAALVAELTIWEKLDNLVNEAPGVPRLAIPPYEWWSEGLHGVAS-SPGTK 123
Query: 103 FG--GDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYW------- 153
F G+F ATSFPQ I S+F+ L +AIG VVS EARA N G +GL +
Sbjct: 124 FAKSGNFSYATSFPQPIVLGSAFDDDLVKAIGEVVSKEARAFSNRGRSGLDLYVSSISRH 183
Query: 154 --------------SPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLK 199
SPN+N F+DPRWGRGQETPGEDP Y A+ + GL+G D + K
Sbjct: 184 IEPEVRDDMLTEPESPNINAFKDPRWGRGQETPGEDPFHLQNYVAAMLTGLEGGDPSK-K 242
Query: 200 VAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQV 259
+ A+CKH+ A D +N+ GVDR F+A ++ QD+ + + PF+ C ++ KV S MCSYN +
Sbjct: 243 LIATCKHYAANDFENYKGVDRAGFDANITTQDLSEYYLPPFKTCAVDKKVGSFMCSYNAI 302
Query: 260 NGVPTCADPNILKRTIRGEWRLNG---YIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAG 316
NG P CA+P +L+ +R W NG Y+ +DCD V + H+ AAA A++AG
Sbjct: 303 NGEPLCANPYLLEDILRQHWGWNGDGQYVSTDCDCVALMVSHHHYAPDLGHAAAWAMKAG 362
Query: 317 LDLDCGPFLGLHT-ESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPK 375
DL+C F G + A + L+SE +++ +L T + +G FD QP L
Sbjct: 363 TDLECNAFPGSEALQLAWNQSLISEKEVDKSLTRMYTALVSVGQFDSA-RGQPLRSLSWD 421
Query: 376 DVCTPDHQELALEAARQGIVLLKNQGPSLPLSHI-RHRTVAVIGPNSDVTVTMIGNYAGI 434
DV T + Q+LA +A +G VLLKN G LPLS R + A+IGP + T M GNY G
Sbjct: 422 DVNTKEAQKLAYQAVIEGAVLLKNDG-ILPLSAAWREKKYALIGPWINATTQMQGNYFGP 480
Query: 435 ACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAE 494
A + Q + + DD F A+D++ A + G+D ++EAE
Sbjct: 481 APYLISLYQAAKEFGLDFTYSLGSRINSTDDS-FKQALDSAHAAALIVFAGGVDNTLEAE 539
Query: 495 ALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQ 554
DR L P Q +L+ VS K P I++ GG +D + I A++W GYPGQ
Sbjct: 540 TRDRKTLAWPESQLDLLRAVSALGK-PVIVLQFGGGQVDDTELLANHSINALLWGGYPGQ 598
Query: 555 AGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPV 614
+GG A+ D+LFG + P G+L +T YP Y ++P T+M +RP GRTY +Y G
Sbjct: 599 SGGKAVIDLLFGRAAPAGRLSVTQYPASYNEDVPSTDMNLRPGPGNSGLGRTYMWYNGDA 658
Query: 615 VYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLT---LG 671
V P+G G+ YT F + + + +++ +S + T LT +
Sbjct: 659 VVPYGFGLHYTTFDAKLKARQASALI----KTEEVSSLLSNDYVSGTLVWQQILTKPVVS 714
Query: 672 VQVDVKNVGSKDGAHTLLVFSTPPAGHW-APHKQLVAFEKV-HVPAGAQQRVGINIHVCK 729
V + V N G+ + L+F AG P K L + + ++ G + ++I + +
Sbjct: 715 VLITVSNTGNVASDYVALLFLRSNAGPTPQPTKTLAGYHRFRNIQPGDRSEREVSITIER 774
Query: 730 YLSVVDRSGTRRIPLGEHNIHI 751
L VD G R + G + + +
Sbjct: 775 -LVRVDELGNRVLHPGSYELFV 795
>gi|255957137|ref|XP_002569321.1| Pc21g23540 [Penicillium chrysogenum Wisconsin 54-1255]
gi|211591032|emb|CAP97251.1| Pc21g23540 [Penicillium chrysogenum Wisconsin 54-1255]
Length = 791
Score = 428 bits (1100), Expect = e-117, Method: Compositional matrix adjust.
Identities = 268/729 (36%), Positives = 393/729 (53%), Gaps = 36/729 (4%)
Query: 43 CQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTK 102
C + R LI + +E V + A+PRLG+ Y+ W+EALHG+ T+
Sbjct: 61 CDTTAKPHDRAAALIAMFTFEELVNSTGNVMPAIPRLGLPPYQVWNEALHGLDRANL-TE 119
Query: 103 FGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRD 162
FG D+ ATSFP I T ++ N TL IG +VS + RA NGG GL +SPN+N FR
Sbjct: 120 FG-DYSWATSFPSPILTMAALNRTLINQIGGIVSTQGRAFNNGGRYGLDVYSPNINSFRH 178
Query: 163 PRWGRGQETPGEDPVLSGKYAASYVRGLQGS-DGDRLKVAASCKHFTAYDLDNWNGVDRF 221
P WGRGQETPGED L Y Y+ GLQG D LK+AA+ KHF YD++NW R
Sbjct: 179 PVWGRGQETPGEDIQLCSVYGLEYITGLQGGLDPKELKLAATAKHFAGYDIENWGNHSRL 238
Query: 222 HFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRL 281
+ +S D + F V + +V SVM SYN VNGVP A+ +L+ +R W
Sbjct: 239 GNDMSISAFDFASYYAPQFVTAVRDARVHSVMASYNAVNGVPASANSFLLQTLLRDTWNF 298
Query: 282 --NGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLS 339
+GY+ SDCDSV ++ + S+ AAA +I+AG D+DCG L+ + +G +S
Sbjct: 299 VEDGYVSSDCDSVYNVFNPHGYASSASLAAAKSIQAGTDIDCGATYQLYLNQSFTQGEIS 358
Query: 340 EIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKN 399
+I A + + LG FDG+ S Y L DV D ++ EAA +GIVLLKN
Sbjct: 359 RSEIERAATRFYSNLVSLGYFDGDNSK--YRDLDWSDVVATDAWNISYEAAVEGIVLLKN 416
Query: 400 QGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCKD 459
G +LPLS H +VA+IGP ++VT TM GNY G A T PL + ++ +
Sbjct: 417 DG-TLPLSKDTH-SVALIGPWANVTTTMQGNYYGAAPYLTGPLAALQASDLDVNYAFGTN 474
Query: 460 VACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASK 519
++ F AA+ A+R++D I G+D S+EAE +DR + PG Q +L+ ++S K
Sbjct: 475 ISSETTSGFEAALSAARKSDVVIFAGGIDNSVEAEGVDRETITWPGNQLQLIEQLSELGK 534
Query: 520 GPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWY 579
P +++ M GG +D + K + + +++W GYPGQ+GG AI DIL G P G+L +T Y
Sbjct: 535 -PLVVLQMGGGQVDSSSLKANKNVNSLVWGGYPGQSGGPAILDILTGKRAPAGRLTVTQY 593
Query: 580 PQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVA 639
P EY P T+M++RP S PG+TY +Y G VY FGHG+ YT F ++AN+
Sbjct: 594 PAEYALQFPATDMSLRPKGSN--PGQTYMWYTGKPVYEFGHGLFYTTFETSLANS----- 646
Query: 640 VPLDGRHGSINATISG--KAIKVTHAKCNRLT----LGVQVDVKNVGSKDGAHTLLVFST 693
HG+ N K + ++A N + + ++V+N G+ +T + F
Sbjct: 647 ------HGANNGASFDIVKLLSRSNAGYNVIEQVPFMNYTIEVENTGTVTSDYTAMAFVN 700
Query: 694 PPAG---HWAPHKQLVAFEKV-HVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNI 749
AG H P+K LV F+++ + A Q + I + + ++ D G R + G++ +
Sbjct: 701 TKAGPSPH--PNKWLVGFDRLGGIEPHATQTMTIPVSLDN-VARTDEDGNRIVYPGKYEL 757
Query: 750 HIGGTKHSV 758
+ + +V
Sbjct: 758 ALNNERSAV 766
>gi|375150455|ref|YP_005012896.1| Beta-glucosidase [Niastella koreensis GR20-10]
gi|361064501|gb|AEW03493.1| Beta-glucosidase [Niastella koreensis GR20-10]
Length = 711
Score = 427 bits (1098), Expect = e-116, Method: Compositional matrix adjust.
Identities = 265/738 (35%), Positives = 384/738 (52%), Gaps = 76/738 (10%)
Query: 38 RTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNV 97
+T F P+ RVNDL+ +L+L EK+ LL + V RLGI Y WW+EALHGV+
Sbjct: 13 QTSVFRNPQQPMEARVNDLLHQLTLPEKISLLGYRSKEVERLGIPAYNWWNEALHGVARA 72
Query: 98 GPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA--------G 149
G AT FPQ I A++FN L + V+S EARA YN A G
Sbjct: 73 GV----------ATVFPQAIGMAATFNDDLLKEAATVISTEARAKYNLSLAQGRHLQYMG 122
Query: 150 LTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTA 209
LT+WSPN+NIFRDPRWGRGQET GEDP L+ ++V+GLQG+D LK +A KHF
Sbjct: 123 LTFWSPNINIFRDPRWGRGQETYGEDPFLTAHMGTAFVKGLQGNDPRYLKASACAKHFAV 182
Query: 210 YDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPN 269
+ NG R FNA V ++D+ +T+ F V G V SVMC+YN+VN P C+
Sbjct: 183 HSGPE-NG--RHTFNAIVDEKDLRETYLYAFHALVDAG-VESVMCAYNRVNDQPCCSGNF 238
Query: 270 ILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHT 329
+L +R EW+ G++V+DC ++ + + E AA AI+AG++LDC L
Sbjct: 239 LLNSILRNEWKFKGHVVTDCGALDDIFMRHKVMPSGVEVAAAAIKAGVNLDCSNVLQKDV 298
Query: 330 ESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEA 389
E AV++ LL+E DI+++L + L Q++LG +D +P++ P+ G V H LA
Sbjct: 299 EKAVEQKLLNEKDIDSSLAHLLRTQIKLGFYD-DPTANPFYKYGADSVANTAHATLARAM 357
Query: 390 ARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRY- 448
A+Q +VLLKN LPL ++ + V+G NS ++GNY G++ + ++GI
Sbjct: 358 AQQSMVLLKNSNQLLPLDKKKYPAIMVVGTNSASMDALLGNYHGVSNRAVSFVEGITNAV 417
Query: 449 ---ARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGL---------DQSIEAEAL 496
R + QG D FG I A+ AD T+ V+GL D + A+
Sbjct: 418 DAGTRVEYDQGSD---YNDTTHFG-GIWAAGNADITVAVIGLTPVYEGEEGDAFLAAKGG 473
Query: 497 DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAG 556
D+ + LP + + A+K P I V+ +G +D++ +P AI+ A YPG+ G
Sbjct: 474 DKPDMSLPAAHIAFMKALRKANKKPIIAVITAGSAVDISAI--EPYADAILLAWYPGEQG 531
Query: 557 GTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVY 616
G A+ADILFG +P G+LP+T+Y Q + AM+ GRTYR++ G V Y
Sbjct: 532 GNALADILFGKVSPAGRLPVTFY-QSFADVPAYDNYAMK--------GRTYRYFNGKVQY 582
Query: 617 PFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDV 676
PFG+G+SYT+F + P + D ++ + V
Sbjct: 583 PFGYGLSYTSFAYEWQQMPANIRTAKD-------------------------SVSFSIKV 617
Query: 677 KNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDR 736
KN GS DG + V+ PA P K+L AF++VHV AG ++ V + I
Sbjct: 618 KNTGSMDGDEVVQVYVEYPAVERMPLKELKAFKRVHVKAGGEETVQLTIPASDLQKWDLA 677
Query: 737 SGTRRIPLGEHNIHIGGT 754
+ + ++ G +NI GG
Sbjct: 678 TSSWKLYPGSYNIFAGGN 695
>gi|376259588|ref|YP_005146308.1| beta-glucosidase-like glycosyl hydrolase [Clostridium sp. BNL1100]
gi|373943582|gb|AEY64503.1| beta-glucosidase-like glycosyl hydrolase [Clostridium sp. BNL1100]
Length = 712
Score = 426 bits (1096), Expect = e-116, Method: Compositional matrix adjust.
Identities = 270/747 (36%), Positives = 393/747 (52%), Gaps = 94/747 (12%)
Query: 42 FCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGT 101
+ SL +R DL+ R++L+EK L A V RLGI Y WW+EALHGV+ G
Sbjct: 6 YLDKSLSFKERAADLVSRMTLEEKASQLRYDAQPVERLGIPRYNWWNEALHGVARAGV-- 63
Query: 102 KFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA--------GLTYW 153
AT FPQ I A+ F+ E I V++ E RA YN G+T+W
Sbjct: 64 --------ATVFPQAIGMAAIFDDEFLEKIADVIATEGRAKYNENAKKGDRDIYKGITFW 115
Query: 154 SPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLD 213
SPNVNIFRDPRWGRG ET GEDP L+ + ++V+GLQG DG LK AA KHF +
Sbjct: 116 SPNVNIFRDPRWGRGHETYGEDPYLTSRLGVAFVKGLQG-DGKYLKTAACAKHFAVH--- 171
Query: 214 NWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKR 273
+ DR HF+A VS++D+ +T+ F V E KV SVM +YN+ NG P +LK
Sbjct: 172 SGPEDDRHHFDAVVSQKDLYETYLPAFEALVKEAKVESVMGAYNRTNGEPCNGSKTLLKD 231
Query: 274 TIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAV 333
+R W +G++VSDC ++ +++ T TP E+ A A+++G DL+CG L A+
Sbjct: 232 ILRDGWGFDGHVVSDCWAIKDFHEGHGVTKTPTESVALALKSGCDLNCGNMY-LLILLAL 290
Query: 334 QRGLLSEIDINNALVNTLTVQMRLGMFDG--EPSSQPYGHLGPKDVCTPDHQELALEAAR 391
+ G ++E DI+ A + +T +MRLGMFD E PY L + +H +L+LEAA+
Sbjct: 291 KEGRITEEDIDRAAIRLMTTRMRLGMFDDDCEFDKIPY-ELND----SVEHNKLSLEAAK 345
Query: 392 QGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRY--- 448
+ +VLLKN G LPL + + +AVIGPN+D ++ + NY+G T L GI +
Sbjct: 346 KSMVLLKNDG-LLPLDSKKIKNIAVIGPNADSSLALRANYSGTPSQNITILDGIRKRVSE 404
Query: 449 -ARTIHQQGC-------KDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEAL---- 496
R + G +D+A DD+L A+ + ++D +L +GLD S+E E
Sbjct: 405 DTRVWYSVGSHLFMNREEDLAQPDDRL-KEAVSVAERSDVVVLCLGLDASVEGEQNDQGT 463
Query: 497 -------DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWA 549
D+A L LP Q+ L++ V +A+ PTI+ L+SG + + A + + AAI+
Sbjct: 464 VILDAGGDKADLNLPESQRNLLNAV-LATGKPTIVALLSGSALSIGDAAD--KAAAIVQC 520
Query: 550 GYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRF 609
YPG GG A A+++FG +P G+LP+T+Y LP P RTY+F
Sbjct: 521 WYPGSRGGLAFAEMIFGDYSPAGRLPVTFYKS--TEELP-------PFADYSMENRTYKF 571
Query: 610 YKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLT 669
KG +YPFG G+SYTNF ++ P V N
Sbjct: 572 MKGEALYPFGFGLSYTNFEYSNIVCPQNVN--------------------------NGEN 605
Query: 670 LGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVC 728
L V VDV+N GS D + V+ A P L F+++H+ +G ++ V I
Sbjct: 606 LSVSVDVQNAGSVDSDEVVQVYIKDMDASVRVPKYSLCGFKRIHLKSGEKKTVTFEID-S 664
Query: 729 KYLSVVDRSGTRRIPLGEHNIHIGGTK 755
+++VD +G R I GE +++GG++
Sbjct: 665 NAMTIVDEAGKRYIENGEFTLYVGGSQ 691
>gi|291518645|emb|CBK73866.1| Beta-glucosidase-related glycosidases [Butyrivibrio fibrisolvens
16/4]
Length = 713
Score = 426 bits (1095), Expect = e-116, Method: Compositional matrix adjust.
Identities = 260/733 (35%), Positives = 390/733 (53%), Gaps = 83/733 (11%)
Query: 51 QRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGA 110
+R +L+ +++++EK ++ A A+ RLGI Y WW+EALHGV+ G A
Sbjct: 7 KRAKELVSQMTIEEKCSQMLHHAEAIDRLGIPKYCWWNEALHGVARAGD----------A 56
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA--------GLTYWSPNVNIFRD 162
T FPQ I ++F+ L E + V S E RA YN T GLTYW+PNVNIFRD
Sbjct: 57 TVFPQAIGLGATFDEELVEKVADVTSTEGRAKYNEFTKHGDRDIYKGLTYWAPNVNIFRD 116
Query: 163 PRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFH 222
PRWGRG ET GEDP L+G+ +YVRGLQG D D K AA KHF + + +R H
Sbjct: 117 PRWGRGHETYGEDPYLTGQLGMAYVRGLQGDDLDNPKSAACAKHFAVH---SGPEAERHH 173
Query: 223 FNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLN 282
F+AKV+ QD+ DT+ F+ V + KV +VM +YN+VNG P C +LK +RG+W
Sbjct: 174 FDAKVNDQDLYDTYLYAFKRLVKDAKVEAVMGAYNRVNGEPACGSKRLLKDILRGDWGFE 233
Query: 283 GYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEID 342
G++VSDC ++ +++ T E+AA A+ G DL+CG A + L++E
Sbjct: 234 GHVVSDCWAIRDFHENHKVTGCEVESAALAVNNGCDLNCGCVYE-KLLYAYKANLVTEET 292
Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGP 402
I ++ + +++RLG S Y + + V +H+ELA+EAA++ +VLLKN G
Sbjct: 293 ITESVERLIELRLRLGTLPERRSK--YDDIPYEVVECKEHKELAIEAAKRSMVLLKNDGL 350
Query: 403 SLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRY----ARTIHQQGC- 457
LPL +T+ VIGPNS+ + ++GNY GI+ Y T L+GI +Y R H G
Sbjct: 351 -LPLKKDEIKTIGVIGPNSNSRMALVGNYEGISSEYITVLEGIQQYVGDDVRVFHSDGTP 409
Query: 458 --KD---VACADDQLFGAAIDASRQADATILVMGLDQSIEAE---------ALDRAGLLL 503
KD V F A+ + +D +L MGLD +IE E + D+ GL L
Sbjct: 410 LWKDRMHVLSEARDTFAEAMAVAEHSDVVVLAMGLDSTIEGEEGDAGNEFGSGDKKGLKL 469
Query: 504 PGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADI 563
PG QQEL+ K++ K P +L++++G +D+++A + + AI+ YPG GG AIA +
Sbjct: 470 PGLQQELLEKITAIGK-PVVLLVLAGSAMDLSWANEN--VNAIMHCWYPGARGGKAIAQV 526
Query: 564 LFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMS 623
LFG +P GKLP+T+Y + +LP P + GRTYR++KG +YPFG+G+S
Sbjct: 527 LFGEDSPSGKLPLTFYKSD--ADLP-------PFEDYSMEGRTYRYFKGTPLYPFGYGLS 577
Query: 624 YTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKD 683
Y++ ++ A +D G+I + V+V VKN G
Sbjct: 578 YSDIQYSNAG--------IDKTEGAIGDKFT-----------------VKVTVKNAGDYK 612
Query: 684 GAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRI 742
T+ V+ A + L KV + G + V + + + +++D G +
Sbjct: 613 AHETVQVYVKDVEASTRVANCSLRKIAKVELLPGESKEVSLELS-ARDFAIIDEKGHCIV 671
Query: 743 PLGEHNIHIGGTK 755
G+ + +GG +
Sbjct: 672 EPGKFKVFVGGQQ 684
>gi|358382857|gb|EHK20527.1| hypothetical protein TRIVIDRAFT_192759 [Trichoderma virens Gv29-8]
Length = 860
Score = 426 bits (1094), Expect = e-116, Method: Compositional matrix adjust.
Identities = 273/726 (37%), Positives = 400/726 (55%), Gaps = 47/726 (6%)
Query: 43 CQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTK 102
C + I R ++ ++L EKV + S A+ RLG+ Y+W +EALHGV+ G +
Sbjct: 139 CDTTKSIAARAAAIVKPMTLNEKVANVGSSASGSGRLGLPAYQWQNEALHGVAG-STGVQ 197
Query: 103 F----GGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVN 158
F G +F ATSFP I +++F+ L +++ +S EARA N G AGL +W+PN+N
Sbjct: 198 FQSPLGANFSAATSFPMPILLSAAFDDALVQSVATAISTEARAFANYGFAGLDFWTPNIN 257
Query: 159 IFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGS-DGDRLKVAASCKHFTAYDLDNWNG 217
FRDPRWGRG ETPGED Y S + GLQG D D + ++CKHF AYD++N
Sbjct: 258 PFRDPRWGRGMETPGEDAFRIQGYVLSLINGLQGGIDPDFFRTISTCKHFAAYDIEN--- 314
Query: 218 VDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRG 277
R N ++QD+ D + F CV + KV S+MC+YN VNGVP CAD +L+ +R
Sbjct: 315 -GRTANNLSPTQQDMADYYLPMFETCVRDAKVGSIMCAYNSVNGVPACADSYLLQSVLRD 373
Query: 278 EWRLN---GYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQ 334
+ Y+VSDCD+V YD H+ + +AAA ++ AG DLDCG + S VQ
Sbjct: 374 GYGFTEDFNYVVSDCDAVENVYDPHHYAANLTQAAAMSLNAGTDLDCGSSYNVLNAS-VQ 432
Query: 335 RGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGI 394
G+ +E ++ +L+ + +++G FD +P+ Y LG +V T + LA +AA G+
Sbjct: 433 AGMTTEATLDKSLIRLYSALIKVGWFD-QPAK--YSSLGWGNVNTTQTRALAHDAATGGM 489
Query: 395 VLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQ 454
LLKN G +LPLS + VAVIGP + T + GNYAG A PL + R +
Sbjct: 490 TLLKNDG-TLPLSPTL-QNVAVIGPWVNATTQLQGNYAGTAPVLVNPLTVFQQKWRNVKY 547
Query: 455 QGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKV 514
+ D F AAI A+ +D + + G+D S+E E DR + PG Q L+S++
Sbjct: 548 AQGTAINSQDTSGFNAAISAASSSDVIVYLGGIDISVENEGFDRTAITWPGNQLSLISQL 607
Query: 515 SMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKL 574
+ K P ++V GG ID + ++ ++ +I+WAGYPGQ GG A+ D+L G + P G+L
Sbjct: 608 ANLGK-PLVIVQFGGGQIDDSSLLSNSKVNSILWAGYPGQEGGNALFDVLTGANPPAGRL 666
Query: 575 PMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANA 634
P+T YP Y+ N + +M +RPS S PGRTY +Y G V PFG+G+ YTNF + +
Sbjct: 667 PITQYPANYVNNNNIQDMNLRPSGS--IPGRTYAWYTGTPVLPFGYGLHYTNFSVSFQST 724
Query: 635 PTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLG-VQVDVKNVGSK-----DGAHTL 688
T G+ ATI A + +R T + V+VKN G K D L
Sbjct: 725 KT---------SGTDVATIVNNA----GSNKDRATFATLVVNVKNTGGKANLASDYVGLL 771
Query: 689 LVFSTP--PAGHWAPHKQLVAFEKV-HVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLG 745
+ ST PA H P+KQL A+ +V V GA Q++ + +++ L+ D +G R + G
Sbjct: 772 FLKSTNAGPAPH--PNKQLAAYGRVKKVGVGATQQLTLTVNLGS-LARADTNGDRWVYPG 828
Query: 746 EHNIHI 751
+ + +
Sbjct: 829 AYTLTL 834
>gi|449303062|gb|EMC99070.1| glycoside hydrolase family 3 protein [Baudoinia compniacensis UAMH
10762]
Length = 786
Score = 425 bits (1093), Expect = e-116, Method: Compositional matrix adjust.
Identities = 239/605 (39%), Positives = 335/605 (55%), Gaps = 12/605 (1%)
Query: 34 DATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHG 93
++T T P C SL R + L+ +L+E + A VPRLG+ YE W+EALHG
Sbjct: 51 NSTLSTTPVCNRSLSAWDRAHALVQLFTLEELANNTGNTAPGVPRLGLPAYEVWNEALHG 110
Query: 94 VSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYW 153
+S+ T G + ATSFP I + +S N TL IG ++S + RA N G GL +
Sbjct: 111 ISHGHFATN--GTWSWATSFPSPILSMASMNRTLINQIGDIISTQGRAFSNAGRYGLDSY 168
Query: 154 SPNVNIFRDPRWGRGQETPGEDP-VLSGKYAASYVRGLQGSDGDRL-KVAASCKHFTAYD 211
+PN+N FR P WGRGQETPGED LS YA Y+ G+QG + K+ A KHF YD
Sbjct: 169 APNINGFRSPVWGRGQETPGEDAFFLSSLYAYEYITGMQGGKAPAVPKLVAVPKHFAGYD 228
Query: 212 LDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNIL 271
++NWN R + +++QD+ + FR + K +MCSYN VNGVP+C++ L
Sbjct: 229 IENWNNNSRLGLDVNITQQDLAGYYTPQFRSAIQNAKALGLMCSYNAVNGVPSCSNSFFL 288
Query: 272 KRTIRGEWRL-NGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTE 330
+ R W NG++ SDCD+V Y+ + + A AD++RAG D+DCG +
Sbjct: 289 QTLARDTWGFGNGFVSSDCDAVYNVYNPHGYAANTTGAVADSLRAGTDIDCGTSYPFYLV 348
Query: 331 SAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAA 390
A GL+S DI AL + + G FDG +S Y +LG DV T D ++ EAA
Sbjct: 349 PAFNAGLVSRNDIELALTRYYSGLVMQGYFDG--NSSLYRNLGWNDVLTTDAWNISYEAA 406
Query: 391 RQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYAR 450
+GI LLKN G +LPLS R+VA+IGP ++ T+ + GNY A +PLQ
Sbjct: 407 VEGITLLKNDG-TLPLSK-STRSVALIGPWANATLQLQGNYYAAAPYLISPLQAFRASGM 464
Query: 451 TIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQEL 510
T++ ++ + F AI ++Q+D I G+D SIEAE LDR + PG Q +L
Sbjct: 465 TVNFVNGTTISSTNTSGFAEAITLAQQSDVIIYAGGIDNSIEAEGLDRQNITWPGNQLDL 524
Query: 511 VSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNP 570
+ ++S K P +++ M GG +D + KN+ ++ A++W GYPGQ+GG A+ DI+ G P
Sbjct: 525 IYQLSQVGK-PLVVLQMGGGQVDSSALKNNSKVNALVWGGYPGQSGGQALFDIIMGNRAP 583
Query: 571 GGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHT 630
G+L T YP Y T+ M M P G+TY +Y G VYPFGHG+ YTNF T
Sbjct: 584 AGRLVTTQYPASYATSFNQLNMNMAPVNGSL--GQTYMWYTGTPVYPFGHGLFYTNFTTT 641
Query: 631 VANAP 635
P
Sbjct: 642 STMGP 646
>gi|2791278|emb|CAA93248.1| beta-xylosidase [Trichoderma reesei]
gi|340519464|gb|EGR49702.1| glycoside hydrolase family 3 [Trichoderma reesei QM6a]
Length = 797
Score = 424 bits (1091), Expect = e-116, Method: Compositional matrix adjust.
Identities = 267/733 (36%), Positives = 399/733 (54%), Gaps = 29/733 (3%)
Query: 43 CQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTK 102
C S +R LI +L+E + + VPRLG+ Y+ W+EALHG+ TK
Sbjct: 63 CDSSAGYVERAQALISLFTLEELILNTQNSGPGVPRLGLPNYQVWNEALHGLDRANFATK 122
Query: 103 FGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRD 162
GG F ATSFP I T ++ N TL I ++S +ARA N G GL ++PNVN FR
Sbjct: 123 -GGQFEWATSFPMPILTTAALNRTLIHQIADIISTQARAFSNSGRYGLDVYAPNVNGFRS 181
Query: 163 PRWGRGQETPGEDPV-LSGKYAASYVRGLQGS-DGDRLKVAASCKHFTAYDLDNWNGVDR 220
P WGRGQETPGED LS Y Y+ G+QG D + LKVAA+ KHF YDL+NWN R
Sbjct: 182 PLWGRGQETPGEDAFFLSSAYTYEYITGIQGGVDPEHLKVAATVKHFAGYDLENWNNQSR 241
Query: 221 FHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWR 280
F+A +++QD+ + + F K S+MC+YN VNGVP+CA+ L+ +R W
Sbjct: 242 LGFDAIITQQDLSEYYTPQFLAAARYAKSRSLMCAYNSVNGVPSCANSFFLQTLLRESWG 301
Query: 281 LN--GYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLL 338
GY+ SDCD+V ++ + S AAA ++RAG D+DCG H + G +
Sbjct: 302 FPEWGYVSSDCDAVYNVFNPHDYASNQSSAAASSLRAGTDIDCGQTYPWHLNESFVAGEV 361
Query: 339 SEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLK 398
S +I ++ +RLG FD Y LG KDV D ++ EAA +GIVLLK
Sbjct: 362 SRGEIERSVTRLYANLVRLGYFD---KKNQYRSLGWKDVVKTDAWNISYEAAVEGIVLLK 418
Query: 399 NQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCK 458
N G +LPLS + R++A+IGP ++ T M GNY G A +PL+ + ++ +
Sbjct: 419 NDG-TLPLSK-KVRSIALIGPWANATTQMQGNYYGPAPYLISPLEAAKKAGYHVNFELGT 476
Query: 459 DVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMAS 518
++A F AI A++++DA I + G+D +IE E DR + PG Q +L+ ++S
Sbjct: 477 EIAGNSTTGFAKAIAAAKKSDAIIYLGGIDNTIEQEGADRTDIAWPGNQLDLIKQLSEVG 536
Query: 519 KGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTW 578
K P +++ M GG +D + K++ ++ +++W GYPGQ+GG A+ DIL G P G+L T
Sbjct: 537 K-PLVVLQMGGGQVDSSSLKSNKKVNSLVWGGYPGQSGGVALFDILSGKRAPAGRLVTTQ 595
Query: 579 YPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVV 638
YP EY+ P +M +RP K PG+TY +Y G VY FG G+ YT F T+A+ P +
Sbjct: 596 YPAEYVHQFPQNDMNLRPD-GKSNPGQTYIWYTGKPVYEFGSGLFYTTFKETLASHPKSL 654
Query: 639 AVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF----STP 694
+ ++ +S T+++ + ++KN G + +T ++F +
Sbjct: 655 KF-------NTSSILSAPHPGYTYSE-QIPVFTFEANIKNSGKTESPYTAMLFVRTSNAG 706
Query: 695 PAGHWAPHKQLVAFEKV-HVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGG 753
PA + P+K LV F+++ + G ++ I I V L+ VD G R + G++ + +
Sbjct: 707 PAPY--PNKWLVGFDRLADIKPGHSSKLSIPIPVSA-LARVDSHGNRIVYPGKYELAL-N 762
Query: 754 TKHSVSLHAATLG 766
T SV L +G
Sbjct: 763 TDESVKLEFELVG 775
>gi|220927661|ref|YP_002504570.1| glycoside hydrolase [Clostridium cellulolyticum H10]
gi|219997989|gb|ACL74590.1| glycoside hydrolase family 3 domain protein [Clostridium
cellulolyticum H10]
Length = 712
Score = 424 bits (1091), Expect = e-116, Method: Compositional matrix adjust.
Identities = 265/746 (35%), Positives = 394/746 (52%), Gaps = 92/746 (12%)
Query: 42 FCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGT 101
+ SL +R DL+ R++L+EK L A V RLGI Y WW+EALHGV+ G
Sbjct: 6 YLDKSLSFKERAVDLVSRMTLEEKASQLRYDAQPVERLGIPRYNWWNEALHGVARAGV-- 63
Query: 102 KFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA--------GLTYW 153
AT FPQ I A+ F+ E I V++ E RA YN + G+T+W
Sbjct: 64 --------ATVFPQAIGLAAIFDDEFLEKIADVIATEGRAKYNESSKKGDRDIYKGITFW 115
Query: 154 SPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLD 213
SPNVNIFRDPRWGRG ET GEDP L+ + ++V+GLQG DG LK AA KHF +
Sbjct: 116 SPNVNIFRDPRWGRGHETYGEDPYLTSRLGVAFVKGLQG-DGKYLKSAACAKHFAVH--- 171
Query: 214 NWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKR 273
+ DR HFNA S++D+ +T+ F V E KV SVM +YN+ NG P +LK
Sbjct: 172 SGPEDDRHHFNAVASQKDMYETYLPAFEALVKEAKVESVMGAYNRTNGEPCNGSKTLLKD 231
Query: 274 TIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAV 333
+R +W +G++VSDC ++ +++ T TP E+ A A++ G DL+CG + L A+
Sbjct: 232 ILRDDWGFDGHVVSDCWAIKDFHEGHGVTKTPTESVALALKNGCDLNCGN-MYLLILLAL 290
Query: 334 QRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVC-TPDHQELALEAARQ 392
+ G ++E DI+ A + +T +M+LGMFD + P +V + +H +L+LEAAR+
Sbjct: 291 KEGKITEEDIDRAAIRLMTTRMKLGMFDDDCEFDKI----PYEVNDSIEHNKLSLEAARK 346
Query: 393 GIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQG----IGRY 448
+VLLKN G LPL + + +AVIGPN+D ++ + NY+G T L G +
Sbjct: 347 SMVLLKNNGL-LPLDSKKIKNIAVIGPNADSSLALRANYSGTPSHNITILDGVRSRVSED 405
Query: 449 ARTIHQQGC-------KDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEAL----- 496
R + G +D+A DD+L A+ + ++D +L +GLD S+E E
Sbjct: 406 TRVWYSLGSHLFMNREEDLAQPDDRL-KEAVSMAERSDVVVLCLGLDASVEGEQNDQGTV 464
Query: 497 ------DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
D+A L LP Q+ L++ V +A+ PTI+ L+SG + + A + + AAI+
Sbjct: 465 ILDAGGDKADLNLPESQRNLLNAV-LATGKPTIVALLSGSALSIGDAAD--KAAAIVQCW 521
Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFY 610
YPG GG A A+++FG +P G+LP+T+Y LP P + RTY+F
Sbjct: 522 YPGSKGGLAFAEMIFGDYSPAGRLPVTFYKS--TEELP-------PFEDYSMENRTYKFM 572
Query: 611 KGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTL 670
KG +YPFG G+SYTNF ++ P V N +L
Sbjct: 573 KGEALYPFGFGLSYTNFEYSNIVCPQAVN--------------------------NGESL 606
Query: 671 GVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCK 729
V VDV+N GS D + V+ A P+ L F+++ + +G ++ V I +
Sbjct: 607 SVSVDVQNAGSVDSDEVVQVYIKDMEASVRVPNHSLCGFKRIFLKSGEKKTVTFEID-SR 665
Query: 730 YLSVVDRSGTRRIPLGEHNIHIGGTK 755
+++VD G R I G+ +++GG +
Sbjct: 666 AMTIVDEEGKRYIENGDFTLYVGGAQ 691
>gi|164429277|ref|XP_958209.2| hypothetical protein NCU09923 [Neurospora crassa OR74A]
gi|16945419|emb|CAB91343.2| related to xylan 1, 4-beta-xylosidase [Neurospora crassa]
gi|157073010|gb|EAA28973.2| hypothetical protein NCU09923 [Neurospora crassa OR74A]
Length = 774
Score = 424 bits (1089), Expect = e-115, Method: Compositional matrix adjust.
Identities = 264/713 (37%), Positives = 381/713 (53%), Gaps = 32/713 (4%)
Query: 14 LLSASSSGLAARE-PFACDPK--DATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLI 70
LLS +S+ ++A + PF P + +L C +L PQR L+ ++ +EK++ L+
Sbjct: 10 LLSCTSALVSAIDLPFQTYPDCVNGPLASLKVCDATLSPPQRAAALVAAMTTEEKLQNLV 69
Query: 71 SGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKF---GGDFPGATSFPQVITTASSFNATL 127
S + PR+G+ Y WWSEALHGV+ PGT+F G F +TSFP + A++F+ L
Sbjct: 70 SKSKGAPRIGLPAYNWWSEALHGVA-YAPGTQFRSGDGPFNSSTSFPMPLLMAATFDDEL 128
Query: 128 WEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYV 187
E +G V+ E RA N G +G YW+PNVN F+DPRWGRG ETPGED + +YAAS +
Sbjct: 129 IEKVGEVIGTEGRAFGNAGFSGFDYWTPNVNPFKDPRWGRGSETPGEDILRIKRYAASMI 188
Query: 188 RGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEG 247
RGLQG +R +V A+CKH+ A D ++WNG R F+AKV+ QD+ + + PF+ C +
Sbjct: 189 RGLQGPLPER-RVVATCKHYAANDFEDWNGSTRHDFDAKVTLQDLAEYYLSPFQQCARDS 247
Query: 248 KVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNG---YIVSDCDSVGVYYDTQHFTST 304
KV S+MCSYN VNGVP CA+ +++ +R W YI SDC++V + H+ T
Sbjct: 248 KVGSIMCSYNAVNGVPACANTYLMQTILREHWNWTAPGNYITSDCEAVLDIFANHHYAKT 307
Query: 305 PEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEP 364
E A A AG D C A +GLL + ++ AL +R+G FDG
Sbjct: 308 NAEGTALAFEAGTDSSCEYESSSDIPGAWTQGLLEQSTVDRALTRLYEGLVRVGYFDGNH 367
Query: 365 SSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKN-QGPSLPLSHIRHRTVAVIGPNSDV 423
S Y LG KDV +P QE+AL+ A +GIVLLKN Q L L +A+IG ++
Sbjct: 368 SE--YASLGWKDVNSPKSQEVALQTAVEGIVLLKNDQTLPLGLKTDPKSKLAMIGFWAND 425
Query: 424 TVTMIGNYAGIACGYTTPLQGIGRYARTIHQQG--CKDVACADDQLFGAAIDASRQADAT 481
T+ G Y+G +P+ + G + ++D AA++A++ A+
Sbjct: 426 PKTLSGGYSGKPAFEHSPVYAAEAMGFNVTTAGGPVLQNSTSNDTWTQAALEAAQDANYI 485
Query: 482 ILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDP 541
+ GLD S E DR + P Q +L+ ++ K P ++V M G +D
Sbjct: 486 LYFGGLDTSAAGETKDRTTINWPEAQLQLIKTLTKLGK-PLVVVQM-GDQLDNTPLLATK 543
Query: 542 RIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKR 601
+ +I+WA +PGQ GGTA+ IL G +P G+LP+T YP Y +PMT+M +RPS R
Sbjct: 544 TVNSILWANWPGQDGGTAVMQILTGLKSPAGRLPVTQYPANYTAAVPMTDMNLRPSD--R 601
Query: 602 YPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANA-PTVVAVPLDGRHGSINATISGKAIKV 660
PGRTYR+Y V PFG G+ YT F +A P + L R G NA +
Sbjct: 602 LPGRTYRWYPT-AVQPFGFGLHYTTFQAKIAAPLPRLAIQDLLSRCGGDNANAYPDTCAL 660
Query: 661 THAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAG-HWAPHKQLVAFEKVH 712
K V+V N G++ + +L F AG P K LV++ ++
Sbjct: 661 PPLK---------VEVTNSGNRSSDYVVLAFLAGDAGPRPYPIKTLVSYTRLR 704
>gi|358397360|gb|EHK46735.1| glycoside hydrolase family 3 protein [Trichoderma atroviride IMI
206040]
Length = 865
Score = 423 bits (1088), Expect = e-115, Method: Compositional matrix adjust.
Identities = 264/722 (36%), Positives = 395/722 (54%), Gaps = 39/722 (5%)
Query: 43 CQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTK 102
C +L + +R ++ ++L EKV + S A+ RLG+ Y+W +EALHGV+ G +
Sbjct: 144 CDTTLSMAERAAAIVKPMTLDEKVANVGSSASGSARLGLPAYQWQNEALHGVAG-STGVQ 202
Query: 103 F----GGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVN 158
F G +F ATSFP I +++F+ L + + +S EARA N G AGL +W+PN+N
Sbjct: 203 FQSPLGANFSAATSFPMPILLSAAFDDALVQNVATAISTEARAFANYGFAGLDFWTPNIN 262
Query: 159 IFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGS-DGDRLKVAASCKHFTAYDLDNWNG 217
FRDPRWGRG ETPGED Y + + GLQG + D ++ A+CKHF AYD++N
Sbjct: 263 PFRDPRWGRGMETPGEDAFRIQGYVLALISGLQGGINPDFFRIIATCKHFAAYDIEN--- 319
Query: 218 VDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRG 277
R N ++QD+ D + F CV + KV SVMC+YN V+G+P CA +L+ +R
Sbjct: 320 -GRTGNNLNPTQQDMADYYLPMFETCVRDAKVGSVMCAYNAVDGIPACASEYLLQDVLRD 378
Query: 278 EWRLN---GYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQ 334
+ Y+VSDCD+V +D H+ S EAAA ++ AG DLDCG + S V+
Sbjct: 379 GFGFTEDFNYVVSDCDAVDNVFDPHHYASNLTEAAALSLNAGTDLDCGSSYNVLNAS-VE 437
Query: 335 RGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGI 394
L SE +N +LV + +++G FD +PS Y L +V T +Q LA +AA G+
Sbjct: 438 AALTSEAALNQSLVRLYSALIKVGYFD-QPSE--YKSLSWANVNTTQNQALAHDAATGGM 494
Query: 395 VLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI-GRYARTIH 453
LLKN G +LPLS VA+IGP + T M GNYAG A PL ++ +
Sbjct: 495 TLLKNDG-TLPLSRTLSN-VAIIGPWVNATTQMQGNYAGTAPFLVNPLDVFQQKWGNVKY 552
Query: 454 QQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSK 513
QG + D F AA+ A+ +D + + G+D ++E E DR ++ PG Q +L+S+
Sbjct: 553 AQGTA-INSQDTSGFSAALSAASSSDVIVYLGGIDITVENEGFDRGSIVWPGNQLDLISQ 611
Query: 514 VSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGK 573
++ K P ++V GG ID + ++P + +I+WAGYPGQ GG A+ D+L G + P G+
Sbjct: 612 LANLGK-PLVIVQFGGGQIDDSSLLSNPNVRSILWAGYPGQDGGNAVFDVLTGANPPAGR 670
Query: 574 LPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNF---VHT 630
LP+T YP YI N + +M +RPS PGRTY +Y G V PFG+G+ YTNF +
Sbjct: 671 LPITQYPASYINNNNIQDMNLRPSNG--IPGRTYAWYTGTPVLPFGYGLHYTNFSVSFQS 728
Query: 631 VANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLV 690
+ A T VA ++ I+ ++ + H + L + D + +
Sbjct: 729 INTAGTDVATIVNNAGAVIDTSVFATLVVSVHNTGGKANL----------ASDYVGLVFL 778
Query: 691 FSTPPAGHWAPHKQLVAFEKVH-VPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNI 749
ST P+KQL A+ + V GA Q++ + I++ L+ D +G R I G++ +
Sbjct: 779 SSTNAGPSPYPNKQLAAYGRAKSVGVGATQQLTLKINLGS-LARADTNGDRWIYPGDYKL 837
Query: 750 HI 751
+
Sbjct: 838 TL 839
>gi|336471692|gb|EGO59853.1| hypothetical protein NEUTE1DRAFT_99999 [Neurospora tetrasperma FGSC
2508]
gi|350292807|gb|EGZ74002.1| glycoside hydrolase [Neurospora tetrasperma FGSC 2509]
Length = 770
Score = 423 bits (1087), Expect = e-115, Method: Compositional matrix adjust.
Identities = 270/716 (37%), Positives = 387/716 (54%), Gaps = 38/716 (5%)
Query: 14 LLSASSSGLAARE-PFACDPK--DATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLI 70
LLS SS ++A + PF P + +L C V+L PQR L+ ++ +EK++ L+
Sbjct: 10 LLSCSSVLVSAIDLPFQTYPDCVNGPLASLKVCDVTLSPPQRAAALVAAMTTEEKLQNLV 69
Query: 71 SGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKF---GGDFPGATSFPQVITTASSFNATL 127
S + PR+G+ Y WWSEALHGV+ PGT+F G F +TSFP + A++F+ L
Sbjct: 70 SKSKGAPRIGLPAYNWWSEALHGVA-YAPGTQFWSGDGPFNASTSFPMPLLMAATFDDEL 128
Query: 128 WEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYV 187
E +G V+ E RA N G +G YW+PNVN F+DPRWGRG ETPGED + +YAAS +
Sbjct: 129 IEKVGEVIGTEGRAFGNAGFSGFDYWTPNVNPFKDPRWGRGSETPGEDILRIKRYAASMI 188
Query: 188 RGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEG 247
RGLQG +R +V A+CKH+ A D ++WNG R FNAKV+ QD+ + + PF+ C +
Sbjct: 189 RGLQGPARER-RVVATCKHYAANDFEDWNGSTRHDFNAKVTLQDLAEYYLSPFQQCARDS 247
Query: 248 KVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNG---YIVSDCDSVGVYYDTQHFTST 304
KV S+MCSYN VNGVP CA+ +++ +R W YI SDC++V H+ T
Sbjct: 248 KVGSIMCSYNAVNGVPACANTYLMQTILREHWNWTAPGNYITSDCEAVLDISANHHYAET 307
Query: 305 PEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEP 364
E A A AG+D C A +GLL + ++ AL +R+G FDG
Sbjct: 308 NAEGTALAFEAGIDSSCEYESSSDIPGAWTQGLLEQSTVDRALKRIYEGLVRVGYFDGNH 367
Query: 365 SSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIR---HRTVAVIGPNS 421
S Y LG KDV +P QE+AL+AA +GIVLLKN +LPL +R +A+IG +
Sbjct: 368 SE--YASLGWKDVNSPKSQEVALQAAVEGIVLLKNDK-TLPLD-LRTDPKSKLAMIGFWA 423
Query: 422 DVTVTMIGNYAGIACGYTTPL---QGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQA 478
+ T+ G Y+G +P+ Q +G ++ T + ++D AA++A++ A
Sbjct: 424 NDPKTLSGGYSGKPAFEHSPVYAAQAMG-FSVTTAGGPVLQNSTSNDTWTQAALEAAKDA 482
Query: 479 DATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAK 538
+ + G D S E DR + P Q +L++ +S K P ++V M G +D
Sbjct: 483 NYILYFGGQDTSAAGETKDRTTINWPEAQLQLITTLSKLGK-PLVVVQM-GDQLDNTPLL 540
Query: 539 NDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQ 598
+ AI+WA + GQ GGTA+ IL G NP G+LP+T YP Y +PMT+M +RPS
Sbjct: 541 AAKAVNAILWANWLGQDGGTAVMQILTGLKNPAGRLPVTQYPANYTAAVPMTDMNLRPSD 600
Query: 599 SKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVA-NAPTVVAVPLDGRHGSINATISGKA 657
+ PGRTYR+Y V PFG G+ YT F +A P + L R G NA
Sbjct: 601 --KLPGRTYRWYPT-AVQPFGFGLHYTTFQTKIAVPLPRLAIQDLLSRCGGDNANAYPDT 657
Query: 658 IKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAG-HWAPHKQLVAFEKVH 712
+ K V+V N G++ + +L F G P K LV++ ++
Sbjct: 658 CALPPLK---------VEVTNSGNRSSDYVVLAFLAGDVGPKPYPIKTLVSYTRLR 704
>gi|326202986|ref|ZP_08192853.1| glycoside hydrolase family 3 domain protein [Clostridium
papyrosolvens DSM 2782]
gi|325987063|gb|EGD47892.1| glycoside hydrolase family 3 domain protein [Clostridium
papyrosolvens DSM 2782]
Length = 712
Score = 422 bits (1086), Expect = e-115, Method: Compositional matrix adjust.
Identities = 265/747 (35%), Positives = 395/747 (52%), Gaps = 94/747 (12%)
Query: 42 FCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGT 101
+ SL +R DL+ +++L+EK L A V RLGI Y WW+EALHGV+ G
Sbjct: 6 YLDKSLSFKERAADLVSKMTLEEKASQLRYDAQPVERLGIPRYNWWNEALHGVARAGV-- 63
Query: 102 KFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA--------GLTYW 153
AT FPQ I A+ F+ E I V++ E RA YN G+T+W
Sbjct: 64 --------ATVFPQAIGMAAMFDDEFLEKIADVIATEGRAKYNESAKKGDRDIYKGITFW 115
Query: 154 SPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLD 213
SPNVNIFRDPRWGRG ET GEDP L+ + ++V+GLQG DG LK AA KH+ +
Sbjct: 116 SPNVNIFRDPRWGRGHETYGEDPYLTSRLGVAFVKGLQG-DGKYLKTAACAKHYAVH--- 171
Query: 214 NWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKR 273
+ DR F+A VS++D+ +T+ F V E KV S+M +YN+ NG P +LK
Sbjct: 172 SGPEDDRHFFDAIVSQKDLYETYLPAFEALVKEAKVESIMGAYNRTNGEPCNGSKTLLKD 231
Query: 274 TIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAV 333
+R W +G++VSDC ++ +++ T TP E+ A A+++G DL+CG L A+
Sbjct: 232 ILRDGWGFDGHVVSDCWAIKDFHEGHGVTKTPTESVALALKSGCDLNCGNMY-LLILLAL 290
Query: 334 QRGLLSEIDINNALVNTLTVQMRLGMFDG--EPSSQPYGHLGPKDVCTPDHQELALEAAR 391
+ GL++E DI+ A + +T +M+LGMFD E + PY L + +H +++LEAA+
Sbjct: 291 KEGLITEEDIDRAAIRLMTTRMKLGMFDDDCEFDNIPY-ELND----SAEHNKISLEAAK 345
Query: 392 QGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRY--- 448
+ +VLLKN G LPL + + VAVIGPN+D ++ + NY+G T ++GI +
Sbjct: 346 KSMVLLKNDGL-LPLDSKKIKNVAVIGPNADSSLALRANYSGTPSQNVTIIEGIRKRVSE 404
Query: 449 -ARTIHQQGC-------KDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEAL---- 496
R + G +D+A DD+L A+ A+ ++D +L +GLD S+E E
Sbjct: 405 NTRVWYAMGSHLFLNRDEDLAQPDDRL-KEAVSAAERSDVVVLCLGLDASVEGEQNDQGT 463
Query: 497 -------DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWA 549
D+A L LP Q+ L++ V +A+ PTI+ L+SG + + A + + AAI+
Sbjct: 464 VILDAGGDKADLNLPESQRNLLNAV-LATGKPTIVALLSGSALSIGDAAD--KAAAIVQC 520
Query: 550 GYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRF 609
YPG GG A A+++FG +P G+LP+T+Y LP P RTY+F
Sbjct: 521 WYPGAIGGLAFAEMIFGDYSPAGRLPVTFYKS--TEELP-------PFADYSMENRTYKF 571
Query: 610 YKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLT 669
KG +YPFG G+SYT+F ++ P V N
Sbjct: 572 MKGDALYPFGFGLSYTSFEYSNMVCPQTVN--------------------------NGEN 605
Query: 670 LGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVC 728
L V VDV+N GS D + V+ A P L F+++H+ +G ++ V +
Sbjct: 606 LSVSVDVQNTGSVDSDEVVQVYIKDMDASVRVPKYSLCGFKRIHLKSGEKKTVTFEV-AS 664
Query: 729 KYLSVVDRSGTRRIPLGEHNIHIGGTK 755
+S+VD +G R I GE ++ GG++
Sbjct: 665 NAMSIVDEAGKRHIENGEFTLYAGGSQ 691
>gi|255690205|ref|ZP_05413880.1| xylosidase/arabinosidase [Bacteroides finegoldii DSM 17565]
gi|260624224|gb|EEX47095.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
finegoldii DSM 17565]
Length = 1425
Score = 422 bits (1084), Expect = e-115, Method: Compositional matrix adjust.
Identities = 264/749 (35%), Positives = 392/749 (52%), Gaps = 77/749 (10%)
Query: 38 RTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNV 97
+ PF L I QRV+DL+ RL+L+EKV+ +++ A A+ RLGI Y WW+E LHGV
Sbjct: 710 KRYPFRNPQLSIEQRVDDLVSRLTLEEKVRQMLNNAPAIKRLGIPAYNWWNECLHGVGR- 768
Query: 98 GPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA--------G 149
TK+ T FPQ I A+S+N L + + ++DE RA+YN
Sbjct: 769 ---TKYH-----VTVFPQAIGMAASWNDVLMKEVASSIADEGRAIYNDAQKRGDYSQYHA 820
Query: 150 LTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTA 209
LTYW+PN+NIFRDPRWGRGQET GEDP L+ K ++V GLQG D LK +A KH+
Sbjct: 821 LTYWTPNINIFRDPRWGRGQETYGEDPYLTSKIGKAFVLGLQGDDPRYLKASACAKHYAV 880
Query: 210 YDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPN 269
+ N R FN+ VS D+ DT+ FR V++ V+ VMC+YN G P C +
Sbjct: 881 HSGPEKN---RHSFNSDVSTYDLWDTYLPAFRTLVVDANVSGVMCAYNAFKGQPCCGNDL 937
Query: 270 ILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHT 329
+++ +R +W GY+ SDC ++ ++ AAADA+ G DLDCG L
Sbjct: 938 LMQSILRDKWNFKGYVTSDCGAIDDIFNHHKAHPDAATAAADAVFHGTDLDCGQSAYLAL 997
Query: 330 ESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQ-PYGHLGPKDVCTPDHQELALE 388
AV+ G+++E ++ ++ T++ RLG+FD P+ Q Y H+ + HQ+LA +
Sbjct: 998 VKAVKNGIITEKQLDVSVKRLFTIRFRLGLFD--PAEQVDYAHIPISVLECKKHQDLAKQ 1055
Query: 389 AARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGR- 447
AR+ +VLLKN LPL + + V V+GPN+D ++GNY G TPLQ I
Sbjct: 1056 LARESMVLLKND-RLLPLQKNKLKKVVVMGPNADCKDALLGNYNGHPSRMLTPLQAIRER 1114
Query: 448 ---YARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEAL-------- 496
A ++ G + + ++ ++ ADA I + G+ +E E +
Sbjct: 1115 LKGVAEVVYVSGIDYINTVSEDELKRYVNQAKGADAVIFIGGISPRLEGEEMSVNKDGFD 1174
Query: 497 --DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQ 554
DR + LP Q +L+ K +A + PT+ V+M+G + + + + AI+ A Y GQ
Sbjct: 1175 GGDRTSIALPTVQTQLM-KALVAGRIPTVFVMMTGSALAIPWEAK--HVPAILNAWYGGQ 1231
Query: 555 AGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPV 614
GG AIAD+LFG NP GKLP+T+Y ++ ++LP E S GRTYR++KG
Sbjct: 1232 YGGEAIADVLFGDYNPSGKLPVTFYAKD--SDLPDFE-------SYDMQGRTYRYFKGKA 1282
Query: 615 VYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQV 674
+YPFG+G+SYT+F ++ PT + K I VT V
Sbjct: 1283 LYPFGYGLSYTDFRYSSLKMPTA-------------CNTTDKEIPVT------------V 1317
Query: 675 DVKNVGSKDGAHTL-LVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSV 733
VKN G DG + L S P P L F+++++ AG +++ ++ + LS
Sbjct: 1318 TVKNTGKMDGEEVVQLYVSHPDKKILVPVTALKGFKRIYLKAGEAKQITFSLS-SEDLSC 1376
Query: 734 VDRSGTRRIPLGEHNIHIGGTKHSVSLHA 762
VD +G R++ G I +GG +L A
Sbjct: 1377 VDENGIRKVLPGTVKIQVGGCSPVATLTA 1405
>gi|292495632|sp|Q0CMH8.2|XYND_ASPTN RecName: Full=Probable exo-1,4-beta-xylosidase xlnD; AltName:
Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
Length = 793
Score = 420 bits (1079), Expect = e-114, Method: Compositional matrix adjust.
Identities = 266/724 (36%), Positives = 394/724 (54%), Gaps = 27/724 (3%)
Query: 43 CQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTK 102
C S R L+ +L+E V + VPRLG+ Y+ WSE+LHGV ++
Sbjct: 63 CDKSARPHDRAAALVSMFTLEELVNNTGNTGTGVPRLGLPKYQVWSESLHGVYRANWASE 122
Query: 103 FGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRD 162
GD+ ATSFPQ I T ++ N TL IG ++S +ARA N G GL ++PN+N FR
Sbjct: 123 --GDYSWATSFPQPILTMAALNRTLIHQIGDILSTQARAFSNVGRYGLDTYAPNINSFRH 180
Query: 163 PRWGRGQETPGEDP-VLSGKYAASYVRGLQGS-DGDRLKVAASCKHFTAYDLDNWNGVDR 220
P WGRGQETPGED L+ YA Y+ G+QG D + LK+ A+ KH+ YD++NW+G R
Sbjct: 181 PVWGRGQETPGEDAYYLASTYAYEYITGIQGGVDPETLKLVATAKHYAGYDIENWDGHSR 240
Query: 221 FHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWR 280
+ ++++QD+ + + F + + KV SVMCSYN VNGVP+C++ L+ +R +
Sbjct: 241 LGNDMQITQQDLSEYYTPQFLVSARDAKVHSVMCSYNAVNGVPSCSNSFFLQTLLRETFG 300
Query: 281 L--NGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLL 338
+GY+ DC +V ++ + + A+AD+IRAG D+DCG H +A G +
Sbjct: 301 FVEDGYVSGDCGAVYNAFNPHEYAANESSASADSIRAGTDIDCGTSYQYHFTNAFDEGEI 360
Query: 339 SEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLK 398
S DI ++ T +RLG FDG +S Y L DV T D ++ EAA +G VLLK
Sbjct: 361 SRQDIERGVIRLYTNLVRLGYFDG--NSSQYRDLTWSDVQTTDAWNISHEAAVEGTVLLK 418
Query: 399 NQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCK 458
N G +LPL+ R+VA+IGP ++ T M GNY G A T+PL + +H
Sbjct: 419 NDG-TLPLAD-SIRSVALIGPWANATTQMQGNYYGPAPYLTSPLAALEASDLDVHYAFGT 476
Query: 459 DVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMAS 518
+++ F A+ A+R+ADA I G+D +IE EALDR + PG Q +L++++S
Sbjct: 477 NISSTTTAGFADALAAARKADAIIFAGGIDNTIEGEALDRMNITWPGNQLDLINQLSALG 536
Query: 519 KGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTW 578
K P +++ M GG +D + K++ ++A++W GYPGQ+GGTA+ DI+ G P G+L T
Sbjct: 537 K-PLVVLQMGGGQVDSSALKHNTNVSALLWGGYPGQSGGTALLDIIRGVRAPAGRLVTTQ 595
Query: 579 YPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVV 638
YP Y T P +M +RP+ + PG+TY +Y G VY FGHG+ YT F A+ T
Sbjct: 596 YPAGYATQFPAIDMGLRPNGTN--PGQTYMWYTGTPVYEFGHGLFYTTFEAKRASTAT-- 651
Query: 639 AVPLDGRHGSINATISGKAIKVTHAKCN-RLTLGVQVDVKNVGSKDGAHTLLVFSTPPAG 697
H S N A +A R L + N G +T ++F+ AG
Sbjct: 652 ------NHSSFNIEDLLTAPHPGYAYPQLRPFLNFTAHITNTGRTTSDYTAMLFANTTAG 705
Query: 698 HWAPH--KQLVAFEKV-HVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGT 754
APH K LV F+++ + GA Q + I + ++ D G R + G + + +
Sbjct: 706 P-APHPNKWLVGFDRLGALEPGASQTMTFPITIDN-VARTDELGNRVLYPGRYELALNNE 763
Query: 755 KHSV 758
+ V
Sbjct: 764 RSVV 767
>gi|154313073|ref|XP_001555863.1| hypothetical protein BC1G_05538 [Botryotinia fuckeliana B05.10]
Length = 755
Score = 420 bits (1079), Expect = e-114, Method: Compositional matrix adjust.
Identities = 273/704 (38%), Positives = 386/704 (54%), Gaps = 60/704 (8%)
Query: 43 CQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTK 102
C S R LI +L EKV + + VPR+G+ YEWW+EALHG++ PGT
Sbjct: 34 CDTSSDPYTRAAALISLFTLAEKVNNTGNTSPGVPRIGLPSYEWWNEALHGIAR-SPGTT 92
Query: 103 F---GGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNI 159
F G ++ +TSFPQ I ++F+ L + VS EARA N GL +W+PN+N
Sbjct: 93 FAATGSNYSYSTSFPQPILMGATFDDELIHKVATQVSTEARAFNNVNRFGLNFWTPNINP 152
Query: 160 FRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGD--RLKVAASCKHFTAYDLDNWNG 217
++DPRWGRGQETPGEDP + Y + + GLQG D K A+CKHF YDL+N +G
Sbjct: 153 YKDPRWGRGQETPGEDPFHTSSYVNALITGLQGGLDDLPYKKGVATCKHFAGYDLENSDG 212
Query: 218 VDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRG 277
R+ F+A + QD+ D + PF+ C + V SVMCSYN +NGVPTCAD +L+ +R
Sbjct: 213 AIRYGFDAIIKSQDLRDYYLPPFQQCARDSNVQSVMCSYNAMNGVPTCADDWLLQTLLRE 272
Query: 278 EW---RLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQ 334
W + ++ SDCD+V +D ++T TPE++AADA+ AG DLDCG F + SA
Sbjct: 273 HWGWTEEDQWVTSDCDAVKNIWDYHNYTLTPEQSAADALNAGTDLDCGTFWPTYLGSAYD 332
Query: 335 RGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGI 394
+GL ++ +L +RLG FD PS QPY L +V TP Q+LAL+AA GI
Sbjct: 333 QGLYDISTLDRSLARRYASLVRLGYFD-PPSVQPYRQLNWDNVSTPAAQQLALQAAEDGI 391
Query: 395 VLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPL---QGIGRYART 451
VLLKN G LPLS VA+IGP ++ T M GNY G A +PL Q G +
Sbjct: 392 VLLKNDG-ILPLSS-NITNVALIGPLANATKQMQGNYYGTAPYLRSPLIAAQNAG--FKV 447
Query: 452 IHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELV 511
+ QG D+ + F AAI A++ AD I V G+D SIEAE E++
Sbjct: 448 TYVQGA-DIDSQNTTDFSAAISAAQSADLVIYVGGIDNSIEAE--------------EIL 492
Query: 512 SKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPG 571
+ +S P I+ M G ID + ++ + A++WAGYPGQ GGTAI +IL G + P
Sbjct: 493 ANLST----PLIISQM-GCMIDSSSLLSNTGVNALLWAGYPGQDGGTAIFNILTGKTAPA 547
Query: 572 GKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTV 631
G+LP+T YP Y+ + MT+M ++PS+ PGRTY++Y G V+ +G+G+ YT F +
Sbjct: 548 GRLPITQYPSNYVNQVTMTDMNLQPSRFN--PGRTYKWYNGEPVFEYGYGLQYTTFDAKI 605
Query: 632 ANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLT--LGVQVDVKNVGSKDGAHTLL 689
+ S N T + + LT + + + V N G+ + L
Sbjct: 606 TPS-------------SPNNTFEISELLANASNYKDLTPFVKIPITVSNTGTTTSDYVAL 652
Query: 690 VF---STPPAGHWAPHKQLVAFEKVH-VPAGAQQRVGINIHVCK 729
F + PA H P K LVA+ ++H + GA +++++
Sbjct: 653 FFLSGTFGPAPH--PKKSLVAYTRLHDITGGANATAEVSLNLAS 694
>gi|238483831|ref|XP_002373154.1| beta-xylosidase XylA [Aspergillus flavus NRRL3357]
gi|292495283|sp|B8MYV0.1|XYND_ASPFN RecName: Full=Probable exo-1,4-beta-xylosidase xlnD; AltName:
Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
gi|220701204|gb|EED57542.1| beta-xylosidase XylA [Aspergillus flavus NRRL3357]
Length = 797
Score = 418 bits (1074), Expect = e-114, Method: Compositional matrix adjust.
Identities = 258/729 (35%), Positives = 383/729 (52%), Gaps = 32/729 (4%)
Query: 43 CQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTK 102
C S R L+ L+ +E V + PR+G+ Y+ W+EALHGV++
Sbjct: 63 CDTSAKPHDRAAALVSLLTFEELVNNTANTGHGAPRIGLPAYQVWNEALHGVAHADFSD- 121
Query: 103 FGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRD 162
GDF +TSFPQ I+T ++ N TL I ++S + RA N G GL +SPN+N FR
Sbjct: 122 -AGDFSWSTSFPQPISTMAALNRTLIHQIATIISTQGRAFMNAGRYGLDVYSPNINTFRH 180
Query: 163 PRWGRGQETPGEDP-VLSGKYAASYVRGLQGS-DGDRLKVAASCKHFTAYDLDNWNGVDR 220
P WGRGQETPGED L+ YA Y+ G+QG D + LK+ A+ KH+ YD++NW+ R
Sbjct: 181 PVWGRGQETPGEDAYCLASTYAYEYITGIQGGVDANPLKLIATAKHYAGYDIENWDNHSR 240
Query: 221 FHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWR 280
+ ++++QD+ + + F + + KV SVMCSYN VNGVP+C++ L+ +R +
Sbjct: 241 LGNDMQITQQDLAEYYTPQFLVASRDAKVHSVMCSYNAVNGVPSCSNSFFLQTLLRDTFD 300
Query: 281 L--NGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLL 338
+GY+ DC +V ++ + + AAAD+IRAG D+DCG H + + +
Sbjct: 301 FVEDGYVSGDCGAVYNVFNPHGYATNESSAAADSIRAGTDIDCGVSYPRHFQESFHDQEV 360
Query: 339 SEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLK 398
S D+ + +R G FDG+ S PY ++ DV + + Q L+ EAA Q IVLLK
Sbjct: 361 SRQDLERGVTRLYASLIRAGYFDGKTS--PYRNITWSDVVSTNAQNLSYEAAAQSIVLLK 418
Query: 399 NQGPSLPL-SHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGC 457
N G LPL + +T+A+IGP ++ T M+GNY G A +PLQ I
Sbjct: 419 NDG-ILPLTTSSSTKTIALIGPWANATTQMLGNYYGPAPYLISPLQAFQDSEYKITYTIG 477
Query: 458 KDVACADDQLF-GAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSM 516
+ D A+ +++AD I G+D ++E EA DR+ + P Q L++K++
Sbjct: 478 TNTTTDPDSTSQSTALTTAKEADLIIFAGGIDNTLETEAQDRSNITWPSNQLSLITKLAD 537
Query: 517 ASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPM 576
K P I++ M GG +D + KN+ + A+IW GYPGQ+GG A+ADI+ G P +L
Sbjct: 538 LGK-PLIVLQMGGGQVDSSALKNNKNVNALIWGGYPGQSGGQALADIITGKRAPAARLVT 596
Query: 577 TWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPT 636
T YP EY P +M +RP+ S PG+TY +Y G VY FGHG+ YTNF T
Sbjct: 597 TQYPAEYAEVFPAIDMNLRPNGSN--PGQTYMWYTGTPVYEFGHGLFYTNF--------T 646
Query: 637 VVAVPLDGRHGSINATISGKAIKVTHAKCNRLT----LGVQVDVKNVGSKDGAHTLLVFS 692
A G + I + + H + L VDVKN G + +T + F
Sbjct: 647 ASASAGSGTKNRTSFNID-EVLGRPHPGYKLVEQMPLLNFTVDVKNTGDRVSDYTAMAFV 705
Query: 693 TPPAGHWAPH--KQLVAFEKVH-VPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNI 749
AG APH K LV F+++ V G+ + + I + V L+ D G R + G + +
Sbjct: 706 NTTAGP-APHPNKWLVGFDRLSAVEPGSAKTMVIPVTV-DSLARTDEEGNRVLYPGRYEV 763
Query: 750 HIGGTKHSV 758
+ + V
Sbjct: 764 ALNNEREVV 772
>gi|440472411|gb|ELQ41274.1| beta-xylosidase [Magnaporthe oryzae Y34]
gi|440484691|gb|ELQ64724.1| beta-xylosidase [Magnaporthe oryzae P131]
Length = 792
Score = 417 bits (1073), Expect = e-114, Method: Compositional matrix adjust.
Identities = 266/716 (37%), Positives = 381/716 (53%), Gaps = 50/716 (6%)
Query: 43 CQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTK 102
C + +R L+ + L EK++ L++ + PR+G+ YEWWSEALHGV+ PG
Sbjct: 41 CDQAATPAERAAGLVDIMELDEKLENLVNKSPGAPRIGLPAYEWWSEALHGVAK-SPGVT 99
Query: 103 F----GGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVN 158
F G F ATSF I +++F+ L EA+ +S EARA N G AGL +W+PN+N
Sbjct: 100 FNKSSGAAFSSATSFSNPIVLSAAFDDELVEAVATQISTEARAFSNAGLAGLDWWTPNIN 159
Query: 159 IFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGV 218
++DPRWGRG ETPGED + KY + +RGL+GSD K+ A+CKH+ A DL+ WNGV
Sbjct: 160 PYKDPRWGRGMETPGEDALRISKYVKALLRGLEGSDPTTRKMVANCKHYAANDLERWNGV 219
Query: 219 DRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQV---------NGVPTCADPN 269
R++F+A V+ QD+ + + F+ C + V S MC+YN + NG P CA
Sbjct: 220 TRYNFDAPVTLQDLSEYYLPAFKQCARDSNVGSFMCAYNAMSIKGKDLSWNGTPVCASKY 279
Query: 270 ILKRTIRGEW---RLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDC--GPF 324
++ +R W N +I SDC++V ++ H++ T EEAA A AG D C +
Sbjct: 280 LMNDILREHWGWKEHNNWITSDCNAVLHMWNQHHWSDTREEAAGSAYTAGTDTVCEVSNY 339
Query: 325 LGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQE 384
+ A RGLL E ++ AL +R G FDG PY ++ DV TP+ ++
Sbjct: 340 DKTAVKGAFDRGLLDEDVVDRALKRLYEGLVRAGYFDG--PDAPYRNITWADVNTPEARK 397
Query: 385 LALEAARQGIVLLKNQGPSLPLS----HIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTT 440
LA +A +G+VL KN G LP+ + +TVA+IG D M+G Y+GIA T
Sbjct: 398 LAHRSAVEGMVLTKNNG-VLPIKLEELQKKGKTVALIGNWVDNGEQMLGTYSGIAPFRNT 456
Query: 441 PLQGIGRYARTIHQQG--CKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDR 498
PL + G + D A++A+ QAD + G+D S+EAE DR
Sbjct: 457 PLAAAKALNLKMVTAGGPVNQSTGSRDSWTRPALNAAIQADVVLYFGGIDLSVEAEDRDR 516
Query: 499 AGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGT 558
L P Q +L+S +S K PT++V + D A N I+AIIWAGYPGQ GGT
Sbjct: 517 YSLAWPSAQAKLLSDISALGK-PTVVVQLGTMLDDTALLDNK-NISAIIWAGYPGQDGGT 574
Query: 559 AIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKR-----YPGRTYRFYKGP 613
A DI+ G + P G+LP+T YP +Y +PMT+M +RPS+ + PGRTYR+Y
Sbjct: 575 AAFDIITGKTAPSGRLPVTQYPAKYANQVPMTDMEVRPSKDTKGGAASNPGRTYRWYD-E 633
Query: 614 VVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKC---NRLTL 670
V+PFG G+ +TNF +VA + + D G S K I KC + L +
Sbjct: 634 AVHPFGFGLHFTNFTTSVAVSSSSAISTSDLESG----CKSEKHID----KCSFPSSLEV 685
Query: 671 GVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVH-VPAGAQQRVGINI 725
V D K+ S A + P + P K LVA+ K+H + G ++V + +
Sbjct: 686 SVTNDGKSTTSSYAALAFVRGEYGPKPY--PLKTLVAYGKLHDIAPGQTKKVKLEL 739
>gi|389632743|ref|XP_003714024.1| beta-xylosidase [Magnaporthe oryzae 70-15]
gi|351646357|gb|EHA54217.1| beta-xylosidase [Magnaporthe oryzae 70-15]
Length = 847
Score = 417 bits (1073), Expect = e-114, Method: Compositional matrix adjust.
Identities = 266/716 (37%), Positives = 382/716 (53%), Gaps = 50/716 (6%)
Query: 43 CQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTK 102
C + +R L+ + L EK++ L++ + PR+G+ YEWWSEALHGV+ PG
Sbjct: 96 CDQAATPAERAAGLVDIMELDEKLENLVNKSPGAPRIGLPAYEWWSEALHGVAK-SPGVT 154
Query: 103 F----GGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVN 158
F G F ATSF I +++F+ L EA+ +S EARA N G AGL +W+PN+N
Sbjct: 155 FNKSSGAAFSSATSFSNPIVLSAAFDDELVEAVATQISTEARAFSNAGLAGLDWWTPNIN 214
Query: 159 IFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGV 218
++DPRWGRG ETPGED + KY + +RGL+GSD K+ A+CKH+ A DL+ WNGV
Sbjct: 215 PYKDPRWGRGMETPGEDALRISKYVKALLRGLEGSDPTTRKMVANCKHYAANDLERWNGV 274
Query: 219 DRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQV---------NGVPTCADPN 269
R++F+A V+ QD+ + + F+ C + V S MC+YN + NG P CA
Sbjct: 275 TRYNFDAPVTLQDLSEYYLPAFKQCARDSNVGSFMCAYNAMSIKGKDLSWNGTPVCASKY 334
Query: 270 ILKRTIRGEW---RLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDC--GPF 324
++ +R W N +I SDC++V ++ H++ T EEAA A AG D C +
Sbjct: 335 LMNDILREHWGWKEHNNWITSDCNAVLHMWNQHHWSDTREEAAGSAYTAGTDTVCEVSNY 394
Query: 325 LGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQE 384
+ A RGLL E ++ AL +R G FDG + PY ++ DV TP+ ++
Sbjct: 395 DKTAVKGAFDRGLLDEDVVDRALKRLYEGLVRAGYFDGPDA--PYRNITWADVNTPEARK 452
Query: 385 LALEAARQGIVLLKNQGPSLPLS----HIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTT 440
LA +A +G+VL KN G LP+ + +TVA+IG D M+G Y+GIA T
Sbjct: 453 LAHRSAVEGMVLTKNNG-VLPIKLEELQKKGKTVALIGNWVDNGEQMLGTYSGIAPFRNT 511
Query: 441 PLQGIGRYARTIHQQG--CKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDR 498
PL + G + D A++A+ QAD + G+D S+EAE DR
Sbjct: 512 PLAAAKALNLKMVTAGGPVNQSTGSRDSWTRPALNAAIQADVVLYFGGIDLSVEAEDRDR 571
Query: 499 AGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGT 558
L P Q +L+S +S K PT++V + D A N I+AIIWAGYPGQ GGT
Sbjct: 572 YSLAWPSAQAKLLSDISALGK-PTVVVQLGTMLDDTALLDNK-NISAIIWAGYPGQDGGT 629
Query: 559 AIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKR-----YPGRTYRFYKGP 613
A DI+ G + P G+LP+T YP +Y +PMT+M +RPS+ + PGRTYR+Y
Sbjct: 630 AAFDIITGKTAPSGRLPVTQYPAKYANQVPMTDMEVRPSKDTKGGAASNPGRTYRWYD-E 688
Query: 614 VVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKC---NRLTL 670
V+PFG G+ +TNF +VA + + D G S K I KC + L +
Sbjct: 689 AVHPFGFGLHFTNFTTSVAVSSSSAISTSDLESG----CKSEKHID----KCSFPSSLEV 740
Query: 671 GVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVH-VPAGAQQRVGINI 725
V D K+ S A + P + P K LVA+ K+H + G ++V + +
Sbjct: 741 SVTNDGKSTTSSYAALAFVRGEYGPKPY--PLKTLVAYGKLHDIAPGQTKKVKLEL 794
>gi|115397385|ref|XP_001214284.1| hypothetical protein ATEG_05106 [Aspergillus terreus NIH2624]
gi|114192475|gb|EAU34175.1| hypothetical protein ATEG_05106 [Aspergillus terreus NIH2624]
Length = 776
Score = 417 bits (1073), Expect = e-114, Method: Compositional matrix adjust.
Identities = 264/706 (37%), Positives = 387/706 (54%), Gaps = 27/706 (3%)
Query: 43 CQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTK 102
C S R L+ +L+E V + VPRLG+ Y+ WSE+LHGV ++
Sbjct: 81 CDKSARPHDRAAALVSMFTLEELVNNTGNTGTGVPRLGLPKYQVWSESLHGVYRANWASE 140
Query: 103 FGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRD 162
GD+ ATSFPQ I T ++ N TL IG ++S +ARA N G GL ++PN+N FR
Sbjct: 141 --GDYSWATSFPQPILTMAALNRTLIHQIGDILSTQARAFSNVGRYGLDTYAPNINSFRH 198
Query: 163 PRWGRGQETPGEDP-VLSGKYAASYVRGLQGS-DGDRLKVAASCKHFTAYDLDNWNGVDR 220
P WGRGQETPGED L+ YA Y+ G+QG D + LK+ A+ KH+ YD++NW+G R
Sbjct: 199 PVWGRGQETPGEDAYYLASTYAYEYITGIQGGVDPETLKLVATAKHYAGYDIENWDGHSR 258
Query: 221 FHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWR 280
+ ++++QD+ + + F + + KV SVMCSYN VNGVP+C++ L+ +R +
Sbjct: 259 LGNDMQITQQDLSEYYTPQFLVSARDAKVHSVMCSYNAVNGVPSCSNSFFLQTLLRETFG 318
Query: 281 L--NGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLL 338
+GY+ DC +V ++ + + A+AD+IRAG D+DCG H +A G +
Sbjct: 319 FVEDGYVSGDCGAVYNAFNPHEYAANESSASADSIRAGTDIDCGTSYQYHFTNAFDEGEI 378
Query: 339 SEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLK 398
S DI ++ T +RLG FDG +S Y L DV T D ++ EAA +G VLLK
Sbjct: 379 SRQDIERGVIRLYTNLVRLGYFDG--NSSQYRDLTWSDVQTTDAWNISHEAAVEGTVLLK 436
Query: 399 NQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCK 458
N G +LPL+ R+VA+IGP ++ T M GNY G A T+PL + +H
Sbjct: 437 NDG-TLPLAD-SIRSVALIGPWANATTQMQGNYYGPAPYLTSPLAALEASDLDVHYAFGT 494
Query: 459 DVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMAS 518
+++ F A+ A+R+ADA I G+D +IE EALDR + PG Q +L++++S
Sbjct: 495 NISSTTTAGFADALAAARKADAIIFAGGIDNTIEGEALDRMNITWPGNQLDLINQLSALG 554
Query: 519 KGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTW 578
K P +++ M GG +D + K++ ++A++W GYPGQ+GGTA+ DI+ G P G+L T
Sbjct: 555 K-PLVVLQMGGGQVDSSALKHNTNVSALLWGGYPGQSGGTALLDIIRGVRAPAGRLVTTQ 613
Query: 579 YPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVV 638
YP Y T P +M +RP+ + PG+TY +Y G VY FGHG+ YT F A+ T
Sbjct: 614 YPAGYATQFPAIDMGLRPNGTN--PGQTYMWYTGTPVYEFGHGLFYTTFEAKRASTAT-- 669
Query: 639 AVPLDGRHGSINATISGKAIKVTHAKCN-RLTLGVQVDVKNVGSKDGAHTLLVFSTPPAG 697
H S N A +A R L + N G +T ++F+ AG
Sbjct: 670 ------NHSSFNIEDLLTAPHPGYAYPQLRPFLNFTAHITNTGRTTSDYTAMLFANTTAG 723
Query: 698 HWAPH--KQLVAFEKV-HVPAGAQQRVGINIHVCKYLSVVDRSGTR 740
APH K LV F+++ + GA Q + I + ++ D G R
Sbjct: 724 P-APHPNKWLVGFDRLGALEPGASQTMTFPITIDN-VARTDELGNR 767
>gi|333381510|ref|ZP_08473192.1| hypothetical protein HMPREF9455_01358 [Dysgonomonas gadei ATCC
BAA-286]
gi|332830480|gb|EGK03108.1| hypothetical protein HMPREF9455_01358 [Dysgonomonas gadei ATCC
BAA-286]
Length = 738
Score = 417 bits (1072), Expect = e-113, Method: Compositional matrix adjust.
Identities = 263/745 (35%), Positives = 393/745 (52%), Gaps = 89/745 (11%)
Query: 41 PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPG 100
PF L +RV+DL+ RL+L+EKV +++ A+ RL I Y WW+E LHG+
Sbjct: 25 PFRDTKLSTDKRVSDLVSRLTLEEKVLQMLNNTPAIERLNIPAYNWWNECLHGIGR---- 80
Query: 101 TKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA--------GLTY 152
T++ T FPQ I A++++A L + + +SDE RA+YN +A GLTY
Sbjct: 81 TEYK-----VTVFPQAIGMAAAWDARLLKDVANAISDEGRAIYNDASAKGNYSIYHGLTY 135
Query: 153 WSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDL 212
W+PNVNIFRDPRWGRGQET GEDP L+G S+V GLQG D LK AA KH+ +
Sbjct: 136 WTPNVNIFRDPRWGRGQETYGEDPYLTGALGKSFVAGLQGDDSQYLKAAACAKHYAVH-- 193
Query: 213 DNWNGVD--RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNI 270
+G + R FN V+ D+ DT+ FR V++ KVA VMC+YN +G P C + +
Sbjct: 194 ---SGPENTRHTFNTFVTTFDLWDTYLPAFRDLVVDAKVAGVMCAYNAFSGEPCCGNNLL 250
Query: 271 LKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEE--AAADAIRAGLDLDCGPFLGLH 328
++ +R +W GY+ SDC ++ +Y +H + P+ AAADA+ +G D+DCG
Sbjct: 251 MQEILRDKWGFTGYVTSDCGAIDDFY--RHHKTHPDAKYAAADAVYSGTDIDCGNEAYKA 308
Query: 329 TESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-HQELAL 387
AV+ GL++E I+ +L ++ RLGMFD P+ P V H++LAL
Sbjct: 309 LVDAVKTGLITEEQIDISLKRLFEIRFRLGMFD--PAEDVKFSKIPLSVLESQPHKDLAL 366
Query: 388 EAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIG- 446
+ R+ IVLLKN+ LPLS + + VAVIGPN+D V+++GNY G TP + I
Sbjct: 367 KITRESIVLLKNENNFLPLSK-KLKKVAVIGPNADNEVSVLGNYNGFPTQIITPYKAIKN 425
Query: 447 --RYARTIHQQGCKDVACADDQLFGAAIDASRQA--DATILVMGLDQSIEAEAL------ 496
+ I+++G V +++ A A R D I G+ +E E +
Sbjct: 426 KLKNTEVIYEKGIDFVKPSENSKEEIAALAKRLKGMDVVIFAGGISPELEGEEMPVKIEG 485
Query: 497 ----DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYP 552
DR + LP Q EL+ + A + PT+ V+M+G I + + + AI+ A Y
Sbjct: 486 FTGGDRTSIKLPKIQTELMQALK-AERIPTVFVMMTGSAIAAEWESQN--VPAILNAWYG 542
Query: 553 GQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKG 612
GQ GTAIAD+LFG NP GKLP+T+Y ++ ++LP S RTYR++ G
Sbjct: 543 GQDAGTAIADVLFGDYNPSGKLPVTFYTKD--SDLPAF-------NSYEMKNRTYRYFDG 593
Query: 613 PVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGV 672
V+YPFG+G+SYT F ++ P SI A G+ ++V+
Sbjct: 594 QVLYPFGYGLSYTKFEYSPIQMP-----------ASIKA---GENMEVS----------- 628
Query: 673 QVDVKNVGSKDGAHTLLVF---STPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCK 729
+ VKN G DG + ++ P L +FE++ + AG + V + +
Sbjct: 629 -ITVKNTGKTDGEEVVQLYISHDNNGTNRQLPLYALKSFERISLKAGESKSVTFKLS-PR 686
Query: 730 YLSVVDRSGTRRIPLGEHNIHIGGT 754
+++ D G ++ G+ ++IGGT
Sbjct: 687 EMALADEDGVLKMTKGKSKLYIGGT 711
>gi|380293100|gb|AFD50200.1| beta-xylosidase [Hypocrea orientalis]
Length = 797
Score = 417 bits (1072), Expect = e-113, Method: Compositional matrix adjust.
Identities = 266/733 (36%), Positives = 400/733 (54%), Gaps = 29/733 (3%)
Query: 43 CQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTK 102
C S +R LI +L+E + + VPRLG+ Y+ W+EALHG+ TK
Sbjct: 63 CDSSAGYVERAQALISLFTLEELILNTQNSGPGVPRLGLPNYQVWNEALHGLDRANFATK 122
Query: 103 FGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRD 162
GG F ATSFP I T ++ N TL I ++S +ARA N G GL ++PNVN FR
Sbjct: 123 -GGQFEWATSFPMPILTTAALNRTLIHQIADIISTQARAFSNSGRYGLDVYAPNVNGFRS 181
Query: 163 PRWGRGQETPGEDPV-LSGKYAASYVRGLQGS-DGDRLKVAASCKHFTAYDLDNWNGVDR 220
P WGRGQETPGED LS Y Y+ G+QG D ++LKVAA+ KHF YDL+NWN R
Sbjct: 182 PLWGRGQETPGEDAFFLSSAYTYEYITGIQGGVDPEQLKVAATVKHFAGYDLENWNNQSR 241
Query: 221 FHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWR 280
F+A +++QD+ + + F K S+MCSYN VNGVP+CA+ L+ +R W
Sbjct: 242 LGFDAIITQQDLSEYYTPQFLAAARYAKSRSLMCSYNSVNGVPSCANSFFLQTLLRESWG 301
Query: 281 LN--GYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLL 338
GY+ SDCD+V ++ + S AAA ++RAG D+DCG H + G +
Sbjct: 302 FPEWGYVSSDCDAVYNVFNPHDYASNQSSAAASSLRAGTDIDCGQTYPWHLNESFVAGEV 361
Query: 339 SEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLK 398
+ +I ++ +RLG FD Y LG KDV D ++ EAA +GIVLLK
Sbjct: 362 TRGEIERSVTRLYANLVRLGYFD---KKNQYRSLGWKDVVKTDAWNISYEAAVEGIVLLK 418
Query: 399 NQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCK 458
N G +LPLS + R++A+IGP ++ T M GNY G A +PL+ + ++ +
Sbjct: 419 NDG-TLPLSK-KVRSIALIGPWANATTQMQGNYFGPAPYLISPLEAAKKAGYHVNFELGT 476
Query: 459 DVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMAS 518
++A F AI A++++DA + + G+D +IE E DR + PG Q +L+ ++S
Sbjct: 477 EIAGNSTAGFAKAIAAAKKSDAIVYLGGIDNTIEQEGADRTDIAWPGNQLDLIKQLSEVG 536
Query: 519 KGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTW 578
K P +++ M GG +D + K++ ++ +++W GYPGQ+GG A+ DIL G P G+L T
Sbjct: 537 K-PLVVLQMGGGQVDSSSLKSNKKVNSLVWGGYPGQSGGVALFDILSGKRAPAGRLITTQ 595
Query: 579 YPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVV 638
YP EY+ P +M +RP K PG+TY +Y G VY FG G+ YT F T+A+ P +
Sbjct: 596 YPAEYVHQFPQNDMNLRPD-GKSNPGQTYIWYTGKPVYEFGSGLFYTTFKETLASHPKCL 654
Query: 639 AVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF----STP 694
+ ++ +S T+++ + ++KN G + +T ++F +
Sbjct: 655 KF-------NTSSILSAPHPGYTYSE-QIPVFTFEANIKNSGKTESPYTAMLFVRTSNAG 706
Query: 695 PAGHWAPHKQLVAFEKV-HVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGG 753
PA + P+K LV F+++ + G ++ I I V L+ VD G R + G++ + +
Sbjct: 707 PAPY--PNKWLVGFDRLADIKPGHSSKLSIPIPVSA-LARVDSYGNRIVYPGKYELAL-N 762
Query: 754 TKHSVSLHAATLG 766
T SV L +G
Sbjct: 763 TDESVKLEFELVG 775
>gi|242771939|ref|XP_002477942.1| beta-xylosidase XylA [Talaromyces stipitatus ATCC 10500]
gi|218721561|gb|EED20979.1| beta-xylosidase XylA [Talaromyces stipitatus ATCC 10500]
Length = 797
Score = 417 bits (1071), Expect = e-113, Method: Compositional matrix adjust.
Identities = 259/717 (36%), Positives = 387/717 (53%), Gaps = 25/717 (3%)
Query: 43 CQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTK 102
C S+ +R LI +L+E + + A VPRLG+ Y+ WSE LHG+ K
Sbjct: 64 CNTSVNYVERAEGLISLFTLEELINNTQNSAPGVPRLGLPPYQVWSEGLHGLDRAN-WAK 122
Query: 103 FGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRD 162
G ++ ATSFP I + ++ N TL I +++ +ARA N G GL ++PN+N FR
Sbjct: 123 SGEEWKWATSFPMPILSMAALNRTLINQIASIIATQARAFNNVGRYGLDAYAPNINGFRS 182
Query: 163 PRWGRGQETPGEDP-VLSGKYAASYVRGLQGS-DGDRLKVAASCKHFTAYDLDNWNGVDR 220
P WGRGQETPGED LS YA Y+ GLQG D + LK+ A+ KHF YDL+NWN R
Sbjct: 183 PLWGRGQETPGEDAGFLSSSYAYEYITGLQGGVDPEHLKIVATAKHFAGYDLENWNNNSR 242
Query: 221 FHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWR 280
F+A +++QD+ + + F K S MCSYN VNGVP+C+ +L+ +R W
Sbjct: 243 LGFDASITQQDLAEYYTPQFLAASRYAKARSFMCSYNSVNGVPSCSSSFLLQTLLRENWD 302
Query: 281 L--NGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLL 338
GY+ SDCD+ ++ + AAAD++RAG D+DCG + + G +
Sbjct: 303 FPDYGYVSSDCDAAYNVFNPHGYAINISAAAADSLRAGTDIDCGQTYPWYLNQSFIEGSV 362
Query: 339 SEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLK 398
+ +I +L+ + ++LG FDG S Y LG DV D ++ EAA +GIVLLK
Sbjct: 363 TRGEIERSLIRLYSNLVKLGYFDGNQSE--YRQLGWNDVVATDAWNISYEAAVEGIVLLK 420
Query: 399 NQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCK 458
N G LPLS + ++VAVIGP ++ T + GNY G A TPLQ ++
Sbjct: 421 NDG-VLPLSE-KLKSVAVIGPWANATQQLQGNYFGPAPYLITPLQAARDAGYKVNYAFGT 478
Query: 459 DVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMAS 518
++ F AA+ A++++D I + G+D +IEAE DR + PG Q +L+ ++S
Sbjct: 479 NILGNTTDGFAAALSAAKKSDVIIYLGGIDNTIEAEGTDRMNVTWPGNQLDLIQQLSQTG 538
Query: 519 KGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTW 578
K P +++ M GG +D + K++ + A++W GYPGQ+GG AI DIL G P G+L T
Sbjct: 539 K-PLVVLQMGGGQVDSSSLKSNNNVNALVWGGYPGQSGGKAIFDILSGKRAPAGRLVTTQ 597
Query: 579 YPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVV 638
YP EY T P T+M +RP K PG+TY +Y G VY FG+ + YT F T
Sbjct: 598 YPAEYATQFPATDMNLRP-DGKSNPGQTYIWYTGKPVYEFGYALFYTTFKET-------- 648
Query: 639 AVPLDGRHGSINATI-SGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFS-TPPA 696
A L I+ I S ++ +++ + V +KN G +T ++F+ T A
Sbjct: 649 AEKLASSSFDISDIIASPRSSSYAYSELVPF-VNVTATIKNTGKTASPYTAMLFANTTNA 707
Query: 697 GHWA-PHKQLVAFEKV-HVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHI 751
G P+K LV ++++ + G + I + + +S VD +G R + G++ + +
Sbjct: 708 GPTPYPNKWLVGYDRLPSIEPGKSTELVIPVPIGA-ISRVDENGNRIVYPGDYQLAL 763
>gi|292495285|sp|B6EY09.1|XYND_ASPJA RecName: Full=Probable exo-1,4-beta-xylosidase xlnD; AltName:
Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
gi|211970990|dbj|BAG82824.1| 1,4-beta-D-xylosidase [Aspergillus japonicus]
Length = 804
Score = 416 bits (1070), Expect = e-113, Method: Compositional matrix adjust.
Identities = 252/723 (34%), Positives = 387/723 (53%), Gaps = 25/723 (3%)
Query: 43 CQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTK 102
C + R L+ +L+E + + + VPRLG+ Y+ WSEALHG++
Sbjct: 60 CDSTASPYDRAAALVSLFTLEELIANTGNTSPGVPRLGLPPYQVWSEALHGLARANFTDN 119
Query: 103 FGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRD 162
G + ATSFP I +A++FN TL I ++S + RA N G GL +SPN+N FR
Sbjct: 120 --GAYSWATSFPSPILSAAAFNRTLINQIASIISTQGRAFNNAGRFGLDVYSPNINTFRH 177
Query: 163 PRWGRGQETPGEDP-VLSGKYAASYVRGLQGS-DGDRLKVAASCKHFTAYDLDNWNGVDR 220
P WGRGQETPGED L+ YA Y+ G+QG + + LK+AA+ KHF YD++NW+ R
Sbjct: 178 PVWGRGQETPGEDAYTLTAAYAYEYITGIQGGVNPEHLKLAATAKHFAGYDIENWDNHSR 237
Query: 221 FHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWR 280
+ +++QD+ + + F + + V S MCSYN VNGVP+C++ L+ +R +
Sbjct: 238 LGNDVNITQQDLAEYYTPQFLVAARDAHVHSFMCSYNAVNGVPSCSNTFFLQTLLRDTFS 297
Query: 281 L--NGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLL 338
+GY+ DC +V ++ + + AAADAI AG D+DCG H ++ G +
Sbjct: 298 FVDHGYVSGDCGAVYGVFNPHGYAANEPSAAADAILAGTDIDCGTSYQYHFNESITTGAV 357
Query: 339 SEIDINNALVNTLTVQMRLGMFDGEPSS-QPYGHLGPKDVCTPDHQELALEAARQGIVLL 397
+ DI + + LG FDG SS PY LG DV D ++ EAA +GIVLL
Sbjct: 358 ARDDIERGFIRLYANLVELGYFDGNSSSSNPYRSLGWPDVQKTDAWNISYEAAVEGIVLL 417
Query: 398 KNQGPSLPL---SHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQ 454
KN G +LPL S +++++A+IGP ++ T + GNY G A +P+ T+H
Sbjct: 418 KNDG-TLPLASPSEGKNKSIALIGPWANATTQLQGNYYGDAPYLISPVDAFTAAGYTVHY 476
Query: 455 QGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKV 514
+++ F AA+ A+R AD + + G+D +IEAEA DR+ + PG Q EL+S++
Sbjct: 477 APGTEISTNSTANFSAALSAARAADTIVFLGGIDNTIEAEAQDRSSIAWPGNQLELISQL 536
Query: 515 SM--ASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGG 572
+ + P ++ M GG +D + K++ ++ A++W GYPGQ+GG A+ DIL G P G
Sbjct: 537 AAQKSDDQPLVVYQMGGGQVDSSALKSNAKVNALLWGGYPGQSGGLALRDILTGARAPAG 596
Query: 573 KLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVA 632
+L T YP Y + +M +RP+++ + PG+TY +Y G VY FGHG+ YT F + A
Sbjct: 597 RLTTTQYPAAYAESFSALDMNLRPNETTQNPGQTYMWYTGEPVYAFGHGLFYTTFNASSA 656
Query: 633 NAPTVVAVPLDGRHGSINAT-ISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF 691
A + N T ++ A T R + N G +D +T LV+
Sbjct: 657 QAAKT--------KYTFNITDLTSAAHPDTTTVGQRTLFNFTASITNSGQRDSDYTALVY 708
Query: 692 S-TPPAG-HWAPHKQLVAFEKVHVPAGAQQRVGINIHVC-KYLSVVDRSGTRRIPLGEHN 748
+ T AG P+K LV F+++ A +N+ V L+ VD +G + G +
Sbjct: 709 ANTSTAGPSPYPNKWLVGFDRLAAVAKEGGTAELNVPVAVDRLARVDEAGNTVLFPGRYE 768
Query: 749 IHI 751
+ +
Sbjct: 769 VAL 771
>gi|380696433|ref|ZP_09861292.1| glycoside hydrolase [Bacteroides faecis MAJ27]
Length = 739
Score = 416 bits (1069), Expect = e-113, Method: Compositional matrix adjust.
Identities = 271/779 (34%), Positives = 401/779 (51%), Gaps = 89/779 (11%)
Query: 13 ILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISG 72
IL + S L A+E F PF LP+ QRV DL+ RL+L+EKVK +++
Sbjct: 10 ILACSMSLHLLAQEKF------------PFRDPQLPVEQRVEDLVSRLTLEEKVKQMLNS 57
Query: 73 AAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIG 132
V RLGI Y WW+E LHG+ TK+ T FPQ I A+++N L + +
Sbjct: 58 TPPVERLGIPAYNWWNECLHGIGR----TKYH-----VTVFPQAIGMAAAWNDALIKEVA 108
Query: 133 RVVSDEARAMYNGGTA--------GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAA 184
++DE RA+YN LTYW+PN+NIFRDPRWGRGQET GEDP L+ +
Sbjct: 109 SSIADEGRAIYNDTQRKEDYSQYHALTYWTPNINIFRDPRWGRGQETYGEDPYLTARIGE 168
Query: 185 SYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCV 244
++V+GLQG + LK +A KH+ + N R FN+ VS D+ DT+ FR V
Sbjct: 169 AFVQGLQGDNPRYLKASACAKHYAVHSGPEKN---RHSFNSDVSTYDLWDTYLPAFRTLV 225
Query: 245 MEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTST 304
++ KV+ VMC+YN G P C + +++ +R +W GY+ SDC ++ ++
Sbjct: 226 VDAKVSGVMCAYNAFQGQPCCGNDLLMQSILRDKWNFTGYVTSDCGAIDDIFNHHKTHPD 285
Query: 305 PEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEP 364
AAADA+ G DLDCG L AV+ G+++E ++ ++ T++ RLG+FD
Sbjct: 286 AATAAADAVFHGTDLDCGHSAYLALVKAVKDGIITEKQLDVSVKRLFTIRFRLGLFD-PV 344
Query: 365 SSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVT 424
Y + + HQ+LA + AR+ +VLLKN LPL + + V V+GPN+D
Sbjct: 345 ELVDYARIPISILECRKHQDLAKQLARESMVLLKND-QLLPLQKNKLKKVVVMGPNADSR 403
Query: 425 VTMIGNYAGIACGYTTPLQGI----GRYARTIHQQGCKDVACADDQLFGAAIDASRQADA 480
+++GNY G TPLQ I G + + +G V ++ ++ ADA
Sbjct: 404 ESLLGNYNGNPSRMLTPLQAIRERLGGWTEVEYIEGVDHVNTISADDLKQYVNRAKGADA 463
Query: 481 TILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGG 530
I + G+ +E E + DR + LP Q +++ K +A PT+ V+M+G
Sbjct: 464 VIFIGGISPRLEGEEMPVSKDGFDGGDRTTIALPAVQTQMM-KAWVAEHIPTVFVMMTGS 522
Query: 531 PIDVAF-AKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPM 589
+ + + A+N P AI+ A Y GQ GG AIAD+LFG NP GKLP+T+Y ++ ++LP
Sbjct: 523 ALAIPWEAQNVP---AILNAWYGGQYGGEAIADVLFGDYNPSGKLPVTFYAKD--SDLPD 577
Query: 590 TEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSI 649
E S GRTYR++ G +YPFG+G+SYT+F ++ P V
Sbjct: 578 FE-------SYDMQGRTYRYFNGKALYPFGYGLSYTSFAYSSLKLPKVCRT--------- 621
Query: 650 NATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTL-LVFSTPPAGHWAPHKQLVAF 708
+ K I+VT V VKN G +G + L S P P L F
Sbjct: 622 ----TDKEIEVT------------VTVKNTGHTEGEEVVQLYVSHPDKKILVPLTALKGF 665
Query: 709 EKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHAATLGV 767
+++ + AG QRV ++ + LS VD +G R++ G I +GG+ +L A GV
Sbjct: 666 KRIQLKAGEAQRVTFSLS-SEDLSCVDENGIRKVWAGTVKIQVGGSSPVATLAAPFKGV 723
>gi|367028614|ref|XP_003663591.1| glycoside hydrolase family 3 protein [Myceliophthora thermophila
ATCC 42464]
gi|347010860|gb|AEO58346.1| glycoside hydrolase family 3 protein [Myceliophthora thermophila
ATCC 42464]
Length = 760
Score = 415 bits (1066), Expect = e-113, Method: Compositional matrix adjust.
Identities = 263/724 (36%), Positives = 392/724 (54%), Gaps = 42/724 (5%)
Query: 43 CQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTK 102
C S R L+ ++ EK+ L++ + V RLG+ Y+WW+EALHGV++ G
Sbjct: 39 CDTSASPGARAAALVSVMNNNEKLANLVNNSPGVSRLGLSAYQWWNEALHGVAH-NRGIT 97
Query: 103 FGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRD 162
+GG+F AT FPQ ITT+++F+ L E IG ++S EARA N G A L +W+PNVN FRD
Sbjct: 98 WGGEFSAATQFPQAITTSATFDDALIEQIGTIISTEARAFANNGRAHLDFWTPNVNPFRD 157
Query: 163 PRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFH 222
PRWGRG ETPGED + K+A ++V+G+QG G +V A+CKH+ AYDL+N RF+
Sbjct: 158 PRWGRGHETPGEDAFKNKKWAEAFVKGMQGP-GPTHRVIATCKHYAAYDLENSGSTTRFN 216
Query: 223 FNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLN 282
F+AKVS QD+ + + PF+ C + KV S+MCSYN VN +P CA+P ++ +R W
Sbjct: 217 FDAKVSTQDLAEYYLPPFQQCARDSKVGSIMCSYNAVNEIPACANPYLMDTILRKHWNWT 276
Query: 283 G---YIVSDCDSVGVYY-----DTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTE--SA 332
YIVSDCD+ VYY + + A ++ AG D C G + SA
Sbjct: 277 DEHQYIVSDCDA--VYYLGNANGGHRYKPSYAAAIGASLEAGCDNMCWATGGTAPDPASA 334
Query: 333 VQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQ 392
G S+ ++ A++ + + G FDG P Y +L DV T Q+ AL+AA
Sbjct: 335 FNSGQFSQTTLDTAILRQMQGLVLAGYFDG-PGGM-YRNLSVADVNTQTAQDTALKAAEG 392
Query: 393 GIVLLKNQGPSLPLS-HIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYART 451
GIVLLKN G LPLS + + VA+IG ++ M+G Y+G P+ T
Sbjct: 393 GIVLLKNDG-ILPLSVNGSNFQVAMIGFWANAADKMLGGYSGSPPFNHDPVTAARSMGIT 451
Query: 452 IHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELV 511
++ D AA++A+++++A + G+D ++E E+ DR + P Q L+
Sbjct: 452 VNYVNGPLTQPNGDT--SAALNAAQKSNAVVFFGGIDNTVEKESQDRTSIEWPSGQLALI 509
Query: 512 SKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPG 571
+ +A G ++V+ G +D + P + AI+WAGYPGQ GGTA+ I+ G ++P
Sbjct: 510 RR--LAETGKPVIVVRLGTHVDDTPLLSIPNVRAILWAGYPGQDGGTAVVKIITGLASPA 567
Query: 572 GKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTV 631
G+LP T YP Y + P T MA+RPS S YPGRTYR+Y V+PFGHG+ YTNF +V
Sbjct: 568 GRLPATVYPSSYTSQAPFTNMALRPSSS--YPGRTYRWYSN-AVFPFGHGLHYTNFSVSV 624
Query: 632 ANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF 691
+ P A I ++ V + V ++V N G++ + L F
Sbjct: 625 RDFPASFA---------IADLLASCGDSVAYLDLCPFP-SVSLNVTNTGTRVSDYVALGF 674
Query: 692 ST---PPAGHWAPHKQLVAFEKV-HVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEH 747
+ P+ H P K L +++V ++ G Q ++ + + L VD G R + G +
Sbjct: 675 LSGDFGPSPH--PIKTLATYKRVFNIEPGETQVAELDWKL-ESLVRVDEKGNRVLYPGTY 731
Query: 748 NIHI 751
+ +
Sbjct: 732 TLLV 735
>gi|3135209|dbj|BAA28267.1| beta-xylosidase A [Aspergillus oryzae]
Length = 798
Score = 415 bits (1066), Expect = e-113, Method: Compositional matrix adjust.
Identities = 258/730 (35%), Positives = 384/730 (52%), Gaps = 33/730 (4%)
Query: 43 CQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTK 102
C S R L+ L+ +E V + PR+G+ Y+ W+EALHGV++
Sbjct: 63 CDTSAKPHDRAAALVSLLTFEELVNNTANTGHGAPRIGLPAYQVWNEALHGVAHADFSD- 121
Query: 103 FGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRD 162
GDF +TSFPQ I+T ++ N TL I ++S + RA N G GL +SPN+N FR
Sbjct: 122 -AGDFSWSTSFPQPISTMAALNRTLIHQIATIISTQGRAFMNAGRYGLDVYSPNINTFRH 180
Query: 163 PRWGRGQETPGEDP-VLSGKYAASYVRGLQGS-DGDRLKVAASCKHFTAYDLDNWNGVDR 220
P WGRGQETPGED L+ YA Y+ G+QG D + LK+ A+ KH+ YD++NW+ R
Sbjct: 181 PVWGRGQETPGEDAYCLASTYAYEYITGIQGGVDANPLKLIATAKHYAGYDIENWDNHSR 240
Query: 221 FHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWR 280
+ ++++QD+ + + F + + KV SVMCSYN VNGVP+C++ L+ +R +
Sbjct: 241 LGNDMQITQQDLAEYYTPQFLVASRDAKVHSVMCSYNAVNGVPSCSNSFFLQTLLRDTFD 300
Query: 281 L--NGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLL 338
+GY+ DC +V ++ + + AAAD+IRAG D+DCG H + + +
Sbjct: 301 FVEDGYVSGDCGAVYNVFNPHGYATNESSAAADSIRAGTDIDCGVSYPRHFQESFHDQEV 360
Query: 339 SEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLK 398
S D+ ++ +R G FDG+ S PY ++ DV + + Q L+ EAA Q IVLLK
Sbjct: 361 SRQDLERGVIRLYASLIRAGYFDGKTS--PYRNITWSDVVSTNAQNLSYEAAAQSIVLLK 418
Query: 399 NQGPSLPL--SHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQG 456
N G LPL + +T+A+IGP ++ T M+GNY G A +PLQ I
Sbjct: 419 NDG-ILPLTSTSSSTKTIALIGPWANATTQMLGNYYGPAPYLISPLQAFQDSEYKITYTI 477
Query: 457 CKDVACADDQLF-GAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVS 515
+ D A+ +++AD I G+D ++E EA DR+ + P Q L++K++
Sbjct: 478 GTNTTTDPDSTSQSTALTTAKEADLIIFAGGIDNTLETEAQDRSNITWPSNQLSLITKLA 537
Query: 516 MASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLP 575
K P I++ M GG +D + KN+ + A+IW GYPGQ+GG A+ADI+ G P +L
Sbjct: 538 DLGK-PLIVLQMGGGQVDSSALKNNKNVNALIWGGYPGQSGGQALADIITGKRAPAARLV 596
Query: 576 MTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAP 635
T YP EY P +M +RP+ S PG+TY +Y G VY FGHG+ YTNF
Sbjct: 597 TTQYPAEYAEVFPAIDMNLRPNGSN--PGQTYMWYTGTPVYEFGHGLFYTNF-------- 646
Query: 636 TVVAVPLDGRHGSINATISGKAIKVTHAKCNRLT----LGVQVDVKNVGSKDGAHTLLVF 691
T A G + I + + H + L VDVKN G + +T + F
Sbjct: 647 TASASAGSGTKNRTSFNID-EVLGRPHPGYKLVEQMPLLNFTVDVKNTGDRVSDYTAMAF 705
Query: 692 STPPAGHWAPH--KQLVAFEKVH-VPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHN 748
AG APH K LV F+++ V G+ + + I + V L+ D G R + G +
Sbjct: 706 VNTTAGP-APHPNKWLVGFDRLSAVEPGSAKTMVIPVTV-DSLARTDEEGNRVLYPGRYE 763
Query: 749 IHIGGTKHSV 758
+ + + V
Sbjct: 764 VALNNEREVV 773
>gi|2723496|dbj|BAA24107.1| beta-1,4-xylosidase [Aspergillus oryzae]
Length = 798
Score = 414 bits (1065), Expect = e-113, Method: Compositional matrix adjust.
Identities = 258/730 (35%), Positives = 384/730 (52%), Gaps = 33/730 (4%)
Query: 43 CQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTK 102
C S R L+ L+ +E V + PR+G+ Y+ W+EALHGV++
Sbjct: 63 CDTSAKPHDRAAALVSLLTFEELVNNTANTGHGAPRIGLPAYQVWNEALHGVAHADFSD- 121
Query: 103 FGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRD 162
GDF +TSFPQ I+T ++ N TL I ++S + RA N G GL +SPN+N FR
Sbjct: 122 -AGDFSWSTSFPQPISTMAALNRTLIHQIATIISTQGRAFMNAGRYGLDVYSPNINTFRH 180
Query: 163 PRWGRGQETPGEDP-VLSGKYAASYVRGLQGS-DGDRLKVAASCKHFTAYDLDNWNGVDR 220
P WGRGQETPGED L+ YA Y+ G+QG D + LK+ A+ KH+ YD++NW+ R
Sbjct: 181 PVWGRGQETPGEDAYCLASTYAYEYITGIQGGVDANPLKLIATAKHYAGYDIENWDNHSR 240
Query: 221 FHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWR 280
+ ++++QD+ + + F + + KV SVMCSYN VNGVP+C++ L+ +R +
Sbjct: 241 LGNDMQITQQDLAEYYTPQFLVASRDAKVHSVMCSYNAVNGVPSCSNSFFLQTLLRDTFD 300
Query: 281 L--NGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLL 338
+GY+ DC +V ++ + + AAAD+IRAG D+DCG H + + +
Sbjct: 301 FVEDGYVSGDCGAVYNVFNPHGYATNESSAAADSIRAGTDIDCGVSYPRHFQESFHDQEV 360
Query: 339 SEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLK 398
S D+ ++ +R G FDG+ S PY ++ DV + + Q L+ EAA Q IVLLK
Sbjct: 361 SRQDLERGVIRLYASLIRAGYFDGKTS--PYRNITWSDVVSTNAQNLSYEAAAQSIVLLK 418
Query: 399 NQGPSLPL--SHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQG 456
N G LPL + +T+A+IGP ++ T M+GNY G A +PLQ I
Sbjct: 419 NDG-ILPLTSTSSSTKTIALIGPWANATTQMLGNYYGPAPYLISPLQAFQDSEYKITYTI 477
Query: 457 CKDVACADDQLF-GAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVS 515
+ D A+ +++AD I G+D ++E EA DR+ + P Q L++K++
Sbjct: 478 GTNTTTDPDSTSQSTALTTAKEADLIIFAGGIDNTLETEAQDRSNITWPSNQLSLITKLA 537
Query: 516 MASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLP 575
K P I++ M GG +D + KN+ + A+IW GYPGQ+GG A+ADI+ G P +L
Sbjct: 538 DLGK-PLIVLQMGGGQVDSSALKNNKNVNALIWGGYPGQSGGQALADIITGKRAPAARLV 596
Query: 576 MTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAP 635
T YP EY P +M +RP+ S PG+TY +Y G VY FGHG+ YTNF
Sbjct: 597 TTQYPAEYAEVFPAIDMNLRPNGSN--PGQTYMWYTGTPVYEFGHGLFYTNF-------- 646
Query: 636 TVVAVPLDGRHGSINATISGKAIKVTHAKCNRLT----LGVQVDVKNVGSKDGAHTLLVF 691
T A G + I + + H + L VDVKN G + +T + F
Sbjct: 647 TASASAGSGTKNRTSFNID-EVLGRPHPGYKLVEQMPLLNFTVDVKNTGDRVSDYTAMAF 705
Query: 692 STPPAGHWAPH--KQLVAFEKVH-VPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHN 748
AG APH K LV F+++ V G+ + + I + V L+ D G R + G +
Sbjct: 706 VNTTAGP-APHPNKWLVGFDRLSAVEPGSAKTMVIPVTV-DSLARTDEEGNRVLYPGRYE 763
Query: 749 IHIGGTKHSV 758
+ + + V
Sbjct: 764 VALNNEREVV 773
>gi|373955483|ref|ZP_09615443.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
paludis DSM 18603]
gi|373892083|gb|EHQ27980.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
paludis DSM 18603]
Length = 738
Score = 414 bits (1063), Expect = e-112, Method: Compositional matrix adjust.
Identities = 273/784 (34%), Positives = 400/784 (51%), Gaps = 104/784 (13%)
Query: 4 TIAFFFLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQ 63
++ FL LIL + P++ T + PF +L + +RV DL+GR++L+
Sbjct: 8 SLGMLFLTLILF------------LSFKPQNHTYK-YPFNNPALSMDERVADLVGRMTLE 54
Query: 64 EKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSF 123
EKV +++ A A+ RLG+ Y WW+E LHGV+ T F T +PQ I A+++
Sbjct: 55 EKVSQMLNSAPAIERLGVPAYNWWNECLHGVAR----TPFK-----VTVYPQAIAMAATW 105
Query: 124 NATLWEAIGRVVSDEARAMYNGGTA--------GLTYWSPNVNIFRDPRWGRGQETPGED 175
+ T +G ++E RA+YN GLTYW+PN+NIFRDPRWGRGQET GED
Sbjct: 106 DKTSMHVMGDYTAEEGRAVYNESIKNDKHDIYLGLTYWTPNINIFRDPRWGRGQETYGED 165
Query: 176 PVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAY----DLDNWNGVDRFHFNAKVSKQD 231
P L+G+ +++V+GLQG D LK A KH+ + DL R FN +S D
Sbjct: 166 PFLTGEMGSAFVKGLQGDDPRYLKAAGCAKHYAVHSGPEDL-------RHKFNTDISDYD 218
Query: 232 IEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDS 291
+ DT+ FR V++ KV VMC+YN G P C ++ + +W+ GY+ SDC
Sbjct: 219 LWDTYLPAFRKLVVDAKVTGVMCAYNAFKGQPCCGSDLLMNSILHDKWKFTGYVTSDCGG 278
Query: 292 VGVYYDTQHFTSTP--EEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVN 349
+ +Y + P E AAADA+ G D++CG AV+ G LSE I+ +L
Sbjct: 279 IDDFYRENTHQTQPDAESAAADAVLHGTDVECGNVTYKSLVKAVKDGKLSEKQIDQSLKR 338
Query: 350 TLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHI 409
+V+ +LGMFD + + Y +G + P H AL+ A Q IVLLKN+G LPLS
Sbjct: 339 LFSVRFKLGMFDPADAVK-YNQIGKDALEAPAHGAQALKMAHQSIVLLKNEGNLLPLSK- 396
Query: 410 RHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRY----ARTIHQQGCKDVACADD 465
+ +AV+GPN+D V+++GNY G T LQGI I+ + VA +
Sbjct: 397 NLKKIAVLGPNADNAVSVLGNYNGTPSRIVTALQGIKNKLPAGTEVIYDKAVDYVADSAA 456
Query: 466 QLFGAAIDAS-RQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKV 514
+ AA+ A + ADA I + G+ +E E + DR+ +LLPG Q EL+ K
Sbjct: 457 RYNYAAMAAKVKDADAIIYIGGISPELEGEEMPVSKPGFHGGDRSTILLPGVQTELL-KA 515
Query: 515 SMASKGPTILVLMSGGPIDVAF-AKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGK 573
A+ P + V+M+G I + A+N P AI+ A Y GQA GTAIAD+LFG NP G+
Sbjct: 516 LKATGKPVVFVMMTGSAIATPWEAENLP---AIVNAWYGGQAAGTAIADVLFGDYNPAGR 572
Query: 574 LPMTWYPQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVA 632
LP+T+Y + +LP T+ +M RTYR++KG +Y FG+G+SY+ F +
Sbjct: 573 LPVTFYGSD--KDLPSFTDYSMD--------NRTYRYFKGKPLYAFGYGLSYSKFEYAPL 622
Query: 633 NAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAH-TLLVF 691
+AP + +G+A L V V V N DG T L
Sbjct: 623 DAPLTLK--------------AGEA------------LTVHVKVTNKSKMDGEEVTELYL 656
Query: 692 STPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHI 751
S + L FE+ + AG + + + LS+ D +G G+ I +
Sbjct: 657 SHIGIKQKTAIRALKGFERTLIKAGETKDITFKLSSAD-LSITDLNGNLVKASGKIAISV 715
Query: 752 GGTK 755
GG++
Sbjct: 716 GGSQ 719
>gi|391872736|gb|EIT81831.1| beta-glucosidase-related glycosidase [Aspergillus oryzae 3.042]
Length = 798
Score = 413 bits (1062), Expect = e-112, Method: Compositional matrix adjust.
Identities = 258/730 (35%), Positives = 383/730 (52%), Gaps = 33/730 (4%)
Query: 43 CQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTK 102
C S R L+ L+ +E V + PR+G+ Y+ W+EALHGV++
Sbjct: 63 CDTSAKPHDRAAALVSLLTFEELVNNTANTGHGAPRIGLPAYQVWNEALHGVAHADFSD- 121
Query: 103 FGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRD 162
GDF +TSFPQ I+T ++ N TL I ++S + RA N G GL +SPN+N FR
Sbjct: 122 -AGDFSWSTSFPQPISTMAALNRTLIHQIATIISTQGRAFMNAGRYGLDVYSPNINTFRH 180
Query: 163 PRWGRGQETPGEDP-VLSGKYAASYVRGLQGS-DGDRLKVAASCKHFTAYDLDNWNGVDR 220
P WGRGQETPGED L+ YA Y+ G+QG D + LK+ A+ KH+ YD++NW+ R
Sbjct: 181 PVWGRGQETPGEDAYCLASTYAYEYITGIQGGVDANPLKLIATAKHYAGYDIENWDNHSR 240
Query: 221 FHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWR 280
+ ++++QD+ + + F + + KV SVMCSYN VNGVP+C++ L+ +R +
Sbjct: 241 LGNDMQITQQDLAEYYTPQFLVASRDAKVHSVMCSYNAVNGVPSCSNSFFLQTLLRDTFD 300
Query: 281 L--NGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLL 338
+GY+ DC +V ++ + + AAAD+IRAG D+DCG H + + +
Sbjct: 301 FVEDGYVSGDCGAVYNVFNPHGYATNESSAAADSIRAGTDIDCGVSYPRHFQESFHDQEV 360
Query: 339 SEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLK 398
S D+ + +R G FDG+ S PY ++ DV + + Q L+ EAA Q IVLLK
Sbjct: 361 SRQDLERGVTRLYASLIRAGYFDGKTS--PYRNITWSDVVSTNAQNLSYEAAAQSIVLLK 418
Query: 399 NQGPSLPL--SHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQG 456
N G LPL + +T+A+IGP ++ T M+GNY G A +PLQ I
Sbjct: 419 NDG-ILPLTSTSSSTKTIALIGPWANATTQMLGNYYGPAPYLISPLQAFQDSEYKITYTI 477
Query: 457 CKDVACADDQLF-GAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVS 515
+ D A+ +++AD I G+D ++E EA DR+ + P Q L++K++
Sbjct: 478 GTNTTTDPDSTSQSTALTTAKEADLIIFAGGIDNTLETEAQDRSNITWPSNQLSLITKLA 537
Query: 516 MASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLP 575
K P I++ M GG +D + KN+ + A+IW GYPGQ+GG A+ADI+ G P +L
Sbjct: 538 DLGK-PLIVLQMGGGQVDSSALKNNKNVNALIWGGYPGQSGGQALADIITGKRAPAARLV 596
Query: 576 MTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAP 635
T YP EY P +M +RP+ S PG+TY +Y G VY FGHG+ YTNF
Sbjct: 597 TTQYPAEYAEVFPAIDMNLRPNGSN--PGQTYMWYTGTPVYEFGHGLFYTNF-------- 646
Query: 636 TVVAVPLDGRHGSINATISGKAIKVTHAKCNRLT----LGVQVDVKNVGSKDGAHTLLVF 691
T A G + I + + H + L VDVKN G + +T + F
Sbjct: 647 TASASAGSGTKNRTSFNID-EVLGRPHPGYKLVEQMPLLNFTVDVKNTGDRVSDYTAMAF 705
Query: 692 STPPAGHWAPH--KQLVAFEKVH-VPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHN 748
AG APH K LV F+++ V G+ + + I + V L+ D G R + G +
Sbjct: 706 VNTTAGP-APHPNKWLVGFDRLSAVEPGSAKTMVIPVTV-DSLARTDEEGNRVLYPGRYE 763
Query: 749 IHIGGTKHSV 758
+ + + V
Sbjct: 764 VALNNEREVV 773
>gi|365120422|ref|ZP_09338009.1| hypothetical protein HMPREF1033_01355 [Tannerella sp.
6_1_58FAA_CT1]
gi|363647477|gb|EHL86692.1| hypothetical protein HMPREF1033_01355 [Tannerella sp.
6_1_58FAA_CT1]
Length = 735
Score = 413 bits (1062), Expect = e-112, Method: Compositional matrix adjust.
Identities = 267/766 (34%), Positives = 412/766 (53%), Gaps = 85/766 (11%)
Query: 30 CDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSE 89
C T PF L +RV+DL+ RL+L+EK+ +++ A A+ RLGI Y+WW+E
Sbjct: 17 CAHTGKAQNTFPFQNPDLSFEKRVDDLVSRLTLEEKISQMLNKAPAIERLGIPAYDWWNE 76
Query: 90 ALHGVSNVGPGTKFGGDFP-GATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA 148
LHGV G P T FPQ I A++++ L++ + ++DE RA+Y+ +
Sbjct: 77 CLHGV----------GRTPYKVTVFPQAIGMAATWDDALFQQVASSIADEGRAIYHDAIS 126
Query: 149 --------GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKV 200
GLTYW+PN+NIFRDPRWGRGQET GEDP L+G ++V GLQG D LK
Sbjct: 127 KGVHEIYHGLTYWTPNINIFRDPRWGRGQETYGEDPYLTGTLGKAFVNGLQGDDPKYLKA 186
Query: 201 AASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVN 260
+A KH+ + + + R FN +VS D+ DT+ FR V++ KV+SVMC+YN +
Sbjct: 187 SACAKHYAVH---SGPEISRHFFNTEVSMYDLWDTYLPAFRDLVVDAKVSSVMCAYNALA 243
Query: 261 GVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFT-STPEEAAADAIRAGLDL 319
G P C + +++ +R +W+ GY+ SDC ++ + +H T + A+ADA+ G DL
Sbjct: 244 GQPCCGNDLLMQDILRKQWKFTGYVTSDCGAIDDF--LKHKTHADAAHASADAVLHGTDL 301
Query: 320 DCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCT 379
+CG + + AV++GL++E I+ ++ + RLG+FD P+ + P V
Sbjct: 302 ECGQNIYVKLVDAVKQGLITEAQIDESVKRLFMTRFRLGLFD--PADRVKYADTPLSVLE 359
Query: 380 PD-HQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGY 438
D H+ LAL+ +R+ +VLLKN LPL + +AVIGPN+D + ++GNY G
Sbjct: 360 CDEHKALALKMSRESVVLLKNDN-VLPLRK-NLKKIAVIGPNADDSTVVLGNYNGFPSKV 417
Query: 439 TTPLQGI----GRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAE 494
TPL+ I G+ + I+ + V +D++ A I+ + D I V G+ +E E
Sbjct: 418 ITPLEAIRSKVGKRTQVIYDRAIDCVKPSDEKTLNALIERLKGVDQVIFVGGISPRLEGE 477
Query: 495 AL----------DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIA 544
L DR + LP Q EL+ K+ A P I V+M+G + + + + I
Sbjct: 478 ELPISVDGFRGGDRTTIALPEVQTELMKKMKEAGL-PVIFVMMTGSALGIEWESQN--IP 534
Query: 545 AIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPG 604
AI+ A Y GQ G AIAD+LFG NP GKLP+T+Y + ++LP P +
Sbjct: 535 AILNAWYGGQFAGQAIADVLFGDYNPSGKLPVTFYRSD--SDLP-------PFGAFSMAN 585
Query: 605 RTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAK 664
RTYR++KG +YPFG G+SYT F ++V P VV+ G + G+ IKV+
Sbjct: 586 RTYRYFKGEALYPFGFGLSYTMFDYSV---PQVVS------GGKV-----GEPIKVS--- 628
Query: 665 CNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGIN 724
V VKN+G K+G + ++ + AP L F++V++ AG ++ +
Sbjct: 629 ---------VKVKNIGKKNGDEVVQLYLSHEGVEKAPITALKGFKRVYLKAGEEKTLSFE 679
Query: 725 IHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHAATLGVIKS 770
I + +S+ D +G + G+ I+ GG S + A + G++KS
Sbjct: 680 IS-PRDMSLPDDNGIITVFPGKKTIYAGGM--SPTSEAKSKGLVKS 722
>gi|292495281|sp|C0STH4.1|XYND_ASPAC RecName: Full=Probable exo-1,4-beta-xylosidase xlnD; AltName:
Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
gi|225878711|dbj|BAH30675.1| beta-xylosidase [Aspergillus aculeatus]
Length = 805
Score = 413 bits (1061), Expect = e-112, Method: Compositional matrix adjust.
Identities = 252/723 (34%), Positives = 384/723 (53%), Gaps = 24/723 (3%)
Query: 43 CQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTK 102
C + R L+ +L+E + + + VPRLG+ Y+ WSEALHG+ T
Sbjct: 60 CDSTASPYDRAAALVSLFTLEELIANTGNTSPGVPRLGLPPYQVWSEALHGLGRAN-FTD 118
Query: 103 FGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRD 162
G G SFP I +A++FN TL I ++S + RA N G GL +SPN+N FR
Sbjct: 119 NGALHAGRPSFPSPILSAAAFNRTLINQIASIISTQGRAFNNAGRFGLDVYSPNINTFRH 178
Query: 163 PRWGRGQETPGEDP-VLSGKYAASYVRGLQGS-DGDRLKVAASCKHFTAYDLDNWNGVDR 220
P WGRGQETPGED L+ YA Y+ G+QG + + LK+AA+ KHF YD++NW+ R
Sbjct: 179 PVWGRGQETPGEDAYTLTAAYAYEYITGIQGGVNPEHLKLAATAKHFAGYDIENWDNHSR 238
Query: 221 FHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWR 280
+ +++QD+ + + F + + V S MCSYN VNGVP+C++ L+ +R +
Sbjct: 239 LGNDVNITQQDLAEYYTPQFLVAARDAHVHSFMCSYNAVNGVPSCSNTFFLQTLLRDTFS 298
Query: 281 L--NGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLL 338
+GY+ DC +V ++ + + AAADAI AG D+DCG H ++ G +
Sbjct: 299 FVDHGYVSGDCGAVYGVFNPHGYAANEPSAAADAILAGTDIDCGTSYQYHFNESITTGAV 358
Query: 339 SEIDINNALVNTLTVQMRLGMFDGEPSS-QPYGHLGPKDVCTPDHQELALEAARQGIVLL 397
+ DI + + LG FDG SS PY LG DV D ++ EAA +GIVLL
Sbjct: 359 ARDDIERGFIRLYANLVELGYFDGNSSSSNPYRSLGWPDVQKTDAWNISYEAAVEGIVLL 418
Query: 398 KNQGPSLPL---SHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQ 454
KN G +LPL S +++++A+IGP ++ T + GNY G A +P+ T+H
Sbjct: 419 KNDG-TLPLASPSEGKNKSIALIGPWANATTQLQGNYYGDAPYLISPVDAFTAAGYTVHY 477
Query: 455 QGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKV 514
+++ F AA+ A+R AD + + G+D +IEAEA DR+ + PG Q EL+S++
Sbjct: 478 APGTEISTNSTANFSAALSAARAADTIVFLGGIDNTIEAEAQDRSSIAWPGNQLELISQL 537
Query: 515 SM--ASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGG 572
+ + P ++ M GG +D + K + ++ A++W GYPGQ+GG A+ DIL G P G
Sbjct: 538 AAQKSDDQPLVVYQMGGGQVDSSSLKFNAKVNALLWGGYPGQSGGLALRDILTGARAPAG 597
Query: 573 KLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVA 632
+L T YP Y + +M +RP+++ + PG+TY +Y G VY FGHG+ YT F + A
Sbjct: 598 RLTTTQYPAAYAESFSALDMNLRPNETTQNPGQTYMWYTGEPVYAFGHGLFYTTFNASSA 657
Query: 633 NAPTVVAVPLDGRHGSINAT-ISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF 691
A + N T ++ A T R + N G +D +T LV+
Sbjct: 658 QAAKT--------KYTFNITDLTSAAHPDTTTVGQRTLFNFTASITNSGQRDSDYTALVY 709
Query: 692 S-TPPAG-HWAPHKQLVAFEKVHVPAGAQQRVGINIHVC-KYLSVVDRSGTRRIPLGEHN 748
+ T AG P+K LV F+++ A +N+ V L+ VD +G + G +
Sbjct: 710 ANTSTAGPSPYPNKWLVGFDRLAAVAKEGGTAELNVPVAVDRLARVDEAGNTVLFPGRYE 769
Query: 749 IHI 751
+ +
Sbjct: 770 VAL 772
>gi|425780840|gb|EKV18836.1| Beta-xylosidase XylA [Penicillium digitatum PHI26]
gi|425783077|gb|EKV20946.1| Beta-xylosidase XylA [Penicillium digitatum Pd1]
Length = 792
Score = 412 bits (1060), Expect = e-112, Method: Compositional matrix adjust.
Identities = 259/729 (35%), Positives = 383/729 (52%), Gaps = 36/729 (4%)
Query: 43 CQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTK 102
C + R L +L+E V + AVPRLG+ Y+ WSEALHG+
Sbjct: 62 CDTTAKPHDRAAALTSMFTLEELVNSTGNVIPAVPRLGLPPYQVWSEALHGLDRAN--LT 119
Query: 103 FGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRD 162
GD+ ATSFP I ++ N TL IG ++S + RA NGG GL ++PN+N FR
Sbjct: 120 ESGDYSWATSFPSPILIMAALNRTLINQIGEIISTQGRAFNNGGRYGLDVYAPNINSFRH 179
Query: 163 PRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDR-LKVAASCKHFTAYDLDNWNGVDRF 221
P WGRGQETPGED L Y Y+ G+QG R LK+AA+ KHF YDL+NW R
Sbjct: 180 PVWGRGQETPGEDVQLCSIYGVEYITGIQGGLNPRDLKLAATAKHFAGYDLENWGNHSRL 239
Query: 222 HFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRL 281
N +S D+ + F V + +V SVM SYN VNGVP+ A+ +L+ +R W
Sbjct: 240 GNNVAISSFDLASYYTPQFITAVRDARVHSVMSSYNAVNGVPSSANSFLLQTLLRETWNF 299
Query: 282 --NGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLS 339
+GY+ SDCD+V ++ + S+ AAA +I+AG D+DCG L+ ++ +S
Sbjct: 300 VEDGYVSSDCDAVFNVFNPHGYASSASLAAAKSIQAGTDIDCGATYQLYLNESLSHDEIS 359
Query: 340 EIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKN 399
+I A+ + + LG FDG+ S Y HL DV D ++ EAA +GIVLLKN
Sbjct: 360 RSEIERAVTRFYSTLVSLGYFDGDNSK--YRHLHWPDVVATDAWNISYEAAVEGIVLLKN 417
Query: 400 QGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCKD 459
G +LPLS+ R+VA+IGP ++VT T+ GNY G A T PL + ++ +
Sbjct: 418 DG-TLPLSN-NTRSVALIGPWANVTTTLQGNYYGAAPYLTGPLAALQASNLDVNYAFGTN 475
Query: 460 VACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASK 519
++ F AA+ A+ +++ I G+D ++EAE +DR + PG Q +L+ ++S K
Sbjct: 476 ISSDSTSGFEAALSAAGKSEVIIFAGGIDNTVEAEGVDRESITWPGNQLQLIEQLSKLGK 535
Query: 520 GPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWY 579
P +++ M GG +D + K + + +++W GYPGQ+GG AI DIL G P G+L +T Y
Sbjct: 536 -PLVVLQMGGGQVDSSSLKANKNVNSLVWGGYPGQSGGPAILDILTGKRAPAGRLTVTQY 594
Query: 580 PQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVA 639
P EY P T+M++RP + PG+TY +Y G VY FGHG+ YT F
Sbjct: 595 PAEYALQFPATDMSLRPKGNN--PGQTYMWYTGKPVYEFGHGLFYTTF-----------K 641
Query: 640 VPLDGRHGSINATISGKAIKVTHAKCNRLT------LGVQVDVKNVGSKDGAHTLLVFST 693
V L HG+ N T ++ + V+V N G+ +T + F
Sbjct: 642 VSLAHFHGAENGTSFDIVQLLSRPNAGYSVVEQIPFINYTVEVMNTGNVTSDYTAMAFVN 701
Query: 694 PPAG---HWAPHKQLVAFEKV-HVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNI 749
AG H P+K LV F+++ + Q + I I + ++ D G R + G++ +
Sbjct: 702 TKAGPSPH--PNKWLVGFDRLGGISPRTTQTMTIPITLDN-VARTDERGNRIVYPGKYEL 758
Query: 750 HIGGTKHSV 758
+ + +V
Sbjct: 759 TLNNERSAV 767
>gi|116197206|ref|XP_001224415.1| hypothetical protein CHGG_05201 [Chaetomium globosum CBS 148.51]
gi|88181114|gb|EAQ88582.1| hypothetical protein CHGG_05201 [Chaetomium globosum CBS 148.51]
Length = 735
Score = 412 bits (1059), Expect = e-112, Method: Compositional matrix adjust.
Identities = 271/713 (38%), Positives = 395/713 (55%), Gaps = 65/713 (9%)
Query: 65 KVKLLISGAAA----VPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTA 120
K ++++G A+ V RLG+ Y+WW+EALHGV++ G +GG F AT FPQ IT++
Sbjct: 33 KSNVVLTGPASNSPGVSRLGLSAYQWWNEALHGVAH-NRGITWGGQFSAATQFPQAITSS 91
Query: 121 SSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSG 180
++F+ L E IG ++S EARA N G A L +W+PNVN FRDPRWGRG ETPGED +
Sbjct: 92 AAFDDHLIERIGVIISTEARAFANNGRAHLDFWTPNVNPFRDPRWGRGHETPGEDAFRNK 151
Query: 181 KYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPF 240
K+A ++V+G+QG++ +V A+CKH+ AYDL+N RF+F+AKVS QD+ + + PF
Sbjct: 152 KWAEAFVQGMQGTESTH-RVIATCKHYAAYDLENSGSTTRFNFDAKVSTQDLAEYYLPPF 210
Query: 241 RMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRL---NGYIVSDCDSVGVYY- 296
+ C + KV S+MCSYN VNGVP CA P ++ +R W N Y+VSDCD+ VYY
Sbjct: 211 QQCARDSKVGSIMCSYNAVNGVPACASPYLMDTILRKHWNWTDQNQYVVSDCDA--VYYL 268
Query: 297 ----DTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTE--SAVQRGLLSEIDINNALVNT 350
+ S+ A ++ AG D C G + SA ++ ++ A++
Sbjct: 269 GNANGGHRYKSSYAAAIGASLEAGCDNMCWATGGTTPDPASAFNSRQFTQATLDKAMLRQ 328
Query: 351 LTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIR 410
+ ++ G FDG P+S Y +L DV T ++ AL+AA +GIVLLKN LPL+
Sbjct: 329 MQGLVKAGYFDG-PNSL-YRNLTAADVNTQVARDTALKAAEEGIVLLKNDN-ILPLTLGG 385
Query: 411 HRT-VAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTI-HQQGCKDVACADDQLF 468
T VA+IG ++ M+G Y+G P+ T+ + G AD
Sbjct: 386 SNTQVAMIGFWANAADKMLGGYSGSPPFSHDPVTAARSMGITVNYVNGPLTQTNADTS-- 443
Query: 469 GAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMS 528
AA++A++++ I G+D ++E E+ DR + P Q ++ +++ K P I+V M
Sbjct: 444 -AAVNAAQKSSVVIFFGGIDNTVEKESQDRTSIAWPSGQLTMIQRLAQTGK-PVIVVRM- 500
Query: 529 GGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLP 588
G +D + P + AI+WAGYPGQ GGTA+ +++ G ++P G+LP+T YP Y P
Sbjct: 501 GTHVDDTPLLSIPNVKAILWAGYPGQDGGTAVMNLITGLASPAGRLPVTVYPSSYTNQAP 560
Query: 589 MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGS 648
T MA+RPS S YPGRTYR+YK P V+PFGHG+ YTNF +VA PLD
Sbjct: 561 YTNMALRPSSS--YPGRTYRWYKDP-VFPFGHGLHYTNF--SVA--------PLD----- 602
Query: 649 INATISGKAIKVTHAKCNRLTL-------GVQVDVKNVGSKDGAHTLLVFSTPPAGHWAP 701
AT S I A C +T V V V N GS+ + +L F AG + P
Sbjct: 603 FPATFS---IADLLASCKGVTYLELCPFPSVSVSVTNTGSRASDYVVLGFL---AGDFGP 656
Query: 702 H----KQLVAFEKVH-VPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNI 749
K L +++V V G Q ++ + + L+ VD G R + G + +
Sbjct: 657 TPRPIKSLATYKRVFDVQPGKTQSAELDWKL-ESLARVDGKGNRVLYPGTYTL 708
>gi|150019484|ref|YP_001311738.1| glycoside hydrolase family protein [Clostridium beijerinckii NCIMB
8052]
gi|149905949|gb|ABR36782.1| glycoside hydrolase, family 3 domain protein [Clostridium
beijerinckii NCIMB 8052]
Length = 709
Score = 412 bits (1059), Expect = e-112, Method: Compositional matrix adjust.
Identities = 256/732 (34%), Positives = 387/732 (52%), Gaps = 90/732 (12%)
Query: 51 QRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGA 110
++ +L+G+++L+EK + L ++AV RL + Y WW+E LHGV+ G A
Sbjct: 14 EKAKELVGKMTLEEKAEQLTYKSSAVKRLNVPRYNWWNEGLHGVARAGT----------A 63
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA--------GLTYWSPNVNIFRD 162
T FPQ I A+ F+ L I +V+S E RA YN + G+T+WSPNVNIFRD
Sbjct: 64 TVFPQAIGLAAMFDDELLNYIAKVISTEGRAKYNENSKKDDRDIYKGITFWSPNVNIFRD 123
Query: 163 PRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFH 222
PRWGRG ET GEDP L+ + ++V+GLQG +G LK AA KHF + G+ R
Sbjct: 124 PRWGRGHETYGEDPYLTSRLGVAFVKGLQG-EGKYLKAAACAKHFAVHS--GPEGL-RHE 179
Query: 223 FNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLN 282
F+A VSK+D+ +T+ F CV EG V +VM +YN+ NG P C +L+ +RG+W
Sbjct: 180 FDAVVSKKDLYETYLPAFEACVKEGDVEAVMGAYNRTNGEPCCGSKTLLRDILRGKWNFK 239
Query: 283 GYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEID 342
G++VSDC ++ ++ TST E+AA A++ G DL+CG L A + GL++E D
Sbjct: 240 GHVVSDCWAIADFHLHHRVTSTATESAALAMKNGCDLNCGNVY-LQLLLAYKEGLVTEED 298
Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGP 402
I A + ++RLGMFD E Y + + +H EL+L+AAR +VLLKN G
Sbjct: 299 ITTAAERLMATRIRLGMFDEECE---YNKIPYELNDCKEHNELSLKAARNSMVLLKNNGI 355
Query: 403 SLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI----GRYARTIHQQGC- 457
LPL+ +++AVIGPN+D + + GNY+G A Y T L+GI G R + +GC
Sbjct: 356 -LPLNKNNLKSIAVIGPNADSQIMLKGNYSGTASRYITVLEGIHEAVGEDVRVYYSEGCH 414
Query: 458 ------KDVACADDQLFGAAIDASRQADATILVMGLDQSIEAE---------ALDRAGLL 502
+++A +D+L AI + ++D IL +GLD +IE E A D+A L
Sbjct: 415 LFRDRVEELAEPNDRL-KEAISIAERSDVAILCLGLDSTIEGEQGDAGNSEGAGDKASLN 473
Query: 503 LPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIAD 562
LPGRQQEL+ K+ + + P ILV+ +G + F + + +AI+ A YPG GG A+AD
Sbjct: 474 LPGRQQELLEKI-IETGTPVILVIGAGSAL--TFNNAEDKCSAILDAWYPGSRGGRAVAD 530
Query: 563 ILFGTSNPGGKLPMTWYPQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHG 621
++FG +P GKLP+T+Y +LP + +M+ RTYR+ +YPFG+G
Sbjct: 531 LIFGKCSPSGKLPITFYRN--TKDLPEFIDYSMKD--------RTYRYMSCESLYPFGYG 580
Query: 622 MSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGS 681
++Y+ T+ + V K + + V V + N G+
Sbjct: 581 LTYS--------------------------TVKLSELHVPDVKSDFEDVEVSVKITNTGN 614
Query: 682 KDGAHTLLVFSTPPAGHWAPHKQ-LVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTR 740
D + + +A L F++V + G + + I + VV+ G R
Sbjct: 615 FDIEEVIQCYIKDLESKYAVRNHSLAGFKRVRLKIGESKIAKMKIKKSSF-EVVNDDGER 673
Query: 741 RIPLGEHNIHIG 752
+ + +G
Sbjct: 674 ILDSKRFKLFVG 685
>gi|436410475|gb|AGB57183.1| beta-xylosidase [Aspergillus sp. BCC125]
Length = 804
Score = 412 bits (1058), Expect = e-112, Method: Compositional matrix adjust.
Identities = 256/720 (35%), Positives = 376/720 (52%), Gaps = 32/720 (4%)
Query: 31 DPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEA 90
D ++ R+ C S R LI +L E + + V RLG+ Y+ WSEA
Sbjct: 57 DCQNGPLRSHLICDESATPYDRAASLISLFTLDELIANTGNTGLGVSRLGLPAYQVWSEA 116
Query: 91 LHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGL 150
LHG+ G + ATSFPQ I T ++ N TL I ++S + RA N G GL
Sbjct: 117 LHGLDRANFSDL--GSYNWATSFPQPILTTAALNRTLIHQIASIISTQGRAFNNAGRYGL 174
Query: 151 TYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGD-RLKVAASCKHFTA 209
++PN+N FR P WGRGQETPGED L+ YA Y+ G+QG D D LK+AA+ KH+
Sbjct: 175 DVYAPNINTFRHPVWGRGQETPGEDVSLAAIYAYEYITGIQGPDPDSNLKLAATAKHYAG 234
Query: 210 YDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPN 269
YD++NW+ R + +++QD+ + + F + + KV SVMC+YN VNGVP CAD
Sbjct: 235 YDIENWHNHSRLGNDMNITQQDLSEYYTPQFHVAARDAKVHSVMCAYNAVNGVPACADSY 294
Query: 270 ILKRTIRGEWRL--NGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGL 327
L+ +R + +GY+ SDCD+ Y+ + S+ AAA+AI AG D+DCG
Sbjct: 295 FLQTLLRDTFGFVDHGYVSSDCDAAYNIYNPHGYASSQAAAAAEAILAGTDIDCGTTYQW 354
Query: 328 HTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPS--SQPYGHLGPKDVCTPDHQEL 385
H ++ G LS DI ++ T ++ G FD + + PY L DV D +
Sbjct: 355 HLNESITAGDLSRDDIEKGVIRLYTTLVQAGYFDSNTTKANNPYRDLTWSDVLETDAWNI 414
Query: 386 ALEAARQGIVLLKNQGPSLPLSHIRH----RTVAVIGPNSDVTVTMIGNYAGIACGYTTP 441
+ +AA QGIVLLKN LPL+ + TVA+IGP ++ T ++GNY G A +P
Sbjct: 415 SYQAATQGIVLLKNSNNVLPLTEKAYPPSNTTVALIGPWANATTQLLGNYYGNAPYMISP 474
Query: 442 LQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGL 501
++ ++ F AA+ A++ AD I G+D ++EAEALDR +
Sbjct: 475 RAAFEEAGYNVNFAEGTGISSTSTSGFAAALSAAQSADVIIYAGGIDNTLEAEALDRESI 534
Query: 502 LLPGRQQELVSKV-SMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAI 560
PG Q +L+ K+ S A P I++ M GG +D + KN+ ++A++W GYPGQ+GG A+
Sbjct: 535 AWPGNQLDLIQKLASSAGSKPLIVLQMGGGQVDSSSLKNNTNVSALLWGGYPGQSGGFAL 594
Query: 561 ADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGH 620
DI+ G NP G+L T YP Y P T+M +RP PG+TY++Y G VY FGH
Sbjct: 595 RDIITGKKNPAGRLVTTQYPASYAEEFPATDMNLRPEGDN--PGQTYKWYTGEAVYEFGH 652
Query: 621 GMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLT----LGVQVDV 676
G+ YT F + +N T R +N + TH +T L ++
Sbjct: 653 GLFYTTFAESSSNTTT--------REIKLNIQ---DILSQTHEDLASITQLPVLNFTANI 701
Query: 677 KNVGSKDGAHTLLVFSTPPAGHWAPH--KQLVAFEKV-HVPAGAQQRVGINIHVCKYLSV 733
KN G + +T +VF+ AP+ K LV ++++ V G + + + I V + V
Sbjct: 702 KNTGKVESDYTAMVFANTSDAGPAPYPVKWLVGWDRLGDVKVGETRELRVPIEVGSFARV 761
>gi|121809149|sp|Q4AEG8.1|XYND_ASPAW RecName: Full=Exo-1,4-beta-xylosidase xlnD; AltName:
Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
gi|73486695|dbj|BAE19756.1| beta-xylosidase [Aspergillus awamori]
Length = 804
Score = 411 bits (1057), Expect = e-112, Method: Compositional matrix adjust.
Identities = 253/720 (35%), Positives = 380/720 (52%), Gaps = 32/720 (4%)
Query: 31 DPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEA 90
D ++ R+ C + R LI +L E + + V RLG+ Y+ WSEA
Sbjct: 57 DCQNGPLRSHLICDETATPYDRAASLISLFTLDELIANTGNTGLGVSRLGLPAYQVWSEA 116
Query: 91 LHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGL 150
LHG+ G + ATSFPQ I T ++ N TL I ++S + RA N G GL
Sbjct: 117 LHGLDRANFSDS--GAYNWATSFPQPILTTAALNRTLIHQIASIISTQGRAFNNAGRYGL 174
Query: 151 TYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGD-RLKVAASCKHFTA 209
++PN+N FR P WGRGQETPGED L+ YA Y+ G+QG D + LK+AA+ KH+
Sbjct: 175 DVYAPNINTFRHPVWGRGQETPGEDVSLAAVYAYEYITGIQGPDPESNLKLAATAKHYAG 234
Query: 210 YDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPN 269
YD++NW+ R + +++QD+ + + F + + KV SVMC+YN VNGVP CAD
Sbjct: 235 YDIENWHNHSRLGNDMNITQQDLSEYYTPQFHVAARDAKVQSVMCAYNAVNGVPACADSY 294
Query: 270 ILKRTIRGEWRL--NGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGL 327
L+ +R + +GY+ SDCD+ Y+ + S+ AAA+AI AG D+DCG
Sbjct: 295 FLQTLLRDTFGFVDHGYVSSDCDAAYNIYNPHGYASSQAAAAAEAILAGTDIDCGTTYQW 354
Query: 328 HTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPS--SQPYGHLGPKDVCTPDHQEL 385
H ++ G LS DI ++ T ++ G FD + + PY L DV D +
Sbjct: 355 HLNESITAGDLSRDDIEQGVIRLYTTLVQAGYFDSNTTKANNPYRDLSWSDVLETDAWNI 414
Query: 386 ALEAARQGIVLLKNQGPSLPLSHIRH----RTVAVIGPNSDVTVTMIGNYAGIACGYTTP 441
+ +AA QGIVLLKN LPL+ + TVA+IGP ++ T ++GNY G A +P
Sbjct: 415 SYQAATQGIVLLKNSNNVLPLTEKAYPPSNTTVALIGPWANATTQLLGNYYGNAPYMISP 474
Query: 442 LQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGL 501
++ ++ F AA+ A++ AD I G+D ++EAEALDR +
Sbjct: 475 RAAFEEAGYKVNFAEGTGISSTSTSGFAAALSAAQSADVIIYAGGIDNTLEAEALDRESI 534
Query: 502 LLPGRQQELVSKV-SMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAI 560
PG Q +L+ K+ S A K P I++ M GG +D + KN+ +++A++W GYPGQ+GG A+
Sbjct: 535 AWPGNQLDLIQKLASAAGKKPLIVLQMGGGQVDSSSLKNNTKVSALLWGGYPGQSGGFAL 594
Query: 561 ADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGH 620
DI+ G NP G+L T YP Y P T+M +RP PG+TY++Y G VY FGH
Sbjct: 595 RDIITGKKNPAGRLVTTQYPASYAEEFPATDMNLRPEGDN--PGQTYKWYTGEAVYEFGH 652
Query: 621 GMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLT----LGVQVDV 676
G+ YT F + +N T V L+ + + TH + +T L ++
Sbjct: 653 GLFYTTFAESSSNT-TTKEVKLNIQ----------DILSRTHEELASITQLPVLNFTANI 701
Query: 677 KNVGSKDGAHTLLVFSTPPAGHWAPH--KQLVAFEKV-HVPAGAQQRVGINIHVCKYLSV 733
+N G + +T +VF+ AP+ K LV ++++ V G + + + + V + V
Sbjct: 702 RNTGKLESDYTAMVFANTSDAGPAPYPKKWLVGWDRLGEVKVGETRELRVPVEVGSFARV 761
>gi|76160898|gb|ABA40420.1| Xld [Aspergillus fumigatus]
Length = 792
Score = 411 bits (1057), Expect = e-112, Method: Compositional matrix adjust.
Identities = 252/724 (34%), Positives = 387/724 (53%), Gaps = 27/724 (3%)
Query: 43 CQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTK 102
C S R L+ + +E V + + VPRLG+ Y+ WSEALHG+ +
Sbjct: 63 CDTSARPHDRAAALVSMFTFEELVNNTGNTSPGVPRLGLPPYQVWSEALHGLDRANFTDE 122
Query: 103 FGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRD 162
G++ ATSFP I T S+ N TL I +++ + RA N G GL ++PN+N FR
Sbjct: 123 --GEYSWATSFPMPILTMSALNRTLINQIATIIATQGRAFNNVGRYGLDVYAPNINAFRS 180
Query: 163 PRWGRGQETPGEDP-VLSGKYAASYVRGLQGS-DGDRLKVAASCKHFTAYDLDNWNGVDR 220
WGRGQETPGED L+ YA Y+ G+QG D + LK+ A+ KH+ YDL+NW+G R
Sbjct: 181 AMWGRGQETPGEDAYCLASAYAYEYITGIQGGVDPEHLKLVATAKHYAGYDLENWDGHSR 240
Query: 221 FHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWR 280
+ +++Q++ + + F + + KV SVMCSYN VNGVP+CA+ L+ +R +
Sbjct: 241 LGNDMNITQQELSEYYTPQFLVAARDAKVHSVMCSYNAVNGVPSCANSFFLQTLLRDTFG 300
Query: 281 L--NGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLL 338
+GY+ SDCDS ++ F + AAAD+IRAG D+DCG + A +
Sbjct: 301 FVEDGYVSSDCDSAYNVWNPHEFAANITGAAADSIRAGTDIDCGTTYQYYFGEAFDEQEV 360
Query: 339 SEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLK 398
+ +I ++ + +RLG FDG S Y L DV T D ++ EAA +GIVLLK
Sbjct: 361 TRAEIERGVIRLYSNLVRLGYFDGNGSV--YRDLTWNDVVTTDAWNISYEAAVEGIVLLK 418
Query: 399 NQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCK 458
N G +LPL+ R+VA+IGP +VT + GNY G A +PL ++
Sbjct: 419 NDG-TLPLAK-SVRSVALIGPWMNVTTQLQGNYFGPAPYLISPLNAFQNSDFDVNYAFGT 476
Query: 459 DVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMAS 518
+++ F A+ A++++D I G+D ++EAEA+DR + PG Q +L+ ++S
Sbjct: 477 NISSHSTDGFSEALSAAKKSDVIIFAGGIDNTLEAEAMDRMNITWPGNQLQLIDQLSQLG 536
Query: 519 KGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTW 578
K P I++ M GG +D + K++ + ++IW GYPGQ+GG A+ DI+ G P G+L +T
Sbjct: 537 K-PLIVLQMGGGQVDSSSLKSNKNVNSLIWGGYPGQSGGQALLDIITGKRAPAGRLVVTQ 595
Query: 579 YPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVV 638
YP EY T P T+M++RP + PG+TY +Y G VY FGHG+ YT F H
Sbjct: 596 YPAEYATQFPATDMSLRPHGNN--PGQTYMWYTGTPVYEFGHGLFYTTF-H--------A 644
Query: 639 AVPLDGR-HGSINATISGKAIKVTHAKCNRL-TLGVQVDVKNVGSKDGAHTLLVFSTPPA 696
++P G+ S N A ++ L V + N G +T ++F+ A
Sbjct: 645 SLPGTGKDKTSFNIQDLLTQPHPGFANVEQMPLLNFTVTITNTGKVASDYTAMLFANTTA 704
Query: 697 GHWA-PHKQLVAFEKV-HVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGT 754
G P+K LV F+++ + Q + I + + ++ D +G R + G++ + +
Sbjct: 705 GPAPYPNKWLVGFDRLASLEPHRSQTMTIPVTIDS-VARTDEAGNRVLYPGKYELALNNE 763
Query: 755 KHSV 758
+ V
Sbjct: 764 RSVV 767
>gi|169767016|ref|XP_001817979.1| exo-1,4-beta-xylosidase xlnD [Aspergillus oryzae RIB40]
gi|121805502|sp|Q2UR38.1|XYND_ASPOR RecName: Full=Exo-1,4-beta-xylosidase xlnD; AltName:
Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
gi|83765834|dbj|BAE55977.1| unnamed protein product [Aspergillus oryzae RIB40]
Length = 798
Score = 411 bits (1057), Expect = e-112, Method: Compositional matrix adjust.
Identities = 257/730 (35%), Positives = 382/730 (52%), Gaps = 33/730 (4%)
Query: 43 CQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTK 102
C S R L+ L+ +E V + PR+G+ Y+ W+EALHGV++
Sbjct: 63 CDTSAKPHDRAAALVSLLTFEELVNNTANTGHGAPRIGLPAYQVWNEALHGVAHADFSD- 121
Query: 103 FGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRD 162
G F +TSFPQ I+T ++ N TL I ++S + RA N G GL +SPN+N FR
Sbjct: 122 -AGGFSWSTSFPQPISTMAALNRTLIHQIATIISTQGRAFMNAGRYGLDVYSPNINTFRH 180
Query: 163 PRWGRGQETPGEDP-VLSGKYAASYVRGLQGS-DGDRLKVAASCKHFTAYDLDNWNGVDR 220
P WGRGQETPGED L+ YA Y+ G+QG D + LK+ A+ KH+ YD++NW+ R
Sbjct: 181 PVWGRGQETPGEDAYCLASTYAYEYITGIQGGVDANPLKLIATAKHYAGYDIENWDNHSR 240
Query: 221 FHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWR 280
+ ++++QD+ + + F + + KV SVMCSYN VNGVP+C++ L+ +R +
Sbjct: 241 LGNDMQITQQDLAEYYTPQFLVASRDAKVHSVMCSYNAVNGVPSCSNSFFLQTLLRDTFD 300
Query: 281 L--NGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLL 338
+GY+ DC +V ++ + + AAAD+IRAG D+DCG H + + +
Sbjct: 301 FVEDGYVSGDCGAVYNVFNPHGYATNESSAAADSIRAGTDIDCGVSYPRHFQESFHDQEV 360
Query: 339 SEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLK 398
S D+ + +R G FDG+ S PY ++ DV + + Q L+ EAA Q IVLLK
Sbjct: 361 SRQDLERGVTRLYASLIRAGYFDGKTS--PYRNITWSDVVSTNAQNLSYEAAAQSIVLLK 418
Query: 399 NQGPSLPL--SHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQG 456
N G LPL + +T+A+IGP ++ T M+GNY G A +PLQ I
Sbjct: 419 NDG-ILPLTSTSSSTKTIALIGPWANATTQMLGNYYGPAPYLISPLQAFQDSEYKITYTI 477
Query: 457 CKDVACADDQLF-GAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVS 515
+ D A+ +++AD I G+D ++E EA DR+ + P Q L++K++
Sbjct: 478 GTNTTTDPDSTSQSTALTTAKEADLIIFAGGIDNTLETEAQDRSNITWPSNQLSLITKLA 537
Query: 516 MASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLP 575
K P I++ M GG +D + KN+ + A+IW GYPGQ+GG A+ADI+ G P +L
Sbjct: 538 DLGK-PLIVLQMGGGQVDSSALKNNKNVNALIWGGYPGQSGGQALADIITGKRAPAARLV 596
Query: 576 MTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAP 635
T YP EY P +M +RP+ S PG+TY +Y G VY FGHG+ YTNF
Sbjct: 597 TTQYPAEYAEVFPAIDMNLRPNGSN--PGQTYMWYTGTPVYEFGHGLFYTNF-------- 646
Query: 636 TVVAVPLDGRHGSINATISGKAIKVTHAKCNRLT----LGVQVDVKNVGSKDGAHTLLVF 691
T A G + I + + H + L VDVKN G + +T + F
Sbjct: 647 TASASASSGTKNRTSFNID-EVLGRPHLGYKLVEQMPLLNFTVDVKNTGDRVSDYTAMAF 705
Query: 692 STPPAGHWAPH--KQLVAFEKVH-VPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHN 748
AG APH K LV F+++ V G+ + + I + V L+ D G R + G +
Sbjct: 706 VNTTAGP-APHPNKWLVGFDRLSAVEPGSAKTMVIPVTV-DSLARTDEEGNRVLYPGRYE 763
Query: 749 IHIGGTKHSV 758
+ + + V
Sbjct: 764 VALNNEREVV 773
>gi|194400335|gb|ACF61038.1| beta-xylosidase [Aspergillus awamori]
Length = 804
Score = 411 bits (1057), Expect = e-112, Method: Compositional matrix adjust.
Identities = 256/720 (35%), Positives = 377/720 (52%), Gaps = 32/720 (4%)
Query: 31 DPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEA 90
D ++ R+ C S R LI +L E + + V RLG+ Y+ WSEA
Sbjct: 57 DCQNGPLRSHLICDESATPYDRAASLISLFTLDELIANTGNTGLGVSRLGLPAYQVWSEA 116
Query: 91 LHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGL 150
LHG+ G + ATSFPQ I T ++ N TL I ++S + RA N G GL
Sbjct: 117 LHGLDRANFSDS--GSYNWATSFPQPILTTAALNRTLIHQIASIISTQGRAFNNAGRYGL 174
Query: 151 TYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGD-RLKVAASCKHFTA 209
++PN+N FR P WGRGQETPGED L+ YA Y+ G+QG D D LK+AA+ KH+
Sbjct: 175 DVYAPNINTFRHPVWGRGQETPGEDVSLAAVYAYEYITGIQGPDPDSNLKLAATAKHYAG 234
Query: 210 YDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPN 269
YD++NW+ R + +++QD+ + + F + + KV SVMC+YN VNGVP CAD
Sbjct: 235 YDIENWHNHSRLGNDMNITQQDLSEYYTPQFHVAARDAKVHSVMCAYNAVNGVPACADSY 294
Query: 270 ILKRTIRGEWRL--NGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGL 327
L+ +R + +GY+ SDCD+ Y+ + S+ AAA+AI AG D+DCG
Sbjct: 295 FLQTLLRDTFGFVDHGYVSSDCDAAYNIYNPHGYASSQAAAAAEAILAGTDIDCGTTYQW 354
Query: 328 HTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPS--SQPYGHLGPKDVCTPDHQEL 385
H ++ G LS DI ++ T ++ G FD + + PY L DV D +
Sbjct: 355 HLNESITAGDLSRDDIEKGVIRLYTTLVQAGYFDSNTTKANNPYRDLTWSDVLETDAWNI 414
Query: 386 ALEAARQGIVLLKNQGPSLPLSHIRH----RTVAVIGPNSDVTVTMIGNYAGIACGYTTP 441
+ +AA QGIVLLKN LPL+ + TVA+IGP ++ T ++GNY G A +P
Sbjct: 415 SYQAATQGIVLLKNSNNVLPLTEKAYPPSNTTVALIGPWANATTQLLGNYYGNAPYMISP 474
Query: 442 LQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGL 501
++ ++ F AA+ A+R AD I G+D ++EAEALDR +
Sbjct: 475 RAAFEEAGYKVNFAEGTGISSTSTSGFAAALSAARSADVIIYAGGIDNTLEAEALDRESI 534
Query: 502 LLPGRQQELVSKV-SMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAI 560
PG Q +L+ K+ S A P I++ M GG +D + KN+ + A++W GYPGQ+GG A+
Sbjct: 535 AWPGNQLDLIQKLASSAGSKPLIVLQMGGGQVDSSSLKNNTNVTALLWGGYPGQSGGFAL 594
Query: 561 ADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGH 620
DI+ G NP G+L T YP Y P T+M +RP PG+TY++Y G VY FGH
Sbjct: 595 RDIITGKKNPAGRLVTTQYPASYAEEFPATDMNLRPEGDN--PGQTYKWYTGEAVYEFGH 652
Query: 621 GMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLT----LGVQVDV 676
G+ YT F + +N T V L+ + + TH + +T L ++
Sbjct: 653 GLFYTTFAESSSNT-TTKEVKLNIQ----------DILSQTHEELASITQLPVLNFTANI 701
Query: 677 KNVGSKDGAHTLLVFSTPPAGHWAPH--KQLVAFEKV-HVPAGAQQRVGINIHVCKYLSV 733
KN G + +T +VF+ AP+ K LV ++++ V G + + + + V + V
Sbjct: 702 KNTGKLESDYTAMVFANTSDAGPAPYPVKWLVGWDRLGDVKVGETRELRVPVEVGSFARV 761
>gi|70996610|ref|XP_753060.1| beta-xylosidase XylA [Aspergillus fumigatus Af293]
gi|74672055|sp|Q4WRB0.1|XYND_ASPFU RecName: Full=Probable exo-1,4-beta-xylosidase xlnD; AltName:
Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
gi|66850695|gb|EAL91022.1| beta-xylosidase XylA [Aspergillus fumigatus Af293]
Length = 792
Score = 411 bits (1057), Expect = e-112, Method: Compositional matrix adjust.
Identities = 252/724 (34%), Positives = 387/724 (53%), Gaps = 27/724 (3%)
Query: 43 CQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTK 102
C S R L+ + +E V + + VPRLG+ Y+ WSEALHG+ +
Sbjct: 63 CDTSARPHDRAAALVSMFTFEELVNNTGNTSPGVPRLGLPPYQVWSEALHGLDRANFTDE 122
Query: 103 FGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRD 162
G++ ATSFP I T S+ N TL I +++ + RA N G GL ++PN+N FR
Sbjct: 123 --GEYSWATSFPMPILTMSALNRTLINQIATIIATQGRAFNNVGRYGLDVYAPNINAFRS 180
Query: 163 PRWGRGQETPGEDP-VLSGKYAASYVRGLQGS-DGDRLKVAASCKHFTAYDLDNWNGVDR 220
WGRGQETPGED L+ YA Y+ G+QG D + LK+ A+ KH+ YDL+NW+G R
Sbjct: 181 AMWGRGQETPGEDAYCLASAYAYEYITGIQGGVDPEHLKLVATAKHYAGYDLENWDGHSR 240
Query: 221 FHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWR 280
+ +++Q++ + + F + + KV SVMCSYN VNGVP+CA+ L+ +R +
Sbjct: 241 LGNDMNITQQELSEYYTPQFLVAARDAKVHSVMCSYNAVNGVPSCANSFFLQTLLRDTFG 300
Query: 281 L--NGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLL 338
+GY+ SDCDS ++ F + AAAD+IRAG D+DCG + A +
Sbjct: 301 FVEDGYVSSDCDSAYNVWNPHEFAANITGAAADSIRAGTDIDCGTTYQYYFGEAFDEQEV 360
Query: 339 SEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLK 398
+ +I ++ + +RLG FDG S Y L DV T D ++ EAA +GIVLLK
Sbjct: 361 TRAEIERGVIRLYSNLVRLGYFDGNGSV--YRDLTWNDVVTTDAWNISYEAAVEGIVLLK 418
Query: 399 NQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCK 458
N G +LPL+ R+VA+IGP +VT + GNY G A +PL ++
Sbjct: 419 NDG-TLPLAK-SVRSVALIGPWMNVTTQLQGNYFGPAPYLISPLNAFQNSDFDVNYAFGT 476
Query: 459 DVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMAS 518
+++ F A+ A++++D I G+D ++EAEA+DR + PG Q +L+ ++S
Sbjct: 477 NISSHSTDGFSEALSAAKKSDVIIFAGGIDNTLEAEAMDRMNITWPGNQLQLIDQLSQLG 536
Query: 519 KGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTW 578
K P I++ M GG +D + K++ + ++IW GYPGQ+GG A+ DI+ G P G+L +T
Sbjct: 537 K-PLIVLQMGGGQVDSSSLKSNKNVNSLIWGGYPGQSGGQALLDIITGKRAPAGRLVVTQ 595
Query: 579 YPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVV 638
YP EY T P T+M++RP + PG+TY +Y G VY FGHG+ YT F H
Sbjct: 596 YPAEYATQFPATDMSLRPHGNN--PGQTYMWYTGTPVYEFGHGLFYTTF-H--------A 644
Query: 639 AVPLDGR-HGSINATISGKAIKVTHAKCNRL-TLGVQVDVKNVGSKDGAHTLLVFSTPPA 696
++P G+ S N A ++ L V + N G +T ++F+ A
Sbjct: 645 SLPGTGKDKTSFNIQDLLTQPHPGFANVEQMPLLNFTVTITNTGKVASDYTAMLFANTTA 704
Query: 697 GHWA-PHKQLVAFEKV-HVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGT 754
G P+K LV F+++ + Q + I + + ++ D +G R + G++ + +
Sbjct: 705 GPAPYPNKWLVGFDRLASLEPHRSQTMTIPVTIDS-VARTDEAGNRVLYPGKYELALNNE 763
Query: 755 KHSV 758
+ V
Sbjct: 764 RSVV 767
>gi|292495282|sp|B0XP71.1|XYND_ASPFC RecName: Full=Probable exo-1,4-beta-xylosidase xlnD; AltName:
Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
gi|159131796|gb|EDP56909.1| beta-xylosidase XylA [Aspergillus fumigatus A1163]
Length = 792
Score = 411 bits (1057), Expect = e-112, Method: Compositional matrix adjust.
Identities = 252/724 (34%), Positives = 387/724 (53%), Gaps = 27/724 (3%)
Query: 43 CQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTK 102
C S R L+ + +E V + + VPRLG+ Y+ WSEALHG+ +
Sbjct: 63 CDTSARPHDRAAALVSMFTFEELVNNTGNTSPGVPRLGLPPYQVWSEALHGLDRANFTDE 122
Query: 103 FGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRD 162
G++ ATSFP I T S+ N TL I +++ + RA N G GL ++PN+N FR
Sbjct: 123 --GEYSWATSFPMPILTMSALNRTLINQIATIIATQGRAFNNVGRYGLDVYAPNINAFRS 180
Query: 163 PRWGRGQETPGEDP-VLSGKYAASYVRGLQGS-DGDRLKVAASCKHFTAYDLDNWNGVDR 220
WGRGQETPGED L+ YA Y+ G+QG D + LK+ A+ KH+ YDL+NW+G R
Sbjct: 181 AMWGRGQETPGEDAYCLASAYAYEYITGIQGGVDPEHLKLVATAKHYAGYDLENWDGHSR 240
Query: 221 FHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWR 280
+ +++Q++ + + F + + KV SVMCSYN VNGVP+CA+ L+ +R +
Sbjct: 241 LGNDMNITQQELSEYYTPQFLVAARDAKVHSVMCSYNAVNGVPSCANSFFLQTLLRDTFG 300
Query: 281 L--NGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLL 338
+GY+ SDCDS ++ F + AAAD+IRAG D+DCG + A +
Sbjct: 301 FVEDGYVSSDCDSAYNVWNPHEFAANITGAAADSIRAGTDIDCGTTYQYYFGEAFDEQEV 360
Query: 339 SEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLK 398
+ +I ++ + +RLG FDG S Y L DV T D ++ EAA +GIVLLK
Sbjct: 361 TRAEIERGVIRLYSNLVRLGYFDGNGSV--YRDLTWNDVVTTDAWNISYEAAVEGIVLLK 418
Query: 399 NQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCK 458
N G +LPL+ R+VA+IGP +VT + GNY G A +PL ++
Sbjct: 419 NDG-TLPLAK-SVRSVALIGPWMNVTTQLQGNYFGPAPYLISPLNAFQNSDFDVNYAFGT 476
Query: 459 DVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMAS 518
+++ F A+ A++++D I G+D ++EAEA+DR + PG Q +L+ ++S
Sbjct: 477 NISSHSTDGFSEALSAAKKSDVIIFAGGIDNTLEAEAMDRMNITWPGNQLQLIDQLSQLG 536
Query: 519 KGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTW 578
K P I++ M GG +D + K++ + ++IW GYPGQ+GG A+ DI+ G P G+L +T
Sbjct: 537 K-PLIVLQMGGGQVDSSSLKSNKNVNSLIWGGYPGQSGGQALLDIITGKRAPAGRLVVTQ 595
Query: 579 YPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVV 638
YP EY T P T+M++RP + PG+TY +Y G VY FGHG+ YT F H
Sbjct: 596 YPAEYATQFPATDMSLRPHGNN--PGQTYMWYTGTPVYEFGHGLFYTTF-H--------A 644
Query: 639 AVPLDGR-HGSINATISGKAIKVTHAKCNRL-TLGVQVDVKNVGSKDGAHTLLVFSTPPA 696
++P G+ S N A ++ L V + N G +T ++F+ A
Sbjct: 645 SLPGTGKDKTSFNIQDLLTQPHPGFANVEQMPLLNFTVTITNTGKVASDYTAMLFANTTA 704
Query: 697 GHWA-PHKQLVAFEKV-HVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGT 754
G P+K LV F+++ + Q + I + + ++ D +G R + G++ + +
Sbjct: 705 GPAPYPNKWLVGFDRLASLEPHRSQTMTIPVTIDS-VARTDEAGNRVLYPGKYELALNNE 763
Query: 755 KHSV 758
+ V
Sbjct: 764 RSVV 767
>gi|410723195|ref|ZP_11362440.1| beta-glucosidase-like glycosyl hydrolase [Clostridium sp.
Maddingley MBC34-26]
gi|410603399|gb|EKQ57833.1| beta-glucosidase-like glycosyl hydrolase [Clostridium sp.
Maddingley MBC34-26]
Length = 709
Score = 411 bits (1056), Expect = e-112, Method: Compositional matrix adjust.
Identities = 260/732 (35%), Positives = 385/732 (52%), Gaps = 98/732 (13%)
Query: 55 DLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFP 114
+L+ +++LQE+ + L + A+ L + Y WW+E LHGV+ G AT FP
Sbjct: 18 ELVSKMTLQERAEQLTYQSPAIKHLNVPEYNWWNEGLHGVARAGT----------ATVFP 67
Query: 115 QVITTASSFNATLWEAIGRVVSDEARAMYN--------GGTAGLTYWSPNVNIFRDPRWG 166
Q I A+ F+ I ++S E RA YN G GLTYWSPNVNIFRDPRWG
Sbjct: 68 QAIGLAAIFDEEFLGEIADIISTEGRAKYNEYSKKDDRGIYKGLTYWSPNVNIFRDPRWG 127
Query: 167 RGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD--RFHFN 224
RG ET GEDP L+ + ++++GLQG +G LK+AA KHF + +G + R FN
Sbjct: 128 RGHETYGEDPYLTSRLGVAFIKGLQG-EGKYLKLAACAKHFAVH-----SGPEGLRHEFN 181
Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
A V K+D+ +T+ F CV E V SVM +YN+ NG P C +LK +RG+W G+
Sbjct: 182 AVVEKKDLYETYLPAFEACVKEANVESVMGAYNRTNGEPCCGSKTLLKDILRGKWGFKGH 241
Query: 285 IVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDIN 344
+VSDC ++ ++ TST E+ A AI G DL+CG + L+ A + GL++E I
Sbjct: 242 VVSDCWALADFHLHHMITSTATESVALAIENGCDLNCGN-MYLNLLLAYKEGLVTEEQIT 300
Query: 345 NALVNTLTVQMRLGMFDG--EPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGP 402
A +T + +LGMFD E + PY + C +H E+AL A+R+ +VLLKN G
Sbjct: 301 TAAERLMTTRFKLGMFDEDCEYNRIPY----EVNDC-KEHNEIALIASRKSMVLLKNDG- 354
Query: 403 SLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI----GRYARTIHQQGC- 457
+LPL +++AVIGPN++ + + GNY+G A YTT L+GI G R + +GC
Sbjct: 355 TLPLDKSSLKSIAVIGPNANSEIMLKGNYSGTASKYTTILEGIHNAVGDNIRVYYSEGCH 414
Query: 458 ------KDVACADDQLFGAAIDASRQADATILVMGLDQSIEAE---------ALDRAGLL 502
+D+A DD+L AI + ++D IL +GLD +IE E A D+ L
Sbjct: 415 LFKDKVEDLAGPDDRL-SEAISVAERSDVVILCLGLDSTIEGEQGDAGNSYGAGDKESLN 473
Query: 503 LPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIAD 562
LPGRQQ L+ KV K P I+VL +G + F + + AAI+ A YPG GGTA+AD
Sbjct: 474 LPGRQQNLLEKVLEVGK-PVIVVLGAGSAL--TFNGAEEKCAAILNAWYPGSHGGTAVAD 530
Query: 563 ILFGTSNPGGKLPMTWYPQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHG 621
ILFG +P GKLP+T+Y NLP T+ +M+ GRTYR+ + +YPFG+G
Sbjct: 531 ILFGKCSPSGKLPVTFYKD--TANLPEFTDYSMK--------GRTYRYLEHESLYPFGYG 580
Query: 622 MSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGS 681
++Y+ + ++V K + + + +D++N G+
Sbjct: 581 LTYSK--------------------------VELSNLQVPFVKADFESFDISIDIRNTGN 614
Query: 682 KDGAHTLLVFSTPPAGHWAP-HKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTR 740
+ + +A + L F++V + G + V I + + V+ G R
Sbjct: 615 YGIEEVVQCYVKDLKSKYAVLNHSLAGFKRVSLKKGESKTVTIELSK-RSFEAVNNDGER 673
Query: 741 RIPLGEHNIHIG 752
+ + +G
Sbjct: 674 LLDSKSFKLFVG 685
>gi|322512556|gb|ADX05682.1| putative carbohydrate-active enzyme [uncultured organism]
Length = 717
Score = 411 bits (1056), Expect = e-112, Method: Compositional matrix adjust.
Identities = 262/735 (35%), Positives = 385/735 (52%), Gaps = 92/735 (12%)
Query: 49 IPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFP 108
+R L+ ++L+EKV + A A+ RLG+ Y +W+EALHGV+ G
Sbjct: 13 FEERAQALVCEMTLEEKVFQTLFNAPAIERLGVPAYNYWNEALHGVARAGV--------- 63
Query: 109 GATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA--------GLTYWSPNVNIF 160
AT FPQ I A+SF+ L + +S EARA +N GLT+WSPNVNIF
Sbjct: 64 -ATVFPQAIGLAASFDEELLGQVADTISTEARAKFNMQQKFGDRDIYKGLTFWSPNVNIF 122
Query: 161 RDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD- 219
RDPRWGRG ET GEDP LSG+ S++RG+QG D +KVAA KHF + +G +
Sbjct: 123 RDPRWGRGHETFGEDPFLSGRLGVSFIRGMQGDDERYMKVAACAKHFAVH-----SGPED 177
Query: 220 -RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGE 278
R FNA VS+QD+ +T+ F CV E V +VM +YN+ NG C +L +RGE
Sbjct: 178 QRHSFNAVVSEQDLRETYLPAFHACVTEAGVEAVMGAYNRTNGEACCGSKKLLVDILRGE 237
Query: 279 WRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLL 338
W G++ SDC ++ +++ T EE A A+ +G DL+CG L +H AV+ GL+
Sbjct: 238 WGFRGHVTSDCWALKDFHEFHMVTKNQEETVALAMNSGCDLNCGN-LYVHLLQAVRDGLV 296
Query: 339 SEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLK 398
E I+ A+ T +M+LG+FD PY +G V T +++L EA+R+ + LLK
Sbjct: 297 EESVIDRAVTRLFTTRMKLGLFD-RSEEVPYNGIGYDRVDTEANRKLNREASRRTVCLLK 355
Query: 399 NQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYA----RTIHQ 454
N LPL + RT+ V+GPN+D ++GNY G A Y T L GI A R ++
Sbjct: 356 NADGLLPLDISKLRTIGVVGPNADNRKALVGNYEGTASEYVTVLDGIRELAGDDVRVVYS 415
Query: 455 QGC-------KDVACADDQLFGAAIDASRQADATILVMGLDQSIEAE---------ALDR 498
+GC + + +D++ A + +D I VMGLD +E E + D+
Sbjct: 416 EGCHLFRDRVQGLGQPNDRI-AEARAVAELSDVVIAVMGLDPGLEGEEGDQGNEFASGDK 474
Query: 499 AGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGT 558
L LPG Q E++ K + S P +LVL+ G + + +A+ + AI+ A YPG GG
Sbjct: 475 PNLELPGLQGEVL-KALVESGKPVVLVLLGGSALAIPWAEE--HVPAILDAWYPGAQGGR 531
Query: 559 AIADILFGTSNPGGKLPMTWYPQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYP 617
A+AD+LFG + P GKLP+T+Y LP T+ +M+ RTYR+ K P +YP
Sbjct: 532 AVADVLFGRACPEGKLPVTFYRTS--EELPAFTDYSMK--------NRTYRYMKQPALYP 581
Query: 618 FGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVK 677
FG+G+SYT++ T A GS++ + +A+ ++
Sbjct: 582 FGYGLSYTSWELTNTTA-----------EGSVDDGVVCRAV-----------------LR 613
Query: 678 NVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRS 737
N G+ GA T+ V+ P P+ QL K+ + G V I++ + V +
Sbjct: 614 NTGAMAGAQTVQVYVKAPLAT-GPNAQLKGLRKIRLQPGESAEVAISLDK-EAFGVYNEK 671
Query: 738 GTRRIPLGEHNIHIG 752
G R + GE+ I+IG
Sbjct: 672 GLRVLLPGEYKIYIG 686
>gi|150019782|ref|YP_001312036.1| glycoside hydrolase family protein [Clostridium beijerinckii NCIMB
8052]
gi|149906247|gb|ABR37080.1| glycoside hydrolase, family 3 domain protein [Clostridium
beijerinckii NCIMB 8052]
Length = 709
Score = 410 bits (1055), Expect = e-111, Method: Compositional matrix adjust.
Identities = 258/730 (35%), Positives = 386/730 (52%), Gaps = 94/730 (12%)
Query: 55 DLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFP 114
+L+ +++LQEK + L + A+ L + Y WW+E LHGV+ G AT FP
Sbjct: 18 ELVSKMTLQEKAEQLTYQSPAIKHLNVPEYNWWNEGLHGVARAGT----------ATVFP 67
Query: 115 QVITTASSFNATLWEAIGRVVSDEARAMYN--------GGTAGLTYWSPNVNIFRDPRWG 166
Q I A+ F+ + +++ E RA YN G GLTYWSPN+NIFRDPRWG
Sbjct: 68 QAIGLAAIFDDEFLGKVANIIATEGRAKYNEYSKKDDRGIYKGLTYWSPNINIFRDPRWG 127
Query: 167 RGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD--RFHFN 224
RG ET GEDP L+ + ++++GLQG +G LK+AA KHF + +G + R FN
Sbjct: 128 RGHETYGEDPYLTSRLGVAFIKGLQG-EGKYLKLAACAKHFAVH-----SGPEGLRHEFN 181
Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
A V+K+D+ +T+ F CV E V SVM +YN+ NG P C +LK +RG+W G+
Sbjct: 182 AVVNKKDLYETYLPAFEACVKEANVESVMGAYNRTNGEPCCGSKTLLKDILRGKWGFKGH 241
Query: 285 IVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDIN 344
+VSDC ++ ++ TST E+ A AI G DL+CG L+ A + GL++E I
Sbjct: 242 VVSDCWALADFHLHHMVTSTATESVALAIENGCDLNCGNMY-LNLLLAYKEGLVTEEQIT 300
Query: 345 NALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSL 404
A +T + +LGMFD E Y + + + +H E+AL A+R+ +VLLKN G +L
Sbjct: 301 TAAERLMTTRFKLGMFDEECE---YNKIPYEVNDSREHNEVALIASRKSMVLLKNNG-TL 356
Query: 405 PLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI----GRYARTIHQQGC--- 457
PL +++AVIGPN++ + + GNY+G A YTT L+GI G R + +GC
Sbjct: 357 PLDKSNLKSIAVIGPNANSEIMLKGNYSGTASKYTTILEGIHDAVGNDVRVYYSEGCHLF 416
Query: 458 ----KDVACADDQLFGAAIDASRQADATILVMGLDQSIEAE---------ALDRAGLLLP 504
+D+A DD+L AI + ++D +L +GLD +IE E A D+ L LP
Sbjct: 417 KDKVEDLARPDDRL-SEAISVAERSDVVVLCLGLDSTIEGEQGDAGNSYGAGDKENLNLP 475
Query: 505 GRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADIL 564
GRQQ L+ KV K P I+VL +G + + A + + AAI+ A YPG GGTA+ADIL
Sbjct: 476 GRQQNLLEKVLEVGK-PVIVVLGAGSALTLNGA--EEKCAAILNAWYPGSHGGTAVADIL 532
Query: 565 FGTSNPGGKLPMTWYPQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMS 623
FG +P GKLP+T+Y LP T+ +M+ GRTYR+ +YPFG+G++
Sbjct: 533 FGKCSPSGKLPVTFYKD--TAKLPDFTDYSMK--------GRTYRYLGHESLYPFGYGLT 582
Query: 624 YTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKD 683
Y+ T+ ++V K + + +++KN G D
Sbjct: 583 YS--------------------------TVELSNLQVPSVKQGFGSFDISIEIKNTGEYD 616
Query: 684 GAHTLLVFSTPPAGHWAP-HKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRI 742
+ + +A + L F++V + G + V I ++ K VV+ G R +
Sbjct: 617 IEEVVQCYVKDIESKYAVLNHSLAGFKRVSLKKGESKIVTIKLNK-KSFEVVNDDGERLL 675
Query: 743 PLGEHNIHIG 752
+ + +G
Sbjct: 676 DSKKFKLFVG 685
>gi|145230215|ref|XP_001389416.1| exo-1,4-beta-xylosidase xlnD [Aspergillus niger CBS 513.88]
gi|74626559|sp|O00089.2|XYND_ASPNG RecName: Full=Exo-1,4-beta-xylosidase xlnD; AltName:
Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
gi|292495287|sp|A2QA27.1|XYND_ASPNC RecName: Full=Probable exo-1,4-beta-xylosidase xlnD; AltName:
Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
gi|2181180|emb|CAB06417.1| xylosidase [Aspergillus niger]
gi|134055533|emb|CAK37179.1| xylosidase xlnD-Aspergillus niger
gi|350638468|gb|EHA26824.1| hypothetical protein ASPNIDRAFT_205670 [Aspergillus niger ATCC
1015]
Length = 804
Score = 410 bits (1053), Expect = e-111, Method: Compositional matrix adjust.
Identities = 253/720 (35%), Positives = 378/720 (52%), Gaps = 32/720 (4%)
Query: 31 DPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEA 90
D ++ R+ C + R LI +L E + + V RLG+ Y+ WSEA
Sbjct: 57 DCQNGPLRSHLICDETATPYDRAASLISLFTLDELIANTGNTGLGVSRLGLPAYQVWSEA 116
Query: 91 LHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGL 150
LHG+ G + ATSFPQ I T ++ N TL I ++S + RA N G GL
Sbjct: 117 LHGLDRANFSDS--GAYNWATSFPQPILTTAALNRTLIHQIASIISTQGRAFNNAGRYGL 174
Query: 151 TYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGD-RLKVAASCKHFTA 209
++PN+N FR P WGRGQETPGED L+ YA Y+ G+QG D + LK+AA+ KH+
Sbjct: 175 DVYAPNINTFRHPVWGRGQETPGEDVSLAAVYAYEYITGIQGPDPESNLKLAATAKHYAG 234
Query: 210 YDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPN 269
YD++NW+ R + +++QD+ + + F + + KV SVMC+YN VNGVP CAD
Sbjct: 235 YDIENWHNHSRLGNDMNITQQDLSEYYTPQFHVAARDAKVQSVMCAYNAVNGVPACADSY 294
Query: 270 ILKRTIRGEWRL--NGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGL 327
L+ +R + +GY+ SDCD+ Y+ + S+ AAA+AI AG D+DCG
Sbjct: 295 FLQTLLRDTFGFVDHGYVSSDCDAAYNIYNPHGYASSQAAAAAEAILAGTDIDCGTTYQW 354
Query: 328 HTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPS--SQPYGHLGPKDVCTPDHQEL 385
H ++ G LS DI ++ T ++ G FD + + PY L DV D +
Sbjct: 355 HLNESIAAGDLSRDDIEQGVIRLYTTLVQAGYFDSNTTKANNPYRDLSWSDVLETDAWNI 414
Query: 386 ALEAARQGIVLLKNQGPSLPLSHIRH----RTVAVIGPNSDVTVTMIGNYAGIACGYTTP 441
+ +AA QGIVLLKN LPL+ + TVA+IGP ++ T ++GNY G A +P
Sbjct: 415 SYQAATQGIVLLKNSNNVLPLTEKAYPPSNTTVALIGPWANATTQLLGNYYGNAPYMISP 474
Query: 442 LQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGL 501
++ ++ F AA+ A++ AD I G+D ++EAEALDR +
Sbjct: 475 RAAFEEAGYKVNFAEGTGISSTSTSGFAAALSAAQSADVIIYAGGIDNTLEAEALDRESI 534
Query: 502 LLPGRQQELVSKV-SMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAI 560
PG Q +L+ K+ S A K P I++ M GG +D + KN+ ++A++W GYPGQ+GG A+
Sbjct: 535 AWPGNQLDLIQKLASAAGKKPLIVLQMGGGQVDSSSLKNNTNVSALLWGGYPGQSGGFAL 594
Query: 561 ADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGH 620
DI+ G NP G+L T YP Y P T+M +RP PG+TY++Y G VY FGH
Sbjct: 595 RDIITGKKNPAGRLVTTQYPASYAEEFPATDMNLRPEGDN--PGQTYKWYTGEAVYEFGH 652
Query: 621 GMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLT----LGVQVDV 676
G+ YT F + +N T V L+ + + TH +T L ++
Sbjct: 653 GLFYTTFAESSSNT-TTKEVKLNIQ----------DILSQTHEDLASITQLPVLNFTANI 701
Query: 677 KNVGSKDGAHTLLVFSTPPAGHWAPH--KQLVAFEKV-HVPAGAQQRVGINIHVCKYLSV 733
+N G + +T +VF+ AP+ K LV ++++ V G + + + + V + V
Sbjct: 702 RNTGKLESDYTAMVFANTSDAGPAPYPKKWLVGWDRLGEVKVGETRELRVPVEVGSFARV 761
>gi|290889355|gb|ADD69953.1| xylosidase HistTag [synthetic construct]
Length = 810
Score = 410 bits (1053), Expect = e-111, Method: Compositional matrix adjust.
Identities = 253/720 (35%), Positives = 378/720 (52%), Gaps = 32/720 (4%)
Query: 31 DPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEA 90
D ++ R+ C + R LI +L E + + V RLG+ Y+ WSEA
Sbjct: 57 DCQNGPLRSHLICDETATPYDRAASLISLFTLDELIANTGNTGLGVSRLGLPAYQVWSEA 116
Query: 91 LHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGL 150
LHG+ G + ATSFPQ I T ++ N TL I ++S + RA N G GL
Sbjct: 117 LHGLDRANFSDS--GAYNWATSFPQPILTTAALNRTLIHQIASIISTQGRAFNNAGRYGL 174
Query: 151 TYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGD-RLKVAASCKHFTA 209
++PN+N FR P WGRGQETPGED L+ YA Y+ G+QG D + LK+AA+ KH+
Sbjct: 175 DVYAPNINTFRHPVWGRGQETPGEDVSLAAVYAYEYITGIQGPDPESNLKLAATAKHYAG 234
Query: 210 YDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPN 269
YD++NW+ R + +++QD+ + + F + + KV SVMC+YN VNGVP CAD
Sbjct: 235 YDIENWHNHSRLGNDMNITQQDLSEYYTPQFHVAARDAKVQSVMCAYNAVNGVPACADSY 294
Query: 270 ILKRTIRGEWRL--NGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGL 327
L+ +R + +GY+ SDCD+ Y+ + S+ AAA+AI AG D+DCG
Sbjct: 295 FLQTLLRDTFGFVDHGYVSSDCDAAYNIYNPHGYASSQAAAAAEAILAGTDIDCGTTYQW 354
Query: 328 HTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPS--SQPYGHLGPKDVCTPDHQEL 385
H ++ G LS DI ++ T ++ G FD + + PY L DV D +
Sbjct: 355 HLNESIAAGDLSRDDIEQGVIRLYTTLVQAGYFDSNTTKANNPYRDLSWSDVLETDAWNI 414
Query: 386 ALEAARQGIVLLKNQGPSLPLSHIRH----RTVAVIGPNSDVTVTMIGNYAGIACGYTTP 441
+ +AA QGIVLLKN LPL+ + TVA+IGP ++ T ++GNY G A +P
Sbjct: 415 SYQAATQGIVLLKNSNNVLPLTEKAYPPSNTTVALIGPWANATTQLLGNYYGNAPYMISP 474
Query: 442 LQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGL 501
++ ++ F AA+ A++ AD I G+D ++EAEALDR +
Sbjct: 475 RAAFEEAGYKVNFAEGTGISSTSTSGFAAALSAAQSADVIIYAGGIDNTLEAEALDRESI 534
Query: 502 LLPGRQQELVSKV-SMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAI 560
PG Q +L+ K+ S A K P I++ M GG +D + KN+ ++A++W GYPGQ+GG A+
Sbjct: 535 AWPGNQLDLIQKLASAAGKKPLIVLQMGGGQVDSSSLKNNTNVSALLWGGYPGQSGGFAL 594
Query: 561 ADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGH 620
DI+ G NP G+L T YP Y P T+M +RP PG+TY++Y G VY FGH
Sbjct: 595 RDIITGKKNPAGRLVTTQYPASYAEEFPATDMNLRPEGDN--PGQTYKWYTGEAVYEFGH 652
Query: 621 GMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLT----LGVQVDV 676
G+ YT F + +N T V L+ + + TH +T L ++
Sbjct: 653 GLFYTTFAESSSNT-TTKEVKLNIQ----------DILSQTHEDLASITQLPVLNFTANI 701
Query: 677 KNVGSKDGAHTLLVFSTPPAGHWAPH--KQLVAFEKV-HVPAGAQQRVGINIHVCKYLSV 733
+N G + +T +VF+ AP+ K LV ++++ V G + + + + V + V
Sbjct: 702 RNTGKLESDYTAMVFANTSDAGPAPYPKKWLVGWDRLGEVKVGETRELRVPVEVGSFARV 761
>gi|160881137|ref|YP_001560105.1| glycoside hydrolase family 3 [Clostridium phytofermentans ISDg]
gi|160429803|gb|ABX43366.1| glycoside hydrolase family 3 domain protein [Clostridium
phytofermentans ISDg]
Length = 717
Score = 410 bits (1053), Expect = e-111, Method: Compositional matrix adjust.
Identities = 260/737 (35%), Positives = 392/737 (53%), Gaps = 91/737 (12%)
Query: 51 QRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGA 110
QR +L+ +++L+EKV + A ++PRL IK Y +W+EALHGV+ G A
Sbjct: 12 QRATELVKKMTLEEKVFQTLHSAPSIPRLDIKAYNYWNEALHGVARAGV----------A 61
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA--------GLTYWSPNVNIFRD 162
T FPQ I A++F+ L E I +S E R +N GLT+WSPNVNIFRD
Sbjct: 62 TVFPQAIGLAATFDEDLIEEIADTISTEGRGKFNAQQKYGDHDIYKGLTFWSPNVNIFRD 121
Query: 163 PRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD--R 220
PRWGRG ET GEDP LSG +V G+QG D LK AA KHF + +G + R
Sbjct: 122 PRWGRGHETFGEDPFLSGTLGGRFVDGIQGHDETYLKAAACAKHFAVH-----SGPEDIR 176
Query: 221 FHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWR 280
FNA+VS+QD+ +T+ F+ V E KV +VM +YN+ NG P C +L+ +RGEW
Sbjct: 177 HSFNAEVSEQDLRETYLPAFKKLVKEHKVEAVMGAYNRTNGEPCCGSKTLLEDILRGEWE 236
Query: 281 LNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSE 340
G++ SDC ++ +++ TS E+ A A+ G DL+CG L ++ AV+ GL+ E
Sbjct: 237 FVGHVTSDCWAIKDFHEHHMVTSNAVESVALAMNRGCDLNCGN-LYVNLLQAVRDGLVEE 295
Query: 341 IDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQ 400
I+ AL+ T +M+LG+FD E S P+ + V T +EL ++A+++ +VLLKN+
Sbjct: 296 ETIDTALIRLFTTRMKLGLFDKE-ESIPFNTITYDQVDTKSSKELNIKASKKCVVLLKNE 354
Query: 401 GPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRY----ARTIHQQG 456
LPL+ + +V VIGPN++ ++GNY G A Y T L+GI + R +G
Sbjct: 355 DNILPLNPKKITSVGVIGPNANNRNALVGNYEGTASEYITVLEGIKQVVPEDVRVYFSEG 414
Query: 457 C-------KDVACADDQLFGAAIDA-SRQADATILVMGLDQSIEAE---------ALDRA 499
C +++ +D++ A + A +D I +GLD +E E + D+
Sbjct: 415 CHLFKNKLSNLSQENDRI--AEVRAVCEHSDVVIACLGLDPGLEGEEGDQGNQFASGDKK 472
Query: 500 GLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTA 559
L LPG Q++++ + K P IL+L+SG + V +A D I AI+ YPG GG A
Sbjct: 473 TLALPGIQEDVLKTIYECGK-PVILILLSGSALAVPWA--DEHIPAILQGWYPGAQGGRA 529
Query: 560 IADILFGTSNPGGKLPMTWYPQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPF 618
IA+++FG NP GKLP+T+Y LP T+ AM+ RTYR+ K +YPF
Sbjct: 530 IAELIFGDGNPEGKLPVTFY--RTTEELPEFTDYAMK--------NRTYRYMKNEALYPF 579
Query: 619 GHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKN 678
G+G+SYT F HT+ +N GK ++ +C V VKN
Sbjct: 580 GYGLSYTTFEHTLL---------------YVNTDTLGKG---SNVEC-------MVRVKN 614
Query: 679 VGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSG 738
G +G+ T + G AP+ QL +KV + G ++ + I + + + + G
Sbjct: 615 TGDYEGSVTTQAY-VKYVGEDAPNCQLKGLKKVSLLPGEEKDIMIELDD-RAFGLYNEEG 672
Query: 739 TRRIPLGEHNIHIGGTK 755
+ GE+ +++ ++
Sbjct: 673 EFILNQGEYELYLSDSQ 689
>gi|354508473|gb|AER26905.1| beta-xylosidase 3 [synthetic construct]
Length = 778
Score = 409 bits (1052), Expect = e-111, Method: Compositional matrix adjust.
Identities = 255/720 (35%), Positives = 377/720 (52%), Gaps = 32/720 (4%)
Query: 31 DPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEA 90
D ++ R+ C S R LI +L E + + V RLG+ Y+ WSEA
Sbjct: 31 DCQNGPLRSHLICDESATPYDRAASLISLFTLDELIANTGNTGLGVSRLGLPAYQVWSEA 90
Query: 91 LHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGL 150
LHG+ G + ATSFPQ I T ++ N TL I ++S + RA N G GL
Sbjct: 91 LHGLDRANFSDS--GSYNWATSFPQPILTTAALNRTLIHQIASIISTQGRAFNNAGRYGL 148
Query: 151 TYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGD-RLKVAASCKHFTA 209
++PN+N FR P WGRGQETPGED L+ YA Y+ G+QG D D LK+AA+ KH+
Sbjct: 149 DVYAPNINTFRHPVWGRGQETPGEDVSLAAVYAYEYITGIQGPDPDSNLKLAATAKHYAG 208
Query: 210 YDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPN 269
YD++NW+ R + +++QD+ + + F + + KV SVMC+YN V+GVP CAD
Sbjct: 209 YDIENWHNHSRLGNDMNITQQDLSEYYTPQFHVAARDAKVHSVMCAYNAVDGVPACADSY 268
Query: 270 ILKRTIRGEWRL--NGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGL 327
L+ +R + +GY+ SDCD+ Y+ + S+ AAA+AI AG D+DCG
Sbjct: 269 FLQTLLRDTFGFVDHGYVSSDCDAAYNIYNPHGYASSQAAAAAEAILAGTDIDCGTTYQW 328
Query: 328 HTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPS--SQPYGHLGPKDVCTPDHQEL 385
H ++ G LS DI ++ T ++ G FD + + PY L DV D +
Sbjct: 329 HLNESITAGDLSRDDIEKGVIRLYTTLVQAGYFDSNTTKANNPYRDLTWSDVLETDAWNI 388
Query: 386 ALEAARQGIVLLKNQGPSLPLSHIRH----RTVAVIGPNSDVTVTMIGNYAGIACGYTTP 441
+ +AA QGIVLLKN LPL+ + TVA+IGP ++ T ++GNY G A +P
Sbjct: 389 SYQAATQGIVLLKNSNNVLPLTEKAYPPSNTTVALIGPWANATTQLLGNYYGNAPYMISP 448
Query: 442 LQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGL 501
++ ++ F AA+ A+R AD I G+D ++EAEALDR +
Sbjct: 449 RAAFEEAGYKVNFAEGTGISSTSTSGFAAALSAARSADVIIYAGGIDNTLEAEALDRESI 508
Query: 502 LLPGRQQELVSKV-SMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAI 560
PG Q +L+ K+ S A P I++ M GG +D + KN+ + A++W GYPGQ+GG A+
Sbjct: 509 AWPGNQLDLIQKLASSAGSKPLIVLQMGGGQVDSSSLKNNTNVTALLWGGYPGQSGGFAL 568
Query: 561 ADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGH 620
DI+ G NP G+L T YP Y P T+M +RP PG+TY++Y G VY FGH
Sbjct: 569 RDIITGKKNPAGRLVTTQYPASYAEEFPATDMNLRPEGDN--PGQTYKWYTGEAVYEFGH 626
Query: 621 GMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLT----LGVQVDV 676
G+ YT F + +N T V L+ + + TH + +T L ++
Sbjct: 627 GLFYTTFAESSSNT-TTKEVKLNIQ----------DILSQTHEELASITQLPVLNFTANI 675
Query: 677 KNVGSKDGAHTLLVFSTPPAGHWAPH--KQLVAFEKV-HVPAGAQQRVGINIHVCKYLSV 733
KN G + +T +VF+ AP+ K LV ++++ V G + + + + V + V
Sbjct: 676 KNTGKLESDYTAMVFANTSDAGPAPYPVKWLVGWDRLGDVKVGETRELRVPVEVGSFARV 735
>gi|4235093|gb|AAD13106.1| beta-xylosidase [Aspergillus niger]
Length = 804
Score = 409 bits (1052), Expect = e-111, Method: Compositional matrix adjust.
Identities = 255/720 (35%), Positives = 377/720 (52%), Gaps = 32/720 (4%)
Query: 31 DPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEA 90
D ++ R+ C S R LI +L E + + V RLG+ Y+ WSEA
Sbjct: 57 DCQNGPLRSHLICDESATPYDRAASLISLFTLDELIANTGNTGLGVSRLGLPAYQVWSEA 116
Query: 91 LHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGL 150
LHG+ G + ATSFPQ I T ++ N TL I ++S + RA N G GL
Sbjct: 117 LHGLDRANFSDS--GSYNWATSFPQPILTTAALNRTLIHQIASIISTQGRAFNNAGRYGL 174
Query: 151 TYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGD-RLKVAASCKHFTA 209
++PN+N FR P WGRGQETPGED L+ YA Y+ G+QG D D LK+AA+ KH+
Sbjct: 175 DVYAPNINTFRHPVWGRGQETPGEDVSLAAVYAYEYITGIQGPDPDSNLKLAATAKHYAG 234
Query: 210 YDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPN 269
YD++NW+ R + +++QD+ + + F + + KV SVMC+YN V+GVP CAD
Sbjct: 235 YDIENWHNHSRLGNDMNITQQDLSEYYTPQFHVAARDAKVHSVMCAYNAVDGVPACADSY 294
Query: 270 ILKRTIRGEWRL--NGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGL 327
L+ +R + +GY+ SDCD+ Y+ + S+ AAA+AI AG D+DCG
Sbjct: 295 FLQTLLRDTFGFVDHGYVSSDCDAAYNIYNPHGYASSQAAAAAEAILAGTDIDCGTTYQW 354
Query: 328 HTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPS--SQPYGHLGPKDVCTPDHQEL 385
H ++ G LS DI ++ T ++ G FD + + PY L DV D +
Sbjct: 355 HLNESITAGDLSRDDIEKGVIRLYTTLVQAGYFDSNTTKANNPYRDLTWSDVLETDAWNI 414
Query: 386 ALEAARQGIVLLKNQGPSLPLSHIRH----RTVAVIGPNSDVTVTMIGNYAGIACGYTTP 441
+ +AA QGIVLLKN LPL+ + TVA+IGP ++ T ++GNY G A +P
Sbjct: 415 SYQAATQGIVLLKNSNNVLPLTEKAYPPSNTTVALIGPWANATTQLLGNYYGNAPYMISP 474
Query: 442 LQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGL 501
++ ++ F AA+ A+R AD I G+D ++EAEALDR +
Sbjct: 475 RAAFEEAGYKVNFAEGTGISSTSTSGFAAALSAARSADVIIYAGGIDNTLEAEALDRESI 534
Query: 502 LLPGRQQELVSKV-SMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAI 560
PG Q +L+ K+ S A P I++ M GG +D + KN+ + A++W GYPGQ+GG A+
Sbjct: 535 AWPGNQLDLIQKLASSAGSKPLIVLQMGGGQVDSSSLKNNTNVTALLWGGYPGQSGGFAL 594
Query: 561 ADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGH 620
DI+ G NP G+L T YP Y P T+M +RP PG+TY++Y G VY FGH
Sbjct: 595 RDIITGKKNPAGRLVTTQYPASYAEEFPATDMNLRPEGDN--PGQTYKWYTGEAVYEFGH 652
Query: 621 GMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLT----LGVQVDV 676
G+ YT F + +N T V L+ + + TH + +T L ++
Sbjct: 653 GLFYTTFAESSSNT-TTKEVKLNIQ----------DILSQTHEELASITQLPVLNFTANI 701
Query: 677 KNVGSKDGAHTLLVFSTPPAGHWAPH--KQLVAFEKV-HVPAGAQQRVGINIHVCKYLSV 733
KN G + +T +VF+ AP+ K LV ++++ V G + + + + V + V
Sbjct: 702 KNTGKLESDYTAMVFANTSDAGPAPYPVKWLVGWDRLGDVKVGETRELRVPVEVGSFARV 761
>gi|261368518|ref|ZP_05981401.1| beta-glucosidase [Subdoligranulum variabile DSM 15176]
gi|282569400|gb|EFB74935.1| glycosyl hydrolase family 3 C-terminal domain protein
[Subdoligranulum variabile DSM 15176]
Length = 717
Score = 409 bits (1050), Expect = e-111, Method: Compositional matrix adjust.
Identities = 259/735 (35%), Positives = 379/735 (51%), Gaps = 88/735 (11%)
Query: 51 QRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGA 110
+R L+ +++L+EK+ ++S A A+PRLGI Y WW+E +HGV G A
Sbjct: 13 ERARALVAQMTLKEKISQMLSWAPAIPRLGIPAYNWWNEGIHGVGRAGT----------A 62
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA--------GLTYWSPNVNIFRD 162
T FPQ I A+SF+ L +G V EAR YN + GLT W+PNVNIFRD
Sbjct: 63 TVFPQAIGLAASFDEDLLGQVGEAVGVEARGKYNMYRSYQDRDIYKGLTIWAPNVNIFRD 122
Query: 163 PRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFH 222
PRWGRG ET GEDP L+ + +V G+QG D D L+ AA KHF + R +
Sbjct: 123 PRWGRGHETYGEDPYLTSRLGVRFVEGMQGDDPDYLRAAACAKHFAVHSGPE---DQRHY 179
Query: 223 FNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLN 282
F+AKVS+QD+ +T+ FR V E V +VM +YN+ NG P C +L +RG+W
Sbjct: 180 FDAKVSQQDLWETYLPAFRALVKEAGVEAVMGAYNRTNGEPCCGSKTLLVDILRGKWNFQ 239
Query: 283 GYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEID 342
G++ SDC ++ +++ TS P ++ A A+ G DL+CG + E AV G + E
Sbjct: 240 GHVTSDCWAIKDFHEGHMVTSGPVDSVALAVNNGCDLNCGDLYA-YLEEAVAEGKVKEET 298
Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGP 402
I+ +LV T +M+LGMFD E PY +G V + + Q L LE A + +VLLKN+
Sbjct: 299 IDRSLVRLFTTRMKLGMFDAEEKV-PYNKIGYDAVDSREMQALNLEVAEKILVLLKNENH 357
Query: 403 SLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRY----ARTIHQQGCK 458
+LPL + VAV+GPN+D ++GNY G A Y T L GI Y + + +GC
Sbjct: 358 TLPLDKSKLHRVAVVGPNADNRKALVGNYEGTASRYVTVLDGIQEYLGEDVQVRYSEGCH 417
Query: 459 DVA------CADDQLFGAAIDASRQADATILVMGLDQSIEAE---------ALDRAGLLL 503
A ++L + D I +GLD +E E + D+ L L
Sbjct: 418 LYADKIQGLAKSNELISEVRGVCAECDVVICCLGLDAGLEGEEGDQGNQFASGDKQSLSL 477
Query: 504 PGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADI 563
PG QE V K + S P ++V++SG + + A+ AA++ A YPG GG A+A
Sbjct: 478 PG-NQESVLKACIESGKPVVVVVLSGSALALGTAQEGA--AAVLQAWYPGAQGGRAVARA 534
Query: 564 LFGTSNPGGKLPMTWYPQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGM 622
LFG NP GKLP+T+Y + +LP T+ AM+ GRTYR+ + +YPFG+G+
Sbjct: 535 LFGECNPQGKLPVTFYHSD--EDLPAFTDYAMK--------GRTYRYMEKEPLYPFGYGL 584
Query: 623 SYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSK 682
SY++F A A P DG + V+V V N G
Sbjct: 585 SYSHFTFRDAKADAAQIGP-DG-------------------------VDVRVTVVNDGQY 618
Query: 683 DGAHTLLVF--STPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTR 740
G T+ V+ + P P+ QL A KV + G ++ V +++ C + ++ + G
Sbjct: 619 RGRETVEVYVKAERPG---TPNAQLKALAKVDLMPGEEKCVTLHLPQCAF-ALCNEEGIS 674
Query: 741 RIPLGEHNIHIGGTK 755
+ GE+ + +GG++
Sbjct: 675 EVLPGEYTVWLGGSQ 689
>gi|358385386|gb|EHK22983.1| glycoside hydrolase family 3 protein [Trichoderma virens Gv29-8]
Length = 795
Score = 408 bits (1049), Expect = e-111, Method: Compositional matrix adjust.
Identities = 259/733 (35%), Positives = 398/733 (54%), Gaps = 29/733 (3%)
Query: 43 CQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTK 102
C S +R LI +L+E + + VPRLG+ Y+ W+EALHG+ TK
Sbjct: 63 CDSSAGYAERAQALISLFTLEELILNTQNSGPGVPRLGLPNYQVWNEALHGLDRANFATK 122
Query: 103 FGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRD 162
GG F ATSFP I + ++ N TL I ++S +ARA N G GL ++PN+N FR
Sbjct: 123 -GGQFQWATSFPMPILSMAALNRTLIHQIADIISTQARAFSNSGRYGLDVYAPNINGFRS 181
Query: 163 PRWGRGQETPGEDP-VLSGKYAASYVRGLQGS-DGDRLKVAASCKHFTAYDLDNWNGVDR 220
P WGRGQETPGED VL+ Y Y+ G+QG D + LK+AA+ KHF YDL+NWN R
Sbjct: 182 PLWGRGQETPGEDANVLTSAYTYEYITGMQGGVDPENLKIAATAKHFAGYDLENWNNQSR 241
Query: 221 FHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWR 280
F+A +++QD+ + + F K S MC+YN VNGVP+CA+ L+ +R W
Sbjct: 242 LGFDAIITQQDLSEYYTPQFLAASRYAKSHSFMCAYNSVNGVPSCANSFFLQTLLRESWG 301
Query: 281 LN--GYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLL 338
GY+ SDCD+V ++ + S AAA ++RAG D+DCG H + G +
Sbjct: 302 FPEWGYVSSDCDAVYNVWNPHDYASNQSSAAASSLRAGTDIDCGQTYPWHLNESFVAGEV 361
Query: 339 SEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLK 398
S +I ++ +RLG FD Y LG KDV D ++ EAA +GIVLLK
Sbjct: 362 SRGEIERSVTRLYANLVRLGYFD---KKNEYRSLGWKDVVKTDAWNISYEAAVEGIVLLK 418
Query: 399 NQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCK 458
N G +LPLS + R++A+IGP ++ T M GNY G A +PL+ + ++ +
Sbjct: 419 NDG-TLPLSK-KVRSIALIGPWANATTQMQGNYFGAAPYLISPLEAAKKAGYQVNFELGT 476
Query: 459 DVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMAS 518
+ A F AI A++++DA I G+D ++E E DR + PG Q +L+ ++S
Sbjct: 477 ETASTSTAGFAKAIAAAKKSDAIIFAGGIDNTVEQEGADRTDIAWPGNQLDLIKQLSELG 536
Query: 519 KGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTW 578
K P +++ M GG +D + K++ ++ +++W GYPGQ+GG A+ DIL G P G+L T
Sbjct: 537 K-PLVVLQMGGGQVDSSSLKSNKKVNSLVWGGYPGQSGGVALFDILSGKRAPAGRLVSTQ 595
Query: 579 YPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVV 638
YP +Y+ P +M +RP K PG+TY +Y G VY FG G+ YT F T++ +
Sbjct: 596 YPADYVHQFPQNDMNLRPD-GKSNPGQTYIWYTGKPVYQFGDGIFYTTFKETLSGSS--- 651
Query: 639 AVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF----STP 694
G ++++ ++ T+++ + L +++N G D ++ ++F +
Sbjct: 652 ----KGLKFNVSSVLAAPHPGYTYSEQTPV-LTFTANIENSGKTDSPYSAMLFVRTANAG 706
Query: 695 PAGHWAPHKQLVAFEKV-HVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGG 753
PA + P+K LV F+++ + G ++ I I V L+ VD G R + G++ + +
Sbjct: 707 PAPY--PNKWLVGFDRLATIKPGHSSKLSIPIPVSA-LARVDSLGNRIVYPGKYELAL-N 762
Query: 754 TKHSVSLHAATLG 766
T S+ L +G
Sbjct: 763 TDESIKLEFELVG 775
>gi|288870210|ref|ZP_06113312.2| beta-glucosidase [Clostridium hathewayi DSM 13479]
gi|288868024|gb|EFD00323.1| beta-glucosidase [Clostridium hathewayi DSM 13479]
Length = 730
Score = 408 bits (1049), Expect = e-111, Method: Compositional matrix adjust.
Identities = 257/736 (34%), Positives = 389/736 (52%), Gaps = 89/736 (12%)
Query: 51 QRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGA 110
++ L+ +++L+EKV ++ A A+ RLGIK Y WW+E LHGV+ G A
Sbjct: 23 EKAEYLVKQMTLEEKVFQTMNQAPAIERLGIKAYNWWNEGLHGVARAGV----------A 72
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYN-----GGT---AGLTYWSPNVNIFRD 162
T FPQ I A++F+ L E +G VS EARA Y+ G T GLT W+PN+NIFRD
Sbjct: 73 TIFPQAIGLAATFDEDLIETVGEAVSTEARAKYHMQQRYGDTDIYKGLTLWAPNINIFRD 132
Query: 163 PRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD--R 220
PRWGRG ET GEDP L+ + Y+RGLQGS LK AA KHF + +G + R
Sbjct: 133 PRWGRGHETYGEDPWLTSRLGIRYIRGLQGSHEKYLKTAACVKHFAVH-----SGPEELR 187
Query: 221 FHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWR 280
F+A+VS++D+ +T+ F CV +G V +VM +YN+VNGVP C + +L+ +R EW
Sbjct: 188 HSFDAEVSEKDLRETYLPAFEACVKDGDVEAVMGAYNRVNGVPCCGNEYLLETILRKEWG 247
Query: 281 LNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSE 340
+G++VSDC ++ +++ T +P E+ + A+ G DL+CG + AV+ G + E
Sbjct: 248 FHGHVVSDCWAIKDFHEGHGVTDSPVESVSMAMNHGCDLNCGNLF-TYLIQAVKEGKVKE 306
Query: 341 IDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQ 400
++ A++ T +++LG PY + +V +P ++L AA + +VLLKN
Sbjct: 307 ERLDEAVIRLFTTRLKLGALGKMEEDDPYAGISYLEVDSPAMKKLNRSAAGKSVVLLKNT 366
Query: 401 GPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGR----YARTIHQQG 456
LP+ R++T+ VIGPN+D ++GNY G A Y T L+GI AR ++ +G
Sbjct: 367 EGLLPIDTKRYKTIGVIGPNADSRRALVGNYEGTASEYVTVLEGIREAAEPEARVLYSEG 426
Query: 457 C-------KDVACADDQLFGAAIDASRQADATILVMGLDQSIEAE---------ALDRAG 500
C + +D+L R++D I MGLD ++E E D+
Sbjct: 427 CHLYKSNVSGLGARNDRL-SEVKGICRESDIVIACMGLDSTLEGEQGDTGNIYAGGDKPD 485
Query: 501 LLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAI 560
L+LPG QQ+++ + + S P +LVL++G + V +A D + AI+ A YPG GG +
Sbjct: 486 LMLPGLQQKIL-ETAYDSGKPVVLVLLAGSAMAVTWA--DEHLPAILTAWYPGAEGGRGV 542
Query: 561 ADILFGTSNPGGKLPMTWYPQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFG 619
AD+LFGT NP G+LP+T+Y LP T +M GRTYRF K +YPFG
Sbjct: 543 ADVLFGTVNPEGRLPVTFYRT--TEELPDFTNYSME--------GRTYRFMKQKALYPFG 592
Query: 620 HGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNV 679
G+SYT F S ++V+ + V++ V N
Sbjct: 593 FGLSYTEF--------------------------SCSGLEVSERDSVDNGVEVKLCVANC 626
Query: 680 GSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGT 739
G + G T+ V+ H + QL A KV + G ++ + I++ + +V D +G
Sbjct: 627 GERWGRETIQVYVGRKEDH-DRNPQLKAAVKVGLEPGEEKTISIHLPADAF-AVYDENGK 684
Query: 740 RRIPLGEHNIHIGGTK 755
R I + I GG++
Sbjct: 685 RYIDACTYQIFAGGSQ 700
>gi|358365439|dbj|GAA82061.1| beta-xylosidase [Aspergillus kawachii IFO 4308]
Length = 788
Score = 408 bits (1048), Expect = e-111, Method: Compositional matrix adjust.
Identities = 251/695 (36%), Positives = 369/695 (53%), Gaps = 32/695 (4%)
Query: 56 LIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQ 115
LI +L E + + V RLG+ Y+ WSEALHG+ G + ATSFPQ
Sbjct: 66 LISLFTLDELIANTGNTGLGVSRLGLPAYQVWSEALHGLDRANFSDS--GSYNWATSFPQ 123
Query: 116 VITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGED 175
I T ++ N TL I ++S + RA N G GL ++PN+N FR P WGRGQETPGED
Sbjct: 124 PILTTAALNRTLIHQIASIISTQGRAFNNAGRYGLDVYAPNINTFRHPVWGRGQETPGED 183
Query: 176 PVLSGKYAASYVRGLQGSDGD-RLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIED 234
L+ YA Y+ G+QG D D LK+AA+ KH+ YD++NW+ R + +++QD+ +
Sbjct: 184 VSLAAVYAYEYITGIQGPDPDSNLKLAATAKHYAGYDIENWHNHSRLGNDMNITQQDLSE 243
Query: 235 TFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRL--NGYIVSDCDSV 292
+ F + + KV SVMC+YN VNGVP CAD L+ +R + +GY+ SDCD+
Sbjct: 244 YYTPQFHVAARDAKVHSVMCAYNAVNGVPACADSYFLQTLLRDTFGFVDHGYVSSDCDAA 303
Query: 293 GVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLT 352
Y+ + S+ AAA+AI AG D+DCG H ++ G LS DI ++ T
Sbjct: 304 YNIYNPHGYASSQAAAAAEAILAGTDIDCGTTYQWHLNESITAGDLSRDDIEKGVIRLYT 363
Query: 353 VQMRLGMFDGEPS--SQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIR 410
++ G FD + + PY L DV D ++ +AA QGIVLLKN LPL+
Sbjct: 364 TLVQAGYFDSNTTKANNPYRDLTWSDVLETDAWNISYQAATQGIVLLKNSNNVLPLTEKA 423
Query: 411 H----RTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQ 466
+ TVA+IGP ++ T ++GNY G A +P ++ ++
Sbjct: 424 YPPSNTTVALIGPWANATTQLLGNYYGNAPYMISPRAAFEEAGYKVNFAEGTGISSTSTS 483
Query: 467 LFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKV-SMASKGPTILV 525
F AA+ A+R AD I G+D ++EAEALDR + PG Q +L+ K+ S A P I++
Sbjct: 484 GFAAALSAARSADVIIYAGGIDNTLEAEALDRESIAWPGNQLDLIQKLASSAGSKPLIVL 543
Query: 526 LMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYIT 585
M GG +D + KN+ + A++W GYPGQ+GG A+ DI+ G NP G+L T YP Y
Sbjct: 544 QMGGGQVDSSSLKNNTNVTALLWGGYPGQSGGFALRDIITGKKNPAGRLVTTQYPASYAE 603
Query: 586 NLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGR 645
P T+M +RP PG+TY++Y G VY FGHG+ YT F + +N T V L+ +
Sbjct: 604 EFPATDMNLRPEGDN--PGQTYKWYTGEAVYEFGHGLFYTTFAESSSNT-TTKEVKLNIQ 660
Query: 646 HGSINATISGKAIKVTHAKCNRLT----LGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAP 701
+ TH + +T L ++KN G + +T +VF+ AP
Sbjct: 661 ----------DILSQTHEELASITQLPVLNFTANIKNTGKLESDYTAMVFANTSDAGPAP 710
Query: 702 H--KQLVAFEKV-HVPAGAQQRVGINIHVCKYLSV 733
+ K LV ++++ V G + + + + V + V
Sbjct: 711 YPVKWLVGWDRLGDVKVGETRELRVPVEVGSFARV 745
>gi|366163035|ref|ZP_09462790.1| glycoside hydrolase family 3 [Acetivibrio cellulolyticus CD2]
Length = 705
Score = 408 bits (1048), Expect = e-111, Method: Compositional matrix adjust.
Identities = 263/725 (36%), Positives = 392/725 (54%), Gaps = 101/725 (13%)
Query: 51 QRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGA 110
++ +L+ +++L+EK L + A+ RLGI Y WW+EALHGV+ G A
Sbjct: 9 KKAEELVAQMTLEEKASQLTYNSPAIERLGIPAYNWWNEALHGVARAGT----------A 58
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA--------GLTYWSPNVNIFRD 162
T FPQ I A+ F+ I ++ EARA YN + GLT WSPN+NIFRD
Sbjct: 59 TVFPQAIGLAAMFDDEFLMKIANAIAIEARAKYNESSKHGDRDIYKGLTIWSPNINIFRD 118
Query: 163 PRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAY----DLDNWNGV 218
PRWGRG ET GEDP LSGK ++++GLQG D D + AA KHF AY DL
Sbjct: 119 PRWGRGHETYGEDPFLSGKLGVAFIKGLQG-DKDVMMTAACVKHFAAYSGPEDL------ 171
Query: 219 DRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGE 278
R FNA+V+K+D+ +T+ F CV + KV +VM YN+ NG P C +L+ +R +
Sbjct: 172 -RHGFNAEVTKKDLWETYLPAFETCVKDAKVEAVMGGYNRTNGEPCCGSYTLLRDILREK 230
Query: 279 WRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLL 338
W G++VSDC ++ ++ T TPEE+ A AI AG DL+CG L A+Q GL+
Sbjct: 231 WGFEGHVVSDCWAIKDFHTDHMVTKTPEESVALAIDAGCDLNCGNMY-LMLLIALQEGLI 289
Query: 339 SEIDINNALVNTLTVQMRLGMFDG-EPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLL 397
+E I A V T + +LG+F+G E + PY + V +H+E+A+EAAR+ VLL
Sbjct: 290 TEEHITRAAVRIFTTRFKLGLFEGSEFDNIPY-----EVVECSEHKEMAIEAARKSAVLL 344
Query: 398 KNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI----GRYARTIH 453
KN G LP++ +T+ VIGPN++ + + GNY G + Y T L+GI G R ++
Sbjct: 345 KNDG-ILPINKGAIKTIGVIGPNANSRIALKGNYHGTSSRYITLLEGIQDEVGDEVRVLY 403
Query: 454 QQGCKDV-------ACADDQLFGAAIDASRQADATILVMGLDQSIEAE---------ALD 497
GC+ V A A+D+L A+ + +D +L +GLD++IE E + D
Sbjct: 404 SNGCELVKDRTEVLAYANDRL-AEAVTVAEHSDLVVLCLGLDETIEGEQSDEGNNGGSGD 462
Query: 498 RAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGG 557
+ L LP Q+ L+ K+ +A+ PT+L LM+G I++++A + W YPG GG
Sbjct: 463 KKDLDLPEVQKSLLEKI-VATGKPTVLCLMAGSAINLSYAHEHCNGILLTW--YPGARGG 519
Query: 558 TAIADILFGTSNPGGKLPMTWYPQEYITNL-PMTEMAMRPSQSKRYPGRTYRFYKGPVVY 616
A+ADILFG ++P GKLP+T+Y + NL P+T+ +M+ RTYR+ + +Y
Sbjct: 520 KAVADILFGNASPSGKLPVTFYRS--LDNLPPITDYSMK--------NRTYRYIEEAPLY 569
Query: 617 PFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATIS-GKAIKVTHAKCNRLTLGVQVD 675
PFG+G++Y + ++ +H I T+ K I +T NR ++ V+ +
Sbjct: 570 PFGYGLTYGD---------------VELKHVEIKGTVEIEKDIYITVTLQNRGSVAVE-E 613
Query: 676 VKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVD 735
V KD V +T L AF +V + A +++V + I L VV+
Sbjct: 614 VVQAYIKDEQSMYAVTNT----------SLCAFMRVGLGANEEKQVSMRIPF-DSLKVVN 662
Query: 736 RSGTR 740
G +
Sbjct: 663 LDGEK 667
>gi|330947691|ref|XP_003306937.1| hypothetical protein PTT_20252 [Pyrenophora teres f. teres 0-1]
gi|311315273|gb|EFQ84970.1| hypothetical protein PTT_20252 [Pyrenophora teres f. teres 0-1]
Length = 756
Score = 407 bits (1047), Expect = e-111, Method: Compositional matrix adjust.
Identities = 256/725 (35%), Positives = 382/725 (52%), Gaps = 28/725 (3%)
Query: 31 DPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEA 90
D K+ ++ C V+ +R L+ + QEK++ L+S + V RLG+ Y WW EA
Sbjct: 23 DCKNGPLKSNAICDVTASPAKRAAALVAAMQTQEKLENLVSKSKGVARLGLPAYNWWGEA 82
Query: 91 LHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGL 150
LHGV+ PG F G + ATSFP + +++F+ L I V+ +EARA NGG A +
Sbjct: 83 LHGVAG-APGINFTGSYRTATSFPMPLLMSAAFDDDLIHQIAIVIGNEARAFGNGGIAPV 141
Query: 151 TYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAY 210
+W+P++N FRDPRWGRG ETPGED + Y S + GL+G R K+ A+CKH+ Y
Sbjct: 142 DFWTPDINPFRDPRWGRGSETPGEDILRIKGYTKSLLSGLEGDKAQR-KIIATCKHYVGY 200
Query: 211 DLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNI 270
D++NWNG DR HF+AK++ QD+ + F PF+ C + KV S MCSYN VNGVPTCAD +
Sbjct: 201 DVENWNGTDRHHFDAKITTQDLAEYFMPPFQQCARDSKVGSFMCSYNAVNGVPTCADTYV 260
Query: 271 LKRTIRGEWRL---NGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGL 327
L+ +R W N YI SDC++V + +T +EA A A G+DL C
Sbjct: 261 LEDILRKHWNWTDSNNYITSDCEAVKDISLRHKYVATLQEATAIAFNNGMDLSCEYSGTS 320
Query: 328 HTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELAL 387
A +GLL+ I+ AL + G FDG ++ Y HLG +D+ TP+ Q+L L
Sbjct: 321 DIPGAFSQGLLNVSVIDRALTRQYEGLVHAGYFDGAAAT--YAHLGVQDINTPEAQKLVL 378
Query: 388 EAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGR 447
+ A +G+ LLKN +LPLS VA++G ++ T + G Y+G A TP+ +
Sbjct: 379 QVAAEGLTLLKNDD-TLPLSLKSGSKVAMVGFWANTTSKLSGIYSGPAPYLHTPVYAGNK 437
Query: 448 YA--RTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPG 505
+ + A D A++A++++D + GLD S AE DR + P
Sbjct: 438 LGLDMAVATGPILQTSGAADNWTTTALNAAKKSDFILYFGGLDPSAAAEGSDRTDISWPS 497
Query: 506 RQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILF 565
Q +L++K +A+ G ++V+ G +D + ++IWA +PGQ GGTA+ ++
Sbjct: 498 AQIDLITK--LAALGKPLVVIALGDMVDHTPILKMKGVNSLIWANWPGQDGGTAVMQVIT 555
Query: 566 GTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYT 625
G G+LP+T YP EY T L M +M MRP + PGRTYR+Y V PFG G+ YT
Sbjct: 556 GEHAIAGRLPITQYPAEY-TQLSMLDMNMRPGGNN--PGRTYRWYN-ESVQPFGFGLHYT 611
Query: 626 NFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGA 685
F ++ + ++N K+ H + ++V V N G++
Sbjct: 612 KFAAKFGSSSGL----------TVNIQDIMKSCTKDHPDLCDVP-PIEVAVTNEGNRTSD 660
Query: 686 HTLLVFSTPPAG-HWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPL 744
L F G P K LV++ ++ +G+Q ++ LS VD+SG
Sbjct: 661 FIALAFIKGEVGPKPYPLKTLVSYARLRDISGSQTKMASLALTLGALSRVDQSGNLVAYP 720
Query: 745 GEHNI 749
GE+ +
Sbjct: 721 GEYTL 725
>gi|359409694|ref|ZP_09202159.1| Beta-glucosidase [Clostridium sp. DL-VIII]
gi|357168578|gb|EHI96752.1| Beta-glucosidase [Clostridium sp. DL-VIII]
Length = 723
Score = 407 bits (1045), Expect = e-110, Method: Compositional matrix adjust.
Identities = 265/733 (36%), Positives = 386/733 (52%), Gaps = 99/733 (13%)
Query: 55 DLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFP 114
+L+ +++LQEK + L + AV RL I Y WW+E LHGV+ G AT FP
Sbjct: 31 ELVAKMTLQEKAEQLTYNSPAVKRLNIPEYNWWNEGLHGVARAGT----------ATVFP 80
Query: 115 QVITTASSFNATLWEAIGRVVSDEARAMYNGGTA--------GLTYWSPNVNIFRDPRWG 166
Q I A+ F+ + +++ E RA YN + GLTYWSPNVNIFRDPRWG
Sbjct: 81 QAIGLAAMFDEEFLGKVAGIIATEGRAKYNENSKKEDRDIYKGLTYWSPNVNIFRDPRWG 140
Query: 167 RGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD--RFHFN 224
RG ET GEDP L+ + ++V+GLQG DG LK++A KHF + +G + R FN
Sbjct: 141 RGHETYGEDPYLTSRLGVAFVKGLQG-DGKYLKLSACAKHFAVH-----SGPESLRHEFN 194
Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
A VS++D+ +T+ F CV E V SVM +YN+ NG P C +LK +RG+W G+
Sbjct: 195 AVVSQKDLHETYLPAFEACVKEANVESVMGAYNRTNGEPCCGSKALLKDILRGKWGFKGH 254
Query: 285 IVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDIN 344
+VSDC ++ ++ TST E+ A AI G DL+CG L+ A + GL++E I
Sbjct: 255 VVSDCWALADFHMHHKVTSTATESVALAIENGCDLNCGNMY-LNLLLAYKEGLVTEEQIT 313
Query: 345 NALVNTLTVQMRLGMFDG--EPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGP 402
A +T + +LGMFD E + PY + C +H +++LEA+R+ +VLLKN G
Sbjct: 314 TAAERLMTTRFKLGMFDEDCEYNQIPY----EVNDC-KEHNQVSLEASRKSMVLLKNNGI 368
Query: 403 SLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRY----ARTIHQQGC- 457
LPL + + VAVIGPN++ + + GNY+G A YTT L GI R + +GC
Sbjct: 369 -LPLDKSKLKAVAVIGPNANSEIMLKGNYSGTASKYTTILDGIHDVLDDDVRVYYSEGCH 427
Query: 458 ------KDVACADDQLFGAAIDASRQADATILVMGLDQSIEAE---------ALDRAGLL 502
+D+A DD+L A+ + +AD IL +GLD +IE E A D+ L
Sbjct: 428 LYKEKVEDLARRDDRL-AEAVSVAERADVVILCLGLDSTIEGEQGDAGNGYGAGDKLDLN 486
Query: 503 LPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIAD 562
LPG QQEL+ KV K P ++VL +G + + A + R AAI+ A YPG GGTA AD
Sbjct: 487 LPGIQQELLEKVLETGK-PVVVVLGTGSGLTLNGA--EERCAAILNAWYPGSHGGTAAAD 543
Query: 563 ILFGTSNPGGKLPMTWYPQEYITNLP-MTEMAMRPSQSKRYPGRTYRFY-KGPVVYPFGH 620
ILFG +P GKLP+T+Y LP T+ AM+ GRTYR+ + +YPFG+
Sbjct: 544 ILFGKCSPSGKLPVTFYKD--TDKLPEFTDYAMK--------GRTYRYMDESNCLYPFGY 593
Query: 621 GMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVG 680
G++Y+ + P V DG + + V+++N G
Sbjct: 594 GLTYSTVELSNLQVPAVRG-EFDG-------------------------IDISVEIENTG 627
Query: 681 SKDGAHTLLVFSTPPAGHWAP-HKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGT 739
S D + + +A + L F++V + G + V + ++ + VD +G
Sbjct: 628 SYDIEEVVQCYIKDLESKYAVLNHSLAGFKRVSLKKGESKTVTMKLNR-RAFEAVDDAGE 686
Query: 740 RRIPLGEHNIHIG 752
R + + + +G
Sbjct: 687 RILDSKKFKLFVG 699
>gi|330836687|ref|YP_004411328.1| Beta-glucosidase [Sphaerochaeta coccoides DSM 17374]
gi|329748590|gb|AEC01946.1| Beta-glucosidase [Sphaerochaeta coccoides DSM 17374]
Length = 709
Score = 407 bits (1045), Expect = e-110, Method: Compositional matrix adjust.
Identities = 235/600 (39%), Positives = 339/600 (56%), Gaps = 62/600 (10%)
Query: 56 LIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQ 115
++ R++L EK+ + A+A+PRL I Y WW+EALHGV+ G AT FPQ
Sbjct: 17 IVSRMTLDEKISQIDYRASAIPRLDIPEYNWWNEALHGVARAGI----------ATVFPQ 66
Query: 116 VITTASSFNATLWEAIGRVVSDEARAMYNGGT--------AGLTYWSPNVNIFRDPRWGR 167
I A+ F++ + E IG V+S E RA YN GLT+WSPNVNIFRDPRWGR
Sbjct: 67 AIGLAAMFDSDMMERIGAVISTEGRAKYNEAVRHGDRDIYKGLTFWSPNVNIFRDPRWGR 126
Query: 168 GQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD--RFHFNA 225
GQET GEDP L+ + A +++RG+QG DG LK AA KHF + +G + R F+A
Sbjct: 127 GQETYGEDPYLTARLAVAFIRGIQG-DGKYLKAAACAKHFAVH-----SGPEALRHEFDA 180
Query: 226 KVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYI 285
+VS++D+ +T+ F+ V E +V VM +YN+VNGVP CA +L +R EW G++
Sbjct: 181 RVSQKDLHETYLSAFKAAVKEAQVEIVMGAYNRVNGVPACASHELLSDILRSEWGFEGHV 240
Query: 286 VSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINN 345
VSD +++ + H+ + A A++AG +L C + H S+V GL+SE +I
Sbjct: 241 VSDYEALEDIFKHHHYVADEAHTMAVALKAGCNL-CAGKIARHLRSSVDEGLISEDEITE 299
Query: 346 ALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLP 405
A+ T ++ +GM PY +G ++ TP+H +LA+EAA + VLLKN G LP
Sbjct: 300 AVERLFTTRIMMGMM---ADDCPYDSIGYEENDTPEHHQLAVEAASRSFVLLKNDGL-LP 355
Query: 406 LSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI----GRYARTIHQQGC---- 457
L + ++AVIGPN++ + GNY G A Y T L+GI G R + +GC
Sbjct: 356 LEMEKISSIAVIGPNANSRKMLEGNYNGTASRYVTVLEGIQDLVGDSVRVWYSEGCHLYK 415
Query: 458 ---KDVACADDQLFGAAIDASRQADATILVMGLDQSIEAE---------ALDRAGLLLPG 505
++ +D+L A+ A++ AD +L +GLD ++E E + D+ L LPG
Sbjct: 416 NFHSSLSGRNDRL-AEAVSAAQHADVVVLCLGLDATLEGEEGDVEVGFGSGDKPNLSLPG 474
Query: 506 RQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILF 565
RQQ L+ + K P IL+L SG + + +ND + AI+ YPG GG A+AD+LF
Sbjct: 475 RQQLLLDTMLTVGK-PVILLLASGSALTLGGRENDENLKAILQIWYPGAMGGKAVADVLF 533
Query: 566 GTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYT 625
G P GKLP+T+Y LP E GRTYR+ KG +YPFG+G++Y+
Sbjct: 534 GRRAPAGKLPVTFYAS--ADELPAFE-------DYSMAGRTYRYMKGNALYPFGYGLTYS 584
>gi|329745495|gb|AEB98984.1| xylosidase precursor [synthetic construct]
Length = 804
Score = 406 bits (1044), Expect = e-110, Method: Compositional matrix adjust.
Identities = 254/720 (35%), Positives = 376/720 (52%), Gaps = 32/720 (4%)
Query: 31 DPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEA 90
D ++ R+ C S R LI +L E + + V RLG+ Y+ WSEA
Sbjct: 57 DCQNGPLRSHLICDESATPYDRAASLISLFTLDELIANTGNTGLGVSRLGLPVYQVWSEA 116
Query: 91 LHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGL 150
LHG+ G + ATSFPQ I T ++ N TL I ++S + RA N G GL
Sbjct: 117 LHGLDRANFSDS--GSYNWATSFPQPILTTAALNRTLIHQIASIISTQGRAFNNAGRYGL 174
Query: 151 TYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGD-RLKVAASCKHFTA 209
++PN+N FR P GRGQETPGED L+ YA Y+ G+QG D D LK+AA+ KH+
Sbjct: 175 DVYAPNINTFRHPVRGRGQETPGEDVSLAAVYAYEYITGIQGPDPDSNLKLAATAKHYAG 234
Query: 210 YDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPN 269
YD++NW+ R + +++QD+ + + F + + KV SVMC+YN VNGVP CAD
Sbjct: 235 YDIENWHNHSRLGNDMNITQQDLSEYYTPQFHVAARDAKVHSVMCAYNAVNGVPACADSY 294
Query: 270 ILKRTIRGEWRL--NGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGL 327
L+ +R + +GY+ SDCD+ Y+ + S+ AAA+AI AG D+DCG
Sbjct: 295 FLQTLLRDTFGFVDHGYVSSDCDAAYNIYNPHGYASSQAAAAAEAILAGTDIDCGTTYQW 354
Query: 328 HTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPS--SQPYGHLGPKDVCTPDHQEL 385
H ++ G LS DI ++ T ++ G FD + + PY L DV D +
Sbjct: 355 HLNESITAGDLSRDDIEKGVIRLYTTLVQAGYFDSNTTKANNPYRDLTWSDVLETDAWNI 414
Query: 386 ALEAARQGIVLLKNQGPSLPLSHIRH----RTVAVIGPNSDVTVTMIGNYAGIACGYTTP 441
+ +AA QGIVLLKN LPL+ + TVA+IGP ++ T ++GNY G A +P
Sbjct: 415 SYQAATQGIVLLKNSNKVLPLTEKAYPPSNTTVALIGPWANATTQLLGNYYGNAPYMISP 474
Query: 442 LQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGL 501
++ ++ + F AA+ A++ AD I G+D ++EAEALDR +
Sbjct: 475 RVAFEEAGYNVNFAERTGISSTNTSGFAAALSAAQSADVIIYAGGIDNTLEAEALDRESI 534
Query: 502 LLPGRQQELVSKV-SMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAI 560
PG Q +L+ K+ S A P I++ M GG +D + KN+ ++A++W GYPGQ+GG A+
Sbjct: 535 AWPGNQLDLIQKLASSAGSKPLIVLQMGGGQVDSSSLKNNTNVSALLWGGYPGQSGGFAL 594
Query: 561 ADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGH 620
DI+ G NP G+L T YP Y P T+M +RP PG+TY++Y G VY FGH
Sbjct: 595 RDIITGKKNPAGRLVTTQYPASYAEEFPATDMNLRPEGDN--PGQTYKWYTGEAVYEFGH 652
Query: 621 GMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLT----LGVQVDV 676
G+ YT F + +N T R +N + TH +T L ++
Sbjct: 653 GLFYTTFAESSSNTTT--------REIKLNIQ---DILSQTHEDLASITQLPVLNFTANI 701
Query: 677 KNVGSKDGAHTLLVFSTPPAGHWAPH--KQLVAFEKV-HVPAGAQQRVGINIHVCKYLSV 733
KN G + +T +VF+ AP+ K LV ++++ V G + + + + V + V
Sbjct: 702 KNTGKVESDYTAMVFANTSDAGPAPYPVKWLVGWDRLGEVKVGETRELRVPVEVGSFARV 761
>gi|336425135|ref|ZP_08605165.1| hypothetical protein HMPREF0994_01171 [Lachnospiraceae bacterium
3_1_57FAA_CT1]
gi|336013044|gb|EGN42933.1| hypothetical protein HMPREF0994_01171 [Lachnospiraceae bacterium
3_1_57FAA_CT1]
Length = 705
Score = 406 bits (1044), Expect = e-110, Method: Compositional matrix adjust.
Identities = 258/745 (34%), Positives = 392/745 (52%), Gaps = 96/745 (12%)
Query: 51 QRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGA 110
++ ++L+ +++L+EK L A A+PRLG+ Y WW+EALHGV+ G A
Sbjct: 9 EKAHELVSQMTLEEKASQLRYDAPAIPRLGVPTYNWWNEALHGVARAGV----------A 58
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGT--------AGLTYWSPNVNIFRD 162
TSFPQ I A++F+ L + +G V+ E RA YN + GLT+WSPNVNIFRD
Sbjct: 59 TSFPQAIAMAAAFDDELLKTVGDAVAAEGRAKYNEYSRHDDRDIYKGLTFWSPNVNIFRD 118
Query: 163 PRWGRGQETPGEDPVLSGKYAASYVRGLQGS-DGDRLKVAASCKHFTAYDLDNWNGVD-- 219
PRWGRG ET GEDP L+ + +YV GLQGS D D +K AA KHF + +G +
Sbjct: 119 PRWGRGHETYGEDPYLTSRLGVAYVEGLQGSQDDDFMKTAACAKHFAVH-----SGPESV 173
Query: 220 RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEW 279
R F+A+ SK+D+ +T+ F CV E V +VM +YN+ NG P C P +++ +R EW
Sbjct: 174 RHEFDAQASKKDMYETYLPAFEACVKEAGVEAVMGAYNRTNGEPCCGSPTLIQNILREEW 233
Query: 280 RLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLS 339
G+ VSDC ++ ++ T TPEE+AA A+++G D++CG LH A Q+GL++
Sbjct: 234 DFQGHYVSDCWAIADFHMHHMVTKTPEESAALALKSGCDVNCG-VTYLHLLKAYQQGLVT 292
Query: 340 EIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKN 399
E +I A T + LG FD Y + + V +H ELA + A++ +VLLKN
Sbjct: 293 EEEITQAAERLFTTRFLLGCFD----KNEYDDIPYEVVECKEHLELAQKMAKESMVLLKN 348
Query: 400 QGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI----GRYARTIHQQ 455
G LPL+ +T+ VIGPN+D ++GNY G + Y T L+GI G R + +
Sbjct: 349 DG-ILPLNKDGLKTIGVIGPNADSRTPLVGNYHGTSSRYITLLEGIQDFVGEDVRVYYSE 407
Query: 456 GC---KD----VACADDQLFGAAIDASRQADATILVMGLDQSIEAE---------ALDRA 499
GC KD + D++ A+ + +D +L +GLD+++E E + D+
Sbjct: 408 GCHIYKDRVEGLGWKQDRI-SEALTVAEHSDVVVLCLGLDENLEGEEGDTGNSYASGDKK 466
Query: 500 GLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTA 559
L LP Q+EL+ V+ K P +L +MSG ID+ FA + AI+ YPG GG A
Sbjct: 467 DLELPESQRELLEAVAGCGK-PVVLCMMSGSAIDMQFAAE--HVNAILQVWYPGARGGKA 523
Query: 560 IADILFGTSNPGGKLPMTWYPQEYITNLPMTE-MAMRPSQSKRYPGRTYRFYKGPVVYPF 618
A+ILFG +P GKLP+T+Y + P E +M+ GRTYR+ + +YPF
Sbjct: 524 AAEILFGACSPSGKLPVTFYKD--LEGFPAFEDYSMK--------GRTYRYLEKEPLYPF 573
Query: 619 GHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKN 678
G+G++Y A + +G+ +I A + T +QV +K+
Sbjct: 574 GYGLTYG---QVCVKAAELTGAVEEGKELTIKAMVENSGKYDTDDV-------IQVYIKD 623
Query: 679 VGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSG 738
+ SK+ P+ L AF++V + G + + + + + L VD G
Sbjct: 624 LDSKNA---------------VPNHSLCAFKRVSLKKGEKAEILLKVPY-EALMAVDEEG 667
Query: 739 TRRIPLGEHNIHIGGTK---HSVSL 760
+ + + +G ++ S+SL
Sbjct: 668 KKYVDSSHFVLSVGTSQPDDRSISL 692
>gi|358393086|gb|EHK42487.1| glycoside hydrolase family 3 protein [Trichoderma atroviride IMI
206040]
Length = 794
Score = 405 bits (1041), Expect = e-110, Method: Compositional matrix adjust.
Identities = 255/732 (34%), Positives = 389/732 (53%), Gaps = 28/732 (3%)
Query: 43 CQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTK 102
C + +R LI +L+E + + VPRLG+ Y+ W+EALHG+ TK
Sbjct: 64 CDSTAGYVERAQALISLFTLEELILNTQNSGPGVPRLGLPNYQVWNEALHGLDRANFATK 123
Query: 103 FGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRD 162
GG+F TSFP I + ++ N TL I ++S +ARA N G GL ++PN+N FR
Sbjct: 124 -GGEFEWGTSFPMPILSMAALNRTLIHQIADIISTQARAFSNNGRYGLDVYAPNINGFRS 182
Query: 163 PRWGRGQETPGEDP-VLSGKYAASYVRGLQGS-DGDRLKVAASCKHFTAYDLDNWNGVDR 220
P WGRGQETPGED VL+ Y Y+ G+QG D + LK+AA+ KHF YDL+N+N R
Sbjct: 183 PLWGRGQETPGEDANVLTSAYTYEYITGMQGGVDPENLKIAATAKHFAGYDLENYNNQSR 242
Query: 221 FHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWR 280
F+A +++QD+ + + F K S MC+YN VNGVP+C++ L+ +R W
Sbjct: 243 LGFDAIITQQDLSEYYTPQFLAASRYAKSHSFMCAYNSVNGVPSCSNSFFLQTLLRESWG 302
Query: 281 L--NGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLL 338
GY+ SDCD++ ++ ++ ++ AAAD+++AG D+DCG H + G +
Sbjct: 303 FPEYGYVSSDCDAIYNVWNPHNYANSQSSAAADSLKAGTDIDCGQTYPWHLNESFVAGTV 362
Query: 339 SEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLK 398
S +I ++ +RLG FD Y LG KDV D ++ EAA +GIVLLK
Sbjct: 363 SRGEIERSVTRLYANLVRLGYFD---KKNEYRSLGWKDVVKTDAWNISYEAAVEGIVLLK 419
Query: 399 NQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCK 458
N G +LPLS + R++A+IGP + T + GNY G A +PLQ + ++ +
Sbjct: 420 NDG-TLPLSK-KVRSIALIGPWVNATEQLQGNYFGTAPYLISPLQAAKKAGYEVNYELGT 477
Query: 459 DVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMAS 518
+ F AI A++++DA I + G+D +IE E DR + PG Q +L+ ++S
Sbjct: 478 GINNQTTAGFAKAIAAAKKSDAIIFIGGIDNTIEQEGADRTDIAWPGNQLDLIKQLSEVG 537
Query: 519 KGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTW 578
K P +++ M GG +D + K++ ++ +++W GYPGQ+GG A+ DIL G P G+L T
Sbjct: 538 K-PLVVLQMGGGQVDSSSIKSNKKVNSLVWGGYPGQSGGYALFDILSGKRAPAGRLVSTQ 596
Query: 579 YPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVV 638
YP EY+ +M +RP K+ PG+TY +Y G VY FG G+ YT F T+ T+
Sbjct: 597 YPAEYVHQFAQNDMNLRPD-GKKNPGQTYIWYTGKPVYQFGDGLFYTTFKETLGKQSTL- 654
Query: 639 AVPLDGRHGSINAT-ISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPA 696
NA+ I G +++N G ++ + F T A
Sbjct: 655 ---------KFNASQILGAGHPGYTYSEQTPVFTFTANIQNSGKTASPYSAMAFVRTSNA 705
Query: 697 G-HWAPHKQLVAFEKV-HVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGT 754
G P+K LV F+++ + G + I I + LS VD +G + + G++ + + T
Sbjct: 706 GPKPYPNKWLVGFDRLATIKPGHSSTLSIPIPLNA-LSRVDSNGNKIVYPGKYEL-VLNT 763
Query: 755 KHSVSLHAATLG 766
SV L +G
Sbjct: 764 DESVKLEFELVG 775
>gi|410098444|ref|ZP_11293422.1| hypothetical protein HMPREF1076_02600 [Parabacteroides goldsteinii
CL02T12C30]
gi|409222318|gb|EKN15263.1| hypothetical protein HMPREF1076_02600 [Parabacteroides goldsteinii
CL02T12C30]
Length = 738
Score = 404 bits (1037), Expect = e-109, Method: Compositional matrix adjust.
Identities = 262/750 (34%), Positives = 383/750 (51%), Gaps = 86/750 (11%)
Query: 35 ATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGV 94
A T PF LP+ RV D+I RL+L+EKV+L+ A AVPRLGI Y WW+EALHGV
Sbjct: 21 AQTYDYPFRNPDLPLDVRVQDIISRLTLEEKVQLMKHAAPAVPRLGIPAYNWWNEALHGV 80
Query: 95 SNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN-----GGTA- 148
+ T FPQ I A++F+ + +G + S E RA++N G T
Sbjct: 81 ARTKEK---------VTVFPQAIGMAATFDTEALQKMGDMTSSEGRALFNEDLKAGKTGE 131
Query: 149 ---GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCK 205
GLTYW+PN+NIFRDPRWGRGQET GEDP L+ K ++ V GL+G++ + LK A K
Sbjct: 132 IYRGLTYWTPNINIFRDPRWGRGQETYGEDPYLTAKMGSAIVHGLEGNNPEYLKSVACAK 191
Query: 206 HFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTC 265
H+ + N R ++A+VS D+ DT+ FR V + KV VMC+YN+ G P C
Sbjct: 192 HYAVHSGPEHN---RHSYDARVSMYDLWDTYLPAFRELVTKAKVHGVMCAYNRFEGTPCC 248
Query: 266 ADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFL 325
+L+ +R +W+ +GY+ SDC +V + S EA ADA+ G DL+CG L
Sbjct: 249 GHNELLQDILRNQWKFDGYVTSDCWAVSDFAKYHKTHSNDTEAVADAVLNGTDLECGN-L 307
Query: 326 GLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQ-PYGHLGPKDVCTPDHQE 384
+ V++GL+SE DIN +L +Q +LGM+D P+ + PY +G + + H++
Sbjct: 308 YQKLQQGVEKGLISEKDINVSLARLFEIQFKLGMYD--PADRVPYASIGREVIECDAHKK 365
Query: 385 LALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQG 444
A E A++ +VLLKN LPL+ + + +A+IGPN D T++ NY G TP +
Sbjct: 366 HAYEMAQKSMVLLKNNKNILPLNASKIKRIALIGPNMDNGSTLLANYFGTPSEIITPYKS 425
Query: 445 I-GRYARTIHQQGCKDVACADDQL----FGAAIDASRQADATILVMGLDQSIE------- 492
+ R+ +I V F +++AD I V G+ E
Sbjct: 426 LQKRFGNSIQIDTLTGVGIVQKLEGAPSFAQVAAQAKKADIIIFVGGISADYEGEAGDAG 485
Query: 493 ------AEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAI 546
+ DR + LP Q EL+ ++ + P ILV MSG + +F AI
Sbjct: 486 AAGYGGFASGDRTTMKLPPVQTELMKELKKTGR-PLILVNMSGSVM--SFDWESRNADAI 542
Query: 547 IWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRT 606
+ A Y GQA G AI D+LFG NP G++P+T Y M + + + RT
Sbjct: 543 LQAWYGGQAAGDAITDVLFGDYNPAGRMPLTTY---------MNDEDLPDFEDYSMANRT 593
Query: 607 YRFYKGPVVYPFGHGMSYTNFVHT-VANAPTVVAVPLDGRHGSINATISGKAIKVTHAKC 665
YR++KG V YPFG+G+SYT F + + NA TV +G++I+VT
Sbjct: 594 YRYFKGDVRYPFGYGLSYTTFGYAPLQNASTVK---------------TGESIQVT---- 634
Query: 666 NRLTLGVQVDVKNVGSKDGAHTL-LVFSTPPAGH-WAPHKQLVAFEKVHVPAGAQQRVGI 723
V N G + G + L S P G+ P + L F+++H+ G ++V
Sbjct: 635 --------TTVTNTGKRAGDEVVQLYISHPQNGNTRVPLRALKGFKRIHLDTGESRQVTF 686
Query: 724 NIHVCKYLSVVDRSGTRRIPLGEHNIHIGG 753
+ + LS+VD G + G ++IGG
Sbjct: 687 TLS-PEELSLVDEKGNQVEKEGTVELYIGG 715
>gi|398406144|ref|XP_003854538.1| hypothetical protein MYCGRDRAFT_38178 [Zymoseptoria tritici IPO323]
gi|339474421|gb|EGP89514.1| hypothetical protein MYCGRDRAFT_38178 [Zymoseptoria tritici IPO323]
Length = 884
Score = 403 bits (1035), Expect = e-109, Method: Compositional matrix adjust.
Identities = 260/723 (35%), Positives = 380/723 (52%), Gaps = 37/723 (5%)
Query: 43 CQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSN---VGP 99
C SL R+ LI +++++EK L+ GA +PR+G+ YEWW+EALHGV+ V
Sbjct: 146 CDTSLSQDDRIAALISQMTVEEKATNLVDGALGLPRIGLPPYEWWNEALHGVAGSRGVSF 205
Query: 100 GTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNI 159
+ G DF ATSFP I ++F+ L + ++ EARA N +G +W+PN+N
Sbjct: 206 DSPNGSDFSYATSFPLPILMGAAFDDPLIYDVASIIGKEARAFANYAHSGYDFWTPNMNT 265
Query: 160 FRDPRWGRGQETPGEDPVLSGKYAASYVRGLQG--SDGDRLKVAASCKHFTAYDLDNWNG 217
F DPRWGRG E P ED + +Y AS V GLQG D ++ A+CKHF YD++
Sbjct: 266 FLDPRWGRGLEVPTEDSFHAQRYVASLVPGLQGGKEKTDHKQIIATCKHFAVYDVE---- 321
Query: 218 VDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRG 277
+R N + + QD+ + + F+ CV + V S+MCSYN V GVP CA L+ +R
Sbjct: 322 TNRHAQNYEPTPQDLGEYYLPAFKTCVRDVNVGSIMCSYNAVYGVPACASEYFLQDVLRD 381
Query: 278 EWRLNG---YIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG-PFLGLHTESAV 333
+W N Y+ SDC++V + +FT T AAA A+ AG D +CG +L L+T A
Sbjct: 382 QWNFNEPYHYVTSDCEAVKDIWTPHNFTDTEPAAAAVALNAGTDTNCGTSYLQLNTSVAN 441
Query: 334 QRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQG 393
+++DI +L +G FDG+P Y L DV TP Q A AA +G
Sbjct: 442 NWTTEAQMDI--SLTRLYNALFTVGYFDGQPE---YDGLSFADVSTPFAQATAYRAASEG 496
Query: 394 IVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIH 453
I LLKN G LPL + +VA+IGP ++ T M G Y GIA +PL I
Sbjct: 497 ITLLKNDG-LLPLKK-SYNSVALIGPWANATTQMQGIYQGIAPYLVSPLAAAQAQWGHIS 554
Query: 454 QQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSK 513
+ + F +A+ A+R AD I G+D SIE E+ DR + PG Q +LV +
Sbjct: 555 FTNGTAINSTNTTGFASALSAARDADVIIYAGGIDSSIEKESRDRTSISWPGNQLDLVQQ 614
Query: 514 VSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGK 573
+S K P ++V GG +D + + + +++WAGYPGQ GG+A+ D+L G +P G+
Sbjct: 615 LSELGK-PLVVVQFGGGQVDDSALLRNKNVNSLVWAGYPGQDGGSALIDVLVGKQSPAGR 673
Query: 574 LPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVAN 633
L +T YP +YI + + + +RPS S PGRTY++Y V PFG+G+ YT F A
Sbjct: 674 LTITQYPADYINQISLFDPNLRPSDSS--PGRTYKWYNKEPVLPFGYGLHYTTFEFDWAK 731
Query: 634 APTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVK--NVGSKDGAHTLLVF 691
AP I + + A T K N + ++ +K N GS + LVF
Sbjct: 732 APQASY--------DIASLVDSTASYTTSPKKNDASPWTELSIKVHNSGSLGSDYVGLVF 783
Query: 692 -STPPAGHWA-PHKQLVAFEKVH-VPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHN 748
TP AG P+K L ++ ++H + AGA + ++ + L+ D G I G++
Sbjct: 784 LRTPNAGPAPYPNKWLASYARLHGLSAGASAELSFSLSLGA-LARGDEHGDLIIYPGDYE 842
Query: 749 IHI 751
+ I
Sbjct: 843 VQI 845
>gi|451821678|ref|YP_007457879.1| periplasmic beta-glucosidase BglX [Clostridium
saccharoperbutylacetonicum N1-4(HMT)]
gi|451787657|gb|AGF58625.1| periplasmic beta-glucosidase BglX [Clostridium
saccharoperbutylacetonicum N1-4(HMT)]
Length = 710
Score = 403 bits (1035), Expect = e-109, Method: Compositional matrix adjust.
Identities = 258/735 (35%), Positives = 384/735 (52%), Gaps = 95/735 (12%)
Query: 51 QRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGA 110
++ +L+ +++LQE+ + L A A+ L I Y WW+E LHGV+ G A
Sbjct: 14 EKAKELVSKMTLQERAEQLTYKAPAIKHLNISRYNWWNEGLHGVARAGT----------A 63
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA--------GLTYWSPNVNIFRD 162
T FPQ I A+ F+ L E I +++ E RA YN + GLT+WSPNVNIFRD
Sbjct: 64 TVFPQAIGLAAIFDDELLEKIAGIIATEGRAKYNENSKKEDKDIYKGLTFWSPNVNIFRD 123
Query: 163 PRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFH 222
PRWGRG ET GEDP L+ + ++V+GLQG D LK+AA KHF + G+ R
Sbjct: 124 PRWGRGHETYGEDPYLTSRLGVAFVKGLQG-DEKYLKIAACAKHFAVHS--GPEGL-RHE 179
Query: 223 FNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLN 282
FNA VSK+D+ +T+ F CV E V +VM +YN+ N P C +LK +RG+W+
Sbjct: 180 FNAVVSKKDLYETYLPAFEACVKEADVEAVMGAYNRTNDEPCCGSSLLLKDILRGKWQFK 239
Query: 283 GYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEID 342
G++VSDC ++ ++ TST E+AA AI+ G DL+CG L A + GL++E D
Sbjct: 240 GHVVSDCWAIADFHLYHGVTSTATESAALAIKNGCDLNCGNVY-LQMLLAYKEGLVTEED 298
Query: 343 INNALVNTLTVQMRLGMFDGEPSSQ--PYGHLGPKDVCTPDHQELALEAARQGIVLLKNQ 400
I A + ++RLGMFD E PY K +H E++L A+R+ IV+L+N
Sbjct: 299 ITRAAERLMATRIRLGMFDEECEFNKIPYTMNDCK-----EHHEVSLMASRKSIVMLRNN 353
Query: 401 GPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI-----GRYARTIHQQ 455
G LPL + +++ +IGPN+D + + GNY G A Y T L+GI R + +
Sbjct: 354 GL-LPLDKSKLKSIGIIGPNADSELMLKGNYFGTASKYITVLEGIHEAVDSENIRIFYSE 412
Query: 456 GC-------KDVACADDQLFGAAIDASRQADATILVMGLDQSIEAE---------ALDRA 499
GC +D+A DD++ A+ + +D IL +GLD SIE E A D+
Sbjct: 413 GCHLYKDRVQDLAEPDDRM-AEAVTVAEHSDVVILCLGLDSSIEGEQGDAGNSDGAGDKL 471
Query: 500 GLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTA 559
L LPG+QQEL+ KV +A+ P I+VL +G + + AAI+ A YPG GG A
Sbjct: 472 NLNLPGKQQELLEKV-IATGKPVIVVLGAGSAL--TLQGQEENCAAILNAWYPGSFGGRA 528
Query: 560 IADILFGTSNPGGKLPMTWYPQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPF 618
IAD++FG +P GKLP+T+Y LP T+ +M+ RTYR+ K +YPF
Sbjct: 529 IADLIFGKCSPSGKLPVTFYKT--TEELPEFTDYSMK--------NRTYRYMKNESLYPF 578
Query: 619 GHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKN 678
G G++Y+ V + + ++ +G + V + + N
Sbjct: 579 GFGLTYSK-VQLSDLSVSDISKDFEG-------------------------VEVSIKISN 612
Query: 679 VGSKDGAHTLLVFSTPPAGHWA-PHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRS 737
VG+ D L + +A + L AF++V + G + V + I+ + VV+
Sbjct: 613 VGNFDIEEVLQCYIKDLESKYAVDNHSLSAFKRVALNKGESKVVKMTINK-RAFEVVNDE 671
Query: 738 GTRRIPLGEHNIHIG 752
G R + + + +G
Sbjct: 672 GDRILDSKKFKLFVG 686
>gi|253579611|ref|ZP_04856880.1| glycoside hydrolase, family 3 domain-containing protein
[Ruminococcus sp. 5_1_39B_FAA]
gi|251849112|gb|EES77073.1| glycoside hydrolase, family 3 domain-containing protein
[Ruminococcus sp. 5_1_39BFAA]
Length = 706
Score = 401 bits (1031), Expect = e-109, Method: Compositional matrix adjust.
Identities = 260/733 (35%), Positives = 391/733 (53%), Gaps = 104/733 (14%)
Query: 51 QRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGA 110
++ L+ +++L EK L AA V RLG+ Y +W+EALHGV+ G A
Sbjct: 13 KKAEKLVSQMTLLEKASQLKYDAAPVKRLGVPAYNYWNEALHGVARAGV----------A 62
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA--------GLTYWSPNVNIFRD 162
T FPQ I A+ F+ + +G +++ E RA YN +A GLT+WSPNVNIFRD
Sbjct: 63 TMFPQAIAMAAVFDDEEMKKVGDIIATEGRAKYNAYSAKEDRDIYKGLTFWSPNVNIFRD 122
Query: 163 PRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD--R 220
PRWGRG ET GEDP L+ + +V G+QG DG +K AA KH+ + +G + R
Sbjct: 123 PRWGRGHETYGEDPYLTSRLGVKFVEGIQG-DGPVMKAAACAKHYAVH-----SGPESLR 176
Query: 221 FHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWR 280
F+A+ S +D+ +T+ F V E V +VM +YN+ NG P CA +++ +RG+W+
Sbjct: 177 HEFDAQASMKDMWETYLPAFEALVTEADVEAVMGAYNRTNGEPCCAHKYLMEDVLRGKWK 236
Query: 281 LNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSE 340
G+ SDC ++ +++ TSTP ++AA A+ AG DL+CG LH A Q GL++E
Sbjct: 237 FEGHYTSDCWAIRDFHEHHMVTSTPRQSAAMALNAGCDLNCGNTY-LHMMGAYQDGLVTE 295
Query: 341 IDINNALVNTLTVQMRLGMFDG-EPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKN 399
I + V LT + LG+FDG E PY + K +H + AL+ AR+ VLLKN
Sbjct: 296 EKITESAVRLLTTRYLLGLFDGSEYDKIPYSVVECK-----EHIDEALKMARKSCVLLKN 350
Query: 400 QGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI----GRYARTIHQQ 455
G LP+ + T+ VIGPN+D +IGNY G + Y T L+GI G R ++ Q
Sbjct: 351 DG-VLPIDKTKVNTIGVIGPNADSRAALIGNYHGTSSEYITVLEGIREEAGDDVRILYSQ 409
Query: 456 GC-------KDVACADDQLFGAAIDASRQADATILVMGLDQSIEAE---------ALDRA 499
GC +++A D++ A I A +D IL +GL++++E E + D+
Sbjct: 410 GCDLYKDKVENLAWDQDRISEAVITA-ENSDVVILCVGLNETLEGEEGDTGNSDASGDKV 468
Query: 500 GLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTA 559
L LP Q+EL+ KV+ K PTI+VLM+G ID+ +A+++ I+ A YPG GG A
Sbjct: 469 DLHLPKVQEELIEKVTAVGK-PTIVVLMAGSAIDLNYAQDN--CNGILLAWYPGARGGRA 525
Query: 560 IADILFGTSNPGGKLPMTWYPQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPF 618
IAD+LFG +P GKLP+T+Y + +P T+ +M+ RTYR+ + +YPF
Sbjct: 526 IADLLFGKESPSGKLPITFYKD--LEGMPEFTDYSMK--------NRTYRYMEKEALYPF 575
Query: 619 GHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKN 678
G+G++Y++ T A V+ D + ++ VKN
Sbjct: 576 GYGLTYSDTCVTEAEVVGEVSAESD--------------------------IVLKATVKN 609
Query: 679 VGSKDGAHTLLVF----STPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVV 734
G+ D + V+ +P A + L F++V + AG ++ V I K +++V
Sbjct: 610 NGTVDTDEVVQVYIKDLDSPLA---VRNYSLCGFKRVSLKAGEEKSVEFTIS-NKAMNIV 665
Query: 735 DRSGTRRIPLGEH 747
D G R I G+H
Sbjct: 666 DEDGNRYIA-GKH 677
>gi|373952439|ref|ZP_09612399.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
paludis DSM 18603]
gi|373889039|gb|EHQ24936.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
paludis DSM 18603]
Length = 721
Score = 400 bits (1029), Expect = e-109, Method: Compositional matrix adjust.
Identities = 259/751 (34%), Positives = 376/751 (50%), Gaps = 87/751 (11%)
Query: 42 FCQVSLPIPQ--------RVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHG 93
FCQ +PI + RV DLI RL+L EKV LL + AVPRL I Y WW+E LHG
Sbjct: 23 FCQ-QIPIYRNPDKKLSTRVQDLISRLTLAEKVSLLGYRSQAVPRLNIPAYNWWNEGLHG 81
Query: 94 VSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA----- 148
V+ G AT FPQ I A++F+ L + + VVS EARA YN TA
Sbjct: 82 VARAGE----------ATIFPQAIAMAATFDDNLVKQVANVVSTEARAKYNLSTAMGRHL 131
Query: 149 ---GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCK 205
GLT+WSPN+NIFRDPRWGRGQET GEDP L+ K +YV GLQG+D LK +A+ K
Sbjct: 132 QYMGLTFWSPNINIFRDPRWGRGQETYGEDPFLTSKMGNAYVHGLQGTDPLHLKTSATAK 191
Query: 206 HFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTC 265
HF A+ +R +F+A V ++D+ DT+ F+ V +G V S+M +YN+VNGVP
Sbjct: 192 HFVAHSGPEG---ERDYFDALVDEKDLRDTYLYAFKSLV-DGGVESIMTAYNRVNGVPNS 247
Query: 266 ADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFL 325
+ ++ + EW G++V+DC ++ Y T E AA AI+AG+DLDC
Sbjct: 248 INKTLVNDIVIKEWGFKGHVVTDCGALDDVYKTHKVLPNRMEVAAAAIKAGVDLDCSSIF 307
Query: 326 GLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQEL 385
+A+ LL+E ++ AL L+ Q +LG FD PSS P+ G + H L
Sbjct: 308 QTDIINAINNKLLTEKQVDAALAAVLSTQFKLGFFDA-PSSSPFYSFGADSIHNDSHVML 366
Query: 386 ALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
A + A++ +VLLKN LPL + ++ V+GPN+ ++ +Y G++ ++GI
Sbjct: 367 ARQMAQKSMVLLKNDKQILPLKMQNYSSIMVVGPNAASLDALVASYHGVSSKAVNFVEGI 426
Query: 446 ----GRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEA------ 495
+ R + G D FG I + AD T+ V+GL +E EA
Sbjct: 427 TAAVDKGTRVEYDLGAD---YRDTTHFG-GIWGAGNADVTVAVIGLTPVLEGEAGDAFLS 482
Query: 496 ---LDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYP 552
D+ L LP + + + K P I V+ SG +D+A P A+I A YP
Sbjct: 483 QTGGDKKDLSLPAGDIAFMKALRKSVKKPIIAVVTSGSDVDIAAIA--PYADAVILAWYP 540
Query: 553 GQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYK 611
G+ GG A+ADILFG +P G LP+T+Y + +LP +M+ GRTYR++
Sbjct: 541 GEQGGNALADILFGKISPSGHLPLTFYNS--VNDLPAYNNYSMK--------GRTYRYFA 590
Query: 612 GPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLG 671
G V YPFG G+SYT F + P K +++ + T+
Sbjct: 591 GAVQYPFGFGLSYTTFNYQWQQQP-----------------------KTSYSAKD--TIQ 625
Query: 672 VQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYL 731
+ V VKN G+ + + P + P K+L F+++ + G+ I+I V +
Sbjct: 626 LSVVVKNTGNISADEVVQAYIGYPTLNRMPLKELKGFKRITLNKGSTSLASISIPVTELQ 685
Query: 732 SVVDRSGTRRIPLGEHNIHIGGTKHSVSLHA 762
+ G + +++G L A
Sbjct: 686 KWNSSKHQFELYPGNYTVYLGSDSEDKKLSA 716
>gi|410628680|ref|ZP_11339398.1| beta-glucosidase [Glaciecola mesophila KMM 241]
gi|410151684|dbj|GAC26167.1| beta-glucosidase [Glaciecola mesophila KMM 241]
Length = 732
Score = 400 bits (1029), Expect = e-108, Method: Compositional matrix adjust.
Identities = 256/749 (34%), Positives = 391/749 (52%), Gaps = 84/749 (11%)
Query: 46 SLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGG 105
L R L+ +++ EK+ L A+PRL + Y WW+EALHG++ G
Sbjct: 34 ELSFETRAQALVNAMTIDEKITQLSHSTPAIPRLEVPQYNWWNEALHGIARNGK------ 87
Query: 106 DFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMY--------NGGTAGLTYWSPNV 157
AT FPQ I ++F+ L + + +SDEARA Y G AGLT+W+PNV
Sbjct: 88 ----ATIFPQAIGLGATFDPELAQEVANAISDEARAKYAIAQSIGNQGQYAGLTFWTPNV 143
Query: 158 NIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNG 217
NIFRDPRWGRGQET GEDP+L+ + ++V+GLQG D LK A KHF + +G
Sbjct: 144 NIFRDPRWGRGQETYGEDPLLTSQMGTAFVKGLQGDDPKYLKSAGVAKHFAVH-----SG 198
Query: 218 VD--RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTI 275
+ R F+ + SK+D+ +T+ F V + KVA VMC+YN V G P+CA +L +
Sbjct: 199 PESLRHQFDVEPSKKDLYETYLPAFEALVTQAKVAGVMCAYNGVYGQPSCASEFLLGEML 258
Query: 276 RGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQR 335
+ +W+ NGY+VSDC ++ ++ T E+AA A+RAG+DL+CG ++A +
Sbjct: 259 KKKWQFNGYVVSDCGALHDFHSGHKVTHNRVESAALALRAGVDLNCGFTYEKSLKAAFEE 318
Query: 336 GLLSEIDINNALVNTLTVQMRLGMFDGEPSS-QPYGHLGPKDVCTPDHQELALEAARQGI 394
GL+++ I+ L N L ++ RLG+FD PS P+ +G + + + +H ELA + A + I
Sbjct: 319 GLITQSLIDQRLKNLLMIRFRLGLFD--PSELNPHNAIGQEVIHSLEHIELARKVAAKSI 376
Query: 395 VLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIG---RYART 451
VLLKN+ LPLS + V GP + + ++GNY GI+ T L+GI +
Sbjct: 377 VLLKNEKQVLPLSK-DIKVPYVTGPFAASSDMLMGNYYGISDSLVTVLEGIAGKVSLGSS 435
Query: 452 IHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEAL---------DRAGLL 502
++ + ++ A + ++ ADA I V+G+ +E E + DR +
Sbjct: 436 LNYRAGALPFHSNINPLNWAPEVAKTADAVIAVVGISADMEGEEVDAIASADRGDRVAIT 495
Query: 503 LPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIAD 562
LP Q + V +++ KGP ILV+ +G P+D+ ++ DP AI+W YPG+ GG A+AD
Sbjct: 496 LPQNQVDYVKQLAENKKGPLILVVAAGSPVDI--SELDPLADAILWIWYPGEQGGNAVAD 553
Query: 563 ILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGM 622
++FG +NP G LP+T+ + I +LP P GRTY+F K +YPFG G+
Sbjct: 554 VIFGDTNPSGHLPLTFV--KTIDDLP-------PFDDYTMTGRTYKFLKKLPLYPFGFGL 604
Query: 623 SYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSK 682
SYT F GK A + + V+V+N +
Sbjct: 605 SYTQFKF-------------------------GKLSLSKRAPQEGENINISVEVENSTAL 639
Query: 683 DGAHTLLVFSTP--PAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTR 740
DG + V+ +P P + A L AF++VH+ A ++ + I K L V+ +G
Sbjct: 640 DGETVVQVYLSPQVPLKNEAI-TNLKAFKRVHIGAYEKRLIEFTIE-GKNLYRVNDAGEN 697
Query: 741 RIPLGEHNIHIGG---TKHSVSLHAATLG 766
P G + + +G +K S+ L AA G
Sbjct: 698 VWPSGAYTLAVGDSLPSKRSIELGAAPHG 726
>gi|125534110|gb|EAY80658.1| hypothetical protein OsI_35835 [Oryza sativa Indica Group]
Length = 511
Score = 400 bits (1029), Expect = e-108, Method: Compositional matrix adjust.
Identities = 221/490 (45%), Positives = 299/490 (61%), Gaps = 15/490 (3%)
Query: 284 YIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDI 343
Y+ SDCD+V D H+T +PE+ A +I+AG+D++CG + +H +AVQ+G L+E DI
Sbjct: 16 YVASDCDAVATIRDAHHYTLSPEDTVAVSIKAGMDVNCGNYTQVHAMAAVQKGNLTEKDI 75
Query: 344 NNALVNTLTVQMRLGMFDGEPSSQP-YGHLGPKDVCTPDHQELALEAARQGIVLLKNQGP 402
+ ALVN V+MRLG FDG+P S YGHLG DVC+P H+ LALEAA+ GIVLLKN
Sbjct: 76 DRALVNLFAVRMRLGHFDGDPRSNAVYGHLGAADVCSPAHKSLALEAAQDGIVLLKNDAG 135
Query: 403 SLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYA--RTIHQQGCKDV 460
+LPL ++AVIGPN+D + GNY G C TTPLQGI Y R GC
Sbjct: 136 ALPLQPSAVTSLAVIGPNADNLGALHGNYFGPPCETTTPLQGIKGYLGDRARFLAGCDSP 195
Query: 461 ACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKG 520
ACA AA AS +D +L MGL Q E E LDR LLLPG QQ L++ V+ A++
Sbjct: 196 ACAVAATNEAAALAS-SSDHVVLFMGLSQKQEQEGLDRTSLLLPGEQQGLITAVANAARR 254
Query: 521 PTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYP 580
P ILVL++GGP+DV FAK++P+I AI+ AGYPGQAGG AIA +LFG NP G+LP+TWYP
Sbjct: 255 PVILVLLTGGPVDVTFAKDNPKIGAILLAGYPGQAGGLAIAKVLFGDHNPSGRLPVTWYP 314
Query: 581 QEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTV--------A 632
+E+ T +PMT+M MR + YPGR+YRFY+G VY FG+G+SY+ F + A
Sbjct: 315 EEF-TKVPMTDMRMRADPATGYPGRSYRFYQGNTVYNFGYGLSYSKFSRRMFSSFSTSNA 373
Query: 633 NAPTVVAVPLDGRHGSINATISGKAIK-VTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF 691
+++A + R G +S +K + +C+RL V+V+N G DG H++L++
Sbjct: 374 GNLSLLAGVMARRAGDDGGGMSSYLVKEIGVERCSRLVFPAVVEVQNHGPMDGKHSVLMY 433
Query: 692 ST-PPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIH 750
P P +QL+ F HV G + V + C++ S V G R I G H +
Sbjct: 434 LRWPTKSGGRPARQLIGFRSQHVKVGEKAMVSFEVSPCEHFSWVGEDGERVIDGGAHFLM 493
Query: 751 IGGTKHSVSL 760
+G + SL
Sbjct: 494 VGDEELETSL 503
>gi|255284060|ref|ZP_05348615.1| beta-glucosidase [Bryantella formatexigens DSM 14469]
gi|255265405|gb|EET58610.1| glycosyl hydrolase family 3 C-terminal domain protein
[Marvinbryantia formatexigens DSM 14469]
Length = 700
Score = 400 bits (1028), Expect = e-108, Method: Compositional matrix adjust.
Identities = 265/721 (36%), Positives = 376/721 (52%), Gaps = 95/721 (13%)
Query: 51 QRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGA 110
+R L+ +++++EK L A A+ RLGI Y WW+EALHGV+ G A
Sbjct: 8 KRAEALVAQMTVEEKASQLKYDAPAIKRLGIPAYNWWNEALHGVARAGQ----------A 57
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA--------GLTYWSPNVNIFRD 162
T FPQ I ++F+ L I V++ E RA YN A GLT+WSPNVNIFRD
Sbjct: 58 TVFPQAIGLGATFDEALLGEIADVIATEGRAKYNAYAAKEDRDIYKGLTFWSPNVNIFRD 117
Query: 163 PRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD--R 220
PRWGRG ET GEDP L+ + ++V+GLQG DG+ +K AA KHF + +G + R
Sbjct: 118 PRWGRGHETYGEDPCLTSRLGVAFVKGLQG-DGETMKAAACAKHFAVH-----SGPEAVR 171
Query: 221 FHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWR 280
FNA+ S +D+E+T+ F V E V +VM +YN+ NG CA P +L++ +R +W
Sbjct: 172 HEFNAEASAKDMEETYLPAFEALVKEADVEAVMGAYNRTNGEACCASP-VLQKILREDWG 230
Query: 281 LNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSE 340
G+ VSDC ++ +++ T+T +E+AA AI +G DL+CG LH A + GL+SE
Sbjct: 231 FEGHFVSDCWAIRDFHEHHMLTATAKESAAMAINSGCDLNCGNTY-LHILHAYRDGLVSE 289
Query: 341 IDINNALVNTLTVQMRLGMFDG-EPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKN 399
I A V T + LG+FDG E PY + K +H LA +AA + VLLKN
Sbjct: 290 ETITEAAVRLFTTRFLLGLFDGSEYDDIPYTVVESK-----EHLALAEKAALESAVLLKN 344
Query: 400 QGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRY----ARTIHQQ 455
G LPL R RTV VIGPN+D + GNY G A Y T QG+ Y R +
Sbjct: 345 NGI-LPLKKERLRTVGVIGPNADSRAALAGNYHGTASRYETIQQGLQDYLGEDVRVLTSV 403
Query: 456 GC-------KDVACADDQLFGAAIDASRQADATILVMGLDQSIEAE---------ALDRA 499
GC + +A A D+L A I + +D IL +GLD+++E E + D+
Sbjct: 404 GCALSEDRTEKLALAGDRLAEAQI-VAENSDVVILCLGLDETLEGEEGDTGNSYASGDKE 462
Query: 500 GLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTA 559
LLLP Q++L+ V+ K P +L +MSG +D+++A AI+ YPG GG+A
Sbjct: 463 TLLLPEAQRDLMEAVAATGK-PVVLCMMSGSDLDMSYAAE--HFDAILQLWYPGSQGGSA 519
Query: 560 IADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFG 619
A +LFG +P GKLP+T+Y E + LP E GRTYR+ P YPFG
Sbjct: 520 AAKLLFGEVSPSGKLPVTFY--ETLEELPAFE-------DYSMKGRTYRYMGHPAQYPFG 570
Query: 620 HGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNV 679
G++Y + T AN I G + + LTL V +
Sbjct: 571 FGLTYGDVRVTDAN-------------------IRGASAE------GDLTLAVTAENAGN 605
Query: 680 GSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGT 739
D + V T A + P+ L AF ++H+ AG ++ + + + + +VVD +G
Sbjct: 606 AVTDEVLQIYVKCTDSA-NAVPNPALAAFGRIHLEAGEKKTIEMTVP-ARAFTVVDEAGV 663
Query: 740 R 740
R
Sbjct: 664 R 664
>gi|320161274|ref|YP_004174498.1| beta-D-xylosidase [Anaerolinea thermophila UNI-1]
gi|319995127|dbj|BAJ63898.1| beta-D-xylosidase [Anaerolinea thermophila UNI-1]
Length = 712
Score = 400 bits (1027), Expect = e-108, Method: Compositional matrix adjust.
Identities = 249/728 (34%), Positives = 393/728 (53%), Gaps = 75/728 (10%)
Query: 48 PIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDF 107
P+ +RVNDLI R++L+EK+ + + AA+PRLGI Y++WSEALHGV+ G
Sbjct: 14 PLEERVNDLISRMTLEEKISQMCNSCAAIPRLGIPAYDYWSEALHGVARNGK-------- 65
Query: 108 PGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN------GGT---AGLTYWSPNVN 158
AT FPQ I A++++ L E + ++ EARA ++ G T GLT WSPN+N
Sbjct: 66 --ATVFPQAIGMAATWDTELIERVADAIASEARAKFHETLRKFGKTDIYQGLTMWSPNIN 123
Query: 159 IFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGV 218
IFRDPRWGRGQET GEDP L+G+ A++VRGLQG D LK AA KH+T + +
Sbjct: 124 IFRDPRWGRGQETWGEDPYLTGEMGAAFVRGLQGKDPHYLKTAACAKHYTVH---SGPEK 180
Query: 219 DRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGE 278
+R FNA V+++++ DT+ F+ V E KV +VM +YN+ G P C P +LK +R +
Sbjct: 181 ERHTFNAIVTRRELFDTYLPAFKKLVTEAKVEAVMGAYNRTLGEPCCGSPYLLKEILRNQ 240
Query: 279 WRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDC-GPFLGLHTESAVQRGL 337
W G++VSDC ++ ++ T E+AA I+ G D+ C + + A+ RGL
Sbjct: 241 WGFKGHVVSDCGAINDFHLHHQVTKDGAESAALGIKNGCDMACICTYSYENLTEALNRGL 300
Query: 338 LSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLL 397
++E DI++AL NTL + +LG+FD + PY H+ V H++LA E A + VLL
Sbjct: 301 ITEEDIDHALRNTLRTRFKLGLFDPQ-EKVPYAHISMSVVGCEAHRKLAYETAVKSAVLL 359
Query: 398 KNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQG-IGRYARTIHQQG 456
KN LP+ +++ ++GPN+ ++GNY G++ TT ++G +GR + +
Sbjct: 360 KNHNHILPVKP-DVKSILIVGPNAGNVHVLLGNYYGLSDSMTTFMEGLVGRLPEGVRMEF 418
Query: 457 CKDVACADDQLFGA--AIDASRQADATILVMGLD---QSIEAEAL-----DRAGLLLPGR 506
D + ++ ++ D I MGL + E EA+ DR + LP
Sbjct: 419 MPGSLLTDSKKIKNDWSVASAASFDLVIAFMGLSPLLEGEEGEAILSDNGDREDIALPKA 478
Query: 507 QQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFG 566
QQE + +A+ G I+++++GG +A + + AI+W GYPGQ GG AIAD++FG
Sbjct: 479 QQEYIR--DLAATGAKIVLVLTGGSA-IALNGIEDLVEAILWVGYPGQEGGRAIADLIFG 535
Query: 567 TSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTN 626
+P GKLP+T+ P++ + P + RTYR+ ++PFG G+SYT
Sbjct: 536 DHSPSGKLPITF---------PVSTDQLPPFREYSMKERTYRYMTSSPLFPFGFGLSYTQ 586
Query: 627 FVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAH 686
F + L H ++A G+A++ T ++ NVG +G
Sbjct: 587 FEYK----------NLQLEHPVLSA---GEALRGT------------FELANVGEYEGEE 621
Query: 687 TLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLG 745
+ V+ S A P ++L++F++V + G ++ I + + ++D G + + G
Sbjct: 622 VVQVYLSDLEASTIVPLQKLISFQRVRLKPGETVQLSFAIQP-EAMMMIDDEGNQVLEPG 680
Query: 746 EHNIHIGG 753
+ + IGG
Sbjct: 681 KFKLTIGG 688
>gi|266619450|ref|ZP_06112385.1| beta-glucosidase [Clostridium hathewayi DSM 13479]
gi|288869013|gb|EFD01312.1| beta-glucosidase [Clostridium hathewayi DSM 13479]
Length = 714
Score = 399 bits (1026), Expect = e-108, Method: Compositional matrix adjust.
Identities = 266/719 (36%), Positives = 379/719 (52%), Gaps = 91/719 (12%)
Query: 51 QRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGA 110
+RV DL+ +++L+EKV L A AV RLGI Y WW+EALHGV+ G A
Sbjct: 14 ERVRDLVSQMTLEEKVSQLRYDAPAVERLGIPSYNWWNEALHGVARAG----------AA 63
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--------GGTAGLTYWSPNVNIFRD 162
T FPQ I A+ F+ L E IG V + E RA Y+ G G+T+WSPN+NIFRD
Sbjct: 64 TVFPQAIGLAAMFDEALLEKIGDVTALEGRAKYHEAVRNGDRGLYKGITFWSPNINIFRD 123
Query: 163 PRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFH 222
PRWGRG ET GEDP L+G+ +Y++G+QG +G RLK AA KHF A+ R
Sbjct: 124 PRWGRGHETYGEDPCLTGRMGTAYIKGMQG-NGKRLKAAACVKHFAAHSGPEKG---RHS 179
Query: 223 FNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLN 282
FN+ VSK+D+ +T+ F CV E V VM YN++NG C +++ +R +W +
Sbjct: 180 FNSVVSKKDLTETYFPAFERCVKEAGVEGVMGGYNRLNGEAACGSHHLITEILREKWGFD 239
Query: 283 GYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEID 342
GY VSDC ++ ++ T TP+E+AA A+++G DL+CG LH SA +GL+S D
Sbjct: 240 GYYVSDCGAIKDFHMHHGLTDTPQESAALALKSGCDLNCGAVY-LHVMSAYNQGLVSAED 298
Query: 343 INNALVNTLTVQMRLGMFD--GEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQ 400
I+ A+ + + +MRLGMFD E PY + C +H LAL+AA + +VLLKN
Sbjct: 299 IDRAVTHLMMTRMRLGMFDQHTEFDEIPY----EINDCA-EHHGLALKAAEESMVLLKND 353
Query: 401 GPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI----GRYARTIHQQG 456
G LPL +TVAVIGPN D + GNY G A T L+GI G+ R +G
Sbjct: 354 GI-LPLDKTALKTVAVIGPNGDSEEILKGNYNGTATEKYTILEGIRAVLGKETRIFCSEG 412
Query: 457 -------CKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAE---------ALDRAG 500
+++A ADD+L A+ + ++D L +GL+ ++E E D+A
Sbjct: 413 SHLYRDNVENLAEADDRL-KEAVSMAVRSDVVFLCLGLNGTLEGEEGDANNSYAGADKAD 471
Query: 501 LLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAI 560
L LP Q L+ V + P IL+L +G + + +A IW YPGQ GG A
Sbjct: 472 LNLPESQMRLLKAVC-GTGTPVILLLAAGSAMAINYAAEHCSAILHIW--YPGQMGGLAA 528
Query: 561 ADILFGTSNPGGKLPMTWYPQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFG 619
A +L G + P G+LP+T+Y + LP T+ +M+ GRTYR+ + +YPFG
Sbjct: 529 ARLLTGEAVPSGRLPVTFY--QTTEELPEFTDYSMK--------GRTYRYMEREALYPFG 578
Query: 620 HGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNV 679
+G+SY +F ++ A A P DG+ S K + A+C+ + +V V+
Sbjct: 579 YGLSYGDFEYSNFKAEQTEAGP-DGQ-----VRFSVKITNRSKAECDEI---AEVYVRIA 629
Query: 680 GSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSG 738
S+ AP L F ++H+ AG V + V K VV+ G
Sbjct: 630 DSELA---------------APGGSLADFRRIHMKAGESVTVPFTLPV-KAFMVVNEEG 672
>gi|182415033|ref|YP_001820099.1| Beta-glucosidase [Opitutus terrae PB90-1]
gi|177842247|gb|ACB76499.1| Beta-glucosidase [Opitutus terrae PB90-1]
Length = 905
Score = 398 bits (1023), Expect = e-108, Method: Compositional matrix adjust.
Identities = 258/714 (36%), Positives = 372/714 (52%), Gaps = 92/714 (12%)
Query: 46 SLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGG 105
S P+ R +DLI R+SL EKV L + A +PRLG+ Y++W+EA HG++N G
Sbjct: 209 SKPLRVRADDLIRRMSLAEKVSQLKNAAPGIPRLGLPAYDYWNEAAHGIANNGI------ 262
Query: 106 DFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGT----------AGLTYWSP 155
AT FPQ I A+++N L G V+ E RA +N GLTYW+P
Sbjct: 263 ----ATVFPQAIGAAAAWNPALLHQEGTVIGIEGRAKFNDYANRHNGDSKWWTGLTYWAP 318
Query: 156 NVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNW 215
N+N+FRDPRWGRGQET GEDP L+ + +V+G+QG D + A KH+ +
Sbjct: 319 NINLFRDPRWGRGQETYGEDPFLTAEIGIEFVKGVQGDDPRYMLAMACAKHYAVHSGPER 378
Query: 216 NGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTI 275
R FNA++ ++D+ DT+ F V EGKVA VM +YN VNGVP A+ +L +
Sbjct: 379 T---RHSFNAEIPERDLFDTYLPHFERVVREGKVAGVMSAYNAVNGVPASANSFLLTELL 435
Query: 276 RGEWRLNGYIVSDCDSV-GVYYDTQ-HFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAV 333
R W GY+ SDCD++ +Y + Q H+ T EEAAA A++AG +L CG AV
Sbjct: 436 RKRWGFEGYVPSDCDAIRDIYGEKQHHYVKTAEEAAALAVKAGCNLCCGGDYNALVR-AV 494
Query: 334 QRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQ-PYGHLGPKDVCTPDHQELALEAARQ 392
Q+GL++E D++ AL +TL + RLG+FD P+ Q P+ KD P H ++ALE ARQ
Sbjct: 495 QQGLVTEKDLDGALYHTLWTRFRLGLFD--PAEQVPFSGYTLKDNDLPAHSQVALELARQ 552
Query: 393 GIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI----GRY 448
IVLLKN G +LPL + + +AVIGPN+ + GNY G A + L I G
Sbjct: 553 AIVLLKNDG-TLPLDRTKLKQIAVIGPNAASKSMLEGNYHGSASRSISILDDIRNLVGSE 611
Query: 449 ARTIHQQGCKDVACADDQLFGAAIDASRQ---------------ADATILVMGLDQSIEA 493
+ H G + + + + ADA I V G+ + E
Sbjct: 612 IKITHAMGSPVTTKPGTAPWSGQDNTTDRPVAELKAEALKLAAEADAIIYVGGITPAQEG 671
Query: 494 EALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPG 553
E+ DR + LP Q++L+ + K P ++V SG + A D + AI+ A YPG
Sbjct: 672 ESFDRESIELPSEQEDLIRALHATGK-PVVMVNCSGSAM--ALTWQDENLPAIVQAWYPG 728
Query: 554 QAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKG 612
Q GG A+A++LFG +NP G LP+T+Y +LP ++ +M+ RTYR++ G
Sbjct: 729 QEGGRAVAEVLFGETNPSGHLPITFY--RSTADLPDFSDYSMK--------NRTYRYFTG 778
Query: 613 PVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGV 672
+Y FGHG+SY+ F + AN ++V A LT V
Sbjct: 779 RPLYAFGHGLSYSTFEY--AN------------------------LRVAPAANGALT--V 810
Query: 673 QVDVKNVGSKDGAHTLLVFSTPPAGHWAPH-KQLVAFEKVHVPAGAQQRVGINI 725
+D+ N G +DG + +++TPPA + L F + HV AG + V + +
Sbjct: 811 TLDLTNSGKRDGDDVVQLYATPPASSQPQELRALCGFRRTHVKAGETRTVTVTV 864
>gi|374372635|ref|ZP_09630297.1| Beta-glucosidase [Niabella soli DSM 19437]
gi|373235166|gb|EHP54957.1| Beta-glucosidase [Niabella soli DSM 19437]
Length = 734
Score = 398 bits (1023), Expect = e-108, Method: Compositional matrix adjust.
Identities = 257/761 (33%), Positives = 379/761 (49%), Gaps = 88/761 (11%)
Query: 35 ATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGV 94
A LPF L RVNDL+ RL+L+EKVK +++ A A+PRLGI Y+WWSE LHGV
Sbjct: 21 ADKSQLPFWNYKLSFEARVNDLVSRLTLEEKVKQMLNHAPAIPRLGIPAYDWWSEVLHGV 80
Query: 95 SNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGT------- 147
+ T T +PQ I A++++ + + E RA++N T
Sbjct: 81 ARTPYHT---------TVYPQAIAMAATWDTVALYTMADQSAREGRAIHNKATEEGKNGD 131
Query: 148 --AGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCK 205
GLTYW+PN+NIFRDPRWGRGQET GEDP L+ ++VRGLQG D LK AA K
Sbjct: 132 RYVGLTYWTPNINIFRDPRWGRGQETYGEDPFLTAMLGRAFVRGLQGEDPKYLKAAACAK 191
Query: 206 HFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTC 265
H Y + + R F+ VS D+ +T+ F+ V KVA VMC+YN P C
Sbjct: 192 H---YAIHSGPEAVRHSFDVDVSDYDLWNTYLPAFKELVTHAKVAGVMCAYNAFRKKPCC 248
Query: 266 ADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFL 325
++ +R +W GY+ SDC ++ +++ E AA DA+ G D++CG
Sbjct: 249 GSDLLMTDILRRQWGFTGYVTSDCGAIDDFFNYHKTHPNAEAAAIDAVTNGTDVECGNRA 308
Query: 326 GLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQ-PYGHLGPKDVCTPDHQE 384
L AV+ G ++E +I+ ++ ++MRLGMFD P S Y P + + H+
Sbjct: 309 YLTLTDAVKTGRIAEKEIDRSVKRLFMIRMRLGMFD--PVSMVSYAQTSPAVLESAPHKA 366
Query: 385 LALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQG 444
AL+ A++ IVLLKN+ LPLS + +AV+GPN+D ++ ++GNY G T L G
Sbjct: 367 QALKMAQESIVLLKNENHLLPLSK-SIKKIAVVGPNADNSIAVLGNYNGTPSKIVTALDG 425
Query: 445 I----GRYARTIHQQGCKDVACADDQL------FGAAIDASRQADATILVMGLDQSIEAE 494
I G +++ K V + L F A + ADA I V G+ +E E
Sbjct: 426 IKAKLGTNGSVVYE---KAVNFTNAMLPEGKTDFAALTSRVKDADAIIFVGGISPQLEGE 482
Query: 495 AL----------DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIA 544
+ DR +LLP Q E + K A+ P + V+M+G + + + + + I
Sbjct: 483 EMKVNEPGFNSGDRTTILLPTVQTEAM-KALKATGKPVVFVMMTGSALAIPWEQEN--IP 539
Query: 545 AIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPG 604
AI+ A Y GQA GTAIAD+LFG NP G+LP+T+Y + +LP + R
Sbjct: 540 AIVNAWYGGQAAGTAIADVLFGDYNPSGRLPVTFYKSD--ADLPAFD-------DYRMEN 590
Query: 605 RTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAK 664
RTYR++ G +YPFG+G+SYT F + PT V
Sbjct: 591 RTYRYFSGQALYPFGYGLSYTTFRYEGLKVPTTVK------------------------- 625
Query: 665 CNRLTLGVQVDVKNVGSKDGAHTL-LVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGI 723
N++ + V + + N G+K G + L S P K L F++V + G + +
Sbjct: 626 -NKVRIPVSIQLTNTGAKGGEEVVQLYISYQGQPIKKPLKALKGFQRVWLNRGQTKTIKF 684
Query: 724 NIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHAAT 764
+ L++ +G P G+ I +GG + V+ A +
Sbjct: 685 -LLTPDALAIAGENGKLLNPKGKLRISVGGGQPDVNTPATS 724
>gi|310797011|gb|EFQ32472.1| glycosyl hydrolase family 3 N terminal domain-containing protein
[Glomerella graminicola M1.001]
Length = 767
Score = 398 bits (1022), Expect = e-108, Method: Compositional matrix adjust.
Identities = 237/601 (39%), Positives = 333/601 (55%), Gaps = 18/601 (2%)
Query: 43 CQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTK 102
C +L P+R L+ L+++EK++ L+S A PR+G+ Y WWSEALHGV+ PGT
Sbjct: 43 CDRTLSPPERAAALVKALTVEEKLQNLVSKAQGAPRIGLPAYNWWSEALHGVA-YAPGTY 101
Query: 103 F-GGD--FPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNI 159
F GD F +TS+P + A++F+ L E IG + EARA N G AGL YW+PNVN
Sbjct: 102 FPEGDVEFNSSTSYPMPLLMAAAFDDELIEQIGAAIGIEARAWGNAGWAGLDYWTPNVNP 161
Query: 160 FRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGS-DGDRLKVAASCKHFTAYDLDNWNGV 218
F+DPRWGRG ETPGED + +YA RGL G G++ +V ++CKH+ D ++WNG
Sbjct: 162 FKDPRWGRGSETPGEDVLRVKRYAEYITRGLDGPVPGEQRRVISTCKHYAGNDFEDWNGT 221
Query: 219 DRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGE 278
R F+AK++ QD+ + + +PF+ C + KV S+MC+YN VNGVP+CA+ +L+ +R
Sbjct: 222 SRHDFDAKITAQDLAEYYLMPFQQCARDSKVGSIMCAYNAVNGVPSCANEYLLQNILREH 281
Query: 279 WRL---NGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQR 335
W N Y+ SDC++V + T A AG+D C A +
Sbjct: 282 WNWTEHNNYVTSDCEAVLDVSANHKYAPTNAAGTAICFEAGMDTSCEYTGSSDIPGAWSQ 341
Query: 336 GLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIV 395
GLL E ++ AL+ +R G FDG + Y LG KDV + + Q LAL+AA +GIV
Sbjct: 342 GLLKEETVDRALLRLYEGLVRAGYFDGHEAI--YAKLGWKDVNSAEAQSLALQAAVEGIV 399
Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYAR--TIH 453
LLKN G +LPL VA+IG +D + G Y+G A TP + T+
Sbjct: 400 LLKNNG-TLPLDLKPSHKVAMIGFWADAPDKLQGGYSGRAAHLHTPAYAARQLGLDITLA 458
Query: 454 QQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSK 513
A D AA++A+ AD + GLD S E LDR L P Q L+ K
Sbjct: 459 SGPVLQRNNASDNWTAAALEAAEGADYILYFGGLDTSAAGETLDRTDLEWPEAQLMLIKK 518
Query: 514 VSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGK 573
+S K P ++ L+ D + D +++I+WA +PGQ GG AI ++ G +P G+
Sbjct: 519 LSALGK-PLVVNLLGDQLDDTPLLQLD-EVSSILWANWPGQDGGVAIMKLITGEKSPAGR 576
Query: 574 LPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVAN 633
LP+T YP Y +PMT M +RP + +YPGRTYR+Y P+ FG G+ YT F V
Sbjct: 577 LPVTQYPSNYTDLIPMTSMDLRP--TSQYPGRTYRWYDKPIKR-FGFGLHYTTFKAEVGG 633
Query: 634 A 634
A
Sbjct: 634 A 634
>gi|238578959|ref|XP_002388893.1| hypothetical protein MPER_12044 [Moniliophthora perniciosa FA553]
gi|215450599|gb|EEB89823.1| hypothetical protein MPER_12044 [Moniliophthora perniciosa FA553]
Length = 658
Score = 397 bits (1020), Expect = e-107, Method: Compositional matrix adjust.
Identities = 252/672 (37%), Positives = 356/672 (52%), Gaps = 53/672 (7%)
Query: 84 YEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMY 143
Y WWSEAL+ F ATSFP IT ++F+ L AI V+S EARA
Sbjct: 1 YNWWSEALN--------------FSSATSFPAPITMGATFDDGLIHAIATVISTEARAFN 46
Query: 144 NGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDG-DRLKVAA 202
N GL +++PN+N F+DPRWGRGQETPGEDP +Y V GLQG G LK+AA
Sbjct: 47 NVNRGGLDFFTPNINPFKDPRWGRGQETPGEDPFHISQYVYQLVTGLQGGVGPTNLKIAA 106
Query: 203 SCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGV 262
CKH+ AYDL+N GV RF F+AKV+ QD+ + + F+ C+ + KVAS+MCSYN VNG+
Sbjct: 107 DCKHWAAYDLENL-GVSRFEFDAKVTMQDLAEFYSPSFQSCIRDAKVASIMCSYNAVNGI 165
Query: 263 PTCADPNILKRTIRGEWRL--NGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLD 320
P+CA+ +L+ R W L +I DC +VG + H+T P A A+ AG D+D
Sbjct: 166 PSCANRYLLQTLARDFWGLGEEQWITGDCGAVGNIFARHHYTDDPANGTAVALNAGTDID 225
Query: 321 CGPFLGLHTES---AVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDV 377
C ++++ A+ R L+SE + A+ +RL DV
Sbjct: 226 CDSGAAAYSQNLGQALNRSLVSEDQLRTAVTRQYNSLVRLSW---------------DDV 270
Query: 378 CTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACG 437
T Q+LA +AA +GIVLLKN G LPL+ + VAV+GP ++ T M NY GIA
Sbjct: 271 NTEPAQQLAYQAAVEGIVLLKNDG-ILPLAS-SVKKVAVVGPMANATTQMQSNYNGIAPF 328
Query: 438 YTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALD 497
+P Q + + +D F AAI A+ AD V G+D +IE E D
Sbjct: 329 LVSPQQAFRNAGFNVTFANGTGLNSSDTSGFSAAIAAADDADVVFYVGGIDTTIEREDRD 388
Query: 498 RAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGG 557
R + G Q LV +++ K P I++ M GG +D + +++ + A+IW GYPGQ+GG
Sbjct: 389 RPEISWTGNQLALVQQLASLGK-PLIVLQMGGGQVDSSSLRDNTSVNALIWGGYPGQSGG 447
Query: 558 TAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYP 617
TA+ D++ G P G+LP+T YP Y+ PMT+M +RPS S PGRTY++Y G ++
Sbjct: 448 TALVDLITGKQAPAGRLPITQYPASYVDGFPMTDMTLRPSSSN--PGRTYKWYTGAPIFE 505
Query: 618 FGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVK 677
FG G+ YT F A+ +V + S K V H L V V
Sbjct: 506 FGFGLHYTTFDAEWASGGDSFSVQ--------DLVSSAKNSGVAHVDLGVLDT-FNVTVT 556
Query: 678 NVGSKDGAHTLLVFSTPPAG-HWAPHKQLVAFEKVH-VPAGAQQRVGINIHVCKYLSVVD 735
N G+ + L+FS AG AP+K+LV++ +V + GA + + + ++ D
Sbjct: 557 NSGTVASDYVALLFSRTTAGPSPAPNKELVSYTRVKGIEPGASSAASLKVTLGA-VARTD 615
Query: 736 RSGTRRIPLGEH 747
G R + GE+
Sbjct: 616 EQGNRVLYPGEY 627
>gi|255572559|ref|XP_002527213.1| Thermostable beta-glucosidase B, putative [Ricinus communis]
gi|223533389|gb|EEF35139.1| Thermostable beta-glucosidase B, putative [Ricinus communis]
Length = 454
Score = 397 bits (1020), Expect = e-107, Method: Compositional matrix adjust.
Identities = 202/459 (44%), Positives = 293/459 (63%), Gaps = 19/459 (4%)
Query: 317 LDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKD 376
+D++CG + + +SAV +G L E DI+ AL+N +VQ+RLG+FDG+ + + LGP+D
Sbjct: 1 MDINCGSYAIRNAQSAVDKGKLREEDIDRALLNLFSVQLRLGLFDGDRINGHFSKLGPED 60
Query: 377 VCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIAC 436
VCT +H++LALEAARQGIVLLKN+ LPL+ ++A+IGP ++ ++ G+Y G +C
Sbjct: 61 VCTEEHKKLALEAARQGIVLLKNEKKFLPLNKKAVSSLAIIGPLANNGGSLGGDYTGYSC 120
Query: 437 GYTTPLQGIGRY-ARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEA 495
+ G+ Y RT + GC +V+C D F AI ++ AD I+V G+D S E E
Sbjct: 121 NPQSLFDGVQAYIKRTSYAVGCSNVSCDSDDQFPEAIHIAKTADFVIVVAGIDLSQETED 180
Query: 496 LDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQA 555
DR LLLPG+Q LVS V+ ASK P ILVL GGP+DV+FAK D RIA+I+W GYPG+A
Sbjct: 181 RDRISLLLPGKQMALVSYVAAASKKPVILVLTGGGPVDVSFAKRDSRIASILWIGYPGEA 240
Query: 556 GGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVV 615
G A+ADI+FG NPGG+LPMTWYP+ + TN+PM +M MR + ++ YPGRTYRFY G V
Sbjct: 241 GAKALADIIFGEYNPGGRLPMTWYPESF-TNVPMNDMNMRANPNRGYPGRTYRFYTGERV 299
Query: 616 YPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAI-----KVTH------AK 664
Y FG G+SYTN+ + +AP+ +++ GS+ AT + + ++ + +
Sbjct: 300 YGFGEGLSYTNYAYKFLSAPSKLSLS-----GSLTATSRKRILHQRGDRLDYIFIDEISS 354
Query: 665 CNRLTLGVQVDVKNVGSKDGAHTLLVFS-TPPAGHWAPHKQLVAFEKVHVPAGAQQRVGI 723
CN L VQ+ V NVG DG+H +++FS P P KQLV FE+++ + I
Sbjct: 355 CNSLRFTVQISVMNVGDMDGSHVVMLFSRVPQVSEGTPEKQLVGFERINTVSHKSTETSI 414
Query: 724 NIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHA 762
+ CK+LS+ + G R +P+G H + +G +H V++ A
Sbjct: 415 LLDPCKHLSIANGQGKRIMPVGSHVLLLGDLQHFVTIEA 453
>gi|189201569|ref|XP_001937121.1| beta-xylosidase [Pyrenophora tritici-repentis Pt-1C-BFP]
gi|187984220|gb|EDU49708.1| beta-xylosidase [Pyrenophora tritici-repentis Pt-1C-BFP]
Length = 756
Score = 397 bits (1019), Expect = e-107, Method: Compositional matrix adjust.
Identities = 257/748 (34%), Positives = 385/748 (51%), Gaps = 32/748 (4%)
Query: 9 FLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKL 68
LGL+L AS + + D + ++ C V+ +R L+ + QEK+
Sbjct: 3 LLGLVLFGASL--VPSAYAIGPDCVNGPLKSNAICDVTASPAKRAAALVAAMQTQEKLDN 60
Query: 69 LISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLW 128
L+S + V RLG+ Y WW EALHGV+ PG F G + ATSFP + +++F+ L
Sbjct: 61 LVSKSKGVARLGLPAYNWWGEALHGVAG-APGINFTGPYRTATSFPMPLLMSAAFDDDLI 119
Query: 129 EAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVR 188
I V+ +EARA NGG A + +W+P++N FRDPRWGRG ETPGED + Y S +
Sbjct: 120 HQIAIVIGNEARAFGNGGIAPVDFWTPDINPFRDPRWGRGSETPGEDILRIKGYTKSLLS 179
Query: 189 GLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGK 248
GL+G R K+ A+CKH+ YD+++WNG DR F+AK++ QD+ + F PF+ C + K
Sbjct: 180 GLEGDKAQR-KIIATCKHYVGYDMEDWNGTDRHSFDAKITTQDLAEYFMPPFQQCARDSK 238
Query: 249 VASVMCSYNQVNGVPTCADPNILKRTIRGEWRL---NGYIVSDCDSVGVYYDTQHFTSTP 305
V S MCSYN VNGVPTCAD +L+ +R W N YI SDC++V + +T
Sbjct: 239 VGSFMCSYNAVNGVPTCADTYVLEDILRKHWNWTDSNNYITSDCEAVKDISLRHKYVATL 298
Query: 306 EEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPS 365
+EA A A G+DL C A +GLL+ I+ AL + G FDG +
Sbjct: 299 QEATAIAFNNGMDLSCEYSGSSDIPGAFSQGLLNVSVIDRALTRQYEGLVHAGYFDGAAA 358
Query: 366 SQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTV 425
+ Y +LG +D+ TP+ Q+L L+ A +G+ LLKN +LPLS VA++G ++ +
Sbjct: 359 T--YANLGVQDINTPEAQKLVLQVAAEGLTLLKNDD-TLPLSLKSGSKVAMVGFWANDSS 415
Query: 426 TMIGNYAGIACGYTTPLQGIGRYA--RTIHQQGCKDVACADDQLFGAAIDASRQADATIL 483
+ G Y+G A P+ + + + A D A+DA++++D +
Sbjct: 416 KLSGIYSGPAPYLHNPVYAGNKLGLDMAVATGPILQKSGAADNWTTKALDAAKKSDTILY 475
Query: 484 VMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRI 543
GLD S AE DR + P Q +L++K +A+ G ++V+ G +D N +
Sbjct: 476 FGGLDPSAAAEGSDRTDISWPSAQIDLITK--LAALGKPLVVIALGDMVDHMPILNMKGV 533
Query: 544 AAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYP 603
++IWA +PGQ GGTA+ ++ G G+LP+T YP +Y T L M +M +RP + P
Sbjct: 534 NSLIWANWPGQDGGTAVMQVITGEHAIAGRLPITQYPAKY-TQLSMLDMNLRPGGNN--P 590
Query: 604 GRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHA 663
GRTYR+Y V PFG G+ YT F G + S+ I T
Sbjct: 591 GRTYRWYN-ESVQPFGFGLHYTKFAAKF------------GSNSSLTVNIQDIMKSCTKD 637
Query: 664 KCNRLTL-GVQVDVKNVGSKDGAHTLLVFSTPPAG-HWAPHKQLVAFEKVHVPAGAQQRV 721
+ + ++V V N G++ L F G P K LV++ ++ +G+Q +
Sbjct: 638 HPDLCDVPPIEVAVTNKGNRTSDFIALAFIKGEVGPKPYPLKTLVSYARLRDISGSQTKT 697
Query: 722 GINIHVCKYLSVVDRSGTRRIPLGEHNI 749
LS VD+SG GE+ +
Sbjct: 698 ASLALTLGTLSRVDQSGNLVAYPGEYTL 725
>gi|358061481|ref|ZP_09148135.1| hypothetical protein HMPREF9473_00197 [Clostridium hathewayi
WAL-18680]
gi|356700240|gb|EHI61746.1| hypothetical protein HMPREF9473_00197 [Clostridium hathewayi
WAL-18680]
Length = 695
Score = 397 bits (1019), Expect = e-107, Method: Compositional matrix adjust.
Identities = 241/599 (40%), Positives = 339/599 (56%), Gaps = 65/599 (10%)
Query: 56 LIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQ 115
L+ +++L+E+ + A AVPRLGI Y WW E LHGV+ G AT FPQ
Sbjct: 13 LVEQMTLEERASQMRYDAPAVPRLGIPAYNWWGEGLHGVARAGT----------ATMFPQ 62
Query: 116 VITTASSFNATLWEAIGRVVSDEARAMYNGGTA--------GLTYWSPNVNIFRDPRWGR 167
I A+ F+ L E I VVS E RA YN GLT+WSPNVNIFRDPRWGR
Sbjct: 63 AIAMAAMFDVELTEEIANVVSTEGRAKYNQFCEEGDRDIYKGLTFWSPNVNIFRDPRWGR 122
Query: 168 GQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD--RFHFNA 225
G ET GEDP L+ + ++VRGLQG DG+ LK+AA KHF + +G + R F A
Sbjct: 123 GHETYGEDPYLTSRLGTAFVRGLQG-DGEHLKIAACAKHFAVH-----SGPEALRHEFWA 176
Query: 226 KVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYI 285
SK+D+ +T+ F CV E V SVM +YN +G P CA+ +++ +RG+W G+
Sbjct: 177 DTSKKDLWETYLPAFEACVKEAHVESVMGAYNSYHGEPCCANTLLMEEILRGQWGFEGHF 236
Query: 286 VSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINN 345
VSDC ++ ++ T T E+AA A++ G DL+CG L A + GLL + +
Sbjct: 237 VSDCWAIRDFHMNYMVTDTAMESAALAVKKGCDLNCGNTY-LQVLKACEEGLLDDACVTE 295
Query: 346 ALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLP 405
A+V T + LGM GE + Y + + V +H+ELA+EAAR+ +VLLKN G LP
Sbjct: 296 AVVRLFTTRYLLGM--GEETE--YDDIPYEVVECKEHRELAVEAARRSMVLLKNDG-LLP 350
Query: 406 LSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI----GRYARTIHQQGC---K 458
L + T+AVIGPN+D +IGNY G + YTT L+GI G R ++ +GC K
Sbjct: 351 LHAEKLNTIAVIGPNADNRTALIGNYHGTSSCYTTILEGIQDAVGEDVRVLYAEGCHLFK 410
Query: 459 D----VACADDQLFGAAIDASRQADATILVMGLDQSIEAE---------ALDRAGLLLPG 505
D +A A D+L A I ++ +D +L +GLD+++E E + D+ LLLP
Sbjct: 411 DRVEHLAVAGDRLSEARI-VAKHSDVVVLCVGLDETLEGEEGDTGNSHASGDKKDLLLPE 469
Query: 506 RQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILF 565
Q+ L+ ++ K P ++ MSG ID++ A+ + A+I YPG GG A+AD+LF
Sbjct: 470 SQRRLMEEILNLGK-PVVVCNMSGSAIDLSLAQE--KAGAVIQVWYPGAEGGRALADLLF 526
Query: 566 GTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSY 624
G ++P GKLP+T+Y + NLP P + GRTYR+ +YPFG G++Y
Sbjct: 527 GKASPSGKLPVTFYKD--LENLP-------PFEDYSMDGRTYRYLTAEPLYPFGFGLTY 576
>gi|291548352|emb|CBL21460.1| Beta-glucosidase-related glycosidases [Ruminococcus sp. SR1/5]
Length = 697
Score = 397 bits (1019), Expect = e-107, Method: Compositional matrix adjust.
Identities = 252/737 (34%), Positives = 385/737 (52%), Gaps = 99/737 (13%)
Query: 51 QRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGA 110
++ L+ R++L+EK L A A+ RLGI Y WW+E LHGV+ G A
Sbjct: 8 KKAEALVARMTLEEKASQLRYDAPAIKRLGIPAYNWWNEGLHGVARAGQ----------A 57
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA--------GLTYWSPNVNIFRD 162
T FPQ I A++F+ + +V+ E RA YN + GLT+WSPNVNIFRD
Sbjct: 58 TVFPQAIGMAAAFDRKSVAEMAGIVATEGRAKYNAYSVNGDRDIYKGLTFWSPNVNIFRD 117
Query: 163 PRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD--R 220
PRWGRG ET GEDP L+ + S+V+ LQG +GD +K AA KHF + +G + R
Sbjct: 118 PRWGRGHETYGEDPYLTKELGVSFVKALQG-NGDTMKAAACAKHFAVH-----SGPEALR 171
Query: 221 FHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWR 280
F+A+ S +D+E+T+ F V E KV +VM +YN+ NG P C P L++ +RGEW+
Sbjct: 172 HEFDAEASAKDMEETYLPAFEGLVKEAKVEAVMGAYNRTNGEPCCGSPT-LQKKLRGEWK 230
Query: 281 LNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSE 340
G+ VSDC ++ +++ T T E+AA AI G DL+CG LH A ++GL++E
Sbjct: 231 FQGHFVSDCWAIRDFHEHHMVTDTAVESAALAINNGCDLNCGNTY-LHIMKAYEKGLVTE 289
Query: 341 IDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQ 400
I A V T + LG+FDG Y +L +V +P H + A +AA + VLLKN
Sbjct: 290 ETITRAAVRLFTTRYLLGLFDG----SEYDNLSYMEVESPRHLDAAEKAAEKSFVLLKNN 345
Query: 401 GPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRY----ARTIHQQG 456
G LPL + +T+ +IGPN+D +IGNY G A Y T +GI Y R + +G
Sbjct: 346 G-ILPLDKEKLKTIGIIGPNADSRQALIGNYHGTASRYITIQEGIQDYVGDDVRILTSRG 404
Query: 457 C-------KDVACADDQLFGAAIDASRQADATILVMGLDQSIEAE---------ALDRAG 500
C + +A D++ A + + +D IL MGLD+++E E + D+
Sbjct: 405 CDLFRDRTEHLAFTRDRIAEAKV-VAENSDVVILCMGLDETLEGEEGDTGNSYVSGDKED 463
Query: 501 LLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAI 560
+ LPG Q+EL+ ++ K P + L++G +D+ +A ++W YPG GG A
Sbjct: 464 IELPGVQRELMEAIADTGK-PVVFCLLAGSDLDLKYAAEKFDAVMMLW--YPGCQGGKAA 520
Query: 561 ADILFGTSNPGGKLPMTWYPQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFG 619
A +LFG +P GKLP+T+Y E + LP T+ +M+ GRTYR+ + +PFG
Sbjct: 521 AKVLFGEISPSGKLPVTFY--ESLEELPDFTDYSMK--------GRTYRYMERKAQFPFG 570
Query: 620 HGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNV 679
+G++Y+ + +K K N V+V+V+N
Sbjct: 571 YGLTYS------------------------KVAVDKAEVKTCGQKIN-----VEVEVQNN 601
Query: 680 GSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSG 738
G+ D + ++ + + P+ L F+++ + AG +++ I I K +VVD +G
Sbjct: 602 GAYDTEDVVQIYVKNIDSKNAIPNPMLAGFQRIFLKAGECRKIEIPIWE-KAFTVVDETG 660
Query: 739 TRRIPLGEHNIHIGGTK 755
R + I+ G ++
Sbjct: 661 KRMEEGKKFEIYAGCSQ 677
>gi|365135698|ref|ZP_09343911.1| hypothetical protein HMPREF1032_03710 [Subdoligranulum sp.
4_3_54A2FAA]
gi|363612160|gb|EHL63713.1| hypothetical protein HMPREF1032_03710 [Subdoligranulum sp.
4_3_54A2FAA]
Length = 643
Score = 396 bits (1018), Expect = e-107, Method: Compositional matrix adjust.
Identities = 239/613 (38%), Positives = 347/613 (56%), Gaps = 60/613 (9%)
Query: 51 QRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGA 110
QR L+ +++L+EKV + A A+ RLGI Y WW+E LHGV G A
Sbjct: 6 QRARALVAQMTLEEKVSQMRYDAPAIERLGIPAYNWWNECLHGVGRSGT----------A 55
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--------GGTAGLTYWSPNVNIFRD 162
T FPQ I A+SF+ +L E + + +SDEARA YN G GLT+WSPN+N+FRD
Sbjct: 56 TVFPQPIGMAASFDESLLEHVAQAISDEARAKYNQYKTFGETGIYQGLTFWSPNINLFRD 115
Query: 163 PRWGRGQETPGEDPVLSGKYAASYVRGLQ-GSDGDRLKVAASCKHFTAYDLDNWNGVDRF 221
PRWGRG ET GEDP+L+G+ +++RGLQ G D K+ A+ KHF A+ R
Sbjct: 116 PRWGRGHETYGEDPLLTGRMGTAFIRGLQEGEDSQYRKLDATVKHFAAHSGPE---AGRH 172
Query: 222 HFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRL 281
FNA+VS +D+ D++ FR C+ K A+VM +YN++NG P CA LK + EW+
Sbjct: 173 SFNAEVSAEDMADSYLWAFRYCIEHAKPAAVMGAYNRINGEPACASSTYLKGVLYEEWKF 232
Query: 282 NGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLH-TESAVQRGLLSE 340
+GY+VSDC ++ + H T +E+AA A+ G L+CG H ++AV+ GL+SE
Sbjct: 233 DGYVVSDCGAIQDINENHHVTKNEKESAALAVNNGCQLNCGK--AYHWVKAAVEDGLISE 290
Query: 341 IDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQ 400
+ A+ + RLGMFD S Y + + H+EL + A++ IVLLKN
Sbjct: 291 DTVTCAVERLFEARFRLGMFD---SDCVYDSIPMNVIECRKHRELNRKMAQESIVLLKNN 347
Query: 401 GPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYAR--TIHQQGC- 457
G LPL+ +T+AVIGPN+D ++GNY G +TT L+GI AR + +G
Sbjct: 348 G-ILPLN--PEKTIAVIGPNADDKTVLLGNYNGTPSHWTTLLRGIQDQARGEVYYARGSV 404
Query: 458 ---KDVACADDQLFGAAIDASRQADATILVMGLDQSIE---------AEALDRAGLLLPG 505
K+ ++ AI ++ AD +L +GL +E A++ DR + LP
Sbjct: 405 LVEKEALPWAEKPLHEAIYTAKAADVVVLCLGLSPLLEGEEGDAYNGADSGDRKDISLPD 464
Query: 506 RQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILF 565
QQ+L+ + + ++ P +LV +SGG +D+ A D R AAI+ YPG GG A+ADILF
Sbjct: 465 IQQQLLCAI-LDTEKPVVLVNVSGGCVDLRQA--DERCAAILQCFYPGAEGGNALADILF 521
Query: 566 GTSNPGGKLPMTWYPQEYITNL-PMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSY 624
G +P G+LP+T+Y + +L P T+ +M+ GRTYRF+ G +YPFGHG++Y
Sbjct: 522 GRVSPSGRLPVTFY--RTVEDLPPFTDYSMK--------GRTYRFFDGKPLYPFGHGLTY 571
Query: 625 TNFVHTVANAPTV 637
+ + TV
Sbjct: 572 ADIKEQWTDPYTV 584
>gi|225878709|dbj|BAH30674.1| beta-xylosidase [Aspergillus aculeatus]
Length = 785
Score = 395 bits (1015), Expect = e-107, Method: Compositional matrix adjust.
Identities = 258/726 (35%), Positives = 381/726 (52%), Gaps = 32/726 (4%)
Query: 43 CQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTK 102
C + R L ++L+E + + A+PRLG+ Y+ W+EALHG+ + T+
Sbjct: 62 CDRTASAHDRAAALTSMMTLEELMNSTGNRIPAIPRLGLPPYQIWNEALHGLY-LANFTE 120
Query: 103 FGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRD 162
G F +TSFP I T ++ N TL I ++++ + RA N G GL +SPN+N FR
Sbjct: 121 -SGPFSWSTSFPSPILTMATLNRTLIHQIAQIIATQGRAFNNAGRYGLNAFSPNINAFRH 179
Query: 163 PRWGRGQETPGEDP-VLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRF 221
P WGRGQETPGED L YA Y+ GLQG + K+ A+ KH+ YD++NW RF
Sbjct: 180 PVWGRGQETPGEDANCLCSAYAYEYITGLQG-NATNPKIIATAKHYAGYDIENWRQRSRF 238
Query: 222 HFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRL 281
+ +++QD+ + F F + V + +V SVM SYN VNGVP+ A+ +L+ +R W
Sbjct: 239 GNDLNITQQDLAEYFTPQFVVAVRDAQVRSVMPSYNAVNGVPSSANTFLLQTLVRDSWGF 298
Query: 282 --NGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLL 338
+GY+ SDCD+V ++ + + A+A ++RAG D+DCG +L ES Q G +
Sbjct: 299 IQDGYMASDCDAVYNVFNPHGYAANLSSASAMSLRAGTDIDCGISYLTTLNESLTQ-GQI 357
Query: 339 SEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLK 398
S +I A+ + + G FDG + PY L DV + +A EAA G+VLLK
Sbjct: 358 SRSEIERAVTRFYSNLVSAGYFDGPDA--PYRDLSWSDVVRTNRWNVAYEAAVAGVVLLK 415
Query: 399 NQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCK 458
N G LPLS R VA+IGP ++ T M GNY G+A T+PL + ++
Sbjct: 416 NDG-VLPLSKSVQR-VALIGPWANATEQMQGNYHGVAPYLTSPLAAVQASGLEVNYAFGT 473
Query: 459 DVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMAS 518
++ F AA+ A+ ++D I G+D ++EAE LDRA + PG Q EL+ ++
Sbjct: 474 NITSNVTNCFAAALAAAEKSDIIIFAGGIDNTLEAEELDRANITWPGNQLELIHRLGELG 533
Query: 519 KGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTW 578
K P +++ M GG +D + K ++ A++W GYPGQAGG A+ DIL G P G+L T
Sbjct: 534 K-PLVVLQMGGGQVDSSALKASEKVGALLWGGYPGQAGGQALWDILTGQRAPAGRLTTTQ 592
Query: 579 YPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVV 638
YP EY P T+M++RP PG+TY +Y G VY FGHG+ YT F T +
Sbjct: 593 YPAEYALQFPATDMSLRPRGDN--PGQTYMWYTGEPVYAFGHGLFYTTFA-------TAL 643
Query: 639 AVPLDGRHGSINATISGKAIKVTHAKCNRLT----LGVQVDVKNVGSKDGAHTLLVFSTP 694
A P S + G + HA N + L V V N G +T + F+
Sbjct: 644 AGPGQEPERSFDI---GALLARPHAGYNLVEQLPFLNFTVKVTNTGEVISDYTAMAFANT 700
Query: 695 PAG-HWAPHKQLVAFEKVHVPAGAQQRVGINIHVC-KYLSVVDRSGTRRIPLGEHNIHIG 752
AG P+K LV F+++ P + +++ V L+ D G R I G + + +
Sbjct: 701 TAGPRPHPNKWLVGFDRIG-PLDPRVSARMSVPVSLDSLARTDAQGNRVIYPGPYELALN 759
Query: 753 GTKHSV 758
+ V
Sbjct: 760 NERSPV 765
>gi|6573772|gb|AAF17692.1|AC009243_19 F28K19.27 [Arabidopsis thaliana]
Length = 696
Score = 395 bits (1014), Expect = e-107, Method: Compositional matrix adjust.
Identities = 200/473 (42%), Positives = 305/473 (64%), Gaps = 9/473 (1%)
Query: 288 DCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNAL 347
DCD+V + YD Q + +PE+A AD ++AG+D++CG +L HT+SA+Q+ +SE DI+ AL
Sbjct: 221 DCDAVSIIYDAQGYAKSPEDAVADVLKAGMDVNCGSYLQKHTKSALQQKKVSETDIDRAL 280
Query: 348 VNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLS 407
+N +V++RLG+F+G+P+ PYG++ P +VC+P HQ LAL+AAR GIVLLKN LP S
Sbjct: 281 LNLFSVRIRLGLFNGDPTKLPYGNISPNEVCSPAHQALALDAARNGIVLLKNNLKLLPFS 340
Query: 408 HIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYART-IHQQGCKDVACADDQ 466
++AVIGPN+ V T++GNYAG C TPL + Y + ++ QGC VAC++
Sbjct: 341 KRSVSSLAVIGPNAHVVKTLLGNYAGPPCKTVTPLDALRSYVKNAVYHQGCDSVACSNAA 400
Query: 467 LFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVL 526
+ A+ ++ AD +L+MGLDQ+ E E DR L LPG+QQEL++ V+ A+K P +LVL
Sbjct: 401 I-DQAVAIAKNADHVVLIMGLDQTQEKEDFDRVDLSLPGKQQELITSVANAAKKPVVLVL 459
Query: 527 MSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITN 586
+ GGP+D++FA N+ +I +IIWAGYPG+AGG AI++I+FG NPGG+LP+TWYPQ ++ N
Sbjct: 460 ICGGPVDISFAANNNKIGSIIWAGYPGEAGGIAISEIIFGDHNPGGRLPVTWYPQSFV-N 518
Query: 587 LPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVAN-APTVVAVPLDGR 645
+ MT+M MR + YPGRTY+FYKGP VY FGHG+SY+ + + A T + +
Sbjct: 519 IQMTDMRMRSATG--YPGRTYKFYKGPKVYEFGHGLSYSAYSYRFKTLAETNLYLNQSKA 576
Query: 646 HGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHW---APH 702
+ ++ ++ C+ V V+V+N G G H +L+F+ G
Sbjct: 577 QTNSDSVRYTLVSEMGKEGCDVAKTKVTVEVENQGEMAGKHPVLMFARHERGGEDGKRAE 636
Query: 703 KQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTK 755
KQLV F+ + + G + + I +C++LS + G + G++ + +G ++
Sbjct: 637 KQLVGFKSIVLSNGEKAEMEFEIGLCEHLSRANEFGVMVLEEGKYFLTVGDSE 689
Score = 249 bits (637), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 112/205 (54%), Positives = 144/205 (70%), Gaps = 8/205 (3%)
Query: 23 AAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIK 82
+A P +CDP + TT+ FC+ LPI +R DL+ RL++ EK+ L++ A +PRLG+
Sbjct: 19 SAPPPHSCDPSNPTTKLYQFCRTDLPIGKRARDLVSRLTIDEKISQLVNTAPGIPRLGVP 78
Query: 83 GYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAM 142
YEWWSEALHGV+ GPG +F G ATSFPQVI TA+SF++ W I +V+ EAR +
Sbjct: 79 AYEWWSEALHGVAYAGPGIRFNGTVKAATSFPQVILTAASFDSYEWFRIAQVIGKEARGV 138
Query: 143 YNGGTA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGD----- 196
YN G A G+T+W+PN+NIFRDPRWGRGQETPGEDP+++G YA +YVRGLQG D
Sbjct: 139 YNAGQANGMTFWAPNINIFRDPRWGRGQETPGEDPMMTGTYAVAYVRGLQGDSFDGRKTL 198
Query: 197 --RLKVAASCKHFTAYDLDNWNGVD 219
L+ +A CKHFTAYDLD W D
Sbjct: 199 SNHLQASACCKHFTAYDLDRWKDCD 223
>gi|94442924|emb|CAJ91136.1| beta-xylosidase [Platanus x acerifolia]
Length = 231
Score = 393 bits (1010), Expect = e-106, Method: Compositional matrix adjust.
Identities = 181/232 (78%), Positives = 208/232 (89%), Gaps = 3/232 (1%)
Query: 183 AASYVRGLQGSDG--DRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPF 240
AASYVRGLQ G RLKVAA CKH+TAYDLDNW G+DRFHFNA+VSKQD+EDTF+VPF
Sbjct: 1 AASYVRGLQQPYGRDSRLKVAACCKHYTAYDLDNWKGIDRFHFNARVSKQDLEDTFNVPF 60
Query: 241 RMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQH 300
R CV+EGKVASVMCSYNQVNG+PTCADPN+L+ TIRGEWRLNGYIV+DCDS+GV YDTQH
Sbjct: 61 RECVVEGKVASVMCSYNQVNGIPTCADPNLLRNTIRGEWRLNGYIVTDCDSIGVLYDTQH 120
Query: 301 FTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMF 360
+TSTPEE+AADAI+AG+DLDCGPFL +HT+ A+ R +LSE+ ++ AL NTL VQMRLGMF
Sbjct: 121 YTSTPEESAADAIKAGVDLDCGPFLAVHTQEAITRRMLSEVYVDGALANTLAVQMRLGMF 180
Query: 361 DGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHR 412
DGEPS+QP+GHLGP+DVCT HQ+LALEAARQGIVL+KNQG SLPLS RHR
Sbjct: 181 DGEPSAQPFGHLGPRDVCTSAHQQLALEAARQGIVLMKNQG-SLPLSTARHR 231
>gi|358380569|gb|EHK18247.1| glycoside hydrolase family 3 protein, partial [Trichoderma virens
Gv29-8]
Length = 722
Score = 393 bits (1010), Expect = e-106, Method: Compositional matrix adjust.
Identities = 258/710 (36%), Positives = 379/710 (53%), Gaps = 44/710 (6%)
Query: 60 LSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPG-------TKFGGDFPGATS 112
L+L EK L++ A V RLG+ YEW +EALHG++ V PG T+ F +T
Sbjct: 12 LTLDEKAANLVNNAPGVKRLGLPPYEWRNEALHGLAGVSPGQGINSTFTQGNVAFNSSTQ 71
Query: 113 FPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETP 172
FP I ++F+ L I VS EARA N AGL YW+PN+N +RDPRWGRGQETP
Sbjct: 72 FPSPIVLGAAFDDHLVHDIATAVSTEARAFSNHLKAGLDYWAPNINPYRDPRWGRGQETP 131
Query: 173 GEDPVLSGKYAASYVRGLQGSDGD-RLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQD 231
GEDP +YA +YV GL+G G + KV ++CKHF YD+++ +GV R +NA +S QD
Sbjct: 132 GEDPYHVAQYAYNYVVGLKGGVGPAKSKVVSTCKHFAGYDIEDSDGVVRGSYNAIISTQD 191
Query: 232 IEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNG---YIVSD 288
+ + + FR C + K +VMCSYN VNG P+CA+ +L +R W ++ D
Sbjct: 192 LAEYYLPSFRSCFRDAKTGAVMCSYNAVNGHPSCANSYMLDTVLRDHWGWGSSAHWVTGD 251
Query: 289 CDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALV 348
C +V ++ H + + A AI G DLDCG + SAVQ +E ++ AL
Sbjct: 252 CGAVDGVFNQHHVGQSAAQGVAFAINNGTDLDCGTAYASNIASAVQNNYTTEAQLDQALS 311
Query: 349 NTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSH 408
+ + LG FD P Q Y LG DV TP Q+LA A +GI +L P P+
Sbjct: 312 RLYSSLIVLGYFD-PPEGQEYRTLGVSDVNTPSTQKLAYTALVEGINIL----PIRPMG- 365
Query: 409 IRHRTVAVIGP-NSDVTVTMIGNYAGIACGYTTPLQGIGRYA---RTIHQQGCKDVACAD 464
+TV +GP ++ +V+M GNY G+A T P+ A + QG + V D
Sbjct: 366 ---QTVLFVGPWANNASVSMFGNYNGVAPYKTIPVPTANSSAYNWNVTYSQGLQYVLSND 422
Query: 465 DQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTIL 524
F AA+ A+++AD + + G+D+ +EAEA DR + PG Q L+ + +A+ P ++
Sbjct: 423 TSQFAAAVSAAQEADVVVYIGGIDEQVEAEAHDRTSIDWPGAQLNLIKQ--LAAVKPVVV 480
Query: 525 VLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYI 584
V + GG +D + + + ++W GYPGQ G+ + DIL G S P G+LP+T YP YI
Sbjct: 481 VQVGGGQVDDSSLLQNKNVKGLLWMGYPGQEFGSGLIDILSGASAPAGRLPVTQYPANYI 540
Query: 585 TNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDG 644
T +PMT+ ++RPS S PGRTYR+Y G V+ PFG G+ YT F + + G
Sbjct: 541 TQVPMTDQSLRPSSSN--PGRTYRWYNGSVI-PFGTGIHYTKFNISWKTGGS-------G 590
Query: 645 RHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF--STPPAGHWAPH 702
R A A+ + Q++V+NVGS + L+F S+ P
Sbjct: 591 RGTYDTADFINAEDPKDLAEFDVF----QINVENVGSTTSDYVALLFVKSSDSGPQPYPL 646
Query: 703 KQLVAFEKVH-VPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHI 751
K LV++ + H G ++ + ++V + ++ D SG + G + + I
Sbjct: 647 KTLVSYARAHGTQPGETTKIDLRVNVGQ-IARNDSSGNLVLYPGAYTLEI 695
>gi|115436096|ref|NP_001042806.1| Os01g0296700 [Oryza sativa Japonica Group]
gi|113532337|dbj|BAF04720.1| Os01g0296700, partial [Oryza sativa Japonica Group]
Length = 522
Score = 393 bits (1009), Expect = e-106, Method: Compositional matrix adjust.
Identities = 227/522 (43%), Positives = 309/522 (59%), Gaps = 30/522 (5%)
Query: 259 VNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYY-DTQHFTSTPEEAAADAIRAGL 317
+NGVP CAD +L T+R +W+L+GYIVSDCDSV V D + T EA A A++AGL
Sbjct: 1 INGVPACADARLLTETVRRDWQLHGYIVSDCDSVRVMVRDAKWLGYTGVEATAAAMKAGL 60
Query: 318 DLDCGPFL-GLHT------ESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYG 370
DLDCG F G+H AV++G L E ++NAL N MRLG FDG P +
Sbjct: 61 DLDCGMFWEGVHDFFTTYGVDAVRQGKLKESAVDNALTNLYLTLMRLGFFDGIPELE--- 117
Query: 371 HLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVT--MI 428
LG DVCT +H+ELA +AARQG+VLLKN LPLS + +VA+ G + T M+
Sbjct: 118 SLGAADVCTEEHKELAADAARQGMVLLKNDAALLPLSPEKVNSVALFGQLQHINATDVML 177
Query: 429 GNYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLD 488
G+Y G C TP G+ + + C +C A A++ DATI+V GL+
Sbjct: 178 GDYRGKPCRVVTPYDGVRKVVSSTSVHACDKGSC------DTAAAAAKTVDATIVVAGLN 231
Query: 489 QSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIW 548
S+E E+ DR LLLP Q ++ V+ AS P +LV+MS G +DV+FA+++P+I A++W
Sbjct: 232 MSVERESNDREDLLLPWSQASWINAVAEASPSPIVLVIMSAGGVDVSFAQDNPKIGAVVW 291
Query: 549 AGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYR 608
AGYPG+ GGTAIAD+LFG NPGG+LP+TWY EY++ +PMT MA+RP YPGRTY+
Sbjct: 292 AGYPGEEGGTAIADVLFGKYNPGGRLPLTWYKNEYVSKIPMTSMALRPDAEHGYPGRTYK 351
Query: 609 FYKGP-VVYPFGHGMSYTNFVHTVANAPTVVAVPLDG----RHGSINATISG----KAIK 659
FY G V+YPFGHG+SYTNF + A A V V + + + A +S A+
Sbjct: 352 FYGGADVLYPFGHGLSYTNFTYASATAAAPVTVKVGAWEYCKQLTYKAGVSSPPACPAVN 411
Query: 660 VTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAG-HWAPHKQLVAFEKVHVPAGAQ 718
V C + V V N G +DG H + +++ PPA AP KQLVAF +V V AGA
Sbjct: 412 VASHACQE-EVSFAVTVANTGGRDGTHVVPMYTAPPAEVDGAPRKQLVAFRRVRVAAGAA 470
Query: 719 QRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
V ++VCK ++V+ + +P G + +G S+S
Sbjct: 471 VEVAFALNVCKAFAIVEETAYTVVPSGVSRVLVGDDALSLSF 512
>gi|167519969|ref|XP_001744324.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163777410|gb|EDQ91027.1| predicted protein [Monosiga brevicollis MX1]
Length = 721
Score = 393 bits (1009), Expect = e-106, Method: Compositional matrix adjust.
Identities = 232/611 (37%), Positives = 336/611 (54%), Gaps = 31/611 (5%)
Query: 40 LPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISG-------AAAVPRLGIKGYEWWSEALH 92
LPFC +SL R DL RL+L E + L + A VPRLG++ Y + +E LH
Sbjct: 42 LPFCDLSLDFRDRAWDLAQRLTLDELAQQLNTYSFTPQAYAPGVPRLGLRNYSYHAEGLH 101
Query: 93 GVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN-------- 144
G+ + ++P AT +PQV A++ NA+L + ++ E RA+ N
Sbjct: 102 GIRDANVV-----NYP-ATLYPQVTAMAATANASLIHEMSTIMGTELRAVNNRAQELGEI 155
Query: 145 -GGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDR-LKVAA 202
G L+ + P +NI RD RWGR QE+ EDP L+G YA ++V GL+ + + L+ A
Sbjct: 156 FGRGGALSIYGPTMNIIRDGRWGRSQESVSEDPWLNGLYAVNFVLGLEQRNSSKYLQAAT 215
Query: 203 SCKHFTAYDLDNWNG-VDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNG 261
SCKH AY + +N + R FNA + + DI DT+ FR CV G V +MCSYN VNG
Sbjct: 216 SCKHLFAYSFEGYNNTLTRHSFNAVIDELDIHDTYLPAFRACVELGHVQQIMCSYNSVNG 275
Query: 262 VPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDC 321
+P CA ++ +R W G IVSDCD+V Y+T ++T TPE+A A++ G DLDC
Sbjct: 276 IPACARGDVQNDRVRKAWGFEGLIVSDCDAVADIYNTHNYTRTPEDAVTVALQGGCDLDC 335
Query: 322 GPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD 381
G F H SAVQ+ L + + ++ L ++ LG FD + +S PY LG + + TP
Sbjct: 336 GDFYSQHLASAVQQNLTTLAALQQSMTRVLEMRFLLGEFDPD-TSVPYRQLGREAIDTPF 394
Query: 382 HQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGY-TT 440
++ +L A+R+ +VLL+N+ LP++ VA+IGP ++T M+G + TT
Sbjct: 395 ARDSSLRASRESVVLLENRIKLLPVTLSADIKVALIGPYVNLTTIMMGGKLDYTPSFITT 454
Query: 441 PLQGIGRYART--IHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDR 498
QG T GC A L A+ + QAD +L +GL IE E DR
Sbjct: 455 YFQGFQAIGITHLTSSPGCNITAPLPGAL-DKAVQIATQADLVVLTLGLSSDIEHEGGDR 513
Query: 499 AGLLLPGRQQELVSKVSMA-SKGPTILVLMSGGPIDVAFAKND-PRIAAIIWAGYPGQAG 556
L LP QQ+L +S A ++VL++GGP+ V K R II A Y GQ+
Sbjct: 514 ETLGLPTPQQDLYDAISAAIPSSKLVVVLVNGGPVSVDRIKYGIARTPTIIEAFYGGQSA 573
Query: 557 GTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVY 616
GTA+A+ +FG +NP G LP T + ++P T+M +RP + +PGRT+RF+ PV++
Sbjct: 574 GTALAETIFGQNNPSGTLPYTVFFSNITAHVPFTDMHLRPDAATGFPGRTHRFFDAPVMW 633
Query: 617 PFGHGMSYTNF 627
PFGHG+SY+ F
Sbjct: 634 PFGHGLSYSTF 644
>gi|345519864|ref|ZP_08799275.1| beta-glucosidase [Bacteroides sp. 4_3_47FAA]
gi|254836262|gb|EET16571.1| beta-glucosidase [Bacteroides sp. 4_3_47FAA]
Length = 736
Score = 392 bits (1008), Expect = e-106, Method: Compositional matrix adjust.
Identities = 260/768 (33%), Positives = 384/768 (50%), Gaps = 90/768 (11%)
Query: 35 ATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGV 94
A LPF LP+ RV DL+ RL+L+EKV L+ + A+PRLGI Y+WW+EALHGV
Sbjct: 19 AQVENLPFRNADLPLEVRVKDLVARLTLEEKVLLMQHHSPAIPRLGIPAYDWWNEALHGV 78
Query: 95 SNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN-----GGTA- 148
+ T FPQ I A++F+ + +G + S E RA++N G T
Sbjct: 79 ART---------LEKVTVFPQAIGMAATFDTEALQKMGDITSTEGRALFNEDWKAGKTGT 129
Query: 149 ---GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCK 205
GLTYW+PN+NIFRDPRWGRGQET GEDP L+ K A+ VRGL+G D LK A K
Sbjct: 130 RYRGLTYWTPNINIFRDPRWGRGQETYGEDPYLTAKMGAAIVRGLEGEDPHYLKSVACAK 189
Query: 206 HFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTC 265
H+ + +N R F+A+ S D+ DT+ FR V + KV VMC+YN++NG P C
Sbjct: 190 HYAVHSGPEYN---RHSFDARPSVFDLWDTYMPAFRELVTKAKVHGVMCAYNRLNGQPCC 246
Query: 266 ADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEE---AAADAIRAGLDLDCG 322
+ +L +R +W +GY+ SDC ++ D F T E A +DA+ AG DL+CG
Sbjct: 247 GNDPLLVDILRNQWHFDGYVTSDCWALK---DFAEFHKTHPEHTIAMSDALLAGTDLECG 303
Query: 323 PFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQ-PYGHLGPKDVCTPD 381
L E V++GL SE DIN +L T+ ++GMFD P+ + PY +G + +
Sbjct: 304 NLYHLLAE-GVKKGLHSERDINVSLSRLFTILFKIGMFD--PAERVPYSSIGREVLECEA 360
Query: 382 HQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTP 441
H++ A A++ IVLL+N+ LPL + +++A+IGPN+D T + NY G TP
Sbjct: 361 HKQHAERMAKESIVLLENKNHILPLDASKIKSIALIGPNADNGQTQLANYFGTPSEIVTP 420
Query: 442 LQGIGRY----ARTIHQQGCKDV-ACADDQLFGAAIDASRQADATILVMGLDQSIE---- 492
+ R + + G V D F + Q+D + V G+ E
Sbjct: 421 YMSLKRRLGDKIKINYLPGVGIVDKLKDAPSFVQVAHKAAQSDVIVFVSGISADYEGEAG 480
Query: 493 ---------AEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRI 543
+ DR + LP Q EL+ K+ + P I+V MSG + +F
Sbjct: 481 DAGAAGYGGFASGDRTTMQLPLVQIELLKKLKKTGR-PLIIVNMSGSVM--SFEWESQNA 537
Query: 544 AAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYP 603
A++ A Y GQA G AI D+LFG NP G++P+T Y + +LP P ++
Sbjct: 538 DALLQAWYGGQAAGDAIVDVLFGHCNPAGRMPLTTYKSD--NDLP-------PFENYSML 588
Query: 604 GRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHA 663
GRTYR++KG YPFG+G+SYT F ++ ++ T +G +VT
Sbjct: 589 GRTYRYFKGEPRYPFGYGLSYTTFAYSDVQC--------------VDETHTGDTARVT-- 632
Query: 664 KCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAG--HWAPHKQLVAFEKVHVPAGAQQRV 721
V V N G DG + ++ P P L F+++H+ G V
Sbjct: 633 ----------VTVSNTGDCDGDEVVQLYVVHPQDGRKQIPLCALKGFKRIHLKRGESTSV 682
Query: 722 GINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHAATLGVIK 769
+ + L++ + G G+ + +GG + + + ++L IK
Sbjct: 683 SFTL-TPEELALTETDGNLVEKNGQVTLFVGGGQPNYAAGVSSLLTIK 729
>gi|316980598|dbj|BAJ51947.1| putative beta-D-xylosidase [Glycyrrhiza uralensis]
Length = 285
Score = 392 bits (1008), Expect = e-106, Method: Compositional matrix adjust.
Identities = 180/278 (64%), Positives = 222/278 (79%), Gaps = 1/278 (0%)
Query: 485 MGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIA 544
MGLDQSIEAE DR GLLLPG QQELVS+V+ ++GP ILVLMSGGPIDV+FAKNDP+I+
Sbjct: 1 MGLDQSIEAEFRDRVGLLLPGHQQELVSRVARVARGPVILVLMSGGPIDVSFAKNDPKIS 60
Query: 545 AIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPG 604
AI+W GYPGQAGGTAIAD++FGT+NPGG+LPMTWYPQ Y+ +PMT M MRP+ + YPG
Sbjct: 61 AILWVGYPGQAGGTAIADVIFGTTNPGGRLPMTWYPQNYLAKVPMTNMDMRPNPATGYPG 120
Query: 605 RTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATIS-GKAIKVTHA 663
RTYRFYKGPVV+PFGHG+SYT F H++A AP V+VP N+T+S KA++V+HA
Sbjct: 121 RTYRFYKGPVVFPFGHGLSYTRFTHSLAIAPKQVSVPFATLQAFTNSTVSTSKAVRVSHA 180
Query: 664 KCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGI 723
C+ + +G VDVKN GS DG +TLLVFS PP G W+ KQLV+F K +VPAG++QRV +
Sbjct: 181 NCDAMEVGFHVDVKNEGSMDGTNTLLVFSKPPPGKWSATKQLVSFHKTYVPAGSKQRVKV 240
Query: 724 NIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLH 761
+HVCK+LSVVD G RRIP+GEH + IG KHS+S+
Sbjct: 241 GVHVCKHLSVVDEFGIRRIPMGEHELQIGDLKHSISVQ 278
>gi|302669556|ref|YP_003829516.1| beta-xylosidase [Butyrivibrio proteoclasticus B316]
gi|302394029|gb|ADL32934.1| beta-xylosidase Xyl3A [Butyrivibrio proteoclasticus B316]
Length = 709
Score = 391 bits (1004), Expect = e-106, Method: Compositional matrix adjust.
Identities = 256/726 (35%), Positives = 391/726 (53%), Gaps = 93/726 (12%)
Query: 51 QRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGA 110
+R +L+ +++++EK L A A+ RLGI Y WW+EALHGV+ G A
Sbjct: 8 KRAKELVAKMTVEEKASQLRYDAPAIDRLGIPAYNWWNEALHGVARAGT----------A 57
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA--------GLTYWSPNVNIFRD 162
T FPQ I A++F+ L +G V+++EARA YN + GLT+W+PNVNIFRD
Sbjct: 58 TMFPQAIGLAAAFDEELMSEVGEVIAEEARAKYNEQSKREDRDIYKGLTFWAPNVNIFRD 117
Query: 163 PRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFH 222
PRWGRG ET GEDP L+ + A +V+ +QG DG+ +K AA KHF + +R
Sbjct: 118 PRWGRGHETYGEDPFLTSRLAVPFVKAMQG-DGEYMKAAACAKHFAVHSGPEG---ERHF 173
Query: 223 FNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLN 282
F+AK SK+D+E+T+ F V E +V +VM +YN+ NG P CA+ ++ T+RG+W
Sbjct: 174 FDAKASKKDLEETYLPAFEALVKEAEVEAVMGAYNRTNGEPCCANKPLMVDTLRGKWGFQ 233
Query: 283 GYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEID 342
G+ VSDC ++ +++ TS+PEE+A A+ G DL+CG + V+ GL+ E
Sbjct: 234 GHFVSDCWAIKDFHENHKVTSSPEESAKLALEMGCDLNCGCTYQ-SIMNGVRAGLIDEKL 292
Query: 343 INNALVNTLTVQMRLGMFDG-EPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQG 401
I + T + LGMFD E PY + V +H +A AAR+ +VLLKN G
Sbjct: 293 ITESCERLFTTRFLLGMFDKTEFDEIPY-----EKVECKEHLAVAKRAARESVVLLKNDG 347
Query: 402 PSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI----GRYARTIHQQGC 457
LPL+ +T+ V+GPN++ +++IGNY G + Y T L+GI G R ++ +GC
Sbjct: 348 -LLPLNKDSIKTIGVVGPNANSRLSLIGNYHGTSSRYITVLEGIQDKVGDDVRVLYSEGC 406
Query: 458 KDV------ACADDQL---FGAAIDASRQADATILVMGLDQSIEAE---------ALDRA 499
D+ AD L A + +D ++V+GLD+++E E + D+
Sbjct: 407 -DIFQNNISNLADPNLPDRLSEAQAVADHSDVVVVVVGLDENLEGEEGDAGNQFASGDKI 465
Query: 500 GLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTA 559
L LP Q++L++ V K PTI++ M+G ID++ A+++ A++ A YPG GG
Sbjct: 466 NLNLPLSQRQLLNAVLDCGK-PTIVIDMAGSAIDLSKAQDEAN--AVLQAFYPGARGGAD 522
Query: 560 IADILFGTSNPGGKLPMTWYPQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPF 618
+ADILFG +P GKLP+T+Y +LP + +M+ RTY+++ G +YPF
Sbjct: 523 VADILFGDVSPSGKLPVTFYKS--ADDLPDFKDYSMK--------NRTYKYFTGTPLYPF 572
Query: 619 GHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLG--VQVDV 676
G+G++Y + V D +SG I VT +L VQ+ +
Sbjct: 573 GYGLTYGDCYVKPDYDFNVKYADAD--------KVSGAEITVTVVNDGKLDTDEVVQLYI 624
Query: 677 KNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDR 736
K++ S F+T + LV F++VHVPAG + RV + + K + V+
Sbjct: 625 KDMDS--------YFAT-------TNPSLVGFKRVHVPAGGETRVTLTVSE-KAFTSVNE 668
Query: 737 SGTRRI 742
G R +
Sbjct: 669 EGERAV 674
>gi|402074909|gb|EJT70380.1| hypothetical protein GGTG_11406 [Gaeumannomyces graminis var.
tritici R3-111a-1]
Length = 793
Score = 390 bits (1003), Expect = e-105, Method: Compositional matrix adjust.
Identities = 254/705 (36%), Positives = 369/705 (52%), Gaps = 59/705 (8%)
Query: 43 CQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTK 102
C SL +R L+ L++ EK+ L+S A R+G+ Y WWSEALHGV+ PGT+
Sbjct: 44 CDRSLSPSERAAALVAALNVTEKMANLVSNANGSARIGLPKYNWWSEALHGVA-YAPGTQ 102
Query: 103 FG---GDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNI 159
F GDF +TSFP + A+SF+ +L E IG V+ E+RA NG +GL YW+PNVN
Sbjct: 103 FRRGPGDFNSSTSFPMPLLLAASFDDSLIEKIGDVIGTESRAFGNGRWSGLDYWTPNVNP 162
Query: 160 FRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDR-LKVAASCKHFTAYDLDNWNGV 218
F+DPRWGRG ETPGED + +YAAS ++GL+G ++ +V ++CKH+ A D ++WNG
Sbjct: 163 FKDPRWGRGSETPGEDILRIKRYAASMIKGLEGPHPEKERRVVSTCKHYAANDFEDWNGT 222
Query: 219 DRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGE 278
R F+A++S QD+ + + +PF+ C + +V S+MC+YN VNGVP+CA+ +L +R
Sbjct: 223 SRHDFDARISAQDLAEYYLMPFQQCARDSRVGSIMCAYNAVNGVPSCANSYLLDTVLRKH 282
Query: 279 WRL---NGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQR 335
W N Y+ SDC++V + T E A AG D C A +
Sbjct: 283 WGWTGHNNYVTSDCEAVLDVSAGHKYARTNAEGTAMCFEAGTDTSCEYTPSSDIRGAYAQ 342
Query: 336 GLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIV 395
GLL E ++ AL+ +R+G FDG S+ + + DV P Q+L+L++A +GIV
Sbjct: 343 GLLREETMDRALLRLYEGLVRVGYFDGNSSA--FSDISWADVNAPAAQDLSLQSAVEGIV 400
Query: 396 LLKNQGP-SLPLSHIRHRT-----------VAVIGPNSDVTVTMIGNYAGIACGYTTPLQ 443
+LKN G LPL +A+IG +D + G Y+G A TP
Sbjct: 401 MLKNDGTLPLPLGAKCSSKSKKRSSSGGPKLAMIGFWADAPEKLRGGYSGTAAYLRTP-- 458
Query: 444 GIGRYARTIHQQGCKDVACADDQLFGAAIDASRQ-----------ADATILVMGLDQSIE 492
A Q G V L GAA A+ AD + GLD++
Sbjct: 459 -----AYAARQMGLDVVTAGGPVLQGAAAAAADNWTAPALAAAEGADYIVYFGGLDETAA 513
Query: 493 AEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYP 552
E DR + PG Q LV + +A+ G ++V+ G +D + + A++WA +P
Sbjct: 514 GENKDRWDVEWPGAQLALVKR--LAALGKPLVVVQMGDQLDGTPLLANAGVGAVLWASWP 571
Query: 553 GQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKG 612
GQ GG A+ +L G ++P G+LP+T YP Y +PMTEMA+RPS S PGRTYR+Y
Sbjct: 572 GQDGGPAVMRLLSGAASPAGRLPVTQYPANYTRLVPMTEMALRPSASGSRPGRTYRWYST 631
Query: 613 PVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTH-AKCNRLTLG 671
PV+ PFG G+ YTNF V P + A + + +A + H +C L
Sbjct: 632 PVL-PFGFGLHYTNFTPAVTVPPALAAA------SGVTTSSLLEACRDPHPERCALPPL- 683
Query: 672 VQVDVKNVGSKDGAHTLLVFSTPPAGHWAPH----KQLVAFEKVH 712
+V V N G + + L F + G + P K L A+ ++
Sbjct: 684 -RVAVANTGRRASDYVALAFVS---GDYGPRPRPIKTLAAYARLR 724
>gi|346225847|ref|ZP_08846989.1| beta-glucosidase [Anaerophaga thermohalophila DSM 12881]
gi|346227016|ref|ZP_08848158.1| beta-glucosidase [Anaerophaga thermohalophila DSM 12881]
Length = 718
Score = 390 bits (1003), Expect = e-105, Method: Compositional matrix adjust.
Identities = 250/744 (33%), Positives = 393/744 (52%), Gaps = 89/744 (11%)
Query: 42 FCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGT 101
F + + +R ++ +L+++EK+ L++ A AV RL I Y+WW+E LHGV+ G
Sbjct: 17 FRNPDISLDERAECIVKQLTVEEKINQLMNAAPAVDRLEIPEYDWWNECLHGVARAGR-- 74
Query: 102 KFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--------GGTAGLTYW 153
AT FPQ I A++++ TL +G +S EARA YN G GLT+W
Sbjct: 75 --------ATVFPQAIGMAATWDTTLVYRVGDAISTEARAKYNVFSKHGYRGQYKGLTFW 126
Query: 154 SPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLD 213
+PNVNIFRDPRWGRGQET GEDP L+ + S+V+GLQG+ LKVAA KH+ +
Sbjct: 127 TPNVNIFRDPRWGRGQETYGEDPFLTSRIGVSFVKGLQGNHPKYLKVAALAKHYAVH--- 183
Query: 214 NWNGVD--RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNIL 271
NG + R F+AKVS +D+ +T+ F V E V VM +YN+ NG P CA P ++
Sbjct: 184 --NGPEALRHEFDAKVSMKDLWETYLPAFEALVKEAGVEGVMGAYNRTNGDPCCAHPYLM 241
Query: 272 KRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTES 331
+ +R +W +GY VSDC ++ +Y TPEEAAA A+ AG +L+CG +S
Sbjct: 242 QEVLREKWGFDGYYVSDCGAIMDFYTGHKIVDTPEEAAAMALNAGCNLNCGDTYASLLKS 301
Query: 332 AVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAAR 391
+++GL +E +I+ ++ ++RLG+F E + PY + + + +HQ+LALEAAR
Sbjct: 302 -LEKGLTTEEEIDRSVKQLFKTRLRLGLFAPE-GAVPYDTISTDVIRSKEHQKLALEAAR 359
Query: 392 QGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI-GRYA- 449
+ +VLLKN+ +LP++ + V V GP + ++ NY G++ TT L+GI G+ +
Sbjct: 360 KSVVLLKNEANTLPVAR-DVKKVYVTGPTATHVQALLANYYGVSEDMTTILEGIVGKVSP 418
Query: 450 --RTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEA---------LDR 498
++QG + A+ A+ AD T+ +G+ Q IE E DR
Sbjct: 419 QTSVQYRQGAL-LYEANRNTMDWFSGAAASADVTVACLGISQLIEGEEGEAIASEHRGDR 477
Query: 499 AGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIA----AIIWAGYPGQ 554
LP Q + + ++ ++K ++V+ SG I + P I A+++ YPG+
Sbjct: 478 ERTRLPQNQIDFLKRIRASAK-KLVVVITSGSAISL------PEIYDMADALLYVWYPGE 530
Query: 555 AGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPV 614
GG A+AD+LFG + P G+LP+T + +LP P ++ GRTYR+ +
Sbjct: 531 QGGKAVADVLFGDAVPSGRLPVTVVKS--VDDLP-------PYENYDMKGRTYRYMEVSP 581
Query: 615 VYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQV 674
+PFG G+SYT+F ++ N T+ KV + RL+
Sbjct: 582 QFPFGFGLSYTDFTYS-------------------NLTLESN--KVKSGESVRLSF---- 616
Query: 675 DVKNVGSKDGAHTLLVFSTP-PAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSV 733
D+ N G D + + T A P + L+ F++V + AG ++ + + +
Sbjct: 617 DLTNEGEYDADEVVQFYITDVEASVNVPKQSLIGFKRVGLAAGESTKIEFTV-TPDMMKI 675
Query: 734 VDRSGTRRIPLGEHNIHIGGTKHS 757
VD +G + + GE I+IGG+ +S
Sbjct: 676 VDNNGEKILESGEFKIYIGGSSYS 699
>gi|371776901|ref|ZP_09483223.1| beta-glucosidase [Anaerophaga sp. HS1]
Length = 720
Score = 390 bits (1002), Expect = e-105, Method: Compositional matrix adjust.
Identities = 255/733 (34%), Positives = 377/733 (51%), Gaps = 80/733 (10%)
Query: 42 FCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGT 101
F +L I R L+ L+L+EK+ LL V RL I Y WW+EALHGV+ G
Sbjct: 29 FRDEALDIETRAKALLSELTLKEKISLLGYNNPPVERLQIPAYNWWNEALHGVARAGE-- 86
Query: 102 KFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA--------GLTYW 153
AT FPQ I A++F+ TL I +S EAR+ YN + G+T+W
Sbjct: 87 --------ATVFPQAIALAATFDTTLVYRIADAISTEARSKYNINRSKGFQNQYLGITFW 138
Query: 154 SPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDR-LKVAASCKHFTAYDL 212
+PN+NIFRDPRWGRGQET GEDP L+ ++V+GLQGS+ +R LK AA KHF +
Sbjct: 139 TPNINIFRDPRWGRGQETYGEDPFLTASMGKAFVKGLQGSEPERRLKTAAGAKHFAVHSG 198
Query: 213 DNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILK 272
DR HFNA V ++D+ +T+ F+ V G V ++MC+YN+VNG P C +L+
Sbjct: 199 PE---ADRHHFNAVVDEKDLRETYLPAFKALVENG-VTTIMCAYNRVNGEPCCTGKTLLQ 254
Query: 273 RTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESA 332
+R EW G +V+DC ++ + T E AA A++AG++LDC L + A
Sbjct: 255 DILRDEWGFKGQVVTDCWALDDIWLRHKTIPTRVEVAAAAVKAGVNLDCANILQEDVQDA 314
Query: 333 VQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQ 392
+++ LL+ +++AL+ TL Q++LG +D +PS PY H G V H LA EAA +
Sbjct: 315 IEKRLLTLEQVDSALLPTLQTQLKLGFYD-DPSHSPYRHYGIDSVNNSYHISLAKEAAEK 373
Query: 393 GIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI----GRY 448
+VLLKN G LPL ++ V+G N+ + GNY G++ T ++G+ G
Sbjct: 374 SMVLLKNDG-ILPLKKDTISSIMVVGENAASISALTGNYHGLSGNMVTFVEGLVKAGGPG 432
Query: 449 ARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAE---------ALDRA 499
+ GC + AD FG I A+ D TI V+GL +E E D+
Sbjct: 433 MSVQYDYGC---SFADTSHFG-GIWAAGFTDVTIAVIGLSPLLEGEHGDAFLSNWGGDKK 488
Query: 500 GLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTA 559
L +P + + K+ + P I V+ G +D++ +P AII+A YPG+ GGTA
Sbjct: 489 DLRMPRSHEIYLKKLRESHNHPVIAVVTGGSALDISAI--EPYADAIIYAWYPGEQGGTA 546
Query: 560 IADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFG 619
+AD++FG +P G+LP+T+Y I +LP P RTYR+++G V+YPFG
Sbjct: 547 LADLIFGEVSPSGRLPITFYKD--IKDLP-------PYHDYNMTNRTYRYFQGDVLYPFG 597
Query: 620 HGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNV 679
+G+SYT+F + + P+ D + V + V N
Sbjct: 598 YGLSYTSFHYEWLSKPSTKVSEDD-------------------------IISVNIAVTNT 632
Query: 680 GSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGT 739
G+ D + V+ P P ++L F ++H+ AG Q I I V K L D
Sbjct: 633 GTMDADEVIQVYIVYPDIERMPLRELKGFSRIHIKAGQTQNTDIQIPV-KNLKKWDSKNN 691
Query: 740 R-RIPLGEHNIHI 751
R ++ G++ I +
Sbjct: 692 RWKLYKGKYKIQV 704
>gi|171695518|ref|XP_001912683.1| hypothetical protein [Podospora anserina S mat+]
gi|170948001|emb|CAP60165.1| unnamed protein product [Podospora anserina S mat+]
Length = 805
Score = 390 bits (1002), Expect = e-105, Method: Compositional matrix adjust.
Identities = 267/767 (34%), Positives = 384/767 (50%), Gaps = 89/767 (11%)
Query: 43 CQVSLPIPQRVNDLIGRLSLQEKVKLLI--------------------SGAAAVPRLGIK 82
C + P R L+ L++ EK+ L+ S + R+G+
Sbjct: 36 CDTTASPPARAAALVQALNITEKLVNLVEYVKSREAPLGISIQLITPHSMSLGAERIGLP 95
Query: 83 GYEWWSEALHGVSNVGPGTKF---GGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEA 139
Y WW+EALHGV+ PG F G +F ATSF IT A++F+ L + +S EA
Sbjct: 96 AYAWWNEALHGVA-ASPGVSFNQAGQEFSHATSFANTITLAAAFDNDLVYEVADTISTEA 154
Query: 140 RAMYNGGTAGLTYWSPNVNIFRDPRWGRG------------------QETPGEDPVLSGK 181
RA N AGL YW+PN+N ++DPRWGRG Q+TPGEDPV
Sbjct: 155 RAFSNAELAGLDYWTPNINPYKDPRWGRGHEVCYLSLLFRAVQLLRTQKTPGEDPVHIKG 214
Query: 182 YAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFR 241
Y + + GL+G D R KV A+CKHF AYDL+ W G R+ FNA V+ QD+ + + PF+
Sbjct: 215 YVQALLEGLEGRDKIR-KVIATCKHFAAYDLERWQGALRYRFNAVVTSQDLSEYYLQPFQ 273
Query: 242 MCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRL---NGYIVSDCDSVGVYYDT 298
C + KV S MCSYN +NG P CA ++ +R W N YI SDC+++ +
Sbjct: 274 QCARDSKVGSFMCSYNALNGTPACASTYLMDDILRKHWNWTEHNNYITSDCNAIQDFLPN 333
Query: 299 QH-FTSTPEEAAADAIRAGLDLDCG-PFLGLHTE--SAVQRGLLSEIDINNALVNTLTVQ 354
H F+ TP +AAADA AG D C P T+ A + LLSE I+ AL
Sbjct: 334 FHNFSQTPAQAAADAYNAGTDTVCEVPGYPPLTDVIGAYNQSLLSEEIIDRALRRLYEGL 393
Query: 355 MRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTV 414
+R G D S PY + V TP Q LAL++A GIVLLKN G LPL + ++T+
Sbjct: 394 IRAGYLDSA-SPHPYTKISWSQVNTPKAQALALQSATDGIVLLKNNG-LLPLD-LTNKTI 450
Query: 415 AVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQG---CKDVACADDQLFGAA 471
A+IG ++ T M+G Y+GI Y P+ + T H + +D A
Sbjct: 451 ALIGHWANATRQMLGGYSGIPPYYANPIYAATQLNVTFHHAPGPVNQSSPSTNDTWTSPA 510
Query: 472 IDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGP 531
+ A+ ++D + + G D SI AE DR + P Q L++ S+A G +V G
Sbjct: 511 LSAASKSDIILYLGGTDLSIAAEDRDRDSIAWPSAQLSLLT--SLAQMGKPTIVARLGDQ 568
Query: 532 IDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTE 591
+D ++P I++I+W GYPGQ+GGTA+ +I+ G S+P +LP+T YP+ Y + +P+T
Sbjct: 569 VDDTPLLSNPNISSILWVGYPGQSGGTALLNIITGVSSPAARLPVTVYPETYTSLIPLTA 628
Query: 592 MAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINA 651
M++RP+ ++ PGRTYR+Y PV+ PFGHG+ YT F + G +
Sbjct: 629 MSLRPTSAR--PGRTYRWYPSPVL-PFGHGLHYTTFT---------------AKFGVFES 670
Query: 652 TISGKAIKVTHAKCNRLTL----GVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPH----K 703
A V++ L L V V V N G + LVF G + P K
Sbjct: 671 LTINIAELVSNCNERYLDLCRFPQVSVWVSNTGELKSDYVALVFVR---GEYGPEPYPIK 727
Query: 704 QLVAFEKVH-VPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNI 749
LV ++++ + G + + V L+ VD G R + G++
Sbjct: 728 TLVGYKRIRDIEPGTTGAAPVGV-VVGDLARVDLGGNRVLFPGKYEF 773
>gi|295134875|ref|YP_003585551.1| beta-glucosidase [Zunongwangia profunda SM-A87]
gi|294982890|gb|ADF53355.1| beta-glucosidase [Zunongwangia profunda SM-A87]
Length = 735
Score = 388 bits (997), Expect = e-105, Method: Compositional matrix adjust.
Identities = 245/739 (33%), Positives = 391/739 (52%), Gaps = 85/739 (11%)
Query: 42 FCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGT 101
F L + +R++DLI RL+L+EK + +++ + A+ RLGI Y+WW+EALHG+ G
Sbjct: 34 FYDTDLSMDERIDDLISRLTLEEKAQQMLNASPAIERLGIPAYDWWNEALHGLGRSGV-- 91
Query: 102 KFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGT--------AGLTYW 153
AT FPQ I ++F+ L + +SDEARA +N GLT+W
Sbjct: 92 --------ATVFPQAIGMGATFDDDLILKVSTAISDEARANFNNAVKHGYHRKYGGLTFW 143
Query: 154 SPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLD 213
+PNVNIFRDPRWGRGQET GEDP L+ K ++V+GLQG + LK AA+ KH+ +
Sbjct: 144 TPNVNIFRDPRWGRGQETYGEDPYLTSKLGEAFVKGLQGDNDKYLKTAAAAKHYAVH--- 200
Query: 214 NWNGVD--RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNIL 271
+G + R FNA VS++D+ +T+ F+ V + V ++MC+YN NG P CA+ ++
Sbjct: 201 --SGPEKLRHEFNADVSEKDLWETYLPAFKTLV-DANVETIMCAYNSTNGEPCCANNRLI 257
Query: 272 KRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTES 331
+R +W NG++VSDC ++ + +PE AAA A+ G++L+CG +
Sbjct: 258 NDILRDKWGFNGHVVSDCWALQDFVSGHDIVESPEAAAALAVEVGIELNCGDTYNFLAK- 316
Query: 332 AVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAAR 391
AV+ GL+SE ++ L L + +LG+FD E S PY +G + + + +H+ LA E AR
Sbjct: 317 AVEDGLVSEELVDKRLHKLLETRFKLGLFDPE-ESNPYNKIGVEVMNSDEHRALARETAR 375
Query: 392 QGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGR---- 447
+ IVLLKN G LPL + + + GPN+ ++GNY G+ T L+GI +
Sbjct: 376 KSIVLLKNDG-VLPLKNNLSKYF-ITGPNATNIEVLLGNYHGVNPDMVTVLEGIAKAIKP 433
Query: 448 YARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEA---------LDR 498
++ ++ G + ++ A+ +A +DAT +VMG+ +E E DR
Sbjct: 434 ESQLQYRMGTRLNLPNENPQDWASPNAG-NSDATFVVMGISGLLEGEEGESIASPTFGDR 492
Query: 499 AGLLLPGRQQELVSKVSMASKG-PTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGG 557
LP Q + + KVS A++ P + ++ G P+++ ++W YPG+ GG
Sbjct: 493 MDYNLPQNQIDYLQKVSEAAEDRPVVAIVTGGSPMNLTEVHKLADAVLLVW--YPGEEGG 550
Query: 558 TAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYP 617
A+ADI+FG ++P G+LP+T+ PMT + + GRTY++ +YP
Sbjct: 551 NAVADIIFGKNSPSGRLPITF---------PMTIEDLPAYEDYTMEGRTYKYMDVVPMYP 601
Query: 618 FGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISG--KAIKVTHAKCNRLTLGVQVD 675
FG+G+SYT+F ++ + I+ T +G +A +V VQV
Sbjct: 602 FGYGLSYTDFEYSEIKLSKDKIKKKESVEARISVTNTGDFEADEV-----------VQVY 650
Query: 676 VKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVD 735
+K+V A P+ +LVAF+ +H+ G + + I + LS +D
Sbjct: 651 LKDV---------------KASSRVPNFELVAFKNIHLKRGESKELTFEI-TPEMLSFID 694
Query: 736 RSGTRRIPLGEHNIHIGGT 754
+G ++ G I+IGG+
Sbjct: 695 DNGKEKLEKGAFEIYIGGS 713
>gi|280977785|gb|ACZ98610.1| glucosidase [Cellulosilyticum ruminicola]
Length = 711
Score = 388 bits (997), Expect = e-105, Method: Compositional matrix adjust.
Identities = 254/738 (34%), Positives = 381/738 (51%), Gaps = 87/738 (11%)
Query: 49 IPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFP 108
+L+ ++ L EK L A A+ RLGI Y WW+EALHGV+ G
Sbjct: 4 FKNEAKELVRQMDLLEKASQLRYDAPAIKRLGIPTYNWWNEALHGVARAGV--------- 54
Query: 109 GATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGT--------AGLTYWSPNVNIF 160
AT FPQ I A+ F+ I +++ E RA YN + G+T+W+PN+NIF
Sbjct: 55 -ATVFPQAIGLAAMFDEEKLGEIADIIAIEGRAKYNQFSQKEDRDIYKGMTFWAPNINIF 113
Query: 161 RDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDG-DRLKVAASCKHFTAYDLDNWNGVD 219
RDPRWGRG ET GEDP L+ + ++++GLQG + D LK AA KHF + D
Sbjct: 114 RDPRWGRGHETYGEDPYLTARLGVAFIKGLQGDENEDYLKAAACAKHFAVHSGPE---ED 170
Query: 220 RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEW 279
R HF+A VSK+D+ +T+ F V E V VM +YN+VNG P C +L ++ +W
Sbjct: 171 RHHFDAIVSKKDLYETYLPAFEAAVKEANVIGVMGAYNRVNGEPACGSKTLLVDILKKDW 230
Query: 280 RLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLS 339
+GYIVSDC ++ ++ T T E+AA AI G +L+CG LH A Q GL+
Sbjct: 231 GFDGYIVSDCWAIRDFHTEHMVTHTAAESAALAINNGCELNCGNTY-LHMLEAHQEGLVK 289
Query: 340 EIDINNALVNTLTVQMRLGMFDG--EPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLL 397
E I A + ++M+LG+FD + + PY + C H+E+ALEA+R+ +V+L
Sbjct: 290 EEIITEAAEKLMRIRMQLGLFDKNCKYNEIPYA----VNDCKV-HREVALEASRRSMVML 344
Query: 398 KNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRY----ARTIH 453
KN G LPL+ + +++ +IGP ++ + GNY G A YTT ++GI Y R +
Sbjct: 345 KNDGI-LPLNKDKLKSIGIIGPTANNRTVLEGNYNGTASRYTTFVEGIQDYVGDDVRVYY 403
Query: 454 QQGC-------KDVACADDQLFGAAIDASRQADATILVMGLDQSIEAE---------ALD 497
+GC ++A +D+ A I + Q+D +L +GLD +IE E D
Sbjct: 404 SEGCHLFANGMSNLAWENDREAEALI-VAEQSDVVVLCLGLDSTIEGEQGDTGNAFAGGD 462
Query: 498 RAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGG 557
+ L L GRQQ+L+ KV K P ILVL +G + + +A D AI YPG GG
Sbjct: 463 KLSLNLIGRQQQLLEKVVAVGK-PVILVLSTGSAMAINYA--DEHCNAIFQTWYPGAQGG 519
Query: 558 TAIADILFGTSNPGGKLPMTWYPQEYITNLPMTE-MAMRPSQSKRYPGRTYRFYKGPVVY 616
A+A +LFG +P GKLP+T+Y LP E +M+ RTYR+ +Y
Sbjct: 520 KALAQLLFGEYSPSGKLPVTFYKT--TEELPAFEDYSMKD--------RTYRYMPNEALY 569
Query: 617 PFGHGMSYTNF-VHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVD 675
PFG+G+SY + V +V LDG G ++T+ + V+V+
Sbjct: 570 PFGYGLSYADIKVQSVK--------VLDGAKGE----------EITNFSAGQTKYKVKVE 611
Query: 676 VKNVGSKDGAHTLLVFSTPPAGHWA-PHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVV 734
++N + D + ++ +A P+ L +F+ V + AG + V +N+ K +V+
Sbjct: 612 LENKSNVDSYDVVQIYIKDMESQYAVPNFSLCSFKSVFLKAGESKEVTLNVGE-KAFTVI 670
Query: 735 DRSGTRRIPLGEHNIHIG 752
+ G R + + + IG
Sbjct: 671 NEEGKRIVDSKKFKLFIG 688
>gi|288924872|ref|ZP_06418809.1| beta-glucosidase [Prevotella buccae D17]
gi|288338659|gb|EFC77008.1| beta-glucosidase [Prevotella buccae D17]
Length = 721
Score = 387 bits (994), Expect = e-104, Method: Compositional matrix adjust.
Identities = 256/735 (34%), Positives = 384/735 (52%), Gaps = 94/735 (12%)
Query: 49 IPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFP 108
+ + ++I R+++ EK+ L++ + A+ LGIK Y+WWSE LHGV G
Sbjct: 30 MSRHAKEIIARMTVSEKISQLMNESPAIEHLGIKPYDWWSEGLHGVGRDGR--------- 80
Query: 109 GATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--------GGTAGLTYWSPNVNIF 160
AT FPQ I ++F+ L IG V+ E RA +N AGLT+WSPNVNIF
Sbjct: 81 -ATVFPQPIALGATFDEALVREIGDAVATEGRAKFNVARKLKNYSRNAGLTFWSPNVNIF 139
Query: 161 RDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDR 220
RDPRWGRG ET GEDP+LSG +YVRGLQG D LK A KH+ + +G +
Sbjct: 140 RDPRWGRGMETYGEDPLLSGMLGTAYVRGLQGDDAFYLKTGACAKHYAVH-----SGPEG 194
Query: 221 FHFNAKV--SKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGE 278
A + S++D+ +T+ F+M V +G+V +VM +YN+V G P +L +R
Sbjct: 195 TRHEADIHPSRRDLFETYLPQFKMLVQQGRVEAVMSAYNRVYGEPCGGSKYLLTDILRKS 254
Query: 279 WRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQ---- 334
W NG+IVSDCD++ +Y + TPEEA A AI+AGL+++CG HT A+Q
Sbjct: 255 WGFNGHIVSDCDAINDFYGGHRYVKTPEEACAAAIKAGLNVECG-----HTFKAMQGALD 309
Query: 335 RGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQ-PYGHLGPKDVCTPDHQELALEAARQG 393
+GLL+E D++ AL + +++LG+ EP S PY ++C+P H LAL AA +
Sbjct: 310 QGLLAEADLDRALFPLVMTRLKLGIL--EPDSACPYNSYDESEICSPAHTALALRAADEA 367
Query: 394 IVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI-GRYARTI 452
+VLLKN G LPL RT+ V GP + ++GNY G++ Y+T LQGI R +
Sbjct: 368 MVLLKNNG-ILPLDK-NIRTLFVAGPGASDAFYLMGNYFGLSNRYSTYLQGIVSRVSSGT 425
Query: 453 HQQGCKDVACADDQL--FGAAIDASRQADATILVMGLDQSIEAEA---------LDRAGL 501
++L A++ + A+ I+VMG + ++E E DR G+
Sbjct: 426 SVNFRPAFMQITEELNDMNWAVNEACAAEVAIVVMGNNGNMEGEEGEAIASASRGDRVGI 485
Query: 502 LLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIA-AIIWAGYPGQAGGTAI 560
LP Q + +V G ++VL G PID+ + ++A A++ A YPGQ GG A+
Sbjct: 486 GLPASQMNYLRRVKARKGGRIVVVLTGGSPIDL---REISKLADAVVMAWYPGQEGGEAL 542
Query: 561 ADILFGTSNPGGKLPMTWYPQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFG 619
D+LFG N G+LP+T +P + + +LP + +M GRTY++ G V+YPFG
Sbjct: 543 GDLLFGDKNFSGRLPIT-FPAD-VDSLPAFDDYSMN--------GRTYKYMSGNVMYPFG 592
Query: 620 HGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNV 679
+G+SY +T +A + G+ K L V+V + N
Sbjct: 593 YGLSYGRVTYT-------------------DARVVGRIKKGE-------PLAVEVVLTNN 626
Query: 680 GSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSG 738
G + + +TP AG +P LV F +V +P + + I V + L + G
Sbjct: 627 GDRTIDEVAQAYIATPTAGKGSPMASLVGFRRVSIPPKSSVKAVFKI-VPERLMTIQSDG 685
Query: 739 TRRIPLGEHNIHIGG 753
+ ++ G + + IGG
Sbjct: 686 SSKLLKGNYTLTIGG 700
>gi|373954937|ref|ZP_09614897.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
paludis DSM 18603]
gi|373891537|gb|EHQ27434.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
paludis DSM 18603]
Length = 723
Score = 387 bits (994), Expect = e-104, Method: Compositional matrix adjust.
Identities = 248/741 (33%), Positives = 386/741 (52%), Gaps = 82/741 (11%)
Query: 48 PIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDF 107
P RV DLI +L+L+EKV ++ + +VPRL + Y WW+EALHGV+ G
Sbjct: 30 PTDVRVRDLISKLTLEEKVHQMMDVSPSVPRLNLPKYNWWNEALHGVARSGV-------- 81
Query: 108 PGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA--------GLTYWSPNVNI 159
AT FPQ I ++F+ L + +SDEARAMYN GLT+W+PN+NI
Sbjct: 82 --ATIFPQAIALGATFDQDLAKRESTAISDEARAMYNAAMVNGYNEKYGGLTFWTPNINI 139
Query: 160 FRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD 219
FRDPRWGRGQET GEDP L+ + ++++GLQG D + LKVAA KHF + +G +
Sbjct: 140 FRDPRWGRGQETYGEDPFLTSQIGVAFIQGLQGDDPEHLKVAACAKHFAVH-----SGPE 194
Query: 220 RFH--FNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRG 277
R FNA S +D+ +T+ F+ V +V +VMC+YN+ N C +L + +R
Sbjct: 195 RLRHSFNAIASPKDLRETYLPAFKALV-NARVEAVMCAYNRTNSEVCCGSNLLLDQILRD 253
Query: 278 EWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGL 337
EW G++VSDC ++ +Y EA A A++ G+DL+CG E AV+RGL
Sbjct: 254 EWHFTGHVVSDCGAIVDFYMGHKVVPGQPEAVALAVKHGVDLNCGDEYPALIE-AVKRGL 312
Query: 338 LSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLL 397
++E +I+ AL L + +LG+FD + +S PY ++ + + DH+ LA E A + IVLL
Sbjct: 313 ITEKEIDKALATLLKTRFKLGLFDPKQNS-PYNNIPVSVINSTDHRALAKEVALKSIVLL 371
Query: 398 KNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGC 457
KN+ LPL + + + GPN+ ++GNY G+ +T L+GI + Q
Sbjct: 372 KNE-KCLPLKNNLSKYY-ITGPNAASVDALMGNYYGVNPHMSTILEGIAGAIQPGSQMQY 429
Query: 458 KDVACAD---DQLFGAAIDASRQADATILVM---GLDQSIEAEAL------DRAGLLLPG 505
K D + ++ +D T +VM GL + E EA+ DR LP
Sbjct: 430 KPGILLDRDNNNPIDWTTGDAKASDVTFVVMGITGLLEGEEGEAIASPNYGDRLDYNLPK 489
Query: 506 RQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILF 565
Q + + K+ +K + ++ G P++++ A++ A YPG+ GG A+ADILF
Sbjct: 490 NQIDFLRKIRKGNKNKVVAIITGGSPMNLSEVHE--LADAVLLAWYPGEEGGNAVADILF 547
Query: 566 GTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYT 625
G +P G+LP+T +P+ + P + +M+ GRTYR+ +Y FG+G+SY+
Sbjct: 548 GKVSPSGRLPVT-FPKSFAQLPPYEDYSMK--------GRTYRYMTAEPMYTFGYGLSYS 598
Query: 626 NFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGA 685
+ ++ + T+S K IK +T+ + V N G +G
Sbjct: 599 TYTYS-------------------SLTLSEKQIK------KNMTIIAETMVTNTGKMEGE 633
Query: 686 HTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLG 745
+ ++ T P P L F++V++ AG ++V I + VD +G+ + G
Sbjct: 634 EVVQLYITVPQTEKNPQYSLKGFKRVNLKAGESRKVQFQI-TPDLMKSVDANGSEVLLSG 692
Query: 746 EHNIHIGG---TKHSVSLHAA 763
+ + IGG +K S+SL AA
Sbjct: 693 SYVVRIGGASPSKRSLSLGAA 713
>gi|348684872|gb|EGZ24687.1| family 3 glycoside hydrolase [Phytophthora sojae]
Length = 805
Score = 387 bits (994), Expect = e-104, Method: Compositional matrix adjust.
Identities = 246/656 (37%), Positives = 356/656 (54%), Gaps = 53/656 (8%)
Query: 18 SSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAA-- 75
+S+G AA P AC D + PFC SL +RV DL+ RL L EKV LL + A+
Sbjct: 14 ASAGHAASIPRAC--SDTEHQKFPFCDASLSTSERVEDLLRRLPLDEKVTLLTARASPKG 71
Query: 76 -VPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRV 134
+ +G+ Y W + +HGV + GT ATSFP + + F+ + +V
Sbjct: 72 NMSSIGLPEYNWGANCVHGVQSTC-GTN------CATSFPNPVNLGAIFDPQAVFDMAQV 124
Query: 135 VSDEARAMYNGGT---------AGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAAS 185
+ E RA++ G GL WSPN+NI RDPRWGR ETP EDP+++ KY +
Sbjct: 125 IGWELRALWLEGARENYAAGPHLGLDCWSPNININRDPRWGRNMETPSEDPLVNSKYGVA 184
Query: 186 YVRGLQ-GSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCV 244
Y RGLQ G D L+ + KH+ AY ++++G+DR FNA+VS+ D DT+ F V
Sbjct: 185 YTRGLQEGKDKRFLQAVVTLKHYAAYSYEHYDGIDRMAFNAQVSRYDFADTYLPAFHASV 244
Query: 245 MEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTST 304
+EGK VMCSYN VNG+P CA+ + + +R +GYI SD ++ Y +H+T +
Sbjct: 245 VEGKAKGVMCSYNSVNGMPMCANEQLNTKLLREALGFDGYITSDSGAIEGIYRQRHYTKS 304
Query: 305 PEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEP 364
EA AI +G D++ G V G L E +++A+ TL ++ LG+FD
Sbjct: 305 LCEAGRLAIMSGTDVNSGSVYKKCLADLVTSGQLPEKAVDDAMRRTLKLRFELGLFD-PI 363
Query: 365 SSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVT 424
QPY H+ P +V + ++L+LE R+ IVLL+N G LPL + + +AVIGP++
Sbjct: 364 DDQPYWHVAPSEVGKTESKQLSLELTRKSIVLLQNHGNVLPLR--KGKKLAVIGPHAKAK 421
Query: 425 VTMIGNYAGIACG--------YTTPLQGI---GRYARTIHQQGCKDVACADDQLFGAAID 473
++GNY G C TPL+ I + T++ +G + F AA
Sbjct: 422 RALLGNYLGQMCHGDYLEVGCVQTPLEAITAANGASNTVYAKG-SGINDTSTADFDAAEA 480
Query: 474 ASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPI- 532
A+R ADA +L +G+D SIE EA DR + +P Q +L+ +V A K PT++VL +GG +
Sbjct: 481 AARGADAVVLFLGIDTSIEREAWDRENIDMPNIQMQLLKRVRRAGK-PTVVVLFNGGVVG 539
Query: 533 -DVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTE 591
+ D + A YPG G A++DILFG + P GKLP+T YP YI ++ M
Sbjct: 540 AEELILHTD----GVAEAFYPGFFGAQAVSDILFGDAIPSGKLPVTMYPSNYINSVDMKS 595
Query: 592 MAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTV----ANAPTVVAVPLD 643
M+M +YPGR+YR+YK V+PFG G+SYT F + + P V+ LD
Sbjct: 596 MSM-----TKYPGRSYRYYKEVPVFPFGWGLSYTKFTLALDGEMPDDPIVITRDLD 646
>gi|367032987|ref|XP_003665776.1| glycoside hydrolase family 3 protein [Myceliophthora thermophila
ATCC 42464]
gi|347013048|gb|AEO60531.1| glycoside hydrolase family 3 protein [Myceliophthora thermophila
ATCC 42464]
Length = 835
Score = 387 bits (993), Expect = e-104, Method: Compositional matrix adjust.
Identities = 238/615 (38%), Positives = 336/615 (54%), Gaps = 38/615 (6%)
Query: 43 CQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTK 102
C +LP +R L+ L+ +EK++ L+S A PR+G+ Y WWSEALHGV++ PGT+
Sbjct: 29 CDRTLPEAERAAALVAALTDEEKLQNLVSKAPGAPRIGLPAYNWWSEALHGVAHA-PGTQ 87
Query: 103 F---GGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNI 159
F GDF +TSFP + A++F+ L EA+G V+ EARA N G +GL YW+PNVN
Sbjct: 88 FRDGPGDFNSSTSFPMPLLMAAAFDDELIEAVGDVIGTEARAFGNAGWSGLDYWTPNVNP 147
Query: 160 FRDPRWGRGQETPGEDPVLSGKYAASYVRGLQ-----------GSDGDRLKVAASCKHFT 208
FRDPRWGRG ETPGED V +YAAS +RGL+ GS G+ +V ++CKH+
Sbjct: 148 FRDPRWGRGSETPGEDVVRLKRYAASMIRGLEGRSSSSSSCSFGSGGEPPRVISTCKHYA 207
Query: 209 AYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADP 268
D ++WNG R F+A +S QD+ + + PF+ C + +V SVMC+YN VNGVP+CA+
Sbjct: 208 GNDFEDWNGTTRHDFDAVISAQDLAEYYLAPFQQCARDSRVGSVMCAYNAVNGVPSCANS 267
Query: 269 NILKRTIRGEWRL---NGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFL 325
++ +RG W + Y+ SDC++V H+ T E AG+D C
Sbjct: 268 YLMNTILRGHWNWTEHDNYVTSDCEAVLDVSAHHHYADTNAEGTGLCFEAGMDTSCEYEG 327
Query: 326 GLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQEL 385
A G L+ ++ AL +R+G FDG S P+ LG DV P+ QEL
Sbjct: 328 SSDIPGASAGGFLTWPAVDRALTRLYRSLVRVGYFDGPES--PHASLGWADVNRPEAQEL 385
Query: 386 ALEAARQGIVLLKNQGPSLPL---------SHIRHRTVAVIGPNSDVTVTMIGNYAGIAC 436
AL AA +GIVLLKN +LPL + R VA+IG +D + G Y+G
Sbjct: 386 ALRAAVEGIVLLKNDNDTLPLPLPDDVVVTADGGRRRVAMIGFWADAPDKLFGGYSGAPP 445
Query: 437 GYTTPLQGIGRYARTIHQQGCK----DVACADDQLFGAAIDASRQADATILVMGLDQSIE 492
+P + + G D +D A++A+ AD + GLD S
Sbjct: 446 FARSPASAARQLGWNVTVAGGPVLEGDSDEEEDTWTAPAVEAAADADYIVYFGGLDTSAA 505
Query: 493 AEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYP 552
E DR + P Q L+S+++ K P ++V M D + D + A++WA +P
Sbjct: 506 GETKDRMTIGWPAAQLALISELARLGK-PVVVVQMGDQLDDTPLFELD-GVGAVLWANWP 563
Query: 553 GQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKG 612
GQ GGTA+ +L G +P G+LP+T YP Y +P+T+M +RPS + PGRTYR+Y
Sbjct: 564 GQDGGTAVVRLLSGAESPAGRLPVTQYPANYTDAVPLTDMTLRPSATN--PGRTYRWYPT 621
Query: 613 PVVYPFGHGMSYTNF 627
P V PFG G+ YT F
Sbjct: 622 P-VRPFGFGLHYTTF 635
>gi|169611757|ref|XP_001799296.1| hypothetical protein SNOG_08993 [Phaeosphaeria nodorum SN15]
gi|160702362|gb|EAT83185.2| hypothetical protein SNOG_08993 [Phaeosphaeria nodorum SN15]
Length = 755
Score = 387 bits (993), Expect = e-104, Method: Compositional matrix adjust.
Identities = 260/756 (34%), Positives = 384/756 (50%), Gaps = 49/756 (6%)
Query: 8 FFLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVK 67
F L++L AS + A D + + C V+ +R L+ + EK+
Sbjct: 3 FLSALVVLGASLANAA----IGPDCANGPLKDNKICDVTAAPAERAAALVEAMQTNEKLD 58
Query: 68 LLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATL 127
L+ G V RLG+ Y WW EALHGV+ PG F G + ATSFP + +++F+ L
Sbjct: 59 NLMRG---VTRLGLPKYNWWGEALHGVAG-APGINFTGAYKTATSFPMPLLMSAAFDDDL 114
Query: 128 WEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYV 187
I ++ +EARA NGG A + +W+P++N FRDPRWGRG ETPGED V Y +
Sbjct: 115 IFKIANIIGNEARAFGNGGVAPVDFWTPDINPFRDPRWGRGSETPGEDIVRIKGYTKHLL 174
Query: 188 RGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEG 247
GL+G R K+ A+CKH+ YD++ W G+DR FNAK++ QD+ + + PF+ C +
Sbjct: 175 AGLEGDKPQR-KIIATCKHYVGYDMEAWGGIDRHSFNAKINMQDLAEYYMPPFQQCARDS 233
Query: 248 KVASVMCSYNQVNGVPTCADPNILKRTIRGEWRL---NGYIVSDCDSVGVYYDTQHFTST 304
KV S MCSYN VNGVPTCAD +L+ +R W N YI SDC++V + T
Sbjct: 234 KVGSFMCSYNAVNGVPTCADTYVLQTILRDHWNWTESNNYITSDCEAVKDISLKHKYAKT 293
Query: 305 PEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEP 364
E A AG+D C A + LS I+ AL +R G FDG
Sbjct: 294 NAEGTGLAFTAGMDNSCEYTGSSDIPGAFNQSYLSIPTIDRALKRQYEGLVRAGYFDGAA 353
Query: 365 SSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVT 424
++ Y +LG KD+ TP+ Q+L+L+ A +G+VLLKN +LPLS VA++G ++ T
Sbjct: 354 AT--YANLGVKDINTPEAQQLSLQVASEGLVLLKNDD-TLPLSLTNGSKVAMLGFWANDT 410
Query: 425 VTMIGNYAGIACGYTTPL---QGIGR----YARTIHQQGCKDVACADDQLFGAAIDASRQ 477
+ G Y+G A +P+ Q +G + I QQ + D A+ A+ +
Sbjct: 411 SKLSGIYSGPAPYLRSPVWAGQKLGLDMAIASGPILQQSN---SSTRDNWTTNALAAAEK 467
Query: 478 ADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFA 537
+D + GLD S AE DR + P Q +L+ K +A+ G ++VL+ G +D +
Sbjct: 468 SDYILYFGGLDPSAAAEGFDRNSIAWPTAQVDLIKK--LAAIGKPLVVLVLGDLMDNSPL 525
Query: 538 KNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPS 597
+ ++IWA +PGQ GG+A+ ++ G G+LP+T YP Y T L M +M MRPS
Sbjct: 526 LELDGVNSVIWANWPGQDGGSAVMQVVTGAVAVAGRLPITQYPANY-TELSMLDMNMRPS 584
Query: 598 QSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKA 657
S PGRTYR++ G V PFG G+ YT F A + +I IS
Sbjct: 585 SSS--PGRTYRWFNG-AVQPFGTGLHYTTFDAKFA------------ANSTIEYDISNIT 629
Query: 658 IKVTHAKCNRLTL-GVQVDVKNVGSKDGAHTLLVF---STPPAGHWAPHKQLVAFEKVHV 713
+ T+ + ++ + V V N G++ L F PA + P K L+++ +V
Sbjct: 630 KECTNQYPDTCSVPSIPVAVTNSGNRTSDFIALAFIKGENGPAPY--PLKTLISYTRVRD 687
Query: 714 PAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNI 749
G Q + L+ VD+ G + GE+ +
Sbjct: 688 VKGGQTKSAEMQLTLGNLARVDQMGNTVLYPGEYTV 723
>gi|402308386|ref|ZP_10827395.1| glycosyl hydrolase family 3, N-terminal domain protein [Prevotella
sp. MSX73]
gi|400375830|gb|EJP28725.1| glycosyl hydrolase family 3, N-terminal domain protein [Prevotella
sp. MSX73]
Length = 721
Score = 386 bits (992), Expect = e-104, Method: Compositional matrix adjust.
Identities = 256/734 (34%), Positives = 381/734 (51%), Gaps = 92/734 (12%)
Query: 49 IPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFP 108
+ + ++I R+++ EK+ L++ + A+ LGIK Y+WWSE LHGV G
Sbjct: 30 MSRHAKEIIARMTVSEKISQLMNESPAIEHLGIKPYDWWSEGLHGVGRDGR--------- 80
Query: 109 GATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--------GGTAGLTYWSPNVNIF 160
AT FPQ I ++F+ L IG V+ E RA +N AGLT+WSPNVNIF
Sbjct: 81 -ATVFPQPIALGATFDEALVREIGDAVATEGRAKFNVAQKLKNYSRNAGLTFWSPNVNIF 139
Query: 161 RDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDR 220
RDPRWGRG ET GEDP+LSG +YVRGLQG D LK A KH+ + +G +
Sbjct: 140 RDPRWGRGMETYGEDPLLSGMLGTAYVRGLQGDDAFYLKTGACAKHYAVH-----SGPEG 194
Query: 221 FHFNAKV--SKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGE 278
A + S++D+ +T+ F+M V +G+V +VM +YN+V G P +L +R
Sbjct: 195 TRHEADIHPSRRDLFETYLPQFKMLVQQGRVEAVMSAYNRVYGEPCGGSKYLLTDILRKS 254
Query: 279 WRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQ---- 334
W NG+IVSDCD++ +Y + TPEEA A AI+AGL+++CG HT A+Q
Sbjct: 255 WGFNGHIVSDCDAINDFYGGHRYVKTPEEACAAAIKAGLNVECG-----HTFKAMQGALD 309
Query: 335 RGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQ-PYGHLGPKDVCTPDHQELALEAARQG 393
+GLL+E D++ AL + +++LG+ EP S PY ++C+P H LAL AA +
Sbjct: 310 QGLLAEADLDRALFPLVMTRLKLGIL--EPDSACPYNSYDESEICSPAHTALALRAADEA 367
Query: 394 IVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI-GRYARTI 452
+VLLKN G LPL RT+ V GP + ++GNY G++ Y+T LQGI R +
Sbjct: 368 MVLLKNNG-ILPLDK-NIRTLFVAGPGASDAFYLMGNYFGLSNRYSTYLQGIVSRVSSGT 425
Query: 453 HQQGCKDVACADDQL--FGAAIDASRQADATILVMGLDQSIEAEA---------LDRAGL 501
++L A++ + A+ I+VMG + ++E E DR G+
Sbjct: 426 SVNFRPAFMQITEELNDMNWAVNEACAAEVAIVVMGNNGNMEGEEGEAIASASRGDRVGI 485
Query: 502 LLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIA 561
LP Q + +V G ++VL G PID+ K A++ A YPGQ GG A+
Sbjct: 486 GLPASQLNYLRRVKARKGGRIVVVLTGGSPIDL--RKISKLADAVVMAWYPGQEGGEALG 543
Query: 562 DILFGTSNPGGKLPMTWYPQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGH 620
D+LFG N G+LP+T +P + + +LP + +M GRTY++ G V+YPFG+
Sbjct: 544 DLLFGDKNFSGRLPIT-FPAD-VDSLPAFDDYSMN--------GRTYKYMSGNVMYPFGY 593
Query: 621 GMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVG 680
G+SY +T +A + G+ K L V+V + N G
Sbjct: 594 GLSYGRVTYT-------------------DARVVGRIKKGE-------PLAVEVVLTNNG 627
Query: 681 SKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGT 739
+ + +TP AG +P LV F +V +P + + I V + L + G+
Sbjct: 628 DRTIDEVAQAYIATPTAGKGSPMASLVGFRRVSIPPKSSVKAVFKI-VPERLMTIQSDGS 686
Query: 740 RRIPLGEHNIHIGG 753
++ G + + IGG
Sbjct: 687 SKLLKGNYTLTIGG 700
>gi|326791674|ref|YP_004309495.1| beta-glucosidase [Clostridium lentocellum DSM 5427]
gi|326542438|gb|ADZ84297.1| Beta-glucosidase [Clostridium lentocellum DSM 5427]
Length = 696
Score = 386 bits (992), Expect = e-104, Method: Compositional matrix adjust.
Identities = 247/705 (35%), Positives = 374/705 (53%), Gaps = 98/705 (13%)
Query: 51 QRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGA 110
++ L+ ++L+E+ L + A+ RLG+ Y WW+EALHGV+ G A
Sbjct: 8 KKAKALVAEMTLEERASQLKYDSPAIKRLGVPAYNWWNEALHGVARAGV----------A 57
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGT--------AGLTYWSPNVNIFRD 162
TSFPQ I A++F+ L + + V+++E RA YN + GLT+WSPNVNIFRD
Sbjct: 58 TSFPQAIGMAATFDDELLKRVAEVIAEEGRAKYNAYSQEGDRDIYKGLTFWSPNVNIFRD 117
Query: 163 PRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFH 222
PRWGRG ET GEDP L+ + ++V+GLQG +G LK AA KHF + DR H
Sbjct: 118 PRWGRGHETYGEDPYLTSRLGVAFVKGLQGEEG--LKTAACAKHFAVHSGPE---ADRHH 172
Query: 223 FNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLN 282
F+A+VS++D+ +T+ F V E +V SVM +YN+ NG P C P ++K +R +W
Sbjct: 173 FDARVSQKDLWETYLPAFEALVKEAEVESVMGAYNRTNGEPCCGSPTLMKDILREKWGFQ 232
Query: 283 GYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEID 342
G+ VSDC ++ +++ TST +E+AA A+++G DL+CG LH A Q GL++E +
Sbjct: 233 GHYVSDCWAIKDFHEHHMVTSTAQESAALALKSGCDLNCGNTY-LHILMAYQNGLVTEEE 291
Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGP 402
I A T + LG+FDG Y + + V + H +A EA + IVLLKN G
Sbjct: 292 ITTAAERLFTTRYLLGLFDG----STYDAIPYEVVESKPHLSVADEATAKSIVLLKNNG- 346
Query: 403 SLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQG----IGRYARTIHQQG-- 456
LPL+ +T+ VIGPN++ +IGNY G + Y T L+G +G R ++ +G
Sbjct: 347 LLPLNKESIKTIGVIGPNANSRKALIGNYHGTSSQYITILEGLQKEVGDEVRILYSEGSH 406
Query: 457 -----CKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAE---------ALDRAGLL 502
+ +A D+L A I ++ +D I+ +GLD+++E E + D+ L
Sbjct: 407 LYADRVEPLAYQRDRLSEAKI-VAKHSDVVIVCVGLDETLEGEEGDTGNAYASGDKRDLA 465
Query: 503 LPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIAD 562
LP QQELV ++ K P IL L +G ID+ +A D A++ A YPG GG IA
Sbjct: 466 LPEPQQELVEAMAKMGK-PVILCLSAGSAIDLQYA--DAHYDAVLQAWYPGARGGQVIAK 522
Query: 563 ILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGM 622
L G P GKLP+T+Y ++ LP E GRTYR+ + +YPFG+G+
Sbjct: 523 ALLGEIVPSGKLPVTFYRD--LSGLPAFE-------DYSMQGRTYRYMQEEALYPFGYGL 573
Query: 623 SYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKV-THAKCN-RLTLGVQVDVKNVG 680
+Y G+ A+ +++V H + + +L VQ+ +KN+
Sbjct: 574 TY-------------------GKCRIEEASYDQGSLRVLVHNEVDFKLEEVVQLYIKNLD 614
Query: 681 SKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINI 725
S+ F+ P+ L F++V + AG + + IN+
Sbjct: 615 SE--------FAV-------PNHSLCGFKRVSLEAGETKEIQINV 644
>gi|443695317|gb|ELT96258.1| hypothetical protein CAPTEDRAFT_179825 [Capitella teleta]
Length = 750
Score = 386 bits (991), Expect = e-104, Method: Compositional matrix adjust.
Identities = 262/774 (33%), Positives = 396/774 (51%), Gaps = 84/774 (10%)
Query: 12 LILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLIS 71
L L+S +S FA P + PF VSLPI R+NDLI RL++++ + ++
Sbjct: 11 LTLISTASFSQDNVLRFA--PSSHALDSFPFRNVSLPIETRLNDLISRLTIEDAINQTVA 68
Query: 72 G----AAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATL 127
+ RLGIK E+ +E L GV AT FPQ + A+SF+ L
Sbjct: 69 RYGKFTPGIERLGIKPIEYITECLRGVRR-----------ENATGFPQALGLAASFSRDL 117
Query: 128 WEAIGRVVSDEARAMYNG-------GTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSG 180
+ + VS E RA YN G G+T +SP +NI R P WGR QET GEDP LSG
Sbjct: 118 MQRVATAVSVEVRAFYNHDIQRETYGAHGITCFSPVINILRHPLWGRNQETYGEDPYLSG 177
Query: 181 KYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPF 240
+ A+ YV GLQG D L+V+A CKHF A+ + V +F F+AK+ ++D++ TF F
Sbjct: 178 ELASQYVSGLQGDDPRYLRVSAGCKHFDAHGGPDTIPVRKFGFDAKIEERDLQMTFLPAF 237
Query: 241 RMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQH 300
+ C+ K +VMCS+N +NGVP+CA+ +L +R +W G++VSD +V + H
Sbjct: 238 KKCIA-AKPYNVMCSFNSINGVPSCANKRLLTDVLRAQWGYEGFVVSDDAAVEYIFTEHH 296
Query: 301 FTSTPEEAAADAIRAGLDLD-CGPFLGLHTE--SAVQRGLLSEIDINNALVNTLTVQMRL 357
+ S+ E AA +AI++G +++ G F + + A+ L+++ ++ + + L
Sbjct: 297 YNSSFETAAVEAIKSGCNMELVGKFDPSYWQLTKALNEHLITKDELMENVRPVFLTRFLL 356
Query: 358 GMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVI 417
G FD P+ P+ + V + +HQ LALEAA + VLLKN LPL +TVAV+
Sbjct: 357 GEFD-PPALNPFNQITKDVVLSAEHQRLALEAAVKSFVLLKNDRNFLPLLKNSLKTVAVV 415
Query: 418 GPNSDVTVTMIGNYAGIA--CGYTTPLQGIGRYARTIH-QQGCKDVACADDQL--FGAAI 472
GP S+ T +IG+Y+ TPL GI + A + GC + C D + AA+
Sbjct: 416 GPMSNYTDGLIGDYSTDTDPSLILTPLHGIKKLAPNVQFASGCSNSTCTDYRATDVAAAV 475
Query: 473 DASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKG-PTILVLMSGGP 531
D A + +G +EAE DR+ ++LPG Q +L+ + G P +L+L +GGP
Sbjct: 476 DG---AQVVFVALGTGFIVEAENNDRSDIVLPGAQLQLLKDAVYHANGRPVVLLLFNGGP 532
Query: 532 IDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILF---GTSNPGGKLPMTWYPQEYITNLP 588
+DV FA+ I +I+ +P G AI +L G S+P G+LP+TW Y+ +P
Sbjct: 533 LDVTFAQLTSGIVSIVECFFPAMMTGEAIYRMLINNEGISSPAGRLPLTW--PAYLNQVP 590
Query: 589 -MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHG 647
+T+ M+ GRTYR+Y +YPFG+G+SYT F ++ + PL+
Sbjct: 591 NITDYTMK--------GRTYRYYTEDPLYPFGYGLSYTQFKYS-----DLKVTPLE---- 633
Query: 648 SINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLV-----FSTPPAGHWAPH 702
G+ I+ V+V V N+G D ++ S P P
Sbjct: 634 ----VTKGQEIR------------VKVKVTNIGLYDADEVRIIVVQAYVSWPKTEIPVPR 677
Query: 703 KQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGT-RRIPLGEHNIHIGGTK 755
QLVAF+++H+ +G + V + I L V T I GE ++IGG +
Sbjct: 678 WQLVAFDRIHIASGKSETVELTIE-ASLLEVWQNPETGFDILEGEMTLYIGGQQ 730
>gi|297740661|emb|CBI30843.3| unnamed protein product [Vitis vinifera]
Length = 401
Score = 385 bits (990), Expect = e-104, Method: Compositional matrix adjust.
Identities = 188/427 (44%), Positives = 274/427 (64%), Gaps = 34/427 (7%)
Query: 335 RGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGI 394
+G E D++ +L N V ++G FDG PS Y L KD+CT +H ELA +AARQGI
Sbjct: 2 QGKAREEDVDTSLRNLYIVLTQVGFFDGIPS---YESLDKKDLCTKEHIELAADAARQGI 58
Query: 395 VLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQ 454
VLLKN +LPL + + +A+IGP+++ T+ M+GNYAG+ C Y++PL G Y + ++
Sbjct: 59 VLLKNINETLPLDPAKLKNLALIGPHANATIEMLGNYAGVPCQYSSPLDGFSAYGKVTYE 118
Query: 455 QGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKV 514
GC +V C + A++AS+ ADATIL++GLD+++E E LDR LLLPG Q EL+ +V
Sbjct: 119 MGCNNVTCDNKTFIMPAVEASKNADATILLVGLDKTVEGEGLDRNDLLLPGYQTELILQV 178
Query: 515 SMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKL 574
+ASKGP ILV+MSG +D++F+K D R+ AI+WAGYPG+ GG AIAD+++G NPGG+L
Sbjct: 179 IVASKGPIILVIMSGSAVDISFSKTDDRVKAILWAGYPGEEGGRAIADVVYGKYNPGGRL 238
Query: 575 PMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANA 634
P+TW+ +Y++ LPMT M++RP + YPGRTY+F+ G VVYPFGHG+SYT F +T
Sbjct: 239 PLTWHQNDYLSMLPMTSMSLRPVNN--YPGRTYKFFNGSVVYPFGHGLSYTKFNYT---- 292
Query: 635 PTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTP 694
++ ++ C + + ++VKN+G+K G +LV+S P
Sbjct: 293 -----------------------LRSSNMSC-KDHFELDIEVKNIGAKHGNEVVLVYSKP 328
Query: 695 PAGHWAPH-KQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGG 753
P G H KQ++ F++V VPAG Q V +VCK L +V + + +P GEH I IG
Sbjct: 329 PTGIVGTHAKQVIGFKRVFVPAGGSQNVKFEFNVCKSLGIVGYNAYKLLPSGEHKIIIGD 388
Query: 754 TKHSVSL 760
+ S+ +
Sbjct: 389 SPTSLPI 395
>gi|301090543|ref|XP_002895482.1| beta-glucosidase, putative [Phytophthora infestans T30-4]
gi|262098232|gb|EEY56284.1| beta-glucosidase, putative [Phytophthora infestans T30-4]
Length = 809
Score = 385 bits (990), Expect = e-104, Method: Compositional matrix adjust.
Identities = 266/773 (34%), Positives = 399/773 (51%), Gaps = 77/773 (9%)
Query: 18 SSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAA-- 75
S+ + A P AC + + FC SL +RV DL+ RL L EKV LL + A+
Sbjct: 15 STPTVDASIPRACSSPEH--QQFAFCNASLSTAERVEDLLRRLPLDEKVTLLTARASPKG 72
Query: 76 -VPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRV 134
+ +G+ Y W + +HGV + GT ATSFP + + F+ + +V
Sbjct: 73 NMSSIGLPEYNWGANCVHGVQSTC-GTN------CATSFPNPVNLGAIFDPRAVFDMAQV 125
Query: 135 VSDEARAMY---------NGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAAS 185
V E RA++ G GL WSPN+NI RDPRWGR ETP EDP+++ KY +
Sbjct: 126 VGWELRALWLEGARENYATGPHLGLDCWSPNININRDPRWGRNMETPSEDPLVNSKYGVA 185
Query: 186 YVRGLQ-GSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCV 244
Y +GLQ G D L+ + KH+ AY ++++G+DR FNA VS+ D DT+ F V
Sbjct: 186 YTKGLQEGKDKRFLQAVVTLKHYAAYSYEHYDGIDRMAFNAVVSRYDFADTYLPAFEASV 245
Query: 245 MEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTST 304
+ GK VMCSYN VNG+P CA+ + + +R +GYI SD ++ Y +H+T T
Sbjct: 246 VHGKAKGVMCSYNSVNGMPMCANEQLNSKLLRDALGFDGYITSDSGAIAGIYHQRHYTKT 305
Query: 305 PEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEP 364
EA AI +G D++ G V G L E +++A+ TL ++ LG+FD
Sbjct: 306 LCEAGRLAILSGTDVNSGSVYKQCLAELVTSGQLPEKAVDDAMRRTLKLRFELGLFD-PI 364
Query: 365 SSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVT 424
QPY H+ P +V T + ++L+L+ +R+ IVLL+N G LPL+ + + +AVIGP++
Sbjct: 365 DDQPYWHVAPNEVNTAESKQLSLDLSRKSIVLLQNHGNILPLA--KGKKLAVIGPHAAAK 422
Query: 425 VTMIGNYAGIACG--------YTTPLQGI---GRYARTIHQQGC--KDVACADDQLFGAA 471
++GNY G C TPL+ I + T++ +G D + A F A
Sbjct: 423 RALLGNYLGQMCHGDYLEVGCVQTPLEAITIANGASNTLYAKGSGINDTSTAG---FDEA 479
Query: 472 IDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGP 531
A+R+A+ +L +G+D SIE EA DR + +P Q +L+ +V A K PT++VL +GG
Sbjct: 480 EAAARKAETVVLFLGIDTSIEREAWDRENIDMPNIQMQLLKRVRRAGK-PTVVVLFNGGV 538
Query: 532 I--DVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPM 589
+ + D ++ A YPG G A++DILFG + P GKLP+T YP Y+T++ M
Sbjct: 539 VGAEELILHTD----GVVEAFYPGFFGAQAVSDILFGDAIPSGKLPVTMYPSNYVTSVDM 594
Query: 590 TEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSI 649
M+M +YPGR+YR+YK V+PFG G+SYT F + LD G
Sbjct: 595 KSMSM-----TKYPGRSYRYYKEVPVFPFGWGLSYTRFT-----------MALDSSSG-- 636
Query: 650 NATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTP----PAGHWA-PHKQ 704
T + I VT +L V V + N G+ G + F P G+ A ++Q
Sbjct: 637 -VTDPSEPIVVTR----QLDQTVTVILSNDGNLVGDEVVFAFFRPLKVNATGNAALLNEQ 691
Query: 705 LVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHS 757
L + +V + +++ I L++VD SG + G + + I H
Sbjct: 692 LFDYRRVSLRPTQYRKLKFRIQQST-LAMVDDSGNQASFPGFYEVIITNGVHE 743
>gi|326789672|ref|YP_004307493.1| beta-glucosidase [Clostridium lentocellum DSM 5427]
gi|326540436|gb|ADZ82295.1| Beta-glucosidase [Clostridium lentocellum DSM 5427]
Length = 704
Score = 385 bits (990), Expect = e-104, Method: Compositional matrix adjust.
Identities = 258/745 (34%), Positives = 383/745 (51%), Gaps = 96/745 (12%)
Query: 52 RVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGAT 111
+ L+ ++ L EK +L + A+ RLG+ Y WWSEALHGV+ G AT
Sbjct: 8 KAGQLVAQMDLLEKASMLRYDSPAIKRLGVPTYNWWSEALHGVARAGV----------AT 57
Query: 112 SFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA--------GLTYWSPNVNIFRDP 163
FPQ I A+ F+ I +++ EARA YN G+T W+PN+NIFRDP
Sbjct: 58 VFPQAIGMAAMFDEEYLYEIADIIATEARAKYNEFAKKEDRDIYKGMTLWAPNINIFRDP 117
Query: 164 RWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASC-KHFTAYDLDNWNGVDRFH 222
RWGRG ET GEDP L+ + +++ GLQG + AA+C KHF + +R H
Sbjct: 118 RWGRGHETYGEDPYLTSRLGVAFIHGLQGDENHHYWKAAACAKHFAVHSGPE---EERHH 174
Query: 223 FNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLN 282
F+A VSK+D+ +T+ F V +GKVA +M +YN+VNG P C +L+ ++ EW +
Sbjct: 175 FDAVVSKKDLYETYLPAFEAAVTKGKVAGMMGAYNRVNGEPACGSKVLLQDILKEEWGFD 234
Query: 283 GYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEID 342
GY+VSDC ++ ++ T T E+AA AI G L+CG LH A + GL++E
Sbjct: 235 GYVVSDCWAIRDFHTEHMVTHTATESAALAINNGCQLNCGNTY-LHMLQAYKEGLVTEET 293
Query: 343 INNALVNTLTVQMRLGMFDG--EPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQ 400
I + + ++M+LG+FD E + PY + C H+++AL+ AR+ +VLLKN
Sbjct: 294 ITKSAQKLMAIRMKLGLFDKNCEYNKIPY----EVNDCKV-HRDIALDVARRSMVLLKNN 348
Query: 401 GPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRY----ARTIHQQG 456
G LPL+ + + + VIGP ++ + GNY G A YTT L+GI Y AR + +G
Sbjct: 349 GI-LPLNLKQTKAIGVIGPTANSRTVLQGNYFGTASRYTTFLEGIQDYVGDAARVYYAEG 407
Query: 457 C-------KDVACADDQLFGAAIDASRQADATILVMGLDQSIEAE---------ALDRAG 500
C ++ +D+L A I + Q+D IL +GLD SIE E A D++
Sbjct: 408 CHLFKNSISGLSWENDRLSEALI-VAEQSDVVILCLGLDASIEGEQGDTGNAFAAGDKSD 466
Query: 501 LLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAI 560
L L GRQQ L+ +V K PTIL+L SG + + A+ AI+ YPGQ+GG A+
Sbjct: 467 LNLIGRQQLLLEEVLKIGK-PTILILSSGSAMAIHTAQE--YCEAILETWYPGQSGGKAL 523
Query: 561 ADILFGTSNPGGKLPMTWYP-QEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFG 619
A +LFG +P GKLP+T+Y E + + MA GRTYR+ K +YPFG
Sbjct: 524 AQLLFGEYSPSGKLPITFYKTTEELPDFRDYSMA----------GRTYRYMKNEALYPFG 573
Query: 620 HGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNV 679
+G++Y V V +A I + I+ N + +Q+ V N
Sbjct: 574 YGLNYAK-----------VEVK--------DAVIKERNIE------NEIIYEIQLQVTNQ 608
Query: 680 GSKDGAHTLLVFSTPPAGHWA-PHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSG 738
+ V+ WA P+ L A++ +++ A + ++ + I + +VD G
Sbjct: 609 SEVCTYDVVQVYIKDMESRWAVPNYSLCAYKSIYLAAYDEPQITLQIKQSAF-EIVDEEG 667
Query: 739 TRRIPLGEHNIHIGGT---KHSVSL 760
R I + IG + K SV L
Sbjct: 668 KRYIDSHHFKLFIGTSGPDKRSVEL 692
>gi|449299051|gb|EMC95065.1| glycoside hydrolase family 3 protein [Baudoinia compniacensis UAMH
10762]
Length = 849
Score = 385 bits (989), Expect = e-104, Method: Compositional matrix adjust.
Identities = 257/764 (33%), Positives = 382/764 (50%), Gaps = 67/764 (8%)
Query: 43 CQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTK 102
C + QR + +I +++ EK+ L+ + RLG+ YEWWSEALHGV+ PG
Sbjct: 43 CDTNATPYQRASAIINAMNITEKLANLLDVSYGSARLGLPPYEWWSEALHGVAG-SPGVN 101
Query: 103 F--GGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIF 160
F G++ ATSFP IT +S+F+ + I V+S EARA N GL Y++PN+N F
Sbjct: 102 FTSSGNYSYATSFPMPITFSSAFDDPSVQNIASVISTEARAYSNAARGGLDYFTPNINPF 161
Query: 161 RDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRL--------KVAASCKHFTAYDL 212
+DPRWGRG ETPGEDP+ Y + + GL+G+D K+ A+CKHF YDL
Sbjct: 162 KDPRWGRGSETPGEDPLRIQGYVKNLLIGLEGTDDGYFNTSHSGYKKMIATCKHFAGYDL 221
Query: 213 DNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILK 272
++W+G R+ ++A+++ QD+ + + PF+ C + VAS+MCSYN VN VP CA+ + +
Sbjct: 222 EDWDGYIRYGYDAEITTQDLAEYYLPPFQTCARDQNVASIMCSYNSVNSVPACANSYLQE 281
Query: 273 RTIRGEWRL---NGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDC-GPFLGLH 328
+R W N YI SDC+++ Y +++ AA ++ G+D C G+
Sbjct: 282 TILREHWGWTIDNNYITSDCNAISDIYYNHNYSVNNAAAAGLSLSNGMDTACIVANTGVM 341
Query: 329 TE--SAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELA 386
T+ + G ++E I AL+ + G FD SS PY +G V TP Q LA
Sbjct: 342 TDVNGSYYGGYVTEATITTALIRQYEALVIAGYFD-PASSNPYRSIGWSSVNTPAAQTLA 400
Query: 387 LEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIG 446
+AA +G LLKN G LP VA+IG ++ T M G Y+G A +PL
Sbjct: 401 RQAATEGTTLLKNTG-LLPYKFTSQTKVAMIGMWANGTSQMQGGYSGPAPYLHSPLYAAS 459
Query: 447 RYARTI-HQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPG 505
+ + + G + A A++ AD + G+D S+EAEA+DR + PG
Sbjct: 460 QLGLSYNYANGPINQTTLTSNYSQNATAAAQNADVILFFGGIDWSVEAEAMDRYQIAWPG 519
Query: 506 RQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILF 565
QQ L+++ +A+ G ++VL G +D ++ I+A++W GYPGQ GG A DIL
Sbjct: 520 AQQALIAQ--LAALGKPMIVLQMGSMLDATPILSNNNISALVWVGYPGQDGGVAAFDILT 577
Query: 566 GTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYT 625
G P G+LP+T YP +Y+ +PMT M++RP PGRTY++Y V+ PF +G+ YT
Sbjct: 578 GAVAPAGRLPVTMYPADYVNQVPMTNMSLRPGPGN--PGRTYKWYNNAVL-PFAYGLHYT 634
Query: 626 NFVHTV--------------ANA------------------------PTVVAVPLDGRHG 647
F T NA P+ VA G
Sbjct: 635 TFKATFNGGPPGPGSPWSPPWNAPWSAKVRRGWGWGNWGPPNWGWTQPSQVAPGNGGLSS 694
Query: 648 SINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWA-PHKQLV 706
S N + H V + V+N G LVFS AG P+K L
Sbjct: 695 SYNIQSLLSSCTAAHPDLCAFP-SVAISVQNAGQTTSDFVALVFSNTTAGPAPYPYKSLA 753
Query: 707 AFEKVH-VPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNI 749
++ ++H V AG +N+ + L+ D G + + G +N+
Sbjct: 754 SYTRLHSVAAGQTVTASLNMTL-GVLARRDDQGNQILYPGTYNL 796
>gi|339499234|ref|YP_004697269.1| beta-glucosidase [Spirochaeta caldaria DSM 7334]
gi|338833583|gb|AEJ18761.1| Beta-glucosidase [Spirochaeta caldaria DSM 7334]
Length = 699
Score = 385 bits (989), Expect = e-104, Method: Compositional matrix adjust.
Identities = 272/735 (37%), Positives = 380/735 (51%), Gaps = 105/735 (14%)
Query: 52 RVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGAT 111
++ LI +SL+EK+ L+I A +PRLGI Y WW+EALHGV+N G AT
Sbjct: 11 QIETLISNMSLEEKIGLMIHRAKGIPRLGIPDYNWWNEALHGVANNGE----------AT 60
Query: 112 SFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA--------GLTYWSPNVNIFRDP 163
FPQ I ++F+ L + +S EARA +N GLT+W+PN+NIFRDP
Sbjct: 61 VFPQAIALGATFDEDLVHRVAEAISIEARAKFNAVGKEKAEQYHRGLTFWAPNINIFRDP 120
Query: 164 RWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD--RF 221
RWGRGQET GEDPVL+ + +YVRGLQGSD L+ AA KHF + +G + R
Sbjct: 121 RWGRGQETYGEDPVLTSRLGTAYVRGLQGSDPYYLRAAACAKHFAVH-----SGPEGLRH 175
Query: 222 HFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRL 281
FNA+VS++D+E+T+ F+ V G V SVM +YN+VNG P C +LK+ +R EW+
Sbjct: 176 TFNAEVSQKDLEETYLPAFKALVKSG-VESVMGAYNRVNGEPACGSTYLLKQKLREEWQF 234
Query: 282 NGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEI 341
G++VSDC ++ ++ T+ E+ A A+R+G DL+CG E AV +G ++E
Sbjct: 235 QGHVVSDCWAICDFHKNHKVTNDILESIALALRSGCDLNCGDAYNYLAE-AVLKGYVTED 293
Query: 342 DINNALVNTLTVQMRLGMF--DGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKN 399
DIN A+V L +LG+ DG PY + + H LALEAA + IVLLKN
Sbjct: 294 DINRAVVRLLITLDKLGLIHDDG-----PYQGITIHQIDWKKHDSLALEAAEKSIVLLKN 348
Query: 400 QGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI----GRYARTIHQQ 455
G LPL + + V GPN+ + ++GNYAG++ T L+ I G +++
Sbjct: 349 NG-VLPLKKDKISYIYVTGPNATNSDALLGNYAGVSSRLLTVLEAIVEEAGPEITVTYKK 407
Query: 456 GCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDR---------AGLLLPGR 506
GC +A A ++ AD TI VMG D S+E E D L L
Sbjct: 408 GCP-LAERRVNPNDWASGVTKYADVTIAVMGRDTSVEGEEGDAILSSTYGDFEDLNLNDE 466
Query: 507 QQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIA-AIIWAGYPGQAGGTAIADILF 565
Q + K+ + K P I+VLM G PI + IA AI+ A YPGQAGGTA+++I+F
Sbjct: 467 QLSYLHKLKESGK-PLIVVLMGGAPI---CSPELHEIADAILVAWYPGQAGGTAVSNIVF 522
Query: 566 GTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYT 625
G +NP GKLP+T +P+ + LP E + GRTYR+ +YPFG G+SYT
Sbjct: 523 GKTNPSGKLPVT-FPKS-VRQLPEFE-------NYSMQGRTYRYMTEEPLYPFGFGLSYT 573
Query: 626 --NFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKD 683
F H GR S + L V ++ N G+ D
Sbjct: 574 KMEFKHVT------------GRWKS----------------PEKDELIVSTELYNQGTID 605
Query: 684 GAHTLLVFSTPPAGHW------APHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRS 737
G + ++ HW P+ L+ F++V V AGA I + K L +D S
Sbjct: 606 GEEVVQLYY-----HWKDAPFAVPNWSLIDFKRVLVAAGASCICEFKIPLEK-LQCIDPS 659
Query: 738 GTRRIPLGEHNIHIG 752
G IP G ++G
Sbjct: 660 GKGVIPTGTLQFYVG 674
>gi|255590044|ref|XP_002535159.1| Thermostable beta-glucosidase B, putative [Ricinus communis]
gi|223523880|gb|EEF27223.1| Thermostable beta-glucosidase B, putative [Ricinus communis]
Length = 449
Score = 385 bits (988), Expect = e-104, Method: Compositional matrix adjust.
Identities = 192/450 (42%), Positives = 288/450 (64%), Gaps = 9/450 (2%)
Query: 317 LDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKD 376
+D++CG +L +T+SAV++ +SE +I+ AL N +++MRLG+F+G P+ PYG +
Sbjct: 1 MDVNCGNYLKNYTKSAVEKKKVSESEIDRALHNLFSIRMRLGLFNGNPTKLPYGDISADQ 60
Query: 377 VCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIAC 436
VC+ +HQ +ALEAAR GIVLLKN LPLS + ++A+IGPN+D + ++GNYAG C
Sbjct: 61 VCSQEHQAVALEAARDGIVLLKNSNQLLPLSKSKTTSLAIIGPNADNSTILVGNYAGPPC 120
Query: 437 GYTTPLQGIGRYARTI-HQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEA 495
TP QG+ Y +T + GC VAC+ + AI +++AD +LVMGLDQ+ E E
Sbjct: 121 KTVTPFQGLQNYIKTTKYHPGCSTVACSSAAI-DQAIKIAKEADQVVLVMGLDQTQEREE 179
Query: 496 LDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQA 555
DR L+LPG+QQEL+ V+ A+K P +LVL+ GGP+D++FAK D I I+WAGYPG+A
Sbjct: 180 HDRVDLVLPGKQQELIISVARAAKKPVVLVLLCGGPVDISFAKYDRNIGGILWAGYPGEA 239
Query: 556 GGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVV 615
GG A+A+I+FG NPGG+LP+TWYPQ++ T +PMT+M MRP S YPGRTYRFYKG V
Sbjct: 240 GGIALAEIIFGNHNPGGRLPVTWYPQDF-TKVPMTDMRMRPQPSSGYPGRTYRFYKGKKV 298
Query: 616 YPFGHGMSYTNFVH---TVANAPTVVAVPLDGRHGSINATISGKAI-KVTHAKCNRLTLG 671
+ FG+G+SY+N+ + +V + +D + + ++ I K I ++ C R
Sbjct: 299 FEFGYGLSYSNYSYELVSVTQNKISLRSSIDQKAEN-SSPIGYKTISEIEEELCERSKFS 357
Query: 672 VQVDVKNVGSKDGAHTLLVFS-TPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKY 730
V V VKN G G H +L+F+ G P K+L+AF+ V + AG + ++ C++
Sbjct: 358 VTVRVKNQGEMTGKHPVLLFARQDKPGSGGPIKKLIAFQSVKLNAGENAEIEYKVNPCEH 417
Query: 731 LSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
LS + G + G + +G ++ +++
Sbjct: 418 LSRANEDGLMVMEEGSQYLLVGDKEYPINI 447
>gi|372208556|ref|ZP_09496358.1| beta-glucosidase [Flavobacteriaceae bacterium S85]
Length = 729
Score = 385 bits (988), Expect = e-104, Method: Compositional matrix adjust.
Identities = 252/748 (33%), Positives = 380/748 (50%), Gaps = 80/748 (10%)
Query: 42 FCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGT 101
+ SL +R++ L+ ++L+EK+ L SG+ V RL I Y WW+EALHGV+ G
Sbjct: 26 WLDTSLTFEERIHHLVKAMTLKEKIAQLDSGSPEVKRLDIPEYNWWNEALHGVARNGK-- 83
Query: 102 KFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--------GGTAGLTYW 153
+T FPQ I A++F+ L + + +SDEARA +N G AGLT+W
Sbjct: 84 --------STVFPQAIGLAATFDPVLAKQVASAISDEARAKFNISQSIGNRGQYAGLTFW 135
Query: 154 SPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLD 213
+PNVNIFRDPRWGRGQET GEDP L+ + ++V+GLQG+ LK AA KHF +
Sbjct: 136 TPNVNIFRDPRWGRGQETYGEDPYLTSQMGVAFVKGLQGNHPKYLKSAACAKHFAVH--- 192
Query: 214 NWNGVD--RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNIL 271
+G + R HFNA SK+D+ +T+ F V + V VM +YN V GVP + +L
Sbjct: 193 --SGPEELRHHFNANPSKKDLYETYLPAFEALVKQANVEGVMSAYNAVYGVPAGSSEFLL 250
Query: 272 KRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTES 331
K T+R W +GYIVSDC ++G + T EAAA A++AG++L+CG E
Sbjct: 251 KETLRKSWGFDGYIVSDCGALGDIFKGHKQVKTMPEAAAVALKAGVNLNCGYVYNGALEK 310
Query: 332 AVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAAR 391
AVQ+GL+SE I+ L L + +LG FD + + PY + + + DH LA + A+
Sbjct: 311 AVQQGLVSEELIDTRLKQLLKTRFKLGFFDPK-EANPYNAIPTSVIHSDDHIALARKTAQ 369
Query: 392 QGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYART 451
+ IVLLKN+ +LPL + V GP + + ++ NY G+ + L+GI
Sbjct: 370 KSIVLLKNKNHTLPLDK-NIKVPYVTGPFASSSDVLLANYYGMTTNLVSVLEGIADKVSL 428
Query: 452 IHQQGCKDVACADDQLFGA---AIDASRQADATILVMGLDQSIEAEAL---------DRA 499
+ A ++ A + ++ ADA I V+GL E E + D+
Sbjct: 429 GTSLNYRMGALPFNKNLNPKNWAPNVAKTADAVIAVVGLSADFEGEEVDAIASPNKGDKK 488
Query: 500 GLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTA 559
L LP Q + V +++ KGP ILV+ SG VA + AI+ YPG+ GG A
Sbjct: 489 DLKLPQNQIDYVKEMAAKKKGPLILVVASGSA--VALGELYDLADAIVLMWYPGEQGGNA 546
Query: 560 IADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFG 619
+AD+LFG +P G LP+T +P+ P + +M+ GRTY++ + ++PFG
Sbjct: 547 VADVLFGDVSPSGHLPVT-FPKSVAQLPPFEDYSMQ--------GRTYKYMEEEPLFPFG 597
Query: 620 HGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNV 679
G+SYT+F + N IS + IK + + V V N
Sbjct: 598 FGLSYTDFKFS-------------------NVQISEEKIK------KKDSFTVSCSVANN 632
Query: 680 GSKDGAHTLLVFSTP-PAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSG 738
G DG + ++ P + P QL+ F+++ + + V N+ K L V++ G
Sbjct: 633 GKVDGEEVVQLYLVPLNSNKDLPKYQLLKFKRIEIQKNTSKTVSFNLE-AKDLFQVNKEG 691
Query: 739 TRRIPLGEHNIHIGG---TKHSVSLHAA 763
+ G++ + + +K S L AA
Sbjct: 692 KKTWIKGKYKLVVANALPSKRSQDLGAA 719
>gi|301118693|ref|XP_002907074.1| glycoside hydrolase, putative [Phytophthora infestans T30-4]
gi|262105586|gb|EEY63638.1| glycoside hydrolase, putative [Phytophthora infestans T30-4]
Length = 809
Score = 384 bits (987), Expect = e-104, Method: Compositional matrix adjust.
Identities = 265/773 (34%), Positives = 398/773 (51%), Gaps = 77/773 (9%)
Query: 18 SSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAA-- 75
S+ + A P AC + + FC SL +RV DL+ RL L EKV LL + A+
Sbjct: 15 STPTVDASIPRACSSPEH--QQFAFCNASLSTAERVEDLLRRLPLDEKVTLLTARASPKG 72
Query: 76 -VPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRV 134
+ +G+ Y W + +HGV + GT ATSFP + + F+ + +V
Sbjct: 73 NMSSIGLPEYNWGANCVHGVQSTC-GTN------CATSFPNPVNLGAIFDPRAVFDMAQV 125
Query: 135 VSDEARAMY---------NGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAAS 185
V E RA++ G GL WSPN+NI RDPRWGR ETP EDP+++ KY +
Sbjct: 126 VGWELRALWLEGARENYATGPHLGLDCWSPNININRDPRWGRNMETPSEDPLVNSKYGVA 185
Query: 186 YVRGLQ-GSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCV 244
Y +GLQ G D L+ + KH+ AY ++++G+DR FNA VS+ D DT+ F V
Sbjct: 186 YTKGLQEGKDKRFLQAVVTLKHYAAYSYEHYDGIDRMAFNAVVSRYDFADTYLPAFEASV 245
Query: 245 MEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTST 304
+ GK VMCSYN VNG+P CA+ + + +R +GYI SD ++ Y +H+T T
Sbjct: 246 VHGKAKGVMCSYNSVNGMPMCANEQLNSKLLRDALGFDGYITSDSGAIAGIYHQRHYTKT 305
Query: 305 PEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEP 364
EA AI +G D++ G V G L E +++A+ TL ++ LG+FD
Sbjct: 306 LCEAGRLAILSGTDVNSGSVYKQCLAELVTSGQLPEKAVDDAMRRTLKLRFELGLFD-PI 364
Query: 365 SSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVT 424
QPY H+ P +V T + ++L+L+ +R+ IVLL+N G LPL+ + + +AVIGP++
Sbjct: 365 DDQPYWHVAPNEVNTAESKQLSLDLSRKSIVLLQNHGNILPLA--KGKKLAVIGPHAAAK 422
Query: 425 VTMIGNYAGIACG--------YTTPLQGI---GRYARTIHQQGC--KDVACADDQLFGAA 471
++GNY G C TPL+ I + T++ +G D + F A
Sbjct: 423 RALLGNYLGQMCHGDYLEVGCVQTPLEAITIANGASNTLYAKGSGINDTSTGG---FDEA 479
Query: 472 IDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGP 531
A+R+A+ +L +G+D SIE EA DR + +P Q +L+ +V A K PT++VL +GG
Sbjct: 480 EAAARKAETVVLFLGIDTSIEREAWDRENIDMPNIQMQLLKRVRRAGK-PTVVVLFNGGV 538
Query: 532 I--DVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPM 589
+ + D ++ A YPG G A++DILFG + P GKLP+T YP Y+T++ M
Sbjct: 539 VGAEELILHTD----GVVEAFYPGFFGAQAVSDILFGDAIPSGKLPVTMYPSNYVTSVDM 594
Query: 590 TEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSI 649
M+M +YPGR+YR+YK V+PFG G+SYT F + LD G
Sbjct: 595 KSMSM-----TKYPGRSYRYYKEVPVFPFGWGLSYTRFT-----------MALDSSSG-- 636
Query: 650 NATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTP----PAGHWA-PHKQ 704
T + I VT +L V V + N G+ G + F P G+ A ++Q
Sbjct: 637 -VTDPSEPIVVTR----QLDQTVTVILSNDGNLVGDEVVFAFFRPLKVNATGNAALLNEQ 691
Query: 705 LVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHS 757
L + +V + +++ I L++VD SG + G + + I H
Sbjct: 692 LFDYRRVSLRPTQYRKLKFRIQQST-LAMVDDSGNQASFPGFYEVIITNGVHE 743
>gi|373460527|ref|ZP_09552278.1| hypothetical protein HMPREF9944_00542 [Prevotella maculosa OT 289]
gi|371955145|gb|EHO72949.1| hypothetical protein HMPREF9944_00542 [Prevotella maculosa OT 289]
Length = 699
Score = 384 bits (986), Expect = e-104, Method: Compositional matrix adjust.
Identities = 248/727 (34%), Positives = 376/727 (51%), Gaps = 79/727 (10%)
Query: 49 IPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFP 108
+ Q+ LI ++L EK+ +++ +PRLGIK Y+WW+E LHGV G
Sbjct: 9 MQQKARRLINMMTLDEKISQMMNETPGIPRLGIKPYDWWNEGLHGVGRDGR--------- 59
Query: 109 GATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--------GGTAGLTYWSPNVNIF 160
AT FPQ I ++FN L IG ++ E RA YN GLT+WSPN+NIF
Sbjct: 60 -ATVFPQPIGMGATFNPALIRQIGDAIATEGRAKYNVAQRNNNYARYTGLTFWSPNINIF 118
Query: 161 RDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDR 220
RDPRWGRG ET GEDP L+G +YV+G+QG+D LKVAA KH+ + R
Sbjct: 119 RDPRWGRGMETYGEDPFLTGTLGIAYVQGMQGNDPFYLKVAACGKHYAVHSGPE---ATR 175
Query: 221 FHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWR 280
N +K+D+ +T+ F+M V +G V ++M +YN+V G +L +R +W
Sbjct: 176 HEANVSPTKRDLFETYLPAFKMLVQQGHVEAIMGAYNRVYGEACSGSKYLLTDVLRKQWG 235
Query: 281 LNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSE 340
G+IVSDCD+V + T EA A AI+AGL+++CG ++ Q+ LL+E
Sbjct: 236 FRGHIVSDCDAVADIHAGHKIVKTEAEACAIAIKAGLNIECGHTFEAMKQAVAQK-LLTE 294
Query: 341 IDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQ 400
+I+ AL+ + +++LG+ + + PY + ++C+P+H LA +AA + +VLLKN
Sbjct: 295 QEIDRALLPLMMTRLKLGILEYDAEC-PYNEVKETEICSPEHIALARKAATESMVLLKNN 353
Query: 401 GPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYART---IHQQGC 457
G LPL H T+ + GP + + ++GNY GI+ Y T LQGI + ++ +
Sbjct: 354 G-ILPLDKNLH-TLFIAGPGASDSFWLMGNYFGISNRYCTYLQGIADKVSSGTAVNFRPA 411
Query: 458 KDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEA---------LDRAGLLLPGRQQ 508
+ A+D + A+ TI+VMG + ++E E DR + LP Q
Sbjct: 412 FGESTPTKNTINWALDEAIAAEKTIVVMGNNGNLEGEEGESIASETRGDRVSMRLPASQM 471
Query: 509 ELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIA-AIIWAGYPGQAGGTAIADILFGT 567
+ + + A K ++VL G PIDV + R+A A++ A YPGQ GG A+AD+LFG
Sbjct: 472 KFLRDLK-ARKNGIVVVLTGGSPIDV---REISRLADAVVMAWYPGQEGGYALADLLFGD 527
Query: 568 SNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNF 627
N G+LP+T+ P + A+ P + GRTY++ + YPFG+G+SYT
Sbjct: 528 ENFSGRLPVTF---------PESTDALPPFEDYAMKGRTYKYQTAHIQYPFGYGLSYTTV 578
Query: 628 VHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHT 687
+ A T +P GR +++A + KN G+K
Sbjct: 579 TYAHAKVET---MPQKGRGMTVSAVL-----------------------KNTGNKAVDEV 612
Query: 688 LLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGE 746
V+ S P AG A LVAF+++ + G QQ V +I + L+ V GT ++ G
Sbjct: 613 AQVYLSAPGAGTTAALASLVAFKRIGLQPGEQQLVRFDIPFDRLLT-VQEDGTAQLLKGN 671
Query: 747 HNIHIGG 753
+ I +GG
Sbjct: 672 YTITVGG 678
>gi|315607899|ref|ZP_07882892.1| beta-glucosidase [Prevotella buccae ATCC 33574]
gi|315250368|gb|EFU30364.1| beta-glucosidase [Prevotella buccae ATCC 33574]
Length = 721
Score = 383 bits (984), Expect = e-103, Method: Compositional matrix adjust.
Identities = 256/735 (34%), Positives = 383/735 (52%), Gaps = 94/735 (12%)
Query: 49 IPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFP 108
+ + ++I R+++ EK+ L++ + A+ LGIK Y+WWSE LHGV G
Sbjct: 30 MSRHAKEIIARMTVSEKISQLMNESPAIEHLGIKPYDWWSEGLHGVGRDGR--------- 80
Query: 109 GATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--------GGTAGLTYWSPNVNIF 160
AT FPQ I ++F+ L IG V+ E RA +N AGLT+WSPNVNIF
Sbjct: 81 -ATVFPQPIALGATFDEALVREIGDAVATEGRAKFNVARKLKNYSRNAGLTFWSPNVNIF 139
Query: 161 RDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDR 220
RD RWGRG ET GEDP+LSG +YVRGLQG D LK A KH+ + +G +
Sbjct: 140 RDLRWGRGMETYGEDPLLSGMLGTAYVRGLQGDDAFYLKTGACAKHYAVH-----SGPEG 194
Query: 221 FHFNAKV--SKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGE 278
A + S++D+ +T+ F+M V +G+V +VM +YN+V G P +L +R
Sbjct: 195 TRHEADIHPSRRDLFETYLPQFKMLVQQGRVEAVMSAYNRVYGEPCGGSKYLLTDILRKS 254
Query: 279 WRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQ---- 334
W NG+IVSDCD++ +Y + TPEEA A AI+AGL+++CG HT A+Q
Sbjct: 255 WGFNGHIVSDCDAINDFYGGHRYVKTPEEACAAAIKAGLNVECG-----HTFKAMQGALD 309
Query: 335 RGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQ-PYGHLGPKDVCTPDHQELALEAARQG 393
+GLL+E D++ AL + +++LG+ EP S PY ++C+P H LAL AA +
Sbjct: 310 QGLLAEADLDRALFPLVMTRLKLGIL--EPDSACPYNSYDESEICSPAHTALALRAADEA 367
Query: 394 IVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI-GRYARTI 452
+VLLKN G LPL RT+ V GP + ++GNY G++ Y+T LQGI R +
Sbjct: 368 MVLLKNNG-ILPLDK-NIRTLFVAGPGASDAFYLMGNYFGLSNRYSTYLQGIVSRVSSGT 425
Query: 453 HQQGCKDVACADDQL--FGAAIDASRQADATILVMGLDQSIEAEA---------LDRAGL 501
++L A++ + A+ I+VMG + ++E E DR G+
Sbjct: 426 SVNFRPAFMQITEELNDMNWAVNEACAAEVAIVVMGNNGNMEGEEGEAIASASRGDRVGI 485
Query: 502 LLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIA-AIIWAGYPGQAGGTAI 560
LP Q + +V G ++VL G PID+ + ++A A++ A YPGQ GG A+
Sbjct: 486 GLPASQLNYLRRVKARKGGRIVVVLTGGSPIDL---REISKLADAVVMAWYPGQEGGEAL 542
Query: 561 ADILFGTSNPGGKLPMTWYPQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFG 619
D+LFG N G+LP+T +P + + +LP + +M GRTY++ G V+YPFG
Sbjct: 543 GDLLFGDKNFSGRLPIT-FPAD-VDSLPAFDDYSMN--------GRTYKYMSGNVMYPFG 592
Query: 620 HGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNV 679
+G+SY +T +A + G+ K L V+V + N
Sbjct: 593 YGLSYGRVTYT-------------------DARVVGRIKKGE-------PLAVEVVLTNN 626
Query: 680 GSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSG 738
G + + +TP AG +P LV F +V +P + + I V + L V G
Sbjct: 627 GDRTIDEVAQAYIATPTAGKGSPMASLVGFRRVSIPPKSSVKAVFKI-VPERLMTVQSDG 685
Query: 739 TRRIPLGEHNIHIGG 753
+ ++ G + + IGG
Sbjct: 686 SSKLLKGNYTLTIGG 700
>gi|348684866|gb|EGZ24681.1| hypothetical protein PHYSODRAFT_325770 [Phytophthora sojae]
Length = 805
Score = 383 bits (983), Expect = e-103, Method: Compositional matrix adjust.
Identities = 260/716 (36%), Positives = 377/716 (52%), Gaps = 72/716 (10%)
Query: 13 ILLSASSSGL----AAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKL 68
+LL+ +S+ L AA P AC K + LPFC SL RV DL+ RL LQEK L
Sbjct: 6 LLLAIASASLVTPTAATIPRACVGK--ANQELPFCNTSLSTADRVEDLLSRLPLQEKATL 63
Query: 69 LISGAAAVPR-----LGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSF 123
L A A PR +G+ Y W + +HGV + G + P TSFP + + F
Sbjct: 64 LT--ARASPRGNMSSIGLPEYNWGANCVHGVQ-----STCGTNCP--TSFPNPVNLGAIF 114
Query: 124 NATLWEAIGRVVSDEARAM--------YNGGT-AGLTYWSPNVNIFRDPRWGRGQETPGE 174
+ + + +V+ E RA+ Y GG GL WSPN+NI RDPRWGR ETP E
Sbjct: 115 DPQVVFDMAQVIGWELRALWLEGATENYKGGPHLGLDCWSPNININRDPRWGRNTETPSE 174
Query: 175 DPVLSGKYAASYVRGLQGSDGDR-----LKVAASCKHFTAYDLDNWNGVDRFHFNAKVSK 229
DP+++ KY +Y RGLQ +G R L+ + KH+ AY +N+ GV+R F+A VS
Sbjct: 175 DPLVNSKYGVAYTRGLQ--EGKRQDPRFLQAVVTLKHYAAYSYENYGGVNRMEFDAIVSP 232
Query: 230 QDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDC 289
D DT+ FR V++G VMCSYN VNG+P CA+ +++ +RG +GY+ SD
Sbjct: 233 YDFADTYFPAFRSSVVDGNAKGVMCSYNSVNGIPMCANKELVETLLRGTLGFDGYVTSDS 292
Query: 290 DSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVN 349
+V D H+ + EAA AI AG D++ G ++ V L E +++AL +
Sbjct: 293 GAVEAISDMHHYADSQCEAARLAILAGTDINSGKSYEACLKTLVDDNQLEEKALDDALRH 352
Query: 350 TLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHI 409
TL ++ LG+FD QPY ++ P +V T + L+L A R+ +V+L+N LPL
Sbjct: 353 TLKLRFELGLFD-PIDDQPYWNVTPSEVNTAAAKALSLNATRKSLVMLQNNASVLPLQ-- 409
Query: 410 RHRTVAVIGPNSDVTVTMIGNYAGIAC--------GYTTPLQGI---GRYARTIHQQGCK 458
+ +AV+GP++ ++GNY G C TPL I + T +GC
Sbjct: 410 KGVKLAVLGPHAKSKRGLLGNYLGQMCHGDYDEVGCVQTPLDAIRAANGASNTTFAEGCG 469
Query: 459 DVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMAS 518
++ F A+ A+++ADA +L +G+D+SIE E DR + LP Q +L+ +V
Sbjct: 470 -ISGNSTAGFEKAVAAAKEADAVVLFLGIDKSIEGEVGDRNNIDLPNIQMQLLQRVHAVG 528
Query: 519 KGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTW 578
+ PT++VL++GG I + R A++ A YPG G A+AD+LFG +NP GKLP+T
Sbjct: 529 R-PTVVVLINGGVIGA--EEIIERTDALVEAFYPGFFGARAMADVLFGDTNPSGKLPVTM 585
Query: 579 YPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVV 638
Y +Y+ + M M M +PGRTYR++KG V+PFG G+SYT F +V +
Sbjct: 586 YRSDYVDQVEMKSMDM-----TAHPGRTYRYFKGEPVFPFGWGLSYTTFSLSVDSG---- 636
Query: 639 AVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTP 694
+ S NA SG + T + + V VKN G G +L F P
Sbjct: 637 ---TNSSSHSNNAAFSGGEVSDTA------NVTISVVVKNDGEVAGDEVVLAFFRP 683
>gi|410617070|ref|ZP_11328046.1| beta-glucosidase [Glaciecola polaris LMG 21857]
gi|410163339|dbj|GAC32184.1| beta-glucosidase [Glaciecola polaris LMG 21857]
Length = 731
Score = 383 bits (983), Expect = e-103, Method: Compositional matrix adjust.
Identities = 247/729 (33%), Positives = 379/729 (51%), Gaps = 81/729 (11%)
Query: 51 QRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGA 110
QR N L+ +++ EK+ L A+ RL + Y WW+EALHG++ G A
Sbjct: 38 QRANLLVNAMTVDEKIAQLSHATPAIARLNVPQYNWWNEALHGIARNGK----------A 87
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMY--------NGGTAGLTYWSPNVNIFRD 162
T FPQ I A++F+ L + +SDEARA Y G AGLT+W+PNVNIFRD
Sbjct: 88 TIFPQAIGLAATFDPDLAHQVASAISDEARAKYAIAQSIGNQGQYAGLTFWTPNVNIFRD 147
Query: 163 PRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD--R 220
PRWGRGQET GEDP L+ + ++V+GLQG D LK A KHF + +G + R
Sbjct: 148 PRWGRGQETYGEDPFLTAQMGTAFVKGLQGDDPKYLKSAGVAKHFAVH-----SGPESLR 202
Query: 221 FHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWR 280
HF+ + S++D+ +T+ F V + KVA VMC+YN VNG P CA +L ++ +W
Sbjct: 203 HHFDVEPSQKDLYETYLPAFEALVTQAKVAGVMCAYNAVNGEPACASAQLLDGILKKQWG 262
Query: 281 LNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSE 340
+GYIVSDC ++ + T + E+AA A+++G++L+CG ++A+++ L+
Sbjct: 263 FHGYIVSDCGALNDFQAGHKVTKSGPESAALALQSGVNLNCGSTYEHFLKAALEQNLVPL 322
Query: 341 IDINNALVNTLTVQMRLGMFDGEPSS-QPYGHLGPKDVCTPDHQELALEAARQGIVLLKN 399
I+ L L ++ +LG FD P+ PY + P + +P+H L+ + AR+ IVLLKN
Sbjct: 323 ELIDQRLTQLLMIRFQLGFFD--PAGLNPYNEVTPDVIHSPEHINLSRDVARKSIVLLKN 380
Query: 400 QGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIG---RYARTIHQQG 456
LPLS + V GP + + +IGNY GI+ + L+GI +++ +
Sbjct: 381 DNHVLPLSK-DIKVPYVTGPFAASSDMLIGNYYGISDSLVSVLEGIAGKVSLGSSLNYRS 439
Query: 457 CKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEAL---------DRAGLLLPGRQ 507
+ A ++ ADA I V+G+ +E E + DR + LP Q
Sbjct: 440 GSLPFHNNINPLNWAPQVAKTADAVIAVVGVSADMEGEEVDAIASADRGDRVAITLPQNQ 499
Query: 508 QELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGT 567
+ V +++ KGP ILV+ +G P+D+ + +P AI+W YPG+ GG A+AD+LFG
Sbjct: 500 VDYVKQLAAHKKGPLILVVAAGSPVDI--SDLEPLADAILWIWYPGEQGGNAVADVLFGD 557
Query: 568 SNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNF 627
+NP G LP+T+ + I +LP P GRTY+F + +YPFG G SYT F
Sbjct: 558 TNPSGHLPLTFV--KSIDDLP-------PFDDYAMTGRTYKFLEKAPLYPFGFGRSYTEF 608
Query: 628 VHTVANAPTVVAVPLDGRHGSINATIS-GKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAH 686
+ T+S GKAI + LTL V+V+ N G G
Sbjct: 609 SFN-------------------DLTVSQGKAI-----EGEALTLSVEVE--NRGDIAGET 642
Query: 687 TLLVFSTPPAG-HWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLG 745
+ + +P A + L +F+++H+ + V + I K L V+ +G P G
Sbjct: 643 VVQAYLSPIARMNNEAISSLKSFKRIHLAPKETRWVELTIQ-GKDLYQVNNAGETVWPQG 701
Query: 746 EHNIHIGGT 754
+++ +G +
Sbjct: 702 RYSLAVGDS 710
>gi|319641744|ref|ZP_07996426.1| beta-glucosidase [Bacteroides sp. 3_1_40A]
gi|317386631|gb|EFV67528.1| beta-glucosidase [Bacteroides sp. 3_1_40A]
Length = 702
Score = 382 bits (980), Expect = e-103, Method: Compositional matrix adjust.
Identities = 255/751 (33%), Positives = 378/751 (50%), Gaps = 90/751 (11%)
Query: 52 RVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGAT 111
RV DL+ RL+L+EKV L+ + A+PRLGI Y+WW+EALHGV+ T
Sbjct: 2 RVKDLVARLTLEEKVLLMQHHSPAIPRLGIPAYDWWNEALHGVART---------LEKVT 52
Query: 112 SFPQVITTASSFNATLWEAIGRVVSDEARAMYN-----GGTA----GLTYWSPNVNIFRD 162
FPQ I A++F+ + +G + S E RA++N G T GLTYW+PN+NIFRD
Sbjct: 53 VFPQAIGMAATFDTEALQKMGDITSTEGRALFNEDWKAGKTGTRYRGLTYWTPNINIFRD 112
Query: 163 PRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFH 222
PRWGRGQET GEDP L+ K A+ VRGL+G D LK A KH+ + +N R
Sbjct: 113 PRWGRGQETYGEDPYLTAKMGAAIVRGLEGEDPHYLKSVACAKHYAVHSGPEYN---RHS 169
Query: 223 FNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLN 282
F+A+ S D+ DT+ FR V + KV VMC+YN++NG P C + +L +R +W +
Sbjct: 170 FDARPSVFDLWDTYMPAFRELVTKAKVHGVMCAYNRLNGQPCCGNDPLLVDILRNQWHFD 229
Query: 283 GYIVSDCDSVGVYYDTQHFTST-PEE--AAADAIRAGLDLDCGPFLGLHTESAVQRGLLS 339
GY+ SDC ++ D F T PE A +DA+ AG DL+CG L E V++GL S
Sbjct: 230 GYVTSDCWALK---DFAEFHKTHPEHTIAMSDALLAGTDLECGNLYHLLAE-GVKKGLHS 285
Query: 340 EIDINNALVNTLTVQMRLGMFDGEPSSQ-PYGHLGPKDVCTPDHQELALEAARQGIVLLK 398
E DIN +L T+ ++GMFD P+ + PY +G + + H++ A A++ IVLL+
Sbjct: 286 ERDINVSLSRLFTILFKIGMFD--PAERVPYSSIGREVLECEAHKQHAERMAKESIVLLE 343
Query: 399 NQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRY----ARTIHQ 454
N+ LPL + +++A+IGPN+D T + NY G TP + R + +
Sbjct: 344 NKNHILPLDASKIKSIALIGPNADNGQTQLANYFGTPSEIVTPYMSLKRRLGDKIKINYL 403
Query: 455 QGCKDV-ACADDQLFGAAIDASRQADATILVMGLDQSIE-------------AEALDRAG 500
G V D F + Q+D + V G+ E + DR
Sbjct: 404 PGVGIVDKLKDAPSFVQVAHKAAQSDVIVFVSGISADYEGEAGDAGAAGYGGFASGDRTT 463
Query: 501 LLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAI 560
+ LP Q EL+ K+ + P I+V MSG + +F A++ A Y GQA G AI
Sbjct: 464 MQLPLVQIELLKKLKKTGR-PLIIVNMSGSVM--SFEWESQNADALLQAWYGGQAAGDAI 520
Query: 561 ADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGH 620
D+LFG NP G++P+T Y + +LP P ++ GRTYR++KG YPFG+
Sbjct: 521 VDVLFGHCNPAGRMPLTTYKSD--NDLP-------PFENYSMLGRTYRYFKGEPRYPFGY 571
Query: 621 GMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVG 680
G+SYT F ++ ++ T +G +VT V V N G
Sbjct: 572 GLSYTTFAYSDVQC--------------VDETHTGDTARVT------------VTVSNTG 605
Query: 681 SKDGAHTLLVFSTPPAG--HWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSG 738
DG + ++ P P L F+++H+ G V + + L++ + G
Sbjct: 606 DCDGDEVVQLYVVHPQDGRKQIPLCALKGFKRIHLKRGESTSVSFTL-TPEELALTETDG 664
Query: 739 TRRIPLGEHNIHIGGTKHSVSLHAATLGVIK 769
G+ + +GG + + + ++L IK
Sbjct: 665 NLVEKNGQVTLFVGGGQPNYAAGVSSLLTIK 695
>gi|291544853|emb|CBL17962.1| Beta-glucosidase-related glycosidases [Ruminococcus champanellensis
18P13]
Length = 697
Score = 381 bits (979), Expect = e-103, Method: Compositional matrix adjust.
Identities = 256/727 (35%), Positives = 379/727 (52%), Gaps = 105/727 (14%)
Query: 46 SLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGG 105
SL +R DL RL+++E+ L A +PRLGI Y WW+E LHGV+ G
Sbjct: 13 SLTPDERAEDLADRLTVEEQASQLRYDALPIPRLGIPAYNWWNEGLHGVARAGT------ 66
Query: 106 DFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA--------GLTYWSPNV 157
AT FPQ I A++F+ L IG + + EARA + GLT W+PN+
Sbjct: 67 ----ATMFPQAIGMAATFDTALLHQIGEITATEARAKHMAAREHGDFDIYKGLTLWAPNI 122
Query: 158 NIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNG 217
N+FRDPRWGRG ET GEDP L+ + ++V+G+QG +G LK AA KHF + +G
Sbjct: 123 NLFRDPRWGRGHETYGEDPFLTARLGVAFVKGMQG-EGKVLKAAACAKHFAVH-----SG 176
Query: 218 VD--RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTI 275
+ R F+A+VS +D+E+++ F V E KV VM +YN+VNG P+CA P ++ +
Sbjct: 177 PEALRHSFDAQVSPKDLEESYLPAFHALVAEAKVEGVMGAYNRVNGEPSCASPMLMDKL- 235
Query: 276 RGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQR 335
+W GY VSDC ++ ++ T E+AA A+R G DL+CG L+ +A++
Sbjct: 236 -HQWGFAGYFVSDCWAIQDFHKHHGVTKNVTESAALALRTGCDLNCGNTY-LYVLAALEE 293
Query: 336 GLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIV 395
GL+ DI A + L ++RLG+FD EP + + +P H+ ++L A + +V
Sbjct: 294 GLIDAADIRRACIRVLRTRIRLGLFDPEPH---FAACTYDTIASPAHKAVSLSCAEKSMV 350
Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRY--ARTIH 453
LLKN G LPL + +AVIGPN+D + GNY G A Y T L+GI R +
Sbjct: 351 LLKNDG-ILPLDLSKLHAIAVIGPNADSRAALEGNYCGTADRYVTFLEGIQDAFPGRVHY 409
Query: 454 QQGC-------KDVACADDQLFGAAIDASRQADATILVMGLDQSIEAE---------ALD 497
QGC ++A ADD+ + A+ A+ +D IL +GLD ++E E + D
Sbjct: 410 AQGCHLYKDRTSNLAMADDR-YAEALAAAEASDVVILCLGLDATLEGEEGDTGNEFSSGD 468
Query: 498 RAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGG 557
+A L LP Q +L+ K+ K P ILVL +G ++ + N A++ A YPGQ GG
Sbjct: 469 KADLRLPPPQCKLLEKLHAVGK-PVILVLAAGSALNPEISCN-----AVLQAWYPGQCGG 522
Query: 558 TAIADILFGTSNPGGKLPMTWYPQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVY 616
A+A ILFG +P GKLP+T+Y E LP T+ +M+ RTYR+ + V+Y
Sbjct: 523 QALAHILFGKVSPSGKLPVTFY--ETAEQLPDFTDYSMQ--------NRTYRYARNNVLY 572
Query: 617 PFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDV 676
PFG+G++Y +V L +G T++ + I+ T DV
Sbjct: 573 PFGYGLTYGK----------IVCTELSYENGCARMTVTNQGIRFTE------------DV 610
Query: 677 KNVGSKDGAHTLLVFSTPPAGHWA-PHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVD 735
+ KD ++P WA P+ L F ++ + G +R+ I + + S VD
Sbjct: 611 VQLYIKD--------NSP----WAVPNHSLCGFARIGLEPGETRRLEIPVPDSAFES-VD 657
Query: 736 RSGTRRI 742
G R +
Sbjct: 658 EQGVRAV 664
>gi|325192664|emb|CCA27085.1| unnamed protein product [Albugo laibachii Nc14]
Length = 2278
Score = 381 bits (978), Expect = e-103, Method: Compositional matrix adjust.
Identities = 265/762 (34%), Positives = 387/762 (50%), Gaps = 93/762 (12%)
Query: 11 GLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLI 70
GL+ L SS A PF PFC SL + RV DL+ RL L EKV++L
Sbjct: 14 GLLKLILSSDAHACEYPFD---------YFPFCNSSLSLDLRVEDLLQRLQLDEKVRMLT 64
Query: 71 SGAA---AVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATL 127
+ A+ ++PRLG+ Y W + +HGV + GT ATSFP + + F+
Sbjct: 65 ARASTHGSIPRLGVPEYNWGANCVHGVQSTC-GTH------CATSFPNPVNLGAIFDPNE 117
Query: 128 WEAIGRVVSDEARAMY---------NGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVL 178
+ +V+ E RA+ G GL WSPN+NI RDPRWGR ETP EDP +
Sbjct: 118 IYKMAQVIGKELRALRLEGARENYARGPHIGLDCWSPNININRDPRWGRAMETPSEDPYV 177
Query: 179 SGKYAASYVRGLQ-GSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFD 237
+ KY +Y +GLQ G D L+ + KH+ AY +N+ G DR F+A VS D DT+
Sbjct: 178 NAKYGVAYTKGLQEGQDSRFLQAVVTLKHYLAYSYENYGGTDRTQFDAIVSAYDFADTYF 237
Query: 238 VPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYD 297
F V++GK +MCSYN +NG+PTCA+ L + +R + +GYI SD ++ +D
Sbjct: 238 PAFEASVVDGKAKGIMCSYNSLNGIPTCAN-KWLNQLLRDDLEFDGYITSDTGAIQGIFD 296
Query: 298 TQHFTSTPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEIDINNALVNTLTVQMR 356
+T T EA A+ +G+D+ G + + A + ID A+ TL ++ +
Sbjct: 297 GHKYTKTLCEATKIAMESGVDICSGNAYWNCLKQLANSTNFSASID--EAIRRTLKLRFQ 354
Query: 357 LGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSH-IRHRTVA 415
LG+FD QP H GP+DV T +L+L+ AR+ IVLL+N G +LPL +R +A
Sbjct: 355 LGLFDA-IGDQP--HFGPEDVRTAKSLQLSLDLARKSIVLLQNHGNTLPLRLGLR---IA 408
Query: 416 VIGPNSDVTVTMIGNYAGIACG--------YTTPLQGI----GRYARTIHQQGC--KDVA 461
VIGP+S ++GNY G C +PL+ I GR T H GC D +
Sbjct: 409 VIGPHSMTRRGIMGNYYGQLCHGDYDEVRCIQSPLEAIQSVNGR-NNTHHVNGCGINDTS 467
Query: 462 CADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGP 521
A+ F A+ A R AD +L +G+D SIE E+ DR + +P Q EL+ + +A K P
Sbjct: 468 TAE---FDDALQAVRTADVAVLFLGIDISIERESKDRDNIDVPHIQLELLKAIRVAGK-P 523
Query: 522 TILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQ 581
T++VL +GG + + K +++ A YPG G AIA+ILFG+ NP GKLP+T Y
Sbjct: 524 TVVVLFNGGILGI--EKLILYADSVLEAFYPGFFGAQAIAEILFGSINPSGKLPVTMYRS 581
Query: 582 EYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVP 641
+I ++ M M+M YPGR+YR+Y VY FG G+SYT F
Sbjct: 582 NFINDVDMKSMSM-----TLYPGRSYRYYTEVPVYSFGWGLSYTTF-------------- 622
Query: 642 LDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGH--- 698
SI + S + H + + ++ + N G G L F P H
Sbjct: 623 ------SIQSIDSHDTRAMNHVLTAQPKM-YRILITNNGKYYGEEVLFAFFRPLDIHATG 675
Query: 699 --WAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSG 738
+ +QL + +V + G + V +++ + L++ DR+G
Sbjct: 676 PVESLQQQLFNYTRVRLDPGDMREVPLHVKD-ENLALHDRNG 716
>gi|348684865|gb|EGZ24680.1| family 3 glycoside hydrolase [Phytophthora sojae]
Length = 769
Score = 380 bits (977), Expect = e-102, Method: Compositional matrix adjust.
Identities = 268/756 (35%), Positives = 392/756 (51%), Gaps = 75/756 (9%)
Query: 27 PFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPR-----LGI 81
P AC K T + LPFC SL RV DL+ RL LQEK LL A A PR +G+
Sbjct: 23 PRACVGK--TNQELPFCNTSLSTADRVEDLLSRLPLQEKATLLT--ARASPRGNMSSIGL 78
Query: 82 KGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARA 141
Y W + +HGV + G + P TSFP + + F+ + + +V+ E RA
Sbjct: 79 PEYNWGANCVHGVQ-----STCGTNCP--TSFPNPVNLGAIFDPQVVFDMAQVIGWELRA 131
Query: 142 M--------YNGGT-AGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQG 192
+ Y GG GL WSPN+NI RDPRWGR ETP EDP+++ KY +Y RGLQ
Sbjct: 132 LWLEGATENYKGGPHLGLDCWSPNININRDPRWGRNTETPSEDPLVNSKYGVAYTRGLQ- 190
Query: 193 SDGDR-----LKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEG 247
+G R L+ + KH+ AY +N+ GV+R F+A VS D DT+ FR V++G
Sbjct: 191 -EGKRQDPRFLQAVVTLKHYAAYSYENYGGVNRMEFDAIVSPYDFADTYFPAFRSSVVDG 249
Query: 248 KVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEE 307
VMCSYN VNG+P CA+ +++ +RG +GY+ SD +V D H+ + E
Sbjct: 250 NAKGVMCSYNSVNGIPMCANKELVETLLRGTLGFDGYVTSDSGAVEAISDMHHYADSQCE 309
Query: 308 AAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQ 367
AA AI AG D++ G ++ V L E +++AL +TL ++ LG+FD Q
Sbjct: 310 AARLAILAGTDINSGKSYEACLKTLVDDNQLEEKALDDALRHTLKLRFELGLFD-PIDDQ 368
Query: 368 PYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTM 427
PY ++ P +V T + L+L A R+ +V+L+N LPL + +AV+GP++ +
Sbjct: 369 PYWNVTPSEVNTAAAKALSLNATRKSLVMLQNNASVLPLQ--KGVKLAVLGPHAKSKRGL 426
Query: 428 IGNYAGIAC--------GYTTPLQGI---GRYARTIHQQGCKDVACADDQLFGAAIDASR 476
+GNY G C TPL I + T +GC ++ F A+ A++
Sbjct: 427 LGNYLGQMCHGDYDEVGCVQTPLDAIRAANGASNTTFAEGCG-ISGNSTAGFEKAVAAAK 485
Query: 477 QADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAF 536
+ADA +L +G+D+SIE E DR + LP Q +L+ +V + PT++VL++GG I
Sbjct: 486 EADAVVLFLGIDKSIEGEVGDRNNIDLPNIQMQLLQRVHAVGR-PTVVVLINGGVIGA-- 542
Query: 537 AKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRP 596
+ R A++ A YPG G A+AD+LFG +NP GKLP+T Y +Y+ + M M M
Sbjct: 543 EEIIERTDALVEAFYPGFFGARAMADVLFGDTNPSGKLPVTMYRSDYVDQVEMKSMDM-- 600
Query: 597 SQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGK 656
+PGRTYR++KG V+PFG G+SYT F +V + + S NA SG
Sbjct: 601 ---TAHPGRTYRYFKGEPVFPFGWGLSYTTFSLSVDSG-------TNSSSHSNNAAFSGG 650
Query: 657 AIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWA-PHKQ--LVAFE---K 710
+ T + + V VKN G G L + A P ++ LV+F +
Sbjct: 651 EVSDTA------NVTISVVVKNDGEVAGDEVLGPLDSTEVSTLALPDEEGNLVSFPGSYE 704
Query: 711 VHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGE 746
V V G ++R+ ++ V V+ R + PL E
Sbjct: 705 VIVSNGVKERLRFSVEVAGG-EVILRDQVQPFPLSE 739
>gi|268610157|ref|ZP_06143884.1| glycoside hydrolase family 3 protein [Ruminococcus flavefaciens
FD-1]
Length = 690
Score = 380 bits (975), Expect = e-102, Method: Compositional matrix adjust.
Identities = 258/724 (35%), Positives = 366/724 (50%), Gaps = 103/724 (14%)
Query: 46 SLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGG 105
SL +R DL RL+L+E+ L A AV RL I Y WWSE LHGV+ G
Sbjct: 8 SLSAQERAEDLTNRLTLEEQASQLKYDAPAVDRLDIPAYNWWSEGLHGVARAGT------ 61
Query: 106 DFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA--------GLTYWSPNV 157
AT FPQ I A+ F+ +G ++ DEARA YN +A GL WSPNV
Sbjct: 62 ----ATMFPQAIGLAAMFDEEAMNKVGSIIGDEARAKYNEYSAHGDHDIYKGLCLWSPNV 117
Query: 158 NIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNG 217
NIFRDPRWGRGQET GEDP L+ + ++ +GLQG +G+ LK AA KH + +G
Sbjct: 118 NIFRDPRWGRGQETYGEDPYLTTRLGVAFAKGLQG-EGEVLKTAACAKHLAVH-----SG 171
Query: 218 VD--RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTI 275
+ R F+A S +D+E+T+ F V E KV VM +YN+VNG P CA ++ +
Sbjct: 172 PEAIRHEFDAVASPKDMEETYLPAFEALVKEAKVEGVMGAYNRVNGEPACASKFLMGKL- 230
Query: 276 RGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQR 335
EW +GY VSDC ++ ++ T T E+AA A++ G DL+CG LH A
Sbjct: 231 -DEWGFDGYFVSDCWAIRDFHTNHMVTKTAPESAAMALKLGCDLNCGNTY-LHLLHAYNE 288
Query: 336 GLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIV 395
GL+++ DI A + + ++RLGMFD E Y L V +++ A + + + +V
Sbjct: 289 GLINDEDIKKACTHLMRTRVRLGMFDDETE---YDKLDYSIVANEENKAYARKCSERSMV 345
Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRY--ARTIH 453
+LKN G LPL + +T+ VIGPN+D + GNY G A Y T L+GI R ++
Sbjct: 346 MLKNNGI-LPLDPSKIKTIGVIGPNADSRPALEGNYNGRADRYITFLEGIQDAFGGRVLY 404
Query: 454 QQG-------CKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAE---------ALD 497
+G C +A ADD+L A I + +D +L +GLD +IE E + D
Sbjct: 405 SEGSHLYKDRCMGLAVADDRLSEAEI-VTEHSDVVVLCVGLDATIEGEEGDTGNEFSSGD 463
Query: 498 RAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGG 557
+ L LP Q++LV V M P I+V +G I+V + A+I A YPGQ GG
Sbjct: 464 KNDLRLPEAQRKLVETV-MRKGKPVIIVTAAGSAINV-----EADCDALIHAWYPGQFGG 517
Query: 558 TAIADILFGTSNPGGKLPMTWYPQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVY 616
TA+ADILFG +P GKLP+T+Y T LP T+ +M+ GRTYR+ + ++Y
Sbjct: 518 TALADILFGKISPSGKLPVTFYTD--TTKLPEFTDYSMK--------GRTYRYTQDNILY 567
Query: 617 PFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDV 676
PFG+G++Y+ T V+ +K + K + V V
Sbjct: 568 PFGYGLTYSK---------TEVS-----------------DLKFENGKAS-------VKV 594
Query: 677 KNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDR 736
N G D + + + P L F +V + G V + + + VD
Sbjct: 595 TNTGDFDTEDVVQFYIKGEGSDYVPFYSLCGFRRVFLKKGESTVVEVTLGDSAF-EAVDE 653
Query: 737 SGTR 740
+G R
Sbjct: 654 NGRR 657
>gi|424661938|ref|ZP_18098975.1| hypothetical protein HMPREF1205_02324 [Bacteroides fragilis HMW
616]
gi|404578249|gb|EKA82984.1| hypothetical protein HMPREF1205_02324 [Bacteroides fragilis HMW
616]
Length = 722
Score = 379 bits (974), Expect = e-102, Method: Compositional matrix adjust.
Identities = 250/718 (34%), Positives = 374/718 (52%), Gaps = 78/718 (10%)
Query: 45 VSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFG 104
+S PI RV LI +++L EK L+S + ++PRL + Y +W+E LHGV+ G
Sbjct: 54 LSQPIAVRVKTLIQQMTLAEKASQLVSESDSIPRLNLPAYNYWNECLHGVARAGE----- 108
Query: 105 GDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPR 164
T FPQ I AS+++ L + + +S EAR Y GLTYWSP +N+ RDPR
Sbjct: 109 -----VTVFPQAINLASTWDTVLVKRVASAISTEARLKYLEIGKGLTYWSPTINMARDPR 163
Query: 165 WGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFN 224
WGR +ET GEDP L+ + ++V+GLQG LK A+ KHF A + +N +RF +
Sbjct: 164 WGRNEETYGEDPYLTSRLGVAFVKGLQGDHPAYLKTVATIKHFVANNEEN----NRFSSS 219
Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
+++ + + + + + CV E V SVM +YN NGVP +L +R EW +G+
Sbjct: 220 SQIPTKQLYEYYFPAYEACVKEADVQSVMTAYNAFNGVPPSGSRWLLGEVLRKEWGFDGF 279
Query: 285 IVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDIN 344
+VSDC ++GV ++ EEAAA + +G DL+CG AV++GL+SE I+
Sbjct: 280 VVSDCGAIGVMNWQHRVVNSLEEAAALGVNSGCDLECGTTYKEKLVQAVKQGLISEATID 339
Query: 345 NALVNTLTVQMRLGMFDGEPSSQ-PYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPS 403
AL LT + +LG FD P PY H K + ELA EAA + +VLLKN+
Sbjct: 340 QALTRVLTARFKLGEFD--PMELVPYNHYDKKLLAGKKFAELAYEAAVKSVVLLKNEN-L 396
Query: 404 LPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI----GRYARTIHQQGCKD 459
LPLS + ++VAV+GP +D +G Y+G T L+G+ G+ + + G
Sbjct: 397 LPLSKEKTKSVAVVGPFAD--HNYLGGYSGQPPYSITLLKGVKDLMGKRGKVNYLNG--- 451
Query: 460 VACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASK 519
+ + D + + A + D ++ +G D+ + E D + LP Q++L+ + +
Sbjct: 452 IGASRDSI----VAAVKGVDVVLVALGSDEKMARENHDMTSIYLPEEQEKLLKAIYQVNP 507
Query: 520 GPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWY 579
+LV SG P+ +A D I AI+ A YPGQ G A+A++LFG NP GKLPMT Y
Sbjct: 508 -RIVLVFHSGNPLTSEWA--DTHIPAIMQAWYPGQEAGRALANLLFGNENPSGKLPMTIY 564
Query: 580 PQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVV 638
E LP + + M + GRTYR+ KG +Y FGHG+SYT+F
Sbjct: 565 KTE--EQLPDILDFDM-------WKGRTYRYMKGEPLYSFGHGLSYTSF----------- 604
Query: 639 AVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF----STP 694
D G N T+ AI L V++ N G G + V+ +TP
Sbjct: 605 --EFDNIQG--NDTLQPDAI-----------LQCSVELSNSGQLAGEEVVQVYVSRENTP 649
Query: 695 PAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIG 752
+ P K+LVAF+KV + +G +++V I + LSV + G R+ G++ + IG
Sbjct: 650 VYTY--PLKKLVAFKKVKLASGEKKKVDFTI-APRELSVWE-DGKWRMLSGKYTLFIG 703
>gi|291240563|ref|XP_002740191.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii]
Length = 747
Score = 379 bits (974), Expect = e-102, Method: Compositional matrix adjust.
Identities = 252/699 (36%), Positives = 348/699 (49%), Gaps = 75/699 (10%)
Query: 40 LPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISG-------AAAVPRLGIKGYEWWSEALH 92
PF SLP +RV+DL+GRL+L+E V + G A + RLGI Y W +E LH
Sbjct: 25 FPFRNTSLPWSERVDDLVGRLTLEEIVLQMSRGGTGSNGPAPPIDRLGIGPYSWNTECLH 84
Query: 93 GVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN-------- 144
G GP ATSFPQ A++F+A L E I + E RA YN
Sbjct: 85 GDVAAGP----------ATSFPQAFGLAATFDAVLIEQIANATAYEVRAKYNNYAKHKEY 134
Query: 145 GGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASC 204
G GL+ +SP +NI R P WGR QET GEDP LSG AASYV GLQG+ + A C
Sbjct: 135 GDHKGLSCFSPVINIARHPLWGRIQETYGEDPYLSGTLAASYVNGLQGNHPRYVTANAGC 194
Query: 205 KHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPT 264
KHF AY R F+AKVS +D+ TF F C+ G S+MCSYN +NGVP
Sbjct: 195 KHFDAYAGPEDIPSSRSTFDAKVSDRDLRMTFLPAFHECIQAG-THSLMCSYNSINGVPA 253
Query: 265 CADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPF 324
CA+ +L +R EW GY++SD +V YD H+T + A + +GL+L+
Sbjct: 254 CANKKLLTDILRTEWNFTGYVISDQSAVEKVYDAHHYTKDMLDTAIACVNSGLNLELSSN 313
Query: 325 LG----LHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTP 380
L + T AV++G ++ + + +MRLG FD P PY L + +
Sbjct: 314 LEDNVMMQTTKAVKQGNVTMKTVKARVSPLFYTRMRLGEFD-PPEMNPYSKLDLSIIQSQ 372
Query: 381 DHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYT- 439
+HQEL+L+AA + VLLKN+ LPL + +AV+GP +D + G+Y+ YT
Sbjct: 373 EHQELSLKAAAKSFVLLKNENRFLPLKE-KIDKLAVVGPLADNVDALYGDYSATPNNYTV 431
Query: 440 TPLQGIGRYA-RTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDR 498
TP G+ R A T + GC + C G A AD ++ +G IE+E DR
Sbjct: 432 TPRNGLARLAGNTSYASGCDNPKCRKYDS-GQVKSAVSGADMVVVCVGTGTDIESEGNDR 490
Query: 499 AGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGT 558
L LPG+Q L+ P IL+L + GP+DV++A +P + I+ +P QA G
Sbjct: 491 HELALPGKQLSLLQDAVKFGTKPVILLLFNAGPLDVSWAVENPAVQTIVACFFPAQATGD 550
Query: 559 AIADILFGT---SNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVV 615
A+ + T SNP G+LPMTW P+ PMT+ M+ GRTYR+ +
Sbjct: 551 ALYRMFMNTSPESNPAGRLPMTW-PRSMEQVPPMTDYTMK--------GRTYRYSDADPL 601
Query: 616 YPFGHGMSYTNF-VHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQV 674
+PFG G+SYT F + + +PTV+ C+ +T + +
Sbjct: 602 FPFGFGLSYTLFKYYNTSASPTVI------------------------KSCDTVT--IPL 635
Query: 675 DVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVH 712
V NVG G + V+ S A P QLV F +V
Sbjct: 636 TVTNVGDFPGDEVMQVYISWSNASVTVPKLQLVGFRRVR 674
>gi|313145345|ref|ZP_07807538.1| beta-glucosidase [Bacteroides fragilis 3_1_12]
gi|313134112|gb|EFR51472.1| beta-glucosidase [Bacteroides fragilis 3_1_12]
Length = 722
Score = 379 bits (973), Expect = e-102, Method: Compositional matrix adjust.
Identities = 251/714 (35%), Positives = 372/714 (52%), Gaps = 76/714 (10%)
Query: 48 PIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDF 107
PI RV LI +++L EK L+S + ++PRL + Y +W+E LHGV+ G
Sbjct: 57 PIAVRVKTLIQQMTLAEKASQLVSESDSIPRLNLPAYNYWNECLHGVARAGE-------- 108
Query: 108 PGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGR 167
T FPQ I AS+++ L + + +S EAR Y GLTYWSP +N+ RDPRWGR
Sbjct: 109 --VTVFPQAINLASTWDTVLVKRVASAISTEARLKYLEIGKGLTYWSPTINMARDPRWGR 166
Query: 168 GQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKV 227
+ET GEDP L+ + ++V+GLQG LK A+ KHF A + +N +RF ++++
Sbjct: 167 NEETYGEDPYLTSRLGVAFVKGLQGDHPAYLKTVATIKHFVANNEEN----NRFSSSSQI 222
Query: 228 SKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVS 287
+ + + + + CV E V SVM +YN NGVP +L +R EW +G++VS
Sbjct: 223 PTKQLYEYYFPAYEACVKEAGVQSVMTAYNAFNGVPPSGSRWLLGEVLRKEWGFDGFVVS 282
Query: 288 DCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNAL 347
DC ++GV ++ EEAAA + +G DL+CG AV++GL+SE I+ AL
Sbjct: 283 DCGAIGVMNWQHRVVNSLEEAAALGVNSGCDLECGTTYKEKLVQAVKQGLISEATIDQAL 342
Query: 348 VNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLS 407
LT + +LG FD PY H K + ELA EAA + +VLLKN+ LPLS
Sbjct: 343 TRVLTARFKLGEFD-PMELVPYNHYDKKLLAGKKFAELAYEAAVKSVVLLKNEN-LLPLS 400
Query: 408 HIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI----GRYARTIHQQGCKDVACA 463
+ ++VAV+GP +D +G Y+G T L+G+ G+ + + G + +
Sbjct: 401 KEKTKSVAVVGPFAD--HNYLGGYSGQPPYSVTLLKGVKDLMGKRGKVNYLNG---IGAS 455
Query: 464 DDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTI 523
D + + A + D ++ +G D+ + E D + LP Q++L+ + + +
Sbjct: 456 RDSI----VAAVKGVDVVLVALGSDEKMARENHDMTSIYLPEEQEKLLKAIYQVNP-RIV 510
Query: 524 LVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEY 583
LV SG P+ +A D I AI+ A YPGQ G A+AD+LFG NP GKLPMT Y E
Sbjct: 511 LVFHSGNPLTSEWA--DVHIPAIMQAWYPGQEAGRALADLLFGNENPSGKLPMTIYRAE- 567
Query: 584 ITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPL 642
LP + + M + GRTYR+ K +Y FGHG+SYT+F
Sbjct: 568 -DQLPDILDFDM-------WKGRTYRYMKEDPLYGFGHGLSYTSF-------------GF 606
Query: 643 DGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF----STPPAGH 698
DG GS + SG TL V++ N G G + V+ +TP +
Sbjct: 607 DGIQGS-DTLKSGT------------TLQCSVELSNTGKWTGEEVVQVYVSRENTPVYTY 653
Query: 699 WAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIG 752
P K+LVAF+KV + G ++RV NI + LSV + +G R+ G++ + IG
Sbjct: 654 --PLKKLVAFKKVKLAPGEKKRVEFNIP-PRELSVWE-NGNWRMLTGKYTLFIG 703
>gi|51507369|emb|CAH18932.1| beta-xylosidase [Pyrus communis]
Length = 238
Score = 379 bits (972), Expect = e-102, Method: Compositional matrix adjust.
Identities = 185/238 (77%), Positives = 205/238 (86%)
Query: 333 VQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQ 392
++ G ++EIDIN AL NT+TVQMRLGMFDGEPS+Q YG+LG DVC P ELALEAARQ
Sbjct: 1 MRTGQVNEIDINYALANTITVQMRLGMFDGEPSTQRYGNLGLADVCKPSSNELALEAARQ 60
Query: 393 GIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTI 452
GIVLL+N+G SLPLS IRHRTVAVIGPNSDVT TMIGNYAG+ACGYTTPLQGI RY RTI
Sbjct: 61 GIVLLENRGNSLPLSTIRHRTVAVIGPNSDVTETMIGNYAGVACGYTTPLQGIARYTRTI 120
Query: 453 HQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVS 512
HQ GC DV C +QL GAA A+RQADAT+LV+GLDQSIEAE DR LLLPG QQELVS
Sbjct: 121 HQAGCTDVHCNGNQLIGAAEVAARQADATVLVIGLDQSIEAEFRDRTNLLLPGHQQELVS 180
Query: 513 KVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNP 570
+V+ AS+GPTILV+MSGGPIDV FAKNDPRI AIIW GYPGQAGGTAIAD+LFGT+NP
Sbjct: 181 RVARASRGPTILVIMSGGPIDVMFAKNDPRIGAIIWVGYPGQAGGTAIADVLFGTTNP 238
>gi|423279990|ref|ZP_17258903.1| hypothetical protein HMPREF1203_03120 [Bacteroides fragilis HMW
610]
gi|404584326|gb|EKA88991.1| hypothetical protein HMPREF1203_03120 [Bacteroides fragilis HMW
610]
Length = 722
Score = 378 bits (971), Expect = e-102, Method: Compositional matrix adjust.
Identities = 250/717 (34%), Positives = 375/717 (52%), Gaps = 76/717 (10%)
Query: 45 VSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFG 104
+S PI RV LI +++L EK L+S + ++PRL + Y +W+E LHGV+ G
Sbjct: 54 LSQPIAVRVKTLIQQMTLAEKASQLVSESDSIPRLNLPAYNYWNECLHGVARAGE----- 108
Query: 105 GDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPR 164
T FPQ I AS+++ L + + +S EAR Y GLTYWSP +N+ RDPR
Sbjct: 109 -----VTVFPQAINLASTWDTVLVKRVASAISTEARLKYLEIGKGLTYWSPTINMARDPR 163
Query: 165 WGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFN 224
WGR +ET GEDP L+ + ++V+GLQG LK A+ KHF A + +N +RF +
Sbjct: 164 WGRNEETYGEDPYLTSRLGVAFVKGLQGDHPAYLKTVATIKHFVANNEEN----NRFSSS 219
Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
+++ + + + + + CV E V SVM +YN NGVP +L +R EW +G+
Sbjct: 220 SQIPTKQLYEYYFPAYEACVKEAGVQSVMTAYNAFNGVPPSGSRWLLGEVLRKEWGFDGF 279
Query: 285 IVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDIN 344
+VSDC ++GV ++ EEAAA + +G DL+CG AV++GL+SE I+
Sbjct: 280 VVSDCGAIGVMNWQHRVVNSLEEAAALGVNSGCDLECGTTYKEKLVQAVKQGLISEATID 339
Query: 345 NALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSL 404
AL LT + +LG FD PY H K + ELA EAA + +VLLKN+ L
Sbjct: 340 QALTRVLTARFKLGEFD-PMELVPYNHYDKKLLAGKKFAELAYEAAVKSVVLLKNEN-LL 397
Query: 405 PLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI----GRYARTIHQQGCKDV 460
PLS + ++VAV+GP +D +G Y+G T L+G+ G+ + + G +
Sbjct: 398 PLSKEKTKSVAVVGPFAD--HNYLGGYSGQPPYSVTLLKGVKDLMGKRGKVNYLNG---I 452
Query: 461 ACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKG 520
+ D + + A + D ++ +G D+ + E D + LP Q++L+ + +
Sbjct: 453 GASRDSI----VAAVKGVDVVLVALGSDEKMARENHDMTSIYLPEEQEKLLKAIYQVNP- 507
Query: 521 PTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYP 580
+LV SG P+ +A D I AI+ A YPGQ G A+AD+LFG NP GKLPMT Y
Sbjct: 508 RIVLVFHSGNPLTSEWA--DVHIPAIMQAWYPGQEAGRALADLLFGNENPSGKLPMTIYR 565
Query: 581 QEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVA 639
E LP + + M + GRTYR+ K +Y FGHG+SYT+F
Sbjct: 566 AE--DQLPDILDFDM-------WKGRTYRYMKEDPLYGFGHGLSYTSF------------ 604
Query: 640 VPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF----STPP 695
DG GS + SG ++ + V++ N G G + V+ +TP
Sbjct: 605 -GFDGIQGS-DTLKSGARLQCS------------VELSNTGKWTGEEVVQVYVSRENTPV 650
Query: 696 AGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIG 752
+ P K+LVAF+KV + G ++RV NI + LSV + +G R+ G++ + IG
Sbjct: 651 YTY--PLKKLVAFKKVKLAPGEKKRVEFNIP-PRELSVWE-NGNWRMLTGKYTLFIG 703
>gi|374316077|ref|YP_005062505.1| beta-glucosidase-like glycosyl hydrolase [Sphaerochaeta pleomorpha
str. Grapes]
gi|359351721|gb|AEV29495.1| beta-glucosidase-like glycosyl hydrolase [Sphaerochaeta pleomorpha
str. Grapes]
Length = 701
Score = 378 bits (971), Expect = e-102, Method: Compositional matrix adjust.
Identities = 252/739 (34%), Positives = 378/739 (51%), Gaps = 94/739 (12%)
Query: 51 QRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGA 110
++ L+ +SL+E L+ A A+PRLG+ Y WW+EALHG + G A
Sbjct: 8 EQAKQLVAHMSLKEMFSQLLHEAPAIPRLGLPRYNWWNEALHGAARSGT----------A 57
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA--------GLTYWSPNVNIFRD 162
T FPQ I A+ F+ + I V+S E RA YN +A GLT WSPNVNIFRD
Sbjct: 58 TVFPQAIGLAAMFDDVFLKEIATVISTEQRAKYNTFSALGDRGIYKGLTLWSPNVNIFRD 117
Query: 163 PRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD--R 220
PRWGRGQET GEDP L+ + S+++GLQG DG LK AA KHF + +G + R
Sbjct: 118 PRWGRGQETYGEDPYLASQLGVSFIQGLQG-DGPYLKTAACVKHFAVH-----SGPEPLR 171
Query: 221 FHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWR 280
FNA VS++D+ +T+ F CV EG+V +VM +Y+ VNG P C P ++ +R +W
Sbjct: 172 HDFNAIVSRKDLYETYLPAFEACVKEGEVNAVMGAYSAVNGEPCCGSPFLITDILRNDWG 231
Query: 281 LNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLS 339
G +SDC ++ ++ T ++ A A+ AG DL+CG +L L E A Q+GL+
Sbjct: 232 FEGMYISDCWAIRDFHLNHAVTKNQVDSVALALNAGCDLNCGCEYLSL--EKAYQQGLID 289
Query: 340 EIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKN 399
I A + +T + LG+F + + Y ++G + T +H+++A +A+ +VLLKN
Sbjct: 290 RKTITQACIRVMTTRFALGLFSEDCT---YSNIGYEQNDTEEHRKVAFKASCNSLVLLKN 346
Query: 400 QGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQG----IGRYARTIHQQ 455
G LPL +A+IGPN+D + GNY G + YTT L+G +G + + Q
Sbjct: 347 DG-MLPLDSRSLHAIAIIGPNADSREALWGNYHGTSSTYTTVLEGFRKTLGESVKVKYSQ 405
Query: 456 G-------CKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAE---------ALDRA 499
G + +A +D++ AI + +D IL +G D+++E E A D+
Sbjct: 406 GSAIQKEKLERLAEPNDRI-AEAIAVATVSDTIILCLGYDETVEGEMHDDGNGGWAGDKQ 464
Query: 500 GLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTA 559
L LP Q+ L+ V+ K P +LVL+SGG ID + P + A++ YPGQ GG A
Sbjct: 465 DLRLPPCQRALLKAVASTGK-PIVLVLLSGGAIDPEIERF-PNVKALLQGWYPGQEGGLA 522
Query: 560 IADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFG 619
IA + G +NP G LP+T+Y E T LP R GRTYR+ + V+YPFG
Sbjct: 523 IAHTILGLNNPSGHLPVTFYRSE--TVLP-------DFCDYRMEGRTYRYVQEKVLYPFG 573
Query: 620 HGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNV 679
G+SYT F + N T G+ N +S V N
Sbjct: 574 FGLSYTTF--SYGNLST-------GKQADGNLELS-------------------FIVSNS 605
Query: 680 GSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGT 739
G+++G + ++ + P+ L F + + G + V I + + S +D G
Sbjct: 606 GNREGREVVQIYCHSDHPFFPPNPVLCGFTSLVLQPGEHKTVTQTI-LAEAFSAIDPEGK 664
Query: 740 RRIPLGEHNIHIGGTKHSV 758
R G ++++G + ++
Sbjct: 665 RIALKGWFDLYVGNHQKAL 683
>gi|383117083|ref|ZP_09937830.1| hypothetical protein BSHG_0813 [Bacteroides sp. 3_2_5]
gi|251947612|gb|EES87894.1| hypothetical protein BSHG_0813 [Bacteroides sp. 3_2_5]
Length = 722
Score = 377 bits (967), Expect = e-101, Method: Compositional matrix adjust.
Identities = 249/718 (34%), Positives = 374/718 (52%), Gaps = 78/718 (10%)
Query: 45 VSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFG 104
+S P+ RV LI +++L EKV L+S + ++PRL + Y +W+E LHGV+ G
Sbjct: 54 LSQPVAVRVRTLIQQMTLAEKVAQLVSESDSIPRLNLPAYNYWNECLHGVARAGE----- 108
Query: 105 GDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPR 164
T FPQ I AS+++ L + + +S EAR Y GLTYWSP +N+ RDPR
Sbjct: 109 -----VTVFPQAINLASTWDTVLVKRVASAISTEARLKYLEIGKGLTYWSPTINMARDPR 163
Query: 165 WGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFN 224
WGR +ET GEDP L+ + ++V+GLQG LK A+ KHF A + +N +RF +
Sbjct: 164 WGRNEETYGEDPHLTSRLGVAFVKGLQGDHPTYLKTVATIKHFVANNEEN----NRFSSS 219
Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
+++ + + + + + CV E SVM +YN NGVP +L +R EW +G+
Sbjct: 220 SQIPTKQLYEYYFPAYEACVKEANAQSVMTAYNAFNGVPPSGSHWLLDDVLRKEWGFDGF 279
Query: 285 IVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDIN 344
+VSDC ++GV ++ EEAAA + +G DL+CG AV++GL+SE+ I+
Sbjct: 280 VVSDCGAIGVMNWQHRVVNSLEEAAALGVNSGCDLECGTTYKEKLVQAVEQGLISEVAID 339
Query: 345 NALVNTLTVQMRLGMFDGEPSSQ-PYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPS 403
AL LT + +LG FD P PY H K + ELA EAA + +VLLKN
Sbjct: 340 RALTRVLTARFKLGEFD--PMELVPYNHYDKKLLAGKKFAELAYEAAVKSVVLLKNDA-L 396
Query: 404 LPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQG----IGRYARTIHQQGCKD 459
LPL+ + ++VAV+GP +D +G Y+G + L+G IG+ + + G
Sbjct: 397 LPLNKEKIKSVAVVGPFAD--YNYLGGYSGQPPYSVSLLKGVKELIGKKGKVTYLNGMGT 454
Query: 460 VACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASK 519
A + Q+ + AD ++ +G D+ + E D + LP Q++L+ K+ +
Sbjct: 455 SADSIAQVV-------KGADIVLVALGSDEKMARENHDMPSIYLPEEQEKLLKKIYQVNP 507
Query: 520 GPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWY 579
+LV +G P+ +A D I AI+ A YPGQ G A+A++LFG NP GKLPMT Y
Sbjct: 508 -RIVLVFHTGNPLTSEWA--DTHIPAIMQAWYPGQEAGRALANLLFGNENPSGKLPMTIY 564
Query: 580 PQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVV 638
E LP + + M + GRTYR+ KG +Y FGHG+SYT+F
Sbjct: 565 KTE--EQLPDILDFDM-------WKGRTYRYMKGEPLYGFGHGLSYTSF----------- 604
Query: 639 AVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF----STP 694
D G N T+ AI L V++ N G G + V+ +TP
Sbjct: 605 --EFDNIQG--NDTLQPDAI-----------LQCSVELSNSGQLAGEEVVQVYVSRENTP 649
Query: 695 PAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIG 752
+ P K+LVAF+KV + +G +++V I + LSV + G R+ G++ + IG
Sbjct: 650 VYTY--PLKKLVAFKKVKLASGEKKKVDFTI-APRELSVWE-DGKWRMLSGKYTLFIG 703
>gi|317474362|ref|ZP_07933636.1| glycosyl hydrolase family 3 C terminal domain-containing protein
[Bacteroides eggerthii 1_2_48FAA]
gi|316909043|gb|EFV30723.1| glycosyl hydrolase family 3 C terminal domain-containing protein
[Bacteroides eggerthii 1_2_48FAA]
Length = 723
Score = 377 bits (967), Expect = e-101, Method: Compositional matrix adjust.
Identities = 259/763 (33%), Positives = 384/763 (50%), Gaps = 103/763 (13%)
Query: 35 ATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGV 94
A + P+ SL +R DL+ RL+L+EK+ L+ + ++AV RLGIK YEWW+EALHGV
Sbjct: 18 ACSNQAPYQNKSLSPTERAADLVSRLTLEEKITLMQNNSSAVKRLGIKPYEWWNEALHGV 77
Query: 95 SNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGT------- 147
+ G AT +PQ I +SFN TL + +SDEAR Y
Sbjct: 78 ARNGL----------ATVYPQAIGMGASFNDTLLYQVFTSISDEARVKYRQAREAGNYKR 127
Query: 148 -AGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASC-K 205
GLT+W+PN+NIFRDPRWGRGQET GEDP L+ + S V GLQG + +C K
Sbjct: 128 YTGLTFWTPNINIFRDPRWGRGQETYGEDPYLTSRMGLSVVNGLQGPQNTKYNKTHACAK 187
Query: 206 HFTAYDLDNWNGVDRFHFNAK-VSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPT 264
H+ + WN R FNA+ ++ +D+ +T+ F+ V++G V VMC+YN+ G P
Sbjct: 188 HYAVHSGPEWN---RHSFNAENINPRDLWETYLPAFQDLVIQGNVKEVMCAYNRFEGDPC 244
Query: 265 CADPNILKRTIRGEWRLNGYIVSDCDSV-GVYYDTQHFT-STPEEAAADAIRAGLDLDCG 322
C +L +R EW G +VSDC ++ Y+ +H T +A+A A+ +G DL+CG
Sbjct: 245 CGSDRLLINILRNEWNYKGLVVSDCGAIDNFYFKGRHETHKNKADASAAAVLSGTDLECG 304
Query: 323 -PFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD 381
+ GL SAV+ GL++E I+ +L + + LG D + P+ L +
Sbjct: 305 RSYTGLI--SAVKEGLINESAIDQSLCRLMKARFELGEMD---DTTPWDQLPDSLLSCHA 359
Query: 382 HQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTP 441
HQ+LAL+ AR+ + LL+N LPL + TVA+IGPN++ +V NY G T
Sbjct: 360 HQQLALQMARESMTLLQNHKNILPLD--KEMTVALIGPNANDSVMQWANYNGFPVHTITL 417
Query: 442 LQGIGRY----------ARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSI 491
L+G+ +Y + I Q V + + A I+ + +AD I G+ S+
Sbjct: 418 LEGLTQYLPQERLIYIPQKNIEVQKYPWVNYYPNDI-QAVINQAAKADVIIYAGGISASL 476
Query: 492 EAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDP 541
E E + DR + LP Q++LV K A+ P + V SG + + + +
Sbjct: 477 EGEEMDVDAEGFRGGDRTTIELPNVQRKLV-KALKATGKPIVFVNFSGCAMGL---QPES 532
Query: 542 RIA-AIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSK 600
+I AI+ A YPGQAGGTAIA++LFG NP G+LP+T+Y ++ LP E
Sbjct: 533 QICDAILQAWYPGQAGGTAIAEVLFGDYNPAGRLPITFYKKD--NQLPDFE-------DY 583
Query: 601 RYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKV 660
GRTYR+ +YPFGHG+SYT F ++ P + +G + +KV
Sbjct: 584 NMQGRTYRYLNYEPLYPFGHGLSYTTFSYST---PFI-------ENGKLK-------VKV 626
Query: 661 THAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQR 720
T++ +Q+ +K DG P K L F+++H+PAG
Sbjct: 627 TNSGNYNGDEVIQLYIKRYDDPDG----------------PLKTLRGFQRIHIPAGQTSE 670
Query: 721 VGINIHVCKYLSVVDRSGTRRIPL-GEHNIHIGGTKHSVSLHA 762
V + + D+ PL G + I +GGT L +
Sbjct: 671 VSFPL-TSDTFTWWDKDSNTVHPLQGRYKILVGGTSEESLLKS 712
>gi|386347261|ref|YP_006045510.1| glycoside hydrolase family protein [Spirochaeta thermophila DSM
6578]
gi|339412228|gb|AEJ61793.1| glycoside hydrolase family 3 domain protein [Spirochaeta
thermophila DSM 6578]
Length = 693
Score = 375 bits (964), Expect = e-101, Method: Compositional matrix adjust.
Identities = 249/727 (34%), Positives = 378/727 (51%), Gaps = 86/727 (11%)
Query: 51 QRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGA 110
+R+ L+ ++S++EK L++ A +PRLGI Y WW+EALHGV+N G A
Sbjct: 5 ERMTSLLSKMSIEEKAGLMLHRAKGIPRLGIPHYNWWNEALHGVANSGE----------A 54
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNG--------GTAGLTYWSPNVNIFRD 162
T FPQ I A++F+ L + +S EARA +N GLT+WSPN+NI+RD
Sbjct: 55 TVFPQAIGLAATFDPDLVRRVAEAISTEARAKFNAIGKERAAEYERGLTFWSPNINIYRD 114
Query: 163 PRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD--R 220
PRWGRGQET GEDP L+ K S+V+GLQG ++VAA KH+ + +G + R
Sbjct: 115 PRWGRGQETYGEDPFLTSKIGVSFVKGLQGDHPYYMRVAACAKHYAVH-----SGPEGLR 169
Query: 221 FHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWR 280
F+A+VS++D+ +T+ F V G V +VM +YN+VNG P C +L +R W
Sbjct: 170 HVFDARVSEKDLWETYLPAFEALVKAG-VEAVMGAYNRVNGEPACGSKRLLDEILRKRWG 228
Query: 281 LNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSE 340
G++VSDC ++ ++ T P E+ A A+ AG DL+CG H AV+ G++SE
Sbjct: 229 FKGHVVSDCWAIADFHLHHKVTKDPIESIAMALEAGCDLNCGNTYE-HLLDAVKAGVVSE 287
Query: 341 IDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQ 400
++ ++ L+ RLG+F + PY L D+ H+ LA EAA + +VLLKN
Sbjct: 288 ELVDRSVARLLSTLDRLGLFTDD---HPYARLSLSDIDWEAHRALAREAAEKSVVLLKNN 344
Query: 401 GPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYA----RTIHQQG 456
G LP + R + V GPN+ V ++GNYAG++ T L+GI YA ++ G
Sbjct: 345 G-ILPFDRQKLRYIYVTGPNAANPVALLGNYAGVSSRLVTVLEGITGYAGPGITVTYKIG 403
Query: 457 CKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDR---------AGLLLPGRQ 507
C + + A+ +R AD T+ VMG D ++E E D + L LP Q
Sbjct: 404 CPLQGNKINPIDWAS-GVARYADVTVAVMGRDSTVEGEEGDAIFSDNYGDLSDLDLPREQ 462
Query: 508 QELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGT 567
E + ++ K P ++VL+SG P V + + AI++A YPG+ GG AIA +LFG
Sbjct: 463 IEYLRRIKEIGK-PLVVVLLSGAP--VCSPELEELADAIVYAWYPGEEGGNAIARVLFGE 519
Query: 568 SNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNF 627
+P G+LP+T +P+ P T+ +M GRTYR+ + +YPFG G+SY
Sbjct: 520 ISPSGRLPIT-FPRGVDQLPPFTDYSME--------GRTYRYMREEPLYPFGFGLSY--- 567
Query: 628 VHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCN-RLTLGVQVDVKNVGSKDGAH 686
AT S + ++ + ++ + R TL + +V+N S
Sbjct: 568 -----------------------ATFSYRGLQSSASRWDKRETLELVCEVENTSSIPADE 604
Query: 687 TL-LVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLG 745
+ L A P L F +V + AG +++V + + LS +D G + +P G
Sbjct: 605 VVQLYVRWEDAPFRVPLWSLKGFTRVSLGAGERKQVRF-VLSPEELSFIDEEGRKVLPEG 663
Query: 746 EHNIHIG 752
+ H+G
Sbjct: 664 RLHFHVG 670
>gi|375357164|ref|YP_005109936.1| putative glycosyl hydrolase [Bacteroides fragilis 638R]
gi|301161845|emb|CBW21389.1| putative glycosyl hydrolase [Bacteroides fragilis 638R]
Length = 722
Score = 375 bits (963), Expect = e-101, Method: Compositional matrix adjust.
Identities = 249/718 (34%), Positives = 373/718 (51%), Gaps = 78/718 (10%)
Query: 45 VSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFG 104
+S P+ RV LI +++L EKV L+S + ++PRL + Y +W+E LHGV+ G
Sbjct: 54 LSQPVAVRVRTLIQQMTLAEKVAQLVSESDSIPRLNLPAYNYWNECLHGVARAGE----- 108
Query: 105 GDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPR 164
T FPQ I AS+++ L + + +S EAR Y GLTYWSP +N+ RDPR
Sbjct: 109 -----VTVFPQAINLASTWDTVLVKRVASAISTEARLKYLEIGKGLTYWSPTINMARDPR 163
Query: 165 WGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFN 224
WGR +ET GEDP L+ + ++V+GLQG LK A+ KHF A + +N +RF +
Sbjct: 164 WGRNEETYGEDPHLTSRLGVAFVKGLQGDHPTYLKTVATIKHFVANNEEN----NRFSSS 219
Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
+++ + + + + + CV E SVM +YN NGVP +L +R EW +G+
Sbjct: 220 SQIPTKQLYEYYFPAYEACVKEANAQSVMTAYNAFNGVPPSGSHWLLDDVLRKEWGFDGF 279
Query: 285 IVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDIN 344
+VSDC ++GV ++ EEAAA + +G DL+CG AV++GL+SE I+
Sbjct: 280 VVSDCGAIGVMNWQHRVVNSLEEAAALGVNSGCDLECGTTYKEKLVQAVEQGLISEAAID 339
Query: 345 NALVNTLTVQMRLGMFDGEPSSQ-PYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPS 403
AL LT + +LG FD P PY H K + ELA EAA + +VLLKN
Sbjct: 340 RALTRVLTARFKLGEFD--PMELVPYNHYDKKLLAGKKFAELAYEAAVKSVVLLKNDA-L 396
Query: 404 LPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQG----IGRYARTIHQQGCKD 459
LPL+ + ++VAV+GP +D +G Y+G + L+G IG+ + + G
Sbjct: 397 LPLNKEKIKSVAVVGPFAD--YNYLGGYSGQPPYSVSLLKGVKELIGKKGKVTYLNGMGT 454
Query: 460 VACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASK 519
A + Q+ + AD ++ +G D+ + E D + LP Q++L+ K+ +
Sbjct: 455 SADSIAQVV-------KGADIVLVALGSDEKMARENHDMPSIYLPEEQEKLLKKIYQVNP 507
Query: 520 GPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWY 579
+LV +G P+ +A D I AI+ A YPGQ G A+A++LFG NP GKLPMT Y
Sbjct: 508 -RIVLVFHTGNPLTSEWA--DTHIPAIMQAWYPGQEAGRALANLLFGNENPSGKLPMTIY 564
Query: 580 PQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVV 638
E LP + + M + GRTYR+ KG +Y FGHG+SYT+F
Sbjct: 565 KTE--EQLPDILDFDM-------WKGRTYRYMKGEPLYGFGHGLSYTSF----------- 604
Query: 639 AVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF----STP 694
D G N T+ AI L V++ N G G + V+ +TP
Sbjct: 605 --EFDNIQG--NDTLQPDAI-----------LQCSVELSNSGQLAGEEVVQVYVSRENTP 649
Query: 695 PAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIG 752
+ P K+LVAF+KV + +G +++V I + LSV + G R+ G++ + IG
Sbjct: 650 VYTY--PLKKLVAFKKVKLASGEKKKVDFTI-APRELSVWE-DGKWRMLSGKYTLFIG 703
>gi|336408348|ref|ZP_08588841.1| hypothetical protein HMPREF1018_00856 [Bacteroides sp. 2_1_56FAA]
gi|423248801|ref|ZP_17229817.1| hypothetical protein HMPREF1066_00827 [Bacteroides fragilis
CL03T00C08]
gi|423253750|ref|ZP_17234681.1| hypothetical protein HMPREF1067_01325 [Bacteroides fragilis
CL03T12C07]
gi|335937826|gb|EGM99722.1| hypothetical protein HMPREF1018_00856 [Bacteroides sp. 2_1_56FAA]
gi|392655379|gb|EIY49022.1| hypothetical protein HMPREF1067_01325 [Bacteroides fragilis
CL03T12C07]
gi|392657742|gb|EIY51373.1| hypothetical protein HMPREF1066_00827 [Bacteroides fragilis
CL03T00C08]
Length = 722
Score = 375 bits (963), Expect = e-101, Method: Compositional matrix adjust.
Identities = 248/718 (34%), Positives = 373/718 (51%), Gaps = 78/718 (10%)
Query: 45 VSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFG 104
+S P+ RV LI +++L EKV L+S + ++PRL + Y +W+E LHGV+ G
Sbjct: 54 LSQPVAVRVRTLIQQMTLAEKVAQLVSESDSIPRLNLPAYNYWNECLHGVARAGE----- 108
Query: 105 GDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPR 164
T FPQ I AS+++ L + + +S EAR Y GLTYWSP +N+ RDPR
Sbjct: 109 -----VTVFPQAINLASTWDTVLVKRVASAISTEARLKYLEIGKGLTYWSPTINMARDPR 163
Query: 165 WGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFN 224
WGR +ET GEDP L+ + ++V+GLQG LK A+ KHF A + +N +RF +
Sbjct: 164 WGRNEETYGEDPHLTSRLGVAFVKGLQGDHPTYLKTVATIKHFVANNEEN----NRFSSS 219
Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
+++ + + + + + CV E SVM +YN NGVP +L +R EW +G+
Sbjct: 220 SQIPTKQLYEYYFPAYEACVKEANAQSVMTAYNAFNGVPPSGSHWLLDDVLRKEWGFDGF 279
Query: 285 IVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDIN 344
+VSDC ++GV ++ EEAAA + +G DL+CG AV++GL+SE I+
Sbjct: 280 VVSDCGAIGVMNWQHRVVNSLEEAAALGVNSGCDLECGTTYKEKLVQAVEQGLISEAAID 339
Query: 345 NALVNTLTVQMRLGMFDGEPSSQ-PYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPS 403
AL LT + +LG FD P PY H K + ELA EAA + +VLLKN
Sbjct: 340 RALTRVLTARFKLGEFD--PMELVPYNHYDKKLLAGKKFAELAYEAAVKSVVLLKNDA-L 396
Query: 404 LPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQG----IGRYARTIHQQGCKD 459
LPL+ + ++VAV+GP +D +G Y+G + L+G IG+ + + G
Sbjct: 397 LPLNKEKIKSVAVVGPFAD--YNYLGGYSGQPPYSVSLLKGVKELIGKKGKVTYLNGMGT 454
Query: 460 VACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASK 519
A + Q+ + AD ++ +G D+ + E D + LP Q++L+ ++ +
Sbjct: 455 SADSIAQVV-------KGADIVLVALGSDEKMARENHDMPSIYLPEEQEKLLKEIYQVNP 507
Query: 520 GPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWY 579
+LV +G P+ +A D I AI+ A YPGQ G A+A++LFG NP GKLPMT Y
Sbjct: 508 -RIVLVFHTGNPLTSEWA--DTHIPAIMQAWYPGQEAGRALANLLFGNENPSGKLPMTIY 564
Query: 580 PQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVV 638
E LP + + M + GRTYR+ KG +Y FGHG+SYT+F
Sbjct: 565 KTE--EQLPDILDFDM-------WKGRTYRYMKGEPLYGFGHGLSYTSF----------- 604
Query: 639 AVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF----STP 694
D G N T+ AI L V++ N G G + V+ +TP
Sbjct: 605 --EFDNIQG--NDTLQSDAI-----------LQCSVELSNSGQLAGEEVVQVYVSRENTP 649
Query: 695 PAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIG 752
+ P K+LVAF+KV + +G +++V I + LSV + G R+ G++ + IG
Sbjct: 650 VYTY--PLKKLVAFKKVKLASGEKKKVDFTI-APRELSVWE-DGKWRMLSGKYTLFIG 703
>gi|109897152|ref|YP_660407.1| beta-glucosidase [Pseudoalteromonas atlantica T6c]
gi|109699433|gb|ABG39353.1| Beta-glucosidase [Pseudoalteromonas atlantica T6c]
Length = 733
Score = 375 bits (963), Expect = e-101, Method: Compositional matrix adjust.
Identities = 258/760 (33%), Positives = 383/760 (50%), Gaps = 85/760 (11%)
Query: 41 PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPG 100
P+ LP +R+ LI ++L+EK L++G A+ RLG+ Y++W+EALHGV+ G
Sbjct: 25 PWFDTQLPTNERIESLIDAMTLKEKASQLVNGNVAIERLGLPEYDFWNEALHGVARNGR- 83
Query: 101 TKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGT--------AGLTY 152
AT FPQ I A++F+ L V+SDEARA +N + +GLT+
Sbjct: 84 ---------ATVFPQAIGMAATFDQDLLLQAATVISDEARAKFNVSSEIGNRSKYSGLTF 134
Query: 153 WSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDL 212
W+PN+NIFRDPRWGRGQET GEDP L+ + + V GLQG LK AA+ KHF +
Sbjct: 135 WTPNINIFRDPRWGRGQETYGEDPYLTAQMGKAMVNGLQGDHPKYLKTAAAAKHFAVH-- 192
Query: 213 DNWNGVD--RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNI 270
+G + R F+A S++D+ +T+ F V E V +VM +YN+VNG P +
Sbjct: 193 ---SGPEALRHEFDAIASEKDMYETYFPAFEALVTEADVETVMAAYNRVNGHPAGGSDFL 249
Query: 271 LKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTE 330
L +R +W +G+IVSDC + +++ T+ E+AA AI G DL+CG +
Sbjct: 250 LNTVLRDKWGFSGHIVSDCWGLADFHEYHKVTANAVESAALAINTGTDLNCGSVYTALPD 309
Query: 331 SAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAA 390
AV+ GL+ E I+ L L + +LG FD + PY + V + H ++A E A
Sbjct: 310 -AVEAGLVDEKTIDTRLHKVLATKFKLGFFDPK-DDNPYNSISADVVNSDAHADVAYEMA 367
Query: 391 RQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIG---R 447
+ IVLL+N+ LPL R V V GP + + ++GNY G++ T L GI
Sbjct: 368 VKSIVLLQNENQVLPLDK-NIRNVYVTGPFASSSEVLLGNYYGLSGKTTNILDGITANVS 426
Query: 448 YARTI-HQQGCKDVACADDQLFGAAIDASRQADATILVMGLD---QSIEAEAL------D 497
TI ++QG + + +A + D I VMGL + E EA+ D
Sbjct: 427 VGTTINYKQGILPYQANVNPIDWTTGEAKQMGDVIIAVMGLSGAYEGEEGEAIASPHKGD 486
Query: 498 RAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIA----AIIWAGYPG 553
R L LP Q E + K+ + P I+VL +G P++V IA AI++A YPG
Sbjct: 487 RLSLDLPEHQIEFLRKLRKDNDKPVIVVLTAGTPVNVT------EIAQLADAIVFAWYPG 540
Query: 554 QAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGP 613
Q GG A+ADILFG +P G+LP+T+ P +E + P GRTYR+
Sbjct: 541 QEGGKAVADILFGERSPSGRLPITF---------PKSEAQLPPYDDYSMQGRTYRYMTEE 591
Query: 614 VVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQ 673
+YPFG G+SY V D N T+ G A ++ + TL V
Sbjct: 592 PMYPFGFGLSY-------------ATVKFD------NITL-GNAEALSSTDGQKGTLDVS 631
Query: 674 VDVKNVGSKDGAHTL-LVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLS 732
V+V N G+++ + L TP AG P + L F+++ + G +V + K L
Sbjct: 632 VNVTNTGTRELEEVVQLYLKTPNAGIDQPIQSLKGFQRIKLAPGQTGQVSFTVS-KKQLY 690
Query: 733 VVDRSGTRRIPLGEHNIHIGGT---KHSVSLHAATLGVIK 769
++ G + G++++ +G K S +L AA + K
Sbjct: 691 SINAKGKPVLLEGDYHVIVGNASPGKRSETLGAAKPQIAK 730
>gi|265765457|ref|ZP_06093732.1| beta-xylosidase [Bacteroides sp. 2_1_16]
gi|263254841|gb|EEZ26275.1| beta-xylosidase [Bacteroides sp. 2_1_16]
Length = 722
Score = 375 bits (963), Expect = e-101, Method: Compositional matrix adjust.
Identities = 249/718 (34%), Positives = 373/718 (51%), Gaps = 78/718 (10%)
Query: 45 VSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFG 104
+S P+ RV LI +++L EKV L+S + ++PRL + Y +W+E LHGV+ G
Sbjct: 54 LSQPVAVRVRTLIQQMTLAEKVAQLVSESDSIPRLNLPAYNYWNECLHGVARAGE----- 108
Query: 105 GDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPR 164
T FPQ I AS+++ L + + +S EAR Y GLTYWSP +N+ RDPR
Sbjct: 109 -----VTVFPQAINLASTWDTVLVKRVASAISTEARLKYLEIGKGLTYWSPTINMARDPR 163
Query: 165 WGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFN 224
WGR +ET GEDP L+ + ++V+GLQG LK A+ KHF A + +N +RF +
Sbjct: 164 WGRNEETYGEDPHLTSRLGVAFVKGLQGDHPTYLKTVATIKHFVANNEEN----NRFSSS 219
Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
+++ + + + + + CV E SVM +YN NGVP +L +R EW +G+
Sbjct: 220 SQIPTKQLYEYYFPAYEACVKEANAQSVMTAYNAFNGVPPSGSHWLLDDVLRKEWGFDGF 279
Query: 285 IVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDIN 344
+VSDC ++GV ++ EEAAA + +G DL+CG AV++GL+SE I+
Sbjct: 280 VVSDCGAIGVMNWQHRVVNSLEEAAALGVNSGCDLECGTTYKEKLVQAVEQGLISEAAID 339
Query: 345 NALVNTLTVQMRLGMFDGEPSSQ-PYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPS 403
AL LT + +LG FD P PY H K + ELA EAA + +VLLKN
Sbjct: 340 RALTRVLTARFKLGEFD--PMELVPYNHYDKKLLAGKKFAELAYEAAVKSVVLLKNDA-L 396
Query: 404 LPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQG----IGRYARTIHQQGCKD 459
LPL+ + ++VAV+GP +D +G Y+G + L+G IG+ + + G
Sbjct: 397 LPLNKEKIKSVAVVGPFAD--YNYLGGYSGQPPYSVSLLKGVKELIGKKGKVTYLNGMGT 454
Query: 460 VACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASK 519
A + Q+ + AD ++ +G D+ + E D + LP Q++L+ K+ +
Sbjct: 455 SADSIAQVV-------KGADIVLVALGSDEKMARENHDMPSIYLPEEQEKLLKKIYQVNP 507
Query: 520 GPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWY 579
+LV +G P+ +A D I AI+ A YPGQ G A+A++LFG NP GKLPMT Y
Sbjct: 508 -RIVLVFHTGNPLTSEWA--DTHIPAIMQAWYPGQEAGRALANLLFGNENPSGKLPMTIY 564
Query: 580 PQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVV 638
E LP + + M + GRTYR+ KG +Y FGHG+SYT+F
Sbjct: 565 KTE--EQLPDILDFDM-------WKGRTYRYMKGEPLYGFGHGLSYTSF----------- 604
Query: 639 AVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF----STP 694
D G N T+ AI L V++ N G G + V+ +TP
Sbjct: 605 --EFDNIQG--NDTLQPDAI-----------LQCSVELSNSGQLAGEEVVQVYVSRENTP 649
Query: 695 PAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIG 752
+ P K+LVAF+KV + +G +++V I + LSV + G R+ G++ + IG
Sbjct: 650 VYTY--PLKKLVAFKKVKLASGEKKKVDFTI-APRELSVWE-DGKWRMLSGKYTLFIG 703
>gi|317057539|ref|YP_004106006.1| glycoside hydrolase family protein [Ruminococcus albus 7]
gi|315449808|gb|ADU23372.1| glycoside hydrolase family 3 domain protein [Ruminococcus albus 7]
Length = 691
Score = 375 bits (962), Expect = e-101, Method: Compositional matrix adjust.
Identities = 262/756 (34%), Positives = 390/756 (51%), Gaps = 107/756 (14%)
Query: 42 FCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGT 101
+ +L +R L ++ +E+ L A AV RLGI Y WW+E +HG++ G
Sbjct: 4 YLDETLSAQERAEALTDEMTTEEQASQLRYDAPAVERLGIPAYNWWNEGIHGLARSGV-- 61
Query: 102 KFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA--------GLTYW 153
AT FPQ I A+ F+ L + V S+EARA YN + GLT W
Sbjct: 62 --------ATMFPQAIGLAAMFDDELTKKTAEVTSEEARAKYNAYSGEEDRDIYKGLTLW 113
Query: 154 SPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLD 213
+PN+NIFRDPRWGRG ET GEDP L+ K + VRGLQG DG +K AA KHF +
Sbjct: 114 APNINIFRDPRWGRGHETFGEDPYLTTKNGMAVVRGLQG-DGKVIKAAACAKHFAVH--- 169
Query: 214 NWNGVD--RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNIL 271
+G + R F+AK + +D+E+T+ F V E KV SVM +YN+VNG P CA N L
Sbjct: 170 --SGPEAIRHSFDAKANAKDMEETYLPAFEALVKEAKVESVMGAYNRVNGEPACA-SNFL 226
Query: 272 KRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG-PFLGLHTE 330
++ EW +GY VSDC ++ +++ T+ E+ A A++AG D++CG + L
Sbjct: 227 MDKLK-EWEFDGYFVSDCWAIRDFHENHMVTANAIESTAMALKAGCDVNCGCTYQNLLV- 284
Query: 331 SAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAA 390
A+++G +++ DI A V+ + ++RLGMFD Y + V +H+ ++LE A
Sbjct: 285 -ALEKGAVTKEDIRTACVHLMRTRIRLGMFD---KKTEYDDIPYDKVACKEHKAISLECA 340
Query: 391 RQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI-GRY- 448
+ +V+L+N G LP+ +++T+AVIGPN+D + GNY G++ YTT L GI R+
Sbjct: 341 EKSLVMLENNGI-LPVDTSKYKTIAVIGPNADSRTALEGNYNGLSDRYTTFLNGIQDRFD 399
Query: 449 ARTIHQQGC-------KDVACADDQLFGAAIDASRQADATILVMGLDQSIEAE------- 494
R I +GC ++A A D+ + A+ A++ AD TIL +GLD +IE E
Sbjct: 400 GRVIFAEGCHLYKDRVSNLAQAGDR-YAEAVAAAKFADMTILCLGLDATIEGEEGDTGNE 458
Query: 495 --ALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYP 552
+ D+ GL LP Q+ELV K+ MA P + V+ +G I+ + + A+I A YP
Sbjct: 459 FSSGDKNGLTLPPPQRELVKKI-MAVGKPVVTVVCAGSAINT-----ESKPDALIHAFYP 512
Query: 553 GQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYK 611
G GG A+A++LFG +P GKLP+T+Y E LP T+ +M+ GRTYR+
Sbjct: 513 GAEGGKALAEVLFGDVSPSGKLPVTFY--EDTDKLPEFTDYSMK--------GRTYRYTT 562
Query: 612 GPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLG 671
V+YPFG+G++Y + V V + A SGKA +
Sbjct: 563 ENVLYPFGYGLTY--------GSVKVTKVEYKDGKAVVTAENSGKATE------------ 602
Query: 672 VQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYL 731
DV + KD + H P+ L F+++ + G I + K
Sbjct: 603 ---DVIQLYIKDYSE-----------HAVPNVSLCGFKRIKLNEGESAVFEIEVPE-KAF 647
Query: 732 SVVDRSGTRRIPLGEHNIHIGGTKHSVSLHAATLGV 767
+ VD +G R++ G GT +L G+
Sbjct: 648 TAVDDNGVRKV-FGSRFTLFAGTSQPDALSEKLTGI 682
>gi|60680313|ref|YP_210457.1| glycosyl hydrolase [Bacteroides fragilis NCTC 9343]
gi|60491747|emb|CAH06504.1| putative glycosyl hydrolase [Bacteroides fragilis NCTC 9343]
Length = 722
Score = 374 bits (960), Expect = e-100, Method: Compositional matrix adjust.
Identities = 248/718 (34%), Positives = 372/718 (51%), Gaps = 78/718 (10%)
Query: 45 VSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFG 104
+S P+ RV LI +++L EKV L+S + ++PRL + Y +W+E LHGV+ G
Sbjct: 54 LSQPVAVRVRTLIQQMTLAEKVAQLVSESDSIPRLNLPAYNYWNECLHGVARAGE----- 108
Query: 105 GDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPR 164
T FPQ I AS+++ L + + +S EAR Y GLTYWSP +N+ RDPR
Sbjct: 109 -----VTVFPQAINLASTWDTVLVKRVASAISTEARLKYLEIGKGLTYWSPTINMARDPR 163
Query: 165 WGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFN 224
WGR +ET GEDP L+ + ++V+GLQG LK A+ KHF A + +N +RF +
Sbjct: 164 WGRNEETYGEDPHLTSRLGVAFVKGLQGDHPTYLKTVATIKHFVANNEEN----NRFSSS 219
Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
+++ + + + + + CV E SVM +YN NGVP +L +R EW +G+
Sbjct: 220 SQIPTKQLYEYYFPAYEACVKEANAQSVMTAYNAFNGVPPSGSHWLLDDVLRKEWGFDGF 279
Query: 285 IVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDIN 344
+VSDC ++GV ++ EEAAA + +G DL+CG AV++GL+SE I+
Sbjct: 280 VVSDCGAIGVMNWQHRVVNSLEEAAALGVNSGCDLECGTTYKEKLVQAVEQGLISEAAID 339
Query: 345 NALVNTLTVQMRLGMFDGEPSSQ-PYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPS 403
AL LT + +LG FD P PY H K + ELA EAA + +VLLKN
Sbjct: 340 RALTRVLTARFKLGEFD--PMELVPYNHYDKKLLAGKKFAELAYEAAVKSVVLLKNDA-L 396
Query: 404 LPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQG----IGRYARTIHQQGCKD 459
LPL+ + ++VAV+GP +D +G Y+G + L+G IG+ + + G
Sbjct: 397 LPLNKEKIKSVAVVGPFAD--YNYLGGYSGQPPYSVSLLKGVKELIGKKGKVTYLNGMGT 454
Query: 460 VACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASK 519
A + Q+ + AD ++ +G D+ + E D + LP Q++ + K+ +
Sbjct: 455 SADSIAQVV-------KGADIVLVALGSDEKMARENHDMPSIYLPEEQEKFLKKIYQVNP 507
Query: 520 GPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWY 579
+LV +G P+ +A D I AI+ A YPGQ G A+A++LFG NP GKLPMT Y
Sbjct: 508 -RIVLVFHTGNPLTSEWA--DTHILAIMQAWYPGQEAGRALANLLFGNENPSGKLPMTIY 564
Query: 580 PQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVV 638
E LP + + M + GRTYR+ KG +Y FGHG+SYT+F
Sbjct: 565 KTE--EQLPDILDFDM-------WKGRTYRYMKGEPLYGFGHGLSYTSF----------- 604
Query: 639 AVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF----STP 694
D G N T+ AI L V++ N G G + V+ +TP
Sbjct: 605 --EFDNIQG--NDTLQPDAI-----------LQCSVELSNSGQLAGEEVVQVYVSRENTP 649
Query: 695 PAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIG 752
+ P K+LVAF+KV + +G +++V I + LSV + G R+ G++ + IG
Sbjct: 650 VYTY--PLKKLVAFKKVKLASGEKKKVDFTI-APRELSVWE-DGKWRMLSGKYTLFIG 703
>gi|423258868|ref|ZP_17239791.1| hypothetical protein HMPREF1055_02068 [Bacteroides fragilis
CL07T00C01]
gi|423264161|ref|ZP_17243164.1| hypothetical protein HMPREF1056_00851 [Bacteroides fragilis
CL07T12C05]
gi|387776448|gb|EIK38548.1| hypothetical protein HMPREF1055_02068 [Bacteroides fragilis
CL07T00C01]
gi|392706427|gb|EIY99550.1| hypothetical protein HMPREF1056_00851 [Bacteroides fragilis
CL07T12C05]
Length = 722
Score = 374 bits (960), Expect = e-100, Method: Compositional matrix adjust.
Identities = 248/718 (34%), Positives = 373/718 (51%), Gaps = 78/718 (10%)
Query: 45 VSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFG 104
+S P+ RV LI +++L EKV L+S + ++PRL + Y +W+E LHGV+ G
Sbjct: 54 LSQPVAVRVRTLIQQMTLAEKVAQLVSESDSIPRLNLPAYNYWNECLHGVARAGE----- 108
Query: 105 GDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPR 164
T FPQ I AS+++ L + + +S EAR Y GLTYWSP +N+ RDPR
Sbjct: 109 -----VTVFPQAINLASTWDTVLVKRVASAISTEARLKYLEIGKGLTYWSPTINMARDPR 163
Query: 165 WGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFN 224
WGR +ET GEDP L+ + ++V+GLQG LK A+ KHF A + +N +RF +
Sbjct: 164 WGRNEETYGEDPHLTSRLGVAFVKGLQGDHPTYLKTVATIKHFVANNEEN----NRFSSS 219
Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
+++ + + + + + CV E SVM +YN NGVP +L +R EW +G+
Sbjct: 220 SQIPTKQLYEYYFPAYEACVKEANAQSVMTAYNAFNGVPPSGSHWLLDDVLRKEWGFDGF 279
Query: 285 IVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDIN 344
+VSDC ++GV ++ EEAAA + +G DL+CG AV++GL+SE I+
Sbjct: 280 VVSDCGAIGVMNWQHRVVNSLEEAAALGVNSGCDLECGTTYKEKLVQAVEQGLISEAAID 339
Query: 345 NALVNTLTVQMRLGMFDGEPSSQ-PYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPS 403
AL LT + +LG FD P PY H K + ELA EAA + +VLLKN
Sbjct: 340 RALTRVLTARFKLGEFD--PMELVPYNHYDKKLLAGKKFAELAYEAAVKSVVLLKNDA-L 396
Query: 404 LPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQG----IGRYARTIHQQGCKD 459
LPL+ + ++VAV+GP +D +G Y+G + L+G IG+ + + G
Sbjct: 397 LPLNKEKIKSVAVVGPFAD--YNYLGGYSGQPPYSVSLLKGVKELIGKKGKVTYLNGMGT 454
Query: 460 VACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASK 519
A + Q+ + AD ++ +G D+ + E D + LP Q++L+ ++ +
Sbjct: 455 SADSIAQVV-------KGADIVLVALGSDEKMARENHDMPSIYLPEEQEKLLKEIYQVNP 507
Query: 520 GPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWY 579
+LV +G P+ +A D I AI+ A YPGQ G A+A++LFG NP GKLPMT Y
Sbjct: 508 -RIVLVFHTGNPLTSEWA--DTHIPAIMQAWYPGQEAGRALANLLFGNENPSGKLPMTIY 564
Query: 580 PQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVV 638
E LP + + M + GRTYR+ KG +Y FGHG+SYT+F
Sbjct: 565 KTE--EQLPDILDFDM-------WKGRTYRYMKGEPLYGFGHGLSYTSF----------- 604
Query: 639 AVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF----STP 694
D G N T+ AI L V++ N G G + V+ +TP
Sbjct: 605 --EFDNIQG--NDTLQPDAI-----------LQCSVELSNSGQLAGEEVVQVYVSRENTP 649
Query: 695 PAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIG 752
+ P K+LVAF+KV + +G +++V I + LSV + G R+ G++ + IG
Sbjct: 650 VYTY--PLKKLVAFKKVKLASGEKKKVDFTI-APRELSVWE-DGKWRMLSGKYTLFIG 703
>gi|423281966|ref|ZP_17260851.1| hypothetical protein HMPREF1204_00389 [Bacteroides fragilis HMW
615]
gi|404582453|gb|EKA87147.1| hypothetical protein HMPREF1204_00389 [Bacteroides fragilis HMW
615]
Length = 722
Score = 374 bits (960), Expect = e-100, Method: Compositional matrix adjust.
Identities = 250/719 (34%), Positives = 374/719 (52%), Gaps = 80/719 (11%)
Query: 45 VSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFG 104
+S P+ RV LI +++L EKV L+S + ++PRL + Y +W+E LHGV+ G
Sbjct: 54 LSQPVAVRVRTLIQQMTLAEKVAQLVSESDSIPRLNLPAYNYWNECLHGVARAGE----- 108
Query: 105 GDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPR 164
T FPQ I AS+++ L + + +S EAR Y GLTYWSP +N+ RDPR
Sbjct: 109 -----VTVFPQAINLASTWDTVLVKRVASAISTEARLKYLEIGKGLTYWSPTINMARDPR 163
Query: 165 WGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFN 224
WGR +ET GEDP L+ + ++V+GLQG LK A+ KHF A + +N +RF +
Sbjct: 164 WGRNEETYGEDPHLTSRLGVAFVKGLQGDHPTYLKTVATIKHFVANNEEN----NRFSSS 219
Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
+++ + + + + + CV E SVM +YN NGVP +L +R EW +G+
Sbjct: 220 SQIPTKQLYEYYFPAYEACVKEANAQSVMTAYNAFNGVPPSGSHWLLDDVLRKEWGFDGF 279
Query: 285 IVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDIN 344
+VSDC ++GV ++ EEAAA + +G DL+CG AV++GL+SE I+
Sbjct: 280 VVSDCGAIGVMNWQHRVVNSLEEAAALGVNSGCDLECGTTYKEKLVQAVEQGLISEAAID 339
Query: 345 NALVNTLTVQMRLGMFDGEPSSQ-PYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPS 403
AL LT + +LG FD P PY H K + ELA EAA + +VLLKN
Sbjct: 340 RALTRVLTARFKLGEFD--PMELVPYNHYDKKLLAGKKFAELAYEAAVKSVVLLKNDA-L 396
Query: 404 LPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQG----IGRYARTIHQQGCKD 459
LPL+ + ++VAV+GP +D +G Y+G + L+G IG+ + + G
Sbjct: 397 LPLNKEKIKSVAVVGPFAD--YNYLGGYSGQPPYSVSLLKGVKELIGKKGKVTYLNGMGT 454
Query: 460 VACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASK 519
A + Q+ + AD ++ +G D+ + E D + LP Q++L+ ++ +
Sbjct: 455 SADSIAQVV-------KGADIVLVALGSDEKMARENHDMPSIYLPEEQEKLLKEIYQVN- 506
Query: 520 GPTI-LVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTW 578
P I LV +G P+ +A D I AI+ A YPGQ G A+A++LFG NP GKLPMT
Sbjct: 507 -PRIALVFHTGNPLTSEWA--DTHIPAIMQAWYPGQEAGRALANLLFGNENPSGKLPMTI 563
Query: 579 YPQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTV 637
Y E LP + + M + GRTYR+ KG +Y FGHG+SYT+F
Sbjct: 564 YKTE--EQLPDILDFDM-------WKGRTYRYMKGEPLYGFGHGLSYTSF---------- 604
Query: 638 VAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF----ST 693
D G N T+ AI L V++ N G G + V+ +T
Sbjct: 605 ---EFDNIQG--NDTLQPDAI-----------LQCSVELSNSGQLAGEEVVQVYVSRENT 648
Query: 694 PPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIG 752
P + P K+LVAF+KV + +G +++V I + LSV + G R+ G++ + IG
Sbjct: 649 PVYTY--PLKKLVAFKKVKLASGEKKKVDFTI-APRELSVWE-DGKWRMLSGKYTLFIG 703
>gi|333995841|ref|YP_004528454.1| beta-glucosidase [Treponema azotonutricium ZAS-9]
gi|333737309|gb|AEF83258.1| periplasmic beta-glucosidase (Gentiobiase)(Cellobiase)
(Beta-D-glucoside glucohydrolase) [Treponema
azotonutricium ZAS-9]
Length = 706
Score = 374 bits (959), Expect = e-100, Method: Compositional matrix adjust.
Identities = 248/732 (33%), Positives = 375/732 (51%), Gaps = 103/732 (14%)
Query: 52 RVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGAT 111
R+ ++I +++L+EKV L A AV GI Y WW+E LHGV+ G AT
Sbjct: 6 RIKEMISKMTLEEKVSQLSYDAPAVESAGIPKYNWWNECLHGVARAGL----------AT 55
Query: 112 SFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA--------GLTYWSPNVNIFRDP 163
FPQ I A++F+ ++ +SDE RA YN GLT+W+PNVNIFRDP
Sbjct: 56 VFPQAIALAATFDEAFIRSVADAISDEGRAKYNEAVKRGNRSQYYGLTFWTPNVNIFRDP 115
Query: 164 RWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFH- 222
RWGRGQET GEDP L+G+ ++++GLQG D + LKVAA KH+ + +G ++
Sbjct: 116 RWGRGQETYGEDPYLTGRIGLAFMKGLQGDDTEHLKVAACAKHYAVH-----SGPEKLRH 170
Query: 223 -FNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRL 281
F+A VSK+D+ +T+ F++ V G V +VM +YN+ G P +LK +RG W
Sbjct: 171 TFDAVVSKKDLFETYLPAFKLLVENG-VEAVMGAYNRTLGEPCGGSTYLLKEILRGRWGF 229
Query: 282 NGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG---PFLGLHTESAVQRGLL 338
G++ SDC ++ +++ T +PEE+AA A+ AG DL+CG P+L + + ++GL+
Sbjct: 230 KGHVTSDCWAIRDFHENHKVTKSPEESAAMALNAGCDLNCGCTYPYLTV----SHKKGLV 285
Query: 339 SEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLK 398
++ I+ AL L + +LG+FD P PY +LG V H+ LALEAA++ IVLLK
Sbjct: 286 TDETIDTALTRLLRTRFKLGLFD-PPEQDPYRNLGNDIVGCEKHRNLALEAAQKSIVLLK 344
Query: 399 NQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTI------ 452
N LPL R + ++GP + +T++ NY G++ T L+G+ +T
Sbjct: 345 NDSNILPLDD-SARKILLMGPGAANILTLLANYYGMSSRLVTILEGLAEKIKTKTAISFE 403
Query: 453 HQQGCKDVACADDQL----FGAA-IDAS------RQADATILVMGLDQSIEAEA------ 495
++QG + + L FG+ +DA + D I V GLD S+E E
Sbjct: 404 YRQGS--LMYEPNHLSNVPFGSTGVDAEAPIYGLDEIDLVIAVYGLDGSMEGEEGDSIAS 461
Query: 496 ---LDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYP 552
DR + LP Q + ++ A K +L+L G PI AF ++ A+++A YP
Sbjct: 462 DANGDRDTIELPSWQLNFLRRIRKAGK-KVVLILTGGSPI--AFPED--LADAVLFAWYP 516
Query: 553 GQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKG 612
G+ GG A+ADILFG +P GKLP+T +PQ P + A++ GRTYR+ K
Sbjct: 517 GEQGGNAVADILFGDVSPSGKLPIT-FPQSTAQLPPYDDYALK--------GRTYRYMKE 567
Query: 613 PVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGV 672
+YPFG G+SYT+F D S + +G ++K
Sbjct: 568 TPLYPFGFGLSYTSF-------------RFDSVELSSSKISAGNSVK------------A 602
Query: 673 QVDVKNVGSKDGAHTL-LVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYL 731
+V V N G +D + L + P L F ++ + AG V I + +
Sbjct: 603 KVQVSNTGKRDAEEVVQLYIAKDNRSEDEPASSLRGFRRLKILAGKSASVEIELPASAFE 662
Query: 732 SVVDRSGTRRIP 743
++ + IP
Sbjct: 663 TINAEGASVLIP 674
>gi|194700280|gb|ACF84224.1| unknown [Zea mays]
Length = 452
Score = 374 bits (959), Expect = e-100, Method: Compositional matrix adjust.
Identities = 199/454 (43%), Positives = 273/454 (60%), Gaps = 10/454 (2%)
Query: 317 LDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKD 376
+D++CG ++ H SA+Q+G ++E DIN AL N V+MRLG+F+G+P YG +GP
Sbjct: 1 MDVNCGSYVQDHGASALQQGKITEQDINRALHNLFAVRMRLGLFNGDPRRNLYGDIGPDQ 60
Query: 377 VCTPDHQELALEAARQGIVLLKNQGPS--LPLSHIRHRTVAVIGPNSDVTVTMIGNYAGI 434
VCT +HQ+LALEAA+ GIVLLKN G + LPLS ++AVIG N++ + + GNY G
Sbjct: 61 VCTQEHQDLALEAAQDGIVLLKNDGGAGALPLSKPNVASLAVIGFNANDAIRLRGNYFGP 120
Query: 435 ACGYTTPLQGIGRYAR-TIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEA 493
C TPLQ + Y + T GC AC + A+ A+ AD+ +L MGLDQ E
Sbjct: 121 PCVTVTPLQVLQGYVKDTSFVAGCNSAAC-NVTTIPEAVQAASSADSVVLFMGLDQDQER 179
Query: 494 EALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPG 553
E +DR L LPG+QQ L+ V+ A+K P ILVL+ GGP+DV+FAK +P+I AI+WAGYPG
Sbjct: 180 EEVDRLDLTLPGQQQTLIESVANAAKKPVILVLLCGGPVDVSFAKTNPKIGAILWAGYPG 239
Query: 554 QAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGP 613
+AGG AIA +LFG NPGG+LP+TWYPQ++ T +PMT+M MR + YPGRTYRFY+GP
Sbjct: 240 EAGGIAIAQVLFGEHNPGGRLPVTWYPQDF-TRVPMTDMRMRADPATGYPGRTYRFYRGP 298
Query: 614 VVYPFGHGMSYTNFVHTVANA--PTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLG 671
V+ FG+G+SY+ + H A PT L + S + C+RL
Sbjct: 299 TVFNFGYGLSYSKYSHRFATKPPPTSNVAGLKAVEATAGGMASYDVEAIGSETCDRLKFP 358
Query: 672 VQVDVKNVGSKDGAHTLLVFSTPPA---GHWAPHKQLVAFEKVHVPAGAQQRVGINIHVC 728
V V+N G DG H++LVF P G P QL+ F+ +H+ A V + C
Sbjct: 359 AVVRVQNHGPMDGKHSVLVFMRWPNATDGSGRPASQLIGFQSLHLRATQTAHVEFEVSPC 418
Query: 729 KYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHA 762
K+ S G + I G H + +G + +S A
Sbjct: 419 KHFSRATEDGRKVIDQGSHFVMVGEDEFEMSFMA 452
>gi|350295750|gb|EGZ76727.1| glycoside hydrolase [Neurospora tetrasperma FGSC 2509]
Length = 839
Score = 373 bits (957), Expect = e-100, Method: Compositional matrix adjust.
Identities = 241/653 (36%), Positives = 342/653 (52%), Gaps = 75/653 (11%)
Query: 43 CQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTK 102
C V+ P+R L+ +L++ EK+ L+ A R+G+ Y WWSE LHGV+ PG
Sbjct: 37 CDVTGTAPERAASLVDQLTIDEKLVNLVDQALGASRIGLPKYAWWSEGLHGVAG-SPGVT 95
Query: 103 F---GGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNI 159
F G F ATSF I +SF+ L +G +S EARA N G GL YW+PNVN
Sbjct: 96 FNTTGYPFSYATSFANAINLGASFDDDLVYEVGTAISTEARAFANFGFGGLDYWTPNVNP 155
Query: 160 FRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD 219
++DPRWGRG ETPGEDP+ Y + + GL+G++ R KV A+CKH+ AYDL+ W+G+
Sbjct: 156 YKDPRWGRGAETPGEDPLHIKGYVKAMLAGLEGNETVR-KVIATCKHYAAYDLERWHGLT 214
Query: 220 RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQV------------------NG 261
R+ F A V+ QD+ + + PF+ C + KV S+MCSYN +
Sbjct: 215 RYEFEAIVTLQDLSEYYLPPFQQCARDSKVGSIMCSYNALTIRDMAGGSKPDEIINLTTA 274
Query: 262 VPTCADPNILKRTIRGEWRL---NGYIVSDCDSVGVYY-DTQHFTSTPEEAAADAIRAGL 317
P CA+ L +R W N YI SDC+++ + D +F+ TP EAAA A +AG
Sbjct: 275 QPACAN-TYLMTILRDHWNWTEHNNYITSDCNAILDFLPDNHNFSQTPAEAAAAAYKAGT 333
Query: 318 DLDCGPFLGLHTE--SAVQRGLLSEIDINNALVNTLTVQMRLGMFD-----------GEP 364
D C T+ A + LL E I+ AL +R G D G
Sbjct: 334 DTVCEVSGSPLTDVVGAYNQSLLPEAVIDTALRRLYEGLIRAGYLDHGRSAVAGGDGGSF 393
Query: 365 SSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVT 424
SS Y L DV TP QELAL +A +GIVLLKN G LPL + VA+IG ++ T
Sbjct: 394 SSPAYDALNWNDVNTPSTQELALRSATEGIVLLKNSGSLLPLD-FSGKKVALIGHWANAT 452
Query: 425 VTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCKDVACAD-DQLFGAAIDASRQADATIL 483
TM G Y+GI Y PL + ++ V +D D A+ A+ AD +
Sbjct: 453 GTMRGPYSGIPPFYHNPLYAAQQLNLSLSYANGPVVNASDPDTWTAPALAAAEGADVVLY 512
Query: 484 VMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRI 543
G D ++ +E LDR + P Q +L+S+ +A G ++V+ G +D + N+ +
Sbjct: 513 FGGTDTTVASEDLDRESIAWPEAQMKLLSE--LAGLGKPVVVIQLGDQVDDSSLLNNGNV 570
Query: 544 AAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRP------- 596
++I+W GYPGQ+GGTA+ D+L G P G+LP+T YP+ Y+ +P+TEMA+RP
Sbjct: 571 SSILWVGYPGQSGGTAVFDVLTGKKAPAGRLPVTQYPEGYVDEVPLTEMALRPFNHSSSN 630
Query: 597 ----------------------SQSKRYPGRTYRFYKGPVVYPFGHGMSYTNF 627
+++ PGRTY++Y PV+ PFG+G+ YT F
Sbjct: 631 LEEEVSVQGGASLTIQARSTPGNKTLSSPGRTYKWYSTPVL-PFGYGLHYTTF 682
>gi|85813774|emb|CAJ65923.1| xylan 1,4-beta-xylosidase [Populus tremula x Populus alba]
Length = 704
Score = 372 bits (956), Expect = e-100, Method: Compositional matrix adjust.
Identities = 198/481 (41%), Positives = 295/481 (61%), Gaps = 15/481 (3%)
Query: 288 DCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNAL 347
DCD+V V + Q + TPE+A ADA+++G+ +L +T+SAV++ ++ +I+ AL
Sbjct: 229 DCDAVNVLHVEQKYAKTPEDAVADALKSGIS-----YLRNYTKSAVEKKKVTVSEIDRAL 283
Query: 348 VNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLS 407
N + +MRLG+F+G+P+ Q Y +GP VC+ +HQ LALEAA GIVLLKN LPLS
Sbjct: 284 HNLFSTRMRLGLFNGDPTKQLYSDIGPDQVCSQEHQALALEAALDGIVLLKNADRLLPLS 343
Query: 408 HIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTI-HQQGCKDVACADDQ 466
++AVIGPN+ + ++GNY G AC T L+G+ Y + +++GC +V+C
Sbjct: 344 KSGISSLAVIGPNAHNSTNLLGNYFGPACKNVTILEGLRNYVSSASYEKGCNNVSCTSAA 403
Query: 467 LFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVL 526
++ ++ D ILVMGLDQS E E LDR L+LPG+Q L++ V+ A+K P +LVL
Sbjct: 404 K-KKPVEMAQTEDQVILVMGLDQSQEKERLDRMDLVLPGKQPTLITAVAKAAKRPIVLVL 462
Query: 527 MSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNP---GGKLPMTWYPQEY 583
+ G P+DV FAKN+ +I +I+WAGYPGQAG TA+A I+FG NP GG+LPMTWYPQ++
Sbjct: 463 LGGSPMDVTFAKNNRKIGSILWAGYPGQAGATALAQIIFGEHNPGNAGGRLPMTWYPQDF 522
Query: 584 ITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVAN-APTVVAVPL 642
T +PMT+M MRP S PGRTYRFY+G V+ FG+G+SY+++ +T A+ A + V
Sbjct: 523 -TKVPMTDMRMRPQPSTGNPGRTYRFYEGEKVFEFGYGLSYSDYSYTFASVAQNQLNVKD 581
Query: 643 DGRHGSINATISGKAI--KVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFST-PPAGHW 699
N+ G + + +C + V V VKN G G H +L+F+ G
Sbjct: 582 SSNQQPENSETPGYKLVSDIGEEQCENIKFKVTVSVKNEGQMAGKHPVLLFARHAKPGKG 641
Query: 700 APHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVS 759
P K+LV F+ V + AG + + + C++LS + G + G + +G +H V+
Sbjct: 642 RPIKKLVGFQTVKLGAGEKTEIEYELSPCEHLSSANEDGVMVMEEGSQILLVGDKEHPVT 701
Query: 760 L 760
+
Sbjct: 702 I 702
Score = 235 bits (600), Expect = 6e-59, Method: Compositional matrix adjust.
Identities = 115/221 (52%), Positives = 150/221 (67%), Gaps = 15/221 (6%)
Query: 23 AAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIK 82
A++ P++CD D +T+T FC+ +LPI +R DL+ RL+ +EK L+ + A+PRLGI
Sbjct: 25 ASQPPYSCDSSDPSTKTYDFCKTTLPISRRAEDLVSRLTFEEKATQLVDTSPAIPRLGIP 84
Query: 83 GYEWWSEALHGV---SNVGPGTKF-GGDFPGATSFPQVITTASSFNATLWEAIGRVVSDE 138
YEWWSE LHG+ + V G F ATSFPQVI TA+SF+A +W IG+V E
Sbjct: 85 AYEWWSEGLHGIGFLTRVQQGISFFNRTIQHATSFPQVILTAASFDAHIWYRIGQV-GKE 143
Query: 139 ARAMYNGG-TAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSD--- 194
ARA+YN G GL +W+PNVNIFRDPRWGRGQETPGEDP++ GKY AS+VRG+QG
Sbjct: 144 ARALYNAGQVTGLGFWAPNVNIFRDPRWGRGQETPGEDPLVVGKYGASFVRGVQGDSFEG 203
Query: 195 ----GDRLKVAASCKHFTAYDLDNW--NGVDRFHFNAKVSK 229
GD L+ +A CKH+TA+DLDNW + V+ H K +K
Sbjct: 204 ESTLGDHLQASACCKHYTAHDLDNWDCDAVNVLHVEQKYAK 244
>gi|53712125|ref|YP_098117.1| beta-xylosidase [Bacteroides fragilis YCH46]
gi|52214990|dbj|BAD47583.1| beta-xylosidase [Bacteroides fragilis YCH46]
Length = 722
Score = 372 bits (956), Expect = e-100, Method: Compositional matrix adjust.
Identities = 248/718 (34%), Positives = 373/718 (51%), Gaps = 78/718 (10%)
Query: 45 VSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFG 104
+S P+ RV LI +++L EKV L+S + ++PRL + Y +W+E LHGV+ G
Sbjct: 54 LSQPVAVRVRTLIQQMTLAEKVAQLVSESDSIPRLNLPAYNYWNECLHGVARAGE----- 108
Query: 105 GDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPR 164
T FPQ I AS+++ L + + +S EAR Y GLTYWSP +N+ RDPR
Sbjct: 109 -----VTVFPQAINLASTWDTVLVKRVASAISTEARLKYLEIGKGLTYWSPTINMARDPR 163
Query: 165 WGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFN 224
WGR +ET GEDP L+ + ++V+GLQG LK A+ KHF A + +N +RF +
Sbjct: 164 WGRNEETYGEDPHLTSRLGVAFVKGLQGDHPTYLKTVATIKHFVANNEEN----NRFSSS 219
Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
+++ + + + + + CV E SVM +YN NGVP +L +R EW +G+
Sbjct: 220 SQIPTKQLYEYYFPAYEACVKEANAQSVMTAYNAFNGVPPSGSHWLLDDVLRKEWGFDGF 279
Query: 285 IVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDIN 344
+VSDC ++GV ++ EEAAA + +G DL+CG AV++GL+SE I+
Sbjct: 280 VVSDCGAIGVMNWQHRVVNSLEEAAALGVNSGCDLECGTTYKEKLVQAVEQGLISEAAID 339
Query: 345 NALVNTLTVQMRLGMFDGEPSSQ-PYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPS 403
AL LT + +LG FD P PY H K + ELA EAA + +VLLKN
Sbjct: 340 RALTRVLTARFKLGEFD--PMELVPYNHYDKKLLAGKKFAELAYEAAVKSVVLLKNDA-L 396
Query: 404 LPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQG----IGRYARTIHQQGCKD 459
LPL+ + ++VAV+GP +D +G Y+G + L+G IG+ + + G
Sbjct: 397 LPLNKEKIKSVAVVGPFAD--YNYLGGYSGQPPYSVSLLKGVKELIGKKGKVTYLNGMGT 454
Query: 460 VACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASK 519
A + Q+ + AD ++ +G D+ + E D + LP Q++L+ ++ +
Sbjct: 455 SADSIAQVV-------KGADIVLVALGSDEKMARENHDMPSIYLPEGQEKLLKEIYQVNP 507
Query: 520 GPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWY 579
+LV +G P+ +A D I AI+ A YPGQ G A+A++LFG NP GKLPMT Y
Sbjct: 508 -RIVLVFHTGNPLTSEWA--DTHIPAIMQAWYPGQEAGRALANLLFGNENPSGKLPMTIY 564
Query: 580 PQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVV 638
E LP + + M + GRTYR+ KG +Y FGHG+SYT+F
Sbjct: 565 KTE--EQLPDILDFDM-------WKGRTYRYMKGEPLYGFGHGLSYTSF----------- 604
Query: 639 AVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF----STP 694
D G N T+ AI L V++ N G G + V+ +TP
Sbjct: 605 --EFDNIQG--NDTLQPDAI-----------LQCSVELSNSGQLAGEEVVQVYVSRENTP 649
Query: 695 PAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIG 752
+ P K+LVAF+KV + +G +++V I + LSV + G R+ G++ + IG
Sbjct: 650 VYTY--PLKKLVAFKKVKLASGEKKKVDFTI-APRELSVWE-DGKWRMLSGKYTLFIG 703
>gi|307719075|ref|YP_003874607.1| glycoside hydrolase family protein [Spirochaeta thermophila DSM
6192]
gi|306532800|gb|ADN02334.1| glycoside hydrolase family 3 [Spirochaeta thermophila DSM 6192]
Length = 693
Score = 371 bits (953), Expect = e-100, Method: Compositional matrix adjust.
Identities = 252/727 (34%), Positives = 372/727 (51%), Gaps = 86/727 (11%)
Query: 51 QRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGA 110
+R+ L+ R+S++EK L++ A VPRLGI Y WW+EALHGV+N G A
Sbjct: 5 ERMTSLLSRMSIEEKAGLMVHRAKGVPRLGIPNYNWWNEALHGVANSGE----------A 54
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNG--------GTAGLTYWSPNVNIFRD 162
T FPQ I A++F+ L + +S EARA +N GLT+WSPN+NI+RD
Sbjct: 55 TVFPQAIGLAATFDPDLVRRVADAISREARAKFNAVGKERAAEYERGLTFWSPNINIYRD 114
Query: 163 PRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD--R 220
PRWGRGQET GEDP L+ K ++V+GLQG L+VAA KH+ + +G + R
Sbjct: 115 PRWGRGQETYGEDPFLTSKIGVAFVKGLQGDHPYYLRVAACAKHYAVH-----SGPEGLR 169
Query: 221 FHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWR 280
F+A+VS++D+ +T+ F V G V +VM +YN+VNG P C +L+ +R +W
Sbjct: 170 HVFDARVSEKDLWETYLPAFEALVKAG-VEAVMGAYNRVNGEPACGSKRLLEEILRKKWG 228
Query: 281 LNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSE 340
G++VSDC ++ ++ T P E+ A A+ AG DL+CG H AV+ G +SE
Sbjct: 229 FKGHVVSDCWAIADFHLHHKVTKDPIESIAMALEAGCDLNCGNTYE-HLLDAVKAGAVSE 287
Query: 341 IDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQ 400
++ ++ L+ RLG+F + PY L D+ H+ LA EAA + +VLLKN
Sbjct: 288 ELVDRSVARLLSTLDRLGLFTDD---HPYVRLSLADIDWEAHRALAREAAEKSVVLLKNN 344
Query: 401 GPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYA----RTIHQQG 456
G LPL + R + V GPN+ V ++GNYAG++ T L+GI YA ++ G
Sbjct: 345 G-ILPLDRRKLRYIYVTGPNAANPVALLGNYAGVSSRLVTVLEGITGYAGPGITVTYKIG 403
Query: 457 CKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDR---------AGLLLPGRQ 507
C + + A+ +R AD T+ VMG D ++E E D + L L Q
Sbjct: 404 CPLQGNKINPIDWAS-GVARYADVTVAVMGRDSAVEGEEGDAIFSDNYGDLSDLNLSREQ 462
Query: 508 QELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGT 567
+ + ++ K P ++VL+SG P V + + AI++A YPG+ GG AIA +LFG
Sbjct: 463 IDYLRRIKEIGK-PLVVVLLSGAP--VCSPELEELADAIVYAWYPGEEGGNAIARVLFGE 519
Query: 568 SNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNF 627
+P G+LP+T +P+ P T+ +M GRTYR+ K +YPFG G+SY F
Sbjct: 520 VSPSGRLPIT-FPKGVDQLPPFTDYSME--------GRTYRYMKEEPLYPFGFGLSYATF 570
Query: 628 VHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHT 687
+ P A D R TL V +V+N S
Sbjct: 571 SY---RDPKSSASRWDKRE----------------------TLEVVCEVENTSSIPADEV 605
Query: 688 L-LVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVC-KYLSVVDRSGTRRIPLG 745
+ L A P L F +V + G +R+ + + + LS +D G + +P G
Sbjct: 606 VQLYVRWEDAPFRVPLWSLKGFTRVSL--GTGERIQVRFVLSPEDLSFIDEKGRKVLPEG 663
Query: 746 EHNIHIG 752
H+G
Sbjct: 664 RLRFHVG 670
>gi|336261464|ref|XP_003345521.1| hypothetical protein SMAC_07509 [Sordaria macrospora k-hell]
gi|380088197|emb|CCC13872.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 762
Score = 371 bits (952), Expect = e-100, Method: Compositional matrix adjust.
Identities = 250/718 (34%), Positives = 368/718 (51%), Gaps = 56/718 (7%)
Query: 12 LILLSASSSGLAARE-PFACDPK--DATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKL 68
L LS S++ + A + PF P + +L C +L PQR L+ ++ +EK++
Sbjct: 15 LFSLSCSAALVYAIDLPFQTYPDCVNGPLASLKVCDATLSPPQRAAALVAAMTTEEKLQN 74
Query: 69 LISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKF---GGDFPGATSFPQVITTASSFNA 125
L+S + PR+G+ Y WWSEALHGV+ PGT+F G F +TSFP + A++F+
Sbjct: 75 LVSKSKGAPRIGLPAYNWWSEALHGVA-YAPGTQFRSGNGTFNSSTSFPMPLLMAATFDD 133
Query: 126 TLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAAS 185
L E +G V+ E RA N G +G YW+PNVN F+DPRWGRG ETPGED + +YAAS
Sbjct: 134 ELIERVGEVIGIEGRAFGNAGFSGFDYWTPNVNPFKDPRWGRGSETPGEDILRIKRYAAS 193
Query: 186 YVRGLQGSDGDR-LKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCV 244
+RGL+G +R ++ A+CKH+ A D ++WNG R FNAKV+ QD+ + + PF+ C
Sbjct: 194 MIRGLEGPVRERERRIVATCKHYAANDFEDWNGSTRHDFNAKVTLQDLAEYYLSPFQQCA 253
Query: 245 MEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNG---YIVSDCDSVGVYYDTQHF 301
+ KV S+MCSYN VNGVP CA+ +++ +R W YI SDC++V H+
Sbjct: 254 RDSKVGSIMCSYNAVNGVPACANTYLMQTILRDHWNWTAPGNYITSDCEAVLDISANHHY 313
Query: 302 TSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFD 361
T E A A AG+D C A +GLL + ++ AL +++G FD
Sbjct: 314 AKTNAEGTALAFEAGIDSSCEYEGSSDILGAWTQGLLKQSTVDRALRRLYEGLVQVGYFD 373
Query: 362 GEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRH---RTVAVIG 418
G S Y LG V P QE+AL+AA +GIVLLKN +LPL ++ +A+IG
Sbjct: 374 GNRSE--YASLGWNHVNRPKSQEVALQAAVEGIVLLKNDK-TLPLGVKKNGPKLKLAMIG 430
Query: 419 PNSDVTVTMIGNYAGIACGYTTPL---QGIGRYARTIHQQGCKDVACADDQLFGAAIDAS 475
++ T+ G Y+G +P+ Q +G T ++ + + D AA+ A+
Sbjct: 431 FWANDPKTLSGGYSGTPAFEHSPVYATQAMGFKVTTAGGPVLQN-STSKDTWTQAALAAA 489
Query: 476 RQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVA 535
+ A+ + G D S E DR + P Q +L++ +S K P ++V M G +D
Sbjct: 490 KDANYILYFGGQDTSAAGETKDRTTINWPEAQLQLITDLSKLGK-PLVVVQM-GDQLDNT 547
Query: 536 FAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMR 595
I +I+WA +P P G+LP+T Y Y +PMT+M +R
Sbjct: 548 PLLASKAINSILWANWP----------------VPAGRLPVTQYHANYTAAVPMTDMTLR 591
Query: 596 PSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISG 655
PS + PGRTYR+Y P V PFG G+ YT F + P L R G+ G
Sbjct: 592 PSD--KLPGRTYRWYPTP-VQPFGFGLHYTTFKTKIVRLPRFAIKDLLSRCGNAYPDTCG 648
Query: 656 KAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAG-HWAPHKQLVAFEKVH 712
++V+V N G + + +L F G P K LV++ ++
Sbjct: 649 LP-------------PLKVEVTNTGKRSSDYVVLAFLKGDVGPKPYPIKTLVSYTRLR 693
>gi|291530120|emb|CBK95705.1| Beta-glucosidase-related glycosidases [Eubacterium siraeum 70/3]
Length = 689
Score = 371 bits (952), Expect = e-99, Method: Compositional matrix adjust.
Identities = 256/741 (34%), Positives = 386/741 (52%), Gaps = 112/741 (15%)
Query: 47 LPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGD 106
L +R L LS +E+ + L A A+ + G+ Y WW+E LHGV+ G
Sbjct: 9 LSAYERAAALADTLSTEEQAQQLKYDAPAIEKAGLPSYNWWNEGLHGVARAGT------- 61
Query: 107 FPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA--------GLTYWSPNVN 158
AT FPQ I A++F+ + +G VVS EARAMYN GLT W+PN+N
Sbjct: 62 ---ATVFPQAIALAAAFDKDMMCRVGEVVSTEARAMYNSAAKHGDTDIYKGLTLWAPNIN 118
Query: 159 IFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGV 218
IFRDPRWGRG ET GEDP L+ + ++V+G+QG + L+ AA KHF + +G
Sbjct: 119 IFRDPRWGRGHETYGEDPYLTSRLGVNFVKGIQGEE-KYLRAAACAKHFAVH-----SGP 172
Query: 219 D--RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIR 276
+ R F+A+VS++D+E+T+ F+ V EG+V VM +YN+VNG P+CA ++ + +R
Sbjct: 173 ESLRHEFDARVSEKDLEETYLPAFKALVKEGRVEGVMGAYNRVNGEPSCASEKLMGK-LR 231
Query: 277 GEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRG 336
EW +GY VSDC ++ ++ T T ++AA A++AG D++CG LH +A++ G
Sbjct: 232 -EWGFDGYFVSDCGAIRDFHTNHKITDTAPQSAAMALKAGCDVNCGNTY-LHILAALEEG 289
Query: 337 LLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-HQELALEAARQGIV 395
L+++ DI A ++ L ++RLG D + L P D+ D ++ L+LEAA + +V
Sbjct: 290 LITKQDIRTACIHALRTRIRLGQLD----DNEFDDL-PFDIIACDGNKALSLEAAEKSMV 344
Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRY--ARTIH 453
LL N G LPL R ++AVIGPN+D ++GNY G T L+GI R +
Sbjct: 345 LLHNDGI-LPLDKSRISSIAVIGPNADSRAALLGNYEGTPDRSVTFLEGIQDAFDGRVYY 403
Query: 454 QQGCKDVACADDQLFGAAIDASR---------QADATILVMGLDQSIEAE-------ALD 497
+GC+ D+ G A+ R AD T++ +GLD ++E E + D
Sbjct: 404 AEGCQ---LFRDRTQGLALPGDRYAEAVAACEAADVTVVCVGLDSTLEGEEGDTENKSGD 460
Query: 498 RAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGG 557
+ L LP Q+ L+ K+ K P I+VL +G ++ N A+I A YPGQ GG
Sbjct: 461 KPDLRLPEVQRVLLQKLKDTGK-PLIIVLAAGSSVNTECEGN-----ALINAWYPGQYGG 514
Query: 558 TAIADILFGTSNPGGKLPMTWYPQEYITNLP-MTEMAMRPSQSKRYPGRTYRFY--KGPV 614
A+A+ILFG +P GKLP+T+Y + LP T+ +M+ RTYRF + V
Sbjct: 515 KALAEILFGEVSPSGKLPVTFYKSADM--LPDFTDYSMK--------NRTYRFCDDESNV 564
Query: 615 VYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQV 674
+YPFG+G++Y++F G I+ + A+ VT
Sbjct: 565 LYPFGYGLTYSHF-----------------ECGDISYKDNTLAVNVT------------- 594
Query: 675 DVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVV 734
N GS+ L V+ G + L AFE+V + G + + INI + V
Sbjct: 595 ---NTGSRSAEDVLQVYIRSENG--VKNHSLCAFERVSLFDGESRTISINIPEGAF-ETV 648
Query: 735 DRSGTRRIPLGEHNIHIGGTK 755
D +G R + G + ++ G T+
Sbjct: 649 DDNGVRAVRSGRYTLYAGFTQ 669
>gi|423269271|ref|ZP_17248243.1| hypothetical protein HMPREF1079_01325 [Bacteroides fragilis
CL05T00C42]
gi|423273165|ref|ZP_17252112.1| hypothetical protein HMPREF1080_00765 [Bacteroides fragilis
CL05T12C13]
gi|392701693|gb|EIY94850.1| hypothetical protein HMPREF1079_01325 [Bacteroides fragilis
CL05T00C42]
gi|392708197|gb|EIZ01305.1| hypothetical protein HMPREF1080_00765 [Bacteroides fragilis
CL05T12C13]
Length = 722
Score = 371 bits (952), Expect = 1e-99, Method: Compositional matrix adjust.
Identities = 247/718 (34%), Positives = 373/718 (51%), Gaps = 78/718 (10%)
Query: 45 VSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFG 104
+S P+ RV LI +++L EKV L+S + ++PRL + Y +W+E LHGV+ G
Sbjct: 54 LSQPVAVRVRTLIQQMTLAEKVAQLVSESDSIPRLNLPAYNYWNECLHGVARAGE----- 108
Query: 105 GDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPR 164
T FPQ I AS+++ L + + +S EAR Y GLTYWSP +N+ RDPR
Sbjct: 109 -----VTVFPQAINLASTWDTVLVKRVASAISTEARLKYLEIGKGLTYWSPTINMARDPR 163
Query: 165 WGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFN 224
WGR +ET GE+P L+ + ++V+GLQG LK A+ KHF A + +N +RF +
Sbjct: 164 WGRNEETYGEEPHLTSRLGVAFVKGLQGDHPTYLKTVATIKHFVANNEEN----NRFSSS 219
Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
+++ + + + + + CV E SVM +YN NGVP +L +R EW +G+
Sbjct: 220 SQIPTKQLYEYYFPAYEACVKEANAQSVMTAYNAFNGVPPSGSHWLLDDVLRKEWGFDGF 279
Query: 285 IVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDIN 344
+VSDC ++GV ++ EEAAA + +G DL+CG AV++GL+SE I+
Sbjct: 280 VVSDCGAIGVMNWQHRVVNSLEEAAALGVNSGCDLECGTTYKEKLVQAVEQGLISEAAID 339
Query: 345 NALVNTLTVQMRLGMFDGEPSSQ-PYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPS 403
AL LT + +LG FD P PY H K + ELA EAA + +VLLKN
Sbjct: 340 RALTRVLTARFKLGEFD--PMELVPYNHYDKKLLAGKKFAELAYEAAVKSVVLLKNDA-L 396
Query: 404 LPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQG----IGRYARTIHQQGCKD 459
LPL+ + ++VAV+GP +D +G Y+G + L+G IG+ + + G
Sbjct: 397 LPLNKEKIKSVAVVGPFAD--YNYLGGYSGQPPYSVSLLKGVKELIGKKGKVTYLNGMGT 454
Query: 460 VACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASK 519
A + Q+ + AD ++ +G D+ + E D + LP Q++L+ ++ +
Sbjct: 455 SADSIAQVV-------KGADIVLVALGSDEKMARENHDMPSIYLPEGQEKLLKEIYQVNP 507
Query: 520 GPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWY 579
+LV +G P+ +A D I AI+ A YPGQ G A+A++LFG NP GKLPMT Y
Sbjct: 508 -RIVLVFHTGNPLTSEWA--DTHIPAIMQAWYPGQEAGRALANLLFGNENPSGKLPMTIY 564
Query: 580 PQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVV 638
E LP + + M + GRTYR+ KG +Y FGHG+SYT+F
Sbjct: 565 KTE--EQLPDILDFDM-------WKGRTYRYMKGEPLYGFGHGLSYTSF----------- 604
Query: 639 AVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF----STP 694
D G N T+ AI L V++ N G G + V+ +TP
Sbjct: 605 --EFDNIQG--NDTLQPDAI-----------LQCSVELSNSGQLAGEEVVQVYVSRENTP 649
Query: 695 PAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIG 752
+ P K+LVAF+KV + +G +++V I + LSV + G R+ G++ + IG
Sbjct: 650 VYTY--PLKKLVAFKKVKLASGEKKKVDFTI-APRELSVWE-DGKWRMLSGKYTLFIG 703
>gi|291556907|emb|CBL34024.1| Beta-glucosidase-related glycosidases [Eubacterium siraeum V10Sc8a]
Length = 691
Score = 370 bits (949), Expect = 2e-99, Method: Compositional matrix adjust.
Identities = 255/743 (34%), Positives = 387/743 (52%), Gaps = 114/743 (15%)
Query: 47 LPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGD 106
L +R L LS +E+ + L A A+ + G+ Y WW+E LHGV+ G
Sbjct: 9 LSAYERAAALADTLSTEEQAQQLKYDAPAIEKAGLPSYNWWNEGLHGVARAGT------- 61
Query: 107 FPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA--------GLTYWSPNVN 158
AT FPQ I A++F+ + +G V+S EARAMYN GLT W+PN+N
Sbjct: 62 ---ATVFPQAIALAAAFDKDMMYRVGEVISTEARAMYNSAAKHGDTDIYKGLTLWAPNIN 118
Query: 159 IFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGV 218
IFRDPRWGRG ET GEDP L+ + S+V+G+QG + + L+ AA KHF + +G
Sbjct: 119 IFRDPRWGRGHETYGEDPYLTSRLGVSFVKGIQGEE-EYLRAAACAKHFAVH-----SGP 172
Query: 219 D--RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIR 276
+ R F+A+VS++D+E+T+ F+ V EG+V VM +YN+VNG P+CA ++ + +R
Sbjct: 173 ESLRHEFDARVSEKDMEETYLPAFKALVKEGRVEGVMGAYNRVNGEPSCASEKLMGK-LR 231
Query: 277 GEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRG 336
EW +GY VSDC ++ ++ T T T ++AA A++AG D++CG LH +A++ G
Sbjct: 232 -EWGFDGYFVSDCWAIRDFHTTHKITDTAPQSAAMALKAGCDVNCGNTY-LHILAALEEG 289
Query: 337 LLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-HQELALEAARQGIV 395
L+++ DI A ++ L ++RLG D + L P D+ D ++ L+LEAA + +V
Sbjct: 290 LITKQDIRTACIHALRTRIRLGQLD----DNEFDDL-PFDIIACDGNKALSLEAAEKSMV 344
Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRY--ARTIH 453
LL N G LPL R ++AVIGPN+D ++GNY G T L+GI R +
Sbjct: 345 LLHNDGI-LPLDKSRISSIAVIGPNADSRAALLGNYNGTPDRSVTFLEGIQDAFDGRVYY 403
Query: 454 QQGCKDVACADDQLFGAAIDASR---------QADATILVMGLDQSIEAE---------A 495
+GC+ D+ G A+ R AD T++ +GLD ++E E +
Sbjct: 404 AEGCQ---LFRDRTQGLALPGDRYAEAVAACEAADVTVICVGLDATLEGEEGDTGNEFAS 460
Query: 496 LDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQA 555
D+ L LP Q+ L+ + K P I+VL +G ++ N A+I A YPGQ
Sbjct: 461 GDKPDLRLPEVQRVLLQNLKDTGK-PLIIVLAAGSSVNTECEGN-----ALINAWYPGQY 514
Query: 556 GGTAIADILFGTSNPGGKLPMTWYPQEYITNLP-MTEMAMRPSQSKRYPGRTYRFY--KG 612
GG A+A+ILFG +P GKLP+T+Y + LP T+ +M+ RTYRF +
Sbjct: 515 GGKALAEILFGEVSPSGKLPVTFYKSADM--LPDFTDYSMK--------NRTYRFCDDES 564
Query: 613 PVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGV 672
V+YPFG+G++Y++F G ++ + A+ VT
Sbjct: 565 NVLYPFGYGLTYSHF-----------------ECGDVSYKDNTLAVNVT----------- 596
Query: 673 QVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLS 732
N GS+ L V+ G + L AFE+V + G + + INI +
Sbjct: 597 -----NTGSRSAEDVLQVYIKSENG--VKNHSLCAFERVSLFDGESRTISINIPEGAF-E 648
Query: 733 VVDRSGTRRIPLGEHNIHIGGTK 755
VD +G R + G + ++ G T+
Sbjct: 649 TVDDNGIRAVRSGRYTLYAGFTQ 671
>gi|325970053|ref|YP_004246244.1| beta-glucosidase [Sphaerochaeta globus str. Buddy]
gi|324025291|gb|ADY12050.1| Beta-glucosidase [Sphaerochaeta globus str. Buddy]
Length = 698
Score = 370 bits (949), Expect = 2e-99, Method: Compositional matrix adjust.
Identities = 249/736 (33%), Positives = 366/736 (49%), Gaps = 93/736 (12%)
Query: 51 QRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGA 110
QR +L+ R++L + + L A A+ LGI Y WW+E LHG + G A
Sbjct: 5 QRAQELVERMNLPQMMSQLRHDAPAIESLGIPAYNWWNEGLHGSARSGT----------A 54
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGT--------AGLTYWSPNVNIFRD 162
T FPQ I AS F+ A+ VVS E RA YN T GLT WSPNVNIFRD
Sbjct: 55 TVFPQAIGLASLFDPDFLYAVASVVSTEQRAKYNLFTHENDRDIYKGLTVWSPNVNIFRD 114
Query: 163 PRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD--R 220
PRWGRGQET GEDP L+ + A +++RGLQG +G LK A+ KHF A+ +G + R
Sbjct: 115 PRWGRGQETFGEDPYLTARLAVAFIRGLQG-EGPVLKTASCVKHFAAH-----SGPEPLR 168
Query: 221 FHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWR 280
FNA V K+D+E+T+ F V E K +VM +Y+ +N P CA +++ T+R W
Sbjct: 169 HGFNAVVGKKDLEETYLPAFASAVKEAKADAVMGAYSALNDEPCCASSFLMEETLRLRWG 228
Query: 281 LNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLS 339
G +SDC ++ ++ T EE+AA A++ G DL CG + L E A Q+GL++
Sbjct: 229 FEGMYISDCWAIRDFHLNHKVTKNEEESAALALKRGCDLACGCEYQSL--EKAFQKGLIT 286
Query: 340 EIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKN 399
I A + +T + +LG FD Y LG + + + +H LA EA+ + +VLLKN
Sbjct: 287 REQIKKAAIRVMTTRFKLGQFD---QGTAYDTLGLESLDSDEHAALAFEASCRSLVLLKN 343
Query: 400 QGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRY----ARTIHQQ 455
LPL +AVIGPN+D + GNY G + Y T L+G+ Y R ++ +
Sbjct: 344 DA-LLPLKKEAVSCLAVIGPNADSRQALWGNYHGTSSRYVTILEGLRDYVGSSTRILYSE 402
Query: 456 GCK------DVACADDQLFGAAIDASRQADATILVMGLDQSIEAE---------ALDRAG 500
G + DD A+ ++ +D +L +GL++++E E A D+
Sbjct: 403 GSNLTKNKVERLAKDDDRLSEAVFMAKASDVVVLCLGLNETVEGEMHDDGNGGWAGDKDD 462
Query: 501 LLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAI 560
L LP Q++L+ V+ K P I+VL+SGG +D + + A+I A YPGQ GG AI
Sbjct: 463 LRLPLCQRKLLKAVAETGK-PIIVVLLSGGSLDPEIEQY-ANVKALIQAWYPGQEGGKAI 520
Query: 561 ADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGP-VVYPFG 619
A +L+G P GKLP+T+Y E LP P RTYR+ P V+YPFG
Sbjct: 521 AHLLYGALCPSGKLPVTFYKAE--AKLP-------PFTDYSLIRRTYRYCDDPDVLYPFG 571
Query: 620 HGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNV 679
G+SY +F ++ A + + AT+ V+N
Sbjct: 572 FGLSYASFSFCLSAAQ-------ETEQNGVAATVL---------------------VRNT 603
Query: 680 GSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGT 739
+ D + ++ PH L + VH+ AG + ++ + K + V G
Sbjct: 604 SALDARTVVQLYLAMEGKDLPPHPVLCGMKSVHLKAGEETQITFILEE-KQFTAVQEDGN 662
Query: 740 RRIPLGEHNIHIGGTK 755
R G + ++ G ++
Sbjct: 663 RYAVRGGYTLYAGSSQ 678
>gi|451851086|gb|EMD64387.1| glycoside hydrolase family 3 protein [Cochliobolus sativus ND90Pr]
Length = 763
Score = 369 bits (948), Expect = 2e-99, Method: Compositional matrix adjust.
Identities = 244/753 (32%), Positives = 379/753 (50%), Gaps = 48/753 (6%)
Query: 10 LGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLL 69
LG L+S S + + D + T C V+ P +R L+ + QEK+ L
Sbjct: 10 LGASLISKSHAAIGP------DCTNGPLSTNAICDVNAPPHERAAALVAAMEPQEKLDNL 63
Query: 70 ISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWE 129
+S + V RLG+ Y WW EALHGV+ PG KF + ATSFP I +++F+ L
Sbjct: 64 VSKSKGVSRLGLPAYNWWGEALHGVAG-APGIKFVEPYKNATSFPMPILMSAAFDDDLIF 122
Query: 130 AIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRG 189
I ++ +EARA NGG A + YW+P++N RD RWGR E+PGED Y + + G
Sbjct: 123 KIANIIGNEARAFGNGGVAPVDYWTPDINPVRDIRWGRASESPGEDIRRIKGYTKALLAG 182
Query: 190 LQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKV 249
L+G R K+ A+CKH+ YD++ W G DR +F+AK++ QD+ + + PF+ C + KV
Sbjct: 183 LEGDQAQR-KIIATCKHYVGYDMEAWGGYDRHNFSAKITMQDLAEYYMPPFQQCARDSKV 241
Query: 250 ASVMCSYNQVNGVPTCADPNILKRTIRGEWRL---NGYIVSDCDSVGVYYDTQHFTSTPE 306
S MCSYN VNG+PTCAD +L+ +R W N YI SDC++V + + T
Sbjct: 242 GSFMCSYNAVNGIPTCADTYVLQTILRDHWNWTDSNNYITSDCEAVADISENHKYVETLA 301
Query: 307 EAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSS 366
+ A A G+DL C A +GLL+ I+ AL + G FDG ++
Sbjct: 302 QGTALAFAKGMDLSCEYTGSSDIPGAWAQGLLNISVIDKALTRQYEGLVHAGYFDGAKAT 361
Query: 367 QPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVT 426
Y +L KD+ TP+ ++L+L+ +G+V+LKN +LPL + VA+IG ++ +
Sbjct: 362 --YANLSYKDINTPEARQLSLQVTSEGLVMLKNDH-TLPLPLTKGSKVAMIGFWANDSSK 418
Query: 427 MIGNYAG--------IACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQA 478
+ G Y+G + G L + I D + A+DA+ ++
Sbjct: 419 LQGIYSGPPPYRHSPVFAGEQMGLDMAIAWGPMIQNSSVPDNWTTN------ALDAAEKS 472
Query: 479 DATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAK 538
D + G D ++ E DR + P Q +L++K +A G ++V+ G D +
Sbjct: 473 DYILYFGGQDWTVAQEGYDRTTISFPQVQIDLLTK--LAKLGKPLVVITLGDMTDHSPLL 530
Query: 539 NDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQ 598
+ + +IIWA +PGQ GG AI +++ G P G+LP+T YP +Y+ L M +M +RP
Sbjct: 531 SMEGVNSIIWANWPGQDGGPAILNVVSGAHAPAGRLPITEYPADYV-KLSMLDMNLRPHT 589
Query: 599 SKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAI 658
PGRTYR++ V PFG G+ YT F + A+ +G I + G
Sbjct: 590 ES--PGRTYRWFN-ESVQPFGFGLHYTTFEASFASE--------EGLTYDIEEILDG-CT 637
Query: 659 KVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAG-HWAPHKQLVAFEKVH-VPAG 716
+ C L +V V N G++ L F G P K L+ + ++ + G
Sbjct: 638 QQYKDLCEVAPL--EVTVANKGNRTSDFVALAFIKGEVGPKPYPLKTLITYGRLRDIHGG 695
Query: 717 AQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNI 749
A++ + + + + L+ VD+SG I GE+ +
Sbjct: 696 AKKSASLPLTLGE-LARVDQSGNTVIYPGEYTL 727
>gi|373852136|ref|ZP_09594936.1| Beta-glucosidase [Opitutaceae bacterium TAV5]
gi|372474365|gb|EHP34375.1| Beta-glucosidase [Opitutaceae bacterium TAV5]
Length = 740
Score = 369 bits (948), Expect = 3e-99, Method: Compositional matrix adjust.
Identities = 257/746 (34%), Positives = 380/746 (50%), Gaps = 88/746 (11%)
Query: 41 PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPG 100
PF L + RV DL+ RL+L EKV + AAA+PRLGI Y +W+E LHGV+ G
Sbjct: 22 PFRDPDLALDHRVRDLVSRLTLAEKVSQMEHAAAAIPRLGIPAYNYWNECLHGVARNG-- 79
Query: 101 TKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA-----------G 149
AT FPQ+I A++++ L + +SDEARA ++ A G
Sbjct: 80 --------RATVFPQIIGLAATWDTDLVYRVATAISDEARAKHHAALARQGFAQTQQYQG 131
Query: 150 LTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGD-RLKVAASCKHFT 208
LT+W+PN+N+FRDPRWGRGQET GEDP L+ + AA++VRGLQG D LK+AA KH+
Sbjct: 132 LTFWTPNINLFRDPRWGRGQETWGEDPHLTARLAAAFVRGLQGDTPDTHLKLAACAKHYA 191
Query: 209 AYD-LDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCAD 267
+ +N +R FNA+V+ D+ D++ F V +V SVM +YN+ P CA
Sbjct: 192 VHSGPEN----ERHTFNARVTPHDLWDSYLPAFEHLVRHARVESVMGAYNRTLDEPCCAS 247
Query: 268 PNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGL 327
+L +R W G++VSDC ++ ++T T+ P E+AA A+ G DL CG L
Sbjct: 248 QFLLLDILRERWGFEGHVVSDCWALRDIHETHRITTDPVESAALALTKGCDLACGTTFEL 307
Query: 328 HTESAVQRGLLSEIDINNALVNTLTVQMRLGMFD-GEPSSQPYGH-LGPKDVCT-PDHQE 384
E AVQRGL++E DI+ AL L + +LGMFD + + P+ + P+ + T H
Sbjct: 308 LGE-AVQRGLITEADIDRALSRHLRARFKLGMFDPADDNRNPWSNPPAPEAIVTCAAHTA 366
Query: 385 LALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQG 444
LA EAA VLL+N LPL R++ + GP + ++GNY G+ T L G
Sbjct: 367 LACEAAVASCVLLQNHNHILPL-RPDVRSIYITGPLAATQDALLGNYYGLPPRAITLLDG 425
Query: 445 IG----RYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEA----- 495
+ R ++ G + L A D + D TI +GL +E E
Sbjct: 426 LAAALPEGIRADYRPGALLSTPKQNALEWAEFDCA-SCDVTIACLGLTALLEGEEGEAIA 484
Query: 496 ----LDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGY 551
DR + LP Q+ + S+ +G ++V++ GG ++ ++ AI+WAGY
Sbjct: 485 SSLHGDRDDISLPPPQRLFLE--SLIQRGARVIVILFGGSA-LSLGPLADKVEAILWAGY 541
Query: 552 PGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYK 611
PGQ GG A+ADIL G ++P G+LP+T+Y E I +LP P + GRT+R++
Sbjct: 542 PGQEGGRALADILLGRASPSGRLPITFY--ENINDLP-------PYANYSMRGRTHRWFD 592
Query: 612 GPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLG 671
G +PFG G++YT F T S + ++ N L
Sbjct: 593 GTPAWPFGFGLTYTRF------------------------TYSDLRVSDVYSPGNDSPLC 628
Query: 672 VQVDVKNVGSKDGAHTLLVFST---PPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVC 728
V + N G + A + ++ T P P + L F +V + G +RV +I
Sbjct: 629 GSVLLTNTGDHEAAEIVQIYLTDFDAPGNGPVPRENLADFHRVTLAPGQSRRVEFSIPP- 687
Query: 729 KYLSVVDRSGTR-RIPLGEHNIHIGG 753
+++ +VD +G R R PL +H+GG
Sbjct: 688 EHILLVDTNGRRTRAPLA-FTVHVGG 712
>gi|282877070|ref|ZP_06285912.1| glycosyl hydrolase family 3 C-terminal domain protein [Prevotella
buccalis ATCC 35310]
gi|281300752|gb|EFA93079.1| glycosyl hydrolase family 3 C-terminal domain protein [Prevotella
buccalis ATCC 35310]
Length = 721
Score = 369 bits (948), Expect = 3e-99, Method: Compositional matrix adjust.
Identities = 257/762 (33%), Positives = 376/762 (49%), Gaps = 107/762 (14%)
Query: 39 TLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVG 98
T PF L QR +DL RL+L+EK L+ + + VPRLGIK ++WW EALHG + G
Sbjct: 23 TYPFQDARLSFEQRADDLCKRLTLEEKAGLMQNNSKPVPRLGIKQFQWWGEALHGSARTG 82
Query: 99 PGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--------GGTAGL 150
AT FPQ I A+SF+ L + + S EARA YN + +
Sbjct: 83 L----------ATVFPQTIGMAASFDDELLLQVFNIASTEARAKYNVAAKKGYFDTSWSV 132
Query: 151 TYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDG--DRLKVAASCKHFT 208
+ W+PNVNIFRDPRWGRGQET GEDP L+ + + V GLQG G K A KHF
Sbjct: 133 SLWTPNVNIFRDPRWGRGQETYGEDPYLTSRMGCAVVEGLQGGKGPHKYYKAFACAKHFA 192
Query: 209 AYDLDNWNGVDRFHFNA-KVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCAD 267
+ WN R + VS +D +T+ F+ V G V VMC+YN ++G P C+D
Sbjct: 193 VHSGPEWN---RHSISIDDVSPRDFHETYLPAFKHLVQVGGVKEVMCAYNSIDGEPCCSD 249
Query: 268 PNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPE--EAAADAIRAGLDLDCGPFL 325
+L++ +R EW G +VSDC ++ + P+ A+A A++ G D+ CG
Sbjct: 250 QRLLEQLLRDEWGFKGIVVSDCGAIDDIWRKGFHEVEPDAAHASARAVKGGTDMSCGQTY 309
Query: 326 GLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQEL 385
G E AV+ G ++E I+ +L + +M+LG FD + ++ + + KDV TP +E+
Sbjct: 310 GSLPE-AVRLGKVTEERIDKSLKRLIVGRMQLGEFDPDSITR-WNAISMKDVSTPASREV 367
Query: 386 ALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
AL+ AR+ + LL N +LPLS + + V V+GPN++ +V M GNY G T L GI
Sbjct: 368 ALKMARETMTLLHNPMHALPLSK-QLKQVVVMGPNANDSVMMWGNYNGTPHHTVTILDGI 426
Query: 446 GRY---ARTIHQQGCKDV--------ACADDQLFGAAIDASRQADATILVMGLDQSIEAE 494
R R +GC V A QL D I V G+ +E E
Sbjct: 427 RRKIGAQRVKFIEGCGLVEPHRRGNQALTTQQLVEEVGDNK----TVIFVGGISPQLEGE 482
Query: 495 AL----------DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIA 544
L DR + LP Q+E+++ + A K I+V SG I + P +
Sbjct: 483 QLEVEAKGFKGGDRVTIELPQVQREMIAALHAAGK-QVIMVNCSGSAIGLV-----PEVT 536
Query: 545 ---AIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLP-MTEMAMRPSQSK 600
AI+ A YPG+ GG A+AD+LFG NP GKLP+T+Y + + LP + MR
Sbjct: 537 HTDAILQAWYPGERGGEAVADVLFGDYNPAGKLPVTFYRDD--SQLPDYLDYNMR----- 589
Query: 601 RYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKV 660
RTYR++KG ++PFGHG+SYT+F K+
Sbjct: 590 ---NRTYRYFKGKPLFPFGHGLSYTSF-------------------------------KI 615
Query: 661 THAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQR 720
AK L V VKN G +DG + ++ + P K L F+++ + AG Q+
Sbjct: 616 GKAKMRNGKL--TVSVKNTGKRDGEEVVQLYISCLDDPNGPIKSLRGFKRMALQAGEQRT 673
Query: 721 VGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHA 762
V +N+ + +++ T R+ G++ ++ G + L +
Sbjct: 674 VTLNLPRKSFERFDEQTNTIRVVPGKYRVYYGTSSDEADLQS 715
>gi|402493386|ref|ZP_10840139.1| beta-glucosidase [Aquimarina agarilytica ZC1]
Length = 734
Score = 369 bits (947), Expect = 3e-99, Method: Compositional matrix adjust.
Identities = 268/787 (34%), Positives = 395/787 (50%), Gaps = 106/787 (13%)
Query: 10 LGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLL 69
L LI L+ LAA+ K+ T+ + + +R L+ L+L+EK+ L+
Sbjct: 16 LMLIFLTLLPKHLAAQ-------KNENTKNFEWFDTNKSFEKRAKALVASLTLEEKISLM 68
Query: 70 ISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWE 129
+ +A + RL I Y WW+E LHGV+ G AT FPQ I A++F+ L
Sbjct: 69 VDQSAPIDRLNIPEYNWWNECLHGVARNGR----------ATVFPQAIGLAATFDQDLIF 118
Query: 130 AIGRVVSDEARAMYN--------GGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGK 181
+ +S EARA +N G AGLT+W+PN+NIFRDPRWGRGQET GEDP L+ +
Sbjct: 119 KVADAISTEARAKFNASIAIGNRGKYAGLTFWTPNINIFRDPRWGRGQETYGEDPYLTSQ 178
Query: 182 YAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD--RFHFNAKVSKQDIEDTFDVP 239
++V+GLQG+ LK AA KH+ + +G + R F+A SK+D+ +T+
Sbjct: 179 IGVNFVKGLQGNHPKYLKSAACAKHYAVH-----SGPEELRHEFDAIASKKDMAETYLPA 233
Query: 240 FRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQ 299
F V E KV VM +YN+VNG CA P +L++ ++ W GYIVSDC ++ +
Sbjct: 234 FEALVKEAKVEGVMGAYNRVNGEGACASPYLLEKLLKDTWGFKGYIVSDCWALSDLHKFH 293
Query: 300 HFTSTPEEAAADAIRAGLDLDCG---PFLGLHTESAVQRGLLSEIDINNALVNTLTVQMR 356
T T EE+AA A+ GL+++CG P L + A+++GL SE ++N L + L + +
Sbjct: 294 KVTQTAEESAAAALNVGLNVNCGNVYPAL----DGAIKQGLTSEKQLDNVLQHQLLTRFK 349
Query: 357 LGMFDGEPS-SQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVA 415
LG FD PS + PY + V + H+ +ALEAA++ IVLLKN +L ++V
Sbjct: 350 LGFFD--PSNNNPYNKITTDVVDSEAHRAIALEAAQKSIVLLKNNN-NLLPLKKDLKSVY 406
Query: 416 VIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYART----IHQQGCKDVACADDQLFGAA 471
V GPN+ ++GNY G+ T L GI ++QG + + +
Sbjct: 407 VAGPNAAREDVLLGNYYGVTSKTQTILDGIVSKVSAGTSINYKQGLLPFQKNVNPIDWST 466
Query: 472 IDASRQADATILVMGLD---QSIEAEAL------DRAGLLLPGRQQELVSKVSMASKG-P 521
+ SR AD I+VMGL + E EA+ DR + LP Q + + K+ + G P
Sbjct: 467 GEISR-ADVGIIVMGLSGNYEGEEGEAIASESKGDRVDIRLPQNQIDYIKKIKAKNTGNP 525
Query: 522 TILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQ 581
+LVL G PI A + + AI++A YPG+ GG A+ADILFG P GKLP+T +P+
Sbjct: 526 LVLVLTGGSPI--AMPEVYDLVDAIVFAWYPGEEGGQAVADILFGDVVPSGKLPIT-FPK 582
Query: 582 EYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVP 641
P + AM+ GRTY++ +PFG G+SYT+F +
Sbjct: 583 SVDDLPPYNDYAMK--------GRTYKYMTKTPQFPFGFGLSYTSFKY------------ 622
Query: 642 LDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWA 700
+KV K + + N G+ D V+ S+P AG
Sbjct: 623 --------------DNLKVYKEKAS-------FSITNNGNVDAEEVAQVYVSSPNAGKGD 661
Query: 701 PHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
P LV F +V + AGA ++V I ++ D G G + IH+GG S +
Sbjct: 662 PLNTLVGFTRVSLKAGATKQVSIPFSKKAFVQ-FDSDGKEITRKGTYTIHVGGA--SPNT 718
Query: 761 HAATLGV 767
TLG+
Sbjct: 719 ITTTLGI 725
>gi|167751044|ref|ZP_02423171.1| hypothetical protein EUBSIR_02029 [Eubacterium siraeum DSM 15702]
gi|167655962|gb|EDS00092.1| glycosyl hydrolase family 3 C-terminal domain protein [Eubacterium
siraeum DSM 15702]
Length = 691
Score = 369 bits (947), Expect = 4e-99, Method: Compositional matrix adjust.
Identities = 255/743 (34%), Positives = 388/743 (52%), Gaps = 114/743 (15%)
Query: 47 LPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGD 106
L +R L LS +E+ + L A A+ + G+ Y WW+E LHGV+ G
Sbjct: 9 LSAYERAAALADTLSTEEQAQQLKYDAPAIEKAGLPSYNWWNEGLHGVARAGT------- 61
Query: 107 FPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA--------GLTYWSPNVN 158
AT FPQ I A++F+ + +G V+S EARAMYN GLT W+PN+N
Sbjct: 62 ---ATVFPQAIALAAAFDKDMMYRVGEVISTEARAMYNSAAKHGDTDIYKGLTLWAPNIN 118
Query: 159 IFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGV 218
IFRDPRWGRG ET GEDP L+ + ++V+G+QG + + L+ AA KHF + +G
Sbjct: 119 IFRDPRWGRGHETYGEDPYLTSRLGVNFVKGIQGEE-EYLRAAACAKHFAVH-----SGP 172
Query: 219 D--RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIR 276
+ R F+A+VS++D+E+T+ F+ V EG+V VM +YN+VNG P+CA ++ + +R
Sbjct: 173 ESLRHEFDARVSEKDMEETYLPAFKALVKEGRVEGVMGAYNRVNGEPSCASEKLMGK-LR 231
Query: 277 GEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRG 336
EW +GY VSDC ++ ++ T T T ++AA A++AG D++CG LH +A++ G
Sbjct: 232 -EWGFDGYFVSDCWAIRDFHTTHKITDTAPQSAAMALKAGCDVNCGNTY-LHILAALEEG 289
Query: 337 LLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-HQELALEAARQGIV 395
L+++ +I A ++ L ++RLG D + L P D+ D ++ L+LEAA + +V
Sbjct: 290 LITKQNIRTACIHALRTRIRLGQLD----DNEFDDL-PFDIIACDGNKALSLEAAEKSMV 344
Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRY--ARTIH 453
LL N G LPL R ++AVIGPN+D ++GNY G T L+GI R +
Sbjct: 345 LLHNDGI-LPLDKSRISSIAVIGPNADSRAALLGNYNGTPDRSVTFLEGIQDAFDGRVYY 403
Query: 454 QQGCKDVACADDQLFGAAIDASR---------QADATILVMGLDQSIEAE---------A 495
+GC+ D+ G A+ R AD T++ +GLD ++E E +
Sbjct: 404 AEGCQ---LFRDRTQGLALPGDRYAEAVAACEAADVTVVCVGLDATLEGEEGDTGNEFAS 460
Query: 496 LDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQA 555
D+ L LP Q+ L+ K+ K P I+VL +G ++ N A+I A YPGQ
Sbjct: 461 GDKPDLRLPEVQRVLLQKLKDTGK-PLIIVLAAGSSVNTECEGN-----ALINAWYPGQY 514
Query: 556 GGTAIADILFGTSNPGGKLPMTWYPQEYITNLP-MTEMAMRPSQSKRYPGRTYRFY--KG 612
GG A+A+ILFG +P GKLP+T+Y + LP T+ +M+ RTYRF +
Sbjct: 515 GGKALAEILFGEVSPSGKLPVTFYKSADM--LPDFTDYSMK--------NRTYRFCDDES 564
Query: 613 PVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGV 672
V+YPFG+G++Y++F G I+ + A+ VT
Sbjct: 565 NVLYPFGYGLTYSHF-----------------ECGDISYKDNTLAVNVT----------- 596
Query: 673 QVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLS 732
N GS+ L V+ G + L AFE+V + G + + INI +
Sbjct: 597 -----NTGSRSAEDVLQVYIKSENG--VKNHSLCAFERVSLFDGESRTISINIPEGAF-E 648
Query: 733 VVDRSGTRRIPLGEHNIHIGGTK 755
VD +G R + G + ++ G T+
Sbjct: 649 TVDDNGVRAVISGRYTLYAGFTQ 671
>gi|410648100|ref|ZP_11358515.1| beta-glucosidase [Glaciecola agarilytica NO2]
gi|410132388|dbj|GAC06914.1| beta-glucosidase [Glaciecola agarilytica NO2]
Length = 733
Score = 369 bits (946), Expect = 5e-99, Method: Compositional matrix adjust.
Identities = 259/786 (32%), Positives = 388/786 (49%), Gaps = 89/786 (11%)
Query: 12 LILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLIS 71
+I+ +A+ S LA C A P+ LP +R++ LI ++L+EK L++
Sbjct: 1 MIIRNAAMSLLAL-----CVATPALANDQPWFDTQLPTQKRIDLLIDAMTLKEKTSQLVN 55
Query: 72 GAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAI 131
G A+ RLG+ Y++W+EALHGV+ G AT FPQ I A++F+ L
Sbjct: 56 GNVAIERLGLPEYDFWNEALHGVARNG----------RATVFPQAIGMAATFDQHLLLKA 105
Query: 132 GRVVSDEARAMYNGGT--------AGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYA 183
V+SDEARA +N + +GLT+W+PN+NIFRDPRWGRGQET GEDP L+ +
Sbjct: 106 ASVISDEARAKFNVSSEIGNRSKYSGLTFWTPNINIFRDPRWGRGQETYGEDPYLTAQMG 165
Query: 184 ASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD--RFHFNAKVSKQDIEDTFDVPFR 241
+ V GLQG LK AA+ KHF + +G + R F+A S +D+ +T+ F
Sbjct: 166 KAMVNGLQGDHPKYLKTAAAAKHFAVH-----SGPEALRHEFDAIASPKDMYETYFPAFE 220
Query: 242 MCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHF 301
V E V +VM +YN+VNG P +L +R +W +G++VSDC + ++
Sbjct: 221 ALVTEANVETVMAAYNRVNGHPAGGSDFLLNTVLRDKWGFSGHVVSDCWGLADFHQYHKV 280
Query: 302 TSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFD 361
T+ E+AA AI G DL+CG + AV+ GL+ E I+ L L + +LG FD
Sbjct: 281 TANAVESAALAINTGTDLNCGAVYNALPD-AVEAGLVDEKTIDKRLSKVLATKFKLGFFD 339
Query: 362 GEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNS 421
+ PY ++ V + H ++A E A + IVLL+N+ LPL R + V GP +
Sbjct: 340 PK-DDNPYNNISADVVNSEAHAQVAYEMAVKSIVLLQNKNNILPLDR-NIRNLYVTGPFA 397
Query: 422 DVTVTMIGNYAGIACGYTTPLQGIG---RYARTI-HQQGCKDVACADDQLFGAAIDASRQ 477
+ ++GNY G++ T L GI TI ++QG + + +A +
Sbjct: 398 SSSEVLLGNYYGLSGKTTNILDGITANVSVGTTINYKQGILPYQANVNPIDWTTGEAKQM 457
Query: 478 ADATILVMGLD---QSIEAEAL------DRAGLLLPGRQQELVSKVSMASKGPTILVLMS 528
D I VMGL + E EA+ DR L LP Q + K+ + P I+VL +
Sbjct: 458 GDVIIAVMGLSGAYEGEEGEAIASPHKGDRLSLDLPEHQIAFLRKLRKDNDKPVIVVLTA 517
Query: 529 GGPIDVAFAKNDPRIA----AIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYI 584
G P+++ IA AI++A YPGQ GG A+ADILFG +P G+LP+T+
Sbjct: 518 GTPVNLT------EIAELADAIVFAWYPGQEGGKAVADILFGERSPSGRLPITF------ 565
Query: 585 TNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDG 644
P +E + P GRTYR+ +YPFG G+SY V D
Sbjct: 566 ---PKSEAQLPPYDDYSMQGRTYRYMTQEPMYPFGFGLSYAQ-------------VKFD- 608
Query: 645 RHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTL-LVFSTPPAGHWAPHK 703
N T+ + + + V V+V N G ++ + L TP AG P
Sbjct: 609 -----NITLGNTQALASKNELQE-NMTVTVNVTNTGEREFEEVVQLYLKTPDAGVSQPLH 662
Query: 704 QLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHAA 763
L F ++ + AG ++V NI K+L ++ G + G++++ +G S +
Sbjct: 663 SLKGFTRIKLAAGQTEQVLFNIP-KKHLYSINEQGKPVLLKGQYSVIVGNA--SPGNRSE 719
Query: 764 TLGVIK 769
TLG K
Sbjct: 720 TLGAAK 725
>gi|451996250|gb|EMD88717.1| glycoside hydrolase family 3 protein [Cochliobolus heterostrophus
C5]
Length = 763
Score = 368 bits (945), Expect = 6e-99, Method: Compositional matrix adjust.
Identities = 248/757 (32%), Positives = 379/757 (50%), Gaps = 47/757 (6%)
Query: 8 FFLGLILLSAS--SSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEK 65
F ++L AS S AA P D + T C V+ P +R L+ + QEK
Sbjct: 3 FLSTAVVLGASLVSKSYAAIGP---DCTNGPLSTNAICDVNAPPHERAAALVAAMEPQEK 59
Query: 66 VKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNA 125
+ L+S + V RLG+ Y WW EALHGV+ PG KF + ATSFP I +++F+
Sbjct: 60 LDNLVSKSKGVSRLGLPAYNWWGEALHGVAG-APGIKFVEPYKNATSFPMPILMSAAFDD 118
Query: 126 TLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAAS 185
L I ++ +EARA NGG A + YW+P++N RD RWGR E+PGED Y +
Sbjct: 119 DLIFKIANIIGNEARAFGNGGVAPMDYWTPDINPVRDIRWGRASESPGEDIRRIKGYTKA 178
Query: 186 YVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVM 245
+ GL+G R K+ A+CKH+ YD++ W G DR +F+AK++ QD+ + + PF+ C
Sbjct: 179 LLAGLEGDQAQR-KIIATCKHYVGYDMEAWGGYDRHNFSAKITMQDLAEYYMPPFQQCAR 237
Query: 246 EGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRL---NGYIVSDCDSVGVYYDTQHFT 302
+ KV S MCSYN VNGVPTCAD +L+ +R W N YI SDC++V + +
Sbjct: 238 DSKVGSFMCSYNAVNGVPTCADTYVLQTILRDHWNWTDSNNYITSDCEAVADISENHKYV 297
Query: 303 STPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDG 362
T + A A G+DL C A +GLL+ I+ AL + G FDG
Sbjct: 298 ETLAQGTALAFAKGMDLSCEYSGSSDIPGAWSQGLLNLSVIDKALTRQYEGLVHAGYFDG 357
Query: 363 EPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSD 422
++ Y +L D+ TP+ ++L+L+ +G+V+LKN +LPL + VA+IG ++
Sbjct: 358 AKAT--YANLSYNDINTPEARQLSLQVTSEGLVMLKNDH-TLPLPLTKGSKVAMIGFWAN 414
Query: 423 VTVTMIGNYAG--------IACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDA 474
+ + G Y+G + G L + I D + A+DA
Sbjct: 415 DSSKLQGIYSGPPPYRHSPVFAGEQMGLDMAIAWGPMIQNSSVPDNWTTN------ALDA 468
Query: 475 SRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDV 534
+ ++D + G D ++ E DR + P Q +L++K +A G ++V+ G D
Sbjct: 469 AEKSDYILYFGGQDWTVAQEGYDRTTISFPQVQIDLLAK--LAKLGKPLVVITLGDMTDH 526
Query: 535 AFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAM 594
+ + I +IIWA +PGQ GG AI +++ G P G+LP+T YP +Y+ L M +M +
Sbjct: 527 SPLLSMEGINSIIWANWPGQDGGPAILNVISGVHAPAGRLPITEYPADYV-KLSMLDMNL 585
Query: 595 RPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATIS 654
RP PGRTYR++ V PFG G+ YT F A+ +G I T+
Sbjct: 586 RPHAES--PGRTYRWFN-ESVQPFGFGLHYTTFEAGFASE--------EGLTYDIQETLD 634
Query: 655 GKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAG-HWAPHKQLVAFEKVH- 712
+ C L +V V N G++ L F G P K L+ + ++
Sbjct: 635 -SCTQQYKDLCEVAPL--EVTVANKGNRTSDFVALAFIKGEVGPKPYPLKTLITYGRLRD 691
Query: 713 VPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNI 749
+ GA++ + + + + L+ VD+SG I GE+ +
Sbjct: 692 IHGGAKKSASLPLTLGE-LARVDQSGNTVIYPGEYTL 727
>gi|5690010|emb|CAB51937.1| Family 3 Glycoside Hydrolase [Ruminococcus flavefaciens]
Length = 690
Score = 367 bits (943), Expect = 9e-99, Method: Compositional matrix adjust.
Identities = 251/725 (34%), Positives = 367/725 (50%), Gaps = 104/725 (14%)
Query: 42 FCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGT 101
+ +L +R D+ RLS +EK + A A RLG Y WWSE LHGV+ G
Sbjct: 6 YLDEALSDLERAEDITDRLSTEEKAEQQKYDAPAEERLGKDAYNWWSEGLHGVARAGT-- 63
Query: 102 KFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA--------GLTYW 153
AT FPQ I A+ F+ G S EARA YN +A GLT W
Sbjct: 64 --------ATMFPQTIGMAAMFDDEAVHRAGETTSREARAKYNEYSAHDDRDIYKGLTLW 115
Query: 154 SPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLD 213
SPNVNIFRDPRWGRGQET GEDP L+ +Y +GLQG DG L+ AA KHF +
Sbjct: 116 SPNVNIFRDPRWGRGQETYGEDPYLTSCLGVAYAKGLQG-DGKVLRTAACAKHFAVH--- 171
Query: 214 NWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKR 273
+ R F+AK + +D+ +T+ F V + KV SVM +YN+VNG P CA ++ +
Sbjct: 172 SGPEATRHEFDAKANMKDMTETYIAAFEALVKDAKVESVMGAYNRVNGEPACASDFVMNK 231
Query: 274 TIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAV 333
EW +G+ VSDC ++ ++ T T E+AA A++ G DL+CG LH +A
Sbjct: 232 L--EEWGFDGHFVSDCWAIRDFHTNHGVTKTAPESAALALKKGCDLNCGNTY-LHLLAAF 288
Query: 334 QRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQG 393
GL++E D+ + + + ++RLGMFD S Y L V +H+E +L + +
Sbjct: 289 NEGLINEEDLRRSCIKLMRTRVRLGMFD---KSTEYDGLDYDIVACDEHKEFSLRCSERS 345
Query: 394 IVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI-----GRY 448
+VLLKN G LPL +++T+ VIGPN+D + GNY G A Y T L GI GR
Sbjct: 346 MVLLKNNGI-LPLDGSKYKTIGVIGPNADSVPALEGNYNGKADEYITFLSGIREAHDGRV 404
Query: 449 ART----IHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAE---------A 495
T +++ C +A DD+L A I +R + + LD +IE E +
Sbjct: 405 LYTEGSHLYKDRCMGLALPDDRLSEAEI-ITRTLRCSGSLCWLDATIEGEEGDTGNEFSS 463
Query: 496 LDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQA 555
D+ L LP Q++LV V MA P I+V +G I+V + A+I A YPGQ
Sbjct: 464 GDKNDLRLPESQRKLVKTV-MAKGKPVIIVTAAGSAINV-----EADCDALIQAWYPGQL 517
Query: 556 GGTAIADILFGTSNPGGKLPMTWYPQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPV 614
GG A+A+ILFG +P GKLP+T+Y E + LP ++ +M+ RTYR+ +G +
Sbjct: 518 GGRALANILFGKVSPSGKLPVTFY--EDASKLPDFSDYSMK--------NRTYRYSEGNI 567
Query: 615 VYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTL--GV 672
++PFG+G++Y + +C+ L+ GV
Sbjct: 568 LFPFGYGLTY------------------------------------SETECSELSFENGV 591
Query: 673 -QVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYL 731
V V N GS+ + ++ + + P+ L F++V + AG + V I + ++
Sbjct: 592 ATVKVTNTGSRFTEDVVQIYIKGYSENAVPNHSLCGFKRVALDAGESRIVQITLPERAFM 651
Query: 732 SVVDR 736
+V ++
Sbjct: 652 AVNEK 656
>gi|291240559|ref|XP_002740189.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii]
Length = 745
Score = 366 bits (940), Expect = 2e-98, Method: Compositional matrix adjust.
Identities = 230/618 (37%), Positives = 325/618 (52%), Gaps = 52/618 (8%)
Query: 40 LPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISG-------AAAVPRLGIKGYEWWSEALH 92
PF SLP +RV DL+GRL L+E V + G A + RL I Y W +E L
Sbjct: 25 FPFRNTSLPWNKRVEDLVGRLKLEEIVLQMSRGGRYSNGPAPPIDRLNIGPYSWNTECLR 84
Query: 93 GVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN-------- 144
G + GP ATSFPQ A++F+A L + I + E RA YN
Sbjct: 85 GDLSAGP----------ATSFPQAFGLAATFDAVLIKQIANATAYEVRAKYNNYTKHKEY 134
Query: 145 GGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASC 204
G GL+ +SP +NI R P WGR QET GEDP LSG AAS+V GLQG+ + A C
Sbjct: 135 GDHKGLSCFSPVINIARHPLWGRIQETYGEDPYLSGTLAASFVTGLQGNHPRYVTANAGC 194
Query: 205 KHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPT 264
KHF AY R F+AKVS +D+ TF F C+ G S+MCSYN +NGVP
Sbjct: 195 KHFDAYAGPENIPSSRSTFDAKVSDRDLRMTFLPAFHECIQAG-TYSLMCSYNSINGVPA 253
Query: 265 CADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPF 324
CA+ +L +R EW GY++SD +V YD H+T + A + +GL+L+
Sbjct: 254 CANKKLLTDILRTEWNFTGYVISDQSAVEKVYDAHHYTKDMLDTAIACVNSGLNLELSSN 313
Query: 325 LG----LHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTP 380
L + T AV++G ++ + + +MRLG FD P PY L + +
Sbjct: 314 LTDNVMMQTTKAVKQGNVTMKTVKARVSPLFYTRMRLGEFD-PPEMNPYSKLDLSIIQSQ 372
Query: 381 DHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYT- 439
+HQEL+L+AA + VLLKN+ LPL + +AV+GP D + + G+ + T
Sbjct: 373 EHQELSLKAAAKSFVLLKNENRFLPLKE-KIDKLAVVGPFGDNPIEIYGSKSPDVSNLTV 431
Query: 440 TPLQGIGRYAR--TIHQQGCKDVACA--DDQLFGAAIDASRQADATILVMGLDQSIEAEA 495
TP G+ + AR T GC AC D + AID + D ++ +G +E EA
Sbjct: 432 TPRYGLSKIARLATTFASGCLSPACTEYDPKSTKQAID---RVDMVVVCLGTGNEVENEA 488
Query: 496 LDRAGLLLPGRQQELVSK-VSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQ 554
DR+ L LPG+Q L+ V+ A+ P IL+L + GP+D+ +A ++P I I+ +P Q
Sbjct: 489 HDRSELTLPGQQLRLLQDAVTFAADKPVILLLFNAGPLDITWAVSNPAIPVIVECFFPAQ 548
Query: 555 AGGTAIADILFGT--SNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKG 612
GTA+ + + SNPGG+LP+TW P+ PM + M GRTYR++ G
Sbjct: 549 TTGTALYHLFVNSPGSNPGGRLPITW-PKSMSQVPPMEDYTME--------GRTYRYFNG 599
Query: 613 PVVYPFGHGMSYTNFVHT 630
++PFG+G+SYT F ++
Sbjct: 600 DPLFPFGYGLSYTTFHYS 617
>gi|332307852|ref|YP_004435703.1| glycoside hydrolase family protein [Glaciecola sp. 4H-3-7+YE-5]
gi|332175181|gb|AEE24435.1| glycoside hydrolase family 3 domain protein [Glaciecola sp.
4H-3-7+YE-5]
Length = 733
Score = 366 bits (940), Expect = 2e-98, Method: Compositional matrix adjust.
Identities = 258/786 (32%), Positives = 389/786 (49%), Gaps = 89/786 (11%)
Query: 12 LILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLIS 71
+I+ +A+ S LA C A P+ LP +R++ LI ++L+EK L++
Sbjct: 1 MIIRNAAMSLLAL-----CVATPALANDQPWFDTQLPTQERIDLLIDAMTLKEKTSQLVN 55
Query: 72 GAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAI 131
G A+ RLG+ Y++W+EALHGV+ G AT FPQ I A++F+ L
Sbjct: 56 GNVAIERLGLPEYDFWNEALHGVARNG----------RATVFPQAIGMAATFDQHLLLKA 105
Query: 132 GRVVSDEARAMYNGGT--------AGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYA 183
V+SDEARA +N + +GLT+W+PN+NIFRDPRWGRGQET GEDP L+ +
Sbjct: 106 ASVISDEARAKFNVSSEIGNRSKYSGLTFWTPNINIFRDPRWGRGQETYGEDPYLTAQMG 165
Query: 184 ASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD--RFHFNAKVSKQDIEDTFDVPFR 241
+ V GLQG LK AA+ KHF + +G + R F+A S +D+ +T+ F
Sbjct: 166 KAMVNGLQGDHPKYLKTAAAAKHFAVH-----SGPEALRHEFDAIASPKDMYETYFPAFE 220
Query: 242 MCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHF 301
+ E V +VM +YN+VNG P +L +R +W +G++VSDC + ++
Sbjct: 221 ALITEANVETVMAAYNRVNGHPAGGSDFLLNTVLRDKWGFSGHVVSDCWGLADFHQYHKV 280
Query: 302 TSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFD 361
T+ E+AA AI G DL+CG + AV+ GL+ E I+ L L + +LG FD
Sbjct: 281 TANAVESAALAINTGTDLNCGAVYNALPD-AVEAGLVDEKTIDKRLSKVLATKFKLGFFD 339
Query: 362 GEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNS 421
+ PY ++ V + H ++A E A + IVLL+N+ LPL R + V GP +
Sbjct: 340 PK-DDNPYNNISADVVNSEAHAQVAYEMAVKSIVLLQNKNNILPLDR-NIRNLYVTGPFA 397
Query: 422 DVTVTMIGNYAGIACGYTTPLQGIG---RYARTI-HQQGCKDVACADDQLFGAAIDASRQ 477
+ ++GNY G++ T L GI TI ++QG + + +A +
Sbjct: 398 SSSEVLLGNYYGLSGKTTNILDGITANVSVGTTINYKQGILPYQANVNPIDWTTGEAKQM 457
Query: 478 ADATILVMGLD---QSIEAEAL------DRAGLLLPGRQQELVSKVSMASKGPTILVLMS 528
D I VMGL + E EA+ DR L LP Q + K+ + P I+VL +
Sbjct: 458 GDVIIAVMGLSGAYEGEEGEAIASPHKGDRLSLDLPEHQIAFLRKLRKDNDKPVIVVLTA 517
Query: 529 GGPIDVAFAKNDPRIA----AIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYI 584
G P+++ IA AI++A YPGQ GG A+ADILFG +P G+LP+T+
Sbjct: 518 GTPVNLT------EIAELADAIVFAWYPGQEGGKAVADILFGERSPSGRLPITF------ 565
Query: 585 TNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDG 644
P +E + P GRTYR+ +YPFG G+SY V D
Sbjct: 566 ---PKSEAQLPPYDDYSMQGRTYRYMTQEPMYPFGFGLSYAQ-------------VKFD- 608
Query: 645 RHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTL-LVFSTPPAGHWAPHK 703
N T+ G + + + V V+V N G ++ + L TP AG P
Sbjct: 609 -----NITL-GNTQALASKNEPQENMTVTVNVTNTGEREFEEVVQLYLKTPDAGVSQPLH 662
Query: 704 QLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHAA 763
L F ++ + AG ++V +I K+L ++ G + G++++ +G S +
Sbjct: 663 SLKGFTRIKLAAGQTEQVLFSIP-KKHLYSINEQGKPVLLKGQYSVIVGNA--SPGNRSE 719
Query: 764 TLGVIK 769
TLG K
Sbjct: 720 TLGAAK 725
>gi|157676888|emb|CAP07659.1| beta-xylosidase [uncultured rumen bacterium]
Length = 761
Score = 365 bits (936), Expect = 6e-98, Method: Compositional matrix adjust.
Identities = 263/809 (32%), Positives = 376/809 (46%), Gaps = 146/809 (18%)
Query: 29 ACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWS 88
+C P++ + + S P R L+ +LSL+EK L+ + AV RLGIK Y WWS
Sbjct: 23 SCAPQE-----ISYTDKSQPAELRAKALLPKLSLEEKAGLVQYNSPAVERLGIKAYNWWS 77
Query: 89 EALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEAR-----AMY 143
EALHGV+ G AT FPQ I A+SF+ E + VSDEAR A
Sbjct: 78 EALHGVARNG----------SATVFPQPIGMAASFDVEKIETVFTAVSDEARVKNRIAAE 127
Query: 144 NGGT---AGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGS-DGDRLK 199
+G AGL++W+PN+NIFRDPRWGRG ET GEDP L G+ + VRGLQG D D LK
Sbjct: 128 DGRVYQYAGLSFWTPNINIFRDPRWGRGMETYGEDPYLMGQLGMAVVRGLQGDPDADVLK 187
Query: 200 VAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQV 259
A KH+ + N R F+A+VS++D+ +T+ F+ V + V VM +YN+
Sbjct: 188 THACAKHYAVHSGLESN---RHRFDAQVSERDLRETYLPAFKDLVTKAGVKEVMTAYNRF 244
Query: 260 NGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYD--TQHFTSTPEEAAADAIRAGL 317
G P A ++++ +R EW G +VSDC ++ +++ F +T EEAAA A+ GL
Sbjct: 245 RGYPCAASEYLVQKILREEWGYKGLVVSDCWAIPDFFEPGRHGFVATGEEAAALAVANGL 304
Query: 318 DLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDV 377
D++CG +A+ +GLL E D++ L+ LT + RLG DGE P+ L P V
Sbjct: 305 DVECGSTFS-KIPAAIDQGLLKEEDLDRNLLRVLTERFRLGEMDGE---SPWDDLDPAIV 360
Query: 378 CTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGI--- 434
P+H+ L+L+ AR+ +VLL+N G LPL +A+IGPN+D GNY +
Sbjct: 361 EGPEHRALSLDIARETMVLLRNNG-VLPLKA--GEKIALIGPNADDAQMQWGNYNPVPKS 417
Query: 435 -------------------ACG------------------------------YTTPLQGI 445
ACG Y + I
Sbjct: 418 TITLLQAMQARVPGLVYDRACGILDAEYAPQGSAYANLIGASEAQLEAAARRYAVSVNDI 477
Query: 446 GRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEAL--------- 496
Y R +Q + D+ A + D + G+ +E E +
Sbjct: 478 KNYIRRDEEQRRSFMPALDE---AAVLKKLEGVDVVVFAGGISPRLEGEEMRVQVPGFSG 534
Query: 497 -DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQA 555
DR + LPG Q+ L+ + A K +LV SG I AI+ A YPGQ
Sbjct: 535 GDRTDIELPGVQRRLLKALHDAGK-KVVLVNFSGCAI--GLVPETESCDAILQAWYPGQE 591
Query: 556 GGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVV 615
GGTAIAD+LFG NP GKLP+T+Y + LP E G TYR+++G +
Sbjct: 592 GGTAIADVLFGDVNPSGKLPVTFYKN--VDQLPDVE-------DYNMEGHTYRYFRGEPL 642
Query: 616 YPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVD 675
YPFG+G+SYT+F + GK +++D
Sbjct: 643 YPFGYGLSYTSFAFG-------------------EPKVKGK--------------NLEID 669
Query: 676 VKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVD 735
V N GS G + ++ P P K L AF +V VPAG +V I + +L +
Sbjct: 670 VTNTGSVAGTEVVQLYVRKPDDTAGPVKTLRAFRRVSVPAGQTVKVSIPLDKETFLWWSE 729
Query: 736 RSGTRRIPLGEHNIHIGGTKHSVSLHAAT 764
+ G + + GG+ + L + +
Sbjct: 730 KDQDMVPVRGRYELLCGGSSAASDLKSVS 758
>gi|410639677|ref|ZP_11350222.1| beta-glucosidase [Glaciecola chathamensis S18K6]
gi|410140558|dbj|GAC08409.1| beta-glucosidase [Glaciecola chathamensis S18K6]
Length = 733
Score = 365 bits (936), Expect = 7e-98, Method: Compositional matrix adjust.
Identities = 259/786 (32%), Positives = 388/786 (49%), Gaps = 89/786 (11%)
Query: 12 LILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLIS 71
+I+ +A+ S LA C A P+ LP +R++ LI ++L+EK L++
Sbjct: 1 MIIRNAAMSLLAL-----CVATPALANDQPWFDTQLPTQKRIDLLIDAMTLKEKTSQLVN 55
Query: 72 GAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAI 131
G A+ RLG+ Y++W+EALHGV+ G AT FPQ I A++F+ L
Sbjct: 56 GNVAIERLGLPEYDFWNEALHGVARNG----------RATVFPQAIGMAATFDQHLLLKA 105
Query: 132 GRVVSDEARAMYNGGT--------AGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYA 183
V+SDEARA +N + +GLT+W+PN+NIFRDPRWGRGQET GEDP L+ +
Sbjct: 106 ASVISDEARAKFNVSSEIGNRSKYSGLTFWTPNINIFRDPRWGRGQETYGEDPYLTAQMG 165
Query: 184 ASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD--RFHFNAKVSKQDIEDTFDVPFR 241
+ V GLQG LK AA+ KHF + +G + R F+A S +D+ +T+ F
Sbjct: 166 KAMVNGLQGDHPKYLKTAAAAKHFAVH-----SGPEALRHEFDAIASPKDMYETYFPAFE 220
Query: 242 MCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHF 301
V E V +VM +YN+VNG P +L +R +W +G++VSDC + ++
Sbjct: 221 ALVTEANVETVMAAYNRVNGHPAGGSDFLLNTVLRDKWGFSGHVVSDCWGLADFHQYHKV 280
Query: 302 TSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFD 361
T+ E+AA AI G DL+CG + AV+ GL+ E I+ L L + +LG FD
Sbjct: 281 TANAVESAALAINTGTDLNCGAVYNALPD-AVEAGLVDEKTIDKRLSKVLATKFKLGFFD 339
Query: 362 GEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNS 421
+ PY ++ V + H ++A E A + IVLL+N+ LPL R + V GP +
Sbjct: 340 PK-DDNPYNNISADVVNSEAHAQVAYEMAVKSIVLLQNKNNILPLDR-NIRNLYVTGPFA 397
Query: 422 DVTVTMIGNYAGIACGYTTPLQGIG---RYARTI-HQQGCKDVACADDQLFGAAIDASRQ 477
+ ++GNY G++ T L GI TI ++QG + + +A +
Sbjct: 398 SSSEVLLGNYYGLSGKTTNILDGITANVSVGTTINYKQGILPYQANVNPIDWTTGEAKQM 457
Query: 478 ADATILVMGLD---QSIEAEAL------DRAGLLLPGRQQELVSKVSMASKGPTILVLMS 528
D I VMGL + E EA+ DR L LP Q + K+ + P I+VL +
Sbjct: 458 GDVIIAVMGLSGAYEGEEGEAIASPHKGDRLSLDLPEHQIAFLRKLRKDNDKPVIVVLTA 517
Query: 529 GGPIDVAFAKNDPRIA----AIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYI 584
G P+++ IA AI++A YPGQ GG A+ADILFG +P G+LP+T+
Sbjct: 518 GTPVNLT------EIAELADAIVFAWYPGQEGGKAVADILFGERSPSGRLPITF------ 565
Query: 585 TNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDG 644
P +E + P RTYR+ +YPFG G+SY V D
Sbjct: 566 ---PKSEAQLPPYDDYSMQERTYRYMTQEPMYPFGFGLSYAQ-------------VKFD- 608
Query: 645 RHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTL-LVFSTPPAGHWAPHK 703
N T+ G + + + V V+V N G ++ + L TP AG P
Sbjct: 609 -----NITL-GNTQALASKNEPQENMTVTVNVTNTGEREFEEVVQLYLKTPDAGVSQPLH 662
Query: 704 QLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHAA 763
L F ++ + AG ++V NI K+L ++ G + G++++ +G S +
Sbjct: 663 SLKGFTRIKLAAGQTEQVLFNIP-KKHLYSINAQGKPVLLKGQYSVIVGNA--SPGNRSE 719
Query: 764 TLGVIK 769
TLG K
Sbjct: 720 TLGAAK 725
>gi|333494646|gb|AEF56854.1| putative glycosyl hydrolase [synthetic construct]
Length = 743
Score = 365 bits (936), Expect = 7e-98, Method: Compositional matrix adjust.
Identities = 253/752 (33%), Positives = 367/752 (48%), Gaps = 113/752 (15%)
Query: 46 SLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGG 105
+L +R DL+ R++L+EK+ + A ++ RLG+ Y WW+EALHGV+ G
Sbjct: 34 NLSFEERARDLVSRMTLEEKIAQMQHEAPSIERLGVPAYNWWNEALHGVARAGV------ 87
Query: 106 DFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGT--------AGLTYWSPNV 157
+T FPQ I A++F+A L E V+S E RA Y+ GLT+WSP +
Sbjct: 88 ----STMFPQAIGMAATFDAELIEKTADVISTEGRARYHEFQRKGDRDIYKGLTFWSPTI 143
Query: 158 NIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNG 217
NI RDPRWGRGQET GEDP L+ + A S++RG+QG G LK AA KHF +
Sbjct: 144 NIDRDPRWGRGQETYGEDPYLTSRLAVSFIRGIQGR-GRYLKAAACAKHFAVHSGPE--- 199
Query: 218 VDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRG 277
+R FNA+VS++D+ +T+ F V E KVA VM +YN+VNG P C +L +RG
Sbjct: 200 SERHQFNAEVSQKDLWETYLPAFEASVKEAKVAGVMGAYNRVNGEPCCGSGTLLGDVLRG 259
Query: 278 EWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGL 337
EW GY+ SDC ++ + T T EE++A A+++G DL+CG + A + GL
Sbjct: 260 EWEFGGYVTSDCWAIKDINEGHGVTKTIEESSALAVKSGCDLNCGCAYASLVK-AYRAGL 318
Query: 338 LSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLL 397
+ E +I+ A+ + +MRLGMFD P PY + + +H+ ALE A + +VLL
Sbjct: 319 IGEKEIDTAVHRLMLTRMRLGMFDA-PEKVPYSSIPYEKNDCAEHRAFALEVAEKSLVLL 377
Query: 398 KNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI----GRYARTIH 453
+N+ LPL R R+VAVIGPN+D V + GNY G A Y T L GI G AR +
Sbjct: 378 RNRSGFLPLDRSRIRSVAVIGPNADSRVALEGNYNGTASEYVTVLDGIREAVGDRARVYY 437
Query: 454 QQGCKDVACADDQLFGAAIDASRQAD--------------ATILVMGLDQSIEAE----- 494
+G LF ++ Q + ++ +GL++ IE E
Sbjct: 438 AEGS--------HLFRNSMGGLSQKNDRLAEAAAAAERADVAVVCLGLNRDIEGEEGDPS 489
Query: 495 ----ALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
A D+ L LPG Q+EL+ V A+ P +LVL+SG + V +A D A++ A
Sbjct: 490 NEYPAGDKRDLRLPGLQEELLETVK-ATGTPVVLVLLSGSALAVNWA--DENADAVVQAW 546
Query: 551 YPG-QAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRF 609
YPG QA G A LFG P G P T T R P
Sbjct: 547 YPGAQAEGRRGA--LFGIIRPAGGFPSR------STVRTRTSRIFGTIHENRLP-----L 593
Query: 610 YKGPVVYPFGHGMSYTNFVH-TVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRL 668
+G +YPFG+G+SYT F + + A + + D
Sbjct: 594 LQGDPLYPFGYGLSYTKFQYGDLKLAASEIPAGEDAE----------------------- 630
Query: 669 TLGVQVDVKNVGSKDGAHTLLVF-----STPPAGHWAPHKQLVAFEKVHVPAGAQQRVGI 723
V V V+N G +D + ++ S+ P W QL F +VH+ G V
Sbjct: 631 ---VSVTVRNAGERDSDEVVQLYLQDLESSVPVPKW----QLAGFRRVHLKPGESAGVRF 683
Query: 724 NIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTK 755
+ + ++++D G + G ++ GG++
Sbjct: 684 TV-AARQMALIDEDGRCVLEPGGFRVYAGGSQ 714
>gi|313202830|ref|YP_004041487.1| glycoside hydrolase [Paludibacter propionicigenes WB4]
gi|312442146|gb|ADQ78502.1| glycoside hydrolase family 3 domain protein [Paludibacter
propionicigenes WB4]
Length = 742
Score = 362 bits (930), Expect = 3e-97, Method: Compositional matrix adjust.
Identities = 248/760 (32%), Positives = 391/760 (51%), Gaps = 90/760 (11%)
Query: 26 EPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYE 85
+P A + ++A PF S I +RV DL+ RL+L EK ++ A A+ RLGI Y
Sbjct: 26 QPSAVNAQNAVK--YPFQDTSKTIDERVKDLVSRLTLDEKAGQMLHNAPAIKRLGILPYS 83
Query: 86 WWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN- 144
WW+EALHGV+ G AT FP+ + A++F+ L IG+ +SDEA A YN
Sbjct: 84 WWNEALHGVARTG----------RATVFPENVGLAATFDEDLVYRIGQAISDEAWAKYNI 133
Query: 145 -------GGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDR 197
G +G+T+++PNVNIFRDPRWGRGQET GEDP L+ + +YV+G+QG+D
Sbjct: 134 AQRLENYGQYSGITFYAPNVNIFRDPRWGRGQETYGEDPFLTSRMGVAYVKGMQGNDPKY 193
Query: 198 LKVAASCKHFTAYDLDNWNGVD--RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCS 255
LK AA KH+ + +G + R ++A+ +D +T+ F V EGKV SVMC+
Sbjct: 194 LKTAACAKHYVVH-----SGPEALRHSYDAEPPMKDFMETYVPAFETLVKEGKVESVMCA 248
Query: 256 YNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRA 315
YN+ G P C +L +R +W GY+ +DC ++ +Y EA A AI++
Sbjct: 249 YNRTFGKPCCGSSFLLHDLLREKWGFTGYVTTDCWAIQNFYLHHGAAKDSLEACALAIKS 308
Query: 316 GLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPK 375
G++L+CG + +AV++GL++E +++ AL L + RLG+FD P+ PY + +
Sbjct: 309 GVNLNCGNEFN-YLPAAVRKGLVTEKEVDEALSQLLRTRFRLGLFDS-PNENPYAKIKEE 366
Query: 376 DVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIA 435
+ + + +LA EAA + +VLL+N+ +LPL +++ V+GP + ++GNY G+
Sbjct: 367 VIGSQQNIDLAYEAAAKSLVLLQNKNNTLPLKK-DMKSLYVVGPYAANQDILLGNYNGVN 425
Query: 436 CGYTTPLQGI-GRYA--RTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIE 492
TT +Q I G+ + +++ + + + + +I + ADA + V G+ E
Sbjct: 426 SRLTTIMQAIVGKVSAGTSVNYRIGVEPSAPNKNSMNYSIGEAADADAVVAVFGISGVFE 485
Query: 493 AEA---------LDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRI 543
E DR L LP Q + + ++ K P ILVL G PI P +
Sbjct: 486 GEEGESTASTSRGDRLDLNLPQNQLDYLRELKKKCKKPIILVLTGGSPICT------PEL 539
Query: 544 A----AIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTE-MAMRPSQ 598
A AI++ YPGQ GG A+AD++FG NP G+L +T +P+ ++ LP E +M+
Sbjct: 540 ADMVDAILFVWYPGQEGGHAVADVIFGDVNPSGRLCIT-FPKS-VSQLPAFEDYSMK--- 594
Query: 599 SKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAI 658
GRTYR+ +YPFG G+SYTN+ ++ N T G+ + AT+S
Sbjct: 595 -----GRTYRYMTEEPLYPFGFGLSYTNYSYS--NIKTDKDKIKKGQSVHVTATVS---- 643
Query: 659 KVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTP-PAGHWAPHKQLVAFEKVHVPAGA 717
N G G ++ T A P L ++V + AG
Sbjct: 644 -------------------NTGKTAGEEVAQLYITDVKASAPTPLYALKGTKRVKLAAGE 684
Query: 718 QQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHS 757
+ V + + + +V +G + I G+ ++I G+ S
Sbjct: 685 SKEVSFEV-TPQMMELVTVTGEKVIEPGDFKVYIAGSTPS 723
>gi|390956994|ref|YP_006420751.1| beta-glucosidase-like glycosyl hydrolase [Terriglobus roseus DSM
18391]
gi|390411912|gb|AFL87416.1| beta-glucosidase-like glycosyl hydrolase [Terriglobus roseus DSM
18391]
Length = 742
Score = 362 bits (929), Expect = 5e-97, Method: Compositional matrix adjust.
Identities = 250/739 (33%), Positives = 365/739 (49%), Gaps = 74/739 (10%)
Query: 42 FCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGT 101
+ +S PI R+ DLI R +LQEK L VPRLG+ + W++ LHGV + P
Sbjct: 38 YRDMSRPIEDRITDLIKRFTLQEKAMQLNHTNRGVPRLGLPMWGGWNQTLHGVWSKQP-- 95
Query: 102 KFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAG------LTYWSP 155
T FP ++++ L + +SDEARA+YN G L Y SP
Sbjct: 96 --------TTLFPIPTAMGATWDPELVHTVADAMSDEARALYNAHAEGPRTPHGLVYRSP 147
Query: 156 NVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNW 215
+NI RDPRWGR QE EDP+L+G+ +YVRGLQG D LK+AA+ KHF ++++
Sbjct: 148 VINISRDPRWGRIQEVFSEDPLLTGRMGVAYVRGLQGDDLQHLKLAATVKHFAVNNVES- 206
Query: 216 NGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTI 275
R H NA V ++++ + + +R +ME SVM SYN +NG+P + +L +
Sbjct: 207 ---GRQHLNADVDERNLFEFWLPHWRAAIMEAHAQSVMSSYNAINGMPDAVNHWLLTDVL 263
Query: 276 RGEWRLNGYIVSDCDSVGVYYDT---------QHFTSTPEEAAADAIRAGLDLDCGPFLG 326
R +W +G++ D +V + T QHF+ P AAA AIRAG D D F
Sbjct: 264 RKKWGFDGFVTDDLGAVALLSGTRATNTSEPGQHFSEDPVVAAAAAIRAGNDSDDVEF-E 322
Query: 327 LHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELA 386
+ AVQRGLL+E D++ AL N L V RLG +D P + Y +G V + H++L+
Sbjct: 323 TNLPLAVQRGLLTEKDVDGALRNVLRVGFRLGAYD-PPQASKYSRIGMDVVRSQAHRDLS 381
Query: 387 LEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQG-- 444
A + + LL N+ LPL + ++VAVIGP GNY G T+ +G
Sbjct: 382 QRVAEESMTLLLNRRQFLPLQRDQVKSVAVIGPAGGEAYE-TGNYYGTPAVKTSVTEGLR 440
Query: 445 --IGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLL 502
+G + +++G V ADD+ A + +R++D +L +G + +EAE DR L
Sbjct: 441 ALLGSGVKVEYEKGAGYVDLADDKEIERAANLARKSDVVVLCLGTNLQVEAEGRDRRDLN 500
Query: 503 LPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIAD 562
LPG QQ L+ V A+ LVLM+ GP+ V +A + + AI+ A YPG+ GG AIA
Sbjct: 501 LPGAQQRLLEAV-YAANPKVALVLMNAGPLGVTWAHD--HVPAILSAWYPGELGGAAIAR 557
Query: 563 ILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGM 622
LFG +NPGG LP T Y NL S+ Y TY+++KG +YPFGHG+
Sbjct: 558 TLFGLNNPGGHLPYTVY-----ANLDGVPPQNEYDVSRGY---TYQYFKGVPLYPFGHGL 609
Query: 623 SYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSK 682
SYT+F ++ +KVT + + V + N G
Sbjct: 610 SYTHFDYS--------------------------KLKVTQTSGDHANVTVSFTLTNTGQS 643
Query: 683 DGAHTLLVFSTP-PAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRR 741
GA ++S + P + L FE+V + G + V I+I R
Sbjct: 644 AGAEVTQLYSHQVKSSEVQPLRTLRGFERVTLQPGESKAVAISIPTSALGWYDTAVHNFR 703
Query: 742 IPLGEHNIHIGGTKHSVSL 760
+ G N +G + + L
Sbjct: 704 VEPGAFNFMVGSSSEDIRL 722
>gi|332377068|gb|AEE64772.1| Xyl3A [Ruminococcus albus 8]
Length = 691
Score = 360 bits (925), Expect = 1e-96, Method: Compositional matrix adjust.
Identities = 255/754 (33%), Positives = 386/754 (51%), Gaps = 105/754 (13%)
Query: 42 FCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGT 101
+ SL +R L ++ +E+ L A A+ RLGI Y WW+E +HG++ G
Sbjct: 4 YLDESLSAEERAEALTDEMTTEEQASQLRYDAPAIERLGIPAYNWWNEGIHGLARSGV-- 61
Query: 102 KFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA--------GLTYW 153
AT FPQ I A+ F+ L + + S+EARA YN T GLT W
Sbjct: 62 --------ATMFPQAIGLAAMFDDELTKRTAEITSEEARAKYNAYTVEGDRDIYKGLTLW 113
Query: 154 SPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLD 213
+PN+NIFRDPRWGR ET GEDP L+ + + VRGLQG DG +K AA KHF +
Sbjct: 114 APNINIFRDPRWGRSHETFGEDPYLTAQNGKAVVRGLQG-DGKVMKAAACAKHFAVH--- 169
Query: 214 NWNGVD--RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNIL 271
+G + R F+AK +D+E+T+ F V E KV SVM +YN+VNG P CA ++
Sbjct: 170 --SGPEALRHSFDAKADAKDMEETYLPAFEALVKEAKVESVMGAYNRVNGEPACASDYLM 227
Query: 272 KRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTES 331
++ EW +GY VSDC ++ +++ T+ E+AA A++AG D++CG + +
Sbjct: 228 EKL--KEWEFDGYFVSDCWAIRDFHEHHMVTANAVESAAMALKAGCDVNCGCTYQ-NLLA 284
Query: 332 AVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAAR 391
A+ +GL+++ I A V+ + ++RLGMFD + + V +H+ ++LE A
Sbjct: 285 ALDKGLITKEQIRTACVHLMRTRIRLGMFD---KHTDFDDIPYSKVACAEHKAVSLECAE 341
Query: 392 QGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI-GRY-A 449
+ +VLLKN G LPL +++T+AVIGPN+D + GNY G++ YTT L GI R+
Sbjct: 342 KSLVLLKNNG-ILPLDDKKYKTIAVIGPNADSRTALEGNYNGLSDRYTTFLNGIQDRFEG 400
Query: 450 RTIHQQGC-------KDVACADDQLFGAAIDASRQADATILVMGLDQSIEAE-------- 494
R I +GC +A A D+ + A+ A++ AD I+ +GLD +IE E
Sbjct: 401 RVIFAEGCHLYKKSISGLAQAGDR-YAEAVAAAKNADLVIMCVGLDATIEGEEGDTGNEF 459
Query: 495 -ALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPG 553
+ D+ GL LP Q+ LV K+ M+ P + V+ +G I+ + + A+I A YPG
Sbjct: 460 SSGDKNGLTLPPPQKILVEKI-MSVGKPVVTVVCAGSAINT-----ESQPDALIHAFYPG 513
Query: 554 QAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKG 612
GG A+A++LFG +P GKLP+T+Y E LP T+ +M+ GRTYR+
Sbjct: 514 AEGGKALAEVLFGDVSPSGKLPVTFY--EDTDKLPEFTDYSMK--------GRTYRYTTD 563
Query: 613 PVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGV 672
+++PFG+G++Y V AV ++ SG+A +
Sbjct: 564 NILFPFGYGLTY--------GGVKVNAVEYKDGKAVVSVENSGRATE------------- 602
Query: 673 QVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLS 732
DV + KD + P+ L F++V + G + V I I K +
Sbjct: 603 --DVIELYLKDYCEQAV-----------PNVSLCGFKRVKLGEGEKATVEIAIPE-KAFT 648
Query: 733 VVDRSGTRRIPLGEHNIHIGGTKHSVSLHAATLG 766
VD +G R++ G + GT +L G
Sbjct: 649 AVDNNGVRKV-FGSKFTLLAGTHQPDALSEKLTG 681
>gi|325679939|ref|ZP_08159508.1| glycosyl hydrolase family 3 C-terminal domain protein [Ruminococcus
albus 8]
gi|324108377|gb|EGC02624.1| glycosyl hydrolase family 3 C-terminal domain protein [Ruminococcus
albus 8]
Length = 691
Score = 360 bits (925), Expect = 1e-96, Method: Compositional matrix adjust.
Identities = 255/754 (33%), Positives = 386/754 (51%), Gaps = 105/754 (13%)
Query: 42 FCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGT 101
+ SL +R L ++ +E+ L A A+ RLGI Y WW+E +HG++ G
Sbjct: 4 YLDESLSAEERAEALTDEMTTEEQASQLRYDAPAIERLGIPAYNWWNEGIHGLARSGV-- 61
Query: 102 KFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA--------GLTYW 153
AT FPQ I A+ F+ L + + S+EARA YN T GLT W
Sbjct: 62 --------ATMFPQAIGLAAMFDDELTKRTAEITSEEARAKYNAYTVEGDRDIYKGLTLW 113
Query: 154 SPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLD 213
+PN+NIFRDPRWGRG ET GEDP L+ + + VRGLQG DG +K AA KHF +
Sbjct: 114 APNINIFRDPRWGRGHETFGEDPYLTAQNGKAVVRGLQG-DGKVMKAAACAKHFAVH--- 169
Query: 214 NWNGVD--RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNIL 271
+G + R F+AK +D+E+T+ F V E KV SVM +YN+VNG P CA ++
Sbjct: 170 --SGPEALRHSFDAKADAKDMEETYLPAFEALVKEAKVESVMGAYNRVNGEPACASDYLM 227
Query: 272 KRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTES 331
++ EW +GY VSDC ++ +++ T+ E+AA A++AG D++CG + +
Sbjct: 228 EKL--KEWEFDGYFVSDCWAIRDFHEHHMVTANAVESAAMALKAGCDVNCGCTYQ-NLLA 284
Query: 332 AVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAAR 391
A+ +GL+++ I A V+ + ++RLGMFD + + V +H+ ++LE A
Sbjct: 285 ALDKGLITKEQIRTACVHLMRTRIRLGMFD---KHTDFDDIPYSKVACAEHKAVSLECAE 341
Query: 392 QGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI-GRY-A 449
+ +VLLKN G LPL +++T+AVIGPN+D + GNY G++ YTT L GI R+
Sbjct: 342 KSLVLLKNNG-ILPLDDKKYKTIAVIGPNADSRTALEGNYNGLSDRYTTFLNGIQDRFEG 400
Query: 450 RTIHQQGC-------KDVACADDQLFGAAIDASRQADATILVMGLDQSIEAE-------- 494
R I +GC +A A D+ + A+ A++ AD I+ +GLD +IE E
Sbjct: 401 RVIFAEGCHLYKKSISGLAQAGDR-YAEAVAAAKNADLVIMCVGLDATIEGEEGDTGNEF 459
Query: 495 -ALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPG 553
+ D+ GL LP Q+ LV K+ M+ P + V+ +G I+ + + A+I A YPG
Sbjct: 460 SSGDKNGLTLPPPQKILVEKI-MSVGKPVVTVVCAGSAINT-----ESQPDALIHAFYPG 513
Query: 554 QAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKG 612
G A+A++LFG +P GKLP+T+Y E LP T+ +M+ GRTYR+
Sbjct: 514 AEGSKALAEVLFGDVSPSGKLPVTFY--EDTDKLPEFTDYSMK--------GRTYRYTTD 563
Query: 613 PVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGV 672
+++PFG+G++Y V AV ++ SG+A +
Sbjct: 564 NILFPFGYGLTY--------GGVKVNAVEYKDGKAVVSVENSGRATE------------- 602
Query: 673 QVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLS 732
DV + KD + P+ L F++V + G + V I I K +
Sbjct: 603 --DVIELYLKDYCEQAV-----------PNVSLCGFKRVKLGEGEKATVEIAIPE-KAFT 648
Query: 733 VVDRSGTRRIPLGEHNIHIGGTKHSVSLHAATLG 766
VD +G R++ G + GT +L G
Sbjct: 649 AVDNNGVRKV-FGSKFTLLAGTHQPDALSEKLTG 681
>gi|332638085|ref|ZP_08416948.1| glycoside hydrolase family 3 protein [Weissella cibaria KACC 11862]
Length = 713
Score = 360 bits (924), Expect = 2e-96, Method: Compositional matrix adjust.
Identities = 241/729 (33%), Positives = 360/729 (49%), Gaps = 94/729 (12%)
Query: 56 LIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQ 115
++ ++++ EK+ + A A+ RL I Y +W+EALHGV+ G AT FPQ
Sbjct: 17 IVDQMTIDEKIGQIKYEAPAIERLNIPEYNYWNEALHGVARAGV----------ATVFPQ 66
Query: 116 VITTASSFNATLWEAIGRVVSDEARAMYNGGTA--------GLTYWSPNVNIFRDPRWGR 167
I A++F+ L I V+ E RA YN T GLT+WSPNVNIFRDPRWGR
Sbjct: 67 AIGLAATFDDQLINDIADVIGTEGRAKYNEFTKHEDRDIYKGLTFWSPNVNIFRDPRWGR 126
Query: 168 GQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKV 227
G ET GEDP L+ K+ ++++GLQG LK+AA+ KHF + G+ R F+A V
Sbjct: 127 GHETYGEDPFLTSKFGVAFIKGLQGQ-AKYLKLAATAKHFAVH--SGPEGL-RHGFDAVV 182
Query: 228 SKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVS 287
S +D+ +T+ F+ V E V S+M +YN V+GVP +L+ + +W G++VS
Sbjct: 183 SDKDLYETYLPAFKAAVEEADVESIMTAYNAVDGVPASVSEMLLRDILHDKWSFEGHVVS 242
Query: 288 DCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPF-LGLHTESAVQRGLLSEIDINNA 346
D + ++ +T E AI+AGL+L G LH A+ RGL++E +I NA
Sbjct: 243 DYMAPEDVHENHKYTKDAAETMGLAIKAGLNLVAGHIEQSLH--EALNRGLVTEEEITNA 300
Query: 347 LVNTLTVQMRLGMF--DGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSL 404
+++ ++RLGMF D E + PY + T H L+ AA + VLLKN G L
Sbjct: 301 VISLYATRVRLGMFATDNEYDAIPY-----EANDTKAHNNLSEIAAEKSFVLLKNDG-VL 354
Query: 405 PLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI----GRYART-------IH 453
PL +AV+GPN+ + ++GNY G T L+GI G R +
Sbjct: 355 PLRKETMEAIAVVGPNAHSEIALLGNYFGTPSRSYTILEGIQERLGDDVRVHYSIGSGVF 414
Query: 454 QQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAE---------ALDRAGLLLP 504
Q + D+ AI A+ +D + V+GLD +IE E A D+ L LP
Sbjct: 415 QDHAAEPLAKADERESEAIIAAEHSDVIVAVLGLDSTIEGEEGDAGNSQGAGDKPNLSLP 474
Query: 505 GRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADIL 564
GRQ++L+ ++ +A P +++L SG + + +N P + AI+ YPG GG A+AD+L
Sbjct: 475 GRQRQLLERL-LAVGKPVVVLLASGSSLQLDGLENHPNLRAIMQIWYPGARGGLAVADVL 533
Query: 565 FGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSY 624
FGT +P GKLP+T+Y NLP E GRTYR+ +YPFG+G++Y
Sbjct: 534 FGTVSPSGKLPVTFYKN--TDNLPAFE-------DYNMAGRTYRYMTEEALYPFGYGLTY 584
Query: 625 TNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDG 684
S + K T V ++N G+ D
Sbjct: 585 -----------------------------SSVELSDLQVKSYEETATATVTIQNTGNFDT 615
Query: 685 AHTLLVFSTPPAGHWA-PHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIP 743
+ V+ +A P+ QL F++V + G++Q + ++ + V D G I
Sbjct: 616 DEVVQVYVKDLESEFAVPNAQLKGFKRVFLGKGSKQTITFDLRPQDF-EVFDEQGHNFID 674
Query: 744 LGEHNIHIG 752
I +G
Sbjct: 675 SNRFEISVG 683
>gi|167537541|ref|XP_001750439.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163771117|gb|EDQ84789.1| predicted protein [Monosiga brevicollis MX1]
Length = 834
Score = 359 bits (921), Expect = 4e-96, Method: Compositional matrix adjust.
Identities = 236/748 (31%), Positives = 369/748 (49%), Gaps = 69/748 (9%)
Query: 32 PKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEK-VKLLISGAAAVPRLGIKGYEWWSEA 90
P + + PFC L + R+ DL+ R+S + +L +A + +G+ Y W + A
Sbjct: 99 PCMGASSSYPFCDTKLSVDDRLKDLVSRVSTADAATQLRARESAQIDNIGLPAYYWGTNA 158
Query: 91 LHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGT-AG 149
+HG+ N T D TSFP +++FN +L + +GR++ E RA YN G
Sbjct: 159 IHGMQN----TACLADGQCPTSFPAPNGLSATFNYSLVKDMGRIIGRELRAYYNTKFHNG 214
Query: 150 LTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQ-GSDGDRLKVAASCKHFT 208
L WSP +N RDPRWGR E+PGE P + G+Y A+Y GLQ G D D + + KH+
Sbjct: 215 LDTWSPTINPSRDPRWGRNVESPGESPFVCGQYGAAYTEGLQNGDDKDYTQAVVTLKHWV 274
Query: 209 AYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADP 268
AY +++++ V R+ +NA VS+ D+ DT+ + V K VMCSYN +NGVPTC +P
Sbjct: 275 AYSVEDYDNVTRYEYNAIVSEYDLMDTYFPGWEYVVKNAKPLGVMCSYNSLNGVPTCGNP 334
Query: 269 NILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLH 328
L +R +W GYI SD DS+ + H+ S A D + G D+D G +
Sbjct: 335 -ALTAYLREDWGFEGYITSDSDSIHCIWADHHYESNAVLATRDGLLGGCDIDSGDTYADN 393
Query: 329 TESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALE 388
E+AV + L++ ++ AL N+ ++ LG+FD ++ Y + +V QE +L
Sbjct: 394 LEAAVNQSLVNRSAVDAALTNSYRMRFNLGLFDPNVTN-AYDRISADEVGMSSSQETSLL 452
Query: 389 AARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRY 448
AAR+ + LLKN G +LP + + VAVIG +S+ ++GNY G C G
Sbjct: 453 AARKSMTLLKNDGQTLPFA--TGKKVAVIGKSSNSAEDILGNYVGPICP-----SGAFDC 505
Query: 449 ARTIHQ------QGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLL 502
+T++Q QG D AI + AD +L + + E DR +
Sbjct: 506 VQTLYQGVAAANQGGATTLSDDVADINTAIQLAMDADQVVLTIS-NYGQAGEGKDRTYIG 564
Query: 503 LPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIAD 562
L QQELV+ V K PT +V+++GG I + + K++ + AI+ A PG GG A+A+
Sbjct: 565 LDTDQQELVAAVLKVGK-PTAIVMLNGGLISLDWIKDEAQ--AILVAFAPGVHGGQAVAE 621
Query: 563 ILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRP-----------SQSKRYPGRTYRFYK 611
+FG +NPGGKLP+T Y +Y+ ++ M+M+ + PGR+Y++Y
Sbjct: 622 TIFGANNPGGKLPVTMYASDYVNDVDFLNMSMQAVAVLHLMNVNGERDDTGPGRSYKYYT 681
Query: 612 GPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLG 671
G +YPF +G+SYT F + + AP + + +T+ T
Sbjct: 682 GEPLYPFAYGLSYTTFNLSWSPAPPMT---------TFTSTLRSTTYTATVTN------- 725
Query: 672 VQVDVKNVGSKDGAHTLLVFSTP--------PAGHWAPHKQLVAFEKVHVPAGAQQRVGI 723
GS G + F P P G+ P K++ F++V + G +V
Sbjct: 726 -------TGSVGGDEVVFAFYKPKSESLKTLPVGNPVPIKEIFGFQRVALGPGQSTQVTF 778
Query: 724 NIHVCKYLSVVDRSGTRRIPLGEHNIHI 751
++ + L+ V G R + GE I +
Sbjct: 779 ELNA-ETLAQVTLDGHRELHSGEFEIEL 805
>gi|405955586|gb|EKC22647.1| Putative beta-D-xylosidase 2 [Crassostrea gigas]
Length = 745
Score = 359 bits (921), Expect = 4e-96, Method: Compositional matrix adjust.
Identities = 247/747 (33%), Positives = 378/747 (50%), Gaps = 85/747 (11%)
Query: 41 PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISG--------AAAVPRLGIKGYEWWSEALH 92
PF SLP RV DL+ RL+++E V + G A AVPRLG+ + W +E L
Sbjct: 27 PFRNTSLPWDARVKDLVDRLTIEEIVVQMSRGGSGPRASPAPAVPRLGVGPFSWNTECLR 86
Query: 93 GVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN-------- 144
G G ATSFPQ + A++F+ + + S E RA +N
Sbjct: 87 GDVYAG----------NATSFPQALGLAATFSTEVICDVASATSIEVRAKFNDYQRRKIY 136
Query: 145 GGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASC 204
G G++ +SP +NI R P WGR QET GEDP LSG+ AA +V+ LQG D ++ A C
Sbjct: 137 GDHKGISCFSPVINIMRHPLWGRNQETYGEDPFLSGELAAIFVKCLQGDDPTYIRANAGC 196
Query: 205 KHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPT 264
KHF + V RF F+AKVS++D TF F+ CV G S+MCS+N++NGVP
Sbjct: 197 KHFDVHGGPENIPVSRFSFDAKVSERDWRLTFLPAFKRCVQAGSY-SLMCSFNRINGVPA 255
Query: 265 CADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPF 324
C + +L +R EW GY+VSD +++ H+T+ + AA ++AG +L+
Sbjct: 256 CGNKRLLTDILRTEWGFTGYVVSDQEAIENIMTYHHYTNNSVDTAALCVKAGCNLELSTN 315
Query: 325 ----LGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTP 380
+ A++ G L + D+ ++ +MRLG FD P PY + + +
Sbjct: 316 EVKPTYFYIIDALKAGKLDKEDLVKSVSPLFYTRMRLGEFD-PPDHNPYNFIDLSVIQSE 374
Query: 381 DHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYA-GIACGY- 438
+H+ ++L AA + VLLKN+G LP++ + T++V+GP +D IG+YA + Y
Sbjct: 375 EHRAISLNAAMKSFVLLKNKGGFLPITKL-FDTISVLGPMADNKYQQIGSYAPDVMPSYT 433
Query: 439 TTPLQGIGRYARTI-HQQGCKDVACA--DDQLFGAAIDASRQADATILVMGLDQSIEAEA 495
TTPLQG+ + ++ + + GC D AC+ + A+++S D + +G IE E
Sbjct: 434 TTPLQGLSKLSKRVQYAAGCNDNACSKYNRTEIQRAVNSS---DIFFVCLGTGPMIENED 490
Query: 496 LDRAGLLLPGRQQELVSKVSMAS-KG-PTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPG 553
DRA + LPG+Q +L+ M S KG P +L+L +GGP+++ +A R+ AI+ +P
Sbjct: 491 HDRASMELPGQQAQLLKDAIMFSAKGVPIVLLLFNGGPVNITWADRSDRVVAIMECFFPA 550
Query: 554 QAGGTAIADILFGT---SNPGGKLPMTWYPQEYITNLP-MTEMAMRPSQSKRYPGRTYRF 609
Q G A+ ++ T SNP G+LP TW +Y +P M +M GRTYR+
Sbjct: 551 QETGEAVLRVVTNTGNSSNPAGRLPYTW--PKYQDQIPSMVNYSME--------GRTYRY 600
Query: 610 YKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLT 669
+ G +YPFG+G+SY+ F T A +++ D
Sbjct: 601 FHGDPLYPFGYGLSYSTFNFTNAWMNPIISQGQD-------------------------- 634
Query: 670 LGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVC 728
L V+V+V N G DG + V+ P QLV FE+V + A I +
Sbjct: 635 LTVRVEVCNEGPTDGDEVIQVYLKWLDTNETMPIHQLVGFERVSLRAKETLSWLITVRA- 693
Query: 729 KYLSVVDRSGTRRIPLGEHNIHIGGTK 755
+ ++V + S I G + ++IGG +
Sbjct: 694 ENMAVWNESRGFYIEPGRYRLYIGGQQ 720
>gi|164428543|ref|XP_964543.2| hypothetical protein NCU00709 [Neurospora crassa OR74A]
gi|157072187|gb|EAA35307.2| hypothetical protein NCU00709 [Neurospora crassa OR74A]
Length = 786
Score = 359 bits (921), Expect = 4e-96, Method: Compositional matrix adjust.
Identities = 235/624 (37%), Positives = 329/624 (52%), Gaps = 76/624 (12%)
Query: 73 AAAVPRLGIKGYEWWSEALHGVSNVGPGTKF---GGDFPGATSFPQVITTASSFNATLWE 129
A RLG+ Y WWSE LHGV+ PG KF G F ATSF I +SF+ L
Sbjct: 8 ALGASRLGLPKYAWWSEGLHGVAG-SPGVKFNTTGYPFSYATSFANAINLGASFDDDLVY 66
Query: 130 AIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRG 189
+G +S EARA N G GL YW+PNVN ++DPRWGRG ETPGEDP+ Y + + G
Sbjct: 67 EVGTAISTEARAFANFGFGGLDYWTPNVNPYKDPRWGRGAETPGEDPLHIKGYVKAILAG 126
Query: 190 LQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKV 249
L+G++ R KV A+CKH+ AYDL+ W+G+ R+ F A V+ QD+ + + PF+ C + KV
Sbjct: 127 LEGNETVR-KVIATCKHYAAYDLERWHGLTRYEFEAIVTLQDLSEYYLPPFQQCARDSKV 185
Query: 250 ASVMCSYN--------------QVN---GVPTCADPNILKRTIRGEWRL---NGYIVSDC 289
S+MCSYN ++N P CA P ++ +R W N YI SDC
Sbjct: 186 GSIMCSYNALTIRDMASGKPDEEINLTTAQPACAKPYLMT-ILRDHWNWTEHNNYITSDC 244
Query: 290 DSVGVYY-DTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTE--SAVQRGLLSEIDINNA 346
+++ + D +F+ TP EAAA A +AG D C T+ A + LL E I+ A
Sbjct: 245 NAILDFLPDNHNFSQTPAEAAAAAYKAGTDTVCEVSGSPLTDVVGAYNQSLLPEAVIDTA 304
Query: 347 LVNTLTVQMRLGMFD------------GEPSSQPYGHLGPKDVCTPDHQELALEAARQGI 394
L +R G D G SS Y L +DV TP QELAL +A +GI
Sbjct: 305 LRRLYEGLIRAGYLDHGRSSAVAGGDGGSFSSPAYDALNWEDVNTPSTQELALRSATEGI 364
Query: 395 VLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQ 454
VLLKN G LPL + VA+IG ++ T TM G Y+GI Y PL + +
Sbjct: 365 VLLKNAGSLLPLD-FSGKKVALIGHWANATGTMRGPYSGIPPFYHNPLYAAQQLNLSFSY 423
Query: 455 QGCKDVACAD-DQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSK 513
V +D D A+ A+ AD + G D ++ +E LDR + P Q +L+S+
Sbjct: 424 ANGPVVNASDPDTWTAPALAAAEGADVVLYFGGTDTTVASEDLDRESIAWPETQMQLLSE 483
Query: 514 VSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGK 573
+A G ++V+ G +D + N+ +++I+W GYPGQ+GGTA+ D+L G P G+
Sbjct: 484 --LAGLGKPLVVIQLGDQVDDSSLLNNGNVSSILWVGYPGQSGGTAVFDVLTGKKAPAGR 541
Query: 574 LPMTWYPQEYITNLPMTEMAMRP------------------------------SQSKRYP 603
LP+T YP+ Y+ +P+TEMA+RP +++ P
Sbjct: 542 LPVTQYPEGYVDEVPLTEMALRPFNYSSSSNLEQEVSVQGRGSLTIQPRSTPGNKTLSSP 601
Query: 604 GRTYRFYKGPVVYPFGHGMSYTNF 627
GRTY++Y PV+ PFG+G+ YT F
Sbjct: 602 GRTYKWYSSPVL-PFGYGLHYTTF 624
>gi|336463686|gb|EGO51926.1| hypothetical protein NEUTE1DRAFT_125528 [Neurospora tetrasperma
FGSC 2508]
Length = 788
Score = 356 bits (913), Expect = 3e-95, Method: Compositional matrix adjust.
Identities = 232/628 (36%), Positives = 325/628 (51%), Gaps = 79/628 (12%)
Query: 73 AAAVPRLGIKGYEWWSEALHGVSNVGPGTKF---GGDFPGATSFPQVITTASSFNATLWE 129
A R+G+ Y WWSE LHGV+ PG F G F ATSF I +SF+ L
Sbjct: 8 ALGASRIGLPKYAWWSEGLHGVAG-SPGVTFNTTGYPFSYATSFANAINLGASFDDDLVY 66
Query: 130 AIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRG 189
+G +S EARA N G GL YW+PNVN ++DPRWGRG ETPGEDP+ Y + + G
Sbjct: 67 EVGTAISTEARAFANFGFGGLDYWTPNVNPYKDPRWGRGAETPGEDPLHIKGYVKAMLAG 126
Query: 190 LQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKV 249
L+G++ R KV A+CKH+ AYDL+ W+G+ R+ F A V+ QD+ + + PF+ C + KV
Sbjct: 127 LEGNETVR-KVIATCKHYAAYDLERWHGLTRYEFEAIVTLQDLSEYYLPPFQQCARDSKV 185
Query: 250 ASVMCSYNQV-----------------NGVPTCADPNILKRTIRGEWRL---NGYIVSDC 289
S+MCSYN + P CA+ L +R W N YI SDC
Sbjct: 186 GSIMCSYNALTIRDMAGGNPDEIINLTTAQPACAN-TYLMTILRDHWNWTEHNNYITSDC 244
Query: 290 DSVGVYY-DTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTE--SAVQRGLLSEIDINNA 346
+++ + D +F+ TP EAAA A +AG D C T+ A + LL E I+ A
Sbjct: 245 NAILDFLPDNHNFSQTPAEAAAAAYKAGTDTVCEVSGSPLTDVVGAYNQSLLPEAVIDTA 304
Query: 347 LVNTLTVQMRLGMFD------------GEPSSQPYGHLGPKDVCTPDHQELALEAARQGI 394
L +R G D G SS Y L +DV TP QELAL +A +GI
Sbjct: 305 LRRLYEGLIRAGYLDHGRSSAVAGGDGGSFSSPAYDALNWEDVNTPSTQELALRSATEGI 364
Query: 395 VLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQ 454
VLLKN G LPL + VA+IG ++ T TM G Y+GI Y PL + +
Sbjct: 365 VLLKNSGSLLPLDFSSGKKVALIGHWANATGTMRGPYSGIPPFYHNPLYAAQQLNLSFSY 424
Query: 455 QGCKDVACAD-DQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSK 513
V +D D A+ A+ AD + G D ++ +E LDR + P Q +L+S+
Sbjct: 425 ANGPVVNASDPDTWTAPALAAAEGADVVLYFGGTDTTVASEDLDRESIAWPKAQMKLLSE 484
Query: 514 VSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGK 573
+A G ++V+ G +D +F + +++I+W GYPGQ+GGTA+ D+L G P G+
Sbjct: 485 --LAGLGKPLVVIQLGDQVDDSFLLENGNVSSILWVGYPGQSGGTAVFDVLTGKKAPAGR 542
Query: 574 LPMTWYPQEYITNLPMTEMAMRP----------------------------------SQS 599
LP+T YP+ Y+ +P+TEMA+RP +++
Sbjct: 543 LPVTQYPEGYVDEVPLTEMALRPFNHSSSTSSSSNPEEEVSVQGSGSLTIQPRSTPGNKT 602
Query: 600 KRYPGRTYRFYKGPVVYPFGHGMSYTNF 627
PGRTY++Y PV+ PFG+G+ YT F
Sbjct: 603 LSSPGRTYKWYSNPVL-PFGYGLHYTTF 629
>gi|110737298|dbj|BAF00595.1| xylosidase [Arabidopsis thaliana]
Length = 303
Score = 355 bits (910), Expect = 7e-95, Method: Compositional matrix adjust.
Identities = 165/222 (74%), Positives = 188/222 (84%), Gaps = 1/222 (0%)
Query: 12 LILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLIS 71
+ LL S + R FACDP + TRTL FC+ ++PI RV DL+GRL+LQEK++ L++
Sbjct: 19 VFLLCLVHSSESLRPLFACDPANGLTRTLRFCRANVPIHVRVQDLLGRLTLQEKIRNLVN 78
Query: 72 GAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAI 131
AAAVPRLGI GYEWWSEALHG+S+VGPG KFGG FPGATSFPQVITTA+SFN +LWE I
Sbjct: 79 NAAAVPRLGIGGYEWWSEALHGISDVGPGAKFGGAFPGATSFPQVITTAASFNQSLWEEI 138
Query: 132 GRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQ 191
GRVVSDEARAMYNGG AGLTYWSPNVNI RDPRWGRGQETPGEDP+++ KYAASYVRGLQ
Sbjct: 139 GRVVSDEARAMYNGGVAGLTYWSPNVNILRDPRWGRGQETPGEDPIVAAKYAASYVRGLQ 198
Query: 192 GS-DGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDI 232
G+ G+RLKVAA CKH+TAYDLDNWNGVDRFHFNAKV+ I
Sbjct: 199 GTAAGNRLKVAACCKHYTAYDLDNWNGVDRFHFNAKVNLLHI 240
>gi|429738050|ref|ZP_19271875.1| glycosyl hydrolase family 3 protein [Prevotella saccharolytica
F0055]
gi|429161155|gb|EKY03583.1| glycosyl hydrolase family 3 protein [Prevotella saccharolytica
F0055]
Length = 722
Score = 354 bits (908), Expect = 1e-94, Method: Compositional matrix adjust.
Identities = 242/729 (33%), Positives = 378/729 (51%), Gaps = 86/729 (11%)
Query: 51 QRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGA 110
Q+ +I +L+L EK+ L A + RLGIK Y W +EALHGV G A
Sbjct: 33 QKAKSIISQLTLDEKISQLTQDAKGIDRLGIKPYYWLNEALHGVGRDGR----------A 82
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMY--------NGGTAGLTYWSPNVNIFRD 162
T FPQ I ++F+ + IG ++ E RA + AGLT+W+PNVNIFRD
Sbjct: 83 TVFPQPINLGATFDPKIVHQIGDAIATEGRAKFIVAQRQKNYSMYAGLTFWAPNVNIFRD 142
Query: 163 PRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFH 222
PRWGRG ET GEDP L+G ++V+G+QG D LK AA KHF + +G +R
Sbjct: 143 PRWGRGMETYGEDPFLTGTLGTAFVKGMQGDDPFYLKAAACGKHFAVH-----SGPERTR 197
Query: 223 FNAKV--SKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWR 280
A V +K+D+ +T+ F+M V +GKV S+M +Y ++ G +L +R +W
Sbjct: 198 HTANVEPTKRDLYETYLPAFKMLVQKGKVESIMGAYQRLYGESCSGSKYLLTDILRKDWG 257
Query: 281 LNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSE 340
G++VSDC +V Y+ + EA A AI+AGL+L+CG + + A+Q+ L++E
Sbjct: 258 FKGHVVSDCGAVTDMYEGHKLVKSEAEAVAFAIKAGLNLECGNSM-RTMKDAIQQKLITE 316
Query: 341 IDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQ 400
D++ AL+ + +++LG+ + + PY + + ++++A +AA + +VLLKN
Sbjct: 317 KDLDKALLPLMMTRLKLGILQPDAAC-PYNEFPESVIGSEANRKIAEQAAEESMVLLKNN 375
Query: 401 GPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI-GRYARTI---HQQG 456
G LP++ RT+ V GP + ++GNY G++ Y+T L+GI G+ + ++QG
Sbjct: 376 G-VLPIAK-DIRTLFVTGPGATDAYYLMGNYFGLSNRYSTYLEGIVGKVSNGTSVNYKQG 433
Query: 457 CKDVACADDQLFGAAIDASRQADATILVMGLDQSIE---------AEALDRAGLLLPGRQ 507
V + + ++ SR A+ +IL+MG + E AE DR L LP Q
Sbjct: 434 FMQVFKNLNDV-NWSVSESRGAEVSILIMGNSGNTEGEEGDAIASAERGDRVNLRLPDSQ 492
Query: 508 QELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIA-AIIWAGYPGQAGGTAIADILFG 566
E + +VS ++VL G PIDV K +A A++ A YPGQ GG A+A++LFG
Sbjct: 493 MEYLREVSKDRTNKLVVVLTGGSPIDV---KEITELADAVVMAWYPGQEGGVALANLLFG 549
Query: 567 TSNPGGKLPMTWYPQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYT 625
+N G+LP+T+ E LP + +M+ GRTY++ ++YPFG+G+SY+
Sbjct: 550 DANFSGRLPVTF--PESADRLPAFDDYSMK--------GRTYKYMTDNILYPFGYGLSYS 599
Query: 626 NFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGA 685
++ NA ++ K T + V VDV N G
Sbjct: 600 KVTYS-------------------NAAVTKMPTKTT-------PMTVYVDVTNNGDMPVD 633
Query: 686 HTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPL 744
+ V+ STP AG+ +P + L+ F++V + I + + L V GT ++
Sbjct: 634 EVVQVYLSTPGAGNTSPIESLIGFKRVKIYPHITVTKDFQIPM-ELLETVQADGTSKLLK 692
Query: 745 GEHNIHIGG 753
GE+ I I G
Sbjct: 693 GEYQIKISG 701
>gi|121700633|ref|XP_001268581.1| beta-xylosidase XylA [Aspergillus clavatus NRRL 1]
gi|119396724|gb|EAW07155.1| beta-xylosidase XylA [Aspergillus clavatus NRRL 1]
Length = 743
Score = 353 bits (907), Expect = 2e-94, Method: Compositional matrix adjust.
Identities = 237/719 (32%), Positives = 354/719 (49%), Gaps = 86/719 (11%)
Query: 52 RVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGAT 111
R LI +L+E V + + VPRLG+ Y+ W+EALHG+ + G F +T
Sbjct: 72 RAAALISLFTLEELVNATGNTSPGVPRLGLPPYQVWNEALHGLDRAYFTDE--GQFSWST 129
Query: 112 SFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQET 171
SFP I T S+ N TL + ++S + RA N G GL +SPN+N FR P WGRGQET
Sbjct: 130 SFPMPILTMSALNRTLINQVASIISTQGRAFSNAGRYGLDVYSPNINSFRHPVWGRGQET 189
Query: 172 PGEDP-VLSGKYAASYVRGLQGS-DGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSK 229
PGED LS YA Y+ G+QG D LK+ A+ KH+ YD++NW+G R + +++
Sbjct: 190 PGEDAYCLSSAYAYEYITGIQGGVDPKSLKLVATAKHYAGYDIENWDGHSRLGNDMNITQ 249
Query: 230 QDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRL--NGYIVS 287
QD+ + + F + + KV SVMCSYN VNGVP+CA+ L+ +R + +GYI S
Sbjct: 250 QDLSEYYTPQFLVAARDAKVRSVMCSYNAVNGVPSCANSFFLQTLLRDTFGFVEDGYISS 309
Query: 288 DCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNAL 347
DCDS ++ + + AAAD+IRAG D+DCG + + AV + LLS DI +
Sbjct: 310 DCDSAYNVFNPHEYAANVSSAAADSIRAGTDIDCGTTYQYYFDEAVDQNLLSRADIERGV 369
Query: 348 VNTLTVQMRLGMFDGEP----SSQPYG-HLGPKD-VCTPDHQELALEAARQGIVLLKNQG 401
+ + MRLG FD P S+Q G + GP + +P L+A R
Sbjct: 370 IRLYSNLMRLGYFDVGPWMNVSTQLQGNYFGPAPYLISP------LDAFRD--------- 414
Query: 402 PSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIAC-GYTTPLQGIGRYARTIHQQGCKDV 460
SH+ DV N + + G++ L +
Sbjct: 415 -----SHL------------DVNYAFGTNISSNSTDGFSKALSAAKK------------- 444
Query: 461 ACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKG 520
+D +F ID S +A E LDR + PG+Q EL+ ++S K
Sbjct: 445 --SDAIIFAGGIDNSLEA---------------ETLDRMNITWPGKQLELIDQLSQLGK- 486
Query: 521 PTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYP 580
P I++ M GG +D + K++ + ++IW GYPGQ+GG A+ DI+ G P G+L +T YP
Sbjct: 487 PLIVLQMGGGQVDSSLLKSNKNVNSLIWGGYPGQSGGQALLDIITGKRAPAGRLVVTQYP 546
Query: 581 QEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAV 640
EY T P T+M++RP + PG+TY +Y G VY FGHG+ YT F V++A V
Sbjct: 547 AEYATQFPATDMSLRPHGNN--PGQTYMWYTGTPVYEFGHGLFYTTF--RVSHARAVKIK 602
Query: 641 PLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWA 700
P +I ++ H + L VD+ N G +T ++F+ AG
Sbjct: 603 PT----YNIQDLLAQPHPGYIHVEQMPF-LNFTVDITNTGKASSDYTAMLFANTTAGPAP 657
Query: 701 -PHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSV 758
P K LV F+++ + ++ ++ D G R + G++ + + + V
Sbjct: 658 YPKKWLVGFDRLPTLGPSTSKLMTIPVTINSMARTDELGNRVLYPGKYELALNNERSVV 716
>gi|323344407|ref|ZP_08084632.1| beta-glucosidase [Prevotella oralis ATCC 33269]
gi|323094534|gb|EFZ37110.1| beta-glucosidase [Prevotella oralis ATCC 33269]
Length = 722
Score = 353 bits (907), Expect = 2e-94, Method: Compositional matrix adjust.
Identities = 243/729 (33%), Positives = 380/729 (52%), Gaps = 86/729 (11%)
Query: 51 QRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGA 110
Q+ +I +L+L EK+ L A + RLGIK Y W +EALHGV G A
Sbjct: 33 QKAKSIISQLTLDEKISQLTQDAKGIDRLGIKPYYWLNEALHGVGRDGR----------A 82
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMY--------NGGTAGLTYWSPNVNIFRD 162
T FPQ I+ ++F+ + + IG ++ E RA + AGLT+W+PNVNIFRD
Sbjct: 83 TVFPQPISLGATFDPEIVQQIGDAIATEGRAKFIVAQRQKNYSMYAGLTFWAPNVNIFRD 142
Query: 163 PRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFH 222
PRWGRG ET GEDP L+G ++V+G+QG+D LK AA KHF + +G +R
Sbjct: 143 PRWGRGMETYGEDPFLTGVLGTAFVKGMQGNDPFYLKAAACGKHFAVH-----SGPERTR 197
Query: 223 FNAKV--SKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWR 280
A V +K D+ +T+ F+M V +GKV S+M +Y ++ G +L +R +W
Sbjct: 198 HTANVEPTKHDLYETYLPAFKMLVQQGKVESIMGAYQRLYGESCSGSKYLLTDILRKDWG 257
Query: 281 LNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSE 340
G++VSDC +V Y+ + EA A AI+AGL+L+CG + + A+++ L++E
Sbjct: 258 FKGHVVSDCGAVTDMYEGHKLVKSEAEAVAFAIKAGLNLECGNSM-RTMKDALKQKLITE 316
Query: 341 IDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQ 400
D++ AL+ + +++LG+ + + PY + + D++ +A AA + +VLLKN
Sbjct: 317 KDLDKALLPLMMTRLKLGILQPDVAC-PYNEFPESVIGSIDNRNIAQRAAEESMVLLKND 375
Query: 401 GPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI-GRYARTI---HQQG 456
G LP++ RT+ V GP + ++GNY G++ Y+T L+GI G+ + ++QG
Sbjct: 376 G-VLPIAK-DIRTLFVTGPGATDAYYLMGNYFGLSDRYSTYLEGIVGKVSNGTSVNYKQG 433
Query: 457 CKDVACADDQLFGAAIDASRQADATILVMGLDQSIE---------AEALDRAGLLLPGRQ 507
V + + ++ SR A+ +I++MG + E +E DR L LP Q
Sbjct: 434 FMQVFKNLNDV-NWSVSESRGAEVSIIIMGNSGNTEGEEGDAIASSERGDRVDLRLPEPQ 492
Query: 508 QELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIA-AIIWAGYPGQAGGTAIADILFG 566
+ + +VS ++VL G PIDV K +A A++ A YPGQ GG A+A++LFG
Sbjct: 493 MQYLREVSKDRTNKLVVVLTGGSPIDV---KEITELADAVVMAWYPGQEGGVALANLLFG 549
Query: 567 TSNPGGKLPMTWYPQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYT 625
+N G+LP+T+ E LP + +M+ GRTY++ ++YPFG+G+SY
Sbjct: 550 DANFSGRLPVTF--PETTDKLPSFDDYSMK--------GRTYKYMTDNILYPFGYGLSY- 598
Query: 626 NFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGA 685
G+ NAT VT ++ V VD+ N G+
Sbjct: 599 ------------------GKVAYGNAT-------VTKLPTKHSSMTVSVDLSNDGNMPVD 633
Query: 686 HTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPL 744
+ V+ STP AG +P + LVAF++V + A I V + L V GT ++
Sbjct: 634 EVVQVYLSTPSAGVTSPIESLVAFKRVKIAPHATVTTDFEIPVER-LETVQEDGTSKLLK 692
Query: 745 GEHNIHIGG 753
GE+ + I G
Sbjct: 693 GEYRVMISG 701
>gi|443692971|gb|ELT94448.1| hypothetical protein CAPTEDRAFT_221920 [Capitella teleta]
Length = 757
Score = 353 bits (906), Expect = 2e-94, Method: Compositional matrix adjust.
Identities = 249/710 (35%), Positives = 363/710 (51%), Gaps = 91/710 (12%)
Query: 40 LPFCQVSLPIPQRVNDLIGRLSLQE---KVKLLISG--AAAVPRLGIKGYEWWSEALHGV 94
PF SL R +DL+ RL+L+E + + G A+ RLGIK Y W +E L G
Sbjct: 20 FPFQDPSLSWDDRADDLVARLTLEEIAPQTQASYGGQHTPAIERLGIKPYVWITECLAGQ 79
Query: 95 SNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA------ 148
N AT++PQ I A+SF+ L + R +S E RA +N A
Sbjct: 80 VNTN-----------ATAYPQPIGMAASFSEELLFNVSRDISYEVRAHWNANRAVGKYST 128
Query: 149 --GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKH 206
GL+ +SP +NI R P WGR QET GEDP+LSG A S+VRGLQG D L+ A CKH
Sbjct: 129 KVGLSCFSPVINIMRHPLWGRNQETYGEDPLLSGTLAQSFVRGLQGDDPRYLRANAGCKH 188
Query: 207 FTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCA 266
F + V RF F+AKV+ +D TF F+MCV G S+MCSYN++NG+P CA
Sbjct: 189 FDVHGGPEDIPVSRFSFDAKVNMRDWRMTFLPQFKMCVDAGSY-SLMCSYNRINGIPACA 247
Query: 267 DPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLG 326
+ +L R EW +GYIVSD ++ + H+T++ AI+AG +L+ G
Sbjct: 248 NKQLLTDITRDEWGFHGYIVSDSGAISNIKEQHHYTNSTVATVVAAIKAGTNLELGGGSN 307
Query: 327 LHTES---AVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQ 383
++ A+++GLL+E +I + + L ++RLG FD E + Y +G + +P+H+
Sbjct: 308 MYYPKQLDAMKQGLLTEKEIRDNVRPLLYTRLRLGEFDPE-AMVDYNKIGVDVIQSPEHR 366
Query: 384 ELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAG-IACGYT--- 439
E A++AA G VLLKN LP+ ++ +A++GP ++ T + G Y+ + +T
Sbjct: 367 EQAVKAAYMGFVLLKNHNNLLPIKK-QYSKLAIVGPFTNATSELFGTYSSEVNLKFTSTI 425
Query: 440 ----TPLQGIGRYARTIHQQGCKDVACA----DDQLFGAAIDASRQADATILVMGLDQSI 491
+PL G R A GC + AC+ DD A AD I+ +G Q
Sbjct: 426 FEGLSPLGGSTRSA-----NGCTNSACSGYVRDDVETAVA-----GADLVIVALGSGQRF 475
Query: 492 EAEALDRAGLLLPGRQQELVSKVSMASKG-PTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
E+E DRA L L G Q +++ S G P ILVL++ GP+D+ +AK DP + AI+ G
Sbjct: 476 ESEGNDRAYLDLHGHQLDILKDAVFFSNGAPVILVLINAGPLDITWAKLDPGVTAILSCG 535
Query: 551 YPGQAGGTAIADILFGT---SNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTY 607
YP Q+ G A+ L + + P G+L TW P +T+ M+ GRTY
Sbjct: 536 YPAQSTGEALRRSLTMSEPQAAPAGRLQATW-PLNLDQVPKITDYTMQ--------GRTY 586
Query: 608 RFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNR 667
R+Y G +YPFG G+SYT+F +T SI+ ++ + VT
Sbjct: 587 RYYVGEPLYPFGFGLSYTSFSYT---------------RLSISPSVITQGDNVT------ 625
Query: 668 LTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHW-APHKQLVAFEKVHVPAG 716
V+V +KN GS D + V+ + P + P L AF + + AG
Sbjct: 626 ----VEVCLKNTGSYDSDEVVQVYMSWPQTPFPLPKWTLAAFARPFISAG 671
>gi|390630430|ref|ZP_10258413.1| Beta-xylosidase B [Weissella confusa LBAE C39-2]
gi|390484359|emb|CCF30761.1| Beta-xylosidase B [Weissella confusa LBAE C39-2]
Length = 674
Score = 350 bits (898), Expect = 2e-93, Method: Compositional matrix adjust.
Identities = 241/706 (34%), Positives = 357/706 (50%), Gaps = 94/706 (13%)
Query: 79 LGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDE 138
+ I Y +W+EALHGV+ G AT FPQ I A++F+ L I V+ E
Sbjct: 1 MNIPEYNYWNEALHGVARAGV----------ATVFPQAIGLAATFDDHLINEIADVIGTE 50
Query: 139 ARAMYNGGTA--------GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGL 190
RA YN T GLT+WSPNVNIFRDPRWGRG ET GEDP L+ K+ ++++GL
Sbjct: 51 GRAKYNEFTKHDDRDIYKGLTFWSPNVNIFRDPRWGRGHETYGEDPFLTSKFGVAFIKGL 110
Query: 191 QGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVA 250
QG LK+AA+ KHF + G+ R F+A VS +D+ +T+ F+ V E V
Sbjct: 111 QGQ-AKYLKLAATAKHFAVH--SGPEGL-RHGFDAVVSDKDLYETYLPAFKAAVEEADVE 166
Query: 251 SVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAA 310
S+M +YN V+GVP +LK + +W G++VSD + ++ +T E
Sbjct: 167 SIMTAYNAVDGVPASVSEMLLKDILHDKWSFEGHVVSDYMAPEDVHENHKYTKDAAETMG 226
Query: 311 DAIRAGLDLDCGPF-LGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMF--DGEPSSQ 367
AI+AGL+L G LH A+ RGL++E +I NA+++ ++RLGMF D E +
Sbjct: 227 LAIKAGLNLVAGHIEQSLH--EALDRGLVTEEEITNAVISLYATRVRLGMFATDNEYDAI 284
Query: 368 PYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTM 427
PY + T H L+ AA + VLLKN G LPL +AV+GPN+ + +
Sbjct: 285 PY-----EANDTKAHNNLSEIAAEKSFVLLKNDG-VLPLRKETMEAIAVVGPNAHSEIAL 338
Query: 428 IGNYAGIACGYTTPLQGI----GRYART--------IHQQGCKDVACADDQLFGAAIDAS 475
+GNY G T L+GI G R + +A AD++ A I A+
Sbjct: 339 LGNYFGTPSRSYTILEGIQERLGDDVRVHYSIGSGLFQDHAAEPLAKADERESEAVI-AA 397
Query: 476 RQADATILVMGLDQSIEAE---------ALDRAGLLLPGRQQELVSKVSMASKGPTILVL 526
+D + V+GLD +IE E A D+ L LPGRQ++L+ ++ +A P +++L
Sbjct: 398 EHSDVVVAVLGLDSTIEGEEGDAGNSQGAGDKPNLSLPGRQRQLLERL-LAVGKPVVVLL 456
Query: 527 MSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITN 586
SG + + +N P + AI+ YPG GG A+AD+LFG +P GKLP+T+Y + N
Sbjct: 457 ASGSSLQLDGLENHPNLRAIMQIWYPGARGGLAVADVLFGAVSPSGKLPVTFYKN--VDN 514
Query: 587 LPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRH 646
LP E GRTYR+ +YPFG+G++Y+ +V L +
Sbjct: 515 LPAFE-------DYNMAGRTYRYMTDEALYPFGYGLTYS----------SVELSDLQVKS 557
Query: 647 GSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLV 706
AT++ + + + VQV VK++GS+ F+ P A QL
Sbjct: 558 YEDTATVTATIQNTGNFDTDEV---VQVYVKDLGSE--------FAVPNA-------QLK 599
Query: 707 AFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIG 752
F++V++ GA+Q + ++ + V D G I I +G
Sbjct: 600 GFKRVYLGKGAKQTITFDLRPQDF-EVFDAQGRNFIDSDRFEISVG 644
>gi|427385932|ref|ZP_18882239.1| hypothetical protein HMPREF9447_03272 [Bacteroides oleiciplenus YIT
12058]
gi|425726971|gb|EKU89834.1| hypothetical protein HMPREF9447_03272 [Bacteroides oleiciplenus YIT
12058]
Length = 732
Score = 349 bits (896), Expect = 3e-93, Method: Compositional matrix adjust.
Identities = 249/788 (31%), Positives = 382/788 (48%), Gaps = 91/788 (11%)
Query: 7 FFFLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKV 66
+ ++ L++L S+G AA+ F + + RV DL+ RL+L++K
Sbjct: 5 YKYMNLLVLIFLSTGAAAQSI-------GIQNNPAFLNQEMSMEARVADLMSRLTLEQKA 57
Query: 67 KLLISGAAAVPRLGIK-GYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNA 125
+LL V G + W++ LHGV P T+FP I ++++
Sbjct: 58 QLLNHRGKTVVVDGFSIRADQWNQCLHGVKWTEP----------TTNFPTSIALGATWDT 107
Query: 126 TLWEAIGRVVSDEARAMYNGGT---------AGLTYWSPNVNIFRDPRWGRGQETPGEDP 176
L + V+SDEARA+YNG GL Y SP +NI R+P WGR E GEDP
Sbjct: 108 ELIHRVATVISDEARAIYNGWKQDPEFRGEHKGLIYRSPVINISRNPYWGRINEIFGEDP 167
Query: 177 VLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTF 236
+G+ +YV+GLQG D LK+A++ KH+ +++ VDR +A+V ++ + + +
Sbjct: 168 YHTGRMGVAYVKGLQGDDSHYLKLASTLKHYAVNNVE----VDRMKLSAQVPERMLYEYW 223
Query: 237 DVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYY 296
F+ C++EGK SVM SYN +NGVP + +L ++ +W G++VSD V
Sbjct: 224 LPHFKDCIVEGKAQSVMASYNAINGVPNNINKLLLTDILKNQWGHEGFVVSDLGGVKTMV 283
Query: 297 DTQHFTSTP-EEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQM 355
+ H EEA +I AG D + + A+++G L+E +N+AL L V+
Sbjct: 284 EGHHQRQISCEEAVGRSIMAGCDFSDAEYEK-YIPDALRKGYLTEERLNDALRRVLLVRF 342
Query: 356 RLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVA 415
RLG FD + S PY + P + +H+ L+LEAAR+ IVLLKN+ LP+ + VA
Sbjct: 343 RLGEFD-DFKSVPYSRISPDVIGCKEHRNLSLEAARKSIVLLKNEKKLLPIDRSIIKRVA 401
Query: 416 VIGPNSDVTVTMIGNYAGIACGYTTPLQGI----GRYARTIHQQGCKDVACA-------- 463
VIGP +D+ GNY G+ TPLQGI G ++ +G +
Sbjct: 402 VIGPYADLFNQ--GNYGGVPKDPVTPLQGIKNAVGNNVEVLYCKGAQITPVKVRKGQPIP 459
Query: 464 ----DDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASK 519
+ A++ +R +D L +G IE E DR L+LPG Q ELV V +K
Sbjct: 460 PRFDKEAEMKKAVEMARNSDVVFLFVGTTADIEVEGRDRKTLVLPGNQNELVKAVYEVNK 519
Query: 520 GPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWY 579
++VLMS GP+ V K + I A++ A +PG GG AIAD+LFG NPGGKLP T Y
Sbjct: 520 -KVVVVLMSAGPVAVPEVKKN--IPAVLQAWWPGDEGGNAIADVLFGDYNPGGKLPYTMY 576
Query: 580 PQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVA 639
+ +P T+ + G TY + K ++ FGHG+SY+ F ++ + V
Sbjct: 577 ASD--EQVPSTD------EYDISKGFTYMYLKKKPLFAFGHGLSYSKFHYSDLQISSPVV 628
Query: 640 VPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTL-LVFSTPPAGH 698
S+N T+S V + VKN+G + G + L A
Sbjct: 629 --------SVNDTVS-----------------VVLKVKNMGKRTGEEVVQLYVRDVKAKV 663
Query: 699 WAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRS-GTRRIPLGEHNIHIGGTKHS 757
P K+L F+++ + +Q + + + V K L+ D S G + G I +G
Sbjct: 664 VRPTKELRGFKRIALQPNEEQEIRLMLPV-KSLAFYDESIGDFLVEPGSFEILLGSASDD 722
Query: 758 VSLHAATL 765
+ L + +
Sbjct: 723 IRLQSKLI 730
>gi|340369765|ref|XP_003383418.1| PREDICTED: probable beta-D-xylosidase 2-like [Amphimedon
queenslandica]
Length = 748
Score = 349 bits (896), Expect = 3e-93, Method: Compositional matrix adjust.
Identities = 247/748 (33%), Positives = 366/748 (48%), Gaps = 76/748 (10%)
Query: 40 LPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAA-------VPRLGIKGYEWWSEALH 92
PF SLPI +RV D++ +LSL + V+ + G A +P+ IK Y+W +E L
Sbjct: 27 FPFRDPSLPIEERVKDIVDQLSLDQLVEQMAHGGAGSNGPAPGIPKFNIKPYQWGTECLS 86
Query: 93 GVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARA-----MYNGGT 147
G N G ATSFP I A+SFN L + + + E RA + NG
Sbjct: 87 GDVNAG----------DATSFPMSIGMAASFNYDLLKQVSNATAYEVRAKNTAAVLNGSY 136
Query: 148 A---GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASC 204
A GL+ WSP +NI RDPRWGR QET GEDP LSG ++V GLQG D + A C
Sbjct: 137 AFHTGLSCWSPVLNIMRDPRWGRNQETYGEDPYLSGYLGQAFVTGLQGDDPTYVIANAGC 196
Query: 205 KHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPT 264
KHF + + R F+A V+ D TF F+ CV G + S+MCSYN++NGVP
Sbjct: 197 KHFDVHGGPEDTPLPRASFDANVTMIDWRMTFLPQFKACVEAGAL-SLMCSYNRINGVPA 255
Query: 265 CADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPF 324
CA+ +L +R EW GY+VSD ++ H+ AAADA AG L+ G
Sbjct: 256 CANKKLLTDILRNEWNFKGYVVSDQGALENIVTQHHYAPDFVTAAADAANAGTCLEDGNS 315
Query: 325 LGL------HTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVC 378
G + + AV++GL+S + +A+ V+ +LG FD ++ PY ++ +
Sbjct: 316 EGKGGNVFDNLDDAVEKGLVSVDTLKDAVSRLFYVRTKLGEFDPPDNNNPYANIPLSIIQ 375
Query: 379 TPDHQELALEAARQGIVLLKNQG---PSLPLSHIRHRTVAVIGPNSDVTVTMIGNYA-GI 434
+ +H +L+++AA + IVL+KN P LPL+ + V+GP + TM G+Y+ +
Sbjct: 376 SDEHIKLSIQAAMETIVLMKNDNDGSPFLPLAADDFKKACVVGPFIENADTMFGDYSPTM 435
Query: 435 ACGY-TTPLQGIGRYA----RTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQ 489
Y TPL GI ++ GC D + A D I+ GL +
Sbjct: 436 MTDYIVTPLAGIKTTQIGSDLLNYEDGCTDGPACEIYDGYKVRTACEGVDLVIVTAGLSR 495
Query: 490 SIEAEALDRAGLLLPGRQQELVSKVSMAS-KGPTILVLMSGGPIDVAFAKNDPRIAAIIW 548
+E E D + + LPG Q L++ AS P IL+L + P+D+++AK++PR AAI+
Sbjct: 496 YLEHEGHDISDIYLPGHQMSLLTDAESASGSAPIILLLFNANPLDISYAKSNPRFAAILE 555
Query: 549 AGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYR 608
A YPGQ G AIA++L G+ NP G+LP TW P M + M+ RTYR
Sbjct: 556 AYYPGQEAGVAIANVLTGSYNPAGRLPNTW-PASLDQVPDMIDYTMKE--------RTYR 606
Query: 609 FYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRL 668
++ +YPFG+G+S+T F ++ N + A +G+
Sbjct: 607 YFTQEPLYPFGYGLSFTTFNYSDLNVAST-------------ANTNGEG----------- 642
Query: 669 TLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVC 728
++ V V V N G+ DG + AP+ QLV + + G V I
Sbjct: 643 SIAVSVTVMNTGTMDGDEVTQAYVKWDNVAEAPNIQLVGVSRKFISKGQSITVSFTIKPE 702
Query: 729 KYLSVVD-RSGTRRIPLGEHNIHIGGTK 755
+ ++ G IP G +++ +GG +
Sbjct: 703 QLQVWINGDDGKWSIPGGTYSLFVGGQQ 730
>gi|348688508|gb|EGZ28322.1| family 3 glycoside hydrolase [Phytophthora sojae]
Length = 701
Score = 349 bits (895), Expect = 4e-93, Method: Compositional matrix adjust.
Identities = 235/670 (35%), Positives = 341/670 (50%), Gaps = 90/670 (13%)
Query: 10 LGLILLSAS--SSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVK 67
L L+LL AS ++ + A P AC + + L FC SLP+ RV DL+ RL L EK
Sbjct: 3 LPLLLLVASVVAASVDAEIPRACVGEH---QKLQFCNTSLPVSARVEDLLARLPLDEKAI 59
Query: 68 LLISGAAAVPR-----LGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASS 122
LL A A PR +G+ Y W + +HGV + G + P TSFP + +
Sbjct: 60 LLT--ARASPRGNMSSIGLPEYNWGANCVHGVR-----STCGTNCP--TSFPNPVNLS-- 108
Query: 123 FNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKY 182
++ RDPRWGR ETP EDP+++ KY
Sbjct: 109 ----------------------------------IHRRRDPRWGRNTETPSEDPLVNSKY 134
Query: 183 AASYVRGLQG---SDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVP 239
+Y +GLQ D L+ + KH+ AY +N+ G +R FNA VS D DT+
Sbjct: 135 GVAYTKGLQEGKHEDPRYLQAVVTLKHYVAYSYENYGGGNRKTFNAIVSPYDFADTYFPA 194
Query: 240 FRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQ 299
FR +++G VMCSYN VNGVP CA+ + + +RG +GYI SD ++ D
Sbjct: 195 FRSSIVDGNAKGVMCSYNSVNGVPACANNELENKLLRGMLGFDGYITSDSGAIEAISDWL 254
Query: 300 HFTSTPEEAAADAIRAGLDLDCGPFLGLHT--ESAVQRGLLSEIDINNALVNTLTVQMRL 357
H+ T EAA AI AG D++ G G + V+ L +++ L +TL ++ L
Sbjct: 255 HYVPTRCEAARLAILAGTDVNSGRGFGYMACLKELVESNQLDVKVVDDVLRHTLKLRFEL 314
Query: 358 GMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVI 417
G+FD QPY + P DV T ++L+L+ AR+ IVLL+N P LPL R +AV+
Sbjct: 315 GLFD-PIEDQPYWKVTPNDVNTDAAKKLSLDLARKSIVLLQNNQPVLPLR--RGVKLAVV 371
Query: 418 GPNSDVTVTMIGNYAGIAC--------GYTTPLQGIGRY---ARTIHQQGCKDVACADDQ 466
GP++ ++GNY G C TP + + + T + GC +V
Sbjct: 372 GPHAQAKRALLGNYLGQMCHGDYNEVGCIKTPFEAVSASNGDSSTTYALGC-NVTGNSTA 430
Query: 467 LFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVL 526
F A+ A + A+A +L +G+D+S+EAE DR + LP Q +L+ +V K PT++VL
Sbjct: 431 GFVEAVKAVQGAEAVVLFLGIDKSVEAEVRDRNNIDLPAIQVQLLQRVRAVGK-PTVVVL 489
Query: 527 MSGGPIDVA--FAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYI 584
M+GG + + D A++ A YPG G A+ DILFG +NPGGKLP+T Y +Y+
Sbjct: 490 MNGGVLTAEDIIGQTD----ALVEAFYPGFFGAQAMTDILFGDANPGGKLPVTMYRSDYV 545
Query: 585 TNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDG 644
+ M M + YPGR+YR++KG V+PFG G+SYT+F +A A +
Sbjct: 546 NTVDMKSMNV-----TAYPGRSYRYFKGEPVFPFGWGLSYTSFSLKADDATATTAKSVS- 599
Query: 645 RHGSINATIS 654
++N TIS
Sbjct: 600 --ATMNTTIS 607
>gi|363742357|ref|XP_003642627.1| PREDICTED: probable beta-D-xylosidase 5-like [Gallus gallus]
Length = 748
Score = 348 bits (894), Expect = 5e-93, Method: Compositional matrix adjust.
Identities = 243/728 (33%), Positives = 366/728 (50%), Gaps = 93/728 (12%)
Query: 35 ATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAA-------AVPRLGIKGYEWW 87
A + PF +LP +R+ DL+GRL+ E V + G A +PRLGI Y W
Sbjct: 22 AEAQPFPFRDPTLPWHRRLEDLLGRLTPAEMVLQMARGGALGNGPAPPIPRLGIAPYNWN 81
Query: 88 SEALHGVSNVGPGTKFGGDFPG-ATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGG 146
+E L G + PG AT+FPQ + A++F+ L + + E RA +N
Sbjct: 82 TECLRG----------DAEAPGWATAFPQALGLAAAFSPELVYRVANATATEVRAKHNSF 131
Query: 147 TA--------GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRL 198
A GL+ +SP +NI R P WGR QET GEDP L+ + A S+V+GLQG +
Sbjct: 132 VAAGRYDDHTGLSCFSPVLNIMRHPLWGRNQETYGEDPYLTAELATSFVQGLQGQHPRYI 191
Query: 199 KVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQ 258
K +A CKHF+ + V R F+AKV ++D TF F+ CV G S MCSYN+
Sbjct: 192 KASAGCKHFSVHGGPENIPVSRLSFDAKVLERDWHTTFLPQFQACVRAGSY-SFMCSYNR 250
Query: 259 VNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLD 318
+NGVP CA+ +L +RGEW GY+VSD +V + +T T E A ++ AGL+
Sbjct: 251 INGVPACANKKLLTDILRGEWGFEGYVVSDEGAVELILLGHRYTHTFLETAIASVNAGLN 310
Query: 319 LDCGPFLG----LHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGP 374
L+ + +H A+ G ++ + + + ++RLG FD P+ PY L
Sbjct: 311 LELSYGMRNNVFMHIPKALAMGNITLEMLRDRVRPLFYTRLRLGEFD-PPAMNPYNALEL 369
Query: 375 KDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGI 434
V + +H+ L+LEAA + VLLKNQ +LPL + + +AV+GP +D + G+YA +
Sbjct: 370 SVVQSSEHRNLSLEAAIKSFVLLKNQRDTLPLRELHGKRLAVVGPFADNPRVLFGDYAPV 429
Query: 435 --ACGYTTPLQGIGRY-ARTIHQQGCKDVAC---ADDQLFGAAIDASRQADATILVMGLD 488
TP +G+ A GC++ C + D++ +A R AD ++ +G
Sbjct: 430 PEPQYIYTPRRGLQTLPANVSFAAGCREPRCWVYSRDEV----ENAVRGADVVLVCLGTG 485
Query: 489 QSIEAEALDRAGLLLPGRQQELVSKVSMASKG-PTILVLMSGGPIDVAFAKNDPRIAAII 547
+E EA DR L LPG Q +L+ A+ G P IL+L + GP+DV++A+ + AI+
Sbjct: 486 IDVEMEARDRKDLSLPGHQLQLLQDAVRAAAGHPVILLLFNAGPLDVSWAQLHDGVGAIL 545
Query: 548 WAGYPGQAGGTAIADILFGT--SNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGR 605
+P QA G AIA +L G ++P G+LP TW P + P ++ GR
Sbjct: 546 ACFFPAQATGLAIASVLLGKQGASPAGRLPATW---------PAGMHQVPPMENYTMEGR 596
Query: 606 TYRFY-KGPVVYPFGHGMSYTNFVHT-VANAPTVVAVPLDGRHGSINATISGKAIKVTHA 663
TYR+Y + +YPFG+G+SYT F + + +P V+ +
Sbjct: 597 TYRYYGQEAPLYPFGYGLSYTTFHYRDLVLSPPVLPI----------------------- 633
Query: 664 KCNRLTLGVQVDVKNVGSKDGAHTLLVF------STPPAGHWAPHKQLVAFEKVHVPAGA 717
C L+ V V ++N G +D + ++ S P P QLVAF +V VPAG
Sbjct: 634 -CANLS--VSVVLENTGPRDSEEVVQLYLRWEQPSVP-----VPRWQLVAFRRVAVPAGG 685
Query: 718 QQRVGINI 725
++ +
Sbjct: 686 ATKLSFGV 693
>gi|340368019|ref|XP_003382550.1| PREDICTED: probable beta-D-xylosidase 2-like [Amphimedon
queenslandica]
Length = 742
Score = 348 bits (893), Expect = 7e-93, Method: Compositional matrix adjust.
Identities = 264/776 (34%), Positives = 378/776 (48%), Gaps = 92/776 (11%)
Query: 28 FAC--DPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAA-------VPR 78
FAC D PF SL I RV D++ L+L+E V+ + G A +PR
Sbjct: 10 FACCLAISDHVEALFPFQNTSLSIEDRVKDIVDNLTLEELVEQMAHGGATLNGPAPGIPR 69
Query: 79 LGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDE 138
L I Y+W +E L G NV G ATSFP I A+SFN L + + + E
Sbjct: 70 LHINPYQWGTECLSG--NVSAG--------DATSFPMPIGMAASFNYDLLKRVTNATAYE 119
Query: 139 ARAMY-----NGGTA---GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGL 190
RA + +G A GL+ WSP +NI RDPRWGR QET GEDP LSG +YV GL
Sbjct: 120 VRAKHAAAVKDGSYAFHTGLSCWSPVLNIMRDPRWGRNQETYGEDPYLSGYLGQAYVNGL 179
Query: 191 QGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVA 250
QG++ + A CKHF + RF F+AKVS +D TF F+ CV G +
Sbjct: 180 QGNNSRYIIANAGCKHFDVHGGPENIPTSRFSFDAKVSMRDWRMTFLPQFKACVEAGAL- 238
Query: 251 SVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAA 310
S+MCSYN++NGVP CA+ +L +R EW GY+VSD ++ H+ +AAA
Sbjct: 239 SLMCSYNRINGVPACANKALLTDILRNEWDFKGYVVSDQGALEFIVIEHHYAPDFMKAAA 298
Query: 311 DAIRAGLDLDCG----PFLGL--HTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEP 364
DA AG L+ G F + H AV+ L+S + NA+ V+M+LG FD P
Sbjct: 299 DAANAGTCLEDGNIGRKFFNVFEHLVDAVKNNLVSVDTLKNAVSRLFYVRMKLGEFD-PP 357
Query: 365 SSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKN----QGPSLPLSHIRHRTVAVIGPN 420
+ PY ++ + + H L+L+AA + IVL+KN + P LP+++ + ++GP
Sbjct: 358 DNNPYANIPLSVIQSDAHINLSLQAAMESIVLMKNDDGFRSPFLPITN-EVKKACMVGPF 416
Query: 421 SDVTVTMIGNYA-GIACGYT-TPLQGIGRY---ARTI-HQQGCKDVACADDQLFGAAIDA 474
SD + G+Y+ + Y T L G+ T+ + GC+D + A
Sbjct: 417 SDDPEVLFGDYSPTLMRDYVITSLAGLKNANIGTDTLNYAVGCEDGPACRNYDSAKVRSA 476
Query: 475 SRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPT-ILVLMSGGPID 533
+ I+ GL + +E+E D + + LPG Q +L+ ASK + IL+L + P+D
Sbjct: 477 CDGVELIIVTAGLSKHLESEGKDLSDINLPGHQLDLMQDAEAASKNASVILILFNASPLD 536
Query: 534 VAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEY-----ITNLP 588
+ +AK DPRI I+ A YPGQ G AIA++L G NP G+LP TW P ITN
Sbjct: 537 IRYAKTDPRIVGILEAYYPGQTAGKAIANVLTGEYNPSGRLPNTW-PASLDQVPGITNYT 595
Query: 589 MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGS 648
M E RTYR++ +YPFG+G+SYT F ++ N S
Sbjct: 596 MKE-------------RTYRYFTQEPLYPFGYGLSYTTFHYSNLNI-------------S 629
Query: 649 INATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAF 708
AT SG + + V V V N GS DG V+ +AP QLV
Sbjct: 630 STATASGAGM-----------IAVSVLVTNTGSMDGTEVTQVYVWCNIS-YAPKLQLVGV 677
Query: 709 EKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHAAT 764
K + G V +I + L V IP+ +++ + G + + ++ A++
Sbjct: 678 NKDFISKGKTLEVSFSIKP-EQLQVWTDDDKWVIPVSTYDLSVSGQQPNQTVLASS 732
>gi|409385818|ref|ZP_11238358.1| Beta-glucosidase [Lactococcus raffinolactis 4877]
gi|399206850|emb|CCK19273.1| Beta-glucosidase [Lactococcus raffinolactis 4877]
Length = 695
Score = 348 bits (892), Expect = 9e-93, Method: Compositional matrix adjust.
Identities = 220/600 (36%), Positives = 330/600 (55%), Gaps = 63/600 (10%)
Query: 56 LIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQ 115
++ +++L EK+ + A+A+ RL I Y +W+E LHGV+ G AT FPQ
Sbjct: 15 IVSQMTLAEKISQIDFDASAIERLNIPHYNYWNEGLHGVARAGV----------ATVFPQ 64
Query: 116 VITTASSFNATLWEAIGRVVSDEARAMYNGGTA--------GLTYWSPNVNIFRDPRWGR 167
I A++F+ L + I V+S E RA YN T GLT+WSPN+N+FRDPRWGR
Sbjct: 65 AIGLAATFDTELVKHIAEVISIEGRAKYNAYTKHGDRDIYKGLTFWSPNINLFRDPRWGR 124
Query: 168 GQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKV 227
GQET GEDP L+ + ++++GLQG +G L++AA KHF + + DR +F+A V
Sbjct: 125 GQETYGEDPFLTAQIGVAFIKGLQG-EGKYLRLAACTKHFAVH---SGPEADRHYFDAVV 180
Query: 228 SKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVS 287
+ +D+ + + F+ + E V S M +YN +NG P C + ++ +T+ G+W G++VS
Sbjct: 181 NPKDLNEFYLPQFKAAIEEADVESFMGAYNAINGQPACVNEELIAKTLLGKWGFEGHVVS 240
Query: 288 DCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNAL 347
D ++ ++ H+T T E A A++ G +L G E AV +GL++E +I ++
Sbjct: 241 DYAALEDVHENHHYTQTAAETMALAMKIGTNLCAGKISDALFE-AVGKGLVTETEITASV 299
Query: 348 VNTLTVQMRLGMF--DGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLP 405
V T +RLGMF D + + PY + + +H+ L+L+AA + +VLLKN LP
Sbjct: 300 VKLYTTHVRLGMFAEDNDYDTIPY-----EVNASAEHEMLSLKAAEKSMVLLKNDN-FLP 353
Query: 406 LSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGR----YARTIHQQGCKDVA 461
LS ++VAVIGP + + GNYAG A Y T + GI + AR + GC A
Sbjct: 354 LSQSEIKSVAVIGPTARNIGALEGNYAGTANHYETFVSGIQQALSNQARVTYALGCHLYA 413
Query: 462 -CADDQLFGA------AIDASRQADATILVMGLDQSIEAE---------ALDRAGLLLPG 505
A+ L A AI A+ AD +L +GLD +IE E + D+ L LPG
Sbjct: 414 DHAESSLSRANERESEAIIAAEHADIAVLCVGLDPTIEGEQGDAGNVYGSGDKPSLSLPG 473
Query: 506 RQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILF 565
+Q+ L+ KV K ILVL SG + + + + AII A YPG GGTA+A+IL
Sbjct: 474 QQKRLIEKVLETGK-TVILVLTSGSALSLEGLEKHTGVKAIIQAWYPGAHGGTALANILL 532
Query: 566 GTSNPGGKLPMTWYPQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSY 624
G +P GKLP+T+ LP ++ +M RTY+ + V+YPFG+G++Y
Sbjct: 533 GKVSPSGKLPVTFCKD--TQGLPDFSDYSMAE--------RTYQNTQLEVLYPFGYGLTY 582
>gi|449527525|ref|XP_004170761.1| PREDICTED: beta-D-xylosidase 1-like [Cucumis sativus]
Length = 241
Score = 346 bits (888), Expect = 3e-92, Method: Compositional matrix adjust.
Identities = 160/199 (80%), Positives = 179/199 (89%), Gaps = 1/199 (0%)
Query: 30 CDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSE 89
CD ++ TR + FCQ SL I +RV DLIGRL+L EK++LL++ A AVPRLGI+GYEWWSE
Sbjct: 34 CDKRNVGTRNMGFCQESLGIEERVKDLIGRLTLGEKIRLLVNNAIAVPRLGIRGYEWWSE 93
Query: 90 ALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAG 149
ALHGVSNVGPGTKFGG FPGATSFPQVITTA+SFN +LW IGRVVSDEARAMYNGGTAG
Sbjct: 94 ALHGVSNVGPGTKFGGTFPGATSFPQVITTAASFNQSLWLLIGRVVSDEARAMYNGGTAG 153
Query: 150 LTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDG-DRLKVAASCKHFT 208
LTYWSPNVNIFRDPRWGRGQETPGEDP+L+ KYAA+YV+GLQG+DG RLKVAA CKH+T
Sbjct: 154 LTYWSPNVNIFRDPRWGRGQETPGEDPILAAKYAANYVQGLQGNDGKKRLKVAACCKHYT 213
Query: 209 AYDLDNWNGVDRFHFNAKV 227
AYDLDNWNGVDR+HFNAKV
Sbjct: 214 AYDLDNWNGVDRYHFNAKV 232
>gi|291240561|ref|XP_002740190.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii]
Length = 763
Score = 345 bits (886), Expect = 4e-92, Method: Compositional matrix adjust.
Identities = 252/717 (35%), Positives = 358/717 (49%), Gaps = 81/717 (11%)
Query: 41 PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVP--RLGIKGYEWWSEALHGVSNVG 98
PF SL +RV+DL+ RL+L E V + + A P RLGIK Y W SE LHGV V
Sbjct: 27 PFQNTSLSWEERVDDLVSRLTLDEMVLQMARTSPAPPIDRLGIKPYVWNSECLHGV--VP 84
Query: 99 PGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGT--------AGL 150
P AT+FPQ I A+SF+ L + + + E RA +N GL
Sbjct: 85 PDGL-------ATAFPQSIGLAASFSPDLLSDVAKAIGLEVRAKHNDYVQRGVYQEHTGL 137
Query: 151 TYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAY 210
+ +SP +NI R P WGR QET GEDP L G+ ++YVRGLQG + A CKHF +
Sbjct: 138 SCFSPVINIARHPLWGRNQETYGEDPFLIGELGSAYVRGLQGDHPRYVLANAGCKHFDVH 197
Query: 211 DLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNI 270
V RF F+AKV ++D + TF F CV G V SVMCSYN++N VP CA+ +
Sbjct: 198 GGPEDIPVSRFSFDAKVFERDWQMTFLPAFHECVKAG-VYSVMCSYNRINEVPACANTRL 256
Query: 271 LKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLD----CGPFLG 326
L +R EW +GY+VSD +V + H+T + + A A+ AG +LD G +
Sbjct: 257 LTDILRKEWGFDGYVVSDEGAVEFIMTSHHYTDSIVDTVASAVNAGCNLDLAFPVGDGMY 316
Query: 327 LHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELA 386
+ AV G + E + + +MRLG FD P PY +L V + +H+ELA
Sbjct: 317 IKIGDAVTAGKIKEKTVVERVKPLFYTRMRLGEFD-PPELNPYANLNLSVVQSEEHRELA 375
Query: 387 LEAARQ-----GIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACG--YT 439
++AA Q VLLK +G LPL + ++ +AVIGP +D + G+Y+
Sbjct: 376 VKAALQSFVLLNFVLLKREGRVLPLDTLVNK-LAVIGPFADNPSYLFGDYSPNPDKEFVV 434
Query: 440 TPLQGIGRYAR-TIHQQGCKDVACAD--DQLFGAAIDASRQADATILVMGLDQSIEAEAL 496
TP +G+ AR T GC C ++ AA+ AD ++ +G IEAE +
Sbjct: 435 TPCKGLSNAARDTRCTPGCLTAPCTTYFSEMVKAAVTG---ADLIVVCLGTGVKIEAEFV 491
Query: 497 DRAGLLLPGRQQELVSKVSMASKG-PTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQA 555
DR+ L LPG+Q +L+ V + G P IL+L + GP+D+ +A +P I I+ +P QA
Sbjct: 492 DRSDLSLPGKQFQLLQDVVKYANGKPIILLLFNAGPLDIVWAVENPAIQVIVACFFPSQA 551
Query: 556 GGTAIADILF-------GTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYR 608
G A+ + G NPGG+LP+TW P+ PMT M GRTYR
Sbjct: 552 TGDALYRMFMNTHGVDTGNGNPGGRLPITW-PRSMNQVPPMTNYTME--------GRTYR 602
Query: 609 FYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRL 668
++ G ++PFG+G+SY +F ++ ++V P +TI CN +
Sbjct: 603 YFNGDPLFPFGYGLSYGSFSYS-----SLVIWP---------STIPA---------CNGV 639
Query: 669 TLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINI 725
V V V +G T + S A P QLVAF++ ++ V I
Sbjct: 640 K--VSVTVYKLGPGGDEVTQVYMSWNNASVVVPKLQLVAFKRFYLETNGVTEVHFTI 694
>gi|372209074|ref|ZP_09496876.1| glycoside hydrolase family protein [Flavobacteriaceae bacterium
S85]
Length = 727
Score = 343 bits (881), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 238/743 (32%), Positives = 370/743 (49%), Gaps = 84/743 (11%)
Query: 40 LPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGP 99
L F I +R L+ +++L+EK+ L + A A+ RL + Y+WW+EALHGV+ G
Sbjct: 19 LSFLDTDKSIEERAEILVSQMTLKEKIAQLKNTAPAISRLKVPDYDWWNEALHGVARNGK 78
Query: 100 GTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMY--------NGGTAGLT 151
AT FPQ I ++F+ L + +S EARA Y + AGLT
Sbjct: 79 ----------ATIFPQGIGIGATFDPDLALRVASAISTEARAKYTISQQMGNHSRYAGLT 128
Query: 152 YWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYD 211
+W+PNVNIFRDPRWGRGQET GEDP L + ++V+GLQG D + LK AA KH+ +
Sbjct: 129 FWTPNVNIFRDPRWGRGQETFGEDPYLMTQMGVAFVKGLQGDDPNYLKSAACAKHYAVH- 187
Query: 212 LDNWNGVD--RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPN 269
+G + R FNA ++QD+ +T+ F V + V VM ++N V G P A+
Sbjct: 188 ----SGPESLRLEFNAVPTQQDLYETYLPAFEALVKDANVEGVMPAHNAVFGAPMAANKF 243
Query: 270 ILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHT 329
+L +R W +GY+V+DC ++ + + AAA A++AG +L+CG
Sbjct: 244 LLTDVLRDRWGFDGYVVTDCGAIKQIKVGHKYVDSEVAAAAVALKAGTNLNCGATYK-EL 302
Query: 330 ESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEA 389
+ A+ +GL++E ++ + RLGMFD + S PY +GP+ + + +H ELA EA
Sbjct: 303 KKAIDQGLVTEELVHERTKQLFKTRFRLGMFDKDLSKNPYSKIGPELIHSKEHIELAREA 362
Query: 390 ARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYA 449
A++ IV+LKN+ LPL + V GP ++ + ++G+Y G++ G T L GI
Sbjct: 363 AQKSIVMLKNKNNLLPLP-TDIKVPYVTGPFANSSDMLMGSYYGVSPGVVTILAGITDAV 421
Query: 450 RTIHQQGCKDVACADDQLFGA---AIDASRQADATILVMGLDQSIEAEAL---------D 497
+ A + A + + +D TI V+GL E E + D
Sbjct: 422 SLGTSLNYRSGALPFQKNINPKNWAPNVAGMSDVTICVVGLTADREGEGVDAIASNHKGD 481
Query: 498 RAGLLLPGRQQELVSKVSMASKG-PTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAG 556
R L LP Q V +++ K P +LV+ SG P V+ + AI+ YPG+ G
Sbjct: 482 RLDLKLPENQINYVKQLAAKKKDKPLVLVIASGSP--VSLEGIEEHCDAILQIWYPGEQG 539
Query: 557 GTAIADILFGTSNPGGKLPMTWYPQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVV 615
G A+AD+LFG +P G LPMT +P+ + LP + +M+ GRTY++ +
Sbjct: 540 GNAVADVLFGKVSPTGHLPMT-FPKS-VAQLPDYKDYSMK--------GRTYKYMTEEPM 589
Query: 616 YPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKC-NRLTLGVQV 674
+PFG G++Y+ K + V AK + +L V V
Sbjct: 590 FPFGFGLTYSK--------------------------TEFKNLVVEDAKLRKKESLKVSV 623
Query: 675 DVKNVGSKDGAHTLLVFSTPPA---GHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYL 731
+V NVG D + ++ +P + G P L AF++V + G Q+V IH + L
Sbjct: 624 EVTNVGDFDIDEIVQLYISPKSQKEGEGLPFTTLKAFKRVALKKGETQKVEFTIH-PESL 682
Query: 732 SVVDRSGTRRIPLGEHNIHIGGT 754
V++ G + G + + +G +
Sbjct: 683 KVINVKGQKVWRKGAYKVTVGNS 705
>gi|110740481|dbj|BAF02134.1| xylosidase [Arabidopsis thaliana]
Length = 284
Score = 343 bits (880), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 169/282 (59%), Positives = 209/282 (74%), Gaps = 12/282 (4%)
Query: 485 MGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIA 544
MGLDQSIEAE DR GLLLPG QQ+LV++V+ AS+GP ILVLMSGGPIDV FAKNDPR+A
Sbjct: 1 MGLDQSIEAETRDRTGLLLPGYQQDLVTRVAQASRGPVILVLMSGGPIDVTFAKNDPRVA 60
Query: 545 AIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPG 604
AIIWAGYPGQAGG AIA+I+FG +NPGGKLPMTWYPQ+Y+ +PMT MAMR S YPG
Sbjct: 61 AIIWAGYPGQAGGAAIANIIFGAANPGGKLPMTWYPQDYVAKVPMTVMAMR--ASGNYPG 118
Query: 605 RTYRFYKGPVVYPFGHGMSYTNFVHTVANAP----TVVAVPLDGRHGSINATISGKAIKV 660
RTYRFYKGPVV+PFG G+SYT F H++A +P +V L+ + +N+ S +IKV
Sbjct: 119 RTYRFYKGPVVFPFGFGLSYTTFTHSLAKSPLAQLSVSLSNLNSANTILNS--SSHSIKV 176
Query: 661 THAKCNRL-TLGVQVDVKNVGSKDGAHTLLVFSTPPAG---HWAPHKQLVAFEKVHVPAG 716
+H CN + + V+V N G DG HT+ VF+ PP +KQL+AFEKVHV AG
Sbjct: 177 SHTNCNSFPKMPLHVEVSNTGEFDGTHTVFVFAEPPINGIKGLGVNKQLIAFEKVHVMAG 236
Query: 717 AQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSV 758
A+Q V +++ CK+L VVD G RRIP+GEH +HIG KH++
Sbjct: 237 AKQTVQVDVDACKHLGVVDEYGKRRIPMGEHKLHIGDLKHTI 278
>gi|326427096|gb|EGD72666.1| hypothetical protein PTSG_04397 [Salpingoeca sp. ATCC 50818]
Length = 614
Score = 340 bits (872), Expect = 2e-90, Method: Compositional matrix adjust.
Identities = 216/587 (36%), Positives = 317/587 (54%), Gaps = 53/587 (9%)
Query: 154 SPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQ-GSDGDRLKVAASCKHFTAYDL 212
SPN+NI RDPRWGR QE P EDP+L+G++ Y GLQ G D KV + KH+ AY L
Sbjct: 11 SPNININRDPRWGRNQEVPSEDPLLNGEFGKLYTMGLQQGEDSRYTKVVVTLKHWDAYSL 70
Query: 213 DNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILK 272
++ +G R +F+AKVS + DT+ FR VMEG VMCSYN +NG PTC P +L
Sbjct: 71 EDSDGFTRHNFDAKVSNFALMDTYWPAFRKAVMEGNAKGVMCSYNALNGRPTCTHP-LLT 129
Query: 273 RTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAG-LDLDCGPFLGLHTES 331
+ +R W+ +GY+ SD ++ Y H+T+ A A A+R G D+D G
Sbjct: 130 KVLRDIWKFDGYVTSDTGAIEDIYAKHHYTANASAAVAAALRDGRCDMDSGAVYHDALLD 189
Query: 332 AVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAAR 391
AV G S D++ AL NTL ++ LG+FD QPY + + T Q+L ++
Sbjct: 190 AVNSGECSMDDVDRALYNTLKLRFELGLFD-PIEDQPYWRINASSINTTYAQDLNMKITL 248
Query: 392 QGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIAC------GYTTPL--- 442
+ ++LL+N +LP + R VAVIGP+ + ++GNY G C T+PL
Sbjct: 249 ESMILLQNHNNALPFK--KGRKVAVIGPHINAQEALVGNYLGQLCPDDSFDCITSPLAAI 306
Query: 443 QGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLL 502
+ I + T+ G +AC D + A++ ++ AD +L++G++ +IEAE+ DR +
Sbjct: 307 EAINGMSNTVSAMGSGVLACTDASI-QEAVNVAKDADYVVLLIGINDTIEAESNDRTSID 365
Query: 503 LPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIAD 562
LP Q +L + ++ +K T VL++GG + A + ++ AII AGYPG GG AIA
Sbjct: 366 LPQCQHKLTAAIAHLNK-TTAAVLINGGML--AIEQEKKQLPAIIEAGYPGFYGGAAIAK 422
Query: 563 ILFGTSNP-GGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHG 621
+FG +N GGKLP T YP +YI + M++M M S PGR+YR+Y G ++PFG G
Sbjct: 423 TIFGDNNHLGGKLPYTVYPADYIHKINMSDMEMTNS-----PGRSYRYYTGQPLWPFGFG 477
Query: 622 MSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGS 681
++YT F +V G S AT S + + V V N G
Sbjct: 478 LAYTTF-----------SVQSPGPSASTFATGSNT------------SFSLPVHVVNTGK 514
Query: 682 KDGAHTLLVFSTP---PAGHWAPHKQLVAFEKVHVPAGAQQRVGINI 725
+ G + V+ P P ++ KQL+AFE+VH+ QR+G+ I
Sbjct: 515 RTGDTVVQVYMAPVSLPHRSFSLKKQLIAFERVHLT--PNQRLGVTI 559
>gi|372209036|ref|ZP_09496838.1| glycoside hydrolase [Flavobacteriaceae bacterium S85]
Length = 859
Score = 340 bits (871), Expect = 2e-90, Method: Compositional matrix adjust.
Identities = 248/782 (31%), Positives = 386/782 (49%), Gaps = 93/782 (11%)
Query: 6 AFFFLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEK 65
AF F + L S SS + P A I RVNDL+ ++L+EK
Sbjct: 7 AFLFAIVFLFSFSSIAQTWKNPNAS------------------IEDRVNDLLANMTLEEK 48
Query: 66 VKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNA 125
+ S + RLGI +EW+ EALHG+ + T FPQ I +++N
Sbjct: 49 ISYCGSRIPEIKRLGIPYFEWYGEALHGIISWN-----------CTQFPQNIAMGATWNP 97
Query: 126 TLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAAS 185
L + +S+EARA+ N G + +SP VN+ RDPRWGR E EDP L + A
Sbjct: 98 DLMFDVATAISNEARALKNAGKKEVMMFSPTVNMARDPRWGRNGECYAEDPHLMSEMARM 157
Query: 186 YVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVM 245
YVRG+QG+D +K + KH+ A +++ R ++ + K+D+ + + ++ C++
Sbjct: 158 YVRGMQGNDPKYVKTVTTVKHYVANNVE----TKREWIHSNIGKKDLYEYYFPAYKTCIV 213
Query: 246 EGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTP 305
+ + +M + N +NG+P A ++ +R EW GY+++D +V + S+
Sbjct: 214 DEEATGIMTALNGLNGIPCSAHDWLVNGVLRNEWGFKGYVIADWAAVQGLEKRMKYASSQ 273
Query: 306 EEAAADAIRAGLDLDC-------GPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLG 358
+AAA AI+AG+D +C P + + A+Q+GL++E +++ + L ++ G
Sbjct: 274 AQAAAMAIKAGVDQECFRNKVRQAPMVQALPD-ALQQGLITEKELDVTVKRLLRLRFMTG 332
Query: 359 MFDGEPSSQPYGHLGPKDVCTPD-HQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVI 417
FD +PS PY + P V D H++LAL+AA Q IVLLKN LPL +++A+I
Sbjct: 333 DFD-DPSLNPYSAI-PTSVLECDAHKQLALKAAEQSIVLLKNDA-VLPLKK-DLKSIAMI 388
Query: 418 GPNSDVTVTMIGNYAGIACGYTTPLQGIGRY--ARTIHQQGCKDVACADD-QLFGAAIDA 474
GP +D +G Y+G +PL GI Y A+ QGC+ A DD Q A+
Sbjct: 389 GPFAD--RCWMGIYSGHPKSKVSPLDGIKAYTNAKVSFAQGCEVTAKEDDEQKIAEAVAL 446
Query: 475 SRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDV 534
+++++ ILV+G D++ E DR + LPG Q +L+ V +K ILVL+ GP V
Sbjct: 447 AKKSEQVILVVGNDETTSTENTDRKSIKLPGNQHQLIKAVQAVNKN-VILVLVPSGPTAV 505
Query: 535 AFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEY-ITNLPMTEMA 593
+ + + I I+ A GQ GTA+A +LFG NPGGKL TWY + + N +MA
Sbjct: 506 TWEQKN--IPGIVCAWPNGQEQGTALAKVLFGDVNPGGKLNATWYQSDKDLPNFHDYKMA 563
Query: 594 MRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATI 653
RTY ++KG +YPFG+G+SYTNF T+ V ++ + N +
Sbjct: 564 G--------GNRTYMYFKGKPLYPFGYGLSYTNF--------TISDVSINKKTLQANEYV 607
Query: 654 SGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHV 713
+ KA KV + VQV +++V SK+ P K L F+++ V
Sbjct: 608 TVKA-KVNNTGAVAGDEVVQVYIRDVKSKEK---------------TPLKALKGFQRISV 651
Query: 714 PAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGG------TKHSVSLHAATLGV 767
AGA + V I I + + + GE I +G T ++ L TL
Sbjct: 652 AAGASKWVEIKIPYEAFSHYNTKKEALMVAKGEFEILVGNASDAIVTTKTIELKKGTLPK 711
Query: 768 IK 769
+K
Sbjct: 712 LK 713
>gi|405968899|gb|EKC33925.1| Putative beta-D-xylosidase 5 [Crassostrea gigas]
Length = 748
Score = 339 bits (870), Expect = 3e-90, Method: Compositional matrix adjust.
Identities = 245/737 (33%), Positives = 363/737 (49%), Gaps = 95/737 (12%)
Query: 28 FACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAA--------AVPRL 79
FA P + PF VSL +RV+DL+GRL+L + V+ L G A A+ L
Sbjct: 14 FALTP--LASSNFPFQNVSLSWSERVDDLVGRLTLDQIVQQLARGGAGLNGGPAPAIENL 71
Query: 80 GIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEA 139
GI Y+W +E L G G ATSFPQ I A++F+ L + + + E
Sbjct: 72 GIGPYQWNTECLRGDVEAG----------NATSFPQAIGLAAAFSKDLIFNVSKAAATEV 121
Query: 140 RAMYNGGT--------AGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQ 191
RA +N GL+ +SP VNI R P WGR QET GEDP LSG YA+ +V+GLQ
Sbjct: 122 RAKHNDFVKRGIFTDHTGLSCFSPVVNIMRHPLWGRNQETYGEDPYLSGTYASYFVQGLQ 181
Query: 192 GSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVAS 251
G ++ A CKHF A+ R F+AKVS +D+ TF F+ CV G S
Sbjct: 182 GDHDRYIQANAGCKHFDAHGGPEDIPESRMGFDAKVSMRDLRLTFLPAFQKCVQAGAY-S 240
Query: 252 VMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAAD 311
+MCSYN +NGVP C++ ++ +RGEW GY+VSD ++ H+ + E+AAA
Sbjct: 241 LMCSYNSINGVPACSNKLLMMDILRGEWNFTGYVVSDEGAIENQISFHHYYNNSEDAAAG 300
Query: 312 AIRAGLDLDCGPFLG----LHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQ 367
++ AG +L+ L + AV+ G L E + N + +MRLG FD P
Sbjct: 301 SVNAGCNLELSGNLTEPVFMKIGDAVKSGKLEESVVRNRVKPLFYTRMRLGEFD-PPEMN 359
Query: 368 PYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPS-LPLSHI----RHRTVAVIGPNSD 422
PY + + + +H+ L+L AA + +VLLK PS H+ +AVIGP ++
Sbjct: 360 PYSSVNLSVIQSEEHRNLSLTAAAKSLVLLKR--PSKFSKRHLIGGFPSERMAVIGPMAN 417
Query: 423 VTVTMIGNYAGIA--CGYTTPLQGIGRYARTI-HQQGCKD----VACADDQLFGAAIDAS 475
T + G+Y+ TPL+G+ ++ + GC D + + D + A +
Sbjct: 418 NTDQIFGDYSPTTDPRFVKTPLKGLTELNFSMNYAAGCVDGTRCLNYSQDDVKTALVG-- 475
Query: 476 RQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVA 535
AD ++ +G + +E+E +DR ++LPG+Q +L+ V + L++ S GP+++
Sbjct: 476 --ADLVVVCLGTGKDLESENVDRKDMMLPGKQLQLLQDVVSMTNKAVYLLVFSAGPVNIT 533
Query: 536 FAKNDPRIAAIIWAGYPGQAGGTAIADILF---GTSNPGGKLPMTWYPQEYITNLP-MTE 591
+A+ R+ I+ YP Q+ G AI L G NP G+LP TWY Y +P MT+
Sbjct: 534 WAQESERVLIILQCFYPAQSAGDAITQALIMRDGRFNPAGRLPYTWY--RYTEQIPEMTD 591
Query: 592 MAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINA 651
+M +TYR++ G +YPFG+G+SY+ FV + + +P
Sbjct: 592 YSM--------ARKTYRYFTGVPLYPFGYGLSYSTFVFS-----KLYFLP---------- 628
Query: 652 TISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEK 710
KV N VQV V N G DG L V+ P QLVAFE+
Sbjct: 629 -------KVNAGDPNV----VQVRVFNEGPFDGDEVLQVYIKWMSTKERMPRVQLVAFER 677
Query: 711 VHVPAGAQQRVGINIHV 727
V + +QQ V I+ +
Sbjct: 678 VFI--RSQQYVDISTAI 692
>gi|167524198|ref|XP_001746435.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163775197|gb|EDQ88822.1| predicted protein [Monosiga brevicollis MX1]
Length = 834
Score = 339 bits (869), Expect = 4e-90, Method: Compositional matrix adjust.
Identities = 245/721 (33%), Positives = 357/721 (49%), Gaps = 86/721 (11%)
Query: 41 PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAA-------VPRLGIKGYEWWSEALHG 93
PF LP R++DL+GRL+L+EK++ L G AA V RLGI + W SE + G
Sbjct: 35 PFRNPDLPWAARLDDLVGRLTLEEKLQQLQHGGAAQMTPAPAVERLGIGPFVWGSECVTG 94
Query: 94 VSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA----- 148
+ GT G+ P T+FPQ + A++F+ L + ++ E RA N
Sbjct: 95 L-----GTD--GNDPHGTAFPQPLGMAATFDPALLKRAAGTIALELRAQRNFDRENGVVK 147
Query: 149 ---GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCK 205
GL+ WSP VNI R P WGR ET GE PVLS A S+V G+QG+ AA+CK
Sbjct: 148 FHHGLSCWSPVVNINRHPLWGRNDETFGECPVLSSFMARSFVEGIQGNHTRYYAAAAACK 207
Query: 206 HFTAYDLDNWNGVD--RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVP 263
H LD + G D R+ F+A VS+ D+ TF + F C G V MCSYN + GVP
Sbjct: 208 H-----LDVYGGPDNLRYVFDADVSQADLTGTFLMAFEECAAAG-VMGYMCSYNSIRGVP 261
Query: 264 TCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDL---D 320
CA+ + R +W GY+VSD +V ++ ++T+ A A+ AG D+ D
Sbjct: 262 ACANYRTMTFFAREQWGFEGYVVSDQGAVFRITESHNYTANQTLGAVAALNAGCDMEDSD 321
Query: 321 CGPFLGLHTES-AVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCT 379
+ + S A+ L I+ ++ V+MRLG FD P + P+ L V +
Sbjct: 322 DAQHVAYYNLSLALDLKLTDMATIDASVSRLFYVRMRLGEFD-PPENDPWRSLNMSIVSS 380
Query: 380 PDHQELALEAARQGIVLLKNQGPSLPLS-HIRHRTVAVIGPNSDVTVTMIGNYA------ 432
P H E+A + A IVLLKNQ +LPLS ++ + ++GP +D M+G Y+
Sbjct: 381 PAHVEMARDVATASIVLLKNQNETLPLSAAAKNASYCLLGPFADNADLMMGKYSPHGSTN 440
Query: 433 ---GIACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQ--ADATILVMGL 487
G LQ + A + +GC C D L AA+ Q D +L +G
Sbjct: 441 VTVTYRAGLAAALQNASQTASFQYLEGCTGPFC--DGLDTAAVTTFIQQGCDTVLLAVGT 498
Query: 488 DQSIEAEALDRAGLLLPGRQQELVSKV--SMASKGPTILVLMSGGPIDVAFAKNDPRIAA 545
+E+E+LDR+ + PG Q LV V ++ +K +L++ + GP+D+A + D R+AA
Sbjct: 499 SYHVESESLDRSNMSFPGAQPTLVQTVLEALGTKQRLVLLVSTAGPVDLAALEQDTRVAA 558
Query: 546 IIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGR 605
I+ Y GQ GTA+ADIL G ++P G+LP +W P + P+ + M+ GR
Sbjct: 559 ILDLIYLGQTAGTALADILLGETSPSGRLPFSW-PNKVSDVPPIDDYTMQ--------GR 609
Query: 606 TYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKC 665
TYRF + V++PFG+G+SYT F + AP ++ V C
Sbjct: 610 TYRFAQADVLFPFGYGLSYTQFNLSHLAAPYILPV------------------------C 645
Query: 666 NRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINI 725
L L V+V N G GA L V+ P P +QL +V V A + + V ++I
Sbjct: 646 QALRLS--VNVTNTGRLSGAIPLQVYVEWPNAVGGPIRQLATTTRVFVDAASSKTVQLSI 703
Query: 726 H 726
Sbjct: 704 R 704
>gi|390340546|ref|XP_001186857.2| PREDICTED: probable beta-D-xylosidase 2-like [Strongylocentrotus
purpuratus]
Length = 623
Score = 337 bits (865), Expect = 1e-89, Method: Compositional matrix adjust.
Identities = 215/608 (35%), Positives = 323/608 (53%), Gaps = 54/608 (8%)
Query: 31 DPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAA-------AVPRLGIKG 83
D + A LPF SLP QR++DL+ RL + + L G A A+ RL I
Sbjct: 21 DDEFAQKSQLPFWNQSLPWDQRLDDLLSRLKVDDMTYQLARGGADPNGPAPAIGRLQIGK 80
Query: 84 YEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMY 143
Y W +E L G + G AT+FPQ + +++F+ L + E RA Y
Sbjct: 81 YVWNTECLRGDAQAG----------NATAFPQALGLSAAFSRDLLFEVANATGYEVRAKY 130
Query: 144 N--------GGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDG 195
N GL +SP +NI R P WGR QET GEDP L+G+ A S+V GLQG+
Sbjct: 131 NYYLQKGDFNNHQGLNCFSPVINIMRHPYWGRNQETYGEDPYLTGELAKSFVWGLQGNHP 190
Query: 196 DRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCS 255
L A CKHF AY RF F+AKVS +D++ TF F+ C+ G SVMCS
Sbjct: 191 RYLLTNAGCKHFAAYSGPENYPSSRFSFDAKVSDKDLQVTFFPAFKECIKAGTY-SVMCS 249
Query: 256 YNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRA 315
YN VNG+P CA+ +L +R EW GY+VSD ++ + ++T++ + A +++A
Sbjct: 250 YNSVNGIPACANSYLLNDVLRTEWGFKGYVVSDQRALELEELAHNYTTSYLDTAIKSLKA 309
Query: 316 GLDLDCG---PFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHL 372
G +LD G P + + AV+ G+L+ D+ +++ ++RLG FD P PY L
Sbjct: 310 GCNLDLGTTKPAVYDYLAEAVELGMLTAQDLRDSIAPLFYTRLRLGEFD-PPDHNPYVKL 368
Query: 373 GPKDVC-TPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY 431
V +P+HQE+AL+AA + VL+KN G +LP+ H T+AV+GP ++ + + G+Y
Sbjct: 369 NVDQVVESPEHQEIALKAALKSFVLVKNDGSTLPIEGTIH-TLAVVGPFANNSKLLFGDY 427
Query: 432 AGIACG--YTTPLQGIGRYA-RTIHQQGCKDVACA--DDQLFGAAIDASRQADATILVMG 486
A TT L+G+ A +T H GC C D Q ++A AD ++ +G
Sbjct: 428 APNPDPRFVTTVLEGLSPMATKTRHASGCPSPKCVTYDQQ---GVLNAVTGADVVVVCLG 484
Query: 487 LDQSIEAEALDRAGLLLPGRQQELVSKVSMASKG-PTILVLMSGGPIDVAFAKNDPRIAA 545
+E+E DR +LLPG+Q++L+ + + G P IL+L + GP+++ +A + P + A
Sbjct: 485 TGIELESEGNDRRDMLLPGKQEQLLQDAARYAAGKPVILLLFNAGPLNITWALSSPSVQA 544
Query: 546 IIWAGYPGQAGGTAIADILFGT---SNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRY 602
I+ +P QA G A+ ++F +NPGG+LP TW P T + P ++
Sbjct: 545 IVECFFPAQATGVALR-MMFQNAPGANPGGRLPSTW---------PATVAQIPPMENYSM 594
Query: 603 PGRTYRFY 610
GRTYR++
Sbjct: 595 DGRTYRYF 602
>gi|413925161|gb|AFW65093.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
Length = 323
Score = 335 bits (860), Expect = 4e-89, Method: Compositional matrix adjust.
Identities = 155/296 (52%), Positives = 209/296 (70%), Gaps = 3/296 (1%)
Query: 22 LAAREPFACDPKDA-TTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLG 80
+A PF+C P A + L FC V+L QR DL+ RL+ EK+ L A VPRLG
Sbjct: 25 VAGDPPFSCGPSSAEASEGLAFCDVTLAPAQRAADLVSRLTAAEKIAQLGDQAPGVPRLG 84
Query: 81 IKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEAR 140
+ GY+WW+EALHG++ G G F ATSFPQV+ TA++F+ LW IG+ + EAR
Sbjct: 85 VPGYKWWNEALHGLATSGKGLHFDAAVRAATSFPQVLLTAAAFDDDLWLRIGQAIGREAR 144
Query: 141 AMYNGGTA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQG-SDGDRL 198
A++N G A GLT WSPNVNIFRDPRWGRGQETPGEDP ++ +YA ++VRG+QG S L
Sbjct: 145 ALFNVGQAEGLTIWSPNVNIFRDPRWGRGQETPGEDPAVASRYAVAFVRGIQGNSSSSLL 204
Query: 199 KVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQ 258
+ +A CKH TAYDL++WNGV R+ F A+V++QD+EDTF+ PFR CV+E K + VMC+Y
Sbjct: 205 QTSACCKHATAYDLEDWNGVARYSFVARVTEQDLEDTFNPPFRSCVVEAKASCVMCAYTA 264
Query: 259 VNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIR 314
+NGVP CA+ ++L T+RG+W L+GY+ SDCD+V + D Q + TPE+A A +++
Sbjct: 265 INGVPACANSDLLTGTVRGDWGLDGYVASDCDAVAIMRDAQRYAPTPEDAVAVSLK 320
>gi|361127339|gb|EHK99311.1| putative exo-1,4-beta-xylosidase bxlB [Glarea lozoyensis 74030]
Length = 569
Score = 333 bits (853), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 198/538 (36%), Positives = 283/538 (52%), Gaps = 50/538 (9%)
Query: 43 CQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTK 102
C + P R L+ + EK++ +IS +A V RLG+ Y WWSEALHGV+ PG +
Sbjct: 65 CDTTAPPADRAAALVKAMQSSEKLQNIISKSAGVSRLGLPPYNWWSEALHGVAGA-PGIQ 123
Query: 103 FGGDFPG--ATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIF 160
F P ATS P I A++F+ L E +G ++ EARA NG +G+ +W+PN+N F
Sbjct: 124 FSSSSPWNYATSLPMPILMAAAFDDDLIEKVGTLIGTEARAFGNGNHSGIDFWTPNINPF 183
Query: 161 RDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDR 220
+DPRWGRG ETPGED + Y A+ +RGL+G+ R ++ A+CKH+ A DL++WNGV R
Sbjct: 184 KDPRWGRGSETPGEDTLRLKGYVAALLRGLEGNKAQR-RIIATCKHYAANDLESWNGVTR 242
Query: 221 FHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWR 280
F+AK+S QD+ + + PF+ C + KV S MCSYN VNGVP CA+ +L+ +R W
Sbjct: 243 HDFDAKISMQDLAEYYLQPFQQCARDSKVGSFMCSYNSVNGVPACANKYLLQTILRDHWN 302
Query: 281 L---NGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGL 337
N Y+ SDC++V H+ ST A A AG D C
Sbjct: 303 WTSENQYVTSDCEAVQDISLNHHYASTNAAGTALAFNAGTDSSC---------------- 346
Query: 338 LSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLL 397
G FDG S Y LG DV TP Q+LAL+A GIV+L
Sbjct: 347 ------------------EAGYFDG--SKALYSSLGWSDVNTPQAQQLALQATVDGIVML 386
Query: 398 KNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPL---QGIGRYARTIHQ 454
KN G +LPL VA+IG + + + G Y+G A TP+ Q +G + +
Sbjct: 387 KNDG-TLPLKLDSKSKVAMIGFWASDSSKLQGGYSGKAPYLRTPVYAAQQLG-FTPNVAT 444
Query: 455 QGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKV 514
+ A A D A+ A+ ++D + GLD S AE +DR L P Q L+ K+
Sbjct: 445 GPVQQSASATDNWTTNALAAASKSDYILYFGGLDTSAAAEGVDRTSLEWPSAQLALIKKL 504
Query: 515 SMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGG 572
S + G ++++ G +D + +++I+WA +PGQ GG A+ I+ G +P G
Sbjct: 505 S--ALGKPLIIIQEGDQMDNTPLLTNKGVSSILWASWPGQDGGPAVMQIISGAKSPAG 560
>gi|336275603|ref|XP_003352555.1| hypothetical protein SMAC_01389 [Sordaria macrospora k-hell]
gi|380094444|emb|CCC07823.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 833
Score = 333 bits (853), Expect = 3e-88, Method: Compositional matrix adjust.
Identities = 239/759 (31%), Positives = 356/759 (46%), Gaps = 113/759 (14%)
Query: 43 CQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTK 102
C + P R L+ +L++ EK+ L+ + PRLG+ Y WWSE LHGV+ PG
Sbjct: 37 CDSTASAPDRAASLVEQLTIDEKLVNLVDQSKGAPRLGLPPYAWWSEGLHGVAG-SPGVV 95
Query: 103 F---GGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNI 159
F G F ATSF VIT ++ + L +G +S EARA G GL YW+PN+N
Sbjct: 96 FNTSGYPFSYATSFANVITLGAALDDDLVYEVGTAISTEARAFAKFGFGGLDYWTPNINP 155
Query: 160 FRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD 219
++DPRWGRG ETPGEDP+ Y + V GL+G +G KV A+CKHF AYDL+ W G+
Sbjct: 156 YKDPRWGRGAETPGEDPLRIKGYVKAMVAGLEG-NGTVRKVIATCKHFAAYDLERWRGLT 214
Query: 220 RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMC------------------------- 254
R+ F+A VS QD+ + + PF+ C + +V S+MC
Sbjct: 215 RYDFDAVVSLQDLSEYYLPPFQQCARDSRVGSIMCRYVSFFLPPFPSFPRLVTRQSGNQV 274
Query: 255 -------SYNQVNGVPTCADPNILKRTIRGEWRL---NGYIVSDCDSVGVYY-DTQHFTS 303
SYN +NG P CA ++ +R W N YI SDC+++ + D +F+
Sbjct: 275 DIVDNFRSYNALNGTPACASTYLMTNILRDHWNWTNHNNYITSDCNAIQDFLPDNHNFSQ 334
Query: 304 TPEEAAADAIRAGLDLDCG-----PFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLG 358
TP EAAA A AG D C P+ + A + LLSE I+ AL +R G
Sbjct: 335 TPAEAAAAAYIAGTDTVCEVSGWPPYTDV--VGAYNQSLLSESVIDTALRRLYEGLIRAG 392
Query: 359 MFD-GEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVI 417
D G P+S +PD + P + +TVA+I
Sbjct: 393 YLDHGRPASS-----------SPDKAPFS--------------SPDFLPLDLTGKTVALI 427
Query: 418 GPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCKDVACAD-DQLFGAAIDASR 476
G ++ T T+ G Y+G+ Y P+ + + + + V D D AA+ A+
Sbjct: 428 GHWANATRTIRGPYSGLPPFYHNPMYAVRQLKLSFYYANGPVVNSTDADTWTAAAMLAAE 487
Query: 477 QADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAF 536
AD + G D ++ +E LDR + P Q L+ K +A G ++V+ G +D
Sbjct: 488 SADVVLYFGGTDTTVASEDLDRESIAWPKTQLTLIEK--LAQVGKPMVVIQLGDQVDDTP 545
Query: 537 AKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRP 596
N+ I++I+W GYPGQ+GGTA+ D+L G G+LP+T YP Y+ +P+TEM +RP
Sbjct: 546 LLNNKNISSILWVGYPGQSGGTAVFDVLTGKKASAGRLPVTQYPAGYVDEVPLTEMGLRP 605
Query: 597 --------------------------------SQSKRYPGRTYRFYKGPVVYPFGHGMSY 624
+++ PGRTY++Y PV+ PFG+G+ Y
Sbjct: 606 FNHSSSTTSSDVSQSGVEEGNGLTIQTRSTRGNKTLSSPGRTYKWYPRPVL-PFGYGLHY 664
Query: 625 TNF-VHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKD 683
T F + + + + D SI + ++ + H + V + N GS
Sbjct: 665 TPFNISLSLSTSSNASSTTDNTSISIRSLLTSQTCTAIHLDLCPFS-PFSVSITNTGSHT 723
Query: 684 GAHTLLVFSTPPAG-HWAPHKQLVAFEKVHVPAGAQQRV 721
+ L+F + G P K LV +++V + RV
Sbjct: 724 SDYVALLFLSGKFGPKPDPLKTLVGYKRVKDIKPGETRV 762
>gi|397642422|gb|EJK75223.1| hypothetical protein THAOC_03061, partial [Thalassiosira oceanica]
Length = 534
Score = 332 bits (852), Expect = 4e-88, Method: Compositional matrix adjust.
Identities = 213/565 (37%), Positives = 299/565 (52%), Gaps = 89/565 (15%)
Query: 192 GSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAK-VSKQDIEDTFDVPFRMCV------ 244
G G R ++AA+CKH AY L+ DRF+F+A + + D E T+ F CV
Sbjct: 2 GGAGLRPRIAATCKHLAAYSLE----TDRFNFSADGIDRTDWEGTYLPAFDACVHAERFL 57
Query: 245 MEGKVAS-------------VMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDS 291
+E AS VMCSYN ++GVP CADP +LK +R +W G +VSDC +
Sbjct: 58 LEHYNASGGGGGGQDRGALGVMCSYNAIDGVPACADPALLKDMLRRDWNFTGLVVSDCWA 117
Query: 292 VGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTL 351
V + F ++ EEA A+R+G+DLDCG A LL E DI+ AL
Sbjct: 118 VDNIHSNHRFVASYEEAVGLALRSGVDLDCGNTFQDFGRLAYDESLLDEDDIDEALSRLF 177
Query: 352 TVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKN------QGPSLP 405
V M LG FD + +P ++ +H +LALEAA Q IVLLKN GP LP
Sbjct: 178 RVLMDLGYFD--ETDEPDAKSSDDEM---EHDQLALEAALQSIVLLKNGINEDEPGP-LP 231
Query: 406 LSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADD 465
LS +H+ +A+ GP +D ++GNY G+ TPL G+ + V A
Sbjct: 232 LSLAKHKEIALFGPLADNQTVLLGNYHGLPSTIVTPLMGLAKMG----------VEVAFR 281
Query: 466 QLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKG---PT 522
Q A++ ATILV+GLDQS+EAE DR LLLP Q++L+ +S SK P
Sbjct: 282 Q--RASVCDFHGESATILVVGLDQSLEAEDQDRTTLLLPVEQRDLIKTISRCSKVRDLPV 339
Query: 523 ILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQE 582
+LV++SGG +D++ KN I A+I YPGQ GG+A+A +L+G NP GKL T YP+
Sbjct: 340 VLVVVSGGMVDLSRYKNSSDIDAMIHMSYPGQNGGSALAQVLYGAYNPSGKLVGTMYPES 399
Query: 583 YITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPL 642
Y+ + + +M MRP ++PGRT+R+Y+G V+YPFG+G+SYT+F + +
Sbjct: 400 YLNEVSLHDMRMRP--DGKFPGRTHRYYRGDVIYPFGYGLSYTSFRYAM----------- 446
Query: 643 DGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGH-WA 700
G +KVT V N GS DG+ +L+F S P AG+
Sbjct: 447 ---------EFLGGTVKVT--------------VSNSGSMDGSVAVLLFHSAPQAGNEQE 483
Query: 701 PHKQLVAFEKVHVPAGAQQRVGINI 725
P + L+ FEK++V G Q V ++
Sbjct: 484 PFRSLIGFEKIYVSVGDSQLVSFDV 508
>gi|198425898|ref|XP_002119549.1| PREDICTED: similar to predicted protein [Ciona intestinalis]
Length = 754
Score = 331 bits (849), Expect = 8e-88, Method: Compositional matrix adjust.
Identities = 248/755 (32%), Positives = 380/755 (50%), Gaps = 90/755 (11%)
Query: 36 TTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISG-------AAAVPRLGIKGYEWWS 88
T+ PF SLPI +R+ DL+ RL+++E + L G A A+ RLGI Y+W +
Sbjct: 21 TSEEFPFRNFSLPIEERLEDLVNRLTIEEVILQLSRGGVRDNGPAPAITRLGIGPYQWNT 80
Query: 89 EALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN---- 144
E L G + GD AT FPQ I A++F+ L + + V+ EARA +N
Sbjct: 81 ECLRGYA-------MNGD---ATCFPQPIGLAATFDQGLIYKMAKTVALEARAKHNNFTK 130
Query: 145 ----GGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKV 200
G GL+ +SP +NI R P WGR QET GEDPVL+ A +YV GLQG D L
Sbjct: 131 NGNFGDHTGLSCFSPVINILRHPLWGRNQETYGEDPVLTSLMARAYVTGLQG-DEIYLPA 189
Query: 201 AASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVN 260
A CKHF AY RF F+A VS DI TF FR CV G VMCSYN +N
Sbjct: 190 TAVCKHFVAYGGPENIPTTRFSFSANVSDHDIGTTFYPAFRECVHAG-AQGVMCSYNAIN 248
Query: 261 GVPTCADPNILKRTIRGEWRLNGYIVSD---CDSVGVYYDTQHFTSTPEEAAADAIRAGL 317
GVP+CA+P +L+ T+R ++ +GY+VSD +++ +Y++ FT + E AA A+ AG+
Sbjct: 249 GVPSCANP-MLETTLRKKFHFDGYVVSDENALENIDLYFN---FTKSKLETAAVALNAGV 304
Query: 318 DLDCGPFLGLHTES----AVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLG 373
DL+ F + S AV++GL++E + + +M LG FD P + ++
Sbjct: 305 DLELTGFGKTNRYSLLNQAVEQGLVTEAALRRSAKRLFRTRMALGEFD-PPEFNHWLNVP 363
Query: 374 PKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY-A 432
V + H++ A+E A + VLLKN G LPL + + V+++GP + + + G+Y A
Sbjct: 364 IDVVQSLAHRKQAVEVAAKSFVLLKNDG-ILPLKQLYDK-VSIVGPFINNSEALTGDYPA 421
Query: 433 GIACGY-TTPL--------QGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATIL 483
Y ++PL G+ R+ +++ + +D ++
Sbjct: 422 EFNLKYFSSPLFAANSLSSSGVARFTTGCVGTNNQNLPICATYNSTNVKEVVTGSDIVLV 481
Query: 484 VMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRI 543
+G + +EAE+ DR + LPG+Q +L+ V + GP I+VL + GP+DV++ +
Sbjct: 482 TLGTGRGVEAESNDRRDINLPGKQLQLIQDVVKYANGPVIVVLFNAGPLDVSWVMGN--T 539
Query: 544 AAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYP 603
AA+I + Q G A+ ++L G NP G+LP TW P PMT+ +M
Sbjct: 540 AAVIACHFSAQMTGEAMLEVLTGVVNPAGRLPNTW-PASMEQVPPMTDYSMHE------- 591
Query: 604 GRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHA 663
RTYR+ ++PFG+G+SYT F + A V P TI I V
Sbjct: 592 -RTYRYSTSSPLFPFGYGLSYTKFWYLDA-----VVEP---------TTIQRCQIPV--- 633
Query: 664 KCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGH-WAPHKQLVAFEKVHVPAGAQQRVG 722
V+V ++N G DG + ++ T +QLVAF++V + AG + +
Sbjct: 634 --------VRVLIQNTGHLDGEEVVQIYMTSKKKRDRELLRQLVAFQRVPIKAGEEVSIS 685
Query: 723 INI-HVCKYLSVV-DRSGTRRIPLGEHNIHIGGTK 755
+ I H+ + + + +RS E +++GG +
Sbjct: 686 LPIPHLSRAIWMKHNRSFDYYFDADEFTLYVGGQQ 720
>gi|256394020|ref|YP_003115584.1| glycoside hydrolase family 3 [Catenulispora acidiphila DSM 44928]
gi|256360246|gb|ACU73743.1| glycoside hydrolase family 3 domain protein [Catenulispora
acidiphila DSM 44928]
Length = 1212
Score = 331 bits (848), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 253/758 (33%), Positives = 370/758 (48%), Gaps = 110/758 (14%)
Query: 42 FCQVSLPIPQRVNDLIGRLSLQEKV-KLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPG 100
+ Q +R DL+ R++L EKV +L + A A+PRLG++ Y +WSE HG++ +G
Sbjct: 45 YLQTRYSFAERAADLVSRMTLPEKVAQLHTNSAPAIPRLGVQSYTYWSEGQHGINLLGAD 104
Query: 101 TKFGGDF--PGATSFPQVITTASSFNATLWEAIGRVVSDEAR----------AMYNGGTA 148
+ GG P ATSFP +++ S++ L +SDE R A N G +
Sbjct: 105 SNNGGAAGGPHATSFPTNLSSTMSWDPALVYQETTAISDEVRGELDKSLWGVAQNNIGPS 164
Query: 149 G-----LTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGD------R 197
LTYW+P VN+ RDPRWGR E GEDP L GK A ++V G QG D
Sbjct: 165 ADDYGSLTYWAPTVNMDRDPRWGRTDEAFGEDPYLVGKMAGAFVAGYQGETIDGTPTSPY 224
Query: 198 LKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYN 257
LKVAA+ KHF L+N N DR +A S+ DI D + FR V + VA +M SYN
Sbjct: 225 LKVAATAKHFA---LNN-NENDRHADSADASESDIRDYYTAQFRSLVEDSHVAGLMTSYN 280
Query: 258 QVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVG------------------VYYDTQ 299
+NG P+ AD + W +GYI SDC +VG V T
Sbjct: 281 AINGTPSPADTYTTDALAQRTWGFDGYITSDCGAVGDVTASSSHDWAPPGWTVSVVNGTS 340
Query: 300 HFTST------PEEAA--ADAIRAGLDLDC--GPFLGLHTESAVQRGLLSEIDINNALVN 349
+T+T P +A A A+RAG D +C G + E+A++ +LSE I++ALV
Sbjct: 341 TWTNTATGVQVPADAGGQAYALRAGTDANCTGGDATLGNIEAAIKAEILSEGVIDHALVQ 400
Query: 350 TLTVQMRLGMFDGEPSSQ-PYGHLGPKDVCTPDHQELALEAARQGIVLLKNQ------GP 402
TV+M+ G FD P+++ Y + + +P+HQ LA + A +VLLKN
Sbjct: 401 LFTVRMQTGEFD--PANKVAYTRITKAQIQSPEHQALAEKVAANSLVLLKNDPMPGSAAK 458
Query: 403 SLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYAR--------TIHQ 454
LP + V V+G ++ TVT+ G Y+G +QGI + T
Sbjct: 459 VLPANPASLNNVVVVGDLAN-TVTL-GGYSGDPTLQVNAVQGITSAVKAANPNATVTFDA 516
Query: 455 QGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKV 514
G A A A A + AD ++ +G D S E+ DRA L +PG L+S+V
Sbjct: 517 CGTSTTATAAASCSAATQAAIKTADLVVVFVGTDGSTAGESNDRASLAMPGNYDSLISQV 576
Query: 515 SMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKL 574
+ T+L + + GP+D+ K D AI+++ Y G++ GTA+AD+LFG NP G L
Sbjct: 577 AALGNPRTVLSMQTDGPVDIENVKGD--FPAIVYSAYNGESQGTALADVLFGKQNPSGHL 634
Query: 575 PMTWYPQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVAN 633
TWY + + LP + + P+ + GRTY+++ G YPFG+G+SYT+F ++
Sbjct: 635 DFTWYKDD--SQLPSIKNYGLNPADTGGL-GRTYQYFTGTPTYPFGYGLSYTDFAYSKVQ 691
Query: 634 APTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFST 693
A + +A GKA V+ DV N G GA ++ T
Sbjct: 692 A-------------TDHADAQGKAT-------------VRFDVTNTGKTPGATVAQLYIT 725
Query: 694 PPA--GHWAPHKQLVAFEKVHV-PAGAQQRVGINIHVC 728
PP+ G P +QL F K V G Q + +++++
Sbjct: 726 PPSVPGTQQPAEQLEGFAKTAVLKPGQTQHLSVSVNIA 763
>gi|323451996|gb|EGB07871.1| hypothetical protein AURANDRAFT_71699 [Aureococcus anophagefferens]
Length = 1202
Score = 329 bits (843), Expect = 4e-87, Method: Compositional matrix adjust.
Identities = 266/776 (34%), Positives = 378/776 (48%), Gaps = 102/776 (13%)
Query: 31 DPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEA 90
D + + P+C +LPI RV DL R ++ E + + + AAAVPRLG+ + EA
Sbjct: 332 DVESPSAAAYPYCDRALPIRARVADLAARFTVNETISQMGTMAAAVPRLGLPALNYGGEA 391
Query: 91 LHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN------ 144
LHGV + T G P T FP +SF+ LW A+G EARA++
Sbjct: 392 LHGVWS----TCAAGRCP--TQFPAPHAMGASFDRDLWRAVGAASGLEARALFRWNQRHN 445
Query: 145 --------GGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGD 196
G GLT+++PNVN+ RDPRWGR +E P EDP+L+G Y A +VRG QG DG
Sbjct: 446 ASDCARSLEGCLGLTFYAPNVNLARDPRWGRIEEVPSEDPLLNGVYGAEFVRGFQG-DGA 504
Query: 197 RLKVAASCKHFTAYDLD---------NWNGV-------DRFHFNAKVSKQDIEDTFDVPF 240
A KHF Y+L+ +W G DR F+A+VS +D E+T+ PF
Sbjct: 505 YRVANAVVKHFAVYNLEVDVEDTPPADWCGSAACAPPNDRHSFDARVSPRDFEETYVGPF 564
Query: 241 RMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQH 300
+ A+ MCSYN VNG P C D +L+ +RG G + +DC ++
Sbjct: 565 VA-PVAAGAAAAMCSYNAVNGEPACTDGALLRGALRGALNFTGVLATDCGALEDAVARHK 623
Query: 301 FTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMF 360
+T EAAA AI AG+D +CG L A+ GL+ + L L ++RLG+
Sbjct: 624 RYATEAEAAAAAIAAGVDSNCGKVLTSALPEALAAGLVRPDALRPPLERLLEARLRLGLL 683
Query: 361 DGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPN 420
D + P V +P H+ LAL AAR+G+VLL+N LPL T+AVIGPN
Sbjct: 684 DDWDADAPVPRPDVDAVDSPAHRALALRAAREGLVLLQNPNQILPLDG--RGTLAVIGPN 741
Query: 421 SDVTVTMIGNYAGIACG--YTTPLQGIG---RYARTIHQQGCKDVACADDQLFGAAIDAS 475
++ ++ ++ Y G +PLQ + R + ++ GC A L A+D +
Sbjct: 742 ANASMNLLSGYHGTPPPDLLRSPLQELEARWRGGKVVYAVGCNASGAATAAL-DEAVDLA 800
Query: 476 RQADATILVMGL------------DQSI----EAEALDRAGLLLPGRQQELVSKVSMASK 519
+ AD +L +GL D + EAE++DR L LPG Q+ L SK+ K
Sbjct: 801 KTADVVVLGLGLCGDNYGGGPPKEDATCFSIDEAESVDRTSLKLPGAQEALFSKIWALGK 860
Query: 520 GPTILV-LMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTW 578
+ V L+S G +D +FAK+ AA++ AGY G+ GG A+AD L G NPGG L T
Sbjct: 861 PVAVAVFLVSAGAVDASFAKDK---AALLLAGYGGEFGGVAVADALLGAYNPGGALTATM 917
Query: 579 YPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYP---FGHGMSYTNFVHTVANAP 635
P + P +MAMRPS + PGRTYRF V P FG G+SYT F ++A P
Sbjct: 918 LPDAGLP--PFRDMAMRPSAAS--PGRTYRFLDERRVAPLWRFGFGLSYTAFAVSLAG-P 972
Query: 636 TVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPP 695
T +V R + V V+NVG+ G + F
Sbjct: 973 T----------------------RVPRRAATRFS----VVVRNVGAVSGDVVVACFVAAV 1006
Query: 696 AGHWAPHKQLVAFEKVH-VPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIH 750
AP ++L F +V + A +V + + + LS+VD +G R G +++
Sbjct: 1007 GRPDAPLRELFDFARVRDLAPAASTKVSMELRP-RSLSLVDEAGVRSTTAGAYDVR 1061
>gi|296081549|emb|CBI20072.3| unnamed protein product [Vitis vinifera]
Length = 333
Score = 328 bits (842), Expect = 5e-87, Method: Compositional matrix adjust.
Identities = 160/334 (47%), Positives = 219/334 (65%), Gaps = 3/334 (0%)
Query: 427 MIGNYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMG 486
MIGNY G YTTPLQG+ T + GC +VAC Q+ A + ADAT+L++G
Sbjct: 1 MIGNYEGTPGKYTTPLQGLTALVATTYLPGCSNVACGTAQI-DEAKKIAAAADATVLIVG 59
Query: 487 LDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAI 546
+DQSIEAE DR + LPG+Q L+++V+ ASKG ILV+MSGG D++FAKND +I +I
Sbjct: 60 IDQSIEAEGRDRVNIQLPGQQPLLITEVAKASKGNVILVVMSGGGFDISFAKNDDKITSI 119
Query: 547 IWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRT 606
+W GYPG+AGG AIAD++FG NP G+LP TWYPQ Y+ +PMT M MRP + YPGRT
Sbjct: 120 LWVGYPGEAGGAAIADVIFGFYNPSGRLPTTWYPQSYVDKVPMTNMNMRPDPASGYPGRT 179
Query: 607 YRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCN 666
YRFY G +Y FG G+SYT F H + AP V++P++ H ++ K++ C
Sbjct: 180 YRFYTGETIYTFGDGLSYTQFNHHLIQAPKSVSIPIEEGHSCHSSKC--KSVDAVQESCQ 237
Query: 667 RLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIH 726
L + + V N G+ G+HT+ +FS+PP+ H +P K L+ FEKV V A A+ V +
Sbjct: 238 NLAFDIHLRVNNAGNISGSHTVFLFSSPPSVHNSPQKHLLGFEKVFVTAKAEALVRFKVD 297
Query: 727 VCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
VCK LS+VD GTR++ LG H +H+G KHS+++
Sbjct: 298 VCKDLSIVDELGTRKVALGLHVLHVGNLKHSLNV 331
>gi|374313710|ref|YP_005060140.1| Beta-glucosidase [Granulicella mallensis MP5ACTX8]
gi|358755720|gb|AEU39110.1| Beta-glucosidase [Granulicella mallensis MP5ACTX8]
Length = 883
Score = 327 bits (839), Expect = 1e-86, Method: Compositional matrix adjust.
Identities = 176/444 (39%), Positives = 254/444 (57%), Gaps = 31/444 (6%)
Query: 29 ACDPKDATTRT--LPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEW 86
A P A TRT LP+ +LP QR DL+GRL+L EK L++ A +PRLG+ Y++
Sbjct: 22 AQSPSPAGTRTPLLPYQDTTLPAEQRAADLVGRLTLDEKAAQLVTSAPGIPRLGVPAYDF 81
Query: 87 WSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGG 146
WSE LHG++ G AT FPQ + A++F+ L IG V+S EARA YN
Sbjct: 82 WSEGLHGIARSG----------YATLFPQAVGMAATFDEPLLHQIGEVISTEARAKYNDA 131
Query: 147 TA--------GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRL 198
A GLT WSPN+NIFRDPRWGRGQET GEDP L+ + ++V GLQG D +
Sbjct: 132 VAHDLRSIFYGLTIWSPNINIFRDPRWGRGQETYGEDPFLTARLGTAFVEGLQGDDPNYY 191
Query: 199 KVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQ 258
+ + KHF + + +R FNA S D+ DT+ FR ++EGK S+MC+YN
Sbjct: 192 RAIGTPKHFAVH---SGPESERHRFNADPSPHDLWDTYLPAFRATIVEGKAGSIMCAYNA 248
Query: 259 VNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYY--DTQHFTSTPEEAAADAIRAG 316
+ G P CA +L +R +W G++ SDC ++ ++ D H++ E+A+ D IRAG
Sbjct: 249 IEGKPACASDLLLDEVLRKDWAFKGFVTSDCGAIDNFFEKDGHHYSKDAEQASVDGIRAG 308
Query: 317 LDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKD 376
D +CG + SAV++G++ E +++ L + +LG+FD PS Y + +
Sbjct: 309 TDTNCGGTY-RNLASAVRKGMIQESELDVPLRRLFLARFKLGLFD-PPSQVKYASMPITE 366
Query: 377 VCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIAC 436
+ H ELAL+AAR+ +VLLKN+ +LPL R +T+AVIGPN+ +++ GNY I
Sbjct: 367 NMSSSHTELALQAAREAVVLLKNEHHTLPLD-ARVKTIAVIGPNASSLISLEGNYNAIPK 425
Query: 437 GYTTPLQGIGRY---ARTIHQQGC 457
+ GI R A+ ++ QG
Sbjct: 426 NPVMQVDGIAREFRDAKVLYAQGS 449
Score = 138 bits (348), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 98/302 (32%), Positives = 152/302 (50%), Gaps = 50/302 (16%)
Query: 467 LFGAAIDASRQADATILVMGLDQSIEAEALD----------RAGLLLPGRQQELVSKVSM 516
L A++A +QADA + +GL +E E +D R L+LP QQ+L+ + +
Sbjct: 602 LRAQAMEAVKQADAVVAFVGLSPELEGEEMDVHIPGFSGGDRTDLVLPAAQQQLL-EAAK 660
Query: 517 ASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPM 576
AS P ++VL++G + V +A+ AI+ A YPGQAG AIA+ L G +NP G+LP+
Sbjct: 661 ASGKPLVVVLLNGSALAVNWAQE--HADAILEAWYPGQAGAQAIAETLSGKNNPSGRLPV 718
Query: 577 TWYPQEYITNL-PMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAP 635
T+Y + +L P T+ AM RTYR++KG +Y FG+G+SY+ F ++
Sbjct: 719 TFY--RSVNDLPPFTDYAM--------ANRTYRYFKGKPLYEFGYGLSYSTFSYS----- 763
Query: 636 TVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPP 695
NA +S + + TL V+ DVKN + G ++ TPP
Sbjct: 764 --------------NAHLSKERLDAGD------TLRVEADVKNTSTLAGDEVAELYLTPP 803
Query: 696 AGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTK 755
P + L FE VH+ G + V + + LS VD G R + G +++ +GG +
Sbjct: 804 QNGVYPLRSLEGFEHVHLLPGQSKHVSFTLD-PRQLSEVDEKGIRAVRAGVYSVTVGGGQ 862
Query: 756 HS 757
S
Sbjct: 863 PS 864
>gi|256393789|ref|YP_003115353.1| beta-glucosidase [Catenulispora acidiphila DSM 44928]
gi|256360015|gb|ACU73512.1| Beta-glucosidase [Catenulispora acidiphila DSM 44928]
Length = 1548
Score = 327 bits (838), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 239/768 (31%), Positives = 357/768 (46%), Gaps = 129/768 (16%)
Query: 52 RVNDLIGRLSLQEKVKLLISGAA-AVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPG- 109
R DL+ R++L EKV+ L + + A+PRLG++ Y +WSE HGV+ +G GG+
Sbjct: 74 RAADLVSRMTLPEKVEQLSTNSGPAIPRLGVQQYTYWSEGQHGVNTLGANQDNGGNGGAV 133
Query: 110 -ATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAG---------------LTYW 153
+TSFP + S++ +L +SDEAR + G LT+W
Sbjct: 134 HSTSFPTNFASTMSWDPSLIYQETTAISDEARGFLDKSLFGVNQNNLGPSAADYGSLTFW 193
Query: 154 SPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQG------SDGDRLKVAASCKHF 207
+P VN+ RDPRWGR E GEDP L+ A ++V G +G S LKVAA+ KH+
Sbjct: 194 APTVNMDRDPRWGRTDEAFGEDPYLTSTMAGAFVNGYEGNTPTGQSKTGTLKVAATAKHY 253
Query: 208 TAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCAD 267
D++ DR ++ VS D+ D + F + V+ +M SYN +NG P+ AD
Sbjct: 254 ALNDVEQ----DRTGISSNVSDTDLHDYYTKQFASLIENAHVSGLMTSYNAINGTPSVAD 309
Query: 268 PNILKRTIRGEWRLNGYIVSDCDSVGVYY--------------------DTQHFTST--- 304
+ + ++ NGY+ SDC ++G Y T +T+T
Sbjct: 310 TYTANQLAQRQFGFNGYVTSDCGAIGTAYQSFPSGHDWAPPGWTTDGKSSTGTWTNTATG 369
Query: 305 ---PEEAA--ADAIRAGLDLDC--GPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRL 357
P +A A A+RAG DL+C G +A+ G+LSE I+NALV TV++
Sbjct: 370 ATVPAQAGGQAYALRAGTDLNCAGGENTYAQITAAISAGVLSEGVIDNALVKIFTVRVET 429
Query: 358 GMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPS-------------- 403
G FD S PY + + +P HQ LA + A +VLLKNQ P+
Sbjct: 430 GEFD-PAGSNPYTGITKAQIQSPAHQALATKVADNSLVLLKNQPPAASGTSTTPPAASSA 488
Query: 404 -----------LPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYAR-- 450
LPLS + ++G ++ +GNY+ +P+QGI R
Sbjct: 489 ASSAAAAAKPLLPLSAAATAKIVIVGDMANAVT--LGNYSSDPALKVSPVQGITAAVRKA 546
Query: 451 ------TIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLP 504
T G A A + ADA I+ +G +Q I E DR + +P
Sbjct: 547 NPGASVTFDACGTSTTASAAASCSAQTLADVAGADAVIVFVGTNQQIADEGKDRTSIAMP 606
Query: 505 GRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADIL 564
G L+S+V+ +L + SGGP+ + + D A+I+++G+ G++ GTA+AD+L
Sbjct: 607 GNYDSLISQVAAVGNPRMVLAVQSGGPVRIDDVQKD--FASIVFSGFNGESQGTALADVL 664
Query: 565 FGTSNPGGKLPMTWYPQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMS 623
FG NP G L TWY + + LP M+ + P+Q+ GRTY ++ G YPFG+G+S
Sbjct: 665 FGAQNPDGHLDFTWYADD--SQLPAMSNYGLTPAQTGGL-GRTYMYFTGTPTYPFGYGLS 721
Query: 624 YTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKD 683
Y+ F + V +GR N + S V V VKN G
Sbjct: 722 YSTF--------SFSGVHAEGRSVDANGSQS-----------------VSVTVKNTGKTA 756
Query: 684 GAHTLLVFSTPP---AGHWAPHKQLVAFEKVHV-PAGAQQRVGINIHV 727
G+ +++ P AG P++QLV F K V G Q + I H+
Sbjct: 757 GSTVAQLYAQPKFTVAGQTFPNEQLVGFAKSKVLKPGESQHLTITAHI 804
>gi|308208211|gb|ADO20356.1| putative beta-D-xylosidase/alpha-L-arabinosidase [uncultured rumen
bacterium]
Length = 780
Score = 325 bits (833), Expect = 6e-86, Method: Compositional matrix adjust.
Identities = 258/796 (32%), Positives = 365/796 (45%), Gaps = 147/796 (18%)
Query: 41 PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPG 100
P+ SLP +R DL+ RL+L+EK L + +A V LGIK Y WWSEALHGV+ G
Sbjct: 45 PYKDRSLPPEERAKDLVSRLTLEEKASLSMHPSAPVEALGIKAYNWWSEALHGVARNG-- 102
Query: 101 TKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--------GGTAGLTY 152
AT FPQ I A+SF+ L + VSDEAR Y G G+T+
Sbjct: 103 --------AATVFPQPIGMAASFDEPLLYEVFTAVSDEARVKYKIAKESGHIGQYQGVTF 154
Query: 153 WSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYD 211
W+PN+NIFRDPRWGRG ET GEDP L+G+ + VRGLQG SD LK A KH+ +
Sbjct: 155 WTPNINIFRDPRWGRGMETYGEDPYLTGQMGMAVVRGLQGPSDSPVLKAHACAKHYAVHS 214
Query: 212 LDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNIL 271
WN R ++A+VS++D+ +T+ F+ V + V VM +YN+ G P A ++
Sbjct: 215 GPEWN---RHSYDAEVSERDLRETYLPAFKDLVTKANVQEVMTAYNRFRGEPCGASDYLI 271
Query: 272 KRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPE--EAAADAIRAGLDLDCGPFLGLHT 329
+RGEW G I SDC +V +Y +P+ AAA A+ AG+D +CG H
Sbjct: 272 NTILRGEWGYKGLITSDCWAVEDFYVQGRHGYSPDVASAAAAAVHAGVDTECGQAY-RHI 330
Query: 330 ESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEA 389
AV+RGLL E D++ L+ T + +LG D + L + P+H L+ +
Sbjct: 331 PEAVERGLLDEKDLDRNLIRLFTARYQLGEMD---DISLWDDLPASILEGPEHLALSRKM 387
Query: 390 ARQGIVLLKNQGPSLPLS-HIRHRTVAVIGPNSDVTVTMIGNYAGI-------------- 434
A++ +VLL+N+G LPL+ +R VA++GPN D GNY +
Sbjct: 388 AQESMVLLQNKGGILPLAPDVR---VALVGPNGDDREMQWGNYNPVPGRTVTLYDALKER 444
Query: 435 --------ACG---------------------------------YTTPLQGIGRYARTIH 453
CG Y +Q I Y R
Sbjct: 445 FPGIKYVRGCGIVGAEFAPKPDPNNPLSQALGKSREEMEAIARQYAIGVQDILNYVRRQE 504
Query: 454 QQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEAL----------DRAGLLL 503
+ + D Q + + D I G+ E E + DR + L
Sbjct: 505 RMQASFLPELDVQ---SVLKELEGIDVVIFAGGISPRFEGEEMPVNLPGFKGGDRTDIQL 561
Query: 504 PGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADI 563
P Q++L+ + A K ILV SG I AI+ A YPG+ GG AI D+
Sbjct: 562 PQVQRDLMKALHDAGKK-VILVNFSGCAI--GLVPETESCDAILQAWYPGEEGGLAITDV 618
Query: 564 LFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMS 623
LFG NP GKLP+T+Y + +LP E + G TYR++KG ++PFG+G+S
Sbjct: 619 LFGDVNPSGKLPVTFY--RSVEDLPDFE-------NYDMKGHTYRYFKGKPLFPFGYGLS 669
Query: 624 YTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKD 683
Y+ F + A R+ S+ I V + T VQV V+ G D
Sbjct: 670 YSTFRYKRAKV----------RNNSL-------IIPVKNTGKREATEVVQVYVRRKGDPD 712
Query: 684 GAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIP 743
G P K L AF +V +PAG +V I + +L + + +P
Sbjct: 713 G----------------PVKTLRAFRRVTIPAGKTVKVCIPLEDETFLWWSEEA-QDMVP 755
Query: 744 L-GEHNIHIGGTKHSV 758
L G++ + GG+ V
Sbjct: 756 LPGKYELLYGGSSDQV 771
>gi|443717728|gb|ELU08656.1| hypothetical protein CAPTEDRAFT_228276 [Capitella teleta]
Length = 731
Score = 325 bits (832), Expect = 8e-86, Method: Compositional matrix adjust.
Identities = 209/628 (33%), Positives = 332/628 (52%), Gaps = 57/628 (9%)
Query: 40 LPFCQVSLPIPQRVNDLIGRLSLQEKVKLLIS--GAAAVP--RLGIKGYEWWSEALHGVS 95
PF V+L +RV+DL+ RL+++E V + ++ G + +P RLG+K Y++ +E + GV
Sbjct: 20 FPFEDVTLSWDKRVDDLVQRLTIEEVVNISVAQYGKSTIPVDRLGVKPYQFINECITGVR 79
Query: 96 NVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNG-------GTA 148
+ +T+FPQ I +SF+ L + + ++ E R YN G
Sbjct: 80 -----------WENSTAFPQAIGLGASFSPDLAFNMSQAIARELRGFYNTEVKSQIYGHR 128
Query: 149 GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFT 208
G+ ++P +NI R P WGR QET GEDP LSG+ + +V+GLQG ++ + CKHF
Sbjct: 129 GVNCFTPVINIMRHPLWGRNQETYGEDPWLSGQLSVGFVKGLQGDHPRYIQASGGCKHFD 188
Query: 209 AYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADP 268
++ V RF F+AKVS++D TF F+ CV G + ++MCSYN++NGVP CA+
Sbjct: 189 VHNGPENIPVSRFGFDAKVSERDWRMTFLPQFKTCVEAGSI-NIMCSYNRINGVPACANK 247
Query: 269 NILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLG-- 326
+L +R EW NGY++SD ++ +T T EAAAD+++AG +++ G
Sbjct: 248 KLLTDILRKEWGFNGYVISDSGAIENIVYHHKYTKTLAEAAADSVKAGCNVELTGATGSG 307
Query: 327 ---LHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQ 383
+ +AV++ L+SE ++ L + +MR G FD P+ + V + +HQ
Sbjct: 308 VAYFNLLNAVKQNLISEEELRENLKKPMYSRMRQGEFD-PVDMNPFTKIDMSVVLSQEHQ 366
Query: 384 ELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYA--GIACGYTTP 441
+LA++A+ VL+KN LPL R +A+IGP +D T+ G+Y +TP
Sbjct: 367 DLAVKASAMSFVLMKNLNRVLPLKK-RFDRLAIIGPFADNAETLFGDYIPNWDPKFVSTP 425
Query: 442 LQGIGRYARTI-HQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAG 500
+G+ + + GC D +C + A A + A + +G+ ++E E DRA
Sbjct: 426 YEGLKSLGDDVRYASGCDDPSCTNYDP-KAIEKAVKGAQFVFVCLGVGSNLEREGHDRAD 484
Query: 501 LLLPGRQQELVSKVSMASK-GPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTA 559
L LPG Q +++ S+ P +LVL + GP+D+ + K P + II YP G A
Sbjct: 485 LDLPGYQLQILKDAEFFSREAPLVLVLFNAGPVDLTWPKLSPEVDGIIECFYPAMGTGKA 544
Query: 560 IADILFGTSN---PGGKLPMTWYPQEY----ITNLPMTEMAMRPSQSKRYPGRTYRFYK- 611
+ ++ T + P +LP TW Q + IT+ MT G TYR++
Sbjct: 545 LYQVVTATGDDGVPAARLPSTWPAQLHQVPSITDYNMT-------------GHTYRYFDG 591
Query: 612 GPVVYPFGHGMSYTNF-VHTVANAPTVV 638
G +YPFG+G+SYT+F TV+ +PT V
Sbjct: 592 GDPLYPFGYGLSYTSFHYQTVSVSPTSV 619
>gi|116621778|ref|YP_823934.1| glycoside hydrolase family 3 protein [Candidatus Solibacter
usitatus Ellin6076]
gi|116224940|gb|ABJ83649.1| glycoside hydrolase, family 3 domain protein [Candidatus Solibacter
usitatus Ellin6076]
Length = 850
Score = 321 bits (823), Expect = 7e-85, Method: Compositional matrix adjust.
Identities = 177/419 (42%), Positives = 253/419 (60%), Gaps = 27/419 (6%)
Query: 36 TTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVS 95
TT LPF L +R DL+ R++L EKV + + A A+PRLGI Y+WW+EALHGV+
Sbjct: 20 TTSQLPFMDPDLSAERRAADLVARMTLDEKVLQMQNSAPAIPRLGIPAYDWWNEALHGVA 79
Query: 96 NVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA------- 148
G AT FPQ I A++++ATL I +S EARA YN
Sbjct: 80 RAGL----------ATVFPQAIGLAATWDATLMHRIAETISTEARAKYNEAIRNDDHSRY 129
Query: 149 -GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHF 207
GLT+WSPN+NIFRDPRWGRGQET GEDP L+ + A ++++G+QG D KV A+ KH
Sbjct: 130 RGLTFWSPNINIFRDPRWGRGQETYGEDPFLTSRMAVAFIKGMQGEDPHYYKVIATAKH- 188
Query: 208 TAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCAD 267
Y + + R F+ K S +D+ DT+ FR ++E + S+MC+YN+V+G+P CA
Sbjct: 189 --YAVHSGPESSRHQFDVKPSPRDLADTYLPAFRASIVEARADSLMCAYNRVDGIPACAS 246
Query: 268 PNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGL 327
++L++ +RGEW G++VSDC +V + H+ A+A A++AG DL CG
Sbjct: 247 TDLLEKRLRGEWGFQGFVVSDCGAVSDIFRGHHYQPDAASASAVAVKAGTDLTCGNEYRA 306
Query: 328 HTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELAL 387
+ AV+ GL++E +IN +L + +LGMFD P P+ ++ +V + H+++AL
Sbjct: 307 LVD-AVKTGLITEPEINRSLERLFVARFKLGMFD-PPERVPFSNIPYSEVDSAGHRKIAL 364
Query: 388 EAARQGIVLLKNQGPSLPL-SHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
EAAR+ IVLLKN G +LPL S I+ +AVIGP +D ++GNY G + TPL GI
Sbjct: 365 EAARKSIVLLKNDG-TLPLKSSIKK--IAVIGPAADDAEALLGNYNGFSSLQVTPLAGI 420
Score = 106 bits (265), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 80/272 (29%), Positives = 127/272 (46%), Gaps = 53/272 (19%)
Query: 461 ACADDQLFGAAIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQEL 510
A D L AAI+A AD T+ +GL+ S+E E + DR L LP Q++L
Sbjct: 587 APPDAPLLAAAIEAVSNADVTLAFVGLNPSLEGEEMPVSVPGFQGGDRTNLELPEPQEKL 646
Query: 511 VSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNP 570
+ + ++A+ P ++VL SG + + FA +A++ Y G+ GTAIAD L G +NP
Sbjct: 647 I-EAAIATGKPVVVVLASGSAVAMNFAAQ--HASALLETWYNGEETGTAIADTLAGINNP 703
Query: 571 GGKLPMTWYPQEYITNL-PMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVH 629
G+LP+T+Y + L P E AM+ GRTYR++ G +Y FG G+SY+ F +
Sbjct: 704 SGRLPVTFY--RSVDQLPPFEEYAMK--------GRTYRYFNGDALYSFGFGLSYSKFQY 753
Query: 630 TVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLL 689
+ A+K A + V V+N S +G +
Sbjct: 754 S--------------------------ALKTRRAGSGTI---VASRVRNASSIEGDEVVQ 784
Query: 690 VFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRV 721
++ P + L F+++H+ G + V
Sbjct: 785 LYVNGSGADGDPIRSLRGFQRIHLRPGESREV 816
>gi|397690575|ref|YP_006527829.1| glucan 1,4-beta-glucosidase [Melioribacter roseus P3M]
gi|395812067|gb|AFN74816.1| glucan 1,4-beta-glucosidase [Melioribacter roseus P3M]
Length = 860
Score = 320 bits (820), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 175/428 (40%), Positives = 253/428 (59%), Gaps = 29/428 (6%)
Query: 42 FCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGT 101
+ V+LP +R DL+ RLSL EK+ L++ + A+ RLGI Y WW+EALHGV+ G
Sbjct: 23 YLNVNLPFEERAEDLLQRLSLDEKISLMVHQSPAIERLGIPEYNWWNEALHGVARNGR-- 80
Query: 102 KFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--------GGTAGLTYW 153
AT FP I A++++ L I V+S+EARA YN G G++ W
Sbjct: 81 --------ATVFPMPIGLAATWDRDLIYRIADVISNEARAKYNSALKKNQRGIYQGISLW 132
Query: 154 SPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLD 213
+PN+NIFRDPRWGRG ET GEDP L+G+ A S+++GLQG D LK A+ KH +
Sbjct: 133 APNINIFRDPRWGRGMETYGEDPYLTGELAVSFIKGLQGQDKKYLKTIATPKHLAVHSGP 192
Query: 214 NWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKR 273
+R HFNA VS D+ +T+ F+ +M+GK SVMC+YN++ G C +L
Sbjct: 193 E---PERHHFNALVSNYDLNETYLPHFKKSIMKGKAYSVMCAYNRLRGKACCGHDTLLTD 249
Query: 274 TIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG-PFLGLHTESA 332
+R +W G +VSDC +V +++ +PE+AAA A+ +G DL+CG FL L ++A
Sbjct: 250 ILRNKWGFEGIVVSDCWAVYDIFNSHKIVDSPEKAAALAVSSGTDLECGNTFLSL--KNA 307
Query: 333 VQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQ 392
+ GL++E +I++AL L + +LGMFD P Y + + ++E+ALEAAR+
Sbjct: 308 YRDGLITEKEIDSALRRVLLARFKLGMFD-PPEIVSYSQIDESYLDNSYNREIALEAARK 366
Query: 393 GIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYART- 451
IVLLKN LPL ++ +AVIGPN+D +++GNY G Y TPLQ I R +
Sbjct: 367 SIVLLKNDNKLLPLDSSINK-IAVIGPNADNLESLLGNYHGFPSEYITPLQAIRRVLKNG 425
Query: 452 --IHQQGC 457
+++GC
Sbjct: 426 EVFYEKGC 433
Score = 130 bits (327), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 98/292 (33%), Positives = 140/292 (47%), Gaps = 49/292 (16%)
Query: 471 AIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKG 520
A + ++DA I+ MGL +E EAL DR L LP Q +L+ K+ K
Sbjct: 591 AYKTALKSDAVIMFMGLCPRMEGEALKIKLDGFKGGDRLKLSLPANQLKLIKKIHSTGK- 649
Query: 521 PTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYP 580
P ILVL++GGPI + + I AI+ A YPGQAGG AI D+++G NP GKLP+T Y
Sbjct: 650 PVILVLLNGGPISTVWESEN--IPAILEAWYPGQAGGRAITDVIWGKYNPSGKLPVTIYK 707
Query: 581 QEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAV 640
E +LP P ++ GRTYR++KG V+YPFG G++YT+
Sbjct: 708 SE--NDLP-------PFENYDMEGRTYRYFKGEVLYPFGWGLNYTDI------------- 745
Query: 641 PLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWA 700
TIS + K N T+ V V +KN G+ G T+ +++ +
Sbjct: 746 -----------TISNIELSANEIKDND-TIRVVVKLKNNGNLAGEETVQLYTKALKDN-R 792
Query: 701 PHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIG 752
K L FEK+ + G + V + VD G +P G + I +G
Sbjct: 793 TIKTLRGFEKIKLEPGTEGMVEFYLSKSDLAVWVDGLGFETMP-GVYEIIVG 843
>gi|296081550|emb|CBI20073.3| unnamed protein product [Vitis vinifera]
Length = 309
Score = 320 bits (820), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 158/248 (63%), Positives = 184/248 (74%), Gaps = 18/248 (7%)
Query: 133 RVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQG 192
+VVS EARAMYN G AGLT+WSPNVNIF+DPRWGRGQETPGEDP+LS KYA+ YVRGLQ
Sbjct: 77 QVVSTEARAMYNVGLAGLTFWSPNVNIFQDPRWGRGQETPGEDPLLSSKYASGYVRGLQQ 136
Query: 193 SDG---DRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKV 249
SD DRLKVAA CKH+TAYDLDNW GVD FHFNA V+ QD++DTF PF+ CV++G V
Sbjct: 137 SDDGSPDRLKVAACCKHYTAYDLDNWKGVDCFHFNAVVTNQDMDDTFQPPFKSCVIDGNV 196
Query: 250 ASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAA 309
ASVMCSYNQ I K ++ Y+VSDCDSV V+Y++QH+T TPEEA
Sbjct: 197 ASVMCSYNQFYL--------IYKYHLQ-------YLVSDCDSVDVFYNSQHYTKTPEEAV 241
Query: 310 ADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPY 369
A AI AGLDL+CG FLG HTE+AV+ GL+ E ++ A+ N MRLG FDG PS Y
Sbjct: 242 AKAILAGLDLNCGSFLGQHTEAAVKGGLVDESAVDKAVSNNFATLMRLGFFDGNPSKAIY 301
Query: 370 GHLGPKDV 377
G LGPKDV
Sbjct: 302 GKLGPKDV 309
>gi|255572557|ref|XP_002527212.1| beta-glucosidase, putative [Ricinus communis]
gi|223533388|gb|EEF35138.1| beta-glucosidase, putative [Ricinus communis]
Length = 349
Score = 319 bits (818), Expect = 3e-84, Method: Compositional matrix adjust.
Identities = 156/340 (45%), Positives = 215/340 (63%), Gaps = 31/340 (9%)
Query: 6 AFFFLGLILLSAS-SSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQE 64
+ FF ++L S S+ + + + C P + T FC SL +P R + LI L+L+E
Sbjct: 7 SLFFSCIVLFELSVSNSESHKLQYPCQPPLHNSYT--FCNQSLSVPTRAHSLISLLTLEE 64
Query: 65 KVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFG-GDFPGATSFPQVITTASSF 123
K+K L A+ +PR GI YEWWSE+LHG++ GPG F G AT FPQVI +A++F
Sbjct: 65 KIKQLSDNASGIPRFGIPPYEWWSESLHGIAINGPGVSFTIGPVSAATGFPQVIISAAAF 124
Query: 124 NATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYA 183
N TLW IG ++ EARAM+N G +GLT+W+PNVNIFRDPRWGRGQETPGEDP+L+ YA
Sbjct: 125 NRTLWFLIGSAIAIEARAMHNVGQSGLTFWAPNVNIFRDPRWGRGQETPGEDPMLTSAYA 184
Query: 184 ASYVRGLQGSDGD--------------------------RLKVAASCKHFTAYDLDNWNG 217
+V+G QG + L ++A CKH TAYDL+ W
Sbjct: 185 IEFVKGFQGGNWKSGVSGSGSGRYGFGEKRMLRDDDGDDGLMLSACCKHLTAYDLEKWGN 244
Query: 218 VDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRG 277
R+ FNA V++QD+EDT+ PFR C+ EGK + +MCSYN+VNGVP CA ++L++ R
Sbjct: 245 FSRYSFNAVVTEQDLEDTYQPPFRSCIEEGKASCLMCSYNEVNGVPACAREDLLQKA-RE 303
Query: 278 EWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGL 317
EW GYIVSDCD+V ++ Q+++ + E+A A A++AG+
Sbjct: 304 EWGFEGYIVSDCDAVATIFEYQNYSKSAEDAVAIALKAGM 343
>gi|413925165|gb|AFW65097.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
Length = 412
Score = 318 bits (816), Expect = 5e-84, Method: Compositional matrix adjust.
Identities = 157/316 (49%), Positives = 206/316 (65%), Gaps = 11/316 (3%)
Query: 11 GLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLI 70
L+L+ + A PF+C + LPFC LP QR DL+ R++ EK L
Sbjct: 69 ALLLMRVVAVVAAGAPPFSCG--GGPSLGLPFCNTKLPAAQRAADLVSRMTPAEKASQLG 126
Query: 71 SGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFG-GDFPGATSFPQVITTASSFNATLWE 129
A VPRLG+ Y+WW+EALHGV+ G G G ATSFPQV+ TA+SFN LW
Sbjct: 127 DVANGVPRLGVPSYKWWNEALHGVAISGKGIHMDRGAVRSATSFPQVLLTAASFNDNLWF 186
Query: 130 AIGRVVSDEARAMYNGGTA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVR 188
IG+ EARA YN G A GLT WSPNVNIFRDPRWGRGQETPGEDP ++ +YAA++VR
Sbjct: 187 RIGQATGKEARAFYNIGQAEGLTMWSPNVNIFRDPRWGRGQETPGEDPAVASRYAAAFVR 246
Query: 189 GLQGSDGDR------LKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRM 242
GLQGS + L +A CKH TAYDL++W GV R+ F A V+ QD+ DTF+ PFR
Sbjct: 247 GLQGSSSNTKSVPPVLLTSACCKHATAYDLEDWKGVTRYSFRATVTVQDLADTFNPPFRS 306
Query: 243 CVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNG-YIVSDCDSVGVYYDTQHF 301
CV++GK + VMC+Y VNGVP+CA+ ++L +T RG W L+G Y+ +DCD+V + ++Q +
Sbjct: 307 CVVDGKASCVMCAYTSVNGVPSCANADLLTKTFRGSWGLDGRYVAADCDAVSIMRNSQFY 366
Query: 302 TSTPEEAAADAIRAGL 317
T E+ A ++AG+
Sbjct: 367 RPTAEDTVATTLKAGM 382
>gi|189467437|ref|ZP_03016222.1| hypothetical protein BACINT_03826 [Bacteroides intestinalis DSM
17393]
gi|189435701|gb|EDV04686.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
intestinalis DSM 17393]
Length = 863
Score = 318 bits (816), Expect = 5e-84, Method: Compositional matrix adjust.
Identities = 181/448 (40%), Positives = 254/448 (56%), Gaps = 30/448 (6%)
Query: 39 TLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVG 98
T F LPI +RVNDL+GRL+L+EK+ +++ A A+ RLGI Y WW+E LHGV+
Sbjct: 23 TCKFLNPELPIVERVNDLVGRLTLEEKISQMLNNAPAIDRLGIPAYNWWNECLHGVAR-- 80
Query: 99 PGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA--------GL 150
+P TSFPQ I A++++ + SDE RA+Y+ T GL
Sbjct: 81 ------SPYP-VTSFPQAIAMAATWDTESVHQMAVYASDEGRAIYHDATRKGTPGIFRGL 133
Query: 151 TYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAY 210
TYWSPN+NIFRDPRWGRGQET GEDP L+ S+V+GLQG D LK +A KH+ +
Sbjct: 134 TYWSPNINIFRDPRWGRGQETYGEDPFLTASIGVSFVKGLQGDDPVYLKSSACAKHYAVH 193
Query: 211 DLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNI 270
WN R ++AKV+ D+ DT+ F+ V+EGKV VMC+YN G P C + +
Sbjct: 194 SGPEWN---RHTYDAKVNNHDLWDTYLPAFKELVVEGKVTGVMCAYNSFFGQPCCGNDLL 250
Query: 271 LKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTE 330
+ +R W+ GY+ SDC +V +Y+T A+ADA+ G D +CG
Sbjct: 251 MMDILRNHWKFGGYVTSDCGAVEDFYNTHKTHQDAAAASADAVLHGTDCECGNGAYRALA 310
Query: 331 SAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-HQELALEA 389
AV RGL++E I+ +L ++ RLGMFD + PY ++ P V D H+ AL+
Sbjct: 311 DAVLRGLITEKQIDESLKKLFEIRFRLGMFDPD-DRVPYSNI-PLSVLECDAHKAHALKI 368
Query: 390 ARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI---- 445
ARQ IVLLKNQ LPL+ + + +AV+GPN+D ++ NY G TT L+GI
Sbjct: 369 ARQSIVLLKNQDQLLPLNKNKIKKIAVVGPNADDKSVLLANYYGYPSHITTALEGIQKKV 428
Query: 446 GRYARTIHQQGCKDVACADDQLFGAAID 473
G ++++G V DD +F +A +
Sbjct: 429 GNQVEVVYEKG---VNLTDDFVFTSAYE 453
Score = 121 bits (303), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 88/298 (29%), Positives = 140/298 (46%), Gaps = 52/298 (17%)
Query: 474 ASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTI 523
A + AD I V GL +E E + DR + +P QQ L+ ++ K P +
Sbjct: 595 AVKDADVIIFVGGLSAKVEGEEMGVEIEGFKRGDRTSISIPSVQQNLLKELYATGK-PVV 653
Query: 524 LVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEY 583
V+M+G + + + + AI+ A Y GQAGG AIAD+LFG NP G+LP+T+Y
Sbjct: 654 FVMMTGSALGLEW--ESAHLPAILNAWYGGQAGGQAIADVLFGDYNPSGRLPLTFYKS-- 709
Query: 584 ITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLD 643
+ +LP E RTYR++ G VYPFG+G+SYT F ++
Sbjct: 710 VNDLPDFE-------DYSMENRTYRYFTGTPVYPFGYGLSYTTFQYSSL----------- 751
Query: 644 GRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHK 703
+ + +++KVT + N G +G ++ + P P +
Sbjct: 752 ----KLQPSPDKRSVKVT------------AKITNTGKMEGEEVAQLYVSNPRDFVTPIR 795
Query: 704 QLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPL-GEHNIHIGGTKHSVSL 760
L F+++++ G Q V + K LSVVD SG + +P+ G+ I +GG + S +
Sbjct: 796 ALKGFKRINLKPGESQTVEF-VLTSKELSVVDISG-KSVPMKGKVQISLGGGQPSTQM 851
>gi|297738404|emb|CBI27605.3| unnamed protein product [Vitis vinifera]
Length = 581
Score = 318 bits (815), Expect = 8e-84, Method: Compositional matrix adjust.
Identities = 169/406 (41%), Positives = 237/406 (58%), Gaps = 50/406 (12%)
Query: 198 LKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYN 257
LKV++ CKH+ YD+D+W VS+QD+++TF PF
Sbjct: 19 LKVSSCCKHYATYDIDSW---------LNVSEQDMKETFFSPFE---------------- 53
Query: 258 QVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPE-EAAADAIRAG 316
R EW L+GYIVSDC + V D Q++ + + +A A ++AG
Sbjct: 54 ------------------RDEWDLHGYIVSDCYGLEVIVDNQNYLNESKVDAVAKTLQAG 95
Query: 317 LDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKD 376
LDL+CG + +V G +S+ +++ AL N + MR+G FDG P+ Y LG KD
Sbjct: 96 LDLECGHYYTDALNESVLTGKVSQYELDRALKNIYVLLMRVGYFDGIPA---YESLGLKD 152
Query: 377 VCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIAC 436
+C DH ELA EAARQGIVLLKN LPL + + ++GP+++ T MIGNYAG+
Sbjct: 153 ICAADHIELAREAARQGIVLLKNDYEVLPLK--PGKKLVLVGPHANATEVMIGNYAGLPY 210
Query: 437 GYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEAL 496
Y +PL+ + GC D +C++D F A +A++ A+ TI+ +G D SIEAE +
Sbjct: 211 KYVSPLEAFSAIGNVTYATGCLDASCSNDTYFSEAKEAAKFAEVTIIFVGTDLSIEAEFV 270
Query: 497 DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAG 556
DR LLPG Q EL+ +V+ S GP ILV++SG ID+ FAKN+PRI+AI+W G+PG+ G
Sbjct: 271 DRVDFLLPGNQTELIKQVAEVSSGPVILVVLSGSNIDITFAKNNPRISAILWVGFPGEQG 330
Query: 557 GTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEM-AMRPSQSKR 601
G AIAD++FG NPGG+LP+TWY +Y+ L M A PS KR
Sbjct: 331 GHAIADVVFGKYNPGGRLPVTWYEADYVACLETHIMDAKTPSPQKR 376
>gi|90021134|ref|YP_526961.1| Beta-glucosidase [Saccharophagus degradans 2-40]
gi|89950734|gb|ABD80749.1| b-xylosidase-like protein [Saccharophagus degradans 2-40]
Length = 893
Score = 318 bits (814), Expect = 8e-84, Method: Compositional matrix adjust.
Identities = 183/462 (39%), Positives = 266/462 (57%), Gaps = 44/462 (9%)
Query: 8 FFLGLILLSASSSGLAA----REPFACDPKD--ATTRTLPFCQVSLPIPQRVNDLIGRLS 61
F +G+I L GL A P A P+D A+T T PF SL + RV+DL+ RL+
Sbjct: 5 FLVGVITLGVILQGLTACSKSAAPNANQPQDTAASTATYPFRDASLSVDARVDDLVSRLT 64
Query: 62 LQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTAS 121
EK+ + + A+ RLGI Y WW+E+LHGV+ G AT +PQ I AS
Sbjct: 65 TTEKIAQMFNDTPAIERLGIPAYNWWNESLHGVARAGK----------ATVYPQAIGLAS 114
Query: 122 SFNATLWEAIGRVVSDEARAMYNGGTA--------GLTYWSPNVNIFRDPRWGRGQETPG 173
+F+ L + +SDE RA Y+ + GLT+WSPN+NIFRDPRWGRGQET G
Sbjct: 115 TFDEDLMLRVATSISDEGRAKYHDFLSKDVRTIYGGLTFWSPNINIFRDPRWGRGQETYG 174
Query: 174 EDPVLSGKYAASYVRGLQGS--DGDRLKVAASCKHFTAYD-LDNWNGVDRFHFNAKVSKQ 230
EDP L+G+ A ++V+G+QG + D LK A+ KH+ + + D +H +++
Sbjct: 175 EDPFLTGRMAINFVKGIQGENDNSDYLKAVATIKHYAVHSGPEKTRHSDDYH----PTRK 230
Query: 231 DIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCD 290
D+ +T+ FRM + E V S+MC+YN+V+G P C + +++ +RG+ NGY+VSDC
Sbjct: 231 DLFETYLPAFRMAIAETNVQSLMCAYNRVDGAPACGNNELMQEILRGDMGFNGYVVSDCG 290
Query: 291 SVGVYYDTQ--HFTSTPEEAAADAIRAGLDLDCG-----PFLGLHTESAVQRGLLSEIDI 343
++ +Y+++ H +P EAAA A+++G DL+CG + LH A+Q+GL++E I
Sbjct: 291 AIADFYESRSHHVVDSPAEAAAWAVKSGTDLNCGDSHGNTYTNLHY--ALQQGLITEDYI 348
Query: 344 NNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPS 403
+ A+ +++LGMFD E PY +G V +P H L EAA + IVLLKN G
Sbjct: 349 DIAVKRLFKARIKLGMFD-EQDRVPYSEIGMDVVGSPKHLALTQEAAEKSIVLLKNNG-V 406
Query: 404 LPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
LPL VAVIGPN+ ++GNY G+ PL+GI
Sbjct: 407 LPLK--AGVKVAVIGPNAVDEDVLVGNYHGVPVKPVLPLEGI 446
Score = 106 bits (264), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 90/313 (28%), Positives = 143/313 (45%), Gaps = 55/313 (17%)
Query: 471 AIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKG 520
A+ A+R+AD I + G+D +E E + DR + LP Q L+ ++ K
Sbjct: 620 ALAAARKADVIIFMGGIDAHLEGEEMPLELDGFTHGDRTHINLPKVQTNLLKQLKATGK- 678
Query: 521 PTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYP 580
P ++V SG + A ++ AI+ A YPG+A GTA+A+IL+G +P G+LP+T+Y
Sbjct: 679 PVVMVNFSGSAM--ALNWESEKLDAILQAFYPGEATGTALANILWGDVSPSGRLPVTFYK 736
Query: 581 QEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAV 640
+ +LP RTY+FY+G +Y FGHG+ Y +F + VVA
Sbjct: 737 G--VDDLPAF-------NDYHMENRTYKFYRGEPLYAFGHGLGYVDFAYN----NLVVA- 782
Query: 641 PLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVG---SKDGAHTLLVFSTPPAG 697
N +GKA L + V V N G ++D A + PA
Sbjct: 783 ---------NTAEAGKA------------LPIAVSVTNTGKMQAEDVAQVYISLLDAPAN 821
Query: 698 HWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIG-GTKH 756
P + L AF++ + AG + N+ + L+ +D +G + G + +G G K
Sbjct: 822 --TPIRDLKAFKRTKLAAGESTELEFNLP-ARVLTYIDDNGKTQTYTGRVEVTVGSGQKG 878
Query: 757 SVSLHAATLGVIK 769
V +A + I
Sbjct: 879 YVKENAIAVATIN 891
>gi|449489074|ref|XP_002195511.2| PREDICTED: beta-xylosidase/alpha-L-arabinofuranosidase 2-like
[Taeniopygia guttata]
Length = 685
Score = 316 bits (809), Expect = 4e-83, Method: Compositional matrix adjust.
Identities = 226/683 (33%), Positives = 342/683 (50%), Gaps = 88/683 (12%)
Query: 73 AAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPG-ATSFPQVITTASSFNATLWEAI 131
A +PRLGI Y W +E L G G+ PG AT+FPQ + A++F+ L +
Sbjct: 6 APPIPRLGIAPYNWNTECLRG----------DGEAPGWATAFPQALGLAAAFSPELIYRV 55
Query: 132 GRVVSDEARAMYNGGTA--------GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYA 183
+ E RA +N A GL+ +SP +NI R P WGR QET GEDP LSG+ A
Sbjct: 56 ANATATEVRAKHNSFAAAGRYSDHTGLSCFSPVLNIMRHPLWGRNQETYGEDPFLSGELA 115
Query: 184 ASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMC 243
S+V+GLQG +K +A CKHF+ + + + V ++D TF F+ C
Sbjct: 116 RSFVQGLQGPHPRYVKASAGCKHFSVHGGHE----NILLYLLTVLERDWRMTFLPQFQAC 171
Query: 244 VMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTS 303
V G S MCSYN++NGVP CA+ +L +RGEW +GY+VSD +V + H+T
Sbjct: 172 VRAGSY-SFMCSYNRINGVPACANKKLLTDILRGEWGFDGYVVSDEGAVELIMLGHHYTR 230
Query: 304 TPEEAAADAIRAGLDLDCGPFLG----LHTESAVQRGLLSEIDINNALVNTLTVQMRLGM 359
+ E A ++ AG +L+ + + A+ G ++ + + + +MRLG
Sbjct: 231 SFLETAVASVNAGCNLELSYGMRNNVFMRIPEALAMGNITLQMLRDRVRPLFYTRMRLGE 290
Query: 360 FDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLS--HIRHRTVAVI 417
FD P+ PY L V +P+H+ L+LEAA + VLLKN +LPL + + +AV+
Sbjct: 291 FD-PPAMNPYSSLDLSVVQSPEHRNLSLEAAVKSFVLLKNVRGTLPLKAQDLSSQHLAVV 349
Query: 418 GPNSDVTVTMIGNYAGIACG--YTTPLQGIGRY-ARTIHQQGCKDVACADDQLFGAA--I 472
GP +D + G+YA + TP +G+ A GC + C Q + A +
Sbjct: 350 GPFADNPRVLFGDYAPVPEPRYIYTPRRGLEMLGANVSFAAGCSEPRC---QRYSRAELV 406
Query: 473 DASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKG-PTILVLMSGGP 531
AD ++ +G +E EA DR+ L LPG Q EL+ A+ G P IL+L + GP
Sbjct: 407 KVVGAADVVLVCLGTGVDVETEAKDRSDLSLPGHQLELLQDAVQAAAGRPVILLLFNAGP 466
Query: 532 IDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGT--SNPGGKLPMTWYPQEYITNLPM 589
+DV++A+ + AI+ +P QA G AIA +L G ++P G+LP TW P
Sbjct: 467 LDVSWAQAHDGVGAILACFFPAQATGLAIARVLLGEAGASPAGRLPATW---------PA 517
Query: 590 TEMAMRPSQSKRYPGRTYRFY-KGPVVYPFGHGMSYTNFVHT-VANAPTVVAVPLDGRHG 647
+ P ++ GRTYR+Y + +YPFG+G+SYT F + + +P V+ +
Sbjct: 518 GMHQVPPMENYTMEGRTYRYYGQEAPLYPFGYGLSYTTFRYRDLVLSPPVLPL------- 570
Query: 648 SINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF-----STPPAGHWAPH 702
C L+ V V ++N G +D + ++ S+ P W
Sbjct: 571 -----------------CANLS--VSVVLENTGLRDSEEVVQLYLRWEHSSVPVPRW--- 608
Query: 703 KQLVAFEKVHVPAGAQQRVGINI 725
QLVAF +V VPAG + ++ +
Sbjct: 609 -QLVAFRRVAVPAGREAKLSFQV 630
>gi|167521708|ref|XP_001745192.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163776150|gb|EDQ89770.1| predicted protein [Monosiga brevicollis MX1]
Length = 614
Score = 311 bits (798), Expect = 6e-82, Method: Compositional matrix adjust.
Identities = 209/610 (34%), Positives = 307/610 (50%), Gaps = 56/610 (9%)
Query: 41 PFCQVSLPIP-QRVNDLIGRLSLQEKVKLLIS---------GAAAVPRLGIKGYEWWSEA 90
P C +P + LI + EK LL + V R+G+ Y+W A
Sbjct: 27 PHCLTMFRVPIGWMQSLISLIHDDEKPPLLTAREGGGGSPGPPGNVSRIGLPEYDWGMNA 86
Query: 91 LHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA-- 148
+HGV + G + TSFP + ++N + + +GR++ E RA++ G
Sbjct: 87 IHGVQSSCIKDDDGTVY-CPTSFPNPVNYGFTWNYSAYLELGRIIGVETRALWLAGAVEA 145
Query: 149 ---------GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLK 199
GL WSPN+NI R P WGR QE PGEDP ++G++ +Y GLQG D L+
Sbjct: 146 STWSGRPHIGLDTWSPNINIARSPLWGRNQEVPGEDPFMNGQFGKAYTLGLQGDDDTYLQ 205
Query: 200 VAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQV 259
+ KH+ AY L++ +G R +FNA VS + DT+ FR+ V EGK VMCSYN V
Sbjct: 206 AIVTLKHWDAYSLEDSDGATRHNFNAIVSNFSLMDTYWPAFRVAVTEGKAKGVMCSYNAV 265
Query: 260 NGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAG-LD 318
NG+PTCA P +L+ +R W+ +GY+ SD +V D +T + AA AIR G D
Sbjct: 266 NGIPTCAHP-LLRTVLRDLWKFDGYVSSDTGAVEDISDNHKYTPSWATAACAAIRDGQTD 324
Query: 319 LDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVC 378
+D G V G D++NAL NTL ++ LG+FD +Q Y H+ V
Sbjct: 325 IDSGAVYMKSLLQGVSEGHCRMEDVDNALRNTLRLRFELGLFD-PVENQSYWHVPLAAVN 383
Query: 379 TPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIAC-- 436
T + + + +VLL+N+ LPL+ + VA+IGP++ M+GNY G C
Sbjct: 384 TNASRATNMLHTLESMVLLQNKNNVLPLA--SNTKVALIGPHAKAQEDMVGNYLGQLCPD 441
Query: 437 --------GYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLD 488
+ + +G A T + G C+ + A+ + AD +L++G+D
Sbjct: 442 NNFDCVVSPHDALVSILGTDAVT-YAPGTNVTTCSQSHI-DEAVSVATAADVAVLMLGID 499
Query: 489 QSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIW 548
+SIEAE+ DR + LP Q +L S + K PT++VL++GG + + K + AII
Sbjct: 500 ESIEAESNDRKSIDLPECQHQLASAIFAVGK-PTVIVLLNGGMLAIENEKQ--QADAIIE 556
Query: 549 AGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYR 608
AGYPG GGTAIA L G + G +YI + M++M M PGRTYR
Sbjct: 557 AGYPGFYGGTAIAQTLTGQNEHLG---------DYINWINMSDMEMTSG-----PGRTYR 602
Query: 609 FYKGPVVYPF 618
+YK ++ F
Sbjct: 603 YYKNETLWAF 612
>gi|389636381|ref|XP_003715843.1| beta-xylosidase [Magnaporthe oryzae 70-15]
gi|351648176|gb|EHA56036.1| beta-xylosidase [Magnaporthe oryzae 70-15]
gi|440480767|gb|ELQ61414.1| beta-xylosidase [Magnaporthe oryzae P131]
Length = 517
Score = 311 bits (798), Expect = 7e-82, Method: Compositional matrix adjust.
Identities = 187/517 (36%), Positives = 268/517 (51%), Gaps = 13/517 (2%)
Query: 7 FFFLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKV 66
+F LG + +R D + T C +L P+R L+ LS++EK+
Sbjct: 5 WFILGASAFVSGQEDPESRRQSYPDCVNGPLSTNNVCDRTLSPPERAAALVEALSIEEKL 64
Query: 67 KLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKF---GGDFPGATSFPQVITTASSF 123
+ L+S + PR+G+ Y WWSEALHGV+ PGT F +F +TS+P + A+ F
Sbjct: 65 QNLVSKSQGAPRIGLPAYNWWSEALHGVA-YAPGTYFPQGNVEFNSSTSYPMPLLMAAGF 123
Query: 124 NATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYA 183
+ L E IG + EARA N G AG YW+PNVN F+DPRWGRG ETPGED + +YA
Sbjct: 124 DDNLIEKIGTAIGIEARAWGNSGWAGFDYWTPNVNAFKDPRWGRGSETPGEDVLRIKRYA 183
Query: 184 ASYVRGLQGS-DGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRM 242
RGL G ++ ++ ++CKH+ D ++WNG R FNAK++ QD+ + + PF+
Sbjct: 184 EYITRGLDGPVPNEQRRIISTCKHYAGNDFEDWNGTTRHDFNAKITMQDLAEYYLKPFQQ 243
Query: 243 CVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRL---NGYIVSDCDSVGVYYDTQ 299
C + KV S+MC+YN VNGVP+CA+ +L+ +R W+ N Y+ SDC++V
Sbjct: 244 CARDSKVGSIMCAYNAVNGVPSCANKYLLQTILRDHWKWTEHNNYVTSDCEAVLDVSANH 303
Query: 300 HFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGM 359
H+ T A AG+D C A +GLL E ++ AL+ +R G
Sbjct: 304 HYAPTNAAGTAICFEAGMDTSCEYTGSSDIPGAWSQGLLKEETVDRALLRLYEGLVRAGY 363
Query: 360 FDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGP 419
FDGE + Y L + V + + Q LAL+AA +G+VLLKN G +LPL +A+IG
Sbjct: 364 FDGEEAM--YADLDWQHVNSAEAQSLALQAAVEGMVLLKNNG-TLPLDLDPSHKIAMIGF 420
Query: 420 NSDVTVTMIGNYAGIACGYTTPLQGIGRYAR--TIHQQGCKDVACADDQLFGAAIDASRQ 477
+D + G Y+G A +P + T+ A D A++A+
Sbjct: 421 WADAPEKLQGGYSGRAHHLYSPAFAARQLGLDITVASGPVLQDNNASDNWTTNALEAASG 480
Query: 478 ADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKV 514
AD + GLD S E LDR L P Q LV V
Sbjct: 481 ADYILYFGGLDTSAAGETLDRTDLDWPEAQLTLVKVV 517
>gi|329850151|ref|ZP_08264997.1| beta-xylosidase B [Asticcacaulis biprosthecum C19]
gi|328842062|gb|EGF91632.1| beta-xylosidase B [Asticcacaulis biprosthecum C19]
Length = 877
Score = 311 bits (797), Expect = 8e-82, Method: Compositional matrix adjust.
Identities = 198/511 (38%), Positives = 269/511 (52%), Gaps = 62/511 (12%)
Query: 29 ACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWS 88
A DP A + + +L R DL+ R++L+EK L A A+PRLG+ Y WW+
Sbjct: 10 ALDPVPADVAAMAYRDTALDPKARAADLVSRMTLEEKAAQLGHTAPAIPRLGVPKYNWWN 69
Query: 89 EALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMY----- 143
E LHGV+ G AT FPQ I A++++ + +G VVS E RA Y
Sbjct: 70 EGLHGVARAGV----------ATVFPQAIGMAATWDEPMMTTVGDVVSTEFRAKYVERVH 119
Query: 144 -NGGT---AGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLK 199
+GGT GLT WSPN+NIFRDPRWGRGQET GEDP L+ + Y+ GLQG+D K
Sbjct: 120 PDGGTDWYRGLTVWSPNINIFRDPRWGRGQETYGEDPYLTSRIGIGYIHGLQGNDPKFFK 179
Query: 200 VAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQV 259
A+ KHF + N R + SK D+EDT+ FR V EGK SVMC YN V
Sbjct: 180 TVATSKHFAVHSGPESN---RHKEDVYPSKFDLEDTYLPAFRATVTEGKAYSVMCVYNAV 236
Query: 260 NGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYY--DTQHFTSTPEEAAADAIRAGL 317
GVP CA +++ +R W G++VSDC + + D H+T T EE A ++AG+
Sbjct: 237 YGVPGCASDFLMEEKLRQNWGFPGFVVSDCGAAANIFREDALHYTKTAEEGVAVGLKAGM 296
Query: 318 DLDCGPFLG-LHTE-----SAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGH 371
DL CG + + TE +AV+ G L ++ ALV ++RLGMFD P+S P+ H
Sbjct: 297 DLICGDYRNKMSTEVQPIINAVKAGQLPIAVVDQALVRLFEGRIRLGMFD-PPASLPFAH 355
Query: 372 LGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY 431
+ D TP H +AL+ A++ +VLLKN G LPL +T+AVIGPN+D ++GNY
Sbjct: 356 ITADDSDTPAHHAVALDMAKKSMVLLKNDG-LLPLK-AEPKTIAVIGPNADSLDALVGNY 413
Query: 432 AGIACGYTTPLQGI-GRY--ARTIHQQG----------------CKDVAC---------- 462
G T L GI R+ A+ ++ +G C D AC
Sbjct: 414 YGKPSKPVTVLDGIRARFPTAKIVYAEGTGLIGPAEPPVPDSVLCLDGACTKQGLTAEHF 473
Query: 463 ADDQLFGAAIDASRQADATILVMGLDQSIEA 493
A D L GA + +A+A + G D+S A
Sbjct: 474 AGDDLQGAPVATRTEANARLDWQGEDKSSSA 504
Score = 116 bits (291), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 94/298 (31%), Positives = 146/298 (48%), Gaps = 52/298 (17%)
Query: 467 LFGAAIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSM 516
+ G A+D ++ AD + V GL +E E + DR + LP QQ+L+ KV +
Sbjct: 587 MAGQAVDVAKTADFVVFVGGLSARVEGEEMKVEAEGFAGGDRTSIDLPKPQQQLLEKV-I 645
Query: 517 ASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPM 576
+ PT+LVLMSG + V +A D + AII A YPG GG A+A ++ G +P G+LP+
Sbjct: 646 GTGKPTVLVLMSGSALGVNWA--DKHVPAIIEAWYPGGEGGHAVAQLIAGDYSPAGRLPV 703
Query: 577 TWYPQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAP 635
T+Y + LP ++ M+ RTYR++ G V+YPFGHG+SYT F + AN
Sbjct: 704 TFY--RSVDALPGFSDYTMK--------NRTYRYFNGEVLYPFGHGLSYTTFAY--ANPK 751
Query: 636 TVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPP 695
+ ++ V VDV N G+ D + ++ + P
Sbjct: 752 V-----------------------SAASVAAGSSVTVSVDVSNSGAMDSDEVVQLYVSHP 788
Query: 696 AGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGG 753
G + L F++V + G + V + + LSVVD G R++ G+ ++ IGG
Sbjct: 789 GG--TAIRSLQGFQRVSLKKGETKTVQFKLD-DRALSVVDEHGGRKVQAGQVDLWIGG 843
>gi|325919363|ref|ZP_08181395.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas gardneri ATCC
19865]
gi|325550152|gb|EGD20974.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas gardneri ATCC
19865]
Length = 876
Score = 311 bits (797), Expect = 9e-82, Method: Compositional matrix adjust.
Identities = 178/444 (40%), Positives = 249/444 (56%), Gaps = 29/444 (6%)
Query: 41 PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPG 100
P+ P R DL+ R++L+EK + + A A+PRL + Y+WW+EALHGV+ G
Sbjct: 19 PYLDTQRPFDARAADLVARMTLEEKAAQMQNAAPAIPRLQVPEYDWWNEALHGVARAG-- 76
Query: 101 TKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA--------GLTY 152
GAT FPQ I A++F+ L + +SDEARA ++ A GLT+
Sbjct: 77 --------GATVFPQAIGLAATFDTPLMAEVATAISDEARAKHHAFLARGEYKRYQGLTF 128
Query: 153 WSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDL 212
WSPN+NIFRDPRWGRGQET GEDP L+ + ++V+GLQ G K+ A+ KHF +
Sbjct: 129 WSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQAQQGPYRKLDATAKHFAVH-- 186
Query: 213 DNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILK 272
+ DR HF+ S++D+ +T+ F+ V EGKVA+VM +YN+VNG A L+
Sbjct: 187 -SGPEADRHHFDVHPSERDLHETYLPAFQALVQEGKVAAVMGAYNRVNGESASASTR-LE 244
Query: 273 RTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESA 332
+R +W +GYIVSDC ++ + TPE AAA ++ G DLDCG +A
Sbjct: 245 GILRRDWGFDGYIVSDCAAIRDIWQNHKIVPTPEAAAALGVKHGTDLDCGDTYAA-LPAA 303
Query: 333 VQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQ 392
V+ GL+ E I+ AL +T +MRLGMFD P+ P+ + +P H LA AR+
Sbjct: 304 VRAGLIDEATIDTALKRLMTTRMRLGMFD-PPAKVPWAQIPASANQSPQHDALARRTARE 362
Query: 393 GIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI---GRYA 449
+VLLKN G LPL R +AVIGP +D ++++GNY G T LQGI A
Sbjct: 363 SLVLLKNDG-VLPLKPTLKR-IAVIGPTADDPMSLLGNYYGTPAAPVTILQGIRDAAPQA 420
Query: 450 RTIHQQGCKDVACADDQLFGAAID 473
+ I+ +G V +D A ID
Sbjct: 421 QVIYARGSDLVEGREDPNAAAPID 444
Score = 120 bits (300), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 88/302 (29%), Positives = 143/302 (47%), Gaps = 53/302 (17%)
Query: 471 AIDASRQADATILVMGLDQSIEAEALD----------RAGLLLPGRQQELVSKVSMASKG 520
A+DA+R A+ + V GL +E E +D R LP Q+EL+ + A+
Sbjct: 603 AVDAARDAEVVVFVGGLTGDVEGEEMDVNYPGFAGGDRTDTRLPKPQRELLQALQ-ATGT 661
Query: 521 PTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYP 580
P + VL +G + + +A+ + AI+ A YPGQ GG+A+ D+LFG ++PGG+LP+T+Y
Sbjct: 662 PVVAVLTTGSALAIDWAQQ--HVPAILLAWYPGQRGGSAVGDVLFGQASPGGRLPVTFYK 719
Query: 581 QEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVA 639
+ LP + AMR GRTYR+++G +YPFGHG+SYT F ++
Sbjct: 720 E--AERLPAFDDYAMR--------GRTYRYFQGKPLYPFGHGLSYTQFAYS--------- 760
Query: 640 VPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTP--PAG 697
D R G TL V +KN G + G + ++ P P
Sbjct: 761 ---DLRLDRTTVAADG-------------TLTATVTLKNTGQRAGDEVVQLYLHPLKPQR 804
Query: 698 HWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVD-RSGTRRIPLGEHNIHIGGTKH 756
A K+L +++ + G Q+++ I L + D + + G + + IG +
Sbjct: 805 ERA-LKELHGLQRITLQPGEQRQLRFTIKAQDALRIYDEQRKAYAVDPGAYEVQIGASSA 863
Query: 757 SV 758
+
Sbjct: 864 DI 865
>gi|440476402|gb|ELQ45004.1| beta-xylosidase, partial [Magnaporthe oryzae Y34]
Length = 515
Score = 311 bits (796), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 186/514 (36%), Positives = 267/514 (51%), Gaps = 13/514 (2%)
Query: 7 FFFLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKV 66
+F LG + +R D + T C +L P+R L+ LS++EK+
Sbjct: 5 WFILGASAFVSGQEDPESRRQSYPDCVNGPLSTNNVCDRTLSPPERAAALVEALSIEEKL 64
Query: 67 KLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKF---GGDFPGATSFPQVITTASSF 123
+ L+S + PR+G+ Y WWSEALHGV+ PGT F +F +TS+P + A+ F
Sbjct: 65 QNLVSKSQGAPRIGLPAYNWWSEALHGVA-YAPGTYFPQGNVEFNSSTSYPMPLLMAAGF 123
Query: 124 NATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYA 183
+ L E IG + EARA N G AG YW+PNVN F+DPRWGRG ETPGED + +YA
Sbjct: 124 DDNLIEKIGTAIGIEARAWGNSGWAGFDYWTPNVNAFKDPRWGRGSETPGEDVLRIKRYA 183
Query: 184 ASYVRGLQGS-DGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRM 242
RGL G ++ ++ ++CKH+ D ++WNG R FNAK++ QD+ + + PF+
Sbjct: 184 EYITRGLDGPVPNEQRRIISTCKHYAGNDFEDWNGTTRHDFNAKITMQDLAEYYLKPFQQ 243
Query: 243 CVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRL---NGYIVSDCDSVGVYYDTQ 299
C + KV S+MC+YN VNGVP+CA+ +L+ +R W+ N Y+ SDC++V
Sbjct: 244 CARDSKVGSIMCAYNAVNGVPSCANKYLLQTILRDHWKWTEHNNYVTSDCEAVLDVSANH 303
Query: 300 HFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGM 359
H+ T A AG+D C A +GLL E ++ AL+ +R G
Sbjct: 304 HYAPTNAAGTAICFEAGMDTSCEYTGSSDIPGAWSQGLLKEETVDRALLRLYEGLVRAGY 363
Query: 360 FDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGP 419
FDGE + Y L + V + + Q LAL+AA +G+VLLKN G +LPL +A+IG
Sbjct: 364 FDGEEAM--YADLDWQHVNSAEAQSLALQAAVEGMVLLKNNG-TLPLDLDPSHKIAMIGF 420
Query: 420 NSDVTVTMIGNYAGIACGYTTPLQGIGRYAR--TIHQQGCKDVACADDQLFGAAIDASRQ 477
+D + G Y+G A +P + T+ A D A++A+
Sbjct: 421 WADAPEKLQGGYSGRAHHLYSPAFAARQLGLDITVASGPVLQDNNASDNWTTNALEAASG 480
Query: 478 ADATILVMGLDQSIEAEALDRAGLLLPGRQQELV 511
AD + GLD S E LDR L P Q LV
Sbjct: 481 ADYILYFGGLDTSAAGETLDRTDLDWPEAQLTLV 514
>gi|433677589|ref|ZP_20509555.1| beta-glucosidase [Xanthomonas translucens pv. translucens DSM
18974]
gi|430817300|emb|CCP39963.1| beta-glucosidase [Xanthomonas translucens pv. translucens DSM
18974]
Length = 913
Score = 310 bits (795), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 173/444 (38%), Positives = 250/444 (56%), Gaps = 36/444 (8%)
Query: 42 FCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGT 101
+ QR DL+ R++L+EK + + A A+PRLG+ Y+WW+EALHGV+ G
Sbjct: 37 YLDTQRSFEQRAADLVARMTLEEKAAQMQNAAPAIPRLGVPAYDWWNEALHGVARAG--- 93
Query: 102 KFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGT--------AGLTYW 153
GAT FPQ I A++F+ L + +SDEARA ++ GLT+W
Sbjct: 94 -------GATVFPQAIGMAATFDLPLMHEVSTAISDEARAKHHEALRHDQHARYQGLTFW 146
Query: 154 SPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGD---------RLKVAASC 204
SPN+NIFRDPRWGRGQET GEDP L+ + ++V+GLQG D D K+ A+
Sbjct: 147 SPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQGEDVDVPKNAQGEAYRKLDATA 206
Query: 205 KHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPT 264
KHF + + DR HF+A S++D+ +T+ F V EGKV +VM +YN+V G
Sbjct: 207 KHFAVH---SGPEADRHHFDAHPSQRDLYETYLPAFEALVKEGKVDAVMGAYNRVYGESA 263
Query: 265 CADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPF 324
A +L+ +R W +GY+VSDC ++ + +T EEAAA A++ G +L+CG
Sbjct: 264 SASKFLLRDVLRDRWGFDGYVVSDCWAIVDIWKNHKIVATREEAAALAVKHGTELECGAE 323
Query: 325 LGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQE 384
+AV++GL+SE D++NAL + +MRLGMFD P + + +P+H
Sbjct: 324 YST-LPTAVRKGLISEADVDNALQKLMYSRMRLGMFD-PPEKLAWAQIPLSANQSPEHDA 381
Query: 385 LALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQG 444
LA AR+ +VLLKN G LPLS + + +AV+GP +D T+ ++GNY G T LQG
Sbjct: 382 LARRTARESLVLLKNDG-VLPLSRAKIKRIAVVGPTADDTMALLGNYYGTPAAPVTVLQG 440
Query: 445 IGRY---ARTIHQQGCKDVACADD 465
I A ++ +G V DD
Sbjct: 441 IREAAPDAEVLYARGADLVEGRDD 464
Score = 123 bits (308), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 91/301 (30%), Positives = 142/301 (47%), Gaps = 51/301 (16%)
Query: 471 AIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKG 520
A+DA+R+AD + V GL +E E + DR L LP Q+ L+ + K
Sbjct: 631 ALDAARRADVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRALLEALHGTGK- 689
Query: 521 PTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYP 580
P + VL +G + + +A+ + AI+ A YPGQ GG+A+AD+LFG +NPGG+LP+T+Y
Sbjct: 690 PVVAVLTTGSALAIDWAQQ--HVPAILLAWYPGQRGGSAVADVLFGDANPGGRLPVTFY- 746
Query: 581 QEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAV 640
+E T + AMR GRTYR++ G +YPFGHG+SYT F ++
Sbjct: 747 KESETLPAFDDYAMR--------GRTYRYFAGTALYPFGHGLSYTQFAYS---------- 788
Query: 641 PLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTP--PAGH 698
D R G+ L + VKN G + G + ++ P P
Sbjct: 789 --DLRLDRSKLAADGR-------------LHATLKVKNTGQRAGDEVVQLYLQPLSPQRE 833
Query: 699 WAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRR-IPLGEHNIHIGGTKHS 757
A K L F+++ + G + V I L + D + + G++ + +G +
Sbjct: 834 RA-SKDLRGFQRIALQPGETREVRFAISPQSDLRLYDEARKAYVVDPGDYELQVGASSSD 892
Query: 758 V 758
V
Sbjct: 893 V 893
>gi|389736853|ref|ZP_10190363.1| glucan 1,4-beta-glucosidase [Rhodanobacter sp. 115]
gi|388438821|gb|EIL95541.1| glucan 1,4-beta-glucosidase [Rhodanobacter sp. 115]
Length = 868
Score = 310 bits (794), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 172/406 (42%), Positives = 243/406 (59%), Gaps = 33/406 (8%)
Query: 52 RVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGAT 111
R L+ +++L EKV + + A A+PRLG+ Y+WWSE LHG++ G AT
Sbjct: 32 RAVALVAKMTLPEKVAQMQNDAPAIPRLGVPAYDWWSEGLHGIARNG----------YAT 81
Query: 112 SFPQVITTASSFNATLWEAIGRVVSDEARAMYN--------GGTAGLTYWSPNVNIFRDP 163
FPQ I A+S++ +L A+G V+S EARA +N G GLT WSPN+NIFRDP
Sbjct: 82 VFPQAIGLAASWDTSLLHAVGTVISTEARAKFNASGSGRAHGLFQGLTLWSPNINIFRDP 141
Query: 164 RWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHF 223
RWGRGQET GEDP L+G+ A ++VRG+QG D + A+ KHF A+ + R F
Sbjct: 142 RWGRGQETYGEDPYLTGQLAVAFVRGIQGDDPQHPRAIATPKHFVAH---SGPEAGRDSF 198
Query: 224 NAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNG 283
+ VS D+EDT+ FR V++G SVMC+YN ++G P CA+ +L +R +W G
Sbjct: 199 DVDVSPHDLEDTYLPAFRTAVVDGHAGSVMCAYNALHGTPACANAGLLDTRLRKDWGFAG 258
Query: 284 YIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTES----AVQRGLLS 339
Y+VSDCD+VG +F +A+ A++AG DLDCG HT + AV++G ++
Sbjct: 259 YVVSDCDAVGDIASYHYFKPDDVQASVAAVQAGTDLDCG-----HTYASLAQAVRQGDIA 313
Query: 340 EIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKN 399
E ++ +LV T + RLG G + PY +G + +P H++LAL+AA + +VLLKN
Sbjct: 314 ESALDASLVRLFTARYRLGEL-GSRGNDPYARIGADQIDSPAHRKLALQAALESLVLLKN 372
Query: 400 QGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
+LPL H R +AVIGP++D T+ NY G A TPLQG+
Sbjct: 373 AHSTLPL-HAGMR-LAVIGPDADALETLEANYHGTARHPVTPLQGL 416
Score = 139 bits (351), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 100/285 (35%), Positives = 144/285 (50%), Gaps = 48/285 (16%)
Query: 478 ADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILVLM 527
ADA + +GL +E E L DR + LP Q+ L+ + + AS P I+VL+
Sbjct: 598 ADAVVAFIGLSPDVEGEQLRIDVPGFDGGDRTDIGLPAPQRALLER-ARASGKPLIVVLL 656
Query: 528 SGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNL 587
SG + + +A+ AI+ A YPGQAGGTAIA +L G NPGG+LP+T+Y +L
Sbjct: 657 SGSAVALDWAQQ--HADAILAAWYPGQAGGTAIAQVLAGDYNPGGRLPVTFY--RSTRDL 712
Query: 588 PMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHG 647
P P S GRTYR++ G +YPFG+G+SYT F + APT+ A
Sbjct: 713 P-------PYVSYAMQGRTYRYFDGRPLYPFGYGLSYTRFTYA---APTLSA-------- 754
Query: 648 SINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVA 707
AT+ TL V +V+N G + G + V+ P AP LV
Sbjct: 755 ---ATLKAGG-----------TLQVSAEVRNAGQRAGDEVVQVYLDTPPSPLAPRHALVG 800
Query: 708 FEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIG 752
F ++H+ AG Q+ V + + LS VD +G R + G++ + IG
Sbjct: 801 FRRIHLAAGEQRLVRFTL-APRQLSSVDAAGARAVEPGQYRVFIG 844
>gi|289668505|ref|ZP_06489580.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. musacearum
NCPPB 4381]
Length = 902
Score = 309 bits (791), Expect = 4e-81, Method: Compositional matrix adjust.
Identities = 174/445 (39%), Positives = 249/445 (55%), Gaps = 36/445 (8%)
Query: 41 PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPG 100
P+ QR DL+ R++L+EK + + A A+PRLG+ Y+WW+EALHGV+ G
Sbjct: 34 PYLDTQRSFEQRAADLVSRMTLEEKAAQMQNAAPAIPRLGVAAYDWWNEALHGVARAG-- 91
Query: 101 TKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNG--------GTAGLTY 152
GAT FPQ I A++F+ L + +SDEARA ++ GLT+
Sbjct: 92 --------GATVFPQAIGMAATFDLPLMHEVATAISDEARAKHHQFLRQNQHERYQGLTF 143
Query: 153 WSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVA---------AS 203
WSPN+NIFRDPRWGRGQET GEDP L+ + ++VRGLQG GD K A A+
Sbjct: 144 WSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVRGLQGEGGDAPKNAQGESYRKLDAT 203
Query: 204 CKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVP 263
KHF + DR HF+A+ S++D+ +T+ F V +GKV +VM +YN+V G
Sbjct: 204 AKHFAVHSGPE---ADRHHFDARPSQRDLYETYLPAFEALVKDGKVDAVMGAYNRVYGES 260
Query: 264 TCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGP 323
A +L+ +R +W GY+VSDC ++ + +T E+AAA A++ G +L+CG
Sbjct: 261 ASASKFLLQDVLRQQWGFKGYVVSDCWAIVDIWKHHKIVATREQAAALAVKHGTELECGE 320
Query: 324 FLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQ 383
+AV +GL+ E I+ +L +T +MRLGMFD P P+ + +P H
Sbjct: 321 EYST-LPAAVHQGLIEEAQIDTSLQTLMTARMRLGMFD-PPGQLPWSKIPASVNQSPAHD 378
Query: 384 ELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQ 443
LA AR+ +VLLKN G LPLS + + +AVIGP +D T+ ++GNY G T LQ
Sbjct: 379 ALARRTARESLVLLKNDG-LLPLSRTKLKRIAVIGPTADDTMALLGNYYGTPAAPVTVLQ 437
Query: 444 GI---GRYARTIHQQGCKDVACADD 465
GI A+ ++ +G V DD
Sbjct: 438 GIRAAAPNAQVLYARGADLVEGRDD 462
Score = 124 bits (310), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 95/304 (31%), Positives = 146/304 (48%), Gaps = 57/304 (18%)
Query: 471 AIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKG 520
A+D +R A+ + V GL +E E + DR L LP Q+EL+ + K
Sbjct: 629 ALDVARSAEVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRELLEALQATGK- 687
Query: 521 PTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYP 580
P + VL +G + + +A+ + AI+ A YPGQ GGTA+AD LFG +NPGG+LP+T+Y
Sbjct: 688 PVVAVLTAGSALAIDWAQQ--HVPAILLAWYPGQRGGTAVADTLFGDANPGGRLPVTFYK 745
Query: 581 QEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHT--VANAPTV 637
+ LP + AM GRTYR++ G +YPFGHG+SYT F ++ + TV
Sbjct: 746 ES--EALPAFDDYAMH--------GRTYRYFGGTPLYPFGHGLSYTQFAYSDLRLDRNTV 795
Query: 638 VAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTP--P 695
A GS AT++ VKN G + G ++ P P
Sbjct: 796 AA------DGSFTATVT---------------------VKNTGQRAGDEVAQLYLHPLTP 828
Query: 696 AGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVD-RSGTRRIPLGEHNIHIGGT 754
A K+L F++V + G Q+ + I+ + L + D + T + G + + IG +
Sbjct: 829 QRERA-GKELRGFQRVALHPGEQRELRFPINAKEALRIYDEQRKTYTVDPGAYEVQIGAS 887
Query: 755 KHSV 758
+
Sbjct: 888 SADI 891
>gi|285016879|ref|YP_003374590.1| beta-glucosidase [Xanthomonas albilineans GPE PC73]
gi|283472097|emb|CBA14604.1| putative beta-glucosidase protein [Xanthomonas albilineans GPE
PC73]
Length = 914
Score = 308 bits (790), Expect = 5e-81, Method: Compositional matrix adjust.
Identities = 169/412 (41%), Positives = 242/412 (58%), Gaps = 33/412 (8%)
Query: 51 QRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGA 110
QR +DL+ R++L+EKV + + A A+PRLG+ Y+WW+E LHGV+ G GA
Sbjct: 43 QRADDLVARMTLEEKVAQMQNAAPAIPRLGVPAYDWWNEGLHGVARAG----------GA 92
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--------GGTAGLTYWSPNVNIFRD 162
T FPQ I A++F+ L + +SDEARA ++ G GLT+WSPN+NIFRD
Sbjct: 93 TVFPQAIGLAATFDLPLMHEVSTAISDEARAKHHEALRRGEHGRYQGLTFWSPNINIFRD 152
Query: 163 PRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVA---------ASCKHFTAYDLD 213
PRWGRGQET GEDP L+ + ++V+G+QG D K A A+ KHF +
Sbjct: 153 PRWGRGQETYGEDPFLTARMGVTFVQGMQGEGADAPKNAQGETYRKLDATAKHFAVH--- 209
Query: 214 NWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKR 273
+ +R HF+A+ S++D+ +T+ F V EGKV +VM +YN++ G A +L+
Sbjct: 210 SGPESERHHFDARPSQRDLYETYLPAFEALVKEGKVDAVMGAYNRLFGESASASKFLLRD 269
Query: 274 TIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAV 333
+R W +GY+VSDC ++ + +T E+AAA A++ G L+CG +AV
Sbjct: 270 VLRERWGFHGYVVSDCWAIVDIWKNHKIVATREQAAALAVKNGTQLECGQEYAT-LPAAV 328
Query: 334 QRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQG 393
Q+GL+ E DI+ AL +T +MRLGMFD P + L +P+H LA AR+
Sbjct: 329 QQGLIGETDIDAALRTLMTARMRLGMFD-PPGQLRWAQLPISVNQSPEHDALARRTARES 387
Query: 394 IVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
+VLLKN G LPLS +H+ +AVIGP +D T+ ++GNY G T LQGI
Sbjct: 388 LVLLKNDG-LLPLSRAKHKRIAVIGPTADDTMALLGNYYGTPATPVTILQGI 438
Score = 127 bits (318), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 93/303 (30%), Positives = 149/303 (49%), Gaps = 51/303 (16%)
Query: 471 AIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKG 520
A+D +R+AD + V GL +E E + DR L LP Q+EL+ +S K
Sbjct: 628 ALDTARRADVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRELLQALSATGK- 686
Query: 521 PTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYP 580
P + VL +G + + +A+ + AI+ A YPGQ GG+A+AD+LFG +NPGG+LP+T+Y
Sbjct: 687 PVVAVLTTGSALAIDWAQE--HVPAILLAWYPGQRGGSAVADVLFGDTNPGGRLPVTFYK 744
Query: 581 QEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVA 639
LP + AMR GRTYR++ G +YPFGHG+SYT F ++
Sbjct: 745 AS--ETLPAFDDYAMR--------GRTYRYFAGTPLYPFGHGLSYTQFAYS--------D 786
Query: 640 VPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHW 699
+ LD R + + +S +KVT N G++ G + ++ P A
Sbjct: 787 LRLDRRKVAADGQLSA-TLKVT----------------NTGTRAGDEVVQLYLHPLAPTR 829
Query: 700 APH-KQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRR-IPLGEHNIHIGGTKHS 757
A K+L F+++ + G + V I L + D + + G++ + +G +
Sbjct: 830 ARAIKELRGFQRIALAPGESRDVHFTISPQTDLRIYDEAQKHYVVDPGDYELQVGASSAD 889
Query: 758 VSL 760
V +
Sbjct: 890 VRV 892
>gi|289666226|ref|ZP_06487807.1| beta-glucosidase precursor [Xanthomonas campestris pv. vasculorum
NCPPB 702]
Length = 902
Score = 308 bits (790), Expect = 5e-81, Method: Compositional matrix adjust.
Identities = 174/445 (39%), Positives = 249/445 (55%), Gaps = 36/445 (8%)
Query: 41 PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPG 100
P+ QR DL+ R++L+EK + + A A+PRLG+ Y+WW+EALHGV+ G
Sbjct: 34 PYLDTQRSFEQRAADLVSRMTLEEKAAQMQNAAPAIPRLGVAAYDWWNEALHGVARAG-- 91
Query: 101 TKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNG--------GTAGLTY 152
GAT FPQ I A++F+ L + +SDEARA ++ GLT+
Sbjct: 92 --------GATVFPQAIGMAATFDLPLMHEVATAISDEARAKHHQFLRQNQHERYQGLTF 143
Query: 153 WSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVA---------AS 203
WSPN+NIFRDPRWGRGQET GEDP L+ + ++VRGLQG GD K A A+
Sbjct: 144 WSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVRGLQGEGGDAPKNAQGESYRKLDAT 203
Query: 204 CKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVP 263
KHF + DR HF+A+ S++D+ +T+ F V +GKV +VM +YN+V G
Sbjct: 204 AKHFAVHSGPE---ADRHHFDARPSQRDLYETYLPAFEALVKDGKVDAVMGAYNRVYGES 260
Query: 264 TCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGP 323
A +L+ +R +W GY+VSDC ++ + +T E+AAA A++ G +L+CG
Sbjct: 261 ASASKFLLQDLLRQQWGFKGYVVSDCWAIVDIWKHHKIVATREQAAALAVKHGTELECGE 320
Query: 324 FLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQ 383
+AV +GL+ E I+ +L +T +MRLGMFD P P+ + +P H
Sbjct: 321 EYST-LPAAVHQGLIEEAQIDTSLQTLMTARMRLGMFD-PPGQLPWSKIPASVNQSPAHD 378
Query: 384 ELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQ 443
LA AR+ +VLLKN G LPLS + + +AVIGP +D T+ ++GNY G T LQ
Sbjct: 379 ALARRTARESLVLLKNDG-LLPLSRTKLKRIAVIGPTADDTMALLGNYYGTPAAPVTVLQ 437
Query: 444 GI---GRYARTIHQQGCKDVACADD 465
GI A+ ++ +G V DD
Sbjct: 438 GIRAAAPNAQVLYARGADLVEGRDD 462
Score = 124 bits (312), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 95/304 (31%), Positives = 146/304 (48%), Gaps = 57/304 (18%)
Query: 471 AIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKG 520
A+D +R A+ + V GL +E E + DR L LP Q+EL+ + K
Sbjct: 629 ALDVARSAEVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRELLEALQATGK- 687
Query: 521 PTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYP 580
P + VL +G + + +A+ + AI+ A YPGQ GGTA+AD LFG +NPGG+LP+T+Y
Sbjct: 688 PVVAVLTAGSALAIDWAQQ--HVPAILLAWYPGQRGGTAVADTLFGDANPGGRLPVTFYK 745
Query: 581 QEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHT--VANAPTV 637
+ LP + AM GRTYR++ G +YPFGHG+SYT F ++ + TV
Sbjct: 746 ES--EALPAFDDYAMH--------GRTYRYFGGTPLYPFGHGLSYTQFAYSDLRLDRNTV 795
Query: 638 VAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTP--P 695
A GS AT++ VKN G + G ++ P P
Sbjct: 796 AA------DGSFTATVT---------------------VKNTGQRAGDEVAQLYLHPLTP 828
Query: 696 AGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVD-RSGTRRIPLGEHNIHIGGT 754
A K+L F++V + G Q+ + I+ + L + D + T + G + + IG +
Sbjct: 829 QRERA-GKELRGFQRVALHPGEQRELSFPINAKEALRIYDEQRKTYTVDPGAYEVQIGAS 887
Query: 755 KHSV 758
+
Sbjct: 888 SADI 891
>gi|256394357|ref|YP_003115921.1| glycoside hydrolase family 3 [Catenulispora acidiphila DSM 44928]
gi|256360583|gb|ACU74080.1| glycoside hydrolase family 3 domain protein [Catenulispora
acidiphila DSM 44928]
Length = 1357
Score = 308 bits (790), Expect = 5e-81, Method: Compositional matrix adjust.
Identities = 237/814 (29%), Positives = 379/814 (46%), Gaps = 121/814 (14%)
Query: 35 ATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKV-KLLISGAAAVPRLGIKGYEWWSEALHG 93
A + T + S P R DL+ R++L EK +L + A A+PRLG++ Y + +EA HG
Sbjct: 42 AASSTPIYLNTSYPFEARAADLVSRMTLAEKAAQLNTTSAPAIPRLGVQQYTYQAEAQHG 101
Query: 94 VSNVGPGTKFG---GDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAG- 149
++ +G G G+ P ATSFP ++ S++ L VSDEAR + + G
Sbjct: 102 INYLGGDQNSGSVAGNPPVATSFPTNFASSMSWDPALVYQETTAVSDEARGLVDKSLFGT 161
Query: 150 --------------LTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGS-- 193
LT+W+P VN+ RDPRWGR E GEDP L G+ A ++V G QG+
Sbjct: 162 GQNNLGPSASDYGSLTFWAPTVNLDRDPRWGRTDEAFGEDPYLVGQMAGAFVNGFQGNSM 221
Query: 194 -----DGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGK 248
DG LK AA+ KH+ D++ +R ++ VS D+ D + F +
Sbjct: 222 TGQSLDG-YLKAAATAKHYALNDVEQ----NRTGISSNVSDTDLRDYYTKQFADLIENSH 276
Query: 249 VASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYY------------ 296
VA +M SYN +NG P+ AD + + + NGY+ SDC +VG Y
Sbjct: 277 VAGLMTSYNAINGTPSVADTYTANQLAQRTYGFNGYVTSDCGAVGTAYRNFPAGHAWAPP 336
Query: 297 -------DTQHFTSTPEEAA---------ADAIRAGLDLDCG--PFLGLHTESAVQRGLL 338
DT + A A ++RAG ++CG F + ++A+ G+L
Sbjct: 337 GWTTDGGDTNSIWTNTSTGAKISGAAGGEAYSLRAGTQVNCGGDEFSLQNIQAAISAGIL 396
Query: 339 SEIDINNALVNTLTVQMRLGMFDGEPSSQ-PYGHLGPKDVCTPDHQELALEAARQGIVLL 397
SE I++ L T++M G FD P+S+ PY + + +P HQ LA A +VLL
Sbjct: 397 SEGVIDSDLTKLFTIRMETGEFD--PASKVPYTSITKAQIQSPAHQALATSVADNSLVLL 454
Query: 398 KN------QGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIAC-------GYTTPLQG 444
KN P LP S + V ++G ++ +G+Y+G G TT ++
Sbjct: 455 KNANVSGTSAPLLPASASKLANVVILGDMANQVT--LGDYSGAPSLQVNAVQGLTTAIKA 512
Query: 445 IGRYARTIHQQGCKDVACADDQLFGAAIDAS-RQADATILVMGLDQSIEAEALDRAGLLL 503
A + +A A+ ++AD ++ +G +Q+ E DR L +
Sbjct: 513 ANPSANILFDAAGTSSTTTSAATLSSATQAAIKKADLVVMFVGTNQNNAQEGNDRTTLNM 572
Query: 504 PGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADI 563
PG L+++ + T LV+ S GP+ ++ + + A++++GY G++ GTA+AD+
Sbjct: 573 PGNYDSLITQTTALGNPKTALVVQSDGPVKISDVQG--SVPAVVFSGYNGESQGTALADV 630
Query: 564 LFGTSNPGGKLPMTWYPQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGM 622
L G NP G L TWY + + LP M+ + P + GRTY+++ G YPFG+G+
Sbjct: 631 LLGKQNPSGHLNFTWYADD--SQLPAMSNYGLTPGDTSGL-GRTYQYFTGTPTYPFGYGL 687
Query: 623 SYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRL-TLGVQVDVKNVGS 681
SY+ F ++ A V +A N T+ V V N GS
Sbjct: 688 SYSAFTYSAA--------------------------TVDNASPNADGTVNVSFKVTNSGS 721
Query: 682 KDGAHTLLVFST---PPAGHWAPHKQLVAFEKVHV-PAGAQQRVGINIHVCKYLSVVDRS 737
GA +++ +G P K+LV F+K V GA Q++ I + + LS + +
Sbjct: 722 TAGATVAQLYAATQFTESGVQLPTKRLVGFQKTGVLNPGAAQQITIPVKISD-LSFWNAT 780
Query: 738 GTRRIPL-GEHNIHIGGTKHSV--SLHAATLGVI 768
+ + G + + +G + + S++ A G I
Sbjct: 781 TMKSVVYDGTYALQVGASASDIRTSVNVAVSGAI 814
>gi|392537607|ref|ZP_10284744.1| Beta-glucosidase [Pseudoalteromonas marina mano4]
Length = 870
Score = 308 bits (789), Expect = 7e-81, Method: Compositional matrix adjust.
Identities = 170/426 (39%), Positives = 247/426 (57%), Gaps = 31/426 (7%)
Query: 30 CDPKDATTRTLP-FCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWS 88
CD + + + P + S I +RVNDL+ RL+L+EKV L + A+ RL I Y WW+
Sbjct: 21 CDNDEQSMQPPPLYLNESASIDERVNDLVTRLTLEEKVAQLFDKSPAIERLNIPEYNWWN 80
Query: 89 EALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA 148
EALHGV+ G AT FPQ I A++F+ L +G +SDE RA ++ A
Sbjct: 81 EALHGVARAGK----------ATVFPQAIGLAATFDEDLMLRVGTAISDEGRAKHHAFLA 130
Query: 149 --------GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKV 200
GLTYWSPN+NIFRDPRWGRGQET GEDP L+ + A +++ GLQG + + LK
Sbjct: 131 ENNRSMYTGLTYWSPNINIFRDPRWGRGQETYGEDPYLTTRIAVNFINGLQGDNTEYLKS 190
Query: 201 AASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVN 260
A+ KH+ + + V R + SK+D+ +T+ F+ + + KVASVMC+YN VN
Sbjct: 191 VATLKHYAVH---SGPEVSRHSDDYTASKKDLAETYLPAFKDVIAQTKVASVMCAYNSVN 247
Query: 261 GVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYD--TQHFTSTPEEAAADAIRAGLD 318
G P C + +++ +R E+ +GYIVSDC ++ +YD + + +T +AAA A++ G D
Sbjct: 248 GTPACGNDELIQNKLRDEFNFDGYIVSDCGAIADFYDVKSHNIVNTEAKAAAMALKTGTD 307
Query: 319 LDCGPFLG---LHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPK 375
L+CG G + AV+ GL+ E D++ AL + + +LGMFD P + PY
Sbjct: 308 LNCGDHHGNTYSYLSQAVKEGLVEEKDVDKALKRLMYARFKLGMFDN-PENVPYSDTSID 366
Query: 376 DVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIA 435
V + H L EAA++ +VLLKN+ LPL + VA+IGPN+D ++GNY G+
Sbjct: 367 IVGSNKHLALTQEAAKKSLVLLKNE-QVLPLK--GNEKVALIGPNADNEAILLGNYNGMP 423
Query: 436 CGYTTP 441
TP
Sbjct: 424 IVPITP 429
Score = 105 bits (263), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 81/279 (29%), Positives = 131/279 (46%), Gaps = 51/279 (18%)
Query: 471 AIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKG 520
A++ + +AD + V G+ ++E E + DR + LP Q L+ K+ K
Sbjct: 599 ALNNANEADVIVFVGGISANLEGEEMPLQIDGFSHGDRTNINLPKSQLNLLKKLKQTGK- 657
Query: 521 PTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYP 580
P +LV MSG + A + I AII YPG+A G+A+ +L+G +P GKLP+T+Y
Sbjct: 658 PIVLVNMSGSAM--ALNWENENIDAIIQGFYPGEAAGSALVSLLYGEYSPSGKLPITFYK 715
Query: 581 QEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVA 639
+++LP + +M+ RTY++Y+G V+YPFG G+SY +F +
Sbjct: 716 S--VSDLPDFKDYSMK--------NRTYKYYEGEVLYPFGFGLSYADFKYKNTRH----- 760
Query: 640 VPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHW 699
+D G +N T + +T+ VQV V S P A
Sbjct: 761 -SIDAGSGDLNLTTT-----ITNQSSFSADDVVQVYV---------------SMPDAPIK 799
Query: 700 APHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSG 738
P+KQLV F+ + + ++ + I K LS ++ G
Sbjct: 800 TPNKQLVGFKHITLKNESKNDIKFTIPKNK-LSYINEQG 837
>gi|440731995|ref|ZP_20911965.1| glucan 1,4-beta-glucosidase [Xanthomonas translucens DAR61454]
gi|440370332|gb|ELQ07251.1| glucan 1,4-beta-glucosidase [Xanthomonas translucens DAR61454]
Length = 913
Score = 308 bits (788), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 173/444 (38%), Positives = 248/444 (55%), Gaps = 36/444 (8%)
Query: 42 FCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGT 101
+ QR DL+ R++L+EK + + A A+PRLG+ Y+WW+EALHGV+ G
Sbjct: 37 YLDTQRSFEQRAADLVARMTLEEKAAQMQNAAPAIPRLGVPAYDWWNEALHGVARAG--- 93
Query: 102 KFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGT--------AGLTYW 153
GAT FPQ I A++F+ L + +SDEARA ++ GLT+W
Sbjct: 94 -------GATVFPQAIGMAATFDVPLMHEVSTAISDEARAKHHEALRHDQHARYQGLTFW 146
Query: 154 SPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVA---------ASC 204
SPN+NIFRDPRWGRGQET GEDP L+ + ++V+GLQG D K A A+
Sbjct: 147 SPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQGEGADAPKNAQGEAYRKLDATA 206
Query: 205 KHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPT 264
KHF + + DR HF+A S++D+ +T+ F V EGKV +VM +YN+V G
Sbjct: 207 KHFAVH---SGPEADRHHFDAHPSQRDLYETYLPAFEALVKEGKVDAVMGAYNRVYGESA 263
Query: 265 CADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPF 324
A +L+ +R W +GY+VSDC ++ + +T EEAAA A++ G +L+CG
Sbjct: 264 SASKFLLRDVLRDRWGFDGYVVSDCWAIVDIWKNHKIVATREEAAALAVKHGTELECGAE 323
Query: 325 LGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQE 384
SAV++GL+SE D++ AL + +MRLGMFD P + + +P+H
Sbjct: 324 YST-LPSAVRKGLISEADVDKALQKLMYSRMRLGMFD-PPEKLAWAQIPLSANQSPEHDA 381
Query: 385 LALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQG 444
LA AR+ +VLLKN G LPLS + + +AV+GP +D T+ ++GNY G T LQG
Sbjct: 382 LARRTARESLVLLKNDG-VLPLSRAKIKRIAVVGPTADDTMALLGNYYGTPAAPVTVLQG 440
Query: 445 IGRY---ARTIHQQGCKDVACADD 465
I A ++ +G V DD
Sbjct: 441 IREAAPDAEVLYARGADLVEGRDD 464
Score = 123 bits (308), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 94/305 (30%), Positives = 144/305 (47%), Gaps = 53/305 (17%)
Query: 471 AIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKG 520
A+DA+R+AD + V GL +E E + DR L LP Q+ L+ + K
Sbjct: 631 ALDAARRADVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRALLEALHGTGK- 689
Query: 521 PTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYP 580
P + VL +G + + +A+ + AI+ A YPGQ GG+A+AD+LFG +NPGG+LP+T+Y
Sbjct: 690 PVVAVLTTGSALAIDWAQQ--HVPAILLAWYPGQRGGSAVADVLFGDANPGGRLPVTFY- 746
Query: 581 QEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAV 640
+E T + AMR GRTYR++ G +YPFGHG+SYT F ++
Sbjct: 747 KESETLPAFDDYAMR--------GRTYRYFAGTPLYPFGHGLSYTQFAYS---------- 788
Query: 641 PLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTP--PAGH 698
D R G+ L + VKN G + G + ++ P P
Sbjct: 789 --DLRLDRSKLAADGR-------------LHATLKVKNTGQRAGDEVVQLYLQPLSPQRE 833
Query: 699 WAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVD--RSGTRRIPLGEHNIHIGGTKH 756
A K L F+++ + G + V I L + D R G P G++ + +G +
Sbjct: 834 RA-SKDLRGFQRIALQPGETREVRFAISPQSDLRLYDEARKGYVVDP-GDYELQVGASSS 891
Query: 757 SVSLH 761
V +
Sbjct: 892 DVRVR 896
>gi|116181370|ref|XP_001220534.1| hypothetical protein CHGG_01313 [Chaetomium globosum CBS 148.51]
gi|88185610|gb|EAQ93078.1| hypothetical protein CHGG_01313 [Chaetomium globosum CBS 148.51]
Length = 549
Score = 308 bits (788), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 191/500 (38%), Positives = 275/500 (55%), Gaps = 25/500 (5%)
Query: 43 CQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTK 102
C P+R L+ L+++EK++ L+ + RLG+ Y WWSEALHGV+ PG +
Sbjct: 39 CDPKATPPERAAALVKALNIEEKLQNLVDMSKGAERLGLPAYAWWSEALHGVA-ASPGVR 97
Query: 103 F----GGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVN 158
F GG F ATSF IT +++F+ L + +S EARA N G AGL YW+PN+N
Sbjct: 98 FNRTAGGRFSSATSFANSITLSAAFDDELVYKVADTISTEARAFANAGLAGLDYWTPNIN 157
Query: 159 IFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGV 218
++DPRWGRG ETPGEDPV Y + + GL+G D KV A+CKH+ AYDL+ W G
Sbjct: 158 PYKDPRWGRGHETPGEDPVRIKGYVKALLAGLEGDDPSIRKVVATCKHYAAYDLERWQGT 217
Query: 219 DRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGE 278
R F+A VS QD+ + + PF+ C + KV S MCSYN +NG P CA ++ +R
Sbjct: 218 TRHRFDAVVSLQDLSEYYLPPFQQCARDSKVGSFMCSYNALNGTPACASTYLMDDILRKH 277
Query: 279 W---RLNGYIVSDCDSVGVYY---DTQHFTSTPEE--AAADAIRAGLDLDCG-PFLGLHT 329
W N YI SDC+++ + +F+ST E AAA A +AG D C P +T
Sbjct: 278 WGWTEHNNYITSDCNAIQDFLPGPKWHNFSSTQTEAEAAAVAYQAGTDTVCEVPGWPPYT 337
Query: 330 E--SAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELAL 387
+ A + LLSE I+ AL +R+G FD S PY +G +DV TP+ QELAL
Sbjct: 338 DVIGAYNQTLLSEEVIDTALKRLYEGLVRVGYFD-PASGSPYRSIGWEDVNTPEAQELAL 396
Query: 388 EAARQGIVLLKNQGPSLPLSHIRHRTVAVIG--PNSDVTVTMIGNYAGIACGYTTPLQGI 445
++ G+VLLKN G +LPL ++ +TVA+IG NS ++G Y+G +P+
Sbjct: 397 QSGTDGLVLLKNDG-TLPL-NLEDKTVALIGFWANSTNGGRILGGYSGFPPYIHSPVDAA 454
Query: 446 GRYARTIHQQG---CKDVA-CADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGL 501
+ T H +++ A D A++ +++++ + G D SI AE LDR +
Sbjct: 455 EKLNLTYHYASGPLAENITQAAIDDWVAKALEPAKKSNVILYFGGTDTSIAAEDLDRDSI 514
Query: 502 LLPGRQQELVSKVSMASKGP 521
P Q ++ +S + P
Sbjct: 515 AWPEIQLAVIEALSALRQAP 534
>gi|386819249|ref|ZP_10106465.1| beta-glucosidase-like glycosyl hydrolase [Joostella marina DSM
19592]
gi|386424355|gb|EIJ38185.1| beta-glucosidase-like glycosyl hydrolase [Joostella marina DSM
19592]
Length = 878
Score = 307 bits (787), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 172/421 (40%), Positives = 245/421 (58%), Gaps = 29/421 (6%)
Query: 35 ATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGV 94
+ + PF LP +RVNDLI RL++ EK+ L+ + A+ RLGI Y WW+E+LHGV
Sbjct: 19 SQSEKYPFQNTELPEDERVNDLINRLTVDEKIAQLLYQSPAIERLGIPAYNWWNESLHGV 78
Query: 95 SNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGT------- 147
+ G AT FPQ IT A+S++ L + V+SDEARA ++
Sbjct: 79 ARAG----------YATVFPQSITIAASWDDELVAEVANVISDEARAKHHEYLRRGQHDI 128
Query: 148 -AGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKH 206
GLT+WSPN+NIFRDPRWGRG ET GEDP L+G YV+GLQG++ LKV A+ KH
Sbjct: 129 YQGLTFWSPNINIFRDPRWGRGHETYGEDPYLTGVLGTEYVKGLQGNNAKYLKVVATAKH 188
Query: 207 FTAYDLDNWNGVD--RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPT 264
F + +G + R F+ S++D+ +T+ FR V +G V S+M +YN++ G
Sbjct: 189 FAVH-----SGPEPLRHEFDVAPSQRDLWETYLPAFRTLVKDGNVYSIMTAYNRIYGEAA 243
Query: 265 CADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPF 324
A N L +R +W NGY+VSDC ++ + T H EA+A A++ G DL+CG
Sbjct: 244 SAS-NSLYSILRDKWGFNGYVVSDCGAIADMWKTHHVAKDAAEASAMAVKEGCDLNCGNS 302
Query: 325 LGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQE 384
T+ A+Q GL++E D++ AL + + +LGMFD + PY + P H+
Sbjct: 303 YEKLTD-ALQDGLITEADLDVALHRLMRARFKLGMFDSD-EKVPYAKIPFSVNNNPKHKV 360
Query: 385 LALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQG 444
LAL+AA++ IVLLKN+ LPLS + +AVIGPN+D ++ GNY G+ T L+G
Sbjct: 361 LALKAAQKSIVLLKNENAILPLSK-NLKNIAVIGPNADNIQSLWGNYNGMPKNPVTVLEG 419
Query: 445 I 445
I
Sbjct: 420 I 420
Score = 143 bits (360), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 94/296 (31%), Positives = 157/296 (53%), Gaps = 50/296 (16%)
Query: 471 AIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKG 520
A+ A+ ++D +L +GL++ +E E + DR L LP +Q EL+ +V +A+
Sbjct: 592 AVLAANKSDVVVLALGLNERLEGEEMKVEVEGFADGDRTSLNLPKKQVELMKEV-VATGK 650
Query: 521 PTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYP 580
P +LVL++G + + +A + I AII AGYPGQ GG AIA++LFG NP G+LP+T+Y
Sbjct: 651 PVVLVLLNGSALSINWASEN--IPAIISAGYPGQEGGNAIANVLFGDYNPAGRLPVTYYK 708
Query: 581 QEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAV 640
+ +LP P + GRTY+++K +YPFG+G+SYT F + + + +
Sbjct: 709 S--VDDLP-------PFEDYNMDGRTYKYFKKEPLYPFGYGLSYTKFKY------SNLEI 753
Query: 641 PLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHW- 699
PL+ IK+ + V V V N G DG + ++ G
Sbjct: 754 PLE--------------IKINEP------IKVSVQVANEGDFDGDEVVQLYVRDEEGSTP 793
Query: 700 APHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTK 755
P +LV F+++H+ GA+Q+V I + L+++++ I G +I +GG++
Sbjct: 794 RPICELVGFKRIHLKKGARQKVEFTIQ-PRELAMINKDDKFVIEPGWFSISVGGSQ 848
>gi|320105647|ref|YP_004181237.1| glycoside hydrolase family protein [Terriglobus saanensis SP1PR4]
gi|319924168|gb|ADV81243.1| glycoside hydrolase family 3 domain protein [Terriglobus saanensis
SP1PR4]
Length = 885
Score = 306 bits (785), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 185/466 (39%), Positives = 265/466 (56%), Gaps = 46/466 (9%)
Query: 9 FLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKL 68
L L + SGLA ++P DP +L P R DL+ R++L+EK
Sbjct: 11 LLSLTFAAFGQSGLA-QKPAYLDP-------------TLSPPARARDLVHRMTLEEKTAQ 56
Query: 69 LISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLW 128
+I+ A A+ RLG+ Y++WSE LHGV+ G AT FPQ I A++++ L
Sbjct: 57 MINTAPAIDRLGVPAYDFWSEGLHGVARSG----------YATLFPQAIGMAATWDEPLM 106
Query: 129 EAIGRVVSDEARAMYNGGTA--------GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSG 180
IG VVS EARA YN GLT WSPN+NIFRDPRWGRGQET GEDP L+
Sbjct: 107 HEIGTVVSTEARAKYNDAVQHGVHSIYFGLTIWSPNINIFRDPRWGRGQETYGEDPFLTA 166
Query: 181 KYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPF 240
+ ++VRG+QG D + + A+ KHF + R FN VS+ D+ DT+ F
Sbjct: 167 RMGTAFVRGIQGDDPNYFRTIATPKHFAVHSGPE---STRHTFNVDVSQHDLWDTYLPAF 223
Query: 241 RMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDT-- 298
R ++EGK S+MC+YN+++G P CA +LK+ +RG+W G++ SDC ++ +Y
Sbjct: 224 RSTIIEGKADSIMCAYNRIDGQPACASDLLLKQILRGDWGFRGFVTSDCGAIDDFYTKIG 283
Query: 299 QHFTSTPEEAAADAIRAGLDLDCGP-FLGLHTESAVQRGLLSEIDINNALVNTLTVQMRL 357
HF+ E+A+A ++AG D CG +LGL SAV+ GL++E +++ +L ++RL
Sbjct: 284 HHFSKEKEDASAAGVKAGTDTACGKTYLGL--TSAVKSGLITEHEMDISLERLFEARIRL 341
Query: 358 GMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVI 417
G+FD +P+ PY L +V +P H+ LAL AAR+ IVLLKN LPL +++ +AVI
Sbjct: 342 GLFD-DPARMPYARLTMAEVNSPAHRALALRAARESIVLLKNANNLLPLHGVKN--IAVI 398
Query: 418 GPNSDVTVTMIGNYAGIACGYTTPLQGIGRY---ARTIHQQGCKDV 460
GPN+ + GNY IA P+ GI A+ ++ QG V
Sbjct: 399 GPNAASLDALEGNYNAIARDPAMPVDGIAAAFPGAKVVYAQGAPYV 444
Score = 126 bits (317), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 94/295 (31%), Positives = 135/295 (45%), Gaps = 50/295 (16%)
Query: 479 DATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILVLMS 528
D + +GL +E E + DR + LP Q EL+ V K P I+VLM+
Sbjct: 620 DVVVAFVGLSPELEGEEMPIKVKGFAGGDRTDIELPQTQLELLRAVKATGK-PLIVVLMN 678
Query: 529 GGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLP 588
G A A D A++ A YPG+AG AIA+ L G +NP G+LP+T+Y I LP
Sbjct: 679 GS----AIALKDSETDALLEAWYPGEAGAQAIAETLAGKNNPSGRLPLTFYSN--IDQLP 732
Query: 589 MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGS 648
+ RTYR++KG +Y FG G+SYT F R+G
Sbjct: 733 AFD-------DYSMANRTYRYFKGQPLYAFGGGLSYTTF-----------------RYGK 768
Query: 649 INATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAF 708
+ ++ TH L V+ +V N G G V+ TPP AP LV +
Sbjct: 769 V-------SLSATHLHAGE-DLTVEAEVTNTGKVAGDEVAQVYLTPPQTSIAPRFALVGY 820
Query: 709 EKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHAA 763
++VH+ G + + +H + LS VD G R G + I +GG+ + +L AA
Sbjct: 821 QRVHLLPGQSKPMRFTLH-PRELSQVDAQGVRAASAGHYEIKVGGSSNVTTLSAA 874
>gi|389737578|ref|ZP_10190998.1| beta-glucosidase [Rhodanobacter sp. 115]
gi|388434298|gb|EIL91245.1| beta-glucosidase [Rhodanobacter sp. 115]
Length = 898
Score = 306 bits (784), Expect = 3e-80, Method: Compositional matrix adjust.
Identities = 181/453 (39%), Positives = 252/453 (55%), Gaps = 29/453 (6%)
Query: 35 ATTRTLP-FCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHG 93
A +T P + + +R DL+ R++L EKV + + A A+PRLG+ Y+WW+EALHG
Sbjct: 35 AAKQTQPLYLDTAHSFQERAADLVSRMTLAEKVAQMQNSAPAIPRLGVPAYDWWNEALHG 94
Query: 94 VSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--------G 145
V+ G AT FPQ I A++F+ L +SDEARA YN G
Sbjct: 95 VARAGE----------ATVFPQAIGLAATFDPALLHHEATAISDEARAKYNDFQRRGMRG 144
Query: 146 GTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCK 205
GLT+WSPN NIFRDPRWGRGQET GEDP L+ + ++VRGL+G D K+ A+ K
Sbjct: 145 RYEGLTFWSPNTNIFRDPRWGRGQETYGEDPYLTSRMGVAFVRGLEGDDPTYQKLDATAK 204
Query: 206 HFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTC 265
HF + + +R F+ S++D+ +T+ F+ V +G V +VM +YN+V+GVP
Sbjct: 205 HFAVH---SGPESERHRFDVHPSERDLHETYLPAFQALVQQGGVDAVMGAYNRVDGVPAT 261
Query: 266 ADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFL 325
A +L+ +R +W GY+VSDCD+V Y T E+AAA A+ G DL+CG
Sbjct: 262 ASHRLLQDILRRDWGFKGYVVSDCDAVADIYQFHKVVPTAEQAAALAVNNGDDLNCGTTY 321
Query: 326 GLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQEL 385
+ AV GL++E I+ A+ + + RLGMFD P P+ L V +P H L
Sbjct: 322 ATLVK-AVHDGLVNEHTIDTAVTRLMLARFRLGMFD-PPGRVPWSTLPMSVVQSPQHDAL 379
Query: 386 ALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
AL A++ +VLLKN G LPLSH R +AVIGP +D ++GNY G T LQGI
Sbjct: 380 ALRTAQESMVLLKNDG-LLPLSH-NVRRIAVIGPTADNVTALLGNYHGTPKAPVTILQGI 437
Query: 446 GRY---ARTIHQQGCKDVACADDQLFGAAIDAS 475
A+ + QG + V D G+ I A+
Sbjct: 438 REAVPNAQVTYVQGTELVQGFDGPDVGSVIGAA 470
Score = 121 bits (303), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 90/307 (29%), Positives = 145/307 (47%), Gaps = 53/307 (17%)
Query: 468 FGAAIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMA 517
F AA+DA+R AD I GL +E E + DR L LP Q++L+ + +
Sbjct: 623 FEAALDAARHADVVIFAGGLSSDLEGEEMPVDYPGFAGGDRTTLALPATQRKLLQALQVT 682
Query: 518 SKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMT 577
K P +LVL +G + + +AK + AI+ A YPGQ GG A+AD LFG +P G+LP+T
Sbjct: 683 GK-PVVLVLTTGSALAIDWAKQ--HLPAILLAWYPGQDGGHAVADALFGNVDPAGRLPVT 739
Query: 578 WYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTV 637
+Y P + AM+ GRTYR++ G ++PFG G+SYT F ++
Sbjct: 740 FYKSARQLP-PFDDYAMK--------GRTYRYFTGQPLFPFGFGLSYTRFAYS------- 783
Query: 638 VAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTP-PA 696
+ LD + T +R+ + ++ VKN G + G + ++ P A
Sbjct: 784 -DLQLD---------------RDTLGPSDRMRISLR--VKNTGQRAGDEVVQLYLRPLRA 825
Query: 697 GHWAPHKQLVAFEKVHVPAGAQQRVGINIHV---CKYLSVVDRSGTRRIPLGEHNIHIGG 753
H K L F+++ + G ++ V +I KY V + + G + + +G
Sbjct: 826 PHARAIKSLRGFQRISLKPGEERSVSFDISPQTDLKYYDVAHHAYA--VAPGRYQVQVGA 883
Query: 754 TKHSVSL 760
+ + L
Sbjct: 884 SSADIRL 890
>gi|365118446|ref|ZP_09337032.1| hypothetical protein HMPREF1033_00378 [Tannerella sp.
6_1_58FAA_CT1]
gi|363649697|gb|EHL88801.1| hypothetical protein HMPREF1033_00378 [Tannerella sp.
6_1_58FAA_CT1]
Length = 1283
Score = 306 bits (783), Expect = 3e-80, Method: Compositional matrix adjust.
Identities = 241/767 (31%), Positives = 372/767 (48%), Gaps = 120/767 (15%)
Query: 30 CDPKDATTRTLPFCQVS--------------LPIPQRVNDLIGRLSLQEKVKLLIS--GA 73
DP++A R++ VS +PI +R++DL+ RL+L+EKV L G+
Sbjct: 46 MDPEEAYNRSMANAGVSQQVSKETPKYLNPNIPIEERIDDLLPRLTLEEKVIQLSDSWGS 105
Query: 74 AAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGR 133
+ RL I +E LHG S G+T FP I S+F+ L + +G+
Sbjct: 106 KGIARLKIPAM-LKTEGLHGQSYA----------TGSTIFPHGINMGSTFDTELIQEVGK 154
Query: 134 VVSDEARAMYNGGTAGL-TYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQG 192
+ EA+A A L WSP +++ RD RWGR +ET GEDP L G+ ++++G QG
Sbjct: 155 ATAIEAKA------ANLRVSWSPVLDVARDARWGRVEETYGEDPYLVGRIGVAWIKGFQG 208
Query: 193 SDGDRLKVAASCKHFTAY-------DLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVM 245
+ A KHF + D ++ DR N ++ PFR +
Sbjct: 209 E-----HMFACPKHFAGHGQPVGGRDSHDYGLSDRVMRNIHLA----------PFRDVIK 253
Query: 246 EGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTP 305
E VM +Y NGVP +L++ +R EW G++VSDC Q T
Sbjct: 254 EANAFGVMAAYGLWNGVPDNGSKELLQKILREEWGFEGFVVSDCSGPENIQRKQSVVGTM 313
Query: 306 EEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPS 365
EEAAA A+RAG+D++CG SAV++G++ E +++ L +MRLG+FD PS
Sbjct: 314 EEAAAMAVRAGVDIECGSAYKKALASAVKKGIIKESELDANLRRVFRAKMRLGLFD-RPS 372
Query: 366 SQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTV 425
+ + TP+H+ LA + A + VLLKN+ LPL +T+AVIGPN+D
Sbjct: 373 IENMVWNKLPEYDTPEHRALARKVAVKSTVLLKNENNLLPLDK-NIKTIAVIGPNADQGQ 431
Query: 426 TMIGNY-AGIACG-YTTPLQGIGRY----ARTIHQQGCKDVACADDQLFGAAIDASRQAD 479
T G+Y A A G + L+G+ + + ++ QGC + D F A++ ++QAD
Sbjct: 432 T--GDYSAKYAPGQIISVLEGVKNHVSPSTKVLYAQGCTQLDM-DTTGFAEAVNIAKQAD 488
Query: 480 ATILVMGLD---------QSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGG 530
A ILV+G + +S E +D A L +PG Q++L+ V K P +LVL++G
Sbjct: 489 AVILVVGDNSNRHENGNKKSTTGENVDGATLEIPGVQRQLIKAVEATGK-PVVLVLVNGK 547
Query: 531 PIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMT 590
P + + D I +I+ YPG+ GG A ADI+FG NP G+LP++ +P+ + LP+
Sbjct: 548 PFTLTW--EDENIESILETWYPGEEGGNATADIIFGDENPSGRLPIS-FPR-HPGQLPLW 603
Query: 591 EMAMRPSQSKRYPGRTYRFYKGPV--VYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGS 648
+ GR Y +Y P +Y FGHG+SYT F R+ +
Sbjct: 604 -------YNYETSGRNYDYYDMPFTPLYRFGHGLSYTTF-----------------RYSN 639
Query: 649 INATI-SGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTP-PAGHWAPHKQLV 706
+ AT SG VT V VD++N G + G ++ T A L
Sbjct: 640 LKATTKSGDPGFVT----------VSVDIENTGKRPGEEVAQLYITDLVASVNTAVIDLK 689
Query: 707 AFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGG 753
F++V + G ++ V ++ LS+++ R + G+ +H+GG
Sbjct: 690 GFKRVFLKPGEKKTVTFELNPY-LLSLLNPDMKRVLEAGKFRMHVGG 735
>gi|427386425|ref|ZP_18882622.1| hypothetical protein HMPREF9447_03655 [Bacteroides oleiciplenus YIT
12058]
gi|425726465|gb|EKU89330.1| hypothetical protein HMPREF9447_03655 [Bacteroides oleiciplenus YIT
12058]
Length = 864
Score = 306 bits (783), Expect = 3e-80, Method: Compositional matrix adjust.
Identities = 175/445 (39%), Positives = 252/445 (56%), Gaps = 30/445 (6%)
Query: 42 FCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGT 101
F LP+ +RVNDL+G L+L+EK+ +++ A A+ RLGI Y WW+E LHGV+
Sbjct: 26 FQNPDLPVEERVNDLVGHLTLEEKISQMMNNAPAIERLGIPAYNWWNECLHGVAR----- 80
Query: 102 KFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--------GGTAGLTYW 153
+P TSFPQ I A++++ + SDE RA+Y+ G GLTYW
Sbjct: 81 ---SPYP-VTSFPQAIAMAATWDTKSVYQMAEYASDEGRAIYHDAARKGTPGIFRGLTYW 136
Query: 154 SPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLD 213
SPN+NIFRDPRWGRGQET GEDP L+ ++V+GLQG D LK +A KH+ +
Sbjct: 137 SPNINIFRDPRWGRGQETYGEDPYLTAAIGVAFVKGLQGDDPVYLKSSACAKHYAVHSGP 196
Query: 214 NWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKR 273
WN R +NA+VS D+ DT+ FR V++ KV VMC+YN P C + ++
Sbjct: 197 EWN---RHTYNAEVSNHDLWDTYLPAFRELVVDAKVTGVMCAYNSFFEQPCCGNDLLMMD 253
Query: 274 TIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAV 333
+R +W+ +GY+ SDC ++ +Y+T + EA+ADA+ G D +CG A+
Sbjct: 254 ILRNQWKFDGYVTSDCGAIEDFYNTHNTHEDAAEASADAVLHGTDCECGNGAYRALADAI 313
Query: 334 QRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-HQELALEAARQ 392
RGL++E ++ +L ++ RLGMFD + PY + P V D H+ AL+ ARQ
Sbjct: 314 VRGLITEEQVDVSLKKLFEIRFRLGMFDPD-DRVPYSDI-PISVLECDAHKAHALKMARQ 371
Query: 393 GIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI----GRY 448
IVLLKN+ LPL + + +AV+GPN+D ++ NY G TT L+GI G
Sbjct: 372 SIVLLKNEKQLLPLDMNKIKKIAVVGPNADDKSVLLANYYGYPSCVTTVLEGIKGKVGDQ 431
Query: 449 ARTIHQQGCKDVACADDQLFGAAID 473
I+++G V DD +F +A D
Sbjct: 432 VEVIYEKG---VNLTDDFIFTSAYD 453
Score = 127 bits (320), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 90/293 (30%), Positives = 140/293 (47%), Gaps = 52/293 (17%)
Query: 476 RQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILV 525
+ AD + V GL +E E + DR + +P QQ L+ ++ K P I +
Sbjct: 597 KDADVVVFVGGLSAKVEGEEMKVEIDGFKRGDRTSISIPVVQQNLLKELYATGK-PVIFI 655
Query: 526 LMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYIT 585
LM+G + + + + AI+ A Y GQAGG AIAD+LFG NP G+LP+T+Y +
Sbjct: 656 LMTGSAVGLEWESE--HLPAILNAWYGGQAGGQAIADVLFGDYNPSGRLPLTFYKN--VN 711
Query: 586 NLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDG 644
+LP + +M+ RTYR++ G VYPFG+G+SYT+F +
Sbjct: 712 DLPDFEDYSMK--------NRTYRYFTGIPVYPFGYGLSYTDFQYN-------------- 749
Query: 645 RHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQ 704
IKV ++L++ V +V NVG +G + ++ + P P +
Sbjct: 750 ------------TIKV-QPSLDKLSVKVTAEVSNVGKYEGEEVVQLYVSNPRDFVTPIRA 796
Query: 705 LVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHS 757
L F ++++ G Q V + K LSVVD +G GE I +GG + S
Sbjct: 797 LKGFRRINLKPGESQMVEF-VLTSKELSVVDVAGNFVPMKGEVQISLGGGQPS 848
>gi|300785890|ref|YP_003766181.1| beta-glucosidase [Amycolatopsis mediterranei U32]
gi|384149201|ref|YP_005532017.1| beta-glucosidase [Amycolatopsis mediterranei S699]
gi|399537773|ref|YP_006550435.1| beta-glucosidase [Amycolatopsis mediterranei S699]
gi|299795404|gb|ADJ45779.1| beta-glucosidase [Amycolatopsis mediterranei U32]
gi|340527355|gb|AEK42560.1| beta-glucosidase [Amycolatopsis mediterranei S699]
gi|398318543|gb|AFO77490.1| beta-glucosidase [Amycolatopsis mediterranei S699]
Length = 1218
Score = 306 bits (783), Expect = 3e-80, Method: Compositional matrix adjust.
Identities = 238/785 (30%), Positives = 356/785 (45%), Gaps = 122/785 (15%)
Query: 51 QRVNDLIGRLSLQEKV-KLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPG 109
+R DL+ R++L EKV +L + A A+PRLG++ Y +WSE HG++ +G T G G
Sbjct: 54 ERAADLVARMTLPEKVLQLRTNSAPAIPRLGVQQYTYWSEGQHGLNTLGANTNDGTVTGG 113
Query: 110 --ATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAG---------------LTY 152
ATSFP + + S++ L + +SDEAR M + G LTY
Sbjct: 114 VHATSFPTNLASTMSWDPELIQQETTAISDEARGMLDKSLWGVAQNNIGPDKNNYGSLTY 173
Query: 153 WSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGD------RLKVAASCKH 206
W+P VN+ RDPRWGR E GEDP L K A ++V G QG LKVAA+ KH
Sbjct: 174 WAPTVNLDRDPRWGRTDEGFGEDPYLVAKMAGAFVNGYQGQTASGRPATPYLKVAATAKH 233
Query: 207 FTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCA 266
+ +++N DR ++ ++ ++ D + FR + + V+ +M SYN +NG P+ +
Sbjct: 234 YALNNVEN----DRHADSSDTTEANLRDYYTKQFRNLIQDAHVSGLMTSYNAINGTPSPS 289
Query: 267 DPNILKRTIRGEWRLNGYIVSDCDSVG-VYYDTQHFTSTPEEAAAD-------------- 311
D + + +GY SDC +VG VY H + P A
Sbjct: 290 DTYTANAIAQRTYGFDGYTTSDCGAVGDVYAPGSHNWAPPGWTTATSNGGTQWTNTATGQ 349
Query: 312 -----------AIRAGLDLDCGPFLGL--HTESAVQRGLLSEIDINNALVNTLTVQMRLG 358
A+RAG L+C + + A++ G+LSE ++NALV+ T +M+ G
Sbjct: 350 QVAGAAGGQAYALRAGTQLNCTGTEATVANIQEAIKAGVLSEGVLDNALVHVFTTRMQTG 409
Query: 359 MFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQ------GPSLPLSHIRHR 412
FD P Y + + +P+HQ LA + A +VLLKN P LP +
Sbjct: 410 EFD-PPDRVAYTKITKDVIQSPEHQALAAKVAAHSLVLLKNDPVPGTAAPLLPADPAKLG 468
Query: 413 TVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYAR--------TIHQQGCKDVACAD 464
TV V+G + +G Y+G +QGI + T G
Sbjct: 469 TVVVVGDLAGKVT--LGGYSGEPALQVNAVQGITSAVKAANPAATVTFDACGTSTATTTA 526
Query: 465 DQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTIL 524
+ A + AD ++ G D ++ E DR + +PG L+ +V A T L
Sbjct: 527 ASCSAETLAALKTADLVVVFAGTDGNVATEGRDRTTIAMPGNYDSLIDQVKAAGNPRTAL 586
Query: 525 VLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYI 584
+ +GG + + A P I+++GY G++ GTA+AD+LFG NP G L TWY +
Sbjct: 587 AVQAGGAVSLGHAAGIP---GIVFSGYNGESQGTALADVLFGKQNPSGHLNFTWYADD-- 641
Query: 585 TNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLD 643
+ LP + + PSQ+ GRTY+++ G YPFG+G+SYT F ++ +A T A
Sbjct: 642 SQLPAIKNYGLTPSQTGGL-GRTYQYFTGTPAYPFGYGLSYTKFAYSRVHADTWAA---- 696
Query: 644 GRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGA-----HTLLVFSTPPAGH 698
N ++ V VDV N GS GA + F P G
Sbjct: 697 ----DANGQVT-----------------VHVDVTNTGSTPGATVAQLYAATAFGVP--GV 733
Query: 699 WAPHKQLVAFEKVHVPA-GAQQRVGINIHV--------CKYLSVVDRSGTRRIPLGEHNI 749
P ++L F K V A G Q + I + + K VV G R +G +
Sbjct: 734 ELPRQRLAGFAKTDVLAPGRTQHLAIPVRIGDLAFWDEGKRREVV-HPGAYRFGVGADSA 792
Query: 750 HIGGT 754
H+ GT
Sbjct: 793 HLAGT 797
>gi|294627323|ref|ZP_06705909.1| glucan 1,4-beta-glucosidase [Xanthomonas fuscans subsp.
aurantifolii str. ICPB 11122]
gi|292598405|gb|EFF42556.1| glucan 1,4-beta-glucosidase [Xanthomonas fuscans subsp.
aurantifolii str. ICPB 11122]
Length = 886
Score = 306 bits (783), Expect = 4e-80, Method: Compositional matrix adjust.
Identities = 173/407 (42%), Positives = 237/407 (58%), Gaps = 32/407 (7%)
Query: 51 QRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGA 110
QR L+ ++S +EKV ++ A A+PRLGI YEWWSE LHG++ G A
Sbjct: 36 QRAAALVAQMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNGY----------A 85
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--GGT-------AGLTYWSPNVNIFR 161
T FPQ I A+S+N +L + +G VVS EARA +N GG AGLT WSPN+NIFR
Sbjct: 86 TVFPQSIGLAASWNTSLMQQVGTVVSTEARAKFNQAGGPGKDHQRYAGLTIWSPNINIFR 145
Query: 162 DPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD-- 219
DPRWGRG ET GEDP L+G+ A ++RGLQG D D + A+ KH + +G +
Sbjct: 146 DPRWGRGMETYGEDPFLTGQMAVGFIRGLQGEDLDHPRTIATPKHIAVH-----SGPEPG 200
Query: 220 RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEW 279
R F+ VS D+E T+ FR ++EG+ SVMC+YN ++G P CA +L +RG+W
Sbjct: 201 RHGFDVDVSPHDVEATYTPAFRAALVEGQAGSVMCAYNSLHGTPACAADWLLNGRVRGDW 260
Query: 280 RLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLL 338
G++VSDCD+V +F ++A A++AG DL+CG + L T A++RG +
Sbjct: 261 GFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAAALKAGHDLNCGHAYRDLGT--AIERGDV 318
Query: 339 SEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLK 398
E ++ +LV + RLG + P PY LG KDV H+ LAL+AA + IVLLK
Sbjct: 319 DEALLDQSLVRLFAARYRLGELEA-PRKDPYARLGAKDVDNAAHRALALQAAAESIVLLK 377
Query: 399 NQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
N +LPL +AVIGPN+D + NY G + TPL G+
Sbjct: 378 NDANTLPLKA--GTRLAVIGPNADALAALEANYQGTSSAPVTPLLGL 422
Score = 136 bits (343), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 94/279 (33%), Positives = 143/279 (51%), Gaps = 48/279 (17%)
Query: 485 MGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDV 534
+GL +E E L DR + LP QQ L+ + + K P ++VLMSG + +
Sbjct: 622 VGLSPDVEGEELRIDVPGFDGGDRNDIALPAPQQALLERAKASGK-PLVVVLMSGSAVAL 680
Query: 535 AFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAM 594
+AK AI+ A YPGQ+GGTAIA +L G NPGG+LP+T+Y +LP
Sbjct: 681 NWAKT--HADAIVAAWYPGQSGGTAIARMLAGDDNPGGRLPVTFY--RSTKDLPAY---- 732
Query: 595 RPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATIS 654
S GRTYR++KG ++PFG+G+SYT F + +AP + L +
Sbjct: 733 ---VSYDMKGRTYRYFKGEPLFPFGYGLSYTRFAY---DAPQLSTTTLQ----------A 776
Query: 655 GKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVP 714
G ++VT V+N G++ G V+ P +P + LV F++VH+
Sbjct: 777 GNPLQVT------------TTVRNTGARAGDEVAQVYLQYPDRPQSPLRSLVGFQRVHLA 824
Query: 715 AGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGG 753
AG Q+ + ++ + LS VDRSG R + G++ + +GG
Sbjct: 825 AGEQRTLTFHLD-ARALSDVDRSGQRAVEAGDYTLFVGG 862
>gi|294665226|ref|ZP_06730524.1| glucan 1,4-beta-glucosidase [Xanthomonas fuscans subsp.
aurantifolii str. ICPB 10535]
gi|292605014|gb|EFF48367.1| glucan 1,4-beta-glucosidase [Xanthomonas fuscans subsp.
aurantifolii str. ICPB 10535]
Length = 886
Score = 305 bits (782), Expect = 5e-80, Method: Compositional matrix adjust.
Identities = 173/407 (42%), Positives = 237/407 (58%), Gaps = 32/407 (7%)
Query: 51 QRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGA 110
QR L+ ++S +EKV ++ A A+PRLGI YEWWSE LHG++ G A
Sbjct: 36 QRAAALVAQMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNGY----------A 85
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--GGT-------AGLTYWSPNVNIFR 161
T FPQ I A+S+N +L + +G VVS EARA +N GG AGLT WSPN+NIFR
Sbjct: 86 TVFPQSIGLAASWNTSLMQQVGTVVSTEARAKFNQAGGPGKDHQRYAGLTIWSPNINIFR 145
Query: 162 DPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD-- 219
DPRWGRG ET GEDP L+G+ A ++RGLQG D D + A+ KH + +G +
Sbjct: 146 DPRWGRGMETYGEDPFLTGQMAVGFIRGLQGEDLDHPRTIATPKHIAVH-----SGPEPG 200
Query: 220 RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEW 279
R F+ VS D+E T+ FR ++EG+ SVMC+YN ++G P CA +L +RG+W
Sbjct: 201 RHGFDVDVSPHDVEATYTPAFRAALVEGQAGSVMCAYNSLHGTPACAADWLLNGRVRGDW 260
Query: 280 RLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLL 338
G++VSDCD+V +F ++A A++AG DL+CG + L T A++RG +
Sbjct: 261 GFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAAALKAGHDLNCGHAYRDLGT--AIERGDV 318
Query: 339 SEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLK 398
E ++ +LV + RLG + P PY LG KDV H+ LAL+AA + IVLLK
Sbjct: 319 DEALLDQSLVRLFAARYRLGELEA-PRKDPYARLGAKDVDNAAHRALALQAAAESIVLLK 377
Query: 399 NQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
N +LPL +AVIGPN+D + NY G + TPL G+
Sbjct: 378 NDANTLPLKA--GTRLAVIGPNADALAALEANYQGTSSAPVTPLLGL 422
Score = 135 bits (341), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 93/279 (33%), Positives = 143/279 (51%), Gaps = 48/279 (17%)
Query: 485 MGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDV 534
+GL +E E L DR + LP QQ L+ + + K P ++VLMSG + +
Sbjct: 622 VGLSPDVEGEELRIDVPGFDGGDRNDIALPAPQQALLERAKASGK-PLVVVLMSGSAVAL 680
Query: 535 AFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAM 594
+AK AI+ A YPGQ+GGTA+A +L G NPGG+LP+T+Y +LP
Sbjct: 681 NWAKT--HADAIVAAWYPGQSGGTAMARMLAGDDNPGGRLPVTFY--RSTKDLPAY---- 732
Query: 595 RPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATIS 654
S GRTYR++KG ++PFG+G+SYT F + +AP + L +
Sbjct: 733 ---VSYDMKGRTYRYFKGEPLFPFGYGLSYTRFAY---DAPQLSTTTLQ----------A 776
Query: 655 GKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVP 714
G ++VT V+N G++ G V+ P +P + LV F++VH+
Sbjct: 777 GNPLQVT------------TTVRNTGARAGDEVAQVYLQYPDRPQSPLRSLVGFQRVHLA 824
Query: 715 AGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGG 753
AG Q+ + ++ + LS VDRSG R + G++ + +GG
Sbjct: 825 AGEQRTLTFHLD-ARALSDVDRSGQRAVEAGDYTLFVGG 862
>gi|182415162|ref|YP_001820228.1| glycoside hydrolase family 3 [Opitutus terrae PB90-1]
gi|177842376|gb|ACB76628.1| glycoside hydrolase family 3 domain protein [Opitutus terrae
PB90-1]
Length = 747
Score = 305 bits (782), Expect = 5e-80, Method: Compositional matrix adjust.
Identities = 229/716 (31%), Positives = 364/716 (50%), Gaps = 64/716 (8%)
Query: 40 LPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGP 99
LPF LP QR++DLIGR++L+EK+ + + AAVPRLG+KG E HGV+ GP
Sbjct: 32 LPFQDPELPAEQRIDDLIGRMTLEEKIDCM-AMRAAVPRLGVKGSRH-IEGYHGVAQGGP 89
Query: 100 GTKFGGDFPGATS-FPQVITTASSFNATLWEAIGRVVSDEARAMYNG---GTAGLTYWSP 155
+ +G P AT+ FPQ ++++ L + ++EAR ++ AGL +P
Sbjct: 90 -SNWGRRNPTATTQFPQAYGLGATWDPELIRQVAAQEAEEARYLFQSPRYDRAGLIVRAP 148
Query: 156 NVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNW 215
N ++ RDPRWGR +E GEDP +G A ++VRGLQG D K + KHF A N
Sbjct: 149 NADLARDPRWGRTEEVYGEDPFHAGTLATAFVRGLQGDDPRYFKAVSLVKHFLA----NS 204
Query: 216 NGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTI 275
N R ++ S++ + + PF M +++G ++M +YN VNG P P +L+ +
Sbjct: 205 NEDGRESSSSNFSERQWREYYAKPFEMAIVDGGAPALMAAYNAVNGTPAHVHP-MLRDIV 263
Query: 276 RGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTES---A 332
EW+LNG + +D + + + H AAA ++AG++ FL H ++ A
Sbjct: 264 MAEWKLNGILCTDGGGLRLLVEKHHAFPDLPSAAAACVKAGINH----FLDRHKDAVTEA 319
Query: 333 VQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLG----PKDVCTPDHQELALE 388
V RG ++E D++ AL V ++LG+ D + PY +G + PD Q L +
Sbjct: 320 VARGSITERDLDAALRGLFRVSLKLGLLDPD-ERVPYAAIGRNGEAEPWLRPDTQALVRK 378
Query: 389 AARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRY 448
++ IVLLKN G LPL + +TVA++GP + T++ ++ G YT P IG
Sbjct: 379 VTQRSIVLLKNSGALLPLDRTKVKTVALVGP---LVNTVLPDWYGGTPPYTVP-PSIG-- 432
Query: 449 ARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLD------------QSIEAEAL 496
+ +G K AD + AA++ +R ++ I+ +G D S EA+
Sbjct: 433 VEKVAGEGVKVGWLAD--MGDAAVELARTSEIAIVCVGNDPISAGGWELVRTPSEGKEAV 490
Query: 497 DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAG 556
DR L LP Q++ + +V +A+ TI+VL+S P + + + AI+ + Q
Sbjct: 491 DRKDLALPRDQEKFIRRV-LAANPRTIVVLISNFPYAMPWVVK--HVPAIVHLTHASQEL 547
Query: 557 GTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVY 616
G A+ D+L+G NP GKL TW P+ PM + + GRTY+++KG +
Sbjct: 548 GHALGDVLWGEVNPDGKLAQTW-PKSLKQLPPMMDYDLT-------HGRTYQYFKGEPQF 599
Query: 617 PFGHGMSYTNFVHTVANAPTVVAVPLD-GRHGSINATISGKAIKVTHAKCNRLTLGVQVD 675
PFG G+SYT F ++N + V LD RH A ++ N + L + V+
Sbjct: 600 PFGFGLSYTTF--NLSN----LRVGLDVARHVGAGAETPAESPAPRTFAPNAI-LSIAVE 652
Query: 676 VKNVGSKDGAHTLLVFSTPPAGHWA-PHKQLVAFEKVHVPAGAQQRVGINIHVCKY 730
V N G++ G + V++ P + P KQL F+++ V AG V + + ++
Sbjct: 653 VTNTGTRAGDEVVQVYARYPHSKVSRPLKQLCGFQRISVAAGETAHVRLQLPASRF 708
>gi|325916103|ref|ZP_08178390.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas vesicatoria
ATCC 35937]
gi|325537647|gb|EGD09356.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas vesicatoria
ATCC 35937]
Length = 896
Score = 305 bits (781), Expect = 6e-80, Method: Compositional matrix adjust.
Identities = 176/451 (39%), Positives = 249/451 (55%), Gaps = 29/451 (6%)
Query: 35 ATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGV 94
A T P+ LP R DL+ R++L+EK + + A A+PRL + Y+WW+EALHGV
Sbjct: 33 AATPPPPYLDTQLPFETRAADLVSRMTLEEKAAQMQNAAPAIPRLRVPAYDWWNEALHGV 92
Query: 95 SNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA------ 148
+ G GAT FPQ I A++F+ L + +SDEARA ++ A
Sbjct: 93 ARAG----------GATVFPQAIGLAATFDTPLMAEVATAISDEARAKHHAFLARDEHKR 142
Query: 149 --GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKH 206
GLT+WSPN+NIFRDPRWGRGQET GEDP L+ + ++V+GLQ G K+ A+ KH
Sbjct: 143 YQGLTFWSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQAQQGPYRKLDATAKH 202
Query: 207 FTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCA 266
+ + + DR HF+ S++D+ +T+ F+ V EG VA+VM +YN+VNG A
Sbjct: 203 YAVH---SGPEADRHHFDVHPSERDLHETYLPAFQALVQEGHVAAVMGAYNRVNGESASA 259
Query: 267 DPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLG 326
L+ +R +W +GYIVSDC ++ + TPE AAA ++ G DLDCG
Sbjct: 260 STR-LEGILRRDWGFDGYIVSDCAAIRDIWQNHKIVPTPEAAAALGVKHGTDLDCGDTYA 318
Query: 327 LHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELA 386
AV+ GL+ E I+ +L +T +MRLGMFD P+ + + +P H LA
Sbjct: 319 A-LPKAVRAGLIDEATIDTSLKRLMTTRMRLGMFD-PPAKVAWAQIPASVNQSPQHDALA 376
Query: 387 LEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI- 445
AR+ +VLLKN G LPL R +AV+GP +D ++++GNY G T LQGI
Sbjct: 377 RRTARESLVLLKNDG-LLPLKPTLKR-IAVVGPTADDPMSLLGNYYGTPAAPVTILQGIR 434
Query: 446 --GRYARTIHQQGCKDVACADDQLFGAAIDA 474
A ++ +G V +D A IDA
Sbjct: 435 DAAPQAEVVYARGSDLVEGREDPNAAAPIDA 465
Score = 125 bits (314), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 89/302 (29%), Positives = 144/302 (47%), Gaps = 53/302 (17%)
Query: 471 AIDASRQADATILVMGLDQSIEAEALD----------RAGLLLPGRQQELVSKVSMASKG 520
A+DA+R A+ + V GL +E E +D R LP Q+EL+ + A+
Sbjct: 623 AVDAARNAEVVVFVGGLTGDVEGEEMDVNYPGFAGGDRTDTRLPKPQRELLQALQ-ATGT 681
Query: 521 PTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYP 580
P + VL +G + V +A+ + AI+ A YPGQ GG+A+ D+LFG ++PGG+LP+T+Y
Sbjct: 682 PVVAVLTTGSALAVDWAQQ--HVPAILLAWYPGQRGGSAVGDVLFGQASPGGRLPITFYK 739
Query: 581 QEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVA 639
+ LP + AMR GRTYR++ G +YPFGHG+SYT F ++
Sbjct: 740 E--AERLPAFDDYAMR--------GRTYRYFTGTALYPFGHGLSYTQFAYSDLRLDRTTL 789
Query: 640 VPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTP--PAG 697
G G++ AT+ V+N G + G + ++ P P
Sbjct: 790 ----GADGTLRATLK---------------------VRNTGKRAGDEVVQLYLHPLDPKR 824
Query: 698 HWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVD-RSGTRRIPLGEHNIHIGGTKH 756
A K+L F+++ + G Q+ V + L + D + T + G + + IG +
Sbjct: 825 ERA-GKELRGFQRMTLQPGEQREVAFTLKAADALRIYDEQRKTYAVDPGAYEVQIGASSS 883
Query: 757 SV 758
+
Sbjct: 884 DI 885
>gi|371777036|ref|ZP_09483358.1| glycoside hydrolase [Anaerophaga sp. HS1]
Length = 890
Score = 305 bits (781), Expect = 6e-80, Method: Compositional matrix adjust.
Identities = 176/430 (40%), Positives = 245/430 (56%), Gaps = 30/430 (6%)
Query: 42 FCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGT 101
+ +LP +R DL+ +++L+EKV + A A+ RLGI Y WW+E LHGV G
Sbjct: 40 YLDPTLPFEERAADLVSKMTLEEKVSQMQHAAPAIERLGIPEYNWWNECLHGVGRAGI-- 97
Query: 102 KFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--------GGTAGLTYW 153
AT FPQ I A+ ++ I VSDEARA ++ G GLT+W
Sbjct: 98 --------ATVFPQAIGMAAMWDDEEMYRIATAVSDEARAKHHDFARRGKRGIYQGLTFW 149
Query: 154 SPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLD 213
+PN+NIFRDPRWGRG ET GEDP L+G+ A Y++GLQG D LK+ A+ KHF +
Sbjct: 150 TPNINIFRDPRWGRGMETYGEDPFLTGELAVDYIKGLQGDDDRYLKLVATSKHFLVHSGP 209
Query: 214 NWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKR 273
DR HF+A+ S +D T+ F+ + E V SVMC+YN+ NG+P C + +
Sbjct: 210 E---PDRHHFDARTSARDSLMTYTPHFKKTIQEAGVYSVMCAYNRYNGLPCCGSKPV-EN 265
Query: 274 TIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTP--EEAAADAIRAGLDLDCGPFLGLHTES 331
+R EW GYIVSDC +V +Y H P EEAAA A++AG DL+CG +
Sbjct: 266 LLRNEWGFKGYIVSDCWAVADFYKKGHHEVVPTVEEAAAMAVKAGTDLNCGNSYPALVD- 324
Query: 332 AVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAAR 391
AV++GL+SE +I+ + + ++RLGMFD P PY ++ V + +H+ELAL AAR
Sbjct: 325 AVKQGLVSEEEIDVLVKRLMEARLRLGMFD-PPEMVPYTNIPYSVVDSKEHRELALIAAR 383
Query: 392 QGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRY--- 448
+ +VLLKN +LPL + VAVIGPN++ ++ NY G TPL GI +
Sbjct: 384 KSMVLLKNDNNTLPLDK-NVKNVAVIGPNANNLDVLLANYNGYPSNPVTPLDGIRQKLPN 442
Query: 449 ARTIHQQGCK 458
A + GC+
Sbjct: 443 ANVQYALGCR 452
Score = 133 bits (335), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 90/295 (30%), Positives = 147/295 (49%), Gaps = 49/295 (16%)
Query: 471 AIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKG 520
AI + +D ++ MGL ++E E + DR + LP Q +LV K M+
Sbjct: 610 AIQIAAASDVVLMFMGLSPNLEGEEMPVNVPGFSGGDRVDIKLPQIQTDLV-KAIMSLGK 668
Query: 521 PTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYP 580
P +LVL++G + + + + + AI+ A YPGQAGGTAIAD+LFG NP G+LP+T+Y
Sbjct: 669 PVVLVLLNGSALAINWEAEN--VPAILEAWYPGQAGGTAIADVLFGDYNPAGRLPVTFY- 725
Query: 581 QEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAV 640
+ +T LP P + GRTY+++KG ++PFG+G+SYT+F + P +
Sbjct: 726 -KSVTQLP-------PFEDYSMDGRTYQYFKGEALFPFGYGLSYTSFKYDNLVVPDKLE- 776
Query: 641 PLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWA 700
+GK + V VDV N G++DG + ++ + P A
Sbjct: 777 -------------AGKEVT------------VHVDVTNTGNRDGDEVVQLYVSHPDVESA 811
Query: 701 PHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTK 755
P + L F+++ + AG + V + + L+V +P G + +GG +
Sbjct: 812 PIRSLQGFDRIALKAGETKTVSFTLK-PEQLAVYQPQNGLVVPAGNLKLSVGGRQ 865
>gi|294667502|ref|ZP_06732718.1| glucan 1,4-beta-glucosidase [Xanthomonas fuscans subsp.
aurantifolii str. ICPB 10535]
gi|292602731|gb|EFF46166.1| glucan 1,4-beta-glucosidase [Xanthomonas fuscans subsp.
aurantifolii str. ICPB 10535]
Length = 901
Score = 305 bits (781), Expect = 6e-80, Method: Compositional matrix adjust.
Identities = 176/458 (38%), Positives = 250/458 (54%), Gaps = 36/458 (7%)
Query: 28 FACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWW 87
C A T P+ R DL+ R++L+EK + + A A+PRL + Y+WW
Sbjct: 20 LGCATGAAQAATPPYLDTQRSFEARAADLVSRMTLEEKAAQMQNAAPAIPRLQVPAYDWW 79
Query: 88 SEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNG-- 145
+EALHGV+ G GAT FPQ I A++F+ L + +SDEARA ++
Sbjct: 80 NEALHGVARAG----------GATVFPQAIGMAATFDLPLMHEVATAISDEARAKHHQFL 129
Query: 146 ------GTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLK 199
GLT+WSPN+NIFRDPRWGRGQET GEDP L+ + ++V+GLQG GD K
Sbjct: 130 RQNQHERYQGLTFWSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQGEGGDAPK 189
Query: 200 VA---------ASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVA 250
A A+ KHF + DR HF+A S++D+ +T+ F V +GKV
Sbjct: 190 NAQGERYRKLDATAKHFAVHSGPE---ADRHHFDAHPSQRDLYETYLPAFEALVKDGKVD 246
Query: 251 SVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAA 310
+VM +YN+V G A +L+ +R +W GY+VSDC ++ + +T E+AAA
Sbjct: 247 AVMGAYNRVYGESASASKFLLQDVLRDQWGFRGYVVSDCWAIVDIWKHHKIVATREQAAA 306
Query: 311 DAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYG 370
A++ G +L+CG +AV++GL+ E I+ AL +T +MRLGMFD P P+
Sbjct: 307 LAVKHGTELECGEEYST-LPAAVRQGLIDEAQIDTALKTLMTARMRLGMFD-PPGQLPWS 364
Query: 371 HLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGN 430
+ +P H LA AR+ +VLLKN G LPLS R + +AVIGP +D T+ ++GN
Sbjct: 365 QIPASVNQSPAHDALARRTARESLVLLKNDG-LLPLSRARLKRIAVIGPTADDTMALLGN 423
Query: 431 YAGIACGYTTPLQGI---GRYARTIHQQGCKDVACADD 465
Y G T LQGI A+ ++ +G V DD
Sbjct: 424 YYGTPAAPVTVLQGIRAAAPNAQVLYARGADLVEGRDD 461
Score = 126 bits (317), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 93/302 (30%), Positives = 148/302 (49%), Gaps = 53/302 (17%)
Query: 471 AIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKG 520
A+D +R A+ + V GL +E E + DR L LP Q++L+ + K
Sbjct: 628 ALDVARSAEVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRDLLEALHATGK- 686
Query: 521 PTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYP 580
P + VL +G + + +A+ + AI+ A YPGQ GGTA+AD LFG +NPGG+LP+T+Y
Sbjct: 687 PVVAVLTTGSALAIDWAQQ--HLPAILLAWYPGQRGGTAVADTLFGDANPGGRLPVTFY- 743
Query: 581 QEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHT-VANAPTVVA 639
+E T + AMR GRTYR++ G +YPFGHG+SYT F ++ + T +A
Sbjct: 744 KESETLPAFDDYAMR--------GRTYRYFGGTPLYPFGHGLSYTQFAYSGLRLDRTTIA 795
Query: 640 VPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTP--PAG 697
GS+ AT++ VKN G + G + ++ P P
Sbjct: 796 T-----DGSLTATVT---------------------VKNTGQRAGDEVVQLYLHPLTPQR 829
Query: 698 HWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVD-RSGTRRIPLGEHNIHIGGTKH 756
A K+L F+++ + G Q+ +G I+ L + D + + G + + IG +
Sbjct: 830 ERA-GKELHGFQRIALTPGEQRELGFTINAKDALRLYDEQRKAYVVDPGAYEVQIGASSA 888
Query: 757 SV 758
+
Sbjct: 889 DI 890
>gi|188574621|ref|YP_001911550.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae PXO99A]
gi|188519073|gb|ACD57018.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae PXO99A]
Length = 904
Score = 305 bits (781), Expect = 7e-80, Method: Compositional matrix adjust.
Identities = 169/422 (40%), Positives = 241/422 (57%), Gaps = 33/422 (7%)
Query: 41 PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPG 100
P+ Q QR DL+ R++L+EK + + A A+PRLG+ Y+WW+EALHGV+ G
Sbjct: 36 PYLQTQRSFEQRAADLVSRMTLEEKAAQMQNAAPAIPRLGVPAYDWWNEALHGVARAG-- 93
Query: 101 TKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--------GGTAGLTY 152
GAT FPQ I A++F+ L + +SDEARA ++ GLT+
Sbjct: 94 --------GATVFPQAIGMAATFDLPLMHEVATAISDEARAKHHRFLRQHQHARYQGLTF 145
Query: 153 WSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQG--------SDGDRL-KVAAS 203
WSPN+NIFRDPRWGRGQET GEDP L+ + ++V+GLQG + G+R K+ A+
Sbjct: 146 WSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQGEGSDAPKNAQGERYRKLDAT 205
Query: 204 CKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVP 263
KHF + DR HF+A+ S++D+ +T+ F V +GKV +VM +YN+V G
Sbjct: 206 AKHFAVHSGPE---ADRHHFDARPSQRDLYETYLPAFEALVKDGKVDAVMGAYNRVYGES 262
Query: 264 TCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGP 323
A +L+ +R +W GY+VSDC ++ + +T E+AAA A+ G +L+CG
Sbjct: 263 ASASKFLLQDVLRQQWGFKGYVVSDCWAIVDVWKHHKIVATREQAAALAVTHGTELECGE 322
Query: 324 FLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQ 383
+AV +GL+ E I+ AL +T +MRLGMFD P P+ + +P H
Sbjct: 323 EYST-LPAAVHQGLIDEAQIDTALQTLMTARMRLGMFD-PPGQLPWSKIPASVNQSPAHD 380
Query: 384 ELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQ 443
LA AR+ +VLLKN G LPLS + +AVIGP +D T+ ++GNY G T LQ
Sbjct: 381 ALARRTARESLVLLKNDG-LLPLSRATLKRIAVIGPTADDTMALLGNYYGTPAAPVTVLQ 439
Query: 444 GI 445
GI
Sbjct: 440 GI 441
Score = 124 bits (312), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 93/301 (30%), Positives = 141/301 (46%), Gaps = 51/301 (16%)
Query: 471 AIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKG 520
A+D +R AD + V GL +E E + DR L LP Q+EL+ + K
Sbjct: 631 ALDVARSADVVVFVGGLTGDVEGEEMKVSYPGFAGGDRTDLRLPKPQRELLEALQATGK- 689
Query: 521 PTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYP 580
P + VL +G + + +A+ + AI+ A YPGQ GGTA+AD LFG +NPGG+LP+T+Y
Sbjct: 690 PVVAVLTAGSALAIDWAQQ--HVPAILLAWYPGQRGGTAVADTLFGDANPGGRLPVTFY- 746
Query: 581 QEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAV 640
+E T + AM GRTYR++ G +YPFGHG+SYT F ++
Sbjct: 747 KESETLPAFDDYAMH--------GRTYRYFGGTPLYPFGHGLSYTQFAYS---------- 788
Query: 641 PLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTP--PAGH 698
D R T G L V VKN G + G + ++ P P
Sbjct: 789 --DLRLDRSTLTADG-------------ALTATVAVKNTGQRAGDEVVQLYLHPLKPQRE 833
Query: 699 WAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVD-RSGTRRIPLGEHNIHIGGTKHS 757
A K+L F+++ + G Q+ + I+ L + D + + G + + IG +
Sbjct: 834 RA-GKELRGFQRLALQPGQQRELRFTINAKDALRIYDAQRKAYTVDPGAYEVQIGASSAD 892
Query: 758 V 758
+
Sbjct: 893 I 893
>gi|58584046|ref|YP_203062.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae KACC
10331]
gi|84625823|ref|YP_453195.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae MAFF
311018]
gi|58428640|gb|AAW77677.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae KACC
10331]
gi|84369763|dbj|BAE70921.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae MAFF
311018]
Length = 904
Score = 305 bits (781), Expect = 7e-80, Method: Compositional matrix adjust.
Identities = 169/422 (40%), Positives = 241/422 (57%), Gaps = 33/422 (7%)
Query: 41 PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPG 100
P+ Q QR DL+ R++L+EK + + A A+PRLG+ Y+WW+EALHGV+ G
Sbjct: 36 PYLQTQRSFEQRAADLVSRMTLEEKAAQMQNAAPAIPRLGVPAYDWWNEALHGVARAG-- 93
Query: 101 TKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--------GGTAGLTY 152
GAT FPQ I A++F+ L + +SDEARA ++ GLT+
Sbjct: 94 --------GATVFPQAIGMAATFDLPLMHEVATAISDEARAKHHRFLRQHQHARYQGLTF 145
Query: 153 WSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQG--------SDGDRL-KVAAS 203
WSPN+NIFRDPRWGRGQET GEDP L+ + ++V+GLQG + G+R K+ A+
Sbjct: 146 WSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQGEGSDAPKNAQGERYRKLDAT 205
Query: 204 CKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVP 263
KHF + DR HF+A+ S++D+ +T+ F V +GKV +VM +YN+V G
Sbjct: 206 AKHFAVHSGPE---ADRHHFDARPSQRDLYETYLPAFEALVKDGKVDAVMGAYNRVYGES 262
Query: 264 TCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGP 323
A +L+ +R +W GY+VSDC ++ + +T E+AAA A+ G +L+CG
Sbjct: 263 ASASKFLLQDVLRQQWGFKGYVVSDCWAIVDVWKHHKIVATREQAAALAVTHGTELECGE 322
Query: 324 FLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQ 383
+AV +GL+ E I+ AL +T +MRLGMFD P P+ + +P H
Sbjct: 323 EYST-LPAAVHQGLIDEAQIDTALQTLMTARMRLGMFD-PPGQLPWSKIPASVNQSPAHD 380
Query: 384 ELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQ 443
LA AR+ +VLLKN G LPLS + +AVIGP +D T+ ++GNY G T LQ
Sbjct: 381 ALARRTARESLVLLKNDG-LLPLSRATLKRIAVIGPTADDTMALLGNYYGTPAAPVTVLQ 439
Query: 444 GI 445
GI
Sbjct: 440 GI 441
Score = 125 bits (313), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 94/301 (31%), Positives = 141/301 (46%), Gaps = 51/301 (16%)
Query: 471 AIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKG 520
A+D +R AD + V GL +E E + DR L LP Q+EL+ + K
Sbjct: 631 ALDVARSADVVVFVGGLTGDVEGEEMKVSYPGFAGGDRTDLRLPKPQRELLEALQATGK- 689
Query: 521 PTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYP 580
P + VL +G + V +A+ + AI+ A YPGQ GGTA+AD LFG +NPGG+LP+T+Y
Sbjct: 690 PVVAVLTAGSALAVDWAQQ--HVPAILLAWYPGQRGGTAVADTLFGDANPGGRLPVTFY- 746
Query: 581 QEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAV 640
+E T + AM GRTYR++ G +YPFGHG+SYT F ++
Sbjct: 747 KESETLPAFDDYAMH--------GRTYRYFGGTPLYPFGHGLSYTQFAYS---------- 788
Query: 641 PLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTP--PAGH 698
D R T G L V VKN G + G + ++ P P
Sbjct: 789 --DLRLDRSTLTADG-------------ALTATVAVKNTGQRAGDEVVQLYLHPLKPQRE 833
Query: 699 WAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVD-RSGTRRIPLGEHNIHIGGTKHS 757
A K+L F+++ + G Q+ + I+ L + D + + G + + IG +
Sbjct: 834 RA-GKELRGFQRLALQPGQQRELRFTINAKDALRIYDAQRKAYTVDPGAYEVQIGASSAD 892
Query: 758 V 758
+
Sbjct: 893 I 893
>gi|418518550|ref|ZP_13084692.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. malvacearum
str. GSPB1386]
gi|418522850|ref|ZP_13088880.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. malvacearum
str. GSPB2388]
gi|410700720|gb|EKQ59264.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. malvacearum
str. GSPB2388]
gi|410703176|gb|EKQ61671.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. malvacearum
str. GSPB1386]
Length = 901
Score = 305 bits (781), Expect = 7e-80, Method: Compositional matrix adjust.
Identities = 175/458 (38%), Positives = 254/458 (55%), Gaps = 36/458 (7%)
Query: 28 FACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWW 87
C A T T P+ R DL+ R++L+EK + + A A+PRL + Y+WW
Sbjct: 20 LGCATGAAQTATPPYLDTQRSFEARAADLVSRMTLEEKAAQMQNAAPAIPRLQVPAYDWW 79
Query: 88 SEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--- 144
+EALHGV+ G GAT FPQ I A++F+ L + +SDEARA ++
Sbjct: 80 NEALHGVARAG----------GATVFPQAIGMAATFDLPLMHEVATAISDEARAKHHQFL 129
Query: 145 -----GGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQG------- 192
GLT+WSPN+NIFRDPRWGRGQET GEDP L+ + ++V+GLQG
Sbjct: 130 RQNQHARYQGLTFWSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQGEGADAPK 189
Query: 193 -SDGDRL-KVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVA 250
+ G+R K+ A+ KHF + DR HF+A+ S++D+ +T+ F V +GKV
Sbjct: 190 NAQGERYRKLDATAKHFAVHSGPE---ADRHHFDARPSQRDLYETYLPAFEALVKDGKVD 246
Query: 251 SVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAA 310
+VM +YN+V G A +L+ +R +W GY+VSDC ++ + +T E+AAA
Sbjct: 247 AVMGAYNRVYGESASASKFLLQDVLRDQWGFRGYVVSDCWAIVDIWKHHKIVATREQAAA 306
Query: 311 DAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYG 370
A++ G +L+CG +AV++GL+ E I+ AL +T +MRLGMFD P P+
Sbjct: 307 LAVKHGTELECGEEYAT-LPAAVRQGLIDEAQIDTALKTLMTARMRLGMFD-PPGQLPWS 364
Query: 371 HLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGN 430
+ +P H LA AR+ +VLLKN G LPLS + + +AVIGP +D T+ ++GN
Sbjct: 365 TIPASVNQSPAHDALARRTARESLVLLKNDG-LLPLSRAKLKRIAVIGPTADDTMALLGN 423
Query: 431 YAGIACGYTTPLQGI---GRYARTIHQQGCKDVACADD 465
Y G T LQGI A+ ++ +G V DD
Sbjct: 424 YYGTPAAPVTVLQGIRAAAPNAQVLYARGADLVEGRDD 461
Score = 129 bits (324), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 93/301 (30%), Positives = 147/301 (48%), Gaps = 51/301 (16%)
Query: 471 AIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKG 520
A+D +R AD + V GL +E E + DR L LP Q++L+ + K
Sbjct: 628 ALDVARSADVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRDLLEALQATGK- 686
Query: 521 PTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYP 580
P + VL +G + + +A+ + AI+ A YPGQ GGTA+AD LFG +NPGG+LP+T+Y
Sbjct: 687 PVVAVLTAGSALAIDWAQQ--HLPAILLAWYPGQRGGTAVADTLFGDANPGGRLPVTFY- 743
Query: 581 QEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHT-VANAPTVVA 639
+E T + AMR GRTYR++ G +YPFGHG+SYT F ++ + T +A
Sbjct: 744 KESETLPAFDDYAMR--------GRTYRYFGGTPLYPFGHGLSYTQFAYSGLRLDRTTIA 795
Query: 640 VPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHW 699
GS+ AT++ VKN G + G + ++ P A
Sbjct: 796 T-----DGSLTATVT---------------------VKNTGQRAGDEVVQLYLHPLAPQR 829
Query: 700 A-PHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVD-RSGTRRIPLGEHNIHIGGTKHS 757
K+L F+++ + G Q+ +G I+ L + D + + G + + IG +
Sbjct: 830 ERAGKELHGFQRIALQPGEQRELGFTINAKDALRLYDEQRKAYGVDPGAYEVQIGASSAD 889
Query: 758 V 758
+
Sbjct: 890 I 890
>gi|225873995|ref|YP_002755454.1| beta-xylosidase B [Acidobacterium capsulatum ATCC 51196]
gi|225792796|gb|ACO32886.1| beta-xylosidase B [Acidobacterium capsulatum ATCC 51196]
Length = 896
Score = 305 bits (780), Expect = 8e-80, Method: Compositional matrix adjust.
Identities = 173/417 (41%), Positives = 245/417 (58%), Gaps = 32/417 (7%)
Query: 41 PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPG 100
P+ + PI +RV++L+ +++LQE+ +++ A A+PRLG+ Y WWSE LHG++ G
Sbjct: 38 PWDNPNQPIQKRVHELVSQMTLQEEAAQMMNTAPAIPRLGVPAYNWWSEGLHGIARSG-- 95
Query: 101 TKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA--------GLTY 152
AT FPQ I +++F+ +G VS EARA YN GLT
Sbjct: 96 --------YATVFPQAIGMSATFDPAAIHQMGTTVSTEARAKYNWAIRHDIHSIYFGLTL 147
Query: 153 WSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDL 212
W+PN+NI RDPRWGRGQET GEDP L+G AA YV GLQG++ LK A+ KHF+ Y
Sbjct: 148 WAPNINIVRDPRWGRGQETYGEDPFLTGTMAAEYVSGLQGNNPKYLKTVATPKHFSVY-- 205
Query: 213 DNWNGVD--RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNI 270
NG + R NA S D++DT+ FRM + +G S+MCSYN V GVP+CA+ +
Sbjct: 206 ---NGPESMRHKINANPSAHDMQDTYLAAFRMAITKGHADSMMCSYNAVYGVPSCAN-KL 261
Query: 271 LKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPE--EAAADAIRAGLDLDCGPFLGLH 328
L +RG+W +GYI SDC ++ +Y +P+ AAA A+ AG D DCG +
Sbjct: 262 LADVVRGKWGFDGYITSDCGAISDFYRPGAHGYSPDAVHAAASAVLAGTDTDCGTGYKVL 321
Query: 329 TESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALE 388
+S VQ+GL+S+ I+ A+ T + RLGMFD + + PY + V + H+ ALE
Sbjct: 322 PQS-VQQGLISKAAIDRAVERLFTARFRLGMFDPK-ADVPYNSIPYSVVDSAAHRAQALE 379
Query: 389 AARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
A + +VLLKN+G LPL + RT+AV+GPN+ ++ GNY I + P+ GI
Sbjct: 380 DASKSMVLLKNEGGILPLRNA--RTIAVVGPNAANLNSIEGNYNAIPSHPSLPVDGI 434
Score = 130 bits (327), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 89/282 (31%), Positives = 147/282 (52%), Gaps = 50/282 (17%)
Query: 485 MGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDV 534
MGL + +E + DR L LP QQ+L+ + +A+ P +LVL++G + +
Sbjct: 633 MGLSPTFVSEEMPIKIPGFDGGDRTRLSLPQTQQDLLHAL-VATGKPVVLVLLNGSALSI 691
Query: 535 AFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNL-PMTEMA 593
+AK + I+ A YPG+AGG AI + L G ++PGGKLP+T+Y + +L P T+ +
Sbjct: 692 DWAKQ--HVQGILEAWYPGEAGGEAIGETLSGQNDPGGKLPITFYTS--VKDLPPFTDYS 747
Query: 594 MRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATI 653
M+ GRTYR+Y G ++PFG+G+SYT F ++ + +
Sbjct: 748 MK--------GRTYRYYTGKPLFPFGYGLSYTTFEYS-------------------HVRL 780
Query: 654 SGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHV 713
S +K L V+ +VKN G G V+ TPP P K+L F++VH+
Sbjct: 781 STSNLKAGEP------LTVEAEVKNTGHVAGDAVTEVYVTPPQNGVNPLKELKGFDRVHL 834
Query: 714 PAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTK 755
G +++ ++ + LS+VD +G R + G ++I +GG++
Sbjct: 835 APGQSRQLTFTLN-PRDLSLVDEAGKRSVQPGVYSIFVGGSQ 875
>gi|289670678|ref|ZP_06491753.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. musacearum
NCPPB 4381]
Length = 886
Score = 304 bits (779), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 175/425 (41%), Positives = 243/425 (57%), Gaps = 38/425 (8%)
Query: 39 TLPFCQVSLPIP------QRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALH 92
TLP ++ P QR L+ ++S +EKV ++ A A+PRLGI YEWWSE LH
Sbjct: 18 TLPCLALAAPTERAATPEQRAAALVAQMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLH 77
Query: 93 GVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--GGT--- 147
G++ G AT FPQ I A+S+N L + +G VVS EARA +N GG
Sbjct: 78 GIARNG----------HATVFPQAIGLAASWNTNLMQQVGTVVSTEARAKFNQAGGPGKD 127
Query: 148 ----AGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAAS 203
AGLT WSPN+NIFRDPRWGRG ET GEDP L+G+ A ++RGLQG D + + A+
Sbjct: 128 HKRYAGLTIWSPNINIFRDPRWGRGMETYGEDPFLTGQMAVGFIRGLQGEDLNHPRTIAT 187
Query: 204 CKHFTAYDLDNWNGVD--RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNG 261
KH + +G + R F+ VS D+E T+ FR +++G+ SVMC+YN ++G
Sbjct: 188 PKHLAVH-----SGPEPGRHGFDVDVSPHDVEATYTPAFRAALVQGQAGSVMCAYNSLHG 242
Query: 262 VPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDC 321
P CA +L +RG+W G++VSDCD+V +F ++A A++AG DL+C
Sbjct: 243 TPACAADWLLNGRVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAAALKAGHDLNC 302
Query: 322 G-PFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTP 380
G + L T A++RG + E ++ +LV + RLG + P PY LG KDV
Sbjct: 303 GHAYRELGT--AIERGDVDEALLDQSLVRLFAARYRLGELEA-PRKDPYARLGAKDVDNA 359
Query: 381 DHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTT 440
H+ LAL+AA + IVLLKN +LPL+ +AVIGPN+D + NY G + T
Sbjct: 360 AHRALALQAAAESIVLLKNDANTLPLNA--GTRLAVIGPNADALAALEANYQGTSSAPVT 417
Query: 441 PLQGI 445
PL G+
Sbjct: 418 PLLGL 422
Score = 144 bits (362), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 98/287 (34%), Positives = 147/287 (51%), Gaps = 48/287 (16%)
Query: 477 QADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILVL 526
Q+DA + +GL +E E L DR + LP QQ L+ + + K P ++VL
Sbjct: 614 QSDAVVAFVGLSPDVEGEELRIDVPGFDGGDRNDIALPAPQQALLERAKASGK-PLVVVL 672
Query: 527 MSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITN 586
MSG + + +AK AI+ A YPGQ+GGTAIA +L G NPGG+LP+T+Y +
Sbjct: 673 MSGSAVALNWAKT--HADAIVAAWYPGQSGGTAIARMLAGDDNPGGRLPVTFY--RSTKD 728
Query: 587 LPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRH 646
LP S GRTYR++KG ++PFG+G+SYT F + +AP + + L
Sbjct: 729 LPAY-------VSYDMKGRTYRYFKGEPLFPFGYGLSYTRFAY---DAPQLSSTTLQ--- 775
Query: 647 GSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLV 706
+G ++VT V+N G+ G V+ P +P + LV
Sbjct: 776 -------AGNPLQVT------------TTVRNTGTHAGDEVAQVYLQYPDRPQSPLRSLV 816
Query: 707 AFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGG 753
F++VH+ AG Q+ + N+ + LS VDRSG R + G + + +GG
Sbjct: 817 GFQRVHLAAGEQRTLTFNLD-ARALSDVDRSGQRAVEAGNYTLFVGG 862
>gi|359450637|ref|ZP_09240068.1| beta-glucosidase [Pseudoalteromonas sp. BSi20480]
gi|358043611|dbj|GAA76317.1| beta-glucosidase [Pseudoalteromonas sp. BSi20480]
Length = 468
Score = 304 bits (779), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 167/426 (39%), Positives = 247/426 (57%), Gaps = 31/426 (7%)
Query: 30 CDPKDATTRTLP-FCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWS 88
C + +T+T P + S I +RVNDL+ RL+L+EKV L + A+ RL + Y WW+
Sbjct: 21 CAKDEQSTQTPPLYLNKSASIDERVNDLVTRLTLEEKVAQLFDKSPAIERLNMPEYNWWN 80
Query: 89 EALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGT- 147
EALHGV+ G AT FPQ I A++F+ L +G +SDE RA ++
Sbjct: 81 EALHGVARAGK----------ATVFPQAIGLAATFDEDLMLRVGTAISDEGRAKHHAFLE 130
Query: 148 -------AGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKV 200
GLTYWSPN+NIFRDPRWGRGQET GEDP L+ + A +++ GLQG + + LK
Sbjct: 131 ENNRSMYTGLTYWSPNINIFRDPRWGRGQETYGEDPYLTTRIAVNFINGLQGDNAEYLKS 190
Query: 201 AASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVN 260
A+ KH+ + + V R + S++D+ +T+ F+ + + KVASVMC+YN VN
Sbjct: 191 VATLKHYAVH---SGPEVSRHSDDYTASEKDLAETYLPAFKDVIAQTKVASVMCAYNSVN 247
Query: 261 GVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYD--TQHFTSTPEEAAADAIRAGLD 318
G P C + +++ +R E+ +GYIVSDC ++ +YD + + +T +AAA A++ G D
Sbjct: 248 GTPACGNDELIQNKLRDEFNFDGYIVSDCGAIADFYDVKSHNIVNTGAKAAAMALKTGTD 307
Query: 319 LDCGPFLG---LHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPK 375
L+CG G + AV+ GL+ E D++ AL + + +LGMFD P + PY
Sbjct: 308 LNCGDHHGNTYSYLTQAVKEGLVEEKDVDKALKRLMYARFKLGMFDN-PENVPYSDTSID 366
Query: 376 DVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIA 435
V + H L EAA++ +VLLKN+ LPL + +A+IGPN+D ++GNY G+
Sbjct: 367 VVGSNKHLALTQEAAQKSLVLLKNE-QVLPLK--GNEKIALIGPNADNEAILLGNYNGMP 423
Query: 436 CGYTTP 441
TP
Sbjct: 424 IVPITP 429
>gi|325925754|ref|ZP_08187127.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas perforans
91-118]
gi|325543811|gb|EGD15221.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas perforans
91-118]
Length = 874
Score = 304 bits (779), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 172/407 (42%), Positives = 236/407 (57%), Gaps = 32/407 (7%)
Query: 51 QRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGA 110
QR L+ ++S +EKV ++ A A+PRLGI YEWWSE LHG++ G A
Sbjct: 24 QRAAALVAQMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNG----------YA 73
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--GGT-------AGLTYWSPNVNIFR 161
T FPQ I A+S+N L + +G VVS EARA +N GG AGLT WSPN+NIFR
Sbjct: 74 TVFPQAIGLAASWNTRLMQQVGTVVSTEARAKFNQAGGPGKDHKRYAGLTIWSPNINIFR 133
Query: 162 DPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD-- 219
DPRWGRG ET GEDP L+G+ A ++RGLQG D + + A+ KH + +G +
Sbjct: 134 DPRWGRGMETYGEDPFLTGQMAVGFIRGLQGEDLNHPRTIATPKHIAVH-----SGPEPG 188
Query: 220 RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEW 279
R F+ VS +D+E T+ FR ++EG+ SVMC+YN ++G P CA +L +RG+W
Sbjct: 189 RHGFDVDVSPRDVEATYTPAFRAAIVEGQAGSVMCAYNSLHGTPACAADWLLNGRVRGDW 248
Query: 280 RLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLL 338
G++VSDCD+V +F ++A A++AG DL+CG + L T A+ RG +
Sbjct: 249 GFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAAALKAGHDLNCGHAYRELGT--AIARGEV 306
Query: 339 SEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLK 398
E ++ +LV + RLG + P PY LG KDV H+ LAL+AA + IVLLK
Sbjct: 307 DEALLDQSLVRLFAARYRLGELEA-PRKDPYARLGAKDVDNAAHRALALQAAAESIVLLK 365
Query: 399 NQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
N +LPL +AVIGPN+D + NY G + TPL G+
Sbjct: 366 NDANTLPLKA--GTRLAVIGPNADALAALEANYQGTSSAPVTPLLGL 410
Score = 135 bits (340), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 94/278 (33%), Positives = 141/278 (50%), Gaps = 48/278 (17%)
Query: 486 GLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVA 535
GL +E E L DR + LP QQ L+ + + K P ++VLMSG + +
Sbjct: 611 GLSPDVEGEELRIDVPGFDGGDRNDIALPAPQQALLERAKASGK-PLVVVLMSGSAVALN 669
Query: 536 FAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMR 595
+AK AI+ A YPGQ+GGTAIA +L G NPGG+LP+T+Y +LP
Sbjct: 670 WAKT--HADAIVAAWYPGQSGGTAIARMLAGDDNPGGRLPVTFY--RSTKDLPAY----- 720
Query: 596 PSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISG 655
S GRTYR++KG ++ FG+G+SYT F + +AP + L +G
Sbjct: 721 --VSYDMKGRTYRYFKGEPLFAFGYGLSYTRFAY---DAPQLSTTTLQ----------AG 765
Query: 656 KAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPA 715
+++VT V+N G++ G V+ P +P + LV F++VH+ A
Sbjct: 766 SSLQVT------------TTVRNTGARAGDEVAQVYLQYPDRPQSPLRSLVGFQRVHLAA 813
Query: 716 GAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGG 753
G Q+ + N+ + LS VDRSG R + G + + +GG
Sbjct: 814 GEQRTLTFNLD-ARALSDVDRSGQRAVEAGNYTLFVGG 850
>gi|58581402|ref|YP_200418.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae KACC
10331]
gi|58425996|gb|AAW75033.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae KACC
10331]
Length = 889
Score = 304 bits (778), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 169/407 (41%), Positives = 234/407 (57%), Gaps = 32/407 (7%)
Query: 51 QRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGA 110
QR DL+ +S +EKV ++ A A+PRLGI YEWWSE LHG++ G A
Sbjct: 39 QRAADLVAHMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNG----------YA 88
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGG---------TAGLTYWSPNVNIFR 161
T FPQ I A+S+N L + +G VVS EARA +N AGLT WSPN+NIFR
Sbjct: 89 TVFPQAIGLAASWNTHLMQQVGTVVSTEARAKFNQAGRPGNDHKRYAGLTIWSPNINIFR 148
Query: 162 DPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD-- 219
DPRWGRG ET GEDP L+G+ A ++ GLQG D D + A+ KH + +G +
Sbjct: 149 DPRWGRGMETYGEDPFLTGQMAVGFIHGLQGEDLDHPRTIATPKHLAVH-----SGPEPG 203
Query: 220 RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEW 279
R F+ VS +D+E T+ FR ++EG+ +VMC+YN ++G P CA ++ +RG+W
Sbjct: 204 RHGFDVDVSPRDVEATYTPAFRAAIVEGQAGAVMCAYNSLHGTPACAADWLINGRVRGDW 263
Query: 280 RLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLL 338
G++VSDCD+V +F ++A A++AG DL+CG + L T A+ RG +
Sbjct: 264 GFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAAALKAGHDLNCGHAYRELGT--AIARGEV 321
Query: 339 SEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLK 398
E ++ +LV + RLG + P PY LG KDV H+ LAL+AA + IVLLK
Sbjct: 322 DEALLDQSLVRLFAARYRLGELEA-PRKDPYARLGAKDVDNAQHRALALQAAAESIVLLK 380
Query: 399 NQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
N +LPL+ +AVIGPN+D + NY G + TPL G+
Sbjct: 381 NNANTLPLNA--GTRLAVIGPNADALAALEANYQGTSSAPVTPLLGL 425
Score = 139 bits (350), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 96/278 (34%), Positives = 142/278 (51%), Gaps = 48/278 (17%)
Query: 486 GLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVA 535
GL +E E L DR + LP QQ L+ + + K P ++VLMSG + +
Sbjct: 626 GLSPDVEGEELRIDVPGFDGGDRNDIALPAPQQALLERAKASGK-PLVVVLMSGSAVALN 684
Query: 536 FAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMR 595
+AK AI+ A YPGQ+GGTAIA +L G NPGG+LP+T+Y +LP
Sbjct: 685 WAKT--HADAIVAAWYPGQSGGTAIARMLAGDDNPGGRLPVTFY--RSTKDLPAY----- 735
Query: 596 PSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISG 655
S GRTYR++KG ++PFG+G+SYT F + +AP + S A +G
Sbjct: 736 --VSYDMKGRTYRYFKGEPLFPFGYGLSYTRFAY---DAPQL----------SSTAVQAG 780
Query: 656 KAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPA 715
++VT V+N G++ G V+ P +P + LV F++VH+ A
Sbjct: 781 STLQVT------------TTVRNTGARAGDEVAQVYLQYPDRPQSPLRSLVGFQRVHLAA 828
Query: 716 GAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGG 753
G Q+ + N+ + LS VDRSG R + G + + +GG
Sbjct: 829 GEQRTLTFNLD-ARALSDVDRSGQRAVEAGNYTLFVGG 865
>gi|399025517|ref|ZP_10727513.1| beta-glucosidase-like glycosyl hydrolase [Chryseobacterium sp.
CF314]
gi|398077894|gb|EJL68841.1| beta-glucosidase-like glycosyl hydrolase [Chryseobacterium sp.
CF314]
Length = 875
Score = 304 bits (778), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 168/446 (37%), Positives = 252/446 (56%), Gaps = 32/446 (7%)
Query: 28 FACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWW 87
F C PF +LP+ QR+ +L+G L++ EK+ +++ + AVPRL I Y WW
Sbjct: 11 FICAFSFGQNYKYPFRNPNLPVEQRIENLLGLLTVDEKIGMMMDNSKAVPRLEIPAYGWW 70
Query: 88 SEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--- 144
+EALHGV+ G AT FPQ I A++++ ++SDEARA YN
Sbjct: 71 NEALHGVARAGT----------ATVFPQAIGMAAAWDVPEHLKTFEMISDEARAKYNKSF 120
Query: 145 ------GGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRL 198
G GLT+W+PN+NIFRDPRWGRGQET GEDP L+ + V+GLQG+D
Sbjct: 121 DEASKTGRYEGLTFWTPNINIFRDPRWGRGQETYGEDPYLTSVLGVAAVKGLQGNDPKYF 180
Query: 199 KVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQ 258
K A KHF + WN R +NA+VSK+D+ +T+ F+ V+EG V VMC+YN
Sbjct: 181 KTHACAKHFAVHSGPEWN---RHSYNAEVSKRDLYETYLPAFKSLVLEGNVREVMCAYNA 237
Query: 259 VNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEE--AAADAIRAG 316
+G P CA +L +RG+W+ +G +VSDC ++ +Y ++ + P+E AADA++
Sbjct: 238 FDGQPCCASNTLLNEILRGKWKYDGMVVSDCWALADFYQEKYHGTHPDEKSTAADALKHS 297
Query: 317 LDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKD 376
DL+CG + ++ GL++E DI+ ++ L LGM D P S + P
Sbjct: 298 TDLECGDTYN-NLNKSLAGGLITEKDIDISMRRILKGWFELGMLD--PKSSVLWNQIPYS 354
Query: 377 VCTPD-HQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIA 435
V D H++ AL+ A++ IVL+KN+ LP + + +AV+GPN+D + +GNY G
Sbjct: 355 VVDSDEHKKQALKMAQKSIVLMKNENNILPFNK-NIKKIAVVGPNADDEMMQLGNYNGTP 413
Query: 436 CGYTTPLQGI-GRYART--IHQQGCK 458
T L+GI ++ T I+++G +
Sbjct: 414 SSIVTILEGIKAKFPNTEIIYEKGSE 439
Score = 105 bits (263), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 80/311 (25%), Positives = 137/311 (44%), Gaps = 39/311 (12%)
Query: 468 FGAAIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMA 517
F + + + AD + GL S+E E + D+ + LP Q+EL++++
Sbjct: 592 FASVKEKVKDADVIVFAGGLSPSLEGEEMLVNAEGFKGGDKTSIELPKVQRELLAELRKT 651
Query: 518 SKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMT 577
K P + VL +G + +++ ++ A Y GQ+GGTA+AD+L G NP G+LP+T
Sbjct: 652 GK-PVVFVLCTGSSL--GLEQDEKNYDVLLNAWYGGQSGGTAVADVLAGDYNPSGRLPVT 708
Query: 578 WYPQ-EYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPT 636
+Y E + N + ++ GRTYR+ +Y FGHG+SY+ F +
Sbjct: 709 FYKNLEQLDNALSKTSKHQGFENYDMQGRTYRYMTENPLYAFGHGLSYSKFNYG------ 762
Query: 637 VVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPA 696
NA +S +I + + V V N+ +DG + V+
Sbjct: 763 -------------NAKLSKNSISPNE------DIIITVPVTNISDRDGEEVVQVYVKRNN 803
Query: 697 GHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKH 756
AP K L AFE+V + + + + + I + +++ G++ I GGT
Sbjct: 804 DVLAPVKTLRAFERVLIRSKETKNIQLTISKESFKFYDEKADDLISKSGDYTILYGGTSA 863
Query: 757 SVSLHAATLGV 767
+ L L V
Sbjct: 864 NSGLKNMQLKV 874
>gi|381170979|ref|ZP_09880130.1| glycosyl hydrolase family 3 N terminal domain protein [Xanthomonas
citri pv. mangiferaeindicae LMG 941]
gi|380688543|emb|CCG36617.1| glycosyl hydrolase family 3 N terminal domain protein [Xanthomonas
citri pv. mangiferaeindicae LMG 941]
Length = 901
Score = 304 bits (778), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 175/458 (38%), Positives = 253/458 (55%), Gaps = 36/458 (7%)
Query: 28 FACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWW 87
C A T P+ QR DL+ R++L+EK + + A A+PRL + Y+WW
Sbjct: 20 LGCATGAAQAATPPYLDTQRSFEQRAADLVSRMTLEEKAAQMQNAAPAIPRLQVPAYDWW 79
Query: 88 SEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--- 144
+EALHGV+ G GAT FPQ I A++F+ L + +SDEARA ++
Sbjct: 80 NEALHGVARAG----------GATVFPQAIGMAATFDLPLMHEVATAISDEARAKHHQFL 129
Query: 145 -----GGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQG------- 192
GLT+WSPN+NIFRDPRWGRGQET GEDP L+ + ++V+GLQG
Sbjct: 130 RQNQHARYQGLTFWSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQGEGAAAPK 189
Query: 193 -SDGDRL-KVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVA 250
+ G+ K+ A+ KHF + DR HF+A+ S++D+ +T+ F V EGKV
Sbjct: 190 NAQGEPYRKLDATAKHFAVHSGPE---ADRHHFDARPSQRDLYETYLPAFEALVKEGKVD 246
Query: 251 SVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAA 310
+VM +YN+V G A +L+ +R +W GY+VSDC ++ + +T E+AAA
Sbjct: 247 AVMGAYNRVYGESASASKFLLQDVLRDQWGFRGYVVSDCWAIVDIWKHHKIVATREQAAA 306
Query: 311 DAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYG 370
A++ G +L+CG +AV++GL+ E I+ AL +T +MRLGMFD P P+
Sbjct: 307 LAVKHGTELECGEEYAT-LPAAVRQGLIDEAQIDTALKTLMTARMRLGMFD-PPGQLPWS 364
Query: 371 HLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGN 430
+ +P H LA AR+ +VLLKN G LPLS + + +AVIGP +D T+ ++GN
Sbjct: 365 TIPASVNQSPAHDALARRTARESLVLLKNDG-LLPLSRAKLKRIAVIGPTADDTMALLGN 423
Query: 431 YAGIACGYTTPLQGI---GRYARTIHQQGCKDVACADD 465
Y G T LQGI A+ ++ +G V DD
Sbjct: 424 YYGTPAAPVTVLQGIRAAAPNAQVLYARGADLVEGRDD 461
Score = 128 bits (322), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 93/301 (30%), Positives = 148/301 (49%), Gaps = 51/301 (16%)
Query: 471 AIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKG 520
A+D +R AD + V GL +E E + DR L LP Q++L+ + A+
Sbjct: 628 ALDVARSADVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRDLLEALQ-ATGR 686
Query: 521 PTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYP 580
P + VL +G + + +A+ + AI+ A YPGQ GGTA+AD LFG +NPGG+LP+T+Y
Sbjct: 687 PVVAVLTTGSALAIDWAQQ--HLPAILLAWYPGQRGGTAVADTLFGDANPGGRLPVTFY- 743
Query: 581 QEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHT-VANAPTVVA 639
+E T + AMR GRTYR++ G +YPFGHG+SYT F ++ + T +A
Sbjct: 744 KESETLPAFDDYAMR--------GRTYRYFGGTPLYPFGHGLSYTQFAYSGLRLDRTTIA 795
Query: 640 VPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHW 699
GS+ AT++ VKN G + G + ++ P A
Sbjct: 796 T-----DGSLTATVT---------------------VKNTGQRAGDEVVQLYLHPLAPQR 829
Query: 700 A-PHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVD-RSGTRRIPLGEHNIHIGGTKHS 757
K+L F+++ + G Q+ +G I+ L + D + + G + + IG +
Sbjct: 830 ERAGKELHGFQRIALQPGEQRELGFTINAKDALRLYDEQRKAYGVDPGAYEVQIGASSAD 889
Query: 758 V 758
+
Sbjct: 890 I 890
>gi|84623339|ref|YP_450711.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae MAFF
311018]
gi|188577358|ref|YP_001914287.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae PXO99A]
gi|84367279|dbj|BAE68437.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae MAFF
311018]
gi|188521810|gb|ACD59755.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae PXO99A]
Length = 889
Score = 304 bits (778), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 169/407 (41%), Positives = 234/407 (57%), Gaps = 32/407 (7%)
Query: 51 QRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGA 110
QR DL+ +S +EKV ++ A A+PRLGI YEWWSE LHG++ G A
Sbjct: 39 QRAADLVAHMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNG----------YA 88
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGG---------TAGLTYWSPNVNIFR 161
T FPQ I A+S+N L + +G VVS EARA +N AGLT WSPN+NIFR
Sbjct: 89 TVFPQAIGLAASWNTHLMQQVGTVVSTEARAKFNQAGRPGKDHKRYAGLTIWSPNINIFR 148
Query: 162 DPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD-- 219
DPRWGRG ET GEDP L+G+ A ++ GLQG D D + A+ KH + +G +
Sbjct: 149 DPRWGRGMETYGEDPFLTGQMAVGFIHGLQGDDLDHPRTIATPKHLAVH-----SGPEPG 203
Query: 220 RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEW 279
R F+ VS +D+E T+ FR ++EG+ +VMC+YN ++G P CA ++ +RG+W
Sbjct: 204 RHGFDVDVSPRDVEATYTPAFRAAIVEGQAGAVMCAYNSLHGTPACAADWLINGRVRGDW 263
Query: 280 RLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLL 338
G++VSDCD+V +F ++A A++AG DL+CG + L T A+ RG +
Sbjct: 264 GFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAAALKAGHDLNCGHAYRELGT--AIARGEV 321
Query: 339 SEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLK 398
E ++ +LV + RLG + P PY LG KDV H+ LAL+AA + IVLLK
Sbjct: 322 DEALLDQSLVRLFAARYRLGELEA-PRKDPYARLGAKDVDNAQHRALALQAAAESIVLLK 380
Query: 399 NQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
N +LPL+ +AVIGPN+D + NY G + TPL G+
Sbjct: 381 NNANTLPLNA--GTRLAVIGPNADALAALEANYQGTSSAPVTPLLGL 425
Score = 139 bits (350), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 96/278 (34%), Positives = 142/278 (51%), Gaps = 48/278 (17%)
Query: 486 GLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVA 535
GL +E E L DR + LP QQ L+ + + K P ++VLMSG + +
Sbjct: 626 GLSPDVEGEELRIDVPGFDGGDRNDIALPAPQQALLERAKASGK-PLVVVLMSGSAVALN 684
Query: 536 FAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMR 595
+AK AI+ A YPGQ+GGTAIA +L G NPGG+LP+T+Y +LP
Sbjct: 685 WAKT--HADAIVAAWYPGQSGGTAIARMLAGDDNPGGRLPVTFY--RSTKDLPAY----- 735
Query: 596 PSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISG 655
S GRTYR++KG ++PFG+G+SYT F + +AP + S A +G
Sbjct: 736 --VSYDMKGRTYRYFKGEPLFPFGYGLSYTRFAY---DAPQL----------SSTAVQAG 780
Query: 656 KAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPA 715
++VT V+N G++ G V+ P +P + LV F++VH+ A
Sbjct: 781 STLQVT------------TTVRNTGARAGDEVAQVYLQYPDRPQSPLRSLVGFQRVHLAA 828
Query: 716 GAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGG 753
G Q+ + N+ + LS VDRSG R + G + + +GG
Sbjct: 829 GEQRTLTFNLD-ARALSDVDRSGQRAVEAGNYTLFVGG 865
>gi|384420163|ref|YP_005629523.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzicola
BLS256]
gi|353463076|gb|AEQ97355.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzicola
BLS256]
Length = 889
Score = 304 bits (778), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 169/407 (41%), Positives = 233/407 (57%), Gaps = 32/407 (7%)
Query: 51 QRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGA 110
QR DL+ +S +EKV ++ A A+PRLGI YEWWSE LHG++ G A
Sbjct: 39 QRAADLVAHMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNG----------YA 88
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGG---------TAGLTYWSPNVNIFR 161
T FPQ I A+S+N L + +G VVS EARA +N AGLT WSPN+NIFR
Sbjct: 89 TVFPQAIGLAASWNTHLMQQVGTVVSTEARAKFNQAGRPGKDHKRYAGLTIWSPNINIFR 148
Query: 162 DPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD-- 219
DPRWGRG ET GEDP L+G+ A ++ GLQG D D + A+ KH + +G +
Sbjct: 149 DPRWGRGMETYGEDPFLTGQMAVGFIHGLQGDDLDHPRTIATPKHLAVH-----SGPEPG 203
Query: 220 RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEW 279
R F+ VS +D+E T+ FR ++EG+ +VMC+YN ++G P CA ++ +RG+W
Sbjct: 204 RHGFDVDVSPRDVEATYTPAFRAAIVEGQAGAVMCAYNSLHGTPACAADWLINGRVRGDW 263
Query: 280 RLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLL 338
G++VSDCD+V +F ++A A++AG DL+CG + L T A+ RG +
Sbjct: 264 GFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAAALKAGHDLNCGHAYRELGT--AIARGEV 321
Query: 339 SEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLK 398
E ++ +LV + RLG + P PY LG KDV H+ LAL+AA + IVLLK
Sbjct: 322 DEALLDQSLVRLFAARYRLGELEA-PRKDPYARLGAKDVDNAQHRALALQAAAESIVLLK 380
Query: 399 NQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
N +LPL +AVIGPN+D + NY G + TPL G+
Sbjct: 381 NNANTLPLKA--GTRLAVIGPNADALAALEANYQGTSSAPVTPLLGL 425
Score = 135 bits (340), Expect = 9e-29, Method: Compositional matrix adjust.
Identities = 95/278 (34%), Positives = 141/278 (50%), Gaps = 48/278 (17%)
Query: 486 GLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVA 535
GL +E E L DR + LP QQ L+ + + K P ++VLMSG + +
Sbjct: 626 GLSPDVEGEELRIDVPGFDGGDRNDIALPAPQQALLERAKASGK-PLVVVLMSGSAVALN 684
Query: 536 FAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMR 595
+AK AI+ A YPGQ+GGTAIA +L G NPGG+LP+T+Y +LP
Sbjct: 685 WAKT--HADAIVAAWYPGQSGGTAIARMLAGDDNPGGRLPVTFY--RSTKDLPAY----- 735
Query: 596 PSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISG 655
S GRTYR++KG ++PFG+G+SYT F + +AP + S A +G
Sbjct: 736 --VSYDMKGRTYRYFKGEPLFPFGYGLSYTCFAY---DAPQL----------SSTAVQAG 780
Query: 656 KAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPA 715
++VT V+N G++ G V+ P +P + LV F++VH+ A
Sbjct: 781 STLQVT------------TTVRNTGARAGDEVAQVYLQYPDRPQSPLRSLVGFQRVHLAA 828
Query: 716 GAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGG 753
G Q+ + N+ + LS VD SG R + G + + +GG
Sbjct: 829 GEQRTLTFNLD-ARALSDVDPSGQRAVEAGNYTLFVGG 865
>gi|289664871|ref|ZP_06486452.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. vasculorum
NCPPB 702]
Length = 886
Score = 303 bits (777), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 175/425 (41%), Positives = 242/425 (56%), Gaps = 38/425 (8%)
Query: 39 TLPFCQVSLPIP------QRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALH 92
TLP ++ P QR L+ +S +EKV ++ A A+PRLGI YEWWSE LH
Sbjct: 18 TLPCLALAAPTERAATPEQRAAALVAHMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLH 77
Query: 93 GVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--GGT--- 147
G++ G AT FPQ I A+S+N L + +G VVS EARA +N GG
Sbjct: 78 GIARNG----------HATVFPQAIGLAASWNTNLMQQVGTVVSTEARAKFNQAGGPGKD 127
Query: 148 ----AGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAAS 203
AGLT WSPN+NIFRDPRWGRG ET GEDP L+G+ A ++RGLQG D + + A+
Sbjct: 128 HKRYAGLTIWSPNINIFRDPRWGRGMETYGEDPFLTGQMAVGFIRGLQGEDLNHPRTIAT 187
Query: 204 CKHFTAYDLDNWNGVD--RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNG 261
KH + +G + R F+ VS D+E T+ FR +++G+ SVMC+YN ++G
Sbjct: 188 PKHLAVH-----SGPEPGRHGFDVDVSPHDVEATYTPAFRAALVQGQAGSVMCAYNSLHG 242
Query: 262 VPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDC 321
P CA +L +RG+W G++VSDCD+V +F ++A A++AG DL+C
Sbjct: 243 TPACAADWLLNGRVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAAALKAGHDLNC 302
Query: 322 G-PFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTP 380
G + L T A++RG + E ++ +LV + RLG + P PY LG KDV
Sbjct: 303 GHAYRELGT--AIERGDVDEALLDQSLVRLFAARYRLGELEA-PRKDPYARLGAKDVDNA 359
Query: 381 DHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTT 440
H+ LAL+AA + IVLLKN +LPL+ +AVIGPN+D + NY G + T
Sbjct: 360 AHRALALQAAAESIVLLKNDANTLPLNA--GTRLAVIGPNADALAALEANYQGTSSAPVT 417
Query: 441 PLQGI 445
PL G+
Sbjct: 418 PLLGL 422
Score = 146 bits (368), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 99/287 (34%), Positives = 148/287 (51%), Gaps = 48/287 (16%)
Query: 477 QADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILVL 526
Q+DA + +GL +E E L DR + LP QQ L+ + + K P ++VL
Sbjct: 614 QSDAVVAFVGLSPDVEGEELRIDVPGFDGGDRNDIALPAPQQALLERAKASGK-PLVVVL 672
Query: 527 MSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITN 586
MSG + + +AK AI+ A YPGQ+GGTAIA +L G NPGG+LP+T+Y +
Sbjct: 673 MSGSAVALNWAKT--HADAIVAAWYPGQSGGTAIARMLAGDDNPGGRLPVTFY--RSTKD 728
Query: 587 LPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRH 646
LP S GRTYR++KG ++PFG+G+SYT F + +AP +
Sbjct: 729 LPAY-------VSYDMKGRTYRYFKGEPLFPFGYGLSYTRFAY---DAPQL--------- 769
Query: 647 GSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLV 706
S A +G ++VT V+N G++ G V+ P +P + LV
Sbjct: 770 -STTALQAGNPLQVT------------TTVRNTGTRAGDEVAQVYLQYPDRPQSPLRSLV 816
Query: 707 AFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGG 753
F++VH+ AG Q+ + N+ + LS VDRSG R + G + + +GG
Sbjct: 817 GFQRVHLAAGEQRTLTFNLD-ARALSDVDRSGQRAVEAGNYTLFVGG 862
>gi|389794400|ref|ZP_10197553.1| beta-glucosidase-related glycosidase [Rhodanobacter fulvus Jip2]
gi|388432423|gb|EIL89432.1| beta-glucosidase-related glycosidase [Rhodanobacter fulvus Jip2]
Length = 902
Score = 303 bits (777), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 170/414 (41%), Positives = 238/414 (57%), Gaps = 29/414 (7%)
Query: 42 FCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGT 101
+ +S R DL+ ++L+EK + + A A+PRLG+ Y+WW+E LHGV+ G
Sbjct: 47 YRDLSRSFHDRAADLVAHMTLEEKAAQMQNTAPAIPRLGVAAYDWWNEGLHGVARAGQ-- 104
Query: 102 KFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--------GGTAGLTYW 153
AT FPQ I A++F+ L + +SDEARA YN G GLTYW
Sbjct: 105 --------ATVFPQAIGLAATFDVPLMHEVATAISDEARAKYNEFQRKGSHGRYEGLTYW 156
Query: 154 SPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLD 213
SPN+NIFRDPRWGRGQET GEDP L+ + ++V GLQG + K+ A+ KHF +
Sbjct: 157 SPNINIFRDPRWGRGQETYGEDPYLTERMGVAFVTGLQGDNPTYRKLDATAKHFAVH--- 213
Query: 214 NWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKR 273
+ DR HF+ S++D+ +T+ F+ V E V +VM +YN+VNG P P +L +
Sbjct: 214 SGPEADRHHFDVHPSERDLYETYLPAFQTLVQEADVDAVMSAYNRVNGEPATGSPRLLGQ 273
Query: 274 TIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAV 333
+R +W GY+VSDC +V Y T E A+A A++ G+DLDCG + AV
Sbjct: 274 ILRKDWGFKGYVVSDCGAVEDIYKHHKVVDTVEAASALAVKNGVDLDCGTEYAALVK-AV 332
Query: 334 QRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQ-PYGHLGPKDVCTPDHQELALEAARQ 392
GL+ E +I+ AL + +MRLGMFD P+S+ P+ + +P H LA AAR+
Sbjct: 333 HDGLIKESEIDAALTRLMQARMRLGMFD--PASKVPWSDVPYSVNQSPQHDALARRAARE 390
Query: 393 GIVLLKNQGPSLPLSH-IRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
+VLLKN G LPLS I+H +AVIGP +D + ++GNY G T L+GI
Sbjct: 391 SMVLLKNDG-VLPLSKDIKH--IAVIGPTADDVMALVGNYHGTPADPVTILRGI 441
Score = 109 bits (273), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 81/293 (27%), Positives = 129/293 (44%), Gaps = 57/293 (19%)
Query: 482 ILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGP 531
+ GL +E E + DR L LP Q++L+ + K P +LVL SG
Sbjct: 642 VFAGGLTSDVEGEEMKVNYPGFAGGDRTDLRLPATQRKLLEALQATGK-PVVLVLTSGSA 700
Query: 532 IDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTE 591
+ V +A + A++ A YPGQ GG A+AD+LFG ++P G+LP+T+Y LP +
Sbjct: 701 LAVDWANQ--HLPAVLLAWYPGQRGGNAVADVLFGKADPAGRLPVTFYKAS--EKLPAFD 756
Query: 592 MAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINA 651
R GRTYR++KG +YPFG+G+SYT F +
Sbjct: 757 -------DYRMDGRTYRYFKGEPLYPFGYGLSYTKFTYA--------------------- 788
Query: 652 TISGKAIKVTHAKCNRL-TLGVQVDVKNVGSKDGAHTLLVF----STPPAGHWAPHKQLV 706
+K+ H K + L V V V N G + G + ++ TP H +K L
Sbjct: 789 -----DLKLDHNKIGKNDKLHVTVKVHNAGKRAGDEVVQLYLRGVGTP---HERSNKDLR 840
Query: 707 AFEKVHVPAGAQQRVGINIHVCKYLSVVD-RSGTRRIPLGEHNIHIGGTKHSV 758
+++ + G + V ++ L D + + G + + IG + +
Sbjct: 841 GIQRITLQPGQTRDVSFDVSPATDLRYYDTKKAAYAVDAGRYEVQIGASSADI 893
>gi|346726970|ref|YP_004853639.1| beta-glucosidase-related glycosidase [Xanthomonas axonopodis pv.
citrumelo F1]
gi|346651717|gb|AEO44341.1| Beta-glucosidase-related glycosidase [Xanthomonas axonopodis pv.
citrumelo F1]
Length = 902
Score = 303 bits (777), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 172/435 (39%), Positives = 246/435 (56%), Gaps = 36/435 (8%)
Query: 51 QRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGA 110
QR DL+ R++L+EK + + A A+PRLG+ Y+WW+EALHGV+ G GA
Sbjct: 44 QRAADLVSRMTLEEKAAQMQNAAPAIPRLGVPAYDWWNEALHGVARAG----------GA 93
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--------GGTAGLTYWSPNVNIFRD 162
T FPQ I A++F+ L + +SDEARA ++ GLT+WSPN+NIFRD
Sbjct: 94 TVFPQAIGMAATFDLPLMHEVATAISDEARAKHHQFLRQNQHARYQGLTFWSPNINIFRD 153
Query: 163 PRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVA---------ASCKHFTAYDLD 213
PRWGRGQET GEDP L+ + ++V+GLQG D K A A+ KHF +
Sbjct: 154 PRWGRGQETYGEDPFLTARMGVTFVQGLQGEGADAPKNAQGEPYRKLDATAKHFAVHSGP 213
Query: 214 NWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKR 273
DR HF+A+ S++D+ +T+ F V +GKV +VM +YN+V G A +L+
Sbjct: 214 E---ADRHHFDARPSQRDLYETYLPAFEALVKDGKVDAVMGAYNRVYGESASASKFLLQD 270
Query: 274 TIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAV 333
+R +W GY+VSDC ++ + +T E+AAA A++ G +L+CG +AV
Sbjct: 271 VLRQQWGFKGYVVSDCWAIVDIWKHHKIVATREQAAALAVKHGTELECGEEYST-LPAAV 329
Query: 334 QRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQG 393
+GL+ E I+ AL +T +MRLGMFD P P+ + +P H LA AR+
Sbjct: 330 HQGLIDEAQIDTALKTLMTARMRLGMFD-PPGQLPWSTIPASVNQSPAHDALARRTARES 388
Query: 394 IVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI---GRYAR 450
+VLLKN G LPLS + + +AVIGP +D T+ ++GNY G T LQGI A+
Sbjct: 389 LVLLKNDG-LLPLSRAKLKRIAVIGPTADDTMALLGNYYGTPAAPVTVLQGIRAAAPNAQ 447
Query: 451 TIHQQGCKDVACADD 465
++ +G V DD
Sbjct: 448 VLYARGADLVEGRDD 462
Score = 124 bits (312), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 92/303 (30%), Positives = 146/303 (48%), Gaps = 55/303 (18%)
Query: 471 AIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKG 520
A+D +R AD + V GL +E E + DR L LP Q++L+ + K
Sbjct: 629 ALDVARSADVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRDLLEALQATGK- 687
Query: 521 PTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYP 580
P + VL +G + + +A+ + AI+ A YPGQ GGTA+AD LFG +NPGG+LP+T+Y
Sbjct: 688 PVVAVLTTGSALAIDWAQQ--HLPAILLAWYPGQRGGTAVADTLFGDANPGGRLPVTFY- 744
Query: 581 QEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHT--VANAPTVV 638
+E T + AMR GRTYR++ G +YPFGHG+SYT F ++ + T+
Sbjct: 745 KESETLPAFDDYAMR--------GRTYRYFGGTPLYPFGHGLSYTQFAYSGLRLDRTTIA 796
Query: 639 AVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTP--PA 696
A GS+ AT++ VKN G + G + ++ P P
Sbjct: 797 A------DGSLTATVT---------------------VKNTGQRAGDEVVQLYLHPLTPQ 829
Query: 697 GHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVD-RSGTRRIPLGEHNIHIGGTK 755
A K+L F+++ + G Q+ + + L + D + + G + + IG +
Sbjct: 830 RERA-GKELHGFQRIALQPGEQRALHFTLDAKNALRIYDAQRKAYAVDPGAYEVQIGASS 888
Query: 756 HSV 758
+
Sbjct: 889 ADI 891
>gi|188993706|ref|YP_001905716.1| beta-glucosidase [Xanthomonas campestris pv. campestris str. B100]
gi|167735466|emb|CAP53681.1| exported beta-glucosidase [Xanthomonas campestris pv. campestris]
Length = 896
Score = 303 bits (777), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 174/451 (38%), Positives = 251/451 (55%), Gaps = 29/451 (6%)
Query: 35 ATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGV 94
A T P+ + P+ R DL+ R++L+EK + + A A+PRL + Y+WW+EALHGV
Sbjct: 33 AATPPPPYLDPTQPLQARAADLVSRMTLEEKAAQMQNAAPAIPRLQVPEYDWWNEALHGV 92
Query: 95 SNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAG----- 149
+ G GAT FPQ I A++F+ L + +SDEARA ++ AG
Sbjct: 93 ARAG----------GATVFPQAIGLAATFDTPLMAEVATAISDEARAKHHAFLAGGEHKR 142
Query: 150 ---LTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKH 206
LT+WSPN+NIFRDPRWGRGQET GEDP L+ + ++V+GLQ G K+ A+ KH
Sbjct: 143 YQGLTFWSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQAQQGPYRKLDATAKH 202
Query: 207 FTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCA 266
+ + + DR HF+ S++D+ +T+ F+ V EG VA+VM +YN+VNG A
Sbjct: 203 YAVH---SGPEADRHHFDVHPSERDLYETYLPAFQALVQEGHVAAVMGAYNRVNGESASA 259
Query: 267 DPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLG 326
L+ +R +W +GYIVSDC ++ + TPE AAA ++ G DLDCG
Sbjct: 260 STR-LEGILRRDWGFDGYIVSDCAAIRDIWQNHKIVPTPEAAAALGVKHGTDLDCGDTYA 318
Query: 327 LHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELA 386
+AV+ GL+ E I+ +L + ++RLGMFD P+ P+ + +P H LA
Sbjct: 319 A-LPAAVRAGLIDEATIDRSLTRLMAARLRLGMFD-PPAKVPWAQIPASANQSPQHDALA 376
Query: 387 LEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI- 445
AR+ +VLLKN G LPL R +AV+GP +D ++++GNY G T LQGI
Sbjct: 377 RRTARESLVLLKNDG-LLPLKPTLKR-IAVVGPTADDPMSLLGNYYGTPAAPVTILQGIR 434
Query: 446 --GRYARTIHQQGCKDVACADDQLFGAAIDA 474
A ++ +G V +D A IDA
Sbjct: 435 DAAPQAEVVYARGSDLVEGREDPNAAAPIDA 465
Score = 127 bits (319), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 93/304 (30%), Positives = 149/304 (49%), Gaps = 57/304 (18%)
Query: 471 AIDASRQADATILVMGLDQSIEAEALD----------RAGLLLPGRQQELVSKVSMASKG 520
A+DA+R AD + V GL +E E +D R LP Q+EL+ + A+
Sbjct: 623 AVDAARNADVVVFVGGLTGDVEGEEMDVNYPGFAGGDRTDTRLPKPQRELLQALQ-ATGT 681
Query: 521 PTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYP 580
P + VL +G + + +A+ + AI+ A YPGQ GGTA+ D+LFG ++PGG+LP+T+Y
Sbjct: 682 PVVAVLTTGSALAIDWAQQ--HVPAILLAWYPGQRGGTAVGDVLFGQASPGGRLPITFYK 739
Query: 581 QEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHT--VANAPTV 637
++ LP + AMR GRTYR++ G +YPFGHG++YT F ++ + TV
Sbjct: 740 ED--ERLPAFDDYAMR--------GRTYRYFDGKPLYPFGHGLAYTQFAYSNLRLDRTTV 789
Query: 638 VAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTP--P 695
A G++ AT+S VKN G + G + ++ P P
Sbjct: 790 AA------DGTLRATVS---------------------VKNTGQRAGDEVVQLYLHPLNP 822
Query: 696 AGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVD-RSGTRRIPLGEHNIHIGGT 754
A K+L F+++ + G + V NI + L + D + + G + + IG +
Sbjct: 823 QRERA-RKELRGFQRITLQPGEHREVSFNITPREALRIYDEQRKAYAVDPGAYELQIGAS 881
Query: 755 KHSV 758
+
Sbjct: 882 SADI 885
>gi|346725879|ref|YP_004852548.1| beta-glucosidase-related glycosidase [Xanthomonas axonopodis pv.
citrumelo F1]
gi|346650626|gb|AEO43250.1| Beta-glucosidase-related glycosidase [Xanthomonas axonopodis pv.
citrumelo F1]
Length = 889
Score = 303 bits (777), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 172/407 (42%), Positives = 236/407 (57%), Gaps = 32/407 (7%)
Query: 51 QRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGA 110
QR L+ ++S +EKV ++ A A+PRLGI YEWWSE LHG++ G A
Sbjct: 39 QRAAALVAQMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNG----------YA 88
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--GGT-------AGLTYWSPNVNIFR 161
T FPQ I A+S+N L + +G VVS EARA +N GG AGLT WSPN+NIFR
Sbjct: 89 TVFPQSIGLAASWNTRLMQQVGTVVSTEARAKFNQAGGPGKDHKRYAGLTIWSPNINIFR 148
Query: 162 DPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD-- 219
DPRWGRG ET GEDP L+G+ A ++RGLQG D + + A+ KH + +G +
Sbjct: 149 DPRWGRGMETYGEDPFLTGQMAVGFIRGLQGEDLNHPRTIATPKHIAVH-----SGPEPG 203
Query: 220 RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEW 279
R F+ VS +D+E T+ FR ++EG+ SVMC+YN ++G P CA +L +RG+W
Sbjct: 204 RHGFDVDVSPRDVEATYTPAFRAAIVEGQAGSVMCAYNSLHGTPACAADWLLNGRVRGDW 263
Query: 280 RLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLL 338
G++VSDCD+V +F ++A A++AG DL+CG + L T A+ RG +
Sbjct: 264 GFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAAALKAGHDLNCGHAYRELGT--AIARGEV 321
Query: 339 SEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLK 398
E ++ +LV + RLG + P PY LG KDV H+ LAL+AA + IVLLK
Sbjct: 322 DEALLDQSLVRLFATRYRLGELEA-PRKDPYARLGAKDVDNAAHRALALQAAAESIVLLK 380
Query: 399 NQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
N +LPL +AVIGPN+D + NY G + TPL G+
Sbjct: 381 NDANTLPLKA--GTRLAVIGPNADALAALEANYQGTSSAPVTPLLGL 425
Score = 135 bits (341), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 94/278 (33%), Positives = 141/278 (50%), Gaps = 48/278 (17%)
Query: 486 GLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVA 535
GL +E E L DR + LP QQ L+ + + K P ++VLMSG + +
Sbjct: 626 GLSPDVEGEELRIDVPGFDGGDRNDIALPAPQQALLERAKASGK-PLVVVLMSGSAVALN 684
Query: 536 FAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMR 595
+AK AI+ A YPGQ+GGTAIA +L G NPGG+LP+T+Y +LP
Sbjct: 685 WAKT--HADAIVAAWYPGQSGGTAIARMLAGDDNPGGRLPVTFY--RSTKDLPAY----- 735
Query: 596 PSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISG 655
S GRTYR++KG ++ FG+G+SYT F + +AP + L +G
Sbjct: 736 --VSYDMKGRTYRYFKGEPLFAFGYGLSYTRFAY---DAPQLSTTTLQ----------AG 780
Query: 656 KAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPA 715
+++VT V+N G++ G V+ P +P + LV F++VH+ A
Sbjct: 781 SSLQVT------------TTVRNTGARAGDEVAQVYLQYPDRPQSPLRSLVGFQRVHLAA 828
Query: 716 GAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGG 753
G Q+ + N+ + LS VDRSG R + G + + +GG
Sbjct: 829 GEQRTLTFNLD-ARALSDVDRSGQRAVEAGNYTLFVGG 865
>gi|78048767|ref|YP_364942.1| beta-glucosidase precursor [Xanthomonas campestris pv. vesicatoria
str. 85-10]
gi|78037197|emb|CAJ24942.1| beta-glucosidase precursor [Xanthomonas campestris pv. vesicatoria
str. 85-10]
Length = 889
Score = 303 bits (777), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 172/407 (42%), Positives = 236/407 (57%), Gaps = 32/407 (7%)
Query: 51 QRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGA 110
QR L+ ++S +EKV ++ A A+PRLGI YEWWSE LHG++ G A
Sbjct: 39 QRAAALVAQMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNG----------YA 88
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--GGT-------AGLTYWSPNVNIFR 161
T FPQ I A+S+N L + +G VVS EARA +N GG AGLT WSPN+NIFR
Sbjct: 89 TVFPQSIGLAASWNTRLMQQVGTVVSTEARAKFNQAGGPGKDHKRYAGLTIWSPNINIFR 148
Query: 162 DPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD-- 219
DPRWGRG ET GEDP L+G+ A ++RGLQG D + + A+ KH + +G +
Sbjct: 149 DPRWGRGMETYGEDPFLTGQMAVGFIRGLQGEDLNHPRTIATPKHIAVH-----SGPEPG 203
Query: 220 RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEW 279
R F+ VS +D+E T+ FR ++EG+ SVMC+YN ++G P CA +L +RG+W
Sbjct: 204 RHGFDVDVSPRDVEATYTPAFRAAIVEGQAGSVMCAYNSLHGTPACAADWLLNGRVRGDW 263
Query: 280 RLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLL 338
G++VSDCD+V +F ++A A++AG DL+CG + L T A+ RG +
Sbjct: 264 GFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAAALKAGHDLNCGHAYRELGT--AIARGEV 321
Query: 339 SEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLK 398
E ++ +LV + RLG + P PY LG KDV H+ LAL+AA + IVLLK
Sbjct: 322 DEALLDQSLVRLFAARYRLGELEA-PRKDPYARLGAKDVDNAAHRALALQAAAESIVLLK 380
Query: 399 NQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
N +LPL +AVIGPN+D + NY G + TPL G+
Sbjct: 381 NDANTLPLKA--GTRLAVIGPNADALAALEANYQGTSSAPVTPLLGL 425
Score = 135 bits (341), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 94/278 (33%), Positives = 141/278 (50%), Gaps = 48/278 (17%)
Query: 486 GLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVA 535
GL +E E L DR + LP QQ L+ + + K P ++VLMSG + +
Sbjct: 626 GLSPDVEGEELRIDVPGFDGGDRNDIALPAPQQALLERAKASGK-PLVVVLMSGSAVALN 684
Query: 536 FAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMR 595
+AK AI+ A YPGQ+GGTAIA +L G NPGG+LP+T+Y +LP
Sbjct: 685 WAKT--HADAIVAAWYPGQSGGTAIARMLAGDDNPGGRLPVTFY--RSTKDLPAY----- 735
Query: 596 PSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISG 655
S GRTYR++KG ++ FG+G+SYT F + +AP + L +G
Sbjct: 736 --VSYDMKGRTYRYFKGEPLFAFGYGLSYTRFAY---DAPQLSTTTLQ----------AG 780
Query: 656 KAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPA 715
+++VT V+N G++ G V+ P +P + LV F++VH+ A
Sbjct: 781 SSLQVT------------TTVRNTGARAGDEVAQVYLQYPDRPQSPLRSLVGFQRVHLAA 828
Query: 716 GAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGG 753
G Q+ + N+ + LS VDRSG R + G + + +GG
Sbjct: 829 GEQRTLTFNLD-ARALSDVDRSGQRAVEAGNYTLFVGG 865
>gi|21243803|ref|NP_643385.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. citri str.
306]
gi|21109396|gb|AAM37921.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. citri str.
306]
Length = 886
Score = 303 bits (776), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 176/425 (41%), Positives = 242/425 (56%), Gaps = 38/425 (8%)
Query: 39 TLPFCQVSLPIP------QRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALH 92
TLP ++ P QR L+ ++S +EKV ++ A A+PRLGI YEWWSE LH
Sbjct: 18 TLPCLALATPAKPAGSPEQRAAALVAQMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLH 77
Query: 93 GVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--GGT--- 147
G++ G AT FPQ I A+S+N +L + +G VVS EARA +N GG
Sbjct: 78 GIARNG----------YATVFPQSIGLAASWNTSLMQQVGTVVSTEARAKFNQAGGPGKD 127
Query: 148 ----AGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAAS 203
AGLT WSPN+NIFRDPRWGRG ET GEDP L+G+ A ++RGLQG D + + A+
Sbjct: 128 HQRYAGLTIWSPNINIFRDPRWGRGMETYGEDPFLTGQMAVGFIRGLQGEDLNHPRTIAT 187
Query: 204 CKHFTAYDLDNWNGVD--RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNG 261
KH + +G + R F+ VS D+E T+ FR ++EG+ SVMC+YN ++G
Sbjct: 188 PKHIAVH-----SGPEPGRHGFDVDVSPHDVEATYTPAFRAALVEGQAGSVMCAYNALHG 242
Query: 262 VPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDC 321
P CA +L +RG+W G++VSDCD+V +F ++A A++AG DL+C
Sbjct: 243 TPVCAADWLLNGRVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAAALKAGHDLNC 302
Query: 322 G-PFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTP 380
G + L T A+ RG + E ++ +LV + RLG + P PY LG KDV
Sbjct: 303 GHAYRELGT--AIARGEVDEALLDQSLVRLFAARYRLGELEA-PRKDPYARLGAKDVDNA 359
Query: 381 DHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTT 440
H+ LAL+AA + IVLLKN +LPL +AVIGPN+D + NY G + T
Sbjct: 360 AHRALALQAAAESIVLLKNDANTLPLRA--GTRLAVIGPNADALAALEANYQGTSSAPVT 417
Query: 441 PLQGI 445
PL G+
Sbjct: 418 PLLGL 422
Score = 139 bits (349), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 95/279 (34%), Positives = 144/279 (51%), Gaps = 48/279 (17%)
Query: 485 MGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDV 534
+GL +E E L DR + LP QQ L+ + + K P ++VLMSG + +
Sbjct: 622 VGLSPDVEGEELRIDVPGFDGGDRNDIALPAPQQALLERAKASGK-PLVVVLMSGSAVAL 680
Query: 535 AFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAM 594
+AK AI+ A YPGQ+GGTAIA +L G NPGG+LP+T+Y +LP
Sbjct: 681 NWAKM--HADAIVAAWYPGQSGGTAIARMLAGDDNPGGRLPVTFY--RSTKDLP------ 730
Query: 595 RPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATIS 654
P S GRTYR++KG ++PFG+G+SYT F + +AP + L +
Sbjct: 731 -PYVSYDMKGRTYRYFKGEPLFPFGYGLSYTRFAY---DAPQLSTTTLQ----------A 776
Query: 655 GKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVP 714
G ++VT V+N G++ G V+ P +P + LV F++VH+
Sbjct: 777 GNPLQVT------------ATVRNTGARAGDEVAQVYLQYPDRPQSPLRSLVGFQRVHLA 824
Query: 715 AGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGG 753
AG Q+ + ++ + LS VDRSG R + G++ + +GG
Sbjct: 825 AGEQRTLTFHLD-ARALSDVDRSGQRAVEAGDYTLFVGG 862
>gi|197106390|ref|YP_002131767.1| glucan 1,4-beta-glucosidase [Phenylobacterium zucineum HLK1]
gi|196479810|gb|ACG79338.1| glucan 1,4-beta-glucosidase [Phenylobacterium zucineum HLK1]
Length = 888
Score = 303 bits (776), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 185/479 (38%), Positives = 263/479 (54%), Gaps = 41/479 (8%)
Query: 42 FCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGT 101
+ LP +R DL+ R++L+EK + + A A+PRLG+ Y WW+E LHGV+ G
Sbjct: 38 YRDTRLPAERRAADLVARMTLEEKSRQIGHTAPAIPRLGVPAYNWWNEGLHGVARAGI-- 95
Query: 102 KFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMY------NGGT---AGLTY 152
AT FPQ I A++++ V+ E RA Y +G T GLT
Sbjct: 96 --------ATVFPQAIGMAATWDVDRMRGTADVIGTEFRAKYAERVHPDGSTDWYRGLTV 147
Query: 153 WSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDL 212
WSPN+NIFRDPRWGRGQET GEDP L+G+ +++RGLQG D + K A+ KH+ +
Sbjct: 148 WSPNINIFRDPRWGRGQETYGEDPYLTGRMGVAFIRGLQGQDPNFFKTIATAKHYAVHSG 207
Query: 213 DNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILK 272
N R + S D+EDT+ FR V EGKV +VMC+YN V+GVP CA +++
Sbjct: 208 PESN---RHREDVHPSAYDLEDTYLPAFRAAVTEGKVQAVMCAYNAVDGVPACASEDLMD 264
Query: 273 RTIRGEWRLNGYIVSDCDSVGVYY--DTQHFTSTPEEAAADAIRAGLDLDCGPFLG-LHT 329
+ +R +W +G++VSDC + Y D+ + TPEE A+ AG+DL CG + +T
Sbjct: 265 QRLRRDWGFSGHVVSDCGAAANIYREDSLAYVKTPEEGITRALNAGMDLVCGDYRADWNT 324
Query: 330 E-----SAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQE 384
E SAV++G+L E ++ ALV ++RLG+FD P+ P+ + TP+H+
Sbjct: 325 EAEATVSAVRKGMLDETVLDGALVRLFADRIRLGLFD-PPAEVPFSKITAAQNDTPEHRA 383
Query: 385 LALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQG 444
++LE A+ + LLKN G LPL R +AV+GPN+D +IGNY G T L G
Sbjct: 384 MSLEMAKASMTLLKNDG-VLPLKG-EPRRIAVVGPNADSVDALIGNYYGTPSNPVTVLAG 441
Query: 445 I-GRY--ARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSI-EAEALDRA 499
I R+ A ++ +G V A + DA ADA GL Q + E AL+ A
Sbjct: 442 IRARFPKAEVVYAEGTGLVGPASLPV----PDAVLCADAACRTKGLKQEVFEGVALEGA 496
Score = 122 bits (307), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 91/286 (31%), Positives = 136/286 (47%), Gaps = 50/286 (17%)
Query: 479 DATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILVLMS 528
D + V GL +E E + DR L LP QQ+L+ ++ K P +LVLM+
Sbjct: 613 DLVVFVGGLTARVEGEEMKLQVPGFAGGDRTSLDLPAPQQDLLRRLHATGK-PVVLVLMN 671
Query: 529 GGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNL- 587
G + V +A D + AI+ A YPG GG A+A +L G +P G+LP+T+Y +L
Sbjct: 672 GSALSVNWA--DANLPAIVEAWYPGGEGGHAVAQLLAGDYSPAGRLPVTFY--RSAGDLP 727
Query: 588 PMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHG 647
P + AM+ GRTYR++ G V+YPFG+G+SYT F + AP L R
Sbjct: 728 PFADYAMK--------GRTYRYFGGEVLYPFGYGLSYTRFSY---GAPQ-----LSARSV 771
Query: 648 SINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVA 707
S + I+ V V N G DG + ++ + P P + L
Sbjct: 772 SADGEIT-----------------VTTQVTNTGGMDGEEVVQLYVSHPGRDGTPIRALQG 814
Query: 708 FEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGG 753
F+++ + G + V + + LSVVD G RR+ G + +GG
Sbjct: 815 FQRIGLKRGETRPVSFTLK-DRQLSVVDAEGNRRVEPGRVEVWVGG 859
>gi|383114360|ref|ZP_09935124.1| hypothetical protein BSGG_1469 [Bacteroides sp. D2]
gi|313693934|gb|EFS30769.1| hypothetical protein BSGG_1469 [Bacteroides sp. D2]
Length = 863
Score = 303 bits (776), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 172/417 (41%), Positives = 244/417 (58%), Gaps = 31/417 (7%)
Query: 41 PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPG 100
P+ L + QR +DL+ RL+L+EKV L+ + + A+PRLGIK YEWW+EALHGV+ G
Sbjct: 27 PYQDTKLTVEQRADDLLQRLTLEEKVALMQNNSPAIPRLGIKPYEWWNEALHGVARAGL- 85
Query: 101 TKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARA---MYNGG-----TAGLTY 152
AT FPQ I A+SFN L + VSDEARA +N GLT
Sbjct: 86 ---------ATVFPQAIGMAASFNDELLYEVFDAVSDEARAKNRQFNEKGQYKRYQGLTM 136
Query: 153 WSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYD 211
W+PNVNIFRDPRWGRGQET GEDP LSG+ + VRGLQG D + K+ A KHF +
Sbjct: 137 WTPNVNIFRDPRWGRGQETYGEDPYLSGRMGMAAVRGLQGPEDAEYDKLHACAKHFAVHS 196
Query: 212 LDNWNGVDRFHFNAK-VSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNI 270
WN R FNA+ ++ +D+ +T+ F+ V + V VMC+YN+ G P C +
Sbjct: 197 GPEWN---RHSFNAENIAPRDLWETYLPAFKELVQKAGVKEVMCAYNRFEGDPCCGSNRL 253
Query: 271 LKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPE--EAAADAIRAGLDLDCGPFLGLH 328
L + +R +W G +V+DC ++G ++ + + P+ A+ADA+ +G DL+CG
Sbjct: 254 LTQILRNDWGFKGIVVTDCGAIGDFFQRKKHETHPDAAHASADAVLSGTDLECGGNFKSI 313
Query: 329 TESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALE 388
T+ AV++GL+SE IN ++ L + LG + S+ P+ ++ + P H+ELAL+
Sbjct: 314 TD-AVKKGLISEEKINTSVKRLLKARFELGEMN---STHPWSNIPFSVIDCPKHKELALK 369
Query: 389 AARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
A + +VLL+N LPL+ R VAVIGPN++ +V GNY G T L+GI
Sbjct: 370 MAHESLVLLQNNNNILPLN--RQMKVAVIGPNANDSVMQWGNYNGFPSHTVTLLEGI 424
Score = 114 bits (286), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 90/307 (29%), Positives = 140/307 (45%), Gaps = 49/307 (15%)
Query: 472 IDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGP 521
++ + AD I G+ +E E++ DR + LP Q+E+++ + K
Sbjct: 594 LNKLQSADVVIFAGGISPLLEGESMRVSDPGFKGGDRTEIELPAIQREVLALLKKNGK-K 652
Query: 522 TILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQ 581
T+ V SG + A AI+ A YPGQAGGTA+AD+LFG NP G+LP+T+Y
Sbjct: 653 TVFVNFSGSAM--AIVPETQNCDAILQAWYPGQAGGTAVADVLFGDYNPAGRLPITFY-- 708
Query: 582 EYITNLPMTE-MAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAV 640
+ + LP E +M+ GRTYRF +YPFG+G+SYT F + A
Sbjct: 709 KSMQQLPDYEDYSMK--------GRTYRFMTETPLYPFGYGLSYTRFSYGKA-------- 752
Query: 641 PLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWA 700
++N + K+T + LT + V NVG +DG + V+ P
Sbjct: 753 -------TLNQS------KLTKGEKAILT----IPVSNVGQRDGEEVVQVYICRPDDKEG 795
Query: 701 PHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
P K L F++V + G Q V I + + + T R G + I G + + L
Sbjct: 796 PQKTLRGFQRVSIAKGKTQNVQIELPYDSFEWFDAATNTIRPLNGTYKILYGNSSNEKDL 855
Query: 761 HAATLGV 767
++ +
Sbjct: 856 QTCSIQI 862
>gi|418519424|ref|ZP_13085476.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. malvacearum
str. GSPB2388]
gi|410704868|gb|EKQ63347.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. malvacearum
str. GSPB2388]
Length = 886
Score = 303 bits (776), Expect = 3e-79, Method: Compositional matrix adjust.
Identities = 176/425 (41%), Positives = 242/425 (56%), Gaps = 38/425 (8%)
Query: 39 TLPFCQVSLPIP------QRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALH 92
TLP ++ P QR L+ ++S +EKV ++ A A+PRLGI YEWWSE LH
Sbjct: 18 TLPCLALAAPAKSAGSPDQRAAALVAQMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLH 77
Query: 93 GVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--GGT--- 147
G++ G AT FPQ I A+S+N +L + +G VVS EARA +N GG
Sbjct: 78 GIARNG----------YATVFPQSIGLAASWNTSLMQQVGTVVSTEARAKFNQAGGPGKD 127
Query: 148 ----AGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAAS 203
AGLT WSPN+NIFRDPRWGRG ET GEDP L+G+ A ++RGLQG D + + A+
Sbjct: 128 HQRYAGLTIWSPNINIFRDPRWGRGMETYGEDPFLTGQMAVGFIRGLQGEDLNHPRTIAT 187
Query: 204 CKHFTAYDLDNWNGVD--RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNG 261
KH + +G + R F+ VS D+E T+ FR ++EG+ SVMC+YN ++G
Sbjct: 188 PKHIAVH-----SGPEPGRHGFDVDVSPHDVEATYTPAFRAALVEGQAGSVMCAYNALHG 242
Query: 262 VPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDC 321
P CA +L +RG+W G++VSDCD+V +F ++A A++AG DL+C
Sbjct: 243 TPVCAADWLLNGRVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAAALKAGHDLNC 302
Query: 322 G-PFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTP 380
G + L T A+ RG + E ++ +LV + RLG + P PY LG KDV
Sbjct: 303 GHAYRELGT--AIARGEVDEALLDQSLVRLFAARYRLGELEA-PRKDPYARLGAKDVDNA 359
Query: 381 DHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTT 440
H+ LAL+AA + IVLLKN +LPL +AVIGPN+D + NY G + T
Sbjct: 360 AHRALALQAAAESIVLLKNDANTLPLRA--GTRLAVIGPNADALAALEANYQGTSSAPVT 417
Query: 441 PLQGI 445
PL G+
Sbjct: 418 PLLGL 422
Score = 136 bits (343), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 94/279 (33%), Positives = 143/279 (51%), Gaps = 48/279 (17%)
Query: 485 MGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDV 534
+GL +E E L DR + LP QQ L+ + + K P ++VLMSG + +
Sbjct: 622 VGLSPDVEGEELRIDVPGFDGGDRNDIALPAPQQTLLERAKASGK-PLVVVLMSGSAVAL 680
Query: 535 AFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAM 594
+AK AI+ A YPGQ+GGTAIA +L G NPGG+LP+T+Y +LP
Sbjct: 681 NWAKT--HADAIVAAWYPGQSGGTAIARMLAGDDNPGGRLPVTFY--RSTKDLPAY---- 732
Query: 595 RPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATIS 654
S GRTYR++KG ++PFG+G+SYT F + +AP + L +
Sbjct: 733 ---VSYDMKGRTYRYFKGEPLFPFGYGLSYTRFAY---DAPQLSTTTLQ----------A 776
Query: 655 GKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVP 714
G ++VT V+N G++ G V+ P +P + LV F++VH+
Sbjct: 777 GNPLQVT------------ATVRNTGARAGDEVAQVYLQYPDRPQSPLRSLVGFQRVHLA 824
Query: 715 AGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGG 753
AG Q+ + ++ + LS VDRSG R + G++ + +GG
Sbjct: 825 AGEQRTLTFHLD-ARALSDVDRSGQRAVEAGDYTLFVGG 862
>gi|78049893|ref|YP_366068.1| beta-glucosidase precursor [Xanthomonas campestris pv. vesicatoria
str. 85-10]
gi|78038323|emb|CAJ26068.1| beta-glucosidase precursor [Xanthomonas campestris pv. vesicatoria
str. 85-10]
Length = 902
Score = 303 bits (776), Expect = 3e-79, Method: Compositional matrix adjust.
Identities = 171/435 (39%), Positives = 247/435 (56%), Gaps = 36/435 (8%)
Query: 51 QRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGA 110
QR DL+ R++L+EK + + A A+PRLG+ Y+WW+EALHGV+ G GA
Sbjct: 44 QRAADLVSRMTLEEKAAQMQNAAPAIPRLGVPAYDWWNEALHGVARAG----------GA 93
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--------GGTAGLTYWSPNVNIFRD 162
T FPQ I A++F+ L + +SDEARA ++ GLT+WSPN+NIFRD
Sbjct: 94 TVFPQAIGMAATFDLPLMHEVATAISDEARAKHHQFLRQNQHARYQGLTFWSPNINIFRD 153
Query: 163 PRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVA---------ASCKHFTAYDLD 213
PRWGRGQET GEDP L+ + ++V+GL+G D K A A+ KHF +
Sbjct: 154 PRWGRGQETYGEDPFLTARMGVTFVQGLRGEGADAPKNAQGEPYRKLDATAKHFAVHSGP 213
Query: 214 NWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKR 273
DR HF+A+ S++D+ +T+ F V +GKV +VM +YN+V G A +L+
Sbjct: 214 E---ADRHHFDARPSQRDLYETYLPAFEALVKDGKVDAVMGAYNRVYGESASASKFLLQD 270
Query: 274 TIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAV 333
+R +W GY+VSDC ++ + +T E+AAA A++ G +L+CG +AV
Sbjct: 271 VLRQQWGFKGYVVSDCWAIVDIWKHHKIVATREQAAALAVKHGTELECGEEYST-LPAAV 329
Query: 334 QRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQG 393
++GL+ E I+ AL +T +MRLGMFD P P+ + +P H LA AR+
Sbjct: 330 RQGLIDEAQIDTALTTLMTARMRLGMFD-PPGQLPWSTIPASVNQSPAHDALARRTARES 388
Query: 394 IVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI---GRYAR 450
+VLLKN G LPLS + + +AVIGP +D T+ ++GNY G T LQGI A+
Sbjct: 389 LVLLKNDG-LLPLSRAKLKRIAVIGPTADDTMALLGNYYGTPAAPVTVLQGIRAAAPNAQ 447
Query: 451 TIHQQGCKDVACADD 465
++ +G V DD
Sbjct: 448 VLYARGADLVEGRDD 462
Score = 122 bits (307), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 92/303 (30%), Positives = 146/303 (48%), Gaps = 55/303 (18%)
Query: 471 AIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKG 520
A+D + AD + V GL +E E + DR L LP Q++L+ + K
Sbjct: 629 ALDVASSADVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRDLLEALQATGK- 687
Query: 521 PTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYP 580
P + VL +G + + +A+ + AI+ A YPGQ GGTA+AD LFG +NPGG+LP+T+Y
Sbjct: 688 PVVAVLTTGSALAIDWAQQ--HLPAILLAWYPGQRGGTAVADTLFGDANPGGRLPVTFY- 744
Query: 581 QEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHT--VANAPTVV 638
+E T + AMR GRTYR++ G +YPFGHG+SYT F ++ + T+
Sbjct: 745 KESETLPAFDDYAMR--------GRTYRYFGGTPLYPFGHGLSYTQFAYSGLRLDRTTIA 796
Query: 639 AVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTP--PA 696
A GS+ AT++ VKN G + G + ++ P P
Sbjct: 797 A------DGSLTATVT---------------------VKNTGQRAGDEVVQLYLHPLTPQ 829
Query: 697 GHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVD-RSGTRRIPLGEHNIHIGGTK 755
A K+L F+++ + AG Q+ + + L + D + + G + + IG +
Sbjct: 830 RERA-GKELHGFQRITLQAGEQRALHFILDAKNALRIYDAQRKAYAVDPGAYEVQIGASS 888
Query: 756 HSV 758
+
Sbjct: 889 ADI 891
>gi|386718620|ref|YP_006184946.1| glucan 1,4-beta-glucosidase [Stenotrophomonas maltophilia D457]
gi|384078182|emb|CCH12773.1| Glucan 1,4-beta-glucosidase [Stenotrophomonas maltophilia D457]
Length = 897
Score = 303 bits (775), Expect = 3e-79, Method: Compositional matrix adjust.
Identities = 175/436 (40%), Positives = 246/436 (56%), Gaps = 28/436 (6%)
Query: 41 PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPG 100
P+ VS QR L+ +++L EK + + A A+ RLG+ Y+WW+E LHGV+ G
Sbjct: 37 PWLDVSASFEQRAASLVAQMTLDEKAAQMQNAAPAIERLGVPAYDWWNEGLHGVARAGQ- 95
Query: 101 TKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--------GGTAGLTY 152
AT FPQ I A++F+ L + +SDEARA ++ G GLT+
Sbjct: 96 ---------ATVFPQAIGLAATFDVPLMGQVATTISDEARAKHHQFLRQGAHGRYQGLTF 146
Query: 153 WSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDL 212
WSPNVNIFRDPRWGRGQET GEDP L+ + ++VRGLQG D K+ A+ KH +
Sbjct: 147 WSPNVNIFRDPRWGRGQETYGEDPYLTARMGVAFVRGLQGDDPVYRKLDATAKHLAVHSG 206
Query: 213 DNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILK 272
DR HF+A+ S++D+ DT+ F V EG V +VM +YN+V G A +L+
Sbjct: 207 PE---ADRHHFDARPSRRDLYDTYLPAFEALVKEGDVDAVMGAYNRVYGESASASRFLLR 263
Query: 273 RTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESA 332
+R +W GY+VSDC ++ + H +T E AAA A+R G +L+CG SA
Sbjct: 264 DVLRRDWGFKGYVVSDCWAIVDIWKHHHIVTTREAAAALAVRNGTELECGQEYAT-LPSA 322
Query: 333 VQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQ 392
V++GL+SE +I++A+ T +MRLGMFD P + + P H LAL+AA+
Sbjct: 323 VRQGLISEAEIDDAVTRLFTARMRLGMFD-PPERVRWARIPASVNQAPSHDALALKAAQA 381
Query: 393 GIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTI 452
+VLLKN G LPLS R +AV+GP +D T+ ++GNY G T LQGI A+ +
Sbjct: 382 SLVLLKNDG-ILPLSRDIKR-IAVVGPTADDTMALLGNYFGTPAAPVTILQGIREAAKGV 439
Query: 453 ---HQQGCKDVACADD 465
+ +G V DD
Sbjct: 440 EVRYARGVDLVEGRDD 455
Score = 125 bits (313), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 94/300 (31%), Positives = 143/300 (47%), Gaps = 51/300 (17%)
Query: 468 FGAAIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMA 517
F A+DA+R+AD + V GL +E E + DR L LP Q+ L+ +
Sbjct: 619 FDEALDAAREADVVVFVGGLTGDVEGEEMTVNYPGFAGGDRTDLRLPAPQRTLLEALHAT 678
Query: 518 SKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMT 577
K P ++VL G I V +A++ + AI+ + YPGQ GGTA+ LFG NP G+LP+T
Sbjct: 679 GK-PVVMVLTGGSAIAVDWAQS--HLPAILMSWYPGQRGGTAVGQALFGDVNPAGRLPVT 735
Query: 578 WYPQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPT 636
+Y LP + AM GRTYR+++G +YPFGHG+SYT F +
Sbjct: 736 FYKAS--EALPAFDDYAME--------GRTYRYFRGTPLYPFGHGLSYTRFDYGT----- 780
Query: 637 VVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTL-LVFSTPP 695
+ LD GS+ A G+ LGV VDV N G++ G + L
Sbjct: 781 ---LRLDA--GSLRA--DGR-------------LGVAVDVTNAGTRSGDEVVQLYVRREH 820
Query: 696 AGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRS-GTRRIPLGEHNIHIGGT 754
AG ++L F+++H+ G + V + + L D + + G + + +G +
Sbjct: 821 AGSGDAVQELRGFQRIHLAPGEHRTVTFTLEAAQALRHYDEARAAYEVRPGAYEVRVGAS 880
>gi|21233528|ref|NP_639445.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. campestris
str. ATCC 33913]
gi|66770493|ref|YP_245255.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. campestris
str. 8004]
gi|21115383|gb|AAM43327.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. campestris
str. ATCC 33913]
gi|66575825|gb|AAY51235.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. campestris
str. 8004]
Length = 896
Score = 302 bits (774), Expect = 4e-79, Method: Compositional matrix adjust.
Identities = 174/451 (38%), Positives = 250/451 (55%), Gaps = 29/451 (6%)
Query: 35 ATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGV 94
A T P+ + P+ R DL+ R++L+EK + + A A+PRL + Y+WW+EALHGV
Sbjct: 33 AATPPPPYLDPTQPLQARAADLVSRMTLEEKAAQMQNAAPAIPRLQVPEYDWWNEALHGV 92
Query: 95 SNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAG----- 149
+ G GAT FPQ I A++F+ L + +SDEARA ++ AG
Sbjct: 93 ARAG----------GATVFPQAIGLAATFDTPLMAEVATAISDEARAKHHAFLAGGEHKR 142
Query: 150 ---LTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKH 206
LT+WSPN+NIFRDPRWGRGQET GEDP L+ + ++V+GLQ G K+ A+ KH
Sbjct: 143 YQGLTFWSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQAQQGPYRKLDATAKH 202
Query: 207 FTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCA 266
+ + + DR HF+ S++D+ +T+ F+ V EG VA+VM +YN+VNG A
Sbjct: 203 YAVH---SGPEADRHHFDVHPSERDLYETYLPAFQALVQEGHVAAVMGAYNRVNGESASA 259
Query: 267 DPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLG 326
L+ +R +W +GYIVSDC ++ + TPE AAA ++ G DLDCG
Sbjct: 260 S-TRLEGILRRDWGFDGYIVSDCAAIRDIWQNHKIVPTPEAAAALGVKHGTDLDCGDTYA 318
Query: 327 LHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELA 386
+AV+ GL+ E I+ +L + ++RLGMFD P+ P+ +P H LA
Sbjct: 319 A-LPAAVRAGLIDEATIDRSLTRLMAARLRLGMFD-PPAKVPWAQTPASANQSPQHDALA 376
Query: 387 LEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI- 445
AR+ +VLLKN G LPL R +AV+GP +D ++++GNY G T LQGI
Sbjct: 377 RRTARESLVLLKNDG-LLPLKPTLKR-IAVVGPTADDPMSLLGNYYGTPAAPVTILQGIR 434
Query: 446 --GRYARTIHQQGCKDVACADDQLFGAAIDA 474
A ++ +G V +D A IDA
Sbjct: 435 DAAPQAEVVYARGSDLVEGREDPNAAAPIDA 465
Score = 127 bits (319), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 93/304 (30%), Positives = 149/304 (49%), Gaps = 57/304 (18%)
Query: 471 AIDASRQADATILVMGLDQSIEAEALD----------RAGLLLPGRQQELVSKVSMASKG 520
A+DA+R AD + V GL +E E +D R LP Q+EL+ + A+
Sbjct: 623 AVDAARNADVVVFVGGLTGDVEGEEMDVNYPGFAGGDRTDTRLPKPQRELLQALQ-ATGT 681
Query: 521 PTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYP 580
P + VL +G + + +A+ + AI+ A YPGQ GGTA+ D+LFG ++PGG+LP+T+Y
Sbjct: 682 PVVAVLTTGSALAIDWAQQ--HVPAILLAWYPGQRGGTAVGDVLFGQASPGGRLPITFYK 739
Query: 581 QEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHT--VANAPTV 637
++ LP + AMR GRTYR++ G +YPFGHG++YT F ++ + TV
Sbjct: 740 ED--ERLPAFDDYAMR--------GRTYRYFDGKPLYPFGHGLAYTQFAYSNLRLDRTTV 789
Query: 638 VAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTP--P 695
A G++ AT+S VKN G + G + ++ P P
Sbjct: 790 AA------DGTLRATVS---------------------VKNTGQRAGDEVVQLYLHPLNP 822
Query: 696 AGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVD-RSGTRRIPLGEHNIHIGGT 754
A K+L F+++ + G + V NI + L + D + + G + + IG +
Sbjct: 823 QRERA-RKELRGFQRITLQPGEHREVSFNITPREALRIYDEQRKAYAVDPGAYELQIGAS 881
Query: 755 KHSV 758
+
Sbjct: 882 SADI 885
>gi|374312362|ref|YP_005058792.1| Beta-glucosidase [Granulicella mallensis MP5ACTX8]
gi|358754372|gb|AEU37762.1| Beta-glucosidase [Granulicella mallensis MP5ACTX8]
Length = 874
Score = 302 bits (774), Expect = 4e-79, Method: Compositional matrix adjust.
Identities = 184/467 (39%), Positives = 252/467 (53%), Gaps = 34/467 (7%)
Query: 44 QVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKF 103
Q I R+++LI ++++ E++ L A A+ RLG+ Y WW+E LHG++ G
Sbjct: 30 QTPQQIDARIDELIAKMTVSERIAQLQDRAPAIERLGLPSYNWWNEGLHGLARDG----- 84
Query: 104 GGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMY--NGGT-----AGLTYWSPN 156
AT FPQ I A++++A L +G VVS EARA + +GG GLT WSPN
Sbjct: 85 -----YATVFPQAIGLAATWDAPLLHEVGDVVSTEARAKFYSHGGENTPRFGGLTVWSPN 139
Query: 157 VNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWN 216
+NIFRDPRWGRGQET GEDP L+ +V G+QG+D LK A+ KHF A+
Sbjct: 140 INIFRDPRWGRGQETYGEDPFLTATLGTQFVEGVQGNDPFYLKADATPKHFAAHSGPEEG 199
Query: 217 GVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIR 276
R FNA VS D+ DT+ F A++MCSYN+++G P+CA N L+ +R
Sbjct: 200 ---RDSFNAVVSPHDLADTYLPAFHALTTNAHAAALMCSYNEIDGTPSCASGNNLQDLVR 256
Query: 277 GEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRG 336
W GY+VSDCD+VG HF + AADA+ AG+DLDCG ++S + +
Sbjct: 257 ERWGFKGYVVSDCDAVGNIAGYHHFATDNAHGAADALNAGVDLDCGNTYAALSKS-LDQN 315
Query: 337 LLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVL 396
L +E +N AL L ++RLGM D S PY +G +++ +P H LAL AA + IVL
Sbjct: 316 LTTEAKLNQALHRLLLARVRLGMLD-PLSCSPYRDIGAEELDSPAHHTLALRAAEESIVL 374
Query: 397 LKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQG 456
LKN G LPL + V+VIGP +D+ + NY G A TPL G + H
Sbjct: 375 LKNDG-VLPLQASTQK-VSVIGPTADMVKVLEANYHGTALHPITPLDG---FRSRFH--- 426
Query: 457 CKDVACADDQLFGAAIDA--SRQADATILVMGLDQSIEAEALDRAGL 501
DV+ A L + A R A G ++AE D+A L
Sbjct: 427 --DVSYAQGSLLAEGVSAPVPRNALRVAAAPGSSAGLQAEYFDKASL 471
Score = 123 bits (309), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 90/300 (30%), Positives = 140/300 (46%), Gaps = 50/300 (16%)
Query: 467 LFGAAIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSM 516
L A+ + ++D + +GL +E EAL DR L LP Q+ L+S+++
Sbjct: 594 LLDQAVQTAAKSDVIVAFVGLSPDLEGEALQLRLKGFNGGDRTSLDLPEAQRTLLSRLTQ 653
Query: 517 ASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPM 576
K P I+VL SG VA A ++ A YPG+AGG A+A IL G NP G+LP+
Sbjct: 654 LHK-PVIIVLTSGS--GVALGPEAKDAAGVLEAWYPGEAGGEALAGILAGNVNPSGRLPV 710
Query: 577 TWYPQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAP 635
T+Y + +LP T+ +M RTYR++ GPV++PFG+G+SY++F +
Sbjct: 711 TFY--RSVDDLPAFTDYSM--------AHRTYRYFDGPVLFPFGYGLSYSHFQY------ 754
Query: 636 TVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPP 695
G+ TH L V V N ++G ++ PP
Sbjct: 755 -------------------GQLRLSTHMLKTSEPLVAMVTVHNESQREGTEVAELYLQPP 795
Query: 696 AGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTK 755
AP L ++V + G + + + + LS VD SG R + GE+ + +GG +
Sbjct: 796 QASGAPRLTLQGVQRVALRPGETRELTFKLAPGQ-LSTVDTSGARTVRAGEYKLFVGGVQ 854
>gi|384430040|ref|YP_005639401.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. raphani
756C]
gi|341939144|gb|AEL09283.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. raphani
756C]
Length = 896
Score = 302 bits (774), Expect = 4e-79, Method: Compositional matrix adjust.
Identities = 174/451 (38%), Positives = 249/451 (55%), Gaps = 29/451 (6%)
Query: 35 ATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGV 94
A T P+ + P+ R DL+ R++L+EK + + A A+PRL + Y+WW+EALHGV
Sbjct: 33 AATPPPPYLDPTQPLQARAADLVSRMTLEEKAAQMQNAAPAIPRLQVPEYDWWNEALHGV 92
Query: 95 SNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA------ 148
+ G GAT FPQ I A++F+ L + +SDEARA ++ A
Sbjct: 93 ARAG----------GATVFPQAIGLAATFDTPLMAEVATAISDEARAKHHAFLARGEHKR 142
Query: 149 --GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKH 206
GLT+WSPN+NIFRDPRWGRGQET GEDP L+ + ++V+GLQ G K+ A+ KH
Sbjct: 143 YQGLTFWSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQAQQGPYRKLDATAKH 202
Query: 207 FTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCA 266
+ + DR HF+ S++D+ +T+ F+ V EG VA+VM +YN+VNG A
Sbjct: 203 YAVHSGPE---ADRHHFDVHPSERDLYETYLPAFQALVQEGHVAAVMGAYNRVNGESASA 259
Query: 267 DPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLG 326
L+ +R +W +GYIVSDC ++ + TPE AAA ++ G DLDCG
Sbjct: 260 STR-LEGILRRDWGFDGYIVSDCAAIRDIWQNHKIVPTPEAAAALGVKHGTDLDCGDTYA 318
Query: 327 LHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELA 386
+AV+ GL+ E I+ +L + ++RLGMFD P+ P+ +P H LA
Sbjct: 319 A-LPAAVRAGLIDEATIDRSLTRLMAARLRLGMFD-PPAKVPWAQTPASANQSPQHDALA 376
Query: 387 LEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI- 445
AR+ +VLLKN G LPL R +AV+GP +D ++++GNY G T LQGI
Sbjct: 377 RRTARESLVLLKNDG-LLPLKPTLKR-IAVVGPTADDPMSLLGNYYGTPAAPVTILQGIR 434
Query: 446 --GRYARTIHQQGCKDVACADDQLFGAAIDA 474
A ++ +G V +D A IDA
Sbjct: 435 DAAPQAEVVYARGSDLVEGREDPNAAAPIDA 465
Score = 122 bits (306), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 91/302 (30%), Positives = 143/302 (47%), Gaps = 53/302 (17%)
Query: 471 AIDASRQADATILVMGLDQSIEAEALD----------RAGLLLPGRQQELVSKVSMASKG 520
A+DA+R AD + V GL +E E +D R LP Q+EL+ + A+
Sbjct: 623 AVDAARNADVVVFVGGLTGDVEGEEMDVNYPGFAGGDRTDTRLPKPQRELLQALQ-ATGT 681
Query: 521 PTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYP 580
P + VL +G + + +A+ + AI+ A YPGQ GGTA+ D+LFG ++PGG+LP+T+Y
Sbjct: 682 PVVAVLTTGSALAIDWAQQ--HVPAILLAWYPGQRGGTAVGDVLFGQASPGGRLPITFYK 739
Query: 581 QEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVA 639
++ LP + AMR GRTYR++ G +YPFGHG++YT F ++
Sbjct: 740 ED--ERLPAFDDYAMR--------GRTYRYFDGKPLYPFGHGLAYTQFAYSNLRLDRTT- 788
Query: 640 VPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTP--PAG 697
V DG TL V VKN G + G + ++ P P
Sbjct: 789 VAADG------------------------TLRATVWVKNTGQRAGDEVVQLYLHPLNPQR 824
Query: 698 HWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVD-RSGTRRIPLGEHNIHIGGTKH 756
A K+L F+++ + G + V I + L + D + + G + + IG +
Sbjct: 825 ERA-RKELRGFQRITLQPGEHREVSFTITPREALRIYDEQRKAYAVDPGAYELQIGASSA 883
Query: 757 SV 758
+
Sbjct: 884 DI 885
>gi|424796589|ref|ZP_18222299.1| exported beta-glucosidase [Xanthomonas translucens pv. graminis
ART-Xtg29]
gi|422794891|gb|EKU23686.1| exported beta-glucosidase [Xanthomonas translucens pv. graminis
ART-Xtg29]
Length = 913
Score = 302 bits (774), Expect = 5e-79, Method: Compositional matrix adjust.
Identities = 172/444 (38%), Positives = 248/444 (55%), Gaps = 36/444 (8%)
Query: 42 FCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGT 101
+ QR DL+ R++L+EK + + A A+PRLG+ Y+WW+EALHGV+ G
Sbjct: 37 YLDTQRSFEQRAADLVSRMTLEEKAAQMQNAAPAIPRLGVPAYDWWNEALHGVARAG--- 93
Query: 102 KFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGT--------AGLTYW 153
GAT FPQ I A++F+ L + +SDEARA ++ GLT+W
Sbjct: 94 -------GATVFPQAIGMAATFDLPLMHEVSTAISDEARAKHHEALRHDQHARYQGLTFW 146
Query: 154 SPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVA---------ASC 204
SPN+NIFRDPRWGRGQET GEDP L+ + ++V+GLQG D K A A+
Sbjct: 147 SPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQGEGADAPKNAQGDAYRKLDATA 206
Query: 205 KHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPT 264
KHF + + DR HF+A S++D+ +T+ F V EGKV +VM +YN+V G
Sbjct: 207 KHFAVH---SGPEADRHHFDAHPSQRDLYETYLPAFEALVKEGKVDAVMGAYNRVYGESA 263
Query: 265 CADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPF 324
A +L+ +R W +GY+VSDC ++ + +T E+AAA A+ G +L+CG
Sbjct: 264 SASKFLLRDVLRDTWGFDGYVVSDCWAIVDIWKNHKIVATREQAAALAVNNGTELECGEE 323
Query: 325 LGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQE 384
+AV++GL+SE D++ AL + +MRLGMFD P + + + +P+H
Sbjct: 324 YST-LPAAVRKGLISEADVDKALQKLMYSRMRLGMFD-PPDTLRWAQIPLSANQSPEHDA 381
Query: 385 LALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQG 444
LA AR+ +VLLKN G LPLS + + +AVIGP +D T+ ++GNY G T LQG
Sbjct: 382 LARRTARESLVLLKNDG-VLPLSRGKIKRIAVIGPTADDTMALLGNYYGTPAAPVTVLQG 440
Query: 445 IGRY---ARTIHQQGCKDVACADD 465
I A ++ +G V DD
Sbjct: 441 IREAAPDAEVLYARGADLVEGRDD 464
Score = 127 bits (319), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 94/306 (30%), Positives = 148/306 (48%), Gaps = 61/306 (19%)
Query: 471 AIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKG 520
A+DA+R+AD + V GL +E E + DR L LP Q+EL+ + K
Sbjct: 631 ALDAARRADVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRELLEALQGTGK- 689
Query: 521 PTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYP 580
P + VL +G + + +A+ + AI+ A YPGQ GG+A+AD+LFG +NPGG+LP+T+Y
Sbjct: 690 PVVAVLTTGSALAIDWAQQ--HVPAILLAWYPGQRGGSAVADVLFGDANPGGRLPVTFYK 747
Query: 581 QEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVA 639
+ LP + AMR GRTYR++ G +YPFGHG+SYT F ++
Sbjct: 748 ES--EKLPAFDDYAMR--------GRTYRYFAGTALYPFGHGLSYTQFAYS--------D 789
Query: 640 VPLD----GRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTP- 694
+ LD GS++AT+ VKN G + G + ++ P
Sbjct: 790 LRLDRSKLATDGSLHATLK---------------------VKNTGQRAGDEVVQLYLHPL 828
Query: 695 -PAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRR-IPLGEHNIHIG 752
P A K+L F+++ + G + V I L + D + + G++ + +G
Sbjct: 829 SPQRERA-RKELRGFQRIALQPGETREVSFAISPQTDLRLYDEARKAYVVDPGDYELQVG 887
Query: 753 GTKHSV 758
+ V
Sbjct: 888 ASSSDV 893
>gi|381169747|ref|ZP_09878910.1| glycosyl hydrolase family 3 N terminal domain protein [Xanthomonas
citri pv. mangiferaeindicae LMG 941]
gi|380689765|emb|CCG35397.1| glycosyl hydrolase family 3 N terminal domain protein [Xanthomonas
citri pv. mangiferaeindicae LMG 941]
Length = 874
Score = 302 bits (774), Expect = 5e-79, Method: Compositional matrix adjust.
Identities = 172/407 (42%), Positives = 236/407 (57%), Gaps = 32/407 (7%)
Query: 51 QRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGA 110
QR L+ ++S +EKV ++ A A+PRLGI YEWWSE LHG++ G A
Sbjct: 24 QRAAALVAQMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNG----------YA 73
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--GGT-------AGLTYWSPNVNIFR 161
T FPQ I A+S+N +L + +G VVS EARA +N GG AGLT WSPN+NIFR
Sbjct: 74 TVFPQSIGLAASWNTSLMQQVGTVVSTEARAKFNQAGGPGKDHQRYAGLTIWSPNINIFR 133
Query: 162 DPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD-- 219
DPRWGRG ET GEDP L+G+ A ++RGLQG D + + A+ KH + +G +
Sbjct: 134 DPRWGRGMETYGEDPFLTGQMAVGFIRGLQGEDLNHPRTIATPKHIAVH-----SGPEPG 188
Query: 220 RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEW 279
R F+ VS D+E T+ FR ++EG+ SVMC+YN ++G P CA +L +RG+W
Sbjct: 189 RHGFDVDVSPHDVEATYTPAFRAALVEGQAGSVMCAYNALHGTPVCAADWLLNGRVRGDW 248
Query: 280 RLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLL 338
G++VSDCD+V +F ++A A++AG DL+CG + L T A+ RG +
Sbjct: 249 GFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAAALKAGHDLNCGHAYRELGT--AIARGEV 306
Query: 339 SEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLK 398
E ++ +LV + RLG + P PY LG KDV H+ LAL+AA + IVLLK
Sbjct: 307 DEALLDQSLVRLFAARYRLGELEA-PRKDPYARLGAKDVDNAAHRALALQAAAESIVLLK 365
Query: 399 NQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
N +LPL +AVIGPN+D + NY G + TPL G+
Sbjct: 366 NDANTLPLRA--GTRLAVIGPNADALAALEANYQGTSSAPVTPLLGL 410
Score = 138 bits (348), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 95/279 (34%), Positives = 144/279 (51%), Gaps = 48/279 (17%)
Query: 485 MGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDV 534
+GL +E E L DR + LP QQ L+ + + K P ++VLMSG + +
Sbjct: 610 VGLSPDVEGEELRIDVPGFDGGDRNDIALPAPQQALLERAKASGK-PLVVVLMSGSAVAL 668
Query: 535 AFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAM 594
+AK AI+ A YPGQ+GGTAIA +L G NPGG+LP+T+Y +LP
Sbjct: 669 NWAKM--HADAIVAAWYPGQSGGTAIARMLAGDDNPGGRLPVTFY--RSTKDLP------ 718
Query: 595 RPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATIS 654
P S GRTYR++KG ++PFG+G+SYT F + +AP + L +
Sbjct: 719 -PYVSYDMKGRTYRYFKGEPLFPFGYGLSYTRFAY---DAPQLSTTTLQ----------A 764
Query: 655 GKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVP 714
G ++VT V+N G++ G V+ P +P + LV F++VH+
Sbjct: 765 GNPLQVT------------ATVRNTGARAGDEVAQVYLQYPDRPQSPLRSLVGFQRVHLA 812
Query: 715 AGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGG 753
AG Q+ + ++ + LS VDRSG R + G++ + +GG
Sbjct: 813 AGEQRTLTFHLD-ARALSDVDRSGQRAVEAGDYTLFVGG 850
>gi|390992294|ref|ZP_10262532.1| glycosyl hydrolase family 3 N terminal domain protein [Xanthomonas
axonopodis pv. punicae str. LMG 859]
gi|372552957|emb|CCF69507.1| glycosyl hydrolase family 3 N terminal domain protein [Xanthomonas
axonopodis pv. punicae str. LMG 859]
Length = 886
Score = 302 bits (773), Expect = 5e-79, Method: Compositional matrix adjust.
Identities = 172/407 (42%), Positives = 236/407 (57%), Gaps = 32/407 (7%)
Query: 51 QRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGA 110
QR L+ ++S +EKV ++ A A+PRLGI YEWWSE LHG++ G A
Sbjct: 36 QRAAALVAQMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNGY----------A 85
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--GGT-------AGLTYWSPNVNIFR 161
T FPQ I A+S+N +L + +G VVS EARA +N GG AGLT WSPN+NIFR
Sbjct: 86 TVFPQSIGLAASWNTSLMQQVGTVVSTEARAKFNQAGGPGKDHQRYAGLTIWSPNINIFR 145
Query: 162 DPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD-- 219
DPRWGRG ET GEDP L+G+ A ++RGLQG D + + A+ KH + +G +
Sbjct: 146 DPRWGRGMETYGEDPFLTGQMAVGFIRGLQGEDLNHPRTIATPKHIAVH-----SGPEPG 200
Query: 220 RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEW 279
R F+ VS D+E T+ FR ++EG+ SVMC+YN ++G P CA +L +RG+W
Sbjct: 201 RHGFDVDVSPHDVEATYTPAFRAALVEGQAGSVMCAYNALHGTPVCAADWLLNGRVRGDW 260
Query: 280 RLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLL 338
G++VSDCD+V +F ++A A++AG DL+CG + L T A+ RG +
Sbjct: 261 GFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAAALKAGHDLNCGHAYRELGT--AIARGEV 318
Query: 339 SEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLK 398
E ++ +LV + RLG + P PY LG KDV H+ LAL+AA + IVLLK
Sbjct: 319 DEALLDQSLVRLFAARYRLGELEA-PRKDPYARLGAKDVDNAAHRALALQAAAESIVLLK 377
Query: 399 NQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
N +LPL +AVIGPN+D + NY G + TPL G+
Sbjct: 378 NDANTLPLRA--GTRLAVIGPNADALAALEANYQGTSSAPVTPLLGL 422
Score = 138 bits (347), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 95/279 (34%), Positives = 142/279 (50%), Gaps = 48/279 (17%)
Query: 485 MGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDV 534
+GL +E E L DR + LP QQ L+ + + K P ++VLMSG + +
Sbjct: 622 VGLSPDVEGEELRIDVPGFDGGDRNDIALPAAQQTLLERAKASGK-PLVVVLMSGSAVAL 680
Query: 535 AFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAM 594
+AK AI+ A YPGQ+GGTAIA +L G NPGG+LP+T+Y +LP
Sbjct: 681 NWAKT--HADAIVAAWYPGQSGGTAIARMLAGDDNPGGRLPVTFY--RSTKDLPAY---- 732
Query: 595 RPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATIS 654
S GRTYR++KG ++PFG+G+SYT F + +AP + L +
Sbjct: 733 ---VSYDMKGRTYRYFKGEPLFPFGYGLSYTRFAY---DAPQLSTTTLQ----------A 776
Query: 655 GKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVP 714
G ++VT V+N G++ G V+ P +P + LV F++VH+
Sbjct: 777 GNPLQVT------------ATVRNTGARAGDEVAQVYLQYPDRPQSPLRSLVGFQRVHLA 824
Query: 715 AGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGG 753
AG Q+ + N+ + LS VDRSG R + G + + +GG
Sbjct: 825 AGEQRTLTFNLD-ARALSDVDRSGQRAVEAGNYTLFVGG 862
>gi|325922365|ref|ZP_08184139.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas gardneri ATCC
19865]
gi|325547147|gb|EGD18227.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas gardneri ATCC
19865]
Length = 889
Score = 302 bits (773), Expect = 5e-79, Method: Compositional matrix adjust.
Identities = 173/416 (41%), Positives = 241/416 (57%), Gaps = 33/416 (7%)
Query: 51 QRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGA 110
QR L+ ++S +EKV ++ A A+PRLGI YEWWSE LHG++ G A
Sbjct: 39 QRAAALVAQMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNG----------YA 88
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--GGT-------AGLTYWSPNVNIFR 161
T FPQ I A+S+N L + +G VVS EARA +N GG AGLT WSPN+NIFR
Sbjct: 89 TVFPQAIGLAASWNTQLMQQVGTVVSTEARAKFNQAGGPGKDHKRYAGLTIWSPNINIFR 148
Query: 162 DPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD-- 219
DPRWGRG ET GEDP L+G+ A ++RGLQG D D + A+ KH + +G +
Sbjct: 149 DPRWGRGMETYGEDPFLTGQLAVGFIRGLQGDDLDHPRTIATPKHIAVH-----SGPEPG 203
Query: 220 RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEW 279
R F+ VS +D+E T+ FR +++G+ SVMC+YN ++G P CA +L +RG+W
Sbjct: 204 RHSFDVDVSPRDVEATYTPAFRAALIDGQAGSVMCAYNSLHGTPACAADWLLNGRVRGDW 263
Query: 280 RLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLL 338
G++VSDCD+V +F ++A +++AG DL+CG + L T A++RG +
Sbjct: 264 GFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAASLKAGHDLNCGYAYRALGT--AIERGEV 321
Query: 339 SEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLK 398
E ++ +LV + RLG + P PY LG KD+ ++ LAL+AA Q IVLLK
Sbjct: 322 DEALLDQSLVRLFAARYRLGELEA-PHKDPYATLGAKDIDNTANRALALKAAAQSIVLLK 380
Query: 399 NQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQ 454
N +LPL +AVIGPN+D + NY G + TPL G+ R +HQ
Sbjct: 381 NDANTLPLKA--GARLAVIGPNADALAALEANYQGTSSTPVTPLLGL-RQRFGVHQ 433
Score = 137 bits (344), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 95/278 (34%), Positives = 142/278 (51%), Gaps = 48/278 (17%)
Query: 486 GLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVA 535
GL +E E L DR + LP QQ L+ + + K P ++VLMSG + +
Sbjct: 626 GLSPDVEGEELRIDVPGFDGGDRNDIALPAPQQALLERAKASGK-PLVVVLMSGSAVALN 684
Query: 536 FAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMR 595
+AK AI+ A YPGQ+GGTAIA +L G NPGG+LP+T+Y +LP
Sbjct: 685 WAKT--HADAIVAAWYPGQSGGTAIARMLAGDDNPGGRLPVTFY--RSTKDLP------- 733
Query: 596 PSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISG 655
P S GRTYR++KG ++PFG+G+SYT+F + AP + + L + GS
Sbjct: 734 PYVSYDMKGRTYRYFKGEPLFPFGYGLSYTSFAY---GAPQLSSTTL--QAGS------- 781
Query: 656 KAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPA 715
TL V V+N G++ G V+ P +P + LV F++VH+
Sbjct: 782 -------------TLQVTTTVRNTGTRAGDEVAQVYLQYPDRPQSPLRSLVGFQRVHLKP 828
Query: 716 GAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGG 753
G Q+ + + + LS VDR+G R + G++ + +GG
Sbjct: 829 GEQRTLTFTLD-ARALSDVDRTGQRAVEAGDYTLFVGG 865
>gi|298386950|ref|ZP_06996504.1| beta-glucosidase [Bacteroides sp. 1_1_14]
gi|298260100|gb|EFI02970.1| beta-glucosidase [Bacteroides sp. 1_1_14]
Length = 846
Score = 302 bits (773), Expect = 6e-79, Method: Compositional matrix adjust.
Identities = 166/403 (41%), Positives = 245/403 (60%), Gaps = 28/403 (6%)
Query: 42 FCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGT 101
+ ++ PI +R+ DL+ +L+++EK+ LL + + + R+GI Y +EALHG+ + PG
Sbjct: 23 YKNMNAPIHERIQDLLSKLTIEEKISLLRATSPGIERMGIDKYYMGNEALHGI--IRPG- 79
Query: 102 KFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAG----------LT 151
KF T FPQ I AS +N L I V+SDEARA +N G LT
Sbjct: 80 KF-------TVFPQAIGLASMWNPELHHIIASVISDEARARWNELERGKKQKDQFSDLLT 132
Query: 152 YWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYD 211
+WSP VN+ RDPRWGR ET GEDP LSG ++V+GLQG LK ++ KHF A +
Sbjct: 133 FWSPTVNMARDPRWGRTPETYGEDPYLSGVLGTAFVKGLQGDHPRYLKSVSTPKHFAANN 192
Query: 212 LDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNIL 271
++ +RF+ +A +++ D+ + + F C+ EGK S+M +YN +NGVP A+ +L
Sbjct: 193 EEH----NRFYCDAAITETDMREYYLPAFEKCIREGKAESIMTAYNAINGVPCTANNWLL 248
Query: 272 KRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPF-LGLHTE 330
+ ++ +W NGYIVSDC + G+ + TPE AA AI+AGLDL+CG + G
Sbjct: 249 NKVLKQDWGFNGYIVSDCGAPGLLMTDHRYVKTPEAAAMIAIKAGLDLECGDYVFGAPLL 308
Query: 331 SAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAA 390
+A ++ ++S +I++A + L +MRLGMFD +P PY HL P+ V H+ELALEAA
Sbjct: 309 NAYKQYMVSTAEIDSAAYHVLRARMRLGMFD-DPEKNPYNHLSPEIVGCEKHKELALEAA 367
Query: 391 RQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAG 433
RQ IVLLKNQ +LPL+ + +++AV+G N+ G+Y+G
Sbjct: 368 RQSIVLLKNQKNTLPLNAKKIKSIAVVGINA--ANCEFGDYSG 408
Score = 139 bits (349), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 97/296 (32%), Positives = 146/296 (49%), Gaps = 41/296 (13%)
Query: 467 LFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVL 526
++G A R++D I VMG++QSIE E DR+ + LP QQ + + A+ TI+VL
Sbjct: 586 MYGDASKVIRESDVVIAVMGINQSIEREGQDRSSIELPKDQQIFIREAYKANPN-TIVVL 644
Query: 527 MSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITN 586
++G + V + D I AII A YPG+ GGTAIA++LFG NP G+LP+T+Y I +
Sbjct: 645 VAGSSMAVGWM--DQNIPAIIDAWYPGEQGGTAIAEVLFGDYNPAGRLPLTFYNS--IED 700
Query: 587 LPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRH 646
LP A K RTY +++G +Y FG+G+SYT F D R+
Sbjct: 701 LP----AFNDYNVKN--NRTYMYFEGKPLYAFGYGLSYTKF---------------DYRN 739
Query: 647 GSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQL 705
+I K + + + VKN G +G V+ P G P KQL
Sbjct: 740 LNI--------------KQDSQNITLNFSVKNSGKYNGDEVAQVYVQFPDLGIKTPLKQL 785
Query: 706 VAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLH 761
F++VH+ GA +++ I I + D+ P G +N +G + ++ L
Sbjct: 786 KGFKRVHIKKGATEQISIEIPKEELRLWDDQKKQFYTPSGTYNFMVGKSSDNICLQ 841
>gi|46127231|ref|XP_388169.1| hypothetical protein FG07993.1 [Gibberella zeae PH-1]
Length = 712
Score = 301 bits (772), Expect = 6e-79, Method: Compositional matrix adjust.
Identities = 201/599 (33%), Positives = 295/599 (49%), Gaps = 74/599 (12%)
Query: 38 RTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNV 97
R+ C + +R L+ L+ +EKV L+S A PR+G+ Y WW+EALHGV+
Sbjct: 37 RSNGICDTTASPAERAAALVSALTPREKVNNLVSNATGAPRIGLPRYNWWNEALHGVAG- 95
Query: 98 GPGTKFGGDFP--GATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAG-LTYWS 154
PG + P ATSFP + S+F+ L IG V+ EARA NGG G + YW+
Sbjct: 96 APGNDYNDKPPYDSATSFPMPLLMGSTFDDDLIHDIGEVIGTEARAWNNGGWGGGVDYWT 155
Query: 155 PNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDN 214
PNVN F+DPRWGRG ETPGED + + + Y R ++
Sbjct: 156 PNVNPFKDPRWGRGSETPGEDAL----HVSRYARAME----------------------- 188
Query: 215 WNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRT 274
C + KV S+MCSYN VNG+P CA+ + +
Sbjct: 189 ----------------------------CTRDAKVGSIMCSYNAVNGIPACANSYLQETL 220
Query: 275 IRGEWRL---NGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTES 331
+R W N +I SDC ++ + ++T T EAA A G D C
Sbjct: 221 LRKHWNWTHTNNWITSDCGAMQDIWQHHNYTKTGAEAAKAAFENGQDSSCEYTTTKDISD 280
Query: 332 AVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAAR 391
+ ++GLL+E ++ AL + G FDG+ S + L DV T Q+LAL++A
Sbjct: 281 SYEQGLLTEKVMDRALKRLFEGLVHTGFFDGDKSE--WSSLDFDDVNTRHAQDLALQSAV 338
Query: 392 QGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPL---QGIGRY 448
+G VLLKN +LPL+ + +VA+IG +D + G Y+G A TP + +G
Sbjct: 339 RGAVLLKNDN-TLPLNIKKKESVALIGFWADDKTKLQGGYSGPAPHVRTPAYAAKMLGLN 397
Query: 449 ARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQ 508
++ + D+ A ++A++++D + + GLD + E DR L P Q
Sbjct: 398 TNVAWGPTLQNSSVPDNWTTNA-LEAAKKSDYIVYLGGLDATAAGEERDRTDLDWPSTQL 456
Query: 509 ELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTS 568
L+ K+S K P ++V + D KN + +I+W YPGQ GGTA+ +++ G
Sbjct: 457 TLLKKLSNLGK-PLVVVQLGDQVDDTPLLKNK-GVNSILWVNYPGQEGGTAVMELITGRK 514
Query: 569 NPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNF 627
P G+LP+T YP +Y + M EM +RP++S PGRTYR+Y V+ PFG G YT F
Sbjct: 515 GPAGRLPLTQYPSKYTEQVGMLEMELRPTKSS--PGRTYRWYSDSVL-PFGFGKHYTTF 570
>gi|160886913|ref|ZP_02067916.1| hypothetical protein BACOVA_04927 [Bacteroides ovatus ATCC 8483]
gi|423288977|ref|ZP_17267828.1| hypothetical protein HMPREF1069_02871 [Bacteroides ovatus
CL02T12C04]
gi|423294866|ref|ZP_17272993.1| hypothetical protein HMPREF1070_01658 [Bacteroides ovatus
CL03T12C18]
gi|156107324|gb|EDO09069.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
ovatus ATCC 8483]
gi|392668741|gb|EIY62235.1| hypothetical protein HMPREF1069_02871 [Bacteroides ovatus
CL02T12C04]
gi|392676057|gb|EIY69498.1| hypothetical protein HMPREF1070_01658 [Bacteroides ovatus
CL03T12C18]
Length = 863
Score = 301 bits (772), Expect = 7e-79, Method: Compositional matrix adjust.
Identities = 172/417 (41%), Positives = 243/417 (58%), Gaps = 31/417 (7%)
Query: 41 PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPG 100
P+ L QR +DL+ RL+L+EKV L+ + + A+PRLGIK YEWW+EALHGV+ G
Sbjct: 27 PYQDTKLTAEQRADDLLQRLTLEEKVALMQNNSPAIPRLGIKPYEWWNEALHGVARAGL- 85
Query: 101 TKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARA---MYNGG-----TAGLTY 152
AT FPQ I A+SFN L + VSDEARA +N GLT
Sbjct: 86 ---------ATVFPQAIGMAASFNDELLYEVFDAVSDEARAKNRQFNEKGQYKRYQGLTM 136
Query: 153 WSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYD 211
W+PNVNIFRDPRWGRGQET GEDP LSG+ + VRGLQG D + K+ A KHF +
Sbjct: 137 WTPNVNIFRDPRWGRGQETYGEDPYLSGRMGMAAVRGLQGPEDAEYDKLHACAKHFAVHS 196
Query: 212 LDNWNGVDRFHFNAK-VSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNI 270
WN R FNA+ ++ +D+ +T+ F+ V + V VMC+YN+ G P C +
Sbjct: 197 GPEWN---RHSFNAENIAPRDLWETYLPAFKELVQKAGVKEVMCAYNRFEGDPCCGSNRL 253
Query: 271 LKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPE--EAAADAIRAGLDLDCGPFLGLH 328
L + +R +W G +V+DC ++G ++ + + P+ A+ADA+ +G DL+CG
Sbjct: 254 LTQILRNDWGFKGIVVTDCGAIGDFFQRKKHETHPDAAHASADAVLSGTDLECGGNFKSI 313
Query: 329 TESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALE 388
T+ AV++GL+SE IN ++ L + LG + S+ P+ ++ + P H+ELAL+
Sbjct: 314 TD-AVKKGLISEEKINTSVKRLLKARFELGEMN---STHPWSNIPFSVIDCPKHKELALK 369
Query: 389 AARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
A + +VLL+N LPL+ R VAVIGPN++ +V GNY G T L+GI
Sbjct: 370 MAHESLVLLQNNNNILPLN--RQMKVAVIGPNANDSVMQWGNYNGFPSHTVTLLEGI 424
Score = 115 bits (287), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 90/307 (29%), Positives = 140/307 (45%), Gaps = 49/307 (15%)
Query: 472 IDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGP 521
++ + AD I G+ +E E++ DR + LP Q+E+++ + K
Sbjct: 594 LNKLQSADVVIFAGGISPLLEGESMRVSDPGFKGGDRTEIELPAIQREVLALLKKNGK-K 652
Query: 522 TILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQ 581
T+ V SG + A AI+ A YPGQAGGTA+AD+LFG NP G+LP+T+Y
Sbjct: 653 TVFVNFSGSAM--AIVPETQNCDAILQAWYPGQAGGTAVADVLFGDYNPAGRLPITFY-- 708
Query: 582 EYITNLPMTE-MAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAV 640
+ + LP E +M+ GRTYRF +YPFG+G+SYT F + A
Sbjct: 709 KSMQQLPDYEDYSMK--------GRTYRFMTETPLYPFGYGLSYTRFSYGKA-------- 752
Query: 641 PLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWA 700
++N + K+T + LT + V NVG +DG + V+ P
Sbjct: 753 -------TLNQS------KLTKGEKAILT----IPVSNVGQRDGEEVVQVYICRPDDKEG 795
Query: 701 PHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
P K L F++V + G Q V I + + + T R G + I G + + L
Sbjct: 796 PQKTLRGFQRVSIAKGKTQNVQIELPYDSFEWFDAATNTIRPLNGTYKILYGNSSNEKDL 855
Query: 761 HAATLGV 767
++ +
Sbjct: 856 QTCSIQI 862
>gi|293370402|ref|ZP_06616956.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
ovatus SD CMC 3f]
gi|292634550|gb|EFF53085.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
ovatus SD CMC 3f]
Length = 863
Score = 301 bits (771), Expect = 8e-79, Method: Compositional matrix adjust.
Identities = 171/417 (41%), Positives = 243/417 (58%), Gaps = 31/417 (7%)
Query: 41 PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPG 100
P+ L + QR +DL+ RL+L+EKV L+ + + A+PRLGIK YEWW+EALHGV+ G
Sbjct: 27 PYQDTKLTVEQRADDLLQRLTLEEKVALMQNNSPAIPRLGIKPYEWWNEALHGVARAGL- 85
Query: 101 TKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARA---MYNGG-----TAGLTY 152
AT FPQ I A+SFN L + VSDEARA +N GLT
Sbjct: 86 ---------ATVFPQAIGMAASFNDELLYEVFDAVSDEARAKNRQFNEKGQYKRYQGLTM 136
Query: 153 WSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYD 211
W+PNVNIFRDPRWGRGQET GEDP LSG+ + VRGLQG D + K+ A KHF +
Sbjct: 137 WTPNVNIFRDPRWGRGQETYGEDPYLSGRMGMAAVRGLQGPEDAEYDKLHACAKHFAVHS 196
Query: 212 LDNWNGVDRFHFNAK-VSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNI 270
WN R FNA+ ++ +D+ +T+ F+ V + V VMC+YN+ G P C +
Sbjct: 197 GPEWN---RHSFNAENIAPRDLWETYLPAFKELVQKAGVKEVMCAYNRFEGDPCCGSNRL 253
Query: 271 LKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPE--EAAADAIRAGLDLDCGPFLGLH 328
L + +R +W G +V+DC ++G ++ + + P+ A+ADA+ +G DL+CG
Sbjct: 254 LTQILRNDWGFKGIVVTDCGAIGDFFQRKKHETHPDAAHASADAVLSGTDLECGGNFKSI 313
Query: 329 TESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALE 388
T+ AV++ L+SE IN ++ L + LG + S+ P+ ++ + P H+ELAL+
Sbjct: 314 TD-AVKKDLISEEKINTSVKRVLKARFELGEMN---STHPWSNIPFSVIDCPKHKELALK 369
Query: 389 AARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
A + +VLL+N LPL+ R VAVIGPN++ +V GNY G T L+GI
Sbjct: 370 MAHESLVLLQNNNNILPLN--RQMKVAVIGPNANDSVMQWGNYNGFPSHTVTLLEGI 424
Score = 115 bits (287), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 90/307 (29%), Positives = 140/307 (45%), Gaps = 49/307 (15%)
Query: 472 IDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGP 521
++ + AD I G+ +E E++ DR + LP Q+E+++ + K
Sbjct: 594 LNKLQSADVVIFAGGISPLLEGESMRVSDPGFKGGDRTEIELPAIQREVLALLKKNGK-K 652
Query: 522 TILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQ 581
T+ V SG + A AI+ A YPGQAGGTA+AD+LFG NP G+LP+T+Y
Sbjct: 653 TVFVNFSGSAM--AIVPETQNCDAILQAWYPGQAGGTAVADVLFGDYNPAGRLPITFY-- 708
Query: 582 EYITNLPMTE-MAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAV 640
+ + LP E +M+ GRTYRF +YPFG+G+SYT F + A
Sbjct: 709 KSMQQLPDYEDYSMK--------GRTYRFMTETPLYPFGYGLSYTRFSYGKA-------- 752
Query: 641 PLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWA 700
++N + K+T + LT + V NVG +DG + V+ P
Sbjct: 753 -------TLNQS------KLTKGEKAILT----IPVSNVGQRDGEEVVQVYICRPDDKEG 795
Query: 701 PHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
P K L F++V + G Q V I + + + T R G + I G + + L
Sbjct: 796 PQKTLRGFQRVSIAKGKTQNVQIELPYDSFEWFDAATNTIRPLNGTYKILYGNSSNEKDL 855
Query: 761 HAATLGV 767
++ +
Sbjct: 856 QTCSIQI 862
>gi|408824590|ref|ZP_11209480.1| Glucan 1,4-beta-glucosidase [Pseudomonas geniculata N1]
Length = 897
Score = 300 bits (769), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 174/436 (39%), Positives = 246/436 (56%), Gaps = 28/436 (6%)
Query: 41 PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPG 100
P+ VS QR L+ +++L+EK + + A A+ RLG+ Y+WW+E LHGV+ G
Sbjct: 37 PWLDVSASFEQRAAALVAQMTLEEKAAQMQNAAPAIERLGVPAYDWWNEGLHGVARAGQ- 95
Query: 101 TKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--------GGTAGLTY 152
AT FPQ I A++F+ L + +SDEARA ++ G GLT+
Sbjct: 96 ---------ATVFPQAIGLAATFDVPLMGQVAATISDEARAKHHQFLREGAHGRYQGLTF 146
Query: 153 WSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDL 212
WSPNVNIFRDPRWGRGQET GEDP L+ + ++VRGLQG D K+ A+ KH +
Sbjct: 147 WSPNVNIFRDPRWGRGQETYGEDPYLTARMGVAFVRGLQGDDPVYRKLDATAKHLAVHSG 206
Query: 213 DNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILK 272
DR HF+A+ S++D+ DT+ F V EG V +VM +YN+V G A +L+
Sbjct: 207 PE---ADRHHFDARPSRRDLYDTYLPAFEALVKEGDVDAVMGAYNRVYGESASASRFLLR 263
Query: 273 RTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESA 332
+R +W GY+VSDC ++ + +T E AAA A+R G +L+CG SA
Sbjct: 264 DVLRRDWGFKGYVVSDCWAIVDIWKHHRIVTTREAAAALAVRNGTELECGQEYAT-LPSA 322
Query: 333 VQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQ 392
V++GL+SE +I++A+ T +MRLGMFD P + + P H LAL+AA+
Sbjct: 323 VRQGLISEAEIDDAVTRLFTARMRLGMFD-PPERVRWARIPASVNQAPAHDALALKAAQA 381
Query: 393 GIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTI 452
+VLLKN G LPLS R +AV+GP +D T+ ++GNY G T LQGI A+ +
Sbjct: 382 SLVLLKNDG-ILPLSR-NTRRIAVVGPTADDTMALLGNYFGTPAAPVTILQGIREAAKGV 439
Query: 453 ---HQQGCKDVACADD 465
+ +G V DD
Sbjct: 440 EVRYARGVDLVEGRDD 455
Score = 104 bits (260), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 84/291 (28%), Positives = 129/291 (44%), Gaps = 53/291 (18%)
Query: 482 ILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGP 531
+ V GL +E E + DR L LP Q+ L+ + K P ++VL G
Sbjct: 633 VFVGGLTGDVEGEEMTVNYPGFAGGDRTDLRLPAPQRTLLEALHGTGK-PVVMVLTGGSA 691
Query: 532 IDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTE 591
I V +A+ + AI+ + YPGQ GGTA+ LFG NP G+LP+T+Y
Sbjct: 692 IAVDWAQA--HLPAILMSWYPGQRGGTAVGQALFGDVNPSGRLPVTFY---------KAG 740
Query: 592 MAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTV--ANAPTVVAVPLDGRHGSI 649
AM GRTYR+++G +YPFGHG+SYT F + +A ++ A DGR
Sbjct: 741 EAMPAFDDYAMEGRTYRYFRGTPLYPFGHGLSYTRFDYGTLRLDADSLRA---DGR---- 793
Query: 650 NATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTL-LVFSTPPAGHWAPHKQLVAF 708
LGV VDV N G++ G + L AG ++L F
Sbjct: 794 --------------------LGVAVDVANTGTRSGDEVVQLYVRREHAGSGDAVQELRGF 833
Query: 709 EKVHVPAGAQQRVGINIHVCKYLSVVDRS-GTRRIPLGEHNIHIGGTKHSV 758
++V + G ++ V + + L D + + G + + +G + +
Sbjct: 834 QRVQLAPGERRTVTFTLEAAQALRHYDEARAAYAVQPGAYEVRVGASSADI 884
>gi|329851587|ref|ZP_08266344.1| beta-xylosidase B [Asticcacaulis biprosthecum C19]
gi|328840433|gb|EGF90005.1| beta-xylosidase B [Asticcacaulis biprosthecum C19]
Length = 883
Score = 300 bits (768), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 177/446 (39%), Positives = 250/446 (56%), Gaps = 40/446 (8%)
Query: 12 LILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLIS 71
L+ LSA ++ A+ P TT+T QR DL+ R+SL+EK LI+
Sbjct: 18 LVCLSAPTA--QAQNPLESPAYQDTTKT---------AEQRAADLVSRMSLEEKAAQLIN 66
Query: 72 GAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAI 131
A A+PRLG++ Y WW+E LHGV+ G AT FPQ + A++F+ L +
Sbjct: 67 DAPAIPRLGVREYNWWNEGLHGVAAHG----------YATVFPQAVGMAATFDEPLIHRV 116
Query: 132 GRVVSDEARAMY------NGGT---AGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKY 182
+S E RA Y GG+ GLT WSPN+NIFRDPRWGRGQET GEDP L+ +
Sbjct: 117 ADTISVEFRAKYVASRHRFGGSDWFRGLTVWSPNINIFRDPRWGRGQETYGEDPYLTARI 176
Query: 183 AASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRM 242
++V+GLQG D + A+ KH+ + R N S+ D+EDT+ FR
Sbjct: 177 GVAFVKGLQGEDPVYYRTIATPKHYAVHSGPE---ASRHRDNINPSRYDLEDTYLPAFRA 233
Query: 243 CVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVG-VYYDT-QH 300
++EGK S+MC+YN ++G P CA+ ++L + +R +W G++VSDCD+VG +YY T H
Sbjct: 234 TIVEGKAVSIMCAYNAIDGQPACANDDLLVKHLRQDWGFKGFVVSDCDAVGDIYYKTSHH 293
Query: 301 FTSTPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGM 359
+ TPEE A +AG DL CG H SAV++G+L E ++ ALV + + +LG
Sbjct: 294 YRPTPEEGVTVAYQAGTDLICGNANEADHVASAVRKGILPESLVDTALVRLFSARFKLGQ 353
Query: 360 FDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGP 419
FD P +Q + + D T +++ + A +VLLKN G LPL RT+AVIGP
Sbjct: 354 FD--PPAQVFPAITADDYDTQANRDFSQHVAESAMVLLKNDG-LLPLKS-EPRTIAVIGP 409
Query: 420 NSDVTVTMIGNYAGIACGYTTPLQGI 445
N+D +++GNY G T L GI
Sbjct: 410 NADTMDSLVGNYNGDPSHPVTVLAGI 435
Score = 138 bits (348), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 95/294 (32%), Positives = 154/294 (52%), Gaps = 50/294 (17%)
Query: 471 AIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKG 520
A+ A++++D I V GL Q +E E + DR L LP QQ+++ +VS K
Sbjct: 600 AVAAAKESDLVIFVAGLSQRVEGEEMRVETPGFSGGDRTSLDLPPVQQKVLEQVSATGK- 658
Query: 521 PTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYP 580
P +LVL++G + V +A D + AI+ A YPG GG A+A ++ G +P G+LP+T+Y
Sbjct: 659 PVVLVLINGSALSVNWA--DKNVPAIVEAWYPGGQGGAAVARLIAGDFSPAGRLPVTFY- 715
Query: 581 QEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVA 639
+P T+ M+ GRTYR++KG +YPFG+G+SYT F + AP ++
Sbjct: 716 -RSADQIPAFTDYTMK--------GRTYRYFKGEALYPFGYGLSYTKFSY----APAKLS 762
Query: 640 VPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHW 699
A ++G +VT V VDV N G++DG + ++ + P
Sbjct: 763 A----------AKVAGNG-EVT----------VSVDVTNSGARDGDEVVQLYLSHPGQKD 801
Query: 700 APHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGG 753
P + L F+++H+ AG + V + + LS V+ G+R + G+ N+ +GG
Sbjct: 802 TPIRALARFDRIHLKAGETKTVTFTLD-SRALSTVNADGSRSVKPGKVNLWLGG 854
>gi|390991557|ref|ZP_10261819.1| glycosyl hydrolase family 3 N terminal domain protein [Xanthomonas
axonopodis pv. punicae str. LMG 859]
gi|372553724|emb|CCF68794.1| glycosyl hydrolase family 3 N terminal domain protein [Xanthomonas
axonopodis pv. punicae str. LMG 859]
Length = 901
Score = 300 bits (767), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 173/458 (37%), Positives = 252/458 (55%), Gaps = 36/458 (7%)
Query: 28 FACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWW 87
C A T P+ R DL+ R++L+EK + + A A+PRL + Y+WW
Sbjct: 20 LGCATGAAHAATPPYLDTQRSFEARAADLVSRMTLEEKAAQMQNAAPAIPRLQVPAYDWW 79
Query: 88 SEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--- 144
+EALHGV+ G GAT FPQ I A++F+ L + +SDEARA ++
Sbjct: 80 NEALHGVARAG----------GATVFPQAIGMAATFDLPLMHEVATAISDEARAKHHQFL 129
Query: 145 -----GGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQG------- 192
GLT+WSPN+NIFRDPRWGRGQET GEDP L+ + ++V+GLQG
Sbjct: 130 RQNQHARYQGLTFWSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQGEGAAAPK 189
Query: 193 -SDGDRL-KVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVA 250
+ G+ K+ A+ KHF + DR HF+A+ S++D+ +T+ F V +GKV
Sbjct: 190 NAQGEPYRKLDATAKHFAVHSGPE---ADRHHFDARPSQRDLYETYLPAFEALVKDGKVD 246
Query: 251 SVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAA 310
+VM +YN+V G A +L+ +R +W GY+VSDC ++ + +T E+AAA
Sbjct: 247 AVMGAYNRVYGESASASKFLLQDVLRDQWGFRGYVVSDCWAIVDIWKHHKIVATREQAAA 306
Query: 311 DAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYG 370
A++ G +L+CG +AV++GL+ E I+ AL +T +MRLGMFD P P+
Sbjct: 307 LAVKHGTELECGEEYAT-LPAAVRQGLIDEAQIDTALKTLMTARMRLGMFD-PPGQLPWS 364
Query: 371 HLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGN 430
+ +P H LA AR+ +VLLKN G LPLS + + +AVIGP +D T+ ++GN
Sbjct: 365 TIPASVNQSPAHDALARRTARESLVLLKNDG-LLPLSRAKLKRIAVIGPTADDTMALLGN 423
Query: 431 YAGIACGYTTPLQGI---GRYARTIHQQGCKDVACADD 465
Y G T LQGI A+ ++ +G V DD
Sbjct: 424 YYGTPAAPVTVLQGIRAAAPKAQVLYARGADLVEGRDD 461
Score = 128 bits (321), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 94/305 (30%), Positives = 149/305 (48%), Gaps = 59/305 (19%)
Query: 471 AIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKG 520
A+D +R AD + V GL +E E + DR L LP Q++L+ + A+
Sbjct: 628 ALDVARSADVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRDLLEALQ-ATGR 686
Query: 521 PTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYP 580
P + VL +G + + +A+ + AI+ A YPGQ GGTA+AD LFG +NPGG+LP+T+Y
Sbjct: 687 PVVAVLTTGSALAIDWAQQ--HLPAILLAWYPGQRGGTAVADTLFGDANPGGRLPVTFY- 743
Query: 581 QEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHT-VANAPTVVA 639
+E T + AMR GRTYR++ G +YPFGHG+SYT F ++ + T +A
Sbjct: 744 KESETLPAFDDYAMR--------GRTYRYFGGTPLYPFGHGLSYTQFAYSGLRLDRTTIA 795
Query: 640 VPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHW 699
GS+ AT++ VKN G + G + ++ P
Sbjct: 796 T-----DGSLTATVT---------------------VKNTGQRAGDEVVQLYLHP----L 825
Query: 700 APH-----KQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVD-RSGTRRIPLGEHNIHIGG 753
AP K+L F+++ + G Q+ +G I+ L + D + + G + + IG
Sbjct: 826 APQRERAGKELHGFQRIALQPGEQRELGFTINAKDALRLYDEQRKAYGVDPGAYEVQIGA 885
Query: 754 TKHSV 758
+ +
Sbjct: 886 SSADI 890
>gi|418518029|ref|ZP_13084183.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. malvacearum
str. GSPB1386]
gi|410705279|gb|EKQ63755.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. malvacearum
str. GSPB1386]
Length = 886
Score = 300 bits (767), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 170/407 (41%), Positives = 235/407 (57%), Gaps = 32/407 (7%)
Query: 51 QRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGA 110
QR L+ ++S +EKV ++ A A+PRLGI YEWWSE LHG++ G A
Sbjct: 36 QRAAALVAQMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNGY----------A 85
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--GGT-------AGLTYWSPNVNIFR 161
T FPQ I A+S+N +L + +G VVS EARA +N GG AGLT WSPN+NIFR
Sbjct: 86 TVFPQSIGLAASWNTSLMQQVGTVVSTEARAKFNQAGGPGKDHQRYAGLTIWSPNINIFR 145
Query: 162 DPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD-- 219
DPRWGRG ET GEDP L+G+ A ++RGLQG D + + A+ KH + +G +
Sbjct: 146 DPRWGRGMETYGEDPFLTGQMAVGFIRGLQGEDLNHPRTIATPKHIAVH-----SGPEPG 200
Query: 220 RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEW 279
R F+ VS D+E T+ FR ++EG+ SVMC+YN ++G P CA +L +RG+W
Sbjct: 201 RHGFDVDVSPHDVEATYTPAFRAALVEGQAGSVMCAYNALHGTPVCAADWLLNGRVRGDW 260
Query: 280 RLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLL 338
G++VSDCD++ +F ++ A++AG DL+CG + L T A+ RG +
Sbjct: 261 GFKGFVVSDCDAIDDMTQFHYFRPDNAGSSVAALKAGHDLNCGHAYRELGT--AIARGEV 318
Query: 339 SEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLK 398
E ++ +LV + RLG + P PY LG KDV H+ LAL+AA + IVLLK
Sbjct: 319 DEALLDQSLVRLFAARYRLGELEA-PRKDPYARLGAKDVDNVAHRALALQAAAESIVLLK 377
Query: 399 NQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
N +LPL +AVIGPN+D + NY G + TPL G+
Sbjct: 378 NDANTLPLRA--GTRLAVIGPNADALAALEANYQGTSSAPVTPLLGL 422
Score = 133 bits (335), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 93/278 (33%), Positives = 141/278 (50%), Gaps = 48/278 (17%)
Query: 486 GLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVA 535
GL +E E L DR + LP QQ L+ + + K P ++VLMSG + +
Sbjct: 623 GLSPDVEGEELRIDVPGFDGGDRNDIALPAPQQALLERAKASGK-PLVVVLMSGSAVALN 681
Query: 536 FAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMR 595
+AK AI+ A YPGQ+GGTAIA +L G NPGG+LP+T+Y +LP
Sbjct: 682 WAKT--HADAIMAAWYPGQSGGTAIARMLAGDDNPGGRLPVTFY--RSTKDLPAY----- 732
Query: 596 PSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISG 655
S GRTYR++KG ++PFG+G+SYT F + +AP + L +G
Sbjct: 733 --VSYDMKGRTYRYFKGEPLFPFGYGLSYTRFAY---DAPQLSTTTLQ----------AG 777
Query: 656 KAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPA 715
++V V+N G++ G V+ P +P + LV F++VH+ A
Sbjct: 778 NPLQVI------------ATVRNTGARAGDEVAQVYLQYPDRPQSPLRSLVGFQRVHLAA 825
Query: 716 GAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGG 753
G Q+ + ++ + LS VDRSG R + G++ + +GG
Sbjct: 826 GEQRTLTFHLD-ARALSDVDRSGQRAVEAGDYTLFVGG 862
>gi|298482082|ref|ZP_07000270.1| beta-glucosidase [Bacteroides sp. D22]
gi|298271639|gb|EFI13212.1| beta-glucosidase [Bacteroides sp. D22]
Length = 863
Score = 300 bits (767), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 171/417 (41%), Positives = 242/417 (58%), Gaps = 31/417 (7%)
Query: 41 PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPG 100
P+ L QR +DL+ RL+L+EKV L+ + + A+PRLGIK YEWW+EALHGV+ G
Sbjct: 27 PYQDTKLTAEQRADDLLQRLTLEEKVALMQNNSPAIPRLGIKPYEWWNEALHGVARAGL- 85
Query: 101 TKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARA---MYN-----GGTAGLTY 152
AT FPQ I A+SFN L + VSDEARA +N GLT
Sbjct: 86 ---------ATVFPQAIGMAASFNDELLYEVFDAVSDEARAKNRQFNERGQYKRYQGLTM 136
Query: 153 WSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYD 211
W+PNVNIFRDPRWGRGQET GEDP LSG+ + VRGLQG D + K+ A KHF +
Sbjct: 137 WTPNVNIFRDPRWGRGQETYGEDPYLSGRMGMAVVRGLQGPEDAEYDKLHACAKHFAVHS 196
Query: 212 LDNWNGVDRFHFNAK-VSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNI 270
WN R FNA+ ++ +D+ +T+ F+ V + V VMC+YN+ G P C +
Sbjct: 197 GPEWN---RHSFNAENIAPRDLWETYLPAFKELVQKAGVKEVMCAYNRFEGDPCCGSNRL 253
Query: 271 LKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPE--EAAADAIRAGLDLDCGPFLGLH 328
L + +R +W G +V+DC ++G ++ + + P+ A+ADA+ G DL+CG
Sbjct: 254 LTQILRNDWGFKGIVVTDCGAIGDFFQRKKHETHPDAVHASADAVLNGTDLECGGNFKSI 313
Query: 329 TESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALE 388
T+ AV++GL+SE IN ++ L + LG + + P+ ++ + P H+ELAL+
Sbjct: 314 TD-AVKKGLISEEKINTSVKRLLKARFELGEMN---PTHPWSNIPYSVIDCPKHKELALK 369
Query: 389 AARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
A + +VLL+N+ LPL+ R VAVIGPN++ +V GNY G T L+GI
Sbjct: 370 MAHESLVLLQNKNNILPLN--RQMKVAVIGPNANDSVMQWGNYNGFPSHTVTLLEGI 424
Score = 118 bits (295), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 90/308 (29%), Positives = 139/308 (45%), Gaps = 51/308 (16%)
Query: 472 IDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGP 521
++ + AD I G+ +E E++ DR + LP Q+E+++ + K
Sbjct: 594 LNKLKNADVVIFAGGISPLLEGESMRVSDPGFKGGDRTEIELPAIQREVLALLKKNGK-K 652
Query: 522 TILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQ 581
T+ V SG + A AI+ A YPGQAGGTA+AD+LFG NP G+LP+T+Y
Sbjct: 653 TVFVNFSGSAM--AIVPETQSCDAILQAWYPGQAGGTAVADVLFGDYNPAGRLPITFY-- 708
Query: 582 EYITNLPMTE-MAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAV 640
+ I LP E +M+ GRTYRF +YPFG+G+SYT F +
Sbjct: 709 KSIQQLPDYEDYSMK--------GRTYRFMTETPLYPFGYGLSYTRFSY----------- 749
Query: 641 PLDGRHGSINATISGKAIKVTHAKCNRLTLGV-QVDVKNVGSKDGAHTLLVFSTPPAGHW 699
GKA + +K N+ + + V NVG +DG + V+ P
Sbjct: 750 --------------GKAT-LNQSKLNKGEKAILTIPVSNVGQRDGEEVVQVYICRPDDKE 794
Query: 700 APHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVS 759
P K L F++V++ G Q V I + + + T R G + I G + +
Sbjct: 795 GPQKTLRGFQRVNIAKGKTQNVSIELPYDSFEWFDTATNTIRPLSGTYKILYGNSSNEND 854
Query: 760 LHAATLGV 767
L ++ +
Sbjct: 855 LQTCSIQI 862
>gi|315500297|ref|YP_004089100.1| glycoside hydrolase family 3 domain protein [Asticcacaulis
excentricus CB 48]
gi|315418309|gb|ADU14949.1| glycoside hydrolase family 3 domain protein [Asticcacaulis
excentricus CB 48]
Length = 882
Score = 300 bits (767), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 173/450 (38%), Positives = 249/450 (55%), Gaps = 33/450 (7%)
Query: 8 FFLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVK 67
+F +L + S+G A + A +P DA + S P R DL+ R++L+EK
Sbjct: 6 YFTAAVLAALMSAG-ALQPALAQNPLDAPG----YQDASKPPEARAADLVSRMTLEEKTA 60
Query: 68 LLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATL 127
LI+ A A+PRL ++ Y WW+E LHGV+ G AT FPQ + A++++ L
Sbjct: 61 QLINDAPAIPRLNVREYNWWNEGLHGVAAAG----------YATVFPQAVGLAATWDEPL 110
Query: 128 WEAIGRVVSDEARAMY------NGGT---AGLTYWSPNVNIFRDPRWGRGQETPGEDPVL 178
+ +S E RA Y GG+ GLT WSPN+NIFRDPRWGRGQET GEDP L
Sbjct: 111 IHRVAETISVEFRAKYLKERHRFGGSDWFGGLTVWSPNINIFRDPRWGRGQETYGEDPYL 170
Query: 179 SGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDV 238
+ + ++VRGLQG D + A+ KH+ + R N S D+ DT+
Sbjct: 171 TARMGVAFVRGLQGDDPVYYRTVATPKHYAVHSGPE---AGRHRDNVNPSPYDLADTYLP 227
Query: 239 PFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVG-VYYD 297
FR + EG+ S+MC+YN +NG P CA+ ++L + +R +W GY+VSDCD+VG +YY
Sbjct: 228 AFRATITEGQAGSIMCAYNAINGQPACANEDLLVKYLRKDWGFKGYVVSDCDAVGDIYYK 287
Query: 298 TQH-FTSTPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEIDINNALVNTLTVQM 355
T H + TPEE A + G DL CG H AV++GLL E ++ AL+ T +
Sbjct: 288 TSHAYRPTPEEGVTAAYQVGTDLICGNANEADHLTRAVRQGLLPEKTLDTALIRLFTARF 347
Query: 356 RLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVA 415
+LG FD P ++ + + +D TP +++ + + A +VLLKN+ LPL R +A
Sbjct: 348 KLGQFD--PPAKVFPKITAEDYDTPANRDFSQKVAESAMVLLKNENNLLPLKG-EPRQIA 404
Query: 416 VIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
VIGPN+D +++GNY G T L GI
Sbjct: 405 VIGPNADSMDSLVGNYNGDPSHPVTVLSGI 434
Score = 135 bits (339), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 94/295 (31%), Positives = 151/295 (51%), Gaps = 50/295 (16%)
Query: 470 AAIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASK 519
+A+ A+++AD + V GL Q +E E + DR L LP QQ+++ +VS A K
Sbjct: 598 SAVAAAKEADLVVFVAGLSQRVEGEEMRVETEGFSGGDRTTLNLPPAQQKVLEQVSAAGK 657
Query: 520 GPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWY 579
P +LVL++G + + +A D + AII A YPG GG A+A ++ G +P G+LP+T+Y
Sbjct: 658 -PVVLVLINGSALGINWA--DKNVPAIIEAWYPGGQGGAAVARLIAGDYSPAGRLPVTFY 714
Query: 580 PQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVV 638
LP + M+ GRTYR++KG +YPFG+G+S+T F + AP
Sbjct: 715 --RSADQLPAFNDYNMK--------GRTYRYFKGEALYPFGYGLSFTTFRY----AP--- 757
Query: 639 AVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGH 698
+ L R + + +S V DV N GS+D + ++ + P
Sbjct: 758 -LTLSARQVAGDGQVS-----------------VSADVTNSGSRDSDEVVQLYVSYPGQK 799
Query: 699 WAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGG 753
AP + L FE++H+ AG + V + + LS V+ G+R + G+ + +GG
Sbjct: 800 LAPIRALARFERIHLKAGETKTVRFTLD-PQALSTVNADGSRSVKPGKVELWLGG 853
>gi|336415490|ref|ZP_08595829.1| hypothetical protein HMPREF1017_02937 [Bacteroides ovatus
3_8_47FAA]
gi|335940369|gb|EGN02236.1| hypothetical protein HMPREF1017_02937 [Bacteroides ovatus
3_8_47FAA]
Length = 863
Score = 300 bits (767), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 171/417 (41%), Positives = 242/417 (58%), Gaps = 31/417 (7%)
Query: 41 PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPG 100
P+ L QR +DL+ RL+L+EKV L+ + + A+PRLGIK YEWW+EALHGV+ G
Sbjct: 27 PYQDTKLTAEQRADDLLQRLTLEEKVALMQNNSPAIPRLGIKPYEWWNEALHGVARAGL- 85
Query: 101 TKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARA---MYN-----GGTAGLTY 152
AT FPQ I A+SFN L + VSDEARA +N GLT
Sbjct: 86 ---------ATVFPQAIGMAASFNDELLYEVFDAVSDEARAKNRQFNERGQYKRYQGLTM 136
Query: 153 WSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYD 211
W+PNVNIFRDPRWGRGQET GEDP LSG+ + VRGLQG D + K+ A KHF +
Sbjct: 137 WTPNVNIFRDPRWGRGQETYGEDPYLSGRMGMAAVRGLQGPEDAEYDKLHACAKHFAVHS 196
Query: 212 LDNWNGVDRFHFNAK-VSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNI 270
WN R FNA+ ++ +D+ +T+ F+ V + V VMC+YN+ G P C +
Sbjct: 197 GPEWN---RHSFNAENIAPRDLWETYLPAFKELVQKAGVKEVMCAYNRFEGDPCCGSNRL 253
Query: 271 LKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPE--EAAADAIRAGLDLDCGPFLGLH 328
L + +R +W G +V+DC ++G ++ + + P+ A+ADA+ G DL+CG
Sbjct: 254 LTQILRNDWGFKGIVVTDCGAIGDFFQRKKHETHPDAAHASADAVLNGTDLECGGNFKSI 313
Query: 329 TESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALE 388
T+ AV++GL+SE IN ++ L + LG + + P+ ++ + P H+ELAL+
Sbjct: 314 TD-AVKKGLISEEKINTSVKRLLKARFELGEMN---PTHPWSNIPYSVINCPKHKELALK 369
Query: 389 AARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
A + +VLL+N+ LPL+ R VAVIGPN++ +V GNY G T L+GI
Sbjct: 370 MAHESLVLLQNKNNILPLN--RQMKVAVIGPNANDSVMQWGNYNGFPSHTVTLLEGI 424
Score = 118 bits (295), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 89/307 (28%), Positives = 138/307 (44%), Gaps = 49/307 (15%)
Query: 472 IDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGP 521
++ + AD I G+ +E E++ DR + LP Q+E+++ + K
Sbjct: 594 LNKLKNADVVIFAGGISPLLEGESMRVSDPGFKGGDRTEIELPAIQREVLALLKKNGK-K 652
Query: 522 TILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQ 581
T+ V SG + A AI+ A YPGQAGGTA+AD+LFG NP G+LP+T+Y
Sbjct: 653 TVFVNFSGSAM--AIVPETQSCDAILQAWYPGQAGGTAVADVLFGNYNPAGRLPITFY-- 708
Query: 582 EYITNLPMTE-MAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAV 640
+ I LP E +M+ GRTYRF +YPFG+G+SYT F + A
Sbjct: 709 KSIQQLPDYEDYSMK--------GRTYRFMTETPLYPFGYGLSYTRFSYGKA-------- 752
Query: 641 PLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWA 700
++N + K K + + V NVG +DG + V+ P
Sbjct: 753 -------TLNQSKLAKGEKAI----------LTIPVSNVGQRDGEEVVQVYICRPDDKGG 795
Query: 701 PHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
P K L F++V++ G Q V I + + + T R G + I G + + L
Sbjct: 796 PQKTLRGFQRVNIAKGKTQNVNIELPYDSFEWFDTATNTIRPLSGTYKILYGNSSNENDL 855
Query: 761 HAATLGV 767
++ +
Sbjct: 856 QTCSIQI 862
>gi|393786911|ref|ZP_10375043.1| hypothetical protein HMPREF1068_01323 [Bacteroides nordii
CL02T12C05]
gi|392658146|gb|EIY51776.1| hypothetical protein HMPREF1068_01323 [Bacteroides nordii
CL02T12C05]
Length = 863
Score = 300 bits (767), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 178/440 (40%), Positives = 249/440 (56%), Gaps = 43/440 (9%)
Query: 40 LPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGP 99
LPF LP+ +RV DL+ RL+L EKV L+ +++VPRLGIK Y WW+EALHGV G
Sbjct: 22 LPFNNPDLPVEERVEDLVRRLTLHEKVLLMCDYSSSVPRLGIKQYNWWNEALHGVGRAGL 81
Query: 100 GTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGT--------AGLT 151
AT FPQ I A++F+ + + VSDEARA Y+ GLT
Sbjct: 82 ----------ATVFPQAIGMAATFDDCAVKQVFECVSDEARAKYHHSENKDGSERYRGLT 131
Query: 152 YWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAY 210
+W+PNVNIFRDPRWGRGQET GEDP L+ + + VRGLQG S+ K+ A KH+ +
Sbjct: 132 FWTPNVNIFRDPRWGRGQETYGEDPYLTSRMGLAVVRGLQGPSESKYDKLHACAKHYALH 191
Query: 211 DLDNWNGVDRFHFNAK-VSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPN 269
WN R F+ + +S +D+ +T+ F+ V +G V VMC+YN+ G P C
Sbjct: 192 SGPEWN---RHRFDVENISPRDLWETYLPAFKALVQQGGVKEVMCAYNRFEGEPCCGSNR 248
Query: 270 ILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFT--STPEEAAADAIRAGLDLDCG-PFLG 326
+L +R EW +G +VSDC ++ +Y H ST E A A A++AG DLDCG +
Sbjct: 249 LLYNILREEWGFDGLVVSDCGAISDFYLKGHHETHSTKESAVAAAVKAGTDLDCGVDYQS 308
Query: 327 LHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEP----SSQPYGHLGPKDVCTPDH 382
L E AV++G+++E I+ +L L + LG+ D E S PY V + H
Sbjct: 309 L--EKAVEKGIITEKQIDVSLSRLLKARFELGLMDEEHLVSWSDIPY-----TVVDSEKH 361
Query: 383 QELALEAARQGIVLLKNQGPSLPLSHIRH-RTVAVIGPNSDVTVTMIGNYAGIACGYTTP 441
+ ALE AR+ + LLKN+ +LPLS +H + VIGPN++ ++ M GNY G T
Sbjct: 362 RAKALEVARKSMTLLKNKNGTLPLS--KHCGKIVVIGPNANDSIMMWGNYNGFPSHTVTI 419
Query: 442 LQGIGRY---ARTIHQQGCK 458
L+GI + I+ +GC+
Sbjct: 420 LEGITHKLDAGQVIYDKGCE 439
Score = 125 bits (315), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 89/293 (30%), Positives = 140/293 (47%), Gaps = 47/293 (16%)
Query: 478 ADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILVLM 527
A+A + V G+ +E E L DR + LP Q++L+ ++ K P IL+L
Sbjct: 600 AEAIVFVGGISPKVEGEELPVSFPGFKGGDRTVIELPQVQRDLLQELYKTGK-PIILILC 658
Query: 528 SGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNL 587
SG I ++ A+ D AII A YPGQAGGTA+AD+LFG NP G+LP+T+Y
Sbjct: 659 SGSAIGLS-AEVD-LADAIIQAWYPGQAGGTAVADVLFGDYNPAGRLPVTFYK------- 709
Query: 588 PMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHG 647
T + + GRTYR++KG ++PFG+G+SYT+F
Sbjct: 710 --TTEQLPDFEDYNMQGRTYRYFKGEALFPFGYGLSYTSFEIG----------------- 750
Query: 648 SINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVA 707
A +S K I ++ + + +KN G +DG + V+ P K L A
Sbjct: 751 --KAQLSKKRIHANE------SVNLDLWIKNTGERDGEEVIQVYIRKLKDKEGPLKTLRA 802
Query: 708 FEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
F++VHV +G ++++ I++ + R+ GE+ + G + L
Sbjct: 803 FKRVHVKSGEKKQISIHLPNDSFEFFDPEFNVMRVMAGEYEVLYGTSSEGKDL 855
>gi|299148437|ref|ZP_07041499.1| beta-glucosidase [Bacteroides sp. 3_1_23]
gi|298513198|gb|EFI37085.1| beta-glucosidase [Bacteroides sp. 3_1_23]
Length = 863
Score = 300 bits (767), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 171/417 (41%), Positives = 242/417 (58%), Gaps = 31/417 (7%)
Query: 41 PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPG 100
P+ L QR +DL+ RL+L+EKV L+ + + A+PRLGIK YEWW+EALHGV+ G
Sbjct: 27 PYQDTKLTAEQRADDLLQRLTLEEKVALMQNNSPAIPRLGIKPYEWWNEALHGVARAGL- 85
Query: 101 TKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARA---MYN-----GGTAGLTY 152
AT FPQ I A+SFN L + VSDEARA +N GLT
Sbjct: 86 ---------ATVFPQAIGMAASFNDELLYEVFDAVSDEARAKNRQFNERGQYKRYQGLTM 136
Query: 153 WSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYD 211
W+PNVNIFRDPRWGRGQET GEDP LSG+ + VRGLQG D + K+ A KHF +
Sbjct: 137 WTPNVNIFRDPRWGRGQETYGEDPYLSGRMGMAAVRGLQGPEDAEYDKLHACAKHFAVHS 196
Query: 212 LDNWNGVDRFHFNAK-VSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNI 270
WN R FNA+ ++ +D+ +T+ F+ V + V VMC+YN+ G P C +
Sbjct: 197 GPEWN---RHSFNAENIAPRDLWETYLPAFKELVQKAGVKEVMCAYNRFEGDPCCGSNRL 253
Query: 271 LKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPE--EAAADAIRAGLDLDCGPFLGLH 328
L + +R +W G +V+DC ++G ++ + + P+ A+ADA+ G DL+CG
Sbjct: 254 LTQILRNDWGFKGIVVTDCGAIGDFFQRKKHETHPDAAHASADAVLNGTDLECGGNFKSI 313
Query: 329 TESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALE 388
T+ AV++GL+SE IN ++ L + LG + + P+ ++ + P H+ELAL+
Sbjct: 314 TD-AVKKGLISEEKINTSVKRLLKARFELGEMN---PTHPWSNIPYSVINCPKHKELALK 369
Query: 389 AARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
A + +VLL+N+ LPL+ R VAVIGPN++ +V GNY G T L+GI
Sbjct: 370 MAHESLVLLQNKNNILPLN--RQMKVAVIGPNANDSVMQWGNYNGFPSHTVTLLEGI 424
Score = 117 bits (294), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 89/307 (28%), Positives = 138/307 (44%), Gaps = 49/307 (15%)
Query: 472 IDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGP 521
++ + AD I G+ +E E++ DR + LP Q+E+++ + K
Sbjct: 594 LNKLKNADVVIFAGGISPLLEGESMRVSDPGFKGGDRTEIELPAIQREVLALLKKNGK-K 652
Query: 522 TILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQ 581
T+ V SG + A AI+ A YPGQAGGTA+AD+LFG NP G+LP+T+Y
Sbjct: 653 TVFVNFSGSAM--AIVPETQSCDAILQAWYPGQAGGTAVADVLFGDYNPAGRLPITFY-- 708
Query: 582 EYITNLPMTE-MAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAV 640
+ I LP E +M+ GRTYRF +YPFG+G+SYT F + A
Sbjct: 709 KSIQQLPDYEDYSMK--------GRTYRFMTETPLYPFGYGLSYTRFSYGKA-------- 752
Query: 641 PLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWA 700
++N + K K + + V NVG +DG + V+ P
Sbjct: 753 -------TLNQSKLAKGEKAI----------LTIPVSNVGQRDGEEVVQVYICRPDDKGG 795
Query: 701 PHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
P K L F++V++ G Q V I + + + T R G + I G + + L
Sbjct: 796 PQKTLRGFQRVNIAKGKTQNVNIELPYDSFEWFDTATNTIRPLSGTYKILYGNSSNENDL 855
Query: 761 HAATLGV 767
++ +
Sbjct: 856 QTCSIQI 862
>gi|21244948|ref|NP_644530.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. citri str.
306]
gi|21110666|gb|AAM39066.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. citri str.
306]
Length = 901
Score = 299 bits (766), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 173/458 (37%), Positives = 251/458 (54%), Gaps = 36/458 (7%)
Query: 28 FACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWW 87
C A T P+ R DL+ R++L+EK + + A A+PRL + Y+WW
Sbjct: 20 LGCATGAAHAATPPYLDTQRSFEARAADLVSRMTLEEKAAQMQNAAPAIPRLQVPAYDWW 79
Query: 88 SEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--- 144
+EALHGV+ G GAT FPQ I A++F+ L + +SDEARA ++
Sbjct: 80 NEALHGVARAG----------GATVFPQAIGMAATFDLPLMHEVATAISDEARAKHHQFL 129
Query: 145 -----GGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQG------- 192
GLT+WSPN+NIFRDPRWGRGQET GEDP L+ + ++V+GLQG
Sbjct: 130 RQNQHARYQGLTFWSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQGEGAAAPK 189
Query: 193 -SDGDRL-KVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVA 250
+ G+ K+ A+ KH + DR HF+A+ S++D+ +T+ F V EGKV
Sbjct: 190 NAQGEPYRKLDATAKHLAVHSGPE---ADRHHFDARPSQRDLYETYLPAFEALVKEGKVD 246
Query: 251 SVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAA 310
+VM +YN+V G A +L+ +R +W GY+VSDC ++ + +T E+AAA
Sbjct: 247 AVMGAYNRVYGESASASKFLLQDVLRDQWGFRGYVVSDCWAIVDIWKHHKIVATREQAAA 306
Query: 311 DAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYG 370
A++ G +L+CG +AV++GL+ E I+ AL +T +MRLGMFD P P+
Sbjct: 307 LAVKHGTELECGEEYAT-LPAAVRQGLIDEAQIDTALKTLMTARMRLGMFD-PPGQLPWS 364
Query: 371 HLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGN 430
+ +P H LA AR+ +VLLKN G LPLS + + +AVIGP +D T+ ++GN
Sbjct: 365 TIPASVNQSPAHDALARRTARESLVLLKNDG-LLPLSRAKFKRIAVIGPTADDTMALLGN 423
Query: 431 YAGIACGYTTPLQGI---GRYARTIHQQGCKDVACADD 465
Y G T LQGI A+ ++ +G V DD
Sbjct: 424 YYGTPAAPVTVLQGIRAAAPNAQVLYARGADLVEGRDD 461
Score = 127 bits (320), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 93/301 (30%), Positives = 148/301 (49%), Gaps = 51/301 (16%)
Query: 471 AIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKG 520
A+D +R AD + V GL +E E + DR L LP Q++L+ + A+
Sbjct: 628 ALDVARSADVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRDLLEALQ-ATGR 686
Query: 521 PTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYP 580
P + VL +G + + +A+ + AI+ A YPGQ GGTA+AD LFG +NPGG+LP+T+Y
Sbjct: 687 PVVAVLTTGSALAIDWAQQ--HLPAILLAWYPGQRGGTAVADTLFGDANPGGRLPVTFY- 743
Query: 581 QEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHT-VANAPTVVA 639
+E T + AMR GRTYR++ G +YPFGHG+SYT F ++ + T +A
Sbjct: 744 KESETLPAFDDYAMR--------GRTYRYFGGTPLYPFGHGLSYTQFAYSGLRLDRTTIA 795
Query: 640 VPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHW 699
GS+ AT++ VKN G + G + ++ P A
Sbjct: 796 T-----DGSLAATVT---------------------VKNTGQRAGDEVVQLYLHPLAPQR 829
Query: 700 A-PHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVD-RSGTRRIPLGEHNIHIGGTKHS 757
K+L F+++ + G Q+ +G I+ L + D + + G + + IG +
Sbjct: 830 ERAGKELHGFQRIALQPGEQRELGFTINAKDALRLYDEQRKAYGVDPGAYEVQIGASSAD 889
Query: 758 V 758
+
Sbjct: 890 I 890
>gi|189468358|ref|ZP_03017143.1| hypothetical protein BACINT_04755 [Bacteroides intestinalis DSM
17393]
gi|189436622|gb|EDV05607.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
intestinalis DSM 17393]
Length = 865
Score = 299 bits (766), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 174/447 (38%), Positives = 259/447 (57%), Gaps = 41/447 (9%)
Query: 35 ATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGV 94
AT + P+ +L +R DL+ R++L+EK+ + +G+ A+ RLGI Y+WW+EALHGV
Sbjct: 18 ATAQNEPYRNPNLSPSERAWDLLKRMTLEEKISQMKNGSPAIERLGIPAYDWWNEALHGV 77
Query: 95 SNVGPGTKFGGDFPGATSFPQVITTASSF-NATLWEAIGRVVSDEARAMYN--------G 145
+ G AT FPQ I A++F N ++E +VSDEARA Y+ G
Sbjct: 78 ARAGK----------ATVFPQAIGLAATFDNQAVYETFD-IVSDEARAKYHDFQRKGERG 126
Query: 146 GTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASC- 204
G GLT+W+PN+NI+RDPRWGRG ET GEDP L+ + V+GLQG+ + A +C
Sbjct: 127 GYKGLTFWTPNINIYRDPRWGRGMETYGEDPYLTSLMGLAVVKGLQGNGAGKYDKAHACA 186
Query: 205 KHFTAYDLDNWNGVDRFHFNAK-VSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVP 263
KH+ + WN R F++K +S++D+ +T+ F+ V EGKV VMC+YN+ G P
Sbjct: 187 KHYAVHSGPEWN---RHSFDSKNISQRDLWETYLPAFKTLVTEGKVKEVMCAYNRFEGEP 243
Query: 264 TCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTP--EEAAADAIRAGLDLDC 321
C++ +L R +R +W + +VSDC ++G +Y H + P E A+ADA+ +G DL+C
Sbjct: 244 CCSNKQLLIRILREDWGYDDIVVSDCGAIGDFYYPNHHETHPSAEAASADAVVSGTDLEC 303
Query: 322 GPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEP----SSQPYGHLGPKDV 377
G E AV++GL++E IN ++ L + +LGMFD + S PY + K
Sbjct: 304 GGSYSSLNE-AVKKGLITEDKINESVFRLLRARFQLGMFDDDTLVSWSEIPYSVVESK-- 360
Query: 378 CTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACG 437
+H + ALE AR+ +VLL N+ SLPLS R VAV+GPN++ +V + NY G
Sbjct: 361 ---EHVDKALEMARKSMVLLTNKNNSLPLSK-SIRKVAVLGPNANDSVMLWANYNGFPTK 416
Query: 438 YTTPLQGIGRY---ARTIHQQGCKDVA 461
T L+GI +++GC V+
Sbjct: 417 SVTILEGIRSKLPEGAVYYEKGCDFVS 443
Score = 124 bits (311), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 86/298 (28%), Positives = 137/298 (45%), Gaps = 47/298 (15%)
Query: 482 ILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGP 531
I V GL ++E E + DR + LP Q+E++ + K P I V+ SG
Sbjct: 605 IFVGGLSSALEGEEMPVDLPGFKKGDRTNIDLPRVQEEMLKALKKTGK-PVIFVVCSGST 663
Query: 532 IDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTE 591
+ + + + + A++ A YPGQ GGTA+AD+LFG NP G+LP+T+Y + ++LP E
Sbjct: 664 LALPWEAEN--LDAMLEAWYPGQQGGTAVADVLFGDYNPAGRLPLTFYASD--SDLPDFE 719
Query: 592 MAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINA 651
RTYR++KG ++PFG+G+SYT F + A
Sbjct: 720 -------DYNMSNRTYRYFKGKPLFPFGYGLSYTTFDYG-------------------KA 753
Query: 652 TISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKV 711
+ K+IK ++ + + +KN G DG + V+ PA P K L AF +V
Sbjct: 754 KVDKKSIKTGD------SMTLTIPLKNTGKMDGDEVVQVYLRNPADKEGPIKMLRAFRRV 807
Query: 712 HVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHAATLGVIK 769
+ AG + + I + + + I G + + GGT SL + + K
Sbjct: 808 SLKAGQAENIQIELPASTFECFNPATNRMEILPGNYELLYGGTSDGKSLQKVAVTLKK 865
>gi|380509734|ref|ZP_09853141.1| beta-glucosidase-related glycosidase [Xanthomonas sacchari NCPPB
4393]
Length = 883
Score = 299 bits (766), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 176/448 (39%), Positives = 252/448 (56%), Gaps = 32/448 (7%)
Query: 29 ACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWS 88
A D DA P+ S R L+ +++L+EK + + A A+ RLG+ Y+WW+
Sbjct: 15 AADAVDAK----PWQDTSASFEARAAALVAQMTLEEKAAQMQNAAPAIERLGVPAYDWWN 70
Query: 89 EALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN---- 144
EALHGV+ G AT FPQ I A++F+ L + +SDEARA ++
Sbjct: 71 EALHGVARAGQ----------ATVFPQAIGLAATFDVPLMGQVATTISDEARAKHHQFLR 120
Query: 145 ----GGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKV 200
G GLT+WSPN+NIFRDPRWGRGQET GEDP L+ + ++V+GLQG D K+
Sbjct: 121 EGAHGRYQGLTFWSPNINIFRDPRWGRGQETYGEDPYLTARMGVAFVQGLQGDDPVYRKL 180
Query: 201 AASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVN 260
A+ KHF + + DR HF+A+ SK+D+ DT+ F V EGKV +VM +YN+V
Sbjct: 181 DATAKHFAVH---SGPEADRHHFDARPSKRDLYDTYLPAFEALVKEGKVDAVMGAYNRVY 237
Query: 261 GVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLD 320
G A +L+ +R +W GY+VSDC ++ + H + E AAA A++ G +L+
Sbjct: 238 GESASASQFLLRDVLRRDWGFTGYVVSDCWAIVDIWKHHHLAPSREAAAALAVKNGTELE 297
Query: 321 CGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTP 380
CG +AV++GL+ E +I++A+ T +MRLGMFD P + + P
Sbjct: 298 CGQEYAT-LPAAVRQGLIGEAEIDDAVTRLFTARMRLGMFD-PPERVRWARIPASVNQVP 355
Query: 381 DHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTT 440
H LAL+AA++ +VLLKN G LPLS R +AV+GP +D T+ ++GNY G T
Sbjct: 356 AHDALALQAAQESLVLLKNDG-VLPLSRTLKR-IAVVGPTADDTMALLGNYFGTPAAPVT 413
Query: 441 PLQGIGRYARTI---HQQGCKDVACADD 465
LQGI A+ I + +G V DD
Sbjct: 414 ILQGIRDAAKGIEVRYARGVDLVEGRDD 441
Score = 123 bits (309), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 95/312 (30%), Positives = 145/312 (46%), Gaps = 59/312 (18%)
Query: 468 FGAAIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMA 517
F A+DA+R AD + V GL +E E + DR L LP Q+ L+ +
Sbjct: 605 FDEALDAARNADVVVFVGGLTGDVEGEEMKVDYPGFAGGDRTDLRLPAPQRALLEALHAT 664
Query: 518 SKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMT 577
K P ++VL G + V +A+ + AI+ + YPGQ GGTA+ LFG NP G+LP+T
Sbjct: 665 GK-PVVMVLTGGSALAVDWAQA--HLPAILMSWYPGQRGGTAVGQALFGEVNPAGRLPVT 721
Query: 578 WYPQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTV--ANA 634
+Y + LP + AM GRTYR+++G +YPFGHG+SYT F + +A
Sbjct: 722 FYRAD--QALPAFDDYAME--------GRTYRYFRGTPLYPFGHGLSYTRFDYGKLHLDA 771
Query: 635 PTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTP 694
P + DGR L +QV+V N G + G ++
Sbjct: 772 PRIAD---DGR------------------------LKLQVEVANTGKRAGDEVAQLYVRR 804
Query: 695 PAGHWAP---HKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRS-GTRRIPLGEHNIH 750
A AP + L F++VH+ G ++ + + + L D + G +P G + +
Sbjct: 805 LAA--APGDAQQTLRGFQRVHLAPGERRTLTFELDAQQALRQYDDARGAYVVPAGRYEVR 862
Query: 751 IGGTKHSVSLHA 762
IGG+ + A
Sbjct: 863 IGGSSADARVRA 874
>gi|315499711|ref|YP_004088514.1| beta-glucosidase [Asticcacaulis excentricus CB 48]
gi|315417723|gb|ADU14363.1| Beta-glucosidase [Asticcacaulis excentricus CB 48]
Length = 869
Score = 299 bits (766), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 179/415 (43%), Positives = 238/415 (57%), Gaps = 37/415 (8%)
Query: 57 IGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQV 116
I R+++++K + + A +P G+ YEWW+E LHGV+ G AT FPQ
Sbjct: 40 IARMTVEQKAAQMQNRAPDLPSAGLTAYEWWNEGLHGVARAGE----------ATVFPQA 89
Query: 117 ITTASSFNATLWEAIGRVVSDEARAMYNGGTA--------GLTYWSPNVNIFRDPRWGRG 168
I A+++N L + +G VVS EARA +N GLT WSPN+NIFRDPRWGRG
Sbjct: 90 IGLAATWNPALLKQVGDVVSTEARAKFNSTDPAGDHQRYYGLTLWSPNINIFRDPRWGRG 149
Query: 169 QETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVS 228
QET GEDP L+ + A +V GLQG D KV AS KH + R F A VS
Sbjct: 150 QETYGEDPFLTSRLAEGFVTGLQGPDPQHPKVVASVKHLAVHSGPE---AGRHGFAASVS 206
Query: 229 KQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSD 288
D+E T+ FR VM K SVMC+YN V GVP CA +LK +R W GY+V+D
Sbjct: 207 PYDLEMTYLPAFRYSVMTTKAQSVMCAYNAVGGVPACASDLLLKTYVREAWGFKGYVVTD 266
Query: 289 CDSVGVYYDTQHF---TSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINN 345
CD++ YD F E++A++++AG+DL+CG E AVQ+GL+ E ++
Sbjct: 267 CDAI---YDMTRFHFYRLNDAESSAESLKAGVDLNCGNAYAALPE-AVQKGLIPESLMDQ 322
Query: 346 ALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLP 405
+L L V+ RLG+ DG PS P+ + P+ + TP Q LAL+AA Q +VLLKN G LP
Sbjct: 323 SLNRLLDVRKRLGI-DGAPS--PWARISPEAINTPQAQGLALQAAEQSLVLLKNNG-VLP 378
Query: 406 LSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRY---ARTIHQQGC 457
L +TVAVIGPN+D T+ GNY GIA TPL G+ A+ ++ QG
Sbjct: 379 LKP--GQTVAVIGPNADTEETLRGNYNGIARQPVTPLTGLRAQLGAAKVLYAQGA 431
Score = 114 bits (284), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 88/272 (32%), Positives = 133/272 (48%), Gaps = 45/272 (16%)
Query: 482 ILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDP 541
ILV G D+ DR L LP Q++L+ V K P ++VL+SG + + +A
Sbjct: 620 ILVPGFDRG------DRTDLGLPRTQEDLLKAVKATGK-PLVVVLLSGSAVALNWADAHA 672
Query: 542 RIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKR 601
W YPG+AGGTAIA L G +NP G+LP+T+Y + +LP P R
Sbjct: 673 DAVVAAW--YPGEAGGTAIARTLTGEANPSGRLPVTFY--RSVQDLP-------PFIDYR 721
Query: 602 YPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVT 661
GRTYR++KG +YPFGHG+SYT F ++ + LD + +G+ ++V+
Sbjct: 722 MEGRTYRYFKGKPLYPFGHGLSYTQFSYS--------DLKLD-----TSTLTAGQPLRVS 768
Query: 662 HAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRV 721
V V+N G + G + ++ P + + L AF +V + AG + V
Sbjct: 769 ------------VRVRNNGQRAGDEVVQLYVKRP-DTFGLNASLAAFARVSLKAGESRTV 815
Query: 722 GINIHVCKYLSVVDRSGTRRIPLGEHNIHIGG 753
+ I + LS V G R I G + + +GG
Sbjct: 816 VMTID-PRDLSTVTLEGERAIRAGAYGLSVGG 846
>gi|373951852|ref|ZP_09611812.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
paludis DSM 18603]
gi|373888452|gb|EHQ24349.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
paludis DSM 18603]
Length = 871
Score = 298 bits (764), Expect = 7e-78, Method: Compositional matrix adjust.
Identities = 169/446 (37%), Positives = 246/446 (55%), Gaps = 32/446 (7%)
Query: 12 LILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLIS 71
+ +LS +A PFA A T P+ L RVNDL+ R++L+EKV +++
Sbjct: 1 MKILSKICILVATMLPFAVI---AQTSDYPYQNYHLDFTTRVNDLVKRMTLEEKVSQMLN 57
Query: 72 GAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAI 131
+ A+PRL I Y+WW+E LHGV+ T F T +PQ I A++F+ +
Sbjct: 58 SSPAIPRLKIPAYDWWNEVLHGVAR----TPFK-----VTVYPQAIAMAATFDRQSLNQM 108
Query: 132 GRVVSDEARAMYNGGTA---------GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKY 182
+ E RA++N GLTYW+PN+NIFRDPRWGRGQET GEDP L+G
Sbjct: 109 ADYAALEGRAVHNKALQMRKPGEKYLGLTYWTPNINIFRDPRWGRGQETYGEDPFLTGAM 168
Query: 183 AASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD--RFHFNAKVSKQDIEDTFDVPF 240
+++V GLQG+D LK AA KH+ + +G + R FNA +S D+ DT+ F
Sbjct: 169 GSAFVSGLQGNDPKYLKAAACAKHYAVH-----SGPEPLRHVFNADISTYDLWDTYLPAF 223
Query: 241 RMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQH 300
+ V++ KVA VMC+YN P C ++ +R +W+ +GY+ SDC + ++
Sbjct: 224 KKLVVDDKVAGVMCAYNAFKTQPCCGSDLLMVDILRNQWKFSGYVTSDCGGIDDFFKNHK 283
Query: 301 FTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMF 360
+T E+A+ DA+ G D++CG +AV+ G +SE I+ ++ ++ RLGMF
Sbjct: 284 THATAEDASTDAVLHGTDIECGTDAYKSLVAAVKEGKISETQIDISVKRLFMIRFRLGMF 343
Query: 361 DGEPSSQPYGHLGPKDVC-TPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGP 419
D PS P V +P+HQ AL+ ARQ +VLLKN +LPLS R + V+GP
Sbjct: 344 D--PSDVVKYAQTPVSVLESPEHQAHALKMARQSVVLLKNANHTLPLSKTI-RKIVVLGP 400
Query: 420 NSDVTVTMIGNYAGIACGYTTPLQGI 445
N+D + ++GNY G TT QGI
Sbjct: 401 NADNPIAILGNYNGTPSNLTTVYQGI 426
Score = 103 bits (256), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 91/298 (30%), Positives = 139/298 (46%), Gaps = 57/298 (19%)
Query: 478 ADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILVLM 527
ADA + V G+ +E E + DR + LP Q L+ K A+ P + V+M
Sbjct: 603 ADAIVYVGGISPQLEGEEMQVNYPGFNGGDRTSIQLPAAQTNLM-KTLQATGKPVVFVMM 661
Query: 528 SGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNL 587
+G + + + I AI+ A Y GQA GTA+AD+LFG NP G+LP+T+Y + T+L
Sbjct: 662 TGSALATPWEAEN--IPAIVNAWYGGQAAGTAVADVLFGDYNPAGRLPVTFYKSD--TDL 717
Query: 588 P-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRH 646
P T+ +M RTYR++KG +Y FG+G+SYT F + P V
Sbjct: 718 PDFTDYSMT--------NRTYRYFKGIPLYGFGYGLSYTQFKYDKLIVPATVK------- 762
Query: 647 GSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHW-APHKQL 705
SGKAI ++ V V N G G + ++ + P K L
Sbjct: 763 -------SGKAIHLS------------VTVTNSGQIAGDEVVQIYMKHHSQRIKVPLKAL 803
Query: 706 VAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPL-GEHNIHIGGTK---HSVS 759
F +V++ AG +R +N + V S +P+ G+ I GG++ H+V+
Sbjct: 804 KGFARVYLKAG--ERRTLNFILSPDDLAVTSSNGGLVPIKGKITISAGGSQPDEHNVT 859
>gi|409197445|ref|ZP_11226108.1| glycoside hydrolase family protein [Marinilabilia salmonicolor JCM
21150]
Length = 737
Score = 298 bits (763), Expect = 7e-78, Method: Compositional matrix adjust.
Identities = 230/716 (32%), Positives = 341/716 (47%), Gaps = 85/716 (11%)
Query: 39 TLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVG 98
+ PF L + RV+DL+ R++L+EKV L S +VPRLGIKG E HGV+ G
Sbjct: 39 SYPFQNADLDMETRVDDLLSRMTLEEKVSAL-STDPSVPRLGIKGAPH-IEGYHGVAMGG 96
Query: 99 PGT--KFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNG---GTAGLTYW 153
P G + T FPQ +++N L G + S EAR ++ GL
Sbjct: 97 PANWAPKGDERVPTTQFPQAYGMGATWNPELIRKAGEIESIEARYIFQNPEISKGGLVVR 156
Query: 154 SPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLD 213
+PN ++ RDPRWGR +E GEDP L G + ++ +GLQG D + A+ KHF A
Sbjct: 157 APNADLGRDPRWGRTEEVLGEDPFLVGTLSTAFTKGLQGDDEKYWRTASLLKHFLA---- 212
Query: 214 NWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKR 273
N N R ++ Q + + FR ++EG + M +YN VNGVP P + K
Sbjct: 213 NSNENTRDSSSSNFDTQLFYEYYGATFRRAILEGGSNAYMTAYNAVNGVPAHIHP-MHKE 271
Query: 274 TIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTE--- 330
W +NG I +D + AA I+AGL+ FL + E
Sbjct: 272 ISMARWGVNGIICTDGGGYTLLVRAHKAYDDYYRAAEGVIKAGLN----QFLDNYREGVW 327
Query: 331 SAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLG----PKDVCTPDHQELA 386
A+ G L+E D++ L V ++LG D + PY +G P +P+HQE A
Sbjct: 328 GALAHGYLAEEDLDEVLKGVYRVMIKLGQLDPQ-DKVPYASIGRDGKPAPWTSPEHQEAA 386
Query: 387 LEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIG 446
L+ AR+ +VLLKN+ +LPL+ VAVIG +D ++ Y+G+ +TPL GI
Sbjct: 387 LQMARESVVLLKNEKQTLPLAGDELGKVAVIGHLAD--TILLDWYSGMPPFMSTPLDGIK 444
Query: 447 RYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGL-------------DQSIEA 493
+ G V A D + AA++A+ QAD I+V+G D +
Sbjct: 445 ------EKMGADKVLFAPDNDYNAAVEAASQADVAIVVLGNHPYCDSERWGDCPDPGMGR 498
Query: 494 EALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPG 553
EA+DR L L E +++ + TILVL S P + +++ + + AI+ + G
Sbjct: 499 EAVDRKTLRL---TDEWLAQRVFEANPNTILVLQSSFPYGINWSQEN--LPAIVHITHNG 553
Query: 554 QAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKG 612
Q+ GTA+AD+LFG NPGGKL TW E LP M E +R G TY ++ G
Sbjct: 554 QSTGTALADVLFGDYNPGGKLTQTWPKSE--EQLPDMMEYDIR-------KGHTYMYFNG 604
Query: 613 PVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGV 672
+YPFG G+SYT+F ++ I+G ++ K N + V
Sbjct: 605 EPLYPFGFGLSYTSFEW-------------------VDMEITGSSV-----KSNEEEVIV 640
Query: 673 QVDVKNVGSKDGAHTLLVFST-PPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHV 727
V +KNVG G + ++++ P P K L F++V + G + V I + +
Sbjct: 641 TVKLKNVGQVKGDEVIQLYASFPETSSRRPDKALKGFKRVTLEPGESKNVQIPVKL 696
>gi|380692997|ref|ZP_09857856.1| beta-glucosidase [Bacteroides faecis MAJ27]
Length = 837
Score = 298 bits (762), Expect = 9e-78, Method: Compositional matrix adjust.
Identities = 166/403 (41%), Positives = 244/403 (60%), Gaps = 28/403 (6%)
Query: 42 FCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGT 101
+ ++ PI +RV DL+ +L+++EKV LL + + + R+GI Y +EALHG+ + PG
Sbjct: 14 YKNMNAPIHERVQDLLSKLTIEEKVSLLRATSPGIERMGIDKYYMGNEALHGI--IRPG- 70
Query: 102 KFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAG----------LT 151
KF T FPQ I AS +N L I V+SDEARA +N G LT
Sbjct: 71 KF-------TVFPQAIGLASMWNPELHHIIAGVISDEARARWNELERGKKQKDQFSDLLT 123
Query: 152 YWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYD 211
+WSP VN+ RDPRWGR ET GEDP LSG ++V+GLQG LK A+ KHF A +
Sbjct: 124 FWSPTVNMARDPRWGRTPETYGEDPYLSGVLGTAFVKGLQGDHPRYLKAVATPKHFAANN 183
Query: 212 LDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNIL 271
++ +RF+ +A +++ D+ + + F C+ EGK S+M +YN +NGVP A+ +L
Sbjct: 184 EEH----NRFYCDAAITETDLREYYFPAFEKCIREGKAESIMTAYNAINGVPCTANNWLL 239
Query: 272 KRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLH-TE 330
+ ++ +W NGYIVSDC + G+ + TPE AA AI+AGLD++CG ++ +
Sbjct: 240 NKVLKQDWGFNGYIVSDCGAPGLLMTDHRYVKTPEAAAMIAIKAGLDVECGDYVFANPLL 299
Query: 331 SAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAA 390
+A ++ ++S +I++A L +MRLGMFD +P PY HL P+ V H +LALEAA
Sbjct: 300 NAYKQYMVSAAEIDSAAYRVLRARMRLGMFD-DPEKNPYNHLSPEIVGCKKHHDLALEAA 358
Query: 391 RQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAG 433
RQ IVLLKNQ +LPL+ + +++AV+G N+ G+Y+G
Sbjct: 359 RQSIVLLKNQQNTLPLNAQKIKSIAVVGINA--ANCEFGDYSG 399
Score = 134 bits (336), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 94/296 (31%), Positives = 146/296 (49%), Gaps = 41/296 (13%)
Query: 467 LFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVL 526
++G A R++D I VMG++QSIE E DR + LP QQ + + A+ TI+VL
Sbjct: 577 MYGDASKIIRESDVVIAVMGINQSIEREGQDRNSIELPKDQQIFIREAYKANPN-TIVVL 635
Query: 527 MSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITN 586
++G + + + D I AII A YPG+ GGTAIA++LFG NP G+LP+T+Y I +
Sbjct: 636 VAGSSMAIGWM--DQHIPAIIDAWYPGEQGGTAIAEVLFGDYNPAGRLPLTFYNS--IED 691
Query: 587 LPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRH 646
LP A K RTY +++G +Y FG+G+SYT F + N + D ++
Sbjct: 692 LP----AFDDYNVKN--NRTYMYFEGKPLYAFGYGLSYTKFDYRNLN------IKQDTQN 739
Query: 647 GSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQL 705
++N +I KN G +G V+ P G P KQL
Sbjct: 740 VTLNFSI-----------------------KNSGKYNGDEVAQVYVKFPDQGIKTPLKQL 776
Query: 706 VAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLH 761
F++VH+ GA +++ I I + D+ P G ++ +G + ++ L
Sbjct: 777 KGFKRVHIKKGATEQISIEIPKEELRLWDDQKKQFYTPSGTYHFMVGKSSDNICLQ 832
>gi|423290405|ref|ZP_17269254.1| hypothetical protein HMPREF1069_04297 [Bacteroides ovatus
CL02T12C04]
gi|392665792|gb|EIY59315.1| hypothetical protein HMPREF1069_04297 [Bacteroides ovatus
CL02T12C04]
Length = 861
Score = 298 bits (762), Expect = 1e-77, Method: Compositional matrix adjust.
Identities = 175/441 (39%), Positives = 245/441 (55%), Gaps = 34/441 (7%)
Query: 35 ATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGV 94
A + LP+ SL QR DL+ RL+L+EKV L+ + + A+PRLGIK YEWW+EALHGV
Sbjct: 19 ACKQLLPYQDTSLTAEQRAEDLLPRLTLEEKVSLMQNASPAIPRLGIKEYEWWNEALHGV 78
Query: 95 SNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--------GG 146
G AT FPQ I +SFN +L + SDEAR
Sbjct: 79 GRAGL----------ATVFPQSIGMGASFNDSLLYEVFNATSDEARVKSRIFGESGVLKR 128
Query: 147 TAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRL-KVAASCK 205
GLT+W+PNVNIFRDPRWGRGQET GEDP L+G+ + VRGLQG + R K+ A K
Sbjct: 129 YQGLTFWTPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPEDARYDKLHACAK 188
Query: 206 HFTAYDLDNWNGVDRFHFNAK-VSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPT 264
HF + WN R F+A+ + +D+ +T+ F+ V + V VMC+YN+ G P
Sbjct: 189 HFAVHSGPEWN---RHSFDAENIDPRDLWETYLPAFKDLVQKAHVKEVMCAYNRFEGEPC 245
Query: 265 CADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTP--EEAAADAIRAGLDLDCG 322
C +L + +R EW G +VSDC ++ +Y + P E A+ADA+RAG DL+CG
Sbjct: 246 CGSNRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTHETHPDKEHASADAVRAGTDLECG 305
Query: 323 PFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDH 382
AV+ GL+ E +I+ +L LT + LG D +P+ + + + + +H
Sbjct: 306 SEYA-SLADAVKAGLIDEKEIDISLKRLLTARFELGEMDEQPA---WAEIPTSVLNSKEH 361
Query: 383 QELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPL 442
Q LAL AR+ +VLL+N+ LPL+ H VAV+GPN++ +V GNY GI T L
Sbjct: 362 QALALRMARESLVLLQNKNNILPLN--THLKVAVMGPNANDSVMQWGNYNGIPAHTVTLL 419
Query: 443 QGIGRY---ARTIHQQGCKDV 460
+ + + I++ GC V
Sbjct: 420 EAVRAKLPEGQIIYEPGCDRV 440
Score = 116 bits (290), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 92/308 (29%), Positives = 135/308 (43%), Gaps = 49/308 (15%)
Query: 471 AIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKG 520
A+ AD + G+ S+E E + DR + LP Q++L+ + A K
Sbjct: 591 AVKRVMDADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQRDLLKALKKAGKK 650
Query: 521 PTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYP 580
+ + SG I AI+ A YPGQAGGTAI D L+G NPGG+LP+T+Y
Sbjct: 651 -VVFINYSGSAI--GLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPGGRLPVTFYK 707
Query: 581 QEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVA 639
+ LP + +M+ GRTYR+ + ++PFGHG+SYT F + A
Sbjct: 708 D--VNQLPDFEDYSMK--------GRTYRYMQQQPLFPFGHGLSYTTFTYGEAKL----- 752
Query: 640 VPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHW 699
S N G+ + +T + V NVG +DG + V+ P
Sbjct: 753 --------SKNTIAKGENVVLT------------IPVSNVGQRDGEEVVQVYLRRPGDKE 792
Query: 700 APHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVS 759
P L AF++VH+PAG + V I + + S T R G + + GGT
Sbjct: 793 GPRYTLRAFKRVHIPAGKTESVAIPLTGENFEWFDAESNTMRPLEGTYELLYGGTSDRNK 852
Query: 760 LHAATLGV 767
L + V
Sbjct: 853 LKTIVMNV 860
>gi|300777563|ref|ZP_07087421.1| beta-glucosidase [Chryseobacterium gleum ATCC 35910]
gi|300503073|gb|EFK34213.1| beta-glucosidase [Chryseobacterium gleum ATCC 35910]
Length = 896
Score = 297 bits (761), Expect = 1e-77, Method: Compositional matrix adjust.
Identities = 162/433 (37%), Positives = 248/433 (57%), Gaps = 32/433 (7%)
Query: 41 PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPG 100
PF LP+ +R+ +L+ L+ +EK+ +++ + AVPRL I Y WW+EALHGV+ G
Sbjct: 45 PFRNPDLPVNERIENLLTLLTTEEKIGMMMDNSQAVPRLEIPAYGWWNEALHGVARAGI- 103
Query: 101 TKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN---------GGTAGLT 151
AT FPQ I A++++ ++SDEARA YN G GLT
Sbjct: 104 ---------ATVFPQAIGMAATWDVPEHFKTFEMISDEARAKYNRSFDEALKTGRYEGLT 154
Query: 152 YWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYD 211
+W+PN+NIFRDPRWGRGQET GEDP L+ + V+GLQG+D K A KHF +
Sbjct: 155 FWTPNINIFRDPRWGRGQETYGEDPYLTSVLGVAAVKGLQGNDPKFFKTHACAKHFAVHS 214
Query: 212 LDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNIL 271
WN R +NA++SK+D+ +T+ F+ V EG V VMC+YN +G P CA+ +L
Sbjct: 215 GPEWN---RHSYNAEISKRDLYETYLPAFKALVQEGNVREVMCAYNAFDGQPCCANNTLL 271
Query: 272 KRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEE--AAADAIRAGLDLDCGPFLGLHT 329
+RG+W+ +G +VSDC ++ ++ ++ + P+E AADA++ DL+CG +
Sbjct: 272 TEILRGKWKYDGMVVSDCWALADFFQKKYHGTHPDEKTTAADALKHSTDLECGDTYN-NL 330
Query: 330 ESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVC-TPDHQELALE 388
++ GL++E DI+ ++ L LGM D P S + + P V + +H++ AL+
Sbjct: 331 NKSLASGLITEKDIDESMRRILKGWFELGMLD--PKSSVHWNTIPYSVVDSEEHKKQALK 388
Query: 389 AARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRY 448
A++ IVL+KN+ LPL+ + +AV+GPN+D + +GNY G T L GI
Sbjct: 389 MAQKSIVLMKNEKNILPLNR-NIKKIAVVGPNADDGLMQLGNYNGTPSSIVTILDGIKTK 447
Query: 449 ---ARTIHQQGCK 458
A I+++G +
Sbjct: 448 FPNAEIIYEKGSE 460
Score = 109 bits (273), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 78/311 (25%), Positives = 138/311 (44%), Gaps = 39/311 (12%)
Query: 468 FGAAIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMA 517
F + + + AD + GL S+E E + D+ + LP Q++L++++
Sbjct: 613 FASVREKVKNADVIVFAGGLSPSLEGEEMMVNAEGFKGGDKTSIALPKVQRDLLAELRKT 672
Query: 518 SKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMT 577
K P + VL +G + +++ A++ A Y GQ+GGTA+AD+L G NP GKLP+T
Sbjct: 673 GK-PVVFVLCTGSAL--GLEQDEKNYDALLNAWYGGQSGGTAVADVLAGDYNPSGKLPIT 729
Query: 578 WYPQ-EYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPT 636
+Y E + N ++ GRTYR+ +YPFGHG+SY+ FV+
Sbjct: 730 FYKNLEQLDNALSKTSKHEGFENYDMQGRTYRYMTEKPLYPFGHGLSYSKFVYG------ 783
Query: 637 VVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPA 696
++ +S +I V + + + V N+ ++G + V+
Sbjct: 784 -------------DSKLSKNSISVNE------NVTITIPVTNISEREGEEVVQVYIKRNN 824
Query: 697 GHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKH 756
AP K L AFE+ + + + + + + + +++ G++ I GGT
Sbjct: 825 DAQAPVKTLRAFERTPIKSKETKNIQLILSKDSFAFYDEKADDLVSKPGDYTIFYGGTSD 884
Query: 757 SVSLHAATLGV 767
L L V
Sbjct: 885 DAGLKNVLLKV 895
>gi|299147288|ref|ZP_07040353.1| beta-glucosidase [Bacteroides sp. 3_1_23]
gi|298514566|gb|EFI38450.1| beta-glucosidase [Bacteroides sp. 3_1_23]
Length = 861
Score = 297 bits (761), Expect = 1e-77, Method: Compositional matrix adjust.
Identities = 175/441 (39%), Positives = 245/441 (55%), Gaps = 34/441 (7%)
Query: 35 ATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGV 94
A + LP+ SL QR DL+ RL+L+EKV L+ + + A+PRLGIK YEWW+EALHGV
Sbjct: 19 ACKQLLPYQDTSLTAEQRAEDLLPRLTLEEKVSLMQNASPAIPRLGIKEYEWWNEALHGV 78
Query: 95 SNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--------GG 146
G AT FPQ I +SFN +L + SDEAR
Sbjct: 79 GRAGL----------ATVFPQSIGMGASFNDSLLYEVFNATSDEARVKSRIFGESGVLKR 128
Query: 147 TAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRL-KVAASCK 205
GLT+W+PNVNIFRDPRWGRGQET GEDP L+G+ + VRGLQG + R K+ A K
Sbjct: 129 YQGLTFWTPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPEDARYDKLHACAK 188
Query: 206 HFTAYDLDNWNGVDRFHFNAK-VSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPT 264
HF + WN R F+A+ + +D+ +T+ F+ V + V VMC+YN+ G P
Sbjct: 189 HFAVHSGPEWN---RHSFDAENIDPRDLWETYLPAFKDLVQKAHVKEVMCAYNRFEGEPC 245
Query: 265 CADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTP--EEAAADAIRAGLDLDCG 322
C +L + +R EW G +VSDC ++ +Y + P E A+ADA+RAG DL+CG
Sbjct: 246 CGSNRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTHGTHPDKEHASADAVRAGTDLECG 305
Query: 323 PFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDH 382
AV+ GL+ E +I+ +L LT + LG D +P+ + + + + +H
Sbjct: 306 SEYA-SLADAVKAGLIDEKEIDISLKRLLTARFELGEMDEQPA---WSEIPTSVLNSKEH 361
Query: 383 QELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPL 442
Q LAL AR+ +VLL+N+ LPL+ H VAV+GPN++ +V GNY GI T L
Sbjct: 362 QALALRMARESLVLLQNKNNILPLN--THLKVAVMGPNANDSVMQWGNYNGIPAHTVTLL 419
Query: 443 QGIGRY---ARTIHQQGCKDV 460
+ + + I++ GC V
Sbjct: 420 EAVRAKLPEGQIIYEPGCDRV 440
Score = 116 bits (291), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 92/308 (29%), Positives = 134/308 (43%), Gaps = 49/308 (15%)
Query: 471 AIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKG 520
A+ AD + G+ S+E E + DR + LP Q+ L+ + A K
Sbjct: 591 AVKRVMDADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQRNLLKALKKAGKK 650
Query: 521 PTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYP 580
+ + SG I AI+ A YPGQAGGTAI D L+G NPGG+LP+T+Y
Sbjct: 651 -VVFINYSGSAI--GLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPGGRLPVTFYK 707
Query: 581 QEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVA 639
+ LP + +M+ GRTYR+ + ++PFGHG+SYT F + A
Sbjct: 708 N--VNQLPDFEDYSMK--------GRTYRYMQQQPLFPFGHGLSYTTFTYGEAKL----- 752
Query: 640 VPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHW 699
S N G+ + +T + V NVG +DG + V+ P
Sbjct: 753 --------SKNTIAKGENVVLT------------IPVSNVGQRDGEEVVQVYLRRPGDKE 792
Query: 700 APHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVS 759
P L AF++VH+PAG + V I + + S T R G + + GGT
Sbjct: 793 GPRYTLRAFKRVHIPAGKTESVAIPLTGVNFEWFDAESNTMRPLEGTYELLYGGTSDRNK 852
Query: 760 LHAATLGV 767
L + V
Sbjct: 853 LKTIVMNV 860
>gi|325914134|ref|ZP_08176487.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas vesicatoria
ATCC 35937]
gi|325539637|gb|EGD11280.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas vesicatoria
ATCC 35937]
Length = 874
Score = 297 bits (760), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 170/410 (41%), Positives = 237/410 (57%), Gaps = 32/410 (7%)
Query: 51 QRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGA 110
QR L+ ++S EKV ++ A A+PRL I YEWWSE LHG++ G A
Sbjct: 24 QRAAALVAQMSRDEKVAQAMNDAPAIPRLDIPAYEWWSEGLHGIARNG----------YA 73
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--GGT-------AGLTYWSPNVNIFR 161
T FPQ I A+S+N L + +G VVS EARA +N GG AGLT WSPN+NIFR
Sbjct: 74 TVFPQAIGLAASWNTALMQQVGTVVSTEARAKFNQAGGPGKDHKRYAGLTIWSPNINIFR 133
Query: 162 DPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD-- 219
DPRWGRG ET GEDP L+G+ A ++RGLQG D + + A+ KH + +G +
Sbjct: 134 DPRWGRGMETYGEDPFLTGQLAVGFIRGLQGDDLNHPRTIATPKHIAVH-----SGPEPG 188
Query: 220 RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEW 279
R F+ VS +D+E T+ FR +++G+ SVMC+YN ++G P CA +L +RG+W
Sbjct: 189 RHGFDVDVSPRDMEATYTPAFRAALVDGQAWSVMCAYNSLHGTPACAADWLLNGRVRGDW 248
Query: 280 RLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLL 338
G++VSDCD+V +F ++A A++AG DL+CG + L T A++RG +
Sbjct: 249 GFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAAALKAGHDLNCGHAYRELGT--AIERGEV 306
Query: 339 SEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLK 398
E ++ +LV + RLG + P PY LG KDV H+ LAL+AA + IVLLK
Sbjct: 307 DEALLDQSLVRLFAARYRLGELEA-PRKDPYARLGAKDVDNAAHRALALQAAAESIVLLK 365
Query: 399 NQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRY 448
N +LPL +AVIGPN+D + NY G + TPL G+ ++
Sbjct: 366 NTATTLPLKA--GTRLAVIGPNADALAALEANYQGTSATPITPLLGLRQH 413
Score = 144 bits (362), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 98/287 (34%), Positives = 147/287 (51%), Gaps = 48/287 (16%)
Query: 477 QADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILVL 526
Q+DA + +GL +E E L DR + LP QQ L+ + + K P ++VL
Sbjct: 602 QSDAVVAFVGLSPDVEGEELRIDVPGFDGGDRNDIALPAPQQALLERAKASGK-PLVVVL 660
Query: 527 MSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITN 586
MSG + + +AK + AI+ A YPGQ+GGTAIA L G NPGG+LP+T+Y +
Sbjct: 661 MSGSAVALNWAKAN--ADAIVAAWYPGQSGGTAIARALAGDDNPGGRLPVTFY--RSTKD 716
Query: 587 LPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRH 646
LP S GRTYR++KG ++PFG+G+SYT+F + +AP +
Sbjct: 717 LPAY-------VSYDMKGRTYRYFKGEPLFPFGYGLSYTSFAY---DAPRL--------- 757
Query: 647 GSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLV 706
S +G ++VT V+N GS+ G V+ P +P + LV
Sbjct: 758 -STRTLQAGNPLQVT------------TTVRNTGSRAGDEVAQVYLQYPDRPQSPLRSLV 804
Query: 707 AFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGG 753
F++VH+ G Q+ + + + LS VDRSG R + GE+ + +GG
Sbjct: 805 GFQRVHLKPGEQRELTFTLD-ARALSDVDRSGQRAVEAGEYRVFVGG 850
>gi|298481648|ref|ZP_06999839.1| beta-glucosidase [Bacteroides sp. D22]
gi|298272189|gb|EFI13759.1| beta-glucosidase [Bacteroides sp. D22]
Length = 861
Score = 297 bits (760), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 175/441 (39%), Positives = 245/441 (55%), Gaps = 34/441 (7%)
Query: 35 ATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGV 94
A + LP+ SL QR DL+ RL+L+EKV L+ + + A+PRLGIK YEWW+EALHGV
Sbjct: 19 ACKQLLPYQDTSLTAEQRAEDLLPRLTLEEKVSLMQNASPAIPRLGIKEYEWWNEALHGV 78
Query: 95 SNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--------GG 146
G AT FPQ I +SFN +L + SDEAR
Sbjct: 79 GRAGL----------ATVFPQSIGMGASFNDSLLYEVFNATSDEARVKSRIFGESGVLKR 128
Query: 147 TAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRL-KVAASCK 205
GLT+W+PNVNIFRDPRWGRGQET GEDP L+G+ + VRGLQG + R K+ A K
Sbjct: 129 YQGLTFWTPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPEDARYDKLHACAK 188
Query: 206 HFTAYDLDNWNGVDRFHFNAK-VSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPT 264
HF + WN R F+A+ + +D+ +T+ F+ V + V VMC+YN+ G P
Sbjct: 189 HFAVHSGPEWN---RHSFDAENIDPRDLWETYLPAFKDLVQKAHVKEVMCAYNRFEGEPC 245
Query: 265 CADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTP--EEAAADAIRAGLDLDCG 322
C +L + +R EW G +VSDC ++ +Y + P E A+ADA+RAG DL+CG
Sbjct: 246 CGSNRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTHGTHPDKEHASADAVRAGTDLECG 305
Query: 323 PFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDH 382
AV+ GL+ E +I+ +L LT + LG D +P+ + + + + +H
Sbjct: 306 SEYA-SLADAVKAGLIDEKEIDISLKRLLTARFELGEMDEQPA---WSEIPTSVLNSKEH 361
Query: 383 QELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPL 442
Q LAL AR+ +VLL+N+ LPL+ H VAV+GPN++ +V GNY GI T L
Sbjct: 362 QALALRMARESLVLLQNKNNILPLN--THLKVAVMGPNANDSVMQWGNYNGIPAHTVTLL 419
Query: 443 QGIGRY---ARTIHQQGCKDV 460
+ + + I++ GC V
Sbjct: 420 EAVRAKLPEGQIIYEPGCDRV 440
Score = 114 bits (284), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 92/308 (29%), Positives = 134/308 (43%), Gaps = 49/308 (15%)
Query: 471 AIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKG 520
A+ AD + G+ S+E E + DR + LP Q+ L+ + A K
Sbjct: 591 AVKRVMDADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQRNLLKALKKAGKK 650
Query: 521 PTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYP 580
+ + SG I AI+ A YPGQAGGTAI D L+G NPGG+LP+T+Y
Sbjct: 651 -VVFINYSGSAI--GLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPGGRLPVTFYK 707
Query: 581 QEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVA 639
+ LP + +M+ GRTYR+ + ++PFGHG+SYT+F + A
Sbjct: 708 D--VNQLPDFEDYSMK--------GRTYRYMQQQPLFPFGHGLSYTDFTYGEAKL----- 752
Query: 640 VPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHW 699
S N G+ + +T + V NVG DG + V+ P
Sbjct: 753 --------SKNTIAKGENVVLT------------IPVSNVGQCDGEEVVQVYLRRPGDKE 792
Query: 700 APHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVS 759
P L AF++VH+PAG + V I + + S T R G + + GGT
Sbjct: 793 GPRYTLRAFKRVHIPAGKTESVAIPLTGENFEWFDVESNTMRPLEGTYELLYGGTSDRNK 852
Query: 760 LHAATLGV 767
L + V
Sbjct: 853 LKTIVMNV 860
>gi|94970273|ref|YP_592321.1| Beta-glucosidase [Candidatus Koribacter versatilis Ellin345]
gi|94552323|gb|ABF42247.1| Beta-glucosidase [Candidatus Koribacter versatilis Ellin345]
Length = 881
Score = 297 bits (760), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 160/419 (38%), Positives = 237/419 (56%), Gaps = 32/419 (7%)
Query: 42 FCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGT 101
+ SL +R DL+ R++++EKV L + + AVPRL + Y+WWSEALHGV+
Sbjct: 30 YLNPSLAPEKRAADLVHRMTVEEKVSQLTNDSRAVPRLNVPDYDWWSEALHGVAQ----- 84
Query: 102 KFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA--------GLTYW 153
PG T +PQ + A++F+ + + R + E R + G GL +W
Sbjct: 85 ------PGVTEYPQPVALAATFDNDKVQRMARFIGIEGRIKHEEGMKDGHSDIFQGLDFW 138
Query: 154 SPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLD 213
+PN+NIFRDPRWGRGQET GEDP L+ + +YV+GLQG D ++ KH+ +
Sbjct: 139 APNINIFRDPRWGRGQETYGEDPFLTARMGVAYVKGLQGDDPKYYLAISTPKHYAVH--- 195
Query: 214 NWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKR 273
+ R + KVSK D DT+ FR V E K SVMC+YN +NG P C + +L+
Sbjct: 196 SGPETTRHFADVKVSKHDELDTYLPAFRATVTEAKAGSVMCAYNSINGQPACVNEFLLQD 255
Query: 274 TIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTE--- 330
+RG+W GY+VSDC+++ Y FT T EA+A A++ G+D +C F +
Sbjct: 256 QLRGKWNFQGYVVSDCEAIINIYRDHKFTKTQAEASALAVQRGMDNECVDFGKQKDDHDY 315
Query: 331 ----SAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELA 386
A ++G+L E +I+ ALV T +M+LGMFD P PY + PK++ + +H+ELA
Sbjct: 316 RPYFDAYKQGILKESEIDTALVRLFTARMKLGMFD-PPEMVPYSKIDPKELESAEHRELA 374
Query: 387 LEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
A + +VLLKN G +LPL + +AVIGP ++ T ++GNY G + L+G+
Sbjct: 375 RTLANESMVLLKNDG-TLPLKKSGLK-IAVIGPLAEQTRYLLGNYNGTPSHTVSVLEGL 431
Score = 131 bits (329), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 94/295 (31%), Positives = 147/295 (49%), Gaps = 50/295 (16%)
Query: 470 AAIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASK 519
AA+ A++ AD I V+G+ +E E + DR L LP +Q+L+ +S A K
Sbjct: 602 AAVTAAKNADVVIAVLGITSDLEGEEMPVSEEGFNGGDRTSLDLPKPEQQLLESISAAGK 661
Query: 520 GPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWY 579
P +LVL +G + V +A+ AI+ YPG+ GGTAIA L G +NP G+LP+T+Y
Sbjct: 662 -PVVLVLSNGSALSVNWAQQ--HANAILEGWYPGEEGGTAIAQTLSGKNNPAGRLPVTFY 718
Query: 580 P-QEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVV 638
E + P + AM+ GRTYR+++G +YPFG+G+SYT F + P
Sbjct: 719 TGTEQLP--PFEDYAMK--------GRTYRYFEGKPLYPFGYGLSYTTFSYRDLALPK-- 766
Query: 639 AVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGH 698
PL+ +G + QV V N G +G ++ + P
Sbjct: 767 -APLN----------AGDPVT------------AQVTVTNTGKVEGDEVAQLYLSFPNIA 803
Query: 699 WAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGG 753
AP + L F ++H+ AG Q + + + LS+V+ +G I GE+++ +GG
Sbjct: 804 GAPLRALRGFRRIHLKAGESQTIKFELK-DRDLSMVNEAGDPIIAEGEYSVSVGG 857
>gi|423215029|ref|ZP_17201557.1| hypothetical protein HMPREF1074_03089 [Bacteroides xylanisolvens
CL03T12C04]
gi|392692292|gb|EIY85530.1| hypothetical protein HMPREF1074_03089 [Bacteroides xylanisolvens
CL03T12C04]
Length = 861
Score = 297 bits (760), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 175/441 (39%), Positives = 245/441 (55%), Gaps = 34/441 (7%)
Query: 35 ATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGV 94
A + LP+ SL QR DL+ RL+L+EKV L+ + + A+PRLGIK YEWW+EALHGV
Sbjct: 19 ACKQLLPYQDTSLTAEQRAEDLLPRLTLEEKVSLMQNASPAIPRLGIKEYEWWNEALHGV 78
Query: 95 SNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--------GG 146
G AT FPQ I +SFN +L + SDEAR
Sbjct: 79 GRAGL----------ATVFPQSIGMGASFNDSLLYEVFNATSDEARVKSRIFGESGVLKR 128
Query: 147 TAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRL-KVAASCK 205
GLT+W+PNVNIFRDPRWGRGQET GEDP L+G+ + VRGLQG + R K+ A K
Sbjct: 129 YQGLTFWTPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPEDARYDKLHACAK 188
Query: 206 HFTAYDLDNWNGVDRFHFNAK-VSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPT 264
HF + WN R F+A+ + +D+ +T+ F+ V + V VMC+YN+ G P
Sbjct: 189 HFAVHSGPEWN---RHSFDAENIDPRDLWETYLPAFKDLVQKAHVKEVMCAYNRFEGEPC 245
Query: 265 CADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTP--EEAAADAIRAGLDLDCG 322
C +L + +R EW G +VSDC ++ +Y + P E A+ADA+RAG DL+CG
Sbjct: 246 CGSNRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTHGTHPDKEHASADAVRAGTDLECG 305
Query: 323 PFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDH 382
AV+ GL+ E +I+ +L LT + LG D +P+ + + + + +H
Sbjct: 306 SEYA-SLADAVKAGLIDEKEIDISLKRLLTARFELGEMDEQPA---WSEIPTSVLNSKEH 361
Query: 383 QELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPL 442
Q LAL AR+ +VLL+N+ LPL+ H VAV+GPN++ +V GNY GI T L
Sbjct: 362 QALALRMARESLVLLQNKNNILPLN--THLKVAVMGPNANDSVMQWGNYNGIPAHTVTLL 419
Query: 443 QGIGRY---ARTIHQQGCKDV 460
+ + + I++ GC V
Sbjct: 420 EAVRAKLPEGQIIYEPGCDRV 440
Score = 117 bits (293), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 92/308 (29%), Positives = 136/308 (44%), Gaps = 49/308 (15%)
Query: 471 AIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKG 520
A+ AD + G+ S+E E + DR + LP Q++L+ + A K
Sbjct: 591 AVKRVMDADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQRDLLKALKKAGKK 650
Query: 521 PTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYP 580
+ + SG I AI+ A YPGQAGGTAI D L+G NPGG+LP+T+Y
Sbjct: 651 -VVFINYSGSAI--GLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPGGRLPVTFYK 707
Query: 581 QEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVA 639
+ LP + +M+ GRTYR+ + ++PFGHG+SYT+F + A
Sbjct: 708 D--VNQLPDFEDYSMK--------GRTYRYMQQQPLFPFGHGLSYTDFTYGEAKL----- 752
Query: 640 VPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHW 699
S N G+ + +T + V NVG +DG + V+ P
Sbjct: 753 --------SKNTIAKGENVVLT------------IPVSNVGQRDGEEVVQVYLRRPGDKE 792
Query: 700 APHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVS 759
P L AF++VH+PAG + V I + + S T R G + + GGT
Sbjct: 793 GPRYTLRAFKRVHIPAGKTESVAIPLTGVNFEWFDVESNTMRPLEGTYELLYGGTSDRNK 852
Query: 760 LHAATLGV 767
L + V
Sbjct: 853 LKTIVMNV 860
>gi|384428895|ref|YP_005638255.1| beta-glucosidase [Xanthomonas campestris pv. raphani 756C]
gi|341937998|gb|AEL08137.1| beta-glucosidase [Xanthomonas campestris pv. raphani 756C]
Length = 888
Score = 297 bits (760), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 168/407 (41%), Positives = 235/407 (57%), Gaps = 32/407 (7%)
Query: 51 QRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGA 110
QR L+ ++S +EKV ++ A A+PRLGI YEWW+E LHG++ G A
Sbjct: 38 QRAAALVAQMSREEKVAQAMNAAPAIPRLGIPAYEWWNEGLHGIARNG----------YA 87
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--GGT-------AGLTYWSPNVNIFR 161
T FPQ I A+S+N L + +G VVS EARA +N GG AGLT WSPN+NIFR
Sbjct: 88 TVFPQAIGLAASWNTQLMQQVGTVVSTEARAKFNQAGGPGKDHKRYAGLTIWSPNINIFR 147
Query: 162 DPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD-- 219
DPRWGRG ET GEDP L+G+ A ++RGLQG D + + A+ KH + +G +
Sbjct: 148 DPRWGRGMETYGEDPFLTGQLAVGFIRGLQGDDLEHPRTIATPKHIAVH-----SGPEPG 202
Query: 220 RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEW 279
R F+ VS +D+E T+ FR ++EG+ SVMC+YN ++G P CA +L +RG+W
Sbjct: 203 RHGFDVDVSPRDVEATYTPAFRAALVEGQAGSVMCAYNSLHGTPACAADWLLNGRVRGDW 262
Query: 280 RLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLL 338
G++VSDCD+V +F ++A A++AG DL+CG + L T A++RG +
Sbjct: 263 GFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAAALKAGHDLNCGTAYRALGT--AIERGEV 320
Query: 339 SEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLK 398
E ++ +LV + RLG P Y LG KD+ ++ LAL+AA + IVLLK
Sbjct: 321 DEALLDQSLVRLFAARYRLGELQA-PRKDRYARLGAKDIDNAGNRALALQAAAESIVLLK 379
Query: 399 NQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
N +LPL +AVIGPN+D + NY G + TPL G+
Sbjct: 380 NANATLPLKASTR--LAVIGPNADALAALEANYQGTSSQPVTPLLGL 424
Score = 144 bits (364), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 97/287 (33%), Positives = 145/287 (50%), Gaps = 48/287 (16%)
Query: 477 QADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILVL 526
Q+DA + +GL +E E L DR + LP QQ L+ + + K P ++VL
Sbjct: 616 QSDAVVAFVGLSPDVEGEELRIDVPGFDGGDRNDIALPAAQQALLERAKASGK-PLVVVL 674
Query: 527 MSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITN 586
MSG + + +AK AI+ A YPGQ+GGTAIA L G NPGG+LP+T+Y +
Sbjct: 675 MSGSAVALNWAKT--HADAIVAAWYPGQSGGTAIARALAGDDNPGGRLPVTFY--RSTKD 730
Query: 587 LPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRH 646
LP P S GRTYR++KG ++PFG+G+SYT F + P + A L
Sbjct: 731 LP-------PYVSYDMKGRTYRYFKGEALFPFGYGLSYTRFAY---ETPRLSATTLQ--- 777
Query: 647 GSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLV 706
+G ++VT V+N G + G V+ P +P + LV
Sbjct: 778 -------AGSPLQVT------------TTVRNTGERAGDEVAQVYLQYPERPQSPLRSLV 818
Query: 707 AFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGG 753
F++VH+ G Q+ + + + LS VDR+GTR + G++ + +GG
Sbjct: 819 GFQRVHLQPGEQRTLTFTLD-ARALSDVDRTGTRAVEAGDYRLFVGG 864
>gi|21232323|ref|NP_638240.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. campestris
str. ATCC 33913]
gi|21114093|gb|AAM42164.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. campestris
str. ATCC 33913]
Length = 888
Score = 296 bits (758), Expect = 3e-77, Method: Compositional matrix adjust.
Identities = 168/407 (41%), Positives = 235/407 (57%), Gaps = 32/407 (7%)
Query: 51 QRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGA 110
QR L+ ++S +EKV ++ A A+PRLGI YEWWSE LHG++ G A
Sbjct: 38 QRAAALVAQMSREEKVAQAMNAAPAIPRLGIPAYEWWSEGLHGIARNG----------YA 87
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--GGT-------AGLTYWSPNVNIFR 161
T FPQ I A+S+N L + +G VVS EARA +N GG AGLT WSPN+NIFR
Sbjct: 88 TVFPQAIGLAASWNTQLMQQVGTVVSTEARAKFNQAGGPGKDHKRYAGLTIWSPNINIFR 147
Query: 162 DPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD-- 219
DPRWGRG ET GEDP L+G+ A ++RGLQG D + + A+ KH + +G +
Sbjct: 148 DPRWGRGMETYGEDPFLTGQLAVGFIRGLQGDDLEHPRTIATPKHIAVH-----SGPEPG 202
Query: 220 RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEW 279
R F+ VS +D+E T+ FR ++EG+ SVMC+YN ++G P CA +L +RG+W
Sbjct: 203 RHGFDVDVSPRDVEATYTPAFRAALVEGQAGSVMCAYNSLHGTPACAADWLLNGRVRGDW 262
Query: 280 RLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLL 338
G++VSDCD+V +F ++A +++AG DL+CG + L T A++RG +
Sbjct: 263 GFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAASLKAGHDLNCGTAYRALGT--AIERGEV 320
Query: 339 SEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLK 398
E ++ +LV + RLG P Y LG KD+ ++ LAL+AA + IVLLK
Sbjct: 321 DEALLDQSLVRLFAARYRLGELQA-PRKDRYARLGAKDIDNAGNRALALQAAAESIVLLK 379
Query: 399 NQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
N +LPL +AVIGPN+D + NY G + TPL G+
Sbjct: 380 NANATLPLKA--GTRLAVIGPNADALAALEANYQGTSSQPVTPLLGL 424
Score = 139 bits (349), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 94/278 (33%), Positives = 143/278 (51%), Gaps = 48/278 (17%)
Query: 486 GLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVA 535
GL +E E L DR + LP QQ L+ + + K P ++VLMSG + +
Sbjct: 625 GLSPDVEGEELRIDVPGFDGGDRNDIALPAAQQALLERAKASGK-PLVVVLMSGSAVALN 683
Query: 536 FAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMR 595
+AK AI+ A YPGQ+GGTAIA L G NPGG+LP+T+Y +LP
Sbjct: 684 WAKT--HADAIVAAWYPGQSGGTAIARALAGDDNPGGRLPVTFY--RSTKDLP------- 732
Query: 596 PSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISG 655
P S GRTYR++KG ++PFG+G+SYT+F + +AP + + L +G
Sbjct: 733 PYVSYDMKGRTYRYFKGEALFPFGYGLSYTSFAY---DAPQLSSTTLQ----------AG 779
Query: 656 KAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPA 715
++VT V+N G++ G V+ P +P + LV F++VH+
Sbjct: 780 SPLQVT------------TTVRNTGTRAGDEVAQVYLQYPDRPQSPLRSLVGFQRVHLQP 827
Query: 716 GAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGG 753
G Q+ + + + LS VDR+GTR + G++ + +GG
Sbjct: 828 GEQRTLTFTLD-ARALSDVDRTGTRAVEAGDYRLFVGG 864
>gi|188990656|ref|YP_001902666.1| beta-glucosidase [Xanthomonas campestris pv. campestris str. B100]
gi|167732416|emb|CAP50610.1| exported beta-glucosidase [Xanthomonas campestris pv. campestris]
Length = 888
Score = 296 bits (758), Expect = 3e-77, Method: Compositional matrix adjust.
Identities = 168/407 (41%), Positives = 235/407 (57%), Gaps = 32/407 (7%)
Query: 51 QRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGA 110
QR L+ ++S +EKV ++ A A+PRLGI YEWWSE LHG++ G A
Sbjct: 38 QRAAALVAQMSREEKVAQAMNAAPAIPRLGIPAYEWWSEGLHGIARNG----------YA 87
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--GGT-------AGLTYWSPNVNIFR 161
T FPQ I A+S+N L + +G VVS EARA +N GG AGLT WSPN+NIFR
Sbjct: 88 TVFPQAIGLAASWNTQLMQQVGTVVSTEARAKFNQAGGPGKDHKRYAGLTIWSPNINIFR 147
Query: 162 DPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD-- 219
DPRWGRG ET GEDP L+G+ A ++RGLQG D + + A+ KH + +G +
Sbjct: 148 DPRWGRGMETYGEDPFLTGQLAVGFIRGLQGDDLEHPRTIATPKHIAVH-----SGPEPG 202
Query: 220 RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEW 279
R F+ VS +D+E T+ FR ++EG+ SVMC+YN ++G P CA +L +RG+W
Sbjct: 203 RHGFDVDVSPRDVEATYTPAFRAALVEGQAGSVMCAYNSLHGTPACAADWLLNGRVRGDW 262
Query: 280 RLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLL 338
G++VSDCD+V +F ++A +++AG DL+CG + L T A++RG +
Sbjct: 263 GFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAASLKAGHDLNCGTAYRALGT--AIERGEV 320
Query: 339 SEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLK 398
E ++ +LV + RLG P Y LG KD+ ++ LAL+AA + IVLLK
Sbjct: 321 DEALLDQSLVRLFAARYRLGELQA-PRKDRYARLGAKDIDNAGNRALALQAAAESIVLLK 379
Query: 399 NQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
N +LPL +AVIGPN+D + NY G + TPL G+
Sbjct: 380 NANATLPLKA--GTRLAVIGPNADALAALEANYQGTSSQPVTPLLGL 424
Score = 136 bits (342), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 92/278 (33%), Positives = 137/278 (49%), Gaps = 48/278 (17%)
Query: 486 GLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVA 535
GL +E E L DR + LP QQ L+ + + K P ++VLMSG + +
Sbjct: 625 GLSPDVEGEELRIDVPGFDGGDRNDIALPAAQQALLERAKASGK-PLVVVLMSGSAVALN 683
Query: 536 FAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMR 595
+AK AI+ A YPGQ+GGTAIA L G NPGG+LP+T+Y +LP
Sbjct: 684 WAKT--HADAIVAAWYPGQSGGTAIARALAGDDNPGGRLPVTFY--RSTKDLP------- 732
Query: 596 PSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISG 655
P S GRTYR++KG ++PFG+G+SYT F + S+ +G
Sbjct: 733 PYVSYDMKGRTYRYFKGEALFPFGYGLSYTRFAYETPRL-------------SVTTLQAG 779
Query: 656 KAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPA 715
++VT V+N G + G V+ P +P + LV F++VH+
Sbjct: 780 SPLQVT------------TTVRNTGERAGDEVAQVYLQYPDRPQSPLRSLVGFQRVHLQP 827
Query: 716 GAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGG 753
G Q+ + + + LS VDR+GTR + G++ + +GG
Sbjct: 828 GEQRTLTFTLD-ARALSDVDRTGTRVVEAGDYRLFVGG 864
>gi|408369545|ref|ZP_11167326.1| glycoside hydrolase [Galbibacter sp. ck-I2-15]
gi|407745291|gb|EKF56857.1| glycoside hydrolase [Galbibacter sp. ck-I2-15]
Length = 881
Score = 296 bits (757), Expect = 3e-77, Method: Compositional matrix adjust.
Identities = 181/466 (38%), Positives = 262/466 (56%), Gaps = 51/466 (10%)
Query: 7 FFFLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKV 66
FF+ +L + S+G + PF L RV DL+ RL+++EK+
Sbjct: 10 LFFIAFVLSMSISNG----------------QQYPFQNPELDDSARVADLLERLTVEEKI 53
Query: 67 KLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNAT 126
L+ + A+ RLGI Y WW+E+LHGV+ G AT FPQ IT A+++++
Sbjct: 54 DQLLYTSPAIERLGIPEYNWWNESLHGVARAG----------YATVFPQSITIAAAWDSD 103
Query: 127 LWEAIGRVVSDEARAMYN--------GGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVL 178
L + + +SDEARA ++ G GLT+WSPN+NIFRDPRWGRG ET GEDP L
Sbjct: 104 LLKEVADAISDEARAKHHEYIRRGQRGIYQGLTFWSPNINIFRDPRWGRGHETYGEDPYL 163
Query: 179 SGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD--RFHFNAKVSKQDIEDTF 236
+G+ +YV+GLQG+D + LK+ A+ KHF + +G + R F+ SK+D+ +T+
Sbjct: 164 TGQLGIAYVKGLQGNDPNYLKLVATAKHFAVH-----SGPEPLRHEFDVSPSKRDLWETY 218
Query: 237 DVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYY 296
FR V +G V SVM +YN+V G A + L +R W +GY+VSDC ++ +
Sbjct: 219 LPAFRYLVKQGDVKSVMTAYNRVYGEAASAS-DTLFTILRDYWDFDGYVVSDCFAISDIW 277
Query: 297 DTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMR 356
EA+A A+ G DL+CG A Q+G+++E DI+ AL + +++
Sbjct: 278 KYHKIAKDAAEASAMAVIEGCDLNCGDSYE-KLNQAYQQGMVTEKDIDIALSRLMEARIK 336
Query: 357 LGMFDGEPSSQPYGHLGPKDVCTPD-HQELALEAARQGIVLLKNQGPSLPLSHIRHRTVA 415
LGMFD E PY + P +V T + H +LAL+AA++ IVLLKNQG LPLS ++VA
Sbjct: 337 LGMFDPE-QLVPYAQI-PFNVNTSEKHNQLALKAAKESIVLLKNQGDLLPLSK-DLKSVA 393
Query: 416 VIGPNSDVTVTMIGNYAGIACGYTTPLQGI----GRYARTIHQQGC 457
VIGPN+D ++ GNY G T LQGI G ++Q+G
Sbjct: 394 VIGPNADNIQSLWGNYNGNPKDPITVLQGIQNALGPQTTVVYQEGS 439
Score = 133 bits (335), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 88/284 (30%), Positives = 144/284 (50%), Gaps = 50/284 (17%)
Query: 483 LVMGLDQSIEAEALD----------RAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPI 532
+V+GL++ +E E +D R L LP Q+ L+ +V+ K P +LVL++G +
Sbjct: 607 MVLGLNERLEGEEMDVVVEGFAGGDRTALDLPASQRTLLKEVAKTGK-PIVLVLLNGSAL 665
Query: 533 DVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEM 592
+ +A + I AI+ AGY GQ GG A+A++LFG NP +LP+T+Y + +LP E
Sbjct: 666 SINWAAEN--IPAIMTAGYAGQQGGNAVAEVLFGDYNPAARLPVTYYKS--VEDLPDFE- 720
Query: 593 AMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINAT 652
GRTYR+++ +YPFG+G+SYT F ++ P+ + +N +
Sbjct: 721 ------DYNMDGRTYRYFEKEPLYPFGYGLSYTTFDYSKFQLPSKI---------DMNES 765
Query: 653 ISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHW-APHKQLVAFEKV 711
I + V+V N G+ DG + V+ T G P ++LV F+++
Sbjct: 766 IE-----------------LSVEVTNTGAYDGDEVVQVYLTDEKGSTPRPIRELVGFKRI 808
Query: 712 HVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTK 755
H+ G Q+V I + LS++D G I G +I +GG +
Sbjct: 809 HLKKGESQKVQFTIE-PRQLSMIDDKGDLVIEPGVFSISVGGEQ 851
>gi|322437617|ref|YP_004219707.1| glycoside hydrolase family protein [Granulicella tundricola
MP5ACTX9]
gi|321165510|gb|ADW71213.1| glycoside hydrolase family 3 domain protein [Granulicella
tundricola MP5ACTX9]
Length = 892
Score = 296 bits (757), Expect = 4e-77, Method: Compositional matrix adjust.
Identities = 169/436 (38%), Positives = 251/436 (57%), Gaps = 33/436 (7%)
Query: 38 RTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNV 97
+ LP+ +L QRV+DL+ R++L+EKV I+ A A+ RL + Y++WSE LHG++
Sbjct: 30 KPLPYMDPALTTQQRVDDLVSRMTLEEKVSQTINSAPAISRLNVPEYDYWSEGLHGIARS 89
Query: 98 GPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA--------G 149
G AT FPQ I A++++A L + IG V+S EARA +N G
Sbjct: 90 G----------YATMFPQAIGMAATWDAPLLQQIGDVISIEARAKFNEAIRHNIHSIYYG 139
Query: 150 LTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTA 209
LT WSPN+NIFRDPRWGRGQET GEDP L+G+ ++V+G+QG D + + A+ KHF
Sbjct: 140 LTIWSPNINIFRDPRWGRGQETYGEDPFLTGRLGVAFVKGIQGPDPNYFRAIATPKHFAV 199
Query: 210 YDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPN 269
+ + R N + + D+ DT+ FR + E S+MC+YN V G P CA
Sbjct: 200 H---SGPESTRHSANIEPTPHDLHDTYLPAFRATITEAHADSIMCAYNAVEGSPACASKL 256
Query: 270 ILKRTIRGEWRLNGYIVSDCDSVGVYYDT----QHFTSTPEEAAADAIRAGLDLDCGPFL 325
+L+ T+R +W G++ SDC ++ +Y T H + E AAA I+AG D +CG
Sbjct: 257 LLQDTLRRDWGFKGFVTSDCGAIDDFYATDYPSHHTSPDKEAAAAAGIKAGTDSNCGQTY 316
Query: 326 GLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQ-PYGHLGPKDVCTPDHQE 384
L SAV++GL++E +I+ AL + T + +LG+FD P+++ + + +V +P HQ
Sbjct: 317 -LTLGSAVKKGLVTEAEIDTALKHLFTARFQLGLFD--PAAKVAFNAIPFSEVNSPAHQA 373
Query: 385 LALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQG 444
LAL+AA + IVLLKN +LP RT+AVIGP++ + GNY I PL G
Sbjct: 374 LALKAAEESIVLLKNDAHTLPFKP-SVRTIAVIGPSAATLNNLEGNYNAIPLHPVLPLDG 432
Query: 445 I---GRYARTIHQQGC 457
I + ++ ++ QG
Sbjct: 433 ILTQFKSSKVLYAQGS 448
Score = 120 bits (302), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 87/281 (30%), Positives = 136/281 (48%), Gaps = 48/281 (17%)
Query: 485 MGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDV 534
+GL +E E + DR + LP QQ+++ V+ K P ++VL++G + V
Sbjct: 630 LGLSPELEGEEMPIHIEGFAGGDRTDIKLPAAQQQMLEAVAATGK-PLVVVLLNGSALAV 688
Query: 535 AFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAM 594
+A + AAI+ A YPGQAGGTAIA+ L G +NP G+LP+T+Y I +P +
Sbjct: 689 NWAND--HAAAILEAWYPGQAGGTAIAETLAGKNNPAGRLPVTFYSS--IDQIPAFD--- 741
Query: 595 RPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATIS 654
RTYR+ K ++ FG+G+SYT F ++ N +S
Sbjct: 742 ----DYSMANRTYRYSKAKPLFEFGYGLSYTTFTYS-------------------NIKLS 778
Query: 655 GKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVP 714
+ + HA L V+ DV+N G G ++ TPP +P + L AF +VH+
Sbjct: 779 TQTL---HAGD---PLTVEADVRNTGRVAGDEVAELYLTPPHTAVSPQRALSAFTRVHLA 832
Query: 715 AGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTK 755
G + V + + LS VD G R + G + + + GT+
Sbjct: 833 PGELRHVTFTLD-PRTLSQVDEKGARAVTPGNYTLSVSGTQ 872
>gi|66767544|ref|YP_242306.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. campestris
str. 8004]
gi|66572876|gb|AAY48286.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. campestris
str. 8004]
Length = 888
Score = 296 bits (757), Expect = 4e-77, Method: Compositional matrix adjust.
Identities = 168/407 (41%), Positives = 235/407 (57%), Gaps = 32/407 (7%)
Query: 51 QRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGA 110
QR L+ ++S +EKV ++ A A+PRLGI YEWWSE LHG++ G A
Sbjct: 38 QRAAALVAQMSREEKVAQSMNAAPAIPRLGIPAYEWWSEGLHGIARNG----------YA 87
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--GGT-------AGLTYWSPNVNIFR 161
T FPQ I A+S+N L + +G VVS EARA +N GG AGLT WSPN+NIFR
Sbjct: 88 TVFPQAIGLAASWNTQLMQQVGTVVSTEARAKFNQAGGPGKDHKRYAGLTIWSPNINIFR 147
Query: 162 DPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD-- 219
DPRWGRG ET GEDP L+G+ A ++RGLQG D + + A+ KH + +G +
Sbjct: 148 DPRWGRGMETYGEDPFLTGQLAVGFIRGLQGDDLEHPRTIATPKHIAVH-----SGPEPG 202
Query: 220 RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEW 279
R F+ VS +D+E T+ FR ++EG+ SVMC+YN ++G P CA +L +RG+W
Sbjct: 203 RHGFDVDVSPRDVEATYTPAFRAALVEGQAGSVMCAYNSLHGTPACAADWLLNGRVRGDW 262
Query: 280 RLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLL 338
G++VSDCD+V +F ++A +++AG DL+CG + L T A++RG +
Sbjct: 263 GFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAASLKAGHDLNCGTAYRALGT--AIERGEV 320
Query: 339 SEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLK 398
E ++ +LV + RLG P Y LG KD+ ++ LAL+AA + IVLLK
Sbjct: 321 DEALLDQSLVRLFAARYRLGELQA-PRKDRYARLGAKDIDNAGNRALALQAAAESIVLLK 379
Query: 399 NQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
N +LPL +AVIGPN+D + NY G + TPL G+
Sbjct: 380 NANATLPLKA--GTRLAVIGPNADALAALEANYQGTSSQPVTPLLGL 424
Score = 139 bits (349), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 94/278 (33%), Positives = 143/278 (51%), Gaps = 48/278 (17%)
Query: 486 GLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVA 535
GL +E E L DR + LP QQ L+ + + K P ++VLMSG + +
Sbjct: 625 GLSPDVEGEELRIDVPGFDGGDRNDIALPAAQQALLERAKASGK-PLVVVLMSGSAVALN 683
Query: 536 FAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMR 595
+AK AI+ A YPGQ+GGTAIA L G NPGG+LP+T+Y +LP
Sbjct: 684 WAKT--HADAIVAAWYPGQSGGTAIARALAGDDNPGGRLPVTFY--RSTKDLP------- 732
Query: 596 PSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISG 655
P S GRTYR++KG ++PFG+G+SYT+F + +AP + + L +G
Sbjct: 733 PYVSYDMKGRTYRYFKGEALFPFGYGLSYTSFAY---DAPQLSSTTLQ----------AG 779
Query: 656 KAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPA 715
++VT V+N G++ G V+ P +P + LV F++VH+
Sbjct: 780 SPLQVT------------TTVRNTGTRAGDEVAQVYLQYPDRPQSPLRSLVGFQRVHLQP 827
Query: 716 GAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGG 753
G Q+ + + + LS VDR+GTR + G++ + +GG
Sbjct: 828 GEQRTLTFTLD-ARALSDVDRTGTRAVEAGDYRLFVGG 864
>gi|62321294|dbj|BAD94522.1| beta-xylosidase - like protein [Arabidopsis thaliana]
Length = 287
Score = 296 bits (757), Expect = 4e-77, Method: Compositional matrix adjust.
Identities = 137/287 (47%), Positives = 192/287 (66%), Gaps = 7/287 (2%)
Query: 479 DATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAK 538
D+ +LV+G DQSIE E DR L LPG+QQELV++V+MA++GP +LV+MSGG D+ FAK
Sbjct: 1 DSVVLVVGADQSIEREGHDRVDLYLPGKQQELVTRVAMAARGPVVLVIMSGGGFDITFAK 60
Query: 539 NDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQ 598
ND +I +I+W GYPG+AGG AI D++FG NP G LPMTWYPQ Y+ +PM+ M MRP +
Sbjct: 61 NDKKITSIMWVGYPGEAGGLAIPDVIFGRHNPSGNLPMTWYPQSYVEKVPMSNMNMRPDK 120
Query: 599 SKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAI 658
SK YPGR+YRFY G VY F ++YT F H + AP +V++ LD H ++ +++
Sbjct: 121 SKGYPGRSYRFYTGETVYAFADALTYTKFDHQLIKAPRLVSLSLDENHPCRSSEC--QSL 178
Query: 659 KVTHAKCNRLTLG-----VQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHV 713
C G V ++VKN G + G+HT+ +F+T P H +P KQL+ FEK+ +
Sbjct: 179 DAIGPHCENAVEGGSDFEVHLNVKNTGDRAGSHTVFLFTTSPQVHGSPIKQLLGFEKIRL 238
Query: 714 PAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
+ V N++VCK LSVVD +G R+I LG H +H+G KHS+++
Sbjct: 239 GKSEEAVVRFNVNVCKDLSVVDETGKRKIALGHHLLHVGSLKHSLNI 285
>gi|384421334|ref|YP_005630694.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzicola
BLS256]
gi|353464247|gb|AEQ98526.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzicola
BLS256]
Length = 904
Score = 295 bits (756), Expect = 4e-77, Method: Compositional matrix adjust.
Identities = 165/422 (39%), Positives = 239/422 (56%), Gaps = 33/422 (7%)
Query: 41 PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPG 100
P+ + QR DL+ R++L+EK + + A A+PRL + Y+WW+EALHGV+ G
Sbjct: 36 PYLDTARSFEQRAADLVSRMTLEEKAAQMQNAAPAIPRLQVPAYDWWNEALHGVARAG-- 93
Query: 101 TKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--------GGTAGLTY 152
GAT FPQ I A++F+ L + +SDEARA ++ GLT+
Sbjct: 94 --------GATVFPQAIGMAATFDLPLMHEVATAISDEARAKHHQFLRQNQHARYQGLTF 145
Query: 153 WSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQG--------SDGDRL-KVAAS 203
WSPN+NIFRDPRWGRGQET GEDP L+ + ++V+GLQG + G+ K+ A+
Sbjct: 146 WSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQGEGAAAPKNAQGEPYRKLDAT 205
Query: 204 CKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVP 263
KHF + +R HF+A+ S++D+ +T+ F V +GKV +VM +YN+V G
Sbjct: 206 AKHFAVHSGPE---AERHHFDARPSQRDLYETYLPAFEALVKDGKVDAVMGAYNRVYGES 262
Query: 264 TCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGP 323
A +L+ +R +W GY+VSDC ++ + +T E+AAA A+ G +L+CG
Sbjct: 263 ASASKFLLQDVLRQQWGFKGYVVSDCWAIVDVWKHHKIVATREQAAALAVTHGTELECGE 322
Query: 324 FLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQ 383
+AV +GL+ E I+ AL +T +MRLGMFD P P+ + +P H
Sbjct: 323 EYST-LPAAVHQGLIDEAQIDTALQTLMTARMRLGMFD-PPGQLPWSKIPASVNQSPAHD 380
Query: 384 ELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQ 443
LA AR+ +VLLKN G LPLS + +AVIGP +D T+ ++GNY G T LQ
Sbjct: 381 ALARRTARESLVLLKNDG-LLPLSRATLKRIAVIGPTADDTMALLGNYYGTPAAPVTVLQ 439
Query: 444 GI 445
GI
Sbjct: 440 GI 441
Score = 124 bits (310), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 92/301 (30%), Positives = 140/301 (46%), Gaps = 51/301 (16%)
Query: 471 AIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKG 520
A+D +R AD + V GL +E E + DR L LP Q+EL+ + K
Sbjct: 631 ALDVARSADVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRELLEALQATGK- 689
Query: 521 PTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYP 580
P + VL +G + + +A+ + AI+ A YPGQ GGTA+AD LFG +NPGG+LP+T+Y
Sbjct: 690 PVVAVLTAGSALAIDWAQQ--HVPAILLAWYPGQRGGTAVADTLFGDANPGGRLPVTFY- 746
Query: 581 QEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAV 640
+E T + M GRTYR++ G +YPFGHG+SYT F ++
Sbjct: 747 KESETLPAFDDYTMH--------GRTYRYFGGTPLYPFGHGLSYTQFAYS---------- 788
Query: 641 PLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTP--PAGH 698
D R T G L V VKN G + G + ++ P P
Sbjct: 789 --DLRLDRSTLTADG-------------ALTATVAVKNTGQRAGDEVVQLYLHPLKPQRE 833
Query: 699 WAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVD-RSGTRRIPLGEHNIHIGGTKHS 757
A K+L F+++ + G Q+ + I+ L + D + + G + + IG +
Sbjct: 834 RA-GKELRGFQRLALQPGEQRELRFTINATDALRIYDAQRKAYTVDPGAYEVQIGASSAD 892
Query: 758 V 758
+
Sbjct: 893 I 893
>gi|256393466|ref|YP_003115030.1| glycoside hydrolase family 3 [Catenulispora acidiphila DSM 44928]
gi|256359692|gb|ACU73189.1| glycoside hydrolase family 3 domain protein [Catenulispora
acidiphila DSM 44928]
Length = 1343
Score = 295 bits (755), Expect = 6e-77, Method: Compositional matrix adjust.
Identities = 230/733 (31%), Positives = 339/733 (46%), Gaps = 112/733 (15%)
Query: 51 QRVNDLIGRLSLQEKV-KLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPG 109
+R DL+ R++L EK +L + A A+PRLG++ Y +WSE HGV+ +G + G D G
Sbjct: 58 ERAADLVSRMTLPEKAAQLQTNSAPAIPRLGVQEYTYWSEGQHGVNTLGADSNRG-DVTG 116
Query: 110 ---ATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAG---------------LT 151
ATSFP S++ L VSDE R + G LT
Sbjct: 117 GVHATSFPVNFAATMSWDPALTYKETTAVSDEVRGFLDKSLWGTGQNNLGPSASDYGALT 176
Query: 152 YWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSD--GDR----LKVAASCK 205
+W+PNVN+ RDP WGR E+ GEDP L+ A ++V G QG G + LKVAA+ K
Sbjct: 177 FWAPNVNMDRDPLWGRTNESFGEDPYLTSTMAGAFVDGYQGQSMTGQQQTPYLKVAATAK 236
Query: 206 HFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTC 265
H++ ++++ R ++ + +I D + F V + V+ +M SYN VNG P+
Sbjct: 237 HYSLNNIED----SRHTGSSDTTDANIRDYYTKQFASLVRDAHVSGIMTSYNAVNGTPSP 292
Query: 266 ADPNILKRTIRGEWRLNGYIVSDCDSVGVYYD------------------TQHFTSTPEE 307
AD + ++ + GY SDC ++G Y T + T
Sbjct: 293 ADTYTVDELLQATYGFAGYTTSDCGAIGDVYGAASHGWAPPGWTSNGTSWTNNATGRQIS 352
Query: 308 AAAD----AIRAGLDLDC--GPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFD 361
AAA AIRAG L+C G + +A+ GLLS ++ L TV+M G FD
Sbjct: 353 AAAGGQAFAIRAGTQLNCAGGEMTAQNISAAIDLGLLSNGVVDATLTRLFTVRMETGEFD 412
Query: 362 GEPSSQPYGHLG----PKD-VCTPDHQELALEAARQGIVLLKNQGPS------LPLSHIR 410
P G +G KD + +P HQ LA + A IVLL+N S LP+ +
Sbjct: 413 ------PAGKVGYTKITKDQIESPAHQALAEQVAANDIVLLQNGAVSGTSAKLLPVDPAK 466
Query: 411 HRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCKDV-ACADDQLF- 468
+V ++G ++ +G Y+G +QGI + + AC
Sbjct: 467 TDSVVIVGDLANKVT--LGGYSGEPTHEVNAVQGITAAVQAANPSATVTFDACGTGTQIT 524
Query: 469 -----GAAIDAS-RQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPT 522
AA A+ + A ++V G D S+ EA DR+ L LPG L+S+VS T
Sbjct: 525 TPASCSAATQAAIKSASLVLVVAGSDLSVADEANDRSTLALPGNYDSLISQVSALGNPRT 584
Query: 523 ILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQE 582
LV+ + GP D+ A+ D AI+++GY GQ+ GTA+A +LFG NP G L TWY +
Sbjct: 585 ALVMQADGPYDIQDAQKD--FPAIVFSGYNGQSQGTALAQVLFGQQNPAGHLDFTWYSGD 642
Query: 583 YITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPL 642
PM + PSQ+ GRTY+++ G YPFG+G SY++F ++
Sbjct: 643 SQL-APMDNYGLTPSQTGGL-GRTYQYFTGTPTYPFGYGQSYSSFAYSHVQV-------- 692
Query: 643 DGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAG--HWA 700
G N G T+ V DVKN G+ G +++ PP +
Sbjct: 693 ----GPQNTNADG-------------TVHVSFDVKNTGTVAGTTVAQLYAAPPGAGTNDT 735
Query: 701 PHKQLVAFEKVHV 713
+QL F+K +
Sbjct: 736 TREQLAGFQKTNT 748
>gi|423226625|ref|ZP_17213090.1| hypothetical protein HMPREF1062_05276 [Bacteroides cellulosilyticus
CL02T12C19]
gi|392628884|gb|EIY22909.1| hypothetical protein HMPREF1062_05276 [Bacteroides cellulosilyticus
CL02T12C19]
Length = 863
Score = 295 bits (755), Expect = 7e-77, Method: Compositional matrix adjust.
Identities = 170/432 (39%), Positives = 247/432 (57%), Gaps = 32/432 (7%)
Query: 41 PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPG 100
P+ +L +R NDL+GRL+L+EK L+ + + A+PRLGIK Y+WW+EALHGV G
Sbjct: 25 PYKNPALSPEERANDLVGRLTLEEKAALMQNTSPAIPRLGIKAYDWWNEALHGVGRAGL- 83
Query: 101 TKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAM-----YNGGTA---GLTY 152
AT FPQ I +SFN L + VSDEARA GG GLT
Sbjct: 84 ---------ATVFPQAIGMGASFNNELLYDVFTAVSDEARAKNTEFSKEGGLKRYQGLTM 134
Query: 153 WSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRL-KVAASCKHFTAYD 211
W+PN+NIFRDPRWGRGQET GEDP L+G+ + VRGLQG +G++ K+ A KH+ +
Sbjct: 135 WTPNINIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPEGEKYDKLHACAKHYAVHS 194
Query: 212 LDNWNGVDRFHFNAK-VSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNI 270
WN R FNA+ + +D+ +T+ F+ V + V VMC+YN+ G P C +
Sbjct: 195 GPEWN---RHSFNAENIDPRDLWETYLPAFKDLVQKAHVKEVMCAYNRFEGEPCCGSNRL 251
Query: 271 LKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEE--AAADAIRAGLDLDCGPFLGLH 328
L + +R EW +VSDC ++ +Y+ + P++ A+A A+ +G D++CG
Sbjct: 252 LMQILRDEWGYKEIVVSDCWAISDFYNKDAHETDPDKQHASAKAVLSGTDVECGDSYASL 311
Query: 329 TESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALE 388
E AV+ GL+ E I+ +L + + LG D EPS + + V + +H+ELAL
Sbjct: 312 PE-AVKEGLIDEKQIDISLKRLMKARFELGEMD-EPSQVSWAQIPYSVVDSKEHRELALR 369
Query: 389 AARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRY 448
AR+ +VLL+N LPL+ ++ VAV+GPN++ +V GNY G T L+GI Y
Sbjct: 370 MARESLVLLQNNQSLLPLN--KNLKVAVVGPNANDSVMQWGNYNGFPSHTITLLEGIREY 427
Query: 449 ---ARTIHQQGC 457
++ I++ GC
Sbjct: 428 LPESQIIYEPGC 439
Score = 117 bits (292), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 91/302 (30%), Positives = 135/302 (44%), Gaps = 49/302 (16%)
Query: 472 IDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGP 521
+D ++AD I G+ ++E E + DR + LP Q L++++ A K
Sbjct: 594 VDKVKEADVIIFAGGISPAVEGEEMHVNIPGFKGGDRETIELPSIQSRLLAELKKAGK-K 652
Query: 522 TILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQ 581
+ V SG I A AI+ A YPGQAGGTAIA++LFG NP G+LP+T+Y
Sbjct: 653 IVFVNFSGSAI--ALTPESKTCDAILQAWYPGQAGGTAIANVLFGDYNPAGRLPVTFYKS 710
Query: 582 EYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAV 640
LP + +M+ GRTYR+ ++PFGHG+SYT F +
Sbjct: 711 --TKQLPDFEDYSMK--------GRTYRYMTENPLFPFGHGLSYTTFQYG---------- 750
Query: 641 PLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWA 700
NA+++ IK ++TL + V N G DG + V+ P
Sbjct: 751 ---------NASLNTSEIK----DGEQVTL--TIPVSNTGKYDGEEVVQVYLRHPGDKEG 795
Query: 701 PHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
P L AF++V + GA V I + + + T R G++ I GGT L
Sbjct: 796 PSHALRAFKRVAIAKGATNNVTIPLSKENFEWFDTSTNTMRPIEGDYEILYGGTSELKQL 855
Query: 761 HA 762
+
Sbjct: 856 KS 857
>gi|224537384|ref|ZP_03677923.1| hypothetical protein BACCELL_02262 [Bacteroides cellulosilyticus
DSM 14838]
gi|224521009|gb|EEF90114.1| hypothetical protein BACCELL_02262 [Bacteroides cellulosilyticus
DSM 14838]
Length = 863
Score = 294 bits (753), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 170/432 (39%), Positives = 247/432 (57%), Gaps = 32/432 (7%)
Query: 41 PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPG 100
P+ +L +R NDL+GRL+L+EK L+ + + A+PRLGIK Y+WW+EALHGV G
Sbjct: 25 PYKNPALSPEERANDLVGRLTLEEKAALMQNTSPAIPRLGIKAYDWWNEALHGVGRAGL- 83
Query: 101 TKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAM-----YNGGTA---GLTY 152
AT FPQ I +SFN L + VSDEARA GG GLT
Sbjct: 84 ---------ATVFPQAIGMGASFNNELLYDVFTAVSDEARAKNTEFSKEGGLKRYQGLTM 134
Query: 153 WSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRL-KVAASCKHFTAYD 211
W+PN+NIFRDPRWGRGQET GEDP L+G+ + VRGLQG +G++ K+ A KH+ +
Sbjct: 135 WTPNINIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPEGEKYDKLHACAKHYAVHS 194
Query: 212 LDNWNGVDRFHFNAK-VSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNI 270
WN R FNA+ + +D+ +T+ F+ V + V VMC+YN+ G P C +
Sbjct: 195 GPEWN---RHSFNAENIDPRDLWETYLPAFKNLVQKAHVKEVMCAYNRFEGEPCCGSNRL 251
Query: 271 LKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEE--AAADAIRAGLDLDCGPFLGLH 328
L + +R EW +VSDC ++ +Y+ + P++ A+A A+ +G D++CG
Sbjct: 252 LMQILRDEWGYKEIVVSDCWAISDFYNKGAHETDPDKQHASAKAVLSGTDVECGDSYASL 311
Query: 329 TESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALE 388
E AV+ GL+ E I+ +L + + LG D EPS + + V + +H+ELAL
Sbjct: 312 PE-AVKEGLIDEKQIDISLKRLMKARFELGEMD-EPSQVSWAQIPYSVVDSKEHRELALR 369
Query: 389 AARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRY 448
AR+ +VLL+N LPL+ ++ VAV+GPN++ +V GNY G T L+GI Y
Sbjct: 370 MARESLVLLQNNQSLLPLN--KNLKVAVVGPNANDSVMQWGNYNGFPSHTITLLEGIREY 427
Query: 449 ---ARTIHQQGC 457
++ I++ GC
Sbjct: 428 LPESQIIYEPGC 439
Score = 116 bits (291), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 91/302 (30%), Positives = 135/302 (44%), Gaps = 49/302 (16%)
Query: 472 IDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGP 521
+D ++AD I G+ ++E E + DR + LP Q L++++ A K
Sbjct: 594 VDKVKEADVIIFAGGISPAVEGEEMHVNIPGFKGGDRETIELPSIQSRLLAELKKAGK-K 652
Query: 522 TILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQ 581
+ V SG I A AI+ A YPGQAGGTAIA++LFG NP G+LP+T+Y
Sbjct: 653 IVFVNFSGSAI--ALTPESKTCDAILQAWYPGQAGGTAIANVLFGDYNPAGRLPVTFYKS 710
Query: 582 EYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAV 640
LP + +M+ GRTYR+ ++PFGHG+SYT F +
Sbjct: 711 --TKQLPDFEDYSMK--------GRTYRYMTENPLFPFGHGLSYTTFQYG---------- 750
Query: 641 PLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWA 700
NA+++ IK ++TL + V N G DG + V+ P
Sbjct: 751 ---------NASLNTSEIK----DGEQVTL--TIPVSNTGKYDGEEVVQVYLRHPGDKEG 795
Query: 701 PHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
P L AF++V + GA V I + + + T R G++ I GGT L
Sbjct: 796 PSHALRAFKRVAIAKGATNNVTIPLSKENFEWFDTSTNTMRPIEGDYEILYGGTSELKQL 855
Query: 761 HA 762
+
Sbjct: 856 KS 857
>gi|229580225|ref|YP_002838625.1| glycoside hydrolase family protein [Sulfolobus islandicus
Y.G.57.14]
gi|229581131|ref|YP_002839530.1| glycoside hydrolase family protein [Sulfolobus islandicus
Y.N.15.51]
gi|228010941|gb|ACP46703.1| glycoside hydrolase family 3 domain protein [Sulfolobus islandicus
Y.G.57.14]
gi|228011847|gb|ACP47608.1| glycoside hydrolase family 3 domain protein [Sulfolobus islandicus
Y.N.15.51]
Length = 754
Score = 294 bits (752), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 216/690 (31%), Positives = 349/690 (50%), Gaps = 98/690 (14%)
Query: 110 ATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQ 169
+T+FPQ I AS++N L I V+ + R + G L SP +++ +DPRWGR +
Sbjct: 101 STAFPQAIGLASTWNPELVMDIASVIRSQGRLV--GVNQCL---SPVLDVCKDPRWGRCE 155
Query: 170 ETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDL-DNWNGVDRFHFNAKVS 228
ET GEDP L +Y+ GLQG + ++ A+ KHF A+ + + + H V
Sbjct: 156 ETYGEDPYLVASMGLAYITGLQGDN----QLVATAKHFAAHGFPEGGRNIAQVH----VG 207
Query: 229 KQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSD 288
+++ +TF PF + V GKV S+M +Y++++G+P +P +L +R EW +G +VSD
Sbjct: 208 NRELRETFLFPFEVAVKIGKVMSIMPAYHEIDGIPCHGNPQLLTNILRQEWGFDGIVVSD 267
Query: 289 CDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTE---SAVQRGLLSEIDINN 345
D + S EAA A+ +G+D++ P + + E +A++ GL+ E I+
Sbjct: 268 YDGIRQLETIHRVASNKMEAAILALESGVDIEF-PTIDCYGEPLVNALKEGLVPESLIDR 326
Query: 346 ALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQ--ELALEAARQGIVLLKNQGPS 403
A+ L ++ RLG+ D P+ + DH+ ELAL+ AR+ IVLLKN+
Sbjct: 327 AVERVLRIKDRLGLLD-----NPFVNENSVPEKLDDHKSRELALKTARESIVLLKNENNI 381
Query: 404 LPLSHIRHRTVAVIGPNSDVTVTMIGNYA-----GIACG--YTTPLQGIGRY---ARTIH 453
LPLS ++ +AVIGPN++ M+G+Y I G T LQGI + ++ ++
Sbjct: 382 LPLSKNVNK-IAVIGPNANDPRNMLGDYTYTGHLNIDSGIEIVTVLQGIVKKVGESKVLY 440
Query: 454 QQGCKDVACADDQLFGAAIDASRQADATILVMG---------LD----------QSIEAE 494
+GC D+A + F AI+ +RQAD I +MG +D Q++ E
Sbjct: 441 AKGC-DIASESKEGFAEAIEIARQADVIIAIMGEKSGLPLSWMDIPSEEEFKKYQAVTGE 499
Query: 495 ALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQ 554
DR+ L LPG Q+EL+ ++ K P ILVL++G P+ ++ N + A+I A +PG+
Sbjct: 500 GNDRSSLRLPGVQEELLKELYKTGK-PIILVLINGRPLVLSSIIN--YVKAVIEAWFPGE 556
Query: 555 AGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPG--RTYRFYKG 612
GG AIAD++FG NPGG+LP+T+ PM + P R P R Y +
Sbjct: 557 EGGNAIADVIFGDYNPGGRLPITF---------PM-DTGQIPLYYNRKPSSFRPYVMLRS 606
Query: 613 PVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGV 672
++ FG+G+SYT F ++ N ++ K I + +
Sbjct: 607 SPLFTFGYGLSYTQFEYS-------------------NLEVTPKEIGPNS------NIAI 641
Query: 673 QVDVKNVGSKDGAHTLLVFSTPPAGHWA-PHKQLVAFEKVHVPAGAQQRVGINIHVCKYL 731
+DVKNVG +G + ++ + A P K+L F K+H+ G ++RV I + L
Sbjct: 642 SIDVKNVGKMEGDDVVQLYVSKTFSSVARPVKELKGFAKIHLKPGEKRRVKF-ILPTEAL 700
Query: 732 SVVDRSGTRRIPLGEHNIHIGGTKHSVSLH 761
+ D + GE+ + IG + ++ L
Sbjct: 701 AFYDSFMRLVVEKGEYQLLIGNSSENIILR 730
>gi|325929067|ref|ZP_08190221.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas perforans
91-118]
gi|325540562|gb|EGD12150.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas perforans
91-118]
Length = 850
Score = 293 bits (751), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 167/426 (39%), Positives = 241/426 (56%), Gaps = 36/426 (8%)
Query: 60 LSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITT 119
++L+EK + + A A+PRLG+ Y+WW+EALHGV+ G GAT FPQ I
Sbjct: 1 MTLEEKAAQMQNAAPAIPRLGVPAYDWWNEALHGVARAG----------GATVFPQAIGM 50
Query: 120 ASSFNATLWEAIGRVVSDEARAMYN--------GGTAGLTYWSPNVNIFRDPRWGRGQET 171
A++F+ L + +SDEARA ++ GLT+WSPN+NIFRDPRWGRGQET
Sbjct: 51 AATFDLPLMHEVATAISDEARAKHHQFLRQNQHARYQGLTFWSPNINIFRDPRWGRGQET 110
Query: 172 PGEDPVLSGKYAASYVRGLQGSDGDRLKVA---------ASCKHFTAYDLDNWNGVDRFH 222
GEDP L+ + ++V+GLQG D K A A+ KHF + DR H
Sbjct: 111 YGEDPFLTARMGVTFVQGLQGEGADAPKNAQGEPYRKLDATAKHFAVHSGPE---ADRHH 167
Query: 223 FNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLN 282
F+A+ S++D+ +T+ F V +GKV +VM +YN+V G A +L+ +R +W
Sbjct: 168 FDARPSQRDLYETYLPAFEALVKDGKVDAVMGAYNRVYGESASASKFLLQDVLRQQWGFK 227
Query: 283 GYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEID 342
GY+VSDC ++ + +T E+AAA A++ G +L+CG +AV++GL+ E
Sbjct: 228 GYVVSDCWAIVDIWKHHKIVATREQAAALAVKHGTELECGEEYST-LPAAVRQGLIDEAQ 286
Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGP 402
I+ AL +T +MRLGMFD P P+ + +P H LA AR+ +VLLKN G
Sbjct: 287 IDTALTTLMTARMRLGMFD-PPGQLPWSTIPASVNQSPAHDALARRTARESLVLLKNDG- 344
Query: 403 SLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI---GRYARTIHQQGCKD 459
LPLS + + +AVIGP +D T+ ++GNY G T LQGI A+ ++ +G
Sbjct: 345 LLPLSRAKLKRIAVIGPTADDTMALLGNYYGTPAAPVTVLQGIRAAAPNAQVLYARGADL 404
Query: 460 VACADD 465
V DD
Sbjct: 405 VEGRDD 410
Score = 124 bits (312), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 92/303 (30%), Positives = 146/303 (48%), Gaps = 55/303 (18%)
Query: 471 AIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKG 520
A+D +R AD + V GL +E E + DR L LP Q++L+ + K
Sbjct: 577 ALDVARSADVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRDLLEALQATGK- 635
Query: 521 PTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYP 580
P + VL +G + + +A+ + AI+ A YPGQ GGTA+AD LFG +NPGG+LP+T+Y
Sbjct: 636 PVVAVLTTGSALAIDWAQQ--HLPAILLAWYPGQRGGTAVADTLFGDANPGGRLPVTFY- 692
Query: 581 QEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHT--VANAPTVV 638
+E T + AMR GRTYR++ G +YPFGHG+SYT F ++ + T+
Sbjct: 693 KESETLPAFDDYAMR--------GRTYRYFGGTPLYPFGHGLSYTQFAYSGLRLDRTTIA 744
Query: 639 AVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTP--PA 696
A GS+ AT++ VKN G + G + ++ P P
Sbjct: 745 A------DGSLTATVT---------------------VKNTGQRAGDEVVQLYLHPLTPQ 777
Query: 697 GHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVD-RSGTRRIPLGEHNIHIGGTK 755
A K+L F+++ + G Q+ + + L + D + + G + + IG +
Sbjct: 778 RERA-GKELHGFQRIALQPGEQRALHFTLDAKNALRIYDAQRKAYAVDPGAYEVQIGASS 836
Query: 756 HSV 758
+
Sbjct: 837 ADI 839
>gi|385774250|ref|YP_005646817.1| glycoside hydrolase family protein [Sulfolobus islandicus HVE10/4]
gi|323478365|gb|ADX83603.1| glycoside hydrolase family 3 domain protein [Sulfolobus islandicus
HVE10/4]
Length = 754
Score = 293 bits (751), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 215/690 (31%), Positives = 347/690 (50%), Gaps = 98/690 (14%)
Query: 110 ATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQ 169
+T+FPQ I AS++N L I V+ + R + G L SP +++ +DPRWGR +
Sbjct: 101 STAFPQAIGLASTWNLELVMDIASVIRSQGRLV--GVNQCL---SPVLDVCKDPRWGRCE 155
Query: 170 ETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDL-DNWNGVDRFHFNAKVS 228
ET GEDP L +Y+ GLQG + ++ A+ KHF A+ + + + H V
Sbjct: 156 ETYGEDPYLVASMGLAYITGLQGDN----QLVATAKHFAAHGFPEGGRNIAQVH----VG 207
Query: 229 KQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSD 288
+++ +TF PF + V GKV S+M +Y++++G+P +P +L +R EW +G +VSD
Sbjct: 208 NRELRETFLFPFEVAVKIGKVMSIMPAYHEIDGIPCHGNPQLLTNILRQEWGFDGIVVSD 267
Query: 289 CDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTE---SAVQRGLLSEIDINN 345
D + S EAA A+ +G+D++ P + ++E +A+ GL+ E I+
Sbjct: 268 YDGIRQLETIHRVASNKMEAAILALESGVDIEF-PTIDCYSEPLVNALTEGLVPESLIDR 326
Query: 346 ALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQ--ELALEAARQGIVLLKNQGPS 403
A+ L ++ RLG+ D P+ + DH+ ELAL+ AR+ IVLLKN+
Sbjct: 327 AVERVLRIKDRLGLLD-----NPFVNENSVPEKLDDHKSRELALKTARESIVLLKNENNI 381
Query: 404 LPLSHIRHRTVAVIGPNSDVTVTMIGNYA-----GIACG--YTTPLQGIGRY---ARTIH 453
LPLS ++ +AVIGPN++ M+G+Y I G T LQG+ + ++ ++
Sbjct: 382 LPLSKNVNK-IAVIGPNANDPRNMLGDYTYTGHLNIDSGIEIVTVLQGVVKKVGESKVLY 440
Query: 454 QQGCKDVACADDQLFGAAIDASRQADATILVMGLD-------------------QSIEAE 494
+GC D+A + F AI+ +RQAD I VMG Q++ E
Sbjct: 441 AKGC-DIASESKEGFAEAIEIARQADVIIAVMGEKSGLPLSWTDIPSEEEFKKYQAVTGE 499
Query: 495 ALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQ 554
DR+ L LPG Q+EL+ ++ K P ILVL++G P+ ++ N + A+I A +PG+
Sbjct: 500 GNDRSSLRLPGVQEELLKELYKTGK-PIILVLINGRPLVLSPIIN--YVKAVIEAWFPGE 556
Query: 555 AGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPG--RTYRFYKG 612
GG AIAD++FG NPGG+LP+T+ PM + P R P R Y +
Sbjct: 557 EGGNAIADVIFGDYNPGGRLPITF---------PM-DTGQIPLYYNRKPSSFRPYVMLRS 606
Query: 613 PVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGV 672
++ FG+G+SYT F ++ N ++ K I + +
Sbjct: 607 SPLFTFGYGLSYTQFEYS-------------------NLEVTPKEIGPNS------NIAI 641
Query: 673 QVDVKNVGSKDGAHTLLVFSTPPAGHWA-PHKQLVAFEKVHVPAGAQQRVGINIHVCKYL 731
+DVKNVG +G + ++ + A P K+L F K+H+ G ++RV I + L
Sbjct: 642 SIDVKNVGKMEGDDVVQLYVSKTFSSVARPVKELKGFAKIHLKPGEKRRVKF-ILPTEAL 700
Query: 732 SVVDRSGTRRIPLGEHNIHIGGTKHSVSLH 761
+ D + GE+ + IG + ++ L
Sbjct: 701 AFYDSFMRLVVEKGEYQLLIGNSSENIILR 730
>gi|218186207|gb|EEC68634.1| hypothetical protein OsI_37026 [Oryza sativa Indica Group]
Length = 1241
Score = 293 bits (751), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 161/328 (49%), Positives = 203/328 (61%), Gaps = 30/328 (9%)
Query: 133 RVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQG 192
+ VS EARAMYN G GLTYWSPN+N+ RDPRWGR ETPGEDP + G+YA ++VRG+Q
Sbjct: 916 QAVSTEARAMYNMGKGGLTYWSPNINVVRDPRWGRALETPGEDPYVVGRYAVNFVRGMQD 975
Query: 193 SDGDR------------LKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPF 240
G LK +A CKH+ AYDLD+W+ RF F+A+V ++D+ +TF PF
Sbjct: 976 IPGHEAVAAGGDPNTRPLKTSACCKHYAAYDLDDWHNHTRFEFDARVDERDMVETFQRPF 1035
Query: 241 RMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQH 300
MCV +G V+SVMCSYN+VNG+P CAD +L +TIR +W L+GYIVSDCD+V V D
Sbjct: 1036 EMCVRDGDVSSVMCSYNRVNGIPACADARLLSQTIRRDWGLHGYIVSDCDAVRVMTDNAT 1095
Query: 301 FTS-TPEEAAADAIRAGLDLDCGP-------------FLGLHTESAVQRGLLSEIDINNA 346
+ T EA+A A++AGLDLDCG FL + AV +G + E DI+NA
Sbjct: 1096 WLGYTGAEASAAALKAGLDLDCGESWKNETDGHPLMDFLTTYGMEAVNKGKMRESDIDNA 1155
Query: 347 LVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPL 406
L N MRLG FD Y LG +D+CT H+ LAL+ ARQGIVLLKN LPL
Sbjct: 1156 LTNQYMTLMRLGYFD---DIAQYSSLGRQDICTDQHKTLALDGARQGIVLLKNDNKLLPL 1212
Query: 407 SHIRHRTVAVIGPNSDV-TVTMIGNYAG 433
+ V V GP+ M G+Y G
Sbjct: 1213 DANKVGFVNVRGPHVQAPEKIMDGDYTG 1240
>gi|189464583|ref|ZP_03013368.1| hypothetical protein BACINT_00926 [Bacteroides intestinalis DSM
17393]
gi|189436857|gb|EDV05842.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
intestinalis DSM 17393]
Length = 879
Score = 293 bits (750), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 169/432 (39%), Positives = 246/432 (56%), Gaps = 32/432 (7%)
Query: 41 PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPG 100
P+ +L +R NDL+GRL+L+EK L+ + + A+PRLGIK Y+WW+EALHGV G
Sbjct: 41 PYKNPALSPEERANDLVGRLTLEEKAALMQNTSPAIPRLGIKAYDWWNEALHGVGRAGL- 99
Query: 101 TKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAM-----YNGGTA---GLTY 152
AT FPQ I +SFN L + +SDEARA GG GLT
Sbjct: 100 ---------ATVFPQAIGMGASFNNELLYDVFTAISDEARAKNTEFSKEGGLKRYQGLTM 150
Query: 153 WSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRL-KVAASCKHFTAYD 211
W+PN+NIFRDPRWGRGQET GEDP L+ + + VRGLQG +G++ K+ A KH+ +
Sbjct: 151 WTPNINIFRDPRWGRGQETYGEDPYLTSQMGMAVVRGLQGPEGEKYDKLHACAKHYAVHS 210
Query: 212 LDNWNGVDRFHFNAK-VSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNI 270
WN R FNA+ + +D+ +T+ F+ V + V VMC+YN+ G P C +
Sbjct: 211 GPEWN---RHSFNAENIDPRDLWETYLPAFKDLVQKAHVKEVMCAYNRFEGEPCCGSNRL 267
Query: 271 LKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEE--AAADAIRAGLDLDCGPFLGLH 328
L +R EW +VSDC ++ +Y+ + P++ A+A A+ +G D++CG G
Sbjct: 268 LMHILRDEWGYKEIVVSDCWAISDFYNKGAHETDPDKQHASAKAVLSGTDIECGDSYGSL 327
Query: 329 TESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALE 388
E AV+ GL+ E I+ +L + + LG D EPS + + V + +H+ELAL
Sbjct: 328 PE-AVKEGLIDEKQIDISLKRLMKARFELGEMD-EPSQVSWAQIPYSVVDSKEHRELALR 385
Query: 389 AARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRY 448
AR+ +VLL+N LPL+ ++ VAV+GPN++ +V GNY G T L+GI Y
Sbjct: 386 MARESLVLLQNNQSLLPLN--KNLKVAVVGPNANDSVMQWGNYNGFPSHTITLLEGIREY 443
Query: 449 ---ARTIHQQGC 457
++ I++ GC
Sbjct: 444 LPESQIIYEPGC 455
Score = 115 bits (288), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 87/302 (28%), Positives = 137/302 (45%), Gaps = 49/302 (16%)
Query: 472 IDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGP 521
++ ++AD I G+ ++E E + DR + LP Q L++++ A K
Sbjct: 610 VNKVKEADVIIFAGGISPAVEGEEMHVNIPGFKGGDRETIELPSIQSRLLAELKKAGK-K 668
Query: 522 TILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQ 581
+ V SG I A AI+ A YPGQAGGTAIA++LFG NP G+LP+T+Y
Sbjct: 669 IVFVNFSGSAI--ALTPESKTCDAILQAWYPGQAGGTAIANVLFGDYNPAGRLPVTFYKS 726
Query: 582 EYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAV 640
+ LP + +M+ RTYR+ ++PFGHG+SYT F
Sbjct: 727 --TSQLPGFEDYSMKE--------RTYRYMTEAPLFPFGHGLSYTTF------------- 763
Query: 641 PLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWA 700
R+G +A+++ + +K + + V NVG DG + V+ P
Sbjct: 764 ----RYG--DASLNTQEVKDGEQTI------LTIPVSNVGEYDGEEVVQVYLRRPGDKEG 811
Query: 701 PHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
P L AF++ ++ GA V +++ + + T R G++ I GGT L
Sbjct: 812 PSHALRAFKRANIAKGATSNVTVSLSKEDFEWFDTETNTMRPIEGDYEILYGGTSELKQL 871
Query: 761 HA 762
A
Sbjct: 872 KA 873
>gi|319788503|ref|YP_004147978.1| glycoside hydrolase [Pseudoxanthomonas suwonensis 11-1]
gi|317467015|gb|ADV28747.1| glycoside hydrolase family 3 domain protein [Pseudoxanthomonas
suwonensis 11-1]
Length = 916
Score = 293 bits (749), Expect = 3e-76, Method: Compositional matrix adjust.
Identities = 175/445 (39%), Positives = 248/445 (55%), Gaps = 37/445 (8%)
Query: 41 PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPG 100
P+ SL +R L+ R++L+EK + + + A+ RLG+ Y+WW+EALHGV+ G
Sbjct: 49 PWLDTSLSFEERAAALVSRMTLEEKAAQMQNDSPAIERLGLPAYDWWNEALHGVARAG-- 106
Query: 101 TKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--------GGTAGLTY 152
GAT FPQ I A+SF+ L + + +SDEARA ++ G GLT+
Sbjct: 107 --------GATVFPQAIGMAASFDVPLMDQVSAAISDEARAKHHDFLRKGEHGRYQGLTF 158
Query: 153 WSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSD---GDRL-----KVAASC 204
WSPN+NIFRDPRWGRGQET GEDP L+ + S+VRGLQG D G L K+ A+
Sbjct: 159 WSPNINIFRDPRWGRGQETYGEDPFLTTRMGVSFVRGLQGMDPQTGQPLDPKYRKLDATA 218
Query: 205 KHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPT 264
KHF + + DR F+ SKQD+ DT+ F V E V +VM +YN+V G
Sbjct: 219 KHFAVH---SGPEADRHTFDVHPSKQDLYDTYLPAFESLVKEADVYAVMGAYNRVYGESA 275
Query: 265 CADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPF 324
+L T+R +W +GY++SDC ++ + TPEEAAA A++ G +L+CG
Sbjct: 276 SGSKFLLLDTLRRDWGFDGYVMSDCWAIVDIWKNHKIVETPEEAAALAVKNGTELNCGST 335
Query: 325 LGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVC-TPDHQ 383
H AV++GL+SE ++++AL +M LGMFD P Q P V + +H
Sbjct: 336 YADHLPVAVKKGLISEAELDDALTRLFVARMELGMFD--PPEQVRWAQVPYSVNQSAEHD 393
Query: 384 ELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQ 443
LA + A++ +VLLKN G LPLS R +AV+GP +D T+ ++GNY G T L+
Sbjct: 394 ALARKMAQESLVLLKNDG-VLPLSKDIRR-LAVVGPTADDTMALLGNYYGTPADPVTILR 451
Query: 444 GIGRYA---RTIHQQGCKDVACADD 465
GI A ++ +G V DD
Sbjct: 452 GIREAAPGVDVVYARGVDLVEGRDD 476
Score = 127 bits (320), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 97/314 (30%), Positives = 154/314 (49%), Gaps = 55/314 (17%)
Query: 468 FGAAIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMA 517
F A++A+ ADA + V GL +E E + DR + LP QQ+L+ V
Sbjct: 640 FETALEAANSADAVVFVGGLTGDVEGEEMKVDYPGFAGGDRTDIRLPATQQKLLEAVHAT 699
Query: 518 SKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMT 577
K P ++VL +G + + +A+ + + I+ A YPGQ GGTA+ + LFG NPGG+LP+T
Sbjct: 700 GK-PVVMVLTTGSALGIDWARRN--VPGILVAWYPGQRGGTAVGEALFGDYNPGGRLPVT 756
Query: 578 WYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTV 637
+Y + P + AM+ RTYR++ G ++PFGHG+SYT+F ++
Sbjct: 757 FYSADEKLP-PFDDYAMKE--------RTYRYFTGQPLFPFGHGLSYTSFGYS------- 800
Query: 638 VAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTP--P 695
+ LD + +G +VT V V VKN G + G + ++ P P
Sbjct: 801 -GLKLDRKR-------AGAGDEVT----------VSVTVKNQGKRAGDEVVQLYLAPVKP 842
Query: 696 AGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTR-RIPLGEHNIHIGGT 754
A K+L F++VH+ G + V +I + L V D + R + G + + +G +
Sbjct: 843 QRERA-LKELRGFQRVHLQPGESRTVTFSIVPERDLRVYDEAAGRYTVDPGRYEVQVGAS 901
Query: 755 ----KHSVSLHAAT 764
+ SV L AT
Sbjct: 902 SADIRASVPLEVAT 915
>gi|15837447|ref|NP_298135.1| family 3 glycoside hydrolase [Xylella fastidiosa 9a5c]
gi|9105751|gb|AAF83655.1|AE003924_1 family 3 glycoside hydrolase [Xylella fastidiosa 9a5c]
Length = 882
Score = 293 bits (749), Expect = 3e-76, Method: Compositional matrix adjust.
Identities = 166/407 (40%), Positives = 236/407 (57%), Gaps = 32/407 (7%)
Query: 51 QRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGA 110
Q L+ +++LQEK+ ++ A A+PRLGI Y+WWSE LHG++ G A
Sbjct: 32 QHAAALVAKMTLQEKITQTMNAAPAIPRLGIPAYDWWSEGLHGIARNG----------YA 81
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--GGT-------AGLTYWSPNVNIFR 161
T FPQ I A+S+N L + +G V S EARA +N GG AGLT WSPN+NIFR
Sbjct: 82 TVFPQAIGLAASWNTDLLQHVGTVTSTEARAKFNLAGGPGKDHPRYAGLTLWSPNINIFR 141
Query: 162 DPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD-- 219
DPRWGRG ET GEDP L+G+ A S++RGLQG+ D + A+ KHF + +G +
Sbjct: 142 DPRWGRGMETYGEDPYLTGQLAVSFIRGLQGNIPDHPRTIATPKHFAVH-----SGPEPG 196
Query: 220 RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEW 279
R F+ VS D+E T+ FR +++G SVMC+YN ++G P CA +L +R +W
Sbjct: 197 RHSFDVDVSAYDLEATYTPAFRAAIVDGHAGSVMCAYNALHGTPACASDWLLNTRLRNDW 256
Query: 280 RLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLL 338
NG++VSDCD++ F A+A A+++G DL+CG + L+ A+ RG +
Sbjct: 257 GFNGFVVSDCDAIDDMTRFHFFRQDNASASAAALKSGNDLNCGNTYRDLN--QAIARGDI 314
Query: 339 SEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLK 398
E ++ AL+ + RLG PY +G K + TP H+ LAL+AA Q +VLLK
Sbjct: 315 DEALLDQALIRLFAARQRLGTLQPR-EHDPYATIGIKHIDTPAHRALALQAAVQSLVLLK 373
Query: 399 NQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
N G +LPL+ T+AV+GP++D + NY G + TPL G+
Sbjct: 374 NSGNTLPLT--PGTTLAVLGPDADSLTALEANYQGTSSTPVTPLTGL 418
Score = 132 bits (331), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 94/290 (32%), Positives = 139/290 (47%), Gaps = 48/290 (16%)
Query: 474 ASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTI 523
A ADA + +GL +E E L DR + LP Q+ L+ V K P I
Sbjct: 607 AVAHADAIVAFVGLSPEVEGEELHIDTPGFSGGDRTTIDLPATQETLLQHVKTTGK-PLI 665
Query: 524 LVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEY 583
+VLMSG + + +A++ AI+ A YPGQ+GGTAIA L G NPGG+LP+T+Y
Sbjct: 666 VVLMSGSAVALNWAQH--HANAILAAWYPGQSGGTAIAQALAGDVNPGGRLPVTFY--RS 721
Query: 584 ITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLD 643
+LP P S GRTYR++KG +YPFG+G+SYT F + T D
Sbjct: 722 TQDLP-------PYISYDMTGRTYRYFKGQPLYPFGYGLSYTQFTYEAPQLSTATLKAGD 774
Query: 644 GRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHK 703
TL V V+N G++ G + ++ PP AP +
Sbjct: 775 -------------------------TLTVTAHVRNTGTRAGDEVVQLYLEPPHSPQAPLR 809
Query: 704 QLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGG 753
LV F++V + G + + + + LS V ++G R + G +++ +GG
Sbjct: 810 NLVGFKRVTLRPGESRLLTFTLD-TRQLSSVQQTGQRSVEAGHYHLFVGG 858
>gi|385776908|ref|YP_005649476.1| glycoside hydrolase family protein [Sulfolobus islandicus REY15A]
gi|323475656|gb|ADX86262.1| glycoside hydrolase family 3 domain protein [Sulfolobus islandicus
REY15A]
Length = 754
Score = 293 bits (749), Expect = 3e-76, Method: Compositional matrix adjust.
Identities = 215/690 (31%), Positives = 347/690 (50%), Gaps = 98/690 (14%)
Query: 110 ATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQ 169
+T+FPQ I AS++N L I V+ +AR + G L SP +++ +DPRWGR +
Sbjct: 101 STAFPQAIGLASTWNLELVMDIASVIRSQARLV--GVNQCL---SPVLDVCKDPRWGRCE 155
Query: 170 ETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDL-DNWNGVDRFHFNAKVS 228
ET GEDP L +Y+ GLQG + ++ A+ KHF A+ + + + H V
Sbjct: 156 ETYGEDPYLVASMGLAYITGLQGDN----QLVATAKHFAAHGFPEGGRNIAQVH----VG 207
Query: 229 KQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSD 288
+++ +TF PF + V GKV S+M +Y++++G+P +P +L +R EW +G +VSD
Sbjct: 208 NRELRETFLFPFEVAVKIGKVMSIMPAYHEIDGIPCHGNPQLLTNILRQEWGFDGIVVSD 267
Query: 289 CDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTE---SAVQRGLLSEIDINN 345
D + S EAA A+ +G+D++ P + ++E +A+ GL+ E I+
Sbjct: 268 YDGIRQLETIHRVASNKMEAAILALESGVDIEF-PTIDCYSEPLVNALTEGLVPESLIDR 326
Query: 346 ALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQ--ELALEAARQGIVLLKNQGPS 403
A+ L ++ RLG+ D P+ + DH+ ELAL+ AR+ IVLLKN+
Sbjct: 327 AVERVLRIKDRLGLLDN-----PFVNENSVPEKLDDHKSRELALKTARESIVLLKNENNI 381
Query: 404 LPLSHIRHRTVAVIGPNSDVTVTMIGNYA-----GIACG--YTTPLQGIGRY---ARTIH 453
LPLS ++ +AVIGPN++ M+G+Y I G T LQG+ + ++ ++
Sbjct: 382 LPLSKNVNK-IAVIGPNANDPRNMLGDYTYTGHLNIDSGIEIVTVLQGVVKKVGESKVLY 440
Query: 454 QQGCKDVACADDQLFGAAIDASRQADATILVMGLD-------------------QSIEAE 494
+GC D+A + F AI+ +RQAD I VMG Q++ E
Sbjct: 441 AKGC-DIASESKEGFAEAIEIARQADVIIAVMGEKSGLPLSWTDIPSEEEFKKYQAVTGE 499
Query: 495 ALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQ 554
DR+ L LPG Q+EL+ ++ K P ILVL++G P+ ++ N + A+I A +PG+
Sbjct: 500 GNDRSSLRLPGVQEELLKELYKTGK-PIILVLINGRPLVLSPIIN--YVKAVIEAWFPGE 556
Query: 555 AGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPG--RTYRFYKG 612
GG AIAD++FG NP G+LP+T+ PM + P R P R Y +
Sbjct: 557 EGGNAIADVIFGDYNPSGRLPITF---------PM-DTGQIPLYYNRKPSSFRPYVMLRS 606
Query: 613 PVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGV 672
++ FG+G+SYT F ++ N ++ K I + +
Sbjct: 607 SPLFTFGYGLSYTQFEYS-------------------NLEVTPKEIGPNS------NIAI 641
Query: 673 QVDVKNVGSKDGAHTLLVFSTPPAGHWA-PHKQLVAFEKVHVPAGAQQRVGINIHVCKYL 731
+DVKNVG +G + ++ + A P K+L F K+H+ G ++RV I + L
Sbjct: 642 SIDVKNVGKMEGDDVVQLYVSKTFSSVARPVKELKGFAKIHLKPGEKRRVKF-ILPTEAL 700
Query: 732 SVVDRSGTRRIPLGEHNIHIGGTKHSVSLH 761
+ D + GE+ + IG + ++ L
Sbjct: 701 AFYDSFMRLVVEKGEYQLLIGNSSENIILR 730
>gi|431798021|ref|YP_007224925.1| beta-glucosidase-like glycosyl hydrolase [Echinicola vietnamensis
DSM 17526]
gi|430788786|gb|AGA78915.1| beta-glucosidase-like glycosyl hydrolase [Echinicola vietnamensis
DSM 17526]
Length = 906
Score = 292 bits (748), Expect = 4e-76, Method: Compositional matrix adjust.
Identities = 178/463 (38%), Positives = 258/463 (55%), Gaps = 37/463 (7%)
Query: 9 FLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKL 68
+LG+ LL +G+ + +D T F + +RV+ L+ ++SL+EKV
Sbjct: 25 YLGMGLLIYLMAGI-----MRVNAQDGTL-DFSFLDMEKNFEERVDILVDQMSLEEKVSQ 78
Query: 69 LISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLW 128
+++ + A+PRL + Y WW+E LHGV+ G AT FPQ I+ A+SF+ L
Sbjct: 79 MMNASPAIPRLKVPEYNWWNECLHGVARAGY----------ATVFPQSISVAASFDKNLM 128
Query: 129 EAIGRVVSDEARAMYN--------GGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSG 180
+ IG V+SDEARA ++ G GL +WSPN+NIFRDPRWGRG ET GEDP L+G
Sbjct: 129 KDIGSVISDEARAKHHEFIRNGKRGIYTGLDFWSPNINIFRDPRWGRGHETYGEDPYLTG 188
Query: 181 KYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD--RFHFNAKVSKQDIEDTFDV 238
+ A+ ++ GLQ SDG LK A+ KHF + +G + R F+ VS +D+ +T+
Sbjct: 189 ELASQFIEGLQDSDGKYLKTIATSKHFAVH-----SGPEPLRHTFDVDVSDRDLYETYLP 243
Query: 239 PFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDT 298
FR V E KV S+M +YN+ G +L + +R +W GY+VSDC ++ +
Sbjct: 244 AFRKTVKEAKVYSIMGAYNRFRGESCSGHDFLLNQLLREQWGFEGYVVSDCGAIQDIHTG 303
Query: 299 QHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLG 358
ST EAAA + G DL+CG + H AV GL+SE +I+ A+ + RLG
Sbjct: 304 HKIASTAAEAAAIGVSGGCDLNCGNYY-THLTEAVAEGLISEEEIDIAVKRLFLARFRLG 362
Query: 359 MFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIG 418
MFD E + Y + VC+ H LA +AA++ +VLLKNQ LPLS + + +AVIG
Sbjct: 363 MFDPEEAVS-YAQIPFGIVCSEAHNTLARQAAQKSMVLLKNQKNLLPLSVDKIKRIAVIG 421
Query: 419 PNSDVTVTMIGNYAGIACGYTTPLQGI----GRYARTIHQQGC 457
PN+D +++GNY GI T L GI G A ++ +G
Sbjct: 422 PNADNVESLLGNYHGIPKKPVTFLDGIKHKVGPKAEVLYTEGV 464
Score = 146 bits (369), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 99/321 (30%), Positives = 160/321 (49%), Gaps = 52/321 (16%)
Query: 461 ACADDQLFGAAIDASRQADATILVMGLDQSIEAEALD----------RAGLLLPGRQQEL 510
A D A+ ++ AD ++V+GL Q +E E++D R + LP +Q+ L
Sbjct: 610 AMPDVSKIDEAVAMAKSADLAVVVLGLSQRLEGESMDVVTPGFDRGDRTAITLPAQQEAL 669
Query: 511 VSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNP 570
+ V K P ILVL +G + + +AK + + AII AGYPG+ GG A+AD++FG NP
Sbjct: 670 LKAVKETGK-PVILVLNAGSAMAINWAKEN--VDAIISAGYPGEEGGNALADVVFGDYNP 726
Query: 571 GGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHT 630
G+LP+T+Y + + +LP P + GRTYR+++G +YPFG+G+SYT F +
Sbjct: 727 AGRLPITYY--QSVEDLP-------PFEDYDMKGRTYRYFEGKPLYPFGYGLSYTRFSYK 777
Query: 631 VANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTL-L 689
P +NA G ++++ V V N+GS+ G + L
Sbjct: 778 DLEVP-----------AKVNA---GDPVQIS------------VTVTNIGSRAGDEVVQL 811
Query: 690 VFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNI 749
+ A P +QL F+++H+ G + V + + LS+++ R I G +I
Sbjct: 812 YLNDKEASTMRPIRQLEGFQRIHLKPGESKVVNFTLS-ARQLSMINGESKRVIEEGVFSI 870
Query: 750 HIGGTKHSV--SLHAATLGVI 768
H+GG + L A T V+
Sbjct: 871 HVGGEQPGFDGKLQAETTEVV 891
>gi|423294294|ref|ZP_17272421.1| hypothetical protein HMPREF1070_01086 [Bacteroides ovatus
CL03T12C18]
gi|392675485|gb|EIY68926.1| hypothetical protein HMPREF1070_01086 [Bacteroides ovatus
CL03T12C18]
Length = 861
Score = 292 bits (747), Expect = 6e-76, Method: Compositional matrix adjust.
Identities = 173/441 (39%), Positives = 246/441 (55%), Gaps = 34/441 (7%)
Query: 35 ATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGV 94
A + LP+ SL QR DL+ RL+L+EKV L+ + + A+PRLGIK YEWW+EALHGV
Sbjct: 19 ACKQLLPYQDTSLTAEQRAEDLLPRLTLEEKVSLMQNASPAIPRLGIKEYEWWNEALHGV 78
Query: 95 SNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARA---MYNGGTA--- 148
G AT FPQ I +SFN +L + SDEAR ++ A
Sbjct: 79 GRAGL----------ATVFPQSIGMGASFNDSLLYEVFNATSDEARVKSRIFGESGALKR 128
Query: 149 --GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRL-KVAASCK 205
GLT+W+PNVNIFRDPRWGRGQET GEDP L+G+ + VRGLQG + + K+ A K
Sbjct: 129 YQGLTFWTPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPEDTKYDKLHACAK 188
Query: 206 HFTAYDLDNWNGVDRFHFNAK-VSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPT 264
HF + WN R F+A+ + +D+ +T+ F+ V + V VMC+YN+ G P
Sbjct: 189 HFAVHSGPEWN---RHSFDAENIDPRDLWETYLPAFKDLVQKAHVKEVMCAYNRFEGEPC 245
Query: 265 CADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTP--EEAAADAIRAGLDLDCG 322
C +L + +R EW G +VSDC ++ +Y + P E A+A A+R G DL+CG
Sbjct: 246 CGSNRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTHGTHPDKEHASAAAVRTGTDLECG 305
Query: 323 PFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDH 382
AV+ GL+ E +I+ +L LT + LG D +P+ + + + + +H
Sbjct: 306 SEYA-SLADAVKAGLIDEKEIDISLKRLLTARFELGEMDEQPA---WSEIPASVLNSKEH 361
Query: 383 QELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPL 442
Q LAL AR+ +VLL+N+ LPL+ H VAV+GPN++ +V GNY GI T L
Sbjct: 362 QALALRMARESLVLLQNKNNILPLN--THLKVAVMGPNANDSVMQWGNYNGIPAHTVTLL 419
Query: 443 QGIGRY---ARTIHQQGCKDV 460
+ + + I++ GC V
Sbjct: 420 EAVRAKLPEGQIIYEPGCDRV 440
Score = 115 bits (287), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 92/308 (29%), Positives = 134/308 (43%), Gaps = 49/308 (15%)
Query: 471 AIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKG 520
A+ AD + G+ S+E E + DR + LP Q+ L+ + A K
Sbjct: 591 AVKRVMDADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQRNLLKALKKAGKK 650
Query: 521 PTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYP 580
+ + SG I AI+ A YPGQAGGTAI D L+G NPGG+LP+T+Y
Sbjct: 651 -VVFINYSGSAI--GLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPGGRLPVTFYK 707
Query: 581 QEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVA 639
+ LP + +M+ GRTYR+ + ++PFGHG+SYT F + A
Sbjct: 708 D--VNQLPDFEDYSMK--------GRTYRYMQQQPLFPFGHGLSYTTFTYGEAKL----- 752
Query: 640 VPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHW 699
S N G+ + +T + V NVG +DG + V+ P
Sbjct: 753 --------SKNTIAKGENVVLT------------IPVSNVGQRDGEEVVQVYLRRPGDKE 792
Query: 700 APHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVS 759
P L AF++VH+PAG + V I + + S T R G + + GGT
Sbjct: 793 GPRYTLRAFKRVHIPAGKTESVAIPLTGENFEWFDVESNTMRPLEGTYELLYGGTSDRNK 852
Query: 760 LHAATLGV 767
L + V
Sbjct: 853 LKTIVMNV 860
>gi|217967241|ref|YP_002352747.1| glycoside hydrolase family 3 [Dictyoglomus turgidum DSM 6724]
gi|217336340|gb|ACK42133.1| glycoside hydrolase family 3 domain protein [Dictyoglomus turgidum
DSM 6724]
Length = 762
Score = 292 bits (747), Expect = 6e-76, Method: Compositional matrix adjust.
Identities = 220/687 (32%), Positives = 340/687 (49%), Gaps = 90/687 (13%)
Query: 109 GATSFPQVITTASSFNATLWEAIGRVVSDEARA--MYNGGTAGLTYWSPNVNIFRDPRWG 166
GAT FPQ I AS+F L + V+ RA ++ G SP ++I RDPRWG
Sbjct: 106 GATVFPQAIGMASTFEPELIRRVSDVIRQHMRAANVHQG-------LSPVLDIPRDPRWG 158
Query: 167 RGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAK 226
R +ET GEDP L + AA YV+GLQG D R + A+ KHFTAY + R AK
Sbjct: 159 RTEETFGEDPYLVSRMAAEYVKGLQGEDW-REGIIATVKHFTAYGISEGA---RNLGPAK 214
Query: 227 VSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIV 286
V ++++ + F PF + + EG+ S+M +Y++++GVP + +L + +R EW GY+V
Sbjct: 215 VGERELREVFLFPFEVAIKEGQAGSLMNAYHEIDGVPCASSKFLLTKILRWEWGFKGYVV 274
Query: 287 SDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTE---SAVQRGLLSEIDI 343
SD ++ + + +EAA A+ AG+D++ P + + E AV+ GL+SE I
Sbjct: 275 SDYIAIRMLENFHRVAKDAKEAAVLALEAGIDIEL-PSVDCYGEPLIQAVKEGLISEEVI 333
Query: 344 NNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPS 403
N ++ L + LG+FDG+ P D P+ +EL+ E AR+ IVLLKN G
Sbjct: 334 NASVERVLRAKFMLGLFDGDLEKDPKKVYDIFD--KPEFRELSREVARRSIVLLKNDG-I 390
Query: 404 LPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIA-----------------CGYTTP--LQG 444
LPLS RTVAVIGPN+D + G+Y+ A C T L+G
Sbjct: 391 LPLSK-NIRTVAVIGPNADNPRNLHGDYSYTAHIPSVSETLEGVKIPEECAVRTVSILEG 449
Query: 445 IGRYA----RTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLD-----QSIEAEA 495
I + ++ +GC ++ + F AI+ +++AD I VMG + + I E
Sbjct: 450 IKNKVSAETQVLYAKGC-EILSDSKEGFDEAIEIAKRADVIIAVMGEESGLFHRGISGEG 508
Query: 496 LDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQA 555
DR L L G Q++L+ ++ K P +LVL++G P + + + + AI+ A YPG+
Sbjct: 509 NDRTTLELFGIQRDLLRELHKLGK-PIVLVLVNGRPQALKWEHEN--LNAILEAWYPGEE 565
Query: 556 GGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVV 615
GG A+AD++FG NP GKLP++ +P +T +PS Y + + +
Sbjct: 566 GGDAVADVIFGDYNPSGKLPIS-FPA--VTGQVPVYYNRKPSAFTDYVEESAK-----PL 617
Query: 616 YPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRL-TLGVQV 674
YPFGHG+SYT F ++ +K+ K N L + +
Sbjct: 618 YPFGHGLSYTTFEYS--------------------------NLKIHPEKVNALEKVEISF 651
Query: 675 DVKNVGSKDGAHTL-LVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSV 733
+KN G ++G + L A P K+L F+K+H+ G +RV ++ + L+
Sbjct: 652 TIKNTGVREGEEVVQLYVHDQVASLERPVKELKGFKKIHLKPGESKRVTFILY-PEQLAF 710
Query: 734 VDRSGTRRIPLGEHNIHIGGTKHSVSL 760
D + G I IG + + L
Sbjct: 711 YDEFMRFVVEKGIFEIMIGSSSEDIRL 737
>gi|227831319|ref|YP_002833099.1| glycoside hydrolase family protein [Sulfolobus islandicus L.S.2.15]
gi|227457767|gb|ACP36454.1| glycoside hydrolase family 3 domain protein [Sulfolobus islandicus
L.S.2.15]
Length = 754
Score = 291 bits (746), Expect = 6e-76, Method: Compositional matrix adjust.
Identities = 215/690 (31%), Positives = 348/690 (50%), Gaps = 98/690 (14%)
Query: 110 ATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQ 169
+T+FPQ I AS++N L I V+ + R + G L SP +++ +DPRWGR +
Sbjct: 101 STAFPQAIGLASTWNPELVMDIASVIRSQGRLV--GVNQCL---SPVLDVCKDPRWGRCE 155
Query: 170 ETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDL-DNWNGVDRFHFNAKVS 228
ET GEDP L +Y+ GLQG + ++ A+ KHF A+ + + + H V
Sbjct: 156 ETYGEDPYLVASMGLAYITGLQGDN----QLVATAKHFAAHGFPEGGRNIAQVH----VG 207
Query: 229 KQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSD 288
+++ +TF PF + V GKV S+M +Y++++G+P +P +L +R EW +G +VSD
Sbjct: 208 NRELRETFLFPFEVAVKIGKVMSIMPAYHEIDGIPCHGNPQLLTNILRQEWGFDGIVVSD 267
Query: 289 CDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTE---SAVQRGLLSEIDINN 345
D + S EAA A+ +G+D++ P + + E +A++ GL+ E I+
Sbjct: 268 YDGIRQLETIHRVASNKMEAAILALESGVDIEF-PTIDCYGEPLVNALKEGLVPESLIDR 326
Query: 346 ALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQ--ELALEAARQGIVLLKNQGPS 403
A+ L ++ RLG+ D P+ + DH+ ELAL+ AR+ IVLLKN+
Sbjct: 327 AVERVLRIKDRLGLLD-----NPFVNENSVPEKLDDHKSRELALKTARESIVLLKNENNI 381
Query: 404 LPLSHIRHRTVAVIGPNSDVTVTMIGNYA-----GIACG--YTTPLQGIGRY---ARTIH 453
LPLS ++ +AVIGPN++ M+G+Y I G T LQGI + ++ ++
Sbjct: 382 LPLSKNVNK-IAVIGPNANDPRNMLGDYTYTGHLNIDSGIEIVTVLQGIVKKVGESKVLY 440
Query: 454 QQGCKDVACADDQLFGAAIDASRQADATILVMG---------LD----------QSIEAE 494
+GC D+A + F AI+ +RQAD I +MG +D Q++ E
Sbjct: 441 AKGC-DIASESKEGFAEAIEIARQADVIIAIMGEKSGLPLSWMDIPSKEEFKKYQAVTGE 499
Query: 495 ALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQ 554
DR+ L LPG Q+EL+ ++ K P ILVL++G P+ ++ N + A+I A +PG+
Sbjct: 500 GNDRSSLRLPGVQEELLKELYKTGK-PIILVLINGRPLVLSSIIN--YVKAVIEAWFPGE 556
Query: 555 AGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPG--RTYRFYKG 612
GG AIAD++FG NP G+LP+T+ PM + P R P R Y +
Sbjct: 557 EGGNAIADVIFGDYNPSGRLPITF---------PM-DTGQIPLYYNRKPSSFRPYVMLRS 606
Query: 613 PVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGV 672
++ FG+G+SYT F ++ N ++ K I + +
Sbjct: 607 SPLFTFGYGLSYTQFEYS-------------------NLEVTPKEIGPNS------NIAI 641
Query: 673 QVDVKNVGSKDGAHTLLVFSTPPAGHWA-PHKQLVAFEKVHVPAGAQQRVGINIHVCKYL 731
+DVKNVG +G + ++ + A P K+L F K+H+ G ++RV I + L
Sbjct: 642 SIDVKNVGKMEGDDVVQLYVSKTFSSVARPVKELKGFAKIHLKPGEKRRVKF-ILPTEAL 700
Query: 732 SVVDRSGTRRIPLGEHNIHIGGTKHSVSLH 761
+ D + GE+ + IG + ++ L
Sbjct: 701 AFYDSFMRLVVEKGEYQLLIGNSSENIILR 730
>gi|237719778|ref|ZP_04550259.1| glycoside hydrolase family 3 protein [Bacteroides sp. 2_2_4]
gi|229451047|gb|EEO56838.1| glycoside hydrolase family 3 protein [Bacteroides sp. 2_2_4]
Length = 861
Score = 291 bits (746), Expect = 7e-76, Method: Compositional matrix adjust.
Identities = 173/441 (39%), Positives = 243/441 (55%), Gaps = 34/441 (7%)
Query: 35 ATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGV 94
A + LP+ SL QR DL+ RL+L+EKV L+ + + A+PRLGIK YEWW+EALHGV
Sbjct: 19 ACKQLLPYQDTSLTAEQRAEDLLPRLTLEEKVSLMQNASPAIPRLGIKEYEWWNEALHGV 78
Query: 95 SNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--------GG 146
G AT FPQ I +SFN +L + SDEAR
Sbjct: 79 GRAGL----------ATVFPQSIGMGASFNDSLLYEVFNATSDEARVKSRIFGESGVLKR 128
Query: 147 TAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRL-KVAASCK 205
GLT+W+PNVNIFRDPRWGRGQET GEDP L+G+ + VRGLQG + R K+ A K
Sbjct: 129 YQGLTFWTPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPEDARYDKLHACAK 188
Query: 206 HFTAYDLDNWNGVDRFHFNAK-VSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPT 264
HF + WN R F+A+ + +D+ +T+ F+ V + V VMC+YN+ G P
Sbjct: 189 HFAVHSGPEWN---RHSFDAENIDPRDLWETYLPAFKDLVQKAHVKEVMCAYNRFEGEPC 245
Query: 265 CADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTP--EEAAADAIRAGLDLDCG 322
C +L + +R EW G +VSDC ++ +Y + P E A+A A+RAG DL+CG
Sbjct: 246 CGSNRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTHETYPDKEHASAGAVRAGTDLECG 305
Query: 323 PFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDH 382
AV+ GL+ E +I+ +L LT + LG D + + + + + + +H
Sbjct: 306 SEYA-SLADAVKAGLIDEKEIDISLKRLLTARFELGEMDEQSA---WSEIPTSVLNSKEH 361
Query: 383 QELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPL 442
Q LAL AR+ +VLL+N+ LPL+ H VAV+GPN++ +V GNY GI T L
Sbjct: 362 QALALRMARESLVLLQNKNNILPLN--THLKVAVMGPNANDSVMQWGNYNGIPAHTVTLL 419
Query: 443 QGIGRY---ARTIHQQGCKDV 460
+ + + I++ GC V
Sbjct: 420 EAVRAKLPEGQIIYEPGCDRV 440
Score = 115 bits (287), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 92/309 (29%), Positives = 136/309 (44%), Gaps = 51/309 (16%)
Query: 471 AIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKG 520
A+ AD + G+ S+E E + DR + LP Q++L+ + A K
Sbjct: 591 AVKKVMDADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQRDLLKALKKAGKK 650
Query: 521 PTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYP 580
+ + SG I AI+ A YPGQAGGTAI D L+G NPGG+LP+T+Y
Sbjct: 651 -VVFINYSGSAI--GLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPGGRLPVTFYK 707
Query: 581 QEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVA 639
+ LP + +M+ GRTYR+ + ++PFGHG+SYT+F + A
Sbjct: 708 D--VNQLPDFEDYSMK--------GRTYRYMQQQPLFPFGHGLSYTDFTYGEAKL----- 752
Query: 640 VPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHW 699
S N G+ + +T + V NVG +DG + V+ P
Sbjct: 753 --------SKNTIAKGENVVLT------------IPVSNVGQRDGEEVVQVYLRRPGDKE 792
Query: 700 APHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPL-GEHNIHIGGTKHSV 758
P L AF++VH+PAG + V I + + D PL G + + GGT
Sbjct: 793 GPRYTLRAFKRVHIPAGKTESVAIPL-TGENFEWFDVESNTMCPLEGTYELLYGGTSDRN 851
Query: 759 SLHAATLGV 767
L + V
Sbjct: 852 KLKTIVMNV 860
>gi|284998833|ref|YP_003420601.1| glycoside hydrolase family protein [Sulfolobus islandicus L.D.8.5]
gi|284446729|gb|ADB88231.1| glycoside hydrolase, family 3 domain protein [Sulfolobus islandicus
L.D.8.5]
Length = 754
Score = 291 bits (746), Expect = 8e-76, Method: Compositional matrix adjust.
Identities = 215/690 (31%), Positives = 348/690 (50%), Gaps = 98/690 (14%)
Query: 110 ATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQ 169
+T+FPQ I AS++N L I V+ + R + G L SP +++ +DPRWGR +
Sbjct: 101 STAFPQAIGLASTWNPELVMDIASVIRSQGRLV--GVNQCL---SPVLDVCKDPRWGRCE 155
Query: 170 ETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDL-DNWNGVDRFHFNAKVS 228
ET GEDP L +Y+ GLQG + ++ A+ KHF A+ + + + H V
Sbjct: 156 ETYGEDPYLVASMGLAYITGLQGDN----QLVATAKHFAAHGFPEGGRNIAQVH----VG 207
Query: 229 KQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSD 288
+++ +TF PF + V GKV S+M +Y++++G+P +P +L +R EW +G +VSD
Sbjct: 208 NRELRETFLFPFEVAVKIGKVMSIMPAYHEIDGIPCHGNPQLLTNILRQEWGFDGIVVSD 267
Query: 289 CDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTE---SAVQRGLLSEIDINN 345
D + S EAA A+ +G+D++ P + + E +A++ GL+ E I+
Sbjct: 268 YDGIRQLETIHRVASNKMEAAILALESGVDIEF-PTIDCYGEPLVNALKEGLVPESLIDR 326
Query: 346 ALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQ--ELALEAARQGIVLLKNQGPS 403
A+ L ++ RLG+ D P+ + DH+ ELAL+ AR+ IVLLKN+
Sbjct: 327 AVERVLRIKDRLGLLD-----NPFVNENSVPEKLDDHKSRELALKTARESIVLLKNENNI 381
Query: 404 LPLSHIRHRTVAVIGPNSDVTVTMIGNYA-----GIACG--YTTPLQGIGRY---ARTIH 453
LPLS ++ +AVIGPN++ M+G+Y I G T LQGI + ++ ++
Sbjct: 382 LPLSKNVNK-IAVIGPNANDPRNMLGDYTYTGHLNIDSGIEIVTVLQGIVKKVGESKVLY 440
Query: 454 QQGCKDVACADDQLFGAAIDASRQADATILVMG---------LD----------QSIEAE 494
+GC D+A + F AI+ +RQAD I +MG +D Q++ E
Sbjct: 441 AKGC-DIASESKEGFAEAIEIARQADVIIAIMGEKSGLPLSWMDIPSEEEFKKYQAVTGE 499
Query: 495 ALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQ 554
DR+ L LPG Q+EL+ ++ K P ILVL++G P+ ++ N + A+I A +PG+
Sbjct: 500 GNDRSSLRLPGVQEELLKELYKTGK-PIILVLINGRPLVLSSIIN--YVKAVIEAWFPGE 556
Query: 555 AGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPG--RTYRFYKG 612
GG AIAD++FG NP G+LP+T+ PM + P R P R Y +
Sbjct: 557 EGGNAIADVIFGDYNPSGRLPITF---------PM-DTGQIPLYYNRKPSSFRPYVMLRS 606
Query: 613 PVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGV 672
++ FG+G+SYT F ++ N ++ K I + +
Sbjct: 607 SPLFTFGYGLSYTQFEYS-------------------NLEVTPKEIGPNS------NIAI 641
Query: 673 QVDVKNVGSKDGAHTLLVFSTPPAGHWA-PHKQLVAFEKVHVPAGAQQRVGINIHVCKYL 731
+DVKNVG +G + ++ + A P K+L F K+H+ G ++RV I + L
Sbjct: 642 SIDVKNVGKMEGDDVVQLYVSKTFSSVARPVKELKGFAKIHLKPGEKRRVKF-ILPTEAL 700
Query: 732 SVVDRSGTRRIPLGEHNIHIGGTKHSVSLH 761
+ D + GE+ + IG + ++ L
Sbjct: 701 AFYDSFMRLVVEKGEYQLLIGNSSENIILR 730
>gi|295086418|emb|CBK67941.1| Beta-glucosidase-related glycosidases [Bacteroides xylanisolvens
XB1A]
Length = 861
Score = 291 bits (745), Expect = 8e-76, Method: Compositional matrix adjust.
Identities = 173/441 (39%), Positives = 243/441 (55%), Gaps = 34/441 (7%)
Query: 35 ATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGV 94
A + LP+ SL QR DL+ RL+L+EKV L+ + + A+PRLGIK YEWW+EALHGV
Sbjct: 19 ACKQLLPYQDTSLTAEQRAEDLLPRLTLEEKVSLMQNASPAIPRLGIKEYEWWNEALHGV 78
Query: 95 SNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--------GG 146
G AT FPQ I +SFN +L + SDEAR
Sbjct: 79 GRAGL----------ATVFPQSIGMGASFNDSLLYEVFNATSDEARVKSRIFGESGVLKR 128
Query: 147 TAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRL-KVAASCK 205
GLT+W+PNVNIFRDPRWGRGQET GEDP L+G+ + VRGLQG + R K+ A K
Sbjct: 129 YQGLTFWTPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPEDARYDKLHACAK 188
Query: 206 HFTAYDLDNWNGVDRFHFNAK-VSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPT 264
HF + WN R F+A+ + +D+ +T+ F+ V + V VMC+YN+ G P
Sbjct: 189 HFAVHSGPEWN---RHSFDAENIDPRDLWETYLPAFKDLVQKAHVKEVMCAYNRFEGEPC 245
Query: 265 CADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTP--EEAAADAIRAGLDLDCG 322
C +L + +R EW G +VSDC ++ +Y + P E A+A A+RAG DL+CG
Sbjct: 246 CGSNRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTHETHPDKEHASAGAVRAGTDLECG 305
Query: 323 PFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDH 382
AV+ GL+ E +I+ +L LT + LG D + + + + + + +H
Sbjct: 306 SEYA-SLADAVKAGLIDEKEIDISLKRLLTARFELGEMDEQSA---WSEIPTSVLNSKEH 361
Query: 383 QELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPL 442
Q LAL AR+ +VLL+N+ LPL+ H VAV+GPN++ +V GNY GI T L
Sbjct: 362 QALALRMARESLVLLQNKNNILPLN--THLKVAVMGPNANDSVMQWGNYNGIPAHTVTLL 419
Query: 443 QGIGRY---ARTIHQQGCKDV 460
+ + + I++ GC V
Sbjct: 420 EAVRAKLPEGQIIYEPGCDRV 440
Score = 114 bits (286), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 92/309 (29%), Positives = 136/309 (44%), Gaps = 51/309 (16%)
Query: 471 AIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKG 520
A+ AD + G+ S+E E + DR + LP Q++L+ + A K
Sbjct: 591 AVKKVMDADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQRDLLKALKKAGKK 650
Query: 521 PTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYP 580
+ + SG I AI+ A YPGQAGGTAI D L+G NPGG+LP+T+Y
Sbjct: 651 -VVFINYSGSAI--GLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPGGRLPVTFYK 707
Query: 581 QEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVA 639
+ LP + +M+ GRTYR+ + ++PFGHG+SYT+F + A
Sbjct: 708 D--VNQLPDFEDYSMK--------GRTYRYMQQQPLFPFGHGLSYTDFTYGEAKL----- 752
Query: 640 VPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHW 699
S N G+ + +T + V NVG +DG + V+ P
Sbjct: 753 --------SKNTIAKGENVVLT------------IPVSNVGQRDGEEVVQVYLRRPGDKE 792
Query: 700 APHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPL-GEHNIHIGGTKHSV 758
P L AF++VH+PAG + V I + + D PL G + + GGT
Sbjct: 793 GPRYTLRAFKRVHIPAGKTESVAIPL-TGENFEWFDVESNTMCPLEGTYELLYGGTSDRN 851
Query: 759 SLHAATLGV 767
L + V
Sbjct: 852 KLKTIVMNV 860
>gi|262405256|ref|ZP_06081806.1| conserved hypothetical protein [Bacteroides sp. 2_1_22]
gi|294644754|ref|ZP_06722499.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
ovatus SD CC 2a]
gi|294810589|ref|ZP_06769241.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
xylanisolvens SD CC 1b]
gi|345508031|ref|ZP_08787672.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp. D1]
gi|229444722|gb|EEO50513.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp. D1]
gi|262356131|gb|EEZ05221.1| conserved hypothetical protein [Bacteroides sp. 2_1_22]
gi|292639876|gb|EFF58149.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
ovatus SD CC 2a]
gi|294442250|gb|EFG11065.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
xylanisolvens SD CC 1b]
Length = 861
Score = 291 bits (745), Expect = 8e-76, Method: Compositional matrix adjust.
Identities = 173/441 (39%), Positives = 243/441 (55%), Gaps = 34/441 (7%)
Query: 35 ATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGV 94
A + LP+ SL QR DL+ RL+L+EKV L+ + + A+PRLGIK YEWW+EALHGV
Sbjct: 19 ACKQLLPYQDTSLTAEQRAEDLLPRLTLEEKVSLMQNASPAIPRLGIKEYEWWNEALHGV 78
Query: 95 SNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--------GG 146
G AT FPQ I +SFN +L + SDEAR
Sbjct: 79 GRAGL----------ATVFPQSIGMGASFNDSLLYEVFNATSDEARVKSRIFGESGVLKR 128
Query: 147 TAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRL-KVAASCK 205
GLT+W+PNVNIFRDPRWGRGQET GEDP L+G+ + VRGLQG + R K+ A K
Sbjct: 129 YQGLTFWTPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPEDARYDKLHACAK 188
Query: 206 HFTAYDLDNWNGVDRFHFNAK-VSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPT 264
HF + WN R F+A+ + +D+ +T+ F+ V + V VMC+YN+ G P
Sbjct: 189 HFAVHSGPEWN---RHSFDAENIDPRDLWETYLPAFKDLVQKAHVKEVMCAYNRFEGEPC 245
Query: 265 CADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTP--EEAAADAIRAGLDLDCG 322
C +L + +R EW G +VSDC ++ +Y + P E A+A A+RAG DL+CG
Sbjct: 246 CGSNRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTHGTHPDKEHASAGAVRAGTDLECG 305
Query: 323 PFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDH 382
AV+ GL+ E +I+ +L LT + LG D + + + + + + +H
Sbjct: 306 SEYA-SLADAVKAGLIDEKEIDISLKRLLTARFELGEMDEQSA---WSEIPTSVLNSKEH 361
Query: 383 QELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPL 442
Q LAL AR+ +VLL+N+ LPL+ H VAV+GPN++ +V GNY GI T L
Sbjct: 362 QALALRMARESLVLLQNKNNILPLN--THLKVAVMGPNANDSVMQWGNYNGIPAHTVTLL 419
Query: 443 QGIGRY---ARTIHQQGCKDV 460
+ + + I++ GC V
Sbjct: 420 EAVRAKLPEGQIIYEPGCDRV 440
Score = 115 bits (287), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 90/308 (29%), Positives = 137/308 (44%), Gaps = 49/308 (15%)
Query: 471 AIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKG 520
A+ AD + G+ S+E E + DR + LP Q++L+ + A K
Sbjct: 591 AVKKVMDADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQRDLLKALKKAGKK 650
Query: 521 PTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYP 580
+ + SG I AI+ A YPGQAGGTAI D L+G NPGG+LP+T+Y
Sbjct: 651 -VVFINYSGSAI--GLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPGGRLPVTFYK 707
Query: 581 QEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVA 639
+ LP + +M+ GRTYR+ + ++PFGHG+SYT F + A
Sbjct: 708 D--VNQLPDFEDYSMK--------GRTYRYMQQQPLFPFGHGLSYTTFTYGEAKL----- 752
Query: 640 VPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHW 699
S N G+ + +T + V NVG +DG + V+ P
Sbjct: 753 --------SKNTIAKGENVVLT------------IPVSNVGQRDGEEVVQVYLRRPGDKE 792
Query: 700 APHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVS 759
P L AF++VH+PAG + V I++ + + + T + + GGT
Sbjct: 793 GPRYTLRAFKRVHIPAGKTESVAISLTHESFEWFDEATNTMHPVADTYELLYGGTSEQNQ 852
Query: 760 LHAATLGV 767
L + T+ V
Sbjct: 853 LKSVTVHV 860
>gi|336415363|ref|ZP_08595703.1| hypothetical protein HMPREF1017_02811 [Bacteroides ovatus
3_8_47FAA]
gi|335940959|gb|EGN02821.1| hypothetical protein HMPREF1017_02811 [Bacteroides ovatus
3_8_47FAA]
Length = 861
Score = 291 bits (745), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 173/441 (39%), Positives = 243/441 (55%), Gaps = 34/441 (7%)
Query: 35 ATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGV 94
A + LP+ SL QR DL+ RL+L+EKV L+ + + A+PRLGIK YEWW+EALHGV
Sbjct: 19 ACKQLLPYQDTSLAAEQRAEDLLPRLTLEEKVSLMQNASPAIPRLGIKEYEWWNEALHGV 78
Query: 95 SNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--------GG 146
G AT FPQ I +SFN +L + SDEAR
Sbjct: 79 GRAGL----------ATVFPQSIGMGASFNDSLLYEVFNATSDEARVKSRIFGESGVLKR 128
Query: 147 TAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRL-KVAASCK 205
GLT+W+PNVNIFRDPRWGRGQET GEDP L+G+ + VRGLQG + R K+ A K
Sbjct: 129 YQGLTFWTPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPEDARYDKLHACAK 188
Query: 206 HFTAYDLDNWNGVDRFHFNAK-VSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPT 264
HF + WN R F+A+ + +D+ +T+ F+ V + V VMC+YN+ G P
Sbjct: 189 HFAVHSGPEWN---RHSFDAENIDPRDLWETYLPAFKDLVQKAHVKEVMCAYNRFEGEPC 245
Query: 265 CADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTP--EEAAADAIRAGLDLDCG 322
C +L + +R EW G +VSDC ++ +Y + P E A+A A+RAG DL+CG
Sbjct: 246 CGSNRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTHGTHPDKEHASAGAVRAGTDLECG 305
Query: 323 PFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDH 382
AV+ GL+ E +I+ +L LT + LG D + + + + + + +H
Sbjct: 306 SEYA-SLADAVKAGLIDEKEIDISLKRLLTARFELGEMDEQSA---WSEIPTSVLNSKEH 361
Query: 383 QELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPL 442
Q LAL AR+ +VLL+N+ LPL+ H VAV+GPN++ +V GNY GI T L
Sbjct: 362 QALALRMARESLVLLQNKNNILPLN--THLKVAVMGPNANDSVMQWGNYNGIPAHTVTLL 419
Query: 443 QGIGRY---ARTIHQQGCKDV 460
+ + + I++ GC V
Sbjct: 420 EAVRAKLPEGQIIYEPGCDRV 440
Score = 123 bits (309), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 90/301 (29%), Positives = 134/301 (44%), Gaps = 49/301 (16%)
Query: 478 ADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILVLM 527
AD + G+ S+E E + DR + LP Q++L+ + K + +
Sbjct: 598 ADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQRDLLKALKKVGK-KVVFINY 656
Query: 528 SGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNL 587
SG I AI+ A YPGQAGGTAI D L+G NPGG+LP+T+Y + L
Sbjct: 657 SGSAI--GLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPGGRLPVTFYKD--VNQL 712
Query: 588 P-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRH 646
P + +M+ GRTYR+ + ++PFGHG+SYT+F + A
Sbjct: 713 PDFEDYSMK--------GRTYRYMQQQPLFPFGHGLSYTDFTYGEAKL------------ 752
Query: 647 GSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLV 706
S N G+ + +T + V NVG +DG + V+ P P L
Sbjct: 753 -SKNTIAKGENVVLT------------IPVSNVGQRDGEEVVQVYLRRPGDKEGPRYTLR 799
Query: 707 AFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHAATLG 766
AF++VH+PAG + V I++ + S T R G + + GGT L +
Sbjct: 800 AFKRVHIPAGKTESVAISLTGENFEWFDVESNTMRPLEGTYELLYGGTSDRNKLKTIVMN 859
Query: 767 V 767
V
Sbjct: 860 V 860
>gi|160885419|ref|ZP_02066422.1| hypothetical protein BACOVA_03419 [Bacteroides ovatus ATCC 8483]
gi|156109041|gb|EDO10786.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
ovatus ATCC 8483]
Length = 861
Score = 291 bits (744), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 174/441 (39%), Positives = 245/441 (55%), Gaps = 34/441 (7%)
Query: 35 ATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGV 94
A + LP+ SL QR DL+ RL+L+EKV L+ + + A+PRLGIK YEWW+EALHGV
Sbjct: 19 ACKQLLPYQDTSLAAEQRTEDLLPRLTLEEKVSLMQNASPAIPRLGIKEYEWWNEALHGV 78
Query: 95 SNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMY----NGGT--- 147
G AT FPQ I +SFN +L + SDEAR + G
Sbjct: 79 GRAGL----------ATVFPQSIGMGASFNDSLLYEVFNATSDEARVKSRIFGDSGVLKR 128
Query: 148 -AGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRL-KVAASCK 205
GLT+W+PNVNIFRDPRWGRGQET GEDP L+G+ + VRGLQG + R K+ A K
Sbjct: 129 YQGLTFWTPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPEDARYDKLHACAK 188
Query: 206 HFTAYDLDNWNGVDRFHFNAK-VSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPT 264
HF + WN R F+A+ + +D+ +T+ F+ V + V VMC+YN+ G P
Sbjct: 189 HFAVHSGPEWN---RHSFDAENIDPRDLWETYLPAFKDLVQKAHVKEVMCAYNRFEGEPC 245
Query: 265 CADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTP--EEAAADAIRAGLDLDCG 322
C +L + +R EW G +VSDC ++ +Y + P E A+A A+RAG DL+CG
Sbjct: 246 CGSNRLLMQILRDEWGYEGIVVSDCGAISDFYRPGTHGTHPDKEHASAGAVRAGTDLECG 305
Query: 323 PFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDH 382
AV+ GL+ E +I+ +L LT + LG D + + + + + + +H
Sbjct: 306 SEYA-SLADAVKAGLIDEKEIDISLKRLLTARFELGEMDEQSA---WSEIPTSVLNSKEH 361
Query: 383 QELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPL 442
Q LAL AR+ +VLL+N+ LPL+ H VAV+GPN++ +V GNY GI T L
Sbjct: 362 QALALRMARESLVLLQNKNNILPLN--THLKVAVMGPNANDSVMQWGNYNGIPAHTVTLL 419
Query: 443 QGIGRY---ARTIHQQGCKDV 460
+ + + I++ GC V
Sbjct: 420 EAVRAKLPEGQIIYEPGCDRV 440
Score = 114 bits (284), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 91/302 (30%), Positives = 134/302 (44%), Gaps = 51/302 (16%)
Query: 478 ADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILVLM 527
AD + G+ S+E E + DR + LP Q++L+ + A K + +
Sbjct: 598 ADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQRDLLKALKKAGKK-VVFINY 656
Query: 528 SGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNL 587
SG I AI+ A YPGQAGGTAI D L+G NPGG+LP+T+Y + L
Sbjct: 657 SGSAI--GLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPGGRLPVTFYKD--VNQL 712
Query: 588 P-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRH 646
P + +M+ GRTYR+ + ++PFGHG+SYT+F + A
Sbjct: 713 PDFEDYSMK--------GRTYRYMQQQPLFPFGHGLSYTDFTYGEAKL------------ 752
Query: 647 GSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLV 706
S N G+ + +T + V NVG +DG + V+ P P L
Sbjct: 753 -SKNTIAKGENVVLT------------IPVSNVGQRDGEEVVQVYLRRPGDKEGPRYTLR 799
Query: 707 AFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPL-GEHNIHIGGTKHSVSLHAATL 765
AF++VH+PAG + V I + + D PL G + + GGT L +
Sbjct: 800 AFKRVHIPAGKTESVAIPL-TGENFEWFDVESNTMCPLEGTYELLYGGTSDRNKLKTIVM 858
Query: 766 GV 767
V
Sbjct: 859 NV 860
>gi|15899739|ref|NP_344344.1| Beta-xylosidase [Sulfolobus solfataricus P2]
gi|13816430|gb|AAK43134.1| Beta-xylosidase [Sulfolobus solfataricus P2]
Length = 754
Score = 290 bits (743), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 218/691 (31%), Positives = 347/691 (50%), Gaps = 100/691 (14%)
Query: 110 ATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQ 169
+T+FPQ I AS++N L + + + R + G L SP +++ RDPRWGR +
Sbjct: 101 STAFPQAIGLASTWNPELLTNVASTIRSQGRLI--GVNQCL---SPVLDVCRDPRWGRCE 155
Query: 170 ETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDL-DNWNGVDRFHFNAKVS 228
ET GEDP L +Y+ GLQG ++ A+ KHF A+ + + + H V
Sbjct: 156 ETYGEDPYLVASMGLAYITGLQGET----QLVATAKHFAAHGFPEGGRNIAQVH----VG 207
Query: 229 KQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSD 288
+++ +TF PF + V GKV S+M +Y++++GVP +P +L +R EW +G +VSD
Sbjct: 208 NRELRETFLFPFEVAVKIGKVMSIMPAYHEIDGVPCHGNPQLLTNILRQEWGFDGIVVSD 267
Query: 289 CDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTE---SAVQRGLLSEIDINN 345
D + S EAA A+ +G+D++ P + + E +A++ GL+SE I+
Sbjct: 268 YDGIRQLEAIHKVASNKMEAAILALESGVDIEF-PTIDCYGEPLVTAIKEGLVSEAIIDR 326
Query: 346 ALVNTLTVQMRLGMFDGEPSSQPYGHLG--PKDVCTPDHQELALEAARQGIVLLKNQGPS 403
A+ L ++ RLG+ D P+ P+ + +ELAL+AAR+ IVLLKN+
Sbjct: 327 AVERVLRIKERLGLLD-----NPFVDESAVPERLDDRKSRELALKAARESIVLLKNENNM 381
Query: 404 LPLSHIRHRTVAVIGPNSDVTVTMIGNYA-----GIACG--YTTPLQGIGRYA---RTIH 453
LPLS ++ +AVIGPN++ M+G+Y I G T LQGI + + ++
Sbjct: 382 LPLSKNINK-IAVIGPNANDPRNMLGDYTYTGHLNIDSGIEIVTVLQGIAKKVGEGKVLY 440
Query: 454 QQGCKDVACADDQLFGAAIDASRQADATILVMGLD-------------------QSIEAE 494
+GC D+A + F AI+ ++QAD I VMG Q++ E
Sbjct: 441 AKGC-DIAGESKEGFSEAIEIAKQADVIIAVMGEKSGLPLSWTDIPSEEEFKKYQAVTGE 499
Query: 495 ALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQ 554
DRA L L G Q+EL+ ++ K P ILVL++G P+ ++ N + AII A +PG+
Sbjct: 500 GNDRASLRLLGVQEELLKELYKTGK-PIILVLINGRPLVLSPIIN--YVKAIIEAWFPGE 556
Query: 555 AGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPG--RTYRFYKG 612
GG AIADI+FG NP G+LP+T+ PM + P R P R Y
Sbjct: 557 EGGNAIADIIFGDYNPSGRLPITF---------PM-DTGQIPLYYSRKPSSFRPYVMLHS 606
Query: 613 PVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLT-LG 671
++ FG+G+SYT F ++ ++VT + L+ +
Sbjct: 607 SPLFTFGYGLSYTQFEYS--------------------------NLEVTPKEVGPLSYIT 640
Query: 672 VQVDVKNVGSKDGAHTLLVFSTPPAGHWA-PHKQLVAFEKVHVPAGAQQRVGINIHVCKY 730
+ +DVKNVG+ +G + ++ + A P K+L F KVH+ G ++RV + + +
Sbjct: 641 ILLDVKNVGNMEGDEVVQLYISKSFSSVARPVKELKGFAKVHLKPGEKRRVKFALPM-EA 699
Query: 731 LSVVDRSGTRRIPLGEHNIHIGGTKHSVSLH 761
L+ D + GE+ I IG + ++ L
Sbjct: 700 LAFYDNFMRLVVEKGEYQILIGNSSENIILK 730
>gi|284174578|ref|ZP_06388547.1| Beta-xylosidase [Sulfolobus solfataricus 98/2]
gi|356934752|gb|AET42953.1| beta-xylosidase-like protein [Sulfolobus solfataricus 98/2]
Length = 754
Score = 290 bits (742), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 218/691 (31%), Positives = 347/691 (50%), Gaps = 100/691 (14%)
Query: 110 ATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQ 169
+T+FPQ I AS++N L + + + R + G L SP +++ RDPRWGR +
Sbjct: 101 STAFPQAIGLASTWNPELLTNVASTIRSQGRLI--GVNQCL---SPVLDVCRDPRWGRCE 155
Query: 170 ETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDL-DNWNGVDRFHFNAKVS 228
ET GEDP L +Y+ GLQG ++ A+ KHF A+ + + + H V
Sbjct: 156 ETYGEDPYLVASMGLAYITGLQGET----QLVATAKHFAAHGFPEGGRNIAQVH----VG 207
Query: 229 KQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSD 288
+++ +TF PF + V GKV S+M +Y++++GVP +P +L +R EW +G +VSD
Sbjct: 208 NRELRETFLFPFEVAVKIGKVMSIMPAYHEIDGVPCHGNPQLLTNILRQEWGFDGIVVSD 267
Query: 289 CDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTE---SAVQRGLLSEIDINN 345
D + S EAA A+ +G+D++ P + + E +A++ GL+SE I+
Sbjct: 268 YDGIRQLEAIHKVASNKMEAAILALESGVDIEF-PTIDCYGEPLVTAIKEGLVSEAIIDR 326
Query: 346 ALVNTLTVQMRLGMFDGEPSSQPYGHLG--PKDVCTPDHQELALEAARQGIVLLKNQGPS 403
A+ L ++ RLG+ D P+ P+ + +ELAL+AAR+ IVLLKN+
Sbjct: 327 AVERVLRIKERLGLLD-----NPFVDESAVPERLDDRKSRELALKAARESIVLLKNENNM 381
Query: 404 LPLSHIRHRTVAVIGPNSDVTVTMIGNYA-----GIACG--YTTPLQGIGRYA---RTIH 453
LPLS ++ +AVIGPN++ M+G+Y I G T LQGI + + ++
Sbjct: 382 LPLSKNINK-IAVIGPNANDPRNMLGDYTYTGHLNIDSGIEIVTVLQGIAKKVGEGKVLY 440
Query: 454 QQGCKDVACADDQLFGAAIDASRQADATILVMGLD-------------------QSIEAE 494
+GC D+A + F AI+ ++QAD I VMG Q++ E
Sbjct: 441 AKGC-DIAGESKEGFSEAIEIAKQADVIIAVMGEKSGLPLSWTDIPSEEEFKKYQAVTGE 499
Query: 495 ALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQ 554
DRA L L G Q+EL+ ++ K P ILVL++G P+ ++ N + AII A +PG+
Sbjct: 500 GNDRASLRLLGVQEELLKELYKTGK-PIILVLINGRPLVLSPIIN--YVKAIIEAWFPGE 556
Query: 555 AGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPG--RTYRFYKG 612
GG AIADI+FG NP G+LP+T+ PM + P R P R Y
Sbjct: 557 EGGNAIADIIFGDYNPSGRLPITF---------PM-DTGQIPLYYSRKPSSFRPYVMLHS 606
Query: 613 PVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLT-LG 671
++ FG+G+SYT F ++ ++VT + L+ +
Sbjct: 607 SPLFTFGYGLSYTQFEYS--------------------------NLEVTPKEVGPLSYIT 640
Query: 672 VQVDVKNVGSKDGAHTLLVFSTPPAGHWA-PHKQLVAFEKVHVPAGAQQRVGINIHVCKY 730
+ +DVKNVG+ +G + ++ + A P K+L F KVH+ G ++RV + + +
Sbjct: 641 ILLDVKNVGNMEGDEVVQLYISKSFSSVARPVKELKGFAKVHLKPGEKRRVKFALPM-EA 699
Query: 731 LSVVDRSGTRRIPLGEHNIHIGGTKHSVSLH 761
L+ D + GE+ I IG + ++ L
Sbjct: 700 LAFYDNFMRLVVEKGEYQILIGNSSENIILK 730
>gi|399029285|ref|ZP_10730258.1| beta-glucosidase-like glycosyl hydrolase [Flavobacterium sp. CF136]
gi|398072895|gb|EJL64089.1| beta-glucosidase-like glycosyl hydrolase [Flavobacterium sp. CF136]
Length = 871
Score = 290 bits (741), Expect = 3e-75, Method: Compositional matrix adjust.
Identities = 174/419 (41%), Positives = 238/419 (56%), Gaps = 31/419 (7%)
Query: 38 RTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNV 97
F +L QRV+DL+ R+S+ EK+ L+ + A+ RLG+ Y WW+E+LHGV+
Sbjct: 22 ENFAFKNPNLTTEQRVDDLVSRMSIDEKISQLMDSSPAIERLGVPEYNWWNESLHGVARA 81
Query: 98 GPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--------GGTAG 149
G AT FPQ I+ ASS++ L + V+SDEARA ++ G G
Sbjct: 82 G----------YATVFPQSISIASSWDRQLIFDVANVISDEARAKHHEYLRRGQHGMYQG 131
Query: 150 LTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTA 209
LT+WSPNVNIFRDPRWGRG ET GEDP L+G+ YV GLQG++ LKV A+ KH+
Sbjct: 132 LTFWSPNVNIFRDPRWGRGHETYGEDPFLTGQLGLKYVNGLQGTNEKYLKVIATAKHYAV 191
Query: 210 YDLDNWNGVD--RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCAD 267
+ +G + R FNA+ S D+ +T+ FR V EG V SVM +YN+ G A
Sbjct: 192 H-----SGPEPSRHLFNAETSDIDLYETYLPAFRTLVKEGHVYSVMGAYNRFRGESCSAS 246
Query: 268 PNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG-PFLG 326
P L +R W +GYIVSDC +V + T A+A A++ GLDL+CG F
Sbjct: 247 P-FLFNILRNVWGFDGYIVSDCGAVTDIWKYHKITGDAATASALALKDGLDLECGSSFKS 305
Query: 327 LHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELA 386
L + A+ R L+SE DI+ A+ T + +LGMFD E Y + H LA
Sbjct: 306 L--KEAIDRKLISEADIDIAVKRLFTARFKLGMFDPEEIVS-YAQIPYSVNNNSAHDWLA 362
Query: 387 LEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
A+++ IVLLKNQ +LPLS +TVAVIGPN++ ++ GNY+G+ T L+GI
Sbjct: 363 RVASQKSIVLLKNQNNTLPLSR-DIKTVAVIGPNANDVQSLWGNYSGVPSNPITVLKGI 420
Score = 150 bits (379), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 106/332 (31%), Positives = 157/332 (47%), Gaps = 57/332 (17%)
Query: 442 LQGIGRYARTIHQQGCKDVACAD-------DQLFGAAIDASRQADATILVMGLDQSIEAE 494
L+ +Y T+ Q A A + + A+ + QADA +LV+GL++ +E E
Sbjct: 556 LEAEKKYKITVKYQNFYGDAIAQLLWAEPQENVLQEAVQVAGQADAIVLVLGLNERLEGE 615
Query: 495 AL----------DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIA 544
+ DR L LP Q+EL+ K A+ P ILVL++G + + +A + +
Sbjct: 616 EMKVEADGFEGGDRTSLDLPSNQEELM-KAMTATGKPVILVLINGSALSINWAND--HVP 672
Query: 545 AIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPG 604
AI+ AGYPGQ GG AIAD+LFG NP G+LP+T+Y LP E + G
Sbjct: 673 AILTAGYPGQQGGNAIADVLFGDYNPAGRLPVTYYKS--TEQLPAFE-------NYDMKG 723
Query: 605 RTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAK 664
RTYR+++ +YPFG G+SYT F ++ PT V D
Sbjct: 724 RTYRYFQKKPLYPFGFGLSYTKFKYSNLKLPTNVTPEKD--------------------- 762
Query: 665 CNRLTLGVQVDVKNVGSKDGAHTL-LVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGI 723
+ VDV N+G +DG + L A P QL FE+V++ G + V
Sbjct: 763 -----FEILVDVTNIGERDGDEVIELYLKDEKASTPRPILQLEGFERVNLKKGETKTVRF 817
Query: 724 NIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTK 755
I + LS++++ G R I G I +GG +
Sbjct: 818 TI-TPRQLSLINKKGQRVIEPGWFTISVGGKQ 848
>gi|336404627|ref|ZP_08585320.1| hypothetical protein HMPREF0127_02633 [Bacteroides sp. 1_1_30]
gi|335941531|gb|EGN03384.1| hypothetical protein HMPREF0127_02633 [Bacteroides sp. 1_1_30]
Length = 861
Score = 289 bits (740), Expect = 3e-75, Method: Compositional matrix adjust.
Identities = 171/441 (38%), Positives = 243/441 (55%), Gaps = 34/441 (7%)
Query: 35 ATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGV 94
A + LP+ SL QR DL+ RL+L+EKV L+ + + A+PRLGIK YEWW+EALHGV
Sbjct: 19 ACKQLLPYQDTSLTAEQRAEDLLPRLTLEEKVSLMQNASPAIPRLGIKEYEWWNEALHGV 78
Query: 95 SNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--------GG 146
G AT FPQ I +SFN +L + SDEAR
Sbjct: 79 GRAGL----------ATVFPQSIGMGASFNDSLLYEVFNATSDEARVKSRIFGESGVLKR 128
Query: 147 TAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCK 205
GLT+W+PNVNIFRDPRWGRGQET GEDP L+G+ + VRGLQG D K+ A K
Sbjct: 129 YQGLTFWTPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPEDAGYDKLHACAK 188
Query: 206 HFTAYDLDNWNGVDRFHFNAK-VSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPT 264
HF + WN R F+A+ ++ +D+ +T+ F+ V + V VMC+YN+ G P
Sbjct: 189 HFAVHSGPEWN---RHSFDAENIAPRDLWETYLPAFKDLVQKAHVKEVMCAYNRFEGEPC 245
Query: 265 CADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTP--EEAAADAIRAGLDLDCG 322
C +L + +R EW G +VSDC ++ +Y + P E A+A A+R G DL+CG
Sbjct: 246 CGSNRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTHETHPDKEHASAAAVRTGTDLECG 305
Query: 323 PFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDH 382
AV+ GL+ E +I+ +L LT + LG D +P+ + + + + +H
Sbjct: 306 SEYA-SLADAVKAGLIDEKEIDISLKRLLTARFELGEMDEQPA---WAEIPTSVLNSKEH 361
Query: 383 QELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPL 442
Q LAL AR+ +VLL+N+ LPL+ + +AV+GPN++ +V GNY GI T L
Sbjct: 362 QALALRMARESLVLLQNKNNILPLN--TNLKIAVMGPNANDSVMQWGNYNGIPAHTVTLL 419
Query: 443 QGIGRY---ARTIHQQGCKDV 460
+ + + I++ GC V
Sbjct: 420 EAVRAKLPEGQIIYEPGCDRV 440
Score = 117 bits (293), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 92/308 (29%), Positives = 136/308 (44%), Gaps = 49/308 (15%)
Query: 471 AIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKG 520
A+ AD + G+ S+E E + DR + LP Q++L+ + A K
Sbjct: 591 AVKKVMDADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQRDLLKALKKAGKK 650
Query: 521 PTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYP 580
+ + SG I AI+ A YPGQAGGTAI D L+G NPGG+LP+T+Y
Sbjct: 651 -VVFINYSGSAI--GLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPGGRLPVTFYK 707
Query: 581 QEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVA 639
+ LP + +M+ GRTYR+ + ++PFGHG+SYT+F + A
Sbjct: 708 D--VNQLPDFEDYSMK--------GRTYRYMQQQPLFPFGHGLSYTDFTYGEAKL----- 752
Query: 640 VPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHW 699
S N G+ + +T + V NVG +DG + V+ P
Sbjct: 753 --------SKNTIAKGENVVLT------------IPVSNVGQRDGEEVVQVYLRRPGDKE 792
Query: 700 APHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVS 759
P L AF++VH+PAG + V I + + S T R G + + GGT
Sbjct: 793 GPRYTLRAFKRVHIPAGKTESVAIPLTGENFEWFDAESNTMRPLEGTYELLYGGTSDRNK 852
Query: 760 LHAATLGV 767
L + V
Sbjct: 853 LKTIVMNV 860
>gi|189463167|ref|ZP_03011952.1| hypothetical protein BACCOP_03878 [Bacteroides coprocola DSM 17136]
gi|189430146|gb|EDU99130.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
coprocola DSM 17136]
Length = 865
Score = 289 bits (739), Expect = 4e-75, Method: Compositional matrix adjust.
Identities = 174/451 (38%), Positives = 248/451 (54%), Gaps = 39/451 (8%)
Query: 38 RTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNV 97
+ P+ SL QR +DL+ RL+L+EKV L+ + + A+PRLGIK Y+WW+EALHGV
Sbjct: 23 QQFPYQNTSLTPEQRASDLLERLTLEEKVSLMQNASPAIPRLGIKAYDWWNEALHGVGRA 82
Query: 98 GPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA--------G 149
G AT FPQ I A+SF+ L + VSDEARA Y + G
Sbjct: 83 GI----------ATVFPQTIGMAASFDDELIYKVFTAVSDEARAKYTEFSKSGNLKRYQG 132
Query: 150 LTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRL-KVAASCKHFT 208
LT+W+PN+NIFRDPRWGRGQET GEDP L+ + + VRGLQG D + K+ A KH+
Sbjct: 133 LTFWTPNINIFRDPRWGRGQETYGEDPYLTSRMGVAVVRGLQGPDNMKYDKLHACAKHYA 192
Query: 209 AYDLDNWNGVDRFHFNAK-VSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCAD 267
+ WN R FNA+ ++ +D+ +T+ F+ V E V VMC+YN+ G P C
Sbjct: 193 VHSGPEWN---RHSFNAENIAPRDLWETYLPAFKALVQEADVKEVMCAYNRFEGEPCCGS 249
Query: 268 PNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTP--EEAAADAIRAGLDLDCGPFL 325
+L + +R EW+ G IVSDC ++ ++ + P E A+A A+ +G DL+CG
Sbjct: 250 NRLLMQILRDEWKYKGIIVSDCGAISDFWRKGDHETHPDKETASAGAVLSGTDLECGNNY 309
Query: 326 GLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEP--SSQPYGHLGPKDVCTPDHQ 383
E AVQ+GL+ E I+ ++ LT + LG D S PY + K H+
Sbjct: 310 KSLPE-AVQKGLIDEKQIDISVKRLLTARFELGEMDEHVCWDSIPYSVVDSK-----AHK 363
Query: 384 ELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQ 443
+LALE AR+ IVLL+N+ LPL +A+IGPN++ +V GNY G +T +
Sbjct: 364 DLALEIARKSIVLLQNRNNILPLKE--DMKIALIGPNANDSVMQWGNYNGFPSHTSTLYE 421
Query: 444 GIGRYARTIHQQGCKDVACADDQLFGAAIDA 474
+ R Q D C D+ G ++++
Sbjct: 422 ALKE--RIPANQLIYDFGC--DRTSGISLES 448
Score = 129 bits (325), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 96/312 (30%), Positives = 144/312 (46%), Gaps = 51/312 (16%)
Query: 470 AAIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASK 519
A+ID + AD + G+ S+E E + DR + LP Q+ L+S++ K
Sbjct: 593 ASIDKVKAADVIVFAGGISPSLEGEEMPVNAEGFKGGDRTTIELPAIQRRLISELKKLGK 652
Query: 520 GPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWY 579
P I V SG V AI+ A YPGQAGGTA+AD+LFG NP GKLP+T+Y
Sbjct: 653 -PIIFVNYSGSA--VGLEPESKICDAILQAWYPGQAGGTAVADVLFGDYNPSGKLPVTFY 709
Query: 580 PQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVV 638
++ LP + +M+ GRTYR+ +Y FGHG+SYTNF + P +
Sbjct: 710 --KHTDQLPDFQDYSMK--------GRTYRYMTESPLYSFGHGLSYTNFTY----GPATL 755
Query: 639 AVPLDGRHGSINATIS-GKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAG 697
+ TIS GK + +T + V+N G+ DG + V+ +
Sbjct: 756 S----------QQTISQGKEVTLT------------IPVQNTGNYDGEEVVQVYLSCSGD 793
Query: 698 HWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHS 757
P L AF++VH+ G + V + + + T R+ G + + GGT
Sbjct: 794 KEGPSHTLRAFKRVHIAKGQRANVSFTLDSETFQWFDTNTNTMRMVEGNYELLYGGTSRI 853
Query: 758 VSLHAATLGVIK 769
L ++ VI+
Sbjct: 854 DQLQKISIKVIE 865
>gi|383125190|ref|ZP_09945844.1| hypothetical protein BSIG_4346 [Bacteroides sp. 1_1_6]
gi|251838523|gb|EES66609.1| hypothetical protein BSIG_4346 [Bacteroides sp. 1_1_6]
Length = 853
Score = 289 bits (739), Expect = 5e-75, Method: Compositional matrix adjust.
Identities = 165/403 (40%), Positives = 245/403 (60%), Gaps = 40/403 (9%)
Query: 48 PIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDF 107
P+ +RV DLI RL+++EK+ LL + + +PRLGI Y +EALHGV V PG +F
Sbjct: 36 PVHERVMDLISRLTVEEKISLLRATSPGIPRLGIDKYYHGNEALHGV--VRPG-RF---- 88
Query: 108 PGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAG----------LTYWSPNV 157
T FPQ I A+++N L + + V+SDEARA +N G LT+WSP V
Sbjct: 89 ---TVFPQAIGLAATWNPELQKRVATVISDEARARWNELDQGREQKEQFSDVLTFWSPTV 145
Query: 158 NIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNG 217
N+ RDPRWGR ET GEDP LSG ++V GLQG D LK+ ++ KHF A + ++
Sbjct: 146 NMARDPRWGRTPETYGEDPFLSGIMGTAFVNGLQGDDPHYLKIVSTPKHFAANNEEH--- 202
Query: 218 VDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRG 277
+RF N ++S++ + + + F MCV EGK AS+M +YN +N VP +P +L++ +R
Sbjct: 203 -NRFVCNPQISEKQLREYYFPAFEMCVKEGKAASIMSAYNALNDVPCTLNPWLLQKVLRQ 261
Query: 278 EWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDC------GPFLGLHTES 331
+W GY+VSDC + + + T E AA +I+AGLDL+C GP L +
Sbjct: 262 DWGFQGYVVSDCGGPALLVNAHKYVKTKEAAATLSIKAGLDLECGDDVYDGPLL-----N 316
Query: 332 AVQRGLLSEIDINNALVNTLTVQMRLGMFD-GEPSSQPYGHLGPKDVCTPDHQELALEAA 390
A ++ ++S+ DI++A + LT +M+LG+FD GE PY + P + + +HQ++AL+AA
Sbjct: 317 AYKQYMVSDADIDSAAYHVLTARMKLGLFDSGE--RNPYTKISPSVIGSKEHQQIALDAA 374
Query: 391 RQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAG 433
RQ IVLLKNQ LPL+ + +++AV+G N+ G+Y+G
Sbjct: 375 RQCIVLLKNQKNRLPLNADKLKSIAVVGINA--GKCEFGDYSG 415
Score = 137 bits (345), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 91/298 (30%), Positives = 147/298 (49%), Gaps = 45/298 (15%)
Query: 467 LFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVL 526
L+G A A R+ + + VMG+++SIE E DR + LP Q+E + ++ + P I+V+
Sbjct: 593 LYGEAGKAVRECETVVAVMGINKSIEREGQDRYDIQLPADQREFLQEIYKVN--PNIIVV 650
Query: 527 MSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITN 586
+ G +A D I AI+ A YPG+ GGTA+A++LFG NP G+LP+T+Y +
Sbjct: 651 LVAGS-SLAINWMDEHIPAIVNAWYPGEQGGTAVAEVLFGDYNPAGRLPLTYYKS--LDE 707
Query: 587 LPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRH 646
LP + GRTY+++KG V+YPFG+G+SY++F ++ L +
Sbjct: 708 LPPFD------DYDITKGRTYKYFKGDVLYPFGYGLSYSSFTYS----------DLQVKD 751
Query: 647 GSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQL 705
G T+S + +KN G ++G V+ P G P K+L
Sbjct: 752 GVGEVTVSFR-------------------LKNTGKRNGDEVAQVYVRIPETGGIVPLKEL 792
Query: 706 VAFEKVHVPAGAQQRVGI--NIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLH 761
F +V + +G +RV I N +Y V G +P G ++ +G + + L
Sbjct: 793 KGFRRVPLKSGESRRVEIKLNKEQLRYWDV--EKGQFVVPKGAFDVMVGASSKDIRLQ 848
>gi|433679952|ref|ZP_20511614.1| beta-glucosidase [Xanthomonas translucens pv. translucens DSM
18974]
gi|430814928|emb|CCP42243.1| beta-glucosidase [Xanthomonas translucens pv. translucens DSM
18974]
Length = 909
Score = 288 bits (738), Expect = 6e-75, Method: Compositional matrix adjust.
Identities = 161/399 (40%), Positives = 231/399 (57%), Gaps = 32/399 (8%)
Query: 59 RLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVIT 118
+++ +EKV ++ A A+PRLG+ YEWW+E LHG++ G AT FPQ I
Sbjct: 67 KMTREEKVAQAMNAAPAIPRLGVPAYEWWNEGLHGIARNGY----------ATVFPQAIG 116
Query: 119 TASSFNATLWEAIGRVVSDEARAMYN--GGT-------AGLTYWSPNVNIFRDPRWGRGQ 169
A+++N L E +G V S EARA +N GG AGLT WSPN+NIFRDPRWGRG
Sbjct: 117 LAATWNTALLEQVGTVTSTEARAKFNLAGGPGKDHPRYAGLTIWSPNINIFRDPRWGRGM 176
Query: 170 ETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD--RFHFNAKV 227
ET GEDP L+G+ A ++RGLQG D + A+ KH + +G + R F+ V
Sbjct: 177 ETYGEDPYLTGQLAVGFIRGLQGDDLTHPRTIATPKHLAVH-----SGPEPGRHGFDVDV 231
Query: 228 SKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVS 287
S D+E T+ FR +++G+ +VMC+YN ++G P CA +L +RG+W G++VS
Sbjct: 232 SPHDLEATYTPAFRAAIVDGRAGAVMCAYNSLHGTPACAADWLLNGRLRGDWGFTGFVVS 291
Query: 288 DCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNAL 347
DCD+V +F + ++A A++AG DL+CG + A+ RG E ++ +L
Sbjct: 292 DCDAVDDMTQFHYFRADNAGSSAAALKAGHDLNCG-YAYRDLGKAIARGDADEAVLDQSL 350
Query: 348 VNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLS 407
V + RLG + PY LG KDV + H+ LAL+AA+Q IVLL+N+ +LPL
Sbjct: 351 VRLFAARYRLGELQPQ-RKDPYARLGAKDVDSAAHRALALQAAQQSIVLLQNRNATLPLR 409
Query: 408 -HIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
+R +AVIGPN+D + NY G + TPL G+
Sbjct: 410 PGLR---LAVIGPNADALAALEANYQGTSAAPVTPLLGL 445
Score = 137 bits (344), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 96/278 (34%), Positives = 139/278 (50%), Gaps = 48/278 (17%)
Query: 486 GLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVA 535
GL +E E L DR L LP QQ L+ + + K P ++VLMSG + +
Sbjct: 646 GLSPDVEGEELRIDVPGFDGGDRNDLALPAAQQALLERAKASGK-PLVVVLMSGSAVALN 704
Query: 536 FAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMR 595
+AK AI+ A YPGQ+GGTAIA +L G NPGG+LP+T+Y +LP
Sbjct: 705 WAKQ--HADAIVAAWYPGQSGGTAIAQVLAGDVNPGGRLPVTFY--RSTKDLPAYV---- 756
Query: 596 PSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISG 655
S GRTYR++KG ++ FG G+SYT F + AP + A L
Sbjct: 757 ---SYDMKGRTYRYFKGEPLFAFGSGLSYTRFTYA---APQLSATTL------------- 797
Query: 656 KAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPA 715
A N L V+ V+N G++ G + V+ PP G +P + LV F++V +
Sbjct: 798 ------QAGAN---LQVRTQVRNSGTRAGDEVVQVYLQPPQGAQSPLRTLVGFQRVTLQP 848
Query: 716 GAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGG 753
G + VG + + LS VDR+G R + G++ + +GG
Sbjct: 849 GEAREVGFEL-TPRQLSDVDRAGQRAVQPGDYRVFVGG 885
>gi|146301622|ref|YP_001196213.1| glycoside hydrolase family 3 protein [Flavobacterium johnsoniae
UW101]
gi|146156040|gb|ABQ06894.1| Candidate beta-xylosidase; Glycoside hydrolase family 3
[Flavobacterium johnsoniae UW101]
Length = 875
Score = 288 bits (738), Expect = 6e-75, Method: Compositional matrix adjust.
Identities = 161/418 (38%), Positives = 229/418 (54%), Gaps = 29/418 (6%)
Query: 40 LPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGP 99
F SL QRV+DL+ RL+L+EKV +++ + + RLGI Y+WW+E LHGV+
Sbjct: 27 FQFQNPSLSFEQRVDDLVSRLTLEEKVSQMLNSSPEIARLGIPAYDWWNETLHGVARTPF 86
Query: 100 GTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA---------GL 150
T T +PQ I A++F+ + + E RA+YN GL
Sbjct: 87 KT---------TVYPQAIGMAATFDKNSLFTMADYSALEGRAIYNKAVELKRTNERYLGL 137
Query: 151 TYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAY 210
TYW+PN+NIFRDPRWGRGQET GEDP L+ ++V+GLQG D LK AA KH+ +
Sbjct: 138 TYWTPNINIFRDPRWGRGQETYGEDPYLTAVLGDAFVKGLQGDDPKYLKAAACAKHYAVH 197
Query: 211 DLDNWNGVD--RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADP 268
+G + R F+ V+ ++ DT+ FR + E VA VMC+YN P CA
Sbjct: 198 -----SGPESLRHTFDVDVTPYELWDTYLPAFRKLITESNVAGVMCAYNAFRTQPCCASD 252
Query: 269 NILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLH 328
++ +R EW+ +GY+ SDC ++ ++ E AAADA+ G D+DCG
Sbjct: 253 ILMNDILRKEWKFDGYVTSDCWAIDDFFKNHKTHPDAESAAADAVFHGTDIDCGTDAYKA 312
Query: 329 TESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVC-TPDHQELAL 387
AV+ G +SE I+ ++ ++ RLGMFD P S P V + +HQ AL
Sbjct: 313 LVQAVKNGKISEKQIDISVKRLFMIRFRLGMFD--PVSMVKYAQTPSSVLESKEHQLHAL 370
Query: 388 EAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
+ ARQ IVLLKN+ LPL+ + + V+GPN+D ++++GNY G TT LQGI
Sbjct: 371 KMARQSIVLLKNEKNILPLNK-NLKKIVVLGPNADNAISILGNYNGTPSKLTTVLQGI 427
Score = 103 bits (257), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 84/301 (27%), Positives = 137/301 (45%), Gaps = 54/301 (17%)
Query: 468 FGAAIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMA 517
F I+ + ADA I G+ +E E + DR +L P Q +L+ K +
Sbjct: 596 FADLIEHHKNADAFIFAGGISPQLEGEEMPVDFPGFKGGDRTSILFPEVQTKLL-KALQS 654
Query: 518 SKGPTILVLMSGGPIDVAF-AKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPM 576
S P + +M+G I + + A+N P I I W G GQ+ GTA AD++FG NP G+LP+
Sbjct: 655 SGKPVVFAMMTGSAIAIPWEAENIPAILNI-WYG--GQSAGTAAADVIFGDYNPAGRLPV 711
Query: 577 TWYPQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAP 635
T+Y + ++LP + M +TYR++KG +Y FG+G+SYT+F ++ P
Sbjct: 712 TFYKND--SDLPSFVDYKM--------DNKTYRYFKGTPLYGFGYGLSYTSFKYSDLKTP 761
Query: 636 TVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTL-LVFSTP 694
+K+ + ++ + V V N G +G L
Sbjct: 762 ----------------------VKIKKGQ----SVSILVKVANTGKTEGEEVAQLYLINQ 795
Query: 695 PAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGT 754
P K L FE+ ++ G + + N+ + LS V G+ + G+ I IGG+
Sbjct: 796 DTAIKTPLKSLKGFERFNLKPGENKTITFNLS-PEDLSYVTPEGSLKQYEGKIKISIGGS 854
Query: 755 K 755
+
Sbjct: 855 Q 855
>gi|29347188|ref|NP_810691.1| beta-glucosidase [Bacteroides thetaiotaomicron VPI-5482]
gi|29339087|gb|AAO76885.1| beta-glucosidase (gentiobiase) [Bacteroides thetaiotaomicron
VPI-5482]
Length = 853
Score = 288 bits (738), Expect = 6e-75, Method: Compositional matrix adjust.
Identities = 164/403 (40%), Positives = 245/403 (60%), Gaps = 40/403 (9%)
Query: 48 PIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDF 107
P+ +RV DLI RL+++EK+ LL + + +PRLGI Y +EALHGV V PG +F
Sbjct: 36 PVHERVMDLISRLTVEEKISLLRATSPGIPRLGIDKYYHGNEALHGV--VRPG-RF---- 88
Query: 108 PGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAG----------LTYWSPNV 157
T FPQ I A+++N L + + V+SDEARA +N G LT+WSP V
Sbjct: 89 ---TVFPQAIGLAATWNPELQKRVATVISDEARARWNELDQGREQKEQFSDVLTFWSPTV 145
Query: 158 NIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNG 217
N+ RDPRWGR ET GEDP LSG ++V GLQG D LK+ ++ KHF A + ++
Sbjct: 146 NMARDPRWGRTPETYGEDPFLSGIMGTAFVNGLQGDDPHYLKIVSTPKHFAANNEEH--- 202
Query: 218 VDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRG 277
+RF N ++S++ + + + F MCV EGK AS+M +YN +N VP +P +L++ +R
Sbjct: 203 -NRFVCNPQISEKQLREYYFPAFEMCVKEGKAASIMSAYNALNDVPCTLNPWLLQKVLRQ 261
Query: 278 EWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDC------GPFLGLHTES 331
+W GY+VSDC + + + T E AA +I+AGLDL+C GP L +
Sbjct: 262 DWGFQGYVVSDCGGPALLVNAHKYVKTKEAAATLSIKAGLDLECGDDVYDGPLL-----N 316
Query: 332 AVQRGLLSEIDINNALVNTLTVQMRLGMFD-GEPSSQPYGHLGPKDVCTPDHQELALEAA 390
A ++ ++S+ DI++A + LT +M+LG+FD GE PY + P + + +HQ++AL+AA
Sbjct: 317 AYKQYMVSDADIDSAAYHVLTARMKLGLFDSGE--RNPYTKISPSVIGSKEHQQIALDAA 374
Query: 391 RQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAG 433
RQ +VLLKNQ LPL+ + +++AV+G N+ G+Y+G
Sbjct: 375 RQCVVLLKNQKNRLPLNADKLKSIAVVGINA--GKCEFGDYSG 415
Score = 137 bits (345), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 90/298 (30%), Positives = 147/298 (49%), Gaps = 45/298 (15%)
Query: 467 LFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVL 526
L+G A A R+ + + VMG+++SIE E DR + LP Q+E + ++ + P I+V+
Sbjct: 593 LYGEAGKAVRECETVVAVMGINKSIEREGQDRYDIQLPADQREFLQEIYKVN--PNIIVV 650
Query: 527 MSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITN 586
+ G +A D I AI+ A YPG+ GGTA+A++LFG NP G+LP+T+Y +
Sbjct: 651 LVAGS-SLAINWMDEHIPAIVNAWYPGEQGGTAVAEVLFGDYNPAGRLPLTYYKS--LDE 707
Query: 587 LPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRH 646
LP + GRTY+++KG V+YPFG+G+SY++F ++ L +
Sbjct: 708 LPPFD------DYDITKGRTYKYFKGDVLYPFGYGLSYSSFTYS----------DLQVKD 751
Query: 647 GSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQL 705
G T+S + +KN G ++G V+ P G P K+L
Sbjct: 752 GGGEVTVSFR-------------------LKNTGKRNGDEVAQVYVRIPETGGIVPLKEL 792
Query: 706 VAFEKVHVPAGAQQRVGINI--HVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLH 761
F +V + +G +RV I + +Y V G +P G ++ +G + + L
Sbjct: 793 KGFRRVPLKSGESRRVEIKLDKEQLRYWDV--EKGQFVVPKGAFDVMVGASSKDIRLQ 848
>gi|94969405|ref|YP_591453.1| Beta-glucosidase [Candidatus Koribacter versatilis Ellin345]
gi|94551455|gb|ABF41379.1| Beta-glucosidase [Candidatus Koribacter versatilis Ellin345]
Length = 902
Score = 288 bits (738), Expect = 7e-75, Method: Compositional matrix adjust.
Identities = 173/437 (39%), Positives = 248/437 (56%), Gaps = 43/437 (9%)
Query: 42 FCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGT 101
+ + P +R +DL+ R++L EK L A A+PRLG+ Y+ WSEALHGV+ G
Sbjct: 38 YRDATRPANERAHDLVQRMTLDEKAAQLEDWATAIPRLGVPDYQTWSEALHGVARAG--- 94
Query: 102 KFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA--------GLTYW 153
AT FPQ I A++++ + + +G V+S EAR YN GLT+W
Sbjct: 95 -------HATVFPQAIGMAATWDTEMVKQMGDVISTEARGKYNEAQREGNHRIFWGLTFW 147
Query: 154 SPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLD 213
SPN+NIFRDPRWGRGQET GEDP L+GK +++ G+QG D K A+ KHF +
Sbjct: 148 SPNINIFRDPRWGRGQETYGEDPFLTGKMGIAFIDGVQGPDAAHPKAVATSKHFAVH--- 204
Query: 214 NWNGVD--RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNIL 271
+G + R F+ KVS +D+E+T+ FR V +G V SVMC+YN V+G+ CA+ +L
Sbjct: 205 --SGPESLRHGFDVKVSPRDLEETYLAAFRATVTDGHVKSVMCAYNAVDGMGACANKMLL 262
Query: 272 KRTIRGEWRLNGYIVSDCDSVGVYYD-TQHFTSTPE--EAAADAIRAGLDLDCGPFL-GL 327
+ ++ W G++VSDC G D TQ + P+ AAA ++ AG DL C + G
Sbjct: 263 EEHLKQAWGFKGFVVSDC---GAIMDVTQGHKNAPDIVHAAAISLAAGTDLSCSIWEPGF 319
Query: 328 HT-ESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELA 386
+T AV++GL++E + A + LGMFD EP S P + V + +H+ A
Sbjct: 320 NTLADAVRKGLVTEDMVTRAAERLYAARFELGMFD-EPGSNPNDKIDMSQVASEEHRAEA 378
Query: 387 LEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIG 446
L+AA + IVLLKN G LPL + +T+AVIGP +++ ++ GNY G TPL GI
Sbjct: 379 LKAAEESIVLLKNDG-LLPLKNA--KTIAVIGPTAELLASLEGNYNGQPVRPVTPLDGI- 434
Query: 447 RYARTIHQQGCKDVACA 463
+ Q G ++V A
Sbjct: 435 -----VKQFGAENVRYA 446
Score = 108 bits (270), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 91/283 (32%), Positives = 133/283 (46%), Gaps = 55/283 (19%)
Query: 486 GLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVA 535
GL ++E E + DR + LP Q++L+ + A K P ++V +SG + +
Sbjct: 640 GLSPNLEGEEMPIKIEGFSGGDRTSIDLPATQEKLLEALGAAGK-PVVVVNLSGSAVALN 698
Query: 536 FAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLP-MTEMAM 594
+A AI+ A YPG GGTAIA L G SNP G+LP+T+Y + +LP TE AM
Sbjct: 699 WANQ--HAGAILQAWYPGVEGGTAIAKTLAGESNPAGRLPVTFYAS--VQDLPAFTEYAM 754
Query: 595 RPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSIN-ATI 653
+ RTYR+Y G ++ FG G+SY+ F ++G + A+
Sbjct: 755 K--------NRTYRYYAGKPLWGFGFGLSYSTF-----------------KYGEVKLAST 789
Query: 654 SGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVH 712
S A K +L V V N G + + TP G P LV F++V
Sbjct: 790 SVDAGK---------SLTATVTVTNTSQVAGDEVVEAYLKTPQKG--GPSHSLVGFQRVP 838
Query: 713 VPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTK 755
+ G + V I + + LS VD SG R I GE+ + IG T+
Sbjct: 839 LNPGESREVAIEVS-PRSLSAVDDSGKRSILAGEYRLSIGSTQ 880
>gi|399030621|ref|ZP_10730998.1| beta-glucosidase-like glycosyl hydrolase [Flavobacterium sp. CF136]
gi|398071229|gb|EJL62496.1| beta-glucosidase-like glycosyl hydrolase [Flavobacterium sp. CF136]
Length = 876
Score = 288 bits (737), Expect = 7e-75, Method: Compositional matrix adjust.
Identities = 165/453 (36%), Positives = 241/453 (53%), Gaps = 41/453 (9%)
Query: 5 IAFFFLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQE 64
+ F + +LL+ S+ ++ F F L +RV+DL+ RL+L+E
Sbjct: 6 LIFQIVLFVLLAIQSTAQKKQKEFL------------FQNPDLSFEKRVDDLVNRLTLEE 53
Query: 65 KVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFN 124
KV +++ + A+PRL I Y+WW+E LHGV+ T F T +PQ I A++F+
Sbjct: 54 KVSQMLNSSPAIPRLDIPAYDWWNETLHGVAR----TPFK-----VTVYPQAIAMAATFD 104
Query: 125 ATLWEAIGRVVSDEARAMYNGGTA---------GLTYWSPNVNIFRDPRWGRGQETPGED 175
+ + E RA+YN GLTYW+PN+NIFRDPRWGRGQET GED
Sbjct: 105 KNSLYKMADFSALEGRAIYNKAVESGRTNERYLGLTYWTPNINIFRDPRWGRGQETYGED 164
Query: 176 PVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD--RFHFNAKVSKQDIE 233
P L+G S+V+GLQG D LK AA KH+ + +G + R F+ V+ ++
Sbjct: 165 PYLTGVLGDSFVKGLQGDDPKYLKAAACAKHYAVH-----SGPEPLRHTFDVDVTPYELW 219
Query: 234 DTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVG 293
DT+ F+ V E KVA VMC+YN P CA ++ +R +W+ GY+ SDC ++
Sbjct: 220 DTYLPAFQKLVTESKVAGVMCAYNAFRTQPCCASDILMTDILRNQWKFEGYVTSDCWAID 279
Query: 294 VYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTV 353
++ E A+ADA+ G D+DCG AV+ G +SE I+ ++ +
Sbjct: 280 DFFKNHKTHPDAESASADAVFHGTDIDCGTDAYKALVQAVKDGKISEKQIDISVKRLFMI 339
Query: 354 QMRLGMFDGEPSSQPYGHLGPKDVCTPD-HQELALEAARQGIVLLKNQGPSLPLSHIRHR 412
+ RLGMFD P P V D H+ AL+ ARQ IVLL+N+ +LPLS + +
Sbjct: 340 RFRLGMFD--PVEMVKYAQTPTSVLENDEHKAHALKMARQSIVLLRNENKTLPLSK-KLK 396
Query: 413 TVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
+ V+GPN D + ++GNY G TT L+GI
Sbjct: 397 KIVVLGPNVDNAIAILGNYNGTPSKLTTVLEGI 429
Score = 116 bits (290), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 89/301 (29%), Positives = 139/301 (46%), Gaps = 54/301 (17%)
Query: 468 FGAAIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMA 517
F ++ + ADA + V G+ +E E + DR +LLP Q +L+ +
Sbjct: 598 FNTLVNRVKDADAFVFVGGISPQLEGEEMKVNFPGFKGGDRTSILLPKIQTDLMKALKTT 657
Query: 518 SKGPTILVLMSGGPIDVAF-AKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPM 576
K P + V+M+G I + + A+N P IA A Y GQA GTA+AD+LFG NP G+LP+
Sbjct: 658 GK-PIVFVMMTGSAIAIPWEAENIPAIAN---AWYGGQAAGTAVADVLFGNYNPAGRLPV 713
Query: 577 TWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVH-TVANAP 635
T+Y ++ + P + RTYR++KG +Y FG+G+SYT F + + AP
Sbjct: 714 TFYK---------SDADLSPFVDYKMDNRTYRYFKGKPLYGFGYGLSYTTFKYDNLKIAP 764
Query: 636 TVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFS-TP 694
+V I GK + +T V V N G G + ++
Sbjct: 765 SV---------------IKGKNVPIT------------VKVTNTGKVSGEEVVQLYVINQ 797
Query: 695 PAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGT 754
AP K L FE++ + AG + + + + LS + G + G+ I IGG
Sbjct: 798 NTAIKAPLKTLKGFERISLKAGKSKTITFTLS-PEDLSYITAEGNHQQYNGKIKIAIGGN 856
Query: 755 K 755
+
Sbjct: 857 Q 857
>gi|293370605|ref|ZP_06617157.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
ovatus SD CMC 3f]
gi|292634339|gb|EFF52876.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
ovatus SD CMC 3f]
Length = 861
Score = 288 bits (737), Expect = 8e-75, Method: Compositional matrix adjust.
Identities = 172/441 (39%), Positives = 241/441 (54%), Gaps = 34/441 (7%)
Query: 35 ATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGV 94
A + LP+ SL QR DL+ RL+L+EKV L+ + + A+PRLGIK YEWW+EALHGV
Sbjct: 19 ACKQLLPYQDTSLTAEQRAEDLLPRLTLEEKVSLMQNASPAIPRLGIKEYEWWNEALHGV 78
Query: 95 SNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--------GG 146
G AT FPQ I +SFN +L + SDEAR
Sbjct: 79 GRAGL----------ATVFPQSIGMGASFNDSLLYEVFNATSDEARVKSRIFGESGVLKR 128
Query: 147 TAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCK 205
GLT+W+PNVNIFRDPRWGRGQET GEDP L+G+ + VRGLQG D K+ A K
Sbjct: 129 YQGLTFWTPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPEDAGYDKLHACAK 188
Query: 206 HFTAYDLDNWNGVDRFHFNAK-VSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPT 264
HF + WN R F+A+ + +D+ +T+ F+ V + V VMC+YN+ G P
Sbjct: 189 HFAVHSGPEWN---RHSFDAENIDPRDLWETYLPAFKDLVQKAHVKEVMCAYNRFEGEPC 245
Query: 265 CADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTP--EEAAADAIRAGLDLDCG 322
C +L + +R EW G +VSDC ++ +Y + P E A+A A+R G DL+CG
Sbjct: 246 CGSNRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTHGTHPDKEHASAAAVRTGTDLECG 305
Query: 323 PFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDH 382
AV+ GL+ E +I+ +L LT + LG D + + + + + + +H
Sbjct: 306 SEYA-SLADAVKAGLIDEKEIDISLKRLLTARFELGEMDEQSA---WSEIPTSVLNSKEH 361
Query: 383 QELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPL 442
Q LAL AR+ +VLL+N+ LPL+ H VAV+GPN++ +V GNY GI T L
Sbjct: 362 QALALRMARESLVLLQNKNNILPLN--THLKVAVMGPNANDSVMQWGNYNGIPAHTVTLL 419
Query: 443 QGIGRY---ARTIHQQGCKDV 460
+ + + I++ GC V
Sbjct: 420 EAVRAKLPEGQIIYEPGCDRV 440
Score = 116 bits (291), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 92/308 (29%), Positives = 136/308 (44%), Gaps = 49/308 (15%)
Query: 471 AIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKG 520
A+ AD + G+ S+E E + DR + LP Q++L+ + A K
Sbjct: 591 AVKRVMDADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQRDLLKALKKAGKK 650
Query: 521 PTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYP 580
+ + SG I AI+ A YPGQAGGTAI D L+G NPGG+LP+T+Y
Sbjct: 651 -VVFINYSGSAI--GLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPGGRLPVTFYK 707
Query: 581 QEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVA 639
+ LP + +M+ GRTYR+ + ++PFGHG+SYT+F + A
Sbjct: 708 D--VNQLPDFEDYSMK--------GRTYRYMQQQPLFPFGHGLSYTDFTYGEAKL----- 752
Query: 640 VPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHW 699
S N G+ + +T + V NVG +DG + V+ P
Sbjct: 753 --------SKNTIAKGENVVLT------------IPVSNVGQRDGEEVVQVYLRRPGDKE 792
Query: 700 APHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVS 759
P L AF++VH+PAG + V I + + S T R G + + GGT
Sbjct: 793 GPRYTLRAFKRVHIPAGKTESVAIPLTGENFEWFDVESNTMRPLEGTYELLYGGTSDRNK 852
Query: 760 LHAATLGV 767
L + V
Sbjct: 853 LKTIVMNV 860
>gi|393781488|ref|ZP_10369683.1| hypothetical protein HMPREF1071_00551 [Bacteroides salyersiae
CL02T12C01]
gi|392676551|gb|EIY69983.1| hypothetical protein HMPREF1071_00551 [Bacteroides salyersiae
CL02T12C01]
Length = 850
Score = 288 bits (737), Expect = 8e-75, Method: Compositional matrix adjust.
Identities = 166/419 (39%), Positives = 238/419 (56%), Gaps = 34/419 (8%)
Query: 40 LPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGP 99
LP+ L QR DL+ RL+++EK+ L+ + + +PRLGI+ YEWW+EALHGV+ G
Sbjct: 14 LPYQNPDLTPEQRATDLLQRLTVEEKISLMQNNSPGIPRLGIRPYEWWNEALHGVARAGL 73
Query: 100 GTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGT--------AGLT 151
AT FPQ I A+SFN +L + + VSDEARA GLT
Sbjct: 74 ----------ATVFPQTIGMAASFNDSLVQKVFTAVSDEARAKNRAFNDQGQYKRYQGLT 123
Query: 152 YWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRL-KVAASCKHFTAY 210
W+PNVNIFRDPRWGRGQET GEDP L+ + + V+GLQG D R K+ A KHF +
Sbjct: 124 MWTPNVNIFRDPRWGRGQETYGEDPYLTSRMGVAVVKGLQGPDSARYDKLHACAKHFAVH 183
Query: 211 DLDNWNGVDRFHFNAK-VSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPN 269
WN R FNA+ ++ +D+ +T+ F+ V E V VMC+YN+ G P C
Sbjct: 184 SGPEWN---RHSFNAENIAPRDLWETYLPAFKTLVQEADVKEVMCAYNRFEGDPCCGSNR 240
Query: 270 ILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPE--EAAADAIRAGLDLDCGPFLGL 327
+L + +R EW NG +VSDC ++ ++ + + P+ A+ADA+ +G DL+CG
Sbjct: 241 LLTQILRDEWGFNGIVVSDCGAISDFWGAKKHNTHPDAAHASADAVLSGTDLECGSNYRK 300
Query: 328 HTESAVQRGLLSEIDINNALVNTLTVQMRLG-MFDGEPSSQPYGHLGPKDVCTPDHQELA 386
T+ AV+ G++SE I+ ++ L + LG M + P + PY V P+H+ LA
Sbjct: 301 LTD-AVKAGIISEEQIDISVKRLLKARFELGEMEESHPWALPYSI-----VDCPEHRHLA 354
Query: 387 LEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
L+ A + + LL+N+ LPL +H VAVIGPN++ +V GNY G +T L +
Sbjct: 355 LQIAHETMTLLQNKENILPLD--KHAKVAVIGPNANDSVMQWGNYNGTPSHTSTLLSAL 411
Score = 113 bits (283), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 97/361 (26%), Positives = 153/361 (42%), Gaps = 56/361 (15%)
Query: 425 VTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQL-------FGAAIDASRQ 477
V GN YT Q Y I + KD A + L A ++ +
Sbjct: 527 VIKTGNIKNPENLYTLQAQAGKSYEIVIEFKQIKDGAYFNFDLVEDIPLNMNATLEKLKD 586
Query: 478 ADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILVLM 527
+ I G+ +E E + DR + LP Q+ +++ + A K I V
Sbjct: 587 TEIVIFAGGISPLLEGEEMKVSAAGFKGGDRTDIELPAVQRNVLAALKKAGK-KVIFVNF 645
Query: 528 SGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNL 587
SG + A AI+ A YPGQ GGTA+AD+LFG NP G+LP+T+Y + L
Sbjct: 646 SGSAM--ALTPETENCDAILQAWYPGQEGGTAVADVLFGDYNPAGRLPVTFYKN--MEQL 701
Query: 588 P-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRH 646
P + +M+ GRTYR+ K ++PFG+G+SYT F + A A D +
Sbjct: 702 PDFEDYSMQ--------GRTYRYMKEAPLFPFGYGLSYTTFTYGKARA--------DKKR 745
Query: 647 GSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLV 706
S +G+ + +T + V N+GS+DG + V+ P K L
Sbjct: 746 IS-----TGEKMTLT------------IPVSNIGSRDGEEVVQVYLRREDDPEGPTKTLR 788
Query: 707 AFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHAATLG 766
AF++V + G V I + + + + T GE+ + GG+ + L + +
Sbjct: 789 AFKRVEITKGKSLNVKIELPYTAFEWFDNSTHTMHSMKGEYEVLYGGSSRTEDLQSIKMQ 848
Query: 767 V 767
+
Sbjct: 849 I 849
>gi|383110854|ref|ZP_09931672.1| hypothetical protein BSGG_1962 [Bacteroides sp. D2]
gi|313694427|gb|EFS31262.1| hypothetical protein BSGG_1962 [Bacteroides sp. D2]
Length = 861
Score = 288 bits (737), Expect = 9e-75, Method: Compositional matrix adjust.
Identities = 174/445 (39%), Positives = 248/445 (55%), Gaps = 40/445 (8%)
Query: 28 FACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWW 87
F+C K LP+ SL QR DL+ RL+L+EKV L+ + + A+PRLGIK Y+WW
Sbjct: 18 FSCAQK------LPYQDTSLTAEQRAEDLLPRLTLEEKVALMQNASPAIPRLGIKEYDWW 71
Query: 88 SEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAM----- 142
+EALHGV G AT FPQ I +SFN +L + VSDEAR
Sbjct: 72 NEALHGVGRAGL----------ATVFPQSIGMGASFNDSLLYEVFDAVSDEARVKSRIFS 121
Query: 143 YNG---GTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQG-SDGDRL 198
NG GLT+W+PNVNIFRDPRWGRGQET GEDP L+G+ + VRGLQG +G
Sbjct: 122 ENGVLKRYQGLTFWTPNVNIFRDPRWGRGQETYGEDPYLTGQLGMAVVRGLQGPENGKYD 181
Query: 199 KVAASCKHFTAYDLDNWNGVDRFHFNAK-VSKQDIEDTFDVPFRMCVMEGKVASVMCSYN 257
K+ A KHF + WN R F+A+ ++ +D+ +T+ F+ V + V VMC+YN
Sbjct: 182 KLHACAKHFAVHSGPEWN---RHSFDAENITPRDLWETYLPAFKDLVQKADVKEVMCAYN 238
Query: 258 QVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTP--EEAAADAIRA 315
+ G P C +L + +R EW G +VSDC ++ +Y + P E A+A A+ +
Sbjct: 239 RFEGEPCCGSNRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTHGTHPDKEHASAGAVLS 298
Query: 316 GLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPK 375
G DL+CG G AV+ GL+ E I+ +L LT + LG D +P+ + +
Sbjct: 299 GTDLECGGEYG-SLADAVKAGLIDEKQIDVSLKRLLTARFELGEMDEQPA---WAEIPAS 354
Query: 376 DVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIA 435
+ + +HQ+LAL AR+ +VLL+N+ LPL+ VAV+GPN++ +V GNY GI
Sbjct: 355 TLNSKEHQDLALRMARESLVLLQNKNDILPLN--TDLKVAVMGPNANDSVMQWGNYNGIP 412
Query: 436 CGYTTPLQGIGRY---ARTIHQQGC 457
T L+ + + +++ GC
Sbjct: 413 GHTVTLLEAVRSKLPEGQVMYEPGC 437
Score = 112 bits (280), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 89/308 (28%), Positives = 137/308 (44%), Gaps = 49/308 (15%)
Query: 471 AIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKG 520
A++ + AD + G+ S+E E + DR + LP Q++L+ + A K
Sbjct: 591 AVEKVKDADVVLFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPAVQRDLLKALKKAGKK 650
Query: 521 PTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYP 580
+ + SG I + N AI+ YPGQAGGTAI D+LFG NP G+LP+T+Y
Sbjct: 651 -VVFINYSGSAIGLVPESN--TCEAILQGWYPGQAGGTAIVDVLFGDYNPAGRLPVTFYK 707
Query: 581 QEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVA 639
LP + +M+ GRTYR+ + ++PFGHG+SYT F + A+
Sbjct: 708 D--AGQLPDFEDYSMK--------GRTYRYMQQQPLFPFGHGLSYTTFTYGEADL----- 752
Query: 640 VPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHW 699
S N G + +T + V N G +DG + V+ A
Sbjct: 753 --------SKNTIGDGGTVTLT------------IPVSNAGQRDGDEVVQVYLRCMADKE 792
Query: 700 APHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVS 759
PH L AF++VH+PAG ++V I + + + T G + + GG+
Sbjct: 793 GPHYTLRAFKRVHIPAGETKQVTIPLTYESFEWFDTATNTVHPLKGTYELLYGGSSDKNK 852
Query: 760 LHAATLGV 767
L + V
Sbjct: 853 LKTIVMNV 860
>gi|332665860|ref|YP_004448648.1| beta-glucosidase [Haliscomenobacter hydrossis DSM 1100]
gi|332334674|gb|AEE51775.1| Beta-glucosidase [Haliscomenobacter hydrossis DSM 1100]
Length = 887
Score = 288 bits (737), Expect = 9e-75, Method: Compositional matrix adjust.
Identities = 171/482 (35%), Positives = 258/482 (53%), Gaps = 47/482 (9%)
Query: 6 AFFFLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEK 65
A FFL L+ + ++ C + + + P +L RV DL+ RL+L+EK
Sbjct: 17 AIFFLSLLTILIAN---------VCQAQ--SFKGFPMWDTNLSFEVRVKDLVSRLTLEEK 65
Query: 66 VKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNA 125
V +++ A A+PRLGI Y+WW+E LHGV+ T F T +PQ I A+ +++
Sbjct: 66 VGQMLNAAPAIPRLGIPAYDWWNEVLHGVAR----TPFH-----VTVYPQAIGMAAGWDS 116
Query: 126 TLWEAIGRVVSDEARAMYNGGTA---------GLTYWSPNVNIFRDPRWGRGQETPGEDP 176
T + + E RA++N TA GLTYW+PN+NIFRDPRWGRGQET GEDP
Sbjct: 117 TSLAMMAHYSALEGRAVFNKATALGRNNERYLGLTYWTPNINIFRDPRWGRGQETYGEDP 176
Query: 177 VLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD--RFHFNAKVSKQDIED 234
L+ ++VRGLQG D LK AA KHF + +G + R N S D+ D
Sbjct: 177 FLTSMLGRAFVRGLQGDDPKYLKAAACAKHFAVH-----SGPEPSRHSDNFSPSNYDLWD 231
Query: 235 TFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGV 294
T+ F+ V + KV VMC+YN +G P C ++ +R +W+ GY+ SDC ++
Sbjct: 232 TYLPAFKELVTKAKVEGVMCAYNAFHGQPCCGSDVLMNDILRKQWQFKGYVTSDCWAIDD 291
Query: 295 YYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQ 354
++ A+ DA+ G D++CG + V++G+++E ++ +L+ T +
Sbjct: 292 FFKFHKTHPDATSASVDAVLHGTDVECGTDVYKSLLDGVKKGMIAEAQLDISLIRLFTTR 351
Query: 355 MRLGMFDGEPSSQPYGHLGPKDVC-TPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRT 413
RLGMFD P S P+ + T +H+ +L+ A+Q IVLLKN+G +LPLS +
Sbjct: 352 YRLGMFD--PVSMVKYAQTPESILETAEHKAHSLKMAQQSIVLLKNEGNTLPLSK-NIKK 408
Query: 414 VAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI----GRYARTIHQQGCKDVACADDQLFG 469
+AV+GPN+D + ++GNY G T LQGI G+ I++ K + +D L
Sbjct: 409 IAVLGPNADNRIVVLGNYNGQPSEIITALQGIKNKLGQEVELIYE---KAINFTNDTLLA 465
Query: 470 AA 471
A
Sbjct: 466 YA 467
Score = 130 bits (326), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 95/301 (31%), Positives = 142/301 (47%), Gaps = 54/301 (17%)
Query: 468 FGAAIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMA 517
A ++ + ADA + V G+ +E E + DR +LLP Q EL+ +
Sbjct: 609 LSAIVNRVKDADAIVYVGGISPQLEGEEMRVDFPGFNGGDRTSILLPAVQTELLKMLKGT 668
Query: 518 SKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMT 577
K P + V+M+G I A D I AI+ A Y GQ+ GTAIAD+LFG NP G+LP+T
Sbjct: 669 GK-PLVFVVMTGSAI--ALPYEDQNIPAIVNAWYGGQSAGTAIADVLFGDYNPAGRLPVT 725
Query: 578 WYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTV 637
+Y + ++LP +S RTYR++KG +YPFGHG+SYT+F ++ P
Sbjct: 726 FYKAD--SDLP-------DFKSYDMNNRTYRYFKGDALYPFGHGLSYTSFQYSKLKTP-- 774
Query: 638 VAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTL-LVFSTPPA 696
G I SG + KV+ + N G KDG + L + P
Sbjct: 775 ---------GKIK---SGASFKVS------------ATLTNTGKKDGDEVVQLYLAYPEV 810
Query: 697 GHWAPHKQLVAFEKVHVPAGAQQRVGINI--HVCKYLSVVDRSGTRRIPLGEHNIHIGGT 754
AP + L F ++ + AG + V + C+ +V+ G P G+ I +GG
Sbjct: 811 AGKAPIRALKGFNRIRLKAGESKTVSFTLSPEQCQ---LVNEEGALYQPKGKMEISLGGA 867
Query: 755 K 755
+
Sbjct: 868 Q 868
>gi|333381842|ref|ZP_08473521.1| hypothetical protein HMPREF9455_01687 [Dysgonomonas gadei ATCC
BAA-286]
gi|332829771|gb|EGK02417.1| hypothetical protein HMPREF9455_01687 [Dysgonomonas gadei ATCC
BAA-286]
Length = 861
Score = 288 bits (736), Expect = 9e-75, Method: Compositional matrix adjust.
Identities = 171/452 (37%), Positives = 247/452 (54%), Gaps = 33/452 (7%)
Query: 38 RTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNV 97
+ +P+ +L +R DL+ RL+L+EKV L+ + V RLG+K + WWSEALHGV+N
Sbjct: 19 QIMPYKDANLTPEERAQDLLSRLTLKEKVGLMGDNSIEVTRLGVKKFAWWSEALHGVANQ 78
Query: 98 GPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA--------- 148
G G T FP+ I A+SFN L + +SDEARA ++
Sbjct: 79 G----------GVTVFPEPIGMAASFNDELLYHVFDAISDEARARFHFREKKGDERRQDN 128
Query: 149 GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHF 207
GL+ W+PNVNIFRDPRWGRGQET GEDP L+ + S V GLQG D K+ A KH+
Sbjct: 129 GLSVWTPNVNIFRDPRWGRGQETYGEDPYLTSRMGISVVNGLQGPKDAKYKKLLACAKHY 188
Query: 208 TAYDLDNWNGVDRFHFNA-KVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCA 266
+ WN R N + + + +T+ F++ V + V+ VMC+Y++ + P C
Sbjct: 189 AVHSGPEWN---RHVLNLNNLDNRHLWETYMPAFQVLVQKADVSQVMCAYHRQDDDPCCG 245
Query: 267 DPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLG 326
+ ++LKR +R EW +VSDC ++ +Y + +S +A + AG D++CG
Sbjct: 246 NNHLLKRILRDEWGFKRMVVSDCGAIADFYTSHKVSSDALHSAVKGVLAGTDVECGFGYT 305
Query: 327 LHT-ESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQEL 385
H AV RGL+ E DI+ +++ LT + RLG FD + S P+ ++ + HQ L
Sbjct: 306 YHELVDAVSRGLIYEADIDKSVLRLLTERFRLGDFD-DNSIVPWANIPDTIINCKKHQAL 364
Query: 386 ALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
ALE ARQ + LL+N+ LPLS + +AVIGPN+D M GNY GI T L+GI
Sbjct: 365 ALEMARQSMTLLQNKNNILPLSS--KKKIAVIGPNADDAKLMWGNYNGIPVKTVTILEGI 422
Query: 446 GRYA--RTIHQQGCKDVACADDQLFGAAIDAS 475
A +++GC V DD + + I S
Sbjct: 423 KSIAGKDIFYEKGCDIV---DDMILESYITRS 451
Score = 99.8 bits (247), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 84/313 (26%), Positives = 134/313 (42%), Gaps = 52/313 (16%)
Query: 468 FGAAIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMA 517
+ +D + D + G+ +E E + DR + LP Q+ + + A
Sbjct: 589 YKGLVDRLKDIDVVVFAGGISGELEGEEMPIEMPGFKGGDRTDIELPASQRNCIKALKKA 648
Query: 518 SKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMT 577
K I+V SG I AI+ A Y GQ+GG AIA++LFG NP GKLP+T
Sbjct: 649 GK-RVIMVNCSGSAI--GLMPESESCEAILQAWYGGQSGGQAIAEVLFGKYNPSGKLPIT 705
Query: 578 WYPQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPT 636
+Y I LP E M+ GRTYR+ + ++PFG+G+SYT F A A +
Sbjct: 706 FYKN--IDQLPDFEEYDMK--------GRTYRYLEDKPLFPFGYGLSYTTFDIGRATASS 755
Query: 637 VVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPA 696
+ A +G+ IK+ + VKN G + G+ T+ V+
Sbjct: 756 ISAK-------------AGEKIKLV------------IPVKNTGKRTGSETVQVY-VKKV 789
Query: 697 GHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPL-GEHNIHIGGTK 755
P K L +F+++ +P Q + + + D + R L G++ I G +
Sbjct: 790 DSGGPIKTLRSFKRIELPPNVSQDLTFELEPS-FFEWYDPATLRMNVLPGDYEILYGTSS 848
Query: 756 HSVSLHAATLGVI 768
+ L + +I
Sbjct: 849 DNKDLQTLNITLI 861
>gi|380696428|ref|ZP_09861287.1| beta-glucosidase [Bacteroides faecis MAJ27]
Length = 851
Score = 288 bits (736), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 162/402 (40%), Positives = 243/402 (60%), Gaps = 38/402 (9%)
Query: 48 PIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDF 107
P+ +RV DLI RL+++EK+ LL + + +PRLGI Y +EALHGV V PG +F
Sbjct: 34 PVHERVMDLISRLTVEEKISLLRATSPGIPRLGIDKYYHGNEALHGV--VRPG-RF---- 86
Query: 108 PGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAG----------LTYWSPNV 157
T FPQ I A+++N L + V+SDEARA +N G LT+WSP V
Sbjct: 87 ---TVFPQAIGLAATWNPELQRRVATVISDEARARWNELDQGRAQKEQFSDVLTFWSPTV 143
Query: 158 NIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNG 217
N+ RDPRWGR ET GEDP LSG ++V+GLQG D LK+ ++ KHF A + ++
Sbjct: 144 NMARDPRWGRTPETYGEDPFLSGVMGTAFVKGLQGDDPHYLKIVSTPKHFAANNEEH--- 200
Query: 218 VDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRG 277
+RF N ++S++ + + + F MCV EGK AS+M +YN +N VP + +L++ +R
Sbjct: 201 -NRFVCNPQISEKQLREYYFPAFEMCVKEGKAASIMSAYNALNDVPCTLNAWLLQKVLRK 259
Query: 278 EWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDC------GPFLGLHTES 331
+W GY+VSDC + + + T E AA +++AGLDL+C GP L +
Sbjct: 260 DWGFQGYVVSDCGGPALLVNAHKYLKTKEAAATLSLKAGLDLECGDDVYDGPLL-----N 314
Query: 332 AVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAAR 391
A ++ ++S+ DI++A + LT +M+LG+FDG PY + P + + +HQ++AL+AAR
Sbjct: 315 AYKQYMVSDADIDSAAYHVLTARMKLGLFDG-VERNPYTKISPSVIGSKEHQQIALDAAR 373
Query: 392 QGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAG 433
Q IVLLKNQ LPL+ + +++AV+G N+ G+Y+G
Sbjct: 374 QCIVLLKNQKNMLPLNASKLKSIAVVGINA--GKCEFGDYSG 413
Score = 135 bits (339), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 89/298 (29%), Positives = 148/298 (49%), Gaps = 45/298 (15%)
Query: 467 LFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVL 526
L+G A A R+ + I VMG+++SIE E DR + LP Q+E + ++ + I++L
Sbjct: 591 LYGEAGKAVRECETVIAVMGINKSIEREGQDRYDIQLPADQREFLQEIYKVNSN-MIVIL 649
Query: 527 MSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITN 586
++G + + + D + AI+ A YPG+ GGTA+A++LFG NP G+LP+T+Y +
Sbjct: 650 VAGSSLAINWM--DEHVPAIVNAWYPGEQGGTAVAEVLFGDYNPAGRLPLTYYKS--LDE 705
Query: 587 LPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRH 646
LP + GRTY+++KG V+YPFG+G+SY++F ++
Sbjct: 706 LPPFD------DYDITKGRTYKYFKGEVLYPFGYGLSYSSFKYS---------------- 743
Query: 647 GSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQL 705
+ + +A +V V +KN G ++G V+ P G P K+L
Sbjct: 744 ---DLRVKDEADEV----------AVSFRLKNTGKRNGDEVTQVYVRIPETGGIVPVKEL 790
Query: 706 VAFEKVHVPAGAQQRVGI--NIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLH 761
F +V + +G +RV I N +Y V G +P G +I +G + + L
Sbjct: 791 KGFRRVPLKSGESRRVEIRLNKEQLRYWDV--GKGQFVVPKGTFDIMVGASSKDIRLQ 846
>gi|371777646|ref|ZP_09483968.1| beta-glucosidase [Anaerophaga sp. HS1]
Length = 865
Score = 288 bits (736), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 169/453 (37%), Positives = 241/453 (53%), Gaps = 45/453 (9%)
Query: 7 FFFLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKV 66
F + +IL+S + G + LPF L +R DLI RL++QEK
Sbjct: 5 LFLIAVILISMTVKG----------------QVLPFQNPDLSSEERAKDLISRLTVQEKA 48
Query: 67 KLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNAT 126
+LL + A+PRLGIK + WWSEALHG +N T FPQ I A+SFN
Sbjct: 49 RLLCDQSEAIPRLGIKKFNWWSEALHGYANND----------SVTVFPQPIGMAASFNEE 98
Query: 127 LWEAIGRVVSDEARAMY---------NGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPV 177
L I +SDEARA Y N L+ W+PNVNIFRDPRWGRGQET GEDP
Sbjct: 99 LVFEIFNAISDEARAKYHQAQRRGEENRRFLSLSVWTPNVNIFRDPRWGRGQETYGEDPY 158
Query: 178 LSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAK-VSKQDIEDT 235
L+ + V+GLQG D K+ A KH+T + W+ R N VS ++ +T
Sbjct: 159 LTSRMGVQVVKGLQGPEDAKYRKLLACAKHYTVHSGPEWS---RHELNINDVSPREFYET 215
Query: 236 FDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVY 295
+ F+ V + V VMC+Y++++ P C++ IL+R +R EW +V+DC ++ +
Sbjct: 216 YMPAFKALVQKADVRQVMCAYHRLDDEPCCSNTRILQRILRDEWGYEHMVVADCGAISDF 275
Query: 296 YDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTE---SAVQRGLLSEIDINNALVNTLT 352
Y T +STP AAA + AG DL+C + H + A+++ L++E DI+ +L+ L
Sbjct: 276 YTTHGISSTPVHAAATGLLAGTDLEC-IWDNYHYKMLPEALEKDLITEKDIDRSLMRVLK 334
Query: 353 VQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHR 412
+ LG D + S P+ + P + H++LA + A+Q IVLL+N+ LPL
Sbjct: 335 GRFDLGEMD-DNSLVPWAQIPPSVLNCEKHRQLAYKMAQQSIVLLQNKNKVLPLDKSSIN 393
Query: 413 TVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
+AV+GPN+D V + GNY G T L GI
Sbjct: 394 KIAVVGPNADDEVVLWGNYNGTPIRTITVLDGI 426
Score = 100 bits (249), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 86/278 (30%), Positives = 119/278 (42%), Gaps = 50/278 (17%)
Query: 459 DVACADDQLFGAAIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQ 508
D +D F A I + I V GL +E E + DR + LP Q+
Sbjct: 584 DFGREEDIDFTALIKKLEGIETVIFVGGLSGFLEGEEMPVSYPGFKGGDRTNIELPSVQR 643
Query: 509 ELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTS 568
+ + A K I V SG I A AII A Y G++GG AIAD+LFG
Sbjct: 644 NCLKALKEAGK-TVIFVNCSGSAI--ALEPETESCDAIIQAWYGGESGGQAIADVLFGDY 700
Query: 569 NPGGKLPMTWYPQEYITNL-PMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNF 627
NP GKLP+T+Y NL + +M GRTYR Y ++PFG G+SYTNF
Sbjct: 701 NPSGKLPVTFYRNS--DNLGDFEDYSME--------GRTYR-YTNNHLFPFGFGLSYTNF 749
Query: 628 VHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHT 687
A +S IK T+ +++ VKN G +DG
Sbjct: 750 EIG-------------------KARLSKSTIKADE------TISIKIPVKNTGKRDGTEI 784
Query: 688 LLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINI 725
+ V+ P K L F+++ VPAG ++ I++
Sbjct: 785 VQVYVRKVNDIDGPLKTLKGFQRIAVPAGKTRQANISL 822
>gi|255690202|ref|ZP_05413877.1| beta-glucosidase [Bacteroides finegoldii DSM 17565]
gi|260624221|gb|EEX47092.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
finegoldii DSM 17565]
Length = 853
Score = 287 bits (735), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 162/397 (40%), Positives = 240/397 (60%), Gaps = 28/397 (7%)
Query: 48 PIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDF 107
P+ +RV DLI RL+++EK+ LL + + +PRLGI Y +EALHGV V PG +F
Sbjct: 35 PVHERVMDLISRLTVEEKISLLRATSPGIPRLGIDKYYHGNEALHGV--VRPG-RF---- 87
Query: 108 PGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAG----------LTYWSPNV 157
T FPQ I A+++N L + I V+SDEARA +N G LT+WSP V
Sbjct: 88 ---TVFPQAIGLAATWNPELQKRIATVISDEARARWNELDQGRNQKEQFSDVLTFWSPTV 144
Query: 158 NIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNG 217
N+ RDPRWGR ET GEDP LSG ++V+GLQG D LK+ ++ KHF A + ++
Sbjct: 145 NMARDPRWGRTPETYGEDPFLSGVMGTAFVKGLQGDDPHYLKIVSTPKHFAANNEEH--- 201
Query: 218 VDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRG 277
+RF N ++S++ + + + F MCV EGK AS+M +YN +N VP + +L++ +R
Sbjct: 202 -NRFVCNPQISEKQLREYYFPAFEMCVKEGKAASIMTAYNALNNVPCTLNSWLLQKVLRR 260
Query: 278 EWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRG 336
+W GY+VSDC + + + T E AA +I+AGLDL+CG + +A ++
Sbjct: 261 DWGFQGYVVSDCGGPSLLVNAHKYVKTKEAAATLSIKAGLDLECGDDVYDEYLLNAYKQY 320
Query: 337 LLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVL 396
+ SE DI++A + LT +M+LG+FDG PY + P + + +HQ +AL AAR+ IVL
Sbjct: 321 MASEADIDSAAYHVLTARMKLGLFDG-VERNPYAKISPSVIGSKEHQTVALNAARECIVL 379
Query: 397 LKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAG 433
LKNQ LPL+ + +++AV+G N+ G+Y+G
Sbjct: 380 LKNQKNMLPLNVKKLKSIAVVGINAG--KCEFGDYSG 414
Score = 131 bits (330), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 87/300 (29%), Positives = 144/300 (48%), Gaps = 42/300 (14%)
Query: 467 LFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVL 526
L+G A A R+ + + VMG+++SIE E DR + LP Q+E + ++ + ILVL
Sbjct: 592 LYGEAGKAVRECETVVAVMGINKSIEREGQDRYDIQLPADQREFLQEIYKVNPN-IILVL 650
Query: 527 MSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITN 586
++G + V + + + AI+ A YPG+ GGTA+A++LFG NP G+LP+T+Y +
Sbjct: 651 VAGSSLAVNW--ENEHLPAIVNAWYPGEQGGTAVAEVLFGDYNPAGRLPLTYYKS--LEQ 706
Query: 587 LPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRH 646
LP + GRTY+++K V+YPFG+G+SYT F ++
Sbjct: 707 LPAFD------DYDITKGRTYQYFKKDVLYPFGYGLSYTTF-----------------KY 743
Query: 647 GSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPP--AGHWAPHKQ 704
++ +GK T+ V +KN G + G V+ P AG +Q
Sbjct: 744 SNLKVDDAGK------------TVNVSFTLKNTGKRAGDEVAQVYVRLPEIAGSTQAIRQ 791
Query: 705 LVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHAAT 764
L F +V + AG ++V I + + ++ +P G +G + + L T
Sbjct: 792 LKGFRRVALKAGESRKVEITLDKEQLRYWDEKQACFVVPQGSFTFMVGASSGDIRLENTT 851
>gi|206901280|ref|YP_002250567.1| xylosidase/arabinosidase [Dictyoglomus thermophilum H-6-12]
gi|206740383|gb|ACI19441.1| xylosidase/arabinosidase [Dictyoglomus thermophilum H-6-12]
Length = 762
Score = 287 bits (735), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 216/690 (31%), Positives = 341/690 (49%), Gaps = 96/690 (13%)
Query: 109 GATSFPQVITTASSFNATLWEAIGRVVSDEARA--MYNGGTAGLTYWSPNVNIFRDPRWG 166
GAT FPQ I AS+F L + V+ +A ++ G SP ++I RDPRWG
Sbjct: 106 GATVFPQAIGMASTFEPELIRRVSDVIRQHMKAANVHQG-------LSPVLDIPRDPRWG 158
Query: 167 RGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFN-- 224
R +ET GEDP L + A YV+GLQG D R + A+ KHFTAY G+ N
Sbjct: 159 RTEETFGEDPYLVSRMATEYVKGLQGEDW-REGIVATVKHFTAY------GISEGARNLG 211
Query: 225 -AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNG 283
AKV ++++ + F PF + + EG+ S+M +Y++++GVP + +L + +R EW G
Sbjct: 212 PAKVGERELREVFLFPFEVAIKEGQAGSLMNAYHEIDGVPCASSKFLLTKILRWEWGFKG 271
Query: 284 YIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTE---SAVQRGLLSE 340
Y+VSD +V + + +EAA A+ AG+D++ P + + E AV+ GL+SE
Sbjct: 272 YVVSDYIAVRMLENFHKVARDAKEAAVLALEAGIDIEL-PSVDCYGEPLIQAVKEGLISE 330
Query: 341 IDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQ 400
IN ++ L + LG+FD P D P+ ++L+ E AR+ IVLLKN
Sbjct: 331 EVINASVERVLRAKFMLGLFDDNLEKDPKKVYEVFD--KPEFRDLSREVARRSIVLLKND 388
Query: 401 GPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYA------GIACGY-------------TTP 441
G +LPLS + VAVIGPN+D + G+Y+ IA G +
Sbjct: 389 G-TLPLSK-NLKKVAVIGPNADNPRNLHGDYSYTAHIPSIAEGLEGVKVEEKCVVRTVSI 446
Query: 442 LQGIGRYA----RTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLD-----QSIE 492
L+GI ++ +GC ++ + D F AI+ +++AD I VMG + + I
Sbjct: 447 LEGIRNKVSPETEVLYAKGCDIISDSKDG-FAEAIEMAKEADVIIAVMGEESGLFHRGIS 505
Query: 493 AEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYP 552
E DR L L G Q++L+ ++ K P +LVL++G P + + + + AI+ A YP
Sbjct: 506 GEGNDRTTLELFGVQRDLLKELHKLGK-PIVLVLINGRPQALKWEHEN--LNAILEAWYP 562
Query: 553 GQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKG 612
G+ GG A+AD++FG NP GKLP++ +P +T +PS Y + +
Sbjct: 563 GEEGGNAVADVIFGDYNPSGKLPIS-FPA--VTGQIPVYYNRKPSAFSDYIDESAK---- 615
Query: 613 PVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRL-TLG 671
+YPFGHG+SYT F ++ +K++ K N L +
Sbjct: 616 -PLYPFGHGLSYTTFEYS--------------------------DLKISPEKVNSLEKVE 648
Query: 672 VQVDVKNVGSKDGAHTL-LVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKY 730
+ +KN G++DG + L A P K+L F+K+++ G +RV ++ +
Sbjct: 649 ISFTIKNTGNRDGEEVVQLYIHDQVASLERPVKELKGFKKIYLKPGESKRVTFTLY-PEQ 707
Query: 731 LSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
L+ D + G + IG + + L
Sbjct: 708 LAFYDEFMRFIVEKGVFEVMIGSSSEDIRL 737
>gi|423289663|ref|ZP_17268513.1| hypothetical protein HMPREF1069_03556 [Bacteroides ovatus
CL02T12C04]
gi|423298156|ref|ZP_17276215.1| hypothetical protein HMPREF1070_04880 [Bacteroides ovatus
CL03T12C18]
gi|392663697|gb|EIY57244.1| hypothetical protein HMPREF1070_04880 [Bacteroides ovatus
CL03T12C18]
gi|392667374|gb|EIY60884.1| hypothetical protein HMPREF1069_03556 [Bacteroides ovatus
CL02T12C04]
Length = 850
Score = 287 bits (735), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 158/397 (39%), Positives = 242/397 (60%), Gaps = 28/397 (7%)
Query: 48 PIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDF 107
P+ +RV DL+ RL+++EK+ LL + + +PRLGI Y +EALHGV V PG +F
Sbjct: 33 PVHERVADLLSRLTVEEKISLLRATSPGIPRLGIDKYYHGNEALHGV--VRPG-RF---- 85
Query: 108 PGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAG----------LTYWSPNV 157
T FPQ I A+++N L + + V+SDEARA +N G LT+WSP V
Sbjct: 86 ---TVFPQAIGLAATWNPVLQQKVATVISDEARARWNELDQGRNQKEQFSDVLTFWSPTV 142
Query: 158 NIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNG 217
N+ RDPRWGR ET GEDP LSG ++V+GLQG D LK+ ++ KHF A + ++
Sbjct: 143 NMARDPRWGRTPETYGEDPFLSGVMGTAFVKGLQGEDPRYLKIVSTPKHFVANNEEH--- 199
Query: 218 VDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRG 277
+RF N ++S++ + + + F MCV +GK AS+M +YN +N VP + +L++ +R
Sbjct: 200 -NRFICNPQISEKQLREYYFPAFEMCVKKGKAASIMTAYNALNDVPCTLNAWLLQKVLRQ 258
Query: 278 EWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRG 336
+W GY+VSDC + + + T E AA +I+AGLDL+CG + +A ++
Sbjct: 259 DWGFRGYVVSDCGGPSLLVNAHKYVKTKETAATLSIKAGLDLECGDDVYDEYLLNAYKQY 318
Query: 337 LLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVL 396
++S+ DI++A + L +M+LGMFD + PY + P + + DHQ++AL+AAR+ IVL
Sbjct: 319 MVSDADIDSAACHVLAARMKLGMFDSK-ERNPYARISPSVIGSKDHQQVALDAARECIVL 377
Query: 397 LKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAG 433
LKNQ LPL+ + +++AV+G N+ G+Y+G
Sbjct: 378 LKNQKNMLPLNVDKLKSIAVVGINA--GTCEFGDYSG 412
Score = 128 bits (321), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 88/297 (29%), Positives = 143/297 (48%), Gaps = 43/297 (14%)
Query: 467 LFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVL 526
L+G A A + + + VMG+++SIE E DR + LP Q+E + ++ + P I+V+
Sbjct: 590 LYGEAGKAVSECETVVAVMGINKSIEREGQDRYDIQLPADQREFLQEIYKVN--PNIIVV 647
Query: 527 MSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITN 586
+ G +A D I AI+ A YPG+ GGTA+AD+LFG NP G+LP+T+Y +
Sbjct: 648 LVAGS-SLAVNWMDEHIPAIVNAWYPGEQGGTAVADVLFGDYNPAGRLPLTYYKS--LDE 704
Query: 587 LPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRH 646
LP + GRTY+++KG V+YPFG+G+SY++F ++
Sbjct: 705 LPPFD------DYDITKGRTYKYFKGDVLYPFGYGLSYSSFKYS---------------- 742
Query: 647 GSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQL 705
+ + KVT V +KN G + G V+ P G P K+L
Sbjct: 743 ---DLKVKDSTDKVT----------VSFRLKNTGRRKGDEVAQVYVRIPETGGIVPIKEL 789
Query: 706 VAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRI-PLGEHNIHIGGTKHSVSLH 761
F +V + G + + I + + L D + + I P G ++ +G + + L
Sbjct: 790 KGFRRVPLEPGESRAIDIELDK-EQLRYWDTTKEQFILPAGTFDVMVGASSKDIRLQ 845
>gi|317477144|ref|ZP_07936385.1| glycosyl hydrolase family 3 C terminal domain-containing protein
[Bacteroides eggerthii 1_2_48FAA]
gi|316906687|gb|EFV28400.1| glycosyl hydrolase family 3 C terminal domain-containing protein
[Bacteroides eggerthii 1_2_48FAA]
Length = 814
Score = 287 bits (735), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 216/685 (31%), Positives = 333/685 (48%), Gaps = 89/685 (12%)
Query: 109 GATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRG 168
G T FP I AS++N L +GR ++ EA A G G + P +++ RDPRW R
Sbjct: 168 GTTVFPTSIGQASTWNPELIRRMGRAIATEASA--QGAHIG---YGPVLDLARDPRWSRV 222
Query: 169 QETPGEDPVLSGKYAASYVRGLQGS-DGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKV 227
+ET GED L+G A+ V+G QG + KV A+ KHF AY W +A V
Sbjct: 223 EETYGEDAYLNGVMGAALVKGFQGEFPRTKGKVIATLKHFAAY---GWTEGGHNGGSAHV 279
Query: 228 SKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVS 287
+++E+ PFR V G + SVM SYN+++G+P A+ N+L ++ W+ G++VS
Sbjct: 280 GNREMEEAIYPPFREAVAAGAL-SVMSSYNEIDGIPCTANSNLLTGLLKKRWQFKGFVVS 338
Query: 288 DCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGP--FLGLHTESAVQRGLLSEIDINN 345
D ++G + T EAA A+ AG+D D G + G +AV+RG + E+ IN
Sbjct: 339 DLYAIGGLRE-HGVADTDYEAAVKAVNAGVDSDLGTNVYAG-QLVNAVKRGDVQEVVINK 396
Query: 346 ALVNTLTVQMRLGMFDGEPSSQPY-GHLGPKD-VCTPDHQELALEAARQGIVLLKNQGPS 403
A+ L ++ +G+FD P+ P+ V + +H ELA E ARQ I+LLKN+
Sbjct: 397 AVSRILALKFHMGLFD-----HPFVDEREPEQVVASTEHLELAREVARQSIILLKNKNEL 451
Query: 404 LPLSHIRHRTVAVIGPNSDVTVTMIGNYAG--IACGYTTPLQGIGRYART----IHQQGC 457
LPL+ + +T+AVIGPN+D M+G+Y T L GI + I+ +GC
Sbjct: 452 LPLNK-KMKTIAVIGPNADNIYNMLGDYTAPQSESSVVTVLDGIRQKVSNDTHIIYAKGC 510
Query: 458 KDVACADDQLFGAAIDASRQADATILVMG-----------------------LDQSIEAE 494
V + F AI+A+RQ+D ++VMG + E
Sbjct: 511 A-VRDSSKSGFQEAIEAARQSDVVVMVMGGSSARDFSSKYEETGAAKVSDSHISDMESGE 569
Query: 495 ALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQ 554
DR+ L L GRQ+EL+ +V +K P +LVL+ G P+ + + AI+ A YPG
Sbjct: 570 GYDRSTLELLGRQRELIREVGKLNK-PIVLVLIKGRPL--LLEGIEAEVDAIVDAWYPGM 626
Query: 555 AGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPV 614
GG A+AD+LFG NP G+L ++ P+ + LP+ R +Y +G
Sbjct: 627 QGGNAVADVLFGDYNPAGRLTIS-VPRS-VGQLPVYYNTKRKGNRSKYIEE-----EGTP 679
Query: 615 VYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQV 674
YPFG+G+SYT+F ++ A V A C + + V
Sbjct: 680 RYPFGYGLSYTSFNYSDLKAEVVEA----------------------EDSC---LVNISV 714
Query: 675 DVKNVGSKDGAHTL-LVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSV 733
V+N GS+DG + L A P KQL F+++H+ G + + + K L++
Sbjct: 715 KVRNEGSRDGDEVVQLYLRDEVASFTTPFKQLCGFQRIHLKVGETKEITFRLD-KKSLAL 773
Query: 734 VDRSGTRRIPLGEHNIHIGGTKHSV 758
++ + G + +GG+ +
Sbjct: 774 YMQNEEWAVEPGRFTLMLGGSSEQI 798
>gi|393786908|ref|ZP_10375040.1| hypothetical protein HMPREF1068_01320 [Bacteroides nordii
CL02T12C05]
gi|392658143|gb|EIY51773.1| hypothetical protein HMPREF1068_01320 [Bacteroides nordii
CL02T12C05]
Length = 854
Score = 287 bits (734), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 164/418 (39%), Positives = 246/418 (58%), Gaps = 29/418 (6%)
Query: 42 FCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGT 101
+ ++ P +R+ DL+ +L+++EK+ LL + + +PRL I Y +EALHGV V PG
Sbjct: 28 YLDMNAPRHERILDLLSKLTIEEKISLLRATSPGIPRLHIDKYYHGNEALHGV--VRPGN 85
Query: 102 KFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAG----------LT 151
T FPQ I A+ +N L I V+SDEARA +N G LT
Sbjct: 86 --------FTVFPQAIGLAAMWNPQLLNEISTVISDEARARWNELEQGKKQLGQFSDLLT 137
Query: 152 YWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYD 211
+WSP VN+ RDPRWGR ET GEDP LSGK S+V+GLQG D LK+ ++ KHF A +
Sbjct: 138 FWSPTVNMARDPRWGRTPETYGEDPFLSGKLGVSFVKGLQGDDPRYLKIVSTPKHFAANN 197
Query: 212 LDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNIL 271
++ +RF N +S++D+ + + F C++EGK AS+M +YN +N VP + +L
Sbjct: 198 EEH----NRFECNPIISEKDLREYYLPAFEKCIIEGKAASIMTAYNAINDVPCTLNNWLL 253
Query: 272 KRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLH-TE 330
K+ +R +W +GY+VSDC + T E AAA +I+AGLDL+CG + +
Sbjct: 254 KKVLRHDWGFDGYVVSDCGGPSFLVTHHKYVKTLEAAAALSIQAGLDLECGDEVYMEPLL 313
Query: 331 SAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAA 390
+A ++ ++SE +I++A + L +MRLG+FD +P+ PY + P V H +LALEAA
Sbjct: 314 NAYKQYMVSEAEIDSAAYHVLRARMRLGLFD-DPALNPYNKISPSIVGCEKHSKLALEAA 372
Query: 391 RQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGY-TTPLQGIGR 447
RQ IVLLKN+ LPL + +++AV+G N+ + G+Y+G + L+GI +
Sbjct: 373 RQSIVLLKNEKKFLPLDSKKIKSIAVVGINA--GNSEFGDYSGTPVNQPVSILEGIKK 428
Score = 134 bits (338), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 90/299 (30%), Positives = 149/299 (49%), Gaps = 45/299 (15%)
Query: 467 LFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVL 526
L+G A D R+ D T+ V+G+++SIE E DR + LP QQ + + + T++VL
Sbjct: 591 LYGEAGDIMRKCDLTVAVLGINKSIEREGQDRYSIELPKDQQIFIEEAYKINPN-TVVVL 649
Query: 527 MSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITN 586
++G + + + D I AI+ A YPG+AGGTA+A++LFG NPGGKLP+T+Y +
Sbjct: 650 VAGSSLAINWM--DEHIPAIVNAWYPGEAGGTAVAEVLFGDYNPGGKLPLTYYRS--LDE 705
Query: 587 LP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGR 645
LP + +R GRTY+F++G +Y FGHG+SYT F + +
Sbjct: 706 LPAFDDYDIRK-------GRTYQFFEGDPLYAFGHGLSYTTFSY---------------K 743
Query: 646 HGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDG---AHTLLVFSTPPAGHWAPH 702
SI+A +G + V+ +KN G +G A + + + P
Sbjct: 744 KLSIDA--AGDVVSVSFT------------LKNTGKYEGDEVAQLYVKYQGSDSQVKLPL 789
Query: 703 KQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLH 761
KQL FE++H+ G +++ + + + + G P G++ +G ++ L
Sbjct: 790 KQLKGFERIHLKKGESKQINLTVPKSELRFWNEEKGEFYTPAGDYLFMVGTASDAIQLQ 848
>gi|440733337|ref|ZP_20913088.1| beta-glucosidase [Xanthomonas translucens DAR61454]
gi|440362904|gb|ELQ00083.1| beta-glucosidase [Xanthomonas translucens DAR61454]
Length = 895
Score = 287 bits (734), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 161/399 (40%), Positives = 230/399 (57%), Gaps = 32/399 (8%)
Query: 59 RLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVIT 118
+++ +EKV ++ A A+PRLG+ YEWW+E LHG++ G AT FPQ I
Sbjct: 53 KMTREEKVAQAMNAAPAIPRLGVPAYEWWNEGLHGIARNG----------YATVFPQAIG 102
Query: 119 TASSFNATLWEAIGRVVSDEARAMYN--GGT-------AGLTYWSPNVNIFRDPRWGRGQ 169
A+++N L E +G V S EARA +N GG AGLT WSPN+NIFRDPRWGRG
Sbjct: 103 LAATWNTALLEQVGTVTSTEARAKFNLAGGPGKDHPRYAGLTIWSPNINIFRDPRWGRGM 162
Query: 170 ETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD--RFHFNAKV 227
ET GEDP L+G+ A ++ GLQG D + A+ KH + +G + R F+ V
Sbjct: 163 ETYGEDPYLTGQLAVGFIHGLQGDDLTHPRTIATPKHLAVH-----SGPEPGRHGFDVDV 217
Query: 228 SKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVS 287
S D+E T+ FR +++G+ SVMC+YN ++G P CA +L +RG+W G++VS
Sbjct: 218 SPHDLEATYTPAFRAAIVDGRAGSVMCAYNALHGTPACAADWLLNGRLRGDWGFTGFVVS 277
Query: 288 DCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNAL 347
DCD+V +F + ++A A++AG DL+CG + A+ RG E ++ +L
Sbjct: 278 DCDAVDDMTQFHYFRADNAGSSAAALKAGHDLNCG-YAYRDLGKAIARGDADEALLDQSL 336
Query: 348 VNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLS 407
V + RLG + PY LG KDV + H+ LAL+AA+Q IVLL+N+ +LPL
Sbjct: 337 VRLFAARYRLGELQPQ-RKDPYAQLGAKDVDSAAHRALALQAAQQSIVLLQNRNATLPLR 395
Query: 408 -HIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
+R +AVIGPN+D + NY G + TPL G+
Sbjct: 396 PGLR---LAVIGPNADALAALEANYQGTSAAPVTPLLGL 431
Score = 135 bits (341), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 96/278 (34%), Positives = 138/278 (49%), Gaps = 48/278 (17%)
Query: 486 GLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVA 535
GL +E E L DR L LP QQ L+ + + K P ++VLMSG + +
Sbjct: 632 GLSPDVEGEELRIDVPGFDGGDRNDLALPAAQQALLERAKASGK-PLVVVLMSGSAVALN 690
Query: 536 FAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMR 595
+AK AI+ A YPGQ+GGTAIA +L G NPGG+LP+T+Y +LP
Sbjct: 691 WAKQ--HADAIVAAWYPGQSGGTAIAQVLAGDVNPGGRLPVTFY--RSTKDLPAY----- 741
Query: 596 PSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISG 655
S GRTYR++KG ++ FG G+SYT F + AP + A L
Sbjct: 742 --VSYDMKGRTYRYFKGEPLFAFGSGLSYTRFTYA---APQLSATTL------------- 783
Query: 656 KAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPA 715
A N L V+ V N G++ G + V+ PP G +P + LV F++V +
Sbjct: 784 ------QAGAN---LQVRTQVSNSGTRAGDEVVQVYLQPPQGAQSPLRTLVGFQRVTLQP 834
Query: 716 GAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGG 753
G + VG + + LS VDR+G R + G++ + +GG
Sbjct: 835 GEAREVGFEL-TPRQLSDVDRAGQRAVQPGDYRVFVGG 871
>gi|285018984|ref|YP_003376695.1| beta-glucosidase [Xanthomonas albilineans GPE PC73]
gi|283474202|emb|CBA16703.1| putative beta-glucosidase protein [Xanthomonas albilineans GPE
PC73]
Length = 904
Score = 287 bits (734), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 163/404 (40%), Positives = 230/404 (56%), Gaps = 28/404 (6%)
Query: 52 RVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGAT 111
R L+ +++ EK+ ++ A A+PRLGI YEWWSE LHG++ G AT
Sbjct: 55 RATALVAKMTRAEKIAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNGE----------AT 104
Query: 112 SFPQVITTASSFNATLWEAIGRVVSDEARAMYN--GGTA-------GLTYWSPNVNIFRD 162
FPQ I A+S+N L A+G V S EARA +N GG GLT WSPN+NIFRD
Sbjct: 105 VFPQAIGLAASWNTDLLHAVGTVTSTEARAKFNLAGGPGKNHARYGGLTIWSPNINIFRD 164
Query: 163 PRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFH 222
PRWGRG ET GEDP L+G+ A ++ GLQG D + A+ KH + + R
Sbjct: 165 PRWGRGMETYGEDPYLTGQLAVGFIHGLQGDDPTHPRTIATPKHLAVH---SGPESGRHG 221
Query: 223 FNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLN 282
F+ VS D E T+ FR ++EG SVMC+YN ++G+P CA ++ +RG W
Sbjct: 222 FDVDVSPHDFEATYSPAFRAAIVEGHAGSVMCAYNALHGIPACAADWLIDGRVRGNWGFK 281
Query: 283 GYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEID 342
G++VSDCD++ ++ + +AA A++AG DL+CG + +A+ RG E
Sbjct: 282 GFVVSDCDAIDDMTQFHYYRADNAGSAAAALKAGHDLNCG-YAYRDLGTALDRGEAEEAM 340
Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGP 402
++ +LV + RLG S PY LG KD+ +P H+ LAL+AA+Q +VLL+N+
Sbjct: 341 LDRSLVRLFAARYRLGELQPR-SKDPYARLGAKDIDSPTHRALALQAAQQSLVLLQNRND 399
Query: 403 SLPLS-HIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
+LPL +R +AVIGPN+D + NY G + TPLQG+
Sbjct: 400 TLPLRPGLR---LAVIGPNADALAALEANYQGTSVAPVTPLQGL 440
Score = 132 bits (332), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 95/278 (34%), Positives = 138/278 (49%), Gaps = 48/278 (17%)
Query: 486 GLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVA 535
GL +E E L DR L LP QQ L+ + + K P I+VLMSG + +
Sbjct: 641 GLSPDVEGEELRIDVPGFDGGDRNDLSLPAAQQALLERAKASGK-PLIVVLMSGSAVALN 699
Query: 536 FAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMR 595
+AK AI+ A YPGQ+GGTAIA L G NPGG+LP+T+Y +LP
Sbjct: 700 WAKQ--HADAILAAWYPGQSGGTAIAQALAGDINPGGRLPVTFY--RSTKDLP------- 748
Query: 596 PSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISG 655
P S GRTYR++KG ++PFG+G+SYT+F +T AP + + L
Sbjct: 749 PYVSYDMKGRTYRYFKGEALFPFGYGLSYTHFAYT---APQLSSTTLQAGD--------- 796
Query: 656 KAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPA 715
TL V V+N G++ G + V+ P +P + LV F++V +
Sbjct: 797 -------------TLHVTTTVRNTGARAGDEVVQVYLQYPPRAQSPLRALVGFQRVSLQP 843
Query: 716 GAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGG 753
G + + + + LS VDRSG R + G++ + +GG
Sbjct: 844 GEARTLSFALE-PRQLSDVDRSGQRAVEAGDYRLFVGG 880
>gi|365122193|ref|ZP_09339098.1| hypothetical protein HMPREF1033_02444 [Tannerella sp.
6_1_58FAA_CT1]
gi|363642907|gb|EHL82241.1| hypothetical protein HMPREF1033_02444 [Tannerella sp.
6_1_58FAA_CT1]
Length = 853
Score = 287 bits (734), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 162/410 (39%), Positives = 240/410 (58%), Gaps = 29/410 (7%)
Query: 35 ATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGV 94
A T+ L + ++ P +R+ DL+ RL+++EK+ LL + + +PRLGI Y +EALHGV
Sbjct: 24 AQTKEL-YKDMNAPQHERIMDLLSRLTVEEKISLLRATSPGIPRLGIDKYYHGNEALHGV 82
Query: 95 SNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAG----- 149
V PG T FPQ I AS +N L I +SDEAR +N G
Sbjct: 83 --VRPGN--------FTVFPQAIGLASMWNPELLYEISTAISDEARGRWNELNRGKDQKG 132
Query: 150 -----LTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASC 204
LT+WSP VN+ RDPRWGR ET GEDP LSGK ++V+GLQG+D LK+ ++
Sbjct: 133 FFSDLLTFWSPTVNMARDPRWGRTPETYGEDPFLSGKLGVAFVKGLQGNDPRYLKIVSTP 192
Query: 205 KHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPT 264
KHF A + ++ +RF N +S++++ + + F C+ EGK S+M +YN +N VP
Sbjct: 193 KHFAANNEEH----NRFECNPHISERNLREYYLPAFESCIKEGKAQSIMSAYNAINDVPC 248
Query: 265 CADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPF 324
+P +L + +R EW NGY+VSDC G + TPE AA +I+AGLDL+CG
Sbjct: 249 TLNPWLLTQVLRKEWGFNGYVVSDCGGPGFLVTHHKYVKTPEAAATLSIKAGLDLECGDN 308
Query: 325 LGLH-TESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQ 383
+ + +A ++ ++++ DI+ A L +M LG+FD +P PY + P V H+
Sbjct: 309 VYIEPLMNAYKQCMVTDADIDTAAYRILRARMMLGLFD-DPEKNPYNAISPSIVGCEKHR 367
Query: 384 ELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAG 433
+LALEAARQ +VLLKN+ LPL+ + +++AV+G N+ G+Y+G
Sbjct: 368 QLALEAARQSLVLLKNEKNFLPLNPKKVKSIAVVGINA--GNCEFGDYSG 415
Score = 133 bits (335), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 89/296 (30%), Positives = 144/296 (48%), Gaps = 41/296 (13%)
Query: 467 LFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVL 526
L+G A A R+ D T+ V+G+++SIE E DR + LP QQ + + A+ T++VL
Sbjct: 593 LYGEAGRAIRECDVTVAVLGINKSIEREGQDRYTIELPADQQLFIKEAYKANPN-TVVVL 651
Query: 527 MSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITN 586
++G + + + D I AI+ A YPG+ GGTA+A+ LFG NPGG+LP+T+Y +
Sbjct: 652 VAGSSLAINWI--DENIPAILNAWYPGEQGGTAVAEALFGDYNPGGRLPLTYY--RSLDE 707
Query: 587 LPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRH 646
LP + GRTY +++ +YPFG+G+SYT F +
Sbjct: 708 LPAFD------DYDIQKGRTYMYFENKPLYPFGYGLSYTRF-----------------DY 744
Query: 647 GSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQL 705
++ + +S A+ ++ VKN G G V+ P +G P KQL
Sbjct: 745 KNLKSEVSDDAV------------NLKFTVKNTGKYAGDEVAQVYVRFPESGIKVPLKQL 792
Query: 706 VAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLH 761
FE+VH+ G +V ++I + ++ G P G + +G + + L
Sbjct: 793 KGFERVHIGKGKSAQVSVSIPKKELRLWDEKDGKFYTPSGNYIFMVGSSSDDIRLQ 848
>gi|427385138|ref|ZP_18881643.1| hypothetical protein HMPREF9447_02676 [Bacteroides oleiciplenus YIT
12058]
gi|425727306|gb|EKU90166.1| hypothetical protein HMPREF9447_02676 [Bacteroides oleiciplenus YIT
12058]
Length = 863
Score = 286 bits (733), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 171/432 (39%), Positives = 242/432 (56%), Gaps = 32/432 (7%)
Query: 41 PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPG 100
P+ +L +R DL+GRL+L+EK L+ + + A+PRLGIK Y+WW+EALHGV G
Sbjct: 25 PYKNPALTPEERAADLVGRLTLEEKASLMQNTSPAIPRLGIKAYDWWNEALHGVGRAGL- 83
Query: 101 TKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAM-----YNGGTA---GLTY 152
AT FPQ I +SFN L + VSDEARA GG GLT
Sbjct: 84 ---------ATVFPQAIGMGASFNNDLLYDVFTAVSDEARAKTAEFSKEGGLKRYQGLTM 134
Query: 153 WSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRL-KVAASCKHFTAYD 211
W+PNVNIFRDPRWGRGQET GEDP L+G+ + VRGLQG +G + K+ A KHF +
Sbjct: 135 WTPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPEGGKYDKLHACAKHFAVHS 194
Query: 212 LDNWNGVDRFHFNAK-VSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNI 270
WN R F+A+ V +D+ +T+ F+ V + V VMC+YN+ G P C +
Sbjct: 195 GPEWN---RHSFDAENVDPRDLWETYLPAFKDLVQKAHVKEVMCAYNRFEGEPCCGSNRL 251
Query: 271 LKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTP--EEAAADAIRAGLDLDCGPFLGLH 328
L + +R EW +G IVSDC ++ +++ + P E A+A A+ G D++CG
Sbjct: 252 LVQILRDEWAYDGIIVSDCWAINDFFNKGAHETEPDKEHASAKAVLTGTDVECGESYA-S 310
Query: 329 TESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALE 388
AV+ GL+ E I+ +L + + LG D P + + V + +H+ELAL
Sbjct: 311 LPQAVKAGLIDEKKIDISLKRLMKARFELGEMDN-PELVSWAQIPYSVVDSKEHRELALR 369
Query: 389 AARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRY 448
AR+ +VLL+N LPL+ + VAV+GPN++ +V GNY G T L+GI +Y
Sbjct: 370 MARESLVLLQNNQNVLPLN--KSLKVAVVGPNANDSVMQWGNYNGFPGHTVTLLEGIRQY 427
Query: 449 ---ARTIHQQGC 457
A+ I++ GC
Sbjct: 428 LPEAQLIYEPGC 439
Score = 117 bits (292), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 89/302 (29%), Positives = 136/302 (45%), Gaps = 49/302 (16%)
Query: 472 IDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGP 521
I + AD + G+ ++E E + DR + LP Q L++++ A K
Sbjct: 594 IQRVKDADIIVFAGGISPAVEGEEMRVTIPGFKGGDRETIELPSIQSRLLAELKKAGK-K 652
Query: 522 TILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQ 581
+ V SG I A AI+ A YPGQAGGTAIA++LFG NP G+LP+T+Y
Sbjct: 653 VVFVNFSGSAI--ALTPETKTCDAILQAWYPGQAGGTAIANVLFGDYNPAGRLPVTFYKS 710
Query: 582 EYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAV 640
+ LP + +M+ GRTYR+ ++PFGHG+SYT F
Sbjct: 711 --TSQLPDFEDYSMK--------GRTYRYMAEAPLFPFGHGLSYTTF------------- 747
Query: 641 PLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWA 700
R+G +A++S + +K + + V N G +DG + V+ P
Sbjct: 748 ----RYG--DASLSTQEVKEGEQAI------LTIPVSNTGERDGEEVVQVYLRRPGDKEG 795
Query: 701 PHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
P L AF++V++ G V I++ + + T R G++ I GGT L
Sbjct: 796 PSHALRAFKRVNIAKGTTGNVTISLSKEDFEWFDTETNTMRPIEGDYEILYGGTSELKQL 855
Query: 761 HA 762
A
Sbjct: 856 KA 857
>gi|299149391|ref|ZP_07042448.1| beta-glucosidase [Bacteroides sp. 3_1_23]
gi|298512578|gb|EFI36470.1| beta-glucosidase [Bacteroides sp. 3_1_23]
Length = 853
Score = 286 bits (733), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 159/397 (40%), Positives = 243/397 (61%), Gaps = 28/397 (7%)
Query: 48 PIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDF 107
P+ +RV DL+ RL+++EK+ LL + + +PRLGI Y +EALHGV V PG +F
Sbjct: 36 PVHERVADLLSRLTVEEKISLLRATSPGIPRLGIDKYYHGNEALHGV--VRPG-RF---- 88
Query: 108 PGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAG----------LTYWSPNV 157
T FPQ I A+++N L + + V+SDEARA +N G LT+WSP V
Sbjct: 89 ---TVFPQAIGLAATWNPELQKRVATVISDEARARWNELDQGREQKEQFSDVLTFWSPTV 145
Query: 158 NIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNG 217
N+ RDPRWGR ET GEDP LSG ++V+GLQG D LK+ ++ KHF A + ++
Sbjct: 146 NMARDPRWGRTPETYGEDPFLSGVMGTAFVKGLQGDDPRYLKIVSTPKHFAANNEEH--- 202
Query: 218 VDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRG 277
+RF N ++S++ + + + F MCV EGK AS+M +YN +N VP + +LK+ +R
Sbjct: 203 -NRFVCNPQISEKQLREYYFPAFEMCVKEGKAASIMTAYNALNDVPCTLNAWLLKKVLRQ 261
Query: 278 EWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRG 336
+W GY+VSDC + + + T E AA +I+AGLDL+CG + +A ++
Sbjct: 262 DWGFQGYVVSDCGGPSLLVNAHKYVKTKEAAATLSIQAGLDLECGDDVYDEYLLNAYKQY 321
Query: 337 LLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVL 396
++S+ DI++A + LT +M+LG+FDG PY + P + + +HQ++AL+AAR+ IVL
Sbjct: 322 MVSDADIDSAACHVLTARMKLGLFDGT-ERNPYTRISPSVIGSKEHQQIALDAARECIVL 380
Query: 397 LKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAG 433
LKN+ LPL+ + +++AV+G N+ G+Y+G
Sbjct: 381 LKNKNNMLPLNVNKVKSIAVVGINA--GKCEFGDYSG 415
Score = 132 bits (333), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 88/296 (29%), Positives = 142/296 (47%), Gaps = 41/296 (13%)
Query: 467 LFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVL 526
L+G A A R+ + + VMG+++SIE E DR + LP Q+E + ++ + P I+V+
Sbjct: 593 LYGEAGKAVRECETVVAVMGINKSIEREGQDRYDIQLPADQREFLQEIYKVN--PNIIVV 650
Query: 527 MSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITN 586
+ G +A D I AI+ A YPG+ GGTA+AD+LFG NP G+LP+T+Y +
Sbjct: 651 LVAGS-SLAVNWMDEHIPAIVNAWYPGEQGGTAVADVLFGDYNPAGRLPLTYYKS--LDE 707
Query: 587 LPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRH 646
LP + GRTY+++KG V+YPFG+G+SY++F ++ DG +
Sbjct: 708 LPAFD------DYDITKGRTYKYFKGDVLYPFGYGLSYSSFKYSDLKVK-------DGAN 754
Query: 647 GSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQL 705
T+ V +KN G + G V+ P G P K+L
Sbjct: 755 ----------------------TISVSFRLKNTGKRKGDEVAQVYVRIPETGGVVPIKEL 792
Query: 706 VAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLH 761
F ++ + +G + V I + + G +P G +I +G + + L
Sbjct: 793 KGFRRIPLKSGESRVVDIELDKEQLRYWDAGLGQFIVPQGAFDIMVGASSKDIRLQ 848
>gi|254786805|ref|YP_003074234.1| glycoside hydrolase family 3 domain-containing protein
[Teredinibacter turnerae T7901]
gi|237686035|gb|ACR13299.1| glycoside hydrolase family 3 domain protein [Teredinibacter
turnerae T7901]
Length = 888
Score = 286 bits (733), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 178/458 (38%), Positives = 247/458 (53%), Gaps = 39/458 (8%)
Query: 10 LGLILLSASSSGLAARE---PFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKV 66
LGL L S +G + P + + +L I RV+DL+ R+ L EK+
Sbjct: 12 LGLTLASLLFTGCSPDNNPVPKPVSERSTANEQPAYMDTTLDIDTRVDDLVSRMDLAEKI 71
Query: 67 KLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNAT 126
+ + + A+ LGI Y+WW+EALHGV+ G AT FPQ I A+ ++
Sbjct: 72 SQMYNESPAIEHLGIAEYDWWNEALHGVARAG----------KATVFPQAIGMAAMWDRE 121
Query: 127 LWEAIGRVVSDEARAMY-----NG---GTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVL 178
I VSDEARA + NG GLT+WSPN+NIFRDPRWGRGQET GEDP L
Sbjct: 122 TMFDIAEAVSDEARAKHHYFVENGVHFRYTGLTFWSPNINIFRDPRWGRGQETYGEDPYL 181
Query: 179 SGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDV 238
+G+ A Y+ GLQG + LK AA KHF + R N S +D+ +T+
Sbjct: 182 TGELALPYISGLQGENPKYLKTAAMAKHFAVHSGPE---KSRHSDNYIASPKDLNETYLP 238
Query: 239 PFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYY-- 296
F V+EG V SVMC+YN+VN P C + +LK T+RG+W G++VSDC ++ +Y
Sbjct: 239 AFEKAVVEGDVESVMCAYNRVNDEPACGNDMLLKETLRGKWGFKGHVVSDCGAIADFYAP 298
Query: 297 DTQHFTSTPEEAAADAIRAGLDLDCG-----PFLGLHTESAVQRGLLSEIDINNALVNTL 351
+ H P AAA A+R+G DL+CG F LH A+QR ++++ +I+ ++ +
Sbjct: 299 EAHHVVMAPAAAAAWAVRSGTDLNCGTDRLSTFANLHF--ALQREMITQDEIDQSVKRLM 356
Query: 352 TVQMRLGMFDGEPSSQ-PYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIR 410
+ +LGMFD P Q PY + V + H L +AA + VLLKN G LPL +
Sbjct: 357 KTRFKLGMFD--PDDQVPYSKIPMDVVGSQAHLALTQKAAEKSFVLLKNSG-ILPLK--K 411
Query: 411 HRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRY 448
VA+IGPN+ ++GNY G TPL GI +Y
Sbjct: 412 SSKVAIIGPNATNPTVLVGNYFGDPIKPVTPLDGIQQY 449
Score = 115 bits (289), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 86/272 (31%), Positives = 136/272 (50%), Gaps = 43/272 (15%)
Query: 486 GLDQSIEAEALD---RAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPR 542
G + S+E E D R + LP Q++L++ + +K P +LV SG I + +A N+
Sbjct: 636 GEEMSVEIEGFDHGDRTDIRLPEPQRKLLATLKKLNK-PIVLVNFSGSAIALNWANNN-- 692
Query: 543 IAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLP-MTEMAMRPSQSKR 601
+ AI+ YPG+A GTA+A IL+G +P G+LP+T+Y + +LP + AM
Sbjct: 693 VDAILQGFYPGEATGTALARILWGEVSPSGRLPITFY--RSLDDLPGFKDYAMT------ 744
Query: 602 YPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVT 661
RTY++Y+G V+YPFG+G+SYT F ++ +AP +A SG+ + +T
Sbjct: 745 --NRTYKYYQGDVLYPFGYGLSYTQFAYSELSAPATMA--------------SGEPLAIT 788
Query: 662 HAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRV 721
N V S + + P G P ++L F+++++ GA Q V
Sbjct: 789 AQVSNS---------GKVASDEVVQVYVSMKVP--GLSLPQRELKEFKRIYLEPGASQTV 837
Query: 722 GINIHVCKYLSVVDRSGTRRIPLGEHNIHIGG 753
+I K LS VD G R G + +GG
Sbjct: 838 EFSI-AGKDLSYVDDQGVRHPYHGPLTLSVGG 868
>gi|393781363|ref|ZP_10369562.1| hypothetical protein HMPREF1071_00430 [Bacteroides salyersiae
CL02T12C01]
gi|392676856|gb|EIY70278.1| hypothetical protein HMPREF1071_00430 [Bacteroides salyersiae
CL02T12C01]
Length = 863
Score = 286 bits (732), Expect = 3e-74, Method: Compositional matrix adjust.
Identities = 178/440 (40%), Positives = 245/440 (55%), Gaps = 43/440 (9%)
Query: 40 LPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGP 99
LPF LP+ +R DL+ RL+LQEKV L+ ++ +PRLGIK Y WW+EALHGV G
Sbjct: 22 LPFNNSDLPVEERAQDLLQRLTLQEKVLLMCDYSSPIPRLGIKRYNWWNEALHGVGRAGL 81
Query: 100 GTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGT--------AGLT 151
AT FPQ I A++F+ VSDEARA Y+ GLT
Sbjct: 82 ----------ATVFPQAIGMAATFDDCAVRQAFECVSDEARAKYHHSENKEGSERYQGLT 131
Query: 152 YWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAY 210
+W+PNVNIFRDPRWGRGQET GEDP L+ + + VRGLQG S+ K+ A KH+ +
Sbjct: 132 FWTPNVNIFRDPRWGRGQETYGEDPYLTSQMGLAVVRGLQGPSESKYDKLHACAKHYALH 191
Query: 211 DLDNWNGVDRFHFNA-KVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPN 269
WN R F+ +S +D+ +T+ F+ V +G V VMC+YN+ G P C
Sbjct: 192 SGPEWN---RHSFDVDSISPRDLWETYLPAFKALVQQGGVKEVMCAYNRFEGEPCCGSNR 248
Query: 270 ILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTP--EEAAADAIRAGLDLDCG-PFLG 326
+L +R EW +G +VSDC ++ +Y H + P E A A A++AG DLDCG +
Sbjct: 249 LLYNILREEWGFDGLVVSDCGAISDFYLKGHHETHPTKEAAVAAAVKAGTDLDCGVDYYA 308
Query: 327 LHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEP----SSQPYGHLGPKDVCTPDH 382
L + AV+ G+++E I+ +L L + LG+ D E S PY V + H
Sbjct: 309 L--QKAVEEGIITEKQIDVSLFRLLKARFELGLMDEEHLVSWSDIPY-----TVVDSEKH 361
Query: 383 QELALEAARQGIVLLKNQGPSLPLSHIRH-RTVAVIGPNSDVTVTMIGNYAGIACGYTTP 441
+E ALE AR+ + LLKN +LPLS +H +AVIGPN++ +V M GNY G T
Sbjct: 362 REKALEMARKSMTLLKNDHGTLPLS--KHCGKIAVIGPNANDSVMMWGNYNGFPSHTVTI 419
Query: 442 LQGIGRY---ARTIHQQGCK 458
L+GI + I+ +GC+
Sbjct: 420 LEGITHKLGAEQIIYDKGCE 439
Score = 121 bits (303), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 94/307 (30%), Positives = 146/307 (47%), Gaps = 51/307 (16%)
Query: 474 ASRQADATILVM--GLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGP 521
A+R DA ++V G+ +E E L DR + LP Q++L+ ++ K P
Sbjct: 594 AARVGDAEVIVFVGGISPKVEGEELPVSFPGFKGGDRTVIELPQVQRDLLQELHKTGK-P 652
Query: 522 TILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQ 581
IL+L SG I ++ A+ D AII A Y GQAGGTA+AD+LFG NP G+LP+T+Y
Sbjct: 653 VILILCSGSAIGLS-AEVD-LADAIIQAWYLGQAGGTAVADVLFGDYNPAGRLPVTFYKA 710
Query: 582 EYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAV 640
LP + +M+ GRTYR+++G ++PFG+G+SYT+F
Sbjct: 711 --TEQLPDFEDYSMQ--------GRTYRYFEGEALFPFGYGLSYTSFEIG---------- 750
Query: 641 PLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWA 700
A +S K I+ + +LT V+N G DG + ++
Sbjct: 751 ---------KARLSKKRIRENESVSLKLT------VENTGKLDGDEVIQIYIRKLQDKEG 795
Query: 701 PHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
P K L AF++ H+ AG ++ V ++ + S T R+ GE+ I G + L
Sbjct: 796 PLKTLRAFKRFHLRAGEKKDVTFHLQNDHFNFFDTESNTMRVMPGEYEILYGASSLEKDL 855
Query: 761 HAATLGV 767
+ +
Sbjct: 856 RRINIKI 862
>gi|383113364|ref|ZP_09934136.1| hypothetical protein BSGG_3068 [Bacteroides sp. D2]
gi|382948729|gb|EFS32368.2| hypothetical protein BSGG_3068 [Bacteroides sp. D2]
Length = 850
Score = 286 bits (732), Expect = 3e-74, Method: Compositional matrix adjust.
Identities = 159/397 (40%), Positives = 243/397 (61%), Gaps = 28/397 (7%)
Query: 48 PIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDF 107
P+ +RV DL+ RL+++EK+ LL + + +PRLGI Y +EALHGV V PG +F
Sbjct: 33 PVHERVADLLSRLTVEEKISLLRATSPGIPRLGIDKYYHGNEALHGV--VRPG-RF---- 85
Query: 108 PGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAG----------LTYWSPNV 157
T FPQ I A+++N L + + V+SDEARA +N G LT+WSP V
Sbjct: 86 ---TVFPQAIGLAATWNPELQKRVATVISDEARARWNELDQGREQKEQFSDVLTFWSPTV 142
Query: 158 NIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNG 217
N+ RDPRWGR ET GEDP LSG ++V+GLQG D LK+ ++ KHF A + ++
Sbjct: 143 NMARDPRWGRTPETYGEDPFLSGVMGTAFVKGLQGDDPRYLKIVSTPKHFAANNEEH--- 199
Query: 218 VDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRG 277
+RF N ++S++ + + + F MCV EGK AS+M +YN +N VP + +LK+ +R
Sbjct: 200 -NRFVCNPQISEKQLREYYFPAFEMCVKEGKAASIMTAYNALNDVPCTLNAWLLKKVLRQ 258
Query: 278 EWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRG 336
+W GY+VSDC + + + T E AA +I+AGLDL+CG + +A ++
Sbjct: 259 DWGFQGYVVSDCGGPSLLVNAHKYVKTKEAAATLSIQAGLDLECGDDVYDEYLLNAYKQY 318
Query: 337 LLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVL 396
++S+ DI++A + LT +M+LG+FDG PY + P + + +HQ++AL+AAR+ IVL
Sbjct: 319 MVSDADIDSAACHVLTARMKLGLFDG-TERNPYTRISPSVIGSKEHQQIALDAARECIVL 377
Query: 397 LKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAG 433
LKN+ LPL+ + +++AV+G N+ G+Y+G
Sbjct: 378 LKNKNNMLPLNVNKVKSIAVVGINA--GKCEFGDYSG 412
Score = 132 bits (332), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 89/296 (30%), Positives = 142/296 (47%), Gaps = 41/296 (13%)
Query: 467 LFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVL 526
L+G A A R+ + + VMG+++SIE E DR + LP Q+E + ++ + P I+V+
Sbjct: 590 LYGEAGKAVRECETVVAVMGINKSIEREGQDRYDIQLPADQREFLQEIYKVN--PNIIVV 647
Query: 527 MSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITN 586
+ G +A D I AI+ A YPG+ GGTA+AD+LFG NP G+LP+T+Y +
Sbjct: 648 LVAGS-SLAVNWMDEHIPAIVNAWYPGEQGGTAVADVLFGDYNPAGRLPLTYYKS--LDE 704
Query: 587 LPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRH 646
LP + GRTY+++KG V+YPFG+G+SY++F ++ DG +
Sbjct: 705 LPAFD------DYDITQGRTYKYFKGDVLYPFGYGLSYSSFKYSDLKVK-------DGAN 751
Query: 647 GSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQL 705
T+ V +KN G + G V+ P G P K+L
Sbjct: 752 ----------------------TVSVSFRLKNTGKRKGDEVAQVYVRIPETGGVVPIKEL 789
Query: 706 VAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLH 761
F ++ + +G + V I + + G +P G +I IG + + L
Sbjct: 790 KGFRRIPLKSGESRVVEIELDKEQLRYWDAGLGQFIVPQGAFDIMIGASSKDIRLQ 845
>gi|336417083|ref|ZP_08597412.1| hypothetical protein HMPREF1017_04520 [Bacteroides ovatus
3_8_47FAA]
gi|335936708|gb|EGM98626.1| hypothetical protein HMPREF1017_04520 [Bacteroides ovatus
3_8_47FAA]
Length = 850
Score = 286 bits (732), Expect = 3e-74, Method: Compositional matrix adjust.
Identities = 159/397 (40%), Positives = 243/397 (61%), Gaps = 28/397 (7%)
Query: 48 PIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDF 107
P+ +RV DL+ RL+++EK+ LL + + +PRLGI Y +EALHGV V PG +F
Sbjct: 33 PVHERVADLLSRLTVEEKISLLRATSPGIPRLGIDKYYHGNEALHGV--VRPG-RF---- 85
Query: 108 PGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAG----------LTYWSPNV 157
T FPQ I A+++N L + + V+SDEARA +N G LT+WSP V
Sbjct: 86 ---TVFPQAIGLAATWNPELQKRVATVISDEARARWNELDQGREQKEQFSDVLTFWSPTV 142
Query: 158 NIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNG 217
N+ RDPRWGR ET GEDP LSG ++V+GLQG D LK+ ++ KHF A + ++
Sbjct: 143 NMARDPRWGRTPETYGEDPFLSGVMGTAFVKGLQGDDPRYLKIVSTPKHFAANNEEH--- 199
Query: 218 VDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRG 277
+RF N ++S++ + + + F MCV EGK AS+M +YN +N VP + +LK+ +R
Sbjct: 200 -NRFVCNPQISEKQLREYYFPAFEMCVKEGKAASIMTAYNALNDVPCTLNAWLLKKVLRQ 258
Query: 278 EWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRG 336
+W GY+VSDC + + + T E AA +I+AGLDL+CG + +A ++
Sbjct: 259 DWGFQGYVVSDCGGPSLLVNAHKYVKTKEAAATLSIQAGLDLECGDDVYDEYLLNAYKQY 318
Query: 337 LLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVL 396
++S+ DI++A + LT +M+LG+FDG PY + P + + +HQ++AL+AAR+ IVL
Sbjct: 319 MVSDADIDSAACHVLTARMKLGLFDG-TERNPYTRISPSVIGSKEHQQIALDAARECIVL 377
Query: 397 LKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAG 433
LKN+ LPL+ + +++AV+G N+ G+Y+G
Sbjct: 378 LKNKNNMLPLNVNKVKSIAVVGINA--GKCEFGDYSG 412
Score = 132 bits (331), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 88/296 (29%), Positives = 142/296 (47%), Gaps = 41/296 (13%)
Query: 467 LFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVL 526
L+G A A R+ + + VMG+++SIE E DR + LP Q+E + ++ + P I+V+
Sbjct: 590 LYGEAGKAVRECETVVAVMGINKSIEREGQDRYDIQLPADQREFLQEIYKVN--PNIIVV 647
Query: 527 MSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITN 586
+ G +A D I AI+ A YPG+ GGTA+AD+LFG NP G+LP+T+Y +
Sbjct: 648 LVAGS-SLAVNWMDEHIPAIVNAWYPGEQGGTAVADVLFGDYNPAGRLPLTYYKS--LDE 704
Query: 587 LPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRH 646
LP + GRTY+++KG V+YPFG+G+SY++F ++ DG +
Sbjct: 705 LPAFD------DYDITKGRTYKYFKGDVLYPFGYGLSYSSFKYSDLKVK-------DGAN 751
Query: 647 GSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQL 705
T+ V +KN G + G V+ P G P K+L
Sbjct: 752 ----------------------TVSVSFRLKNTGKRKGDEVAQVYVRIPETGGVVPIKEL 789
Query: 706 VAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLH 761
F ++ + +G + V I + + G +P G +I +G + + L
Sbjct: 790 KGFRRIPLKSGESRVVEIELDKEQLRYWDAGLGRFIVPQGAFDIMVGASSKDIRLQ 845
>gi|189464211|ref|ZP_03012996.1| hypothetical protein BACINT_00548 [Bacteroides intestinalis DSM
17393]
gi|189438001|gb|EDV06986.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
intestinalis DSM 17393]
Length = 814
Score = 286 bits (732), Expect = 3e-74, Method: Compositional matrix adjust.
Identities = 216/685 (31%), Positives = 333/685 (48%), Gaps = 89/685 (12%)
Query: 109 GATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRG 168
G T FP I AS++N L +GR ++ EA A G G + P +++ RDPRW R
Sbjct: 168 GTTVFPTSIGQASTWNPELIRRMGRAIATEASA--QGAHIG---YGPVLDLARDPRWSRV 222
Query: 169 QETPGEDPVLSGKYAASYVRGLQGS-DGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKV 227
+ET GED L+G A+ V+G QG + KV A+ KHF AY W +A V
Sbjct: 223 EETYGEDAYLNGVMGAALVKGFQGEFPRTKGKVIATLKHFAAY---GWTEGGHNGGSAHV 279
Query: 228 SKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVS 287
+++E+ PFR V G + SVM SYN+++G+P A+ N+L ++ W+ G++VS
Sbjct: 280 GNREMEEAIYPPFREAVAAGAL-SVMSSYNEIDGIPCTANSNLLTGLLKERWQFKGFVVS 338
Query: 288 DCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGP--FLGLHTESAVQRGLLSEIDINN 345
D ++G + T EAA A+ AG+D D G + G +AV+RG + E+ IN
Sbjct: 339 DLYAIGGLRE-HGVADTDYEAAVKAVNAGVDSDLGTNVYAG-QLVNAVKRGDVQEVVINK 396
Query: 346 ALVNTLTVQMRLGMFDGEPSSQPY-GHLGPKD-VCTPDHQELALEAARQGIVLLKNQGPS 403
A+ L ++ +G+FD P+ P+ V + +H ELA E ARQ I+LLKN+
Sbjct: 397 AVSRILALKFHMGLFD-----HPFVDEREPEQVVASTEHLELAREVARQSIILLKNKNEL 451
Query: 404 LPLSHIRHRTVAVIGPNSDVTVTMIGNYAG--IACGYTTPLQGIGRYART----IHQQGC 457
LPL+ + +T+AVIGPN+D M+G+Y T L GI + I+ +GC
Sbjct: 452 LPLNK-KTKTIAVIGPNADNIYNMLGDYTAPQSESSVVTVLDGIRQKVSNDTHIIYAKGC 510
Query: 458 KDVACADDQLFGAAIDASRQADATILVMG-----------------------LDQSIEAE 494
V + F AI+A+RQ+D ++VMG + E
Sbjct: 511 A-VRDSSKSGFQEAIEAARQSDVVVMVMGGSSARDFSSKYEETGAAKVSDSHISDMESGE 569
Query: 495 ALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQ 554
DR+ L L GRQ+EL+ +V +K P +LVL+ G P+ + + AI+ A YPG
Sbjct: 570 GYDRSTLELLGRQRELIREVGKLNK-PIVLVLIKGRPL--LLEGIEAEVDAIVDAWYPGM 626
Query: 555 AGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPV 614
GG A+AD+LFG NP G+L ++ P+ + LP+ R +Y +G
Sbjct: 627 QGGNAVADVLFGDYNPAGRLTIS-VPRS-VGQLPVYYNTKRKGNRSKYIEE-----EGTP 679
Query: 615 VYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQV 674
YPFG+G+SYT+F ++ A V A C + + V
Sbjct: 680 RYPFGYGLSYTSFNYSDLKAEVVEA----------------------EDSC---LVNISV 714
Query: 675 DVKNVGSKDGAHTL-LVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSV 733
V+N GS+DG + L A P KQL F+++H+ G + + + K L++
Sbjct: 715 KVRNEGSRDGDEVVQLYLRDEVASFTTPFKQLCGFQRIHLKVGETKEITFRLD-KKSLAL 773
Query: 734 VDRSGTRRIPLGEHNIHIGGTKHSV 758
++ + G + +GG+ +
Sbjct: 774 YMQNEEWAVEPGRFTLMLGGSSEQI 798
>gi|424792251|ref|ZP_18218496.1| exported beta-glucosidase [Xanthomonas translucens pv. graminis
ART-Xtg29]
gi|422797157|gb|EKU25539.1| exported beta-glucosidase [Xanthomonas translucens pv. graminis
ART-Xtg29]
Length = 909
Score = 286 bits (732), Expect = 3e-74, Method: Compositional matrix adjust.
Identities = 161/399 (40%), Positives = 230/399 (57%), Gaps = 32/399 (8%)
Query: 59 RLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVIT 118
+++ +EKV ++ A A+PRLG+ YEWW+E LHG++ G AT FPQ I
Sbjct: 67 KMTREEKVAQAMNAAPAIPRLGVPAYEWWNEGLHGIARNG----------YATVFPQAIG 116
Query: 119 TASSFNATLWEAIGRVVSDEARAMYN--GGT-------AGLTYWSPNVNIFRDPRWGRGQ 169
A+++N L E +G V S EARA +N GG AGLT WSPN+NIFRDPRWGRG
Sbjct: 117 LAATWNTALLEQVGTVTSTEARAKFNLAGGPGKDHPRYAGLTIWSPNINIFRDPRWGRGM 176
Query: 170 ETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD--RFHFNAKV 227
ET GEDP L+G+ A ++ GLQG D + A+ KH + +G + R F+ V
Sbjct: 177 ETYGEDPYLTGQLAVGFIHGLQGDDLTHPRTIATPKHLAVH-----SGPEPGRHGFDVDV 231
Query: 228 SKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVS 287
S D+E T+ FR +++G+ SVMC+YN ++G P CA +L +RG+W G++VS
Sbjct: 232 SPHDLEATYTPAFRAAIVDGRAGSVMCAYNALHGTPACAADWLLNGRLRGDWGFTGFVVS 291
Query: 288 DCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNAL 347
DCD+V +F + ++A A++AG DL+CG + A+ RG E ++ +L
Sbjct: 292 DCDAVDDMTQFHYFRADNAGSSAAALKAGHDLNCG-YAYRDLGKAIARGDADEALLDKSL 350
Query: 348 VNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLS 407
V + RLG + PY LG KDV + H+ LAL+AA+Q IVLL+N+ +LPL
Sbjct: 351 VRLFAARYRLGELQPQ-RKDPYARLGAKDVDSAAHRALALQAAQQSIVLLQNRNATLPLR 409
Query: 408 -HIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
+R +AVIGPN+D + NY G + TPL G+
Sbjct: 410 PGLR---LAVIGPNADALAALEANYQGTSAAPVTPLLGL 445
Score = 125 bits (315), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 93/279 (33%), Positives = 136/279 (48%), Gaps = 50/279 (17%)
Query: 486 GLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVA 535
GL +E E L DR L LP QQ L+ + + K P ++VLMSG + +
Sbjct: 646 GLSPDVEGEELRIDVPGFDGGDRNDLALPAAQQALLERAKASGK-PLVVVLMSGSAVALN 704
Query: 536 FAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMR 595
+AK AI+ A YPGQ+GGTAIA +L G NPGG+LP+T+Y +LP
Sbjct: 705 WAKQ--HADAIVAAWYPGQSGGTAIAQVLAGDVNPGGRLPVTFY--RSTKDLPAY----- 755
Query: 596 PSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLD-GRHGSINATIS 654
S GRTYR++KG ++ FG G+SYT F + AP + A L G H
Sbjct: 756 --VSYDMKGRTYRYFKGEPLFAFGSGLSYTRFTYA---APQLSATTLQAGAH-------- 802
Query: 655 GKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVP 714
L V+ V+N G++ G + V+ P +P + LV F++V +
Sbjct: 803 ---------------LQVRTQVRNSGTRAGDEVVQVYLEFPQRAQSPLRTLVGFQRVTLQ 847
Query: 715 AGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGG 753
G + V + + LS VDR+G R + G++ + +GG
Sbjct: 848 PGEARDVSFEL-APRQLSDVDRAGQRAVQPGDYRVFVGG 885
>gi|393789624|ref|ZP_10377744.1| hypothetical protein HMPREF1068_04024 [Bacteroides nordii
CL02T12C05]
gi|392650340|gb|EIY44009.1| hypothetical protein HMPREF1068_04024 [Bacteroides nordii
CL02T12C05]
Length = 855
Score = 286 bits (732), Expect = 3e-74, Method: Compositional matrix adjust.
Identities = 170/436 (38%), Positives = 247/436 (56%), Gaps = 43/436 (9%)
Query: 35 ATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGV 94
A +T F ++ P +R+ DL+ RL+++EKV LL++ A +PRL I Y +EALHGV
Sbjct: 22 AQNKTELFRDMTAPQHERILDLLNRLTVEEKVSLLVNDAREIPRLNIDKYNHGNEALHGV 81
Query: 95 SNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN---------- 144
V PG T FPQ I A+++N L + +SDEAR +
Sbjct: 82 --VRPGE--------FTVFPQAIGLAATWNPNLIFRVSTAISDEARGRWKELDYGKKQIA 131
Query: 145 GGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASC 204
GG+ LT+WSP VN+ RDPRWGR ET GEDP LSG+ +V+GLQG + LK ++
Sbjct: 132 GGSDLLTFWSPTVNMARDPRWGRTPETYGEDPFLSGRIGCEFVKGLQGDNPRYLKTVSTP 191
Query: 205 KHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPT 264
KHF A + ++ +R NA++S++D+ + + F C+++GK S+M +YN VN VP
Sbjct: 192 KHFAANNEEH----NRSSCNARMSERDLREYYLPAFERCIVDGKAQSIMMAYNAVNDVPC 247
Query: 265 CADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG-- 322
+ ++K+ +RG+W NGYIVSDC + + E AA A++AGLDL+CG
Sbjct: 248 TVNIYLIKKVLRGDWNFNGYIVSDCSAPEWMVTKHKYVKNLEAAATLALKAGLDLECGDR 307
Query: 323 ----PFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVC 378
P L + E V SE +I++A + L +M LG+FD +PS PY + P +
Sbjct: 308 VYTAPLLKAYNEYMV-----SEAEIDSAAYHILRGRMLLGLFD-DPSQNPYNKIEPSVIG 361
Query: 379 TPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGY 438
+HQELALE ARQ +VLLKNQ LPL+ + R++AV+G ++ G+Y+G
Sbjct: 362 CKEHQELALETARQSMVLLKNQKNFLPLNRKKIRSIAVVGISA--AHCEFGDYSGNP--K 417
Query: 439 TTP---LQGIGRYART 451
TP L GI +YA
Sbjct: 418 NTPVSVLDGIKKYAEN 433
Score = 141 bits (355), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 88/296 (29%), Positives = 149/296 (50%), Gaps = 42/296 (14%)
Query: 467 LFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVL 526
LFG A +++ D T+ V+G+++SIE E DR L LP QQE + ++ + T++VL
Sbjct: 592 LFGDAGKVAKECDVTVAVLGINKSIEREGQDRYSLELPIDQQEFIKELYKVNPN-TVVVL 650
Query: 527 MSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITN 586
++G + + + D + AI+ A YPG+ GG A+A++LFG NPGG+LP+T+Y +
Sbjct: 651 VAGSSMAINWM--DENVPAILNAWYPGEQGGNAVAEVLFGDYNPGGRLPLTYYNS--LDE 706
Query: 587 LPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRH 646
LP + RTY++++G +Y FG+G+SYTNF +
Sbjct: 707 LPAFD-------DYSVKNRTYQYFEGKPLYEFGYGLSYTNFKY----------------- 742
Query: 647 GSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQL 705
K+I ++ T+ + ++ NVG DG V+ P G + P KQL
Sbjct: 743 -------KKKSIMQSND-----TVDITFNLSNVGKYDGDEVAQVYVRYPETGTYMPLKQL 790
Query: 706 VAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLH 761
F +VH+ G + I+I + +++ P GE+ +GG+ ++S+
Sbjct: 791 KGFSRVHLKKGKSADITISIPKKELRYWDEKTRQFVTPTGEYVFQVGGSSENISIE 846
>gi|298387490|ref|ZP_06997042.1| beta-glucosidase [Bacteroides sp. 1_1_14]
gi|298259697|gb|EFI02569.1| beta-glucosidase [Bacteroides sp. 1_1_14]
Length = 853
Score = 286 bits (732), Expect = 3e-74, Method: Compositional matrix adjust.
Identities = 164/403 (40%), Positives = 244/403 (60%), Gaps = 40/403 (9%)
Query: 48 PIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDF 107
P+ +RV DLI RL+++EK+ LL + + +PRLGI Y +EALHGV V PG +F
Sbjct: 36 PVHERVMDLISRLTVEEKISLLRATSPGIPRLGIDKYYHGNEALHGV--VRPG-RF---- 88
Query: 108 PGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAG----------LTYWSPNV 157
T FPQ I A+++N L + + V+SDEARA +N G LT+WSP V
Sbjct: 89 ---TVFPQAIGLAATWNPELQKRVATVISDEARARWNELDQGREQKEQFSDVLTFWSPTV 145
Query: 158 NIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNG 217
N+ RDPRWGR ET GEDP LSG ++V GLQG D LK+ ++ KHF A + ++
Sbjct: 146 NMARDPRWGRTPETYGEDPFLSGIMGTAFVNGLQGDDPHYLKIVSTPKHFAANNEEH--- 202
Query: 218 VDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRG 277
+RF N ++S++ + + + F MCV EGK AS+M +YN +N VP + +L++ +R
Sbjct: 203 -NRFVCNPQISEKQLREYYFPAFEMCVKEGKAASIMSAYNALNDVPCTLNSWLLQKVLRQ 261
Query: 278 EWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDC------GPFLGLHTES 331
+W GY+VSDC + + + T E AA +I+AGLDL+C GP L +
Sbjct: 262 DWGFQGYVVSDCGGPALLVNAHKYVKTKEAAATLSIKAGLDLECGDDVYDGPLL-----N 316
Query: 332 AVQRGLLSEIDINNALVNTLTVQMRLGMFD-GEPSSQPYGHLGPKDVCTPDHQELALEAA 390
A ++ ++S+ DI++A + LT +M+LG+FD GE PY + P + + +HQ++AL+AA
Sbjct: 317 AYKQYMVSDADIDSAACHVLTARMKLGLFDSGE--RNPYTKISPSVIGSKEHQQIALDAA 374
Query: 391 RQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAG 433
RQ IVLLKNQ LPL+ + +++AV+G N+ G+Y+G
Sbjct: 375 RQCIVLLKNQKNRLPLNADKLKSIAVVGINA--GKCEFGDYSG 415
Score = 137 bits (345), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 89/298 (29%), Positives = 147/298 (49%), Gaps = 45/298 (15%)
Query: 467 LFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVL 526
L+G A A R+ + + VMG+++SIE E DR + LP Q+E + ++ + P I+V+
Sbjct: 593 LYGEAGKAVRECETVVAVMGINKSIEREGQDRYDIQLPADQREFLQEIYKVN--PNIIVV 650
Query: 527 MSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITN 586
+ G +A D + AI+ A YPG+ GGTA+A++LFG NP G+LP+T+Y +
Sbjct: 651 LVAGS-SLAVNWMDEHVPAIVNAWYPGEQGGTAVAEVLFGDYNPAGRLPLTYYKS--LDE 707
Query: 587 LPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRH 646
LP + GRTY+++KG V+YPFG+G+SY++F ++ L +
Sbjct: 708 LPPFD------DYDITKGRTYKYFKGDVLYPFGYGLSYSSFTYS----------DLQVKD 751
Query: 647 GSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQL 705
G T+S + +KN G ++G V+ P G P K+L
Sbjct: 752 GGDEVTVSFR-------------------LKNTGKRNGDEVAQVYVRIPETGGIVPLKEL 792
Query: 706 VAFEKVHVPAGAQQRVGINI--HVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLH 761
F +V + +G +RV I + +Y V G +P G ++ +G + + L
Sbjct: 793 KGFRRVPLKSGESRRVEIKLDKEQLRYWDV--EKGQFVVPKGAFDVMVGASSKDIRLQ 848
>gi|365121873|ref|ZP_09338785.1| hypothetical protein HMPREF1033_02131 [Tannerella sp.
6_1_58FAA_CT1]
gi|363644185|gb|EHL83481.1| hypothetical protein HMPREF1033_02131 [Tannerella sp.
6_1_58FAA_CT1]
Length = 850
Score = 286 bits (731), Expect = 4e-74, Method: Compositional matrix adjust.
Identities = 157/403 (38%), Positives = 239/403 (59%), Gaps = 28/403 (6%)
Query: 42 FCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGT 101
+ + P +R+ DL+ RL+++EK+ LL + + +PRL I+ Y +EALHG+ V PG
Sbjct: 27 YKDMDAPQHERIMDLLSRLTIEEKISLLRATSPGIPRLEIEKYYHGNEALHGI--VRPGN 84
Query: 102 KFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAG----------LT 151
T FPQ I AS +N I V+SDEARA +N G LT
Sbjct: 85 --------FTVFPQAIGLASMWNPDFLYEISTVISDEARARWNELNRGKDQKRLFSDLLT 136
Query: 152 YWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYD 211
+WSP VN+ RDPRWGR ET GEDP LSGK ++V+GLQG+D LKV ++ KHF A +
Sbjct: 137 FWSPTVNMARDPRWGRTPETYGEDPFLSGKLGVAFVKGLQGNDPRYLKVVSTPKHFAANN 196
Query: 212 LDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNIL 271
++ +RF N ++S++D+ + + F C+++GK S+M +YN +N VP + +L
Sbjct: 197 EEH----NRFECNPQISERDLREYYLPAFERCIIDGKAQSIMTAYNAINDVPCTLNTWLL 252
Query: 272 KRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLH-TE 330
K+ +R +W NGY+VSDC + + + TPE AA A++AGLDL+CG + +
Sbjct: 253 KKVLRTDWGFNGYVVSDCGAPSLLVTHHKYVKTPEAAATLALKAGLDLECGDNVYIEPLM 312
Query: 331 SAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAA 390
+A ++ ++SE +I+ A L +M LG+FD +P+ PY L P V H+ +ALEAA
Sbjct: 313 NAYKQYMVSEAEIDTAAYRILRARMMLGLFD-DPAKNPYNALSPSIVGCEKHKNMALEAA 371
Query: 391 RQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAG 433
RQ +VLLKN+ LP++ + +++AV+G N+ G+Y+G
Sbjct: 372 RQSLVLLKNENNFLPINPKKIKSIAVVGINA--GNCEFGDYSG 412
Score = 122 bits (305), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 90/297 (30%), Positives = 139/297 (46%), Gaps = 43/297 (14%)
Query: 467 LFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVL 526
L+G A A ++ D TI VMG+++SIE E DR + LP + QEL + + +VL
Sbjct: 590 LYGDAKKAIQECDMTIAVMGINKSIEREGRDRDHIELP-KDQELFIEEAYKLNPKMAVVL 648
Query: 527 MSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITN 586
++G + V + D + AI+ A YPG+ GGTA+A+ LFG NP G+LP+T+Y + +
Sbjct: 649 VAGSSLAVNWM--DEHVPAILNAWYPGEQGGTAVAEALFGDYNPAGRLPLTYYRS--LDD 704
Query: 587 L-PMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGR 645
L P + A++ + RTY ++ G +Y FG+G+SYT F D R
Sbjct: 705 LPPFDDYAVQKN-------RTYMYFTGKPLYAFGYGLSYTKF---------------DYR 742
Query: 646 HGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQ 704
S++ + T +KN G +G V+ P G P KQ
Sbjct: 743 KLSVDQDAENVRLSFT--------------IKNSGKYNGDEVAQVYVQFPEIGVKVPIKQ 788
Query: 705 LVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLH 761
L FE+VH+ G V I + + +R G P G + +G + + L
Sbjct: 789 LKGFERVHIAKGKTLPVTITVPKKELRIWNERKGEFFTPSGNYVFMVGASSDDIRLQ 845
>gi|333380551|ref|ZP_08472242.1| hypothetical protein HMPREF9455_00408 [Dysgonomonas gadei ATCC
BAA-286]
gi|332826546|gb|EGJ99375.1| hypothetical protein HMPREF9455_00408 [Dysgonomonas gadei ATCC
BAA-286]
Length = 854
Score = 285 bits (730), Expect = 5e-74, Method: Compositional matrix adjust.
Identities = 168/415 (40%), Positives = 239/415 (57%), Gaps = 39/415 (9%)
Query: 48 PIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDF 107
P R+ DL+ RL+++EK+ LL + + +PRL I Y +E+LHGV V PG +F
Sbjct: 36 PTHDRIMDLLSRLTIEEKISLLRATSPGIPRLQIPKYYHGNESLHGV--VRPG-RF---- 88
Query: 108 PGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAG----------LTYWSPNV 157
T FPQ I AS +N L I +SDEAR +N G LT+WSP V
Sbjct: 89 ---TVFPQAIGLASMWNPELHHKIATAISDEARGRWNELEQGKLQTQRFTDLLTFWSPTV 145
Query: 158 NIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNG 217
N+ RDPRWGR ET GEDP LSG ++VRGLQG D LK+ ++ KHF A + ++
Sbjct: 146 NMARDPRWGRTPETYGEDPYLSGILGTAFVRGLQGDDPRYLKIVSTPKHFAANNEEH--- 202
Query: 218 VDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRG 277
+RF N ++S++ + + + F MCV +GK AS+M +YN +N VP A+P +L + +R
Sbjct: 203 -NRFVCNPQISERQLREYYFPAFEMCVKDGKSASIMSAYNAINDVPCTANPWLLTKVLRH 261
Query: 278 EWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG------PFLGLHTES 331
+W NGY+VSDC + + T E AA +I+AGLDL+CG P L + +
Sbjct: 262 DWGFNGYVVSDCGGPSLLVSAMKYVKTKEAAATLSIKAGLDLECGDDVYMQPLLNAYNQY 321
Query: 332 AVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAAR 391
V R DI+ A L +M LG+FD +P PY + P V + +H++LALEAAR
Sbjct: 322 MVSRA-----DIDTAAYRVLRARMHLGLFD-DPDLNPYNKISPSVVGSAEHKQLALEAAR 375
Query: 392 QGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGI-ACGYTTPLQGI 445
Q IVLLKN +LPL+ + +++AV+G N+ + G+Y+GI A + LQGI
Sbjct: 376 QSIVLLKNNNRTLPLNPKKVKSIAVVGINA--GNSEFGDYSGIPANAPVSILQGI 428
Score = 139 bits (349), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 91/296 (30%), Positives = 147/296 (49%), Gaps = 43/296 (14%)
Query: 467 LFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVL 526
++G A A R+ + I V+G++++IE E DR + LP Q+E + ++ + P I+V+
Sbjct: 593 MYGEAGKAVRECEQVIAVLGINKTIEREGQDRYDIHLPADQEEFIREIYKVN--PNIVVV 650
Query: 527 MSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQ-EYIT 585
+ G +A D + AI+ A YPG+ GGTA+A++LFG NPGG+LP+T+Y E I
Sbjct: 651 LVAGS-SLAINWMDEHVPAIVNAWYPGEQGGTAVAEVLFGEYNPGGRLPVTYYNSLEEIP 709
Query: 586 NLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGR 645
+ ++ GRTY+++KG +YPFG+G+SYT F +
Sbjct: 710 SFDDYDITK---------GRTYQYFKGKPLYPFGYGLSYTTFAYK--------------- 745
Query: 646 HGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQ 704
++ +G IKV+ ++KN G DG V+ P +G + P K+
Sbjct: 746 --NLQINDNGNNIKVS------------FELKNTGRMDGDEVSQVYVKIPSSGIFMPIKE 791
Query: 705 LVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
L F++ + GA + V INI D + T P GE+ IG + + L
Sbjct: 792 LKGFQRSTLKKGATKNVEINIRKDLLRYWDDATETFITPKGEYEFMIGTSSQDIQL 847
>gi|218262493|ref|ZP_03476939.1| hypothetical protein PRABACTJOHN_02617 [Parabacteroides johnsonii
DSM 18315]
gi|218223341|gb|EEC95991.1| hypothetical protein PRABACTJOHN_02617 [Parabacteroides johnsonii
DSM 18315]
Length = 868
Score = 285 bits (729), Expect = 7e-74, Method: Compositional matrix adjust.
Identities = 170/454 (37%), Positives = 241/454 (53%), Gaps = 37/454 (8%)
Query: 41 PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPG 100
PF LPI +R++DL+ RL+ +EKV +++ A+ RLGI Y+WW+EALHGV+ G
Sbjct: 27 PFRNPDLPIDERIDDLLKRLTAEEKVGQMMNTTPAIERLGIPQYDWWNEALHGVARAGK- 85
Query: 101 TKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--------GGTAGLTY 152
AT FPQ I A++F+ +VSDEARA Y+ GLT+
Sbjct: 86 ---------ATVFPQAIAMAATFDDDALYETFTMVSDEARAKYHQYQKDKEYDRYKGLTF 136
Query: 153 WSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDL 212
W+PN+NIFRDPRWGRG ET GEDP L+ + + V+GLQG D K A KH+ +
Sbjct: 137 WTPNINIFRDPRWGRGMETYGEDPYLTERMGVAVVKGLQGDDPKYFKTHACAKHYAVHSG 196
Query: 213 DNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILK 272
WN R F+ V+ +D+ T+ F V EG V VMC+YN+ G P C+ +L
Sbjct: 197 PEWN---RHEFDVTVTPRDLWQTYLPAFEALVKEGNVQEVMCAYNRYQGKPCCSSDKLLI 253
Query: 273 RTIRGEWRLNGYIVSDCDSVGVYYDTQHFT------STPEEAAADAIRAGLDLDCG-PFL 325
+R W I+SDC ++ +++ T E A+ADA+ G DL+CG +
Sbjct: 254 DILRNSWGYENIILSDCGAINDFWERDERTPRHETHPDAESASADAVLNGTDLECGNSYR 313
Query: 326 GLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQ-PYGHLGPKDVCTPDHQE 384
L A++ G +SE D++ +L L + LGMFD P Q PY + V +P+H
Sbjct: 314 AL--VKALKDGKISENDLDVSLRRLLKGRFELGMFD--PDEQVPYAQIPYNVVESPEHVA 369
Query: 385 LALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQG 444
ALE A + +VLLKN+ +LPLS R +AV+GPN+ + + NY G T L+G
Sbjct: 370 QALEMAHKSMVLLKNKNNTLPLSKTI-RKIAVVGPNAADSTMLWANYNGFPTHTVTILEG 428
Query: 445 IGRY---ARTIHQQGCKDVACADDQLFGAAIDAS 475
I I++ GC A Q G+ I +S
Sbjct: 429 IRNKVPDTEVIYELGCNHAADFVIQDLGSHITSS 462
Score = 132 bits (331), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 100/311 (32%), Positives = 141/311 (45%), Gaps = 49/311 (15%)
Query: 468 FGAAIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMA 517
+ A + AD + V G+ +E E + DR + LP QQE+V K A
Sbjct: 595 YAATAAKVKDADVIVYVGGISPRLEGEEMPVNVEGFKKGDRTNIELPKVQQEMV-KALKA 653
Query: 518 SKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMT 577
+ P + VL +G + A + I AI+ A Y GQ GTA+ADILFG NP G+LP+T
Sbjct: 654 TGKPVVYVLCTGSAL--ALNWEEANIDAILNAWYGGQEAGTAVADILFGDYNPSGRLPVT 711
Query: 578 WYPQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPT 636
+Y I LP + +M+ GRTYR+ +YPFG+G+SYTNF +
Sbjct: 712 FYKS--IDQLPDFEDYSMK--------GRTYRYMTETPLYPFGYGLSYTNFAYR------ 755
Query: 637 VVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPA 696
NA +S I AK +TL D+ N G DG ++ P
Sbjct: 756 -------------NAKLSSGKI----AKDQSVTL--TFDIANTGKMDGDEVAQIYIKNPN 796
Query: 697 GHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKH 756
P K L AF +VHV AG Q V I + + S D + T + G++ I GG+
Sbjct: 797 DPEGPIKALKAFLRVHVKAGDSQEVNIELAPETFHSFNDNTQTMEVRPGKYQILYGGSSD 856
Query: 757 SVSLHAATLGV 767
+L L +
Sbjct: 857 DKALQRLELTI 867
>gi|212692496|ref|ZP_03300624.1| hypothetical protein BACDOR_01992 [Bacteroides dorei DSM 17855]
gi|212664971|gb|EEB25543.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
dorei DSM 17855]
Length = 864
Score = 285 bits (729), Expect = 7e-74, Method: Compositional matrix adjust.
Identities = 175/474 (36%), Positives = 254/474 (53%), Gaps = 51/474 (10%)
Query: 1 MASTIAFFFLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRL 60
M + I + F L++LS + A + +L +R DL+ +L
Sbjct: 1 MRNRILYLFASLVILSGCGNQPAYKNS------------------NLSPEERAEDLLQQL 42
Query: 61 SLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTA 120
+L+EKV L++ + V RLGIK Y WW+EALHGV+ G AT FPQ I A
Sbjct: 43 TLEEKVALMMDNSKPVERLGIKPYNWWNEALHGVARSGL----------ATVFPQPIGMA 92
Query: 121 SSFNATLWEAIGRVVSDEARAMYNGGTA--------GLTYWSPNVNIFRDPRWGRGQETP 172
+SF I VSDEARA +A GLT W+P VNI+RDPRWGRG ET
Sbjct: 93 ASFEPDAIHTIYTAVSDEARAKNTAYSAAGSYERYQGLTMWTPTVNIYRDPRWGRGIETY 152
Query: 173 GEDPVLSGKYAASYVRGLQGSDGDRL--KVAASCKHFTAYDLDNWNGVDRFHFNAK-VSK 229
GEDP L+ + V+GLQ +D ++ K+ A KHF + WN R FNA+ +
Sbjct: 153 GEDPYLTSVMGVNVVKGLQCTDANQKYDKIHACAKHFAVHSGPEWN---RHEFNAENIKP 209
Query: 230 QDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDC 289
+D+ +T+ VPF V EGKV VMC+YN++ G P C +L + +R EW G ++SDC
Sbjct: 210 RDLHETYLVPFEALVKEGKVKEVMCAYNRLEGDPCCGSDRLLMQILRQEWGYEGIVLSDC 269
Query: 290 DSVGVYYDTQHFTSTP--EEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNAL 347
++ +Y + + P E A+A A+ +G DL+CG ESA ++GL+SE DI+ ++
Sbjct: 270 GAIDDFYREKGHKTHPDAESASAAAVLSGTDLECGSSYKALVESA-KKGLISEKDIDVSV 328
Query: 348 VNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLS 407
L + LG D +PS + + VC+ +H L+L+ AR+ + LL N+ LPL
Sbjct: 329 KRLLKARFELGEMD-DPSKVEWTKIPYSVVCSAEHDSLSLDIARKSMTLLLNKNNILPLK 387
Query: 408 HIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI----GRYARTIHQQGC 457
+T+AV+GPN++ +V GNY G T L+GI G + I++QGC
Sbjct: 388 R-GGQTIAVMGPNANDSVMQWGNYNGTPKHTITLLEGIRSAMGENDKLIYEQGC 440
Score = 126 bits (317), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 97/301 (32%), Positives = 136/301 (45%), Gaps = 50/301 (16%)
Query: 476 RQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILV 525
+ AD I G+ S+E E + DR + LP Q+EL+ + A K I V
Sbjct: 600 KDADIVIFAGGISPSLEGEEMGVNLPGFRKGDRTDIELPAVQRELIKALCDAGK-KVIFV 658
Query: 526 LMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYIT 585
SG PI A AI+ A YPGQ+GG A A++LFG NP G+LP+T+Y I
Sbjct: 659 NFSGSPI--AMEPETKYCQAILQAWYPGQSGGKAAAEVLFGDYNPAGRLPVTFYRN--IA 714
Query: 586 NLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGR 645
LP E GRTYR++KG ++PFG+G+SYT F + + LD
Sbjct: 715 QLPDFE-------DYNMTGRTYRYFKGDPLFPFGYGLSYTTFNYD--------NIKLD-- 757
Query: 646 HGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQL 705
+ IKV T + + V N G++DG + V+ P K L
Sbjct: 758 ----------QTIKVGE------TAKMVIPVTNAGNRDGEEVVQVYLKKQEDAEGPAKTL 801
Query: 706 VAFEKVHVPAGAQQRVGINIHVCKYLSVVD-RSGTRRIPLGEHNIHIGGTKHSVSLHAAT 764
AF++V +PAG V + + K L D ++ T R G +I +GG L T
Sbjct: 802 RAFKRVQIPAGKTVNVELEL-TPKQLEWWDAQTNTMRTIAGNFDIMVGGNSKDAELQVKT 860
Query: 765 L 765
L
Sbjct: 861 L 861
>gi|330996729|ref|ZP_08320604.1| glycosyl hydrolase family 3 protein [Paraprevotella xylaniphila YIT
11841]
gi|329572574|gb|EGG54217.1| glycosyl hydrolase family 3 protein [Paraprevotella xylaniphila YIT
11841]
Length = 852
Score = 285 bits (729), Expect = 7e-74, Method: Compositional matrix adjust.
Identities = 171/421 (40%), Positives = 242/421 (57%), Gaps = 29/421 (6%)
Query: 42 FCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGT 101
F + P +R+ DL+ RL+++EK+ LL++ A A+ RLGI Y +EALHGV V PG
Sbjct: 29 FRDMKAPQHERIMDLLSRLTVEEKISLLVNDAPAIGRLGIDKYNHGNEALHGV--VRPG- 85
Query: 102 KFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN----------GGTAGLT 151
DF T FPQ I A+ +N L I +SDEAR + G + LT
Sbjct: 86 ----DF---TVFPQAIGMAAMWNPELLYRISSAISDEARGRWKELEYGKKQIAGASDLLT 138
Query: 152 YWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYD 211
+WSP VN+ RDPRWGR ET GEDP LSG ++V+GLQG+ LK ++ KHF +
Sbjct: 139 FWSPTVNMARDPRWGRTPETYGEDPYLSGVLGVAFVKGLQGNHPRYLKTVSTPKHFAVNN 198
Query: 212 LDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNIL 271
++ +R NAKVS++D+ + + F C+ EGK S+M +YN VN VP + ++
Sbjct: 199 EEH----NRSSCNAKVSERDLREYYLPSFERCITEGKAQSIMMAYNAVNDVPCTVNTYLI 254
Query: 272 KRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG-PFLGLHTE 330
K +RG+W NGYIVSDC + H+ T E AA A++AGLDL+CG G
Sbjct: 255 KNVLRGDWGFNGYIVSDCSAPEWMITKHHYVKTREAAATLAVKAGLDLECGNQVYGEGLL 314
Query: 331 SAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAA 390
A ++ ++SE DI++A L +M LG+FD +PS PY + P V HQ+LALEAA
Sbjct: 315 KAYRQYMVSEADIDSAAYRILRGRMMLGLFD-DPSQNPYNQIEPSVVGCKAHQDLALEAA 373
Query: 391 RQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGY-TTPLQGIGRYA 449
RQ +VLLKN+ LPL+ + +++AV+G ++ G+Y+G T L GI +YA
Sbjct: 374 RQSMVLLKNKDNFLPLNPQKVKSIAVVGISA--GHCEFGDYSGTPKNEPVTILDGIKQYA 431
Query: 450 R 450
Sbjct: 432 E 432
Score = 134 bits (338), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 89/296 (30%), Positives = 147/296 (49%), Gaps = 44/296 (14%)
Query: 468 FGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLM 527
+G A + + D T+ V+G+++SIE E DR L LP QQE + ++ + T++VL+
Sbjct: 594 YGDAGKVAAECDVTVAVLGINKSIEREGQDRFTLELPIDQQEFIKELYKVNPN-TVVVLV 652
Query: 528 SGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNL 587
+G + V + D + AI+ A YPG+ GG A+A++LFG NPGG+LP+T+Y
Sbjct: 653 AGSSLAVNWM--DENVPAILNAWYPGEQGGNAVAEVLFGDYNPGGRLPLTYYNS------ 704
Query: 588 PMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHG 647
+ E+ + S + GRTY++++G +Y FG+G+SYT F R+
Sbjct: 705 -LDEIPAFDNYSVK--GRTYQYFEGQPLYEFGYGLSYTKF-----------------RYK 744
Query: 648 SINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLV 706
S +++ R T+ V +V N G DG V+ P G + P KQL
Sbjct: 745 SKGVSVA------------RDTVKVSFEVSNTGKYDGDEVAQVYVKYPETGTYMPLKQLH 792
Query: 707 AFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRI-PLGEHNIHIGGTKHSVSLH 761
F++VH+ G +V + + K L D + + P GE+ +G + +
Sbjct: 793 GFKRVHIKKGKTSKVTVGVPK-KDLRYWDEQERKFVTPKGEYTFMVGASSEDIKFR 847
>gi|237709184|ref|ZP_04539665.1| glycoside hydrolase family 3 protein [Bacteroides sp. 9_1_42FAA]
gi|229456880|gb|EEO62601.1| glycoside hydrolase family 3 protein [Bacteroides sp. 9_1_42FAA]
Length = 864
Score = 285 bits (729), Expect = 7e-74, Method: Compositional matrix adjust.
Identities = 175/474 (36%), Positives = 254/474 (53%), Gaps = 51/474 (10%)
Query: 1 MASTIAFFFLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRL 60
M + I + F L++LS + A + +L +R DL+ +L
Sbjct: 1 MRNRILYLFASLVILSGCGNQPAYKNS------------------NLSPEERAEDLLQQL 42
Query: 61 SLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTA 120
+L+EKV L++ + V RLGIK Y WW+EALHGV+ G AT FPQ I A
Sbjct: 43 TLEEKVALMMDNSKPVERLGIKPYNWWNEALHGVARSGL----------ATVFPQPIGMA 92
Query: 121 SSFNATLWEAIGRVVSDEARAMYNGGTA--------GLTYWSPNVNIFRDPRWGRGQETP 172
+SF I VSDEARA +A GLT W+P VNI+RDPRWGRG ET
Sbjct: 93 ASFEPDAIHTIYTAVSDEARAKNTAYSAAGSYERYQGLTMWTPTVNIYRDPRWGRGIETY 152
Query: 173 GEDPVLSGKYAASYVRGLQGSDGDRL--KVAASCKHFTAYDLDNWNGVDRFHFNAK-VSK 229
GEDP L+ + V+GLQ +D ++ K+ A KHF + WN R FNA+ +
Sbjct: 153 GEDPYLTSVMGVNVVKGLQCTDANQKYDKIHACAKHFAVHSGPEWN---RHEFNAENIKP 209
Query: 230 QDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDC 289
+D+ +T+ VPF V EGKV VMC+YN++ G P C +L + +R EW G ++SDC
Sbjct: 210 RDLHETYLVPFEALVKEGKVKEVMCAYNRLEGDPCCGSDRLLMQILRQEWGYEGIVLSDC 269
Query: 290 DSVGVYYDTQHFTSTP--EEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNAL 347
++ +Y + + P E A+A A+ +G DL+CG ESA ++GL+SE DI+ ++
Sbjct: 270 GAIDDFYREKGHKTHPDAESASAAAVLSGTDLECGSSYKALVESA-KKGLISEKDIDVSV 328
Query: 348 VNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLS 407
L + LG D +PS + + VC+ +H L+L+ AR+ + LL N+ LPL
Sbjct: 329 KRLLKARFELGEMD-DPSKVEWTKIPYSVVCSAEHDSLSLDIARKSMTLLLNKNNILPLK 387
Query: 408 HIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI----GRYARTIHQQGC 457
+T+AV+GPN++ +V GNY G T L+GI G + I++QGC
Sbjct: 388 R-GGQTIAVMGPNANDSVMQWGNYNGTPKHTITLLEGIRSAMGENDKLIYEQGC 440
Score = 126 bits (316), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 97/301 (32%), Positives = 136/301 (45%), Gaps = 50/301 (16%)
Query: 476 RQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILV 525
+ AD I G+ S+E E + DR + LP Q+EL+ + A K I V
Sbjct: 600 KDADIVIFAGGISPSLEGEEMGVNLPGFRKGDRTDIELPAVQRELIKALCDAGK-KVIFV 658
Query: 526 LMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYIT 585
SG PI A AI+ A YPGQ+GG A A++LFG NP G+LP+T+Y I
Sbjct: 659 NFSGSPI--AMEPETKYCQAILQAWYPGQSGGKAAAEVLFGDYNPAGRLPVTFYRN--IA 714
Query: 586 NLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGR 645
LP E GRTYR++KG ++PFG+G+SYT F + + LD
Sbjct: 715 QLPDFE-------DYNMTGRTYRYFKGDPLFPFGYGLSYTTFNYD--------NIKLD-- 757
Query: 646 HGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQL 705
+ IKV T + + V N G++DG + V+ P K L
Sbjct: 758 ----------QTIKVGE------TAKMVIPVTNAGNRDGEEVVQVYLKKQEDAEGPAKTL 801
Query: 706 VAFEKVHVPAGAQQRVGINIHVCKYLSVVD-RSGTRRIPLGEHNIHIGGTKHSVSLHAAT 764
AF++V +PAG V + + K L D ++ T R G +I +GG L T
Sbjct: 802 RAFKRVQIPAGKTVNVELEL-TPKQLEWWDAQTNTMRTIAGNFDIMVGGNSKDAELQVKT 860
Query: 765 L 765
L
Sbjct: 861 L 861
>gi|313204584|ref|YP_004043241.1| glycoside hydrolase [Paludibacter propionicigenes WB4]
gi|312443900|gb|ADQ80256.1| glycoside hydrolase family 3 domain protein [Paludibacter
propionicigenes WB4]
Length = 727
Score = 285 bits (729), Expect = 7e-74, Method: Compositional matrix adjust.
Identities = 225/725 (31%), Positives = 355/725 (48%), Gaps = 109/725 (15%)
Query: 39 TLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIK--GYEWWSEALHGVSN 96
T PF LP +R+++L+ ++L EKV L S VPRLGI+ G+ SE LHG++
Sbjct: 24 TFPFQNTGLPDNERLDNLLSLMTLDEKVNAL-STNLGVPRLGIRNTGH---SEGLHGMAL 79
Query: 97 VGPGTKFGGDFPG------ATSFPQVITTASSFNATLWEAIGRVVSDEARAMY---NGGT 147
GPG +GG G T FPQ +++ L + + + + E R N
Sbjct: 80 GGPGN-WGGSERGVAKTYPTTIFPQAYGLGETWDTELIQKVADIEATEIRFYAQNANLQK 138
Query: 148 AGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHF 207
G+ +PN ++ RDPRWGR +E+ GED L + ++V+GLQG+D K A+ KHF
Sbjct: 139 GGMVMRAPNADLARDPRWGRTEESYGEDAFLGSRLTVAFVKGLQGNDPKYWKSASLMKHF 198
Query: 208 TAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCAD 267
A N N R ++ ++ + + PF + EG + M SYN NGVP +
Sbjct: 199 LA----NSNEDGRDSTSSNFDERLFREYYSFPFYKGITEGGSRAFMASYNAWNGVPMTVN 254
Query: 268 PNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLG- 326
P ILK+ R EW NG I +D ++ + + H T E AA ++A + G FL
Sbjct: 255 P-ILKKIARDEWGNNGIICTDGGALSLLVNAHHAFPTLTEGAAAVVKASV----GQFLDN 309
Query: 327 --LHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTP---- 380
+ A+++GLL+E +I+N + V ++LG+ D + S PY +G D +P
Sbjct: 310 FRSYIYEALKKGLLTEKNIDNVIRGNFYVALKLGLLDADQSKVPYTGIGVTDTVSPWNKQ 369
Query: 381 DHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTT 440
D + + + +VLLKN LPL+ + +++AVIGP ++ ++ Y+G T
Sbjct: 370 DTKAFVRKVTAKSVVLLKNTAGLLPLNKSKIKSIAVIGPRAN--EVLLDWYSG-----TP 422
Query: 441 PLQGIGRYARTIHQQGCKDVACADDQLFGAAID-------ASRQADATILVMGLD----- 488
P YA +I QG K+ D ++F A D A+R+AD I+ +G
Sbjct: 423 P------YAVSI-LQGIKNAVGKDIEVFYAPSDEMDKATLAARKADVAIVCVGNHPYGTD 475
Query: 489 --------QSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKND 540
S EA+DR + L +QE + K+ M + T++VL+S P + +++ +
Sbjct: 476 ARWKISPVPSDGREAVDRKSITL---EQEDLVKLVMQANPKTVMVLVSNFPFAINWSQEN 532
Query: 541 PRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNL-PMTEMAMRPSQS 599
+ AI+ Q G +AD++FG +P G+ TW + IT+L PM + +R
Sbjct: 533 --VPAILHVTNNSQELGNGLADVIFGDVSPAGRTTQTWV--KSITDLPPMMDYDIR---- 584
Query: 600 KRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIK 659
GRTY+++K +YPFG G+SYT+F ++ G S N T++
Sbjct: 585 ---HGRTYQYFKSKPLYPFGFGLSYTSFEYS-------------GLETS-NPTLTD---- 623
Query: 660 VTHAKCNRLTLGVQVDVKNVGSKDGAHTL-LVFSTPPAGHWAPHKQLVAFEKVHVPAGAQ 718
++ V V VKN+G +DG + L S P + P KQL F++V +PAG
Sbjct: 624 ---------SIFVSVKVKNIGKRDGDEVIQLYVSYPDSKVERPMKQLKGFKRVFIPAGKS 674
Query: 719 QRVGI 723
+ V I
Sbjct: 675 KTVEI 679
>gi|423240769|ref|ZP_17221883.1| hypothetical protein HMPREF1065_02506 [Bacteroides dorei
CL03T12C01]
gi|392643731|gb|EIY37480.1| hypothetical protein HMPREF1065_02506 [Bacteroides dorei
CL03T12C01]
Length = 864
Score = 285 bits (728), Expect = 8e-74, Method: Compositional matrix adjust.
Identities = 175/474 (36%), Positives = 254/474 (53%), Gaps = 51/474 (10%)
Query: 1 MASTIAFFFLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRL 60
M + I + F L++LS + A + +L +R DL+ +L
Sbjct: 1 MRNRILYLFASLVILSGCGNQPAYKNS------------------NLSPEERAEDLLQQL 42
Query: 61 SLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTA 120
+L+EKV L++ + V RLGIK Y WW+EALHGV+ G AT FPQ I A
Sbjct: 43 TLEEKVALMMDNSKPVERLGIKPYNWWNEALHGVARSGL----------ATVFPQPIGMA 92
Query: 121 SSFNATLWEAIGRVVSDEARAMYNGGTA--------GLTYWSPNVNIFRDPRWGRGQETP 172
+SF I VSDEARA +A GLT W+P VNI+RDPRWGRG ET
Sbjct: 93 ASFEPDAIHTIYTAVSDEARAKNTAYSAAGSYERYQGLTMWTPTVNIYRDPRWGRGIETY 152
Query: 173 GEDPVLSGKYAASYVRGLQGSDGDRL--KVAASCKHFTAYDLDNWNGVDRFHFNAK-VSK 229
GEDP L+ + V+GLQ +D ++ K+ A KHF + WN R FNA+ +
Sbjct: 153 GEDPYLTSVMGVNVVKGLQCTDANQKYDKIHACAKHFAVHSGPEWN---RHEFNAENIKP 209
Query: 230 QDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDC 289
+D+ +T+ VPF V EGKV VMC+YN++ G P C +L + +R EW G ++SDC
Sbjct: 210 RDLHETYLVPFEALVKEGKVKEVMCAYNRLEGDPCCGSDRLLMQILRQEWGYEGIVLSDC 269
Query: 290 DSVGVYYDTQHFTSTP--EEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNAL 347
++ +Y + + P E A+A A+ +G DL+CG ESA ++GL+SE DI+ ++
Sbjct: 270 GAIDDFYREKGHKTHPNAESASAAAVLSGTDLECGSSYKALVESA-KKGLISEKDIDVSV 328
Query: 348 VNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLS 407
L + LG D +PS + + VC+ +H L+L+ AR+ + LL N+ LPL
Sbjct: 329 KRLLKARFELGEMD-DPSKVEWTKIPYSVVCSAEHDSLSLDIARKSMTLLLNKNNILPLK 387
Query: 408 HIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI----GRYARTIHQQGC 457
+T+AV+GPN++ +V GNY G T L+GI G + I++QGC
Sbjct: 388 R-GGQTIAVMGPNANDSVMQWGNYNGTPKHTITLLEGIRSAMGENDKLIYEQGC 440
Score = 126 bits (316), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 97/301 (32%), Positives = 136/301 (45%), Gaps = 50/301 (16%)
Query: 476 RQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILV 525
+ AD I G+ S+E E + DR + LP Q+EL+ + A K I V
Sbjct: 600 KDADIVIFAGGISPSLEGEEMGVNLPGFRKGDRTDIELPAVQRELIKALCDAGK-KVIFV 658
Query: 526 LMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYIT 585
SG PI A AI+ A YPGQ+GG A A++LFG NP G+LP+T+Y I
Sbjct: 659 NFSGSPI--AMEPETKYCQAILQAWYPGQSGGKAAAEVLFGDYNPAGRLPVTFYRN--IA 714
Query: 586 NLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGR 645
LP E GRTYR++KG ++PFG+G+SYT F + + LD
Sbjct: 715 QLPDFE-------DYNMTGRTYRYFKGDPLFPFGYGLSYTTFNYD--------NIKLD-- 757
Query: 646 HGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQL 705
+ IKV T + + V N G++DG + V+ P K L
Sbjct: 758 ----------QTIKVGE------TAKMVIPVTNAGNRDGEEVVQVYLKKQEDAEGPAKTL 801
Query: 706 VAFEKVHVPAGAQQRVGINIHVCKYLSVVD-RSGTRRIPLGEHNIHIGGTKHSVSLHAAT 764
AF++V +PAG V + + K L D ++ T R G +I +GG L T
Sbjct: 802 RAFKRVQIPAGKTVNVELEL-TPKQLEWWDAQTNTMRTIAGNFDIMVGGNSKDAELQVKT 860
Query: 765 L 765
L
Sbjct: 861 L 861
>gi|345514226|ref|ZP_08793739.1| glycoside hydrolase family beta-glycosidase [Bacteroides dorei
5_1_36/D4]
gi|229437207|gb|EEO47284.1| glycoside hydrolase family beta-glycosidase [Bacteroides dorei
5_1_36/D4]
Length = 864
Score = 285 bits (728), Expect = 8e-74, Method: Compositional matrix adjust.
Identities = 175/474 (36%), Positives = 254/474 (53%), Gaps = 51/474 (10%)
Query: 1 MASTIAFFFLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRL 60
M + I + F L++LS + A + +L +R DL+ +L
Sbjct: 1 MRNRILYLFASLVILSGCGNQPAYKNS------------------NLSPEERAEDLLQQL 42
Query: 61 SLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTA 120
+L+EKV L++ + V RLGIK Y WW+EALHGV+ G AT FPQ I A
Sbjct: 43 TLEEKVALMMDNSKPVERLGIKPYNWWNEALHGVARSGL----------ATVFPQPIGMA 92
Query: 121 SSFNATLWEAIGRVVSDEARAMYNGGTA--------GLTYWSPNVNIFRDPRWGRGQETP 172
+SF I VSDEARA +A GLT W+P VNI+RDPRWGRG ET
Sbjct: 93 ASFEPDAIHTIYTAVSDEARAKNTAYSAAGSYERYQGLTMWTPTVNIYRDPRWGRGIETY 152
Query: 173 GEDPVLSGKYAASYVRGLQGSDGDRL--KVAASCKHFTAYDLDNWNGVDRFHFNAK-VSK 229
GEDP L+ + V+GLQ +D ++ K+ A KHF + WN R FNA+ +
Sbjct: 153 GEDPYLTSVMGVNVVKGLQCTDANQKYDKIHACAKHFAVHSGPEWN---RHEFNAENIKP 209
Query: 230 QDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDC 289
+D+ +T+ VPF V EGKV VMC+YN++ G P C +L + +R EW G ++SDC
Sbjct: 210 RDLHETYLVPFEALVKEGKVKEVMCAYNRLEGDPCCGSDRLLMQILRQEWGYEGIVLSDC 269
Query: 290 DSVGVYYDTQHFTSTP--EEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNAL 347
++ +Y + + P E A+A A+ +G DL+CG ESA ++GL+SE DI+ ++
Sbjct: 270 GAIDDFYREKGHKTHPDAESASAAAVLSGTDLECGSSYKALVESA-KKGLISEKDIDVSV 328
Query: 348 VNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLS 407
L + LG D +PS + + VC+ +H L+L+ AR+ + LL N+ LPL
Sbjct: 329 KRLLKARFELGEMD-DPSKVEWTKIPYSVVCSAEHDSLSLDIARKSMTLLLNKNNILPLK 387
Query: 408 HIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI----GRYARTIHQQGC 457
+T+AV+GPN++ +V GNY G T L+GI G + I++QGC
Sbjct: 388 R-GGQTIAVMGPNANDSVMQWGNYNGTPKHTITLLEGIRSAMGENDKLIYEQGC 440
Score = 126 bits (316), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 97/301 (32%), Positives = 136/301 (45%), Gaps = 50/301 (16%)
Query: 476 RQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILV 525
+ AD I G+ S+E E + DR + LP Q+EL+ + A K I V
Sbjct: 600 KDADIVIFAGGISPSLEGEEMGVNLPGFRKGDRTDIELPAVQRELIKALCDAGK-KVIFV 658
Query: 526 LMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYIT 585
SG PI A AI+ A YPGQ+GG A A++LFG NP G+LP+T+Y I
Sbjct: 659 NFSGSPI--AMEPETKYCQAILQAWYPGQSGGKAAAEVLFGDYNPAGRLPVTFYRN--IA 714
Query: 586 NLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGR 645
LP E GRTYR++KG ++PFG+G+SYT F + + LD
Sbjct: 715 QLPDFE-------DYNMTGRTYRYFKGDPLFPFGYGLSYTTFNYD--------NIKLD-- 757
Query: 646 HGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQL 705
+ IKV T + + V N G++DG + V+ P K L
Sbjct: 758 ----------QTIKVGE------TAKMVIPVTNAGNRDGEEVVQVYLKKQEDAEGPAKTL 801
Query: 706 VAFEKVHVPAGAQQRVGINIHVCKYLSVVD-RSGTRRIPLGEHNIHIGGTKHSVSLHAAT 764
AF++V +PAG V + + K L D ++ T R G +I +GG L T
Sbjct: 802 RAFKRVQIPAGKTVNVELEL-TPKQLEWWDAQTNTMRTIAGNFDIMVGGNSKDAELQVKT 860
Query: 765 L 765
L
Sbjct: 861 L 861
>gi|423230604|ref|ZP_17217008.1| hypothetical protein HMPREF1063_02828 [Bacteroides dorei
CL02T00C15]
gi|423244313|ref|ZP_17225388.1| hypothetical protein HMPREF1064_01594 [Bacteroides dorei
CL02T12C06]
gi|392630748|gb|EIY24734.1| hypothetical protein HMPREF1063_02828 [Bacteroides dorei
CL02T00C15]
gi|392642494|gb|EIY36260.1| hypothetical protein HMPREF1064_01594 [Bacteroides dorei
CL02T12C06]
Length = 864
Score = 285 bits (728), Expect = 9e-74, Method: Compositional matrix adjust.
Identities = 175/474 (36%), Positives = 254/474 (53%), Gaps = 51/474 (10%)
Query: 1 MASTIAFFFLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRL 60
M + I + F L++LS + A + +L +R DL+ +L
Sbjct: 1 MRNRILYLFASLVILSGCGNQPAYKNS------------------NLSPEERAEDLLQQL 42
Query: 61 SLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTA 120
+L+EKV L++ + V RLGIK Y WW+EALHGV+ G AT FPQ I A
Sbjct: 43 TLEEKVALMMDNSKPVERLGIKPYNWWNEALHGVARSGL----------ATVFPQPIGMA 92
Query: 121 SSFNATLWEAIGRVVSDEARAMYNGGTA--------GLTYWSPNVNIFRDPRWGRGQETP 172
+SF I VSDEARA +A GLT W+P VNI+RDPRWGRG ET
Sbjct: 93 ASFEPDAIHTIYTAVSDEARAKNTAYSAAGSYERYQGLTMWTPTVNIYRDPRWGRGIETY 152
Query: 173 GEDPVLSGKYAASYVRGLQGSDGDRL--KVAASCKHFTAYDLDNWNGVDRFHFNAK-VSK 229
GEDP L+ + V+GLQ +D ++ K+ A KHF + WN R FNA+ +
Sbjct: 153 GEDPYLTSVMGVNVVKGLQCTDANQKYDKIHACAKHFAVHSGPEWN---RHEFNAENIKP 209
Query: 230 QDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDC 289
+D+ +T+ VPF V EGKV VMC+YN++ G P C +L + +R EW G ++SDC
Sbjct: 210 RDLHETYLVPFEALVKEGKVKEVMCAYNRLEGDPCCGSDRLLMQILRQEWGYEGIVLSDC 269
Query: 290 DSVGVYYDTQHFTSTP--EEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNAL 347
++ +Y + + P E A+A A+ +G DL+CG ESA ++GL+SE DI+ ++
Sbjct: 270 GAIDDFYREKGHKTHPDAESASAAAVLSGTDLECGSSYKALVESA-KKGLISEKDIDVSV 328
Query: 348 VNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLS 407
L + LG D +PS + + VC+ +H L+L+ AR+ + LL N+ LPL
Sbjct: 329 KRLLKARFELGEMD-DPSKVEWTKIPYSVVCSAEHDSLSLDIARKSMTLLLNKNNILPLK 387
Query: 408 HIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI----GRYARTIHQQGC 457
+T+AV+GPN++ +V GNY G T L+GI G + I++QGC
Sbjct: 388 R-GGQTIAVMGPNANDSVMQWGNYNGTPKHTITLLEGIRSAMGENDKLIYEQGC 440
Score = 125 bits (313), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 94/301 (31%), Positives = 133/301 (44%), Gaps = 50/301 (16%)
Query: 476 RQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILV 525
+ AD I G+ S+E E + DR + LP Q+EL+ + A K I V
Sbjct: 600 KDADIVIFAGGISPSLEGEEMGVNLPGFRKGDRTDIELPAVQRELIKALCDAGK-KVIFV 658
Query: 526 LMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYIT 585
SG PI A AI+ A YPGQ+GG A A++LFG NP G+LP+T+Y I
Sbjct: 659 NFSGSPI--AMEPETKYCQAILQAWYPGQSGGKAAAEVLFGDYNPAGRLPVTFYRN--IA 714
Query: 586 NLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGR 645
LP E GRTYR++KG ++PFG+G+SYT F + + V
Sbjct: 715 QLPDFE-------DYNMTGRTYRYFKGDPLFPFGYGLSYTTFNYDNIKLEQTIKV----- 762
Query: 646 HGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQL 705
G+ K+ + V N G++DG + V+ P K L
Sbjct: 763 ---------GETAKMV------------IPVTNTGNRDGEEVVQVYLKKQEDTEGPTKTL 801
Query: 706 VAFEKVHVPAGAQQRVGINIHVCKYLSVVD-RSGTRRIPLGEHNIHIGGTKHSVSLHAAT 764
AF++V +PAG V + + K L D ++ T R G +I +GG L T
Sbjct: 802 RAFKRVQIPAGKTVNVELEL-TPKQLEWWDAQTNTMRTIAGNFDIMVGGNSKDAELQVKT 860
Query: 765 L 765
L
Sbjct: 861 L 861
>gi|365121914|ref|ZP_09338824.1| hypothetical protein HMPREF1033_02170 [Tannerella sp.
6_1_58FAA_CT1]
gi|363643627|gb|EHL82934.1| hypothetical protein HMPREF1033_02170 [Tannerella sp.
6_1_58FAA_CT1]
Length = 1073
Score = 285 bits (728), Expect = 9e-74, Method: Compositional matrix adjust.
Identities = 163/409 (39%), Positives = 237/409 (57%), Gaps = 38/409 (9%)
Query: 41 PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPG 100
PF +L +R+ DL+ RL++ EK+ LL + + A+PRLGI Y +EALHGV V PG
Sbjct: 28 PFRDTTLSHHERIKDLLSRLNVSEKISLLRATSPAIPRLGIDKYYHGNEALHGV--VRPG 85
Query: 101 TKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN----------GGTAGL 150
KF T FPQ I AS +N + + +SDEAR +N G + L
Sbjct: 86 -KF-------TVFPQAIGLASMWNPDFLQEVSTAISDEARGRWNELNQGKDQTAGASDLL 137
Query: 151 TYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAY 210
T+WSP +N+ RDPRWGR ET GEDP L+G ++VRGLQG+D +KV ++ KHF A
Sbjct: 138 TFWSPTINMARDPRWGRTPETYGEDPFLTGTLGTAFVRGLQGNDPKYIKVVSTPKHFAAN 197
Query: 211 DLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNI 270
+ ++ +R NA +S++D+ + + F C+ EG+ SVM +YN VNG+P + +
Sbjct: 198 NEEH----NRASGNAVISERDLREYYFPAFEKCIKEGQAQSVMSAYNAVNGIPCTLNKWL 253
Query: 271 LKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG------PF 324
L +R +W +GY+VSDC + H+ T EEAA+ I+AGLDL+CG P
Sbjct: 254 LTDVLRDDWGFDGYVVSDCSAPEYIVSQHHYVDTYEEAASLCIKAGLDLECGDNVYITPL 313
Query: 325 LGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQE 384
L +A RG+++ +I++A L +MRLG+FD +P+ PY + P V H+E
Sbjct: 314 L-----NAYNRGMVTMSEIDSAAYRVLRGRMRLGLFD-DPNENPYNKISPSIVGCEKHRE 367
Query: 385 LALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAG 433
LALEAARQ +VLLKN LP+ +++AV+G N+ G+Y+G
Sbjct: 368 LALEAARQSLVLLKNDKDMLPIQTDNIKSIAVVGINA--ANCEFGDYSG 414
Score = 131 bits (329), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 87/296 (29%), Positives = 148/296 (50%), Gaps = 43/296 (14%)
Query: 468 FGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLM 527
+G A + R +D TI V+G+D++IE E DR+ + LP QQ + + A+ T++VL+
Sbjct: 733 YGDAGEIIRGSDLTIAVLGIDRTIEREGQDRSTIELPEDQQIFIEEAYKANPN-TVVVLV 791
Query: 528 SGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNL 587
+G + + + D I A++ A YPG+ GGTA+A+ LFG NPGG+LP+T+Y +++L
Sbjct: 792 AGSSLAINWI--DQNIPAVLDAWYPGEQGGTAVAEALFGDYNPGGRLPLTFYNS--LSDL 847
Query: 588 P-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRH 646
P + +R + RTY +++G +YPFG+G+SYT+F +
Sbjct: 848 PAFDDYNVRNN-------RTYMYFEGKPLYPFGYGLSYTDFAY----------------- 883
Query: 647 GSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQL 705
+ + VT + N + V+ V N G+ DG V+ P G P KQL
Sbjct: 884 ---------RGLDVTQDEEN---VTVKFFVSNTGNYDGDEVAQVYIQFPDQGTTLPLKQL 931
Query: 706 VAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLH 761
F++VH+ G + + + I + + + P G + +G + + L
Sbjct: 932 KGFKRVHISKGQETEITVRIPKKELRLWSENNSEFYTPEGNYIFLVGASSEDIRLQ 987
>gi|265752711|ref|ZP_06088280.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp.
3_1_33FAA]
gi|263235897|gb|EEZ21392.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp.
3_1_33FAA]
Length = 864
Score = 285 bits (728), Expect = 9e-74, Method: Compositional matrix adjust.
Identities = 175/474 (36%), Positives = 254/474 (53%), Gaps = 51/474 (10%)
Query: 1 MASTIAFFFLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRL 60
M + I + F L++LS + A + +L +R DL+ +L
Sbjct: 1 MRNRILYLFASLVILSGCGNQPAYKNS------------------NLSPEERAEDLLQQL 42
Query: 61 SLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTA 120
+L+EKV L++ + V RLGIK Y WW+EALHGV+ G AT FPQ I A
Sbjct: 43 TLEEKVALMMDNSKPVERLGIKPYNWWNEALHGVARSGL----------ATVFPQPIGMA 92
Query: 121 SSFNATLWEAIGRVVSDEARAMYNGGTA--------GLTYWSPNVNIFRDPRWGRGQETP 172
+SF I VSDEARA +A GLT W+P VNI+RDPRWGRG ET
Sbjct: 93 ASFEPDAIHTIYTAVSDEARAKNTAYSAAGSYERYQGLTMWTPTVNIYRDPRWGRGIETY 152
Query: 173 GEDPVLSGKYAASYVRGLQGSDGDRL--KVAASCKHFTAYDLDNWNGVDRFHFNAK-VSK 229
GEDP L+ + V+GLQ +D ++ K+ A KHF + WN R FNA+ +
Sbjct: 153 GEDPYLTSVMGVNVVKGLQCTDANQKYDKIHACAKHFAVHSGPEWN---RHEFNAENIKP 209
Query: 230 QDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDC 289
+D+ +T+ VPF V EGKV VMC+YN++ G P C +L + +R EW G ++SDC
Sbjct: 210 RDLHETYLVPFEALVKEGKVKEVMCAYNRLEGDPCCGSDRLLMQILRQEWGYEGIVLSDC 269
Query: 290 DSVGVYYDTQHFTSTP--EEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNAL 347
++ +Y + + P E A+A A+ +G DL+CG ESA ++GL+SE DI+ ++
Sbjct: 270 GAIDDFYREKGHKTHPDAESASAAAVLSGTDLECGSSYKALVESA-KKGLISEKDIDVSV 328
Query: 348 VNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLS 407
L + LG D +PS + + VC+ +H L+L+ AR+ + LL N+ LPL
Sbjct: 329 KRLLKARFELGEMD-DPSKVEWTKIPYSVVCSAEHDSLSLDIARKSMTLLLNKNNILPLK 387
Query: 408 HIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI----GRYARTIHQQGC 457
+T+AV+GPN++ +V GNY G T L+GI G + I++QGC
Sbjct: 388 R-GGQTIAVMGPNANDSVMQWGNYNGTPKHTITLLEGIRSAMGENDKLIYEQGC 440
Score = 126 bits (316), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 97/301 (32%), Positives = 136/301 (45%), Gaps = 50/301 (16%)
Query: 476 RQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILV 525
+ AD I G+ S+E E + DR + LP Q+EL+ + A K I V
Sbjct: 600 KDADIVIFAGGISPSLEGEEMGVNLPGFRKGDRTDIELPAVQRELIKALCDAGK-KVIFV 658
Query: 526 LMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYIT 585
SG PI A AI+ A YPGQ+GG A A++LFG NP G+LP+T+Y I
Sbjct: 659 NFSGSPI--AMEPETQYCQAILQAWYPGQSGGKAAAEVLFGDYNPAGRLPVTFYRN--IA 714
Query: 586 NLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGR 645
LP E GRTYR++KG ++PFG+G+SYT F
Sbjct: 715 QLPDFE-------DYNMTGRTYRYFKGDPLFPFGYGLSYTTF-----------------N 750
Query: 646 HGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQL 705
+G+I + IKV T + + V N G++DG + V+ P K L
Sbjct: 751 YGNIKLE---QTIKVGE------TAKMVIPVTNTGNRDGEEVVQVYLKKQEDTEGPAKTL 801
Query: 706 VAFEKVHVPAGAQQRVGINIHVCKYLSVVD-RSGTRRIPLGEHNIHIGGTKHSVSLHAAT 764
AF++V +PAG V + + K L D ++ T R G +I +GG L T
Sbjct: 802 RAFKRVQIPAGKTVNVELEL-TPKQLEWWDAQTNTMRTIAGNFDIMVGGNSKDAELQVKT 860
Query: 765 L 765
L
Sbjct: 861 L 861
>gi|395803818|ref|ZP_10483061.1| glycoside hydrolase family 3 protein [Flavobacterium sp. F52]
gi|395434089|gb|EJG00040.1| glycoside hydrolase family 3 protein [Flavobacterium sp. F52]
Length = 875
Score = 284 bits (727), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 162/427 (37%), Positives = 235/427 (55%), Gaps = 35/427 (8%)
Query: 31 DPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEA 90
DPK PF L +RV +L+ +L+L+EKV +++ A A+PRLGI Y+WW+E
Sbjct: 24 DPK------FPFQNTDLTFEERVENLVSQLTLEEKVAQMLNAAPAIPRLGIPAYDWWNET 77
Query: 91 LHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA-- 148
LHGV+ T T FPQ I A++F+ + + E RA+YN
Sbjct: 78 LHGVARTPFKT---------TVFPQAIAMAATFDKNSLFKMADYSALEGRAIYNKAVELN 128
Query: 149 -------GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVA 201
GLTYW+PN+NIFRDPRWGRGQET GEDP L+ ++V+GLQG D LK A
Sbjct: 129 RTKERYLGLTYWTPNINIFRDPRWGRGQETYGEDPYLTAVLGDAFVKGLQGDDPKYLKAA 188
Query: 202 ASCKHFTAYDLDNWNGVD--RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQV 259
A KH+ + +G + R F+ V+ ++ DT+ F+ V KVA VMC+YN
Sbjct: 189 ACAKHYAVH-----SGPESLRHTFDVDVTPYELWDTYLPAFKKLVTNSKVAGVMCAYNAF 243
Query: 260 NGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDL 319
P CA ++ +R +W+ GY+ SDC ++ ++ A+ADA+ G D+
Sbjct: 244 RTQPCCASDILMNDILRNQWKFTGYVTSDCWAIDDFFKNHKTHPDAASASADAVLHGTDI 303
Query: 320 DCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVC- 378
DCG AV+ G ++E I+ ++ ++ RLGMFD P S P V
Sbjct: 304 DCGTDAYKSLVQAVKNGQITEKQIDVSVKRLFMIRFRLGMFD--PVSMVKYAQTPSSVLE 361
Query: 379 TPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGY 438
+ +H+E AL+ ARQ IVLLKN+ +LPLS + + + V+GPN+D +++++GNY G
Sbjct: 362 SEEHKEHALKMARQSIVLLKNEKNTLPLSK-KLKKIVVLGPNADNSISILGNYNGTPSKL 420
Query: 439 TTPLQGI 445
TT LQGI
Sbjct: 421 TTVLQGI 427
Score = 100 bits (250), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 84/300 (28%), Positives = 132/300 (44%), Gaps = 52/300 (17%)
Query: 468 FGAAIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMA 517
F I+ + ADA I G+ +E E + DR +LLP Q L+ K +
Sbjct: 596 FANLIERHKNADAFIFAGGISPQLEGEEMPVDAPGFNGGDRTSILLPEVQTRLL-KALQS 654
Query: 518 SKGPTILVLMSGGPIDVAF-AKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPM 576
S P + ++M+G I V + A+N P I I W G GQ+ GTA AD++FG NP G+LP+
Sbjct: 655 SGKPVVFLIMTGSAIAVPWEAENIPAILNI-WYG--GQSAGTASADVIFGDYNPAGRLPV 711
Query: 577 TWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPT 636
T+Y + + + +TYR++KG +Y FG+G+SYT F ++ P
Sbjct: 712 TFYK---------GDSDLSSFVDYKMDNKTYRYFKGIPLYGFGYGLSYTEFKYSGLKTPD 762
Query: 637 VVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTL-LVFSTPP 695
+ G+ + ++ V V N G +G L P
Sbjct: 763 KIK--------------KGQPVTIS------------VKVTNTGKMEGEEVAQLYLINPN 796
Query: 696 AGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTK 755
+P K L FE+ ++ G V + + LS V SG + G+ I +GG++
Sbjct: 797 TSIKSPLKSLKGFERFNLKPGQSTVVNFTLS-PEDLSYVTESGNLKPYEGKIQIAVGGSQ 855
>gi|262381651|ref|ZP_06074789.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp.
2_1_33B]
gi|262296828|gb|EEY84758.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp.
2_1_33B]
Length = 868
Score = 284 bits (726), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 164/439 (37%), Positives = 239/439 (54%), Gaps = 35/439 (7%)
Query: 41 PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPG 100
PF LP+ +R++DL+ RL+ +EK+ +++ A+ RLGI Y+WW+EALHGV+ G
Sbjct: 27 PFRNPELPLEERIDDLLSRLTPEEKIGQMMNVTPAIERLGIPTYDWWNEALHGVARAG-- 84
Query: 101 TKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--------GGTAGLTY 152
AT FPQ I A++F+ +VSDEARA Y+ GLT+
Sbjct: 85 --------RATVFPQAIAMAATFDDNAVHETFTIVSDEARAKYHQYQKDKEYDRYKGLTF 136
Query: 153 WSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDL 212
W+PN+NIFRDPRWGRG ET GEDP L+ K + RGLQG D + K A KH+ +
Sbjct: 137 WTPNINIFRDPRWGRGMETYGEDPYLTEKMGVAVTRGLQGDDPNYYKTHACAKHYAVHSG 196
Query: 213 DNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILK 272
WN R FNA+ + +D+ +T+ F V EG V VMC+YN+ G P C+ +L
Sbjct: 197 PEWN---RHEFNAEATPRDLYETYLPAFEALVKEGDVQEVMCAYNRFEGKPCCSSDKLLI 253
Query: 273 RTIRGEWRLNGYIVSDCDSVGVYY----DTQHFTSTP--EEAAADAIRAGLDLDC-GPFL 325
+R W + I+SDC ++ ++ +T + P E A+ADA+ G DL+C G +
Sbjct: 254 DILRNSWGYDNIILSDCGAIDDFWRKDKNTPRHETHPDAESASADAVLNGTDLECGGSYR 313
Query: 326 GLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQEL 385
L+ A+ G +SE D++ +L L + LGMFD + PY + V +P+H
Sbjct: 314 ALN--KALADGKISEKDLDVSLRRLLKGRFELGMFDPD-ERVPYSKIPYSVVESPEHIAK 370
Query: 386 ALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
AL+ AR+ IVLLKN+ LPL + +AV+GPN+ + + NY G T ++GI
Sbjct: 371 ALDMARKSIVLLKNKNNMLPLDK-NIKKIAVVGPNAADSTMLWANYNGFPSKTVTIVEGI 429
Query: 446 GRY---ARTIHQQGCKDVA 461
A I++ GC A
Sbjct: 430 RNKVPNAEVIYELGCNHTA 448
Score = 127 bits (319), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 90/309 (29%), Positives = 146/309 (47%), Gaps = 49/309 (15%)
Query: 468 FGAAIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMA 517
F A + AD + V G+ +E E + DR + +P Q+E+V K +A
Sbjct: 595 FQATASKVKDADVIVFVGGISPRLEGEEMPVDAEGFRKGDRTNIEIPAVQKEMV-KALVA 653
Query: 518 SKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMT 577
+ P + V+ +G + + + +ND + AI+ A Y GQ GGTA+AD+LFG NP G+LP+T
Sbjct: 654 TGKPVVYVVCTGSALALNW-END-HVNAILNAWYGGQEGGTAVADVLFGDYNPAGRLPIT 711
Query: 578 WYPQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPT 636
+Y + LP + +M+ GRTYR+ +YPFG+G+SYT F +
Sbjct: 712 FYKS--VDQLPDFQDYSMK--------GRTYRYMTQTPLYPFGYGLSYTTFDYK------ 755
Query: 637 VVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPA 696
NA +S I + ++ + D+ N G DG ++ P
Sbjct: 756 -------------NAKLSKDKI------ASNESVTLSFDIANTGKMDGDEVAQIYIKNPN 796
Query: 697 GHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKH 756
P K + AF++V+V AG++Q V I + + S D + T + G++ I GG+
Sbjct: 797 DPAGPLKAMKAFKRVNVKAGSEQPVSIQLEPKAFQSFNDNTQTMEVRPGKYQILYGGSSD 856
Query: 757 SVSLHAATL 765
+L L
Sbjct: 857 DKTLKKIDL 865
>gi|225873993|ref|YP_002755452.1| beta-xylosidase B [Acidobacterium capsulatum ATCC 51196]
gi|225791521|gb|ACO31611.1| beta-xylosidase B [Acidobacterium capsulatum ATCC 51196]
Length = 894
Score = 284 bits (726), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 156/419 (37%), Positives = 229/419 (54%), Gaps = 31/419 (7%)
Query: 42 FCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGT 101
+ SLP R DL+ R++L+EK L++ A A+PRL + Y WWSEALHGV+
Sbjct: 39 YLNPSLPPVVRARDLVSRMTLKEKASQLVNAARAIPRLKVPAYNWWSEALHGVA------ 92
Query: 102 KFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA--------GLTYW 153
G T FP+ I ++F+ + + E R +Y GL +W
Sbjct: 93 -----VNGTTEFPEPIGLGATFDVPAIHEMAVDIGTEGRVVYEENEKDGSSKIFHGLDFW 147
Query: 154 SPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLD 213
+PN+NIFRDPRWGRGQET GEDP L+GK ++V G+QG + +V A+ KHF D+
Sbjct: 148 APNLNIFRDPRWGRGQETYGEDPFLTGKMGVAFVSGMQGDNPKYYRVIATPKHF---DVH 204
Query: 214 NWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKR 273
+ R + VS D DT++ FR +M+G SVMCSYN +NG P CA+ L+
Sbjct: 205 SGPEPTRHFADVDVSLHDQLDTYEPAFRAAIMQGHADSVMCSYNAINGQPACANQFTLQH 264
Query: 274 TIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTE--- 330
+RG W GY+VSDCD+V Y + T +AAA ++ G+D DC F +
Sbjct: 265 QLRGAWGFKGYVVSDCDAVHDIYSGHKYRPTLAQAAAISMERGMDNDCADFAQPKGDDDY 324
Query: 331 ----SAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELA 386
AVQ+G LS+ ++ ALV T +++LG+FD + PY ++ +P H+ A
Sbjct: 325 KAYIDAVQQGYLSQQAMDTALVRLFTARIKLGLFDPK-GMDPYADTPHSELNSPAHRAYA 383
Query: 387 LEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
+ A + +VLLKN G +LPL ++AV+GP +D T ++GNY G+ + L+G+
Sbjct: 384 RKLADESMVLLKNDG-TLPLKPGSVHSIAVVGPLADQTAVLLGNYNGVPTHTVSFLEGL 441
Score = 104 bits (259), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 80/282 (28%), Positives = 133/282 (47%), Gaps = 48/282 (17%)
Query: 482 ILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGP 531
I V+G+ +E E + DR L +P ++ LV V+ K P ++VLM+G
Sbjct: 627 IAVVGITSKLEGEEMPVDQPGFLGGDRTNLQMPEPEEALVEAVAKTGK-PVVVVLMNGSA 685
Query: 532 IDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTE 591
+ V + A++ A Y G+ GG AIAD L G ++P G+LP+T+Y + + LP E
Sbjct: 686 LAVNWISQ--HANAVLEAWYSGEEGGAAIADTLSGKNDPAGRLPVTFY--KSVNQLPNFE 741
Query: 592 MAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINA 651
RTYR++KG +YPFG+G+SYT F ++ + P H +++A
Sbjct: 742 -------DYSMENRTYRYFKGKPLYPFGYGLSYTTFRYSDLSIP----------HATVDA 784
Query: 652 TISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKV 711
G+ ++ + V N G G + ++ P AP L F+++
Sbjct: 785 ---GQPVEAS------------ATVTNTGKVAGDEVVQLYLKFPKVDGAPDIALRGFQRI 829
Query: 712 HVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGG 753
H+ G Q+V + + LS+V G + G++ + IGG
Sbjct: 830 HLEPGQSQQVHFELK-KRDLSMVTALGQIIVAQGDYTLSIGG 870
>gi|86143269|ref|ZP_01061671.1| beta-glucosidase precursor [Leeuwenhoekiella blandensis MED217]
gi|85830174|gb|EAQ48634.1| beta-glucosidase precursor [Leeuwenhoekiella blandensis MED217]
Length = 873
Score = 284 bits (726), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 166/428 (38%), Positives = 241/428 (56%), Gaps = 29/428 (6%)
Query: 28 FACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWW 87
F C + + PF L + R+NDL+ R++L+EK+ L+S A A+ RL I Y WW
Sbjct: 12 FCCCIQIEAQKQFPFQNEQLDLETRLNDLVSRMTLEEKISQLMSDAPAIERLNIPKYNWW 71
Query: 88 SEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGT 147
+E+LHGV+ G AT FPQ I+ A+S++A L + +SDEARA ++
Sbjct: 72 NESLHGVARAGY----------ATVFPQSISIAASWDAQLVREVATAISDEARAKHHEYL 121
Query: 148 --------AGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLK 199
GLT WSPN+NIFRDPRWGRG ET GEDP L+G A YV+GLQG D + LK
Sbjct: 122 RRDQHDIYQGLTMWSPNINIFRDPRWGRGHETYGEDPFLTGTLGAQYVKGLQGDDPEYLK 181
Query: 200 VAASCKHFTAYDLDNWNGVD--RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYN 257
V A+ KHF + +G + R +F+A S++D+ +T+ FRM V + +V SVM +YN
Sbjct: 182 VVATAKHFAVH-----SGPEESRHYFDANTSERDLWETYLPAFRMLVKDAQVQSVMTAYN 236
Query: 258 QVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGL 317
+ G ++ +L +R +W +GY+VSDC ++ ++ T+ A+A A+ G
Sbjct: 237 RFRGEAASSN-KLLFDILRNKWGFDGYVVSDCGAINDIWEDHKITADAASASALALETGT 295
Query: 318 DLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDV 377
DL+CG E A+ GL++E IN A+ +++LGMFD E + Y +
Sbjct: 296 DLNCGATYKSLKE-AIANGLITEEKINIAIERLFRARLKLGMFDTE-ENLSYATIPFSVN 353
Query: 378 CTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACG 437
H LA +AA++ IVLLKN+ LPLS + +AVIGPN+ ++ GNY G
Sbjct: 354 TNASHTALARKAAQESIVLLKNEAHMLPLSK-DLKQIAVIGPNAHNVQSLWGNYNGTPKN 412
Query: 438 YTTPLQGI 445
T +QGI
Sbjct: 413 PVTVVQGI 420
Score = 147 bits (370), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 101/315 (32%), Positives = 155/315 (49%), Gaps = 59/315 (18%)
Query: 471 AIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKG 520
A++ + +D TILV+GL++ +E E + DR L LP Q+EL+ + +A+
Sbjct: 592 AVNLAEDSDVTILVLGLNERLEGEEMRIDVEGFSKGDRTALDLPLEQRELM-RALVATGK 650
Query: 521 PTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYP 580
P +LVL++G + + +A+ + AI+ AGYPGQ GG AIAD+LFG NP G+LP+T+Y
Sbjct: 651 PIVLVLLNGSALAINYAQE--HVPAILSAGYPGQEGGNAIADVLFGDYNPAGRLPVTYYK 708
Query: 581 QEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVA 639
+ +LP + +M+ GRTYR+++G +YPFG+G+SYT F
Sbjct: 709 S--VDDLPDFEDYSMK--------GRTYRYFEGEALYPFGYGLSYTQF------------ 746
Query: 640 VPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTL-LVFSTPPAGH 698
S AIK + L VQV V N G +DG + L A
Sbjct: 747 --------------SYDAIKTSGRLAADKVLNVQVTVTNSGDRDGDEVVQLYLKDEVAST 792
Query: 699 WAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGG----- 753
P QLV F+++H+ G Q V + ++ + D+ P G ++ GG
Sbjct: 793 TRPQVQLVGFKRIHLQKGETQTVEFRLDARQFSMINDQEQLVVEP-GWFTLYAGGGQPNK 851
Query: 754 --TKHSVSLHAATLG 766
T+ SVS+ G
Sbjct: 852 KQTESSVSMRYEVTG 866
>gi|393773722|ref|ZP_10362116.1| glycosyl hydrolase family 3 N terminal domain protein
[Novosphingobium sp. Rr 2-17]
gi|392720897|gb|EIZ78368.1| glycosyl hydrolase family 3 N terminal domain protein
[Novosphingobium sp. Rr 2-17]
Length = 846
Score = 283 bits (725), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 184/465 (39%), Positives = 252/465 (54%), Gaps = 38/465 (8%)
Query: 57 IGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQV 116
I + L EKV L S A A +G+ Y+WW+E LHG++ G AT FPQ
Sbjct: 17 IAKADLAEKVGQLQSTAPAAKAMGLPAYDWWNEGLHGLARNGE----------ATVFPQA 66
Query: 117 ITTASSFNATLWEAIGRVVSDEARAMYNGGTA--------GLTYWSPNVNIFRDPRWGRG 168
I A++F+ L E +G V+S EARA +N GLT WSPN+NIFRDPRWGRG
Sbjct: 67 IGLAATFDPVLMEKVGTVISTEARAKFNTRPVNADRRIYEGLTIWSPNINIFRDPRWGRG 126
Query: 169 QETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVS 228
QET GEDP LSG A +++GLQG+D +V A+ KHF + R F+ VS
Sbjct: 127 QETYGEDPFLSGTLAVGFIKGLQGTDPAHPRVIATPKHFAVHSGPE---AGRDSFDVDVS 183
Query: 229 KQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSD 288
QD E T+ FR V EG+ SVMC+YN +G P CAD ++ + +R +W G++VSD
Sbjct: 184 PQDREATYLPAFRKAVTEGRPLSVMCAYNSTHGTPVCADDALINQRLRKDWGFKGFVVSD 243
Query: 289 CDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALV 348
CD++ + H+ + EA+A +I+AG D DCG + AV+RGL+ E I+ AL
Sbjct: 244 CDAIANVWMFHHYAADAAEASAASIKAGTDFDCGNTYAALPQ-AVERGLVDEATIDRALA 302
Query: 349 NTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSH 408
+L + LG+ G ++ P+G + P V T + LALEAAR+ IVLL+N LPL
Sbjct: 303 RSLQARHSLGIAFG--AANPWGRIKPSQVHTLASRALALEAARKSIVLLQNDNARLPLKP 360
Query: 409 IRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQLF 468
+AV+G N+D + +Y G A TPLQGI R + G +VA A +
Sbjct: 361 --GTKLAVVGANADDLSVIEASYHGTAADPITPLQGIRR------RFGAANVAYAQGSVL 412
Query: 469 --GAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELV 511
GAA+ A A+ GL + E LD G ++ RQ V
Sbjct: 413 AEGAAVIVPETALASQGKPGL----KGEYLDATGKVVLTRQDRRV 453
Score = 132 bits (331), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 98/301 (32%), Positives = 153/301 (50%), Gaps = 63/301 (20%)
Query: 471 AIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKG 520
AI A+R ADA + V+GL +E EAL DR+ + LP Q +L+ + K
Sbjct: 565 AIAATRNADAIVAVLGLSPDLEGEALQVQVPGFVGGDRSDIALPRPQAQLLEALHATGK- 623
Query: 521 PTILVLMSGGPIDVAFAKNDPRIA-AIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWY 579
P I+VL SG I + DP++A A++ A YPG+ GGTA+A+ L G +NP G+LP+T+Y
Sbjct: 624 PVIVVLTSGSAIAI-----DPKLADAVLAAWYPGEEGGTALAETLGGINNPSGRLPVTFY 678
Query: 580 PQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVV 638
++LP + +M+ R+YR++ G ++ FGHG+SYTNF + +AP V
Sbjct: 679 --RSTSDLPAYVDYSMKE--------RSYRYFTGTPLWGFGHGLSYTNFGY---DAPKAV 725
Query: 639 AVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGH 698
A + G + VT V V N G + G + + PP
Sbjct: 726 ATGI------------GDPVSVT------------VTVHNTGVRAGEDVVQAYVVPPTIE 761
Query: 699 WAP-------HKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHI 751
P +QL AF+++ + G + + +N+ + +SVVDR G RR+ GE+ + +
Sbjct: 762 PEPIMTQAVLQRQLAAFQRIALAPGQARAITLNLD-PREISVVDRRGIRRLVPGEYRVWV 820
Query: 752 G 752
G
Sbjct: 821 G 821
>gi|237721201|ref|ZP_04551682.1| glycoside hydrolase family 3 protein [Bacteroides sp. 2_2_4]
gi|229448997|gb|EEO54788.1| glycoside hydrolase family 3 protein [Bacteroides sp. 2_2_4]
Length = 863
Score = 283 bits (724), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 170/417 (40%), Positives = 241/417 (57%), Gaps = 31/417 (7%)
Query: 41 PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPG 100
P+ L QR +DL+ RL+L+EKV L+ + + A+PRLGIK YEWW+EALHGV+ G
Sbjct: 27 PYQDTKLTAEQRADDLLQRLTLEEKVALMQNNSPAIPRLGIKPYEWWNEALHGVARAGL- 85
Query: 101 TKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARA---MYNGG-----TAGLTY 152
AT FPQ I A+SFN L + VSDEARA +N GLT
Sbjct: 86 ---------ATVFPQAIGMAASFNDELLYEVFDAVSDEARAKNRQFNEKGQYKRYQGLTM 136
Query: 153 WSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYD 211
W+PNVNIFRDPRWGRGQET GEDP LSG+ + VRGLQG D + K+ A KHF +
Sbjct: 137 WTPNVNIFRDPRWGRGQETYGEDPYLSGRMGMAAVRGLQGPEDAEYDKLHACAKHFAVHS 196
Query: 212 LDNWNGVDRFHFNAK-VSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNI 270
WN R FNA+ ++ +D+ +T+ F+ V + V VMC+YN+ G P C +
Sbjct: 197 GPEWN---RHSFNAENIAPRDLWETYLPAFKELVQKAGVKEVMCAYNRFEGDPCCGSNRL 253
Query: 271 LKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPE--EAAADAIRAGLDLDCGPFLGLH 328
L + +R +W G +V+DC ++G ++ + + P+ A+ADA+ +G DL+CG
Sbjct: 254 LTQILRNDWGFKGIVVTDCGAIGDFFQRKKHETHPDAAHASADAVLSGTDLECGGNFKSI 313
Query: 329 TESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALE 388
T+ AV++GL+SE IN ++ L + LG + S+ P+ ++ + P H+ELAL+
Sbjct: 314 TD-AVKKGLISEEKINTSVKRLLKARFELGEMN---STHPWSNIPFSVIDCPKHKELALK 369
Query: 389 AARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
A + +VLL+N + L R VAVIGPN++ +V GNY G T L+GI
Sbjct: 370 MAHESLVLLQNN--NNLLPLNRQMKVAVIGPNANDSVMQWGNYNGFPSHTVTLLEGI 424
Score = 115 bits (287), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 90/307 (29%), Positives = 140/307 (45%), Gaps = 49/307 (15%)
Query: 472 IDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGP 521
++ + AD I G+ +E E++ DR + LP Q+E+++ + K
Sbjct: 594 LNKLQSADVVIFAGGISPLLEGESMRVSDPGFKGGDRTEIELPAIQREVLALLKKNGK-K 652
Query: 522 TILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQ 581
T+ V SG + A AI+ A YPGQAGGTA+AD+LFG NP G+LP+T+Y
Sbjct: 653 TVFVNFSGSAM--AIVPETQNCDAILQAWYPGQAGGTAVADVLFGDYNPAGRLPITFY-- 708
Query: 582 EYITNLPMTE-MAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAV 640
+ + LP E +M+ GRTYRF +YPFG+G+SYT F + A
Sbjct: 709 KSMQQLPDYEDYSMK--------GRTYRFMTKTPLYPFGYGLSYTRFSYGKA-------- 752
Query: 641 PLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWA 700
++N + K+T + LT + V NVG +DG + V+ P
Sbjct: 753 -------TLNQS------KLTKGEKAILT----IPVSNVGQRDGEEVVQVYICRPDDKEG 795
Query: 701 PHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
P K L F++V + G Q V I + + + T R G + I G + + L
Sbjct: 796 PQKTLRGFQRVSIAKGKTQNVQIELPYDSFEWFDAATNTIRPLNGTYKILYGNSSNEKDL 855
Query: 761 HAATLGV 767
++ +
Sbjct: 856 QTCSIQI 862
>gi|150007848|ref|YP_001302591.1| glycoside hydrolase family protein [Parabacteroides distasonis ATCC
8503]
gi|301310124|ref|ZP_07216063.1| beta-glucosidase [Bacteroides sp. 20_3]
gi|423336365|ref|ZP_17314112.1| hypothetical protein HMPREF1059_00064 [Parabacteroides distasonis
CL09T03C24]
gi|149936272|gb|ABR42969.1| glycoside hydrolase family 3, candidate beta-glycosidase
[Parabacteroides distasonis ATCC 8503]
gi|300831698|gb|EFK62329.1| beta-glucosidase [Bacteroides sp. 20_3]
gi|409240840|gb|EKN33614.1| hypothetical protein HMPREF1059_00064 [Parabacteroides distasonis
CL09T03C24]
Length = 868
Score = 283 bits (724), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 164/439 (37%), Positives = 239/439 (54%), Gaps = 35/439 (7%)
Query: 41 PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPG 100
PF LP+ +R++DL+ RL+ +EK+ +++ A+ RLGI Y+WW+EALHGV+ G
Sbjct: 27 PFRNPDLPLEERIDDLLSRLTPEEKIGQMMNVTPAIERLGIPTYDWWNEALHGVARAG-- 84
Query: 101 TKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--------GGTAGLTY 152
AT FPQ I A++F+ +VSDEARA Y+ GLT+
Sbjct: 85 --------RATVFPQAIAMAATFDDNAVHETFTMVSDEARAKYHQYQKDKEYDRYKGLTF 136
Query: 153 WSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDL 212
W+PN+NIFRDPRWGRG ET GEDP L+ K + RGLQG D + K A KH+ +
Sbjct: 137 WTPNINIFRDPRWGRGMETYGEDPYLTEKMGVAVTRGLQGDDPNYYKTHACAKHYAVHSG 196
Query: 213 DNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILK 272
WN R FNA+ + +D+ +T+ F V EG V VMC+YN+ G P C+ +L
Sbjct: 197 PEWN---RHEFNAEATPRDLYETYLPAFEALVKEGDVQEVMCAYNRFEGKPCCSSDKLLI 253
Query: 273 RTIRGEWRLNGYIVSDCDSVGVYY----DTQHFTSTP--EEAAADAIRAGLDLDC-GPFL 325
+R W + I+SDC ++ ++ +T + P E A+ADA+ G DL+C G +
Sbjct: 254 DILRNSWGYDNIILSDCGAIDDFWRKDKNTPRHETHPDAESASADAVLNGTDLECGGSYR 313
Query: 326 GLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQEL 385
L+ A+ G +SE D++ +L L + LGMFD + PY + V +P+H
Sbjct: 314 ALN--KALADGKISEKDLDVSLRRLLKGRFELGMFDPD-ERVPYSKIPYSVVESPEHIAK 370
Query: 386 ALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
AL+ AR+ IVLLKN+ LPL + +AV+GPN+ + + NY G T ++GI
Sbjct: 371 ALDMARKSIVLLKNKNNMLPLDK-NIKKIAVVGPNAADSTMLWANYNGFPTKTVTIVEGI 429
Query: 446 GRY---ARTIHQQGCKDVA 461
A I++ GC A
Sbjct: 430 RNKVPNAEVIYELGCNHTA 448
Score = 127 bits (319), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 90/309 (29%), Positives = 146/309 (47%), Gaps = 49/309 (15%)
Query: 468 FGAAIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMA 517
F A + AD + V G+ +E E + DR + +P Q+E+V K +A
Sbjct: 595 FQATASKVKDADVIVFVGGISPRLEGEEMPVDAEGFRKGDRTNIEIPAVQKEMV-KALVA 653
Query: 518 SKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMT 577
+ P + V+ +G + + + +ND + AI+ A Y GQ GGTA+AD+LFG NP G+LP+T
Sbjct: 654 TGKPVVYVVCTGSALALNW-END-HVNAILNAWYGGQEGGTAVADVLFGDYNPAGRLPIT 711
Query: 578 WYPQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPT 636
+Y + LP + +M+ GRTYR+ +YPFG+G+SYT F +
Sbjct: 712 FYKS--VDQLPDFQDYSMK--------GRTYRYMTQTPLYPFGYGLSYTTFDYK------ 755
Query: 637 VVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPA 696
NA +S I + ++ + D+ N G DG ++ P
Sbjct: 756 -------------NAKLSKDKI------ASNESVTLSFDIANTGKMDGDEVAQIYIKNPN 796
Query: 697 GHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKH 756
P K + AF++V+V AG++Q V I + + S D + T + G++ I GG+
Sbjct: 797 DPAGPLKAMKAFKRVNVKAGSEQPVSIQLEPKAFQSFNDNTQTMEVRPGKYQILYGGSSD 856
Query: 757 SVSLHAATL 765
+L L
Sbjct: 857 DKTLKKIDL 865
>gi|254295141|ref|YP_003061164.1| glycoside hydrolase [Hirschia baltica ATCC 49814]
gi|254043672|gb|ACT60467.1| glycoside hydrolase family 3 domain protein [Hirschia baltica ATCC
49814]
Length = 897
Score = 283 bits (724), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 167/439 (38%), Positives = 241/439 (54%), Gaps = 45/439 (10%)
Query: 33 KDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALH 92
++A + F SL +R DL+ ++L+EK + AAA+PRLG+ Y WW+EALH
Sbjct: 35 EEAKSSEFRFMDPSLSPKERALDLVSHMTLEEKAAQMYDKAAAIPRLGLHEYNWWNEALH 94
Query: 93 GVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN-------- 144
GV+ G AT FPQ I A++++ L + V+SDE RA ++
Sbjct: 95 GVARAG----------HATVFPQAIGMAATWDEDLMLEVANVISDEGRAKHHFYANEDVY 144
Query: 145 GGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASC 204
GLT+WSPN+NIFRDPRWGRGQET GEDP L+G+ A +++ GLQG D K A+
Sbjct: 145 AMYGGLTFWSPNINIFRDPRWGRGQETYGEDPYLTGRMAVNFINGLQGDDDKYFKSVATV 204
Query: 205 KHFTAYDLDNWNGVD--RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGV 262
KH+ + +G + R N + D+ +T+ F+ E +VASVMC+YN V G
Sbjct: 205 KHYAVH-----SGPEPSRHRDNYIATDADLYETYLPAFKTAFDETEVASVMCAYNAVWGD 259
Query: 263 PTCADPNILKRTIRGEWRLNGYIVSDCDSVG-VYYDTQH------------FTSTPEEAA 309
P C ++K +R E +GY+VSDC ++G YYD + T +AA
Sbjct: 260 PACGSERLMKDLLREELGFDGYVVSDCGAIGDFYYDEEKKAEGTAPYAAHDHVDTRAQAA 319
Query: 310 ADAIRAGLDLDCGPFLGLHTES---AVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSS 366
A ++ G DL+CG G ++ AV+ GL++E I+ ++V + +LGM+D +PS
Sbjct: 320 ALSVNMGTDLNCGDGEGNKMDALPQAVKEGLITEETIDQSVVRLYSALFKLGMYD-DPSL 378
Query: 367 QPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVT 426
P+ ++ V +P H E + EAAR +VLLKN G LPL VAVIGPN+D T
Sbjct: 379 VPWSNISIDTVASPSHLEKSEEAARASLVLLKNDG-ILPLK--PDTKVAVIGPNADNWWT 435
Query: 427 MIGNYAGIACGYTTPLQGI 445
++ NY G T L+GI
Sbjct: 436 LVANYYGQPTAPVTALKGI 454
Score = 99.4 bits (246), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 79/286 (27%), Positives = 127/286 (44%), Gaps = 54/286 (18%)
Query: 482 ILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGP 531
+ G+D ++E E + DR + LP Q++L+ ++ K P +LV SG
Sbjct: 634 LFFGGIDANLEGEEMGVELDGFLGGDRTHINLPAPQEKLLKELHATGK-PVVLVNFSGSA 692
Query: 532 IDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLP-MT 590
+ A D + AI+ A YPG+ GTAIAD+L+G +P G+LP+T+Y + +P
Sbjct: 693 M--ALNWEDENLPAIVQAFYPGEKSGTAIADLLWGEFSPSGRLPVTFYKS--LEGMPAFD 748
Query: 591 EMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSIN 650
+ +M RTY++Y+G +YPFGHG+SYT+F ++
Sbjct: 749 DYSME--------NRTYKYYEGEQLYPFGHGLSYTSFEYS-------------------- 780
Query: 651 ATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPP--AGHWAPHKQLVAF 708
+K+ A L V V V N G K + + T A P +L AF
Sbjct: 781 ------DLKLETAYAANENLQVSVKVTNSGDKASREIVQAYVTRDTLANVSTPRVELAAF 834
Query: 709 EKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPL-GEHNIHIGG 753
+ + + Q V ++I + + +G P G + IGG
Sbjct: 835 DAIELAPKESQTVTLSIK-PDAIGYFNENGKLTFPEDGSFTLSIGG 879
>gi|393720890|ref|ZP_10340817.1| beta-glucosidase [Sphingomonas echinoides ATCC 14820]
Length = 563
Score = 283 bits (724), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 168/423 (39%), Positives = 237/423 (56%), Gaps = 34/423 (8%)
Query: 60 LSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITT 119
L+ +K L S A A P G+ Y+WW+E LHG++ G AT FPQ I
Sbjct: 20 LTTAQKAAQLQSSAPADPAAGLPAYDWWNEGLHGLARNG----------YATVFPQAIGM 69
Query: 120 ASSFNATLWEAIGRVVSDEARAMYNGGTA--------GLTYWSPNVNIFRDPRWGRGQET 171
A++++A L IG VV+ EARA +N GLT WSPN+NIFRDPRWGRGQET
Sbjct: 70 AATWDAPLLNRIGEVVATEARAKFNAQPVAADRKIYQGLTIWSPNINIFRDPRWGRGQET 129
Query: 172 PGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQD 231
GEDP L+G +++ GLQG D KV A+ KHF + R F+ S QD
Sbjct: 130 YGEDPYLTGHLGVAFITGLQGPDPKHPKVIATPKHFVVHSGPE---AGRDGFDVDPSPQD 186
Query: 232 IEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDS 291
+E T+ FR+ V EGK S+MC+YN ++GVP CA+ ++ +R +W G+ VSDCD+
Sbjct: 187 LEATYLPAFRLAVTEGKAQSLMCAYNAIHGVPACANAPLMVDRLRHDWGFTGFTVSDCDA 246
Query: 292 VGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTL 351
V + H+ AAA +++ G DL+CG E A+QRGL++ DI+ A +
Sbjct: 247 VANVHLFHHYRLDAAGAAAASLKGGTDLNCGTTYAALPE-ALQRGLVTSQDIDTAFGRAM 305
Query: 352 TVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRH 411
+ LG+ G ++ P+ + P+D+ TP+H +ALEAAR+ IVLLKN+G LPL
Sbjct: 306 AARKALGIAYG--ATSPWARISPRDIGTPEHHAVALEAARKAIVLLKNEGDRLPLK--TG 361
Query: 412 RTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQLF--G 469
+A+IG ++D + GNY G A TPL+G+ + Q G ++V A L G
Sbjct: 362 TRIALIGADADDIGVLQGNYHGTAIAPVTPLEGLRQ------QFGAENVLYAQGSLLADG 415
Query: 470 AAI 472
AA+
Sbjct: 416 AAV 418
>gi|393782428|ref|ZP_10370612.1| hypothetical protein HMPREF1071_01480 [Bacteroides salyersiae
CL02T12C01]
gi|392673256|gb|EIY66719.1| hypothetical protein HMPREF1071_01480 [Bacteroides salyersiae
CL02T12C01]
Length = 596
Score = 283 bits (724), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 195/621 (31%), Positives = 315/621 (50%), Gaps = 64/621 (10%)
Query: 150 LTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTA 209
+TYWSPNVNIFRDPRWGRGQET GEDP L+ + +YVRGLQG+D LK AA KH+
Sbjct: 1 MTYWSPNVNIFRDPRWGRGQETYGEDPYLTAEIGKAYVRGLQGNDPFFLKAAACAKHYAV 60
Query: 210 YDLDNWNGVD--RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCAD 267
+ +G + R FNA SK+D+ +T+ F V E KV +VM +YN+V G
Sbjct: 61 H-----SGPEALRHEFNASPSKRDLFETYLPAFEALVKEAKVEAVMGAYNRVYGESASGS 115
Query: 268 PNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDC-GPFLG 326
+L +R +W G++VSDC +V Y EA+A A+++GL+L+C G F
Sbjct: 116 FFLLTDILRKKWGFKGHVVSDCGAVDDIYGGHKIAKDVAEASAIALKSGLNLNCGGSFHA 175
Query: 327 LHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELA 386
L + A++R L++E+D++NAL+ + +++LG + S PY ++ + + H +A
Sbjct: 176 L--KEALERKLITEVDLDNALMPLMMTRLKLGNLTDDDES-PYKNISDSVIASYTHAMVA 232
Query: 387 LEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIG 446
E A++ +VLLKN +LPL +T+ V GP + T M+GNY G++ T LQGI
Sbjct: 233 REVAQKSMVLLKNNNHTLPLKK-DVKTIFVTGPYAADTYVMMGNYYGVSPRSNTFLQGIA 291
Query: 447 RYAR---TIHQQGCKDVACADDQLFGAAIDASRQADATILVMGL---DQSIEAEAL---- 496
+I+ + + + R A+ I+V+GL D+ E +A+
Sbjct: 292 AKVSGGTSINYKIGILPTTPNMNPADWTVGEVRAAEVAIVVIGLSGIDEGEEGDAIASSH 351
Query: 497 --DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQ 554
D+ L LP Q + + +S + V+ G PID+ + A+I A YPGQ
Sbjct: 352 RGDKQNLKLPEHQLKFLRDISRNRWNKLVTVITGGSPIDL--EEVSELSDAVIMAWYPGQ 409
Query: 555 AGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPV 614
GG A+ D+LFG + G++P+T+ P+ + + GRTY++ +
Sbjct: 410 EGGMALGDLLFGDVSFSGRMPVTF---------PINSDWLPAFEDYNMQGRTYKYMTDNI 460
Query: 615 VYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCN-RLTLGVQ 673
+YPFG+G++Y + +S +K+ + K + + + VQ
Sbjct: 461 MYPFGYGLTYGD--------------------------VSYSDVKILNPKYDGKQEIHVQ 494
Query: 674 VDVKNVGSKDGAHTL-LVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLS 732
++N G+ + + L S P AG P L+ F++V + + Q V I + L
Sbjct: 495 ATLRNNGNNEVEEVVQLYLSAPGAGVITPISSLIGFKRVTLESHLSQTVEFIIKPDQ-LK 553
Query: 733 VVDRSGTRRIPLGEHNIHIGG 753
+V G++ + G++ I + G
Sbjct: 554 MVMEDGSKNLLKGKYTIIVSG 574
>gi|256840106|ref|ZP_05545615.1| glycoside hydrolase family beta-glycosidase [Parabacteroides sp.
D13]
gi|256739036|gb|EEU52361.1| glycoside hydrolase family beta-glycosidase [Parabacteroides sp.
D13]
Length = 868
Score = 283 bits (724), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 164/439 (37%), Positives = 239/439 (54%), Gaps = 35/439 (7%)
Query: 41 PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPG 100
PF LP+ +R++DL+ RL+ +EK+ +++ A+ RLGI Y+WW+EALHGV+ G
Sbjct: 27 PFRNPDLPLEERIDDLLSRLTPEEKIGQMMNVTPAIERLGIPTYDWWNEALHGVARAG-- 84
Query: 101 TKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--------GGTAGLTY 152
AT FPQ I A++F+ +VSDEARA Y+ GLT+
Sbjct: 85 --------RATVFPQAIAMAATFDDNAVHETFTMVSDEARAKYHQYQKDKEYDRYKGLTF 136
Query: 153 WSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDL 212
W+PN+NIFRDPRWGRG ET GEDP L+ K + RGLQG D + K A KH+ +
Sbjct: 137 WTPNINIFRDPRWGRGMETYGEDPYLTEKMGVAVTRGLQGDDPNYYKTHACAKHYAVHSG 196
Query: 213 DNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILK 272
WN R FNA+ + +D+ +T+ F V EG V VMC+YN+ G P C+ +L
Sbjct: 197 PEWN---RHEFNAEATPRDLYETYLPAFEALVKEGDVQEVMCAYNRFEGKPCCSSDKLLI 253
Query: 273 RTIRGEWRLNGYIVSDCDSVGVYY----DTQHFTSTP--EEAAADAIRAGLDLDC-GPFL 325
+R W + I+SDC ++ ++ +T + P E A+ADA+ G DL+C G +
Sbjct: 254 DILRNSWGYDNIILSDCGAIDDFWRKDKNTPRHETHPDAESASADAVLNGTDLECGGSYR 313
Query: 326 GLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQEL 385
L+ A+ G +SE D++ +L L + LGMFD + PY + V +P+H
Sbjct: 314 ALN--KALADGKISEKDLDVSLRRLLKGRFELGMFDPD-ERVPYSKIPYSVVESPEHIAK 370
Query: 386 ALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
AL+ AR+ IVLLKN+ LPL + +AV+GPN+ + + NY G T ++GI
Sbjct: 371 ALDMARKSIVLLKNKNNMLPLDK-NIKKIAVVGPNAADSTMLWANYNGFPSKTVTIVEGI 429
Query: 446 GRY---ARTIHQQGCKDVA 461
A I++ GC A
Sbjct: 430 RNKVPNAEVIYELGCNHTA 448
Score = 127 bits (320), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 91/309 (29%), Positives = 146/309 (47%), Gaps = 49/309 (15%)
Query: 468 FGAAIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMA 517
F A + AD + V G+ +E E + DR + +P Q+E+V K +A
Sbjct: 595 FQATASKVKDADVIVFVGGISPRLEGEEMPVDAEGFRKGDRTNIEIPAVQKEMV-KALVA 653
Query: 518 SKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMT 577
+ P + V+ +G + + + +ND + AI+ A Y GQ GGTA+AD+LFG NP G+LP+T
Sbjct: 654 TGKPVVYVVCTGSALALNW-END-HVNAILNAWYGGQEGGTAVADVLFGDYNPAGRLPIT 711
Query: 578 WYPQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPT 636
+Y + LP + +M+ GRTYR+ +YPFG+G+SYT F +
Sbjct: 712 FYKS--VDQLPDFQDYSMK--------GRTYRYMTQTPLYPFGYGLSYTTFDYK------ 755
Query: 637 VVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPA 696
NA +S I + ++ + D+ N G DG ++ P
Sbjct: 756 -------------NAKLSKDKI------ASNESVTLSFDIANTGKMDGDEVAQIYIKNPN 796
Query: 697 GHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKH 756
P K + AF++V+V AG++Q V I + + S D + T I G++ I GG+
Sbjct: 797 DPAGPLKAMKAFKRVNVKAGSEQPVSIQLEPKAFQSFNDNTQTMEIRPGKYQILYGGSSD 856
Query: 757 SVSLHAATL 765
+L L
Sbjct: 857 DKTLKKIDL 865
>gi|298376791|ref|ZP_06986746.1| beta-glucosidase [Bacteroides sp. 3_1_19]
gi|298266669|gb|EFI08327.1| beta-glucosidase [Bacteroides sp. 3_1_19]
Length = 868
Score = 283 bits (724), Expect = 3e-73, Method: Compositional matrix adjust.
Identities = 164/439 (37%), Positives = 239/439 (54%), Gaps = 35/439 (7%)
Query: 41 PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPG 100
PF LP+ +R++DL+ RL+ +EK+ +++ A+ RLGI Y+WW+EALHGV+ G
Sbjct: 27 PFRNPELPLEERIDDLLSRLTPEEKIGQMMNVTPAIERLGIPTYDWWNEALHGVARAG-- 84
Query: 101 TKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--------GGTAGLTY 152
AT FPQ I A++F+ +VSDEARA Y+ GLT+
Sbjct: 85 --------RATVFPQAIAMAATFDDNAVHETFTMVSDEARAKYHQYQKDKEYDRYKGLTF 136
Query: 153 WSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDL 212
W+PN+NIFRDPRWGRG ET GEDP L+ K + RGLQG D + K A KH+ +
Sbjct: 137 WTPNINIFRDPRWGRGMETYGEDPYLTEKMGVAVTRGLQGDDPNYYKTHACAKHYAVHSG 196
Query: 213 DNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILK 272
WN R FNA+ + +D+ +T+ F V EG V VMC+YN+ G P C+ +L
Sbjct: 197 PEWN---RHEFNAEATPRDLYETYLPAFEALVKEGDVQEVMCAYNRFEGKPCCSSDKLLI 253
Query: 273 RTIRGEWRLNGYIVSDCDSVGVYY----DTQHFTSTP--EEAAADAIRAGLDLDC-GPFL 325
+R W + I+SDC ++ ++ +T + P E A+ADA+ G DL+C G +
Sbjct: 254 DILRNSWGYDNIILSDCGAIDDFWRKDKNTPRHETHPDAESASADAVLNGTDLECGGSYR 313
Query: 326 GLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQEL 385
L+ A+ G +SE D++ +L L + LGMFD + PY + V +P+H
Sbjct: 314 ALN--KALADGKISEKDLDVSLRRLLKGRFELGMFDPD-ERVPYSKIPYSVVESPEHIAK 370
Query: 386 ALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
AL+ AR+ IVLLKN+ LPL + +AV+GPN+ + + NY G T ++GI
Sbjct: 371 ALDMARKSIVLLKNKNNMLPLDK-NIKKIAVVGPNAADSTMLWANYNGFPSKTVTIVEGI 429
Query: 446 GRY---ARTIHQQGCKDVA 461
A I++ GC A
Sbjct: 430 RNKVPNAEVIYELGCNHTA 448
Score = 127 bits (318), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 90/309 (29%), Positives = 146/309 (47%), Gaps = 49/309 (15%)
Query: 468 FGAAIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMA 517
F A + AD + V G+ +E E + DR + +P Q+E+V K +A
Sbjct: 595 FQATASKVKDADVIVFVGGISPRLEGEEMPVDAEGFRKGDRTNIEIPAVQKEMV-KALVA 653
Query: 518 SKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMT 577
+ P + V+ +G + + + +ND + AI+ A Y GQ GGTA+AD+LFG NP G+LP+T
Sbjct: 654 TGKPVVYVVCTGSALALNW-END-HVNAILNAWYGGQEGGTAVADVLFGDYNPAGRLPIT 711
Query: 578 WYPQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPT 636
+Y + LP + +M+ GRTYR+ +YPFG+G+SYT F +
Sbjct: 712 FYKS--VDQLPDFQDYSMK--------GRTYRYMTQTPLYPFGYGLSYTTFDYK------ 755
Query: 637 VVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPA 696
NA +S I + ++ + D+ N G DG ++ P
Sbjct: 756 -------------NAKLSKDKI------ASNESVTLSFDIANTGKMDGDEVAQIYIKNPN 796
Query: 697 GHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKH 756
P K + AF++V+V AG++Q V I + + S D + T + G++ I GG+
Sbjct: 797 DPAGPLKAMKAFKRVNVKAGSEQPVSIQLEPKAFQSFNDNTQTMEVRPGKYQILYGGSSD 856
Query: 757 SVSLHAATL 765
+L L
Sbjct: 857 DKTLKKIDL 865
>gi|393781366|ref|ZP_10369565.1| hypothetical protein HMPREF1071_00433 [Bacteroides salyersiae
CL02T12C01]
gi|392676859|gb|EIY70281.1| hypothetical protein HMPREF1071_00433 [Bacteroides salyersiae
CL02T12C01]
Length = 854
Score = 283 bits (723), Expect = 3e-73, Method: Compositional matrix adjust.
Identities = 169/455 (37%), Positives = 256/455 (56%), Gaps = 44/455 (9%)
Query: 5 IAFFFLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQE 64
I FF +GL+ + + A KD + ++ P +R+ DL+ +L+++E
Sbjct: 6 ILFFIVGLVCIFTAQ---------AQKGKDV------YLDMNAPQHERILDLLSKLTIEE 50
Query: 65 KVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFN 124
K+ LL + + +PRL I Y +EALHGV V PG T FPQ I A+ +N
Sbjct: 51 KISLLRATSPGIPRLQIDKYYHGNEALHGV--VRPGN--------FTVFPQAIGLAAMWN 100
Query: 125 ATLWEAIGRVVSDEARAMYNGGTAG----------LTYWSPNVNIFRDPRWGRGQETPGE 174
L I +SDEARA +N G LT+WSP VN+ RDPRWGR ET GE
Sbjct: 101 PQLLNEISTAISDEARARWNELEQGKKQLGQFSDLLTFWSPTVNMARDPRWGRTPETYGE 160
Query: 175 DPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIED 234
DP LSGK S+V+GLQG D LK+ ++ KHF A + ++ +RF N +S++D+ +
Sbjct: 161 DPFLSGKLGVSFVKGLQGDDPRYLKIVSTPKHFAANNEEH----NRFECNPIISEKDLRE 216
Query: 235 TFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGV 294
+ F C++EGK AS+M +YN +N VP + +LK+ +R +W +GY+VSDC +
Sbjct: 217 YYLPAFEKCIIEGKAASIMTAYNAINDVPCTLNNWLLKKVLRHDWGFDGYVVSDCGAPDF 276
Query: 295 YYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLH-TESAVQRGLLSEIDINNALVNTLTV 353
+ T E AA +I+AGLDL+CG + + +A ++ +++E +I++A + L
Sbjct: 277 LVTHHKYVKTLEAAATLSIQAGLDLECGDNVYMEPLLNAYKQYMVTEAEIDSAAYHILRA 336
Query: 354 QMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRT 413
+MRLG+FD +P+ PY + P V H +LALEAARQ IVLLKN+ LPL + ++
Sbjct: 337 RMRLGLFD-DPNLNPYNKISPSVVGCEKHSQLALEAARQSIVLLKNEKKFLPLDLKKIKS 395
Query: 414 VAVIGPNSDVTVTMIGNYAGIACGY-TTPLQGIGR 447
+AV+G N+ G+Y+G + L+GI +
Sbjct: 396 IAVVGINA--GNCEFGDYSGTPVNQPVSILEGIKK 428
Score = 147 bits (370), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 98/299 (32%), Positives = 151/299 (50%), Gaps = 45/299 (15%)
Query: 467 LFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVL 526
L+GAA DA R+ D TI V+G+++SIE E DR + LP QQ + + + T++VL
Sbjct: 591 LYGAAGDAMRKCDLTIAVVGINKSIEREGQDRYSIELPKDQQIFIEEAYKINPN-TVVVL 649
Query: 527 MSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITN 586
++G + + + D I AI+ A YPG+AGGTA+A++LFG NPGGKLP+T+Y +
Sbjct: 650 VAGSSLAINWM--DEHIPAIVNAWYPGEAGGTAVAEVLFGDYNPGGKLPLTYYRS--LDE 705
Query: 587 LP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGR 645
LP + +R GRTY+F++G +Y FGHG+SYT F
Sbjct: 706 LPAFDDYDIR-------KGRTYQFFEGNPLYAFGHGLSYTTF-----------------S 741
Query: 646 HGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDG---AHTLLVFSTPPAGHWAPH 702
+ +N +G A+KV+ A +KN G DG A + + + P
Sbjct: 742 YKKLNIDSTGDAVKVSFA------------LKNTGKYDGDEVAQLYVKYQGNDSLVKLPL 789
Query: 703 KQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLH 761
KQL FE+VH+ G +RV + + + + G P G++ +G ++ L
Sbjct: 790 KQLKGFERVHLKKGESKRVTLTVPKSELRFWDEEKGEFYTPAGDYLFMVGTASDAIQLQ 848
>gi|332881172|ref|ZP_08448831.1| glycosyl hydrolase family 3 protein [Capnocytophaga sp. oral taxon
329 str. F0087]
gi|357047867|ref|ZP_09109460.1| glycosyl hydrolase family 3 protein [Paraprevotella clara YIT
11840]
gi|332680886|gb|EGJ53824.1| glycosyl hydrolase family 3 protein [Capnocytophaga sp. oral taxon
329 str. F0087]
gi|355529206|gb|EHG98645.1| glycosyl hydrolase family 3 protein [Paraprevotella clara YIT
11840]
Length = 851
Score = 282 bits (722), Expect = 4e-73, Method: Compositional matrix adjust.
Identities = 170/421 (40%), Positives = 239/421 (56%), Gaps = 29/421 (6%)
Query: 42 FCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGT 101
F + P +R+ DL+ RL+++EK+ LL++ A A+ RLGI Y +EALHGV V PG
Sbjct: 29 FRDMKAPQHERIMDLLSRLTVEEKISLLVNDAPAIGRLGIDKYNHGNEALHGV--VRPG- 85
Query: 102 KFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN----------GGTAGLT 151
DF T FPQ I A+ +N L I +SDEAR + G + LT
Sbjct: 86 ----DF---TVFPQAIGMAAMWNPELLYRISSAISDEARGRWKELEYGKKQIAGASDLLT 138
Query: 152 YWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYD 211
+WSP VN+ RDPRWGR ET GEDP LSG ++V+GLQG LK ++ KHF +
Sbjct: 139 FWSPTVNMARDPRWGRTPETYGEDPYLSGVLGVAFVKGLQGDHPRYLKTVSTPKHFAVNN 198
Query: 212 LDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNIL 271
++ +R NAKVS++D+ + + F C+ EGK S+M +YN VN VP + ++
Sbjct: 199 EEH----NRSSCNAKVSERDLREYYLPSFERCITEGKAQSIMMAYNAVNDVPCTVNTYLI 254
Query: 272 KRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG-PFLGLHTE 330
K +RG+W NGYIVSDC + H+ T E AA A++ GLDL+CG G
Sbjct: 255 KNVLRGDWGFNGYIVSDCSAPEWMITKHHYVKTREAAATLAVKVGLDLECGNQVYGEGLL 314
Query: 331 SAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAA 390
A ++ ++SE DI++A L +M LG+FD PS PY + P V HQ+LALEAA
Sbjct: 315 KAYRQYMVSEADIDSAAYRILRGRMMLGLFDA-PSQNPYNQIEPSVVGCKAHQDLALEAA 373
Query: 391 RQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGY-TTPLQGIGRYA 449
RQ +VLLKN+ LPL+ + +++AV+G ++ G+Y+G T L GI +YA
Sbjct: 374 RQSMVLLKNKDNFLPLNPKKVKSIAVVGISAG--HCEFGDYSGTPKNEPVTILDGIKQYA 431
Query: 450 R 450
Sbjct: 432 E 432
Score = 135 bits (339), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 87/295 (29%), Positives = 142/295 (48%), Gaps = 42/295 (14%)
Query: 468 FGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLM 527
FG A + + D T+ V+G+++SIE E DR L LP QQE + ++ + T++VL+
Sbjct: 593 FGDAGKVAAECDVTVAVLGINKSIEREGQDRFSLELPVDQQEFIKELYKVNPN-TVVVLV 651
Query: 528 SGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNL 587
+G + V + D + AI+ A YPG+ GG A+A++LFG NPGG+LP+T+Y
Sbjct: 652 AGSSMAVNWM--DENVPAILNAWYPGEQGGNAVAEVLFGDYNPGGRLPLTYYNS------ 703
Query: 588 PMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHG 647
+ E+ + S + GRTY++++G +Y FG+G+SYT F R+
Sbjct: 704 -LDEIPAFDNYSVK--GRTYQYFEGQPLYEFGYGLSYTKF-----------------RYK 743
Query: 648 SINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLV 706
S + +KV+ +V N G DG V+ P G + P KQL
Sbjct: 744 SKGVNVEQDTVKVS------------FEVSNTGKYDGDEVAQVYVKYPETGTYMPLKQLH 791
Query: 707 AFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLH 761
F++VH+ G +V I + ++ P GE+ +G + +
Sbjct: 792 GFKRVHIKKGKTSKVTIGVPRKDLRYWYEQERKFITPKGEYTFMVGASSDDIKFR 846
>gi|224535242|ref|ZP_03675781.1| hypothetical protein BACCELL_00103 [Bacteroides cellulosilyticus
DSM 14838]
gi|224523140|gb|EEF92245.1| hypothetical protein BACCELL_00103 [Bacteroides cellulosilyticus
DSM 14838]
Length = 864
Score = 282 bits (722), Expect = 4e-73, Method: Compositional matrix adjust.
Identities = 173/447 (38%), Positives = 253/447 (56%), Gaps = 41/447 (9%)
Query: 34 DATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHG 93
+ T + P+ L +R DL+ R++L+EKV + +G+ A+ RLGI Y+WW+EALHG
Sbjct: 16 NVTAQNEPYKNPELSPSERAWDLLKRMTLEEKVSQMKNGSPAIERLGIPAYDWWNEALHG 75
Query: 94 VSNVGPGTKFGGDFPGATSFPQVITTASSF-NATLWEAIGRVVSDEARAMYNG------- 145
V+ G AT FPQ I A++F N ++E +VSDEARA Y+
Sbjct: 76 VARAGK----------ATVFPQAIGLAATFDNQAVYETFD-IVSDEARAKYHDFQRKGER 124
Query: 146 -GTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASC 204
G GLT+W+PN+NI+RDPRWGRG ET GEDP L+ + V+GLQG + A +C
Sbjct: 125 DGYKGLTFWTPNINIYRDPRWGRGMETYGEDPYLTSLMGLAVVKGLQGGGTGKYDKAHAC 184
Query: 205 -KHFTAYDLDNWNGVDRFHFNAK-VSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGV 262
KH+ + WN R F+AK +S++D+ +T+ F+ V EGKV VMC+YN+ G
Sbjct: 185 AKHYAVHSGPEWN---RHSFDAKNISQRDLWETYLSAFKTLVKEGKVKEVMCAYNRFEGE 241
Query: 263 PTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTP--EEAAADAIRAGLDLD 320
P C++ +L R +R +W + +VSDC ++G +Y H + P A+ADA+ +G DL+
Sbjct: 242 PCCSNKQLLIRILREDWGYDDIVVSDCGAIGDFYYPNHHETHPTAAAASADAVVSGTDLE 301
Query: 321 CGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEP----SSQPYGHLGPKD 376
CG E AV++GL+SE IN ++ L + +LGMFD + S PY + K
Sbjct: 302 CGGSYSSLNE-AVRKGLISEEKINESVFRLLRARFQLGMFDDDALVSWSEIPYSVVESK- 359
Query: 377 VCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIAC 436
+H ALE AR+ +VLL N+ +LPLS R VAV+GPN++ +V + NY G
Sbjct: 360 ----EHVTKALEMARKSMVLLTNKNHTLPLSK-SIRKVAVLGPNANDSVMLWANYNGFPT 414
Query: 437 GYTTPLQGIGRY---ARTIHQQGCKDV 460
T L+GI +++GC V
Sbjct: 415 KSVTILEGIKSKLPEGTVYYEKGCDYV 441
Score = 135 bits (339), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 96/321 (29%), Positives = 149/321 (46%), Gaps = 47/321 (14%)
Query: 459 DVACADDQLFGAAIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQ 508
D+ + + D + +ADA I V GL ++E E + DR + LP Q
Sbjct: 581 DIGIKKEINYKEVADKAAEADAIIFVGGLSPTLEGEEMPVDLPGFRKGDRTNIDLPHVQA 640
Query: 509 ELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTS 568
E++ + K P I VL SG + + + + + AI+ A YPGQ GGTA+AD+LFG
Sbjct: 641 EMLKALKKTGK-PVIFVLCSGSTLALPWEAEN--LDAILEAWYPGQQGGTAVADVLFGDY 697
Query: 569 NPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFV 628
NP G+LP+T+Y +LP E RTYR++KG ++PFGHG+SYT F
Sbjct: 698 NPAGRLPLTFYASS--NDLPDFE-------DYDMSNRTYRYFKGKALFPFGHGLSYTIFD 748
Query: 629 HTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTL 688
+ A +D ++ +G+ + +T + +KN G DG +
Sbjct: 749 YGKAK--------VDKQN-----VRAGEGMTLT------------IPLKNTGKLDGDEVI 783
Query: 689 LVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHN 748
V+ PA P K L AF +V +PAG + + I + + + I G++
Sbjct: 784 QVYLRNPADKEGPIKTLRAFRRVSLPAGQTENIRIELPASTFECFNPSTNRMEILPGKYE 843
Query: 749 IHIGGTKHSVSLHAATLGVIK 769
+ GGT +L T+ V K
Sbjct: 844 LLYGGTSDEKALQKLTITVKK 864
>gi|423342048|ref|ZP_17319763.1| hypothetical protein HMPREF1077_01193 [Parabacteroides johnsonii
CL02T12C29]
gi|409219455|gb|EKN12417.1| hypothetical protein HMPREF1077_01193 [Parabacteroides johnsonii
CL02T12C29]
Length = 868
Score = 282 bits (722), Expect = 5e-73, Method: Compositional matrix adjust.
Identities = 167/450 (37%), Positives = 238/450 (52%), Gaps = 35/450 (7%)
Query: 41 PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPG 100
PF LPI +R++DL+ RL+ +EKV +++ A+ RLGI Y+WW+EALHGV+ G
Sbjct: 27 PFRNPDLPIDERIDDLLKRLTAEEKVGQMMNTTPAIERLGIPQYDWWNEALHGVARAGK- 85
Query: 101 TKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--------GGTAGLTY 152
AT FPQ I A++F+ +VSDEARA Y+ GLT+
Sbjct: 86 ---------ATVFPQAIAMAATFDDDALYETFTMVSDEARAKYHQYQKDKEYDRYKGLTF 136
Query: 153 WSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDL 212
W+PN+NIFRDPRWGRG ET GEDP L+ + + V+GLQG D K A KH+ +
Sbjct: 137 WTPNINIFRDPRWGRGMETYGEDPYLTERMGVAVVKGLQGDDPKYFKTHACAKHYAVHSG 196
Query: 213 DNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILK 272
WN R F+ V+ +D+ T+ F V EG V VMC+YN+ G P C+ +L
Sbjct: 197 PEWN---RHEFDVTVTPRDLWQTYLPAFEALVKEGNVQEVMCAYNRYQGKPCCSSDKLLI 253
Query: 273 RTIRGEWRLNGYIVSDCDSVGVYYDTQHFT------STPEEAAADAIRAGLDLDCG-PFL 325
+R W I+SDC ++ +++ T E A+ADA+ G DL+CG +
Sbjct: 254 DILRNSWGYENIILSDCGAINDFWERDERTPRHETHPDAESASADAVLNGTDLECGNSYR 313
Query: 326 GLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQEL 385
L A++ G +SE D++ +L L + LGMFD + PY + V +P+H
Sbjct: 314 AL--VKALKDGKISENDLDVSLRRLLKGRFELGMFDPD-ERVPYAQIPYNVVESPEHVAQ 370
Query: 386 ALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
ALE A + +VLLKN+ +LPLS R +AV+GPN+ + + NY G T L+GI
Sbjct: 371 ALEMAHKSMVLLKNKNNTLPLSKTI-RKIAVVGPNAADSTMLWANYNGFPTHTVTILEGI 429
Query: 446 GRY---ARTIHQQGCKDVACADDQLFGAAI 472
I++ GC A Q G+ I
Sbjct: 430 RNKVPDTEVIYELGCNHAADFVIQDLGSHI 459
Score = 131 bits (330), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 100/311 (32%), Positives = 141/311 (45%), Gaps = 49/311 (15%)
Query: 468 FGAAIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMA 517
+ A + AD + V G+ +E E + DR + LP QQE+V K A
Sbjct: 595 YAATAAKVKDADVIVYVGGISPRLEGEEMPVNVEGFKKGDRTNIELPKVQQEMV-KALKA 653
Query: 518 SKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMT 577
+ P + VL +G + A + I AI+ A Y GQ GTA+ADILFG NP G+LP+T
Sbjct: 654 TGKPVVYVLCTGSAL--ALNWEEANIDAILNAWYGGQEAGTAVADILFGDYNPSGRLPVT 711
Query: 578 WYPQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPT 636
+Y I LP + +M+ GRTYR+ +YPFG+G+SYTNF +
Sbjct: 712 FYKS--IDQLPDFEDYSMK--------GRTYRYMTETPLYPFGYGLSYTNFAYR------ 755
Query: 637 VVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPA 696
NA +S I AK +TL D+ N G DG ++ P
Sbjct: 756 -------------NAKLSSGKI----AKDQSVTL--TFDIANTGKMDGDEIAQIYIKNPN 796
Query: 697 GHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKH 756
P K L AF +VHV AG Q V I + + S D + T + G++ I GG+
Sbjct: 797 DPEGPIKALKAFLRVHVKAGDSQEVNIELAPETFHSFNDNTQTMEVRPGKYQILYGGSSD 856
Query: 757 SVSLHAATLGV 767
+L L +
Sbjct: 857 DKALQRLELTI 867
>gi|423313129|ref|ZP_17291065.1| hypothetical protein HMPREF1058_01677 [Bacteroides vulgatus
CL09T03C04]
gi|392686343|gb|EIY79649.1| hypothetical protein HMPREF1058_01677 [Bacteroides vulgatus
CL09T03C04]
Length = 864
Score = 282 bits (721), Expect = 6e-73, Method: Compositional matrix adjust.
Identities = 173/474 (36%), Positives = 254/474 (53%), Gaps = 51/474 (10%)
Query: 1 MASTIAFFFLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRL 60
M + I + F L++LS ++ A ++ SL +R DL+ +L
Sbjct: 1 MRNKILYLFASLVMLSGCNNQPAYKDS------------------SLSPEERAEDLLQQL 42
Query: 61 SLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTA 120
+L+EKV L++ + V RLGIK Y WW+EALHGV+ G AT FPQ I A
Sbjct: 43 TLEEKVALMMDNSKPVERLGIKPYNWWNEALHGVARSGL----------ATVFPQPIGMA 92
Query: 121 SSFNATLWEAIGRVVSDEARAMYNGGTA--------GLTYWSPNVNIFRDPRWGRGQETP 172
+SF I VSDEARA +A GLT W+P VNI+RDPRWGRG ET
Sbjct: 93 ASFEPDAIHTIYTAVSDEARAKNAAYSAAGSYERYQGLTMWTPTVNIYRDPRWGRGIETY 152
Query: 173 GEDPVLSGKYAASYVRGLQGSDGDRL--KVAASCKHFTAYDLDNWNGVDRFHFNAK-VSK 229
GEDP L+ + V+GLQ D ++ K+ A KHF + WN R FNA+ +
Sbjct: 153 GEDPYLTSVMGVNVVKGLQCMDANQKYDKIHACAKHFAVHSGPEWN---RHEFNAENIKP 209
Query: 230 QDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDC 289
+D+ +T+ VPF V E KV VMC+YN++ G P C +L + +R +W +G ++SDC
Sbjct: 210 RDLHETYLVPFEALVKEAKVKEVMCAYNRLEGDPCCGSDRLLMQILRQDWGYDGIVLSDC 269
Query: 290 DSVGVYYDTQHFTSTP--EEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNAL 347
++ +Y + + P E A+A A+ +G DL+CG ESA ++GL+SE DI+ ++
Sbjct: 270 GAIDDFYREKGHKTHPDAESASAAAVLSGTDLECGSSYKALVESA-KKGLISEKDIDVSV 328
Query: 348 VNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLS 407
L + LG D +P + + VC+ +H L+L+ AR+ + LL N+ LPL
Sbjct: 329 KRLLKARFELGEMD-DPDKVEWTKIPYSVVCSAEHDSLSLDIARKSMTLLLNKNNILPLK 387
Query: 408 HIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI----GRYARTIHQQGC 457
+T+AV+GPN++ +V GNY G T L+GI G + I++QGC
Sbjct: 388 R-GGQTIAVMGPNANDSVMQWGNYNGTPKHTITLLEGIRSAMGENDKLIYEQGC 440
Score = 131 bits (330), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 99/301 (32%), Positives = 138/301 (45%), Gaps = 50/301 (16%)
Query: 476 RQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILV 525
+ AD I G+ S+E E + DR + LP Q+EL+ + A K I V
Sbjct: 600 KDADVVIFAGGISPSLEGEEMGVNLPGFRKGDRTDIELPAVQRELIKALCDAGK-KVIFV 658
Query: 526 LMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYIT 585
SG PI A AI+ A YPGQ+GG A+A++LFG NP G+LP+T+Y IT
Sbjct: 659 NFSGSPI--AMEPETKYCQAILQAWYPGQSGGKAVAEVLFGDYNPAGRLPVTFYRN--IT 714
Query: 586 NLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGR 645
LP E GRTYR++KG ++PFG+G+SYT F
Sbjct: 715 QLPNFE-------DYNMTGRTYRYFKGDPLFPFGYGLSYTTF-----------------N 750
Query: 646 HGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQL 705
+G+I + IKV T + V V N G++DG + V+ P K L
Sbjct: 751 YGNIKLE---QTIKVGE------TAKIIVPVTNTGNRDGEEVVQVYLKKQEDAEGPVKTL 801
Query: 706 VAFEKVHVPAGAQQRVGINIHVCKYLSVVD-RSGTRRIPLGEHNIHIGGTKHSVSLHAAT 764
AF++V +PAG V + + K L D ++ T R G +I +GG L T
Sbjct: 802 RAFKRVQIPAGKTVNVELEL-TPKQLEWWDTQTNTMRTLAGNFDIMVGGNSKDTELQVKT 860
Query: 765 L 765
L
Sbjct: 861 L 861
>gi|150003731|ref|YP_001298475.1| glycoside hydrolase family protein [Bacteroides vulgatus ATCC 8482]
gi|319640047|ref|ZP_07994774.1| glycoside hydrolase family 3 [Bacteroides sp. 3_1_40A]
gi|345517061|ref|ZP_08796539.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp.
4_3_47FAA]
gi|149932155|gb|ABR38853.1| glycoside hydrolase family 3, candidate beta-glycosidase
[Bacteroides vulgatus ATCC 8482]
gi|254833833|gb|EET14142.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp.
4_3_47FAA]
gi|317388325|gb|EFV69177.1| glycoside hydrolase family 3 [Bacteroides sp. 3_1_40A]
Length = 864
Score = 282 bits (721), Expect = 6e-73, Method: Compositional matrix adjust.
Identities = 173/474 (36%), Positives = 254/474 (53%), Gaps = 51/474 (10%)
Query: 1 MASTIAFFFLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRL 60
M + I + F L++LS ++ A ++ SL +R DL+ +L
Sbjct: 1 MRNKILYLFASLVMLSGCNNQPAYKDS------------------SLSPEERAEDLLQQL 42
Query: 61 SLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTA 120
+L+EKV L++ + V RLGIK Y WW+EALHGV+ G AT FPQ I A
Sbjct: 43 TLEEKVALMMDNSKPVERLGIKPYNWWNEALHGVARSGL----------ATVFPQPIGMA 92
Query: 121 SSFNATLWEAIGRVVSDEARAMYNGGTA--------GLTYWSPNVNIFRDPRWGRGQETP 172
+SF I VSDEARA +A GLT W+P VNI+RDPRWGRG ET
Sbjct: 93 ASFEPDAIHTIYTAVSDEARAKNAAYSAAGSYERYQGLTMWTPTVNIYRDPRWGRGIETY 152
Query: 173 GEDPVLSGKYAASYVRGLQGSDGDRL--KVAASCKHFTAYDLDNWNGVDRFHFNAK-VSK 229
GEDP L+ + V+GLQ D ++ K+ A KHF + WN R FNA+ +
Sbjct: 153 GEDPYLTSVMGVNVVKGLQCMDANQKYDKIHACAKHFAVHSGPEWN---RHEFNAENIKP 209
Query: 230 QDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDC 289
+D+ +T+ VPF V E KV VMC+YN++ G P C +L + +R +W +G ++SDC
Sbjct: 210 RDLHETYLVPFEALVKEAKVKEVMCAYNRLEGDPCCGSDRLLMQILRQDWGYDGIVLSDC 269
Query: 290 DSVGVYYDTQHFTSTP--EEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNAL 347
++ +Y + + P E A+A A+ +G DL+CG ESA ++GL+SE DI+ ++
Sbjct: 270 GAIDDFYREKGHKTHPDAESASAAAVLSGTDLECGSSYKALVESA-KKGLISEKDIDVSV 328
Query: 348 VNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLS 407
L + LG D +P + + VC+ +H L+L+ AR+ + LL N+ LPL
Sbjct: 329 KRLLKARFELGEMD-DPDKVEWTKIPYSVVCSAEHDSLSLDIARKSMTLLLNKNNILPLK 387
Query: 408 HIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI----GRYARTIHQQGC 457
+T+AV+GPN++ +V GNY G T L+GI G + I++QGC
Sbjct: 388 R-GGQTIAVMGPNANDSVMQWGNYNGTPKHTITLLEGIRSAMGENDKLIYEQGC 440
Score = 129 bits (325), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 99/301 (32%), Positives = 137/301 (45%), Gaps = 50/301 (16%)
Query: 476 RQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILV 525
+ AD I G+ S+E E + DR + LP Q+EL+ + A K I V
Sbjct: 600 KDADVVIFAGGISPSLEGEEMGVNLPGFRKGDRTDIELPAVQRELIKALCDAGK-KVIFV 658
Query: 526 LMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYIT 585
SG PI A AI+ A YPGQ+GG A A++LFG NP G+LP+T+Y IT
Sbjct: 659 NFSGSPI--AMEPETKYCQAILQAWYPGQSGGKAAAEVLFGDYNPAGRLPVTFYRN--IT 714
Query: 586 NLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGR 645
LP E GRTYR++KG ++PFG+G+SYT F
Sbjct: 715 QLPDFE-------DYNMTGRTYRYFKGDPLFPFGYGLSYTTF-----------------N 750
Query: 646 HGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQL 705
+G+I + IKV T + V V N G++DG + V+ P K L
Sbjct: 751 YGNIKLE---QTIKVGE------TAKIIVPVTNTGNRDGEEVVQVYLKKQEDAEGPVKTL 801
Query: 706 VAFEKVHVPAGAQQRVGINIHVCKYLSVVD-RSGTRRIPLGEHNIHIGGTKHSVSLHAAT 764
AF++V +PAG V + + K L D ++ T R G +I +GG L T
Sbjct: 802 RAFKRVQIPAGKTVNVELEL-TPKQLEWWDAQTNTMRTIAGNFDIMVGGNSKDAELQVKT 860
Query: 765 L 765
L
Sbjct: 861 L 861
>gi|294777452|ref|ZP_06742903.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
vulgatus PC510]
gi|294448520|gb|EFG17069.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
vulgatus PC510]
Length = 864
Score = 281 bits (720), Expect = 8e-73, Method: Compositional matrix adjust.
Identities = 173/474 (36%), Positives = 254/474 (53%), Gaps = 51/474 (10%)
Query: 1 MASTIAFFFLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRL 60
M + I + F L++LS ++ A ++ SL +R DL+ +L
Sbjct: 1 MRNKILYLFASLVMLSGCNNQPAYKDS------------------SLSPEERAEDLLQQL 42
Query: 61 SLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTA 120
+L+EKV L++ + V RLGIK Y WW+EALHGV+ G AT FPQ I A
Sbjct: 43 TLEEKVALMMDNSKPVERLGIKPYNWWNEALHGVARSGL----------ATVFPQPIGMA 92
Query: 121 SSFNATLWEAIGRVVSDEARAMYNGGTA--------GLTYWSPNVNIFRDPRWGRGQETP 172
+SF I VSDEARA +A GLT W+P VNI+RDPRWGRG ET
Sbjct: 93 ASFEPDAIHTIYTAVSDEARAKNAAYSAAGSYERYQGLTMWTPTVNIYRDPRWGRGIETY 152
Query: 173 GEDPVLSGKYAASYVRGLQGSDGDRL--KVAASCKHFTAYDLDNWNGVDRFHFNAK-VSK 229
GEDP L+ + V+GLQ D ++ K+ A KHF + WN R FNA+ +
Sbjct: 153 GEDPYLTSVMGVNVVKGLQCMDANQKYDKIHACAKHFAVHSGPEWN---RHEFNAENIKP 209
Query: 230 QDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDC 289
+D+ +T+ VPF V E KV VMC+YN++ G P C +L + +R +W +G ++SDC
Sbjct: 210 RDLHETYLVPFEALVKEAKVKEVMCAYNRLEGDPCCGSDRLLMQILRQDWGYDGIVLSDC 269
Query: 290 DSVGVYYDTQHFTSTP--EEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNAL 347
++ +Y + + P E A+A A+ +G DL+CG ESA ++GL+SE DI+ ++
Sbjct: 270 GAIDDFYREKGHKTHPDAESASAAAVLSGTDLECGSSYKALVESA-KKGLISEKDIDVSV 328
Query: 348 VNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLS 407
L + LG D +P + + VC+ +H L+L+ AR+ + LL N+ LPL
Sbjct: 329 KRLLKARFELGEMD-DPDKVEWTKIPYSVVCSAEHDSLSLDIARKSMTLLLNKNNILPLK 387
Query: 408 HIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI----GRYARTIHQQGC 457
+T+AV+GPN++ +V GNY G T L+GI G + I++QGC
Sbjct: 388 R-GGQTIAVMGPNANDSVMQWGNYNGTPKHTITLLEGIRSAMGENDKLIYEQGC 440
Score = 125 bits (313), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 97/301 (32%), Positives = 135/301 (44%), Gaps = 50/301 (16%)
Query: 476 RQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILV 525
+ AD I G+ S+E E + DR + LP Q+EL+ + A K I V
Sbjct: 600 KDADIVIFAGGISPSLEGEEMGVNLPGFRKGDRTDIELPAVQRELIKALCDAGK-KVIFV 658
Query: 526 LMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYIT 585
SG PI A AI+ A YPGQ+GG A A++LFG NP G+LP+T+Y
Sbjct: 659 NFSGSPI--AMEPETKYCQAILQAWYPGQSGGKAAAEVLFGDYNPAGRLPVTFYRN--TA 714
Query: 586 NLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGR 645
LP E GRTYR++KG ++PFG+G+SYT F
Sbjct: 715 QLPDFE-------DYNMTGRTYRYFKGDPLFPFGYGLSYTTF-----------------N 750
Query: 646 HGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQL 705
+G+I + IKV T + V V N G++DG + V+ P K L
Sbjct: 751 YGNIKLE---QTIKVGE------TAKIIVPVTNTGNRDGEEVVQVYLKKQEDAEGPVKTL 801
Query: 706 VAFEKVHVPAGAQQRVGINIHVCKYLSVVD-RSGTRRIPLGEHNIHIGGTKHSVSLHAAT 764
AF++V +PAG V + + K L D ++ T R G +I +GG L T
Sbjct: 802 RAFKRVQIPAGKTVNVELEL-TPKQLEWWDAQTNTMRTIAGNFDIMVGGNSKDAELQVKT 860
Query: 765 L 765
L
Sbjct: 861 L 861
>gi|380512525|ref|ZP_09855932.1| beta-glucosidase [Xanthomonas sacchari NCPPB 4393]
Length = 885
Score = 281 bits (720), Expect = 8e-73, Method: Compositional matrix adjust.
Identities = 158/405 (39%), Positives = 229/405 (56%), Gaps = 38/405 (9%)
Query: 56 LIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQ 115
L+ +++ EK+ ++ A A+PRLG+ YEWWSE LHG++ G AT FPQ
Sbjct: 40 LVAKMTRAEKIAQAMNAAPAIPRLGVPAYEWWSEGLHGIARNGE----------ATVFPQ 89
Query: 116 VITTASSFNATLWEAIGRVVSDEARAMYN--GGT-------AGLTYWSPNVNIFRDPRWG 166
I A+++N L +G V S EARA +N GG AGLT WSPN+NIFRDPRWG
Sbjct: 90 AIGLAATWNPELLHDVGTVTSTEARAKFNLAGGPGKDHPRYAGLTIWSPNINIFRDPRWG 149
Query: 167 RGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD--RFHFN 224
RG ET GEDP L+G+ A ++ GLQG D + A+ KH + +G + R F+
Sbjct: 150 RGMETYGEDPYLTGRLAVGFIHGLQGDDPAHPRTIATPKHLAVH-----SGPEPGRHGFD 204
Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
VS D E T+ FR +++G+ SVMC+YN ++G P CA ++ +RG+W G+
Sbjct: 205 VDVSPHDFEATYSPAFRAAIVDGQAGSVMCAYNSLHGTPACAADWLIDGRVRGDWGFKGF 264
Query: 285 IVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPF---LGLHTESAVQRGLLSEI 341
+VSDCD++ ++ ++A A++AG DL+CG LG+ A RG E
Sbjct: 265 VVSDCDAIDDMTQFHYYRPDNAGSSAAALKAGHDLNCGTAYRELGI----AFDRGEADEA 320
Query: 342 DINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQG 401
++ +LV + RLG + PY LG +D+ + H+ LAL+AA+Q +VLLKN
Sbjct: 321 LLDRSLVRLFAARYRLGELQPR-RNDPYARLGARDIDSAAHRALALQAAQQSLVLLKNAN 379
Query: 402 PSLPLS-HIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
+LPL +R +AV+GPN+D + NY G + TPLQG+
Sbjct: 380 ATLPLRPGLR---LAVLGPNADALAALEANYQGTSVQPVTPLQGL 421
Score = 134 bits (337), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 93/278 (33%), Positives = 141/278 (50%), Gaps = 48/278 (17%)
Query: 486 GLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVA 535
GL +E E L DR L LP QQ L+ + + K P ++VLMSG + +
Sbjct: 622 GLSPDVEGEELRIDVPGFDGGDRNDLALPAAQQALLERAKASGK-PLVVVLMSGSAVALN 680
Query: 536 FAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMR 595
+A+ AII A YPGQ+GGTAIA L G NPGG+LP+T+Y +LP
Sbjct: 681 WAEQ--HADAIIAAWYPGQSGGTAIAQALAGDINPGGRLPVTFY--RSTKDLP------- 729
Query: 596 PSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISG 655
P S GRTYR++KG ++PFG+G+SYT F + +AP + S +G
Sbjct: 730 PYVSYDMKGRTYRYFKGEPLFPFGYGLSYTQFAY---DAPQL----------STTTLQAG 776
Query: 656 KAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPA 715
+ ++V+ V+N G++ G + V+ P +P + LV F++VH+
Sbjct: 777 QPLQVS------------TTVRNTGARAGDEVVQVYLQYPQRAQSPLRSLVGFQRVHLQP 824
Query: 716 GAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGG 753
G + + + + LS VDRSG R + G++ + +GG
Sbjct: 825 GEARTLSFALD-ARQLSDVDRSGQRAVEAGDYRLFVGG 861
>gi|423227459|ref|ZP_17213920.1| hypothetical protein HMPREF1062_06106 [Bacteroides cellulosilyticus
CL02T12C19]
gi|392623089|gb|EIY17195.1| hypothetical protein HMPREF1062_06106 [Bacteroides cellulosilyticus
CL02T12C19]
Length = 864
Score = 281 bits (719), Expect = 9e-73, Method: Compositional matrix adjust.
Identities = 173/447 (38%), Positives = 253/447 (56%), Gaps = 41/447 (9%)
Query: 34 DATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHG 93
+ T + P+ L +R DL+ R++L+EKV + +G+ A+ RLGI Y+WW+EALHG
Sbjct: 16 NVTAQNEPYKNPELSPSERAWDLLKRMTLEEKVSQMKNGSPAIERLGIPAYDWWNEALHG 75
Query: 94 VSNVGPGTKFGGDFPGATSFPQVITTASSF-NATLWEAIGRVVSDEARAMYNG------- 145
V+ G AT FPQ I A++F N ++E +VSDEARA Y+
Sbjct: 76 VARAGK----------ATVFPQAIGLAATFDNQAVYETFD-IVSDEARAKYHDFQRKGER 124
Query: 146 -GTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASC 204
G GLT+W+PN+NI+RDPRWGRG ET GEDP L+ + V+GLQG + A +C
Sbjct: 125 DGYKGLTFWTPNINIYRDPRWGRGMETYGEDPYLTSLMGLAVVKGLQGGGTGKYDKAHAC 184
Query: 205 -KHFTAYDLDNWNGVDRFHFNAK-VSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGV 262
KH+ + WN R F+AK +S++D+ +T+ F+ V EGKV VMC+YN+ G
Sbjct: 185 AKHYAVHSGPEWN---RHSFDAKNISQRDLWETYLPAFKTLVKEGKVKEVMCAYNRFEGE 241
Query: 263 PTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTP--EEAAADAIRAGLDLD 320
P C++ +L R +R +W + +VSDC ++G +Y H + P A+ADA+ +G DL+
Sbjct: 242 PCCSNKQLLIRILREDWGYDDIVVSDCGAIGDFYYPNHHETHPTAAAASADAVVSGTDLE 301
Query: 321 CGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEP----SSQPYGHLGPKD 376
CG E AV++GL+SE IN ++ L + +LGMFD + S PY + K
Sbjct: 302 CGGSYSSLNE-AVRKGLISEEKINESVFRLLRARFQLGMFDDDALVSWSEIPYSVVESK- 359
Query: 377 VCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIAC 436
+H ALE AR+ +VLL N+ +LPLS R VAV+GPN++ +V + NY G
Sbjct: 360 ----EHVAKALEMARKSMVLLTNKNHTLPLSK-SIRKVAVLGPNANDSVMLWANYNGFPT 414
Query: 437 GYTTPLQGIGRY---ARTIHQQGCKDV 460
T L+GI +++GC V
Sbjct: 415 KSVTILEGIKSKLPEGTVYYEKGCDYV 441
Score = 134 bits (338), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 96/321 (29%), Positives = 149/321 (46%), Gaps = 47/321 (14%)
Query: 459 DVACADDQLFGAAIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQ 508
D+ + + D + +ADA I V GL ++E E + DR + LP Q
Sbjct: 581 DIGIKKEINYKEVADKAAEADAIIFVGGLSPTLEGEEMPVDLPGFRKGDRTNIDLPHVQA 640
Query: 509 ELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTS 568
E++ + K P I VL SG + + + + + AI+ A YPGQ GGTA+AD+LFG
Sbjct: 641 EMLKALKKTGK-PVIFVLCSGSTLALPWEAEN--LDAILEAWYPGQQGGTAVADVLFGDY 697
Query: 569 NPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFV 628
NP G+LP+T+Y +LP E RTYR++KG ++PFGHG+SYT F
Sbjct: 698 NPAGRLPLTFYASS--DDLPDFE-------DYDMSNRTYRYFKGKALFPFGHGLSYTIFD 748
Query: 629 HTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTL 688
+ A +D ++ +G+ + +T + +KN G DG +
Sbjct: 749 YGKAK--------VDKQN-----VRAGEGMTLT------------IPLKNTGKLDGDEVI 783
Query: 689 LVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHN 748
V+ PA P K L AF +V +PAG + + I + + + I G++
Sbjct: 784 QVYLRNPADKEGPIKTLRAFRRVSLPAGQTENIRIELPASTFECFNPSTNRMEILPGKYE 843
Query: 749 IHIGGTKHSVSLHAATLGVIK 769
+ GGT +L T+ V K
Sbjct: 844 LLYGGTSDEKALQKLTITVKK 864
>gi|383649536|ref|ZP_09959942.1| beta-glucosidase [Sphingomonas elodea ATCC 31461]
Length = 853
Score = 281 bits (719), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 171/420 (40%), Positives = 235/420 (55%), Gaps = 29/420 (6%)
Query: 50 PQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPG 109
P V I ++++++K L S A A P +G+ Y+WW+E LHG++ G
Sbjct: 13 PDPVAAAIAKMTIEQKAAQLQSTAPADPVIGLPAYDWWNEGLHGLARDG----------Y 62
Query: 110 ATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA--------GLTYWSPNVNIFR 161
AT FPQ I A++++ L +G V+ EARA +N GLT WSPN+NIFR
Sbjct: 63 ATVFPQAIGLAATWDVPLLHKVGDTVATEARAKFNAKPVTADRKIYEGLTIWSPNINIFR 122
Query: 162 DPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRF 221
DPRWGRGQET GEDP L+G A +++GLQG D KV A+ KH + R
Sbjct: 123 DPRWGRGQETYGEDPFLTGHLAVGFIQGLQGPDPAHPKVLATPKHLAVHSGPE---AGRD 179
Query: 222 HFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRL 281
F+ S QD+E T+ FR+ + EGK SVMC+YN ++G P CA +L +R +W
Sbjct: 180 GFDVDPSPQDLESTYTPAFRLALTEGKAQSVMCAYNSIHGTPACASGGLLNDRLRKDWGF 239
Query: 282 NGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEI 341
G IVSDCD+V + H+ EA+A AI+AG+DL+CG E AV+RGL+SE
Sbjct: 240 TGLIVSDCDAVANIHLFHHYRLDAAEASAAAIKAGMDLNCGTTYAALPE-AVKRGLVSEA 298
Query: 342 DINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQG 401
+++ AL L + LG+ G + P+ + P + TP + LALEAAR+ IVLLKN G
Sbjct: 299 EVDVALKRALDARRALGIAFG--GANPWSRIKPGERGTPAQRALALEAARKAIVLLKNDG 356
Query: 402 PSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRY---ARTIHQQGCK 458
LPL+ +AVIG N+D + GNY G A TPL GI R R ++ QG +
Sbjct: 357 DRLPLAP--GGRIAVIGANADDLGVLEGNYHGTARDPVTPLDGIRRQFGADRVVYAQGAQ 414
Score = 106 bits (265), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 89/292 (30%), Positives = 132/292 (45%), Gaps = 63/292 (21%)
Query: 479 DATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILVLMS 528
D + V+GL +E EAL DR + LP Q EL+ + K P +LVL S
Sbjct: 580 DTIVAVLGLSPDLEGEALSVSIPGFVGGDRTDIALPRPQLELLKALRKTGK-PLVLVLTS 638
Query: 529 GGPIDVAFAKNDPRIA-AIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNL 587
G + V DP +A AI+ A YPG+ GGTAIA+ L G +NP G+LP+T+Y + +L
Sbjct: 639 GSAVAV-----DPSLADAILEAWYPGEEGGTAIAETLAGKNNPSGRLPLTFYAS--VDDL 691
Query: 588 P-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRH 646
P + M+ RTYRF+ G ++ FGHG+SYT F +
Sbjct: 692 PAFVDYGMKE--------RTYRFFTGKPLWGFGHGLSYTKFA-----------------Y 726
Query: 647 GSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPA-----GHWAP 701
G++ +G V V + N G++ G V+ PA G P
Sbjct: 727 GNVAVKAAGIGQPVQ----------VSATLTNAGNRSGEEVAQVYVVTPAAGKPGGLTTP 776
Query: 702 --HKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHI 751
+QL F++ + G + + + +S V R GTRR+ G + + I
Sbjct: 777 VLQRQLAGFQRTALAPGKSASLSFTLD-PRSISSVARDGTRRVLPGTYRVWI 827
>gi|222635353|gb|EEE65485.1| hypothetical protein OsJ_20901 [Oryza sativa Japonica Group]
Length = 481
Score = 281 bits (719), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 136/186 (73%), Positives = 155/186 (83%), Gaps = 2/186 (1%)
Query: 142 MYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSD--GDRLK 199
MYNGG AGLT+WSPNVNIFRDPRWGRGQETPGEDP ++ +YAA+YVRGLQ RLK
Sbjct: 1 MYNGGQAGLTFWSPNVNIFRDPRWGRGQETPGEDPAVAARYAAAYVRGLQQQQPSSGRLK 60
Query: 200 VAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQV 259
+AA CKHFTAYDLDNW+G DRFHFNA V++QD+EDTF+VPFR CV++G+ ASVMCSYNQV
Sbjct: 61 LAACCKHFTAYDLDNWSGTDRFHFNAVVTRQDLEDTFNVPFRSCVVDGRAASVMCSYNQV 120
Query: 260 NGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDL 319
NGVPTCAD L+ TIR W L GYIVSDCDSV V+Y QH+T T E+A A +RAGLDL
Sbjct: 121 NGVPTCADAAFLRGTIRRRWGLAGYIVSDCDSVDVFYSDQHYTRTREDAVAATLRAGLDL 180
Query: 320 DCGPFL 325
DCGPFL
Sbjct: 181 DCGPFL 186
Score = 265 bits (676), Expect = 9e-68, Method: Compositional matrix adjust.
Identities = 145/264 (54%), Positives = 182/264 (68%), Gaps = 2/264 (0%)
Query: 504 PGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADI 563
PGRQ EL+S V+ ASKGP ILVLMSGGPID+ FA+NDP+IA I+WAGYPGQAGG AIAD+
Sbjct: 220 PGRQAELISSVAKASKGPVILVLMSGGPIDIGFAQNDPKIAGILWAGYPGQAGGQAIADV 279
Query: 564 LFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMS 623
+FG NPGGKLP+TWYPQ+Y+ +PMT MAMR + +K YPGRTYRFY GP ++PFGHG+S
Sbjct: 280 IFGHHNPGGKLPVTWYPQDYLQKVPMTNMAMRANPAKGYPGRTYRFYTGPTIHPFGHGLS 339
Query: 624 YTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKD 683
YT+F H++A+AP+ + V L H + +A+ S A G + S +
Sbjct: 340 YTSFTHSIAHAPSQLTVRLSAHHAAASASASLNA--TARLSRAAAERGRTRRRRTRCSCN 397
Query: 684 GAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIP 743
A L + AGH AP +QLVAFEKVHV AG RV + I VC LSV DR+G RRIP
Sbjct: 398 AAAPALSAAEAAAGHGAPVRQLVAFEKVHVGAGGTARVEMGIDVCDGLSVADRNGVRRIP 457
Query: 744 LGEHNIHIGGTKHSVSLHAATLGV 767
+GEH + IG H+V++ LGV
Sbjct: 458 VGEHRLIIGELTHTVTIALEQLGV 481
>gi|325105296|ref|YP_004274950.1| beta-glucosidase [Pedobacter saltans DSM 12145]
gi|324974144|gb|ADY53128.1| Beta-glucosidase [Pedobacter saltans DSM 12145]
Length = 884
Score = 281 bits (719), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 168/430 (39%), Positives = 239/430 (55%), Gaps = 31/430 (7%)
Query: 42 FCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGT 101
F LP+ +R+ +L+G L+L+EKV L+++ + V RLGI Y+WW+EALHGV+ G
Sbjct: 32 FRNPDLPVNERIENLLGLLTLEEKVGLMMNSSKPVGRLGIPAYDWWNEALHGVARSGK-- 89
Query: 102 KFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--------GGTAGLTYW 153
AT FPQ I A+++N + + ++SDEARA YN G GL++W
Sbjct: 90 --------ATVFPQAIGMAATWNESGHKQTFDLISDEARAKYNEAIRNGERGRYYGLSFW 141
Query: 154 SPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLD 213
+PN+NIFRDPRWGRGQET GEDP L+ + + VRGLQG D K A KHF +
Sbjct: 142 TPNINIFRDPRWGRGQETYGEDPYLTARLGVAAVRGLQGDDPKYFKTHACAKHFAVHSGP 201
Query: 214 NWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKR 273
WN R ++A S +D+ +T+ F+ V E V VMC+YN G P C +L
Sbjct: 202 EWN---RHSYDATASGRDLWETYLPAFKALVKEANVQEVMCAYNAYEGQPCCGSDRLLTD 258
Query: 274 TIRGEWRLNGYIVSDCDSVGVYYDTQHFTS--TPEEAAADAIRAGLDLDCGPFLGLHTES 331
+R W G +VSDC ++ ++ H + AAADA+ DL+CG E
Sbjct: 259 ILRNRWEYKGIVVSDCWAIDDFFRKGHHETHKDAAAAAADAVIHSTDLECGSAYTNLLE- 317
Query: 332 AVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQ-PYGHLGPKDVCTPDHQELALEAA 390
AV++GL+S+ I+ +L L LGM D P+ + P+ L + V + +H + AL+ A
Sbjct: 318 AVRQGLISQQQIDISLRRVLRGWFELGMLD--PAERLPWSQLPYQIVASKEHVQQALKVA 375
Query: 391 RQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGR--- 447
R+ + LLKN G LPLS + +AVIGPN+ +V + GNY G T LQGI
Sbjct: 376 RESMTLLKNNGSILPLSK-SIKKIAVIGPNAADSVMLWGNYNGTPNSTVTILQGIKNKLP 434
Query: 448 YARTIHQQGC 457
+A I+ +GC
Sbjct: 435 HAEIIYDKGC 444
Score = 110 bits (274), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 94/374 (25%), Positives = 161/374 (43%), Gaps = 50/374 (13%)
Query: 416 VIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTI-HQQGCK------DVACADDQLF 468
V+ N V +T G A YT + +Y + H+Q K V D F
Sbjct: 539 VLYVNDKVLLTKKGREASADTEYTMKAEKNKKYNIVLEHKQVGKYVNIDFSVLKKDKADF 598
Query: 469 GAAIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMAS 518
I ++ DA + GL +E E + D+ + LP Q+EL+S +
Sbjct: 599 TNLIKRLKEVDAIVYAGGLSPQLEGEEMPVNADGFRGGDKISIDLPKIQRELLSSLKSTG 658
Query: 519 KGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTW 578
K P + VL +G + A +++ A++ A Y GQ GTA+AD+LFG NP G+LP+T+
Sbjct: 659 K-PVVFVLCTGSSL--ALEQDEKNYNALLCAWYGGQEAGTAVADVLFGDYNPAGRLPITF 715
Query: 579 YPQ-EYITN--LPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAP 635
Y + N L ++ + + ++ GRTYR+ +Y FGHG+SY+ F +
Sbjct: 716 YKSLSQLDNALLKTSDTSRQDFENYSMQGRTYRYMTEKPLYAFGHGLSYSKFNYG----- 770
Query: 636 TVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPP 695
A ++ +K+ + TL + + + N+ + G + V+
Sbjct: 771 --------------EAKLTSGTVKIGN------TLNISIPLTNISNNKGEEVVQVYVKRN 810
Query: 696 AGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIP-LGEHNIHIGGT 754
AP K L F++V + AG + + + + D S P G + I GG+
Sbjct: 811 GDPDAPVKSLKGFKRVAIAAGETKHLDFQL-TAEAFEFYDPSKDELGPKAGNYTIMYGGS 869
Query: 755 KHSVSLHAATLGVI 768
+ L + ++ V+
Sbjct: 870 SNEDRLKSISINVV 883
>gi|315607027|ref|ZP_07882031.1| beta-glucosidase [Prevotella buccae ATCC 33574]
gi|315251081|gb|EFU31066.1| beta-glucosidase [Prevotella buccae ATCC 33574]
Length = 866
Score = 281 bits (718), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 164/440 (37%), Positives = 240/440 (54%), Gaps = 41/440 (9%)
Query: 28 FACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWW 87
F+C A + P+ + L +R DL RL+L+EK KL+ + + A+PRLGI +EWW
Sbjct: 14 FSCV---AGAQQYPYQNLQLSSQERAEDLCSRLTLEEKTKLMRNSSPAIPRLGIPQFEWW 70
Query: 88 SEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGT 147
SEALHG++ G AT FPQ A+S++ L + SDEA A N
Sbjct: 71 SEALHGIARNG----------FATVFPQTTAMAASWDDELLYRVFCAASDEAVAKNNLAR 120
Query: 148 A--------GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDR-- 197
G++ W+PN+NIFRDPRWGRGQET GEDP L+ + + V GLQG R
Sbjct: 121 KSGDIKRYQGVSIWTPNINIFRDPRWGRGQETYGEDPYLTSRMGLAVVNGLQGQPFRRDM 180
Query: 198 ---------LKVAASCKHFTAYDLDNWNGVDRFHFNA-KVSKQDIEDTFDVPFRMCVMEG 247
K A KH+ + WN R F+ ++ ++D+ +T+ F+ V EG
Sbjct: 181 RPFTERPRYYKTLACAKHYAVHSGPEWN---RHVFDVERLPERDLWETYLPAFKSLVQEG 237
Query: 248 KVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDT--QHFTSTP 305
V VMC+Y +++G P C + L + +RGEW NG +VSDC ++ +Y H TP
Sbjct: 238 NVREVMCAYQRIDGSPCCGNTRYLHQILRGEWGYNGLVVSDCGAISDFYREGHHHVVETP 297
Query: 306 EEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPS 365
EA+A +RAG D++CG AV++GL+S I+ ++V L + +G FD E
Sbjct: 298 AEASAMGVRAGTDVECGAVYAT-LPRAVEQGLISREAIDTSVVRLLKARFEVGDFDSE-K 355
Query: 366 SQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTV 425
P+ GP+ + + H+ LAL+ AR+ + LL+N+ LPLS R +AV+GPN++ +V
Sbjct: 356 LVPWKLTGPEVIASETHRRLALDMARESMTLLQNRNRLLPLSKNGLR-IAVMGPNANDSV 414
Query: 426 TMIGNYAGIACGYTTPLQGI 445
+ GNY G TT L+GI
Sbjct: 415 MLWGNYTGYPISTTTILKGI 434
Score = 117 bits (294), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 86/300 (28%), Positives = 134/300 (44%), Gaps = 55/300 (18%)
Query: 478 ADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILVLM 527
AD + V G+ +E E + DR + LP Q+E++ + A K + V
Sbjct: 611 ADVVVFVGGISPRLEGEEMKVDAPGFNGGDRTSIELPEAQREVIRLLRQAGK-LVVFVNC 669
Query: 528 SGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNL 587
SGG VA A++ A Y G+AGG A+AD+LFG NP GKLP+T+Y + +L
Sbjct: 670 SGGA--VALVPEAEACDAVLQAWYAGEAGGQAVADVLFGDYNPSGKLPVTFYKSD--ADL 725
Query: 588 PMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHG 647
P R GRTYR+++G ++PFG G+SYT+FV
Sbjct: 726 P-------DFLDYRMTGRTYRYFRGTPLFPFGFGLSYTSFVFGTPRYE------------ 766
Query: 648 SINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVA 707
+GK + V+V N G +DGA + V+ PA P K L
Sbjct: 767 ------NGK---------------LYVEVTNTGKRDGAEVVQVYVKNPADADGPVKTLRG 805
Query: 708 FEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHAATLGV 767
F ++ + AG ++RV I + ++ + T R+ G H + +G + L ++ +
Sbjct: 806 FARIDLKAGERRRVEIAMPRERFEGWDATTNTMRVKPGNHLLMVGSSSRDADLQTISVSI 865
>gi|71731103|gb|EAO33170.1| Beta-glucosidase [Xylella fastidiosa subsp. sandyi Ann-1]
Length = 882
Score = 280 bits (717), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 165/405 (40%), Positives = 231/405 (57%), Gaps = 28/405 (6%)
Query: 51 QRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGA 110
Q L+ +++ QEK+ ++ A A+PRLGI Y+WWSE LHG++ G A
Sbjct: 32 QHAAALVAQMTRQEKIAQTMNAAPAIPRLGIPAYDWWSEGLHGIARNG----------YA 81
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--GGT-------AGLTYWSPNVNIFR 161
T FPQ I A+S+N L + +G V S EARA +N GG AGLT WSPN+NIFR
Sbjct: 82 TVFPQAIGLAASWNTDLLQHVGTVTSTEARAKFNLTGGPGKDHPRYAGLTLWSPNINIFR 141
Query: 162 DPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRF 221
DPRWGRG ET GEDP L+G+ A S++RGLQG D + A+ KHF + R
Sbjct: 142 DPRWGRGMETYGEDPYLTGQLAVSFIRGLQGDTPDHPRTIATPKHFAVHSGPEQG---RH 198
Query: 222 HFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRL 281
F+ VS D+E T+ FR +++G SVMC+YN ++G P CA +L +R +W
Sbjct: 199 SFDVDVSAYDLEATYTPAFRAAIVDGHAGSVMCAYNALHGTPACASDWLLNTRLRNDWGF 258
Query: 282 NGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSE 340
NG++VSDCD++ F A+A A+++G DL+CG + L+ A+ RG + E
Sbjct: 259 NGFVVSDCDAIEDMTRFHFFRQDNASASAAALKSGDDLNCGNTYRDLN--QAIARGDIDE 316
Query: 341 IDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQ 400
++ AL+ T + RLG PY +G K + TP H+ LAL+AA Q +VLLKN
Sbjct: 317 STLDQALIRLFTARQRLGTLQPR-EHDPYAAIGIKHIDTPAHRALALQAAAQSLVLLKNS 375
Query: 401 GPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
G +LPL T+AV+GP++D + NY G + TPL G+
Sbjct: 376 GNTLPLP--PETTLAVLGPDADSLTALEANYQGTSSTPVTPLTGL 418
Score = 131 bits (330), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 95/290 (32%), Positives = 142/290 (48%), Gaps = 48/290 (16%)
Query: 474 ASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTI 523
A ADA + +GL +E E L DR + LP Q+ L+ V K P I
Sbjct: 607 AVAHADAIVAFVGLSPEVEGEELHIDTPGFSGGDRTTIDLPATQETLLQHVKTTGK-PLI 665
Query: 524 LVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEY 583
+VLMSG + + +A++ AI+ A YPGQ+GGTAIA L G NPGG+LP+T+Y
Sbjct: 666 VVLMSGSAVALNWAQH--HADAILAAWYPGQSGGTAIAQALAGDVNPGGRLPVTFY--RS 721
Query: 584 ITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLD 643
+LP P S GRTYR++KG +YPFG+G+SYT F + AP + L
Sbjct: 722 TQDLP-------PYISYDMTGRTYRYFKGQPLYPFGYGLSYTQFAY---EAPQLSTATLK 771
Query: 644 GRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHK 703
+ TL V V+N G++ G + ++ PP AP +
Sbjct: 772 AGN----------------------TLTVTAHVRNTGTRAGDEVVQLYLEPPYSPQAPLR 809
Query: 704 QLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGG 753
LV F++V + G + + + + LS V ++G R + G +++ +GG
Sbjct: 810 SLVGFKRVTLRPGESRLLTFTLD-ARQLSGVQQTGQRSVEAGHYHLFVGG 858
>gi|288927072|ref|ZP_06420962.1| beta-glucosidase [Prevotella buccae D17]
gi|288336152|gb|EFC74543.1| beta-glucosidase [Prevotella buccae D17]
Length = 866
Score = 280 bits (717), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 164/440 (37%), Positives = 239/440 (54%), Gaps = 41/440 (9%)
Query: 28 FACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWW 87
F+C A + P+ L +R DL RL+L+EK KL+ + + A+PRLGI +EWW
Sbjct: 14 FSCV---AGAQQYPYQNPRLSSQERAEDLCSRLTLEEKTKLMRNSSPAIPRLGIPQFEWW 70
Query: 88 SEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGT 147
SEALHG++ G AT FPQ A+S++ L + SDEA A N
Sbjct: 71 SEALHGIARNG----------FATVFPQTTAMAASWDDELLYHVFCAASDEAVAKNNLAR 120
Query: 148 A--------GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDR-- 197
G++ W+PN+NIFRDPRWGRGQET GEDP L+ + + V GLQG R
Sbjct: 121 KSGDIKRYQGVSIWTPNINIFRDPRWGRGQETYGEDPYLTSRMGLAVVNGLQGQPFRRDM 180
Query: 198 ---------LKVAASCKHFTAYDLDNWNGVDRFHFNA-KVSKQDIEDTFDVPFRMCVMEG 247
K A KH+ + WN R F+ ++ ++D+ +T+ F+ V EG
Sbjct: 181 RPFTERPRYYKTLACAKHYAVHSGPEWN---RHVFDVERLPERDLWETYLPAFKSLVQEG 237
Query: 248 KVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDT--QHFTSTP 305
V VMC+Y +++G P C + L + +RGEW NG +VSDC ++ +Y H TP
Sbjct: 238 NVREVMCAYQRIDGSPCCGNTRYLHQILRGEWEYNGLVVSDCGAISDFYREGHHHVVETP 297
Query: 306 EEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPS 365
EA+A +RAG D++CG AV++GL+S I+ ++V L + +G FD E
Sbjct: 298 AEASAMGVRAGTDVECGAVYAT-LPRAVEQGLISREAIDTSVVRLLKARFEVGDFDSE-K 355
Query: 366 SQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTV 425
P+ GP+ + + H+ LAL+ AR+ + LL+N+ LPLS R +AV+GPN++ +V
Sbjct: 356 LVPWKLTGPEVIASETHRRLALDMARESMTLLQNRNRLLPLSKNGLR-IAVMGPNANDSV 414
Query: 426 TMIGNYAGIACGYTTPLQGI 445
+ GNY G TT L+GI
Sbjct: 415 MLWGNYTGYPISTTTILKGI 434
Score = 115 bits (289), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 85/300 (28%), Positives = 133/300 (44%), Gaps = 55/300 (18%)
Query: 478 ADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILVLM 527
AD + V G+ +E E + DR + LP Q+E++ + A K + V
Sbjct: 611 ADVVVFVGGISPRLEGEEMKVDAPGFKGGDRTSIELPEAQREVIRLLRQAGK-LVVFVNC 669
Query: 528 SGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNL 587
SGG VA A++ A Y G+AGG A+AD+LFG NP GKLP+T+Y + +L
Sbjct: 670 SGGA--VALVPETEACDAVLQAWYAGEAGGQAVADVLFGDYNPSGKLPVTFYKSD--ADL 725
Query: 588 PMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHG 647
P R GRTYR+++G ++PFG G+SYT+F G+
Sbjct: 726 P-------DFLDYRMTGRTYRYFRGIPLFPFGFGLSYTSFAF--------------GKPR 764
Query: 648 SINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVA 707
N + V+V N G +DGA + V+ PA P K L
Sbjct: 765 YENGKL-------------------YVEVTNTGKRDGAEVVQVYVKNPADADGPVKTLRG 805
Query: 708 FEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHAATLGV 767
F ++ + AG ++RV I + ++ + T R+ G H + +G + L ++ +
Sbjct: 806 FARIDLKAGERRRVEIAMPRERFEGWDATTNTMRVKPGNHLLMVGSSSRDADLQTISVSI 865
>gi|154493680|ref|ZP_02033000.1| hypothetical protein PARMER_03021 [Parabacteroides merdae ATCC
43184]
gi|423723902|ref|ZP_17698051.1| hypothetical protein HMPREF1078_02038 [Parabacteroides merdae
CL09T00C40]
gi|154086890|gb|EDN85935.1| glycosyl hydrolase family 3 C-terminal domain protein
[Parabacteroides merdae ATCC 43184]
gi|409240709|gb|EKN33484.1| hypothetical protein HMPREF1078_02038 [Parabacteroides merdae
CL09T00C40]
Length = 868
Score = 280 bits (716), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 163/440 (37%), Positives = 234/440 (53%), Gaps = 37/440 (8%)
Query: 41 PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPG 100
PF LPI +R++DL+ RL+ +EK+ +++ A+ RLGI Y+WW+EALHGV+ G
Sbjct: 27 PFRNPDLPIDERIDDLLKRLTAEEKIGQMMNTTPAIERLGIPEYDWWNEALHGVARAGK- 85
Query: 101 TKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--------GGTAGLTY 152
AT FPQ I A++F+ +VSDEARA Y+ GLT+
Sbjct: 86 ---------ATVFPQAIAMAATFDDDALYETFTMVSDEARAKYHQYQKNKEYDRYKGLTF 136
Query: 153 WSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDL 212
W+PN+NIFRDPRWGRG ET GEDP L+ + + V+GLQG D K A KH+ +
Sbjct: 137 WTPNINIFRDPRWGRGMETYGEDPYLTERMGVAVVKGLQGDDPKYFKTHACAKHYAVHSG 196
Query: 213 DNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILK 272
WN R F+ V+ +D+ T+ F V +G V VMC+YN+ G P C+ +L
Sbjct: 197 PEWN---RHEFDVTVTPRDLWQTYLPAFEALVKKGNVQEVMCAYNRYQGKPCCSSDKLLI 253
Query: 273 RTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTP--------EEAAADAIRAGLDLDCGPF 324
+R W I+SDC ++ ++ Q TP E A+ADA+ G DL+CG
Sbjct: 254 DILRNSWGYENIILSDCGAINDFW--QRDERTPRHETHPDAESASADAVLNGTDLECGNS 311
Query: 325 LGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQE 384
+ A++ G +SE D++ +L L + LGMFD + PY + V +P+H
Sbjct: 312 YKALIK-ALKEGKISENDLDVSLRRLLKGRFELGMFDPD-ERVPYAQIPYNVVESPEHVA 369
Query: 385 LALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQG 444
ALE A + +VLLKN+ +LPLS R +AV+GPN+ + + NY G T L+G
Sbjct: 370 QALEMAHKSMVLLKNKNNTLPLSKTI-RKIAVVGPNAADSTMLWANYNGFPTHTVTILEG 428
Query: 445 IGRY---ARTIHQQGCKDVA 461
I I++ GC A
Sbjct: 429 IRNKVPDTEVIYELGCNHAA 448
Score = 132 bits (331), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 99/311 (31%), Positives = 141/311 (45%), Gaps = 49/311 (15%)
Query: 468 FGAAIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMA 517
+ A + AD + V G+ +E E + DR + +P QQE+V K A
Sbjct: 595 YAATAAKVKDADVIVYVGGISPRLEGEEMPVNVEGFKKGDRTNIEIPKVQQEMV-KALKA 653
Query: 518 SKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMT 577
+ P + VL +G + A D I AI+ A Y GQ GTA+ADILFG NP G+LP+T
Sbjct: 654 TGKPVVYVLCTGSAL--ALNWEDANIDAILNAWYGGQEAGTAVADILFGDYNPSGRLPVT 711
Query: 578 WYPQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPT 636
+Y I LP + +M+ GRTYR+ +YPFG+G+SYTNF +
Sbjct: 712 FYKS--IDQLPDFEDYSMK--------GRTYRYMTETPLYPFGYGLSYTNFAYR------ 755
Query: 637 VVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPA 696
NA +S K+T + LT D+ N G DG ++ P
Sbjct: 756 -------------NAKLSSG--KITKDQSVTLTF----DIANTGKMDGDEVAQIYIKNPN 796
Query: 697 GHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKH 756
P K L AF +VHV AG Q V I + + S D + T + G++ I GG+
Sbjct: 797 DPEGPIKALKAFLRVHVKAGDSQEVNIELTPEAFHSFNDNTQTMEVRPGKYQILYGGSSD 856
Query: 757 SVSLHAATLGV 767
+L L +
Sbjct: 857 EKALERLELTI 867
>gi|423344787|ref|ZP_17322476.1| hypothetical protein HMPREF1060_00148 [Parabacteroides merdae
CL03T12C32]
gi|409224378|gb|EKN17311.1| hypothetical protein HMPREF1060_00148 [Parabacteroides merdae
CL03T12C32]
Length = 866
Score = 280 bits (716), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 163/440 (37%), Positives = 234/440 (53%), Gaps = 37/440 (8%)
Query: 41 PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPG 100
PF LPI +R++DL+ RL+ +EK+ +++ A+ RLGI Y+WW+EALHGV+ G
Sbjct: 25 PFRNPDLPIDERIDDLLKRLTAEEKIGQMMNTTPAIERLGIPEYDWWNEALHGVARAGK- 83
Query: 101 TKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--------GGTAGLTY 152
AT FPQ I A++F+ +VSDEARA Y+ GLT+
Sbjct: 84 ---------ATVFPQAIAMAATFDDDALYETFTMVSDEARAKYHQYQKNKEYDRYKGLTF 134
Query: 153 WSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDL 212
W+PN+NIFRDPRWGRG ET GEDP L+ + + V+GLQG D K A KH+ +
Sbjct: 135 WTPNINIFRDPRWGRGMETYGEDPYLTERMGLAVVKGLQGDDPKYFKTHACAKHYAVHSG 194
Query: 213 DNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILK 272
WN R F+ V+ +D+ T+ F V +G V VMC+YN+ G P C+ +L
Sbjct: 195 PEWN---RHEFDVTVTPRDLWQTYLPAFEALVKKGNVQEVMCAYNRYQGKPCCSSDKLLI 251
Query: 273 RTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTP--------EEAAADAIRAGLDLDCGPF 324
+R W I+SDC ++ ++ Q TP E A+ADA+ G DL+CG
Sbjct: 252 DILRNSWGYENIILSDCGAINDFW--QRDERTPRHETHPDAESASADAVLNGTDLECGNS 309
Query: 325 LGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQE 384
+ A++ G +SE D++ +L L + LGMFD + PY + V +P+H
Sbjct: 310 YKALIK-ALKEGKISENDLDVSLRRLLKGRFELGMFDPD-ERVPYAQIPYNVVESPEHVA 367
Query: 385 LALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQG 444
ALE A + +VLLKN+ +LPLS R +AV+GPN+ + + NY G T L+G
Sbjct: 368 QALEMAHKSMVLLKNKNNTLPLSKTI-RKIAVVGPNAADSTMLWANYNGFPTHTVTILEG 426
Query: 445 IGRY---ARTIHQQGCKDVA 461
I I++ GC A
Sbjct: 427 IRNKVPDTEVIYELGCNHAA 446
Score = 132 bits (332), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 99/311 (31%), Positives = 141/311 (45%), Gaps = 49/311 (15%)
Query: 468 FGAAIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMA 517
+ A + AD + V G+ +E E + DR + +P QQE+V K A
Sbjct: 593 YAATAAKVKDADVIVYVGGISPRLEGEEMPVNVEGFKKGDRTNIEIPKVQQEMV-KALKA 651
Query: 518 SKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMT 577
+ P + VL +G + A D I AI+ A Y GQ GTA+ADILFG NP G+LP+T
Sbjct: 652 TGKPVVYVLCTGSAL--ALNWEDANIDAILNAWYGGQEAGTAVADILFGDYNPSGRLPVT 709
Query: 578 WYPQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPT 636
+Y I LP + +M+ GRTYR+ +YPFG+G+SYTNF +
Sbjct: 710 FYKS--IDQLPDFEDYSMK--------GRTYRYMTETPLYPFGYGLSYTNFAYR------ 753
Query: 637 VVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPA 696
NA +S K+T + LT D+ N G DG ++ P
Sbjct: 754 -------------NAKLSSG--KITKDQSVTLTF----DIANTGKMDGDEVAQIYIKNPN 794
Query: 697 GHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKH 756
P K L AF +VHV AG Q V I + + S D + T + G++ I GG+
Sbjct: 795 DPEGPIKALKAFLRVHVKAGDSQEVNIELTPEAFHSFNDNTQTMEVRPGKYQILYGGSSD 854
Query: 757 SVSLHAATLGV 767
+L L +
Sbjct: 855 EKALERLELTI 865
>gi|255013451|ref|ZP_05285577.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp. 2_1_7]
gi|410103695|ref|ZP_11298616.1| hypothetical protein HMPREF0999_02388 [Parabacteroides sp. D25]
gi|409236424|gb|EKN29231.1| hypothetical protein HMPREF0999_02388 [Parabacteroides sp. D25]
Length = 868
Score = 280 bits (716), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 162/439 (36%), Positives = 238/439 (54%), Gaps = 35/439 (7%)
Query: 41 PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPG 100
PF LP+ +R++DL+ RL+ +EK+ +++ A+ RLGI Y+WW+EALHGV+ G
Sbjct: 27 PFRNPDLPLEERIDDLLSRLTPEEKIGQMMNVTPAIERLGIPTYDWWNEALHGVARAG-- 84
Query: 101 TKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--------GGTAGLTY 152
AT FPQ I A++F+ +VSDEARA Y+ GLT+
Sbjct: 85 --------RATVFPQAIAMAATFDDNAVHETFTMVSDEARAKYHQYQKDKEYDRYKGLTF 136
Query: 153 WSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDL 212
W+PN+NIFRDPRWGRG ET GEDP L+ K + RGLQG D + K A KH+ +
Sbjct: 137 WTPNINIFRDPRWGRGMETYGEDPYLTEKMGVAVTRGLQGDDPNYYKTHACAKHYAVHSG 196
Query: 213 DNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILK 272
WN R F+ + + +D+ +T+ F V EG V VMC+YN+ G P C+ +L
Sbjct: 197 PEWN---RHEFDVEATPRDLYETYLPAFEALVKEGDVQEVMCAYNRFEGKPCCSSDKLLI 253
Query: 273 RTIRGEWRLNGYIVSDCDSVGVYY----DTQHFTSTP--EEAAADAIRAGLDLDC-GPFL 325
+R W + I+SDC ++ ++ +T + P E A+ADA+ G DL+C G +
Sbjct: 254 DILRNSWGYDNIILSDCGAIDDFWRKDKNTPRHETHPDAESASADAVLNGTDLECGGSYR 313
Query: 326 GLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQEL 385
L+ A+ G +SE D++ +L L + LGMFD + PY + V +P+H
Sbjct: 314 ALN--KALADGKISEKDLDVSLRRLLKGRFELGMFDPD-ERVPYSKIPYSVVESPEHIAK 370
Query: 386 ALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
AL+ AR+ IVLLKN+ LPL + +AV+GPN+ + + NY G T ++GI
Sbjct: 371 ALDMARKSIVLLKNKNNMLPLDK-NIKKIAVVGPNAADSTMLWANYNGFPTKTVTIVEGI 429
Query: 446 GRY---ARTIHQQGCKDVA 461
A I++ GC A
Sbjct: 430 RNKVPNAEVIYELGCNHTA 448
Score = 127 bits (319), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 90/309 (29%), Positives = 146/309 (47%), Gaps = 49/309 (15%)
Query: 468 FGAAIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMA 517
F A + AD + V G+ +E E + DR + +P Q+E+V K +A
Sbjct: 595 FQATASKVKDADVIVFVGGISPRLEGEEMPVDAEGFRKGDRTNIEIPAVQKEMV-KALVA 653
Query: 518 SKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMT 577
+ P + V+ +G + + + +ND + AI+ A Y GQ GGTA+AD+LFG NP G+LP+T
Sbjct: 654 TGKPVVYVVCTGSALALNW-END-HVNAILNAWYGGQEGGTAVADVLFGDYNPAGRLPIT 711
Query: 578 WYPQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPT 636
+Y + LP + +M+ GRTYR+ +YPFG+G+SYT F +
Sbjct: 712 FYKS--VDQLPDFQDYSMK--------GRTYRYMTQTPLYPFGYGLSYTTFDYK------ 755
Query: 637 VVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPA 696
NA +S I + ++ + D+ N G DG ++ P
Sbjct: 756 -------------NAKLSKDKI------ASNESVTLSFDIANTGKMDGDEVAQIYIKNPN 796
Query: 697 GHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKH 756
P K + AF++V+V AG++Q V I + + S D + T + G++ I GG+
Sbjct: 797 DPAGPLKAMKAFKRVNVKAGSEQPVSIQLEPKAFQSFNDNTQTMEVRPGKYQILYGGSSD 856
Query: 757 SVSLHAATL 765
+L L
Sbjct: 857 DKTLKKIDL 865
>gi|423331656|ref|ZP_17309440.1| hypothetical protein HMPREF1075_01453 [Parabacteroides distasonis
CL03T12C09]
gi|409230226|gb|EKN23094.1| hypothetical protein HMPREF1075_01453 [Parabacteroides distasonis
CL03T12C09]
Length = 868
Score = 280 bits (715), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 162/439 (36%), Positives = 238/439 (54%), Gaps = 35/439 (7%)
Query: 41 PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPG 100
PF LP+ +R++DL+ RL+ +EK+ +++ A+ RLGI Y+WW+EALHGV+ G
Sbjct: 27 PFRNPDLPLEERIDDLLSRLTPEEKIGQMMNVTPAIERLGIPTYDWWNEALHGVARAG-- 84
Query: 101 TKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--------GGTAGLTY 152
AT FPQ I A++F+ +VSDEARA Y+ GLT+
Sbjct: 85 --------RATVFPQAIAMAATFDDNAVHETFTMVSDEARAKYHQYQKDKEYDRYKGLTF 136
Query: 153 WSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDL 212
W+PN+NIFRDPRWGRG ET GEDP L+ K + RGLQG D + K A KH+ +
Sbjct: 137 WTPNINIFRDPRWGRGMETYGEDPYLTEKMGVAVTRGLQGDDPNYYKTHACAKHYAVHSG 196
Query: 213 DNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILK 272
WN R F+ + + +D+ +T+ F V EG V VMC+YN+ G P C+ +L
Sbjct: 197 PEWN---RHEFDVEATPRDLYETYLPAFEALVKEGDVQEVMCAYNRFEGKPCCSSDKLLI 253
Query: 273 RTIRGEWRLNGYIVSDCDSVGVYY----DTQHFTSTP--EEAAADAIRAGLDLDC-GPFL 325
+R W + I+SDC ++ ++ +T + P E A+ADA+ G DL+C G +
Sbjct: 254 DILRNSWGYDNIILSDCGAIDDFWRKDKNTPRHETHPDAESASADAVLNGTDLECGGSYR 313
Query: 326 GLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQEL 385
L+ A+ G +SE D++ +L L + LGMFD + PY + V +P+H
Sbjct: 314 ALN--KALADGKISEKDLDVSLRRLLKGRFELGMFDPD-ERVPYSKIPYSVVESPEHIAK 370
Query: 386 ALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
AL+ AR+ IVLLKN+ LPL + +AV+GPN+ + + NY G T ++GI
Sbjct: 371 ALDMARKSIVLLKNKNNMLPLDK-NIKKIAVVGPNAADSTMLWANYNGFPTKTVTIVEGI 429
Query: 446 GRY---ARTIHQQGCKDVA 461
A I++ GC A
Sbjct: 430 RNKVPNAEVIYELGCNHTA 448
Score = 125 bits (315), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 90/309 (29%), Positives = 145/309 (46%), Gaps = 49/309 (15%)
Query: 468 FGAAIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMA 517
F A + AD + V G+ +E E + DR + +P Q+E+V K +A
Sbjct: 595 FQATASKVKDADVIVFVGGISPRLEGEEMPVDAEGFRKGDRTNIEIPAVQKEMV-KALVA 653
Query: 518 SKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMT 577
+ P + V+ +G + + + +ND + AI+ A Y GQ GGTA+AD+LFG NP G+LP+T
Sbjct: 654 TGKPVVYVVCTGSALALNW-END-HVNAILNAWYGGQEGGTAVADVLFGDYNPAGRLPIT 711
Query: 578 WYPQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPT 636
+Y + LP + +M+ GRTYR+ +YPFG+G+SYT F +
Sbjct: 712 FYKS--VDQLPDFQDYSMK--------GRTYRYMTQTPLYPFGYGLSYTTFDYK------ 755
Query: 637 VVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPA 696
NA +S I + ++ + D+ N G DG ++ P
Sbjct: 756 -------------NAKLSKDKI------ASNESVTLSFDIANTGKMDGDEVAQIYIKNPN 796
Query: 697 GHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKH 756
P K + AF++V+V AG+ Q V I + + S D + T + G++ I GG+
Sbjct: 797 DPAGPLKAMKAFKRVNVKAGSAQPVSIQLEPKAFQSFNDNTQTMEVRPGKYQILYGGSSD 856
Query: 757 SVSLHAATL 765
+L L
Sbjct: 857 DKTLKKIDL 865
>gi|313205375|ref|YP_004044032.1| glycoside hydrolase [Paludibacter propionicigenes WB4]
gi|312444691|gb|ADQ81047.1| glycoside hydrolase family 3 domain protein [Paludibacter
propionicigenes WB4]
Length = 858
Score = 280 bits (715), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 175/445 (39%), Positives = 240/445 (53%), Gaps = 40/445 (8%)
Query: 38 RTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNV 97
+ LP+ L R DL+ RL+L EK L+ + + A+PRLGIK YEWW+EALHGV
Sbjct: 20 QQLPYQNPKLSAEVRATDLLARLTLAEKAALMQNNSPAIPRLGIKAYEWWNEALHGVGRS 79
Query: 98 GPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN-----GGTA---G 149
G AT FPQ I A+SFN L VSDEARA N GG G
Sbjct: 80 GV----------ATVFPQAIGMAASFNNGLLFDAFTAVSDEARAKSNKFSEQGGLKRYQG 129
Query: 150 LTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRL-KVAASCKHFT 208
LTYW+PNVNIFRDPRWGRGQET GEDP L+ + V+GLQG D K+ A KHF
Sbjct: 130 LTYWTPNVNIFRDPRWGRGQETYGEDPYLTSLMGVAVVKGLQGPDNAEYDKLHACAKHFA 189
Query: 209 AYDLDNWNGVDRFHFNAK-VSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCAD 267
+ WN R FNA+ ++ +D+ +T+ F+ V + V VMC+YN+ P C
Sbjct: 190 VHSGPEWN---RHSFNAENINPRDLWETYLPAFKALVQKADVKEVMCAYNRFEDEPCCGS 246
Query: 268 PNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPE--EAAADAIRAGLDLDCGP-F 324
+L + +R +W+ +G +VSDC ++ +Y + P+ AAA+A+ G DL+CG F
Sbjct: 247 NRLLTQILRNDWKFDGLVVSDCWAISDFYKPNAHATQPDATHAAANAVLNGTDLECGSDF 306
Query: 325 LGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQE 384
L AV+ GL+ E I+ +L L + L GE +S + V + HQ
Sbjct: 307 RNL--PEAVKAGLIEEKRIDVSLKRLLKARFEL----GEMNSDQVWPISYSVVNSEKHQN 360
Query: 385 LALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQG 444
LAL A + IVLL+N LPLS + +AV+GPN++ +V GNY G T L+
Sbjct: 361 LALRMAEESIVLLQNNNNILPLS--KKLKIAVMGPNANDSVMQWGNYNGFPAHTVTLLEA 418
Query: 445 IGRY---ARTIHQQGCK---DVACA 463
+ + A+ I++ GC DVA +
Sbjct: 419 MRKSFPGAQLIYEPGCDRTMDVAVS 443
Score = 102 bits (255), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 82/309 (26%), Positives = 131/309 (42%), Gaps = 49/309 (15%)
Query: 468 FGAAIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMA 517
A+I + AD + G+ S+E E + DR + LP Q+ L+ + A
Sbjct: 585 LSASIAKVKDADVVVFAGGIAPSLEGEEMRVTVPGFKGGDRTDIELPAIQRRLLQALKDA 644
Query: 518 SKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMT 577
K + V SG + AI+ A YPGQAGGTA+A++L G NP G+LP+T
Sbjct: 645 GK-KVVFVNFSGSAM--GLVPETQSCEAILQAWYPGQAGGTAVANVLLGNYNPSGRLPVT 701
Query: 578 WYPQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPT 636
+Y + LP + +M+ GRTYR+ ++ FG+G+SYT FV A
Sbjct: 702 FYKN--VAQLPDFEDYSMK--------GRTYRYMTEKPLFSFGYGLSYTKFVLGTAK--- 748
Query: 637 VVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPA 696
++ +IK TL + V V N G G L V+
Sbjct: 749 ----------------LNKSSIKANE------TLKITVPVTNAGKVAGTEVLQVYVRKVK 786
Query: 697 GHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKH 756
P K L F+KV++ G ++ I++ + + GE+ ++ G +
Sbjct: 787 DVDGPAKTLRGFKKVNIEPGKTSQISIDLTSSAFEFYDWTQRKMMVTPGEYEVYYGTSSD 846
Query: 757 SVSLHAATL 765
+ L A++
Sbjct: 847 AKDLKKASI 855
>gi|393786770|ref|ZP_10374902.1| hypothetical protein HMPREF1068_01182 [Bacteroides nordii
CL02T12C05]
gi|392658005|gb|EIY51635.1| hypothetical protein HMPREF1068_01182 [Bacteroides nordii
CL02T12C05]
Length = 864
Score = 279 bits (714), Expect = 4e-72, Method: Compositional matrix adjust.
Identities = 165/425 (38%), Positives = 238/425 (56%), Gaps = 34/425 (8%)
Query: 36 TTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVS 95
T LP+ +L QR DL+ RL+++EKV L+ + + + RLGIK YEWW+EALHGV+
Sbjct: 24 TYSQLPYQDPNLTPEQRATDLLQRLTIEEKVSLMQNNSPGILRLGIKPYEWWNEALHGVA 83
Query: 96 NVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAM---YN-----GGT 147
G AT FPQ I A+SF+ TL + +SDEARA +N
Sbjct: 84 RAGL----------ATVFPQTIGMAASFDDTLIYEVFNAISDEARAKNRHFNTLGQYKRY 133
Query: 148 AGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRL-KVAASCKH 206
GLT W+PN+NIFRDPRWGRGQET GEDP L+ + + V+GLQG D R K+ A KH
Sbjct: 134 QGLTMWTPNINIFRDPRWGRGQETYGEDPYLTSRMGVAVVKGLQGPDSARYNKLHACAKH 193
Query: 207 FTAYDLDNWNGVDRFHFNAK-VSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTC 265
F + WN R FNA+ + +D+ +T+ F+ V E V VMC+YN+ G P C
Sbjct: 194 FAVHSGPEWN---RHSFNAENIIPRDLWETYLPAFKTLVQEADVKEVMCAYNRFEGDPCC 250
Query: 266 ADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPE--EAAADAIRAGLDLDCGP 323
+L + +R EW G +VSDC ++ ++ T+ + P+ A+A+A+ G DL+CG
Sbjct: 251 GSNRLLTQILRNEWGFKGIVVSDCGAISDFWGTKKHNTHPDAAHASAEAVLNGTDLECGS 310
Query: 324 FLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLG-MFDGEPSSQPYGHLGPKDVCTPDH 382
TE A++ G++SE IN ++ L + LG M + P + PY V +P H
Sbjct: 311 NYRKLTE-AIKAGIISEKQINVSVKRLLKARFELGEMENIHPWTLPYS-----IVDSPKH 364
Query: 383 QELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPL 442
+ LAL+ A + + LL+N+G LPL + +A+IGPN++ +V GNY G +T L
Sbjct: 365 RCLALKMAHETMTLLQNKGKVLPLD--KQARIAIIGPNANDSVMQWGNYNGTPSHTSTLL 422
Query: 443 QGIGR 447
+
Sbjct: 423 SAFRK 427
Score = 117 bits (292), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 88/307 (28%), Positives = 140/307 (45%), Gaps = 49/307 (15%)
Query: 472 IDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGP 521
++ + D I G+ S+E E + DR + P Q+++++ + A K
Sbjct: 595 LEKLKDIDIIIFAGGISPSLEGEEMNVSATGFKGGDRTDIEFPAVQRKVLAALKEAGK-K 653
Query: 522 TILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQ 581
ILV SG + A AI+ A YPG+ GG AI ++LFG NP G+LP+T+Y
Sbjct: 654 VILVNFSGSAM--ALTPETKSCDAILQAWYPGEEGGMAIVNVLFGDYNPAGRLPITFYKS 711
Query: 582 EYITNLPMTE-MAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAV 640
I LP E +M+ GRTYR+ + ++PFG+G+SYT F
Sbjct: 712 --IDQLPDFENYSMK--------GRTYRYMQEEPLFPFGYGLSYTTFA------------ 749
Query: 641 PLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWA 700
H + N+ +G+ KVT + + +KN+G +DG + ++ A
Sbjct: 750 -FGKIHINKNSLSAGE--KVT----------LHIPIKNIGDRDGVEVVQIYIQRQADKEG 796
Query: 701 PHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
P K L AF++V +P G Q V I + + + T R GE+NI G + L
Sbjct: 797 PVKTLRAFKRVEIPKGKTQEVKIELPYVAFEWFDPTTNTMRPIQGEYNILYGNSSRMKDL 856
Query: 761 HAATLGV 767
+A + +
Sbjct: 857 QSAKIQI 863
>gi|261408260|ref|YP_003244501.1| glycoside hydrolase family protein [Paenibacillus sp. Y412MC10]
gi|261284723|gb|ACX66694.1| glycoside hydrolase family 3 domain protein [Paenibacillus sp.
Y412MC10]
Length = 763
Score = 279 bits (714), Expect = 4e-72, Method: Compositional matrix adjust.
Identities = 218/678 (32%), Positives = 336/678 (49%), Gaps = 86/678 (12%)
Query: 109 GATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRG 168
GAT FP +T S++N L+ +I R V+ E RA GG+A +SP +++ RDPRWGR
Sbjct: 123 GATVFPVPLTIGSTWNTELFRSISRAVAAETRA--QGGSA---TYSPVLDVVRDPRWGRT 177
Query: 169 QETPGEDPVLSGKYAASYVRGLQGSDGD-RLKVAASCKHFTAYDLDNWNGVDRFHFNAKV 227
+ET GEDP L ++A + V+GLQG D + A+ KHF Y R +
Sbjct: 178 EETFGEDPHLVTEFAVAAVQGLQGERLDSHTSLLATLKHFAGYGASEGG---RNGAPVHM 234
Query: 228 SKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVS 287
+++ + +PFR V G + SVM +YN+++GVP + +L+ +R W +G++++
Sbjct: 235 GLRELHEVDLLPFRKAVEAGAL-SVMTAYNEIDGVPCTSSGYLLQDVLREAWGFDGFVIT 293
Query: 288 DCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLD-CGPFLGLHTESAVQRGLLSEIDINNA 346
DC ++ + + + EAAA +++AG+D++ G H A+++GL++E D+N A
Sbjct: 294 DCGAIHMLACGHNTAGSGVEAAAQSLKAGVDMEMSGTMFRAHLHQALEQGLITEEDLNRA 353
Query: 347 LVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPL 406
L ++ RLG+FD P P + + +H LA +AA +GIVLLKN+G LPL
Sbjct: 354 AGRVLELKFRLGLFD-RPYVDP--AWAEQVIGCKEHIALAYQAAAEGIVLLKNEGNLLPL 410
Query: 407 SHIRHRTVAVIGPNSDVTVTMIGNYAGIA--CGYTTPLQGIGRY---ARTIHQQGCKDVA 461
T+AVIGPN+ +G+Y T L GI R +R ++ GC+ +
Sbjct: 411 DS-SSGTIAVIGPNAHAPYHQLGDYTSPQPPGQIVTVLDGIRRRLGDSRVLYAPGCR-IQ 468
Query: 462 CADDQLFGAAIDASRQADATILVMG------------------------LDQSIE-AEAL 496
+ F A+ + QAD ++V+G + +E E +
Sbjct: 469 GDSREGFPRALACAEQADVIVMVLGGSSARDFGEGTIDLRTGASVVTGHAESDMECGEGI 528
Query: 497 DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAG 556
DR+ L L G Q EL+ ++ K P I+V ++G PI + D I +I+ A YPGQ G
Sbjct: 529 DRSTLTLMGVQLELLQELHKLGK-PVIVVYINGRPITEPWI--DEHIPSIVEAWYPGQEG 585
Query: 557 GTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVY 616
G+AIAD+LFG NP G+LP++ P+E + LP + ++R G+ Y Y
Sbjct: 586 GSAIADMLFGDINPSGRLPLS-IPKE-VGQLPNS------YNARRTRGKRYLETDLAPRY 637
Query: 617 PFGHGMSYTNFVH-TVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVD 675
PFG G+SYT F + + P VV I G+A V++D
Sbjct: 638 PFGFGLSYTEFRYGRLTVEPAVVP-------------IGGEAT-------------VRID 671
Query: 676 VKNVGSKDGAHTL-LVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVV 734
V N G++DGA + L S A P K L F KV + AG Q V I + L ++
Sbjct: 672 VTNAGARDGAEVVQLYVSDLAASVTRPEKALKGFRKVFLKAGETQEVTFTIG-SEQLELI 730
Query: 735 DRSGTRRIPLGEHNIHIG 752
+ GE I +G
Sbjct: 731 GLDLKPVVEPGEFRIQVG 748
>gi|402304900|ref|ZP_10823963.1| glycosyl hydrolase family 3, N-terminal domain protein [Prevotella
sp. MSX73]
gi|400380686|gb|EJP33499.1| glycosyl hydrolase family 3, N-terminal domain protein [Prevotella
sp. MSX73]
Length = 866
Score = 279 bits (713), Expect = 4e-72, Method: Compositional matrix adjust.
Identities = 164/440 (37%), Positives = 239/440 (54%), Gaps = 41/440 (9%)
Query: 28 FACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWW 87
F+C A + P+ L +R DL RL+L+EK KL+ + + A+PRLGI +EWW
Sbjct: 14 FSCV---AGAQQYPYQNPRLSSQERAEDLCSRLTLEEKTKLMRNSSPAIPRLGIPQFEWW 70
Query: 88 SEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGT 147
SEALHG++ G AT FPQ A+S++ L + SDEA A N
Sbjct: 71 SEALHGIARNG----------FATVFPQTTAMAASWDDELLYRVFCAASDEAVAKNNLAR 120
Query: 148 A--------GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDR-- 197
G++ W+PN+NIFRDPRWGRGQET GEDP L+ + + V GLQG R
Sbjct: 121 KSGDIKRYQGVSIWTPNINIFRDPRWGRGQETYGEDPYLTSRMGLAVVNGLQGQPFRRDM 180
Query: 198 ---------LKVAASCKHFTAYDLDNWNGVDRFHFNA-KVSKQDIEDTFDVPFRMCVMEG 247
K A KH+ + WN R F+ ++ ++D+ +T+ F+ V EG
Sbjct: 181 RPFTERPRYYKTLACAKHYAVHSGPEWN---RHVFDVERLPERDLWETYLPAFKSLVQEG 237
Query: 248 KVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDT--QHFTSTP 305
V VMC+Y +++G P C + L + +RGEW NG +VSDC ++ +Y H TP
Sbjct: 238 NVREVMCAYQRIDGSPCCGNTRYLHQILRGEWGYNGLVVSDCGAISDFYREGHHHVVETP 297
Query: 306 EEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPS 365
EA+A +RAG D++CG AV++GL+S I+ ++V L + +G FD E
Sbjct: 298 AEASAMGVRAGTDVECGAVYAT-LPRAVEQGLISREAIDTSVVRLLKARFEVGDFDSE-K 355
Query: 366 SQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTV 425
P+ GP+ + + H+ LAL+ AR+ + LL+N+ LPLS R +AV+GPN++ +V
Sbjct: 356 LVPWKLTGPEVIASETHRRLALDMARESMTLLQNRNRLLPLSKNGLR-IAVMGPNANDSV 414
Query: 426 TMIGNYAGIACGYTTPLQGI 445
+ GNY G TT L+GI
Sbjct: 415 MLWGNYTGYPISTTTILKGI 434
Score = 115 bits (289), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 85/300 (28%), Positives = 133/300 (44%), Gaps = 55/300 (18%)
Query: 478 ADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILVLM 527
AD + V G+ +E E + DR + LP Q+E++ + A K + V
Sbjct: 611 ADVVVFVGGISPRLEGEEMKVDAPGFKGGDRTSIELPEAQREVIRLLRQAGK-LVVFVNC 669
Query: 528 SGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNL 587
SGG VA A++ A Y G+AGG A+AD+LFG NP GKLP+T+Y + +L
Sbjct: 670 SGGA--VALVPEAEACDAVLQAWYAGEAGGQAVADVLFGDYNPSGKLPVTFYKSD--ADL 725
Query: 588 PMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHG 647
P R GRTYR+++G ++PFG G+SYT+F G+
Sbjct: 726 P-------DFLDYRMTGRTYRYFRGTPLFPFGFGLSYTSFAF--------------GKPR 764
Query: 648 SINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVA 707
N + V+V N G +DGA + V+ PA P K L
Sbjct: 765 YENGML-------------------YVEVTNTGKRDGAEVVQVYVKNPADADGPVKTLRG 805
Query: 708 FEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHAATLGV 767
F ++ + AG ++RV I + ++ + T R+ G H + +G + L ++ +
Sbjct: 806 FARIDLKAGERRRVEIAMPRERFEGWDATANTMRVKPGNHLLMVGSSSRDADLQTISVSI 865
>gi|317477153|ref|ZP_07936394.1| glycosyl hydrolase family 3 N terminal domain-containing protein
[Bacteroides eggerthii 1_2_48FAA]
gi|316906696|gb|EFV28409.1| glycosyl hydrolase family 3 N terminal domain-containing protein
[Bacteroides eggerthii 1_2_48FAA]
Length = 863
Score = 279 bits (713), Expect = 5e-72, Method: Compositional matrix adjust.
Identities = 149/403 (36%), Positives = 230/403 (57%), Gaps = 17/403 (4%)
Query: 45 VSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFG 104
+S PI R+ ++I +++L+EKV L + + ++PRL + Y +W+E LHGV+ G
Sbjct: 52 LSQPISVRIENIIRQMTLEEKVAQLSNESDSIPRLNLPSYNYWNECLHGVARAGE----- 106
Query: 105 GDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPR 164
T FPQ I AS+++ L + I +S EAR Y GLTYW+P +N+ RDPR
Sbjct: 107 -----VTVFPQAINLASTWDTLLVKRIASAISTEARLKYLDIGKGLTYWAPTINMARDPR 161
Query: 165 WGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFN 224
WGR +ET GEDP L+ + ++V+GLQG + LK A+ KHF A + +N DRF +
Sbjct: 162 WGRNEETYGEDPYLTSRLGVAFVKGLQGDHPNYLKTVATVKHFVANNQEN----DRFSSS 217
Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
+++ + + + + + CV E V S+M +YN NG+P +L+ +R EW +G+
Sbjct: 218 SQIPTKQLYEYYFPAYEACVKEANVQSIMTAYNAFNGIPPSGSTWLLEDVLRKEWGFDGF 277
Query: 285 IVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDIN 344
+VSDC ++GV ++ EEAAA I +G DL+CG + +AVQRGL+SE I+
Sbjct: 278 VVSDCGAIGVMNWQHRIVNSLEEAAALGINSGCDLECGGTYRENLVAAVQRGLVSEYAID 337
Query: 345 NALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSL 404
AL LT++ +LG FD PY H K + + LA EAA + I+LLKN+ L
Sbjct: 338 RALTRVLTMRFKLGEFD-PIELVPYNHYDKKLLAGEQFRRLAYEAAVKSIILLKNEDNFL 396
Query: 405 PLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGR 447
P+ R++A++GP +D +G Y+G + LQG+ +
Sbjct: 397 PIDKKDVRSIAIVGPFAD--NNYLGGYSGKPVHNISLLQGVKK 437
Score = 103 bits (256), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 84/286 (29%), Positives = 133/286 (46%), Gaps = 43/286 (15%)
Query: 477 QADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTI-LVLMSGGPIDVA 535
+AD ++ +G D + E D + LP Q+ L+ ++ + P I L+L +G P+
Sbjct: 608 RADLVLVALGNDGKLARENRDLPSIYLPMTQELLLKEIYKVN--PRIALILQTGNPLTSQ 665
Query: 536 FAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMR 595
+A + +I+ A YPGQ GG A+A ILFG NP GKLPMT Y E LP
Sbjct: 666 WAAE--HVPSILQAWYPGQEGGAALAGILFGLENPSGKLPMTIYESE--QQLPNI----- 716
Query: 596 PSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISG 655
+ GRTY++ +Y FGHG+SY+NF + VV V DG
Sbjct: 717 -LDYDIWKGRTYQYLSSKPLYGFGHGLSYSNFEYADLQCNDVVHV--DG----------- 762
Query: 656 KAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAG--HWAPHKQLVAFEKVHV 713
TL + VKN+ G + V+ + + P K+L+AF +V++
Sbjct: 763 -------------TLQCSIKVKNISDVVGEEVIQVYVSREKTPVYTFPLKKLIAFARVNL 809
Query: 714 PAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVS 759
+ V I + LSV + G ++ G++++ +GG + +S
Sbjct: 810 KPNESKTVTFTI-TPRQLSVW-QDGEWKMLSGKYSLFVGGGQKELS 853
>gi|409195436|ref|ZP_11224099.1| glycoside hydrolase family protein [Marinilabilia salmonicolor JCM
21150]
Length = 867
Score = 278 bits (712), Expect = 6e-72, Method: Compositional matrix adjust.
Identities = 162/439 (36%), Positives = 247/439 (56%), Gaps = 35/439 (7%)
Query: 51 QRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGA 110
+R +DL+ L+L+EKV L++ A+ RLGI+ Y WW+EALHGV+ G A
Sbjct: 35 ERADDLLKELTLEEKVSLMVDRNTAIERLGIEEYNWWNEALHGVARAGQ----------A 84
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--------GGTAGLTYWSPNVNIFRD 162
T FPQ + A++F+ + + SDEARA ++ G GLT W+PN+N+FRD
Sbjct: 85 TVFPQPVGMAAAFDRDMVLDVFSAASDEARAKHHFFKERGERGRYQGLTMWTPNINVFRD 144
Query: 163 PRWGRGQETPGEDPVLSGKYAASYVRGLQGS-DGDRLKVAASCKHFTAYDLDNWNGVDRF 221
PRWGRG E GEDP ++G + V+GLQG G K+ A KH+ + WN R
Sbjct: 145 PRWGRGMEAYGEDPFMNGVLGTAVVKGLQGDRSGKYDKLHACAKHYAVHSGPEWN---RH 201
Query: 222 HFNAK-VSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWR 280
FNA+ + +D+ +T+ F+ V++G V VMC+YN+ G P C + +L+ +R EW
Sbjct: 202 SFNAENIRPRDLHETYLPAFKKLVIDGDVRMVMCAYNRFEGEPCCGNNQLLRDILRNEWG 261
Query: 281 LNGYIVSDCDSVGVYYDTQHFTSTPE--EAAADAIRAGLDLDCGPFLGLHTESAVQRGLL 338
+G +VSDC ++ +++ P+ A+ DA+ AG DL+CG E AV++GL+
Sbjct: 262 FDGVVVSDCWAINDFFNKDAHAMYPDAKTASTDAVLAGTDLNCGDSYPSLVE-AVEQGLI 320
Query: 339 SEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLK 398
+E ++ +L L + LG D + + + + V +P H E+ALEAAR+ + LL
Sbjct: 321 TEEQLDISLRRLLIARFELGEMDPDEEVE-WSKIPHSVVSSPTHSEMALEAARKSMTLLM 379
Query: 399 NQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI----GRYARTIHQ 454
N+ +LPL TVAV+GPN++ ++ GNY G TT LQGI G + I++
Sbjct: 380 NKNGALPLKK-EGLTVAVMGPNANDSLMQWGNYNGTPATTTTILQGIRNALGNDDQVIYE 438
Query: 455 QGCKDVACADDQLFGAAID 473
QG + V DD++F + D
Sbjct: 439 QGTQWV---DDRIFKSVFD 454
Score = 114 bits (286), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 86/303 (28%), Positives = 135/303 (44%), Gaps = 49/303 (16%)
Query: 478 ADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILVLM 527
AD + G+ +E E + DR + LP Q+E++ + A K ILV
Sbjct: 603 ADVVVFASGISPFLEGEEMGVDLPGFKGGDRTDIALPAIQKEMLKALHKAGK-EIILVNC 661
Query: 528 SGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNL 587
SG I F + +AI+ A YPGQAGG A+A++LFG NP G+LP+T+Y + L
Sbjct: 662 SGSAI--GFEEATDYSSAILQAWYPGQAGGQAVAEVLFGDYNPAGRLPVTFYKS--VDQL 717
Query: 588 PMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHG 647
P Q RTYR+++G +YPFG+G+SYT F + + P + +
Sbjct: 718 P-------DFQDYNMTNRTYRYFEGEPLYPFGYGLSYTTFSY---DQPELSQTSI----- 762
Query: 648 SINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVA 707
+T ++KV+ V N G DG + ++ P P L
Sbjct: 763 ---STEEEASLKVS--------------VANTGDYDGEEVVQLYLQKPDDTEGPSLTLRG 805
Query: 708 FEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPL-GEHNIHIGGTKHSVSLHAATLG 766
F++V +P G V + + L + R PL G++ + +GG+ L L
Sbjct: 806 FQRVFIPKGETVEVEFQL-TEEVLEWWNADAQRMTPLAGDYRLLVGGSSRMQDLQEVALT 864
Query: 767 VIK 769
V++
Sbjct: 865 VVQ 867
>gi|427387362|ref|ZP_18883418.1| hypothetical protein HMPREF9447_04451 [Bacteroides oleiciplenus YIT
12058]
gi|425725523|gb|EKU88394.1| hypothetical protein HMPREF9447_04451 [Bacteroides oleiciplenus YIT
12058]
Length = 865
Score = 278 bits (711), Expect = 7e-72, Method: Compositional matrix adjust.
Identities = 171/442 (38%), Positives = 245/442 (55%), Gaps = 39/442 (8%)
Query: 35 ATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGV 94
AT + P+ L +R DL+ R++L+EK+ + +G+ A+ RLGI Y WW+EALHGV
Sbjct: 18 ATAQNEPYKNPDLTPSERAWDLLKRMTLEEKISQMKNGSPAIERLGIPAYNWWNEALHGV 77
Query: 95 SNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNG--------G 146
+ G AT FPQ I A++F+ +VSDEARA Y+ G
Sbjct: 78 ARAGK----------ATVFPQAIGLAATFDNQAVHETFSIVSDEARAKYHDFQRKGERDG 127
Query: 147 TAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGS-DGDRLKVAASCK 205
GLT+W+PN+NI+RDPRWGRG ET GEDP L+ + V+GLQG G K A K
Sbjct: 128 YKGLTFWTPNINIYRDPRWGRGMETYGEDPYLTSLMGLAVVKGLQGDGTGKYDKTHACAK 187
Query: 206 HFTAYDLDNWNGVDRFHFNAK-VSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPT 264
H+ + WN R F+AK +S++D+ +T+ F+ V EGKV VMC+YN+ G P
Sbjct: 188 HYAVHSGPEWN---RHSFDAKNISQRDLWETYLPAFKTLVTEGKVKEVMCAYNRYEGEPC 244
Query: 265 CADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTP--EEAAADAIRAGLDLDCG 322
C++ +L R +R +W + +VSDC ++G +Y H + P A+ADA+ +G DL+CG
Sbjct: 245 CSNKQLLIRILREDWGYDDIVVSDCGAIGDFYYPNHHETHPTAAAASADAVVSGTDLECG 304
Query: 323 PFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEP----SSQPYGHLGPKDVC 378
E AV++GL+SE IN ++ L + +LGMFD S PY + K
Sbjct: 305 GSYSSLNE-AVRKGLISEDKINESVFRLLRARFQLGMFDDNTLVSWSEIPYSVVESK--- 360
Query: 379 TPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGY 438
+H ALE AR+ +VLL N+ LPLS R VAV+GPN++ +V + NY G
Sbjct: 361 --EHVAKALEMARKSMVLLTNKNNILPLSK-SVRKVAVLGPNANDSVMLWANYNGFPTKS 417
Query: 439 TTPLQGIGRY---ARTIHQQGC 457
T L+GI +++GC
Sbjct: 418 VTILEGIRNKLPEGAVYYEKGC 439
Score = 129 bits (325), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 91/321 (28%), Positives = 144/321 (44%), Gaps = 47/321 (14%)
Query: 459 DVACADDQLFGAAIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQ 508
D+ + + D + +AD I V GL S+E E + DR + LP Q+
Sbjct: 582 DIGIKKEINYKEMADKAAEADVIIFVGGLSSSLEGEEMPVDLPGFRKGDRTNIDLPQVQE 641
Query: 509 ELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTS 568
E++ + K P + VL SG + + + + + AII A YPGQ GGTA+AD+LFG
Sbjct: 642 EMLKALKKTGK-PVVFVLCSGSTLALPWEAEN--LDAIIEAWYPGQQGGTAVADVLFGDY 698
Query: 569 NPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFV 628
NP G+LP+T+Y ++LP E RTYR++KG ++PFGHG+SYT F
Sbjct: 699 NPAGRLPLTFYASS--SDLPDFE-------DYDMSNRTYRYFKGRPLFPFGHGLSYTTFD 749
Query: 629 HTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTL 688
+ A A + +G L + + +KN+G G +
Sbjct: 750 YGKAKADKKILRAGEG-------------------------LTLTIPLKNIGKLSGDEVV 784
Query: 689 LVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHN 748
V+ P P K L AF ++ + AG + V + V + + + G++
Sbjct: 785 QVYLRNPGDKEGPIKTLRAFRRISLEAGQAEDVLFELPVSTFEWFNPATNRMEVLPGKYE 844
Query: 749 IHIGGTKHSVSLHAATLGVIK 769
+ GGT +L T+ + K
Sbjct: 845 LLYGGTSDEKALQRLTVTLKK 865
>gi|427384377|ref|ZP_18880882.1| hypothetical protein HMPREF9447_01915 [Bacteroides oleiciplenus YIT
12058]
gi|425727638|gb|EKU90497.1| hypothetical protein HMPREF9447_01915 [Bacteroides oleiciplenus YIT
12058]
Length = 1050
Score = 278 bits (711), Expect = 9e-72, Method: Compositional matrix adjust.
Identities = 166/436 (38%), Positives = 253/436 (58%), Gaps = 42/436 (9%)
Query: 28 FACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWW 87
AC + A + L + + P +R+ DL+ RL+++EK+ LL + + + RL I Y
Sbjct: 16 MACATRTAQAQEL-YKNENAPTHERIMDLLSRLTVEEKISLLRATSPGISRLDIPKYYHG 74
Query: 88 SEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGT 147
+EALHGV V PG +F T FPQ I A+++N L E + V+SDEARA +N
Sbjct: 75 NEALHGV--VRPG-RF-------TVFPQAIGLAATWNPVLQEQVATVISDEARARWNELD 124
Query: 148 AG----------LTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDR 197
G LT+WSP VN+ RDPRWGR ET GEDP LSG ++V+GLQG+D
Sbjct: 125 QGREQKSQFSDLLTFWSPTVNMARDPRWGRTPETYGEDPYLSGIMGTAFVKGLQGNDSRY 184
Query: 198 LKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYN 257
LK+ ++ KHF A + ++ +RF N ++S++ + + + F CV +GK AS+M +YN
Sbjct: 185 LKIVSTPKHFAANNEEH----NRFVCNPQISEKQLREYYLPAFEACVKDGKSASIMSAYN 240
Query: 258 QVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGL 317
+N VP + +L + +R +W GY+VSDC + + + T E AA +I+AGL
Sbjct: 241 ALNDVPCTLNAWLLTKVLRNDWGFKGYVVSDCGGPSLLVNAHKYVKTKEAAATLSIKAGL 300
Query: 318 DLDCG------PFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFD-GEPSSQPYG 370
DL+CG P L SA ++ ++++ DI++A L +M+LG+FD GE PY
Sbjct: 301 DLECGDDVYDEPLL-----SAYRQYMVTDADIDSAAYRVLRARMQLGLFDSGE--KNPYT 353
Query: 371 HLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGN 430
+ P + + +HQE+AL AAR+ IVLLKNQ LPL+ + +++AV+G N+ + G+
Sbjct: 354 KISPAVIGSKEHQEVALNAARECIVLLKNQKKMLPLNAKKIKSIAVVGINA--GSSEFGD 411
Query: 431 YAGI-ACGYTTPLQGI 445
Y+G+ + LQGI
Sbjct: 412 YSGLPVIAPVSVLQGI 427
Score = 133 bits (334), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 89/300 (29%), Positives = 148/300 (49%), Gaps = 45/300 (15%)
Query: 467 LFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVL 526
L+G A A R+ + + V+G+++SIE E DR + LP Q+E + ++ + P I+V+
Sbjct: 592 LYGEAGKAVRECETVVAVLGINKSIEREGQDRYDIQLPADQREFLQEIYKVN--PNIVVV 649
Query: 527 MSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITN 586
+ G +A D + AI+ A YPG++GG A+A++LFG NPGG+LP+T+Y +
Sbjct: 650 LVAGS-SLAVNWMDEHVPAIVNAWYPGESGGKAVAEVLFGDYNPGGRLPLTYY--RSLDE 706
Query: 587 LPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRH 646
LP + GRTY+++KG V+YPFG+G+SYT+F ++
Sbjct: 707 LPPFD------DYDITKGRTYKYFKGDVLYPFGYGLSYTSFKYS---------------- 744
Query: 647 GSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQL 705
N ++ +V+ V +KN G G V+ P P K+L
Sbjct: 745 ---NLQVADGEEEVS----------VSFQLKNTGRYAGDEVAQVYVKLPEREEVMPVKEL 791
Query: 706 VAFEKVHVPAGAQQRVGINIH--VCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHAA 763
FE+V + +G ++V I + + +Y + G P G +NI +G + + L A
Sbjct: 792 KGFERVSLKSGESKKVTIKLRKDLLRYWD--EAKGKFIYPSGNYNIMVGASSADIRLQKA 849
>gi|28199699|ref|NP_780013.1| family 3 glycoside hydrolase [Xylella fastidiosa Temecula1]
gi|182682443|ref|YP_001830603.1| beta-glucosidase [Xylella fastidiosa M23]
gi|417557804|ref|ZP_12208815.1| Beta-glucosidase [Xylella fastidiosa EB92.1]
gi|28057820|gb|AAO29662.1| family 3 glycoside hydrolase [Xylella fastidiosa Temecula1]
gi|182632553|gb|ACB93329.1| Beta-glucosidase [Xylella fastidiosa M23]
gi|338179587|gb|EGO82522.1| Beta-glucosidase [Xylella fastidiosa EB92.1]
Length = 882
Score = 278 bits (711), Expect = 9e-72, Method: Compositional matrix adjust.
Identities = 164/405 (40%), Positives = 230/405 (56%), Gaps = 28/405 (6%)
Query: 51 QRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGA 110
Q L+ +++ QEK+ ++ A A+PRLGI Y+WWSE LHG++ G A
Sbjct: 32 QHAAALVAQMTRQEKIAQTMNAAPAIPRLGIPAYDWWSEGLHGIARNG----------YA 81
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--GGT-------AGLTYWSPNVNIFR 161
T FPQ I A+S+N L + +G V S EARA +N GG AGLT WSPN+NIFR
Sbjct: 82 TVFPQAIGLAASWNTDLLQHVGTVTSTEARAKFNLTGGPGKDHPRYAGLTLWSPNINIFR 141
Query: 162 DPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRF 221
DPRWGRG ET GEDP L+ + A S++RGLQG D + A+ KHF + R
Sbjct: 142 DPRWGRGMETYGEDPYLTSQLAVSFIRGLQGDTPDHPRTIATPKHFAVHSGPEQG---RH 198
Query: 222 HFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRL 281
F+ VS D+E T+ FR +++G SVMC+YN ++G P CA +L +R +W
Sbjct: 199 SFDVDVSAYDLEATYTPAFRAAIVDGHAGSVMCAYNALHGTPACASDWLLNTRLRNDWGF 258
Query: 282 NGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSE 340
NG++VSDCD++ F A+A A+++G DL+CG + L+ A+ RG + E
Sbjct: 259 NGFVVSDCDAIEDMTRFHFFRQDNASASAAALKSGNDLNCGNTYRDLN--QAIARGDIDE 316
Query: 341 IDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQ 400
++ AL+ T + RLG PY +G K + TP H+ LAL+AA Q +VLLKN
Sbjct: 317 STLDQALIRLFTARQRLGTLQPR-EHDPYAAIGIKHIDTPAHRALALQAAAQSLVLLKNS 375
Query: 401 GPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
G +LPL T+AV+GP++D + NY G + TPL G+
Sbjct: 376 GNTLPLP--PETTLAVLGPDADSLTALEANYQGTSSTPVTPLTGL 418
Score = 132 bits (332), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 95/290 (32%), Positives = 142/290 (48%), Gaps = 48/290 (16%)
Query: 474 ASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTI 523
A ADA + +GL +E E L DR + LP Q+ L+ V K P I
Sbjct: 607 AVAHADAIVAFVGLSPEVEGEELHIDTPGFSGGDRTTIDLPATQETLLQHVKTTGK-PLI 665
Query: 524 LVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEY 583
+VLMSG + + +A++ AI+ A YPGQ+GGTAIA L G NPGG+LP+T+Y
Sbjct: 666 VVLMSGSAVALNWAQH--HADAILAAWYPGQSGGTAIAQALAGDVNPGGRLPVTFY--RS 721
Query: 584 ITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLD 643
+LP P S GRTYR++KG +YPFG+G+SYT F + AP + L
Sbjct: 722 TQDLP-------PYISYDMTGRTYRYFKGQPLYPFGYGLSYTQFAY---EAPQLSTATLK 771
Query: 644 GRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHK 703
+ TL V V+N G++ G + ++ PP AP +
Sbjct: 772 AGN----------------------TLTVTTHVRNTGTRAGDEVVQLYLEPPYSPQAPLR 809
Query: 704 QLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGG 753
LV F++V + G + + + + LS V ++G R + G +++ +GG
Sbjct: 810 SLVGFKRVTLRPGESRLLTFTLD-ARQLSSVQQTGQRSVEAGHYHLFVGG 858
>gi|219118959|ref|XP_002180246.1| beta-xylosidase [Phaeodactylum tricornutum CCAP 1055/1]
gi|217408503|gb|EEC48437.1| beta-xylosidase [Phaeodactylum tricornutum CCAP 1055/1]
Length = 682
Score = 278 bits (711), Expect = 9e-72, Method: Compositional matrix adjust.
Identities = 200/614 (32%), Positives = 303/614 (49%), Gaps = 57/614 (9%)
Query: 33 KDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLL---------ISGAAAVPRLGIKG 83
++ + LP+C +SL I +R+ DL+ L+L EKV ++ ++ V R+G+
Sbjct: 63 QNDVAKALPYCDMSLSIDERLEDLLSHLTLDEKVDMIGADPTQDVCMTHTMNVSRIGLPD 122
Query: 84 YEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMY 143
Y W E +N G+ + AT F ++ A+SFN + W G V E RA+
Sbjct: 123 YYWLVE-----TNTAVGSACIAENKCATEFSGPLSIAASFNRSSWFLKGSVFGTEQRALM 177
Query: 144 N----------GGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGS 193
N G GLT + PN+N RDPR+GR E PGEDP LSG+YAA V+G+Q
Sbjct: 178 NVHGERFHTHSGRHIGLTAFGPNINQQRDPRFGRSSELPGEDPFLSGQYAAHMVQGMQER 237
Query: 194 DGDRL-KVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASV 252
D + KV A KHFTAY + G D ++ +S D+ DT+ + M +++G V
Sbjct: 238 DANGYPKVLAYLKHFTAYSREEGRGNDDYN----ISMYDLFDTYLPQYEMGMVQGGATGV 293
Query: 253 MCSYNQVNGVPTCADPNILKRTIRGEW-RLNGYIVSDCDSVGVYYDTQHFTSTPEEAAAD 311
MCSYN VNG+P CA+ +L + +R W R + ++ +DC +V + +AAA
Sbjct: 294 MCSYNAVNGIPACANDYLLNKILRQRWNRSDAHVTTDCGAVNNLRGKPIQAADEAQAAAM 353
Query: 312 AIRAGLDLDCGPFLGLHT-ESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYG 370
A+ G D++ G L +H +A+ G +E +N A+ + G FD +P+ +
Sbjct: 354 ALMNGADIEMGSTLFVHNLTTAITLGYATEEAVNQAIRRSYRPHFIAGRFD-DPTLSEWF 412
Query: 371 HLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGN 430
LG D+ + HQE+ LEAA QG+VLLK++ LP++ +AV+GP ++ +
Sbjct: 413 SLGLDDIQSKKHQEIQLEAALQGLVLLKHEDSILPIA--AGTKLAVLGPLGMTRSGLMSD 470
Query: 431 YAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADAT--------- 481
Y + I A +I K+ A G +D+ +D
Sbjct: 471 YESDQSCFGGGHDCIPTLAESIGFINGKEFTVA---AAGVDVDSRNTSDVERILQLAADR 527
Query: 482 ---ILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAK 538
+L +G ++ E E DR LPG+Q L V + + P +LVL++GG I +
Sbjct: 528 DLIVLCLGNTKTQEQEGFDRKDTALPGQQYALFEAV-LTLRKPVVLVLVNGGQIALDGMT 586
Query: 539 NDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQ 598
P +AII A P GGTA+A LFG N GKLP T YP + + M + +M
Sbjct: 587 GYP--SAIIEAFNPNGIGGTALAASLFGQENRWGKLPYTIYPYSVMQSFDMKDHSMSAP- 643
Query: 599 SKRYPGRTYRFYKG 612
PGRTYR++ G
Sbjct: 644 ----PGRTYRYFTG 653
>gi|390945417|ref|YP_006409177.1| beta-glucosidase-like glycosyl hydrolase [Alistipes finegoldii DSM
17242]
gi|390421986|gb|AFL76492.1| beta-glucosidase-like glycosyl hydrolase [Alistipes finegoldii DSM
17242]
Length = 771
Score = 278 bits (711), Expect = 9e-72, Method: Compositional matrix adjust.
Identities = 208/687 (30%), Positives = 328/687 (47%), Gaps = 94/687 (13%)
Query: 109 GATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRG 168
GAT+FP AS++N L E +G+V++ E R G + P ++I RDPRW R
Sbjct: 139 GATTFPTAPGQASTWNPELIERMGKVIAAEIRLQ-----GGHICYGPVLDIVRDPRWSRT 193
Query: 169 QETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASC-KHFTAYDLDN--WNGVDRFHFNA 225
+E+ GED L+ + +YVRG D + + A S KHF AY NG
Sbjct: 194 EESYGEDCYLTARIGEAYVRGTGSGDLSQSRHALSTLKHFIAYGASEGGQNGGSNL---- 249
Query: 226 KVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYI 285
+ ++++ +T+ PF V G SVM +YN V+G+P A+ +L +RGEW +G++
Sbjct: 250 -LGERELRETYLPPFEAAVKAG-ARSVMTAYNSVDGIPCTANRRMLTDILRGEWGFDGFV 307
Query: 286 VSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDC--GPFLGLHTESAVQRGLLSEIDI 343
VSD S+ ++T + EAA A+RAG+D D G F L A + G ++E +I
Sbjct: 308 VSDLLSIEGLHETHGVAGSVREAAVQALRAGVDADLKGGAFASLR--EAAEAGDVAEAEI 365
Query: 344 NNALVNTLTVQMRLGMFDGEPSSQPY-GHLGPKDVCTPDHQELALEAARQGIVLLKNQGP 402
+ A+ L ++ +G+F+ PY +V H ELALEAARQ + LL+N+
Sbjct: 366 DRAVERVLALKFEMGLFE-----NPYIDEAAAAEVGCAAHSELALEAARQSVTLLENRSG 420
Query: 403 SLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYA---RTIHQQGCKD 459
+LPL R R VAVIGPN+D +G+Y T G+ + R ++ +GC
Sbjct: 421 TLPLDPRRLRRVAVIGPNADNIYNQLGDYTAQQTAANTVRDGLEKLLGRDRVVYSRGCT- 479
Query: 460 VACADDQLFGAAIDASRQADATILVMGLDQSIE-----------------------AEAL 496
V D AA+ A+R DA ++V+G + + E
Sbjct: 480 VRGGDRSEIAAAVSAARGTDAAVVVIGGSSARDFDTEFLQTGAAKAAHDEVRDMECGEGF 539
Query: 497 DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAG 556
DRA L L G Q+EL+ ++ A+ P I+V ++G P+D+ A + A++ A YPG G
Sbjct: 540 DRATLALLGEQEELLRRIK-ATGTPLIVVCIAGRPLDLRRASE--QADALLMAWYPGARG 596
Query: 557 GTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGR-TYRFYKGPVV 615
G A+A+ + G +NP G+LP+T +P E + +K+ P Y +
Sbjct: 597 GDAVAETILGHNNPAGRLPIT---------IPRAEGQIPVYYNKKRPANHDYTDLTAAPL 647
Query: 616 YPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVD 675
YPFG+G+SY+ F +GS+ A SG + L V
Sbjct: 648 YPFGYGLSYSTF-----------------EYGSLEARQSGDNV-----------LEVSCR 679
Query: 676 VKNVGSKDGAHTL-LVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVV 734
++N ++G + L S A P +QL F ++ + G Q++V + + LS++
Sbjct: 680 IRNTSDREGDEVVQLYISDMVASTVRPPRQLGGFRRIRLAPGEQRQVSFTLG-DEALSLI 738
Query: 735 DRSGTRRIPLGEHNIHIGGTKHSVSLH 761
D G R + G+ I +G + + L
Sbjct: 739 DPQGRRVVEKGDFVIAVGSSSQDIRLQ 765
>gi|402307522|ref|ZP_10826545.1| glycosyl hydrolase family 3, N-terminal domain protein [Prevotella
sp. MSX73]
gi|400378572|gb|EJP31427.1| glycosyl hydrolase family 3, N-terminal domain protein [Prevotella
sp. MSX73]
Length = 858
Score = 278 bits (710), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 174/457 (38%), Positives = 250/457 (54%), Gaps = 30/457 (6%)
Query: 35 ATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGV 94
AT + LP+C L +R DL+ RL+L+EK +L++ + A+PRLGIK + WWSEALHG
Sbjct: 17 ATAQLLPYCNPDLSARERARDLLSRLTLEEKARLMLDESPAIPRLGIKKFFWWSEALHGA 76
Query: 95 SNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMY-----NGGTA- 148
+N+G G T FP+ + A+SFN L + SDE RA Y NGG
Sbjct: 77 ANMG----------GVTVFPEPVGMAASFNDGLLRRVFDAASDEMRAQYNRRMLNGGEDE 126
Query: 149 ---GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASC- 204
L+ W+PNVNIFRDPRWGRGQET GEDP L+ + VRGLQG + + + +C
Sbjct: 127 KFHSLSVWTPNVNIFRDPRWGRGQETYGEDPYLTSVMGTAVVRGLQGPETAKYRKLWACA 186
Query: 205 KHFTAYDLDNWNGVDRFHFN-AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVP 263
KH+ + + R N A VS +D+ +T+ F+ V E KV VMC+Y +++ P
Sbjct: 187 KHYAVHSGPEYT---RHTANVADVSPRDLWETYLPAFKTLVTEAKVREVMCAYQRLDDDP 243
Query: 264 TCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGP 323
C++ +L++ +R EW N +VSDC +V Y +S AAA A AG D++CG
Sbjct: 244 CCSNNRLLQQILRDEWGFNYLVVSDCGAVTDIYANHKTSSDAVHAAAKAAVAGTDVECGF 303
Query: 324 FLGLHT-ESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDH 382
T AV+RGL++E +++ ++ L + LG D +P + + + + H
Sbjct: 304 GYAYKTIPEAVRRGLITEAEVDKHVLRLLEGRFDLGEMD-DPKLVEWSKIPASVMDSKAH 362
Query: 383 QELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPL 442
++LAL+ ARQ +VLL+N+G LPL +AVIGPN+D M GNY G T L
Sbjct: 363 RQLALDMARQSLVLLQNKGGVLPLK-AGGEPIAVIGPNADDGPMMWGNYNGTPNRTVTIL 421
Query: 443 QGI-GRYARTIHQQGCK--DVACADDQLFGAAIDASR 476
GI R+ R + +GC D + L AID +
Sbjct: 422 DGIKARHKRVTYLKGCDLTDTKTVNSLLPQCAIDGRK 458
Score = 99.0 bits (245), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 84/299 (28%), Positives = 129/299 (43%), Gaps = 54/299 (18%)
Query: 482 ILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGP 531
+ V G+ ++E E + DR + LP Q++ + + A K + V SG
Sbjct: 603 VFVGGISAALEGEEMPVDIDGFKGGDRTNIELPKVQRDFLRALHEAGK-TVVFVNCSGSA 661
Query: 532 IDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTE 591
I A AI+ A Y GQ GGTA++D+LFGT NP GKLP+T+Y + LP E
Sbjct: 662 I--ALEPEMETCDAILQAWYAGQEGGTAVSDVLFGTVNPSGKLPVTFYKR--TDQLPDYE 717
Query: 592 -MAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSIN 650
+MR GRTYR++ P ++ FG+G+SYT F R G
Sbjct: 718 DYSMR--------GRTYRYFSDP-LFAFGYGLSYTTF-----------------RFGR-- 749
Query: 651 ATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEK 710
HA+ + V + N G++ G + V+ A P K L AF +
Sbjct: 750 ----------AHAEAAEGGYRLSVPLTNTGTRPGEEVVQVYIRRVADTNGPLKSLRAFRR 799
Query: 711 VHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHAATLGVIK 769
V + AG V I + + + + T R G++ + G + L T+ V++
Sbjct: 800 VALKAGESTTVEIPLSRKSFECFDESTNTMRTLPGDYELMYGNSSRPEDLQTLTVSVVE 858
>gi|224536538|ref|ZP_03677077.1| hypothetical protein BACCELL_01413 [Bacteroides cellulosilyticus
DSM 14838]
gi|224521794|gb|EEF90899.1| hypothetical protein BACCELL_01413 [Bacteroides cellulosilyticus
DSM 14838]
Length = 863
Score = 277 bits (709), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 168/432 (38%), Positives = 237/432 (54%), Gaps = 32/432 (7%)
Query: 42 FCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGT 101
+ SL +R L+ L+L+EK L++ G+ +V RLGIK Y WW+EALHGV+ G
Sbjct: 23 YKDASLSPERRAELLVKELTLEEKAHLMMDGSRSVERLGIKPYNWWNEALHGVARAGL-- 80
Query: 102 KFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARA--MYNGGT------AGLTYW 153
AT FPQ I A+SFN + + VSDEARA Y GLT W
Sbjct: 81 --------ATVFPQPIGMAASFNPEMVYEVFNAVSDEARAKNTYYASQDSRERYQGLTMW 132
Query: 154 SPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL 212
+P VNI+RDPRWGRG ET GEDP L+ + V+GLQG +DG K+ A KHF +
Sbjct: 133 TPTVNIYRDPRWGRGIETYGEDPYLTSRMGVMVVKGLQGPADGKYDKLHACAKHFAVHSG 192
Query: 213 DNWNGVDRFHFNAK-VSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNIL 271
WN R FNA+ + +D+ +T+ PF V EGKV VMC+YN+ G P C +L
Sbjct: 193 PEWN---RHSFNAENIKPRDLYETYLPPFEALVKEGKVEEVMCAYNRFEGDPCCGSDRLL 249
Query: 272 KRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTP--EEAAADAIRAGLDLDCGPFLGLHT 329
+ +RGEW +G +VSDC ++ +Y+ + + P E A+A A+ +G DL+CG
Sbjct: 250 MQILRGEWGFDGIVVSDCGAIADFYNDRGHHTHPDAESASAAAVISGTDLECGSSYKALI 309
Query: 330 ESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEA 389
ES V++GL+SE ++ ++ + + LG D EP + + V + H LAL
Sbjct: 310 ES-VKKGLISEETVDTSVKRLMKARFALGEMD-EPEKVSWTKIPFSVVASAAHDSLALNM 367
Query: 390 ARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI---- 445
AR+ + LL N+ LPL TVAV+GPN++ +V GNY G+ T L G+
Sbjct: 368 ARESMTLLMNKDNFLPLKR-GGLTVAVMGPNANDSVMQWGNYNGMPAHTVTILDGVRNLL 426
Query: 446 GRYARTIHQQGC 457
G + I++QGC
Sbjct: 427 GTDDKLIYEQGC 438
Score = 131 bits (329), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 100/344 (29%), Positives = 156/344 (45%), Gaps = 48/344 (13%)
Query: 434 IACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEA 493
+ G + L+ Y R+ Q D+ D +++ + AD I G+ S+E
Sbjct: 557 VQAGKSYDLELDFEYLRSDAQLNF-DLGFKKDVDIRKSVERVKDADIVIFASGISPSLEG 615
Query: 494 EAL----------DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRI 543
E + DR + LP Q+EL+ + A K ILV SG PI +
Sbjct: 616 EEMGVNLPGFKKGDRTDIELPAVQRELIDALHRAGK-KIILVNCSGSPI--GLEPETQKC 672
Query: 544 AAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYP 603
AI+ A YPGQ GG A+A++LFG NP GKLP+T+Y ++ LP E
Sbjct: 673 EAILQAWYPGQQGGKAVAEVLFGDYNPAGKLPVTFYRN--VSQLPDFE-------DYNMT 723
Query: 604 GRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHA 663
GRTYR+ + ++PFG+G+SYT F + TV+ N +G+++K+T
Sbjct: 724 GRTYRYMQDVPLFPFGYGLSYTTFGY----GKTVL---------DKNELTAGQSLKLT-- 768
Query: 664 KCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGI 723
V V N G ++G + V+ P K L AF++V +PAG V
Sbjct: 769 ----------VPVTNTGKRNGEEVVQVYLRKQGDAEGPIKTLRAFKRVSIPAGKTVNVEF 818
Query: 724 NIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHAATLGV 767
++ + D+S T R+ G ++I +GG+ L T+ +
Sbjct: 819 DLKDKELEWWDDQSNTVRVCPGNYDIMVGGSSKEEDLQRTTIAI 862
>gi|325103214|ref|YP_004272868.1| glycoside hydrolase family protein [Pedobacter saltans DSM 12145]
gi|324972062|gb|ADY51046.1| glycoside hydrolase family 3 domain protein [Pedobacter saltans DSM
12145]
Length = 866
Score = 277 bits (709), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 167/439 (38%), Positives = 241/439 (54%), Gaps = 32/439 (7%)
Query: 35 ATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGV 94
A + PF LP +RV+DL+ RL+++EKV L+ + + RLGIK Y WW+EALHGV
Sbjct: 19 AQNKLYPFQDNRLPFDKRVDDLLQRLTVEEKVLLMQDVSRPIERLGIKQYNWWNEALHGV 78
Query: 95 SNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--------GG 146
+ G AT FPQ I A+SF+ + VSDEARA +N G
Sbjct: 79 ARAGL----------ATVFPQPIGMAASFDRDALFNVFNAVSDEARAKHNYHLSQGSYGR 128
Query: 147 TAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCK 205
GLT W+P +NIFRDPRWGRG ET GEDP L+ V+GLQG S+G K+ A K
Sbjct: 129 YEGLTMWTPTINIFRDPRWGRGIETYGEDPYLTAVMGVQAVKGLQGPSNGKYDKLHACAK 188
Query: 206 HFTAYDLDNWNGVDRFHFN-AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPT 264
HF + WN R F+ A + ++D+ +T+ F V E KV VMC+YN+ G P
Sbjct: 189 HFAVHSGPEWN---RHSFDAANIKQRDLYETYLPAFEALVKEAKVQEVMCAYNRFEGDPC 245
Query: 265 CADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPE--EAAADAIRAGLDLDCG 322
C +L++ +R +W G +V+DC ++ ++ + P+ A+A A+ +G DLDCG
Sbjct: 246 CGSDRLLQQILRKKWGFEGIVVADCGAIADFFKENAHKTHPDAASASAAAVYSGTDLDCG 305
Query: 323 PFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDH 382
TE AV++GL+ E DI+ ++ L + RLG D + S P+ + V + H
Sbjct: 306 SSYKALTE-AVKKGLIEEKDIDVSVRRLLMARFRLGEMDDQ-SLVPWSKISYNVVASKAH 363
Query: 383 QELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPL 442
++AL+ AR+ I LL+N+ LPL + +AV+GPN+ +V GNY G T L
Sbjct: 364 NQIALDMARKSITLLQNKNNILPLKSGGLK-IAVMGPNAQDSVMQWGNYNGTPANTITIL 422
Query: 443 QGI----GRYARTIHQQGC 457
+GI G + I++Q C
Sbjct: 423 EGIKAALGPKDKLIYEQAC 441
Score = 110 bits (275), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 87/319 (27%), Positives = 135/319 (42%), Gaps = 47/319 (14%)
Query: 459 DVACADDQLFGAAIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQ 508
D+ ++ +I AD + V G+ S+E E + DR + LP Q+
Sbjct: 584 DIGYKEEANINKSIKNIAGADLVVFVGGISPSLEGEEMGVKLPGFRGGDRTDIQLPTIQR 643
Query: 509 ELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTS 568
+ V + A K I + SG PI +A + AI+ A YPGQAGG A+AD+LFG
Sbjct: 644 QFVKALKEAGK-RVIFINCSGSPIGLADEMANSE--AIVQAWYPGQAGGQAVADVLFGKY 700
Query: 569 NPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFV 628
NP G+LP+T+Y T LP E + GRTYR+ + ++PFG+G+SYT F
Sbjct: 701 NPSGRLPITFYRD--TTQLPDFE-------NYDMAGRTYRYMQDKPLFPFGYGLSYTQFQ 751
Query: 629 HTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTL 688
+ G I N T+ + V V N G + G +
Sbjct: 752 Y-------------------------GNPILNQQVITNGQTIQLTVPVTNTGKRSGDEVV 786
Query: 689 LVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHN 748
V+ P K L F ++ AG Q+V I + ++S ++ G++
Sbjct: 787 QVYLRKKGDATGPVKTLRDFRRLSFNAGQTQQVVFKITPKQLEWWNEQSKAMQVQSGDYE 846
Query: 749 IHIGGTKHSVSLHAATLGV 767
+ +G + L T +
Sbjct: 847 LLVGKSSADEDLKMVTFTI 865
>gi|334365132|ref|ZP_08514098.1| glycosyl hydrolase family 3 N-terminal domain protein [Alistipes
sp. HGB5]
gi|313158675|gb|EFR58064.1| glycosyl hydrolase family 3 N-terminal domain protein [Alistipes
sp. HGB5]
Length = 771
Score = 277 bits (709), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 207/687 (30%), Positives = 328/687 (47%), Gaps = 94/687 (13%)
Query: 109 GATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRG 168
GAT+FP AS++N L E +G+V++ E R G + P ++I RDPRW R
Sbjct: 139 GATTFPTAPGQASTWNPELIERMGKVIAAEIRLQ-----GGHICYGPVLDIVRDPRWSRT 193
Query: 169 QETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASC-KHFTAYDLDN--WNGVDRFHFNA 225
+E+ GED L+ + +YVRG D + + A S KHF AY NG
Sbjct: 194 EESYGEDCYLTARIGEAYVRGTGSGDLSQSRHALSTLKHFIAYGASEGGQNGGSNL---- 249
Query: 226 KVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYI 285
+ ++++ +T+ PF V G SVM +YN V+G+P A+ +L +RGEW +G++
Sbjct: 250 -LGERELRETYLPPFEAAVKAG-ARSVMTAYNSVDGIPCTANRRMLTDILRGEWGFDGFV 307
Query: 286 VSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDC--GPFLGLHTESAVQRGLLSEIDI 343
VSD S+ ++T + EAA A+RAG+D D G F L A + G ++E +I
Sbjct: 308 VSDLLSIEGLHETHGVAGSVREAAVQALRAGVDADLKGGAFASLR--EAAEAGDVAEAEI 365
Query: 344 NNALVNTLTVQMRLGMFDGEPSSQPY-GHLGPKDVCTPDHQELALEAARQGIVLLKNQGP 402
+ A+ L ++ +G+F+ PY +V H ELALEAARQ + LL+N+
Sbjct: 366 DRAVERVLALKFEMGLFE-----NPYIDEAAAAEVGCAAHSELALEAARQSVTLLENRSG 420
Query: 403 SLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYA---RTIHQQGCKD 459
+LPL R R VAVIGPN+D +G+Y T G+ + R ++ +GC
Sbjct: 421 TLPLDPRRLRRVAVIGPNADNIYNQLGDYTAQQTAANTVRDGLEKLLGRDRVVYSRGCT- 479
Query: 460 VACADDQLFGAAIDASRQADATILVMGLDQSIE-----------------------AEAL 496
V D AA+ A+R DA ++V+G + + E
Sbjct: 480 VRGGDRSEIAAAVSAARGTDAAVVVIGGSSARDFDTEFLQTGAAKAAHDEVRDMECGEGF 539
Query: 497 DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAG 556
DRA L L G Q+EL+ ++ A+ P I+V ++G P+D+ A + A++ A YPG G
Sbjct: 540 DRATLALLGEQEELLRRIK-ATGTPLIVVCIAGRPLDLRRASE--QADALLMAWYPGARG 596
Query: 557 GTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGR-TYRFYKGPVV 615
G A+A+ + G +NP G+LP+T +P E + +K+ P Y +
Sbjct: 597 GDAVAETILGRNNPAGRLPIT---------IPRAEGQIPVYYNKKRPANHDYTDLTAAPL 647
Query: 616 YPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVD 675
YPFG+G+SY+ F +GS+ A SG + L V
Sbjct: 648 YPFGYGLSYSTF-----------------EYGSLEARQSGDNV-----------LEVSCR 679
Query: 676 VKNVGSKDGAHTL-LVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVV 734
++N ++G + L S A P +QL F ++ + G Q++V + + L+++
Sbjct: 680 IRNTSDREGDEVVQLYISDMVASTVRPPRQLGGFRRIRLAPGEQRQVSFTLG-DEALALI 738
Query: 735 DRSGTRRIPLGEHNIHIGGTKHSVSLH 761
D G R + G+ I +G + + L
Sbjct: 739 DPQGRRVVEKGDFVIAVGSSSQDIRLQ 765
>gi|325918730|ref|ZP_08180824.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas vesicatoria
ATCC 35937]
gi|325535054|gb|EGD06956.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas vesicatoria
ATCC 35937]
Length = 391
Score = 277 bits (709), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 156/368 (42%), Positives = 216/368 (58%), Gaps = 30/368 (8%)
Query: 51 QRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGA 110
QR L+ ++S EKV ++ A A+PRL I YEWWSE LHG++ G A
Sbjct: 34 QRAAALVAQMSRDEKVAQAMNDAPAIPRLDIPAYEWWSEGLHGIARNGY----------A 83
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--GG-------TAGLTYWSPNVNIFR 161
T FPQ I A+S+N L + +G VVS EARA +N GG AGLT WSPN+NIFR
Sbjct: 84 TVFPQAIGLAASWNTALMQQVGTVVSTEARAKFNQAGGPGKDHKRYAGLTIWSPNINIFR 143
Query: 162 DPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD-- 219
DPRWGRG ET GEDP L+G+ A ++RGLQG D + + A+ KH + +G +
Sbjct: 144 DPRWGRGMETYGEDPFLTGQLAVGFIRGLQGDDLNHPRTIATPKHIAVH-----SGPEPG 198
Query: 220 RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEW 279
R F+ VS +D+E T+ FR +++G+ SVMC+YN ++G P CA +L +RG+W
Sbjct: 199 RHGFDVDVSPRDMEATYTPAFRAALVDGQAWSVMCAYNSLHGTPACAADWLLNGRVRGDW 258
Query: 280 RLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLL 338
G++VSDCD+V +F ++A A++AG DL+CG + L T A++RG +
Sbjct: 259 GFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAAALKAGHDLNCGHAYRELGT--AIERGEV 316
Query: 339 SEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLK 398
E ++ +LV + RLG + P PY LG KDV H+ LAL+AA + IVLLK
Sbjct: 317 DEALLDQSLVRLFAARYRLGELEA-PRKDPYARLGAKDVDNAAHRALALQAAAESIVLLK 375
Query: 399 NQGPSLPL 406
N +LPL
Sbjct: 376 NTATTLPL 383
>gi|288925400|ref|ZP_06419334.1| beta-glucosidase [Prevotella buccae D17]
gi|288337871|gb|EFC76223.1| beta-glucosidase [Prevotella buccae D17]
Length = 858
Score = 277 bits (708), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 174/457 (38%), Positives = 250/457 (54%), Gaps = 30/457 (6%)
Query: 35 ATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGV 94
AT + LP+C L +R DL+ RL+L+EK +L++ + A+PRLGIK + WWSEALHG
Sbjct: 17 ATAQLLPYCNPDLSARERARDLLSRLTLEEKARLMLDESPAIPRLGIKKFFWWSEALHGA 76
Query: 95 SNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMY-----NGGTA- 148
+N+G G T FP+ + A+SFN L + SDE RA Y NGG
Sbjct: 77 ANMG----------GVTVFPEPVGMAASFNDGLLRRVFDAASDEMRAQYNRRMLNGGEDE 126
Query: 149 ---GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASC- 204
L+ W+PNVNIFRDPRWGRGQET GEDP L+ + VRGLQG + + + +C
Sbjct: 127 KFHSLSVWTPNVNIFRDPRWGRGQETYGEDPYLTSVMGTAVVRGLQGPETAKYRKLWACA 186
Query: 205 KHFTAYDLDNWNGVDRFHFN-AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVP 263
KH+ + + R N A VS +D+ +T+ F+ V E KV VMC+Y +++ P
Sbjct: 187 KHYAVHSGPEYT---RHTANVADVSPRDLWETYLPAFKTLVTEAKVREVMCAYQRLDDDP 243
Query: 264 TCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGP 323
C++ +L++ +R EW N +VSDC +V Y +S AAA A AG D++CG
Sbjct: 244 CCSNNRLLQQILRDEWGFNYLVVSDCGAVTDIYANHKTSSDAVHAAAKAAVAGTDVECGF 303
Query: 324 FLGLHT-ESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDH 382
T AV+RGL++E +++ ++ L + LG D +P + + + + H
Sbjct: 304 GYAYKTIPEAVRRGLITEAEVDKHVLRLLEGRFDLGEMD-DPKLVEWSKIPASVMDSKAH 362
Query: 383 QELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPL 442
++LAL+ ARQ +VLL+N+G LPL +AVIGPN+D M GNY G T L
Sbjct: 363 RQLALDMARQSLVLLQNKGGVLPLK-AGGEPIAVIGPNADDGPMMWGNYNGTPNRTVTIL 421
Query: 443 QGIG-RYARTIHQQGCK--DVACADDQLFGAAIDASR 476
GI R+ R + +GC D + L AID +
Sbjct: 422 NGIKVRHKRVTYLKGCDLTDTKTVNSLLPQCAIDGRK 458
Score = 98.6 bits (244), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 83/299 (27%), Positives = 130/299 (43%), Gaps = 54/299 (18%)
Query: 482 ILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGP 531
+ V G+ ++E E + DR + LP Q++ + + A K + V SG
Sbjct: 603 VFVGGISAALEGEEMPVDIDGFKGGDRTNIELPKVQRDFLRALHEAGK-TVVFVNCSGSA 661
Query: 532 IDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTE 591
I A AI+ A Y GQ GGTA++D+LFGT NP GKLP+T+Y + LP E
Sbjct: 662 I--ALEPEMETCDAILQAWYAGQEGGTAVSDVLFGTVNPSGKLPVTFYKR--TDQLPDYE 717
Query: 592 -MAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSIN 650
+MR GRTYR++ P ++ FG+G+SYT F R G
Sbjct: 718 DYSMR--------GRTYRYFSDP-LFAFGYGLSYTTF-----------------RFGRAR 751
Query: 651 ATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEK 710
A + +++ V + N G++ G + V+ A P K L AF +
Sbjct: 752 AEAAEGGYRLS------------VPLTNTGTRPGEEVVQVYIRRVADTNGPLKSLRAFRR 799
Query: 711 VHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHAATLGVIK 769
V + AG V I + + + + T R G++ + G + L T+ V++
Sbjct: 800 VALKAGESTTVEIPLSRKSFECFDESTNTMRTLPGDYELMYGNSSRPEDLQTLTVSVVE 858
>gi|170731072|ref|YP_001776505.1| beta-glucosidase [Xylella fastidiosa M12]
gi|167965865|gb|ACA12875.1| Beta-glucosidase [Xylella fastidiosa M12]
Length = 882
Score = 277 bits (708), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 165/407 (40%), Positives = 235/407 (57%), Gaps = 32/407 (7%)
Query: 51 QRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGA 110
Q L+ +++ QEK+ ++ A A+PRLGI Y+WWSE LHG++ G A
Sbjct: 32 QHAAALVAQMTRQEKIAQTMNAAPAIPRLGIPAYDWWSEGLHGIARNG----------YA 81
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--GGT-------AGLTYWSPNVNIFR 161
T FPQ I A+S+N L + +G V S EARA +N GG AGLT WSPN+NIFR
Sbjct: 82 TVFPQAIGLAASWNTDLLQHVGTVTSTEARAKFNLTGGPGKDHPRYAGLTLWSPNINIFR 141
Query: 162 DPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD-- 219
DPRWGRG ET GEDP L+ + A S++RGLQG+ D + A+ KHF + +G +
Sbjct: 142 DPRWGRGMETYGEDPYLTSQLAVSFIRGLQGNIPDHPRTIATPKHFAVH-----SGPEPG 196
Query: 220 RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEW 279
R F+ VS D+E T+ FR +++G SVMC+YN ++G P CA +L +R +W
Sbjct: 197 RHSFDVDVSAYDLEATYTPAFRAAIVDGHAGSVMCAYNALHGTPACASDWLLNTRLRNDW 256
Query: 280 RLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLL 338
NG++VSDCD++ F A+A A+++G DL+CG + L+ A+ RG +
Sbjct: 257 GFNGFVVSDCDAIEDMTRFHFFRQDNASASAAALKSGDDLNCGNTYRDLN--QAIARGDI 314
Query: 339 SEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLK 398
E ++ AL+ T + RLG PY +G K + TP H+ LAL+AA Q +VLLK
Sbjct: 315 DESTLDQALIRLFTARQRLGTLQPR-EHDPYAAIGIKHIDTPAHRALALQAAAQSLVLLK 373
Query: 399 NQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
N G +LPL+ T+AV+GP++D + NY G + TPL G+
Sbjct: 374 NSGNTLPLT--PGTTLAVLGPDADSLTALEANYQGTSSTPVTPLIGL 418
Score = 134 bits (337), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 95/290 (32%), Positives = 139/290 (47%), Gaps = 48/290 (16%)
Query: 474 ASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTI 523
A ADA + +GL +E E L DR + LP Q+ L+ V K P I
Sbjct: 607 AVAHADAIVAFVGLSPEVEGEELHIDTPGFSGGDRTTIDLPATQETLLQHVKTTGK-PLI 665
Query: 524 LVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEY 583
+VLMSG + + +A++ AI+ A YPGQ+GGTAIA L G NPGG+LPMT+Y
Sbjct: 666 VVLMSGSAVALNWAQH--HADAILAAWYPGQSGGTAIAQALAGDVNPGGRLPMTFY--RS 721
Query: 584 ITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLD 643
+LP P S GRTYR++KG +YPFG+G+SYT F + T D
Sbjct: 722 TQDLP-------PYISYDMTGRTYRYFKGQPLYPFGYGLSYTQFAYEAPQLSTATLKAGD 774
Query: 644 GRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHK 703
TL V V+N G++ G + ++ PP AP +
Sbjct: 775 -------------------------TLTVTAHVRNTGTRAGDEVVQLYLEPPHSPQAPLR 809
Query: 704 QLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGG 753
LV F++V + G + + + + LS V ++G R + G +++ +GG
Sbjct: 810 NLVGFKRVTLRPGESRLLTFTLD-ARQLSSVQQTGQRSVEAGHYHLFVGG 858
>gi|395492941|ref|ZP_10424520.1| glycoside hydrolase family protein [Sphingomonas sp. PAMC 26617]
Length = 865
Score = 277 bits (708), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 168/411 (40%), Positives = 242/411 (58%), Gaps = 33/411 (8%)
Query: 48 PIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDF 107
PI RV+DL+ R++L+EK + + A A+PRLGI Y++W+EALHGV+ G
Sbjct: 20 PIEARVDDLMRRMTLEEKAAQMQNVAPAIPRLGIPPYDYWNEALHGVARAGE-------- 71
Query: 108 PGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA--------GLTYWSPNVNI 159
AT FPQ I A++++ + A G+ V+ E RA YN A GLT+WSPN+NI
Sbjct: 72 --ATVFPQAIGMAATWDRDMMLAEGQTVATEGRAKYNQAQAQKNYDRYYGLTFWSPNINI 129
Query: 160 FRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD 219
FRDPRWGRGQET GEDP L+G A +V G+QG+D + LK A+ KHF + +G +
Sbjct: 130 FRDPRWGRGQETLGEDPYLTGTMAVPFVHGVQGTDANYLKAIATPKHFAVH-----SGPE 184
Query: 220 --RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRG 277
R FN S +D+ +T+ FR +++G+ S+MC+YN V+ CA+ +LK T+RG
Sbjct: 185 QLRHQFNVDPSPRDLSETYLPAFRRAIVDGRAESLMCAYNAVDTKAACANTMLLKDTLRG 244
Query: 278 EWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLG--LHTESAVQR 335
W G++ SDC ++ H + T E AA A++AG D C F L AV+
Sbjct: 245 AWGFKGFVTSDCGAIDDITTGHHNSPTNPEGAALAVKAGTDTGCD-FKDEMLDLPRAVKA 303
Query: 336 GLLSEIDINNALVNTLTVQMRLGMFDGEPSSQ-PYGHLGPKDVCTPDHQELALEAARQGI 394
G L+E D++ AL T +M+LGMFD P+++ P+ + + +P H+ LAL AAR+ I
Sbjct: 304 GYLTEGDMDVALRRLFTARMKLGMFD--PAARVPFSTISIAENHSPAHRALALRAARESI 361
Query: 395 VLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
VLLKN G LPL+ R +AV+GP + + + GNY G G P+ G+
Sbjct: 362 VLLKNDG-VLPLA-AGARRIAVVGPTAASLIALEGNYNGTPVGAVLPVDGM 410
Score = 105 bits (262), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 74/241 (30%), Positives = 114/241 (47%), Gaps = 47/241 (19%)
Query: 486 GLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVA 535
GL+ +E E + DR + LP Q +L+ + A+ P ++VL SG I A
Sbjct: 602 GLNAWLEGEEMPLQVPGFAGGDRTAIALPAAQSQLLDAL-FATGKPLVIVLQSGSAI--A 658
Query: 536 FAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMR 595
+ + A++ A YPG+AGG AIA++L GT NP G+LP+T+Y LP +
Sbjct: 659 LGAQEAKARAVLEAWYPGEAGGQAIAEVLSGTVNPSGRLPVTFYAS--TDQLPAFD---- 712
Query: 596 PSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISG 655
R RTYR++ G V YPFGHG+SYT F ++ T ++ +G
Sbjct: 713 ---DYRMANRTYRYFAGRVEYPFGHGLSYTRFAYSALRPAT-------------SSVAAG 756
Query: 656 KAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPA 715
+ V+ V V+N G G ++ + P AP + L +++VH+ A
Sbjct: 757 QGTSVS------------VAVRNTGVLAGDEVAQLYLSVPGREGAPIRSLKGYQRVHLAA 804
Query: 716 G 716
G
Sbjct: 805 G 805
>gi|423223593|ref|ZP_17210062.1| hypothetical protein HMPREF1062_02248 [Bacteroides cellulosilyticus
CL02T12C19]
gi|392638218|gb|EIY32065.1| hypothetical protein HMPREF1062_02248 [Bacteroides cellulosilyticus
CL02T12C19]
Length = 863
Score = 276 bits (707), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 168/432 (38%), Positives = 237/432 (54%), Gaps = 32/432 (7%)
Query: 42 FCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGT 101
+ SL +R L+ L+L+EK L++ G+ +V RLGIK Y WW+EALHGV+ G
Sbjct: 23 YKDASLSPERRAELLVKELTLEEKAHLMMDGSRSVERLGIKPYNWWNEALHGVARAGL-- 80
Query: 102 KFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARA--MYNGGT------AGLTYW 153
AT FPQ I A+SFN + + VSDEARA Y GLT W
Sbjct: 81 --------ATVFPQPIGMAASFNPEMVYEVFNAVSDEARAKNTYYASQDSRERYQGLTMW 132
Query: 154 SPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL 212
+P VNI+RDPRWGRG ET GEDP L+ + V+GLQG +DG K+ A KHF +
Sbjct: 133 TPTVNIYRDPRWGRGIETYGEDPYLTSRMGVMVVKGLQGPADGKYDKLHACAKHFAVHSG 192
Query: 213 DNWNGVDRFHFNAK-VSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNIL 271
WN R FNA+ + +D+ +T+ PF V EGKV VMC+YN+ G P C +L
Sbjct: 193 PEWN---RHSFNAENIKPRDLYETYLPPFEALVKEGKVEEVMCAYNRFEGDPCCGSDRLL 249
Query: 272 KRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTP--EEAAADAIRAGLDLDCGPFLGLHT 329
+ +RGEW +G +VSDC ++ +Y+ + + P E A+A A+ +G DL+CG
Sbjct: 250 MQILRGEWGFDGIVVSDCGAIADFYNDRGHHTHPDAESASAAAVISGTDLECGSSYKALI 309
Query: 330 ESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEA 389
ES V++GL+SE ++ ++ + + LG D EP + + V + H LAL
Sbjct: 310 ES-VKKGLISEETVDTSVKRLMKARFALGEMD-EPEKVSWTKIPFSVVASAAHDSLALNM 367
Query: 390 ARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI---- 445
AR+ + LL N+ LPL TVAV+GPN++ +V GNY G+ T L G+
Sbjct: 368 ARESMTLLMNKDNFLPLKR-GGLTVAVMGPNANDSVMQWGNYNGMPAHTVTILDGVRNLL 426
Query: 446 GRYARTIHQQGC 457
G + I++QGC
Sbjct: 427 GTDDKLIYEQGC 438
Score = 130 bits (328), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 95/319 (29%), Positives = 147/319 (46%), Gaps = 47/319 (14%)
Query: 459 DVACADDQLFGAAIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQ 508
D+ D +++ + AD I G+ S+E E + DR + LP Q+
Sbjct: 581 DLGFKKDVDIRKSVERVKDADIVIFASGISPSLEGEEMGVNLPGFKKGDRTDIELPAVQR 640
Query: 509 ELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTS 568
EL+ + A K ILV SG PI + AI+ A YPGQ GG A+A++LFG
Sbjct: 641 ELIDALHRAGK-KIILVNCSGSPI--GLEPETQKCEAILQAWYPGQQGGKAVAEVLFGDY 697
Query: 569 NPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFV 628
NP GKLP+T+Y ++ LP E GRTYR+ + ++PFG+G+SYT F
Sbjct: 698 NPAGKLPVTFYRN--VSQLPDFE-------DYNMTGRTYRYMQDVPLFPFGYGLSYTTFG 748
Query: 629 HTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTL 688
+ TV+ N +G+++K+T V V N G ++G +
Sbjct: 749 Y----GKTVL---------DKNELTAGQSLKLT------------VPVTNTGKRNGEEVV 783
Query: 689 LVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHN 748
V+ P K L AF++V +PAG V ++ + D+S T R+ G ++
Sbjct: 784 QVYLRKQGDAEGPIKTLRAFKRVSIPAGKTVNVEFDLKDKELEWWDDQSNTVRVCPGNYD 843
Query: 749 IHIGGTKHSVSLHAATLGV 767
I +GG+ L T+ +
Sbjct: 844 IMVGGSSKEEDLQRTTIAI 862
>gi|354580734|ref|ZP_08999639.1| glycoside hydrolase family 3 domain protein [Paenibacillus lactis
154]
gi|353203165|gb|EHB68614.1| glycoside hydrolase family 3 domain protein [Paenibacillus lactis
154]
Length = 766
Score = 276 bits (707), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 224/683 (32%), Positives = 344/683 (50%), Gaps = 93/683 (13%)
Query: 109 GATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRG 168
GAT FP +T S++N L+ ++ R V+ E R+ GG A +SP +++ RDPRWGR
Sbjct: 123 GATVFPVPLTIGSTWNPELFRSMCRAVAAETRS--QGGAA---TYSPVLDVVRDPRWGRT 177
Query: 169 QETPGEDPVLSGKYAASYVRGLQGSDGDRL----KVAASCKHFTAYDLDNWNGVDRFHFN 224
+ET GEDP L ++A + V+GLQG DRL + A+ KHF Y R
Sbjct: 178 EETFGEDPHLVAEFAVAAVQGLQG---DRLDAEDSLLATLKHFAGYGASEGG---RNGAP 231
Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
+ +++ + +PFR V E SVM +YN+++GVP + +L +R W +G+
Sbjct: 232 VHMGLRELHEIDLLPFRKAV-EAGAQSVMTAYNEIDGVPCTSSRYLLHDVLREAWGFDGF 290
Query: 285 IVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLD-CGPFLGLHTESAVQRGLLSEIDI 343
+++DC ++ + + ++ EEAAA A+ AG+D++ G ++ A+++G ++E D+
Sbjct: 291 VITDCGAIDMLKSGHNTAASGEEAAAQALTAGVDMEMSGSMFRVYLRQALEQGHITEDDL 350
Query: 344 NNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPS 403
N A+ L ++ RLG+FD P + P K + +H ELA A +GIVLLKN+G
Sbjct: 351 NTAVGRVLAMKFRLGLFD-RPYTDP--ERAEKVIGCEEHIELARRVAAEGIVLLKNEGNV 407
Query: 404 LPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIA--CGYTTPLQGIGRY------ARTIHQQ 455
LPL+ + +AVIGPN++ +G+Y T L+GI R+ R ++
Sbjct: 408 LPLNP-KTGKIAVIGPNANAPYNQLGDYTSPQPPGQIITVLEGIRRHIGEDADTRVLYAP 466
Query: 456 GCK-----------DVAC-ADDQLFGAAIDASRQAD---ATI-------LVMGLDQS-IE 492
GC+ +AC A+ + AI S D TI +V GL QS +E
Sbjct: 467 GCRIQGDSREGLSHALACAAEADVIVMAIGGSSARDFGEGTIDLRTGASVVTGLAQSDME 526
Query: 493 -AEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGY 551
E +DR+ L L G Q EL+ ++ K P ++V ++G PI + D I AI+ A Y
Sbjct: 527 CGEGIDRSTLHLMGVQLELLQEIHKLGK-PVVVVYINGRPITEPWI--DEHIPAILEAWY 583
Query: 552 PGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYK 611
PGQ GG+AIADILFG NP G+L +T P+E + LP+ +KR G+ Y
Sbjct: 584 PGQEGGSAIADILFGDVNPSGRLTLT-IPKE-VGQLPIN------YNAKRTRGKRYLETD 635
Query: 612 GPVVYPFGHGMSYTNFVH-TVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTL 670
YPFG+G+SYT+F + ++ P V +P DG +
Sbjct: 636 LEPRYPFGYGLSYTDFHYGNLSVEPAV--IPADG------------------------SA 669
Query: 671 GVQVDVKNVGSKDGAHTL-LVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCK 729
V++ V N G +DGA + L S A P K L AF KV + AG + V + +
Sbjct: 670 AVRIVVTNTGPRDGAEVVQLYVSDLAASVTRPEKALKAFSKVFLKAGESREVTFTVG-PE 728
Query: 730 YLSVVDRSGTRRIPLGEHNIHIG 752
L ++ + GE I +G
Sbjct: 729 QLELIGPDMKAVVEPGEFRIRVG 751
>gi|423248809|ref|ZP_17229825.1| hypothetical protein HMPREF1066_00835 [Bacteroides fragilis
CL03T00C08]
gi|423253758|ref|ZP_17234689.1| hypothetical protein HMPREF1067_01333 [Bacteroides fragilis
CL03T12C07]
gi|392655387|gb|EIY49030.1| hypothetical protein HMPREF1067_01333 [Bacteroides fragilis
CL03T12C07]
gi|392657750|gb|EIY51381.1| hypothetical protein HMPREF1066_00835 [Bacteroides fragilis
CL03T00C08]
Length = 805
Score = 276 bits (707), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 222/689 (32%), Positives = 329/689 (47%), Gaps = 87/689 (12%)
Query: 109 GATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRG 168
G T FP I AS++N L +GRV++ EA A G G + P +++ RDPRW R
Sbjct: 159 GTTVFPTSIGQASTWNPELIRQMGRVIAIEASA--QGAHIG---YGPVLDLARDPRWSRV 213
Query: 169 QETPGEDPVLSGKYAASYVRGLQGSD-GDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKV 227
+ET GEDP L+G + VRG QG D V A+ KHF +Y W A +
Sbjct: 214 EETYGEDPYLNGVMGTALVRGFQGETLNDGKSVIATLKHFASY---GWTEGGHNGGTAHI 270
Query: 228 SKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVS 287
++++E+ PFR V G + SVM SYN+++G P +L ++ W+ G++VS
Sbjct: 271 GERELEEAIFPPFREAVGAGAL-SVMSSYNEIDGNPCTGSRYLLTDILKDRWQFKGFVVS 329
Query: 288 DCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTE---SAVQRGLLSEIDIN 344
D +VG + EAA A+ AG+D D G ++ E +AV+RG ++ I+
Sbjct: 330 DLYAVGGLRE-HGVAGNDYEAAIKAVNAGVDSDLGT--NVYAEQLVAAVKRGDVAVATID 386
Query: 345 NALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSL 404
A+ L+++ ++G+FD + + V + +H LA E ARQ IVLLKN+ L
Sbjct: 387 KAVRRILSLKFQMGLFDDPFVDEKQ---AVQLVASSEHTGLAREVARQSIVLLKNKDKLL 443
Query: 405 PLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYT--TPLQGI----GRYARTIHQQGCK 458
PL RT+AVIGPN+D M+G+Y T T L GI + R ++ +GC
Sbjct: 444 PLKK-DIRTLAVIGPNADNVYNMLGDYTAPQADGTVVTVLDGIRQKVSKETRVLYAKGCA 502
Query: 459 DVACADDQLFGAAIDASRQADATILVMG----LDQSIE-------------------AEA 495
V + F AI+ +R ADA ++VMG D S E E
Sbjct: 503 -VRDSSRTGFKDAIETARNADAVVMVMGGSSARDFSSEYEETGAAKVTINQISDMESGEG 561
Query: 496 LDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQA 555
DRA L L GRQ EL+ ++S K P +LVL+ G P+ + A + AI+ A YPG
Sbjct: 562 YDRATLHLMGRQLELLEEISRLGK-PVVLVLIKGRPLLMEGAIQEAE--AIVDAWYPGMQ 618
Query: 556 GGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVV 615
GG A+AD+LFG NP G+L ++ P+ + LP+ R RY G
Sbjct: 619 GGNAVADVLFGDYNPAGRLTLS-VPRS-VGQLPVYYNTRRKGNRSRYVEE-----PGTPR 671
Query: 616 YPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVD 675
YPFG+G+SYT F +T ++VT + + V V
Sbjct: 672 YPFGYGLSYTTFSYT------------------------DMKVQVTEG-SDDCRVDVTVT 706
Query: 676 VKNVGSKDGAHTL-LVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVV 734
++N G+ DG L F + P KQL AF ++H+ AG + V + K L++
Sbjct: 707 IQNQGTADGDEVAQLYFQDDVSSFTTPAKQLRAFSRIHLKAGESREVTFTLD-KKSLALY 765
Query: 735 DRSGTRRIPLGEHNIHIGGTKHSVSLHAA 763
+ G + G I +GG+ ++ A
Sbjct: 766 MQEGEWVVEPGRFTIMVGGSSEDIACRQA 794
>gi|218130696|ref|ZP_03459500.1| hypothetical protein BACEGG_02285 [Bacteroides eggerthii DSM 20697]
gi|217987040|gb|EEC53371.1| glycosyl hydrolase family 3 N-terminal domain protein [Bacteroides
eggerthii DSM 20697]
Length = 858
Score = 276 bits (707), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 162/416 (38%), Positives = 248/416 (59%), Gaps = 41/416 (9%)
Query: 48 PIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDF 107
PI +R+ DL+ RL+++EK+ LL + + + RL I Y +EALHGV V PG +F
Sbjct: 35 PIHERIMDLLSRLTVEEKISLLRATSPGISRLDIPKYYHGNEALHGV--VRPG-RF---- 87
Query: 108 PGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAG----------LTYWSPNV 157
T FPQ I A+++N L + + V+SDEARA +N G LT+WSP V
Sbjct: 88 ---TVFPQAIGLAATWNPVLQKQVATVISDEARARWNELDQGREQNSQFSDLLTFWSPTV 144
Query: 158 NIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNG 217
N+ RDPRWGR ET GEDP LSG ++V+GLQG+D LK+ ++ KHF A + ++
Sbjct: 145 NMARDPRWGRTPETYGEDPYLSGIMGTAFVKGLQGNDSRYLKIVSTPKHFAANNEEH--- 201
Query: 218 VDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRG 277
+RF N ++S++ + + + F CV EGK AS+M +YN +N VP + +L + +R
Sbjct: 202 -NRFVCNPQISEKQLREYYLPAFEACVKEGKSASIMSAYNALNDVPCTLNAWLLTKVLRE 260
Query: 278 EWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG------PFLGLHTES 331
+W GY+VSDC + + + T E AA +I+AGLDL+CG P L +
Sbjct: 261 DWGFKGYVVSDCGGPALLVNAHKYVKTKEAAATLSIKAGLDLECGDDVYDAPLL-----N 315
Query: 332 AVQRGLLSEIDINNALVNTLTVQMRLGMFD-GEPSSQPYGHLGPKDVCTPDHQELALEAA 390
A ++ ++++ DI++A L +M+LG+FD GE + PY + PK + + +HQ++AL+AA
Sbjct: 316 AYRQYMVTDADIDSAAYRVLRARMQLGLFDSGE--NNPYTKISPKVIGSKEHQKVALDAA 373
Query: 391 RQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGI-ACGYTTPLQGI 445
R+ IVLLKNQ LPL + +++AV+G N+ + G+Y+G+ + LQGI
Sbjct: 374 RECIVLLKNQNKMLPLDAKKIKSIAVVGINA--GRSEFGDYSGLPVIAPVSILQGI 427
Score = 127 bits (320), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 83/301 (27%), Positives = 146/301 (48%), Gaps = 45/301 (14%)
Query: 467 LFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVL 526
L+G A R+ + + V+G++++IE E DR+ + LP Q+E + ++ + ++VL
Sbjct: 592 LYGEAGRVVRECEKVVAVLGINKAIEREGQDRSDIQLPADQREFLKEIYKVNPN-IVVVL 650
Query: 527 MSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITN 586
++G + + + D I AII A YPG++GG A+A++LFG NPGG+LP+T+Y +
Sbjct: 651 VAGSSLSINWM--DEHIPAIINAWYPGESGGKAVAEVLFGDYNPGGRLPLTYYRS--LDE 706
Query: 587 LPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRH 646
LP + GRTY+++KG V+YPFG+G+SYT+F ++
Sbjct: 707 LPPFD------DYDITKGRTYQYFKGNVLYPFGYGLSYTSF-----------------KY 743
Query: 647 GSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQL 705
+ T + + V+ +KNVG G ++ P P K+L
Sbjct: 744 SDLQVTDGNQEVNVSFC------------LKNVGKYAGDEVAQIYVKLPERDKIMPIKEL 791
Query: 706 VAFEKVHVPAGAQQRVGINIH--VCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHAA 763
FE++ + G ++V I + + +Y P G++ I IG + + L
Sbjct: 792 KGFERISLKRGESRKVTIRLKKDLLRYWDEEKECFVH--PSGDYTIMIGASSADIRLQKV 849
Query: 764 T 764
+
Sbjct: 850 S 850
>gi|317474225|ref|ZP_07933501.1| glycosyl hydrolase family 3 N terminal domain-containing protein
[Bacteroides eggerthii 1_2_48FAA]
gi|316909535|gb|EFV31213.1| glycosyl hydrolase family 3 N terminal domain-containing protein
[Bacteroides eggerthii 1_2_48FAA]
Length = 858
Score = 276 bits (707), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 162/416 (38%), Positives = 248/416 (59%), Gaps = 41/416 (9%)
Query: 48 PIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDF 107
PI +R+ DL+ RL+++EK+ LL + + + RL I Y +EALHGV V PG +F
Sbjct: 35 PIHERIMDLLSRLTVEEKISLLRATSPGISRLDIPKYYHGNEALHGV--VRPG-RF---- 87
Query: 108 PGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAG----------LTYWSPNV 157
T FPQ I A+++N L + + V+SDEARA +N G LT+WSP V
Sbjct: 88 ---TVFPQAIGLAATWNPVLQKQVATVISDEARARWNELDQGREQNSQFSDLLTFWSPTV 144
Query: 158 NIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNG 217
N+ RDPRWGR ET GEDP LSG ++V+GLQG+D LK+ ++ KHF A + ++
Sbjct: 145 NMARDPRWGRTPETYGEDPYLSGIMGTAFVKGLQGNDSRYLKIVSTPKHFAANNEEH--- 201
Query: 218 VDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRG 277
+RF N ++S++ + + + F CV EGK AS+M +YN +N VP + +L + +R
Sbjct: 202 -NRFVCNPQISEKQLREYYLPAFEACVKEGKSASIMSAYNALNDVPCTLNAWLLTKVLRE 260
Query: 278 EWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG------PFLGLHTES 331
+W GY+VSDC + + + T E AA +I+AGLDL+CG P L +
Sbjct: 261 DWGFKGYVVSDCGGPALLVNAHKYVKTKEAAATLSIKAGLDLECGDDVYDAPLL-----N 315
Query: 332 AVQRGLLSEIDINNALVNTLTVQMRLGMFD-GEPSSQPYGHLGPKDVCTPDHQELALEAA 390
A ++ ++++ DI++A L +M+LG+FD GE + PY + PK + + +HQ++AL+AA
Sbjct: 316 AYRQYMVTDADIDSAAYRVLRARMQLGLFDSGE--NNPYTKISPKVIGSKEHQKVALDAA 373
Query: 391 RQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGI-ACGYTTPLQGI 445
R+ IVLLKNQ LPL + +++AV+G N+ + G+Y+G+ + LQGI
Sbjct: 374 RECIVLLKNQNKMLPLDAKKIKSIAVVGINA--GRSEFGDYSGLPVIAPVSILQGI 427
Score = 129 bits (325), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 83/301 (27%), Positives = 148/301 (49%), Gaps = 45/301 (14%)
Query: 467 LFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVL 526
L+G A R+ + + V+G++++IE E DR+ + LP Q+E + ++ + ++VL
Sbjct: 592 LYGEAGRVVRECEKVVAVLGINKAIEREGQDRSDIQLPADQREFLKEIYKVNPN-IVVVL 650
Query: 527 MSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITN 586
++G + + + D I AII A YPG++GG A+A++LFG NPGG+LP+T+Y +
Sbjct: 651 VAGSSLSINWM--DEHIPAIINAWYPGESGGKAVAEVLFGDYNPGGRLPLTYY--RSLDE 706
Query: 587 LPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRH 646
LP + GRTY+++KG V+YPFG+G+SYT+F ++
Sbjct: 707 LPPFD------DYDITKGRTYQYFKGNVLYPFGYGLSYTSF-----------------KY 743
Query: 647 GSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQL 705
+ T + + V+ +KNVG G ++ P P K+L
Sbjct: 744 SDLQVTEGNQEVNVSFC------------LKNVGKYAGDEVAQIYVKLPERDKIMPIKEL 791
Query: 706 VAFEKVHVPAGAQQRVGINIH--VCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHAA 763
FE++ + G ++V I + + +Y + G P G++ I +G + + L
Sbjct: 792 KGFERISLKRGGSRKVTIRLKKDLLRYWD--EEKGCFVHPSGDYTIMVGASSADIRLQKV 849
Query: 764 T 764
+
Sbjct: 850 S 850
>gi|380694609|ref|ZP_09859468.1| periplasmic beta-glucosidase , xylosidase/arabinosidase
[Bacteroides faecis MAJ27]
Length = 804
Score = 276 bits (707), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 215/694 (30%), Positives = 335/694 (48%), Gaps = 108/694 (15%)
Query: 109 GATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRG 168
G T FP I +++++ TL E +G+ ++ E R+ A ++Y P +++ RDPRW R
Sbjct: 161 GTTVFPTGIGMSATWSPTLIEEVGKAIAKEIRSQ----GAHISY-GPVLDLSRDPRWSRV 215
Query: 169 QETPGEDPVLSGKYAASYVRGLQGSDGDRLKVA-ASCKHFTAYDLDNWNGVDRFHFNAKV 227
+ET GEDPVLSG+ A+ V GL D R A+ KHF AY + + A V
Sbjct: 216 EETFGEDPVLSGRLGAAMVTGLGSGDLSREHATIATLKHFLAYAVPEGGQNGNY---ASV 272
Query: 228 SKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVS 287
+D+ + F PFR + G + SVM SYN ++G+P A+ +L + +R EW+ G++VS
Sbjct: 273 GARDLHENFLPPFREAIEAGAL-SVMTSYNSIDGIPCTANHYLLTQLLRNEWKFRGFVVS 331
Query: 288 DCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNAL 347
D S+ +++ ST EEAA A+ AG+D+D G ++ AV+ G L E IN A+
Sbjct: 332 DLYSIEGIHESHFVASTMEEAAVQALSAGVDIDLGGDAFMNLLQAVRSGKLDETQINAAV 391
Query: 348 VNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLS 407
L ++ +G+F+ P P K V +H +LA + A+ +VLL+N+ LPLS
Sbjct: 392 DRILRMKFEMGLFE-HPYVNP--KTTTKMVRNKEHVKLARKVAQSSVVLLENKNSILPLS 448
Query: 408 HIRHRTVAVIGPNSDVTVTMIGNYAGIA--CGYTTPLQGIGRY---ARTIHQQGC--KDV 460
+ + VAV+GPN+D M+G+Y T L G+ +R + +GC +D
Sbjct: 449 K-KIKRVAVVGPNADNRYNMLGDYTAPQEDKDIRTVLDGVISKLSPSRVEYVRGCAIRDT 507
Query: 461 ACADDQLFGAAIDASRQADATILVMG-------------------LDQSIE----AEALD 497
+ A++A+ +++ I V+G ++SI E D
Sbjct: 508 TVNE---IAEAVEAAHRSEVIIAVVGGSSARDFKTSYQETGAAIADEKSISDMECGEGFD 564
Query: 498 RAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGG 557
RA L L G+QQ+L++ + K P I+V + G P+D +A A++ A YPGQAGG
Sbjct: 565 RATLTLLGKQQDLLNALKTTGK-PLIVVYIEGRPLDKVWASECAD--ALLTASYPGQAGG 621
Query: 558 TAIADILFGTSNPGGKLPMT----------WYPQEYITNLPMTEMAMRPSQSKRYPGRTY 607
AIAD+LFG NP G+LP++ +Y ++ N EMA P
Sbjct: 622 DAIADVLFGDYNPAGRLPVSVPRSVGQIPVYYNKKAPRNHDYVEMAASP----------- 670
Query: 608 RFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNR 667
+Y FG+G+SYT F ++ + I+ K + C+
Sbjct: 671 -------LYGFGYGLSYTTFEYS-------------------DLQITQK------SPCH- 697
Query: 668 LTLGVQVDVKNVGSKDGAHTL-LVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIH 726
V VKN G+ DG L A P KQL FE+ + G ++ + +
Sbjct: 698 --FEVSFKVKNTGNYDGEEVAQLYLKDEYASVVQPLKQLKHFERFFLRKGEEKEILFTL- 754
Query: 727 VCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
K LS++DRS R + G+ I IG + + L
Sbjct: 755 TEKDLSIIDRSMKRVVETGDFRIMIGASSDDIRL 788
>gi|423269263|ref|ZP_17248235.1| hypothetical protein HMPREF1079_01317 [Bacteroides fragilis
CL05T00C42]
gi|423273173|ref|ZP_17252120.1| hypothetical protein HMPREF1080_00773 [Bacteroides fragilis
CL05T12C13]
gi|392701685|gb|EIY94842.1| hypothetical protein HMPREF1079_01317 [Bacteroides fragilis
CL05T00C42]
gi|392708205|gb|EIZ01313.1| hypothetical protein HMPREF1080_00773 [Bacteroides fragilis
CL05T12C13]
Length = 805
Score = 276 bits (706), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 222/689 (32%), Positives = 329/689 (47%), Gaps = 87/689 (12%)
Query: 109 GATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRG 168
G T FP I AS++N L +GRV++ EA A G G + P +++ RDPRW R
Sbjct: 159 GTTVFPTSIGQASTWNPELIRQMGRVIAIEASA--QGAHIG---YGPVLDLARDPRWSRV 213
Query: 169 QETPGEDPVLSGKYAASYVRGLQGSD-GDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKV 227
+ET GEDP L+G + VRG QG D V A+ KHF +Y W A +
Sbjct: 214 EETYGEDPYLNGVMGTALVRGFQGETLNDGKSVIATLKHFASY---GWTEGGHNGGTAHI 270
Query: 228 SKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVS 287
++++E+ PFR V G + SVM SYN+++G P +L ++ W+ G++VS
Sbjct: 271 GERELEEAIFPPFREAVGAGAL-SVMSSYNEIDGNPCTGSRYLLTDILKDRWQFKGFVVS 329
Query: 288 DCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTE---SAVQRGLLSEIDIN 344
D +VG + EAA A+ AG+D D G ++ E +AV+RG ++ I+
Sbjct: 330 DLYAVGGLRE-HGVAGNDYEAAIKAVNAGVDSDLGT--NVYAEQLVAAVKRGDVAVATID 386
Query: 345 NALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSL 404
A+ L+++ ++G+FD + + V + +H LA E ARQ IVLLKN+ L
Sbjct: 387 KAVRRILSLKFQMGLFDDPFVDEKQ---AVQLVASSEHTGLAREVARQSIVLLKNKDKLL 443
Query: 405 PLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYT--TPLQGI----GRYARTIHQQGCK 458
PL RT+AVIGPN+D M+G+Y T T L GI + R ++ +GC
Sbjct: 444 PLKK-DIRTLAVIGPNADNVYNMLGDYTAPQADGTVVTVLDGIRQKVSKETRVLYAKGCA 502
Query: 459 DVACADDQLFGAAIDASRQADATILVMG----LDQSIE-------------------AEA 495
V + F AI+ +R ADA ++VMG D S E E
Sbjct: 503 -VRDSSRTGFKDAIETARNADAVVMVMGGSSARDFSSEYEETGAAKVTINQISDMESGEG 561
Query: 496 LDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQA 555
DRA L L GRQ EL+ ++S K P +LVL+ G P+ + A + AI+ A YPG
Sbjct: 562 YDRATLHLMGRQLELLEEISRLGK-PVVLVLIKGRPLLMEGAIQEAE--AIVDAWYPGMQ 618
Query: 556 GGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVV 615
GG A+AD+LFG NP G+L ++ P+ + LP+ R RY G
Sbjct: 619 GGNAVADVLFGDYNPAGRLTLS-VPRS-VGQLPVYYNTRRKGNRSRYIEE-----PGTPR 671
Query: 616 YPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVD 675
YPFG+G+SYT F +T ++VT + + V V
Sbjct: 672 YPFGYGLSYTTFSYT------------------------DMKVQVTEG-SDDCRVDVTVT 706
Query: 676 VKNVGSKDGAHTL-LVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVV 734
++N G+ DG L F + P KQL AF ++H+ AG + V + K L++
Sbjct: 707 IQNQGTADGDEVAQLYFRDDVSSFTTPAKQLRAFSRIHLKAGESREVTFTLD-KKSLALY 765
Query: 735 DRSGTRRIPLGEHNIHIGGTKHSVSLHAA 763
+ G + G I +GG+ ++ A
Sbjct: 766 MQEGEWVVEPGRFTIMVGGSSEDIACRQA 794
>gi|265765465|ref|ZP_06093740.1| periplasmic beta-glucosidase [Bacteroides sp. 2_1_16]
gi|263254849|gb|EEZ26283.1| periplasmic beta-glucosidase [Bacteroides sp. 2_1_16]
Length = 814
Score = 276 bits (706), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 222/689 (32%), Positives = 329/689 (47%), Gaps = 87/689 (12%)
Query: 109 GATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRG 168
G T FP I AS++N L +GRV++ EA A G G + P +++ RDPRW R
Sbjct: 168 GTTVFPTSIGQASTWNPELIRQMGRVIAIEASA--QGAHIG---YGPVLDLARDPRWSRV 222
Query: 169 QETPGEDPVLSGKYAASYVRGLQGSD-GDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKV 227
+ET GEDP L+G + VRG QG D V A+ KHF +Y W A +
Sbjct: 223 EETYGEDPYLNGAMGTALVRGFQGETLNDGKSVIATLKHFASY---GWTEGGHNGGTAHI 279
Query: 228 SKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVS 287
++++E+ PFR V G + SVM SYN+++G P +L ++ W+ G++VS
Sbjct: 280 GERELEEAIFPPFREAVGAGAL-SVMSSYNEIDGNPCTGSRYLLTDILKDRWQFKGFVVS 338
Query: 288 DCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTE---SAVQRGLLSEIDIN 344
D +VG + EAA A+ AG+D D G ++ E +AV+RG ++ I+
Sbjct: 339 DLYAVGGLRE-HGVAGNDYEAAIKAVNAGVDSDLGT--NVYAEQLVAAVKRGDVAVATID 395
Query: 345 NALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSL 404
A+ L+++ ++G+FD + + V + +H LA E ARQ IVLLKN+ L
Sbjct: 396 KAVRRILSLKFQMGLFDDPFVDEKQ---AVQLVASSEHTGLAREVARQSIVLLKNKDKLL 452
Query: 405 PLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYT--TPLQGI----GRYARTIHQQGCK 458
PL RT+AVIGPN+D M+G+Y T T L GI + R ++ +GC
Sbjct: 453 PLKK-DIRTLAVIGPNADNVYNMLGDYTAPQADGTVVTVLDGIRQKVSKETRVLYAKGCA 511
Query: 459 DVACADDQLFGAAIDASRQADATILVMG----LDQSIE-------------------AEA 495
V + F AI+ +R ADA ++VMG D S E E
Sbjct: 512 -VRDSSRTGFKDAIETARNADAVVMVMGGSSARDFSSEYEETGAAKVTINQISDMESGEG 570
Query: 496 LDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQA 555
DRA L L GRQ EL+ ++S K P +LVL+ G P+ + A + AI+ A YPG
Sbjct: 571 YDRATLHLMGRQLELLEEISRLGK-PVVLVLIKGRPLLMEGAIQEAE--AIVDAWYPGMQ 627
Query: 556 GGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVV 615
GG A+AD+LFG NP G+L ++ P+ + LP+ R RY G
Sbjct: 628 GGNAVADVLFGDYNPAGRLTLS-VPRS-VGQLPVYYNTRRKGNRSRYIEE-----PGTPR 680
Query: 616 YPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVD 675
YPFG+G+SYT F +T ++VT + + V V
Sbjct: 681 YPFGYGLSYTTFSYT------------------------DMKVQVTEG-SDDCRVDVTVT 715
Query: 676 VKNVGSKDGAHTL-LVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVV 734
++N G+ DG L F + P KQL AF ++H+ AG + V + K L++
Sbjct: 716 IQNQGTADGDEVAQLYFRDDVSSFTTPAKQLRAFSRIHLKAGESREVTFTLD-KKSLALY 774
Query: 735 DRSGTRRIPLGEHNIHIGGTKHSVSLHAA 763
+ G + G I +GG+ ++ A
Sbjct: 775 MQEGEWVVEPGRFTIMVGGSSEDIACRQA 803
>gi|423258860|ref|ZP_17239783.1| hypothetical protein HMPREF1055_02060 [Bacteroides fragilis
CL07T00C01]
gi|423264169|ref|ZP_17243172.1| hypothetical protein HMPREF1056_00859 [Bacteroides fragilis
CL07T12C05]
gi|387776440|gb|EIK38540.1| hypothetical protein HMPREF1055_02060 [Bacteroides fragilis
CL07T00C01]
gi|392706435|gb|EIY99558.1| hypothetical protein HMPREF1056_00859 [Bacteroides fragilis
CL07T12C05]
Length = 805
Score = 276 bits (706), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 222/689 (32%), Positives = 329/689 (47%), Gaps = 87/689 (12%)
Query: 109 GATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRG 168
G T FP I AS++N L +GRV++ EA A G G + P +++ RDPRW R
Sbjct: 159 GTTVFPTSIGQASTWNPELIRQMGRVIAIEASA--QGAHIG---YGPVLDLARDPRWSRV 213
Query: 169 QETPGEDPVLSGKYAASYVRGLQGSD-GDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKV 227
+ET GEDP L+G + VRG QG D V A+ KHF +Y W A +
Sbjct: 214 EETYGEDPYLNGAMGTALVRGFQGETLNDGKSVIATLKHFASY---GWTEGGHNGGTAHI 270
Query: 228 SKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVS 287
++++E+ PFR V G + SVM SYN+++G P +L ++ W+ G++VS
Sbjct: 271 GERELEEAIFPPFREAVGAGAL-SVMSSYNEIDGNPCTGSRYLLTDILKDRWQFKGFVVS 329
Query: 288 DCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTE---SAVQRGLLSEIDIN 344
D +VG + EAA A+ AG+D D G ++ E +AV+RG ++ I+
Sbjct: 330 DLYAVGGLRE-HGVAGNDYEAAIKAVNAGVDSDLGT--NVYAEQLVAAVKRGDVAVATID 386
Query: 345 NALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSL 404
A+ L+++ ++G+FD + + V + +H LA E ARQ IVLLKN+ L
Sbjct: 387 KAVRRILSLKFQMGLFDDPFVDEKQ---AVQLVASSEHTGLAREVARQSIVLLKNKDKLL 443
Query: 405 PLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYT--TPLQGI----GRYARTIHQQGCK 458
PL RT+AVIGPN+D M+G+Y T T L GI + R ++ +GC
Sbjct: 444 PLKK-DIRTLAVIGPNADNVYNMLGDYTAPQADGTVVTVLDGIRQKVSKETRVLYAKGCA 502
Query: 459 DVACADDQLFGAAIDASRQADATILVMG----LDQSIE-------------------AEA 495
V + F AI+ +R ADA ++VMG D S E E
Sbjct: 503 -VRDSSRTGFKDAIETARNADAVVMVMGGSSARDFSSEYEETGAAKVTINQISDMESGEG 561
Query: 496 LDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQA 555
DRA L L GRQ EL+ ++S K P +LVL+ G P+ + A + AI+ A YPG
Sbjct: 562 YDRATLHLMGRQLELLEEISRLGK-PVVLVLIKGRPLLMEGAIQEAE--AIVDAWYPGMQ 618
Query: 556 GGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVV 615
GG A+AD+LFG NP G+L ++ P+ + LP+ R RY G
Sbjct: 619 GGNAVADVLFGDYNPAGRLTLS-VPRS-VGQLPVYYNTRRKGNRSRYIEE-----PGTPR 671
Query: 616 YPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVD 675
YPFG+G+SYT F +T ++VT + + V V
Sbjct: 672 YPFGYGLSYTTFSYT------------------------DMKVQVTEG-SDDCRVDVTVT 706
Query: 676 VKNVGSKDGAHTL-LVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVV 734
++N G+ DG L F + P KQL AF ++H+ AG + V + K L++
Sbjct: 707 IQNQGTADGDEVAQLYFRDDVSSFTTPAKQLRAFSRIHLKAGESREVTFTLD-KKSLALY 765
Query: 735 DRSGTRRIPLGEHNIHIGGTKHSVSLHAA 763
+ G + G I +GG+ ++ A
Sbjct: 766 MQEGEWVVEPGRFTIMVGGSSEDIACRQA 794
>gi|325299987|ref|YP_004259904.1| Beta-glucosidase [Bacteroides salanitronis DSM 18170]
gi|324319540|gb|ADY37431.1| Beta-glucosidase [Bacteroides salanitronis DSM 18170]
Length = 864
Score = 276 bits (705), Expect = 4e-71, Method: Compositional matrix adjust.
Identities = 165/448 (36%), Positives = 242/448 (54%), Gaps = 54/448 (12%)
Query: 35 ATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGV 94
A + LP+ L +R NDL+GRL+L+EK L+ + + A+PRLGIK Y+WW+EALHGV
Sbjct: 20 AMGQQLPYQNPKLTPEERANDLVGRLTLEEKASLMQNTSPAIPRLGIKAYDWWNEALHGV 79
Query: 95 SNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--------GG 146
G AT FPQ I A+SF+ L + VSDEARA Y
Sbjct: 80 GRAGI----------ATVFPQTIGMAASFDDELLYQVFTAVSDEARAKYTQFRKEGDLKR 129
Query: 147 TAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCK 205
GLT+W+PNVNIFRDPRWGRGQET GEDP L+ + + VRGLQG D K+ A K
Sbjct: 130 YQGLTFWTPNVNIFRDPRWGRGQETYGEDPYLTSQMGMAVVRGLQGPEDAPYDKLHACAK 189
Query: 206 HFTAYDLDNWNGVDRFHFNAK-VSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPT 264
HF + WN R FNA+ ++ +D+ +T+ F+ V + V VMC+YN++ G P
Sbjct: 190 HFAVHSGPEWN---RHEFNAENIAPRDLWETYMPAFKDLVQKAHVKEVMCAYNRLEGEPC 246
Query: 265 CADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEE--AAADAIRAGLDLDCG 322
C + +L +R EW G +VSDC ++ ++ + P++ A+A A+ +G DL+CG
Sbjct: 247 CGNNRLLTHILRDEWGYQGIVVSDCGAISDFWRKGDHETHPDKAHASAGAVLSGTDLECG 306
Query: 323 PFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVC---- 378
E AV+ GL++E ++ ++ L + LG D KDVC
Sbjct: 307 SNYKSLPE-AVKAGLIAESQLDISVKRLLKARFELGEMD-------------KDVCWDTI 352
Query: 379 ------TPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYA 432
H++LAL AR+ IVLL+N+ LPL + +A++GPN++ ++ GNY
Sbjct: 353 PYSVVDCQAHKDLALRMARESIVLLQNRNNILPLR--KDMKIALVGPNANDSIMHWGNYN 410
Query: 433 GIACGYTTPLQGIGRY---ARTIHQQGC 457
G T + + + ++ I++ GC
Sbjct: 411 GFPSHTETLYEALKKRLPASQLIYEFGC 438
Score = 124 bits (310), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 91/309 (29%), Positives = 136/309 (44%), Gaps = 49/309 (15%)
Query: 470 AAIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASK 519
A D + AD + G+ ++E E + DR + LP Q++LV ++ K
Sbjct: 593 ATADKVKDADVILFAGGISPTLEGEEMPVDAEGFRGGDRTSIELPAIQRQLVGELKKLGK 652
Query: 520 GPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWY 579
P + + SG + A +I A YPGQAGGTAIAD+LFG NP GKLP+T+Y
Sbjct: 653 -PIVFINYSGSAM--GLAPESEICDGMIQAWYPGQAGGTAIADVLFGDYNPAGKLPVTFY 709
Query: 580 PQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVV 638
LP + AM+ GRTYR+ ++ FGHG+SYT F +
Sbjct: 710 RN--TEQLPDFEDYAMK--------GRTYRYMTETPLFRFGHGLSYTTFDY--------- 750
Query: 639 AVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGH 698
GKA + TL + + V N G++DG T+ V+ P
Sbjct: 751 ----------------GKARLSQNTFSKGETLTLTIPVSNTGTRDGEETVQVYLRRPGDA 794
Query: 699 WAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSV 758
AP L AF++V+VP G + + + +L + + GE+ + GGT +
Sbjct: 795 DAPSHTLRAFKRVYVPKGGTKEIKFTLSDDNFLWFDTSTNNMNLISGEYELLYGGTSDTN 854
Query: 759 SLHAATLGV 767
L L +
Sbjct: 855 QLKKIRLMI 863
>gi|375357172|ref|YP_005109944.1| putative beta-glucosidase [Bacteroides fragilis 638R]
gi|301161853|emb|CBW21397.1| putative beta-glucosidase [Bacteroides fragilis 638R]
Length = 814
Score = 276 bits (705), Expect = 4e-71, Method: Compositional matrix adjust.
Identities = 222/689 (32%), Positives = 329/689 (47%), Gaps = 87/689 (12%)
Query: 109 GATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRG 168
G T FP I AS++N L +GRV++ EA A G G + P +++ RDPRW R
Sbjct: 168 GTTVFPTSIGQASTWNPELIRQMGRVIAIEASA--QGAHIG---YGPVLDLARDPRWSRV 222
Query: 169 QETPGEDPVLSGKYAASYVRGLQGSD-GDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKV 227
+ET GEDP L+G + VRG QG D V A+ KHF +Y W A +
Sbjct: 223 EETYGEDPYLNGVMGTALVRGFQGETLNDGKSVIATLKHFASY---GWTEGGHNGGTAHI 279
Query: 228 SKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVS 287
++++E+ PFR V G + SVM SYN+++G P +L ++ W+ G++VS
Sbjct: 280 GERELEEAIFPPFREAVGAGAL-SVMSSYNEIDGNPCTGSRYLLTDILKDRWQFKGFVVS 338
Query: 288 DCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTE---SAVQRGLLSEIDIN 344
D +VG + EAA A+ AG+D D G ++ E +AV+RG ++ I+
Sbjct: 339 DLYAVGGLRE-HGVAGNDYEAAIKAVNAGVDSDLGT--NVYAEQLVAAVKRGDVAVATID 395
Query: 345 NALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSL 404
A+ L+++ ++G+FD + + V + +H LA E ARQ IVLLKN+ L
Sbjct: 396 KAVRRILSLKFQMGLFDDPFVDEKQ---AVQLVASSEHTGLAREVARQSIVLLKNKDKLL 452
Query: 405 PLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYT--TPLQGI----GRYARTIHQQGCK 458
PL RT+AVIGPN+D M+G+Y T T L GI + R ++ +GC
Sbjct: 453 PLKK-DIRTLAVIGPNADNVYNMLGDYTAPQADGTVVTVLDGIRQKVSKETRVLYAKGCA 511
Query: 459 DVACADDQLFGAAIDASRQADATILVMG----LDQSIE-------------------AEA 495
V + F AI+ +R ADA ++VMG D S E E
Sbjct: 512 -VRDSSRTGFKDAIETARNADAVVMVMGGSSARDFSSEYEETGAAKVTINQISDMESGEG 570
Query: 496 LDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQA 555
DRA L L GRQ EL+ ++S K P +LVL+ G P+ + A + AI+ A YPG
Sbjct: 571 YDRATLHLMGRQLELLEEISRLGK-PVVLVLIKGRPLLMEGAIQEAE--AIVDAWYPGMQ 627
Query: 556 GGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVV 615
GG A+AD+LFG NP G+L ++ P+ + LP+ R RY G
Sbjct: 628 GGNAVADVLFGDYNPAGRLTLS-VPRS-VGQLPVYYNTRRKGNRSRYIEE-----PGTPR 680
Query: 616 YPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVD 675
YPFG+G+SYT F +T ++VT + + V V
Sbjct: 681 YPFGYGLSYTTFSYT------------------------DMKVQVTEG-SDDCRVDVTVT 715
Query: 676 VKNVGSKDGAHTL-LVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVV 734
++N G+ DG L F + P KQL AF ++H+ AG + V + K L++
Sbjct: 716 IQNQGTADGDEVAQLYFRDDVSSFTTPAKQLRAFSRIHLKAGESREVTFTLD-KKSLALY 774
Query: 735 DRSGTRRIPLGEHNIHIGGTKHSVSLHAA 763
+ G + G I +GG+ ++ A
Sbjct: 775 MQEGEWVVEPGRFTIMVGGSSEDIACRQA 803
>gi|315606832|ref|ZP_07881841.1| beta-glucosidase [Prevotella buccae ATCC 33574]
gi|315251497|gb|EFU31477.1| beta-glucosidase [Prevotella buccae ATCC 33574]
Length = 858
Score = 276 bits (705), Expect = 4e-71, Method: Compositional matrix adjust.
Identities = 173/457 (37%), Positives = 250/457 (54%), Gaps = 30/457 (6%)
Query: 35 ATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGV 94
AT + LP+C +L +R DL+ RL+L+EK +L++ + A+PRLGIK + WWSEALHG
Sbjct: 17 ATAQLLPYCNPALSARERARDLLSRLTLEEKARLMLDESPAIPRLGIKKFFWWSEALHGA 76
Query: 95 SNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMY-----NGGTA- 148
+N+G G T FP+ + A+SFN L + SDE RA Y NGG
Sbjct: 77 ANMG----------GVTVFPEPVGMAASFNDGLLRRVFDAASDEMRAQYNRRMLNGGEDE 126
Query: 149 ---GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASC- 204
L+ W+PNVNIFRDPRWGRGQET GEDP L+ + VRGLQG + + + +C
Sbjct: 127 KFHSLSVWTPNVNIFRDPRWGRGQETYGEDPYLTSVMGTAVVRGLQGPETAKYRKLWACA 186
Query: 205 KHFTAYDLDNWNGVDRFHFN-AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVP 263
KH+ + + R N A VS +D+ +T+ F+ V E KV VMC+Y +++ P
Sbjct: 187 KHYAVHSGPEYT---RHTANVADVSPRDLWETYLPAFKTLVTEAKVREVMCAYQRLDDDP 243
Query: 264 TCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGP 323
C++ +L++ +R EW N +VSDC +V Y +S AAA A AG D++CG
Sbjct: 244 CCSNNRLLQQILRDEWGFNYLVVSDCGAVTDIYANHKTSSDAVHAAAKAAVAGTDVECGF 303
Query: 324 FLGLHT-ESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDH 382
T AV+RGL++E +++ ++ L + LG D +P + + + + H
Sbjct: 304 GYAYKTIPEAVRRGLITEAEVDKHVLRLLEGRFDLGEMD-DPKLVEWSKIPASVMDSKAH 362
Query: 383 QELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPL 442
++LAL+ ARQ +VLL+N+G LPL + VIGPN+D M GNY G T L
Sbjct: 363 RQLALDMARQSLVLLQNKGGVLPLK-AGGDPITVIGPNADDGPMMWGNYNGTPNRTVTIL 421
Query: 443 QGI-GRYARTIHQQGCK--DVACADDQLFGAAIDASR 476
GI R+ R + +GC D + L AID +
Sbjct: 422 DGIKARHTRVTYLKGCDLTDTKTVNSLLPQCAIDGRK 458
Score = 98.6 bits (244), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 83/299 (27%), Positives = 130/299 (43%), Gaps = 54/299 (18%)
Query: 482 ILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGP 531
+ V G+ ++E E + DR + LP Q++ + + A K + V SG
Sbjct: 603 VFVGGISAALEGEEMPVDIDGFKGGDRTNIELPKVQRDFLRALHEAGK-TVVFVNCSGSA 661
Query: 532 IDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTE 591
I A AI+ A Y GQ GGTA++D+LFGT NP GKLP+T+Y + LP E
Sbjct: 662 I--ALEPEMETCDAILQAWYAGQEGGTAVSDVLFGTVNPSGKLPVTFYKR--TDQLPDYE 717
Query: 592 -MAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSIN 650
+MR GRTYR++ P ++ FG+G+SYT F R G
Sbjct: 718 DYSMR--------GRTYRYFSDP-LFAFGYGLSYTTF-----------------RFGRAR 751
Query: 651 ATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEK 710
A + +++ V + N G++ G + V+ A P K L AF +
Sbjct: 752 AEAAEGGYRLS------------VPLTNTGTRPGEEVVQVYIRRVADTNGPLKSLRAFRR 799
Query: 711 VHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHAATLGVIK 769
V + AG V I + + + + T R G++ + G + L T+ V++
Sbjct: 800 VALKAGESTTVEIPLSRKSFECFDESTNTMRTLPGDYELMYGNSSRPKDLQTLTVSVVE 858
>gi|60680320|ref|YP_210464.1| beta-glucosidase [Bacteroides fragilis NCTC 9343]
gi|60491754|emb|CAH06512.1| putative beta-glucosidase [Bacteroides fragilis NCTC 9343]
Length = 814
Score = 276 bits (705), Expect = 4e-71, Method: Compositional matrix adjust.
Identities = 222/689 (32%), Positives = 329/689 (47%), Gaps = 87/689 (12%)
Query: 109 GATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRG 168
G T FP I AS++N L +GRV++ EA A G G + P +++ RDPRW R
Sbjct: 168 GTTVFPTSIGQASTWNPELIRQMGRVIAIEASA--QGAHIG---YGPVLDLARDPRWSRV 222
Query: 169 QETPGEDPVLSGKYAASYVRGLQGSD-GDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKV 227
+ET GEDP L+G + VRG QG D V A+ KHF +Y W A +
Sbjct: 223 EETYGEDPYLNGVMGTALVRGFQGETLNDGKSVIATLKHFASY---GWTEGGHNGGTAHI 279
Query: 228 SKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVS 287
++++E+ PFR V G + SVM SYN+++G P +L ++ W+ G++VS
Sbjct: 280 GERELEEAIFPPFREAVGAGAL-SVMSSYNEIDGNPCTGSRYLLTDILKDRWQFKGFVVS 338
Query: 288 DCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTE---SAVQRGLLSEIDIN 344
D +VG + EAA A+ AG+D D G ++ E +AV+RG ++ I+
Sbjct: 339 DLYAVGGLRE-HGVAGNDYEAAIKAVNAGVDSDLGT--NVYAEQLVAAVKRGDVAVATID 395
Query: 345 NALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSL 404
A+ L+++ ++G+FD + + V + +H LA E ARQ IVLLKN+ L
Sbjct: 396 KAVRRILSLKFQMGLFDDPFVDEKQ---AVQLVASSEHTGLAREVARQSIVLLKNKDKLL 452
Query: 405 PLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYT--TPLQGI----GRYARTIHQQGCK 458
PL RT+AVIGPN+D M+G+Y T T L GI + R ++ +GC
Sbjct: 453 PLKK-DIRTLAVIGPNADNVYNMLGDYTAPQADGTVVTVLDGIRQKVSKETRVLYAKGCT 511
Query: 459 DVACADDQLFGAAIDASRQADATILVMG----LDQSIE-------------------AEA 495
V + F AI+ +R ADA ++VMG D S E E
Sbjct: 512 -VRDSSRTGFKDAIETARNADAVVMVMGGSSARDFSSEYEETGAAKVTINQISDMESGEG 570
Query: 496 LDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQA 555
DRA L L GRQ EL+ ++S K P +LVL+ G P+ + A + AI+ A YPG
Sbjct: 571 YDRATLHLMGRQLELLEEISRLGK-PVVLVLIKGRPLLMEGAIQEAE--AIVDAWYPGMQ 627
Query: 556 GGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVV 615
GG A+AD+LFG NP G+L ++ P+ + LP+ R RY G
Sbjct: 628 GGNAVADVLFGDYNPAGRLTLS-VPRS-VGQLPVYYNTRRKGNRSRYIEE-----PGTPR 680
Query: 616 YPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVD 675
YPFG+G+SYT F +T ++VT + + V V
Sbjct: 681 YPFGYGLSYTTFSYT------------------------DMKVQVTEG-SDDCRVDVTVT 715
Query: 676 VKNVGSKDGAHTL-LVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVV 734
++N G+ DG L F + P KQL AF ++H+ AG + V + K L++
Sbjct: 716 IQNQGTADGDEVAQLYFRDDVSSFTTPAKQLRAFSRIHLKAGESREVTFTLD-KKSLALY 774
Query: 735 DRSGTRRIPLGEHNIHIGGTKHSVSLHAA 763
+ G + G I +GG+ ++ A
Sbjct: 775 MQEGEWVVEPGRFTIMVGGSSEDIACRQA 803
>gi|393782348|ref|ZP_10370533.1| hypothetical protein HMPREF1071_01401 [Bacteroides salyersiae
CL02T12C01]
gi|392673619|gb|EIY67078.1| hypothetical protein HMPREF1071_01401 [Bacteroides salyersiae
CL02T12C01]
Length = 852
Score = 275 bits (704), Expect = 5e-71, Method: Compositional matrix adjust.
Identities = 164/426 (38%), Positives = 239/426 (56%), Gaps = 39/426 (9%)
Query: 42 FCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGT 101
F ++ P +R+ DL+ RL+++EK+ LL++ A +PRL I Y +EALHG+ V PG
Sbjct: 30 FRDMNAPQHERLLDLLSRLTIEEKISLLVNDAREIPRLNIDKYYHGNEALHGI--VRPGE 87
Query: 102 KFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN----------GGTAGLT 151
T FPQ I A+++N L + +SDEAR + G + LT
Sbjct: 88 --------FTVFPQAIGLAATWNPGLIFEVSSAISDEARGRWKELDYGKKQIAGASDLLT 139
Query: 152 YWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYD 211
+WSP VN+ RDPRWGR ET GEDP L+G +V+GLQG LK ++ KHF A +
Sbjct: 140 FWSPTVNMARDPRWGRTPETYGEDPFLTGVIGCEFVKGLQGDHPRYLKTVSTPKHFAANN 199
Query: 212 LDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNIL 271
++ +R NA++S++D+ + + F C+++ K S+M +YN VNGVP + ++
Sbjct: 200 EEH----NRSSCNARMSERDLREFYLPSFERCIVDAKAQSIMMAYNAVNGVPCTVNTYLI 255
Query: 272 KRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG------PFL 325
K +RG+W NGYIVSDC + + + AA AI+AGLDL+CG P L
Sbjct: 256 KNVLRGDWGFNGYIVSDCSAPEWMVTKHKYVRDLDAAATLAIKAGLDLECGDRVYTAPLL 315
Query: 326 GLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQEL 385
+ ES V S+ DI++A L +M LG+FD +PS PY + P + HQEL
Sbjct: 316 KAYNESMV-----SKADIDSAAYRVLRGRMLLGLFD-DPSQNPYNQIEPSVIGCKKHQEL 369
Query: 386 ALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGY-TTPLQG 444
ALE ARQ +VLLKNQ LPL+ + +++AV+G N+ G+Y+GI + L G
Sbjct: 370 ALETARQSMVLLKNQKNFLPLNLKKVKSIAVVGINA--GHCEFGDYSGIPKNAPVSVLDG 427
Query: 445 IGRYAR 450
I +YA
Sbjct: 428 IRKYAE 433
Score = 141 bits (355), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 91/296 (30%), Positives = 145/296 (48%), Gaps = 42/296 (14%)
Query: 467 LFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVL 526
LFG A A+++ D T+ V+G+++SIE E DR L LP QQE + ++ + T++VL
Sbjct: 593 LFGDAGKAAKECDVTVAVLGINKSIEREGQDRYSLELPTDQQEFIRELYKVNPN-TVVVL 651
Query: 527 MSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITN 586
++G + + + D + AI+ A YPG+ GGTAIA++LFG NPGG+LP+T+Y +
Sbjct: 652 VAGSSLAINWI--DENVPAILNAWYPGEQGGTAIAEVLFGDYNPGGRLPLTYYNS--LDE 707
Query: 587 LPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRH 646
LP + + RTY+++KG +Y FG+G+SYT F +
Sbjct: 708 LPSFD-------NYSVQNRTYQYFKGKPLYEFGYGLSYTKF-----------------NY 743
Query: 647 GSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQL 705
N +I+ I +T V N G DG V+ P G + P KQL
Sbjct: 744 KKKNVSIANDTIDITFK------------VSNAGKYDGDEVAQVYVQYPETGTYMPLKQL 791
Query: 706 VAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLH 761
F +VH+ G V I++ + +++ P G++ IG + + L
Sbjct: 792 RGFSRVHIKKGKSADVTISVPKKELRYWDEKTRQFVTPEGKYVFLIGSSSEDIKLQ 847
>gi|238620766|ref|YP_002915592.1| glycoside hydrolase family protein [Sulfolobus islandicus M.16.4]
gi|238381836|gb|ACR42924.1| glycoside hydrolase family 3 domain protein [Sulfolobus islandicus
M.16.4]
Length = 755
Score = 275 bits (704), Expect = 5e-71, Method: Compositional matrix adjust.
Identities = 207/691 (29%), Positives = 347/691 (50%), Gaps = 100/691 (14%)
Query: 110 ATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQ 169
AT+FPQ I AS+++ L + + +A+ + GT SP +++ RDPRWGR +
Sbjct: 101 ATAFPQAIGLASTWDPDLIREVSSTIRYQAKLI---GTN--QCLSPVLDVCRDPRWGRCE 155
Query: 170 ETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDL-DNWNGVDRFHFNAKVS 228
ET GED L +YV+GLQG + ++ A+ KHF A+ + + H V
Sbjct: 156 ETYGEDQYLVASIGLAYVKGLQGEN----ELIATVKHFAAHGFPEGGRNIAPVH----VG 207
Query: 229 KQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSD 288
+++ + F PF + + GK SVM +Y++++G+P ++ +L + +R EW G +VSD
Sbjct: 208 NRELREVFLFPFEVAIKLGKAMSVMPAYHEIDGIPCHSNAELLTKILRQEWGFEGIVVSD 267
Query: 289 CDSVGVYYDTQHFTSTPEEAAADAIRAGLD-----LDCGPFLGLHTESAVQRGLLSEIDI 343
D++ + +EAA A+ AG+D +DC G AV+ GL+SE I
Sbjct: 268 YDAIRQLEAIHKVSLNKKEAAILALEAGVDTEFPNIDC---FGEPLLEAVKEGLISESII 324
Query: 344 NNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPS 403
+ A+ L ++ +LG+F+ ++ + P+ + +ELAL+ AR+ IVLLKN
Sbjct: 325 DRAVERVLRIKEKLGLFNDHYINE---NNVPEKLDNSKSRELALDVARKSIVLLKNDN-I 380
Query: 404 LPLSHIRHRTVAVIGPNSDVTVTMIGNY-------AGIACGYTTPLQGIGRYA----RTI 452
LPL+ T+AVIGPN++ ++G+Y A + T L+GI R +
Sbjct: 381 LPLNK-NIGTIAVIGPNANEPRNLLGDYTYTGHLNADVGIEVVTVLEGIMRKVSNNTNVL 439
Query: 453 HQQGCKDVACADDQLFGAAIDASRQADATILVMGLD-------------------QSIEA 493
+ +GC D+A + F AI+ +++ D I VMG Q++
Sbjct: 440 YAKGC-DIAAESKEGFSEAIEIAKKGDIIIAVMGEKSGLPLSWTDVPGKDEFEKYQAVTG 498
Query: 494 EALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPG 553
E DR L LPG Q+EL+ ++ K P ILVL++G P+ ++ N+ + AII A +PG
Sbjct: 499 EGNDRTSLRLPGVQEELLKELHKTGK-PIILVLVNGRPLALSSIFNE--VNAIIDAWFPG 555
Query: 554 QAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPG--RTYRFYK 611
+ GG AIAD++FG NP G+LP+++ P+ + P R P R Y K
Sbjct: 556 EEGGNAIADVIFGDYNPSGRLPISF---------PI-DTGQIPIYYNRKPSSLRPYVMMK 605
Query: 612 GPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLG 671
++PFG+G+SYT F ++ L+ +N+ SGK +
Sbjct: 606 SKPLFPFGYGLSYTEFKYS----------NLEVTPKEVNS--SGK-------------IK 640
Query: 672 VQVDVKNVGSKDGAHTL-LVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKY 730
+ ++V+NVG ++G T+ L S +G P K+L F KV++ ++++ ++ + +
Sbjct: 641 ISLEVENVGKREGEETVQLYISKQYSGVSRPIKELKGFAKVYLKPNEKRKITFSLPL-EA 699
Query: 731 LSVVDRSGTRRIPLGEHNIHIGGTKHSVSLH 761
L+ D+ I G++ I IG + + L
Sbjct: 700 LAFYDQYMRLIIDTGDYEILIGKSSEDIVLK 730
>gi|300726322|ref|ZP_07059774.1| beta-xylosidase B [Prevotella bryantii B14]
gi|291292284|gb|ADD92014.1| Xyl3A [Prevotella bryantii B14]
gi|299776347|gb|EFI72905.1| beta-xylosidase B [Prevotella bryantii B14]
Length = 885
Score = 275 bits (704), Expect = 6e-71, Method: Compositional matrix adjust.
Identities = 166/443 (37%), Positives = 241/443 (54%), Gaps = 39/443 (8%)
Query: 34 DATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHG 93
D T+ LP+ +L +R DL RL+L+EK L+ + A+PRLGIK + WWSEALHG
Sbjct: 42 DKTSTLLPYQNPNLSAYERAIDLCHRLTLEEKALLMQDESPAIPRLGIKKFFWWSEALHG 101
Query: 94 VSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMY-----NGGTA 148
+N+G T+FP+ I ASSFN TL +++ SDE RA Y NGG
Sbjct: 102 AANMG----------NVTNFPEPIAMASSFNPTLLKSVFSAASDEMRAQYHHRMDNGGED 151
Query: 149 ----GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASC 204
L+ W+PNVNIFRDPRWGRGQET GEDP L+ + V GLQG + + + +C
Sbjct: 152 EKFHSLSVWTPNVNIFRDPRWGRGQETYGEDPYLTSVMGCAVVEGLQGPESSKYRKLWAC 211
Query: 205 -KHFTAYDLDNWNGVDRFHFNA---KVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVN 260
KHF + +G + A +S +D+ +T+ F+ V +G V VMC+Y +++
Sbjct: 212 AKHFAVH-----SGPESTRHTANLNNISPRDLYETYLPAFQSTVQDGHVREVMCAYQRLD 266
Query: 261 GVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLD 320
P C++ +L++ +R EW +VSDC +V + + +S A+ A AG D++
Sbjct: 267 DEPCCSNNRLLQQILREEWGFKYLVVSDCGAVSDIWQSHKTSSDAVHASRQATLAGTDVE 326
Query: 321 CGP-FLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDG----EPSSQPYGHLGPK 375
CG + AV+RGLL+E +I+ ++ L + LG D E S PY + K
Sbjct: 327 CGYGYTYAKIPEAVKRGLLTEEEIDKHVIRLLEGRFDLGEMDDSKLVEWSKIPYSIMSCK 386
Query: 376 DVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIA 435
H +LAL+ ARQ IVLL+N+G LPL ++ +AVIGPN+D M GNY G
Sbjct: 387 -----AHAQLALDMARQSIVLLQNKGNILPLQLKKNERIAVIGPNADNKPMMWGNYNGTP 441
Query: 436 CGYTTPLQGIGR-YARTIHQQGC 457
+ L+GI + Y ++ C
Sbjct: 442 NHTVSILEGIRKQYKNVVYLPAC 464
Score = 105 bits (261), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 87/304 (28%), Positives = 134/304 (44%), Gaps = 53/304 (17%)
Query: 468 FGAAIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMA 517
+ A I + D I V G+ S+E E + DR + +P Q++ + ++ A
Sbjct: 616 YAANIAQLKGIDKVIFVGGIAPSLEGEEMPVNIPGFKGGDRTDIEMPQVQRDFIKALAEA 675
Query: 518 SKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMT 577
K ILV SG I A R AII A YPGQ GGTA+ADIL G NP GKLP+T
Sbjct: 676 GK-QIILVNCSGSAI--ALTPEAQRCQAIIQAWYPGQEGGTAVADILMGKVNPMGKLPVT 732
Query: 578 WYPQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPT 636
+Y LP + +M+ RTYR+++ +YPFG+G+SYT+F A T
Sbjct: 733 FYKS--TQQLPDFEDYSMK--------NRTYRYFED-ALYPFGYGLSYTSFEIGTAKLQT 781
Query: 637 VVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPA 696
+ N +TL Q+ V N G ++G + V+
Sbjct: 782 LTN--------------------------NSITL--QIPVTNTGKREGTELVQVYLRRDD 813
Query: 697 GHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKH 756
P K L +F + + AG ++ + ++ ++ + T R+ G++ I G +
Sbjct: 814 DVEGPSKTLRSFAHITLKAGETKKAILKLNRNQFECWDASTNTMRVIPGKYTIFYGNSSK 873
Query: 757 SVSL 760
L
Sbjct: 874 KEDL 877
>gi|390957160|ref|YP_006420917.1| beta-glucosidase-like glycosyl hydrolase [Terriglobus roseus DSM
18391]
gi|390412078|gb|AFL87582.1| beta-glucosidase-like glycosyl hydrolase [Terriglobus roseus DSM
18391]
Length = 908
Score = 275 bits (703), Expect = 6e-71, Method: Compositional matrix adjust.
Identities = 153/405 (37%), Positives = 232/405 (57%), Gaps = 26/405 (6%)
Query: 51 QRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGA 110
QR DL+GR++L+EK +++GAAA+PRL + Y++W+E LHGV+ G A
Sbjct: 33 QRAADLVGRMTLEEKSLQMVNGAAAIPRLNVPAYDYWNEGLHGVARSG----------YA 82
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA--------GLTYWSPNVNIFRD 162
T FPQ I A++++A L + IG V++ EARA N GLT+WSPN+NIFRD
Sbjct: 83 TMFPQAIGMAATWDAPLLKQIGDVIATEARAKNNEALRRNNHDIYFGLTFWSPNINIFRD 142
Query: 163 PRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFH 222
PRWGRGQET GEDP L+ + +++ GLQG+D KV A+ KHF + R
Sbjct: 143 PRWGRGQETYGEDPHLTTQLGVNFIEGLQGTDPKFYKVIATPKHFAVHSGPEEG---RHK 199
Query: 223 FNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLN 282
F+ + + D+ DT+ FR +++ K S+MC+YN+++G P C +L +R +W+
Sbjct: 200 FDVEPTPHDLWDTYLPQFRAAIVDAKADSIMCAYNRIDGQPACGSKLLLVDILRNDWKFQ 259
Query: 283 GYIVSDCDSVGVYYDTQHFTSTPEEAAAD--AIRAGLDLDCGPFLGLHTESAVQRGLLSE 340
G++ SDC ++ ++ + P+ AD A+ AG D +CG AV+ GL+ E
Sbjct: 260 GFVTSDCGAIDDFFRPNTHQTEPDAEHADKAALLAGTDTNCGSTY-RKLGDAVKSGLIKE 318
Query: 341 IDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQ 400
DI+ +L ++RLG+FD S PY + V +P + +A AA + +VLLKN
Sbjct: 319 SDIDVSLRRLFEARVRLGLFD-PAGSVPYAQIPFSQVNSPANAAVAKRAAEESMVLLKND 377
Query: 401 GPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
G LPL +++T+AVIGPN ++ GNY G+A P+ +
Sbjct: 378 G-ILPLKAGKYKTIAVIGPNGASLSSLEGNYNGMAHDPRMPVDAL 421
Score = 135 bits (340), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 96/304 (31%), Positives = 153/304 (50%), Gaps = 55/304 (18%)
Query: 467 LFGAAIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSM 516
L A++A+ ++D + ++GL +E E + DR + LP QQ L+ + +
Sbjct: 620 LLPEALEAANKSDLVVAMLGLSPDLEGEEMPVKLPGFVGGDRTDISLPASQQALLQGL-I 678
Query: 517 ASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPM 576
A+ PTI+VL++G + + A D + AI+ + YPG+AG TA+AD L G +NP G+LP+
Sbjct: 679 ATGKPTIVVLLNGSALAINLA--DEKANAILESWYPGEAGSTALADTLVGRNNPSGRLPI 736
Query: 577 TWYPQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAP 635
T+Y E ++LP + +M+ RTYR++KG +Y FG G+SYT F ++
Sbjct: 737 TFYKSE--SDLPGFEDYSMQ--------NRTYRYFKGAPLYGFGFGLSYTKFAYS----- 781
Query: 636 TVVAVPLDGRHGSINATISGKAIKVTHAKCNR-LTLGVQVDVKNVGSKDGAHTLLVFSTP 694
+K+ AK N TL +V VKN G G ++ P
Sbjct: 782 ---------------------GLKLAKAKLNAGDTLTAEVTVKNTGKVAGEEVAELYLLP 820
Query: 695 PA---GHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHI 751
PA +P +QL F++V + G +++ + + LS VD GTR I G + I I
Sbjct: 821 PAEGNAGLSPKQQLEGFQRVMLKPGESRKLTFTL-TPRQLSEVDAKGTRAIQPGTYAIAI 879
Query: 752 GGTK 755
GG +
Sbjct: 880 GGAQ 883
>gi|365121645|ref|ZP_09338561.1| hypothetical protein HMPREF1033_01907 [Tannerella sp.
6_1_58FAA_CT1]
gi|363645135|gb|EHL84409.1| hypothetical protein HMPREF1033_01907 [Tannerella sp.
6_1_58FAA_CT1]
Length = 868
Score = 275 bits (703), Expect = 6e-71, Method: Compositional matrix adjust.
Identities = 170/454 (37%), Positives = 256/454 (56%), Gaps = 46/454 (10%)
Query: 41 PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPG 100
P+ L +R DL+ R++L+EK + + + RLG++ Y+WW+EALHG++ G
Sbjct: 25 PYKNPELSPDERALDLLNRMTLKEKFAQMHNNTGGIERLGVRPYDWWNEALHGIARAGK- 83
Query: 101 TKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARA---------MYNGGTAGLT 151
AT FPQ I A++F+ T + +VSDE RA MYNG GLT
Sbjct: 84 ---------ATVFPQAIGLAATFDDTAVYEMFDMVSDEGRAKYHDFQRKGMYNG-YKGLT 133
Query: 152 YWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASC-KHFTAY 210
+W+PN+NIFRDPRWGRG ET GEDP L+ K + V+GLQG + A +C KH+ +
Sbjct: 134 FWTPNINIFRDPRWGRGMETYGEDPFLTTKMGLAVVKGLQGDGTQKYDKAHACAKHYAVH 193
Query: 211 DLDNWNGVDRFHFNAK-VSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPN 269
WN R +NA+ +S +D+ +T+ F+ V EGKV VMC+YN+ G P C++
Sbjct: 194 SGPEWN---RHSYNAENISIRDLRETYLPAFKALVTEGKVKEVMCAYNRFEGEPCCSNKT 250
Query: 270 ILKRTIRGEWRLNGYIVSDCDSVGVYYDT-QHFT-STPEEAAADAIRAGLDLDC-GPFLG 326
+L ++ EW + IVSDC ++ +Y +H T ++ +A+ADA+ +G DL+C G +
Sbjct: 251 LLINILKDEWGFDDVIVSDCGAIADFYTKGRHETHASAADASADAVISGTDLECGGSYWA 310
Query: 327 LHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEP----SSQPYGHLGPKDVCTPDH 382
L + A+++GL++E IN ++ L + LGMFD + SS PY VC H
Sbjct: 311 L--DEALEKGLITETKINESVFRLLRARFELGMFDDDSLVSWSSIPYSV-----VCCDKH 363
Query: 383 QELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPL 442
+ ALE AR+ +VLL N+ +LPLS + VAV+GPN++ +V + NY G T L
Sbjct: 364 KAKALEMARKSMVLLSNKNNTLPLSK-SIKKVAVMGPNANDSVMLWANYNGTPDRSVTIL 422
Query: 443 QGIGRY---ARTIHQQGCKDVACADDQLFGAAID 473
+GI I+++GC V D ++F + D
Sbjct: 423 EGIKAKLPEGSVIYEKGCDYV---DTEVFFSYFD 453
Score = 134 bits (336), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 97/320 (30%), Positives = 149/320 (46%), Gaps = 46/320 (14%)
Query: 459 DVACADDQLFGAAIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQ 508
DV + + A + + ADA I V G+ S+E E + DR + LP Q+
Sbjct: 583 DVGLSRQIDYKAVAEKVKDADAIIFVGGISSSLEGEEMGVKYPGFRNGDRTNIDLPQVQK 642
Query: 509 ELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTS 568
++ + K P I VL SG + A + D + AI+ A YPGQ GGTA+AD+LFG
Sbjct: 643 NMMKALKETGK-PVIFVLCSGSTM--ALSWEDKNMDAILQAWYPGQEGGTAVADVLFGDY 699
Query: 569 NPGGKLPMTWYPQEYITNLPMTE-MAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNF 627
NP G+LP+T+Y +LP E M Q GRTYR++KG +YPFGHG+SYT F
Sbjct: 700 NPAGRLPLTFYASS--DDLPDFENYNMSEGQ-----GRTYRYFKGKPLYPFGHGLSYTGF 752
Query: 628 VHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHT 687
++ A L+ + S+N ++ + +++KN G +DG
Sbjct: 753 SYSKAK--------LNKKSMSVNDSVF-----------------LSLNLKNTGLRDGDEV 787
Query: 688 LLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEH 747
+ V+ P K L +++V V AG V I++ + + + G++
Sbjct: 788 VQVYIRNLQDPEGPSKSLRGYKRVSVKAGQTVPVKIDLPASSFEFFNPVTEKMEVRPGKY 847
Query: 748 NIHIGGTKHSVSLHAATLGV 767
I GGT + L L +
Sbjct: 848 EILYGGTSDNKQLKKLGLEI 867
>gi|423281958|ref|ZP_17260843.1| hypothetical protein HMPREF1204_00381 [Bacteroides fragilis HMW
615]
gi|404582445|gb|EKA87139.1| hypothetical protein HMPREF1204_00381 [Bacteroides fragilis HMW
615]
Length = 805
Score = 275 bits (703), Expect = 7e-71, Method: Compositional matrix adjust.
Identities = 222/689 (32%), Positives = 329/689 (47%), Gaps = 87/689 (12%)
Query: 109 GATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRG 168
G T FP I AS++N L +GRV++ EA A G G + P +++ RDPRW R
Sbjct: 159 GTTVFPTSIGQASTWNPELIRQMGRVIAIEASA--QGAHIG---YGPVLDLARDPRWSRV 213
Query: 169 QETPGEDPVLSGKYAASYVRGLQGSD-GDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKV 227
+ET GEDP L+G + VRG QG D V A+ KHF +Y W A +
Sbjct: 214 EETYGEDPYLNGVMGTALVRGFQGETLNDGKSVIATLKHFASY---GWTEGGHNGGTAHI 270
Query: 228 SKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVS 287
++++E+ PFR V G + SVM SYN+++G P +L ++ W+ G++VS
Sbjct: 271 GERELEEAIFPPFREAVGAGAL-SVMSSYNEIDGNPCTGSRYLLTDILKDRWQFKGFVVS 329
Query: 288 DCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTE---SAVQRGLLSEIDIN 344
D +VG + EAA A+ AG+D D G ++ E +AV+RG ++ I+
Sbjct: 330 DLYAVGGLRE-HGVAGNDYEAAIKAVNAGVDSDLGT--NVYAEQLVAAVKRGDVAVATID 386
Query: 345 NALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSL 404
A+ L+++ ++G+FD + + V + +H LA E ARQ IVLLKN+ L
Sbjct: 387 KAVRRILSLKFQMGLFDDPFVDEKQ---AAQLVASSEHTGLAREVARQSIVLLKNKDKLL 443
Query: 405 PLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYT--TPLQGI----GRYARTIHQQGCK 458
PL RT+AVIGPN+D M+G+Y T T L GI + R ++ +GC
Sbjct: 444 PLKK-DIRTLAVIGPNADNVYNMLGDYTAPQADGTVVTVLDGIRQKVSKETRVLYAKGCA 502
Query: 459 DVACADDQLFGAAIDASRQADATILVMG----LDQSIE-------------------AEA 495
V + F AI+ +R ADA ++VMG D S E E
Sbjct: 503 -VRDSSRTGFKDAIETARNADAVVMVMGGSSARDFSSEYEETGAAKVTINQISDMESGEG 561
Query: 496 LDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQA 555
DRA L L GRQ EL+ ++S K P +LVL+ G P+ + A + AI+ A YPG
Sbjct: 562 YDRATLHLMGRQLELLEEISRLGK-PVVLVLIKGRPLLMEGAIQEAE--AIVDAWYPGMQ 618
Query: 556 GGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVV 615
GG A+AD+LFG NP G+L ++ P+ + LP+ R RY G
Sbjct: 619 GGNAVADVLFGDYNPAGRLTLS-VPRS-VGQLPVYYNTRRKGNRSRYIEE-----PGTPR 671
Query: 616 YPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVD 675
YPFG+G+SYT F +T ++VT + + V V
Sbjct: 672 YPFGYGLSYTTFSYT------------------------DMKVQVTEG-SDDCRVDVTVT 706
Query: 676 VKNVGSKDGAHTL-LVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVV 734
++N G+ DG L F + P KQL AF ++H+ AG + V + K L++
Sbjct: 707 IQNQGTADGDEVAQLYFRDDVSSFTTPAKQLRAFSRIHLKAGESREVTFTLD-KKSLALY 765
Query: 735 DRSGTRRIPLGEHNIHIGGTKHSVSLHAA 763
+ G + G I +GG+ ++ A
Sbjct: 766 MQEGEWVVEPGLFTIMVGGSSEDIACRQA 794
>gi|160901716|ref|YP_001567297.1| glycoside hydrolase family 3 protein [Petrotoga mobilis SJ95]
gi|160359360|gb|ABX30974.1| glycoside hydrolase family 3 domain protein [Petrotoga mobilis
SJ95]
Length = 777
Score = 275 bits (703), Expect = 7e-71, Method: Compositional matrix adjust.
Identities = 202/697 (28%), Positives = 354/697 (50%), Gaps = 102/697 (14%)
Query: 109 GATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRG 168
GAT FPQ+I AS++ L E + + ++ +A+ G GL SP V++ RDPRWGR
Sbjct: 110 GATIFPQMIGAASTWEPPLIERMTTSIRNQMKAL--GIHQGL---SPVVDVTRDPRWGRT 164
Query: 169 QETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFN---A 225
+ET GEDP L K +YV+GLQ SD + + A+ KHF Y GV N A
Sbjct: 165 EETFGEDPYLIAKMGVAYVKGLQ-SDDLKNGIVATLKHFVGY------GVSEGGMNWAPA 217
Query: 226 KVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYI 285
+ ++++++TF PF + EGKV SVM +Y++++G+P A +L+R +R EW +G +
Sbjct: 218 HIPERELKETFLFPFEAAIKEGKVKSVMNAYHEIDGIPCGASETLLRRILREEWGFDGIV 277
Query: 286 VSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTE---SAVQRGLLSEID 342
VSD ++ + EEAA A++AG+D++ P + E +A++ G SE
Sbjct: 278 VSDYFAINSLMEYHKIALNKEEAAIKALKAGIDVEL-PSFDCYKEPLKNAIENGEFSEAF 336
Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLG--PKDVCTPDHQELALEAARQGIVLLKNQ 400
I+ ++ N L ++ +G+F+ PY L P ++ TP+ ++LA E A++ IVLLKN
Sbjct: 337 IDKSVRNILRLKFEMGLFE-----NPYVDLEKVPDNLDTPEDRKLAYEIAKKSIVLLKND 391
Query: 401 GPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGI---------ACGYTTPLQGIGRYA-- 449
G + + + VAVIGPN++ + G+Y + A G T+ ++GI
Sbjct: 392 GIVPLKKNSKIKKVAVIGPNANSARNLTGDYTYLTHLETLKQGAFG-TSAMEGITFSESE 450
Query: 450 ---RTIHQ----------------QGCKDVACADDQLFGAAIDASRQADATILVMGLDQS 490
+TI++ +GC ++ + ++ A++ + +D +LV+G D+S
Sbjct: 451 LPIKTIYESLKEKLEKLNVETSYAKGC-EINDDNKEMIKEAVELAENSDVALLVLG-DKS 508
Query: 491 ------IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIA 544
E+ D + L+LPG Q +L+ V + + P I+VL++G P + + + ++
Sbjct: 509 GLTLDCTTGESRDSSTLILPGVQLDLLKSV-INTGTPVIVVLVNGRPYSLDWVSKN--VS 565
Query: 545 AIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPG 604
AI A PG+ GG A+ADI+ G +P GKLP++ +P+ ++ +P+ +PS +
Sbjct: 566 AIFEAWLPGEEGGNALADIILGDESPSGKLPIS-FPR-HVGQIPVY-YNHKPSGGRSQWW 622
Query: 605 RTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAK 664
Y +YPFGHG+SYT F +G++ + + +K++
Sbjct: 623 GDYTDSPAKPLYPFGHGLSYTQF-----------------EYGNLQIENNDRIVKIS--- 662
Query: 665 CNRLTLGVQVDVKNVGSKDGAHTL-LVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGI 723
+DVKN+G + G + L + A P K+L F++V + ++R+
Sbjct: 663 ---------MDVKNIGEETGDEIVQLYMNDEVASVTRPVKELKGFQRVTLKPSEKKRIIF 713
Query: 724 NIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
N+ + + L++ + + G + +G + + L
Sbjct: 714 NLPI-ETLALYNEKMEFLVEKGYFKVMVGSSSEDIRL 749
>gi|332185848|ref|ZP_08387595.1| glycosyl hydrolase family 3 N terminal domain protein [Sphingomonas
sp. S17]
gi|332014206|gb|EGI56264.1| glycosyl hydrolase family 3 N terminal domain protein [Sphingomonas
sp. S17]
Length = 838
Score = 275 bits (703), Expect = 8e-71, Method: Compositional matrix adjust.
Identities = 173/452 (38%), Positives = 249/452 (55%), Gaps = 37/452 (8%)
Query: 59 RLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVIT 118
+L++++K L++ A A P L + Y+WWSE LHG++ GP AT FPQ +
Sbjct: 24 KLTVEQKAAQLVNEAPAEPALDLPAYDWWSEGLHGIARNGP----------ATVFPQAMG 73
Query: 119 TASSFNATLWEAIGRVVSDEARAMYN--------GGTAGLTYWSPNVNIFRDPRWGRGQE 170
A++++ L E +G VV+ EARA +N G GLT WSPN+NIFRDPRWGRGQE
Sbjct: 74 MAATWDPALIEKVGDVVAVEARAKFNTKPINADRGLFEGLTIWSPNINIFRDPRWGRGQE 133
Query: 171 TPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD--RFHFNAKVS 228
T GEDP L+ + ++VRGLQG D KV A+ KH + +G + R F+ VS
Sbjct: 134 TYGEDPFLTSRIGIAFVRGLQGPDLKHPKVIATVKHLAVH-----SGPEGGRDSFDVMVS 188
Query: 229 KQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSD 288
+D+E T+ FR V EGK S+MC+YN ++G P CA+P ++ +R +W G VSD
Sbjct: 189 PRDLEATYLPAFRATVTEGKALSLMCAYNAIHGTPVCANPMLMTERLRTDWGFKGLTVSD 248
Query: 289 CDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALV 348
CD+VG + H AAA A+RAG DL+CG E A++RGL++E +I+ AL
Sbjct: 249 CDAVGNIWMFHHAQPDAPAAAAAALRAGTDLNCGNTYRALPE-ALKRGLITEGEIDTALA 307
Query: 349 NTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSH 408
L V+ L + P+ + P + TP ++ LALE AR+GI+LL N LPL
Sbjct: 308 RALAVRRMLTV------DSPWNRIKPSQLGTPANRALALETARKGIILLDNPNDLLPL-- 359
Query: 409 IRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI-GRYARTI-HQQGCKDVACADDQ 466
+ + +AV G ++D + GNY G A TPL GI ++ R + + QG A
Sbjct: 360 -KGKKLAVFGADADDLGVLEGNYHGSAIDPVTPLDGIRAKFGRDVRYAQGSVLAEGAFIN 418
Query: 467 LFGAAIDASRQADATILVMGLDQSIEAEALDR 498
+ A+ A+ + + MG + A DR
Sbjct: 419 VPETALSANGKPGLAVETMGANGQWIAAGQDR 450
Score = 106 bits (265), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 91/307 (29%), Positives = 137/307 (44%), Gaps = 64/307 (20%)
Query: 465 DQLFGAAIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKV 514
+ L A+ + AD + ++GL +E EAL DR ++LP Q+ L++ +
Sbjct: 550 EALQAEALKVAGTADVLVAIVGLSPDLEGEALGVSVPGFAGGDRTEVVLPEPQRNLLAAL 609
Query: 515 SMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKL 574
K P + V++SG A + D + AA + A YPG GGTA+A+IL G NP G+L
Sbjct: 610 QATGK-PVVAVIVSGS----AVSLGDIKPAATLAAFYPGAEGGTALAEILSGDVNPSGRL 664
Query: 575 PMTWYPQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVAN 633
P+T Y +LP + M+ RTYR++ G + FGHG+SYT F + +
Sbjct: 665 PVTIY--HSAADLPAFADYGMKE--------RTYRYFTGKPAWGFGHGLSYTKFDYGQPS 714
Query: 634 APTVVAV--PLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF 691
VAV P+D V V ++N+G + G +
Sbjct: 715 VTASVAVGQPVD----------------------------VAVQLRNIGQRGGEEVAQAY 746
Query: 692 STPPAGHWAP-------HKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPL 744
PPA P +QLV F +V + G I + +S V R GTRRI
Sbjct: 747 LVPPATDEKPVLTDPILQRQLVGFTRVALKPGQTGTTRFTID-PRMMSQVWRDGTRRILP 805
Query: 745 GEHNIHI 751
G + ++I
Sbjct: 806 GTYKLYI 812
>gi|87200432|ref|YP_497689.1| beta-glucosidase [Novosphingobium aromaticivorans DSM 12444]
gi|87136113|gb|ABD26855.1| Beta-glucosidase [Novosphingobium aromaticivorans DSM 12444]
Length = 849
Score = 274 bits (701), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 166/384 (43%), Positives = 219/384 (57%), Gaps = 26/384 (6%)
Query: 81 IKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEAR 140
+ Y++W+EALHG++ G AT FPQ I A++++A L E IG VVS EAR
Sbjct: 46 LPAYDYWNEALHGLARNGV----------ATVFPQAIGLAATWDAPLLERIGTVVSTEAR 95
Query: 141 AMYNG-------GTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGS 193
A YN GLT WSPN+NIFRDPRWGRGQET GEDPVL+G A +YVRGLQG
Sbjct: 96 AKYNALPGKDRRRYQGLTIWSPNINIFRDPRWGRGQETYGEDPVLTGTLATAYVRGLQGP 155
Query: 194 DGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVM 253
D D +V A+ KH A+ R FN + S D+E T+ FR + EGK SVM
Sbjct: 156 DLDHPRVIATPKHLVAHSGPE---AGRDSFNVQSSAYDMEATYLPAFRRALTEGKALSVM 212
Query: 254 CSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAI 313
CSYN V+GVP C +L + +R +W +G +VSDCD++G Q + T A+A AI
Sbjct: 213 CSYNSVHGVPVCGADWLLNQRVRKDWGFDGLVVSDCDAIGNINHYQRYRQTNAAASAAAI 272
Query: 314 RAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLG 373
AG+DL+CG E+ Q GL + ++ AL T + +LG D ++ P+ +
Sbjct: 273 NAGMDLNCGRTYAALPEALAQ-GLTTREVVDRALARTFAARRKLG--DAFGATSPWATIP 329
Query: 374 PKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAG 433
V T +H+ LALEAAR+ +VLLKN G LPL + +AV+GPN+D T+ NY G
Sbjct: 330 ANTVDTAEHRALALEAARKSLVLLKNNG-VLPLR--KGARIAVVGPNADSLDTLEANYHG 386
Query: 434 IACGYTTPLQGIGRYARTIHQQGC 457
A TPL G R + QG
Sbjct: 387 TAAQPVTPLDGFAARYRMSYAQGA 410
Score = 116 bits (291), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 88/288 (30%), Positives = 140/288 (48%), Gaps = 59/288 (20%)
Query: 485 MGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDV 534
+GL ++E EAL DR + LP QQ+L+ + K P ++VL+SG + +
Sbjct: 579 VGLSPAVEGEALQIEVPGFSGGDRTDIALPRAQQDLLETLKATGK-PLVVVLLSGSAVAM 637
Query: 535 AFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLP-MTEMA 593
+ K + A++ A YPGQ+GGTAIAD++ GT NP G+LP+T+Y + +LP +
Sbjct: 638 PWVKEN--ADAVVAAWYPGQSGGTAIADLVDGTLNPSGRLPVTFYAR--TRDLPAFVDYN 693
Query: 594 MRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATI 653
MR RTYR++ G ++ FG G+SYT+F + A+AP +
Sbjct: 694 MRE--------RTYRYFHGTPLWSFGEGLSYTSFAYGKASAPARIK-------------- 731
Query: 654 SGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPA-----GHWAP---HKQL 705
+G+ TL V + NVG+++G + PP G + L
Sbjct: 732 AGE------------TLTATVSLANVGARNGEEVAQAYLVPPEHLRTIGEFNDPVLRHSL 779
Query: 706 VAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGG 753
VA+ + + G R+ + + LS VDR+G R + G + + IGG
Sbjct: 780 VAYRRAALAKGETTRLSFTLD-PRSLSTVDRNGVRAVRPGTYRLFIGG 826
>gi|189464219|ref|ZP_03013004.1| hypothetical protein BACINT_00556 [Bacteroides intestinalis DSM
17393]
gi|189438009|gb|EDV06994.1| glycosyl hydrolase family 3 N-terminal domain protein [Bacteroides
intestinalis DSM 17393]
Length = 865
Score = 274 bits (700), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 154/417 (36%), Positives = 228/417 (54%), Gaps = 21/417 (5%)
Query: 44 QVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKF 103
+S PI RV +LI +++L+EKV L + ++PRL + Y +W+E LHGV+ G
Sbjct: 51 NLSQPISARVENLISKMTLEEKVAQLSNETDSIPRLNLPSYNYWNECLHGVARAGE---- 106
Query: 104 GGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDP 163
T FPQ I AS+++ L + + +S EAR Y GLTYWSP +N+ RDP
Sbjct: 107 ------VTVFPQAINLASTWDTLLIKKVASAISTEARLKYLEIGKGLTYWSPTINMARDP 160
Query: 164 RWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHF 223
RWGR +ET GEDP L+ + ++V+GLQG D LK A+ KHF A + +N DRF
Sbjct: 161 RWGRNEETYGEDPYLTSRLGVAFVKGLQGDHPDYLKTVATIKHFVANNQEN----DRFSS 216
Query: 224 NAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNG 283
++++ + + + + + CV E SVM +YN NGV +L +R EW +G
Sbjct: 217 SSQIPTKQLYEYYFPAYEACVKEADAQSVMTAYNAFNGVAPSGSTWLLGDVLRKEWGFDG 276
Query: 284 YIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDI 343
++VSDC ++GV ++ EEAAA I +G DL+CG +AV+ GL+SE I
Sbjct: 277 FVVSDCGAIGVMNWQHRVVNSLEEAAALGINSGCDLECGGTYREKLVAAVKMGLVSEQAI 336
Query: 344 NNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPS 403
+ AL LT + +LG FD PY H K + +LA EAA + IVLLKN
Sbjct: 337 DKALTRVLTARFKLGEFD-PIELVPYNHYDKKLLAGEKFGKLAYEAAVKSIVLLKNDNDF 395
Query: 404 LPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQG----IGRYARTIHQQG 456
LP+ + R+VA++GP +D +G Y+G + LQG +G+ + + +G
Sbjct: 396 LPVDKKKIRSVAIVGPFAD--NNYLGGYSGKPVHNVSLLQGVKDLVGKKVKVSYMEG 450
Score = 107 bits (266), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 84/286 (29%), Positives = 135/286 (47%), Gaps = 45/286 (15%)
Query: 478 ADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFA 537
AD ++ +G D+ + E D + LP Q+ L+ ++ + T L+L +G P+ +A
Sbjct: 609 ADLVLVALGNDEKLARENRDLPSIYLPMTQELLLKEIYKVNP-RTALILHTGNPLTSKWA 667
Query: 538 KNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPS 597
+ + AI+ A YPGQ GG A+A ILFG+ NP GKLPMT Y E E
Sbjct: 668 AEN--VPAILQAWYPGQEGGKALAGILFGSENPSGKLPMTIYESE--------EQLPDIL 717
Query: 598 QSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKA 657
+ GRTY++ +Y FGHG+SY+NF +T + VV DG
Sbjct: 718 DYDIWKGRTYQYLSSKPLYGFGHGLSYSNFEYTHLQSDDVVRP--DG------------- 762
Query: 658 IKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF----STPPAGHWAPHKQLVAFEKVHV 713
TL +++KN+ G + V+ +TP + P K+LVAF +V +
Sbjct: 763 -----------TLQCSIEIKNISDVAGEEVVQVYISRENTPV--YTFPLKKLVAFARVDL 809
Query: 714 PAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVS 759
G + V I + LS+ + +P G++++ +G + +S
Sbjct: 810 KPGESKTVTFTI-APRQLSIWQEGIWKMLP-GKYSLFVGSGQEGLS 853
>gi|423223731|ref|ZP_17210200.1| hypothetical protein HMPREF1062_02386 [Bacteroides cellulosilyticus
CL02T12C19]
gi|392638106|gb|EIY31959.1| hypothetical protein HMPREF1062_02386 [Bacteroides cellulosilyticus
CL02T12C19]
Length = 854
Score = 274 bits (700), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 172/463 (37%), Positives = 259/463 (55%), Gaps = 55/463 (11%)
Query: 1 MASTIAFFFLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRL 60
M + F GL+ L A + A+E + KD P+ +R+ DL+ RL
Sbjct: 1 MKTLKKIVFTGLLALIACAGVAQAQELY----KDEKA----------PMHERIMDLLSRL 46
Query: 61 SLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTA 120
+++EK+ LL + + + RL I Y +EALHGV V PG +F T FPQ I A
Sbjct: 47 TVEEKISLLRATSPGISRLDIPKYYHGNEALHGV--VRPG-RF-------TVFPQAIGLA 96
Query: 121 SSFNATLWEAIGRVVSDEARAMYNGGTAG----------LTYWSPNVNIFRDPRWGRGQE 170
+++N L + V+SDEARA +N G LT+WSP VN+ RDPRWGR E
Sbjct: 97 ATWNPELQLQVATVISDEARARWNELDQGREQKSQFSDLLTFWSPTVNMARDPRWGRTPE 156
Query: 171 TPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQ 230
T GEDP LSG ++V+GLQG D LK+ ++ KHF A + ++ +RF N ++S++
Sbjct: 157 TYGEDPYLSGIMGTAFVKGLQGDDDRYLKIVSTPKHFAANNEEH----NRFVCNPQISEK 212
Query: 231 DIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCD 290
+ + + F CV +GK AS+M +YN +N VP + +L + +R +W GY+VSDC
Sbjct: 213 QLREYYLPAFEACVKDGKSASIMSAYNALNDVPCTLNAWLLTKVLRKDWGFKGYVVSDCG 272
Query: 291 SVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG------PFLGLHTESAVQRGLLSEIDIN 344
+ + + T E AAA +I+AGLDL+CG P L SA ++ ++++ DI+
Sbjct: 273 GPSLLVNAHKYVKTKEAAAALSIKAGLDLECGDDVYDQPLL-----SAYRQYMVTDADID 327
Query: 345 NALVNTLTVQMRLGMFD-GEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPS 403
+A L +M LG+FD GE PY + P + + +HQE+AL AAR+ IVLLKNQ
Sbjct: 328 SAAYRVLRARMELGLFDSGE--QNPYTKISPAVIGSAEHQEVALNAARECIVLLKNQKKM 385
Query: 404 LPLSHIRHRTVAVIGPNSDVTVTMIGNYAGI-ACGYTTPLQGI 445
LPL+ + +++AV+G N+ + G+Y+G+ + LQGI
Sbjct: 386 LPLNAKKVKSIAVVGINA--GSSEFGDYSGLPVIAPISVLQGI 426
Score = 131 bits (330), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 88/297 (29%), Positives = 143/297 (48%), Gaps = 43/297 (14%)
Query: 467 LFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVL 526
L+G A A R+ + + V+G+++SIE E DR + LP QQE + ++ + P I+V+
Sbjct: 591 LYGEAGKAVRECETVVAVLGINKSIEREGQDRYDIQLPADQQEFLQEIYKVN--PNIVVV 648
Query: 527 MSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITN 586
+ G +A D I AI+ A YPG++GG A+A++LFG NPGG+LP+T+Y +
Sbjct: 649 LVAGS-SLAINWMDEHIPAIVNAWYPGESGGKAVAEVLFGDYNPGGRLPLTYY--RSLDE 705
Query: 587 LPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRH 646
LP + GRTY+++KG V+YPFG+G+SYT F ++ DG
Sbjct: 706 LPPFD------DYDITKGRTYKYFKGDVLYPFGYGLSYTTFKYSNLQVA-------DGEE 752
Query: 647 GSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQL 705
+ V +KN G G V+ P P K+L
Sbjct: 753 ----------------------EINVSFQLKNSGKYAGDEVAQVYVKLPERDEVMPIKEL 790
Query: 706 VAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRI-PLGEHNIHIGGTKHSVSLH 761
FE+V + +G ++V + + L D + + + P G++ I +G + + L
Sbjct: 791 KGFERVTLKSGENKKVTLKLR-KDLLRYWDEAKDKFVCPSGDYTIMVGASSADIRLQ 846
>gi|383117091|ref|ZP_09937838.1| hypothetical protein BSHG_0805 [Bacteroides sp. 3_2_5]
gi|382973702|gb|EES87886.2| hypothetical protein BSHG_0805 [Bacteroides sp. 3_2_5]
Length = 805
Score = 274 bits (700), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 221/689 (32%), Positives = 328/689 (47%), Gaps = 87/689 (12%)
Query: 109 GATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRG 168
G T FP I AS++N L +GRV++ EA A G G + P +++ RDPRW R
Sbjct: 159 GTTVFPTSIGQASTWNPELIRQMGRVIAIEASA--QGAHIG---YGPVLDLARDPRWSRV 213
Query: 169 QETPGEDPVLSGKYAASYVRGLQGSD-GDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKV 227
+ET GEDP L+G + VRG QG D V A+ KHF +Y W A +
Sbjct: 214 EETYGEDPYLNGVMGTALVRGFQGETLNDGKSVIATLKHFASY---GWTEGGHNGGTAHI 270
Query: 228 SKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVS 287
++++E+ PFR V G + SVM SYN+++G P +L ++ W+ G++VS
Sbjct: 271 GERELEEAIFPPFREAVGAGAL-SVMSSYNEIDGNPCTGSRYLLTDILKDRWQFKGFVVS 329
Query: 288 DCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTE---SAVQRGLLSEIDIN 344
D +VG + EAA A+ AG+D D G ++ E +AV+RG ++ I+
Sbjct: 330 DLYAVGGLRE-HGVAGNDYEAAIKAVNAGVDSDLGT--NVYAEQLVAAVKRGDVAVATID 386
Query: 345 NALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSL 404
A+ L+++ ++G+FD + + V + +H LA E ARQ IVLLKN+ L
Sbjct: 387 KAVRRILSLKFQMGLFDDPFVDEKQ---AVQLVASSEHTGLAREVARQSIVLLKNKDKLL 443
Query: 405 PLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYT--TPLQGI----GRYARTIHQQGCK 458
PL RT+AVIGPN+D M+G+Y T T L GI + R ++ +GC
Sbjct: 444 PLKK-DIRTLAVIGPNADNVYNMLGDYTAPQADGTVVTVLDGIRQKVSKETRVLYAKGCA 502
Query: 459 DVACADDQLFGAAIDASRQADATILVMG----LDQSIE-------------------AEA 495
V + F AI+ +R AD ++VMG D S E E
Sbjct: 503 -VRDSSRTGFKDAIETARNADTVVMVMGGSSARDFSSEYEETGAAKVTINQISDMESGEG 561
Query: 496 LDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQA 555
DRA L L GRQ EL+ ++S K P +LVL+ G P+ + A + AI+ A YPG
Sbjct: 562 YDRATLHLMGRQLELLEEISRLGK-PVVLVLIKGRPLLMEGAIQEAE--AIVDAWYPGMQ 618
Query: 556 GGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVV 615
GG A+AD+LFG NP G+L ++ P+ + LP+ R RY G
Sbjct: 619 GGNAVADVLFGDYNPAGRLTLS-VPRS-VGQLPVYYNTRRKGNRSRYVEE-----PGTPR 671
Query: 616 YPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVD 675
YPFG+G+SYT F +T ++VT + + V V
Sbjct: 672 YPFGYGLSYTTFSYT------------------------DMKVQVTEG-SDDCWVDVTVT 706
Query: 676 VKNVGSKDGAHTL-LVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVV 734
++N G+ DG L F + P KQL AF ++H+ AG + V + K L++
Sbjct: 707 IQNQGTADGDEVAQLYFRDDVSSFTTPAKQLRAFSRIHLKAGESREVTFTLD-KKSLALY 765
Query: 735 DRSGTRRIPLGEHNIHIGGTKHSVSLHAA 763
+ G + G I +GG+ ++ A
Sbjct: 766 MQEGEWVVEPGRFTIMVGGSSEDITCRQA 794
>gi|423221630|ref|ZP_17208100.1| hypothetical protein HMPREF1062_00286 [Bacteroides cellulosilyticus
CL02T12C19]
gi|392645869|gb|EIY39591.1| hypothetical protein HMPREF1062_00286 [Bacteroides cellulosilyticus
CL02T12C19]
Length = 864
Score = 274 bits (700), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 163/424 (38%), Positives = 238/424 (56%), Gaps = 32/424 (7%)
Query: 40 LPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGP 99
P+ SL +R +DL+ RL+L+EK L+++G+ A+PRL IK Y WW+EALHG++ G
Sbjct: 27 FPYQDTSLTAEERADDLLKRLTLEEKASLMMNGSPAIPRLSIKAYGWWNEALHGLARTGL 86
Query: 100 GTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARA----MYNGGT----AGLT 151
AT FPQ I +SF+ +L + VSDEARA + + G LT
Sbjct: 87 ----------ATVFPQAIGMGASFDDSLLYEVFTAVSDEARAKSRRLDSKGNLTRYQALT 136
Query: 152 YWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRL-KVAASCKHFTAY 210
W+PNVNIFRDPRWGRGQET GEDP L+ + + V GLQG D R K+ A KH+ +
Sbjct: 137 VWTPNVNIFRDPRWGRGQETYGEDPYLTSRLGVAVVNGLQGPDTARYNKLHACAKHYAVH 196
Query: 211 DLDNWNGVDRFHFNAK-VSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPN 269
WN R FNA+ +S +D+ +T+ F+ V E KV VMC+YN+ G P C
Sbjct: 197 SGPEWN---RHSFNAENISPRDLWETYLPAFKTLVQEAKVKEVMCAYNRFEGEPCCGSNR 253
Query: 270 ILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTP--EEAAADAIRAGLDLDCGPFLGL 327
+L + +R EW +G +VSDC +V ++ + + P A+ADA+ G D++CG
Sbjct: 254 LLTQILRDEWGFDGVVVSDCGAVSDFWQKRKHETHPDAASASADAVLNGTDVECGNSYK- 312
Query: 328 HTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELAL 387
AV+ GL++E I+ ++ L + LG D + + V +P H++LAL
Sbjct: 313 SLPDAVKAGLITENQIDISVKRLLKARFELGEMD----ENVWTGISSDVVDSPKHRQLAL 368
Query: 388 EAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGR 447
+ AR+ + LL+N LPLS + +A+IGPN++ +V GNY G+ T L+G+ R
Sbjct: 369 QMARETMTLLQNNNNILPLS--KQAKIALIGPNANDSVMQWGNYNGLPSHTITLLEGMQR 426
Query: 448 YART 451
Y T
Sbjct: 427 YLPT 430
Score = 117 bits (294), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 88/299 (29%), Positives = 133/299 (44%), Gaps = 47/299 (15%)
Query: 472 IDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGP 521
++ + D I G+ ++E E + DR + LP Q+ +V + A K
Sbjct: 595 LERIKDVDVVIFAGGISPALEGEEMPVDAAGFRGGDRTEIELPAVQRRVVEALKTAGK-R 653
Query: 522 TILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQ 581
+ V SG I A AI+ A YPGQAGG A+A++LFG NP GKLP+T+Y
Sbjct: 654 IVFVNFSGAAI--ALEPESLNCEAILQAWYPGQAGGQAVAEVLFGDYNPAGKLPLTFYRN 711
Query: 582 EYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVP 641
+ +P E GRTYR+ K ++PFGHG+SYT F
Sbjct: 712 --LAQIPDFE-------DYNMTGRTYRYMKETPLFPFGHGLSYTTF-------------- 748
Query: 642 LDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAP 701
++G + K+ + L + + V N GS+DG + V+ P
Sbjct: 749 ---KYGKLKMNDD----KIAAGQ----NLNLVIPVTNTGSRDGDEVVQVYLKKMDDTEGP 797
Query: 702 HKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
K L AF++V +PAG V ++ + ++S T R+ G + + IGGT SL
Sbjct: 798 VKTLRAFKRVRIPAGKTVEVKFSLDDTQLEWWDEQSNTMRVCPGNYTVMIGGTSCEKSL 856
>gi|336408356|ref|ZP_08588849.1| hypothetical protein HMPREF1018_00864 [Bacteroides sp. 2_1_56FAA]
gi|335937834|gb|EGM99730.1| hypothetical protein HMPREF1018_00864 [Bacteroides sp. 2_1_56FAA]
Length = 805
Score = 273 bits (699), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 221/689 (32%), Positives = 328/689 (47%), Gaps = 87/689 (12%)
Query: 109 GATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRG 168
G T FP I AS++N L +GRV++ EA A G G + P +++ RDPRW R
Sbjct: 159 GTTVFPTSIGQASTWNPELIRQMGRVIAIEASA--QGAHIG---YGPVLDLARDPRWSRV 213
Query: 169 QETPGEDPVLSGKYAASYVRGLQGSD-GDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKV 227
+ET GEDP L+G + VRG QG D V A+ KHF +Y W A +
Sbjct: 214 EETYGEDPYLNGVMGTALVRGFQGETLNDGKSVIATLKHFASY---GWTEGGHNGGTAHI 270
Query: 228 SKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVS 287
++++E+ PFR V G + SVM SYN+++G P +L ++ W+ G++VS
Sbjct: 271 GERELEEAIFPPFREAVGAGAL-SVMSSYNEIDGNPCTGSRYLLTDILKDRWQFKGFVVS 329
Query: 288 DCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTE---SAVQRGLLSEIDIN 344
D +VG + EAA A+ AG+D D G ++ E +AV+RG ++ I+
Sbjct: 330 DLYAVGGLRE-HGVAGNDYEAAIKAVNAGVDSDLGT--NVYAEQLVAAVKRGDVAVATID 386
Query: 345 NALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSL 404
A+ L+++ ++G+FD + + V + +H LA E ARQ IVLLKN+ L
Sbjct: 387 KAVRRILSLKFQMGLFDDPFVDEKQ---AVQLVASSEHTGLAREVARQSIVLLKNKDKLL 443
Query: 405 PLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYT--TPLQGI----GRYARTIHQQGCK 458
PL RT+AVIGPN+D M+G+Y T T L GI + R ++ +GC
Sbjct: 444 PLKK-DIRTLAVIGPNADNVYNMLGDYTAPQADGTVVTVLDGIRQKVSKETRVLYAKGCA 502
Query: 459 DVACADDQLFGAAIDASRQADATILVMG----LDQSIE-------------------AEA 495
V + F AI+ +R ADA ++VMG D S E E
Sbjct: 503 -VRDSSRTGFKDAIETARNADAVVMVMGGSSARDFSSEYEETGAAKVTINQISDMESGEG 561
Query: 496 LDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQA 555
DRA L L GRQ EL+ ++S K P +LVL+ G P+ + A + AI+ A YPG
Sbjct: 562 YDRATLHLMGRQLELLEEISRLGK-PVVLVLIKGRPLLMEGAIQEAE--AIVDAWYPGMQ 618
Query: 556 GGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVV 615
GG A+AD+LFG NP G+L ++ P+ + LP+ R RY G
Sbjct: 619 GGNAVADVLFGDYNPAGRLTLS-VPRS-VGQLPVYYNTRRKGNRSRYIEE-----PGTPR 671
Query: 616 YPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVD 675
YPFG+G+SYT F +T ++VT + + V V
Sbjct: 672 YPFGYGLSYTTFSYT------------------------DMKVQVTEG-SDDCRVDVTVT 706
Query: 676 VKNVGSKDGAHTL-LVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVV 734
++N G+ DG L F + P KQL AF ++H+ A + V + K L++
Sbjct: 707 IQNQGTADGDEVAQLYFRDDVSSFTTPAKQLRAFSRIHLKAAESREVTFTLD-KKSLALY 765
Query: 735 DRSGTRRIPLGEHNIHIGGTKHSVSLHAA 763
+ G + G I +GG+ ++ A
Sbjct: 766 MQEGEWVVEPGRFTIMVGGSSEDIACRQA 794
>gi|224538282|ref|ZP_03678821.1| hypothetical protein BACCELL_03173 [Bacteroides cellulosilyticus
DSM 14838]
gi|224520107|gb|EEF89212.1| hypothetical protein BACCELL_03173 [Bacteroides cellulosilyticus
DSM 14838]
Length = 864
Score = 273 bits (699), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 163/424 (38%), Positives = 238/424 (56%), Gaps = 32/424 (7%)
Query: 40 LPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGP 99
P+ SL +R +DL+ RL+L+EK L+++G+ A+PRL IK Y WW+EALHG++ G
Sbjct: 27 FPYQDTSLTAEERADDLLKRLTLEEKASLMMNGSPAIPRLSIKAYGWWNEALHGLARTGL 86
Query: 100 GTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARA----MYNGGT----AGLT 151
AT FPQ I +SF+ +L + VSDEARA + + G LT
Sbjct: 87 ----------ATVFPQAIGMGASFDDSLLYEVFTAVSDEARAKSRRLDSKGNLTRYQALT 136
Query: 152 YWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRL-KVAASCKHFTAY 210
W+PNVNIFRDPRWGRGQET GEDP L+ + + V GLQG D R K+ A KH+ +
Sbjct: 137 VWTPNVNIFRDPRWGRGQETYGEDPYLTSRLGVAVVNGLQGPDTARYNKLHACAKHYAVH 196
Query: 211 DLDNWNGVDRFHFNAK-VSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPN 269
WN R FNA+ +S +D+ +T+ F+ V E KV VMC+YN+ G P C
Sbjct: 197 SGPEWN---RHSFNAENISPRDLWETYLPAFKTLVQEAKVKEVMCAYNRFEGEPCCGSNR 253
Query: 270 ILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTP--EEAAADAIRAGLDLDCGPFLGL 327
+L + +R EW +G +VSDC +V ++ + + P A+ADA+ G D++CG
Sbjct: 254 LLTQILRDEWGFDGVVVSDCGAVSDFWQKRKHETHPDAASASADAVLNGTDVECGNSYK- 312
Query: 328 HTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELAL 387
AV+ GL++E I+ ++ L + LG D + + V +P H++LAL
Sbjct: 313 SLPDAVKAGLITENQIDISVKRLLKARFELGEMD----ENVWTGISSDVVDSPKHRQLAL 368
Query: 388 EAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGR 447
+ AR+ + LL+N LPLS + +A+IGPN++ +V GNY G+ T L+G+ R
Sbjct: 369 QMARETMTLLQNNNNILPLS--KQAKIALIGPNANDSVMQWGNYNGLPSHTITLLEGMQR 426
Query: 448 YART 451
Y T
Sbjct: 427 YLPT 430
Score = 118 bits (296), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 88/299 (29%), Positives = 133/299 (44%), Gaps = 47/299 (15%)
Query: 472 IDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGP 521
++ + D I G+ ++E E + DR + LP Q+ +V + A K
Sbjct: 595 LERIKDVDVVIFAGGISPALEGEEMPVDAAGFRGGDRTEIELPAVQRRVVEALKTAGK-R 653
Query: 522 TILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQ 581
+ V SG I A AI+ A YPGQAGG A+A++LFG NP GKLP+T+Y
Sbjct: 654 IVFVNFSGAAI--ALEPESQNCEAILQAWYPGQAGGQAVAEVLFGDYNPAGKLPLTFYRN 711
Query: 582 EYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVP 641
+ +P E GRTYR+ K ++PFGHG+SYT F
Sbjct: 712 --LAQIPDFE-------DYNMTGRTYRYMKETPLFPFGHGLSYTTF-------------- 748
Query: 642 LDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAP 701
++G + K+ + L + + V N GS+DG + V+ P
Sbjct: 749 ---KYGKLKMNDD----KIAAGQ----NLNLAIPVTNTGSRDGDEVVQVYLKKMDDTEGP 797
Query: 702 HKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
K L AF++V +PAG V ++ + ++S T R+ G + + IGGT SL
Sbjct: 798 VKTLRAFKRVRIPAGKTVEVKFSLDDTQLEWWDEQSNTMRVCPGNYTVMIGGTSCEKSL 856
>gi|294675412|ref|YP_003576028.1| family 3 glycosyl hydrolase [Prevotella ruminicola 23]
gi|294472176|gb|ADE81565.1| glycosyl hydrolase, family 3 [Prevotella ruminicola 23]
Length = 875
Score = 273 bits (699), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 163/437 (37%), Positives = 239/437 (54%), Gaps = 39/437 (8%)
Query: 40 LPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGP 99
LP+ +L QR +DL+ RL+L EKV L++ + A+PRLGI ++WW+EALHG+ G
Sbjct: 23 LPYQNANLSAAQRADDLLSRLTLDEKVSLMMDTSPAIPRLGIPQFQWWNEALHGIGRNG- 81
Query: 100 GTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA--------GLT 151
AT FP + A+S++ L + VSDEAR L+
Sbjct: 82 ---------FATVFPITMAMAASWDDALLHQVFTAVSDEARVKAQQAKCTGDIKRYQSLS 132
Query: 152 YWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQ--GSDGDRL------KVAAS 203
+W+PN+NIFRDPRWGRGQET GEDP L+ K + VRGLQ G +G+ L K+ A
Sbjct: 133 FWTPNINIFRDPRWGRGQETYGEDPYLTAKMGLAVVRGLQGVGYNGEDLGVSKYRKLLAC 192
Query: 204 CKHFTAYDLDNWNGVDRFHFNAK-VSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGV 262
KHF + WN R FN + + ++D+ +T+ F+ V EGKVA VMC+Y +++G
Sbjct: 193 AKHFAVHSGPEWN---RHEFNIENLPERDLWETYLPAFKALVQEGKVAEVMCAYQRIDGQ 249
Query: 263 PTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTP-EEAAADAIRAGLDLDC 321
CA ++ +R EW +G I SDC ++ + H S EA+A A+ AG D++C
Sbjct: 250 ACCAQTRYEQQILRDEWGFDGLITSDCGAIRDFLPRWHNVSKDGAEASAKAVLAGTDVEC 309
Query: 322 GPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD 381
G H AV+RG + E DI+ +L L + LG D + + + V +
Sbjct: 310 GSEYK-HLPEAVRRGDVKEADIDRSLRRLLIARFELGDMDSD-DLNAWTKIPETVVASQA 367
Query: 382 HQELALEAARQGIVLLKNQGPSLPLSHIRH------RTVAVIGPNSDVTVTMIGNYAGIA 435
H++LAL+ A + IVLL+N+ LPL + + + + V+GPN++ +V M GNYAG
Sbjct: 368 HKDLALKMALKSIVLLQNKIKVLPLGNPLNAGAGSDKDIVVMGPNANDSVMMWGNYAGYP 427
Query: 436 CGYTTPLQGIGRYARTI 452
T L GI R A+T+
Sbjct: 428 THTVTALDGITRMAKTL 444
Score = 89.0 bits (219), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 80/297 (26%), Positives = 129/297 (43%), Gaps = 57/297 (19%)
Query: 482 ILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGP 531
I V G+ ++E E + DR + LP Q++L++ + A K I V SG
Sbjct: 624 IFVGGISPNLEGEEMRVNEPGFKGGDRTSIELPQAQRDLLAVLHKAGK-KVIFVNCSGSA 682
Query: 532 IDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLP-MT 590
+ A A AI+ Y G+ GG A+A LFG P GKLP+T+Y + LP
Sbjct: 683 M--ALAPELETCDAILQWWYGGEQGGAALATTLFGMVAPSGKLPVTFY--KSTDELPDFL 738
Query: 591 EMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSIN 650
+ M+ RTYR+Y+G ++PFG G+ YT F +I+
Sbjct: 739 DYTMK--------NRTYRYYEGEPLFPFGFGLGYTTF--------------------NID 770
Query: 651 ATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEK 710
I K N+ VQV VKN+G+ G T+ V+ A P K L A+++
Sbjct: 771 KPIY---------KNNK----VQVRVKNLGTTAGTETVQVYIRHLADKEGPKKSLRAYQQ 817
Query: 711 VHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHAATLGV 767
V + A + + I + + ++ T R+ G++ + +G + L + +
Sbjct: 818 VTLNAAEAKTISIELPRKSFEGWDVKTNTMRVVPGKYEVMVGNSSADKDLKKIIVNI 874
>gi|254445290|ref|ZP_05058766.1| Glycosyl hydrolase family 3 C terminal domain protein
[Verrucomicrobiae bacterium DG1235]
gi|198259598|gb|EDY83906.1| Glycosyl hydrolase family 3 C terminal domain protein
[Verrucomicrobiae bacterium DG1235]
Length = 730
Score = 273 bits (699), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 222/714 (31%), Positives = 338/714 (47%), Gaps = 91/714 (12%)
Query: 35 ATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGV 94
AT PF LP +R++DLI ++L+EKV L+ +PRL +K Y SE HGV
Sbjct: 22 ATDYDYPFQDPDLPNEERIDDLITCMTLEEKVDLM-GFVPGIPRLDVK-YTRISEGYHGV 79
Query: 95 SNVGPGTKFGGDFPGATS-FPQVITTASSFNATLWEAIGRVVSDEARAMYNGGT---AGL 150
+ GP + +G P T+ FPQ A++++ L + + E R +Y +GL
Sbjct: 80 AQGGP-SNWGKRNPTPTTQFPQAYGLAATWDPALISRVSANQATEVRYLYQSPKYQRSGL 138
Query: 151 TYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAY 210
+PN ++ RDPRWGR +E GEDP L+G AA++ GL G LK + KHF A
Sbjct: 139 VVMAPNADLARDPRWGRTEEVYGEDPFLTGTLAAAFASGLAGDHPRYLKATSLLKHFLA- 197
Query: 211 DLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNI 270
N N DRF ++ ++ + + PF M + +G S+M +YN +NG P P +
Sbjct: 198 ---NSNEDDRFFSSSDFDERLWREYYAKPFEMAIRDGGARSMMAAYNAINGTPAHVHP-M 253
Query: 271 LKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTE 330
L+ + GEW L+G I +D + + A A I+AG++L FL HT+
Sbjct: 254 LRDIVMGEWGLDGTICTDGGGLAHLVNQHKTYPDLPTATAACIKAGINL----FLDNHTQ 309
Query: 331 S---AVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLG------PKDVCTPD 381
+ AV++ L++E +I++ + + + + LG+ D P PY ++G P ++ P+
Sbjct: 310 AALDAVEQSLVTEAEIDDVIRGRIRLFLDLGLLD-PPELVPYSNIGHEPGLEPWEL--PE 366
Query: 382 HQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTP 441
E R+ IVLLKN+ LPL + +VA++GP ++ T++ Y+G P
Sbjct: 367 THAFVREVTRKSIVLLKNENNILPLDPSKINSVAIVGPLAN--TTLLDWYSGTPPYAIPP 424
Query: 442 LQGIGRYAR-----TIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEA--- 493
GI YA + + G VA D A++ + D I+V+G A
Sbjct: 425 RDGIEGYANSGPFPSPAKFGSNWVADMSD----TALEVAASRDVAIVVVGNHPESNAGWG 480
Query: 494 ---------EALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAF-AKNDPRI 543
EA+DR ++L Q+E + KV A+ TI+VL+S P + + A+N P I
Sbjct: 481 VVTSPSEGKEAVDRQEIILQPDQEEFIQKVYAANPN-TIVVLVSNFPYAMPWAAENAPAI 539
Query: 544 AAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYP 603
I A Q G A+AD+LFG NPGGK TW P+ PM + +R
Sbjct: 540 VHITHA---SQEQGNALADVLFGDYNPGGKTVQTW-PKSLDQLPPMMDYDIR-------R 588
Query: 604 GRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHA 663
GRTY + + YPFG+G+SYT F + K
Sbjct: 589 GRTYMYSQHEPQYPFGYGLSYTTFELSKL--------------------------KAPKK 622
Query: 664 KCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWA-PHKQLVAFEKVHVPAG 716
T ++V V N G +DG + ++ P P KQL F++V VPAG
Sbjct: 623 LKADATATIKVRVANTGERDGDEVVQLYVRYPNSKVERPSKQLKGFQRVTVPAG 676
>gi|337749758|ref|YP_004643920.1| glycoside hydrolase family protein [Paenibacillus mucilaginosus
KNP414]
gi|336300947|gb|AEI44050.1| glycoside hydrolase family 3 domain protein [Paenibacillus
mucilaginosus KNP414]
Length = 937
Score = 273 bits (699), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 166/430 (38%), Positives = 244/430 (56%), Gaps = 32/430 (7%)
Query: 36 TTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVS 95
T +PF LP+ +RV DL+ RL+L+EK+ L+ +PRLGI Y+ +E HGV+
Sbjct: 3 TETKMPFQDTRLPLEERVQDLVSRLTLEEKINLMCQYQEEIPRLGIAKYKHGTEGAHGVA 62
Query: 96 NVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMY--NGGTAGLTYW 153
+G AT FPQ + A ++N L IG V+ DEAR Y N GLT W
Sbjct: 63 WLGE----------ATVFPQNVGLACTWNPELMREIGSVIGDEARVYYQRNPEINGLTIW 112
Query: 154 SPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLD 213
+P V++ RDPRWGR +E GEDP L+G+ + V+G+QG LK AA+ KHF L
Sbjct: 113 APTVDMERDPRWGRTEEAYGEDPYLTGRLSTELVKGMQGDHPFYLKTAATLKHF----LG 168
Query: 214 NWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKR 273
N N +DR +A + +++ + + F EG S+M +YN VNG P +P++
Sbjct: 169 NNNEMDRGECSASIDPRNMREYYLKAFEPAFREGGAQSMMTAYNSVNGTPCNLNPDV-NA 227
Query: 274 TIRGEWRLNGYIVSDC-DSVGVYYDTQHFTSTPEEAAADAIRAGLDL---DCGPFLGLHT 329
++GEW ++G++VSD D +G + ++F S E AA ++R+G+D D G L
Sbjct: 228 IVKGEWGMDGFVVSDAGDVLGTVNEHRYFASYAEAVAA-SVRSGIDSITDDAGITL-RAV 285
Query: 330 ESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEA 389
A++ GLL+E D++ A+ N V++RLG FDG P PY H+ +C P+H L+L+A
Sbjct: 286 RDALEGGLLAEEDLDRAVGNAFRVRIRLGEFDG-PEENPYAHVPEAKLCAPEHAALSLQA 344
Query: 390 ARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRY- 448
AR+G VLLKN+G LPL+ + ++AVIGP +DV T Y+G TPL+G+
Sbjct: 345 AREGFVLLKNEG-LLPLA--KPASIAVIGPLADVVHTDW--YSGTPPYRITPLEGVAERM 399
Query: 449 --ARTIHQQG 456
IH+ G
Sbjct: 400 APGTVIHRTG 409
Score = 104 bits (260), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 85/299 (28%), Positives = 143/299 (47%), Gaps = 48/299 (16%)
Query: 470 AAIDASRQADATILVMGLDQSIEA-EALDRAGLLLPGRQQELVSKVSMASKGPTILVLMS 528
AA++A+RQA+ I+ +G I E +DR L L Q+ L+ V +A+ T+ VL+
Sbjct: 546 AAVEAARQAETAIVFLGNSPFINGKECVDRPDLTLAPAQEALLQAV-LAANPRTVAVLVG 604
Query: 529 GGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLP 588
P V +A+ + AI++ + GQ G A AD+LFG +P G+L MTWY +LP
Sbjct: 605 SYPFAVNWAQ--AHVPAILYTSHAGQELGRAAADVLFGDCSPAGRLNMTWYKS--AADLP 660
Query: 589 -MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVH-TVANAPTVVAVPLDGRH 646
+ + + + RTY+++ G V+YPFGHG+SYT F + V AP+ +
Sbjct: 661 DLLDYDIIKGK------RTYQYFDGEVLYPFGHGLSYTEFRYLEVQAAPS--------EN 706
Query: 647 GSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPH--KQ 704
G+ N T++ V+N GS+ G + ++ W P K
Sbjct: 707 GAFNVTVT---------------------VQNAGSRAGDEVVQLY-VKAGESWVPRPLKT 744
Query: 705 LVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRR-IPLGEHNIHIGGTKHSVSLHA 762
L F ++H+ G ++V + + L+ D + + + G + GG+ ++ L A
Sbjct: 745 LAGFRRIHLLPGQTEQVQFTVEASQ-LAFWDVTREKYCVEEGTYRFMAGGSSGNLPLAA 802
>gi|189461690|ref|ZP_03010475.1| hypothetical protein BACCOP_02354 [Bacteroides coprocola DSM 17136]
gi|189431577|gb|EDV00562.1| glycosyl hydrolase family 3 N-terminal domain protein [Bacteroides
coprocola DSM 17136]
Length = 499
Score = 273 bits (698), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 166/441 (37%), Positives = 248/441 (56%), Gaps = 52/441 (11%)
Query: 9 FLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKL 68
F GL+L+ A + A+E + + DA P+ +R+ DL+ RL+++EKV L
Sbjct: 9 FAGLLLIIAYVIPMQAQELYKNE--DA------------PLHERIMDLLSRLTVEEKVSL 54
Query: 69 LISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLW 128
L + + + RL I Y +EALHGV V PG +F T FPQ I A+++N L
Sbjct: 55 LRATSPGISRLDIPKYYHGNEALHGV--VRPG-RF-------TVFPQAIGLAATWNPELQ 104
Query: 129 EAIGRVVSDEARAMYNGGTAG----------LTYWSPNVNIFRDPRWGRGQETPGEDPVL 178
+ V+SDEARA +N G LT+WSP VN+ RDPRWGR ET GEDP L
Sbjct: 105 YQVATVISDEARARWNELDQGKLQKGQFSDLLTFWSPTVNMARDPRWGRTPETYGEDPYL 164
Query: 179 SGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDV 238
SG ++VRGLQG D LKV ++ KHF A + ++ +RF N ++S++ + + +
Sbjct: 165 SGTMGTAFVRGLQGDDARYLKVVSTPKHFAANNEEH----NRFECNPQISEKQLREYYLP 220
Query: 239 PFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDT 298
F C+ +GK AS+M +YN +N VP + +L + +R +W GY+VSDC + +
Sbjct: 221 AFEACIKDGKAASIMSAYNAINNVPCTLNSWLLTKVLRHDWGFQGYVVSDCGGPSLLVNA 280
Query: 299 QHFTSTPEEAAADAIRAGLDLDCG------PFLGLHTESAVQRGLLSEIDINNALVNTLT 352
+ T E AA +I+AGLDL+CG P L +A ++ ++S+ DI++ + L
Sbjct: 281 HKYVKTKEAAATLSIKAGLDLECGDDVYYEPLL-----NAYKQYMVSDADIDSTAYHVLK 335
Query: 353 VQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHR 412
+MRLG+FD + PY + P + + HQ +ALEAARQ IVLLKN LPL + +
Sbjct: 336 ARMRLGLFDNG-KNNPYTKISPSIIGSKLHQRVALEAARQCIVLLKNHNWVLPLDTKKLK 394
Query: 413 TVAVIGPNSDVTVTMIGNYAG 433
++AV+G N+ G+Y+G
Sbjct: 395 SIAVVGINAG--NCEFGDYSG 413
>gi|373952814|ref|ZP_09612774.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
paludis DSM 18603]
gi|373889414|gb|EHQ25311.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
paludis DSM 18603]
Length = 862
Score = 273 bits (698), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 165/462 (35%), Positives = 246/462 (53%), Gaps = 30/462 (6%)
Query: 35 ATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGV 94
A+ + LP+ +L R DL+ RL+L+EKV L+ + AVPRLGIK + WWSEALHG
Sbjct: 17 ASAQQLPYQNPALSSEARAKDLVTRLTLKEKVGLMKDVSEAVPRLGIKKFNWWSEALHGY 76
Query: 95 SNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA------ 148
+N GP T FP+ + A+SF+ + VSDEARA N
Sbjct: 77 ANQGP----------VTVFPEPVGMAASFDDQKLFHVFDAVSDEARAKNNEYRKQVESQR 126
Query: 149 --GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCK 205
L+ W+PNVNIFRDPRWGRGQET GEDP L+ + S V+GLQG +D K+ A K
Sbjct: 127 FHDLSVWTPNVNIFRDPRWGRGQETYGEDPYLTSRMGVSVVKGLQGPADAKYRKLLACAK 186
Query: 206 HFTAYDLDNWNGVDRFHFN-AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPT 264
H+ + W+ R N V+ +D+ +T+ F+ V + V VMC+Y +++ P
Sbjct: 187 HYAVHSGPEWS---RHEMNVTDVTPRDLWETYLPAFKSLVQDADVREVMCAYQRLDDEPC 243
Query: 265 CADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPF 324
C + +L + +R +W +VSDC ++ +Y++ H +S A+A A+ +G D++C +
Sbjct: 244 CGNSRLLGQILREDWGFKYLVVSDCGAITDFYNSHHSSSDATHASAKAVLSGTDVECVGY 303
Query: 325 LGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQE 384
AV RGL+ E DIN ++V +T + LG D + P+ + V + DHQ+
Sbjct: 304 AFDKIPDAVYRGLIKEKDINTSVVRLMTQRFELGEMDKD-ELVPWTKIPLSVVNSEDHQK 362
Query: 385 LALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQG 444
LAL+ AR+ + LL+N LPLS + +AVIGPN++ + + GNY G L+G
Sbjct: 363 LALDMARETMTLLQNNNNILPLSKSIGK-LAVIGPNANDSQMLSGNYNGTPLRTINILEG 421
Query: 445 IGRYA---RTIHQQGCK--DVACADDQLFGAAIDASRQADAT 481
I I+ GC D + + +ID + AT
Sbjct: 422 IKTKLGADHVIYDAGCDLTDDKITESLIGNCSIDGKKGIRAT 463
Score = 94.7 bits (234), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 81/311 (26%), Positives = 130/311 (41%), Gaps = 49/311 (15%)
Query: 468 FGAAIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMA 517
+ I+ + AD + V G+ +E E + DR + LP Q+ + + A
Sbjct: 589 YDGIIEKVKDADIVVFVGGISPKLEGEEMPVQLPGFKGGDRTDIELPAVQRNCIEALRKA 648
Query: 518 SKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMT 577
K + V SG I A AI+ A Y G++GG A+AD+LFG NP G LP+T
Sbjct: 649 GK-KIVFVNCSGSAI--AMVPETQNCDAILQAWYAGESGGQAVADVLFGDYNPSGHLPVT 705
Query: 578 WYPQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPT 636
+Y + LP ++ +M+ GRTYR+ K ++PFG G+SYT F + A
Sbjct: 706 FYRN--VQQLPDFSDYSMK--------GRTYRYLKSAPLFPFGFGLSYTTF--NIGEAKL 753
Query: 637 VVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPA 696
+ N G+AI+ ++V V N G DG L V+
Sbjct: 754 -----------TKNNITKGEAIQ------------LRVPVANAGKTDGTELLQVYIRKVD 790
Query: 697 GHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKH 756
K L F+++ V AG + V +++ + R+ G + + G +
Sbjct: 791 DPDGASKTLRGFKRIPVSAGKTEMVTLDLPPKTFEFFDPTDAVVRVSPGNYQLLYGESSD 850
Query: 757 SVSLHAATLGV 767
L + L +
Sbjct: 851 PKDLKSIQLTI 861
>gi|393784569|ref|ZP_10372732.1| hypothetical protein HMPREF1071_03600 [Bacteroides salyersiae
CL02T12C01]
gi|392665550|gb|EIY59074.1| hypothetical protein HMPREF1071_03600 [Bacteroides salyersiae
CL02T12C01]
Length = 929
Score = 273 bits (698), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 153/412 (37%), Positives = 229/412 (55%), Gaps = 21/412 (5%)
Query: 38 RTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNV 97
+T PF SL +R +L+ L+L+EK+ + A+PRL IKGY +W+EA+HGV+
Sbjct: 38 QTHPFQDESLSFHERAKNLVSLLTLEEKINQVGHQTLAIPRLNIKGYNYWNEAIHGVARS 97
Query: 98 GPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNV 157
G ATSFP +S+++ L SDEAR N GL YW P +
Sbjct: 98 GL----------ATSFPVSKAMSSTWDLPLIFDCAVATSDEARVYSNTKDKGLIYWCPTI 147
Query: 158 NIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNG 217
N+ RDPRWGR +E GEDP L+GK A Y++G+QG D K A+ KHF A + +
Sbjct: 148 NMSRDPRWGRDEENYGEDPFLTGKIAVEYIKGMQGDDPKYYKTIATAKHFAANNYEK--- 204
Query: 218 VDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRG 277
R ++ + +++ + + F M V EG V SVM +YN +NG+P A+ +L +R
Sbjct: 205 -GRHSTSSDMDARNLREYYLPAFEMAVKEGNVRSVMSAYNALNGIPCGANHELLIDILRT 263
Query: 278 EWRLNGYIVSDCDSVGVYYDT--QHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQR 335
EW NG++ SDC +V Y + HF +T EA+A +I G DL+CG + + A+++
Sbjct: 264 EWGFNGFVTSDCGAVDDVYQSNRHHFVNTAAEASAVSIVNGEDLNCGNTFQDYCKEAIEK 323
Query: 336 GLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIV 395
G + E D++ ALV + +G FD S+ P+ + + +H++LA +AA++ IV
Sbjct: 324 GYMQEADLDTALVRVFEARFSVGEFDN-ASNVPWRSISDDVLDCEEHRQLAYKAAQEAIV 382
Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGR 447
LLKN LPL + ++VAVIGP + T+T+ G Y+G TTP GI
Sbjct: 383 LLKNDNNILPLD--KTKSVAVIGPFGN-TITL-GGYSGSPTALTTPFGGIAE 430
Score = 114 bits (286), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 90/308 (29%), Positives = 141/308 (45%), Gaps = 39/308 (12%)
Query: 455 QGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKV 514
+GC A+ L A + + +AD I G D ++ E+ DR L LPG QQ+L+ V
Sbjct: 591 KGCAVTGTAETNL-ERAKEIAAKADVVIFAAGTDLTVSDESHDRTNLNLPGDQQKLLEAV 649
Query: 515 SMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKL 574
A+ IL+L + + + +AK + AII A Y GQA G AIAD+L+G NP GKL
Sbjct: 650 YSANPN-VILLLQTCSSVTINWAK--EHVPAIIEAWYGGQAQGKAIADVLYGDYNPSGKL 706
Query: 575 PMTWYPQEYITNLP--MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVA 632
TWY +++LP M +R ++ TY ++ +YPFG+GMSYT F +
Sbjct: 707 TSTWYNA--LSDLPNGMLNYDIRDAK------YTYMYHDKTPLYPFGYGMSYTTFEYQKL 758
Query: 633 NAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFS 692
N IS + L V D+ N G GA + +++
Sbjct: 759 N-------------------ISKSRLAAGEE------LIVSADITNTGKYAGAEIVQLYA 793
Query: 693 TPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIG 752
+ P KQLV F +V + G + V + + + +++ T + G +I++
Sbjct: 794 HVNSSIERPLKQLVGFARVELEPGETKTVTMPLKHEQLSYFNEQTNTFDVEEGSVDIYVA 853
Query: 753 GTKHSVSL 760
+ V L
Sbjct: 854 ASSADVRL 861
>gi|430749766|ref|YP_007212674.1| beta-glucosidase-like glycosyl hydrolase [Thermobacillus composti
KWC4]
gi|430733731|gb|AGA57676.1| beta-glucosidase-like glycosyl hydrolase [Thermobacillus composti
KWC4]
Length = 971
Score = 273 bits (698), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 170/410 (41%), Positives = 231/410 (56%), Gaps = 25/410 (6%)
Query: 41 PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPG 100
PF LP+ RVNDLIGRL+ +EK+ L+ AVPRLGI Y+ +EA HG++ +GP
Sbjct: 8 PFRDPDLPLETRVNDLIGRLTTEEKINLMAQYQDAVPRLGIAPYKHGTEAAHGIAWLGP- 66
Query: 101 TKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMY--NGGTAGLTYWSPNVN 158
AT+FPQ I A +++A L +G V+ DEAR Y N GLT W+P V+
Sbjct: 67 ---------ATTFPQPIGLACTWDADLLRRVGSVIGDEARVWYRRNPAVNGLTLWAPTVD 117
Query: 159 IFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGV 218
+ RDPRWGR +E GEDP L GK A++ +RG+QG LK A+ KHF L N N
Sbjct: 118 LERDPRWGRTEEAYGEDPFLVGKLASALIRGIQGDHPFYLKAVATLKHF----LGNNNEA 173
Query: 219 DRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGE 278
R + + +++ + + F EG S+M +YN VNGVP P ++ ++ E
Sbjct: 174 GRGDKSVSIDPRNMREYYLKVFETAFKEGGALSMMTAYNAVNGVPANLLP-MITSVVKQE 232
Query: 279 WRLNGYIVSDC-DSVGVYYDTQHFTSTPEEAAADAIRAGLD--LDCGPFLGLHTESAVQR 335
W +NG++VSD D G D + S P AA +IRAG+D D + A++
Sbjct: 233 WGMNGFVVSDAFDVTGTVRDHGYMASLPAAVAA-SIRAGIDSITDDAKVVKQAIREALEM 291
Query: 336 GLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIV 395
GLL+E D++ AL NT V+ RLG FD + PY +G + P+H+ L+LEAAR+ IV
Sbjct: 292 GLLTENDLDVALRNTFRVRFRLGEFDPD-DRNPYAEIGESHMMRPEHEALSLEAARKSIV 350
Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
LLKN G LPLS R R +AVIGP +DV Y+G TPL GI
Sbjct: 351 LLKNDG-VLPLSADRIRKIAVIGPLADVVYR--DWYSGTLPYAVTPLAGI 397
Score = 77.4 bits (189), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 70/234 (29%), Positives = 105/234 (44%), Gaps = 38/234 (16%)
Query: 494 EALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAF-AKNDPRIAAIIWAGYP 552
E +DR + LPG Q+ L+ +V +A TI+V++ P + + A++ P AI++ +
Sbjct: 605 ETVDRPDIALPGPQERLIREV-LAVNPNTIVVVIGSYPFALGWTAEHAP---AILYLAHA 660
Query: 553 GQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKG 612
GQ G A+A LFG NP G+L MTWY LP M Q K RTY ++ G
Sbjct: 661 GQELGRAVAGTLFGRFNPAGRLNMTWY--RSADQLP-DFMDYDIIQGK----RTYLYFDG 713
Query: 613 PVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGV 672
+YPFG+G+SY+ F + + TIS AI R+ +G
Sbjct: 714 EPLYPFGYGLSYSRFEYA-------------------DLTISADAI----GYGERVKIGF 750
Query: 673 QVDVKNVGSKDGAHT-LLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINI 725
V NV G L + P +QL F ++ + G +RV +
Sbjct: 751 T--VTNVSDVYGEEVPQLYIRCEDSRVQRPLRQLAGFARIGLAPGESKRVTFEL 802
>gi|227828570|ref|YP_002830350.1| glycoside hydrolase [Sulfolobus islandicus M.14.25]
gi|229585800|ref|YP_002844302.1| glycoside hydrolase family protein [Sulfolobus islandicus M.16.27]
gi|227460366|gb|ACP39052.1| glycoside hydrolase family 3 domain protein [Sulfolobus islandicus
M.14.25]
gi|228020850|gb|ACP56257.1| glycoside hydrolase family 3 domain protein [Sulfolobus islandicus
M.16.27]
Length = 755
Score = 273 bits (697), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 207/691 (29%), Positives = 346/691 (50%), Gaps = 100/691 (14%)
Query: 110 ATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQ 169
AT+FPQ I AS+++ L + + +A+ + GT SP +++ RDPRWGR +
Sbjct: 101 ATAFPQAIGLASTWDPDLIREVSSTIRYQAKLI---GTN--QCLSPVLDVCRDPRWGRCE 155
Query: 170 ETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDL-DNWNGVDRFHFNAKVS 228
ET GED L +YV+GLQG + ++ A+ KHF A+ + + H V
Sbjct: 156 ETYGEDQYLVASIGLAYVKGLQGEN----ELIATVKHFAAHGFPEGGRNIAPVH----VG 207
Query: 229 KQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSD 288
+++ + F PF + + GK SVM +Y++++G+P ++ +L + +R EW G +VSD
Sbjct: 208 NRELREVFLFPFEVAIKLGKAMSVMPAYHEIDGIPCHSNAELLTKILRQEWGFEGIVVSD 267
Query: 289 CDSVGVYYDTQHFTSTPEEAAADAIRAGLD-----LDCGPFLGLHTESAVQRGLLSEIDI 343
D++ + +EAA A+ AG+D +DC G AV+ GL+SE I
Sbjct: 268 YDAIRQLEAIHKVSLNKKEAAILALEAGVDTEFPNIDC---FGEPLLEAVKEGLISESII 324
Query: 344 NNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPS 403
+ A+ L ++ +LG+F+ ++ + P+ + +ELAL+ AR+ IVLLKN
Sbjct: 325 DRAVERVLRIKEKLGLFNNHYINE---NNVPEKLDNSKSRELALDVARKSIVLLKNDN-I 380
Query: 404 LPLSHIRHRTVAVIGPNSDVTVTMIGNY-------AGIACGYTTPLQGIGRYA----RTI 452
LPL+ T+AVIGPN++ ++G+Y A T L+GI R +
Sbjct: 381 LPLNK-NIGTIAVIGPNANEPRNLLGDYTYTGHLNADGGIEVVTVLEGIMRKVSNNTNVL 439
Query: 453 HQQGCKDVACADDQLFGAAIDASRQADATILVMGLD-------------------QSIEA 493
+ +GC D+A + F AI+ +++ D I VMG Q++
Sbjct: 440 YAKGC-DIAAESKEGFSEAIEIAKKGDIIIAVMGEKSGLPLSWTDVPGKDEFEKYQAVTG 498
Query: 494 EALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPG 553
E DR L LPG Q+EL+ ++ K P ILVL++G P+ ++ N+ + AII A +PG
Sbjct: 499 EGNDRTSLRLPGVQEELLKELHKTGK-PIILVLVNGRPLALSSIFNE--VNAIIDAWFPG 555
Query: 554 QAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPG--RTYRFYK 611
+ GG AIAD++FG NP G+LP+++ P+ + P R P R Y K
Sbjct: 556 EEGGNAIADVIFGDYNPSGRLPISF---------PI-DTGQIPIYYNRKPSSLRPYVMMK 605
Query: 612 GPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLG 671
++PFG+G+SYT F ++ L+ +N+ SGK +
Sbjct: 606 SKPLFPFGYGLSYTEFKYS----------NLEVTPKEVNS--SGK-------------IK 640
Query: 672 VQVDVKNVGSKDGAHTL-LVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKY 730
+ ++V+NVG ++G T+ L S +G P K+L F KV++ ++++ ++ + +
Sbjct: 641 ISLEVENVGKREGEETVQLYISKQYSGVSRPIKELKGFAKVYLKPNEKRKITFSLPL-EA 699
Query: 731 LSVVDRSGTRRIPLGEHNIHIGGTKHSVSLH 761
L+ D+ I G++ I IG + + L
Sbjct: 700 LAFYDQYMRLIIDTGDYEILIGKSSEDIVLK 730
>gi|270340289|ref|ZP_06007688.2| periplasmic beta-glucosidase [Prevotella bergensis DSM 17361]
gi|270331970|gb|EFA42756.1| periplasmic beta-glucosidase [Prevotella bergensis DSM 17361]
Length = 867
Score = 272 bits (696), Expect = 5e-70, Method: Compositional matrix adjust.
Identities = 168/469 (35%), Positives = 249/469 (53%), Gaps = 37/469 (7%)
Query: 20 SGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRL 79
S L A PF P + LP+ +L +R DL GRL+L+EK L++ + A+PRL
Sbjct: 9 SLLMAVLPFCNMP----AQQLPYQNPALSAHERAVDLCGRLTLEEKASLMLDDSPAIPRL 64
Query: 80 GIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEA 139
GIK ++WWSEALHGV+N+G T FPQ I A+SFN L + SDE
Sbjct: 65 GIKRFQWWSEALHGVANMG----------DVTVFPQPIGMAASFNDNLVYQVFNATSDEM 114
Query: 140 RAMYNGGTA---------GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGL 190
RA +N A GL+ W+PNVNIFRDPRWGRGQET GEDP L+ + + VRGL
Sbjct: 115 RAKWNELQASGKEVTRFHGLSVWTPNVNIFRDPRWGRGQETYGEDPYLTSRMGCAVVRGL 174
Query: 191 QGSDGDRLKVAASC-KHFTAYDLDNWNGVDRFHFN-AKVSKQDIEDTFDVPFRMCVMEGK 248
QG + + + +C KH+ + W R N VS +D+ +T+ F+ V +
Sbjct: 175 QGPETAKYRKLWACAKHYAIHSGPEWA---RHTDNVTDVSPRDLWETYMPAFKSLVQDAN 231
Query: 249 VASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEA 308
V VMC+Y + + P C + +L+R +R EW +VSDC +V ++ + +S A
Sbjct: 232 VREVMCAYQRWDDDPCCGNNRLLQRILRDEWGFKYLVVSDCGAVSDFWTSHKSSSNARNA 291
Query: 309 AADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQ 367
A A+ AG D++CG ++ AV+ G ++E +++ +V L + LG D + S
Sbjct: 292 ATKAVLAGTDVECGYNYVYKSIPEAVKYGAMTEAEMDEHVVRLLEGRFDLGEMD-DNSIV 350
Query: 368 PYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTM 427
P+ + +C+ H++L+L+ + Q + LL+N LPL+ + + +AVIGPN D M
Sbjct: 351 PWSSIPSSALCSKKHRQLSLDMSLQTMTLLQNSNDVLPLNK-KEKKIAVIGPNVDNEPMM 409
Query: 428 IGNYAGIACGYTTPLQGIG---RYARTIHQQGCKDVACADDQLFGAAID 473
GNY G T L GI R + + +GC V +DQ + D
Sbjct: 410 WGNYNGTPRSTVTILDGIKSRLRKNQLVTFRGCDLV---NDQTLNSYYD 455
Score = 113 bits (282), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 88/310 (28%), Positives = 134/310 (43%), Gaps = 52/310 (16%)
Query: 468 FGAAIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMA 517
+ +AI + + + V G+ +E E + DR + LP Q+E + + A
Sbjct: 594 YTSAIKQLKDCETVVFVGGISPQLEGEEMPINVDGFKGGDRTDIELPKVQREFLKALKQA 653
Query: 518 SKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMT 577
K + V SG I A AI+ A YPGQ GG A+A +LFG NP GKLP+T
Sbjct: 654 GKS-VVFVCCSGSAI--ALTPETKTCDAILQAWYPGQEGGEAVARVLFGEYNPSGKLPIT 710
Query: 578 WYPQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPT 636
+Y LP + +M+ RTYR Y ++PFG+G+SYT+F
Sbjct: 711 FYKN--TEQLPDFKDYSMK--------NRTYR-YMNDALFPFGYGLSYTDF--------- 750
Query: 637 VVAVPLDGRHGSI-NATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPP 695
SI +AT+SGK + T+ ++V VKN+G +DG + ++ P
Sbjct: 751 -----------SIGDATLSGKTLTPG------ATITMKVPVKNIGKRDGTEVIQIYVKDP 793
Query: 696 AGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTK 755
P K L F++V V AG I + + S T R G + ++ G +
Sbjct: 794 TDSEGPLKSLKGFQRVPVKAGQTAEATITLDSRTFELFDAGSNTVRAKAGNYEVYYGSSS 853
Query: 756 HSVSLHAATL 765
L +
Sbjct: 854 ADKDLKKVNV 863
>gi|146301613|ref|YP_001196204.1| beta-glucosidase [Flavobacterium johnsoniae UW101]
gi|146156031|gb|ABQ06885.1| Candidate beta-xylosidase; Glycoside hydrolase family 3
[Flavobacterium johnsoniae UW101]
Length = 864
Score = 272 bits (695), Expect = 6e-70, Method: Compositional matrix adjust.
Identities = 163/435 (37%), Positives = 238/435 (54%), Gaps = 35/435 (8%)
Query: 41 PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPG 100
P+ SL +R DLI RL+L+EK L+ + A+PRLGIK + WWSEALHG +N
Sbjct: 23 PYKDPSLSSKKRAEDLISRLTLEEKAALMCDQSDAIPRLGIKKFNWWSEALHGYANND-- 80
Query: 101 TKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAG---------LT 151
T FP+ I A+SF+ L + VSDE RA YN L+
Sbjct: 81 --------NVTVFPEPIGMAASFDDQLVFRVFDAVSDEGRAKYNQWIQNGNENKRFLSLS 132
Query: 152 YWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAY 210
W+PNVNIFRDPRWGRGQET GEDP L+ + S V+GLQG +D K+ A KHF +
Sbjct: 133 VWTPNVNIFRDPRWGRGQETYGEDPYLTSRMGVSVVKGLQGPADAKYRKLLACAKHFAVH 192
Query: 211 DLDNWNGVDRFHFNA-KVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPN 269
W+ R N V +++ +T+ F+ V E V VMC+Y +++ P C++
Sbjct: 193 SGPEWS---RHELNLNNVKPRELYETYLPAFKALVQEADVRQVMCAYQRLDDEPCCSNTR 249
Query: 270 ILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDC----GPFL 325
+L+R +R EW +VSDC +V +Y T +S AA+ A+ AG D++C PF
Sbjct: 250 LLQRILRDEWGFQYLVVSDCGAVTDFYTTHKVSSDEVHAASKAVLAGTDVECVWDKYPFK 309
Query: 326 GLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQEL 385
L AV++ L+ E +IN +L+ L + LG D + S P+ + + + +HQ+L
Sbjct: 310 KL--PEAVEKDLIKEEEINKSLLRVLIGRFDLGEMDDD-SIVPWAQIPASVLNSKEHQQL 366
Query: 386 ALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
ALE A++ + LL+N+ LPL+ ++ VAVIGPN+D + GNY G T +GI
Sbjct: 367 ALEMAQKSMTLLQNKNNILPLNKNVNK-VAVIGPNADNEPMLWGNYNGTPNKTITIKKGI 425
Query: 446 -GRYA--RTIHQQGC 457
G+ A + ++ + C
Sbjct: 426 EGKLAANKVLYDKSC 440
Score = 101 bits (251), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 80/304 (26%), Positives = 125/304 (41%), Gaps = 48/304 (15%)
Query: 468 FGAAIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMA 517
F A I + D I GL +E E + DR + LP Q++ + ++ A
Sbjct: 592 FKALIQKLKGIDTVIFAGGLSTLLEGEEMPVSFPGFKGGDRTNIELPAVQRKCLKELKAA 651
Query: 518 SKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMT 577
K I V SG I A AI+ A YPG++GG A+AD+LFG NP GKLP++
Sbjct: 652 GK-KVIFVNCSGSAI--ALTPETESCDAILQAWYPGESGGQAVADVLFGDYNPAGKLPIS 708
Query: 578 WYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTV 637
+Y + + GRTYR Y V++PFG G+SY+ F N
Sbjct: 709 FYK---------NSDKLGDFEDYSLKGRTYR-YTTDVLFPFGFGLSYSKFDIQAGN---- 754
Query: 638 VAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAG 697
+S IK ++ +KN + G + V+
Sbjct: 755 ---------------LSKNKIKPNE------DTQLKFSIKNSSKRSGTEIVQVYVRKLND 793
Query: 698 HWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHS 757
P K L AF+++ + AG +Q V I++ + ++ I GE+ ++ G + +
Sbjct: 794 TEGPLKTLKAFKRIELKAGEKQNVTIDLPASSFEFYDAKTRGMNIASGEYEVYYGNSSDT 853
Query: 758 VSLH 761
L
Sbjct: 854 KDLK 857
>gi|329923020|ref|ZP_08278536.1| glycosyl hydrolase family 3 N-terminal domain protein
[Paenibacillus sp. HGF5]
gi|328941793|gb|EGG38078.1| glycosyl hydrolase family 3 N-terminal domain protein
[Paenibacillus sp. HGF5]
Length = 763
Score = 272 bits (695), Expect = 6e-70, Method: Compositional matrix adjust.
Identities = 218/678 (32%), Positives = 334/678 (49%), Gaps = 86/678 (12%)
Query: 109 GATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRG 168
GAT FP +T S++N L+ +I R V+ E RA GG A +SP +++ RDPRWGR
Sbjct: 123 GATVFPVPLTIGSTWNTELFRSISRAVAAETRA--QGGAA---TYSPVLDVVRDPRWGRT 177
Query: 169 QETPGEDPVLSGKYAASYVRGLQGSDGD-RLKVAASCKHFTAYDLDNWNGVDRFHFNAKV 227
+ET GEDP L ++A + V+GLQG D + A+ KHF Y R +
Sbjct: 178 EETFGEDPHLVAEFAVAAVQGLQGERLDSHTSLLATLKHFAGYGASEGG---RNGAPVHM 234
Query: 228 SKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVS 287
+++ + +PFR V G + S+M +YN+++GVP + +L+ +R W +G++++
Sbjct: 235 GLRELHEVDLLPFRKAVESGAL-SIMTAYNEIDGVPCTSSRYLLQNVLREAWGFDGFVIT 293
Query: 288 DCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLD-CGPFLGLHTESAVQRGLLSEIDINNA 346
DC ++ + + + EAA +++AG+D++ G H + A+++GL++E D+N A
Sbjct: 294 DCGAIHMLACGHNTAGSGVEAATQSLKAGVDMEMSGTMFRAHLQQALEQGLITEDDLNRA 353
Query: 347 LVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPL 406
L ++ RLG+FD P P + + +H LA +AA +GIVLLKN+G LPL
Sbjct: 354 AGRVLELKFRLGLFD-RPYVDP--AWAEQVIGCKEHIALAYQAAAEGIVLLKNEGNLLPL 410
Query: 407 SHIRHRTVAVIGPNSDVTVTMIGNYAGIA--CGYTTPLQGIGRY---ARTIHQQGCKDVA 461
T+AVIGPN+ +G+Y T L GI R +R ++ GC+ +
Sbjct: 411 DS-SSGTIAVIGPNAHTPYHQLGDYTSPQPPGQIVTVLDGIRRRLGDSRVLYAPGCR-IQ 468
Query: 462 CADDQLFGAAIDASRQADATILVMGLDQSIE-------------------------AEAL 496
+ F A+ + QAD ++V+G + + E +
Sbjct: 469 GDSREGFPRALACAEQADVIVMVLGGSSARDFGEGTIDLRTGASVVTGDAKSDMECGEGI 528
Query: 497 DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAG 556
DR+ L L G Q EL+ ++ K P I+V ++G PI + D I AII A YPGQ G
Sbjct: 529 DRSTLTLMGVQLELLQELQKLGK-PVIVVYINGRPITEPWI--DEFIPAIIEAWYPGQEG 585
Query: 557 GTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVY 616
G AIAD+LFG NP G+LP++ P+E + LP++ A +R G+ Y Y
Sbjct: 586 GGAIADMLFGDINPSGRLPLS-IPKE-VGQLPISYNA------RRTRGKRYLETDLAPRY 637
Query: 617 PFGHGMSYTNFVH-TVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVD 675
PFG G+SYT F + + P VV I G+A V++D
Sbjct: 638 PFGFGLSYTEFRYGRLTVEPAVVP-------------IGGEAT-------------VRID 671
Query: 676 VKNVGSKDGAHTL-LVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVV 734
V N G++DGA + L S A P K L F KV + AG Q V I + L ++
Sbjct: 672 VTNAGARDGAEVVQLYVSDLAASVTRPEKALKGFRKVFLKAGETQEVTFTIG-SEQLELI 730
Query: 735 DRSGTRRIPLGEHNIHIG 752
+ GE I +G
Sbjct: 731 GLDLKPVVEPGEFRIQVG 748
>gi|313204104|ref|YP_004042761.1| beta-glucosidase [Paludibacter propionicigenes WB4]
gi|312443420|gb|ADQ79776.1| Beta-glucosidase [Paludibacter propionicigenes WB4]
Length = 871
Score = 272 bits (695), Expect = 6e-70, Method: Compositional matrix adjust.
Identities = 158/436 (36%), Positives = 238/436 (54%), Gaps = 32/436 (7%)
Query: 38 RTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNV 97
+ LP+ +L +R DLI RL+L EK L+ + AVPRLGIK + WWSEALHG +N
Sbjct: 28 QQLPYKNPNLSSEERAKDLISRLTLSEKATLMCDISDAVPRLGIKKFNWWSEALHGFANN 87
Query: 98 GPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA--------- 148
G T FP+ + A+SF+ L I VSDEARA YN T
Sbjct: 88 G----------NVTVFPEPVGMAASFDDNLVYKIFDAVSDEARAKYNEATKKGLENARFL 137
Query: 149 GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHF 207
L+ W+PNVNIFRDPRWGRGQET GEDP L+ + S V+GLQG +D K+ A KHF
Sbjct: 138 SLSVWTPNVNIFRDPRWGRGQETYGEDPYLTSRMGVSVVKGLQGPADAKYRKLLACAKHF 197
Query: 208 TAYDLDNWNGVDRFHFNA-KVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCA 266
+ W+ R N + +D+ +T+ F+ V + V VMC+Y +++ P C
Sbjct: 198 AVHSGPEWS---RHSLNINNLDPRDLWETYLPAFKSLVQKSDVRQVMCAYQRLDDEPCCG 254
Query: 267 DPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLG 326
+ +L+R +R EW +V+DC +V +Y + +S AA+ + AG D++C
Sbjct: 255 NTQLLQRILRDEWGYKYMVVADCGAVSDFYTSHKVSSDAVHAASKGVWAGTDVECQWDNH 314
Query: 327 LHTE--SAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQE 384
++ + AV +GL++E +IN L+N L + LG D + + P+ + V +H++
Sbjct: 315 IYKQLPDAVAKGLITEAEINKHLLNVLIGRFDLGEMDDD-ALVPWSKIPMSVVNNEEHRK 373
Query: 385 LALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQG 444
LAL+ A + + LL+N+ LPLS + + +AV+GPN++ + GNY G T L G
Sbjct: 374 LALDMALKSMTLLQNKNNILPLS--KSKKIAVVGPNANDKPMLWGNYNGTPVRTITILDG 431
Query: 445 IGRYA---RTIHQQGC 457
I + ++++GC
Sbjct: 432 ITSKLSANKVLYEKGC 447
Score = 89.4 bits (220), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 73/272 (26%), Positives = 113/272 (41%), Gaps = 38/272 (13%)
Query: 497 DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAG 556
DR + LP Q+ + + A K I V SG I A AI+ A Y G++G
Sbjct: 638 DRTDIELPAVQRNCLKALKEAGK-QVIFVNCSGSAI--ALIPETESCDAILQAWYGGESG 694
Query: 557 GTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVY 616
G A+AD+LFG NP GKLP+T+Y + + + GRTYR+ P ++
Sbjct: 695 GQAVADVLFGDYNPSGKLPITFYK---------SVKQLSDFEDYSMKGRTYRYMSDP-LF 744
Query: 617 PFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDV 676
PFG G+SYT F NA +S IK ++ + + V
Sbjct: 745 PFGFGLSYTTFAIG-------------------NAKLSNPEIKSNQ------SVELTIPV 779
Query: 677 KNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDR 736
N G + G + V+ P K L F++V V AG + IN+ +
Sbjct: 780 SNKGKRSGTEIVQVYVRMVNDTDGPLKTLKGFQRVEVAAGKTAQAVINLPYNSFEFFNRN 839
Query: 737 SGTRRIPLGEHNIHIGGTKHSVSLHAATLGVI 768
+ + GE+ + G + + L A + ++
Sbjct: 840 TVKMEVVPGEYEVLYGNSSATKDLKTAKVRIL 871
>gi|323344052|ref|ZP_08084278.1| beta-glucosidase [Prevotella oralis ATCC 33269]
gi|323094781|gb|EFZ37356.1| beta-glucosidase [Prevotella oralis ATCC 33269]
Length = 779
Score = 271 bits (694), Expect = 7e-70, Method: Compositional matrix adjust.
Identities = 208/690 (30%), Positives = 328/690 (47%), Gaps = 97/690 (14%)
Query: 109 GATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRG 168
GAT FP + A++++ + E G +++ E R G + P +++ +PRW R
Sbjct: 147 GATVFPTGLGMAATWSTDVIEQAGVIIAKEIRLQ-----GGHISYGPVLDLAHEPRWSRV 201
Query: 169 QETPGEDPVLSGKYAASYVRGLQGSDGDR-LKVAASCKHFTAYDL--DNWNGVDRFHFNA 225
+ET GEDPVLSG A + V+GL D + A+ KHF AY + NG
Sbjct: 202 EETMGEDPVLSGTIAVAQVKGLGAGDITKPFATIATLKHFIAYGIPESGQNGAPSI---- 257
Query: 226 KVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYI 285
+ +D+ D F PFR + G + SVM SYN ++G+P ++ ++L +R +W G++
Sbjct: 258 -IGTRDLLDNFLPPFRRAIDAGAL-SVMTSYNSMDGIPCTSNGHLLTEILRNQWGFKGFV 315
Query: 286 VSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINN 345
VSD S+ Y T H S+ +EA +A+RAG+D+D G AV++G +SE I+
Sbjct: 316 VSDLYSIDGIYGTHHTVSSLQEAGIEALRAGVDVDLGANAFALLCDAVRQGRVSEAAIDE 375
Query: 346 ALVNTLTVQMRLGMFDGEPSSQPYGHLGPKD----VCTPDHQELALEAARQGIVLLKNQG 401
A++ L +++ +G+F+ PY + PK V T ++ ++A A + I LLKN
Sbjct: 376 AVLRILRMKIEMGLFE-----HPY--VNPKTAKTGVRTAENIQVAKRVAEESITLLKNSN 428
Query: 402 PSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIA--CGYTTPLQGIGRY---ARTIHQQG 456
LPLS ++ +AVIGPN+D M+G+Y T L GI ++ + +G
Sbjct: 429 KLLPLS--KNIKIAVIGPNADNRYNMLGDYTAPQQDSNVKTILDGIRSKLSPSQITYVKG 486
Query: 457 CKDVACADDQLFGAAIDASRQADATILVMG-----------------------LDQSIEA 493
C +++ G A+ A+R+AD ++ +G +
Sbjct: 487 CSIRDTVFNEI-GEAVRAAREADVIVVAVGGSSARDFKTSYQETGAAITSSKVVSDMESG 545
Query: 494 EALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPG 553
E DRA L L G Q L+ + K P +++ + G P+D +A + A++ A YPG
Sbjct: 546 EGFDRASLSLMGIQSRLLQSLKETGK-PMVVIYIEGRPLDKTWASE--QADALLTAYYPG 602
Query: 554 QAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGP 613
Q GG AIA++LFG NP G+LP+T P+ + LP+ RP Y
Sbjct: 603 QEGGNAIANVLFGDYNPAGRLPIT-VPRS-VGQLPVYYNKKRPVV------HNYVEMAST 654
Query: 614 VVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQ 673
+YPFG+G+SYT+F D H +I K + V
Sbjct: 655 PLYPFGYGLSYTSF---------------DYSHLNIT-------------KKSEEEYEVS 686
Query: 674 VDVKNVGSKDGAHTL-LVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLS 732
D++N G +DG L S A P KQL F ++H+ G +R+ + I LS
Sbjct: 687 FDIRNSGERDGDEVAQLYISDKVASVVQPVKQLKGFARIHLKKGETKRITL-ILKKDDLS 745
Query: 733 VVDRSGTRRIPLGEHNIHIGGTKHSVSLHA 762
+ DR+ R + G+ I IG + + L A
Sbjct: 746 ITDRNMERVVEAGDFEIQIGSSSEDIRLKA 775
>gi|393782347|ref|ZP_10370532.1| hypothetical protein HMPREF1071_01400 [Bacteroides salyersiae
CL02T12C01]
gi|392673618|gb|EIY67077.1| hypothetical protein HMPREF1071_01400 [Bacteroides salyersiae
CL02T12C01]
Length = 862
Score = 271 bits (694), Expect = 8e-70, Method: Compositional matrix adjust.
Identities = 167/432 (38%), Positives = 242/432 (56%), Gaps = 31/432 (7%)
Query: 41 PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPG 100
P+ L QR DL+GRL+L+EKV+L+ + + V RLGI Y+WW+EALHGV+ G
Sbjct: 23 PYKNPDLSPEQRAEDLLGRLTLEEKVRLMQNESFPVERLGIAHYDWWNEALHGVARAGI- 81
Query: 101 TKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGG---------TAGLT 151
AT FP + AS+F+ L E + VVSDEARA Y+ GLT
Sbjct: 82 ---------ATVFPITMGMASTFDDKLVEDVFTVVSDEARAKYHNAHREGRRGVRCEGLT 132
Query: 152 YWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAY 210
+W+PNVNIFRDPRWGRGQET GEDP L+ + + V GLQG +D K+ A KH+ +
Sbjct: 133 FWTPNVNIFRDPRWGRGQETYGEDPYLTSRMGVAVVNGLQGPADAKYDKLHACAKHYAVH 192
Query: 211 DLDNWNGVDRFHFNAK-VSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPN 269
R +FNA+ +S +D+ +T+ F+ V E V VMC+YN+ G P C
Sbjct: 193 SGPE---AKRHYFNAENISPRDLWETYLPAFKDLVQEADVKEVMCAYNRFEGDPCCGSNR 249
Query: 270 ILKRTIRGEWRLNGYIVSDCDSVG-VYYDTQHFT-STPEEAAADAIRAGLDLDCGPFLGL 327
+L + +R EW +VSDC ++ +Y +H T +A+A A+ +G DL+CG
Sbjct: 250 LLTQILRDEWGYKHMVVSDCGAISDFFYKDRHATHKDAADASAAAVLSGTDLECGIEYA- 308
Query: 328 HTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELAL 387
H E AV++GL+SE IN +L L + LG D + + P+ + V H+++AL
Sbjct: 309 HLEEAVKKGLISEERINTSLRRLLKARFELGEMDDD-ALVPWSKISIDTVDCETHKQMAL 367
Query: 388 EAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI-G 446
+ R+ +VLL N G LPL+ R +AV+GPN+ +V GNY G +T L+GI
Sbjct: 368 DVTRKSMVLLHNNG-VLPLAKTGTR-IAVMGPNAVDSVMQWGNYKGTPSHTSTILEGIRN 425
Query: 447 RYARTIHQQGCK 458
+ +++GC+
Sbjct: 426 KIGNVPYEKGCE 437
Score = 108 bits (269), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 89/292 (30%), Positives = 125/292 (42%), Gaps = 48/292 (16%)
Query: 478 ADATILVMGLDQSIEAEAL-----------DRAGLLLPGRQQELVSKVSMASKGPTILVL 526
AD I V G+ +E E DR + LP Q+ ++ + A K I V
Sbjct: 598 ADVVIFVGGISPDLEGEDKYFVNCPGFSGGDRTTIELPEVQRNILKALKQAGK-KVIFVN 656
Query: 527 MSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITN 586
SG VA AI+ A YPGQAGGTA+AD+LFG NP GKLP+T+Y
Sbjct: 657 CSGSA--VALVPETKSCDAILQAWYPGQAGGTAVADVLFGDYNPSGKLPVTFYKN--TDQ 712
Query: 587 LPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRH 646
LP E + GRTYR+ +YPFG+G+SYT F + V
Sbjct: 713 LPDFE-------NYDMKGRTYRYMTETPLYPFGYGLSYTTFDISKGRLSKNVI------- 758
Query: 647 GSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLV 706
S N T++ KA +VKN G ++G + V+ P K L
Sbjct: 759 -STNQTVTFKA-----------------NVKNTGKREGTEVVQVYVRKVGDKEGPVKTLR 800
Query: 707 AFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSV 758
AF + + AG V I++ + + T RI G++ I G + ++
Sbjct: 801 AFRSIPLKAGKSSVVSIDLPPRTFEFFDPETNTMRILPGDYEIMYGNSSETL 852
>gi|346226088|ref|ZP_08847230.1| glycoside hydrolase family 3 domain protein [Anaerophaga
thermohalophila DSM 12881]
Length = 749
Score = 271 bits (692), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 224/728 (30%), Positives = 337/728 (46%), Gaps = 100/728 (13%)
Query: 35 ATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGV 94
+ + PF L R++DL+ R++L EKV L S +VPRLG+KG E HGV
Sbjct: 47 SAQESYPFQNPELDSEARIDDLLSRMTLDEKVSAL-STDPSVPRLGVKGAPH-IEGYHGV 104
Query: 95 SNVGPGT--KFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNG---GTAG 149
+ GP G + T+FPQ +++N L G + S EAR ++ G
Sbjct: 105 AMGGPANWAPKGDEAVPTTTFPQAYGMGATWNPELIRLAGEIESIEARYIFQNPEIAKGG 164
Query: 150 LTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTA 209
L +PN ++ RDPRWGR +E GEDP L G A ++ +GLQG D + A+ KHF A
Sbjct: 165 LVVRAPNADLGRDPRWGRTEECFGEDPFLVGTSATAFTKGLQGDDDQYWRTASLLKHFLA 224
Query: 210 YDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPN 269
N N R ++ Q + + FR +EG + M +YN +NGVP +
Sbjct: 225 ----NSNENGRESSSSDFDMQLYHEYYGASFRRAFIEGGSNAYMAAYNAINGVPAHVH-D 279
Query: 270 ILKRTIRGEWRLNGYIVSDCDSVGV------YYDTQHFTSTPEEAAADAIRAGLDLDCGP 323
+ K W ++G +D + YYD + AA I+AGL+
Sbjct: 280 MHKEITERMWGVDGIKCTDGGGYQLLVYGHKYYDDLYL------AAEGVIKAGLN----Q 329
Query: 324 FLGLHTE---SAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLG----PKD 376
FL + E A+ G ++E DI+ L V ++LG D + PY +G P
Sbjct: 330 FLDNYREGVYGALAHGYITEADIDEVLRGVYRVMIKLGQLDPQ-EKVPYSAIGRDGKPAP 388
Query: 377 VCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIAC 436
T H++ AL AR+ IVLLKN +LPL+ + VAVIG +D ++ Y+G+
Sbjct: 389 WTTQKHKDAALRMARESIVLLKNNNKTLPLNADKLNKVAVIGYLAD--TVLLDWYSGLPP 446
Query: 437 GYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGL--------- 487
TPL+GI R K V A D + AA++A+ +AD I+++G
Sbjct: 447 YRITPLEGI----REKLGNDSK-VLYAPDNDYNAAVEAASEADVAIVILGNYPTCNSEIW 501
Query: 488 ----DQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRI 543
D + EA+DR L L E + K+ M + TI VL S P + +++ + +
Sbjct: 502 ADCPDPGMGREAIDRKTLRL---TDEYLVKLVMEANPNTIFVLQSSFPYAINWSQQN--V 556
Query: 544 AAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLP-MTEMAMRPSQSKRY 602
AI+ + GQ G+A+AD+LFG NPGGKL TW E LP M E +R
Sbjct: 557 PAILHLTHNGQETGSALADVLFGDYNPGGKLTQTWPKSE--DQLPDMMEYDIR------- 607
Query: 603 PGRTYRFYKGPVVYPFGHGMSYTNFV--HTVANAPTVVAVPLDGRHGSINATISGKAIKV 660
G TY +++ +YPFGHG+SYT F N P V A
Sbjct: 608 KGHTYMYFEDKPLYPFGHGLSYTTFAWEDISINKPVVSA--------------------- 646
Query: 661 THAKCNRLTLGVQVDVKNVGSKDGAHTLLVFST-PPAGHWAPHKQLVAFEKVHVPAGAQQ 719
+ + + V +KN G G + ++++ P + P K L F++V + G ++
Sbjct: 647 -----DDEEVIITVKLKNTGDVKGDEVVQLYASFPESTVRRPAKALKGFKRVTLEPGEKK 701
Query: 720 RVGINIHV 727
++ I I +
Sbjct: 702 KIEIPIKL 709
>gi|225872720|ref|YP_002754177.1| xylan 1,4-beta-xylosidase [Acidobacterium capsulatum ATCC 51196]
gi|225793233|gb|ACO33323.1| xylann 1,4-beta-xylosidase [Acidobacterium capsulatum ATCC 51196]
Length = 721
Score = 271 bits (692), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 231/755 (30%), Positives = 347/755 (45%), Gaps = 96/755 (12%)
Query: 41 PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPG 100
PF +L QR++DL+ R++LQEK++ L VPRLGI G E LHG + GP
Sbjct: 25 PFQNPALSPDQRIDDLLSRMTLQEKIQAL-GDDPGVPRLGIPG-ALTEEGLHGAAIGGPA 82
Query: 101 --TKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNG-GTAGLTYWSPNV 157
G T FPQ +++ L + V + E R N GL +PN
Sbjct: 83 HWEGRGRAVVPTTQFPQNHGLGQTWDPALLQKAANVEAYETRWAVNKYHDGGLIVRAPNA 142
Query: 158 NIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNG 217
N+ RDPRWGR +E+ GEDP L G A ++++GLQG++ + AA KHF AY N
Sbjct: 143 NLSRDPRWGRTEESYGEDPYLVGTLAVAWIKGLQGNNPRYWETAALMKHFDAYS----NE 198
Query: 218 VDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRG 277
+R ++ K+ + + VPFRM + +G + M SYN NG+P A+P +LK +
Sbjct: 199 ANRDGSSSNFGKRLFYEYYSVPFRMGIEQGHSDAFMTSYNAWNGIPMTANP-VLKSVVMK 257
Query: 278 EWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLH---TESAVQ 334
+W NG I +D ++ H+ T EAAA A+ AG++ FL + E A+Q
Sbjct: 258 KWGFNGIICTDAGALSNMVTHFHYYKTMPEAAAGAVHAGINQ----FLDRYQQPVEEALQ 313
Query: 335 RGLLSEIDINNALVNTLTVQMRLGMFDGEPSS-QPYGHLG----------PKDVCTPDHQ 383
+ LL+E I+ L V +RLG+ D PSS PY +G P D P H
Sbjct: 314 QKLLTEQQIDQDLKGVYRVVLRLGLMD--PSSMSPYSMIGLTNDNPAKGDPWD--WPSHI 369
Query: 384 ELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQ 443
L + + IVLLKNQ +LPL + ++AVIGP +++ + Y+G TP++
Sbjct: 370 ALDRKVTDESIVLLKNQNHALPLDAKKLHSIAVIGPWANIVA--LDWYSGTPPFGVTPVE 427
Query: 444 GIGRYARTIHQQGCKDVACA--DDQLFGAAIDASRQADATILVMGLDQSIEA-------- 493
G I Q+ DV D AA ++Q+D I+++G + +A
Sbjct: 428 G-------IRQRVGPDVKVTFNDGSNLQAAAALAKQSDEAIVIIGNHPTCDAGWGKCALP 480
Query: 494 ----EALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWA 549
EA DR L LP E ++K A+ T++VL + P + + I AI+
Sbjct: 481 SEGKEAFDRTALNLP---DESIAKAVYAANPHTVVVLQTSFPYTTDWTQ--AHIPAILEM 535
Query: 550 GYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNL-PMTEMAMRPSQSKRYPGRTYR 608
+ + GTA+AD+LFG +P G+L TW I L PM + +R GRTY
Sbjct: 536 AHNSEEQGTALADVLFGDYDPAGRLAQTWVAS--IGQLPPMMDYNIR-------DGRTYM 586
Query: 609 FYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRL 668
+ K +YPFG G+SYT F ++ N +S +H
Sbjct: 587 YLKSKPLYPFGFGLSYTTFKYS-------------------NLRLS------SHTLPAGG 621
Query: 669 TLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWA-PHKQLVAFEKVHVPAGAQQRVGINIHV 727
L V VDV N G +G + ++ + P + L F++V +P G + V + +
Sbjct: 622 QLTVSVDVTNTGKYNGDEVVQMYVKHLDSKVSRPLEALKGFDRVSIPVGQTRTVTLPLKA 681
Query: 728 CKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHA 762
+ R+ I IG + + A
Sbjct: 682 SALAYWDKNTHAFRVEPDHVQIRIGASSADIRQKA 716
>gi|379722647|ref|YP_005314778.1| glycoside hydrolase family protein [Paenibacillus mucilaginosus
3016]
gi|378571319|gb|AFC31629.1| glycoside hydrolase family 3 domain-containing protein
[Paenibacillus mucilaginosus 3016]
Length = 937
Score = 270 bits (691), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 165/430 (38%), Positives = 243/430 (56%), Gaps = 32/430 (7%)
Query: 36 TTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVS 95
T +PF LP+ +RV DL+ RL+L+EK+ L+ +PRLGI Y+ +E HGV+
Sbjct: 3 TETKMPFQDTRLPLEERVQDLVSRLTLEEKINLMCQYQEEIPRLGIAKYKHGTEGAHGVA 62
Query: 96 NVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMY--NGGTAGLTYW 153
+G AT FPQ + A ++N L IG V+ DEAR Y N GLT W
Sbjct: 63 WLGE----------ATVFPQNVGLACTWNPELMREIGSVIGDEARVYYQRNPEINGLTIW 112
Query: 154 SPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLD 213
+P V++ RDPRWGR +E GEDP L+G+ + V+G+QG LK AA+ KHF L
Sbjct: 113 APTVDMERDPRWGRTEEAYGEDPYLTGRLSTELVKGMQGDHPFYLKTAATLKHF----LG 168
Query: 214 NWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKR 273
N N +DR +A + +++ + + F EG S+M +YN VNG P +P++
Sbjct: 169 NNNEMDRGECSASIDPRNMREYYLKAFEPAFREGGAQSMMTAYNSVNGTPCNLNPDV-NA 227
Query: 274 TIRGEWRLNGYIVSDC-DSVGVYYDTQHFTSTPEEAAADAIRAGLDL---DCGPFLGLHT 329
++GEW ++G++VSD D +G + ++F S E AA ++R+G+D D G L
Sbjct: 228 IVKGEWGMDGFVVSDAGDVLGTVNEHRYFASYAEAVAA-SVRSGIDSITDDVGITL-RAI 285
Query: 330 ESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEA 389
A++ GLL+E D++ A+ N V++RLG FD P PY H+ +C P+H L+L+A
Sbjct: 286 RDALEGGLLAEEDLDRAVGNAFRVRIRLGEFDS-PEENPYAHVPEAKLCAPEHAALSLQA 344
Query: 390 ARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRY- 448
AR+G VLLKN+G LPL+ + ++AVIGP +DV T Y+G TPL+G+
Sbjct: 345 AREGFVLLKNEG-LLPLA--KPASIAVIGPLADVVHTDW--YSGTPPYRITPLEGVAERM 399
Query: 449 --ARTIHQQG 456
IH+ G
Sbjct: 400 APGTVIHRTG 409
Score = 99.4 bits (246), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 83/298 (27%), Positives = 143/298 (47%), Gaps = 46/298 (15%)
Query: 470 AAIDASRQADATILVMGLDQSIEA-EALDRAGLLLPGRQQELVSKVSMASKGPTILVLMS 528
AA++A+RQA+ I+ +G I E +DR L L ++ L+ V +A+ T+ VL+
Sbjct: 546 AAVEAARQAETAIVFLGNSPFINGKECVDRPDLTLAPAKEALLQAV-LAANPRTVAVLVG 604
Query: 529 GGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLP 588
P V +A+ + AI++ + GQ G A AD+LFG +P G+L MTWY + +LP
Sbjct: 605 SYPFAVNWAQ--AHVPAILYTSHAGQELGRAAADVLFGDCSPAGRLNMTWY--KSAADLP 660
Query: 589 -MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVH-TVANAPTVVAVPLDGRH 646
+ + + + RTY+++ G V+YPFGHG+SYT F + V AP+ +
Sbjct: 661 DLLDYDIIKGK------RTYQYFDGEVLYPFGHGLSYTEFRYLEVQAAPS--------EN 706
Query: 647 GSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHW-APHKQL 705
G+ N T++ V+N GS+ G + ++ P K L
Sbjct: 707 GAFNVTVT---------------------VQNAGSRAGDEVVQLYVKAGESRVPRPLKTL 745
Query: 706 VAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRR-IPLGEHNIHIGGTKHSVSLHA 762
F ++H+ G ++V + + L+ D + + + G + GG+ ++ L A
Sbjct: 746 AGFRRIHLLPGQTEQVEFTVEAPQ-LAFWDVTREKYCVEEGTYRFMAGGSSGNLPLAA 802
>gi|189464310|ref|ZP_03013095.1| hypothetical protein BACINT_00651 [Bacteroides intestinalis DSM
17393]
gi|189438100|gb|EDV07085.1| glycosyl hydrolase family 3 N-terminal domain protein [Bacteroides
intestinalis DSM 17393]
Length = 864
Score = 270 bits (691), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 169/450 (37%), Positives = 250/450 (55%), Gaps = 54/450 (12%)
Query: 1 MASTIAFFFLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRL 60
M + F GL L A + A+E + KD P+ +R+ DL+ RL
Sbjct: 1 MRTLKKIVFTGLFALIACAGVAQAQELY----KDEKA----------PMHERIMDLLSRL 46
Query: 61 SLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTA 120
+++EK+ LL + + +PRL I Y +EALHGV V PG +F T FPQ I A
Sbjct: 47 TVEEKISLLRATSPGIPRLDIPKYYHGNEALHGV--VRPG-RF-------TVFPQAIGLA 96
Query: 121 SSFNATLWEAIGRVVSDEARAMYNGGTAG----------LTYWSPNVNIFRDPRWGRGQE 170
+++N L + V+SDEARA +N G LT+WSP VN+ RDPRWGR E
Sbjct: 97 ATWNPELQLQVATVISDEARARWNELDQGREQKSQFSDLLTFWSPTVNMARDPRWGRTPE 156
Query: 171 TPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQ 230
T GEDP LSG ++V+GLQG D LK+ ++ KHF A + ++ +RF N ++S++
Sbjct: 157 TYGEDPYLSGVMGTAFVKGLQGDDDRYLKIVSTPKHFAANNEEH----NRFVCNPQISEK 212
Query: 231 DIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCD 290
+ + + F CV +GK AS+M +YN +N VP + +L + +R +W GY+VSDC
Sbjct: 213 QLREYYLPAFEACVKDGKSASIMSAYNALNDVPCTLNAWLLTKVLREDWGFKGYVVSDCG 272
Query: 291 SVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG------PFLGLHTESAVQRGLLSEIDIN 344
+ + + T E AA +I+AGLDL+CG P L SA ++ +++ DI+
Sbjct: 273 GPSLLVNAHKYVKTKEAAATLSIKAGLDLECGDDVFDEPLL-----SAYRQYMVTNADID 327
Query: 345 NALVNTLTVQMRLGMFD-GEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPS 403
+A L +M+LG+FD GE PY + P V + HQE+AL AAR+ IVLLKNQ
Sbjct: 328 SAAYRVLRARMQLGLFDSGE--KNPYTKISPAVVGSAKHQEVALNAARECIVLLKNQKKM 385
Query: 404 LPLSHIRHRTVAVIGPNSDVTVTMIGNYAG 433
LPL+ + +++AV+G N+ G+Y+G
Sbjct: 386 LPLNAKKVKSIAVVGINA--GNCEFGDYSG 413
Score = 128 bits (321), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 85/298 (28%), Positives = 142/298 (47%), Gaps = 45/298 (15%)
Query: 467 LFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVL 526
L+G A A R+ + + V+G+++SIE E DR + LP Q E + ++ + P I+V+
Sbjct: 591 LYGEAGKAVRECETVVAVLGINKSIEREGQDRYDIQLPADQMEFLQEIYKVN--PNIVVV 648
Query: 527 MSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITN 586
+ G +A D + AI+ A YPG++GG A+A++LFG NPGG+LP+T+Y +
Sbjct: 649 LVAGS-SLAVNWMDEHVPAIVNAWYPGESGGKAVAEVLFGDYNPGGRLPLTYY--RSLDE 705
Query: 587 LPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRH 646
LP + GRTY+++KG V+YPFG+G+SYT F ++ DG
Sbjct: 706 LPPFD------DYDITKGRTYKYFKGDVLYPFGYGLSYTTFKYSNLQVA-------DGEE 752
Query: 647 GSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQL 705
+ V +KN G G V+ P P K+L
Sbjct: 753 ----------------------EINVSFQLKNAGKYAGDEVAQVYVKLPERDEVMPVKEL 790
Query: 706 VAFEKVHVPAGAQQRVGINIH--VCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLH 761
FE+V + +G +++ + + + +Y + G P G++ I +G + + L
Sbjct: 791 KGFERVALKSGENKKMTLKLRKDLLRYWD--EAKGKFVYPSGDYTIMVGASSADIRLQ 846
>gi|295135996|ref|YP_003586672.1| beta-glucosidase [Zunongwangia profunda SM-A87]
gi|294984011|gb|ADF54476.1| putative beta-glucosidase [Zunongwangia profunda SM-A87]
Length = 796
Score = 270 bits (690), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 219/717 (30%), Positives = 336/717 (46%), Gaps = 111/717 (15%)
Query: 88 SEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGT 147
EA+HG VG T FP I AS++N L + + V++ E RA +
Sbjct: 135 EEAMHGHMAVG-----------TTVFPTAIGQASTWNPDLIKKMAHVIAKEIRAQGSN-- 181
Query: 148 AGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLK----VAAS 203
T + P ++I R+PRW R +ET GEDP L + S V G QGS LK VAA+
Sbjct: 182 ---TAYGPIIDIAREPRWSRVEETFGEDPYLIAEMGKSMVTGFQGSHESDLKSNEHVAAT 238
Query: 204 CKHFTAYDLDNWNGVDRFHFN---AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVN 260
KHF AY GV N + ++D+ + P + V G V SVM +Y+ ++
Sbjct: 239 LKHFAAY------GVSEGGHNGAAVHIGQRDLFQNYMYPVKEAVDNG-VMSVMTAYSSID 291
Query: 261 GVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLD 320
GVP+ A N+L ++ +W G+++SD S+ H T E+AAA A+ AG+D+D
Sbjct: 292 GVPSTAHKNLLTNILKEKWGFKGFVISDLASIEGLLGDHHIVDTEEDAAAMAMNAGVDVD 351
Query: 321 CGPFLGLHTE--SAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGH--LGPKD 376
G G AV G ++E I+ A+ LTV+ +LG+F+ PY + K
Sbjct: 352 LGGN-GYDDALIDAVNAGKVAEERIDEAVRRILTVKFKLGLFE-----NPYANEKQAEKI 405
Query: 377 VCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIAC 436
V +H ELA E ARQ I +LKN+ LPL+ + +AVIG N+D+ +G+Y
Sbjct: 406 VRNSEHIELAREVARQSITMLKNEDNILPLNK-ELQNIAVIGSNADMQYNQLGDYTAPQS 464
Query: 437 --GYTTPLQGIGRYARTIHQQGCKDVACADDQLFG--AAIDASRQADATILVMG------ 486
T L+GI + + K A D AA++A++ A+ I+V+G
Sbjct: 465 EENIITVLEGIQHKMPNANIEYVKGTAVRDTTQTNIPAAVEAAKNAEVAIVVLGGSSARD 524
Query: 487 --------------------LDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVL 526
L E DR+ L L G+Q EL+ V +A+ PT+LVL
Sbjct: 525 FKTEYLETGAATISSKEDQVLSDMESGEGYDRSTLNLMGKQLELLQAV-VATGTPTVLVL 583
Query: 527 MSGGPIDVAF-AKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYIT 585
+ G P+ + + A+N P I+ A YPGQ GG+AIAD++FG NP G+LP++ P+ +
Sbjct: 584 IKGRPLLLNWPAENVP---VILDAWYPGQEGGSAIADVIFGDFNPAGRLPVS-VPKS-LG 638
Query: 586 NLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGR 645
+P+ P++ R Y +YPFG+G+SY+ F +
Sbjct: 639 QIPVYYNYWFPNR------RDYVETDAKPLYPFGYGLSYSEF-----------------K 675
Query: 646 HGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTL-LVFSTPPAGHWAPHKQ 704
+ + SGK + + + + N DG + L + +P KQ
Sbjct: 676 YSDLKVATSGKG--------RNTKIEISLKISNTSKVDGDEVIQLYIRDMVSTVLSPVKQ 727
Query: 705 LVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLH 761
L AFE+V + AG + V + + K LS+ D +++ GE + IG + + L
Sbjct: 728 LRAFERVSIKAGETKTVQFEL-LPKELSLFDTEMKQKVQAGEFKLMIGASSEDIRLE 783
>gi|294462719|gb|ADE76904.1| unknown [Picea sitchensis]
Length = 304
Score = 270 bits (690), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 131/297 (44%), Positives = 195/297 (65%), Gaps = 5/297 (1%)
Query: 467 LFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVL 526
+FG A++ +++ D I+V+GLDQ+ E E DR L LPG+Q+ LV +VS A+K P + V+
Sbjct: 1 MFGEAVETAKKVDEVIVVVGLDQTQEKEERDRIKLTLPGQQENLVYQVSRAAKRPVVFVI 60
Query: 527 MSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITN 586
+SGGP+DV+FA NDP+I++IIWAGYPGQAGG A+A+I+FG NPGG+LPMTWYPQ+++
Sbjct: 61 LSGGPVDVSFAVNDPQISSIIWAGYPGQAGGQALAEIIFGDYNPGGRLPMTWYPQDFV-K 119
Query: 587 LPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRH 646
+PMT+M MRP+ YPGRTYRFY G V+ FG G+SY+ + + ++ T+ + L+
Sbjct: 120 IPMTDMNMRPNHDTGYPGRTYRFYTGKKVFEFGQGLSYSAYSYNFSST-TIQKIDLNVTM 178
Query: 647 GSINATIS-GKA-IKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHW-APHK 703
A + GK ++V + C +L + V+N DG H +L++S PA H AP K
Sbjct: 179 EHFEALGNRGKGHVRVENTPCRKLKFRSSIFVRNHDKMDGRHAVLLYSKSPATHKGAPQK 238
Query: 704 QLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
QL+ F VHV +V + C + S V+ +G R + +G H++ +G T++ VSL
Sbjct: 239 QLIGFRSVHVQGKHTAKVTFVVKPCDHFSTVEENGQRLLAIGSHSLIVGDTQYPVSL 295
>gi|156933295|ref|YP_001437211.1| hypothetical protein ESA_01107 [Cronobacter sakazakii ATCC BAA-894]
gi|156531549|gb|ABU76375.1| hypothetical protein ESA_01107 [Cronobacter sakazakii ATCC BAA-894]
Length = 757
Score = 270 bits (690), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 212/694 (30%), Positives = 337/694 (48%), Gaps = 112/694 (16%)
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
T FP + ASSFN + +GRV + Y GL W+P V++ RDPRWGRG
Sbjct: 110 TIFPNSLGLASSFNLDAVKTVGRV------SAYEAADDGLNMTWAPMVDVTRDPRWGRGS 163
Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
E GED L+ + V +QG S DR V S KHF AY +N VD
Sbjct: 164 EGFGEDTYLTSMMGKTMVESMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 218
Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
+S Q + + + P++ + G VM + N +NG P +D +LK +RG+W G
Sbjct: 219 --MSSQRLFNDYMPPYKEALDAGS-GGVMIALNSLNGTPAASDGWLLKDLLRGDWGFKGI 275
Query: 285 IVSDCDSVGVYYDTQHFT-STPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
+SD ++ +H T S PE+A AI++G+D+ + + + ++ G ++ +
Sbjct: 276 TISDHGAIKEL--IKHGTASDPEDAVRVAIKSGVDMSMADEYYSKYLPNLIKSGKVTMAE 333
Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
+++A + L V+ +G+F+ PY HLGPKD D H++ A + AR+ +V
Sbjct: 334 LDDATRHVLNVKYDMGLFN-----DPYSHLGPKDSDPKDTNAESRLHRDDARKVARESLV 388
Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGI----GRYA 449
LLKN+ +LPL + T+AV+GP +D M+G++ AG+A T LQG+ G A
Sbjct: 389 LLKNRLETLPLK--KSGTIAVVGPLADSKRDMMGSWSAAGVADQSVTLLQGMKNVAGHKA 446
Query: 450 RTIHQQGCKDVACADDQ----------------------LFGAAIDASRQADATILVMGL 487
+ ++ +G DD+ + A++ ++Q+D + V+G
Sbjct: 447 KILYAKGAN---VTDDKGIVDFLNLYEPAVVVDKRTPKAMIDEAVNVAKQSDVVVAVVGE 503
Query: 488 DQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAII 547
Q + EA R L +P Q++L+S + K P +LVLM+G P+ A K D + AI+
Sbjct: 504 AQGMAHEASSRTDLTIPQSQRDLISALKATGK-PLVLVLMNGRPL--ALVKEDQQADAIL 560
Query: 548 WAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPM------TEMAMRPSQSKR 601
+ G GG AIAD+LFG NP GKLP++ +P+ + +P+ T P + +
Sbjct: 561 ETWFAGTEGGNAIADVLFGDYNPSGKLPIS-FPRS-VGQIPVYYSHLNTGRPYNPEKPNK 618
Query: 602 YPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVT 661
Y R + GP +YPFG+G+SYT F +V++ ++ P R G + A+++
Sbjct: 619 YTSRYFDEANGP-LYPFGYGLSYTTF--SVSDVK--LSAPTMKRDGKVTASVT------- 666
Query: 662 HAKCNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQR 720
V N G ++GA + ++ A P KQL FEK+ + G Q
Sbjct: 667 --------------VTNTGKREGATAVQMYLQDVTASMSRPVKQLRGFEKITLKPGESQT 712
Query: 721 VG--INIHVCKYLSVVDRSGTRRIPLGEHNIHIG 752
V I+I K+ + R P G+ N+ IG
Sbjct: 713 VSFPIDIDALKFWN--QRMKYDAEP-GKFNVFIG 743
>gi|429087308|ref|ZP_19150040.1| Periplasmic beta-glucosidase [Cronobacter universalis NCTC 9529]
gi|426507111|emb|CCK15152.1| Periplasmic beta-glucosidase [Cronobacter universalis NCTC 9529]
Length = 765
Score = 270 bits (690), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 213/691 (30%), Positives = 335/691 (48%), Gaps = 106/691 (15%)
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
T FP + ASSFN + +GRV + Y GL W+P V++ RDPRWGRG
Sbjct: 118 TIFPNSLGLASSFNLDAVKTVGRV------SAYEAADDGLNMTWAPMVDVTRDPRWGRGS 171
Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
E GED L+ + V +QG S DR V S KHF AY +N VD
Sbjct: 172 EGFGEDTYLTSMMGKTMVESMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 226
Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
+S Q + + + P++ + G VM + N +NG P +D +LK +RG+W G
Sbjct: 227 --MSSQRLFNDYMPPYKEALDAGS-GGVMIALNSLNGTPAASDGWLLKDLLRGDWGFKGI 283
Query: 285 IVSDCDSVGVYYDTQHFT-STPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
+SD ++ +H T S PE+A AI++G+D+ + + ++ G +S +
Sbjct: 284 TISDHGAIKEL--IKHGTASDPEDAVRVAIKSGVDMSMADEYYSKYLPDLIKSGKVSMEE 341
Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
+++A + L V+ +G+F+ PY HLGPKD D H++ A AR+ +V
Sbjct: 342 LDDATRHVLNVKYDMGLFN-----DPYSHLGPKDSDPKDTNAESRLHRDDARNVARESLV 396
Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGI----GRYA 449
LLKN+ +LPL + T+AV+GP +D M+G++ AG+A T LQG+ G A
Sbjct: 397 LLKNRLETLPLK--KSGTIAVVGPLADSKRDMMGSWSAAGVADQSVTLLQGMKNVAGDKA 454
Query: 450 RTIHQQGCK---DVACAD----------------DQLFGAAIDASRQADATILVMGLDQS 490
+ ++ +G D D ++ A++ ++Q+D + V+G Q
Sbjct: 455 KILYAKGANVTDDKGIVDFLNLYEPAVVVDKRTPKEMIDEAVNVAKQSDVVVAVVGEAQG 514
Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
+ EA R L +P Q++L+S + K P +LVLM+G P+ A K D + AI+
Sbjct: 515 MAHEASSRTDLTIPQSQRDLISALKATGK-PLVLVLMNGRPL--ALVKEDQQADAILETW 571
Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPM------TEMAMRPSQSKRYPG 604
+ G GG AIAD+LFG NP GKLP++ +P+ + +P+ T P + +Y
Sbjct: 572 FAGTEGGNAIADVLFGDYNPSGKLPIS-FPRS-VGQIPVYYSHLNTGRPYNPEKPNKYTS 629
Query: 605 RTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAK 664
R + GP +YPFG+G+SYT F +V++ ++ P R G + A+++
Sbjct: 630 RYFDEANGP-LYPFGYGLSYTTF--SVSDVK--LSAPTMKRDGKVTASVT---------- 674
Query: 665 CNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVG- 722
V N G ++GA + ++ A P KQL FEK+ + G Q V
Sbjct: 675 -----------VTNTGKREGATAVQMYLQDVTASMSRPVKQLRGFEKITLKPGESQTVSF 723
Query: 723 -INIHVCKYLSVVDRSGTRRIPLGEHNIHIG 752
I+I K+ + R P G+ N+ IG
Sbjct: 724 PIDIDALKFWN--QRMKYDAEP-GKFNVFIG 751
>gi|397691073|ref|YP_006528327.1| beta-glucosidase [Melioribacter roseus P3M]
gi|395812565|gb|AFN75314.1| beta-glucosidase [Melioribacter roseus P3M]
Length = 923
Score = 270 bits (689), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 151/413 (36%), Positives = 236/413 (57%), Gaps = 26/413 (6%)
Query: 51 QRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGA 110
+R+NDLI ++ +EK+K L + A ++PRLG++ Y +W+E+LHGV GA
Sbjct: 115 ERLNDLISLMTTEEKIKQLTNQADSIPRLGLRAYNYWNESLHGVLA-----------EGA 163
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQE 170
TSFPQ I ++++ L + VSDEARA+ GLTYWSP +NI RDPRWGR +E
Sbjct: 164 TSFPQAIALGATWDPRLVNRVATAVSDEARALNRLYGKGLTYWSPTINIARDPRWGRNEE 223
Query: 171 TPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHF-NAKVSK 229
+ EDP L + ++++G+QG LK A+ KHF A N +R H ++ V
Sbjct: 224 SYSEDPYLLSRMGVAFIKGMQGDHPYYLKTVATPKHFIAN-----NEEERRHTGSSDVDM 278
Query: 230 QDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDC 289
+++ + + F+ ++E + S+M +YN++N VP+ A+ ++ +R +W GY+VSDC
Sbjct: 279 RNLYEYYLPAFKSAIVEARAYSIMGAYNELNHVPSNANMFLMTDLLRRQWGFEGYVVSDC 338
Query: 290 DSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVN 349
++ F T EA A +I AG DL+CG + A+ GLL E DI++AL
Sbjct: 339 GAIHDMLYGHKFFKTGAEAVARSILAGCDLNCGQAYREFIKDALDEGLLREKDIDSALFR 398
Query: 350 TLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHI 409
L+ + RLG FD P PY +G + + +++ LAL+AAR+ IVLLKN LP+
Sbjct: 399 VLSARFRLGEFD-PPELVPYSSIGKDKLDSKENRRLALDAARKSIVLLKNND-ILPIDKS 456
Query: 410 RHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTI-----HQQGC 457
+ +++AVIGPN+ +G Y+G +PL+GI A ++ + +GC
Sbjct: 457 KIKSIAVIGPNA--REAQLGIYSGFPNVLISPLEGIKNKADSLDIRVGYVKGC 507
Score = 116 bits (291), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 94/320 (29%), Positives = 140/320 (43%), Gaps = 50/320 (15%)
Query: 445 IGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLP 504
+G AR G KD F A + + D ILV+G+ I E LDR + LP
Sbjct: 643 LGATARLTWDLGQKD--------FEKAKKIAAENDLVILVLGITPGISQEELDRKEIELP 694
Query: 505 GRQQELVSKVSMASKGPTI-LVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADI 563
Q+ELV + A P I +VL++GGP VA A + AI+ Y G+ GG A+AD+
Sbjct: 695 SVQRELVKQT--AEVNPNIVIVLVNGGP--VALAGAEKYAKAIVENWYNGEFGGQALADV 750
Query: 564 LFGTSNPGGKLPMTWYPQ-EYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGM 622
LFG NPGGKLP T+Y E + + ++ P RTY + ++PFGHG+
Sbjct: 751 LFGDYNPGGKLPQTFYASTEQLPPMSDYDIINNP--------RTYMYLNEQALFPFGHGL 802
Query: 623 SYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRL-TLGVQVDVKNVGS 681
SYT F + ++K+ N TL +Q + NVG+
Sbjct: 803 SYTTFKY--------------------------DSLKIVSNTLNETDTLSLQFRLTNVGN 836
Query: 682 KDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTR 740
++G + ++ S A P KQL F ++ + G + + I V +
Sbjct: 837 RNGDEVVQIYASCKDAKFKVPRKQLKRFRRLTLQTGESKVLEFKIPVDELAFYSTYENDF 896
Query: 741 RIPLGEHNIHIGGTKHSVSL 760
+ G I IG + + L
Sbjct: 897 VVEKGAWEILIGSSSEDIRL 916
>gi|336399370|ref|ZP_08580170.1| Beta-glucosidase [Prevotella multisaccharivorax DSM 17128]
gi|336069106|gb|EGN57740.1| Beta-glucosidase [Prevotella multisaccharivorax DSM 17128]
Length = 862
Score = 270 bits (689), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 158/431 (36%), Positives = 233/431 (54%), Gaps = 30/431 (6%)
Query: 41 PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPG 100
P+ L R DL RL+L+EK L+ + A+PRLGIK + WWSEALHG +N G
Sbjct: 24 PYQDPGLSFEARAKDLCSRLTLEEKASLMCDVSPAIPRLGIKPFNWWSEALHGYANNG-- 81
Query: 101 TKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA---------GLT 151
T FP+ I A+SFN T+ + SDEAR YN A L+
Sbjct: 82 --------DVTVFPEPIGMAASFNPTMVYQVFTATSDEARGKYNQSMAEGKEDTRFHSLS 133
Query: 152 YWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASC-KHFTAY 210
W+PNVNIFRDPRWGRGQET GEDP L+ V+GLQG + + + +C KHF +
Sbjct: 134 VWTPNVNIFRDPRWGRGQETYGEDPYLTSVMGVEVVKGLQGPESTKYRKLYACAKHFAVH 193
Query: 211 DLDNWNGVDRFHFN-AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPN 269
+ R N A +S +D+ +T+ F+ V + V VMC+Y +++ P C +
Sbjct: 194 SGPEYT---RHTANLADISPRDLWETYLPAFKATVQQAGVREVMCAYQRLDDEPCCGNSR 250
Query: 270 ILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG-PFLGLH 328
+L++ +R EW +VSDC ++ +Y H +S AAA AG D++CG + +
Sbjct: 251 LLQQILRDEWGFRHMVVSDCGAIADFYTNHHVSSDAVHAAAKGTLAGTDVECGFGYAYMK 310
Query: 329 TESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALE 388
AV+RGL+SE +++ ++ L + LG+ D +P + + PK V + H++LAL
Sbjct: 311 LPEAVRRGLVSEAEVDKHVIRLLKGRFELGVMD-DPKLVSWTKISPKVVDSDAHRQLALN 369
Query: 389 AARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI-GR 447
ARQ + LL+N+ LPL+ + +AV+GPN+ + GNY G TT L+GI +
Sbjct: 370 MARQTMTLLQNRNNVLPLA--KGEKIAVVGPNAADGPMLWGNYNGTPSRTTTILEGIRAK 427
Query: 448 YARTI-HQQGC 457
+ I + QGC
Sbjct: 428 AGKDIPYLQGC 438
Score = 100 bits (249), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 86/311 (27%), Positives = 133/311 (42%), Gaps = 50/311 (16%)
Query: 468 FGAAIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMA 517
+ AI R + V G+ +E E + DR + LP Q++ + + A
Sbjct: 590 YATAIRQLRGVRTVVFVGGISSKLEGEEMPVHVEGFKGGDRTSIELPAVQRDFLKALKAA 649
Query: 518 SKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMT 577
K + V SG I A AI+ A Y G+ GG A+AD+L+G NPGGKLP+T
Sbjct: 650 GK-TVVFVNCSGSAI--ALTPEVESCDAILQAWYAGEEGGRAVADVLYGDYNPGGKLPVT 706
Query: 578 WYPQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPT 636
+Y T LP + +M+ GRTYR++ ++PFG+G+SYT F
Sbjct: 707 FY--RSTTQLPAFDDYSMK--------GRTYRYFSD-ALFPFGYGLSYTRFAIG------ 749
Query: 637 VVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPA 696
GS++A KVT + V V NVG + G + V+
Sbjct: 750 ---------KGSLSAPAMKADGKVT----------LTVPVSNVGKRTGDEVVQVYVRDVN 790
Query: 697 GHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKH 756
P K L AF +V + AG ++V I + + S T R G++ ++ G +
Sbjct: 791 DADGPLKSLKAFRRVSLKAGESRKVTIPLTAETFSLFDSASNTVRTKPGKYVVYYGTSSR 850
Query: 757 SVSLHAATLGV 767
L ++ +
Sbjct: 851 DSDLKQLSMTI 861
>gi|299141953|ref|ZP_07035087.1| beta-glucosidase [Prevotella oris C735]
gi|298576415|gb|EFI48287.1| beta-glucosidase [Prevotella oris C735]
Length = 858
Score = 270 bits (689), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 177/490 (36%), Positives = 253/490 (51%), Gaps = 39/490 (7%)
Query: 3 STIAFFFLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSL 62
+ I L + + +SS LA +P T P+ +L R DL+ RL+L
Sbjct: 2 NKIKTILLSFVTVMTASSALA--QP------SLTGMAYPYQNPNLSALTRAQDLLSRLTL 53
Query: 63 QEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASS 122
+EK L++ + A+PRLGIK + WWSEALHG +N+G T FP+ I A+S
Sbjct: 54 EEKALLMLDESPAIPRLGIKKFFWWSEALHGAANMG----------NVTVFPEPIAMAAS 103
Query: 123 FNATLWEAIGRVVSDEARAMY-----NGGTA----GLTYWSPNVNIFRDPRWGRGQETPG 173
FN L + SDE RA Y NGG L+ W+PNVNIFRDPRWGRGQET G
Sbjct: 104 FNDALLYKVFSAASDEMRAQYHHRIRNGGEDEKFHSLSVWTPNVNIFRDPRWGRGQETYG 163
Query: 174 EDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNA-KVSKQD 231
EDP L+ + VRGLQG D K+ A KH+ + + R N VS +D
Sbjct: 164 EDPYLTAVMGTAVVRGLQGPEDSKYRKLWACAKHYAVHSGPEYT---RHTANLNNVSPRD 220
Query: 232 IEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDS 291
+ +T+ F+ V E KV VMC+Y ++ P C + +L++ +R EW +VSDC +
Sbjct: 221 LWETYLPAFKTLVEEAKVREVMCAYQALDDEPCCGNSRLLQQILRDEWGFQYLVVSDCGA 280
Query: 292 VGVYYDTQHFTSTPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEIDINNALVNT 350
V + +S A A A AG D++CG + AVQRGL+SE +++ ++
Sbjct: 281 VSDIWQNHKTSSDAVHATAKAALAGTDVECGFNYTYKCIPEAVQRGLISEKEVDKHVLRL 340
Query: 351 LTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIR 410
L + LG D +P+ P+ + + + H++L+L+ ARQ IVLL+N+ LPL
Sbjct: 341 LEGRFDLGEMD-DPALVPWSKIPYSVMDSKAHRQLSLDMARQSIVLLQNKQNMLPLKKNN 399
Query: 411 HRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI-GRYARTIHQQGCK--DVACADDQL 467
R +AVIGPN+D M GNY G T L GI ++ + +GC D D L
Sbjct: 400 ER-IAVIGPNADNVPMMWGNYNGTPNRTVTILDGIRAKHKNVKYIKGCDLTDTMVV-DPL 457
Query: 468 FGAAIDASRQ 477
F A + A ++
Sbjct: 458 FDACMIAGKR 467
Score = 101 bits (252), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 84/261 (32%), Positives = 117/261 (44%), Gaps = 52/261 (19%)
Query: 468 FGAAIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMA 517
+ I A + + + V G+ ++E E + DR + LP Q++ + + A
Sbjct: 598 YQEVIRALKGIEKVVFVGGISPALEGEEMPVDIPGFKGGDRTDIELPRVQRDFIKALHAA 657
Query: 518 SKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMT 577
K + V SG I A AI+ A Y GQ GGTA+AD+LFG NP GKLP+T
Sbjct: 658 GK-QLVYVNCSGSAI--ALEPETTACDAIVQAWYAGQEGGTAVADVLFGDYNPSGKLPVT 714
Query: 578 WYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTV 637
+Y LP E + GRTYR++ P+ + FGHG+SYT F N T
Sbjct: 715 FYKNS--NQLPDYE-------NYSMKGRTYRYFSDPL-FAFGHGLSYTTF-----NMGT- 758
Query: 638 VAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAG 697
A I KA + V++ V+NVGSKDG T+L++
Sbjct: 759 -------------AEIIKKADSIV----------VRIPVENVGSKDGTETVLLYIKNHQD 795
Query: 698 HWAPHKQLVAFEKVHVPAGAQ 718
P K L F +V V AG Q
Sbjct: 796 PNGPIKSLRGFSRVFVKAGHQ 816
>gi|373460605|ref|ZP_09552356.1| hypothetical protein HMPREF9944_00620 [Prevotella maculosa OT 289]
gi|371955223|gb|EHO73027.1| hypothetical protein HMPREF9944_00620 [Prevotella maculosa OT 289]
Length = 858
Score = 270 bits (689), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 177/490 (36%), Positives = 253/490 (51%), Gaps = 39/490 (7%)
Query: 3 STIAFFFLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSL 62
+ I L + + +SS LA +P T P+ +L R DL+ RL+L
Sbjct: 2 NKIKTILLSFVTVMTASSALA--QP------SLTGMAYPYQNPNLSALTRAQDLLSRLTL 53
Query: 63 QEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASS 122
+EK L++ + A+PRLGIK + WWSEALHG +N+G T FP+ I A+S
Sbjct: 54 EEKALLMLDESPAIPRLGIKKFFWWSEALHGAANMG----------NVTVFPEPIAMAAS 103
Query: 123 FNATLWEAIGRVVSDEARAMY-----NGGTA----GLTYWSPNVNIFRDPRWGRGQETPG 173
FN L + SDE RA Y NGG L+ W+PNVNIFRDPRWGRGQET G
Sbjct: 104 FNDALLYKVFSAASDEMRAQYHHRIRNGGEDEKFHSLSVWTPNVNIFRDPRWGRGQETYG 163
Query: 174 EDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNA-KVSKQD 231
EDP L+ + VRGLQG D K+ A KH+ + + R N VS +D
Sbjct: 164 EDPYLTAVMGTAVVRGLQGPEDSKYRKLWACAKHYAVHSGPEYT---RHTANLNNVSPRD 220
Query: 232 IEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDS 291
+ +T+ F+ V E KV VMC+Y ++ P C + +L++ +R EW +VSDC +
Sbjct: 221 LWETYLPAFKTLVEEAKVREVMCAYQALDDEPCCGNSRLLQQILRDEWGFQYLVVSDCGA 280
Query: 292 VGVYYDTQHFTSTPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEIDINNALVNT 350
V + +S A A A AG D++CG + AVQRGL+SE +++ ++
Sbjct: 281 VSDIWQNHKTSSDAVHATAKAALAGTDVECGFNYTYKCIPEAVQRGLISEKEVDKHVLRL 340
Query: 351 LTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIR 410
L + LG D +P+ P+ + + + H++L+L+ ARQ IVLL+N+ LPL
Sbjct: 341 LEGRFDLGEMD-DPALVPWSKIPYSVMDSKAHRQLSLDMARQSIVLLQNKQNMLPLKKNN 399
Query: 411 HRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI-GRYARTIHQQGCK--DVACADDQL 467
R +AVIGPN+D M GNY G T L GI ++ + +GC D D L
Sbjct: 400 ER-IAVIGPNADNVPMMWGNYNGTPNRTVTILDGIRAKHKNVKYIKGCDLTDTMVV-DPL 457
Query: 468 FGAAIDASRQ 477
F A + A ++
Sbjct: 458 FDACMIAGKR 467
Score = 100 bits (248), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 83/259 (32%), Positives = 116/259 (44%), Gaps = 52/259 (20%)
Query: 468 FGAAIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMA 517
+ I A + + + V G+ ++E E + DR + LP Q++ + + A
Sbjct: 598 YQEVIRALKGIEKVVFVGGISPALEGEEMPVDIPGFKGGDRTDIELPRVQRDFIKALHAA 657
Query: 518 SKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMT 577
K + V SG I A AI+ A Y GQ GGTA+AD+LFG NP GKLP+T
Sbjct: 658 GK-QLVYVNCSGSAI--ALEPETTACDAIVQAWYAGQEGGTAVADVLFGDYNPSGKLPVT 714
Query: 578 WYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTV 637
+Y LP E + GRTYR++ P+ + FGHG+SYT F N T
Sbjct: 715 FYKNS--NQLPDYE-------NYSMKGRTYRYFSDPL-FAFGHGLSYTTF-----NMGT- 758
Query: 638 VAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAG 697
A I KA + V++ V+NVGSKDG T+L++
Sbjct: 759 -------------AEIIKKADSIV----------VRIPVENVGSKDGTETVLLYIKNHQD 795
Query: 698 HWAPHKQLVAFEKVHVPAG 716
P K L F +V V AG
Sbjct: 796 PNGPIKSLRGFSRVFVKAG 814
>gi|260598593|ref|YP_003211164.1| beta-D-glucoside glucohydrolase [Cronobacter turicensis z3032]
gi|260217770|emb|CBA32207.1| Periplasmic beta-glucosidase [Cronobacter turicensis z3032]
Length = 765
Score = 270 bits (689), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 214/691 (30%), Positives = 335/691 (48%), Gaps = 106/691 (15%)
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
T FP + ASSFN + +GR+ + Y GL W+P V++ RDPRWGRG
Sbjct: 118 TIFPNSLGLASSFNLDAVKTVGRI------SAYEAADDGLNMTWAPMVDVTRDPRWGRGS 171
Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
E GED L+ + V +QG S DR V S KHF AY +N VD
Sbjct: 172 EGFGEDTYLTSMMGKTMVESMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 226
Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
+S Q + + + P++ + G VM + N +NG P +D +LK +RG+W G
Sbjct: 227 --MSSQRLFNDYMPPYKEALDAGS-GGVMIALNSLNGTPAASDGWLLKDLLRGDWGFKGI 283
Query: 285 IVSDCDSVGVYYDTQHFT-STPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
+SD ++ +H T S PE+A AI++G+D+ + + + ++ G +S +
Sbjct: 284 TISDHGAIKEL--IKHGTASDPEDAVRVAIKSGVDMSMADEYYSKYLPNLIKSGKVSMEE 341
Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
+++A + L V+ +G+F+ PY HLGPKD D H++ A AR+ +V
Sbjct: 342 LDDATRHVLNVKYDMGLFN-----DPYSHLGPKDSDPKDTNAESRLHRDDARNVARESLV 396
Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGI----GRYA 449
LLKN+ +LPL + T+AV+GP +D M+G++ AG+A T LQG+ G A
Sbjct: 397 LLKNRLETLPLK--KSGTIAVVGPLADSKRDMMGSWSAAGVADQSVTLLQGMKNVAGDKA 454
Query: 450 RTIHQQGCK---DVACAD----------------DQLFGAAIDASRQADATILVMGLDQS 490
+ + +G D D ++ A++ ++Q+D + V+G Q
Sbjct: 455 KILFAKGANVTDDKGIVDFLNLYEPAVVVDKRTPKEMIDEAVNVAKQSDVVVAVVGEAQG 514
Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
+ EA R L LP Q++L+S + K P +LVLM+G P+ A K D + AI+
Sbjct: 515 MAHEASSRTDLTLPQSQRDLISALKATGK-PLVLVLMNGRPL--ALVKEDQQADAILETW 571
Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPM------TEMAMRPSQSKRYPG 604
+ G GG AIAD+LFG NP GKLP++ +P+ + +P+ T P + +Y
Sbjct: 572 FAGTEGGNAIADVLFGDYNPSGKLPIS-FPRS-VGQIPVYYSHLNTGRPYNPEKPNKYTS 629
Query: 605 RTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAK 664
R + GP +YPFG+G+SYT F +V++ ++ P R G + A+++
Sbjct: 630 RYFDEANGP-LYPFGYGLSYTTF--SVSDVK--LSAPTMKRDGKVTASVT---------- 674
Query: 665 CNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVG- 722
V N G ++GA + ++ A P KQL FEKV + G Q V
Sbjct: 675 -----------VTNTGKREGATAVQMYLQDVTASMSRPVKQLRGFEKVTLKPGETQTVSF 723
Query: 723 -INIHVCKYLSVVDRSGTRRIPLGEHNIHIG 752
I+I K+ + R P G+ N+ IG
Sbjct: 724 PIDIDALKFWN--QRMKYDAEP-GKFNVFIG 751
>gi|313203744|ref|YP_004042401.1| glycoside hydrolase [Paludibacter propionicigenes WB4]
gi|312443060|gb|ADQ79416.1| glycoside hydrolase family 3 domain protein [Paludibacter
propionicigenes WB4]
Length = 1286
Score = 269 bits (688), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 158/425 (37%), Positives = 231/425 (54%), Gaps = 18/425 (4%)
Query: 32 PKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEAL 91
P +T+ + S +R DLI RL+L+EK LL + AA+PRLGIK WSEAL
Sbjct: 23 PAKVSTKKPIYLNTSYSFEERAADLISRLTLEEKESLLGNSMAAIPRLGIKSMNVWSEAL 82
Query: 92 HGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLT 151
HG+ +G + G G TSFP + S+++ L + ++DEARA+ GT GLT
Sbjct: 83 HGI--LGGANQSVG-ISGPTSFPNSVALGSAWDPALMQREAMAIADEARAINQTGTKGLT 139
Query: 152 YWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYD 211
YWSP V RDPRWGR E+ GEDP L+ + A +VRG+ G+D LK KH+ A
Sbjct: 140 YWSPVVEPIRDPRWGRTGESYGEDPFLAAEIAGGFVRGMVGNDPTYLKSVPCAKHYFA-- 197
Query: 212 LDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNIL 271
N + DR ++ + +D+ + + P++ + + + S+M SYN VNGVPT A L
Sbjct: 198 --NNSEFDRHVSSSNMDSRDMREFYLAPYKKLIEQDNLPSIMSSYNAVNGVPTSASQLYL 255
Query: 272 KRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTES 331
R + L GYI DC ++ Y ++ T EEA A ++AG+D DCG + +
Sbjct: 256 DTIARRTYGLKGYITGDCAAIEDIYTGHYYVKTAEEATAKGLKAGVDSDCGSIYQRYAIA 315
Query: 332 AVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAAR 391
A+++GL++ DI+ AL+N V+MR G FD P+ Y P V +P ++ LA E A
Sbjct: 316 ALKKGLITMADIDRALLNIFIVRMRTGEFD-PPAKVLYAQFQPNIVNSPANKALAKEIAT 374
Query: 392 QGIVLLKN------QGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIAC--GYTTPLQ 443
+ VLLKN +LPL+ + +A+IGP++D +G Y+G TP
Sbjct: 375 KTPVLLKNNISLKTNRKALPLNPADLKKIALIGPHAD--KVELGPYSGRPAQENMITPFA 432
Query: 444 GIGRY 448
GI +Y
Sbjct: 433 GIKKY 437
Score = 137 bits (344), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 108/297 (36%), Positives = 148/297 (49%), Gaps = 36/297 (12%)
Query: 461 ACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKG 520
A AD ++ I + AD ++ +G D+ E DR LLLPG Q EL+ V+ A
Sbjct: 594 APADAKIDEQTIKTAAAADVAVVFVGTDEKTATEEADRLTLLLPGNQVELIKAVA-AVNP 652
Query: 521 PTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYP 580
TI+V+ + G ++V KN I IIW GY GQA G AIA +LFG NPGGKL TWY
Sbjct: 653 NTIVVMQTLGCVEVEEFKNLQNIPGIIWVGYNGQAQGDAIASVLFGEVNPGGKLNGTWYK 712
Query: 581 QEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVA 639
+ +LP +T+ +R K GRT+ ++ V Y FG GMSYT F ++
Sbjct: 713 S--VKDLPEITDYTLRGGNGKN--GRTFWYFDKDVSYEFGFGMSYTTFEYS--------- 759
Query: 640 VPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF-STP--PA 696
N IS +I + H K + V VDVKN G +G + V+ TP PA
Sbjct: 760 ----------NFRISKNSI-IPHDK-----ITVSVDVKNTGKVEGDEVIQVYMKTPDSPA 803
Query: 697 GHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDR-SGTRRIPLGEHNIHIG 752
P K+L F++V +PAG + V I+I+ C L D T G++ IG
Sbjct: 804 SLQRPIKRLKGFKRVTLPAGQTKTVNIDIN-CADLWFWDMDKNTTTFDQGKYVFEIG 859
>gi|429101023|ref|ZP_19162997.1| Periplasmic beta-glucosidase [Cronobacter turicensis 564]
gi|426287672|emb|CCJ89110.1| Periplasmic beta-glucosidase [Cronobacter turicensis 564]
Length = 759
Score = 269 bits (688), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 214/691 (30%), Positives = 335/691 (48%), Gaps = 106/691 (15%)
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
T FP + ASSFN + +GRV + Y GL W+P V++ RDPRWGRG
Sbjct: 112 TIFPNSLGLASSFNLDAVKTVGRV------SAYEAADDGLNMTWAPMVDVTRDPRWGRGS 165
Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
E GED L+ + V +QG S DR V S KHF AY +N VD
Sbjct: 166 EGFGEDTYLTSMMGKTMVESMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 220
Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
+S Q + + + P++ + G VM + N +NG P +D +LK +RG+W G
Sbjct: 221 --MSSQRLFNDYMPPYKEALDAGS-GGVMIALNSLNGTPAASDGWLLKDLLRGDWGFKGI 277
Query: 285 IVSDCDSVGVYYDTQHFT-STPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
+SD ++ +H T S PE+A AI++G+D+ + + + ++ G +S +
Sbjct: 278 TISDHGAIKEL--IKHGTASDPEDAVRVAIKSGVDMSMADEYYSKYLPNLIKSGKVSMEE 335
Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
+++A + L V+ +G+F+ PY HLGPKD D H++ A AR+ +V
Sbjct: 336 LDDATRHVLNVKYDMGLFN-----DPYSHLGPKDSDPKDTNAESRLHRDDARNVARESLV 390
Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGI----GRYA 449
LLKN+ +LPL + T+AV+GP +D M+G++ AG+A T LQG+ G A
Sbjct: 391 LLKNRLETLPLK--KSGTIAVVGPLADSKRDMMGSWSAAGVADQSVTLLQGMKNVAGDKA 448
Query: 450 RTIHQQGCK---DVACAD----------------DQLFGAAIDASRQADATILVMGLDQS 490
+ + +G D D ++ A++ ++Q+D + V+G Q
Sbjct: 449 KILFAKGANVTDDKGIVDFLNLYEPAVVVDKRTPKEMIDEAVNVAKQSDVVVAVVGEAQG 508
Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
+ EA R L +P Q++L+S + K P +LVLM+G P+ A K D + AI+
Sbjct: 509 MAHEASSRTDLTIPQSQRDLISALKATGK-PLVLVLMNGRPL--ALVKEDQQADAILETW 565
Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPM------TEMAMRPSQSKRYPG 604
+ G GG AIAD+LFG NP GKLP++ +P+ + +P+ T P + +Y
Sbjct: 566 FAGTEGGNAIADVLFGDYNPSGKLPIS-FPRS-VGQIPVYYSHLNTGRPYNPEKPNKYTS 623
Query: 605 RTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAK 664
R + GP +YPFG+G+SYT F +V++ ++ P R G + A+++
Sbjct: 624 RYFDEANGP-LYPFGYGLSYTTF--SVSDVK--LSAPTMKRDGKVTASVT---------- 668
Query: 665 CNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVG- 722
V N G ++GA + ++ A P KQL FEKV + G Q V
Sbjct: 669 -----------VTNTGKREGATAVQMYLQDVTASMSRPVKQLRGFEKVTLKPGETQTVSF 717
Query: 723 -INIHVCKYLSVVDRSGTRRIPLGEHNIHIG 752
I+I K+ + R P G+ N+ IG
Sbjct: 718 PIDIDALKFWN--QRMKYDAEP-GKFNVFIG 745
>gi|429105352|ref|ZP_19167221.1| Periplasmic beta-glucosidase [Cronobacter malonaticus 681]
gi|426292075|emb|CCJ93334.1| Periplasmic beta-glucosidase [Cronobacter malonaticus 681]
Length = 765
Score = 269 bits (687), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 212/691 (30%), Positives = 337/691 (48%), Gaps = 106/691 (15%)
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
T FP + ASSFN + +GRV + Y GL W+P V++ RDPRWGRG
Sbjct: 118 TIFPNSLGLASSFNLDAVKTVGRV------SAYEAADDGLNMTWAPMVDVTRDPRWGRGS 171
Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
E GED L+ + V +QG S DR V S KHF AY +N VD
Sbjct: 172 EGFGEDTYLTSMMGKTMVESMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 226
Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
+S Q + + + P++ + G VM + N +NG P +D +LK +RG+W G
Sbjct: 227 --MSSQRLFNDYMPPYKEALDAGS-GGVMIALNSLNGTPAASDGWLLKDLLRGDWGFKGI 283
Query: 285 IVSDCDSVGVYYDTQHFT-STPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
+SD ++ +H T S PE+A AI++G+D+ + + + ++ G ++ +
Sbjct: 284 TISDHGAIKEL--IKHGTASDPEDAVRVAIKSGVDMSMADEYYSKYLPNLIKSGKVTMAE 341
Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
+++A + L V+ +G+F+ PY HLGPKD D H++ A + AR+ +V
Sbjct: 342 LDDATRHVLNVKYDMGLFN-----DPYSHLGPKDSDPQDTNAESRLHRDDARKVARESLV 396
Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGI----GRYA 449
LLKN+ +LPL + T+AV+GP +D M+G++ AG+A T LQG+ G A
Sbjct: 397 LLKNRLETLPLK--KSGTIAVVGPLADSKRDMMGSWSAAGVADQSVTLLQGMKNVAGDKA 454
Query: 450 RTIHQQGCK---DVACAD----------------DQLFGAAIDASRQADATILVMGLDQS 490
+ ++ +G D D ++ A++ ++Q+D + V+G Q
Sbjct: 455 KILYAKGANVTDDKGIVDFLNLYEPAVVVDKRTPKEMIDEAVNVAKQSDVVVAVVGEAQG 514
Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
+ EA R L +P Q++L+S + K P +LVLM+G P+ A K D + AI+
Sbjct: 515 MAHEASSRTDLTIPQSQRDLISALKATGK-PLVLVLMNGRPL--ALVKEDQQADAILETW 571
Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPM------TEMAMRPSQSKRYPG 604
+ G GG AIAD+LFG NP GKLP++ +P+ + +P+ T P + +Y
Sbjct: 572 FAGTEGGNAIADVLFGDYNPSGKLPIS-FPRS-VGQIPVYYSHLNTGRPYNPEKPNKYTS 629
Query: 605 RTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAK 664
R + GP +YPFG+G+SYT F +V++ ++ P R G + A+++
Sbjct: 630 RYFDEANGP-LYPFGYGLSYTTF--SVSDVK--LSSPTMKRDGKVTASVT---------- 674
Query: 665 CNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVG- 722
V N G ++GA + ++ A P KQL FEK+ + G Q V
Sbjct: 675 -----------VTNTGKREGATAVQMYLQDVTASMSRPVKQLRGFEKITLKPGESQTVSF 723
Query: 723 -INIHVCKYLSVVDRSGTRRIPLGEHNIHIG 752
I+I K+ + R P G+ N+ IG
Sbjct: 724 PIDIDALKFWN--QRMKYDAEP-GKFNVFIG 751
>gi|429110963|ref|ZP_19172733.1| Periplasmic beta-glucosidase [Cronobacter malonaticus 507]
gi|426312120|emb|CCJ98846.1| Periplasmic beta-glucosidase [Cronobacter malonaticus 507]
Length = 759
Score = 269 bits (687), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 212/691 (30%), Positives = 337/691 (48%), Gaps = 106/691 (15%)
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
T FP + ASSFN + +GRV + Y GL W+P V++ RDPRWGRG
Sbjct: 112 TIFPNSLGLASSFNLDAVKTVGRV------SAYEAADDGLNMTWAPMVDVTRDPRWGRGS 165
Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
E GED L+ + V +QG S DR V S KHF AY +N VD
Sbjct: 166 EGFGEDTYLTSMMGKTMVESMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 220
Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
+S Q + + + P++ + G VM + N +NG P +D +LK +RG+W G
Sbjct: 221 --MSSQRLFNDYMPPYKEALDAGS-GGVMIALNSLNGTPAASDGWLLKDLLRGDWGFKGI 277
Query: 285 IVSDCDSVGVYYDTQHFT-STPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
+SD ++ +H T S PE+A AI++G+D+ + + + ++ G ++ +
Sbjct: 278 TISDHGAIKEL--IKHGTASDPEDAVRVAIKSGVDMSMADEYYSKYLPNLIKSGKVTMAE 335
Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
+++A + L V+ +G+F+ PY HLGPKD D H++ A + AR+ +V
Sbjct: 336 LDDATRHVLNVKYDMGLFN-----DPYSHLGPKDSDPQDTNAESRLHRDDARKVARESLV 390
Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGI----GRYA 449
LLKN+ +LPL + T+AV+GP +D M+G++ AG+A T LQG+ G A
Sbjct: 391 LLKNRLETLPLK--KSGTIAVVGPLADSKRDMMGSWSAAGVADQSVTLLQGMKKVAGDKA 448
Query: 450 RTIHQQGCK---DVACAD----------------DQLFGAAIDASRQADATILVMGLDQS 490
+ ++ +G D D ++ A++ ++Q+D + V+G Q
Sbjct: 449 KILYAKGANVTDDKGIVDFLNLYEPAVVVDKRTPKEMIDEAVNVAKQSDMVVAVVGEAQG 508
Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
+ EA R L +P Q++L+S + K P +LVLM+G P+ A K D + AI+
Sbjct: 509 MAHEASSRTDLTIPQSQRDLISALKATGK-PLVLVLMNGRPL--ALVKEDQQADAILETW 565
Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPM------TEMAMRPSQSKRYPG 604
+ G GG AIAD+LFG NP GKLP++ +P+ + +P+ T P + +Y
Sbjct: 566 FAGTEGGNAIADVLFGDYNPSGKLPIS-FPRS-VGQIPVYYSHLNTGRPYNPEKPNKYTS 623
Query: 605 RTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAK 664
R + GP +YPFG+G+SYT F +V++ ++ P R G + A+++
Sbjct: 624 RYFDEANGP-LYPFGYGLSYTTF--SVSDVK--LSSPTMKRDGKVTASVT---------- 668
Query: 665 CNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVG- 722
V N G ++GA + ++ A P KQL FEK+ + G Q V
Sbjct: 669 -----------VTNTGKREGATAVQMYLQDVTASMSRPVKQLRGFEKITLKPGESQTVSF 717
Query: 723 -INIHVCKYLSVVDRSGTRRIPLGEHNIHIG 752
I+I K+ + R P G+ N+ IG
Sbjct: 718 PIDIDALKFWN--QRMKYDAEP-GKFNVFIG 745
>gi|389840350|ref|YP_006342434.1| beta-D-glucoside glucohydrolase [Cronobacter sakazakii ES15]
gi|417792062|ref|ZP_12439471.1| beta-D-glucoside glucohydrolase [Cronobacter sakazakii E899]
gi|429121663|ref|ZP_19182278.1| Periplasmic beta-glucosidase [Cronobacter sakazakii 680]
gi|333953864|gb|EGL71757.1| beta-D-glucoside glucohydrolase [Cronobacter sakazakii E899]
gi|387850826|gb|AFJ98923.1| beta-D-glucoside glucohydrolase [Cronobacter sakazakii ES15]
gi|426323856|emb|CCK13015.1| Periplasmic beta-glucosidase [Cronobacter sakazakii 680]
Length = 765
Score = 269 bits (687), Expect = 5e-69, Method: Compositional matrix adjust.
Identities = 212/694 (30%), Positives = 337/694 (48%), Gaps = 112/694 (16%)
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
T FP + ASSFN + +GRV + Y GL W+P V++ RDPRWGRG
Sbjct: 118 TIFPNSLGLASSFNLDAVKTVGRV------SAYEAADDGLNMTWAPMVDVTRDPRWGRGS 171
Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
E GED L+ + V +QG S DR V S KHF AY +N VD
Sbjct: 172 EGFGEDTYLTSMMGKTMVESMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 226
Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
+S Q + + + P++ + G VM + N +NG P +D +LK +RG+W G
Sbjct: 227 --MSSQRLFNDYMPPYKEALDAGS-GGVMIALNSLNGTPAASDGWLLKDLLRGDWGFKGI 283
Query: 285 IVSDCDSVGVYYDTQHFT-STPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
+SD ++ +H T S PE+A AI++G+D+ + + + ++ G ++ +
Sbjct: 284 TISDHGAIKEL--IKHGTASDPEDAVRVAIKSGVDMSMADEYYSKYLPNLIKSGKVTMAE 341
Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
+++A + L V+ +G+F+ PY HLGPKD D H++ A + AR+ +V
Sbjct: 342 LDDATRHVLNVKYDMGLFN-----DPYSHLGPKDSDPKDTNAESRLHRDDARKVARESLV 396
Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGI----GRYA 449
LLKN+ +LPL + T+AV+GP +D M+G++ AG+A T LQG+ G A
Sbjct: 397 LLKNRLETLPLK--KSGTIAVVGPLADSKRDMMGSWSAAGVADQSVTLLQGMKNVAGDKA 454
Query: 450 RTIHQQGCKDVACADDQ----------------------LFGAAIDASRQADATILVMGL 487
+ ++ +G DD+ + A++ ++Q+D + V+G
Sbjct: 455 KILYAKGAN---VTDDKGIVDFLNLYEPAVVVDKRTPKAMIDEAVNVAKQSDVVVAVVGE 511
Query: 488 DQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAII 547
Q + EA R L +P Q++L+S + K P +LVLM+G P+ A K D + AI+
Sbjct: 512 AQGMAHEASSRTDLTIPQSQRDLISALKATGK-PLVLVLMNGRPL--ALVKEDQQADAIL 568
Query: 548 WAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPM------TEMAMRPSQSKR 601
+ G GG AIAD+LFG NP GKLP++ +P+ + +P+ T P + +
Sbjct: 569 ETWFAGTEGGNAIADVLFGDYNPSGKLPIS-FPRS-VGQIPVYYSHLNTGRPYNPEKPNK 626
Query: 602 YPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVT 661
Y R + GP +YPFG+G+SYT F +V++ ++ P R G + A+++
Sbjct: 627 YTSRYFDEANGP-LYPFGYGLSYTTF--SVSDVK--LSAPTMKRDGKVTASVT------- 674
Query: 662 HAKCNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQR 720
V N G ++GA + ++ A P KQL FEK+ + G Q
Sbjct: 675 --------------VTNTGKREGATAVQMYLQDVTASMSRPVKQLRGFEKITLKPGESQT 720
Query: 721 VG--INIHVCKYLSVVDRSGTRRIPLGEHNIHIG 752
V I+I K+ + R P G+ N+ IG
Sbjct: 721 VSFPIDIDALKFWN--QRMKYDAEP-GKFNVFIG 751
>gi|429084349|ref|ZP_19147356.1| Periplasmic beta-glucosidase [Cronobacter condimenti 1330]
gi|426546667|emb|CCJ73397.1| Periplasmic beta-glucosidase [Cronobacter condimenti 1330]
Length = 759
Score = 269 bits (687), Expect = 6e-69, Method: Compositional matrix adjust.
Identities = 210/689 (30%), Positives = 333/689 (48%), Gaps = 102/689 (14%)
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
T FP + ASSFN + +GR+ + Y GL W+P V++ RDPRWGRG
Sbjct: 112 TIFPNSLGLASSFNLDAVKTVGRI------SAYEAADDGLNMTWAPMVDVTRDPRWGRGS 165
Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
E GED L+ + V +QG S DR V S KHF AY +N VD
Sbjct: 166 EGFGEDTYLTSMMGKTMVESMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 220
Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
+S Q + + + P++ + G VM + N +NG P +D +LK +RGEW G
Sbjct: 221 --MSSQRLFNDYMPPYKEALDAGS-GGVMIALNSLNGTPAASDSWLLKDLLRGEWGFKGI 277
Query: 285 IVSDCDSVGVYYDTQHFTST-PEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
+SD ++ +H T++ PE+A AI++G+D+ + + ++ G ++ +
Sbjct: 278 TISDHGAIKEL--IKHGTASGPEDAVRVAIKSGVDMSMADEYYSKYLPDLIKNGSVTMAE 335
Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
+++A + L V+ +G+F+ PY HLGPKD D H++ A + AR+ +V
Sbjct: 336 LDDATRHVLNVKYDMGLFN-----DPYSHLGPKDSDPQDTNAESRLHRDDARKVARESLV 390
Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGI----GRYA 449
LLKN+ +LPL + T+AV+GP +D M+G++ AG+A T LQG+ G A
Sbjct: 391 LLKNRLDTLPLK--KSGTIAVVGPLADSKRDMMGSWSAAGVADQSITLLQGMKNVAGDKA 448
Query: 450 RTIHQQGCK---DVACAD----------------DQLFGAAIDASRQADATILVMGLDQS 490
+ ++ +G D D ++ A++ ++Q+D + V+G Q
Sbjct: 449 KILYAKGANVTDDKGIVDFLNLYEPAVVVDKRTPKEMIDEAVNVAKQSDVVVAVVGEAQG 508
Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
+ EA R L +P Q++L+S + K P +LVLM+G P+ A K + + AI+
Sbjct: 509 MAHEASSRTDLTIPQSQRDLISALKATGK-PLVLVLMNGRPL--ALVKENQQADAILETW 565
Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPM------TEMAMRPSQSKRYPG 604
+ G GG AIAD+LFG NP GKLP++ +P+ + LP+ T P + +Y
Sbjct: 566 FAGTEGGNAIADVLFGDYNPSGKLPIS-FPRS-VGQLPVYYSHLNTGRPYNPEKPNKYTS 623
Query: 605 RTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAK 664
R + GP +YPFG+G+SYT F +V++ ++ P R G + A+++
Sbjct: 624 RYFDEANGP-LYPFGYGLSYTTF--SVSDVK--LSSPTMKRDGKVTASVT---------- 668
Query: 665 CNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVGI 723
V N G ++GA + ++ A P KQL FEK+ + G Q V
Sbjct: 669 -----------VTNTGKREGATAVQMYLQDVTASMSRPVKQLRGFEKITLKPGESQTVSF 717
Query: 724 NIHVCKYLSVVDRSGTRRIPLGEHNIHIG 752
I V R P G+ N+ IG
Sbjct: 718 PIDVDALKFWNQRMKYDAEP-GKFNVFIG 745
>gi|53712134|ref|YP_098126.1| beta-glucosidase [Bacteroides fragilis YCH46]
gi|52214999|dbj|BAD47592.1| periplasmic beta-glucosidase precursor [Bacteroides fragilis YCH46]
Length = 812
Score = 268 bits (685), Expect = 7e-69, Method: Compositional matrix adjust.
Identities = 220/689 (31%), Positives = 327/689 (47%), Gaps = 89/689 (12%)
Query: 109 GATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRG 168
G T FP I AS++N L +GRV++ EA A G G + P +++ RDPRW R
Sbjct: 168 GTTVFPTSIGQASTWNPELIRQMGRVIAIEASA--QGAHIG---YGPVLDLARDPRWSRV 222
Query: 169 QETPGEDPVLSGKYAASYVRGLQGSD-GDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKV 227
+ET GEDP L+G + VRG QG D V A+ KHF +Y W A +
Sbjct: 223 EETYGEDPYLNGVMGTALVRGFQGETLNDGKSVIATLKHFASY---GWTEGGHNGGTAHI 279
Query: 228 SKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVS 287
++++E+ PFR V G + SVM SYN+++G P +L ++ W+ G++VS
Sbjct: 280 GERELEEAIFPPFREAVGAGAL-SVMSSYNEIDGNPCTGSRYLLTDILKDRWQFKGFVVS 338
Query: 288 DCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTE---SAVQRGLLSEIDIN 344
D +VG + EAA A+ AG+D D G ++ E +AV+RG ++ I+
Sbjct: 339 DLYAVGGLRE-HGVAGNDYEAAIKAVNAGVDSDLGT--NVYAEQLVAAVKRGDVAVATID 395
Query: 345 NALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSL 404
A+ L+++ ++G+FD + + V + +H LA E ARQ IVLLKN+ L
Sbjct: 396 KAVRRILSLKFQMGLFDDPFVDEKQ---AVQLVASSEHTGLAREVARQSIVLLKNKDKLL 452
Query: 405 PLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYT--TPLQGI----GRYARTIHQQGCK 458
PL RT+AVIGPN+D M+G+Y T T L GI + R ++ +GC
Sbjct: 453 PLKK-DIRTLAVIGPNADNVYNMLGDYTAPQADGTVVTVLDGIRQKVSKETRVLYAKGCA 511
Query: 459 DVACADDQLFGAAIDASRQADATILVMG----LDQSIE-------------------AEA 495
V + F AI+ +R ADA ++VMG D S E E
Sbjct: 512 -VRDSSRTGFKDAIETARNADAVVMVMGGSSARDFSSEYEETGAAKVTINQISDMESGEG 570
Query: 496 LDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQA 555
DRA L L GRQ EL+ ++S K +VL+ G P+ + A + AI+ A YPG
Sbjct: 571 YDRATLHLMGRQLELLEEISRLGKP---VVLIKGRPLLMEGAIQEAE--AIVDAWYPGMQ 625
Query: 556 GGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVV 615
GG A+AD+LFG NP G+L ++ P+ + LP+ R RY G
Sbjct: 626 GGNAVADVLFGDYNPAGRLTLS-VPRS-VGQLPVYYNTRRKGNRSRYVEE-----PGTPR 678
Query: 616 YPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVD 675
YPFG+G+SYT F +T ++VT + + V V
Sbjct: 679 YPFGYGLSYTTFSYT------------------------DMKVQVTEG-SDDCWVDVTVT 713
Query: 676 VKNVGSKDGAHTL-LVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVV 734
++N G+ DG L F + P KQL AF ++H+ AG + V + K L++
Sbjct: 714 IQNQGTADGDEVAQLYFRDDVSSFTTPAKQLRAFSRIHLKAGESREVTFTLD-KKSLALY 772
Query: 735 DRSGTRRIPLGEHNIHIGGTKHSVSLHAA 763
+ G + G I +GG+ ++ A
Sbjct: 773 MQEGEWVVEPGRFTIMVGGSSEDITCRQA 801
>gi|383125188|ref|ZP_09945842.1| hypothetical protein BSIG_4348 [Bacteroides sp. 1_1_6]
gi|382983435|gb|EES66611.2| hypothetical protein BSIG_4348 [Bacteroides sp. 1_1_6]
Length = 954
Score = 268 bits (685), Expect = 7e-69, Method: Compositional matrix adjust.
Identities = 226/760 (29%), Positives = 362/760 (47%), Gaps = 105/760 (13%)
Query: 42 FCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAA--AVPRLGIKGYEWWSEALHGVSNVGP 99
+ SLP+ +RV L+ ++ ++K++L+ G +P L + EA+HG S
Sbjct: 170 YMDASLPVEERVESLLAVMTPEDKMELIREGWGIPGIPHLYVPPITK-VEAVHGFS---- 224
Query: 100 GTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNI 159
+G GAT FPQ + +++N L E + V+ DE A N A WSP +++
Sbjct: 225 ---YGS---GATIFPQALAMGATWNRKLTEEVAMVIGDETVAA-NTKQA----WSPVLDV 273
Query: 160 FRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD 219
+D RWGR +ET GEDPVL + ++++G Q + + KHF + G D
Sbjct: 274 AQDARWGRCEETFGEDPVLVSQIGGAWIKGYQSRG-----LFTTPKHFGGHGAP-LGGRD 327
Query: 220 RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEW 279
+S++++ + VPFR + S+M +Y+ GVP +L++ +R EW
Sbjct: 328 SHDIG--LSEREMREIHLVPFRHAIRNYDCQSLMMAYSDYMGVPVAKSKELLQQILRQEW 385
Query: 280 RLNGYIVSDCDSVGVYYDTQHFTSTPE-EAAADAIRAGLDLDCG-PFLGLHTESAVQRGL 337
NG+IVSDC ++G +H+T+ + EAA A+ AG+ +CG + A + G
Sbjct: 386 GFNGFIVSDCGAIGNLTARKHYTAKDKIEAANQALAAGIATNCGDTYNNKEVIQAAKDGR 445
Query: 338 LSEIDINNALVNTLTVQMRLGMFDGEPSS-QPYGHLGPKDVCTPDHQELALEAARQGIVL 396
++ D++N L R +F+ P + + P + H+E+A +AAR+ IV+
Sbjct: 446 INMEDLDNVCRTMLGTMFRNELFEKNPCKPLDWKKIYP-GWNSDSHKEMARQAARESIVM 504
Query: 397 LKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYA-----GIACGYTTPLQG-IGRYAR 450
L+N+ LPLS RT+AV+GP +D G+Y G T ++G +G+ +
Sbjct: 505 LENKDNLLPLSKTL-RTIAVLGPGAD--DLQPGDYTPKLLPGQLKSVLTGIKGAVGKQTK 561
Query: 451 TIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEA---------EALDRAGL 501
+++QGC D D+ A+ A+ Q+D I+V+G + EA E D A L
Sbjct: 562 VLYEQGC-DFTNPDETNIPKAVKAASQSDVVIMVLGDCSTSEATNDVRKTCGENNDWATL 620
Query: 502 LLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIA 561
+LPG+QQEL+ V K P IL+L +G P D+ A + + W PGQ GG A+A
Sbjct: 621 ILPGKQQELLEAVCATGK-PVILILQAGRPYDILKASEMCKAILVNW--LPGQEGGPAMA 677
Query: 562 DILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHG 621
D+LFG NP G+LPMT +P+ ++ LP+ + +RY +Y +Y FG G
Sbjct: 678 DVLFGDYNPAGRLPMT-FPR-HVGQLPL--YYNFKTSGRRYEYVDMEYYP---LYRFGFG 730
Query: 622 MSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGS 681
+SYT+F ++ N I KA + VQ VKNVGS
Sbjct: 731 LSYTSFEYS-------------------NLKIQEKA---------NGNVEVQATVKNVGS 762
Query: 682 KDGAHTLLVFSTPPAGHWAPHK----QLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRS 737
+ G ++ T +A K +L F ++H+ G + V + +S+++
Sbjct: 763 RAGDEVAQLYVT---DMYASVKTRVMELKDFARIHLQPGESKTVSFEMTPYD-ISLLNDR 818
Query: 738 GTRRIPLGEHNIHIGGT----------KHSVSLHAATLGV 767
R + GE I +GG KHSV GV
Sbjct: 819 MDRVVEKGEFKIMVGGMSPDYVAKNEIKHSVGYSDNKKGV 858
>gi|449307585|ref|YP_007439941.1| beta-D-glucoside glucohydrolase [Cronobacter sakazakii SP291]
gi|449097618|gb|AGE85652.1| beta-D-glucoside glucohydrolase [Cronobacter sakazakii SP291]
Length = 765
Score = 268 bits (685), Expect = 8e-69, Method: Compositional matrix adjust.
Identities = 212/694 (30%), Positives = 337/694 (48%), Gaps = 112/694 (16%)
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
T FP + ASSFN + +GRV + Y GL W+P V++ RDPRWGRG
Sbjct: 118 TIFPNSLGLASSFNLDAVKTVGRV------SAYEAADDGLNMTWAPMVDVTRDPRWGRGS 171
Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
E GED L+ + V +QG S DR V S KHF AY +N VD
Sbjct: 172 EGFGEDTYLTSMMGKTMVESMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 226
Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
+S Q + + + P++ + G VM + N +NG P +D +LK +RG+W G
Sbjct: 227 --MSSQRLFNDYMPPYKEALDAGS-GGVMIALNSLNGTPAASDGWLLKDLLRGDWGFKGI 283
Query: 285 IVSDCDSVGVYYDTQHFT-STPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
+SD ++ +H T S PE+A AI++G+D+ + + + ++ G ++ +
Sbjct: 284 TISDHGAIKEL--IKHGTASDPEDAVRVAIKSGVDMSMADEYYSKYLPNLIKSGKVTMAE 341
Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
+++A + L V+ +G+F+ PY HLGPKD D H++ A + AR+ +V
Sbjct: 342 LDDATRHVLNVKYDMGLFN-----DPYSHLGPKDSDPKDTNAESRLHRDDARKVARESLV 396
Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGI----GRYA 449
LLKN+ +LPL + T+AV+GP +D M+G++ AG+A T LQG+ G A
Sbjct: 397 LLKNRLETLPLK--KSGTIAVVGPLADSKRDMMGSWSAAGVADQSVTLLQGMKNVAGDKA 454
Query: 450 RTIHQQGCKDVACADDQ----------------------LFGAAIDASRQADATILVMGL 487
+ ++ +G DD+ + A++ ++Q+D + V+G
Sbjct: 455 KILYAKGAN---VTDDKGIVDFLNLYEPAVVVDKRTPKAMIDEAVNVAKQSDVVVAVVGE 511
Query: 488 DQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAII 547
Q + EA R L +P Q++L+S + K P +LVLM+G P+ A K D + AI+
Sbjct: 512 AQGMAHEASSRTDLTIPQSQRDLISALKATGK-PLVLVLMNGRPL--ALVKEDLQADAIL 568
Query: 548 WAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPM------TEMAMRPSQSKR 601
+ G GG AIAD+LFG NP GKLP++ +P+ + +P+ T P + +
Sbjct: 569 ETWFAGTEGGNAIADVLFGDYNPSGKLPIS-FPRS-VGQIPVYYSHLNTGRPYNPEKPNK 626
Query: 602 YPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVT 661
Y R + GP +YPFG+G+SYT F +V++ ++ P R G + A+++
Sbjct: 627 YTSRYFDEANGP-LYPFGYGLSYTTF--SVSDVK--LSAPTMKRDGKVTASVT------- 674
Query: 662 HAKCNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQR 720
V N G ++GA + ++ A P KQL FEK+ + G Q
Sbjct: 675 --------------VTNTGKREGATAVQMYLQDVTASMSRPVKQLRGFEKITLKPGESQT 720
Query: 721 VG--INIHVCKYLSVVDRSGTRRIPLGEHNIHIG 752
V I+I K+ + R P G+ N+ IG
Sbjct: 721 VSFPIDIDALKFWN--QRMKYDAEP-GKFNVFIG 751
>gi|29347190|ref|NP_810693.1| beta-glucosidase [Bacteroides thetaiotaomicron VPI-5482]
gi|29339089|gb|AAO76887.1| periplasmic beta-glucosidase precursor [Bacteroides
thetaiotaomicron VPI-5482]
Length = 950
Score = 268 bits (685), Expect = 9e-69, Method: Compositional matrix adjust.
Identities = 226/760 (29%), Positives = 362/760 (47%), Gaps = 105/760 (13%)
Query: 42 FCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAA--AVPRLGIKGYEWWSEALHGVSNVGP 99
+ SLP+ +RV L+ ++ ++K++L+ G +P L + EA+HG S
Sbjct: 166 YMDASLPVEERVESLLAVMTPEDKMELIREGWGIPGIPHLYVPPITK-VEAVHGFS---- 220
Query: 100 GTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNI 159
+G GAT FPQ + +++N L E + V+ DE A N A WSP +++
Sbjct: 221 ---YGS---GATIFPQALAMGATWNRKLTEEVAMVIGDETVAA-NTKQA----WSPVLDV 269
Query: 160 FRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD 219
+D RWGR +ET GEDPVL + ++++G Q + + KHF + G D
Sbjct: 270 AQDARWGRCEETFGEDPVLVSQIGGAWIKGYQSRG-----LFTTPKHFGGHGAP-LGGRD 323
Query: 220 RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEW 279
+S++++ + VPFR + S+M +Y+ GVP +L++ +R EW
Sbjct: 324 SHDIG--LSEREMREIHLVPFRHAIRNYDCQSLMMAYSDYMGVPVAKSKELLQQILRQEW 381
Query: 280 RLNGYIVSDCDSVGVYYDTQHFTSTPE-EAAADAIRAGLDLDCG-PFLGLHTESAVQRGL 337
NG+IVSDC ++G +H+T+ + EAA A+ AG+ +CG + A + G
Sbjct: 382 GFNGFIVSDCGAIGNLTARKHYTAKDKIEAANQALAAGIATNCGDTYNNKEVIQAAKDGR 441
Query: 338 LSEIDINNALVNTLTVQMRLGMFDGEPSS-QPYGHLGPKDVCTPDHQELALEAARQGIVL 396
++ D++N L R +F+ P + + P + H+E+A +AAR+ IV+
Sbjct: 442 INMEDLDNVCRTMLGTMFRNELFEKNPCKPLDWKKIYP-GWNSDSHKEMARQAARESIVM 500
Query: 397 LKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYA-----GIACGYTTPLQG-IGRYAR 450
L+N+ LPLS RT+AV+GP +D G+Y G T ++G +G+ +
Sbjct: 501 LENKDNLLPLSKTL-RTIAVLGPGAD--DLQPGDYTPKLLPGQLKSVLTGIKGAVGKQTK 557
Query: 451 TIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEA---------EALDRAGL 501
+++QGC D D+ A+ A+ Q+D I+V+G + EA E D A L
Sbjct: 558 VLYEQGC-DFTNPDETNIPKAVKAASQSDVVIMVLGDCSTSEATNDVRKTCGENNDWATL 616
Query: 502 LLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIA 561
+LPG+QQEL+ V K P IL+L +G P D+ A + + W PGQ GG A+A
Sbjct: 617 ILPGKQQELLEAVCATGK-PVILILQAGRPYDILKASEMCKAILVNW--LPGQEGGPAMA 673
Query: 562 DILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHG 621
D+LFG NP G+LPMT +P+ ++ LP+ + +RY +Y +Y FG G
Sbjct: 674 DVLFGDYNPAGRLPMT-FPR-HVGQLPL--YYNFKTSGRRYEYVDMEYYP---LYRFGFG 726
Query: 622 MSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGS 681
+SYT+F ++ N I KA + VQ VKNVGS
Sbjct: 727 LSYTSFEYS-------------------NLKIQEKA---------NGNVEVQATVKNVGS 758
Query: 682 KDGAHTLLVFSTPPAGHWAPHK----QLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRS 737
+ G ++ T +A K +L F ++H+ G + V + +S+++
Sbjct: 759 RAGDEVAQLYVT---DMYASVKTRVMELKDFARIHLQPGESKTVSFEMTPYD-ISLLNDR 814
Query: 738 GTRRIPLGEHNIHIGGT----------KHSVSLHAATLGV 767
R + GE I +GG KHSV GV
Sbjct: 815 MDRVVEKGEFKIMVGGMSPDYVAKNEIKHSVGYSDNKKGV 854
>gi|253574420|ref|ZP_04851761.1| glycoside hydrolase [Paenibacillus sp. oral taxon 786 str. D14]
gi|251846125|gb|EES74132.1| glycoside hydrolase [Paenibacillus sp. oral taxon 786 str. D14]
Length = 782
Score = 268 bits (685), Expect = 9e-69, Method: Compositional matrix adjust.
Identities = 233/809 (28%), Positives = 364/809 (44%), Gaps = 146/809 (18%)
Query: 35 ATTRTLPFCQVSLPIPQRVNDLIGRLSLQEK-------------------VKLLISGAAA 75
+ L + S PIP+RV L+G ++L+EK +KL + A
Sbjct: 13 SEVEMLLYKDSSKPIPERVEHLLGLMTLEEKAGQLVQPFGWQTYEHKDGEIKLTEAFKAQ 72
Query: 76 VPRLGI-------KGYEWW-------------SEALHGVSN-------------VGPGTK 102
V G+ + W +EA++ + +G
Sbjct: 73 VKNGGVGSLYGVLRADPWTGVTLETGLSPREGTEAVNAIQRYAIENSRLGIPILIGEECS 132
Query: 103 FGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRD 162
G GAT FP ++ S++N L+ + R V+ E RA G +SP +++ RD
Sbjct: 133 HGHMAIGATVFPVPLSLGSTWNVELYREMCRAVARETRAQ-----GGAVTYSPVLDVVRD 187
Query: 163 PRWGRGQETPGEDPVLSGKYAASYVRGLQGS--DGDRLKVAASCKHFTAYDLDNWNGVDR 220
PRWGR +E GED L + A + V GLQG DG+ VAA+ KHF Y G
Sbjct: 188 PRWGRTEECFGEDAYLISEMAVASVEGLQGESLDGED-SVAATLKHFVGY------GSSE 240
Query: 221 FHFNA---KVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRG 277
NA + ++++ + +PFR V E AS+M +YN+++GVP + +L +RG
Sbjct: 241 GGRNAGPVHMGRRELLEVDLLPFRKAV-EAGAASIMPAYNEIDGVPCTTNEELLDGVLRG 299
Query: 278 EWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLD-CGPFLGLHTESAVQRG 336
EW +G +++DC ++ + +AA AIRAG+D++ G G H AV+ G
Sbjct: 300 EWGFDGMVITDCGAIDMLASGHDVAEDGRDAAIQAIRAGIDMEMSGVMFGKHLVEAVRSG 359
Query: 337 LLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVL 396
L E ++ A+ LT++ RLG+F+ P + P + + + +H ELA + A +G+VL
Sbjct: 360 QLEEEVLDRAVRRVLTLKFRLGLFE-RPYADP--ERAERVIGSAEHVELARQLASEGVVL 416
Query: 397 LKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIA--CGYTTPLQGIGRY-----A 449
LKN+ LPLS T+AVIGPN+D +G+Y TT L GI
Sbjct: 417 LKNKDGVLPLS-ADAGTIAVIGPNADAGYNQLGDYTSPQPRSKVTTVLGGIRSKLAETPE 475
Query: 450 RTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIE----------------- 492
R ++ GC+ + + F A+ + +AD ++V+G + +
Sbjct: 476 RVLYAPGCR-INGNSREGFDVALSCAEKADTVVMVVGGSSARDFGEGTIDLRTGASKVTD 534
Query: 493 --------AEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIA 544
E +DR L L G Q EL+ ++ K P ++V ++G PI + D
Sbjct: 535 NAESDMDCGEGIDRMNLSLSGVQLELIQEIHKLGK-PLVVVYINGRPIAEPWI--DEHAD 591
Query: 545 AIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPG 604
AI+ A YPGQ GG AIADILFG NP G+L ++ P+ ++ +P+ KR G
Sbjct: 592 AILEAWYPGQEGGHAIADILFGDVNPSGRLTIS-IPK-HVGQVPVY------YHGKRSRG 643
Query: 605 RTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAK 664
+ Y YPFG+G+SYT F + L +IN S K
Sbjct: 644 KRYLEGDSQPRYPFGYGLSYTEFTYN----------NLKLESDTINKDGSTK-------- 685
Query: 665 CNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWA-PHKQLVAFEKVHVPAGAQQRVGI 723
V V+V NVG + GA + ++ T A P K+L F K+ + G Q V
Sbjct: 686 -------VTVEVTNVGERAGAEVIQLYITDVASKVTRPAKELKGFRKIFLQPGETQTVEF 738
Query: 724 NIHVCKYLSVVDRSGTRRIPLGEHNIHIG 752
+ + L + ++ + GE +H+G
Sbjct: 739 TVG-PEQLQYIGQNYKPVVEPGEFRVHVG 766
>gi|167517889|ref|XP_001743285.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163778384|gb|EDQ91999.1| predicted protein [Monosiga brevicollis MX1]
Length = 521
Score = 268 bits (685), Expect = 9e-69, Method: Compositional matrix adjust.
Identities = 192/535 (35%), Positives = 270/535 (50%), Gaps = 33/535 (6%)
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--------GGTAGLTYWSPNVNIFRD 162
++FP + A+SFNA+LW A G V++ E RA N T + PN+NI RD
Sbjct: 1 STFPGPLGMAASFNASLWFAKGDVLATELRAFSNTNWHRSNVDNQIQYTGFGPNINIARD 60
Query: 163 PRWGRGQETPGEDPVLSGKYAASYVRGLQGSD-GDRLKVAASCKHFTAYDLDNWNGVDRF 221
PR+GR E PGEDP LSG YA VRG+ +D K+ A KHFTAY + +R
Sbjct: 61 PRFGRTSELPGEDPYLSGTYATHMVRGMMQADAAGHPKMLAYLKHFTAYSTE----TNRQ 116
Query: 222 HFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRL 281
H + VS D+ DT+ + M + A MCSYN VNG P+CA+ IL+ +R +W+
Sbjct: 117 HSDFNVSSHDLWDTYLPQYEMAFRSAQPAGAMCSYNAVNGRPSCANGYILRDVLRNQWQQ 176
Query: 282 -NGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSE 340
N +I SDC ++ F AAA A+ G DL+ G + AV R L S
Sbjct: 177 PNAHITSDCGAISSLRGAPVFAPDDATAAAVALNNGTDLEMGSQVYASLAEAVARNLTSS 236
Query: 341 IDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQ 400
+ A + R G FD P++ + G +D+ + Q L EA Q +VLL+N+
Sbjct: 237 TLVEEAFRRAARILFRGGRFD-PPATVEWNAYGVQDINSSATQALVHEATAQSLVLLQNR 295
Query: 401 GPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIAC--GYTTPLQGIGRYARTIHQQGCK 458
LPL+ + VAV+GP + ++ +YA + C G + +G ++ G
Sbjct: 296 HGILPLAP--GQRVAVVGPLVERGDALLSDYASLVCYDGTYDCIPTLGASVTAANKGGAT 353
Query: 459 ------DVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVS 512
DV + AA+ A++ AD + +G D++IE E LDR L LP Q L+
Sbjct: 354 TVVPGVDVNSNNSSGLAAAVAAAQAADVVVAFLGTDKTIEREGLDRVNLTLP-GLQGLLL 412
Query: 513 KVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGG 572
+A+ P +L+L +GGP+ + N R AA++ GQ G T +A LFG N G
Sbjct: 413 DQLLATGTPVVLLLNNGGPLAIESYLN--RTAAVMETFNAGQFGATVMAKALFGQVNNFG 470
Query: 573 KLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNF 627
KLP T YP Y+ TE AM YPGRTYR+ V+PFG G+SYT F
Sbjct: 471 KLPYTVYPAGYV-----TEQAMNNYDMALYPGRTYRYLVQAPVFPFGFGLSYTTF 520
>gi|261880245|ref|ZP_06006672.1| beta-glucosidase [Prevotella bergensis DSM 17361]
gi|270333079|gb|EFA43865.1| beta-glucosidase [Prevotella bergensis DSM 17361]
Length = 854
Score = 268 bits (684), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 157/420 (37%), Positives = 234/420 (55%), Gaps = 29/420 (6%)
Query: 38 RTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNV 97
+ P+ L +R DL RL+L+EK K++ +G+ A+PRLGI +EWWSEALHG+
Sbjct: 21 QQFPYQNTDLSPKERAADLCSRLTLEEKSKIMQNGSPAIPRLGIPQFEWWSEALHGIGRN 80
Query: 98 GPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAM----YNGGT----AG 149
G AT FP + ASS++ L + + VSDE R GT G
Sbjct: 81 G----------FATVFPITMGMASSWDDALLQKVFDAVSDEGRVKAQQAKRSGTIKRYQG 130
Query: 150 LTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFT 208
L++W+PN+NIFRDPRWGRGQET GEDP L+ + + VRGLQG SD K+ A KHF
Sbjct: 131 LSFWTPNINIFRDPRWGRGQETYGEDPYLTSRMGLAVVRGLQGPSDSKYRKLLACAKHFA 190
Query: 209 AYDLDNWNGVDRFHFNAK-VSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCAD 267
+ WN R FN + + ++D+ +T+ F+ V +G VA VMC+Y +++G P C +
Sbjct: 191 VHSGPEWN---RHTFNVEDLPERDLWETYLPAFKALVQQGDVAEVMCAYQRIDGQPCCGN 247
Query: 268 PNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPE--EAAADAIRAGLDLDCGPFL 325
LK +R EW G +VSDC +V ++ H +P+ A+A A+ +G D++CG
Sbjct: 248 NRFLKSILRNEWNYQGMVVSDCWAVPDFWKKGHHEVSPDATHASAKAVLSGTDVECGSDY 307
Query: 326 GLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQEL 385
+ AV+ G++ E D++ ++ L + LG FD + P+ + V + H++L
Sbjct: 308 S-NLPEAVRAGIIKEADVDVSVRRLLEARFALGDFDPD-ELVPWTKISESVVASKAHKQL 365
Query: 386 ALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
AL+ AR+ +VLL+N LPL + V V+G N+ + M GNY+G T LQG+
Sbjct: 366 ALDMARKSMVLLQNND-ILPLKRSGQKIV-VVGANAIDSTMMWGNYSGYPTQTVTILQGL 423
Score = 113 bits (282), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 84/300 (28%), Positives = 131/300 (43%), Gaps = 55/300 (18%)
Query: 478 ADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILVLM 527
AD I V G+ +E E + DR + LP Q+E++ +S A + + V
Sbjct: 599 ADVVIFVGGISPRLEGEEMEVSDPGFKGGDRTTIELPQAQREVIKALSEAGRR-IVFVNC 657
Query: 528 SGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNL 587
SG I A R+ AI+ A YPG+ GGTA+AD+LFG NP GKLP+T+Y + L
Sbjct: 658 SGSAI--ALTPESQRVDAILQAWYPGEQGGTAVADVLFGDYNPSGKLPVTFYKND--AQL 713
Query: 588 PMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHG 647
P R GRTYR++K ++PFG+G+SYT F
Sbjct: 714 P-------DFLDYRMAGRTYRYFKETPLFPFGYGLSYTQF-------------------- 746
Query: 648 SINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVA 707
T + + VQV V N G +DG + V+ P K L
Sbjct: 747 -------------TIGQPRYINNQVQVSVSNTGKRDGDEVVQVYIRRTDDAAGPIKTLRG 793
Query: 708 FEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHAATLGV 767
F++V + G ++V +++ + S T R+ G + + +G + + +L + +
Sbjct: 794 FQRVSLKVGETKQVSVSLPRESFEWWDASSNTMRVIPGNYEVMVGSSSMAKNLKTIMVNL 853
>gi|402299776|ref|ZP_10819350.1| glycoside hydrolase family protein [Bacillus alcalophilus ATCC
27647]
gi|401725066|gb|EJS98379.1| glycoside hydrolase family protein [Bacillus alcalophilus ATCC
27647]
Length = 937
Score = 268 bits (684), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 155/425 (36%), Positives = 240/425 (56%), Gaps = 29/425 (6%)
Query: 40 LPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGP 99
PF L + RV DL+ R +L EK++L++ + RLGIK ++ +EA HG++ +G
Sbjct: 7 FPFQNTELLVEDRVKDLVNRFTLDEKIELMMQHQPEIKRLGIKAHKQGTEAAHGIAWLGE 66
Query: 100 GTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA--GLTYWSPNV 157
AT FPQ I +S++N L IG V+ DEAR Y + GLT W+P V
Sbjct: 67 ----------ATVFPQNIGLSSTWNKDLMNKIGTVIGDEARVFYQKDPSKHGLTLWAPTV 116
Query: 158 NIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNG 217
++ RDPRWGR +E GEDP L+G+ V+G+QG+D D LK A+ KHF L N N
Sbjct: 117 DMERDPRWGRTEEAYGEDPELTGQLTTELVKGMQGNDPDYLKSVATLKHF----LGNNNE 172
Query: 218 VDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRG 277
++R +A + +++ + + F+ ++EGK S+M +YN +NG P P + + ++
Sbjct: 173 INRDKCSASIDPRNMHEYYLKAFKPAIVEGKAKSIMTAYNSINGTPALLHPYV-EEVVKQ 231
Query: 278 EWRLNGYIVSDC-DSVGVYYDTQHFTSTPEEAAADAIRAGLD--LDCGPFLGLHTESAVQ 334
EW ++G+IVSD D VG+ D +++ T EA AD I++G+D D A+
Sbjct: 232 EWGMDGFIVSDAGDVVGIVNDHKYY-ETYSEALADTIKSGIDSITDDAEISLQAMRDALS 290
Query: 335 RGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGI 394
+GLL E D++ AL+NT V+ RLG FD E + PY + + +H++L+ +AA++ +
Sbjct: 291 KGLLVEADLDKALINTFKVRFRLGEFDEERN--PYAFVSESKLLAKEHKQLSYQAAQEQV 348
Query: 395 VLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIG---RYART 451
VLLKN G LPL+ ++AVIGP SD ++ Y+G T L+ I + +
Sbjct: 349 VLLKNDGL-LPLNEGTTESIAVIGPLSD--DVLVDWYSGTPSYRITALEAIEHKLQNKKV 405
Query: 452 IHQQG 456
IH+ G
Sbjct: 406 IHKSG 410
Score = 105 bits (263), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 79/292 (27%), Positives = 141/292 (48%), Gaps = 39/292 (13%)
Query: 471 AIDASRQADATILVMGLDQSIE-AEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSG 529
A+ A+ + D ++V+G + I E +DR ++LP Q++L+ V + TI VL+S
Sbjct: 547 AVKAAAEQDYAVVVLGNNPFINGKECIDREDIVLPPEQEKLLQAVKQVNPN-TIAVLVSS 605
Query: 530 GPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPM 589
P + +A + ++ I++ + G G A+AD+LFG NP G+L MTWY + LP
Sbjct: 606 YPYALNWANEN--LSGIMYTSHAGPELGKAVADVLFGDYNPAGRLSMTWYRS--VDQLPD 661
Query: 590 TEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSI 649
M + K RTY++++G V+YPFGHG+SYT F ++ P D
Sbjct: 662 I-MDYDIIKGK----RTYQYFEGEVLYPFGHGLSYTTFNYSNIQVNETTLSPDD------ 710
Query: 650 NATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWA-PHKQLVAF 708
+KVT + ++N G G + ++ P K L+ F
Sbjct: 711 -------QVKVT------------ISIENTGRFAGDEVVQLYGRAERSRVKRPLKTLLDF 751
Query: 709 EKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRR-IPLGEHNIHIGGTKHSVS 759
+VH+ +G ++++ + + K L D + R + G +++ IG + +++
Sbjct: 752 TRVHLQSGEKKQIEFTVPISK-LEFWDVTQERYCVEKGVYSLLIGSSSKNIA 802
>gi|333380553|ref|ZP_08472244.1| hypothetical protein HMPREF9455_00410 [Dysgonomonas gadei ATCC
BAA-286]
gi|332826548|gb|EGJ99377.1| hypothetical protein HMPREF9455_00410 [Dysgonomonas gadei ATCC
BAA-286]
Length = 957
Score = 267 bits (682), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 217/739 (29%), Positives = 361/739 (48%), Gaps = 101/739 (13%)
Query: 46 SLPIPQRVNDLIGRLSLQEKVKLLISG--AAAVPRLGIKGYEWWSEALHGVSNVGPGTKF 103
+LP+ RV DL+ +++++K++LL G +P LG+ EA+HG S +
Sbjct: 174 NLPLESRVEDLLSVMTVEDKMELLREGWGIPGIPHLGVPAIHK-VEAIHGFS-------Y 225
Query: 104 GGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDP 163
G GAT FPQ I +++N L EA + DE + + WSP +++ +D
Sbjct: 226 GS---GATIFPQSIGMGATWNKRLIEAAAMAIGDETVS-----ANAVQAWSPVLDVAQDA 277
Query: 164 RWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHF 223
RWGR +ET GEDPVL + ++++G Q S G + + KHF A+ G D
Sbjct: 278 RWGRCEETYGEDPVLVTEIGGAWIKGYQ-SKG----LMTTPKHFAAHGAP-LGGRDSHDI 331
Query: 224 NAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNG 283
+S++++ + VPFR + K S+M SY+ GVP +LK +R EW +G
Sbjct: 332 G--LSEREMREIHLVPFRDIYKKYKYQSIMMSYSDFLGVPVAKSKELLKGILRDEWGFDG 389
Query: 284 YIVSDCDSVGVYYDTQHFTSTPE-EAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEI 341
+IVSDC ++G +H+T+ + EAA A+ AG+ +CG + +A +RG L+
Sbjct: 390 FIVSDCGAIGNLTARKHYTAVDKVEAARQALAAGIATNCGDTYNDPDVIAAAKRGELNMD 449
Query: 342 DINNALVNTLTVQMRLGMFDGEPSS-QPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQ 400
D++ L R G+F+ P + + P +P+HQ LA + A++ IVLL+N+
Sbjct: 450 DLDFTCKTLLRTLFRNGLFENNPCKPLDWNKIYP-GWNSPEHQALARKTAQESIVLLENK 508
Query: 401 GPSLPLSHIRHRTVAVIGPNSD-------VTVTMIGNYAGIACGYTTPLQGIGRYARTIH 453
G LPLS +T+AVIGP +D + G + G + + ++
Sbjct: 509 GNILPLSK-SLKTIAVIGPGADNLQPGDYTSKPQPGQLKSVLTGIKA---AVNSSTKVLY 564
Query: 454 QQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEA---------EALDRAGLLLP 504
++GC+ + + A+ A+ AD +LV+G + EA E D A L+LP
Sbjct: 565 EEGCRFIGTEGTDI-AKAVKAAENADVAVLVLGDCSTSEALKGITNTSGENHDLATLILP 623
Query: 505 GRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADIL 564
G QQ+L+ V K P +L+L +G P ++++A + + + W PGQ GG A AD+L
Sbjct: 624 GEQQKLLEAVCKTGK-PVVLILQAGRPYNLSYAAENCQAVLVNW--LPGQEGGYATADVL 680
Query: 565 FGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPV--VYPFGHGM 622
FG NP G+LPMT +P++ LP+ + + GR Y + P +Y FG+G+
Sbjct: 681 FGDYNPAGRLPMT-FPRD-AAQLPLY-------YNFKTSGRVYDYVDMPYYPLYQFGYGL 731
Query: 623 SYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSK 682
SYT+F ++ N +++ +G + S+NAT++ N G
Sbjct: 732 SYTSFNYSDLN----ISLEKNG-NVSVNATVT-----------------------NTGKV 763
Query: 683 DGAHTLLVFSTPPAGHWAPHK----QLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSG 738
G + ++ T +A K +L F++V++ G ++V + LS+++
Sbjct: 764 AGDEVVQLYIT---DMYASVKTRVMELKDFDRVYLNPGESKKVSF-VLTPYQLSLLNDEM 819
Query: 739 TRRIPLGEHNIHIGGTKHS 757
R + G I +GG S
Sbjct: 820 DRVVEKGLFKIMVGGKSPS 838
>gi|330995911|ref|ZP_08319806.1| glycosyl hydrolase family 3 protein [Paraprevotella xylaniphila YIT
11841]
gi|329574250|gb|EGG55825.1| glycosyl hydrolase family 3 protein [Paraprevotella xylaniphila YIT
11841]
Length = 865
Score = 267 bits (682), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 176/472 (37%), Positives = 256/472 (54%), Gaps = 48/472 (10%)
Query: 1 MASTIAFFFLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRL 60
M +T+ +LGL + + G A EP+ +P D + R +R DL+ RL
Sbjct: 1 MKNTVFVLWLGL---AVADIGWAQNEPYK-NP-DLSPR------------ERAEDLLKRL 43
Query: 61 SLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTA 120
+L+EKV L+I+ + AV RLGIK Y WWSEALHGV+ G AT +P + A
Sbjct: 44 TLKEKVSLMINSSPAVERLGIKPYNWWSEALHGVARSGI----------ATVYPITMGMA 93
Query: 121 SSFNATLWEAIGRVVSDEARAMYNGG---------TAGLTYWSPNVNIFRDPRWGRGQET 171
S F+ EA+ VSDEARA Y+ GLT+W+PNVNIFRDPRWGRGQET
Sbjct: 94 SVFDDEAVEAVYTTVSDEARAKYHDAHKKGRYGQMNEGLTFWTPNVNIFRDPRWGRGQET 153
Query: 172 PGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNA-KVSK 229
GEDP L+ + + VRGLQG +D K+ A KH+ + R +F+ ++
Sbjct: 154 WGEDPYLTSRMGVAVVRGLQGPADAKYDKLHACAKHYAVHSGPE---AKRHYFDVEQLDP 210
Query: 230 QDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDC 289
+D+ +T+ F++ V E V VMC+Y + G P C +L + +R EW +VSDC
Sbjct: 211 RDLWETYLPAFKVLVQEADVKEVMCAYQRFEGEPCCGSKRLLHQILRDEWGFKYVVVSDC 270
Query: 290 DSVGVYYDTQHFTSTPE--EAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNAL 347
++G +++ + P+ A+A A+ +G DL+CG + E+AV RGL++E I+ +L
Sbjct: 271 GAIGDFFNPGLHETHPDAATASASAVTSGTDLECGWGDYMQLEAAVDRGLITEHRIDTSL 330
Query: 348 VNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLS 407
L + LG D + S P+ +G V H+++AL+ AR+ +VLL N G LPL
Sbjct: 331 CRLLEARFALGEMD-DDSLVPWSRIGIDTVDCQTHKQMALDIARKSLVLLHNDG-VLPLD 388
Query: 408 HIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI-GRYARTI-HQQGC 457
R V V+GPN+ +V GNY G T L GI R R + +++GC
Sbjct: 389 KTRG-DVVVMGPNAVDSVMQWGNYEGTPSHTYTVLDGIRERLGRDVRYEKGC 439
Score = 108 bits (271), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 86/294 (29%), Positives = 130/294 (44%), Gaps = 50/294 (17%)
Query: 476 RQADATILVMGLDQSIEAEAL-----------DRAGLLLPGRQQELVSKVSMASKGPTIL 524
+ A + V G+ +E E DR + LP Q++++ + A K +
Sbjct: 599 KDAGTVVFVGGISPDLEGEEKNHVNCPGFAGGDRTSIELPQVQRDILKALKAAGK-KVVF 657
Query: 525 VLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYI 584
V SG + A AI+ A YPGQAGG A+AD+LFG NP GKLP+T+Y
Sbjct: 658 VNCSGSAM--ALVPELESCDAILQAWYPGQAGGLAVADVLFGDFNPSGKLPVTFYKN--T 713
Query: 585 TNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLD 643
LP + +M+ GRTYR+ ++PFG+G+SYT F +
Sbjct: 714 EQLPDFEDYSMK--------GRTYRYMTDKPLFPFGYGLSYTTFAIS------------K 753
Query: 644 GRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHK 703
GR S ++ +G+ +K T VKN G +DGA + V+ P K
Sbjct: 754 GRL-SKSSVEAGEGVKFT------------AQVKNTGKRDGAEVVQVYVRKVGDTGGPLK 800
Query: 704 QLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHS 757
L F +V + AG +RV I + + + T R+ GE+ I G + +
Sbjct: 801 SLRGFRRVELKAGESRRVSIELSPEAFEFFDTSTNTMRVVPGEYEIMYGNSSDT 854
>gi|380696432|ref|ZP_09861291.1| beta-glucosidase [Bacteroides faecis MAJ27]
Length = 954
Score = 267 bits (682), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 224/761 (29%), Positives = 358/761 (47%), Gaps = 107/761 (14%)
Query: 42 FCQVSLPIPQRVNDLIGRLSLQEKVKLLISG--AAAVPRLGIKGYEWWSEALHGVSNVGP 99
+ VSLP+ +RV L+ ++ ++K++L+ G +P L + EA+HG S
Sbjct: 170 YMDVSLPVEERVESLLAVMTPEDKMELIREGWGIPGIPHLYVPPITK-VEAVHGFS---- 224
Query: 100 GTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNI 159
+G GAT FPQ + +++N L E + V+ DE A N A WSP +++
Sbjct: 225 ---YGS---GATIFPQALAMGATWNKKLTEEVAMVIGDETVAA-NTKQA----WSPVLDV 273
Query: 160 FRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD 219
+D RWGR +ET GEDPVL + ++++G Q + + KHF + G D
Sbjct: 274 AQDARWGRCEETFGEDPVLVSQMGGAWIKGYQSRG-----LFTTPKHFGGHGAP-LGGRD 327
Query: 220 RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEW 279
+S++++ + VPFR + S+M +Y+ GVP +L++ +R EW
Sbjct: 328 SHDIG--LSEREMREIHLVPFRHAIRNYDCQSLMMAYSDYMGVPVAKSKELLQQILRQEW 385
Query: 280 RLNGYIVSDCDSVGVYYDTQHFTSTPE-EAAADAIRAGLDLDCG-PFLGLHTESAVQRGL 337
NG+IVSDC ++G +H+T+ + EAA A+ AG+ +CG + A + G
Sbjct: 386 GFNGFIVSDCGAIGNLTARKHYTAKDKIEAANQALAAGIATNCGDTYNNKEVIQAAKDGR 445
Query: 338 LSEIDINNALVNTLTVQMRLGMFDGEPSS-QPYGHLGPKDVCTPDHQELALEAARQGIVL 396
++ D++N L+ R +F+ P + + P + H+E+A +AAR+ IV+
Sbjct: 446 INMEDLDNVCRTMLSTMFRNELFEKNPCKPLDWKKIYP-GWNSDSHKEMARQAARESIVM 504
Query: 397 LKNQGPSLPLSHIRHRTVAVIGPNSD-------VTVTMIGNYAGIACGYTTPLQGIGRYA 449
L+N+ LPLS RT+AV+GP +D + G + G + +G+
Sbjct: 505 LENKENLLPLSKTL-RTIAVVGPGADDLQPGDYTPKLLPGQLKSVLTGIKS---AVGKQT 560
Query: 450 RTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEA---------EALDRAG 500
+ +++QGC D D A+ + Q+D I+V+G + EA E D A
Sbjct: 561 KVLYEQGC-DFTNPDATNIPKAVKTASQSDVVIMVLGDCSTSEATNDVRKTCGENNDWAT 619
Query: 501 LLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAI 560
L+LPG+QQEL+ V K P IL+L +G P D+ A + + W PGQ GG A+
Sbjct: 620 LILPGKQQELLEAVCATGK-PVILILQAGRPYDILKASEMCKAILVNW--LPGQEGGPAM 676
Query: 561 ADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGH 620
AD+LFG NP G+LPMT +P+ ++ LP+ + +RY +Y +Y FG
Sbjct: 677 ADVLFGDYNPAGRLPMT-FPR-HVGQLPL--YYNFKTSGRRYEYVDMEYYP---LYRFGF 729
Query: 621 GMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVG 680
G+SYT+F ++ N I KA + VQ VKNVG
Sbjct: 730 GLSYTSFEYS-------------------NLKIQEKA---------NGNVEVQATVKNVG 761
Query: 681 SKDGAHTLLVFSTPPAGHWAPHK----QLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDR 736
S G ++ T +A K +L F ++H+ G + V + +S+++
Sbjct: 762 SCAGDEVAQLYVT---DMYASVKTRVMELKDFTRIHLQPGESKTVSFEMTPYD-ISLLND 817
Query: 737 SGTRRIPLGEHNIHIGGT----------KHSVSLHAATLGV 767
R + GE I IGG KHSV GV
Sbjct: 818 RMDRVVEKGEFKIMIGGMSPDYVAKNEIKHSVGYSDNKKGV 858
>gi|294146655|ref|YP_003559321.1| putative beta-glucosidase [Sphingobium japonicum UT26S]
gi|292677072|dbj|BAI98589.1| putative beta-glucosidase [Sphingobium japonicum UT26S]
Length = 874
Score = 267 bits (682), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 152/393 (38%), Positives = 218/393 (55%), Gaps = 33/393 (8%)
Query: 56 LIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQ 115
++ RL+L EKV L++ A A+PRLGI Y WW+E+LHG P T+FP+
Sbjct: 40 IVDRLTLDEKVAQLLNVAPAIPRLGIPAYNWWTESLHGAIGAVP----------TTNFPE 89
Query: 116 VITTASSFNATLWEAIGRVVSDEARAMYNGGT---------AGLTYWSPNVNIFRDPRWG 166
I A++F+A L + + + E RA++ G GL WSPN+NIFRDPRWG
Sbjct: 90 PIGLAATFDAPLIQQVAAAIGTEVRALHTLGRRTGHLGRIGTGLDTWSPNINIFRDPRWG 149
Query: 167 RGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAK 226
RGQET GEDP L+ + ++V+G+QG+D D V A+ KHF + + R N
Sbjct: 150 RGQETYGEDPYLTARIGVAFVQGMQGNDPDLPLVVATPKHFAVHSGPEPS---RHTDNIF 206
Query: 227 VSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIV 286
+ +D+EDT+ FR ++EGK S+MC+YN+V+G P C +LK +RG W GY+V
Sbjct: 207 ATPRDLEDTYLPAFRAAIVEGKAGSIMCAYNRVDGQPACGSHMLLKDYLRGAWGFKGYVV 266
Query: 287 SDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG-------PFLGLHTESAVQRGLLS 339
SDCD+V Y+ + A A+R G+D +C P LG + ++ RG ++
Sbjct: 267 SDCDAVVDIYEHHKYAPDAATGVAVALRHGVDSECNNATLGGRPDLGDRYKESLARGHIA 326
Query: 340 EIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKN 399
DI+ ALV + + R G G S +P + P + TPDHQ LAL A + +VLLKN
Sbjct: 327 MGDIDAALVRLFSARYRNGDLPGLSSRKP-NAIPPSAIGTPDHQALALSTAEKSLVLLKN 385
Query: 400 QGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYA 432
G LPL +A+ GP +D T + GNY+
Sbjct: 386 DG-ILPLRP--GARIALAGPLADATRVLRGNYS 415
Score = 125 bits (313), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 93/278 (33%), Positives = 142/278 (51%), Gaps = 50/278 (17%)
Query: 486 GLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVA 535
GL +E E + DR L LP Q+ + K K P ++V M+G ID++
Sbjct: 611 GLTSDLEGEEMPVKVEGFDGGDRTTLDLPADQRAFLEKARALGK-PLVIVAMNGSAIDLS 669
Query: 536 FAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNL-PMTEMAM 594
+AK + AAI+ A YPGQ+GG A+ ++L G ++PGG+LP+T+Y + +L P T+ M
Sbjct: 670 WAKEN--AAAIVEAWYPGQSGGLAVGNVLSGRADPGGRLPVTFY--RNVNDLPPFTDYGM 725
Query: 595 RPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATIS 654
GRTYR+++G VYPFGHG+SYT+F + AP V P+D GS+
Sbjct: 726 E--------GRTYRYFRGTPVYPFGHGLSYTSFRY----APLTVE-PVD---GSVE---- 765
Query: 655 GKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVP 714
K +KV A + N G++ G ++ TPP AP L F++V +
Sbjct: 766 -KGLKVRTA------------ITNTGARAGDDVAQLYITPPRFEGAPRLALRGFQRVTLK 812
Query: 715 AGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIG 752
G + V + + LS V +G R + G++ + IG
Sbjct: 813 PGETRNVEFTLS-PRDLSFVTMAGERGLIPGDYGLSIG 849
>gi|357046682|ref|ZP_09108302.1| glycosyl hydrolase family 3 protein, partial [Paraprevotella clara
YIT 11840]
gi|355530484|gb|EHG99896.1| glycosyl hydrolase family 3 protein, partial [Paraprevotella clara
YIT 11840]
Length = 677
Score = 267 bits (682), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 174/472 (36%), Positives = 254/472 (53%), Gaps = 48/472 (10%)
Query: 1 MASTIAFFFLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRL 60
M +T+ +LGL++ + G A EP+ +P L +R DL+ RL
Sbjct: 1 MKNTVFVLWLGLVV---ADIGWAQNEPYK-NP-------------DLSPRERAEDLLKRL 43
Query: 61 SLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTA 120
+L+EK L+I+ + AV RLGIK Y WWSEALHGV+ G AT +P + A
Sbjct: 44 TLKEKASLMINSSPAVERLGIKPYNWWSEALHGVARSGV----------ATVYPITMGMA 93
Query: 121 SSFNATLWEAIGRVVSDEARAMYNGG---------TAGLTYWSPNVNIFRDPRWGRGQET 171
S F+ EA+ VSDEARA Y+ GLT+W+PNVNIFRDPRWGRGQET
Sbjct: 94 SVFDDKAVEAVYTTVSDEARAKYHDAHKKGRYGQMNEGLTFWTPNVNIFRDPRWGRGQET 153
Query: 172 PGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNA-KVSK 229
GEDP L+ + + VRGLQG +D K+ A KH+ + R +F+ ++
Sbjct: 154 WGEDPYLTSRMGVAVVRGLQGPADAKYDKLHACAKHYAVHSGPE---AKRHYFDVEQLDP 210
Query: 230 QDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDC 289
+D+ +T+ F++ V E V VMC+Y + G P C +L + +R EW +VSDC
Sbjct: 211 RDLWETYLPAFKVLVQEADVKEVMCAYQRFEGEPCCGSERLLHQILRDEWGFKYVVVSDC 270
Query: 290 DSVGVYYDTQHFTSTPE--EAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNAL 347
++G +++ + P+ A+A A+ +G DL+CG + E+AV RGL++E I+ +L
Sbjct: 271 GAIGDFFNPGLHETHPDAATASASAVTSGTDLECGWGDYMQLEAAVDRGLITEHRIDTSL 330
Query: 348 VNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLS 407
L + LG D + S P+ +G V H+++AL+ AR+ +VLL N G LPL
Sbjct: 331 CRLLEARFALGEMD-DDSLVPWSRIGIDTVDCQTHKQMALDIARKSLVLLHNDG-VLPLD 388
Query: 408 HIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI-GRYARTI-HQQGC 457
R V V+GPN+ +V GNY G T L GI R R + +++GC
Sbjct: 389 KTRG-DVVVMGPNAVDSVMQWGNYEGTPSHTYTVLDGIRERLGRDVRYEKGC 439
>gi|329956868|ref|ZP_08297436.1| glycosyl hydrolase family 3 protein [Bacteroides clarus YIT 12056]
gi|328523625|gb|EGF50717.1| glycosyl hydrolase family 3 protein [Bacteroides clarus YIT 12056]
Length = 864
Score = 266 bits (681), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 165/424 (38%), Positives = 233/424 (54%), Gaps = 33/424 (7%)
Query: 51 QRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGA 110
+R DL+ +L+L+EKV L++ + V RLGIK Y WW+EALHGV+ G A
Sbjct: 33 ERAEDLVKQLTLEEKVALMMDNSKPVERLGIKPYNWWNEALHGVARSG----------WA 82
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA--------GLTYWSPNVNIFRD 162
T FPQ I A+SF+ VSDEARA +A GLT W+P VNI+RD
Sbjct: 83 TVFPQPIGMAASFSPEALHTAFVAVSDEARAKNAAYSAEGSYKRYQGLTIWTPTVNIYRD 142
Query: 163 PRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRL--KVAASCKHFTAYDLDNWNGVDR 220
PRWGRG ET GEDP L+ S V+GLQ D + KV A KHF + WN R
Sbjct: 143 PRWGRGIETYGEDPYLASVMGVSVVKGLQCLDENEKYDKVHACAKHFAVHSGPEWN---R 199
Query: 221 FHFNAK-VSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEW 279
FNA+ +S +D+ +T+ PF V EGKV VMC+YN+ G P C +L +R EW
Sbjct: 200 HSFNAENISPRDLYETYLPPFEALVKEGKVKEVMCAYNRFEGEPCCGSNRLLNHILRREW 259
Query: 280 RLNGYIVSDCDSVGVYY-DTQHFT-STPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGL 337
+G +V+DC ++ ++ D H T + A++ A+ +G DL+CG TE V++G
Sbjct: 260 GYDGIVVADCSAISDFHNDKGHKTHADAASASSAAVLSGTDLECGSNYRSLTE-GVKKGF 318
Query: 338 LSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLL 397
+ E DI+ ++ L + LG D EP + + VC+ H L+L+ AR+ + LL
Sbjct: 319 IDEADIDRSVKRLLQARFELGEMD-EPDQVRWAQIPYSVVCSDKHDSLSLDMARKSMTLL 377
Query: 398 KNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI----GRYARTIH 453
N+ +LPL T+AV+GPN++ +V GNY G+ T L GI G+ + I+
Sbjct: 378 LNKNNALPLER-GGTTIAVMGPNANDSVMQWGNYNGLPKRTITILDGIRSAMGKDDKLIY 436
Query: 454 QQGC 457
+QGC
Sbjct: 437 EQGC 440
Score = 120 bits (300), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 91/321 (28%), Positives = 142/321 (44%), Gaps = 50/321 (15%)
Query: 459 DVACADDQLFGAAIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQ 508
D+ ++ ++ + AD I G+ +E E + DR + LP Q+
Sbjct: 583 DLGFKEEADIQRSVAKVKDADVVIFAGGISPQLEGEEMGVKLPGFRGGDRTDIELPAVQR 642
Query: 509 ELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTS 568
E++ + A K I V SG PI A AI+ A YPGQ+GG A+A++LFG
Sbjct: 643 EMIKALHDAGK-KVIFVNCSGSPI--AMEPETEYCQAILQAWYPGQSGGKAVAEVLFGDY 699
Query: 569 NPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFV 628
NP G+LP T+Y + LP E G TYRF+ G ++PFG+G+SYT F
Sbjct: 700 NPAGRLPATFYRN--LAQLPDFE-------DYNMAGHTYRFFNGEPLFPFGYGLSYTTF- 749
Query: 629 HTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTL 688
++G I S + + T+ + V V N GS++G +
Sbjct: 750 ----------------KYGKIQLKSSAQTDE---------TVKITVPVTNTGSRNGEEVV 784
Query: 689 LVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDR-SGTRRIPLGEH 747
V+ P K L AF++V++PAG +V + + K L D + T R
Sbjct: 785 QVYLKKQGETDGPVKTLRAFKRVYIPAGKTVKVELEL-TPKQLEWWDSATNTMRTMPNTF 843
Query: 748 NIHIGGTKHSVSLHAATLGVI 768
++ +GG+ L TL ++
Sbjct: 844 DVMVGGSSKDSDLQKKTLKLL 864
>gi|404406439|ref|ZP_10998023.1| glycoside hydrolase 3 [Alistipes sp. JC136]
Length = 925
Score = 266 bits (681), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 199/686 (29%), Positives = 329/686 (47%), Gaps = 82/686 (11%)
Query: 110 ATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLT-YWSPNVNIFRDPRWGRG 168
AT+FP + ++N L GR+V EAR + G T ++P +++ RD RWGR
Sbjct: 180 ATNFPSQLGMGHTWNRELLRKTGRIVGREARLL------GYTNIYAPVLDVGRDQRWGRY 233
Query: 169 QETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVS 228
+E GE P L + + G+Q +VA++ KHF AY + + ++
Sbjct: 234 EEVFGESPYLVAELGVAMASGMQTD----YQVASTAKHFAAYSNNKGAREGMSRVDPQMP 289
Query: 229 KQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSD 288
+++E+ +PFR + + VM SYN +GVP L +RGE GY+VSD
Sbjct: 290 PREVENIHLMPFREVIRRAGILGVMSSYNDYDGVPIQGSRYWLTERLRGEMGFRGYVVSD 349
Query: 289 CDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAV-------QRGLLSEI 341
SV ++ H +A +I AGL++ C + H E+ V + GL++E
Sbjct: 350 SGSVEYLHNKHHTAVNQLDAVRQSIEAGLNVRCNFW---HPETYVMPLRQLLREGLITEE 406
Query: 342 DINNALVNTLTVQMRLGMFDGEPSSQPYGH---LGPKDVCTPDHQELALEAARQGIVLLK 398
+++ + + L V+ +G+FD +PY ++V P+H E+AL+A+R+ IVLLK
Sbjct: 407 LLDSRVRDVLRVKFLVGLFD-----RPYQTDLAAADREVDGPEHNEVALQASRESIVLLK 461
Query: 399 NQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGR----YARTIHQ 454
N+ +LPL + R +AV+GPN+D +G+Y +A T+ L G+ R +++
Sbjct: 462 NENSTLPLDARKIRRIAVLGPNADARGFALGHYGPLAVEVTSVLDGLKRNLGARCEIVYE 521
Query: 455 QGCK--DVACADDQLFGA------------AIDASRQADATILVMGLDQSIEAEALDRAG 500
+GC+ D A ++F A +A+ ++D ++V+G E R+
Sbjct: 522 KGCELVDAAWPLSEIFREEMTPEEKAGIRRAAEAASESDVAVVVLGGGSRTCGENCSRSS 581
Query: 501 LLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAI 560
L LPGRQ+EL+ V K PT+LV+++G P + +A D + AI+ A YPG GG A+
Sbjct: 582 LDLPGRQEELLRAVEATGK-PTVLVMINGRPNSINWA--DAHVDAIVEAWYPGAHGGQAV 638
Query: 561 ADILFGTSNPGGKLPMTWYPQ---EYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYP 617
++LFG NPGGKL +T +P+ + N P A PG G +Y
Sbjct: 639 YEVLFGEYNPGGKLTVT-FPRHVGQIPFNFPYKPAANTDGGLTPGPGGNQTRING-ALYD 696
Query: 618 FGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVK 677
FG+G+SYT F + + I + I+ V DV
Sbjct: 697 FGYGLSYTTFEYA-------------------DLRIEPQTIRQDEP------FRVSFDVT 731
Query: 678 NVGSKDGAHTLLVFSTPPAGHWAPH-KQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDR 736
N G +DG + ++ + K L F++VH+ AG +RV + + + LS+++
Sbjct: 732 NTGQRDGDEVVQLYIHDVLSSVTTYEKNLRGFDRVHLKAGETRRVTMQVR-PQDLSLLNE 790
Query: 737 SGTRRIPLGEHNIHIGGTKHSVSLHA 762
R + G+ ++ IG + + L A
Sbjct: 791 RMERVVEPGDFDVLIGASSTDIRLKA 816
>gi|332877556|ref|ZP_08445303.1| glycosyl hydrolase family 3 protein, partial [Capnocytophaga sp.
oral taxon 329 str. F0087]
gi|332684662|gb|EGJ57512.1| glycosyl hydrolase family 3 protein [Capnocytophaga sp. oral taxon
329 str. F0087]
Length = 676
Score = 266 bits (681), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 174/472 (36%), Positives = 254/472 (53%), Gaps = 48/472 (10%)
Query: 1 MASTIAFFFLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRL 60
M +T+ +LGL++ + G A EP+ +P L +R DL+ RL
Sbjct: 1 MKNTVFVLWLGLVV---ADIGWAQNEPYK-NP-------------DLSPRERAEDLLKRL 43
Query: 61 SLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTA 120
+L+EK L+I+ + AV RLGIK Y WWSEALHGV+ G AT +P + A
Sbjct: 44 TLKEKASLMINSSPAVERLGIKPYNWWSEALHGVARSGV----------ATVYPITMGMA 93
Query: 121 SSFNATLWEAIGRVVSDEARAMYNGG---------TAGLTYWSPNVNIFRDPRWGRGQET 171
S F+ EA+ VSDEARA Y+ GLT+W+PNVNIFRDPRWGRGQET
Sbjct: 94 SVFDDKAVEAVYTTVSDEARAKYHDAHKKGRYGQMNEGLTFWTPNVNIFRDPRWGRGQET 153
Query: 172 PGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNA-KVSK 229
GEDP L+ + + VRGLQG +D K+ A KH+ + R +F+ ++
Sbjct: 154 WGEDPYLTSRMGVAVVRGLQGPADAKYDKLHACAKHYAVHSGPE---AKRHYFDVEQLDP 210
Query: 230 QDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDC 289
+D+ +T+ F++ V E V VMC+Y + G P C +L + +R EW +VSDC
Sbjct: 211 RDLWETYLPAFKVLVQEADVKEVMCAYQRFEGEPCCGSERLLHQILRDEWGFKYVVVSDC 270
Query: 290 DSVGVYYDTQHFTSTPE--EAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNAL 347
++G +++ + P+ A+A A+ +G DL+CG + E+AV RGL++E I+ +L
Sbjct: 271 GAIGDFFNPGLHETHPDAATASASAVTSGTDLECGWGDYMQLEAAVDRGLITEHRIDTSL 330
Query: 348 VNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLS 407
L + LG D + S P+ +G V H+++AL+ AR+ +VLL N G LPL
Sbjct: 331 CRLLEARFALGEMD-DDSLVPWSRIGIDTVDCQTHKQMALDIARKSLVLLHNDG-VLPLD 388
Query: 408 HIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI-GRYARTI-HQQGC 457
R V V+GPN+ +V GNY G T L GI R R + +++GC
Sbjct: 389 KTRG-DVVVMGPNAVDSVMQWGNYEGTPSHTYTVLDGIRERLGRDVRYEKGC 439
>gi|427384502|ref|ZP_18881007.1| hypothetical protein HMPREF9447_02040 [Bacteroides oleiciplenus YIT
12058]
gi|425727763|gb|EKU90622.1| hypothetical protein HMPREF9447_02040 [Bacteroides oleiciplenus YIT
12058]
Length = 862
Score = 266 bits (679), Expect = 4e-68, Method: Compositional matrix adjust.
Identities = 164/432 (37%), Positives = 238/432 (55%), Gaps = 32/432 (7%)
Query: 42 FCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGT 101
+ SL +R L+ L+L+EK +L++ + +V RLGIK Y WW+EALHGV+ G
Sbjct: 22 YKNTSLSPEERAELLVKELTLEEKARLMMDASQSVERLGIKPYNWWNEALHGVARAGL-- 79
Query: 102 KFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA--------GLTYW 153
AT FPQ I A+SF+ + + VSDE RA T+ GLT W
Sbjct: 80 --------ATVFPQPIGMAASFDPEMVYEVFNAVSDEVRAKNTYYTSRDSRERYQGLTMW 131
Query: 154 SPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL 212
+P VNI+RDPRWGRG ET GEDP L+ + V+GLQG +DG K+ A KHF +
Sbjct: 132 TPTVNIYRDPRWGRGIETYGEDPYLTSRMGVMVVKGLQGPADGKYDKLHACAKHFAVHSG 191
Query: 213 DNWNGVDRFHFNAK-VSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNIL 271
WN R FNA+ ++ +D+ +T+ PF V EG+V VMC+YN+ G P C +L
Sbjct: 192 PEWN---RHSFNAENINPRDLYETYLPPFEALVKEGEVKEVMCAYNRFEGEPCCGSDRLL 248
Query: 272 KRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPE--EAAADAIRAGLDLDCGPFLGLHT 329
+ +RGEW +G +V+DC ++ +Y+ + + P A+A A+ +G DLDCG
Sbjct: 249 MQILRGEWGFDGIVVADCGAIADFYNDRGHHTHPNAESASAAAVVSGTDLDCGSSYKALV 308
Query: 330 ESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEA 389
ES V++GL++E ++ ++ L + LG D EP + + V + H LALE
Sbjct: 309 ES-VKKGLIAEETVDVSVKRLLKARFELGEMD-EPEKVSWTKIPFSVVASAAHDSLALEI 366
Query: 390 ARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI---- 445
AR+ + LL N+ LPL TVAV+GPN++ +V GNY G+ T L G+
Sbjct: 367 ARKSMTLLMNKDNFLPLKR-GGLTVAVMGPNANDSVMQWGNYNGMPPHTVTILDGVRNIL 425
Query: 446 GRYARTIHQQGC 457
G + I++QGC
Sbjct: 426 GAEDKLIYEQGC 437
Score = 123 bits (309), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 91/319 (28%), Positives = 143/319 (44%), Gaps = 47/319 (14%)
Query: 459 DVACADDQLFGAAIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQ 508
D+ D +++ + AD I G+ S+E E + DR + LP Q+
Sbjct: 580 DLGFKKDVDIRKSVERVKDADVVIFASGISPSLEGEEMGVNLPGFKKGDRTDIELPAVQR 639
Query: 509 ELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTS 568
EL+ + A K ILV SG PI + A++ A YPGQ GGTA+A++LFGT
Sbjct: 640 ELIDALHRAGK-KIILVNCSGSPI--GLEPETKKCEALLQAWYPGQQGGTAVAEVLFGTY 696
Query: 569 NPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFV 628
NP G+LP+T+Y Y LP E GRTYR+ + ++PFG+G+SYT F
Sbjct: 697 NPAGRLPVTFYRNIY--QLPDFE-------DYNMTGRTYRYMQDIPLFPFGYGLSYTTFN 747
Query: 629 HTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTL 688
+ +S ++T + +LT V V N G + G +
Sbjct: 748 Y---------------------GKVSLNKGEITDGQTVKLT----VSVTNTGERGGEEVV 782
Query: 689 LVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHN 748
V+ P K L AF+++ +P+G V + + ++S T R+ G ++
Sbjct: 783 QVYLKKQGDTKGPVKTLRAFKRISIPSGQTVNVEFELKDKELEWWDEQSNTVRVCPGNYD 842
Query: 749 IHIGGTKHSVSLHAATLGV 767
I +GG+ L ++ V
Sbjct: 843 IMVGGSSRDGDLQQISIIV 861
>gi|261340566|ref|ZP_05968424.1| periplasmic beta-glucosidase [Enterobacter cancerogenus ATCC 35316]
gi|288316974|gb|EFC55912.1| periplasmic beta-glucosidase [Enterobacter cancerogenus ATCC 35316]
Length = 765
Score = 265 bits (678), Expect = 5e-68, Method: Compositional matrix adjust.
Identities = 212/689 (30%), Positives = 336/689 (48%), Gaps = 102/689 (14%)
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
T FP + ASSFN + +GR+ + Y GL W+P V++ RDPRWGR
Sbjct: 118 TVFPISLGLASSFNLDAVKTVGRI------SAYEAADDGLNMTWAPMVDVSRDPRWGRAS 171
Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
E GED L+ + V +QG S DR V S KHF AY +N VD
Sbjct: 172 EGFGEDTYLTATMGKTMVEAMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 226
Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
+S Q + + + P++ + G +VM + N +NG P +D +LK +R +W G
Sbjct: 227 --MSPQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSDAWLLKDVLRDQWGFKGI 283
Query: 285 IVSDCDSVGVYYDTQHFTST-PEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
VSD ++ +H T+ PE+A A+++G+++ + + V+ G ++ +
Sbjct: 284 TVSDHGAIKEL--IKHGTAADPEDAVRVALKSGINMSMSDEYYSKYLPGLVKSGKVTMAE 341
Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
+++A + L V+ +G+F+ PY HLGPKD D H++ A E AR+ +V
Sbjct: 342 LDDATRHVLNVKYDMGLFN-----DPYSHLGPKDSDPADTNAESRLHRKEAREVARESLV 396
Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGI----GRYA 449
LLKN+ +LPL + T+AV+GP +D ++G++ AG+A T L GI G A
Sbjct: 397 LLKNRLDTLPLK--KSGTIAVVGPLADSKRDVMGSWSAAGVADQSVTVLTGIKSAVGDNA 454
Query: 450 RTIHQQGC-----KDVACADDQ--------------LFGAAIDASRQADATILVMGLDQS 490
+ ++ +G KD+ +Q + A++A++Q+D + V+G Q
Sbjct: 455 KVVYAKGANVTNDKDIVTFLNQYEEAVKVDPRTPKEMIDEAVNAAKQSDVVVAVVGEAQG 514
Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
+ EA R + +P Q++L++ + K P +LVLM+G P+ A K D + AI+
Sbjct: 515 MAHEASSRTDITIPQSQRDLIAALKATGK-PLVLVLMNGRPL--ALVKEDQQADAILETW 571
Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAM---RP---SQSKRYPG 604
+ G GG AIAD+LFG NP GKLPM+ +P+ + +P+ + RP + +Y
Sbjct: 572 FAGTEGGNAIADVLFGDYNPSGKLPMS-FPRS-VGQIPVYYSHLNTGRPYHADKPNKYTS 629
Query: 605 RTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAK 664
R + GP +YPFG+G+SYT F TV++ ++ P R GS+ A+
Sbjct: 630 RYFDEANGP-LYPFGYGLSYTTF--TVSDVK--MSAPTMTRDGSVTAS------------ 672
Query: 665 CNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVGI 723
VDV N G +DGA + ++ A P KQL FEKV + G + V
Sbjct: 673 ---------VDVTNSGKRDGATVIQMYVQDVTASMSRPVKQLRGFEKVDLKPGETKTVSF 723
Query: 724 NIHVCKYLSVVDRSGTRRIPLGEHNIHIG 752
I V L ++ G+ N+ IG
Sbjct: 724 PIDVNA-LKFWNQQMKYDAEPGKFNVFIG 751
>gi|336412663|ref|ZP_08593016.1| hypothetical protein HMPREF1017_00124 [Bacteroides ovatus
3_8_47FAA]
gi|335942709|gb|EGN04551.1| hypothetical protein HMPREF1017_00124 [Bacteroides ovatus
3_8_47FAA]
Length = 735
Score = 265 bits (678), Expect = 6e-68, Method: Compositional matrix adjust.
Identities = 230/773 (29%), Positives = 363/773 (46%), Gaps = 101/773 (13%)
Query: 29 ACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLI--------------SGAA 74
A D KD P PI +RVNDL+ R++L+EKV L
Sbjct: 21 AKDKKDVALYKDP----KAPIEKRVNDLLSRMTLEEKVMQLNQYTLGRNNNVNNVGEEVK 76
Query: 75 AVP-RLGIKGYEWWSEALHGV--------SNVGPGTKFGGDFPGA--TSFPQVITTASSF 123
VP +G Y + AL S +G FG D T +P + A S+
Sbjct: 77 KVPAEIGSLIYFETNPALRNSMQKKAMEKSRLGIPIIFGYDAIHGFRTVYPISLAQACSW 136
Query: 124 NATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQETPGEDPVLSGKY 182
N L E V + EAR +G+ + +SP +++ RDPRWGR E GEDP +G +
Sbjct: 137 NPDLVEQACAVSAQEAR------MSGVDWTFSPMIDVARDPRWGRVAEGYGEDPYTNGVF 190
Query: 183 AASYVRGLQGSD-GDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFR 241
A+ V+G QG D ++AA KH+ Y R + ++SKQ + DT+ +P+
Sbjct: 191 GAASVKGYQGDDLSAENRMAACLKHYVGYGASE---AGRDYVYTEISKQTLWDTYLLPYE 247
Query: 242 MCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHF 301
M V G A++M S+N ++GVP A+P I+ ++ W +G+IVSD ++ Q
Sbjct: 248 MGVKAG-AATLMSSFNDISGVPGSANPYIMTEILKKRWGHDGFIVSDWGAIE-QLKNQGL 305
Query: 302 TSTPEEAAADAIRAGLDLDCGPF-LGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMF 360
+T +EAA A AGL++D H + V+ G +S ++ A+ L ++ RLG+F
Sbjct: 306 AATKKEAAWHAFTAGLEMDMMSHAYDRHLQELVEEGRVSVAQVDEAVRRVLLLKFRLGLF 365
Query: 361 DGEPSSQPY--GHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIG 418
+ +PY + P ++A A + +VLLKN+ +LPL+ + +AVIG
Sbjct: 366 E-----RPYTPATSEKERFFRPQSMDIAARLAAESMVLLKNENKTLPLTD--KKKIAVIG 418
Query: 419 PNSDVTVTMIGNYAG------IACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAI 472
P + ++G++ G +A Y A + GC + + F A+
Sbjct: 419 PMAKNGWDLLGSWCGHGKDTDVAMLYNGLATEFAGKAELRYAAGCA-TKGDNREGFAEAL 477
Query: 473 DASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPI 532
+A+R +D +L +G + E R+ + LP Q+EL +++ A K P +LVL++G P+
Sbjct: 478 EAARWSDVVVLCLGEMMTWSGENASRSSIALPQIQEELAAELKKAGK-PIVLVLVNGRPL 536
Query: 533 DVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEM 592
++ + +P AI+ PG G +A IL G NP GKL MT P +
Sbjct: 537 EL--NRLEPISDAILEIWQPGVNGALPMAGILSGRINPSGKLAMT---------FPYSTG 585
Query: 593 AMRPSQSKRYPGRTYR-FYKGPV---VYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGS 648
+ ++R GR ++ FYK +YPFGHG+SYT F +
Sbjct: 586 QIPIYYNRRKSGRGHQGFYKDITSDPLYPFGHGLSYTEFKY------------------- 626
Query: 649 INATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVA 707
T++ A KV + +RL+ V+V V NVG++DGA T+ F S P P K+L
Sbjct: 627 --GTVTPSATKVK--RGDRLS--VEVTVTNVGARDGAETVHWFISDPYCSITRPVKELKH 680
Query: 708 FEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
FEK + AG + +I + + V+ G R + GE++I + G + L
Sbjct: 681 FEKQLIKAGETKTFRFDIDMERDFGFVNEDGKRFLEAGEYHILVQGQTVKIEL 733
>gi|313145353|ref|ZP_07807546.1| periplasmic beta-glucosidase [Bacteroides fragilis 3_1_12]
gi|313134120|gb|EFR51480.1| periplasmic beta-glucosidase [Bacteroides fragilis 3_1_12]
Length = 802
Score = 265 bits (677), Expect = 6e-68, Method: Compositional matrix adjust.
Identities = 217/687 (31%), Positives = 330/687 (48%), Gaps = 91/687 (13%)
Query: 109 GATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRG 168
G T FP I AS++N L +GRV++ EA A G G + P +++ RDPRW R
Sbjct: 156 GTTVFPTSIGQASTWNPELIRQMGRVIATEASA--QGAHIG---YGPVLDLARDPRWSRV 210
Query: 169 QETPGEDPVLSGKYAASYVRGLQGSD-GDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKV 227
+ET GEDP L+G A+ VRG QG R V A+ KHF +Y W A +
Sbjct: 211 EETYGEDPYLNGVMGAALVRGFQGDTLRGRKSVIATLKHFASY---GWTEGGHNGGTAHL 267
Query: 228 SKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVS 287
++++E+ PFR V G + SVM SYN+++G P +L ++ W+ G++VS
Sbjct: 268 GERELEEAIFPPFREAVGAGAL-SVMSSYNEIDGNPCTGSRYLLTDILKDRWQFKGFVVS 326
Query: 288 DCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTE---SAVQRGLLSEIDIN 344
D ++G + + EAA A+ AG+D D G ++ E +AV++G ++ ++
Sbjct: 327 DLYAIGGLRE-HGVAGSDYEAAVKAVNAGVDSDLGT--NVYAEQLVAAVRKGDVAMETVD 383
Query: 345 NALVNTLTVQMRLGMFDGE--PSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGP 402
A+ L ++ +G+FD +P + V +P+H LA E ARQ IVLLKN+
Sbjct: 384 KAVRRILFLKFHMGLFDAPFVDDKRP-----AQLVASPEHIGLAREVARQSIVLLKNEDK 438
Query: 403 SLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIAC--GYTTPLQGI----GRYARTIHQQG 456
LPL RT+AVIGPN+D M+G+Y T L+GI + R ++ +G
Sbjct: 439 LLPLKK-DIRTLAVIGPNADNGYNMLGDYTAPQADGSVVTVLEGIRQKVSKDTRVLYAKG 497
Query: 457 CKDVACADDQLFGAAIDASRQADATILVMG----LDQSIE-------------------A 493
C V + F AI+A+R AD ++V+G D S E
Sbjct: 498 CA-VRDSSRTGFADAIEAARSADVVVMVVGGSSARDFSSEYEETGAAKVSANRVSDMESG 556
Query: 494 EALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPG 553
E DRA L L GRQ EL+ +V K P +LVL+ G P+ + + AI+ A YPG
Sbjct: 557 EGYDRATLHLMGRQLELLEEVRKLGK-PMVLVLIKGRPLLMEGVIQEA--DAILDAWYPG 613
Query: 554 QAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGP 613
GG A+AD+LFG NP G+L ++ P+ + LP+ R RY G
Sbjct: 614 MQGGNAVADVLFGDYNPAGRLTLS-VPRS-VGQLPVYYNTKRKGNRSRYIEEA-----GT 666
Query: 614 VVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQ 673
YPFG+G+SYT F +T G ++V+ + N + V
Sbjct: 667 PRYPFGYGLSYTTFSYT------------------------GMKVRVSE-ESNHCRVDVS 701
Query: 674 VDVKNVGSKDGAHTLLVFSTPPAGHW-APHKQLVAFEKVHVPAGAQQRVGINIHVCKYLS 732
V V+N G+ DG + ++ G + P +QL AF +V + AG + + + K L+
Sbjct: 702 VTVRNQGTVDGDEVVQLYLRDEVGSFTTPDRQLRAFSRVRLKAGETREITFTLD-KKSLA 760
Query: 733 VVDRSGTRRIPLGEHNIHIGGTKHSVS 759
+ R G + G + GG+ ++
Sbjct: 761 LYMRDGEWAVEPGRFTVMAGGSSEDIA 787
>gi|116621797|ref|YP_823953.1| glycoside hydrolase family protein [Candidatus Solibacter usitatus
Ellin6076]
gi|116224959|gb|ABJ83668.1| glycoside hydrolase, family 3 domain protein [Candidatus Solibacter
usitatus Ellin6076]
Length = 765
Score = 265 bits (677), Expect = 7e-68, Method: Compositional matrix adjust.
Identities = 223/730 (30%), Positives = 345/730 (47%), Gaps = 109/730 (14%)
Query: 78 RLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSD 137
RLGI + E LHG + +G TSFPQ I ++F+ L E++ + +
Sbjct: 104 RLGIPVI-FHEECLHGHAAIG-----------GTSFPQPIGLGATFDPELVESLFAMTAA 151
Query: 138 EARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDG-- 195
EARA G LT P V++ R+PRWGR +ET GEDP L + + VRG QG
Sbjct: 152 EARA--RGTHQALT---PVVDVAREPRWGRVEETYGEDPFLVSRMGIAAVRGFQGDATFR 206
Query: 196 DRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCS 255
D+ +V A+ KHF A+ +G + N VS + + +TF PF+ + +G SVM S
Sbjct: 207 DKTRVIATLKHFAAHGQPE-SGTNCAPVN--VSMRVLRETFLFPFKEALDKGCAISVMAS 263
Query: 256 YNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPE--------- 306
YN+++GVP+ A +L+ +R EW G++VSD YY + PE
Sbjct: 264 YNEIDGVPSHASRWLLRDVLRKEWGFKGFVVSD------YYAIYELSYRPESHGHFVAKD 317
Query: 307 --EAAADAIRAGLDLDC-GPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGE 363
EA A A++AG++++ P LH V +G+L E ++ + L + ++G+FD +
Sbjct: 318 KREACALAVQAGVNIELPEPDCYLHLVDLVHKGVLQESQLDELVEPMLRWKFQMGLFD-D 376
Query: 364 PSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDV 423
P P + H+ELA++AAR+ I LLKN GP +PL +T+AVIGPN++
Sbjct: 377 PYVDPAE--AERIAGCDAHRELAMQAARETITLLKNDGPVVPLDLSAIKTIAVIGPNAN- 433
Query: 424 TVTMIGNYAGIACGYTTPLQGI----GRYARTIHQQGCKDV--------------ACADD 465
+++G Y+G+ T L GI G A+ ++ +GCK D
Sbjct: 434 -RSLLGGYSGVPKHDVTVLDGIRERVGSRAKVVYAEGCKITIGGSWVQDEVTPSDPAEDR 492
Query: 466 QLFGAAIDASRQADATILVMGLDQSIEAEAL------DRAGLLLPGRQQELVSKVSMASK 519
+ A+ +++AD +L +G ++ EA DR L L GRQ+ELV + +A+
Sbjct: 493 RQIAEAVKVAKRADVIVLAIGGNEQTSREAWSPKHLGDRPSLDLVGRQEELV-RAMVATG 551
Query: 520 GPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWY 579
P I L +G PI + + + AI Y GQ G A+A++LFG +NPGGKLP+T
Sbjct: 552 KPVIAFLFNGRPISINYLAQS--VPAIFECWYLGQETGRAVAEVLFGDTNPGGKLPIT-I 608
Query: 580 PQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVA 639
P+ +LP +PS R Y F + +Y FG+G+SYT F
Sbjct: 609 PRS-AGHLPAF-YNHKPSAR-----RGYLFDEVGPLYAFGYGLSYTTFAF---------- 651
Query: 640 VPLDGRHGSINATISGKAIKVTHAKCNRL-TLGVQVDVKNVGSKDGAHTLLVFSTPPAGH 698
+ +++ K +R T V VDV N G+++G + ++
Sbjct: 652 ----------------QNLRLAKKKMHRESTARVLVDVTNTGAREGREVVQLYIRDLVSS 695
Query: 699 WA-PHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHS 757
P K+L F K+ + G Q V I L+ + + G+ I +G +
Sbjct: 696 VTRPIKELKGFRKITLQPGQTQTVEFEI-TPDLLAFYNVDMKFVVEPGDFEIMVGSSSRD 754
Query: 758 VSLHAATLGV 767
L L V
Sbjct: 755 ADLQKVILRV 764
>gi|409196594|ref|ZP_11225257.1| beta-glucosidase [Marinilabilia salmonicolor JCM 21150]
Length = 876
Score = 265 bits (677), Expect = 7e-68, Method: Compositional matrix adjust.
Identities = 154/419 (36%), Positives = 224/419 (53%), Gaps = 29/419 (6%)
Query: 41 PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPG 100
PF SL +RV DLI RL+++EK LL + A+PRLGI+ + WWSEALHG +N
Sbjct: 34 PFQNPSLSFEERVKDLISRLTVEEKAALLCDQSDAIPRLGIEKFNWWSEALHGYANND-- 91
Query: 101 TKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMY---------NGGTAGLT 151
T FPQ + A+SFN L I SDE RA Y N L+
Sbjct: 92 --------SVTVFPQPVGMAASFNDELVFDIFNATSDEGRAKYHQAQRRGEENRRFLSLS 143
Query: 152 YWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAY 210
W+PNVNIFRDPRWGRGQET GEDP L+ + V+GLQG D K+ A KH+T +
Sbjct: 144 VWTPNVNIFRDPRWGRGQETYGEDPYLTSRMGVQVVKGLQGPEDAKYRKLLACAKHYTVH 203
Query: 211 DLDNWNGVDRFHFNAK-VSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPN 269
W+ R N VS ++ +T+ F+ V + V VMC+Y++++ P C++
Sbjct: 204 SGPEWS---RHELNVNDVSPREFYETYMPAFKALVQDADVRQVMCAYHRLDDEPCCSNTR 260
Query: 270 ILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHT 329
IL+R +R EW + +DC ++ ++ T +STP AAA + AG DL+C + H
Sbjct: 261 ILQRILRDEWGYEHLVAADCGAITDFHTTHGISSTPVHAAATGLLAGTDLEC-IWDNYHY 319
Query: 330 E---SAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELA 386
+ A+++ L+ E I+++L L + LG D + + P+ + +H+ELA
Sbjct: 320 KMLPEALEKDLIKEEHIDSSLFRVLMGRFELGEMDDD-AIVPWAQTPASVLNNKEHRELA 378
Query: 387 LEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
+ A+Q + LL+N+ LPL + +AVIGPN+D + GNY G T L G+
Sbjct: 379 YKMAQQSMTLLQNKNNVLPLDKSSTQKIAVIGPNADDEEVLWGNYNGTPIRTITILDGM 437
Score = 98.6 bits (244), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 82/258 (31%), Positives = 113/258 (43%), Gaps = 50/258 (19%)
Query: 479 DATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILVLMS 528
+ + GL S+E E + DR + LP Q+ + + A K I V S
Sbjct: 615 ETVVFAGGLSGSLEGEEMPVSYPGFKGGDRTHIELPSVQRNALKALKEAGK-TVIFVNFS 673
Query: 529 GGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNL- 587
G I A AI+ A Y G++GG AIAD+LFG NP GKLP+T+Y NL
Sbjct: 674 GSAI--ALEPETESCDAILQAWYAGESGGQAIADVLFGDYNPSGKLPLTFYRNS--DNLG 729
Query: 588 PMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHG 647
+ +M GRTYR Y ++PFG G+SYTNF + NA
Sbjct: 730 DFEDYSME--------GRTYR-YTNDHLFPFGFGLSYTNF--QIGNAKLS---------- 768
Query: 648 SINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVA 707
ATI+ T+ + + VKN G +DG + V+ P K L
Sbjct: 769 --KATITPDE-----------TISINIPVKNTGKRDGTEIIQVYVRKVNDIDGPIKTLKG 815
Query: 708 FEKVHVPAGAQQRVGINI 725
FE+V VPAG + I++
Sbjct: 816 FERVDVPAGKTTQANISL 833
>gi|375149998|ref|YP_005012439.1| Beta-glucosidase [Niastella koreensis GR20-10]
gi|361064044|gb|AEW03036.1| Beta-glucosidase [Niastella koreensis GR20-10]
Length = 875
Score = 265 bits (677), Expect = 7e-68, Method: Compositional matrix adjust.
Identities = 164/462 (35%), Positives = 233/462 (50%), Gaps = 43/462 (9%)
Query: 1 MASTIAFFF--LGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIG 58
M T AFF+ +G +LL C A PF L RVNDL+
Sbjct: 1 MKITRAFFYPLIGCLLL--------------CIHLQAQNSKFPFQNYRLSFEDRVNDLVS 46
Query: 59 RLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVIT 118
RL+L+EKV +++ A +PRL I Y+WW+E LHGV+ T + T FPQ I
Sbjct: 47 RLTLEEKVAQMLNAAPGIPRLDIPAYDWWNETLHGVAR----TPYN-----VTVFPQAIA 97
Query: 119 TASSFNATLWEAIGRVVSDEARAMYNGGTA---------GLTYWSPNVNIFRDPRWGRGQ 169
A++++ + + E R ++N A GLTYW+PN+NIFRDPRWGRGQ
Sbjct: 98 MAATWDTAALYRMADCSALEGRVIHNKAIAAGKEKDRYLGLTYWTPNINIFRDPRWGRGQ 157
Query: 170 ETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD--RFHFNAKV 227
ET GEDP L+ A ++VRGLQG+D LK AA KH+ + +G + R F+ V
Sbjct: 158 ETYGEDPYLTAALADAFVRGLQGNDPKYLKAAACAKHYAVH-----SGPEPSRHVFDVDV 212
Query: 228 SKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVS 287
+ D+ DT+ F+ V VA VMC+YN P CA ++ +R +W GY+ S
Sbjct: 213 TPYDLWDTYLPSFKKLVTVSNVAGVMCAYNAFRKQPCCASDVLMTDILRNQWSFKGYVTS 272
Query: 288 DCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNAL 347
DC ++ +Y A+ADA+ G D+DCG AV+ ++E I+ ++
Sbjct: 273 DCGAIDDFYRNHKTHPDAAAASADAVFHGTDIDCGNEAYRALVQAVKENKITEKQIDISV 332
Query: 348 VNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLS 407
++ RLGMFD PS Y ++ + H + AL A + IVLLKN +LPL
Sbjct: 333 KRLFMIRFRLGMFD-PPSMVKYAQTPATELESAAHAKHALLMAHESIVLLKNANNTLPLK 391
Query: 408 HIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYA 449
+ V V+GPN+ + +GNY+G T QGI A
Sbjct: 392 KGLKKIV-VLGPNATNVIAPLGNYSGTPSKLITLFQGIKEKA 432
Score = 107 bits (266), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 88/300 (29%), Positives = 132/300 (44%), Gaps = 52/300 (17%)
Query: 468 FGAAIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMA 517
F A + ADA I G+ +E E + DR +LLP Q EL+ K A
Sbjct: 597 FNALVKKYSDADAFIFAGGISPQLEGEEMKVSDPGFKGGDRTTILLPAIQTELM-KALQA 655
Query: 518 SKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMT 577
S P + V+M+G + + + I AI+ A Y GQA GTA+AD+LFG NP G+LP+T
Sbjct: 656 SGKPVVFVMMTGSALATPWESEN--IPAIVNAWYGGQAAGTALADVLFGDYNPSGRLPVT 713
Query: 578 WYPQEYITNLPMTE-MAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPT 636
+Y + +LP E +M+ RTYR++ G +Y FG+G+SYT F + P
Sbjct: 714 FYGSD--NDLPSFEDYSMK--------NRTYRYFTGKPLYGFGYGLSYTTFRYDQLTMPV 763
Query: 637 VVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF-STPP 695
+GK +KVT V V N G G ++
Sbjct: 764 TAQ--------------NGKPVKVT------------VRVTNTGKTTGDEVAQIYVVNEN 797
Query: 696 AGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTK 755
K L F+++ + + V + L+ VD G R+ G+ I +GG++
Sbjct: 798 TSIQTALKTLKGFQRISLRPAESKMVSFVLQ-SDDLTYVDADGQRKPLTGKIQICVGGSQ 856
>gi|298387489|ref|ZP_06997041.1| periplasmic beta-glucosidase [Bacteroides sp. 1_1_14]
gi|298259696|gb|EFI02568.1| periplasmic beta-glucosidase [Bacteroides sp. 1_1_14]
Length = 950
Score = 265 bits (677), Expect = 7e-68, Method: Compositional matrix adjust.
Identities = 225/761 (29%), Positives = 362/761 (47%), Gaps = 105/761 (13%)
Query: 42 FCQVSLPIPQRVNDLIGRLSLQEKVKLLISG--AAAVPRLGIKGYEWWSEALHGVSNVGP 99
+ SLP+ +RV L+ ++ ++K++L+ G +P L + EA+HG S
Sbjct: 166 YMDASLPVEERVESLLAVMTPEDKMELIREGWGIPGIPHLYVPPITK-VEAVHGFS---- 220
Query: 100 GTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNI 159
+G GAT FPQ + +++N L E + V+ DE A N A WSP +++
Sbjct: 221 ---YGS---GATIFPQALAMGATWNRKLTEEVAMVIGDETVAA-NTKQA----WSPVLDV 269
Query: 160 FRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD 219
+D RWGR +ET GEDPVL + ++++G Q + + KHF + G D
Sbjct: 270 AQDARWGRCEETFGEDPVLVSQIGGAWIKGYQSRG-----LFTTPKHFGGHGAP-LGGRD 323
Query: 220 RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEW 279
+S++++ + VPFR + S+M +Y+ GVP +L++ +R EW
Sbjct: 324 SHDIG--LSEREMREIHLVPFRHAIRNYDCQSLMMAYSDYMGVPVAKSKELLQQILRQEW 381
Query: 280 RLNGYIVSDCDSVGVYYDTQHFTSTPE-EAAADAIRAGLDLDCG-PFLGLHTESAVQRGL 337
NG+IVSDC ++G +H+T+ + EAA A+ AG+ +CG + A + G
Sbjct: 382 GFNGFIVSDCGAIGNLTARKHYTAKDKIEAANQALAAGIATNCGDTYNNKEVIQAAKDGR 441
Query: 338 LSEIDINNALVNTLTVQMRLGMFDGEPSS-QPYGHLGPKDVCTPDHQELALEAARQGIVL 396
++ D++N L R +F+ P + + P + H+E+A +AAR+ IV+
Sbjct: 442 INMEDLDNVCRTMLGTMFRNELFEKNPCKPLDWKKIYP-GWNSDSHKEMARQAARESIVM 500
Query: 397 LKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYA-----GIACGYTTPLQG-IGRYAR 450
L+N+ LPLS T+AV+GP +D G+Y G T ++G +G+ +
Sbjct: 501 LENKENLLPLSKTLC-TIAVLGPGAD--DLQPGDYTPKLLPGQLKSVLTGIKGAVGKQTK 557
Query: 451 TIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEA---------EALDRAGL 501
+++QGC D D+ A+ A+ Q+D I+V+G + EA E D A L
Sbjct: 558 VLYEQGC-DFTNPDETNIPKAVKAASQSDVVIMVLGDCSTSEATNDVRKTCGENNDWATL 616
Query: 502 LLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIA 561
+LPG+QQEL+ V K P IL+L +G P D+ A + + W PGQ GG A+A
Sbjct: 617 ILPGKQQELLEAVCATGK-PVILILQAGRPYDILKASEMCKAILVNW--LPGQEGGPAMA 673
Query: 562 DILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHG 621
D+LFG NP G+LPMT +P+ ++ LP+ + +RY +Y +Y FG G
Sbjct: 674 DVLFGDYNPAGRLPMT-FPR-HVGQLPL--YYNFKTSGRRYEYVDMEYYP---LYRFGFG 726
Query: 622 MSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGS 681
+SYT+F ++ N I KA + VQ VKNVGS
Sbjct: 727 LSYTSFEYS-------------------NLKIQEKA---------NGNVEVQATVKNVGS 758
Query: 682 KDGAHTLLVFSTPPAGHWAPHK----QLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRS 737
+ G ++ T +A K +L F ++H+ G + V + +S+++
Sbjct: 759 RAGDEVAQLYVT---DMYASVKTRVMELKDFARIHLQPGESKTVSFEMTPYD-ISLLNDR 814
Query: 738 GTRRIPLGEHNIHIGGT----------KHSVSLHAATLGVI 768
R + GE I +GG KHSV GV+
Sbjct: 815 MDRVVEKGEFKIMVGGMSPDYVAKNEIKHSVGYSDNKKGVM 855
>gi|354582345|ref|ZP_09001247.1| glycoside hydrolase family 3 domain protein [Paenibacillus lactis
154]
gi|353199744|gb|EHB65206.1| glycoside hydrolase family 3 domain protein [Paenibacillus lactis
154]
Length = 765
Score = 265 bits (677), Expect = 8e-68, Method: Compositional matrix adjust.
Identities = 206/683 (30%), Positives = 325/683 (47%), Gaps = 97/683 (14%)
Query: 109 GATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRG 168
GAT FP ++ S++N L+ + R V+ E R+ G +SP +++ RDPRWGR
Sbjct: 122 GATVFPVPLSLGSTWNTELYREMCRAVALETRSQ-----GGAVTYSPVLDVVRDPRWGRT 176
Query: 169 QETPGEDPVLSGKYAASYVRGLQGSDGD-RLKVAASCKHFTAYDLDNWNGVDRFHFNAKV 227
+E GEDP L G++AA+ V GLQG D VAA+ KHF Y G NA
Sbjct: 177 EECFGEDPYLIGEFAAASVEGLQGESLDGEASVAATLKHFVGY------GSSEGGRNAGP 230
Query: 228 SKQDIEDTFDV---PFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
+ +V PF+ V E AS+M +YN+++GVP + +L +R EW +G
Sbjct: 231 VHMGTRELMEVDMYPFKKAV-EAGAASIMPAYNEIDGVPCTVNEELLDGVLRKEWGFDGM 289
Query: 285 IVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLD-CGPFLGLHTESAVQRGLLSEIDI 343
+++DC ++ + +AA AI AG+D++ G G++ E AVQ L +
Sbjct: 290 VITDCGAINMLAAGHDTAEDGMDAAVSAISAGIDMEMSGEMFGMYLERAVQEKRLDVSVL 349
Query: 344 NNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPS 403
+ A+ LT++ +LG+F+ P + P + + H+E+A + A +GIVLLKN+G +
Sbjct: 350 DEAVRRVLTLKFKLGLFE-NPYADP--ARAEQVIGCSRHREMARQLAAEGIVLLKNEGST 406
Query: 404 LPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIA--CGYTTPLQGI-----GRYARTIHQQG 456
LPLS +AVIGPN+D +G+Y T L+GI G R ++ G
Sbjct: 407 LPLSK-EDGVIAVIGPNADQGYNQLGDYTSPQPPSRVVTVLEGIRAKLGGDKGRVLYAPG 465
Query: 457 CKDVACADDQLFGAAIDASRQADATILVMGLDQSIE------------------------ 492
C+ + + F A+ + QAD +LV+G + +
Sbjct: 466 CR-INGDSREGFELALSCAGQADTVVLVLGGSSARDFGEGTIDLRTGASKVTGNDWSDMD 524
Query: 493 -AEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGY 551
E +DR L L G Q EL ++ K ++V ++G PI + D AI+ A Y
Sbjct: 525 CGEGIDRMTLQLSGVQLELAREIHKLGK-RLVVVYINGRPIAEPWI--DRHADAILEAWY 581
Query: 552 PGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYK 611
PGQ GG A+ADILFG NP GKL ++ P+ ++ LP+ KR G+ Y
Sbjct: 582 PGQEGGHAVADILFGDVNPSGKLTIS-IPK-HVGQLPVY------YNGKRSRGKRYLEED 633
Query: 612 GPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLG 671
YPFG+G+SYT F ++ + ++ + I+ T
Sbjct: 634 SQPQYPFGYGLSYTEFRYS-------------------DLQVTPQTIRTGE------TAV 668
Query: 672 VQVDVKNVGSKDGAHTLLVFSTPPAGHWA-PHKQLVAFEKVHVPAGAQQRVGINI----- 725
V V+V+N GS GA + ++ A + P K+L F K+++ G +QR+ +
Sbjct: 669 VTVNVENSGSVAGAEVVQLYINDAASRFTRPAKELKGFRKIYLEPGEKQRIEFTVGPEQL 728
Query: 726 -HVCKYLSVVDRSGTRRIPLGEH 747
++ + V G R+ +G+H
Sbjct: 729 QYIGQNYQPVVEPGLFRVMVGKH 751
>gi|224583333|ref|YP_002637131.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Paratyphi C strain RKS4594]
gi|224467860|gb|ACN45690.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Paratyphi C strain RKS4594]
Length = 765
Score = 265 bits (677), Expect = 8e-68, Method: Compositional matrix adjust.
Identities = 206/669 (30%), Positives = 326/669 (48%), Gaps = 103/669 (15%)
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
T FP + ASSFN +GRV + Y GL W+P V++ RDPRWGR
Sbjct: 118 TVFPISLGLASSFNLDAVRTVGRV------SAYEAADDGLNMTWAPMVDVSRDPRWGRAS 171
Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
E GED L+ + V+ +QG S DR V S KHF AY +N VD
Sbjct: 172 EGFGEDTYLTSIMGETMVKAMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 226
Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
+S Q + + + P++ + G +VM + N +NG P +D +LK +R EW G
Sbjct: 227 --MSSQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSDSWLLKDVLRDEWGFKGI 283
Query: 285 IVSDCDSVGVYYDTQHFTST-PEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
VSD ++ +H T+ PE+A A++AG+D+ + + ++ G ++ +
Sbjct: 284 TVSDHGAIKEL--IKHGTAADPEDAVRVALKAGVDMSMADEYYSKYLPGLIKSGKVTMAE 341
Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
+++A + L V+ +G+F+ PY HLGPK+ D H++ A E AR+ +V
Sbjct: 342 LDDATRHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAESRLHRKEAREVARESVV 396
Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGI----GRYA 449
LLKN+ +LPL + T+AV+GP +D ++G++ AG+A T L GI G A
Sbjct: 397 LLKNRLETLPLK--KSGTIAVVGPLADSQRDVMGSWSAAGVANQSVTVLAGIQNAVGDGA 454
Query: 450 RTIHQQGC-----KDVAC--------------ADDQLFGAAIDASRQADATILVMGLDQS 490
+ +H +G KD+ + + A+ A++QAD + V+G Q
Sbjct: 455 KILHAKGANITNDKDIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADVVVAVVGESQG 514
Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
+ EA R + +P Q++L++ + K P +LVLM+G P+ A K D + AI+
Sbjct: 515 MAHEASSRTNITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKEDQQADAILETW 571
Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPM------TEMAMRPSQSKRYPG 604
+ G GG AIAD+LFG NP GKLP++ +P+ + +P+ T P + +Y
Sbjct: 572 FAGTEGGNAIADVLFGDYNPSGKLPIS-FPRS-VGQIPVYYSHLNTGRPYNPEKPNKYTS 629
Query: 605 RTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAK 664
R + GP +YPFG+G+SYT F TV++ ++ P R G + A+
Sbjct: 630 RYFDEANGP-LYPFGYGLSYTTF--TVSD--VTLSSPTMQRDGKVTAS------------ 672
Query: 665 CNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVG- 722
V+V N G ++GA + ++ A P KQL FEK+ + G + V
Sbjct: 673 ---------VEVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGFEKITLKPGESKTVSF 723
Query: 723 -INIHVCKY 730
I+I K+
Sbjct: 724 PIDIEALKF 732
>gi|383113360|ref|ZP_09934132.1| hypothetical protein BSGG_3064 [Bacteroides sp. D2]
gi|382948727|gb|EFS32364.2| hypothetical protein BSGG_3064 [Bacteroides sp. D2]
Length = 954
Score = 265 bits (677), Expect = 8e-68, Method: Compositional matrix adjust.
Identities = 216/761 (28%), Positives = 362/761 (47%), Gaps = 107/761 (14%)
Query: 42 FCQVSLPIPQRVNDLIGRLSLQEKVKLLISG--AAAVPRLGIKGYEWWSEALHGVSNVGP 99
+ SLP+ +RV L+ ++ ++K++L+ G +P L + EA+HG S
Sbjct: 170 YMDASLPVEERVESLLAVMTPEDKMELIREGWGIPGIPHLYVPPITK-VEAVHGFS---- 224
Query: 100 GTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNI 159
+G GAT FPQ + +++N L E + V+ DE A N A WSP +++
Sbjct: 225 ---YGS---GATIFPQALAMGATWNRKLTEEVAMVIGDETVAA-NTKQA----WSPVLDV 273
Query: 160 FRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD 219
+D RWGR +ET GEDPVL + ++++G Q + + KHF + G D
Sbjct: 274 AQDARWGRCEETFGEDPVLVSQIGGAWIKGYQSRG-----LFTTPKHFGGHGAP-LGGRD 327
Query: 220 RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEW 279
+S++++ + VPFR + S+M +Y+ G+P +L++ +R EW
Sbjct: 328 SHDIG--LSEREMREVHLVPFRHAIRNYDCQSLMMAYSDYMGIPVAKSTELLQQILRQEW 385
Query: 280 RLNGYIVSDCDSVGVYYDTQHFTSTPE-EAAADAIRAGLDLDCG-PFLGLHTESAVQRGL 337
NG+IVSDC ++G +H+T+ + EAA A+ AG+ +CG + A + G
Sbjct: 386 GFNGFIVSDCGAIGNLTARKHYTAQDKIEAANQALAAGIATNCGDTYNNKEVIQAAKDGR 445
Query: 338 LSEIDINNALVNTLTVQMRLGMFDGEPSSQ-PYGHLGPKDVCTPDHQELALEAARQGIVL 396
++ +++N L+ R +F+ P + + P + H+E+A +AAR+ IV+
Sbjct: 446 INMENLDNVCRTMLSTMFRNELFEKNPCKPLDWKKIYP-GWNSDSHKEMARQAARESIVM 504
Query: 397 LKNQGPSLPLSHIRHRTVAVIGPNSD-------VTVTMIGNYAGIACGYTTPLQGIGRYA 449
L+N+ LPL+ RT+AV+GP +D + G + G + +G+
Sbjct: 505 LENKENLLPLTK-NLRTIAVLGPGADDLQPGDYTPKLLPGQLKSVLTGIK---EAVGKQT 560
Query: 450 RTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEA---------EALDRAG 500
+ +++QGC D D+ A+ A+ Q+D ++V+G + EA E D A
Sbjct: 561 KVLYEQGC-DFTNPDETNIPKAVKAASQSDVVVMVLGDCSTSEATNDVRKTCGENNDWAT 619
Query: 501 LLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAI 560
L+LPG+QQEL+ V K P IL+L +G P D+ A + + W PGQ GG A+
Sbjct: 620 LILPGKQQELLEAVCATGK-PVILILQAGRPYDILKASEMCKAILVNW--LPGQEGGPAM 676
Query: 561 ADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGH 620
AD+LFG NPGG+LPMT +P+ ++ LP+ + +RY +Y +Y FG
Sbjct: 677 ADVLFGDYNPGGRLPMT-FPR-HVGQLPL--YYNFKTSGRRYEYVDMEYYP---LYRFGF 729
Query: 621 GMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVG 680
G+SYT+F ++ +K+ +T VQ VKN+G
Sbjct: 730 GLSYTSFEYS--------------------------DLKIQEKPNGNVT--VQATVKNIG 761
Query: 681 SKDGAHTLLVFSTPPAGHWAPHK----QLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDR 736
S+ G ++ T +A K +L F+++++ G + V + +S+++
Sbjct: 762 SRAGDEVAQLYVT---DMYASVKTRVMELKDFDRIYLQPGESKTVSFELTPYD-ISLLND 817
Query: 737 SGTRRIPLGEHNIHIGGT----------KHSVSLHAATLGV 767
R + GE I +GG KHSV GV
Sbjct: 818 HMDRVVEKGEFKICVGGMSPDYVAKNEIKHSVGYSDKKKGV 858
>gi|317474349|ref|ZP_07933623.1| glycosyl hydrolase family 3 C terminal domain-containing protein
[Bacteroides eggerthii 1_2_48FAA]
gi|316909030|gb|EFV30710.1| glycosyl hydrolase family 3 C terminal domain-containing protein
[Bacteroides eggerthii 1_2_48FAA]
Length = 877
Score = 265 bits (677), Expect = 8e-68, Method: Compositional matrix adjust.
Identities = 166/450 (36%), Positives = 243/450 (54%), Gaps = 42/450 (9%)
Query: 29 ACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWS 88
AC +A PF L + +RV++L+ L+LQEKV ++I+ + A+ RLGI Y WW
Sbjct: 8 AC--SEAEKYQYPFQNPKLKVEERVDNLMSLLTLQEKVSMMINRSLAIERLGIPAYNWWG 65
Query: 89 EALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN---- 144
EA HG+ G G T FPQ I A++F+ + +VSDEARA YN
Sbjct: 66 EACHGL--------IAG---GVTVFPQSIALAATFDDSSQLTTYTMVSDEARARYNTLPL 114
Query: 145 --------GGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGD 196
LT+W+PN+NIFRDPRWGRGQET GEDP L + + V G+QG D
Sbjct: 115 DGDIGPYVSAIPNLTFWAPNINIFRDPRWGRGQETYGEDPFLMSRMGLNVVLGMQGDDEH 174
Query: 197 RLKVAASCKHFTAYDLDNWNGVD--RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMC 254
K A KH+ + +G + R FNA VS +D+ +T+ F V++G V VMC
Sbjct: 175 YYKTHACAKHYGVH-----SGPEPLRHEFNAVVSMRDLWETYLPAFETLVVKGNVREVMC 229
Query: 255 SYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTP--EEAAADA 312
+Y+ G P CA +L +R W +G +VSDCD++ +Y + P A+ADA
Sbjct: 230 AYSAYEGEPCCASNRLLVDILRNRWGFDGMVVSDCDAINDFYVKGRHETHPDAAAASADA 289
Query: 313 IRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQ-PYGH 371
+ G DL+CG E AV++G++ E D++ +L LT + RLG+ D P+ PY
Sbjct: 290 VLTGTDLECGRSYNALIE-AVEKGIIKEQDLDVSLRRILTERFRLGLLD--PAKYVPYST 346
Query: 372 LGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY 431
+ + +H++ AL+ A + VLLKN+G LPL +++AV+GPN + ++ M GNY
Sbjct: 347 IPGSVIDCQEHRDHALKMAHESQVLLKNEGNILPLDK-NIKSIAVVGPNINDSIMMRGNY 405
Query: 432 AGIACGYTTPLQGIGRY---ARTIHQQGCK 458
+G T LQG+ R I ++G +
Sbjct: 406 SGSPTHCITILQGLKNKLPNTRIISERGSE 435
Score = 114 bits (286), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 91/303 (30%), Positives = 136/303 (44%), Gaps = 41/303 (13%)
Query: 467 LFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVL 526
+F + + +A I V G+ + E E +RA + LP Q+ + + K P I V
Sbjct: 609 VFEELKEKIKDVEAIIYVGGITPTQEGEGHERAKIELPDVQKRFLKAMHETGK-PVIYVN 667
Query: 527 MSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITN 586
SG I A A D A++ A YPGQ GGTA+AD+LFG NP GKLP+T+Y
Sbjct: 668 CSGSAI--ALADIDYAYDALLQAWYPGQEGGTAVADVLFGDYNPSGKLPVTFYKS--TEQ 723
Query: 587 LP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGR 645
LP T+ +M RTYR++KG Y FG+G+SYT+F
Sbjct: 724 LPEFTDYSME--------NRTYRYFKGEPQYAFGYGLSYTDFEFG--------------- 760
Query: 646 HGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQL 705
A +S +IK + + + + NVG DGA + V+ AP K L
Sbjct: 761 ----EALLSSSSIKAGEK------VEITIPLTNVGKMDGAEVVQVYVKSLTNPDAPIKSL 810
Query: 706 VAFEKVHVPAGAQQRVGINIHVCKYLSV-VDRSGTRRIPLGEHNIHIGGTKHSVSLHAAT 764
+ + + AG ++V I + + D G + P G++ I G + + L +
Sbjct: 811 KGYVRQEIKAGKSEKVRITLEPESFAYYNADVDGLKVFP-GKYQILYGNSSRDMDLSSLD 869
Query: 765 LGV 767
L V
Sbjct: 870 LEV 872
>gi|393789623|ref|ZP_10377743.1| hypothetical protein HMPREF1068_04023 [Bacteroides nordii
CL02T12C05]
gi|392650339|gb|EIY44008.1| hypothetical protein HMPREF1068_04023 [Bacteroides nordii
CL02T12C05]
Length = 863
Score = 265 bits (676), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 169/465 (36%), Positives = 250/465 (53%), Gaps = 45/465 (9%)
Query: 8 FFLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVK 67
+ L L +A+ G+A EP+ +P +L +R DL+GRL+L+EKV
Sbjct: 5 LIITLCLSTATLVGMAQNEPYR-NP-------------NLSPEERAEDLLGRLTLKEKVT 50
Query: 68 LLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATL 127
L+ + + + RLGI Y+WW+EALHGV+ G AT FP + ASSF+
Sbjct: 51 LMQNESFPIERLGIAHYDWWNEALHGVARAGI----------ATVFPITMGMASSFDDKA 100
Query: 128 WEAIGRVVSDEARAMYNGG---------TAGLTYWSPNVNIFRDPRWGRGQETPGEDPVL 178
E I VSDE RA Y+ GLT+W+PN+NIFRDPRWGRGQET GEDP L
Sbjct: 101 IEDIFTAVSDEVRAKYHDAHRQGRRGVRCEGLTFWTPNINIFRDPRWGRGQETYGEDPYL 160
Query: 179 SGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAK-VSKQDIEDTF 236
+ + + V+GLQG +D K+ A KH+ + R FNA+ +S +D+ +T+
Sbjct: 161 TSRMGVAVVKGLQGPADAKYDKLHACAKHYAVHSGPE---AKRHSFNAENISPRDLWETY 217
Query: 237 DVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVG-VY 295
F+ V E V VMC+YN+ G P C +L + +R +W +VSDC ++ +
Sbjct: 218 LPAFKTLVQEADVKEVMCAYNRFEGDPCCGSNRLLTQILRDDWGYKHVVVSDCGAISDFF 277
Query: 296 YDTQHFT-STPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQ 354
Y +H T +A+A A+ +G DL+CG H + AV+ GL+SE IN +L L +
Sbjct: 278 YSDRHATHKNAADASAAAVLSGTDLECGIEYA-HLDKAVKEGLISEERINTSLFRLLKAR 336
Query: 355 MRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTV 414
LG D + + P+ + V H+++AL+ R+ +VLL N G LPL R +
Sbjct: 337 FELGEMDDD-ALVPWSKISIDTVDCQMHKQMALDITRKSMVLLHNNG-VLPLGKTGAR-I 393
Query: 415 AVIGPNSDVTVTMIGNYAGIACGYTTPLQGI-GRYARTIHQQGCK 458
AV+GPN++ +V GNY G T L+GI + I+++GC+
Sbjct: 394 AVMGPNANDSVMQWGNYEGTPSHTVTVLEGIRNKIGNVIYEKGCE 438
Score = 110 bits (276), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 90/308 (29%), Positives = 135/308 (43%), Gaps = 50/308 (16%)
Query: 470 AAIDASRQADATILVMGLDQSIEAEAL-----------DRAGLLLPGRQQELVSKVSMAS 518
A + ++AD I V G+ ++E E DR + LP Q+ + K +
Sbjct: 591 AVAERVKEADVVIFVGGISPNLEGEEKNFVNCPGFVGGDRTSIELPEVQRN-ILKALKKA 649
Query: 519 KGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTW 578
I V SG + A AI+ A YPGQAGGTA+ADI+FG NP GKLP+T+
Sbjct: 650 GKKVIFVNCSGSAM--ALVPETQSCDAILQAWYPGQAGGTAVADIIFGDYNPSGKLPVTF 707
Query: 579 YPQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTV 637
Y LP + +M+ GRTYR+ ++PFG+G+SYT F + A
Sbjct: 708 YKN--TEQLPDFEDYSMK--------GRTYRYMTESPLFPFGYGLSYTTFQFSKAG---- 753
Query: 638 VAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAG 697
L+ R +IN + K V+VKN G +DG + ++
Sbjct: 754 ----LNKRVATINQPVQFK-----------------VNVKNTGKRDGTEVVQIYIRKADD 792
Query: 698 HWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHS 757
P K L AF V + AG V I + + + T RI G++ I G + ++
Sbjct: 793 EEGPVKSLRAFRPVTLKAGKSATVNITLSPDAFEFFDPETNTMRIVPGDYEIMYGNSSNT 852
Query: 758 VSLHAATL 765
+ + +L
Sbjct: 853 LPENKLSL 860
>gi|336415919|ref|ZP_08596257.1| hypothetical protein HMPREF1017_03365 [Bacteroides ovatus
3_8_47FAA]
gi|335939822|gb|EGN01694.1| hypothetical protein HMPREF1017_03365 [Bacteroides ovatus
3_8_47FAA]
Length = 782
Score = 265 bits (676), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 210/699 (30%), Positives = 325/699 (46%), Gaps = 108/699 (15%)
Query: 109 GATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRG 168
GAT FP I A++++ L + +G+V++ E R+ G + P +++ RDPRW R
Sbjct: 148 GATVFPTGIGMAATWSPELVKEVGQVIAKEIRSQ-----GGHISYGPVLDLTRDPRWSRV 202
Query: 169 QETPGEDPVLSGKYAASYVRGLQGSD-GDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKV 227
+ET GEDPVLSG AS V GL G + + A+ KHF AY + + A V
Sbjct: 203 EETFGEDPVLSGILGASMVDGLGGGNLSQKYATIATLKHFLAYAVPEGGQNGNY---ASV 259
Query: 228 SKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVS 287
+D+ F PFR + G + SVM SYN ++G+P ++ N+L + +R EW+ G++VS
Sbjct: 260 GIRDLHQNFLPPFRKAIDAGAL-SVMTSYNSIDGIPCTSNHNLLTQLLRNEWKFRGFVVS 318
Query: 288 DCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNAL 347
D S+ +++ T E AA ++ AG+D+D G + AVQ G + + I+ A+
Sbjct: 319 DLYSIEGIHESHFVAPTKENAAIQSVTAGVDVDLGGDAYTNLCHAVQSGQMDKAVIDTAV 378
Query: 348 VNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLS 407
L ++ +G+F+ P P + K V +H ELA + A+ I LLKN+ LPLS
Sbjct: 379 CRVLRMKFEMGLFE-HPYVDP--KIAAKTVRRKEHIELARKIAQSSITLLKNENSILPLS 435
Query: 408 HIRHRTVAVIGPNSDVTVTMIGNYAGIA--CGYTTPLQGIGRY---ARTIHQQGC--KDV 460
++ VAVIGPN+D M+G+Y T L GI +R + +GC +D
Sbjct: 436 KTINK-VAVIGPNADNRYNMLGDYTAPQEDSNVKTVLDGIITKLSPSRVEYVRGCAIRDT 494
Query: 461 ACADDQLFGAAIDASRQAD----------------------ATILVMGLDQSIE-AEALD 497
+ + AI+A+R+++ A + G +E E D
Sbjct: 495 TVNEIE---QAIEAARRSEVVIVVVGGSSARDFKTSYKETGAAVAEEGSVSDMECGEGFD 551
Query: 498 RAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGG 557
RA L L GRQQEL+ + K P I+V + G P++ +A A++ A YPGQ GG
Sbjct: 552 RASLSLLGRQQELLESLQKTGK-PLIVVYIEGRPLEKNWASE--YADALLTAYYPGQEGG 608
Query: 558 TAIADILFGTSNPGGKLPMT----------WYPQEYITNLPMTEMAMRPSQSKRYPGRTY 607
AIAD+LFG NP G+LP++ +Y Q+ N E++ P
Sbjct: 609 NAIADVLFGDYNPSGRLPISVPRSVGQIPVYYNQKAPRNHDYVEVSSSP----------- 657
Query: 608 RFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNR 667
+Y FG+GMSYT F + S + A+C
Sbjct: 658 -------LYSFGYGMSYTTFEY------------------------SDLQVVQKSARC-- 684
Query: 668 LTLGVQVDVKNVGSKDGAH-TLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIH 726
V VKN G DG + L A P KQL FE+ H+ G +++V +
Sbjct: 685 --FEVSFKVKNTGKYDGEEVSQLYMRDEYASVVQPMKQLKHFERFHLKKGEEKKVTFVLT 742
Query: 727 VCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHAATL 765
+ +V+ + + + G ++ IG + + L L
Sbjct: 743 EEDFF-LVNYTLKKVVESGNFHLMIGAASNDIRLQNVIL 780
>gi|194449077|ref|YP_002046219.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar
Heidelberg str. SL476]
gi|205359117|ref|ZP_02667180.2| glycosyl hydrolase family 3 N domain protein [Salmonella enterica
subsp. enterica serovar Heidelberg str. SL486]
gi|386591978|ref|YP_006088378.1| Periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Heidelberg str. B182]
gi|417384271|ref|ZP_12149694.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar
Johannesburg str. S5-703]
gi|194407381|gb|ACF67600.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Heidelberg str. SL476]
gi|205338504|gb|EDZ25268.1| glycosyl hydrolase family 3 N domain protein [Salmonella enterica
subsp. enterica serovar Heidelberg str. SL486]
gi|353609401|gb|EHC62716.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar
Johannesburg str. S5-703]
gi|383799022|gb|AFH46104.1| Periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Heidelberg str. B182]
Length = 755
Score = 265 bits (676), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 205/669 (30%), Positives = 327/669 (48%), Gaps = 103/669 (15%)
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
T FP + ASSFN +GRV + Y GL W+P V++ RDPRWGR
Sbjct: 108 TVFPISLGLASSFNLDAVRTVGRV------SAYEAADDGLNMTWAPMVDVSRDPRWGRAS 161
Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
E GED L+ + V+ +QG S DR V S KHF AY +N VD
Sbjct: 162 EGFGEDTYLTSIMGETMVKAMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 216
Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
+S Q + + + P++ + G +VM + N +NG P +D +LK +R EW G
Sbjct: 217 --MSSQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSDSWLLKDVLRDEWGFKGI 273
Query: 285 IVSDCDSVGVYYDTQHFTST-PEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
VSD ++ +H T+ PE+A A++AG+D+ + + ++ G ++ +
Sbjct: 274 TVSDHGAIKEL--IKHGTAADPEDAVRVALKAGVDMSMADEYYSKYLPGLIKSGKVTMAE 331
Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
+++A + L V+ +G+F+ PY HLGPK+ D H++ A E AR+ +V
Sbjct: 332 LDDATRHVLNVKYDMGLFN-----DPYSHLGPKESDPADTNAESRLHRKEAREVARESVV 386
Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGI----GRYA 449
LLKN+ +LPL + T+AV+GP +D ++G++ AG+A T L GI G A
Sbjct: 387 LLKNRLETLPLK--KSGTIAVVGPLADSQRDVMGSWSAAGVANQSVTVLAGIQNAVGDGA 444
Query: 450 RTIHQQGC-----KDVAC--------------ADDQLFGAAIDASRQADATILVMGLDQS 490
+ ++ +G KD+ + + A+ A++QAD + V+G Q
Sbjct: 445 KILYAKGANITNDKDIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADVVVAVVGESQG 504
Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
+ EA R + +P Q++L++ + K P +LVLM+G P+ A K D + AI+
Sbjct: 505 MAHEASSRTNITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKEDQQADAILETW 561
Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPM------TEMAMRPSQSKRYPG 604
+ G GG AIAD+LFG NP GKLP++ +P+ + +P+ T P + +Y
Sbjct: 562 FAGTEGGNAIADVLFGDYNPSGKLPIS-FPRS-VGQIPVYYSHLNTGRPYNPEKPNKYTS 619
Query: 605 RTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAK 664
R + GP +YPFG+G+SYT F TV++ ++ P R G + A+
Sbjct: 620 RYFDEANGP-LYPFGYGLSYTTF--TVSD--VTLSSPTMQRDGKVTAS------------ 662
Query: 665 CNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVG- 722
V+V N G ++GA + ++ A P KQL FEK+ + G ++ V
Sbjct: 663 ---------VEVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGFEKITLKPGERKTVSF 713
Query: 723 -INIHVCKY 730
I+I K+
Sbjct: 714 PIDIEALKF 722
>gi|323451833|gb|EGB07709.1| hypothetical protein AURANDRAFT_64764 [Aureococcus anophagefferens]
Length = 819
Score = 264 bits (675), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 246/763 (32%), Positives = 348/763 (45%), Gaps = 126/763 (16%)
Query: 16 SASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAA 75
+A +GLAA E +C D T + SLP R+ L + L++ + L++ A A
Sbjct: 52 TACVAGLAAYE--SCPGLDGT-----YLDASLPEADRLAWLADNVPLEDMIGQLVNAAPA 104
Query: 76 VPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVV 135
V + + Y W ++ HGV GT AT +P + +S++ L +G +
Sbjct: 105 VDAVDLPAYNWLNDNEHGVK----GTAH------ATVYPMGASLGASWSVDLAWRVGAAI 154
Query: 136 SDEARAMYNG-----GTA--------------GLTYWSPNVNIFRDPRWGRGQETPGEDP 176
+E+RA +NG G A G+T ++PNVN+ RDPRWGR +E GEDP
Sbjct: 155 GNESRATHNGLADKSGNACGSTSTGEVVANGCGITLYAPNVNLVRDPRWGRAEEVYGEDP 214
Query: 177 VLSGKYAASYVRGLQGS--------DGDRLKVAASCKHFTAYDLDNWNG---VDRFHFNA 225
L+ + A V GLQG+ G L A CKHF A+ N DR +A
Sbjct: 215 HLTAELAVGMVTGLQGNAEGSTSGPGGGPLVTGACCKHFAAHFAVYQNEDLPADRMVLDA 274
Query: 226 KVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYI 285
VS +D+ +T+ + CV+ K VNG PTCA P +L +R W +G++
Sbjct: 275 NVSSRDLWETYLPVMKACVVRAKA-------THVNGKPTCAHPELLNDVLRESWGFDGFV 327
Query: 286 VSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG-----PFLGLHTESAVQRGLLSE 340
VSD D+ T + ST EEAAA I AG+D + G P L AV+ G ++
Sbjct: 328 VSDYDAWSNLVTTHKYVSTWEEAAAAGINAGMDQEGGFGDYSPVDAL--PDAVRNGTVAA 385
Query: 341 IDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELAL--EAARQGIVLLK 398
+ + + V++RLGMFD S+ YG D +LAL EAAR+GIVL K
Sbjct: 386 ATVRRSFERLMRVRLRLGMFDPPASTAVYGEAYQCDYQCETAAKLALAREAAREGIVLFK 445
Query: 399 NQGPSLPLSHIRHRTVAVIGPNSDVTVTMIG--NYA---GIACGYTTPLQGIGRYARTIH 453
N G +LPL+ + +A++GP D ++G NYA G T +G+ A
Sbjct: 446 NAGGALPLA--KGARIALVGPQVDDWRVLLGAVNYAFEDGPDVAPVTIQKGLEAVANVSV 503
Query: 454 QQGCKDVACADDQLFGAAI--------------DASRQADATILVMGL-DQSIEAEALDR 498
GC VACA A D+ D L G D E+E+ DR
Sbjct: 504 AAGCDSVACAALVDVDGAKRLAAAADATVVVLGDSFGATDGWPLCRGTRDDGCESESHDR 563
Query: 499 AGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGT 558
A + LPG Q LV+ + AS + VL+ GG + + A +D +W PGQ GG
Sbjct: 564 ATIELPGEQVALVAALRAASS-RLVCVLVHGGAVALGAAADDCDAVLDLWV--PGQMGGA 620
Query: 559 AIADILFGTSNPGGKLPMTWYPQEYITNLP----MTEMAMRPSQSKRYPGRTYRFYKGPV 614
A+AD+LFG +P G+ P+T Y ++LP E A S G TYR+Y GP
Sbjct: 621 ALADVLFGDYSPAGRSPITMYAA--TSDLPPMGVFDEYAGESSN-----GTTYRYYAGPA 673
Query: 615 -VYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQ 673
Y FG G+SY +F + A AP T C + L +
Sbjct: 674 PTYAFGDGLSYASFSYAWAAAP-----------------------PTTVDACGAIRL--R 708
Query: 674 VDVKNVGSKDGAHTLLVFS-TPPAGHWAPHKQLVAFEKVHVPA 715
V V N GS + V++ P A AP +LVAF++V A
Sbjct: 709 VAVTNTGSVASDEVVQVYARVPDATVPAPAIRLVAFDRVRAIA 751
>gi|218132025|ref|ZP_03460829.1| hypothetical protein BACEGG_03650 [Bacteroides eggerthii DSM 20697]
gi|217985785|gb|EEC52125.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
eggerthii DSM 20697]
Length = 888
Score = 264 bits (675), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 165/450 (36%), Positives = 243/450 (54%), Gaps = 42/450 (9%)
Query: 29 ACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWS 88
AC +A PF L + +RV++L+ L+LQEKV ++I+ + A+ RLGI Y WW
Sbjct: 19 AC--SEAEKYQYPFQNPKLKVEERVDNLMSLLTLQEKVSMMINRSLAIERLGIPAYNWWG 76
Query: 89 EALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN---- 144
EA HG+ G G T FPQ I A++F+ + +VSDEARA YN
Sbjct: 77 EACHGL--------IAG---GVTVFPQSIALAATFDDSSQLTTYTMVSDEARARYNTLPL 125
Query: 145 --------GGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGD 196
LT+W+PN+NIFRDPRWGRGQET GEDP L + + V G+QG D
Sbjct: 126 DGDIGPYVSAIPNLTFWAPNINIFRDPRWGRGQETYGEDPFLMSRMGLNVVLGMQGDDEH 185
Query: 197 RLKVAASCKHFTAYDLDNWNGVD--RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMC 254
K A KH+ + +G + R FNA VS +D+ +T+ F V++G V VMC
Sbjct: 186 YYKTHACAKHYGVH-----SGPEPLRHEFNAVVSMRDLWETYLPAFETLVVKGNVREVMC 240
Query: 255 SYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTP--EEAAADA 312
+Y+ G P CA +L +R W +G +VSDCD++ +Y + P A+ADA
Sbjct: 241 AYSAYEGEPCCASNRLLVDILRNRWGFDGMVVSDCDAINDFYVKGRHETHPDAAAASADA 300
Query: 313 IRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQ-PYGH 371
+ G DL+CG E AV++G++ E D++ +L LT + RLG+ D P+ PY
Sbjct: 301 VLTGTDLECGRSYNALIE-AVEKGIIKEQDLDVSLRRILTERFRLGLLD--PAKYVPYST 357
Query: 372 LGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY 431
+ + +H++ AL+ A + VLLKN+G LPL +++A++GPN + ++ M GNY
Sbjct: 358 IPGSVIDCQEHRDHALKMAHESQVLLKNEGNILPLDK-NIKSIAIVGPNINDSIMMRGNY 416
Query: 432 AGIACGYTTPLQGIGRY---ARTIHQQGCK 458
+G T LQG+ R I ++G +
Sbjct: 417 SGSPTHCITILQGLKNKLPNTRIISERGSE 446
Score = 115 bits (288), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 91/303 (30%), Positives = 136/303 (44%), Gaps = 41/303 (13%)
Query: 467 LFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVL 526
+F + + +A I V G+ + E E +RA + LP Q+ + + K P I V
Sbjct: 620 VFEELKEKIKDVEAIIYVGGITPTQEGEGHERAKIELPDVQKRFLKAMHETGK-PVIYVN 678
Query: 527 MSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITN 586
SG I A A D A++ A YPGQ GGTA+AD+LFG NP GKLP+T+Y
Sbjct: 679 CSGSAI--ALADIDYAYDALLQAWYPGQEGGTAVADVLFGDYNPSGKLPVTFYKS--TEQ 734
Query: 587 LP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGR 645
LP T+ +M RTYR++KG Y FG+G+SYT+F
Sbjct: 735 LPEFTDYSME--------NRTYRYFKGEPQYAFGYGLSYTDFEFG--------------- 771
Query: 646 HGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQL 705
A +S +IK + + + + NVG DGA + V+ AP K L
Sbjct: 772 ----EALLSSSSIKAGEK------VEITIPLTNVGKMDGAEVVQVYVKSLTNPDAPIKSL 821
Query: 706 VAFEKVHVPAGAQQRVGINIHVCKYLSV-VDRSGTRRIPLGEHNIHIGGTKHSVSLHAAT 764
+ + + AG ++V I + + D G + P G++ I G + + L +
Sbjct: 822 KGYVRQEIKAGKSEKVRITLEPESFAYYNADVDGLKVFP-GKYQILYGNSSRDMDLRSLD 880
Query: 765 LGV 767
L V
Sbjct: 881 LEV 883
>gi|336417087|ref|ZP_08597416.1| hypothetical protein HMPREF1017_04524 [Bacteroides ovatus
3_8_47FAA]
gi|335936712|gb|EGM98630.1| hypothetical protein HMPREF1017_04524 [Bacteroides ovatus
3_8_47FAA]
Length = 954
Score = 264 bits (675), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 216/761 (28%), Positives = 361/761 (47%), Gaps = 107/761 (14%)
Query: 42 FCQVSLPIPQRVNDLIGRLSLQEKVKLLISG--AAAVPRLGIKGYEWWSEALHGVSNVGP 99
+ SLP+ +RV L+ ++ ++K++L+ G +P L + EA+HG S
Sbjct: 170 YMDASLPVEERVESLLAVMTPEDKMELIREGWGIPGIPHLYVPPITK-VEAVHGFS---- 224
Query: 100 GTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNI 159
+G GAT FPQ + +++N L E + V+ DE A N A WSP +++
Sbjct: 225 ---YGS---GATIFPQALAMGATWNRKLTEEVAMVIGDETVAA-NTKQA----WSPVLDV 273
Query: 160 FRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD 219
+D RWGR +ET GEDPVL + ++++G Q + + KHF + G D
Sbjct: 274 AQDARWGRCEETFGEDPVLVSQIGGAWIKGYQSRG-----LFTTPKHFGGHGAP-LGGRD 327
Query: 220 RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEW 279
+S++++ + VPFR + S+M +Y+ G+P +L++ +R EW
Sbjct: 328 SHDIG--LSEREMREVHLVPFRHAIRNYDCQSLMMAYSDYMGIPVAKSTELLQQILRQEW 385
Query: 280 RLNGYIVSDCDSVGVYYDTQHFTSTPE-EAAADAIRAGLDLDCG-PFLGLHTESAVQRGL 337
NG+IVSDC ++G +H+T+ + EAA A+ AG+ +CG + A + G
Sbjct: 386 GFNGFIVSDCGAIGNLTARKHYTAQDKIEAANQALAAGIATNCGDTYNNKEVIQAAKDGR 445
Query: 338 LSEIDINNALVNTLTVQMRLGMFDGEPSSQ-PYGHLGPKDVCTPDHQELALEAARQGIVL 396
+ +++N L+ R +F+ P + + P + H+E+A +AAR+ IV+
Sbjct: 446 IDMENLDNVCRTMLSTMFRNELFEKNPCKPLDWKKIYP-GWNSDSHKEMARQAARESIVM 504
Query: 397 LKNQGPSLPLSHIRHRTVAVIGPNSD-------VTVTMIGNYAGIACGYTTPLQGIGRYA 449
L+N+ LPL+ RT+AV+GP +D + G + G + +G+
Sbjct: 505 LENKENLLPLTK-NLRTIAVLGPGADDLQPGDYTPKLLPGQLKSVLTGIK---EAVGKQT 560
Query: 450 RTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEA---------EALDRAG 500
+ +++QGC D D+ A+ A+ Q+D ++V+G + EA E D A
Sbjct: 561 KVLYEQGC-DFTNPDETNIPKAVKAASQSDVVVMVLGDCSTSEATNDVRKTCGENNDWAT 619
Query: 501 LLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAI 560
L+LPG+QQEL+ V K P IL+L +G P D+ A + + W PGQ GG A+
Sbjct: 620 LILPGKQQELLEAVCATGK-PVILILQAGRPYDILKASEMCKAILVNW--LPGQEGGPAM 676
Query: 561 ADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGH 620
AD+LFG NPGG+LPMT +P+ ++ LP+ + +RY +Y +Y FG
Sbjct: 677 ADVLFGDYNPGGRLPMT-FPR-HVGQLPL--YYNFKTSGRRYEYVDMEYYP---LYRFGF 729
Query: 621 GMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVG 680
G+SYT+F ++ +K+ +T VQ VKN+G
Sbjct: 730 GLSYTSFEYS--------------------------DLKIQEKPNGNVT--VQATVKNIG 761
Query: 681 SKDGAHTLLVFSTPPAGHWAPHK----QLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDR 736
S+ G ++ T +A K +L F+++++ G + V + +S+++
Sbjct: 762 SRAGDEVAQLYVT---DMYASVKTRVMELKDFDRIYLQPGESKTVSFELTPYD-ISLLND 817
Query: 737 SGTRRIPLGEHNIHIGGT----------KHSVSLHAATLGV 767
R + GE I +GG KHSV GV
Sbjct: 818 HMDRVVEKGEFKICVGGMSPDYVAKNEIKHSVGYSDKKKGV 858
>gi|150002739|ref|YP_001297483.1| glycoside hydrolase family protein [Bacteroides vulgatus ATCC 8482]
gi|294776994|ref|ZP_06742455.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
vulgatus PC510]
gi|149931163|gb|ABR37861.1| glycoside hydrolase family 3, candidate beta-glycosidase
[Bacteroides vulgatus ATCC 8482]
gi|294449242|gb|EFG17781.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
vulgatus PC510]
Length = 788
Score = 264 bits (675), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 201/679 (29%), Positives = 314/679 (46%), Gaps = 82/679 (12%)
Query: 110 ATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLT-YWSPNVNIFRDPRWGRG 168
AT FP +++N L IG V + EA A+ G T +SP ++I +DPRWGR
Sbjct: 158 ATYFPAQCGQGATWNKKLIARIGEVEAKEAVAL------GYTNIYSPILDIAQDPRWGRC 211
Query: 169 QETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVS 228
ET GEDP L G+ + LQ + + A+ KHF Y + + + V+
Sbjct: 212 VETYGEDPYLVGELGKQMITSLQ-----KYNLVATPKHFAVYSIPIGGRDGKTRTDPHVA 266
Query: 229 KQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSD 288
+++ + PFRM E VM SYN +G P + L +R EW GY+VSD
Sbjct: 267 PREMRTLYIEPFRMAFQEAGALGVMSSYNDYDGEPITGSYHFLTEILRQEWGFKGYVVSD 326
Query: 289 CDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDC-----GPFLGLHTESAVQRGLLSEIDI 343
++V + T E+ A A+ AGL++ F+ L AV G +S+ +
Sbjct: 327 SEAVEFISNKHKVADTYEDGIAQAVNAGLNIRTHFTPPADFI-LPLRKAVDNGKISQETL 385
Query: 344 NNALVNTLTVQMRLGMFDGEPSSQPY---GHLGPKDVCTPDHQELALEAARQGIVLLKNQ 400
+ + L ++ RLG+FD PY G + V + +HQ ++LEAARQ +VLLKN+
Sbjct: 386 DKRVAEILRIKFRLGLFD-----NPYRGNGKQAEQIVHSKEHQAVSLEAARQSLVLLKNE 440
Query: 401 GPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGR---YARTIHQQGC 457
LPLS R++AVIGPN++ +I Y T QGI + I+++GC
Sbjct: 441 TNLLPLSK-SIRSIAVIGPNANEQTQLICRYGPANAPIKTVYQGIKELLPHTEVIYKKGC 499
Query: 458 KDV--------------ACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLL 503
+ + QL AI A++QA+ ++V+G ++ E R L L
Sbjct: 500 DIIDPHFPESEILDFPKTAEEVQLMEEAIRAAKQAEVVVMVLGGNELTVREDRSRTSLNL 559
Query: 504 PGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADI 563
PGRQ+EL+ V K P ILV++ G + +A I AI+ A +PG+ G A+A+
Sbjct: 560 PGRQEELLKAVCATGK-PIILVMLDGRASSINYAA--AHIPAILHAWFPGEFCGQAVAEA 616
Query: 564 LFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMS 623
LFG NPGG+L +T +P+ + +P +P + Y +YPFGHG+S
Sbjct: 617 LFGDYNPGGRLAVT-FPKS-VGQIPFA-FPFKPGSDESSSTSVY-----GALYPFGHGLS 668
Query: 624 YTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKD 683
YT F ++ + ++ G G I H C +KN G
Sbjct: 669 YTTFTYSDLH----ISPSHQGVQGDI------------HVSCK---------IKNTGKIK 703
Query: 684 GAHTLLVFSTPPAGHWAPH-KQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRI 742
G + ++ + K L FE++ + AG +Q V + + L + D++ R+
Sbjct: 704 GDEVVQLYLRDEISSVTTYTKVLRGFERISLKAGEEQTVHFRLR-PQDLGLWDKNMNFRV 762
Query: 743 PLGEHNIHIGGTKHSVSLH 761
LG + +G + + LH
Sbjct: 763 ELGSFKVMLGASSTDIRLH 781
>gi|270296173|ref|ZP_06202373.1| conserved hypothetical protein [Bacteroides sp. D20]
gi|270273577|gb|EFA19439.1| conserved hypothetical protein [Bacteroides sp. D20]
Length = 942
Score = 264 bits (675), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 207/681 (30%), Positives = 333/681 (48%), Gaps = 74/681 (10%)
Query: 110 ATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLT-YWSPNVNIFRDPRWGRG 168
AT+FP + ++N L +G + EAR + G T ++P +++ RD RWGR
Sbjct: 173 ATNFPTQLGLGHTWNRELIRQVGLITGREARML------GYTNVYAPILDVGRDQRWGRY 226
Query: 169 QETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVS 228
+E GE P L + VRGLQ + +VAA+ KHF AY + + ++S
Sbjct: 227 EEVYGESPYLVAELGIEMVRGLQHNH----QVAATGKHFAAYSNNKGAREGMARVDPQMS 282
Query: 229 KQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSD 288
+++E+ PF+ + E + VM SYN +G+P L +RGE GY+VSD
Sbjct: 283 PREVENIHIYPFKRVIREAGMLGVMSSYNDYDGIPVQGSYYWLTTRLRGEMGFRGYVVSD 342
Query: 289 CDSVGVYYDTQHFTSTP-EEAAADAIRAGLDLDC-----GPFLGLHTESAVQRGLLSEID 342
D+V Y T+H T+ +EA ++ AGL++ C F+ L V+ G LSE
Sbjct: 343 SDAVEYLY-TKHGTAKDMKEAVRQSVEAGLNVRCTFRSPDSFV-LPLRELVKEGGLSEEV 400
Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGP 402
IN+ + + L V+ +G+FD + G ++V +++ +AL+A+R+ IVLLKN G
Sbjct: 401 INDRVRDILRVKFLIGLFDAPYQTDLAG--ADREVEKEENEAIALQASRESIVLLKNAGE 458
Query: 403 SLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGR----YARTIHQQGCK 458
LPL + +AV GPN++ + +Y +A TT L+GI A ++ +GC
Sbjct: 459 LLPLDINSTKKIAVCGPNANEEGYALTHYGPLAVEVTTVLEGIQEKTKGKAEVLYTKGCD 518
Query: 459 ------------DVACADDQL--FGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLP 504
D DD+ A++ +RQAD I+V+G Q E R L LP
Sbjct: 519 LVDAHWPESEIIDYPLTDDEQAEIDKAVENARQADVAIVVLGGGQRTCGENKSRTSLDLP 578
Query: 505 GRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADIL 564
GRQ +L+ + K P +L+L++G P+ + +A D + AI+ A YPG GGTA+ADIL
Sbjct: 579 GRQLQLLQAIQATGK-PVVLILINGRPLSINWA--DKFVPAILEAWYPGSKGGTALADIL 635
Query: 565 FGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQS---KRYPGRTYRFYK-GPVVYPFGH 620
FG NPGGKL +T +P+ + +P +PS + PG T + +YPFG+
Sbjct: 636 FGDYNPGGKLTVT-FPKT-VGQIPFN-FPCKPSSQIDGGKNPGPTGNMSRINGALYPFGY 692
Query: 621 GMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVG 680
G+SYT F ++ + V P + +AT+ +KVT+ VQ+ +++V
Sbjct: 693 GLSYTTFEYSDLDITPRVITPNE------SATVR---LKVTNTGKRAGDEVVQLYIRDVL 743
Query: 681 SKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTR 740
S + K L F+++H+ G Q + I K+L ++D
Sbjct: 744 SSITTY---------------EKNLAGFQRIHLEPGEAQELSFTID-RKHLELLDADMKW 787
Query: 741 RIPLGEHNIHIGGTKHSVSLH 761
+ G+ + G + + L+
Sbjct: 788 VVEPGDFVLMAGASSEDIRLN 808
>gi|397691065|ref|YP_006528319.1| glycoside hydrolase family 3 protein [Melioribacter roseus P3M]
gi|395812557|gb|AFN75306.1| glycoside hydrolase family 3 protein [Melioribacter roseus P3M]
Length = 769
Score = 264 bits (675), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 223/734 (30%), Positives = 343/734 (46%), Gaps = 119/734 (16%)
Query: 78 RLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSD 137
RLGI + E LHG++ ATS+P I A++FN L E I +++
Sbjct: 110 RLGIPVI-FHEECLHGLAA-----------KDATSYPVPIGLAATFNPELIEKIFSAIAE 157
Query: 138 EARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQG--SDG 195
+AR+ G LT P V++ RDPRWGR +ET GED L + + V+GLQG S
Sbjct: 158 DARS--RGAHQALT---PVVDVVRDPRWGRVEETFGEDTYLVSQMGIASVKGLQGDGSLN 212
Query: 196 DRLKVAASCKHFTAYDLDNWNGVDRFHFN---AKVSKQDIEDTFDVPFRMCVMEGKVASV 252
+ KV A+ KHF A+ G N A S++ + DTF +PF+ + + V SV
Sbjct: 213 NNNKVIATLKHFAAH------GQPESGTNCAPANFSERFLRDTFLMPFKEAIDKAGVISV 266
Query: 253 MCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTS-----TPEE 307
M SYN+++G+P+ A+ +L++ +R EW G++VSD ++ + + S E
Sbjct: 267 MASYNEIDGIPSHANKWLLRKVLRDEWNFKGFVVSDYYAITELFHKEETVSHGVAANKVE 326
Query: 308 AAADAIRAGLDL-----DCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDG 362
AA A+ AG+++ DC P L TE V+ GL E DI+ ++ L + LG+FD
Sbjct: 327 AAKLALEAGVNIEFPNPDCYPNL---TE-MVKGGLADESDIDALVLPMLKYKFELGLFD- 381
Query: 363 EPSSQPYGHLGPKDVCTPDHQ--ELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPN 420
PY P Q ELAL+AAR+ I LLKN+G LPL + +AVIGPN
Sbjct: 382 ----NPYVEAEPGQFENKLEQDRELALQAARETITLLKNEGNLLPLKDFKK--IAVIGPN 435
Query: 421 SDVTVTMIGNYAGIACGYTTPLQGI----GRYARTIHQQGCKDVA--------------C 462
+D T++G Y G YT+ QGI G+ + +GCK
Sbjct: 436 AD--RTLLGGYHGTPKYYTSVYQGIKDKVGKNGEVFYSEGCKITVGGSWNDDEVILPDPA 493
Query: 463 ADDQLFGAAIDASRQADATILVMGLDQSIEAEAL------DRAGLLLPGRQQELVSKVSM 516
D++L A+ ++++D +LV+G ++ EA DR L L GRQ +LV ++
Sbjct: 494 EDEKLINEAVAVAQKSDVAVLVLGGNEQTSREAWNKKHLGDRPSLELVGRQNKLVEEILK 553
Query: 517 ASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPM 576
K P +++L +G P + F K++ + AI+ Y GQ G A+AD+LFG NP GKLP+
Sbjct: 554 TGK-PVVVLLFNGRPNSIGFIKDN--VPAILECWYLGQETGRAVADVLFGDYNPSGKLPV 610
Query: 577 TWYPQEYITNLPMTEMAMRPSQSKRYPG--RTYRFYKGPVVYPFGHGMSYTNFVHTVANA 634
+ +P + + P+ P R Y F ++ FG+G+SYT F
Sbjct: 611 S---------IPRSAGHI-PAHYSHKPSARRGYLFDDVSPLFAFGYGLSYTKFSFD---- 656
Query: 635 PTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTP 694
N +S I + V ++VKN G+ G + ++
Sbjct: 657 ---------------NLRLSKDTISADE------KVSVSIEVKNEGAIAGEEVVQLYIRD 695
Query: 695 PAGHWA-PHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGG 753
P K+L F K+ + G V + + ++L+ + + GE I +G
Sbjct: 696 KVSSVTRPVKELKGFRKITLAPGQTSTVVFEL-LPEHLAFTNVDMKFTVEPGEFEIMVGN 754
Query: 754 TKHSVSLHAATLGV 767
+ L L V
Sbjct: 755 SSRDEDLIKTILTV 768
>gi|423303655|ref|ZP_17281654.1| hypothetical protein HMPREF1072_00594 [Bacteroides uniformis
CL03T00C23]
gi|423307623|ref|ZP_17285613.1| hypothetical protein HMPREF1073_00363 [Bacteroides uniformis
CL03T12C37]
gi|392688019|gb|EIY81310.1| hypothetical protein HMPREF1072_00594 [Bacteroides uniformis
CL03T00C23]
gi|392689492|gb|EIY82769.1| hypothetical protein HMPREF1073_00363 [Bacteroides uniformis
CL03T12C37]
Length = 801
Score = 264 bits (675), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 199/678 (29%), Positives = 321/678 (47%), Gaps = 82/678 (12%)
Query: 110 ATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLT-YWSPNVNIFRDPRWGRG 168
AT FP +++N L IG + EA + G T +SP ++I +DPRWGR
Sbjct: 171 ATYFPAQCGQGATWNKELIAQIGEAEAREASVL------GYTNIYSPILDIAQDPRWGRC 224
Query: 169 QETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVS 228
ET GEDP +G+ + LQ + K+ ++ KHF Y + + + V+
Sbjct: 225 VETYGEDPYHAGQMGKQMILSLQKN-----KLVSTPKHFAVYSIPVGGRDGKTRTDPHVA 279
Query: 229 KQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSD 288
+++ + PFR+ E VM SYN +G P + L +R EW GY+VSD
Sbjct: 280 PREMRTLYLDPFRVAFHEAGALGVMSSYNDYDGEPITGSYHFLTEILRQEWGFKGYVVSD 339
Query: 289 CDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDC-----GPFLGLHTESAVQRGLLSEIDI 343
++V + E+A A A+ AGL++ F+ L SAV++G +S+ +
Sbjct: 340 SEAVEFISTKHQVANGYEDAVAQAVNAGLNIRTHFTPPADFI-LPLRSAVKKGKISQETL 398
Query: 344 NNALVNTLTVQMRLGMFDGEPSSQPY---GHLGPKDVCTPDHQELALEAARQGIVLLKNQ 400
N + L V+ LG+FD PY + V +P+HQ+LALEAARQ +VLLKN+
Sbjct: 399 NQRVAEILRVKFWLGLFDN-----PYRGDEKRAGQIVHSPEHQQLALEAARQSLVLLKNE 453
Query: 401 GPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRY---ARTIHQQGC 457
+LPLS R+VAVIGPN+D +I Y TT +GI + A ++++GC
Sbjct: 454 HQTLPLSK-SIRSVAVIGPNADERQQLICRYGPANAHITTIYEGIKKMLPQADVVYKKGC 512
Query: 458 KDV--------------ACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLL 503
+ A + Q+ AI+A++ A+ T++V+G ++ E R L L
Sbjct: 513 DIIDPHFPESEVLEFPKAAQEAQMMEEAIEAAKGAEVTVMVLGGNELTVREDRSRTSLDL 572
Query: 504 PGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADI 563
PGRQ+EL+ K+ K P +LV++ G + FA + AII A +PG+ GG AIA+
Sbjct: 573 PGRQEELLKKICQLGK-PVVLVMIDGRASSINFAAT--HVPAIIHAWFPGEFGGQAIAEA 629
Query: 564 LFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMS 623
LFG NPGG+L +T +P+ + +P +P + Y +YPFGHG+S
Sbjct: 630 LFGDYNPGGRLAVT-FPKS-VGQIPFA-FPFKPGSDESSETSVY-----GALYPFGHGLS 681
Query: 624 YTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKD 683
YT F ++ V++ G G+I+ + +KN+G ++
Sbjct: 682 YTTFQYS----DLVISPSKQGVQGNIS---------------------ISCTIKNIGQRE 716
Query: 684 GAHTLLVFSTPPAGHWAPHKQLV-AFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRI 742
G + ++ + Q++ FE++ + A V + + L + D+ +
Sbjct: 717 GDEVVQLYLRDEVSSVTTYTQVLRGFERITLKPEASHTVHFEL-TPQELGIWDKQMNFTV 775
Query: 743 PLGEHNIHIGGTKHSVSL 760
G + IG + + L
Sbjct: 776 EPGMFKVMIGSSSKDIRL 793
>gi|299149395|ref|ZP_07042452.1| periplasmic beta-glucosidase [Bacteroides sp. 3_1_23]
gi|298512582|gb|EFI36474.1| periplasmic beta-glucosidase [Bacteroides sp. 3_1_23]
Length = 950
Score = 264 bits (675), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 216/761 (28%), Positives = 362/761 (47%), Gaps = 107/761 (14%)
Query: 42 FCQVSLPIPQRVNDLIGRLSLQEKVKLLISG--AAAVPRLGIKGYEWWSEALHGVSNVGP 99
+ SLP+ +RV L+ ++ ++K++L+ G +P L + EA+HG S
Sbjct: 166 YMDASLPVEERVESLLAVMTPEDKMELIREGWGIPGIPHLYVPPITK-VEAVHGFS---- 220
Query: 100 GTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNI 159
+G GAT FPQ + +++N L E + V+ DE A N A WSP +++
Sbjct: 221 ---YGS---GATIFPQALAMGATWNRKLTEEVAMVIGDETVAA-NTKQA----WSPVLDV 269
Query: 160 FRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD 219
+D RWGR +ET GEDPVL + ++++G Q + + KHF + G D
Sbjct: 270 AQDARWGRCEETFGEDPVLVSQIGGAWIKGYQSRG-----LFTTPKHFGGHGAP-LGGRD 323
Query: 220 RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEW 279
+S++++ + VPFR + S+M +Y+ G+P +L++ +R EW
Sbjct: 324 SHDIG--LSEREMREVHLVPFRHAIRNYDCQSLMMAYSDYMGIPVAKSTELLQQILRQEW 381
Query: 280 RLNGYIVSDCDSVGVYYDTQHFTSTPE-EAAADAIRAGLDLDCG-PFLGLHTESAVQRGL 337
NG+IVSDC ++G +H+T+ + EAA A+ AG+ +CG + A + G
Sbjct: 382 GFNGFIVSDCGAIGNLTARKHYTAQDKIEAANQALAAGIATNCGDTYNNKEVIQAAKDGR 441
Query: 338 LSEIDINNALVNTLTVQMRLGMFDGEPSSQ-PYGHLGPKDVCTPDHQELALEAARQGIVL 396
++ +++N L+ R +F+ P + + P + H+E+A +AAR+ IV+
Sbjct: 442 INMENLDNVCRTMLSTMFRNELFEKNPCKPLDWKKIYP-GWNSDSHKEMARQAARESIVM 500
Query: 397 LKNQGPSLPLSHIRHRTVAVIGPNSD-------VTVTMIGNYAGIACGYTTPLQGIGRYA 449
L+N+ LPL+ RT+AV+GP +D + G + G + +G+
Sbjct: 501 LENKENLLPLTK-NLRTIAVLGPGADDLQPGDYTPKLLPGQLKSVLTGIK---EAVGKQT 556
Query: 450 RTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEA---------EALDRAG 500
+ +++QGC D D+ A+ A+ Q+D ++V+G + EA E D A
Sbjct: 557 KVLYEQGC-DFTNPDETNIPKAVKAASQSDVVVMVLGDCSTSEATNDVRKTCGENNDWAT 615
Query: 501 LLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAI 560
L+LPG+QQEL+ V K P IL+L +G P D+ A + + W PGQ GG A+
Sbjct: 616 LILPGKQQELLEAVCATGK-PVILILQAGRPYDILKASEMCKAILVNW--LPGQEGGPAM 672
Query: 561 ADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGH 620
AD+LFG NPGG+LPMT +P+ ++ LP+ + +RY +Y +Y FG
Sbjct: 673 ADVLFGDYNPGGRLPMT-FPR-HVGQLPL--YYNFKTSGRRYEYVDMEYYP---LYRFGF 725
Query: 621 GMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVG 680
G+SYT+F ++ +K+ +T VQ VKN+G
Sbjct: 726 GLSYTSFEYS--------------------------DLKIQEKPNGNVT--VQATVKNIG 757
Query: 681 SKDGAHTLLVFSTPPAGHWAPHK----QLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDR 736
S+ G ++ T +A K +L F+++++ G + V + +S+++
Sbjct: 758 SRAGDEVAQLYVT---DMYASVKTRVMELKDFDRIYLQPGESKTVSFELTPYD-ISLLND 813
Query: 737 SGTRRIPLGEHNIHIGGT----------KHSVSLHAATLGV 767
R + GE I +GG KHSV GV
Sbjct: 814 HMDRVVEKGEFKICVGGMSPDYVAKNEIKHSVGYSDNKKGV 854
>gi|424661946|ref|ZP_18098983.1| hypothetical protein HMPREF1205_02332 [Bacteroides fragilis HMW
616]
gi|404578257|gb|EKA82992.1| hypothetical protein HMPREF1205_02332 [Bacteroides fragilis HMW
616]
Length = 814
Score = 264 bits (675), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 217/687 (31%), Positives = 330/687 (48%), Gaps = 91/687 (13%)
Query: 109 GATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRG 168
G T FP I AS++N L +GRV++ EA A G G + P +++ RDPRW R
Sbjct: 168 GTTVFPTSIGQASTWNPELIRQMGRVIAIEASA--QGAHIG---YGPVLDLARDPRWSRV 222
Query: 169 QETPGEDPVLSGKYAASYVRGLQGSD-GDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKV 227
+ET GEDP L+G A+ VRG QG R V A+ KHF +Y W A +
Sbjct: 223 EETYGEDPYLNGVMGAALVRGFQGDTLRGRKSVIATLKHFASY---GWTEGGHNGGTAHL 279
Query: 228 SKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVS 287
++++E+ PFR V G + SVM SYN+++G P +L ++ W G++VS
Sbjct: 280 GERELEEAIFPPFREAVGAGAL-SVMSSYNEIDGNPCTGSRYLLTDILKDRWLFKGFVVS 338
Query: 288 DCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTE---SAVQRGLLSEIDIN 344
D ++G + + EAA A+ AG+D D G ++ E +AV++G ++ ++
Sbjct: 339 DLYAIGGLRE-HGVAGSDYEAAVKAVNAGVDSDLGT--NVYAEQLVAAVRKGDVAMETVD 395
Query: 345 NALVNTLTVQMRLGMFDGE--PSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGP 402
A+ L+++ +G+FD +P + V +P+H LA E ARQ IVLLKN+
Sbjct: 396 KAVRRILSLKFHMGLFDAPFVDDKRP-----AQLVASPEHIGLAREVARQSIVLLKNEDK 450
Query: 403 SLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIAC--GYTTPLQGI----GRYARTIHQQG 456
LPL RT+AVIGPN+D M+G+Y T L+GI + R ++ +G
Sbjct: 451 LLPLKK-DIRTLAVIGPNADNGYNMLGDYTAPQADGSVVTVLEGIRQKVSKDTRVLYAKG 509
Query: 457 CKDVACADDQLFGAAIDASRQADATILVMG----LDQSIE-------------------A 493
C V + F AI+A+R AD ++V+G D S E
Sbjct: 510 CA-VRDSSRTGFADAIEAARSADVVVMVVGGSSARDFSSEYEETGAAKVSANRVSDMESG 568
Query: 494 EALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPG 553
E DRA L L GRQ EL+ +V K P +LVL+ G P+ + + AI+ A YPG
Sbjct: 569 EGYDRATLHLMGRQLELLEEVRKLGK-PMVLVLIKGRPLLMEGVIQEA--DAILDAWYPG 625
Query: 554 QAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGP 613
GG A+AD+LFG NP G+L ++ P+ + LP+ R RY G
Sbjct: 626 MQGGNAVADVLFGDYNPAGRLTLS-VPRS-VGQLPVYYNTKRKGNRSRYIEEA-----GT 678
Query: 614 VVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQ 673
YPFG+G+SYT F +T G ++V+ + N + V
Sbjct: 679 PRYPFGYGLSYTMFSYT------------------------GMKVRVSE-ESNHCRVDVS 713
Query: 674 VDVKNVGSKDGAHTLLVFSTPPAGHW-APHKQLVAFEKVHVPAGAQQRVGINIHVCKYLS 732
V V+N G+ DG + ++ G + P +QL AF +V + AG + + + K L+
Sbjct: 714 VTVRNQGTVDGDEVVQLYLRDEVGSFTTPDRQLRAFSRVRLKAGETREITFTLD-KKSLA 772
Query: 733 VVDRSGTRRIPLGEHNIHIGGTKHSVS 759
+ R G + G + GG+ ++
Sbjct: 773 LYMRDGEWAVEPGRFTVMAGGSSEDIA 799
>gi|161613146|ref|YP_001587111.1| hypothetical protein SPAB_00854 [Salmonella enterica subsp.
enterica serovar Paratyphi B str. SPB7]
gi|416529744|ref|ZP_11744511.1| hypothetical protein SEEM010_09961 [Salmonella enterica subsp.
enterica serovar Montevideo str. LQC 10]
gi|417466542|ref|ZP_12165045.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar
Montevideo str. S5-403]
gi|161362510|gb|ABX66278.1| hypothetical protein SPAB_00854 [Salmonella enterica subsp.
enterica serovar Paratyphi B str. SPB7]
gi|353629977|gb|EHC77663.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar
Montevideo str. S5-403]
gi|363552147|gb|EHL36453.1| hypothetical protein SEEM010_09961 [Salmonella enterica subsp.
enterica serovar Montevideo str. LQC 10]
Length = 755
Score = 264 bits (675), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 205/669 (30%), Positives = 325/669 (48%), Gaps = 103/669 (15%)
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
T FP + ASSFN +GRV + Y GL W+P V++ RDPRWGR
Sbjct: 108 TVFPISLGLASSFNLDAVRTVGRV------SAYEAADDGLNMTWAPMVDVSRDPRWGRAS 161
Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
E GED L+ + V+ +QG S DR V S KHF AY +N VD
Sbjct: 162 EGFGEDTYLTSIMGETMVKAMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 216
Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
+S Q + + + P++ + G +VM + N +NG P +D +LK +R EW G
Sbjct: 217 --MSSQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSDSWLLKDVLRDEWGFKGI 273
Query: 285 IVSDCDSVGVYYDTQHFTST-PEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
VSD ++ +H T+ PE+A A++AG+D+ + + ++ G ++ +
Sbjct: 274 TVSDHGAIKEL--IKHGTAADPEDAVRVALKAGVDMSMADEYYSKYLPGLIKSGKVTMAE 331
Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
+++A + L V+ +G+F+ PY HLGPK+ D H++ A E AR+ +V
Sbjct: 332 LDDATRHVLNVKYDMGLFN-----DPYSHLGPKESDPADTNAESRLHRKEAREVARESVV 386
Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGI----GRYA 449
LLKN+ +LPL + T+AV+GP +D ++G++ AG+A T L GI G A
Sbjct: 387 LLKNRLETLPLK--KSGTIAVVGPLADSQRDVMGSWSAAGVANQSVTVLAGIQNAVGDGA 444
Query: 450 RTIHQQGCK---DVACAD----------------DQLFGAAIDASRQADATILVMGLDQS 490
+ ++ +G D D + A+ A++QAD + V+G Q
Sbjct: 445 KILYAKGANITNDKGIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADVVVAVVGESQG 504
Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
+ EA R + +P Q++L++ + K P +LVLM+G P+ A K D + AI+
Sbjct: 505 MAHEASSRTNITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKEDQQADAILETW 561
Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPM------TEMAMRPSQSKRYPG 604
+ G GG AIAD+LFG NP GKLP++ +P+ + +P+ T P + +Y
Sbjct: 562 FAGTEGGNAIADVLFGDYNPSGKLPIS-FPRS-VGQIPVYYSHLNTGRPYNPEKPNKYTS 619
Query: 605 RTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAK 664
R + GP +YPFG+G+SYT F TV++ ++ P R G + A+
Sbjct: 620 RYFDEANGP-LYPFGYGLSYTTF--TVSD--VTLSSPTMQRDGKVTAS------------ 662
Query: 665 CNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVG- 722
V+V N G ++GA + ++ A P KQL FEK+ + G ++ V
Sbjct: 663 ---------VEVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGFEKITLKPGERKTVSF 713
Query: 723 -INIHVCKY 730
I+I K+
Sbjct: 714 PIDIEALKF 722
>gi|375001990|ref|ZP_09726330.1| glycosyl hydrolase family 3 protein [Salmonella enterica subsp.
enterica serovar Infantis str. SARB27]
gi|353076678|gb|EHB42438.1| glycosyl hydrolase family 3 protein [Salmonella enterica subsp.
enterica serovar Infantis str. SARB27]
Length = 751
Score = 264 bits (674), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 205/669 (30%), Positives = 327/669 (48%), Gaps = 103/669 (15%)
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
T FP + ASSFN +GRV + Y GL W+P V++ RDPRWGR
Sbjct: 104 TVFPISLGLASSFNLDAVRTVGRV------SAYEAADDGLNMTWAPMVDVSRDPRWGRAS 157
Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
E GED L+ + V+ +QG S DR V S KHF AY +N VD
Sbjct: 158 EGFGEDTYLTSIMGETMVKAMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 212
Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
+S Q + + + P++ + G +VM + N +NG P +D +LK +R EW G
Sbjct: 213 --MSSQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSDSWLLKDVLRDEWGFKGI 269
Query: 285 IVSDCDSVGVYYDTQHFTST-PEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
VSD ++ +H T+ PE+A A++AG+D+ + + ++ G ++ +
Sbjct: 270 TVSDHGAIKEL--IKHGTAADPEDAVRVALKAGVDMSMADEYYSKYLPGLIKSGKVTMAE 327
Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
+++A + L V+ +G+F+ PY HLGPK+ D H++ A E AR+ +V
Sbjct: 328 LDDATRHVLNVKYDMGLFN-----DPYSHLGPKESDPADTNAESRLHRKEAREVARESVV 382
Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGI----GRYA 449
LLKN+ +LPL + T+AV+GP +D ++G++ AG+A T L GI G A
Sbjct: 383 LLKNRLETLPLK--KSGTIAVVGPLADSQRDVMGSWSAAGVANQSVTVLAGIQNAVGDGA 440
Query: 450 RTIHQQGC-----KDVAC--------------ADDQLFGAAIDASRQADATILVMGLDQS 490
+ ++ +G KD+ + + A+ A++QAD + V+G Q
Sbjct: 441 KILYAKGANITNDKDIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADVVVAVVGESQG 500
Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
+ EA R + +P Q++L++ + K P +LVLM+G P+ A K D + AI+
Sbjct: 501 MAHEASSRTNITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKEDQQADAILETW 557
Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPM------TEMAMRPSQSKRYPG 604
+ G GG AIAD+LFG NP GKLP++ +P+ + +P+ T P + +Y
Sbjct: 558 FAGTEGGNAIADVLFGDYNPSGKLPIS-FPRS-VGQIPVYYSHLNTGRPYNPEKPNKYTS 615
Query: 605 RTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAK 664
R + GP +YPFG+G+SYT F TV++ ++ P R G + A+
Sbjct: 616 RYFDEANGP-LYPFGYGLSYTTF--TVSD--VTLSSPTMQRDGKVTAS------------ 658
Query: 665 CNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVG- 722
V+V N G ++GA + ++ A P KQL FEK+ + G ++ V
Sbjct: 659 ---------VEVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGFEKITLKPGERKTVSF 709
Query: 723 -INIHVCKY 730
I+I K+
Sbjct: 710 PIDIEALKF 718
>gi|365121891|ref|ZP_09338802.1| hypothetical protein HMPREF1033_02148 [Tannerella sp.
6_1_58FAA_CT1]
gi|363644131|gb|EHL83433.1| hypothetical protein HMPREF1033_02148 [Tannerella sp.
6_1_58FAA_CT1]
Length = 855
Score = 264 bits (674), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 162/421 (38%), Positives = 245/421 (58%), Gaps = 35/421 (8%)
Query: 42 FCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGT 101
+ ++ PI RV DL+ R++++EKV L+I A +PRL I Y +EALHG+ V PG
Sbjct: 32 YKDMNAPIHDRVMDLLSRMTVEEKVSLMIHNAPGIPRLEIDKYYHGNEALHGI--VRPG- 88
Query: 102 KFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN----------GGTAGLT 151
KF T FPQ I A+S+N L I +SDEAR +N G + L+
Sbjct: 89 KF-------TVFPQAIGMAASWNPELIYKISTAISDEARGKWNALGLGKKQLDGSSDLLS 141
Query: 152 YWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYD 211
+WSP VN+ RDPRWGR ET GEDP L+G ++V+GLQG+ LK A+ KHF A +
Sbjct: 142 FWSPTVNMARDPRWGRTPETYGEDPHLTGTLGCAFVKGLQGNHPKYLKAVATPKHFAANN 201
Query: 212 LDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNIL 271
++ +R H NA +S++D+ + + F C++EGK S+M +YN VNG+P + ++
Sbjct: 202 EEH----NRAHCNAVISERDLREYYLPSFEKCIVEGKAQSIMTAYNAVNGIPCTVNTYLI 257
Query: 272 KRTIRGEWRLNGYIVSDCDSVGVYYDTQH-FTSTPEEAAADAIRAGLDLDCGPFLGLHTE 330
K+ +R +W GY+V+DC S + TQH + E AA +AG D++C ++T+
Sbjct: 258 KKVLREDWGFQGYVVTDC-SAPAWMVTQHKYVKDYETAAVLMAKAGSDMECAD--NVYTQ 314
Query: 331 ---SAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELAL 387
+A +S+ DI++ + L +M LG+FD +P PY + P+ V +HQELAL
Sbjct: 315 PLLNAYYNYRVSDADIDSIAYHLLRGRMLLGLFD-DPEKNPYNKISPEKVGCKEHQELAL 373
Query: 388 EAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYT-TPLQGIG 446
E ARQ +VLLKN+ LP++ + +++AV+G N+D G+Y+G + T L+GI
Sbjct: 374 ETARQSLVLLKNENNFLPINPKKIKSIAVVGINAD--RCEFGDYSGTPVNESVTVLEGIK 431
Query: 447 R 447
R
Sbjct: 432 R 432
Score = 148 bits (374), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 101/300 (33%), Positives = 151/300 (50%), Gaps = 47/300 (15%)
Query: 466 QLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILV 525
LFG A A+++ D T+ V+G+D+SIE E DR L LP QQE + ++ + T++V
Sbjct: 594 NLFGEAGKAAKECDITVAVLGIDKSIEREGQDRYTLELPADQQEFIREIYKINPK-TVVV 652
Query: 526 LMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYIT 585
L++G I + + D I AII A YPG+ GGTA+A+ LFG NPGG+LP+T+Y +
Sbjct: 653 LVAGSSIAINWI--DENIPAIIDAWYPGEQGGTAVAEALFGKYNPGGRLPLTFYNS--MD 708
Query: 586 NL-PMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDG 644
L P + A++ GRTY+++ G +Y FG+G+SYT F + N +A D
Sbjct: 709 ELPPFDDYAVK-------KGRTYQYFTGKPLYEFGYGLSYTKFNYRKLN----IASKQD- 756
Query: 645 RHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHK 703
T+ +Q + N G DG V+ P G + P K
Sbjct: 757 ------------------------TINIQFSISNTGKYDGDEVAQVYVQYPETGTYMPIK 792
Query: 704 QLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRI--PLGEHNIHIGGTKHSVSLH 761
QL F++VH+ G Q V I+I K L D TR+ P G + +G + ++L
Sbjct: 793 QLKGFKRVHIKKGQTQNVSISIPK-KELRYWDEK-TRKFVTPSGNYIFQVGSSSQRINLQ 850
>gi|416539671|ref|ZP_11750078.1| periplasmic beta-D-glucoside glucohydrolase [Salmonella enterica
subsp. enterica serovar Montevideo str. SARB30]
gi|416551719|ref|ZP_11756625.1| periplasmic beta-D-glucoside glucohydrolase [Salmonella enterica
subsp. enterica serovar Montevideo str. 29N]
gi|416566632|ref|ZP_11763924.1| periplasmic beta-D-glucoside glucohydrolase [Salmonella enterica
subsp. enterica serovar Montevideo str. 4441 H]
gi|417327454|ref|ZP_12112866.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar
Adelaide str. A4-669]
gi|418868407|ref|ZP_13422850.1| periplasmic beta-D-glucoside glucohydrolase [Salmonella enterica
subsp. enterica serovar Newport str. CVM 4176]
gi|353570549|gb|EHC34783.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar
Adelaide str. A4-669]
gi|363559751|gb|EHL43903.1| periplasmic beta-D-glucoside glucohydrolase [Salmonella enterica
subsp. enterica serovar Montevideo str. SARB30]
gi|363566134|gb|EHL50153.1| periplasmic beta-D-glucoside glucohydrolase [Salmonella enterica
subsp. enterica serovar Montevideo str. 29N]
gi|363579468|gb|EHL63250.1| periplasmic beta-D-glucoside glucohydrolase [Salmonella enterica
subsp. enterica serovar Montevideo str. 4441 H]
gi|392838313|gb|EJA93877.1| periplasmic beta-D-glucoside glucohydrolase [Salmonella enterica
subsp. enterica serovar Newport str. CVM 4176]
Length = 765
Score = 264 bits (674), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 205/669 (30%), Positives = 325/669 (48%), Gaps = 103/669 (15%)
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
T FP + ASSFN +GRV + Y GL W+P V++ RDPRWGR
Sbjct: 118 TVFPISLGLASSFNLDAVRTVGRV------SAYEAADDGLNMTWAPMVDVSRDPRWGRAS 171
Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
E GED L+ + V+ +QG S DR V S KHF AY +N VD
Sbjct: 172 EGFGEDTYLTSIMGETMVKAMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 226
Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
+S Q + + + P++ + G +VM + N +NG P +D +LK +R EW G
Sbjct: 227 --MSSQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSDSWLLKDVLRDEWGFKGI 283
Query: 285 IVSDCDSVGVYYDTQHFTST-PEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
VSD ++ +H T+ PE+A A++AG+D+ + + ++ G ++ +
Sbjct: 284 TVSDHGAIKEL--IKHGTAADPEDAVRVALKAGVDMSMADEYYSKYLPGLIKSGKVTMAE 341
Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
+++A + L V+ +G+F+ PY HLGPK+ D H++ A E AR+ +V
Sbjct: 342 LDDATRHVLNVKYDMGLFN-----DPYSHLGPKESDPADTNAESRLHRKEAREVARESVV 396
Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGI----GRYA 449
LLKN+ +LPL + T+AV+GP +D ++G++ AG+A T L GI G A
Sbjct: 397 LLKNRLETLPLK--KSGTIAVVGPLADSQRDVMGSWSAAGVANQSVTVLAGIQNAVGDGA 454
Query: 450 RTIHQQGCK---DVACAD----------------DQLFGAAIDASRQADATILVMGLDQS 490
+ ++ +G D D + A+ A++QAD + V+G Q
Sbjct: 455 KILYAKGANITNDKGIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADVVVAVVGESQG 514
Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
+ EA R + +P Q++L++ + K P +LVLM+G P+ A K D + AI+
Sbjct: 515 MAHEASSRTNITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKEDQQADAILETW 571
Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPM------TEMAMRPSQSKRYPG 604
+ G GG AIAD+LFG NP GKLP++ +P+ + +P+ T P + +Y
Sbjct: 572 FAGTEGGNAIADVLFGDYNPSGKLPIS-FPRS-VGQIPVYYSHLNTGRPYNPEKPNKYTS 629
Query: 605 RTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAK 664
R + GP +YPFG+G+SYT F TV++ ++ P R G + A+
Sbjct: 630 RYFDEANGP-LYPFGYGLSYTTF--TVSD--VTLSSPTMQRDGKVTAS------------ 672
Query: 665 CNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVG- 722
V+V N G ++GA + ++ A P KQL FEK+ + G ++ V
Sbjct: 673 ---------VEVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGFEKITLKPGERKTVSF 723
Query: 723 -INIHVCKY 730
I+I K+
Sbjct: 724 PIDIEALKF 732
>gi|329963634|ref|ZP_08301109.1| glycosyl hydrolase family 3 protein [Bacteroides fluxus YIT 12057]
gi|328528179|gb|EGF55158.1| glycosyl hydrolase family 3 protein [Bacteroides fluxus YIT 12057]
Length = 863
Score = 264 bits (674), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 173/454 (38%), Positives = 245/454 (53%), Gaps = 35/454 (7%)
Query: 42 FCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGT 101
+ SLP +R L+ L+L+EKV L++ GA V RLGIK Y WW+EALHGV+ G
Sbjct: 23 YKTTSLPPEERAELLLKELTLEEKVSLMMDGAKPVERLGIKPYNWWNEALHGVARAGL-- 80
Query: 102 KFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA--------GLTYW 153
AT FPQ I A+SF+ + VSDEARA + + GLT W
Sbjct: 81 --------ATVFPQPIGMAASFSPEAVYGVFDAVSDEARAKNSYYASQGSYERYQGLTMW 132
Query: 154 SPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL 212
+P VNI+RDPRWGRG ET GEDP L+ + V+GLQG +DG K+ A KHF +
Sbjct: 133 TPTVNIYRDPRWGRGIETYGEDPYLTSRMGVEVVKGLQGGNDGRYDKLHACAKHFAVHSG 192
Query: 213 DNWNGVDRFHFNAK-VSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNIL 271
WN R FNA+ + +D+ +T+ PF V EGKV VMC+YN+ G P C +L
Sbjct: 193 PEWN---RHEFNAENIKPRDLYETYLPPFEALVKEGKVKEVMCAYNRFEGDPCCGSNRLL 249
Query: 272 KRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTP--EEAAADAIRAGLDLDCGPFLGLHT 329
+ +R EW +G ++SDC ++ +Y+ + + P E A+A A+ +G DL+CG
Sbjct: 250 MQILRDEWGFDGIVLSDCGAIADFYNDRGHKTHPDAESASAAAVLSGTDLECGSSYKALV 309
Query: 330 ESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEA 389
E AV+ G + E ++ A+ LT + LG D EP + + V + H LAL+
Sbjct: 310 E-AVKLGKIEEKAVDVAVKRLLTARFALGEMD-EPDKVSWTKIPFSVVASAKHDSLALDI 367
Query: 390 ARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI---- 445
AR+ + LL N+ LPL TVAV+GPN++ +V GNY G+ T L+G+
Sbjct: 368 ARKSMTLLMNKNDILPLKQ-GGLTVAVMGPNANDSVMQWGNYNGMPSHTVTILEGVRKAL 426
Query: 446 GRYARTIHQQGCKDVACADDQLFGAAIDASRQAD 479
G + I++QGC D L +A + + AD
Sbjct: 427 GADDKLIYEQGC---GWVDRSLIQSAFNLCKSAD 457
Score = 119 bits (297), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 89/289 (30%), Positives = 137/289 (47%), Gaps = 47/289 (16%)
Query: 482 ILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGP 531
I V G+ S+E E + DR + LP Q+E+++ + + K I V SG P
Sbjct: 604 IFVGGISPSLEGEEMGVNLPGFKKGDRTDIELPAVQREMIAALHRSGK-KVIFVNCSGSP 662
Query: 532 IDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTE 591
I AI+ A YPGQAGGTA+A++LFG NP G+LP+T+Y ++ LP E
Sbjct: 663 I--GLEPETKECEAILQAWYPGQAGGTAVAEVLFGKYNPAGRLPVTFYRN--VSQLPDFE 718
Query: 592 MAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINA 651
GRTYR+ ++PFG+G+SYT F + + LD INA
Sbjct: 719 -------DYDMSGRTYRYMTQEPLFPFGYGLSYTAFEYG--------QMVLD--QNKINA 761
Query: 652 TISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKV 711
G+ +++T V V N G +DG + V+ P+K L AF++V
Sbjct: 762 ---GQTLELT------------VPVTNAGKRDGEEVVQVYLRKRDDTDGPNKTLRAFKRV 806
Query: 712 HVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
++PAG V + + + T R+ G+++I +GG+ + L
Sbjct: 807 NIPAGKTVNVDFELKDKELEWWDTATNTMRVCAGDYDIMVGGSSQTEDL 855
>gi|255545654|ref|XP_002513887.1| hypothetical protein RCOM_1034150 [Ricinus communis]
gi|223546973|gb|EEF48470.1| hypothetical protein RCOM_1034150 [Ricinus communis]
Length = 246
Score = 264 bits (674), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 126/229 (55%), Positives = 162/229 (70%), Gaps = 15/229 (6%)
Query: 12 LILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLIS 71
LI + A S A + PF+CD + +T++ FCQ SLPI QR DL+ RL+L EK+ L+
Sbjct: 18 LIFIFADS---APQPPFSCDYSNPSTKSYLFCQTSLPISQRAKDLVSRLTLDEKISQLVD 74
Query: 72 GAAAVPRLGIKGYEWWSEALHGVSNV-----GPGTKFGGDFPGATSFPQVITTASSFNAT 126
A ++PRLG+ Y WWSEALHGV+ + G F G ATSFPQVI TA+SF+
Sbjct: 75 TAPSIPRLGVPAYRWWSEALHGVAFLPDVSERQGIHFNGTIRSATSFPQVIITAASFDTK 134
Query: 127 LWEAIGRVVSDEARAMYNGGTA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAAS 185
LW IG+V EARA+YN G A G+T+W+PN+NIFRDPRWGRGQETPGEDP+++GKYA S
Sbjct: 135 LWYRIGQVTGKEARAIYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPMVTGKYAVS 194
Query: 186 YVRGLQGSD------GDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVS 228
+VRG+QG G++L+ +A CKHFTAYDL+ WNG +RF FNA+VS
Sbjct: 195 FVRGVQGDSFEGGVLGEKLQASACCKHFTAYDLEKWNGTNRFIFNAQVS 243
>gi|419728314|ref|ZP_14255280.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Heidelberg str. 41579]
gi|419736497|ref|ZP_14263337.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Heidelberg str. 41563]
gi|419741060|ref|ZP_14267772.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Heidelberg str. 41573]
gi|419745173|ref|ZP_14271816.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Heidelberg str. 41566]
gi|419746816|ref|ZP_14273391.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Heidelberg str. 41565]
gi|421574444|ref|ZP_16020065.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Heidelberg str. CFSAN00325]
gi|421579948|ref|ZP_16025510.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Heidelberg str. CFSAN00326]
gi|421585722|ref|ZP_16031214.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Heidelberg str. CFSAN00328]
gi|381291806|gb|EIC33035.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Heidelberg str. 41563]
gi|381293004|gb|EIC34177.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Heidelberg str. 41573]
gi|381299851|gb|EIC40919.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Heidelberg str. 41579]
gi|381306597|gb|EIC47470.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Heidelberg str. 41566]
gi|381320690|gb|EIC61232.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Heidelberg str. 41565]
gi|402521423|gb|EJW28761.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Heidelberg str. CFSAN00326]
gi|402524644|gb|EJW31941.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Heidelberg str. CFSAN00325]
gi|402529572|gb|EJW36805.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Heidelberg str. CFSAN00328]
Length = 765
Score = 264 bits (674), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 205/669 (30%), Positives = 327/669 (48%), Gaps = 103/669 (15%)
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
T FP + ASSFN +GRV + Y GL W+P V++ RDPRWGR
Sbjct: 118 TVFPISLGLASSFNLDAVRTVGRV------SAYEAADDGLNMTWAPMVDVSRDPRWGRAS 171
Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
E GED L+ + V+ +QG S DR V S KHF AY +N VD
Sbjct: 172 EGFGEDTYLTSIMGETMVKAMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 226
Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
+S Q + + + P++ + G +VM + N +NG P +D +LK +R EW G
Sbjct: 227 --MSSQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSDSWLLKDVLRDEWGFKGI 283
Query: 285 IVSDCDSVGVYYDTQHFTST-PEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
VSD ++ +H T+ PE+A A++AG+D+ + + ++ G ++ +
Sbjct: 284 TVSDHGAIKEL--IKHGTAADPEDAVRVALKAGVDMSMADEYYSKYLPGLIKSGKVTMAE 341
Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
+++A + L V+ +G+F+ PY HLGPK+ D H++ A E AR+ +V
Sbjct: 342 LDDATRHVLNVKYDMGLFN-----DPYSHLGPKESDPADTNAESRLHRKEAREVARESVV 396
Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGI----GRYA 449
LLKN+ +LPL + T+AV+GP +D ++G++ AG+A T L GI G A
Sbjct: 397 LLKNRLETLPLK--KSGTIAVVGPLADSQRDVMGSWSAAGVANQSVTVLAGIQNAVGDGA 454
Query: 450 RTIHQQGC-----KDVAC--------------ADDQLFGAAIDASRQADATILVMGLDQS 490
+ ++ +G KD+ + + A+ A++QAD + V+G Q
Sbjct: 455 KILYAKGANITNDKDIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADVVVAVVGESQG 514
Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
+ EA R + +P Q++L++ + K P +LVLM+G P+ A K D + AI+
Sbjct: 515 MAHEASSRTNITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKEDQQADAILETW 571
Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPM------TEMAMRPSQSKRYPG 604
+ G GG AIAD+LFG NP GKLP++ +P+ + +P+ T P + +Y
Sbjct: 572 FAGTEGGNAIADVLFGDYNPSGKLPIS-FPRS-VGQIPVYYSHLNTGRPYNPEKPNKYTS 629
Query: 605 RTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAK 664
R + GP +YPFG+G+SYT F TV++ ++ P R G + A+
Sbjct: 630 RYFDEANGP-LYPFGYGLSYTTF--TVSD--VTLSSPTMQRDGKVTAS------------ 672
Query: 665 CNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVG- 722
V+V N G ++GA + ++ A P KQL FEK+ + G ++ V
Sbjct: 673 ---------VEVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGFEKITLKPGERKTVSF 723
Query: 723 -INIHVCKY 730
I+I K+
Sbjct: 724 PIDIEALKF 732
>gi|334124834|ref|ZP_08498829.1| periplasmic beta-glucosidase [Enterobacter hormaechei ATCC 49162]
gi|333387905|gb|EGK59096.1| periplasmic beta-glucosidase [Enterobacter hormaechei ATCC 49162]
Length = 771
Score = 264 bits (674), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 212/689 (30%), Positives = 336/689 (48%), Gaps = 102/689 (14%)
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
T FP + ASSFN + +GRV + Y GL W+P V++ RDPRWGR
Sbjct: 124 TVFPISLGLASSFNLDAVKTVGRV------SAYEAADDGLNMTWAPMVDVSRDPRWGRAS 177
Query: 170 ETPGEDPVLSGKYAASYVRGLQ-GSDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
E GED L+ + V +Q S DR V S KHF AY +N VD
Sbjct: 178 EGFGEDTFLTATMGKTMVEAMQRKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 232
Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
+S Q + + + P++ + G +VM + N +NG P +D +LK +R +W G
Sbjct: 233 --MSPQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSDSWLLKDVLRDQWGFKGI 289
Query: 285 IVSDCDSVGVYYDTQHFT-STPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
VSD ++ +H T S PE+A A+++G+++ + + V+ G ++ +
Sbjct: 290 TVSDHGAIKEL--IKHGTASDPEDAVRVALKSGINMSMSDEYYSKYLPGLVKSGKVTMAE 347
Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
+++A + L V+ +G+F+ PY HLGPKD D H++ A E AR+ +V
Sbjct: 348 LDDAARHVLNVKYDMGLFN-----DPYSHLGPKDSDPTDTNAESRLHRKEAREVARESLV 402
Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGI----GRYA 449
LLKN+ +LPL + T+AV+GP +D ++G++ AG+A T L GI G A
Sbjct: 403 LLKNRLDTLPLK--KSGTIAVVGPLADSKRDVMGSWSAAGVADQSVTVLTGIKSAVGDNA 460
Query: 450 RTIHQQGC-----KDVACADDQ--------------LFGAAIDASRQADATILVMGLDQS 490
+ ++ +G KD+ +Q + A++A++Q+D + V+G Q
Sbjct: 461 KVVYAKGANVTDDKDIVTFLNQYEEAVKVDPRTPKEMIDEAVNAAKQSDVVVAVVGEAQG 520
Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
+ EA R + +P Q++L++ + K P +LVLM+G P+ A K D + AI+
Sbjct: 521 MAHEASSRTDITIPQSQRDLIAALKATGK-PLVLVLMNGRPL--ALVKEDQQADAILETW 577
Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAM---RP---SQSKRYPG 604
+ G GG AIAD+LFG NP GKLPM+ +P+ + +P+ + RP + +Y
Sbjct: 578 FAGTEGGNAIADVLFGDYNPSGKLPMS-FPRS-VGQIPVYYSHLNTGRPYNADKPNKYTS 635
Query: 605 RTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAK 664
R + GP +YPFG+G+SYT F TV++ ++ P R GS+ A+
Sbjct: 636 RYFDEANGP-LYPFGYGLSYTTF--TVSDVK--MSAPTMKRDGSVTAS------------ 678
Query: 665 CNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVGI 723
VDV N G ++GA + ++ A P KQL FEKV++ G + V
Sbjct: 679 ---------VDVTNSGKREGATVIQMYVQDVTASMSRPVKQLRGFEKVNLKPGETRTVSF 729
Query: 724 NIHVCKYLSVVDRSGTRRIPLGEHNIHIG 752
I V L ++ G+ N+ IG
Sbjct: 730 PIDVNA-LKFWNQQMKYDAEPGKFNVFIG 757
>gi|189464698|ref|ZP_03013483.1| hypothetical protein BACINT_01042 [Bacteroides intestinalis DSM
17393]
gi|189436972|gb|EDV05957.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
intestinalis DSM 17393]
Length = 862
Score = 264 bits (674), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 157/451 (34%), Positives = 240/451 (53%), Gaps = 33/451 (7%)
Query: 38 RTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNV 97
+TLP+ L +R DL+ RL+L+EK L+ + A+PRLGIK + WWSEALHGV+N
Sbjct: 19 QTLPYQNPELSPAERAKDLVKRLTLEEKALLMCDDSEAIPRLGIKKFNWWSEALHGVANQ 78
Query: 98 GPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAG-------- 149
G T FP+ + A+SFN L I VSDE RA +N
Sbjct: 79 G----------NVTVFPEPVGMAASFNDKLVFEIFNAVSDEMRAKHNERVRNGLEDVRFH 128
Query: 150 -LTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASC-KHF 207
L+ W+PNVNIFRDPRWGRGQET GEDP L+ + + V+GLQG + ++ + +C KH+
Sbjct: 129 SLSVWTPNVNIFRDPRWGRGQETYGEDPYLTSQMGIAVVKGLQGPENEKYRKLLACAKHY 188
Query: 208 TAYDLDNWNGVDRFHFNA-KVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCA 266
+ W+ R N VS +D+ +T+ F+ V + V VMC+Y +++ P C
Sbjct: 189 AVHSGPEWS---RHTANLNNVSPRDLWETYLPAFKALVQKADVREVMCAYQRLDDDPCCG 245
Query: 267 DPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG-PFL 325
+ +L++ +R EW +VSDC ++ ++ + +S AA AG D++CG +
Sbjct: 246 NTRLLQQILRDEWGFKYLVVSDCGAIADFWTSHKSSSDAVHAAVKGTMAGTDVECGYGYA 305
Query: 326 GLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQEL 385
AV RGL++E ++N ++ + + LG D +PS + + V H++L
Sbjct: 306 YQKLPEAVSRGLITEEEVNKHVLRLMEGRFELGEMD-DPSLVNWTKIPMSVVNCKAHKDL 364
Query: 386 ALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
+L +RQ + LL+N+ LPLS R +AVIGPN+D + GNY G T L G
Sbjct: 365 SLNMSRQTMTLLQNKNNVLPLSK-SIRKIAVIGPNADDKPMLWGNYNGTPNQTITILDGF 423
Query: 446 G---RYARTIHQQGCKDVACADDQLFGAAID 473
+ + ++ +GC V +DQ + +D
Sbjct: 424 KSKLKKNQIVYMKGCDLV---NDQTLESYLD 451
Score = 98.6 bits (244), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 83/315 (26%), Positives = 129/315 (40%), Gaps = 62/315 (19%)
Query: 468 FGAAIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMA 517
+ +I + D + V G+ +E E + DR + LP Q+ + + A
Sbjct: 590 YSESISKLKGIDVVVFVGGISPQLEGEEMPVNIPGFKGGDRTDIELPAVQRNFLKALKDA 649
Query: 518 SKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMT 577
K + V SG + A AI+ A Y G+ GG A+AD+LFG NP GKLP+T
Sbjct: 650 GK-QVVFVNCSGSSM--ALLPETESCDAILQAWYGGELGGYAVADVLFGDYNPSGKLPVT 706
Query: 578 WYPQEYITNLPMTE-MAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPT 636
+Y LP E +M+ GRTYR+ P ++PFG G+SYT+F
Sbjct: 707 FYKN--TKQLPDYEDYSMK--------GRTYRYMSDP-LFPFGFGLSYTDFA-------- 747
Query: 637 VVAVPLDGRHGSINATISGKAIKVTHAKCNRL------TLGVQVDVKNVGSKDGAHTLLV 690
+ A CN+ +L + V V N G + G + V
Sbjct: 748 -----------------------IGTASCNKTQLRTDESLTLTVPVSNTGKRSGTEIVQV 784
Query: 691 FSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIH 750
+ P K L A+ +V + GAQQ V I + + + T R+ GE+ +
Sbjct: 785 YIRKTDDADGPLKSLKAYARVELATGAQQDVKIELPSESFECFDPSTNTMRVAPGEYELF 844
Query: 751 IGGTKHSVSLHAATL 765
G + + L + +
Sbjct: 845 YGTSSAAQDLQSVKV 859
>gi|378984729|ref|YP_005247884.1| periplasmic beta-D-glucoside glucohydrolase [Salmonella enterica
subsp. enterica serovar Typhimurium str. T000240]
gi|312913157|dbj|BAJ37131.1| periplasmic beta-D-glucoside glucohydrolase [Salmonella enterica
subsp. enterica serovar Typhimurium str. T000240]
Length = 765
Score = 263 bits (673), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 205/669 (30%), Positives = 325/669 (48%), Gaps = 103/669 (15%)
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
T FP + ASSFN +GRV + Y GL W+P V++ RDPRWGR
Sbjct: 118 TVFPISLGLASSFNLDAVRTVGRV------SAYEAADDGLNMTWAPMVDVSRDPRWGRAS 171
Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
E GED L+ + V+ +QG S DR V S KHF AY +N VD
Sbjct: 172 EGFGEDTYLTSIMGETMVKAMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 226
Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
+S Q + + + P++ + G +VM + N +NG P +D +LK +R EW G
Sbjct: 227 --MSSQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSDSWLLKDVLRDEWGFKGI 283
Query: 285 IVSDCDSVGVYYDTQHFTST-PEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
VSD ++ +H T+ PE+A A++AG+D+ + + ++ G ++ +
Sbjct: 284 TVSDHGAIKEL--IKHGTAADPEDAVRVALKAGVDMSMADEYYSKYLPGLIKSGKVTMAE 341
Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
+++A + L V+ +G+F+ PY HLGPK+ D H++ A E AR+ +V
Sbjct: 342 LDDATRHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAESRLHRKEAREVARESVV 396
Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGI----GRYA 449
LLKN+ +LPL + T+AV+GP +D ++G++ AG+A T L GI G A
Sbjct: 397 LLKNRLETLPLK--KSGTIAVVGPLADSQRDVMGSWSAAGVANQSVTVLAGIQNAVGDGA 454
Query: 450 RTIHQQGCK---DVACAD----------------DQLFGAAIDASRQADATILVMGLDQS 490
+ ++ +G D D + A+ A++QAD + V+G Q
Sbjct: 455 KILYAKGANITNDKGIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADVVVAVVGESQE 514
Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
+ EA R + +P Q++L++ + K P +LVLM+G P+ A K D + AI+
Sbjct: 515 MAHEASSRTNITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKEDQQADAILETW 571
Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPM------TEMAMRPSQSKRYPG 604
+ G GG AIAD+LFG NP GKLP++ +P+ + +P+ T P + +Y
Sbjct: 572 FAGTEGGNAIADVLFGDYNPSGKLPIS-FPRS-VGQIPVYYSHLNTGRPYNPEKPNKYTS 629
Query: 605 RTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAK 664
R + GP +YPFG+G+SYT F TV++ ++ P R G + A+
Sbjct: 630 RYFDEANGP-LYPFGYGLSYTTF--TVSD--VTLSSPTMQRDGKVTAS------------ 672
Query: 665 CNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVG- 722
V+V N G ++GA + ++ A P KQL FEK+ + G ++ V
Sbjct: 673 ---------VEVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGFEKITLKPGERKTVSF 723
Query: 723 -INIHVCKY 730
I+I K+
Sbjct: 724 PIDIEALKF 732
>gi|198245676|ref|YP_002216250.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Dublin str. CT_02021853]
gi|375119733|ref|ZP_09764900.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Dublin str. SD3246]
gi|197940192|gb|ACH77525.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Dublin str. CT_02021853]
gi|326624000|gb|EGE30345.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Dublin str. SD3246]
Length = 755
Score = 263 bits (673), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 205/669 (30%), Positives = 327/669 (48%), Gaps = 103/669 (15%)
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
T FP + ASSFN +GRV + Y GL W+P V++ RDPRWGR
Sbjct: 108 TVFPISLGLASSFNLDAVRTVGRV------SAYEAADDGLNMTWAPMVDVSRDPRWGRAS 161
Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
E GED L+ + V+ +QG S DR V S KHF AY +N VD
Sbjct: 162 EGFGEDTYLTSIMGETMVKAMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 216
Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
+S Q + + + P++ + G +VM + N +NG P +D +LK +R EW G
Sbjct: 217 --MSSQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSDSWLLKDVLRDEWGFKGI 273
Query: 285 IVSDCDSVGVYYDTQHFTST-PEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
VSD ++ +H T+ PE+A A++AG+D+ + + ++ G ++ +
Sbjct: 274 TVSDHGAIKEL--IKHGTAADPEDAVRVALKAGVDMSMADEYYSKYLPGLIKSGKVTMAE 331
Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
+++A + L V+ +G+F+ PY HLGPK+ D H++ A E AR+ +V
Sbjct: 332 LDDATRHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAESRLHRKEAREVARESVV 386
Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGI----GRYA 449
LLKN+ +LPL + T+AV+GP +D ++G++ AG+A T L GI G A
Sbjct: 387 LLKNRLETLPLK--KSGTIAVVGPLADSQRDVMGSWSAAGVANQSVTVLAGIQNAVGDGA 444
Query: 450 RTIHQQGC-----KDVAC--------------ADDQLFGAAIDASRQADATILVMGLDQS 490
+ ++ +G KD+ + + A+ A++QAD + V+G Q
Sbjct: 445 KILYAKGANITNDKDIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADMVVAVVGESQG 504
Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
+ EA R + +P Q++L++ + K P +LVLM+G P+ A K D + AI+
Sbjct: 505 MAHEASSRTNITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKEDQQADAILETW 561
Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPM------TEMAMRPSQSKRYPG 604
+ G GG AIAD+LFG NP GKLP++ +P+ + +P+ T P + +Y
Sbjct: 562 FAGTEGGNAIADVLFGDYNPSGKLPIS-FPRS-VGQIPVYYSHLNTGRPYNPEKPNKYTS 619
Query: 605 RTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAK 664
R + GP +YPFG+G+SYT F TV++ ++ P R G + A+
Sbjct: 620 RYFDEANGP-LYPFGYGLSYTTF--TVSD--VTLSSPTMQRDGKVTAS------------ 662
Query: 665 CNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVG- 722
V+V N G ++GA + ++ A P KQL FEK+ + G ++ V
Sbjct: 663 ---------VEVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGFEKITLKPGERKTVSF 713
Query: 723 -INIHVCKY 730
I+I K+
Sbjct: 714 PIDIEALKF 722
>gi|194736410|ref|YP_002115255.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar
Schwarzengrund str. CVM19633]
gi|197300833|ref|ZP_02661390.2| glycosyl hydrolase family 3 N domain protein [Salmonella enterica
subsp. enterica serovar Schwarzengrund str. SL480]
gi|204929066|ref|ZP_03220209.1| glycosyl hydrolase family 3 N domain protein [Salmonella enterica
subsp. enterica serovar Javiana str. GA_MM04042433]
gi|205360240|ref|ZP_02681470.2| glycosyl hydrolase family 3 N domain protein [Salmonella enterica
subsp. enterica serovar Hadar str. RI_05P066]
gi|416491454|ref|ZP_11727088.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Montevideo str. OH_2009072675]
gi|416496191|ref|ZP_11729048.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Montevideo str. CASC_09SCPH15965]
gi|416586610|ref|ZP_11775622.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Montevideo str. MD_MDA09249507]
gi|416674556|ref|ZP_11821234.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Montevideo str. MB102109-0047]
gi|417359373|ref|ZP_12133763.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar Give
str. S5-487]
gi|417391848|ref|ZP_12154881.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar
Minnesota str. A4-603]
gi|417476332|ref|ZP_12170882.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar
Rubislaw str. A4-653]
gi|417511967|ref|ZP_12176427.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar
Senftenberg str. A4-543]
gi|194711912|gb|ACF91133.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Schwarzengrund str. CVM19633]
gi|197290594|gb|EDY29949.1| glycosyl hydrolase family 3 N domain protein [Salmonella enterica
subsp. enterica serovar Schwarzengrund str. SL480]
gi|204321610|gb|EDZ06809.1| glycosyl hydrolase family 3 N domain protein [Salmonella enterica
subsp. enterica serovar Javiana str. GA_MM04042433]
gi|205350836|gb|EDZ37467.1| glycosyl hydrolase family 3 N domain protein [Salmonella enterica
subsp. enterica serovar Hadar str. RI_05P066]
gi|322649375|gb|EFY45811.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Montevideo str. OH_2009072675]
gi|322656495|gb|EFY52784.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Montevideo str. CASC_09SCPH15965]
gi|322667165|gb|EFY63332.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Montevideo str. MD_MDA09249507]
gi|323222876|gb|EGA07232.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Montevideo str. MB102109-0047]
gi|353589745|gb|EHC48461.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar Give
str. S5-487]
gi|353614040|gb|EHC65987.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar
Minnesota str. A4-603]
gi|353641236|gb|EHC86018.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar
Rubislaw str. A4-653]
gi|353641325|gb|EHC86084.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar
Senftenberg str. A4-543]
Length = 755
Score = 263 bits (673), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 205/669 (30%), Positives = 327/669 (48%), Gaps = 103/669 (15%)
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
T FP + ASSFN +GRV + Y GL W+P V++ RDPRWGR
Sbjct: 108 TVFPISLGLASSFNLDAVRTVGRV------SAYEAADDGLNMTWAPMVDVSRDPRWGRAS 161
Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
E GED L+ + V+ +QG S DR V S KHF AY +N VD
Sbjct: 162 EGFGEDTYLTSIMGETMVKAMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 216
Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
+S Q + + + P++ + G +VM + N +NG P +D +LK +R EW G
Sbjct: 217 --MSSQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSDSWLLKDVLRDEWGFKGI 273
Query: 285 IVSDCDSVGVYYDTQHFTST-PEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
VSD ++ +H T+ PE+A A++AG+D+ + + ++ G ++ +
Sbjct: 274 TVSDHGAIKEL--IKHGTAADPEDAVRVALKAGVDMSMADEYYSKYLPGLIKSGKVTMAE 331
Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
+++A + L V+ +G+F+ PY HLGPK+ D H++ A E AR+ +V
Sbjct: 332 LDDATRHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAESRLHRKEAREVARESVV 386
Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGI----GRYA 449
LLKN+ +LPL + T+AV+GP +D ++G++ AG+A T L GI G A
Sbjct: 387 LLKNRLETLPLK--KSGTIAVVGPLADSQRDVMGSWSAAGVANQSVTVLAGIQNAVGDGA 444
Query: 450 RTIHQQGC-----KDVAC--------------ADDQLFGAAIDASRQADATILVMGLDQS 490
+ ++ +G KD+ + + A+ A++QAD + V+G Q
Sbjct: 445 KILYAKGANITNDKDIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADVVVAVVGESQG 504
Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
+ EA R + +P Q++L++ + K P +LVLM+G P+ A K D + AI+
Sbjct: 505 MAHEASSRTNITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKEDQQADAILETW 561
Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPM------TEMAMRPSQSKRYPG 604
+ G GG AIAD+LFG NP GKLP++ +P+ + +P+ T P + +Y
Sbjct: 562 FAGTEGGNAIADVLFGDYNPSGKLPIS-FPRS-VGQIPVYYSHLNTGRPYNPEKPNKYTS 619
Query: 605 RTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAK 664
R + GP +YPFG+G+SYT F TV++ ++ P R G + A+
Sbjct: 620 RYFDEANGP-LYPFGYGLSYTTF--TVSD--VTLSSPTMQRDGKVTAS------------ 662
Query: 665 CNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVG- 722
V+V N G ++GA + ++ A P KQL FEK+ + G ++ V
Sbjct: 663 ---------VEVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGFEKITLKPGERKTVSF 713
Query: 723 -INIHVCKY 730
I+I K+
Sbjct: 714 PIDIEALKF 722
>gi|423300729|ref|ZP_17278753.1| hypothetical protein HMPREF1057_01894 [Bacteroides finegoldii
CL09T03C10]
gi|408472616|gb|EKJ91142.1| hypothetical protein HMPREF1057_01894 [Bacteroides finegoldii
CL09T03C10]
Length = 735
Score = 263 bits (673), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 220/754 (29%), Positives = 352/754 (46%), Gaps = 97/754 (12%)
Query: 48 PIPQRVNDLIGRLSLQEKVKLLI--------------SGAAAVP-RLGIKGYEWWSEALH 92
PI +RV+DL+ R++L+EKV L VP +G Y + L
Sbjct: 36 PIEKRVDDLLSRMTLEEKVMQLNQYTLGRNNNVNNVGEEVKKVPAEIGSLIYFETNPELR 95
Query: 93 G--------VSNVGPGTKFGGDFPGA--TSFPQVITTASSFNATLWEAIGRVVSDEARAM 142
S +G FG D T +P + A S+N L E V + EAR
Sbjct: 96 NNMQKKAMEESRLGIPIIFGYDAIHGFRTVYPISLAQACSWNPDLVEQACAVSAQEAR-- 153
Query: 143 YNGGTAGLTY-WSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSD-GDRLKV 200
+G+ + +SP +++ RDPRWGR E GEDP +G + A+ VRG QG + +V
Sbjct: 154 ----MSGVDWTFSPMIDVARDPRWGRVAEGYGEDPYANGVFGAASVRGYQGDNMSAENRV 209
Query: 201 AASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVN 260
AA KH+ Y R + ++SKQ + DT+ +P++M V G A++M S+N ++
Sbjct: 210 AACLKHYVGYGASE---AGRDYVYTEISKQTLWDTYLLPYKMGVKAG-AATLMSSFNDIS 265
Query: 261 GVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLD 320
GVP A+P + ++ WR +G+IVSD ++ Q +T +EAA A AGL++D
Sbjct: 266 GVPGSANPYTMTEILKNRWRHDGFIVSDWGAIE-QLKNQGLAATKKEAARHAFTAGLEMD 324
Query: 321 CGPF-LGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKD--V 377
H + V+ G +S ++ A+ L ++ RLG+F+ +PY + +
Sbjct: 325 MMSHAYDRHLQELVEEGKVSMAQVDEAVRRVLLLKFRLGLFE-----RPYTPVTTEKERF 379
Query: 378 CTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAG---- 433
P ++A A + +VLLKN+ LPL+ + +AVIGP + ++G++ G
Sbjct: 380 LRPQSMDIAARLAAESMVLLKNENNVLPLAD--KKKIAVIGPMAKNGWDLLGSWRGHGKD 437
Query: 434 --IACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSI 491
+ Y A + GC + + + F A+ A+R +D +L +G +
Sbjct: 438 TDVVMLYDGLAAEFAGKAELRYALGC-NTKGDNREGFAEALGAARWSDVVVLCLGEMMTW 496
Query: 492 EAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGY 551
E R+ + LP Q+EL ++ K P +L+L++G P+++ + +P AI+
Sbjct: 497 SGENASRSSIALPQMQEELAKELKKVGK-PVVLILVNGRPLEL--NRLEPVSDAILEIWQ 553
Query: 552 PGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYR-FY 610
PG G +A IL G NP GKL MT P + + ++R GR ++ FY
Sbjct: 554 PGVNGALPMAGILSGRINPSGKLAMT---------FPYSTGQIPIYYNRRKSGRGHQGFY 604
Query: 611 KGPV---VYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNR 667
K +YPFGHG+SYT F + T++ A KV +
Sbjct: 605 KDMTSDPLYPFGHGLSYTEFKY---------------------GTVTPSATKVKRGE--- 640
Query: 668 LTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIH 726
L +V V N+G++DGA T+ F S P P K+L FEK + AG + +I
Sbjct: 641 -KLSAEVTVTNIGARDGAETVHWFISDPYCSITRPVKELKHFEKQLIKAGETKTFRFDID 699
Query: 727 VCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
+ + V+ G R + GE+NIH+ G + L
Sbjct: 700 LERDFGFVNEDGKRFLETGEYNIHVLGQTVKIEL 733
>gi|16765496|ref|NP_461111.1| periplasmic beta-D-glucoside glucohydrolase [Salmonella enterica
subsp. enterica serovar Typhimurium str. LT2]
gi|378445601|ref|YP_005233233.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Typhimurium str. D23580]
gi|378450850|ref|YP_005238209.1| periplasmic beta-D-glucoside glucohydrolase [Salmonella enterica
subsp. enterica serovar Typhimurium str. 14028S]
gi|378700080|ref|YP_005182037.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar
Typhimurium str. SL1344]
gi|378989553|ref|YP_005252717.1| periplasmic beta-D-glucoside glucohydrolase [Salmonella enterica
subsp. enterica serovar Typhimurium str. UK-1]
gi|379701394|ref|YP_005243122.1| periplasmic beta-D-glucoside glucohydrolase [Salmonella enterica
subsp. enterica serovar Typhimurium str. ST4/74]
gi|422026439|ref|ZP_16372832.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm1]
gi|422031463|ref|ZP_16377632.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm2]
gi|427551058|ref|ZP_18928136.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm8]
gi|427567333|ref|ZP_18932851.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm9]
gi|427587543|ref|ZP_18937641.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm3]
gi|427611175|ref|ZP_18942507.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm4]
gi|427634848|ref|ZP_18947401.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm6]
gi|427656473|ref|ZP_18952166.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm10]
gi|427661625|ref|ZP_18957079.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm11]
gi|427670100|ref|ZP_18961878.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm12]
gi|20141276|sp|Q56078.2|BGLX_SALTY RecName: Full=Periplasmic beta-glucosidase; AltName:
Full=Beta-D-glucoside glucohydrolase; AltName:
Full=Cellobiase; AltName: Full=Gentiobiase; AltName:
Full=T-cell inhibitor; Flags: Precursor
gi|16420702|gb|AAL21070.1| periplasmic beta-D-glucoside glucohydrolase [Salmonella enterica
subsp. enterica serovar Typhimurium str. LT2]
gi|261247380|emb|CBG25205.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Typhimurium str. D23580]
gi|267994228|gb|ACY89113.1| periplasmic beta-D-glucoside glucohydrolase [Salmonella enterica
subsp. enterica serovar Typhimurium str. 14028S]
gi|301158728|emb|CBW18240.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Typhimurium str. SL1344]
gi|323130493|gb|ADX17923.1| periplasmic beta-D-glucoside glucohydrolase [Salmonella enterica
subsp. enterica serovar Typhimurium str. ST4/74]
gi|332989100|gb|AEF08083.1| periplasmic beta-D-glucoside glucohydrolase [Salmonella enterica
subsp. enterica serovar Typhimurium str. UK-1]
gi|414017758|gb|EKT01456.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm1]
gi|414018535|gb|EKT02182.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm8]
gi|414020221|gb|EKT03810.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm2]
gi|414032204|gb|EKT15215.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm9]
gi|414033625|gb|EKT16574.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm3]
gi|414036785|gb|EKT19597.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm4]
gi|414046773|gb|EKT29088.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm10]
gi|414047776|gb|EKT30042.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm6]
gi|414052520|gb|EKT34556.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm11]
gi|414059327|gb|EKT40912.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm12]
Length = 765
Score = 263 bits (673), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 205/669 (30%), Positives = 325/669 (48%), Gaps = 103/669 (15%)
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
T FP + ASSFN +GRV + Y GL W+P V++ RDPRWGR
Sbjct: 118 TVFPISLGLASSFNLDAVRTVGRV------SAYEAADDGLNMTWAPMVDVSRDPRWGRAS 171
Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
E GED L+ + V+ +QG S DR V S KHF AY +N VD
Sbjct: 172 EGFGEDTYLTSIMGETMVKAMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 226
Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
+S Q + + + P++ + G +VM + N +NG P +D +LK +R EW G
Sbjct: 227 --MSSQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSDSWLLKDVLRDEWGFKGI 283
Query: 285 IVSDCDSVGVYYDTQHFTST-PEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
VSD ++ +H T+ PE+A A++AG+D+ + + ++ G ++ +
Sbjct: 284 TVSDHGAIKEL--IKHGTAADPEDAVRVALKAGVDMSMADEYYSKYLPGLIKSGKVTMAE 341
Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
+++A + L V+ +G+F+ PY HLGPK+ D H++ A E AR+ +V
Sbjct: 342 LDDATRHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAESRLHRKEAREVARESVV 396
Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGI----GRYA 449
LLKN+ +LPL + T+AV+GP +D ++G++ AG+A T L GI G A
Sbjct: 397 LLKNRLETLPLK--KSGTIAVVGPLADSQRDVMGSWSAAGVANQSVTVLAGIQNAVGDGA 454
Query: 450 RTIHQQGCK---DVACAD----------------DQLFGAAIDASRQADATILVMGLDQS 490
+ ++ +G D D + A+ A++QAD + V+G Q
Sbjct: 455 KILYAKGANITNDKGIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADVVVAVVGESQG 514
Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
+ EA R + +P Q++L++ + K P +LVLM+G P+ A K D + AI+
Sbjct: 515 MAHEASSRTNITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKEDQQADAILETW 571
Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPM------TEMAMRPSQSKRYPG 604
+ G GG AIAD+LFG NP GKLP++ +P+ + +P+ T P + +Y
Sbjct: 572 FAGTEGGNAIADVLFGDYNPSGKLPIS-FPRS-VGQIPVYYSHLNTGRPYNPEKPNKYTS 629
Query: 605 RTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAK 664
R + GP +YPFG+G+SYT F TV++ ++ P R G + A+
Sbjct: 630 RYFDEANGP-LYPFGYGLSYTTF--TVSD--VTLSSPTMQRDGKVTAS------------ 672
Query: 665 CNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVG- 722
V+V N G ++GA + ++ A P KQL FEK+ + G ++ V
Sbjct: 673 ---------VEVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGFEKITLKPGERKTVSF 723
Query: 723 -INIHVCKY 730
I+I K+
Sbjct: 724 PIDIEALKF 732
>gi|417519247|ref|ZP_12181440.1| Periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Uganda str. R8-3404]
gi|353647113|gb|EHC90324.1| Periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Uganda str. R8-3404]
Length = 755
Score = 263 bits (673), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 205/669 (30%), Positives = 324/669 (48%), Gaps = 103/669 (15%)
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
T FP + ASSFN +GRV + Y GL W+P V++ RDPRWGR
Sbjct: 108 TVFPISLGLASSFNLDAVRTVGRV------SAYEAADDGLNMTWAPMVDVSRDPRWGRAS 161
Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
E GED L+ + V+ +QG S DR V S KHF AY +N VD
Sbjct: 162 EGFGEDTYLTSIMGETMVKAMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 216
Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
+S Q + + + P++ + G +VM + N +NG P +D +LK +R EW G
Sbjct: 217 --MSSQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSDSWLLKDVLRDEWGFKGI 273
Query: 285 IVSDCDSVGVYYDTQHFTST-PEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
VSD ++ +H T+ PE+A A++AG+D+ + + ++ G ++ +
Sbjct: 274 TVSDHGAIKEL--IKHGTAADPEDAVRVALKAGVDMSMADEYYSKYLPGLIKSGKVTMAE 331
Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
+++A + L V+ +G+F+ PY HLGPK+ D H++ A E AR+ +V
Sbjct: 332 LDDATRHVLNVKYDMGLFN-----DPYSHLGPKESDPADTNAESRLHRKEAREVARESVV 386
Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGI----GRYA 449
LLKN+ +LPL + T+AV+GP +D ++G++ AG+A T L GI G A
Sbjct: 387 LLKNRLETLPLK--KSGTIAVVGPLADSQRDVMGSWSAAGVANQSVTVLAGIQNAVGDGA 444
Query: 450 RTIHQQGCK---DVACAD----------------DQLFGAAIDASRQADATILVMGLDQS 490
+ ++ +G D D + A+ A++QAD + V+G Q
Sbjct: 445 KILYAKGANITNDKGIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADVVVAVVGESQG 504
Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
+ EA R + +P Q++L++ + K P +LVLM+G P+ A K D + AI+
Sbjct: 505 MAHEASSRTNITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKEDQQADAILETW 561
Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPM------TEMAMRPSQSKRYPG 604
+ G GG AIAD+LFG NP GKLP++ +P+ + +P+ T P + +Y
Sbjct: 562 FAGTEGGNAIADVLFGDYNPSGKLPIS-FPRS-VGQIPVYYSHLNTGRPYNPEKPNKYTS 619
Query: 605 RTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAK 664
R + GP +YPFG+G+SYT F TV++ ++ P R G + A+
Sbjct: 620 RYFDEVNGP-LYPFGYGLSYTTF--TVSD--VTLSSPTMQRDGKVTAS------------ 662
Query: 665 CNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVG- 722
V+V N G ++GA + ++ A P KQL FEK+ + G + V
Sbjct: 663 ---------VEVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGFEKITLKPGESKTVSF 713
Query: 723 -INIHVCKY 730
I+I K+
Sbjct: 714 PIDIEALKF 722
>gi|416424053|ref|ZP_11691321.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Montevideo str. 315996572]
gi|416434178|ref|ZP_11697512.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Montevideo str. 495297-1]
gi|416438422|ref|ZP_11699509.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Montevideo str. 495297-3]
gi|416447728|ref|ZP_11705981.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Montevideo str. 495297-4]
gi|416450525|ref|ZP_11707600.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Montevideo str. 515920-1]
gi|416460485|ref|ZP_11714793.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Montevideo str. 515920-2]
gi|416470222|ref|ZP_11718747.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Montevideo str. 531954]
gi|416476726|ref|ZP_11721214.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Montevideo str. NC_MB110209-0054]
gi|416508101|ref|ZP_11735884.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Montevideo str. SARB31]
gi|416513698|ref|ZP_11738019.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Montevideo str. ATCC BAA710]
gi|416540687|ref|ZP_11750492.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Montevideo str. 19N]
gi|416561671|ref|ZP_11761668.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Montevideo str. 42N]
gi|416575337|ref|ZP_11768369.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Montevideo str. 81038-01]
gi|416594582|ref|ZP_11780414.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Montevideo str. 414877]
gi|416602160|ref|ZP_11785217.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Montevideo str. 366867]
gi|416607231|ref|ZP_11788413.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Montevideo str. 413180]
gi|416611818|ref|ZP_11791047.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Montevideo str. 446600]
gi|416623896|ref|ZP_11797678.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Montevideo str. 609458-1]
gi|416630098|ref|ZP_11800505.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Montevideo str. 556150-1]
gi|416636573|ref|ZP_11803136.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Montevideo str. 609460]
gi|416652084|ref|ZP_11811486.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Montevideo str. 507440-20]
gi|416655624|ref|ZP_11812629.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Montevideo str. 556152]
gi|416669115|ref|ZP_11819140.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Montevideo str. MB101509-0077]
gi|416700997|ref|ZP_11829262.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Montevideo str. MB110209-0055]
gi|416705382|ref|ZP_11830863.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Montevideo str. MB111609-0052]
gi|416713638|ref|ZP_11837193.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Montevideo str. 2009083312]
gi|416720123|ref|ZP_11841889.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Montevideo str. 2009085258]
gi|416721559|ref|ZP_11842724.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Montevideo str. 315731156]
gi|416729877|ref|ZP_11848338.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Montevideo str. IA_2009159199]
gi|416738151|ref|ZP_11853179.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Montevideo str. IA_2010008282]
gi|416744242|ref|ZP_11856524.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Montevideo str. IA_2010008283]
gi|416759732|ref|ZP_11864557.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Montevideo str. IA_2010008284]
gi|416760387|ref|ZP_11864780.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Montevideo str. IA_2010008285]
gi|416767537|ref|ZP_11869997.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Montevideo str. IA_2010008287]
gi|418484908|ref|ZP_13053898.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Montevideo str. 80959-06]
gi|418492113|ref|ZP_13058613.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Montevideo str. CT_02035278]
gi|418496941|ref|ZP_13063366.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Montevideo str. CT_02035318]
gi|418500305|ref|ZP_13066703.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Montevideo str. CT_02035320]
gi|418503319|ref|ZP_13069684.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Montevideo str. CT_02035321]
gi|418509304|ref|ZP_13075600.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Montevideo str. CT_02035327]
gi|418512924|ref|ZP_13079159.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Pomona str. ATCC 10729]
gi|418527981|ref|ZP_13093934.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Montevideo str. IA_2010008286]
gi|452119629|ref|YP_007469877.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Javiana str. CFSAN001992]
gi|322615194|gb|EFY12116.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Montevideo str. 315996572]
gi|322617785|gb|EFY14681.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Montevideo str. 495297-1]
gi|322624621|gb|EFY21452.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Montevideo str. 495297-3]
gi|322626928|gb|EFY23724.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Montevideo str. 495297-4]
gi|322634114|gb|EFY30850.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Montevideo str. 515920-1]
gi|322635695|gb|EFY32405.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Montevideo str. 515920-2]
gi|322640170|gb|EFY36834.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Montevideo str. 531954]
gi|322646407|gb|EFY42919.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Montevideo str. NC_MB110209-0054]
gi|322661471|gb|EFY57695.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Montevideo str. 19N]
gi|322665641|gb|EFY61825.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Montevideo str. 81038-01]
gi|322671171|gb|EFY67299.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Montevideo str. 414877]
gi|322675188|gb|EFY71265.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Montevideo str. 366867]
gi|322680797|gb|EFY76832.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Montevideo str. 413180]
gi|322686972|gb|EFY82949.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Montevideo str. 446600]
gi|323192736|gb|EFZ77963.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Montevideo str. 609458-1]
gi|323198832|gb|EFZ83931.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Montevideo str. 556150-1]
gi|323205158|gb|EFZ90136.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Montevideo str. 609460]
gi|323213417|gb|EFZ98214.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Montevideo str. 556152]
gi|323215774|gb|EGA00517.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Montevideo str. MB101509-0077]
gi|323223860|gb|EGA08163.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Montevideo str. MB110209-0055]
gi|323231220|gb|EGA15335.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Montevideo str. MB111609-0052]
gi|323233654|gb|EGA17746.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Montevideo str. 2009083312]
gi|323237724|gb|EGA21784.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Montevideo str. 2009085258]
gi|323245707|gb|EGA29701.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Montevideo str. 315731156]
gi|323248929|gb|EGA32853.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Montevideo str. IA_2009159199]
gi|323253000|gb|EGA36833.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Montevideo str. IA_2010008282]
gi|323258676|gb|EGA42338.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Montevideo str. IA_2010008283]
gi|323259508|gb|EGA43143.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Montevideo str. IA_2010008284]
gi|323268291|gb|EGA51766.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Montevideo str. IA_2010008285]
gi|323271920|gb|EGA55336.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Montevideo str. IA_2010008287]
gi|363552716|gb|EHL36999.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Montevideo str. SARB31]
gi|363567278|gb|EHL51278.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Montevideo str. ATCC BAA710]
gi|363573864|gb|EHL57738.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Montevideo str. 42N]
gi|366055944|gb|EHN20277.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Montevideo str. CT_02035318]
gi|366057212|gb|EHN21516.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Montevideo str. 80959-06]
gi|366058491|gb|EHN22779.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Montevideo str. CT_02035278]
gi|366069158|gb|EHN33284.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Montevideo str. CT_02035320]
gi|366073161|gb|EHN37236.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Montevideo str. CT_02035321]
gi|366078494|gb|EHN42495.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Montevideo str. CT_02035327]
gi|366082861|gb|EHN46791.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Pomona str. ATCC 10729]
gi|366827184|gb|EHN54093.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Montevideo str. 507440-20]
gi|372204105|gb|EHP17636.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Montevideo str. IA_2010008286]
gi|451908633|gb|AGF80439.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Javiana str. CFSAN001992]
Length = 765
Score = 263 bits (673), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 205/669 (30%), Positives = 327/669 (48%), Gaps = 103/669 (15%)
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
T FP + ASSFN +GRV + Y GL W+P V++ RDPRWGR
Sbjct: 118 TVFPISLGLASSFNLDAVRTVGRV------SAYEAADDGLNMTWAPMVDVSRDPRWGRAS 171
Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
E GED L+ + V+ +QG S DR V S KHF AY +N VD
Sbjct: 172 EGFGEDTYLTSIMGETMVKAMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 226
Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
+S Q + + + P++ + G +VM + N +NG P +D +LK +R EW G
Sbjct: 227 --MSSQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSDSWLLKDVLRDEWGFKGI 283
Query: 285 IVSDCDSVGVYYDTQHFTST-PEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
VSD ++ +H T+ PE+A A++AG+D+ + + ++ G ++ +
Sbjct: 284 TVSDHGAIKEL--IKHGTAADPEDAVRVALKAGVDMSMADEYYSKYLPGLIKSGKVTMAE 341
Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
+++A + L V+ +G+F+ PY HLGPK+ D H++ A E AR+ +V
Sbjct: 342 LDDATRHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAESRLHRKEAREVARESVV 396
Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGI----GRYA 449
LLKN+ +LPL + T+AV+GP +D ++G++ AG+A T L GI G A
Sbjct: 397 LLKNRLETLPLK--KSGTIAVVGPLADSQRDVMGSWSAAGVANQSVTVLAGIQNAVGDGA 454
Query: 450 RTIHQQGC-----KDVAC--------------ADDQLFGAAIDASRQADATILVMGLDQS 490
+ ++ +G KD+ + + A+ A++QAD + V+G Q
Sbjct: 455 KILYAKGANITNDKDIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADVVVAVVGESQG 514
Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
+ EA R + +P Q++L++ + K P +LVLM+G P+ A K D + AI+
Sbjct: 515 MAHEASSRTNITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKEDQQADAILETW 571
Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPM------TEMAMRPSQSKRYPG 604
+ G GG AIAD+LFG NP GKLP++ +P+ + +P+ T P + +Y
Sbjct: 572 FAGTEGGNAIADVLFGDYNPSGKLPIS-FPRS-VGQIPVYYSHLNTGRPYNPEKPNKYTS 629
Query: 605 RTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAK 664
R + GP +YPFG+G+SYT F TV++ ++ P R G + A+
Sbjct: 630 RYFDEANGP-LYPFGYGLSYTTF--TVSD--VTLSSPTMQRDGKVTAS------------ 672
Query: 665 CNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVG- 722
V+V N G ++GA + ++ A P KQL FEK+ + G ++ V
Sbjct: 673 ---------VEVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGFEKITLKPGERKTVSF 723
Query: 723 -INIHVCKY 730
I+I K+
Sbjct: 724 PIDIEALKF 732
>gi|421883572|ref|ZP_16314801.1| periplasmic beta-D-glucoside glucohydrolase [Salmonella enterica
subsp. enterica serovar Senftenberg str. SS209]
gi|379986798|emb|CCF87074.1| periplasmic beta-D-glucoside glucohydrolase [Salmonella enterica
subsp. enterica serovar Senftenberg str. SS209]
Length = 765
Score = 263 bits (673), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 205/669 (30%), Positives = 327/669 (48%), Gaps = 103/669 (15%)
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
T FP + ASSFN +GRV + Y GL W+P V++ RDPRWGR
Sbjct: 118 TVFPISLGLASSFNLDAVRTVGRV------SAYEAADDGLNMTWAPMVDVSRDPRWGRAS 171
Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
E GED L+ + V+ +QG S DR V S KHF AY +N VD
Sbjct: 172 EGFGEDTYLTSIMGETMVKAMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 226
Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
+S Q + + + P++ + G +VM + N +NG P +D +LK +R EW G
Sbjct: 227 --MSSQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSDSWLLKDVLRDEWGFKGI 283
Query: 285 IVSDCDSVGVYYDTQHFTST-PEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
VSD ++ +H T+ PE+A A++AG+D+ + + ++ G ++ +
Sbjct: 284 TVSDHGAIKEL--IKHGTAADPEDAVRVALKAGVDMSMADEYYSKYLPGLIKSGKVTMAE 341
Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
+++A + L V+ +G+F+ PY HLGPK+ D H++ A E AR+ +V
Sbjct: 342 LDDATRHVLNVKYDMGLFNA-----PYSHLGPKESDPVDTNAESRLHRKEAREVARESVV 396
Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGI----GRYA 449
LLKN+ +LPL + T+AV+GP +D ++G++ AG+A T L GI G A
Sbjct: 397 LLKNRLETLPLK--KSGTIAVVGPLADSQRDVMGSWSAAGVANQSVTVLAGIQNAVGDGA 454
Query: 450 RTIHQQGC-----KDVAC--------------ADDQLFGAAIDASRQADATILVMGLDQS 490
+ ++ +G KD+ + + A+ A++QAD + V+G Q
Sbjct: 455 KILYAKGANITNDKDIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADVVVAVVGESQG 514
Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
+ EA R + +P Q++L++ + K P +LVLM+G P+ A K D + AI+
Sbjct: 515 MAHEASSRTNITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKEDQQADAILETW 571
Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPM------TEMAMRPSQSKRYPG 604
+ G GG AIAD+LFG NP GKLP++ +P+ + +P+ T P + +Y
Sbjct: 572 FAGTEGGNAIADVLFGDYNPSGKLPIS-FPRS-VGQIPVYYSHLNTGRPYNPEKPNKYTS 629
Query: 605 RTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAK 664
R + GP +YPFG+G+SYT F TV++ ++ P R G + A+
Sbjct: 630 RYFDEANGP-LYPFGYGLSYTTF--TVSD--VTLSSPTMQRDGKVTAS------------ 672
Query: 665 CNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVG- 722
V+V N G ++GA + ++ A P KQL FEK+ + G ++ V
Sbjct: 673 ---------VEVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGFEKITLKPGERKTVSF 723
Query: 723 -INIHVCKY 730
I+I K+
Sbjct: 724 PIDIEALKF 732
>gi|223936933|ref|ZP_03628842.1| Beta-glucosidase [bacterium Ellin514]
gi|223894502|gb|EEF60954.1| Beta-glucosidase [bacterium Ellin514]
Length = 774
Score = 263 bits (672), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 218/693 (31%), Positives = 342/693 (49%), Gaps = 85/693 (12%)
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQE 170
TSFPQ I ++FN L E + + + E R GG LT P V++ RD RWGR +E
Sbjct: 133 TSFPQPIGLGATFNPALVEKLYAMTAHETRV--RGGHQALT---PVVDVARDARWGRVEE 187
Query: 171 TPGEDPVLSGKYAASYVRGLQG--SDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVS 228
T GEDP L+ + + VRG QG S D+ V A+ KHF A+ +G + N VS
Sbjct: 188 TYGEDPFLNTQLGIAAVRGFQGDASFKDKKHVIATLKHFAAHGQPE-SGQNCAPVN--VS 244
Query: 229 KQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSD 288
++ + +TF PFR C+ +G SVM SYN+++GVP+ A +L+ +R EW G++VSD
Sbjct: 245 ERLLRETFLHPFRDCLKKGGAISVMASYNEIDGVPSHASRWLLRDVLRKEWGFKGFVVSD 304
Query: 289 CDSVGVYYDT-----QHFTSTPEEAAADAIRAGLDLDC-GPFLGLHTESAVQRGLLSEID 342
++ H + +EA A++AG++++ P H V++ +L E +
Sbjct: 305 YYAIWELSHRPDSHGHHVAADKKEACVLAVKAGVNIEFPEPDCYRHLVELVRKKVLHETE 364
Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGP 402
++ + L + ++G+FD +P P + V H+ELA EAAR+ I LLKN+
Sbjct: 365 LDELIAPMLLWKFKMGLFD-DPYVDP--EEAARVVGCEVHRELASEAARETITLLKNEND 421
Query: 403 SLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI----GRYARTIHQQGCK 458
LPL+ + +TVAVIGPN++ +++G Y+G+ T L GI G + +H +GCK
Sbjct: 422 LLPLNPAKLKTVAVIGPNAN--RSLLGGYSGVPAHNVTVLDGIKARLGGAVKVVHAEGCK 479
Query: 459 ---------DVACADD-----QLFGAAIDASRQADATILVMGLDQSIEAEAL------DR 498
D A D + A+ + AD I+ +G ++ EA DR
Sbjct: 480 ITVGGSWQQDEVLASDPAEDRKQIDEAVKVAWSADVVIVAIGGNEQTSREAWSLKHMGDR 539
Query: 499 AGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVA-FAKNDPRIAAIIWAGYPGQAGG 557
L L G Q EL+ + +A+ P + ++ +G P+ + A+N P AI+ Y GQ G
Sbjct: 540 TSLDLIGHQDELI-RALLATGKPVVALVFNGRPLAINHVAQNVP---AILECWYLGQECG 595
Query: 558 TAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYP 617
+A+A +LFG NPGGKLP++ P+ + LP+ +PS R + + + ++P
Sbjct: 596 SAVAAVLFGDHNPGGKLPIS-IPRS-VGQLPVF-YNHKPSAR-----RGFLWDEATPLFP 647
Query: 618 FGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVK 677
FG G+SYT F T N +A + R GS TH V VDV
Sbjct: 648 FGFGLSYTKF--TFKN--VRLAKKIISRTGS------------TH---------VSVDVT 682
Query: 678 NVGSKDGAHTLLVFSTPPAGHWA-PHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDR 736
N G + G + V+ P K+L F+K+ + G + V +++ + L+ D
Sbjct: 683 NAGKRAGTEVVQVYVRDLISSVTRPVKELKVFQKITLAPGETKTVSLDL-TPESLAFYDV 741
Query: 737 SGTRRIPLGEHNIHIGGTKHSVSLHAATLGVIK 769
+ + GE I +G + V L L V+K
Sbjct: 742 NMKYVVEPGEFEIMVGNSSRDVDLQKVVLRVVK 774
>gi|378954476|ref|YP_005211963.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar
Gallinarum/pullorum str. RKS5078]
gi|357205087|gb|AET53133.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Gallinarum/pullorum str. RKS5078]
Length = 755
Score = 263 bits (672), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 203/669 (30%), Positives = 325/669 (48%), Gaps = 103/669 (15%)
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
T FP + ASSFN +GRV + Y GL W+P V++ RDPRWGR
Sbjct: 108 TVFPISLGLASSFNLDAVRTVGRV------SAYEAADDGLNMTWAPMVDVSRDPRWGRAS 161
Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
E GED L+ + V+ +QG S DR V S KHF AY +N VD
Sbjct: 162 EGFGEDTYLTSIMGETMVKAMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 216
Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
+S Q + + + P++ + G +VM + N +NG P +D +LK +R EW G
Sbjct: 217 --MSSQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSDSWLLKDVLRDEWGFKGI 273
Query: 285 IVSDCDSVGVYYDTQHFTST-PEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
VSD ++ +H T+ PE+A A++AG+D+ + + ++ G ++ +
Sbjct: 274 TVSDHGAIKEL--IKHGTAADPEDAVRVALKAGVDMSMADEYYSKYLPGLIKSGKVTMAE 331
Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
+++A + L V+ +G+F+ PY HLGPK+ D H++ A E AR+ +V
Sbjct: 332 LDDATRHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAESRLHRKEAREVARESVV 386
Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGIGR------ 447
LLKN+ +LPL + T+AV+GP +D ++G++ AG+A T L GI
Sbjct: 387 LLKNRLETLPLK--KSGTIAVVGPLADSQRDVMGSWSAAGVANQSVTVLAGIQNAVDDGA 444
Query: 448 ---YAR--------------TIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQS 490
YA+ +H++ K + + A+ A++QA+ + V+G Q
Sbjct: 445 KILYAKGANITNDKDIVDFLNLHEEAVKIDPRSPQAMIDEAVQAAKQANVVVAVVGESQG 504
Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
+ EA R + +P Q++L++ + K P +LVLM+G P+ A K D + AI+
Sbjct: 505 MAHEASSRTNITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKEDQQADAILETW 561
Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPM------TEMAMRPSQSKRYPG 604
+ G GG AIAD+LFG NP GKLP++ +P+ + +P+ T P + +Y
Sbjct: 562 FAGTEGGNAIADVLFGDYNPSGKLPIS-FPRS-VGQIPVYYSHLNTGRPYNPEKPNKYTS 619
Query: 605 RTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAK 664
R + GP +YPFG+G+SYT F TV++ ++ P R G + A+
Sbjct: 620 RYFDEANGP-LYPFGYGLSYTTF--TVSD--VTLSSPTMQRDGKVTAS------------ 662
Query: 665 CNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVG- 722
V+V N G ++GA + ++ A P KQL FEK+ + G ++ V
Sbjct: 663 ---------VEVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGFEKITLKPGERKTVSF 713
Query: 723 -INIHVCKY 730
I+I K+
Sbjct: 714 PIDIEALKF 722
>gi|205357644|ref|ZP_02572188.2| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar 4,[5],12:i:- str. CVM23701]
gi|374981198|ref|ZP_09722528.1| Periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Typhimurium str. TN061786]
gi|383496875|ref|YP_005397564.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar
Typhimurium str. 798]
gi|205330573|gb|EDZ17337.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar 4,[5],12:i:- str. CVM23701]
gi|321224818|gb|EFX49881.1| Periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Typhimurium str. TN061786]
gi|380463696|gb|AFD59099.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Typhimurium str. 798]
Length = 755
Score = 263 bits (672), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 205/669 (30%), Positives = 325/669 (48%), Gaps = 103/669 (15%)
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
T FP + ASSFN +GRV + Y GL W+P V++ RDPRWGR
Sbjct: 108 TVFPISLGLASSFNLDAVRTVGRV------SAYEAADDGLNMTWAPMVDVSRDPRWGRAS 161
Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
E GED L+ + V+ +QG S DR V S KHF AY +N VD
Sbjct: 162 EGFGEDTYLTSIMGETMVKAMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 216
Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
+S Q + + + P++ + G +VM + N +NG P +D +LK +R EW G
Sbjct: 217 --MSSQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSDSWLLKDVLRDEWGFKGI 273
Query: 285 IVSDCDSVGVYYDTQHFTST-PEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
VSD ++ +H T+ PE+A A++AG+D+ + + ++ G ++ +
Sbjct: 274 TVSDHGAIKEL--IKHGTAADPEDAVRVALKAGVDMSMADEYYSKYLPGLIKSGKVTMAE 331
Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
+++A + L V+ +G+F+ PY HLGPK+ D H++ A E AR+ +V
Sbjct: 332 LDDATRHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAESRLHRKEAREVARESVV 386
Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGI----GRYA 449
LLKN+ +LPL + T+AV+GP +D ++G++ AG+A T L GI G A
Sbjct: 387 LLKNRLETLPLK--KSGTIAVVGPLADSQRDVMGSWSAAGVANQSVTVLAGIQNAVGDGA 444
Query: 450 RTIHQQGCK---DVACAD----------------DQLFGAAIDASRQADATILVMGLDQS 490
+ ++ +G D D + A+ A++QAD + V+G Q
Sbjct: 445 KILYAKGANITNDKGIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADVVVAVVGESQG 504
Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
+ EA R + +P Q++L++ + K P +LVLM+G P+ A K D + AI+
Sbjct: 505 MAHEASSRTNITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKEDQQADAILETW 561
Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPM------TEMAMRPSQSKRYPG 604
+ G GG AIAD+LFG NP GKLP++ +P+ + +P+ T P + +Y
Sbjct: 562 FAGTEGGNAIADVLFGDYNPSGKLPIS-FPRS-VGQIPVYYSHLNTGRPYNPEKPNKYTS 619
Query: 605 RTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAK 664
R + GP +YPFG+G+SYT F TV++ ++ P R G + A+
Sbjct: 620 RYFDEANGP-LYPFGYGLSYTTF--TVSD--VTLSSPTMQRDGKVTAS------------ 662
Query: 665 CNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVG- 722
V+V N G ++GA + ++ A P KQL FEK+ + G ++ V
Sbjct: 663 ---------VEVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGFEKITLKPGERKTVSF 713
Query: 723 -INIHVCKY 730
I+I K+
Sbjct: 714 PIDIEALKF 722
>gi|238912609|ref|ZP_04656446.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Tennessee str. CDC07-0191]
Length = 765
Score = 263 bits (672), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 205/669 (30%), Positives = 326/669 (48%), Gaps = 103/669 (15%)
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
T FP + ASSFN +GRV + Y GL W+P V++ RDPRWGR
Sbjct: 118 TVFPISLGLASSFNLDAVRTVGRV------SAYEAADDGLNMTWAPMVDVSRDPRWGRAS 171
Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
E GED L+ + V+ +QG S DR V S KHF AY +N VD
Sbjct: 172 EGFGEDTYLTSIMGETMVKAMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 226
Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
+S Q + + + P++ + G +VM + N +NG P +D +LK +R EW G
Sbjct: 227 --MSSQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSDSWLLKDVLRDEWGFKGI 283
Query: 285 IVSDCDSVGVYYDTQHFTST-PEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
VSD ++ +H T+ PE+A A++AG+D+ + + ++ G ++ +
Sbjct: 284 TVSDHGAIKEL--IKHGTAADPEDAVRVALKAGVDMSMADEYYSKYLPGLIKSGKVTMAE 341
Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
+++A + L V+ +G+F+ PY HLGPK+ D H++ A E AR+ +V
Sbjct: 342 LDDATRHVLNVKYDMGLFN-----DPYSHLGPKESDPADTNAESRLHRKEAREVARESVV 396
Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGI----GRYA 449
LLKN+ +LPL + T+AV+GP +D ++G++ AG+A T L GI G A
Sbjct: 397 LLKNRLETLPLK--KSGTIAVVGPLADSQRDVMGSWSAAGVANQSVTVLAGIQNAVGDGA 454
Query: 450 RTIHQQGC-----KDVAC--------------ADDQLFGAAIDASRQADATILVMGLDQS 490
+ ++ +G KD+ + + A+ A++QAD + V+G Q
Sbjct: 455 KILYAKGANITNDKDIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADVVVAVVGESQG 514
Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
+ EA R + +P Q++L++ + K P +LVLM+G P+ A K D + AI+
Sbjct: 515 MAHEASSRTNITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKEDQQADAILETW 571
Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPM------TEMAMRPSQSKRYPG 604
+ G GG AIAD+LFG NP GKLP++ +P+ + +P+ T P + +Y
Sbjct: 572 FAGTEGGNAIADVLFGDYNPSGKLPIS-FPRS-VGQIPVYYSHLNTGRPYNPEKPNKYTS 629
Query: 605 RTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAK 664
R + GP +YPFG+G+SYT F TV++ ++ P R G + A+
Sbjct: 630 RYFDEANGP-LYPFGYGLSYTTF--TVSD--VTLSSPTMQRDGKVTAS------------ 672
Query: 665 CNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVG- 722
V+V N G ++GA + ++ A P KQL FEK+ + G + V
Sbjct: 673 ---------VEVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGFEKITLKPGESKTVSF 723
Query: 723 -INIHVCKY 730
I+I K+
Sbjct: 724 PIDIEALKF 732
>gi|445146604|ref|ZP_21387818.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Dublin str. SL1438]
gi|445154484|ref|ZP_21391811.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Dublin str. HWS51]
gi|444845501|gb|ELX70711.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Dublin str. SL1438]
gi|444850729|gb|ELX75826.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Dublin str. HWS51]
Length = 765
Score = 263 bits (672), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 205/669 (30%), Positives = 327/669 (48%), Gaps = 103/669 (15%)
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
T FP + ASSFN +GRV + Y GL W+P V++ RDPRWGR
Sbjct: 118 TVFPISLGLASSFNLDAVRTVGRV------SAYEAADDGLNMTWAPMVDVSRDPRWGRAS 171
Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
E GED L+ + V+ +QG S DR V S KHF AY +N VD
Sbjct: 172 EGFGEDTYLTSIMGETMVKAMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 226
Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
+S Q + + + P++ + G +VM + N +NG P +D +LK +R EW G
Sbjct: 227 --MSSQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSDSWLLKDVLRDEWGFKGI 283
Query: 285 IVSDCDSVGVYYDTQHFTST-PEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
VSD ++ +H T+ PE+A A++AG+D+ + + ++ G ++ +
Sbjct: 284 TVSDHGAIKEL--IKHGTAADPEDAVRVALKAGVDMSMADEYYSKYLPGLIKSGKVTMAE 341
Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
+++A + L V+ +G+F+ PY HLGPK+ D H++ A E AR+ +V
Sbjct: 342 LDDATRHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAESRLHRKEAREVARESVV 396
Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGI----GRYA 449
LLKN+ +LPL + T+AV+GP +D ++G++ AG+A T L GI G A
Sbjct: 397 LLKNRLETLPLK--KSGTIAVVGPLADSQRDVMGSWSAAGVANQSVTVLAGIQNAVGDGA 454
Query: 450 RTIHQQGC-----KDVAC--------------ADDQLFGAAIDASRQADATILVMGLDQS 490
+ ++ +G KD+ + + A+ A++QAD + V+G Q
Sbjct: 455 KILYAKGANITNDKDIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADMVVAVVGESQG 514
Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
+ EA R + +P Q++L++ + K P +LVLM+G P+ A K D + AI+
Sbjct: 515 MAHEASSRTNITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKEDQQADAILETW 571
Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPM------TEMAMRPSQSKRYPG 604
+ G GG AIAD+LFG NP GKLP++ +P+ + +P+ T P + +Y
Sbjct: 572 FAGTEGGNAIADVLFGDYNPSGKLPIS-FPRS-VGQIPVYYSHLNTGRPYNPEKPNKYTS 629
Query: 605 RTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAK 664
R + GP +YPFG+G+SYT F TV++ ++ P R G + A+
Sbjct: 630 RYFDEANGP-LYPFGYGLSYTTF--TVSD--VTLSSPTMQRDGKVTAS------------ 672
Query: 665 CNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVG- 722
V+V N G ++GA + ++ A P KQL FEK+ + G ++ V
Sbjct: 673 ---------VEVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGFEKITLKPGERKTVSF 723
Query: 723 -INIHVCKY 730
I+I K+
Sbjct: 724 PIDIEALKF 732
>gi|194446334|ref|YP_002041437.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar
Newport str. SL254]
gi|194469038|ref|ZP_03075022.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Kentucky str. CVM29188]
gi|195874002|ref|ZP_02700028.2| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Newport str. SL317]
gi|205358223|ref|ZP_02654575.2| glycosyl hydrolase family 3 N domain protein [Salmonella enterica
subsp. enterica serovar Kentucky str. CDC 191]
gi|205359498|ref|ZP_02830128.2| glycosyl hydrolase family 3 N domain protein [Salmonella enterica
subsp. enterica serovar Weltevreden str. HI_N05-537]
gi|417374418|ref|ZP_12144175.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar
Inverness str. R8-3668]
gi|418765082|ref|ZP_13321175.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Newport str. CVM 35199]
gi|418772136|ref|ZP_13328140.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Newport str. CVM 21539]
gi|418825678|ref|ZP_13380951.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Newport str. CVM 22462]
gi|194404997|gb|ACF65219.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Newport str. SL254]
gi|194455402|gb|EDX44241.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Kentucky str. CVM29188]
gi|195631424|gb|EDX49984.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Newport str. SL317]
gi|205335699|gb|EDZ22463.1| glycosyl hydrolase family 3 N domain protein [Salmonella enterica
subsp. enterica serovar Kentucky str. CDC 191]
gi|205344851|gb|EDZ31615.1| glycosyl hydrolase family 3 N domain protein [Salmonella enterica
subsp. enterica serovar Weltevreden str. HI_N05-537]
gi|353599534|gb|EHC55680.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar
Inverness str. R8-3668]
gi|392732250|gb|EIZ89461.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Newport str. CVM 21539]
gi|392742031|gb|EIZ99126.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Newport str. CVM 35199]
gi|392812566|gb|EJA68550.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Newport str. CVM 22462]
Length = 755
Score = 263 bits (672), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 205/669 (30%), Positives = 326/669 (48%), Gaps = 103/669 (15%)
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
T FP + ASSFN +GRV + Y GL W+P V++ RDPRWGR
Sbjct: 108 TVFPISLGLASSFNLDAVRTVGRV------SAYEAADDGLNMTWAPMVDVSRDPRWGRAS 161
Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
E GED L+ + V+ +QG S DR V S KHF AY +N VD
Sbjct: 162 EGFGEDTYLTSIMGETMVKAMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 216
Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
+S Q + + + P++ + G +VM + N +NG P +D +LK +R EW G
Sbjct: 217 --MSSQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSDSWLLKDVLRDEWGFKGI 273
Query: 285 IVSDCDSVGVYYDTQHFTST-PEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
VSD ++ +H T+ PE+A A++AG+D+ + + ++ G ++ +
Sbjct: 274 TVSDHGAIKEL--IKHGTAADPEDAVRVALKAGVDMSMADEYYSKYLPGLIKSGKVTMAE 331
Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
+++A + L V+ +G+F+ PY HLGPK+ D H++ A E AR+ +V
Sbjct: 332 LDDATRHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAESRLHRKEAREVARESVV 386
Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGI----GRYA 449
LLKN+ +LPL + T+AV+GP +D ++G++ AG+A T L GI G A
Sbjct: 387 LLKNRLETLPLK--KSGTIAVVGPLADSQRDVMGSWSAAGVANQSVTVLAGIQNAVGDGA 444
Query: 450 RTIHQQGC-----KDVAC--------------ADDQLFGAAIDASRQADATILVMGLDQS 490
+ ++ +G KD+ + + A+ A++QAD + V+G Q
Sbjct: 445 KILYAKGANITNDKDIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADVVVAVVGESQG 504
Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
+ EA R + +P Q++L++ + K P +LVLM+G P+ A K D + AI+
Sbjct: 505 MAHEASSRTNITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKEDQQADAILETW 561
Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPM------TEMAMRPSQSKRYPG 604
+ G GG AIAD+LFG NP GKLP++ +P+ + +P+ T P + +Y
Sbjct: 562 FAGTEGGNAIADVLFGDYNPSGKLPIS-FPRS-VGQIPVYYSHLNTGRPYNPEKPNKYTS 619
Query: 605 RTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAK 664
R + GP +YPFG+G+SYT F TV++ ++ P R G + A+
Sbjct: 620 RYFDEANGP-LYPFGYGLSYTTF--TVSD--VTLSSPTMQRDGKVTAS------------ 662
Query: 665 CNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVG- 722
V+V N G ++GA + ++ A P KQL FEK+ + G + V
Sbjct: 663 ---------VEVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGFEKITLKPGESKTVSF 713
Query: 723 -INIHVCKY 730
I+I K+
Sbjct: 714 PIDIEALKF 722
>gi|197248682|ref|YP_002147137.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar Agona
str. SL483]
gi|440763077|ref|ZP_20942125.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Agona str. SH11G1113]
gi|440769230|ref|ZP_20948190.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Agona str. SH08SF124]
gi|440771374|ref|ZP_20950293.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Agona str. SH10GFN094]
gi|197212385|gb|ACH49782.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Agona str. SL483]
gi|436415218|gb|ELP13139.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Agona str. SH08SF124]
gi|436421310|gb|ELP19157.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Agona str. SH11G1113]
gi|436422409|gb|ELP20247.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Agona str. SH10GFN094]
Length = 765
Score = 263 bits (672), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 205/669 (30%), Positives = 326/669 (48%), Gaps = 103/669 (15%)
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
T FP + ASSFN +GRV + Y GL W+P V++ RDPRWGR
Sbjct: 118 TVFPISLGLASSFNLDAVRTVGRV------SAYEAADDGLNMTWAPMVDVSRDPRWGRAS 171
Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
E GED L+ + V+ +QG S DR V S KHF AY +N VD
Sbjct: 172 EGFGEDTYLTSIMGETMVKAMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 226
Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
+S Q + + + P++ + G +VM + N +NG P +D +LK +R EW G
Sbjct: 227 --MSSQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSDSWLLKDVLRDEWGFKGI 283
Query: 285 IVSDCDSVGVYYDTQHFTST-PEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
VSD ++ +H T+ PE+A A++AG+D+ + + ++ G ++ +
Sbjct: 284 TVSDHGAIKEL--IKHGTAADPEDAVRVALKAGVDMSMADEYYSKYLPGLIKSGKVTMAE 341
Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
+++A + L V+ +G+F+ PY HLGPK+ D H++ A E AR+ +V
Sbjct: 342 LDDATRHVLNVKYDMGLFN-----DPYSHLGPKESDPADTNAESRLHRKEAREVARESVV 396
Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGI----GRYA 449
LLKN+ +LPL + T+AV+GP +D ++G++ AG+A T L GI G A
Sbjct: 397 LLKNRLETLPLK--KSGTIAVVGPLADSQRDVMGSWSAAGVANQSVTVLAGIQNAVGDGA 454
Query: 450 RTIHQQGC-----KDVAC--------------ADDQLFGAAIDASRQADATILVMGLDQS 490
+ ++ +G KD+ + + A+ A++QAD + V+G Q
Sbjct: 455 KILYAKGANITNDKDIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADVVVAVVGESQG 514
Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
+ EA R + +P Q++L++ + K P +LVLM+G P+ A K D + AI+
Sbjct: 515 MAHEASSRTNITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKEDQQADAILETW 571
Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPM------TEMAMRPSQSKRYPG 604
+ G GG AIAD+LFG NP GKLP++ +P+ + +P+ T P + +Y
Sbjct: 572 FAGTEGGNAIADVLFGDYNPSGKLPIS-FPRS-VGQIPVYYSHLNTGRPYNPEKPNKYTS 629
Query: 605 RTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAK 664
R + GP +YPFG+G+SYT F TV++ ++ P R G + A+
Sbjct: 630 RYFDEANGP-LYPFGYGLSYTTF--TVSD--VTLSSPTMQRDGKVTAS------------ 672
Query: 665 CNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVG- 722
V+V N G ++GA + ++ A P KQL FEK+ + G + V
Sbjct: 673 ---------VEVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGFEKITLKPGESKTVSF 723
Query: 723 -INIHVCKY 730
I+I K+
Sbjct: 724 PIDIEALKF 732
>gi|270296098|ref|ZP_06202298.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp. D20]
gi|270273502|gb|EFA19364.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp. D20]
Length = 798
Score = 263 bits (672), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 200/679 (29%), Positives = 323/679 (47%), Gaps = 84/679 (12%)
Query: 110 ATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLT-YWSPNVNIFRDPRWGRG 168
AT FP +++N L IG + EA + G T +SP ++I +DPRWGR
Sbjct: 168 ATYFPAQCGQGATWNKELIAQIGEAEAREASVL------GYTNIYSPILDIAQDPRWGRC 221
Query: 169 QETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVS 228
ET GEDP +G+ + LQ + K+ ++ KHF Y + + + V+
Sbjct: 222 VETYGEDPYHAGQMGKQMILSLQKN-----KLVSTPKHFAVYSIPVGGRDGKTRTDPHVA 276
Query: 229 KQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSD 288
+++ + PFR+ E VM SYN +G P + L +R EW GY+VSD
Sbjct: 277 PREMRTLYLDPFRVAFHEAGALGVMSSYNDYDGEPITGSYHFLTEILRQEWGFKGYVVSD 336
Query: 289 CDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDC-----GPFLGLHTESAVQRGLLSEIDI 343
++V + E+A A A+ AGL++ F+ L SAV++G +S+ +
Sbjct: 337 SEAVEFISTKHQVANGYEDAVAQAVNAGLNIRTHFTPPADFI-LPLRSAVKKGKISQETL 395
Query: 344 NNALVNTLTVQMRLGMFDGEPSSQPY---GHLGPKDVCTPDHQELALEAARQGIVLLKNQ 400
N + L V+ LG+FD PY + V +P+HQ+LALEAARQ +VLLKN+
Sbjct: 396 NQRVAEILRVKFWLGLFDN-----PYRGDEKRAGQIVHSPEHQQLALEAARQSLVLLKNE 450
Query: 401 GPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRY---ARTIHQQGC 457
+LPLS R+VAVIGPN+D +I Y TT +GI + A ++++GC
Sbjct: 451 HQTLPLSK-SIRSVAVIGPNADERQQLICRYGPANAHITTIYEGIKKMLPQADVVYKKGC 509
Query: 458 KDV--------------ACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLL 503
+ A + Q+ AI+A++ A+ T++V+G ++ E R L L
Sbjct: 510 DIIDPHFPESEVLEFPKAAQEAQMMEEAIEAAKGAEVTVMVLGGNELTVREDRSRTSLDL 569
Query: 504 PGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADI 563
PGRQ+EL+ K+ K P +LV++ G + FA + AII A +PG+ GG AIA+
Sbjct: 570 PGRQKELLKKICQLGK-PVVLVMIDGRASSINFAAT--HVPAIIHAWFPGEFGGQAIAEA 626
Query: 564 LFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMS 623
LFG NPGG+L +T +P+ + +P +P + Y +YPFGHG+S
Sbjct: 627 LFGDYNPGGRLAVT-FPKS-VGQIPFA-FPFKPGSDESSETSVY-----GALYPFGHGLS 678
Query: 624 YTNFVHT-VANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSK 682
YT F ++ +A +P+ G G+I+ + +KN+G +
Sbjct: 679 YTTFQYSDLAISPSK-----QGVQGNIS---------------------ISCTIKNIGQR 712
Query: 683 DGAHTLLVFSTPPAGHWAPHKQLV-AFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRR 741
+G + ++ + Q++ FE++ + A V + + L + D+
Sbjct: 713 EGDEVVQLYLRDEVSSVTTYTQVLRGFERITLKPEASHTVHFEL-TPQELGIWDKQMNFT 771
Query: 742 IPLGEHNIHIGGTKHSVSL 760
+ G + IG + + L
Sbjct: 772 VEPGMFKVMIGSSSKDIRL 790
>gi|429092960|ref|ZP_19155568.1| Periplasmic beta-glucosidase [Cronobacter dublinensis 1210]
gi|426742256|emb|CCJ81681.1| Periplasmic beta-glucosidase [Cronobacter dublinensis 1210]
Length = 759
Score = 263 bits (672), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 201/664 (30%), Positives = 323/664 (48%), Gaps = 101/664 (15%)
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
T FP + ASSFN + +GR+ + Y GL W+P V++ RDPRWGRG
Sbjct: 112 TIFPNSLGLASSFNLDAVKTVGRI------SAYEAADDGLNMTWAPMVDVTRDPRWGRGS 165
Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
E GED L+ + V +QG S DR V S KHF AY +N VD
Sbjct: 166 EGFGEDTYLTSMMGKTMVESMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 220
Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
+S Q + + + P++ + G VM + N +NG P +D +LK +R +W G
Sbjct: 221 --MSSQRLFNDYMPPYKEALDAGS-GGVMIALNSLNGTPAASDGWLLKDLLRDDWGFKGI 277
Query: 285 IVSDCDSVGVYYDTQHFTST-PEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
+SD ++ +H T++ PE+A AI++G+D+ + + ++ G ++ +
Sbjct: 278 TISDHGAIKEL--IKHGTASGPEDAVRVAIKSGVDMSMADEYYSKYLPDLIKSGKVTMAE 335
Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
+++A + L V+ +G+F+ PY HLGPK+ D H++ A AR+ +V
Sbjct: 336 LDDATRHVLNVKYDMGLFN-----DPYSHLGPKESDPKDTNAESRLHRDDARRVARESLV 390
Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGI----GRYA 449
LLKN+ +LPL + T+AV+GP +D M+G++ AG+A T LQG+ G A
Sbjct: 391 LLKNRLETLPLK--KSGTIAVVGPLADSKRDMMGSWSAAGVADQAVTLLQGMKNVAGDKA 448
Query: 450 RTIHQQGCK---DVACAD----------------DQLFGAAIDASRQADATILVMGLDQS 490
+ ++ +G D D ++ A++ ++Q+D + V+G Q
Sbjct: 449 KILYAKGANVTDDKGIVDFLNLYEPAVVVDKRTPKEMIDEAVNVAKQSDVVVAVVGEAQG 508
Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
+ EA R L +P Q++L+S + K P +LVLM+G P+ A K D + A++
Sbjct: 509 MAHEASSRTDLTIPQSQRDLISALKATGK-PLVLVLMNGRPL--ALVKEDQQADAMLETW 565
Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPM------TEMAMRPSQSKRYPG 604
+ G GG AIAD+LFG NP GKLP++ +P+ + +P+ T P + +Y
Sbjct: 566 FAGTEGGNAIADVLFGDYNPSGKLPIS-FPRS-VGQIPVYYSHLNTGRPYNPEKPNKYTS 623
Query: 605 RTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAK 664
R + GP +YPFG+G+SYT F +V++ ++ P R G + A+++
Sbjct: 624 RYFDEANGP-LYPFGYGLSYTTF--SVSDVK--LSAPTLKRDGKVTASVT---------- 668
Query: 665 CNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVGI 723
V N G ++GA + ++ A P KQL FEKV + G Q V
Sbjct: 669 -----------VTNTGKREGATAVQMYLQDVTASMSRPVKQLRGFEKVTLKPGESQTVSF 717
Query: 724 NIHV 727
I V
Sbjct: 718 PIDV 721
>gi|62180752|ref|YP_217169.1| beta-D-glucoside glucohydrolase, periplasmic [Salmonella enterica
subsp. enterica serovar Choleraesuis str. SC-B67]
gi|375115081|ref|ZP_09760251.1| beta-D-glucoside glucohydrolase, periplasmic [Salmonella enterica
subsp. enterica serovar Choleraesuis str. SCSA50]
gi|62128385|gb|AAX66088.1| beta-D-glucoside glucohydrolase, periplasmic [Salmonella enterica
subsp. enterica serovar Choleraesuis str. SC-B67]
gi|322715227|gb|EFZ06798.1| beta-D-glucoside glucohydrolase, periplasmic [Salmonella enterica
subsp. enterica serovar Choleraesuis str. SCSA50]
Length = 765
Score = 263 bits (671), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 205/669 (30%), Positives = 325/669 (48%), Gaps = 103/669 (15%)
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
T FP + ASSFN +GRV + Y GL W+P V++ RDPRWGR
Sbjct: 118 TVFPISLGLASSFNQDAVRTVGRV------SAYEAADDGLNMTWAPMVDVSRDPRWGRAS 171
Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
E GED L+ + V+ +QG S DR V S KHF AY +N VD
Sbjct: 172 EGFGEDTYLTSIMGETMVKAMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 226
Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
+S Q + + + P++ + G +VM + N +NG P +D +LK +R EW G
Sbjct: 227 --MSSQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSDSWLLKDVLRDEWGFKGI 283
Query: 285 IVSDCDSVGVYYDTQHFTST-PEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
VSD ++ +H T+ PE+A A++AG+D+ + + ++ G ++ +
Sbjct: 284 TVSDHGAIKEL--IKHGTAADPEDAVRVALKAGVDMSMADEYYSKYLPGLIKSGKVTMAE 341
Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
+++A + L V+ +G+F+ PY HLGPK+ D H++ A E AR+ +V
Sbjct: 342 LDDATRHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAESRLHRKEAREVARESVV 396
Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGI----GRYA 449
LLKN+ +LPL + +AV+GP +D ++G++ AG+A T L GI G A
Sbjct: 397 LLKNRLETLPLK--KSGAIAVVGPLADSQRDVMGSWSAAGVANQSVTVLAGIQNAVGDGA 454
Query: 450 RTIHQQGC-----KDVAC--------------ADDQLFGAAIDASRQADATILVMGLDQS 490
+ +H +G KD+ + + A+ A++QAD + V+G Q
Sbjct: 455 KILHAKGANITNDKDIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADVVVAVVGESQG 514
Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
+ EA R + +P Q++L++ + K P +LVLM+G P+ A K D + AI+
Sbjct: 515 MAHEASSRTNITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKEDQQADAILETW 571
Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPM------TEMAMRPSQSKRYPG 604
+ G GG AIAD+LFG NP GKLP++ +P+ + +P+ T P + +Y
Sbjct: 572 FAGTEGGNAIADVLFGDYNPSGKLPIS-FPRS-VGQIPVYYSHLNTGRPYNPEKPNKYTS 629
Query: 605 RTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAK 664
R + GP +YPFG+G+SYT F TV++ ++ P R G + A+
Sbjct: 630 RYFDEANGP-LYPFGYGLSYTTF--TVSD--VTLSSPTMQRDGKVTAS------------ 672
Query: 665 CNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVG- 722
V+V N G ++GA + ++ A P KQL FEK+ + G + V
Sbjct: 673 ---------VEVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGFEKITLKPGESKTVSF 723
Query: 723 -INIHVCKY 730
I+I K+
Sbjct: 724 PIDIEALKF 732
>gi|437730260|ref|ZP_20831186.1| beta-D-glucoside glucohydrolase, partial [Salmonella enterica
subsp. enterica serovar Enteritidis str. 648901 16-16]
gi|435289521|gb|ELO66481.1| beta-D-glucoside glucohydrolase, partial [Salmonella enterica
subsp. enterica serovar Enteritidis str. 648901 16-16]
Length = 764
Score = 263 bits (671), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 203/669 (30%), Positives = 325/669 (48%), Gaps = 103/669 (15%)
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
T FP + ASSFN +GRV + Y GL W+P V++ RDPRWGR
Sbjct: 118 TVFPISLGLASSFNLDAVRTVGRV------SAYEAADDGLNMTWAPMVDVSRDPRWGRAS 171
Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
E GED L+ + V+ +QG S DR V S KHF AY +N VD
Sbjct: 172 EGFGEDTYLTSIMGETMVKAMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 226
Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
+S Q + + + P++ + G +VM + N +NG P +D +LK +R EW G
Sbjct: 227 --MSSQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSDSWLLKDVLRDEWGFKGI 283
Query: 285 IVSDCDSVGVYYDTQHFTST-PEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
VSD ++ +H T+ PE+A A++AG+D+ + + ++ G ++ +
Sbjct: 284 TVSDHGAIKEL--IKHGTAADPEDAVRVALKAGVDMSMADEYYSKYLPGLIKSGKVTMAE 341
Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
+++A + L V+ +G+F+ PY HLGPK+ D H++ A E AR+ +V
Sbjct: 342 LDDATRHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAESRLHRKEAREVARESVV 396
Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGIGR------ 447
LLKN+ +LPL + T+AV+GP +D ++G++ AG+A T L GI
Sbjct: 397 LLKNRLETLPLK--KSGTIAVVGPLADSQRDVMGSWSAAGVANQSVTVLAGIQNAVDDGA 454
Query: 448 ---YAR--------------TIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQS 490
YA+ ++++ K + + A+ A++QAD + V+G Q
Sbjct: 455 KILYAKGANITNDKDIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADVVVAVVGESQG 514
Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
+ EA R + +P Q++L++ + K P +LVLM+G P+ A K D + AI+
Sbjct: 515 MAHEASSRTNITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKEDQQADAILETW 571
Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPM------TEMAMRPSQSKRYPG 604
+ G GG AIAD+LFG NP GKLP++ +P+ + +P+ T P + +Y
Sbjct: 572 FAGTEGGNAIADVLFGDYNPSGKLPIS-FPRS-VGQIPVYYSHLNTGRPYNPEKPNKYTS 629
Query: 605 RTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAK 664
R + GP +YPFG+G+SYT F TV++ ++ P R G + A+
Sbjct: 630 RYFDEANGP-LYPFGYGLSYTTF--TVSD--VTLSSPTMQRDGKVTAS------------ 672
Query: 665 CNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVG- 722
V+V N G ++GA + ++ A P KQL FEK+ + G ++ V
Sbjct: 673 ---------VEVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGFEKITLKPGERKTVSF 723
Query: 723 -INIHVCKY 730
I+I K+
Sbjct: 724 PIDIEALKF 732
>gi|423295566|ref|ZP_17273693.1| hypothetical protein HMPREF1070_02358 [Bacteroides ovatus
CL03T12C18]
gi|392672275|gb|EIY65744.1| hypothetical protein HMPREF1070_02358 [Bacteroides ovatus
CL03T12C18]
Length = 782
Score = 263 bits (671), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 209/699 (29%), Positives = 325/699 (46%), Gaps = 108/699 (15%)
Query: 109 GATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRG 168
GAT FP I A++++ L + +G+V++ E R+ G + P +++ RDPRW R
Sbjct: 148 GATVFPTGIGMAATWSPELVKEVGQVIAKEIRSQ-----GGHISYGPVLDLTRDPRWSRV 202
Query: 169 QETPGEDPVLSGKYAASYVRGLQGSD-GDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKV 227
+ET GEDPVLSG AS V GL G + + A+ KHF AY + + A V
Sbjct: 203 EETFGEDPVLSGTLGASMVDGLGGGNLSQKYATIATLKHFLAYAVPEGGQNGNY---ASV 259
Query: 228 SKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVS 287
+D+ F PFR + G + SVM SYN ++G+P ++ +L + +R EW+ G++VS
Sbjct: 260 GIRDLHQNFLPPFRKAIDAGAL-SVMTSYNSIDGIPCTSNHYLLTQLLRNEWKFRGFVVS 318
Query: 288 DCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNAL 347
D S+ +++ T E AA ++ AG+D+D G + AVQ G + + I+ A+
Sbjct: 319 DLYSIEGIHESHFVAPTKENAAIQSVMAGVDVDLGGDAYTNLCHAVQSGQMDKAVIDTAV 378
Query: 348 VNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLS 407
L ++ +G+F+ P P + K V +H ELA + A+ I LLKN+ LPLS
Sbjct: 379 CRVLRMKFEMGLFE-HPYVDP--KIAAKTVRRKEHIELARKIAQSSITLLKNENSILPLS 435
Query: 408 HIRHRTVAVIGPNSDVTVTMIGNYAGIA--CGYTTPLQGIGRY---ARTIHQQGC--KDV 460
+ ++ VAVIGPN+D M+G+Y T L GI +R + +GC +D
Sbjct: 436 KMINK-VAVIGPNADNRYNMLGDYTAPQEDSNVKTVLDGIITKLSPSRVEYVRGCAIRDT 494
Query: 461 ACADDQLFGAAIDASRQAD----------------------ATILVMGLDQSIEA-EALD 497
+ + AI+A+R+++ A + G +E E D
Sbjct: 495 TVNEIE---QAIEAARRSEVVIVVVGGSSARDFKTSYKETGAAVAEEGSVSDMECGEGFD 551
Query: 498 RAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGG 557
RA L L GRQQEL+ + K P I+V + G P++ +A A++ A YPGQ GG
Sbjct: 552 RASLSLLGRQQELLESLQKTGK-PLIVVYIEGRPLEKNWASE--YADALLTAYYPGQEGG 608
Query: 558 TAIADILFGTSNPGGKLPMT----------WYPQEYITNLPMTEMAMRPSQSKRYPGRTY 607
AIAD+LFG NP G+LP++ +Y Q+ N E++ P
Sbjct: 609 NAIADVLFGDYNPSGRLPISVPRSVGQIPVYYNQKAPRNHDYVEVSSSP----------- 657
Query: 608 RFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNR 667
+Y FG+GMSYT F + S + A+C
Sbjct: 658 -------LYSFGYGMSYTTFEY------------------------SDLQVVQKSARC-- 684
Query: 668 LTLGVQVDVKNVGSKDGAH-TLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIH 726
V VKN G DG + L A P KQL FE+ H+ G +++V +
Sbjct: 685 --FEVSFKVKNTGKYDGEEVSQLYMRDEYASVVQPMKQLKHFERFHLKKGEEKKVTFVLT 742
Query: 727 VCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHAATL 765
+ +V+ + + + G ++ IG + + L L
Sbjct: 743 EEDFF-LVNYTLKKVVESGNFHLMIGAASNDIRLQNVIL 780
>gi|409250776|ref|YP_006886584.1| beta-D-glucoside glucohydrolase, periplasmic [Salmonella enterica
subsp. enterica serovar Weltevreden str. 2007-60-3289-1]
gi|418760521|ref|ZP_13316675.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Newport str. CVM 35185]
gi|418775352|ref|ZP_13331310.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Newport str. CVM 33953]
gi|418781475|ref|ZP_13337358.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Newport str. CVM 35188]
gi|418783741|ref|ZP_13339586.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Newport str. CVM 21559]
gi|418789285|ref|ZP_13345072.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Newport str. CVM 19447]
gi|418794086|ref|ZP_13349808.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Newport str. CVM 19449]
gi|418796683|ref|ZP_13352374.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Newport str. CVM 19567]
gi|418803316|ref|ZP_13358937.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Newport str. CVM 35202]
gi|418809719|ref|ZP_13365271.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Newport str. CVM 21550]
gi|418813873|ref|ZP_13369394.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Newport str. CVM 22513]
gi|418815139|ref|ZP_13370647.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Newport str. CVM 21538]
gi|418822928|ref|ZP_13378339.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Newport str. CVM 22425]
gi|418830026|ref|ZP_13384989.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Newport str. CVM N18486]
gi|418836408|ref|ZP_13391292.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Newport str. CVM N1543]
gi|418840973|ref|ZP_13395796.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Newport str. CVM 21554]
gi|418844732|ref|ZP_13399518.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Newport str. CVM 19443]
gi|418850848|ref|ZP_13405564.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Newport str. CVM 37978]
gi|418857905|ref|ZP_13412528.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Newport str. CVM 19470]
gi|418862200|ref|ZP_13416744.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Newport str. CVM 19536]
gi|419789701|ref|ZP_14315381.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Newport str. Levine 1]
gi|419792237|ref|ZP_14317879.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Newport str. Levine 15]
gi|320086604|emb|CBY96375.1| beta-D-glucoside glucohydrolase, periplasmic [Salmonella enterica
subsp. enterica serovar Weltevreden str. 2007-60-3289-1]
gi|392615486|gb|EIW97925.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Newport str. Levine 1]
gi|392618961|gb|EIX01347.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Newport str. Levine 15]
gi|392742601|gb|EIZ99688.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Newport str. CVM 35185]
gi|392747243|gb|EJA04244.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Newport str. CVM 35188]
gi|392748425|gb|EJA05411.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Newport str. CVM 33953]
gi|392756753|gb|EJA13648.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Newport str. CVM 21559]
gi|392760526|gb|EJA17361.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Newport str. CVM 19447]
gi|392762081|gb|EJA18897.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Newport str. CVM 19449]
gi|392770056|gb|EJA26784.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Newport str. CVM 19567]
gi|392772845|gb|EJA29542.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Newport str. CVM 35202]
gi|392773804|gb|EJA30500.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Newport str. CVM 21550]
gi|392775099|gb|EJA31794.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Newport str. CVM 22513]
gi|392787185|gb|EJA43733.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Newport str. CVM 22425]
gi|392793518|gb|EJA49962.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Newport str. CVM 21538]
gi|392800769|gb|EJA56999.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Newport str. CVM N1543]
gi|392802259|gb|EJA58473.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Newport str. CVM N18486]
gi|392809639|gb|EJA65673.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Newport str. CVM 21554]
gi|392813541|gb|EJA69505.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Newport str. CVM 19443]
gi|392818698|gb|EJA74582.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Newport str. CVM 37978]
gi|392834583|gb|EJA90187.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Newport str. CVM 19470]
gi|392836652|gb|EJA92232.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Newport str. CVM 19536]
Length = 765
Score = 263 bits (671), Expect = 4e-67, Method: Compositional matrix adjust.
Identities = 205/669 (30%), Positives = 326/669 (48%), Gaps = 103/669 (15%)
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
T FP + ASSFN +GRV + Y GL W+P V++ RDPRWGR
Sbjct: 118 TVFPISLGLASSFNLDAVRTVGRV------SAYEAADDGLNMTWAPMVDVSRDPRWGRAS 171
Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
E GED L+ + V+ +QG S DR V S KHF AY +N VD
Sbjct: 172 EGFGEDTYLTSIMGETMVKAMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 226
Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
+S Q + + + P++ + G +VM + N +NG P +D +LK +R EW G
Sbjct: 227 --MSSQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSDSWLLKDVLRDEWGFKGI 283
Query: 285 IVSDCDSVGVYYDTQHFTST-PEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
VSD ++ +H T+ PE+A A++AG+D+ + + ++ G ++ +
Sbjct: 284 TVSDHGAIKEL--IKHGTAADPEDAVRVALKAGVDMSMADEYYSKYLPGLIKSGKVTMAE 341
Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
+++A + L V+ +G+F+ PY HLGPK+ D H++ A E AR+ +V
Sbjct: 342 LDDATRHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAESRLHRKEAREVARESVV 396
Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGI----GRYA 449
LLKN+ +LPL + T+AV+GP +D ++G++ AG+A T L GI G A
Sbjct: 397 LLKNRLETLPLK--KSGTIAVVGPLADSQRDVMGSWSAAGVANQSVTVLAGIQNAVGDGA 454
Query: 450 RTIHQQGC-----KDVAC--------------ADDQLFGAAIDASRQADATILVMGLDQS 490
+ ++ +G KD+ + + A+ A++QAD + V+G Q
Sbjct: 455 KILYAKGANITNDKDIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADVVVAVVGESQG 514
Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
+ EA R + +P Q++L++ + K P +LVLM+G P+ A K D + AI+
Sbjct: 515 MAHEASSRTNITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKEDQQADAILETW 571
Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPM------TEMAMRPSQSKRYPG 604
+ G GG AIAD+LFG NP GKLP++ +P+ + +P+ T P + +Y
Sbjct: 572 FAGTEGGNAIADVLFGDYNPSGKLPIS-FPRS-VGQIPVYYSHLNTGRPYNPEKPNKYTS 629
Query: 605 RTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAK 664
R + GP +YPFG+G+SYT F TV++ ++ P R G + A+
Sbjct: 630 RYFDEANGP-LYPFGYGLSYTTF--TVSD--VTLSSPTMQRDGKVTAS------------ 672
Query: 665 CNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVG- 722
V+V N G ++GA + ++ A P KQL FEK+ + G + V
Sbjct: 673 ---------VEVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGFEKITLKPGESKTVSF 723
Query: 723 -INIHVCKY 730
I+I K+
Sbjct: 724 PIDIEALKF 732
>gi|197263061|ref|ZP_03163135.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Saintpaul str. SARA23]
gi|207857601|ref|YP_002244252.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar
Enteritidis str. P125109]
gi|421359420|ref|ZP_15809713.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 622731-39]
gi|421364010|ref|ZP_15814248.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 639016-6]
gi|421367050|ref|ZP_15817252.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 640631]
gi|421373030|ref|ZP_15823175.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 77-0424]
gi|421375395|ref|ZP_15825508.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 607307-6]
gi|421382177|ref|ZP_15832228.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 485549-17]
gi|421386891|ref|ZP_15836897.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 596866-22]
gi|421391230|ref|ZP_15841201.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 596866-70]
gi|421395600|ref|ZP_15845536.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 629164-26]
gi|421397961|ref|ZP_15847870.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 629164-37]
gi|421402484|ref|ZP_15852342.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 639672-46]
gi|421409068|ref|ZP_15858863.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 639672-50]
gi|421411752|ref|ZP_15861516.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 77-1427]
gi|421418166|ref|ZP_15867872.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 77-2659]
gi|421422724|ref|ZP_15872392.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 78-1757]
gi|421424918|ref|ZP_15874555.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 22510-1]
gi|421433210|ref|ZP_15882778.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 8b-1]
gi|421434152|ref|ZP_15883702.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648905 5-18]
gi|421441895|ref|ZP_15891355.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648901 6-18]
gi|421443218|ref|ZP_15892660.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 50-3079]
gi|436600421|ref|ZP_20512957.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 22704]
gi|436686654|ref|ZP_20517957.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. SE30663]
gi|436802328|ref|ZP_20525319.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CHS44]
gi|436809476|ref|ZP_20528856.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1882]
gi|436814772|ref|ZP_20532323.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1884]
gi|436844194|ref|ZP_20537952.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1594]
gi|436854476|ref|ZP_20544110.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1566]
gi|436855800|ref|ZP_20544925.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1580]
gi|436864301|ref|ZP_20550268.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1543]
gi|436870232|ref|ZP_20554038.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1441]
gi|436877723|ref|ZP_20558651.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1810]
gi|436886662|ref|ZP_20563082.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1558]
gi|436893559|ref|ZP_20567466.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1018]
gi|436901307|ref|ZP_20572217.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1010]
gi|436912656|ref|ZP_20578485.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1729]
gi|436920338|ref|ZP_20582934.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_0895]
gi|436926675|ref|ZP_20586501.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_0899]
gi|436936605|ref|ZP_20592045.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1457]
gi|436940618|ref|ZP_20594562.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1747]
gi|436950717|ref|ZP_20599772.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_0968]
gi|436961959|ref|ZP_20605333.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1444]
gi|436969147|ref|ZP_20608268.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1445]
gi|436976823|ref|ZP_20612073.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1559]
gi|436989589|ref|ZP_20616596.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1565]
gi|437001001|ref|ZP_20620797.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1808]
gi|437022365|ref|ZP_20628334.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1811]
gi|437036112|ref|ZP_20633844.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_0956]
gi|437044148|ref|ZP_20637101.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1455]
gi|437052068|ref|ZP_20641628.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1575]
gi|437059125|ref|ZP_20645972.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1725]
gi|437064832|ref|ZP_20648606.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1745]
gi|437077112|ref|ZP_20655320.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1791]
gi|437083638|ref|ZP_20659292.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1795]
gi|437092016|ref|ZP_20663616.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 576709]
gi|437113855|ref|ZP_20669057.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 635290-58]
gi|437122048|ref|ZP_20672085.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 607308-16]
gi|437128670|ref|ZP_20675357.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 607308-19]
gi|437139171|ref|ZP_20681653.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 607307-2]
gi|437144157|ref|ZP_20684771.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 607308-9]
gi|437151251|ref|ZP_20689128.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 629163]
gi|437162727|ref|ZP_20696289.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. SE15-1]
gi|437169975|ref|ZP_20700070.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CVM_N202]
gi|437179039|ref|ZP_20705157.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CVM_56-3991]
gi|437181201|ref|ZP_20706372.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CVM_76-3618]
gi|437239027|ref|ZP_20714239.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 13183-1]
gi|437261953|ref|ZP_20718699.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CVM_81-2490]
gi|437267405|ref|ZP_20721157.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. SL909]
gi|437280676|ref|ZP_20728053.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. SL913]
gi|437289916|ref|ZP_20731294.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CVM_69-4941]
gi|437311898|ref|ZP_20736006.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 638970-15]
gi|437330435|ref|ZP_20741599.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 17927]
gi|437346849|ref|ZP_20747003.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CHS4]
gi|437411031|ref|ZP_20752807.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 543463 22-17]
gi|437442646|ref|ZP_20757878.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 543463 40-18]
gi|437461006|ref|ZP_20761959.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 561362 1-1]
gi|437479748|ref|ZP_20768095.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 642044 4-1]
gi|437494385|ref|ZP_20772414.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 642046 4-7]
gi|437505366|ref|ZP_20775420.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648898 4-5]
gi|437535206|ref|ZP_20781440.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648899 3-17]
gi|437559607|ref|ZP_20785823.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648900 1-16]
gi|437572273|ref|ZP_20789035.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648901 1-17]
gi|437583575|ref|ZP_20792569.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648901 39-2]
gi|437599978|ref|ZP_20797137.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648902 6-8]
gi|437616848|ref|ZP_20802600.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648903 1-6]
gi|437631373|ref|ZP_20806367.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648904 3-6]
gi|437662149|ref|ZP_20813366.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 653049 13-19]
gi|437676329|ref|ZP_20816941.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 642044 8-1]
gi|437695929|ref|ZP_20822252.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 561362 9-7]
gi|437711846|ref|ZP_20826864.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 543463 42-20]
gi|437813102|ref|ZP_20841687.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 33944]
gi|438082703|ref|ZP_20857889.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 81-2625]
gi|438099351|ref|ZP_20863367.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 62-1976]
gi|438109484|ref|ZP_20867447.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 53-407]
gi|445165173|ref|ZP_21394056.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. SE8a]
gi|445182314|ref|ZP_21398520.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 20037]
gi|445229380|ref|ZP_21405024.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. SE10]
gi|445333762|ref|ZP_21414961.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 18569]
gi|445344304|ref|ZP_21417576.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 13-1]
gi|445361577|ref|ZP_21423869.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. PT23]
gi|197241316|gb|EDY23936.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Saintpaul str. SARA23]
gi|206709404|emb|CAR33745.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Enteritidis str. P125109]
gi|395985043|gb|EJH94216.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 622731-39]
gi|395985485|gb|EJH94655.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 639016-6]
gi|395989705|gb|EJH98839.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 640631]
gi|395998658|gb|EJI07685.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 77-0424]
gi|395999277|gb|EJI08299.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 485549-17]
gi|396005381|gb|EJI14360.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 607307-6]
gi|396011532|gb|EJI20442.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 596866-22]
gi|396012239|gb|EJI21137.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 629164-26]
gi|396012640|gb|EJI21536.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 596866-70]
gi|396025999|gb|EJI34772.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 639672-50]
gi|396032021|gb|EJI40746.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 639672-46]
gi|396032157|gb|EJI40881.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 629164-37]
gi|396039333|gb|EJI47961.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 77-2659]
gi|396042039|gb|EJI50662.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 78-1757]
gi|396045253|gb|EJI53847.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 77-1427]
gi|396049426|gb|EJI57969.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 8b-1]
gi|396058514|gb|EJI66975.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 22510-1]
gi|396060758|gb|EJI69199.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648905 5-18]
gi|396062572|gb|EJI70983.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648901 6-18]
gi|396071002|gb|EJI79329.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 50-3079]
gi|434957421|gb|ELL51060.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CHS44]
gi|434967293|gb|ELL60128.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1882]
gi|434972888|gb|ELL65276.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1884]
gi|434975403|gb|ELL67694.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 22704]
gi|434978780|gb|ELL70772.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1594]
gi|434983279|gb|ELL75087.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1566]
gi|434992014|gb|ELL83484.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1580]
gi|434995336|gb|ELL86652.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1543]
gi|435002560|gb|ELL93625.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1441]
gi|435003859|gb|ELL94860.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. SE30663]
gi|435008607|gb|ELL99430.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1810]
gi|435011905|gb|ELM02608.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1558]
gi|435018555|gb|ELM09017.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1018]
gi|435020741|gb|ELM11130.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1010]
gi|435026901|gb|ELM17032.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1729]
gi|435027857|gb|ELM17949.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_0895]
gi|435036516|gb|ELM26335.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_0899]
gi|435039443|gb|ELM29224.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1457]
gi|435045523|gb|ELM35151.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1747]
gi|435051098|gb|ELM40602.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1444]
gi|435051184|gb|ELM40686.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_0968]
gi|435059248|gb|ELM48538.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1445]
gi|435071156|gb|ELM60106.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1565]
gi|435071299|gb|ELM60247.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1559]
gi|435074446|gb|ELM63278.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_0956]
gi|435075555|gb|ELM64369.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1808]
gi|435076993|gb|ELM65767.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1811]
gi|435081352|gb|ELM69994.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1455]
gi|435091470|gb|ELM79861.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1575]
gi|435094938|gb|ELM83277.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1725]
gi|435100574|gb|ELM88742.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1745]
gi|435103950|gb|ELM92024.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1791]
gi|435107381|gb|ELM95366.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1795]
gi|435112922|gb|ELN00787.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 576709]
gi|435116153|gb|ELN03904.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 635290-58]
gi|435124194|gb|ELN11661.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 607308-16]
gi|435128186|gb|ELN15537.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 607308-19]
gi|435132693|gb|ELN19891.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 607307-2]
gi|435139148|gb|ELN26152.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 607308-9]
gi|435142647|gb|ELN29534.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. SE15-1]
gi|435145134|gb|ELN31963.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 629163]
gi|435147755|gb|ELN34507.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CVM_N202]
gi|435152456|gb|ELN39086.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CVM_56-3991]
gi|435162324|gb|ELN48508.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CVM_81-2490]
gi|435166749|gb|ELN52715.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CVM_76-3618]
gi|435170164|gb|ELN55920.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. SL913]
gi|435175921|gb|ELN61323.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. SL909]
gi|435182180|gb|ELN67212.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CVM_69-4941]
gi|435182531|gb|ELN67539.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 13183-1]
gi|435183030|gb|ELN68005.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 638970-15]
gi|435189277|gb|ELN73922.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. CHS4]
gi|435189619|gb|ELN74243.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 17927]
gi|435204236|gb|ELN87933.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 543463 22-17]
gi|435211340|gb|ELN94479.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 543463 40-18]
gi|435219425|gb|ELO01787.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 642044 4-1]
gi|435220698|gb|ELO02980.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 561362 1-1]
gi|435227283|gb|ELO08792.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 642046 4-7]
gi|435235423|gb|ELO16226.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648898 4-5]
gi|435238918|gb|ELO19527.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648900 1-16]
gi|435240567|gb|ELO20958.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648899 3-17]
gi|435247913|gb|ELO27842.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648901 1-17]
gi|435261576|gb|ELO40730.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648903 1-6]
gi|435262896|gb|ELO41978.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648902 6-8]
gi|435263473|gb|ELO42520.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648901 39-2]
gi|435267795|gb|ELO46460.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 653049 13-19]
gi|435276722|gb|ELO54719.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 642044 8-1]
gi|435277229|gb|ELO55183.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 561362 9-7]
gi|435283446|gb|ELO61011.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648904 3-6]
gi|435288066|gb|ELO65157.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 543463 42-20]
gi|435297184|gb|ELO73479.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 33944]
gi|435320361|gb|ELO93000.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 81-2625]
gi|435325905|gb|ELO97749.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 62-1976]
gi|435332633|gb|ELP03544.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 53-407]
gi|444865473|gb|ELX90243.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. SE10]
gi|444867011|gb|ELX91716.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. SE8a]
gi|444871369|gb|ELX95805.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 20037]
gi|444875449|gb|ELX99648.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 18569]
gi|444880476|gb|ELY04551.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 13-1]
gi|444884583|gb|ELY08407.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. PT23]
Length = 765
Score = 263 bits (671), Expect = 4e-67, Method: Compositional matrix adjust.
Identities = 203/669 (30%), Positives = 325/669 (48%), Gaps = 103/669 (15%)
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
T FP + ASSFN +GRV + Y GL W+P V++ RDPRWGR
Sbjct: 118 TVFPISLGLASSFNLDAVRTVGRV------SAYEAADDGLNMTWAPMVDVSRDPRWGRAS 171
Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
E GED L+ + V+ +QG S DR V S KHF AY +N VD
Sbjct: 172 EGFGEDTYLTSIMGETMVKAMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 226
Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
+S Q + + + P++ + G +VM + N +NG P +D +LK +R EW G
Sbjct: 227 --MSSQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSDSWLLKDVLRDEWGFKGI 283
Query: 285 IVSDCDSVGVYYDTQHFTST-PEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
VSD ++ +H T+ PE+A A++AG+D+ + + ++ G ++ +
Sbjct: 284 TVSDHGAIKEL--IKHGTAADPEDAVRVALKAGVDMSMADEYYSKYLPGLIKSGKVTMAE 341
Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
+++A + L V+ +G+F+ PY HLGPK+ D H++ A E AR+ +V
Sbjct: 342 LDDATRHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAESRLHRKEAREVARESVV 396
Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGIGR------ 447
LLKN+ +LPL + T+AV+GP +D ++G++ AG+A T L GI
Sbjct: 397 LLKNRLETLPLK--KSGTIAVVGPLADSQRDVMGSWSAAGVANQSVTVLAGIQNAVDDGA 454
Query: 448 ---YAR--------------TIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQS 490
YA+ ++++ K + + A+ A++QAD + V+G Q
Sbjct: 455 KILYAKGANITNDKDIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADVVVAVVGESQG 514
Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
+ EA R + +P Q++L++ + K P +LVLM+G P+ A K D + AI+
Sbjct: 515 MAHEASSRTNITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKEDQQADAILETW 571
Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPM------TEMAMRPSQSKRYPG 604
+ G GG AIAD+LFG NP GKLP++ +P+ + +P+ T P + +Y
Sbjct: 572 FAGTEGGNAIADVLFGDYNPSGKLPIS-FPRS-VGQIPVYYSHLNTGRPYNPEKPNKYTS 629
Query: 605 RTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAK 664
R + GP +YPFG+G+SYT F TV++ ++ P R G + A+
Sbjct: 630 RYFDEANGP-LYPFGYGLSYTTF--TVSD--VTLSSPTMQRDGKVTAS------------ 672
Query: 665 CNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVG- 722
V+V N G ++GA + ++ A P KQL FEK+ + G ++ V
Sbjct: 673 ---------VEVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGFEKITLKPGERKTVSF 723
Query: 723 -INIHVCKY 730
I+I K+
Sbjct: 724 PIDIEALKF 732
>gi|427384989|ref|ZP_18881494.1| hypothetical protein HMPREF9447_02527 [Bacteroides oleiciplenus YIT
12058]
gi|425728250|gb|EKU91109.1| hypothetical protein HMPREF9447_02527 [Bacteroides oleiciplenus YIT
12058]
Length = 862
Score = 263 bits (671), Expect = 4e-67, Method: Compositional matrix adjust.
Identities = 156/451 (34%), Positives = 239/451 (52%), Gaps = 33/451 (7%)
Query: 38 RTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNV 97
+TLP+ L +R DL+ RL+L+EK L+ + A+PRLGIK + WWSEALHGV+N
Sbjct: 19 QTLPYQNPELSPAERAKDLVSRLTLEEKALLMCDDSEAIPRLGIKKFNWWSEALHGVANQ 78
Query: 98 GPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAG-------- 149
G T FP+ + A+SFN L I SDE RA +N
Sbjct: 79 G----------NVTVFPEPVGMAASFNEKLVFEIFNATSDEMRAKHNERVRNGLEDTRFH 128
Query: 150 -LTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASC-KHF 207
L+ W+PNVNIFRDPRWGRGQET GEDP L+ + + V+GLQG + ++ + +C KH+
Sbjct: 129 SLSVWTPNVNIFRDPRWGRGQETYGEDPYLTSQMGIAVVKGLQGPEDEKYRKLLACAKHY 188
Query: 208 TAYDLDNWNGVDRFHFNA-KVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCA 266
+ W+ R N +S +D+ +T+ F+ V + V VMC+Y +++ P C
Sbjct: 189 AVHSGPEWS---RHSANLNNISPRDLWETYLPAFKALVQKADVREVMCAYQRLDDDPCCG 245
Query: 267 DPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG-PFL 325
+L++ +R EW +VSDC ++ ++ + +S AA AG D++CG +
Sbjct: 246 STRLLQQILRDEWGFKYLVVSDCGAIADFWTSHKSSSDAVHAAVKGTMAGTDVECGYGYA 305
Query: 326 GLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQEL 385
AV RGL++E +I+ ++ L + LG D +PS + + V + H++L
Sbjct: 306 YQKLPEAVSRGLITEEEIDKHVLRLLEGRFELGEMD-DPSLVKWSQIPMSVVNSKAHKDL 364
Query: 386 ALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
+L +RQ + LL+N+ LPLS R +AVIGPN+D + GNY G T L G
Sbjct: 365 SLNMSRQTMTLLQNKNNVLPLSK-SIRKIAVIGPNADDKPMLWGNYNGTPNQTITILDGF 423
Query: 446 G---RYARTIHQQGCKDVACADDQLFGAAID 473
+ + I+ +GC V +D+ + +D
Sbjct: 424 KTKLKKNQIIYMKGCDLV---NDKTLESYLD 451
Score = 100 bits (250), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 83/312 (26%), Positives = 132/312 (42%), Gaps = 50/312 (16%)
Query: 468 FGAAIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMA 517
+ +I + D + V G+ +E E + DR + LP Q+ + + A
Sbjct: 590 YDESISKLKNVDMVVFVGGISPQLEGEEMPLNLPGFKNGDRTDIELPAVQRNFLKALKEA 649
Query: 518 SKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMT 577
SK + V SG + A AI+ A Y G+ GG A+AD+LFG NP GKLP+T
Sbjct: 650 SK-QVVFVNCSGSSM--ALLPETESCDAILQAWYGGELGGQAVADVLFGDYNPSGKLPVT 706
Query: 578 WYPQEYITNLPMTE-MAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPT 636
+Y LP E +M+ GRTYR+ P ++PFG G+SYT+F TV
Sbjct: 707 FYKS--TKQLPDYEDYSMK--------GRTYRYMSDP-LFPFGFGLSYTDF--TVG---- 749
Query: 637 VVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPA 696
A S ++ A L + V + N G + G + V+
Sbjct: 750 -------------TAQCSKTQLRTEEA------LTLTVPISNTGKRSGTEVIQVYIRKTD 790
Query: 697 GHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKH 756
P K L A+ + + AGA Q + I + + + T R+ GE+ + G +
Sbjct: 791 DTGGPLKSLKAYARAELAAGATQDIEIQLPAESFECFDPSTNTMRVAPGEYELFYGTSSA 850
Query: 757 SVSLHAATLGVI 768
+ L + + ++
Sbjct: 851 AKDLQSVKVTLL 862
>gi|421449803|ref|ZP_15899183.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 58-6482]
gi|396068619|gb|EJI76965.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Enteritidis str. 58-6482]
Length = 765
Score = 262 bits (670), Expect = 4e-67, Method: Compositional matrix adjust.
Identities = 203/669 (30%), Positives = 325/669 (48%), Gaps = 103/669 (15%)
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
T FP + ASSFN +GRV + Y GL W+P V++ RDPRWGR
Sbjct: 118 TVFPISLGLASSFNLDAVRTVGRV------SAYEAADDGLNMTWAPMVDVSRDPRWGRAS 171
Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
E GED L+ + V+ +QG S DR V S KHF AY +N VD
Sbjct: 172 EGFGEDTYLTSIMGETMVKAMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 226
Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
+S Q + + + P++ + G +VM + N +NG P +D +LK +R EW G
Sbjct: 227 --MSSQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSDSWLLKDVLRDEWGFKGI 283
Query: 285 IVSDCDSVGVYYDTQHFTST-PEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
VSD ++ +H T+ PE+A A++AG+D+ + + ++ G ++ +
Sbjct: 284 TVSDHGAIKEL--IKHGTAADPEDAVRVALKAGVDMSMADEYYSKYLPGLIKSGKVTMAE 341
Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
+++A + L V+ +G+F+ PY HLGPK+ D H++ A E AR+ +V
Sbjct: 342 LDDATRHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAESRLHRKEAREVARESVV 396
Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGIGR------ 447
LLKN+ +LPL + T+AV+GP +D ++G++ AG+A T L GI
Sbjct: 397 LLKNRLETLPLK--KSGTIAVVGPLADSQRDVMGSWSAAGVANQSVTVLAGIQNAVDDGA 454
Query: 448 ---YAR--------------TIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQS 490
YA+ ++++ K + + A+ A++QAD + V+G Q
Sbjct: 455 KILYAKGANITNDKDIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADVVVAVVGESQG 514
Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
+ EA R + +P Q++L++ + K P +LVLM+G P+ A K D + AI+
Sbjct: 515 MAHEASSRTNITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKEDQQADAILETW 571
Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPM------TEMAMRPSQSKRYPG 604
+ G GG AIAD+LFG NP GKLP++ +P+ + +P+ T P + +Y
Sbjct: 572 FAGTEGGNAIADVLFGDYNPSGKLPIS-FPRS-VGQIPVYYSHLNTGRPYNPEKPNKYTS 629
Query: 605 RTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAK 664
R + GP +YPFG+G+SYT F TV++ ++ P R G + A+
Sbjct: 630 RYFDEANGP-LYPFGYGLSYTTF--TVSD--VTLSSPTMQRDGKVTAS------------ 672
Query: 665 CNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVG- 722
V+V N G ++GA + ++ A P KQL FEK+ + G ++ V
Sbjct: 673 ---------VEVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGFEKITLKPGERKTVSF 723
Query: 723 -INIHVCKY 730
I+I K+
Sbjct: 724 PIDIEALKF 732
>gi|437849166|ref|ZP_20847216.1| beta-D-glucoside glucohydrolase, partial [Salmonella enterica
subsp. enterica serovar Enteritidis str. 6.0562-1]
gi|435338617|gb|ELP07830.1| beta-D-glucoside glucohydrolase, partial [Salmonella enterica
subsp. enterica serovar Enteritidis str. 6.0562-1]
Length = 792
Score = 262 bits (670), Expect = 4e-67, Method: Compositional matrix adjust.
Identities = 203/669 (30%), Positives = 325/669 (48%), Gaps = 103/669 (15%)
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
T FP + ASSFN +GRV + Y GL W+P V++ RDPRWGR
Sbjct: 145 TVFPISLGLASSFNLDAVRTVGRV------SAYEAADDGLNMTWAPMVDVSRDPRWGRAS 198
Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
E GED L+ + V+ +QG S DR V S KHF AY +N VD
Sbjct: 199 EGFGEDTYLTSIMGETMVKAMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 253
Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
+S Q + + + P++ + G +VM + N +NG P +D +LK +R EW G
Sbjct: 254 --MSSQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSDSWLLKDVLRDEWGFKGI 310
Query: 285 IVSDCDSVGVYYDTQHFTST-PEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
VSD ++ +H T+ PE+A A++AG+D+ + + ++ G ++ +
Sbjct: 311 TVSDHGAIKEL--IKHGTAADPEDAVRVALKAGVDMSMADEYYSKYLPGLIKSGKVTMAE 368
Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
+++A + L V+ +G+F+ PY HLGPK+ D H++ A E AR+ +V
Sbjct: 369 LDDATRHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAESRLHRKEAREVARESVV 423
Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGIGR------ 447
LLKN+ +LPL + T+AV+GP +D ++G++ AG+A T L GI
Sbjct: 424 LLKNRLETLPLK--KSGTIAVVGPLADSQRDVMGSWSAAGVANQSVTVLAGIQNAVDDGA 481
Query: 448 ---YAR--------------TIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQS 490
YA+ ++++ K + + A+ A++QAD + V+G Q
Sbjct: 482 KILYAKGANITNDKDIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADVVVAVVGESQG 541
Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
+ EA R + +P Q++L++ + K P +LVLM+G P+ A K D + AI+
Sbjct: 542 MAHEASSRTNITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKEDQQADAILETW 598
Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPM------TEMAMRPSQSKRYPG 604
+ G GG AIAD+LFG NP GKLP++ +P+ + +P+ T P + +Y
Sbjct: 599 FAGTEGGNAIADVLFGDYNPSGKLPIS-FPRS-VGQIPVYYSHLNTGRPYNPEKPNKYTS 656
Query: 605 RTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAK 664
R + GP +YPFG+G+SYT F TV++ ++ P R G + A+
Sbjct: 657 RYFDEANGP-LYPFGYGLSYTTF--TVSD--VTLSSPTMQRDGKVTAS------------ 699
Query: 665 CNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVG- 722
V+V N G ++GA + ++ A P KQL FEK+ + G ++ V
Sbjct: 700 ---------VEVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGFEKITLKPGERKTVSF 750
Query: 723 -INIHVCKY 730
I+I K+
Sbjct: 751 PIDIEALKF 759
>gi|160892207|ref|ZP_02073210.1| hypothetical protein BACUNI_04671 [Bacteroides uniformis ATCC 8492]
gi|156858685|gb|EDO52116.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
uniformis ATCC 8492]
Length = 990
Score = 262 bits (670), Expect = 4e-67, Method: Compositional matrix adjust.
Identities = 207/681 (30%), Positives = 332/681 (48%), Gaps = 74/681 (10%)
Query: 110 ATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLT-YWSPNVNIFRDPRWGRG 168
AT+FP + ++N L +G + EAR + G T ++P +++ RD RWGR
Sbjct: 221 ATNFPTQLGLGHTWNRELIRQVGLITGREARML------GYTNVYAPILDVGRDQRWGRY 274
Query: 169 QETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVS 228
+E GE P L + VRGLQ + +VAA+ KHF AY + + ++S
Sbjct: 275 EEVYGESPYLVAELGIEMVRGLQHNH----QVAATGKHFAAYSNNKGAREGMARVDPQMS 330
Query: 229 KQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSD 288
+++E+ PF+ + E + VM SYN +G+P L +RGE GY+VSD
Sbjct: 331 PREVENIHIYPFKRVIREAGMLGVMSSYNDYDGIPVQGSYYWLTTRLRGEMGFRGYVVSD 390
Query: 289 CDSVGVYYDTQHFTSTP-EEAAADAIRAGLDLDC-----GPFLGLHTESAVQRGLLSEID 342
D+V Y T+H T+ +EA ++ AGL++ C F+ L V+ G LSE
Sbjct: 391 SDAVEYLY-TKHGTAKDMKEAVRQSVEAGLNVRCTFRSPDSFV-LPLRELVKEGGLSEEV 448
Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGP 402
IN+ + + L V+ +G+FD P ++V +++ +AL+A+R+ IVLLKN G
Sbjct: 449 INDRVRDILRVKFLIGLFDA-PYQTDLAD-ADREVEKEENEAIALQASRESIVLLKNAGE 506
Query: 403 SLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGR----YARTIHQQGCK 458
LPL + +AV GPN++ + +Y +A TT L+GI A ++ +GC
Sbjct: 507 LLPLDINSTKKIAVCGPNANEEGYALTHYGPLAVEVTTVLEGIQEKTKGKAEVLYTKGCD 566
Query: 459 ------------DVACADDQL--FGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLP 504
D DD+ A++ +RQAD I+V+G Q E R L LP
Sbjct: 567 LVDAHWPESEIIDYPLTDDEQAEIDKAVENARQADVAIVVLGGGQRTCGENKSRTSLDLP 626
Query: 505 GRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADIL 564
GRQ +L+ + K P +L+L++G P+ + +A D + AI+ A YPG GGTA+ADIL
Sbjct: 627 GRQLQLLQAIQATGK-PVVLILINGRPLSINWA--DKFVPAILEAWYPGSKGGTALADIL 683
Query: 565 FGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQS---KRYPGRTYRFYK-GPVVYPFGH 620
FG NPGGKL +T +P+ + +P +PS + PG T + +YPFG+
Sbjct: 684 FGDYNPGGKLTVT-FPKT-VGQIPFN-FPCKPSSQIDGGKNPGPTGNMSRINGALYPFGY 740
Query: 621 GMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVG 680
G+SYT F ++ + V P + +AT+ +KVT+ VQ+ +++V
Sbjct: 741 GLSYTTFEYSDLDITPRVITPNE------SATVR---LKVTNTGKRAGDEVVQLYIRDVL 791
Query: 681 SKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTR 740
S + K L F+++H+ G Q + I K+L ++D
Sbjct: 792 SSITTY---------------EKNLAGFQRIHLEPGEAQELSFTID-RKHLELLDADMKW 835
Query: 741 RIPLGEHNIHIGGTKHSVSLH 761
+ G+ + G + + L+
Sbjct: 836 VVEPGDFVLMAGASSEDIRLN 856
>gi|417366389|ref|ZP_12138693.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar
Hvittingfoss str. A4-620]
gi|353592073|gb|EHC50188.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar
Hvittingfoss str. A4-620]
Length = 755
Score = 262 bits (670), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 205/669 (30%), Positives = 324/669 (48%), Gaps = 103/669 (15%)
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
T FP + ASSFN +GRV + Y GL W+P V++ RDPRWGR
Sbjct: 108 TVFPISLGLASSFNLDAVRTVGRV------SAYEAADDGLNMTWAPMVDVSRDPRWGRAS 161
Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
E GED L+ + V+ +QG S DR V S KHF AY +N VD
Sbjct: 162 EGFGEDTYLTSIMGETMVKAMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 216
Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
+S Q + + + P++ + G +VM + N +NG P +D +LK +R EW G
Sbjct: 217 --MSSQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSDSWLLKDVLRDEWGFKGI 273
Query: 285 IVSDCDSVGVYYDTQHFTST-PEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
VSD ++ +H T+ PE+A A++AG+D+ + + ++ G ++ +
Sbjct: 274 TVSDHGAIKEL--IKHGTAADPEDAVRVALKAGVDMSMADEYYSKYLPGLIKSGKVTMAE 331
Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
+++A + L V+ +G+F+ PY HLGPK+ D H++ A E AR+ +V
Sbjct: 332 LDDATRHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAESRLHRKEAREVARESVV 386
Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGI----GRYA 449
LLKN+ +LPL + T+AV+GP +D ++G++ AG+A T L GI G A
Sbjct: 387 LLKNRLETLPLK--KSGTIAVVGPLADSQRDVMGSWSAAGVANQSVTVLAGIQNAVGDGA 444
Query: 450 RTIHQQGCK---DVACAD----------------DQLFGAAIDASRQADATILVMGLDQS 490
+ ++ +G D D + A+ A++QAD + V+G Q
Sbjct: 445 KILYAKGANITNDKGIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADVVVAVVGESQG 504
Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
+ EA R + +P Q++L++ + K P +LVLM+G P+ A K D + AI+
Sbjct: 505 MAHEASSRTDITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKEDQQADAILETW 561
Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPM------TEMAMRPSQSKRYPG 604
+ G GG AIAD+LFG NP GKLP++ +P+ + +P+ T P + +Y
Sbjct: 562 FAGTEGGNAIADVLFGDYNPSGKLPIS-FPRS-VGQIPVYYSHLNTGRPYNPEKPNKYTS 619
Query: 605 RTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAK 664
R + GP +YPFG+G+SYT F TV++ ++ P R G + A+
Sbjct: 620 RYFDEANGP-LYPFGYGLSYTTF--TVSD--VTLSSPTMQRDGKVTAS------------ 662
Query: 665 CNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVG- 722
V+V N G ++GA + ++ A P KQL FEK+ + G + V
Sbjct: 663 ---------VEVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGFEKITLKPGESKTVSF 713
Query: 723 -INIHVCKY 730
I+I K+
Sbjct: 714 PIDIEALKF 722
>gi|317480750|ref|ZP_07939836.1| glycosyl hydrolase family 3 C terminal domain-containing protein
[Bacteroides sp. 4_1_36]
gi|316903091|gb|EFV24959.1| glycosyl hydrolase family 3 C terminal domain-containing protein
[Bacteroides sp. 4_1_36]
Length = 942
Score = 262 bits (670), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 208/684 (30%), Positives = 333/684 (48%), Gaps = 80/684 (11%)
Query: 110 ATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLT-YWSPNVNIFRDPRWGRG 168
AT+FP + ++N L +G + EAR + G T ++P +++ RD RWGR
Sbjct: 173 ATNFPTQLGLGHTWNRELIRQVGLITGREARML------GYTNVYAPILDVGRDQRWGRY 226
Query: 169 QETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVS 228
+E GE P L + VRGLQ + +VAA+ KHF AY + + ++S
Sbjct: 227 EEVYGESPYLVAELGIEMVRGLQHNH----QVAATGKHFAAYSNNKGAREGMARVDPQMS 282
Query: 229 KQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSD 288
+++E+ PF+ + E + VM SYN +G+P L +RGE GY+VSD
Sbjct: 283 PREVENIHIYPFKRVIREAGMLGVMSSYNDYDGIPVQGSYYWLTTRLRGEMGFRGYVVSD 342
Query: 289 CDSVGVYYDTQHFTSTP-EEAAADAIRAGLDLDC-----GPFLGLHTESAVQRGLLSEID 342
D+V Y T+H T+ +EA ++ AGL++ C F+ L V+ G LSE
Sbjct: 343 SDAVEYLY-TKHGTAKDMKEAVRQSVEAGLNVRCTFRSPDSFV-LPLRELVKEGGLSEEV 400
Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGH---LGPKDVCTPDHQELALEAARQGIVLLKN 399
IN+ + + L V+ +G+FD PY ++V +++ +AL+A+R+ IVLLKN
Sbjct: 401 INDRVRDILRVKFLIGLFDA-----PYQTDLADADREVEKEENEAIALQASRESIVLLKN 455
Query: 400 QGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGR----YARTIHQQ 455
G LPL + +AV GPN++ + +Y +A TT L+GI A ++ +
Sbjct: 456 AGELLPLDINSTKKIAVCGPNANEEGYALTHYGPLAVEVTTVLEGIQEKTKGKAEVLYTK 515
Query: 456 GCK------------DVACADDQL--FGAAIDASRQADATILVMGLDQSIEAEALDRAGL 501
GC D DD+ A++ +RQAD I+V+G Q E R L
Sbjct: 516 GCDLVDAHWPESEIIDYPLTDDEQAEIDKAVENARQADVAIVVLGGGQRTCGENKSRTSL 575
Query: 502 LLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIA 561
LPGRQ +L+ + K P +L+L++G P+ + +A D + AI+ A YPG GGTA+A
Sbjct: 576 DLPGRQLQLLQAIQATGK-PVVLILINGRPLSINWA--DKFVPAILEAWYPGSKGGTALA 632
Query: 562 DILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQS---KRYPGRTYRFYK-GPVVYP 617
DILFG NPGGKL +T +P+ + +P +PS + PG T + +YP
Sbjct: 633 DILFGDYNPGGKLTVT-FPKT-VGQIPFN-FPCKPSSQIDGGKNPGPTGNMSRINGALYP 689
Query: 618 FGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVK 677
FG+G+SYT F ++ + V P + +AT+ +KVT+ VQ+ ++
Sbjct: 690 FGYGLSYTTFEYSDLDITPRVITPNE------SATVR---LKVTNTGKRAGDEVVQLYIR 740
Query: 678 NVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRS 737
+V S + K L F+++H+ G Q + I K+L ++D
Sbjct: 741 DVLSSITTY---------------EKNLAGFQRIHLEPGEAQELSFTID-RKHLELLDAD 784
Query: 738 GTRRIPLGEHNIHIGGTKHSVSLH 761
+ G+ + G + + L+
Sbjct: 785 MKWVVEPGDFVLMAGASSEDIRLN 808
>gi|293372493|ref|ZP_06618877.1| glycosyl hydrolase family 3 N-terminal domain protein [Bacteroides
ovatus SD CMC 3f]
gi|299144770|ref|ZP_07037838.1| periplasmic beta-glucosidase [Bacteroides sp. 3_1_23]
gi|292632676|gb|EFF51270.1| glycosyl hydrolase family 3 N-terminal domain protein [Bacteroides
ovatus SD CMC 3f]
gi|298515261|gb|EFI39142.1| periplasmic beta-glucosidase [Bacteroides sp. 3_1_23]
Length = 735
Score = 262 bits (669), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 228/773 (29%), Positives = 355/773 (45%), Gaps = 101/773 (13%)
Query: 29 ACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLI--------------SGAA 74
A D KD P PI +RVNDL+ R++L+EKV L
Sbjct: 21 AKDKKDVALYKDP----KAPIEKRVNDLLSRMTLEEKVMQLNQYTLGRNNNVNNVGEEVK 76
Query: 75 AVP-RLGIKGYEWWSEALHGV--------SNVGPGTKFGGDFPGA--TSFPQVITTASSF 123
VP +G Y + AL S +G FG D T +P + A S+
Sbjct: 77 KVPAEIGSLIYFETNPALRNSMQKKAMEESRLGIPIIFGYDAIHGFRTVYPISLAQACSW 136
Query: 124 NATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQETPGEDPVLSGKY 182
N L E V + EAR +G+ + +SP +++ RDPRWGR E GEDP +G +
Sbjct: 137 NPDLVEQACAVSAQEAR------MSGVDWTFSPMIDVARDPRWGRVAEGYGEDPYTNGVF 190
Query: 183 AASYVRGLQGSD-GDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFR 241
A+ V+G QG D ++AA KH+ Y R + ++SKQ + DT+ +P+
Sbjct: 191 GAASVKGYQGDDLSAENRMAACLKHYVGYGASE---AGRDYVYTEISKQTLWDTYLLPYE 247
Query: 242 MCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHF 301
M V G A++M S+N ++GVP A+P I+ ++ W +G+IVSD ++ Q
Sbjct: 248 MGVKAG-AATLMSSFNDISGVPGSANPYIMTEILKKRWGHDGFIVSDWGAIE-QLKNQGL 305
Query: 302 TSTPEEAAADAIRAGLDLDCGPF-LGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMF 360
+T +EAA A AGL++D H + V+ G +S ++ A+ L ++ RLG+F
Sbjct: 306 AATKKEAAWHAFTAGLEMDMMSHAYDRHLQELVEEGRVSVAQVDEAVRRVLLLKFRLGLF 365
Query: 361 DGEPSSQPY--GHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIG 418
+ +PY + P ++A A + +VLLKN+ +LPL+ + +AVIG
Sbjct: 366 E-----RPYTPATSEKERFFRPQSMDIAARLAAESMVLLKNENKTLPLTD--KKKIAVIG 418
Query: 419 PNSDVTVTMIGNYAG------IACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAI 472
P + ++G++ G +A Y A + GC + + F A+
Sbjct: 419 PMAKNGWDLLGSWCGHGKDTDVAMLYNGLATEFAGKAELRYAAGCA-TKGDNKEGFAEAL 477
Query: 473 DASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPI 532
+A+R +D +L +G + E R+ + LP Q+EL +++ A K P +LVL++G P+
Sbjct: 478 EAARWSDVVVLCLGEMMTWSGENASRSSIALPQIQEELAAELKKAGK-PIVLVLVNGRPL 536
Query: 533 DVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEM 592
++ + IW PG G +A IL G NP GKL MT P +
Sbjct: 537 ELNRLELISDAILEIWQ--PGVNGALPMAGILSGRINPSGKLAMT---------FPYSTG 585
Query: 593 AMRPSQSKRYPGRTYR-FYKGPV---VYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGS 648
+ ++R GR ++ FYK +YPFGHG+SYT F + P+V V R
Sbjct: 586 QIPIYYNRRKSGRGHQGFYKDITSDPLYPFGHGLSYTEFKYGTVT-PSVTKVKRGDR--- 641
Query: 649 INATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVA 707
L V+V V NVG++DGA T+ F S P P K+L
Sbjct: 642 ---------------------LSVEVTVTNVGARDGAETVHWFISDPYCSITRPVKELKH 680
Query: 708 FEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
FEK + AG + +I + + V+ G R + GE++I + G + L
Sbjct: 681 FEKQLIRAGETKTFRFDIDLERDFGFVNEDGKRFLEAGEYHILVQGQTVKIEL 733
>gi|401676349|ref|ZP_10808334.1| glucosidase [Enterobacter sp. SST3]
gi|400216388|gb|EJO47289.1| glucosidase [Enterobacter sp. SST3]
Length = 765
Score = 262 bits (669), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 212/691 (30%), Positives = 337/691 (48%), Gaps = 106/691 (15%)
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
T FP + ASSFN + +GRV + Y GL W+P V++ RDPRWGRG
Sbjct: 118 TVFPISLGLASSFNLDAVKTVGRV------SAYEAADDGLNMTWAPMVDVSRDPRWGRGS 171
Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
E GED L+ + V +QG S DR V S KHF AY +N VD
Sbjct: 172 EGFGEDTFLTSTMGKTMVEAMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 226
Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
+S Q + + + P++ + G +VM + N +NG P +D +LK +R +W G
Sbjct: 227 --MSPQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSDSWLLKDVLRDQWGFKGI 283
Query: 285 IVSDCDSVGVYYDTQHFT-STPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
VSD ++ +H T S PE+A A+++G+++ + + V+ G ++ +
Sbjct: 284 TVSDHGAIKEL--IKHGTASDPEDAVRVALKSGINMSMSDEYYSKYLPGLVKSGKVTMAE 341
Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
+++A + L V+ +G+F+ PY HLGPKD D H++ A + AR+ +V
Sbjct: 342 LDDATRHVLNVKYDMGLFN-----DPYSHLGPKDSDPADTNAESRLHRKEARDVARESLV 396
Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGI----GRYA 449
LLKN+ +LPL + T+AV+GP +D ++G++ AG+A T L GI G A
Sbjct: 397 LLKNRLDTLPLK--KSGTIAVVGPLADSKRDVMGSWSAAGVADQSVTVLTGIKSAVGENA 454
Query: 450 RTIHQQGC-----KDVACADDQ--------------LFGAAIDASRQADATILVMGLDQS 490
+ ++ +G KD+ +Q + A++A++Q+D + V+G Q
Sbjct: 455 KVVYAKGANVTNDKDIVTFLNQYEEAVKVDPRTPKEMIDEAVNAAKQSDVVVAVVGEAQG 514
Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
+ EA R + +P Q++L++ + K P +LVLM+G P+ A K D + AI+
Sbjct: 515 MAHEASSRTDITIPQSQRDLIAALKATGK-PLVLVLMNGRPL--ALVKEDQQADAILETW 571
Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAM---RP---SQSKRYPG 604
+ G GG AIAD+LFG NP GKLPM+ +P+ + +P+ + RP + +Y
Sbjct: 572 FAGTEGGNAIADVLFGDYNPSGKLPMS-FPRS-VGQIPVYYSHLNTGRPYNADKPNKYTS 629
Query: 605 RTYRFYKGPVVYPFGHGMSYTNF-VHTVA-NAPTVVAVPLDGRHGSINATISGKAIKVTH 662
R + GP +YPFG+G+SYT F V V +APT+ R G + A+
Sbjct: 630 RYFDEANGP-LYPFGYGLSYTTFKVSDVKMSAPTL------KRDGKVTAS---------- 672
Query: 663 AKCNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRV 721
V+V N G ++GA + ++ A P KQL FEKV++ G + +
Sbjct: 673 -----------VEVTNTGKREGATVIQMYVQDVTASMSRPVKQLRGFEKVNLKPGETKTI 721
Query: 722 GINIHVCKYLSVVDRSGTRRIPLGEHNIHIG 752
I V L ++ G+ N+ IG
Sbjct: 722 SFPIDV-DALKFWNQQMKYDAEPGKFNVFIG 751
>gi|296103779|ref|YP_003613925.1| putative periplasmic beta-glucosidase [Enterobacter cloacae subsp.
cloacae ATCC 13047]
gi|295058238|gb|ADF62976.1| putative periplasmic beta-glucosidase precursor [Enterobacter
cloacae subsp. cloacae ATCC 13047]
Length = 765
Score = 262 bits (669), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 209/689 (30%), Positives = 336/689 (48%), Gaps = 102/689 (14%)
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
T FP + ASSFN + +GRV + Y GL W+P V++ RDPRWGR
Sbjct: 118 TVFPISLGLASSFNLDAVKTVGRV------SAYEAADDGLNMTWAPMVDVSRDPRWGRAS 171
Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
E GED L+ + V +QG S DR V S KHF AY +N VD
Sbjct: 172 EGFGEDTYLTATMGKTMVEAMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 226
Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
+S Q + + + P++ + G +VM + N +NG P +D +LK +R +W G
Sbjct: 227 --MSPQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSDAWLLKDVLRDQWGFKGI 283
Query: 285 IVSDCDSVGVYYDTQHFT-STPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
VSD ++ +H T S PE+A A+++G+++ + + V+ G ++ +
Sbjct: 284 TVSDHGAIKEL--IKHGTASDPEDAVRVALKSGINMSMSDEYYSKYLPGLVKSGKVTMAE 341
Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
+++A + L V+ +G+F+ PY HLGPK+ D H++ A E AR+ +V
Sbjct: 342 LDDATRHVLNVKYDMGLFN-----DPYSHLGPKESDPADTNAESRLHRKEAREVARESLV 396
Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGI----GRYA 449
LLKN+ +LPL + T+AV+GP +D ++G++ AG+A T L GI G A
Sbjct: 397 LLKNRLDTLPLK--KSGTIAVVGPLADSKRDVMGSWSAAGVADQSVTVLTGIKNAVGENA 454
Query: 450 RTIHQQGC-----KDVACADDQ--------------LFGAAIDASRQADATILVMGLDQS 490
+ ++ +G KD+ +Q + A++A++Q+D + V+G Q
Sbjct: 455 KVVYAKGANVTDDKDIVTFLNQYEEAVKVDPRSPKEMIDEAVNAAKQSDVVVAVVGEAQG 514
Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
+ EA R + +P Q++L++ + K P +LVLM+G P+ A K D + AI+
Sbjct: 515 MAHEASSRTDITIPQSQRDLIAALKATGK-PLVLVLMNGRPL--ALVKEDQQADAILETW 571
Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAM---RP---SQSKRYPG 604
+ G GG AIAD+LFG NP GKLPM+ +P+ + +P+ + RP + +Y
Sbjct: 572 FAGTEGGNAIADVLFGDYNPSGKLPMS-FPRS-VGQIPVYYSHLNTGRPYNADKPNKYTS 629
Query: 605 RTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAK 664
R + GP +YPFG+G+SYT F V++ ++ P R G + A+
Sbjct: 630 RYFDEANGP-LYPFGYGLSYTTF--KVSDVK--MSAPSLKRDGKVTAS------------ 672
Query: 665 CNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVGI 723
VDV N G ++GA + ++ A P KQL FEKV++ G + +
Sbjct: 673 ---------VDVTNTGKREGATVIQMYVQDVTASMSRPVKQLRGFEKVNLKPGETKTISF 723
Query: 724 NIHVCKYLSVVDRSGTRRIPLGEHNIHIG 752
I V L ++ G+ N++IG
Sbjct: 724 PIDV-DALKFWNQQMKYDAEPGKFNVYIG 751
>gi|423313768|ref|ZP_17291703.1| hypothetical protein HMPREF1058_02315 [Bacteroides vulgatus
CL09T03C04]
gi|392684303|gb|EIY77631.1| hypothetical protein HMPREF1058_02315 [Bacteroides vulgatus
CL09T03C04]
Length = 788
Score = 262 bits (669), Expect = 6e-67, Method: Compositional matrix adjust.
Identities = 200/679 (29%), Positives = 314/679 (46%), Gaps = 82/679 (12%)
Query: 110 ATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLT-YWSPNVNIFRDPRWGRG 168
AT FP +++N L IG V + EA A+ G T +SP ++I +DPRWGR
Sbjct: 158 ATYFPAQCGQGATWNKKLIARIGEVEAKEAVAL------GYTNIYSPILDIAQDPRWGRC 211
Query: 169 QETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVS 228
ET GEDP L G+ + LQ + + A+ KHF Y + + + V+
Sbjct: 212 VETYGEDPYLVGELGKQMITSLQ-----KYNLVATPKHFAVYSIPIGGRDGKTRTDPHVA 266
Query: 229 KQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSD 288
+++ + PFRM E VM SYN +G P + L +R EW GY+VSD
Sbjct: 267 PREMRTLYIEPFRMAFQEAGALGVMSSYNDYDGEPITGSYHFLTEILRQEWGFKGYVVSD 326
Query: 289 CDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDC-----GPFLGLHTESAVQRGLLSEIDI 343
++V + T E+ A A+ AGL++ F+ L AV G +S+ +
Sbjct: 327 SEAVEFISNKHKVADTYEDGIAQAVNAGLNIRTHFTPPADFI-LPLRKAVDNGKISQETL 385
Query: 344 NNALVNTLTVQMRLGMFDGEPSSQPY---GHLGPKDVCTPDHQELALEAARQGIVLLKNQ 400
+ + L ++ RLG+FD PY G + V + +HQ ++LEAARQ +VLLKN+
Sbjct: 386 DKRVAEILRIKFRLGLFD-----NPYRGNGKQAEQIVHSKEHQAVSLEAARQSLVLLKNE 440
Query: 401 GPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGR---YARTIHQQGC 457
LPLS R++AVIGPN++ +I Y T QGI +A I+++GC
Sbjct: 441 TNLLPLSK-SIRSIAVIGPNANEQTQLICRYGPANAPIKTVYQGIKELLPHAEVIYKKGC 499
Query: 458 KDV--------------ACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLL 503
+ + QL AI A++QA+ ++V+G ++ E R L L
Sbjct: 500 DIIDPHFPESEILDFPKTAEEVQLMEEAIRAAKQAEVVVMVLGGNELTVREDRSRTSLNL 559
Query: 504 PGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADI 563
PGRQ+EL+ V K P ILV++ G + +A + AI+ A +PG+ G A+A+
Sbjct: 560 PGRQEELLKAVCATGK-PVILVMLDGRASSINYAA--AHVPAILHAWFPGEFCGQAVAEA 616
Query: 564 LFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMS 623
LFG NPGG+L +T +P+ + +P +P + Y +YPFGHG+S
Sbjct: 617 LFGDYNPGGRLAVT-FPKS-VGQIPFA-FPFKPGSDESSSTSVY-----GALYPFGHGLS 668
Query: 624 YTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKD 683
YT F ++ + ++ G G I H C +KN G
Sbjct: 669 YTTFTYSDLH----ISPSHQGVQGDI------------HVSCK---------IKNTGKIK 703
Query: 684 GAHTLLVFSTPPAGHWAPH-KQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRI 742
G + ++ + K L FE++ + AG +Q V + + L + D++ R+
Sbjct: 704 GDEVVQLYLRDEISSVTTYTKVLRGFERISLKAGEEQTVHFRLR-PQDLGLWDKNMNFRV 762
Query: 743 PLGEHNIHIGGTKHSVSLH 761
G + +G + + LH
Sbjct: 763 EPGSFKVMLGASSTDIRLH 781
>gi|383115340|ref|ZP_09936096.1| hypothetical protein BSGG_2785 [Bacteroides sp. D2]
gi|313695250|gb|EFS32085.1| hypothetical protein BSGG_2785 [Bacteroides sp. D2]
Length = 735
Score = 262 bits (669), Expect = 6e-67, Method: Compositional matrix adjust.
Identities = 228/773 (29%), Positives = 357/773 (46%), Gaps = 101/773 (13%)
Query: 29 ACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLI--------------SGAA 74
A D KD P PI +RVNDL+ R++L+EKV L
Sbjct: 21 AKDKKDVALYKDP----KAPIEKRVNDLLSRMTLEEKVMQLNQYTLGRNNNVNNVGEEVK 76
Query: 75 AVP-RLGIKGYEWWSEALHGV--------SNVGPGTKFGGDFPGA--TSFPQVITTASSF 123
VP +G Y + AL S +G FG D T +P + A S+
Sbjct: 77 KVPAEIGSLIYFETNPALRNSMQKKAMEESRLGIPIIFGYDAIHGFRTVYPISLAQACSW 136
Query: 124 NATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQETPGEDPVLSGKY 182
N L E V + EAR +G+ + +SP +++ RDPRWGR E GEDP +G +
Sbjct: 137 NPDLVEQACAVSAQEAR------MSGVDWTFSPMIDVARDPRWGRVAEGYGEDPYTNGVF 190
Query: 183 AASYVRGLQGSD-GDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFR 241
A+ V+G QG D ++AA KH+ Y R + ++SKQ + DT+ +P+
Sbjct: 191 GAASVKGYQGDDLSAENRMAACLKHYVGYGASE---AGRDYVYTEISKQTLWDTYLLPYE 247
Query: 242 MCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHF 301
M V G A++M S+N ++GVP A+ I+ ++ W +G+IVSD ++ Q
Sbjct: 248 MGVKAG-AATLMSSFNDISGVPGSANSYIMTEILKKRWGHDGFIVSDWGAIE-QLKNQGL 305
Query: 302 TSTPEEAAADAIRAGLDLDCGPF-LGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMF 360
+T +EAA A AGL++D H + V+ G +S ++ A+ L ++ RLG+F
Sbjct: 306 AATKKEAAWHAFTAGLEMDMMSHAYDRHLQELVEEGRVSVAQVDEAVRRVLLLKFRLGLF 365
Query: 361 DGEPSSQPY--GHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIG 418
+ +PY + P ++A A + +VLLKN+ +LPL+ + +AVIG
Sbjct: 366 E-----RPYTPATSEKERFFRPQSMDIAARLAAESMVLLKNENKTLPLTD--KKKIAVIG 418
Query: 419 PNSDVTVTMIGNYAG------IACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAI 472
P + ++G++ G +A Y A + GC + + F A+
Sbjct: 419 PMAKNGWDLLGSWCGHGKDTDVAMLYNGLATEFAGKAELRYAAGCA-TKGDNKEGFAEAL 477
Query: 473 DASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPI 532
+A+R +D +L +G + E R+ + LP Q+EL +++ A K P +LVL++G P+
Sbjct: 478 EAARWSDVVVLCLGEMMTWSGENASRSSIALPQIQEELAAELKKAGK-PIVLVLVNGRPL 536
Query: 533 DVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEM 592
++ + +P AI+ PG G +A IL G NP GKL MT P +
Sbjct: 537 EL--NRLEPISDAILEIWQPGVNGALPMAGILSGRINPSGKLAMT---------FPYSTG 585
Query: 593 AMRPSQSKRYPGRTYR-FYKGPV---VYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGS 648
+ ++R GR ++ FYK +YPFGHG+SYT F + P+V V R
Sbjct: 586 QIPIYYNRRKSGRGHQGFYKDITSDPLYPFGHGLSYTEFKYGTVT-PSVTKVKRGDR--- 641
Query: 649 INATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVA 707
L V+V V NVG++DGA T+ F S P P K+L
Sbjct: 642 ---------------------LSVEVTVTNVGARDGAETVHWFISDPYCSITRPVKELKH 680
Query: 708 FEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
FEK + AG + +I + + V+ G R + GE++I + G + L
Sbjct: 681 FEKQLIKAGETKTFRFDIDLERDFGFVNEDGKRFLEAGEYHILVQGQTVKIEL 733
>gi|205357092|ref|ZP_02344874.2| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Saintpaul str. SARA29]
gi|213650030|ref|ZP_03380083.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Typhi str. J185]
gi|289824263|ref|ZP_06543858.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Typhi str. E98-3139]
gi|378958841|ref|YP_005216327.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar Typhi
str. P-stx-12]
gi|205324021|gb|EDZ11860.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Saintpaul str. SARA29]
gi|374352713|gb|AEZ44474.1| Periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Typhi str. P-stx-12]
Length = 755
Score = 262 bits (669), Expect = 6e-67, Method: Compositional matrix adjust.
Identities = 204/669 (30%), Positives = 326/669 (48%), Gaps = 103/669 (15%)
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
T FP + ASSFN +GRV + Y GL W+P V++ RDPRWGR
Sbjct: 108 TVFPISLGLASSFNLDAVRTVGRV------SAYEAADDGLNMTWAPMVDVSRDPRWGRAS 161
Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
E GED L+ + V+ +QG S DR V S KHF AY +N VD
Sbjct: 162 EGFGEDTYLTSIMGETMVKAMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 216
Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
+S Q + + + P++ + G +VM + N +NG P +D +LK +R EW G
Sbjct: 217 --MSSQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSDSWLLKDVLRDEWGFKGI 273
Query: 285 IVSDCDSVGVYYDTQHFTST-PEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
VSD ++ +H T+ PE+A A++AG+D+ + + ++ G ++ +
Sbjct: 274 TVSDHGAIKEL--IKHGTAADPEDAVRVALKAGVDMSMADEYYSKYLPGLIKSGKVTMAE 331
Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
+++A + L V+ +G+F+ PY HLGPK+ D H++ A E AR+ +V
Sbjct: 332 LDDATRHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAESRLHRKEAREVARESVV 386
Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGI----GRYA 449
LLKN+ +LPL + T+AV+GP +D ++G++ AG+A T L GI G A
Sbjct: 387 LLKNRLETLPLK--KSGTIAVVGPLADSQRDVMGSWSAAGVANQSVTVLAGIQNAVGDGA 444
Query: 450 RTIHQQGC-----KDVAC--------------ADDQLFGAAIDASRQADATILVMGLDQS 490
+ ++ +G KD+ + + A+ A++QAD + V+G Q
Sbjct: 445 KILYAKGANITNDKDIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADVVVAVVGESQG 504
Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
+ EA R + +P Q++L++ + K P +LVLM+G P+ A K D + AI+
Sbjct: 505 MAHEASSRTNITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKEDQQADAILETW 561
Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPM------TEMAMRPSQSKRYPG 604
+ G GG AIAD+LFG NP GKLP++ +P+ + +P+ T P + +Y
Sbjct: 562 FAGTEGGNAIADVLFGDYNPSGKLPIS-FPRS-VGQIPVYYSHLNTGRPYNPEKPNKYTS 619
Query: 605 RTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAK 664
R + GP +YPFG+G+SYT F TV++ ++ P R G + A+
Sbjct: 620 RYFDEANGP-LYPFGYGLSYTTF--TVSD--VTLSSPTMQRDGKVTAS------------ 662
Query: 665 CNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVG- 722
V+V N G ++G + ++ A P KQL FEK+ + G ++ V
Sbjct: 663 ---------VEVTNTGKREGETVIQMYLQDVTASMSRPVKQLKGFEKITLKPGERKTVSF 713
Query: 723 -INIHVCKY 730
I+I K+
Sbjct: 714 PIDIEALKF 722
>gi|379996382|gb|AFD23677.1| glucosidase [uncultured organism]
Length = 765
Score = 262 bits (669), Expect = 7e-67, Method: Compositional matrix adjust.
Identities = 212/691 (30%), Positives = 338/691 (48%), Gaps = 106/691 (15%)
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
T FP + ASSFN + +GRV + Y GL W+P V++ RDPRWGRG
Sbjct: 118 TVFPISLGLASSFNLDAVKTVGRV------SAYEAADDGLNMTWAPMVDVSRDPRWGRGS 171
Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
E GED L+ + V +QG S DR V S KHF AY +N VD
Sbjct: 172 EGFGEDTYLTSTMGKTMVEAMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 226
Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
+S Q + + + P++ + G +VM + N +NG P +D +LK +R +W G
Sbjct: 227 --MSPQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSDSWLLKDVLRDQWGFKGI 283
Query: 285 IVSDCDSVGVYYDTQHFT-STPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
VSD ++ +H T S PE+A A+++G+++ + + V+ G ++ +
Sbjct: 284 TVSDHGAIKEL--IKHGTASDPEDAVRVALKSGINMSMSDEYYSKYLPGLVKSGKVTMAE 341
Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
+++A + L V+ +G+F+ PY HLGPKD D H++ A + AR+ +V
Sbjct: 342 LDDAARHVLNVKYDMGLFN-----DPYSHLGPKDSDPADTNAESRLHRKEARDVARESLV 396
Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGI----GRYA 449
LLKN+ +LPL + T+AV+GP +D ++G++ AG+A T L GI G A
Sbjct: 397 LLKNRLDTLPLK--KSGTIAVVGPLADSKRDVMGSWSAAGVADQSVTVLTGIKSAVGDNA 454
Query: 450 RTIHQQGC-----KDVACADDQ--------------LFGAAIDASRQADATILVMGLDQS 490
+ ++ +G KD+ +Q + A++ ++Q+D + V+G Q
Sbjct: 455 KVVYAKGANVTNDKDIVTFLNQYEEAVKVDPRTPKEMIDEAVNTAKQSDVVVAVVGEAQG 514
Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
+ EA R + +P Q++L++ + K P +LVLM+G P+ A K D + AI+
Sbjct: 515 MAHEASSRTDITIPQSQRDLIAALKATGK-PLVLVLMNGRPL--ALVKEDQQADAILETW 571
Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAM---RP---SQSKRYPG 604
+ G GG AIAD+LFG NP GKLPM+ +P+ + +P+ + RP + +Y
Sbjct: 572 FAGTEGGNAIADVLFGDYNPSGKLPMS-FPRS-VGQIPVYYSHLNTGRPYNADKPNKYTS 629
Query: 605 RTYRFYKGPVVYPFGHGMSYTNF-VHTVA-NAPTVVAVPLDGRHGSINATISGKAIKVTH 662
R + GP +YPFG+G+SYT+F V V +APT+ R G + A+
Sbjct: 630 RYFDEANGP-LYPFGYGLSYTSFKVSDVKMSAPTL------KRDGKVTAS---------- 672
Query: 663 AKCNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRV 721
V+V N G ++GA + ++ A P KQL FEKV++ G Q +
Sbjct: 673 -----------VEVTNTGKREGATVIQMYVQDVTASMSRPVKQLRGFEKVNLKPGETQTI 721
Query: 722 GINIHVCKYLSVVDRSGTRRIPLGEHNIHIG 752
I V + L ++ G+ N+ IG
Sbjct: 722 SFPIDV-EALKFWNQQMKYDAEPGKFNVFIG 751
>gi|200387658|ref|ZP_03214270.1| glycosyl hydrolase family 3 N domain protein [Salmonella enterica
subsp. enterica serovar Virchow str. SL491]
gi|199604756|gb|EDZ03301.1| glycosyl hydrolase family 3 N domain protein [Salmonella enterica
subsp. enterica serovar Virchow str. SL491]
Length = 765
Score = 262 bits (669), Expect = 7e-67, Method: Compositional matrix adjust.
Identities = 205/669 (30%), Positives = 324/669 (48%), Gaps = 103/669 (15%)
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
T FP + ASSFN +GRV + Y GL W+P V++ RDPRWGR
Sbjct: 118 TVFPISLGLASSFNLDAVRTVGRV------SAYEAADDGLNMTWAPMVDVSRDPRWGRAS 171
Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
E GED L+ + V+ +QG S DR V S KHF AY +N VD
Sbjct: 172 EGFGEDTYLTSIMGETMVKAMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 226
Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
+S Q + + + P++ + G +VM + N +NG P +D +LK +R EW G
Sbjct: 227 --MSSQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSDSWLLKDVLRDEWGFKGI 283
Query: 285 IVSDCDSVGVYYDTQHFTST-PEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
VSD ++ +H T+ PE+A A++AG+D+ + + ++ G ++ +
Sbjct: 284 TVSDHGAIKEL--IKHGTAADPEDAVRVALKAGVDMSMADEYYSKYLPGLIKSGKVTMAE 341
Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
+++A + L V+ +G+F+ PY HLGPK+ D H++ A E AR+ +V
Sbjct: 342 LDDATRHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAESRLHRKEAREVARESVV 396
Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGI----GRYA 449
LLKN+ +LPL + T+AV+GP +D ++G++ AG+A T L GI G A
Sbjct: 397 LLKNRLETLPLK--KSGTIAVVGPLADSQRDVMGSWSAAGVANQSVTVLAGIQNAVGDGA 454
Query: 450 RTIHQQGCK---DVACAD----------------DQLFGAAIDASRQADATILVMGLDQS 490
+ ++ +G D D + A+ A++QAD + V+G Q
Sbjct: 455 KILYAKGANITNDKGIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADVVVAVVGESQG 514
Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
+ EA R + +P Q++L++ + K P +LVLM+G P+ A K D + AI+
Sbjct: 515 MAHEASSRTDITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKEDQQADAILETW 571
Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPM------TEMAMRPSQSKRYPG 604
+ G GG AIAD+LFG NP GKLP++ +P+ + +P+ T P + +Y
Sbjct: 572 FAGTEGGNAIADVLFGDYNPSGKLPIS-FPRS-VGQIPVYYSHLNTGRPYNPEKPNKYTS 629
Query: 605 RTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAK 664
R + GP +YPFG+G+SYT F TV++ ++ P R G + A+
Sbjct: 630 RYFDEANGP-LYPFGYGLSYTTF--TVSD--VTLSSPTMQRDGKVTAS------------ 672
Query: 665 CNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVG- 722
V+V N G ++GA + ++ A P KQL FEK+ + G + V
Sbjct: 673 ---------VEVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGFEKLTLKPGESKTVSF 723
Query: 723 -INIHVCKY 730
I+I K+
Sbjct: 724 PIDIEALKF 732
>gi|161502686|ref|YP_001569798.1| hypothetical protein SARI_00733 [Salmonella enterica subsp.
arizonae serovar 62:z4,z23:- str. RSK2980]
gi|160864033|gb|ABX20656.1| hypothetical protein SARI_00733 [Salmonella enterica subsp.
arizonae serovar 62:z4,z23:-]
Length = 771
Score = 262 bits (669), Expect = 7e-67, Method: Compositional matrix adjust.
Identities = 206/689 (29%), Positives = 335/689 (48%), Gaps = 102/689 (14%)
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
T FP + ASSFN +GRV + Y GL W+P V++ RDPRWGR
Sbjct: 124 TVFPISLGLASSFNLDAVRTVGRV------SAYEAADDGLNMTWAPMVDVSRDPRWGRAS 177
Query: 170 ETPGEDPVLSGKYAASYVRGLQGSD-GDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
E GED L+ + V+ +QG + DR V S KHF AY +N VD
Sbjct: 178 EGFGEDTYLTSIMGETMVKAMQGKNPADRYSVMTSVKHFAAYGAVEGGKEYNSVD----- 232
Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
+S Q + + + P++ + G +VM + N +NG P +D +LK +R EW G
Sbjct: 233 --MSSQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSDSWLLKEVLRDEWGFKGI 289
Query: 285 IVSDCDSVGVYYDTQHFTST-PEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
VSD ++ +H T+ PE+A A+++G+D+ + + ++ G ++ +
Sbjct: 290 TVSDHGAIKEL--IKHGTAADPEDAVRVALKSGVDMSMADEYYSKYLPGLIKSGKVTMAE 347
Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
+++A + L V+ +G+F+ PY HLGPK+ D H++ A E AR+ +V
Sbjct: 348 LDDATRHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAESRLHRKEAREVARESLV 402
Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGIGR------ 447
LLKN+ +LPL + T+AV+GP +D ++G++ AG+A T L GI
Sbjct: 403 LLKNRLETLPLK--KSGTIAVVGPLADSQRDVMGSWSAAGVADQSVTVLAGIQNAVGDGA 460
Query: 448 ---YAR--------------TIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQS 490
YA+ ++++ K + + A+ A++QAD + V+G Q
Sbjct: 461 KILYAKGANITNNKGIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADVVVAVVGESQG 520
Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
+ EA R + +P Q++L++ + K P +LVLM+G P+ A K D + AI+
Sbjct: 521 MAHEASSRTDITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKEDQQADAILETW 577
Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAM---RPSQSK---RYPG 604
+ G GG AIAD+LFG NP GKLPM+ +P+ + +P+ + RP ++ +Y
Sbjct: 578 FAGTEGGNAIADVLFGDYNPSGKLPMS-FPRS-VGQIPVYYSHLNTGRPYDTEKPNKYTS 635
Query: 605 RTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAK 664
R + GP +YPFG+G+SYT F TV++ ++ P R G++ A+
Sbjct: 636 RYFDEANGP-LYPFGYGLSYTTF--TVSD--VTLSAPTMKRDGNVTAS------------ 678
Query: 665 CNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVGI 723
V+V N G ++GA + ++ A P KQL FEK+ + G ++ V
Sbjct: 679 ---------VEVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGFEKITLKPGERKTVSF 729
Query: 724 NIHVCKYLSVVDRSGTRRIPLGEHNIHIG 752
I + + L ++ G+ NI IG
Sbjct: 730 PIDI-EALKFWNQQMKYDAEPGKFNIFIG 757
>gi|423222970|ref|ZP_17209439.1| hypothetical protein HMPREF1062_01625 [Bacteroides cellulosilyticus
CL02T12C19]
gi|392640546|gb|EIY34345.1| hypothetical protein HMPREF1062_01625 [Bacteroides cellulosilyticus
CL02T12C19]
Length = 862
Score = 261 bits (668), Expect = 7e-67, Method: Compositional matrix adjust.
Identities = 156/451 (34%), Positives = 240/451 (53%), Gaps = 33/451 (7%)
Query: 38 RTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNV 97
+TLP+ L +R DL+ RL+L+EK L+ + A+PRLGIK + WWSEALHGV+N
Sbjct: 19 QTLPYQNPELSPAERAKDLVKRLTLEEKALLMCDDSEAIPRLGIKKFNWWSEALHGVANQ 78
Query: 98 GPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAG-------- 149
G T FP+ + A+SFN L I VSDE RA +N
Sbjct: 79 G----------NVTVFPEPVGMAASFNDKLVFEIFNAVSDEMRAKHNERVRNGLEDVRFH 128
Query: 150 -LTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASC-KHF 207
L+ W+PNVNIFRDPRWGRGQET GEDP L+ + + V+GLQG + ++ + +C KH+
Sbjct: 129 SLSVWTPNVNIFRDPRWGRGQETYGEDPYLTSQMGIAVVKGLQGPENEKYRKLLACAKHY 188
Query: 208 TAYDLDNWNGVDRFHFNA-KVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCA 266
+ W+ R N VS +D+ +T+ F+ V + V VMC+Y +++ P C
Sbjct: 189 AVHSGPEWS---RHTANLNNVSPRDLWETYLPAFKALVQKADVREVMCAYQRLDDDPCCG 245
Query: 267 DPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGP-FL 325
+ +L++ +R EW +VSDC ++ ++ + +S AA AG D++CG +
Sbjct: 246 NTRLLQQILRDEWGFKYLVVSDCGAIADFWTSHKSSSDAVHAAVKGTMAGTDVECGYGYA 305
Query: 326 GLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQEL 385
AV RGL++E +++ ++ + + LG D +PS + + V H++L
Sbjct: 306 YQKLPEAVSRGLITEEEVDKHVLRLMEGRFELGEMD-DPSLVNWTKIPMSVVNCKAHKDL 364
Query: 386 ALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
+L +RQ + LL+N+ LPLS R +AVIGPN+D + GNY G T L G
Sbjct: 365 SLNMSRQTMTLLQNKNNVLPLSK-SIRKIAVIGPNADDKPMLWGNYNGTPNQTITILDGF 423
Query: 446 G---RYARTIHQQGCKDVACADDQLFGAAID 473
+ + ++ +GC V +DQ + +D
Sbjct: 424 KSKLKKNQIVYMKGCDLV---NDQTLESYLD 451
Score = 93.2 bits (230), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 77/279 (27%), Positives = 118/279 (42%), Gaps = 52/279 (18%)
Query: 497 DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAG 556
DR + LP Q+ + + A K + V SG + A AI+ A Y G+ G
Sbjct: 629 DRTDIELPAVQRNFLKALKDAGK-QVVFVNCSGSSM--ALLPETESCDAILQAWYGGELG 685
Query: 557 GTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTE-MAMRPSQSKRYPGRTYRFYKGPVV 615
G A+AD+LFG NP GKLP+T+Y LP E +M+ GRTYR+ P +
Sbjct: 686 GYAVADVLFGDYNPSGKLPVTFYKS--TKQLPDYEDYSMK--------GRTYRYMSDP-L 734
Query: 616 YPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRL------T 669
+PFG G+SYT+F V A CN+ +
Sbjct: 735 FPFGFGLSYTDFA-------------------------------VGTASCNKTQLRTDES 763
Query: 670 LGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCK 729
L + V V N G + G + V+ P K L A+ +V + AGA+Q V I +
Sbjct: 764 LTLTVPVSNTGKRSGTEVVQVYIRKTDDADGPLKSLKAYARVELAAGAKQDVKIELPSES 823
Query: 730 YLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHAATLGVI 768
+ + T R+ G++ + G + L + + ++
Sbjct: 824 FECFDPSTNTMRVAPGKYELFYGTSSADRDLQSVKVTLL 862
>gi|383114908|ref|ZP_09935668.1| hypothetical protein BSGG_5166 [Bacteroides sp. D2]
gi|382948422|gb|EIC71783.1| hypothetical protein BSGG_5166 [Bacteroides sp. D2]
Length = 782
Score = 261 bits (668), Expect = 7e-67, Method: Compositional matrix adjust.
Identities = 209/692 (30%), Positives = 327/692 (47%), Gaps = 94/692 (13%)
Query: 109 GATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRG 168
GAT FP I A++++ L + +G+V++ E R+ G + P +++ RDPRW R
Sbjct: 148 GATVFPTGIGMAATWSPELVKEVGQVIAKEIRSQ-----GGHISYGPVLDLTRDPRWSRV 202
Query: 169 QETPGEDPVLSGKYAASYVRGLQGSD-GDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKV 227
+ET GEDPVLSG AS V GL G + + A+ KHF AY + + A V
Sbjct: 203 EETFGEDPVLSGILGASMVDGLGGGNLSQKYATIATLKHFLAYAVPEGGQNGNY---ASV 259
Query: 228 SKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVS 287
+D+ F PFR + G + SVM SYN ++G+P ++ +L + +R EW+ G++VS
Sbjct: 260 GIRDLHQNFLPPFRKAIDSGAL-SVMTSYNSIDGIPCTSNHYLLTQLLRNEWKFRGFVVS 318
Query: 288 DCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNAL 347
D S+ +++ T E AA ++ AG+D+D G + AVQ G + + I+ A+
Sbjct: 319 DLYSIEGIHESHFVALTKENAAIQSVTAGVDVDLGGDAYTNLCHAVQSGQMDKAVIDTAV 378
Query: 348 VNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLS 407
L ++ +G+F+ P P + K V +H ELA + A+ I LLKN+ LPLS
Sbjct: 379 CRVLRMKFEMGLFE-HPYVDP--KIAAKTVRRKEHIELARKIAQSSITLLKNENSILPLS 435
Query: 408 HIRHRTVAVIGPNSDVTVTMIGNYAGIA--CGYTTPLQGIGRY---ARTIHQQGC--KDV 460
++ VAVIGPN+D M+G+Y T L GI +R + +GC +D
Sbjct: 436 KTINK-VAVIGPNADNRYNMLGDYTAPQEDSNVKTVLDGIITKLSPSRVEYVRGCAIRDT 494
Query: 461 ACADDQLFGAAIDASRQAD----------------------ATILVMGLDQSIE-AEALD 497
+ + AI+A+R+++ A + G +E E D
Sbjct: 495 TVNEIE---QAIEAARRSEVVIVVVGGSSARDFKTSYKETGAAVAEEGSVSDMECGEGFD 551
Query: 498 RAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGG 557
RA L L GRQQEL+ + K P I+V + G P++ +A A++ A YPGQ GG
Sbjct: 552 RASLSLLGRQQELLESLQKTGK-PLIVVYIEGRPLEKNWASE--YADALLTAYYPGQEGG 608
Query: 558 TAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYP-GRTYRFYKGPVVY 616
AIAD+LFG NP G+LP++ +P + + +K+ P Y +Y
Sbjct: 609 NAIADVLFGDYNPSGRLPIS---------VPRSVGQIPVYYNKKAPRNHDYVEVSSSPLY 659
Query: 617 PFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTH--AKCNRLTLGVQV 674
FG+GMSYT F ++ A++V A+C V
Sbjct: 660 SFGYGMSYTTFEYS--------------------------ALQVVQKSARC----FEVSF 689
Query: 675 DVKNVGSKDGAH-TLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSV 733
VKN G DG + L A P KQL FE+ H+ G +++V + + +
Sbjct: 690 KVKNTGKYDGEEVSQLYMRDEYASVVQPMKQLKHFERFHLKKGEEKKVTFVLTEEDFF-L 748
Query: 734 VDRSGTRRIPLGEHNIHIGGTKHSVSLHAATL 765
V+ + + + G ++ IG + + L L
Sbjct: 749 VNYTLKKVVESGNFHLMIGAASNDIRLQNVIL 780
>gi|213852437|ref|ZP_03381969.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Typhi str. M223]
Length = 721
Score = 261 bits (668), Expect = 8e-67, Method: Compositional matrix adjust.
Identities = 204/669 (30%), Positives = 326/669 (48%), Gaps = 103/669 (15%)
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
T FP + ASSFN +GRV + Y GL W+P V++ RDPRWGR
Sbjct: 74 TVFPISLGLASSFNLDAVRTVGRV------SAYEAADDGLNMTWAPMVDVSRDPRWGRAS 127
Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
E GED L+ + V+ +QG S DR V S KHF AY +N VD
Sbjct: 128 EGFGEDTYLTSIMGETMVKAMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 182
Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
+S Q + + + P++ + G +VM + N +NG P +D +LK +R EW G
Sbjct: 183 --MSSQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSDSWLLKDVLRDEWGFKGI 239
Query: 285 IVSDCDSVGVYYDTQHFTST-PEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
VSD ++ +H T+ PE+A A++AG+D+ + + ++ G ++ +
Sbjct: 240 TVSDHGAIKEL--IKHGTAADPEDAVRVALKAGVDMSMADEYYSKYLPGLIKSGKVTMAE 297
Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
+++A + L V+ +G+F+ PY HLGPK+ D H++ A E AR+ +V
Sbjct: 298 LDDATRHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAESRLHRKEAREVARESVV 352
Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGI----GRYA 449
LLKN+ +LPL + T+AV+GP +D ++G++ AG+A T L GI G A
Sbjct: 353 LLKNRLETLPLK--KSGTIAVVGPLADSQRDVMGSWSAAGVANQSVTVLAGIQNAVGDGA 410
Query: 450 RTIHQQGC-----KDVAC--------------ADDQLFGAAIDASRQADATILVMGLDQS 490
+ ++ +G KD+ + + A+ A++QAD + V+G Q
Sbjct: 411 KILYAKGANITNDKDIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADVVVAVVGESQG 470
Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
+ EA R + +P Q++L++ + K P +LVLM+G P+ A K D + AI+
Sbjct: 471 MAHEASSRTNITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKEDQQADAILETW 527
Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPM------TEMAMRPSQSKRYPG 604
+ G GG AIAD+LFG NP GKLP++ +P+ + +P+ T P + +Y
Sbjct: 528 FAGTEGGNAIADVLFGDYNPSGKLPIS-FPRS-VGQIPVYYSHLNTGRPYNPEKPNKYTS 585
Query: 605 RTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAK 664
R + GP +YPFG+G+SYT F TV++ ++ P R G + A+
Sbjct: 586 RYFDEANGP-LYPFGYGLSYTTF--TVSD--VTLSSPTMQRDGKVTAS------------ 628
Query: 665 CNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVG- 722
V+V N G ++G + ++ A P KQL FEK+ + G ++ V
Sbjct: 629 ---------VEVTNTGKREGETVIQMYLQDVTASMSRPVKQLKGFEKITLKPGERKTVSF 679
Query: 723 -INIHVCKY 730
I+I K+
Sbjct: 680 PIDIEALKF 688
>gi|16761109|ref|NP_456726.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Typhi str. CT18]
gi|29141195|ref|NP_804537.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Typhi str. Ty2]
gi|213428232|ref|ZP_03360982.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Typhi str. E02-1180]
gi|25289432|pir||AD0778 beta-glucosidase (EC 3.2.1.21) - Salmonella enterica subsp.
enterica serovar Typhi (strain CT18)
gi|16503407|emb|CAD02546.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Typhi]
gi|29136821|gb|AAO68386.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Typhi str. Ty2]
Length = 765
Score = 261 bits (667), Expect = 9e-67, Method: Compositional matrix adjust.
Identities = 204/669 (30%), Positives = 326/669 (48%), Gaps = 103/669 (15%)
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
T FP + ASSFN +GRV + Y GL W+P V++ RDPRWGR
Sbjct: 118 TVFPISLGLASSFNLDAVRTVGRV------SAYEAADDGLNMTWAPMVDVSRDPRWGRAS 171
Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
E GED L+ + V+ +QG S DR V S KHF AY +N VD
Sbjct: 172 EGFGEDTYLTSIMGETMVKAMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 226
Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
+S Q + + + P++ + G +VM + N +NG P +D +LK +R EW G
Sbjct: 227 --MSSQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSDSWLLKDVLRDEWGFKGI 283
Query: 285 IVSDCDSVGVYYDTQHFTST-PEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
VSD ++ +H T+ PE+A A++AG+D+ + + ++ G ++ +
Sbjct: 284 TVSDHGAIKEL--IKHGTAADPEDAVRVALKAGVDMSMADEYYSKYLPGLIKSGKVTMAE 341
Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
+++A + L V+ +G+F+ PY HLGPK+ D H++ A E AR+ +V
Sbjct: 342 LDDATRHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAESRLHRKEAREVARESVV 396
Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGI----GRYA 449
LLKN+ +LPL + T+AV+GP +D ++G++ AG+A T L GI G A
Sbjct: 397 LLKNRLETLPLK--KSGTIAVVGPLADSQRDVMGSWSAAGVANQSVTVLAGIQNAVGDGA 454
Query: 450 RTIHQQGC-----KDVAC--------------ADDQLFGAAIDASRQADATILVMGLDQS 490
+ ++ +G KD+ + + A+ A++QAD + V+G Q
Sbjct: 455 KILYAKGANITNDKDIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADVVVAVVGESQG 514
Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
+ EA R + +P Q++L++ + K P +LVLM+G P+ A K D + AI+
Sbjct: 515 MAHEASSRTNITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKEDQQADAILETW 571
Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPM------TEMAMRPSQSKRYPG 604
+ G GG AIAD+LFG NP GKLP++ +P+ + +P+ T P + +Y
Sbjct: 572 FAGTEGGNAIADVLFGDYNPSGKLPIS-FPRS-VGQIPVYYSHLNTGRPYNPEKPNKYTS 629
Query: 605 RTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAK 664
R + GP +YPFG+G+SYT F TV++ ++ P R G + A+
Sbjct: 630 RYFDEANGP-LYPFGYGLSYTTF--TVSD--VTLSSPTMQRDGKVTAS------------ 672
Query: 665 CNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVG- 722
V+V N G ++G + ++ A P KQL FEK+ + G ++ V
Sbjct: 673 ---------VEVTNTGKREGETVIQMYLQDVTASMSRPVKQLKGFEKITLKPGERKTVSF 723
Query: 723 -INIHVCKY 730
I+I K+
Sbjct: 724 PIDIEALKF 732
>gi|289805550|ref|ZP_06536179.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Typhi str. AG3]
Length = 657
Score = 261 bits (667), Expect = 9e-67, Method: Compositional matrix adjust.
Identities = 204/669 (30%), Positives = 326/669 (48%), Gaps = 103/669 (15%)
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
T FP + ASSFN +GRV + Y GL W+P V++ RDPRWGR
Sbjct: 10 TVFPISLGLASSFNLDAVRTVGRV------SAYEAADDGLNMTWAPMVDVSRDPRWGRAS 63
Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
E GED L+ + V+ +QG S DR V S KHF AY +N VD
Sbjct: 64 EGFGEDTYLTSIMGETMVKAMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 118
Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
+S Q + + + P++ + G +VM + N +NG P +D +LK +R EW G
Sbjct: 119 --MSSQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSDSWLLKDVLRDEWGFKGI 175
Query: 285 IVSDCDSVGVYYDTQHFTST-PEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
VSD ++ +H T+ PE+A A++AG+D+ + + ++ G ++ +
Sbjct: 176 TVSDHGAIKEL--IKHGTAADPEDAVRVALKAGVDMSMADEYYSKYLPGLIKSGKVTMAE 233
Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
+++A + L V+ +G+F+ PY HLGPK+ D H++ A E AR+ +V
Sbjct: 234 LDDATRHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAESRLHRKEAREVARESVV 288
Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGI----GRYA 449
LLKN+ +LPL + T+AV+GP +D ++G++ AG+A T L GI G A
Sbjct: 289 LLKNRLETLPLK--KSGTIAVVGPLADSQRDVMGSWSAAGVANQSVTVLAGIQNAVGDGA 346
Query: 450 RTIHQQGC-----KDVAC--------------ADDQLFGAAIDASRQADATILVMGLDQS 490
+ ++ +G KD+ + + A+ A++QAD + V+G Q
Sbjct: 347 KILYAKGANITNDKDIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADVVVAVVGESQG 406
Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
+ EA R + +P Q++L++ + K P +LVLM+G P+ A K D + AI+
Sbjct: 407 MAHEASSRTNITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKEDQQADAILETW 463
Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPM------TEMAMRPSQSKRYPG 604
+ G GG AIAD+LFG NP GKLP++ +P+ + +P+ T P + +Y
Sbjct: 464 FAGTEGGNAIADVLFGDYNPSGKLPIS-FPRS-VGQIPVYYSHLNTGRPYNPEKPNKYTS 521
Query: 605 RTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAK 664
R + GP +YPFG+G+SYT F TV++ ++ P R G + A+
Sbjct: 522 RYFDEANGP-LYPFGYGLSYTTF--TVSD--VTLSSPTMQRDGKVTAS------------ 564
Query: 665 CNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVG- 722
V+V N G ++G + ++ A P KQL FEK+ + G ++ V
Sbjct: 565 ---------VEVTNTGKREGETVIQMYLQDVTASMSRPVKQLKGFEKITLKPGERKTVSF 615
Query: 723 -INIHVCKY 730
I+I K+
Sbjct: 616 PIDIEALKF 624
>gi|299146513|ref|ZP_07039581.1| beta-glucosidase [Bacteroides sp. 3_1_23]
gi|298517004|gb|EFI40885.1| beta-glucosidase [Bacteroides sp. 3_1_23]
Length = 736
Score = 261 bits (667), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 208/699 (29%), Positives = 324/699 (46%), Gaps = 108/699 (15%)
Query: 109 GATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRG 168
G T FP I A++++ L + +G+V++ E R+ G + P +++ RDPRW R
Sbjct: 102 GTTVFPTGIGMAATWSPELVKEVGQVIAKEIRSQ-----GGHISYGPVLDLTRDPRWSRV 156
Query: 169 QETPGEDPVLSGKYAASYVRGLQGSD-GDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKV 227
+ET GEDPVLSG AS V GL G + + A+ KHF AY + + A V
Sbjct: 157 EETFGEDPVLSGILGASMVDGLGGGNLSQKYATIATLKHFLAYAVPEGGQNGNY---ASV 213
Query: 228 SKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVS 287
+D+ F PFR + G + SVM SYN ++G+P ++ +L + +R EW+ G++VS
Sbjct: 214 GIRDLHQNFLPPFRKAIDAGAL-SVMTSYNSIDGIPCTSNHYLLTKLLRNEWKFRGFVVS 272
Query: 288 DCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNAL 347
D S+ +++ T E AA ++ AG+D+D G + AVQ G + + I+ A+
Sbjct: 273 DLYSIEGIHESHFVAPTKENAAIQSVMAGVDVDLGGDAYTNLCHAVQSGQMDKTVIDTAV 332
Query: 348 VNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLS 407
L ++ +G+F+ P P + K V +H ELA + A+ I LLKN+ LPLS
Sbjct: 333 CRVLRMKFEMGLFE-HPYVDP--KIAAKTVRRKEHIELARKIAQSSITLLKNENSILPLS 389
Query: 408 HIRHRTVAVIGPNSDVTVTMIGNYAGIA--CGYTTPLQGIGRY---ARTIHQQGC--KDV 460
+ ++ VAVIGPN+D M+G+Y T L GI +R + +GC +D
Sbjct: 390 KMINK-VAVIGPNADNRYNMLGDYTAPQEDSNVKTVLDGIITKLSPSRVEYVRGCAIRDT 448
Query: 461 ACADDQLFGAAIDASRQAD----------------------ATILVMGLDQSIE-AEALD 497
+ + AI+A+R+++ A + G +E E D
Sbjct: 449 TVNEIE---QAIEAARRSEVVIVVVGGSSARDFKTSYKETGAAVAEEGSVSDMECGEGFD 505
Query: 498 RAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGG 557
RA L L GRQQEL+ + K P I+V + G P++ +A A++ A YPGQ GG
Sbjct: 506 RASLSLLGRQQELLESLQKTGK-PLIVVYIEGRPLEKNWASE--YADALLTAYYPGQEGG 562
Query: 558 TAIADILFGTSNPGGKLPMT----------WYPQEYITNLPMTEMAMRPSQSKRYPGRTY 607
AIAD+LFG NP G+LP++ +Y Q+ N E++ P
Sbjct: 563 NAIADVLFGDYNPSGRLPISVPRSVGQIPVYYNQKAPRNHDYVEVSSSP----------- 611
Query: 608 RFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNR 667
+Y FG+GMSYT F + S + A+C
Sbjct: 612 -------LYSFGYGMSYTTFEY------------------------SDLQVVQKSARC-- 638
Query: 668 LTLGVQVDVKNVGSKDGAH-TLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIH 726
V VKN G DG + L A P KQL FE+ H+ G +++V +
Sbjct: 639 --FEVSFKVKNTGKYDGEEVSQLYMRDEYASVVQPMKQLKHFERFHLKKGEEKKVTFVLT 696
Query: 727 VCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHAATL 765
+ +V+ + + + G ++ IG + + L L
Sbjct: 697 EEDFF-LVNYTLKKVVESGNFHLMIGAASNDIRLQNVIL 734
>gi|392979938|ref|YP_006478526.1| beta-D-glucoside glucohydrolase [Enterobacter cloacae subsp.
dissolvens SDM]
gi|392325871|gb|AFM60824.1| beta-D-glucoside glucohydrolase [Enterobacter cloacae subsp.
dissolvens SDM]
Length = 765
Score = 261 bits (667), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 210/689 (30%), Positives = 335/689 (48%), Gaps = 102/689 (14%)
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
T FP + ASSFN + +GRV + Y GL W+P V++ RDPRWGR
Sbjct: 118 TVFPISLGLASSFNLDAVKTVGRV------SAYEAADDGLNMTWAPMVDVSRDPRWGRAS 171
Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
E GED L+ + V +QG S DR V S KHF AY +N VD
Sbjct: 172 EGFGEDTYLTATMGKTMVEAMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 226
Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
+S Q + + + P++ + G +VM + N +NG P +D +LK +R +W G
Sbjct: 227 --MSPQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSDSWLLKDVLRDQWGFKGI 283
Query: 285 IVSDCDSVGVYYDTQHFT-STPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
VSD ++ +H T S PE+A A+++G+++ + + V+ G ++ +
Sbjct: 284 TVSDHGAIKEL--IKHGTASDPEDAVRVALKSGINMSMSDEYYSKYLPGLVKSGKVTMAE 341
Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
+++A + L V+ +G+F+ PY HLGPK+ D H++ A E AR+ +V
Sbjct: 342 LDDATRHVLNVKYDMGLFN-----DPYSHLGPKESDPADTNAESRLHRKEAREVARESLV 396
Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGI----GRYA 449
LLKN+ +LPL + T+AV+GP +D ++G++ AG+A T L GI G A
Sbjct: 397 LLKNRLDTLPLK--KSGTIAVVGPLADSKRDVMGSWSAAGVADQSVTVLTGIKSAVGENA 454
Query: 450 RTIHQQGC-----KDVACADDQ--------------LFGAAIDASRQADATILVMGLDQS 490
+ I+ +G KD+ +Q + A++A++Q+D + V+G Q
Sbjct: 455 KVIYAKGANVTDDKDIVTFLNQYEEAVKVDPRTPKEMIDEAVNAAKQSDVVVAVVGEAQG 514
Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
+ EA R + +P Q++L++ + K P +LVLM+G P+ A K D + AI+
Sbjct: 515 MAHEASSRTDITIPQSQRDLIAALKATGK-PLVLVLMNGRPL--ALVKEDQQADAILETW 571
Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAM---RP---SQSKRYPG 604
+ G GG AIAD+LFG NP GKLPM+ +P+ + +P+ + RP + +Y
Sbjct: 572 FAGTEGGNAIADVLFGDYNPSGKLPMS-FPRS-VGQIPVYYSHLNTGRPYNADKPNKYTS 629
Query: 605 RTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAK 664
R + GP +YPFG+G+SYT F V++ ++ P R G + A+
Sbjct: 630 RYFDEANGP-LYPFGYGLSYTTF--KVSDVK--MSAPSLKRDGKVTAS------------ 672
Query: 665 CNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVGI 723
V+V N G ++GA L ++ A P KQL FEKV++ G + +
Sbjct: 673 ---------VEVTNTGKREGATVLQMYVQDVTASMSRPVKQLRGFEKVNLKPGETKTISF 723
Query: 724 NIHVCKYLSVVDRSGTRRIPLGEHNIHIG 752
I V L ++ G+ N+ IG
Sbjct: 724 PIDV-DALKFWNQQMKYDAEPGKFNVFIG 751
>gi|56412920|ref|YP_149995.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Paratyphi A str. ATCC 9150]
gi|197361851|ref|YP_002141487.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar
Paratyphi A str. AKU_12601]
gi|56127177|gb|AAV76683.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Paratyphi A str. ATCC 9150]
gi|197093327|emb|CAR58775.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Paratyphi A str. AKU_12601]
Length = 768
Score = 261 bits (667), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 205/671 (30%), Positives = 324/671 (48%), Gaps = 104/671 (15%)
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
T FP + ASSFN +GRV + Y GL W+P V++ RDPRWGR
Sbjct: 118 TVFPISLGLASSFNLDAVRTVGRV------SAYEAADDGLNMTWAPMVDVSRDPRWGRAS 171
Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
E GED L+ + V+ +QG S DR V S KHF AY +N VD
Sbjct: 172 EGFGEDTYLTSIMGETMVKAMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 226
Query: 225 AKVSKQDIEDTFDVPFR--MCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLN 282
+S Q + + + P++ + G V + S N +NG P +D +LK +R EW
Sbjct: 227 --MSSQRLFNDYMPPYKAGLDAGSGAVMVALNSLNSLNGTPATSDSWLLKDVLRDEWGFK 284
Query: 283 GYIVSDCDSVGVYYDTQHFTST-PEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSE 340
G VSD ++ +H T+ PE+A A++AG+D+ + + ++ G ++
Sbjct: 285 GITVSDHGAIKEL--IKHGTAADPEDAVRVALKAGVDMSMADEYYSKYLPGLIKSGKVTM 342
Query: 341 IDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQG 393
++++A + L V+ +G+F+ PY HLGPK+ D H++ A E AR+
Sbjct: 343 AELDDATRHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAESRLHRKEAREVARES 397
Query: 394 IVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGI----GR 447
+VLLKN+ +LPL + T+AV+GP +D ++G++ AG+A T L GI G
Sbjct: 398 VVLLKNRLETLPLK--KSGTIAVVGPLADSQRDVMGSWSAAGVANQSVTVLAGIQNAVGD 455
Query: 448 YARTIHQQGCK---DVACAD----------------DQLFGAAIDASRQADATILVMGLD 488
A+ ++ +G D D + A+ A++QAD + V+G
Sbjct: 456 GAKILYAKGANITNDKGIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADVVVAVVGES 515
Query: 489 QSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIW 548
Q + EA R + +P Q++L++ + K P +LVLM+G P+ A K D + AI+
Sbjct: 516 QGMAHEASSRTDITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKEDQQADAILE 572
Query: 549 AGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPM------TEMAMRPSQSKRY 602
+ G GG AIAD+LFG NP GKLP++ +P+ + +P+ T P + +Y
Sbjct: 573 TWFAGTEGGNAIADVLFGDYNPSGKLPIS-FPRS-VGQIPVYYSHLNTGRPYNPEKPNKY 630
Query: 603 PGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTH 662
R + GP +YPFG+G+SYT F TV++ ++ P R G + A+
Sbjct: 631 TSRYFDEANGP-LYPFGYGLSYTTF--TVSD--VTLSSPTMQRDGKVTAS---------- 675
Query: 663 AKCNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRV 721
V+V N G ++GA + ++ A P KQL FEK+ + G ++ V
Sbjct: 676 -----------VEVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGFEKITLKPGERKTV 724
Query: 722 G--INIHVCKY 730
I+I K+
Sbjct: 725 SFPIDIEALKF 735
>gi|237721771|ref|ZP_04552252.1| glycoside hydrolase [Bacteroides sp. 2_2_4]
gi|229448640|gb|EEO54431.1| glycoside hydrolase [Bacteroides sp. 2_2_4]
Length = 735
Score = 261 bits (666), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 227/773 (29%), Positives = 355/773 (45%), Gaps = 101/773 (13%)
Query: 29 ACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLI--------------SGAA 74
A D KD P PI +RVNDL+ R++L+EK+ L
Sbjct: 21 AKDKKDVALYKDP----KAPIEKRVNDLLSRMTLEEKMMQLNQYTLGRNNNVNNVGEEVK 76
Query: 75 AVP-RLGIKGYEWWSEALHGV--------SNVGPGTKFGGDFPGA--TSFPQVITTASSF 123
VP +G Y + AL S +G FG D T +P + A S+
Sbjct: 77 KVPAEIGSLIYFETNPALRNSMQKKAMEESRLGIPIIFGYDAIHGFRTVYPISLAQACSW 136
Query: 124 NATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQETPGEDPVLSGKY 182
N L E V + EAR +G+ + +SP +++ RDPRWGR E GEDP +G +
Sbjct: 137 NPDLVEQACAVSAQEAR------MSGVDWTFSPMIDVARDPRWGRVAEGYGEDPYTNGVF 190
Query: 183 AASYVRGLQGSD-GDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFR 241
A+ V+G QG D ++AA KH+ Y R + ++SKQ + DT+ +P+
Sbjct: 191 GAASVKGYQGDDLSAENRMAACLKHYVGYGASE---AGRDYVYTEISKQTLWDTYLLPYE 247
Query: 242 MCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHF 301
M V G A++M S+N ++GVP A+P I+ ++ W +G+IVSD ++ Q
Sbjct: 248 MGVKAG-AATLMSSFNDISGVPGSANPYIMTEILKKRWGHDGFIVSDWGAIE-QLKNQGL 305
Query: 302 TSTPEEAAADAIRAGLDLDCGPF-LGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMF 360
+T +EAA A AGL++D H + V+ G +S ++ A+ L ++ RLG+F
Sbjct: 306 AATKKEAAWHAFTAGLEMDMMSHAYDRHLQELVEEGRVSVAQVDEAVRRVLLLKFRLGLF 365
Query: 361 DGEPSSQPY--GHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIG 418
+ +PY + P ++A A + +VLLKN+ +LPL+ + +AVIG
Sbjct: 366 E-----RPYTPATSEKERFFRPQSMDIAARLAAESMVLLKNENKTLPLTD--KKKIAVIG 418
Query: 419 PNSDVTVTMIGNYAG------IACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAI 472
P + ++G++ G +A Y A + GC + + F A+
Sbjct: 419 PMAKNGWDLLGSWCGHGKDTDVAMLYNGLATEFAGKAELRYAAGCA-TKGDNKEGFAEAL 477
Query: 473 DASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPI 532
+A+R +D +L +G + E R+ + LP Q+EL +++ A K P +LVL++G P+
Sbjct: 478 EAARWSDVVVLCLGEMMTWSGENASRSSIALPQIQEELAAELKKAGK-PIVLVLVNGRPL 536
Query: 533 DVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEM 592
++ + IW PG G +A IL G NP GKL MT P +
Sbjct: 537 ELNRLELISDAILEIWQ--PGVNGALPMAGILSGRINPSGKLAMT---------FPYSTG 585
Query: 593 AMRPSQSKRYPGRTYR-FYKGPV---VYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGS 648
+ ++R GR ++ FYK +YPFGHG+SYT F + P+V V R
Sbjct: 586 QIPIYYNRRKSGRGHQGFYKDITSDPLYPFGHGLSYTEFKYGTVT-PSVTKVKRGDR--- 641
Query: 649 INATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVA 707
L V+V V NVG++DGA T+ F S P P K+L
Sbjct: 642 ---------------------LSVEVTVTNVGARDGAETVHWFISDPYCSITRPVKELKH 680
Query: 708 FEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
FEK + AG + +I + + V+ G R + GE++I + G + L
Sbjct: 681 FEKQLIRAGETKTFRFDIDLERDFGFVNEDGKRFLEAGEYHILVQGQTVKIEL 733
>gi|205353304|ref|YP_002227105.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
serovar Gallinarum str. 287/91]
gi|375124143|ref|ZP_09769307.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Gallinarum str. SG9]
gi|445128447|ref|ZP_21380240.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Gallinarum str. 9184]
gi|205273085|emb|CAR38038.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Gallinarum str. 287/91]
gi|326628393|gb|EGE34736.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Gallinarum str. SG9]
gi|444854909|gb|ELX79964.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
enterica serovar Gallinarum str. 9184]
Length = 765
Score = 261 bits (666), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 202/669 (30%), Positives = 325/669 (48%), Gaps = 103/669 (15%)
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
T FP + ASSFN +GRV + Y GL W+P V++ RDPRWGR
Sbjct: 118 TVFPISLGLASSFNLDAVRTVGRV------SAYEAADDGLNMTWAPMVDVSRDPRWGRAS 171
Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
E GED L+ + V+ +QG S DR V S KHF AY +N VD
Sbjct: 172 EGFGEDTYLTSIMGETMVKAMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 226
Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
+S Q + + + P++ + G +VM + N +NG P +D +LK +R EW G
Sbjct: 227 --MSSQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSDSWLLKDVLRDEWGFKGI 283
Query: 285 IVSDCDSVGVYYDTQHFTST-PEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
VSD ++ +H T+ PE+A A++AG+D+ + + ++ G ++ +
Sbjct: 284 TVSDHGAIKEL--IKHGTAADPEDAVRVALKAGVDMSMADEYYSKYLPGLIKSGKVTMAE 341
Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
+++A + L V+ +G+F+ PY HLGPK+ D H++ A E AR+ +V
Sbjct: 342 LDDATRHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAESRLHRKEAREVARESVV 396
Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGIGR------ 447
LLKN+ +LPL + T+AV+GP +D ++G++ AG+A T L GI
Sbjct: 397 LLKNRLETLPLK--KSGTIAVVGPLADSQRDVMGSWSAAGVANQSVTVLAGIQNAVDDGA 454
Query: 448 ---YAR--------------TIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQS 490
YA+ ++++ K + + A+ A++QA+ + V+G Q
Sbjct: 455 KILYAKGANITNDKDIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQANVVVAVVGESQG 514
Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
+ EA R + +P Q++L++ + K P +LVLM+G P+ A K D + AI+
Sbjct: 515 MAHEASSRTNITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKEDQQADAILETW 571
Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPM------TEMAMRPSQSKRYPG 604
+ G GG AIAD+LFG NP GKLP++ +P+ + +P+ T P + +Y
Sbjct: 572 FAGTEGGNAIADVLFGDYNPSGKLPIS-FPRS-VGQIPVYYSHLNTGRPYNPEKPNKYTS 629
Query: 605 RTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAK 664
R + GP +YPFG+G+SYT F TV++ ++ P R G + A+
Sbjct: 630 RYFDEANGP-LYPFGYGLSYTTF--TVSD--VTLSSPTMQRDGKVTAS------------ 672
Query: 665 CNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVG- 722
V+V N G ++GA + ++ A P KQL FEK+ + G ++ V
Sbjct: 673 ---------VEVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGFEKITLKPGERKTVSF 723
Query: 723 -INIHVCKY 730
I+I K+
Sbjct: 724 PIDIEALKF 732
>gi|427384392|ref|ZP_18880897.1| hypothetical protein HMPREF9447_01930 [Bacteroides oleiciplenus YIT
12058]
gi|425727653|gb|EKU90512.1| hypothetical protein HMPREF9447_01930 [Bacteroides oleiciplenus YIT
12058]
Length = 954
Score = 261 bits (666), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 217/736 (29%), Positives = 353/736 (47%), Gaps = 91/736 (12%)
Query: 40 LPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISG--AAAVPRLGIKGYEWWSEALHGVSNV 97
L + +LP+ +RV L+ ++ ++K++L+ G +P L + EA+HG S
Sbjct: 168 LRYMDPTLPVEERVESLLSVMTPEDKMELIREGWGIPGIPHLYVPPITK-VEAVHGFS-- 224
Query: 98 GPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNV 157
+G GAT FPQ + +++N L E I V DE A GT + WSP +
Sbjct: 225 -----YGS---GATIFPQALAMGATWNKKLTEEIAMAVGDETLA---AGT--MQAWSPVL 271
Query: 158 NIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNG 217
++ +D RWGR +ET GEDPVL + ++++G Q S G + + KHF + G
Sbjct: 272 DVAQDARWGRCEETFGEDPVLVSQIGGAWIKGYQ-SKG----LFTTPKHFGGHGAP-LGG 325
Query: 218 VDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRG 277
D +S++++ + VPFR + S+M +Y+ GVP +L +R
Sbjct: 326 RDSHDIG--LSEREMREVHLVPFRHVIRNYDCQSLMMAYSDFLGVPVAKSKELLHNILRE 383
Query: 278 EWRLNGYIVSDCDSVGVYYDTQHFTSTPE-EAAADAIRAGLDLDCG-PFLGLHTESAVQR 335
EW +G+IVSDC ++G +H+T+ + EAA A+ AG+ +CG + A +
Sbjct: 384 EWGFDGFIVSDCGAIGNLTARKHYTAKDKIEAANQALAAGIATNCGDTYNDKEVIQAAKD 443
Query: 336 GLLSEIDINNALVNTLTVQMRLGMFDGEPSS-QPYGHLGPKDVCTPDHQELALEAARQGI 394
G L+ +++N L + R +F+ P+ + + P + +H+E+A +AAR+ I
Sbjct: 444 GRLNMENLDNVCRTMLRMMFRNELFEKAPNKPLDWNKIYP-GWNSDNHKEMARQAARESI 502
Query: 395 VLLKNQGPSLPLSHIRHRTVAVIGPNSD-------VTVTMIGNYAGIACGYTTPLQGIGR 447
V+L+N+ LPL R++AV+GP +D + G + G Q +G+
Sbjct: 503 VMLENKENILPLDK-GIRSIAVLGPGADDLQPGDYTPKLLPGQLKSVLTGIK---QAVGK 558
Query: 448 YARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEA---------EALDR 498
+ I++QGC D + A+ A+ Q+D ++V+G + EA E D
Sbjct: 559 QTKVIYEQGC-DFTNLSETNIPKAVKAASQSDVVVMVLGDCSTSEATTDVYKTSGENHDY 617
Query: 499 AGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGT 558
A L+LPG+QQEL+ V K P ILVL +G P ++ K AII PGQ GG
Sbjct: 618 ATLILPGKQQELLEAVCATGK-PVILVLQAGRPYNL--TKASKLCKAIIVNWLPGQEGGP 674
Query: 559 AIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPF 618
A AD+LFG NP G+LPMT +PQ ++ LP+ + +RY +Y P+ Y F
Sbjct: 675 ATADVLFGDYNPAGRLPMT-FPQ-HVGQLPL--YYNFKTSGRRYEYSDLEYY--PLYY-F 727
Query: 619 GHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKN 678
G+G+SYT+F ++ +KV +T VQ VKN
Sbjct: 728 GYGLSYTSFEYS--------------------------GLKVQEKDNGNIT--VQATVKN 759
Query: 679 VGSKDGAHTLLVFSTPPAGHWAPH-KQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRS 737
VG + G + ++ T +L F ++++ G + V + LS+++
Sbjct: 760 VGQRAGDEVVQLYVTDMYASVKTRITELKDFTRINLKPGESKTVSFELTPYD-LSLLNDH 818
Query: 738 GTRRIPLGEHNIHIGG 753
R + GE I +GG
Sbjct: 819 MDRVVEKGEFKILVGG 834
>gi|419958177|ref|ZP_14474242.1| beta-D-glucoside glucohydrolase [Enterobacter cloacae subsp.
cloacae GS1]
gi|388606860|gb|EIM36065.1| beta-D-glucoside glucohydrolase [Enterobacter cloacae subsp.
cloacae GS1]
Length = 765
Score = 261 bits (666), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 213/691 (30%), Positives = 336/691 (48%), Gaps = 106/691 (15%)
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
T FP + ASSFN + +GRV + Y GL W+P V++ RDPRWGR
Sbjct: 118 TVFPISLGLASSFNLDAVKTVGRV------SAYEAADDGLNMTWAPMVDVSRDPRWGRAS 171
Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
E GED L+ + V +QG S DR V S KHF AY +N VD
Sbjct: 172 EGFGEDTYLTATMGKTMVEAMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 226
Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
+S Q + + + P++ + G +VM + N +NG P +D +LK +R +W G
Sbjct: 227 --MSPQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSDSWLLKDVLRDQWGFKGI 283
Query: 285 IVSDCDSVGVYYDTQHFT-STPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
VSD ++ +H T S PE+A A+++G+++ + + V+ G ++ +
Sbjct: 284 TVSDHGAIKEL--IKHGTASDPEDAVRVALKSGINMSMSDEYYSKYLPGLVKSGKVTMAE 341
Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
+++A + L V+ +G+F+ PY HLGPKD D H++ A E AR+ +V
Sbjct: 342 LDDAARHVLNVKYDMGLFN-----DPYSHLGPKDSDPADTNAESRLHRKEAREVARESLV 396
Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGI----GRYA 449
LLKN+ +LPL + T+AV+GP +D ++G++ AG+A T L GI G A
Sbjct: 397 LLKNRLDTLPLK--KSGTIAVVGPLADSKRDVMGSWSAAGVADQSVTVLTGIKSAVGDNA 454
Query: 450 RTIHQQGC-----KDVACADDQ--------------LFGAAIDASRQADATILVMGLDQS 490
+ ++ +G KD+ +Q + A++A++Q+D + V+G Q
Sbjct: 455 KVVYAKGANVTDDKDIVTFLNQYEEAVKVDARTPKEMLDEAVNAAKQSDVVVAVVGEAQG 514
Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
+ EA R + +P Q++L++ + K P +LVLM+G P+ A K D + AI+
Sbjct: 515 MAHEASSRTDITIPQSQRDLIAALKATGK-PLVLVLMNGRPL--ALVKEDQQADAILETW 571
Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAM---RP---SQSKRYPG 604
+ G GG AIAD+LFG NP GKLPM+ +P+ + +P+ + RP + +Y
Sbjct: 572 FAGTEGGNAIADVLFGDYNPSGKLPMS-FPRS-VGQIPVYYSHLNTGRPYNADKPNKYTS 629
Query: 605 RTYRFYKGPVVYPFGHGMSYTNF-VHTVA-NAPTVVAVPLDGRHGSINATISGKAIKVTH 662
R + GP +YPFG+G+SYT F V V +APT+ R G + A+
Sbjct: 630 RYFDEANGP-LYPFGYGLSYTTFKVSDVKMSAPTL------KRDGKVTAS---------- 672
Query: 663 AKCNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRV 721
V+V N G ++GA + ++ A P KQL FEKV++ G + V
Sbjct: 673 -----------VEVTNSGKREGATVIQMYVQDVTASMSRPVKQLRGFEKVNLKPGETRTV 721
Query: 722 GINIHVCKYLSVVDRSGTRRIPLGEHNIHIG 752
I V L ++ G+ N+ IG
Sbjct: 722 SFPIDVNA-LKFWNQQMKYDAEPGKFNVFIG 751
>gi|255545660|ref|XP_002513890.1| Periplasmic beta-glucosidase precursor, putative [Ricinus communis]
gi|223546976|gb|EEF48473.1| Periplasmic beta-glucosidase precursor, putative [Ricinus communis]
Length = 336
Score = 261 bits (666), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 139/325 (42%), Positives = 200/325 (61%), Gaps = 13/325 (4%)
Query: 425 VTMIGNYAGIACGYTTPLQGIGRYAR-TIHQQGCKDVACADDQLFGAAIDASRQADATIL 483
+T++GNYAG C TPLQG+ Y + T + +GC VAC+ + A+ +R+ D +L
Sbjct: 1 MTLVGNYAGPPCKTVTPLQGLQNYIKNTKYHRGCNTVACSKATI-REAVQIAREVDQVVL 59
Query: 484 VMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRI 543
VM LDQ+ EAE +D L LPG QQ+L+ V+ A+ P +LVL+ GG +DV+FA +P+I
Sbjct: 60 VMELDQTQEAERIDLLNLRLPGNQQKLIISVARAANKPVVLVLICGGLVDVSFAITEPKI 119
Query: 544 AAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYP 603
+I+WAGYPG+AGGTA+A+I+FG NPGGKLP+TWYPQ+Y T +PMT++ MRP + YP
Sbjct: 120 GSILWAGYPGEAGGTALAEIIFGDHNPGGKLPVTWYPQQY-TKIPMTDVRMRPQIASGYP 178
Query: 604 GRTYRFYKGPVVYPFGHGMSYTNFVHTVANAP---TVVAVP--LDGRHGSINATISGKAI 658
GR+YRFY+G V+ FG+G+SY+N + +A+ P + P + G S +S
Sbjct: 179 GRSYRFYEGKKVFEFGYGLSYSNCSYEIASIPQDKIFLRSPSSIKGVKTSSYTLVS---- 234
Query: 659 KVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGA 717
++ C R V V VKN G G H +LVF P G P K+LVAF+ V + AG
Sbjct: 235 ELGKELCERSKFSVTVKVKNEGKIIGKHPVLVFLRQPKPGSGRPVKKLVAFQTVRLNAGQ 294
Query: 718 QQRVGINIHVCKYLSVVDRSGTRRI 742
+ + C+ L+ + G+ I
Sbjct: 295 NAEIQRKLSPCEPLTRANEDGSMVI 319
>gi|423279982|ref|ZP_17258895.1| hypothetical protein HMPREF1203_03112 [Bacteroides fragilis HMW
610]
gi|404584318|gb|EKA88983.1| hypothetical protein HMPREF1203_03112 [Bacteroides fragilis HMW
610]
Length = 812
Score = 260 bits (665), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 216/687 (31%), Positives = 327/687 (47%), Gaps = 91/687 (13%)
Query: 109 GATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRG 168
G T FP I AS++N L +GRV++ EA A G G + P +++ RDPRW R
Sbjct: 166 GTTVFPTSIGQASTWNPELIRQMGRVIAIEASA--QGAHIG---YGPVLDLARDPRWSRV 220
Query: 169 QETPGEDPVLSGKYAASYVRGLQGSD-GDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKV 227
+ET GEDP L+G A+ VRG QG R V A+ KHF +Y W A +
Sbjct: 221 EETYGEDPYLNGVMGAALVRGFQGDTLRGRKSVIATLKHFASY---GWTEGGHNGGTAHL 277
Query: 228 SKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVS 287
++++E+ PFR V G + SVM SYN+++G P +L + W G++VS
Sbjct: 278 GERELEEAIFPPFREAVGAGAL-SVMSSYNEIDGNPCTGSRYLLTDILEDRWLFKGFVVS 336
Query: 288 DCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTE---SAVQRGLLSEIDIN 344
D ++G + + EAA A+ AG+D D G ++ E +AV++G ++ ++
Sbjct: 337 DLYAIGGLRE-HGVAGSDYEAAVKAVNAGVDSDLGT--NVYAEQLVAAVRKGDVAMETVD 393
Query: 345 NALVNTLTVQMRLGMFDGE--PSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGP 402
A+ L+++ +G+FD +P + V +P+H LA E ARQ IVLLKN+
Sbjct: 394 KAVRRILSLKFHMGLFDAPFVDDKRP-----AQLVASPEHIGLAREVARQSIVLLKNEDK 448
Query: 403 SLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIAC--GYTTPLQGI----GRYARTIHQQG 456
LPL RT+AVIGPN+D M+G+Y T L+GI + R + +G
Sbjct: 449 LLPLKK-DIRTLAVIGPNADNGYNMLGDYTAPQADGSVVTVLEGIRQKVSKDTRVFYAKG 507
Query: 457 CKDVACADDQLFGAAIDASRQADATILVMG----LDQSIE-------------------A 493
C V + F AI+++R AD ++V+G D S E
Sbjct: 508 CA-VRDSSRTGFADAIESARSADVVVMVVGGSSARDFSSEYEETGAAKVSANRVSDMESG 566
Query: 494 EALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPG 553
E DRA L L GRQ EL+ +V K P +LVL+ G P+ + + AI+ A YPG
Sbjct: 567 EGYDRATLHLMGRQLELLEEVRKLGK-PMVLVLIKGRPLLMEGVIQEA--DAILDAWYPG 623
Query: 554 QAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGP 613
GG A+AD+LFG NP G+L ++ P+ + LP+ R RY G
Sbjct: 624 MQGGNAVADVLFGDYNPAGRLTLS-VPRS-VGQLPVYYNTKRKGNRSRYIEEA-----GT 676
Query: 614 VVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQ 673
YPFG+G+SYT F +T G ++V+ + N + V
Sbjct: 677 PRYPFGYGLSYTTFSYT------------------------GMKVRVSE-ESNHCRVDVS 711
Query: 674 VDVKNVGSKDGAHTLLVFSTPPAGHW-APHKQLVAFEKVHVPAGAQQRVGINIHVCKYLS 732
V V+N G+ DG + ++ G + P +QL AF +V + AG + + K L+
Sbjct: 712 VTVRNQGTVDGDEVVQLYLRDEVGSFTTPDRQLRAFRRVRLKAGETWEITFTLD-KKSLA 770
Query: 733 VVDRSGTRRIPLGEHNIHIGGTKHSVS 759
+ R G + G + GG+ ++
Sbjct: 771 LYMRDGEWAVEPGRFTVMAGGSSEDIA 797
>gi|324497317|gb|ADY39467.1| putative periplasmic beta-glucosidase precursor [bacterium
enrichment culture clone P69-9E]
Length = 765
Score = 260 bits (665), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 212/691 (30%), Positives = 337/691 (48%), Gaps = 106/691 (15%)
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
T FP + ASSFN + +GRV + Y GL W+P V++ RDPRWGRG
Sbjct: 118 TVFPISLGLASSFNLDAVKTVGRV------SAYEAADDGLNMTWAPMVDVSRDPRWGRGS 171
Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
E GED L+ + V +QG S DR V S KHF AY +N VD
Sbjct: 172 EGFGEDTYLTSTMGKTMVEAMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 226
Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
+S Q + + + P++ + G +VM + N +NG P +D +LK +R +W G
Sbjct: 227 --MSPQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSDSWLLKDVLRDQWGFKGI 283
Query: 285 IVSDCDSVGVYYDTQHFT-STPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
VSD ++ +H T S PE+A A+++G+++ + + V+ G ++ +
Sbjct: 284 TVSDHGAIKEL--IKHGTASDPEDAVRVALKSGINMSMSDEYYSKYLPGLVKSGKVTMAE 341
Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
+++A + L V+ +G+F+ PY HLGPKD D H++ A + AR+ +V
Sbjct: 342 LDDAARHVLNVKYDMGLFN-----DPYSHLGPKDSDPADTNAESRLHRKEARDVARESLV 396
Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGI----GRYA 449
LLKN+ +LPL + T+AV+GP +D ++G++ AG+A T L GI G A
Sbjct: 397 LLKNRLDTLPLK--KSGTIAVVGPLADSKRDVMGSWSAAGVADQSVTVLTGIKSAVGDNA 454
Query: 450 RTIHQQGC-----KDVACADDQ--------------LFGAAIDASRQADATILVMGLDQS 490
+ ++ +G KD+ +Q + A++ ++Q+D + V+G Q
Sbjct: 455 KVVYAKGANVTNDKDIVTFLNQYEEAVKVDPRTPKEMIDEAVNTAKQSDVVVAVVGEAQG 514
Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
+ EA R + +P Q++L++ + K P +LVLM+G P+ A K D + AI+
Sbjct: 515 MAHEASSRTDITIPQSQRDLIAALKATGK-PLVLVLMNGRPL--ALVKEDQQADAILETW 571
Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAM---RP---SQSKRYPG 604
+ G GG AIAD+LFG NP GKLPM+ +P+ + +P+ + RP + +Y
Sbjct: 572 FAGTEGGNAIADVLFGDYNPSGKLPMS-FPRS-VGQIPVYYSHLNTGRPYNADKPNKYTS 629
Query: 605 RTYRFYKGPVVYPFGHGMSYTNF-VHTVA-NAPTVVAVPLDGRHGSINATISGKAIKVTH 662
R + GP +YPFG+G+SYT F V V +APT+ R G + A+
Sbjct: 630 RYFDEANGP-LYPFGYGLSYTTFKVSDVKMSAPTL------KRDGKVIAS---------- 672
Query: 663 AKCNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRV 721
V+V N G ++GA + ++ A P KQL FEKV++ G Q +
Sbjct: 673 -----------VEVTNTGKREGATVIQMYVQDVTASMSRPVKQLRGFEKVNLKPGETQTI 721
Query: 722 GINIHVCKYLSVVDRSGTRRIPLGEHNIHIG 752
I V + L ++ G+ N+ IG
Sbjct: 722 SFPIDV-EALKFWNQQMKYDAEPGKFNVFIG 751
>gi|393719789|ref|ZP_10339716.1| glycosyl hydrolase family protein [Sphingomonas echinoides ATCC
14820]
Length = 896
Score = 260 bits (665), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 168/451 (37%), Positives = 246/451 (54%), Gaps = 47/451 (10%)
Query: 1 MASTIAFFFLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRL 60
MAST++F L+L+SAS + +A++ A D A ++ + + L+ ++
Sbjct: 1 MASTVSFASRALLLVSASVATIASQ---AQDLHGAAQPSVARAEAD----RLAATLVAQM 53
Query: 61 SLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTA 120
+ +EK+ L++ A A+PRL I Y WW+E+LHG +GP P T+FP+ I A
Sbjct: 54 TTEEKLPQLLNVAPAIPRLKIPAYNWWTESLHGA--LGP-------IP-TTNFPEPIGLA 103
Query: 121 SSFNATLWEAIGRVVSDEARAMYNGGT---------AGLTYWSPNVNIFRDPRWGRGQET 171
+SF+A L + +S E RA++ G GL WSPN+NIFRDPRWGRGQET
Sbjct: 104 ASFDAPLVHQVADAISTEVRALHTQGRQTGKLGRIGTGLDTWSPNINIFRDPRWGRGQET 163
Query: 172 PGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQD 231
GEDP L+ + ++V G+QG + D +V ++ KHF A+ R N VS D
Sbjct: 164 YGEDPYLAARMGVAFVTGMQGPNPDLPRVISTPKHFAAHSGPEST---RHAANVYVSPHD 220
Query: 232 IEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDS 291
+EDT+ FR ++EG+ S+MC+YN++ G P CA +LK +RG W GY+VSDCD+
Sbjct: 221 LEDTYLPAFRAAIVEGRAGSIMCAYNRIEGQPACASDLLLKDHLRGAWGFTGYVVSDCDA 280
Query: 292 VGVYYDTQHFTSTPEEAAADAIRAGLDLDC-----GPFLGLHTE--SAVQRGLLSEIDIN 344
V D + A A A++AG+D +C G GL A++R L+S+ DI+
Sbjct: 281 VKDIADNHKYAPDQATAVAAAMKAGVDNECNTQTIGDIGGLPDRFGDALKRNLISQGDID 340
Query: 345 NALVNTLTVQMRLGMFD---GEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQG 401
LV + ++R G G P+ P + TPDH LAL+AA + +VLLKN G
Sbjct: 341 RTLVRLFSARLRNGDLPGVAGAPAVVPVSA-----ILTPDHIALALDAAEKSLVLLKNDG 395
Query: 402 PSLPLSHIRHRTVAVIGPNSDVTVTMIGNYA 432
LPL +AV+GP D T + GNY+
Sbjct: 396 -VLPLRP--GARIAVVGPLGDATRVLRGNYS 423
Score = 119 bits (297), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 85/295 (28%), Positives = 136/295 (46%), Gaps = 48/295 (16%)
Query: 486 GLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVA 535
GL +EAE D+ L LP QQ L+ K P I+V M+G PI++A
Sbjct: 633 GLTSDMEAEETGTDIPGFKGGDKTSLDLPAEQQALLEHAKATGK-PLIVVAMNGSPINLA 691
Query: 536 FAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMR 595
+AK++ AAI+ A YPGQ+GG A+A++L G +NP G+LP+T+Y + +LP
Sbjct: 692 WAKDN--AAAIVEAWYPGQSGGLAVANVLTGKTNPAGRLPLTFY--RSVADLP------- 740
Query: 596 PSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISG 655
P GRTYR++ G VYPFG+G+S+T F + AP V G + T
Sbjct: 741 PFDDYAMTGRTYRYFTGTPVYPFGYGLSFTRFAY----APLTVTPAAGGAETGLRVT--- 793
Query: 656 KAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPA 715
+V N G + G ++ P P L ++++ +
Sbjct: 794 ------------------TEVSNTGQRSGDEVAQLYLNFPDQPGTPRIALRGYQRITLKP 835
Query: 716 GAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHAATLGVIKS 770
G + + ++ + LS V GTR++ G++ + +G + + +T S
Sbjct: 836 GEHRVISFDLS-PRDLSAVAVDGTRQVMAGQYRVSVGSGQPDTGVPTSTASFTTS 889
>gi|339999930|ref|YP_004730813.1| periplasmic beta-glucosidase [Salmonella bongori NCTC 12419]
gi|339513291|emb|CCC31041.1| periplasmic beta-glucosidase precursor [Salmonella bongori NCTC
12419]
Length = 765
Score = 260 bits (665), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 207/691 (29%), Positives = 335/691 (48%), Gaps = 106/691 (15%)
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
T FP + AS+FN +GRV + Y GL W+P V++ RDPRWGR
Sbjct: 118 TVFPISLGLASTFNLDAVRTVGRV------SAYEAADDGLNMTWAPMVDVSRDPRWGRAS 171
Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
E GED L+ + V+ +QG S DR V S KHF AY +N VD
Sbjct: 172 EGFGEDTYLTSIMGETMVKAMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 226
Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
+S Q + + + P++ + G +VM + N +NG P +D +LK +R +W G
Sbjct: 227 --MSPQRLFNDYMPPYKAGLDAGS-GAVMIALNSLNGTPATSDAWLLKDVLRHQWGFKGI 283
Query: 285 IVSDCDSVGVYYDTQHFTST-PEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
VSD ++ +H T+T PE+A A+++G+D+ + + ++ G ++ +
Sbjct: 284 TVSDHGAIKEL--IKHGTATDPEDAVRVALKSGVDMSMADEYYSKYLPGLIKSGKVTMAE 341
Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
+++A + L V+ +G+F+ PY HLGPK+ D H++ A E AR+ +V
Sbjct: 342 LDDATRHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAESRLHRKEAREVARESLV 396
Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGI----GRYA 449
LLKN+ +LPLS ++ T+AV+GP +D ++G++ AG+A T L GI G
Sbjct: 397 LLKNRLNTLPLS--KNATIAVVGPLADSKRDIMGSWSAAGVADQSVTVLTGIKNALGERG 454
Query: 450 RTIHQQGC---------------KDVACADDQ----LFGAAIDASRQADATILVMGLDQS 490
+ ++ +G ++V D + + A+ A++QAD + V+G Q
Sbjct: 455 KVLYAKGANITNDKDIVDFLNLYEEVVKIDPRSPQAMIDEAVQAAKQADVVVAVVGESQG 514
Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
+ EA R + +P Q++L++ + K P +LVLM+G P+ A K D + AI+
Sbjct: 515 MAHEASSRTDITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKEDQQADAILETW 571
Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPM------TEMAMRPSQSKRYPG 604
+ G GG AIAD+LFG NP GKLPM+ +P+ + +P+ T P + +Y
Sbjct: 572 FAGTEGGNAIADVLFGDDNPSGKLPMS-FPRS-VGQIPVYYSHLNTGRPYNPEKPDKYTS 629
Query: 605 RTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAK 664
R + GP +YPFG+G+SY F + ++ P R G + A+
Sbjct: 630 RYFDEANGP-LYPFGYGLSYNTF----KVSDVTLSAPTMKRDGKVTAS------------ 672
Query: 665 CNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVG- 722
V+V N G ++GA + ++ A P KQL FEK+ + G + V
Sbjct: 673 ---------VEVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGFEKIILKPGETKTVSF 723
Query: 723 -INIHVCKYLSVVDRSGTRRIPLGEHNIHIG 752
I+I+ K+ + R P G+ N+ IG
Sbjct: 724 PIDINALKFWN--QRMKYDAEP-GKFNVFIG 751
>gi|255689965|ref|ZP_05413640.1| beta-glucosidase [Bacteroides finegoldii DSM 17565]
gi|260624572|gb|EEX47443.1| glycosyl hydrolase family 3 N-terminal domain protein [Bacteroides
finegoldii DSM 17565]
Length = 688
Score = 260 bits (665), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 198/657 (30%), Positives = 319/657 (48%), Gaps = 72/657 (10%)
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
T +P + A S+N L E V + EAR +G+ + +SP +++ RDPRWGR
Sbjct: 77 TVYPISLAQACSWNPDLVEQACAVSAQEAR------MSGVDWTFSPMIDVARDPRWGRVA 130
Query: 170 ETPGEDPVLSGKYAASYVRGLQGSD-GDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVS 228
E GEDP +G + A+ VRG QG + +VAA KH+ Y R + ++S
Sbjct: 131 EGYGEDPYANGVFGAASVRGYQGDNMSAENRVAACLKHYVGYGASE---AGRDYVYTEIS 187
Query: 229 KQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSD 288
+Q + DT+ +P+ M V G A++M S+N ++GVP A+P + ++ WR +G+IVSD
Sbjct: 188 QQTLWDTYLLPYEMGVKAG-AATLMSSFNDISGVPGSANPYTMTEILKNRWRHDGFIVSD 246
Query: 289 CDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPF-LGLHTESAVQRGLLSEIDINNAL 347
++ Q +T +EAA A AGL++D H + V+ G +S ++ A+
Sbjct: 247 WGAIE-QLKNQGLAATKKEAARYAFTAGLEMDMMSHAYDRHLQELVEEGKVSMAQVDEAV 305
Query: 348 VNTLTVQMRLGMFDGEPSSQPY--GHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLP 405
L ++ RLG+F+ +PY + P ++A A + +VLLKN+ LP
Sbjct: 306 RRVLLLKFRLGLFE-----RPYTPATTEKERFFRPKSMDIAARLAAESMVLLKNENNVLP 360
Query: 406 LSHIRHRTVAVIGPNSDVTVTMIGNYAG------IACGYTTPLQGIGRYARTIHQQGCKD 459
L+ + +AVIGP + ++G++ G +A Y A + GC +
Sbjct: 361 LTD--KKKIAVIGPMAKNGWDLLGSWRGHGKDTDVAMLYDGLAAEFAGKAELRYALGC-N 417
Query: 460 VACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASK 519
+ + F A++A+R +D +L +G + E R+ + LP Q+EL ++ A K
Sbjct: 418 TQGDNREGFAEALEAARWSDVVVLCLGEMMTWSGENASRSSIALPQMQEELAKELKKAGK 477
Query: 520 GPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWY 579
P +LVL++G P+++ + +P AI+ PG G +A IL G NP GKL MT
Sbjct: 478 -PVVLVLVNGRPLEL--NRLEPVSDAILEIWQPGVNGALPMAGILSGRINPSGKLAMT-- 532
Query: 580 PQEYITNLPMTEMAMRPSQSKRYPGRTYR-FYKGPV---VYPFGHGMSYTNFVHTVANAP 635
P + + ++R GR ++ FYK +YPFGHG+SYT F +
Sbjct: 533 -------FPYSTGQIPIYYNRRKSGRGHQGFYKDITSDPLYPFGHGLSYTEFKY------ 579
Query: 636 TVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF-STP 694
T++ A KV + L +V V N+G++DGA T+ F S P
Sbjct: 580 ---------------GTVTPSATKVKRGE----KLSAEVTVTNIGARDGAETVHWFISDP 620
Query: 695 PAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHI 751
P K+L FEK + AG + +I + + V+ G R + GE+NIH+
Sbjct: 621 YCSITRPVKELKHFEKQLIKAGETKTFRFDIDLERDFGFVNEDGKRFLETGEYNIHV 677
>gi|295132888|ref|YP_003583564.1| beta-glucosidase [Zunongwangia profunda SM-A87]
gi|294980903|gb|ADF51368.1| beta-glucosidase [Zunongwangia profunda SM-A87]
Length = 855
Score = 260 bits (665), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 150/424 (35%), Positives = 227/424 (53%), Gaps = 32/424 (7%)
Query: 37 TRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSN 96
++ P+ +L +R DL+ RL+L+EK L+ + A+PRLGIK + WWSEALHG +N
Sbjct: 9 SQQKPYQNPNLSPEERAEDLVNRLTLEEKASLMFDVSEAIPRLGIKKFNWWSEALHGFAN 68
Query: 97 VGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMY---------NGGT 147
T FP+ + A+SF+ L + SDE RA Y N
Sbjct: 69 ND----------DVTVFPEPVGMAASFDDELVYQVFDATSDEVRAKYHEALRNGEENKRF 118
Query: 148 AGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKH 206
L+ W+PNVNIFRDPRWGRGQET GEDP L+ + V+GLQG D K+ A KH
Sbjct: 119 LSLSVWTPNVNIFRDPRWGRGQETYGEDPYLTSRMGVQVVKGLQGPEDAKYKKLLACAKH 178
Query: 207 FTAYDLDNWNGVDRFHFNA-KVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTC 265
+ + W+ R N VS++D+ +T+ F++ V + V VMC+Y +++ P C
Sbjct: 179 YAVHSGPEWS---RHELNLNNVSQRDLWETYLPAFKVLVQDANVRQVMCAYQRLDDEPCC 235
Query: 266 ADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFL 325
+L++ +R +W +VSDC ++ +Y + + +S AAA A+ AG D++C
Sbjct: 236 GSDRLLQQILREKWGFEHLVVSDCGAIQDFYTSHNVSSDAVHAAAKAVLAGTDVECQ--W 293
Query: 326 GLHT----ESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD 381
H AV++GL+ E DI+ ++ L + LG D + PY + + +
Sbjct: 294 DKHNYKLLPEAVEKGLVKEEDIDRSVKRVLIGRFELGEMDPD-EIVPYAQIPASVINNEE 352
Query: 382 HQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTP 441
H++LAL+ AR+ + LL+N+ LPLS + R +AVIGPN+D + GNY G +
Sbjct: 353 HRQLALKMARESMTLLQNKNNILPLSKGQDR-IAVIGPNADDEPMLWGNYNGTPVRTISI 411
Query: 442 LQGI 445
L GI
Sbjct: 412 LDGI 415
Score = 101 bits (251), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 79/300 (26%), Positives = 125/300 (41%), Gaps = 49/300 (16%)
Query: 479 DATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILVLMS 528
+ I V GL +E E + DR + LP Q+ + + A K I V S
Sbjct: 593 ETVIFVGGLSTKLEGEEMPVSYPGFKGGDRTDIALPSVQRNCLKTLKDAGK-KVIFVNNS 651
Query: 529 GGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLP 588
G I AI+ A Y G++GG A+AD+LFG NP GKLP+T+Y T LP
Sbjct: 652 GSAI--GLVPETTSCDAILQAWYGGESGGQAVADVLFGDYNPSGKLPVTFYKD--TTQLP 707
Query: 589 -MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHG 647
+ +M GRTYRF K ++PFGHG+SYTNF + LD
Sbjct: 708 DFEDYSMN--------GRTYRFMKAEPLFPFGHGLSYTNF--------KIGEAQLDKSEI 751
Query: 648 SINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVA 707
+++++ + + + N G +G + V+ P K L
Sbjct: 752 DTSSSVN-----------------ITISISNEGKTEGVEIIQVYVHKQGLEEGPIKTLKG 794
Query: 708 FEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHAATLGV 767
F++V++ + V IN+ + ++ + ++ G + I G + L L +
Sbjct: 795 FKRVNLKPNEMKNVTINLPSNSFEFYDKKARSMKVMPGNYEIFYGNSSDDKQLKKTELKI 854
>gi|319787180|ref|YP_004146655.1| glycoside hydrolase [Pseudoxanthomonas suwonensis 11-1]
gi|317465692|gb|ADV27424.1| glycoside hydrolase family 3 domain protein [Pseudoxanthomonas
suwonensis 11-1]
Length = 903
Score = 260 bits (664), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 153/399 (38%), Positives = 219/399 (54%), Gaps = 35/399 (8%)
Query: 56 LIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQ 115
L+ +L+ +EK+ L++ A A+PRLG+ Y WW+E+LHG P T+F +
Sbjct: 54 LVAKLTTEEKLDQLLNTAPAIPRLGVPHYNWWTESLHGAMGTLP----------TTNFAE 103
Query: 116 VITTASSFNATLWEAIGRVVSDEARAMY-----NGGTA----GLTYWSPNVNIFRDPRWG 166
I A++F+ L + +G+V++ E R ++ G T GL WSPN+NIFRDPRWG
Sbjct: 104 PIGLAATFDDGLVKRVGQVIATEVRGLHALARQTGRTGRMGTGLNTWSPNINIFRDPRWG 163
Query: 167 RGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAK 226
RGQET GEDP L+ + +YV+G+QG D + A+ KH+ + R N
Sbjct: 164 RGQETYGEDPFLAARMGVAYVKGVQGEDPRWYDIIATPKHYAVHSGPEST---RHSANVY 220
Query: 227 VSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIV 286
VS+ D+EDT+ FR ++EG SVMC+YN+V+G P CA+ +LK +RG W GY+V
Sbjct: 221 VSRHDLEDTYLPAFRAAIVEGGAGSVMCAYNRVDGQPACANDLLLKDYLRGAWDFKGYVV 280
Query: 287 SDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG-------PFLGLHTESAVQRGLLS 339
SDCD+V H+ A A A+R+G+D +C L + A++RGL+S
Sbjct: 281 SDCDAVTDINRHHHYAPDAASAVAAAMRSGVDNECNGATLVDMAGLARPYQEALERGLIS 340
Query: 340 EIDINNALVNTLTVQMRLGMFDG-EPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLK 398
D++ ALV + R G G P S P DV P+H LALE A + +VLLK
Sbjct: 341 MADVDRALVRLFAARYRNGDLPGLRPLSTETA--SPADVGKPEHAALALEVAEKSLVLLK 398
Query: 399 NQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACG 437
N G LPL +AV+GP +D T + GNY+ G
Sbjct: 399 NSG-VLPLRP--QAKLAVVGPLADATRVLRGNYSSPQSG 434
Score = 124 bits (311), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 92/300 (30%), Positives = 149/300 (49%), Gaps = 50/300 (16%)
Query: 465 DQLFGAAIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKV 514
D G A+ Q+D + V+GL +EAE D+ L L QQ L+ +
Sbjct: 619 DDPIGQMRAAAAQSDVVVAVVGLTSDLEAEEAPVEVPGFKGGDKTSLDLLADQQALL-EA 677
Query: 515 SMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKL 574
+ A+ P ++V+M+G P+++A+A+ AAII A YPGQ+GG AI ++L G +NP G+L
Sbjct: 678 ARATGKPLVVVVMNGSPVNLAWARQ--HAAAIIEAWYPGQSGGLAIGNVLAGHANPAGRL 735
Query: 575 PMTWYPQEYITNL-PMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVAN 633
P+T+Y + +L P + MR GRTYR+++G VYPFGHG+SYT+F +
Sbjct: 736 PLTFY--RSVDDLPPFDDYDMR--------GRTYRYFEGKPVYPFGHGLSYTSFGY---- 781
Query: 634 APTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFST 693
P + G H + +KVT +++N G + G ++
Sbjct: 782 GPLKLEPAAGGAH---------EGLKVT------------TELRNTGKRAGDEVAQLYLD 820
Query: 694 PPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGG 753
P P L F++VH+ G ++ + + + LS V G R++ G + + +GG
Sbjct: 821 FPDVDGLPRIALRGFQRVHLAPGERRVLEFQLD-PRDLSAVTPEGQRQVFAGNYRVFVGG 879
>gi|418855797|ref|ZP_13410448.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Newport str. CVM 19593]
gi|392820838|gb|EJA76679.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
enterica serovar Newport str. CVM 19593]
Length = 765
Score = 260 bits (664), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 204/669 (30%), Positives = 325/669 (48%), Gaps = 103/669 (15%)
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
T FP + ASSFN +GRV + Y GL W+P V++ RDPRWGR
Sbjct: 118 TVFPISLGLASSFNLDAVRTVGRV------SAYEAADDGLNMTWAPMVDVSRDPRWGRAS 171
Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
E GED L+ + V+ +QG S DR V S KHF AY +N VD
Sbjct: 172 EGFGEDTYLTSIMGETMVKAMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 226
Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
+S Q + + + P++ + G +VM + N +NG P +D +LK +R EW G
Sbjct: 227 --MSSQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSDSWLLKDVLRDEWGFKGI 283
Query: 285 IVSDCDSVGVYYDTQHFTST-PEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
VSD ++ +H T+ PE+A A++AG+D+ + + ++ G ++ +
Sbjct: 284 TVSDHGAIKEL--IKHGTAADPEDAVRVALKAGVDMSMADEYYSKYLPGLIKSGKVTMAE 341
Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
+++A + L V+ +G+F+ PY HLGPK+ D H++ A E AR+ +V
Sbjct: 342 LDDATRHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAESRLHRKEAREVARESVV 396
Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGI----GRYA 449
LLKN+ +LPL + T+AV+GP +D ++G++ AG+A T L GI G A
Sbjct: 397 LLKNRLETLPLK--KSGTIAVVGPLADSQRDVMGSWSAAGVANQSVTVLAGIQNAVGDGA 454
Query: 450 RTIHQQGC-----KDVAC--------------ADDQLFGAAIDASRQADATILVMGLDQS 490
+ ++ +G KD+ + + A+ A++QAD + V+G Q
Sbjct: 455 KILYAKGANITNDKDIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADVVVAVVGESQG 514
Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
+ EA R + +P Q++L++ + K P +LVLM+G P+ A K D + AI+
Sbjct: 515 MAHEASSRTNITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKEDQQADAILETW 571
Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPM------TEMAMRPSQSKRYPG 604
+ G GG AIAD+LFG NP GKLP++ +P+ + +P+ T P + +Y
Sbjct: 572 FAGTEGGNAIADVLFGDYNPSGKLPIS-FPRS-VGQIPVYYSHLNTGRPYNPEKPNKYTS 629
Query: 605 RTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAK 664
R + GP +Y FG+G+SYT F TV++ ++ P R G + A+
Sbjct: 630 RYFDEANGP-LYSFGYGLSYTTF--TVSD--VTLSSPTMQRDGKVTAS------------ 672
Query: 665 CNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVG- 722
V+V N G ++GA + ++ A P KQL FEK+ + G + V
Sbjct: 673 ---------VEVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGFEKITLKPGESKTVSF 723
Query: 723 -INIHVCKY 730
I+I K+
Sbjct: 724 PIDIEALKF 732
>gi|224536377|ref|ZP_03676916.1| hypothetical protein BACCELL_01251 [Bacteroides cellulosilyticus
DSM 14838]
gi|224522015|gb|EEF91120.1| hypothetical protein BACCELL_01251 [Bacteroides cellulosilyticus
DSM 14838]
Length = 954
Score = 260 bits (664), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 213/737 (28%), Positives = 357/737 (48%), Gaps = 91/737 (12%)
Query: 39 TLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISG--AAAVPRLGIKGYEWWSEALHGVSN 96
+L + +LP+ +RV L+ ++ ++K++L+ G +P L + EA+HG S
Sbjct: 167 SLRYMDPTLPVEERVESLLSVMTPEDKMELIREGWGIPGIPHLYVPPITK-VEAVHGFS- 224
Query: 97 VGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPN 156
+G GAT FPQ + +++N L E + V DE A GT + WSP
Sbjct: 225 ------YGS---GATIFPQALAMGATWNKKLTEDVAMAVGDETLA---AGT--MQAWSPV 270
Query: 157 VNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWN 216
+++ +D RWGR +ET GEDPVL + ++++G Q S G + + KHF +
Sbjct: 271 LDVAQDARWGRCEETFGEDPVLVSQIGGAWIKGYQ-SKG----LFTTPKHFGGHGAP-LG 324
Query: 217 GVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIR 276
G D +S++++ + VPFR + SVM +Y+ GVP +L +R
Sbjct: 325 GRDSHDIG--LSEREMREVHLVPFRHVIRNYDCQSVMMAYSDYLGVPVAKSRELLHSILR 382
Query: 277 GEWRLNGYIVSDCDSVGVYYDTQHFTSTPE-EAAADAIRAGLDLDCG-PFLGLHTESAVQ 334
EW +G+IVSDC ++G +H+T+ + EAA A+ AG+ +CG + A +
Sbjct: 383 EEWGFDGFIVSDCGAIGNLTARKHYTAKDKIEAANQALAAGIATNCGDTYNDKEVIQAAK 442
Query: 335 RGLLSEIDINNALVNTLTVQMRLGMFDGEPSS-QPYGHLGPKDVCTPDHQELALEAARQG 393
G ++ +++ L + R +F+ P+ + + P + H+E+A +AAR+
Sbjct: 443 DGRINMENLDEVCRTMLRMMFRNELFEKTPNKPLDWNKIYP-GWNSDSHKEMARQAARES 501
Query: 394 IVLLKNQGPSLPLSHIRHRTVAVIGPNSD-------VTVTMIGNYAGIACGYTTPLQGIG 446
IV+L+N+ LPL+ RT+AV+GP +D + G + G Q +G
Sbjct: 502 IVMLENKDNILPLAK-DMRTIAVVGPGADDLQPGDYTPKLLPGQLKSVLTGIK---QAVG 557
Query: 447 RYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEA---------EALD 497
+ + +++QGC D ++ A+ A+ Q+D +LV+G + E+ E D
Sbjct: 558 KQTKVVYEQGC-DFTSSNGTNIPKAVKAASQSDVVVLVLGDCSTSESTTDVYKTSGENHD 616
Query: 498 RAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGG 557
A L+LPG+QQEL+ V K P IL+L +G P +++ A + + W PGQ GG
Sbjct: 617 YATLILPGKQQELLEAVCATGK-PVILILQAGRPYNLSKASELCKAILVNW--LPGQEGG 673
Query: 558 TAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYP 617
A AD+LFG NP G+LPMT +P+ ++ LP+ + +RY FY P+ Y
Sbjct: 674 PATADVLFGDYNPAGRLPMT-FPR-HVGQLPL--YYNFKTSGRRYEYSDMEFY--PLYY- 726
Query: 618 FGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVK 677
FG+G+SYT+F + SG I+ + + + +Q VK
Sbjct: 727 FGYGLSYTSFEY------------------------SGLKIQ----EKDNGNVAIQATVK 758
Query: 678 NVGSKDGAHTLLVFSTPPAGHWAPH-KQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDR 736
NVG + G + ++ T +L F +VH+ G + V + + LS+++
Sbjct: 759 NVGQRAGDEVVQLYITDMYASVKTRITELKDFTRVHLQPGESKIVSFELTPYE-LSLLND 817
Query: 737 SGTRRIPLGEHNIHIGG 753
R + GE I +GG
Sbjct: 818 RMDRVVEKGEFKILVGG 834
>gi|423140736|ref|ZP_17128374.1| glycosyl hydrolase family 3 protein [Salmonella enterica subsp.
houtenae str. ATCC BAA-1581]
gi|379053290|gb|EHY71181.1| glycosyl hydrolase family 3 protein [Salmonella enterica subsp.
houtenae str. ATCC BAA-1581]
Length = 765
Score = 260 bits (664), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 207/669 (30%), Positives = 327/669 (48%), Gaps = 103/669 (15%)
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
T FP + ASSFN +GRV + Y GL W+P V++ RDPRWGR
Sbjct: 118 TVFPISLGLASSFNLDAVRTVGRV------SAYEAADDGLNMTWAPMVDVSRDPRWGRAS 171
Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
E GED L+ + V+ +QG S DR V S KHF AY +N VD
Sbjct: 172 EGFGEDTWLTSIMGETMVKAMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 226
Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
+S Q + + + P++ + G +VM + N +NG P +D +LK +R EW G
Sbjct: 227 --MSSQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSDSWLLKDVLRDEWGFKGI 283
Query: 285 IVSDCDSVGVYYDTQHFTST-PEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
VSD ++ +H T+ PE+A A+++G+D+ + + ++ G ++ +
Sbjct: 284 TVSDHGAIKEL--IKHGTAADPEDAVRVALKSGVDMSMADEYYSKYLPGLIKSGKVTMAE 341
Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
+++A + L V+ +G+F+ PY HLGPK+ D H++ A E AR+ +V
Sbjct: 342 LDDATRHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAESRLHRKEAREVARESLV 396
Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGI----GRYA 449
LLKN+ +LPL + T+AV+GP +D ++G++ AG+A T L GI G A
Sbjct: 397 LLKNRLETLPLK--KSGTIAVVGPLADSQRDVMGSWSAAGVADQSVTVLAGIQNAVGDGA 454
Query: 450 RTIHQQGCK---DVACAD----------------DQLFGAAIDASRQADATILVMGLDQS 490
+ ++ +G D D + A+ A++QAD + V+G Q
Sbjct: 455 KILYAKGANITNDKGIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADVVVAVVGESQG 514
Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
+ EA R + LP Q++L++ + K P +LVLM+G P+ A K D + AI+
Sbjct: 515 MAHEASSRTDITLPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKEDQQADAILETW 571
Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAM---RPSQSKRYPGRTY 607
+ G GG AIAD+LFG NP GKLPM+ +P+ + +P+ + RP S++ T
Sbjct: 572 FAGTEGGNAIADVLFGDYNPSGKLPMS-FPRS-VGQIPVYYSHLNTGRPYVSEKPNKYTS 629
Query: 608 RFY---KGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAK 664
R++ GP +YPFG+G+SYT F TV++ ++ P R G + A+
Sbjct: 630 RYFDEANGP-LYPFGYGLSYTTF--TVSD--VTLSAPTMKRDGKVTAS------------ 672
Query: 665 CNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVG- 722
V+V N G ++GA + ++ A P KQL FEK+ + G + V
Sbjct: 673 ---------VEVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGFEKIALKPGESKTVSF 723
Query: 723 -INIHVCKY 730
I+I K+
Sbjct: 724 PIDIDALKF 732
>gi|386821036|ref|ZP_10108252.1| beta-glucosidase-like glycosyl hydrolase [Joostella marina DSM
19592]
gi|386426142|gb|EIJ39972.1| beta-glucosidase-like glycosyl hydrolase [Joostella marina DSM
19592]
Length = 725
Score = 259 bits (663), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 217/712 (30%), Positives = 341/712 (47%), Gaps = 90/712 (12%)
Query: 41 PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVS---NV 97
PF + +RV++L+ +++ EKV L S VPRLG+KG E LHG++
Sbjct: 31 PFQNPKIATEKRVDNLLSLMTIDEKVNAL-STNPEVPRLGVKG-TGHVEGLHGLALGGPA 88
Query: 98 GPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEAR-AMYNGGTAGLTYWSPN 156
G G K P T+FPQ +++ L + I ++ EAR A+ G GL +PN
Sbjct: 89 GWGGKGKEPLP-TTTFPQAYGLGETWDTELLKEIAKIEGYEARYALQKYGRGGLVIRAPN 147
Query: 157 VNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWN 216
++ RDPRWGR +E+ GED +GK ++V+GLQGSD + A+ KHF A N N
Sbjct: 148 ADLARDPRWGRTEESYGEDAFFNGKMTVAFVKGLQGSDKTYWQTASLMKHFLA----NSN 203
Query: 217 GVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIR 276
R + ++ ++ + + +PF+M V+EG + M +YN+VNG+P P +LK
Sbjct: 204 EDGRTYTSSDFDERLWREYYALPFKMGVVEGGSRAYMAAYNKVNGIPAMVHP-MLKDITV 262
Query: 277 GEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTE---SAV 333
EW NG I +D + + + AA I+AG++ FL TE A+
Sbjct: 263 DEWGQNGIICTDGGAYKLLLSDHKYYKDKYLGAAATIKAGIN----QFLDDFTEGVYGAL 318
Query: 334 QRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPK-DVCTP----DHQELALE 388
G L+E D++ L V ++LGM D ++ PY +G + D P H++LALE
Sbjct: 319 ANGYLTEADLDEVLRGNYRVMIKLGMLDSS-ANNPYAKIGAEADSMDPWELEAHKKLALE 377
Query: 389 AARQGIVLLKNQGPS--LPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI- 445
A + IVLLKN LPL + + +A+IG +D ++ Y+G +PLQGI
Sbjct: 378 ATEKSIVLLKNDPAKRLLPLQKKKVKKIAIIGEYAD--AVLLDWYSGTPPYTISPLQGIK 435
Query: 446 GRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEA------------ 493
+ + +V A + G A++ ++ AD I+ +G + A
Sbjct: 436 NKVGENV------EVLFAKNNADGKAVEIAKNADVAIVFIGNHPTCNAGWAQCPVPSNGK 489
Query: 494 EALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPG 553
EA+DR L + E + K+ + T++ L+S P + + + + I AI
Sbjct: 490 EAVDRQAL---NSEYEDLVKLVYKANPNTVVGLISSFPYTINWTQEN--IPAIFHVTQNS 544
Query: 554 QAGGTAIADILFGTSNPGGKLPMTWYPQEYITNL-PMTEMAMRPSQSKRYPGRTYRFYKG 612
Q GTAIA++LFG NP G+L TW I++L P+ + +R GRTY ++KG
Sbjct: 545 QELGTAIANVLFGAYNPAGRLTQTWVKD--ISDLPPLMDYNIR-------NGRTYMYFKG 595
Query: 613 PVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGV 672
+Y FGHG+SYT F + P K IK + V
Sbjct: 596 KPLYAFGHGLSYTTFKYKDMEIP--------------------KQIKENEE------VSV 629
Query: 673 QVDVKNVGSKDGAHTL-LVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGI 723
+V++ N G DG + L + P K+L +F+++H+ AG + V +
Sbjct: 630 KVNITNAGEVDGDEVVQLYVKHINSTVERPIKELKSFKRIHIKAGETKTVSL 681
>gi|218132023|ref|ZP_03460827.1| hypothetical protein BACEGG_03648 [Bacteroides eggerthii DSM 20697]
gi|217985783|gb|EEC52123.1| glycosyl hydrolase family 3 N-terminal domain protein [Bacteroides
eggerthii DSM 20697]
Length = 762
Score = 259 bits (663), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 218/779 (27%), Positives = 352/779 (45%), Gaps = 123/779 (15%)
Query: 48 PIPQRVNDLIGRLSLQEKVKLL---------ISGAAAVPRLG--IKGYEWWS-------- 88
P+ RV DL+ R++L+EK+ + + G ++ +KG + S
Sbjct: 34 PVEVRVADLLKRMTLEEKIAQMQDLKFKDFSVDGKVDTVKMDSVLKGMSYASVFGSRLSV 93
Query: 89 ----EALH------------GVSNVGPGTKFGGDF-PGATSFPQVITTASSFNATLWEAI 131
E++ G+ +G G GAT FPQ I +S+FN + +
Sbjct: 94 EQMQESMFAINKYMAEHNRLGIPVLGEAESLHGLIHDGATIFPQSIALSSTFNPDITHRV 153
Query: 132 GRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQ 191
V++ EA+A T SP +++ R+ RWGR +ET GEDP L G+ +YV
Sbjct: 154 ATVIAQEAKA-----TGVDQVLSPVLDLARELRWGRVEETYGEDPYLVGRMGVAYVSAF- 207
Query: 192 GSDGDRLKVAASCKHFTAY-------DLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCV 244
++ V + KHF A+ +L + G +R D+ + PF+ +
Sbjct: 208 ----NKEGVMTTLKHFLAHGSPTGGLNLASVTGCER----------DLRSLYLKPFQDVM 253
Query: 245 MEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTST 304
E SVM SYN VP A IL +RGE GYI SD SV + H
Sbjct: 254 REAMPYSVMNSYNSYESVPVAASHWILDDILRGEMGFKGYISSDWGSVEMLRSLHHTAKD 313
Query: 305 PEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEP 364
+AA A+ AG+D++ +S V+ G+L E +I+ + LT + +G+FD +
Sbjct: 314 KADAACQAVIAGVDVEVDGDCYETLDSLVRSGVLPEKEIDKCVSRVLTAKFAMGLFDKDY 373
Query: 365 SSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVT 424
+ + +L + V TP+ ELAL AAR+ +L+KN+ LPL + R+VAVIGPN+
Sbjct: 374 TKR--ANLS-QTVHTPEAVELALVAARESAILVKNENSLLPLDANKLRSVAVIGPNA--A 428
Query: 425 VTMIGNYAGIACGY--TTPLQGIGRY----ARTIHQQGCKDVACADDQLFGAAIDASRQA 478
G+Y TPLQGI + + +GC ++ D F A+ A+R +
Sbjct: 429 QVQFGDYMWTNSNEYGITPLQGIEAVTQGKVKINYAKGC-EIHTQDRSGFSQAVTAARNS 487
Query: 479 DATILVMG---------LDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSG 529
D +L +G S+ E+ D + + LPG Q+ L+ V K PTI+VL++G
Sbjct: 488 DVALLFVGAMSGSPGRPWPNSVSGESFDLSDISLPGCQEALIRAVKATGK-PTIVVLVAG 546
Query: 530 GPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQE------Y 583
P + + K++ + W Y G+ G AIA+ILFG NP G+L ++ +PQ +
Sbjct: 547 KPFAIPWVKDNCEAVIVQW--YGGEQEGRAIAEILFGEVNPSGRLNVS-FPQSTGHLPVF 603
Query: 584 ITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLD 643
P + + PGR Y F V+ FGHG+SYT F +
Sbjct: 604 YNYYPSDKGFYHDHGTLEKPGRDYVFSSPDPVWAFGHGLSYTTFKY-------------- 649
Query: 644 GRHGSINATISGKAIKVTHAK-CNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWA-P 701
K++++++ + + T + V+V N G +DG + ++ P
Sbjct: 650 ------------KSMQISNKEFTDDDTCEITVEVANTGKRDGKEVVQLYVNDIVSSVVTP 697
Query: 702 HKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
K+L FEKV +PAG + V N+ + K L++ + + G+ + +G + L
Sbjct: 698 VKELRRFEKVFIPAGETRTVKFNLPI-KELALWNTDMKEVVEPGDFELQVGAASDDIRL 755
>gi|224537265|ref|ZP_03677804.1| hypothetical protein BACCELL_02142 [Bacteroides cellulosilyticus
DSM 14838]
gi|224521119|gb|EEF90224.1| hypothetical protein BACCELL_02142 [Bacteroides cellulosilyticus
DSM 14838]
Length = 885
Score = 259 bits (663), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 155/451 (34%), Positives = 240/451 (53%), Gaps = 33/451 (7%)
Query: 38 RTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNV 97
+TLP+ L +R DL+ RL+L+EK L+ + A+PRLGIK + WWSEALHGV+N
Sbjct: 19 QTLPYQNPELSPAERAKDLVKRLTLEEKALLMCDDSEAIPRLGIKKFNWWSEALHGVANQ 78
Query: 98 GPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAG-------- 149
G T FP+ + A+SFN L I VSDE RA +N
Sbjct: 79 G----------NVTVFPEPVGMAASFNDKLVFDIFNAVSDEMRAKHNERVRNGLEDVRFH 128
Query: 150 -LTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASC-KHF 207
L+ W+PNVNIFRDPRWGRGQET GEDP L+ + + V+GLQG + ++ + +C KH+
Sbjct: 129 SLSVWTPNVNIFRDPRWGRGQETYGEDPYLTSQMGIAVVKGLQGPENEKYRKLLACAKHY 188
Query: 208 TAYDLDNWNGVDRFHFNA-KVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCA 266
+ W+ R N VS +D+ +T+ F+ V + V VMC+Y +++ P C
Sbjct: 189 AVHSGPEWS---RHTANLNNVSPRDLWETYLPAFKALVQKADVREVMCAYQRLDDDPCCG 245
Query: 267 DPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG-PFL 325
+ +L++ +R EW +VSDC ++ ++ + +S AA AG D++CG +
Sbjct: 246 NTRLLQQILRDEWGFKYLVVSDCGAIADFWTSHKSSSDAVHAAVKGTMAGTDVECGYGYA 305
Query: 326 GLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQEL 385
AV +GL++E +++ ++ + + LG D +PS + + V H++L
Sbjct: 306 YQKLPEAVSKGLITEEEVDKHVLRLMEGRFELGEMD-DPSLVNWTKIPMSVVNCKAHKDL 364
Query: 386 ALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
+L +RQ + LL+N+ LPLS R +AVIGPN+D + GNY G T L G
Sbjct: 365 SLNMSRQTMTLLQNKNNVLPLSK-SIRKIAVIGPNADDKPMLWGNYNGTPNQTITILDGF 423
Query: 446 G---RYARTIHQQGCKDVACADDQLFGAAID 473
+ + ++ +GC V +DQ + +D
Sbjct: 424 KSKLKKNQIVYMKGCDLV---NDQTLESYLD 451
Score = 99.4 bits (246), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 85/320 (26%), Positives = 132/320 (41%), Gaps = 62/320 (19%)
Query: 468 FGAAIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMA 517
+ +I + D + V G+ +E E + DR + LP Q+ + + A
Sbjct: 590 YSESISKLKGIDVVVFVGGISPQLEGEEMPVNIPGFKGGDRTDIELPAVQRNFLKALKDA 649
Query: 518 SKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMT 577
K + V SG + A AI+ A Y G+ GG A+AD+LFG NP GKLP+T
Sbjct: 650 GK-QVVFVNCSGSSM--ALLPETESCDAILQAWYGGELGGYAVADVLFGDYNPSGKLPVT 706
Query: 578 WYPQEYITNLPMTE-MAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPT 636
+Y LP E +M+ GRTYR+ P ++PFG G+SYT+F
Sbjct: 707 FYKS--TKQLPDYEDYSMK--------GRTYRYMSDP-LFPFGFGLSYTDFA-------- 747
Query: 637 VVAVPLDGRHGSINATISGKAIKVTHAKCNRL------TLGVQVDVKNVGSKDGAHTLLV 690
V A CN+ +L + V V N G + G + V
Sbjct: 748 -----------------------VGTASCNKTQLHTDESLTLTVPVSNTGKRSGTEVVQV 784
Query: 691 FSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIH 750
+ P K L A+ +V + AGA+Q V I + + + T R+ GE+ +
Sbjct: 785 YIRKTDDADGPLKSLKAYARVELAAGAKQDVKIELPSESFECFDPSTNTMRVAPGEYELF 844
Query: 751 IGGTKHSVSLHAATLGVIKS 770
G + + L + + + S
Sbjct: 845 YGTSSAARDLQSVKVTLRDS 864
>gi|365971332|ref|YP_004952893.1| beta-glucosidase [Enterobacter cloacae EcWSU1]
gi|365750245|gb|AEW74472.1| Periplasmic beta-glucosidase [Enterobacter cloacae EcWSU1]
Length = 765
Score = 259 bits (663), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 211/691 (30%), Positives = 336/691 (48%), Gaps = 106/691 (15%)
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
T FP + ASSFN + +GRV + Y GL W+P V++ RDPRWGRG
Sbjct: 118 TVFPISLGLASSFNLDAVKTVGRV------SAYEAADDGLNMTWAPMVDVSRDPRWGRGS 171
Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
E GED L+ + V +QG S DR V S KHF AY +N VD
Sbjct: 172 EGFGEDTYLTATMGKTMVEAMQGKSPADRYAVMTSVKHFAAYGAVEGGKEYNTVD----- 226
Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
+S Q + + + P++ + G +VM + N +NG P +D +LK +R +W G
Sbjct: 227 --MSPQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSDSWLLKDVLRDQWGFKGI 283
Query: 285 IVSDCDSVGVYYDTQHFT-STPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
VSD ++ +H T S PE+A A+++G+++ + + V+ G ++ +
Sbjct: 284 TVSDHGAIKEL--IKHGTASDPEDAVRVALKSGINMSMSDEYYSKYLPGLVKSGKVTMAE 341
Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
+++A + L V+ +G+F+ PY HLGPKD D H++ A + AR+ +V
Sbjct: 342 LDDATRHVLNVKYDMGLFN-----DPYSHLGPKDSDPVDTNAESRLHRKEARDVARESLV 396
Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGI----GRYA 449
LLKN+ +LPL + T+AV+GP +D ++G++ AG+A T L GI G A
Sbjct: 397 LLKNRLDTLPLK--KSGTIAVVGPLADSKRDVMGSWSAAGVAEQSVTVLTGIKNAVGENA 454
Query: 450 RTIHQQGC-----KDVACADDQ--------------LFGAAIDASRQADATILVMGLDQS 490
+ ++ +G KD+ +Q + A++ ++Q+D + V+G Q
Sbjct: 455 KVVYAKGANVTNDKDIVTFLNQYEEAVKVDPRSPKEMIDEAVNTAKQSDVVVAVVGEAQG 514
Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
+ EA R + +P Q++L++ + K P +LVLM+G P+ A K D + AI+
Sbjct: 515 MAHEASSRTDITIPQSQRDLIAALKATGK-PLVLVLMNGRPL--ALVKEDQQADAILETW 571
Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAM---RP---SQSKRYPG 604
+ G GG AIAD+LFG NP GKLPM+ +P+ + +P+ + RP + +Y
Sbjct: 572 FAGTEGGNAIADVLFGDYNPSGKLPMS-FPRS-VGQIPVYYSHLNTGRPYNADKPNKYTS 629
Query: 605 RTYRFYKGPVVYPFGHGMSYTNF-VHTVA-NAPTVVAVPLDGRHGSINATISGKAIKVTH 662
R + GP +YPFG+G+SYT F V V +APT+ R G + A+
Sbjct: 630 RYFDEANGP-LYPFGYGLSYTTFKVSDVKMSAPTL------KRDGKVTAS---------- 672
Query: 663 AKCNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRV 721
V+V N G ++GA + ++ A P KQL FEKV++ G + +
Sbjct: 673 -----------VEVSNTGKREGATVIQMYVQDVTASMSRPVKQLRGFEKVNLKPGETKTI 721
Query: 722 GINIHVCKYLSVVDRSGTRRIPLGEHNIHIG 752
I V L ++ G+ N+ IG
Sbjct: 722 SFPIDV-DALKFWNQQMKYDAEPGKFNVFIG 751
>gi|206901921|ref|YP_002251428.1| xylosidase/arabinosidase [Dictyoglomus thermophilum H-6-12]
gi|206741024|gb|ACI20082.1| xylosidase/arabinosidase [Dictyoglomus thermophilum H-6-12]
Length = 756
Score = 259 bits (663), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 217/690 (31%), Positives = 337/690 (48%), Gaps = 96/690 (13%)
Query: 89 EALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA 148
EALHG G+T FPQ I AS++N L + + E R +
Sbjct: 138 EALHGCMA-----------KGSTIFPQAIGMASTWNPELIYQVATAIGKETR------SR 180
Query: 149 GL-TYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHF 207
G+ SP +NI RDPR GR +ET GEDP L+ + A +Y++G+Q V A+ KHF
Sbjct: 181 GIHQVLSPTINIARDPRCGRTEETYGEDPYLASRMAVAYIKGVQEQG-----VIATPKHF 235
Query: 208 TAYDLDNWNGVDRF--HFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTC 265
A + + G D + HF+ ++ + + + FR + E S+M +YN ++G+P
Sbjct: 236 VANFVGD-GGRDSYPIHFSERL----LREIYFPAFRASIEEAGALSLMAAYNSLDGIPCS 290
Query: 266 ADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDL-----D 320
++ +L R +R EW GY+VSD SV + EAA ++ AGLD+ D
Sbjct: 291 SNKWLLTRILRKEWGFKGYVVSDYFSVLHLMTKHKVAESKAEAAKLSLEAGLDMELPDSD 350
Query: 321 CGPFL-GLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCT 379
C + GL ES LS+ ++ A+ L V+ +G+FD P P + + C+
Sbjct: 351 CFEEIPGLIRESK-----LSQDTLDEAVRRVLRVKFWIGLFDN-PFVDP-DYAERINDCS 403
Query: 380 PDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYT 439
+H+ELAL AR+ IVLLKN+G LPL+ R++AVIGPN+ V +G Y+G
Sbjct: 404 -EHRELALRVARESIVLLKNEG-ILPLNK-DIRSIAVIGPNA--AVPRLGGYSGYGVKVV 458
Query: 440 TPLQGI----GRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLD-QSIEAE 494
TPL+GI G + +GC + F AI ++++D IL MG E E
Sbjct: 459 TPLEGIKNKLGDKVKVYFAEGC-GLNDTSKSGFDEAIKIAQKSDVAILFMGNSVPETEGE 517
Query: 495 ALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQ 554
DR L LPG Q++L+ ++ + P I+VL++G I + + ++ A+I A YPG+
Sbjct: 518 QRDRHNLNLPGVQEDLIKEIC-NTNTPVIVVLINGSAITMMNWID--KVQAVIEAWYPGE 574
Query: 555 AGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRT--YRFYKG 612
GG AIAD+LFG NPGGKLP++ +P+ Y + LP+ +PS GR Y +G
Sbjct: 575 EGGNAIADVLFGDYNPGGKLPIS-FPK-YSSQLPLY-YNHKPS------GRVDDYVDLRG 625
Query: 613 -PVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLG 671
++PFG+G+SYT+F ++ T +P DG +
Sbjct: 626 NQYLFPFGYGLSYTDFKYSNLRI-TPEEIPRDGE------------------------VV 660
Query: 672 VQVDVKNVGSKDGAHTL-LVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKY 730
+ D++N+G G + L A P K+L FE+V + G ++ V ++ +
Sbjct: 661 ITFDIENIGKYKGDEVVQLYLHDEFASVARPIKELKRFERVTLDVGERKTVSFKLN-RRD 719
Query: 731 LSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
L + + G + IG + + L
Sbjct: 720 LEFLSMDMELVVEPGRFEVLIGSSSEDIRL 749
>gi|359473580|ref|XP_003631325.1| PREDICTED: protein BRASSINOSTEROID INSENSITIVE 1-like [Vitis
vinifera]
Length = 785
Score = 259 bits (663), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 134/302 (44%), Positives = 189/302 (62%), Gaps = 12/302 (3%)
Query: 284 YIVSDCDSVGVYYDTQHFTSTPE-EAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEID 342
YIVSDC + V D Q++ + + +A A ++AGLDL+CG + +V G +S+ +
Sbjct: 10 YIVSDCYGLEVIVDNQNYLNESKVDAVAKTLQAGLDLECGHYYTDALNESVLTGKVSQYE 69
Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGP 402
++ AL N + MR+G FDG P+ Y LG KD+C DH ELA EAARQGIVLLKN
Sbjct: 70 LDRALKNIYVLLMRVGYFDGIPA---YESLGLKDICAADHIELAREAARQGIVLLKNDYE 126
Query: 403 SLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCKDVAC 462
LPL + + ++GP+++ T MIGNYAG+ Y +PL+ + GC D +C
Sbjct: 127 VLPLK--PGKKLVLVGPHANATEVMIGNYAGLPYKYVSPLEAFSAIGNVTYATGCLDASC 184
Query: 463 ADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPT 522
++D F A +A++ A+ TI+ +G D SIEAE +DR LLPG Q EL+ +V+ S GP
Sbjct: 185 SNDTYFSEAKEAAKFAEVTIIFVGTDLSIEAEFVDRVDFLLPGNQTELIKQVAEVSSGPV 244
Query: 523 ILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGG------KLPM 576
ILV++SG ID+ FAKN+PRI+AI+W G+PG+ GG AIAD++FG NP KL
Sbjct: 245 ILVVLSGSNIDITFAKNNPRISAILWVGFPGEQGGHAIADVVFGKYNPDTIPEWLWKLDF 304
Query: 577 TW 578
+W
Sbjct: 305 SW 306
>gi|308080460|ref|NP_001183746.1| uncharacterized protein LOC100502339 [Zea mays]
gi|238014360|gb|ACR38215.1| unknown [Zea mays]
Length = 344
Score = 259 bits (663), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 140/338 (41%), Positives = 196/338 (57%), Gaps = 16/338 (4%)
Query: 425 VTMIGNYAGIACGYTTPLQGIGRYARTIH-QQGCKDVAC---ADDQLFGAAIDASRQADA 480
+ +I NY G C TTPL+G+ Y + GC AC A DQ A+ + D
Sbjct: 1 MALIANYFGPPCESTTPLKGLQSYVNDVRFLAGCNSAACDVAATDQ----AVALAGSEDY 56
Query: 481 TILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKND 540
L MGL Q E+E DR LLLPG QQ L++ V+ ASK P ILVL+SGGP+D+ FA+++
Sbjct: 57 VFLFMGLSQKQESEGKDRTSLLLPGMQQSLITAVADASKRPVILVLLSGGPVDITFAQSN 116
Query: 541 PRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSK 600
P+I AI+WAGYPGQAGG AIA +LFG NP G+LP+TWYP+E+ T +PMT+M MR +
Sbjct: 117 PKIGAILWAGYPGQAGGLAIAKVLFGDHNPSGRLPVTWYPEEF-TKVPMTDMRMRADPTS 175
Query: 601 RYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVP---LDGRHGSINATISGKA 657
YPGR+YRFY+G VY FG+G+SY+ F + + +V A+ L G ++ ++
Sbjct: 176 GYPGRSYRFYQGNTVYKFGYGLSYSTFSRRLVHGTSVPALSSTLLTGLRETMTPQDGDRS 235
Query: 658 IKVT---HAKCNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHV 713
V C +L V+V+N G DG H++L+F P P QL+ F H+
Sbjct: 236 YHVDAIGTEGCEQLKFPAMVEVQNHGPMDGKHSVLMFLRWPNTKQGRPASQLIGFRSQHL 295
Query: 714 PAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHI 751
AG ++ +I CK+ S V G + I +G H + +
Sbjct: 296 KAGETAKLRFDISPCKHFSRVRADGRKVIDIGSHFLMV 333
>gi|119476117|ref|ZP_01616469.1| periplasmic beta-glucosidase [marine gamma proteobacterium
HTCC2143]
gi|119450744|gb|EAW31978.1| periplasmic beta-glucosidase [marine gamma proteobacterium
HTCC2143]
Length = 748
Score = 259 bits (663), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 202/671 (30%), Positives = 334/671 (49%), Gaps = 71/671 (10%)
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
T FP + A+S+N L E RV + EA T G+ + ++P ++I RDPRWGR
Sbjct: 101 TIFPIPLGQAASWNPELIEQGARVAALEA------ATVGVNWTFAPMIDITRDPRWGRIA 154
Query: 170 ETPGEDPVLSGKYAASYVRGLQGSDGDRL-KVAASCKHFTAYDLDNWNGVDRFHFNAKVS 228
E+ GEDP L G+ A+ VRG QG D + +AA KHF Y GVD + A ++
Sbjct: 155 ESLGEDPYLCGELGAAMVRGFQGKDLSAIGSIAACAKHFAGYGAAE-GGVD--YNTAIIA 211
Query: 229 KQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSD 288
+ ++ + + PF+ + G VAS M ++N +NGVP + +LK+ +R EW G +VSD
Sbjct: 212 ENELRNVYLPPFKAALDSG-VASFMTAFNDLNGVPASGNEFLLKQILREEWCYQGMVVSD 270
Query: 289 CDSVGVYYDTQHFTSTPEEAAADAIRAGLDLD-CGPFLGLHTESAVQRGLLSEIDINNAL 347
+S+ V FT+ +EAA +A AG+D++ H ES + G +S ++ +
Sbjct: 271 WESI-VQLTEHGFTANDKEAAFEAANAGIDMEMVSNTYSQHLESLIIEGRISLAQVDEMV 329
Query: 348 VNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLS 407
N L ++ RLG+F+ P QP + DH++ A + A + +VLLKN SLPL
Sbjct: 330 KNILRLKFRLGLFEN-PYPQPDKL---PALVNHDHRQAAKKLALESVVLLKNSHQSLPLR 385
Query: 408 HIRHRTVAVIGPNSDVTVTMIGN--YAGIACGYTTPLQGIGRYA---RTIHQQGCKDVAC 462
++A+IGP +D +G + G A T LQ I +A T++ +
Sbjct: 386 LSALSSIALIGPLADDAYEQLGTWIFDGDADDSETVLQAINAFAGDSLTVNVDRALETTR 445
Query: 463 ADDQL-FGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGP 521
++ + + A++ +DA +L +G + + EA RA + LPG Q++L+ ++ +K P
Sbjct: 446 SNTFIDIDRTMAAAQSSDAIVLCLGEESILSGEAHSRADISLPGAQEQLIHLLAKTAK-P 504
Query: 522 TILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMT---- 577
IL++M+G P+ + + + AI++A +PG GTA+ D+LFG +P GKLP+T
Sbjct: 505 MILIVMAGRPLTLEPIID--HVDAILYAWHPGTMAGTALTDLLFGEVSPSGKLPITFPRM 562
Query: 578 ------WYPQEYITNLP-------MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSY 624
+Y ++ P M ++A R +Q+ + ++PFG G+SY
Sbjct: 563 VGQVPIYYGKKNTGKPPSAESVVHMNDIAPRAAQTSLGMSAFHLDAGFTPLFPFGFGLSY 622
Query: 625 TNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDG 684
T+F + N +S + + + V VDV N G ++G
Sbjct: 623 TSFTYE-------------------NLHLSSSTMNIDG------VITVTVDVINCGEREG 657
Query: 685 AHTLLVFSTPPAGHWA-PHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIP 743
+ +++ A + P K+L F+KVH+ AG +Q+V + L+ DR R I
Sbjct: 658 QEVVQLYTRDLAANVTRPVKELKQFQKVHLSAGERQQVKFLLKASA-LAFYDRKMNRIIE 716
Query: 744 LGEHNIHIGGT 754
G ++ GG+
Sbjct: 717 PGVFHLWTGGS 727
>gi|398386387|ref|ZP_10544389.1| beta-glucosidase-like glycosyl hydrolase [Sphingobium sp. AP49]
gi|397718418|gb|EJK79007.1| beta-glucosidase-like glycosyl hydrolase [Sphingobium sp. AP49]
Length = 791
Score = 259 bits (662), Expect = 4e-66, Method: Compositional matrix adjust.
Identities = 238/726 (32%), Positives = 338/726 (46%), Gaps = 109/726 (15%)
Query: 78 RLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSD 137
RLGI + E LHG + VG ATSFPQ I ASS++ T+ + +V+
Sbjct: 138 RLGIP-ILFHEEGLHGYAAVG-----------ATSFPQSIAMASSWDPTMLRQVNQVIGR 185
Query: 138 EARAMYNGGTAGL-TYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGD 196
E RA G+ SP V+I RDPRWGR +ET GEDP L G+ + V GLQG
Sbjct: 186 EIRAR------GVPMVLSPVVDIARDPRWGRIEETYGEDPYLVGEMGVAAVEGLQGEGRS 239
Query: 197 RL----KVAASCKHFTAYDLDNWNGVDRFHFN---AKVSKQDIEDTFDVPFRMCVMEGKV 249
RL V A+ KH T + G N A VS++++ + F PF V +
Sbjct: 240 RLLRPGHVFATLKHLTGH------GQPESGTNVGPAPVSERELRENFFPPFEQVVKRTGI 293
Query: 250 ASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAA 309
+VM SYN+++GVP+ A+ +L +R EW G +VSD +V H + EEAA
Sbjct: 294 EAVMASYNEIDGVPSHANRWLLDNVLRQEWGFRGAVVSDYSAVDQLMSIHHIAANLEEAA 353
Query: 310 ADAIRAGLDLDCGPFLGLHT-ESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQP 368
A+ AG+D D L T V+ G +SE ++ A+ L ++ R G+F+ P
Sbjct: 354 MRALDAGVDADLPEGLSYATLGKLVREGKVSEAKVDLAVRRMLELKFRAGLFE-----NP 408
Query: 369 YGHLGPKDVCTPDHQ--ELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVT 426
Y T + + LA AA++ I LLKN G LPL T+AVIGP++ V
Sbjct: 409 YADANAAAAITNNDEARALARTAAQRSITLLKNDG-MLPLKP--EGTIAVIGPSA--AVA 463
Query: 427 MIGNYAGIACGYTTPLQGI----GRYARTIHQQGCK---------DVACADD-----QLF 468
+G Y G + L+GI G A + QG K D D +L
Sbjct: 464 RLGGYYGQPPHSVSILEGIKARVGTKANIVFAQGVKITENDDWWEDKVVKSDPAENRKLI 523
Query: 469 GAAIDASRQADATILVMGLDQSIEAEAL------DRAGLLLPGRQQELVSKVSMASKGPT 522
A++A+R D IL +G + E DR L L G QQEL + K P
Sbjct: 524 AQAVEAARNVDRIILTLGDTEQSSREGWADNHLGDRPSLDLVGEQQELFDALKALGK-PI 582
Query: 523 ILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQE 582
+VL++G P + K + AI+ Y G+ GG A+ADILFG NPGGKLP+T P+
Sbjct: 583 TVVLINGRP--ASTVKVSEQANAILEGWYLGEQGGNAVADILFGDVNPGGKLPVT-VPRS 639
Query: 583 YITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPL 642
+ LPM M+PS R Y F +YPFG G+SYTNF + AP
Sbjct: 640 -VGQLPMF-YNMKPSAR-----RGYLFDTTDPLYPFGFGLSYTNFSLS---AP------- 682
Query: 643 DGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWA-P 701
++AT G K + V VDV+N G+++G + ++ P
Sbjct: 683 -----RLSATKIGTGGKTS----------VSVDVRNTGAREGDEVVQLYIRDKVSSVTRP 727
Query: 702 HKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLH 761
K+L F++V + G + V + + L + + R + G+ I G + SV+L
Sbjct: 728 VKELKGFQRVTLKPGESRTVTFTVG-PEALQMWNDQMRRVVEPGDFEIMTGNS--SVALQ 784
Query: 762 AATLGV 767
+ TL V
Sbjct: 785 STTLTV 790
>gi|334144838|ref|YP_004538047.1| beta-glucosidase [Novosphingobium sp. PP1Y]
gi|333936721|emb|CCA90080.1| beta-glucosidase [Novosphingobium sp. PP1Y]
Length = 889
Score = 259 bits (662), Expect = 4e-66, Method: Compositional matrix adjust.
Identities = 151/394 (38%), Positives = 216/394 (54%), Gaps = 32/394 (8%)
Query: 55 DLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFP 114
DL+ +++L EK+ L++ A A+PRL I Y WW+E+LHG G P T+FP
Sbjct: 36 DLVAKMTLDEKLGQLLNTAPAIPRLDIPAYNWWTESLHGAL---------GSLP-TTNFP 85
Query: 115 QVITTASSFNATLWEAIGRVVSDEARAMYNGGT---------AGLTYWSPNVNIFRDPRW 165
+ I A++F+A+L + + +S E R ++ GL WSPN+NIFRDPRW
Sbjct: 86 EPIGLAATFDASLVKDVAGAISTEVRGLHALARKTGRMGRIGTGLDTWSPNINIFRDPRW 145
Query: 166 GRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNA 225
GRGQET GEDP L+ + S+V G+QG D D V A+ KHF + N R H N
Sbjct: 146 GRGQETYGEDPYLTARMGVSFVEGMQGPDPDLPDVIATPKHFAVH---NGPESTRHHANV 202
Query: 226 KVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYI 285
VS+ D+EDT+ FR ++EG+ SVMC+YN+V+G P CA +L+ + W GY+
Sbjct: 203 FVSRHDLEDTYLPAFRAAIVEGRAGSVMCAYNRVDGQPACASQELLQEHLVDAWGFQGYV 262
Query: 286 VSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTE-------SAVQRGLL 338
VSDCD+V D + A A A+R G+D +C + T+ A++RGL+
Sbjct: 263 VSDCDAVKDISDNHKYAPDGAAAVAAAMRMGVDSECHTWTLSDTDGLTDRYREALERGLI 322
Query: 339 SEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLK 398
+ D++ L+ + ++R G G + DV TP H LAL+AA + +VLLK
Sbjct: 323 TVSDVDRTLIRLFSARLRNGDLPGVRKLSTFTS-SAADVGTPAHGALALKAAEESLVLLK 381
Query: 399 NQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYA 432
N G LP + VAVIGP D T + GNY+
Sbjct: 382 NDG-ILPFQTAGMK-VAVIGPFGDATRVLRGNYS 413
Score = 124 bits (311), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 91/289 (31%), Positives = 136/289 (47%), Gaps = 48/289 (16%)
Query: 474 ASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTI 523
A++ AD + V+GL +EAE D+ L +P QQEL+ + K P I
Sbjct: 614 AAQAADVLVAVVGLTSDLEAEESPIEIPGFKGGDKTTLDIPADQQELLEQAKATGK-PLI 672
Query: 524 LVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEY 583
+V M+G PI++ +AK + AI+ A YPGQ+GG AIA++L G +NP GKLP+T+Y
Sbjct: 673 VVAMNGSPINLHWAKEN--ADAILEAWYPGQSGGLAIANVLTGKANPTGKLPLTFY--RS 728
Query: 584 ITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLD 643
+ +LP P GRTYR++ G VYPFG+G+SYT F + P V
Sbjct: 729 VEDLP-------PFDDYDMKGRTYRYFTGKAVYPFGYGLSYTTFGY----GPVAVEPASG 777
Query: 644 GRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHK 703
G I T V N G + G + ++ P P+
Sbjct: 778 GAQDGIRVTTQ---------------------VSNTGQRAGGDAVQLYLDFPDAPGTPNI 816
Query: 704 QLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIG 752
L F+KV + G ++V + + LS V G R++ G + + +G
Sbjct: 817 ALRGFQKVSLQPGETRQVTFTLS-PRDLSSVTPDGVRKVLKGHYRVTVG 864
>gi|393724082|ref|ZP_10344009.1| beta-glucosidase [Sphingomonas sp. PAMC 26605]
Length = 900
Score = 259 bits (662), Expect = 4e-66, Method: Compositional matrix adjust.
Identities = 170/452 (37%), Positives = 239/452 (52%), Gaps = 45/452 (9%)
Query: 1 MASTIAFFFLGLILLSASSSGLAAREPFACD-PKDATTRTLPFCQVSLPIPQRVNDLIGR 59
MAST + ++LL SG+AA A D P T P V + ++ R
Sbjct: 1 MASTFSVLSRTVLLLG---SGMAAIASQAQDAPPQPTVLAQP-SAVRADADRLARAIVAR 56
Query: 60 LSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITT 119
++ EK+ L++ A A+PRL I Y WW+E+LHG P T+FP+ I
Sbjct: 57 MTPAEKLPQLLNVAPAIPRLNIPAYNWWTESLHGAIGTVP----------TTNFPEPIGL 106
Query: 120 ASSFNATLWEAIGRVVSDEARAMYNGGT---------AGLTYWSPNVNIFRDPRWGRGQE 170
A+SF+A L + +S E RA++ GL WSPN+NIFRDPRWGRGQE
Sbjct: 107 AASFDAPLVHRVAGAISTEVRALHTLARETGKLGRIGTGLDTWSPNINIFRDPRWGRGQE 166
Query: 171 TPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQ 230
T GEDP L+ ++V G+QG + D +V A+ KHF + R N VS
Sbjct: 167 TYGEDPYLTAHQGVAFVTGMQGPNPDLPEVIATPKHFAVHSGPEST---RHAANVYVSPH 223
Query: 231 DIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCD 290
D+EDT+ FR ++EG+ S+MC+YN+++G P CA+ +LK +RG W NGY+VSDCD
Sbjct: 224 DLEDTYLPAFRAAIVEGRAGSIMCAYNRIDGQPACANDLLLKDHLRGAWGFNGYVVSDCD 283
Query: 291 SVGVYYDTQHFTSTPEEAAADAIRAGLDLDC-----GPFLGLHTE--SAVQRGLLSEIDI 343
+V D + A A A++AG+D +C G GL A++R L+SE D+
Sbjct: 284 AVKDIADNHKYAPDQATAVAAALKAGVDNECNTQTIGDIGGLPDRFGEALKRNLISEGDV 343
Query: 344 NNALVNTLTVQMRLGMFDG---EPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQ 400
+ LV + ++R G G P+ P G + TPDH LALEAA++ +V+LKN
Sbjct: 344 DRTLVRLFSARIRNGDLPGIAAAPAVVPVGA-----ILTPDHAALALEAAQKSLVMLKND 398
Query: 401 GPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYA 432
G LPL +AVIGP D T + GNY+
Sbjct: 399 G-VLPLRP--GARIAVIGPLGDATRVLRGNYS 427
Score = 117 bits (294), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 87/278 (31%), Positives = 136/278 (48%), Gaps = 50/278 (17%)
Query: 486 GLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVA 535
GL +EAE D+ L LP QQ L+ + K P I++ M+G PI++A
Sbjct: 637 GLTSDLEAEETGVDVPGFKGGDKTTLDLPAEQQALLERAHATGK-PLIVIAMNGSPINLA 695
Query: 536 FAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMR 595
+AK++ AAI+ A YPGQAGG A+A++L G ++PGG+LP+T+Y + +LP
Sbjct: 696 WAKDN--AAAIVEAWYPGQAGGLAVANVLTGKADPGGRLPLTFY--RSVADLP------- 744
Query: 596 PSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISG 655
P GRTYR++ G VYPFG+G+S+T+F + AP
Sbjct: 745 PFDDYAMTGRTYRYFTGTPVYPFGYGLSFTSFAY----AP-------------------- 780
Query: 656 KAIKVTHAKCNRLT-LGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVP 714
+ VT A T L V +V N G + G + ++ P P L + +V +
Sbjct: 781 --LTVTPAAGGAETGLRVTTEVSNTGQRVGDEVVQLYLNFPDQPGVPRVALRGYRRVTLK 838
Query: 715 AGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIG 752
G + V ++ + LS V G R++ G++ + +G
Sbjct: 839 PGEHRAVTFDLS-PRDLSAVGIDGKRQVMAGQYRVSVG 875
>gi|189464498|ref|ZP_03013283.1| hypothetical protein BACINT_00840 [Bacteroides intestinalis DSM
17393]
gi|189438288|gb|EDV07273.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
intestinalis DSM 17393]
Length = 862
Score = 259 bits (662), Expect = 4e-66, Method: Compositional matrix adjust.
Identities = 164/432 (37%), Positives = 239/432 (55%), Gaps = 32/432 (7%)
Query: 42 FCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGT 101
+ SL +R + L+ L+L+EKV+L++ G+ V RL IK Y WW+EALHGV+ G
Sbjct: 22 YKNTSLSPEERADLLVKELTLEEKVQLMMDGSRPVERLDIKPYNWWNEALHGVARAGL-- 79
Query: 102 KFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARA---MYNGGTA-----GLTYW 153
AT FPQ I A+SF+ + VSDEARA Y+ + GLT W
Sbjct: 80 --------ATVFPQPIGMAASFSPETVYDVFTAVSDEARAKNTYYSSKDSRERYQGLTMW 131
Query: 154 SPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL 212
+P VNI+RDPRWGRG ET GEDP L+ + V+GLQG +DG+ K+ A KHF +
Sbjct: 132 TPTVNIYRDPRWGRGIETYGEDPYLTSRMGVMVVKGLQGPADGNYDKLHACAKHFAVHSG 191
Query: 213 DNWNGVDRFHFNAK-VSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNIL 271
WN R FNA+ + +D+ +T+ PF V EG+V VMC+YN+ G P C +L
Sbjct: 192 PEWN---RHSFNAENIKPRDLYETYLPPFEALVKEGEVKEVMCAYNRFEGEPCCGSNRLL 248
Query: 272 KRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPE--EAAADAIRAGLDLDCGPFLGLHT 329
+ +R EW +G +VSDC ++ +Y+ + + P+ A+A A+ +G DL+CG
Sbjct: 249 MQILRDEWGFDGIVVSDCGAIADFYNDRGHHTHPDAESASAAAVVSGTDLECGSSYKALV 308
Query: 330 ESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEA 389
ES V++GL+SE +N ++ L + LG D + + + V + H LAL+
Sbjct: 309 ES-VKKGLISEEKVNTSVRRLLKARFELGEMD-DLEKVSWAKIPFSVVASAAHDSLALKI 366
Query: 390 ARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI---- 445
AR+ + LL N+ LPL TVAV+GPN++ +V GNY G+ T L G+
Sbjct: 367 ARKSMTLLMNKDNFLPLRR-GGLTVAVMGPNANDSVMQWGNYNGMPPHTVTILDGVRNLL 425
Query: 446 GRYARTIHQQGC 457
G + I++QGC
Sbjct: 426 GADDKLIYEQGC 437
Score = 130 bits (328), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 94/309 (30%), Positives = 147/309 (47%), Gaps = 48/309 (15%)
Query: 471 AIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKG 520
+++ ++AD I G+ S+E E + DR + LP Q+EL+ + A K
Sbjct: 592 SVEQVKEADIVIFASGISPSLEGEEMGVNLPGFKKGDRTDIELPAVQRELIDALYRAGK- 650
Query: 521 PTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYP 580
+LV SG PI + AI+ A YPGQ GGTA+A++LFG NP G+LP+T+Y
Sbjct: 651 KIVLVNCSGSPI--GLEPETKKCEAILQAWYPGQQGGTAVAEVLFGDYNPAGRLPVTFYR 708
Query: 581 QEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAV 640
++ LP E GRTYR+ + ++PFG+G+SYT F + V
Sbjct: 709 N--VSQLPDFE-------DYNMAGRTYRYMQDTPLFPFGYGLSYTTFSYD--------KV 751
Query: 641 PLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWA 700
LD + +G+ +K+T V V N G +DG + V+
Sbjct: 752 VLDK-----SEVTAGQTLKLT------------VPVTNTGKRDGEEVVQVYLKKQGDAEG 794
Query: 701 PHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
P K L AF++V +PAG + ++ + +++ T R+ G ++I +GG+ SL
Sbjct: 795 PVKTLRAFKRVFIPAGQTTNIEFDLKDKELEWWDEQTNTVRVCSGTYDIMVGGSSEDESL 854
Query: 761 HAATLGVIK 769
TL IK
Sbjct: 855 Q-QTLVTIK 862
>gi|410097219|ref|ZP_11292201.1| hypothetical protein HMPREF1076_01379 [Parabacteroides goldsteinii
CL02T12C30]
gi|409224537|gb|EKN17469.1| hypothetical protein HMPREF1076_01379 [Parabacteroides goldsteinii
CL02T12C30]
Length = 805
Score = 259 bits (662), Expect = 4e-66, Method: Compositional matrix adjust.
Identities = 232/844 (27%), Positives = 375/844 (44%), Gaps = 151/844 (17%)
Query: 10 LGLI-LLSASSSGLAAREPFACDPKDATT-------------RTLPFCQVSLPIPQRVND 55
GL+ LL+A++S L A +P A PK+ + + +S PI +RV D
Sbjct: 17 FGLLPLLNANTSSLQASKPDA--PKNQKKIYQKGWIDFNKNGKKDIYEDLSQPIDKRVED 74
Query: 56 LIGRLSLQEKVKLL--ISGAAAVPRLGIKGYEW----WSEALHGVSNV-----GPGTKFG 104
L+ +++++EK L I G AV + + EW W + G+ N+ G +
Sbjct: 75 LLKQMTVEEKTCQLGTIYGYGAVLKDTLPTDEWKTRIWKD---GIGNIDEHLNGEWKRTS 131
Query: 105 GDFP--------------------------------------GATSFPQVITTASSFNAT 126
DFP +T FP I +++
Sbjct: 132 LDFPYSNHAEAMNKVQAFFVEETRLGIPADLTNEGIRGLKHEKSTFFPAQIGQGCTWDKE 191
Query: 127 LWEAIGRVVSDEARAMYNGGTAGLT-YWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAAS 185
L IGR+ +EA+A+ G T +SP +++ RDPRWGR E+ GED L+G+
Sbjct: 192 LIYEIGRITGEEAKAL------GYTNIYSPILDLSRDPRWGRTVESYGEDSYLAGELGRQ 245
Query: 186 YVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRF-HFNAKVSKQDIEDTFDVPFRMCV 244
V G+Q + +V ++ KHF Y + G D + + S Q++ + PFR+
Sbjct: 246 QVLGIQSN-----RVVSTPKHFAIYGIPG-GGRDCYSRTDPHASPQEVHELHLEPFRIAF 299
Query: 245 MEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTST 304
E MCS+N NG P A ++ +R +W GY+VSD ++ H T
Sbjct: 300 QEAGALGTMCSHNDYNGTPVSASHYLMTELLRNQWGFKGYVVSDSWAIDKNVKFYHIVDT 359
Query: 305 PEEAAADAIRAGLD----LDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMF 360
EEA A + AGL+ + A+Q+GL+ E ++ + L V+ LG+F
Sbjct: 360 EEEAVASELNAGLNVRTFFEQSEVFIEALRRALQKGLVEESTLDQRVREVLYVKFWLGLF 419
Query: 361 DGEPSSQPY---GHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVI 417
D PY L K V + ++E++L AAR+ IVLLKN+ +LPLS + +AVI
Sbjct: 420 D-----DPYVKDTKLADKIVNSDKNREVSLRAARESIVLLKNENNTLPLSKTL-KNIAVI 473
Query: 418 GPNSDVTVTMIGNYAGIACGYTTPLQG----IGRYARTIHQQGCK--------------D 459
GP +D ++ Y T LQG +G ++ +GC +
Sbjct: 474 GPQADEVKSLTSRYGSHNPNVITGLQGLKNLLGENVNLMYAKGCNVRDKNFPQSDVMYFE 533
Query: 460 VACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASK 519
++ + + A++ +++A+ I+ +G D E+ R L L GRQ+ELV V A+
Sbjct: 534 LSDKEKEEIDEAVEIAKKAEVAIIYVGDDFRTIGESRSRVNLDLSGRQKELVRAV-QATG 592
Query: 520 GPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWY 579
P +LVL +G P+ + + D + AI+ A YPG+ G A+A++LFG NPGGKL T +
Sbjct: 593 TPVVLVLFNGRPVTLNW--EDANLPAIVEAWYPGEFSGQAVAEVLFGDYNPGGKLSTT-F 649
Query: 580 PQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVA 639
P+ + +P +P+ + G+ + G +YPFG+G+SYT F ++N
Sbjct: 650 PKS-VGQIPWA-FPFKPNAT----GKGFARVDGE-LYPFGYGLSYTTF--EISNLQPSAT 700
Query: 640 VPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHW 699
DG TL V VKN GS G + ++
Sbjct: 701 KIADGD-----------------------TLTVTCKVKNTGSVKGDEVVQLYLNDETSSI 737
Query: 700 AP-HKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSV 758
+ K+L FE+V + G ++ V ++ Y D++ P G+ + G + S
Sbjct: 738 SRFEKELCGFERVALEPGEEKTVTFKVNRRAYGMYNDKNEFVVEP-GKFFLFAGNSSKST 796
Query: 759 SLHA 762
L+A
Sbjct: 797 PLNA 800
>gi|224025503|ref|ZP_03643869.1| hypothetical protein BACCOPRO_02243 [Bacteroides coprophilus DSM
18228]
gi|224018739|gb|EEF76737.1| hypothetical protein BACCOPRO_02243 [Bacteroides coprophilus DSM
18228]
Length = 787
Score = 259 bits (662), Expect = 4e-66, Method: Compositional matrix adjust.
Identities = 234/824 (28%), Positives = 368/824 (44%), Gaps = 166/824 (20%)
Query: 42 FCQVSLPIPQRVNDLIGRLSLQEKV----------------------------------- 66
+ Q S PI +RV+DL+GR++L+EK+
Sbjct: 24 YRQASAPIEERVSDLLGRMTLEEKIGQICCPLGWEMYTKVSPDSVTISDKYRQQMDEAPI 83
Query: 67 --------------KLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFP---- 108
K L +G PRL K + + +G F + P
Sbjct: 84 GSYWAVLRADPWTQKTLETGLN--PRLAAKALNALQKYAVEHTRLGIPVLFAEECPHGHM 141
Query: 109 --GATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWG 166
GAT FP + AS++N +L +G V+ EAR G G + P ++I R+PRW
Sbjct: 142 AIGATVFPTSMGQASTWNESLIRQMGEVIGLEAR--LQGANIG---YGPVLDIAREPRWS 196
Query: 167 RGQETPGEDPVLSGKYAASYVRGLQGSD-GDRLKVAASCKHFTAYDLDNWNGVDRFHFN- 224
R +ET GEDP L+G ++V+G+QG D D V ++ KH AY GV R N
Sbjct: 197 RVEETFGEDPYLTGILGTAFVQGMQGKDFKDGRHVYSTLKHLAAY------GVPRGGHNG 250
Query: 225 --AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLN 282
A + + + D + F+ V GK A+VM SYN ++GVP ++ ++ +R W +
Sbjct: 251 GPADMGLRALLDEYLPGFQRAVEVGKAATVMTSYNSIDGVPCTSNKFLIDSLLRKRWGFD 310
Query: 283 GYIVSDCDSV-GVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEI 341
G++ SD S+ G+ H + E+AA A+ AG D+D G AVQ G + E
Sbjct: 311 GFVYSDLASIDGIA--GAHVAANLEDAAIQAVEAGTDMDLGANAYRRLVKAVQTGKVKES 368
Query: 342 DINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKD----VCTPDHQELALEAARQGIVLL 397
IN A+ N L ++ R+G+F+ QPY + P++ V DH+ LA + AR+G VLL
Sbjct: 369 AINRAVSNVLRLKFRMGLFE-----QPY--VSPEEAARLVNCEDHRMLARKIAREGTVLL 421
Query: 398 KNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAG--IACGYTTPLQGIGRY---ARTI 452
KN G LPL ++ +AVIGPN+DV +G+Y T L + R
Sbjct: 422 KNNG-ILPLGKVKR--IAVIGPNADVMYNYLGDYTAPQERSKVVTLLDALRNRMPDVRID 478
Query: 453 HQQGC--KDVACADDQLFGAAIDASRQADATILVMG------------------------ 486
+ +GC +D ++ A++A+R+AD IL +G
Sbjct: 479 YVKGCAIRDTTQSN---IKEAVEAARKADLVILAVGGSSARDFKTKYINTGAATVDSENS 535
Query: 487 --LDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIA 544
L E DRA L L G Q++L+ ++ A++ P + V ++G P+++ A
Sbjct: 536 GILSDMECGEGFDRATLDLLGDQEKLIRAIA-ATEKPLVTVYIAGRPLNMNLASEVSD-- 592
Query: 545 AIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPG 604
A++ A YPG+ GG I D+L G NP G+LPM+ P+ ++ +P+ Y
Sbjct: 593 ALLTAWYPGEQGGNGIVDVLTGEYNPSGRLPMS-VPR-HVGQIPV-----------HYSQ 639
Query: 605 RTYRFYK---GPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVT 661
T R Y G +Y FG+G+SYT F ++ N +S A +
Sbjct: 640 GTLRDYMDCPGKPLYTFGYGLSYTTFAYS-------------------NLKLSATAKAAS 680
Query: 662 H-AKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWA-PHKQLVAFEKVHVPAGAQQ 719
A N + + V N G +DG + ++ A P +L F+K+ + G +
Sbjct: 681 QPAGDNEVMQTITCTVTNTGDRDGDEVVQLYLNDEVSSVAVPPIRLKGFQKIFLKKGESR 740
Query: 720 RVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHAA 763
V + + LS+ DR+ G N+ IGG+ ++ L +
Sbjct: 741 EVTFQL-TRQDLSIYDRNMNFTAEPGRFNVMIGGSSDNLPLKGS 783
>gi|399070782|ref|ZP_10749848.1| beta-glucosidase-like glycosyl hydrolase [Caulobacter sp. AP07]
gi|398043993|gb|EJL36849.1| beta-glucosidase-like glycosyl hydrolase [Caulobacter sp. AP07]
Length = 900
Score = 259 bits (661), Expect = 4e-66, Method: Compositional matrix adjust.
Identities = 158/418 (37%), Positives = 228/418 (54%), Gaps = 38/418 (9%)
Query: 56 LIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQ 115
L+G+++L EKV+ L++ A A+PRLGI Y WW+E+LHG P T+FP+
Sbjct: 52 LVGQMTLDEKVEQLLNTAPAIPRLGIPAYNWWTESLHGALGALP----------TTNFPE 101
Query: 116 VITTASSFNATLWEAIGRVVSDEARAMYNGGTA---------GLTYWSPNVNIFRDPRWG 166
I A++F+ L + V E R ++ + GL WSPN+NIFRDPRWG
Sbjct: 102 PIGLAATFDPALVHDVAGAVGSEVRGLHTLARSTGRMGRIGGGLDTWSPNINIFRDPRWG 161
Query: 167 RGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAK 226
RGQET GEDP L+ + ++V+G+QG + D +V A+ KHF + R N
Sbjct: 162 RGQETYGEDPFLTARVGVAFVQGVQGPNPDLPQVIATPKHFAVHSGPEST---RHSANVF 218
Query: 227 VSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIV 286
VS+ D+EDT+ FR + EG+ SVMC+YN+++G P CA +LK +R WR +GY+V
Sbjct: 219 VSRHDLEDTYLPAFRAAITEGRAGSVMCAYNRIDGQPACASDLLLKDHLRQAWRFDGYVV 278
Query: 287 SDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG-------PFLGLHTESAVQRGLLS 339
SDCD+V D + A A A+RAG+D +C L A++R L+S
Sbjct: 279 SDCDAVKDISDHHRYAPDAAAAVAAAMRAGVDNECNGATLTDTDGLADRYREALRRDLIS 338
Query: 340 EIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKN 399
D++ ALV + ++R G G +P + V TP+H LAL AA + +VLLKN
Sbjct: 339 VADVDRALVRLFSARLRTGDLPG--VRKPETPVALASVMTPEHAVLALAAAEKSLVLLKN 396
Query: 400 QGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY-AGIACGYTTPLQGIGR---YARTIH 453
G LPL +AVIGP D T + GNY + ++ + L+G+ R AR +H
Sbjct: 397 DG-VLPLKA--DTRIAVIGPLGDATRVLRGNYSSALSAPPISVLEGLRRAMPSARIVH 451
Score = 122 bits (307), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 86/290 (29%), Positives = 142/290 (48%), Gaps = 50/290 (17%)
Query: 474 ASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTI 523
A+ QAD + V+GL +EAE D+ L +P QQ L+ + K P +
Sbjct: 625 AAAQADVLVAVVGLTSDLEAEESPVEIPGFKGGDKTTLDIPSDQQALLEQAKALGK-PLV 683
Query: 524 LVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEY 583
+V M+G P+++++AK++ +AI+ A YPGQ+GG A+A++L G +NP G+LP+T+Y
Sbjct: 684 VVTMNGSPLNLSWAKDN--ASAILEAWYPGQSGGLAVANVLSGKTNPSGRLPLTFY--RS 739
Query: 584 ITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLD 643
+ +LP P GRTYR+++G VY FG+G+SYT F +
Sbjct: 740 VDDLP-------PFGDYAMAGRTYRYFEGTPVYSFGYGLSYTRFAY-------------- 778
Query: 644 GRHGSINATISGKAIKV-THAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPH 702
+ +KV A L V V+N+G++ G ++ P AP
Sbjct: 779 ------------EPLKVEAAAGGAGKGLRVTTSVRNLGTRSGDEVAQLYLDFPDAPGAPR 826
Query: 703 KQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIG 752
L F+++ + G + + + + LS VD G RR+ G + + +G
Sbjct: 827 LALRGFQRLTLKPGESRAITFALS-SRDLSSVDPDGQRRVASGRYRVSVG 875
>gi|224538725|ref|ZP_03679264.1| hypothetical protein BACCELL_03619 [Bacteroides cellulosilyticus
DSM 14838]
gi|224519667|gb|EEF88772.1| hypothetical protein BACCELL_03619 [Bacteroides cellulosilyticus
DSM 14838]
Length = 942
Score = 259 bits (661), Expect = 5e-66, Method: Compositional matrix adjust.
Identities = 205/680 (30%), Positives = 325/680 (47%), Gaps = 74/680 (10%)
Query: 110 ATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLT-YWSPNVNIFRDPRWGRG 168
AT+FP + ++N L IG + EAR + G T ++P +++ RD RWGR
Sbjct: 173 ATNFPTQLGLGHTWNRELIRQIGLITGREARML------GYTNVYAPILDVGRDQRWGRY 226
Query: 169 QETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVS 228
+E GE P L + VRG+Q + +VAA+ KHF AY + + ++S
Sbjct: 227 EEVYGESPYLVAELGIEMVRGMQHNH----QVAATGKHFVAYSNNKGAREGMARVDPQMS 282
Query: 229 KQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSD 288
+++E PF+ + E + VM SYN +GVP L +RGE GY+VSD
Sbjct: 283 PREVEMIHVYPFKRVIKEAGLLGVMSSYNDYDGVPIQGSYYWLTTRLRGEMGFRGYVVSD 342
Query: 289 CDSVGVYYDTQHFTSTP-EEAAADAIRAGLDLDCG----PFLGLHTESAVQRGLLSEIDI 343
D+V Y T+H T+ +EA ++ AGL++ C L V+ G LSE I
Sbjct: 343 SDAVEYLY-TKHSTAKDMKEAVRQSVEAGLNVRCTFRSPDSYVLPLRELVKEGGLSEEVI 401
Query: 344 NNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPS 403
N+ + + L V+ +G+FD + G K+V +++ LAL+A+R+ +VLLKN+
Sbjct: 402 NDRVRDILRVKFLVGLFDTPYQTDLAG--ADKEVEKAENESLALQASRESLVLLKNENNV 459
Query: 404 LPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGR----YARTIHQQGCK- 458
LPL + +AV GPN+D + +Y +A TT L+GI + A ++ +GC
Sbjct: 460 LPLDINNVKKIAVCGPNADEEGYALTHYGPLAVEVTTVLEGIRQKAEGKAEVLYTKGCDL 519
Query: 459 -DVACADDQL------------FGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPG 505
D + +L A++ +RQAD ++V+G Q E R+ L LPG
Sbjct: 520 VDANWPESELIDYPMTDSEQAEIDKAVENARQADVAVVVLGGGQRTCGENKSRSSLELPG 579
Query: 506 RQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILF 565
RQ +L+ V K P +LVL++G P+ + +A D + AI+ A YPG GGTA+AD+LF
Sbjct: 580 RQLKLLQAVQATGK-PVVLVLINGRPLSINWA--DKFVPAILEAWYPGSKGGTAVADVLF 636
Query: 566 GTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQS---KRYPGRTYRFYK-GPVVYPFGHG 621
G NPGGKL +T +P+ + +P +PS + PG + +Y FG+G
Sbjct: 637 GDYNPGGKLTVT-FPKS-VGQIPFN-FPCKPSSQIDGGKNPGLDGNMSRVNGALYSFGYG 693
Query: 622 MSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGS 681
+SYT F ++ + IS K I V+ V N G
Sbjct: 694 LSYTTFEYS-------------------DIEISPKVITPNQKAT------VRCKVTNTGK 728
Query: 682 KDGAHTLLVFSTPPAGHWAPH-KQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTR 740
+ G + ++ + K L FE++H+ G + V + K L ++D+
Sbjct: 729 RAGDEVVQLYVRDILSSVTTYEKNLAGFERIHLQPGETKEVVFTLD-RKQLELLDKHMEW 787
Query: 741 RIPLGEHNIHIGGTKHSVSL 760
+ G+ +I IG + + L
Sbjct: 788 VVEPGDFSIMIGASSEDIRL 807
>gi|386724230|ref|YP_006190556.1| beta-glucosidase [Paenibacillus mucilaginosus K02]
gi|384091355|gb|AFH62791.1| beta-glucosidase [Paenibacillus mucilaginosus K02]
Length = 973
Score = 259 bits (661), Expect = 5e-66, Method: Compositional matrix adjust.
Identities = 161/440 (36%), Positives = 237/440 (53%), Gaps = 23/440 (5%)
Query: 42 FCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGT 101
+ + P+ RVNDLI +L+L EK++L+ A+ RLGI+ Y+ +EA HG++ +G
Sbjct: 9 YWDFNAPLEDRVNDLIAQLTLDEKIELMPQYQIAIERLGIQAYKHGTEAAHGMAWLGE-- 66
Query: 102 KFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMY--NGGTAGLTYWSPNVNI 159
ATS+PQ I A ++++ L + IG + DEAR Y N GLT W+P V++
Sbjct: 67 --------ATSYPQPIGLACTWDSELLKRIGSAIGDEARGFYKQNPNFNGLTLWAPTVDL 118
Query: 160 FRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD 219
RDPRWGR +E GEDPVL+GK AA+ +G+QG LK A+ KHF + N N
Sbjct: 119 ERDPRWGRTEEAYGEDPVLAGKLAAALTQGIQGDHPFYLKAVATLKHF----IGNNNEAG 174
Query: 220 RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEW 279
R + + +++ + + F + EG S+M +YN VNGVP ++ ++GEW
Sbjct: 175 RGDTSVSLDPRNLREYYLKVFEITFKEGGAQSMMTAYNAVNGVPANLSELVID-IVKGEW 233
Query: 280 RLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIR-AGLD--LDCGPFLGLHTESAVQRG 336
+NG++VSD V H+ T +EA A +IR G+D D + A++ G
Sbjct: 234 GMNGFVVSDAFDVTGTMRDHHYVETLKEAVARSIREGGIDSITDDAAVVTEAIHEALKDG 293
Query: 337 LLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVL 396
LL+E D++ AL NT V+ RLG FD P PY + + P+H +LA EA+++ +VL
Sbjct: 294 LLTENDLDTALRNTFRVRFRLGEFD-PPERNPYATIDESAILHPEHAKLAREASQKAVVL 352
Query: 397 LKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQG 456
LKN G +LPL + VAVIGP +DV Y+G TPLQGI +
Sbjct: 353 LKNDGKTLPLQADKLSKVAVIGPLADVVYQ--DWYSGSLPYAVTPLQGIRERLAAEGKDA 410
Query: 457 CKDVACADDQLFGAAIDASR 476
A D++ AI + R
Sbjct: 411 VTSYAAGTDRMRIKAISSGR 430
Score = 89.7 bits (221), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 85/313 (27%), Positives = 141/313 (45%), Gaps = 55/313 (17%)
Query: 454 QQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEA-EALDRAGLLLPGRQQELVS 512
Q ++ C L AA+ ++R ADA I+ +G I E +DR + L Q++L
Sbjct: 560 QADTFEIECVSSGL-DAAVASARGADAAIVFVGNHPLINGKETMDRPDITLAESQEKLA- 617
Query: 513 KVSMASKGP-TILVLMSGGPIDVAFAKN--DPRIAAIIWAGYPGQAGGTAIADILFGTSN 569
+++ K P TI+V++ P FA N + AII+ + GQ G A+A +LFG N
Sbjct: 618 -LAVIEKNPNTIIVVVGSYP----FALNALQEKAKAIIYTSHAGQELGRAVAAVLFGDVN 672
Query: 570 PGGKLPMTWYPQEYITNLP--MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNF 627
P G+L MTWY + LP M +R RTY++++G +YPFGHG+SY++F
Sbjct: 673 PAGRLNMTWYKS--VDQLPPFMDYDIIRGC-------RTYQYFEGEPLYPFGHGLSYSDF 723
Query: 628 VHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHT 687
+ + +P G+ A + V V N+ +G
Sbjct: 724 RY-----EALQLLPNRAEAGTDTA------------------VQVSCRVTNISDCEGEEV 760
Query: 688 LLVFSTPPAGHWA-PHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVD--------RSG 738
+ ++ A P QL F+++ + AG V + + + L++ D SG
Sbjct: 761 VQLYVRADASRVKRPRLQLADFKRIALAAGESAEVTFQLQL-ESLAIWDVTRDRFCLESG 819
Query: 739 TRRIPLGEHNIHI 751
T + LG ++ +
Sbjct: 820 TYTVMLGASSVDL 832
>gi|389696043|ref|ZP_10183685.1| beta-glucosidase-like glycosyl hydrolase [Microvirga sp. WSM3557]
gi|388584849|gb|EIM25144.1| beta-glucosidase-like glycosyl hydrolase [Microvirga sp. WSM3557]
Length = 751
Score = 259 bits (661), Expect = 5e-66, Method: Compositional matrix adjust.
Identities = 223/739 (30%), Positives = 354/739 (47%), Gaps = 88/739 (11%)
Query: 49 IPQRVNDLIGRLSLQEKVKLL-------------ISGAAAVPRLGIKGYE--WWSEALHG 93
I +RVN+L+GR++L+EKV L IS A L E ++AL
Sbjct: 36 IERRVNELLGRMTLEEKVGQLNLVSHGPPLRWEDISEGKAGAVLNFNSAEDVARAQALVR 95
Query: 94 VSNVGPGTKFGGDFPGA--TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLT 151
S++ FG D T FP + A++F+ + R+ S+ A G+
Sbjct: 96 ESHLKIPLLFGLDVLHGFRTQFPLPLGEAAAFSPR----VSRLASE--WAAREASYVGVN 149
Query: 152 Y-WSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAY 210
+ ++P ++ RD RWGR E GEDP L A+ V G + + +AA+ KHF Y
Sbjct: 150 WTFAPMADLSRDSRWGRIVEGFGEDPTLGAALTAARVEGFR-----KGGLAAAAKHFAGY 204
Query: 211 DLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNI 270
R + + + ++ DT+ PFR V E AS M ++N +NG P+ A+P +
Sbjct: 205 GAPQGG---RDYDTTYIPRAEMYDTYLPPFRAAV-EAGTASFMAAFNALNGEPSTANPWL 260
Query: 271 LKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDC-GPFLGLHT 329
L +R +W +G++ SD +G + + EAA AI AG+D+D G H
Sbjct: 261 LTDVLRTQWGFDGFVTSDWVGIGELVN-HGIAADGAEAARKAILAGVDMDMMGQLYINHL 319
Query: 330 ESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEA 389
V+ G + E I+ ++ L + RLG+FD P HL + +P+ ++ A E
Sbjct: 320 PDEVRAGRVPESVIDESVRRVLRTKFRLGLFD-RPDVD-SSHLD-SEFPSPESRQAAREV 376
Query: 390 ARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYA--GIACGYTTPLQGIGR 447
AR+ VLL+N+ LP+ + R++AV+GP +D +G +A G T L+GI R
Sbjct: 377 ARETFVLLQNRDDVLPIPS-KVRSIAVVGPLADAPQDQMGPHAARGHKEDSVTILEGIRR 435
Query: 448 YARTI-----HQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLL 502
A++ H GC D+ C + A++A+RQ+D I V G Q + EA RA +
Sbjct: 436 RAQSAGIAVRHAPGC-DLFCRNTDALPGALEAARQSDFVIAVFGEPQELSGEAASRANME 494
Query: 503 LPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIAD 562
L G+Q E++ +++ K P LV+M G P + + RI +I+ A YPG G A+AD
Sbjct: 495 LNGKQIEVLEELAKTGK-PVALVIMGGRPQVLGPVAD--RIPSILMAWYPGTEAGPAVAD 551
Query: 563 ILFGTSNPGGKLPMTWYPQE------YITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVY 616
+LFG +P GKLP+TW P+ Y LP RP+ + Y +Y
Sbjct: 552 VLFGDVSPSGKLPLTW-PRATGQLPLYYNRLPTG----RPTLANNRFTLHYIDESIAPLY 606
Query: 617 PFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDV 676
PFG G+SYT+F ++ +A I+ + + L V +DV
Sbjct: 607 PFGWGLSYTHFAYS-------------------DARIASRQLDEGQ------VLEVSLDV 641
Query: 677 KNVGSKDGAHTLLVFSTPP-AGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVD 735
KN G++DG + +++ P A P ++L AFEK+ + +G +RV + + V +D
Sbjct: 642 KNTGARDGQEVVQLYTRDPVASRSRPLRELKAFEKIALKSGETKRVTLRVPVESLGFHLD 701
Query: 736 RSGTRRIPLGEHNIHIGGT 754
GT + G + +GG+
Sbjct: 702 -DGTYLVEAGAIQVFVGGS 719
>gi|423212854|ref|ZP_17199383.1| hypothetical protein HMPREF1074_00915 [Bacteroides xylanisolvens
CL03T12C04]
gi|392694712|gb|EIY87939.1| hypothetical protein HMPREF1074_00915 [Bacteroides xylanisolvens
CL03T12C04]
Length = 782
Score = 259 bits (661), Expect = 5e-66, Method: Compositional matrix adjust.
Identities = 205/685 (29%), Positives = 318/685 (46%), Gaps = 84/685 (12%)
Query: 109 GATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRG 168
GAT FP I A++++ L + +G+V++ E R+ G + P +++ RDPRW R
Sbjct: 148 GATVFPTGIGMAATWSPELVKEVGQVIAKEIRSQ-----GGHISYGPVLDLTRDPRWSRV 202
Query: 169 QETPGEDPVLSGKYAASYVRGLQGSD-GDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKV 227
+ET GEDPVLSG AS V GL G + + A+ KHF AY + + A V
Sbjct: 203 EETFGEDPVLSGILGASMVDGLGGGNLSQKYATIATLKHFLAYAVPEGGQNGNY---ASV 259
Query: 228 SKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVS 287
+D+ F PFR + G + SVM SYN ++G+P ++ +L + +R EW+ G++VS
Sbjct: 260 GIRDLHQNFLPPFRKAIDAGAL-SVMTSYNSIDGIPCTSNHYLLTQLLRNEWKFRGFVVS 318
Query: 288 DCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNAL 347
D S+ +++ T E AA ++ AG+D+D G + AVQ G + + I+ A+
Sbjct: 319 DLYSIEGIHESHFVAPTKENAAIQSVTAGVDVDLGGDAYTNLCHAVQSGQMDKTVIDTAV 378
Query: 348 VNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLS 407
L ++ +G+F+ P P + K V +H ELA + A+ I LLKN+ LPLS
Sbjct: 379 CRVLRMKFEMGLFE-HPYVDP--KIAAKTVRRKEHIELARKIAQSSITLLKNENSILPLS 435
Query: 408 HIRHRTVAVIGPNSDVTVTMIGNYAGIA--CGYTTPLQGIGRYARTIHQQGCKDVACADD 465
++ VAVIGPN+D M+G+Y T L GI + + A D
Sbjct: 436 KTINK-VAVIGPNADNRYNMLGDYTAPQEDSNVKTVLDGILTKLSPFRVEYVRGCAIRDT 494
Query: 466 QL--FGAAIDASRQAD----------------------ATILVMGLDQSIE-AEALDRAG 500
+ AI+A+R+++ A + G +E E DRA
Sbjct: 495 TVNEIEQAIEAARRSEVVIVVVGGSSARDFKTSYKETGAAVAEEGSVSDMECGEGFDRAS 554
Query: 501 LLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAI 560
L L GRQQEL+ + K P I+V + G P++ +A A++ A YPGQ GG AI
Sbjct: 555 LSLLGRQQELLESLQKTGK-PLIVVYIEGRPLEKNWASE--YADALLTAYYPGQEGGNAI 611
Query: 561 ADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYP-GRTYRFYKGPVVYPFG 619
AD+LFG NP G+LP++ +P + + +K+ P Y +Y FG
Sbjct: 612 ADVLFGDYNPSGRLPIS---------VPRSVGQIPVYYNKKAPRNHDYVEVSSSPLYSFG 662
Query: 620 HGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNV 679
+GMSYT F + S + A+C V VKN
Sbjct: 663 YGMSYTTFEY------------------------SDLQVVQKSARC----FEVSFKVKNT 694
Query: 680 GSKDGAH-TLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSG 738
G DG + L A P KQL FE+ H+ G +++V + + +V+ +
Sbjct: 695 GKYDGEEVSQLYMRDEYASVVQPMKQLKHFERFHLKKGEEKKVTFVLTEEDFF-LVNYTL 753
Query: 739 TRRIPLGEHNIHIGGTKHSVSLHAA 763
+ + G + IG + + L +
Sbjct: 754 KKVVESGTFQVMIGSSSDDIRLEKS 778
>gi|358342292|dbj|GAA27551.2| probable beta-D-xylosidase 7 [Clonorchis sinensis]
Length = 826
Score = 259 bits (661), Expect = 5e-66, Method: Compositional matrix adjust.
Identities = 231/827 (27%), Positives = 356/827 (43%), Gaps = 159/827 (19%)
Query: 41 PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAA--------VPRLGIKGYEWWSEALH 92
PF SLP RV+DL+ RL+ +E ++ + +G A + RL I Y+W
Sbjct: 28 PFRNPSLPANFRVDDLLARLTNEELIQQVSNGGAGPQHGPAPGIARLNISAYQW------ 81
Query: 93 GVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY 152
+N G G T FPQ + ++F+ + R E RA +N A TY
Sbjct: 82 -RTNPGDGR--------ITPFPQPVNLGATFDVHTVYRVARATGLEMRARWNRAKAKKTY 132
Query: 153 --------WSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQG------------ 192
++P VN+ R P WGR QET GEDP + GK A ++VRGL G
Sbjct: 133 RDGNGIHLFAPVVNLLRHPLWGRNQETFGEDPFMIGKLARTFVRGLGGWKNAEPQSLDEQ 192
Query: 193 ---SDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKV 249
S D L V A+CKHF + V R F A V+ D+ T+ FR C+ G V
Sbjct: 193 NLSSQPDVLLVGANCKHFAVHTGPEDFPVSRLSFEANVTDVDLWQTYLPAFRACLEAGAV 252
Query: 250 ASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVG-VYYDTQHFTSTPEEA 308
SVMC+Y+ +NG P C + +L +R +W+ G++V+DC ++ V + Q F E A
Sbjct: 253 -SVMCAYSGINGTPDCINHWLLTELLRQKWKFKGFVVTDCGALQFVIWKHQIFNHYNETA 311
Query: 309 AADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGE---PS 365
A A+RAG++L+ S + L S + L+ + GE
Sbjct: 312 MA-AVRAGVNLENSVVYATEVFSTLPHLLASGSLSRDQLIEMARPLFLTRLMQGEFNPVE 370
Query: 366 SQPYGHLGPKD-VCTPDHQELALEAARQGIVLLKNQGPSLPLSH--------IRHRTVAV 416
PY L P++ + DH+ +AL + IVLL+N+ LPL + +RH +A+
Sbjct: 371 MDPYRLLAPEEAILNEDHRRVALATTARSIVLLQNRDRFLPLKNNMSDSGGPLRH--IAI 428
Query: 417 IGPNSDVTVTMIGNY-AGIACGYTTPL-QGIGRYARTIHQQG-CKD----VACADDQLFG 469
+GP + + G+Y PL +G+ + +R +H C D + DD L
Sbjct: 429 VGPFATSVTELYGHYRTAPEPEIEVPLSKGLSQLSRRMHASDICTDGGRCSSLNDDALHS 488
Query: 470 AAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKG--------- 520
D +L +G +E E +DR + LPG+Q EL+ + S G
Sbjct: 489 TL--GYDDLDLIVLSLGTGSEVEGENVDRQNITLPGKQPELLEETLKLSSGLGNSGLSKR 546
Query: 521 --PTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSN--------- 569
P IL++ S GPI+++ A + + AI W G+PG G A+ +L G+S
Sbjct: 547 TVPIILLVFSAGPINISRAVENENVKAIFWCGFPGPLVGDAMRHLLLGSSGELFGPSKPI 606
Query: 570 ------------------------PGGKLPMTWYPQ-EYITNLPMTEMAMRPSQSKRY-P 603
P +LP TWY + + N+ + EM +Q+ RY P
Sbjct: 607 SVGFHSFQEAYRWDVTPDDGYWWIPAARLPFTWYESIDQLANITVYEMT---NQTYRYLP 663
Query: 604 GRTY---RFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKV 660
+ + K PV+YPFG+G+SY NF L G G + + + I
Sbjct: 664 TQCHMSSEDCKIPVLYPFGYGLSY-NF-------------NLSGASGFVYSDL----IAP 705
Query: 661 THAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHW-----------APHKQLVAFE 709
+ A + + V V+N G + V++ W P QL FE
Sbjct: 706 SSAVSSNQRIVFYVTVQNEGPIACEEVVQVYT-----KWLNRTENDNSRNGPLIQLAGFE 760
Query: 710 KVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIP-LGEHNIHIGGTK 755
+V + G +++ + ++L+V S IP G I +GG +
Sbjct: 761 RVRLDVGEYKQLKFTLIPSEHLAVWSLSENTMIPGRGVLQISVGGQQ 807
>gi|336404202|ref|ZP_08584900.1| hypothetical protein HMPREF0127_02213 [Bacteroides sp. 1_1_30]
gi|335943530|gb|EGN05369.1| hypothetical protein HMPREF0127_02213 [Bacteroides sp. 1_1_30]
Length = 735
Score = 259 bits (661), Expect = 6e-66, Method: Compositional matrix adjust.
Identities = 232/795 (29%), Positives = 369/795 (46%), Gaps = 109/795 (13%)
Query: 8 FFLGLILLSASSSGLAAREPFACDPKDATTRTLP-FCQVSLPIPQRVNDLIGRLSLQEKV 66
LGL LL + S L++++ +++P + PI +R++DLI R++L+EK+
Sbjct: 6 LILGLCLLGVTHS-LSSKD----------KKSIPLYKDAKAPIEKRIDDLISRMTLEEKI 54
Query: 67 KLLI--------------SGAAAVP-RLGIKGYEWWSEALHGV--------SNVGPGTKF 103
L VP +G Y + L S +G F
Sbjct: 55 LQLNQYTLGRNNNVNNVGEEVKKVPSEIGSLIYFDINPELRNSMQKKAMEESRLGIPIIF 114
Query: 104 GGDFPGA--TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIF 160
G D T +P + A S+N L E V + EAR +G+ + +SP +++
Sbjct: 115 GYDAIHGFRTIYPISLGQACSWNPGLVEQACAVSAQEAR------MSGVDWTFSPMIDVA 168
Query: 161 RDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSD-GDRLKVAASCKHFTAYDLDNWNGVD 219
RDPRWGR E GEDP +G +AA+ VRG QG D ++AA KH+ Y
Sbjct: 169 RDPRWGRVAEGYGEDPYTNGVFAAASVRGYQGDDMSAENRMAACLKHYVGYGASE---AG 225
Query: 220 RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEW 279
R + ++S Q + DT+ +P+ M V G A++M S+N ++GVP A+P I+ ++ W
Sbjct: 226 RDYVYTEISAQTLWDTYLLPYEMGVKAG-AATLMSSFNDISGVPGSANPYIMTEILKKRW 284
Query: 280 RLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPF-LGLHTESAVQRGLL 338
+ +G+IVSD +V Q +T ++AA A AGL++D H + V+ G +
Sbjct: 285 KHDGFIVSDWGAVE-QLKNQGLAATKKDAAQYAFNAGLEMDMMSHAYDRHLKELVEEGKV 343
Query: 339 SEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHL-GPKD-VCTPDHQELALEAARQGIVL 396
+ ++ ++ L V+ RLG+F+ +PY + KD P +A + A + +VL
Sbjct: 344 TMAQVDESVRRVLRVKFRLGLFE-----RPYTPVTNEKDRFFRPQSMAVAAQLAAESMVL 398
Query: 397 LKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAG------IACGYTTPLQGIGRYAR 450
LKN LPL++ + +AV+GP + ++G++ G + Y G A
Sbjct: 399 LKNDNQILPLTN--KKKIAVVGPMAKNGWDLLGSWCGHGKDTDVEMLYDGLTAEFGGDAE 456
Query: 451 TIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQEL 510
+ GCK D F A+D +R +D I+ +G + E R+ + LP Q+EL
Sbjct: 457 LRYAMGCKPQG-NDRSGFAGALDVARWSDVVIVCLGEMLTWSGENASRSTIALPQIQEEL 515
Query: 511 VSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNP 570
V ++ A K P ILVL +G P+++ + +P AI+ PG G ++A IL G NP
Sbjct: 516 VKELKEAGK-PVILVLSNGRPLEL--NRMEPLCDAILEIWQPGINGARSMAGILSGRINP 572
Query: 571 GGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYR-FYKGPV---VYPFGHGMSYTN 626
GKL MT P + + ++R GR ++ FYK +YPFGHG+SYT
Sbjct: 573 SGKLAMT---------FPYSTGQIPIYYNRRKSGRGHQGFYKDITSDPLYPFGHGLSYTE 623
Query: 627 FVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAH 686
F + T++ A KV L +V V N G++DGA
Sbjct: 624 FKY---------------------GTVTPSATKVKRGD----KLSAEVTVTNTGARDGAE 658
Query: 687 TLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLG 745
T+ F S P P K+L FEK + AG + +I + + V+ G R + G
Sbjct: 659 TVHWFISDPYCSITRPVKELKHFEKQLIKAGETKTFRFDIDLERDFGFVNEDGKRFLEAG 718
Query: 746 EHNIHIGGTKHSVSL 760
E++I + G + L
Sbjct: 719 EYHILVQGQTVKIEL 733
>gi|423289665|ref|ZP_17268515.1| hypothetical protein HMPREF1069_03558 [Bacteroides ovatus
CL02T12C04]
gi|423298158|ref|ZP_17276217.1| hypothetical protein HMPREF1070_04882 [Bacteroides ovatus
CL03T12C18]
gi|392663699|gb|EIY57246.1| hypothetical protein HMPREF1070_04882 [Bacteroides ovatus
CL03T12C18]
gi|392667376|gb|EIY60886.1| hypothetical protein HMPREF1069_03558 [Bacteroides ovatus
CL02T12C04]
Length = 955
Score = 259 bits (661), Expect = 6e-66, Method: Compositional matrix adjust.
Identities = 215/759 (28%), Positives = 357/759 (47%), Gaps = 99/759 (13%)
Query: 20 SGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISG--AAAVP 77
S L A + + + T R + SLP+ +RV L+ ++ ++K++L+ G +P
Sbjct: 151 SVLVAEKVLKDNKGEVTDRR--YMDASLPVEERVESLLAVMTPEDKMELIREGWGIPGIP 208
Query: 78 RLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSD 137
L + EA+HG S +G GAT FPQ + +++N L E + V+ D
Sbjct: 209 HLYVPPITK-VEAVHGFS-------YGS---GATIFPQALAMGATWNRKLTEEVAMVIGD 257
Query: 138 EARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDR 197
E + N A WSP +++ +D RWGR +ET GEDPVL + ++++G Q
Sbjct: 258 ET-VVANTKQA----WSPVLDVAQDARWGRCEETFGEDPVLVSQIGGAWIKGYQSRG--- 309
Query: 198 LKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYN 257
+ + KHF + G D +S++++ + VPFR V S+M +Y+
Sbjct: 310 --LFTTPKHFGGHGAP-LGGRDSHDIG--LSEREMREVHLVPFRHVVRNYDCQSLMMAYS 364
Query: 258 QVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPE-EAAADAIRAG 316
G+P +L++ +R EW NG+IVSDC ++G +H+T+ + EAA A+ AG
Sbjct: 365 DYMGIPVAGSTELLQQILRQEWGFNGFIVSDCGAIGNLTARKHYTAKDKIEAANQALAAG 424
Query: 317 LDLDCG-PFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQ-PYGHLGP 374
+ +CG + A + G ++ ++++N L R +F+ P + + P
Sbjct: 425 IATNCGDTYNDKEVIQAAKDGRINMVNLDNVCRTMLATMFRNELFEKNPCKPLDWNKIYP 484
Query: 375 KDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSD-------VTVTM 427
+ H+E+A +AAR+ IV+L+N+ LPLS +T+AV+GP +D
Sbjct: 485 -GWNSDRHREMARQAARESIVMLENKDNLLPLSKTL-KTIAVLGPGADDLQPGDYTPKLQ 542
Query: 428 IGNYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGL 487
G + G +G+ + +++QGC D D A+ A+ Q+D ++V+G
Sbjct: 543 PGQLKSVLSGIKA---AVGKQTKVLYEQGC-DFTTPDATNIPKAVKAASQSDVVVMVLGD 598
Query: 488 DQSIEA---------EALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAK 538
+ EA E D A L+LPG+QQEL+ V K P +L+L +G P D+ A
Sbjct: 599 CSTSEATNNVRKTCGENNDWATLILPGKQQELLEAVCATGK-PVVLILQAGRPYDLLKAS 657
Query: 539 NDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQ 598
+ + W PGQ GG A AD+LFG NPGG+LPMT +P+ ++ LP+ +
Sbjct: 658 EMCKAILVNW--LPGQEGGPATADVLFGDYNPGGRLPMT-FPR-HVGQLPL--YYNFKTS 711
Query: 599 SKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAI 658
+RY FY +Y FG+G+SYT+F ++ I +G +
Sbjct: 712 GRRYEYVDMEFYP---LYRFGYGLSYTSFEYS---------------DLKIQEKSNGNVM 753
Query: 659 KVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHK----QLVAFEKVHVP 714
VQ VKNVG G ++ T +A K +L F ++H+
Sbjct: 754 -------------VQATVKNVGGCAGDEVAQLYIT---DMYASVKTRVMELKDFTRIHLQ 797
Query: 715 AGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGG 753
G + V + +S+++ R + GE + +GG
Sbjct: 798 PGESKNVSFELTPYD-ISLLNDRMDRVVEKGEFKVMVGG 835
>gi|329954674|ref|ZP_08295734.1| glycosyl hydrolase family 3 protein [Bacteroides clarus YIT 12056]
gi|328527215|gb|EGF54219.1| glycosyl hydrolase family 3 protein [Bacteroides clarus YIT 12056]
Length = 855
Score = 259 bits (661), Expect = 6e-66, Method: Compositional matrix adjust.
Identities = 212/699 (30%), Positives = 324/699 (46%), Gaps = 95/699 (13%)
Query: 88 SEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGT 147
+EALHG ++ G+T +PQ I S+FN L ++++ E A G
Sbjct: 120 AEALHGSAH-----------EGSTIYPQNIALGSTFNPDLAYRKAKMIAGELHAQ---GM 165
Query: 148 AGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHF 207
+ +P V++ RD RWGR +E+ GEDP L G + + V+G Q ++ KHF
Sbjct: 166 KQIL--APCVDVVRDLRWGRVEESYGEDPFLCGLFGIAEVKGYQEQG-----ISPMLKHF 218
Query: 208 TAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCAD 267
+ + G++ V +D+ D + PF M + + +VM +YN N +P A
Sbjct: 219 GPHG-NPLGGLNLASVECGV--RDLHDIYLKPFEMIIKSQPLKAVMSTYNSWNRIPNSAS 275
Query: 268 PNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDL----DCGP 323
+L +R +W GY+ SD ++ + H ++PEE A A+ AGLD+ +C P
Sbjct: 276 HYLLTDVLRKQWGFTGYVYSDWGAIEMLQTFHHTANSPEECALQALMAGLDVEASSECYP 335
Query: 324 FLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQ 383
L + AV+ G L I+ A+ LT + G+F+ +P + Y V + +
Sbjct: 336 AL----KQAVKEGRLPVSYIDEAVRRVLTAKFETGLFE-DPFGEKY---RAGKVHSEESV 387
Query: 384 ELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGY--TTP 441
LA E A + +VLLKN+G LPL R VAVIGPN+D G+Y TP
Sbjct: 388 RLAREIADESVVLLKNEGKLLPLDEKRLDAVAVIGPNAD--QVQFGDYTWSRSNKDGVTP 445
Query: 442 LQGI----GRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEA---- 493
LQGI G + + +GC D+ D L A++A+R++D IL +G + A
Sbjct: 446 LQGIRRLVGNKVKVHYAKGC-DMMSPDTSLIAKAVEAARKSDVAILFVGSSSASLARDYS 504
Query: 494 -----EALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIW 548
E D L L G Q L+ V A+ P +LVL+SG P +++ K I AI+
Sbjct: 505 NTNCGEGFDLNDLTLAGAQSGLIKAV-YATGTPVVLVLVSGKPFVLSWEKE--HIPAILA 561
Query: 549 AGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQE------YITNLPMTEMAMRPSQSKRY 602
Y G+ G +IADILFG NP G L + +PQ Y +LP + + S
Sbjct: 562 QWYAGEQEGNSIADILFGRVNPSGHLTFS-FPQSTGHLPVYYNHLPSDKGFYKKPGSYSE 620
Query: 603 PGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTH 662
PGR Y F ++ FGHG+SYT+FV I+ ++ +I V
Sbjct: 621 PGRDYVFSSPAALWSFGHGLSYTDFVFE-----------------GIDTSLRNDSIYV-- 661
Query: 663 AKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWA-PHKQLVAFEKVHVPAGAQQRV 721
KC ++NVG++ G + ++ P +QL AFEKV + AG V
Sbjct: 662 -KCT---------LRNVGNRSGKEVVQLYVRDLVSSVVMPVQQLKAFEKVGLKAGESSEV 711
Query: 722 GINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
+++ V + L++ D G R I G +G + L
Sbjct: 712 QLSLPVSE-LAISDNDGKRVIEPGAFEFQLGNASDRILL 749
>gi|160887545|ref|ZP_02068548.1| hypothetical protein BACOVA_05565 [Bacteroides ovatus ATCC 8483]
gi|156107956|gb|EDO09701.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
ovatus ATCC 8483]
Length = 736
Score = 258 bits (660), Expect = 6e-66, Method: Compositional matrix adjust.
Identities = 207/696 (29%), Positives = 317/696 (45%), Gaps = 102/696 (14%)
Query: 109 GATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRG 168
GAT FP I A++++ L + +G+V++ E R+ G + P +++ RDPRW R
Sbjct: 102 GATVFPTGIGMAATWSPELVKEVGQVIAKEIRSQ-----GGHISYGPVLDLTRDPRWSRV 156
Query: 169 QETPGEDPVLSGKYAASYVRGLQGSD-GDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKV 227
+ET GEDPVLSG AS V GL G + + A+ KHF AY + + A V
Sbjct: 157 EETFGEDPVLSGILGASMVDGLGGGNLSQKYATIATLKHFLAYAVPEGGQNGNY---ASV 213
Query: 228 SKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVS 287
+D+ F PFR + G + SVM SYN ++G P ++ +L + +R EW+ G++VS
Sbjct: 214 GIRDLHQNFLPPFRKAIDAGAL-SVMTSYNSIDGTPCTSNHYLLTQLLRNEWKFRGFVVS 272
Query: 288 DCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNAL 347
D S+ +++ T E AA ++ AG+D+D G + AVQ G + + I+ A+
Sbjct: 273 DLYSIEGIHESHFVAPTKENAAIQSVMAGVDVDLGGDAYTNLCHAVQSGQMDKTVIDTAV 332
Query: 348 VNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLS 407
L ++ +G+F+ P P + K V +H ELA + A+ I LLKN+ LPLS
Sbjct: 333 CRVLRMKFEMGLFE-HPYVDP--KIAAKTVRRKEHIELARKIAQSSITLLKNENSILPLS 389
Query: 408 HIRHRTVAVIGPNSDVTVTMIGNYAGIA--CGYTTPLQGIGRYARTIHQQGCKDVACADD 465
++ VAVIGPN+D M+G+Y T L GI + + A D
Sbjct: 390 KTINK-VAVIGPNADNRYNMLGDYTAPQEDSNVKTVLDGILTKLSPFRVEYVRGCAIRDT 448
Query: 466 QL--FGAAIDASRQAD----------------------ATILVMGLDQSIE-AEALDRAG 500
+ AI A+R+++ A + G +E E DRA
Sbjct: 449 TVNEIEQAIKAARRSEVVIVVVGGSSARDFKTSYKETGAAVAEEGSVSDMECGEGFDRAS 508
Query: 501 LLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAI 560
L L GRQQEL+ + K P I+V + G P++ +A A++ A YPGQ GG AI
Sbjct: 509 LSLLGRQQELLESLQKTGK-PLIVVYIEGRPLEKNWASE--YADALLTAYYPGQEGGNAI 565
Query: 561 ADILFGTSNPGGKLPMT----------WYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFY 610
AD+LFG NP G+LP++ +Y ++ N EM+ P
Sbjct: 566 ADVLFGDYNPSGRLPISVPRSVGQIPVYYNKKAPRNHDYVEMSSFP-------------- 611
Query: 611 KGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTL 670
+Y FG+GMSYT F + S + A+C
Sbjct: 612 ----LYSFGYGMSYTTFEY------------------------SDLQVVQKSARC----F 639
Query: 671 GVQVDVKNVGSKDGAH-TLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCK 729
V VKN G DG + L A P KQL FE+ H+ G +++V +
Sbjct: 640 EVSFKVKNTGKYDGEEVSQLYMRDEYASVVQPMKQLKHFERFHLKKGEEKKVTFVLTEED 699
Query: 730 YLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHAATL 765
+ +V+ + + + G ++ IG + + L L
Sbjct: 700 FF-LVNYTLKKVVESGNFHLMIGAASNDIRLQNVIL 734
>gi|354724530|ref|ZP_09038745.1| beta-D-glucoside glucohydrolase [Enterobacter mori LMG 25706]
Length = 765
Score = 258 bits (660), Expect = 6e-66, Method: Compositional matrix adjust.
Identities = 208/689 (30%), Positives = 334/689 (48%), Gaps = 102/689 (14%)
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
T FP + ASSFN + +GRV + Y GL W+P V++ RDPRWGR
Sbjct: 118 TVFPISLGLASSFNLDAVKTVGRV------SAYEAADDGLNMTWAPMVDVSRDPRWGRAS 171
Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
E GED L+ + V +QG S DR V S KHF AY +N VD
Sbjct: 172 EGFGEDTYLTATMGKTMVEAMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 226
Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
+S Q + + + P++ + G +VM + N +NG P +D +LK +R +W G
Sbjct: 227 --MSPQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSDSWLLKDVLRDQWGFKGI 283
Query: 285 IVSDCDSVGVYYDTQHFT-STPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
VSD ++ +H T S PE+A A+++G+++ + + V+ G ++ +
Sbjct: 284 TVSDHGAIKEL--IKHGTASDPEDAVRVALKSGINMSMSDEYYSKYLPGLVKSGKVTMAE 341
Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
+++A + L V+ +G+F+ PY HLGPKD D H++ A + AR+ +V
Sbjct: 342 LDDAARHVLNVKYDMGLFN-----DPYSHLGPKDSDPADTNAESRLHRKEARDVARESLV 396
Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGI----GRYA 449
LLKN+ +LPL + T+AV+GP +D ++G++ AG+A T L GI G A
Sbjct: 397 LLKNRLDTLPLK--KSGTIAVVGPLADSKRDVMGSWSAAGVADQSVTVLTGIKNAVGENA 454
Query: 450 RTIHQQGC-----KDVACADDQ--------------LFGAAIDASRQADATILVMGLDQS 490
+ ++ +G KD+ +Q + A++ ++Q+D + V+G Q
Sbjct: 455 KVVYAKGANVTDDKDIVTFLNQYEEAVKVDPRTPKEMIDEAVNTAKQSDVVVAVVGEAQG 514
Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
+ EA R + +P Q++L++ + K P +LVLM+G P+ A K D + AI+
Sbjct: 515 MAHEASSRTDIPIPQSQRDLIAALKATGK-PLVLVLMNGRPL--ALVKEDQQADAILETW 571
Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAM---RP---SQSKRYPG 604
+ G GG AIAD+LFG NP GKLPM+ +P+ + +P+ + RP + +Y
Sbjct: 572 FAGTEGGNAIADVLFGDYNPSGKLPMS-FPRS-VGQIPVYYSHLNTGRPYNADKPNKYTS 629
Query: 605 RTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAK 664
R + GP +YPFG+G+SYT F V++ ++ P R G + A+
Sbjct: 630 RYFDEANGP-LYPFGYGLSYTTF--KVSDVK--MSAPTMKRDGKVTAS------------ 672
Query: 665 CNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVGI 723
VDV N G ++GA + ++ A P KQL F+KV + G + V
Sbjct: 673 ---------VDVTNTGKREGATVIQMYVQDVTASMSRPVKQLRGFDKVSLKPGETKTVSF 723
Query: 724 NIHVCKYLSVVDRSGTRRIPLGEHNIHIG 752
I V L ++ + G+ N+ IG
Sbjct: 724 PIDV-DALKFWNQQMKYVVEPGKFNVFIG 751
>gi|295098160|emb|CBK87250.1| beta-glucosidase [Enterobacter cloacae subsp. cloacae NCTC 9394]
Length = 765
Score = 258 bits (660), Expect = 7e-66, Method: Compositional matrix adjust.
Identities = 214/692 (30%), Positives = 337/692 (48%), Gaps = 108/692 (15%)
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
T FP + ASSFN + +GRV + Y GL W+P V++ RDPRWGR
Sbjct: 118 TVFPISLGLASSFNLDAVKTVGRV------SAYEAADDGLNMTWAPMVDVSRDPRWGRAS 171
Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
E GED L+ + V +QG S DR V S KHF AY +N VD
Sbjct: 172 EGFGEDTYLTATMGKTMVEAMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 226
Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
+S Q + + + P++ + G +VM + N +NG P +D +LK +R +W G
Sbjct: 227 --MSPQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSDSWLLKDVLRDQWGFKGI 283
Query: 285 IVSDCDSVGVYYDTQHFT-STPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
VSD ++ +H T S PE+A A+++G+++ + + V+ G ++ +
Sbjct: 284 TVSDHGAIKEL--IKHGTASDPEDAVRVALKSGINMSMSDEYYSKYLPGLVKSGKVTMAE 341
Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
+++A + L V+ +G+F+ PY HLGPKD D H++ A E AR+ +V
Sbjct: 342 LDDAARHVLNVKYDMGLFN-----DPYSHLGPKDSDPADTNAESRLHRKEAREVARESLV 396
Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGI----GRYA 449
LLKN+ +LPL + T+AV+GP +D ++G++ AG+A T L GI G A
Sbjct: 397 LLKNRLDTLPLK--KSGTIAVVGPLADSKRDVMGSWSAAGVADQSVTVLTGIKSAVGDNA 454
Query: 450 RTIHQQGC-----KDVACADDQ--------------LFGAAIDASRQADATILVMGLDQS 490
+ ++ +G KD+ +Q + A++A++Q+D I V+G Q
Sbjct: 455 KVVYAKGANVTNDKDIVTFLNQYEEAVKVDPRTPKEMIDEAVNAAKQSDVVIAVVGEAQG 514
Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
+ EA R + +P Q++L++ + K P +LVLM+G P+ A K D + AI+
Sbjct: 515 MAHEASSRTDITIPQSQRDLIAALKATGK-PLVLVLMNGRPL--ALVKEDQQADAILETW 571
Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAM---RP---SQSKRYPG 604
+ G GG AIAD+LFG NP GKLPM+ +P+ + +P+ + RP + +Y
Sbjct: 572 FAGTEGGNAIADVLFGDYNPSGKLPMS-FPRS-VGQIPVYYSHLNTGRPYNADKPNKYTS 629
Query: 605 RTYRFYKGPVVYPFGHGMSYTNF-VHTVA-NAPTVVAVPLDGRH-GSINATISGKAIKVT 661
R + GP +YPFG+G+SYT F V V +APT+ +H G + A+
Sbjct: 630 RYFDEANGP-LYPFGYGLSYTTFKVSDVKMSAPTL-------KHDGKVTAS--------- 672
Query: 662 HAKCNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQR 720
V+V N G ++GA + ++ A P KQL FEKV++ G +
Sbjct: 673 ------------VEVTNSGKREGATVIQMYIQDVTASMSRPVKQLRGFEKVNLKPGETRT 720
Query: 721 VGINIHVCKYLSVVDRSGTRRIPLGEHNIHIG 752
V I V L ++ G+ N+ IG
Sbjct: 721 VSFPIDVNA-LKFWNQQMKYDAEPGKFNVFIG 751
>gi|423287910|ref|ZP_17266761.1| hypothetical protein HMPREF1069_01804 [Bacteroides ovatus
CL02T12C04]
gi|392671925|gb|EIY65396.1| hypothetical protein HMPREF1069_01804 [Bacteroides ovatus
CL02T12C04]
Length = 782
Score = 258 bits (660), Expect = 7e-66, Method: Compositional matrix adjust.
Identities = 207/696 (29%), Positives = 318/696 (45%), Gaps = 102/696 (14%)
Query: 109 GATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRG 168
GAT FP I A++++ L + +G+V++ E R+ G + P +++ RDPRW R
Sbjct: 148 GATVFPTGIGMAATWSLELVKEVGQVIAKEIRSQ-----GGHISYGPVLDLTRDPRWSRV 202
Query: 169 QETPGEDPVLSGKYAASYVRGLQGSD-GDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKV 227
+ET GEDPVLSG AS V GL G + + A+ KHF AY + + A V
Sbjct: 203 EETFGEDPVLSGILGASMVDGLGGGNLSQKYATIATLKHFLAYAVPEGGQNGNY---ASV 259
Query: 228 SKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVS 287
+D+ F PFR + G + SVM SYN ++G+P ++ +L + +R EW+ G++VS
Sbjct: 260 GIRDLHQNFLPPFRKAIDSGAL-SVMTSYNSIDGIPCTSNHYLLTQLLRNEWKFCGFVVS 318
Query: 288 DCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNAL 347
D S+ +++ T E AA ++ AG+D+D G + AVQ G + + I+ A+
Sbjct: 319 DLYSIEGIHESHFVALTKENAAIQSVTAGVDVDLGGDAYTNLCHAVQSGQMDKAVIDTAV 378
Query: 348 VNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLS 407
L ++ +G+F+ P P + K V +H ELA + A+ I LLKN+ LPLS
Sbjct: 379 CRVLRMKFEMGLFE-HPYVDP--KIAAKTVRRKEHIELARKIAQSSITLLKNENSILPLS 435
Query: 408 HIRHRTVAVIGPNSDVTVTMIGNYAGIA--CGYTTPLQGIGRYARTIHQQGCKDVACADD 465
++ VAVIGPN+D M+G+Y T L GI + + A D
Sbjct: 436 KTINK-VAVIGPNADNRYNMLGDYTAPQEDSNVKTVLDGILTKLSPFRVEYVRGCAIRDT 494
Query: 466 QL--FGAAIDASRQAD----------------------ATILVMGLDQSIEA-EALDRAG 500
+ AI A+R+++ A + G +E E DRA
Sbjct: 495 TVNEIEQAIKAARRSEVVIVVVGGSSARDFKTSYKETGAAVAEEGSVSDMECGEGFDRAS 554
Query: 501 LLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAI 560
L L GRQQEL+ + K P I+V + G P++ +A A++ A YPGQ GG AI
Sbjct: 555 LSLLGRQQELLESLQKTGK-PLIVVYIEGRPLEKNWASE--YADALLTAYYPGQEGGNAI 611
Query: 561 ADILFGTSNPGGKLPMT----------WYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFY 610
AD+LFG NP G+LP++ +Y ++ N EM+ P
Sbjct: 612 ADVLFGDYNPSGRLPISVPRSVGQIPVYYNKKAPRNHDYVEMSSFP-------------- 657
Query: 611 KGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTL 670
+Y FG+GMSYT F + S + A+C
Sbjct: 658 ----LYSFGYGMSYTTFEY------------------------SDLQVVQKSARC----F 685
Query: 671 GVQVDVKNVGSKDGAH-TLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCK 729
V VKN G DG + L A P KQL FE+ H+ G +++V +
Sbjct: 686 EVSFKVKNTGKYDGEEVSQLYMRDEYASVVQPMKQLKHFERFHLKKGEEKKVTFVLTEED 745
Query: 730 YLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHAATL 765
+ +V+ + + + G ++ IG + + L L
Sbjct: 746 FF-LVNYTLKKVVESGNFHLMIGAASNDIRLQNVIL 780
>gi|281422547|ref|ZP_06253546.1| beta-glucosidase [Prevotella copri DSM 18205]
gi|281403371|gb|EFB34051.1| beta-glucosidase [Prevotella copri DSM 18205]
Length = 869
Score = 258 bits (660), Expect = 7e-66, Method: Compositional matrix adjust.
Identities = 157/453 (34%), Positives = 238/453 (52%), Gaps = 28/453 (6%)
Query: 34 DATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHG 93
+A+ + LP+ +L +R DL GRL+L+EK L++ + A+PRLGIK ++WWSEALHG
Sbjct: 20 NASAQQLPYQNPALSAHERAVDLCGRLTLEEKASLMLDDSPAIPRLGIKRFQWWSEALHG 79
Query: 94 VSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN-----GGTA 148
V+N+G T FP+ I A+SFN + + SDE RA +N GG
Sbjct: 80 VANMG----------DVTVFPEPIGMAASFNDRMVYRVFDATSDEMRAKWNELQQKGGDV 129
Query: 149 ----GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAAS 203
L+ W+PNVNIFRDPRWGRGQET GEDP L+ + + VRGLQG D K+ A
Sbjct: 130 TRFHALSVWTPNVNIFRDPRWGRGQETYGEDPYLTSRMGCAVVRGLQGPEDTKYRKLWAC 189
Query: 204 CKHFTAYDLDNWNGVDRFHFN-AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGV 262
KH+ + W R N V+ +D+ +T+ F+ V + KV VMC+Y + +
Sbjct: 190 AKHYAIHSGPEWA---RHTDNITDVTPRDLWETYMPAFKSLVQDAKVREVMCAYQRWDDE 246
Query: 263 PTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG 322
P C + +L++ +R EW +VSDC +V +++ +S AAA + AG D++CG
Sbjct: 247 PCCGNTRLLQQILRDEWGFKYLVVSDCGAVTDFWENHKVSSNARNAAAKGVLAGTDVECG 306
Query: 323 -PFLGLHTESAVQRGLLS-EIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTP 380
++ AV+ G L+ E +++ ++ L + LG D + + + +C+
Sbjct: 307 FNYIYKSVPEAVKYGALTEEEEVDKHVIRLLEGRFDLGEMD-DNKIVSWSKIPVSVLCSK 365
Query: 381 DHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTT 440
H++L+L+ A Q + LL+N+ LPL + + +A IGPN D M GNY G T
Sbjct: 366 AHRQLSLDMALQTMTLLQNKNEVLPLDK-KVKKIAFIGPNVDNEPMMWGNYNGTPRQTIT 424
Query: 441 PLQGIGRYARTIHQQGCKDVACADDQLFGAAID 473
L GI + K +DQ + D
Sbjct: 425 ILDGIKSRLKKNQVVTFKGCDLVNDQTLDSYFD 457
Score = 116 bits (290), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 92/311 (29%), Positives = 137/311 (44%), Gaps = 50/311 (16%)
Query: 468 FGAAIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMA 517
+ A++ + + + V G+ +E E + DR + LP Q+ + + A
Sbjct: 596 YQASLKQLKDCETVVFVGGISPQLEGEEMPIEISGFKGGDRTNIELPKVQRNFLKALKEA 655
Query: 518 SKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMT 577
K + V SG I A AI+ A YPGQ GG A+A +LFG NP GKLP+T
Sbjct: 656 GK-KVVFVNCSGSAI--ALTPETESCDAILQAWYPGQEGGEAVARVLFGEYNPAGKLPIT 712
Query: 578 WYPQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPT 636
+Y LP + +M+ GRTYR Y ++PFG+G+SYT+F
Sbjct: 713 FYKNS--EQLPDFKDYSMK--------GRTYR-YMNDALFPFGYGLSYTSF--------- 752
Query: 637 VVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPA 696
R G +AT+S +K K ++TL +V V NVG KDG + V+ PA
Sbjct: 753 --------RMG--DATLSNSILK----KGEKITL--KVPVSNVGKKDGTEIVQVYVKDPA 796
Query: 697 GHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKH 756
P K L AFE+V V AG I + + + T R G + ++ G +
Sbjct: 797 DTEGPLKSLKAFERVEVKAGKTAEAVITLDSRNFELFDAATNTVRAKAGNYEVYYGSSSA 856
Query: 757 SVSLHAATLGV 767
L + +
Sbjct: 857 DKDLKKLDVSI 867
>gi|160891087|ref|ZP_02072090.1| hypothetical protein BACUNI_03534 [Bacteroides uniformis ATCC 8492]
gi|156859308|gb|EDO52739.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
uniformis ATCC 8492]
Length = 865
Score = 258 bits (660), Expect = 7e-66, Method: Compositional matrix adjust.
Identities = 168/432 (38%), Positives = 235/432 (54%), Gaps = 32/432 (7%)
Query: 42 FCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGT 101
+ SLP +R L+ L+L+EKV L++ + V RLGIK Y WW+EALHGV+ G
Sbjct: 23 YKNASLPAEERAGLLLKELTLEEKVSLMMDSSKPVERLGIKPYNWWNEALHGVARAGL-- 80
Query: 102 KFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA--------GLTYW 153
AT FPQ I A+SF+ + VSDEARA T+ GLT W
Sbjct: 81 --------ATVFPQPIGMAASFSPETVYEVFTAVSDEARAKNAYYTSQESHERYQGLTMW 132
Query: 154 SPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGS-DGDRLKVAASCKHFTAYDL 212
+P VNI+RDPRWGRG ET GEDP L+ + V+GLQG+ DG K+ A KHF +
Sbjct: 133 TPTVNIYRDPRWGRGIETYGEDPYLTSRMGVMVVKGLQGTNDGKYDKLHACAKHFAVHSG 192
Query: 213 DNWNGVDRFHFNAK-VSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNIL 271
WN R FNA+ + +D+ +T+ PF V EGKV VMC+YN+ G P C +L
Sbjct: 193 PEWN---RHEFNAENIKPRDLYETYLPPFEALVKEGKVKEVMCAYNRFEGDPCCGSDRLL 249
Query: 272 KRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTP--EEAAADAIRAGLDLDCGPFLGLHT 329
+ +R EW +G ++SDC ++ +Y + P E A+A A+ +G DL+CG
Sbjct: 250 MQILRDEWGFDGIVLSDCGAIADFYRDYGHKTHPDAESASAAAVLSGTDLECGSSYEALV 309
Query: 330 ESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEA 389
E AV++G + E ++ A+ LT + LG D EP + + V + H LAL+
Sbjct: 310 E-AVKQGKIDEKAVDVAVKRLLTARFALGEMD-EPEKVSWTGIPFSVVASAGHDSLALDM 367
Query: 390 ARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI---- 445
AR+ + LL N+ +LPL T+AV+GPN++ +V GNY G+ T L GI
Sbjct: 368 ARKSMTLLMNKDNTLPLKR-GGLTIAVMGPNANDSVMQWGNYNGMPPHTVTILDGIRKAL 426
Query: 446 GRYARTIHQQGC 457
G R I++QGC
Sbjct: 427 GSDDRLIYEQGC 438
Score = 124 bits (311), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 93/300 (31%), Positives = 144/300 (48%), Gaps = 47/300 (15%)
Query: 471 AIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKG 520
+++ + AD + V G+ ++E E + DR + LP Q+EL++ + A K
Sbjct: 594 SVERVKDADIVVFVGGVSPNLEGEEMGVELPGFRGGDRTDIELPAVQRELIAALHHAGK- 652
Query: 521 PTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYP 580
+LV SG PI R AI+ A YPGQAGGTA+A++LFG NP G+LP+T+Y
Sbjct: 653 KVVLVNCSGSPI--GLEPETGRCGAILQAWYPGQAGGTAVAEVLFGDYNPAGRLPVTFYR 710
Query: 581 QEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAV 640
++ LP E GRTYR+ ++PFGHG+SYT+F + AV
Sbjct: 711 N--VSQLPDFE-------DYNMTGRTYRYMTQEPLFPFGHGLSYTSFSYG--------AV 753
Query: 641 PLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWA 700
L GS N SG+ ++++ V V N G DG + V+
Sbjct: 754 VL----GSDNIK-SGEKLRLS------------VPVTNTGKCDGEEVVQVYLKKNDDVEG 796
Query: 701 PHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
P K L AF++VH+PAG V ++ + + +S + G + + +GG+ + L
Sbjct: 797 PSKALRAFKRVHIPAGKTVDVEFDLGDKELVWWNPQSNAMCVSEGSYELMVGGSSQTADL 856
>gi|298480647|ref|ZP_06998843.1| beta-glucosidase [Bacteroides sp. D22]
gi|298273081|gb|EFI14646.1| beta-glucosidase [Bacteroides sp. D22]
Length = 862
Score = 258 bits (659), Expect = 8e-66, Method: Compositional matrix adjust.
Identities = 168/454 (37%), Positives = 242/454 (53%), Gaps = 35/454 (7%)
Query: 42 FCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGT 101
+ +SL +R L+ L+L+EKV L++ G+ AV RLGIK Y WW+EALHGV+ G
Sbjct: 22 YKTISLSPEKRAELLLKELTLEEKVSLMMDGSKAVERLGIKPYNWWNEALHGVARAGL-- 79
Query: 102 KFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARA--MYNGGTA------GLTYW 153
AT FPQ I A+SF+ + VSDEARA Y GLT W
Sbjct: 80 --------ATVFPQPIGMAASFSPQTVYGVFDAVSDEARAKNTYYASQGSYERYQGLTMW 131
Query: 154 SPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL 212
+P VN++RDPRWGRG ET GEDP L+ + V+GLQG +DG K+ A KHF +
Sbjct: 132 TPTVNVYRDPRWGRGIETYGEDPYLTSRMGVEVVKGLQGGNDGKYDKLHACAKHFAVHSG 191
Query: 213 DNWNGVDRFHFNAK-VSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNIL 271
WN R FN + + +D+ +T+ PF V E KV VMC+YN+ G P C +L
Sbjct: 192 PEWN---RHEFNVENIKPRDLYETYLPPFEALVKEAKVKEVMCAYNRFEGDPCCGSDRLL 248
Query: 272 KRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTP--EEAAADAIRAGLDLDCGPFLGLHT 329
+ +R EW +G ++SDC ++ +Y+ + + P E A+A A+ +G DL+CG
Sbjct: 249 MQILRNEWGFDGIVLSDCGAIADFYNDRGHKTHPDAESASAAAVLSGTDLECGSSYKALI 308
Query: 330 ESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEA 389
E AV+ G + E D++ A+ L + LG D EP + + V + H LAL+
Sbjct: 309 E-AVKLGKIDEKDVDVAVKRLLAARFALGEMD-EPDKVSWTKIPFSVVASEKHDSLALDI 366
Query: 390 ARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYA 449
AR+ + LL N+ LPL T+AV+GPN++ +V GNY G+ T L G+ +
Sbjct: 367 ARKSMTLLMNKNDILPLKR-GGLTIAVMGPNANDSVMQWGNYNGMPSHTVTVLDGVRKAL 425
Query: 450 ----RTIHQQGCKDVACADDQLFGAAIDASRQAD 479
+ I++QGC D L +A + + AD
Sbjct: 426 DANDKLIYEQGC---GWVDRSLIQSAFNLCKSAD 456
Score = 128 bits (322), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 95/300 (31%), Positives = 143/300 (47%), Gaps = 47/300 (15%)
Query: 471 AIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKG 520
+++ + AD I V G+ S+E E + DR + LP Q+EL++ + + K
Sbjct: 592 SVEKVKGADVVIFVGGISPSLEGEEMGVNLPGFKRGDRTDIELPAVQRELIAALHHSGK- 650
Query: 521 PTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYP 580
ILV SG PI + AI+ A YPGQAGGTA+A++LFG NP G+LP+T+Y
Sbjct: 651 KVILVNCSGSPI--GLEPETKKCEAILQAWYPGQAGGTAVAEVLFGKYNPAGRLPVTFYR 708
Query: 581 QEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAV 640
++ LP E GRTYRF ++PFG+G+SYT F + +
Sbjct: 709 N--VSQLPDFE-------DYDMSGRTYRFMTQEPLFPFGYGLSYTTFEYG--------EM 751
Query: 641 PLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWA 700
LD INA G+ +K+T V + N G +DG + V+
Sbjct: 752 TLD--KNKINA---GEVLKLT------------VPITNTGKRDGEEVVQVYLRKHGDTDG 794
Query: 701 PHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
P K L AFE+V++PAG V + + S T R+ G+++I +G + + L
Sbjct: 795 PQKTLRAFERVNIPAGETVNVDFELKDRELEWWDVTSNTVRVCPGDYDIMVGSSSQTEDL 854
>gi|423303577|ref|ZP_17281576.1| hypothetical protein HMPREF1072_00516 [Bacteroides uniformis
CL03T00C23]
gi|423307700|ref|ZP_17285690.1| hypothetical protein HMPREF1073_00440 [Bacteroides uniformis
CL03T12C37]
gi|392687941|gb|EIY81232.1| hypothetical protein HMPREF1072_00516 [Bacteroides uniformis
CL03T00C23]
gi|392689569|gb|EIY82846.1| hypothetical protein HMPREF1073_00440 [Bacteroides uniformis
CL03T12C37]
Length = 942
Score = 258 bits (659), Expect = 8e-66, Method: Compositional matrix adjust.
Identities = 203/681 (29%), Positives = 331/681 (48%), Gaps = 74/681 (10%)
Query: 110 ATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLT-YWSPNVNIFRDPRWGRG 168
AT+FP + ++N L +G + EAR + G T ++P +++ RD RWGR
Sbjct: 173 ATNFPTQLGLGHTWNRELIRQVGLITGREARML------GYTNVYAPILDVGRDQRWGRY 226
Query: 169 QETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVS 228
+E GE P L + VRGLQ + +VAA+ KHF AY + + ++S
Sbjct: 227 EEVYGESPYLVAELGIEMVRGLQHNH----QVAATGKHFAAYSNNKGAREGMARVDPQMS 282
Query: 229 KQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSD 288
+++E+ PF+ + E + VM SYN +G+P L +RGE GY+VSD
Sbjct: 283 PREVENIHIYPFKRVIREAGMLGVMSSYNDYDGIPVQGSYYWLTTRLRGEMGFRGYVVSD 342
Query: 289 CDSVGVYYDTQHFTSTP-EEAAADAIRAGLDLDC-----GPFLGLHTESAVQRGLLSEID 342
D+V Y T+H T+ +EA ++ AGL++ C F+ L V+ G LSE
Sbjct: 343 SDAVEYLY-TKHGTAKDMKEAVRQSVEAGLNVRCTFRSPDSFV-LPLRELVKEGGLSEEV 400
Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGP 402
IN+ + + L V+ +G+FD + G ++V +++ +AL+A+ + +VLLKN
Sbjct: 401 INDRVRDILRVKFLIGLFDAPYQTDLAG--ADREVEKEENEAIALQASHESVVLLKNADE 458
Query: 403 SLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGR----YARTIHQQGCK 458
LPL + +AV GPN++ + +Y +A TT L+GI A ++ +GC
Sbjct: 459 LLPLDINSTKKIAVCGPNANEEGYALTHYGPLAVEVTTVLEGIQEKTKSKAEVLYTKGCD 518
Query: 459 ------------DVACADDQL--FGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLP 504
D DD+ A++ +RQAD ++V+G Q E R L LP
Sbjct: 519 LVDAHWPESEIIDYPLTDDEQAEIDKAVENARQADVAVVVLGGGQRTCGENKSRTSLDLP 578
Query: 505 GRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADIL 564
GRQ +L+ + K P +L+L++G P+ + +A D + AI+ A YPG GGTA+ADIL
Sbjct: 579 GRQLQLLQAIQATGK-PVVLILINGRPLSINWA--DKFVPAILEAWYPGSKGGTALADIL 635
Query: 565 FGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQS---KRYPGRTYRFYK-GPVVYPFGH 620
FG NPGGKL +T +P+ + +P +PS + PG T + +YPFG+
Sbjct: 636 FGDYNPGGKLTVT-FPKT-VGQIPFN-FPCKPSSQIDGGKNPGPTGNMSRINGALYPFGY 692
Query: 621 GMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVG 680
G+SYT F ++ + V P + +AT+ +KVT+ VQ+ +++V
Sbjct: 693 GLSYTTFEYSDLDITPRVITPNE------SATVR---LKVTNTGKRAGDEVVQLYIRDVL 743
Query: 681 SKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTR 740
S + K L F+++H+ G Q + I K+L ++D
Sbjct: 744 SSITTY---------------EKNLAGFQRIHLEPGEAQELSFTID-RKHLELLDADMKW 787
Query: 741 RIPLGEHNIHIGGTKHSVSLH 761
+ G+ + G + + L+
Sbjct: 788 VVEPGDFVLMAGASSEDIRLN 808
>gi|332184951|ref|ZP_08386700.1| glycosyl hydrolase family 3 N terminal domain protein [Sphingomonas
sp. S17]
gi|332014675|gb|EGI56731.1| glycosyl hydrolase family 3 N terminal domain protein [Sphingomonas
sp. S17]
Length = 886
Score = 258 bits (659), Expect = 8e-66, Method: Compositional matrix adjust.
Identities = 153/394 (38%), Positives = 216/394 (54%), Gaps = 37/394 (9%)
Query: 56 LIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQ 115
L+G+++ EK+ L++ A A+PRLG+ Y WW+E+LHG P T+FP+
Sbjct: 37 LVGKMTTDEKLYQLLNTAPAIPRLGVPAYNWWTESLHGALGPVP----------TTNFPE 86
Query: 116 VITTASSFNATLWEAIGRVVSDEARAMY----------NGGTAGLTYWSPNVNIFRDPRW 165
+ A++F+ L + +S E R ++ GT GL WSPN+NIFRDPRW
Sbjct: 87 PVGLAATFDDALVGRAAQAISREVRGLHALARQTGRLGRIGT-GLDSWSPNINIFRDPRW 145
Query: 166 GRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNA 225
GRGQET GEDP L+ + ++VRG+QG+D +R+ V A+ KH+ + R N
Sbjct: 146 GRGQETYGEDPFLTARMGVAFVRGMQGTDPNRIDVVATPKHYAVHSGPEST---RHEANV 202
Query: 226 KVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYI 285
VS+ D+ DT+ FR ++EG S+MC+YN+++G P CA +LK +RG W GY+
Sbjct: 203 YVSQHDLVDTYLPAFRAAIVEGGAGSIMCAYNRIDGQPACASDLLLKDYLRGRWGFRGYV 262
Query: 286 VSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTE-------SAVQRGLL 338
VSDCD+V H+ P A A A+RAG+D +C T+ A+ RGL+
Sbjct: 263 VSDCDAVKDIDANHHYAPDPATAVAAAMRAGVDNECNNATLTDTDGLTNPYRDALARGLI 322
Query: 339 SEIDINNALVNTLTVQMRLGMFDG-EPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLL 397
S DI+ +L + R G G P S P D+ TP HQ LALE A + +VLL
Sbjct: 323 SVADIDRSLTRLFAARYRTGDLPGVRPLSMQSSS--PADIGTPAHQALALETAEKSLVLL 380
Query: 398 KNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY 431
KN+G LPL +AVIGP D T + GNY
Sbjct: 381 KNKG-ILPLKA--QARIAVIGPLGDATRVLRGNY 411
Score = 134 bits (337), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 95/291 (32%), Positives = 147/291 (50%), Gaps = 52/291 (17%)
Query: 474 ASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTI 523
A++ +DA + V+GL +EAE D+ L LP QQ L+ + K P I
Sbjct: 611 AAKDSDALVAVVGLTSDLEAEEAPISVPGFKGGDKTTLDLPADQQALLEQAKATGK-PLI 669
Query: 524 LVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEY 583
+VLM+G P+++A+AK + AI+ A YPGQAGG A+ ++L G +NP G+LP+T+Y
Sbjct: 670 VVLMNGSPVNLAWAKAN--ADAIVEAWYPGQAGGLAVGNVLSGKTNPAGRLPLTFY--RS 725
Query: 584 ITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHT-VANAPTVVAVP 641
+ +LP + AM+ GRTYR++ G VYPFG+G+SYT+F + V P V V
Sbjct: 726 VEDLPAFGDYAMK--------GRTYRYFTGTPVYPFGYGLSYTSFAYAPVKLTPAVTGV- 776
Query: 642 LDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAP 701
DG L V +V+N G + G ++ P AP
Sbjct: 777 QDG-------------------------LTVTTEVRNTGRRAGDEVAQLYLNFPKVDGAP 811
Query: 702 HKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIG 752
L F++VH+ G ++ + + + LS VD G R + G++ + +G
Sbjct: 812 RVALRGFQRVHLAPGERRMLRFTLD-PRDLSAVDADGHRLVMKGDYRVSVG 861
>gi|304406707|ref|ZP_07388362.1| glycoside hydrolase family 3 domain protein [Paenibacillus
curdlanolyticus YK9]
gi|304344240|gb|EFM10079.1| glycoside hydrolase family 3 domain protein [Paenibacillus
curdlanolyticus YK9]
Length = 733
Score = 258 bits (659), Expect = 9e-66, Method: Compositional matrix adjust.
Identities = 207/672 (30%), Positives = 331/672 (49%), Gaps = 77/672 (11%)
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
T FP + A+++N + S EA T G+T+ ++P +++ RDPRWGR
Sbjct: 112 TVFPIPLAMAAAWNPEVARQTSAAASTEAL------TDGVTWVFAPMIDVSRDPRWGRIA 165
Query: 170 ETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFNA 225
E+ GEDP L+ Y ++V G Q +G A+ KHF Y + ++N VD
Sbjct: 166 ESIGEDPYLTAAYGRAWVEGSQIDNGPGRATASCPKHFAGYGMAEAGRDYNTVD------ 219
Query: 226 KVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYI 285
+S +++ D PF+ V G + S+M S+N++NG+P CA+ +LK +R EW G +
Sbjct: 220 -LSDRELRDIILPPFQDAVEAGAL-SIMASFNEINGIPACANEYLLKTILRDEWGFEGVV 277
Query: 286 VSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTE---SAVQRGLLSEID 342
SD +++ V + EEA + AG D+D G+ T V+ G + E
Sbjct: 278 ASDYNAL-VELIVHGVAANEEEACEMTVLAGCDMDMHS--GIFTRQLPKLVRAGRVPESV 334
Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGP 402
+++++ L ++++LG+ E S ++ ELA EAARQ IVLL+N+
Sbjct: 335 VDDSVRRILAMKIKLGLL--EQSKSDVSQSAATQPLKSEYVELAREAARQSIVLLQNKEQ 392
Query: 403 SLPLSHIRHRTVAVIGPNSDVTVTMIGNYA--GIACGYTTPLQGIGRYA----RTIHQQG 456
LPLS ++AVIGP +D +G +A G + T L+GI + A + QG
Sbjct: 393 VLPLSKA-GASIAVIGPLADNATDPLGCWALDGRSDEVVTALEGIRQAAAEGTSIRYAQG 451
Query: 457 CKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSM 516
C D+ ++ F AA++A+R +D ++++G ++ E+ RA L LPG+Q+ LV V+
Sbjct: 452 C-DIDSDSEEGFEAALEAARSSDVVVMLLGESATMSGESRSRAALDLPGKQRALVEAVAK 510
Query: 517 ASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPM 576
K P + V++SG P+ FA + +AI+ A + G G AIAD+LFG NP G+LP+
Sbjct: 511 LGK-PIVAVILSGRPL--TFAWLPEQASAIVQAWHLGVQSGNAIADVLFGDFNPSGRLPV 567
Query: 577 TWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFY----KGPVVYPFGHGMSYTNFVHTVA 632
T +PQ N+ + ++ R P Y Y +YPFG+G++YT F
Sbjct: 568 T-FPQ----NVGQIPIYHYRKKTGRPPAGAYSSYYIDSTTEPLYPFGYGLTYTEF----- 617
Query: 633 NAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF- 691
+G+I + S L V V ++NVG+ G + +
Sbjct: 618 ------------EYGAIQTSKSSIGADE--------QLDVTVSIRNVGNLAGEEVVQCYV 657
Query: 692 STPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHI 751
A P K+LVAF KV V AG V I + L+++D+ R + G+ + I
Sbjct: 658 RDEVASVTQPLKRLVAFRKVKVAAGESVDVTFTIGAAE-LAILDKHMKRTVEPGDFTLWI 716
Query: 752 G---GTKHSVSL 760
G G+ SVS+
Sbjct: 717 GPSAGSGESVSI 728
>gi|262405981|ref|ZP_06082531.1| conserved hypothetical protein [Bacteroides sp. 2_1_22]
gi|345510488|ref|ZP_08790055.1| beta-glucosidase [Bacteroides sp. D1]
gi|262356856|gb|EEZ05946.1| conserved hypothetical protein [Bacteroides sp. 2_1_22]
gi|345454434|gb|EEO48987.2| beta-glucosidase [Bacteroides sp. D1]
Length = 735
Score = 258 bits (659), Expect = 9e-66, Method: Compositional matrix adjust.
Identities = 233/795 (29%), Positives = 368/795 (46%), Gaps = 109/795 (13%)
Query: 8 FFLGLILLSASSSGLAAREPFACDPKDATTRTLP-FCQVSLPIPQRVNDLIGRLSLQEKV 66
LGL LL + S L++++ +++P + PI +R++DLI R++L+EKV
Sbjct: 6 LILGLCLLGVTHS-LSSKD----------KKSIPLYKDAKAPIEKRIDDLISRMTLEEKV 54
Query: 67 KLLI--------------SGAAAVP-RLGIKGYEWWSEALHGV--------SNVGPGTKF 103
L VP +G Y + L S +G F
Sbjct: 55 LQLNQYTLGRNNNVNNVGEEVKKVPSEIGSLIYFDINPELRNSMQKKAMEESRLGIPIIF 114
Query: 104 GGDFPGA--TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIF 160
G D T +P + A S+N L E V + EAR +G+ + +SP +++
Sbjct: 115 GYDAIHGFRTIYPISLGQACSWNPGLVEQACAVSAQEAR------MSGVDWTFSPMIDVA 168
Query: 161 RDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSD-GDRLKVAASCKHFTAYDLDNWNGVD 219
RDPRWGR E GEDP +G +AA+ VRG QG D ++AA KH+ Y
Sbjct: 169 RDPRWGRVAEGYGEDPYTNGVFAAASVRGYQGDDMSAENRMAACLKHYVGYGASE---AG 225
Query: 220 RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEW 279
R + ++S Q + DT+ +P+ M V G ++M S+N ++GVP A+P I+ ++ W
Sbjct: 226 RDYVYTEISAQTLWDTYLLPYEMGVKAG-APTLMSSFNDISGVPGSANPYIMTEILKKRW 284
Query: 280 RLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPF-LGLHTESAVQRGLL 338
+ +G+IVSD +V Q +T ++AA A AGL++D H + V+ G +
Sbjct: 285 KHDGFIVSDWGAVE-QLKNQGLAATKKDAARYAFNAGLEMDMMSHAYDRHLKELVEEGKV 343
Query: 339 SEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHL-GPKD-VCTPDHQELALEAARQGIVL 396
+ ++ ++ L V+ RLG+F+ +PY + KD P +A + A + +VL
Sbjct: 344 TMAQVDESVRRVLRVKFRLGLFE-----RPYTPVTNEKDRFFRPQSMAVAAQLAAESMVL 398
Query: 397 LKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAG------IACGYTTPLQGIGRYAR 450
LKN LPL++ + +AV+GP + ++G++ G + Y G A
Sbjct: 399 LKNDNQILPLTN--KKKIAVVGPMAKNGWDLLGSWCGHGKDTDVEMLYDGLTAEFGGDAE 456
Query: 451 TIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQEL 510
+ GCK D F A+D +R +D I+ +G + E R+ + LP Q+EL
Sbjct: 457 LRYAMGCKPQG-NDRSGFAGALDVARWSDVVIVCLGEMLTWSGENASRSTIALPQIQEEL 515
Query: 511 VSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNP 570
V ++ A K P ILVL +G P+++ + +P AI+ PG G ++A IL G NP
Sbjct: 516 VKELKEAGK-PVILVLSNGRPLEL--NRMEPLCDAILEIWQPGINGARSMAGILSGRINP 572
Query: 571 GGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYR-FYKGPV---VYPFGHGMSYTN 626
GKL MT P + + ++R GR ++ FYK +YPFGHG+SYT
Sbjct: 573 SGKLAMT---------FPYSTGQIPIYYNRRKSGRGHQGFYKDITSDPLYPFGHGLSYTE 623
Query: 627 FVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAH 686
F + T++ A KV L +V V N GS+DGA
Sbjct: 624 FKY---------------------GTVTPSATKVKRGD----KLSAEVTVTNTGSRDGAE 658
Query: 687 TLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLG 745
T+ F S P P K+L FEK + AG + +I + + V+ G R + G
Sbjct: 659 TVHWFISDPYCSITRPVKELKHFEKQLIKAGETKTFRFDIDLERDFGFVNEDGKRFLEAG 718
Query: 746 EHNIHIGGTKHSVSL 760
E++I + G + L
Sbjct: 719 EYHILVQGQTVKIEL 733
>gi|261405721|ref|YP_003241962.1| glycoside hydrolase family protein [Paenibacillus sp. Y412MC10]
gi|261282184|gb|ACX64155.1| glycoside hydrolase family 3 domain protein [Paenibacillus sp.
Y412MC10]
Length = 765
Score = 258 bits (659), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 209/686 (30%), Positives = 324/686 (47%), Gaps = 103/686 (15%)
Query: 109 GATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRG 168
G T FP ++ S++N L+ + R V+ E R+ G +SP +++ RDPRWGR
Sbjct: 122 GGTVFPVPLSIGSTWNVDLYRDMCRAVALETRSQ-----GGAVTYSPVLDVVRDPRWGRT 176
Query: 169 QETPGEDPVLSGKYAASYVRGLQGSDGDR-LKVAASCKHFTAYDLDNWNGVDRFHFNA-- 225
+E GEDP L +YA + V GLQG D VAA+ KHF Y G NA
Sbjct: 177 EECFGEDPYLISEYAVASVEGLQGESLDSPSSVAATLKHFVGY------GSSEGGRNAGP 230
Query: 226 -KVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
+ +++ + +PF+ V E AS+M +YN+++GVP + +L +R EW +G
Sbjct: 231 VHMGTRELMEVDMLPFKKAV-EAGAASIMPAYNEIDGVPCTVNTELLDGILRKEWGFDGM 289
Query: 285 IVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLD-CGPFLGLHTESAVQRGLLSEIDI 343
+++DC ++ + +AA AIRAG+D++ G G H + AV+ L +
Sbjct: 290 VITDCGAIDMLASGHDTAEDGMDAAVQAIRAGIDMEMSGEMFGKHLQKAVESNKLEVSVL 349
Query: 344 NNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPS 403
+ A+ LT++ +LG+F+ P P + + H LA + A +GIVLLKN+ +
Sbjct: 350 DEAVRRVLTLKFKLGLFE-NPYVDP--QTAENVIGSEQHVGLARQLAAEGIVLLKNEAKA 406
Query: 404 LPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIA--CGYTTPLQGI----GRYA-RTIHQQG 456
LPLS +AVIGPN+D +G+Y TT L GI G A R ++ G
Sbjct: 407 LPLSK-EGGVIAVIGPNADQGYNQLGDYTSPQPPAAVTTVLGGIRAKLGEEAQRVLYAPG 465
Query: 457 CKDVACADD--QLFGAAIDASRQADATILVMG---------------------LDQSIE- 492
C+ DD + F A+ + QAD ++V+G D ++
Sbjct: 466 CR---IKDDSREGFEFALTCAEQADTVVMVLGGSSARDFGEGTIDLRTGASKVTDDALSD 522
Query: 493 ---AEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWA 549
E +DR L L G Q ELV ++ K I+V ++G PI + D AI+ A
Sbjct: 523 MDCGEGIDRMTLQLSGVQLELVQEIHKLGK-RMIVVYINGRPIAEPWI--DEHADAILEA 579
Query: 550 GYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRF 609
YPGQ GG A+ADILFG NP GKL M+ P+ ++ LP+ KR G+ Y
Sbjct: 580 WYPGQEGGHAVADILFGDVNPSGKLTMS-IPK-HVGQLPVY------YNGKRSRGKRYLE 631
Query: 610 YKGPVVYPFGHGMSYTNFVHT-VANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRL 668
YPFG+G+SYT F ++ + P V+ G A+
Sbjct: 632 EDSQPRYPFGYGLSYTEFSYSDIQMTPEVIGT-------------DGTAV---------- 668
Query: 669 TLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWA-PHKQLVAFEKVHVPAGAQQRVGINI-- 725
V V+V N G +G+ + ++ + A + P ++L F+K+ + G +++V I
Sbjct: 669 ---VSVNVTNSGDCEGSEVVQLYVSDAASKYTRPARELKGFQKIFLQPGERRKVEFTIGP 725
Query: 726 ----HVCKYLSVVDRSGTRRIPLGEH 747
++ + V G R+ LG H
Sbjct: 726 EQLQYIGQDYRQVVEPGLFRVMLGRH 751
>gi|167646366|ref|YP_001684029.1| beta-glucosidase [Caulobacter sp. K31]
gi|167348796|gb|ABZ71531.1| Beta-glucosidase [Caulobacter sp. K31]
Length = 898
Score = 258 bits (658), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 156/396 (39%), Positives = 223/396 (56%), Gaps = 37/396 (9%)
Query: 55 DLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFP 114
+L+G+++L EK++ L++ A A+PRLG+ Y WW+E+LHG P T+FP
Sbjct: 48 ELVGKMTLDEKLEQLLNTAPAIPRLGVPAYNWWTESLHGALGALP----------TTNFP 97
Query: 115 QVITTASSFNATLWEAIGRVVSDE-------ARAMYNGGT--AGLTYWSPNVNIFRDPRW 165
+ I A++F+A L + + E ARA G GL WSPN+NIFRDPRW
Sbjct: 98 EPIGLAATFDAPLVHDVAGAIGAEMQDLHALARATGRMGRIGTGLNTWSPNINIFRDPRW 157
Query: 166 GRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNA 225
GRGQET GEDP L+ + ++V G+QG D D ++ A+ KHF + R N
Sbjct: 158 GRGQETYGEDPFLTARMGVAFVEGIQGDDPDHPRIIATPKHFAVHSGPEST---RHGANV 214
Query: 226 KVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYI 285
VS++D+EDT+ FR V+EG+ S+MC+YN+++G P CA +LK +RG W+ +GY+
Sbjct: 215 FVSRRDLEDTYLPAFRAAVVEGRAGSIMCAYNRIDGQPACASDLLLKEHLRGAWKFDGYV 274
Query: 286 VSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTE-------SAVQRGLL 338
VSDCD+V D + A A A+RAG+D +C T+ A+ RGL+
Sbjct: 275 VSDCDAVKDISDHHKYAPEAASAVAAALRAGVDNECNGATLTDTDGLAGRYREALDRGLI 334
Query: 339 SEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHL-GPKDVCTPDHQELALEAARQGIVLL 397
S I+ ALV + + R G + S G L GP V T +H+ LAL A+ + +VLL
Sbjct: 335 STAQIDTALVRLFSARFRNGDLPAKGGSD--GRLAGPSVVTTHEHEALALAASEKSLVLL 392
Query: 398 KNQGPSLPLS-HIRHRTVAVIGPNSDVTVTMIGNYA 432
KN G LPL +R +AVIGP D T + GNY+
Sbjct: 393 KNDG-VLPLKPGLR---IAVIGPLGDATRVLRGNYS 424
Score = 120 bits (302), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 85/284 (29%), Positives = 140/284 (49%), Gaps = 48/284 (16%)
Query: 479 DATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILVLMS 528
D + V+GL +EAE D+ L LP Q+ ++ + K P I+V M+
Sbjct: 628 DVLVAVVGLTSDMEAEEAPIEIPGFKGGDKTTLDLPADQRAMLEQAKALGK-PLIVVAMN 686
Query: 529 GGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLP 588
G P++ A+AK++ +A++ A YPGQ+GG AIA++L G +NP G+LP+T+Y + +LP
Sbjct: 687 GSPLNFAWAKDN--ASALLEAWYPGQSGGLAIANVLTGKTNPAGRLPLTFY--RSVDDLP 742
Query: 589 MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGS 648
P GRTYR+++G VYPFG+G+SYT F + P + G
Sbjct: 743 -------PFDDYAMAGRTYRYFEGTPVYPFGYGLSYTRFDY----GPLKIEPATKG---- 787
Query: 649 INATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAF 708
+G+ ++VT +KNVG++ G ++ P AP L F
Sbjct: 788 -----AGQGLRVT------------TTIKNVGTRPGEEVAQLYLDFPKTPGAPRLALRGF 830
Query: 709 EKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIG 752
+++ + G + + + + LS VD G R+ G + + +G
Sbjct: 831 QRIALKPGETRDITFALS-PRDLSSVDLDGEHRVSAGLYRVSVG 873
>gi|1483154|dbj|BAA13102.1| T-cell inhibitor(STI) [Salmonella enterica subsp. enterica serovar
Typhimurium]
Length = 765
Score = 258 bits (658), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 204/669 (30%), Positives = 323/669 (48%), Gaps = 103/669 (15%)
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
T FP + ASSFN +GRV + Y GL W+P V++ RDPRWGR
Sbjct: 118 TVFPISLGLASSFNLDAVRTVGRV------SAYEAADDGLNMTWAPMVDVSRDPRWGRAS 171
Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
E GED L+ + V+ +QG S DR V S KHF AY +N VD
Sbjct: 172 EGFGEDTYLTSIMGETMVKAMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 226
Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
+S Q + + + P++ + G +VM + N +NG P +D +LK +R EW G
Sbjct: 227 --MSSQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSDSWLLKDVLRDEWGFKGI 283
Query: 285 IVSDCDSVGVYYDTQHFTST-PEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
VSD ++ +H T+ PE+A A++AG+D+ + + ++ G ++ +
Sbjct: 284 TVSDHGAIKEL--IKHGTAADPEDAVRVALKAGVDMSMADEYYSKYLPGLIKSGKVTMAE 341
Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
+++A + L V+ +G+F+ PY HLGPK+ D H++ A E AR+ +V
Sbjct: 342 LDDATRHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAESRLHRKEAREVARESVV 396
Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGI----GRYA 449
LLKN+ +LPL + T+AV+GP +D ++G++ AG+A T L GI G A
Sbjct: 397 LLKNRLETLPLK--KSGTIAVVGPLADSQRDVMGSWSAAGVANQSVTVLAGIQNAVGDGA 454
Query: 450 RTIHQQGCK---DVACAD----------------DQLFGAAIDASRQADATILVMGLDQS 490
+ ++ +G D D + A+ A++QAD + V+G Q
Sbjct: 455 KILYAKGANITNDKGIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADVVVAVVGESQG 514
Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
+ EA R + +P Q++L++ + K P +LVLM+G P+ A K D + AI+
Sbjct: 515 MAHEASSRTNITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKEDQQADAILETW 571
Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPM------TEMAMRPSQSKRYPG 604
+ G GG AIAD+LFG NP GKLP++ +P+ + +P+ T P + +Y
Sbjct: 572 FAGTEGGNAIADVLFGDYNPSGKLPIS-FPRS-VGQIPVYYSHLNTGRPYNPEKPNKYTS 629
Query: 605 RTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAK 664
+ GP +YPFG+G+SYT F TV++ ++ P R G + A+
Sbjct: 630 LYFDKPNGP-LYPFGYGLSYTTF--TVSD--FTLSSPTMQRDGKVTAS------------ 672
Query: 665 CNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVG- 722
V V N G ++GA + ++ A P KQL FEK+ + G ++ V
Sbjct: 673 ---------VKVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGFEKITLKPGKRKTVSF 723
Query: 723 -INIHVCKY 730
I+I K+
Sbjct: 724 PIDIEALKF 732
>gi|329962030|ref|ZP_08300041.1| glycosyl hydrolase family 3 protein [Bacteroides fluxus YIT 12057]
gi|328530678|gb|EGF57536.1| glycosyl hydrolase family 3 protein [Bacteroides fluxus YIT 12057]
Length = 941
Score = 258 bits (658), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 204/681 (29%), Positives = 319/681 (46%), Gaps = 76/681 (11%)
Query: 110 ATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLT-YWSPNVNIFRDPRWGRG 168
AT+FP + ++N L +G + EAR + G T ++P +++ RD RWGR
Sbjct: 173 ATNFPTQLGLGHTWNRALIHKVGLITGREARML------GYTNVYAPILDVGRDQRWGRY 226
Query: 169 QETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVS 228
+E GE P L + VRGLQ VAA+ KHF AY + + + S
Sbjct: 227 EEVYGESPYLVAELGIEMVRGLQQH------VAATGKHFAAYSNNKGAREGMARVDPQTS 280
Query: 229 KQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSD 288
++E+ PFR + E + VM SYN +G+P L +R E GY+VSD
Sbjct: 281 PHEVENIHIYPFRRVIKEAGLLGVMSSYNDYDGIPIQGSYYWLTTRLRDEMGFRGYVVSD 340
Query: 289 CDSVGVYYDTQHFTSTP-EEAAADAIRAGLDLDC-----GPFLGLHTESAVQRGLLSEID 342
D+V Y T+H T+ +EA ++ AGL++ C F+ L V+ G L E
Sbjct: 341 SDAVEYLY-TKHGTAKDMKEAVRQSVEAGLNVRCTFRSPDSFV-LPLRELVKEGGLDEET 398
Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGP 402
+N+ + + L V+ +G+FD + G K+V +++ +AL+A+R+ +VLLKN+
Sbjct: 399 VNDRVRDILRVKFLIGLFDAPYQTDLAG--ADKEVEKEENEAVALQASRESVVLLKNENS 456
Query: 403 SLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI----GRYARTIHQQGCK 458
+LPL+ + +AV GPN+D + +Y +A TT L+GI A ++ +GC
Sbjct: 457 TLPLNINTVKKIAVCGPNADEDGYALTHYGPLAVEVTTVLKGIQDKVNGKAEVLYTKGCD 516
Query: 459 DVACA-------------DDQL-FGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLP 504
V D+Q A++ +R+AD ++V+G Q E R+ L LP
Sbjct: 517 LVDANWPESEIIDYPLTPDEQAEINKAVENARRADVAVVVLGGGQRTCGENKSRSSLDLP 576
Query: 505 GRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADIL 564
GRQ +L+ V K P +L+L++G P+ V +A D + AI+ A YPG GG A+ADIL
Sbjct: 577 GRQLQLLQAVQATGK-PVVLILINGRPLSVNWA--DKYVPAILEAWYPGSKGGVALADIL 633
Query: 565 FGTSNPGGKLPMTWYPQ---EYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHG 621
FG NPGGKL +T +P+ + N P + P G +YPFG+G
Sbjct: 634 FGDYNPGGKLTVT-FPKTVGQIPFNFPCKPASQIDGGKNAGPDGNMSRING-ALYPFGYG 691
Query: 622 MSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGS 681
+SYT F ++ N I+ K I RL V N G
Sbjct: 692 LSYTTFEYS-------------------NLEITPKVITPNEKATVRLK------VTNTGK 726
Query: 682 KDGAHTLLVFSTPPAGHWAPH-KQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTR 740
G + +++ + K L FE++H+ G + V I K+L ++D R
Sbjct: 727 YAGDEVVQLYTRDVLSSVTTYEKNLAGFERIHLEPGETKEVTF-ILDRKHLELLDADMKR 785
Query: 741 RIPLGEHNIHIGGTKHSVSLH 761
+ G+ I G + + L+
Sbjct: 786 VVEPGDFAIMAGASSEDIRLN 806
>gi|237712573|ref|ZP_04543054.1| glycoside hydrolase family 3 protein [Bacteroides sp. 9_1_42FAA]
gi|345512524|ref|ZP_08792050.1| glycoside hydrolase family beta-glycosidase [Bacteroides dorei
5_1_36/D4]
gi|423239901|ref|ZP_17221016.1| hypothetical protein HMPREF1065_01639 [Bacteroides dorei
CL03T12C01]
gi|229435409|gb|EEO45486.1| glycoside hydrolase family beta-glycosidase [Bacteroides dorei
5_1_36/D4]
gi|229453894|gb|EEO59615.1| glycoside hydrolase family 3 protein [Bacteroides sp. 9_1_42FAA]
gi|392644890|gb|EIY38624.1| hypothetical protein HMPREF1065_01639 [Bacteroides dorei
CL03T12C01]
Length = 788
Score = 258 bits (658), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 202/680 (29%), Positives = 313/680 (46%), Gaps = 84/680 (12%)
Query: 110 ATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLT-YWSPNVNIFRDPRWGRG 168
AT FP +++N L IG V + EA A+ G T +SP ++I +DPRWGR
Sbjct: 158 ATYFPAQCGQGATWNKELIARIGEVEAKEAVAL------GYTNIYSPILDIAQDPRWGRC 211
Query: 169 QETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVS 228
ET GEDP L G+ + LQ + + A+ KHF Y + + + V+
Sbjct: 212 VETYGEDPYLVGELGKQMITSLQ-----KHNLVATPKHFAVYSIPVGGRDGKTRTDPHVA 266
Query: 229 KQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSD 288
+++ + PFRM E VM SYN +G P + L +R EW GY+VSD
Sbjct: 267 PREMRTLYIEPFRMAFQEAGALGVMSSYNDYDGEPITGSYHFLTEILRQEWGFKGYVVSD 326
Query: 289 CDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDC-----GPFLGLHTESAVQRGLLSEIDI 343
++V +T E+ A A+ AGL++ F+ L AV G +S+ +
Sbjct: 327 SEAVEFISSKHKVANTYEDGIAQAVNAGLNIRTHFTPPADFI-LPLRKAVADGKISQETL 385
Query: 344 NNALVNTLTVQMRLGMFDGEPSSQPY---GHLGPKDVCTPDHQELALEAARQGIVLLKNQ 400
+ + L V+ LG+FD PY G + V + +HQ ++LEAARQ +VLLKN+
Sbjct: 386 DKRVAEILRVKFWLGLFD-----NPYRGNGKQAEQIVHSKEHQAVSLEAARQSLVLLKNE 440
Query: 401 GPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGR---YARTIHQQGC 457
LPLS R++AVIGPN+D +I Y T QGI + I+++GC
Sbjct: 441 MNLLPLSK-SLRSIAVIGPNADERTQLICRYGPANAPIKTVYQGIKERLPHTEVIYRKGC 499
Query: 458 KDV--------------ACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLL 503
+ + +L AI A++QA+ ++V+G ++ E R L L
Sbjct: 500 DIIDPHFPESEVLDFPKTTEEARLMEEAIHAAKQAEVVVMVLGGNELTVREDRSRTSLNL 559
Query: 504 PGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADI 563
PGRQ+EL+ V K P +LVL+ G + +A + AI+ A +PG+ G A+A+
Sbjct: 560 PGRQEELLKAVCATGK-PVVLVLLDGRASSINYAA--AHVPAILHAWFPGEFCGQAVAEA 616
Query: 564 LFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMS 623
LFG NPGG+L +T +P+ + +P +P + Y V+YPFGHG+S
Sbjct: 617 LFGDYNPGGRLAVT-FPKS-VGQIPFA-FPFKPGSDESSSTSVY-----GVLYPFGHGLS 668
Query: 624 YTNFVHTVANAPTVVAVPL-DGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSK 682
YT F + + PL G G IN + +KN G
Sbjct: 669 YTTFSYG-----DLKISPLRQGVQGDIN---------------------ISCKIKNTGKI 702
Query: 683 DGAHTLLVFSTPPAGHWAPH-KQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRR 741
G + ++ + K L FE++ + AG +Q V + + L + D++ R
Sbjct: 703 KGDEVVQLYLRDEVSSVTTYTKVLRGFERISLEAGEEQMVHFRLR-PQDLGLWDKNMNFR 761
Query: 742 IPLGEHNIHIGGTKHSVSLH 761
+ G+ + IG + + LH
Sbjct: 762 VEPGKFKVMIGSSSTDIRLH 781
>gi|395233504|ref|ZP_10411744.1| beta-D-glucoside glucohydrolase [Enterobacter sp. Ag1]
gi|394732231|gb|EJF31938.1| beta-D-glucoside glucohydrolase [Enterobacter sp. Ag1]
Length = 765
Score = 258 bits (658), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 209/691 (30%), Positives = 328/691 (47%), Gaps = 106/691 (15%)
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
T FP + ASSFN + +GR+ + Y GL W+P V++ RDPRWGR
Sbjct: 118 TVFPISLGLASSFNLDAVKTVGRI------SAYEAADDGLNMTWAPMVDVSRDPRWGRVS 171
Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
E GED L+ + V +QG S DR V S KHF AY +N VD
Sbjct: 172 EGFGEDTYLTSMMGRTMVEAMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 226
Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
+S Q + + + P++ + G VM + N +NG P AD +LK +R EW+ G
Sbjct: 227 --MSPQRLFNDYMPPYKAALDAGS-GGVMVALNSLNGTPASADSWLLKDLLRDEWKFKGI 283
Query: 285 IVSDCDSVGVYYDTQH-FTSTPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
+SD ++ +H S P++A A+ +G+++ + + V+ G + +
Sbjct: 284 TISDHGAIKEL--IKHGVASDPKDAVRIALNSGINMSMSDEYYSKYLPELVKSGAVPMKE 341
Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
+++A + L V+ +G+F+ PY HLGPKD D H++ A E AR+ +V
Sbjct: 342 LDDAARHVLNVKYDMGLFN-----DPYSHLGPKDSDPVDTNAESRLHRKEAREVARESMV 396
Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGI----GRYA 449
LLKN+ +LPL + T+AVIGP +D +G++ AG+A TPL+GI G+
Sbjct: 397 LLKNRLQTLPLK--KTSTIAVIGPLADSKRDAMGSWSAAGVAAQSVTPLEGIKNAVGQQG 454
Query: 450 RTIHQQGC-----KDVACADDQ--------------LFGAAIDASRQADATILVMGLDQS 490
+ + +G KD+ +Q + A+ ++ AD + V+G Q
Sbjct: 455 KVVFAKGANVTDEKDIVEFLNQYEPAVTVDPRTPQAMIDEAVKTAKNADVVVAVVGEAQG 514
Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
+ EA R L LP Q++L++ + K P +LVLM+G P+ A K + + A++ A
Sbjct: 515 MAHEASSRTNLTLPQSQRDLIAALKATGK-PLVLVLMNGRPL--ALVKEEQQADAMLEAW 571
Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQE------YITNLPMTEMAMRPSQSKRYPG 604
Y G GG A+ADILFG NP GKLPM+ +P+ Y ++L T P + +Y
Sbjct: 572 YGGTEGGNAVADILFGDYNPSGKLPMS-FPRSVGQIPTYYSHL-NTGRPYDPEKPNKYTS 629
Query: 605 RTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAK 664
R + GP +YPFG+G+SYT F V + ++ P R G + A+++
Sbjct: 630 RYFDEANGP-LYPFGYGLSYTTF--KVGDVK--LSAPSMKRDGKVEASVT---------- 674
Query: 665 CNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVG- 722
V N G + G + ++ A P K+L F+KV + AG Q V
Sbjct: 675 -----------VTNTGDRAGETVVQMYLQDVTASMSRPVKELKGFKKVALKAGESQTVSF 723
Query: 723 -INIHVCKYLSVVDRSGTRRIPLGEHNIHIG 752
I+I K+ + + G+ N+ IG
Sbjct: 724 PIDISALKFWNAKMKYDAEP---GKFNVFIG 751
>gi|423293434|ref|ZP_17271561.1| hypothetical protein HMPREF1070_00226 [Bacteroides ovatus
CL03T12C18]
gi|392678377|gb|EIY71785.1| hypothetical protein HMPREF1070_00226 [Bacteroides ovatus
CL03T12C18]
Length = 735
Score = 258 bits (658), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 231/795 (29%), Positives = 368/795 (46%), Gaps = 109/795 (13%)
Query: 8 FFLGLILLSASSSGLAAREPFACDPKDATTRTLP-FCQVSLPIPQRVNDLIGRLSLQEKV 66
LGL LL + S L++++ +++P + PI +R++DLI R++L+EK+
Sbjct: 6 LILGLCLLGVTHS-LSSKD----------KKSIPLYKDAKAPIEKRIDDLISRMTLEEKI 54
Query: 67 KLLI--------------SGAAAVP-RLGIKGYEWWSEALHGV--------SNVGPGTKF 103
L VP +G Y + L S +G F
Sbjct: 55 LQLNQYTLGRNNNVNNVGEEVKKVPSEIGSLIYFDINPELRNSMQKKAMEESRLGIPIIF 114
Query: 104 GGDFPGA--TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIF 160
G D T +P + A S+N L E V + EAR +G+ + +SP +++
Sbjct: 115 GYDAIHGFRTIYPISLGQACSWNPGLVEQACAVSAQEAR------MSGVDWTFSPMIDVA 168
Query: 161 RDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSD-GDRLKVAASCKHFTAYDLDNWNGVD 219
RDPRWGR E GEDP +G +AA+ VRG QG D ++AA KH+ Y
Sbjct: 169 RDPRWGRVAEGYGEDPYTNGVFAAASVRGYQGDDMSAENRMAACLKHYVGYGASE---AG 225
Query: 220 RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEW 279
R + ++S Q + DT+ +P+ M V G A++M S+N ++GVP A+P I+ ++ W
Sbjct: 226 RDYVYTEISAQTLWDTYLLPYEMGVKAG-AATLMSSFNDISGVPGSANPYIMTEILKKRW 284
Query: 280 RLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPF-LGLHTESAVQRGLL 338
+ +G+IVSD +V Q +T ++AA A AGL++D H + V+ G +
Sbjct: 285 KHDGFIVSDWGAVE-QLKNQGLAATKKDAARYAFNAGLEMDMMSHAYDRHLKELVEEGKV 343
Query: 339 SEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHL-GPKD-VCTPDHQELALEAARQGIVL 396
+ ++ ++ L V+ RLG+F+ +PY + KD P +A + A + +VL
Sbjct: 344 TMAQVDESVRRVLRVKFRLGLFE-----RPYTPVTNEKDRFFRPQSMAVAAQLAAESMVL 398
Query: 397 LKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAG------IACGYTTPLQGIGRYAR 450
LKN LPL++ + +AV+GP + ++G++ G + Y G A
Sbjct: 399 LKNNNQILPLTN--KKKIAVVGPMAKNGWDLLGSWCGHGKDTDVEMLYDGLTAEFGGDAE 456
Query: 451 TIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQEL 510
+ GCK D F A+D +R +D I+ +G + E R+ + LP Q+EL
Sbjct: 457 LRYAMGCKPQG-NDRSGFAGALDVARWSDVVIVCLGEMLTWSGENASRSTIALPQIQEEL 515
Query: 511 VSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNP 570
V ++ A K P ILVL +G P+++ + +P AI+ PG G ++A IL G NP
Sbjct: 516 VKELKEAGK-PIILVLSNGRPLEL--NRMEPLCDAILEIWQPGINGARSMAGILSGRINP 572
Query: 571 GGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYR-FYKGPV---VYPFGHGMSYTN 626
GKL MT P + + ++R GR ++ FYK +YPFGHG+SYT
Sbjct: 573 SGKLAMT---------FPYSTGQIPIYYNRRKSGRGHQGFYKDITSDPLYPFGHGLSYTE 623
Query: 627 FVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAH 686
F + T++ A KV L +V V N G++DGA
Sbjct: 624 FKY---------------------GTVTPSATKVKRGD----KLSAEVTVTNTGARDGAE 658
Query: 687 TLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLG 745
T+ F S P P K+L FEK + G + +I + + V+ G R + G
Sbjct: 659 TVHWFISDPYCSITRPVKELKHFEKQFIKVGETKTFRFDIDLERDFGFVNEDGKRFLEAG 718
Query: 746 EHNIHIGGTKHSVSL 760
E++I + G + L
Sbjct: 719 EYHILVQGQTVKIEL 733
>gi|381405212|ref|ZP_09929896.1| beta-D-glucoside glucohydrolase [Pantoea sp. Sc1]
gi|380738411|gb|EIB99474.1| beta-D-glucoside glucohydrolase [Pantoea sp. Sc1]
Length = 765
Score = 258 bits (658), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 205/665 (30%), Positives = 330/665 (49%), Gaps = 103/665 (15%)
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
T FP + ASS++ + +GRV + Y GL W+P V++ R+PRWGRG
Sbjct: 118 TIFPIPLALASSWDLDAVKEVGRV------SAYEAADDGLNMTWAPMVDVTREPRWGRGS 171
Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
E GED L+ + S V+ +QG S DR V S KHF Y ++N VD
Sbjct: 172 EGFGEDTYLTSEMGRSMVQAMQGKSAADRYSVMTSVKHFALYGAVEGGRDYNTVD----- 226
Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
+S Q + + P++ + G VM + N +NGVP AD +LK +RG+W+ G
Sbjct: 227 --MSPQRMFQDYLPPYKASLDAGS-GGVMVALNSINGVPATADSWLLKDLLRGQWKFKGI 283
Query: 285 IVSDCDSVGVYYDTQH-FTSTPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
+SD ++ +H S P++A A+++G+D+ + + V+RG +S +
Sbjct: 284 TISDHGAIKEL--IKHGVASDPQDAVRIALKSGVDMSMSDEYYSKYLPDLVKRGEVSMAE 341
Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
I++A + L V+ +G+F+ PY HLGPK+ D H+ A + AR+ IV
Sbjct: 342 IDDAARHVLNVKYDMGLFN-----DPYSHLGPKESDPQDTNAESRLHRAEARDVARKSIV 396
Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGI-------- 445
LLKN+ +LPL + T+A+IGP +D ++G++ AG+A + LQG+
Sbjct: 397 LLKNRLETLPLK--KSGTIALIGPLADSQRDIMGSWSAAGVAKQSVSLLQGMRNATEGKA 454
Query: 446 ----GRYARTIHQQGCKDV------ACADD-----QLFGAAIDASRQADATILVMGLDQS 490
+ A +G +D A + D Q+ A+ ++QAD + +G Q
Sbjct: 455 TLLYEKGANVSDNKGIQDFLNLYEPAVSIDTRSPQQMIDDAVAKAKQADVIVAAVGEAQG 514
Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIA-AIIWA 549
+ EA R+ L++P QQ+L++ + K P ++VLM+G + V N+ R+A A++
Sbjct: 515 MAHEASSRSDLVIPPSQQKLLAALKATGK-PLVIVLMNGRALSVV---NEDRMADAMLET 570
Query: 550 GYPGQAGGTAIADILFGTSNPGGKLPMTWYPQE------YITNLPMTEMAMRPSQSKRYP 603
+ G GG AIAD+LFG NP GKLP++ +P+ Y +LP T ++ +Y
Sbjct: 571 WFSGTEGGNAIADVLFGDYNPSGKLPVS-FPRSVGQIPIYYNHLP-TGRPYNFAKPNKYT 628
Query: 604 GRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHA 663
Y GP +YPFG+G+SYT F TV +P ++ P R+GSI A+++
Sbjct: 629 SHYYDAVNGP-LYPFGYGLSYTRF--TV--SPVKMSSPTMPRNGSIEASVT--------- 674
Query: 664 KCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWA-PHKQLVAFEKVHVPAGAQQRVG 722
V N G +DGA + ++ P + P ++L F+++ + AG Q V
Sbjct: 675 ------------VTNSGQRDGATVVQMYLNDPVASISRPVQELRGFKRIMLKAGESQTVT 722
Query: 723 INIHV 727
I V
Sbjct: 723 FKIDV 727
>gi|146312373|ref|YP_001177447.1| beta-galactosidase [Enterobacter sp. 638]
gi|145319249|gb|ABP61396.1| beta-glucosidase [Enterobacter sp. 638]
Length = 772
Score = 258 bits (658), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 208/690 (30%), Positives = 330/690 (47%), Gaps = 104/690 (15%)
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
T FP + ASSFN +GR+ + Y GL W+P V++ RDPRWGR
Sbjct: 125 TVFPISLGLASSFNLDAVRTVGRI------SAYEAADDGLNMTWAPMVDVSRDPRWGRAS 178
Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
E GED L+ + V +QG S DR V S KHF AY +N VD
Sbjct: 179 EGFGEDTYLTATLGKTMVEAMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 233
Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
+S Q + + + P++ + G +VM + N +NG P +D +LK +R +W G
Sbjct: 234 --MSPQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSDSWLLKDVLRDQWGFKGI 290
Query: 285 IVSDCDSVGVYYDTQH-FTSTPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
VSD ++ +H S PE+A A++AG+++ + + V+ G ++ +
Sbjct: 291 TVSDHGAIKEL--IKHGAASDPEDAVRVALKAGINMSMSDEYYSKYLPDLVKTGKVTMTE 348
Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
+++A + L V+ +G+F+ PY HLGPKD D H++ A E AR+ +V
Sbjct: 349 LDDATRHVLNVKYDMGLFN-----DPYSHLGPKDSDPADTNAESRLHRKDAREVARESLV 403
Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGI----GRYA 449
LLKN+ +LPL + T+AV+GP +D ++G++ AG+A T L GI G
Sbjct: 404 LLKNRLDTLPLK--KSGTIAVVGPLADSKRDVMGSWSAAGVADQSVTVLTGIKNALGEDG 461
Query: 450 RTIHQQGC-----KDVACADDQ--------------LFGAAIDASRQADATILVMGLDQS 490
+ ++ +G KD+ +Q + A++A++Q+D + V+G Q
Sbjct: 462 KVVYAKGANVTNDKDIVTFLNQYEEAVKVDPRSAQAMIDEAVNAAKQSDVVVAVVGEAQG 521
Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
+ EA R + +P Q++L++ + K P +LVLM+G P+ A K D + A++
Sbjct: 522 MAHEASSRTDITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKEDQQADALLETW 578
Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAM---RP---SQSKRYPG 604
+ G GG AIAD+LFG NP GKLPM+ +P+ + +P+ + RP + +Y
Sbjct: 579 FAGTEGGNAIADVLFGDYNPSGKLPMS-FPRS-VGQIPVYYSHLNTGRPYNADKPNKYTS 636
Query: 605 RTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTV-VAVPLDGRHGSINATISGKAIKVTHA 663
R + GP +YPFG+G+SYT F N V ++ P R G + A+
Sbjct: 637 RYFDEANGP-LYPFGYGLSYTTF-----NVSDVKMSAPSLKRDGKVTAS----------- 679
Query: 664 KCNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVG 722
V+V N G ++GA + ++ A P KQL FEKV + G + V
Sbjct: 680 ----------VEVTNTGKREGATVIQMYVQDVTASMSRPVKQLRGFEKVDLKPGETKTVS 729
Query: 723 INIHVCKYLSVVDRSGTRRIPLGEHNIHIG 752
I V L ++ G+ N+ IG
Sbjct: 730 FPIDV-DALKFWNQQMKYDAEAGKFNVFIG 758
>gi|417419081|ref|ZP_12159904.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar
Mississippi str. A4-633]
gi|353619740|gb|EHC70042.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar
Mississippi str. A4-633]
Length = 769
Score = 258 bits (658), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 205/681 (30%), Positives = 322/681 (47%), Gaps = 113/681 (16%)
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
T FP + ASSFN +GRV + Y GL W+P V++ RDPRWGR
Sbjct: 108 TVFPISLGLASSFNLDAVRTVGRV------SAYEAADDGLNMTWAPMVDVSRDPRWGRAS 161
Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
E GED L+ + V+ +QG S DR V S KHF AY +N VD
Sbjct: 162 EGFGEDTYLTSIMGETMVKAMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 216
Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
+S Q + + + P++ + G +VM + N +NG P +D +LK +R EW G
Sbjct: 217 --MSSQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSDSWLLKDVLRDEWGFKGI 273
Query: 285 IVSDCDSVGVYYDTQHFTST-PEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
VSD ++ +H T+ PE+A A++AG+D+ + + ++ G ++ +
Sbjct: 274 TVSDHGAIKEL--IKHGTAADPEDAVRVALKAGVDMSMADEYYSKYLPGLIKSGKVTMAE 331
Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
+++A + L V+ +G+F+ PY HLGPK+ D H++ A E AR+ +V
Sbjct: 332 LDDATRHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAESRLHRKEAREVARESVV 386
Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGI----GRYA 449
LLKN+ +LPL + T+AV+GP +D ++G++ AG+A T L GI G A
Sbjct: 387 LLKNRLETLPLK--KSGTIAVVGPLADSQRDVMGSWSAAGVANQSVTVLAGIQNAVGDGA 444
Query: 450 RTIHQQGCK---DVACAD----------------DQLFGAAIDASRQADATILVMGLDQS 490
+ ++ +G D D + A+ A++QAD + V+G Q
Sbjct: 445 KILYAKGANITNDKGIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADVVVAVVGESQG 504
Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
+ EA R + +P Q++L++ + K P +LVLM+G P+ A K D + AI+
Sbjct: 505 MAHEASSRTNITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKEDQQADAILETW 561
Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTW---------YPQEYITNLPMTEMAM------- 594
+ G GG AIAD+LFG NP GKLP+++ Y T P +
Sbjct: 562 FAGTEGGNAIADVLFGDYNPSGKLPISFPRSVGQIPVYYSHLNTGRPYYYSHLNTGRPGR 621
Query: 595 --RPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINAT 652
P + +Y R + GP +YPFG+G+SYT F TV++ ++ P R G + A+
Sbjct: 622 PYNPEKPNKYTSRYFDEANGP-LYPFGYGLSYTTF--TVSD--VTLSSPTMQRDGKVTAS 676
Query: 653 ISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKV 711
V+V N G ++GA + ++ A P KQL FEK+
Sbjct: 677 ---------------------VEVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGFEKI 715
Query: 712 HVPAGAQQRVG--INIHVCKY 730
+ G ++ V I+I K+
Sbjct: 716 TLKPGERKTVSFPIDIEALKF 736
>gi|427383551|ref|ZP_18880271.1| hypothetical protein HMPREF9447_01304 [Bacteroides oleiciplenus YIT
12058]
gi|425728735|gb|EKU91590.1| hypothetical protein HMPREF9447_01304 [Bacteroides oleiciplenus YIT
12058]
Length = 939
Score = 257 bits (657), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 197/680 (28%), Positives = 323/680 (47%), Gaps = 72/680 (10%)
Query: 110 ATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLT-YWSPNVNIFRDPRWGRG 168
AT+FP + ++N L +G + EAR + G T ++P +++ RD RWGR
Sbjct: 170 ATNFPTQLGLGHTWNRKLIHQVGLITGREARML------GYTNVYAPILDVGRDQRWGRY 223
Query: 169 QETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVS 228
+E GE P L + VRG+Q + +VAA+ KHF AY + + ++S
Sbjct: 224 EEVYGESPYLVAELGIEMVRGMQHNH----QVAATGKHFVAYSNNKGAREGMARVDPQMS 279
Query: 229 KQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSD 288
+++E PF+ + E + VM SYN +G+P L + +RGE GY+VSD
Sbjct: 280 PREVEMIHVYPFKRVIKEAGMLGVMSSYNDYDGIPIQGSYYWLTKRLRGEMGFRGYVVSD 339
Query: 289 CDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG----PFLGLHTESAVQRGLLSEIDIN 344
D+V Y +EA ++ AGL++ C L V+ G LSE IN
Sbjct: 340 SDAVEYLYTKHSTAKDMKEAVRQSVEAGLNVRCTFRSPDSYVLPLRELVKEGGLSEDIIN 399
Query: 345 NALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSL 404
+ + + L V+ +G+FD + G K+V +++ +AL+A+R+ ++LLKN+ L
Sbjct: 400 DRVRDILRVKFLIGLFDAPYQTDLAG--ADKEVEKAENEAVALQASRESLILLKNENNVL 457
Query: 405 PLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGR----YARTIHQQGCK-- 458
PL +T+AV GPN++ + +Y +A T L+GI + A ++ +GC
Sbjct: 458 PLDINNIKTIAVCGPNANEEGYALTHYGPLAVEVITVLEGIRQKAEGKAEVLYAKGCDLV 517
Query: 459 DVACADDQL------------FGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGR 506
D + +L A++ +R+AD ++V+G Q E R+ L LPGR
Sbjct: 518 DANWPESELIEYPMTNEEQAEINKAVENARKADVAVVVLGGGQRTCGENKSRSSLDLPGR 577
Query: 507 QQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFG 566
Q +L+ V K P +LVL++G P+ + +A D + AI+ YPG GGTA+AD+LFG
Sbjct: 578 QLKLLQAVQATGK-PVVLVLINGRPLSINWA--DKFVPAILETWYPGSKGGTAVADVLFG 634
Query: 567 TSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQS---KRYPGRTYRFYK-GPVVYPFGHGM 622
NPGGKL +T +P+ + +P +PS + PG + +YPFG+G+
Sbjct: 635 DYNPGGKLTVT-FPKS-VGQIPFN-FPCKPSSQIDGGKNPGPDGNMSRVNGSLYPFGYGL 691
Query: 623 SYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSK 682
SYT F ++ N IS K + V+ V N G +
Sbjct: 692 SYTTFEYS-------------------NIEISPKMMTANQKAT------VRCKVTNTGKR 726
Query: 683 DGAHTLLVFSTPPAGHWAPH-KQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRR 741
G + ++ + K L FE+VH+ G + V I K+L ++D+
Sbjct: 727 AGDEVVQLYIRDMLSSVTTYEKNLAGFERVHLQPGETKEVTF-ILDRKHLELLDKHMEWV 785
Query: 742 IPLGEHNIHIGGTKHSVSLH 761
+ G+ +I +G + + L+
Sbjct: 786 VEPGDFSIMVGASSEDIRLN 805
>gi|189464325|ref|ZP_03013110.1| hypothetical protein BACINT_00666 [Bacteroides intestinalis DSM
17393]
gi|189438115|gb|EDV07100.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
intestinalis DSM 17393]
Length = 935
Score = 257 bits (657), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 208/737 (28%), Positives = 356/737 (48%), Gaps = 91/737 (12%)
Query: 39 TLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISG--AAAVPRLGIKGYEWWSEALHGVSN 96
+L + +LP+ +RV L+ ++ ++K++L+ G +P L + EA+HG S
Sbjct: 148 SLRYMDPTLPVEERVESLLSVMTPEDKMELIREGWGIPGIPHLYVPPITK-VEAVHGFS- 205
Query: 97 VGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPN 156
+G GAT FPQ + +++N L E + V DE + GT + WSP
Sbjct: 206 ------YGS---GATIFPQALAMGATWNKKLTEEVAMAVGDET---LSAGT--MQAWSPV 251
Query: 157 VNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWN 216
+++ +D RWGR +ET GEDPVL + ++++G Q + + + KHF +
Sbjct: 252 LDVAQDARWGRCEETFGEDPVLVSQIGGAWIKGYQS-----MGLYTTPKHFGGHGAP-LG 305
Query: 217 GVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIR 276
G D +S++++ + VPFR + S+M +Y+ GVP +L +R
Sbjct: 306 GRDSHDIG--LSEREMREVHLVPFRHVIRNYDCQSLMMAYSDFLGVPVAKSRELLHNILR 363
Query: 277 GEWRLNGYIVSDCDSVGVYYDTQHFTSTPE-EAAADAIRAGLDLDCG-PFLGLHTESAVQ 334
EW +G+IVSDC ++G +H+T+ + EAA A+ AG+ +CG + A +
Sbjct: 364 EEWGFSGFIVSDCGAIGNLTARKHYTAKNKIEAANQALAAGIATNCGDTYNDKEVIQAAK 423
Query: 335 RGLLSEIDINNALVNTLTVQMRLGMFDGEPSS-QPYGHLGPKDVCTPDHQELALEAARQG 393
G ++ +++ L + R +F+ P+ + + P + H+E+A +AAR+
Sbjct: 424 DGRINMENLDEVCRTMLRMMFRNELFEKAPNKPLDWNKIYP-GWNSDSHKEMARQAARES 482
Query: 394 IVLLKNQGPSLPLSHIRHRTVAVIGPNS------DVTVTMI-GNYAGIACGYTTPLQGIG 446
IVLL+N+ LPLS RT+AV+GP + D T + G + G Q +G
Sbjct: 483 IVLLENKDNILPLSK-DMRTIAVLGPGANDLQPGDYTPKLQPGQLKSVLTGIK---QAVG 538
Query: 447 RYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEA---------EALD 497
+ + I++QGC + ++ + A+ + Q+D +LV+G + EA E D
Sbjct: 539 KQTKVIYEQGCDFTSLGENNI-AKAVKVASQSDVVLLVLGDCSTSEATTDVYKTSGENHD 597
Query: 498 RAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGG 557
A L+LPG+QQEL+ V K P IL+L +G P +++ A + + W PGQ GG
Sbjct: 598 YATLILPGKQQELLEAVCATGK-PVILILQAGRPYNLSKASELCKAILVNW--LPGQEGG 654
Query: 558 TAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYP 617
A AD+LFG NP G+LPMT +P+ ++ LP+ + +RY +Y P+ Y
Sbjct: 655 PATADVLFGDYNPAGRLPMT-FPR-HVGQLPL--YYNFKTSGRRYEYSDMEYY--PLYY- 707
Query: 618 FGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVK 677
FG+G+SYT+F ++ +K+ + +T VQ VK
Sbjct: 708 FGYGLSYTSFEYS--------------------------GLKIQEKENGNIT--VQATVK 739
Query: 678 NVGSKDGAHTLLVFSTPPAGHWAPH-KQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDR 736
N+G + G + ++ T +L F ++H+ G + V + + LS+++
Sbjct: 740 NIGQRAGDEVVQLYVTDMYASVKTRITELKDFTRIHLKPGEAKTVSFELTPYE-LSLLND 798
Query: 737 SGTRRIPLGEHNIHIGG 753
R + G I +GG
Sbjct: 799 HMDRVVEKGAFKILVGG 815
>gi|224536364|ref|ZP_03676903.1| hypothetical protein BACCELL_01238, partial [Bacteroides
cellulosilyticus DSM 14838]
gi|224522024|gb|EEF91129.1| hypothetical protein BACCELL_01238 [Bacteroides cellulosilyticus
DSM 14838]
Length = 808
Score = 257 bits (657), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 156/403 (38%), Positives = 234/403 (58%), Gaps = 41/403 (10%)
Query: 61 SLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTA 120
+++EK+ LL + + + RL I Y +EALHGV V PG +F T FPQ I A
Sbjct: 1 TVEEKISLLRATSPGISRLDIPKYYHGNEALHGV--VRPG-RF-------TVFPQAIGLA 50
Query: 121 SSFNATLWEAIGRVVSDEARAMYNGGTAG----------LTYWSPNVNIFRDPRWGRGQE 170
+++N L + V+SDEARA +N G LT+WSP VN+ RDPRWGR E
Sbjct: 51 ATWNPELQLQVATVISDEARARWNELDQGREQKSQFSDLLTFWSPTVNMARDPRWGRTPE 110
Query: 171 TPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQ 230
T GEDP LSG ++V+GLQG D LK+ ++ KHF A + ++ +RF N ++S++
Sbjct: 111 TYGEDPYLSGIMGTAFVKGLQGDDDRYLKIVSTPKHFAANNEEH----NRFVCNPQISEK 166
Query: 231 DIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCD 290
+ + + F CV +GK AS+M +YN +N VP + +L + +R +W GY+VSDC
Sbjct: 167 QLREYYLPAFEACVKDGKSASIMSAYNALNDVPCTLNAWLLTKVLRKDWGFKGYVVSDCG 226
Query: 291 SVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG------PFLGLHTESAVQRGLLSEIDIN 344
+ + + T E AAA +I+AGLDL+CG P L SA ++ ++++ DI+
Sbjct: 227 GPSLLVNAHKYVKTKEAAAALSIKAGLDLECGDDVYDQPLL-----SAYRQYMVTDADID 281
Query: 345 NALVNTLTVQMRLGMFD-GEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPS 403
+A L +M LG+FD GE PY + P + + +HQE+AL AAR+ IVLLKNQ
Sbjct: 282 SAAYRVLRARMELGLFDSGE--QNPYTKISPAVIGSAEHQEVALNAARECIVLLKNQKKM 339
Query: 404 LPLSHIRHRTVAVIGPNSDVTVTMIGNYAGI-ACGYTTPLQGI 445
LPL+ + +++AV+G N+ + G+Y+G+ + LQGI
Sbjct: 340 LPLNARKVKSIAVVGINA--GSSEFGDYSGLPVIAPISVLQGI 380
Score = 131 bits (330), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 88/297 (29%), Positives = 143/297 (48%), Gaps = 43/297 (14%)
Query: 467 LFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVL 526
L+G A A R+ + + V+G+++SIE E DR + LP QQE + ++ + P I+V+
Sbjct: 545 LYGEAGKAVRECETVVAVLGINKSIEREGQDRYDIQLPADQQEFLQEIYKVN--PNIVVV 602
Query: 527 MSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITN 586
+ G +A D I AI+ A YPG++GG A+A++LFG NPGG+LP+T+Y +
Sbjct: 603 LVAGS-SLAINWMDEHIPAIVNAWYPGESGGKAVAEVLFGDYNPGGRLPLTYY--RSLDE 659
Query: 587 LPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRH 646
LP + GRTY+++KG V+YPFG+G+SYT F ++ DG
Sbjct: 660 LPPFD------DYDITKGRTYKYFKGDVLYPFGYGLSYTTFKYSNLQVA-------DGEE 706
Query: 647 GSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQL 705
+ V +KN G G V+ P P K+L
Sbjct: 707 ----------------------EINVSFQLKNSGKYAGDEVAQVYVKLPERDEIMPIKEL 744
Query: 706 VAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRI-PLGEHNIHIGGTKHSVSLH 761
FE+V + +G ++V + + L D + + + P G++ I +G + + L
Sbjct: 745 KGFERVTLKSGENKKVTLKLR-KDLLRYWDEAKDKFVCPSGDYTIMVGASSADIRLQ 800
>gi|354583533|ref|ZP_09002432.1| glycoside hydrolase family 3 domain protein [Paenibacillus lactis
154]
gi|353198174|gb|EHB63648.1| glycoside hydrolase family 3 domain protein [Paenibacillus lactis
154]
Length = 931
Score = 257 bits (657), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 170/467 (36%), Positives = 251/467 (53%), Gaps = 35/467 (7%)
Query: 41 PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPG 100
PF LP+ +RVNDL+ R +L+EK++L+ A +PRLG++ Y+ +E HGV+ +G
Sbjct: 4 PFQNPDLPLEERVNDLVSRFTLEEKIELMCQYQAEIPRLGVQKYKHGTEGAHGVAWLGE- 62
Query: 101 TKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA--GLTYWSPNVN 158
AT FPQ A ++N L IG V++DEAR Y A GLT W+P V+
Sbjct: 63 ---------ATVFPQNTGLACTWNPELMRQIGSVIADEARVYYQRDKAVNGLTIWAPTVD 113
Query: 159 IFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGV 218
+ RDPRWGR +E GEDP L+G+ + V+G+QG K A+ KHF N N V
Sbjct: 114 MERDPRWGRTEEAYGEDPHLTGELSTGLVQGMQGDHPFYYKTVATLKHFYG----NNNEV 169
Query: 219 DRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGE 278
DR + + + ++ + + F +G S+M +YN +NG P C + + R ++ E
Sbjct: 170 DRGNASVSIDPRNKREYYLKAFEPAFRKGGAKSMMTAYNGINGTP-CNLNDEVNRIVKQE 228
Query: 279 WRLNGYIVSDC-DSVGVYYDTQHFTSTPEEAAADAIRAGLD--LDCGPFLGLHTESAVQR 335
W ++G++V D D +G D H+ + EA A +++AG+D D A+++
Sbjct: 229 WGMDGFVVGDAGDVLGTVMD-HHYVDSYAEAVARSVKAGIDSITDDQEISFRALRDALEQ 287
Query: 336 GLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIV 395
GLL+E D+++AL NT V+ RLG FD E PY H+ +C P+H EL+L AAR+ +V
Sbjct: 288 GLLTEQDLDHALRNTFRVRFRLGEFDPE-DRNPYSHVPESKLCAPEHAELSLRAARESVV 346
Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQ 455
LLKN G LPL++ + AVIGP +D T Y+G TPLQGI T
Sbjct: 347 LLKNDG-LLPLNN-NMPSAAVIGPLADEAFT--DWYSGTPPYRITPLQGI-----TEKMN 397
Query: 456 GCK-DVACADDQL---FGAAIDASRQADATILVMGLDQSIEAEALDR 498
G K + DQ+ A+ AD L+ G ++S EA +R
Sbjct: 398 GRKVNFHTGLDQVRLRSAASGQYVTLADDNRLLFGTNESSEAALFER 444
Score = 102 bits (253), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 78/261 (29%), Positives = 130/261 (49%), Gaps = 38/261 (14%)
Query: 494 EALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPG 553
E +DR ++LP QQ+L+ V A+ T++V++ P V + + + +I++ + G
Sbjct: 562 ETIDRPDIVLPPSQQQLIQAVHAANPN-TVVVIVGSYPFAVNWEQE--HVPSIVFTSHAG 618
Query: 554 QAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGP 613
Q G A+AD+L+G NPGG+L MTWY + LP M + K RTY+++ G
Sbjct: 619 QELGRAVADVLYGDYNPGGRLNMTWYKS--VDQLPDL-MDYDIIKGK----RTYQYFDGD 671
Query: 614 VVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQ 673
V+YPFGHG+SYT F + GR +IS +K ++T+ VQ
Sbjct: 672 VLYPFGHGLSYTRFAY--------------GR-----LSISEPVMKAD----GQVTVSVQ 708
Query: 674 VDVKNVGSKDGAHTLLVFSTPPAGHWA-PHKQLVAFEKVHVPAGAQQRVGINIHVCKYLS 732
V+N G + G + ++ + P K L F ++H+ GA + V + + L+
Sbjct: 709 --VRNEGDRAGDEVVQLYVSANNSRVVRPLKTLKGFRRIHLDPGASETVSFEL-AAEELA 765
Query: 733 VVDRSGTRR-IPLGEHNIHIG 752
D + R I GE+++ +G
Sbjct: 766 FWDVTRERYCIESGEYSVMVG 786
>gi|317480996|ref|ZP_07940076.1| glycosyl hydrolase family 3 C terminal domain-containing protein
[Bacteroides sp. 4_1_36]
gi|316902889|gb|EFV24763.1| glycosyl hydrolase family 3 C terminal domain-containing protein
[Bacteroides sp. 4_1_36]
Length = 864
Score = 257 bits (657), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 169/432 (39%), Positives = 234/432 (54%), Gaps = 32/432 (7%)
Query: 42 FCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGT 101
+ SLP +R L+ L+L+EKV L++ + V RLGIK Y WW+EALHGV+ G
Sbjct: 22 YKNASLPAEERAGLLLKELTLEEKVSLMMDSSKPVERLGIKPYNWWNEALHGVARAGL-- 79
Query: 102 KFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAM--YNGGTA------GLTYW 153
AT FPQ I A+SF+ + VSDEARA Y GLT W
Sbjct: 80 --------ATVFPQPIGMAASFSPETVYEVFTAVSDEARAKNAYYASQGSRERYQGLTMW 131
Query: 154 SPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGS-DGDRLKVAASCKHFTAYDL 212
+P VNI+RDPRWGRG ET GEDP L+ + V+GLQG+ DG K+ A KHF +
Sbjct: 132 TPTVNIYRDPRWGRGIETYGEDPYLTSRMGVMVVKGLQGTNDGKYDKLHACAKHFAVHSG 191
Query: 213 DNWNGVDRFHFNAK-VSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNIL 271
WN R FNA+ + +D+ +T+ PF V EGKV VMC+YN+ G P C +L
Sbjct: 192 PEWN---RHEFNAENIKPRDLYETYLPPFEALVKEGKVKEVMCAYNRFEGDPCCGSDRLL 248
Query: 272 KRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTP--EEAAADAIRAGLDLDCGPFLGLHT 329
+ +R EW +G ++SDC ++ +Y + P E A+A A+ +G DL+CG
Sbjct: 249 MQILRDEWGFDGIVLSDCGAIADFYRDYGHKTHPDAESASAAAVLSGTDLECGSSYEALV 308
Query: 330 ESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEA 389
E AV++G + E ++ A+ LT + LG D EP + + V + H LAL+
Sbjct: 309 E-AVKQGKIDEKAVDVAVKRLLTARFALGEMD-EPEKVSWTGIPFSVVASAGHDSLALDM 366
Query: 390 ARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI---- 445
AR+ + LL N+ +LPL TVAV+GPN++ +V GNY G+ T L GI
Sbjct: 367 ARKSMTLLMNKDNTLPLKR-GGLTVAVMGPNANDSVMQWGNYNGMPPHTVTILDGIRKAL 425
Query: 446 GRYARTIHQQGC 457
G R I++QGC
Sbjct: 426 GTDDRLIYEQGC 437
Score = 126 bits (317), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 94/300 (31%), Positives = 144/300 (48%), Gaps = 47/300 (15%)
Query: 471 AIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKG 520
+++ + AD + V G+ ++E E + DR + LP Q+EL++ + A K
Sbjct: 593 SVERVKDADVVVFVGGVSPNLEGEEMGVELPGFRGGDRTDIELPAVQRELIAALHHAGK- 651
Query: 521 PTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYP 580
+LV SG PI R AI+ A YPGQAGGTA+A++LFG NP G+LP+T+Y
Sbjct: 652 KVVLVNCSGSPI--GLEPETGRCGAILQAWYPGQAGGTAVAEVLFGDYNPAGRLPVTFYR 709
Query: 581 QEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAV 640
++ LP E GRTYR+ ++PFGHG+SYT+F + AV
Sbjct: 710 N--VSQLPDFE-------DYNMTGRTYRYMTQEPLFPFGHGLSYTSFCYG--------AV 752
Query: 641 PLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWA 700
L GS N SG+ ++ + V V N G DG + V+
Sbjct: 753 VL----GSDNIK-SGEKLR------------LNVPVTNTGKCDGEEVVQVYLKKNDDVEG 795
Query: 701 PHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
P K L AF++VH+PAG V ++ + + +S T + G + + +GG+ + L
Sbjct: 796 PSKALRAFKRVHIPAGKTVDVEFDLGDKELVWWNPQSNTMCVSEGSYELMVGGSSQTAGL 855
>gi|157144933|ref|YP_001452252.1| hypothetical protein CKO_00662 [Citrobacter koseri ATCC BAA-895]
gi|157082138|gb|ABV11816.1| hypothetical protein CKO_00662 [Citrobacter koseri ATCC BAA-895]
Length = 823
Score = 257 bits (657), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 211/691 (30%), Positives = 335/691 (48%), Gaps = 106/691 (15%)
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
T FP + ASSFN + +GRV + Y GL W+P V++ RDPRWGR
Sbjct: 176 TVFPISLGLASSFNLEAVKTVGRV------SAYEAADDGLNMTWAPMVDVSRDPRWGRAS 229
Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
E GED L+ + V +QG S DR V S KHF AY +N VD
Sbjct: 230 EGFGEDTYLTAIMGKTMVEAMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 284
Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
+S Q + + + P++ + G +VM + N +NG P +D +LK +R EW G
Sbjct: 285 --MSPQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSDSWLLKEVLRDEWGFKGI 341
Query: 285 IVSDCDSVGVYYDTQHFT-STPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
VSD ++ +H T S PE+A A+++G+++ + + V+ G ++ +
Sbjct: 342 TVSDHGAIKEL--IKHGTASDPEDAVRVALKSGINMSMSDEYYSKYLPGLVKSGKVTMAE 399
Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
+++A + L V+ +G+F+ PY HLG K+ D H++ A + AR+ +V
Sbjct: 400 LDDAARHVLNVKYDMGLFN-----DPYSHLGAKESDPQDTNAESRLHRKDARDVARESLV 454
Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGI----GRYA 449
LLKN+ +LPL+ + T+AV+GP +D ++G++ AG+A T L GI G
Sbjct: 455 LLKNRLETLPLN--KSGTIAVVGPLADSQRDVMGSWSAAGVASQSVTVLTGIKNALGDKG 512
Query: 450 RTIHQQGCK---DVACAD----------------DQLFGAAIDASRQADATILVMGLDQS 490
+ I+ +G D D + + A++A++Q+D + V+G Q
Sbjct: 513 KVIYAKGANVTNDKGIVDFLNQYEEAVKVAPRSPEAMIDEAVNAAKQSDVVVAVVGEAQG 572
Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
+ EA R + +P Q++L++ + K P +LVLM+G P+ A K D + AI+
Sbjct: 573 MAHEASSRTDIQIPQSQRDLIAALKATGK-PLVLVLMNGRPL--ALVKEDQQADAILETW 629
Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAM---RP---SQSKRYPG 604
+ G GG AIAD+LFG NP GKLPM+ +P+ + +P+ + RP + +Y
Sbjct: 630 FAGTEGGNAIADVLFGDYNPSGKLPMS-FPRS-VGQIPVYYSHLNTGRPYNADKPNKYTS 687
Query: 605 RTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAK 664
R + GP +YPFG+G+SYT F TV++ ++ P R G++ A+
Sbjct: 688 RYFDEANGP-LYPFGYGLSYTTF--TVSD--VTLSAPTMKRDGAVTAS------------ 730
Query: 665 CNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVG- 722
V V N G ++GA + ++ A P KQL FEKV + G Q V
Sbjct: 731 ---------VRVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGFEKVTLKPGETQTVSF 781
Query: 723 -INIHVCKYLSVVDRSGTRRIPLGEHNIHIG 752
I+I K+ + R P G+ N+ IG
Sbjct: 782 PIDIEALKFWN--QRMKYDAEP-GKFNVFIG 809
>gi|283832442|ref|ZP_06352183.1| periplasmic beta-glucosidase [Citrobacter youngae ATCC 29220]
gi|291072102|gb|EFE10211.1| periplasmic beta-glucosidase [Citrobacter youngae ATCC 29220]
Length = 765
Score = 257 bits (657), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 212/691 (30%), Positives = 332/691 (48%), Gaps = 106/691 (15%)
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
T FP + ASSFN +GRV + Y GL W+P V++ RDPRWGR
Sbjct: 118 TVFPISLGLASSFNLDAVRTVGRV------SAYEAADDGLNMTWAPMVDVSRDPRWGRAS 171
Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
E GED L+ + V +QG S DR V S KHF AY +N VD
Sbjct: 172 EGFGEDTYLTATMGKTMVEAMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 226
Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
+S Q + + + P++ + G +VM N +NG P +D +LK +R EW G
Sbjct: 227 --MSPQRLFNDYMPPYKAGLDAGS-GAVMVGLNSLNGTPATSDAWLLKDVLRDEWGFKGI 283
Query: 285 IVSDCDSVGVYYDTQHFTST-PEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
VSD ++ +H T+ PE+A AI++G+++ + + ++ G ++ +
Sbjct: 284 TVSDHGAIKEL--IKHGTAADPEDAVRVAIKSGINMSMSDEYYSKYLPGLIKSGKVTMAE 341
Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
+++A + L V+ +G+F+ PY HLG K+ D H++ A E AR+ +V
Sbjct: 342 LDDAARHVLNVKYDMGLFN-----DPYSHLGAKESDPVDTNAESRLHRKEAREVARESLV 396
Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGI----GRYA 449
LLKN+ +LPL + TVAV+GP +D ++G++ AG+A T L GI G
Sbjct: 397 LLKNRLDTLPLK--KSATVAVVGPLADSKRDVMGSWSAAGVADQSVTVLTGIKNAIGEQG 454
Query: 450 RTIHQQGCK---DVACAD----------------DQLFGAAIDASRQADATILVMGLDQS 490
+ I+ +G D D + A++A++Q+D + V+G Q
Sbjct: 455 KVIYAKGANITNDKGIVDFLNLYEEAVKIDPRSPQAMIDEAVNAAKQSDVVVAVVGEAQG 514
Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
+ EA R + +P Q++L++ + K P +LVLM+G P+ A K D + AI+
Sbjct: 515 MAHEASSRTDITIPQSQRDLIAALKATGK-PLVLVLMNGRPL--ALVKEDQQADAILETW 571
Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAM---RP---SQSKRYPG 604
+ G GG AIAD+LFG NP GKLPM+ +P+ + +P+ + RP + +Y
Sbjct: 572 FAGTEGGNAIADVLFGDYNPSGKLPMS-FPRS-VGQIPVYYSHLNTGRPYNADKPNKYTS 629
Query: 605 RTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAK 664
R + GP +YPFG+G+SYT F TV++ ++ P R G++ A+
Sbjct: 630 RYFDEANGP-LYPFGYGLSYTTF--TVSDVK--LSSPTMKRDGTVTAS------------ 672
Query: 665 CNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVG- 722
V V N G ++GA + ++ A P KQL FEKV++ G Q V
Sbjct: 673 ---------VQVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGFEKVNLKPGETQTVSF 723
Query: 723 -INIHVCKYLSVVDRSGTRRIPLGEHNIHIG 752
I+I K+ + R P G+ N+ IG
Sbjct: 724 PIDIEALKFWN--QRMKYDAEP-GKFNVFIG 751
>gi|423229063|ref|ZP_17215468.1| hypothetical protein HMPREF1063_01288 [Bacteroides dorei
CL02T00C15]
gi|423244903|ref|ZP_17225977.1| hypothetical protein HMPREF1064_02183 [Bacteroides dorei
CL02T12C06]
gi|392634816|gb|EIY28728.1| hypothetical protein HMPREF1063_01288 [Bacteroides dorei
CL02T00C15]
gi|392640944|gb|EIY34735.1| hypothetical protein HMPREF1064_02183 [Bacteroides dorei
CL02T12C06]
Length = 788
Score = 257 bits (657), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 200/679 (29%), Positives = 311/679 (45%), Gaps = 82/679 (12%)
Query: 110 ATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQ 169
AT FP +++N L IG V + EA A+ +SP ++I +DPRWGR
Sbjct: 158 ATYFPAQCGQGATWNKELIARIGEVEAKEAVAL-----EYTNIYSPILDIAQDPRWGRCV 212
Query: 170 ETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSK 229
ET GEDP L G+ + LQ + + A+ KHF Y + + + V+
Sbjct: 213 ETYGEDPYLVGELGKQMITSLQ-----KHNLVATPKHFAVYSIPVGGRDGKTRTDPHVAP 267
Query: 230 QDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDC 289
+++ + PFRM E VM SYN +G P + L +R EW GY+VSD
Sbjct: 268 REMRTLYIEPFRMAFQEAGALGVMSSYNDYDGEPITGSYHFLTEILRQEWGFKGYVVSDS 327
Query: 290 DSVGVYYDTQHFTSTPEEAAADAIRAGLDLDC-----GPFLGLHTESAVQRGLLSEIDIN 344
++V +T E+ A A+ AGL++ F+ L AV G +S+ ++
Sbjct: 328 EAVEFISSKHKVANTYEDGIAQAVNAGLNIRTHFTPPADFI-LPLRKAVADGKISQETLD 386
Query: 345 NALVNTLTVQMRLGMFDGEPSSQPY---GHLGPKDVCTPDHQELALEAARQGIVLLKNQG 401
+ L V+ LG+FD PY G + V + +HQ ++LEAARQ +VLLKN+
Sbjct: 387 KRVAEILRVKFWLGLFD-----NPYRGNGKQAEQIVHSKEHQAVSLEAARQSLVLLKNEM 441
Query: 402 PSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGR---YARTIHQQGCK 458
LPLS R++AVIGPN+D +I Y T QGI + I+++GC
Sbjct: 442 NLLPLSK-SLRSIAVIGPNADERTQLICRYGPANAPIKTVYQGIKERLPHTEVIYRKGCD 500
Query: 459 DV--------------ACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLP 504
+ + +L AI A++QA+ ++V+G ++ E R L LP
Sbjct: 501 IIDPHFPESEVLDFPKTTEEARLMEEAIHAAKQAEVVVMVLGGNELTVREDRSRTSLNLP 560
Query: 505 GRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADIL 564
GRQ+EL+ V K P +LVL+ G + +A + AI+ A +PG+ G A+A+ L
Sbjct: 561 GRQEELLKAVCATGK-PVVLVLLDGRASSINYAA--AHVPAILHAWFPGEFCGQAVAEAL 617
Query: 565 FGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSY 624
FG NPGG+L +T +P+ + +P +P + Y V+YPFGHG+SY
Sbjct: 618 FGDYNPGGRLAVT-FPKS-VGQIPFA-FPFKPGSDESSSTSVY-----GVLYPFGHGLSY 669
Query: 625 TNFVHTVANAPTVVAVPL-DGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKD 683
T F + + PL G G IN + +KN G
Sbjct: 670 TTFSYG-----DLKISPLRQGVQGDIN---------------------ISCKIKNTGKIK 703
Query: 684 GAHTLLVFSTPPAGHWAPH-KQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRI 742
G + ++ + K L FE++ + AG +Q V + + L + D++ R+
Sbjct: 704 GDEVVQLYLRDEVSSVTTYTKVLRGFERISLEAGEEQMVHFRLR-PQDLGLWDKNMNFRV 762
Query: 743 PLGEHNIHIGGTKHSVSLH 761
G+ + IG + + LH
Sbjct: 763 EPGKFKVMIGSSSTDIRLH 781
>gi|389794138|ref|ZP_10197297.1| beta-glucosidase [Rhodanobacter fulvus Jip2]
gi|388432924|gb|EIL89908.1| beta-glucosidase [Rhodanobacter fulvus Jip2]
Length = 869
Score = 257 bits (657), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 160/403 (39%), Positives = 224/403 (55%), Gaps = 36/403 (8%)
Query: 56 LIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQ 115
L+ R++L EK+ L++ A A+PRLGI Y WW+E+LHG P T+FP+
Sbjct: 25 LVARMTLAEKLPQLLNVAPAIPRLGIPRYNWWTESLHGALGTLP----------TTNFPE 74
Query: 116 VITTASSFNATLWEAIGRVVSDEARAMYNGGT---------AGLTYWSPNVNIFRDPRWG 166
I +SF+A L + + +S E R ++ GL WSPN+NIFRDPRWG
Sbjct: 75 PIGLGASFDAPLVKRVAAAISTEVRGLHTLARQTGRLGRIGTGLDTWSPNLNIFRDPRWG 134
Query: 167 RGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAK 226
RGQET GEDP L+ + ++V G+QG + DR +V A+ KHF + N R N
Sbjct: 135 RGQETYGEDPFLTARMGVAFVEGMQGPNADRPEVIATPKHFAVH---NGPESTRHEANVF 191
Query: 227 VSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIV 286
VS+ D+ DT+ FR + E K S+MC+YN+++G P CA+ +LK +RG W GY+V
Sbjct: 192 VSRHDLVDTYLPAFRAAITEAKAGSIMCAYNRIDGQPACANDLLLKHYLRGAWGFTGYVV 251
Query: 287 SDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDC-GPFL----GLHTE--SAVQRGLLS 339
SDCD+V D H+ A A A+RAG+D +C G L GL A+QR L+S
Sbjct: 252 SDCDAVKDISDHHHYAPDAATAVAAAVRAGVDNECNGATLSDTAGLEDRYSEALQRELIS 311
Query: 340 EIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKN 399
DI+ ALV + R+G G P + P + P + + + +LAL+AA++ +VLLKN
Sbjct: 312 VADIDRALVRLFAARYRVGDLAGLPGTAP--AVPPSAIGSAANSQLALDAAQKTLVLLKN 369
Query: 400 QGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPL 442
G LPL VAVIGP D T + GNY+ G TP+
Sbjct: 370 DG-VLPLKA--GVKVAVIGPLGDATRVLRGNYSSPLSG--TPI 407
Score = 134 bits (337), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 108/332 (32%), Positives = 161/332 (48%), Gaps = 52/332 (15%)
Query: 434 IACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAA-ID-ASRQADATILVMGLDQSI 491
+A T L+ RY T+ G D+A A +D A+ QAD + V+GL +
Sbjct: 552 LATMKTVTLERGHRYPITVDATGGVDLAWKRISTTPEADLDRAAAQADVLVAVVGLTSDL 611
Query: 492 EAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDP 541
EAE D+ L LP Q ++ K + A+ P I+V M+G PID+A++K +
Sbjct: 612 EAEETSVTVPGFAGGDKTTLDLPADQIAML-KQARATGKPLIVVAMNGSPIDLAWSKQN- 669
Query: 542 RIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKR 601
AAI+ A YPGQ+GG AIA++L G ++PGG+LP+T+Y + +LP P R
Sbjct: 670 -AAAILEAWYPGQSGGLAIANVLSGKTDPGGRLPLTFY--RSVADLP-------PFGDYR 719
Query: 602 YPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINAT-ISGKAIKV 660
GRTYR++KG VYPFG G+SYT F +G +N T +G A
Sbjct: 720 MEGRTYRYFKGEPVYPFGFGLSYTEFT-----------------YGPLNVTPAAGGAANG 762
Query: 661 THAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQR 720
H V V N G ++G ++ PA AP L F++V + G +
Sbjct: 763 LH---------VSTVVTNAGQREGGDVAQLYLRFPAIPGAPQVALRGFQRVALKPGETRT 813
Query: 721 VGINIHVCKYLSVVDRSGTRRIPLGEHNIHIG 752
V ++ + LS V G R + G++ + +G
Sbjct: 814 VSFDLS-PRDLSAVTADGQREVMAGDYTVSVG 844
>gi|423305378|ref|ZP_17283377.1| hypothetical protein HMPREF1072_02317 [Bacteroides uniformis
CL03T00C23]
gi|423311197|ref|ZP_17289166.1| hypothetical protein HMPREF1073_03916 [Bacteroides uniformis
CL03T12C37]
gi|392679729|gb|EIY73108.1| hypothetical protein HMPREF1073_03916 [Bacteroides uniformis
CL03T12C37]
gi|392681368|gb|EIY74727.1| hypothetical protein HMPREF1072_02317 [Bacteroides uniformis
CL03T00C23]
Length = 864
Score = 257 bits (657), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 170/432 (39%), Positives = 235/432 (54%), Gaps = 32/432 (7%)
Query: 42 FCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGT 101
+ SLP +R L+ L+L+EKV L++ + V RLGIK Y WW+EALHGV+ G
Sbjct: 22 YKNASLPAEERAGLLLKELTLEEKVSLMMDSSKPVERLGIKPYNWWNEALHGVARAGL-- 79
Query: 102 KFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAM--YNGGTA------GLTYW 153
AT FPQ I A+SF+ + VSDEARA Y GLT W
Sbjct: 80 --------ATVFPQPIGMAASFSPETVYEVFTAVSDEARAKNAYYASQGSRERYQGLTMW 131
Query: 154 SPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGS-DGDRLKVAASCKHFTAYDL 212
+P VNI+RDPRWGRG ET GEDP L+ + V+GLQG+ DG K+ A KHF +
Sbjct: 132 TPTVNIYRDPRWGRGIETYGEDPYLTSRMGVMVVKGLQGTNDGKYDKLHACAKHFAVHSG 191
Query: 213 DNWNGVDRFHFNAK-VSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNIL 271
WN R FNA+ + +D+ +T+ PF V EGKV VMC+YN+ G P C +L
Sbjct: 192 PEWN---RHEFNAENIKPRDLYETYLPPFEALVKEGKVKEVMCAYNRFEGDPCCGSDRLL 248
Query: 272 KRTIRGEWRLNGYIVSDCDSVGVYY-DTQHFTS-TPEEAAADAIRAGLDLDCGPFLGLHT 329
+ +R EW +G ++SDC ++ +Y D H T E A+A A+ +G DL+CG
Sbjct: 249 MQILRDEWGFDGIVLSDCGAIADFYRDYGHKTHLDAESASAAAVLSGTDLECGSSYEALV 308
Query: 330 ESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEA 389
E AV++G + E ++ A+ LT + LG D EP + + V + H LAL+
Sbjct: 309 E-AVKQGKIDEKAVDVAVKRLLTARFALGEMD-EPEKVSWTGIPFSVVASAGHDSLALDM 366
Query: 390 ARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI---- 445
AR+ + LL N+ +LPL T+AV+GPN++ +V GNY G+ T L GI
Sbjct: 367 ARKSMTLLMNKDNTLPLKR-GGLTIAVMGPNANDSVMQWGNYNGMPPHTVTILDGIRKAL 425
Query: 446 GRYARTIHQQGC 457
G R I++QGC
Sbjct: 426 GSDDRLIYEQGC 437
Score = 126 bits (317), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 94/300 (31%), Positives = 145/300 (48%), Gaps = 47/300 (15%)
Query: 471 AIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKG 520
+++ + AD + V G+ ++E E + DR + LP Q+EL++ + A K
Sbjct: 593 SVERVKDADIVVFVGGVSPNLEGEEMGVELPGFRGGDRTDIELPAVQRELIAALHRAGK- 651
Query: 521 PTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYP 580
+LV SG PI R AI+ A YPGQAGGTA+A++LFG NP G+LP+T+Y
Sbjct: 652 KVVLVNCSGSPI--GLEPETGRCGAILQAWYPGQAGGTAVAEVLFGDYNPAGRLPVTFYR 709
Query: 581 QEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAV 640
++ LP E GRTYR+ ++PFGHG+SYT+F + AV
Sbjct: 710 N--VSQLPDFE-------DYNMTGRTYRYMTQEPLFPFGHGLSYTSFSYG--------AV 752
Query: 641 PLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWA 700
L GS N SG+ ++++ V V N G DG + V+
Sbjct: 753 VL----GSDNIK-SGEKLRLS------------VPVTNTGKCDGEEVVQVYLKKNDDVEG 795
Query: 701 PHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
P K L AF++VH+PAG V ++ + + +S T + G + + +GG+ + L
Sbjct: 796 PSKALRAFKRVHIPAGKTVDVEFDLGDKELVWWNPQSNTMCVSEGSYELMVGGSSQTADL 855
>gi|294675359|ref|YP_003575975.1| 1,4-beta-xylosidase [Prevotella ruminicola 23]
gi|225016052|gb|ACN78955.1| xylosidase/arabinofuranosidase [Prevotella ruminicola]
gi|294472720|gb|ADE82109.1| putative 1,4-beta-xylosidase [Prevotella ruminicola 23]
Length = 861
Score = 257 bits (657), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 153/437 (35%), Positives = 231/437 (52%), Gaps = 29/437 (6%)
Query: 35 ATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGV 94
A+ +TLP+ +L +R DL RL+L+EK L++ + A+PRLGIK + WWSEALHG
Sbjct: 17 ASAQTLPYQNPNLSAKERAVDLCSRLTLEEKAMLMLDESPAIPRLGIKKFFWWSEALHGA 76
Query: 95 SNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNG--------- 145
+N+G T+FP+ + A+SFN L + + S E RA YN
Sbjct: 77 ANMG----------NVTNFPEPVGMAASFNPHLLFKVFDIASTEFRAQYNHRMYDLNGED 126
Query: 146 -GTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASC 204
L+ W+PNVNIFRDPRWGRGQET GEDP L+ V+GLQG + R + +C
Sbjct: 127 MKMRSLSVWTPNVNIFRDPRWGRGQETYGEDPYLTSVMGVQVVKGLQGPEDARYRKLWAC 186
Query: 205 -KHFTAYDLDNWNGVDRFHFN-AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGV 262
KH+ + + R N VS +D +T+ F+ V + KV VMC+Y +++
Sbjct: 187 AKHYAVHSGPEYT---RHTANLTDVSARDFWETYMPAFKTLVKDAKVREVMCAYQRLDDD 243
Query: 263 PTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG 322
P C +L++ +R EW +VSDC +V +Y+ +S + A+ AG D++CG
Sbjct: 244 PCCGSTRLLQQILRDEWGFEYLVVSDCGAVSDFYENHKSSSDAVHGTSKAVLAGTDVECG 303
Query: 323 PFLGLHT-ESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD 381
+ AV++GLLSE +++ ++ L + LG D +PS + + + T
Sbjct: 304 FNYAYKSLPEAVRKGLLSEKEVDKHVIRLLEGRFDLGEMD-DPSLVEWSKIPYSAMSTKA 362
Query: 382 HQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTP 441
+AL+ ARQ IVLL+N+ LPL + +A+IGPN+ M GNY G T
Sbjct: 363 SANVALDMARQTIVLLQNKNNILPLKKNAEK-IAIIGPNAHNEPMMWGNYNGTPNHTVTI 421
Query: 442 LQGI-GRYARTIHQQGC 457
L G+ + + ++ GC
Sbjct: 422 LDGVKAKQKKLVYIPGC 438
Score = 99.4 bits (246), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 93/323 (28%), Positives = 137/323 (42%), Gaps = 62/323 (19%)
Query: 459 DVACADDQLFGAAIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQ 508
DVA + + I + + I G+ S+E E + DR + LP Q+
Sbjct: 583 DVARELNIDYQETIAQLKGINKVIFCGGIAPSLEGEEMPVNIEGFKGGDRTSIELPKVQR 642
Query: 509 ELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTS 568
E + + A K I V SG I A AI+ A YPGQ GGTA+AD+LFG
Sbjct: 643 EFLKALKAAGK-QVIYVNCSGSAI--ALQPETESCDAIVQAWYPGQEGGTAVADVLFGDY 699
Query: 569 NPGGKLPMTWYPQEYITNLPMTE-MAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNF 627
NPGGKL +T+Y + LP E +M+ GRTYR++ ++PFG+G+SYT F
Sbjct: 700 NPGGKLSVTFYKND--QQLPDYEDYSMK--------GRTYRYFDD-ALFPFGYGLSYTTF 748
Query: 628 VHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLG----VQVDVKNVGSKD 683
+V AK T G VQ+ V N G+K+
Sbjct: 749 -------------------------------EVGEAKVEAATDGALYNVQIPVTNTGTKN 777
Query: 684 GAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVD-RSGTRRI 742
G+ T+ ++ P K L FE++ + AG + + + L D + T R
Sbjct: 778 GSETIQLYIRNLQDPDGPLKSLRGFERLDIKAGKTATANLKL-TKESLEFWDAETNTMRT 836
Query: 743 PLGEHNIHIGGTKHSVSLHAATL 765
G++ I G + L T+
Sbjct: 837 KPGKYEILYGTSSLDKDLKKLTI 859
>gi|160884133|ref|ZP_02065136.1| hypothetical protein BACOVA_02110 [Bacteroides ovatus ATCC 8483]
gi|423291392|ref|ZP_17270240.1| hypothetical protein HMPREF1069_05283 [Bacteroides ovatus
CL02T12C04]
gi|156110475|gb|EDO12220.1| glycosyl hydrolase family 3 N-terminal domain protein [Bacteroides
ovatus ATCC 8483]
gi|392663392|gb|EIY56942.1| hypothetical protein HMPREF1069_05283 [Bacteroides ovatus
CL02T12C04]
Length = 735
Score = 257 bits (656), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 231/795 (29%), Positives = 369/795 (46%), Gaps = 109/795 (13%)
Query: 8 FFLGLILLSASSSGLAAREPFACDPKDATTRTLP-FCQVSLPIPQRVNDLIGRLSLQEKV 66
LGL LL + S L++++ +++P + PI +R++DLI R++L+EK+
Sbjct: 6 LILGLCLLGVTHS-LSSKD----------KKSIPLYKDAKAPIEKRIDDLISRMTLEEKI 54
Query: 67 KLLI--------------SGAAAVP-RLGIKGYEWWSEALHGV--------SNVGPGTKF 103
L VP +G Y + L S +G F
Sbjct: 55 LQLNQYTLGRNNNVNNVGEEVKKVPSEIGSLIYFDINPELRNSMQKKAMEESRLGIPIIF 114
Query: 104 GGDFPGA--TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIF 160
G D T +P + A S+N L E V + EAR +G+ + +SP +++
Sbjct: 115 GYDAIHGFRTIYPISLGQACSWNPGLVEQACAVSAQEAR------MSGVDWTFSPMIDVA 168
Query: 161 RDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSD-GDRLKVAASCKHFTAYDLDNWNGVD 219
RDPRWGR E GEDP +G +AA+ VRG QG D ++AA KH+ Y
Sbjct: 169 RDPRWGRVAEGYGEDPYTNGVFAAASVRGYQGDDMSAENRMAACLKHYVGYGASE---AG 225
Query: 220 RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEW 279
R + ++S Q + DT+ +P+ M V G A++M S+N ++GVP A+P I+ ++ W
Sbjct: 226 RDYVYTEISAQTLWDTYLLPYEMGVKAG-AATLMSSFNDISGVPGSANPYIMTEILKKRW 284
Query: 280 RLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPF-LGLHTESAVQRGLL 338
+ +G+IVSD +V Q +T ++AA A AGL++D + + V+ G +
Sbjct: 285 KHDGFIVSDWGAVE-QLKNQGLAATKKDAARYAFNAGLEMDMMSHAYDRYLKELVEEGKV 343
Query: 339 SEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHL-GPKD-VCTPDHQELALEAARQGIVL 396
+ ++ ++ L V+ RLG+F+ +PY + KD P +A + A + +VL
Sbjct: 344 TMAQVDESVRRVLRVKFRLGLFE-----RPYTPVTNEKDRFFRPQSMAVAAQLAAESMVL 398
Query: 397 LKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAG------IACGYTTPLQGIGRYAR 450
LKN LPL++ + +AV+GP + ++G++ G + Y G A
Sbjct: 399 LKNDNQILPLTN--KKKIAVVGPMAKNGWDLLGSWCGHGKDTDVEMLYDGLTAEFGGDAE 456
Query: 451 TIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQEL 510
+ GCK D F A+D +R +D I+ +G + E R+ + LP Q+EL
Sbjct: 457 LRYAMGCKPQG-NDRSGFAGALDVARWSDVVIVCLGEMLTWSGENASRSTIALPQIQEEL 515
Query: 511 VSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNP 570
V ++ A K P ILVL +G P+++ + +P AI+ PG G ++A IL G NP
Sbjct: 516 VKELKEAGK-PIILVLSNGRPLEL--NRMEPLCDAILEIWQPGINGARSMAGILSGRINP 572
Query: 571 GGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYR-FYKGPV---VYPFGHGMSYTN 626
GKL MT P + + ++R GR ++ FYK +YPFGHG+SYT
Sbjct: 573 SGKLAMT---------FPYSTGQIPIYYNRRKSGRGHQGFYKDITSDPLYPFGHGLSYTE 623
Query: 627 FVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAH 686
F + T++ A KV L +V V N G++DGA
Sbjct: 624 FKY---------------------GTVTPSATKVKRGD----KLSAEVTVTNTGARDGAE 658
Query: 687 TLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLG 745
T+ F S P P K+L FEK + AG + +I + + V+ G R + G
Sbjct: 659 TVHWFISDPYCSITRPVKELKHFEKQFIKAGETKTFRFDIDLERDFGFVNEDGKRFLEAG 718
Query: 746 EHNIHIGGTKHSVSL 760
E++I + G + L
Sbjct: 719 EYHILVQGQTVKIEL 733
>gi|255693560|ref|ZP_05417235.1| periplasmic beta-glucosidase [Bacteroides finegoldii DSM 17565]
gi|260620625|gb|EEX43496.1| glycosyl hydrolase family 3 N-terminal domain protein [Bacteroides
finegoldii DSM 17565]
Length = 770
Score = 257 bits (656), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 203/685 (29%), Positives = 330/685 (48%), Gaps = 78/685 (11%)
Query: 105 GDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDP 163
G + G+T +P I A++F+ L E + R + E RA +G+ + ++PNV + RD
Sbjct: 141 GLYRGSTIYPTPIGQAATFDPALVERMSRETAIEMRA------SGMHWTFTPNVEVARDA 194
Query: 164 RWGRGQETPGEDPVLSGKYAASYVRGLQGSD-GDRLKVAASCKHFT--AYDLDNWNGVDR 220
RWGR ET GEDP L G+ A+ VRG Q D KV A KH + + NG
Sbjct: 195 RWGRVGETFGEDPYLVGQMGAATVRGFQTKDFTGNDKVIACAKHLVGGSQPANGINGAP- 253
Query: 221 FHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWR 280
A++S++ +++ F PF+ C +E V +VM ++N++NG+P + ++ +R +W+
Sbjct: 254 ----AELSERTLQEVFFPPFKDC-LEAGVFTVMTAHNELNGIPCHGNKYLMTEVLRNQWK 308
Query: 281 LNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDC-GPFLGLHTESAVQRGLLS 339
+G++VSD + +D + T ++A ++ AG+ + GP V+ G +
Sbjct: 309 FDGFVVSDWMDIERMHDYHNVAETLKDAYQISVDAGMGMHMHGPEFYEAIIECVKEGSIP 368
Query: 340 EIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKD--VCTPDHQELALEAARQGIVLL 397
E I+ A+ L V+ RLG+F+ P+ L KD V HQ+ ALE AR+ IVLL
Sbjct: 369 EKQIDAAVSKILEVKFRLGLFEN-----PFIDLKKKDEIVFNEKHQQTALEGARKSIVLL 423
Query: 398 KNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYA--GIACGYTTPLQGIGRYARTIHQQ 455
KN+G LPL +++ V V G N++ +++G++A TT L+G+ + +
Sbjct: 424 KNEGNMLPLDASKYKKVFVTGHNAN-NQSILGDWAMEQPEEHVTTVLKGLKAISPETNYN 482
Query: 456 GCK---DVACADDQLFGAAIDASRQADATILVMG-------LDQSIEAEALDRAGLLLPG 505
+V D A+ +R +D ILV+G ++ E DR L LPG
Sbjct: 483 FLDLGWNVRLLSDNQIKEAVQQARSSDLAILVVGENSMRYHWNEKTCGENSDRYELSLPG 542
Query: 506 RQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILF 565
RQQELV V+ A+ PT+++L++G P+ + D + II A PG AGG A+A+IL+
Sbjct: 543 RQQELVEAVA-ATGVPTVVILVNGRPLTTEWI--DENMPCIIEAWEPGVAGGQALAEILY 599
Query: 566 GTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPV--VYPFGHGMS 623
G NP GKLP+T +P + ++ + ++ + + G +Y FG+G+S
Sbjct: 600 GKVNPSGKLPIT---------IPRSTGQIQCMYNHKFTNHWFPYATGNSLPLYEFGYGLS 650
Query: 624 YTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKD 683
YT + + P K++KVT VDV N G D
Sbjct: 651 YTTYKYENLKLSEATITP-------------DKSVKVT------------VDVTNTGKMD 685
Query: 684 GAHTL-LVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRI 742
G T+ L + P K+L F ++ + AG + V + + LS D + +
Sbjct: 686 GEETVQLYIRDEYSSATRPVKELKDFARIPLKAGETKEVSFTL-TPEMLSYYDANMHYGV 744
Query: 743 PLGEHNIHIGGTKHSVSLHAATLGV 767
G I +G + L + L V
Sbjct: 745 EKGTFKIMVGASSRDTDLQSIILTV 769
>gi|390433227|ref|ZP_10221765.1| beta-D-glucoside glucohydrolase [Pantoea agglomerans IG1]
Length = 765
Score = 257 bits (656), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 205/665 (30%), Positives = 330/665 (49%), Gaps = 103/665 (15%)
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
T FP + ASS++ + +GRV + Y GL W+P V++ R+PRWGRG
Sbjct: 118 TIFPIPLALASSWDLDAVKEVGRV------SAYEAADDGLNMTWAPMVDVSREPRWGRGS 171
Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
E GED L+ + S V+ +QG S DR V S KHF Y ++N VD
Sbjct: 172 EGFGEDTYLTSEMGRSMVQSMQGKSAADRYSVMTSVKHFALYGAVEGGRDYNTVD----- 226
Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
+S Q + + P++ + G VM + N +NGVP AD +LK +R +W+ G
Sbjct: 227 --MSPQRMFQDYLPPYKASLDAGS-GGVMVALNSINGVPATADSWLLKDLLRDKWKFKGI 283
Query: 285 IVSDCDSVGVYYDTQH-FTSTPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
+SD ++ +H S P+EA A+++G+D+ + + + V+ G ++ +
Sbjct: 284 TISDHGAIKEL--IKHGVASDPQEAVRIALKSGVDMSMSDEYYSKYLPALVKSGDVTMAE 341
Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
I++A + L V+ +G+F+ PY HLGPK+ D H+ A + AR+ IV
Sbjct: 342 IDDAARHVLNVKYDMGLFN-----DPYSHLGPKESDPQDTNAESRLHRAEARDVARKSIV 396
Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGI-------- 445
LLKN+ +LPL + T+A+IGP +D ++G++ AG+A + LQG+
Sbjct: 397 LLKNRLETLPLK--KSGTIALIGPLADSQRDIMGSWSAAGVAKQSVSLLQGMRNATEGKA 454
Query: 446 ----GRYARTIHQQGCKDV------ACADD-----QLFGAAIDASRQADATILVMGLDQS 490
+ A +G +D A + D QL A+ ++QAD + +G Q
Sbjct: 455 TLLYEKGANVTDNKGIQDFLNLYEQAVSVDTRSPQQLIDDAVAKAKQADVVVAAVGEAQG 514
Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIA-AIIWA 549
+ EA R+ L++P QQ+L++ + K P ++VLM+G P+ V N+ R+A A++
Sbjct: 515 MAHEASSRSDLVIPPSQQKLLAALKATGK-PLVIVLMNGRPLSVV---NEDRMADAMLET 570
Query: 550 GYPGQAGGTAIADILFGTSNPGGKLPMTWYPQE------YITNLPMTEMAMRPSQSKRYP 603
+ G GG AIAD+LFG NP GKLP++ +P+ Y +LP T ++ +Y
Sbjct: 571 WFSGTEGGNAIADVLFGDYNPSGKLPVS-FPRSVGQIPIYYNHLP-TGRPYNFAKPNKYT 628
Query: 604 GRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHA 663
Y GP +YPFG+G+SYT F TV +P ++ P R+GSI A+++
Sbjct: 629 SHYYDAANGP-LYPFGYGLSYTTF--TV--SPVKMSSPTMPRNGSIEASVT--------- 674
Query: 664 KCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWA-PHKQLVAFEKVHVPAGAQQRVG 722
V N G +DGA + ++ P + P ++L F+++ + AG Q V
Sbjct: 675 ------------VTNSGKRDGATVVQMYLNDPVASISRPVQELRGFKRIMLKAGESQTVT 722
Query: 723 INIHV 727
I V
Sbjct: 723 FKIDV 727
>gi|298479985|ref|ZP_06998184.1| periplasmic beta-glucosidase [Bacteroides sp. D22]
gi|298273794|gb|EFI15356.1| periplasmic beta-glucosidase [Bacteroides sp. D22]
Length = 735
Score = 257 bits (656), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 233/795 (29%), Positives = 368/795 (46%), Gaps = 109/795 (13%)
Query: 8 FFLGLILLSASSSGLAAREPFACDPKDATTRTLP-FCQVSLPIPQRVNDLIGRLSLQEKV 66
LGL LL + S L++++ +++P + PI +R++DLI R++L+EKV
Sbjct: 6 LILGLCLLGVTHS-LSSKD----------KKSIPLYKDAKAPIEKRIDDLISRMTLEEKV 54
Query: 67 KLLI--------------SGAAAVP-RLGIKGYEWWSEALHGV--------SNVGPGTKF 103
L VP +G Y + L S +G F
Sbjct: 55 LQLNQYTLGRNNNVNNVGEEVKKVPSEIGSLIYFDINPELRNSMQKKAMEESRLGIPIIF 114
Query: 104 GGDFPGA--TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIF 160
G D T +P + A S+N L E V + EAR +G+ + +SP +++
Sbjct: 115 GYDAIHGFRTIYPISLGQACSWNPGLVEQACAVSAQEAR------MSGVDWTFSPMIDVA 168
Query: 161 RDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSD-GDRLKVAASCKHFTAYDLDNWNGVD 219
RDPRWGR E GEDP +G +AA+ VRG QG D ++AA KH+ Y
Sbjct: 169 RDPRWGRVAEGYGEDPYTNGVFAAASVRGYQGDDMSAENRMAACLKHYVGYGASE---AG 225
Query: 220 RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEW 279
R + ++S Q + DT+ +P+ M V G A++M S+N ++GVP A+P I+ ++ W
Sbjct: 226 RDYVYTEISAQTLWDTYLLPYEMGVKAG-AATLMSSFNDISGVPGSANPYIMTEILKKRW 284
Query: 280 RLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPF-LGLHTESAVQRGLL 338
+ +G+IVSD +V Q +T ++AA A AGL++D H + V+ G +
Sbjct: 285 KHDGFIVSDWGAVE-QLKNQGLAATKKDAAQYAFNAGLEMDMMSHAYDRHLKELVEEGKV 343
Query: 339 SEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHL-GPKD-VCTPDHQELALEAARQGIVL 396
+ ++ ++ L V+ LG+F+ +PY + KD P +A + A + +VL
Sbjct: 344 TMAQVDESVRRVLRVKFCLGLFE-----RPYTPVTNEKDRFFRPQSMAVAAQLAAESMVL 398
Query: 397 LKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAG------IACGYTTPLQGIGRYAR 450
LKN LPL++ + +AV+GP + ++G++ G + Y G A
Sbjct: 399 LKNDNQILPLTN--KKKIAVVGPMAKNGWDLLGSWCGHGKDTDVEMLYDGLTAEFGGDAE 456
Query: 451 TIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQEL 510
+ GCK D F A+D +R +D I+ +G + E R+ + LP Q+EL
Sbjct: 457 LRYAMGCKPQG-NDRSGFAGALDVARWSDVVIVCLGEMLTWSGENASRSTIALPQIQEEL 515
Query: 511 VSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNP 570
V ++ A K P ILVL +G P+++ + +P AI+ PG G ++A IL G NP
Sbjct: 516 VKELKEAGK-PVILVLSNGRPLEL--NRMEPLCDAILEIWQPGINGARSMAGILSGRINP 572
Query: 571 GGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYR-FYKGPV---VYPFGHGMSYTN 626
GKL MT P + + ++R GR ++ FYK +YPFGHG+SYT
Sbjct: 573 SGKLAMT---------FPYSTGQIPIYYNRRKSGRGHQGFYKDITSDPLYPFGHGLSYTE 623
Query: 627 FVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAH 686
F + T++ A KV L +V V N GS+DGA
Sbjct: 624 FKY---------------------GTVTPSATKVKRGD----KLSAEVTVTNTGSRDGAE 658
Query: 687 TLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLG 745
T+ F S P P K+L FEK + AG + +I + + V+ G R + G
Sbjct: 659 TVHWFISDPYCSITRPVKELRHFEKQLIKAGETKTFRFDIDLERDFGFVNEDGKRFLEAG 718
Query: 746 EHNIHIGGTKHSVSL 760
E++I + G + L
Sbjct: 719 EYHILVQGQTVKIEL 733
>gi|365108332|ref|ZP_09336233.1| periplasmic beta-glucosidase [Citrobacter freundii 4_7_47CFAA]
gi|363640688|gb|EHL80138.1| periplasmic beta-glucosidase [Citrobacter freundii 4_7_47CFAA]
Length = 765
Score = 257 bits (656), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 204/669 (30%), Positives = 324/669 (48%), Gaps = 103/669 (15%)
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
T FP + ASSFN +GR+ + Y GL W+P V++ RDPRWGR
Sbjct: 118 TVFPISLGLASSFNLDAVRTVGRI------SAYEAADDGLNMTWAPMVDVSRDPRWGRAS 171
Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
E GED L+ + V +QG S DR V S KHF AY +N VD
Sbjct: 172 EGFGEDTYLTATMGKTMVEAMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 226
Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
+S Q + + + P++ + G +VM N +NG P +D +LK +R EW G
Sbjct: 227 --MSPQRLFNDYMPPYKAGLDAGS-GAVMVGLNSLNGTPATSDAWLLKDVLRDEWGFKGI 283
Query: 285 IVSDCDSVGVYYDTQHFT-STPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
VSD ++ +H T S PE+A AI++G+++ + + ++ G ++ +
Sbjct: 284 TVSDHGAIKEL--IKHGTASDPEDAVRVAIKSGINMSMSDEYYSKYLPGLIKSGKVTMAE 341
Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
+++A + L V+ +G+F+ PY HLG K+ D H++ A E AR+ +V
Sbjct: 342 LDDAARHVLNVKYDMGLFN-----DPYSHLGAKESDPQDTNAESRLHRKEAREVARESLV 396
Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGIGR------ 447
LLKN+ +LPL + TVAV+GP +D ++G++ AG+A T L GI
Sbjct: 397 LLKNRLETLPLK--KSATVAVVGPLADSKRDVMGSWSAAGVADQSVTVLTGIKNAVGDQG 454
Query: 448 ---YAR--------------TIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQS 490
YA+ ++++ K + + A++A++Q+D + V+G Q
Sbjct: 455 KVLYAKGANITNDKGIVDFLNLYEEAVKVDPRSPQAMIDEAVNAAKQSDVVVAVVGEAQG 514
Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
+ EA R + +P Q++L++ + K P +LVLM+G P+ A K D + AI+
Sbjct: 515 MAHEASSRTDITIPQSQRDLIAALKATGK-PLVLVLMNGRPL--ALVKEDQQADAILETW 571
Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAM---RP---SQSKRYPG 604
+ G GG AIAD+LFG NP GKLPM+ +P+ + +P+ + RP + +Y
Sbjct: 572 FAGTEGGNAIADVLFGDYNPSGKLPMS-FPRS-VGQIPVYYSHLNTGRPYNADKPNKYTS 629
Query: 605 RTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAK 664
R + GP +YPFG+G+SYT F TV++ ++ P R G++ A+
Sbjct: 630 RYFDEANGP-LYPFGYGLSYTTF--TVSDVK--LSSPTMKRDGTVTAS------------ 672
Query: 665 CNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVG- 722
V V N G ++GA + ++ A P KQL FEKV++ G Q V
Sbjct: 673 ---------VQVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGFEKVNLKPGETQTVSF 723
Query: 723 -INIHVCKY 730
I+I K+
Sbjct: 724 PIDIEALKF 732
>gi|270294390|ref|ZP_06200592.1| conserved hypothetical protein [Bacteroides sp. D20]
gi|270275857|gb|EFA21717.1| conserved hypothetical protein [Bacteroides sp. D20]
Length = 864
Score = 257 bits (656), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 169/432 (39%), Positives = 234/432 (54%), Gaps = 32/432 (7%)
Query: 42 FCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGT 101
+ SLP +R L+ L+L+EKV L++ + V RLGIK Y WW+EALHGV+ G
Sbjct: 22 YKNASLPAEERAGLLLKELTLEEKVSLMMDSSKPVERLGIKPYNWWNEALHGVARAGL-- 79
Query: 102 KFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAM--YNGGTA------GLTYW 153
AT FPQ I A+SF+ + VSDEARA Y GLT W
Sbjct: 80 --------ATVFPQPIGMAASFSPETVYEVFTAVSDEARAKNAYYASQGSRERYQGLTMW 131
Query: 154 SPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGS-DGDRLKVAASCKHFTAYDL 212
+P VNI+RDPRWGRG ET GEDP L+ + V+GLQG+ DG K+ A KHF +
Sbjct: 132 TPTVNIYRDPRWGRGIETYGEDPYLTSRMGVMVVKGLQGTNDGKYDKLHACAKHFAVHSG 191
Query: 213 DNWNGVDRFHFNAK-VSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNIL 271
WN R FNA+ + +D+ +T+ PF V EGKV VMC+YN+ G P C +L
Sbjct: 192 PEWN---RHEFNAENIKPRDLYETYLPPFEALVKEGKVKEVMCAYNRFEGDPCCGSDRLL 248
Query: 272 KRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTP--EEAAADAIRAGLDLDCGPFLGLHT 329
+ +R EW +G ++SDC ++ +Y + P E A+A A+ +G DL+CG
Sbjct: 249 MQILRDEWGFDGIVLSDCGAIADFYRDYGHKTHPDAESASAVAVLSGTDLECGSSYEALV 308
Query: 330 ESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEA 389
E AV++G + E ++ A+ LT + LG D EP + + V + H LAL+
Sbjct: 309 E-AVKQGKIDEKAVDVAVKRLLTARFALGEMD-EPEKVSWTGIPFSVVASAGHDSLALDM 366
Query: 390 ARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI---- 445
AR+ + LL N+ +LPL TVAV+GPN++ +V GNY G+ T L GI
Sbjct: 367 ARKSMTLLMNKDNTLPLKR-GGLTVAVMGPNANDSVMQWGNYNGMPPHTVTILDGIRKAL 425
Query: 446 GRYARTIHQQGC 457
G R I++QGC
Sbjct: 426 GTDDRLIYEQGC 437
Score = 126 bits (316), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 94/300 (31%), Positives = 145/300 (48%), Gaps = 47/300 (15%)
Query: 471 AIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKG 520
+++ + AD + V G+ ++E E + DR + LP Q+EL++ + A K
Sbjct: 593 SVERVKDADIVVFVGGVSPNLEGEEMGVELPGFRGGDRTDIELPAVQRELIAALHRAGK- 651
Query: 521 PTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYP 580
+LV SG PI R AI+ A YPGQAGGTA+A++LFG NP G+LP+T+Y
Sbjct: 652 KVVLVNCSGSPI--GLEPETGRCGAILQAWYPGQAGGTAVAEVLFGDYNPAGRLPVTFYR 709
Query: 581 QEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAV 640
++ LP E GRTYR+ ++PFGHG+SYT+F + AV
Sbjct: 710 N--VSQLPDFE-------DYNMTGRTYRYMTQEPLFPFGHGLSYTSFSYG--------AV 752
Query: 641 PLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWA 700
L GS N SG+ ++++ V V N G DG + V+
Sbjct: 753 VL----GSDNIK-SGEKLRLS------------VPVTNTGKCDGEEVVQVYLKKNDDVEG 795
Query: 701 PHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
P K L AF++VH+PAG V ++ + + +S T + G + + +GG+ + L
Sbjct: 796 PSKALRAFKRVHIPAGKTVDVEFDLGDKELVWWNPQSNTMCVSEGSYELMVGGSSQTADL 855
>gi|423223721|ref|ZP_17210190.1| hypothetical protein HMPREF1062_02376 [Bacteroides cellulosilyticus
CL02T12C19]
gi|392638096|gb|EIY31949.1| hypothetical protein HMPREF1062_02376 [Bacteroides cellulosilyticus
CL02T12C19]
Length = 954
Score = 257 bits (656), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 212/737 (28%), Positives = 356/737 (48%), Gaps = 91/737 (12%)
Query: 39 TLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISG--AAAVPRLGIKGYEWWSEALHGVSN 96
+L + +LP+ +RV L+ ++ ++K++L+ G +P L + EA+HG S
Sbjct: 167 SLRYMDPTLPVEERVESLLSVMTPEDKMELIREGWGIPGIPHLYVPPITK-VEAVHGFS- 224
Query: 97 VGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPN 156
+G GAT FPQ + +++N L E + V DE A GT + WSP
Sbjct: 225 ------YGS---GATIFPQALAMGATWNKKLTEDVAMAVGDETLA---AGT--MQAWSPV 270
Query: 157 VNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWN 216
+++ +D RWGR +ET GEDPVL + ++++G Q S G + + KHF +
Sbjct: 271 LDVAQDARWGRCEETFGEDPVLVSQIGGAWIKGYQ-SKG----LFTTPKHFGGHGAP-LG 324
Query: 217 GVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIR 276
G D +S++++ + VPFR + SVM +Y+ GVP +L +R
Sbjct: 325 GRDSHDIG--LSEREMREVHLVPFRHVIRNYDCQSVMMAYSDYLGVPVAKSRELLHSILR 382
Query: 277 GEWRLNGYIVSDCDSVGVYYDTQHFTSTPE-EAAADAIRAGLDLDCG-PFLGLHTESAVQ 334
EW +G+IVSDC ++G +H+T+ + EAA A+ AG+ +CG + A +
Sbjct: 383 EEWGFDGFIVSDCGAIGNLTARKHYTAKDKIEAANQALAAGIATNCGDTYNDKEVIQAAK 442
Query: 335 RGLLSEIDINNALVNTLTVQMRLGMFDGEPSS-QPYGHLGPKDVCTPDHQELALEAARQG 393
G ++ +++ L + R +F+ P+ + + P + H+E+A +AAR+
Sbjct: 443 DGRINMENLDEVCRTMLRMMFRNELFEKTPNKPLDWNKIYP-GWNSDSHKEMARQAARES 501
Query: 394 IVLLKNQGPSLPLSHIRHRTVAVIGPNSD-------VTVTMIGNYAGIACGYTTPLQGIG 446
IV+L+N+ LPL+ RT+AV+GP +D + G + G Q +G
Sbjct: 502 IVMLENKDNILPLAK-DMRTIAVVGPGADDLQPGDYTPKLLPGQLKSVLTGIK---QAVG 557
Query: 447 RYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEA---------EALD 497
+ + +++QGC D ++ A+ A+ Q+D +LV+G + E+ E D
Sbjct: 558 KQTKVVYEQGC-DFTSSNGTDIPKAVKAASQSDVVVLVLGDCSTSESTTDVYKTSGENHD 616
Query: 498 RAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGG 557
A L+LPG+QQEL+ V K P IL+L +G P +++ A + + W PGQ GG
Sbjct: 617 YATLILPGKQQELLEAVCATGK-PVILILQAGRPYNLSKASELCKAILVNW--LPGQEGG 673
Query: 558 TAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYP 617
A AD+LFG NP G+LPMT +P+ ++ LP+ + +RY FY P+ Y
Sbjct: 674 PATADVLFGDYNPAGRLPMT-FPR-HVGQLPL--YYNFKTSGRRYEYSDMEFY--PLYY- 726
Query: 618 FGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVK 677
FG+G+SYT+F + SG I+ + + + +Q VK
Sbjct: 727 FGYGLSYTSFEY------------------------SGLKIQ----EKDNGNVAIQATVK 758
Query: 678 NVGSKDGAHTLLVFSTPPAGHWAPH-KQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDR 736
NVG + G + ++ T +L F +VH+ + V + + LS+++
Sbjct: 759 NVGQRAGDEVVQLYITDMYASVKTRITELKDFTRVHLQPDESKIVSFELTPYE-LSLLND 817
Query: 737 SGTRRIPLGEHNIHIGG 753
R + GE I +GG
Sbjct: 818 RMDRVVEKGEFKILVGG 834
>gi|319643197|ref|ZP_07997825.1| glycoside hydrolase family 3 [Bacteroides sp. 3_1_40A]
gi|345520511|ref|ZP_08799899.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp.
4_3_47FAA]
gi|254835034|gb|EET15343.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp.
4_3_47FAA]
gi|317385101|gb|EFV66052.1| glycoside hydrolase family 3 [Bacteroides sp. 3_1_40A]
Length = 788
Score = 257 bits (656), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 198/679 (29%), Positives = 312/679 (45%), Gaps = 82/679 (12%)
Query: 110 ATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLT-YWSPNVNIFRDPRWGRG 168
AT FP +++N L IG V + EA A+ G T +SP ++I +DPRWGR
Sbjct: 158 ATYFPAQCGQGATWNKKLIARIGEVEAKEAVAL------GYTNIYSPILDIAQDPRWGRC 211
Query: 169 QETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVS 228
ET GEDP L G+ + LQ + + A+ KHF Y + + + V+
Sbjct: 212 VETYGEDPYLVGELGKQMITSLQ-----KYNLVATPKHFAVYSIPIGGRDGKTRTDPHVA 266
Query: 229 KQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSD 288
+++ + PFRM E VM SYN +G P + L +R EW GY+VSD
Sbjct: 267 PREMRTLYIEPFRMAFQEAGALGVMSSYNDYDGEPITGSYHFLTEILRQEWGFKGYVVSD 326
Query: 289 CDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDC-----GPFLGLHTESAVQRGLLSEIDI 343
++V + T E+ A A+ AGL++ F+ L AV G +S+ +
Sbjct: 327 SEAVEFISNKHKVADTYEDGIAQAVNAGLNIRTHFTPPADFI-LPLRKAVDDGKISQETL 385
Query: 344 NNALVNTLTVQMRLGMFDGEPSSQPY---GHLGPKDVCTPDHQELALEAARQGIVLLKNQ 400
+ + L ++ LG+FD PY G + V + +HQ ++LEAARQ +VLLKN+
Sbjct: 386 DKRVAEILRIKFWLGLFD-----NPYRGNGKQAEQIVHSKEHQAVSLEAARQSLVLLKNE 440
Query: 401 GPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGR---YARTIHQQGC 457
LPLS R++AVIGPN+D +I Y T QGI +A I+++GC
Sbjct: 441 THLLPLSK-SIRSIAVIGPNADEQTQLICRYGPANAPIKTVYQGIKELLPHAEVIYKKGC 499
Query: 458 KDV--------------ACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLL 503
+ + +L I A++QA+ ++V+G ++ E R L L
Sbjct: 500 DIIDPHFPESEILDFPKTAEEVRLMQEVIRAAKQAEVVVMVLGGNELTVREDRSRTSLNL 559
Query: 504 PGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADI 563
PGRQ+EL+ V K P ILV++ G + +A + AI+ A +PG+ G A+A+
Sbjct: 560 PGRQEELLKAVCATGK-PVILVMLDGRASSINYAA--AHVPAILHAWFPGEFCGQAVAEA 616
Query: 564 LFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMS 623
LFG NPGG+L +T +P+ + +P +P + Y +YPFGHG+S
Sbjct: 617 LFGDYNPGGRLAVT-FPKS-VGQIPFA-FPFKPGSDESSSTSVY-----GALYPFGHGLS 668
Query: 624 YTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKD 683
YT F ++ + ++ G G I H C +KN G
Sbjct: 669 YTTFTYSDLH----ISPSHQGVQGDI------------HVSCK---------IKNTGKIK 703
Query: 684 GAHTLLVFSTPPAGHWAPH-KQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRI 742
G + ++ + K L FE++ + AG +Q V + + L + D++ R+
Sbjct: 704 GDEVVQLYLRDEISSVTTYTKVLRGFERISLKAGEEQTVHFRLR-PQDLGLWDKNMNFRV 762
Query: 743 PLGEHNIHIGGTKHSVSLH 761
G + +G + + LH
Sbjct: 763 EPGSFKVMLGASSTDIRLH 781
>gi|440286925|ref|YP_007339690.1| beta-glucosidase-like glycosyl hydrolase [Enterobacteriaceae
bacterium strain FGI 57]
gi|440046447|gb|AGB77505.1| beta-glucosidase-like glycosyl hydrolase [Enterobacteriaceae
bacterium strain FGI 57]
Length = 765
Score = 257 bits (656), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 211/689 (30%), Positives = 330/689 (47%), Gaps = 102/689 (14%)
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
T FP + ASSFN +GRV + Y GL W+P V++ RDPRWGR
Sbjct: 118 TVFPISLGLASSFNLDAVRTVGRV------SAYEAADDGLNMTWAPMVDVSRDPRWGRAS 171
Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
E GED L+ + V +QG S DR V S KHF AY +N VD
Sbjct: 172 EGFGEDTYLTSIMGKTMVEAMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 226
Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
+S Q + + + P++ + G +VM + N +NG P +D +LK +R +W G
Sbjct: 227 --MSSQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSDAWLLKEILRDKWGFKGI 283
Query: 285 IVSDCDSVGVYYDTQHFT-STPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
VSD ++ +H T S PE+A A+++G+++ + + V+ G ++ +
Sbjct: 284 TVSDHGAIKEL--IKHGTASDPEDAVRVALKSGINMSMSDEYYSKYLPGLVKSGKVTMAE 341
Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
+++A + L V+ +G+F+ PY HLGPK+ D H++ A + AR+ +V
Sbjct: 342 LDDATRHVLNVKYDMGLFN-----DPYSHLGPKESDPQDTNAESRLHRKDARDVARESLV 396
Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGI----GRYA 449
LLKN+ +LPL + TVAV+GP +D ++G++ AG+A T L GI G
Sbjct: 397 LLKNRLETLPLK--KSGTVAVVGPLADSKRDVMGSWSAAGVADQSVTVLTGIKNALGDKG 454
Query: 450 RTIHQQGCK---DVACAD----------------DQLFGAAIDASRQADATILVMGLDQS 490
+ ++ +G D D + A+ A++Q+D + V+G Q
Sbjct: 455 KVVYAKGANVTNDKGIVDFLNLYEEAVKVDPRSPQAMIDEAVAAAKQSDVVVAVVGEAQG 514
Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
+ EA R + +P Q++L++ + K P +LVLM+G P+ A K D + AI+
Sbjct: 515 MAHEASSRTNIEIPQSQRDLIAALKATGK-PLVLVLMNGRPL--ALVKEDQQADAILETW 571
Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAM---RP---SQSKRYPG 604
+ G GG AIAD+LFG NP GKLPM+ +P+ + +P+ + RP + +Y
Sbjct: 572 FAGTEGGNAIADVLFGDYNPSGKLPMS-FPRS-VGQIPVYYSHLNTGRPYNADKPNKYTS 629
Query: 605 RTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAK 664
R + GP +YPFG+G+SYT F TV++ ++ P R G + A+
Sbjct: 630 RYFDEANGP-LYPFGYGLSYTTF--TVSDVK--MSAPSMTRDGKVTAS------------ 672
Query: 665 CNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVGI 723
VDV N G ++GA + ++ A P KQL FEKV + G Q V
Sbjct: 673 ---------VDVTNSGKREGATVIQMYLQDVTASMSRPVKQLKGFEKVTLKPGETQTVSF 723
Query: 724 NIHVCKYLSVVDRSGTRRIPLGEHNIHIG 752
I V L ++S G+ N+ IG
Sbjct: 724 PIDV-DALKFWNQSMKYDAEPGKFNVFIG 751
>gi|237732180|ref|ZP_04562661.1| yohA [Citrobacter sp. 30_2]
gi|226907719|gb|EEH93637.1| yohA [Citrobacter sp. 30_2]
Length = 765
Score = 256 bits (655), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 204/669 (30%), Positives = 324/669 (48%), Gaps = 103/669 (15%)
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
T FP + ASSFN +GR+ + Y GL W+P V++ RDPRWGR
Sbjct: 118 TVFPISLGLASSFNLDAVRTVGRI------SAYEAADDGLNMTWAPMVDVSRDPRWGRAS 171
Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
E GED L+ + V +QG S DR V S KHF AY +N VD
Sbjct: 172 EGFGEDTYLTATMGKTMVEAMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 226
Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
+S Q + + + P++ + G +VM N +NG P +D +LK +R EW G
Sbjct: 227 --MSPQRLFNDYMPPYKAGLDAGS-GAVMVGLNSLNGTPATSDAWLLKDVLRDEWGFKGI 283
Query: 285 IVSDCDSVGVYYDTQHFT-STPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
VSD ++ +H T S PE+A AI++G+++ + + ++ G ++ +
Sbjct: 284 TVSDHGAIKEL--IKHGTASDPEDAVRVAIKSGINMSMSDEYYSKYLPGLIKSGKVTMAE 341
Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
+++A + L V+ +G+F+ PY HLG K+ D H++ A E AR+ +V
Sbjct: 342 LDDAARHVLNVKYDMGLFN-----DPYSHLGAKESDPQDTNAESRLHRKEAREVARESLV 396
Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGIGR------ 447
LLKN+ +LPL + TVAV+GP +D ++G++ AG+A T L GI
Sbjct: 397 LLKNRLETLPLK--KSATVAVVGPLADSKRDVMGSWSAAGVADQSVTVLTGIKNAVGDQG 454
Query: 448 ---YAR--------------TIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQS 490
YA+ ++++ K + + A++A++Q+D + V+G Q
Sbjct: 455 KVLYAKGANITNDKGIVDFLNLYEEAVKVDPRSPQAMIDEAVNAAKQSDVVVAVVGEAQG 514
Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
+ EA R + +P Q++L++ + K P +LVLM+G P+ A K D + AI+
Sbjct: 515 MAHEASSRTDITIPQSQRDLIAALKATGK-PLVLVLMNGRPL--ALVKEDQQADAILETW 571
Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAM---RP---SQSKRYPG 604
+ G GG AIAD+LFG NP GKLPM+ +P+ + +P+ + RP + +Y
Sbjct: 572 FAGTEGGNAIADVLFGDYNPSGKLPMS-FPRS-VGQIPVYYSHLNTGRPYNADKPNKYTS 629
Query: 605 RTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAK 664
R + GP +YPFG+G+SYT F TV++ ++ P R G++ A+
Sbjct: 630 RYFDEANGP-LYPFGYGLSYTTF--TVSDVK--LSSPTMKRDGTVTAS------------ 672
Query: 665 CNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVG- 722
V V N G ++GA + ++ A P KQL FEKV++ G Q V
Sbjct: 673 ---------VQVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGFEKVNLKPGETQTVSF 723
Query: 723 -INIHVCKY 730
I+I K+
Sbjct: 724 PIDIEALKF 732
>gi|455646191|gb|EMF25234.1| beta-D-glucoside glucohydrolase [Citrobacter freundii GTC 09479]
Length = 765
Score = 256 bits (655), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 210/691 (30%), Positives = 333/691 (48%), Gaps = 106/691 (15%)
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
T FP + ASSFN +GR+ + Y GL W+P V++ RDPRWGR
Sbjct: 118 TVFPISLGLASSFNLDAVRTVGRI------SAYEAADDGLNMTWAPMVDVSRDPRWGRAS 171
Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
E GED L+ + V +QG S DR V S KHF AY +N VD
Sbjct: 172 EGFGEDTYLTATMGKTMVEAMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 226
Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
+S Q + + + P++ + G +VM N +NG P +D +LK +R EW G
Sbjct: 227 --MSPQRLFNDYMPPYKAGLDAGS-GAVMVGLNSLNGTPATSDAWLLKDVLRDEWGFKGI 283
Query: 285 IVSDCDSVGVYYDTQHFT-STPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
VSD ++ +H T S PE+A AI++G+++ + + ++ G ++ +
Sbjct: 284 TVSDHGAIKEL--IKHGTASDPEDAVRVAIKSGINMSMSDEYYSKYLPGLIKSGKVTMAE 341
Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
+++A + L V+ +G+F+ PY HLG K+ D H++ A E AR+ +V
Sbjct: 342 LDDAARHVLNVKYDMGLFN-----DPYSHLGAKESDPQDTNAESRLHRKEAREVARESLV 396
Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGIGR------ 447
LLKN+ +LPL + TVAV+GP +D ++G++ AG+A T L GI
Sbjct: 397 LLKNRLETLPLK--KSATVAVVGPLADSKRDVMGSWSAAGVADQSVTVLTGIKNAVGDQG 454
Query: 448 ---YAR--------------TIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQS 490
YA+ ++++ K + + A++A++Q+D + V+G Q
Sbjct: 455 KVLYAKGANITNDKGIVDFLNLYEEAVKVDPRSPQAMIDEAVNAAKQSDVVVAVVGEAQG 514
Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
+ EA R + +P Q++L++ + K P +LVLM+G P+ A K D + AI+
Sbjct: 515 MAHEASSRTDITIPQSQRDLIAALKATGK-PLVLVLMNGRPL--ALVKEDQQADAILETW 571
Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAM---RP---SQSKRYPG 604
+ G GG AIAD+LFG NP GKLPM+ +P+ + +P+ + RP + +Y
Sbjct: 572 FAGTEGGNAIADVLFGDYNPSGKLPMS-FPRS-VGQIPVYYSHLNTGRPYNADKPNKYTS 629
Query: 605 RTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAK 664
R + GP +YPFG+G+SYT F TV++ ++ P R G++ A+
Sbjct: 630 RYFDEANGP-LYPFGYGLSYTTF--TVSDVK--LSSPTMKRDGTVTAS------------ 672
Query: 665 CNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVG- 722
V V N G ++GA + ++ A P KQL FEKV++ G Q V
Sbjct: 673 ---------VQVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGFEKVNLKPGETQTVSF 723
Query: 723 -INIHVCKYLSVVDRSGTRRIPLGEHNIHIG 752
I+I K+ + R P G+ N+ IG
Sbjct: 724 PIDIEALKFWN--QRMQYDAEP-GKFNVFIG 751
>gi|383115540|ref|ZP_09936296.1| hypothetical protein BSGG_2590 [Bacteroides sp. D2]
gi|313695055|gb|EFS31890.1| hypothetical protein BSGG_2590 [Bacteroides sp. D2]
Length = 770
Score = 256 bits (655), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 203/685 (29%), Positives = 330/685 (48%), Gaps = 78/685 (11%)
Query: 105 GDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDP 163
G + G+T +P I A++F+ L E + R + E RA +G+ + ++PNV + RD
Sbjct: 141 GLYRGSTIYPTPIGQAATFDPALVERMSRETAIEMRA------SGMHWTFTPNVEVARDA 194
Query: 164 RWGRGQETPGEDPVLSGKYAASYVRGLQGSD-GDRLKVAASCKHFT--AYDLDNWNGVDR 220
RWGR ET GEDP L G+ A+ VRG Q D KV A KH + + NG
Sbjct: 195 RWGRVGETFGEDPYLVGQMGAATVRGFQTKDFTGNDKVIACAKHLVGGSQPANGINGAP- 253
Query: 221 FHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWR 280
A++S++ +++ F PF+ C +E V +VM ++N++NG+P + ++ +R +W+
Sbjct: 254 ----AELSERTLQEVFFPPFKDC-LEAGVFTVMTAHNELNGIPCHGNKYLMTEVLRNQWK 308
Query: 281 LNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDC-GPFLGLHTESAVQRGLLS 339
+G++VSD + +D + T ++A ++ AG+ + GP V+ G +
Sbjct: 309 FDGFVVSDWMDIERMHDYHNVAETLKDAYRISVDAGMGMHMHGPEFYEAIIECVKEGSIP 368
Query: 340 EIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKD--VCTPDHQELALEAARQGIVLL 397
E I+ A+ L V+ RLG+F+ P+ L KD V HQ+ ALE AR+ IVLL
Sbjct: 369 EKQIDAAVSKILEVKFRLGLFEN-----PFIDLKKKDEIVFNEKHQQTALEGARKSIVLL 423
Query: 398 KNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYA--GIACGYTTPLQGIGRYARTIHQQ 455
KN+G LPL +++ V V G N++ +++G++A TT L+G+ + +
Sbjct: 424 KNEGNMLPLDASKYKKVFVTGHNAN-NQSILGDWAMEQPEEHVTTVLKGLKAISPETNYN 482
Query: 456 GCK---DVACADDQLFGAAIDASRQADATILVMG-------LDQSIEAEALDRAGLLLPG 505
+V D A+ +R +D ILV+G ++ E DR L LPG
Sbjct: 483 FLDLGWNVRLLSDNQIKEAVQQARNSDLAILVVGENSMRYHWNEKTCGENSDRYELSLPG 542
Query: 506 RQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILF 565
RQQELV V+ A+ PT+++L++G P+ + D + II A PG AGG A+A+IL+
Sbjct: 543 RQQELVKAVA-ATGVPTVVILVNGRPLTTEWI--DENMPCIIEAWEPGVAGGQALAEILY 599
Query: 566 GTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPV--VYPFGHGMS 623
G NP GKLP+T +P + ++ + ++ + + G +Y FG+G+S
Sbjct: 600 GKVNPSGKLPIT---------IPRSTGQIQCMYNHKFTNHWFPYATGNSLPLYEFGYGLS 650
Query: 624 YTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKD 683
YT + + P K++KVT VDV N G D
Sbjct: 651 YTTYKYENLKLSEATITP-------------DKSVKVT------------VDVTNTGKMD 685
Query: 684 GAHTL-LVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRI 742
G T+ L + P K+L F ++ + AG + V + + LS D + +
Sbjct: 686 GEETVQLYIRDEYSSATRPVKELKDFARIPLKAGETKEVSFTL-TPEMLSYYDANMHYGV 744
Query: 743 PLGEHNIHIGGTKHSVSLHAATLGV 767
G I +G + L + L V
Sbjct: 745 EKGTFKIMVGASSRDTDLQSIILTV 769
>gi|229818089|ref|ZP_04448371.1| hypothetical protein BIFANG_03380 [Bifidobacterium angulatum DSM
20098 = JCM 7096]
gi|229784693|gb|EEP20807.1| hypothetical protein BIFANG_03380 [Bifidobacterium angulatum DSM
20098 = JCM 7096]
Length = 775
Score = 256 bits (655), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 240/815 (29%), Positives = 366/815 (44%), Gaps = 133/815 (16%)
Query: 40 LPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLG-IKGYEWWSEALH------ 92
LP+ LPI +R+ DL+GR++L+EKV ++ A L I ++ LH
Sbjct: 6 LPYKNPDLPIQERIADLLGRMTLEEKVGQMMQLDARNGNLDDIIINKYVGSILHTSPADL 65
Query: 93 --GVSNVGPGTKFG-----GD--------FPGATSFPQVITTASSFNATLWEAIGRVVSD 137
V V T+ G GD +PG+T FP + A+SF+ +A GR ++
Sbjct: 66 PRAVETVNTKTRLGIPLIIGDDCIHGYSFWPGSTIFPSQLGMATSFDPAKVQAAGRATAE 125
Query: 138 EARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQG--SD 194
E T G+ + +SP + I RD RWGR ET GEDP L G+ A++ V+G QG
Sbjct: 126 EV------STTGVHWTFSPVLCIARDTRWGRVDETFGEDPYLIGEMASAIVKGYQGGAKA 179
Query: 195 GDRLK---VAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVAS 251
G+ L + A KHF Y + G D A +S + +E F PF EG +
Sbjct: 180 GEPLAKDAILACAKHFAGYS-ETQGGRDASE--ADLSHRKLESWFLPPFERVAKEG-CGT 235
Query: 252 VMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAAD 311
M Y ++GVP + +L +RG+W G +++D D+VG Q + +AAAD
Sbjct: 236 FMLGYESIDGVPVTFNKWLLSDKLRGDWNYQGTLITDWDNVGRAVWEQKVKADYVQAAAD 295
Query: 312 AIRAGLDL-DCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYG 370
A++AG DL P AV+RGLL E I+ A+ L ++ RLG+F E P
Sbjct: 296 AVKAGNDLIMTTPKFYEGAIEAVKRGLLDESLIDEAVSRILALKFRLGLF--EDPRLPDE 353
Query: 371 HLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGN 430
+ + + +HQEL L+ AR+ I LLKN G SLP + + +AV+GP +D +G+
Sbjct: 354 QRMKEAIGSSEHQELNLQIARESIALLKNNG-SLPFTATSGKRIAVVGPLADDAQEQLGD 412
Query: 431 YAG-------IACG-----YTTPLQGIGRYA----RTIHQQGCKDVACADD--------- 465
+AG I G TT L G + A ++ +G + DD
Sbjct: 413 WAGNSGQVNWIPDGQPRNMITTVLDGFKQLAPEGCEVVYSRGANILDLVDDPEGKFYPDG 472
Query: 466 --------------QLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELV 511
L A++ +RQ+D + V+G + E A L L G Q L+
Sbjct: 473 QPRPKLGVSAAIDQALLDEAVENARQSDLVVAVVGDVVQLVGETCSTATLELLGGQNALL 532
Query: 512 SKVSMASK---GPTILVLMSGGP--------------IDVAFAKNDPRIAAIIWAGYPGQ 554
+S +K P ++VLMS P +D + A +A +WA PG
Sbjct: 533 EALSDVAKETGKPLVVVLMSSKPQVMPACVIGTNGVIVDESAADG---TSAFMWAPNPGM 589
Query: 555 AGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPV 614
GG AIA+I+ G +NP G+LP+T +P+ + LP+ +R RY T + P
Sbjct: 590 KGGQAIAEIILGKTNPSGRLPIT-FPR-HAGQLPVYYNQIRGQHGNRYADLT----QDP- 642
Query: 615 VYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQV 674
+ FG G+SYT F + PT+ VP G T K + T+ ++
Sbjct: 643 AFAFGEGLSYTTFAY---GEPTITNVPDSG----------------TFTKTD--TVHAEI 681
Query: 675 DVKNVGSKDGAHTLLVF--STPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLS 732
+ N G + G + + + W ++L AF++V + G + V +I V +
Sbjct: 682 TLTNTGERRGTEVVQAYIGDIVTSYSWT-DRELKAFQRVELEPGETRTVAFDIPVAD-CT 739
Query: 733 VVDRSGTRRIPLGEHNIHIGGTKHSVSLHAATLGV 767
+VD R + GE + IG + L T V
Sbjct: 740 IVDPDARRIVEPGEFEVLIGHSSRRKDLRRTTFTV 774
>gi|294674604|ref|YP_003575220.1| family 3 glycosyl hydrolase [Prevotella ruminicola 23]
gi|294474051|gb|ADE83440.1| glycosyl hydrolase, family 3 [Prevotella ruminicola 23]
Length = 869
Score = 256 bits (655), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 155/447 (34%), Positives = 237/447 (53%), Gaps = 49/447 (10%)
Query: 41 PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPG 100
P+ L +RV +L+ L+ +EKV L+++ + ++ RLGI Y WWSEA HGV
Sbjct: 24 PYLDTKLSNKERVENLLSLLTPEEKVGLMMNKSISIDRLGIPSYNWWSEACHGVRQ---- 79
Query: 101 TKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNG--------------- 145
G T +PQ I A++FNA L+ + VSDEARA +N
Sbjct: 80 -------DGYTVYPQPIGMAAAFNAQLFYDVFSQVSDEARANWNRTDHNDPKLFNVPMGV 132
Query: 146 ----GTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVA 201
G LT+W PNVNIFRDPRWGRGQET GEDP L+ V G+QG++ K
Sbjct: 133 TYYPGNPELTFWCPNVNIFRDPRWGRGQETCGEDPYLNAVLGVQTVLGMQGNNDKYFKTH 192
Query: 202 ASCKHFTAYDLDNWNGVD--RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQV 259
A KH+ + +G + R N + + +D+ +T+ F+ V +G V VMC+Y +
Sbjct: 193 ACAKHYAVH-----SGPEPLRHSMNVEPTNRDLWETYLPAFKALVKKGNVREVMCAYQRF 247
Query: 260 NGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDT-QHFTSTPE-EAAADAIRAGL 317
G P C +L +R +W + +++DCD++ +++ QH T A+ DA+ G
Sbjct: 248 EGKPCCTSDRLLIDILRNKWGYDAIVLTDCDAINNFFNRGQHETHKDGLSASVDAVLNGT 307
Query: 318 DLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQ-PYGHLGPKD 376
DL+CG + +++GL+ E D++N L TL + LGMFD P+ P+ LG
Sbjct: 308 DLECGKVF-MSLVEGLKKGLIKESDLDNHLRKTLMGRFELGMFD--PADMLPWAKLGADV 364
Query: 377 VCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAG--I 434
+ + + +A++AAR+ +VLL+N+ LPLS +T+AV+GPN+D + GNY G
Sbjct: 365 ISSEKNDAMAVQAARESMVLLENKAGILPLSK-SIKTLAVLGPNADDVNMLNGNYGGTPT 423
Query: 435 ACGYTTPLQGIGRY---ARTIHQQGCK 458
A + L GI A+ I+ + C+
Sbjct: 424 AAHQHSLLSGIKAAVPGAKIIYNKACE 450
Score = 114 bits (285), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 83/251 (33%), Positives = 123/251 (49%), Gaps = 39/251 (15%)
Query: 476 RQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVA 535
++ADA +++ G+ +E E D+ + LP QQ LV + K P I V SG I A
Sbjct: 614 KEADAIVIIGGISAQMEGEGGDKQDIELPKVQQMLVKAMHKTGK-PVIFVNCSGSAI--A 670
Query: 536 FAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLP-MTEMAM 594
F + A++ A Y GQ G A+A++LFG NPGGKLP+T+Y +LP + +M
Sbjct: 671 FGSVEGEYDALLQAWYAGQGGAKALAEVLFGDYNPGGKLPVTFYRSN--NDLPDFLDYSM 728
Query: 595 RPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATIS 654
+ RTYR++ G Y FG+G+SYT F +A IS
Sbjct: 729 K--------NRTYRYFTGVPQYAFGYGLSYTTFALG-------------------DAKIS 761
Query: 655 GKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVP 714
K +K K ++TL V V N G ++G T+ V+ AP K L F+K+++
Sbjct: 762 AKQMK----KNGKVTL--TVPVTNTGKREGTETVQVYVKRLDDAGAPIKALKGFQKLNLK 815
Query: 715 AGAQQRVGINI 725
AG Q+ I +
Sbjct: 816 AGETQKATITL 826
>gi|304395778|ref|ZP_07377661.1| glycoside hydrolase family 3 domain protein [Pantoea sp. aB]
gi|304357072|gb|EFM21436.1| glycoside hydrolase family 3 domain protein [Pantoea sp. aB]
Length = 765
Score = 256 bits (655), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 203/665 (30%), Positives = 330/665 (49%), Gaps = 103/665 (15%)
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
T FP + ASS++ + +GRV + Y GL W+P V++ R+PRWGRG
Sbjct: 118 TIFPIPLALASSWDLDAVKEVGRV------SAYEAADDGLNMTWAPMVDVSREPRWGRGS 171
Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
E GED L+ + S V+ +QG S DR V S KHF Y ++N VD
Sbjct: 172 EGFGEDTYLTSEMGRSMVQSMQGKSAADRYSVMTSVKHFALYGAVEGGRDYNTVD----- 226
Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
+S Q + + P++ + G VM + N +NGVP AD +LK +R +W+ G
Sbjct: 227 --MSPQRMFQDYLPPYKASLDAGS-GGVMVALNSINGVPATADSWLLKDLLRDKWKFKGI 283
Query: 285 IVSDCDSVGVYYDTQH-FTSTPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
+SD ++ +H S P+EA A+++G+D+ + + + V+ G ++ +
Sbjct: 284 TISDHGAIKEL--IKHGVASDPQEAVRIALKSGVDMSMSDEYYSKYLPALVKSGDVTMAE 341
Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
I++A + L V+ +G+F+ PY HLGPK+ D H+ A + AR+ IV
Sbjct: 342 IDDAARHVLNVKYDMGLFN-----DPYSHLGPKESDPQDTNAESRLHRAEARDVARKSIV 396
Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGI-------- 445
LLKN+ +LPL + T+A+IGP +D ++G++ AG+A + LQG+
Sbjct: 397 LLKNRLETLPLK--KSGTIALIGPLADSQRDIMGSWSAAGVAKQSVSLLQGMRNATEGKA 454
Query: 446 ----GRYARTIHQQGCKDV------ACADD-----QLFGAAIDASRQADATILVMGLDQS 490
+ A +G +D A + D Q+ A+ ++QAD + +G Q
Sbjct: 455 TLLYEKGANVTDNKGIQDFLNLYEQAVSVDTRSPQQMIDDAVAKAKQADVVVAAVGEAQG 514
Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIA-AIIWA 549
+ EA R+ L++P QQ+L++ + K P ++VLM+G P+ + N+ R+A A++
Sbjct: 515 MAHEASSRSDLVIPPSQQKLLAALKATGK-PLVIVLMNGRPLSIV---NEDRMADAVLET 570
Query: 550 GYPGQAGGTAIADILFGTSNPGGKLPMTWYPQE------YITNLPMTEMAMRPSQSKRYP 603
+ G GG AIAD+LFG NP GKLP++ +P+ Y +LP T ++ +Y
Sbjct: 571 WFSGTEGGNAIADVLFGDYNPSGKLPVS-FPRSVGQIPIYYNHLP-TGRPYNFAKPNKYT 628
Query: 604 GRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHA 663
Y GP +YPFG+G+SYT F TV +P ++ P R+GSI A+++
Sbjct: 629 SHYYDAVNGP-LYPFGYGLSYTTF--TV--SPVKMSSPTMPRNGSIKASVT--------- 674
Query: 664 KCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWA-PHKQLVAFEKVHVPAGAQQRVG 722
V N G +DGA + ++ P + P ++L F+++ + AG Q V
Sbjct: 675 ------------VTNSGKRDGATVVQMYLNDPVASISRPVQELRGFKRIMLKAGESQTVT 722
Query: 723 INIHV 727
I V
Sbjct: 723 FKIDV 727
>gi|372277745|ref|ZP_09513781.1| beta-D-glucoside glucohydrolase [Pantoea sp. SL1_M5]
Length = 765
Score = 256 bits (655), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 205/665 (30%), Positives = 329/665 (49%), Gaps = 103/665 (15%)
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
T FP + ASS++ + +GRV + Y GL W+P V++ R+PRWGRG
Sbjct: 118 TIFPIPLALASSWDLDAVKEVGRV------SAYEAADDGLNMTWAPMVDVSREPRWGRGS 171
Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
E GED L+ + S V+ +QG S DR V S KHF Y ++N VD
Sbjct: 172 EGFGEDTYLTSEMGRSMVQSMQGKSAADRYSVMTSVKHFALYGAVEGGRDYNTVD----- 226
Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
+S Q + + P++ + G VM + N +NGVP AD +LK +R +W+ G
Sbjct: 227 --MSPQRMFQDYLPPYKASLDAGS-GGVMVALNSINGVPATADSWLLKDLLRDKWKFKGI 283
Query: 285 IVSDCDSVGVYYDTQH-FTSTPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
+SD ++ +H S P+EA A+++G+D+ + + + V+ G ++ +
Sbjct: 284 TISDHGAIKEL--IKHGVASDPQEAVRIALKSGVDMSMSDEYYSKYLPALVKSGDVTMAE 341
Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
I++A + L V+ +G+F+ PY HLGPK+ D H+ A + AR+ IV
Sbjct: 342 IDDAARHVLNVKYDMGLFN-----DPYSHLGPKESDPQDTNAESRLHRAEARDVARKSIV 396
Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGI-------- 445
LLKN+ +LPL + T+A+IGP +D ++G++ AG+A + LQG+
Sbjct: 397 LLKNRLETLPLK--KSGTIALIGPLADSQRDIMGSWSAAGVAKQSVSLLQGMRNATEGKA 454
Query: 446 ----GRYARTIHQQGCKDV------ACADD-----QLFGAAIDASRQADATILVMGLDQS 490
+ A +G +D A + D QL A+ ++QAD + +G Q
Sbjct: 455 TLLYEKGANVTDNKGIQDFLNLYEQAVSVDTRSPQQLIDDAVAKAKQADVVVAAVGEAQG 514
Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIA-AIIWA 549
+ EA R+ L++P QQ+L++ + K P ++VLM+G P+ V N+ R+A A++
Sbjct: 515 MAHEASSRSDLVIPPSQQKLLAALKATGK-PLVIVLMNGRPLSVV---NEDRMADAMLET 570
Query: 550 GYPGQAGGTAIADILFGTSNPGGKLPMTWYPQE------YITNLPMTEMAMRPSQSKRYP 603
+ G GG AIAD+LFG NP GKLP++ +P+ Y +LP T ++ +Y
Sbjct: 571 WFSGTEGGNAIADVLFGDYNPSGKLPVS-FPRSVGQIPIYYNHLP-TGRPYNFAKPNKYT 628
Query: 604 GRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHA 663
Y GP +YPFG+G+SYT F TV +P ++ P R GSI A+++
Sbjct: 629 SHYYDAANGP-LYPFGYGLSYTTF--TV--SPVKMSSPTMPRDGSIEASVT--------- 674
Query: 664 KCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWA-PHKQLVAFEKVHVPAGAQQRVG 722
V N G +DGA + ++ P + P ++L F+++ + AG Q V
Sbjct: 675 ------------VTNSGKRDGATVVQMYLNDPVASISRPVQELRGFKRIMLKAGESQTVT 722
Query: 723 INIHV 727
I V
Sbjct: 723 FKIDV 727
>gi|429116223|ref|ZP_19177141.1| Periplasmic beta-glucosidase [Cronobacter sakazakii 701]
gi|426319352|emb|CCK03254.1| Periplasmic beta-glucosidase [Cronobacter sakazakii 701]
Length = 625
Score = 256 bits (654), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 201/662 (30%), Positives = 322/662 (48%), Gaps = 106/662 (16%)
Query: 143 YNGGTAGLTY-WSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQG-SDGDRLKV 200
Y GL W+P V++ RDPRWGRG E GED L+ + V +QG S DR V
Sbjct: 4 YEAADDGLNMTWAPMVDVTRDPRWGRGSEGFGEDTYLTSMMGKTMVESMQGKSPADRYSV 63
Query: 201 AASCKHFTAYDL----DNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSY 256
S KHF AY +N VD +S Q + + + P++ + G VM +
Sbjct: 64 MTSVKHFAAYGAVEGGKEYNTVD-------MSSQRLFNDYMPPYKEALDAGS-GGVMIAL 115
Query: 257 NQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFT-STPEEAAADAIRA 315
N +NG P +D +LK +RG+W G +SD ++ +H T S PE+A AI++
Sbjct: 116 NSLNGTPAASDGWLLKDLLRGDWGFKGITISDHGAIKEL--IKHGTASDPEDAVRVAIKS 173
Query: 316 GLDLDCG-PFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGP 374
G+D+ + + + ++ G ++ ++++A + L V+ +G+F+ PY HLGP
Sbjct: 174 GVDMSMADEYYSKYLPNLIKSGKVTMAELDDATRHVLNVKYDMGLFN-----DPYSHLGP 228
Query: 375 KDVCTPD-------HQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTM 427
KD D H++ A + AR+ +VLLKN+ +LPL + T+AV+GP +D M
Sbjct: 229 KDSDPKDTNAESRLHRDDARKVARESLVLLKNRLETLPLK--KSGTIAVVGPLADSKRDM 286
Query: 428 IGNY--AGIACGYTTPLQGI----GRYARTIHQQGCKDVACADDQ--------------- 466
+G++ AG+A T LQG+ G A+ ++ +G DD+
Sbjct: 287 MGSWSAAGVADQSVTLLQGMKNVAGDKAKILYAKGAN---VTDDKGIVDFLNLYEPAVVV 343
Query: 467 -------LFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASK 519
+ A++ ++Q+D + V+G Q + EA R L +P Q++L+S + K
Sbjct: 344 DKRTPKAMIDEAVNVAKQSDVVVAVVGEAQGMAHEASSRTDLTIPQSQRDLISALKATGK 403
Query: 520 GPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWY 579
P +LVLM+G P+ A K D + AI+ + G GG AIAD+LFG NP GKLP++ +
Sbjct: 404 -PLVLVLMNGRPL--ALVKEDQQADAILETWFAGTEGGNAIADVLFGDYNPSGKLPIS-F 459
Query: 580 PQEYITNLPM------TEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVAN 633
P+ + +P+ T P + +Y R + GP +YPFG+G+SYT F +V++
Sbjct: 460 PRS-VGQIPVYYSHLNTGRPYNPEKPNKYTSRYFDEANGP-LYPFGYGLSYTTF--SVSD 515
Query: 634 APTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF-S 692
++ P R G + A+++ V N G ++GA + ++
Sbjct: 516 VK--LSAPTMKRDGKVTASVT---------------------VTNTGKREGATAVQMYLQ 552
Query: 693 TPPAGHWAPHKQLVAFEKVHVPAGAQQRVG--INIHVCKYLSVVDRSGTRRIPLGEHNIH 750
A P KQL FEK+ + G Q V I+I K+ + R P G+ N+
Sbjct: 553 DVTASMSRPVKQLHGFEKITLKPGESQTVSFPIDIDALKFWN--QRMKYDAEP-GKFNVF 609
Query: 751 IG 752
IG
Sbjct: 610 IG 611
>gi|417342342|ref|ZP_12123176.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar
Baildon str. R6-199]
gi|357956757|gb|EHJ82061.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar
Baildon str. R6-199]
Length = 765
Score = 256 bits (654), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 205/683 (30%), Positives = 324/683 (47%), Gaps = 117/683 (17%)
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
T FP + ASSFN +GRV + Y GL W+P V++ RDPRWGR
Sbjct: 104 TVFPISLGLASSFNLDAVRTVGRV------SAYEAADDGLNMTWAPMVDVSRDPRWGRAS 157
Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
E GED L+ + V+ +QG S DR V S KHF AY +N VD
Sbjct: 158 EGFGEDTYLTSIMGETMVKAMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 212
Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
+S Q + + + P++ + G +VM + N +NG P +D +LK +R EW G
Sbjct: 213 --MSSQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSDSWLLKDVLRDEWGFKGI 269
Query: 285 IVSDCDSVGVYYDTQHFTST-PEEAAADAIRAGLDLD---------------CGPFLGLH 328
VSD ++ +H T+ PE+A A++AG+D+ + +
Sbjct: 270 TVSDHGAIKEL--IKHGTAADPEDAVRVALKAGVDMSIMSMADEYYRVDMSMADEYYSKY 327
Query: 329 TESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD------- 381
++ G ++ ++++A + L V+ +G+F+ PY HLGPK+ D
Sbjct: 328 LPGLIKSGTVTMAELDDATRHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAESRL 382
Query: 382 HQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYT 439
H++ A E AR+ +VLLKN+ +LPL + T+AV+GP +D ++G++ AG+A
Sbjct: 383 HRKEAREVARESVVLLKNRLETLPLK--KSGTIAVVGPLADSQRDVMGSWSAAGVANQSV 440
Query: 440 TPLQGI----GRYARTIHQQGCK---DVACAD----------------DQLFGAAIDASR 476
T L GI G A+ ++ +G D D + A+ A++
Sbjct: 441 TVLAGIQNAVGDGAKILYAKGANITNDKGIVDFLNLYEEAVKIDPRSPQAMIDEAVLAAK 500
Query: 477 QADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAF 536
QAD + V+G Q + EA R + +P Q++L++ + K P +LVLM+G P+ A
Sbjct: 501 QADVVVAVVGESQGMAHEASSRTNITIPQSQRDLITALKATGK-PLVLVLMNGRPL--AL 557
Query: 537 AKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPM------T 590
K D + AI+ + G GG AIAD+LFG NP GKLP++ +P+ + +P+ T
Sbjct: 558 VKEDQQADAILETWFAGTEGGNAIADVLFGDYNPSGKLPIS-FPRS-VGQIPVYYSHLNT 615
Query: 591 EMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSIN 650
P + +Y R + GP +YPFG+G+SYT F TV++ ++ P R G +
Sbjct: 616 GRPYNPEKPNKYTSRYFDEANGP-LYPFGYGLSYTTF--TVSD--VTLSSPTMQRDGKVT 670
Query: 651 ATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFE 709
A+ V+V N G ++GA + ++ A P KQL FE
Sbjct: 671 AS---------------------VEVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGFE 709
Query: 710 KVHVPAGAQQRVG--INIHVCKY 730
K+ + G + V I+I K+
Sbjct: 710 KITLKPGESKTVSFPIDIEALKF 732
>gi|440759115|ref|ZP_20938268.1| Periplasmic beta-glucosidase [Pantoea agglomerans 299R]
gi|436427131|gb|ELP24815.1| Periplasmic beta-glucosidase [Pantoea agglomerans 299R]
Length = 737
Score = 256 bits (654), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 203/665 (30%), Positives = 330/665 (49%), Gaps = 103/665 (15%)
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
T FP + ASS++ + +GRV + Y GL W+P V++ R+PRWGRG
Sbjct: 90 TIFPIPLALASSWDLDAVKEVGRV------SAYEAADDGLNMTWAPMVDVSREPRWGRGS 143
Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
E GED L+ + S V+ +QG S DR V S KHF Y ++N VD
Sbjct: 144 EGFGEDTYLTSEMGRSMVQSMQGKSAADRYSVMTSVKHFALYGAVEGGRDYNTVD----- 198
Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
+S Q + + P++ + G VM + N +NGVP AD +LK +R +W+ G
Sbjct: 199 --MSPQRMFQDYLPPYKASLDAGS-GGVMVALNSINGVPATADSWLLKDLLRDKWKFKGI 255
Query: 285 IVSDCDSVGVYYDTQH-FTSTPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
+SD ++ +H S P+EA A+++G+D+ + + + V+ G ++ +
Sbjct: 256 TISDHGAIKEL--IKHGVASDPQEAVRIALKSGVDMSMSDEYYSKYLPALVKSGDVTMAE 313
Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
I++A + L V+ +G+F+ PY HLGPK+ D H+ A + AR+ IV
Sbjct: 314 IDDAARHVLNVKYDMGLFN-----DPYSHLGPKESDPQDTNAESRLHRAEARDVARKSIV 368
Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGI-------- 445
LLKN+ +LPL + T+A+IGP +D ++G++ AG+A + LQG+
Sbjct: 369 LLKNRLETLPLK--KSGTIALIGPLADSQRDIMGSWSAAGVAKQSVSLLQGMRNATEGKA 426
Query: 446 ----GRYARTIHQQGCKDV------ACADD-----QLFGAAIDASRQADATILVMGLDQS 490
+ A +G +D A + D Q+ A+ ++QAD + +G Q
Sbjct: 427 TLLYEKGANVTDNKGIQDFLNLYEQAVSVDTRSPQQMIDDAVAKAKQADVVVAAVGEAQG 486
Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIA-AIIWA 549
+ EA R+ L++P QQ+L++ + K P ++VLM+G P+ + N+ R+A A++
Sbjct: 487 MAHEASSRSDLVIPPSQQKLLAALKATGK-PLVIVLMNGRPLSIV---NEDRMADAVLET 542
Query: 550 GYPGQAGGTAIADILFGTSNPGGKLPMTWYPQE------YITNLPMTEMAMRPSQSKRYP 603
+ G GG AIAD+LFG NP GKLP++ +P+ Y +LP T ++ +Y
Sbjct: 543 WFSGTEGGNAIADVLFGDYNPSGKLPVS-FPRSVGQIPIYYNHLP-TGRPYNFAKPNKYT 600
Query: 604 GRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHA 663
Y GP +YPFG+G+SYT F TV +P ++ P R+GSI A+++
Sbjct: 601 SHYYDAVNGP-LYPFGYGLSYTTF--TV--SPVKMSSPTMPRNGSIEASVT--------- 646
Query: 664 KCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWA-PHKQLVAFEKVHVPAGAQQRVG 722
V N G +DGA + ++ P + P ++L F+++ + AG Q V
Sbjct: 647 ------------VTNSGKRDGATVVQMYLNDPVASISRPVQELRGFKRIMLKAGESQTVT 694
Query: 723 INIHV 727
I V
Sbjct: 695 FKIDV 699
>gi|423222018|ref|ZP_17208488.1| hypothetical protein HMPREF1062_00674 [Bacteroides cellulosilyticus
CL02T12C19]
gi|392644204|gb|EIY37946.1| hypothetical protein HMPREF1062_00674 [Bacteroides cellulosilyticus
CL02T12C19]
Length = 942
Score = 256 bits (654), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 201/680 (29%), Positives = 320/680 (47%), Gaps = 74/680 (10%)
Query: 110 ATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLT-YWSPNVNIFRDPRWGRG 168
AT+FP + ++N L +G + EAR + G T ++P +++ RD RWGR
Sbjct: 173 ATNFPTQLGLGHTWNRELIRQVGLITGREARML------GYTNVYAPILDVGRDQRWGRY 226
Query: 169 QETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVS 228
+E GE P L + VRG+Q S +VAA+ KHF AY + + ++S
Sbjct: 227 EEVYGESPYLVAELGIEMVRGMQHSH----QVAATGKHFVAYSNNKGAREGMARVDPQMS 282
Query: 229 KQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSD 288
+++E PF+ + E + VM SYN +GVP L +RGE GY+VSD
Sbjct: 283 PREVEMIHVYPFKRVIKEAGLLGVMSSYNDYDGVPIQGSYYWLTTRLRGEMGFRGYVVSD 342
Query: 289 CDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLG-----LHTESAVQRGLLSEIDI 343
D+V Y +EA ++ AGL++ C F L V+ G LSE I
Sbjct: 343 SDAVEYLYTKHSTAKDMKEAVRQSVEAGLNVRC-TFRSPDSYVLPLRELVKEGGLSEEVI 401
Query: 344 NNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPS 403
N+ + + L V+ +G+FD + G K+V +++ LAL+A+R+ +VLLKN+
Sbjct: 402 NDRVRDILRVKFLVGLFDTPYQTDLAG--ADKEVEKAENESLALQASRESLVLLKNENNV 459
Query: 404 LPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGR----YARTIHQQGCK- 458
LPL + +AV GPN+D + +Y +A TT L+GI + A ++ +GC
Sbjct: 460 LPLDINNVKKIAVCGPNADEEGYALTHYGPLAVEVTTVLEGIRQKAEGKAEVLYTKGCDL 519
Query: 459 -DVACADDQL------------FGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPG 505
D + +L A++ +RQAD ++V+G Q E R+ L LPG
Sbjct: 520 VDANWPESELIDYPMTDSEQAEIDKAVENARQADVAVVVLGGGQRTCGENKSRSSLDLPG 579
Query: 506 RQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILF 565
RQ +L+ V K P +LVL++G P+ + +A D + I+ A YPG GGTA+AD+LF
Sbjct: 580 RQLKLLQAVQATGK-PVVLVLINGRPLSINWA--DKFVPVILEAWYPGSKGGTAVADVLF 636
Query: 566 GTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQS---KRYPGRTYRFYK-GPVVYPFGHG 621
G NPGGKL +T +P+ + +P +PS + PG + +Y FG+G
Sbjct: 637 GDYNPGGKLTVT-FPKS-VGQIPFN-FPCKPSSQIDGGKNPGLDGNMSRVNGALYSFGYG 693
Query: 622 MSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGS 681
+SYT F ++ + IS K I V+ V N G
Sbjct: 694 LSYTTFEYS-------------------DIEISPKVITPNQKAT------VRCKVTNTGK 728
Query: 682 KDGAHTLLVFSTPPAGHWAPH-KQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTR 740
+ G + ++ + K L FE++H+ G + V + K L ++D+
Sbjct: 729 RAGDEVVQLYVRDILSSVTTYEKNLAGFERIHLQPGETKEVVFTLD-RKQLELLDKHMEW 787
Query: 741 RIPLGEHNIHIGGTKHSVSL 760
+ G+ +I +G + + L
Sbjct: 788 VVEPGDFSIMVGASSEDIRL 807
>gi|251795943|ref|YP_003010674.1| beta-glucosidase [Paenibacillus sp. JDR-2]
gi|247543569|gb|ACT00588.1| Beta-glucosidase [Paenibacillus sp. JDR-2]
Length = 940
Score = 256 bits (654), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 155/424 (36%), Positives = 233/424 (54%), Gaps = 25/424 (5%)
Query: 36 TTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVS 95
+ + PF LP+ QRVNDL+ R +L+EK+ L+ + LG+K Y+ +EA HG++
Sbjct: 5 SEQKYPFQNPDLPLEQRVNDLVNRFTLEEKINLMCQYQDEITHLGVKAYKHGTEAAHGMA 64
Query: 96 NVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMY--NGGTAGLTYW 153
+G AT+FPQ I A +++ L + IG V+ DEAR Y N GLT W
Sbjct: 65 WLGE----------ATTFPQPIGLACTWDKDLMKKIGSVIGDEARGFYSQNPTHNGLTLW 114
Query: 154 SPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLD 213
+P V++ RDPRWGR +E GEDP L+GK +A+ +G+QG +K AS KHF +
Sbjct: 115 APTVDMERDPRWGRTEEAYGEDPHLTGKLSAALTQGIQGDHPFYVKAVASLKHF----IG 170
Query: 214 NWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKR 273
N N ++R + + ++ + + F + EG S+M +YN VNGVP +P++
Sbjct: 171 NNNEINRGECSVSLDPRNKNEYYLKAFEIPFKEGGALSMMTAYNSVNGVPANINPDV-NN 229
Query: 274 TIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLD--LDCGPFLGLHTES 331
++ +W +NG++VSD V + H+ T +EA A I+AG+D D P
Sbjct: 230 IVKRDWGMNGFVVSDAGDVLGTVNDHHYVDTYKEAVALTIKAGVDSITDDHPISKQAIRD 289
Query: 332 AVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQ-PYGHLGPKDVCTPDHQELALEAA 390
A+ G+L+E D++ AL NT V+ RLG FD P+ + PY + + P+H ELAL+A
Sbjct: 290 ALAEGMLTENDLDIALRNTFRVRFRLGEFD--PADRNPYAAIDESVIMKPEHAELALDAV 347
Query: 391 RQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYAR 450
R+ IVLLKN G LPLS + AVIGP + + Y+G+ +P +GI +
Sbjct: 348 RKSIVLLKNDG-ILPLSAEKLSKAAVIGPLAGIVYRDW--YSGLMPYSISPFEGIRKKMN 404
Query: 451 TIHQ 454
HQ
Sbjct: 405 GAHQ 408
Score = 83.6 bits (205), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 68/258 (26%), Positives = 117/258 (45%), Gaps = 43/258 (16%)
Query: 471 AIDASRQADATILVMGLDQSIEA-EALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSG 529
AI A+R + ++V+G I E +DR L L Q++L+ +V + P +V++ G
Sbjct: 552 AIAAARNNEVAVVVVGNHPLINGKETIDRPDLTLAAYQEKLIREVYAVN--PNTVVVVVG 609
Query: 530 G-PIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLP 588
P + + + + I AI++ + GQ G A+AD+LFG NP G++ MTWY
Sbjct: 610 SYPFAMPWVQEN--IPAIVYLSHAGQELGHALADVLFGDYNPAGRVNMTWYESADQLGEF 667
Query: 589 MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGS 648
M ++ RTY+++ G +YPFGHG+SY+ F ++ +
Sbjct: 668 MDYDIIKSE-------RTYQYFAGKPLYPFGHGLSYSRF-----------------QYDA 703
Query: 649 INATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWA-PHKQLVA 707
+ SG +L V V N DG + +++ P +QL +
Sbjct: 704 LAQEQSGN------------SLTVSFRVTNTSDVDGEEVVQLYTRALESRVKRPLRQLQS 751
Query: 708 FEKVHVPAGAQQRVGINI 725
FE++ + AG + V +
Sbjct: 752 FERIALAAGESREVSFKL 769
>gi|198274480|ref|ZP_03207012.1| hypothetical protein BACPLE_00628 [Bacteroides plebeius DSM 17135]
gi|198272682|gb|EDY96951.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
plebeius DSM 17135]
Length = 912
Score = 256 bits (654), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 206/715 (28%), Positives = 336/715 (46%), Gaps = 92/715 (12%)
Query: 78 RLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSD 137
RLGI ++ +E + GV N AT+FP + ++N L IG +
Sbjct: 118 RLGIPA-DFTNEGIRGVENYI-----------ATNFPTQLALGHTWNRELIRQIGYITGR 165
Query: 138 EARAMYNGGTAGLT-YWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGD 196
EAR + G T ++P +++ RD RWGR +E GE P L + + +GLQ
Sbjct: 166 EARLL------GYTNVYAPILDVGRDQRWGRYEEVYGESPYLVAELGIAMGKGLQTD--- 216
Query: 197 RLKVAASCKHFTAYDLDNWNGVDRF-HFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCS 255
++VA++ KHF AY +N + F + ++S +++E+ PF + E + VM S
Sbjct: 217 -MQVASTAKHFIAYS-NNKGAREGFARVDPQMSWREVENIHAYPFTRVIQEAGILGVMSS 274
Query: 256 YNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRA 315
YN +G P + L + +RG GY+VSD D+V Y +EA ++ A
Sbjct: 275 YNDYDGFPIQSSYYWLTQRLRGTMGFRGYVVSDSDAVEYLYSKHKTAKDMKEAVRQSVEA 334
Query: 316 GLDLDCG----PFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGH 371
GL++ C L +Q G LS I+N + + L V+ G+FD PY
Sbjct: 335 GLNVRCTFRSPESYVLPLRELIQEGGLSMETIDNRVRDILRVKFLTGLFD-----TPYQT 389
Query: 372 ---LGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMI 428
L K+V + HQ++AL+A+R+G+VLLKN LPL + + +AV GPN+D +
Sbjct: 390 DLALADKEVNSEAHQQVALQASREGLVLLKNANNLLPLDKSQIKRIAVCGPNADEASFAL 449
Query: 429 GNYAGIACGYTTPLQGIGRY----ARTIHQQGCKDV--------------ACADDQLFGA 470
+Y +A TT L+GI + + + +GC V +
Sbjct: 450 THYGPVAVEVTTVLEGIKQQVKEGTKVTYTKGCDLVDANWPESEIISYPLTAEEKTEIQK 509
Query: 471 AIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGG 530
A+D +++D ++V+G E R L LPG QQ+L+ + +A+ P +LVL++G
Sbjct: 510 AVDNVKESDVAVVVLGGGIRTCGENKSRTSLDLPGHQQQLLEAI-VATGKPVVLVLINGR 568
Query: 531 PIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMT 590
P+ + +A D + AI+ A YPG GGTAIA+ LFG NPGGKL +T +P+ + +P
Sbjct: 569 PLSINWA--DKFVPAILEAWYPGSQGGTAIAEALFGDYNPGGKLTVT-FPKT-VGQIPFN 624
Query: 591 EMAMRPSQSK--RYPGR--TYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRH 646
A SQ + PG GP +YPFG+G+SYT F ++
Sbjct: 625 FPAKPASQVDGGQTPGMKGNQSRINGP-LYPFGYGLSYTTFEYS---------------- 667
Query: 647 GSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPH-KQL 705
N +S I ++ + V +KN G++ G + +++ + K L
Sbjct: 668 ---NLQLSSPVI------TDKEPVTVTCKIKNTGTRSGDEVVQLYTRDVISSVTTYEKNL 718
Query: 706 VAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
FE+VH+ G ++V + + + ++++ + G I IG + + L
Sbjct: 719 RGFERVHLEPGETKKVSFQL-LPRDFQLLNKDNHWVVEPGMFQIMIGASSEDIRL 772
>gi|189467715|ref|ZP_03016500.1| hypothetical protein BACINT_04107 [Bacteroides intestinalis DSM
17393]
gi|189435979|gb|EDV04964.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
intestinalis DSM 17393]
Length = 943
Score = 256 bits (653), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 199/680 (29%), Positives = 324/680 (47%), Gaps = 74/680 (10%)
Query: 110 ATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLT-YWSPNVNIFRDPRWGRG 168
AT+FP + ++N L +G + EAR + G T ++P +++ RD RWGR
Sbjct: 173 ATNFPTQLGLGHTWNRELIRQVGLITGREARIL------GYTNVYAPILDVGRDQRWGRY 226
Query: 169 QETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVS 228
+E GE P L + VRG+Q + +VAA+ KHF AY + + ++S
Sbjct: 227 EEVYGESPYLVAELGIEMVRGMQHNH----QVAATGKHFVAYSNNKGAREGMARVDPQMS 282
Query: 229 KQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSD 288
+++E PF+ + E + VM SYN +GVP L +RGE GY+VSD
Sbjct: 283 PREVEMIHVYPFKRVIKEAGLLGVMSSYNDYDGVPIQGSYYWLTTRLRGEMGFRGYVVSD 342
Query: 289 CDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG----PFLGLHTESAVQRGLLSEIDIN 344
D+V Y +EA ++ AGL++ C L V+ G LSE IN
Sbjct: 343 SDAVEYLYTKHSTAKDMKEAVRQSVEAGLNVRCTFRSPDSYVLPLRELVKEGGLSEEVIN 402
Query: 345 NALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSL 404
+ + + L V+ +G+FD + G ++V +++ LAL+A+R+ +VLLKN+ L
Sbjct: 403 DRVRDILRVKFLIGLFDAPYQTDLAG--ADREVEKAENESLALQASRESLVLLKNENNVL 460
Query: 405 PLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGR----YARTIHQQGCK-- 458
PL + +AV GPN+D + +Y +A TT L+GI + A ++ +GC
Sbjct: 461 PLDINNVKKIAVCGPNADEEGYALTHYGPLAVEVTTVLEGIRQKSEGKAEVLYTKGCDLV 520
Query: 459 ----------DVACADDQL--FGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGR 506
D D++ A++ +RQAD ++V+G Q E R+ L LPGR
Sbjct: 521 DANWPESELIDYPMTDNEQAEIDKAVENARQADVAVVVLGGGQRTCGENKSRSSLDLPGR 580
Query: 507 QQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFG 566
Q +L+ V K P +LVL++G P+ + +A D + AI+ A YPG GGTA+AD+LFG
Sbjct: 581 QLKLLQAVQATGK-PVVLVLINGRPLSINWA--DKFVPAILEAWYPGSKGGTAVADVLFG 637
Query: 567 TSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQS---KRYPGRTYRFYK-GPVVYPFGHGM 622
NPGGK+ +T +P+ + +P +PS + PG + +Y FG+G+
Sbjct: 638 DYNPGGKMTVT-FPKS-VGQIPFN-FPCKPSSQIDGGKNPGLDGNMSRVNGALYSFGYGL 694
Query: 623 SYTNFVHT-VANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGS 681
SYT F ++ + +P V+ T + KA V+ V N G
Sbjct: 695 SYTTFEYSGIEISPKVI-------------TPNQKAT-------------VRCKVTNTGK 728
Query: 682 KDGAHTLLVFSTPPAGHWAPH-KQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTR 740
+ G + ++ + K L FE++H+ G + V + K L ++D+
Sbjct: 729 RAGDEVVQLYVRDILSSVTTYEKNLAGFERIHLQPGETKEVVFTLD-RKQLELLDKHMEW 787
Query: 741 RIPLGEHNIHIGGTKHSVSL 760
+ G+ +I +G + + L
Sbjct: 788 VVEPGDFSIMVGASSEDIRL 807
>gi|332881173|ref|ZP_08448832.1| glycosyl hydrolase family 3 protein, partial [Capnocytophaga sp.
oral taxon 329 str. F0087]
gi|332680887|gb|EGJ53825.1| glycosyl hydrolase family 3 protein [Capnocytophaga sp. oral taxon
329 str. F0087]
Length = 675
Score = 256 bits (653), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 157/432 (36%), Positives = 229/432 (53%), Gaps = 30/432 (6%)
Query: 41 PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPG 100
P+ L +R +DL+ RL+L+EK+ L+ + + V RLGIK Y WWSEALHGV+ G
Sbjct: 24 PYKNPDLSPQERADDLLKRLTLKEKISLMQNQSPGVERLGIKPYNWWSEALHGVARNGL- 82
Query: 101 TKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAM---------YNGGTAGLT 151
AT +P + AS F+ L E I VSDE RA Y G GLT
Sbjct: 83 ---------ATVYPITMGMASVFDDKLIEDIYVTVSDEGRAKFHDARRHGRYGRGNEGLT 133
Query: 152 YWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAY 210
+W+PNVNIFRDPRWGRGQET GEDP L+ + + VRG+QG +D K A KH+ +
Sbjct: 134 FWNPNVNIFRDPRWGRGQETWGEDPYLTTRMGVAVVRGMQGPADAKYDKTHACAKHYAVH 193
Query: 211 DLDNWNGVDRFHFNAK-VSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPN 269
R F+ + + +D+ +T+ F+ V E V VMC+Y + G P C
Sbjct: 194 SGPE---AKRHSFDVENLEPRDLWETYLPAFKALVQEADVKEVMCAYQRFEGEPCCGSNR 250
Query: 270 ILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEA--AADAIRAGLDLDCGPFLGL 327
+L + +R EW +VSDC ++ ++ + P+ A +A A+ G DL+CG
Sbjct: 251 LLTQILRDEWGYKHLVVSDCGAISDFFYQGRHETHPDAATSSASAVINGTDLECGVEYA- 309
Query: 328 HTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELAL 387
H + AV+RGL++E I+ +L L + LG D + + P+ + V H+ +AL
Sbjct: 310 HLDEAVERGLITEHRIDTSLRRLLEARFALGEMDDD-ALVPWSRISIDTVDCGTHRRMAL 368
Query: 388 EAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI-G 446
+ R+ +VLL N G LPL +AV+GPN+ +V GNY G+ T L+GI G
Sbjct: 369 DVTRKSMVLLHNNG-ILPLDKGDAGKIAVMGPNAVDSVMQWGNYKGVPAHTYTILEGIRG 427
Query: 447 RYARTIHQQGCK 458
+++GC+
Sbjct: 428 AIGNVPYEKGCE 439
>gi|365122063|ref|ZP_09338970.1| hypothetical protein HMPREF1033_02316 [Tannerella sp.
6_1_58FAA_CT1]
gi|363643257|gb|EHL82578.1| hypothetical protein HMPREF1033_02316 [Tannerella sp.
6_1_58FAA_CT1]
Length = 819
Score = 256 bits (653), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 202/712 (28%), Positives = 320/712 (44%), Gaps = 81/712 (11%)
Query: 78 RLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSD 137
RLGI ++ +E +HG+++ AT P I S++NA L G +
Sbjct: 151 RLGIP-VDFSNETIHGLNHTK-----------ATPLPAPIGIGSTWNAPLVYKAGSIAGK 198
Query: 138 EARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDR 197
EA+A+ G T ++P +++ RDPRWGR E GEDP L V+G+Q
Sbjct: 199 EAKAL--GYT---NIYAPILDLARDPRWGRVLECYGEDPFLVATLGTQMVKGIQEQG--- 250
Query: 198 LKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYN 257
VAA+ KHF Y + + V+ +++ PF+ + + VM SYN
Sbjct: 251 --VAATLKHFAVYSVPKGGRDGSVRTDPHVAPREMHQMHLYPFKKVIQDAHPMGVMSSYN 308
Query: 258 QVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGL 317
+GVP A L + +R E+ +GY+VSD D+V Y+ H T EEA + AGL
Sbjct: 309 DWDGVPVTASYYFLTQLLRQEFGFDGYVVSDSDAVEYVYNKHHVAETYEEAVRMVLEAGL 368
Query: 318 DLDCG----PFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLG 373
++ L V+ G LS I+ + + L V+ RLG+FD + P
Sbjct: 369 NVRTTFAAPDIFILPARKLVKEGRLSMKVIDERVADVLRVKFRLGLFDQPFVADPKA--A 426
Query: 374 PKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAG 433
K V +++ L+ RQ +VLLKN+ LPL + + + GP + M+ Y
Sbjct: 427 DKIVGADKNKDFVLDIQRQSLVLLKNENNLLPLDKNKLSRILITGPLAKEENYMVSRYGP 486
Query: 434 IACGYTTPLQGIGRY----ARTIHQQGCK--------------DVACADDQLFGAAIDAS 475
T +GI Y + GCK + + Q A++ +
Sbjct: 487 QELENITVYEGIKNYLGNKVAVDYALGCKVKDAKWPESEIIHSPLTTEEQQEIQNAVEKA 546
Query: 476 RQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVA 535
+ +D I V+G D+ E+ R+GL LPGRQQ+L+ + K P +LVL++G P+ +
Sbjct: 547 KLSDIVIAVLGEDEESTGESKSRSGLDLPGRQQQLLEALYATGK-PVVLVLINGQPLTIN 605
Query: 536 FAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQ---EYITNLPMTEM 592
+A D I AI+ A +PGQ GGTAIA+ LFG NPGGKLP+T +P+ + N P +
Sbjct: 606 WA--DRYIPAILEAWFPGQMGGTAIAETLFGDYNPGGKLPVT-FPKTLGQIELNFPF-KP 661
Query: 593 AMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINAT 652
A + Q + P + +YPFG G+SYT F ++ V+ G G I
Sbjct: 662 ASQSKQPEAGPNGYGKTRVNGALYPFGFGLSYTTFEYSNLK----VSPERQGPKGDIQ-- 715
Query: 653 ISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLV-AFEKV 711
V D+ N G + G + ++ ++ L+ FE+V
Sbjct: 716 -------------------VSFDITNTGKRAGDEIVQLYVKDKVSSVISYESLLRGFERV 756
Query: 712 HVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHAA 763
+ G + + +H + L ++D + + GE + IG + + L +
Sbjct: 757 SLQPGETKNIQFTLH-PEDLEILDINMNWNVEPGEFEVRIGASSEDIKLKKS 807
>gi|357047866|ref|ZP_09109459.1| glycosyl hydrolase family 3 protein, partial [Paraprevotella clara
YIT 11840]
gi|355529205|gb|EHG98644.1| glycosyl hydrolase family 3 protein, partial [Paraprevotella clara
YIT 11840]
Length = 676
Score = 256 bits (653), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 157/432 (36%), Positives = 229/432 (53%), Gaps = 30/432 (6%)
Query: 41 PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPG 100
P+ L +R +DL+ RL+L+EK+ L+ + + V RLGIK Y WWSEALHGV+ G
Sbjct: 24 PYKNPDLSPQERADDLLKRLTLKEKISLMQNQSPGVERLGIKPYNWWSEALHGVARNGL- 82
Query: 101 TKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAM---------YNGGTAGLT 151
AT +P + AS F+ L E I VSDE RA Y G GLT
Sbjct: 83 ---------ATVYPITMGMASVFDDKLIEDIYVTVSDEGRAKFHDARRHGRYGRGNEGLT 133
Query: 152 YWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAY 210
+W+PNVNIFRDPRWGRGQET GEDP L+ + + VRG+QG +D K A KH+ +
Sbjct: 134 FWNPNVNIFRDPRWGRGQETWGEDPYLTTRMGVAVVRGMQGPADAKYDKTHACAKHYAVH 193
Query: 211 DLDNWNGVDRFHFNAK-VSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPN 269
R F+ + + +D+ +T+ F+ V E V VMC+Y + G P C
Sbjct: 194 SGPE---AKRHSFDVENLEPRDLWETYLPAFKALVQEADVKEVMCAYQRFEGEPCCGSNR 250
Query: 270 ILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEA--AADAIRAGLDLDCGPFLGL 327
+L + +R EW +VSDC ++ ++ + P+ A +A A+ G DL+CG
Sbjct: 251 LLTQILRDEWGYKHLVVSDCGAISDFFYQGRHETHPDAATSSASAVINGTDLECGVEYA- 309
Query: 328 HTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELAL 387
H + AV+RGL++E I+ +L L + LG D + + P+ + V H+ +AL
Sbjct: 310 HLDEAVERGLITEHRIDTSLRRLLEARFALGEMDDD-ALVPWSRISIDTVDCGTHRRMAL 368
Query: 388 EAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI-G 446
+ R+ +VLL N G LPL +AV+GPN+ +V GNY G+ T L+GI G
Sbjct: 369 DVTRKSMVLLHNNG-ILPLDKGDAGKIAVMGPNAVDSVMQWGNYKGVPAHTYTILEGIRG 427
Query: 447 RYARTIHQQGCK 458
+++GC+
Sbjct: 428 AIGNVPYEKGCE 439
>gi|393784338|ref|ZP_10372503.1| hypothetical protein HMPREF1071_03371 [Bacteroides salyersiae
CL02T12C01]
gi|392666114|gb|EIY59631.1| hypothetical protein HMPREF1071_03371 [Bacteroides salyersiae
CL02T12C01]
Length = 857
Score = 256 bits (653), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 221/786 (28%), Positives = 358/786 (45%), Gaps = 122/786 (15%)
Query: 38 RTLPFCQVSLPIPQRVNDLIGRLSLQEKV---------KLLISGAAAVPRLG--IKGYEW 86
+ L + Q SLPI +RV+DL+GR++L+EK+ + + +LG G W
Sbjct: 24 QQLSYRQSSLPISERVDDLLGRMTLEEKIAQIRHIHSWNVFNGQDLDMEKLGKFTGGVSW 83
Query: 87 -WSEA--LHGVS---NVGPGTKF-------------------GGDFPGATSFPQVITTAS 121
+ E L GV+ N+ KF G G+T +PQ I S
Sbjct: 84 GFVEGFPLTGVNCKKNMQLIQKFMVENTRLGIPVFTVAESLHGSVHEGSTIYPQNIAMGS 143
Query: 122 SFNATL-WEAIGRVVSD-EARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLS 179
+F L + + D A+ M+ +P +++ RD RWGR +E+ GEDPVL
Sbjct: 144 TFRPELAYRKAAMITKDLHAQGMHQ-------VLAPCIDVVRDLRWGRVEESFGEDPVLC 196
Query: 180 GKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHF-NAKVSKQDIEDTFDV 238
G + + V+G + ++ KH+ + N + + + + +D+ + +
Sbjct: 197 GLFGIAEVKGYMDNG-----ISPMLKHYGPHG----NPLSGLNLASVECGLRDLHEVYLK 247
Query: 239 PFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDT 298
PF M + V +VM +YN N VP A +L +RG++ GY+ SD ++ +
Sbjct: 248 PFEMVIRNTPVLAVMSTYNSWNHVPNSASHYLLTEVLRGQFGFKGYVYSDWGAIEMLKTL 307
Query: 299 QHFTSTPEEAAADAIRAGLDLD----CGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQ 354
EEAA A AGLD++ C P L +Q+G L E +N ++ L +
Sbjct: 308 HRVAHNSEEAAMQAFTAGLDVEASSNCYPLLA----GLIQKGKLDEEVLNESVRRVLYAK 363
Query: 355 MRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTV 414
++G+F+ +P + Y H ++ + L+ E A + +VLLKN+ LPL+ + ++V
Sbjct: 364 FKMGLFE-DPYGEQYSH---SEMHGAESIRLSKEIADESVVLLKNENGLLPLNADKLKSV 419
Query: 415 AVIGPNSDVTVTMIGNYAGIACGY--TTPLQGIGRY----ARTIHQQGCKDVACADDQLF 468
AVIGPN+D G+Y TPL+GI R A + +GC D+ +
Sbjct: 420 AVIGPNAD--QVQFGDYTWSRNNKDGVTPLEGIRRLLGGKATVRYAKGC-DLVSLNAGGI 476
Query: 469 GAAIDASRQADATILVMG---------LDQSIEAEALDRAGLLLPGRQQELVSKVSMASK 519
A++A+R+++ IL G S E D L L G Q +L+ +V +
Sbjct: 477 KEAVEAARKSEVAILFCGSASAALARDYKSSTCGEGFDLNDLNLTGVQGQLIKEV-YETG 535
Query: 520 GPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWY 579
P +LVL++G P +++ K I AI+ Y G+ G +IADILFG+ +P G+L + Y
Sbjct: 536 TPVVLVLVTGKPFAISWEKK--HIPAILTQWYAGEQAGNSIADILFGSISPSGRLTFS-Y 592
Query: 580 PQE------YITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVAN 633
PQ Y LP + + S PGR Y F ++ FGHG++YT+FV+ N
Sbjct: 593 PQTTGHLPVYYNYLPSDKGFYKNPGSYESPGRDYVFSSPDALWAFGHGLTYTSFVY--KN 650
Query: 634 APTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFST 693
T D H +N TI + VD+KN G ++G + ++
Sbjct: 651 LRT------DKEHYGLNDTIY-----------------IDVDIKNTGKREGKEVVQLYVN 687
Query: 694 PPAGHWA-PHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIG 752
P KQL F+KV V AG + V + + V L +V+ R + GE + +G
Sbjct: 688 DKVSTVVTPVKQLRDFKKVDVEAGKTETVKLKVAVND-LYIVNAGNKRVVEPGEFELQVG 746
Query: 753 GTKHSV 758
++
Sbjct: 747 AASDNI 752
>gi|333377782|ref|ZP_08469515.1| hypothetical protein HMPREF9456_01110 [Dysgonomonas mossii DSM
22836]
gi|332883802|gb|EGK04082.1| hypothetical protein HMPREF9456_01110 [Dysgonomonas mossii DSM
22836]
Length = 727
Score = 255 bits (652), Expect = 5e-65, Method: Compositional matrix adjust.
Identities = 212/722 (29%), Positives = 341/722 (47%), Gaps = 110/722 (15%)
Query: 41 PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIK--GYEWWSEALHGVSNVG 98
PF SL +R+++L+ +++ EK+ L S VPRLGI+ G+ SE LHG++ G
Sbjct: 25 PFQNTSLSDEKRLDNLLSIMTIDEKINAL-STNLGVPRLGIRNTGH---SEGLHGMALGG 80
Query: 99 PGTKFGG----------DFPGATSFPQVITTASSFNATLWEAIGRVVSDEAR-----AMY 143
PG +GG D T+FPQ +++ L + + + + E R Y
Sbjct: 81 PGN-WGGFKMVNYQRVPDVYPTTTFPQAYGLGETWDTELIKKVADIEATEIRYYTQNERY 139
Query: 144 NGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAAS 203
G GL +PN ++ RDPRWGR +E+ GEDP L + A ++++GLQG + K A+
Sbjct: 140 TKG--GLVMRAPNADLARDPRWGRTEESFGEDPFLVSEMAVAFIKGLQGENPRYWKSASL 197
Query: 204 CKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVP 263
KHF A N N R ++ + + + PFR + +G + M +YN N +P
Sbjct: 198 MKHFLA----NSNEDGRDSTSSNFDNRLFHEYYSYPFRKGIEKGGSQAFMAAYNSWNEIP 253
Query: 264 TCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGP 323
P ILK+ IR +W G I +D ++ + T E +A ++AG+ G
Sbjct: 254 MTIHP-ILKK-IRKDWNFKGIICTDGGALDLLIKAHKTFPTHTEGSAAIVKAGV----GQ 307
Query: 324 FLGL---HTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCT- 379
FL + A+++G+L+E +I+ A+ + ++LG+ DG+ + PY H+G D +
Sbjct: 308 FLDNFRPYIYQALEKGMLTEAEIDKAIRGNFYIALKLGLLDGDQTKLPYAHIGVTDTVSV 367
Query: 380 ---PDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIAC 436
+ Q+ + +VLLKN+ LPL+ + +AVIGP ++ ++ Y+G
Sbjct: 368 WRNKEIQDFVRLVTAKSVVLLKNEKKLLPLNKGNIKRIAVIGPRAN--EVLLDWYSGTPP 425
Query: 437 GYTTPLQGI----GRYARTIHQQGCKDVACADDQLFGAAIDASRQADATIL-----VMGL 487
+ LQGI G I++ + A A+++AD I+ V G
Sbjct: 426 YTVSILQGIKNAVGNNVEVIYESSNE---------IDKAYLAAQKADIAIVCVGNHVYGT 476
Query: 488 DQSIE--------AEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKN 539
D + EA+DR L L Q++LV V A+ T++VL+S P + +++
Sbjct: 477 DPKWKYSPVPSDGREAVDRKALSL--EQEDLVKIVHKANPN-TVMVLVSSFPFAINWSQE 533
Query: 540 DPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNL-PMTEMAMRPSQ 598
+ I AI+ Q G +AD++FG NP G+ TW I +L PM + +R
Sbjct: 534 N--IPAILHITNNSQELGNGLADVIFGNYNPAGRTNQTWVKS--IADLPPMMDYDIR--- 586
Query: 599 SKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAI 658
GRTY + K +YPFG+G+SYTNF ++ S +A GK +
Sbjct: 587 ----NGRTYMYAKEKPLYPFGYGLSYTNFTYSDMAL-------------SSSALSKGKNL 629
Query: 659 KVTHAKCNRLTLGVQVDVKNVGSKDGAHTL-LVFSTPPAGHWAPHKQLVAFEKVHVPAGA 717
KV+ V+VKN G DG L S P + P KQL F+++ + G
Sbjct: 630 KVS------------VNVKNTGDMDGEEVAQLYVSFPQSKVVRPIKQLKGFDRISIKKGE 677
Query: 718 QQ 719
+
Sbjct: 678 SK 679
>gi|329922637|ref|ZP_08278189.1| glycosyl hydrolase family 3 N-terminal domain protein
[Paenibacillus sp. HGF5]
gi|328941979|gb|EGG38262.1| glycosyl hydrolase family 3 N-terminal domain protein
[Paenibacillus sp. HGF5]
Length = 765
Score = 255 bits (652), Expect = 5e-65, Method: Compositional matrix adjust.
Identities = 208/686 (30%), Positives = 322/686 (46%), Gaps = 103/686 (15%)
Query: 109 GATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRG 168
G T FP ++ S++N L+ + R V+ E R+ G +SP +++ RDPRWGR
Sbjct: 122 GGTVFPVPLSIGSTWNLDLYRDMCRAVALETRSQ-----GGAVTYSPVLDVVRDPRWGRT 176
Query: 169 QETPGEDPVLSGKYAASYVRGLQGSDGDR-LKVAASCKHFTAYDLDNWNGVDRFHFNA-- 225
+E GEDP L +YA + V GLQG D VAA+ KHF Y G NA
Sbjct: 177 EECFGEDPYLISEYAVASVEGLQGESLDSPSSVAATLKHFVGY------GSSEGGRNAGP 230
Query: 226 -KVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
+ +++ + +PF+ V E AS+M +YN+++GVP + +L +R EW +G
Sbjct: 231 VHMGTRELMEVDMLPFKKAV-EAGAASIMPAYNEIDGVPCTVNTELLDGILRKEWGFDGM 289
Query: 285 IVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLD-CGPFLGLHTESAVQRGLLSEIDI 343
+++DC ++ + +AA AIRAG+DL+ G G H + AV+ L +
Sbjct: 290 VITDCGAIDMLASGHDTAEDGMDAAVQAIRAGIDLEMSGEMFGKHLQKAVESNKLEVSVL 349
Query: 344 NNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPS 403
+ A+ LT++ +LG+F+ P P + + H LA + A +GIVLLKN+ +
Sbjct: 350 DEAVRRVLTLKFKLGLFE-NPYVDP--QTAENVIGSGQHIGLARQLAAEGIVLLKNEAKA 406
Query: 404 LPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIA--CGYTTPLQGI----GRYA-RTIHQQG 456
LPLS +AVIGPN+D +G+Y TT L GI G A R ++ G
Sbjct: 407 LPLSK-EGGVIAVIGPNADQGYNQLGDYTSPQPPAAVTTVLGGIRAKLGEEAQRVLYAPG 465
Query: 457 CKDVACADD--QLFGAAIDASRQADATILVMG---------------------LDQSIE- 492
C+ DD + F A+ + QAD ++V+G D ++
Sbjct: 466 CR---IKDDSREGFEFALSCAEQADTVVMVLGGSSARDFGEGTIDLRTGASKVTDDALSD 522
Query: 493 ---AEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWA 549
E +DR L L G Q +L ++ K I+V ++G PI + D AI+ A
Sbjct: 523 MDCGEGIDRMTLQLSGVQLDLAQEIHKLGK-RMIVVYINGRPIAEPWI--DEHADAILEA 579
Query: 550 GYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRF 609
YPGQ GG AIADILFG NP GKL M+ +++ LP+ KR G+ Y
Sbjct: 580 WYPGQEGGHAIADILFGDVNPSGKLTMSI--PKHVGQLPVY------YNGKRSRGKRYLE 631
Query: 610 YKGPVVYPFGHGMSYTNFVHT-VANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRL 668
YPFG+G+SYT F ++ + P V+ G A+
Sbjct: 632 EDSQPRYPFGYGLSYTEFSYSDIQMTPEVIGT-------------DGTAV---------- 668
Query: 669 TLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWA-PHKQLVAFEKVHVPAGAQQRVGINI-- 725
V V+V N G +G+ + ++ + A + P ++L F+K+ + G +++V I
Sbjct: 669 ---VSVNVTNSGDCEGSEVVQLYVSDAASKYTRPARELKGFQKISLQPGERRKVEFTIGP 725
Query: 726 ----HVCKYLSVVDRSGTRRIPLGEH 747
++ + V G R+ LG H
Sbjct: 726 EQLQYIGQDYRQVVEPGLFRVMLGRH 751
>gi|404254492|ref|ZP_10958460.1| glycosyl hydrolase family protein [Sphingomonas sp. PAMC 26621]
Length = 898
Score = 255 bits (652), Expect = 6e-65, Method: Compositional matrix adjust.
Identities = 155/414 (37%), Positives = 226/414 (54%), Gaps = 41/414 (9%)
Query: 56 LIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQ 115
++ R++ +EK+ L++ A A+PR+ I Y WW+E+LHG P T+FP+
Sbjct: 51 IVARMTTEEKLPQLLNVAPAIPRIKIPAYNWWTESLHGALGAVP----------TTNFPE 100
Query: 116 VITTASSFNATLWEAIGRVVSDEARAMYNGGT---------AGLTYWSPNVNIFRDPRWG 166
I A+SF+A L + +S E RA++ G GL WSPN+NIFRDPRWG
Sbjct: 101 PIGLAASFDAPLVHQVAGAISTEVRALHTMGRQTGKLGRIGTGLDTWSPNINIFRDPRWG 160
Query: 167 RGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAK 226
RGQET GEDP L+ ++V G+QG + D V A+ KHF + R N
Sbjct: 161 RGQETYGEDPYLAAHMGVAFVTGMQGPNPDLPDVIATPKHFAVHSGPEST---RHGANVF 217
Query: 227 VSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIV 286
VS D+EDT+ FR ++EG+ S+MC+YN+++G P CA+ +LK +RG W GY+V
Sbjct: 218 VSAHDLEDTYLPAFRAAIVEGRAGSIMCAYNRIDGQPACANDLLLKDHLRGAWGFTGYVV 277
Query: 287 SDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDC-----GPFLGLHTE--SAVQRGLLS 339
SDCD+V D + A A A++AG D +C G GL +++RGL+S
Sbjct: 278 SDCDAVKDIADGHKYAPDQATAIAAAMKAGTDNECHTQTIGDMGGLPERFGDSLRRGLIS 337
Query: 340 EIDINNALVNTLTVQMRLGMFD---GEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVL 396
+ DI+ +LV + ++R G G P++ P + + +PDH LAL+AA + +VL
Sbjct: 338 QDDIDRSLVRLFSARLRNGDLPGIAGAPAAVPV-----RAILSPDHMALALQAAEKSLVL 392
Query: 397 LKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPL-QGIGRYA 449
LKN G LPL +AVIGP D T + GNY+ P+ +G+ R A
Sbjct: 393 LKNDG-VLPLRP--GARIAVIGPLGDATRVLRGNYSSTQSAPPVPIVEGLRRAA 443
Score = 125 bits (314), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 88/277 (31%), Positives = 141/277 (50%), Gaps = 48/277 (17%)
Query: 486 GLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVA 535
GL +EAE D+ L LP Q ++ + K P I+V M+G PI++A
Sbjct: 635 GLTSDLEAEETGTDIPGFQGGDKTSLDLPAEQMAMLERAKATGK-PVIVVAMNGSPINLA 693
Query: 536 FAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMR 595
+AK++ AAI+ A YPGQ+GG A+A++L G ++PGG+LP+T+Y + +LP
Sbjct: 694 WAKDN--AAAIVEAWYPGQSGGLAVANVLTGKTDPGGRLPLTFY--RSVADLP------- 742
Query: 596 PSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISG 655
P + GRTYR++ G VYPFGHG+SYT+F +T PL + SG
Sbjct: 743 PFDNYAMTGRTYRYFTGKPVYPFGHGLSYTSFAYT----------PL------VLNPASG 786
Query: 656 KAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPA 715
A + L V DV N G + G + ++ + P AP L ++++ +
Sbjct: 787 GAEQ---------GLRVTTDVSNTGKRAGDEVVQLYLSFPDVPGAPRIALRGYQRITLAP 837
Query: 716 GAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIG 752
G ++ V ++ + LS V +G R + G++ + +G
Sbjct: 838 GERRSVTFDLS-PRDLSAVRANGVREVMQGQYRVSVG 873
>gi|317476310|ref|ZP_07935559.1| glycosyl hydrolase family 3 C terminal domain-containing protein
[Bacteroides eggerthii 1_2_48FAA]
gi|316907336|gb|EFV29041.1| glycosyl hydrolase family 3 C terminal domain-containing protein
[Bacteroides eggerthii 1_2_48FAA]
Length = 862
Score = 255 bits (651), Expect = 7e-65, Method: Compositional matrix adjust.
Identities = 160/409 (39%), Positives = 223/409 (54%), Gaps = 34/409 (8%)
Query: 66 VKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNA 125
V L++ + AV RLGIK Y WW+EALHGV+ G AT FPQ I A+SF+
Sbjct: 46 VALMMDNSQAVERLGIKPYNWWNEALHGVARAGL----------ATVFPQPIGMAASFSP 95
Query: 126 TLWEAIGRVVSDEARAMYNGGTA---------GLTYWSPNVNIFRDPRWGRGQETPGEDP 176
+ VSDEARA N A GLT W+P VNI+RDPRWGRG ET GEDP
Sbjct: 96 ESVYEVFSAVSDEARAK-NANYASQGSYARYQGLTMWTPTVNIYRDPRWGRGIETYGEDP 154
Query: 177 VLSGKYAASYVRGLQGS-DGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAK-VSKQDIED 234
L+ + V+GLQG+ DG K+ A KHF + WN R FN + +S +D+ +
Sbjct: 155 YLTSRMGVMVVKGLQGTNDGKYDKLHACAKHFAVHSGPEWN---RHSFNVEDLSARDLYE 211
Query: 235 TFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGV 294
T+ PF V E KV VMC+YN G P C +L + +R +W +G I+SDC ++
Sbjct: 212 TYLPPFEALVKEAKVKEVMCAYNSFEGEPCCGSNRLLMQILRNDWGFDGIILSDCGAIAD 271
Query: 295 YYDTQHFTSTP--EEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLT 352
+Y+ + P E A+A A+ G DL+CG + AV+ G ++E DI+ A++ L
Sbjct: 272 FYNEHGHKAYPDAESASAAAVLNGTDLECGSSYKALVK-AVRAGQINEKDIDKAVMRLLE 330
Query: 353 VQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHR 412
+ LG D +P + + + V + +H LAL+ AR+ + LL+N G LPL
Sbjct: 331 ARFALGEMD-DPDNVSWTKIPFSVVASAEHDSLALDMARKSMTLLQNTGNILPLKR-GGL 388
Query: 413 TVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI----GRYARTIHQQGC 457
TVAV+GPN++ +V GNY G+ T L GI G + I++QGC
Sbjct: 389 TVAVMGPNANDSVMQWGNYNGMPPHTVTILGGIRKALGADDKLIYEQGC 437
Score = 121 bits (304), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 91/306 (29%), Positives = 140/306 (45%), Gaps = 47/306 (15%)
Query: 470 AAIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASK 519
A++ + AD + G+ +E E + DR + LP Q+EL+S + A K
Sbjct: 591 ASVARVKDADIVVFASGVSPVLEGEEMGVNLPGFKGGDRTDIELPAIQRELISALHRAGK 650
Query: 520 GPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWY 579
ILV SG PI A AI+ A YPGQ GGTA+A++LFG NPGG+LP+T+Y
Sbjct: 651 -KIILVNCSGSPI--ALEPETKNCEAILQAWYPGQQGGTAVAEVLFGDYNPGGRLPVTFY 707
Query: 580 PQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVA 639
++ LP E GRTYR+ ++PFG+G+SYT F
Sbjct: 708 RN--MSQLPDFE-------DYNMTGRTYRYMTQQPLFPFGYGLSYTTF------------ 746
Query: 640 VPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHW 699
+G + ++ + IK L + V V N G +DG + V+
Sbjct: 747 -----DYGKL--SLGKEQIKQGEP------LKLAVSVTNSGQRDGEEVVQVYLKKQDDAE 793
Query: 700 APHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVS 759
P K L AF++V++PAG V ++ + +S T R+ G+++I +G +
Sbjct: 794 GPGKTLRAFKRVYIPAGKSVDVEFDLKDKELEWWDAQSNTMRVCAGKYDIMVGRSSQDQD 853
Query: 760 LHAATL 765
L +
Sbjct: 854 LQRGSF 859
>gi|365836152|ref|ZP_09377549.1| glycosyl hydrolase family 3 protein [Hafnia alvei ATCC 51873]
gi|364564272|gb|EHM42040.1| glycosyl hydrolase family 3 protein [Hafnia alvei ATCC 51873]
Length = 810
Score = 255 bits (651), Expect = 7e-65, Method: Compositional matrix adjust.
Identities = 205/689 (29%), Positives = 332/689 (48%), Gaps = 102/689 (14%)
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
T FP + ASS+N +GR+ + Y GL W+P V++ RDPRWGR
Sbjct: 163 TVFPISLGLASSWNRDAVTTVGRI------SAYEASEDGLNMTWAPMVDVTRDPRWGRTS 216
Query: 170 ETPGEDPVLSGKYAASYVRGLQGSD-GDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
E GED L+ + V +QG++ DR V S KHF AY ++N VD
Sbjct: 217 EGFGEDTYLTSEMGRLMVEAMQGNNPADRHSVMTSVKHFAAYGAVEGGRDYNTVD----- 271
Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
+S Q + + P++ + G VM S N +NG P ++ +LK +R EW G
Sbjct: 272 --MSPQRLFQDYMPPYKAALDAGS-GGVMVSLNSINGTPATSNSWLLKDVLRDEWNFKGI 328
Query: 285 IVSDCDSVGVYYDTQH-FTSTPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
+SD ++ +H S PE+A A+++G+D+ + + S V+ G +SE +
Sbjct: 329 TISDHGAIKEL--IKHGVASDPEDAVRVAVKSGIDMSMSDEYYSKYLPSLVKSGRVSEKE 386
Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
+++A + L V+ +G+F + Y HLGP D H+ A AR+ +V
Sbjct: 387 VDDAARHVLNVKYDMGLF-----TDAYSHLGPVGSDPVDTNAESRLHRPEARSVARESMV 441
Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQG----IGRYA 449
LLKN+ +LPL+ + T+A+IGP +D ++G++ AG+ T LQG +G A
Sbjct: 442 LLKNRLNTLPLA--KSGTIALIGPLADSKRDVMGSWSAAGVVDQSVTVLQGLRNAVGYKA 499
Query: 450 RTIHQQGCK---DVACAD----------------DQLFGAAIDASRQADATILVMGLDQS 490
+ ++ +G D D + A+ A++++D + V+G Q
Sbjct: 500 QIVYAKGANVSNDPGITDFLNLYEKAVTVDPRSPQAMIDEAVAAAKKSDVIVAVVGEAQG 559
Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
+ EA R+ + +P Q++L++ + K P +LVLM+G P+ A K D + AI+ +
Sbjct: 560 MAHEASSRSDITIPQSQRDLIAALKQTGK-PLVLVLMNGRPL--ALEKEDQQADAILESW 616
Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPM------TEMAMRPSQSKRYPG 604
+ G GG AIAD+LFG NP GKLPM+ +P+ + +P+ T P + ++Y
Sbjct: 617 FSGTEGGNAIADVLFGDYNPSGKLPMS-FPRS-VGQIPIYYSHLNTGRPYNPEKPEKYTS 674
Query: 605 RTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAK 664
Y GP +YPFG+G+SYT F +V++ ++ P+ R GS+ A+++
Sbjct: 675 HYYDAANGP-LYPFGYGLSYTTF--SVSDVK--MSSPIMKRDGSVTASVT---------- 719
Query: 665 CNRLTLGVQVDVKNVGSKDGAHTL-LVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGI 723
VKN G +DGA + L A P K+L FEKV + +G Q+ +
Sbjct: 720 -----------VKNTGKRDGATVVQLYIQDKTATMSRPVKELKGFEKVMLKSGEQKTISF 768
Query: 724 NIHVCKYLSVVDRSGTRRIPLGEHNIHIG 752
I + L + S G+ N+ IG
Sbjct: 769 KIDANQ-LKFWNASMKYVSEPGKFNVFIG 796
>gi|94495010|ref|ZP_01301591.1| family 3 glycoside hydrolase [Sphingomonas sp. SKA58]
gi|94425276|gb|EAT10296.1| family 3 glycoside hydrolase [Sphingomonas sp. SKA58]
Length = 872
Score = 255 bits (651), Expect = 8e-65, Method: Compositional matrix adjust.
Identities = 150/398 (37%), Positives = 224/398 (56%), Gaps = 43/398 (10%)
Query: 56 LIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQ 115
++ +L+LQEK + L++ A A+PRLGI Y WW+E+LHG +GP P T+FP+
Sbjct: 38 IVRQLTLQEKTEQLLNTAPAIPRLGIPAYNWWTESLHGA--IGP-------VP-TTNFPE 87
Query: 116 VITTASSFNATLWEAIGRVVSDEARAMYNGGT---------AGLTYWSPNVNIFRDPRWG 166
I A++F+A L + V+S E +A++ G GL WSPN+NIFRDPRWG
Sbjct: 88 PIGLAATFDAPLIHTVASVISTEVQALHTLGRQTGHLGRIGTGLDTWSPNINIFRDPRWG 147
Query: 167 RGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAK 226
RGQET GEDP L+ + ++++G+QG + D V A+ KHF + R N
Sbjct: 148 RGQETYGEDPYLTARMGVAFIQGMQGDNPDLPDVVATPKHFAVHSGPE---PSRHTDNIF 204
Query: 227 VSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIV 286
+K+D+EDT+ FR ++EGK S+MC+YN+V+G P C +L +R W GY+V
Sbjct: 205 ATKRDLEDTYLPAFRAAIVEGKAGSIMCAYNRVDGEPACGSAMLLTDYLRNAWGFRGYVV 264
Query: 287 SDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPF-------LGLHTESAVQRGLLS 339
SDCD+V Y + +P + A+R G+D +C LG + A+ +S
Sbjct: 265 SDCDAVVDIYAHHKYAPSPATGVSVALRRGVDSECNNATLSGQAGLGDRYKDALAANHIS 324
Query: 340 EIDINNALVNTLTVQMRLGMFDG----EPSSQPYGHLGPKDVCTPDHQELALEAARQGIV 395
DI++ALV + ++R G G +P+S P +G TP H LAL+AA + +V
Sbjct: 325 IGDIDSALVRLFSARLRNGDLPGLSARKPNSTPVSAIG-----TPAHGALALDAAVKSLV 379
Query: 396 LLKNQGPSLPLSH-IRHRTVAVIGPNSDVTVTMIGNYA 432
LLKN+G +LPL +R + + GP +D T + GNY+
Sbjct: 380 LLKNEG-ALPLKPGVR---IVLAGPLADATRVLRGNYS 413
Score = 149 bits (375), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 104/289 (35%), Positives = 153/289 (52%), Gaps = 50/289 (17%)
Query: 476 RQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILV 525
R AD + VMGL +E E + DR L LP Q + K K P ILV
Sbjct: 599 RDADVIVAVMGLTSDLEGEEMPVKVEGFEGGDRTTLALPADQIAFLEKAKATGK-PLILV 657
Query: 526 LMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYIT 585
+M+G ID+ +AK++ AAI+ A YPGQ+GG AIA++L G ++PGG+LP+T+Y +
Sbjct: 658 MMNGSAIDLGWAKDN--AAAILEAWYPGQSGGLAIANVLSGKADPGGRLPLTFY--HSVD 713
Query: 586 NL-PMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDG 644
+L P T+ +M GRTYR+++G VYPFGHG+SYT F + AP VV P+DG
Sbjct: 714 DLPPFTDYSME--------GRTYRYFRGAPVYPFGHGLSYTRFRY----APLVVE-PIDG 760
Query: 645 RHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQ 704
K ++VT R+T NVG + G ++ TPPA AP
Sbjct: 761 --------AVEKGLRVT----TRIT--------NVGQRPGDEVAQLYITPPAFEGAPRTA 800
Query: 705 LVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGG 753
L F+++ + AG + + + + LS V +G R + G +++ +GG
Sbjct: 801 LRGFQRLSLKAGESRAISFTLS-PRDLSFVTMAGDRMLIPGNYDLSVGG 848
>gi|218131278|ref|ZP_03460082.1| hypothetical protein BACEGG_02889 [Bacteroides eggerthii DSM 20697]
gi|217986495|gb|EEC52831.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
eggerthii DSM 20697]
Length = 862
Score = 254 bits (650), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 160/409 (39%), Positives = 222/409 (54%), Gaps = 34/409 (8%)
Query: 66 VKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNA 125
V L++ + AV RLGIK Y WW+EALHGV+ G AT FPQ I A+SF+
Sbjct: 46 VALMMDNSQAVERLGIKPYNWWNEALHGVARAGL----------ATVFPQPIGMAASFSP 95
Query: 126 TLWEAIGRVVSDEARAMYNGGTA---------GLTYWSPNVNIFRDPRWGRGQETPGEDP 176
+ VSDEARA N A GLT W+P VNI+RDPRWGRG ET GEDP
Sbjct: 96 ESVYEVFSAVSDEARAK-NANYASQGSYARYQGLTMWTPTVNIYRDPRWGRGIETYGEDP 154
Query: 177 VLSGKYAASYVRGLQGS-DGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAK-VSKQDIED 234
L+ + V+GLQG+ DG K+ A KHF + WN R FN + +S +D+ +
Sbjct: 155 YLTSRMGVMVVKGLQGTNDGKYDKLHACAKHFAVHSGPEWN---RHSFNVEDLSARDLYE 211
Query: 235 TFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGV 294
T+ PF V E KV VMC+YN G P C +L + +R +W +G I+SDC ++
Sbjct: 212 TYLPPFEALVKEAKVKEVMCAYNSFEGEPCCGSNRLLMQILRNDWGFDGIILSDCGAIAD 271
Query: 295 YYDTQHFTSTP--EEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLT 352
+Y+ + P E A+A A+ G DL+CG + AV+ G ++E DI+ A++ L
Sbjct: 272 FYNEHGHKAYPDAESASAAAVLNGTDLECGSSYKALVK-AVRAGQINEKDIDKAVMRLLE 330
Query: 353 VQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHR 412
+ LG D +P + + + V + +H LAL AR+ + LL+N G LPL
Sbjct: 331 ARFALGEMD-DPDNVSWTKIPFSVVASAEHDSLALNMARKSMTLLQNTGNILPLKR-GGL 388
Query: 413 TVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI----GRYARTIHQQGC 457
TVAV+GPN++ +V GNY G+ T L GI G + I++QGC
Sbjct: 389 TVAVMGPNANDSVMQWGNYNGMPPHTVTILGGIRKALGADDKLIYEQGC 437
Score = 122 bits (306), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 91/306 (29%), Positives = 140/306 (45%), Gaps = 47/306 (15%)
Query: 470 AAIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASK 519
A++ + AD + G+ +E E + DR + LP Q+EL+S + A K
Sbjct: 591 ASVARVKDADIVVFASGVSPVLEGEEMGVNLPGFKGGDRTDIELPAIQRELISALHRAGK 650
Query: 520 GPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWY 579
ILV SG PI A AI+ A YPGQ GGTA+A++LFG NPGG+LP+T+Y
Sbjct: 651 -KIILVNCSGSPI--ALEPETKNCEAILQAWYPGQQGGTAVAEVLFGDYNPGGRLPVTFY 707
Query: 580 PQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVA 639
++ LP E GRTYR+ ++PFG+G+SYT F
Sbjct: 708 RN--MSQLPDFE-------DYNMTGRTYRYMTQQPLFPFGYGLSYTTF------------ 746
Query: 640 VPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHW 699
+G + ++ + IK L + V V N G +DG + V+
Sbjct: 747 -----DYGKL--SLDKEQIKQGEP------LKLAVSVTNSGQRDGEEVVQVYLKKQDDAE 793
Query: 700 APHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVS 759
P K L AF++V++PAG V ++ + +S T R+ G+++I +G +
Sbjct: 794 GPGKTLRAFKRVYIPAGKSVNVEFDLKDKELEWWDAQSNTMRVCAGKYDIMVGRSSQDQD 853
Query: 760 LHAATL 765
L +
Sbjct: 854 LQRGSF 859
>gi|421846569|ref|ZP_16279716.1| beta-D-glucoside glucohydrolase [Citrobacter freundii ATCC 8090 =
MTCC 1658]
gi|411772163|gb|EKS55801.1| beta-D-glucoside glucohydrolase [Citrobacter freundii ATCC 8090 =
MTCC 1658]
Length = 765
Score = 254 bits (649), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 209/691 (30%), Positives = 332/691 (48%), Gaps = 106/691 (15%)
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
T FP + ASSFN +GR+ + Y GL W+P V++ RDPRWGR
Sbjct: 118 TVFPISLGLASSFNLDAVRTVGRI------SAYEAADDGLNMTWAPMVDVSRDPRWGRAS 171
Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
E GED L+ + V +QG S DR V S KHF AY +N VD
Sbjct: 172 EGFGEDTYLTATMGKTMVEAMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 226
Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
+S Q + + + P++ + G +VM N +NG P +D +LK +R EW G
Sbjct: 227 --MSPQRLFNDYMPPYKAGLDAGS-GAVMVGLNSLNGTPATSDAWLLKDVLRDEWGFKGI 283
Query: 285 IVSDCDSVGVYYDTQHFT-STPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
VSD ++ +H T S PE+A AI++G+++ + + ++ G ++ +
Sbjct: 284 TVSDHGAIKEL--IKHGTASDPEDAVRVAIKSGINMSMSDEYYSKYLPGLIKSGKVTMAE 341
Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
+++A + L V+ +G+F+ PY HLG K+ D H++ A E AR+ +V
Sbjct: 342 LDDAARHVLNVKYDMGLFN-----DPYSHLGAKESDPQDTNAESRLHRKEAREVARESLV 396
Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGIGR------ 447
LLKN+ +LPL + TVAV+GP +D ++G++ AG+A T L GI
Sbjct: 397 LLKNRLETLPLK--KSATVAVVGPLADSKRDVMGSWSAAGVADQSVTVLTGIKNAVGDQG 454
Query: 448 ---YAR--------------TIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQS 490
YA+ ++++ K + + A++A++Q+D + V+G Q
Sbjct: 455 KVLYAKGANITNDKGIVDFLNLYEEAVKVDPRSPQAMIDEAVNAAKQSDVVVAVVGEAQG 514
Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
+ EA R + +P Q++L++ + K P +LVLM+G P+ K D + AI+
Sbjct: 515 MAHEASSRTDITIPQSQRDLIAALKATGK-PLVLVLMNGRPL--VLVKEDQQADAILETW 571
Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAM---RP---SQSKRYPG 604
+ G GG AIAD+LFG NP GKLPM+ +P+ + +P+ + RP + +Y
Sbjct: 572 FAGTEGGNAIADVLFGDYNPSGKLPMS-FPRS-VGQIPVYYSHLNTGRPYNADKPNKYTS 629
Query: 605 RTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAK 664
R + GP +YPFG+G+SYT F TV++ ++ P R G++ A+
Sbjct: 630 RYFDEANGP-LYPFGYGLSYTTF--TVSDVK--LSSPTMKRDGTVTAS------------ 672
Query: 665 CNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVG- 722
V V N G ++GA + ++ A P KQL FEKV++ G Q V
Sbjct: 673 ---------VQVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGFEKVNLKPGETQTVSF 723
Query: 723 -INIHVCKYLSVVDRSGTRRIPLGEHNIHIG 752
I+I K+ + R P G+ N+ IG
Sbjct: 724 PIDIDALKFWN--QRMQYDAEP-GKFNVFIG 751
>gi|395230595|ref|ZP_10408899.1| periplasmic beta-glucosidase [Citrobacter sp. A1]
gi|424731344|ref|ZP_18159928.1| periplasmic beta-glucosidase [Citrobacter sp. L17]
gi|394715980|gb|EJF21765.1| periplasmic beta-glucosidase [Citrobacter sp. A1]
gi|422893995|gb|EKU33810.1| periplasmic beta-glucosidase [Citrobacter sp. L17]
Length = 765
Score = 254 bits (649), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 210/691 (30%), Positives = 332/691 (48%), Gaps = 106/691 (15%)
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
T FP + ASSFN +GR+ + Y GL W+P V++ RDPRWGR
Sbjct: 118 TVFPISLGLASSFNLDAVRTVGRI------SAYEAADDGLNMTWAPMVDVSRDPRWGRAS 171
Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
E GED L+ + V +QG S DR V S KHF AY +N VD
Sbjct: 172 EGFGEDTYLTATMGKTMVEAMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 226
Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
+S Q + + + P++ + G +VM N +NG P +D +LK +R EW G
Sbjct: 227 --MSPQRLFNDYMPPYKAGLDAGS-GAVMVGLNSLNGTPATSDAWLLKDVLRDEWGFKGI 283
Query: 285 IVSDCDSVGVYYDTQHFT-STPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
VSD ++ +H T S PE+A AI++G+++ + + ++ G ++ +
Sbjct: 284 TVSDHGAIKEL--IKHGTASDPEDAVRVAIKSGINMSMSDEYYSKYLPGLIKSGKVTMAE 341
Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
+++A + L V+ +G+F+ PY HLG K+ D H++ A E AR+ +V
Sbjct: 342 LDDAARHVLNVKYDMGLFN-----DPYSHLGAKESDPQDTNAESRLHRKEAREVARESLV 396
Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGIGR------ 447
LLKN+ +LPL + TVAV+GP +D ++G++ AG+A T L GI
Sbjct: 397 LLKNRLETLPLK--KSATVAVVGPLADSKRDVMGSWSAAGVADQSVTVLTGIKNAVGDQG 454
Query: 448 ---YAR--------------TIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQS 490
YA+ ++++ K + + A+ A++Q+D + V+G Q
Sbjct: 455 KVLYAKGANITNDKGIVDFLNLYEEAVKVDPRSPQAMIDEAVIAAKQSDVVVAVVGEAQG 514
Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
+ EA R + +P Q++L++ + K P +LVLM+G P+ A K D + AI+
Sbjct: 515 MAHEASSRTDITIPQSQRDLIAALKATGK-PLVLVLMNGRPL--ALVKEDQQADAILETW 571
Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAM---RP---SQSKRYPG 604
+ G GG AIAD+LFG NP GKLPM+ +P+ + +P+ + RP + +Y
Sbjct: 572 FAGTEGGNAIADVLFGDYNPSGKLPMS-FPRS-VGQIPVYYSHLNTGRPYNADKPNKYTS 629
Query: 605 RTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAK 664
R + GP +YPFG+G+SYT F TV++ ++ P R G++ A+
Sbjct: 630 RYFDEANGP-LYPFGYGLSYTTF--TVSDVK--LSSPTMKRDGTVTAS------------ 672
Query: 665 CNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVG- 722
V V N G ++GA + ++ A P KQL FEKV++ G Q V
Sbjct: 673 ---------VQVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGFEKVNLKPGETQTVSF 723
Query: 723 -INIHVCKYLSVVDRSGTRRIPLGEHNIHIG 752
I+I K+ + R P G+ N+ IG
Sbjct: 724 PIDIDALKFWN--QRMQYDAEP-GKFNVFIG 751
>gi|380692851|ref|ZP_09857710.1| beta-glucosidase [Bacteroides faecis MAJ27]
Length = 854
Score = 254 bits (649), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 165/447 (36%), Positives = 241/447 (53%), Gaps = 34/447 (7%)
Query: 41 PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPG 100
P+ L +R +DL+GRL+L+EK+ L+ + + AV RLGI Y WWSE LHGV+ G
Sbjct: 15 PYKNHQLSPKERADDLLGRLTLKEKIGLMQNSSNAVERLGIAPYNWWSEGLHGVARNGL- 73
Query: 101 TKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAM---------YNGGTAGLT 151
AT FP + AS+F+ E I VSDE RA Y G GLT
Sbjct: 74 ---------ATVFPITMGMASTFDDEAIEKIYTAVSDEGRAKFHDAHRRNRYGCGNEGLT 124
Query: 152 YWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAY 210
+W+PNVNIFRDPRWGRGQET GEDP L+ + S V+G+QG +D K A KH+ +
Sbjct: 125 FWNPNVNIFRDPRWGRGQETFGEDPYLTTRMGVSVVKGMQGPADAAYDKTHACAKHYAVH 184
Query: 211 DLDNWNGVDRFHFNAK-VSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPN 269
R F+ + +S +D+ +T+ F+ V E V VMC+Y + G P C
Sbjct: 185 SGPE---AKRHSFDVEHLSPRDLWETYLPAFKALVQEADVKEVMCAYQRFEGEPCCDSNR 241
Query: 270 ILKRTIRGEWRLNGYIVSDCDSVG-VYYDTQHFT-STPEEAAADAIRAGLDLDCGPFLGL 327
+L + +R EW +VSDC ++ +Y +H T +A+A A+ G DL+CG
Sbjct: 242 LLTQILRDEWGYKHLVVSDCGAISDFFYKDRHGTHKDAADASASAVINGTDLECGVEYA- 300
Query: 328 HTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELAL 387
H E AV+RGL++E IN +L L + LG D + S + +G V H+++AL
Sbjct: 301 HLEEAVKRGLITEERINTSLRRLLEARFALGEMDPD-SIVAWSKIGIDTVDCDLHRQMAL 359
Query: 388 EAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI-G 446
+ R+ +VLL N G LPL+ R +AV+GPN+ +V GNY G+ T L+GI
Sbjct: 360 DITRKSMVLLHNNG-ILPLAKTSTR-IAVMGPNAVDSVMQWGNYKGVPSHTYTILEGIRN 417
Query: 447 RYARTIHQQGCKDVACADDQLFGAAID 473
+ +++GC+ ++Q+F + +
Sbjct: 418 KVGNVPYEKGCE---LLENQVFDSYFN 441
Score = 99.0 bits (245), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 86/295 (29%), Positives = 127/295 (43%), Gaps = 50/295 (16%)
Query: 472 IDASRQADATILVMGLDQSIEAEAL-----------DRAGLLLPGRQQELVSKVSMASKG 520
+D + AD I + G+ +E E DR + LP Q++ + K +
Sbjct: 584 VDKVKDADIVIFLGGISPDLEGEEKHFVNCPGFSGGDRTSIELPQVQRD-ILKALKKAGK 642
Query: 521 PTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYP 580
+ V SG VA AI+ A YPGQAGG A+AD+LFG NP GKLP+T+Y
Sbjct: 643 KVVFVNCSGSA--VALVPEMESCDAILQAWYPGQAGGLAVADVLFGDFNPSGKLPVTFYK 700
Query: 581 QEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVA 639
LP + +M+ RTYR+ ++PFG+G+SYT F
Sbjct: 701 S--TDQLPDFEDYSMK--------NRTYRYMVEAPLFPFGYGLSYTTF------------ 738
Query: 640 VPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHW 699
D G +N K+I L +V+VKN G DGA + V+
Sbjct: 739 ---DISKGRLNK----KSISAGK------DLNFKVNVKNTGKCDGAEVIQVYVRKVDDME 785
Query: 700 APHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGT 754
P K L AF +V + AG V I++ + + T R+ G++ + G +
Sbjct: 786 GPVKSLRAFRRVPLKAGESCVVSIDLLPATFEFFDPVTNTMRVMPGKYEVMYGNS 840
>gi|253574725|ref|ZP_04852065.1| glycoside hydrolase [Paenibacillus sp. oral taxon 786 str. D14]
gi|251845771|gb|EES73779.1| glycoside hydrolase [Paenibacillus sp. oral taxon 786 str. D14]
Length = 953
Score = 254 bits (649), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 164/446 (36%), Positives = 238/446 (53%), Gaps = 38/446 (8%)
Query: 36 TTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVS 95
T PF LP+ +RV+DL+ R +L EK++L+ VPRLG+K Y+ +E HGV+
Sbjct: 4 NTYQYPFQNPDLPLAERVDDLVSRFTLDEKIELMCQFQTEVPRLGVKKYKHGTEGAHGVA 63
Query: 96 NVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMY--NGGTAGLTYW 153
+G ATSFPQ A ++N L IG V+ DEAR + N GLT W
Sbjct: 64 WLGE----------ATSFPQNTGLACTWNPELMRQIGSVIGDEARVYFQRNPEVNGLTIW 113
Query: 154 SPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLD 213
+P V++ RDPRWGR +E GEDP L+G V+GLQG K A+ KHF A
Sbjct: 114 APTVDMERDPRWGRTEEAYGEDPHLTGALTTELVKGLQGDHPFYYKAVATLKHFYA---- 169
Query: 214 NWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKR 273
N N + R + + ++ + + F EG+ S+M +YN +NG P C + +
Sbjct: 170 NNNEIGRGSESVSIDPRNKREYYLKAFERAFREGRAGSMMTAYNGINGTP-CNLNSEVNE 228
Query: 274 TIRGEWRLNGYIVSDC-DSVGVYYDTQHFTSTPEEAAADAIRAGLD--LDCGPFLGLHTE 330
++GEW ++G++V D D +G D ++ S EA A +I+AG+D D P +
Sbjct: 229 IVKGEWEMDGFVVGDAGDVMGTVLDHKYVESYA-EAVAGSIKAGIDSMTDEQPIMFHALR 287
Query: 331 SAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAA 390
+A+++GLL+E D++ AL N V+ RLG FD E PY + + +C P+H LAL+A+
Sbjct: 288 NALEQGLLAESDLDRALRNAFRVRFRLGEFDPE-ERNPYSRVPEEKLCAPEHAALALKAS 346
Query: 391 RQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYAR 450
R+ IVLLKN+G LPL +VAV+GP ++ T Y+G TPLQGI
Sbjct: 347 REAIVLLKNEG-LLPLPQTLG-SVAVVGPLANEAYTDW--YSGTPPYRITPLQGIK---- 398
Query: 451 TIHQQGCKDVACADDQLFGAAIDASR 476
+ G + V LFG +D R
Sbjct: 399 --EKMGDRPV------LFGTGLDRIR 416
Score = 96.3 bits (238), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 80/297 (26%), Positives = 138/297 (46%), Gaps = 47/297 (15%)
Query: 471 AIDASRQADATILVMGLDQSIEA-EALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSG 529
A+ A++ A+ ++ +G + I A E +DR ++LP QQ L+ V + T++V++
Sbjct: 559 AVQAAKSAETAVVFVGNNPFINAKETIDRVDIVLPPAQQALIQAVRAVNPN-TVVVIVGS 617
Query: 530 GPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPM 589
P V + + + +I++ + Q G A+ D+LFG NP G+L MTWY + + LP
Sbjct: 618 YPFAVNW--ENAHVPSILFTSHSAQELGHAVTDVLFGDYNPAGRLNMTWY--QSVDQLPD 673
Query: 590 TEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSI 649
M + KR TY+++ G V+YPFGHG+SYT F ++ A A+ DG
Sbjct: 674 I-MDYDIIKGKR----TYQYFDGEVLYPFGHGLSYTTFEYSDL-ALDRAAIGQDGE---- 723
Query: 650 NATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF-----STPPAGHWAPHKQ 704
+ + DV+N G+ G ++ S P P K
Sbjct: 724 --------------------VQISFDVRNAGNIAGDEVPQLYVRIGQSRVP----RPLKT 759
Query: 705 LVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRR-IPLGEHNIHIGGTKHSVSL 760
L F ++H+ QRV + + L+ D + R + GE+ + +G + ++L
Sbjct: 760 LKGFTRLHLQPDETQRVTFTLK-AEELAFWDVTRDRYCVESGEYTVMVGPSSDEIAL 815
>gi|218130693|ref|ZP_03459497.1| hypothetical protein BACEGG_02282 [Bacteroides eggerthii DSM 20697]
gi|217987037|gb|EEC53368.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
eggerthii DSM 20697]
Length = 954
Score = 254 bits (649), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 213/743 (28%), Positives = 355/743 (47%), Gaps = 103/743 (13%)
Query: 39 TLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISG--AAAVPRLGIKGYEWWSEALHGVSN 96
TL + LP+ +RV L+ ++ ++K++L+ G +P L + EA+HG S
Sbjct: 167 TLRYMDPRLPVNERVESLLSVMTPEDKMELIREGWGIPGIPHLYVPPITK-VEAVHGFS- 224
Query: 97 VGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPN 156
+G GAT FPQ + +++N L E + + +E A GT + WSP
Sbjct: 225 ------YGS---GATIFPQALAMGATWNKNLTEKVAMAIGEETLA---AGT--MQAWSPV 270
Query: 157 VNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWN 216
+++ +D RWGR +ET GEDPVL + ++++G Q S G + + KHF +
Sbjct: 271 LDVAQDARWGRCEETFGEDPVLVSQIGGAWIKGYQ-SKG----LFTTPKHFGGHGAP-LG 324
Query: 217 GVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIR 276
G D +S++++ + VPFR + S+M +Y+ GVP +L+ +R
Sbjct: 325 GRDSHDIG--LSEREMREVHLVPFRHVIRNYSCQSLMMAYSDFLGVPVAKSKELLRNILR 382
Query: 277 GEWRLNGYIVSDCDSVGVYYDTQHFTSTPE-EAAADAIRAGLDLDCG-PFLGLHTESAVQ 334
EW +G++VSDC ++G +H+T+ + EAA A+ AG+ +CG + A +
Sbjct: 383 EEWGFDGFVVSDCGAIGNLTSRKHYTAKNKIEAANQALAAGIATNCGDTYNDKEVIQAAK 442
Query: 335 RGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKD-------VCTPDHQELAL 387
G L +++N L + R +F+ E H P D + H+E+A
Sbjct: 443 DGRLDMANLDNVCRTMLRMMFRNELFEKE-------HKEPLDWNKIYPGWNSDSHKEIAR 495
Query: 388 EAARQGIVLLKNQGPSLPLS-HIRHRTVAVIGPNSDVTVTMIGNYA------GIACGYTT 440
++AR+ IV+L+N+ LPLS H+ RT+AV+GP +D G+Y + T
Sbjct: 496 QSARESIVMLENKDDVLPLSKHV--RTIAVLGPGAD--NLQPGDYTPKLRPGQLKSVLTG 551
Query: 441 PLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEA------- 493
Q +G + +++QGC + ++ + A+ + Q+D +LV+G + EA
Sbjct: 552 IKQAVGNQTKILYEQGC-EFTGSNGENIPNAVKVAAQSDVVVLVLGDCSTSEATTDVYKT 610
Query: 494 --EALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGY 551
E D A L+LPGRQQEL+ V K P +LVL G P ++ K AII
Sbjct: 611 SGENHDYATLILPGRQQELLEAVCATGK-PVVLVLQIGRPYNL--TKESELCKAIIVNWL 667
Query: 552 PGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYK 611
PGQ GG A AD+LFG NP G+LPMT +P+ ++ LP+ + +RY +Y
Sbjct: 668 PGQEGGLATADVLFGDYNPAGRLPMT-FPR-HVGQLPL--YYNFKTSGRRYEYSDLEYYP 723
Query: 612 GPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLG 671
+Y FG+G+SYT+F ++ + +G+I A
Sbjct: 724 ---LYYFGYGLSYTSFEYSDLKVE-------ERENGNIIA-------------------- 753
Query: 672 VQVDVKNVGSKDGAHTLLVFSTPPAGHWAPH-KQLVAFEKVHVPAGAQQRVGINIHVCKY 730
V+VKN+G + G + ++ T +L F +VH+ G + + + +
Sbjct: 754 -HVNVKNIGHRAGDEVVQLYVTDMYASVKTRITELKDFTRVHLRPGESKSISFELTPYE- 811
Query: 731 LSVVDRSGTRRIPLGEHNIHIGG 753
LS+++ + R + G I +GG
Sbjct: 812 LSLLNDNMDRVVEKGTFKILVGG 834
>gi|329928264|ref|ZP_08282164.1| glycosyl hydrolase family 3 C-terminal domain protein
[Paenibacillus sp. HGF5]
gi|328937943|gb|EGG34345.1| glycosyl hydrolase family 3 C-terminal domain protein
[Paenibacillus sp. HGF5]
Length = 934
Score = 254 bits (648), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 163/465 (35%), Positives = 247/465 (53%), Gaps = 28/465 (6%)
Query: 41 PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPG 100
PF LP+ +RV+DL+ RL+L EK++L+ +PRLG++ Y+ +E HGV+ +G
Sbjct: 4 PFQDPDLPLEERVHDLVSRLTLDEKIELMCQYQTEIPRLGVQKYKHGTEGAHGVAWLGE- 62
Query: 101 TKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA--GLTYWSPNVN 158
AT FPQ A ++N L IG V+++EAR Y A GLT W+P V+
Sbjct: 63 ---------ATVFPQNTGLACTWNPELMREIGSVIAEEARVYYQRDRAINGLTIWAPTVD 113
Query: 159 IFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGV 218
+ RDPRWGR +E GEDP L+G+ + V+G+QG K A+ KHF + + G
Sbjct: 114 LERDPRWGRTEEAYGEDPHLTGQLSTGLVKGMQGDHPFYYKTVATLKHFYGNNNEADRGS 173
Query: 219 DRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGE 278
+ + ++ F+ PFR EGK S+M +YN +NG P C + + ++ E
Sbjct: 174 ASVSIDPRNKREYYLKAFEAPFR----EGKAGSMMTAYNGINGTP-CNLNHEVNDIVKQE 228
Query: 279 WRLNGYIVSDC-DSVGVYYDTQHFTSTPEEAAADAIRAGLD--LDCGPFLGLHTESAVQR 335
W ++G++V D D +G D H+ ++ EA AD+++AG+D D A+++
Sbjct: 229 WGMDGFVVGDAGDVLGTVMD-HHYVASYAEAVADSVKAGIDSITDDQDISFRALRDALEQ 287
Query: 336 GLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIV 395
GLL E +++AL NT V+ RLG FD E PY + +C P+H EL+L AAR+ IV
Sbjct: 288 GLLEEQHLDHALRNTFRVRFRLGEFDPE-ERNPYSRVPESKLCAPEHAELSLRAARESIV 346
Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYA--RTIH 453
LLKN G LPL ++ AVIGP ++ T Y+G TPLQG+ A R++
Sbjct: 347 LLKNDG-LLPLPRDPFKSAAVIGPLANEAFT--DWYSGTPPYRITPLQGVQAKAGDRSVQ 403
Query: 454 QQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDR 498
D + G + S + + IL+ G +S +A +R
Sbjct: 404 FHTGLDQVRLRSAVSGTYVALSSE-EQGILLAGTSESADAAVFER 447
Score = 90.5 bits (223), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 68/235 (28%), Positives = 110/235 (46%), Gaps = 40/235 (17%)
Query: 494 EALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPG 553
E +DR + LP QQ L+ V A+ T++V++ G P V + + + AI++ + G
Sbjct: 565 ETIDRPDITLPPAQQALIQAVFEANP-RTVVVIVGGYPFAVNWEQE--HLPAILFTSHAG 621
Query: 554 QAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGP 613
Q G A+AD+L+G NPGG+L MTWY LP M + K RTY+++ G
Sbjct: 622 QELGHAVADVLYGDYNPGGRLNMTWYKS--TEQLPDI-MDYDIIKGK----RTYQYFDGD 674
Query: 614 VVYPFGHGMSYTNFVHT--VANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLG 671
V+YPFGHG+SY+ F ++ + + P + A GS+ T++
Sbjct: 675 VLYPFGHGLSYSRFKYSGLLLSQPKMTA------EGSVTVTVA----------------- 711
Query: 672 VQVDVKNVGSKDGAHTLLVFSTPPAGH-WAPHKQLVAFEKVHVPAGAQQRVGINI 725
V+N + G + ++ + P K L F +VHV + V +
Sbjct: 712 ----VRNESQRAGDEVVQLYVKADSSRVIRPLKTLKGFRRVHVAPDTSETVTFEL 762
>gi|149280000|ref|ZP_01886125.1| putative beta-glucosidase [Pedobacter sp. BAL39]
gi|149229197|gb|EDM34591.1| putative beta-glucosidase [Pedobacter sp. BAL39]
Length = 793
Score = 254 bits (648), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 202/688 (29%), Positives = 330/688 (47%), Gaps = 89/688 (12%)
Query: 109 GATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRG 168
G T FP I +S+++ L + + ++ E R GG G + P +++ R+PRW R
Sbjct: 152 GTTVFPTSIGQSSTWDPALIKEMAAAIAMETR--LQGGHIG---YGPVLDLAREPRWSRV 206
Query: 169 QETPGEDPVLSGKYAASYVRGLQGSD-GDRLKVAASCKHFTAYDLDNWNGVDRFHFN--- 224
+ET GEDPVL+ + + V G QG++ G + + ++ KHFTAY GV N
Sbjct: 207 EETYGEDPVLNSRMGEAMVSGFQGTNIGSGVNILSTLKHFTAY------GVPEGGHNGGS 260
Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
V +++ ++ PF+ V G + SVM +YN V+G+P ++ +L +RG+W NG+
Sbjct: 261 VTVGNRELFQSYLPPFKAAVKAGAL-SVMTAYNSVDGIPCSSNRYLLTDILRGQWGFNGF 319
Query: 285 IVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPF-LGLHTESAVQRGLLSEIDI 343
+VSD +S+ H S+ EAAA A+ AGLD D + G AV GL+ +
Sbjct: 320 VVSDLNSISGLEGNHHVASSATEAAALAMNAGLDADLSGYGYGPALVKAVNGGLVKMATV 379
Query: 344 NNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPS 403
+ AL L ++ +G+F+ P P K V H LA + A++ +VLLKN+
Sbjct: 380 DTALARVLRLKFNMGLFE-NPYVNP--KQAEKQVMNAKHVTLARKVAQESVVLLKNEKNI 436
Query: 404 LPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIAC--GYTTPLQGI----GRYARTIHQQGC 457
LPLS + +AVIGPN+D +G+Y T L GI + +Q+GC
Sbjct: 437 LPLSKAL-KNIAVIGPNADNVYNQLGDYTAPQADGKVITVLNGIRAKVSKETGVFYQKGC 495
Query: 458 -------------------KDVAC-----ADDQLFGAAIDASRQADATILVMGLDQSIEA 493
DVA + + F + A+ + +
Sbjct: 496 AIRDTASAGIAAAVALASKSDVAIVVLGGSSARDFKTEYQNTGAAEVKASAVAVSDMESG 555
Query: 494 EALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPG 553
E DR+ L L GRQ EL+ V + + P ++VL+ G P+ + +A + +AA++ A YPG
Sbjct: 556 EGFDRSTLDLMGRQMELLRAV-VKTGTPVVVVLIKGRPLTLNWAAEN--VAAMVDAWYPG 612
Query: 554 QAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGP 613
Q GG AIAD+LFG NP G+L ++ P+ + LP+ RP P + P
Sbjct: 613 QEGGNAIADVLFGDYNPAGRLSVS-VPKS-VGQLPVYYNKKRP-----LPHNYVELDEQP 665
Query: 614 VVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQ 673
+Y FG+G+SY+ F ++ N T V+ GR GK ++V V
Sbjct: 666 -LYSFGYGLSYSTFEYS--NLKTNVS----GR---------GKDVRVQ----------VT 699
Query: 674 VDVKNVGSKDGAHTL-LVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLS 732
D+KN GS+DG + L + P +QL F ++ + +G QQ++ + + L
Sbjct: 700 FDLKNTGSRDGDEVVQLYLRDEQSSVVTPMQQLKQFRRLSLKSGQQQQLSFELS-AEDLQ 758
Query: 733 VVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
++++ ++ G+ ++ +G + + L
Sbjct: 759 LMNQQMEWQVEPGDFSLMVGASSSDIRL 786
>gi|395490413|ref|ZP_10421992.1| glycosyl hydrolase family protein [Sphingomonas sp. PAMC 26617]
Length = 898
Score = 254 bits (648), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 152/396 (38%), Positives = 218/396 (55%), Gaps = 40/396 (10%)
Query: 56 LIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQ 115
++ R++ +EK+ L++ A A+PRL I Y WW+E+LHG P T+FP+
Sbjct: 51 IVARMTTEEKLPQLLNVAPAIPRLKIPAYNWWTESLHGALGAVP----------TTNFPE 100
Query: 116 VITTASSFNATLWEAIGRVVSDEARAMYNGGT---------AGLTYWSPNVNIFRDPRWG 166
I A+SF+A L + +S E RA++ G GL WSPN+NIFRDPRWG
Sbjct: 101 PIGLAASFDAPLVHEVAGAISTEVRALHTMGRQTGKLGRIGTGLDTWSPNINIFRDPRWG 160
Query: 167 RGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAK 226
RGQET GEDP LS ++V G+QG + D V A+ KHF + R N
Sbjct: 161 RGQETYGEDPYLSAHMGVAFVTGMQGPNPDLPDVIATPKHFAVHSGPEST---RHGANVF 217
Query: 227 VSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIV 286
VS D+EDT+ FR ++EG+ S+MC+YN+++G P CA+ +LK +RG W GY+V
Sbjct: 218 VSAHDLEDTYLPAFRAAIVEGRAGSIMCAYNRIDGQPACANDLLLKDHLRGAWGFTGYVV 277
Query: 287 SDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDC-----GPFLGLHTE--SAVQRGLLS 339
SDCD+V D + A A A++AG D +C G GL +++RGL+S
Sbjct: 278 SDCDAVKDIADGHKYAPDQATAIAAAMKAGTDNECHTQTIGDMGGLPERFGDSLRRGLIS 337
Query: 340 EIDINNALVNTLTVQMRLGMFD---GEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVL 396
+ DI+ +LV + ++R G G P++ P + + +PDH LAL+A + +VL
Sbjct: 338 QDDIDRSLVRLFSARLRNGDLPGIAGAPAAVPV-----RAILSPDHMALALQATEKSLVL 392
Query: 397 LKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYA 432
LKN G LPL +AVIGP D T + GNY+
Sbjct: 393 LKNDG-VLPLRP--GARIAVIGPLGDATRVLRGNYS 425
Score = 122 bits (306), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 87/277 (31%), Positives = 140/277 (50%), Gaps = 48/277 (17%)
Query: 486 GLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVA 535
GL +EAE D+ L LP Q ++ + K P I+V M+G PI++A
Sbjct: 635 GLTSDLEAEETGTDIPGFQGGDKTSLDLPAEQMAMLERAKATGK-PVIVVAMNGSPINLA 693
Query: 536 FAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMR 595
+AK++ AAI+ A YPGQ+GG A+A++L G ++PGG+LP+T+Y + +LP
Sbjct: 694 WAKDN--AAAIVEAWYPGQSGGVAVANVLTGKTDPGGRLPLTFY--RSVADLP------- 742
Query: 596 PSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISG 655
P + GRTYR++ G VYPFGHG+SYT+F +T PL + SG
Sbjct: 743 PFDNYTMTGRTYRYFTGKPVYPFGHGLSYTSFAYT----------PL------VLNPASG 786
Query: 656 KAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPA 715
A + L V DV N + G + ++ + P AP L ++++ +
Sbjct: 787 GAEQ---------GLRVTTDVSNTVKRAGDEVVQLYLSFPDVPGAPRIALRGYQRITLAP 837
Query: 716 GAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIG 752
G ++ V ++ + LS V +G R + G++ + +G
Sbjct: 838 GERRSVTFDLS-PRDLSAVRANGVREVMQGQYRVSVG 873
>gi|336412679|ref|ZP_08593032.1| hypothetical protein HMPREF1017_00140 [Bacteroides ovatus
3_8_47FAA]
gi|335942725|gb|EGN04567.1| hypothetical protein HMPREF1017_00140 [Bacteroides ovatus
3_8_47FAA]
Length = 735
Score = 254 bits (648), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 233/795 (29%), Positives = 368/795 (46%), Gaps = 109/795 (13%)
Query: 8 FFLGLILLSASSSGLAAREPFACDPKDATTRTLP-FCQVSLPIPQRVNDLIGRLSLQEKV 66
LGL LL + S L++++ +++P + PI +R++DLI R++L+EKV
Sbjct: 6 LILGLCLLGVTHS-LSSKD----------KKSIPLYKDAKAPIEKRIDDLISRMTLEEKV 54
Query: 67 KLLI--------------SGAAAVP-RLGIKGYEWWSEALHGV--------SNVGPGTKF 103
L VP +G Y + L S +G F
Sbjct: 55 LQLNQYTLGRNNNVNNVGEEVKKVPSEIGSLIYFDINPELRNSMQKKAMEESRLGIPIIF 114
Query: 104 GGDFPGA--TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIF 160
G D T +P + A S+N L E V + EAR +G+ + +SP +++
Sbjct: 115 GYDAIHGFRTIYPISLGQACSWNPGLVEQACAVSAQEAR------MSGVDWTFSPMIDVA 168
Query: 161 RDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSD-GDRLKVAASCKHFTAYDLDNWNGVD 219
RDPRWGR E GEDP +G +AA+ VRG QG D ++AA KH+ Y
Sbjct: 169 RDPRWGRVAEGYGEDPYTNGVFAAASVRGYQGDDMSAENRMAACLKHYVGYGASE---AG 225
Query: 220 RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEW 279
R + ++S Q + DT+ +P+ M V G A++M S+N ++GVP A+P I+ ++ W
Sbjct: 226 RDYVYTEISAQTLWDTYLLPYEMGVKAG-AATLMSSFNDISGVPGSANPYIMTEILKKRW 284
Query: 280 RLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPF-LGLHTESAVQRGLL 338
+ +G+IVSD +V Q +T ++AA A AGL++D H + V+ G +
Sbjct: 285 KHDGFIVSDWGAVE-QLKNQGLAATKKDAARYAFNAGLEMDMMSHAYDRHLKELVEEGKV 343
Query: 339 SEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHL-GPKD-VCTPDHQELALEAARQGIVL 396
+ ++ ++ L V+ RLG+F+ +PY + KD P +A + A + +VL
Sbjct: 344 TMAQVDESVRRVLRVKFRLGLFE-----RPYTPVTNEKDRFFRPQSMAVAAQLAAESMVL 398
Query: 397 LKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAG------IACGYTTPLQGIGRYAR 450
LKN LPL++ + +AV+GP + ++G++ G + Y G A
Sbjct: 399 LKNDNQILPLTN--KKRIAVVGPMAKNGWDLLGSWCGHGKDTDVEMLYDGLTAEFGGEAE 456
Query: 451 TIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQEL 510
+ GCK D F A+D R +D I+ +G + E R+ + LP Q+EL
Sbjct: 457 LRYAMGCKPQG-NDRSGFAGALDVVRWSDVVIVCLGEMLTWSGENASRSTIALPQIQEEL 515
Query: 511 VSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNP 570
V ++ A K P ILVL +G P+++ + +P AI+ PG G ++A IL G NP
Sbjct: 516 VKELKEAGK-PIILVLSNGRPLEL--NRMEPLCDAILEIWQPGINGARSMAGILSGRINP 572
Query: 571 GGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYR-FYKGPV---VYPFGHGMSYTN 626
GKL +T P + + ++R GR ++ FYK Y FG+G+SYT
Sbjct: 573 SGKLAIT---------FPYSTGQIPIYYNRRKSGRWHQGFYKDITSDPFYSFGYGLSYTE 623
Query: 627 FVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAH 686
F + V P+ V + G L V+V V NVG +DGA
Sbjct: 624 FQYGVVT-PSSTTV----KRGE--------------------KLSVEVTVTNVGKRDGAE 658
Query: 687 TLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLG 745
T+ F S P P K+L FEK + G + ++ + + L VD +G R + G
Sbjct: 659 TVHWFISDPYCSITRPVKELKHFEKQFIKVGETRTFRFDVDLERDLGFVDGNGKRFLEAG 718
Query: 746 EHNIHIGGTKHSVSL 760
E+NI + K + L
Sbjct: 719 EYNIWVQDQKVKIEL 733
>gi|295689400|ref|YP_003593093.1| glycosyl hydrolase family protein [Caulobacter segnis ATCC 21756]
gi|295431303|gb|ADG10475.1| glycoside hydrolase family 3 domain protein [Caulobacter segnis
ATCC 21756]
Length = 895
Score = 254 bits (648), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 151/401 (37%), Positives = 224/401 (55%), Gaps = 39/401 (9%)
Query: 51 QRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGA 110
Q DL+ +++ +EK+ L++ A A+PRLG+ Y WW+E+LHG G P
Sbjct: 41 QMAADLVAKMTTEEKLDQLLNTAPAIPRLGVPAYNWWTESLHGAL---------GSLP-T 90
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA---------GLTYWSPNVNIFR 161
T+FP+ + A++F+ L + +G + E R ++ A L WSPN+NIFR
Sbjct: 91 TNFPEPVGLAATFDTDLVKDVGGAIGAEVRGLHTLARATGRMGRIGTALNTWSPNINIFR 150
Query: 162 DPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRF 221
DPRWGRGQET GEDP L+ + ++V G+QG D D +V A+ KH+ + R
Sbjct: 151 DPRWGRGQETYGEDPHLAARMGVAFVEGVQGPDPDLPQVIATPKHYAVHSGPEST---RH 207
Query: 222 HFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRL 281
H N VS++D+EDT+ FR ++E K S+MC+YN+++G P CA +LK +R W+
Sbjct: 208 HANVYVSRRDLEDTYLPAFRAAIVEAKAGSIMCAYNRIDGQPACASDMLLKDYLRTAWKF 267
Query: 282 NGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG-------PFLGLHTESAVQ 334
+GY+VSDCD+V D + A A A+RAG+D +C L A+Q
Sbjct: 268 DGYVVSDCDAVKDINDNHKYAPDAATAVAAAVRAGVDNECNGATLTDTAGLSNRYRDALQ 327
Query: 335 RGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPY--GHLGPKDVCTPDHQELALEAARQ 392
+GL++ D++ ALV + ++R+G P +P P V TP+H+ LAL A+ +
Sbjct: 328 QGLITIGDVDQALVRLYSARLRVGDL---PGVRPVDTAPKAPSAVMTPEHEALALAASEK 384
Query: 393 GIVLLKNQGPSLPLSH-IRHRTVAVIGPNSDVTVTMIGNYA 432
+VLLKN G LPL +R VAVIGP D T + GNY+
Sbjct: 385 SLVLLKNDG-LLPLKPGLR---VAVIGPLGDATRVLRGNYS 421
Score = 130 bits (327), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 92/284 (32%), Positives = 146/284 (51%), Gaps = 48/284 (16%)
Query: 479 DATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILVLMS 528
D + V+GL +EAE D+ L +P QQ L+ + K P ++V M+
Sbjct: 625 DVLVAVVGLTSDLEAEESPVEIPGFKGGDKTTLDIPADQQALLEQAKALGK-PLVVVAMN 683
Query: 529 GGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLP 588
G P+++++AK++ AAI+ A YPGQ+GG AIA++L G +NP G+LP+T+Y + + +LP
Sbjct: 684 GSPLNLSWAKDN--AAAILEAWYPGQSGGLAIANVLTGKTNPSGRLPLTFY--KSVEDLP 739
Query: 589 MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGS 648
P R GRTYR++ G VYPFG+G+SYT F + AP V P+ G
Sbjct: 740 -------PFGDYRMDGRTYRYFTGQPVYPFGYGLSYTRFDY----APLKVE-PIKGD--- 784
Query: 649 INATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAF 708
+G+ ++VT V+NVG + G ++ P AP L F
Sbjct: 785 -----AGQGLRVT------------TTVRNVGQRAGDEVAQLYLNFPNSPGAPRVALRGF 827
Query: 709 EKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIG 752
++V + G + V ++ + LS VD G R++ G + + +G
Sbjct: 828 QRVSLKPGEAKSVTFSLS-SRDLSSVDPDGVRQVMTGRYKVSVG 870
>gi|380692929|ref|ZP_09857788.1| glycoside hydrolase family protein [Bacteroides faecis MAJ27]
Length = 777
Score = 254 bits (648), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 225/791 (28%), Positives = 351/791 (44%), Gaps = 131/791 (16%)
Query: 46 SLPIPQRVNDLIGRLSLQEKVKLL--ISGAAAVPRLGIKGYEWWSEALH-GVSNV----- 97
S PI +RV DL+ ++++ EK + + G+ V + +W SE G+ N+
Sbjct: 36 SAPIEERVKDLLSQMNMDEKTCQMATLYGSGRVLADALPTEKWKSEIWKDGIGNIDEEHN 95
Query: 98 GPGTKFGGD--FP--------------------------------------GATSFPQVI 117
G G KFG + FP AT FP
Sbjct: 96 GLG-KFGSEYAFPYAKHVKAIHDIQRWFVEETRLGIPVDFTNEGIRGVCHEKATFFPAQC 154
Query: 118 TTASSFNATLWEAIGRVVSDEARAMYNGGTAGLT-YWSPNVNIFRDPRWGRGQETPGEDP 176
S++N L IG V + EA A+ G T +SP ++I +DPRWGR E GEDP
Sbjct: 155 GQGSTWNKELIARIGEVEAKEAVAL------GYTNIYSPILDIAQDPRWGRAVECYGEDP 208
Query: 177 VLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTF 236
L G+ ++ LQ + K+ A+ KHF Y + + V+ +++ +
Sbjct: 209 YLVGQLGKQMIQSLQ-----KHKLVATPKHFAVYSIPVGGRDGGTRTDPHVAPREMRTLY 263
Query: 237 DVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYY 296
PFR+ E VM SYN +G P L + +R EW GY+VSD D+V
Sbjct: 264 LEPFRVAFQEAGALGVMSSYNDYDGEPITGSYRFLTQILRQEWGFKGYVVSDSDAVEFIS 323
Query: 297 DTQHFTSTPEEAAADAIRAGLDLDC-----GPFLGLHTESAVQRGLLSEIDINNALVNTL 351
EEA ++ AGL++ F+ SA+ +G +S+ I+ + L
Sbjct: 324 SKHKVADNNEEAVVQSVNAGLNVRTNFSSPAGFIK-PLRSAIAKGKVSQATIDQRVSEIL 382
Query: 352 TVQMRLGMFDGEPSSQPY---GHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSH 408
V+ LG+FD PY G L K V +HQ +ALEAARQ IVLLKNQ LPL
Sbjct: 383 YVKFWLGLFD-----NPYRGDGKLADKIVHCKEHQAVALEAARQSIVLLKNQDNLLPLQK 437
Query: 409 IRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRY---ARTIHQQGCK------- 458
++VAVIGPN+D +I Y T +GI A+ ++++GC+
Sbjct: 438 TL-KSVAVIGPNADEQKELICRYGPSNAPIKTVYKGIKEALPGAKVVYKKGCEIVDPHFP 496
Query: 459 -------DVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELV 511
D+ + Q+ AI+A++ A+ I+V+G + E R L LPGRQ+EL+
Sbjct: 497 ESEVLPFDITPKEQQIMDEAIEAAKSAEVVIMVLGGSEVTVREERSRTSLDLPGRQEELL 556
Query: 512 SKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPG 571
V K PTILV++ G + +AK + AI+ A +PG+ G A+A+ +FG +NPG
Sbjct: 557 KAVCKLGK-PTILVMIDGRASSINYAKK--YVPAILHAWFPGEFCGQAVAETIFGDNNPG 613
Query: 572 GKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTV 631
GKL +T +P+ + +P +P S G + ++PFGHG+SYT F +
Sbjct: 614 GKLAVT-FPKS-VGQIPFA-FPFKPG-SDSGCGTSVT----GALFPFGHGLSYTTFEYN- 664
Query: 632 ANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF 691
N IS + V + V VKN G + G + ++
Sbjct: 665 ------------------NLKISPEQQGVLG------EVKVSCTVKNTGKRPGDEVVQLY 700
Query: 692 STPPAGHWAPH-KQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIH 750
+ K L FE++ + +++V + + L++ D++ ++ G +
Sbjct: 701 LRDEISSVTTYVKILRGFERITLQPNEEKKVTFTLS-PQDLAIWDKNMKFQVEPGTFKVM 759
Query: 751 IGGTKHSVSLH 761
IG + + L
Sbjct: 760 IGASSKDIRLE 770
>gi|419862653|ref|ZP_14385247.1| beta-D-glucoside glucohydrolase [Escherichia coli O103:H25 str.
CVM9340]
gi|388344793|gb|EIL10617.1| beta-D-glucoside glucohydrolase [Escherichia coli O103:H25 str.
CVM9340]
Length = 765
Score = 254 bits (648), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 200/668 (29%), Positives = 322/668 (48%), Gaps = 101/668 (15%)
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
T FP + ASSFN + +GRV + Y GL W+P V++ RDPRWGR
Sbjct: 118 TVFPISLGLASSFNLDAVKTVGRV------SAYEAADDGLNMTWAPMVDVSRDPRWGRAS 171
Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
E GED L+ + V +QG S DR V S KHF AY +N VD
Sbjct: 172 EGFGEDTYLTSTMGKTMVEAMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 226
Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
+S Q + + + P++ + G +VM + N +NG P +D +LK +R +W G
Sbjct: 227 --MSPQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSDSWLLKDVLRDQWGFKGI 283
Query: 285 IVSDCDSVGVYYDTQHFTST-PEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
VSD ++ +H T+ PE+A A+++G+++ + + ++ G ++ +
Sbjct: 284 TVSDHGAIKEL--IKHGTAADPEDAVRVALKSGINMSMSDEYYSKYLPGLIKSGKVTMEE 341
Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
+++A + L V+ +G+F+ PY HLGPK+ D H++ A E AR+ +V
Sbjct: 342 LDDAARHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAESRLHRKEAREVARESLV 396
Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGI----GRYA 449
LLKN+ +LPL + T+AV+GP +D ++G++ AG+A T L GI G
Sbjct: 397 LLKNRLETLPLK--KSATIAVVGPLADSKRDVMGSWSAAGVADQSVTVLTGIKNAVGENG 454
Query: 450 RTIHQQGCK---DVACAD----------------DQLFGAAIDASRQADATILVMGLDQS 490
+ ++ +G D D ++ A+ ++Q+D + V+G Q
Sbjct: 455 KVLYAKGANVTSDKGIIDFLNQYEEAVKVDPRSPQEMIDEAVQTAKQSDVVVAVVGEAQG 514
Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
+ EA R + +P Q++L++ + K P +LVLM+G P+ A K D + AI+
Sbjct: 515 MAHEASSRTDITIPQSQRDLIAALKATGK-PLVLVLMNGRPL--ALVKEDQQADAILETW 571
Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAM---RPSQSKRYPGRTY 607
+ G GG AIAD+LFG NP GKLPM+ +P+ + +P+ + RP + + T
Sbjct: 572 FAGTEGGNAIADVLFGDYNPSGKLPMS-FPRS-VGQIPVYYSHLNTGRPYNADKPNKYTS 629
Query: 608 RFYKGP--VVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKC 665
R++ +YPFG+G+SYT F TV++ ++ P R G + A+
Sbjct: 630 RYFDEANGALYPFGYGLSYTTF--TVSDVK--LSAPTMKRDGKVTAS------------- 672
Query: 666 NRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVG-- 722
V V N G +DGA + ++ A P KQL FEK+ + G Q V
Sbjct: 673 --------VQVTNTGKRDGATVVQMYLQDVTASMSRPVKQLKGFEKITLKPGETQTVSFP 724
Query: 723 INIHVCKY 730
I+I K+
Sbjct: 725 IDIEALKF 732
>gi|423121272|ref|ZP_17108956.1| periplasmic beta-glucosidase [Klebsiella oxytoca 10-5246]
gi|376394107|gb|EHT06758.1| periplasmic beta-glucosidase [Klebsiella oxytoca 10-5246]
Length = 765
Score = 253 bits (647), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 208/688 (30%), Positives = 326/688 (47%), Gaps = 100/688 (14%)
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
T FP + ASSFN + +GR+ + Y GL W+P V++ RDPRWGR
Sbjct: 118 TVFPISLGLASSFNLDAVKTVGRI------SAYEAADDGLNMTWAPMVDVSRDPRWGRAS 171
Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
E GED L+ + + V +QG S DR V S KHF AY +N VD
Sbjct: 172 EGFGEDTYLTTEMGRAMVESMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 226
Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
+S Q + + + P++ + G +VM + N +NG P +D +LK +R +W G
Sbjct: 227 --MSPQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSDAWLLKEVLRDDWGFKGI 283
Query: 285 IVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEIDI 343
VSD ++ + PE+A A+++G+++ + + V+ G ++ ++
Sbjct: 284 TVSDHGAIKELI-KHGVAADPEDAVRVALKSGINMSMSDEYYSKYLPGLVKSGKVTMAEL 342
Query: 344 NNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIVL 396
++A + L V+ +G+F+ PY HLGP+ D H++ A E ARQ +VL
Sbjct: 343 DDATRHVLNVKYDMGLFN-----DPYSHLGPQGSDPQDTNAESRLHRQEAREVARQSLVL 397
Query: 397 LKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGI----GRYAR 450
LKN+ +LPL + TVAVIGP +D M+G++ AG+A T L GI G +
Sbjct: 398 LKNRLETLPLK--KSGTVAVIGPLADSKRDMMGSWSAAGVADQSVTVLTGIKEALGDKGK 455
Query: 451 TIHQQGCK---DVACAD----------------DQLFGAAIDASRQADATILVMGLDQSI 491
I+ +G D D + A+ A++Q+D + V+G Q +
Sbjct: 456 VIYARGANVTNDKGIVDFLNLYEKAVQVDPRSPQAMIDEAVAAAKQSDVVVAVVGEAQGM 515
Query: 492 EAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGY 551
EA R + LP Q++L+S + K P +LVLM+G P+ A K D + AI+ +
Sbjct: 516 AHEASSRTDITLPQSQRDLISALKATGK-PLVLVLMNGRPL--ALVKEDQQADAILETWF 572
Query: 552 PGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAM---RP---SQSKRYPGR 605
G GG AIAD+LFG +NP GKLPM+ +P+ + +P+ + RP + +Y R
Sbjct: 573 AGTEGGHAIADVLFGDANPSGKLPMS-FPRS-VGQIPVYYSHLNTGRPYNADKPNKYTSR 630
Query: 606 TYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKC 665
+ GP +YPFG+G+SYT F +V++ ++ P R GS+ A+
Sbjct: 631 YFDEANGP-LYPFGYGLSYTTF--SVSDVK--MSAPTLQRDGSVTAS------------- 672
Query: 666 NRLTLGVQVDVKNVGSKDGAHTL-LVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGIN 724
V V N G ++GA + L A P K L F++V + G Q V
Sbjct: 673 --------VQVTNTGKREGATVIQLYLQDVTASMSRPVKMLRGFKRVTLKPGETQTVAFP 724
Query: 725 IHVCKYLSVVDRSGTRRIPLGEHNIHIG 752
I + L ++ G+ N+ IG
Sbjct: 725 IEI-DALKFWNQKMKYDAEPGKFNVFIG 751
>gi|365850358|ref|ZP_09390822.1| glycosyl hydrolase family 3 protein [Yokenella regensburgei ATCC
43003]
gi|364567365|gb|EHM45033.1| glycosyl hydrolase family 3 protein [Yokenella regensburgei ATCC
43003]
Length = 765
Score = 253 bits (647), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 202/669 (30%), Positives = 323/669 (48%), Gaps = 103/669 (15%)
Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
T FP + ASSFN + +GRV + Y GL W+P V++ RDPRWGR
Sbjct: 118 TVFPNSLGLASSFNLDAVKTVGRV------SAYEAADDGLNMTWAPMVDVSRDPRWGRVS 171
Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
E GED L+ + V +QG S DR V S KHF AY +N VD
Sbjct: 172 EGFGEDTYLTAIMGKTMVESMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 226
Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
+S Q + + + P++ + G VM + N +NG P +D +LK +R +W G
Sbjct: 227 --MSPQRLFNDYLPPYKAALDAGS-GGVMVALNSLNGTPASSDSWLLKDILRDDWGFKGI 283
Query: 285 IVSDCDSVGVYYDTQHFT-STPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
+SD ++ +H T S PE+A A+ +G+++ + + V+ G ++ +
Sbjct: 284 TISDHGAIKEL--IKHGTASDPEDAVRIALTSGINMSMSDEYYSKYLPGLVKSGKVTMAE 341
Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
+++A + L V+ +G+F+ PY HLG K+ D H++ A E AR+ +V
Sbjct: 342 LDDATRHVLNVKYDMGLFN-----DPYSHLGAKETDPQDTNAESRLHRKEAREVARESLV 396
Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGI----GRYA 449
LLKN+ +LPL ++ +AV+GP +D M+G++ AG+A T L GI G
Sbjct: 397 LLKNRLETLPLK--KNAIIAVVGPLADSQRDMMGSWSAAGVASQSVTLLTGIQNAVGTEG 454
Query: 450 RTIHQQGC-----KDVACADDQ--------------LFGAAIDASRQADATILVMGLDQS 490
+ ++ +G K + +Q + A+ A++Q+D + V+G Q
Sbjct: 455 KVLYAKGANVTNDKGIVEFLNQYEPAVVVDPRSPQAMIDEAVKAAKQSDVVVAVVGEAQG 514
Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
+ EA R + +P Q++L+S + K P +LVLM+G P+ A K D + AI+
Sbjct: 515 MAHEASSRTDITIPQSQRDLISALKATGK-PLVLVLMNGRPL--ALVKEDQQADAILETW 571
Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPM------TEMAMRPSQSKRYPG 604
+ G GG AIAD+LFG NP GKLPM+ +P+ + +P+ T P + +Y
Sbjct: 572 FAGTEGGNAIADVLFGDYNPSGKLPMS-FPRS-VGQIPVYYSHLNTGRPYNPEKPNKYTS 629
Query: 605 RTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAK 664
+ GP +YPFG+G+SYT F TV++ ++ P+ R G + A+
Sbjct: 630 HYFDEANGP-LYPFGYGLSYTTF--TVSDVK--MSSPVMKRDGKVTAS------------ 672
Query: 665 CNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVG- 722
V+V N G ++GA + ++ A P KQL FEKV + G Q V
Sbjct: 673 ---------VEVTNTGKREGATVIQMYVQDVTASMSRPVKQLRGFEKVTLKPGETQTVSF 723
Query: 723 -INIHVCKY 730
I+++ K+
Sbjct: 724 PIDVNALKF 732
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.319 0.136 0.416
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 12,862,684,050
Number of Sequences: 23463169
Number of extensions: 573245454
Number of successful extensions: 1194741
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 6066
Number of HSP's successfully gapped in prelim test: 1659
Number of HSP's that attempted gapping in prelim test: 1145659
Number of HSP's gapped (non-prelim): 13788
length of query: 770
length of database: 8,064,228,071
effective HSP length: 151
effective length of query: 619
effective length of database: 8,816,256,848
effective search space: 5457262988912
effective search space used: 5457262988912
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 81 (35.8 bits)